Releases: stackhpc/ansible-slurm-appliance
v1.139
What's Changed
- Pull containers by @sjpb in #351:
- Container images pulled before service start and during fat image build.
- Fixes an issue where podman commands failed after reboot.
See PR for full details. - Use most-recent image in skeleton terraform if multiple found by @sjpb in #350
Full Changelog: v1.138...v.139
Deployment notes
No changes to galaxy roles/collections.
Image Details
New fat image openhpc-240116-1156-aa8dba7d, requires 12GB root disk.
CI changes
- FIP now used for build on Arcus to avoid docker.io rate limits.
v1.138
What's Changed
- Don't ignore image changes in skeleton terraform lifecycle by @sjpb in #334
- Cope with stale NFS file handles by @sjpb in #332
- Update fatimage base to RL8.9 with robust volume mounts by @sjpb in #341
- Remove cve-2023-41914 role by @sjpb in #337
- Avoid prompting user to accept hostkey in OOD shell by @sjpb in #331
- Fix removal of packer bundled ansible plugin by @sjpb in #346
Full Changelog: v1.137...v1.138
Deployment notes
The stackhpc.nfs
role has changed. To update this run:
dev/setup-env.sh
Image Details
New image openhpc-240102-1025-e533fd70, requiring 10GB root disk.
v1.137
What's Changed
Deployment Notes
No changes to galaxy roles/collections.
CI changes
None
Image Details
New image openhpc-231206-1648-9d6aa4e4, requiring 10GB root disk.
Full Changelog: v1.136...v1.137
v1.136
What's Changed
- Fix testuser password in CI image build by @sjpb in #335
- Fix #73: Fails late if no secrets defined by @sjpb in #329
- Use new TurboVNC repofile by @sjpb in #339: NB: turbovnc repofile has moved so this PR is required to deploy the appliance as of this date
Full Changelog: v1.135...v1.136
Deployment notes
No changes, no galaxy updates required.
Image Details
No new image provided at this time
v1.135
v1.134
What's Changed
- Updates to OpenHPC role and source image by @sjpb in #324
- Development quality-of-life improvements by @sjpb in #316
- Add support for freeipa clients by @sjpb in #241
Full Changelog: v1.133...v1.134
Deployment notes
The stackhpc.openhpc
role has changed. To update this run:
dev/setup-env.sh
Image Details
New image openhpc-231027-0916-893570de
v1.133
What's Changed
- Update to Open OnDemand v3.0.1 by @thomasbergernz and @sjpb in #314
- Make nvidia-driver install idempotent by @sjpb in #315
- Fix for Slurm CVE 2023 41914 by @sjpb in #320
CI changes
Deployment notes
The osc.ood
role has changed. To update this run:
ansible-galaxy role install --force -r requirements.yml -p ansible/roles
Image Details
New image openhpc-231020-1357-b5d8b056, requiring 10GB root disk.
New Contributors
- @thomasbergernz made their first contribution in #310
Full Changelog: v1.132...v1.133
v1.132
What's Changed
- Fix ssh ControlPath in skeleton by @sjpb in #297
- Fix issues when using GenericCloud image by @sjpb in #313
Full Changelog: v1.131...v1.132
CI changes
- "Fat" image build can now be done either on Arcus (using volume-backed instances -> 10GB virtual disk) or SMS-labs (using non-volume-backed instances - 12GB virtual disk)
Deployment notes
No galaxy-installed roles/collections have changed.
Image Details
Built a new image openhpc-230922-0940-434e190f
Now only requires a 10GB root disk.
v1.131
What's Changed
New features
Fixes and Enhancements
- Make etc_hosts role more flexible by @sjpb in #277
- Update prometheus-slurm-exporter version by @m-bull in #280
- Install out of tree openstack builder plugin by @m-bull in #285
- Remove warn parameter for ansible>=2.14 by @mkjpryor in #286
- Fix opensearch grafana plugin at last working version by @sjpb in #292
- Fix query type in the Slurm jobs Grafana dashboard by @mkarpiarz in #293
- Use Python3.9 for jupyter notebook server by @sjpb in #294
- Pin Terraform in CI to MPL licenced version by @sjpb in #302
- Update opensearch to 2.9.0 by @sjpb in #299
CI changes
- Make CI cloud selectable between SMSlabs and Arcus by @sjpb in #288
- Disable EESSI tests in CI and make them debuggable by @sjpb in #295
- Fix SMS ssh by @sjpb in #296
- Use
portal-internal
network (with normal-mode ports) for Arcus CI by @sjpb in #306
Deployment notes
Galaxy roles/collection versions have been changed so use ansible-galaxy {role,collection} install -f ...
after merging to force-update these.
Image Details
- Built a new image
openhpc-230811-1548-a49164d1
in #301 - Available from https://object.arcus.openstack.hpc.cam.ac.uk/swift/v1/AUTH_3a06571936a0424bb40bc5c672c4ccb1/openhpc-images/openhpc-230811-1548-a49164d1
- Now only requires a 12GB root disk
New Contributors
- @mkjpryor made their first contribution in #286
- @mkarpiarz made their first contribution in #293
Full Changelog: v1.130...v1.131
v1.130
What's Changed
New functionallity/roles/groups
- Support for EESSI - see
roles/eeessi
. By @JohnGarbutt in #252
Changes to Packer build functionality
- Allow Packer base images to be specified by either UUID or name by @m-bull in #266
- Support attaching a floating IP to the fatimage builder instance by @m-bull in #267
- Support using volume-backed instances for building and selecting the output image format by @m-bull in #269
- Allow specifying the packer manifest output path by @m-bull in #268
- Allow use of ephemeral SSH keys when building Packer images by @m-bull in #274
Other changes
- Support changing the
podman
user's uid by @sjpb in #264 - Fix to
proxy
role: now defaults to includinglocalhost
inno_proxy
by @sjpb in #270 - Add debug logging options for opensearch & filebeat by @sjpb in #271
- The UCX device to use for
hpctests
can now be defined per partition by @sjpb in #275 - Always delete resources on deploy failure in CI by @sjpb in #272
Full Changelog: v1.129...v1.130
Deployment notes
- No galaxy reinstalls required since last release.
Image details
- Provides a new image openhpc-230503-0944-bf8c3f63.qcow2 including EESSI support.
- Requires a 40GB root disk.