Skip to content

[UpdateWorkflow] Make login node use head-node-driven orchestration f…#7409

Draft
gmarciani wants to merge 1 commit into
aws:developfrom
gmarciani:wip/mgiacomo/3160/uw-login-nodes-0526-1
Draft

[UpdateWorkflow] Make login node use head-node-driven orchestration f…#7409
gmarciani wants to merge 1 commit into
aws:developfrom
gmarciani:wip/mgiacomo/3160/uw-login-nodes-0526-1

Conversation

@gmarciani
Copy link
Copy Markdown
Contributor

@gmarciani gmarciani commented May 27, 2026

This PR depends on aws/aws-parallelcluster-cookbook#3188

Description of changes

Make login node use head-node-driven orchestration for the update workflow.

With this change, login nodes do not depend on cfn-hup/cfn-init anymore and mirror the same update mechanism already adopted for compute nodes. With this change we expect the update workflow to be more resilient.

The changes in this PR mirrors the changes made for compute fleet in the following PRs:

  1. Replace cfn-hup on compute nodes with systemd timers to signal updates aws-parallelcluster-cookbook#3070: this is the core change that removes cfn-hup
  2. [Scaling] Remove usage of cfn-init in Compute Fleet aws-parallelcluster-cookbook#2875: change in cookbook to remove cfn-init, required by change pasted help documentation into the readme #1
  3. [Scaling] Removing usage of cfn-init for compute fleet #6655: change in cli to remove cfn-init, required by change pasted help documentation into the readme #1
  4. Remove cfn-hup.log from computefleet cloudwatch agent config aws-parallelcluster-cookbook#3093: minor change related to logs

Tests

  • ONGOING test_update_slurm

References

  • Link to impacted open issues.
  • Link to related PRs in other packages (i.e. cookbook, node).
  • Link to documentation useful to understand the changes.

Checklist

  • Make sure you are pointing to the right branch.
  • If you're creating a patch for a branch other than develop add the branch name as prefix in the PR title (e.g. [release-3.6]).
  • Check all commits' messages are clear, describing what and why vs how.
  • Make sure to have added unit tests or integration tests to cover the new/modified code.
  • Check if documentation is impacted by this change.

Please review the guidelines for contributing and Pull Request Instructions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

…or the update workflow.

With this change, login nodes do not depend on cfn-hup/cfn-init anymore and mirror the same update mechanism already adopted for compute nodes.
With this change we expect the update workflow to be more resilient.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant