Skip to content

fix(controller): honor scaleDownDelaySeconds before scaling down the intermediate RS#4645

Open
andrewjamesbrown wants to merge 9 commits into
argoproj:masterfrom
andrewjamesbrown:ajb/abortScaleDownDelaySeconds
Open

fix(controller): honor scaleDownDelaySeconds before scaling down the intermediate RS#4645
andrewjamesbrown wants to merge 9 commits into
argoproj:masterfrom
andrewjamesbrown:ajb/abortScaleDownDelaySeconds

Conversation

@andrewjamesbrown

Copy link
Copy Markdown
Contributor

We are experiencing a problem with our use of Istio and Argo Rollouts. During an "interrupted canary", we see:

The issue occurs because the Istio control plane is async - it takes time for the configuration to sync to all the proxies, and during this time, we've started to scale down the intermediate replicaset. In this case, it's desirable to delay the scale-down of the intermediate replicaset until the Istio data plane is in sync.

We see this on a 1.9-equivalent release that we build internally.

Checklist:

  • Either (a) I've created an enhancement proposal and discussed it with the community, (b) this is a bug fix, or (c) this is a chore.
  • The title of the PR is (a) conventional with a list of types and scopes found here, (b) states what changed, and (c) suffixes the related issues number. E.g. "fix(controller): Updates such and such. Fixes #1234".
  • I've signed my commits with DCO
  • My builds are green. Try syncing with master if they are not.
  • I have written unit and/or e2e tests for my change. PRs without these are unlikely to be merged.
  • I have run all tests locally (including the flaky ones) and they pass on my workstation
  • I have used LLM/AI/Agent tools for this PR but I am responsible for all code of this PR
  • I understand what the code does and WHY/HOW it works in several scenarios
  • I know if my code is just adding new functionality or changing old functionality for existing users
  • My organization is added to USERS.md.

@andrewjamesbrown andrewjamesbrown changed the title fix(canary): honor scaleDownDelaySeconds before scaling down the intermediate RS fix(controller): honor scaleDownDelaySeconds before scaling down the intermediate RS Mar 12, 2026
@andrewjamesbrown andrewjamesbrown force-pushed the ajb/abortScaleDownDelaySeconds branch 2 times, most recently from 75ffe54 to c96475f Compare March 12, 2026 19:24
@github-actions

github-actions Bot commented Mar 12, 2026

Copy link
Copy Markdown
Contributor

Published E2E Test Results

  4 files    4 suites   3h 45m 23s ⏱️
120 tests 106 ✅  7 💤 7 ❌
488 runs  453 ✅ 28 💤 7 ❌

For more details on these failures, see this check.

Results for commit f67f89f.

♻️ This comment has been updated with latest results.

@github-actions

github-actions Bot commented Mar 12, 2026

Copy link
Copy Markdown
Contributor

Published Unit Test Results

2 474 tests   2 473 ✅  3m 19s ⏱️
  129 suites      0 💤
    1 files        1 ❌

For more details on these failures, see this check.

Results for commit f67f89f.

♻️ This comment has been updated with latest results.

@codecov

codecov Bot commented Mar 12, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 85.06%. Comparing base (2ccdae8) to head (3c3e116).

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #4645      +/-   ##
==========================================
+ Coverage   85.03%   85.06%   +0.02%     
==========================================
  Files         164      164              
  Lines       18989    18992       +3     
==========================================
+ Hits        16148    16155       +7     
+ Misses       1993     1991       -2     
+ Partials      848      846       -2     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@andrewjamesbrown andrewjamesbrown force-pushed the ajb/abortScaleDownDelaySeconds branch from eb6072a to 917b5f0 Compare March 19, 2026 20:37
…intermediate RS

Signed-off-by: Andrew Brown <andrew.brown@wealthsimple.com>
Signed-off-by: Andrew Brown <andrew.brown@wealthsimple.com>
Signed-off-by: Andrew Brown <andrew.brown@wealthsimple.com>
Signed-off-by: Andrew Brown <andrew.brown@wealthsimple.com>
@andrewjamesbrown andrewjamesbrown force-pushed the ajb/abortScaleDownDelaySeconds branch from 917b5f0 to cd62806 Compare March 19, 2026 20:38
@sonarqubecloud

Copy link
Copy Markdown

@Hariharasuthan99 Hariharasuthan99 left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@kostis-codefresh kostis-codefresh added the needs-follow-up Used when a maintainer needs to follow up label Apr 28, 2026
@andrewjamesbrown andrewjamesbrown requested a review from a team as a code owner June 8, 2026 13:00
@sonarqubecloud

sonarqubecloud Bot commented Jun 8, 2026

Copy link
Copy Markdown

@Hariharasuthan99

Copy link
Copy Markdown
Contributor

Hello @andrewjamesbrown, I see after you merged master, one of the unit test has failed, could you check and see if you could fix it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cherry-pick/release-1.9 needs-follow-up Used when a maintainer needs to follow up

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants