align probe exec timeouts with k8s probe timeouts#3784
align probe exec timeouts with k8s probe timeouts#3784openshift-merge-bot[bot] merged 2 commits intooperator-framework:masterfrom
Conversation
Signed-off-by: grokspawn <jordan@nimblewidget.com>
rashmigottipati
left a comment
There was a problem hiding this comment.
Have you noticed any issues with probe timeouts that motivated adding -rpc-timeout=5s? Or was this mainly to align with Kubernetes defaults?
Yeah, there are a lot of times in the logs/events where the catsrc pods would claim a startupProbe failure because the catalog size prevented its GRPC availability w/in 1s. The retries always recovered it, but the logs always include a statement about the 1s failure.
This is weird, because we don't configure any 1s thresholds, but this is because it's the default for grpc_health_probe. Functionally this change has no real effect. It will just result in avoiding the initial failure log/event for catalogs of a certain threshold size or higher, and the confusing message. |
rashmigottipati
left a comment
There was a problem hiding this comment.
@grokspawn Thanks for the context. This is a clean improvement. Changes look good to me.
/lgtm
rashmigottipati
left a comment
There was a problem hiding this comment.
To fix the failing unit test, you may have to update the reconciler_test.go to reflect the new probe args.
Signed-off-by: grokspawn <jordan@nimblewidget.com>
|
Thanks @rashmigottipati . Could I bother you for another lgtm? I had to fix the whitespace-only formatting on the test file. (Maybe wait until we see the sanity CI pass so you don't throw away the lgtm because for some reason I can't run |
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rashmigottipati, tmshort The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
04fcb6e
into
operator-framework:master
Description of the change:
Motivation for the change:
Architectural changes:
Testing remarks:
Reviewer Checklist
/doc[FLAKE]are truly flaky and have an issue