Skip to content

[Metrics] Kubernetes Support & Maintenance #427

Merged
dcrankshaw merged 38 commits intoucbrise:developfrom
simon-mo:kube
Mar 17, 2018
Merged

[Metrics] Kubernetes Support & Maintenance #427
dcrankshaw merged 38 commits intoucbrise:developfrom
simon-mo:kube

Conversation

@simon-mo
Copy link
Copy Markdown
Contributor

@simon-mo simon-mo commented Mar 8, 2018

This PR replace #411 with cleaner commit history and different integration test.

The integration test issue come with #411 is hard to debug. It turns out to be:

  • Inside each pod, the kubelet needs time to setup the network service within a pod. We just need to wait for the network service to be setup. It takes 30 seconds to 2 minutes. This is fixed by using retry and while loop inside integration test for kubernetes metrics.

This PR also adds some maintenance:

  • Modify the docker integration test as well to test for specific node health. It turns out the original test 1 calls API that returns the same information not matter the node is healthy or not.
  • Let frontend-exporter to be more fault tolerant with some try-catch. Not a big fix so it just with this PR. Independent PR in [Metrics] Frontend-exporter fault tolerance quick fix #429
  • Update readme

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1079/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1081/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1106/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1109/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1110/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1111/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1115/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1116/
Test FAILed.

@simon-mo
Copy link
Copy Markdown
Contributor Author

Jenkins test this please

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1117/
Test FAILed.

Passes if I use my own docker registry
@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1118/
Test FAILed.

@AmplabJenkins
Copy link
Copy Markdown

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1119/
Test FAILed.

@simon-mo
Copy link
Copy Markdown
Contributor Author

jenkins test this please

@AmplabJenkins
Copy link
Copy Markdown

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1121/
Test PASSed.

@@ -31,12 +31,12 @@
# TODO: Add kubernetes specific checks that use kubernetes API
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this TODO still outstanding?

Copy link
Copy Markdown
Contributor

@dcrankshaw dcrankshaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@AmplabJenkins
Copy link
Copy Markdown

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Clipper-PRB/1126/
Test PASSed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants