New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI: Cilium E2E Upgrade: cilium-health-ep is failing #32431
Comments
Some more relevant timestamps:
It looks like the health endpoint is restored, deleted, and recreated prior to the daemon running
Seems like endpoint restoration might be taking a long time |
Hit on #32436 as well |
#32528 increases the wait-timeout for the Cilium upgrade task from 5 to 10 minutes. This should help fixing the failures on that specific job, but might not fix the root cause (EP restoration logic, compile times? maybe increased a little bit with the recent LLVM 17 update?). Hence, lets keep this open and monitor the situation. |
Hmm, hit again even with the 10 minutes timeout: https://github.com/cilium/cilium/actions/runs/9188138331/job/25267307890
There seems to be something weird going on with the health endpoint restoration. |
E2E Upgrade: cilium-health-ep is failing; connect: no route to host
This is occurring in the E2E Upgrade workflow, especially on config 7
Output of
Upgrade Cilium
step:There is a long delay between agent startup and the Cilium Health API being served. Snippet of relevant logs below, full output in this gist.
Hit on PR #32403
Workflow: https://github.com/cilium/cilium/actions/runs/9003017786/attempts/2
Sysdump:
cilium-sysdumps.zip
The text was updated successfully, but these errors were encountered: