Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

starts getting timeout errors with ECR registeries until pod is restarted #679

Open
pdeva opened this issue Feb 24, 2024 · 1 comment
Open
Labels
bug Something isn't working

Comments

@pdeva
Copy link

pdeva commented Feb 24, 2024

Describe the bug
seeing timeout errors with ECR from time to time. simply restarting the image-updater pod fixes it. only for the problem to randomly surface hours later...

To Reproduce
this is the relevant config of image updater helm chart:

              - name: config.registries[0].name
                value: "ECR"
              - name: config.registries[0].api_url
                value: "https://xxx.dkr.ecr.us-east-1.amazonaws.com"
              - name: config.registries[0].prefix
                value: "xxx.dkr.ecr.us-east-1.amazonaws.com"
              - name: config.registries[0].default
                value: "true"
              - name: config.registries[0].ping
                value: "true"
              - name: config.registries[0].insecure
                value: "false"
              - name: config.registries[0].credentials
                value: "ext:/scripts/ecr_login.sh"
              - name: config.registries[0].credsexpire
                value: "11h"
              - name: "authScripts.enabled"
                value: "true"
              - name: 'authScripts.scripts.ecr_login\.sh'
                value: >
                  #!/bin/sh

                  aws ecr --region us-east-1 get-authorization-token --output text --query
                  'authorizationData[].authorizationToken' | base64 -d

Expected behavior
if the auth works once, it should work always. shouldnt randomly time out. shouldnt need pod restarts to fix.

Additional context
Add any other context about the problem here.

Version

0.12.2

Logs

Could not set registry endpoint credentials: error executing /scripts/ecr_login.sh: `/scripts/ecr_login.sh` failed timeout after 10s

Screenshot 2024-02-24 at 5 37 52 AM

@pdeva pdeva added the bug Something isn't working label Feb 24, 2024
@pdeva
Copy link
Author

pdeva commented Feb 25, 2024

seems related to #657

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant