Dealing with long running client #320

zhavir · 2024-04-16T12:36:47Z

Hello, do you know which is the beast way to deal with a long-running client? I'd like to use boto3 client within an api call, but it is taking forever every time to setup the session and making the call. Any suggestion? where is the best place where to initialize the boto3 session?

25thbamofthetower · 2024-05-04T20:49:32Z

AWS has docs about where to place it. It's usually outside of the Lambda handler function. It's usually the first search results. But I think what you're seeing might be the Lambda coldstart.

zhavir · 2024-05-04T23:41:02Z

yes, more or less. I've put my client init outside the handler. But seems like that, even so, the first call to SNS (that is my case) is taking forever (3.5 secs). Afterward, consecutive calls are taking 200 ms. Interestingly, I've tried to set up a CloudWatch event that is always warming the lambda function (more or less), with this trick I was able to decrease the timing to 1.2 secs on the first call, consecutive calls are still taking 200 ms. To reduce the time also for the first call I was forced to call the list topic right after the initialization of the Boto client

nitsujri · 2024-05-14T06:10:03Z

To reduce the time also for the first call I was forced to call the list topic right after the initialization of the Boto client

This is cold start problem, right? After you deploy your AWS Lambda container? When you deploy a new version, it takes 3.5 to 5 seconds to cold start and fully answer a client?

If it is, we deal with this problem via blue/green deployments with CodeDeploy. For us, we use the CDK, specifically

aws_sam.CfnFunction.
- This is not lambda.cfnfunction.
This allows us to use the pre-deployment Validation. via deploymentPreference: {hooks: { preTraffic: ... }}
preTraffic is the same container but with a different entrypoint.
- The entry point is a special "pre-warming" hook that fires a direct lambda call to the main one.
- We call in parallel 5x (synchronous) to the "incoming" version with a special params.
- The special param is a code path that effectively calls print(User.objects.count()). This makes the container reach out to the DB, helps test and makes real calls+codepaths.
- The special codepath also has a sleep of 0.25-0.4 random, simply helps make 5x parallel containers prewarmed.
Once all the parallel direct lambda calls have returned, we return the preTraffic hook w/ a positive result. If any prewarm fails, we fail the preTraffic.

This way you can pre-warm the new version and only switch traffic to the new version after the cold start has been "dealt with". It has the benefit of testing all the way to a simple DB call so in case there's a failure the new version never gets switched in.

zhavir · 2024-05-14T09:19:16Z

thank you so much @nitsujri! it worked for me

zhavir closed this as completed May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dealing with long running client #320

Dealing with long running client #320

zhavir commented Apr 16, 2024

25thbamofthetower commented May 4, 2024

zhavir commented May 4, 2024

nitsujri commented May 14, 2024 •

edited

zhavir commented May 14, 2024

Dealing with long running client #320

Dealing with long running client #320

Comments

zhavir commented Apr 16, 2024

25thbamofthetower commented May 4, 2024

zhavir commented May 4, 2024

nitsujri commented May 14, 2024 • edited

zhavir commented May 14, 2024

nitsujri commented May 14, 2024 •

edited