-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Spike] [2h] Explore and understand 'Split Cost Allocation Data' on EKS #4465
Comments
I tried to follow https://docs.aws.amazon.com/cur/latest/userguide/use-athena-cf.html but couldn't find the I set up another export in the meantime. |
I can't do the manual athena attempt either, because https://docs.aws.amazon.com/cur/latest/userguide/create-manual-table.html is asking me for a I've spent 15min on this so far. |
Aaaah, looking at https://docs.aws.amazon.com/cur/latest/userguide/dataexports-processing.html, I see:
|
I've enabled this export now with |
I can run SQL queries with Athena now!
which gives as output: So that's great. Next is to examine the split output columns to see if we can use those. |
I can verify that individual pod names actually do make it in here, as part of |
Running this query to see what kind of costs get allocated:
I see:
And unfortunately, only that. This means the following costs are unattributed:
While we could tag the hub db disks and prometheus, I'm not sure we can do the same for network base disks. Not being able to tag network requests presents both a smaller and bigger challenge. Smaller because almost all our egress goes through the proxy pods and ingress pods anyway, so per-namespace networking would be kinda 'off' regardless (everything would get attributed to nginx-ingress). But bigger challenge because it's possible for this to get really expensive, and we need to be careful to make sure we can track this information. |
I've activated the tags I'll explore our networking situation, as well as 'requestor pays' situation |
For |
Based on my exploration here, and on what was determined to be the things that would be valuble to admins right now (per #4384), I've made the following choices:
I'll proceed to refine more tasks based on this. Looking at my time tracking, this has taken about 90minutes spread out over 3 days, which isn't so bad :) |
AWS has recently enabled better kubernetes integration for its cost data exports. We should explore it to see if it will serve our needs - I suspect it may.
I've enabled it for the openscapes cluster, so we will have data to work with shortly.
Things to investigate:
Things we wanna track
Spike outcome
Based on my exploration here, and on what was determined to be the things that would be valuble to admins right now (per #4384), I've made the following choices:
The text was updated successfully, but these errors were encountered: