Add aws instance type to affinity terms in the pod template #3783

austinzh · 2024-01-19T22:21:31Z

When a pool has multiple instance e.g: cpu and gpu mix pool. We would like to specify instance type

88manpreet · 2024-01-22T21:29:25Z

Code looks ok to me. Can you add unit-tests and few manual tests in the relevant ticket?

nemacysts

@austinzh just curious: how are we expecting folks to use this?

imo, the easiest (user-experience-wise, that is) approach would be to have a flag like --job-type X or something (where X could be things like generic, model-training, model-inference, etc) and ML Compute handles updating what instance types and whatnot those map to in the background - that way, Spark users don't need to worry about what instance types they need/want/can use

(that said, we would likely still want something like this for power-users and whatnot that want to run on specific hardware for whatever reason)

nemacysts · 2024-03-15T16:15:29Z

paasta_tools/cli/cmds/spark_run.py

@@ -265,6 +249,11 @@ def add_subparser(subparsers):
 default=default_spark_pool,
 )

+ list_parser.add_argument(
+ "--aws-instance-types",
+ help="AWS instance types for executor, seperate by comma(,)",


small wording edit:

Suggested change

help="AWS instance types for executor, seperate by comma(,)",

help="AWS instance types for executor, separated by commas (,)",

it might also be nice to have arparse handle the splitting for us with something like:

Suggested change

help="AWS instance types for executor, seperate by comma(,)",

help="AWS instance types for executor, separated by commas (,)",

type=lambda instances: [instance for instance in instances.split(","))

nemacysts · 2024-03-15T16:21:53Z

paasta_tools/cli/cmds/spark_run.py

@@ -522,6 +511,47 @@ def should_enable_compact_bin_packing(disable_compact_bin_packing, cluster_manag
 return True


+# inplace add a low priority podAffinityTerm for compact bin packing
+def add_compact_bin_packing_affinity_term(pod: Dict, spark_pod_label: str):


suggestion: i'd probably rename pod here to pod_template to reduce confusion

suggestion: if y'all ever want to get rid of the incompletely typed Dict here, a possible option would be to use the models from the kubernetes client (e.g., https://github.com/kubernetes-client/python/blob/master/kubernetes/docs/V1PodTemplate.md) internally and then serialize to yaml at the very end :)

suggestion: imo, it's a little preferable to not mutate inputs in-place since pure functions are generally easier to work with/test - but it's not a particularly big deal :)

suggestion (if this remains an impure function): typing this as def add_compact_bin_packing_affinity_term(pod: Dict, spark_pod_label: str) -> None and removing the return would reduce confusion

(same points apply to add_node_affinity_terms() below)

nemacysts · 2024-03-15T16:27:41Z

paasta_tools/cli/cmds/spark_run.py

+ ].setdefault("nodeSelectorTerms", []).extend(
+ [
+ {
+ "key": "node.kubernetes.io/instance-type",


just curious: do we want users to specify an instance type (e.g., g4dn.xlarge vs g4dn.2xlarge) or would we be fine having them specify a family (e.g., g4dn) and letting karpenter spin up the most optimal instance type for the given requests at the time?

nemacysts · 2024-04-02T14:55:26Z

@austinzh do we still want to get this merged?

austinzh force-pushed the u/austinzh/add-template branch from cecffa3 to 5dafcb4 Compare January 19, 2024 22:26

Add aws instance type to affinity terms in the pod template

86537ac

austinzh force-pushed the u/austinzh/add-template branch from 5dafcb4 to 86537ac Compare January 19, 2024 22:31

nemacysts reviewed Mar 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aws instance type to affinity terms in the pod template #3783

Add aws instance type to affinity terms in the pod template #3783

austinzh commented Jan 19, 2024

88manpreet commented Jan 22, 2024

nemacysts left a comment

nemacysts Mar 15, 2024

nemacysts Mar 15, 2024

nemacysts Mar 15, 2024

nemacysts commented Apr 2, 2024

	help="AWS instance types for executor, seperate by comma(,)",
	help="AWS instance types for executor, separated by commas (,)",

	help="AWS instance types for executor, seperate by comma(,)",
	help="AWS instance types for executor, separated by commas (,)",
	type=lambda instances: [instance for instance in instances.split(","))

Add aws instance type to affinity terms in the pod template #3783

Are you sure you want to change the base?

Add aws instance type to affinity terms in the pod template #3783

Conversation

austinzh commented Jan 19, 2024

88manpreet commented Jan 22, 2024

nemacysts left a comment

Choose a reason for hiding this comment

nemacysts Mar 15, 2024

Choose a reason for hiding this comment

nemacysts Mar 15, 2024

Choose a reason for hiding this comment

nemacysts Mar 15, 2024

Choose a reason for hiding this comment

nemacysts commented Apr 2, 2024