Skip to content

Add support for NCCOM micro-benchmark for Kubernetes #433

Open
@bryantbiggs

Description

@bryantbiggs

Similar to NCCL tests for Kubernetes https://github.com/aws-samples/awsome-distributed-training/tree/main/micro-benchmarks/nccl-tests/kubernetes - it would be great if there was a similar test for NCCOM https://github.com/aws-samples/awsome-distributed-training/tree/main/micro-benchmarks/nccom-tests

  • Would this require a new Docker image similar to public.ecr.aws/hpc-cloud/nccl-tests:latest - public.ecr.aws/hpc-cloud/nccom-tests:latest?

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions