megatron-lm

Star

Here are 7 public repositories matching this topic...

alibaba / Megatron-LLaMA

Star

Best practice for training LLaMA models in Megatron-LM

pytorch llama distributed-training pretraining deepspeed megatron-lm llm

Updated Jan 2, 2024
Python

shreyansh26 / Annotated-ML-Papers

Star

Annotations of the interesting ML papers I read

nlp machine-learning deep-learning transformers gpt research-paper bert gpt-2 xlnet annotated-paper megatron-lm papers-annotations

Updated May 11, 2024

xrsrke / pipegoose

Star

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

transformers moe data-parallelism distributed-optimizers model-parallelism megatron mixture-of-experts pipeline-parallelism huggingface-transformers megatron-lm tensor-parallelism large-scale-language-modeling 3d-parallelism zero-1 sequence-parallelism

Updated Dec 14, 2023
Python

MoFHeka / LLaMA-Megatron

Star

A LLaMA1/LLaMA12 Megatron implement.

pytorch llama megatron megatron-lm llm llm-training llama2

Updated Dec 13, 2023
Python

Beomi / megatronlm_dataset_autotokenizer

Star

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

transformers gpt-neox tokenizers megatron-lm

Updated Nov 16, 2023
Python

GoogleCloudPlatform / nvidia-nemo-on-gke

Star

Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine

nvidia gke nvidia-gpu nvidia-nemo megatron-lm

Updated May 16, 2024
HCL

GJ98 / Megatron-LM

Star

Megatron-LM implemented by PyTorch

nlp pytorch megatron-lm

Updated May 27, 2023
Python

Improve this page

Add a description, image, and links to the megatron-lm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the megatron-lm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

megatron-lm

Here are 7 public repositories matching this topic...

alibaba / Megatron-LLaMA

shreyansh26 / Annotated-ML-Papers

xrsrke / pipegoose

MoFHeka / LLaMA-Megatron

Beomi / megatronlm_dataset_autotokenizer

GoogleCloudPlatform / nvidia-nemo-on-gke

GJ98 / Megatron-LM

Improve this page

Add this topic to your repo