Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.9k 651

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 149

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.5k 253

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 14k 1k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 850 80

Repositories

Showing 10 of 523 repositories
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,144 Apache-2.0 429 18 25 Updated Sep 1, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 44 Apache-2.0 9 1 29 Updated Sep 1, 2025
  • sinonym Public

    Format and normalize Chinese names into Western forms

    allenai/sinonym’s past year of commit activity
    Python 2 Apache-2.0 1 0 0 Updated Sep 1, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 278 Apache-2.0 53 0 34 Updated Aug 31, 2025
  • regmixer Public
    allenai/regmixer’s past year of commit activity
    Jupyter Notebook 6 0 0 2 Updated Aug 31, 2025
  • beaker-gantry Public

    Gantry is a CLI that streamlines running experiments in Beaker

    allenai/beaker-gantry’s past year of commit activity
    Python 27 Apache-2.0 7 2 2 Updated Aug 31, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 13,954 Apache-2.0 1,030 19 6 Updated Aug 31, 2025
  • SimplerEnv Public
    allenai/SimplerEnv’s past year of commit activity
    Jupyter Notebook 12 MIT 4 0 0 Updated Aug 31, 2025
  • S2AND Public

    Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

    allenai/S2AND’s past year of commit activity
    Python 95 20 6 0 Updated Aug 29, 2025
  • allenai/rslearn_projects’s past year of commit activity
    Python 13 Apache-2.0 2 15 11 Updated Aug 29, 2025