Skip to content
View memray's full-sized avatar

Organizations

@salesforce

Block or report memray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A community hub for collecting and sharing real-world issues with LLMs and other models to help improve their capabilities.

1 Updated Oct 21, 2024

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]

Python 153 6 Updated Mar 7, 2025

A PyTorch Native LLM Training Framework

Python 747 41 Updated Dec 27, 2024

Salesforce open-source LLMs with 8k sequence length.

Python 716 38 Updated Jan 31, 2025

Unified Controllable Visual Generation Model

Python 634 35 Updated Jan 27, 2025
Jupyter Notebook 304 22 Updated Jan 27, 2025

A deep learning library for identifying keyphrases from text

Python 25 3 Updated Aug 1, 2022
Python 132 17 Updated Jul 5, 2023

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 5,024 391 Updated Jan 31, 2025

ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, dressage, heart failure, and wind energy).

19 2 Updated Apr 8, 2022

Automatically generate your résumé and various cover letters from YAML files.

Python 125 21 Updated Aug 14, 2024

Code to obtain the PMC-SA. A dataset for the summarization of scientific articles.

Python 6 4 Updated Mar 24, 2023

Everything you need to know for a Software Engineering interview

2,098 456 Updated Mar 7, 2023

BART summarization tool

Python 6 Updated Sep 11, 2020

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

Shell 144 29 Updated Jul 3, 2020
Jupyter Notebook 535 117 Updated Dec 30, 2021

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,339 1,432 Updated Feb 13, 2023

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

C++ 1,178 277 Updated Jan 27, 2022
Python 449 79 Updated Oct 26, 2022

Plot the vector graph of attention based text visualisation

Python 372 57 Updated Apr 12, 2019

Keyphrase Generation

Jupyter Notebook 218 33 Updated Jul 22, 2023

Python Keyphrase Extraction module

Python 1,578 291 Updated Jul 12, 2023

An open-source NLP research library, built on PyTorch.

Python 11,824 2,251 Updated Nov 22, 2022
Python 3 Updated Dec 2, 2018

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,528 247 Updated Nov 29, 2024

Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.

Perl 157 25 Updated Jul 10, 2019

A Python wrapper for the ROUGE summarization evaluation package

Python 251 71 Updated Feb 10, 2021

Unsupervised Language Modeling at scale for robust sentiment classification

Python 1,060 202 Updated Jun 28, 2020

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,900 495 Updated Feb 14, 2023

Facebook AI Research Sequence-to-Sequence Toolkit

Lua 3,739 613 Updated Sep 17, 2021
Next
Showing results