-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
which model should be used? #13
Comments
|
|
More research is needed to establish the exact benefit from document instructions. Here's the part from the paper that discusses it a bit:
|
Okay so use instructions for document retrieval, just for the query embedding side not the document embedding side. Thanks for the excerpt I understand the one sided instructions now. Do you have any other recommendations for finetuning the existing grit model for embedding only? |
Exactly. You can also use them for the document embedding side if you want, but the benefit is unclear to me. Would be interesting to know! If you are only interested in embedding performance, I would probably fine-tune from the embedding-only variant instead: https://huggingface.co/GritLM/emb_m7_nodes16_fast Other than that, I'd follow the recommendations in the paper (bidirectional attn, large batch size etc) |
Hi and thank you for sharing this amazing work.
i want to use Gritlm to produce embeddings to be stored in some vector database for document retrieval. But. there are many models on the huggingface.
The text was updated successfully, but these errors were encountered: