Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tracking] Sentence Embedding Model #2324

Open
1 task
tqchen opened this issue May 11, 2024 · 2 comments
Open
1 task

[Tracking] Sentence Embedding Model #2324

tqchen opened this issue May 11, 2024 · 2 comments
Assignees
Labels
status: tracking Tracking work in progress

Comments

@tqchen
Copy link
Contributor

tqchen commented May 11, 2024

Overview

This is a global tracking issue to bring generic sentence embedding models to MLCEngine.

Action Items

  • Add support for mistral based sentence embedding

Links to Related Issues and PRs

@tqchen
Copy link
Contributor Author

tqchen commented May 11, 2024

context #1744

@VoVAllen
Copy link

bge-m3 is a good candidate since it also supports sparse embedding model. In the e2e pipeline, we found the process turning text into embedding took most of the time. (text->embedding through openai API costs 100ms+ while vector search part only needs 10ms). It would be nice to have an efficient embedding model at local

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status: tracking Tracking work in progress
Projects
Status: No status
Development

No branches or pull requests

3 participants