Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AWQ-Bert / 4-bit Bert #95

Open
michaelfeil opened this issue Feb 10, 2024 · 2 comments
Open

AWQ-Bert / 4-bit Bert #95

michaelfeil opened this issue Feb 10, 2024 · 2 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@michaelfeil
Copy link
Owner

Hoping to add a implementation of 4bit Bert, potentially in casper-hansen/AutoAWQ#328. Contributions welcome

@michaelfeil michaelfeil added the enhancement New feature or request label Mar 16, 2024
@casper-hansen
Copy link

Hi @michaelfeil, any chance you will look more closely into quantizing BERT models with AWQ? Your PR was off to a great start, but needs more experimentation to figure out how to scale a BERT model.

@michaelfeil michaelfeil added the help wanted Extra attention is needed label Jun 24, 2024
@michaelfeil
Copy link
Owner Author

@casper-hansen open for collaboration, but no further progress unfortunately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants