[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

metascroy · 2024-05-06T22:13:54Z

Remove -l 2 and -l 3 flags and auto-detect model architecture from tokenizer class.

Issue warning if user supplied -v does not match the tokenizer.model inferred vocab size.

The text was updated successfully, but these errors were encountered:

Jack-Khuu · 2025-02-21T20:34:12Z

AOTI support added, ET support flagged in #1484

larryliu0820 · 2025-02-21T20:35:07Z

AOTI support added, ET support flagged in #1484

There's some remaining integration work needs to be done so maybe we should keep this open

larryliu0820 · 2025-02-21T20:36:08Z

Basically right now if we do torchchat export with llama3.2 vision model tag and aoti, there's some breakage. Was hoping someone can enable that path

larryliu0820 · 2025-02-21T20:36:42Z

Oh uh, this issue is not talking about aoti I guess?

Jack-Khuu · 2025-02-21T21:03:02Z

The original task was pretty old, so context has slightly changed as well

I took it to mean removing the need for manually specifying the tokenizer type during aoti_runs for the Text-Only models (Like angela's AOTI PR and the followup issue #1484)

torchchat/runner/run.cpp

Line 856 in 062dd87

    
           ModelType model_type = get_model_type(std::stoi(aoti_metadata["tokenizer_type"]));

Jack-Khuu · 2025-02-21T21:05:39Z

Basically right now if we do torchchat export with llama3.2 vision model tag and aoti, there's some breakage. Was hoping someone can enable that path

Got it, mind spinning up an issue for the 3.2 aoti integration?

larryliu0820 · 2025-02-21T21:16:46Z

Basically right now if we do torchchat export with llama3.2 vision model tag and aoti, there's some breakage. Was hoping someone can enable that path

Got it, mind spinning up an issue for the 3.2 aoti integration?

Here you go #1497

mikekgfb changed the title ~~Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et~~ [FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et May 12, 2024

Jack-Khuu closed this as completed Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

metascroy commented May 6, 2024 •

edited

Loading

Jack-Khuu commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

Jack-Khuu commented Feb 21, 2025

Jack-Khuu commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et #706

Comments

metascroy commented May 6, 2024 • edited Loading

Jack-Khuu commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

Jack-Khuu commented Feb 21, 2025

Jack-Khuu commented Feb 21, 2025

larryliu0820 commented Feb 21, 2025

metascroy commented May 6, 2024 •

edited

Loading