Can't use large models with pipeline() #1179

sroussey · 2025-02-01T04:47:48Z

System Info

Example:

        const p = pipeline('text-generation', 'Xenova/Phi-3-mini-4k-instruct', {
            device: 'webgpu',
            dtype: 'q4',
        });

I see this error in the console:

Uncaught (in promise) Error: Can't create a session. ERROR_CODE: 1, ERROR_MESSAGE: Deserialize tensor model.layers.5.mlp.gate_proj.MatMul.weight_Q4 failed.Failed to load external data file ""model_q4.onnx_data"", error: Module.MountedFiles is not available.

Seeing that onnx_data is the issue, I figured I needed to pass use_external_data_format along, but it does not work.

I have tried :

        const p = pipeline('text-generation', 'Xenova/Phi-3-mini-4k-instruct', {
            device: 'webgpu',
            dtype: 'q4',
            use_external_data_format: true,
        });

and

        const p = pipeline('text-generation', 'Xenova/Phi-3-mini-4k-instruct', {
            device: 'webgpu',
            dtype: 'q4',
            session_options: {use_external_data_format: true},
        });

But neither of these will load the model correctly.

Environment/Platform

Description

see above

Reproduction

see above

The text was updated successfully, but these errors were encountered:

sroussey · 2025-02-01T05:21:37Z

I altered the source to pass use_external_data_format through, and I see that the extra data file loads. However, it seems more difficult to use this way... like how do you use a chat template?

xenova · 2025-02-01T09:52:37Z

Thanks for #1180! I don't know why this wasn't added before 👀

However, it seems more difficult to use this way... like how do you use a chat template?

You can just pass in the messages object (and it will do the templating for you). See here for example code.

If you want access to the tokenizer to do things yourself, you can access pipeline.tokenizer.apply_chat_template

sroussey · 2025-02-01T22:27:41Z

BTW: if you run the code I did above without the fix, it loads most of the files, but not all, and so fails.

Problem number 2 is that if you then fix the transformers code, it will still fail since not all the files were downloaded correctly, but it thinks it was.

I need to open DevTools, go to Application tab, then to Cache Storage, and then delete transformers-cache by right clicking and choosing delete. Only then will the code work.

This seems brittle.

benc-uk · 2025-02-16T14:04:24Z

Exact same issue in #963

tangkunyin · 2025-02-20T06:17:30Z

@benc-uk @sroussey They had lost option parameters, not only use_external_data_format #1200

Hope it will be fixed as soon as possible. Thanks a lot @xenova

tangkunyin · 2025-02-21T05:30:36Z

Before the official updates, anyone can use this for temporary

"@huggingface/transformers": "git+https://github.com/tangkunyin/transformers.js.git#develop"

It works for me!

sroussey added the bug Something isn't working label Feb 1, 2025

sroussey mentioned this issue Feb 1, 2025

Add option to use external data format in pipeline initialization #1180

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't use large models with pipeline() #1179

Can't use large models with pipeline() #1179

sroussey commented Feb 1, 2025

sroussey commented Feb 1, 2025

xenova commented Feb 1, 2025 •

edited

Loading

sroussey commented Feb 1, 2025

benc-uk commented Feb 16, 2025

tangkunyin commented Feb 20, 2025

tangkunyin commented Feb 21, 2025

Can't use large models with pipeline() #1179

Can't use large models with pipeline() #1179

Comments

sroussey commented Feb 1, 2025

System Info

Environment/Platform

Description

Reproduction

sroussey commented Feb 1, 2025

xenova commented Feb 1, 2025 • edited Loading

sroussey commented Feb 1, 2025

benc-uk commented Feb 16, 2025

tangkunyin commented Feb 20, 2025

tangkunyin commented Feb 21, 2025

xenova commented Feb 1, 2025 •

edited

Loading