Failed to Run Benchmark for llama-3-8b-instruct and llama-3.1-8b-instruct Models. #822

tim102187S · 2024-09-04T08:57:42Z

I attempted to run benchmarks for the llama-3-8b-instruct and llama-3.1-8b-instruct models using both CPU and GPU, but the process failed. (I successfully tested the llama2-7b-chatbot model)

I followed the instructions in openvino_notebooks/llm-chatbot.ipynb to download the models and ensured that all necessary files (including the required tokenizer.model) were included. I am using the latest version of OpenVINO (2024.3.0) and have also upgraded the transformers library.

The command I executed is:
python benchmark.py -m {path}/openvino_notebooks/notebooks/llm-chatbot/llama-3-8b-instruct/INT4_compressed_weights -n 2 -d CPU -p "What is large language model (LLM)?"

And I received the following error output:

peterchen-intel · 2024-09-05T08:39:56Z

Adding -ic 512 option should work around this issue.
We have a PR to fix this issue, but there is a performance regression, WIP on the analysis.

tim102187S · 2024-09-05T09:11:53Z

Thank you for your response. Adding the -ic 512 option indeed resolves the issue.

When can we expect the full solution to be available?

andrei-kochin assigned peterchen-intel Oct 16, 2024

andrei-kochin added the category: llm_bench Label for tool/llm_bench folder label Oct 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failed to Run Benchmark for llama-3-8b-instruct and llama-3.1-8b-instruct Models. #822

Failed to Run Benchmark for llama-3-8b-instruct and llama-3.1-8b-instruct Models. #822

tim102187S commented Sep 4, 2024 •

edited

Loading

peterchen-intel commented Sep 5, 2024

tim102187S commented Sep 5, 2024

Failed to Run Benchmark for llama-3-8b-instruct and llama-3.1-8b-instruct Models. #822

Failed to Run Benchmark for llama-3-8b-instruct and llama-3.1-8b-instruct Models. #822

Comments

tim102187S commented Sep 4, 2024 • edited Loading

peterchen-intel commented Sep 5, 2024

tim102187S commented Sep 5, 2024

tim102187S commented Sep 4, 2024 •

edited

Loading