Errors when running Llama3 8b on executorch QNN backend

Jiawei_Liu1 · November 8, 2024, 6:25am

I was following the executorch tutorial sesion “Building and Running Llama 3 8B Instruct with Qualcomm AI Engine Direct Backend”. I’ve finished converting the llama3 8b and the llama_main runner compiling. But I got this error when I was running the llama_main runner on the phone through the command “./llama_main --model_path llama3_8b.pte --tokenizer_path tokenizer.model --prompt “What is the capital of France?” --seq_len 120 --warmup=1”:

I was pushing the model to meizu 21pro which is 8gen3 android phone with 16G RAM.
Did anyone have this question before?

Jiawei_Liu1 · November 8, 2024, 6:26am

Additional error log

Jiawei_Liu1 · November 8, 2024, 6:27am

Files in my phone directory:

dbort · December 2, 2024, 7:27pm

Hi @Jiawei_Liu1, sorry for the delay. Could you please file a github issue with instructions for reproducing the problem, along with information about your development environment?