I was following the executorch tutorial sesion “Building and Running Llama 3 8B Instruct with Qualcomm AI Engine Direct Backend”. I’ve finished converting the llama3 8b and the llama_main runner compiling. But I got this error when I was running the llama_main runner on the phone through the command “./llama_main --model_path llama3_8b.pte --tokenizer_path tokenizer.model --prompt “What is the capital of France?” --seq_len 120 --warmup=1”:
I was pushing the model to meizu 21pro which is 8gen3 android phone with 16G RAM.
Did anyone have this question before?