On-device ExecuTorch LLM API

Hi all,

So I want to make a mobile game where the NPCs are basically chatbots. To do so I need to quantize LLMs and deploy them onto mobiles. I have been reading ExecuTorch tutorials but not completely, and since my focus is on computer vision, this is a new field to me really.

To my understanding, I only need LLM APIs to get this idea to work, as opposed to compiling a chatbot app. I was wondering if this is possible? Many thanks in advance.

1 Like

Hi @Alex_Zhou1 thanks for your interest in ExecuTorch!

The high level flow of deploying LLM models to mobile application is:

For more questions please create an issue in Issues · pytorch/executorch · GitHub