This is an automated archive made by the Lemmit Bot.

The original was posted on /r/homeassistant by /u/Grandpa-Nefario on 2025-03-23 22:28:20+00:00.


There is a lot of variety in the LLMs being used with Home Assistant, as well as voice pipelines.

Because this tech moves pretty fast, I would be curious to hear what others are either using right now, and satified with, or if they are like me and waiting for the next shiny object.

I am using an older model, luna_7b served by LM Studio using the LM Studio gpt compatible API. Works pretty for the most part, however, it has a problem with musltiple commands at a time.

I have also used, the Qwen 2.5_7 and 14b variants with pretty good success; they are on average about 1.5 to 3 seconds slower in their response times than luna_7b, but don’t struggle with multipler commands at one time.

Chat GPT-4o, while not local, is still the fastest model I have used, but of course it is not local. FYI, I am using faster-whisper turbo on my server and haven’t tested Gemini or Claude