Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

CubitOom , (edited )

So what is Home Assistant using for this?

If I were to build it myself I'd probably over complicate it by using multiple llm agents on a local server. Probably use whisper to do the speech to text and then Mistral fine tuned on the Rosetta code dataset to send the API calls to HA. However that wouldnt keep it from always listening to me and trying to interpret what I say into a command for HA. Is that just a prompting issue for whisper or would I need another agent to turn on whisper?

I could maybe get this to run without specialized hardware like a GPU but it would be better to have something for the llms to be a bit more responsive.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • technology@lemmy.world
  • random
  • incremental_games
  • meta
  • All magazines