Content Warning

This post may melt your monocle. Proceed?

R.NF
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 1 year ago

Making my local LLM voice assistant faster and more scalable with RAG

johnthenerd.com

external-link
message-square
0
link
fedilink
1
external-link

Making my local LLM voice assistant faster and more scalable with RAG

johnthenerd.com

bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 1 year ago
message-square
0
link
fedilink
If you read my previous blog post, you probably already know that I like my smart home open-source and very local, and that certainly includes any voice assistant I may have. If you watched the video demo, you have probably also found out that it’s… slow. Trust me, I did too. Prefix caching helps, but it feels like cheating. Sure, it’ll look amazing in a demo, but as soon as I start using my LLM for other things (which I do, quite often), that cache is going to get evicted and that first prompt is still going to be slow.

HN Discussion

alert-triangle
You must log in or register to comment.

Hacker News@lemmy.smeargle.fans

hackernews@lemmy.smeargle.fans

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]
lock
Community locked: only moderators can create posts. You can still comment on posts.

A mirror of Hacker News’ best submissions.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 0 users / day
  • 0 users / week
  • 0 users / month
  • 0 users / 6 months
  • 0 local subscribers
  • 1.94K subscribers
  • 150 Posts
  • 0 Comments
  • Modlog
  • mods:
  • bot@lemmy.smeargle.fans
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org