Coming soon – offline speech recognition on your phone

tardigrada@beehaw.org · 14 days ago

Coming soon – offline speech recognition on your phone

Wistful@discuss.tchncs.de · 14 days ago

What does FUTO use? It works pretty good (based on my limited testing) and it works offline.

hendrik@palaver.p3x.de · edit-2 14 days ago

I think that’s based on OpenAI’s Whisper model. (Which seems to be the defacto standard these days.)

Steve@communick.news · 14 days ago

That’s what I was thinking.
I’m pretty sure FUTO isn’t the only one either.
This doesn’t seem like new tech.

Markaos@discuss.tchncs.de · 13 days ago

Yeah, stock Google voice recognition also works offline if you download the language model beforehand.

Gormadt@lemmy.blahaj.zone · 14 days ago

I’ve only had issues on days when I swap my aligners but even my friends have a hard time those days lol

10/10 highly recommended

I also dig their keyboard, I just wish it supported like searching for gifs to put directly into messenger apps.

9/10

dan@upvote.au · 14 days ago

I think the Home Assistant community has been working on offline speech recognition too, as a fully open replacement to things like Google Assistant.

fmstrat@lemmy.nowsci.com · 13 days ago

Pretty sure they use Whisper, which is what FUTO Keyboard already uses on Android to keep it local to the phone.

I use Heliboard as a keyboard, then FUTO Voice connected to the mic button.

𝕸𝖔𝖘𝖘@infosec.pub · 14 days ago

This article may have been right 2 years ago, but not so much today.

I have an offline stt keyboard on my phone that uses Vosk. I used to have a stt digital assistant, too (can’t remember which model), but I didn’t need a “siri” and ended up uninstalling.

brisk@aussie.zone · 13 days ago

This maneuver may sound simple, but it involves an entirely new and unique code for which the researchers have sought a patent.

How to make your discovery worthless in a single, idiotic move.

hendrik@palaver.p3x.de · edit-2 14 days ago

I think the real deal would be to have that available as open source. Maybe integrated directly into the core AOSP. I mean the technology is available. And my phone has like 8GB of RAM. The only issue is that all of that isn’t really integrated into my phone. And I think I’d ocassionaly use speech to text, text to speech and machine translation… But I want it locally and Free Software… Same for my computer. All the software is there. But it isn’t integrated into the desktop and takes half a day to set up all the different Python projects…

B0rax@feddit.org · 13 days ago

What? Local speech recognition is already integrated in most phones. Open source options are also freely available… I am not sure what the news is here…

Markaos@discuss.tchncs.de · 13 days ago

Indeed, try switching your smartphone to airplane mode and see how far your voice commands get you.

Did that (or rather disabled mobile data and WiFi, because airplane mode would still keep the WiFi on), and then I dictated this sentence after the parentheses. So Google’s voice input works offline just fine.

Or do they mean something like a smart assistant? In that case fair, but it’s not like it will work with text input either.

It is true, however, that Google Translate doesn’t do offline voice translation even if the language you’re trying to translate from is downloaded for system-wide voice recognition.

t3rmit3@beehaw.org · 14 days ago

Coming soon

Not to my phone it’s not!

MisterD@lemmy.ca · 13 days ago

Only to send the words back to Google? No thanks

Verito@lemm.ee · edit-2 13 days ago

But it saves so much money on server time and data costs to just send the final transcript!
/s