This piece was originally put together for the BCS OSSG (British Computer Society, Open Source Specialist Group) website on the 15th March 2005. Some of the info may be a bit old by now (21st July 2006), but I decided it would be handy to post it on the HantsLUG Wiki too.
A good number of people who were at the OOSG meeting yesterday (14th March 2005 for those reading this sometime after posting) will no doubt have discussed this in the break. It got me thinking a bit since the IBM ViaVoice has been discontinued on the Linux platform and I couldnt think of any current projects in the field. I had a vague feeling in the back of my mind that I had read something related to one of the desktops, probably Gnome as thats the one Ive had most interest in (although I use XFCE myself), but this could easily have been a discussion that there should be something!
Anyway, I couldnt imagine that there was nothing at all on the subject, even if there was nothing useable, so Ive delved into Google and my bookmarks and come up with a few useful links - and since I said Id post anything I found on the site, heres a new forum as well
OK, starting with the more generic stuff:
First up theres an old artilce in the Linux Gazette, although this doesnt get into anything technical and is far to high level to be of any real use: http://linuxgazette.net/issue87/lodato.html
Theres also an article in Linux Journal on using ViaVoice with XVoice. Ive not read it yet, but since ViaVoice is no longer available it seems of little use: http://www.linuxjournal.com/article/6383
Theres some discussion on integrating ViaVoice with KDE as well, but Ive not found more than some basic discussion of whether it is a good idea so far. Ive also come across comments on GVoice for Gnome, but nothing particularly useful on it yet, and I think it is basically dictation based.
Theres a links page with various speech related sites, some of which no longer exist, here: http://www.linux-sound.org/speech.html (not all voice recognition though, much is synthesis).
Getting into the more specific implementation based sites I have have the already mentioned XVoice:
this looks to me more dictation based unfortunately and also relies on ViaVoice.
The Open-Source Speech Recognition Initiative site looks pretty dead, but the list appears to still be active and may be worth a look: http://www.ossri.org/
Theres a site on Automated Speech Recognition that looks to be research based with some code available, although Ive not quite managed to get my head around exactly what is going on there yet!: http://www.isip.msstate.edu/projects/speech/software/index.html
Theres a couple of sites on CMU Sphinx which looks interesting, but Im not sure whether it is able to work with desktop/application control or not - it probably depends how much development work youre willing/able to put in Theres two links:
Most promising of the lot looks to be PerlBox which acts as a front end to the above CMU Sphinx system (amongst others) and from a reference article looks to be able to control the desktop to some extent with PerlBox Voice. It is customisable, but looks to be mainly application launching based, so what is involved to get more application control Im not sure. It also looks to be of most use if you are using KDE.
Hopefully the above links will be a good starting point to further investigation. Ive not delved far into any of them yet, but given time (more elusive than the Scarlet Pimpernel that commodity!) I may. Ive just got to get sound working on my system first, Im afraid Ive never seen it as a high priority and Im not a boxed Linux user, so its not thrown on by default - my systems are mainly CLI based only or a somewhat customised Debian desktop!