Posted by
Andreas Maier on
Aug 03, 2010; 11:45pm
URL: http://imagej.273.s1.nabble.com/Voice-Recognition-tp3684612p3684618.html
Hi Alejandro,
> This is designed for doing experiments on cell cultures. I also will be
> saving images as tiff files. Is there an image format that can use audio
> data?
>
I guess one could store some information in the private fields of the
tiff header. Maybe it would be better to reference the location of an
additional audio file... I'm no expert for image formats..
> As for voice recording in Java, do you mean just recording voice data and
> saving as a .wav file? I have to look closely to see how to used Dragon
> Naturally Speaking.
>
I guess wou could just write your own speech recorder using the Java
sound API:
http://java.sun.com/products/java-media/sound/Alternatively you could also use some open-source recorder such as:
http://www.quadmore.com/swingrecorder/Both should give you the capabilities to record audio via Java.
>
> Andreas Maier wrote:
>
>> as far as I remember, Dragon Naturally Speaking has a batch mode to
>> process prerecorded files. So basically you would need a macro or a
>> custom plug-in which invokes the Dragon Speech Reco engine, parses the
>> output and puts it into the meta header.
>>
>> In a custom plug-in you could even do the voice recording via Java. For
>> someone who knows the respective APIs, I guess it would be two or three
>> days of work to develop such a plug-in.
>>
>>
> I'm still pretty new to Java and ImageJ plugins but is it really possible to
> invoke the Dragon Speech Reco engine? Would you happen to know how to do
> that?
>
> Thanks for all your help
>
> Alejandro
>
I guess you could use the batch interface of Dragon Naturally Speaking.
There should be some command line interface which you can invoke via
Runtime.exec.
There is also a complete Java Speech Recognition Engine available:
http://cmusphinx.sourceforge.net/sphinx4/But I guess, they probably don't have the nice acoustic models which are
supplied with dragon's speech recognition engine. (And it will most
likely not be able to read Dragon's models).
In any case it's quite a bit of work. (Using the Java engine will be
even more work, but of course the most beautiful solution.)
Best,
Andreas
--
Dr.-Ing. Andreas Maier
Stanford University
Department of Radiology
The Lucas Center for Imaging
Mail Code 5488, Route 8
Stanford, CA 94305
http://med.stanford.edu/profiles/Andreas_Maier/