Content Archived

This content is no longer current. Our recommendation for up to date content:

Audio Fundamentals (Beta 2 SDK)

Sign in to queue

The Discussion

  • User profile image

    This is awesome!!

  • User profile image


    But how about other languages? Like German, French oder Spanish?

    Are these supported?

  • User profile image

    How about a brief code sample of how general dictation might be used? When I try to modify the sample code to add


    It crashes on


    I've searched high and low for a solution but it appears this is a general issue (that the dictation stuff doesn't work) with the speech API so why is it there? Any help is much appreciated.



  • User profile image
    George Birbilis

    typo: visaul -> visual

  • User profile image

    @George Birbilis: fixed the typo

  • User profile image

    @TheZar: are you using the x86 or x64 speech APIs?

  • User profile image

    Very good the tutorial but, do you have the code for Speech Recognition? Thanks

  • User profile image

    Is there any good resource out there for learning the SRGS XML format? The W3C specification is too.. specificationy, and all the tutorials I've found so far deal with the BNF format rather than the XML format.

  • User profile image

    Hi, thanks for sharing us such a good tutorial. But I personally find it is not so difficult to record streaming audio from microphone by standalone audio recorders, not built-in ones.

  • User profile image
    Hiva Javaher

    I'm trying to get both speech recognition and Text to speech to work on a WPF app (C#)
    I have the Recognition down but the synthesizer part keeps giving an error of "No voice installed on the system or none available with the current security setting."
    I have both "Microsoft Speech Platform - Software Development Kit (SDK) (Version 10.2)" and "Microsoft Speech Platform - Server Runtime (Version 10.2)" in X86 and X64 installed on my system.

    Can anyone tell me whats wrong? I would really really appreciate it.


  • User profile image

    I am trying to add speech recognition to a WPF C# app. I am receiving video, skeletal, and depth data correctly, but whenever I start capturing the audio I receive the exception error bellow. I can run the demo above correctly. Is there a reference or an extra step needed when using WPF.


    System.InvalidCastException was unhandled
      Message=Unable to cast COM object of type 'System.__ComObject' to interface type 'Microsoft.Research.Kinect.Audio.IMediaObject'. This operation failed because the QueryInterface call on the COM component for the interface with IID '{D8AD0F58-5494-4102-97C5-EC798E59BCF4}' failed due to the following error: No such interface supported (Exception from HRESULT: 0x80004002 (E_NOINTERFACE)).
           at System.StubHelpers.StubHelpers.GetCOMIPFromRCW(Object objSrc, IntPtr pCPCMD, Boolean& pfNeedsRelease)
           at Microsoft.Research.Kinect.Audio.IMediaObject.ProcessOutput(Int32 dwFlags, Int32 cOutputBufferCount, DMO_OUTPUT_DATA_BUFFER[] pOutputBuffers, Int32& pdwStatus)
           at Microsoft.Research.Kinect.Audio.KinectAudioStream.RunCapture(Object notused)
           at System.Threading.ThreadHelper.ThreadStart_Context(Object state)
           at System.Threading.ExecutionContext.Run(ExecutionContext executionContext, ContextCallback callback, Object state, Boolean ignoreSyncCtx)
           at System.Threading.ExecutionContext.Run(ExecutionContext executionContext, ContextCallback callback, Object state)
           at System.Threading.ThreadHelper.ThreadStart(Object obj)

  • User profile image

    For some reason i only have the Microsoft Lightweight Speech Recognizer v11.0 (SR_MS_ZXX_Lightweight_v11.0) showing up as an available speech recognizer. I've double-checked that i have everything installed correctly, and i'm referencing the C:\Program Files\Microsoft SDKs\Speech\v11.0\Assembly\Microsoft.Speech.dll. Any ideas why i don't see the Kinect Recognizer?

Add Your 2 Cents