Hi, I made similar demo with Paolo Patierno.

We developed it in C# using Kinect for Windows SDK, Windows Speech APIs.and .Net Framework  for the client and NET Micro Framework on the Netduino board (robot)

Gesture Recognition Video:
http://vimeo.com/58336449

Speech Recognition:
http://vimeo.com/58336020

The speech commands in the second video are in Neapolitan (recognized by UNISCO as a language and a heritage.)