AV Robot Command recognition @ ICMI & ICASSP

Xavier Alameda-Pineda, Jordi Sanchez and Radu Horaud

We investigated the problem of choosing a classifier for audio-visual command recognition. Because such commands are culture- and user-dependant, methods need to learn new commands from a few examples. We benchmark three state-of-the-art discriminative classifiers based on bag of words and SVM. The comparison is made on monocular and monaural recordings of a publicly available dataset. We seek for the best trade off between speed, robustness and size of the training set. In the light of over 150,000 experiments, we conclude that this is a promising direction of work towards a flexible methodology that must be easily adaptable to a large variety of users.

Would you be so kind to help the @TheOfficialACM #SIGMM Records team by filling this short survey about #ACM #Multimedia? This feedback and opinion will be very valuable! 😉 https://t.co/8ZahCffpCV #ACMMM @ACM_MM2018 @ACMMM19 cc @mad_astronaut @xavirema