Date de début:
10:00
Date de fin:
11:00
Lieu:
Campus Michel-Ange CNRS
Ville:
Paris
Producteur:
-

Durée:
12:10
Type:
video/mp4
Poids:
76.82 Mo
Format:
mp4
Résolution:
768x576
Codec:
-

Session 4-Speech Recognition, Machine Translation and Gesture Localisation

In this paper we provide the state-of-the-art of existing proprietary and free and open source software (FOSS) automatic speech recognition (ASR), speech synthesizers, and Machine Translation (MT) tools. We also focus on the need for multimodal communication including gestures, furnishing some examples of 3D gesture recognition software. Our current experiment is based on interoperability between FOSS ASR, MT, and text-to-speech applications, while future experiments will include gesture recognition tools. Our application environment is an ambient assisted living lab at the University of Bremen, suitable for the elderly and/or people with impairments. In a nutshell, our goal is to provide a single uniform multimodal interface combining FOSS speech processing, MT, and gesture recognition tools for people in need.

D. Anastasiou, University of Bremen

Dernières vidéos