Suppressing unintended invocation of the device because of the speech that sounds like wake-word, or accidental button presses, is critical for a good...
Existing novice-friendly machine learning (ML) modeling tools center around a solo user experience, where a single user collects only their own data to...
Existing novice-friendly machine learning (ML) modeling tools center around a solo user experience, where a single user collects only their own data to...
The adoption of multimodal interactions by Voice Assistants (VAs) is growing rapidly to enhance human-computer interactions. Smartwatches have now incorporated trigger-less methods of...
On-device automatic speech recognition systems face several challenges compared to server-based systems. They have to meet stricter constraints in terms of speed, disk...
Multimodal learning is defined as learning over multiple heterogeneous input modalities such as video, audio, and text. In this work, we are concerned...
*= Equal Contributors
Applications of large open-domain knowledge graphs (KGs) to real-world problems pose many unique challenges. In this paper, we present extensions to...