Monday, September 9, 2024

Artificial Intelligence news

Roblox is launching a...

Roblox plans to roll out a generative AI tool that will let...

What this futuristic Olympics...

The Olympic Games in Paris just finished last month and the Paralympics...

AI’s impact on elections...

This year, close to half the world’s population has the opportunity to...

Here’s how ed-tech companies...

This story is from The Algorithm, our weekly newsletter on AI. To...
HomeMachine LearningIntelligent Assistant Language...

Intelligent Assistant Language Understanding On-device



It has recently become feasible to run personal digital assistants on phones and other personal devices. In this paper, we describe a design for a natural language understanding system that runs on-device. In comparison to a server-based assistant, this system is more private, more reliable, faster, more expressive, and more accurate. We describe what led to key choices about architecture and technologies. For example, some approaches in the dialog systems literature are difficult to maintain over time in a deployment setting. We hope that sharing learnings from our practical experiences may…



Article Source link and Credit

Continue reading

Positional Description for Numerical Normalization

We present a Positional Description Scheme (PDS) tailored for digit sequences, integrating placeholder value information for each digit. Given the structural limitations of subword tokenization algorithms, language models encounter critical Text Normalization (TN) challenges when handling numerical tasks....

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

Audio-visual speech contains synchronized audio and visual information that provides cross-modal supervision to learn representations for both automatic speech recognition (ASR) and visual speech recognition (VSR). We introduce continuous pseudo-labeling for audio-visual speech recognition (AV-CPL), a semi-supervised method...

Novel-View Acoustic Synthesis From 3D Reconstructed Rooms

We investigate the benefit of combining blind audio recordings with 3D scene information for novel-view acoustic synthesis. Given audio recordings from 2-4 microphones and the 3D geometry and material of a scene containing multiple unknown sound sources, we...