Saturday, January 25, 2025

Artificial Intelligence news

How a top Chinese...

The AI community is abuzz over DeepSeek R1, a new open-source reasoning...

What’s next for robots

MIT Technology Review’s What’s Next series looks across industries, trends, and technologies...

OpenAI launches Operator—an agent...

After weeks of buzz, OpenAI has released Operator, its first AI agent....

Implementing responsible AI in...

Many organizations have experimented with AI, but they haven’t always gotten the...
HomeMachine LearningApplication-Agnostic Language Modeling...

Application-Agnostic Language Modeling for On-Device ASR



On-device automatic speech recognition systems face several challenges compared to server-based systems. They have to meet stricter constraints in terms of speed, disk size and memory while maintaining the same accuracy. Often they have to serve several applications with different distributions at once, such as communicating with a virtual assistant and speech-to-text. The simplest solution to serve multiple applications is to build application-specific (language) models, but this leads to an increase in memory. Therefore, we explore different data- and architecture-driven language modeling…



Article Source link and Credit

Continue reading

Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition

This paper presents an efficient decoding approach for end-to-end automatic speech recognition (E2E-ASR) with large language models (LLMs). Although shallow fusion is the most common approach to incorporate language models into E2E-ASR decoding, we face two practical problems...

Interpreting CLIP: Insights on the Robustness to ImageNet Distribution Shifts

What distinguishes robust models from non-robust ones? While for ImageNet distribution shifts it has been shown that such differences in robustness can be traced back predominantly to differences in training data, so far it is not known what...

Controlling Language and Diffusion Models by Transporting Activations

The increasing capabilities of large generative models and their ever more widespread deployment have raised concerns about their reliability, safety, and potential misuse. To address these issues, recent works have proposed to control model generation by steering model...