Monday, January 13, 2025

Artificial Intelligence news

Anthropic’s chief scientist on...

Agents are the hottest thing in tech right now. Top firms from...

What’s next for AI...

MIT Technology Review’s What’s Next series looks across industries, trends, and technologies...

How optimistic are you...

This story originally appeared in The Algorithm, our weekly newsletter on AI....

AI means the end...

We all know what it means, colloquially, to google something. You pop...
HomeMachine LearningApproximate Nearest Neighbor...

Approximate Nearest Neighbor Phrase Mining for Contextual Speech Recognition



This paper presents an extension to train end-to-end Context-Aware Transformer Transducer ( CATT ) models by using a simple, yet efficient method of mining hard negative phrases from the latent space of the context encoder. During training, given a reference query, we mine a number of similar phrases using approximate nearest neighbour search. These sampled phrases are then used as negative examples in the context list alongside random and ground truth contextual information. By including approximate nearest neighbour phrases (ANN-P) in the context list, we encourage the learned representation…



Article Source link and Credit

Continue reading

Accelerating LLM Inference on NVIDIA GPUs with ReDrafter

Accelerating LLM inference is an important ML research problem, as auto-regressive token generation is computationally expensive and relatively slow, and improving inference efficiency can reduce latency for users. In addition to ongoing efforts to accelerate inference on Apple...

ARMADA: Augmented Reality for Robot Manipulation and Robot-Free Data Acquisition

Teleoperation for robot imitation learning is bottlenecked by hardware availability. Can high-quality robot data be collected without a physical robot? We present a system for augmenting Apple Vision Pro with real-time virtual robot feedback. By providing users with...

BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale

Information Retrieval (IR) systems used in search and recommendation platforms frequently employ Learning-to-Rank (LTR) models to rank items in response to user queries. These models heavily rely on features derived from user interactions, such as clicks and engagement...