Saturday, October 12, 2024

Artificial Intelligence news

Google DeepMind wins joint...

In a second Nobel win for AI, the Royal Swedish Academy of...

Adobe wants to make...

Adobe has announced a new tool to help creators watermark their artwork...

Geoffrey Hinton just won...

Geoffrey Hinton, a computer scientist whose pioneering work on deep learning in...

Geoffrey Hinton just won...

Geoffrey Hinton, a computer scientist whose pioneering work on deep learning in...
HomeMachine LearningTowards Real-World Streaming...

Towards Real-World Streaming Speech Translation for Code-Switched Speech



This paper was accepted at the EMNLP Workshop on Computational Approaches to Linguistic Code-Switching (CALCS).
Code-switching (CS), i.e. mixing different languages in a single sentence, is a common phenomenon in communication and can be challenging in many Natural Language Processing (NLP) settings. Previous studies on CS speech have shown promising results for end-to-end speech translation (ST), but have been limited to offline scenarios and to translation to one of the languages present in the source (monolingual transcription).
In this paper, we focus on two essential yet unexplored areas…



Article Source link and Credit

Continue reading

On the Limited Generalization Capability of the Implicit Reward Model Induced by Direct Preference Optimization

Reinforcement Learning from Human Feedback (RLHF) is an effective approach for aligning language models to human preferences. Central to RLHF is learning a reward function for scoring human preferences. Two main approaches for learning a reward model are...

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

We present a foundation model for zero-shot metric monocular depth estimation. Our model, Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high-frequency details. The predictions are metric, with absolute scale, without relying on the availability of...

Improving How Machine Translations Handle Grammatical Gender Ambiguity

Machine Translation (MT) enables people to connect with others and engage with content across language barriers. Grammatical gender presents a difficult challenge for these systems, as some languages require specificity for terms that can be ambiguous or neutral...