Posted by Dahun Kim and Weicheng Kuo, Research Scientists, Google
The ability to detect objects in the visual world is crucial for computer vision and machine intelligence, enabling applications like adaptive autonomous agents and versatile shopping systems. However, modern object detectors are limited by the manual annotations of their training data, resulting in a vocabulary size significantly smaller than the vast array of objects encountered in reality. To overcome this, the open-vocabulary detection task (OVD) has emerged, utilizing image-text pairs for training and incorporating new category names at test...
Posted by Susanna Ricco and Utsav Prabhu, co-leads, Perception Fairness Team, Google Research
Google’s Responsible AI research is built on a foundation of collaboration...
Posted by Sergio Boixo and Vadim Smelyanskiy, Principal Scientists, Google Quantum AI Team
A full-scale error-corrected quantum computer will be able to solve some...
Posted by Hattie Zhou, Graduate Student at MILA, Hanie Sedghi, Research Scientist, Google
Large language models (LLMs), such as GPT-3 and PaLM, have shown...
Posted by Wenhao Yu and Fei Xia, Research Scientists, Google
Empowering end-users to interactively teach robots to perform novel tasks is a crucial capability...
Posted by Catherine Armato, Program Manager, Google
This week, the 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023) is being held...
Posted by Ziniu Hu, Student Researcher, and Alireza Fathi, Research Scientist, Google Research, Perception Team
There has been great progress towards adapting large language...
Posted by Hussein Hazimeh, Research Scientist, Athena Team, and Riade Benbaki, Graduate Student at MIT
Modern neural networks have achieved impressive performance across a...
Posted by Eltayeb Ahmed, Research Engineer, and Subhrajit Roy, Senior Research Scientist, Google Research
Reading has many benefits for young students, such as better...
Posted by Sandeep Tata, Software Engineer, Google Research, Athena Team
The last few years have seen rapid progress in systems that can automatically process...