Wednesday, November 6, 2024

Artificial Intelligence news

How ChatGPT search paves...

This story originally appeared in The Algorithm, our weekly newsletter on AI....

This AI-generated Minecraft may...

When you walk around in a version of the video game Minecraft...

OpenAI brings a new...

ChatGPT can now search the web for up-to-date answers to a user’s...

Chasing AI’s value in...

Inspired by an unprecedented opportunity, the life sciences sector has gone all...
HomeMachine LearningFineRecon: Depth-aware Feed-forward...

FineRecon: Depth-aware Feed-forward Network for Detailed 3D Reconstruction



Recent works on 3D reconstruction from posed images have demonstrated that direct inference of scene-level 3D geometry without iterative optimization is feasible using a deep neural network, showing remarkable promise and high efficiency. However, the reconstructed geometries, typically represented as a 3D truncated signed distance function (TSDF), are often coarse without fine geometric details. To address this problem, we propose three effective solutions for improving the fidelity of inference-based 3D reconstructions. We first present a resolution-agnostic TSDF supervision strategy to…



Article Source link and Credit

Continue reading

Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs

Translating text that contains entity names is a challenging task, as cultural-related references can vary significantly across languages. These variations may also be caused by transcreation, an adaptation process that entails more than transliteration and word-for-word translation. In...

ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models

The rapid evolution of Large Language Models (LLMs) and conversational assistants necessitates dynamic, scalable, and configurable conversational datasets for training and evaluation. These datasets must accommodate diverse user interaction modes, including text and voice, each presenting unique modeling...

Promoting Cross-Modal Representations to Improve Multimodal Foundation Models for Physiological Signals

Many healthcare applications are inherently multimodal, involving several physiological signals. As sensors for these signals become more common, improving machine learning methods for multimodal healthcare data is crucial. Pretraining foundation models is a promising avenue for success. However,...