Machine Learning Evaluating Evaluation Metrics — The Mirage of Hallucination Detection AI News Team October 27, 2025 Hallucinations pose a significant obstacle to the reliability and widespread adoption of language models, yet…
Machine Learning Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices AI News Team October 27, 2025 Fine-tuning large language models (LLMs) with backpropagation — even for a subset of parameters such…
Machine Learning Inductive Domain Transfer In Misspecified Simulation-Based Inference AI News Team October 27, 2025 Simulation-based inference (SBI) is a statistical inference approach for estimating latent parameters of a physical…
Machine Learning Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding? AI News Team October 27, 2025 This paper was accepted at the Evaluating the Evolving LLM Lifecycle Workshop at NeurIPS 2025.…
Machine Learning ODKE+: Ontology-Guided Open-Domain Knowledge Extraction with LLMs AI News Team October 27, 2025 Knowledge graphs (KGs) are foundational to many AI applications, but maintaining their freshness and completeness…
Machine Learning PrimeX: A Dataset of Worldview, Opinion, and Explanation AI News Team October 27, 2025 As the adoption of language models advances, so does the need to better represent individual…
Machine Learning Introducing vibe coding in Google AI Studio AI News Team October 26, 2025 Introducing vibe coding in Google AI Studio
Machine Learning Bias after Prompting: Persistent Discrimination in Large Language Models AI News Team October 25, 2025 A dangerous assumption that can be made from prior work on the bias transfer hypothesis…
Machine Learning Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping AI News Team October 24, 2025 We revisit scene-level 3D object detection as the output of an object-centric framework capable of…
Machine Learning New updates and more access to Google Earth AI AI News Team October 23, 2025 Earth AI is helping enterprises and cities with everything from environmental monitoring to disaster response.