News
Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery
2+ hour, 49+ min ago (275+ words) Published today in Nature, Empirical Research Assistance (ERA) is an AI tool for expert-level scientific coding that helped build the Computational Discovery prototype, now available through a trusted tester program in Google Labs. As part of our wider science announcements…...
Improving the academic workflow: Introducing two AI agents for better figures and peer review
1+ mon, 1+ week ago (345+ words) Jinsung Yoon, Research Scientist, and Tomas Pfister, Director, Google Cloud Introducing two AI agents to streamline academic research. These include: Paper Viz Agent, a visualizer agent for drawing academic figures, and Scholar Peer, a reviewer agent that automatically and rigorously…...
Building better AI benchmarks: How many raters are enough?
1+ mon, 2+ week ago (563+ words) Flip Korn and Chris Welty, Research Scientists, Google Research We introduce an evaluation framework for ML models, based on "gold" ratings data, that optimizes the trade-off between the number of items and raters per item, providing a roadmap for building…...
Exploring the feasibility of conversational diagnostic AI in a real-world clinical study
2+ mon, 1+ week ago (417+ words) We present insights from a first-of-its-kind research study in partnership with Beth Israel Deaconess Medical Center towards prospective real-world assessment of AMIE, our conversational medical AI for clinical reasoning and dialogue. Translating AI systems into clinical practice requires assessment in…...
Teaching LLMs to reason like Bayesians
2+ mon, 2+ week ago (444+ words) Sjoerd van Steenkiste and Tal Linzen, Research Scientists, Google Research We teach LLMs to reason in a Bayesian manner by training them to mimic the predictions of an optimal Bayesian model. The goal of the assistant was to recommend the…...
Towards a science of scaling agent systems: When and why agent systems work
3+ mon, 3+ week ago (720+ words) We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Our researchers drive advancements in computer science through both fundamental and applied research. We regularly open-source projects with…...
ATLAS: Practical scaling laws for multilingual models
3+ mon, 3+ week ago (483+ words) We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Our researchers drive advancements in computer science through both fundamental and applied research. We regularly open-source projects with…...
Small models, big results: Achieving superior intent extraction through decomposition
3+ mon, 4+ week ago (635+ words) We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Our researchers drive advancements in computer science through both fundamental and applied research. We regularly open-source projects with…...
Next generation medical image interpretation with Med Gemma 1. 5 and medical speech to text with Med ASR
4+ mon, 6+ day ago (978+ words) We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Our researchers drive advancements in computer science through both fundamental and applied research. We regularly open-source projects with…...
Introducing Nested Learning: A new ML paradigm for continual learning
6+ mon, 1+ week ago (633+ words) We strive to create an environment conducive to many different types of research across many different time scales and levels of risk. Our researchers drive advancements in computer science through both fundamental and applied research. We regularly open-source projects with…...