News

arXiv.org
arxiv.org > abs > 2603.05488

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

5+ hour, 2+ min ago  (258+ words) We provide evidence of performative chain-of-thought (CoT) in reasoning models, where a model becomes strongly confident in its final answer, but continues generating tokens without revealing its internal belief. Our analysis compares activation probing, early forced answering, and a CoT…...

arXiv.org
arxiv.org > abs > 2602.16800

Large-scale online deanonymization with LLMs

3+ week, 11+ hour ago  (343+ words) We show that large language models can be used to perform at-scale deanonymization. With full Internet access, our agent can re-identify Hacker News users and Anthropic Interviewer participants at high precision, given pseudonymous online profiles and conversations alone, matching what…...

arXiv.org
arxiv.org > abs > 2512.15745

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

2+ mon, 3+ week ago  (241+ words) This paper presents LLaDA2.0 -- a tuple of discrete diffusion large language models (dLLM) scaling up to 100B total parameters through systematic conversion from auto-regressive (AR) models -- establishing a new paradigm for frontier-scale deployment. Instead of costly training from scratch, LLaDA2.0 upholds knowledge inheritance, progressive…...

arXiv.org
arxiv.org > abs > 2512.08269

EgoX: Egocentric Video Generation from a Single Exocentric Video

2+ mon, 4+ week ago  (254+ words) Egocentric perception enables humans to experience and understand the world directly from their own point of view. Translating exocentric (third-person) videos into egocentric (first-person) videos opens up new possibilities for immersive understanding but remains highly challenging due to extreme camera…...

arXiv.org
arxiv.org > abs > 2512.09742

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

3+ mon, 14+ hour ago  (328+ words) LLMs are useful because they generalize so well. But can you have too much of a good thing? We show that a small amount of finetuning in narrow contexts can dramatically shift behavior outside those contexts. In one experiment, we…...

arXiv.org
arxiv.org > abs > 2509.21155

Learning the Wrong Lessons: Syntactic-Domain Spurious Correlations in Language Models

3+ mon, 1+ week ago  (86+ words) Help | Advanced Search arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and…...

arXiv.org
arxiv.org > abs > 2510.15510

Exploring Conditions for Diffusion models in Robotic Control

4+ mon, 2+ week ago  (100+ words) YOU make open access possible! Tell us why you support #openaccess and give to arXiv this week to help keep science open for all. Help | Advanced Search arXivLabs is a framework that allows collaborators to develop and share new arXiv…...

arxiv.org
arxiv.org > html > 2510.15511v3

Language Models are Injective and Hence Invertible - GLADIA Research

4+ mon, 2+ week ago  (1674+ words) In this paper, we show that this intuition is misleading. Despite their apparent complexity, standard decoder-only Transformer language models (seen as maps from prompts to hidden states) are in fact almost-surely injective; for essentially all parameter settings and during the…...

arXiv.org
arxiv.org > abs > 2510.04721

BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs

4+ mon, 2+ week ago  (86+ words) Help | Advanced Search arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and…...

arXiv.org
arxiv.org > abs > 2510.01171

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

4+ mon, 4+ week ago  (86+ words) Help | Advanced Search arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and…...