News
The Sequence AI of the Week #867: Thinking in Latents: Why Sapient's HRM-Text Is a Quiet Rebuke to Chain-of-Thought
6+ hour, 23+ min ago (310+ words) The Sequence The Sequence AI of the Week #867: Thinking in Latents: Why Sapient's HRM-Text Is a Quiet Rebuke to Chain-of-Thought One of the most impressive small models recently released. There is a particular sleight-of-hand at the heart of modern LLM…...
The Sequence Radar #861: Last Week in AI: IPOs, Interactive Models, and Recursive Dreams
1+ week, 3+ day ago (575+ words) We continue our series about trasnformer alternatives. In our opinion section, we discuss the thesis that "every agent needs a computer" In the AI of the Week section, we dive into Thinking Machines" interactive models which can think and listen…...
The Sequence AI of the Week #859: Reading Claude's Mind in English: A Note on Natural Language Autoencoders
2+ week, 5+ hour ago (205+ words) The Sequence The Sequence AI of the Week #859: Reading Claude's Mind in English: A Note on Natural Language Autoencoders Anthropic's fascinating new papers for the future of AI interpretability. There is a recurring fantasy in interpretability work, somewhere between a…...
The Sequence Knowledge #858: How State Space Models Went from Curiosity to Serious Transformer Competitor
2+ week, 1+ day ago (278+ words) The Sequence The Sequence Knowledge #858: How State Space Models Went from Curiosity to Serious Transformer Competitor Inside the core ideas, potential and challenges of SSMs " AI Concept of the Day: How State Space Models Went from Curiosity to Serious Transformer…...
The Sequence Radar #857: Last Week in AI: Inside the Machine, Outside the Text Box
2+ week, 3+ day ago (534+ words) We continue our series about alternatives to transformers. In the AI of the week, we dive into Anthropic's groundbreaking paper about natural language autoencoders. Our opinion section dives into an interesting idea: every company's last exam. So the week's lesson…...
The Sequence Knowledge #854: Return of the King: Unrolling the x LSTM Architecture
3+ week, 1+ day ago (162+ words) The Sequence The Sequence Knowledge #854: Return of the King: Unrolling the x LSTM Architecture An unexpected alternative to transformers. " AI Concept of the Day: Return of the King: Unrolling the x LSTM Architecture If you were training sequence models circa…...
The Sequence Knowledge #846: Beyond Transformer: A New Series
1+ mon, 6+ day ago (142+ words) The Sequence The Sequence Knowledge #846: Beyond Transformer: A New Series Let's explore every major viable alternative to the transformer architecture. " AI Concept of the Day: Beyond Transformer: A New Series If you have been watching the ar Xiv firehose lately,…...
The Sequence AI of the Week #843: The AI We Built But Can't Release: A Practical View Into the Claude Mythos Preview
1+ mon, 1+ week ago (154+ words) The Sequence The Sequence AI of the Week #843: The AI We Built But Can't Release: A Practical View Into the Claude Mythos Preview Some technical insides into Project Glasswing and Claude Mythos Welcome to another edition of The Sequence. Today,…...
The Sequence AI of the Week #839: Gemma 4 and the Compression of Intelligence
1+ mon, 2+ week ago (192+ words) The Sequence The Sequence AI of the Week #839: Gemma 4 and the Compression of Intelligence The model represents an impressive open source release. There is a recurring pattern in AI progress. First, a capability appears at the frontier in a form…...
The Sequence Knowledge #833: How to Build a World Model
1+ mon, 3+ week ago (139+ words) thesequence. substack. com The Sequence Knowledge #833: How to Build a World Model Inside the techniques that powers the latest generation of world models. " AI Concept of the Day: How to Build a World Model World models are the workaround. You…...