News

machinelearning. apple. com
machinelearning. apple. com > research > memoryllm

Memory LLM: Plug-n-Play Interpretable Feed-Forward Memory for Transformers

7+ hour, 30+ min ago  (119+ words) Authors Ajay Jaiswal, Lauren Hannah, Han-Byul Kim, Duc Hoang, Arnav Kundu, Mehrdad Farajtabar, Minsik Cho July 2, 2026research area Computer Visionconference ICML One Wide Feedforward is All You Need December 1, 2023research area Speech and Natural Language Processing Workshop at EMNLP This paper was…...

Symbols: nyse:xpev
machinelearning. apple. com
machinelearning. apple. com > research > anti-casual

Anti-Causal Domain Generalization: Leveraging Unlabeled Data

3+ hour, 23+ min ago  (85+ words) May 2, 2023research area Health, research area Methods and Algorithmsconference ICLR This paper was accepted at the workshop "Trustworthy Machine Learning for Healthcare Workshop" at the conference ICLR 2023. When analyzing robustness of predictive models under distribution shift, many works focus on tackling…...

Symbols: trg-rf,top-16
machinelearning. apple. com
machinelearning. apple. com > research > learning-structured-reasoning

Learning Structured Reasoning via Tractable Trajectory Control

7+ hour, 30+ min ago  (309+ words) Apple Machine Learning Research Learning Structured Reasoning via Tractable Trajectory Control Large language models can exhibit emergent reasoning behaviors, often manifested as recurring lexical patterns (e. g. , "wait," indicating verification). However, complex reasoning trajectories remain sparse in unconstrained sampling, and standard RL…...

Symbols: nasdaq:nvda
Google News
machinelearning. apple. com > research > spatial

From Where Things Are to What They're For: Benchmarking Spatial'Functional Intelligence for Multimodal LLMs - Apple Machine Learning Research

1+ mon, 3+ week ago  (330+ words) From Where Things Are to What They're For: Benchmarking Spatial'Functional Intelligence for Multimodal LLMs'Apple Machine Learning Research From Where Things Are to What They're For: Benchmarking Spatial'Functional Intelligence for Multimodal LLMs True spatial intelligence for multimodal agents transcends low-level geometric…...

Symbols: btc-usd,eth-usd
machinelearning. apple. com
machinelearning. apple. com > research > normalizing-flows-iterative-denoising

Normalizing Flows with Iterative Denoising

1+ mon, 3+ week ago  (125+ words) Apple Machine Learning Research Normalizing Flows with Iterative Denoising Authors Tianrong Chen, Jiatao Gu, David Berthelot, Joshua Susskind, Shuangfei Zhai Related readings and updates. STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis June 30, 2025research area Computer Vision, research area Methods…...

Symbols: btc-usd,eth-usd,xrp-usd
machinelearning. apple. com
machinelearning. apple. com > research > ladir

La Di R: Latent Diffusion Enhances LLMs for Text Reasoning

2+ mon, 3+ day ago  (134+ words) Authors Haoqiang Kang, Yizhe Zhang, Nikki Lijing Kuang, Nicklas Majamaki, Navdeep Jaitly, Yi-An Ma, Lianhui Qin Thinking into the Future: Latent Lookahead Training for Transformers March 25, 2026research area Methods and Algorithms Workshop at ICLR This paper was accepted at the Workshop…...

Symbols: btc-usd,eth-usd
machinelearning. apple. com
machinelearning. apple. com > research > large-scale-rnns

Para RNN: Large-Scale Nonlinear RNNs, Trainable in Parallel

2+ mon, 1+ week ago  (757+ words) To accelerate research in efficient sequence modeling and enable researchers and practitioners to explore new nonlinear RNN models at scale, the Para RNN codebase has been released as an open-source framework for automatic training-parallelization of nonlinear RNNs. The computational cost…...

Symbols: post-ln,btc-usd,eth-usd
machinelearning. apple. com
machinelearning. apple. com > research > iclr-2026

Apple Machine Learning Research at ICLR 2026

2+ mon, 1+ week ago  (461+ words) Apple researcher Stephan Richter presenting at ICLR 2025. During exhibition hours, attendees will be able to experience demonstrations of Apple's ML research in our booth #204, including local LLM inference on Apple silicon with MLX and Sharp Monocular View Synthesis in Less…...

Symbols: nasdaq:crwv
machinelearning. apple. com
machinelearning. apple. com > research > llm-context-understanding

Can Large Language Models Understand Context?

2+ mon, 1+ week ago  (309+ words) Apple Machine Learning Research Can Large Language Models Understand Context? Understanding context is key to understanding human language, an ability which Large Language Models (LLMs) have been increasingly seen to demonstrate to an impressive extent. However, though the evaluation of…...

machinelearning. apple. com
machinelearning. apple. com > research > lacy

La Cy: What Small Language Models Can and Should Learn is Not Just a Question of Loss

2+ mon, 3+ week ago  (149+ words) This paper was accepted at the Workshop on Memory for LLM-Based Agentic Systems at ICLR. Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential August 8, 2025research area Methods and Algorithms, research area Speech and Natural Language Processing October 29, 2024research area Methods…...

Symbols: llms