vision-language models

Mamba‑3: Open Source State Space Models Challenge Transformers on Speed, Cost and Reasoning

Cary Huang
March 20, 2026
News, Technology

Since Google’s 2017 “Attention Is All You Need” paper, the Transformer architecture has been the default foundation for large language models (LLMs). It powers systems like ChatGPT and Gemini, but its quadratic compute and heavy memory footprint make large-scale inference… Read More »Mamba‑3: Open Source State Space Models Challenge Transformers on Speed, Cost and Reasoning

MIT’s Recursive Language Models: A Systems Approach to 10M-Token Contexts

Cary Huang
January 23, 2026January 23, 2026
News, Orchestration

MIT CSAIL researchers are proposing a different answer to the long-context problem in large language models: don’t keep stretching the context window—change how the system uses it. Their new Recursive Language Models (RLMs) framework treats long prompts as an external… Read More »MIT’s Recursive Language Models: A Systems Approach to 10M-Token Contexts

NeurIPS 2025: Five Papers That Show Why AI Progress Is Now Systems-Limited

Cary Huang
January 20, 2026January 20, 2026
Infrastructure, Machine Learning, News, Orchestration

NeurIPS has long been the place where new architectures, training tricks and evaluation benchmarks quietly change how real systems are built. The 2025 edition continued that pattern — but with a sharper message for anyone working on LLMs, agentic systems… Read More »NeurIPS 2025: Five Papers That Show Why AI Progress Is Now Systems-Limited

Nvidia’s Cosmos Reason 2 and Nemotron Upgrades Push ‘Physical AI’ Beyond the Chatbox

Cary Huang
January 7, 2026January 7, 2026
News, Orchestration

Nvidia is using CES 2026 to sharpen its pitch for what CEO Jensen Huang has called the “age of physical AI” – systems where agentic models don’t just reason over text or code, but drive robots, vehicles and other devices… Read More »Nvidia’s Cosmos Reason 2 and Nemotron Upgrades Push ‘Physical AI’ Beyond the Chatbox