MiniMax M2.7: A Self‑Evolving Chinese LLM Aiming to Automate Reinforcement Learning Research
MiniMax’s new M2.7 large language model is not just another entrant in the crowded frontier-model race. The Shanghai-based startup is using M2.7 and its predecessors as active participants in their own training loop, automating 30–50% of the reinforcement learning (RL)… Read More »MiniMax M2.7: A Self‑Evolving Chinese LLM Aiming to Automate Reinforcement Learning Research









