Adopting the Trajectory Level Aggregation for Faster Training

Adopting the Trajectory Level Aggregation for Faster Training Agent Lightning (AGL) Team Date: Dec. 2025 1. Introduction In the context of Multi-turn Agent Reinforcement Learning (RL), data collection relies on rollouts where an agent interacts with an environment over multiple sequential turns. The strategy used to process these rollouts into training samples is a critical architectural decision that fundamentally impacts both training efficiency and model performance. Currently, Agent Lightning supports two primary strategies for aggregating these interaction traces: Transition Aggregation and Trajectory Aggregation. ...

December 17, 2025 · 10 min

Tinker X Agent Lightning

Tuning ANY AI agent with Tinker X Agent-lightning Yuge Zhang Nov. 2025 Tinker is the first product built by an all-star company called Thinking Machine Lab, whose team members come from leading organizations such as OpenAI. Notable members include former OpenAI CTO Mira Murati; John Schulman, the first author of PPO; Barret Zoph, a leading scientist in AutoML (the area I previously worked in); and well-known Asian researchers like Danqi Chen and Lilian Weng. ...

November 19, 2025 · 32 min