Adopting the Trajectory Level Aggregation for Faster Training
Adopting the Trajectory Level Aggregation for Faster Training Agent Lightning (AGL) Team Date: Dec. 2025 1. Introduction In the context of Multi-turn Agent Reinforcement Learning (RL), data collection relies on rollouts where an agent interacts with an environment over multiple sequential turns. The strategy used to process these rollouts into training samples is a critical architectural decision that fundamentally impacts both training efficiency and model performance. Currently, Agent Lightning supports two primary strategies for aggregating these interaction traces: Transition Aggregation and Trajectory Aggregation. ...