InfGen: Scenario Generation as Next-Token-Group Prediction

Zhenghao Peng , Yuxin Liu , Bolei Zhou

University of California, Los Angeles

Image

InfGen is a unified transformer that treats the entire driving scene—map, lights, agent states and motions—as one long token stream and rolls it out autoregressively.

    :traffic_light: Generates both initial states, traffic light signals, and traffic participant trajectories in a single model.

    :mag: Supports dynamic agent injection for open-world, long-horizon simulation.

    :rocket: Autonomous driving planners trained in InfGen scenarios show ↑ robustness & generalization over log-replay baselines.