[1] Gemini Robotics Team, “Gemini Robotics: Bringing AI into the Physical World.” arXiv, 2025.
[2] NVIDIA, “GR00T N1: An Open Foundation Model for Generalist Humanoid Robots.” arXiv, 2025.
[3] Figure AI, “Helix: A Vision-Language-Action Model for Generalist Humanoid Control.” Figure AI, 2025.
[4] Physical Intelligence, “π?: A Vision-Language-Action Flow Model for General Robot Control.” arXiv, 2024.
[5] Physical Intelligence, “π?.?: A Vision-Language-Action Model with Open-World Generalization.” arXiv, 2025.
[6] Physical Intelligence, “π*?.?: A VLA That Learns From Experience.” arXiv, 2025.
[7] Physical Intelligence, “π?.?: A Steerable Generalist Robotic Foundation Model with Emergent Capabilities.” arXiv, 2026.
[8] D. Kim et al., “RLDX-1 Technical Report.” arXiv, 2026.
[9] S. Ye et al., “World Action Models are Zero-shot Policies.” arXiv, 2026.
[10] S. Gao et al., “DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos.” arXiv, 2026.
[11] N. Agarwal et al., “Cosmos World Foundation Model Platform for Physical AI.” arXiv, 2025.
[12] A. Ali et al., “World Simulation with Video Foundation Models for Physical AI.” arXiv, 2025.