[1] Jaech, Aaron, et al. "Openai o1 system card." arXiv preprint arXiv:2412.16720 (2024).
[2] Guo, Daya, et al. "Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning." arXiv preprint arXiv:2501.12948 (2025).
[3] OpenAI, "Introducing deep research." https://openai.com/index/introducing-deep-research
[4] Li, Xiaoxi, et al. "Search-o1: Agentic search-enhanced large reasoning models." arXiv preprint arXiv:2501.05366 (2025).
[5] Yao, Zijun, et al. "Are Reasoning Models More Prone to Hallucination?." arXiv preprint arXiv:2505.23646 (2025).
[6] Wu, Jialong, et al. "WebDancer: Towards Autonomous Information Seeking Agency." arXiv preprint arXiv:2505.22648 (2025).
[7] Yao, Shunyu, et al. "React: Synergizing reasoning and acting in language models." International Conference on Learning Representations (ICLR). 2023.