참고

[1] Jaech, Aaron, et al. "Openai o1 system card." arXiv preprint arXiv:2412.16720 (2024).

[2] Guo, Daya, et al. "Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning." arXiv preprint arXiv:2501.12948 (2025).

[3] OpenAI, "Introducing deep research." https://openai.com/index/introducing-deep-research

[4] Li, Xiaoxi, et al. "Search-o1: Agentic search-enhanced large reasoning models." arXiv preprint arXiv:2501.05366 (2025).

[5] Yao, Zijun, et al. "Are Reasoning Models More Prone to Hallucination?." arXiv preprint arXiv:2505.23646 (2025).

[6] Wu, Jialong, et al. "WebDancer: Towards Autonomous Information Seeking Agency." arXiv preprint arXiv:2505.22648 (2025).

[7] Yao, Shunyu, et al. "React: Synergizing reasoning and acting in language models." International Conference on Learning Representations (ICLR). 2023.