Beong-woo Kwak
Hello! I am a Ph.D student in AI at Yonsei University, advised by Prof. Jinyoung Yeo.
My research interest is in the field of Language Models and Agents. Especially, I am interested in: (1) Agents that can interact with various environments (Tool, Code, Web, …), (2) Personalization and Agent Memory, (3) Open-world Adaptation of Agents: Efficient and scalable training/evaluation mechanisms under unknown or evolving environments.
News
| Sep 18, 2025 | |
|---|---|
| Aug 21, 2025 | |
| Jun 01, 2025 | |
| May 16, 2025 | |
| Jan 23, 2025 | |
| Sep 20, 2024 | |
| Jun 23, 2024 | |
| May 16, 2024 | |
| Jun 27, 2023 | |
Experience & Educational Timeline
- Jun 2025 - Dec 2025
Research Scientist Intern Microsoft Research Asia (MSRA)@Beijing, China
Worked on self-evolving tool agents and agentic RL
Mentors: Dr. Liang Wang, Dr. Nan Yang, Dr. Xingxing Zhang
Government-funded collaborative research program (IITP) - Jun 2024 - Sep 2024
Applied Scientist Intern Amazon AGI@Sunnyvale, CA, USA
Return Internship
Worked on Code LLMs - Jun 2023 - Sep 2023
Applied Scientist Intern Amazon Alexa AI@Sunnyvale, CA, USA
Worked on Tool-augmented LLMs
Mentors: Dr. Hann Wang, Dr. Nikolaos Malandrakis,
Dr. Nagesh Panyam - Mar 2022 - Current
Ph.D. Student, Artificial Intelligence Yonsei University@Seoul, Republic of Korea
Advised by Prof. Jinyoung Yeo - Mar 2020 - Feb 2022
M.S., Artificial Intelligence Yonsei University@Seoul, Republic of Korea
Advised by Prof. Jinyoung Yeo - Jan 2018 - Jul 2018
Exchange Student University of California@Santa Cruz, CA, USA
International exchange program - Oct 2015 - Jul 2017
Sergeant Republic of Korea Army@Seoul, Republic of Korea
Compulsory military service during B.S.
in the Republic of Korea Army - Mar 2014 - Feb 2022
B.S., Computer Science Yonsei University@Seoul, Republic of Korea
Activities
-
Reviewer
- AAAI, EMNLP, ACL, NAACL
-
Teaching Experiences
- NVIDIA: Teaching Assistant on NLP
- Teaching Assistant at Yonsei Univ: Text and Language Understanding, Big Data, Natural Language Processing
-
Company Experiences
- Microsoft Research Asia (MSRA), Beijing, China - Research Scientist Intern (Jun 2025 - Dec 2025)
- Amazon AGI Foundational Models Group, Sunnyvale, CA, USA - Applied Scientist Intern (Jun 2024 - Sep 2024)
- Amazon Alexa AI, Sunnyvale, CA, USA - Applied Scientist Intern (Jun 2023 - Sep 2023)
-
Honors and Awards
- Encouragement Prize, Korea Capstone Design Fair (As a representative of Yonsei University) - Ministry of Trade, Industry and Energy, Korea
- Top Prize, Software Capstone Design - Computer Science Department, Yonsei Univ
- Top Presentation Prize, Yonsei Creative Exhibition Presentation - College of Engineering, Yonsei Univ
- Student Teaching Assistant Scholarship (Head Professor TA) - Department of Artificial Intelligence, Yonsei Univ
Publications
- Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized AssistancearXiv, 2025arXiv arXiv 2025
- Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsIn NeurIPS Spotlight, 2025arXiv NeurIPS 2025 (Spotlight)
- ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term InteractionsIn Findings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025arXiv EMNLP 2025 (Findings)
- One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RLIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025arXiv ACL 2025 (Industry)
- LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical StudyIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025arXiv ACL 2025
- Can You Share Your Story? Modeling Clients’ Metacognition and Openness for LLM Therapist EvaluationIn Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025arXiv ACL 2025 (Findings)
- Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with PsychometricsIn Findings of the Association for Computational Linguistics: NAACL 2025, Jul 2025arXiv NAACL 2025 (Findings)
- Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language ModelsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous CodeIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation DatasetIn Findings of the Association for Computational Linguistics ACL 2024, Jul 2024arXiv ACL 2024 (Findings)
- Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense ReasoningIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jul 2022arXiv NAACL 2022
- Dual task framework for improving persona-grounded dialogue datasetIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022
- Trustal: Trustworthy active learning using knowledge distillationIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022