Beong-woo Kwak
Hello! I am a Ph.D student in AI at Yonsei University, advised by Prof. Jinyoung Yeo.
My research interest is in NLP, focusing on Conversational AI where knowledge, reasoning, and interaction come into play. In particular, I’m interested in:
- Personalized agents that can perform/develop/be maintained in a lifelong manner,
- Code / Tool-learning,
- Efficient, scalable training/evaluation under evolving environments.
News
| Sep 18, 2025 | |
|---|---|
| Aug 21, 2025 | |
| Jun 01, 2025 | |
| May 16, 2025 | |
| Jan 23, 2025 | |
| Sep 20, 2024 | |
| Jun 23, 2024 | |
| May 16, 2024 | |
| Jun 27, 2023 | |
Experience & Educational Timeline
- Jun 2025 - Dec 2025
Research Scientist Intern Microsoft Research Asia (MSRA)@Beijing, China
Worked on self-evolving tool agents and agentic RL
Mentors: Dr. Liang Wang, Dr. Nan Yang, Dr. Xingxing Zhang
Government-funded collaborative research program (IITP) - Jun 2024 - Sep 2024
Applied Scientist Intern Amazon AGI@Sunnyvale, CA, USA
Return Internship
Worked on Code LLMs - Jun 2023 - Sep 2023
Applied Scientist Intern Amazon Alexa AI@Sunnyvale, CA, USA
Worked on Tool-augmented LLMs
Mentors: Dr. Hann Wang, Dr. Nikolaos Malandrakis,
Dr. Nagesh Panyam - Mar 2022 - Current
Ph.D. Student, Artificial Intelligence Yonsei University@Seoul, Republic of Korea
Advised by Prof. Jinyoung Yeo - Mar 2020 - Feb 2022
M.S., Artificial Intelligence Yonsei University@Seoul, Republic of Korea
Advised by Prof. Jinyoung Yeo - Jan 2018 - Jul 2018
Exchange Student University of California@Santa Cruz, CA, USA
International exchange program - Oct 2015 - Jul 2017
Sergeant Republic of Korea Army@Seoul, Republic of Korea
Compulsory military service during B.S.
in the Republic of Korea Army - Mar 2014 - Feb 2022
B.S., Computer Science Yonsei University@Seoul, Republic of Korea
Activities
-
Reviewer
- AAAI, EMNLP, ACL, NAACL
-
Teaching Experiences
- NVIDIA: Teaching Assistant on NLP
- Teaching Assistant at Yonsei Univ: Text and Language Understanding, Big Data, Natural Language Processing
-
Company Experiences
- Microsoft Research Asia (MSRA), Beijing, China - Research Scientist Intern (Jun 2025 - Dec 2025)
- Amazon AGI Foundational Models Group, Sunnyvale, CA, USA - Applied Scientist Intern (Jun 2024 - Sep 2024)
- Amazon Alexa AI, Sunnyvale, CA, USA - Applied Scientist Intern (Jun 2023 - Sep 2023)
-
Honors and Awards
- Encouragement Prize, Korea Capstone Design Fair (As a representative of Yonsei University) - Ministry of Trade, Industry and Energy, Korea
- Top Prize, Software Capstone Design - Computer Science Department, Yonsei Univ
- Top Presentation Prize, Yonsei Creative Exhibition Presentation - College of Engineering, Yonsei Univ
- Student Teaching Assistant Scholarship (Head Professor TA) - Department of Artificial Intelligence, Yonsei Univ
Publications
- Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized AssistancearXiv, 2025arXiv arXiv 2025
- Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsIn NeurIPS Spotlight, 2025arXiv NeurIPS 2025 (Spotlight)
- ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term InteractionsIn Findings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025arXiv EMNLP 2025 (Findings)
- One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RLIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025arXiv ACL 2025 (Industry)
- LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical StudyIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025arXiv ACL 2025
- Can You Share Your Story? Modeling Clients’ Metacognition and Openness for LLM Therapist EvaluationIn Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025arXiv ACL 2025 (Findings)
- Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with PsychometricsIn Findings of the Association for Computational Linguistics: NAACL 2025, Jul 2025arXiv NAACL 2025 (Findings)
- Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language ModelsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous CodeIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation DatasetIn Findings of the Association for Computational Linguistics ACL 2024, Jul 2024arXiv ACL 2024 (Findings)
- Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense ReasoningIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jul 2022arXiv NAACL 2022
- Dual task framework for improving persona-grounded dialogue datasetIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022
- Trustal: Trustworthy active learning using knowledge distillationIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022