Publications
† indicates equal contribution.
Publications by Year
Top Venues (J+C)
2025
- Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized AssistancearXiv, 2025arXiv arXiv 2025
- Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsIn NeurIPS Spotlight, 2025arXiv NeurIPS 2025 (Spotlight)
- ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term InteractionsIn Findings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025arXiv EMNLP 2025 (Findings)
- One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RLIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), Jul 2025arXiv ACL 2025 (Industry)
- LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical StudyIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul 2025arXiv ACL 2025
- Can You Share Your Story? Modeling Clients’ Metacognition and Openness for LLM Therapist EvaluationIn Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025arXiv ACL 2025 (Findings)
- Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with PsychometricsIn Findings of the Association for Computational Linguistics: NAACL 2025, Jul 2025arXiv NAACL 2025 (Findings)
2024
- Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language ModelsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous CodeIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Jul 2024arXiv EMNLP 2024
- Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation DatasetIn Findings of the Association for Computational Linguistics ACL 2024, Jul 2024arXiv ACL 2024 (Findings)
2022
- Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense ReasoningIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jul 2022arXiv NAACL 2022
- Dual task framework for improving persona-grounded dialogue datasetIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022
- Trustal: Trustworthy active learning using knowledge distillationIn Proceedings of the AAAI conference on artificial intelligence, Jul 2022arXiv AAAI 2022