Wang Haofen
Distinguished Researcher, Doctoral Supervisor
College of Design and Innovation, Tongji University
Research Interests
- Knowledge Graphs
- Natural Language Processing
- Retrieval-Augmented Generation
- Knowledge-Enhanced Large Language Models
Education
- 2007-09 to 2013-12, Shanghai Jiao Tong University, Computer Science and Engineering, PhD
- 2005-09 to 2007-06, Shanghai Jiao Tong University, Computer Science and Engineering, Master
- 2003-09 to 2006-06, Shanghai Jiao Tong University, Mathematics and Applied Mathematics, Bachelor of Science
- 2001-09 to 2005-06, Shanghai Jiao Tong University, Computer Science and Engineering, Bachelor of Science
Work Experience
- 2019-09 to Present, Tongji University, College of Design and Innovation, Distinguished Researcher
- 2018-07 to 2019-08, Shanghai Leyan Information Technology Co., Ltd. (valuation over USD 1 billion; AI-powered e-commerce customer service platform serving over 1 billion users), CTO
- 2016-02 to 2018-06, Shenzhen Goome Robotics Co., Ltd. (AI unicorn in emotional companion robots; launched the world’s first cultivatable virtual idol “Amber·XuYan”), CTO
- 2014-01 to 2016-01, East China University of Science and Technology, Lecturer
Teaching
- Undergraduate courses: Python Programming, Professional Design, Interaction Design
- Graduate courses: Innovation Design & Entrepreneurship Frontiers, Innovation Project Practice, Design Practice Research
Research Projects
- National Key R&D Program (Ministry of Science and Technology) — New Generation AI 2030 Major Project: Knowledge-Enhanced Scientific Embodied Agent Platform andApplications, Dec 2025 – Nov 2028, Principal Investigator of the subject (responsible for knowledge-enhanced simulated experimental environments and skill learning by agents)
- National Natural Science Foundation of China (NSFC) Key Project: Research on Large-Scale Systematic Knowledge Computation Platform Construction, Jan 2024 – Dec 2027, Principal Investigator
- NSFC General Program: Research on Multi-Hop Knowledge Question Answering Based on Explainable Neuro-Symbolic Reasoning, Jan 2022 – Dec 2025, Principal Investigator
- Shanghai Basic Research Special Zone Program: Urban Characteristic Style Shaping Based on Multimodal Knowledge-Enhanced Large Models, Jan 2024 – Dec 2027, Co-Principal Investigator
- Industry Project: Huawei Personal Intelligence Engine 2.0 Technology Collaboration, Oct 2023 – Dec 2025, Principal Investigator
- Industry Project: Datagrand Information Technology Knowledge Graph & Semantic Understanding Intelligent System Research, Jul 2021 – Jul 2025, Principal Investigator
- Industry Project: Samsung Multimodal Knowledge Construction & Reasoning for Personal Memory Systems from Long Videos, Oct 2025 – Dec 2025, Principal Investigator
- Industry Project: Meituan LLM Evaluation Dataset Construction Based on Crowdsourcing Competition, Sep 2023 – Dec 2023, Principal Investigator
- Industry Project: miHoYo Platform Public Opinion Monitoring and Guidance, Aug 2021 – Mar 2022, Principal Investigator
Publications
In the past five years, 90 papers have been published, including 50+ high-level papers in CCF-A/B or CAS Tier 1/2 journals and conferences. Total Google Scholar citations: 10,944; highest single-paper citations: 5,367. [Selected Recent Papers]
Monographs
- Retrieval-Augmented Generation: Theory and Practice, Electronic Industry Press, Wang Haofen, Wang Nan, Luo Yun, Gao Yunfan, January 2026
- Multi-Source Knowledge Fusion and Applications, Electronic Industry Press, Wang Xiaoling, Wang Haofen, Yang Xiaochun, March 2025
- Human-Intelligence Interaction: Interdisciplinary Integration for Human-Centered AI, Tsinghua University Press, Contributing Author (Chapter 5: Data and Knowledge Dual Driven Artificial Intelligence), September 2024
- Knowledge Graph / AI and Intelligent Education Series, Educational Science Press, Wang Haofen, Ding Jun, Hu Fanghuai, Yang Xiangdong, July 2022
- Knowledge Graph: Methods, Practice and Applications, Electronic Industry Press, Wang Haofen, Qi Guilin, Chen Huajun, August 2019
- Natural Language Processing in Practice: Principles and Applications of Chatbot Technology, Electronic Industry Press, Wang Haofen, Shao Hao, February 2019
Granted Patents
- Text Conversion Encoder, Text-to-SQL Query Analysis Method and System, Feb 21, 2025, China, ZL202210443248.X, Wang Haofen, Li Shuqin
- A Multi-Strategy Fusion Knowledge Question Answering Method and System, Feb 2, 2021, China, ZL201910153329.4, Zhou Yang, Wang Haofen
- A Knowledge Graph-Empowered Information Retrieval-Based Question Answering System and Method, Oct 2, 2020, China, ZL201910134021.5, Chu Shanbo, Wang Haofen
- A Visual Question Answering Method Based on Cognitive Dual-Channel Reasoning, Aug 1, 2025, China, ZL202210343042.X, Zhang Wenqiang, Zhang Kailei, Wang Haofen, Liu Weichen
- An Entity-Relation Joint Extraction Method Integrating Attention Mechanism and Segment Arrangement, Jul 1, 2025, China, ZL202210341776.4, Zhang Wenqiang, Zhang Chenglong, Wang Haofen
- A Generation Method and Device for Chinese General Knowledge Graph with Timestamps, Nov 3, 2020, China, ZL201710601438.9, Song Yanan, Qiu Nan, Wang Haofen, Shao Hao
- A Multi-Source Cross-Domain Data Query Method and System, Aug 15, 2025, China, ZL202510206906.7, Li Bohan, Wu Wenlong, Wen Hao, Wang Haofen, Yin Hailian, Li Jingbo, Zhuo Junnan, Zhao Xinzhe, Liu Yuanrui
- A Large Language Model Safety Detection Method Based on Automated Knowledge Graph Generation, Jul 18, 2025, China, ZL202510654123.5, Li Bohan, Zhao Xinzhe, Wu Wenlong, Zhuo Junnan, Huang Ruilong, Liu Liang, Wang Haofen, Ruan Guoyue
Open Source Projects
OpenKG [link]
OpenKG is an open knowledge graph initiative aimed at promoting the openness, interconnection, and crowdsourcing of knowledge graph data centered on Chinese, as well as the open-source development of knowledge graph tools, models, and platforms.
KAG [link]
KAG is a professional domain knowledge-augmented service framework specifically designed for building domain-specific knowledge bases.
MemOS [link]
MemOS is an intelligent memory operating system that enhances the personalization of large models. Through two key mechanisms—memory tiering and multi-granular scheduling—it enables continuous evolution and personalized responses of the models.
KaLM Embedding[link]
KaLM-Embedding, a multilingual embedding model, leverages high-quality training data and advanced techniques to achieve superior performance compared to other similarly sized models on the MTEB benchmark.
JoyAgent[link]
JoyAgent is the industry's first open-source, fully-featured, lightweight, and general-purpose multi-agent product.
AI-Ceping [link]
AI-Ceping is a cutting-edge platform dedicated to the evaluation and advancement of LLMs. As a pioneer in the field, AI-Ceping offers a comprehensive suite of tools and services designed to test, improve, and showcase LLMs capabilities.
Awards and Honors
- 2025 MUSE Design & Creativity Award, Silver Prize “AI-Ceping: Large Model Evaluation Platform”, First Contributor
- 2025 CCF Science and Technology Achievement Award, Third Prize for Technological Progress “Key Technologies and Applications of Knowledge-Enhanced Intelligent Decision-Making”, First Contributor
- 30th International Conference on Database Systems for Advanced Applications (DASFAA 2025) Best Student Paper “HBS-KGLLM: A General Framework for Generating Knowledge Graphs for Jailbreaking”
- 2024 First Prize, China Transportation Association Science and Technology Progress Award “Research and Application of Key Technologies for Intelligent Regulation of Shanghai Airport Operations”, Second Contributor
- 13th International Conference on Design, User Experience and Usability (HCII 2024) Best Paper “From Passive to Active: Towards Conversational In-Vehicle Navigation ThroughLarge Language Model”
- 7th China Health Information Processing Conference (CHIP 2021) Best Paper “Construction of a Linking Data Set of COVID-19 Knowledge Graphs: Development and Applications”
- 2020 First Prize, Outstanding Publication (Professional Category), China Industry & Information Technology Media Publishing Group “Knowledge Graph: Methods, Practice and Applications”, First Contributor
- 2018 Grand Prize, Startup Track, 4th National Youth AI Innovation and Entrepreneurship Conference “Leyu AI Customer Service Robot”, Second Contributor
- 2016 Shanghai Outstanding Doctoral Dissertation Award “Semantic Search over Large-Scale RDF Data”
Academic Roles
- Rotating Chair, OpenKG Knowledge Graph Community (2024–2026)
- Chair, Technical Committee, OpenMem Memory-Centric AI System Open-Source Community (2025–present)
- Executive Editor-in-Chief, Data Intelligence journal (2024–present)
- Associate Editor, Knowledge Engineering & Review journal (2025–present)
- Secretary-General, CCF (China Computer Federation) Technical Committee on Natural Language Processing (2024–2027)
- Chair, CCF Technical Frontier — Knowledge Graph Special Interest Group (2024–2027)
- Deputy Director, CCF Terminology Review Working Committee (2024–2026)
- Member, CCF Academic Affairs Committee (2024–2026)
- Standing Committee Member, CCF Technical Committee on Information Systems (2024–2027)
- Council Member, Chinese Information Processing Society of China (CIPS) (2021–2026)
- Deputy Secretary-General, CIPS Special Committee on Language and Knowledge Computing (2021–2026)
- Program Committee Chair, IJCNLP-AACL 2025 (International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics, 2025)
- Chair, Large Knowledge-Enhanced Models Workshop @ IJCAI 2024
- Program Committee Chair, WISA 2024 (International Conference on Web Information Systems and Applications)
- General Chair, CCKS 2023 (China Conference on Knowledge Graph and Semantic Computing)
- Forum Chair, Sixth Knowledge Graph Forum @ CNCC 2022 — “Knowledge Graphs Empowering Big Data and Massive Computing”
- General Chair, IJCKG 2022 (International Joint Conference on Knowledge Graphs)
- Program Committee Vice-Chair, WISA 2022
- Program Committee Chair, IJCKG 2021
- Program Committee Chair, CCKS 2021
Guest Editor Roles
- CAS Tier 1 Journal Big Data Mining and Analytics — Special Issue: “Challenges and Opportunities in Retrieval-Augmented Generation for LLMs: Techniques, Trends and Applications”
- CCF-B Journal World Wide Web Journal — Special Issue: “Neuro-Symbolic Intelligence: Large Language Model Enabled Knowledge Engineering”
- CCF-B Chinese Journal Journal of Frontiers of Computer Science and Technology — Special Issue: “Construction and Application of Domain-Specific Large Language Models”
Selected Talks
- Keynote Speaker, “From RAG to KAG: Complex Reasoning under Structured Thinking Paradigms”, Tencent Cloud Architects Summit 2025. [Slides]
- Keynote Speaker, “RAG2.0: A New Paradigm for Knowledge Enhancement Integrating Graph, Reasoning, and Decision Making”, ADL158 “AI Search and Information Agents”. [Slides]
- Keynote Speaker, “KG+LLM: Reconstructing and Evolving Knowledge Graphs through Large Language Models”, DataFun Summit 2025. [Slides]
- Keynote Speaker, “The Era of Agentic RAG:DeepSeek drives knowledge retrieval enhancement with upgraded reasoning models”, Tencent Cloud Valuable Professional(TVP) Seminar 2025. [Slides]
- Keynote Speaker, “Theoretical Innovations and New Research Paradigms of Knowledge Graphs in the Era of Large Language Models”, the First International OpenKG Workshop Large Knowledge-Enhanced Models @IJCAI 2024 [Slides]
- Keynote Speaker, “Industry-level Knowledge Graph Platform for Large-scale, Diverse and Dynamic Scenarios”, 2024 International Workshop on LLM+KG: Data Management Opportunities in Unifying Large Language Models+Knowledge Graphs @VLDB 2024 [Slides]
- Keynote Speaker, “Towards Intelligent Systems Driven by Knowledge Graph and Large Language Model”, the International Conference on Computational Linguistics and Natural Language Processing (CLNLP 2023) [Slides]
- Keynote Speaker, “Emerging Technologies of Knowledge Graph in the Big Data Era”, 6th International Joint Conference, APWeb-WAIM 2022 [Slides]