About

Haitao Yuan (袁海涛 in Chinese) is currently a research fellow working with Prof. Gao Cong at College of Computing and Data Science, NTU. Prior to that, he received his Ph.D. degree in computer science from Tsinghua University, supervised by Prof. Guoliang Li and Prof. Ling Feng. Furthermore, Haitao is fortunate to enjoy a close academic collaboration with Prof. Zhifeng Bao.

Haitao’s research focuses on developing innovative database technologies that can effectively and efficiently utilize Big Data (e.g., tabular, text, image, and spatio-temporal data) and AI Models (e.g., LLM, GNN, and RL) to benefit people in areas such as transportation, healthcare, education, and more. To achieve this goal, Haitao concentrates on three key research areas (SIR):

  • Building Scalable multi-modal data management and retrirval systems
  • Creating Intelligent data manipulation and preparation pipelines
  • Developing Robust data prediction models and decision strategies for real-world applications

He has published 30+ papers in the top DB/DM conferences and journals (SIGMOD, VLDB, ICDE, WWW, TKDE, CIKM, etc).

Feel free to catch me if interested to discuss ideas or work together. 😜

Email: yhaitao45@163.com / haitao.yuan@ntu.edu.sg

Office: N4 #B3a-02, 50 Nanyang Avenue, Singapore 639798.

Research Interests

  • Spatio-temporal Data Preparation and Management (S&I): trajectory search and join, trajectory generation, road network generation, traffic imputation.
  • AI-powered Database Optimization (S): materialized view advisor, query rewrite, index advisor.
  • Interdisciplinary Application (I&R): AI4Transportation (e.g., travel time estimation, traffic prediction), AI4Science (e.g., medical diagnosis) and AI4Education (e.g., mathematical exercise solver, knowledge tracing).
  • Retrival-augmented Generation(S&I): multi-modal RAG, vector database.

What's New

  • 2024-08: One research paper is accepted by PVLDB 2025.
  • 2024-07: Two research papers are accepted by CIKM 2024.
  • 2024-06: Three papers ([SIGMOD] and [ICDE]) are selected into 2024 Highly-Cited List (2019-2023).
  • 2024-03: One research paper is accepted by ICDE 2024.

Selected Awards

  • 2021 ACM SIGMOD China Doctoral Dissertation Award
  • 2021 Best Ph.D Thesis of Tsinghua CS
  • 2021 Outstanding Graduate of Beijing
  • 2020 National Scholarship
  • 2020 Innovative Future Scholarship of Tsinghua CS
  • 2019 VMware Scholarship Award
  • 2019 Best Paper Award of DASFAA2019

Program Committee Member and Reviewer

KDD, ICDE, CIKM, WWWJ etc.

Invited Talks

I am happy to give a talk if you are interested in my work. 😊

  • Nuhuo: An Effective Estimation Model for Traffic Speed Histogram Imputation on A Road Network. PVLDB’24, 2024. 08 [Slides]

  • Route Travel Time Estimation on A Road Network Revisited: Heterogeneity, Proximity, Periodicity and Dynamicity. PVLDB’23, 2023. 09 [Slides]

  • Automatic Road Extraction with Multi-Source Data Revisited: Completeness, Smoothness and Discrimination. PVLDB’23, 2023. 09 [Slides]

  • Deep Learnig for ETA Application on Baidu Map. Beijing University of Posts and Telecommunications. 2022. 05.

  • Big Spatio-temporal Trajectory Data Management and Mining (In Chinese). Renmin University of China, 2022. 01