photo

Zeyi Wen

Assistant Professor

Contact

Data Science and Analytics (DSA) Thrust, Information Hub,
Hong Kong University of Science and Technology, Guangzhou (HKUST-GZ)
Email: myname@ust.hk, where myname=wenzeyi
Office address: Room 312, E2 Building, 1 DuXue Rd, Nansha District, Guangzhou, China

Short Bio

What's new

  • Dec 2024: Two papers on LLM inference and fine-tuning got accepted by AAAI'25.
  • Nov 2024: Our paper on GBDTs for High Dimensional Data was accepted by KDD'25.
  • Sep 2024: Our paper on SVMs for ABSA tasks was accepted by EMNLP'24 Findings.
  • Jun 2024: Our paper on Bayesian Network Learning was accepted by IEEE TPDS.
  • May 2024: Our paper on PGM inference acceleration was accepted by USENIX ATC'24.
  • Feb 2024: Our paper on Hyper-parameter Optimization with Adaptive Fidelity was accepted by CVPR'24.
  • Jan 2024: Our paper on Bandit-based Hyperparameter Optimization was accepted by ICDE'24.

Research Interests (publications)

  • Large Language Model Inference/Training/Fine-Tuning Acceleration
  • Machine Learning Systems, Automatic Machine Learning, and High-Performance Computing

Teaching

Current Courses at HKUST-GZ
  • Spring 2025, UFUG 2601 C++ Programming
  • Fall 2022-2024, DSAA 5003 Automatic Machine Learning
  • Spring 2023 & 2024, DSAA 6000H Data Warehousing
Past Courses at UWA
  • Semester 1, 2020 & 2021: CITS3401/5504 Data Warehousing
  • Semester 2, 2020 & 2021: CITS5507 High Performance Computing

Graduate Supervision

  • There is currently one PhD opening under Dr. Zeyi Wen's supervision. Students interested in efficiency optimization for Large Language Models (LLMs) can send their applications by email. An application should include a resume with transcript(s).
  • Current graduate students and their research
    NameResearch
    Guoli Wu (PhD student to start from Spring 2025)LLM inference optimization
    Haichao Fang (RBM MPhil student from Fall 2024)LLMs for health applications
    Chengxi Liao (PhD student since Fall 2024)Long context LLM inference
    Tong Yuan (PhD student since Fall 2024)Distributed LLM inference
    Ruijia Yang (PhD student since Fall 2024)Efficient LLM inference
    Rui Zhang (PhD student since Fall 2024)Accelerating kernel machine training
    Zeyuan Lin (Visiting MPhil student from Fall 2024)Kernels for CNNs
    Hangyu Yang (Visiting master student from Fall 2024)Efficient fine-tuning for LLMs
    Xinhao Huang (PhD student from Spring 2024)Model decomposition
    Youliang Huang (RBM MPhil student from Fall 2023)Efficient fine-tuning for LLMs
    Jihang Li (PhD student since Fall 2023)Learning operator acceleration with GPUs
    Minping Chen (PhD student since Fall 2023)Efficient fine-tuning for LLMs
    Yuebin Xu (PhD student since Fall 2023)Hyper-parameter optimization (HPO) for LLMs
    Xuemei Peng (PhD student since Spring 2023)Efficient LLM serving
    Yile Chen (Visiting PhD student since 2022)Hyper-parameter optimization (HPO)
    Hanfeng Liu (PhD student since Fall 2022)Parallel GBDT training

Selected Professional Activities

  • Program Committee or Reviewer
    • 2025: KDD, ICLR, ICDE, AAAI, ACL, EMNLP, DASFAA (demo track co-chair)
    • 2024: KDD, NeurIPS, AAAI, CIKM, EDBT, EMNLP
    • 2023: KDD, SC, AAAI, CIKM, WSDM, SIGIR, ECML-PKDD, CCGrid,
    • 2022: KDD, SC, AAAI, SIGIR, WSDM, CIKM, CVPR, ECCV