Kien T. Pham (TK)

Hi πŸ‘‹, I am currently a PhD student at The Hong Kong University of Science and Technology, co-advised by Prof. Long Chen in the LONG Group and Prof. Qifeng Chen in the Visual Intelligence Lab.

Previously, I completed my MPhil at HKUST under the guidance of Prof. Qifeng Chen, a journey filled with joy and success. Before that, I earned my BSc Degree with First Class Honors, double majoring in Data Science and Computer Science. I am also an alum of the S.S. Chern Class, an elite class for students with demonstrated mathematical excellence.

During my MPhil, I worked on creative applications of Diffusion Models such as Image Composition, as well as engaged in projects about Visual Perception for Autonomous Driving and Humanoid Robotics. For my current research, I'm mainly focused on Audio-Video Generation and Understanding.

Email  /  ResumΓ© / CV  /  Scholar  /  ORCID  /  GitHub  /  WeChat

profile photo

News

β€’ Oct. 2025 ✈️ I'm very excited to attend ACM MM 2025 in Dublin, Ireland to present our paper!
β€’ Jul. 2025 πŸ₯³ One paper accepted to ACM MM 2025.
β€’ Apr. 2025 ✈️ I'm very excited to attend ICLR 2025 in Singapore!
β€’ Oct. 2024 ✈️ I'm very excited to attend ACM MM 2024 in Melbourne, Australia to present our paper!
β€’ Jul. 2024 πŸ₯³ One paper accepted to ACM MM 2024.
β€’ Jun. 2024 πŸ₯³ One paper accepted to RO-MAN 2024.
β€’ May. 2024 ✈️ I'm very excited to attend ICRA 2024 in Yokohama, Japan to present our paper!
β€’ Jan. 2024 πŸ₯³ One paper accepted to ICRA 2024.

Publications

SteerFlow: Steering Rectified Flows for Faithful Inversion-Based Image Editing
Thinh Dao, Zhen Wang, Kien T. Pham, Long Chen
arXiv Preprint (arXiv), 2026
Code / arXiv
SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation
Kien T. Pham, Yingqing He, Yazhou Xing, Qifeng Chen, Long Chen
ACM International Conference on Multimedia (MM), 2025
Page / Code / arXiv / Paper
TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization
Kien T. Pham, Jingye Chen, Qifeng Chen
ACM International Conference on Multimedia (MM), 2024
Page / Code / arXiv / Paper
A Humanoid Robot Dialogue System Architecture Targeting Patient Interview Tasks
Yifan Shen*, Dingdong Liu*, Yejin Bang*, Ho Shu Chan, Rita Frieske, Hoo Choun Chung, Jay Nieles, Tianjia Zhang, Kien T. Pham, Wai Yi Rosita Cheng, Yini Fang, Qifeng Chen, Pascale Fung, Xiaojuan Ma, Bertram Shi
IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 2024
✨ 3rd Place β€” Best Industry Application Award
Paper
Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving
Zhili Chen, Kien T. Pham, Maosheng Ye, Zhiqiang Shen, Qifeng Chen
IEEE International Conference on Robotics and Automation (ICRA), 2024
arXiv / Paper

Side Projects

Training-free Diffusion-based Medical Cancer Image Generation
Propose a method for generating highly realistic medical cancer images given any pairs of non-tumorous background and tumour object images, directly utilizing the pretrained Stable Diffusion.
Project Report, 2024
PDF

Education

The Hong Kong University of Science and Technology (HKUST)
Hong Kong  Β·  Sep. 2018 – Present

Alum of the S.S. Chern Class, an elite class for students with demonstrated mathematical excellence.

Industry Experience

Celia Large Model Application Lab @ HKRC
Hong Kong, On-site  Β·  Feb. 2026 – Present
Research Intern β€” Research on Unified Foundation Model for Audio-Video.
Mentor: Dr. Rui Liu and Dr. Haoxuan Che
DeepRoute.ai
Shenzhen, On-site  Β·  May. 2025 – Nov. 2025
Algorithm Research Intern β€” Post-training of VLA Model for Autonomous Driving.
Mentor: Dr. Zhili Chen
viAct
Hong Kong, Remote  Β·  Mar. 2021 – Aug. 2021
AI Trainee β€” Working on Video Analytics for ConTech Domain.
GMO-Z.com RUNSYSTEM
Hanoi, On-site  Β·  Mar. 2020 – May. 2020
AI Intern β€” Working on OCR-centric tasks.

Teaching

  • 2026 Spring: COMP2011 β€” Programming with C++
  • 2024 Fall, 2022 Spring: COMP1021 β€” Introduction to Computer Science
  • 2023 Fall: COMP4471/ELEC4240 β€” Deep Learning in Computer Vision

Community Services

Conferences: NeurIPS, ICCV, ICML, CVPR, ICLR, AISTAT, BMVC

Journals: TIP

Awards & Scholarships

  • Postgraduate Studentship (2022 – Present)
  • RedBird Academic Excellence Award, HKUST (2024 - 2025)
  • RTG Award, HKUST (Spring 2024, Fall 2024)
  • Chern Class Achievement Scholarship (2022)
  • Chern Class Talent Scholarship (2020 – 2022)
  • Chern Class Scholarship (2019)
  • Heart Free Lodge Scholarship (2018)
  • University Scholarship (2018 – 2022)
  • Dean's List (Spring 2018, 2020; Fall 2019, 2021)

Updated: 2026-03-15  Β·  By TK Pham πŸ˜„  Β·  View Source on GitHub