Member image
Suyeon Lee
이수연
Ph.D. Student
School of Electrical Engineering, KAIST

About me

I am a second-year Ph.D. student at KAIST Multimodal AI Lab, advised by Professor Joon Son Chung. My research focuses on multimodal AI, with a core emphasis on audio-visual speech and source separation. I investigate how cross-modal cues can be leveraged to isolate target signals and enhance auditory clarity in complex acoustic environments.

Education

Ph.D. in Electrical Engineering, KAIST

Mar. 2025 - Present

Advisor: Joon Son Chung

M.S. in Electrical Engineering, KAIST

Mar. 2023 - Feb. 2025

Advisor: Joon Son Chung

B.S. in Electrical Engineering, KAIST

Mar. 2018 - Feb. 2023

Selected Awards

2024
  • Achieved 1st Place (Winner) in the Audio Track and 4th Place in the Audio-Visual Track of NIST Speaker Recognition Evaluation (SRE)

Experience

Research Intern, AIRS Company, Hyundai Motor Group

Sep. 2021 - Feb. 2022

Publications

2026

Thumbnail
Cinematic Audio Source Separation Using Visual Cues
K. Zhang*, S. Lee*, A. Senocak, and J. S. Chung
IEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Paper Project Page
Thumbnail
Plug-and-Steer: Decoupling Separation and Selection in Audio-Visual Target Speaker Extraction
D. Kwak*, S. Lee*, and J. S. Chung
Preprint (Submitted to Interspeech)
Paper Project Page

2025

Thumbnail
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
K. Zhang*, T. X. Pham*, S. Lee, A. Niu, A. Senocak, and J. S. Chung
Conference on Neural Information Processing Systems (NeurIPS)
Paper Code

2024

Thumbnail
Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model
S. Lee*, C. Jung*, Y. Jang, J. Kim, and J. S. Chung
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Paper Project Page Code
Thumbnail
TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning
C. Jung*, S. Lee*, K. Nam, K. Rho, Y. Jang, and J. S. Chung
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Paper Code
Thumbnail
FlowAVSE: Efficient Audio Visual Speech Enhancement Models with Conditional Flow Matching
C. Jung, S. Lee, J. H. Kim, and J. S. Chung
Interspeech
Paper Code

Contact

  syun (at) mmai (dot) io
  Room 3103, N24 (LG Innovation Hall)

KAIST logo