Research Scientist Intern
- In Meta Reality Labs Research Audio team under Anurag Kumar.
- Robust multi-modal audiovisual speech representation learning research.
I am a Ph.D. graduate in Integrated Vision Language Lab. under the supervision of Professor Yong Man Ro in Electrical Engineering at Korea Advanced Institute of Science and Technology (KAIST).
I received a B.S. in Electrical Engineering from KAIST in 2019.
My major research area is multi-modal representation learning and human interactive learning.
I am more focusing on integrating audio, video, and text modalities in human dialogue systems, especially lipreading, lip-to-speech synthesis, and audio-visual speech recognition.
My research interests extend further like machine translation, speech enhancement, and speech separation, using multi-modal representations.
I am proud to highlight that I have been honored with the Outstanding Dissertation Award in the School of Electrical Engineering.
My thesis focuses on human speech understanding through multimodal representation learning.
Here is Curriculum Vitae for more information about me.
(* indicates equal contribution)
Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeonghun Yeo, Yong Man Ro
Association for Computational Linguistics (ACL), 2024 (Oral)
Joanna Hong, Se Jin Park, and Yong Man Ro
Findings of the Association for Computational Linguistics: EMNLP, 2023
Jeongsoo Choi*, Joanna Hong*, and Yong Man Ro
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Joanna Hong*, Minsu Kim*, Jeongsoo Choi, and Yong Man Ro
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Minsu Kim*, Joanna Hong*, and Yong Man Ro
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
Joanna Hong*, Minsu Kim, and Yong Man Ro
European Conference on Computer Vision (ECCV), 2022
Minsu Kim*, Joanna Hong*, Daehun Yoo, and Yong Man Ro
Interspeech, 2022 (Oral)
Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, and Yong Man Ro
AAAI Conference on Artificial Intelligence (AAAI), 2022 (Oral)
Minsu Kim, Joanna Hong, and Yong Man Ro
Conference on Neural Information Processing Systems (NeuIPS), 2021
Minsu Kim*, Joanna Hong*, Se Jin Park, and Yong Man Ro
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
Minsu Kim, Joanna Hong, Junho Kim, Hong Joo Lee, and Yong Man Ro International Conference on Pattern Recognition (ICPR), 2021
Joanna Hong, Jung Uk Kim, Sangmin Lee, Yong Man Ro IEEE International Conference on Image Processing (ICIP), 2020
Joanna Hong, Hong Joo Lee, Yelin Kim, and Yong Man Ro International Conference on Multimedia Modeling (MMM), 2020
Joanna Hong, Minsu Kim, Se Jin Park, and Yong Man Ro
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), 2021
Minsu Kim, Joanna Hong, Se Jin Park, and Yong Man Ro
IEEE Transactions on Multimedia (TMM), 2021