Kangsan Kim (김강산) (kksan07 [at] kaist [dot] ac [dot] kr), and here is my CV (Curriculum Vitae).
I am a Ph.D. student in the Graduate School of AI at KAIST (MLAI lab), fortunate to be advised by Prof. Sung Ju Hwang.
My research focuses on developing multimodal large language models (MLLMs) that understand the world and interact with humans through visual data. I have previously worked on video understanding and multimodal Retrieval-Augmented Generation (RAG). I am also interested in embodied AI models that operate on egocentric video and require spatial reasoning capabilities.
🔥 News
- 2025.07: 🗽 Joined NYU as a visiting student under Prof. Mengye Ren.
- 2025.05: 📖 HoliSafe is released on arXiv.
- 2025.05: 🎉 VideoRAG got accepted to ACL Findings 2025.
- 2025.04: 📖 UniversalRAG is released on arXiv.
- 2025.02: 🎉 VideoICL got accepted to CVPR 2025.
📝 Publications
-
HoliSafe: Holistic Safety Benchmarking and Modeling with Safety Meta Token for Vision-Language Model
[project page] [paper] [code]
Youngwan Lee, Kangsan Kim, Kwanyong Park, Ilchae Jung, Sujin Jang, Seanie Lee, Yong-Ju Lee, Sung Ju Hwang
Arxiv 2025 -
UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities
[project page] [paper] [code]
Woongyeong Yeo*, Kangsan Kim*, Soyeong Jeong, Jinheon Baek, Sung Ju Hwang
Arxiv 2025 -
VideoRAG: Retrieval-Augmented Generation over Video Corpus
[paper] [poster] [code]
Soyeong Jeong*, Kangsan Kim*, Jinheon Baek*, Sung Ju Hwang
Findings of the Association for Computational Linguistics (ACL Findings) 2025 -
VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
[paper] [poster] [code]
Kangsan Kim*, Geon Park*, Youngwan Lee, Woongyeong Yeo, Sung Ju Hwang
Conference on Computer Vision and Pattern Recognition (CVPR) 2025
(*: equal contribution)
💻 Experiences
-
Visiting Student, New York University
2025.07 - Current, Brooklyn, NY, USA
Advisor: Prof. Mengye Ren
Studying question answering over egocentric video streams from multiple embodied agents. -
Computer Vision Engineer Intern, B GARAGE
2022.10 - 2023.07, San Jose, CA, USA
Developed an ultra-fast edge instance segmentation model that can segment anything in the warehouse. -
Machine Learning(NLP) Scientist Intern, NAVER
2021.07 - 2021.10, Remote
Built and improved end-to-end Korean-English speech translation model.
📖 Educations
- 2024.03 - Current, Ph.D. in Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST).
- 2018.03 - 2024.02, B.S. in Computer Science, Korea Advanced Institute of Science and Technology (KAIST).
🏆 Honors and Awards
- 2023.06 Qualcomm-KAIST Innovation Award.
- 2020.09 Dean’s List, College of Engineering.