I am a Staff AI Research Scientist at Intel, working on multimodal AI, video understanding, and agentic systems.
My research interests include vision-language models, video and scene understanding, model robustness, and efficient adaptation of foundation models, with the goal of building AI systems that can reliably understand and reason about complex visual information. My work has been published at CVPR, ICCV, ECCV, NeurIPS, WACV, and ACL.
I received my Ph.D. from the University of California San Diego (UCSD), advised by Prof. Nuno Vasconcelos. Prior to UCSD, I completed my B.S. and M.S. from National Tsing Hua University (NTHU), where I worked with Prof. Min Sun on multimodal learning through video and wearable sensors.
Please see my CV for a complete list of publications and professional activities.