Profile image
Tz-Ying (Gina) Wu
Staff AI Research Scientist

About Me


I am a Staff AI Research Scientist at Intel, working on multimodal AI, video understanding, and agentic systems. My research interests include vision-language models, video and scene understanding, model robustness, and efficient adaptation of foundation models, with the goal of building AI systems that can reliably understand and reason about complex visual information. My work has been published at CVPR, ICCV, ECCV, NeurIPS, WACV, and ACL.
I received my Ph.D. from the University of California San Diego (UCSD), advised by Prof. Nuno Vasconcelos. Prior to UCSD, I completed my B.S. and M.S. from National Tsing Hua University (NTHU), where I worked with Prof. Min Sun on multimodal learning through video and wearable sensors.
Please see my CV for a complete list of publications and professional activities.

CV [PDF]

News


  • 06/2026: Recognized as outstanding reviewer in CVPR 2026.
  • 04/2026: One paper accepted to ACL 2026.
  • 02/2026: Invited talk at EnCORE Workshop on Interpretability in Modern AI.
  • 10/2025: One paper accepeted to WACV 2026.
  • 10/2025: Recognized as Top Reviewer in NeurIPS 2025.
  • 10/2025: Three papers presented in ICCV Workshops (HiGen, SAUAFG, CVAM) 2025.
  • 10/2025: Co-organizing CVAM Workshop at ICCV 2025.
  • 06/2025: One paper presented in the EgoVis Workshop at CVPR 2025.
  • 10/2024: One paper accepeted to WACV 2025.
  • 08/2024: Joined Intel AI.
  • 06/2024: Successfully defended my PhD thesis.
  • 06/2024: Presented in Doctoral Consortium at CVPR 2024.
  • 06/2024: Presented in WiCV workshop at CVPR 2024.
  • 02/2024: One paper accepeted to CVPR 2024.
  • 06/2023: Interned at Intel AI.
  • 09/2022: One paper accepeted to NeurIPS 2022.
  • 09/2022: Amazon Post-Internship Fellowship.
  • 06/2022: Outstanding Reviewer Award at CVPR 2022.
  • 02/2022: One paper accepeted to CVPR 2022.
  • 07/2021: One paper accepeted to ICCV 2021.
  • 06/2021: Interned at Amazon AWS AI.
  • 07/2020: One paper accepeted to ECCV 2020.
  • 02/2020: Two papers accepeted to CVPR 2020.
  • 09/2018: UCSD ECE Department Fellowship.
  • 07/2018: One paper accepted to ECCV 2018.
  • 07/2017: One paper accepted to ICCV 2017 (spotlight).
  • 09/2016: NTHU EE Admission Scholarship.

Projects


VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis
Shubhashis Roy Dipta*, Tz-Ying Wu*, Subarna Tripathi (* indicates equal contribution)
The 64th Annual Meeting of the Association for Computational Linguistics (ACL), 2026
[ Arxiv | Website ]
Conference paper
Harnessing Object Grounding for Time Sensitive Video Understanding
Tz-Ying Wu, Sharath Nittur Sridhar, Subarna Tripathi
IEEE Winter Conference on Applications of Computer Vision (WACV), 2026
[ Arxiv ]
Conference paper
EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs
Ivan Rodin*, Tz-Ying Wu*, Kyle Min, Sharath Nittur Sridhar, Antonino Furnari, Subarna Tripathi, and Giovanni Maria Farinella (* indicates equal contribution)
IEEE International Conference on Computer Vision (ICCV) Workshop, 2025
[ Arxiv | Website ]
Workshop paper
Toward Scalable Video Narration: A Training-free Approach Using Multimodal Large Language Models
Tz-Ying Wu*, Tahani Trigui*, Sharath Nittur Sridhar, Anand Bodas, and Subarna Tripathi (* indicates equal contribution)
IEEE International Conference on Computer Vision (ICCV) Workshop, 2025
[ Arxiv ]
Workshop paper
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Tz-Ying Wu, Kyle Min, Subarna Tripathi, Nuno Vasconcelos
IEEE Winter Conference on Applications of Computer Vision (WACV), 2025
[ Website | Arxiv | Supplemental | BibTeX | Code ]
Conference paper
ProTeCt: Prompt Tuning for Taxonomic Open Set Classification
Tz-Ying Wu*, Chih-Hui Ho*, Nuno Vasconcelos (* indicates equal contribution)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[ Website | Arxiv | Supplemental | BibTeX | Code ]
Conference paper
Single-Stage Visual Relationship Learning using Conditional Queries
Alakh Desai, Tz-Ying Wu, Subarna Tripathi, Nuno Vasconcelos
Conference on Neural Information Processing Systems (NeurIPS), 2022
[ Arxiv | Supplemental | BibTeX ]
Conference paper
Class-Incremental Learning with Strong Pre-trained Models
Tz-Ying Wu, Gurumurthy Swaminathan, Zhizhong Li, Avinash Ravichandran, Nuno Vasconcelos, Rahul Bhotika, Stefano Soatto
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[ Arxiv | Supplemental | BibTeX | Code ]
Conference paper
Learning of Visual Relations: The Devil is in the Tails
Alakh Desai*, Tz-Ying Wu*, Subarna Tripathi and Nuno Vasconcelos (* indicates equal contribution)
IEEE International Conference on Computer Vision (ICCV), 2021
[ Website | Arxiv | Supplemental | BibTeX | Code ]
Conference paper
Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier
Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, and Nuno Vasconcelos
European Conference on Computer Vision (ECCV), 2020
[ Website | Arxiv | Supplemental | BibTeX | Code ]
Conference paper
Explainable Object-induced Action Decision for Autonomous Vehicles
Yiran Xu, Xiaoyin Yang, Lihang Gong, Hsuan-Chu Lin, Tz-Ying Wu, Yunsheng Li, Nuno Vasconcelos
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[ Website | Arxiv | Code ]
Conference paper
Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition
Chih-Hui Ho, Bo Liu, Tz-Ying Wu, Nuno Vasconcelos
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
[ Website | Arxiv | Supplemental | BibTeX | Code ]
Conference paper
Liquid Pouring Monitoring via Rich Sensory Inputs
Tz-Ying Wu*, Juan-Ting Lin*, Tsun-Hsuang Wang, Chan-Wei Hu, Juan Carlos Niebles, Min Sun (* indicates equal contribution)
European Conference on Computer Vision (ECCV), 2018
[ Website | Arxiv ]
Conference paper
Anticipating Daily Intention using On-Wrist Motion Triggered Sensing
Tz-Ying Wu*, Ting-An Chien*, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun (* indicates equal contribution)
IEEE International Conference on Computer Vision (ICCV Spotlight), 2017
[ Website | Arxiv | Code ]
Conference paper
Recognition from Hand Cameras: A Revisit with Deep Learning
Cheng-Sheng Chan, Ting-An Chien, Tz-Ying Wu, Min Sun
Asian Conference on Computer Vision (ACCV), 2016
[ Demo1 | Demo2 ]
Conference Demo

Experiences


Staff AI Research Scientist at Intel
Present
Graduate Student Researcher at Statistical Visual Computing Lab, UCSD
Fall 2018 - Spring 2024
Machine Learning Research Intern at Intel AI Lab
Summer 2023
Teaching Assistant of Machine Learning for Physical Applications at UCSD
Spring 2023, Spring 2024
2022 - 2023
Teaching Assistant of Introduction to Machine Learning Algorithms at UCSD
Winter 2023
Summer 2019 - 2023
Teaching Assistant of Statistical Learning I at UCSD
Fall 2021, Fall 2022
Applied Scientist Intern at Amazon AWS AI
Summer 2021
Teaching Assistant of Elements of Machine Intelligence at UCSD
Winter 2020
Teaching Assistant of Statistical Learning II at UCSD
Winter 2019
Graduate Research Assistant at Vision Science Lab, NTHU
2016 - 2018
Teaching Assistant of Deep Learning Lab at UNITEC - NTHU Summer School
Summer 2017
Teaching Assistant of Signals and Systems at NTHU
Spring 2016, Spring 2015
Teaching Assistant of Introduction to Programming at NTHU
Fall 2015
Intern at IMEC-Taiwan CO. (R&D department)
Summer 2015

Education


Ph.D. in ECE, University of California San Diego, USA
Sep. 2018 - Jun. 2024
M.S. in EE, National Tsing Hua University, Hsinchu, Taiwan
Jan. 2016 - Jan. 2018
B.S. in EE, National Tsing Hua University, Hsinchu, Taiwan
Sep. 2012 - Jan. 2016