Akshita Gupta
I am an ELLIS PhD student at TU Darmstadt, co-supervised
by Prof. Marcus Rohrbach and
Dr. Federico Tombari at
Google Zurich.
I completed my MASc at the University of Guelph, where I was advised by
Prof. Graham Taylor.
During that time, I was also a student researcher at the
Vector Institute.
I was fortunate to spend time as a research intern at
Apple under
Dr. Tatiana Likhomanenko,
Microsoft under
Gaurav Mittal and
Mei Chen,
Vector Institute under
Dr. David Emerson,
and as a scientist in residence at
NextAI with Prof. Graham Taylor.
Before academia, I worked as a Data Scientist at
Bayanat
,
where I focused on projects related to detection and segmentation.
Prior to that, I was a Research Engineer at the
Inception Institute of Artificial Intelligence (IIAI),
working with
Dr. Sanath Narayan,
Dr. Salman Khan, and
Dr. Fahad Shahbaz Khan.
Email /
Google Scholar /
Twitter /
Github /
Resume/CV
|
|
TU Darmstadt 2025-Present
Apple 2024-2025
University of Guelph 2022-2024
Vector Institute 2022-2024
Microsoft Research 2023-2024
NextAI 2024
Bayanat for Mapping & Surveying 2022
Inception Institute of Artificial Intelligence 2018-2022
What's New โจ
[Mar 2025] | ๐ Excited to be an ELLIS PhD student at TU Darmstadt under Prof. Marcus Rohrbach and Dr. Federico Tombari (Google Zurich) ๐ |
[Oct 2024] | ๐ Graduated and Defended my Masters Thesis |
[Nov 2024] | ๐ Our paper Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis is now on ArXiv! |
[Jun 2024] | ๐ Joined Apple as a Research Intern! |
[May 2024] | ๐ง Serving as a Scientist-in-Residence at NextAI. |
[Jan 2024] | ๐ Our paper Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization accepted at WACV 2025 (Oral)! ๐ค |
[Dec 2023] | ๐ Our work Open-Vocabulary Temporal Action Localization using Multimodal Guidance accepted at BMVC 2024! |
[Jun 2023] | ๐งช Our paper Generative Multi-Label Zero-Shot Learning accepted at TPAMI 2023. |
[Jun 2023] | ๐ Started interning at Microsoft, ROAR team. |
[Jan 2023] | ๐ค Interned at Vector Institute with AI Engineering team. |
[Sep 2022] | ๐ฌ Joined Prof. Graham Taylor's Lab and Vector Institute. |
[Mar 2022] | ๐
OW-DETR accepted at CVPR 2022. |
[Sep 2021] | โ๏ธ Reviewer for CVPR 2023, CVPR 2022, ECCV 2022, ICCV 2021, TPAMI. |
[Jul 2021] | ๐
BiAM accepted at ICCV 2021. |
[Feb 2021] | โ๏ธ Serving as a reviewer for ML Reproducibility Challenge 2020. |
[Jan 2021] | ๐ Paper out on arXiv: Generative Multi-Label Zero-Shot Learning |
[Jul 2020] | ๐
TF-VAEGAN accepted at ECCV 2020. |
[Aug 2019] | ๐ฐ๏ธ A Large-scale Instance Segmentation Dataset for Aerial Images (iSAID) available for download. |
[Aug 2018] | ๐ค One paper accepted at Interspeech, CHiME Workshop 2018. |
[May 2018] | ๐ Selected as an Outreachy intern with Mozilla. |
Conference and Journal Reviewing ๐
CVPR (2022โ2025) |
ECCV (2022, 2024) |
ICCV (2021) |
TPAMI (Journal)
Invited Talks ๐ค
- [Mar 2025] โ Gave a talk at UCF CRCV lab โ thank you Prof. Shah for hosting me!
- [Dec 2021] โ Computer Vision Talks (YouTube Link)
Research Interests ๐
I am broadly interested in building scalable, multimodal models that combine vision, language, and speech modalities with interests in efficient modeling, temporal understanding, and open-world generalization.
Publications ๐
I borrowed this website layout from here!
|
|