Google at ICCV 2025
Google at ICCV 2025
Google is proud to be an Ultimate Sponsor of the International Conference on Computer Vision (ICCV 2025), a premier annual conference, which is being held Sunday, October 19th through Thursday October 23rd in Honolulu, Hawaii. This year researchers from across Google will be contributing at all levels with 39 accepted papers and active involvement in 63 workshops and tutorials, as well as several in-booth demo sessions.
Attending ICCV 2025 in person? Stop by the Google booth to learn more about how we’re actively pursuing the latest innovations in computer vision. Visit the @GoogleResearch X and Google Research LinkedIn accounts for announcements about Google booth activities (e.g., demos and Q&A sessions, which are also listed below).
Continue below to learn more about how Google researchers are engaged at ICCV 2025 (Google affiliations highlighted in bold).
All session times are provided in HST. Dates and times may be subject to change.
Demos and Q&A at the Google Booth
*Dates and times may be subject to change. Stop by the Google booth (#305) for more details.
-
TUE, Oct 21 | 12:00PM - 1:30PM
Discover Android XRPresenters: Federico Tombari, Sean Fanello, Yannick Strümpler, Martin Sundermeyer
-
TUE, Oct 21 | 3:00PM - 3:30PM
Visual Intelligence: Video Models are Zero-Shot Learners and ReasonersPresenter: Priyank Jaini
-
TUE, Oct 21 | 4:00PM - 4:30PM
Video Generation at YouTubePresenters: Orly Liba, Mitchell McIntire, William Zhu
-
Wed, Oct 22 | 11:30AM - 12:00PM
Research to Reality: A Google Cloud AI InteractivePresenter: Ran Li
-
Wed, Oct 22 | 12:30PM - 1:00PM
Efficient Model Training Through Coreset SelectionPresenter: Elisa Tsai
-
Wed, Oct 22 | 2:30PM - 3:00PM
Wizard of Oz: An Experiential Time Machine Powered by Google AI Now Playing at the Las Vegas SpherePresenters: Irfan Essa, Steven Hickson, Albert Shaw
-
Wed, Oct 22 | 3:30PM - 4:00PM
Nano Banana: The Latest Gemini Multimodal Generation CapabilitiesPresenter: Qifei Wang
-
Thu, Oct 23 | 11:30AM - 12:00PM
Discover Android XRPresenters: Federico Tombari, Sean Fanello, Yannick Strümpler, Martin Sundermeyer
Tutorials
-
Sun, Oct 19 | 1:00PM — 5:00PM
Benchmarking Egocentric Visual-Inertial SLAM at City ScaleOrganizers: Paul-Edouard Sarlin
-
Sun, Oct 19 | 9:00AM — 5:45PM
Learning Deep Low-Dimensional Models from High-Dimensional Data: From Theory to PracticePanelists: Berivan Isik
-
Mon, Oct 20 | 1:00PM — 5:10PM
RANSAC in 2025Organizers: Daniel Barath
-
Mon, Oct 20 | 8:30AM — 12:00PM
Towards Comprehensive Reasoning in Vision-Language ModelsOrganizers: Ming-Hsuan Yang
Accepted Papers
4D Gaussian Splatting SLAM
Yanyan Li, Youxu Fang, Zunjie Zhu, Kunyi Li, Yong Ding, Federico Tombari
AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion
Liuyue Xie, Jiancong Guo, Ozan Cakmakci, Andre Araujo, László A. Jeni, Zhiheng Jia
Bolt3D: Generating 3D Scenes in Seconds
Stanislaw Szymanowicz, Jason Y. Zhang, Pratul Srinivasan, Ruiqi Gao, Arthur Brussee, Aleksander Hołyński, Ricardo Martin-Brualla, Jonathan T. Barron, Philipp Henzler
CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from a Single-View Image
Wonseok Roh, Hwanhee Jung, Jong Wook Kim, Seunggwan Lee, Innfarn Yoo, Andreas Lugmayr, Seunggeun Chi, Karthik Ramani, Sangpil Kim
CL-Splats: Continual Learning of Gaussian Splatting with Local Optimization
Jan Ackermann, Jonas Kulhanek, Shengqu Cai, Haofei Xu, Marc Pollefeys, Gordon Wetzstein, Leonidas J. Guibas, Songyou Peng
Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention
Weida Wang, Changyong He, Jin Zeng, Di Qiu
Contrastive Test-Time Composition of Multiple LoRA Models for Image Generation
Tuna Han Salih Meral, Enis Simsar, Federico Tombari, Pinar Yanardag
Erasing More Than Intended? How Concept Erasure Degrades the Generation of Non-Target Concepts
Ibtihel Amara, Ahmed Imtiaz Humayun, Ivana Kajic, Zarana Parekh, Natalie Harris, Sarah Young, Chirag Nagpal, Najoung Kim, Junfeng He, Cristina Nader Vasconcelos, Deepak Ramachandran, Golnoosh Farnadi, Katherine Heller, Mohammad Havaei, Negar Rostamzadeh
FastPoint: Accelerating 3D Point Cloud Model Inference via Sample Point Distance Prediction
Donghyun Lee, Dawoon Jeong, Jae W. Lee, Hongil Yoon
From Image to Video: An Empirical Study of Diffusion Representations
Pedro Vélez, Luisa F. Polanía, Yi Yang, Chuhan Zhang, Rishabh Kabra, Anurag Arnab, Mehdi S. M. Sajjadi
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition
Ling Lo, Kelvin C.K. Chan, Wen-Huang Cheng, Ming-Hsuan Yang
Global Motion Corresponder for 3D Point-Based Scene Interpolation under Large Motion
Junru Lin, Chirag Vashist, Mikaela Angelina Uy, Colton Stearns, Xuan Luo, Leonidas Guibas, Ke Li
Improving Rectified Flow with Boundary Conditions
Xixi Hu, Runlong Liao, Keyang Xu, Bo Liu, Yeqing Li, Eugene Ie, Hongliang Fei, Qiang Liu
LayerLock: Non-Collapsing Representation Learning with Progressive Freezing
Goker Erdogan, Nikhil Parthasarathy, Catalin Ionescu, Drew Hudson, Alexander Lerchner, Andrew Zisserman, Mehdi Sajjadi, João Carreira
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz, Yuanzhen Li, Neal Wadhwa, Yael Pritch, Michael Rubinstein, David E. Jacobs, Shlomi Fruchter
MINERVA: Evaluating Complex Video Reasoning
Arsha Nagrani, Sachit Menon*, Ahmet Iscen, Shyamal Buch, Ramin Mehran, Nilpa Jha, Anja Hauth, Yukun Zhu, Carl Vondrick, Mikhail Sirotenko, Cordelia Schmid, Tobias Weyand
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-Modal Bottleneck Fusion and Calibrated Decoder Pruning
Mattia Segù, Marta Tintore Gazulla, Yongqin Xian, Luc Van Gool, Federico Tombari
ModalTune: Fine-Tuning Slide-Level Foundation Models with Multi-Modal Information for Multi-Task Learning in Digital Pathology
Vishwesh Ramanathan, Tony Xu, Pushpak Pati, Faruk Ahmed, Maged Goubran, Anne L. Martel
MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps
Jiahui Lei, Kyle Genova, George Kopanas, Noah Snavely, Leonidas Guibas
Motal: Unsupervised 3D Object Detection by Modality and Task-specific Knowledge Transfer
Nithin Gopalakrishnan Nair, Srinivas Kaza, Xuan Luo, Vishal M. Patel, Stephen Lombardi, Jungyeon Park
ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation
Daniel Winter, Asaf Shul, Matan Cohen, Dana Berman, Yael Pritch, Alex Rav-Acha, Yedid Hoshen
Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation
Akshay Krishnan*, Xinchen Yan, Vincent Casser, Abhijit Kundu
Prior2Former - Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
Sebastian Schmidt, Julius Körner, Dominik Fuchsgruber, Stefano Gasperini, Federico Tombari, Stephan Günnemann
Radiant Foam: Real-Time Differentiable Ray Tracing
Shrisudhan Govindarajan, Daniel Rebain, Kwang Moo Yi, Andrea Tagliasacchi
RoMo: Robust Motion Segmentation Improves Structure from Motion
Lily Goli, Sara Sabour, Mark Matthews, Marcus Brubaker, Dmitry Lagun, Alec Jacobson, David J. Fleet, Saurabh Saxena, Andrea Tagliasacchi
SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications
Yana Hasson, Pauline Luc, Liliane Momeni, Maks Ovsjanikov, Guillaume Le Moing, Alina Kuznetsova, Ira Ktena, Jennifer J. Sun, Skanda Koppula, Dilara Gokay, Joseph Heyward, Etienne Pot, Andrew Zisserman
Shape of Motion: 4D Reconstruction From a Single Video
Qianqian Wang, Vickie Ye, Hang Gao, Jake Austin, Zhengqi Li, Angjoo Kanazawa
Spectral Image Tokenizer
Carlos Esteves, Mohammed Suhail, Ameesh Makadia
SplatTalk: 3D VQA with Gaussian Splatting
Anh Thai*, Songyou Peng, Kyle Genova, Leonidas Guibas, Thomas Funkhouser
StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting
Shakiba Kheradmand, Delio Vicini, George Kopanasa*, Dmitry Lagun, Kwang Moo Yi, Mark Matthews, Andrea Tagliasacchi
TAB: Transformer Attention Bottlenecks Enable User Intervention and Debugging in Vision-Language Models
Pooyan Rahmanzadehgervi, Hung Huy Nguyen, Rosanne Liu, Long Mai, Anh Totti Nguyen
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus*, Carl Doersch, Yi Yang, Skanda Koppula, Viorica Patraucean, Xu Owen He, Ignacio Rocco, Mehdi S. M. Sajjadi, Sarath Chandar, Ross Goroshin
Toward Material-Agnostic System Identification from Videos
Yizhou Zhao, Haoyu Chen, Chunjiang Liu, Zhenyang Li, Charles Herrmann, Junhwa Hur, Yinxiao Li, Ming-Hsuan Yang, Bhiksha Raj, Min Xu
Understanding Museum Exhibits using Vision-Language Reasoning
Ada-Astrid Balauca, Sanjana Garai, Stefan Balauca, Rasesh Udayakumar Shetty, Naitik Agrawal, Dhwanil Subhashbhai Shah, Yuqian Fu, Xi Wang, Kristina Toutanova, Danda Pani Paudel, Luc Van Gool
UIP2P: Unsupervised Instruction-Based Image Editing via Edit Reversibility Constraint
Enis Simsar, Alessio Tonioni, Yongqin Xian, Thomas Hofmann, Federico Tombari
UniRes: Universal Image Restoration for Complex Degradations
Mo Zhou*, Keren Ye, Mauricio Delbracio, Peyman Milanfar, Vishal M. Patel, Hossein Talebi
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
Boyang Deng, Songyou Peng, Kyle Genova, Gordon Wetzstein, Noah Snavely, Leonidas Guibas, Thomas Funkhouser
Visual Intention Grounding for Egocentric Assistants
Pengzhan Sun, Junbin Xiao, Tze Ho Elden Tse, Yicong Li, Arjun Akula, Angela Yao
Workshops
-
Sun, Oct 19 | 8:00AM — 12:30PM
Affective & Behavior Analysis in-the-WildOrganizer: Stefanos Zafeiriou
-
Sun, Oct 19 | 9:00AM — 5:00PM
Artificial Social Intelligence WorkshopOrganizer: Leena Mathur
-
Sun, Oct 19 | 8:30AM — 12:15PM
Benchmarking Multi-Target Tracking: Towards Spatiotemporal Action Grounding in VideosOrganizer: Jindong Gu
-
Sun, Oct 19 | 9:00AM — 12:35PM
Binocular Egocentric-360 Multi-Modal Scene Understanding in the WildSpeaker: Dima Damen
-
Sun, Oct 19 | 1:30PM — 5:00PM
Comic Intelligence Quotient: Advances and Challenges in AI-Driven Comic AnalysisOrganizer: Andrew Zisserman
-
Sun, Oct 19 | 8:30AM — 5:20PM
Computer Vision for Automated Medical DiagnosisSpeaker: Shekoofeh Azizi
-
Sun, Oct 19 | 9:00AM — 5:00PM
Computer Vision for Developing CountriesOrganizer: Du Tran
-
Sun, Oct 19 | 8:30AM — 12:10PM
Computer Vision for Physiological Measurement (CVPM)Organizer: Daniel McDuff
-
Sun, Oct 19 | 8:30AM — 12:30PM
DataCV Workshop and ChallengeOrganizer: José Lezama
Speaker: Dima Damen -
Sun, Oct 19 | 9:00AM — 12:15PM
Driving Simulation from Real-World Data: How Well Can We Render and Drive?Speaker: Jyh-Jing Hwang
-
Sun, Oct 19 | 9:00AM — 12:30PM
Foundation Data for Industrial Tech TransferOrganizer: Keisuke Tateno
Speaker: Federico Tombari -
Sun, Oct 19 | 1:00PM — 5:00PM
Graphic Design Understanding and GenerationSpeaker: Tali Dekel
-
Sun, Oct 19 | 9:00AM — 5:30PM
Human-Interactive Generation and EditingSpeaker: Saining Xie
-
Sun, Oct 19 | 8:30AM — 12:30PM
Instance-Level Recognition and Generation WorkshopOrganizers: Andre Araujo, Bingyi Cao, Kaifeng Chen, Guangxing Han
-
Sun, Oct 19 | 1:00PM — 5:00PM
Knowledge-Intensive Multimodal Reasoning WorkshopOrganizer: Zhenting Qi
Speakers: Yilun Du, Kang-Fu Mei -
Sun, Oct 19 | 1:00PM — 5:00PM
Multi-Modal Foundation Models for Cancer Detection and PreventionOrganizers: Shekoofeh Azizi, Yun Liu, Daniel Golden
Speaker: Daniel McDuff -
Sun, Oct 19 | 1:00PM — 5:00PM
Multimodal Continual LearningOrganizer: Sayna Ebrahimi
Speaker: Marc'Aurelio Ranzato -
Sun, Oct 19 | 1:00PM — 5:00PM
Neural SLAM WorkshopSpeaker: Federico Tombari
-
Sun, Oct 19 | 1:30PM — 6:10PM
Open-World 3D Scene UnderstandingOrganizers: Johanna Wald, Federico Tombari, Leonidas Guibas
Speaker: Saining Xie -
Sun, Oct 19 | 9:00AM — 5:00PM
P13N: Personalization in Generative AI WorkshopOrganizer: Federico Tombari
Speaker: Nataniel Ruiz -
Sun, Oct 19 | 9:00AM — 5:00PM
Perception Test ChallengeOrganizers: Joe Heyward, Nikhil Parthasarathy, Tyler Zhu, Dima Damen, João Carreira,
Andrew Zisserman, Viorica PatrauceanPhysics-IQ Challenge Guest Track Contributors: Robert Geirhos, Priyank Jaini
-
Sun, Oct 19 | 1:00PM — 5:00PM
Recovering 6D Object Pose (R6D)Organizer: Martin Sundermeyer
-
Sun, Oct 19 | 9:00AM — 3:15PM
Safe and Trustworthy Multimodal AI SystemsOrganizer: Jindong Gu
Speaker: Yao Qin -
Sun, Oct 19 | 8:40AM — 12:30PM
SLoMo: Story-Level Movie Understanding & Audio DescriptionOrganizers: Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Andrew Zisserman
Speaker: Amy Pavel -
Sun, Oct 19 | 8:50AM — 5:30PM
Structural Priors for VisionOrganizers: Daniel Zoran, Leonidas Guibas
Speakers: João Carreira, Saining Xie -
Sun, Oct 19 | 1:00PM — 5:00PM
Sustainability with Earth Observation and AIOrganizer: Gengchen Mai
Speaker: Christopher F. Brown -
Sun, Oct 19 | 1:30PM — 5:00PM
Transparent & Reflective Objects in the Wild ChallengesSpeaker: Andrea Tagliasacchi
-
Sun, Oct 19 | 1:00PM — 5:00PM
Visual Quality Assessment CompetitionOrganizer: Shuo Xing
Speaker: Balu Adsumilli -
Mon, Oct 20 | 8:00AM — 12:30PM
AI for 3D Content CreationOrganizers: Nikos Kolotouros, Leonidas Guibas
Speaker: Philipp Henzler -
Mon, Oct 20 | 9:00AM — 5:00PM
AI for Content Generation; Quality Enhancement and StreamingSpeaker: Agrim Gupta
-
Mon, Oct 20 | 1:00PM — 5:00PM
Audio-Visual Generation and LearningOrganizers: Rodrigo Mira, Ming-Hsuan Yang
Speaker: Bill Freeman -
Mon, Oct 20 | 9:00AM — 5:30PM
Category-Level Object Pose Estimation for Robotic ManipulationOrganizer: Leonidas Guibas
-
Mon, Oct 20 | 8:30AM — 6:00PM
Closing the Loop Between Vision and Language (Decade Mark)Speaker: Aishwarya Agrawal
-
Mon, Oct 20 | 8:30AM — 5:30PM
Continual Learning in Computer VisionSpeakers: Eleni Triantafillou, Tengda Han
-
Mon, Oct 20 | 8:20AM — 12:20PM
Distillation of Foundation Models for Autonomous DrivingOrganizer: Mahmut Yurt, Xu Cao
-
Mon, Oct 20 | 9:00AM — 5:00PM
Ego-Exo Sensing for Smart MobilityOrganizer: Zhenzhen Liu
-
Mon, Oct 20 | 8:30AM — 12:30PM
Fairness and Ethics in AI: Facing the ChalLEnge through Model DebiasingSpeaker: Jindong Gu
-
Mon, Oct 20 | 9:00AM — 5:00PM
Findings of the ICCVOrganizer: Boqing Gong
-
Mon, Oct 20 | 8:30AM — 12:00PM
From Street to Space: 3D Vision AcrosS altiTudesSpeaker: Noah Snavely
-
Mon, Oct 20 | 8:50AM — 12:40PM
Generative Scene Completion for Immersive WorldsSpeakers: Andrea Tagliasacchi, Aleksander Hołyński
-
Mon, Oct 20 | 8:45AM — 6:00PM
GeoFreeNVS: Geometry-Free Novel View Synthesis and Controllable Video ModelsOrganizers: Andrea Tagliasacchi, Marcus Brubaker, Boyang Deng, Leonidas Guibas
Speakers: Ruiqi Gao, Agrim Gupta, Noah Snavely, Ali Eslami -
Mon, Oct 20 | 1:30PM — 5:30PM
Human-Robot-Scene Interaction and CollaborationOrganizer: Fangchen Liu
-
Mon, Oct 20 | 1:00PM — 5:00PM
Large Scale Cross Device LocalizationOrganizer: Linfei Pan
-
Mon, Oct 20 | 8:00AM — 5:00PM
Large-Scale Video Object SegmentationSpeaker: Ming-Hsuan Yang
-
Mon, Oct 20 | 1:00PM — 5:00PM
Long Multi-Scene Video FoundationsOrganizers: Regev Cohen, Sivan Doveh, Inbar Mosseri
-
Mon, Oct 20 | 8:50AM — 12:30PM
Mobile Intelligent Photography and ImagingSpeaker: Ming-Hsuan Yang
-
Mon, Oct 20 | 8:50AM — 6:00PM
Multi-Modal Reasoning for Agentic IntelligenceSpeaker: Ranjay Krishna
-
Mon, Oct 20 | 8:00AM — 5:00PM
Multimodal Reasoning and Slow Thinking in Large Model Era: Towards System2 and BeyondSpeaker: Saining Xie
-
Mon, Oct 20 | 8:30AM — 12:30PM
Multimodal Representation and RetrievalSpeaker: Cordelia Schmid
-
Mon, Oct 20 | 1:00PM — 5:30PM
Multimodal Spatial IntelligenceOrganizers: Songyou Peng, Kyle Genova, Thomas Funkhouser, Leonidas Guibas, Saining Xie
Speakers: Saining Xie, Ranjay Krishna -
Mon, Oct 20 | 1:00PM — 4:50PM
Responsible ImagingSpeaker: Bill Freeman
-
Mon, Oct 20 | 9:00AM — 5:30PM
Robust and Interactable World Models in Computer VisionSpeakers: Tali Dekel, Jack Parker-Holder, Sherry Yang, Yilun Du
-
Mon, Oct 20 | 1:30PM — 5:30PM
Scene Graphs and Graph Representation LearningOrganizer: Federico Tombari
-
Mon, Oct 20 | 1:00PM — 5:50PM
TrustFM: Workshop on Trustworthy Foundation ModelsOrganizer: Mu Cai
Speaker: Yao Qin -
Mon, Oct 20 | 8:25AM — 6:00PM
Vision Foundation Models and Generative AI for Accessibility: Challenges and OpportunitiesOrganizer: Jon Froehlich
-
Mon, Oct 20 | 1:30PM — 5:40PM
Visual Object Tracking and Segmentation Challenge WorkshopSpeaker: Ming-Hsuan Yang
-
Mon, Oct 20 | 8:00AM — 12:00PM
What is Next in Multimodal Foundation Models?Speaker: Saining Xie
Panelist: Saining Xie -
Mon, Oct 20 | 9:00AM — 4:35PM
Wild3D: 3D Modeling; Reconstruction; and Generation in the WildSpeaker: Noah Snavely
Board & Organizing Committee
-
Ramin Zabih
- General Chair
-
Saining Xie
- Broadening Participation Chair
-
Lijie Fan
- Tutorial Chair
-
Bohyung Han
- Program Chair
-
Deqing Sun
- Program Chair
-
Boqing Gong
- Workshop Chair
-
Nataniel Ruiz
- Publicity Chair