Stanford AI Lab Papers at ICCV 2021

The International Conference on Computer Vision (ICCV 2021)
will be hosted virtually next week. We’re excited to share all the work from SAIL that will be presented, and you’ll find links to papers, videos and blogs below. Feel free to reach out to the contact authors directly to learn more about the work that’s happening at Stanford!

List of Accepted Papers

GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition


Authors: Mars Huang

Contact: mschuang@stanford.edu

Keywords: medical image, self-supervised learning, multimodal fusion


3D Shape Generation and Completion Through Point-Voxel Diffusion


Authors: Linqi Zhou, Yilun Du, Jiajun Wu

Contact: linqizhou@stanford.edu

Links: Paper | Video | Website

Keywords: diffusion, shape generation


CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds


Authors: Yijia Weng*, He Wang*, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas

Contact: yijiaw@stanford.edu

Award nominations: Oral Presentation

Links: Paper | Video | Website

Keywords: category-level object pose tracking, articulated objects


Detecting Human-Object Relationships in Videos


Authors: Jingwei Ji, Rishi Desai, Juan Carlos Niebles

Contact: jingweij@cs.stanford.edu

Links: Paper

Keywords: human-object relationships, video, detection, transformer, spatio-temporal reasoning


Geography-Aware Self-Supervised Learning


Authors: Kumar Ayush, Burak Uzkent, Chenlin Meng, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon

Contact: kayush@cs.stanford.edu, chenlin@stanford.edu

Links: Paper | Website

Keywords: self-supervised learning, contrastive learning, remote sensing, spatio-temporal, classification, object detection, segmentation


HuMoR: 3D Human Motion Model for Robust Pose Estimation


Authors: Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas Guibas

Contact: drempe@stanford.edu

Award nominations: Oral Presentation

Links: Paper | Website

Keywords: 3d human pose estimation; 3d human motion; generative modeling


Learning Privacy-preserving Optics for Human Pose Estimation


Authors: Carlos Hinojosa, Juan Carlos Niebles, Henry Arguello

Contact: carlos.hinojosa@saber.uis.edu.co

Links: Paper | Website

Keywords: computational photography; fairness, accountability, transparency, and ethics in vision; gestures and body pose


Learning Temporal Dynamics from Cycles in Narrated Video


Authors: Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun

Contact: jiajunwu@cs.stanford.edu

Links: Paper | Website

Keywords: multi-modal learning, cycle consistency, video


Vector Neurons: A General Framework for SO(3)-Equivariant Networks


Authors: Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, Leonidas Guibas

Contact: congyue@stanford.edu

Links: Paper | Video | Website

Keywords: pointcloud network, rotation equivariance, rotation invariance


Neural Radiance for 4D View Synthesis and Video Processing


Authors: Yilun Du, Yinan Zhang, Hong-Xing Yu, Joshua B. Tenenbaum, Jiajun Wu

Contact: jiajunwu@cs.stanford.edu

Links: Paper | Website

Keywords: 4d representation, neural rendering, video processing


Where2Act: From Pixels to Actions for Articulated 3D Objects


Authors: Kaichun Mo, Leonidas J. Guibas, Mustafa Mukadam, Abhinav Gupta, Shubham Tulsiani

Contact: kaichunm@stanford.edu

Links: Paper | Website

Keywords: 3d computer vision, robotic vision, affordance learning, robot learning


Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories


Authors: Fait Poms*, Vishnu Sarukkai*, Ravi Teja Mullapudi, Nimit S. Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian

Contact: sarukkai@stanford.edu

Links: Paper | Blog | Video

Keywords: model evaluation, active learning



We look forward to seeing you at ICCV 2021!

Read More