Georgios Pavlakos

Georgios Pavlakos
pavlakos@cs.utexas.edu

I am an Assistant Professor in the Department of Computer Science at the University of Texas at Austin. Before that, I was a Postdoctoral Researcher at UC Berkeley, advised by Angjoo Kanazawa and Jitendra Malik. I completed my PhD in Computer Science at the University of Pennsylvania with my advisor, Kostas Daniilidis. I did my undergraduate studies at the National Technical University of Athens, where I worked with Petros Maragos. During my PhD, I spent time at the Max Planck Institute in Tübingen, working with Michael Black.

Email / CV / Google Scholar / GitHub

News

New papers accepted to CVPR 2026! Physically Plausible HOI, HumanNOVA and HTD-Refine!
STAR accepted to ICRA 2026!
Our ICCV 2025 paper, RayZer, received the Best Student Paper Honorable Mention!
New papers accepted to ICCV 2025 and CoRL 2025! RayZer, Real3D and COLLAGE!
New papers accepted to CVPR 2025! HSMR, MegaSynth, FIction, ExpertAF and EgoAllo!
Atlas Gaussians accepted to ICLR 2025 as a Spotlight!
New papers accepted to CoRL 2024 and NeurIPS 2024! OKAMI, EVA and CoFie!
New papers accepted to CVPR 2024! HaMeR, BUDDI, GART and MultiPhys!
I started as an Assistant Professor of Computer Science at UT Austin in January 2024!
4D Humans accepted to ICCV 2023!
New papers accepted to CVPR 2023! SLAHMR, LART and Animals3D!
I will be an Area Chair for CVPR 2024, ICCV 2023, BMVC 2023 and 3DV 2024!
New paper accepted to ECCV 2022!
New papers accepted to CVPR 2022! PHALP and Multishot!
Interview at Computer Vision News - Best of ICCV selection!
I was recognized as Outstanding Reviewer for ICCV 2021 and 3DV 2021!
I received the Morris and Dorothy Rubinoff Award for the Best Computer Science Dissertation at UPenn!
I started as a Postdoctoral Researcher at UC Berkeley with Angjoo Kanazawa and Jitendra Malik!

Publications

	Recovering Physically Plausible Human-Object Interactions from Monocular Videos Dingbang Huang, Etienne Vouga, Qixing Huang, Georgios Pavlakos Computer Vision and Pattern Recognition (CVPR), 2026 (Highlight Paper) project page / code / bibtex We reconstruct human-object interactions inside a physics simulator to guarantee physical plausibility by design.
	HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image Hezhen Hu, Wangbo Zhao, Lanqing Guo, Hanwen Jiang, Jonathan C. Liu, Zhiwen Fan, Kai Wang, Zhangyang Wang, Georgios Pavlakos Computer Vision and Pattern Recognition (CVPR), 2026 (Highlight Paper) project page / code / hugging face / bibtex We train a Large Reconstruction Model that can recover a photorealistic 3D human avatar from a single image.
	Natural Human Motion Recovery by Aligning High-Order Temporal Dynamics from Monocular Videos Dingkun Wei, Zehong Shen, Yan Xia, Georgios Pavlakos, Yujun Shen, Xiaowei Zhou Computer Vision and Pattern Recognition (CVPR), 2026 (Oral Presentation) (Best paper candidate - Top 0.5%) project page / bibtex We refine existing Human Motion Recovery methods with estimated 3D velocity and 3D acceleration to recover natural human motion in global coordinates.
	Searching in Space and Time: Unified Memory-Action Loops for Open-World Object Retrieval Taijing Chen, Sateesh Kumar, Junhong Xu, Georgios Pavlakos, Joydeep Biswas* Roberto Martín-Martín* International Conference on Robotics and Automation (ICRA), 2026 project page / code / benchmark / bibtex STAR is a framework that enables retrieving of objects in dynamic environments by unifying memory-based reasoning about past observations with real-time spatial search.
	RayZer: A Self-supervised Large View Synthesis Model Hanwen Jiang, Hao Tan, Peng Wang, Haian Jin, Yue Zhao, Sai Bi, Kai Zhang, Fujun Luan, Kalyan Sunkavalli, Qixing Huang, Georgios Pavlakos International Conference on Computer Vision (ICCV), 2025 (Oral Presentation - Best Student Paper Honorable Mention) project page / code / bibtex We develop a self-supervised framework for learning a large view synthesis model with zero 3D supervision (no scene! no cameras!).
	Real3D: Towards Scaling Large Reconstruction Models with Real Images Hanwen Jiang, Qixing Huang, Georgios Pavlakos International Conference on Computer Vision (ICCV), 2025 project page / code / demo / bibtex We use in-the-wild images to scale up the training data of Large Reconstruction Models.
	COLLAGE: Adaptive Fusion-based Retrieval for Augmented Policy Learning Sateesh Kumar, Shivin Dass, Georgios Pavlakos, Roberto Martín-Martín Conference on Robot Learning (CoRL), 2025 project page / code / bibtex COLLAGE is a few-shot imitation learning method that augments training with retrieved demonstrations from large-scale datasets. Retrieval fuses multiple cues to select demonstrations that are most similar to the target task.
	Reconstructing Humans with a Biomechanically Accurate Skeleton Yan Xia, Xiaowei Zhou, Etienne Vouga, Qixing Huang, Georgios Pavlakos Computer Vision and Pattern Recognition (CVPR), 2025 (Oral Presentation) project page / code / colab / demo / bibtex We reconstruct humans using the SKEL model which provides a biomechanically accurate skeleton.
	MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data Hanwen Jiang, Zexiang Xu, Desai Xie, Ziwen Chen, Haian Jin, Fujun Luan, Zhixin Shu, Kai Zhang, Sai Bi, Xin Sun, Jiuxiang Gu, Qixing Huang, Georgios Pavlakos, Hao Tan Computer Vision and Pattern Recognition (CVPR), 2025 project page / code / bibtex Multi-view reconstruction is largely non-semantic, enabling scalable training with non-semantic synthesized data.
	FIction: 4D Future Interaction Prediction from Video Kumar Ashutosh, Georgios Pavlakos, Kristen Grauman Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight Paper) project page / code / bibtex Given an video of a human activity, we predict what objects at what 3D locations the person will interact with in the next time period, and how they will interact with them.
	ExpertAF: Expert Actionable Feedback from Video Kumar Ashutosh, Tushar Nagarajan, Georgios Pavlakos, Kris Kitani, Kristen Grauman Computer Vision and Pattern Recognition (CVPR), 2025 EgoVis 2024/2025 Distinguished Paper Award project page / bibtex Given a video of a person doing a physical activity, we generate free-form expert commentary and a visual demonstration with the required corrections.
	Estimating Body and Hand Motion in an Ego-sensed World Brent Yi, Vickie Ye, Maya Zheng, Yunqi Li, Lea Müller, Georgios Pavlakos, Yi Ma, Jitendra Malik, Angjoo Kanazawa Computer Vision and Pattern Recognition (CVPR), 2025 (Highlight Paper) project page / code / bibtex Given a video captured from an egocentric sensor, we reconstruct the body and hand motion of the person wearing the sensor.
	Atlas Gaussians Diffusion for 3D Generation Haitao Yang, Yuan Dong, Hanwen Jiang, Dejia Xu, Georgios Pavlakos, Qixing Huang International Conference on Learning Representations (ICLR), 2025 (Spotlight Presentation) project page / code / bibtex A 3D shape is modeled as a union of patches, where each patch can decode infinite 3D Gaussians.
	CoFie: Learning Compact Neural Surface Representations with Coordinate Fields Hanwen Jiang, Haitao Yang, Georgios Pavlakos, Qixing Huang Conference on Neural Information Processing Systems (NeurIPS), 2024 project page / code / bibtex Introduces a local surface representation that improves fitting on novel shape instances.
	Expressive Gaussian Human Avatars from Monocular RGB Video Hezhen Hu, Zhiwen Fan, Tianhao Wu, Yihan Xi, Seoyoung Lee, Georgios Pavlakos, Zhangyang Wang Conference on Neural Information Processing Systems (NeurIPS), 2024 project page / code / bibtex Using a single video to train an Expressive Gaussian Human Avatar.
	OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation Jinhan Li, Yifeng Zhu, Yuqi Xie, Zhenyu Jiang, Mingyo Seo, Georgios Pavlakos, Yuke Zhu Conference on Robot Learning (CoRL), 2024 (Oral Presentation) project page / bibtex Using a single video with a human demonstration to teach a humanoid robot to perform the same manipulation skill.
	Reconstructing Hands in 3D with Transformers Georgios Pavlakos, Dandan Shan, Ilija Radosavovic, Angjoo Kanazawa, David Fouhey, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2024 project page / code / data / demo / bibtex Scaling up data and models for hand mesh recovery from images and video.
	Generative Proxemics: A Prior for 3D Social Interaction from Images Lea Müller, Vickie Ye, Georgios Pavlakos, Michael J. Black, Angjoo Kanazawa Computer Vision and Pattern Recognition (CVPR), 2024 project page / code / demo / bibtex A 3D generative model of two people in close social interaction.
	GART: Gaussian Articulated Template Models Jiahui Lei, Yufu Wang, Georgios Pavlakos, Lingjie Liu, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2024 (Highlight Paper) project page / code / bibtex 3D Gaussian Splatting for non-rigid articulated subject capturing and rendering from monocular videos.
	MultiPhys: Multi-Person Physics-aware 3D Motion Estimation Nicolas Ugrinovic, Boxiao Pan, Georgios Pavlakos, Despoina Paschalidou, Bokui Shen, Jordi Sanchez-Riera, Francesc Moreno-Noguer, Leonidas Guibas Computer Vision and Pattern Recognition (CVPR), 2024 project page / code / bibtex Using a physics simulator to recover physically plausible multi-person 3D motion.
	Humans in 4D: Reconstructing and Tracking Humans with Transformers Shubham Goel, Georgios Pavlakos, Jathushan Rajasegaran, Angjoo Kanazawa, Jitendra Malik International Conference on Computer Vision (ICCV), 2023 project page / code / demo / bibtex A fully "transformerized" deisgn for Human Mesh Recovery achieves improved precision and remarkable robustness for 3D human reconstruction and tracking!
	Decoupling Human and Camera Motion from Videos in the Wild Vickie Ye, Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa Computer Vision and Pattern Recognition (CVPR), 2023 project page / code / bibtex Reasoning about the global motion of humans in the world by decoupling human and camera motion.
	On the Benefits of 3D Pose and Tracking for Human Action Recognition Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2023 project page / code / bibtex Using 3D human reconstruction and tracking to recognize atomic actions in video.
	Learning Articulated Shape with Keypoint Pseudo-labels from Web Images Anastasis Stathopoulos, Georgios Pavlakos, Ligong Han, Dimitris Metaxas Computer Vision and Pattern Recognition (CVPR), 2023 project page / code / bibtex Training models for 3D animal recovery with minimal annotations using large-scale collections of web images.
	The One Where They Reconstructed 3D Humans and Environments in TV Shows Georgios Pavlakos, Ethan Weber, Matthew Tancik, Angjoo Kanazawa European Conference on Computer Vision (ECCV), 2022 project page / code & data / bibtex Recovering 3D humans, cameras and static structure in TV shows.
	Tracking People by Predicting 3D Appearance, Location & Pose Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2022 (Oral Presentation) (Best paper finalist - Top 0.4%) project page / code / video / bibtex Predicting pose, appearance and location of people in 3D for monocular tracking.
	Human Mesh Recovery from Multiple Shots Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa Computer Vision and Pattern Recognition (CVPR), 2022 project page / code & data / video / bibtex Using information from multiple shots to improve reconstruction of humans in edited media.
	Tracking People with 3D Representations Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik Neural Information Processing Systems (NeurIPS), 2021 project page / code / video / bibtex Performing monocular tracking of people by lifting them to 3D and then using 3D representations of their appearance, pose and location.
	Probabilistic Modeling for Human Mesh Recovery Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis International Conference on Computer Vision (ICCV), 2021 project page / supplementary / code / bibtex Interview at Computer Vision News - Best of ICCV selection Casting human mesh recovery as a regression from an image to a distribution of 3D poses, and showing the benefits on downstream tasks.
	Reactive Navigation in Partially Familiar Planar Environments Using Semantic Perceptual Feedback Vasileios Vasilopoulos, Georgios Pavlakos, Karl Schmeckpeper, Kostas Daniilidis, Daniel E. Koditschek International Journal of Robotics Research, 2021 bibtex Reactive navigation in unexplored environments cluttered with familiar or unknown objects.
	Monocular Expressive Body Regression through Body-Driven Attention Vasileios Choutas, Georgios Pavlakos, Timo Bolkart, Dimitrios Tzionas, Michael J. Black European Conference on Computer Vision (ECCV), 2020 project page / supplementary / code / long video / short video / bibtex Regression-based expressive capture of 3D humans from a single RGB image.
	Coherent Reconstruction of Multiple Humans from a Single Image Wen Jiang, Nikos Kolotouros, Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2020 project page / supplementary / code / bibtex End-to-end reconstruction of multiple people using two novel geometric losses that encourage coherent 3D estimates.
	Reactive Semantic Planning in Unexplored Semantic Environments Using Deep Perceptual Feedback Vasileios Vasilopoulos, Georgios Pavlakos, Sean L. Bowman, J. Diego Caporale, Kostas Daniilidis, George J. Pappas, Daniel E. Koditschek IEEE Robotics and Automation Letters (RA-L), 2020 International Conference on Intelligent Robots and Systems (IROS), 2020 video / code / bibtex Reactive human following with semantic targets in unexplored environments cluttered with familiar or unknown objects.
	TexturePose: Supervising Human Mesh Estimation with Texture Consistency Georgios Pavlakos, Nikos Kolotouros, Kostas Daniilidis International Conference on Computer Vision (ICCV), 2019 project page / supplementary / code / bibtex Leveraging texture consistency to train networks for human mesh estimation.
	Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop Nikos Kolotouros, Georgios Pavlakos*, Michael J. Black, Kostas Daniilidis International Conference on Computer Vision (ICCV)*, 2019 project page / supplementary / code / bibtex Improving model-based human pose and shape regression with automatic in-the-loop fitting
	Expressive Body Capture: 3D Hands, Face and Body from a Single Image Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black Computer Vision and Pattern Recognition (CVPR), 2019 (Oral Presentation) project page / supplementary / video / code / bibtex Expressive capture of bodies, hands and faces from a single RGB image.
	Convolutional Mesh Regression for Single-Image Human Shape Reconstruction Nikos Kolotouros, Georgios Pavlakos, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2019 (Oral Presentation) (Best paper finalist - Top 1%) project page / supplementary / video / code / bibtex Estimating 3D human pose and shape using Graph Convolutional Networks.
	Ordinal Depth Supervision for 3D Human Pose Estimation Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2018 (Oral Presentation) project page / supplementary / video / code / data / bibtex Incorporating ordinal depth supervision in the training of end-to-end ConvNets for 3D human pose estimation.
	Learning to Estimate 3D Human Pose and Shape from a Single Color Image Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2018 Invited to 3D HUMANS, CVPR Workshop, 2018 (Best Poster Award) project page / supplementary / bibtex A direct, end-to-end approach for the reconstruction of 3D human pose and shape from single images.
	Human Motion Capture Using a Drone Xiaowei Zhou, Sikang Liu, Georgios Pavlakos, Vijay Kumar, Kostas Daniilidis International Conference on Robotics and Automation (ICRA), 2018 video / data / bibtex Estimating the 3D human pose from a monocular video captured by a drone.
	MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior Xiaowei Zhou, Menglong Zhu, Georgios Pavlakos, Spyridon Leonardos, Konstantinos G. Derpanis, Kostas Daniilidis Pattern Analysis and Machine Intelligence (PAMI), 2018 code / bibtex Estimating the 3D pose of a human from a monocular video using a ConvNet to localize 2D keypoints and an EM optimization scheme to recover 3D pose over time.
	Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight Presentation) project page / supplementary / video / training code / demo code / bibtex End-to-end learning for 3D human pose using a volumetric representation and casting the problem as 3D keypoint localization in a discretized 3D space.
	Harvesting Multiple Views for Marker-less 3D Human Pose Annotations Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis Computer Vision and Pattern Recognition (CVPR), 2017 (Spotlight Presentation) project page / supplementary / video / code / bibtex Estimating 3D human pose from multiple views and leveraging the estimates as automatic 3D annotations for human pose estimation tasks.
	6-DoF Object Pose from Semantic Keypoints Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis International Conference on Robotics and Automation (ICRA), 2017 project page / journal version / code / video / bibtex Estimating the 6-DoF pose of an object from a single image using semantic keypoints and a deformable shape model.
	Reconstruction of 3D Pose for Surfaces of Revolution from Range Data Georgios Pavlakos, Kostas Daniilidis International Conference on 3D Vision (3DV), 2015 bibtex
	On Shape Recognition and Language Petros Maragos, Vassilis Pitsikalis, Athanasios Katsamanis, Georgios Pavlakos, Stavros Theodorakis Perspectives in Shape Analysis, 2016, edited by M. Breuss, A. Bruckstein, P. Maragos and S. Wuhrer bibtex
	Kinect-based multimodal gesture recognition using a two-pass fusion scheme Georgios Pavlakos, Stavros Theodorakis, Vassilis Pitsikalis, Athanasios Katsamanis, Petros Maragos International Conference on Image Processing (ICIP), 2014 bibtex
	Towards an intelligent robotic walker for assisted living using multimodal sensorial data Georgia Chalvatzaki, Georgios Pavlakos, Kevis Maninis, Xanthi S. Papageorgiou, Vassilis Pitsikalis, Costas S. Tzafestas, Petros Maragos International Conference on Mobile Communication and Healthcare (Mobihealth), 2014 bibtex
	Advances in intelligent mobility assistance robot integrating multimodal sensory processing Xanthi S. Papageorgiou, Costas S. Tzafestas, Petros Maragos, Georgios Pavlakos, Georgia Chalvatzaki, George Moustris, Iasonas Kokkinos, Angelika Peer, Bartlomiej Stanczyk, Evita-Stavroula Fotinea, Eleni Efthimiou International Conference on Universal Access in Human-Computer Interaction (HCII), 2014 bibtex

This guy has an awesome website