Georgios Pavlakos
pavlakos@cs.utexas.edu

I am an Assistant Professor in the Department of Computer Science at the University of Texas at Austin. Before that, I was a Postdoctoral Researcher at UC Berkeley, advised by Angjoo Kanazawa and Jitendra Malik. I completed my PhD in Computer Science at the University of Pennsylvania with my advisor, Kostas Daniilidis. I did my undergraduate studies at the National Technical University of Athens, where I worked with Petros Maragos. During my PhD, I spent time at the Max Planck Institute in Tübingen, working with Michael Black.

Email  /  CV  /  Google Scholar  /  GitHub

georgios_pavlakos_res.jpg
News
Publications
Reconstructing Hands in 3D with Transformers
Georgios Pavlakos, Dandan Shan, Ilija Radosavovic, Angjoo Kanazawa, David Fouhey, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2024  
project page / code / demo / bibtex

Scaling up data and models for hand mesh recovery from images and video.

Generative Proxemics: A Prior for 3D Social Interaction from Images
Lea Müller, Vickie Ye, Georgios Pavlakos, Michael J. Black, Angjoo Kanazawa
Computer Vision and Pattern Recognition (CVPR), 2024  
project page / code / demo / bibtex

A 3D generative model of two people in close social interaction.

GART: Gaussian Articulated Template Models
Jiahui Lei, Yufu Wang, Georgios Pavlakos, Lingjie Liu, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2024  
project page / code / bibtex

3D Gaussian Splatting for non-rigid articulated subject capturing and rendering from monocular videos.

Humans in 4D: Reconstructing and Tracking Humans with Transformers
Shubham Goel, Georgios Pavlakos, Jathushan Rajasegaran, Angjoo Kanazawa*, Jitendra Malik*
International Conference on Computer Vision (ICCV), 2023  
project page / code / demo / bibtex

A fully "transformerized" deisgn for Human Mesh Recovery achieves improved precision and remarkable robustness for 3D human reconstruction and tracking!

Decoupling Human and Camera Motion from Videos in the Wild
Vickie Ye, Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa
Computer Vision and Pattern Recognition (CVPR), 2023  
project page / code / bibtex

Reasoning about the global motion of humans in the world by decoupling human and camera motion.

On the Benefits of 3D Pose and Tracking for Human Action Recognition
Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Christoph Feichtenhofer, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2023
project page / code / bibtex

Using 3D human reconstruction and tracking to recognize atomic actions in video.



Learning Articulated Shape with Keypoint Pseudo-labels from Web Images
Anastasis Stathopoulos, Georgios Pavlakos, Ligong Han, Dimitris Metaxas
Computer Vision and Pattern Recognition (CVPR), 2023
project page / code / bibtex

Training models for 3D animal recovery with minimal annotations using large-scale collections of web images.



The One Where They Reconstructed 3D Humans and Environments in TV Shows
Georgios Pavlakos*, Ethan Weber*, Matthew Tancik, Angjoo Kanazawa
European Conference on Computer Vision (ECCV), 2022
project page / code & data / bibtex

Recovering 3D humans, cameras and static structure in TV shows.

Tracking People by Predicting 3D Appearance, Location & Pose
Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2022   (Oral Presentation)
(Best paper finalist - Top 0.4%)
project page / code / video / bibtex

Predicting pose, appearance and location of people in 3D for monocular tracking.



Human Mesh Recovery from Multiple Shots
Georgios Pavlakos, Jitendra Malik, Angjoo Kanazawa
Computer Vision and Pattern Recognition (CVPR), 2022  
project page / code & data / video / bibtex

Using information from multiple shots to improve reconstruction of humans in edited media.



Tracking People with 3D Representations
Jathushan Rajasegaran, Georgios Pavlakos, Angjoo Kanazawa, Jitendra Malik
Neural Information Processing Systems (NeurIPS), 2021  
project page / code / video / bibtex

Performing monocular tracking of people by lifting them to 3D and then using 3D representations of their appearance, pose and location.



Probabilistic Modeling for Human Mesh Recovery
Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis
International Conference on Computer Vision (ICCV), 2021  
project page / supplementary / code / bibtex
Interview at Computer Vision News - Best of ICCV selection

Casting human mesh recovery as a regression from an image to a distribution of 3D poses, and showing the benefits on downstream tasks.



Reactive Navigation in Partially Familiar Planar Environments Using Semantic Perceptual Feedback
Vasileios Vasilopoulos, Georgios Pavlakos, Karl Schmeckpeper, Kostas Daniilidis, Daniel E. Koditschek
International Journal of Robotics Research, 2021  
bibtex

Reactive navigation in unexplored environments cluttered with familiar or unknown objects.



Monocular Expressive Body Regression through Body-Driven Attention
Vasileios Choutas, Georgios Pavlakos, Timo Bolkart, Dimitrios Tzionas, Michael J. Black
European Conference on Computer Vision (ECCV), 2020  
project page / supplementary / code / long video / short video / bibtex

Regression-based expressive capture of 3D humans from a single RGB image.



Coherent Reconstruction of Multiple Humans from a Single Image
Wen Jiang*, Nikos Kolotouros*, Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2020  
project page / supplementary / code / bibtex

End-to-end reconstruction of multiple people using two novel geometric losses that encourage coherent 3D estimates.



Reactive Semantic Planning in Unexplored Semantic Environments Using Deep Perceptual Feedback
Vasileios Vasilopoulos, Georgios Pavlakos, Sean L. Bowman, J. Diego Caporale, Kostas Daniilidis, George J. Pappas, Daniel E. Koditschek
IEEE Robotics and Automation Letters (RA-L), 2020  
International Conference on Intelligent Robots and Systems (IROS), 2020  
video / code / bibtex

Reactive human following with semantic targets in unexplored environments cluttered with familiar or unknown objects.



TexturePose: Supervising Human Mesh Estimation with Texture Consistency
Georgios Pavlakos*, Nikos Kolotouros*, Kostas Daniilidis
International Conference on Computer Vision (ICCV), 2019  
project page / supplementary / code / bibtex

Leveraging texture consistency to train networks for human mesh estimation.



Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop
Nikos Kolotouros*, Georgios Pavlakos*, Michael J. Black, Kostas Daniilidis
International Conference on Computer Vision (ICCV), 2019
project page / supplementary / code / bibtex

Improving model-based human pose and shape regression with automatic in-the-loop fitting



Expressive Body Capture: 3D Hands, Face and Body from a Single Image
Georgios Pavlakos*, Vasileios Choutas*, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black
Computer Vision and Pattern Recognition (CVPR), 2019   (Oral Presentation)
project page / supplementary / video / code / bibtex

Expressive capture of bodies, hands and faces from a single RGB image.

Convolutional Mesh Regression for Single-Image Human Shape Reconstruction
Nikos Kolotouros, Georgios Pavlakos, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2019   (Oral Presentation)
(Best paper finalist - Top 1%)
project page / supplementary / video / code / bibtex

Estimating 3D human pose and shape using Graph Convolutional Networks.



Ordinal Depth Supervision for 3D Human Pose Estimation
Georgios Pavlakos, Xiaowei Zhou, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2018   (Oral Presentation)
project page / supplementary / video / code / data / bibtex

Incorporating ordinal depth supervision in the training of end-to-end ConvNets for 3D human pose estimation.

Learning to Estimate 3D Human Pose and Shape from a Single Color Image
Georgios Pavlakos, Luyang Zhu, Xiaowei Zhou, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2018  
Invited to 3D HUMANS, CVPR Workshop, 2018   (Best Poster Award)
project page / supplementary / bibtex

A direct, end-to-end approach for the reconstruction of 3D human pose and shape from single images.

Human Motion Capture Using a Drone
Xiaowei Zhou, Sikang Liu, Georgios Pavlakos, Vijay Kumar, Kostas Daniilidis
International Conference on Robotics and Automation (ICRA), 2018
video / data / bibtex

Estimating the 3D human pose from a monocular video captured by a drone.

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior
Xiaowei Zhou, Menglong Zhu, Georgios Pavlakos, Spyridon Leonardos, Konstantinos G. Derpanis, Kostas Daniilidis
Pattern Analysis and Machine Intelligence (PAMI), 2018
code / bibtex

Estimating the 3D pose of a human from a monocular video using a ConvNet to localize 2D keypoints and an EM optimization scheme to recover 3D pose over time.

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2017   (Spotlight Presentation)
project page / supplementary / video / training code / demo code / bibtex

End-to-end learning for 3D human pose using a volumetric representation and casting the problem as 3D keypoint localization in a discretized 3D space.

Harvesting Multiple Views for Marker-less 3D Human Pose Annotations
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Computer Vision and Pattern Recognition (CVPR), 2017   (Spotlight Presentation)
project page / supplementary / video / code / bibtex

Estimating 3D human pose from multiple views and leveraging the estimates as automatic 3D annotations for human pose estimation tasks.

6-DoF Object Pose from Semantic Keypoints
Georgios Pavlakos, Xiaowei Zhou, Aaron Chan, Konstantinos G. Derpanis, Kostas Daniilidis
International Conference on Robotics and Automation (ICRA), 2017
project page / journal version / code / video / bibtex

Estimating the 6-DoF pose of an object from a single image using semantic keypoints and a deformable shape model.

Reconstruction of 3D Pose for Surfaces of Revolution from Range Data
Georgios Pavlakos, Kostas Daniilidis
International Conference on 3D Vision (3DV), 2015
bibtex

On Shape Recognition and Language
Petros Maragos, Vassilis Pitsikalis, Athanasios Katsamanis, Georgios Pavlakos, Stavros Theodorakis
Perspectives in Shape Analysis, 2016, edited by M. Breuss, A. Bruckstein, P. Maragos and S. Wuhrer
bibtex

Kinect-based multimodal gesture recognition using a two-pass fusion scheme
Georgios Pavlakos, Stavros Theodorakis, Vassilis Pitsikalis, Athanasios Katsamanis, Petros Maragos
International Conference on Image Processing (ICIP), 2014
bibtex

Towards an intelligent robotic walker for assisted living using multimodal sensorial data
Georgia Chalvatzaki, Georgios Pavlakos, Kevis Maninis, Xanthi S. Papageorgiou, Vassilis Pitsikalis, Costas S. Tzafestas, Petros Maragos
International Conference on Mobile Communication and Healthcare (Mobihealth), 2014
bibtex

Advances in intelligent mobility assistance robot integrating multimodal sensory processing
Xanthi S. Papageorgiou, Costas S. Tzafestas, Petros Maragos, Georgios Pavlakos, Georgia Chalvatzaki, George Moustris, Iasonas Kokkinos, Angelika Peer, Bartlomiej Stanczyk, Evita-Stavroula Fotinea, Eleni Efthimiou
International Conference on Universal Access in Human-Computer Interaction (HCII), 2014
bibtex


This guy has an awesome website