Yang et al., 2021 - Google Patents

Viser: Video-specific surface embeddings for articulated 3d shape reconstruction

Yang et al., 2021

Document ID: 2525108969224127806
Author: Yang G; Sun D; Jampani V; Vlasic D; Cole F; Liu C; Ramanan D
Publication year: 2021
Publication venue: Advances in Neural Information Processing Systems

External Links

Cited by

Snippet

We introduce ViSER, a method for recovering articulated 3D shapes and dense3D trajectories from monocular videos. Previous work on high-quality reconstruction of dynamic 3D shapes typically relies on multiple camera views, strong category-specific priors, or 2D …

Continue reading at proceedings.neurips.cc (PDF) (other versions)

230000003287 optical 0 abstract description 19

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/08—Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/0068—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping

Similar Documents

Publication	Publication Date	Title
Yang et al.	2021	Viser: Video-specific surface embeddings for articulated 3d shape reconstruction
Weng et al.	2022	Humannerf: Free-viewpoint rendering of moving people from monocular video
Yu et al.	2021	Function4d: Real-time human volumetric capture from very sparse consumer rgbd sensors
Su et al.	2020	Robustfusion: Human volumetric capture with data-driven visual cues using a rgbd camera
Gao et al.	2022	Monocular dynamic view synthesis: A reality check
Huang et al.	2017	Towards accurate marker-less human shape and pose estimation over time
De Aguiar et al.	2008	Performance capture from sparse multi-view video
US8384714B2 (en)	2013-02-26	Systems, methods and devices for motion capture using video imaging
Chen et al.	2022	Alignsdf: Pose-aligned signed distance fields for hand-object reconstruction
Tretschk et al.	2020	Demea: Deep mesh autoencoders for non-rigidly deforming objects
Yao et al.	2022	Lassie: Learning articulated shapes from sparse image ensemble via 3d part discovery
Jiang et al.	2022	Neuralhofusion: Neural volumetric rendering under human-object interactions
Xu et al.	2008	Animating animal motion from still
Zhang et al.	2023	NeuralDome: A neural modeling pipeline on multi-view human-object interactions
He et al.	2021	Challencap: Monocular 3d capture of challenging human performances using multi-modal references
Wu et al.	2022	Casa: Category-agnostic skeletal animal reconstruction
Yao et al.	2023	Hi-lassie: High-fidelity articulated shape and skeleton discovery from sparse image ensemble
Jiang et al.	2023	Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream
Kwon et al.	2020	Rotationally-temporally consistent novel view synthesis of human performance video
Venkat et al.	2019	HumanMeshNet: Polygonal mesh recovery of humans
Jung et al.	2023	Deformable 3d gaussian splatting for animatable human avatars
Habtegebrial et al.	2018	Fast view synthesis with deep stereo vision
Theobalt et al.	2010	Performance capture from multi-view video
Luo et al.	2023	Sparse RGB-D images create a real thing: A flexible voxel based 3D reconstruction pipeline for single object
Zhang et al.	2024	BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors