Yang et al., 2021 - Google Patents
Viser: Video-specific surface embeddings for articulated 3d shape reconstructionYang et al., 2021
View PDF- Document ID
- 2525108969224127806
- Author
- Yang G
- Sun D
- Jampani V
- Vlasic D
- Cole F
- Liu C
- Ramanan D
- Publication year
- Publication venue
- Advances in Neural Information Processing Systems
External Links
Snippet
We introduce ViSER, a method for recovering articulated 3D shapes and dense3D trajectories from monocular videos. Previous work on high-quality reconstruction of dynamic 3D shapes typically relies on multiple camera views, strong category-specific priors, or 2D …
- 230000003287 optical 0 abstract description 19
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/08—Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/0068—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yang et al. | Viser: Video-specific surface embeddings for articulated 3d shape reconstruction | |
Weng et al. | Humannerf: Free-viewpoint rendering of moving people from monocular video | |
Yu et al. | Function4d: Real-time human volumetric capture from very sparse consumer rgbd sensors | |
Su et al. | Robustfusion: Human volumetric capture with data-driven visual cues using a rgbd camera | |
Gao et al. | Monocular dynamic view synthesis: A reality check | |
Huang et al. | Towards accurate marker-less human shape and pose estimation over time | |
De Aguiar et al. | Performance capture from sparse multi-view video | |
US8384714B2 (en) | Systems, methods and devices for motion capture using video imaging | |
Chen et al. | Alignsdf: Pose-aligned signed distance fields for hand-object reconstruction | |
Tretschk et al. | Demea: Deep mesh autoencoders for non-rigidly deforming objects | |
Yao et al. | Lassie: Learning articulated shapes from sparse image ensemble via 3d part discovery | |
Jiang et al. | Neuralhofusion: Neural volumetric rendering under human-object interactions | |
Xu et al. | Animating animal motion from still | |
Zhang et al. | NeuralDome: A neural modeling pipeline on multi-view human-object interactions | |
He et al. | Challencap: Monocular 3d capture of challenging human performances using multi-modal references | |
Wu et al. | Casa: Category-agnostic skeletal animal reconstruction | |
Yao et al. | Hi-lassie: High-fidelity articulated shape and skeleton discovery from sparse image ensemble | |
Jiang et al. | Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream | |
Kwon et al. | Rotationally-temporally consistent novel view synthesis of human performance video | |
Venkat et al. | HumanMeshNet: Polygonal mesh recovery of humans | |
Jung et al. | Deformable 3d gaussian splatting for animatable human avatars | |
Habtegebrial et al. | Fast view synthesis with deep stereo vision | |
Theobalt et al. | Performance capture from multi-view video | |
Luo et al. | Sparse RGB-D images create a real thing: A flexible voxel based 3D reconstruction pipeline for single object | |
Zhang et al. | BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors |