[go: nahoru, domu]

Yang et al., 2021 - Google Patents

Viser: Video-specific surface embeddings for articulated 3d shape reconstruction

Yang et al., 2021

View PDF
Document ID
2525108969224127806
Author
Yang G
Sun D
Jampani V
Vlasic D
Cole F
Liu C
Ramanan D
Publication year
Publication venue
Advances in Neural Information Processing Systems

External Links

Snippet

We introduce ViSER, a method for recovering articulated 3D shapes and dense3D trajectories from monocular videos. Previous work on high-quality reconstruction of dynamic 3D shapes typically relies on multiple camera views, strong category-specific priors, or 2D …
Continue reading at proceedings.neurips.cc (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/10Geometric effects
    • G06T15/20Perspective computation
    • G06T15/205Image-based rendering
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general
    • G06T2200/08Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
    • G06T3/0068Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping

Similar Documents

Publication Publication Date Title
Yang et al. Viser: Video-specific surface embeddings for articulated 3d shape reconstruction
Weng et al. Humannerf: Free-viewpoint rendering of moving people from monocular video
Yu et al. Function4d: Real-time human volumetric capture from very sparse consumer rgbd sensors
Su et al. Robustfusion: Human volumetric capture with data-driven visual cues using a rgbd camera
Gao et al. Monocular dynamic view synthesis: A reality check
Huang et al. Towards accurate marker-less human shape and pose estimation over time
De Aguiar et al. Performance capture from sparse multi-view video
US8384714B2 (en) Systems, methods and devices for motion capture using video imaging
Chen et al. Alignsdf: Pose-aligned signed distance fields for hand-object reconstruction
Tretschk et al. Demea: Deep mesh autoencoders for non-rigidly deforming objects
Yao et al. Lassie: Learning articulated shapes from sparse image ensemble via 3d part discovery
Jiang et al. Neuralhofusion: Neural volumetric rendering under human-object interactions
Xu et al. Animating animal motion from still
Zhang et al. NeuralDome: A neural modeling pipeline on multi-view human-object interactions
He et al. Challencap: Monocular 3d capture of challenging human performances using multi-modal references
Wu et al. Casa: Category-agnostic skeletal animal reconstruction
Yao et al. Hi-lassie: High-fidelity articulated shape and skeleton discovery from sparse image ensemble
Jiang et al. Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream
Kwon et al. Rotationally-temporally consistent novel view synthesis of human performance video
Venkat et al. HumanMeshNet: Polygonal mesh recovery of humans
Jung et al. Deformable 3d gaussian splatting for animatable human avatars
Habtegebrial et al. Fast view synthesis with deep stereo vision
Theobalt et al. Performance capture from multi-view video
Luo et al. Sparse RGB-D images create a real thing: A flexible voxel based 3D reconstruction pipeline for single object
Zhang et al. BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors