Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
- Yushuang Wu,
- Luyue Shi,
- Haolin Liu,
- Hongjie Liao,
- Lingteng Qiu,
- Weihao Yuan,
- Xiaodong Gu,
- Zilong Dong,
- Shuguang Cui,
- Xiaoguang Han
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 173, Pages 1–16https://doi.org/10.1145/3687973MVImgNet is a large-scale dataset that contains multi-view images of ~220k real-world objects in 238 classes. As a counterpart of ImageNet, it introduces 3D visual signals via multi-view shooting, making a soft bridge between 2D and 3D vision. This paper ...
- research-articleNovember 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
- Chongjie Ye,
- Lingteng Qiu,
- Xiaodong Gu,
- Qi Zuo,
- Yushuang Wu,
- Zilong Dong,
- Liefeng Bo,
- Yuliang Xiu,
- Xiaoguang Han
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 250, Pages 1–18https://doi.org/10.1145/3687971This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle ...
- ArticleNovember 2024
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
AbstractIn this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in current human generative techniques. The ...
- ArticleNovember 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
AbstractThis paper enables high-fidelity, transferable NeRF editing by frequency decomposition. Recent NeRF editing pipelines lift 2D stylization results to 3D scenes while suffering from blurry results, and fail to capture detailed structures caused by ...
- ArticleNovember 2024
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding
Abstract3D vision is inherently characterized by sparse spatial structures, which propels the necessity for an efficient paradigm tailored to 3D generation. Another discrepancy is the amount of training data, which undeniably affects generalization if we ...
- ArticleOctober 2024
An Optimization Framework to Enforce Multi-view Consistency for Texturing 3D Meshes
AbstractA fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues are the ...
- research-articleJune 2024
Learning Spherical Radiance Field for Efficient 360° Unbounded Novel View Synthesis
IEEE Transactions on Image Processing (TIP), Volume 33Pages 3722–3734https://doi.org/10.1109/TIP.2024.3409052Novel view synthesis aims at rendering any posed images from sparse observations of the scene. Recently, neural radiance fields (NeRF) have demonstrated their effectiveness in synthesizing novel views of a bounded scene. However, most existing methods ...
- research-articleOctober 2023
Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4606–4615https://doi.org/10.1145/3581783.3611857Recently, Neural Radiance Fields (NeRF) has exhibited significant success in novel view synthesis, surface reconstruction, etc. However, since no physical reflection is considered in its rendering pipeline, NeRF mistakes the reflection in the mirror as a ...
- research-articleSeptember 2023
Guiding image inpainting via structure and texture features with dual encoder
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 40, Issue 6Pages 4303–4317https://doi.org/10.1007/s00371-023-03083-7AbstractImage inpainting techniques have made rapid progresses in recent years. Recent advancements focus mainly on generating realistic and semantically plausible structure and texture features in missing regions. However, current popular inpainting ...
- research-articleSeptember 2021
Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 8890–8897https://doi.org/10.1109/IROS51168.2021.9636767The integration of multiple cameras and 3D Li-DARs has become basic configuration of augmented reality devices, robotics, and autonomous vehicles. The calibration of multi-modal sensors is crucial for a system to properly function, but it remains tedious ...
- research-articleDecember 2016
Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion
IEEE Transactions on Image Processing (TIP), Volume 25, Issue 12Pages 5957–5970https://doi.org/10.1109/TIP.2016.2607425Structure-from-motion (SfM) largely relies on feature tracking. In image sequences, if disjointed tracks caused by objects moving in and out of the field of view, occasional occlusion, or image noise are not handled well, corresponding SfM could be ...
- articleJanuary 2014
Efficient keyframe-based real-time camera tracking
Computer Vision and Image Understanding (CVIU), Volume 118Pages 97–110https://doi.org/10.1016/j.cviu.2013.08.005We present a novel keyframe-based global localization method for markerless real-time camera tracking. Our system contains an offline module to select features from a group of reference images and an online module to match them to the input live video ...
- chapterJanuary 2012
Depth-Varying human video sprite synthesis
Transactions on Edutainment VIIJanuary 2012, Pages 34–47Video texture is an appealing method to extract and replay natural human motion from video shots. There have been much research on video texture analysis, generation and interactive control. However, the video sprites created by existing methods are ...
- research-articleDecember 2011
Interactive weathering of depth-inferred videos
VRCAI '11: Proceedings of the 10th International Conference on Virtual Reality Continuum and Its Applications in IndustryPages 117–124https://doi.org/10.1145/2087756.2087771Aging or weathering is an important technique for generating natural images in computer graphics. In this paper, we propose a novel video weathering method which can synthesize the weathering effects for the real captured videos. We first recover the ...
- ArticleSeptember 2010
Efficient non-consecutive feature tracking for structure-from-motion
Structure-from-motion (SfM) is an important computer vision problem and largely relies on the quality of feature tracking. In image sequences, if disjointed tracks caused by objects moving in and out of the view, occasional occlusion, or image noise, ...
- articleJune 2010
Adaptive voxels: interactive rendering of massive 3D models
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 26, Issue 6-8Pages 409–419https://doi.org/10.1007/s00371-010-0465-7We present a novel approach for interactive rendering of massive 3D models. Our approach integrates adaptive sampling-based simplification, visibility culling, out-of-core data management and level-of-detail. We use a unified scene graph representation ...
- research-articleSeptember 2009
Refilming with Depth-Inferred Videos
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 15, Issue 5Pages 828–840https://doi.org/10.1109/TVCG.2009.47Compared to still image editing, content-based video editing faces the additional challenges of maintaining the spatiotemporal consistency with respect to geometry. This brings up difficulties of seamlessly modifying video content, for instance, ...
- articleApril 2006
Synthesizing trees by plantons
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 22, Issue 4Pages 238–248https://doi.org/10.1007/s00371-006-0002-xIn this paper, we present a two-level statistical model for characterizing the stochastic and specific nature of trees. At the low level, we define plantons, which are a group of similar organs, to depict tree organ details statistically. At the high ...