Author: Dong, Zilong : Search

Applied Filters

People

Publications

Conferences

Publication Date

18 Results for: Author: Dong, ZilongEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,834,857 records)|Limit your search to The ACM Full-Text Collection (773,110 records)

Showing 1 - 18of18 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
November 2024
MVImgNet2.0: A Larger-scale Dataset of Multi-view Images
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 173, Pages 1–16https://doi.org/10.1145/3687973

MVImgNet is a large-scale dataset that contains multi-view images of ~220k real-world objects in 238 classes. As a counterpart of ImageNet, it introduces 3D visual signals via multi-view shooting, making a soft bridge between 2D and 3D vision. This paper ...
0
93
Metrics
Total Citations0
Total Downloads93
Last 12 Months93
Last 6 weeks45
Get Access
research-article
November 2024
StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
ACM Transactions on Graphics (TOG), Volume 43, Issue 6Article No.: 250, Pages 1–18https://doi.org/10.1145/3687971

This work addresses the challenge of high-quality surface normal estimation from monocular colored inputs (i.e., images and videos), a field which has recently been revolutionized by repurposing diffusion priors. However, previous attempts still struggle ...
0
88
Metrics
Total Citations0
Total Downloads88
Last 12 Months88
Last 6 weeks72
Get Access
Article
November 2024
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
- Shenhao Zhu,
- Junming Leo Chen,
- Zuozhuo Dai,
- Zilong Dong,
- Yinghui Xu,
- Xun Cao,
- Yao Yao,
- Hao Zhu,
- Siyu Zhu
Computer Vision – ECCV 2024Pages 145–162https://doi.org/10.1007/978-3-031-73001-6_9
Abstract
In this study, we introduce a methodology for human image animation by leveraging a 3D human parametric model within a latent diffusion framework to enhance shape alignment and motion guidance in current human generative techniques. The ...
0
Metrics
Total Citations0
Article
November 2024
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Computer Vision – ECCV 2024Pages 73–91https://doi.org/10.1007/978-3-031-72940-9_5
Abstract
This paper enables high-fidelity, transferable NeRF editing by frequency decomposition. Recent NeRF editing pipelines lift 2D stylization results to 3D scenes while suffering from blurry results, and fail to capture detailed structures caused by ...
0
Metrics
Total Citations0
Article
November 2024
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding
Computer Vision – ECCV 2024Pages 52–69https://doi.org/10.1007/978-3-031-72684-2_4
Abstract
3D vision is inherently characterized by sparse spatial structures, which propels the necessity for an efficient paradigm tailored to 3D generation. Another discrepancy is the amount of training data, which undeniably affects generalization if we ...
0
Metrics
Total Citations0
Article
October 2024
An Optimization Framework to Enforce Multi-view Consistency for Texturing 3D Meshes
Computer Vision – ECCV 2024Pages 145–162https://doi.org/10.1007/978-3-031-72764-1_9
Abstract
A fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues are the ...
0
Metrics
Total Citations0
research-article
June 2024
Learning Spherical Radiance Field for Efficient 360° Unbounded Novel View Synthesis
IEEE Transactions on Image Processing (TIP), Volume 33Pages 3722–3734https://doi.org/10.1109/TIP.2024.3409052
Novel view synthesis aims at rendering any posed images from sparse observations of the scene. Recently, neural radiance fields (NeRF) have demonstrated their effectiveness in synthesizing novel views of a bounded scene. However, most existing methods ...
1
Metrics
Total Citations1
research-article
October 2023
Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4606–4615https://doi.org/10.1145/3581783.3611857

Recently, Neural Radiance Fields (NeRF) has exhibited significant success in novel view synthesis, surface reconstruction, etc. However, since no physical reflection is considered in its rendering pipeline, NeRF mistakes the reflection in the mirror as a ...
11
224
Metrics
Total Citations11
Total Downloads224
Last 12 Months139
Last 6 weeks9
Get Access
research-article
September 2023
Guiding image inpainting via structure and texture features with dual encoder
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 40, Issue 6Pages 4303–4317https://doi.org/10.1007/s00371-023-03083-7
Abstract
Image inpainting techniques have made rapid progresses in recent years. Recent advancements focus mainly on generating realistic and semantically plausible structure and texture features in missing regions. However, current popular inpainting ...
0
Metrics
Total Citations0
research-article
September 2021
Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 8890–8897https://doi.org/10.1109/IROS51168.2021.9636767
The integration of multiple cameras and 3D Li-DARs has become basic configuration of augmented reality devices, robotics, and autonomous vehicles. The calibration of multi-modal sensors is crucial for a system to properly function, but it remains tedious ...
1
Metrics
Total Citations1
research-article
December 2016
Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion
IEEE Transactions on Image Processing (TIP), Volume 25, Issue 12Pages 5957–5970https://doi.org/10.1109/TIP.2016.2607425

Structure-from-motion (SfM) largely relies on feature tracking. In image sequences, if disjointed tracks caused by objects moving in and out of the field of view, occasional occlusion, or image noise are not handled well, corresponding SfM could be ...
15
Metrics
Total Citations15
article
January 2014
Efficient keyframe-based real-time camera tracking
Computer Vision and Image Understanding (CVIU), Volume 118Pages 97–110https://doi.org/10.1016/j.cviu.2013.08.005

We present a novel keyframe-based global localization method for markerless real-time camera tracking. Our system contains an offline module to select features from a group of reference images and an online module to match them to the input live video ...
2
Metrics
Total Citations2
chapter
January 2012
Depth-Varying human video sprite synthesis
Transactions on Edutainment VIIJanuary 2012, Pages 34–47

Video texture is an appealing method to extract and replay natural human motion from video shots. There have been much research on video texture analysis, generation and interactive control. However, the video sprites created by existing methods are ...
0
Metrics
Total Citations0
research-article
December 2011
Interactive weathering of depth-inferred videos
VRCAI '11: Proceedings of the 10th International Conference on Virtual Reality Continuum and Its Applications in IndustryPages 117–124https://doi.org/10.1145/2087756.2087771

Aging or weathering is an important technique for generating natural images in computer graphics. In this paper, we propose a novel video weathering method which can synthesize the weathering effects for the real captured videos. We first recover the ...
1
170
Metrics
Total Citations1
Total Downloads170
Last 12 Months2
Last 6 weeks1
Get Access
Article
September 2010
Efficient non-consecutive feature tracking for structure-from-motion
ECCV'10: Proceedings of the 11th European conference on Computer vision: Part VPages 422–435

Structure-from-motion (SfM) is an important computer vision problem and largely relies on the quality of feature tracking. In image sequences, if disjointed tracks caused by objects moving in and out of the view, occasional occlusion, or image noise, ...
6
Metrics
Total Citations6
article
June 2010
Adaptive voxels: interactive rendering of massive 3D models
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 26, Issue 6-8Pages 409–419https://doi.org/10.1007/s00371-010-0465-7

We present a novel approach for interactive rendering of massive 3D models. Our approach integrates adaptive sampling-based simplification, visibility culling, out-of-core data management and level-of-detail. We use a unified scene graph representation ...
1
Metrics
Total Citations1
research-article
September 2009
Refilming with Depth-Inferred Videos
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 15, Issue 5Pages 828–840https://doi.org/10.1109/TVCG.2009.47

Compared to still image editing, content-based video editing faces the additional challenges of maintaining the spatiotemporal consistency with respect to geometry. This brings up difficulties of seamlessly modifying video content, for instance, ...
7
Metrics
Total Citations7
article
April 2006
Synthesizing trees by plantons
The Visual Computer: International Journal of Computer Graphics (VISC), Volume 22, Issue 4Pages 238–248https://doi.org/10.1007/s00371-006-0002-x

In this paper, we present a two-level statistical model for characterizing the stochastic and specific nature of trees. At the low level, we define plantons, which are a group of similar organs, to depict tree organ details statistically. At the high ...
1
Metrics
Total Citations1

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

MVImgNet2.0: A Larger-scale Dataset of Multi-view Images

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding

An Optimization Framework to Enforce Multi-view Consistency for Texturing 3D Meshes

Learning Spherical Radiance Field for Efficient 360° Unbounded Novel View Synthesis

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing

Guiding image inpainting via structure and texture features with dual encoder

Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs

Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion

Efficient keyframe-based real-time camera tracking

Depth-Varying human video sprite synthesis

Interactive weathering of depth-inferred videos

Efficient non-consecutive feature tracking for structure-from-motion

Adaptive voxels: interactive rendering of massive 3D models

Refilming with Depth-Inferred Videos

Synthesizing trees by plantons