Search Results for author: Zilong Dong

Found 17 papers, 4 papers with code

Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

no code implementations • 3 Apr 2024 • Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

This paper enables high-fidelity, transferable NeRF editing by frequency decomposition.

Paper
Add Code

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

no code implementations • 22 Mar 2024 • Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang

In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.

Paper
Add Code

OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

no code implementations • 19 Mar 2024 • Junhao Cai, Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qifeng Chen

Derived from OmniObject3D, OO3D-9D is the largest and most diverse dataset in the field of category-level object pose and size estimation.

Object

Paper
Add Code

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

no code implementations • 18 Mar 2024 • Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.

Denoising

Paper
Add Code

Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation

no code implementations • 25 Jan 2024 • Minglin Chen, Weihao Yuan, Yukun Wang, Zhe Sheng, Yisheng He, Zilong Dong, Liefeng Bo, Yulan Guo

We propose a novel synchronized generation and reconstruction method to effectively optimize the NeRF.

3D Generation Text to 3D

Paper
Add Code

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

no code implementations • 28 Nov 2023 • Lingteng Qiu, GuanYing Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images.

3D Generation Text to 3D

Paper
Add Code

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing

no code implementations • 7 Aug 2023 • Junyi Zeng, Chong Bao, Rui Chen, Zilong Dong, Guofeng Zhang, Hujun Bao, Zhaopeng Cui

Recently, Neural Radiance Fields (NeRF) has exhibited significant success in novel view synthesis, surface reconstruction, etc.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

Fine-grained Text-Video Retrieval with Frozen Image Encoders

no code implementations • 14 Jul 2023 • Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

In the second stage, we propose a novel decoupled video text cross attention module to capture fine-grained multimodal information in spatial and temporal dimensions.

Decoder Retrieval +1

Paper
Add Code

PanoContext-Former: Panoramic Total Scene Understanding with a Transformer

no code implementations • 21 May 2023 • Yuan Dong, Chuan Fang, Liefeng Bo, Zilong Dong, Ping Tan

Panoramic image enables deeper understanding and more holistic perception of $360^\circ$ surrounding environment, which can naturally encode enriched scene context information compared to standard perspective image.

3D Object Detection object-detection +1

Paper
Add Code

3D Former: Monocular Scene Reconstruction with 3D SDF Transformers

1 code implementation • 31 Jan 2023 • Weihao Yuan, Xiaodong Gu, Heng Li, Zilong Dong, Siyu Zhu

In this work, we propose an SDF transformer network, which replaces the role of 3D CNN for better 3D feature aggregation.

Paper
Code

Dense RGB SLAM with Neural Implicit Maps

no code implementations • 21 Jan 2023 • Heng Li, Xiaodong Gu, Weihao Yuan, Luwei Yang, Zilong Dong, Ping Tan

To reach this challenging goal without depth input, we introduce a hierarchical feature volume to facilitate the implicit map decoder.

Decoder Simultaneous Localization and Mapping

Paper
Add Code

${S}^{2}$Net: Accurate Panorama Depth Estimation on Spherical Surface

no code implementations • 14 Jan 2023 • Meng Li, Senbo Wang, Weihao Yuan, Weichao Shen, Zhe Sheng, Zilong Dong

In this paper, we propose an end-to-end deep network for monocular panorama depth estimation on a unit spherical surface.

Decoder Monocular Depth Estimation

Paper
Add Code

RenderNet: Visual Relocalization Using Virtual Viewpoints in Large-Scale Indoor Environments

no code implementations • 26 Jul 2022 • Jiahui Zhang, Shitao Tang, Kejie Qiu, Rui Huang, Chuan Fang, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan

Visual relocalization has been a widely discussed problem in 3D vision: given a pre-constructed 3D visual map, the 6 DoF (Degrees-of-Freedom) pose of a query image is estimated.

Image Retrieval Retrieval +1

Paper
Add Code

AR Mapping: Accurate and Efficient Mapping for Augmented Reality

no code implementations • 27 Mar 2021 • Rui Huang, Chuan Fang, Kejie Qiu, Le Cui, Zilong Dong, Siyu Zhu, Ping Tan

Secondly, we propose an AR mapping pipeline which takes the input from the scanning device and produces accurate AR Maps.

Paper
Add Code

DRO: Deep Recurrent Optimizer for Video to Depth

1 code implementation • 24 Mar 2021 • Xiaodong Gu, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Chengzhou Tang, Zilong Dong, Ping Tan

There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques.

Paper
Code

UniFuse: Unidirectional Fusion for 360$^{\circ}$ Panorama Depth Estimation

1 code implementation • 6 Feb 2021 • Hualie Jiang, Zhe Sheng, Siyu Zhu, Zilong Dong, Rui Huang

Besides, we also designed a more effective fusion module for our fusion scheme.

Ranked #1 on Depth Estimation on Matterport3D

Depth Estimation

Paper
Code

ENFT: Efficient Non-Consecutive Feature Tracking for Robust Structure-from-Motion

3 code implementations • 27 Oct 2015 • Guofeng Zhang, Hao-Min Liu, Zilong Dong, Jiaya Jia, Tien-Tsin Wong, Hujun Bao

Our framework consists of steps of solving the feature `dropout' problem when indistinctive structures, noise or large image distortion exists, and of rapidly recognizing and joining common features located in different subsequences.

250

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.