Search Results for author: Zetong Yang

Found 10 papers, 4 papers with code

CN: Channel Normalization For Point Cloud Recognition

no code implementations • ECCV 2020 • Zetong Yang, Yanan sun, Shu Liu, Xiaojuan Qi, Jiaya Jia

In 3D recognition, to fuse multi-scale structure information, existing methods apply hierarchical frameworks stacked by multiple fusion layers for integrating current relative locations with structure information from the previous level.

Paper
Add Code

Improving Distant 3D Object Detection Using 2D Box Supervision

no code implementations • 14 Mar 2024 • Zetong Yang, Zhiding Yu, Chris Choy, Renhao Wang, Anima Anandkumar, Jose M. Alvarez

This mapping allows the depth estimation of distant objects conditioned on their 2D boxes, making long-range 3D detection with 2D supervision feasible.

3D Object Detection Depth Estimation +2

Paper
Add Code

Visual Point Cloud Forecasting enables Scalable Autonomous Driving

1 code implementation • 29 Dec 2023 • Zetong Yang, Li Chen, Yanan sun, Hongyang Li

To resolve this, we bring up a new pre-training task termed as visual point cloud forecasting - predicting future point clouds from historical visual input.

Motion Forecasting

202

Paper
Code

Fully Sparse 3D Occupancy Prediction

3 code implementations • 28 Dec 2023 • Haisong Liu, Yang Chen, Haiguang Wang, Zetong Yang, Tianyu Li, Jia Zeng, Li Chen, Hongyang Li, LiMin Wang

Occupancy prediction plays a pivotal role in autonomous driving.

Autonomous Driving

493

Paper
Code

Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding

no code implementations • CVPR 2023 • Li Jiang, Zetong Yang, Shaoshuai Shi, Vladislav Golyanik, Dengxin Dai, Bernt Schiele

Masked signal modeling has greatly advanced self-supervised pre-training for language and 2D images.

Scene Understanding

Paper
Add Code

A Unified Query-based Paradigm for Point Cloud Understanding

1 code implementation • CVPR 2022 • Zetong Yang, Li Jiang, Yanan sun, Bernt Schiele, Jiaya Jia

This is achieved by introducing an intermediate representation, i. e., Q-representation, in the querying stage to serve as a bridge between the embedding stage and task heads.

Ranked #7 on Semantic Segmentation on S3DIS

Autonomous Driving object-detection +2

118

Paper
Code

3D-MAN: 3D Multi-frame Attention Network for Object Detection

no code implementations • CVPR 2021 • Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam

In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset.

3D Object Detection Autonomous Driving +1

Paper
Add Code

3DSSD: Point-based 3D Single Stage Object Detector

2 code implementations • CVPR 2020 • Zetong Yang, Yanan sun, Shu Liu, Jiaya Jia

Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.

Object

4,858

Paper
Code