Search Results for author: Zhiyu Tan

Found 10 papers, 7 papers with code

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

no code implementations • 8 Mar 2024 • Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang song

Vanilla text-to-image diffusion models struggle with generating accurate human images, commonly resulting in imperfect anatomies such as unnatural postures or disproportionate limbs. Existing methods address this issue mostly by fine-tuning the model with extra images or adding additional controls -- human-centric priors such as pose or depth maps -- during the image generation phase.

Image Generation

Paper
Add Code

OVO: Open-Vocabulary Occupancy

1 code implementation • 25 May 2023 • Zhiyu Tan, ZiChao Dong, Cheng Zhang, Weikun Zhang, Hang Ji, Hao Li

Semantic occupancy prediction aims to infer dense geometry and semantics of surroundings for an autonomous agent to operate safely in the 3D environment.

Knowledge Distillation

Paper
Code

Entroformer: A Transformer-based Entropy Model for Learned Image Compression

2 code implementations • ICLR 2022 • Yichen Qian, Ming Lin, Xiuyu Sun, Zhiyu Tan, Rong Jin

One critical component in lossy deep image compression is the entropy model, which predicts the probability distribution of the quantized latent representation in the encoding and decoding modules.

Image Classification Image Compression +1

Paper
Code

GiraffeDet: A Heavy-Neck Paradigm for Object Detection

2 code implementations • ICLR 2022 • Yiqi Jiang, Zhiyu Tan, Junyan Wang, Xiuyu Sun, Ming Lin, Hao Li

This heavy-backbone design paradigm is mostly due to the historical legacy when transferring image recognition models to object detection rather than an end-to-end optimized design for object detection.

Object object-detection +1

Paper
Code

MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection

1 code implementation • 26 Nov 2021 • Zhenhong Sun, Ming Lin, Xiuyu Sun, Zhiyu Tan, Hao Li, Rong Jin

Recent researches attempt to reduce this cost by optimizing the backbone architecture with the help of Neural Architecture Search (NAS).

Ranked #88 on Object Detection on COCO minival

Neural Architecture Search Object +2

345

Paper
Code

ZenDet: Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search

no code implementations • 29 Sep 2021 • Zhenhong Sun, Ming Lin, Zhiyu Tan, Xiuyu Sun, Rong Jin

Recent researches attempt to reduce this cost by optimizing the backbone architecture with the help of Neural Architecture Search (NAS).

Neural Architecture Search Object +2

Paper
Add Code

Interpolation variable rate image compression

1 code implementation • 20 Sep 2021 • Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li

Compression standards have been used to reduce the cost of image storage and transmission for decades.

Image Compression MS-SSIM +1

Paper
Code

Spatiotemporal Entropy Model is All You Need for Learned Video Compression

1 code implementation • 13 Apr 2021 • Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Dongyang Li, Yichen Qian, Hao Li

The framework of dominant learned video compression methods is usually composed of motion prediction modules as well as motion vector and residual image compression modules, suffering from its complex structure and error propagation problem.

Image Compression motion prediction +3

Paper
Code

Learning Accurate Entropy Model with Global Reference for Image Compression

2 code implementations • ICLR 2021 • Yichen Qian, Zhiyu Tan, Xiuyu Sun, Ming Lin, Dongyang Li, Zhenhong Sun, Hao Li, Rong Jin

In this work, we propose a novel Global Reference Model for image compression to effectively leverage both the local and the global context information, leading to an enhanced compression rate.

Image Compression

Paper
Code

Learning to Rank Proposals for Object Detection

no code implementations • ICCV 2019 • Zhiyu Tan, Xuecheng Nie, Qi Qian, Nan Li, Hao Li

Non-Maximum Suppression (NMS) is an essential step of modern object detection models for removing duplicated candidates.

Learning-To-Rank Object +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.