Search Results for author: Haomiao Ni

Found 11 papers, 8 papers with code

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models

1 code implementation • 25 Apr 2024 • Haomiao Ni, Bernhard Egger, Suhas Lohit, Anoop Cherian, Ye Wang, Toshiaki Koike-Akino, Sharon X. Huang, Tim K. Marks

To guide video generation with the additional image input, we propose a "repeat-and-slide" strategy that modulates the reverse denoising process, allowing the frozen diffusion model to synthesize a video frame-by-frame starting from the provided image.

Denoising Image to Video Generation

420

Paper
Code

3D-Aware Talking-Head Video Motion Transfer

no code implementations • 5 Nov 2023 • Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang

In this paper, we propose a novel 3D-aware talking-head video motion transfer network, Head3D, which fully exploits the subject appearance information by generating a visually-interpretable 3D canonical head from the 2D subject frames with a recurrent network.

Novel View Synthesis

Paper
Add Code

Synthetic Augmentation with Large-scale Unconditional Pre-training

1 code implementation • 8 Aug 2023 • Jiarong Ye, Haomiao Ni, Peng Jin, Sharon X. Huang, Yuan Xue

To further reduce the dependency on annotated data, we propose a synthetic augmentation method called HistoDiffusion, which can be pre-trained on large-scale unlabeled datasets and later applied to a small-scale labeled dataset for augmented training.

Paper
Code

Exploring Compositional Visual Generation with Latent Classifier Guidance

no code implementations • 25 Apr 2023 • Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min

We show that this paradigm based on latent classifier guidance is agnostic to pre-trained generative models, and present competitive results for both image generation and sequential manipulation of real and synthetic images.

Image Generation

Paper
Add Code

Conditional Image-to-Video Generation with Latent Flow Diffusion Models

1 code implementation • CVPR 2023 • Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

In this paper, we propose an approach for cI2V using novel latent flow diffusion models (LFDM) that synthesize an optical flow sequence in the latent space based on the given condition to warp the given image.

Image to Video Generation Optical Flow Estimation

420

Paper
Code

Semi-supervised Body Parsing and Pose Estimation for Enhancing Infant General Movement Assessment

2 code implementations • 14 Oct 2022 • Haomiao Ni, Yuan Xue, Liya Ma, Qian Zhang, Xiaoye Li, Xiaolei Huang

We collected a new clinical IMV dataset with GMA annotations, and our experiments show that SPN models for body parsing and pose estimation trained on the first two datasets generalize well to the new clinical dataset and their results can significantly boost the CRNN-based GMA prediction performance.

Data Augmentation Generative Adversarial Network +1

Paper
Code

Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis

1 code implementation • 2 Oct 2022 • Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue

The novel design of dual branches combines the strengths of deformation-grid-based transformation and warp-free generation for better identity preservation and robustness to occlusion in the synthesized videos.

motion retargeting

140

Paper
Code

Asymmetry Disentanglement Network for Interpretable Acute Ischemic Stroke Infarct Segmentation in Non-Contrast CT Scans

1 code implementation • 30 Jun 2022 • Haomiao Ni, Yuan Xue, Kelvin Wong, John Volpi, Stephen T. C. Wong, James Z. Wang, Xiaolei Huang

In this paper, we propose a novel Asymmetry Disentanglement Network (ADN) to automatically separate pathological asymmetries and intrinsic anatomical asymmetries in NCCTs for more effective and interpretable AIS segmentation.

Disentanglement Segmentation

Paper
Code

DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning

no code implementations • 24 Sep 2021 • Tongan Cai, Haomiao Ni, Mingli Yu, Xiaolei Huang, Kelvin Wong, John Volpi, James Z. Wang, Stephen T. C. Wong

In an emergency room (ER) setting, stroke triage or screening is a common challenge.

Attribute Multimodal Deep Learning +2

Paper
Add Code

SiamParseNet: Joint Body Parsing and Label Propagation in Infant Movement Videos

1 code implementation • 16 Jul 2020 • Haomiao Ni, Yuan Xue, Qian Zhang, Xiaolei Huang

In this paper, we propose a semi-supervised body parsing model, termed SiamParseNet (SPN), to jointly learn single frame body parsing and label propagation between frames in a semi-supervised fashion.

Paper
Code

Action Recognition Based on Optimal Joint Selection and Discriminative Depth Descriptor

1 code implementation • The 13th Asian Conference on Computer Vision 2016 • Haomiao Ni, Hong Liu, Xiangdong Wang, Yueliang Qian

This paper proposes a novel human action recognition using the decision-level fusion of both skeleton and depth sequence.

Action Recognition Dynamic Time Warping +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.