Search Results for author: Taku Komura

Found 45 papers, 15 papers with code

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

no code implementations • 22 May 2024 • Rui Xu, Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Shiqing Xin, Changhe Tu, Taku Komura, Wenping Wang

We show that the space spanned by the combination of dimensions and attributes is insufficiently sampled by existing training scheme of diffusion generative models, causing degraded test time performance.

Paper
Add Code

InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios

no code implementations • 19 May 2024 • Yinghao Huang, Leo Ho, Dafei Qin, Mingyi Shi, Taku Komura

We address the problem of accurate capture and expressive modelling of interactive behaviors happening between two persons in daily scenarios.

Paper
Add Code

CWF: Consolidating Weak Features in High-quality Mesh Simplification

no code implementations • 24 Apr 2024 • Rui Xu, Longdu Liu, Ningna Wang, Shuangmin Chen, Shiqing Xin, Xiaohu Guo, Zichun Zhong, Taku Komura, Wenping Wang, Changhe Tu

In mesh simplification, common requirements like accuracy, triangle quality, and feature alignment are often considered as a trade-off.

Paper
Add Code

Taming Diffusion Probabilistic Models for Character Control

1 code implementation • 23 Apr 2024 • Rui Chen, Mingyi Shi, Shaoli Huang, Ping Tan, Taku Komura, Xuelin Chen

We present a novel character control framework that effectively utilizes motion diffusion probabilistic models to generate high-quality and diverse character animations, responding in real-time to a variety of dynamic user-supplied control signals.

Computational Efficiency

Paper
Code

Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization

no code implementations • 23 Jan 2024 • Zimeng Wang, Zhiyang Dou, Rui Xu, Cheng Lin, YuAn Liu, Xiaoxiao Long, Shiqing Xin, Taku Komura, Xiaoming Yuan, Wenping Wang

We introduce Coverage Axis++, a novel and efficient approach to 3D shape skeletonization.

Paper
Add Code

On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional Encoding

no code implementations • 2 Jan 2024 • Guying Lin, Lei Yang, YuAn Liu, Congyi Zhang, Junhui Hou, Xiaogang Jin, Taku Komura, John Keyser, Wenping Wang

Sampling against this intrinsic frequency following the Nyquist-Sannon sampling theorem allows us to determine an appropriate training sampling rate.

Paper
Add Code

DiffusionPhase: Motion Diffusion in Frequency Domain

no code implementations • 7 Dec 2023 • Weilin Wan, Yiming Huang, Shutong Wu, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

In this study, we introduce a learning-based method for generating high-quality human motion sequences from text descriptions (e. g., ``A person walks forward").

Paper
Add Code

EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation

no code implementations • 4 Dec 2023 • Wenyang Zhou, Zhiyang Dou, Zeyu Cao, Zhouyingcheng Liao, Jingbo Wang, Wenjia Wang, YuAn Liu, Taku Komura, Wenping Wang, Lingjie Liu

We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality human motion generation.

Denoising Human Dynamics +1

Paper
Add Code

Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction

no code implementations • 29 Nov 2023 • Yilin Wen, Hao Pan, Takehiko Ohkawa, Lei Yang, Jia Pan, Yoichi Sato, Taku Komura, Wenping Wang

We present a novel framework that concurrently tackles hand action recognition and 3D future hand motion prediction.

Action Recognition motion prediction

Paper
Add Code

StructRe: Rewriting for Structured Shape Modeling

no code implementations • 29 Nov 2023 • Jiepeng Wang, Hao Pan, Yang Liu, Xin Tong, Taku Komura, Wenping Wang

Such a localized rewriting process enables probabilistic modeling of ambiguous structures and robust generalization across object categories.

Object

Paper
Add Code

TLControl: Trajectory and Language Control for Human Motion Synthesis

no code implementations • 28 Nov 2023 • Weilin Wan, Zhiyang Dou, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

Controllable human motion synthesis is essential for applications in AR/VR, gaming, movies, and embodied AI.

Motion Synthesis

Paper
Add Code

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models

no code implementations • 28 Nov 2023 • Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, YuAn Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang

The experiments demonstrate the superior performance of Surf-D in shape generation across multiple modalities as conditions.

3D Reconstruction

Paper
Add Code

C$\cdot$ASE: Learning Conditional Adversarial Skill Embeddings for Physics-based Characters

no code implementations • 20 Sep 2023 • Zhiyang Dou, Xuelin Chen, Qingnan Fan, Taku Komura, Wenping Wang

We present C$\cdot$ASE, an efficient and effective framework that learns conditional Adversarial Skill Embeddings for physics-based characters.

Imitation Learning

Paper
Add Code

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

2 code implementations • 7 Sep 2023 • YuAn Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image.

3D Generation Image to 3D +2

807

Paper
Code

BodyFormer: Semantics-guided 3D Body Gesture Synthesis with Transformer

no code implementations • 7 Sep 2023 • Kunkun Pang, Dafei Qin, Yingruo Fan, Julian Habekost, Takaaki Shiratori, Junichi Yamagishi, Taku Komura

Learning the mapping between speech and 3D full-body gestures is difficult due to the stochastic nature of the problem and the lack of a rich cross-modal dataset that is needed for training.

Paper
Add Code

Motion In-Betweening with Phase Manifolds

1 code implementation • 24 Aug 2023 • Paul Starke, Sebastian Starke, Taku Komura, Frank Steinicke

This paper introduces a novel data-driven motion in-betweening system to reach target poses of characters by making use of phases variables learned by a Periodic Autoencoder.

164

Paper
Code

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

1 code implementation • 27 May 2023 • YuAn Liu, Peng Wang, Cheng Lin, Xiaoxiao Long, Jiepeng Wang, Lingjie Liu, Taku Komura, Wenping Wang

We present a neural rendering-based method called NeRO for reconstructing the geometry and the BRDF of reflective objects from multiview images captured in an unknown environment.

Neural Rendering Object

505

Paper
Code

Neural Face Rigging for Animating and Retargeting Facial Meshes in the Wild

1 code implementation • 15 May 2023 • Dafei Qin, Jun Saito, Noam Aigerman, Thibault Groueix, Taku Komura

We propose an end-to-end deep-learning approach for automatic rigging and retargeting of 3D models of human faces in the wild.

Paper
Code

F$^{2}$-NeRF: Fast Neural Radiance Field Training with Free Camera Trajectories

1 code implementation • 28 Mar 2023 • Peng Wang, YuAn Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang

Based on our analysis, we further propose a novel space-warping method called perspective warping, which allows us to handle arbitrary trajectories in the grid-based NeRF framework.

Novel View Synthesis

902

Paper
Code

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction

1 code implementation • ICCV 2023 • Wenjia Wang, Yongtao Ge, Haiyi Mei, Zhongang Cai, Qingping Sun, Yanjun Wang, Chunhua Shen, Lei Yang, Taku Komura

As it is hard to calibrate single-view RGB images in the wild, existing 3D human mesh reconstruction (3DHMR) methods either use a constant large focal length or estimate one based on the background environment context, which can not tackle the problem of the torso, limb, hand or face distortion caused by perspective camera projection when the camera is close to the human body.

Ranked #6 on 3D Human Pose Estimation on 3DPW

3D Human Pose Estimation 3D Reconstruction

Paper
Code

Online Neural Path Guiding with Normalized Anisotropic Spherical Gaussians

no code implementations • 11 Mar 2023 • Jiawei Huang, Akito Iizuka, Hajime Tanaka, Taku Komura, Yoshifumi Kitamura

The variance reduction speed of physically-based rendering is heavily affected by the adopted importance sampling technique.

Paper
Add Code

PhaseMP: Robust 3D Pose Estimation via Phase-conditioned Human Motion Prior

no code implementations • ICCV 2023 • Mingyi Shi, Sebastian Starke, Yuting Ye, Taku Komura, Jungdam Won

We present a novel motion prior, called PhaseMP, modeling a probability distribution on pose transitions conditioned by a frequency domain feature extracted from a periodic autoencoder.

3D Pose Estimation Motion Estimation

Paper
Add Code

F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories

no code implementations • CVPR 2023 • Peng Wang, YuAn Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, Wenping Wang

Existing fast grid-based NeRF training frameworks, like Instant-NGP, Plenoxels, DVGO, or TensoRF, are mainly designed for bounded scenes and rely on space warping to handle unbounded scenes.

Novel View Synthesis

Paper
Add Code

NeuralUDF: Learning Unsigned Distance Fields for Multi-view Reconstruction of Surfaces with Arbitrary Topologies

no code implementations • CVPR 2023 • Xiaoxiao Long, Cheng Lin, Lingjie Liu, YuAn Liu, Peng Wang, Christian Theobalt, Taku Komura, Wenping Wang

In this paper, we propose to represent surfaces as the Unsigned Distance Function (UDF) and develop a new volume rendering scheme to learn the neural UDF representation.

Neural Rendering

Paper
Add Code

TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer

no code implementations • ICCV 2023 • Zhiyang Dou, Qingxuan Wu, Cheng Lin, Zeyu Cao, Qiangqiang Wu, Weilin Wan, Taku Komura, Wenping Wang

We further demonstrate the generalizability of our method on hand mesh recovery.

Human Mesh Recovery Token Reduction

Paper
Add Code

Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos

1 code implementation • CVPR 2023 • Yilin Wen, Hao Pan, Lei Yang, Jia Pan, Taku Komura, Wenping Wang

Understanding dynamic hand motions and actions from egocentric RGB videos is a fundamental yet challenging task due to self-occlusion and ambiguity.

3D Hand Pose Estimation Action Recognition

Paper
Code

Progressively-connected Light Field Network for Efficient View Synthesis

no code implementations • 10 Jul 2022 • Peng Wang, YuAn Liu, Guying Lin, Jiatao Gu, Lingjie Liu, Taku Komura, Wenping Wang

ProLiF encodes a 4D light field, which allows rendering a large batch of rays in one training step for image- or patch-level losses.

Novel View Synthesis

Paper
Add Code

NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors

no code implementations • 27 Jun 2022 • Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang

The key idea of NeuRIS is to integrate estimated normal of indoor scenes as a prior in a neural rendering framework for reconstructing large texture-less shapes and, importantly, to do this in an adaptive manner to also enable the reconstruction of irregular shapes with fine details.

3D Reconstruction Neural Rendering

Paper
Add Code

Learn to Predict How Humans Manipulate Large-sized Objects from Interactive Motions

no code implementations • 25 Jun 2022 • Weilin Wan, Lei Yang, Lingjie Liu, Zhuoying Zhang, Ruixing Jia, Yi-King Choi, Jia Pan, Christian Theobalt, Taku Komura, Wenping Wang

We also observe that an object's intrinsic physical properties are useful for the object motion prediction, and thus design a set of object dynamic descriptors to encode such intrinsic properties.

Graph Neural Network Human-Object Interaction Detection +2

Paper
Add Code

SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views

1 code implementation • 12 Jun 2022 • Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang

We introduce SparseNeuS, a novel neural rendering based method for the task of surface reconstruction from multi-view images.

Neural Rendering Surface Reconstruction

315

Paper
Code

Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images

1 code implementation • 22 Apr 2022 • YuAn Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, Taku Komura, Wenping Wang

In this paper, we present a generalizable model-free 6-DoF object pose estimator called Gen6D.

Object Pose Estimation

543

Paper
Code

Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion Phases

1 code implementation • 12 Jan 2022 • Ian Mason, Sebastian Starke, Taku Komura

In this work we present a style modelling system that uses an animation synthesis network to model motion content based on local motion phases.

Style Transfer

Paper
Code

FaceFormer: Speech-Driven 3D Facial Animation with Transformers

1 code implementation • CVPR 2022 • Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

Speech-driven 3D facial animation is challenging due to the complex geometry of human faces and the limited availability of 3D audio-visual data.

Ranked #1 on 3D Face Animation on VOCASET

3D Face Animation

744

Paper
Code

Joint Audio-Text Model for Expressive Speech-Driven 3D Facial Animation

no code implementations • 4 Dec 2021 • Yingruo Fan, Zhaojiang Lin, Jun Saito, Wenping Wang, Taku Komura

The existing datasets are collected to cover as many different phonemes as possible instead of sentences, thus limiting the capability of the audio-based model to learn more diverse contexts.

Language Modelling

Paper
Add Code

DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation

1 code implementation • 27 Jul 2021 • Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang

Scalable 6D pose estimation for rigid objects from RGB images aims at handling multiple objects and generalizing to novel objects.

6D Pose Estimation Metric Learning +2

Paper
Code

NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction

6 code implementations • NeurIPS 2021 • Peng Wang, Lingjie Liu, YuAn Liu, Christian Theobalt, Taku Komura, Wenping Wang

In NeuS, we propose to represent a surface as the zero-level set of a signed distance function (SDF) and develop a new volume rendering method to train a neural SDF representation.

Novel View Synthesis Surface Reconstruction

1,504

Paper
Code

MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency

no code implementations • 22 Jun 2020 • Mingyi Shi, Kfir Aberman, Andreas Aristidou, Taku Komura, Dani Lischinski, Daniel Cohen-Or, Baoquan Chen

We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video. While previous methods rely on either rigging or inverse kinematics (IK) to associate a consistent skeleton with temporally coherent joint rotations, our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used, motion representation.

Paper
Add Code

Local motion phases for learning multi-contact character movements

1 code implementation • 10/06 2020 • Sebastian Dorothee Starke, Yiwei Zhao, Taku Komura, Kazi A. Zaman

Training a bipedal character to play basketball and interact with objects, or a quadruped character to move in various locomotion modes, are difficult tasks due to the fast and complex contacts happening during the motion.

7,324

Paper
Code

Learning Whole-body Motor Skills for Humanoids

no code implementations • 7 Feb 2020 • Chuanyu Yang, Kai Yuan, Wolfgang Merkt, Taku Komura, Sethu Vijayakumar, Zhibin Li

This paper presents a hierarchical framework for Deep Reinforcement Learning that acquires motor skills for a variety of push recovery and balancing behaviors, i. e., ankle, hip, foot tilting, and stepping strategies.