Search Results for author: Bo Dai

Found 192 papers, 78 papers with code

MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model

no code implementations • 30 Apr 2024 • Wenxun Dai, Ling-Hao Chen, Jingbo Wang, Jinpeng Liu, Bo Dai, Yansong Tang

By employing one-step (or few-step) inference, we further improve the runtime efficiency of the motion latent diffusion model for motion generation.

Paper
Add Code

PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios

no code implementations • 30 Apr 2024 • Jingbo Wang, Zhengyi Luo, Ye Yuan, Yixuan Li, Bo Dai

We address the challenge of content diversity and controllability in pedestrian simulation for driving scenarios.

Paper
Add Code

PhyRecon: Physically Plausible Neural Scene Reconstruction

no code implementations • 25 Apr 2024 • Junfeng Ni, Yixin Chen, Bohan Jing, Nan Jiang, Bin Wang, Bo Dai, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

In this paper, we introduce PhyRecon, which stands as the first approach to harness both differentiable rendering and differentiable physics simulation to learn implicit surface representations.

3D Reconstruction Multi-View 3D Reconstruction

Paper
Add Code

TELA: Text to Layer-wise 3D Clothed Human Generation

no code implementations • 25 Apr 2024 • Junting Dong, Qi Fang, Zehuan Huang, Xudong Xu, Jingbo Wang, Sida Peng, Bo Dai

Previous works usually encode the human body and clothes as a holistic model and generate the whole model in a single-stage optimization, which makes them struggle for clothing editing and meanwhile lose fine-grained control over the whole generation process.

Disentanglement Virtual Try-on

Paper
Add Code

Efficient Duple Perturbation Robustness in Low-rank MDPs

no code implementations • 11 Apr 2024 • Yang Hu, Haitong Ma, Bo Dai, Na Li

The pursuit of robustness has recently been a popular topic in reinforcement learning (RL) research, yet the existing methods generally suffer from efficiency issues that obstruct their real-world implementation.

Reinforcement Learning (RL)

Paper
Add Code

Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint

no code implementations • 7 Apr 2024 • Haitong Ma, Zhaolin Ren, Bo Dai, Na Li

Moreover, to handle the sim-to-real gap in the dynamics, we propose a skill discovery algorithm that learns new skills caused by the sim-to-real gap from real-world data.

Representation Learning

Paper
Add Code

SemGrasp: Semantic Grasp Generation via Language Aligned Discretization

no code implementations • 4 Apr 2024 • Kailin Li, Jingbo Wang, Lixin Yang, Cewu Lu, Bo Dai

We introduce a discrete representation that aligns the grasp space with semantic space, enabling the generation of grasp postures in accordance with language instructions.

Grasp Generation Language Modelling +2

Paper
Add Code

CameraCtrl: Enabling Camera Control for Text-to-Video Generation

1 code implementation • 2 Apr 2024 • Hao He, Yinghao Xu, Yuwei Guo, Gordon Wetzstein, Bo Dai, Hongsheng Li, Ceyuan Yang

Controllability plays a crucial role in video generation since it allows users to create desired content.

Text-to-Video Generation Video Generation

225

Paper
Code

Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

1 code implementation • 26 Mar 2024 • Kerui Ren, Lihan Jiang, Tao Lu, Mulin Yu, Linning Xu, Zhangkai Ni, Bo Dai

The recent 3D Gaussian splatting (3D-GS) has shown remarkable rendering fidelity and efficiency compared to NeRF-based neural scene representations.

Neural Rendering

396

Paper
Code

GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction

no code implementations • 25 Mar 2024 • Mulin Yu, Tao Lu, Linning Xu, Lihan Jiang, Yuanbo Xiangli, Bo Dai

We show on diverse scenes that our design unlocks the potential for more accurate and detailed surface reconstructions, and at the meantime benefits 3DGS rendering with structures that are more aligned with the underlying geometry.

Neural Rendering

Paper
Add Code

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

no code implementations • 25 Mar 2024 • Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma

Creating and animating 3D biped cartoon characters is crucial and valuable in various applications.

Question Answering Texture Synthesis

Paper
Add Code

LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation

no code implementations • 18 Mar 2024 • Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy

The latent is decoded by a transformer-based decoder into a high-capacity 3D neural field.

3D Generation 3D Reconstruction +2

Paper
Add Code

GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation

no code implementations • 18 Mar 2024 • Zhaoyang Lyu, Ben Fei, Jinyi Wang, Xudong Xu, Ya zhang, Weidong Yang, Bo Dai

Mesh is a fundamental representation of 3D assets in various industrial applications, and is widely supported by professional softwares.

Paper
Add Code

Generalized Predictive Model for Autonomous Driving

1 code implementation • 14 Mar 2024 • Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li

In this paper, we introduce the first large-scale video prediction model in the autonomous driving discipline.

Autonomous Driving Video Prediction

384

Paper
Code

Stochastic Gradient Succeeds for Bandits

no code implementations • 27 Feb 2024 • Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvari, Dale Schuurmans

We show that the \emph{stochastic gradient} bandit algorithm converges to a \emph{globally optimal} policy at an $O(1/t)$ rate, even with a \emph{constant} step size.

Paper
Add Code

BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models

1 code implementation • 13 Feb 2024 • Haotian Sun, Yuchen Zhuang, Wei Wei, Chao Zhang, Bo Dai

BBox-Adapter distinguishes target and source domain data by treating target data as positive and source data as negative.

Paper
Code

Beyond Expectations: Learning with Stochastic Dominance Made Practical

no code implementations • 5 Feb 2024 • Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai

Stochastic dominance models risk-averse preferences for decision making with uncertain outcomes, which naturally captures the intrinsic structure of the underlying uncertainty, in contrast to simply resorting to the expectations.

Decision Making Portfolio Optimization

Paper
Add Code

DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing

no code implementations • 12 Dec 2023 • Kaiwen Zhang, Yifan Zhou, Xudong Xu, Xingang Pan, Bo Dai

Our key idea is to capture the semantics of the two images by fitting two LoRAs to them respectively, and interpolate between both the LoRA parameters and the latent noises to ensure a smooth semantic transition, where correspondence automatically emerges without the need for annotation.

Image Generation Image Morphing

Paper
Add Code

EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

no code implementations • 11 Dec 2023 • Zehuan Huang, Hao Wen, Junting Dong, Yaohui Wang, Yangguang Li, Xinyuan Chen, Yan-Pei Cao, Ding Liang, Yu Qiao, Bo Dai, Lu Sheng

Generating multiview images from a single view facilitates the rapid generation of a 3D mesh conditioned on a single image.

SSIM

Paper
Add Code

EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM

1 code implementation • 11 Dec 2023 • Chong Zhou, Xiangtai Li, Chen Change Loy, Bo Dai

It is also the first SAM variant that can run at over 30 FPS on an iPhone 14.

Decoder

695

Paper
Code

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

no code implementations • 4 Dec 2023 • Qihang Zhang, Yinghao Xu, Yujun Shen, Bo Dai, Bolei Zhou, Ceyuan Yang

Generating large-scale 3D scenes cannot simply apply existing 3D object synthesis technique since 3D scenes usually hold complex spatial configurations and consist of a number of objects at varying scales.

Scene Generation

Paper
Add Code

Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering

1 code implementation • 30 Nov 2023 • Tao Lu, Mulin Yu, Linning Xu, Yuanbo Xiangli, LiMin Wang, Dahua Lin, Bo Dai

Neural rendering methods have significantly advanced photo-realistic 3D scene rendering in various academic and industrial applications.

Neural Rendering

528

Paper
Code

Cinematic Behavior Transfer via NeRF-based Differentiable Filming

no code implementations • 29 Nov 2023 • Xuekun Jiang, Anyi Rao, Jingbo Wang, Dahua Lin, Bo Dai

In the evolving landscape of digital media and video production, the precise manipulation and reproduction of visual elements like camera movements and character actions are highly desired.

Pose Estimation

Paper
Add Code

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

1 code implementation • 28 Nov 2023 • Yuwei Guo, Ceyuan Yang, Anyi Rao, Maneesh Agrawala, Dahua Lin, Bo Dai

The development of text-to-video (T2V), i. e., generating videos with a given text prompt, has been significantly advanced in recent years.

Video Generation

8,961

Paper
Code

InterControl: Generate Human Motion Interactions by Controlling Every Joint

1 code implementation • 27 Nov 2023 • Zhenzhi Wang, Jingbo Wang, Yixuan Li, Dahua Lin, Bo Dai

Furthermore, we demonstrate that the distance between joint pairs for human-wise interactions can be generated using an off-the-shelf Large Language Model (LLM).

Language Modelling Large Language Model +1

Paper
Code

Point Cloud Pre-training with Diffusion Models

no code implementations • 25 Nov 2023 • Xiao Zheng, Xiaoshui Huang, Guofeng Mei, Yuenan Hou, Zhaoyang Lyu, Bo Dai, Wanli Ouyang, Yongshun Gong

This generator aggregates the features extracted by the backbone and employs them as the condition to guide the point-to-point recovery from the noisy point cloud, thereby assisting the backbone in capturing both local and global geometric priors as well as the global point density distribution of the object.

Point Cloud Pre-training

Paper
Add Code

Efficient Reinforcement Learning from Partial Observability

no code implementations • 20 Nov 2023 • Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai

In most real-world reinforcement learning applications, state information is only partially observable, which breaks the Markov decision process assumption and leads to inferior performance for algorithms that conflate observations with state.

Partially Observable Reinforcement Learning reinforcement-learning

Paper
Add Code

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

no code implementations • 1 Nov 2023 • Jiayi Chen, Hanjun Dai, Bo Dai, Aidong Zhang, Wei Wei

However, prior works for Few-shot VDER mainly address the problem at the document level with a predefined global entity space, which doesn't account for the entity-level few-shot scenario: target entity types are locally personalized by each task and entity occurrences vary significantly among documents.

Contrastive Learning Entity Retrieval +2

Paper
Add Code

MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond

no code implementations • ICCV 2023 • Yixuan Li, Lihan Jiang, Linning Xu, Yuanbo Xiangli, Zhenzhi Wang, Dahua Lin, Bo Dai

While most of recent neural rendering works focus on objects and small-scale scenes, developing neural rendering methods for city-scale scenes is of great potential in many real-world applications.

Neural Rendering

Paper
Add Code

OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs

1 code implementation • ICCV 2023 • Honglin He, Zhuoqian Yang, Shikai Li, Bo Dai, Wayne Wu

We present a new method for generating realistic and view-consistent images with fine geometry from 2D image collections.

225

Paper
Code

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

2 code implementations • 26 Sep 2023 • Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

To this end, we propose LaVie, an integrated video generation framework that operates on cascaded video latent diffusion models, comprising a base T2V model, a temporal interpolation model, and a video super-resolution model.

Ranked #4 on Text-to-Video Generation on EvalCrafter Text-to-Video (ECTV) Dataset (using extra training data)

Text-to-Video Generation Video Generation +1

738

Paper
Code

Interpret Vision Transformers as ConvNets with Dynamic Convolutions

no code implementations • 19 Sep 2023 • Chong Zhou, Chen Change Loy, Bo Dai

There has been a debate about the superiority between vision Transformers and ConvNets, serving as the backbone of computer vision models.

Paper
Add Code

Unified Human-Scene Interaction via Prompted Chain-of-Contacts

1 code implementation • 14 Sep 2023 • Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang

Based on the definition, UniHSI constitutes a Large Language Model (LLM) Planner to translate language prompts into task plans in the form of CoC, and a Unified Controller that turns CoC into uniform task execution.

Language Modelling Large Language Model

124

Paper
Code

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

1 code implementation • 29 Aug 2023 • Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Wanli Ouyang, Yu Qiao, Chao Dong

We present DiffBIR, a general restoration pipeline that could handle different blind image restoration tasks in a unified framework.

Ranked #1 on Blind Face Restoration on LFW

Blind Face Restoration Image Denoising +2

3,050

Paper
Code

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events

1 code implementation • ICCV 2023 • Bo Dai, Linge Wang, Baoxiong Jia, Zeyu Zhang, Song-Chun Zhu, Chi Zhang, Yixin Zhu

Intuitive physics is pivotal for human understanding of the physical world, enabling prediction and interpretation of events even in infancy.

Paper
Code

MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR

no code implementations • 18 Aug 2023 • Xudong Xu, Zhaoyang Lyu, Xingang Pan, Bo Dai

In this work, we propose Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR (\textbf{MATLABER}) that leverages a novel latent BRDF auto-encoder for material generation.

3D Generation Text to 3D

Paper
Add Code

DF2: Distribution-Free Decision-Focused Learning

no code implementations • 11 Aug 2023 • Lingkai Kong, Wenhao Mu, Jiaming Cui, Yuchen Zhuang, B. Aditya Prakash, Bo Dai, Chao Zhang

However, existing end-to-end DFL methods are hindered by three significant bottlenecks: model mismatch error, sample average approximation error, and gradient approximation error.

Paper
Add Code

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering

1 code implementation • ICCV 2023 • Wei Cheng, Ruixiang Chen, Wanqi Yin, Siming Fan, Keyu Chen, Honglin He, Huiwen Luo, Zhongang Cai, Jingbo Wang, Yang Gao, Zhengming Yu, Zhengyu Lin, Daxuan Ren, Lei Yang, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Bo Dai, Kwan-Yee Lin

Realistic human-centric rendering plays a key role in both computer vision and computer graphics.

Camera Calibration Novel View Synthesis

201

Paper
Code

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

4 code implementations • 10 Jul 2023 • Yuwei Guo, Ceyuan Yang, Anyi Rao, Zhengyang Liang, Yaohui Wang, Yu Qiao, Maneesh Agrawala, Dahua Lin, Bo Dai

Once trained, the motion module can be inserted into a personalized T2I model to form a personalized animation generator.

Image Animation

8,961

Paper
Code

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE

no code implementations • 5 Jun 2023 • Zikai Wei, Anyi Rao, Bo Dai, Dahua Lin

Factor model is a fundamental investment tool in quantitative investment, which can be empowered by deep learning to become more flexible and efficient in practical complicated investing situations.

Open-Ended Question Answering Stock Prediction

Paper
Add Code

Probabilistic Adaptation of Text-to-Video Models

no code implementations • 2 Jun 2023 • Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel

Large text-to-video models trained on internet-scale data have demonstrated exceptional capabilities in generating high-fidelity videos from arbitrary textual descriptions.

Language Modelling Large Language Model

Paper
Add Code

Controllable Motion Diffusion Model

no code implementations • 1 Jun 2023 • Yi Shi, Jingbo Wang, Xuekun Jiang, Bo Dai

To enable real-time motion synthesis with diffusion models in response to time-varying control signals, we propose the framework of the Controllable Motion Diffusion Model (COMODO).

Image Generation Motion Synthesis

Paper
Add Code

AdaPlanner: Adaptive Planning from Feedback with Language Models

1 code implementation • NeurIPS 2023 • Haotian Sun, Yuchen Zhuang, Lingkai Kong, Bo Dai, Chao Zhang

We propose a closed-loop approach, AdaPlanner, which allows the LLM agent to refine its self-generated plan adaptively in response to environmental feedback.

Decision Making Hallucination

Paper
Code

E2EAI: End-to-End Deep Learning Framework for Active Investing

no code implementations • 25 May 2023 • Zikai Wei, Bo Dai, Dahua Lin

Active investing aims to construct a portfolio of assets that are believed to be relatively profitable in the markets, with one popular method being to construct a portfolio via factor-based strategies.

Paper
Add Code

RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars

1 code implementation • NeurIPS 2023 • Dongwei Pan, Long Zhuo, Jingtan Piao, Huiwen Luo, Wei Cheng, Yuxin Wang, Siming Fan, Shengqi Liu, Lei Yang, Bo Dai, Ziwei Liu, Chen Change Loy, Chen Qian, Wayne Wu, Dahua Lin, Kwan-Yee Lin

It is a large-scale digital library for head avatars with three key attributes: 1) High Fidelity: all subjects are captured by 60 synchronized, high-resolution 2K cameras in 360 degrees.

2k Image Matting +2

219

Paper
Code

Towards Multi-Layered 3D Garments Animation

no code implementations • ICCV 2023 • Yidi Shao, Chen Change Loy, Bo Dai

In this paper, we propose a novel data-driven method, called LayersNet, to model garment-level animations as particle-wise interactions in a micro physics system.

Paper
Add Code

LEO: Generative Latent Image Animator for Human Video Synthesis

5 code implementations • 6 May 2023 • Yaohui Wang, Xin Ma, Xinyuan Chen, Antitza Dantcheva, Bo Dai, Yu Qiao

Our key idea is to represent motion as a sequence of flow maps in the generation process, which inherently isolate motion from appearance.

Disentanglement Video Editing

Paper
Code

HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks

no code implementations • 19 Apr 2023 • Zhuo Chen, Xudong Xu, Yichao Yan, Ye Pan, Wenhan Zhu, Wayne Wu, Bo Dai, Xiaokang Yang

While the use of 3D-aware GANs bypasses the requirement of 3D data, we further alleviate the necessity of style images with the CLIP model being the stylization guidance.

Attribute

Paper
Add Code

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding

no code implementations • 8 Apr 2023 • Tongzheng Ren, Zhaolin Ren, Haitong Ma, Na Li, Bo Dai

This paper presents an approach, Spectral Dynamics Embedding Control (SDEC), to optimal control for nonlinear stochastic systems.

Paper
Add Code

Generative Diffusion Prior for Unified Image Restoration and Enhancement

no code implementations • CVPR 2023 • Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, Bo Dai

Besides, we devise hierarchical guidance and patch-based methods, enabling the GDP to generate images of arbitrary resolutions.

Colorization Deblurring +3

Paper
Add Code

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling

1 code implementation • ICCV 2023 • Zhitao Yang, Zhongang Cai, Haiyi Mei, Shuai Liu, Zhaoxi Chen, Weiye Xiao, Yukun Wei, Zhongfei Qing, Chen Wei, Bo Dai, Wayne Wu, Chen Qian, Dahua Lin, Ziwei Liu, Lei Yang

Synthetic data has emerged as a promising source for 3D human research as it offers low-cost access to large-scale human datasets.

Human Mesh Recovery Neural Rendering

175

Paper
Code

Grid-guided Neural Radiance Fields for Large Urban Scenes

no code implementations • CVPR 2023 • Linning Xu, Yuanbo Xiangli, Sida Peng, Xingang Pan, Nanxuan Zhao, Christian Theobalt, Bo Dai, Dahua Lin

An alternative solution is to use a feature grid representation, which is computationally efficient and can naturally scale to a large scene with increased grid resolutions.

Paper
Add Code

AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation

no code implementations • ICCV 2023 • Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Bo Dai, Dahua Lin

Traditional modeling pipelines keep an asset library storing unique object templates, which is both versatile and memory efficient in practice.

Novel View Synthesis Object

Paper
Add Code

Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations

no code implementations • 23 Mar 2023 • Quanzhou Li, Jingbo Wang, Chen Change Loy, Bo Dai

Generating task-oriented human-object interaction motions in simulation is challenging.

Human-Object Interaction Detection Motion Estimation +2

Paper
Add Code

3D Data Augmentation for Driving Scenes on Camera

no code implementations • 18 Mar 2023 • Wenwen Tong, Jiangwei Xie, Tianyu Li, Hanming Deng, Xiangwei Geng, Ruoyi Zhou, Dingchen Yang, Bo Dai, Lewei Lu, Hongyang Li

The proposed data augmentation approach contributes to a gain of 1. 7% and 1. 4% in terms of detection accuracy, on Waymo and nuScences respectively.

Autonomous Driving Data Augmentation +1

Paper
Add Code

Controllable Mesh Generation Through Sparse Latent Point Diffusion Models

no code implementations • CVPR 2023 • Zhaoyang Lyu, Jinyi Wang, Yuwei An, Ya zhang, Dahua Lin, Bo Dai

In this work, we design a novel sparse latent point diffusion model for mesh generation.

Paper
Add Code

Prototype-based Embedding Network for Scene Graph Generation

1 code implementation • CVPR 2023 • Chaofan Zheng, Xinyu Lyu, Lianli Gao, Bo Dai, Jingkuan Song

Current Scene Graph Generation (SGG) methods explore contextual information to predict relationships among entity pairs.

Graph Generation Relation +1

Paper
Code

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

no code implementations • 30 Jan 2023 • Anyi Rao, Xuekun Jiang, Yuwei Guo, Linning Xu, Lei Yang, Libiao Jin, Dahua Lin, Bo Dai

Amateurs working on mini-films and short-form videos usually spend lots of time and effort on the multi-round complicated process of setting and adjusting scenes, plots, and cameras to deliver satisfying video shots.

Paper
Add Code

The Role of Baselines in Policy Gradient Optimization

no code implementations • 16 Jan 2023 • Jincheng Mei, Wesley Chung, Valentin Thomas, Bo Dai, Csaba Szepesvari, Dale Schuurmans

Instead, the analysis reveals that the primary effect of the value baseline is to \textbf{reduce the aggressiveness of the updates} rather than their variance.

Paper
Add Code

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis

no code implementations • ICCV 2023 • Jiapeng Zhu, Ceyuan Yang, Yujun Shen, Zifan Shi, Bo Dai, Deli Zhao, Qifeng Chen

This work presents an easy-to-use regularizer for GAN training, which helps explicitly link some axes of the latent space to a set of pixels in the synthesized image.

Image Generation

Paper
Add Code

Correspondence Distillation from NeRF-based GAN

no code implementations • 19 Dec 2022 • Yushi Lan, Chen Change Loy, Bo Dai

The neural radiance field (NeRF) has shown promising results in preserving the fine details of objects and scenes.

Paper
Add Code

Latent Variable Representation for Reinforcement Learning

no code implementations • 17 Dec 2022 • Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai

Theoretically, we establish the sample complexity of the proposed approach in the online and offline settings.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Add Code

3DHumanGAN: 3D-Aware Human Image Generation with 3D Pose Mapping

1 code implementation • ICCV 2023 • Zhuoqian Yang, Shikai Li, Wayne Wu, Bo Dai

We present 3DHumanGAN, a 3D-aware generative adversarial network that synthesizes photorealistic images of full-body humans with consistent appearances under different view-angles and body-poses.

Generative Adversarial Network Image Generation

225

Paper
Code

Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion

no code implementations • CVPR 2023 • Yushi Lan, Xuyi Meng, Shuai Yang, Chen Change Loy, Bo Dai

In this paper, we study the challenging problem of 3D GAN inversion where a latent code is predicted given a single face image to faithfully recover its 3D shapes and detailed textures.

3D Face Reconstruction

Paper
Add Code

Score-based Continuous-time Discrete Diffusion Models

no code implementations • 30 Nov 2022 • Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai

Score-based modeling through stochastic differential equations (SDEs) has provided a new perspective on diffusion models, and demonstrated superior performance on continuous data.

Paper
Add Code

Learning to Optimize with Stochastic Dominance Constraints

no code implementations • 14 Nov 2022 • Hanjun Dai, Yuan Xue, Niao He, Bethany Wang, Na Li, Dale Schuurmans, Bo Dai

In real-world decision-making, uncertainty is important yet difficult to handle.

Decision Making Management

Paper
Add Code

Oracle Inequalities for Model Selection in Offline Reinforcement Learning

no code implementations • 3 Nov 2022 • Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai, Emma Brunskill

We propose the first model selection algorithm for offline RL that achieves minimax rate-optimal oracle inequalities up to logarithmic factors.

Model Selection Offline RL +2

Paper
Add Code

Factor Investing with a Deep Multi-Factor Model

no code implementations • 22 Oct 2022 • Zikai Wei, Bo Dai, Dahua Lin

Modeling and characterizing multiple factors is perhaps the most important step in achieving excess returns over market benchmarks.

Graph Attention Management +1

Paper
Add Code

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows

no code implementations • 17 Oct 2022 • Anyi Rao, Xuekun Jiang, Sichen Wang, Yuwei Guo, Zihao Liu, Bo Dai, Long Pang, Xiaoyu Wu, Dahua Lin, Libiao Jin

The ability to choose an appropriate camera view among multiple cameras plays a vital role in TV shows delivery.

Paper
Add Code

Rethinking Trajectory Prediction via "Team Game"

no code implementations • 17 Oct 2022 • Zikai Wei, Xinge Zhu, Bo Dai, Dahua Lin

To accurately predict trajectories in multi-agent settings, e. g. team games, it is important to effectively model the interactions among agents.

Trajectory Prediction

Paper
Add Code

Improving GANs with A Dynamic Discriminator

no code implementations • 20 Sep 2022 • Ceyuan Yang, Yujun Shen, Yinghao Xu, Deli Zhao, Bo Dai, Bolei Zhou

Two capacity adjusting schemes are developed for training GANs under different data regimes: i) given a sufficient amount of training data, the discriminator benefits from a progressively increased learning capacity, and ii) when the training data is limited, gradually decreasing the layer width mitigates the over-fitting issue of the discriminator.

3D-Aware Image Synthesis Data Augmentation

Paper
Add Code

Spectral Decomposition Representation for Reinforcement Learning

no code implementations • 19 Aug 2022 • Tongzheng Ren, Tianjun Zhang, Lisa Lee, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai

Representation learning often plays a critical role in reinforcement learning by managing the curse of dimensionality.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Transformer with Implicit Edges for Particle-based Physics Simulation

1 code implementation • 22 Jul 2022 • Yidi Shao, Chen Change Loy, Bo Dai

Consequently, in this paper we propose a novel Transformer-based method, dubbed as Transformer with Implicit Edges (TIE), to capture the rich semantics of particle interactions in an edge-free manner.

Paper
Code

Monocular 3D Object Reconstruction with GAN Inversion

1 code implementation • 20 Jul 2022 • Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy

Reconstruction is achieved by searching for a latent space in the 3D GAN that best resembles the target mesh in accordance with the single view observation.

3D Object Reconstruction Object

Paper
Code

BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis

1 code implementation • 20 Jul 2022 • Davide Moltisanti, Jinyi Wu, Bo Dai, Chen Change Loy

Estimating human keypoints from these videos is difficult due to the complexity of the dance, as well as the multiple moving cameras recording setup.

Motion Synthesis Pose Estimation

Paper
Code

Making Linear MDPs Practical via Contrastive Representation Learning

no code implementations • 14 Jul 2022 • Tianjun Zhang, Tongzheng Ren, Mengjiao Yang, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai

It is common to address the curse of dimensionality in Markov decision processes (MDPs) by exploiting low-rank representations.

Representation Learning

Paper
Add Code

Discrete Langevin Sampler via Wasserstein Gradient Flow

no code implementations • 29 Jun 2022 • Haoran Sun, Hanjun Dai, Bo Dai, Haomin Zhou, Dale Schuurmans

It is known that gradient-based MCMC samplers for continuous spaces, such as Langevin Monte Carlo (LMC), can be derived as particle versions of a gradient flow that minimizes KL divergence on a Wasserstein manifold.

Paper
Add Code

Guided Diffusion Model for Adversarial Purification

2 code implementations • 30 May 2022 • Jinyi Wang, Zhaoyang Lyu, Dahua Lin, Bo Dai, Hongfei Fu

In this paper, we propose a novel purification approach, referred to as guided diffusion model for purification (GDMP), to help protect classifiers from adversarial attacks.

Denoising

Paper
Code

Accelerating Diffusion Models via Early Stop of the Diffusion Process

1 code implementation • 25 May 2022 • Zhaoyang Lyu, Xudong Xu, Ceyuan Yang, Dahua Lin, Bo Dai

By modeling the reverse process of gradually diffusing the data distribution into a Gaussian distribution, generating a sample in DDPMs can be regarded as iteratively denoising a randomly sampled Gaussian noise.

Denoising Image Generation

Paper
Code

Towards Diverse and Natural Scene-aware 3D Human Motion Synthesis

no code implementations • CVPR 2022 • Jingbo Wang, Yu Rong, Jingyuan Liu, Sijie Yan, Dahua Lin, Bo Dai

The ability to synthesize long-term human motion sequences in real-world scenes can facilitate numerous applications.

Motion Synthesis

Paper
Add Code

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

1 code implementation • CVPR 2022 • Yanbo Xu, Yueqin Yin, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu

In this study, we highlight the importance of interaction in a dual-space GAN for more controllable editing.

Attribute Disentanglement +1

173

Paper
Code

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

1 code implementation • CVPR 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou

To enhance the quality of synthesized gestures, we develop a contrastive learning strategy based on audio-text alignment for better audio representations.

Ranked #3 on Gesture Generation on TED Gesture Dataset

Contrastive Learning Gesture Generation

118

Paper
Code

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

1 code implementation • 16 Mar 2022 • Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu

This paper proposes a simple baseline framework for video-based 2D/3D human pose estimation that can achieve 10 times efficiency improvement over existing works without any performance degradation, named DeciWatch.

Ranked #1 on 2D Human Pose Estimation on JHMDB (2D poses only)

2D Human Pose Estimation 3D Human Pose Estimation +2

169

Paper
Code

SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition

no code implementations • 10 Feb 2022 • Dylan Slack, Yinlam Chow, Bo Dai, Nevan Wichers

However, we identify these techniques are not well equipped for safe policy learning because they ignore negative experiences(e. g., unsafe or unsuccessful), focusing only on positive experiences, which harms their ability to generalize to new tasks safely.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Model Selection in Batch Policy Optimization

no code implementations • 23 Dec 2021 • Jonathan N. Lee, George Tucker, Ofir Nachum, Bo Dai

We formalize the problem in the contextual bandit setting with linear model classes by identifying three sources of error that any model selection algorithm should optimally trade-off in order to be competitive: (1) approximation error, (2) statistical complexity, and (3) coverage.

Model Selection

Paper
Add Code

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

no code implementations • CVPR 2022 • Yinghao Xu, Fangyun Wei, Xiao Sun, Ceyuan Yang, Yujun Shen, Bo Dai, Bolei Zhou, Stephen Lin

Typically in recent work, the pseudo-labels are obtained by training a model on the labeled data, and then using confident predictions from the model to teach itself.

Action Recognition

Paper
Add Code

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering

no code implementations • 10 Dec 2021 • Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin

The wide span of viewing positions within these scenes yields multi-scale renderings with very different levels of detail, which poses great challenges to neural radiance field and biases it towards compromised results.

Paper
Add Code

Extract Free Dense Labels from CLIP

1 code implementation • 2 Dec 2021 • Chong Zhou, Chen Change Loy, Bo Dai

Contrastive Language-Image Pre-training (CLIP) has made a remarkable breakthrough in open-vocabulary zero-shot image recognition.

Ranked #3 on Unsupervised Semantic Segmentation with Language-image Pre-training on KITTI-STEP

Novel Concepts Open Vocabulary Panoptic Segmentation +5

363

Paper
Code

Towards understanding retrosynthesis by energy-based models

no code implementations • NeurIPS 2021 • Ruoxi Sun, Hanjun Dai, Li Li, Steven Kearnes, Bo Dai

In this paper, we propose a framework that unifies sequence- and graph-based methods as energy-based models (EBMs) with different energy functions.

Drug Discovery Retrosynthesis

Paper
Add Code

Neural Stochastic Dual Dynamic Programming

no code implementations • ICLR 2022 • Hanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai

Stochastic dual dynamic programming (SDDP) is a state-of-the-art method for solving multi-stage stochastic optimization, widely used for modeling real-world process optimization tasks.

Stochastic Optimization

Paper
Add Code

A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning

no code implementations • 22 Nov 2021 • Tongzheng Ren, Tianjun Zhang, Csaba Szepesvári, Bo Dai

Representation learning lies at the heart of the empirical success of deep learning for dealing with the curse of dimensionality.

Reinforcement Learning (RL) Representation Learning

Paper
Add Code

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

2 code implementations • NeurIPS 2021 • Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy

Generative adversarial networks (GANs) typically require ample data for training in order to synthesize high-fidelity images.

251

Paper
Code

Generative Occupancy Fields for 3D Surface-Aware Image Synthesis

1 code implementation • NeurIPS 2021 • Xudong Xu, Xingang Pan, Dahua Lin, Bo Dai

In this paper, we propose Generative Occupancy Fields (GOF), a novel model based on generative radiance fields that can learn compact object surfaces without impeding its training convergence.

3D-Aware Image Synthesis Object

103

Paper
Code

Understanding the Effect of Stochasticity in Policy Optimization

no code implementations • NeurIPS 2021 • Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvari, Dale Schuurmans

We study the effect of stochasticity in on-policy policy optimization, and make the following four contributions.

Paper
Add Code

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

1 code implementation • NeurIPS 2021 • Xingang Pan, Xudong Xu, Chen Change Loy, Christian Theobalt, Bo Dai

Motivated by the observation that a 3D object should look realistic from multiple viewpoints, these methods introduce a multi-view constraint as regularization to learn valid 3D radiance fields from 2D images.

3D-Aware Image Synthesis 3D Shape Reconstruction +2

146

Paper
Code

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

1 code implementation • 28 Oct 2021 • Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans

There are two important reasoning tasks on KGs: (1) single-hop knowledge graph completion, which involves predicting individual links in the KG; and (2), multi-hop reasoning, where the goal is to predict which KG entities satisfy a given logical query.

Scheduling

159

Paper
Code

Understanding and Leveraging Overparameterization in Recursive Value Estimation

no code implementations • ICLR 2022 • Chenjun Xiao, Bo Dai, Jincheng Mei, Oscar A Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans

To better understand the utility of deep models in RL we present an analysis of recursive value estimation using overparameterized linear representations that provides useful, transferable findings.

Reinforcement Learning (RL) Value prediction

Paper
Add Code

SAFER: Data-Efficient and Safe Reinforcement Learning Through Skill Acquisition

no code implementations • 29 Sep 2021 • Dylan Z Slack, Yinlam Chow, Bo Dai, Nevan Wichers

Though many reinforcement learning (RL) problems involve learning policies in settings that are difficult to specify safety constraints and sparse rewards, current methods struggle to rapidly and safely acquire successful policies.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

SiT: Simulation Transformer for Particle-based Physics Simulation

no code implementations • 29 Sep 2021 • Yidi Shao, Chen Change Loy, Bo Dai

However, they force particles to interact with all neighbors without selection, and they fall short in capturing material semantics for different particles, leading to unsatisfactory performance, especially in generalization.

Paper
Add Code

MeshInversion: 3D textured mesh reconstruction with generative prior

no code implementations • 29 Sep 2021 • Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy

Reconstruction is achieved by searching for a latent space in the 3D GAN that best resembles the target mesh in accordance with the single view observation.

Paper
Add Code

Combiner: Full Attention Transformer with Sparse Computation Cost

2 code implementations • NeurIPS 2021 • Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai

However, the key limitation of transformers is their quadratic memory and time complexity $\mathcal{O}(L^2)$ with respect to the sequence length in attention layers, which restricts application in extremely long sequences.

Ranked #2 on Language Modelling on Wiki-40B

Image Generation Language Modelling

32,966

Paper
Code

Safe Exploration by Solving Early Terminated MDP

no code implementations • 9 Jul 2021 • Hao Sun, Ziping Xu, Meng Fang, Zhenghao Peng, Jiadong Guo, Bo Dai, Bolei Zhou

Safe exploration is crucial for the real-world application of reinforcement learning (RL).

Reinforcement Learning (RL) Safe Exploration

Paper
Add Code

The Curse of Passive Data Collection in Batch Reinforcement Learning

no code implementations • 18 Jun 2021 • Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvari

In high stake applications, active experimentation may be considered too risky and thus data are often collected passively.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Optimization Variance: Exploring Generalization Properties of DNNs

1 code implementation • 3 Jun 2021 • Xiao Zhang, Dongrui Wu, Haoyi Xiong, Bo Dai

Unlike the conventional wisdom in statistical learning theory, the test error of a deep neural network (DNN) often demonstrates double descent: as the model complexity increases, it first follows a classical U-shaped curve and then shows a second descent.

Learning Theory

Paper
Code

Scene-aware Generative Network for Human Motion Synthesis

no code implementations • CVPR 2021 • Jingbo Wang, Sijie Yan, Bo Dai, Dahua Lin

We revisit human motion synthesis, a task useful in various real world applications, in this paper.

Motion Synthesis

Paper
Add Code

Leveraging Non-uniformity in First-order Non-convex Optimization

no code implementations • 13 May 2021 • Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvari, Dale Schuurmans

Classical global convergence results for first-order methods rely on uniform smoothness and the \L{}ojasiewicz inequality.

BIG-bench Machine Learning

Paper
Add Code

Revisiting Skeleton-based Action Recognition

4 code implementations • CVPR 2022 • Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai

In this work, we propose PoseC3D, a new approach to skeleton-based action recognition, which relies on a 3D heatmap stack instead of a graph sequence as the base representation of human skeletons.

Ranked #1 on Action Recognition on NTU RGB+D 120

Group Activity Recognition Pose Estimation +1

3,926

Paper
Code

Unsupervised 3D Shape Completion through GAN Inversion

no code implementations • CVPR 2021 • Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy

In contrast to previous fully supervised approaches, in this paper we present ShapeInversion, which introduces Generative Adversarial Network (GAN) inversion to shape completion for the first time.

Generative Adversarial Network valid

Paper
Add Code

Visually Informed Binaural Audio Generation without Binaural Audios

no code implementations • CVPR 2021 • Xudong Xu, Hang Zhou, Ziwei Liu, Bo Dai, Xiaogang Wang, Dahua Lin

Moreover, combined with binaural recordings, our method is able to further boost the performance of binaural audio generation under supervised settings.

Audio Generation

Paper
Add Code

On the Optimality of Batch Policy Optimization Algorithms

no code implementations • 6 Apr 2021 • Chenjun Xiao, Yifan Wu, Tor Lattimore, Bo Dai, Jincheng Mei, Lihong Li, Csaba Szepesvari, Dale Schuurmans

First, we introduce a class of confidence-adjusted index algorithms that unifies optimistic and pessimistic principles in a common framework, which enables a general analysis.

Value prediction

Paper
Add Code

Nearly Horizon-Free Offline Reinforcement Learning

no code implementations • NeurIPS 2021 • Tongzheng Ren, Jialian Li, Bo Dai, Simon S. Du, Sujay Sanghavi

To the best of our knowledge, these are the \emph{first} set of nearly horizon-free bounds for episodic time-homogeneous offline tabular MDP and linear MDP with anchor points.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

DeepStyle: User Style Embedding for Authorship Attribution of Short Texts

no code implementations • 14 Mar 2021 • Zhiqiang Hu, Roy Ka-Wei Lee, Lei Wang, Ee-Peng Lim, Bo Dai

Authorship attribution (AA), which is the task of finding the owner of a given text, is an important and widely studied research topic with many applications.

Authorship Attribution text-classification +1

Paper
Add Code

Off-Policy Imitation Learning from Observations

1 code implementation • NeurIPS 2020 • Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

To further accelerate the learning procedure, we regulate the policy update with an inverse action model, which assists distribution matching from the perspective of mode-covering.

Imitation Learning

Paper
Code

Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach

1 code implementation • EMNLP 2021 • Haoming Jiang, Bo Dai, Mengjiao Yang, Tuo Zhao, Wei Wei

An ideal environment for evaluating dialog systems, also known as the Turing test, needs to involve human interaction, which is usually not affordable for large-scale experiments.

Model-based Reinforcement Learning Off-policy evaluation +2

32,966

Paper
Code

Self-Supervised Continuous Control without Policy Gradient

no code implementations • 1 Jan 2021 • Hao Sun, Ziping Xu, Meng Fang, Yuhang Song, Jiechao Xiong, Bo Dai, Zhengyou Zhang, Bolei Zhou

Despite the remarkable progress made by the policy gradient algorithms in reinforcement learning (RL), sub-optimal policies usually result from the local exploration property of the policy gradient update.

Continuous Control Policy Gradient Methods +3

Paper
Add Code

BlockPlanner: City Block Generation With Vectorized Graph Representation

no code implementations • ICCV 2021 • Linning Xu, Yuanbo Xiangli, Anyi Rao, Nanxuan Zhao, Bo Dai, Ziwei Liu, Dahua Lin

City modeling is the foundation for computational urban planning, navigation, and entertainment.

valid

Paper
Add Code

Slow Control System for PandaX-III experiment

no code implementations • 24 Dec 2020 • Xiyu Yan, Xun Chen, Yu Chen, Bo Dai, Heng Lin, Tao Li, Ke Han, Kaixiang Ni, Fusang Wang, Shaobo Wang, Qibin Zheng, Xinning Zeng

The PandaX-III experiment uses high pressure gaseous time projection chamber to search for the neutrinoless double beta decay of $^{136}$Xe.

Anomaly Detection High Energy Physics - Experiment Instrumentation and Detectors

Paper
Add Code

Focal Frequency Loss for Image Reconstruction and Synthesis

1 code implementation • ICCV 2021 • Liming Jiang, Bo Dai, Wayne Wu, Chen Change Loy

In this study, we show that narrowing gaps in the frequency domain can ameliorate image reconstruction and synthesis quality further.

Ranked #6 on Image-to-Image Translation on Cityscapes Labels-to-Photo

Image Reconstruction Image-to-Image Translation

598

Paper
Code

Offline Policy Selection under Uncertainty

1 code implementation • 12 Dec 2020 • Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans

More importantly, we show how the belief distribution estimated by BayesDICE may be used to rank policies with respect to any arbitrary downstream policy selection metric, and we empirically demonstrate that this selection procedure significantly outperforms existing approaches, such as ranking policies according to mean or high-confidence lower bound value estimates.

Paper
Code

Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach

no code implementations • NeurIPS 2020 • Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Mladen Kolar, Zhaoran Wang

We study estimation in a class of generalized SEMs where the object of interest is defined as the solution to a linear operator equation.

Paper
Add Code

Differentiable Top-k with Optimal Transport

no code implementations • NeurIPS 2020 • Yujia Xie, Hanjun Dai, Minshuo Chen, Bo Dai, Tuo Zhao, Hongyuan Zha, Wei Wei, Tomas Pfister

Finding the k largest or smallest elements from a collection of scores, i. e., top-k operation, is an important model component widely used in information retrieval, machine learning, and data mining.

Information Retrieval Retrieval

Paper
Add Code

Escaping the Gravitational Pull of Softmax

no code implementations • NeurIPS 2020 • Jincheng Mei, Chenjun Xiao, Bo Dai, Lihong Li, Csaba Szepesvari, Dale Schuurmans

Both findings are based on an analysis of convergence rates using the Non-uniform \L{}ojasiewicz (N\L{}) inequalities.

Paper
Add Code

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration

no code implementations • NeurIPS 2020 • Hanjun Dai, Rishabh Singh, Bo Dai, Charles Sutton, Dale Schuurmans

In this paper we propose ALOE, a new algorithm for learning conditional and unconditional EBMs for discrete structured data, where parameter gradients are estimated using a learned sampler that mimics local search.

Language Modelling

Paper
Add Code

Do 2D GANs Know 3D Shape? Unsupervised 3D shape reconstruction from 2D Image GANs

1 code implementation • ICLR 2021 • Xingang Pan, Bo Dai, Ziwei Liu, Chen Change Loy, Ping Luo

Through our investigation, we found that such a pre-trained GAN indeed contains rich 3D knowledge and thus can be used to recover 3D shape from a single 2D image in an unsupervised manner.

3D Shape Reconstruction Object

570

Paper
Code

Named Entity Recognition for Social Media Texts with Semantic Augmentation

1 code implementation • EMNLP 2020 • Yuyang Nie, Yuanhe Tian, Xiang Wan, Yan Song, Bo Dai

In particular, we obtain the augmented semantic information from a large-scale corpus, and propose an attentive semantic augmentation module and a gate module to encode and aggregate such information, respectively.

Ranked #4 on Named Entity Recognition (NER) on WNUT 2016

Chinese Named Entity Recognition named-entity-recognition +3

Paper
Code

CoinDICE: Off-Policy Confidence Interval Estimation

no code implementations • NeurIPS 2020 • Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans

We study high-confidence behavior-agnostic off-policy evaluation in reinforcement learning, where the goal is to estimate a confidence interval on a target policy's value, given only access to a static experience dataset collected by unknown behavior policies.

Off-policy evaluation valid

Paper
Add Code

Differentiable Top-$k$ with Optimal Transport

no code implementations • NeurIPS Workshop LMCA 2020 • Yujia Xie, Hanjun Dai, Minshuo Chen, Bo Dai, Tuo Zhao, Hongyuan Zha, Wei Wei, Tomas Pfister

The top-$k$ operation, i. e., finding the $k$ largest or smallest elements from a collection of scores, is an important model component, which is widely used in information retrieval, machine learning, and data mining.

Information Retrieval Retrieval

Paper
Add Code

Small Towers Make Big Differences

no code implementations • 13 Aug 2020 • Yuyan Wang, Zhe Zhao, Bo Dai, Christopher Fifty, Dong Lin, Lichan Hong, Ed H. Chi

A delicate balance between multi-task generalization and multi-objective optimization is therefore needed for finding a better trade-off between efficiency and generalization.

Multi-Task Learning

Paper
Add Code

Energy-based View of Retrosynthesis

no code implementations • 14 Jul 2020 • Ruoxi Sun, Hanjun Dai, Li Li, Steven Kearnes, Bo Dai

Retrosynthesis -- the process of identifying a set of reactants to synthesize a target molecule -- is of vital importance to material design and drug discovery.

Ranked #1 on Single-step retrosynthesis on USPTO-50k

Drug Discovery Retrosynthesis +1

Paper
Add Code

Off-Policy Evaluation via the Regularized Lagrangian

no code implementations • NeurIPS 2020 • Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans

The recently proposed distribution correction estimation (DICE) family of estimators has advanced the state of the art in off-policy evaluation from behavior-agnostic data.

Off-policy evaluation

Paper
Add Code

Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

no code implementations • 2 Jul 2020 • Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

We study estimation in a class of generalized SEMs where the object of interest is defined as the solution to a linear operator equation.

Paper
Add Code

Unsupervised Landmark Learning from Unpaired Data

1 code implementation • 29 Jun 2020 • Yinghao Xu, Ceyuan Yang, Ziwei Liu, Bo Dai, Bolei Zhou

Recent attempts for unsupervised landmark learning leverage synthesized image pairs that are similar in appearance but different in poses.

Paper
Code

Video Representation Learning with Visual Tempo Consistency

1 code implementation • 28 Jun 2020 • Ceyuan Yang, Yinghao Xu, Bo Dai, Bolei Zhou

Visual tempo, which describes how fast an action goes, has shown its potential in supervised action recognition.

Action Anticipation Action Detection +3

Paper
Code

Scalable Deep Generative Modeling for Sparse Graphs

1 code implementation • ICML 2020 • Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, Dale Schuurmans

Based on this, we develop a novel autoregressive model, named BiGG, that utilizes this sparsity to avoid generating the full adjacency matrix, and importantly reduces the graph generation time complexity to $O((n + m)\log n)$.

Graph Generation

32,966

Paper
Code

Zeroth-Order Supervised Policy Improvement

no code implementations • 11 Jun 2020 • Hao Sun, Ziping Xu, Yuhang Song, Meng Fang, Jiechao Xiong, Bo Dai, Bolei Zhou

However, PG algorithms rely on exploiting the value function being learned with the first-order update locally, which results in limited sample efficiency.

Continuous Control Policy Gradient Methods +2

Paper
Add Code

Novel Policy Seeking with Constrained Optimization

1 code implementation • 21 May 2020 • Hao Sun, Zhenghao Peng, Bo Dai, Jian Guo, Dahua Lin, Bolei Zhou

In problem-solving, we humans can come up with multiple novel solutions to the same problem.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Intra- and Inter-Action Understanding via Temporal Action Parsing

no code implementations • CVPR 2020 • Dian Shao, Yue Zhao, Bo Dai, Dahua Lin

Current methods for action recognition primarily rely on deep convolutional networks to derive feature embeddings of visual and motion features.

Action Parsing Action Recognition +1

Paper
Add Code

Evolutionary Stochastic Policy Distillation

1 code implementation • 27 Apr 2020 • Hao Sun, Xinyu Pan, Bo Dai, Dahua Lin, Bolei Zhou

Solving the Goal-Conditioned Reward Sparse (GCRS) task is a challenging reinforcement learning problem due to the sparsity of reward signals.

Paper
Code

FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding

no code implementations • CVPR 2020 • Dian Shao, Yue Zhao, Bo Dai, Dahua Lin

To take action recognition to a new level, we develop FineGym, a new dataset built on top of gymnastic videos.

Action Recognition Action Understanding

Paper
Add Code

Temporal Pyramid Network for Action Recognition

3 code implementations • CVPR 2020 • Ceyuan Yang, Yinghao Xu, Jianping Shi, Bo Dai, Bolei Zhou

Previous works often capture the visual tempo through sampling raw videos at multiple rates and constructing an input-level frame pyramid, which usually requires a costly multi-branch network to handle.

Ranked #105 on Action Recognition on Something-Something V2

Action Recognition

3,926

Paper
Code

Self-Supervised Scene De-occlusion

2 code implementations • CVPR 2020 • Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, Chen Change Loy

This is achieved via Partial Completion Network (PCNet)-mask (M) and -content (C), that learn to recover fractions of object masks and contents, respectively, in a self-supervised manner.

Image Manipulation Scene Understanding

771

Paper
Code

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

1 code implementation • 1 Apr 2020 • Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

SAIL bridges the advantages of IL and RL to reduce the sample complexity substantially, by effectively exploiting sup-optimal demonstrations and efficiently exploring the environment to surpass the demonstrated performance.

Continuous Control Imitation Learning +1

Paper
Code

Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation

1 code implementation • ECCV 2020 • Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, Ping Luo

Learning a good image prior is a long-term goal for image restoration and manipulation.

Generative Adversarial Network Image Manipulation +2

474

Paper
Code

Energy-Based Processes for Exchangeable Data

1 code implementation • ICML 2020 • Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans

Recently there has been growing interest in modeling sets with exchangeability such as point clouds.

Denoising Point Cloud Generation

32,966

Paper
Code

Batch Stationary Distribution Estimation

1 code implementation • ICML 2020 • Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans

We consider the problem of approximating the stationary distribution of an ergodic Markov chain given a set of sampled transitions.

Off-policy evaluation

Paper
Code

GenDICE: Generalized Offline Estimation of Stationary Values

1 code implementation • ICLR 2020 • Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans

An important problem that arises in reinforcement learning and Monte Carlo methods is estimating quantities defined by the stationary distribution of a Markov chain.

Paper
Code

Differentiable Top-k Operator with Optimal Transport

no code implementations • 16 Feb 2020 • Yujia Xie, Hanjun Dai, Minshuo Chen, Bo Dai, Tuo Zhao, Hongyuan Zha, Wei Wei, Tomas Pfister

The top-k operation, i. e., finding the k largest or smallest elements from a collection of scores, is an important model component, which is widely used in information retrieval, machine learning, and data mining.

Information Retrieval Retrieval

Paper
Add Code

Real or Not Real, that is the Question

2 code implementations • ICLR 2020 • Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, Dahua Lin

While generative adversarial networks (GAN) have been widely adopted in various topics, in this paper we generalize the standard GAN to a new perspective by treating realness as a random variable that can be estimated from multiple angles.

286

Paper
Code

Reinforcement Learning via Fenchel-Rockafellar Duality

1 code implementation • 7 Jan 2020 • Ofir Nachum, Bo Dai

We review basic concepts of convex duality, focusing on the very general and supremely useful Fenchel-Rockafellar duality.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Retrosynthesis Prediction with Conditional Graph Logic Network

1 code implementation • NeurIPS 2019 • Hanjun Dai, Chengtao Li, Connor W. Coley, Bo Dai, Le Song

Retrosynthesis is one of the fundamental problems in organic chemistry.

Ranked #11 on Single-step retrosynthesis on USPTO-50k

Retrosynthesis Single-step retrosynthesis

109

Paper
Code

AlgaeDICE: Policy Gradient from Arbitrary Experience

no code implementations • 4 Dec 2019 • Ofir Nachum, Bo Dai, Ilya Kostrikov, Yin-Lam Chow, Lihong Li, Dale Schuurmans

In many real-world applications of reinforcement learning (RL), interactions with the environment are limited due to cost or feasibility.

Reinforcement Learning (RL)

Paper
Add Code

Overcoming Catastrophic Forgetting by Generative Regularization

no code implementations • 3 Dec 2019 • Patrick H. Chen, Wei Wei, Cho-Jui Hsieh, Bo Dai

In this paper, we propose a new method to overcome catastrophic forgetting by adding generative regularization to Bayesian inference framework.

Bayesian Inference Continual Learning

Paper
Add Code

Energy-Inspired Models: Learning with Sampler-Induced Distributions

1 code implementation • NeurIPS 2019 • Dieterich Lawson, George Tucker, Bo Dai, Rajesh Ranganath

Motivated by this, we consider the sampler-induced distribution as the model of interest and maximize the likelihood of this model.

Variational Inference

32,977

Paper
Code

Learning with Social Influence through Interior Policy Differentiation

no code implementations • 25 Sep 2019 • Hao Sun, Bo Dai, Jiankai Sun, Zhenghao Peng, Guodong Xu, Dahua Lin, Bolei Zhou

In this work we model the social influence into the scheme of reinforcement learning, enabling the agents to learn both from the environment and from their peers.

Reinforcement Learning (RL)

Paper
Add Code

Recursive Visual Sound Separation Using Minus-Plus Net

1 code implementation • ICCV 2019 • Xudong Xu, Bo Dai, Dahua Lin

Sounds provide rich semantics, complementary to visual data, for many tasks.

Paper
Code

DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

2 code implementations • NeurIPS 2019 • Ofir Nachum, Yin-Lam Chow, Bo Dai, Lihong Li

In contrast to previous approaches, our algorithm is agnostic to knowledge of the behavior policy (or policies) used to generate the dataset.

32,966

Paper
Code

Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification

no code implementations • NeurIPS 2018 • Harsh Shrivastava, Eugene Bart, Bob Price, Hanjun Dai, Bo Dai, Srinivas Aluru

We propose a new approach, called cooperative neural networks (CoNN), which uses a set of cooperatively trained neural networks to capture latent representations that exploit prior given independence structure.

General Classification text-classification +1

Paper
Add Code

Exponential Family Estimation via Adversarial Dynamics Embedding

1 code implementation • NeurIPS 2019 • Bo Dai, Zhen Liu, Hanjun Dai, Niao He, Arthur Gretton, Le Song, Dale Schuurmans

We present an efficient algorithm for maximum likelihood estimation (MLE) of exponential family models, with a general parametrization of the energy function that includes neural networks.

Paper
Code

Feature Intertwiner for Object Detection

2 code implementations • ICLR 2019 • Hongyang Li, Bo Dai, Shaoshuai Shi, Wanli Ouyang, Xiaogang Wang

We argue that the reliable set could guide the feature learning of the less reliable set during training - in spirit of student mimicking teacher behavior and thus pushing towards a more compact class centroid in the feature space.

Ranked #145 on Object Detection on COCO test-dev

Object object-detection +1

107

Paper
Code

Revisiting Auxiliary Latent Variables in Generative Models

no code implementations • ICLR Workshop DeepGenStruct 2019 • Dieterich Lawson, George Tucker, Bo Dai, Rajesh Ranganath

The success of enriching the variational family with auxiliary latent variables motivates applying the same techniques to the generative model.

Paper
Add Code

Learning to Defense by Learning to Attack

no code implementations • ICLR Workshop DeepGenStruct 2019 • Zhehui Chen, Haoming Jiang, Yuyang Shi, Bo Dai, Tuo Zhao

From the perspective of generative learning, our proposed method can be viewed as learning a deep generative model for generating adversarial samples, which is adaptive to the robust classification.

Adversarial Attack Robust classification

Paper
Add Code

Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees

1 code implementation • ICLR 2020 • Binghong Chen, Bo Dai, Qinjie Lin, Guo Ye, Han Liu, Le Song

We propose a meta path planning algorithm named \emph{Neural Exploration-Exploitation Trees~(NEXT)} for learning from prior experience for solving new path planning problems in high dimensional continuous state and action spaces.

Vocal Bursts Intensity Prediction

Paper
Code

Meta Architecture Search

1 code implementation • NeurIPS 2019 • Albert Shaw, Wei Wei, Weiyang Liu, Le Song, Bo Dai

Neural Architecture Search (NAS) has been quite successful in constructing state-of-the-art models on a variety of tasks.

Bayesian Inference Few-Shot Learning +1

Paper
Code

Predictive Approximate Bayesian Computation via Saddle Points

no code implementations • NeurIPS 2018 • Yingxiang Yang, Bo Dai, Negar Kiyavash, Niao He

Approximate Bayesian computation (ABC) is an important methodology for Bayesian inference when the likelihood function is intractable.

Bayesian Inference regression

Paper
Add Code

Coupled Variational Bayes via Optimization Embedding

1 code implementation • NeurIPS 2018 • Bo Dai, Hanjun Dai, Niao He, Weiyang Liu, Zhen Liu, Jianshu Chen, Lin Xiao, Le Song

This flexible function class couples the variational distribution with the original parameters in the graphical models, allowing end-to-end learning of the graphical models by back-propagation through the variational distribution.

Variational Inference

Paper
Code

Kernel Exponential Family Estimation via Doubly Dual Embedding

1 code implementation • 6 Nov 2018 • Bo Dai, Hanjun Dai, Arthur Gretton, Le Song, Dale Schuurmans, Niao He

We investigate penalized maximum log-likelihood estimation for exponential family distributions whose natural parameter resides in a reproducing kernel Hilbert space.

Paper
Code

Learning to Defend by Learning to Attack

no code implementations • 3 Nov 2018 • Haoming Jiang, Zhehui Chen, Yuyang Shi, Bo Dai, Tuo Zhao

Adversarial training provides a principled approach for training robust neural networks.

Adversarial Attack Adversarial Defense +3

Paper
Add Code

A Neural Compositional Paradigm for Image Captioning

1 code implementation • NeurIPS 2018 • Bo Dai, Sanja Fidler, Dahua Lin

Mainstream captioning models often follow a sequential structure to generate captions, leading to issues such as introduction of irrelevant semantics, lack of diversity in the generated captions, and inadequate generalization performance.

Image Captioning

Paper
Code

Neural Network Encapsulation

2 code implementations • ECCV 2018 • Hongyang Li, Xiaoyang Guo, Bo Dai, Wanli Ouyang, Xiaogang Wang

Motivated by the routing to make higher capsule have agreement with lower capsule, we extend the mechanism as a compensation for the rapid loss of information in nearby layers.

Paper
Code

Move Forward and Tell: A Progressive Generator of Video Descriptions

no code implementations • ECCV 2018 • Yilei Xiong, Bo Dai, Dahua Lin

We present an efficient framework that can generate a coherent paragraph to describe a given video.

Descriptive Sentence +1

Paper
Add Code

Rethinking the Form of Latent States in Image Captioning

no code implementations • ECCV 2018 • Bo Dai, Deming Ye, Dahua Lin

Taking advantage of this, we visually reveal the internal dynamics in the process of caption generation, as well as the connections between input visual domain and output linguistic domain.

Caption Generation Image Captioning

Paper
Add Code

Learning Deep Hidden Nonlinear Dynamics from Aggregate Data

no code implementations • 22 Jul 2018 • Yisen Wang, Bo Dai, Lingkai Kong, Sarah Monazam Erfani, James Bailey, Hongyuan Zha

Learning nonlinear dynamics from diffusion data is a challenging problem since the individuals observed may be different at different time points, generally following an aggregate behaviour.

Paper
Add Code

Learning Steady-States of Iterative Algorithms over Graphs

no code implementations • ICML 2018 • Hanjun Dai, Zornitsa Kozareva, Bo Dai, Alex Smola, Le Song

Many graph analytics problems can be solved via iterative algorithms where the solutions are often characterized by a set of steady-state conditions.

Paper
Add Code

Learning towards Minimum Hyperspherical Energy

4 code implementations • NeurIPS 2018 • Weiyang Liu, Rongmei Lin, Zhen Liu, Lixin Liu, Zhiding Yu, Bo Dai, Le Song

In light of this intuition, we reduce the redundancy regularization problem to generic energy minimization, and propose a minimum hyperspherical energy (MHE) objective as generic regularization for neural networks.

148

Paper
Code

Decoupled Networks

1 code implementation • CVPR 2018 • Weiyang Liu, Zhen Liu, Zhiding Yu, Bo Dai, Rongmei Lin, Yisen Wang, James M. Rehg, Le Song

Inner product-based convolution has been a central component of convolutional neural networks (CNNs) and the key to learning visual representations.

Paper
Code

Syntax-Directed Variational Autoencoder for Structured Data

1 code implementation • ICLR 2018 • Hanjun Dai, Yingtao Tian, Bo Dai, Steven Skiena, Le Song

Deep generative models have been enjoying success in modeling continuous data.

Decoder Translation +1

Paper
Code

Boosting the Actor with Dual Critic

no code implementations • ICLR 2018 • Bo Dai, Albert Shaw, Niao He, Lihong Li, Le Song

This paper proposes a new actor-critic-style algorithm called Dual Actor-Critic or Dual-AC.

Paper
Add Code

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

no code implementations • ICML 2018 • Bo Dai, Albert Shaw, Lihong Li, Lin Xiao, Niao He, Zhen Liu, Jianshu Chen, Le Song

When function approximation is used, solving the Bellman optimality equation with stability guarantees has remained a major open problem in reinforcement learning for decades.

Q-Learning reinforcement-learning +1

Paper
Add Code

Deep Hyperspherical Learning

no code implementations • NeurIPS 2017 • Weiyang Liu, Yan-Ming Zhang, Xingguo Li, Zhiding Yu, Bo Dai, Tuo Zhao, Le Song

In light of such challenges, we propose hyperspherical convolution (SphereConv), a novel learning framework that gives angular representations on hyperspheres.

Representation Learning

Paper
Add Code

Towards Black-box Iterative Machine Teaching

no code implementations • ICML 2018 • Weiyang Liu, Bo Dai, Xingguo Li, Zhen Liu, James M. Rehg, Le Song

We propose an active teacher model that can actively query the learner (i. e., make the learner take exams) for estimating the learner's status and provably guide the learner to achieve faster convergence.

Paper
Add Code

Contrastive Learning for Image Captioning

no code implementations • NeurIPS 2017 • Bo Dai, Dahua Lin

Specifically, via two constraints formulated on top of a reference model, the proposed method can encourage distinctiveness, while maintaining the overall quality of the generated captions.

Contrastive Learning Image Captioning

Paper
Add Code

Iterative Machine Teaching

2 code implementations • ICML 2017 • Weiyang Liu, Bo Dai, Ahmad Humayun, Charlene Tay, Chen Yu, Linda B. Smith, James M. Rehg, Le Song

Different from traditional machine teaching which views the learners as batch algorithms, we study a new paradigm where the learner uses an iterative algorithm and a teacher can feed examples sequentially and intelligently based on the current performance of the learner.

Paper
Code

Detecting Visual Relationships with Deep Relational Networks

1 code implementation • CVPR 2017 • Bo Dai, Yuqi Zhang, Dahua Lin

Relationships among objects play a crucial role in image understanding.

Ranked #3 on Visual Relationship Detection on VRD Phrase Detection

General Classification

200

Paper
Code

Towards Diverse and Natural Image Descriptions via a Conditional GAN

1 code implementation • ICCV 2017 • Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin

Despite the substantial progress in recent years, the image captioning techniques are still far from being perfect. Sentences produced by existing methods, e. g. those based on RNNs, are often overly rigid and lacking in variability.

Image Captioning

Paper
Code

Stochastic Generative Hashing

2 code implementations • ICML 2017 • Bo Dai, Ruiqi Guo, Sanjiv Kumar, Niao He, Le Song

Learning-based binary hashing has become a powerful paradigm for fast search and retrieval in massive databases.

Retrieval

Paper
Code

Learning from Conditional Distributions via Dual Embeddings

no code implementations • 15 Jul 2016 • Bo Dai, Niao He, Yunpeng Pan, Byron Boots, Le Song

In such problems, each sample $x$ itself is associated with a conditional distribution $p(z|x)$ represented by samples $\{z_i\}_{i=1}^M$, and the goal is to learn a function $f$ that links these conditional distributions to target values $y$.

Paper
Add Code

Discriminative Embeddings of Latent Variable Models for Structured Data

1 code implementation • 17 Mar 2016 • Hanjun Dai, Bo Dai, Le Song

Kernel classifiers and regressors designed for structured data, such as sequences, trees and graphs, have significantly advanced a number of interdisciplinary areas such as computational biology and drug design.

281

Paper
Code

Provable Bayesian Inference via Particle Mirror Descent

no code implementations • 9 Jun 2015 • Bo Dai, Niao He, Hanjun Dai, Le Song

Bayesian methods are appealing in their flexibility in modeling complex data and ability in capturing uncertainty in parameters.

Bayesian Inference Gaussian Processes

Paper
Add Code

Scalable Kernel Methods via Doubly Stochastic Gradients

1 code implementation • NeurIPS 2014 • Bo Dai, Bo Xie, Niao He, YIngyu Liang, Anant Raj, Maria-Florina Balcan, Le Song

The general perception is that kernel methods are not scalable, and neural nets are the methods of choice for nonlinear learning problems.

Paper
Code

Transductive Learning with Multi-class Volume Approximation

no code implementations • 3 Feb 2014 • Gang Niu, Bo Dai, Marthinus Christoffel du Plessis, Masashi Sugiyama

Given a hypothesis space, the large volume principle by Vladimir Vapnik prioritizes equivalence classes according to their volume in the hypothesis space.

Transductive Learning

Paper
Add Code

Robust Low Rank Kernel Embeddings of Multivariate Distributions

no code implementations • NeurIPS 2013 • Le Song, Bo Dai

Kernel embedding of distributions has led to many recent advances in machine learning.

BIG-bench Machine Learning Density Estimation

Paper
Add Code

Nonparametric Estimation of Multi-View Latent Variable Models

no code implementations • 13 Nov 2013 • Le Song, Animashree Anandkumar, Bo Dai, Bo Xie

We establish that the sample complexity for the proposed method is quadratic in the number of latent components and is a low order polynomial in the other relevant parameters.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.