Search Results for author: Jonathan Tremblay

Found 32 papers, 17 papers with code

Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects

1 code implementation • 1 Apr 2024 • Yijia Weng, Bowen Wen, Jonathan Tremblay, Valts Blukis, Dieter Fox, Leonidas Guibas, Stan Birchfield

We address the problem of building digital twins of unknown articulated objects from two RGBD scans of the object at different articulation states.

Object

Paper
Code

Snap-it, Tap-it, Splat-it: Tactile-Informed 3D Gaussian Splatting for Reconstructing Challenging Surfaces

no code implementations • 29 Mar 2024 • Mauro Comi, Alessio Tonioni, Max Yang, Jonathan Tremblay, Valts Blukis, Yijiong Lin, Nathan F. Lepora, Laurence Aitchison

Touch and vision go hand in hand, mutually enhancing our ability to understand the world.

Novel View Synthesis Surface Reconstruction

Paper
Add Code

Diff-DOPE: Differentiable Deep Object Pose Estimation

no code implementations • 30 Sep 2023 • Jonathan Tremblay, Bowen Wen, Valts Blukis, Balakumar Sundaralingam, Stephen Tyree, Stan Birchfield

We introduce Diff-DOPE, a 6-DoF pose refiner that takes as input an image, a 3D textured model of an object, and an initial pose of the object.

Object Pose Estimation +1

Paper
Add Code

HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions

no code implementations • 2 Aug 2023 • Andrew Guo, Bowen Wen, Jianhe Yuan, Jonathan Tremblay, Stephen Tyree, Jeffrey Smith, Stan Birchfield

We outline the usefulness of our dataset for 6-DoF category-level pose+scale estimation and related tasks.

Pose Estimation

Paper
Add Code

Partial-View Object View Synthesis via Filtered Inversion

no code implementations • 3 Apr 2023 • Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber

At inference, given one or more views of a novel real-world object, FINV first finds a set of latent codes for the object by inverting the generative model from multiple initial seeds.

Object

Paper
Add Code

TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation

no code implementations • CVPR 2023 • Taeyeop Lee, Jonathan Tremblay, Valts Blukis, Bowen Wen, Byeong-Uk Lee, Inkyu Shin, Stan Birchfield, In So Kweon, Kuk-Jin Yoon

Unlike previous unsupervised domain adaptation methods for category-level object pose estimation, our approach processes the test data in a sequential, online manner, and it does not require access to the source domain at runtime.

Object Pose Estimation +2

Paper
Add Code

BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects

1 code implementation • CVPR 2023 • Bowen Wen, Jonathan Tremblay, Valts Blukis, Stephen Tyree, Thomas Muller, Alex Evans, Dieter Fox, Jan Kautz, Stan Birchfield

We present a near real-time method for 6-DoF tracking of an unknown object from a monocular RGBD video sequence, while simultaneously performing neural 3D reconstruction of the object.

3D Object Tracking 3D Reconstruction +5

897

Paper
Code

MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

no code implementations • 13 Dec 2022 • Yann Labbé, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic

Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner.

6D Pose Estimation Object

Paper
Add Code

RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control

no code implementations • 21 Oct 2022 • Zhenggang Tang, Balakumar Sundaralingam, Jonathan Tremblay, Bowen Wen, Ye Yuan, Stephen Tyree, Charles Loop, Alexander Schwing, Stan Birchfield

We present a system for collision-free control of a robot manipulator that uses only RGB views of the world.

Model Predictive Control

Paper
Add Code

One-Shot Neural Fields for 3D Object Understanding

no code implementations • 21 Oct 2022 • Valts Blukis, Taeyeop Lee, Jonathan Tremblay, Bowen Wen, In So Kweon, Kuk-Jin Yoon, Dieter Fox, Stan Birchfield

At test-time, we build the representation from a single RGB input image observing the scene from only one viewpoint.

3D Reconstruction Decoder +3

Paper
Add Code

Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation

1 code implementation • 18 Oct 2022 • Yunzhi Lin, Thomas Müller, Jonathan Tremblay, Bowen Wen, Stephen Tyree, Alex Evans, Patricio A. Vela, Stan Birchfield

We present a parallelized optimization method based on fast Neural Radiance Fields (NeRF) for estimating 6-DoF pose of a camera with respect to an object or scene.

Pose Estimation

Paper
Code

ProgPrompt: Generating Situated Robot Task Plans using Large Language Models

no code implementations • 22 Sep 2022 • Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg

To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information.

Paper
Add Code

Variable Bitrate Neural Fields

1 code implementation • 15 Jun 2022 • Towaki Takikawa, Alex Evans, Jonathan Tremblay, Thomas Müller, Morgan McGuire, Alec Jacobson, Sanja Fidler

Neural approximations of scalar and vector fields, such as signed distance functions and radiance fields, have emerged as accurate, high-quality representations.

Decoder

225

Paper
Code

Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty Estimation

1 code implementation • 23 May 2022 • Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela, Stan Birchfield

We propose a single-stage, category-level 6-DoF pose estimation algorithm that simultaneously detects and tracks instances of objects within a known category.

Pose Estimation Pose Tracking

257

Paper
Code

RTMV: A Ray-Traced Multi-View Synthetic Dataset for Novel View Synthesis

no code implementations • 14 May 2022 • Jonathan Tremblay, Moustafa Meshry, Alex Evans, Jan Kautz, Alexander Keller, Sameh Khamis, Thomas Müller, Charles Loop, Nathan Morrical, Koki Nagano, Towaki Takikawa, Stan Birchfield

We present a large-scale synthetic dataset for novel view synthesis consisting of ~300k images rendered from nearly 2000 complex scenes using high-quality ray tracing at high resolution (1600 x 1600 pixels).

Ranked #1 on Novel View Synthesis on RTMV

Novel View Synthesis

Paper
Add Code

6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark

1 code implementation • 11 Mar 2022 • Stephen Tyree, Jonathan Tremblay, Thang To, Jia Cheng, Terry Mosier, Jeffrey Smith, Stan Birchfield

We propose a set of toy grocery objects, whose physical instantiations are readily available for purchase and are appropriately sized for robotic grasping and manipulation.

Pose Estimation Robotic Grasping

Paper
Code

Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects

1 code implementation • CVPR 2022 • Atsuhiro Noguchi, Umar Iqbal, Jonathan Tremblay, Tatsuya Harada, Orazio Gallo

Rendering articulated objects while controlling their poses is critical to applications such as virtual reality or animation for movies.

Object

Paper
Code

Efficient Geometry-aware 3D Generative Adversarial Networks

2 code implementations • CVPR 2022 • Eric R. Chan, Connor Z. Lin, Matthew A. Chan, Koki Nagano, Boxiao Pan, Shalini De Mello, Orazio Gallo, Leonidas Guibas, Jonathan Tremblay, Sameh Khamis, Tero Karras, Gordon Wetzstein

Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using only collections of single-view 2D photographs has been a long-standing challenge.

Computational Efficiency Neural Rendering

3,132

Paper
Code

Single-Stage Keypoint-Based Category-Level Object Pose Estimation from an RGB Image

1 code implementation • 13 Sep 2021 • Yunzhi Lin, Jonathan Tremblay, Stephen Tyree, Patricio A. Vela, Stan Birchfield

Prior work on 6-DoF object pose estimation has largely focused on instance-level processing, in which a textured CAD model is available for each object being detected.

Object object-detection +2

257

Paper
Code

NViSII: A Scriptable Tool for Photorealistic Image Generation

2 code implementations • 28 May 2021 • Nathan Morrical, Jonathan Tremblay, Yunzhi Lin, Stephen Tyree, Stan Birchfield, Valerio Pascucci, Ingo Wald

We present a Python-based renderer built on NVIDIA's OptiX ray tracing engine and the OptiX AI denoiser, designed to generate high-quality synthetic images for research in computer vision and deep learning.

Image Generation Optical Flow Estimation +1

310

Paper
Code

DexYCB: A Benchmark for Capturing Hand Grasping of Objects

2 code implementations • CVPR 2021 • Yu-Wei Chao, Wei Yang, Yu Xiang, Pavlo Molchanov, Ankur Handa, Jonathan Tremblay, Yashraj S. Narang, Karl Van Wyk, Umar Iqbal, Stan Birchfield, Jan Kautz, Dieter Fox

We introduce DexYCB, a new dataset for capturing hand grasping of objects.

3D Hand Pose Estimation 6D Pose Estimation using RGB +2

141

Paper
Code

Fast Uncertainty Quantification for Deep Object Pose Estimation

no code implementations • 16 Nov 2020 • Guanya Shi, Yifeng Zhu, Jonathan Tremblay, Stan Birchfield, Fabio Ramos, Animashree Anandkumar, Yuke Zhu

Deep learning-based object pose estimators are often unreliable and overconfident especially when the input image is outside the training domain, for instance, with sim2real transfer.

Object Pose Estimation +1

Paper
Add Code

Indirect Object-to-Robot Pose Estimation from an External Monocular RGB Camera

1 code implementation • 26 Aug 2020 • Jonathan Tremblay, Stephen Tyree, Terry Mosier, Stan Birchfield

We present a robotic grasping system that uses a single external monocular RGB camera as input.

Robotics

966

Paper
Code

Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning

no code implementations • 21 May 2020 • Michelle A. Lee, Carlos Florensa, Jonathan Tremblay, Nathan Ratliff, Animesh Garg, Fabio Ramos, Dieter Fox

Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state.

Paper
Add Code

PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data

8 code implementations • ICCV 2019 • Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, Xiaodong Yang

In comparison with person re-identification (ReID), which has been widely studied in the research community, vehicle ReID has received less attention.

Attribute Multi-Task Learning +3

334

Paper
Code

Contextual Reinforcement Learning of Visuo-tactile Multi-fingered Grasping Policies

no code implementations • 21 Nov 2019 • Visak Kumar, Tucker Herman, Dieter Fox, Stan Birchfield, Jonathan Tremblay

We propose a Grasping Objects Approach for Tactile (GOAT) robotic hands, learning to overcome the reality gap problem.

Robotics

Paper
Add Code

Camera-to-Robot Pose Estimation from a Single Image

2 code implementations • 21 Nov 2019 • Timothy E. Lee, Jonathan Tremblay, Thang To, Jia Cheng, Terry Mosier, Oliver Kroemer, Dieter Fox, Stan Birchfield

We show experimental results for three different camera sensors, demonstrating that our approach is able to achieve accuracy with a single frame that is better than that of classic off-line hand-eye calibration using multiple frames.

Robotics

139

Paper
Code

Few-Shot Viewpoint Estimation

no code implementations • 13 May 2019 • Hung-Yu Tseng, Shalini De Mello, Jonathan Tremblay, Sifei Liu, Stan Birchfield, Ming-Hsuan Yang, Jan Kautz

Through extensive experimentation on the ObjectNet3D and Pascal3D+ benchmark datasets, we demonstrate that our framework, which we call MetaView, significantly outperforms fine-tuning the state-of-the-art models with few examples, and that the specific architectural innovations of our method are crucial to achieving good performance.

Meta-Learning Viewpoint Estimation

Paper
Add Code

Deep Object Pose Estimation for Semantic Robotic Grasping of Household Objects

8 code implementations • 27 Sep 2018 • Jonathan Tremblay, Thang To, Balakumar Sundaralingam, Yu Xiang, Dieter Fox, Stan Birchfield

Using synthetic data generated in this manner, we introduce a one-shot deep neural network that is able to perform competitively against a state-of-the-art network trained on a combination of real and synthetic data.

Robotics

966

Paper
Code

Synthetically Trained Neural Networks for Learning Human-Readable Plans from Real-World Demonstrations

1 code implementation • 18 May 2018 • Jonathan Tremblay, Thang To, Artem Molchanov, Stephen Tyree, Jan Kautz, Stan Birchfield

We present a system to infer and execute a human-readable program from a real-world demonstration.

Robotics

556

Paper
Code

Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization

1 code implementation • 18 Apr 2018 • Jonathan Tremblay, Aayush Prakash, David Acuna, Mark Brophy, Varun Jampani, Cem Anil, Thang To, Eric Cameracci, Shaad Boochoon, Stan Birchfield

We present a system for training deep neural networks for object detection using synthetic images.

Object object-detection +1

456

Paper
Code

Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation

no code implementations • 18 Apr 2018 • Jonathan Tremblay, Thang To, Stan Birchfield

We present a new dataset, called Falling Things (FAT), for advancing the state-of-the-art in object detection and 3D pose estimation in the context of robotics.

3D Object Detection 3D Pose Estimation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.