Search Results for author: Diana Wofk

Found 7 papers, 6 papers with code

L-MAGIC: Language Model Assisted Generation of Images with Coherence

1 code implementation3 Jun 2024 Zhipeng Cai, Matthias Mueller, Reiner Birkl, Diana Wofk, Shao-Yen Tseng, Junda Cheng, Gabriela Ben-Melech Stan, Vasudev Lal, Michael Paulitsch

However, the lack of global scene layout priors leads to subpar outputs with duplicated objects (e. g., multiple beds in a bedroom) or requires time-consuming human text inputs for each view.

Depth Estimation Language Modelling +2

LDM3D-VR: Latent Diffusion Model for 3D VR

no code implementations6 Nov 2023 Gabriela Ben Melech Stan, Diana Wofk, Estelle Aflalo, Shao-Yen Tseng, Zhipeng Cai, Michael Paulitsch, Vasudev Lal

Our models are fine-tuned from existing pretrained models on datasets containing panoramic/high-resolution RGB images, depth maps and captions.

MiDaS v3.1 -- A Model Zoo for Robust Monocular Relative Depth Estimation

2 code implementations26 Jul 2023 Reiner Birkl, Diana Wofk, Matthias Müller

We release MiDaS v3. 1 for monocular depth estimation, offering a variety of new models based on different encoder backbones.

Image Classification Monocular Depth Estimation

LDM3D: Latent Diffusion Model for 3D

2 code implementations18 May 2023 Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Muller, Vasudev Lal

This research paper proposes a Latent Diffusion Model for 3D (LDM3D) that generates both image and depth map data from a given text prompt, allowing users to generate RGBD images from text prompts.

Monocular Visual-Inertial Depth Estimation

1 code implementation21 Mar 2023 Diana Wofk, René Ranftl, Matthias Müller, Vladlen Koltun

We evaluate on the TartanAir and VOID datasets, observing up to 30% reduction in inverse RMSE with dense scale alignment relative to performing just global alignment alone.

Depth Completion Monocular Depth Estimation

ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth

3 code implementations23 Feb 2023 Shariq Farooq Bhat, Reiner Birkl, Diana Wofk, Peter Wonka, Matthias Müller

Finally, ZoeD-M12-NK is the first model that can jointly train on multiple datasets (NYU Depth v2 and KITTI) without a significant drop in performance and achieve unprecedented zero-shot generalization performance to eight unseen datasets from both indoor and outdoor domains.

Ranked #16 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Monocular Depth Estimation Zero-shot Generalization

Cannot find the paper you are looking for? You can Submit a new open access paper.