Search Results for author: Nataniel Ruiz

Found 22 papers, 9 papers with code

ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs

1 code implementation • 22 Nov 2023 • Viraj Shah, Nataniel Ruiz, Forrester Cole, Erika Lu, Svetlana Lazebnik, Yuanzhen Li, Varun Jampani

Experiments on a wide range of subject and style combinations show that ZipLoRA can generate compelling results with meaningful improvements over baselines in subject and style fidelity while preserving the ability to recontextualize.

457

Paper
Code

RealFill: Reference-Driven Generation for Authentic Image Completion

no code implementations • 28 Sep 2023 • Luming Tang, Nataniel Ruiz, Qinghao Chu, Yuanzhen Li, Aleksander Holynski, David E. Jacobs, Bharath Hariharan, Yael Pritch, Neal Wadhwa, Kfir Aberman, Michael Rubinstein

Once personalized, RealFill is able to complete a target image with visually compelling contents that are faithful to the original scene.

Paper
Add Code

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

2 code implementations • 14 Aug 2023 • Ariel N. Lee, Cole J. Hunter, Nataniel Ruiz

We present $\textbf{Platypus}$, a family of fine-tuned and merged Large Language Models (LLMs) that achieves the strongest performance and currently stands at first place in HuggingFace's Open LLM Leaderboard as of the release date of this work.

623

Paper
Code

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

2 code implementations • 13 Jul 2023 • Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Wei Wei, Tingbo Hou, Yael Pritch, Neal Wadhwa, Michael Rubinstein, Kfir Aberman

By composing these weights into the diffusion model, coupled with fast finetuning, HyperDreamBooth can generate a person's face in various contexts and styles, with high subject details while also preserving the model's crucial knowledge of diverse styles and semantic modifications.

Diffusion Personalization Tuning Free

153

Paper
Code

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

no code implementations • 30 Jun 2023 • Ariel N. Lee, Sarah Adel Bargal, Janavi Kasera, Stan Sclaroff, Kate Saenko, Nataniel Ruiz

We hypothesize that this power to ignore out-of-context information (which we name $\textit{patch selectivity}$), while integrating in-context information in a non-local manner in early layers, allows ViTs to more easily handle occlusion.

Data Augmentation Inductive Bias

Paper
Add Code

StyleDrop: Text-to-Image Generation in Any Style

3 code implementations • 1 Jun 2023 • Kihyuk Sohn, Nataniel Ruiz, Kimin Lee, Daniel Castro Chin, Irina Blok, Huiwen Chang, Jarred Barber, Lu Jiang, Glenn Entis, Yuanzhen Li, Yuan Hao, Irfan Essa, Michael Rubinstein, Dilip Krishnan

Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts.

Text-to-Image Generation

551

Paper
Code

DreamBooth3D: Subject-Driven Text-to-3D Generation

no code implementations • ICCV 2023 • Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani

We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject.

3D Generation Text to 3D

Paper
Add Code

Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing

no code implementations • 29 Nov 2022 • Nataniel Ruiz, Sarah Adel Bargal, Cihang Xie, Kate Saenko, Stan Sclaroff

One shortcoming of this is the fact that these deep neural networks cannot be easily evaluated for robustness issues with respect to specific scene variations.

counterfactual Object

Paper
Add Code

Human Body Measurement Estimation with Adversarial Augmentation

no code implementations • 11 Oct 2022 • Nataniel Ruiz, Miriam Bellver, Timo Bolkart, Ambuj Arora, Ming C. Lin, Javier Romero, Raja Bala

Training of BMnet is performed on data from real human subjects, and augmented with a novel adversarial body simulator (ABS) that finds and synthesizes challenging body shapes.

Paper
Add Code

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

10 code implementations • CVPR 2023 • Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, Kfir Aberman

Once the subject is embedded in the output domain of the model, the unique identifier can be used to synthesize novel photorealistic images of the subject contextualized in different scenes.

Diffusion Personalization Image Generation

9,876

Paper
Code

Examining the Human Perceptibility of Black-Box Adversarial Attacks on Face Recognition

no code implementations • ICML Workshop AML 2021 • Benjamin Spetter-Goldstein, Nataniel Ruiz, Sarah Adel Bargal

We also show how the $\ell_2$ norm and other metrics do not correlate with human perceptibility in a linear fashion, thus making these norms suboptimal at measuring adversarial attack perceptibility.

Adversarial Attack Face Recognition

Paper
Add Code

Simulated Adversarial Testing of Face Recognition Models

no code implementations • CVPR 2022 • Nataniel Ruiz, Adam Kortylewski, Weichao Qiu, Cihang Xie, Sarah Adel Bargal, Alan Yuille, Stan Sclaroff

In this work, we propose a framework for learning how to test machine learning algorithms using simulators in an adversarial manner in order to find weaknesses in the model before deploying it in critical scenarios.

BIG-bench Machine Learning Face Recognition

Paper
Add Code

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias

no code implementations • 9 Dec 2020 • Nataniel Ruiz, Barry-John Theobald, Anurag Ranjan, Ahmed Hussein Abdelaziz, Nicholas Apostoloff

Images generated using MorphGAN conserve the identity of the person in the original image, and the provided control over head pose and facial expression allows test sets to be created to identify robustness issues of a facial recognition deep network with respect to pose and expression.

Data Augmentation Face Generation +2

Paper
Add Code

Protecting Against Image Translation Deepfakes by Leaking Universal Perturbations from Black-Box Neural Networks

no code implementations • 11 Jun 2020 • Nataniel Ruiz, Sarah Adel Bargal, Stan Sclaroff

In this work, we develop efficient disruptions of black-box image translation deepfake generation systems.

Face Swapping General Classification +1

Paper
Add Code

Detecting Attended Visual Targets in Video

1 code implementation • CVPR 2020 • Eunji Chong, Yongxin Wang, Nataniel Ruiz, James M. Rehg

We address the problem of detecting attention targets in video.

Deep Attention

158

Paper
Code

Disrupting Deepfakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems

4 code implementations • 3 Mar 2020 • Nataniel Ruiz, Sarah Adel Bargal, Stan Sclaroff

This type of manipulated images and video have been coined Deepfakes.

Adversarial Attack Attribute +1

298

Paper
Code

Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System

no code implementations • 12 Feb 2020 • Nataniel Ruiz, Hao Yu, Danielle A. Allessio, Mona Jalal, Ajjen Joshi, Thomas Murray, John J. Magee, Jacob R. Whitehill, Vitaly Ablavsky, Ivon Arroyo, Beverly P. Woolf, Stan Sclaroff, Margrit Betke

In this work, we propose a video-based transfer learning approach for predicting problem outcomes of students working with an intelligent tutoring system (ITS).

Math Transfer Learning

Paper
Add Code

Learning To Simulate

no code implementations • ICLR 2019 • Nataniel Ruiz, Samuel Schulter, Manmohan Chandraker

Simulation is a useful tool in situations where training data for machine learning models is costly to annotate or even hard to acquire.

Paper
Add Code

Learning to Localize and Align Fine-Grained Actions to Sparse Instructions

no code implementations • 22 Sep 2018 • Meera Hahn, Nataniel Ruiz, Jean-Baptiste Alayrac, Ivan Laptev, James M. Rehg

Automatic generation of textual video descriptions that are time-aligned with video content is a long-standing goal in computer vision.

Object Object Recognition

Paper
Add Code

Connecting Gaze, Scene, and Attention: Generalized Attention Estimation via Joint Modeling of Gaze and Scene Saliency

no code implementations • ECCV 2018 • Eunji Chong, Nataniel Ruiz, Yongxin Wang, Yun Zhang, Agata Rozga, James Rehg

This paper addresses the challenging problem of estimating the general visual attention of people in images.

Multi-Task Learning

Paper
Add Code

Fine-Grained Head Pose Estimation Without Keypoints

13 code implementations • 2 Oct 2017 • Nataniel Ruiz, Eunji Chong, James M. Rehg

Estimating the head pose of a person is a crucial problem that has a large amount of applications such as aiding in gaze estimation, modeling attention, fitting 3D models to video and performing face alignment.

Ranked #5 on Head Pose Estimation on AFLW

Face Alignment Gaze Estimation +1

1,526

Paper
Code

Dockerface: an Easy to Install and Use Faster R-CNN Face Detector in a Docker Container

1 code implementation • 15 Aug 2017 • Nataniel Ruiz, James M. Rehg

Face detection is a very important task and a necessary pre-processing step for many applications such as facial landmark detection, pose estimation, sentiment analysis and face recognition.

Face Detection Face Recognition +3

189

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.