Search Results for author: Matthew Trager

Found 25 papers, 4 papers with code

NeRF-Insert: 3D Local Editing with Multimodal Control Signals

no code implementations • 30 Apr 2024 • Benet Oriol Sabat, Alessandro Achille, Matthew Trager, Stefano Soatto

We propose NeRF-Insert, a NeRF editing framework that allows users to make high-quality local edits with a flexible level of control.

Image Generation

Paper
Add Code

Multi-Modal Hallucination Control by Visual Information Grounding

no code implementations • 20 Mar 2024 • Alessandro Favero, Luca Zancato, Matthew Trager, Siddharth Choudhary, Pramuditha Perera, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

In particular, we show that as more tokens are generated, the reliance on the visual prompt decreases, and this behavior strongly correlates with the emergence of hallucinations.

Hallucination Visual Question Answering (VQA)

Paper
Add Code

Interpretable Measures of Conceptual Similarity by Complexity-Constrained Descriptive Auto-Encoding

no code implementations • 14 Feb 2024 • Alessandro Achille, Greg Ver Steeg, Tian Yu Liu, Matthew Trager, Carson Klingenberg, Stefano Soatto

Quantifying the degree of similarity between images is a key copyright issue for image-based machine learning.

Descriptive text similarity

Paper
Add Code

Meaning Representations from Trajectories in Autoregressive Models

1 code implementation • 23 Oct 2023 • Tian Yu Liu, Matthew Trager, Alessandro Achille, Pramuditha Perera, Luca Zancato, Stefano Soatto

We propose to extract meaning representations from autoregressive language models by considering the distribution of all possible trajectories extending an input text.

Paper
Code

Towards Visual Foundational Models of Physical Scenes

no code implementations • 6 Jun 2023 • Chethan Parameshwara, Alessandro Achille, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, Cj Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto

We describe a first step towards learning general-purpose visual representations of physical scenes using only image prediction as a training criterion.

Paper
Add Code

Prompt Algebra for Task Composition

no code implementations • 1 Jun 2023 • Pramuditha Perera, Matthew Trager, Luca Zancato, Alessandro Achille, Stefano Soatto

We investigate whether prompts learned independently for different tasks can be later combined through prompt algebra to obtain a model that supports composition of tasks.

Attribute Classification

Paper
Add Code

Function Space and Critical Points of Linear Convolutional Networks

no code implementations • 12 Apr 2023 • Kathlén Kohn, Guido Montúfar, Vahid Shahverdi, Matthew Trager

We study the geometry of linear networks with one-dimensional convolutional layers.

Paper
Add Code

Train/Test-Time Adaptation with Retrieval

no code implementations • CVPR 2023 • Luca Zancato, Alessandro Achille, Tian Yu Liu, Matthew Trager, Pramuditha Perera, Stefano Soatto

Second, we apply ${\rm T^3AR}$ for test-time adaptation and show that exploiting a pool of external images at test-time leads to more robust representations over existing methods on DomainNet-126 and VISDA-C, especially when few adaptation data are available (up to 8%).

Retrieval Test-time Adaptation

Paper
Add Code

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

no code implementations • ICCV 2023 • Matthew Trager, Pramuditha Perera, Luca Zancato, Alessandro Achille, Parminder Bhatia, Stefano Soatto

These vectors can be seen as "ideal words" for generating concepts directly within the embedding space of the model.

Disentanglement Retrieval

Paper
Add Code

À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting

no code implementations • 15 Feb 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto

During inference, models can be assembled based on arbitrary selections of data sources, which we call "\`a-la-carte learning".

Continual Learning

Paper
Add Code

A-La-Carte Prompt Tuning (APT): Combining Distinct Data via Composable Prompting

no code implementations • CVPR 2023 • Benjamin Bowman, Alessandro Achille, Luca Zancato, Matthew Trager, Pramuditha Perera, Giovanni Paolini, Stefano Soatto

During inference, models can be assembled based on arbitrary selections of data sources, which we call a-la-carte learning.

Continual Learning

Paper
Add Code

Geometry of Linear Convolutional Networks

no code implementations • 3 Aug 2021 • Kathlén Kohn, Thomas Merkh, Guido Montúfar, Matthew Trager

We study the family of functions that are represented by a linear convolutional neural network (LCN).

Paper
Add Code

Symmetry Breaking in Symmetric Tensor Decomposition

no code implementations • 10 Mar 2021 • Yossi Arjevani, Joan Bruna, Michael Field, Joe Kileel, Matthew Trager, Francis Williams

In this note, we consider the highly nonconvex optimization problem associated with computing the rank decomposition of symmetric tensors.

Tensor Decomposition

Paper
Add Code

Neural Splines: Fitting 3D Surfaces with Infinitely-Wide Neural Networks

1 code implementation • CVPR 2021 • Francis Williams, Matthew Trager, Joan Bruna, Denis Zorin

We present Neural Splines, a technique for 3D surface reconstruction that is based on random feature kernels arising from infinitely-wide shallow ReLU networks.

Surface Reconstruction

Paper
Code

Pure and Spurious Critical Points: a Geometric Study of Linear Networks

no code implementations • ICLR 2020 • Matthew Trager, Kathlén Kohn, Joan Bruna

The critical locus of the loss function of a neural network is determined by the geometry of the functional space and by the parameterization of this space by the network's weights.

Paper
Add Code

Gradient Dynamics of Shallow Univariate ReLU Networks

no code implementations • NeurIPS 2019 • Francis Williams, Matthew Trager, Claudio Silva, Daniele Panozzo, Denis Zorin, Joan Bruna

We show that the gradient dynamics of such networks are determined by the gradient flow in a non-redundant parameterization of the network function.

Paper
Add Code

Coordinate-Free Carlsson-Weinshall Duality and Relative Multi-View Geometry

no code implementations • CVPR 2019 • Matthew Trager, Martial Hebert, Jean Ponce

We present a coordinate-free description of Carlsson-Weinshall duality between scene points and camera pinholes and use it to derive a new characterization of primal/dual multi-view geometry.

Paper
Add Code

On the Expressive Power of Deep Polynomial Neural Networks

1 code implementation • NeurIPS 2019 • Joe Kileel, Matthew Trager, Joan Bruna

We study deep neural networks with polynomial activations, particularly their expressive power.

Paper
Code

On the Solvability of Viewing Graphs

1 code implementation • ECCV 2018 • Matthew Trager, Brian Osserman, Jean Ponce

A set of fundamental matrices relating pairs of cameras in some configuration can be represented as edges of a "viewing graph".

Paper
Code

Consistent sets of lines with no colorful incidence

no code implementations • 16 Mar 2018 • Boris Bukh, Xavier Goaoc, Alfredo Hubard, Matthew Trager

We consider incidences among colored sets of lines in $\mathbb{R}^d$ and examine whether the existence of certain concurrences between lines of $k$ colors force the existence of at least one concurrence between lines of $k+1$ colors.

3D Reconstruction

Paper
Add Code

Changing Views on Curves and Surfaces

no code implementations • 6 Jul 2017 • Kathlén Kohn, Bernd Sturmfels, Matthew Trager

Visual events in computer vision are studied from the perspective of algebraic geometry.

Paper
Add Code

General models for rational cameras and the case of two-slit projections

no code implementations • CVPR 2017 • Matthew Trager, Bernd Sturmfels, John Canny, Martial Hebert, Jean Ponce

The rational camera model recently introduced in [19] provides a general methodology for studying abstract nonlinear imaging systems and their multi-view geometry.

Paper
Add Code

Congruences and Concurrent Lines in Multi-View Geometry

no code implementations • 21 Aug 2016 • Jean Ponce, Bernd Sturmfels, Matthew Trager

We present a new framework for multi-view geometry in computer vision.

Paper
Add Code

Consistency of Silhouettes and Their Duals

no code implementations • CVPR 2016 • Matthew Trager, Martial Hebert, Jean Ponce

Silhouettes provide rich information on three-dimensional shape, since the intersection of the associated visual cones generates the "visual hull", which encloses and approximates the original shape.

Camera Calibration Object +1

Paper
Add Code

The Joint Image Handbook

no code implementations • ICCV 2015 • Matthew Trager, Martial Hebert, Jean Ponce

Given multiple perspective photographs, point correspondences form the "joint image", effectively a replica of three dimensional space distributed across its two-dimensional projections.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.