Search Results for author: Narendra Ahuja

Found 31 papers, 8 papers with code

Improving Multi-label Recognition using Class Co-Occurrence Probabilities

no code implementations • 24 Apr 2024 • Samyak Rawlekar, Shubhang Bhatnagar, Vishnuvardhan Pogunulu Srinivasulu, Narendra Ahuja

Multi-label Recognition (MLR) involves the identification of multiple objects within an image.

Paper
Add Code

Piecewise-Linear Manifolds for Deep Metric Learning

no code implementations • 22 Mar 2024 • Shubhang Bhatnagar, Narendra Ahuja

For this purpose, we propose to model the high-dimensional data manifold using a piecewise-linear approximation, with each low-dimensional linear piece approximating the data manifold in a small neighborhood of a point.

Image Retrieval Metric Learning +2

Paper
Add Code

Learning Implicit Representation for Reconstructing Articulated Objects

no code implementations • 16 Jan 2024 • Hao Zhang, Fang Li, Samyak Rawlekar, Narendra Ahuja

Our method simultaneously estimates the visible (explicit) representation (3D shapes, colors, camera parameters) and the implicit skeletal representation, from motion cues in the object video without 3D supervision.

3D Reconstruction Object

Paper
Add Code

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen

no code implementations • 9 Dec 2023 • Hao Zhang, Fang Li, Lu Qi, Ming-Hsuan Yang, Narendra Ahuja

Addressing Out-Of-Distribution (OOD) Segmentation and Zero-Shot Semantic Segmentation (ZS3) is challenging, necessitating segmenting unseen classes.

Domain Adaptation Segmentation +2

Paper
Add Code

Open-NeRF: Towards Open Vocabulary NeRF Decomposition

no code implementations • 25 Oct 2023 • Hao Zhang, Fang Li, Narendra Ahuja

Current techniques for NeRF decomposition involve a trade-off between the flexibility of processing open-vocabulary queries and the accuracy of 3D segmentation.

3D Reconstruction Segmentation

Paper
Add Code

Long-Distance Gesture Recognition using Dynamic Neural Networks

no code implementations • 9 Aug 2023 • Shubhang Bhatnagar, Sharath Gopal, Narendra Ahuja, Liu Ren

We demonstrate the performance of our method on the LD-ConGR long-distance dataset where it outperforms previous state-of-the-art methods on recognition accuracy and compute efficiency.

Gesture Recognition

Paper
Add Code

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation

no code implementations • 29 Oct 2022 • Moitreya Chatterjee, Narendra Ahuja, Anoop Cherian

In this paper, we propose to use this connection between audio and visual dynamics for solving two challenging tasks simultaneously, namely: (i) separating audio sources from a mixture using visual cues, and (ii) predicting the 3D visual motion of a sounding source using its separated audio.

Audio Source Separation

Paper
Add Code

Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras

1 code implementation • 22 Nov 2021 • Aniket Shirke, Aziz Saifuddin, Achleshwar Luthra, Jiangong Li, Tawni Williams, Xiaodan Hu, Aneesh Kotnana, Okan Kocabalkanli, Narendra Ahuja, Angela Green-Miller, Isabella Condotta, Ryan N. Dilger, Matthew Caesar

We identify the adjacent camera and the shared pig location on the floor at the handover time using inter-view homography.

Multi-Object Tracking object-detection +1

Paper
Code

A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction

no code implementations • ICCV 2021 • Moitreya Chatterjee, Narendra Ahuja, Anoop Cherian

Predicting the future frames of a video is a challenging task, in part due to the underlying stochastic real-world phenomena.

Video Generation Video Prediction

Paper
Add Code

Visual Scene Graphs for Audio Source Separation

no code implementations • ICCV 2021 • Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja, Anoop Cherian

At its core, AVSGS uses a recursive neural network that emits mutually-orthogonal sub-graph embeddings of the visual graph using multi-head attention.

AudioCaps Audio Source Separation

Paper
Add Code

Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition

1 code implementation • ICCV 2021 • Xiaodan Hu, Narendra Ahuja

Dance experts often view dance as a hierarchy of information, spanning low-level (raw images, image sequences), mid-levels (human poses and bodypart movements), and high-level (dance genre).

Ranked #3 on Weakly-supervised 3D Human Pose Estimation on Human3.6M

3D Pose Estimation Unsupervised 3D Human Pose Estimation +2

306

Paper
Code

Learning to Generate Videos Using Neural Uncertainty Priors

no code implementations • 1 Jan 2021 • Moitreya Chatterjee, Anoop Cherian, Narendra Ahuja

Predicting the future frames of a video is a challenging task, in part due to the underlying stochastic real-world phenomena.

Video Generation

Paper
Add Code

Sound2Sight: Generating Visual Dynamics from Sound and Context

no code implementations • ECCV 2020 • Anoop Cherian, Moitreya Chatterjee, Narendra Ahuja

To tackle this problem, we present Sound2Sight, a deep variational framework, that is trained to learn a per frame stochastic prior conditioned on a joint embedding of audio and past frames.

Multimodal Reasoning

Paper
Add Code

Coreset-Based Neural Network Compression

no code implementations • ECCV 2018 • Abhimanyu Dubey, Moitreya Chatterjee, Narendra Ahuja

We propose a novel Convolutional Neural Network (CNN) compression algorithm based on coreset representations of filters.

Neural Network Compression Quantization

Paper
Add Code

DeepMVS: Learning Multi-view Stereopsis

1 code implementation • CVPR 2018 • Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang

We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstruction.

328

Paper
Code

Joint Image Filtering with Deep Convolutional Networks

no code implementations • 11 Oct 2017 • Yijun Li, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

In contrast to existing methods that consider only the guidance image, the proposed algorithm can selectively transfer salient structures that are consistent with both guidance and target images.

Paper
Add Code

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

2 code implementations • 5 Oct 2017 • Shun Zhang, Jia-Bin Huang, Jongwoo Lim, Yihong Gong, Jinjun Wang, Narendra Ahuja, Ming-Hsuan Yang

Multi-face tracking in unconstrained videos is a challenging problem as faces of one person often appear drastically different in multiple shots due to significant variations in scale, pose, expression, illumination, and make-up.

Clustering

Paper
Code

Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks

7 code implementations • 4 Oct 2017 • Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

However, existing methods often require a large number of network parameters and entail heavy computational loads at runtime for generating high-accuracy super-resolution results.

Image Reconstruction Image Super-Resolution

213

Paper
Code

Robust Visual Tracking Using Oblique Random Forests

1 code implementation • CVPR 2017 • Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin

Unlike conventional orthogonal decision trees that use a single feature and heuristic measures to obtain a split at each node, we propose to use a more powerful proximal SVM to obtain oblique hyperplanes to capture the geometric structure of the data better.

General Classification Image Classification +5

Paper
Code

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

1 code implementation • CVPR 2017 • Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

Convolutional neural networks have recently demonstrated high-quality reconstruction for single-image super-resolution.

Ranked #40 on Image Super-Resolution on BSD100 - 4x upscaling

Image Super-Resolution

Paper
Code

Detecting Migrating Birds at Night

no code implementations • CVPR 2016 • Jia-Bin Huang, Rich Caruana, Andrew Farnsworth, Steve Kelling, Narendra Ahuja

In this paper, we present a vision-based system for detecting migrating birds in flight at night.

Paper
Add Code

A Comparative Study for Single Image Blind Deblurring

no code implementations • CVPR 2016 • Wei-Sheng Lai, Jia-Bin Huang, Zhe Hu, Narendra Ahuja, Ming-Hsuan Yang

Using these datasets, we conduct a large-scale user study to quantify the performance of several representative state-of-the-art blind deblurring algorithms.

Single-Image Blind Deblurring

Paper
Add Code

Superpixel Hierarchy

1 code implementation • 20 May 2016 • Xing Wei, Qingxiong Yang, Yihong Gong, Ming-Hsuan Yang, Narendra Ahuja

Quantitative and qualitative evaluation on a number of computer vision applications was conducted, demonstrating that the proposed method is the top performer.

Image Segmentation Segmentation +2

Paper
Code

Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos

no code implementations • ICCV 2015 • Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulo, Narendra Ahuja, Oswald Lanz

We present a novel approach for jointly estimating tar- gets' head, body orientations and conversational groups called F-formations from a distant social scene (e. g., a cocktail party captured by surveillance cameras).

TAR

Paper
Add Code

On the Equivalence of Moving Entrance Pupil and Radial Distortion for Camera Calibration

no code implementations • ICCV 2015 • Avinash Kumar, Narendra Ahuja

Using a thick-lens setting, we show that such a back-projection is more accurate than the two-step method of undistorting an image pixel and then back-projecting it.

Camera Calibration

Paper
Add Code

Single Image Super-Resolution From Transformed Self-Exemplars

no code implementations • CVPR 2015 • Jia-Bin Huang, Abhishek Singh, Narendra Ahuja

However, the internal dictionary obtained from the given image may not always be sufficiently expressive to cover the textural appearance variations in the scene.

Image Super-Resolution

Paper
Add Code

Structural Sparse Tracking

no code implementations • CVPR 2015 • Tianzhu Zhang, Si Liu, Changsheng Xu, Shuicheng Yan, Bernard Ghanem, Narendra Ahuja, Ming-Hsuan Yang

Sparse representation has been applied to visual tracking by finding the best target candidate with minimal reconstruction error by use of target templates.

Visual Tracking

Paper
Add Code

Super-Resolving Noisy Images

no code implementations • CVPR 2014 • Abhishek Singh, Fatih Porikli, Narendra Ahuja

We then show that by taking a convex combination of orientation and frequency selective bands of the noisy and the denoised HR images, we can obtain a desired HR image where (i) some of the textural signal lost in the denoising step is effectively recovered in the HR domain, and (ii) additional textures can be easily synthesized by appropriately constraining the parameters of the convex combination.

Denoising Super-Resolution

Paper
Add Code

Generalized Pupil-Centric Imaging and Analytical Calibration for a Non-frontal Camera

no code implementations • CVPR 2014 • Avinash Kumar, Narendra Ahuja

The second approach based on decentering distortion modeling is approximate as it can only handle small tilts and cannot explicitly estimate the sensor tilt.

Camera Calibration

Paper
Add Code

Robust Orthonormal Subspace Learning: Efficient Recovery of Corrupted Low-rank Matrices

no code implementations • CVPR 2014 • Xianbiao Shu, Fatih Porikli, Narendra Ahuja

Low-rank matrix recovery from a corrupted observation has many applications in computer vision.

Paper
Add Code

Partial Occlusion Handling for Visual Tracking via Robust Part Matching

no code implementations • CVPR 2014 • Tianzhu Zhang, Kui Jia, Changsheng Xu, Yi Ma, Narendra Ahuja

The proposed part matching tracker (PMT) has a number of attractive properties.

Occlusion Handling Sparse Learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.