Search Results for author: Xuan Yang

Found 17 papers, 7 papers with code

A Concept-based Interpretable Model for the Diagnosis of Choroid Neoplasias using Multimodal Data

no code implementations • 8 Mar 2024 • Yifan Wu, Yang Liu, Yue Yang, Michael S. Yao, Wenli Yang, Xuehui Shi, Lihong Yang, Dongjun Li, Yueming Liu, James C. Gee, Xuan Yang, Wenbin Wei, Shi Gu

Diagnosing rare diseases presents a common challenge in clinical practice, necessitating the expertise of specialists for accurate identification.

Interpretable Machine Learning

Paper
Add Code

VideoPoet: A Large Language Model for Zero-Shot Video Generation

no code implementations • 21 Dec 2023 • Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Grant Schindler, Rachel Hornung, Vighnesh Birodkar, Jimmy Yan, Ming-Chang Chiu, Krishna Somandepalli, Hassan Akbari, Yair Alon, Yong Cheng, Josh Dillon, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, Mikhail Sirotenko, Kihyuk Sohn, Xuan Yang, Hartwig Adam, Ming-Hsuan Yang, Irfan Essa, Huisheng Wang, David A. Ross, Bryan Seybold, Lu Jiang

We present VideoPoet, a language model capable of synthesizing high-quality video, with matching audio, from a large variety of conditioning signals.

Ranked #3 on Text-to-Video Generation on MSR-VTT

Decoder Language Modelling +3

Paper
Add Code

PolyMaX: General Dense Prediction with Mask Transformer

1 code implementation • 9 Nov 2023 • Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen

Despite this shift, methods based on the per-pixel prediction paradigm still dominate the benchmarks on the other dense prediction tasks that require continuous outputs, such as depth estimation and surface normal prediction.

Ranked #2 on Surface Normals Estimation on NYU Depth v2

Monocular Depth Estimation Semantic Segmentation +2

990

Paper
Code

SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset

no code implementations • 21 Sep 2023 • Sagar M. Waghmare, Kimberly Wilber, Dave Hawkey, Xuan Yang, Matthew Wilson, Stephanie Debats, Cattalyya Nuengsigkapian, Astuti Sharma, Lars Pandikow, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko

All synthetic sessions and a subset of real sessions have temporally consistent dense panoptic segmentation labels.

Depth Estimation Domain Adaptation +5

Paper
Add Code

VideoGLUE: Video General Understanding Evaluation of Foundation Models

1 code implementation • 6 Jul 2023 • Liangzhe Yuan, Nitesh Bharadwaj Gundavarapu, Long Zhao, Hao Zhou, Yin Cui, Lu Jiang, Xuan Yang, Menglin Jia, Tobias Weyand, Luke Friedman, Mikhail Sirotenko, Huisheng Wang, Florian Schroff, Hartwig Adam, Ming-Hsuan Yang, Ting Liu, Boqing Gong

We evaluate existing foundation models video understanding capabilities using a carefully designed experiment protocol consisting of three hallmark tasks (action recognition, temporal localization, and spatiotemporal localization), eight datasets well received by the community, and four adaptation methods tailoring a foundation model (FM) for a downstream task.

Action Recognition Temporal Localization +1

76,633

Paper
Code

Spatial-temporal traffic modeling with a fusion graph reconstructed by tensor decomposition

no code implementations • 12 Dec 2022 • Qin Li, Xuan Yang, Yong Wang, Yuankai Wu, Deqiang He

This paper proposes reconstructing the binary adjacency matrix via tensor decomposition, and a traffic flow forecasting method is proposed.

Open-Ended Question Answering Tensor Decomposition

Paper
Add Code

MMGA: Multimodal Learning with Graph Alignment

no code implementations • 18 Oct 2022 • Xuan Yang, Quanjin Tao, Xiao Feng, Donghong Cai, Xiang Ren, Yang Yang

In this paper, we propose MMGA (Multimodal learning with Graph Alignment), a novel multimodal pre-training framework to incorporate information from graph (social network), image and text modalities on social media to enhance user representation learning.

Representation Learning

Paper
Add Code

On Label Granularity and Object Localization

1 code implementation • 20 Jul 2022 • Elijah Cole, Kimberly Wilber, Grant van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha

Weakly supervised object localization (WSOL) aims to learn representations that encode object location using only image-level category labels.

Object Weakly-Supervised Object Localization

Paper
Code

DropMessage: Unifying Random Dropping for Graph Neural Networks

2 code implementations • 21 Apr 2022 • Taoran Fang, Zhiqing Xiao, Chunping Wang, Jiarong Xu, Xuan Yang, Yang Yang

First, it is challenging to find a universal method that are suitable for all cases considering the divergence of different datasets and models.

Graph Representation Learning

Paper
Code

Coverage Control Algorithm for DSNs Based on Improved Gravitational Search

no code implementations • IEEE Sensors Journal 2022 • Yindi Yao, Huanmin Liao, Xiong Li, Student Member, IEEE, Feng Zhao, Xuan Yang, and Shanshan Hu

—In directional sensor networks (DSNs), coverage control is an important way to ensure efficient communication and reliable data transmission.

Position

Paper
Add Code

FCCDN: Feature Constraint Network for VHR Image Change Detection

2 code implementations • 23 May 2021 • Pan Chen, Danfeng Hong, Zhengchao Chen, Xuan Yang, Baipeng Li, Bing Zhang

Moreover, a self-supervised learning-based strategy is proposed to constrain feature learning.

Change Detection Decoder +2

343

Paper
Code

When Does Contrastive Visual Representation Learning Work?

no code implementations • CVPR 2022 • Elijah Cole, Xuan Yang, Kimberly Wilber, Oisin Mac Aodha, Serge Belongie

Recent self-supervised representation learning techniques have largely closed the gap between supervised and unsupervised learning on ImageNet classification.

Contrastive Learning Fine-Grained Image Classification +2

Paper
Add Code

An Attention-Fused Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

no code implementations • 10 May 2021 • Xuan Yang, Shanshan Li, Zhengchao Chen, Jocelyn Chanussot, Xiuping Jia, Bing Zhang, Baipeng Li, Pan Chen

Semantic segmentation is an essential part of deep learning.

Semantic Segmentation

Paper
Add Code

A Fast and Precise Method for Large-Scale Land-Use Mapping Based on Deep Learning

no code implementations • 9 Aug 2019 • Xuan Yang, Zhengchao Chen, Baipeng Li, Dailiang Peng, Pan Chen, Bing Zhang

The land-use map is an important data that can reflect the use and transformation of human land, and can provide valuable reference for land-use planning.

Classification General Classification +1

Paper
Add Code

DNN Dataflow Choice Is Overrated

no code implementations • 10 Sep 2018 • Xuan Yang, Mingyu Gao, Jing Pu, Ankita Nayak, Qiaoyi Liu, Steven Emberton Bell, Jeff Ou Setter, Kaidi Cao, Heonjae Ha, Christos Kozyrakis, Mark Horowitz

Many DNN accelerators have been proposed and built using different microarchitectures and program mappings.

Distributed, Parallel, and Cluster Computing

Paper
Add Code

Programming Heterogeneous Systems from an Image Processing DSL

3 code implementations • 28 Oct 2016 • Jing Pu, Steven Bell, Xuan Yang, Jeff Setter, Stephen Richardson, Jonathan Ragan-Kelley, Mark Horowitz

We address this problem by extending the image processing language, Halide, so users can specify which portions of their applications should become hardware accelerators, and then we provide a compiler that uses this code to automatically create the accelerator along with the "glue" code needed for the user's application to access this hardware.

Software Engineering

Paper
Code

A Systematic Approach to Blocking Convolutional Neural Networks

1 code implementation • 14 Jun 2016 • Xuan Yang, Jing Pu, Blaine Burton Rister, Nikhil Bhagdikar, Stephen Richardson, Shahar Kvatinsky, Jonathan Ragan-Kelley, Ardavan Pedram, Mark Horowitz

Convolutional Neural Networks (CNNs) are the state of the art solution for many computer vision problems, and many researchers have explored optimized implementations.

Blocking

207

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.