Search Results for author: Yilin Wang

Found 55 papers, 21 papers with code

The CRECIL Corpus: a New Dataset for Extraction of Relations between Characters in Chinese Multi-party Dialogues

1 code implementation • LREC 2022 • Yuru Jiang, Yang Xu, Yuhang Zhan, WeiKai He, Yilin Wang, Zixuan Xi, Meiyun Wang, Xinyu Li, Yu Li, Yanchao Yu

We describe a new freely available Chinese multi-party dialogue dataset for automatic extraction of dialogue-based character relationships.

Relation Extraction

Paper
Code

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing

no code implementations • 8 Apr 2024 • Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang

Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.

Image Generation Object

Paper
Add Code

KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion

1 code implementation • 26 Mar 2024 • Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang, Xicheng Lu

Previous methods for KGC re-ranking are mostly built on non-generative language models to obtain the probability of each candidate.

Knowledge Graph Completion Re-Ranking

Paper
Code

Bayesian Diffusion Models for 3D Shape Reconstruction

no code implementations • 11 Mar 2024 • Haiyang Xu, Yu Lei, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu

We present Bayesian Diffusion Models (BDM), a prediction algorithm that performs effective Bayesian inference by tightly coupling the top-down (prior) information with the bottom-up (data-driven) procedure via joint diffusion processes.

3D Reconstruction 3D Shape Reconstruction +1

Paper
Add Code

UniHuman: A Unified Model for Editing Human Images in the Wild

1 code implementation • 22 Dec 2023 • Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin

In this paper, we propose UniHuman, a unified model that addresses multiple facets of human image editing in real-world settings.

Paper
Code

TokenCompose: Grounding Diffusion with Token-level Supervision

1 code implementation • 6 Dec 2023 • ZiRui Wang, Zhizhou Sha, Zheng Ding, Yilin Wang, Zhuowen Tu

We present TokenCompose, a Latent Diffusion Model for text-to-image generation that achieves enhanced consistency between user-specified text prompts and model-generated images.

Denoising Object +1

Paper
Code

Learning Mutually Informed Representations for Characters and Subwords

1 code implementation • 14 Nov 2023 • Yilin Wang, Xinyi Hu, Matthew R. Gormley

In this paper, we introduce the entanglement model, aiming to combine character and subword language models.

named-entity-recognition Named Entity Recognition +4

Paper
Code

Dolfin: Diffusion Layout Transformers without Autoencoder

no code implementations • 25 Oct 2023 • Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu

In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods.

Paper
Add Code

Quantum-Enhanced Forecasting: Leveraging Quantum Gramian Angular Field and CNNs for Stock Return Predictions

no code implementations • 11 Oct 2023 • Zhengmeng Xu, Yujie Wang, Xiaotong Feng, Yilin Wang, Yanli Li, Hai Lin

We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF).

Time Series Time Series Classification +2

Paper
Add Code

Scalable Optimal Margin Distribution Machine

2 code implementations • 8 May 2023 • Yilin Wang, Nan Cao, Teng Zhang, Xuanhua Shi, Hai Jin

Optimal margin Distribution Machine (ODM) is a newly proposed statistical learning framework rooting in the novel margin theory, which demonstrates better generalization performance than the traditional large margin based counterparts.

Paper
Code

LightPainter: Interactive Portrait Relighting with Freehand Scribble

no code implementations • CVPR 2023 • Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel

Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map.

Paper
Add Code

MRET: Multi-resolution Transformer for Video Quality Assessment

no code implementations • 13 Mar 2023 • Junjie Ke, Tianhao Zhang, Yilin Wang, Peyman Milanfar, Feng Yang

No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience.

Video Quality Assessment Video Recognition +1

Paper
Add Code

IDM-Follower: A Model-Informed Deep Learning Method for Long-Sequence Car-Following Trajectory Prediction

no code implementations • 20 Oct 2022 • Yilin Wang, Yiheng Feng

Model-based and learning-based methods are two major types of methodologies to model car following behaviors.

Decoder Trajectory Prediction

Paper
Add Code

CONVIQT: Contrastive Video Quality Estimator

1 code implementation • 29 Jun 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.

Ranked #1 on Video Quality Assessment on LIVE-ETRI (using extra training data)

Self-Supervised Learning Video Quality Assessment

Paper
Code

On the Role of Generalization in Transferability of Adversarial Examples

no code implementations • 18 Jun 2022 • Yilin Wang, Farzan Farnia

We support our theoretical results by performing several numerical experiments showing the role of the substitute network's generalization in generating transferable adversarial examples.

Generalization Bounds

Paper
Add Code

Making Video Quality Assessment Models Sensitive to Frame Rate Distortions

no code implementations • 21 May 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We consider the problem of capturing distortions arising from changes in frame rate as part of Video Quality Assessment (VQA).

Video Quality Assessment Visual Question Answering (VQA)

Paper
Add Code

A Video Anomaly Detection Framework based on Appearance-Motion Semantics Representation Consistency

no code implementations • 8 Apr 2022 • Xiangyu Huang, Caidan Zhao, Yilin Wang, Zhiqiang Wu

Firstly, we design a two-stream encoder to encode the appearance and motion information representations of normal samples and introduce constraints to further enhance the consistency of the feature semantics between appearance and motion information of normal samples so that abnormal samples with low consistency appearance and motion feature representation can be identified.

Ranked #2 on Anomaly Detection on CUHK Avenue

Anomaly Detection Optical Flow Estimation +1

Paper
Add Code

Perceptual Quality Assessment of UGC Gaming Videos

no code implementations • 31 Mar 2022 • Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Add Code

Subjective and Objective Analysis of Streamed Gaming Videos

no code implementations • 24 Mar 2022 • Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

A number of studies have been directed towards understanding the perceptual characteristics of professionally generated gaming videos arising in gaming video streaming, online gaming, and cloud gaming.

Video Quality Assessment Visual Question Answering (VQA)

Paper
Add Code

Interactive Portrait Harmonization

no code implementations • 15 Mar 2022 • Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel

To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect to a selected \emph{region} in the reference image instead of the entire background.

Image Harmonization

Paper
Add Code

Lite Vision Transformer with Enhanced Self-Attention

1 code implementation • CVPR 2022 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille

We propose Lite Vision Transformer (LVT), a novel light-weight transformer network with two enhanced self-attention mechanisms to improve the model performances for mobile deployment.

Panoptic Segmentation Segmentation

131

Paper
Code

Image Quality Assessment using Contrastive Learning

2 code implementations • 25 Oct 2021 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We consider the problem of obtaining image quality representations in a self-supervised manner.

Ranked #2 on Video Quality Assessment on LIVE-ETRI (using extra training data)

Blind Image Quality Assessment Contrastive Learning +3

117

Paper
Code

Distributed Optimal Margin Distribution Machine

no code implementations • 29 Sep 2021 • Yilin Wang, Nan Cao, Teng Zhang, Hai Jin

Optimal margin Distribution Machine (ODM), a newly proposed statistical learning framework rooting in the novel margin theory, demonstrates better generalization performance than the traditional large margin based counterparts.

Paper
Add Code

High Frame Rate Video Quality Assessment using VMAF and Entropic Differences

no code implementations • 27 Sep 2021 • Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor.

Ranked #2 on Video Quality Assessment on LIVE-YT-HFR

Video Quality Assessment Visual Question Answering (VQA) +1

Paper
Add Code

SSH: A Self-Supervised Framework for Image Harmonization

1 code implementation • ICCV 2021 • Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

Image harmonization aims to improve the quality of image compositing by matching the "appearance" (\eg, color tone, brightness and contrast) between foreground and background images.

Benchmarking Data Augmentation +1

Paper
Code

MUSIQ: Multi-scale Image Quality Transformer

2 code implementations • ICCV 2021 • Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang

To accommodate this, the input images are usually resized and cropped to a fixed shape, causing image quality degradation.

Ranked #3 on Image Quality Assessment on MSU NR VQA Database

Image Quality Assessment Video Quality Assessment

33,156

Paper
Code

Rich Features for Perceptual Quality Assessment of UGC Videos

no code implementations • CVPR 2021 • Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang

Besides the subjective ratings and content labels of the dataset, we also propose a DNN-based framework to thoroughly analyze importance of content, technical quality, and compression level in perceptual quality.

Video Quality Assessment

Paper
Add Code

Making CNNs Interpretable by Building Dynamic Sequential Decision Forests with Top-down Hierarchy Learning

no code implementations • 5 Jun 2021 • Yilin Wang, Shaozuo Yu, Xiaokang Yang, Wei Shen

In this paper, we propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable, while maintaining their high classification accuracy.

Classification

Paper
Add Code

Multimodal Contrastive Training for Visual Representation Learning

no code implementations • CVPR 2021 • Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta

We first train our model on COCO and evaluate the learned visual representations on various downstream tasks including image classification, object detection, and instance segmentation.

Cross-Modal Retrieval Image Classification +6

Paper
Add Code

Classifying Video based on Automatic Content Detection Overview

no code implementations • 29 Mar 2021 • Yilin Wang, Jiayi Ye

Video classification and analysis is always a popular and challenging field in computer vision.

Classification General Classification +3

Paper
Add Code

Regression or Classification? New Methods to Evaluate No-Reference Picture and Video Quality Models

no code implementations • 30 Jan 2021 • Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus.

General Classification Image Quality Assessment +2

Paper
Add Code

RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content

1 code implementation • 26 Jan 2021 • Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

However, these models are either incapable or inefficient for predicting the quality of complex and diverse UGC videos in practical applications.

Ranked #4 on Video Quality Assessment on LIVE Livestream

Video Quality Assessment

Paper
Code

Electronic Correlations and Absence of Superconductivity in the Collapsed Phase of LaFe$_2$As$_2$

no code implementations • 22 Dec 2020 • Jianzhou Zhao, Yilin Wang, Xiaolong Feng, Shengyuan A. Yang

Our results indicate that the electronic structures of LaFe$_2$As$_2$ and CaFe$_2$As$_2$ are not too different, which further suggest that superconductivity might also be induced in the collapsed phase of LaFe$_2$As$_2$ under similar non-hydrostatic conditions as for CaFe$_2$As$_2$.

Strongly Correlated Electrons Superconductivity

Paper
Add Code

Meticulous Object Segmentation

1 code implementation • 13 Dec 2020 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zhe Lin, Alan Yuille

To evaluate segmentation quality near object boundaries, we propose the Meticulosity Quality (MQ) score considering both the mask coverage and boundary precision.

2k 4k +5

Paper
Code

Mask Guided Matting via Progressive Refinement Network

1 code implementation • CVPR 2021 • Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille

We propose Mask Guided (MG) Matting, a robust matting framework that takes a general coarse mask as guidance.

Image Matting

322

Paper
Code

The Loewner-Kufarev Energy and Foliations by Weil-Petersson Quasicircles

no code implementations • 10 Dec 2020 • Fredrik Viklund, Yilin Wang

Moreover, if either of these two energies is finite they are equal up to a constant factor, and in this case, the foliation leaves are Weil-Petersson quasicircles.

Complex Variables Mathematical Physics Mathematical Physics Probability

Paper
Add Code

An Overview Of 3D Object Detection

no code implementations • 29 Oct 2020 • Yilin Wang, Jiayi Ye

Point cloud 3D object detection has recently received major attention and becomes an active research topic in 3D computer vision community.

3D Object Detection Object +2

Paper
Add Code

ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction

1 code implementation • 26 Oct 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos.

Ranked #1 on Video Quality Assessment on LIVE-YT-HFR

Video Quality Assessment Visual Question Answering (VQA)

Paper
Code

A Feasible Level Proximal Point Method for Nonconvex Sparse Constrained Optimization

no code implementations • NeurIPS 2020 • Digvijay Boob, Qi Deng, Guanghui Lan, Yilin Wang

We also establish new convergence complexities to achieve an approximate KKT solution when the objective can be smooth/nonsmooth, deterministic/stochastic and convex/nonconvex with complexity that is on a par with gradient descent for unconstrained optimization problems in respective cases.

Paper
Add Code

Adaptive Debanding Filter

1 code implementation • 22 Sep 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions.

Quantization

Paper
Code

Shape Adaptor: A Learnable Resizing Module

1 code implementation • ECCV 2020 • Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns

We present a novel resizing module for neural networks: shape adaptor, a drop-in enhancement built on top of traditional resizing layers, such as pooling, bilinear sampling, and strided convolution.

Image Classification Neural Architecture Search +1

Paper
Code

Subjective and Objective Quality Assessment of High Frame Rate Videos

1 code implementation • 22 Jul 2020 • Pavan C. Madhusudana, Xiangxu Yu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

We also conducted a holistic evaluation of existing state-of-the-art Full and No-Reference video quality algorithms, and statistically benchmarked their performance on the new database.

Vocal Bursts Intensity Prediction

Paper
Code

Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation

no code implementations • ECCV 2020 • Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang

To our best knowledge, the proposed method is first to enable adversarial learning in autoregressive models for image generation.

Image Generation

Paper
Add Code

GIFnets: Differentiable GIF Encoding Framework

no code implementations • CVPR 2020 • Innfarn Yoo, Xiyang Luo, Yilin Wang, Feng Yang, Peyman Milanfar

DitherNet manipulates the input image to reduce color banding artifacts and provides an alternative to traditional dithering.

Paper
Add Code

Capturing Video Frame Rate Variations via Entropic Differencing

no code implementations • 19 Jun 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

High frame rate videos are increasingly getting popular in recent years, driven by the strong requirements of the entertainment and streaming industries to provide high quality of experiences to consumers.

Ranked #3 on Video Quality Assessment on LIVE-YT-HFR

Video Quality Assessment

Paper
Add Code

UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content

5 code implementations • 29 May 2020 • Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to the evolution of affordable and reliable consumer capture devices, and the tremendous popularity of social media platforms.

Ranked #11 on Video Quality Assessment on YouTube-UGC

Benchmarking feature selection +2

123

Paper
Code

BBAND Index: A No-Reference Banding Artifact Predictor

no code implementations • 27 Feb 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos.

Video Compression

Paper
Add Code

YouTube UGC Dataset for Video Compression Research

1 code implementation • 13 Apr 2019 • Yilin Wang, Sasi Inguva, Balu Adsumilli

However, traditional metrics used in compression and quality assessment, like BD-Rate and PSNR, are designed for pristine originals.

Multimedia Image and Video Processing

Paper
Code

Multimodal Style Transfer via Graph Cuts

2 code implementations • ICCV 2019 • Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang

An assumption widely used in recent neural style transfer methods is that image styles can be described by global statics of deep features like Gram or covariance matrices.

Style Transfer

Paper
Code

Generalizing Graph Matching beyond Quadratic Assignment Model

no code implementations • NeurIPS 2018 • Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li

Graph matching has received persistent attention over decades, which can be formulated as a quadratic assignment problem (QAP).

Graph Matching

Paper
Add Code

Improving Vision-based Self-positioning in Intelligent Transportation Systems via Integrated Lane and Vehicle Detection

no code implementations • 5 Apr 2017 • Parag S. Chandakkar, Yilin Wang, Baoxin Li

In the framework, the number of lanes, the vehicle's position in those lanes and the presence of other vehicles are considered as parameters.

Computational Efficiency Density Estimation +1

Paper
Add Code

Hierarchical Attention Network for Action Recognition in Videos

no code implementations • 21 Jul 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Neil O'Hare, Yi Chang, Baoxin Li

Understanding human actions in wild videos is an important task with a broad range of applications.

Action Recognition In Videos Action Understanding +1

Paper
Add Code

PPP: Joint Pointwise and Pairwise Image Label Prediction

no code implementations • CVPR 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li

However, pointwise labels in image classification and tag annotation are inherently related to the pairwise labels.

Attribute General Classification +2

Paper
Add Code

Unsupervised Video Analysis Based on a Spatiotemporal Saliency Detector

no code implementations • 24 Mar 2015 • Qiang Zhang, Yilin Wang, Baoxin Li

Recently, the spectrum analysis based visual saliency approach has attracted a lot of interest due to its simplicity and good performance, where the phase information of the image is used to construct the saliency map.

Anomaly Detection Foreground Segmentation +5

Paper
Add Code

Stereo under Sequential Optimal Sampling: A Statistical Analysis Framework for Search Space Reduction

no code implementations • CVPR 2014 • Yilin Wang, Ke Wang, Enrique Dunn, Jan-Michael Frahm

We develop a sequential optimal sampling framework for stereo disparity estimation by adapting the Sequential Probability Ratio Test (SPRT) model.

Disparity Estimation Stereo Disparity Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.