1 code implementation • LREC 2022 • Yuru Jiang, Yang Xu, Yuhang Zhan, WeiKai He, Yilin Wang, Zixuan Xi, Meiyun Wang, Xinyu Li, Yu Li, Yanchao Yu
We describe a new freely available Chinese multi-party dialogue dataset for automatic extraction of dialogue-based character relationships.
no code implementations • 8 Apr 2024 • Jing Gu, Yilin Wang, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Xin Eric Wang
Compared with existing methods for personalized subject swapping, SwapAnything has three unique advantages: (1) precise control of arbitrary objects and parts rather than the main subject, (2) more faithful preservation of context pixels, (3) better adaptation of the personalized concept to the image.
1 code implementation • 26 Mar 2024 • Yilin Wang, Minghao Hu, Zhen Huang, Dongsheng Li, Dong Yang, Xicheng Lu
Previous methods for KGC re-ranking are mostly built on non-generative language models to obtain the probability of each candidate.
no code implementations • 11 Mar 2024 • Haiyang Xu, Yu Lei, Zeyuan Chen, Xiang Zhang, Yue Zhao, Yilin Wang, Zhuowen Tu
We present Bayesian Diffusion Models (BDM), a prediction algorithm that performs effective Bayesian inference by tightly coupling the top-down (prior) information with the bottom-up (data-driven) procedure via joint diffusion processes.
1 code implementation • 22 Dec 2023 • Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin
In this paper, we propose UniHuman, a unified model that addresses multiple facets of human image editing in real-world settings.
1 code implementation • 6 Dec 2023 • ZiRui Wang, Zhizhou Sha, Zheng Ding, Yilin Wang, Zhuowen Tu
We present TokenCompose, a Latent Diffusion Model for text-to-image generation that achieves enhanced consistency between user-specified text prompts and model-generated images.
1 code implementation • 14 Nov 2023 • Yilin Wang, Xinyi Hu, Matthew R. Gormley
In this paper, we introduce the entanglement model, aiming to combine character and subword language models.
no code implementations • 25 Oct 2023 • Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu
In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods.
no code implementations • 11 Oct 2023 • Zhengmeng Xu, Yujie Wang, Xiaotong Feng, Yilin Wang, Yanli Li, Hai Lin
We propose a time series forecasting method named Quantum Gramian Angular Field (QGAF).
2 code implementations • 8 May 2023 • Yilin Wang, Nan Cao, Teng Zhang, Xuanhua Shi, Hai Jin
Optimal margin Distribution Machine (ODM) is a newly proposed statistical learning framework rooting in the novel margin theory, which demonstrates better generalization performance than the traditional large margin based counterparts.
no code implementations • CVPR 2023 • Yiqun Mei, He Zhang, Xuaner Zhang, Jianming Zhang, Zhixin Shu, Yilin Wang, Zijun Wei, Shi Yan, HyunJoon Jung, Vishal M. Patel
Recent portrait relighting methods have achieved realistic results of portrait lighting effects given a desired lighting representation such as an environment map.
no code implementations • 13 Mar 2023 • Junjie Ke, Tianhao Zhang, Yilin Wang, Peyman Milanfar, Feng Yang
No-reference video quality assessment (NR-VQA) for user generated content (UGC) is crucial for understanding and improving visual experience.
no code implementations • 20 Oct 2022 • Yilin Wang, Yiheng Feng
Model-based and learning-based methods are two major types of methodologies to model car following behaviors.
1 code implementation • 29 Jun 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Perceptual video quality assessment (VQA) is an integral component of many streaming and video sharing platforms.
Ranked #1 on Video Quality Assessment on LIVE-ETRI (using extra training data)
no code implementations • 18 Jun 2022 • Yilin Wang, Farzan Farnia
We support our theoretical results by performing several numerical experiments showing the role of the substitute network's generalization in generating transferable adversarial examples.
no code implementations • 21 May 2022 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of capturing distortions arising from changes in frame rate as part of Video Quality Assessment (VQA).
no code implementations • 8 Apr 2022 • Xiangyu Huang, Caidan Zhao, Yilin Wang, Zhiqiang Wu
Firstly, we design a two-stream encoder to encode the appearance and motion information representations of normal samples and introduce constraints to further enhance the consistency of the feature semantics between appearance and motion information of normal samples so that abnormal samples with low consistency appearance and motion feature representation can be identified.
Ranked #2 on Anomaly Detection on CUHK Avenue
no code implementations • 31 Mar 2022 • Xiangxu Yu, Zhengzhong Tu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In recent years, with the vigorous development of the video game industry, the proportion of gaming videos on major video websites like YouTube has dramatically increased.
no code implementations • 24 Mar 2022 • Xiangxu Yu, Zhenqiang Ying, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
A number of studies have been directed towards understanding the perceptual characteristics of professionally generated gaming videos arising in gaming video streaming, online gaming, and cloud gaming.
no code implementations • 15 Mar 2022 • Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel
To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect to a selected \emph{region} in the reference image instead of the entire background.
1 code implementation • CVPR 2022 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille
We propose Lite Vision Transformer (LVT), a novel light-weight transformer network with two enhanced self-attention mechanisms to improve the model performances for mobile deployment.
2 code implementations • 25 Oct 2021 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of obtaining image quality representations in a self-supervised manner.
Ranked #2 on Video Quality Assessment on LIVE-ETRI (using extra training data)
no code implementations • 29 Sep 2021 • Yilin Wang, Nan Cao, Teng Zhang, Hai Jin
Optimal margin Distribution Machine (ODM), a newly proposed statistical learning framework rooting in the novel margin theory, demonstrates better generalization performance than the traditional large margin based counterparts.
no code implementations • 27 Sep 2021 • Pavan C Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
In this work we address the problem of frame rate dependent Video Quality Assessment (VQA) when the videos to be compared have different frame rate and compression factor.
Ranked #2 on Video Quality Assessment on LIVE-YT-HFR
1 code implementation • ICCV 2021 • Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang
Image harmonization aims to improve the quality of image compositing by matching the "appearance" (\eg, color tone, brightness and contrast) between foreground and background images.
2 code implementations • ICCV 2021 • Junjie Ke, Qifei Wang, Yilin Wang, Peyman Milanfar, Feng Yang
To accommodate this, the input images are usually resized and cropped to a fixed shape, causing image quality degradation.
Ranked #3 on Image Quality Assessment on MSU NR VQA Database
no code implementations • CVPR 2021 • Yilin Wang, Junjie Ke, Hossein Talebi, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli, Peyman Milanfar, Feng Yang
Besides the subjective ratings and content labels of the dataset, we also propose a DNN-based framework to thoroughly analyze importance of content, technical quality, and compression level in perceptual quality.
no code implementations • 5 Jun 2021 • Yilin Wang, Shaozuo Yu, Xiaokang Yang, Wei Shen
In this paper, we propose a generic model transfer scheme to make Convlutional Neural Networks (CNNs) interpretable, while maintaining their high classification accuracy.
no code implementations • CVPR 2021 • Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta
We first train our model on COCO and evaluate the learned visual representations on various downstream tasks including image classification, object detection, and instance segmentation.
no code implementations • 29 Mar 2021 • Yilin Wang, Jiayi Ye
Video classification and analysis is always a popular and challenging field in computer vision.
no code implementations • 30 Jan 2021 • Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus.
1 code implementation • 26 Jan 2021 • Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
However, these models are either incapable or inefficient for predicting the quality of complex and diverse UGC videos in practical applications.
Ranked #4 on Video Quality Assessment on LIVE Livestream
no code implementations • 22 Dec 2020 • Jianzhou Zhao, Yilin Wang, Xiaolong Feng, Shengyuan A. Yang
Our results indicate that the electronic structures of LaFe$_2$As$_2$ and CaFe$_2$As$_2$ are not too different, which further suggest that superconductivity might also be induced in the collapsed phase of LaFe$_2$As$_2$ under similar non-hydrostatic conditions as for CaFe$_2$As$_2$.
Strongly Correlated Electrons Superconductivity
1 code implementation • 13 Dec 2020 • Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zhe Lin, Alan Yuille
To evaluate segmentation quality near object boundaries, we propose the Meticulosity Quality (MQ) score considering both the mask coverage and boundary precision.
1 code implementation • CVPR 2021 • Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille
We propose Mask Guided (MG) Matting, a robust matting framework that takes a general coarse mask as guidance.
no code implementations • 10 Dec 2020 • Fredrik Viklund, Yilin Wang
Moreover, if either of these two energies is finite they are equal up to a constant factor, and in this case, the foliation leaves are Weil-Petersson quasicircles.
Complex Variables Mathematical Physics Mathematical Physics Probability
no code implementations • 29 Oct 2020 • Yilin Wang, Jiayi Ye
Point cloud 3D object detection has recently received major attention and becomes an active research topic in 3D computer vision community.
1 code implementation • 26 Oct 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We consider the problem of conducting frame rate dependent video quality assessment (VQA) on videos of diverse frame rates, including high frame rate (HFR) videos.
Ranked #1 on Video Quality Assessment on LIVE-YT-HFR
no code implementations • NeurIPS 2020 • Digvijay Boob, Qi Deng, Guanghui Lan, Yilin Wang
We also establish new convergence complexities to achieve an approximate KKT solution when the objective can be smooth/nonsmooth, deterministic/stochastic and convex/nonconvex with complexity that is on a par with gradient descent for unconstrained optimization problems in respective cases.
1 code implementation • 22 Sep 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifacts, which manifest as staircase-like color bands on pictures or video frames, is a common distortion caused by compression of low-textured smooth regions.
1 code implementation • ECCV 2020 • Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns
We present a novel resizing module for neural networks: shape adaptor, a drop-in enhancement built on top of traditional resizing layers, such as pooling, bilinear sampling, and strided convolution.
1 code implementation • 22 Jul 2020 • Pavan C. Madhusudana, Xiangxu Yu, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
We also conducted a holistic evaluation of existing state-of-the-art Full and No-Reference video quality algorithms, and statistically benchmarked their performance on the new database.
no code implementations • ECCV 2020 • Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang
To our best knowledge, the proposed method is first to enable adversarial learning in autoregressive models for image generation.
no code implementations • CVPR 2020 • Innfarn Yoo, Xiyang Luo, Yilin Wang, Feng Yang, Peyman Milanfar
DitherNet manipulates the input image to reduce color banding artifacts and provides an alternative to traditional dithering.
no code implementations • 19 Jun 2020 • Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
High frame rate videos are increasingly getting popular in recent years, driven by the strong requirements of the entertainment and streaming industries to provide high quality of experiences to consumers.
Ranked #3 on Video Quality Assessment on LIVE-YT-HFR
5 code implementations • 29 May 2020 • Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Recent years have witnessed an explosion of user-generated content (UGC) videos shared and streamed over the Internet, thanks to the evolution of affordable and reliable consumer capture devices, and the tremendous popularity of social media platforms.
Ranked #11 on Video Quality Assessment on YouTube-UGC
no code implementations • 27 Feb 2020 • Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Banding artifact, or false contouring, is a common video compression impairment that tends to appear on large flat regions in encoded videos.
1 code implementation • 13 Apr 2019 • Yilin Wang, Sasi Inguva, Balu Adsumilli
However, traditional metrics used in compression and quality assessment, like BD-Rate and PSNR, are designed for pristine originals.
Multimedia Image and Video Processing
2 code implementations • ICCV 2019 • Yulun Zhang, Chen Fang, Yilin Wang, Zhaowen Wang, Zhe Lin, Yun Fu, Jimei Yang
An assumption widely used in recent neural style transfer methods is that image styles can be described by global statics of deep features like Gram or covariance matrices.
no code implementations • NeurIPS 2018 • Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li
Graph matching has received persistent attention over decades, which can be formulated as a quadratic assignment problem (QAP).
no code implementations • 5 Apr 2017 • Parag S. Chandakkar, Yilin Wang, Baoxin Li
In the framework, the number of lanes, the vehicle's position in those lanes and the presence of other vehicles are considered as parameters.
no code implementations • 21 Jul 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Neil O'Hare, Yi Chang, Baoxin Li
Understanding human actions in wild videos is an important task with a broad range of applications.
no code implementations • CVPR 2016 • Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li
However, pointwise labels in image classification and tag annotation are inherently related to the pairwise labels.
no code implementations • 24 Mar 2015 • Qiang Zhang, Yilin Wang, Baoxin Li
Recently, the spectrum analysis based visual saliency approach has attracted a lot of interest due to its simplicity and good performance, where the phase information of the image is used to construct the saliency map.
no code implementations • CVPR 2014 • Yilin Wang, Ke Wang, Enrique Dunn, Jan-Michael Frahm
We develop a sequential optimal sampling framework for stereo disparity estimation by adapting the Sequential Probability Ratio Test (SPRT) model.