Search Results for author: Jinghui Qin

Found 22 papers, 17 papers with code

Mirror Gradient: Towards Robust Multimodal Recommender Systems via Exploring Flat Local Minima

1 code implementation • 17 Feb 2024 • Shanshan Zhong, Zhongzhan Huang, Daifeng Li, Wushao Wen, Jinghui Qin, Liang Lin

This strategy can implicitly enhance the model's robustness during the optimization process, mitigating instability risks arising from multimodal information inputs.

Multimodal Recommendation

Paper
Code

ADASR: An Adversarial Auto-Augmentation Framework for Hyperspectral and Multispectral Data Fusion

1 code implementation • 11 Oct 2023 • Jinghui Qin, Lihuang Fang, Ruitao Lu, Liang Lin, Yukai Shi

Deep learning-based hyperspectral image (HSI) super-resolution, which aims to generate high spatial resolution HSI (HR-HSI) by fusing hyperspectral image (HSI) and multispectral image (MSI) with deep neural networks (DNNs), has attracted lots of attention.

Data Augmentation Super-Resolution

Paper
Code

Understanding Self-attention Mechanism via Dynamical System Perspective

no code implementations • ICCV 2023 • Zhongzhan Huang, Mingfu Liang, Jinghui Qin, Shanshan Zhong, Liang Lin

The self-attention mechanism (SAM) is widely used in various fields of artificial intelligence and has successfully boosted the performance of different models.

Paper
Add Code

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

1 code implementation • 9 May 2023 • Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin

Our approach can make text-to-image diffusion models easier to use with better user experience, which demonstrates our approach has the potential for further advancing the development of user-friendly text-to-image generation models by bridging the semantic gap between simple narrative prompts and complex keyword-based prompts.

Knowledge Distillation Text-to-Image Generation

106

Paper
Code

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem

1 code implementation • 9 May 2023 • Shanshan Zhong, Wushao Wen, Jinghui Qin, Qiangpu Chen, Zhongzhan Huang

In computer vision, the performance of deep neural networks (DNNs) is highly related to the feature extraction ability, i. e., the ability to recognize and focus on key pixel regions in an image.

Paper
Code

ASR: Attention-alike Structural Re-parameterization

no code implementations • 13 Apr 2023 • Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin

This technique enables the mitigation of the extra costs for performance improvement during training, such as parameter size and inference time, through these transformations during inference, and therefore SRP has great potential for industrial and practical applications.

Paper
Add Code

UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression

1 code implementation • 6 Dec 2022 • Jiaqi Chen, Tong Li, Jinghui Qin, Pan Lu, Liang Lin, Chongyu Chen, Xiaodan Liang

Naturally, we also present a unified multi-task Geometric Transformer framework, Geoformer, to tackle calculation and proving problems simultaneously in the form of sequence generation, which finally shows the reasoning ability can be improved on both two tasks by unifying formulation.

Ranked #3 on Mathematical Reasoning on PGPS9K

Geometry Problem Solving Logical Reasoning +1

Paper
Code

Deepening Neural Networks Implicitly and Locally via Recurrent Attention Strategy

no code implementations • 27 Oct 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin, Zhongzhan Huang

More and more empirical and theoretical evidence shows that deepening neural networks can effectively improve their performance under suitable training settings.

Paper
Add Code

Causal Inference for Chatting Handoff

1 code implementation • 6 Oct 2022 • Shanshan Zhong, Jinghui Qin, Zhongzhan Huang, Daifeng Li

However, most existing methods mainly focus on the dialogue context or assist with global satisfaction prediction based on multi-task learning, which ignore the grounded relationships among the causal variables, like the user state and labor cost.

Causal Inference Chatbot +2

Paper
Code

Switchable Self-attention Module

1 code implementation • 13 Sep 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin

Attention mechanism has gained great success in vision recognition.

Paper
Code

Mix-Pooling Strategy for Attention Mechanism

1 code implementation • 22 Aug 2022 • Shanshan Zhong, Wushao Wen, Jinghui Qin

Recently many effective attention modules are proposed to boot the model performance by exploiting the internal information of convolutional neural networks in computer vision.

Paper
Code

Real-World Image Super-Resolution by Exclusionary Dual-Learning

1 code implementation • 6 Jun 2022 • Hao Li, Jinghui Qin, Zhijing Yang, Pengxu Wei, Jinshan Pan, Liang Lin, Yukai Shi

Real-world image super-resolution is a practical image restoration problem that aims to obtain high-quality images from in-the-wild input, has recently received considerable attention with regard to its tremendous application potentials.

Image Restoration Image Super-Resolution

Paper
Code

Unbiased Math Word Problems Benchmark for Mitigating Solving Bias

2 code implementations • Findings (NAACL) 2022 • Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang

However, current solvers exist solving bias which consists of data bias and learning bias due to biased dataset and improper training strategy.

Math

Paper
Code

LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning

2 code implementations • 17 May 2022 • Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Liang Lin, Xiaodan Liang

To address this issue and make a step towards interpretable MWP solving, we first construct a high-quality MWP dataset named InterMWP which consists of 11, 495 MWPs and annotates interpretable logical formulas based on algebraic knowledge as the grounded linguistic logic of each solution equation.

Math Math Word Problem Solving

Paper
Code

Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks

1 code implementation • ACL 2021 • Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang, Liang Lin

Previous math word problem solvers following the encoder-decoder paradigm fail to explicitly incorporate essential math symbolic constraints, leading to unexplainable and unreasonable predictions.

Decoder Math

Paper
Code

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

1 code implementation • Findings (ACL) 2021 • Jiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric P. Xing, Liang Lin

Therefore, we propose a Geometric Question Answering dataset GeoQA, containing 4, 998 geometric problems with corresponding annotated programs, which illustrate the solving process of the given problems.

Ranked #4 on Mathematical Reasoning on PGPS9K

Math Mathematical Reasoning +1

Paper
Code

Content-adaptive Representation Learning for Fast Image Super-resolution

no code implementations • 20 May 2021 • Yukai Shi, Jinghui Qin

In contrast to existing studies that ignore difficulty diversity, we adopt different stage of a neural network to perform image restoration.

Image Restoration Image Super-Resolution +1

Paper
Add Code

Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems

1 code implementation • EMNLP 2020 • Jinghui Qin, Lihui Lin, Xiaodan Liang, Rumin Zhang, Liang Lin

A practical automatic textual math word problems (MWPs) solver should be able to solve various textual MWPs while most existing works only focused on one-unknown linear MWPs.

Ranked #10 on Math Word Problem Solving on ALG514

Decoder Math +1

Paper
Code

GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems

1 code implementation • EMNLP 2020 • Lishan Huang, Zheng Ye, Jinghui Qin, Liang Lin, Xiaodan Liang

Capitalized on the topic-level dialogue graph, we propose a new evaluation metric GRADE, which stands for Graph-enhanced Representations for Automatic Dialogue Evaluation.

Dialogue Evaluation

Paper
Code

Dynamic Knowledge Routing Network For Target-Guided Open-Domain Conversation

1 code implementation • 4 Feb 2020 • Jinghui Qin, Zheng Ye, Jianheng Tang, Xiaodan Liang

Target-guided open-domain conversation aims to proactively and naturally guide a dialogue agent or human to achieve specific goals, topics or keywords during open-ended conversations.

Retrieval

Paper
Code

Difficulty-aware Image Super Resolution via Deep Adaptive Dual-Network

1 code implementation • 11 Apr 2019 • Jinghui Qin, Ziwei Xie, Yukai Shi, Wushao Wen

To identify whether a region is easy or hard, we propose a novel image difficulty recognition network based on PSNR prior.

Image Super-Resolution

Paper
Code

PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report

no code implementations • 3 Oct 2018 • Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X. Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong, Qinquan Gao, Liu Hanwen, Pablo Navarrete Michelini, Zhu Dan, Hu Fengshuo, Zheng Hui, Xiumei Wang, Lirui Deng, Rang Meng, Jinghui Qin, Yukai Shi, Wushao Wen, Liang Lin, Ruicheng Feng, Shixiang Wu, Chao Dong, Yu Qiao, Subeesh Vasu, Nimisha Thekke Madam, Praveen Kandula, A. N. Rajagopalan, Jie Liu, Cheolkon Jung

This paper reviews the first challenge on efficient perceptual image enhancement with the focus on deploying deep learning models on smartphones.

Image Enhancement Image Super-Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.