Search Results for author: Sicong Leng

Found 6 papers, 5 papers with code

Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly

1 code implementation • 30 Apr 2024 • Hang Du, Sicheng Zhang, Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui Xu, Hangyu Liu, Sicong Leng, Jiangming Liu, Hehe Fan, Dajiu Huang, Jing Feng, Linli Chen, Can Zhang, Xuhuan Li, Hao Zhang, Jianhang Chen, Qimei Cui, Xiaofeng Tao

In pursuit of these answers, we present a comprehensive benchmark for Causation Understanding of Video Anomaly (CUVA).

Anomaly Detection

Paper
Code

Constrained Layout Generation with Factor Graphs

no code implementations • 30 Mar 2024 • Mohammed Haroon Dupty, Yanfei Dong, Sicong Leng, Guoji Fu, Yong Liang Goh, Wei Lu, Wee Sun Lee

This paper addresses the challenge of object-centric layout generation under spatial constraints, seen in multiple domains including floorplan design process.

Object

Paper
Add Code

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

2 code implementations • 28 Nov 2023 • Sicong Leng, Hang Zhang, Guanzheng Chen, Xin Li, Shijian Lu, Chunyan Miao, Lidong Bing

Large Vision-Language Models (LVLMs) have advanced considerably, intertwining visual recognition and language understanding to generate content that is not only coherent but also contextually attuned.

Hallucination Object

9,276

Paper
Code

Tell2Design: A Dataset for Language-Guided Floor Plan Generation

2 code implementations • 27 Nov 2023 • Sicong Leng, Yang Zhou, Mohammed Haroon Dupty, Wee Sun Lee, Sam Conrad Joyce, Wei Lu

We make multiple contributions to initiate research on this task.

Conditional Image Generation

7,802

Paper
Code

Speaker-Oriented Latent Structures for Dialogue-Based Relation Extraction

1 code implementation • 11 Sep 2021 • Guoshun Nan, Guoqing Luo, Sicong Leng, Yao Xiao, Wei Lu

Dialogue-based relation extraction (DiaRE) aims to detect the structural information from unstructured utterances in dialogues.

Dialog Relation Extraction Relation

Paper
Code

Interventional Video Grounding with Dual Contrastive Learning

1 code implementation • CVPR 2021 • Guoshun Nan, Rui Qiao, Yao Xiao, Jun Liu, Sicong Leng, Hao Zhang, Wei Lu

2) Meanwhile, we introduce a dual contrastive learning approach (DCL) to better align the text and video by maximizing the mutual information (MI) between query and video clips, and the MI between start/end frames of a target moment and the others within a video to learn more informative visual representations.

Causal Inference Contrastive Learning +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.