no code implementations • 15 Apr 2024 • Ziniu Zhang, Shulin Tian, Liangyu Chen, Ziwei Liu
To answer this question, we present MMInA, a multihop and multimodal benchmark to evaluate the embodied agents for compositional Internet tasks, with several appealing properties: 1) Evolving real-world multimodal websites.
no code implementations • 9 Jul 2023 • Shulin Tian, YuFei Wang, Renjie Wan, Wenhan Yang, Alex C. Kot, Bihan Wen
In this work, we propose a novel approach to increase the visibility of images captured under low-light environments by removing the in-camera infrared (IR) cut-off filter, which allows for the capture of more photons and results in improved signal-to-noise ratio due to the inclusion of information from the IR spectrum.