no code implementations • 5 Jan 2024 • Daoan Zhang, Junming Yang, Hanjia Lyu, Zijian Jin, Yuan YAO, Mingkai Chen, Jiebo Luo
When exploring the development of Artificial General Intelligence (AGI), a critical task for these models involves interpreting and processing information from multiple image inputs.
Ranked #3 on Visual Reasoning on Winoground
no code implementations • 5 Feb 2023 • Daoan Zhang, Mingkai Chen, Chenming Li, Lingyun Huang, JianGuo Zhang
Different from learning domain invariant features from source domains, we decouple the input images into Domain Expert Features and noise.