Search Results for author: Moyuru Yamada

Found 5 papers, 3 papers with code

GLoD: Composing Global Contexts and Local Details in Image Generation

no code implementations23 Apr 2024 Moyuru Yamada

However, simultaneous control over both global contexts (e. g., object layouts and interactions) and local details (e. g., colors and emotions) still remains a significant challenge.

Denoising Object +1

D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

1 code implementation15 Sep 2023 Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix

We demonstrate that this result is independent of the similarity between the training and testing data and applies to well-known families of neural network architectures for VQA (i. e. monolithic architectures and neural module networks).

Question Answering Systematic Generalization +1

Detect Only What You Specify : Object Detection with Linguistic Target

no code implementations18 Nov 2022 Moyuru Yamada

We then propose targeted detection task, where detection targets are given by a natural language and the goal of the task is to detect only all the target objects in a given image.

Decoder Object +2

Transformer Module Networks for Systematic Generalization in Visual Question Answering

1 code implementation27 Jan 2022 Moyuru Yamada, Vanessa D'Amario, Kentaro Takemoto, Xavier Boix, Tomotake Sasaki

We reveal that Neural Module Networks (NMNs), i. e., question-specific compositions of modules that tackle a sub-task, achieve better or similar systematic generalization performance than the conventional Transformers, even though NMNs' modules are CNN-based.

Question Answering Systematic Generalization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.