Multi-InteractionVQA

Multi InteractionVQA

This repository is the implementation of Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering for the visual question answering task. Our single model achieved 70.93 (Test-standard, VQA 2.0). Moreover, in TDIUC dataset, our single model achieved 73.04 in Arithmetic MTP metric and 66.86 in Harmonic MTP metric.

cc-by-4.0
Visual Question Answering
English
by @researchteam-1234
0

Last updated: 19 days ago


Details
Files
Discussions
1

MoD Data Link

Opened about 19 days ago @TuongDo

1
1