Multi-InteractionVQA

Multi InteractionVQA

This repository is the implementation of Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering for the visual question answering task. Our single model achieved 70.93 (Test-standard, VQA 2.0). Moreover, in TDIUC dataset, our single model achieved 73.04 in Arithmetic MTP metric and 66.86 in Harmonic MTP metric.

cc-by-4.0
Visual Question Answering
English
by @researchteam-1234
0

Last updated: 19 days ago


Sign in to see model files

orCreate an account