面向视觉问答的上下文感知多模态交互网络
颜洪,黄青松,刘利军
面向视觉问答的上下文感知多模态交互网络
Context-aware Multi-modality Interactive Network for Visual Question Answering
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 | 〉 |