vqa

Star

Here are 231 public repositories matching this topic...

facebookresearch / mmf

Star

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

deep-learning dialog pytorch vqa pretrained-models captioning multimodal multi-tasking textvqa hateful-memes

Updated Mar 3, 2024
Python

peteanderson80 / bottom-up-attention

Star

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

caffe vqa faster-rcnn image-captioning captioning-images mscoco mscoco-dataset visual-question-answering

Updated Feb 3, 2023
Jupyter Notebook

BDBC-KG-NLP / QA-Survey-CN

Star

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于表格的问答系统（TableQA）、基于视觉的问答系统（VisualQA）和机器阅读理解（MRC）等，每类任务分别对学术界和工业界进行了相关总结。

nlp qa survey vqa question-answering cqa kbqa qa-survey tqa

Updated Apr 6, 2023

microsoft / Oscar

Star

Oscar and VinVL

vqa image-captioning oscar vision-and-language pre-training image-text-search vinvl

Updated Aug 28, 2023
Python

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Updated Nov 24, 2023
Python

hengyuan-hu / bottom-up-attention-vqa

Star

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

pytorch vqa bottom-up-attention

Updated Mar 10, 2024
Python

Cadene / vqa.pytorch

Star

Visual Question Answering in Pytorch

deep-learning torch pytorch vqa coco resnet skipthoughts clevr vgenome

Updated Dec 11, 2019
Python

stanfordnlp / mac-network

Star

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

tensorflow vqa question-answering attention clevr machine-reasoning compositional-attention-networks

Updated Jul 10, 2021
Python

hila-chefer / Transformer-MM-Explainability

Star

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

visualization transformers transformer vqa clip interpretability explainable-ai explainability detr lxmert visualbert

Updated Aug 24, 2023
Jupyter Notebook

Cyanogenoid / pytorch-vqa

Star

Strong baseline for visual question answering

pytorch vqa baseline visual-question-answering

Updated Mar 13, 2023
Python

jokieleung / awesome-visual-question-answering

Star

A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.

vqa awesome-list multi-modal multi-modal-learning attention-networks

Updated Jul 6, 2023

chingyaoc / awesome-vqa

Star

Visual Q&A reading list

vqa arxiv papers

Updated Oct 7, 2018

vacancy / NSCL-PyTorch-Release

Star

PyTorch implementation for the Neuro-Symbolic Concept Learner (NS-CL).

vqa concept-learning neuro-symbolic-learning

Updated Oct 24, 2020
Python

jayleicn / ClipBERT

Star

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

pytorch vqa vision-and-language video-retrieval video-question-answering cvpr2021