captioning

Star

Here are 64 public repositories matching this topic...

facebookresearch / mmf

Star

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

deep-learning dialog pytorch vqa pretrained-models captioning multimodal multi-tasking textvqa hateful-memes

Updated Mar 3, 2024
Python

ltguo19 / VSUA-Captioning

Star

Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019

nlp deep-learning pytorch captioning language-generation

Updated Oct 18, 2019
Python

drethage / fully-convolutional-point-network

Star

Fully-Convolutional Point Networks for Large-Scale Point Clouds

deep-neural-networks computer-vision deep-learning point-cloud point-clouds semantic-segmentation meshes 3d captioning

Updated Mar 22, 2019
Python

DavidHuji / CapDec

Star

CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)

clip zero-shot-learning captioning multimodal-deep-learning gpt-2 clipcap

Updated Jan 28, 2024
Python

wangleihitcs / MedicalReportGeneration

Star

A Base Tensorflow Project for Medical Report Generation

tensorflow-models captioning medical-report-generate

Updated Jun 16, 2019
Python

audio-captioning / clotho-dataset

Star

Python code for handling the Clotho dataset.

audio natural-language-processing deep-learning audio-signal-processing captioning audio-captioning clotho-dataset

Updated Nov 24, 2020
Python

ParitoshParmar / MTL-AQA

Star

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

pytorch video-processing lstm representation-learning action-recognition video-understanding c3d video-captioning captioning fine-grained-classification multitask-learning dilated-convolution action-quality-assessment mtl-aqa fine-grained-action-recognition dilated-c3d

Updated Nov 3, 2022
Python

HaydenFaulkner / Tennis

Star

A Tennis dataset and models for event detection & commentary generation

machine-learning video computer-vision mxnet dataset tennis gluon sportsanalytics fine-grained captioning eventdetection

Updated Aug 17, 2020
Python

deepgram-devs / video-chat

Star

Sample app to display live captioning to a WebRTC video session with the Deepgram API.

webrtc speech-recognition speech-to-text captioning deepgram

Updated Nov 22, 2021
JavaScript

aimagelab / camel

Star

CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022

computer-vision pytorch artificial-intelligence image-captioning captioning-images captioning

Updated Dec 1, 2022
Python

audio-captioning / dcase-2020-baseline

Star

Audio captioning baseline system for DCASE 2020 challenge.

machine-learning deep-neural-networks deep-learning signal-processing audio-signal-processing captioning dcase machine-listening audio-captioning dcase2020

Updated Aug 22, 2023
Python

alecwangcq / show-attend-and-tell

Star

captioning

Updated Nov 15, 2017
Jupyter Notebook

AdrianHsu / S2VT-seq2seq-video-captioning-attention

Star

S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow

video deep-learning tensorflow seq2seq attention-mechanism captioning

Updated Apr 26, 2018
Python

ebu / ebu-tt-live-toolkit

Star

Toolkit for supporting the EBU-TT Live specification

python video live captions subtitles broadcast ebu-tt subtitling captioning

Updated Oct 11, 2023
Python

Mauville / MedCLIP

Star

Medical image captioning using OpenAI's CLIP

machine-learning deep-learning medical-imaging clip captioning what-a-challenge-this-was

Updated Mar 7, 2023
Jupyter Notebook

TheShadow29 / VidSitu

Star

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

nlp video vision srl captioning captioning-videos vision-and-language grounding video-language event-relations semantic-roles

Updated Aug 17, 2021
Python

nikhilkumarsingh / MemeGenerator

Star

Python program to generate memes.

python generator memes pillow captioning

Updated Oct 3, 2023
Jupyter Notebook

rayandrew / indonesian-image-captioning

Star

Indonesian Image Captioning using Attention-based Semantic Compositional Networks

pytorch indonesia attention image-captioning resnet indonesian captioning

Updated Jul 31, 2019
Jupyter Notebook

lucidrains / AoA-pytorch

Star

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

vqa attention attention-mechanism captioning visual-question-answering

Updated Nov 8, 2020
Python

Labbeti / aac-datasets

Star

Audio Captioning datasets for PyTorch.

audio deep-learning pytorch dataset caption datasets captioning audio-captioning

Updated May 7, 2024
Python

Improve this page

Add a description, image, and links to the captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

captioning

Here are 64 public repositories matching this topic...

facebookresearch / mmf

ltguo19 / VSUA-Captioning

drethage / fully-convolutional-point-network

DavidHuji / CapDec

wangleihitcs / MedicalReportGeneration

audio-captioning / clotho-dataset

ParitoshParmar / MTL-AQA

HaydenFaulkner / Tennis

deepgram-devs / video-chat

aimagelab / camel

audio-captioning / dcase-2020-baseline

alecwangcq / show-attend-and-tell

AdrianHsu / S2VT-seq2seq-video-captioning-attention

ebu / ebu-tt-live-toolkit

Mauville / MedCLIP

TheShadow29 / VidSitu

nikhilkumarsingh / MemeGenerator

rayandrew / indonesian-image-captioning

lucidrains / AoA-pytorch

Labbeti / aac-datasets

Improve this page

Add this topic to your repo