video-captioning

This repository provides a tool for automatically generating subtitles for video content, improving accessibility and viewer experience by adding captions quickly and accurately.

subtitles video-processing video-editing video-captioning subtitle-generator media-editing video-tools subtitles-generator video-subtitles auto-captioning subtitle-tools media-tools auto-subtitles-generator video-accessibility auto-subtitling subtitle-automation content-accessibility video-enhancements subtitling-tools

Updated Feb 8, 2025

hmartelb / multimodal-video-captioning

Star

Multimodal Video Captioning project for the Natural Language Processing course at Tsinghua University, spring 2021

natural-language-processing video deep-learning pytorch video-captioning tsinghua-university multimodal

Updated Aug 30, 2022
Jupyter Notebook

ccc000-png / Tracker4Cap

Star

Frame-by-Frame Multi-object Tracking Guided Information Augmentation For Video Captioning

video-captioning

Updated Jan 20, 2025
Python

KARTASAR / video-captioning

Star

Video captioning | Video2Description

mlp video-captioning blip gpt-2

Updated Jan 28, 2023
Jupyter Notebook

Aavtic / thamburaan

Star

auto-caption program for generation word by word captioning on a green-screen video

rust text-to-speech tts video-captioning caption-generation

Updated Aug 13, 2024
Rust

HaydenFaulkner / Attributes_SVO_Video_Captioning

Star

LSTM RNN and Transformer networks video captioning on MSVD and MSR-VTT using attributes and SVOS

pytorch transformer lstm video-captioning msvd msr-vtt

Updated Feb 15, 2021
Jupyter Notebook

AkagawaTsurunaki / zerolan-core

Star

ZerolanCore integrates many open-source, locally deployable AI models, and aims to integrate a series of AI models such as large language model (LLM), automatic speech recognition (ASR), text-to-speech (TTS), image captioning, optical character recognition (OCR), video captioning, etc.

python nlp docker ocr anaconda cv tts image-captioning asr video-captioning llm

Updated Jan 29, 2025
Python

willyfh / msvd-indonesian

Star

MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).

deep-learning neural-network bahasa-indonesia video-captioning msvd video-retrieval video-description multimodal-dataset video-text indonesian-dataset msvd-indonesian

Updated Aug 4, 2023

AI-14 / video-captioning-for-arabic-sign-language-recognition-at-sentence-level

Star

An encoder-decoder deep learning model (with/without attention mechanism) where the input is an arabic sign-language video and the output is its translation in text format.

python deep-learning pytorch lstm vgg16 attention-mechanism video-captioning encoder-decoder-model

Updated Aug 29, 2023
Jupyter Notebook

vaishwi / CapSum-Joint-Video-Summarization-and-Video-Captioing

Star

AI based Video summarizer along with captioning.

video-summarization video-captioning gcp-storage gcp-sql gcp-app-engine

Updated Nov 7, 2020
Python

Pandla-Vijay / Video-Captioning-using-Spatio-temporal-features-and-Gaussian-Attention

Star

This project utilizes advanced deep learning techniques to automatically generate contextually relevant captions for videos by extracting spatial and temporal features, while incorporating Gaussian attention to focus on important regions. This enhances video indexing, retrieval, and accessibility for visually impaired individuals.

lstm gru video-captioning spatio-temporal-data msvd