This project is a simple tool that allows users to upload a video or audio file and extract text from it. It utilizes Streamlit for the user interface and ffmpeg for audio extraction, Whisper API for speech recognition, and yt-dlp for retrieving audio from a video url.
- Open https://transcribetool.streamlit.app/
- Upload a video | audio file or provide url of a youtube video.
- Wait for the transcription process to complete.
- View the text extracted from the file.
To run this project locally, follow these steps:
- Clone this repository to your local machine.
- Navigate to the project directory.
- Create and activate a virtual environment (optional but recommended):
python3 -m venv venv
source venv/bin/activate
- Install the required dependencies using pip:
pip install -r requirements.txt
- Run the following command to start the Streamlit app:
streamlit run main.py
- Open your web browser and go to the URL provided by Streamlit.
- Upload a video | audio file or provide url of a video.
- Wait for the transcription process to complete.
- View the text extracted from the file.
This project uses the following libraries:
- ffmpeg-python for audio extraction.
- Streamlit for building the web application.
- Whisper for speech recognition.
- yt-dlp for retrieving audio from a video url.
This project is licensed under the GPL-3.0 License - see the LICENSE file for details.