This project allows you to download YouTube video transcripts, audio, and video content.
- Python 3.x
- ffmpeg (required for audio conversion)
- Ubuntu/Debian:
sudo apt-get install ffmpeg
- macOS:
brew install ffmpeg
- Windows: Download from ffmpeg.org
- Ubuntu/Debian:
- youtube-transcript-api >= 0.6.1
- yt-dlp >= 2025.1.26
.
├── srcs/
│ ├── main.py
│ ├── youtube_service.py
│ └── urls.txt
├── audio/ # Downloaded audio files
├── video/ # Downloaded video files
├── transcripts/ # Generated transcripts
├── requirements.txt
└── setup.sh
- Clone the repository:
git clone https://github.com/afuma/Youtube_transcript.git
cd Youtube_transcript
- Run the setup script to create a virtual environment and install dependencies:
./setup.sh
- Add YouTube URLs to
srcs/urls.txt
, one URL per line. - Run the main script:
python srcs/main.py
The script will:
- Download video transcripts to the
transcripts/
directory - Download audio files to the
audio/
directory - Download video files to the
video/
directory
When you're done, you can deactivate the virtual environment:
deactivate