bazarr-ai-sub-generator/README.md

# Automatic subtitles in your videos

This is a fork of [auto_subtitle](https://github.com/m1guelpf/auto-subtitle) using [faster-whisper](https://github.com/SYSTRAN/faster-whisper) implementation.

This repository uses `ffmpeg` and [OpenAI's Whisper](https://openai.com/blog/whisper) to automatically generate and overlay subtitles on any video.

## Installation

To get started, you'll need Python 3.7 or newer. Install the binary by running the following command:

    pip install git+https://github.com/Sirozha1337/faster-auto-subtitle.git@dev

You'll also need to install [`ffmpeg`](https://ffmpeg.org/), which is available from most package managers:

```bash
# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg
```

## Usage

The following command will generate a `subtitled/video.mp4` file contained the input video with overlayed subtitles.

    faster_auto_subtitle /path/to/video.mp4 -o subtitled/

The default setting (which selects the `small` model) works well for transcribing English. You can optionally use a bigger model for better results (especially with other languages). The available models are `tiny`, `tiny.en`, `base`, `base.en`, `small`, `small.en`, `medium`, `medium.en`, `large`, `large-v1`, `large-v2`, `large-v3`.

    faster_auto_subtitle /path/to/video.mp4 --model medium

Adding `--task translate` will translate the subtitles into English:

    faster_auto_subtitle /path/to/video.mp4 --task translate

Run the following to view all available options:

    faster_auto_subtitle --help

## Tips

The tool also exposes a couple of model parameters, that you can tweak to increase accuracy.

Higher `beam_size` usually leads to greater accuracy, but slows down the process.

Setting higher `no_speech_threshold` could be useful for videos with a lot of background noise to stop Whisper from "hallucinating" subtitles for it.

In my experience settings option `condition_on_previous_text` to `False` dramatically increases accurracy for videos like TV Shows with an intro song at the start. 

You can use `sample_interval` parameter to generate subtitles for a portion of the video to play around with those parameters:

    faster_auto_subtitle /path/to/video.mp4 --model medium --sample_interval 00:05:30-00:07:00 --condition_on_previous_text False --beam_size 6 --no_speech_threshold 0.7

## License

This script is open-source and licensed under the MIT License. For more details, check the [LICENSE](LICENSE) file.
init 2022-09-28 01:57:08 +01:00			`# Automatic subtitles in your videos`

Readme changes 2024-01-05 17:59:31 +01:00			`This is a fork of [auto_subtitle](https://github.com/m1guelpf/auto-subtitle) using [faster-whisper](https://github.com/SYSTRAN/faster-whisper) implementation.`

			This repository uses `ffmpeg` and [OpenAI's Whisper](https://openai.com/blog/whisper) to automatically generate and overlay subtitles on any video.
init 2022-09-28 01:57:08 +01:00
			`## Installation`

			`To get started, you'll need Python 3.7 or newer. Install the binary by running the following command:`

New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			`pip install git+https://github.com/Sirozha1337/faster-auto-subtitle.git@dev`
init 2022-09-28 01:57:08 +01:00
			You'll also need to install [`ffmpeg`](https://ffmpeg.org/), which is available from most package managers:

			```bash
			`# on Ubuntu or Debian`
			`sudo apt update && sudo apt install ffmpeg`

			`# on MacOS using Homebrew (https://brew.sh/)`
			`brew install ffmpeg`

			`# on Windows using Chocolatey (https://chocolatey.org/)`
			`choco install ffmpeg`
			```

			`## Usage`

			The following command will generate a `subtitled/video.mp4` file contained the input video with overlayed subtitles.

New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			`faster_auto_subtitle /path/to/video.mp4 -o subtitled/`
init 2022-09-28 01:57:08 +01:00
New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			The default setting (which selects the `small` model) works well for transcribing English. You can optionally use a bigger model for better results (especially with other languages). The available models are `tiny`, `tiny.en`, `base`, `base.en`, `small`, `small.en`, `medium`, `medium.en`, `large`, `large-v1`, `large-v2`, `large-v3`.
Readme changes 2024-01-05 17:59:31 +01:00
New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			`faster_auto_subtitle /path/to/video.mp4 --model medium`
init 2022-09-28 01:57:08 +01:00
			Adding `--task translate` will translate the subtitles into English:

New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			`faster_auto_subtitle /path/to/video.mp4 --task translate`
init 2022-09-28 01:57:08 +01:00
			`Run the following to view all available options:`

New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00			`faster_auto_subtitle --help`

			`## Tips`

			`The tool also exposes a couple of model parameters, that you can tweak to increase accuracy.`

Readme changes 2024-01-05 17:59:31 +01:00			Higher `beam_size` usually leads to greater accuracy, but slows down the process.
New version using faster-whisper with more model parameters exposed 2024-01-04 23:45:37 +01:00
			Setting higher `no_speech_threshold` could be useful for videos with a lot of background noise to stop Whisper from "hallucinating" subtitles for it.

			In my experience settings option `condition_on_previous_text` to `False` dramatically increases accurracy for videos like TV Shows with an intro song at the start.

			You can use `sample_interval` parameter to generate subtitles for a portion of the video to play around with those parameters:

			`faster_auto_subtitle /path/to/video.mp4 --model medium --sample_interval 00:05:30-00:07:00 --condition_on_previous_text False --beam_size 6 --no_speech_threshold 0.7`
init 2022-09-28 01:57:08 +01:00
			`## License`

			`This script is open-source and licensed under the MIT License. For more details, check the [LICENSE](LICENSE) file.`