Setup

Whisper Playground

Instantly build real-time speech2text apps in 99 languages using faster-whisper, Diart, and Pyannote

Try it via the online demo

Playground.Demo.mp4

Setup

Have Conda and Yarn on your device
Clone or fork this repository
Install the backend and frontend environment sh install_playground.sh
Review config.py to make sure the transcription device and compute type match your setup. Review config.js to make sure it conforms to the backend config and that the backend address is correct.
Run the backend cd backend && python server.py
In a different terminal, run the React frontend cd interface && yarn start

Access to Pyannote Models

This repository uses libraries based on pyannote.audio models, which are stored in the Hugging Face Hub. You must accept their terms of use before using them. Note: You need to have a Hugging Face account to use pyannote

Accept terms for the pyannote/segmentation model
Accept terms for the pyannote/embedding model
Accept terms for the pyannote/speaker-diarization model
Install huggingface-cli and log in with your user access token (can be found in Settings -> Access Tokens)

Parameters

Model Size: Choose the model size, from tiny to large-v2.
Language: Select the language you will be speaking in.
Transcription Timeout: Set the number of seconds the application will wait before transcribing the current audio data.
Beam Size: Adjust the number of transcriptions generated and considered, which affects accuracy and transcription generation time.
Transcription Method: Choose "real-time" for real-time diarization and transcriptions, or "sequential" for periodic transcriptions with more context.

Troubleshooting

On MacOS, if building the wheel for safetensors fails, install Rust brew install rust and try again.

Known Bugs

This repository hasn't been tested for all languages; please create an issue if you encounter any problems.

License

This repository and the code and model weights of Whisper are released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github		.github
backend		backend
interface		interface
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install_playground.sh		install_playground.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

backend

backend

interface

interface

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

install_playground.sh

install_playground.sh

requirements.txt

requirements.txt

Repository files navigation

Whisper Playground

Instantly build real-time speech2text apps in 99 languages using faster-whisper, Diart, and Pyannote

Try it via the online demo

Setup

Access to Pyannote Models

Parameters

Troubleshooting

Known Bugs

License

About

Releases 2

Sponsor this project

Contributors 3

Languages

License

saharmor/whisper-playground

Folders and files

Latest commit

History

Repository files navigation

Whisper Playground

Instantly build real-time speech2text apps in 99 languages using faster-whisper, Diart, and Pyannote

Setup

Access to Pyannote Models

Parameters

Troubleshooting

Known Bugs

License

About

Topics

Resources

License

Stars

Watchers

Forks

Sponsor this project

Languages