generated from nhcarrigan/template
feat: use whisperx to allow diarization
This commit is contained in:
@@ -6,6 +6,22 @@ This is a local recording, transcription, and summarisation script that listens
|
||||
|
||||
Gotta run it locally. Sorry!
|
||||
|
||||
1. Install Ollama, pull `llama3:8b`
|
||||
2. Install `uv`.
|
||||
3. Run `uvx python@3.12 whisperx` to install whisperx
|
||||
4. Run `uvx python@3.12 main.py`.
|
||||
|
||||
### Diarization
|
||||
|
||||
To download the models needed for diarization, you need to create a Hugging Face account. Then you should agree to the terms to access these two models:
|
||||
|
||||
1. https://huggingface.co/pyannote/segmentation-3.0
|
||||
2. https://huggingface.co/pyannote/speaker-diarization-3.1
|
||||
|
||||
Uncomment the `--hf_token` line and provide your Hugging Face token. This token needs read permissions for gated public repositories you have access to.
|
||||
|
||||
Once you have run the script with this token at least once, you can remove your token and the script should work as normal.
|
||||
|
||||
## Feedback and Bugs
|
||||
|
||||
If you have feedback or a bug report, please feel free to open a GitHub issue!
|
||||
|
||||
Reference in New Issue
Block a user