feat: use whisperx to allow diarization

This commit is contained in:
2025-08-05 13:35:00 -07:00
parent 53f30437d9
commit 97e325c0b7
2 changed files with 24 additions and 4 deletions
+16
View File
@@ -6,6 +6,22 @@ This is a local recording, transcription, and summarisation script that listens
Gotta run it locally. Sorry!
1. Install Ollama, pull `llama3:8b`
2. Install `uv`.
3. Run `uvx python@3.12 whisperx` to install whisperx
4. Run `uvx python@3.12 main.py`.
### Diarization
To download the models needed for diarization, you need to create a Hugging Face account. Then you should agree to the terms to access these two models:
1. https://huggingface.co/pyannote/segmentation-3.0
2. https://huggingface.co/pyannote/speaker-diarization-3.1
Uncomment the `--hf_token` line and provide your Hugging Face token. This token needs read permissions for gated public repositories you have access to.
Once you have run the script with this token at least once, you can remove your token and the script should work as normal.
## Feedback and Bugs
If you have feedback or a bug report, please feel free to open a GitHub issue!