Skip to content

Releases: m-bain/whisperX

3.1.1

13 May 11:19
d8a2b4f
Compare
Choose a tag to compare
  • translate functionality added
  • fix word timestamp bug (words no longer have consecutive timestamps)

v3.1.0

07 May 19:34
1caddfb
Compare
Choose a tag to compare
  • 70x real time transcription, <8GB gpu memory requirement ⚡️⚡️
  • each transcript segment is a sentence (using nltk.sent_tokenize)
  • diarization now assigned per sentence (and outputted to srt)
  • clean up on alignment logic

v3.0.2

04 May 19:53
Compare
Choose a tag to compare
v3.0.2 Pre-release
Pre-release

torch2.0, python3.10

v3.0.1

01 May 10:52
e24ca9e
Compare
Choose a tag to compare
v3.0.1 Pre-release
Pre-release
  • fix pickling error (set num_workers=0) to perform dataloading iwth main process
  • add basic diarization
  • pad language detection if less than 30s

v3.0.0

28 Apr 15:48
cc7e168
Compare
Choose a tag to compare
v3.0.0 Pre-release
Pre-release

batched inference with faster-whisper backend

v2.0.1

28 Apr 15:47
Compare
Choose a tag to compare
allow custom model_dir for torchaudio models

alpha

26 Feb 21:05
847a3cd
Compare
Choose a tag to compare
Merge pull request #96 from smly/fix-batch-processing

FIX: Assertion error in batch processing