Skip to content

Add WhisperX model download script and configuration files for version 3.4.2#1792

Draft
arash77 wants to merge 3 commits intobgruening:masterfrom
arash77:whisperx-data-tables
Draft

Add WhisperX model download script and configuration files for version 3.4.2#1792
arash77 wants to merge 3 commits intobgruening:masterfrom
arash77:whisperx-data-tables

Conversation

@arash77
Copy link
Contributor

@arash77 arash77 commented Feb 27, 2026

FOR CONTRIBUTOR:

  • I have read the CONTRIBUTING.md document and this tool is appropriate for the tools-iuc repo.
  • License permits unrestricted use (educational + commercial)
  • This PR adds a new tool or tool collection
  • This PR updates an existing tool or tool collection
  • This PR does something else (explain below)

Introduce a script and configuration files to facilitate the downloading of WhisperX models for version 3.4.2, enhancing the tool's functionality and model accessibility.

@arash77 arash77 force-pushed the whisperx-data-tables branch from c5e36c2 to 9a40bc8 Compare February 27, 2026 13:42
@arash77
Copy link
Contributor Author

arash77 commented Mar 2, 2026

In this PR, we only address the faster-whisper models, but the path should actually point tp the root directory for all models.
Should we add a row for each model to the data table, or should we go with the simplest option?

@arash77 arash77 marked this pull request as draft March 2, 2026 16:04
@arash77
Copy link
Contributor Author

arash77 commented Mar 2, 2026

As we already have the huggingface.loc file we could also use such thing:

# Columns: value <TAB> name <TAB> pipeline_tag <TAB> domain <TAB> free_tag <TAB> version <TAB> path
#
# --- version 3.4.2 ---
#
# ── ASR models ──────────────────────────────────────────────────────────────────────
whisperx_3.4.2_asr_turbo	Turbo (~8x faster than large)	asr	turbo	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_asr_tiny	Tiny (~10x faster than large)	asr	tiny	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_asr_base	Base (~7x faster than large)	asr	base	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_asr_small	Small (~4x faster than large)	asr	small	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_asr_medium	Medium (~2x faster than large)	asr	medium	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_asr_large	Large	asr	large	whisperx	3.4.2	/data/db/whisperx_models
#
# ── Alignment models — torchaudio (en, fr, de, es, it) ──────────────────────────────
whisperx_3.4.2_align_en	English	alignment	en	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_fr	French	alignment	fr	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_de	German	alignment	de	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_es	Spanish	alignment	es	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_it	Italian	alignment	it	whisperx	3.4.2	/data/db/whisperx_models
#
# ── Alignment models — HuggingFace ──────────────────────────────────────────────────────
whisperx_3.4.2_align_ja	Japanese	alignment	ja	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_zh	Chinese	alignment	zh	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_nl	Dutch	alignment	nl	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_uk	Ukrainian	alignment	uk	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_pt	Portuguese	alignment	pt	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ar	Arabic	alignment	ar	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_cs	Czech	alignment	cs	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ru	Russian	alignment	ru	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_pl	Polish	alignment	pl	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_hu	Hungarian	alignment	hu	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_fi	Finnish	alignment	fi	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_fa	Persian	alignment	fa	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_el	Greek	alignment	el	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_tr	Turkish	alignment	tr	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_da	Danish	alignment	da	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_he	Hebrew	alignment	he	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_vi	Vietnamese	alignment	vi	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ko	Korean	alignment	ko	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ur	Urdu	alignment	ur	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_te	Telugu	alignment	te	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_hi	Hindi	alignment	hi	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ca	Catalan	alignment	ca	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ml	Malayalam	alignment	ml	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_no	Norwegian	alignment	no	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_nn	Nynorsk	alignment	nn	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_sk	Slovak	alignment	sk	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_sl	Slovenian	alignment	sl	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_hr	Croatian	alignment	hr	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ro	Romanian	alignment	ro	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_eu	Basque	alignment	eu	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_gl	Galician	alignment	gl	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_ka	Georgian	alignment	ka	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_lv	Latvian	alignment	lv	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_tl	Tagalog	alignment	tl	whisperx	3.4.2	/data/db/whisperx_models
whisperx_3.4.2_align_sv	Swedish	alignment	sv	whisperx	3.4.2	/data/db/whisperx_models

@bgruening what do you think about it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant