Skip to content
renatotn7 edited this page Mar 23, 2016 · 5 revisions

This project has a several subprojects for language processing

Statistics of unknowns words in captions This scripts make statistics of unknowns words in captions

at directory wordsFromCaptions there is the script:

  • wordsStatisticsFromCaptions.py

For this script, must exists in same directory the files

  • conhecidas.csv (file with previous known words)
  • legenda.srt(file with the captions)
  • referencia.txt (file with other captions that will basis for the statistics)
Clone this wiki locally