Home

Jump to bottom

renatotn7 edited this page Mar 23, 2016 · 5 revisions

This project has a several subprojects for language processing

Statistics of unknowns words in captions This scripts make statistics of unknowns words in captions

at directory wordsFromCaptions there is the script:

wordsStatisticsFromCaptions.py

For this script, must exists in same directory the files

conhecidas.csv (file with previous known words)
legenda.srt(file with the captions)
referencia.txt (file with other captions that will basis for the statistics)