-
Notifications
You must be signed in to change notification settings - Fork 1
Home
renatotn7 edited this page Mar 23, 2016
·
5 revisions
This project has a several subprojects for language processing
Statistics of unknowns words in captions This scripts make statistics of unknowns words in captions and produces a output with csv format
at directory wordsFromCaptions there is the script:
- wordsStatisticsFromCaptions.py
For this script, must exists in same directory the files
- conhecidas.csv (file with previous known words)
- legenda.srt(file with the captions)
- referencia.txt (file with other captions that will basis for the statistics)