corpus_balancing This is the repository hosting the program to balance our corpus according to the following criteria: number of tokens per author's gender number of tokens per temporal phase (decade) number of documents per author