Created a backup of the corpus and removed some faulty tokens

This commit is contained in:
Eduardo Cueto Mendoza 2020-08-17 10:45:28 -06:00
parent b208cacbf4
commit 6037759e37
2 changed files with 29853 additions and 913 deletions

File diff suppressed because it is too large Load Diff

29059
Corpus/CORPUS.txt.bak.txt Normal file

File diff suppressed because it is too large Load Diff