this is based on calsyslab project
 

6 lines
78 B

import nltk
nltk.download()
#corpora -> stopwords
#models -> punkt tokenizer