These are the exercise files used for Natural Language Processing with Python NLTK Training course.
The course outline can be found in
https://www.tertiarycourses.com.sg/natural-language-processing-nlp-nltk.html https://www.tertiarycourses.com.my/natural-language-processing-nlp-nltk-malaysia.html
Topic 1 Overview of Text Mining and Text Analytics
- Introduction to Natural Language Processing (NLP)
- Applications of Text Analytics and Text Mining for Business Intelligence
- Cross-Industry Standard Process for Data Mining (CRISP-DM)
Topic 2: Text Cleaning and Pre-processing
- Install Python NLTK Package
- Read In Text Corpus
- Remove Punctuation and Stop Words
- Pre-process Text using Tokenization, Stemming, Lemmatization
- Vectorize Text using Term Frequency (TF) Vectorization, N-gram and Inverse-Document Frequency (TF-IDF)
Topic 3 Text Analytics
- Part of Speech (POS) Tagging
- Name Entity Recognition (NER)
- Text Link Analysis and Feature Engineering
Topic 4: Sentimental Analysis
- Overview of Machine Learning
- Install Python Scikit Learn Package
- Build a Machine Learning Model for Sentimental Analysis
- Model Evaluation
Topic 5: Text Summarization
- Summarize Sentiment Analysis
- Visualize Text Summarization