Skip to content

tertiarycourses/NLTKTraining

Repository files navigation

Natural Language Processing with Python NLTK Training

These are the exercise files used for Natural Language Processing with Python NLTK Training course.

The course outline can be found in

https://www.tertiarycourses.com.sg/natural-language-processing-nlp-nltk.html https://www.tertiarycourses.com.my/natural-language-processing-nlp-nltk-malaysia.html

Topic 1 Overview of Text Mining and Text Analytics

  • Introduction to Natural Language Processing (NLP)
  • Applications of Text Analytics and Text Mining for Business Intelligence
  • Cross-Industry Standard Process for Data Mining (CRISP-DM)

Topic 2: Text Cleaning and Pre-processing

  • Install Python NLTK Package
  • Read In Text Corpus
  • Remove Punctuation and Stop Words
  • Pre-process Text using Tokenization, Stemming, Lemmatization
  • Vectorize Text using Term Frequency (TF) Vectorization, N-gram and Inverse-Document Frequency (TF-IDF)

Topic 3 Text Analytics

  • Part of Speech (POS) Tagging
  • Name Entity Recognition (NER)
  • Text Link Analysis and Feature Engineering

Topic 4: Sentimental Analysis

  • Overview of Machine Learning
  • Install Python Scikit Learn Package
  • Build a Machine Learning Model for Sentimental Analysis
  • Model Evaluation

Topic 5: Text Summarization

  • Summarize Sentiment Analysis
  • Visualize Text Summarization