text-normalization

Here are 45 public repositories matching this topic...

Aayshashukla / SentimentAnalysis

Twitter Sentiment Analysis using Natural Language Processing(NLP)

python nlp text-mining text-classification kaggle artificial-intelligence logistic-regression nlp-machine-learning twitter-data text-normalization

Updated May 17, 2024
Jupyter Notebook

Aalaa4444 / Text_Processing-and-Unique_Word_Extraction_fromHTML

Star

Extract text content from an HTML page, process it, and extract unique words from the processed text. This notebook utilizes various text processing techniques including cleaning, normalization, tokenization, lemmatization or stemming, and stop words removal.

tokenizer text-extraction requests data-extraction beautifulsoup text-processing tokenization stemming lemmatization stopwords-removal text-cleaning text-normalization extract-html text-tokenization text-lemmatization

Updated Apr 5, 2024
Jupyter Notebook

alanbracco / twnorm

Star

Text Normalization on tweets (Tweet Normalization)

twitter tweets text-normalization

Updated Nov 14, 2018
Python

amogh9594 / Sentiment-Analysis

Star

Sentiment-Analysis

sentiment-analysis sentiment-classification text-normalization

Updated Mar 23, 2021
Jupyter Notebook

vn33 / Intensity-Analysis-EmotionClassification

Star

Predict emotions (happiness, anger, sadness) from WhatsApp chat data using machine learning and deep learning models. Includes text normalization, vectorization (TF-IDF, BoW, Word2Vec, GloVe), and model evaluation.

machine-learning natural-language-processing deep-learning text-classification word2vec hyperparameter-tuning bidirectional-lstm countvectorizer glove-embeddings text-normalization emotion-classification tf-idf-vectorizer word2vec-embeddinngs

Updated May 28, 2024
Jupyter Notebook

weezymatt / text-scrapbook

Star

Welcome to my text scrapbook! Here you will find examples of text tokenization, normalization, n-grams, and lots of text adjacent stuff.

nlp perl n-grams tokenization text-normalization

Updated Dec 20, 2023
Jupyter Notebook

curegit / unicodecheck

Star

Simple tool to check if Unicode text files are Unicode-normalized

unicode character-encoding text-normalization

Updated May 31, 2024
Python

Bonniface / Text-CLeaning-And-Classification

Star

Text classification is a widely used natural language processing task in different business problems. Given a statement or document, the task involves assigning to it an appropriate category from a pre-defined set of categories. The dataset of choice determines the set of categories. Text classification has applications in emotion classification, n

text-classification nlp-machine-learning text-cleaning text-normalization text-preprocessing

Updated Nov 2, 2022
Jupyter Notebook

areeba0 / English-to-French-Translation-using-NLTK-and-Hugging-Face-Transformers-MarianMTModel

Star

This repository provides a complete workflow for text processing using Hugging Face Transformers and NLTK. It includes modules for sentence normalization, spelling correction, word embedding generation, positional encoding computation, and English-to-French translation

python nlp word-embeddings jupyter-notebook nltk text-normalization positional-encoding huggingface-transformers english-to-french-translation

Updated Jun 18, 2024
Jupyter Notebook

vn33 / Ecommerce-Product-Categorization

Star

Accurate categorization of eCommerce products improves user experience and boosts search engine visibility. The project goal is to classify products into 14 predefined categories using their descriptions sourced from an eCommerce platform.

natural-language-processing ecommerce text-classification text-normalization streamlit-webapp