Underthesea - Vietnamese NLP Toolkit
-
Updated
Oct 5, 2023 - Python
Underthesea - Vietnamese NLP Toolkit
A Vietnamese natural language processing toolkit (NAACL 2018)
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
PhoGPT: Generative Pre-training for Vietnamese (2023)
Vietnamese NLP Toolkit for Node
A Large-scale Vietnamese News Text Classification Corpus
Vietnamese question answering system with BERT
VietASR - Vietnamese Automatic Speech Recognition
Vietnamese Automatic Speech Recognition
Vietnamese Word Tokenize
Vietnamese sensitive words (including teencode) was created by ML algorithm
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
We use LSTM, BiLSTM, BERT and SVM with TF-IDF, Word2vec and Bag-of-words to classify this documents to positive (labeled as 1), neutral (labeled as 0) and negative (labeled as 2)
Add a description, image, and links to the vietnamese-nlp topic page so that developers can more easily learn about it.
To associate your repository with the vietnamese-nlp topic, visit your repo's landing page and select "manage topics."