transformer-architecture

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

image translation deep-learning neural-network gpu text machine-translation cuda transformer lstm seq2seq sequence-to-sequence tensor encoder-decoder attention-model transformer-encoder transformer-architecture vision-transformer

Updated Jun 6, 2024
C#

saaadiqh / NLP-Learning_Analytics

Star

Developing Natural Language Processing tools to enhance Learning Analytics. Creating an automated dashboard that diagnoses strengths and weaknesses from educational data.

natural-language-processing deep-learning text-classification learning-analytics text-summarization topic-modeling text-clustering transformer-architecture large-language-models

Updated Jun 5, 2024
Python

razamehar / IMDB-Sentiment-Analysis-BoW-S2S-Models

Star

Sentiment analysis on the IMDB dataset using Bag of Words models (Unigram, Bigram, Trigram, Bigram with TF-IDF) and Sequence to Sequence models (one-hot vectors, word embeddings, pretrained embeddings like GloVe, and transformers with positional embeddings).

python sentiment-analysis tensorflow word-embeddings bag-of-words glove-embeddings sequence-to-sequence-models imdb-dataset transformer-architecture term-frequency-inverse-document-frequency one-hot-encoded-vectors

Updated Jun 4, 2024
Jupyter Notebook

Yunika-Bajracharya / Extractive-Nepali-QA

Star

Extractive Nepali Question Answering System | Browser Extension & Web Application

nlp transformer-architecture extractive-question-answering muril

Updated Jun 3, 2024
Jupyter Notebook

kyegomez / MultiModalMamba

Sponsor

Star

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

machine-learning ai ml transformers torch pytorch artificial-intelligence zeta attention-mechanism ssm mamba transformer-architecture

Updated Jun 3, 2024
Python

Awni00 / abstract_transformer

Star

This is the project repo associated with the paper "Disentangling and Integrating Relational and Sensory Information in Transformer Architectures" by Awni Altabaa, John Lafferty

machine-learning attention relational-learning relational-reasoning transformer-architecture machine-learning-research

Updated Jun 8, 2024
Jupyter Notebook

gustavecortal / transformer

Star

Slides from my NLP course on the transformer architecture

nlp tutorial slides transformers transformer transformer-architecture transformer-models

Updated May 30, 2024

RuochenT / transformer_hybrid

Star

This study aims to investigate the effectiveness of three Transformers (BERT, RoBERTa, XLNet) in handling data sparsity and cold start problems in the recommender system. We present a Transformer-based hybrid recommender system that predicts missing ratings and ex- tracts semantic embeddings from user reviews to mitigate the issues.

matrix-factorization transformer bert multilabel-classification sentence-embeddings hybrid-recommender-system roberta transformer-architecture xlnet cold-start-problem