STAT-541-Comparative Analysis of Different Approaches to Text Entailment

Team:

Student name	CCID
Ravika Nagpal	ravika
Chirag Daryani	cdaryani
Rohan Verma	rverma3
Karan Chadha	kchadha1

Task Description:

We are attempting the problem of Text Entailment, which is determining whether a piece of text logically follows from another piece of text. It is the task of Natural Language Inference (NLI) where given pairs of small text snippets (one or more sentences in English), known as Text-Hypothesis (T-H) pairs, we have to decide if the hypothesis contradicts, entails or is neutral to the text.

In this project , we are performing a quantitative analysis of prevalent models and embeddings for the task of text entailment and we are evaluating which model and embedding is the best for the text entailment task.

Description about the Dataset:

For our task, we have used the Stanford Natural Language Inference (SNLI) corpus, which is a collection of 570k human-written English sentence pairs manually labeled for balanced classification with the labels entailment, contradiction, and neutral. Each entry in the dataset is a pair of hypothesis and baseline with gold-label, and the parse structure of sentences in Penn Treebank format. There are 550152 training pairs, 10000 validation and 10000 test pairs.

This complete dataset for the task can be downloaded from this website.

Code Files:

GLoVe-all-models.ipynb : All models built on GLoVe Embeddings.
elmo-embedding-all-models.ipynb : All models (CNN + LSTM + Bi-LSTM) built on ELMo Embeddings.
BERT-model.ipynb : Bert model fine tuned

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
BERT-model.ipynb		BERT-model.ipynb
GLoVe-all-models.ipynb		GLoVe-all-models.ipynb
README.md		README.md
elmo-embedding-all-models.ipynb		elmo-embedding-all-models.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAT-541-Comparative Analysis of Different Approaches to Text Entailment

Team:

Task Description:

Description about the Dataset:

Code Files:

References:

About

Releases

Packages

Contributors 2

Languages

ravikanagpal/STAT-541-Text-Entailment

Folders and files

Latest commit

History

Repository files navigation

STAT-541-Comparative Analysis of Different Approaches to Text Entailment

Team:

Task Description:

Description about the Dataset:

Code Files:

References:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages