Contrastive Pre-Training of Transformer Models for Computational Framing Analysis

This repository contains the notebooks and mCPT framework used in mCPT at SemEval-2023 Task 3 as well as within the context of my thesis. Notebooks 0x contain an analysis of the data and notebooks 3x and 4x correspond to the sections methodology and results in my thesis. mcpt contains the mCPT pyTorch framework.

Ertl, A., Reiter-Haas, M., Innerebner, K., and Lex, E. (2023). mCPT at SemEval-2023 Task 3: Multilingual Label-Aware Contrastive Pre-Training of Transformers for Few- and Zero-shot Framing Detection. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 941–949, Toronto, Canada. Association for Computational Linguistics.

Datasets

GVFC (Liu, S., Guo, L., Mays, K., Betke, M., and Wijaya, D. T. (2019). Detecting frames in news headlines and its application to analyzing news framing trends surrounding us gun violence. In Proceedings of the 23rd conference on computational natural language learning (CoNLL), pages 504–514.)
LOCO (Miani, A., Hills, T., and Bangerter, A. (2021). LOCO: The 88-million-word language of conspiracy corpus. Behavior Research Methods, 54(4):1794–1817.)
SemEval (Piskorski, J., Stefanovitch, N., Da San Martino, G., and Nakov, P. (2023). Semeval-2023 task 3: Detecting the category, the framing, and the persuasion techniques in online news in a multi-lingual setup. In Proceedings of the 17th International Workshop on Semantic Evaluation, SemEval 2023, Toronto, Canada.)
US-Economic-News.csv (https://www.kaggle.com/datasets/heeraldedhia/us-economic-news-articles)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figures		figures
kaggle		kaggle
mcpt		mcpt
presentation		presentation
.gitignore		.gitignore
00a_semeval_data_analysis.ipynb		00a_semeval_data_analysis.ipynb
00b_first_steps.ipynb		00b_first_steps.ipynb
00c_next_steps.ipynb		00c_next_steps.ipynb
00d_final_steps.ipynb		00d_final_steps.ipynb
03a-embedding-generation.ipynb		03a-embedding-generation.ipynb
03b-contrast-sampling-variance.ipynb		03b-contrast-sampling-variance.ipynb
03c_loco10k-analysis.ipynb		03c_loco10k-analysis.ipynb
04a_ablation_study.ipynb		04a_ablation_study.ipynb
04b_contrast_vs_random.ipynb		04b_contrast_vs_random.ipynb
04c_embedding_analysis.ipynb		04c_embedding_analysis.ipynb
04c_embeddings_toy_example.ipynb		04c_embeddings_toy_example.ipynb
04d_LOCO_frame_analysis.ipynb		04d_LOCO_frame_analysis.ipynb
04e_AMR.ipynb		04e_AMR.ipynb
06_GVFC.ipynb		06_GVFC.ipynb
README.md		README.md
thesis.pdf		thesis.pdf
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contrastive Pre-Training of Transformer Models for Computational Framing Analysis

Datasets

About

Releases

Packages

Languages

lambdasonly/mCPT

Folders and files

Latest commit

History

Repository files navigation

Contrastive Pre-Training of Transformer Models for Computational Framing Analysis

Datasets

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages