Skip to content

NLP analysis of dialogue rules on Reddit conducted during BSc. Code sample includes latent Dirichlet allocation and hyperparameter optimisation conducted in R.

Notifications You must be signed in to change notification settings

DavidFeng-GitHub/reddit-nlp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Reddit Natural Language Processing

This research report was conducted as part of the PB312: Research Apprenticeship module during my BSc.

I applied NLP techniques including topic modeling (latent Dirichlet allocation) and supplementary co-occurrence network analysis to examine the latent structure of dialogue norms across 11,000 subreddits extracted using Reddit API. Models were refined using a range of hyperparameter optimisation techniques, including k-fold cross-validation with parallel computing.

This socio-technical investigation aimed to extend our understanding of the typology of community rules on online platforms essential to designing moderation practices and technologies.

Repository contains research report and analysis script coded in R.

About

NLP analysis of dialogue rules on Reddit conducted during BSc. Code sample includes latent Dirichlet allocation and hyperparameter optimisation conducted in R.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published