Skip to content

A basic R package for scraping Reddit data using the pushshift API

Notifications You must be signed in to change notification settings

nathancunn/pushshiftR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pushshiftR

This is a very basic R package for fetching Reddit data using the pushshift API. At present, the package should suit general users, but is not a general package.

Installation

devtools::install_github("https://github.com/nathancunn/pushshiftR")

Basic use

To get top-level posts from /r/soccer from January 1st 2019:

getPushshiftData(postType = "comment",
                 size = 1000,
                 after = "1546300800",
                 subreddit = "soccer",
                 nest_level = 1)

Acknowledgments

This package is basically an R implementation of the code here and uses the pushshift API to download Reddit data. If you use this, you might consider donating to them.

About

A basic R package for scraping Reddit data using the pushshift API

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages