Skip to content

a-luna/vigorish

Repository files navigation

PyPI version PyPI - Downloads PyPI - License PyPI - Python Version Maintainability codecov

vigorish

vigorish is a hybrid Python/Node.js application that scrapes MLB data from mlb.com, brooksbaseball.net and baseball-reference.com.

My goal is to capture as much data as possible — ranging from PitchFX measurements at the most granular level to play-by-play data (play descriptions, substitutions, manager challenges, etc) and individual player pitch/bat stats at the highest level.

Requirements

  • Python 3.6+
  • Node.js 10+ (Tested with Node.js 11-13)
  • Xvfb
  • AWS account (optional but recommended, used to store scraped data in S3)

Project Documentation

For a step-by-step install guide and instructions for configuring/using vigorish, please visit the link below:

Vigorish: Hybrid Python/Node.Js Web Scraper

Credits

vigorish either relies on the following projects listed below directly or as a dev dependency. It would not have been possible for me to create vigorish without these projects, thanks to all of the creators/maintainers for making these available (projects are listed alphabetically):