Skip to content

orianeido/SparkAB

Repository files navigation

SparkAB

A comprehensive analytics platform tailored for A/B testing using Apache Kafka and Spark.

Features

  • Real-time Data Integration: Utilizes Kafka for real-time data acquisition.
  • Analytics with Spark: Processes and analyzes data streams with Spark.
  • Insightful Dashboards: Translates datasets into actionable insights.

Getting Started

Prerequisites

  • Apache Kafka
  • Apache Spark
  • Python libraries: pyspark, csv, time, IPython, matplotlib, panda, scipy

Installation & Setup

  1. Clone the repository: git clone https://github.com/idooriane/SparkAB.git
  2. Navigate to the directory: cd SparkAB
  3. Database download - Click Here
  4. Open "Steps to Run SparkAB.pdf" for setup instructions

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT