Skip to content
#

spark-dataframes

Here are 42 public repositories matching this topic...

Predict the success of Kickstarter campaigns using machine learning. Analyze project data including financial goals, pledge amounts, categories, and outcomes. Perform data cleaning, queries, visualizations, and build models to forecast campaign success, helping entrepreneurs optimize their funding strategies

  • Updated May 22, 2024
  • Jupyter Notebook

This repository contains the implementation of a wide variety of BigData Projects in different applications of NoSQL databases, Spark, Data Pipelines, and map-reduce. These projects include university projects and projects implemented due to interest in BigData.

  • Updated Sep 28, 2023
  • Jupyter Notebook

Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark.

  • Updated Jul 1, 2022
  • Java

Improve this page

Add a description, image, and links to the spark-dataframes topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spark-dataframes topic, visit your repo's landing page and select "manage topics."

Learn more