Skip to content

Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark.

Notifications You must be signed in to change notification settings

mayankrawat/CSVJoin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

CSVJoin

Use this project to join data from multiple csv files. Currently in this project we support one to one and one to many join. Along with this you can find how to use kafka producer efficiently with spark. Metdata for whole joining process is defined in #datasource.json.

Commandline arguments:

  1. Boolean value (true to unable kafka otherwise false)
  2. Kafka topic name (If one is true )

Plateforms:

  1. Spark 2.1
  2. Kafka and zookeeper (If Kafka is enable)

Releases

No releases published

Packages

No packages published

Languages