This repo contains R files.
It is spark SQL API.
####Select Select table and subset of columns. Also create new columns.
- Table name - Mandatory
- Boolean optional flag (TRUE or FALSE) to indicate if original columns should be kept or not - Optional
- Column names - Mandatory
####Join Usage jdf1 = Join(tpcds1gb.store_sales, tpcds1gb.store, tpcds1gb.customer)
It is json data. It contain orders. Orders has data about order, customer, shipping information. It has sales employee information and the territories that the employee covers. Each territory details is provided.
It also has Customers details. Id and complete address of the customers is provided.
OrderDetails has information about OrderDetails like Product, Categories, Shipping details etc.