These are the exercise files used for Big Data Analysis with Apache Hive course.
The course outline can be found in
https://www.tertiarycourses.com.sg/big-data-analysis-apache-hive.html
https://www.tertiarycourses.com.my/big-data-analysis-with-apache-hive-malaysia.html
Module 1: Get Started on Apache Hive
- What is Hive?
- How Hive Works with Hadoop
- Install CDH on VirtualBox
- Hue 4 UI Overview
Module 2: Basic Hive Operations
- Create and Drop Database
- Create and Drop Table
- Create Table from CSV File
- Alter Table
- Fix CSV File with Serde
- Load Data to Empty Table
- Partition Tables
Module 3: HiveQL
- Retrieve Data with SELECT
- SELECT Options
- Operators and Built In Functions
- Filter data with WHERE
Module 4: Aggregating Data
- Hive Aggregations
- Having
- Grouping Sets
- Cube & Rollup
Module 5: Joining Tables
- Combining Tables with JOIN
- EXIST & SEMI JOIN
- Joining Multiple Tables
Module 6: Data Analysis with Apache Hive
- Math Functions
- String Functions
- Date Functions
- Conditional Statements