Creating Data Engineering Solution using end-to-end project pipeline utilising Big Data and Machine Learning tools and technologies
-
Updated
May 23, 2022 - Jupyter Notebook
Creating Data Engineering Solution using end-to-end project pipeline utilising Big Data and Machine Learning tools and technologies
Modern Big Data Analysis: recommend which pair of United States airports should be connected with a high-speed passenger rail tunnel.
Performance Benchmarking for Solr, Elastic, Impala, SparkSQL via SparkThrift -- using JMeter
This repository offers a meticulous journey through Impala, merging the theoretical foundations with hands-on SQL applications. Through a series of tasks, we delve into the nuances of data virtualization, database creation, and the art and science of query optimization.
Here we are performing real time market basket analysis using hive for dynamic updating data
Task of DLUT Big Data Department & SensorsData, third semester. Cooperate with @Bellick
BigData Engineering Capstone Project with Tech-stack : Linux, MySQL, sqoop, HDFS, Hive, Impala, SparkSQL, SparkML, git
Pytorch implementation of various distributed reinforcement learning algorithms
Code for paper "Deep Reinforcement Learning based Multi-task Automated Channel Pruning for DNNs"
Add a description, image, and links to the impala topic page so that developers can more easily learn about it.
To associate your repository with the impala topic, visit your repo's landing page and select "manage topics."