cloudera
Here are 178 public repositories matching this topic...
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3, LDAP, Code/Build Linting, pkg mgmt for Linux, Mac, Python, Perl, Ruby, NodeJS, Golang, Advanced dotfiles: .bashrc, .vimrc, .gitconfig, .screenrc, tmux..
-
Updated
May 26, 2024 - Shell
CDP Public Cloud is an integrated analytics and data management platform deployed on cloud services. It offers broad data analytics and artificial intelligence functionality along with secure user access and data governance features.
-
Updated
May 24, 2024 - Java
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
-
Updated
May 23, 2024
Ansible Execution Environment images for Cloudera Data Platform (CDP) Public and Private Cloud
-
Updated
May 21, 2024 - Shell
This workshop aims to make use of airlines data set that is publicly available and showcase how one can make use of CDW for Open Data Lakehouse using Apache Iceberg.
-
Updated
May 6, 2024
DocGenius AI - Generative AI Chatbot for your Documents - Powered by Cloudera Machine Learning (CML)
-
Updated
Apr 29, 2024 - Python
Perl Utility Library for my other repos
-
Updated
Apr 23, 2024 - Perl
450+ AWS, Hadoop, Cloud, Kafka, Docker, Elasticsearch, RabbitMQ, Redis, HBase, Solr, Cassandra, ZooKeeper, HDFS, Yarn, Hive, Presto, Drill, Impala, Consul, Spark, Jenkins, Travis CI, Git, MySQL, Linux, DNS, Whois, SSL Certs, Yum Security Updates, Kubernetes, Cloudera etc...
-
Updated
May 21, 2024 - Python
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Kubernetes, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
-
Updated
Feb 9, 2024 - Shell
The objective of this project is to build a data pipeline to show and analyse the results in PowerBI from the MovieLens 25M database, using Hive and Python.
-
Updated
Feb 1, 2024 - Jupyter Notebook
The power of librdkafka for pythons
-
Updated
Jan 23, 2024 - Python
This repository serves as a hands-on implementation of a Big Data platform focused on processing parliamentary data from the website of the Moroccan Parliament. The project aims to calculate Key Performance Indicators (KPIs) to evaluate the engagement level of each government.
-
Updated
Dec 8, 2023 - Jupyter Notebook
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
-
Updated
Nov 28, 2023 - Shell
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
-
Updated
Nov 28, 2023 - Shell
Improve this page
Add a description, image, and links to the cloudera topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cloudera topic, visit your repo's landing page and select "manage topics."