[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
-
Updated
Mar 22, 2022 - Scala
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
A scalable, mature and versatile web crawler based on Apache Storm
Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Fast Advanced Spam Analysis Tool
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
News crawling with StormCrawler - stores content as WARC
Docker image packaging for Apache Storm
Apache Pulsar Adapters
Battle-tested Apache Storm Multi-Lang implementation for Python
A framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
Storm Debian Packaging with dpkg-buildpackage
A curated list of Pulsar tools, integrations and resources.
Apache Storm cluster on Docker
A dockerized image of Apache Storm (Zookeeper, Nimbus, Supervisor, Ui, Logviewer.)
Investigating the trade-offs of low latency responses over quality when applying machine learning algorithms over lambda architecture.
Resources for running StormCrawler with Docker services
CISC 5950 Big Data Programming Final Project. Storm cluster tutorial and application.
Real time computation system with Apache Storm, Apache Kafka and Google Guice
Add a description, image, and links to the apache-storm topic page so that developers can more easily learn about it.
To associate your repository with the apache-storm topic, visit your repo's landing page and select "manage topics."