A real-time streaming pipeline can stream live tweets from Twitter and ingest the data to the Apache Kafka clusters as a topic, and consumers consume the hashtag tweets as a message.
Create a developer account here to get the Twitter API key. Then change the Twitter API key configuration at the virtual environment level by the newly generated API key.
export CONSUMER_KEY="XXXXXXXXXXXXXXXXXXXXXXXXX"
export CONSUMER_SECRET="XXXXXXXXXXXXXXXXXXXXXX"
export ACCESS_KEY="XXXXXXXXXXXXXXXXXXXXXXXXXXX"
export ACCESS_SECRET="XXXXXXXXXXXXXXXXXXXXXXXX"
git clone https://github.com/saboye/Real-Time-Data-Streaming-Pipeline.
pip install -r requirements.txt
advertised.listeners=PLAINTEXT://localhost:9092
zookeeper.connect=localhost:2181
To run locally you need to installed Apache Kafka
bin/zookeeper-server-start.sh config/zookeeper.properties
JMX_PORT=8004 bin/kafka-server-start.sh config/server.properties
👤 SABOYE
Contributions, issues and feature requests are welcome!
Feel free to check issues page.
Give a ⭐️ if this project helped you!
This project is MIT licensed.