Kafka Athena Streaming Project

Project Description:

In this project I have implemented an End-To-End Data Engineering Project simulating real time data using stock market data using Kafka. The following technologies were used:

Architecture

EC2 and Kafka Setup:

Python code to simulate streaming stock data:

Python code get the data from consumer and upload to an S3 bucket:

Setting up a glue crawler to crawl the s3 bucket:

Exploring the data in Athena: