Dashboard
Real-time statistics
SF crime statistics with Spark Streaming

In this project, I simulate a real-time data streaming using a dataset extracted from Kaggle on San Francisco crime incidents. I created a Kafka server to produce the data and ingest it through Spark Structured Streaming, updating real-time statistics over the data arrival.

This project is connected to the Data Streaming Nanodegree, from Udacity. I used Python, Apache Spark, and Kafka.

VIEW PROJECT