Dataset: Crime Classification of San Francisco
to run the file run these commands at the concurrantly:
/opt/spark/bin/spark-submit bd-df2.py 2>log.txt
python3 stream.py -f crime -b 1000
for graph plots kindly install the following: matplotlib seaborn folium squarify