Skip to content

Latest commit

 

History

History
84 lines (53 loc) · 3.9 KB

README.md

File metadata and controls

84 lines (53 loc) · 3.9 KB

Techlabs

A project in collaboration with local business Pickshare to help them get better understanding about their customers, provide their business partnes with optimal marketing strategies.

Techies:

  1. My Nguyen: Master student in Digital Transformation at FH Dortmund
  2. Christoph Klippenstein: Studies mechanical Engineering at TU Dortmund, specialized in mechanical simulations
  3. Shourav Deb Nath: Studies manufacturing Technology (MS) at TU Dortmund

Make the code run:

  • To properly run the code you need to change the location of the .json files (users.json and packages.json) These files are in the "Used Data & Results.zip" file in the folder: "Used_json_files"

    Example code: Samplecode users = pd.read_json('**') ** = path of the users.json file

    packages = pd.read_json('***') *** = path of the packages.json file

  • You can adjust the size of all figures and the text in them with the variables: Samplecode_2 my_font_size = 10

    figure_size = 9

    You have to adjust those two values seperately to achieve good figure appearence

    Cities_have_the_most_order-3 Cities_have_the_most_users packages_per_user-3

  • There are two seperated files which run seperately of each other:

    • Pickshare_Time_Series_Analysis.ipynb

      Top_5_most_ordered_senders_by_date-3 top_5_most_ordered_senders-5 Top_5_most_ordered_sorted_by_date Top_5_cities_have_the_most_orders Top_5_cities_have_the_most_by_day Average_processing_time_by_day

    • Pickshare_plz_&_maps.ipynb

    The three other files:

    • old_version.ipynb
    • old_version2.ipynb
    • techlabs_notebook_titanic.ipynb

    can be ignored since these are only a backup for the developers

Nice to know

  • Any changes in the figures will be saved as .jpg files in the folder "/content" with a resolution of 300dpi after you compile the code

Futher improvements:

  • In the Data Processing section the cleaning_text() function needs futher improvement. The main function right now is to get rid of unknown characters to make the code work. Futher improvements could be to automate this procedur for all kinds of unknown characters and also unify the individual properties to a standart format.

  • Make the maps head maps for better visualation.

Known bugs and errors:

  • Average processing time: There are a lot of missing values an therefore a lot of nonsense data. Because either the starttime is missing or the enddime and therefor the calculation makes no sense.

Used sources in the code: