Java Data Retrieval and Model Training This project contains all files required to retrieve data from GH Archive. These can be used to train tokenizer and machine learning model. The resulting model is available at Hugging Face.