Dataset handling is very inefficient #34

mvirgo · 2020-05-08T07:14:13Z

The current way the dataset is loaded for training is super inefficient and loads the whole dataset all at once. As such, I should consider changing the dataset from being stored as a pickle file, as well as whether to use flow_from_directory or similar techniques.

The text was updated successfully, but these errors were encountered:

NickSotir · 2020-05-08T11:52:43Z

Are you, by any chance, able to provide the raw dataset (meaning the images and labels without being pickled) ?

mvirgo · 2020-10-02T02:48:36Z

Sorry I missed your comment @NickSotir - my original check of this looked to be that I had deleted it to save space, but looks like I do actually still have it, in the case of the full size images (1,978 images at 1280x720). I have uploaded it here.

I only have the 112x112 versions of the labels it looks like (same as the pickle file), although I think if you were to re-size them as needed, you won't lose much information. You should otherwise be able to use pillow's Image.fromarray() function if you load the pickle files to save these down separately. Note the labels from the pickle file will look like essentially nothing on their own since they are a single channel of 0 for not lane or 1 for lane by pixel. I made a more "human viewable" version with a process similar to the top answer here. The human viewable version, which has the same thing stacked 3 times (for RGB) and instead scaled to 0 to 255, can be found here, or alternatively the "binary" version (what the model used) is here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataset handling is very inefficient #34

Dataset handling is very inefficient #34

mvirgo commented May 8, 2020

NickSotir commented May 8, 2020 •

edited

Loading

mvirgo commented Oct 2, 2020

Dataset handling is very inefficient #34

Dataset handling is very inefficient #34

Comments

mvirgo commented May 8, 2020

NickSotir commented May 8, 2020 • edited Loading

mvirgo commented Oct 2, 2020

NickSotir commented May 8, 2020 •

edited

Loading