DataTau | This is a tutorial-style post that offers an alternative approach to dealing wit...

DataTau

1 point by timehaven 2463 days ago | link | parent

This is a tutorial-style post that offers an alternative approach to dealing with large sets of training data, without resorting to copying and moving files to hard-coded directories named `train`, `validation` and `test`. Instead, you keep the files where they "naturally" reside on your system and track their locations with a Pandas DataFrame, feeding their names to the Keras generator. It scales well when dealing with millions of image files and hundreds of gigabytes of data.

RSS | Announcements