makeClasses is a simple script that loads face, torso, body and background samples from INRIA dataset and saves them in a ImageNet compatible format. This means, object are fairly centred and the sample size is 256 × 256 px².
In this first release, the dataset has to be downloaded manually from here and extracted into ~/Work/Datasets.
In the code there is my user name (
~=/Users/atcold). I will probably update it in a following commit.
What this script does is simply extract every face, torso and body from the positive and background from the negative folder, cropping them square and saving them into a Extracted-data folder, under a sub-directory representing their label.
Face: are sized 1/3 of the width of the bounding box;Torso: upper biggest square croppable from the bounding box;Body: square with side equal to the hight of the bounding box. If the limits of the image are exceeded (the image is too narrow), the first/last column of pixel is replicated to fill the gap;Bckg: squares of 128 px are cropped randomly from the negative samples.
And this is how they look like
After changing the folder pointers in the code (which I will make soon automatic), in the src directory, run
th makeClasses.lua
To have an idea of the information provided with the dataset, I wrote checkBox which gives a general idea of what we can do with the data. To run it, go to the src directory and type
qlua checkBox.lua




