Image Augmentations for Aerial Datasets
When creating computer vision models, data augmentation can improve model performance with an existing image dataset. Image augmentation increases the size and variability of a dataset, thereby improving model generalizability.
Image augmentations should be selected based on the domain problem one is tackling. Aerial data, specifically, often shares a number of characteristics that enable similar selection of image augmentation to be performed. Aerial data is any data captured from above looking down. Commonly, this includes drone and satellite imagery.
The below recommendations may not be applicable in every situation but are likely relevant in many aerial computer vision problems like when using drones for computer vision or satellite images for computer vision.
Brightness
Brightness image augmentation involves randomly increasing or decreasing an image – like applying a uniform +/- percentage change to all pixels in an image. Brightness as an image augmentation step for aerial data is often reasonable as aerial data may be captured in varying degrees of sunlight.
Flip
Flip image augmentation means mirroring an image about its y or x axis. (It is distinct from rotation.) For aerial data, objects below can often be mirrored about their y- or x-axis, making it a good choice.
90-Degree Rotations
Rotate image augmentation means moving an image about its center some number of degrees. A 90-degree rotation is a special type of rotation that only applies rotation in 90-degree multiples, thereby keeping bounding box size otherwise identical. (Non-90 degree image rotation augmentation can alter bounding box size.)
Rotating an image clockwise or counter-clockwise by 90-degrees is often a good choice for aerial datasets as the objects below could be viewed from any orientation above – there is not a fixed perspective from which they may be viewed.
Crop
Crop image augmentation involves randomly creating a subset of the overall source image and then resizing that subset to be the same size as other images in the dataset. In effect, random crop almost "zooms in" on random portions of the image. (We've also written more about how and why to use random crop image augmentation.)
For aerial data in particular, this can be massively useful as objects below will vary in size and, potentially, closeness to the camera perspective.
Color Changes (Hue)
Hue image augmentation randomly alters the colors of an image, thereby making a model more resilient to specific colors of objects. For aerial data, this can be useful for seasonal changes that alter the colors of objects.
The above list are fairly 'safe' bets to select for image augmentation for any given aerial problem. Depending on the problem you are tackling, it may also be relevant to include blur, noise, mosaic, or other image augmentations. As always, happy building!