Introducing Bounding Box Level Augmentations
Published Mar 18, 2020 • 3 min read

Having training data that matches the diversity of your task is paramount to the success of your models.

At Roboflow, we’re committed to providing you with state-of-the-art techniques that can improve your deep learning model’s performance -- without needing to collect any more data or even re-label images. We’re focused on assuring the data and annotations you’ve collected are of the highest quality and in the right format so that you can focus on the important parts of development, like fine tuning your computer vision models or determining how a model should be used for your business context.

Thus, we’re introducing a new augmentation technique unavailable anywhere else: bounding box level augmentation.

What is Bounding Box Level Augmentation

Image augmentation is the act of increasing your dataset size through manipulating existing training data. Fundamentally, it’s creating real-looking fake data. Image augmentation helps a model generalize better to a wide array of contexts. (More about that in our other blog posts.)

We’ve seen users leverage image augmentation to create production-ready models for industrial tasks with just eight (8!) original source images.

Bounding box augmentation takes this idea and goes one step further: what if we altered the content of an image, but only the content within a given bounding box? For example, vary the brightness or darkness of an object relative to its background. Or, perhaps blur an object relative to its background for tasks that are often capturing rapidly moving objects.

Bounding box level augmentation generating new training data by only altering the content of a source image’s bounding boxes. In doing so, developers have greater control over creating training data that is more suitable to their problem’s conditions.

Chess board annotated for object detection.
An original image.
Chess board annotated for object detection with the interiors of the bounding boxes augmented.
Bounding box only augmentation that has randomly horizontally flipped and altered the brightness of objects. Note that each individual bounding box receives its own settings.

The Genesis of Bounding Box Level Augmentation

We can’t take credit for being the first to consider bounding box only augmentation. A 2019 paper from Google researchers introduces the idea of using bounding box only augmentation to create optimal data for their models. In this paper, researchers showed bounding box only modifications generated systemic improvements, especially for models that were small datasets.

Visualization of bounding box augmentations.
Via the researchers' paper.

Using Bounding Box Level Augmentation

We envision many use cases for more specialized augmentation. Consider modifying colors of only objects in a OCR image, introducing blur for only the moving objects like pets or cars, rotating objects like board game pieces, and flipping the orientation of objects to produce a mirrored effect like those present in most camera situations.

Getting started is easy. Simply toggle on any options of interest, and generate a new dataset.

Roboflow Screenshot: selecting options for bounding box augmentations.
Here's the settings used for the chess images above.

Our advanced augmentations (like Bounding box level augmentation) are reserved for Roboflow Pro accounts. Get in touch if you’d like access. Free accounts can access all the standard augmentations though, so give it a try!

Cite this Post

Use the following entry to cite this post in your research:

Joseph Nelson. (Mar 18, 2020). Introducing Bounding Box Level Augmentations. Roboflow Blog: https://blog.roboflow.com/introducing-bounding-box-level-augmentations/

Discuss this Post

If you have any questions about this blog post, start a discussion on the Roboflow Forum.

Stay Connected
Get the Latest in Computer Vision First
Unsubscribe at any time. Review our Privacy Policy.

Written by

Joseph Nelson
Roboflow cofounder and CEO. On a mission to transform every industry by democratizing computer vision. Previously founded and sold a machine learning company.