Blog

Dataset Management

Latest Posts Case Studies Product Updates Logistics Manufacturing

data annotation guide

22 May 2025 • 11 min read

Data Annotation for High-Performing Computer Vision Models

Learn all about data annotation, from what it is and how it works, to common challenges, best practices, and the tools that can streamline the process.

5 Best Image Annotation Tools in 2025

9 Jan 2025 • 6 min read

5 Best Image Annotation Tools in 2025

Explore the top five image annotation tools you can use to label data for your next computer vision project.

Handling unbalanced classes

3 Jan 2025 • 7 min read

5 Strategies for Handling Unbalanced Classes in Machine Learning

Dealing with unbalance classes is a common challenge that can significantly impact the performance of your models. When one class dominates the dataset algorithms become biased, leading to inaccurate predictions. Suppose you're trying to teach an alien – like one of the crew mates from the wildly popular game

This is a template for making a rock paper scissors game created using artificial intelligence

2 Jan 2025 • 6 min read

Rock, Paper, Scissors with AI: How to Make Multiplayer Games

This is a template for making multiplayer games - such as rock paper scissors- that involve your hands and body using AI or computer vision. You can even submit new games to the repo and I will host them at https://handland.lol Included Multiplayer Games: The repo currently comes

Launch: Find Similar Images to Expand Vision Datasets

16 Dec 2024 • 3 min read

Launch: Find Similar Images to Expand Vision Datasets

Learn how to find similar images to use in your new computer vision datasets.

Count Objects on a Conveyor Belt Using Computer Vision

12 Dec 2024 • 4 min read

Count Objects on a Conveyor Belt Using Computer Vision

In many manufacturing environments, conveyor belts are used for transporting objects, especially small components such as bolts, nuts, or other fasteners through various stages of production. Being able to reliably count these objects in real-time improves inventory management, quality assurance, and overall efficiency. Introduction In this guide, we’ll walk

How to Fine-tune PaliGemma 2

10 Dec 2024 • 13 min read

How to Fine-tune PaliGemma 2

Learn how to fine-tune PaliGemma 2 to extract data from an image in JSON format.

What is Active Learning? The Ultimate Guide.

1 Sep 2024 • 11 min read

What is Active Learning? The Ultimate Guide.

In this guide, we discuss what active learning is, types of active learning, and walk through an example of active learning in practice.

How to Create a Workout Pose Correction Tool

14 Aug 2024 • 6 min read

How to Create a Workout Pose Correction Tool

0:00 /0:21 1× Introduction Computer vision is a useful tool when it comes to understanding and quantifying real-world activity happening in real-time. Tracking human movements with pose estimation is a common way to evaluate athletics or general body movement to help gain insight into proper form and technique.

How to Import Hugging Face Datasets to Roboflow

2 Aug 2024 • 2 min read

How to Import Hugging Face Datasets to Roboflow

Learn how to import a Hugging Face dataset into Roboflow for labeling, training, and deployment.

What is the Open Images Dataset? A Deep Dive.

16 Jul 2024 • 8 min read

What is the Open Images Dataset? A Deep Dive.

The Open Images Dataset was released by Google in 2016, and it is one of the largest and most diverse collections of labeled images. Since then, Google has regularly updated and improved it. The latest version of the dataset, Open Images V7, was introduced in 2022. Globally, researchers and developers

How to Train RT-DETR on a Custom Dataset with Transformers

11 Jul 2024 • 11 min read

How to Train RT-DETR on a Custom Dataset with Transformers

💡Looking for RF-DETR, the state-of-the-art real-time object detection model developed by Roboflow ? Check out the RF-DETR training guide. RF-DETR runs in real time, is the first model to achieve 60+ on COCO, and is state-of-the-art on the RF100-VL benchmark. RT-DETR, short for "Real-Time DEtection TRansformer", is a computer

databricks

2 Apr 2024 • 4 min read

Import Images from Databricks to Roboflow

Upload image data from Databricks SQL warehouse into Roboflow for training custom computer vision models.

How to Use YOLO-World With Active Learning to Train a Custom Model

29 Feb 2024 • 5 min read

How to Use YOLO-World With Active Learning to Train a Custom Model

In this guide, we demonstrate an approach where we can start using the benefits of YOLO-World now, while simultaneously collecting data to train a faster custom model later.

gaudi

20 Feb 2024 • 8 min read

Build Enterprise Datasets with CLIP for Multimodal Model Training Using Intel Gaudi2 HPUs

In this guide, learn how to use CLIP on Intel Gaudi2 HPUs to deduplicate datasets before training large multimodal vision models.

How to Use Multiple Models to Label Datasets with Autodistill

16 Feb 2024 • 5 min read

How to Use Multiple Models to Label Datasets with Autodistill

In this guide, we cover the benefits of and how to combine multiple models in order to automatically label a dataset of images.

Label Verification AI

6 Feb 2024 • 6 min read

Label Verification AI: How to Verify Label Placement on Packages

Learn how to build a system to verify label placement on packages using computer vision.

platform

26 Jan 2024 • 6 min read

How to Analyze a Folder of Videos from Google Cloud Platform

In this guide, we walk through how to analyze videos stored in Google Cloud Storage with computer vision models.

How to Use the Segment Anything Model (SAM)

22 Jan 2024 • 6 min read

How to Use the Segment Anything Model (SAM)

Segment Anything (SAM) is a computer vision model developed by Meta AI. In this guide, you will learn how to use SAM on your own data.

Launch: Model Prompting for Automated Labeling with Autodistill

19 Jan 2024 • 5 min read

Launch: Model Prompting for Automated Labeling with Autodistill

In this guide, learn how to use the Roboflow automated image labeling feature to label images in your computer vision datasets.

aws

12 Jan 2024 • 6 min read

How to Analyze a Folder of Videos from AWS S3

In this guide, learn how to analyze a folder of images with machine learning models using data stored in an AWS S3 bucket.

label

11 Jan 2024 • 3 min read

How to Label Outdoor Surveillance Data for Computer Vision Models

In this guide, learn how to effectively label outdoor surveillance data for use in training computer vision models.

How to Label Floor Plan Data for Computer Vision Models

11 Jan 2024 • 3 min read

How to Label Floor Plan Data for Computer Vision Models

In this guide, learn how to effectively label floor plan data for use in training computer vision models.

sports data annotation

11 Jan 2024 • 4 min read

Sports Data Annotation: How to Label Sports Data for Computer Vision Models

In this guide, we discuss tips on how to effectively label sports data for use in training computer vision models.

Stay Connected

Get the Latest in Computer Vision First