16 May 2025 • 20 min read Object Detection Metrics Understand what key object detection metrics are. Learn why they're used and how to use them.
9 May 2025 • 9 min read Using Computer Vision to Develop a Robotic Arm Poker Dealer Learn how the vision system of an acting poker dealer arm was built by the University of Manchester Robotics Society.
19 Feb 2025 • 5 min read How to Train a YOLOv12 Object Detection Model on a Custom Dataset Learn how to train a YOLOv12 object detection model on a custom dataset.
12 Dec 2024 • 5 min read Count Objects on a Conveyor Belt Using Computer Vision SUMMARY This tutorial builds an end-to-end system for counting bolts and nuts on a moving conveyor belt, covering every stage from data collection to local deployment. Video frames are annotated using Roboflow's auto-labeling with Grounding DINO, a custom detection model is trained in the cloud,
10 Dec 2024 • 13 min read How to Fine-tune PaliGemma 2 Learn how to fine-tune PaliGemma 2 to extract data from an image in JSON format.
14 Oct 2024 • 3 min read How to Remove Background of an Image with Computer Vision Learn how to build your own background removal application using computer vision.
8 Oct 2024 • 7 min read How to Train a YOLOv11 Object Detection Model on a Custom Dataset Learn how to train a YOLOv11 object detection model with a custom dataset.
24 Sep 2024 • 3 min read Motion Detection with Computer Vision SUMMARY This tutorial shows how to filter object detection to only moving objects by comparing consecutive video frames with a pixel-differencing function. The approach uses inferenceJS to run a Roboflow Universe model, then applies a bounding box drawn from detected motion to suppress detections of stationary objects. Configurable thresholds
3 Sep 2024 • 4 min read How to Draw a Bounding Box with Python In this post, we discuss how to use the Roboflow supervision library to draw and fill a bounding box in Python.
6 Aug 2024 • 7 min read Ball Tracking in Sports with Computer Vision Ball tracking is crucial for AI systems to analyze sports effectively, but it's challenging due to factors like the ball's small size, high velocity, complex backgrounds, similar-looking objects, and varying lighting. This tutorial will teach you how to overcome these challenges.
30 Jul 2024 • 8 min read What is Segment Anything 2 (SAM 2)? Learn about Meta AI's new Segment Anything 2 model and how you can use it for image and grounded image segmentation.
19 Jul 2024 • 8 min read People Counting Using Computer Vision SUMMARY Computer vision-based people counting tracks entry and exit flows in real time, giving organizations accurate occupancy data that manual headcounts cannot match at scale. This tutorial uses a people detection model from Roboflow to build a working counter that draws bounding boxes, overlays a running tally on each
16 Jul 2024 • 8 min read What is the Open Images Dataset? A Deep Dive. SUMMARY Open Images V7, released by Google in 2022, contains over nine million annotated images spanning nearly 20,000 categories and supports six annotation types: image-level labels, bounding boxes, segmentation masks, visual relationships, localized narratives, and point-level labels. That breadth makes it one of the few public datasets
11 Jul 2024 • 11 min read How to Train RT-DETR on a Custom Dataset with Transformers SUMMARY RT-DETR (Real-Time DEtection TRansformer), developed by Peking University and Baidu, is a transformer-based object detection model that targets competitive speed and accuracy compared to YOLO-family models. This tutorial covers fine-tuning RT-DETR on a custom dataset sourced from Roboflow Universe, using the HuggingFace Transformers
25 Jun 2024 • 12 min read How to Fine-tune Florence-2 for Object Detection Tasks This tutorial will show you how to fine-tune Florence-2 on object detection datasets to improve model performance for your specific use case.
20 Jun 2024 • 5 min read Florence-2: Vision-language Model Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license.
3 May 2024 • 5 min read Realtime Video Stream Analysis with Computer Vision In this guide, we use computer vision to process multiple live video streams to perform analysis and gain insights.
12 Apr 2024 • 5 min read What is Handwriting Recognition? In this guide, we go over an overview of handwriting recognition, including the use cases, challenges, and ways of using of handwriting recognition, as well as a tutorial.
3 Apr 2024 • 10 min read What is OpenPose? A Guide for Beginners. In this guide, we discuss what OpenPose is, what you can do with the model, and how you can use OpenPose.
3 Apr 2024 • 11 min read Counting Rebar with Computer Vision Learn how to build an application that counts rebar using computer vision.
26 Mar 2024 • 5 min read How to Detect Objects with YOLOv5 SUMMARY YOLOv5 is an object detection model with an active deployment base, and Roboflow Inference provides a straightforward path to running it in production. This guide covers uploading a trained YOLOv5 model to Roboflow, then deploying it with the Inference SDK to run predictions on individual images and on live
26 Mar 2024 • 5 min read What is YOLOv3? An Introductory Guide. Learn what YOLOv3 is and the notable architectural eatures of this model.
26 Mar 2024 • 9 min read Build a Gesture-Based Light Controller with Computer Vision Learn how to build a gesture-based light controler using computer vision
21 Mar 2024 • 8 min read What is OpenCV? A Guide for Beginners. Learn what OpenCV is, what you can do with OpenCV, how OpenCV performs on various tasks when run on CPU vs. GPU, and more.