Model Training

What is Zero-Shot Classification?

Learn what zero-shot classification is, what zero-shot classification is used for, and how to use zero-shot classification to solve computer vision problems.

What is an Image Embedding?

Learn what image embeddings are and explore four use cases for embeddings: classifying images and video, clustering images, and image search.

DINO-GPT4-V: Use GPT-4V in a Two-Stage Detection Model

In this guide, we introduce DINO-GPT4V, a model that uses Grounding DINO to detect general objects and GPT-4V to refine labels.

What is Keypoint Detection?

In this guide, we discuss what keypoint detection is, common architectures used for keypoint detection, and the high-level steps to build a keypoint detection model.

Launch: Roboflow Logistics Pre-trained Object Detection Model

Starting a machine learning model from zero is computationally expensive and time-consuming. Pre-trained models solve this by offering a jump-start: they come with learned features from extensive training on large

Comparing Computer Vision Models On Custom Data

In this guide, show how to compare how two person detection models on Roboflow Universe perform using a benchmark dataset and supervision.

How to Use FastViT

In this guide, we talk about what FastViT is and show how to experiment with FastViT using the autodistill-fastvit wrapper.

Train a Computer Vision Model with AWS Rekognition Custom Labels

In this guide, you will learn how to train a computer vision model using AWS Rekognition Custom Labels.

Label a Computer Vision Dataset with Azure Custom Vision

In this guide, you will learn how to label a dataset in Azure Custom Vision for use in training a computer vision model.

Train a Computer Vision Model with Azure Custom Vision

In this guide, you will learn how to train a computer vision model using Azure Custom Vision.

Train a Computer Vision Model with Vertex AI

In this guide, you will learn how to train a computer vision model with Vertex AI. You will also learn how to deploy an endpoint to test your model.

Scaling up ViT Image Classification on Gaudi2 HPUs

Training large image transformers comes with heavy compute requirements - enter the Gaudi2 from Habana Labs, an Intel company.

How to Use Kaggle for Computer Vision

In this guide, we show how to use Kaggle Notebooks for computer vision tasks.

What is Mask2Former? The Ultimate Guide.

In this guide, we discuss what Mask2Former is, how the model works, and how Mask2Former performs on various computer vision tasks.

Ultimate Guide to Converting Bounding Boxes, Masks and Polygons

In this guide, we show how to convert bounding boxes (xyxy), masks, and polygons.

What is EfficientNet? The Ultimate Guide.

In this guide, we discuss what EfficientNet is, how it works, and how the compound scaling method is used in the model.

How to Train RTMDet on a Custom Dataset

Learn how to train a RTMDet computer vision model on a custom dataset.

Using Stable Diffusion and SAM to Modify Image Contents Zero Shot

Introduction Recent breakthroughs in large language models (LLMs) and foundation computer vision models have unlocked new interfaces and methods for editing images or videos. You may have heard of inpainting,

How to Use Computer Vision for Environment Monitoring

This article was contributed to the Roboflow blog by Abirami Vina. Measuring changes to our environment is an important part of understanding progress made toward a more sustainable world. Historically,

Detect and Segment Oil Spills Using Computer Vision

The article below was contributed by Timothy Malche, an assistant professor in the Department of Computer Applications at Manipal University Jaipur. Introduction An oil spill in the sea is a

Improve Accuracy: Polygon Annotations for Object Detection

In this blog post, we will explore how you can improve your object detection model performance by converting your bounding box annotations to polygon annotations. We will also discuss the

Pose Estimation Algorithms: History and Evolution

This article was contributed to the Roboflow blog by Abirami Vina. What is Pose Estimation? Pose estimation, also called keypoint detection, is a computer vision technique that pinpoints the key

How to Build a Semantic Image Search Engine with Supabase and OpenAI CLIP

Historically, building a robust search engine for images was difficult. One could search by features such as file name and image metadata, and use any context around an image (i.

Detecting Objects with DETIC vs Custom Training

Learn how to evaluate large foundation models and how custom model training can improve performance.

CVPR 2023 Highlights

Three members of the Roboflow team attended CVPR this year. Read our highlights from the conference and what trends we noticed.