13 Dec 2023 • 6 min read How to Use Grounded EdgeSAM Learn how to use Grounded EdgeSAM to auto-label data for use in training an image segmentation model.
7 Dec 2023 • 11 min read Google's Gemini Multimodal Model: What We Know In this guide, we are going to discuss what Gemini is, for whom it is available, and what Gemini can do (according to the information available from Google). We will also look ahead to potential applications for Gemini in computer vision tasks.
6 Dec 2023 • 7 min read How to Detect Objects with YOLOv8 Learn how to detect objects with YOLOv8 using pre-trained and custom-trained object detection models.
1 Dec 2023 • 5 min read How to Moderate Video Content Learn how to use the Roboflow Video Inference API to moderate video content.
1 Dec 2023 • 5 min read How to Deploy Computer Vision Models Offline In this guide, we walk through how to deploy computer vision models (i.e. YOLOv8) offline using Roboflow Inference.
1 Dec 2023 • 4 min read How to Blur People in Images and Videos with an API In this guide, we show how to use the Roboflow Video Inference API and supervision to blur people in images and videos.
1 Dec 2023 • 7 min read Automatically Label Product SKUs with Autodistill In this guide, we show how to automatically label product SKUs (with a manual review stage) using Autodistill.
28 Nov 2023 • 5 min read How to Load Image Embeddings into Pinecone In this guide, learn how to calculate CLIP embeddings with Roboflow Inference and save the results in a Pinecone vector database.
27 Nov 2023 • 5 min read How to Load CLIP Image Embeddings into LanceDB Learn how to calculate CLIP embeddings using Roboflow Inference and save them into LanceDB.
23 Nov 2023 • 7 min read GPT-4 Vision Alternatives Explore alternatives to GPT-4 Vision with Large Multimodal Models such as Qwen-VL and CogVLM, and fine-tuned detection models.
22 Nov 2023 • 4 min read How to Search Video Frames with Roboflow Build a search engine that lets you find frames in a video with text queries using Roboflow Inference.
21 Nov 2023 • 7 min read Launch: Roboflow Video Inference API In this post, we introduce the Roboflow Video Inference API, a hosted solution for running fine-tuned and foundation models on videos.
16 Nov 2023 • 5 min read What is Object Recognition? In this guide, we discuss what object recognition is, how it works, and how to start using object recognition to solve problems.
16 Nov 2023 • 5 min read What is Retrieval Augmented Generation? Learn what Retrieval Augmented Generation (RAG) is, how it works, and how RAG can be used in computer vision applications.
16 Nov 2023 • 5 min read What is Zero-Shot Classification? Learn what zero-shot classification is, what zero-shot classification is used for, and how to use zero-shot classification to solve computer vision problems.
16 Nov 2023 • 6 min read What is an Image Embedding? Learn what image embeddings are and explore four use cases for embeddings: classifying images and video, clustering images, and image search.
16 Nov 2023 • 4 min read What is Zero-Shot Object Detection? Learn what zero-shot object detection is, applications for zero-shot object detection, and how to get started with Grounding DINO, a zero-shot model.
15 Nov 2023 • 4 min read How to Use Roboflow with GPT-4 Vision Explore ways you can use Roboflow with GPT-4 Vision to solve computer vision problems.
7 Nov 2023 • 4 min read Distilling GPT-4 for Classification with an API In this guide, learn how to distill GPT-4V to train an image classification model.
7 Nov 2023 • 4 min read DINO-GPT4-V: Use GPT-4V in a Two-Stage Detection Model In this guide, we introduce DINO-GPT4V, a model that uses Grounding DINO to detect general objects and GPT-4V to refine labels.
7 Nov 2023 • 5 min read How CLIP and GPT-4V Compare for Classification In this post, we analyze how CLIP and GPT-4V compare for classification.
7 Nov 2023 • 5 min read Experiments with GPT-4V for Object Detection See our experiments that explore GPT-4V's object detection capabilities.
1 Nov 2023 • 8 min read How to Detect Text in Images with OCR This guide shows how to use the Roboflow OCR API as part of a two-stage detection system that identifies regions of interest and reads text in them.
25 Oct 2023 • 5 min read Launch: Advanced Dataset Search Filters, Operators, and Logic Learn how to use the new advanced dataset search filters, operators, and logic available in the Roboflow dataset management tool.