Latest Posts

Google's Gemini Multimodal Model: What We Know

In this guide, we are going to discuss what Gemini is, for whom it is available, and what Gemini can do (according to the information available from Google). We will also look ahead to potential applications for Gemini in computer vision tasks.

Comparing Custom Models to Google Cloud Vision API

In this guide, we go over how to evaluate object detection models on Roboflow Universe versus Google Cloud Vision.

How to Detect Objects with YOLOv8

Learn how to detect objects with YOLOv8 using pre-trained and custom-trained object detection models.

Use Cases for Computer Vision in Analytics

Learn how to use computer vision in your data analytics pipelines.

How to Moderate Video Content

Learn how to use the Roboflow Video Inference API to moderate video content.

How to Deploy Computer Vision Models Offline

In this guide, we walk through how to deploy computer vision models (i.e. YOLOv8) offline using Roboflow Inference.

How to Blur People in Images and Videos with an API

In this guide, we show how to use the Roboflow Video Inference API and supervision to blur people in images and videos.

Automatically Label Product SKUs with Autodistill

In this guide, we show how to automatically label product SKUs (with a manual review stage) using Autodistill.

Multimodal Maestro: Advanced LMM Prompting

Learn how to expand the range of LMMs' capabilities using Multimodal Maestro

Manufacturing to Computer Vision: Three Applications From Field Experience

In this article, we explore three applications of computer vision in the manufacturing industry, written by an expert with field experience.

How to Load Image Embeddings into Pinecone

In this guide, learn how to calculate CLIP embeddings with Roboflow Inference and save the results in a Pinecone vector database.

Launch: Synthetic Image Generation with DALL-E and GPT-4 Vision

In this guide, learn how to use Roboflow to generate synthetic data with DALL-E and GPT-4 Vision for use in training vision models.

Roboflow Video Inference with Custom Annotators

Performing real-time video inference is crucial for many applications like autonomous vehicles, security systems, logistics, and more. However, setting up a robust video inference pipeline can be time consuming. You

How to Load CLIP Image Embeddings into LanceDB

Learn how to calculate CLIP embeddings using Roboflow Inference and save them into LanceDB.

GPT-4 Vision Alternatives

Explore alternatives to GPT-4 Vision with Large Multimodal Models such as Qwen-VL and CogVLM, and fine-tuned detection models.

How to Search Video Frames with Roboflow

Build a search engine that lets you find frames in a video with text queries using Roboflow Inference.

Launch: Roboflow Video Inference API

In this post, we introduce the Roboflow Video Inference API, a hosted solution for running fine-tuned and foundation models on videos.

What is Optical Character Recognition (OCR)?

Learn what Optical Character Recognition is, what problems can be solved with OCR, and explore the approaches used by OCR algorithms to identify characters.

What is Object Recognition?

In this guide, we discuss what object recognition is, how it works, and how to start using object recognition to solve problems.

What is Retrieval Augmented Generation?

Learn what Retrieval Augmented Generation (RAG) is, how it works, and how RAG can be used in computer vision applications.

What is Zero-Shot Classification?

Learn what zero-shot classification is, what zero-shot classification is used for, and how to use zero-shot classification to solve computer vision problems.

What is an Image Embedding?

Learn what image embeddings are and explore four use cases for embeddings: classifying images and video, clustering images, and image search.

What is Zero-Shot Object Detection?

Learn what zero-shot object detection is, applications for zero-shot object detection, and how to get started with Grounding DINO, a zero-shot model.

How to Use Roboflow with GPT-4 Vision

Explore ways you can use Roboflow with GPT-4 Vision to solve computer vision problems.

Distilling GPT-4 for Classification with an API

In this guide, learn how to distill GPT-4V to train an image classification model.