Blog

Computer Vision

Latest Posts Case Studies Product Updates Logistics Manufacturing

OpenAI o3 and o4-mini: Multimodal and Vision Analysis

17 Apr 2025 • 6 min read

OpenAI o3 and o4-mini: Multimodal and Vision Analysis

Read our analysis of how OpenAI's O3 and O4-Mini models perform on a range of vision tasks.

OpenAI GPT-4.1: Multimodal and Vision Analysis

15 Apr 2025 • 5 min read

OpenAI GPT-4.1: Multimodal and Vision Analysis

Read our analysis of OpenAI's GPT-4.1 model on multimodal tasks like VQA, object detection, and more.

How to Monitor Red Zones with Computer Vision

10 Apr 2025 • 6 min read

How to Monitor Red Zones with Computer Vision

Learn how to monitor red zones for safety compliance using computer vision.

Computer Vision Augmentations: An Introduction

4 Apr 2025 • 11 min read

Computer Vision Augmentations: An Introduction

Learn about the most common augmentations used in computer vision and when each may be useful.

Understanding AUC and ROC

1 Apr 2025 • 13 min read

What Is AUC-ROC?

Understand what AUC ROC is and learn how to use it for computer vision models.

document processing AI

28 Mar 2025 • 8 min read

What Is Document Processing AI? The Ultimate Guide

Understand what is Document AI and learn how to use Roboflow Workflows to perform Document AI related tasks.

Launch: Train and Deploy RF-DETR Models with Roboflow

28 Mar 2025 • 7 min read

Launch: Train and Deploy RF-DETR Models with Roboflow

Learn how to train RF-DETR models with Roboflow Train and deploy RF-DETR models with Roboflow Workflows and Inference.

How to Set Up a Basler on a Mac

28 Mar 2025 • 5 min read

How to Set Up a Basler on a Mac

Learn how to set up a Basler Camera for use with a Mac, ideal for testing.

How to Train RF-DETR on a Custom Dataset

20 Mar 2025 • 7 min read

How to Train RF-DETR on a Custom Dataset

Learn how to train an RF-DETR model on a custom dataset.

RF-DETR: A SOTA Real-Time Object Detection Model

20 Mar 2025 • 7 min read

RF-DETR: A SOTA Real-Time Object Detection Model

Today we are releasing RF-DETR, a state-of-the-art real-time object detection model. Learn more about how RF-DETR works and how to use the model.

under 30 mins to learn AI

14 Mar 2025 • 4 min read

How I Taught My Dad Computer Vision with Roboflow in Under 30 Minutes!

Discover how I helped my dad build a custom computer vision model in under 30 minutes using Roboflow’s no-code AI tools.

How Deep Learning Solves Machine Vision’s Biggest Frustrations

14 Mar 2025 • 8 min read

How Deep Learning Solves Machine Vision’s Biggest Frustrations

Learn how deep learning can be used to solve difficult vision problems at which traditional techniques struggle.

Launch: Fine-Tune and Deploy Qwen2.5-VL Models with Roboflow

13 Mar 2025 • 7 min read

Launch: Fine-Tune and Deploy Qwen2.5-VL Models with Roboflow

Learn how to fine-tune and deploy Qwen2.5-VL models with Roboflow.

Launch: Roboflow Batch Processing

13 Mar 2025 • 7 min read

Launch: Roboflow Batch Processing

Learn how to use Batch Processing to run multi-step vision AI workflows on a folder of images or videos.

How to Scan Pallets using Computer Vision

13 Mar 2025 • 8 min read

How to Scan Pallets using Computer Vision

Learn how to build an automated pallet scanning system with computer vision technology.

Foundational Few-Shot Object Detection Challenge [CVPR 2025]

13 Mar 2025 • 1 min read

Foundational Few-Shot Object Detection Challenge [CVPR 2025]

Roboflow & Carnegie Mellon University are releasing the second iteration of the Foundational Few-Shot Object Detection Challenge at CVPR 2025.

Computer vision applications

11 Mar 2025 • 11 min read

Computer Vision Applications

Computer vision applications are reshaping industries across the globe, enhancing efficiency, safety, and quality in ways that were previously unimaginable.

SmolVLM2: Multimodal and Vision Analysis

11 Mar 2025 • 5 min read

SmolVLM2: Multimodal and Vision Analysis

Read our analysis of how SmolVLM2 performs on a range of multimodal vision tasks.

Moondream 2: Multimodal and Vision Analysis

11 Mar 2025 • 6 min read

Moondream 2: Multimodal and Vision Analysis

Read our analysis of how the multimodal Moondream 2 model performs on a range of vision tasks.

Computer Vision Trends Report 2025

7 Mar 2025 • 4 min read

Computer Vision Trends Report 2025

In our new report "Trends in Vision AI 2025," we delve into the key insights shaping the exciting field of computer vision, and explore how enterprises are successfully deploying vision AI to solve real-world challenges.

Top Multimodal Models: A Complete Guide

7 Mar 2025 • 6 min read

Top Multimodal Models: A Complete Guide

Read our guide to the best multimodal vision models for use in tasks like OCR, object detection, and image classification.

Computer vision platform

4 Mar 2025 • 11 min read

What Is A Computer Vision Platform?

Computer vision platforms play a crucial role in simplifying this process by offering tools and services that streamline the development pipeline.

Multimodal Benchmark Datasets

4 Mar 2025 • 4 min read

Multimodal Benchmark Datasets

When new multimodal models come out, they need to be tested on reliable benchmarks to see how well they perform across different tasks. Today we'll share some of the best multimodal benchmark datasets you can use to evaluate new models.

Cohere Aya Vision: Multimodal and Vision Analysis

4 Mar 2025 • 5 min read

Cohere Aya Vision: Multimodal and Vision Analysis

Learn how Cohere Aya Vision performs on a series of qualitative multimodal task evaluations.

Stay Connected

Get the Latest in Computer Vision First