Roboflow Blog

NeurIPS 2023 Papers Highlights

Introduction NeurIPS 2023, the conference and workshop on Neural Information Processing Systems, took place December 10th through 16th. The conference showcased the latest in machine learning and artificial intelligence. This

How to Deploy CogVLM on AWS

Guide on deploying a CogVLM Inference Server with 4-bit quantization on Amazon Web Services, covering setup of EC2 instances, configuring hardware and software requirements, and starting the inference server with Docker.

CogVLM Use Cases in Industry

Learn how you can use CogVLM, a multimodal language model with vision capabilities, for industrial use cases.

Detecting Complex and Amorphous Features of Marine Sponges

In this post, Andy Portalatin describes work done for his master's thesis toward detecting complex and amorphous features of marine sponges.

How to Detect Brand Logos in Videos

Learn how to detect brand logos in Roboflow with a custom-trained object detection model that you can run on videos.

Using Computer Vision to Understand Food and Cuisines

Learn how to use computer vision to identify the cuisine associated with different foods.

How to Deploy CogVLM

In this guide, learn how to deploy the CogVLM multimodal model on your own infrastructure with Roboflow Inference.

First Impressions with Google’s Gemini

In this guide, we evaluate Google's Gemini LMM against several computer vision tasks, from OCR to VQA to zero-shot object detection.

How to Use Grounded EdgeSAM

Learn how to use Grounded EdgeSAM to auto-label data for use in training an image segmentation model.

Monitoring Plant Growth using Computer Vision

In this blog post we will show how computer vision can be used to monitor plant growth. We will focus on the height measurement technique.

What is Few-Shot Learning?

In this blog post, we discuss what few-shot learning is, architectural approaches for implementing few-shot learning, and specific implementations of few-shot learning techniques.

Launch: Advanced Class Management with Roboflow

Learn how to use Roboflow's advanced ontology management features to manage classes in your project.

Google's Gemini Multimodal Model: What We Know

In this guide, we are going to discuss what Gemini is, for whom it is available, and what Gemini can do (according to the information available from Google). We will also look ahead to potential applications for Gemini in computer vision tasks.

Comparing Custom Models to Google Cloud Vision API

In this guide, we go over how to evaluate object detection models on Roboflow Universe versus Google Cloud Vision.

How to Detect Objects with YOLOv8

Learn how to detect objects with YOLOv8 using pre-trained and custom-trained object detection models.

Use Cases for Computer Vision in Analytics

Learn how to use computer vision in your data analytics pipelines.

How to Moderate Video Content

Learn how to use the Roboflow Video Inference API to moderate video content.

How to Deploy Computer Vision Models Offline

In this guide, we walk through how to deploy computer vision models (i.e. YOLOv8) offline using Roboflow Inference.

How to Blur People in Images and Videos with an API

In this guide, we show how to use the Roboflow Video Inference API and supervision to blur people in images and videos.

Automatically Label Product SKUs with Autodistill

In this guide, we show how to automatically label product SKUs (with a manual review stage) using Autodistill.

Multimodal Maestro: Advanced LMM Prompting

Learn how to expand the range of LMMs' capabilities using Multimodal Maestro

Manufacturing to Computer Vision: Three Applications From Field Experience

In this article, we explore three applications of computer vision in the manufacturing industry, written by an expert with field experience.

How to Load Image Embeddings into Pinecone

In this guide, learn how to calculate CLIP embeddings with Roboflow Inference and save the results in a Pinecone vector database.

Launch: Synthetic Image Generation with DALL-E and GPT-4 Vision

In this guide, learn how to use Roboflow to generate synthetic data with DALL-E and GPT-4 Vision for use in training vision models.

Roboflow Video Inference with Custom Annotators

Performing real-time video inference is crucial for many applications like autonomous vehicles, security systems, logistics, and more. However, setting up a robust video inference pipeline can be time consuming. You