Education

What is the JAX Deep Learning Framework?

You've probably heard of TensorFlow and PyTorch, and maybe you've even heard of MXNet - but there is a new kid on the block of machine learning frameworks - Google's JAX.

For the People, By the People

Computer vision, on the whole, is an ambitious undertaking. We are developing technology that can see the world as we see it - to recognize simple objects like trees and

How to Train MobileNetV2 On a Custom Dataset

In this post, we will walk through how you can train MobileNetV2 to recognize image classification data for your custom use case.

Building vs. Buying a Computer Vision Platform

“You could do what Roboflow does yourself but…why would you?”-Jack Clark, Co-Founder of Anthropic, former Policy Directory at OpenAI, It’s no secret that building a computer vision

How to Train with Microsoft Azure Custom Vision and Roboflow

Roboflow is a tool for building robust machine learning operations pipelines for computer vision: from collecting and organizing images, annotating, training, deploying, and creating active learning pipelines to rapidly create

Your Comprehensive Guide to the YOLO Family of Models

YOLO (You Only Look Once) is a family of computer vision models that has gained significant fanfare since Joseph Redmon, Santosh Divvala,  Ross Girshick, and Ali Farhadi introduced the novel

What Does "End to End" Really Mean?

Developing, deploying and optimizing computer vision models used to be a cumbersome, painful process. With Roboflow, we sought to democratize this technology, which (first and foremost) meant knocking down the

Prompt Engineering: The Magic Words to using OpenAI's CLIP

Featuring rock, paper, scissors. OpenAI's CLIP model (Contrastive Language-Image Pre-Training) is a powerful zero-shot classifier that leverages knowledge of the English language to classify images without having to be trained

License Plate Detection and OCR using Roboflow Inference API

In this post, we’ll walk you through creating a license plate detection and OCR model using Roboflow that you can programmatically use for your own projects.

Partnering with Luxonis and OpenCV for Seamless Deployment to OpenCV AI Kit

Deploying computer vision models to the edge is critical to unlocking new use cases like in places with limited internet connectivity or where minimal latency is essential. That might be

Image Augmentations for Aerial Datasets

When creating computer vision models, data augmentation can improve model performance with an existing image dataset. Image augmentation increases the size and variability of a dataset, thereby improving model generalizability.

How important is subject similarity for transfer learning?

Using transfer learning to initialize your computer vision model from pre-trained weights rather than starting from scratch (initializing randomly) has been shown to increase performance and decrease training time. It

Our Favorite Computer Vision Courses

A question we frequently receive at Roboflow is, "What is the best class for learning computer vision?" Like most questions, the answer does depend on your background and what you're

How to Use Roboflow with IBM Visual Recognition (IBM Watson vs Roboflow)

IBM recently announced they are shutting down IBM Visual Inspection, their product for creating custom computer vision models for classification and object detection. No new instances can be created and

Zero-Shot Content Moderation with OpenAI's New CLIP Model

When creating a platform on which people can create and share content, there’s often a question of content moderation. Content moderation can mean a whole host of different things,

Andrew Ng: "Deploying to production means you're halfway there."

Andrew Ng, the co-founder of Google Brain and Coursera and former Chief Scientist at Baidu, spoke at this week's Scale Transform conference on the transition from "big data" to "good

What is Embedded Machine Learning?

Machine learning – the software discipline of mapping inputs to outputs without explicitly programmed relationships – requires substantial computational resources. Traditionally, this limits where machine learning models can run to very powerful

How We Built Paint.wtf, an AI Game with 150,000+ Submissions that Judges Your Art

Paint.wtf is an online game that uses AI to score user-submitted digital drawings to zany prompts like, "Draw a giraffe in the arctic" or "Draw a bumblebee loves capitalism.

ELI5 CLIP: A Beginner's Guide to the CLIP Model

You may have heard about OpenAI's CLIP model. If you looked it up, you read that CLIP stands for "Contrastive Language-Image Pre-training." That doesn't immediately make much sense to me,

Roboflow and OpenCV Partner to Advance Computer Vision Capabilities for All Developers

OpenCV has been a key part of advancing computer vision capabilities for developers for over 20 years. The Open Source Computer Vision Library on Github has over 50,000 stars,

How to Use Roboflow and Streamlit to Visualize Object Detection Output

Building an app for blood cell count detection.The app in action.Most technology is designed to make your life, or your work, easier. If your work involves building computer

How to Use Google Earth Engine and Python API to Export Images to Roboflow

This is a guest post written by Ethan Arsht and Raluca Cîrju. Google Earth Engine is a powerful tool for analyzing and acquiring geographic data. Machine learning experts use Google

Liquid Neural Networks in Computer Vision

Excitement is building in the artificial intelligence community around MIT's recent release of liquid neural networks. The breakthroughs that Hasani and team have made are incredible. In this post, we will discuss the new liquid neural networks and what they might mean for the vision field.

How to Train and Deploy a License Plate Detector to the Luxonis OAK

In this post, we will leverage Roboflow and the Luxonis OAK to train and deploy a custom license plate model to your OAK device.

The Ultimate Guide to Object Detection

Object detection is a computer vision technology that localizes and identifies objects in an image. Due to object detection's versatility, object detection has emerged in the last few years as

Computer Vision Use Cases in Healthcare and Medicine

Computer vision technology continues to expand its use cases in healthcare and medicine. In this post, we will touch on some exciting example use cases for vision in healthcare and medicine and provide some resources on getting started applying vision to these problems.

A Primer on Transfer Learning

Teaching Friends to Skateboard 🛹Imagine you have two friends that you're trying to teach how to skateboard. Both have never skateboarded previously. Friend A, call them Anna, has snowboarded in

Using Computer Vision to Boost Cities' Efficiency by Reallocating Police Resources

The below post is a guest post written by data scientist Joseph Rosenblum. He is using computer vision to make cities more efficient and decrease the bias in traffic-related policing.

Football, Kaggle, Roboflow: Using Computer Vision to Tackle Helmet Safety

If you're searching for a dataset to use or are looking to improve your data science modeling skills, Kaggle is a great resource for free data and for competitions. For

5 Strategies for Handling Unbalanced Classes

Suppose you're trying to teach an alien – like one of the crewmates from the wildly popular game Among Us – to tell the difference between a human and a dog. "Purp

Scaled-YOLOv4 is Now the Best Model for Object Detection

(based on Microsoft COCO benchmarks) The object detection space remains white hot with the recent publication of Scaled-YOLOv4, establishing a new state of the art in object detection. Looking to

How to Run Jupyter Notebooks on an Apple M1 Mac

You've probably heard a lot about the MacBook that contains the new Apple M1 chip. Quick summary: It's fast. Like, really fast. You, a data scientist or related tech professional,

Seven Tips for Labeling Images for Computer Vision

Creating a high quality dataset for computer vision is essential to having strong model performance. In addition to collecting images that are as similar to your deployed conditions as possible,

Introduction to Computer Vision

After reading this post, you should have a good understanding of computer vision without a strong technical background and you should know the steps needed to solve a computer vision problem.

What is Active Learning?

Machine learning algorithms are exceptionally data-hungry, requiring thousands – if not millions – of examples to make informed decisions. Providing high quality training data for our algorithms to learn is an expensive

Google Researchers Say Underspecification is Ruining Your Model Performance. Here's Five Ways to Fix That.

We read that Google underspecification paper so you don't have to.

Bringing Street Murals to Life with Computer Vision

In Bedford–Stuyvesant, Brooklyn (BedStuy), Yuri Fukuda regularly walks by a mural that showcases prominent female leaders. Since October 2005, a stunning 3,300 square foot mural, When Women Pursue

YOLOv4 - Ten Tactics to Build a Better Model

The YOLO v4 repository is currently one of the best places to train a custom object detector, and the capabilities of the Darknet repository are vast. In this post, we discuss and implement ten advanced tactics in YOLO v4 so you can build the best object detection model from your custom dataset.

Evaluating Object Detection Models with mAP by Class

When evaluating an object detection model in computer vision, mean average precision is the most commonly cited metric for assessing performance. Remember, mean average precision is a measure of our

How to Use the Detectron2 Model Zoo (for Object Detection)

Detectron2 is Facebook's open source library for implementing state-of-the-art computer vision techniques in PyTorch. Facebook introduced Detectron2 in October 2019 as a complete rewrite of Detectron (which was implemented in

How This Fulbright Scholar is Using Computer Vision to Protect Endangered Species

The below post is by Kasim Rafiq, a conservationist, Fulbright Scholar, and National Geographic Explorer studying at UC Santa Cruz. Kasim holds a PhD in Wildlife Ecology from Liverpool John

How to Create Your Own Train Test Split in Roboflow

In order to ensure our models are generalizing well (rather than memorizing training data), it is best practice to create a train, test split. That is, absent rigor, our models

Luxonis OAK-D - Deploy a Custom Object Detection Model with Depth

We are pretty excited about the Luxonis OpenCV AI Kit (OAK-D) device at Roboflow, and we're not alone. Our excitement has naturally led us to create another tutorial on how to train and deploy a custom object detection model leveraging Roboflow and DepthAI, to the edge, with depth, faster.

Behind the Design of an Augmented Reality Board Game App

During the summer of 2019, I received a Facebook message from Roboflow co-founder Brad Dwyer asking me if I wanted to design a new mobile app he was working on.

How to Save and Load Model Weights in Google Colab

Google Colab is Google's hosted Jupyter Notebook product that provides a free compute environment, including GPU and TPU. Colab comes "batteries included" with many popular Python packages installed, making it

Using Computer Vision to Help Deaf and Hard of Hearing Communities

The below post is a lightly edited guest post by David Lee, a data scientist using computer vision to boost tech accessibility for communities that need it. David has open

Introducing an Improved Shear Augmentation

Today, we introduce a new and improved shear augmentation. We'll walk through some details on the change, as well as some intuition and results backing up our reasoning.

How Tesla Teaches Cars to Stop

Creating successful computer vision models requires handling an ever growing set of edge cases. At the 2020 Conference on Computer Vision and Pattern Recognition (CVPR), Tesla's Senior Director of AI,

Glossary of Common Computer Vision Terms

Computer Vision (and Machine Learning in general) is one of those fields that can seem hard to approach because there are so many industry-specific words (or common words used in novel ways) that it can feel a bit like you're trying to learn a new language when you're trying to get started.

What is a Label Map?

In this post, we will demystify the label map by discussing the role that it plays in the computer vision annotation process. Then we will get hands on with some real life examples using a label map.

Getting Started with LabelMe - Computer Vision Annotation

In this post, we will walk through how to jumpstart your image annotation process using LabelMe, a free, open source labeling tool. Labeling images from the public aerial maritime dataset

What the heck is an annotation group?

If you're wondering this, you're not alone. The annotation group is the category that encompasses all of the classes in your dataset. It answers the question "What kind of things

Elisha Odemakinde Hosts Roboflow for a Fireside Chat

Recently, Roboflow machine learning engineer Jacob Solawetz sat down with Elisha Odemakinde, an ML researcher and Community Manager at Data Science Nigeria, for a Fireside chat. During the conversation, Jacob

The Train, Validation, Test Split and Why You Need It

At Roboflow, we often get asked, what is the train, validation, test split and why do I need it? The motivation is quite simple: you should separate you data into train, validation, and test splits to prevent your model from overfitting and to accurately evaluate your model. T

How to Train a Custom Resnet34 Model for Image Classification in fastai and PyTorch

Can a computer tell the difference between a dandelion and a daisy? In this post we put these philosophical musings aside, and dive into the the code necessary to find

Fast.ai v2 Released - What's New?

Fastai, the popular deep learning framework and MOOC releases fastai v2 with new improvements to the fastai library, a new online machine learning course, and new helper repositories. fastai's layered

Tackling the Small Object Problem in Object Detection

Detecting small objects is one of the most challenging and important problems in computer vision. In this post, we will discuss some of the strategies we have developed at Roboflow

How to Train a Custom TensorFlow Lite Object Detection Model

In this post, we walk through the steps to train and export a custom TensorFlow Lite object detection model with your own object detection dataset to detect your own custom

PP-YOLO Surpasses YOLOv4 - State of the Art Object Detection Techniques

Baidu publishes PP-YOLO and pushes the state of the art in object detection research by building on top of YOLOv3, the PaddlePaddle deep learning framework, and cutting edge computer vision research.

Ontology Management for Computer Vision

As their projects mature and dataset sizes grow, most teams wrestle with their workflow. Slicing and dicing data is more of an art than a science and you will want

How to Train EfficientNet - Custom Image Classification

In this tutorial, we will train state of the art EfficientNet convolutional neural network, to classify images, using a custom dataset and custom classifications. To run this tutorial on your

What are Anchor Boxes in Object Detection?

Object detection models utilize anchor boxes to make bounding box predictions. In this post, we dive into the concept of anchor boxes and why they are so pivotal for modeling

Breaking Down YOLOv4

A thorough explanation of how YOLOv4 worksThe realtime object detection space remains hot and moves ever forward with the publication of YOLO v4. Relative to inference speed, YOLOv4 outperforms other

Getting Started with Data Augmentation in Computer Vision

Data augmentation in computer vision is not new, but recently data augmentation has emerged on the forefront of state of the art modeling. YOLOv4, a new state of the art

What is Mean Average Precision (mAP) in Object Detection?

What is mean average precision? How do we calculate mAP?

Breaking Down the Technology Behind Self-Driving Cars

In May 2016, Joshua Brown died in the Tesla's first autopilot crash. The crash was attributed to the self-driving cars system not recognizing the difference between a truck and the

YOLOv3 Versus EfficientDet for State-of-the-Art Object Detection

YOLOv3 is known to be an incredibly performant, state-of-the-art model architecture: fast, accurate, and reliable. So how does the "new kid on the block," EfficientDet, compare? Without spoilers, we were