Launch: RF-DETR Keypoint in Roboflow

Published Jun 22, 2026 • 4 min read

RF-DETR Keypoint is a real-time, end-to-end pose model that beats YOLO26-pose on both accuracy and speed, learns calibrated per-keypoint uncertainty from your data, and ships under Apache 2.0. You can label, train, and deploy it in Roboflow today.

Today we are launching RF-DETR Keypoint support in Roboflow. RF-DETR Keypoint is a real-time, end-to-end keypoint detection model that extends the RF-DETR family from boxes and masks. You can use RF-DETR Keypoints across the Roboflow platform: label your skeletons in Annotate, train on your own data with Train, and deploy with Inference and Workflows.

RF-DETR Keypoint is releasing as a preview, the same way our instance segmentation model did, because we want real-world usage to shape the final family of checkpoints. You can start building with it now.

For the architecture, the self-calibrating loss, and the full benchmark methodology, read the RF-DETR Keypoint technical deep dive.

0:00

/0:09

Source

Label, Train, and Deploy in Roboflow

RF-DETR Keypoint runs through the full Roboflow workflow:

Label your skeletons. Define the keypoints and the skeleton for your object in Annotate, whether that is a 17-point person or 20 points on a basketball court. There is no requirement to match COCO.

Train on your data. Fine-tuning RF-DETR Keypoint works the same way as it does for detection and segmentation.

Deploy where you run. Serve the model with Inference on the cloud or the edge, and chain it with logic and other models in Workflows to turn poses into applications: rep counting, ergonomic checks, sports analytics, robot guidance, gauge reading.

What is RF-DETR Keypoint?

RF-DETR Keypoint builds a keypoint head directly into the detection transformer. For every object it detects, it predicts a structured set of keypoints in a single forward pass: no NMS, no heatmaps, no post-hoc grouping of points into instances. Detection and pose are learned jointly, so the keypoints even feed back to make the detections better.

It is not limited to people. The COCO checkpoint covers the 17-keypoint human skeleton because that is the benchmark everyone can check, but the architecture supports arbitrary skeletons: any number of keypoints, on any object class, with multiple keypointed classes in one model. License-plate corners, surgical instrument tips, robot-arm joints, and gauge needles are the point.

How RF-DETR Keypoint Is Different

Three things set RF-DETR Keypoint apart from other pose models.

It beats the newest YOLO pose models on accuracy and speed. On COCO Keypoints, measured end-to-end on an NVIDIA T4 with TensorRT FP16, the preview checkpoint at 576x576 reaches 71.8 AP at 9.8 ms, ahead of YOLO26x-pose (the largest model in the newest YOLO pose family) at 71.0 AP and 10.6 ms. Scaled up to 888x888 it reaches 74.2 AP, within 0.6 AP of the state-of-the-art academic model GroupPose Swin-L while running roughly 13 times faster.

It learns its own keypoint uncertainty, and hands it to you. Most pose models depend on per-keypoint tolerance constants that were hand-measured on COCO and quietly fall back to a uniform guess the moment your skeleton is not COCO's.

RF-DETR Keypoint instead predicts a full distribution for each keypoint and calibrates that spread from your data, so the loss that produces the benchmark numbers is the same loss you fine-tune with. Each keypoint comes back with a confidence ellipse and a usable 2D covariance, a drop-in observation model for a tracker, a Kalman filter, or a skeleton fit, plus separate signals for whether a keypoint is findable at all versus merely occluded.

One checkpoint runs at every speed. Like the rest of the RF-DETR family, it is trained with weight-sharing neural architecture search, so a single set of weights runs across resolutions from about 4.5 ms to 26 ms with no retraining. Train once, get a family of options.

RF-DETR is Apache 2.0: Ship It Anywhere

RF-DETR Keypoint Preview is released under the Apache 2.0 license, code and weights, free for commercial use, with no copyleft obligations and deployable inside closed-source products.

The YOLO family are AGPL-3.0, which in practice means open-sourcing the application you build around the model or buying an enterprise license, even for many internal commercial uses. If licensing has been the thing standing between your team and shipping a pose model, it is not anymore.

Conclusion

RF-DETR Keypoint is launching as a preview. The architecture and training recipe are performing well in our evaluations, but real-world usage surfaces things benchmarks do not, and that feedback will shape what comes next: checkpoints across the full accuracy-latency curve, NAS-enabled training on the platform, and purpose-built models for popular keypoint datasets.

Start with RF-DETR Keypoint or read the technical deep dive.

Cite this Post

Use the following entry to cite this post in your research:

Contributing Writer. (Jun 22, 2026). Launch: RF-DETR Keypoint in Roboflow. Roboflow Blog: https://blog.roboflow.com/launch-rf-detr-keypoint-in-roboflow/

Written by

Contributing Writer

View more posts

Launch: RF-DETR Keypoint in Roboflow

Label, Train, and Deploy in Roboflow

What is RF-DETR Keypoint?

How RF-DETR Keypoint Is Different

RF-DETR is Apache 2.0: Ship It Anywhere

Conclusion

Cite this Post

Written by

Topics

More About Product Updates

Turn Predictions Into Better Models and Insights with Vision Events

Roboflow and Standard Bots Partner to Bring Custom Visual Intelligence to Every Robot

Real-Time Keypoint Detection with RF-DETR

Launch: Claude Fable 5 available in Roboflow

Launch: Use YOLO26 Semantic Segmentation with Roboflow

Train a Computer Vision Model with Roboflow Train