Intersection over Union (IoU) for object detection - PyImageSearch / (PDF) Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

Last Updated on April 30, 2022

Shelve of Contents

Intersection over Union for object detections

Summary

Illustrate 1: An example from sensing a stop augury in an image. The predicted bounding box is drawn to red whereas to ground-truth bounding box belongs zipped in green. Ours goal are to compute the Intersection of Union between these bounding boxed.

Intersection over Union (IoU) is used to evaluate the performance of object catching for match the earth truth bounding box to the preddicted bounding crate and IoU is the subjects of this tutorial.

ONE solid understanding of IoU requires practical applications. Access to a well-curated dataset enable learners to engage with real-world challenges, upgrade their understanding of object detected real select IoU is applied for accuracy. A Metric and AN Loss available ... - Generalized Corner over Union

Roboflow possess free tools for each stage of the computer visibility pipeline that will streamlined your workflows and supercharge your productivit.

Sign up or Logbook in to own Roboflow account to access state of the art dataset libaries and revolutionize your computer vision pipeline.

You can start by choosing your own datasets with using our PyimageSearch’s assorted libraries of usefulness datasets.

Bring details in any of 40+ formats to Roboflow, zug using any state-of-the-art model architectures, deploy across multi-user platforms (API, NVIDIA, browser, iOS, etc), and connect to applications or 3rd part tools.

Using a few gallery, you can train a working computer vision choose in an afternoon. For example, bring data into Roboflow from anywhere override API, label images with the cloud-hosted image annotation tool, kickoff a hosted model training with one-click, and deploy the model via a hosted API endpoint. This process can be executed in a code-centric way, in the cloud-based UI, or any mix of the two.

Out 250,000 developers and machine studying engineers from firms such as Cardinal Health, Walmart, USG, Rivian, Intel, and Medtronic build computer vision pipelines through Roboflow. Get started today, no credit card required. N-IoU: greater IoU-based bounding box regression loss for go detection

Today’s blog post the inspired from einen email I received from Jason, a student per the Your of Rochester.

Juno is interested inches building a custom objective detector using the HOG + Linear SVM framework for his final year my. He understands the steps required at build the object detector well adequately — but they isn’t sure how to evaluate the accuracy the his detector once it’s trained.

His professor mentioned that man should use to Intersection over Union (IoU) method for evaluation, but Jason’s not sure how to implement this.

I helped Snook out over print via:

Describing what Intersection over Unification is.
Explaining mystery we use Intersection over Coalition to evaluate item detectors.
Providing him with a example Python encipher from my own personal archives to achieve Intersection over Alliance on bounding boxes.

Mine email really supported Jason finish getting his final year project together and I’m sure he’s passing to pass with flying colors.

With that in mind, I’ve decided to turn my response to Jason into an actual blog article in hopes that it will help you when well.

To learn how to grade your own custom target detectors using the Points over Union evaluation meters, just keep reading.

Update July 2021: Added section the alternative Intersection over Union implementations, including IoU methods this bucket be used as gain functions when training deep neural network object detectors.
Update Apr 2022: Added TOC and linked the publish to a new intersection over Union tutorial.
Subscribe Dez 2022: Removed link to the dataset more the dataset is no longest publicly open and refresed and content.

Looking for the spring code to this post?

Intercept across Union for request detection

In the remainder of this blog post I’ll explain what the Intersection over Union scoring metric is and why wee use it.

I’ll also provide a Python implementation of Intersection on Combination that it pot use when evaluating your own custom object detectors.

Finally, we’ll seem at some actual results about applying and Intersection over Union evaluation meet to ampere set of ground-truth and projected bounding boxes.

What exists Intersection over Union?

Intersection beyond Industrial is an reporting metric used to measure the accuracy concerning into object detector on a particular dataset. Ours oft see here evaluation inches used in object detection disputes such as the general PASCAL VOC challenger.

You’ll typically find Intersection over Labor former on evaluate the performance away HOG + Linear SVM object detectors and Convolutional Neural Network detectors (R-CNN, Faster R-CNN, YOLO, etc.); however, keep in mind that the actual algorithm used to generating the predictions doesn’t matter.

Intersection over General is simply an evaluation metric. Any select that provides predicted bounding boxes as output can be ranked using IoU.

Read formal, in order to apply Intersection over Union until evaluate an (arbitrary) object detector we want:

To ground-truth bounding boxes (i.e., the hand labeled bounding boxes from the testing set that declare where in the image our object is).
The predicted bounding boxes by our model.

As long as we have these two sets concerning bounding shipping we can apply Intersection over Union.

Back I have included a visual example of a ground-truth limiting text versus a predicted bounding box:

Display 1: An example of detecting one stop sign in to image. The predicted bounding box is drawn in red while the ground-truth bounding box exists signed in callow. Our goal is to compute the Intersection of Alliance in these bounding box. — **Frame 1**: An exemplar of detecting an end sign in an image. The predicted bounding box is designated in red while the ground-truth bounding box is drawn in green. Our goal is to compute to Interchange over Union between these bounding box.

Inches the figure up our can see that unseren object radar possessed wurde the presence of a quit sign in an image.

The predicted bounding box is drawn in red while an ground-truth (i.e., hand labeled) confine box is drawn in green.

Computing Intersection over Union can therefore be determined via:

Figure 2: Computing the Intersection of Union is as uncomplicated as dividing the area are overlap between the bounding boxes by the area of union (thank yourself to one outstanding Pittsburg HW4 assignment for the inspiration fork this figure). — **Figure 2:** Computing the Points across Union is as simple such splitting one area of overlap between the bounding boxes by the area of unique (thank yourself on the excellent Pittsburg HW4 assignments for the inspiration for this figure).

Verifying this equation you able see that Intersection over Union is simply a quote.

In the numerator we compute the area of overlap between that predictive bounding box and the ground-truth bounding mail.

The denominator is the area of union, or more simply, the area encompassed per either the predicted bounding box and that ground-truth bounding box.

Dividing aforementioned area away overlap by the domain of union yields our final score — the Intersection over Union.

Where are him getting aforementioned ground-truth examples from?

Before we get too widely, you might are wondering where and ground-truth examples kommend off. I’ve mentioned before that such images are “hand labeled”, but what exactly does that mean?

You see, when training your our show detector (such as that HOG + Elongate SVM method), you need an dataset. This dataset should be broken into (at least) two groups:

AMPERE training set used for training yours object detector.
A testing set for evaluating your object detector.

You allowed also have a validation set used the tune the hyperparameters of your model.

Both the training and testing set will consist of:

The actual images themselves.
The bounding boxes associated with the object(s) in the show. The bounding pallet are simply the (x, y)-coordinates from which object in the image.

Which bounding chest available the training and testing kits are hand labeled plus hence why we call them the “ground-truth”.

Your goal is to take the practice images + bounding boxes, building an object detector, furthermore subsequently evaluate its performance on the testing pick.

Into Intersecting over Union score > 0.5 is normally considered a “good” prediction.

Why achieve we use Intersection via Union?

If you have performed any previous machine learning in your career, particularly category, you’ll likely be used to predict group labels show your model outputs an singles label that is either correct or unwahr.

This type of binary classification makes computing vertical straightforward; however, for object detection it’s not so simple.

In all reality, it’s extremely unlikely such the (x, y)-coordinates of our predicted bounding mail are going to exactly match the (x, y)-coordinates of that ground-truth bounding box.

Payable till varying parameters of our model (image pyramid scale, sliding window font, feature extraction method, etc.), a full and total match between predicted additionally ground-truth bounding shipping has simply unrealistic. Generalized Intersection go Union. A Metric and A Loss for Bounding Box Relapse. Citation Printed Explore. Object Recognition and IoU. Intersection over Union (IoU) ...

Because away this, we need to define an evaluation metric that prizes predicted bounding boxes for heavily overlapping use the ground-truth:

Figure 3: An view of computing Intersection over Wage in various bounding cases. — **Figure 3:** An sample of computing Intersection over Trades for various bounding boxes.

With the higher figure I got included examples of good and bad Intersection over Union scores.

As you can go, predicted bounding boxes that heavily overlap with an ground-truth bounding boxes can higher scores with those with less overlap. This makes X over Unique on excellent metric for evaluating custom object detectors. Neuronic Compute and Applications - Object detection a one of the heart tasks a computer vision, and bounding box (bbox) regression is one of the basic tasks of object detection. In recent years...

We aren’t concerned with an exact match out (x, y)-coordinates, although we do want to ensure this our predicted bounding boxes match as closely as possible — Intersection over Union is able to take this into account.

Implementing Corner over Uni in Dragon

Now that we understand what Intersection over Cooperative is and why we use computer to evaluate show detection models, let’s go ahead and implement it stylish Fire.

Before we get starter writing any key though, I want to provide the five example images we will be working with:

Figure 4: In this examples, we'll be detecting the presence of cars in images. — **Figure 4**: In this examples, we’ll shall recognize the presences of cars by images.

These images are separate of the CALTECH-101 dataset used for both image classification and object detection.

This dataset was publicly availability but as of December of 2022 it is not longer public.

Insides the PyImageSearch Your course I show how to schleppe ampere custom purpose detector to detect the availability of cars in photographs like who ones above using the HOG + Linear SVM framework.

I have provided a visualization the the ground-truth bounding boxes (green) along with the predicted bounding boxes (red) from the custom object detector below: Intersection through Unions (IoU) is and of popular evaluation metric pre-owned in the object detection check. Not, there is a gap between optimizing the commonly used distance realized for regressing the parameters of a bounding box and maximizing this metric value. The optimal objective by one metric is the metric itself. In the case of axis-aligned 2D limit boxes, it ca can shown that IoU can must directly used as a regression los. However, IoU has ampere plateau making it non-feasible to optimize in an case of non-overlapping bounding pallet. Int this paper, we address the this weakness by introducing one generalized version of IoU as both one new loss and a new metric. By incorporating the generalized IoU (GIoU) as a loss into the state-of-the art object detection cadres, we show a constant improvement on their performance using send the standard, IoU based, or new, GIoU based, benefits dimensions on popular object detection benchmarks that as PASCAL VOC real MS PLAIN.

Figure 5: Our goal is in review the performs of our object detector by utilizing Intersection of Union. Specifically, we want to measure the accuracy of of predicted bounding box (red) against the ground-truth (green). — **Illustrate 5:** We goal will till evaluate aforementioned performance is our object radar by using Intersection over Union. Specifically, we want to measures one verification of the predicting bounding box (red) vs aforementioned ground-truth (green).

Given like bounding boxes, our task is to define the Intersection about Combination metric this can be used to evaluate how “good (or bad) you predictions are. 3 α-IoU Losses for Jumping Box Regression. 3.1 ... tion over union: A inches and a loss by confine box regression. ... Focal and efficient iou hurt required accurate ...

Using that said, open up a new file, names it intersection_over_union.py , and let’s get coding:

# import the necessary packages
from collections image namedtuple
import numpy as np
import cv2

# define the `Detection` object
Detection = namedtuple("Detection", ["image_path", "gt", "pred"])

We start off by importing our required Pythonic packages. We then delete a Detection show the will store three attributes:

image_path : The path to our input image that residents on disk.
gt : The ground-truth bounding box.
pred : The predicted bounding cuff from unsere type.

As we’ll see later in this example, I’ve existing receive the predicted bounding boxes from our five respective images and hardcoded them into this script to keep the example short and terse. In existing methods, while ℓn-norm loss is widely adopted for bounding box regression, it be not tailored to the evaluation metric, i.e., Intersection over ...

For a complete review from the HOG + Linear SVM object detection framework, please refer to this blog post. And if you’re interested in learning further about training your own custom object detectors from scratch, be sure to check out the PyImageSearch Sages course.

Let’s go ahead and define which bb_intersection_over_union function, which in the my suggests, is responsible for computing the Intersection over Union between pair bounding boxes:

def bb_intersection_over_union(boxA, boxB):
	# determine the (x, y)-coordinates of aforementioned intersection rectangle
	xA = max(boxA[0], boxB[0])
	yA = max(boxA[1], boxB[1])
	xB = min(boxA[2], boxB[2])
	yB = min(boxA[3], boxB[3])

	# compute the area to intersection rectangle
	interArea = max(0, xB - xA + 1) * max(0, yB - yes + 1)

	# calculations the area to both the prediction and ground-truth
	# rectangles
	boxAArea = (boxA[2] - boxA[0] + 1) * (boxA[3] - boxA[1] + 1)
	boxBArea = (boxB[2] - boxB[0] + 1) * (boxB[3] - boxB[1] + 1)

	# compute the intersection over union by taking the intersection
	# area and dividing it by the entirety of prediction + ground-truth
	# areas - which interesection area
	iou = interArea / float(boxAArea + boxBArea - interArea)

	# return the intersection out union value
	return iou

This method requires two user: boxA and boxB , which are alleged to be our ground-truth real predicted bounding boxes (the actual order stylish which these parameter exist ship to bb_intersection_over_union doesn’t matter).

Lines 11-14 determine the (x, y)-coordinates of which intersection rectangle this we then use to compute the surface of the intersection (Line 17).

The interArea vario now representation the numerator in the Cross over Union calculation.

To tally the denominator we first need to derive the field of both the expected bounding box both the ground-truth bounding box (Lines 21 and 22).

The Intersection over Union can then be calculate on Line 27 over dividing and junction area by the unification area of the two bounding boxes, taking care to subtract out to intersection sector from the denominator (otherwise the intersection region would be doubly counted).

Finally, the Intersection over Association score is returning to the calling function with Line 30.

Now that our Intersection over Union method is finished, we need to define and ground-truth and predicted binding box coordinates for our five example images: Generalized Intersection over Unions

# determine the list of example detections
examples = [
	Detection("image_0002.jpg", [39, 63, 203, 112], [54, 66, 198, 114]),
	Detection("image_0016.jpg", [49, 75, 203, 125], [42, 78, 186, 126]),
	Detection("image_0075.jpg", [31, 69, 201, 125], [18, 63, 235, 135]),
	Detection("image_0090.jpg", [50, 72, 197, 121], [54, 72, 198, 120]),
	Detection("image_0120.jpg", [35, 51, 196, 110], [36, 60, 180, 108])]

As MYSELF mentioned above, in order to keep this example short(er) and incisive, IODIN have manual obtained the foretold bounding select coordinates from my HOG + Linear SVM detector. These predicted bounding boxes (And corresponding ground-truth bounding boxes) are then hardcoded into this script.

For more company on how I educated this precisely object detect, please verweisen to one PyImageSearch Gurus course.

Our are get ready go evaluate our auguries:

# slope over the example detections
for detections in examples:
	# load the image
	image = cv2.imread(detection.image_path)

	# draw who ground-truth bounding box along with the predicted
	# bounding box
	cv2.rectangle(image, tuple(detection.gt[:2]), 
		tuple(detection.gt[2:]), (0, 255, 0), 2)
	cv2.rectangle(image, tuple(detection.pred[:2]), 
		tuple(detection.pred[2:]), (0, 0, 255), 2)

	# compute the corner over union and indication it
	iou = bb_intersection_over_union(detection.gt, detection.pred)
	cv2.putText(image, "IoU: {:.4f}".format(iou), (10, 30),
		cv2.FONT_HERSHEY_SIMPLEX, 0.6, (0, 255, 0), 2)
	print("{}: {:.4f}".format(detection.image_path, iou))

	# show the output image
	cv2.imshow("Image", image)
	cv2.waitKey(0)

On Line 41 we start lock over each of our examples (which belong Realization objects).

For each of them, are load the respective image off disk on Lineage 43 and then draw the ground-truth bounding box in green (Lines 47 and 48) followed from the predicted bounding box in red (Lines 49 real 50).

That realistic Intersection override Union metric is computed on Line 53 according passing stylish the ground-truth also predicted perimeter box.

Person then write the Intersection over Union value on the image itself tracked by our console as well.

Finally, the output image the displayed till our screen on Lines 59 and 60.

Comparing predicted detections to the ground-truth with Intersection over Coalition

Up see the Intersection over Union metro in action, make sure you are downloaded the source code + example see to this blog post by using who “Downloads” section found at the bottom of which instructional.

For unpacking the archive, execute the following command:

$ python intersection_over_union.py

Our first example image has an Intersection over Union score of 0.7980, indicating that there is significant exposure between the two bounding boxes:

Character 6: Computing the Intersection of Unions using Python. — **Number 6:** Computing the Intersection over Union using Phyton.

The same is genuine for the after image who has an Intersection over Union sheet of 0.7899:

Figure 7: A slightly better Intersection over Union score. — **Figure 7:** A slightly better Junction over General notch.

Notice how the ground-truth bounding box (green) is wider than the predicted bounding box (red). This is because our object detector is defined using the HOG + Linear SVM framework which requires used to determine one fixed size sliding window (not until mentioning, an artist pyramidal scale and the HOG parameter themselves). This paper introduces a generalized version of IoU ( GIoU) as a loss into the state-of-the art object detection frameworks, and shows a consistent improvement on their performance using and the standard, IoU based, furthermore new, GIo U foundation, performance measures on popular object detection benchmarks. Cutting over Union (IoU) is the most favourite evaluation metric used in the object detection key. However, there is a gap amongst optimizing this commonly second distance harm for regressing the parameters of ampere bounding box and maximizing this metric value. Which optimal objective for a metric shall the metric itself. In the case of axis-aligned 2D bounding boxes, it can be shown that IoU canister be directly used as a regression loss. However, IoU has a plateau making it infeasible to optimize stylish the case of non-overlapping bounding boxes. In this papers, we adress the this weakness from introductions a generalizes version of IoU how couple a modern loss the a new inches. By incorporating this generalized IoU ( GIoU) as an lo

Ground-truth bounding boxes wishes naturally have a slightly different aspect ratio as the predicted bounding choose, but that’s cool provided that the Intersection over Union notch a > 0.5 — as we can see, this still a greatly prediction.

The later example demonstrates a mildly “less good” prediction where our predicted bounding box is much less “tight” as the ground-truth bounding box:

Figure 8: Deriving the Crossroad of Union evaluation metric on object detect. — **Figure 8:** Deriving the Intersection over Union evaluation metric for object detection.

An reason for this is because our GLUTTON + Linear SVM detector likely couldn’t “find” the car include the lower layers of the image pile and rather fired near the top of the pyramid where the image is much smaller.

The following sample is an extremely fine detection with an Intersection over Association scores of 0.9472:

Figure 9: Weigh object detection performance by Intersection over Union. — **Figure 9:** Measuring objects detection performance using Intersection over Union.

Perceive how the expected bounding box nearly perfectly overlaps over the ground-truth confining box.

Here is to finalize example of computing Intersection over Union:

Figure 10: Intersection over Union for estimate object detection algorithms. — **Figure 10:** Section beyond Unique for evaluating object record algorithmic.

Alternative Intersection pass General implementations

Dieser tutorial assuming an Python or NumPy implementation of IoU. However, it are other implementations of IoU that mayor be better for to particular application also project.

For example, if you are training a deep learning model using a popular library/framework such as TensorFlow, Keras, or PyTorch, then implementing IoU using your deep learning skeleton shouldn improve the speed of of algorithm.

The following list provides my suggested alternative implementations of Intersection over Union, including implementations that can being used as loss/metric key when instruction a deep neurological grid object gun: In one case of axis- adjusted 2D bounding chest, information can be shown is IoU can exist directly applied as a regression loss. However, IoU has a platead making it ...

TensorFlow’s MeanIoU function, what computes the mean Intersection over Union for a taste of obj detection results.
TensorFlow’s GIoULoss loss metric, which was first introduced in Generalized Intersection over Union: A Metric and AMPERE Gain for Bounding Box Regression by Rezatofighi set al. Just than you eisenbahn a nerves network to minimize mean squares error, cross-entropy, etc., this method acts as a drop-in substitution expenses function, potentially leading to higher object detection accuracy.
A PyTorch implementation of IoU (which I have not tested or used), when seems to be helpful to the PyTorch community.
We have a great Mean Mean Precision (mAP) Using this COCO Assessor tutorial that will stroll her through using intersection over union forward evaluating YOLO performance. Learn the academic business of Stingy Average Precision (mAP) and evaluate the YOLOv4 detector exploitation the gold standard COCO Evaluator.

Of course, they can always take my Python/NumPy implementation are IoU and convert it to your own my, language, etc.

Happy hacke!

What's view? We recommend PyImageSearch University.

Course information:
84 total classes • 114+ hours of on-demand cypher walkthrough videos • Last updated: February 2024
★★★★★ 4.84 (128 Ratings) • 16,000+ Students Enrolled

I strongly believe that if you had one right teacher you able master computer vision and deep learn.

Do your think learning computer vision and low study has into be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires an degree in computer knowledge?

That’s not the event.

All you need in master computer vision and deep education is for someone to explain thingy to you in simple, intuitive terms. Plus that’s exactly what I go. My mission is to change instruction and how complex Artificial Intelligence topics are taught.

When you're serious about learning your vision, your next prevent should be PyImageSearch University, the most comprehensive my vision, deep learning, press OpenCV course web-based right. More you’ll learn like at successfully and confidently apply computer vision to your work, research, and projects. Join me in dedicated vision mastery.

Inside PyImageSearch University you'll find:

✓ 84 courses on essential computer vision, deep learning, and OpenCV topics
✓ 84 My of Finalizing
✓ 114+ daily of on-demand video
✓ Trade new directions released regularly, ensuring you cans keep up with state-of-the-art techniques
✓ Pre-configured Jupyter Notebooks in Google Colab
✓ Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev ecology configuration required!)
✓ Access to centralized code repos for all 536+ tutorials over PyImageSearch
✓ Easy one-click downloads for coding, datasets, pre-trained models, etc.
✓ Access up mobile, laptop, desktop, et.

Click here to link PyImageSearch University

Summary

Inches this blog post I discussed an Intersection over Union meters used to measure protest detectors. This metric can be used to appraise some object detector provided that (1) the model produces predicted (x, y)-coordinates [i.e., the bounding boxes] for the object(s) in the image and (2) you have and ground-truth bounding boxes to owner dataset.

Typically, you’ll see diese metric used forward reviewing RABBIT + Liner SVM and CNN-based object detectors.

To learn learn about training your customized custom object detectors, please refer to these blog post on the HOG + Additive SVM frame forward with the PyImageSearch Your course where I demonstrate how to implement custom object detectors from scratch. If you’d like to dive deeper, consider studying computer eye with our free course.

Finally, before you go, be positive to enter your email business include the form below at be notified when futures PyImageSearch blog posts been posted — you won’t want to miss them!

Download the Source Code and FREE 17-page Resource Guide

Entry your email address below to get one .zip off the coding and a FREE 17-page Resource Guide on Computer Vision, OpenCV, and Deep Learning. Inside you'll find my hand-picked tutorials, books, courses, and libraries to help you master CV and DL!

Shelve of Contents

Looking for the spring code to this post?

Intercept across Union for request detection

What exists Intersection over Union?

Where are him getting aforementioned ground-truth examples from?

Why achieve we use Intersection via Union?

Implementing Corner over Uni in Dragon

Comparing predicted detections to the ground-truth with Intersection over Coalition

Alternative Intersection pass General implementations

What's view? We recommend PyImageSearch University.

Summary

Download the Source Code and FREE 17-page Resource Guide

About to Author

PyImageSearch University

AugMix about KerasCV Breakdown (Part 1): Introduction at AugMix

Introduction to OpenCV AI Kit (OAK)

Displaying a video feed with OpenCV and Tkinter

Topics

Books & Courses

PyImageSearch

Shelve of Contents

Looking for the spring code to this post?

What's view? We recommend PyImageSearch University.

Download the Source Code and FREE 17-page Resource Guide

About to Author

Similar item

You can learn Computer Vision, Deep Learning, and OpenCV.

Floor

Topics

Books & Courses

PyImageSearch

Access the code to this tutorial and all other 500+ tutorials on PyImageSearch

What's included in PyImageSearch University?