UPDATE: The introduction to this post may seen a little “out there”. For some context, I had just finished watching the South Park Black Friday episodes prior to writing this post so I definitely had some inspiration regarding zombie shoppers, Black Friday chaos, and Game of Thrones.
Black Friday is coming.
Hordes of angry shoppers. Stampedes of middle-aged midwestern women, their toothless gums bloodthirsty for 75% off the latest season of Game of Thrones at the local Wal-Mart.
Theyāll lineup outside the Wal-Mat doors on Thanksgiving at midnight. Theyāll rally, beating at the locked doors with their hands and heads until their bodies are raw and bloody, like zombies from 28 Days Later. But instead of human flesh, they crave petty consumer sustenance. Their war cries of discounts and sales will reach the heavens. And their thunderous footsteps will cause earthquakes across the Great Plains.
Of course, the media wonāt help — they will sensationalize every last little piece. From frostbitten families camping out all night in the blistering cold, to the little old lady trampled by raging bargain hunters as the doors open, akin to the Gallimimus stampede in Jurassic Park. All of this simply because she wanted to purchase the latest Halo game for Timmy, her little 9 year old grandson, whoās parents passed away this time last year. At a Wal-Mart. During Black Friday.
And I have to ask, is all this chaos and bedlam worth it?
Hell. No.
Any shopping I do this Black Friday will be from (safely) behind my laptop screen, likely nursing a hangover from the night before with a cup of coffee and a handful of Tylenol.
But if you decide you are going to venture out into the real-world and brave the bargain hunters, youāll want to download the source code to this blog post first…
Imagine how silly you would feel, standing in line, waiting to checkout, only to scan the barcode on the latest season of Game of Thrones only to find out that Target has it for $5 cheaper?
In the rest of this blog post Iāll show you how to detect barcodes in images using nothing but Python and OpenCV.
Looking for the source code to this post?
Jump Right To The Downloads SectionDetecting Barcodes in Images using Python and OpenCV
The goal of this blog post is to demonstrate a basic implementation of barcode detection using computer vision and image processing techniques. My implementation of the algorithm is originally based loosely on this StackOverflow question. I have gone through the code and provided some updates and improvements to the original algorithm.
Itās important to note that this algorithm will not work for all barcodes, but it should give you the basic intuition as to what types of techniques you should be applying.
For this example, we will be detecting the barcode in the following image:
Letās go ahead and start writing some code. Open up a new file, name it detect_barcode.py
, and letās get coding:
# import the necessary packages import numpy as np import argparse import imutils import cv2 # construct the argument parse and parse the arguments ap = argparse.ArgumentParser() ap.add_argument("-i", "--image", required = True, help = "path to the image file") args = vars(ap.parse_args())
The first thing weāll do is import the packages weāll need. Weāll utilize NumPy for numeric processing, argparse
for parsing command line arguments, and cv2
for our OpenCV bindings.
Then weāll setup our command line arguments. We need just a single switch here, --image
, which is the path to our image that contains a barcode that we want to detect.
Now, time for some actual image processing:
# load the image and convert it to grayscale image = cv2.imread(args["image"]) gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY) # compute the Scharr gradient magnitude representation of the images # in both the x and y direction using OpenCV 2.4 ddepth = cv2.cv.CV_32F if imutils.is_cv2() else cv2.CV_32F gradX = cv2.Sobel(gray, ddepth=ddepth, dx=1, dy=0, ksize=-1) gradY = cv2.Sobel(gray, ddepth=ddepth, dx=0, dy=1, ksize=-1) # subtract the y-gradient from the x-gradient gradient = cv2.subtract(gradX, gradY) gradient = cv2.convertScaleAbs(gradient)
On Lines 14 and 15 we load our image
off disk and convert it to grayscale.
Then, we use the Scharr operator (specified using ksize = -1
) to construct the gradient magnitude representation of the grayscale image in the horizontal and vertical directions on Lines 19-21.
From there, we subtract the y-gradient of the Scharr operator from the x-gradient of the Scharr operator on Lines 24 and 25. By performing this subtraction we are left with regions of the image that have high horizontal gradients and low vertical gradients.
Our gradient
representation of our original image above looks like:
Notice how the barcoded region of the image has been detected by our gradient operations. The next steps will be to filter out the noise in the image and focus solely on the barcode region.
# blur and threshold the image blurred = cv2.blur(gradient, (9, 9)) (_, thresh) = cv2.threshold(blurred, 225, 255, cv2.THRESH_BINARY)
The first thing weāll do is apply an average blur on Line 28 to the gradient image using a 9 x 9 kernel. This will help smooth out high frequency noise in the gradient representation of the image.
Weāll then threshold the blurred image on Line 29. Any pixel in the gradient image that is not greater than 225 is set to 0 (black). Otherwise, the pixel is set to 255 (white).
The output of the blurring and thresholding looks like this:
However, as you can see in the threshold image above, there are gaps between the vertical bars of the barcode. In order to close these gaps and make it easier for our algorithm to detect the āblobā-like region of the barcode, weāll need to perform some basic morphological operations:
# construct a closing kernel and apply it to the thresholded image kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (21, 7)) closed = cv2.morphologyEx(thresh, cv2.MORPH_CLOSE, kernel)
Weāll start by constructing a rectangular kernel using the cv2.getStructuringElement
on Line 32. This kernel has a width that is larger than the height, thus allowing us to close the gaps between vertical stripes of the barcode.
We then perform our morphological operation on Line 33 by applying our kernel to our thresholded image, thus attempting to close the the gaps between the bars.
You can now see that the gaps are substantially more closed, as compared to the thresholded image above:
Of course, now we have small blobs in the image that are not part of the actual barcode, but may interfere with our contour detection.
Letās go ahead and try to remove these small blobs:
# perform a series of erosions and dilations closed = cv2.erode(closed, None, iterations = 4) closed = cv2.dilate(closed, None, iterations = 4)
All we are doing here is performing 4 iterations of erosions, followed by 4 iterations of dilations. An erosion will āerodeā the white pixels in the image, thus removing the small blobs, whereas a dilation will ādilateā the remaining white pixels and grow the white regions back out.
Provided that the small blobs were removed during the erosion, they will not reappear during the dilation.
After our series of erosions and dilations you can see that the small blobs have been successfully removed and we are left with the barcode region:
Finally, letās find the contours of the barcoded region of the image:
# find the contours in the thresholded image, then sort the contours # by their area, keeping only the largest one cnts = cv2.findContours(closed.copy(), cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE) cnts = imutils.grab_contours(cnts) c = sorted(cnts, key = cv2.contourArea, reverse = True)[0] # compute the rotated bounding box of the largest contour rect = cv2.minAreaRect(c) box = cv2.cv.BoxPoints(rect) if imutils.is_cv2() else cv2.boxPoints(rect) box = np.int0(box) # draw a bounding box arounded the detected barcode and display the # image cv2.drawContours(image, [box], -1, (0, 255, 0), 3) cv2.imshow("Image", image) cv2.waitKey(0)
Luckily, this is the easy part. On Lines 41-44 we simply find the largest contour in the image, which if we have done our image processing steps correctly, should correspond to the barcoded region.
We then determine the minimum bounding box for the largest contour on Lines 47-49 and finally display the detected barcode on Lines 53-55.
As you can see in the following image, we have successfully detected the barcode:
In the next section weāll try a few more images.
Successful Barcode Detections
To follow along with these results, use the form at the bottom of this post to download the source code and accompanying images for this blog post.
Struggling with running the code in this section? Take a look at my command line arguments blog post.
Once you have the code and images, open up a terminal and execute the following command:
$ python detect_barcode.py --image images/barcode_02.jpg
No problem detecting the barcode on that jar of coconut oil!
Letās try another image:
$ python detect_barcode.py --image images/barcode_03.jpg
We were able to find the barcode in that image too!
But enough of the food products, what about the barcode on a book:
$ python detect_barcode.py --image images/barcode_04.jpg
Again, no problem!
How about the tracking code on a package?
$ python detect_barcode.py --image images/barcode_05.jpg
Again, our algorithm is able to successfully detect the barcode.
Finally, letās try one more image This one is of my favorite pasta sauce, Raoās Homemade Vodka Sauce:
$ python detect_barcode.py --image images/barcode_06.jpg
We were once again able to detect the barcode!
What's next? I recommend PyImageSearch University.
30+ total classes • 39h 44m video • Last updated: 12/2021
★★★★★ 4.84 (128 Ratings) • 3,000+ Students Enrolled
I strongly believe that if you had the right teacher you could master computer vision and deep learning.
Do you think learning computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?
Thatās not the case.
All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And thatās exactly what I do. My mission is to change education and how complex Artificial Intelligence topics are taught.
If you're serious about learning computer vision, your next stop should be PyImageSearch University, the most comprehensive computer vision, deep learning, and OpenCV course online today. Here youāll learn how to successfully and confidently apply computer vision to your work, research, and projects. Join me in computer vision mastery.
Inside PyImageSearch University you'll find:
- ✓ 30+ courses on essential computer vision, deep learning, and OpenCV topics
- ✓ 30+ Certificates of Completion
- ✓ 39h 44m on-demand video
- ✓ Brand new courses released every month, ensuring you can keep up with state-of-the-art techniques
- ✓ Pre-configured Jupyter Notebooks in Google Colab
- ✓ Run all code examples in your web browser ā works on Windows, macOS, and Linux (no dev environment configuration required!)
- ✓ Access to centralized code repos for all 500+ tutorials on PyImageSearch
- ✓ Easy one-click downloads for code, datasets, pre-trained models, etc.
- ✓ Access on mobile, laptop, desktop, etc.
Summary
In this blog post we reviewed the steps necessary to detect barcodes in images using computer vision techniques. We implemented our algorithm using the Python programming language and the OpenCV library.
The general outline of the algorithm is to:
- Compute the Scharr gradient magnitude representations in both the xĀ and yĀ direction.
- Subtract the y-gradient from the x-gradient to reveal the barcoded region.
- Blur and threshold the image.
- Apply a closing kernel to the thresholded image.
- Perform a series of dilations and erosions.
- Find the largest contour in the image, which is now presumably the barcode.
It is important to note that since this method makes assumptions regarding the gradient representations of the image, and thus will only work for horizontal barcodes.
If you wanted to implement a more robust barcode detection algorithm, you would need to take the orientation of the image into consideration, or better yet, apply machine learning techniques such as Haar cascades or HOG + Linear SVM to āscanā the image for barcoded regions.
Download the Source Code and FREE 17-page Resource Guide
Enter your email address below to get a .zip of the code and a FREE 17-page Resource Guide on Computer Vision, OpenCV, and Deep Learning. Inside you'll find my hand-picked tutorials, books, courses, and libraries to help you master CV and DL!