Regardless of your experience level, you'll find tremendous value inside OCR with OpenCV, Tesseract, and Python, I guarantee it.
I lead a large team here at GE, and I gotta say, your books are awesome. I've been able to considerably strengthen my knowledge about deep learning and machine vision, which in turn has enabled me to steer the team in entirely new directions. We're working on some really cool things thanks to what your literature has taught me.”
Magnus AkessonCIO and VP at General Electric
As a computer scientist who had no previous experience with ML (and forgot everything I learned in my math courses) the book was easy to follow/understand and at the same time covered almost all the important details. It helped me "catch up" very quickly to the point I feel like I can start building projects! In other words.. worth every penny!”
Andy HinSoftware Developer
Phenomenal. The concepts on deep learning are so well explained that I will be recommending this book to anybody not just involved in computer vision but AI in general.”
Dr. Zig ZdziarskiPhD in CV and ML
The goal of OCR is to Create a piece of software that accepts an input image, automatically recognize the text in the image, and then convert it to machine encoded text (i.e., a "string" data type).
For example, if I were to present the following image to my OCR algorithm, I would expect it to detect the text, recognize the text, and then encode the text as a string variable.
But despite being such an intuitive concept, OCR is incredibly hard. The field of computer vision has existed for over 50 years (with mechanical OCR machines dating back over 100 years), but we still have not "solved" OCR and created an off-the-shelf OCR system that works in nearly any situation.
My brand new book, OCR with OpenCV, Tesseract, and Python, is for developers, students, researchers, and hobbyists just like you who want to learn how to successfully apply Optical Character Recognition to your work, research, and projects.
Regardless of your current experience level with computer vision and OCR, after reading this book you will be armed with the knowledge necessary to tackle your own OCR projects.
Regardless of your experience level, you'll find tremendous value inside OCR with OpenCV, Tesseract, and Python, I guarantee it.
OCR with Tesseract, OpenCV, and Python will teach you how to successfully apply Optical Character Recognition to your work, projects, and research. You will learn via practical, hands-on projects (with lots of code) so you can not only develop your own OCR Projects, but feel confident while doing so.
Inside the book we will focus on:
This book isn't just for beginners — there's advanced concepts, algorithms, and techniques covered here as well:
Best of all, the algorithms, walkthroughs, and code can be directly applied to your current job and research.
The primary focus of this book is around Tesseract, which is the world's most popular open source OCR engine. Simply put — if you're interested in learning how to apply OCR to your own projects, you need to learn how to operate the Tesesract OCR engine.
We'll be utilizing the Python programming language in this book. Python is an extremely easy language. It also has easy-to-use libraries and packages that allow us to seamlessly interact with our computer vision, deep learning, and OCR APIs.
When we interact with Tesseract via Python, we'll use PyTesseract. The PyTesseract package interfaces Tesseract, making it easy to OCR images using Python.
For computer vision and image processing, we'll be using OpenCV, the de facto standard library for image processing. You'll find OpenCV easy to use, especially with the hands-on projects covered in the text.
When training our own custom deep learning OCR models, we'll be using Keras and TensorFlow 2. Using Keras and TensorFlow 2 is the fastest, easiest way to go from idea, to experimentation, to result.
You'll also learn how to use cloud-based OCR APIs, including Amazon Rekognition, Microsoft Cognitive Services, and the Google Vision API.
This is one of the best books (if not the BEST) I've found in this subject. Easy to read and easy to understand, with many practical examples. A good combination between theory and hands-on problems. I would recommend it to any person interested in Deep Learning and Computer Vision. Adrian, you are a great teacher!”
Sebastian Adolfo Prieto SanchezProfessor at University of Alcalá
This book is for developers, students, researchers, and hobbyists who want to learn how to successfully code Optical Character Recognition project (and have at least some programming/scripting experience).
If any of these descriptions fit you... you're the target student. I am writing this book for you.
Download Free Sample Chapters
and Table of Contents
You're ready to learn OCR, but you're not sure where to start. That's where my new book, OCR with Tesseract, OpenCV, and Python comes in.
I am a firm believer that everything can be explained to anybody with the right words. Adrian Rosebrock excels in this art, and solves complex problems in only a few lines of code, going straight to the point and making it so clear. He makes deep learning so accessible and so exciting!"”
Aline Gendrin BrokmannData Scientist at AstraZeneca
Great material, great courses, great support. Thanks, Adrian.”
Greg ChapmanPrincipal Engineer with Microsoft Game Studios
Since this book covers a huge amount of content, I've decided to break the book down into three volumes called "bundles". A bundle includes the eBook, video tutorials, and source code for a given volume.
Each bundle builds on top of the others and includes all content from the lower volumes. You should choose a bundle based on: (1) how in-depth you want to study deep learning, computer vision, and visual recognition and (2) your particular budget.
You can find a quick breakdown of the three bundles below —
the full list of topics to be covered can be found later on this page:
The perfect starting point if you're a beginner to OCR. Here, you'll learn the basics of the Tesseract OCR engine and how you can use Tesseract and OpenCV to build real-world OCR projects, including OCR'ing and translating text, recognizing credit card numbers in images, automatically reading passports, detecting text in natural scene images, and more!
Includes everything in the "Intro to OCR" Bundle and is the ideal starting point for readers who want to learn more advanced OCR techniques. This bundle covers how to train your own custom OCR models with Keras/TensorFlow, apply image/document alignment and registration, OCR receipts and invoices, OCR license plates, improve OCR speed with GPUs, handwriting recognition, and even how to train/fine-tune Tesseract models on your custom datasets.
Includes everything in the previous two bundles, plus you'll have access to every chapter in the book, a physical, printed edition of both volumes, a Certificate of Completion, and access to my private community forums for additional help and support. If you're even remotely serious about learning OCR, go with this bundle.
Whether you're brand new to OCR, or have been working with OCR for years, this book will help you reach OCR mastery.
Each bundle includes:
To receive the certificate, you will need to complete all lessons and quizzes associated with the text.
After successfully completing all lessons/quizzes, you will receive your certificate and be able to embed it directly on your LinkedIn profile, thereby demonstrating your Optical Character Recognition skills.
Regardless of your experience level, you'll find tremendous value inside OCR with OpenCV, Tesseract, and Python, I guarantee it.
Since this book covers a huge amount of content, I've decided to break the book down into three volumes called "bundles". Each bundle builds on top of the others and includes all content from the lower tiers. Use the list of topics below (broken down by bundle) to help you (1) identify which topics you would like to study and then (2) choose a bundle based on this list.
Core OCR guide
Solve real-world problems with OCR
Core OCR guide for experts
The "Intro to OCR" Bundle is right for you if:
Inside this bundle you will learn the fundamentals of Optical Character Recognition using Tesseract, OpenCV, and Python. And while this is the lowest tier bundle, you'll still be getting a great education with a lot of hands-on experience.
That said, for a more in-depth treatment of OCR, I would recommend either the "OCR Practitioner" Bundle or "OCR Expert" Bundle.
My Recommendation: The "Intro to OCR" Bundle is a great first step towards applying OCR to real-world projects. You'll learn the fundamentals of OCR and Tesseract, empowering you to apply OCR to your own projects.
Take Your First Steps
tipYour First OCR Project
TipOCR'ing and Translating Text
TipOCR'ing Digits
TipRead Passports with OCR
TipOCR Credit Card Digits
TipText localization with Tesseract
TipRotated text localization with OpenCV
TipComplete OCR Pipeline
TipThe "OCR Practitioner" Bundle builds on the previous bundle and includes every chapter in the "Intro to OCR" Bundle. This bundle is geared towards more advanced OCR algorithms, techniques, and use cases, including:
My Recommendation: The "OCR Practitioner Bundle" gives you the best bang for your buck. You should choose this bundle if you want a super in-depth treatment of OCR, but cannot afford the "OCR Expert" Bundle.
Training OCR Model with Keras/TensorFlow
TipDenoising Images with Machine Learning
TipImage Alignment and Registration
TipOCR'ing Invoices and Forms
TipOCR'ing Receipts
TipSolving Sudoku Puzzles with OCR
TipANPR and OCR
TipOCR Video Streams
TipOCR with GPUs
TipHandwriting Recognition
TipOCR with Cloud APIs
TipTraining Custom Tesseract Model
TipThe "OCR Expert" Bundle includes everything from both the "Intro to OCR" Bundle and "OCR Practitioner" Bundle.
This bundle also includes:
My Recommendation: You should go with the "OCR Expert" Bundle if you want to study OCR in-depth and you want additional help and support along the way. When it comes to learning Optical Character Recognition, you just can't beat this bundle!
Additionally, "OCR Expert" Bundle includes a Certificate of Completion. To receive the certificate, you will need to complete all lessons and quizzes associated with the text.
After successfully completing all lessons/quizzes, you will receive your certificate and be able to embed it directly on your LinkedIn profile, thereby demonstrating your Optical Character Recognition skills.
All Bonus Chapters
TipPhysical, Printed Edition of Text
TipAccess to Private Community Forums
TipThe perfect starting point if you're a beginner to OCR. Here, you'll learn the basics of the Tesseract OCR engine and how you can use Tesseract and OpenCV to build real-world OCR projects, including OCR'ing and translating text, recognizing credit card numbers in images, automatically reading passports, detecting text in natural scene images, and more!
Read More...
While this is the lowest tier bundle, you'll still be getting a great education with a lot of hands-on experience. That said, for a more in-depth treatment of OCR, I would recommend either the "OCR Practitioner" Bundle or "OCR Expert" Bundle.
The ideal starting point if you want to learn more advanced OCR techniques This bundle includes everything from the "Intro to OCR" Bundle, plus how to train your own custom OCR models with Keras/TensorFlow, apply image/document alignment and registration, OCR receipts and invoices, OCR license plates and Automatic License Plate Recognition, improve OCR speed with GPUs, handwriting recognition, and even how to train and fine-tune Tesseract models on your own custom datasets. You'll also learn my tips, suggestions, and best practices when building OCR projects.
Read More...
The "OCR Practitioner" Bundle gives you the best bang for your buck. You should choose this bundle if you want a super in-depth treatment of OCR, but cannot afford the "OCR Expert" Bundle.
Master Optical Character Recognition with OpenCV and Tesseract. This bundle includes everything from both the "Intro to OCR" Bundle and "OCR Practitioner" Bundle.
The "OCR Expert" Bundle is the only bundle that includes a hardcopy edition of both volumes of OCR with OpenCV, Tesseract, and Python mailed to your doorstep.
This bundle also includes access to my private community forums, a Certificate of Completion, and all bonus chapters included in the text.
Read More...
When it comes to studying Optical Character Recognition, you can't beat this bundle!
At the age of 58, learning ML, Computer Vision and Python all in parallel with no prior programming background was a steep learning curve and without PyImageSearch this could not have been possible. PyImageSearch brought it all nicely together.”
Sam RanadeIT Professional
Just wanted to thank you [Adrian] for the great work you are doing with PyImageSearch blog. The site has often been our main go-to place for solving Computer Vision problems for image and video analysis. I particularly appreciate the code samples you provide which are easy to understand.”
Dattaraj RaoChief Architect at GE
After reading my book, if you haven't learned the fundamentals of Optical Character Recognition, then I don't want your money. That's why I offer a 100% Money Back Guarantee. Simply send me an email and ask for a refund, up to 30 days after your purchase. With all the copies I've sold, I count the number of refunds on one hand. My readers are satisfied and I'm sure you will be too.
OCR with OpenCV, Tesseract, and Python is the most in-depth, comprehensive, and hands-on guide to learning Optical Character Recognition with OpenCV and Tesseract. You cannot find any other book or course online that includes this level of intuitive explanations and thoroughly documented code.
Each bundle builds on top of the others and includes all content from lower volumes. You should choose a bundle based on (1) how in depth you want to study Optical Character Recognition, OpenCV, and Tesseract and (2) your particular budget. Use the "Here's the full breakdown of what you'll learn inside OCR with OpenCV, Tesseract, and Python" section above to help you decide which topics you want to learn, then pick a bundle based on your choices.
After you purchase your copy of OCR with OpenCV, Tesseract, and Python you will (1) receive an email receipt for your purchase and (2) you will be able to download your books, code, datasets, etc. immediately.
I offer a 30 day Money Back Guarantee on all orders. If you haven't learned the fundamentals of Optical Character Recognition after reading my book, then I don't want your money. Simply send me an email and ask for a refund, up to 30 days after your purchase. With all the copies I've sold, I count the number of refunds on one hand. My readers are satisfied and I'm sure you will be too.
Yep, the hardcopies are indeed shipping! The "OCR Expert" Bundle is the only bundle that includes a hardcopy edition. After you purchase, you will receive an email with a link to enter your shipping information. Once I have your shipping address I can get your hardcopy edition in the mail, normally within 48 hours.
Python is an extremely easy language to learn and has huge number of powerful packages. Python is the best way to apply Optical Character Recognition to your own projects.
Yes, this book covers Tesseract for Optical Character Recognition extensively. You'll start by learning the basics of the Tesseract OCR engine, including how to utilize the Tesseract binary from the command line. From there, you'll interface with Tesseract via the Python programming language, learn how to OCR images with Python and Tesseract, and even train and fine-tune Tesseract models on your own custom datasets!.
The OpenCV library is a collection of functions, modules, and tools that allow you to perform image processing and computer vision. OpenCV includes functions that let you perform OCR, but the library is actually much more powerful than just OCR. Inside this book you'll not only learn how to perform OCR with OpenCV, but also use OpenCV's functions to help you obtain higher OCR accuracy.
It's okay if you are brand new to Optical Character Recognition! This book makes no assumptions on your prior experience with OCR, computer vision, or deep learning. You'll still be able to follow along with this book, even if you are brand new to the world of OCR.
That said, a little bit of OpenCV experience goes a long way, so if you're new to OpenCV, I highly recommend that you also grab a copy of Practical Python and OpenCV so that you can learn the basics.
This book isn't just for beginners — there are advanced concepts, algorithms, and techniques covered as well:
And you'll learn how to use OCR APIs in the cloud, including Amazon Rekognition, Microsoft Cognitive Services, and Google Vision.
Best of all, the algorithms, walkthroughs, and code can be directly applied to your current job and research.
This book assumes you have some prior programming experience (e.g. you know what a variable function, loop, etc. are). You should have more skills than a beginner but certainly not an intermediate or advanced developer. As long as you understand basic programming logic-flow, you'll be successful in reading (and understanding) the contents of this book.
You do not need to know the OpenCV library to be successful when going through this book. We only use OpenCV to facilitate basic image processing operations such as loading an image from disk, displaying it to our screen, along with some basic image processing routines.
That said, a little bit of OpenCV experience goes a long way, so if you're new to OpenCV, I highly recommend that you also grab a copy of Practical Python and OpenCV so that you can learn the basics.
The "Intro to OCR" Bundle does not require any knowledge over deep learning. The "OCR Practitioner" Bundle and "OCR Expert" Bundle require a practitioner's knowledge over deep learning, meaning that you understand the basic idea of how a model is trained. If you're new to deep learning or want to level up your skills, make sure you grab a copy of the Deep Learning for Computer Vision with Python book — you can work through this OCR book and the deep learning book in tandem.
Yes, you can always upgrade your bundle to a higher one. For example, you could purchase the "Intro to OCR" Bundle and then upgrade to the "OCR Practitioner" Bundle or "OCR Expert" Bundle at a later date.
The cost to upgrade would simply be the price difference between your current bundle and the bundle you wanted to upgrade to (you would not need to "repurchase" the content you already own). To upgrade your bundle just send me an email and I can get you the upgrade link.
Everyone has the same amount of time in a day — we all have 24 hours to work, spend time with our families, sleep, and have fun. If you're interested in studying Optical Character Recognition, I challenge you to make it your goal. Take the time to invest in yourself and your OCR knowledge by grabbing a copy of OCR with OpenCV, Tesseract, and Python.
Ask yourself, how much time are you wasting because:
Your scripts error out, leaving you confused on how to proceed.
OCR with OpenCV, Tesseract, and Python solves these problems so you can stop wasting your time and money following paths that only lead to failure — let me guide you to success!
I have authored over 400+ blog posts about computer vision, OpenCV, and deep learning over at PyImageSearch.com. Check out the posts to get a feel for my teaching and writing style (not to mention the quality and depth of the tutorials). I would also highly suggest that you sign up for the (free) Table of Contents and sample chapters I am offering using the form at the bottom-right corner of this page.
If you have any other questions, please send me a message, and I'll get back to you ASAP.
Who's behind this?
If studying deep learning and visual recognition sounds interesting to you, I hope you'll consider grabbing a copy of this book. You'll learn a ton about deep learning and computer vision in a practical, hands-on way. And you'll have fun doing it. See you on the other side!