1st Edition officialy released — Grab your copy now!

OCR with OpenCV, Tesseract, and Python is more than just a book...

...it's your complete training guide to mastering Optical Character Recognition.

OCR with Tesseract, OpenCV, and Python will teach you how to successfully apply Optical Character Recognition to your work, projects, and research.

You will learn via practical, hands-on projects (with lots of code) so you can not only develop your own OCR Projects, but feel confident while doing so.

Grab Your Copy Now

I lead a large team here at GE, and I gotta say, your books are awesome. I've been able to considerably strengthen my knowledge about deep learning and machine vision, which in turn has enabled me to steer the team in entirely new directions. We're working on some really cool things thanks to what your literature has taught me.”
Magnus AkessonCIO and VP at General Electric

The OCR with OpenCV, Tesseract, and Python IndieGoGo was a huge success!

855 backers

4,756% funded

$242,815 pledged of $5,000 goal

Take a sneak peek at what's inside...

Practical, real-world projects.

The chapters in this book cover real-world use cases, including aligning and OCR'ing invoices, Automatic License/Number Plate Recognition, OCR'ing and translating text, and handwriting recognition, just to name a few. What you learn in this book will directly translate to your own OCR projects.

Clear, easy-to-follow tutorials.

This book won't bury you under a pile of theory or pages and pages of complex equations. Instead, you'll learn through intuitive chapters that are super practical and present solutions to actual real-world OCR problems. By the end of this book you'll be able to successfully apply OCR to your own projects.

Expert advice.

Inevitability, when working on your projects you'll have a question. Questions are a good thing — they mean you're pushing the boundaries of your current knowledge, and are looking to expand your understanding. With our unparalleled support you know your question won't get left unanswered.

Hands-on coding.

In this book you will learn by doing. You'll roll up your sleeves and get your hands dirty with code. Each and every chapter starts with an empty directory. We then design the project and write the code from scratch. Along the way I explain what every line of code is doing, ensuring you not only understand we are doing but we are doing it.

Unparalleled support.

I hold myself and PyImageSearch to an incredibly high standard. When you buy a book or course from me, you're not just getting the content itself — you're also getting access to me and the PyImageSearch team. We are committed to helping you not only learn OCR, but successfully apply OCR to your projects as well. I'm ready to order my copy now

As a computer scientist who had no previous experience with ML (and forgot everything I learned in my math courses) the book was easy to follow/understand and at the same time covered almost all the important details. It helped me "catch up" very quickly to the point I feel like I can start building projects! In other words.. worth every penny!”
Andy HinSoftware Developer

Phenomenal. The concepts on deep learning are so well explained that I will be recommending this book to anybody not just involved in computer vision but AI in general.”
Dr. Zig ZdziarskiPhD in CV and ML

Optical Character Recognition (OCR) is a simple concept but is hard in practice.

The goal of OCR is to Create a piece of software that accepts an input image, automatically recognize the text in the image, and then convert it to machine encoded text (i.e., a "string" data type).

For example, if I were to present the following image to my OCR algorithm, I would expect it to detect the text, recognize the text, and then encode the text as a string variable.

But despite being such an intuitive concept, OCR is incredibly hard. The field of computer vision has existed for over 50 years (with mechanical OCR machines dating back over 100 years), but we still have not "solved" OCR and created an off-the-shelf OCR system that works in nearly any situation.

And worse, trying to code custom software that can perform OCR is even harder:

Open source OCR packages like Tesseract can be difficult to use if you are new to the world of OCR.

Computer vision and image processing libraries such as OpenCV and scikit- image can help you preprocess your images to improve OCR accuracy... but which algorithms and techniques do you use?

Obtaining high accuracy with Tesseract typically requires that you know which options, parameters, and configurations to use — and unfortunately there aren't many high-quality Tesseract tutorials or books online.

Deep learning is responsible for unprecedented accuracy in nearly every area of computer science. Which deep learning models, layer types, and loss functions should you be using for OCR?

If you've ever found yourself struggling to apply OCR to a project, or if you're simply interested in learning OCR, this book is for you.

I'm ready to order my copy now

My brand new book, OCR with OpenCV, Tesseract, and Python, is for developers, students, researchers, and hobbyists just like you who want to learn how to successfully apply Optical Character Recognition to your work, research, and projects.

Regardless of your current experience level with computer vision and OCR, after reading this book you will be armed with the knowledge necessary to tackle your own OCR projects.

Regardless of your experience level, you'll find tremendous value inside OCR with OpenCV, Tesseract, and Python, I guarantee it.

Grab Your Copy Now

Curious about what you’ll learn?

OCR with Tesseract, OpenCV, and Python will teach you how to successfully apply Optical Character Recognition to your work, projects, and research. You will learn via practical, hands-on projects (with lots of code) so you can not only develop your own OCR Projects, but feel confident while doing so.

Inside the book we will focus on:

Getting started with OCR
Learning the basics of the Tesseract OCR engine
Discovering how to improve OCR accuracy using Tesseract options and configurations
Interfacing with Tesseract via the Python programming language
Localizing and detecting text in images using both OpenCV and Tesseract
Using OpenCV and image processing techniques to improve OCR accuracy
Using machine learning to denoise our images for better OCR accuracy
Image/document registration and alignment to build an invoice scanning project

Training our own custom deep learning OCR models with Keras and TensorFlow
Solving Sudoku puzzles with OCR, OpenCV, and Keras/TensorFlow
Automatic License/Number Plate Recognition (ANPR)
Handwriting recognition
Performing OCR in real-time video streams
Utilizing GPUs for faster OCR inference
Using OCR engines in the cloud (i.e., Google Vision API, Amazon Rekognition, etc.)
Tips, suggestions, and best practices when performing OCR

Experienced OCR practitioner?

This book isn't just for beginners — there's advanced concepts, algorithms, and techniques covered here as well:

You'll learn how to train custom Keras / TensorFlow OCR models.
You'll build an Automatic License/Number Plate Recognition (ANPR) system.
You'll learn how to apply OCR to real-time video streams using GPUs for inference.

You'll train and fine-tune Tesseract models on your own custom OCR datasets.
And you'll learn how to use OCR APIs in the cloud, including Amazon Rekognition, Microsoft Cognitive Services, and Google Vision.

Best of all, the algorithms, walkthroughs, and code can be directly applied to your current job and research.

Libraries and Packages

The primary focus of this book is around Tesseract, which is the world's most popular open source OCR engine. Simply put — if you're interested in learning how to apply OCR to your own projects, you need to learn how to operate the Tesesract OCR engine.

We'll be utilizing the Python programming language in this book. Python is an extremely easy language. It also has easy-to-use libraries and packages that allow us to seamlessly interact with our computer vision, deep learning, and OCR APIs.

When we interact with Tesseract via Python, we'll use PyTesseract. The PyTesseract package interfaces Tesseract, making it easy to OCR images using Python.

For computer vision and image processing, we'll be using OpenCV, the de facto standard library for image processing. You'll find OpenCV easy to use, especially with the hands-on projects covered in the text.

When training our own custom deep learning OCR models, we'll be using Keras and TensorFlow 2. Using Keras and TensorFlow 2 is the fastest, easiest way to go from idea, to experimentation, to result.

You'll also learn how to use cloud-based OCR APIs, including Amazon Rekognition, Microsoft Cognitive Services, and the Google Vision API.

I'm ready to order my copy now

This is one of the best books (if not the BEST) I've found in this subject. Easy to read and easy to understand, with many practical examples. A good combination between theory and hands-on problems. I would recommend it to any person interested in Deep Learning and Computer Vision. Adrian, you are a great teacher!”
Sebastian Adolfo Prieto SanchezProfessor at University of Alcalá

You're probably wondering...
“Is this book right for me?”

This book is for developers, students, researchers, and hobbyists who want to learn how to successfully code Optical Character Recognition project (and have at least some programming/scripting experience).

If any of these descriptions fit you... you're the target student. I am writing this book for you.

You are a computer vision developer who needs to learn how to apply OCR to a work project.
You write code at your day job and are motivated to stand out from your coworkers by learning OCR.
You are an undergrad student doing your final project and want to utilize OCR (and you need to complete the project in order to successfully graduate).
You are a MSc or PhD student working on your thesis/dissertation and need a practical, hands-on education to compliment what you are learning in textbooks and research papers.

You are a computer science teacher who wants to teach your students how to apply OCR.
You are a computer vision researcher who has read OCR papers but are struggling to translate theory into code.
You are an entrepreneur in the computer vision/deep learning space and see a gap in the market that OCR could help solve.
You are a "computer vision hobbyist" who hacks around with OpenCV and now wants to learn OCR.

Download Free Sample Chapters
and Table of Contents

You're ready to learn OCR, but you're not sure where to start. That's where my new book, OCR with Tesseract, OpenCV, and Python comes in.

I am a firm believer that everything can be explained to anybody with the right words. Adrian Rosebrock excels in this art, and solves complex problems in only a few lines of code, going straight to the point and making it so clear. He makes deep learning so accessible and so exciting!"”
Aline Gendrin BrokmannData Scientist at AstraZeneca

Great material, great courses, great support. Thanks, Adrian.”
Greg ChapmanPrincipal Engineer with Microsoft Game Studios

A three volume book — customized to what you want to learn.

Since this book covers a huge amount of content, I've decided to break the book down into three volumes called "bundles". A bundle includes the eBook, video tutorials, and source code for a given volume.

Each bundle builds on top of the others and includes all content from the lower volumes. You should choose a bundle based on: (1) how in-depth you want to study deep learning, computer vision, and visual recognition and (2) your particular budget.

You can find a quick breakdown of the three bundles below —
the full list of topics to be covered can be found later on this page:

“Intro to OCR” Bundle

The perfect starting point if you're a beginner to OCR. Here, you'll learn the basics of the Tesseract OCR engine and how you can use Tesseract and OpenCV to build real-world OCR projects, including OCR'ing and translating text, recognizing credit card numbers in images, automatically reading passports, detecting text in natural scene images, and more!

See What's Included

“OCR Practitioner” Bundle

Includes everything in the "Intro to OCR" Bundle and is the ideal starting point for readers who want to learn more advanced OCR techniques. This bundle covers how to train your own custom OCR models with Keras/TensorFlow, apply image/document alignment and registration, OCR receipts and invoices, OCR license plates, improve OCR speed with GPUs, handwriting recognition, and even how to train/fine-tune Tesseract models on your custom datasets.

See What's Included

“OCR Expert” Bundle

Includes everything in the previous two bundles, plus you'll have access to every chapter in the book, a physical, printed edition of both volumes, a Certificate of Completion, and access to my private community forums for additional help and support. If you're even remotely serious about learning OCR, go with this bundle.

See What's Included

More than just a book — it's your complete training guide to mastering Optical Character Recognition.

Whether you're brand new to OCR, or have been working with OCR for years, this book will help you reach OCR mastery.

Each bundle includes:

The eBook files in PDF, .mobi, and .epub format.

Access to the OCR with OpenCV, Tesseract, and Python companion website so you can further your knowledge, even when you're done reading the book

High quality, well documented source code

Pre-configured Virtual Machine with all the necessary computer vision, deep learning, and OCR packages pre-configured and pre-installed

The "OCR Expert" Bundle includes a Certificate of Completion.

To receive the certificate, you will need to complete all lessons and quizzes associated with the text.

After successfully completing all lessons/quizzes, you will receive your certificate and be able to embed it directly on your LinkedIn profile, thereby demonstrating your Optical Character Recognition skills.

Regardless of your experience level, you'll find tremendous value inside OCR with OpenCV, Tesseract, and Python, I guarantee it.

Grab Your Copy Now

Here's the full breakdown of what you'll learn inside OCR with Tesseract, OpenCV, and Python

Since this book covers a huge amount of content, I've decided to break the book down into three volumes called "bundles". Each bundle builds on top of the others and includes all content from the lower tiers. Use the list of topics below (broken down by bundle) to help you (1) identify which topics you would like to study and then (2) choose a bundle based on this list.

Intro Bundle

Core OCR guide

Practitioner Bundle

Solve real-world problems with OCR

Expert Bundle

Core OCR guide for experts

The "Intro to OCR" Bundle is right for you if:

You are new to the world of OCR and Computer Vision
You are just testing the OCR waters
You are on a budget

Inside this bundle you will learn the fundamentals of Optical Character Recognition using Tesseract, OpenCV, and Python. And while this is the lowest tier bundle, you'll still be getting a great education with a lot of hands-on experience.

That said, for a more in-depth treatment of OCR, I would recommend either the "OCR Practitioner" Bundle or "OCR Expert" Bundle.

My Recommendation: The "Intro to OCR" Bundle is a great first step towards applying OCR to real-world projects. You'll learn the fundamentals of OCR and Tesseract, empowering you to apply OCR to your own projects.

Take Your First Steps

tip

Your First OCR Project

Tip

OCR'ing and Translating Text

Tip

OCR'ing Digits

Tip

Read Passports with OCR

Tip

OCR Credit Card Digits

Tip

Text localization with Tesseract

Tip

Rotated text localization with OpenCV

Tip

Complete OCR Pipeline

Tip

I'm Ready to Order my copy now

Order my copy now

The "OCR Practitioner" Bundle builds on the previous bundle and includes every chapter in the "Intro to OCR" Bundle. This bundle is geared towards more advanced OCR algorithms, techniques, and use cases, including:

Deep learning and OCR
Image/document alignment and registration
OCR in real-time video streams
OCR with GPUs
Cloud-based OCR APIs
...and more!

My Recommendation: The "OCR Practitioner Bundle" gives you the best bang for your buck. You should choose this bundle if you want a super in-depth treatment of OCR, but cannot afford the "OCR Expert" Bundle.

I'm ready to order my copy now

Everything in Intro Bundle

Training OCR Model with Keras/TensorFlow

Tip

Denoising Images with Machine Learning

Tip

Image Alignment and Registration

Tip

OCR'ing Invoices and Forms

Tip

OCR'ing Receipts

Tip

Solving Sudoku Puzzles with OCR

Tip

ANPR and OCR

Tip

OCR Video Streams

Tip

OCR with GPUs

Tip

Handwriting Recognition

Tip

OCR with Cloud APIs

Tip

Training Custom Tesseract Model

Tip

Order my copy now

The "OCR Expert" Bundle includes everything from both the "Intro to OCR" Bundle and "OCR Practitioner" Bundle.

This bundle also includes:

All bonus chapters and additional content.
A physical, printed edition of both volumes of OCR with Tesseract, OpenCV, and Python — this is the only bundle that includes a hardcopy edition.
Access to my private community forums for additional help and support. You'll get faster, more detailed answers to your questions and you'll be able to better connect with myself and other readers. (again, the other two bundles do not have access to these forums).

My Recommendation: You should go with the "OCR Expert" Bundle if you want to study OCR in-depth and you want additional help and support along the way. When it comes to learning Optical Character Recognition, you just can't beat this bundle!

Additionally, "OCR Expert" Bundle includes a Certificate of Completion. To receive the certificate, you will need to complete all lessons and quizzes associated with the text.

Everything in Intro Bundle

Everything in Practitioner Bundle

All Bonus Chapters

Tip

Physical, Printed Edition of Text

Tip

Access to Private Community Forums

Tip

Certificate
of Completion

I'm Ready to Order my copy now

Order my copy now

Trusted by members of top machine learning companies and schools. Join them in computer vision and OCR mastery.

Intro Bundle

$125

Learn the basics of OCR Order My Copy Now

The perfect starting point if you're a beginner to OCR. Here, you'll learn the basics of the Tesseract OCR engine and how you can use Tesseract and OpenCV to build real-world OCR projects, including OCR'ing and translating text, recognizing credit card numbers in images, automatically reading passports, detecting text in natural scene images, and more!

Enjoy a 100% money back guarantee.

After reading my book, if you haven't learned the fundamentals of Optical Character Recognition, then I don't want your money. That's why I offer a 100% Money Back Guarantee. Simply send me an email and ask for a refund, up to 30 days after your purchase. With all the copies I've sold, I count the number of refunds on one hand. My readers are satisfied and I'm sure you will be too.

Questions?

Why this book?

OCR with OpenCV, Tesseract, and Python is the most in-depth, comprehensive, and hands-on guide to learning Optical Character Recognition with OpenCV and Tesseract. You cannot find any other book or course online that includes this level of intuitive explanations and thoroughly documented code.

What bundle should I buy?

Each bundle builds on top of the others and includes all content from lower volumes. You should choose a bundle based on (1) how in depth you want to study Optical Character Recognition, OpenCV, and Tesseract and (2) your particular budget. Use the "Here's the full breakdown of what you'll learn inside OCR with OpenCV, Tesseract, and Python" section above to help you decide which topics you want to learn, then pick a bundle based on your choices.

What happens after I purchase?

After you purchase your copy of OCR with OpenCV, Tesseract, and Python you will (1) receive an email receipt for your purchase and (2) you will be able to download your books, code, datasets, etc. immediately.

What is your money back guarantee policy?

I offer a 30 day Money Back Guarantee on all orders. If you haven't learned the fundamentals of Optical Character Recognition after reading my book, then I don't want your money. Simply send me an email and ask for a refund, up to 30 days after your purchase. With all the copies I've sold, I count the number of refunds on one hand. My readers are satisfied and I'm sure you will be too.

Are the hardcopy editions shipping?

Yep, the hardcopies are indeed shipping! The "OCR Expert" Bundle is the only bundle that includes a hardcopy edition. After you purchase, you will receive an email with a link to enter your shipping information. Once I have your shipping address I can get your hardcopy edition in the mail, normally within 48 hours.

Why are we using the Python programming language?

Python is an extremely easy language to learn and has huge number of powerful packages. Python is the best way to apply Optical Character Recognition to your own projects.

Does this book cover OCR with Tesseract?

Yes, this book covers Tesseract for Optical Character Recognition extensively. You'll start by learning the basics of the Tesseract OCR engine, including how to utilize the Tesseract binary from the command line. From there, you'll interface with Tesseract via the Python programming language, learn how to OCR images with Python and Tesseract, and even train and fine-tune Tesseract models on your own custom datasets!.

Will this book show me how to apply OCR with OpenCV?

The OpenCV library is a collection of functions, modules, and tools that allow you to perform image processing and computer vision. OpenCV includes functions that let you perform OCR, but the library is actually much more powerful than just OCR. Inside this book you'll not only learn how to perform OCR with OpenCV, but also use OpenCV's functions to help you obtain higher OCR accuracy.

What if I'm a beginner at OCR?

It's okay if you are brand new to Optical Character Recognition! This book makes no assumptions on your prior experience with OCR, computer vision, or deep learning. You'll still be able to follow along with this book, even if you are brand new to the world of OCR.

That said, a little bit of OpenCV experience goes a long way, so if you're new to OpenCV, I highly recommend that you also grab a copy of Practical Python and OpenCV so that you can learn the basics.

What if I'm already experienced in OCR?

This book isn't just for beginners — there are advanced concepts, algorithms, and techniques covered as well:

You'll learn how to train custom Keras/TensorFlow OCR models.
You'll build an Automatic License/Number Plate Recognition (ANPR) system.
You'll learn how to apply OCR to real-time video streams using GPUs for inference.
You'll train and fine-tune Tesseract models on your own custom OCR datasets.
And you'll learn how to use OCR APIs in the cloud, including Amazon Rekognition, Microsoft Cognitive Services, and Google Vision.

Best of all, the algorithms, walkthroughs, and code can be directly applied to your current job and research.

Do I need any programming experience before reading this book?

This book assumes you have some prior programming experience (e.g. you know what a variable function, loop, etc. are). You should have more skills than a beginner but certainly not an intermediate or advanced developer. As long as you understand basic programming logic-flow, you'll be successful in reading (and understanding) the contents of this book.

Do I need to know OpenCV?

You do not need to know the OpenCV library to be successful when going through this book. We only use OpenCV to facilitate basic image processing operations such as loading an image from disk, displaying it to our screen, along with some basic image processing routines.

That said, a little bit of OpenCV experience goes a long way, so if you're new to OpenCV, I highly recommend that you also grab a copy of Practical Python and OpenCV so that you can learn the basics.

Do I need to know Deep Learning, Keras, TensorFlow, etc.?

The "Intro to OCR" Bundle does not require any knowledge over deep learning. The "OCR Practitioner" Bundle and "OCR Expert" Bundle require a practitioner's knowledge over deep learning, meaning that you understand the basic idea of how a model is trained. If you're new to deep learning or want to level up your skills, make sure you grab a copy of the Deep Learning for Computer Vision with Python book — you can work through this OCR book and the deep learning book in tandem.

Can I upgrade from the lower tier bundle to a higher one? How does the upgrade process work?

Yes, you can always upgrade your bundle to a higher one. For example, you could purchase the "Intro to OCR" Bundle and then upgrade to the "OCR Practitioner" Bundle or "OCR Expert" Bundle at a later date.

The cost to upgrade would simply be the price difference between your current bundle and the bundle you wanted to upgrade to (you would not need to "repurchase" the content you already own). To upgrade your bundle just send me an email and I can get you the upgrade link.

I'm just so busy right now...

Everyone has the same amount of time in a day — we all have 24 hours to work, spend time with our families, sleep, and have fun. If you're interested in studying Optical Character Recognition, I challenge you to make it your goal. Take the time to invest in yourself and your OCR knowledge by grabbing a copy of OCR with OpenCV, Tesseract, and Python.

Ask yourself, how much time are you wasting because:

You lack the fundamentals of OpenCV, Computer Vision, and Deep Learning.
You don’t understand what knobs and dials to tune to achieve high accuracy results.
Your scripts error out, leaving you confused on how to proceed.

OCR with OpenCV, Tesseract, and Python solves these problems so you can stop wasting your time and money following paths that only lead to failure — let me guide you to success!

Where can I learn more about you?

I have authored over 400+ blog posts about computer vision, OpenCV, and deep learning over at PyImageSearch.com. Check out the posts to get a feel for my teaching and writing style (not to mention the quality and depth of the tutorials). I would also highly suggest that you sign up for the (free) Table of Contents and sample chapters I am offering using the form at the bottom-right corner of this page.

I have another question.

If you have any other questions, please send me a message, and I'll get back to you ASAP.

Who's behind this?

Hey, I'm Adrian Rosebrock, a Ph.D and entrepreneur who has spent his entire adult life studying computer vision and machine learning. Over the past 5 years alone I have:

Started the PyImageSearch.com blog and published over 400+ tutorials and articles aimed at teaching computer vision, deep learning, and OpenCV.
Created PyImageSearch Gurus, an actionable, real-world course on computer vision and OpenCV. This course is the most comprehensive computer vision education online today, covering 13 modules broken out into 168 lessons with over 2,161 pages of content.
Authored Deep Learning for Computer Vision with Python, the most in-depth computer vision + deep learning book available today, including super practical walkthroughs, hands-on tutorials (with lots of code), and a no-nonsense teaching style that will help you master computer vision and deep learning.
Answered over 50,000+ emails and helped 10,000s of developers, researchers, and students just like yourself learn the ropes of computer vision and deep learning.

If studying deep learning and visual recognition sounds interesting to you, I hope you'll consider grabbing a copy of this book. You'll learn a ton about deep learning and computer vision in a practical, hands-on way. And you'll have fun doing it. See you on the other side!

OCR with OpenCV, Tesseract, and Python is more than just a book...

The OCR with OpenCV, Tesseract, and Python IndieGoGo was a huge success!

Take a sneak peek at what's inside...

Practical, real-world projects.

Clear, easy-to-follow tutorials.

Expert advice.

Hands-on coding.

Unparalleled support.

Optical Character Recognition (OCR) is a simple concept but is hard in practice.

And worse, trying to code custom software that can perform OCR is even harder:

If you've ever found yourself struggling to apply OCR to a project, or if you're simply interested in learning OCR, this book is for you.

Curious about what you’ll learn?

Experienced OCR practitioner?

Libraries and Packages

You're probably wondering...“Is this book right for me?”

A three volume book — customized to what you want to learn.

“Intro to OCR” Bundle

“OCR Practitioner” Bundle

“OCR Expert” Bundle

More than just a book — it's your complete training guide to mastering Optical Character Recognition.

The "OCR Expert" Bundle includes a Certificate of Completion.

Here's the full breakdown of what you'll learn inside OCR with Tesseract, OpenCV, and Python

Intro Bundle

Practitioner Bundle

Expert Bundle

Trusted by members of top machine learning companies and schools. Join them in computer vision and OCR mastery.

Enjoy a 100% money back guarantee.

Questions?

Hey, I'm Adrian Rosebrock, a Ph.D and entrepreneur who has spent his entire adult life studying computer vision and machine learning. Over the past 5 years alone I have:

You're probably wondering...
“Is this book right for me?”