OCR for Java

OCR for C++

OCR for Python via .NET

OCR for Python via Java

OCR for Javascript via C++

OCR for Node.js via C++

OCR for Python via C++

OCR Excellence with Aspose.OCR for Python via C++

Explore the power of OCR in Python with Aspose.OCR for Python via C++. Seamlessly convert images and PDFs into editable text with speed and accuracy.

Download Trial Purchase

See what’s new

import asposeocr

settings = asposeocr.RecognitionSettings()
input_data = [asposeocr.AsposeOCRInput()]
input_data[0].url = 'path/to/file'
recognize_result = AsposeOCRRecognize(input_data, settings)

> pip install aspose-ocr-python-cpp

Why Choose Aspose.OCR for Python via C++?

Enhance your OCR capabilities with Aspose.OCR for Python via C++. This feature-packed library seamlessly integrates OCR into Python applications with minimal code. Experience exceptional speed and accuracy, supporting 28 languages, including Latin, Cyrillic, and Chinese scripts. Recognize a variety of images, from scanned documents and smartphone photos to screenshots and scanned PDFs. Advanced pre-processing filters handle challenges like rotated, skewed, and noisy images, ensuring optimal performance by leveraging GPU processing.

Swift and Precise OCR

Achieve rapid and accurate OCR results with cutting-edge Python via C++ technology.

Multilingual Recognition

Recognize text in 28 languages, ensuring versatility for Python applications via C++ integration.

Comprehensive Image Support

Effortlessly process images from scanners, cameras, and smartphones with Python via C++.

Accurate Chinese Character Recognition

Precisely recognize over 6,000 Chinese characters in your Python projects via C++.

Maintain Font Styles and Formatting

Preserve font styles and formatting for accurate representation of recognized text in your Python applications via C++.

Live code sample

Initiate text recognition from images in just three lines of code. Experience the simplicity!

Ready to recognize Ready to recognize Drop a file here or click to browse *

* By uploading your files or using the service you agree with our Terms of use and Privacy Policy.

Recognition result

// Initialize OCR engine
var recognitionEngine = new Aspose.OCR.AsposeOcr();
// Extract text from image
string result = recognitionEngine.RecognizeImage("<file name>");
// Display the recognition result
Console.WriteLine(result);

OCR Integration for Python Applications

Aspose.OCR for Python via C++ seamlessly integrates with any platform supporting C++ - whether on desktop Windows, Windows Server, macOS, Linux, or the cloud.

Supported file formats

Aspose.OCR for Python via C++ can work with virtually any file you can get from a scanner or camera. Recognition results are returned in the most popular file and data exchange formats that can be saved, imported to a database, or analyzed in real time.

Images

JPEG
PNG
TIFF
BMP

Batch OCR

Multi-page PDF
ZIP
Folder

Recognition results

Text
PDF
Microsoft Word
Microsoft Excel
RTF
JSON
XML

Effortless Installation for Python via C++

Aspose.OCR for Python via C++ is delivered as a Python package or as a downloadable file with minimal dependencies. Easily install it into your project, and you’re ready to recognize texts in multiple supported languages and save recognition results in various formats.

Request a trial license to kickstart the development of a fully functional OCR application without limitations.

C++ Backend Integration for Python Applications

Our library seamlessly integrates with the C++ backend, enabling Python applications to run seamlessly on any platform – desktop Windows, Windows Server, macOS, Linux, and the cloud.

28 Recognition Languages

Support for diverse languages and scripts, including mixed languages:

Optimize recognition by letting the library detect language or define it for improved performance.

Extended Latin alphabet: Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Italian, Latvian, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish;
Cyrillic alphabet: Belorussian, Bulgarian, Kazakh, Russian, Serbian, Ukrainian;
Chinese: Over 6,000 characters;
Hindi.

Key Features

Aspose.OCR for Python via C++ Explore advanced capabilities of Aspose.OCR for Python via C++.

Photo OCR

Extract text from smartphone photos with scan-level accuracy.

Searchable PDF

Convert any scan into a fully searchable and indexable document.

URL recognition

Recognize an image from URL without downloading it locally.

Bulk recognition

Read all images from multi-page documents, folders and archives.

Any font and style

Identify and recognize text in all popular typefaces and styles.

Fine-tune recognition

Adjust every OCR parameter for best recognition results.

Spell checker

Improve results by automatically correcting misspelled words.

Find text in images

Search for text or regular expression within a set of images.

Compare image texts

Compare texts on two images, regardless of the case and layout.

Limit recognition scope

Limit the set of characters the OCR engine will look for.

Detect image defects

Automatically find potentially problematic areas of image.

Recognize areas

Find and read only specific areas of an image, not all text.

Python Code Samples

Discover code samples to seamlessly integrate Aspose.OCR for Python via C++ into your applications.

Effortless Installation

As a lightweight Python package or a downloadable file with minimal dependencies, Aspose.OCR for Python via C++ ensures easy distribution. Integrate it into your project directly from Python, and you’re prepared to leverage complete OCR capabilities, saving recognition results in various formats.

Post-installation, promptly commence using Aspose.OCR for Python via C++, albeit with certain limitations. A temporary license removes all trial version restrictions for 30 days. Utilize this period to initiate the development of a fully functional OCR application, allowing you to make an informed decision on purchasing Aspose.OCR for Python via C++ later.

Document areas detection

An image scan or photograph of a text document often comprises multiple content blocks, including text paragraphs, tables, illustrations, formulas, and more. Successfully and accurately recognizing these areas of interest on a page is crucial for effective OCR. This pivotal process is known as document area detection, involving the identification, organization, and classification of distinct content regions.

Aspose.OCR offers several document areas detection algorithms, allowing you to choose the one that works best for your specific content.

Setup for varuous image content - Python OCR

settings = asposeocr.RecognitionSettings()
settings.areasMode = asposeocr.DetectAreasMode.TABLE
# DetectAreasMode.DOCUMENT / PHOTO / CURVED_TEXT / TABLE / COMBINE

input_data = [asposeocr.AsposeOCRInput()]
input_data[0].url = 'path/to/file'
recognize_result = asposeocr.AsposeOCRRecognize(input_data, settings)

Various output formats

OCR for C++ can save recognition results in multiple formats so they can be shared, stored in a database, displayed, or analyzed: plain text, PDF, Microsoft Word, Microsoft Excel, JSON, and XML. {index-content-python-cpp.code_samples.item3.content2}

Save recognition results to JSON - Python

recognize_result_txt 
    = asposeocr.AsposeOCRSerializeResult(
      recognize_result, 
      asposeocr.ExportFormat.json)