Analysis of Grid Computing as It Applies to High-Volume Document Processing and Optical Character Recognition

[article]

Summary:

In this paper we analyze the concept of grid programming as it applies to document imaging and processing. The paper mostly focuses on optical character recognition (OCR) processing mapped to a simple grid configuration. Alchemi .NET grid framework was used as the grid engine. Basic dependences of the OCR engine behavior in the grid configuration are discussed and the increase of the document processing rate is demonstrated. Based solely on the small sample and test regimen employed, we demonstrate that throughput in excess of 150 percent can be achieved even in this CPU intensive process by employing grid strategies.

File:

XUS7471838file1_0.doc

Archived

About the author

Stephen Pearson

Stephen Pearson has more than fifteen years of imaging experience including FileNet, Eastman, Open Text and IBM. Currently he works as freelance consultant.

About the author

Dmitri Ilkaev

Dmitri Ilkaev has more than twenty years of experience in software and technology development. He holds Ph.D. in Computer Sciences from Moscow Institute of Physics and Technology. He can be reached at [email protected].

AgileConnection is a TechWell community.

Through conferences, training, consulting, and online resources, TechWell helps you develop and deliver great software every day.

Jun 02	AI Con USA Bridging Minds and Machines
Sep 22	STARWEST Software Testing Conference in Anaheim & Online
Oct 13	Agile + DevOps USA The Conference for Agile and DevOps Professionals

May 16	Test Ahead of the Curve: Insights for Developing a Superior Test Coverage
May 23	How Generative AI Boosts Speed and Quality in Software Testing
On Demand	Building Confidence in Your Automation
On Demand	Leveraging Open Source Tools for DevSecOps
On Demand	Five Reasons Why Agile Isn't Working