In this paper we analyze the concept of grid programming as it applies to document imaging and processing. The paper mostly focuses on optical character recognition (OCR) processing mapped to a simple grid configuration. Alchemi .NET grid framework was used as the grid engine. Basic dependences of the OCR engine behavior in the grid configuration are discussed and the increase of the document processing rate is demonstrated. Based solely on the small sample and test regimen employed, we demonstrate that throughput in excess of 150 percent can be achieved even in this CPU intensive process by employing grid strategies.
Analysis of Grid Computing as It Applies to High-Volume Document Processing and Optical Character Recognition