DocIm2002: Optical Character Recognition Is Slow on Document of Low Image-Quality (290450)



The information in this article applies to:

  • Microsoft Office XP Document Imaging
  • Microsoft Office XP Document Scanning

This article was previously published under Q290450

SYMPTOMS

When you scan a document, and then recognize the text using Optical Character Recognition (OCR), the OCR portion of the scan takes longer than you expect.

CAUSE

This behavior can occur when you use OCR on a document that has a dark or "noisy" background; for example, a document that has been repeatedly photocopied.

WORKAROUND

To work around this behavior, scan the document again to obtain a clearer image with which to use OCR.

MORE INFORMATION

When you use OCR on a document of low image-quality, the OCR recognizer can interpret background marks as text that needs to be recognized. This may add more time to the recognition of the document.

Modification Type:MajorLast Reviewed:11/3/2003
Keywords:kbprb KB290450