Level Extreme platform
Subscription
Corporate profile
Products & Services
Support
Legal
Français
Pdf to a Character
Message
From
19/01/2010 02:49:04
 
 
To
18/01/2010 10:34:04
General information
Forum:
Visual FoxPro
Category:
Coding, syntax & commands
Environment versions
Visual FoxPro:
VFP 9 SP2
OS:
Windows 7
Network:
Windows 2008 Server
Database:
Visual FoxPro
Application:
Desktop
Miscellaneous
Thread ID:
01444558
Message ID:
01444686
Views:
53
>Hi all,
>
>I have a scanner that i use to put my docs into pdf files. These pdf are ocred. I wonder if it is possible to get the text or a part of the text via foxpro to a memo or a char field...

In most cases, if paper pages are scanned directly to PDFs, then they are really what I call "dumb" PDFs, in that they just contain a large image rather than text. If the intention is to try to extract the text, it's usually a good idea to skip the PDF creation step and OCR the original image file, say a JPG or TIFF, directly. Usually, the fewer the conversion steps, the better the result.

That said, the result of an OCR operation is a text file of some sort - could be as simple as a plain TXT file, or it might be structured like a Word DOC/DOCX. You often have a choice of output type.

If you can choose the output type, you could experiment with TXT, in which case a simple FILETOSTR() may be all you need to manipulate it in VFP. If it's a Word DOC you could experiment with Word automation.
Regards. Al

"Violence is the last refuge of the incompetent." -- Isaac Asimov
"Never let your sense of morals prevent you from doing what is right." -- Isaac Asimov

Neither a despot, nor a doormat, be

Every app wants to be a database app when it grows up
Previous
Reply
Map
View

Click here to load this message in the networking platform