>Hello.
>
>We have some PDF files that contain data. They are actually drug schedules (formularies), but the important fact is that there is information in the files that we would like to extract.
>
>Does anyone know how to read a PDF file? I can view/print the file with Acrobat Reader, but I am unable to extract the raw data. I tried printing to a file with the generic test only print driver, but that doesn't work. I think the letters are all printed like graphics.
>
>Does anyone know how to convert PDF files to an image file? As a less-favourable solution, we could OCR an image file to get the information.
>
>Any help would be appreciated,
>Joe
If the text is in the .PDF as text, and not as graphics, you may try and select all text, copy it, and paste it in a text document. It depends on how the text is formatted in the document, but maybe you will be able to extract the necessary information from it.
To automate the process, you could use DDE (but don't ask me how).
Also, using the .PDF Writer or Distiller from Acrobat, you can fully edit .PDF files.
Regards,
Marc
Précédent
Répondre
Voir le fil de ce thread
Voir le fil de ce thread à partir de ce message seulement
Voir tous les messages de ce thread
Voir tous les messages de ce thread à partir de ce message seulement