Level Extreme platform
Subscription
Corporate profile
Products & Services
Support
Legal
Français
Reading fields from PDF
Message
 
To
11/11/2011 10:44:59
General information
Forum:
ASP.NET
Category:
Other
Environment versions
Environment:
VB 9.0
OS:
Windows 7
Network:
Windows 2003 Server
Database:
MS SQL Server
Application:
Web
Miscellaneous
Thread ID:
01528652
Message ID:
01528814
Views:
44
Hi Michel

I don't know if it is too late but I have used some tools from http://www.aspose.com/categories/.net-components/aspose.pdf-for-.net/default.aspx and there support is excellent.


>Right now, we are using a very primitive PDFToTXT product from http://www.verypdf.com, which has been abandoned and taken over by another party which doesn't show any intention of offering technical support. As a matter of fact, the new site shows "Live Chat Offline", yes, you read that well "Live Chat Offline".
>
>This utility converts to TXT and then we apply all kinds of logic to parse whatever we need. Of course, we have problems with French characters sometimes as it only converts them at 75%. It seems their 2007 version has not been updated for new PDF format. Also, we end up with page markers and weird characters in the parsing and this makes our work very difficult to obtain a clean parsing of the data we want.
>
>So, is there a .NET PDF reader DLL we can have where we would be able to obtain a more advanced way of extracting the fields we want. As, I assume a PDF much have field names hidden. So, with a more advanced utility, I assume we could just use the PDF field name and obtain the value as is.
Éric Moreau, MCPD, Visual Developer - Visual Basic MVP
Conseiller Principal / Senior Consultant
Moer inc.
http://www.emoreau.com
Previous
Next
Reply
Map
View

Click here to load this message in the networking platform