Hi all,
I just stumbled onto this site and am glad I did.
Let me preface all with I have searched high & low across the net for answers to my dilemma (including the "netiquette" for this board). My Company literally has thousands of documents in PDF format. Most of this information is perishable and updated every 6-18 weeks by divisions scattered from here to kingdom-come. My task it to filter out the 2-4 paragraphs in a 200+ page document and present it in .php / .html format.
I have done a search in the message base but haven't found a clear answer. Is it possible to programmatically search through a PDF file (on disk) for an HTML form-posted string, get the pertinent paragraphs, and paste them into an HTML page? Or - do I have to write C code to do it (if possible), pass it to PHP, then output it? Or would I be re-inventing the wheel as someone has already cracked the code?
Thanks for any information in advance