Hi, I am in a desperate need of help with "screen scraping" script. I'm currently using a web service that returns information in xml. This service will be switched off shortly and I have to revert to the "old technique" of reading information off a html page (all legal under the terms and conditions of the information provider).
I think I know what I want to achieve, just don't know how... Can you point me to a tutorial or some examples on how to do it? This is what I understand I need to do:
- read extract page from external site (cURL?)
- dump content to a "buffer" (?)
- extract text that falls between <pre>...</pre> tags as rows
- remove column headers (always same rows - spreaded around the file)
- break individual rows (space as delimiter)
- join with another bit of info based on a common name - from external txt file
- write the lot into MySQL database
- repeat the process every 20 min past the hour (cgi script?)
Phew, and it used to be so simple... I'm running my site on php 4.4.4. My knowledge of php is very limited.
ANY HELP MUCH APPRECIATED!