Hi,
Ive just posted a new script to my website, it is called Table Extractor and it can extract almost any table from any html document/page, and then convert that html table into a php array.
Here are the features:
-
Can extract almost any table from any html page.
-
Uses the table column headers as the keys in the array.
-
Strip unnecessary html tags out of the source.
-
Limit the number of columns and rows returned, as well as set the start row and start column.
-
Create extra columns based on data in others using regular expressions, eg. The URL of a link.
-
Splits colspans and rowspans into their original number of cells.
I am sure that some parts of it could be greatly improved, if anyone has any suggestions please let me know, cheers.
The script is here.
And a guide explaining the features and how to use them is here.
Thanks,