How can I "scrape" and parse an online XHTML page and extract XHTML table contents from the web page into a PHP5 array?
Next, this PHP5 array must parsed into an XML file but I guess this next step in parsing can be done with PEAR XML_Serializer?
It is the first step that is the problem .. XHTML table to PHP5 array.
How can Tidy (PHP5 extension) be used for this?
I've tried using "output-xml" => true option in Tidy .. but no XML is created.
My configuration ..
PHP5 with cURL and Tidy
PEAR extensions - Tidy, XML_Serializer, XML_Parser, XML_Tree