I am tying to build a database of items that exsist on in HTML pages already. The issue I have is pretty simple though, as I can access a item on one page and have all the data, but parsing the correct values is not easy.
The problem with DOMX I have is that its generic div inside a a named div, I can of course grab the named Div data no problem and parse elements. Here is an example
<html>
<body>
<div id="page">
<div id="header">BLA BLA</div>
<div id="body">
<div id="item1">
<div class="spacer">
<div class="value">DATA I WANT</div>
</div>
</div>
<div id="item2">
<div class="spacer">
<div class="value">DATA I WANT</div>
</div>
</div>
<div id="item3">
<div class="spacer">
<div class="value">DATA I DONT WANT</div>
</div>
</div>
<div id="item4">
<div class="spacer">
<div class="value">DATA I WANT</div>
</div>
</div>
</div>
</div>
</body>
</html>
I can of course get a names Div, but when all the "data" divs are un named and use "value" cor the class its not going to work well.
Thanks,
Chris