Well, i am looking for an open source software that would be able to extract datas from a html file (the format of the file is varying) and to store the datas in a database. I would like to find a tool that would allow me : to specify the way how to identify the data from the formatting and to specify the way how to store the datas extracted. If you know a good tool (in PHP or any other languages) able to do this thanks in advance to drop me a note.
David