Hi to the community,
i'm using, with satsfaction, the
phpcrawl
classes to spider websites.
It is a very interesting tool and I can suggest to use it to everybody interested to do such a job.
What I need today is a class or a function
that help me to parse HTML input , so to
better indicize the content of sites I spider.
I tried with pear classes but I found it
too much heavy, and I would like to know if everybody coul suggest me any piece of code to parse simple elements of html page like <title><head><body> etc.
Any help would be very appreciated
Thank you
Max