Hi,
I have a collection of webpages that I'd like to parse. They are in a similar format to the page at:
http://www.leeds.ac.uk/students/ugmodules/comp1400.htm
What I'd like to do is identify, for example, which page the module is for, which semester the module is being taught in and the module objectives and have each of these values in seperate variables.
Any ideas how I'd go about this?
Thanks
Paul