Hi,
This is my problem, i wonder if anyone can help.
I am downloading remote html files from given URLs to my server and saving them on that machine. What i want to do is divide up the html file ie HTML123.html into its component parts ie HTML code only, content only and Javascript only. I then want to save this to these 3 respective text files ie CODE.txt, CONTENT.txt and JS.txt on the server so that i can perform different operations on each file.
Does anyone know of a good way to do this?
I know i would need to open the file and then read through looking for particular chars like <font> and then add that the closing </font> tag to a string for saving in the CODE.txt file, but i have no real idea how to go about the detail on this.
Thanks for reading,
Oaf