I am trying to remove all the html and text from a page except the text between the <a> and </a> tag. If it looked like this:
<a href="url">click here</a>, all I want is "click here", and no other html or text. There will be hundreds of these on each page, thats why I need to do this. Pretty much all I need is a text file of all the words between the anchor tags, one on each line if possible. I have tried searching, but havent found anything that does what I need. Any help would be appreciated.
Thanks,
Joe