does anybody know how to extract the hlinks from an html file?
preg_match_all("/<[ \r\n]?a[ \r\n]+?href[ \r\n]?=[ \r\n]?(?😢.+?)'|\"(.+?)\").?>/i",$htmlfile,$matches);
should give what you want as an array in $matches[1], assuming that the tags are HTML compliant and the URLs are quoted.