So im in a bit of confusion, see i used file_get_contents to get all the data from a website and managed to strip all the unwanted tags leaving only the html ones using strip_tags, so my question is what is the best way of making the script read all the contents of the page, select only the elements with html tags and putting those elements in an array?