Stephen - I'm not sure exactly what you want your program to do, but generally speaking, you can extract text from any file (.html, .txt, or even an HTML file at an http:// address) by opening it with fopen, reading it into variable(s), closing the file, then doing what you need to do with the data you read. Does this help? If not, give more specifics about what you are trying to do.
best,
Eric Mueller