sorry, here is the platform:
- system = linux and mysql
- files = word, excel, powerpoint, pdf
i need to read the text from these files, i dont care about the formatting.
i tried this code:
$filename = "1.doc";
$fd = fopen($filename, "rb");
$contents = fread ($fd, filesize ($filename));
fclose ($fd);
echo "$contents <p>";
but this prints a load of junk text apart from my text as well! how can i remove that junk text, for instance?
when i try with pdf, i only see junk.
would really appreciate any ideas!
thanx/erick