we have a simple cms that clients seem to love pasting data into from Word documents. we end up with loads of horrible characters in the data that word must generate. any ideas of how to strip these (binary?) characters from a whole string either before it gets entered into the db or on pageload??
You could use regular expressions like preg_replace to remove anything thats not valid.
trouble is there could be 1000's of the little blighters!
is there not a php function that can do this?