I copied this function from somewhere on the web to convert some of those M$ Word-specific characters, which might help:
<?php
function filterText($text)
{
$search = array (
'&',
'<',
'>',
'"',
chr(212),
chr(213),
chr(210),
chr(211),
chr(209),
chr(208),
chr(201),
chr(145),
chr(146),
chr(147),
chr(148),
chr(151),
chr(150),
chr(133)
);
$replace = array (
'&',
'<',
'>',
'"',
'‘',
'’',
'“',
'”',
'–',
'—',
'…',
'‘',
'’',
'“',
'”',
'–',
'—',
'…'
);
return str_replace($search, $replace, $text);
}
Also, you'll likely want/need to let your database connection know you're talking UTF-8 now. Here's some info I collected a few years ago: http://www.ebookworm.us/utf8-in-php-and-mysql/