I always have troubles with UTF8.
The scenario: Mulitple sources (like xml, csv, own cms), and one database: Mysql. Information needs to be displayed on a website.
The encoding for xml: encoding="utf-8". So I have words(in dutch) from the xml like: "première", and "één".
Do I need to store these like they are? Special settings needed (collocation?) in the mysql database?
To display these on a webpage I use a utf-8 header using php or a meta tag saying it is utf8.
But the problem then is, while I have words (from other sources) that are already encoded to something like: "één" are not displaying correctly. The webpage then shows: "??n".
What is the best consistent way to avoid these problems?