I am working on a little tool that looks for certains strings / tags in RTF files. Now, those RTF files are not "regular" rtfs. These are actually FrameMaker MIF files that were exported to RTF for translation (in TRADOS). The RTFs are in fact bilingual.
This poses problems when I want to display text from these RTFs using PHP. I can find the strings I'm looking for with PHP's pregmatch function (based on some FrameMaker tags) but I cannot display the results - a sentence in Portuguese or Russian for example - appropriately on a web-page, since I have no clue about character encoding / conversion for those files.
Example in RTF source:
The English string "Settings and Adjustments" is translated into Portuguese as "Programa\'e7\'f5\loch\f0 es e ajustes".
what you see in Word:
"Programações e ajustes", where
\'e7\'f5\loch\f0 = çõ
Does anyone know how to transform this into HTML entitities without using Word? Help much appreciated!
Thanks