Hi Srividya
I've had a lot of trouble with on-Roman characters and XML. I can't get UTF-8 to work as it's supposed to. What I do at the moment is do a search and replace for a whole long list of non-Roman characters in the XML doc and replace them with issue-free alternatives, e.g. Ä -- Aumlaut
The alternatives can't be letter combinations which could otherwise appear in normal words. When I output to HTML, I then replace these alternatives with HTML entities, e.g. Ä
I haven't yet found an elegant solution which would make all this unnecessary. I tried using utf8_encode(), but it did weird things and it didn't solve the problem either. I haven't had a problem with SOAP/WSDL, but then I haven't tried non-Roman characters yet. I use the Google API, but only for very simple stuff.
Let me know if you get this fixed, because I'd really like to know how to fix it.
Thanks
Norm