Hello,
I'm creating a word filtering script and I have a set of "bad words" that I'm currently filtering. The problem I have is that some users are using instead non-ascii characters to simulate these bad words. So I'm in need of a way to filter non-ascii characters to avoid this. Maybe just allow a-z, 0-9 and characters like @, ?, #, %,!, &,~, for example.
How can I accomplish this?
Thanks!