Forum Moderators: Robert Charlton & goodroi
What can i use to copy Chinese characters and paste them in Chinese?In this day and age you should be able to use anything--yes, even on Windows--but you have to tell the text editor (or whatever you're using) what encoding to use. Any self-respecting text editor should also have a way to reinterpret text, so if you find yourself staring at
Download the Chinese character set (simplified Chinese) and the characters will display properlyThe issue here is not with the ability to display characters. If your device is correctly reading UTF-8 but you happen not to have any Chinese font installed, you will instead see some type of generic I-can’t-display-that for each individual character. That's different from the string of  and à and so on that you get if the Chinese has been interpreted as Latin-1 (or analogous gibberish if your device defaults to a different one-byte encoding).
the webmasterworld web server does not render Chinese!Or anything else outside the Windows-Latin-1 range. But if you have a text editor that does HTML preview, just paste it in and Preview.
the url changes to the roman charactersPlease paste in a few examples of what you mean. I seriously doubt your browser, or even your server, is brilliant enough to transliterate from Chinese to Roman*, so it still has to be a question of reinterpreting from one encoding to another.
US version of ultrasonic scalpelOK then.
No matter what ediitor i use it remains in string 2 format.Then you'll need to route it via a bit of ajax or php. Or simply javascript, if the site already requires scripting to function; otherwise you may lose some users. In javascript--which is what I use for my local decoding function--the command is
decodeURIComponent(argument)
Someone hereabouts will know the equivalent command in php or whatever language you end up using. /(%[cd][\da-f]%[89ab][\da-f]|%e[\da-f]%[89ab][\da-f]%[89ab][\da-f]|%f[\da-f]%[89ab][\da-f]%[89ab][\da-f]%[89ab][\da-f])/iWhew. I think that's meant to cover all possible permutations of the percent-encoding of multi-byte characters. It's a heck of a lot easier to read in RegEx engines that have the shorthand \h = [a-f0-9] but hardly any of them do.