Forum Moderators: phranque

Message Too Old, No Replies

You decided not to migrate to UTF-8

         

sleidia

2:14 pm on Jun 20, 2007 (gmt 0)

10+ Year Member



What were your reasons for not doing so?

As for myself, I found that there is too much hassle to do the move :(
Plus, I found a bug with the trim() function of PHP on Japanese text: some kanjis got changed. So, I'm still using Shift-jis.

encyclo

12:40 am on Jun 21, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The problem you mention is one of the common reasons. PHP doesn't handle UTF-8 well. I run a phpBB forum, and if you try UTF-8 the search function doesn't work with the multi-byte characters, nor does the text highlighting code. So I stick with ISO-8859-1 (in my case).

However, I think the biggest reasons for not moving to UTF-8 are the tools (read Windows text editors), a general lack of knowledge about character encoding, apathy and inertia.

bill

2:56 am on Jun 21, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Japanese can be quite problematic with UTF-8. Some sites are using it successfully but you've really got to know what you're doing in terms of the server setup. If you are on a shared server / virtual host it's doubtful they will have all the mbstring modules complied into PHP that you'll need.

Shift-JIS is a good encoding to use for a Japanese site.

sleidia

3:20 pm on Jun 21, 2007 (gmt 0)

10+ Year Member




Thanks a lot!

But really, theses answers come as a big surprise.
During all the past years, I've always used ISO-8859-1 for latin text and Shift-JIS for the Japanese ... and am still using them.

But recently, I've found many web articles pushing webmasters toward the UTF-8 trend because it's supposed to be "safer" and "more efficient" for multilingual sites. And you can display multiple languages on the same page.

I started to think that I was doing a very bad thing in sticking with the "good old encodings".

Where is the truth then? After some research, I came to the conclusion that UTF-8 is good for western sites. As for the Japanese sites, EUC-JP seems to be prefered to Shift-JIS by a huge majority ... and I still don't know why.

bill

2:11 am on Jun 22, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



We're having pretty much the same discussion in your other thread in the Asia/Pacific Forum: EUC-JP better than SHIFT-JIS? [webmasterworld.com]

UTF-8 works pretty well for Western character sets. It's the double byte languages like Japanese and Chinese that still have issues.