Forum Moderators: Robert Charlton & goodroi
Why does the "Active Posts" list persist in saying there are seven messages in this thread when in fact there are only three? The Google SEO forum listing has it right. I re-checked after posting this, and it's now correctly saying four. Curiouser and curiouser.
How does Google treat Unicode Languages in its Algo?
...Unicode is growing both in usage and in character coverage. We recently upgraded to the latest version of Unicode, version 5.2 [unicode.org] ...We're constantly improving our handling of existing characters... after extensive testing, we just recently turned on support for these and thousands of other characters; your searches will now also find these documents....
...We’ve long used Unicode as the internal format for all the text Google searches and process: any other encoding is first converted to Unicode. Version 6.1 just released with over 110,000 characters; soon we’ll be updating to that version and to Unicode’s locale data from CLDR 21 [cldr.unicode.org]....
When I last reviewed it, the suggestion was to try to avoid mixing of language on a page if possible.
IME Google does deliver slightly less "esoteric" answers in "Non English" ( French at least )..less of the "we think that you really meant to search for"
How does Google treat foreign language in its algorithms?
How does Google treat Unicode Languages in its Algo?
How does it treat Latin language compared to Unicode language? ie: Arabic Chinese
Are Algo updated such as Panda effect All languages at once or just Latin Languages or just english?
How does Googlebot differentiate between those different languages?
How come you do not hear about a Turkish, Arabic or Chinese Matt Cutts trying to tackle spam in their respective Language?
How does Google treat a URL in its Algo that has content Both in English and another language?