Forum Moderators: phranque
Obviously, the best solution would be to eliminate the "illegal" characters, but that's not a very easy fix (for various reasons).
So first question- are there newer RFCs that allow those characters (perhaps in relation to foreign character domains)? Second, assuming they are valid, how to get Firefox to recognize that they're valid?
Google doesn't seem to have a problem indexing those pages, so my feeling is it's not worth an overhaul at this point.
Google doesn't seem to have a problem indexing those pages
Now that's news to me. That's the opposite of some of my research.
It comes down to the resolver being used by the operating system. A malformed label portion of the domain name will often resolve on Windows but not on most, if not all, others. Invalid examples would be any of the following:
http://-www.example.com/
http://www-.example.com/
http://-www-.example.com/
I have run tests on malformed urls using the exact same browsers and versions on both Windows and Unix (CENTOS5, RHEL4/5) and although the example urls listed above would open in a browser from Windows I had difficulty using other tools to get the resource from that operating system. And on the Unix flavors listed here -- it flat out failed, could not reach the resource period. So, although your client may find some success with the malformed urls you should let them know they are losing traffic because of them. Best to follow standards in this situation.
The part that is catching my interest here is that your client's site has been indexed by Google. I have done a lot of testing and research and never found indexed content on the sites I was analyzing. Perhaps it's time I revive some old projects ...