Unicode In 5 Minutes - Second Life Wiki
http://wiki.secondlife.com/wiki/Unicode_In_5_Minutes
Nerdiness.Tinyarro.ws - Shortest URLs on Earth
SHORT URL
tinyarro.ws: shortest urls in the WORLD. LINKS WILL NEVER BE THE FRICKEN' SAME! (isn't it silly the things we come up w/? (h/t @elCatch22)絵文字が開いてしまった「パンドラの箱」第1回--日本の携帯電話キャリアが選んだ道:コラム - CNET Japan
十徳ナイフ的テキストエンコードツール 便利便利Unicode table for you
A list of the many challenges of internationalization. My favourite code-breaker is Russian pluralization: if you are saying "X new messages", then "messages" is spelled one way if X ends in 1, another if it ends if 2,3, or 4, and a third way if it ends in 5,6,7,8,9 or 0. I cannot imagine how annoying this must be to code.
It’s not always as simple as 'Just use Unicode'
'Let us start by looking at four main variables in the space : language, country, culture and technology. Language. Let us take that one first. How hard can that be? Well, what list of languages do you want to target? Please don't say "all of them". There is no such thing as a definitive list of languages and even if there was, the list of languages supported in Unicode changes across various incarnations of Unicode. Oh, and there are languages with unbounded sets of "characters" such as Chinese which literally cannot be fully described in Unicode.'
Some examples of why internationalization and localization are not as simple as "just use Unicode".
Nice explanation of the complexity of language support.UnicodeとUTF-8の違いは? - おつあり
(via mi-ka-n)A List Apart: Articles: Accent Folding for Auto-Complete
In this article we will skip through a bit of history and theory, then illustrate a neat hack called accent-folding. Accent-folding has its limitations but it can help make some important yet overlooked user interactions work better.Google Japan Blog: 絵文字のユニコード符号化: 符号化提案用のオープンソースデータ
When you talk about “text,” you’re probably thinking of “characters and symbols on my computer screen.” But computers don’t deal in characters and symbols; they deal in bits and bytes. Every piece of text you’ve ever seen on a computer screen is actually stored in a particular character encoding. Very roughly speaking, the character encoding provides a mapping between the stuff you see on your screen and the stuff your computer actually stores in memory and on disk. There are many different character encodings, some optimized for particular languages like Russian or Chinese or English, and others that can be used for multiple languages. In reality, it’s more complicated than that. Many characters are common to multiple encodings, but each encoding may use a different sequence of bytes to actually store those characters in memory or on disk. So you can think of the character encoding as a kind of decryption key.
Everything you thought you knew about strings is wrong.
also about UTF-32, UTF-16, UTF-8Repeat after me: Unicode is not UTF-\d{1,2} « Enjoy, you’re doing it wrong!
Unicode is a standard, UTF-8/16/32 are encodings.Neven Mrgan's tumbl
A web app to enter Dingbats symbols
glyphboard for iphoneGlyphboard
for whiteapple
glyph icons that you can copy to emailsGlyphboard
for whiteapple
glyph icons that you can copy to emailsFalling foul of special characters « simon r jones
These days its pretty standard to require support for multiple languages and special characters on your website. But it’s still terribly easy to trip up and make mistakes, usually indicated by weird characters popping up across your web content. Here’s a few tips on how to sort out your character encoding.Falling foul of special characters « simon r jones
These days its pretty standard to require support for multiple languages and special characters on your website. But it’s still terribly easy to trip up and make mistakes, usually indicated by weird characters popping up across your web content. Here’s a few tips on how to sort out your character encoding.Nice Entity — "Find your character!"
Quick references of most common XHTML entities.
Quick References of Most Common Entities