Unicode 5.2.0新标准正式发布 增加6648个字符
2009年10月1日,The Unicode Consortium(统一码联盟)正式发布最新版字符编码标准Unicode 5.2.0。Unicode 5.2.0相比Unicode 5.1.0,增加了6648个字符。主要新增了数个文字区块:Bamum(巴穆姆文)、Javanese(爪哇文)、Lisu(傈僳音节文字)、Meetei Mayek(曼尼普尔文)、Samaritan(撒马利亚文)、Tai Tham(傣坦文)、Tai Viet(越南傣文)和CJK Unified Ideographs Extension C(中日韩统一表意文字扩展C)等,以及增补现有的文字区块:Abkhaz(阿布哈兹)、Canadian Aboriginal Syllabics(加拿大土著统一音节文字)、Coptic(科普特文)、Devanagari(天城体梵文字母)、Khamti Shan、Malayalam(马拉亚拉姆文)和Myanmar(缅甸文)。特别值得注意的是,Unicode 5.2.0已经支持Vedic Extensions(吠陀梵文扩展),吠陀梵文是印度传统宗教的主要语言之一。
Unicode 5.2.0完整新增的字符范围
0524..0525 [2] CYRILLIC CAPITAL LETTER PE WITH DESCENDER..CYRILLIC SMALL LETTER PE WITH DESCENDER
0800..082D [46] SAMARITAN LETTER ALAF..SAMARITAN MARK NEQUDAA
0830..083E [15] SAMARITAN PUNCTUATION NEQUDAA..SAMARITAN PUNCTUATION ANNAAU
0900 DEVANAGARI SIGN INVERTED CANDRABINDU
094E DEVANAGARI VOWEL SIGN PRISHTHAMATRA E
0955 DEVANAGARI VOWEL SIGN CANDRA LONG E
0979..097A [2] DEVANAGARI LETTER ZHA..DEVANAGARI LETTER HEAVY YA
09FB BENGALI GANDA MARK
0FD5..0FD8 [4] RIGHT-FACING SVASTI SIGN..LEFT-FACING SVASTI SIGN WITH DOTS
109A..109D [4] MYANMAR SIGN KHAMTI TONE-1..MYANMAR VOWEL SIGN AITON AI
115A..115E [5] HANGUL CHOSEONG KIYEOK-TIKEUT..HANGUL CHOSEONG TIKEUT-RIEUL
11A3..11A7 [5] HANGUL JUNGSEONG A-EU..HANGUL JUNGSEONG O-YAE
11FA..11FF [6] HANGUL JONGSEONG KIYEOK-NIEUN..HANGUL JONGSEONG SSANGNIEUN
1400 CANADIAN SYLLABICS HYPHEN
1677..167F [9] CANADIAN SYLLABICS WOODS-CREE THWEE..CANADIAN SYLLABICS BLACKFOOT W
18B0..18F5 [70] CANADIAN SYLLABICS OY..CANADIAN SYLLABICS CARRIER DENTAL S
19AA..19AB [2] NEW TAI LUE LETTER HIGH SUA..NEW TAI LUE LETTER LOW SUA
19DA NEW TAI LUE THAM DIGIT ONE
1A20..1A5E [63] TAI THAM LETTER HIGH KA..TAI THAM CONSONANT SIGN SA
1A60..1A7C [29] TAI THAM SIGN SAKOT..TAI THAM SIGN KHUEN-LUE KARAN
1A7F..1A89 [11] TAI THAM COMBINING CRYPTOGRAMMIC DOT..TAI THAM HORA DIGIT NINE
1A90..1A99 [10] TAI THAM THAM DIGIT ZERO..TAI THAM THAM DIGIT NINE
1AA0..1AAD [14] TAI THAM SIGN WIANG..TAI THAM SIGN CAANG
1CD0..1CF2 [35] VEDIC TONE KARSHANA..VEDIC SIGN ARDHAVISARGA
1DFD COMBINING ALMOST EQUAL TO BELOW
20B6..20B8 [3] LIVRE TOURNOIS SIGN..TENGE SIGN
2150..2152 [3] VULGAR FRACTION ONE SEVENTH..VULGAR FRACTION ONE TENTH
2189 VULGAR FRACTION ZERO THIRDS
23E8 DECIMAL EXPONENT SYMBOL
269E..269F [2] THREE LINES CONVERGING RIGHT..THREE LINES CONVERGING LEFT
26BD..26BF [3] SOCCER BALL..SQUARED KEY
26C4..26CD [10] SNOWMAN WITHOUT SNOW..DISABLED CAR
26CF..26E1 [19] PICK..RESTRICTED LEFT ENTRY-2
26E3 HEAVY CIRCLE WITH STROKE AND TWO DOTS ABOVE
26E8..26FF [24] BLACK CROSS ON SHIELD..WHITE FLAG WITH HORIZONTAL MIDDLE BLACK STRIPE
2757 HEAVY EXCLAMATION MARK SYMBOL
2B55..2B59 [5] HEAVY LARGE CIRCLE..HEAVY CIRCLED SALTIRE
2C70 LATIN CAPITAL LETTER TURNED ALPHA
2C7E..2C7F [2] LATIN CAPITAL LETTER S WITH SWASH TAIL..LATIN CAPITAL LETTER Z WITH SWASH TAIL
2CEB..2CF1 [7] COPTIC CAPITAL LETTER CRYPTOGRAMMIC SHEI..COPTIC COMBINING SPIRITUS LENIS
2E31 WORD SEPARATOR MIDDLE DOT
3244..324F [12] CIRCLED IDEOGRAPH QUESTION..CIRCLED NUMBER EIGHTY ON BLACK SQUARE
9FC4..9FCB [8] CJK UNIFIED IDEOGRAPH-9FC4..CJK UNIFIED IDEOGRAPH-9FCB
A4D0..A4FF [48] LISU LETTER BA..LISU PUNCTUATION FULL STOP
A6A0..A6F7 [88] BAMUM LETTER A..BAMUM QUESTION MARK
A830..A839 [10] NORTH INDIC FRACTION ONE QUARTER..NORTH INDIC QUANTITY MARK
A8E0..A8FB [28] COMBINING DEVANAGARI DIGIT ZERO..DEVANAGARI HEADSTROKE
A960..A97C [29] HANGUL CHOSEONG TIKEUT-MIEUM..HANGUL CHOSEONG SSANGYEORINHIEUH
A980..A9CD [78] JAVANESE SIGN PANYANGGA..JAVANESE TURNED PADA PISELEH
A9CF..A9D9 [11] JAVANESE PANGRANGKEP..JAVANESE DIGIT NINE
A9DE..A9DF [2] JAVANESE PADA TIRTA TUMETES..JAVANESE PADA ISEN-ISEN
AA60..AA7B [28] MYANMAR LETTER KHAMTI GA..MYANMAR SIGN PAO KAREN TONE
AA80..AAC2 [67] TAI VIET LETTER LOW KO..TAI VIET TONE MAI SONG
AADB..AADF [5] TAI VIET SYMBOL KON..TAI VIET SYMBOL KOI KOI
ABC0..ABED [46] MEETEI MAYEK LETTER KOK..MEETEI MAYEK APUN IYEK
ABF0..ABF9 [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE
D7B0..D7C6 [23] HANGUL JUNGSEONG O-YEO..HANGUL JUNGSEONG ARAEA-E
D7CB..D7FB [49] HANGUL JONGSEONG NIEUN-RIEUL..HANGUL JONGSEONG PHIEUPH-THIEUTH
FA6B..FA6D [3] CJK COMPATIBILITY IDEOGRAPH-FA6B..CJK COMPATIBILITY IDEOGRAPH-FA6D
10840..10855 [22] IMPERIAL ARAMAIC LETTER ALEPH..IMPERIAL ARAMAIC LETTER TAW
10857..1085F [9] IMPERIAL ARAMAIC SECTION SIGN..IMPERIAL ARAMAIC NUMBER TEN THOUSAND
1091A..1091B [2] PHOENICIAN NUMBER TWO..PHOENICIAN NUMBER THREE
10A60..10A7F [32] OLD SOUTH ARABIAN LETTER HE..OLD SOUTH ARABIAN NUMERIC INDICATOR
10B00..10B35 [54] AVESTAN LETTER A..AVESTAN LETTER HE
10B39..10B55 [29] AVESTAN ABBREVIATION MARK..INSCRIPTIONAL PARTHIAN LETTER TAW
10B58..10B72 [27] INSCRIPTIONAL PARTHIAN NUMBER ONE..INSCRIPTIONAL PAHLAVI LETTER TAW
10B78..10B7F [8] INSCRIPTIONAL PAHLAVI NUMBER ONE..INSCRIPTIONAL PAHLAVI NUMBER ONE THOUSAND
10C00..10C48 [73] OLD TURKIC LETTER ORKHON A..OLD TURKIC LETTER ORKHON BASH
10E60..10E7E [31] RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
11080..110BC [61] KAITHI SIGN CANDRABINDU..KAITHI ENUMERATION SIGN
110BD KAITHI NUMBER SIGN
110BE..110C1 [4] KAITHI SECTION MARK..KAITHI DOUBLE DANDA
13000..1342E [1071] EGYPTIAN HIEROGLYPH A001..EGYPTIAN HIEROGLYPH AA032
1F100..1F10A [11] DIGIT ZERO FULL STOP..DIGIT NINE COMMA
1F110..1F12E [31] PARENTHESIZED LATIN CAPITAL LETTER A..CIRCLED WZ
1F131 SQUARED LATIN CAPITAL LETTER B
1F13D SQUARED LATIN CAPITAL LETTER N
1F13F SQUARED LATIN CAPITAL LETTER P
1F142 SQUARED LATIN CAPITAL LETTER S
1F146 SQUARED LATIN CAPITAL LETTER W
1F14A..1F14E [5] SQUARED HV..SQUARED PPV
1F157 NEGATIVE CIRCLED LATIN CAPITAL LETTER H
1F15F NEGATIVE CIRCLED LATIN CAPITAL LETTER P
1F179 NEGATIVE SQUARED LATIN CAPITAL LETTER J
1F17B..1F17C [2] NEGATIVE SQUARED LATIN CAPITAL LETTER L..NEGATIVE SQUARED LATIN CAPITAL LETTER M
1F17F NEGATIVE SQUARED LATIN CAPITAL LETTER P
1F18A..1F18D [4] CROSSED NEGATIVE SQUARED LATIN CAPITAL LETTER P..NEGATIVE SQUARED SA
1F190 SQUARE DJ
1F200 SQUARE HIRAGANA HOKA
1F210..1F231 [34] SQUARED CJK UNIFIED IDEOGRAPH-624B..SQUARED CJK UNIFIED IDEOGRAPH-6253
1F240..1F248 [9] TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-672C..TORTOISE SHELL BRACKETED CJK UNIFIED IDEOGRAPH-6557
2A700..2B734 [4149] CJK UNIFIED IDEOGRAPH-2A700..CJK UNIFIED IDEOGRAPH-2B734
字客网友情提示:字客网字典频道已经完整支持Unicode 5.2.0所有字符的查询与浏览。
关于统一码联盟
统一码联盟(英语:The Unicode Consortium, 法语:le Consortium Unicode)是一个统筹统一码 (Unicode) 发展的非牟利机构,其宗旨为最终以 Unicode 取代现存的字符编码,以及制定数种 Unicode Transformation Format (UTF) 规格。因为现存编码不能够在多语言电脑环境中使用,而且字数有局限。统一码的成功,使电脑使用进入一个新纪元,并诞生了很多新技术,如 XML、Java 程序设计语言和现今的操作系统。统一码联盟有来自多个国家政府和各大软件商的代表参与。统一码联盟积极与各标准制订机构合作,包括国际标准化组织 (ISO)、国际电工委员会 (IEC)、万维网联盟 (W3C)、互联网工程工作小组 (IETF) 和欧洲计算机制造协会 (ECMA) 等。