• 字体
  • 字体家族
  • 字体公司
  • 字体设计师
  • 字体样张
  • 资讯
  • 帮助
字客网>资讯>详情

Standardized Variants—Part 1

时间:2012-12-26 13:44:57| 标准|浏览:140|来源:Adobe Blogs|作者:Dr. Ken Lunde
导语One problem that has been plaguing CJK Compatibility Ideographs i

One problem that has been plaguing CJK Compatibility Ideographs is the fact that they are adversely affected by normalization. Regardless of which of the four normalization forms is applied—NFC, NFD, NFKC, or NFKD—they are converted to their canonical equivalents, which are CJK Unified Ideographs. This is a problem, particularly for Japan, because 75 kanji in JIS X 0213:2004 kanji map to CJK Compatibility Ideograph code points. Furthermore, 57 of these 75 kanji correspond to Jinmei-yō Kanji (人名用漢字), meaning that they are used for personal names. The bottom-line problem with CJK Compatibility Ideographs is that any application of normalization, by any process, will permanently remove any distinctions between a CJK Compatibility Ideograph and its canonical equivalent. Not all processes are under one's direct control, meaning that it is impossible to guarantee that normalization will not be applied. My opinion is that it is prudent to assume that normalization will be applied, and that preemption is the best solution.

The background is that WG2 N4246 proposed 1,002 standardized variants that correspond to the 1,002 CJK Compatibility Ideographs, which are implemented as variation sequences. The first character in the variation sequence is a CJK Unified Ideograph that is also the canonical equivalent of the CJK Compatibility Ideograph, and the second character is one of the 16 Variation Selectors in the BMP (as opposed to the 240 in the Variation Selectors Supplement in Plane 14 that are reserved for Ideographic Variation Sequences).

The benefit of these standardized variants, of course, is that they are immune to the effects of normalization. In all fairness, I should note that an expert from Japan, Mr. Masahiro Sekiguchi, objected to these standardized variants, and the details of his objections, along with alternative solutions, are recorded in WG2 N4247. The reply from the UTC (Unicode Technical Committee) is recorded in WG2 N4309.

The current status of these 1,002 standardized variants is that they passed WG2 (short for ISO/IEC JTC1/SC2/WG2), and are likely to be incorporated in a future version of ISO/IEC 10646 (and thus Unicode). But, before it is a done deal, they must be included in a ballot. The Unicode Pipeline Table, which details proposed characters and their current standardization status, already lists them.

Stay tuned for Parts 2 and 3 of this article. Part 2 will compare these standardized variants with IVSes (Ideographic Variation Sequences), and Part 3 will detail font implementations.

0
  • 关注字客网公众号领取Z码
  • 关注字体先森公众号抽取SVIP
Standardized Variants—Part 1 网友点评
游客:文明上网,理性发言。 看不清?换一张
Standardized Variants—Part 1 最新评论
暂无相关评论