Current Allocation
The following tables give the statistics for currently unassigned
(reserved) code points in the Basic Multilingual Plane (BMP) in Unicode
4.0.
First, a few definitions.
- column
- A set of 16 Unicode code points that all have the same "div
16" value, e.g. U+2040..U+204F or U+1D120..U+1D12F. (The name
comes from the fact that they occupy a vertical column in the
character charts.)
- empty column
- A column where none of the code points are designated.
- partial column
- A column where some code points are designated and some are
reserved.
For practical reasons, the Unicode Technical Committee avoids
splitting character blocks across columns. For that reason, it is
important in new allocation to distinguish code points from these
sources:
- reserved code points not in blocks
- reserved code points in empty columns (within assigned blocks)
- reserved code points in partially allocated columns (within
assigned blocks)
- designated code points (includes assigned characters, private
use, surrogate code points, and noncharacters—all of which are
unavailable for assigning new characters)
Here is the breakdown in Unicode 4.0.
SUMMARY |
|
Reserved |
Designated |
|
Not in Blocks |
in Empty Columns |
in Partial Columns |
|
Code Points |
4,144 |
1,008 |
1,171 |
59,213 |
Columns |
259 |
63 |
n/a |
3,774 |
The following lists the code points in empty columns in more
detail. It is separated into two parts: empty columns in unassigned
blocks (or areas), and empty columns in assigned blocks. The (xx)
are the number of code points in empty columns in that block or
area.
Reserved Ranges Not in Blocks
0750..077F | 48 | | (General Scripts Area - Right to Left) |
07C0..08FF | 320 | | (General Scripts Area - Right to Left) |
1380..139F | 32 | | (General Scripts Area) |
18B0..18FF | 80 | | (General Scripts Area) |
1980..19DF | 96 | | (General Scripts Area) |
1A00..1CFF | 768 | | (General Scripts Area) |
1D80..1DFF | 128 | | (General Scripts Area) |
2C00..2E7F | 640 | | (Symbols Area) |
2FE0..2FEF | 16 | | (Symbols Area) |
31C0..31EF | 48 | | (CJK Phonetics and Symbols Area) |
A4D0..ABFF | 1840 | | (General Scripts Area) |
D7B0..D7FF | 80 | | (General Scripts Area) |
FE10..FE1F | 16 | | (Compatibility Area and Specials |
Empty Columns in Assigned Blocks
0240..024F | 16 | | [Latin Extended-B] |
0510..052F | 32 | | [Cyrillic Supplement] |
0C70..0C7F | 16 | | [Telugu] |
0CF0..0CFF | 16 | | [Kannada] |
0D70..0D7F | 16 | | [Malayalam] |
0DE0..0DEF | 16 | | [Sinhala] |
0E60..0E7F | 32 | | [Thai] |
0EE0..0EFF | 32 | | [Lao] |
0FD0..0FFF | 48 | | [Tibetan] |
1060..109F | 64 | | [Myanmar] |
1D70..1D7F | 16 | | [Phonetic Extensions] |
2090..209F | 16 | | [Superscripts and Subscripts] |
20C0..20CF | 16 | | [Currency Symbols] |
20F0..20FF | 16 | | [Combining Marks for Symbols] |
23E0..23FF | 32 | | [Miscellaneous Technical] |
2430..243F | 16 | | [Control Pictures] |
2450..245F | 16 | | [Optical Character Recognition] |
26B0..26FF | 80 | | [Miscellaneous Symbols] |
27C0..27CF | 16 | | [Miscellaneous Mathematical Symbols-A] |
2B10..2BFF | 240 | | [Miscellaneous Symbols and Arrows] |
9FB0..9FFF | 80 | | [CJK Unified Ideographs] |
FA70..FAFF | 144 | | [CJK Compatibility Ideographs] |
FBC0..FBCF | 16 | | [Arabic Presentation Forms-A] |
FD40..FD4F | 16 | | [Arabic Presentation Forms-A] |
Last updated:
- Wed Sep 17 05:21:19 2003
|