Skip to content

Commit a6ec955

Browse files
committed
Support Unicode 16.0.0
1 parent 797b1f0 commit a6ec955

22 files changed

+9040
-2016
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ cpp-unicodelib
33

44
[![](https://github.com/yhirose/cpp-unicodelib/workflows/CMake/badge.svg)](https://github.com/yhirose/cpp-unicodelib/actions)
55

6-
A C++17 single-file header-only Unicode library. (Unicode 15.1)
6+
A C++17 single-file header-only Unicode library. (Unicode 16.0.0)
77

88
API
99
---

UCD/Blocks.txt

+15-4
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,8 @@
1-
# Blocks-15.1.0.txt
2-
# Date: 2023-07-28, 15:47:20 GMT
3-
# © 2023 Unicode®, Inc.
4-
# For terms of use, see https://www.unicode.org/terms_of_use.html
1+
# Blocks-16.0.0.txt
2+
# Date: 2024-02-02
3+
# © 2024 Unicode®, Inc.
4+
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5+
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
56
#
67
# Unicode Character Database
78
# For documentation, see https://www.unicode.org/reports/tr44/
@@ -217,6 +218,7 @@ FFF0..FFFF; Specials
217218
10500..1052F; Elbasan
218219
10530..1056F; Caucasian Albanian
219220
10570..105BF; Vithkuqi
221+
105C0..105FF; Todhri
220222
10600..1077F; Linear A
221223
10780..107BF; Latin Extended-F
222224
10800..1083F; Cypriot Syllabary
@@ -239,6 +241,7 @@ FFF0..FFFF; Specials
239241
10C00..10C4F; Old Turkic
240242
10C80..10CFF; Old Hungarian
241243
10D00..10D3F; Hanifi Rohingya
244+
10D40..10D8F; Garay
242245
10E60..10E7F; Rumi Numeral Symbols
243246
10E80..10EBF; Yezidi
244247
10EC0..10EFF; Arabic Extended-C
@@ -258,12 +261,14 @@ FFF0..FFFF; Specials
258261
11280..112AF; Multani
259262
112B0..112FF; Khudawadi
260263
11300..1137F; Grantha
264+
11380..113FF; Tulu-Tigalari
261265
11400..1147F; Newa
262266
11480..114DF; Tirhuta
263267
11580..115FF; Siddham
264268
11600..1165F; Modi
265269
11660..1167F; Mongolian Supplement
266270
11680..116CF; Takri
271+
116D0..116FF; Myanmar Extended-C
267272
11700..1174F; Ahom
268273
11800..1184F; Dogra
269274
118A0..118FF; Warang Citi
@@ -274,6 +279,7 @@ FFF0..FFFF; Specials
274279
11AB0..11ABF; Unified Canadian Aboriginal Syllabics Extended-A
275280
11AC0..11AFF; Pau Cin Hau
276281
11B00..11B5F; Devanagari Extended-A
282+
11BC0..11BFF; Sunuwar
277283
11C00..11C6F; Bhaiksuki
278284
11C70..11CBF; Marchen
279285
11D00..11D5F; Masaram Gondi
@@ -288,12 +294,15 @@ FFF0..FFFF; Specials
288294
12F90..12FFF; Cypro-Minoan
289295
13000..1342F; Egyptian Hieroglyphs
290296
13430..1345F; Egyptian Hieroglyph Format Controls
297+
13460..143FF; Egyptian Hieroglyphs Extended-A
291298
14400..1467F; Anatolian Hieroglyphs
299+
16100..1613F; Gurung Khema
292300
16800..16A3F; Bamum Supplement
293301
16A40..16A6F; Mro
294302
16A70..16ACF; Tangsa
295303
16AD0..16AFF; Bassa Vah
296304
16B00..16B8F; Pahawh Hmong
305+
16D40..16D7F; Kirat Rai
297306
16E40..16E9F; Medefaidrin
298307
16F00..16F9F; Miao
299308
16FE0..16FFF; Ideographic Symbols and Punctuation
@@ -308,6 +317,7 @@ FFF0..FFFF; Specials
308317
1B170..1B2FF; Nushu
309318
1BC00..1BC9F; Duployan
310319
1BCA0..1BCAF; Shorthand Format Controls
320+
1CC00..1CEBF; Symbols for Legacy Computing Supplement
311321
1CF00..1CFCF; Znamenny Musical Notation
312322
1D000..1D0FF; Byzantine Musical Symbols
313323
1D100..1D1FF; Musical Symbols
@@ -325,6 +335,7 @@ FFF0..FFFF; Specials
325335
1E290..1E2BF; Toto
326336
1E2C0..1E2FF; Wancho
327337
1E4D0..1E4FF; Nag Mundari
338+
1E5D0..1E5FF; Ol Onal
328339
1E7E0..1E7FF; Ethiopic Extended-B
329340
1E800..1E8DF; Mende Kikakui
330341
1E900..1E95F; Adlam

UCD/CaseFolding.txt

+31-4
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
1-
# CaseFolding-15.1.0.txt
2-
# Date: 2023-05-12, 21:53:10 GMT
3-
# © 2023 Unicode®, Inc.
1+
# CaseFolding-16.0.0.txt
2+
# Date: 2024-04-30, 21:48:11 GMT
3+
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5-
# For terms of use, see https://www.unicode.org/terms_of_use.html
5+
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
66
#
77
# Unicode Character Database
88
# For documentation, see https://www.unicode.org/reports/tr44/
@@ -603,6 +603,7 @@
603603
1C86; C; 044A; # CYRILLIC SMALL LETTER TALL HARD SIGN
604604
1C87; C; 0463; # CYRILLIC SMALL LETTER TALL YAT
605605
1C88; C; A64B; # CYRILLIC SMALL LETTER UNBLENDED UK
606+
1C89; C; 1C8A; # CYRILLIC CAPITAL LETTER TJE
606607
1C90; C; 10D0; # GEORGIAN MTAVRULI CAPITAL LETTER AN
607608
1C91; C; 10D1; # GEORGIAN MTAVRULI CAPITAL LETTER BAN
608609
1C92; C; 10D2; # GEORGIAN MTAVRULI CAPITAL LETTER GAN
@@ -1240,9 +1241,13 @@ A7C5; C; 0282; # LATIN CAPITAL LETTER S WITH HOOK
12401241
A7C6; C; 1D8E; # LATIN CAPITAL LETTER Z WITH PALATAL HOOK
12411242
A7C7; C; A7C8; # LATIN CAPITAL LETTER D WITH SHORT STROKE OVERLAY
12421243
A7C9; C; A7CA; # LATIN CAPITAL LETTER S WITH SHORT STROKE OVERLAY
1244+
A7CB; C; 0264; # LATIN CAPITAL LETTER RAMS HORN
1245+
A7CC; C; A7CD; # LATIN CAPITAL LETTER S WITH DIAGONAL STROKE
12431246
A7D0; C; A7D1; # LATIN CAPITAL LETTER CLOSED INSULAR G
12441247
A7D6; C; A7D7; # LATIN CAPITAL LETTER MIDDLE SCOTS S
12451248
A7D8; C; A7D9; # LATIN CAPITAL LETTER SIGMOID S
1249+
A7DA; C; A7DB; # LATIN CAPITAL LETTER LAMBDA
1250+
A7DC; C; 019B; # LATIN CAPITAL LETTER LAMBDA WITH STROKE
12461251
A7F5; C; A7F6; # LATIN CAPITAL LETTER REVERSED HALF H
12471252
AB70; C; 13A0; # CHEROKEE SMALL LETTER A
12481253
AB71; C; 13A1; # CHEROKEE SMALL LETTER E
@@ -1525,6 +1530,28 @@ FF3A; C; FF5A; # FULLWIDTH LATIN CAPITAL LETTER Z
15251530
10CB0; C; 10CF0; # OLD HUNGARIAN CAPITAL LETTER EZS
15261531
10CB1; C; 10CF1; # OLD HUNGARIAN CAPITAL LETTER ENT-SHAPED SIGN
15271532
10CB2; C; 10CF2; # OLD HUNGARIAN CAPITAL LETTER US
1533+
10D50; C; 10D70; # GARAY CAPITAL LETTER A
1534+
10D51; C; 10D71; # GARAY CAPITAL LETTER CA
1535+
10D52; C; 10D72; # GARAY CAPITAL LETTER MA
1536+
10D53; C; 10D73; # GARAY CAPITAL LETTER KA
1537+
10D54; C; 10D74; # GARAY CAPITAL LETTER BA
1538+
10D55; C; 10D75; # GARAY CAPITAL LETTER JA
1539+
10D56; C; 10D76; # GARAY CAPITAL LETTER SA
1540+
10D57; C; 10D77; # GARAY CAPITAL LETTER WA
1541+
10D58; C; 10D78; # GARAY CAPITAL LETTER LA
1542+
10D59; C; 10D79; # GARAY CAPITAL LETTER GA
1543+
10D5A; C; 10D7A; # GARAY CAPITAL LETTER DA
1544+
10D5B; C; 10D7B; # GARAY CAPITAL LETTER XA
1545+
10D5C; C; 10D7C; # GARAY CAPITAL LETTER YA
1546+
10D5D; C; 10D7D; # GARAY CAPITAL LETTER TA
1547+
10D5E; C; 10D7E; # GARAY CAPITAL LETTER RA
1548+
10D5F; C; 10D7F; # GARAY CAPITAL LETTER NYA
1549+
10D60; C; 10D80; # GARAY CAPITAL LETTER FA
1550+
10D61; C; 10D81; # GARAY CAPITAL LETTER NA
1551+
10D62; C; 10D82; # GARAY CAPITAL LETTER PA
1552+
10D63; C; 10D83; # GARAY CAPITAL LETTER HA
1553+
10D64; C; 10D84; # GARAY CAPITAL LETTER OLD KA
1554+
10D65; C; 10D85; # GARAY CAPITAL LETTER OLD NA
15281555
118A0; C; 118C0; # WARANG CITI CAPITAL LETTER NGAA
15291556
118A1; C; 118C1; # WARANG CITI CAPITAL LETTER A
15301557
118A2; C; 118C2; # WARANG CITI CAPITAL LETTER WI

UCD/CompositionExclusions.txt

+5-4
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,8 @@
1-
# CompositionExclusions-15.1.0.txt
2-
# Date: 2023-01-05
3-
# © 2023 Unicode®, Inc.
4-
# For terms of use, see https://www.unicode.org/terms_of_use.html
1+
# CompositionExclusions-16.0.0.txt
2+
# Date: 2024-02-02
3+
# © 2024 Unicode®, Inc.
4+
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
5+
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
56
#
67
# Unicode Character Database
78
# For documentation, see https://www.unicode.org/reports/tr44/

0 commit comments

Comments
 (0)