Chromium Code Reviews
chromiumcodereview-hr@appspot.gserviceaccount.com (chromiumcodereview-hr) | Please choose your nickname with Settings | Help | Chromium Project | Gerrit Changes | Sign out
(7)

Unified Diff: source/data/translit/be_be_Latn_BGN.txt

Issue 2440913002: Update ICU to 58.1
Patch Set: Created 4 years, 2 months ago
Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.
Jump to:
View side-by-side diff with in-line comments
Download patch
« no previous file with comments | « source/data/translit/az_Upper.txt ('k') | source/data/translit/bg_bg_Latn_BGN.txt » ('j') | no next file with comments »
Expand Comments ('e') | Collapse Comments ('c') | Show Comments Hide Comments ('s')
Index: source/data/translit/be_be_Latn_BGN.txt
diff --git a/source/data/translit/Belarusian_Latin_BGN.txt b/source/data/translit/be_be_Latn_BGN.txt
similarity index 51%
rename from source/data/translit/Belarusian_Latin_BGN.txt
rename to source/data/translit/be_be_Latn_BGN.txt
index 057d771f026c483b85e04492537b3944727d5ce2..c5b9082ea09b30513f880440752bd138794121f7 100644
--- a/source/data/translit/Belarusian_Latin_BGN.txt
+++ b/source/data/translit/be_be_Latn_BGN.txt
@@ -1,14 +1,42 @@
-# ***************************************************************************
-# *
-# * Copyright (C) 2004-2015, International Business Machines
-# * Corporation; Unicode, Inc.; and others. All Rights Reserved.
-# *
-# ***************************************************************************
-# File: Belarusian_Latin_BGN.txt
-# Generated from CLDR
+# © 2016 and later: Unicode, Inc. and others.
+# License & terms of use: http://www.unicode.org/copyright.html#License
+#
+# File: be_be_Latn_BGN.txt
+# Generated from CLDR
+#
+
+#
+########################################################################
+# BGN/PCGN 1979 System
+#
+# The BGN/PCGN system for Belarusian (formerly Byelorussian) was
+# designed for use in romanizing names written in the Belarusian
+# Cyrillic alphabet. The Belarusian alphabet contains three
+# letters not present in the Russian alphabet: Іі, Ўў, ’.
+# One obsolete letter Ґґ is included.
+#
+# The Belarusian Alphabet as defined by the BGN (Page 23):
+#
+# АБВГДЕЁЖЗІЙКЛМНОПРСТУЎФХЦЧШЫЬЭЮЯҐ
+# абвгдеёжзійклмнопрстуўфхцчшыьэюя’ґ
+#
+# Originally prepared by Michael Everson <everson@evertype.com>
+########################################################################
+#
+# MINIMAL FILTER: Belarusian-Latin
#
:: [АБВГДЕЁЖЗІЙКЛМНОПРСТУЎФХЦЧШЩЪЫЬЭЮЯҐабвгдеёжзійклмнопрстуўфхцчшщъыьэюя’ґ] ;
:: NFD (NFC) ;
+#
+#
+########################################################################
+#
+########################################################################
+#
+# Define All Transformation Variables
+#
+########################################################################
+#
$prime = ʹ ;
$doublePrime = ʺ ;
$upperConsonants = [БВГДЖЗЙКЛМНПРСТЎФХЦЧШЬҐ] ;
@@ -18,7 +46,22 @@ $upperVowels = [АЕЁІОУЫЭЮЯ] ;
$lowerVowels = [аеёіоуыэюя] ;
$vowels = [$upperVowels $lowerVowels] ;
$lower = [$lowerConsonants $lowerVowels] ;
+#
+#
+# Use this $wordBoundary until bug 2034 is fixed in ICU:
+# http://bugs.icu-project.org/cgi-bin/icu-bugs/transliterate?id=2034;expression=boundary;user=guest
+#
$wordBoundary = [^[:L:][:M:][:N:]] ;
+#
+#
+########################################################################
+#
+########################################################################
+#
+# Start of Alphabetic Transformations
+#
+########################################################################
+#
А → A ; # CYRILLIC CAPITAL LETTER A
а → a ; # CYRILLIC SMALL LETTER A
Б → B ; # CYRILLIC CAPITAL LETTER BE
@@ -38,20 +81,56 @@ $wordBoundary = [^[:L:][:M:][:N:]] ;
Ж} $lower → Zh ; # CYRILLIC CAPITAL LETTER ZHE
Ж → ZH ; # CYRILLIC CAPITAL LETTER ZHE
ж → zh ; # CYRILLIC SMALL LETTER ZHE
+#
+#
+########################################################################
+#
+# BGN Page 23 Note 1
+#
+# The character sequences зг, цг, сг, тс, and кг may be romanized z·h,
+# k·h, s·h, t·s and ts·h in order to differentiate those romanizations
+# from the digraphs zh, kh, sh, ts, and the letter sequence tsh, which
+# are used to render characters ж, х, ш, ц, and the character sequence тш.
+#
+########################################################################
+#
ЗГ → Z·H ; # CYRILLIC CAPITAL LETTER ZE
Зг → Z·h ; # CYRILLIC CAPITAL LETTER ZE
зг → z·h ; # CYRILLIC SMALL LETTER ZE
З → Z ; # CYRILLIC CAPITAL LETTER ZE
з → z ; # CYRILLIC SMALL LETTER ZE
+#
+#
+########################################################################
+#
+# End Rule 1
+#
+########################################################################
І → I ; # CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
і → i ; # CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I
Й → Y ; # CYRILLIC CAPITAL LETTER I
й → y ; # CYRILLIC SMALL LETTER I
+#
+########################################################################
+#
+# BGN Page 23 Rule 1
+#
+# кг becomes k·h
+#
+########################################################################
+#
КГ → K·H ; # CYRILLIC CAPITAL LETTER KA
Кг → K·h ; # CYRILLIC CAPITAL LETTER KA
кг → k·h ; # CYRILLIC SMALL LETTER KA
К → K ; # CYRILLIC CAPITAL LETTER KA
к → k ; # CYRILLIC SMALL LETTER KA
+#
+#
+########################################################################
+#
+# End Rule 1
+#
+########################################################################
Л → L ; # CYRILLIC CAPITAL LETTER EL
л → l ; # CYRILLIC SMALL LETTER EL
М → M ; # CYRILLIC CAPITAL LETTER EM
@@ -64,16 +143,48 @@ $wordBoundary = [^[:L:][:M:][:N:]] ;
п → p ; # CYRILLIC SMALL LETTER PE
Р → R ; # CYRILLIC CAPITAL LETTER ER
р → r ; # CYRILLIC SMALL LETTER ER
+#
+########################################################################
+#
+# BGN Page 23 Rule 1
+#
+# сг becomes s·h
+#
+########################################################################
+#
СГ → S·H ; # CYRILLIC CAPITAL LETTER ES
Сг → S·h ; # CYRILLIC CAPITAL LETTER ES
сг → s·h ; # CYRILLIC SMALL LETTER ES
С → S ; # CYRILLIC CAPITAL LETTER ES
с → s ; # CYRILLIC SMALL LETTER ES
+#
+#
+########################################################################
+#
+# End Rule 1
+#
+########################################################################
+#
+########################################################################
+#
+# BGN Page 23 Rule 1
+#
+# тс becomes t·s
+#
+########################################################################
+#
ТС → T·S ; # CYRILLIC CAPITAL LETTER TE
Тс → T·s ; # CYRILLIC CAPITAL LETTER TE
тс → t·s ; # CYRILLIC SMALL LETTER TE
Т → T ; # CYRILLIC CAPITAL LETTER TE
т → t ; # CYRILLIC SMALL LETTER TE
+#
+#
+########################################################################
+#
+# End Rule 1
+#
+########################################################################
У → U ; # CYRILLIC CAPITAL LETTER U
у → u ; # CYRILLIC SMALL LETTER U
Ў → W ; # CYRILLIC CAPITAL LETTER SHORT U
@@ -83,12 +194,28 @@ $wordBoundary = [^[:L:][:M:][:N:]] ;
Х} $lower → Kh ; # CYRILLIC CAPITAL LETTER HA
Х → KH ; # CYRILLIC CAPITAL LETTER HA
х → kh ; # CYRILLIC SMALL LETTER HA
+#
+########################################################################
+#
+# BGN Page 23 Rule 1
+#
+# цг becomes ts·h
+#
+########################################################################
+#
ЦГ → TS·H ; # CYRILLIC CAPITAL LETTER TSE
Цг → Ts·h ; # CYRILLIC CAPITAL LETTER TSE
цг → ts·h ; # CYRILLIC SMALL LETTER TSE
Ц} $lower → Ts ; # CYRILLIC CAPITAL LETTER TSE
Ц → TS ; # CYRILLIC CAPITAL LETTER TSE
ц → ts ; # CYRILLIC SMALL LETTER TSE
+#
+#
+########################################################################
+#
+# End Rule 1
+#
+########################################################################
Ч} $lower → Ch ; # CYRILLIC CAPITAL LETTER CHE
Ч → CH ; # CYRILLIC CAPITAL LETTER CHE
ч → ch ; # CYRILLIC SMALL LETTER CHE
@@ -105,5 +232,22 @@ $wordBoundary = [^[:L:][:M:][:N:]] ;
Я → YA ; # CYRILLIC CAPITAL LETTER YA
я → ya ; # CYRILLIC SMALL LETTER YA
’ → $doublePrime ; # LEFT SINGLE QUOTATION MARK
+#
+########################################################################
+#
+# BGN Page 23 Note 2
+#
+# The obsolete character ґ should be romanized g.
+#
+########################################################################
+#
Ґ → G ; # CYRILLIC CAPITAL LETTER GHE WITH UPTURN
ґ → g ; # CYRILLIC SMALL LETTER GHE WITH UPTURN
+#
+#
+########################################################################
+#
+# End Note 2
+#
+########################################################################
+
« no previous file with comments | « source/data/translit/az_Upper.txt ('k') | source/data/translit/bg_bg_Latn_BGN.txt » ('j') | no next file with comments »

Powered by Google App Engine
This is Rietveld 408576698