Skip to content

Commit fa562ae

Browse files
committed
#137 add bgnpcgn-ron-cyrl-latn-2002
1 parent dd5b2fe commit fa562ae

File tree

1 file changed

+165
-0
lines changed

1 file changed

+165
-0
lines changed
+165
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,165 @@
1+
metadata {
2+
authority_id: bgnpcgn
3+
id: 2002
4+
language: iso-639-2:ron
5+
source_script: Cyrl
6+
destination_script: Latn
7+
name: CYRILLIC - ROMAN BGN/PCGN 2002 Agreement
8+
url: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/816780/TABLE_OF_CORRESPONDENCES_FOR_MOLDOVAN.pdf
9+
creation_date: 2002
10+
confirmation_date: 2019
11+
description: |
12+
In 1990, the Moldovan authorities adopted the Roman alphabet to replace the existing Cyrillic
13+
alphabet. The presentation below provides a table of correspondences between the former Cyrillic alphabet
14+
and the current Roman alphabet. When Moldovan Roman-alphabet spellings are not available, this table
15+
can be used to convert Moldovan Cyrillic spellings. It should be noted that Moldovan is little used as a
16+
language label now, and is usually considered as Romanian.
17+
18+
notes:
19+
- | # 1
20+
This Roman-script form is used before e, i and iu.
21+
22+
- | # 2
23+
This Roman-script form is used before a.
24+
25+
- | # 3
26+
The letter ‘i’ is used to represent Ь at the end of a word.
27+
28+
- | # 4
29+
An inventory of letter-diacritic combinations, with their Unicode encoding, in addition to the
30+
unmodified letters of the basic Roman script is:
31+
Ț (U+021A) ț (U+021B)
32+
Ș (U+0218) ș (U+0219)
33+
Î (U+00CE) î (U+00EE)
34+
Ă (U+0102) ă (U+0103)
35+
36+
- | # 5
37+
The Roman-script columns show only lowercase forms but, when applying the table, uppercase and
38+
lowercase Roman letters as appropriate should be used.
39+
}
40+
41+
tests {
42+
# https://www.tourister.ru/world/europe/moldova-republic-of/publications/507
43+
test "Бунэ диминеа'ца!", "Bună diminea'ţa!"
44+
test "Ну неам вэзут де ун кар де ань!", "Nu neam văzut de un car de ani!" # note[3]
45+
test "Перми'тецимь сэ вэ салут ын нумеле...", "Permi'teţimi să vă salut în numele..." # note[3]
46+
test "Сэ фий сэнэтос!", "Să fii sănătos!"
47+
test "Ку ун груп де туришть", "Cu un grup de turişti" # note[3]
48+
test "Пе соци'я меа", "Pe soţi'ea mea"
49+
test "хотелул", "hotelul"
50+
test "Че са ынтымплат?", "Ce sa întîmplat?"
51+
test "Драджь", "Dradzhi" # note[3]
52+
test "Че есте ачаста?", "Ce este aceasta?" # note[1] note[2]
53+
test "черюрй", "ceriuri" # note[1]
54+
test "феричире!", "fericire!"
55+
test "ӂэужык", "giăuzhîc"
56+
57+
test "БУНЭ ДИМИНЕА'ЦА!", "BUNĂ DIMINEA'ŢA!"
58+
test "НУ НЕАМ ВЭЗУТ ДЕ УН КАР ДЕ АНЬ!", "NU NEAM VĂZUT DE UN CAR DE ANI!" # note[3]
59+
test "ПЕРМИ'ТЕЦИМЬ СЭ ВЭ САЛУТ ЫН НУМЕЛЕ...", "PERMI'TEŢIMI SĂ VĂ SALUT ÎN NUMELE..." # note[3]
60+
test "СЭ ФИЙ СЭНЭТОС!", "SĂ FII SĂNĂTOS!"
61+
test "КУ УН ГРУП ДЕ ТУРИШТЬ", "CU UN GRUP DE TURIŞTI" # note[3]
62+
test "ПЕ СОЦИ'Я МЕА", "PE SOŢI'EA MEA"
63+
test "ХОТЕЛУЛ", "HOTELUL"
64+
test "ЧЕ СА ЫНТЫМПЛАТ?", "CE SA ÎNTÎMPLAT?"
65+
test "ДРАДЖЬ", "DRADZHI" # note[3]
66+
test "ЧЕ ЕСТЕ АЧАСТА?", "CE ESTE ACEASTA?" # note[1] note[2]
67+
test "ЧЕРЮРЙ", "CERIURI" # note[1]
68+
test "ФЕРИЧИРЕ!", "FERICIRE!"
69+
test "Ӂэужык", "GIăuzhîc"
70+
}
71+
72+
stage {
73+
74+
# RULES
75+
# note[1]
76+
sub "\u0413", "Gh", after: any("ЕеИиЙйЮю") # Г[ЕеИиЙйЮю]
77+
sub "\u0433", "gh", after: any("ЕеИиЙйЮю") # г[ЕеИиЙйЮю]
78+
sub "\u04C1", "Gh", after: any("ЕеИиЙйЮю") # Ӂ[ЕеИиЙйЮю]
79+
sub "\u04C1", "gh", after: any("ЕеИиЙйЮю") # ӂ[ЕеИиЙйЮю]
80+
sub "\u041A", "Ch", after: any("ЕеИиЙйЮю") # К[ЕеИиЙйЮю]
81+
sub "\u043A", "ch", after: any("ЕеИиЙйЮю") # к[ЕеИиЙйЮю]
82+
sub "\u0427", "C", after: any("ЕеИиЙйЮю") # Ч[ЕеИиЙйЮю]
83+
sub "\u0447", "c", after: any("ЕеИиЙйЮю") # ч[ЕеИиЙйЮю]
84+
85+
# note[2]
86+
sub "\u04C1", "Ge", after: any("Аа") # Ӂ[Aa]
87+
sub "\u04C1", "ge", after: any("Аа") # ӂ[Aa]
88+
sub "\u0427", "CE", after: any("А") # Ч[A]
89+
sub "\u0427", "Ce", after: any("а") # Ч[a]
90+
sub "\u0447", "ce", after: any("Аа") # ч[Aa]
91+
92+
# note[3]
93+
sub "\u042C", "I", after: boundary # Ь$
94+
sub "\u044C", "i", after: boundary # ь$
95+
96+
97+
# CHARACTERS
98+
parallel {
99+
sub "\u0410", "A" # А
100+
sub "\u0411", "B" # Б
101+
sub "\u0412", "V" # B
102+
sub "\u0413", "G" # Г note[1]
103+
sub "\u0414", "D" # Д
104+
sub "\u0415", "E" # Е
105+
sub "\u0416", "ZH" # Ж
106+
sub "\u04C1", "GI" # Ӂ note[1] note[2]
107+
sub "\u0417", "Z" # З
108+
sub "\u0418", "I" # И
109+
sub "\u0419", "I" # Й
110+
sub "\u041A", "C" # К note[1]
111+
sub "\u041B", "L" # Л
112+
sub "\u041C", "M" # М
113+
sub "\u041D", "N" # Н
114+
sub "\u041E", "O" # О
115+
sub "\u041F", "P" # П
116+
sub "\u0420", "R" # Р
117+
sub "\u0421", "S" # С
118+
sub "\u0422", "T" # Т
119+
sub "\u0423", "U" # У
120+
sub "\u0424", "F" # Ф
121+
sub "\u0425", "H" # Х
122+
sub "\u0426", "Ţ" # Ц
123+
sub "\u0427", "CI" # Ч note[1] note[2]
124+
sub "\u0428", "Ş" # Ш
125+
sub "\u042B", "Î" # Ы
126+
sub "\u042C", "’" # Ь note[3]
127+
sub "\u042D", "Ă" # Э
128+
sub "\u042E", "IU" # Ю
129+
sub "\u042F", "EA" # Я
130+
131+
sub "\u0430", "a" # а
132+
sub "\u0431", "b" # б
133+
sub "\u0432", "v" # b
134+
sub "\u0433", "g" # г note[1]
135+
sub "\u0434", "d" # д
136+
sub "\u0435", "e" # е
137+
sub "\u0436", "zh" # ж
138+
sub "\u04C2", "gi" # ӂ note[1] note[2]
139+
sub "\u0437", "z" # з
140+
sub "\u0438", "i" # и
141+
sub "\u0439", "i" # й
142+
sub "\u043A", "c" # к note[1]
143+
sub "\u043B", "l" # л
144+
sub "\u043C", "m" # м
145+
sub "\u043D", "n" # н
146+
sub "\u043E", "o" # о
147+
sub "\u043F", "p" # п
148+
sub "\u0440", "r" # р
149+
sub "\u0441", "s" # с
150+
sub "\u0442", "t" # т
151+
sub "\u0443", "u" # у
152+
sub "\u0444", "f" # ф
153+
sub "\u0445", "h" # х
154+
sub "\u0446", "ţ" # ц
155+
sub "\u0447", "ci" # ч note[1] note[2]
156+
sub "\u0448", "ş" # ш
157+
sub "\u044B", "î" # ы
158+
sub "\u044C", "’" # ь note[3]
159+
sub "\u044D", "ă" # э
160+
sub "\u044E", "iu" # ю
161+
sub "\u044F", "ea" # я
162+
}
163+
164+
}
165+

0 commit comments

Comments
 (0)