Additional digits Since the 1960s Shan digits have been used alongside Myanmar and European digits.

Similar documents
VOWEL SIGN CONSONANT SIGN SHAN MEDIAL WA contrasts with the

tone marks. (Figures 4, 5, 6, 7, and 8.)

N3976R L2/11-130R

N3976 L2/11-130)

This is a preliminary proposal to encode the Mandaic script in the BMP of the UCS.

This document requests an additional character to be added to the UCS and contains the proposal summary form.

Proposal to encode Al-Dani Quranic marks used in Quran published in Libya. For consideration by UTC and ISO/IEC JTC1/SC2/WG2

Proposal to Encode the Typikon Symbols in Unicode: Part 2 Old Rite Symbols

@ó 061A

Proposal to Encode the Typikon Symbols in Unicode

ISO/IEC JTC1/SC2/WG2 N3816

Proposal to encode svara markers for the Jaiminiya Archika. 1. Background

This document requests an additional character to be added to the UCS and contains the proposal summary form.

Proposal to encode Quranic marks used in Quran published in Libya (Narration of Qaloon with script Aldani)

Request to encode South Indian CANDRABINDU-s. Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2010-Oct Background

This is a preliminary proposal to encode the Chakma script in the BMP of the UCS.

ISO/IEC JTC1/SC2/WG2 N25xx

If these characters were in second position in a cluster, would they interfere with searching operations? Example: vs.

JTC2/SC2/WG2 N 2190 Date:

A. Administrative. B. Technical -- General

Proposal to Encode the Typikon Symbols in Unicode

1 RAÑJANA encompasses: Rañjana (Figure 1, 2, 3) Wartu (Figure 4)

Ê P p P f Í Ṣ ṣ Ṣ ž? ˆ Š š Š č, ǰ. œ BI bi BI be. œ LIḄA lebba heart RḄH rabba great

ISO/IEC JTC1/SC2/WG2 N3767 L2/10-012R

ISO/IEC JTC1/SC2/WG2 N4283 L2/12-214

Proposal to Encode the Grantha Script in the Basic Multilingual Plane (BMP) of ISO/IEC 10646

ƒ Δ ~ ÀÃÕŒœ ÿÿ Ä Å Ç É Ñ Ö Ü á à â ä ã å ç èê ë í ì î ñ ó ô õ ß Ø ± π ª

Î 2CEB Ï 2CEC Ì 2CED Ó FE26 COMBINING CONJOINING MACRON

+ HETH ḥw = WAW. ḥr = RESH + HETH. br = RESH + BETH + HETH ḥd = DALETH

4. Radicals. The chief issue about which we would like feedback at this time is the question of the encoding of Jurchen radicals.

Summary. Background. Individual Contribution For consideration by the UTC. Date:

Revised proposal to encode Hanifi Rohingya in Unicode

typically extends beneath the killed letter and the letter following. A syllable is structured (and represented in the backing store) as follows:

Schema for the Transliteration of Sanskrit and Pāḷi

Proposal to Encode the Grantha Script in the Supplementary Multilingual Plane (SMP) of ISO/IEC 10646

Proposal to encode the Hanifi Rohingya script in Unicode

ƒ Δ ~ ÀÃÕŒœ ÿÿ Ä Å Ç É Ñ Ö Ü á à â ä ã å ç èê ë í ì î ñ ó ô õ ß Ø ± π ª

2. Processing. Imperial Aramaic is an alphabetic script written right-to-left, in scriptio continua or with spaces between words.

Bhagavad Gétä Chanting Guidelines

Bhagavad Gétä Chanting Guidelines (Chpt 17)

Title: Mongolian Glyph Table With Word Samples Author: Greg Eck Date: 2004-Nov-12 Related: See table of links at the end of the document

Dual-joining Psalter Pahlavi Characters Character X n X r X m X l. Right-joining Psalter Pahlavi Characters Character X n X r

Proposal to encode Grantha Chillu Marker sign in Unicode/ISO 10646

Dual-joining Manichaean Characters Character X n X r X m X l. Right-joining Manichaean Characters Character X n X r

Proposal to Encode Alternative Characters for Biblical Hebrew

ê LONG E e û LONG u U åˆ: OVERLONG AA åˆ LONG AA å î LONG i â: OVERLONG â LONG a o ă SHORT A

4. Shaping. Dual-joining Manichaean Characters Character Right-joining Manichaean Characters Character Left-joining Manichaean Characters Character

Response to the Proposal to Encode Phoenician in Unicode. Dean A. Snyder 8 June 2004

Arabic. Arabic Page 1

Proposal to add two Tifinagh characters for vowels in Tuareg language variants

Arabic. The previous UN-approved system is still found in considerable international usage.

L2/ Background. Proposal

, Historical Perspective on the Cryptography Technology: Its Primordial Form in Classical Sanskrit Literature

Responses to Several Hebrew Related Items

The Construction of Sanskrit Database for a Polyglot Buddhist Dictionary

Request for editorial updates to Indic scripts

Proposal to encode Vedic characters for the Grantha script. 1. Characters being proposed

Final Proposal to Encode the Khojki Script in ISO/IEC 10646

ISO/IEC JTC1/SC2/WG2 N2972

Easter Sunday Wonderful Day

Dual-joining Manichaean Characters Character X n X r X m X l. Right-joining Manichaean Characters Character X n X r

The Unicode Standard Version 7.0 Core Specification

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

daughters to recite Surah Waqiah everyday as the Holy Prophet (sa) said, whoever recites Surah Waqiah every night will never go hungry.

Title: Preliminary Proposal to Encode the Turkestani Script Author: Lee Wilson Date:

Yangon-Mechanical. 1 Arkar Myint U M-1 Mechanical 12/Ka Ma Ya(N) U Myint Tin

Some of the companions mentioned by Huzoor Anwar (aa) today are as follows:

Mandalay-Construction. 2 U Hla Htwe C-502 7/Na Tha La(N) Construction U Aung Sin

Automatic Recognition of Tibetan Buddhist Text by Computer. Masami Kojima*1, Yoshiyuki Kawazoe*2 and Masayuki Kimura*3

TOWARDS UNICODE STANDARD FOR URDU - WG2 N2413-1/SC2 N35891

Developing Database of the Pāli Canon

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532

Issues in the Representation of Pointed Hebrew in Unicode

Lessons and Rulings from the Story about the Guests of Ibraaheem Taken from Tayseer al-kareem ar-rahmaan fee Tafseer Kalaam al-mannaan, by as-sa dee

Follow-up to Extended Tamil proposal L2/10-256R. 1. Encoding model of Extended Tamil and related script-forms

Everson Typography. 48B Gleann na Carraige, Cill Fhionntain Baile Átha Cliath 13, Éire. Computer Locale Requirements for Afghanistan TYPOGRAPHY

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 11.0 Core Specification

Brinton follows this with a discussion of Parra s characters by Otto Stoll; I have given them alongside modern transcriptions:

StoryTown Reading/Language Arts Grade 2

A SONG OF MILAREPA MILAREPA AND THE GESHE IN DRIN LOTSAWA TONY DUFF PADMA KARPO TRANSLATION COMMITTEE

Exhibit 1.Example used by Everson in proposal L2/ The example given is grammatically and orthographically Tamil. This is an example of the

Review of Bengali Khanda Ta and PRI-30 Feedback

BURMA: COMPARISON OF NEW GOVERNMENT OFFICIALS WITH THE COUNCIL OF THE EUROPEAN UNION LIST OF SANCTIONED REGIME MEMBERS 1

In alten, alten Zeiten, als die Menschen noch in ganz anderen Sprachen redeten, gab es in den warmen Ländern schon große und prächtige Städte.

Department of Tarbiyat

Part III HISTORY. Maximum : 80 Scores (Map of India to Accompany)

Wilson Fundations Scope and Sequence

Comments on Grantha OM

Discussion Draft. Basics of Veda Swaras and Vedic Recital-07. Ghana Paatam

Scott Foresman Reading Street Common Core 2013

Department of Tarbiyat

(Guha Ashtakam) am) Sree Narayana Guru

PY An 1. The text of the celebrated Pylos tablet An 1 reads as follows:

Check-List: Books for Pre-school. We Represent: Price (`)

Scott Foresman Reading Street Common Core 2013

Minnesota Academic Standards for Language Arts Kindergarten

Mandalay-Construction. 2 Aung Khaing Tun U C-802 Construction 2/LaKaNa(N) U Saw Lwin

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532-N3697

Proposal to Encode the Mark's Chapter Glyph in theunicode Standard

Transcription:

ISO/IEC JTC1/SC2/WG2 N3277R3 L2/07-205R3 2007-08-28 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding additional Myanmar characters for Shan in the UCS Source: Michael Everson Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2007-08-28 New information has come to light about the Shan extensions under ballot in FPDAM4. Seventeen characters should be added to the ballot and the characters should be re-arranged as shown in the code table below. In that table, the characters on the current ballot are given with blue shading, and the new characters which should be added are given in yellow. Additional letters The letter Ä LETTER SHAN THA is used in Shan to represent a foreign sound. Four other such letters exist: LETTER SHAN GA contrasts with Burmese Ç LETTER GA; LETTER SHAN ZA is unique but in one Shan Pali orthography it answers to Burmese à LETTER JHA; LETTER SHAN DA contrasts with Burmese í LETTER DA; and ˇ LETTER SHAN BA contrasts with Burmese ó LETTER BA. Additional digits Since the 1960s Shan digits have been used alongside Myanmar and European digits. Additional symbols The character û SYMBOL SHAN ONE is used as a measure word; it is not a digit. Although it looks similar to @É VOWEL SIGN SHAN AA with @ SIGN ANUSVARA, it is not a combining character. The character ü SYMBOL SHAN EXCLAMATION is used to represent exclamations like English Oh! Alas! Ah!. Shan Council tones The analysis made of Shan Council tones in N3143 was incomplete. Shan Council orthography marked four tones: ıé ka 1 is unmarked, ıéã ka 2 uses SIGN SHAN COUNCIL TONE-2, ıéå ka 3 uses SIGN SHAN COUNCIL TONE-3, ıé ka 4 uses SIGN VISARGA, ıéâ ka 5 uses SIGN SHAN TONE-5. In FPDAM4 ıéã ka 2 SIGN SHAN COUNCIL TONE-2 is missing, and ıéå ka 3 is mistakenly named SIGN SHAN COUNCIL TONE-4 (it should be SIGN SHAN COUNCIL TONE-3). Glyph changes The letter LETTER SHAN NYA on the ballot is not shown with the preferred shape. should be used. The glyph of @å SIGN SHAN COUNCIL TONE-3 on the ballot is slightly too high. The correct proportions: @ @å @â. The Unicode Standard may wish to cross-reference U+1037, U+1085, and U+108B. Ordering The unified order for the Myanmar script incorporating the characters here (and those previously accepted for encoding) is given below. Ordering is syllable-based, so this is indicative of only one level of ordering. The Shan, Karen, and Kayah characters currently on the ballot are shown below in italics. 1

ka < shan-ka < kha < shan-kha < ga < shan ga < gha < nga < mon-nga < ca < shan-ca < cha < ja < shan za < jha < mon-jha < sgaw-karen-sha < nya < shan-nya < nnya < tta < ttha < dda < ddha < nna < eastern-pwo-karen-nna < ta < tha < da < shan da < dha < na < shan-na < pa < pha < shan-pha < shan-fa < rumai-palaung-fa < ba < shan ba < bha < ma < ya < ra < la < wa < shan-tha < sha < ssa < western-pwo-karen-tha < sa < great-sa < ha < shan-ha < lla < mon-bba < eastern-pwo-karen-ywa < eastern-pwo-karen-gwa < a < shan-a < kayah-oe < i < ii < u < kayah-u < kayah-ee < uu < vocalic-r < vocalic-rr < vocalic-l < vocalic-ll < e < mon-bbe < western-pwo-karen-pwa < mon-e < o < au Unicode Character Properties 1022;MYANMAR LETTER SHAN A;Lo;0;L;;;;;N;;;;; 1065;MYANMAR LETTER WESTERN PWO KAREN THA;Lo;0;L;;;;;N;;;;; 1066;MYANMAR LETTER WESTERN PWO KAREN PWA;Lo;0;L;;;;;N;;;;; 1067;MYANMAR VOWEL SIGN WESTERN PWO KAREN EU;Mc;0;L;;;;;N;;;;; 1068;MYANMAR VOWEL SIGN WESTERN PWO KAREN UE;Mc;0;L;;;;;N;;;;; 1069;MYANMAR SIGN WESTERN PWO KAREN TONE-1;Mc;0;L;;;;;N;;;;; 106A;MYANMAR SIGN WESTERN PWO KAREN TONE-2;Mc;0;L;;;;;N;;;;; 106B;MYANMAR SIGN WESTERN PWO KAREN TONE-3;Mc;0;L;;;;;N;;;;; 106C;MYANMAR SIGN WESTERN PWO KAREN TONE-4;Mc;0;L;;;;;N;;;;; 106D;MYANMAR SIGN WESTERN PWO KAREN TONE-5;Mc;0;L;;;;;N;;;;; 106E;MYANMAR LETTER EASTERN PWO KAREN NNA;Lo;0;L;;;;;N;;;;; 106F;MYANMAR LETTER EASTERN PWO KAREN YWA;Lo;0;L;;;;;N;;;;; 1070;MYANMAR LETTER EASTERN PWO KAREN GHWA;Lo;0;L;;;;;N;;;;; 1071;MYANMAR VOWEL SIGN GEBA KAREN I;Mn;0;NSM;;;;;N;;;;; 1072;MYANMAR VOWEL SIGN KAYAH OE;Mn;0;NSM;;;;;N;;;;; 1073;MYANMAR VOWEL SIGN KAYAH U;Mn;0;NSM;;;;;N;;;;; 1074;MYANMAR VOWEL SIGN KAYAH EE;Mn;0;NSM;;;;;N;;;;; 1075;MYANMAR LETTER SHAN KA;Lo;0;L;;;;;N;;;;; 1076;MYANMAR LETTER SHAN KHA;Lo;0;L;;;;;N;;;;; 1077;MYANMAR LETTER SHAN GA;Lo;0;L;;;;;N;;;;; 1078;MYANMAR LETTER SHAN CA;Lo;0;L;;;;;N;;;;; 1079;MYANMAR LETTER SHAN ZA;Lo;0;L;;;;;N;;;;; 107A;MYANMAR LETTER SHAN NYA;Lo;0;L;;;;;N;;;;; 107B;MYANMAR LETTER SHAN DA;Lo;0;L;;;;;N;;;;; 107C;MYANMAR LETTER SHAN NA;Lo;0;L;;;;;N;;;;; 107D;MYANMAR LETTER SHAN PHA;Lo;0;L;;;;;N;;;;; 107E;MYANMAR LETTER SHAN FA;Lo;0;L;;;;;N;;;;; 107F;MYANMAR LETTER SHAN BA;Lo;0;L;;;;;N;;;;; 1080;MYANMAR LETTER SHAN THA;Lo;0;L;;;;;N;;;;; 1081;MYANMAR LETTER SHAN HA;Lo;0;L;;;;;N;;;;; 1082;MYANMAR CONSONANT SIGN SHAN MEDIAL WA;Mn;0;NSM;;;;;N;;;;; 1083;MYANMAR VOWEL SIGN SHAN AA;Mc;0;L;;;;;N;;;;; 1084;MYANMAR VOWEL SIGN SHAN E;Mc;0;NSM;;;;;N;;;;; 1085;MYANMAR VOWEL SIGN SHAN E ABOVE;Mn;0;NSM;;;;;N;;;;; 1086;MYANMAR VOWEL SIGN SHAN FINAL Y;Mn;0;NSM;;;;;N;;;;; 1087;MYANMAR SIGN SHAN TONE-2;Mc;0;L;;;;;N;;;;; 1088;MYANMAR SIGN SHAN TONE-3;Mc;0;L;;;;;N;;;;; 1089;MYANMAR SIGN SHAN TONE-5;Mc;0;L;;;;;N;;;;; 108A;MYANMAR SIGN SHAN TONE-6;Mc;0;L;;;;;N;;;;; 108B;MYANMAR SIGN SHAN COUNCIL TONE-2;Mc;0;L;;;;;N;;;;; 108C;MYANMAR SIGN SHAN COUNCIL TONE-3;Mc;0;L;;;;;N;;;;; 108D;MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE;Mn;0;NSM;;;;;N;;;;; 108E;MYANMAR VOWEL SIGN RUMAI PALAUNG FA;Lo;0;L;;;;;N;;;;; 108F;MYANMAR SIGN RUMAI PALAUNG TONE-6;Mc;0;L;;;;;N;;;;; 1090;MYANMAR SHAN DIGIT ZERO;Nd;0;L;;0;0;0;N;;;;; 1091;MYANMAR SHAN DIGIT ONE;Nd;0;L;;1;1;1;N;;;;; 1092;MYANMAR SHAN DIGIT TWO;Nd;0;L;;2;2;2;N;;;;; 1093;MYANMAR SHAN DIGIT THREE;Nd;0;L;;3;3;3;N;;;;; 1094;MYANMAR SHAN DIGIT FOUR;Nd;0;L;;4;4;4;N;;;;; 1095;MYANMAR SHAN DIGIT FIVE;Nd;0;L;;5;5;5;N;;;;; 1096;MYANMAR SHAN DIGIT SIX;Nd;0;L;;6;6;6;N;;;;; 1097;MYANMAR SHAN DIGIT SEVEN;Nd;0;L;;7;7;7;N;;;;; 1098;MYANMAR SHAN DIGIT EIGHT;Nd;0;L;;8;8;8;N;;;;; 1099;MYANMAR SHAN DIGIT NINE;Nd;0;L;;9;9;9;N;;;;; 109E;MYANMAR SYMBOL SHAN ONE;Po;0;L;;;;;N;;;;; 109F;MYANMAR SYMBOL SHAN EXCLAMATION;Po;0;L;;;;;N;;;;; 2

Figures Figure 1. Sample from the 1997 Shan Reader for Kindergarten, showing LETTER SHAN ZA, LETTER SHAN DA, and ˇ LETTER SHAN BA. Figure 2. Sample from Sai Hkam Leik s 1999 Songs for the Highland, Volume I, showing ˇ LETTER SHAN BA, LETTER SHAN GA, LETTER SHAN ZA, and LETTER SHAN DA. 3

Figure 3. Sample from the 1985 Shan Reader for Kindergarten, showing ü SYMBOL SHAN EXCLAMATION. Figure 4. Sample from Hkwan Hseng s 1997 Who s Who in the Shan State, û SYMBOL SHAN ONE. Figure 5. Sample from the 1985 Shan Reader for Kindergarten, showing the preferred form of LETTER SHAN NYA. 4

Figure 6. Sample from the 1997 Shan Reader for Kindergarten, showing the preferred form of LETTER SHAN NYA. Figure 7. Sample from a Shan Council reader, showing ıé ka 1, ıéã ka 2 with SIGN SHAN COUNCIL TONE-2, ıéå ka 3 with SIGN SHAN COUNCIL TONE-3, ıé ka 4 with SIGN VISARGA, and ıéâ ka 5 with SIGN SHAN TONE-5. Note specifically the height of the dot in SIGN SHAN COUNCIL TONE-2 is the same as that of the dots in SIGN SHAN COUNCIL TONE-3, while the dot in SIGN SHAN TONE-5, is aligned to the baseline as is the lower dot of SIGN VISARGA. Shan Council orthography does not use the non-spacing SIGN DOT BELOW as other orthographies do. 5

Figure 8. Sample of a 2003 calendar showing Shan digits alongside European digits and Myanmar digits. Figure 9. Shan digits, from a 2005 Shan Reader.. 6

Proposal for encoding additional Myanmar characters for Shan in the UCS Michael Everson Row 10: MYANMAR 100 101 102 103 104 105 106 107 108 109 0 Ä ê @ @ Ä ê 1 Å ë ±@ @Ò Å ë 2 Ç í @ @ @Ú @Ç í 3 É ì @ @ @Û @É ì 4 Ñ î @ ƒ @ @Ù Ñ@ î 5 Ö ï @μ  ı @Ö ï 6 Ü ñ @ Δ @ Ê ˆ @Ü ñ 7 á ó ß @ «@ @Á @á ó 8 à ò @» @ÿ @Ë @à ò G = 00 P = 00 9 â ô π @Ÿ @È @â ô A ä ö @ ~ @Í @ä B ã õ @ @ª À @Î @ã C å ú @ º@ à @Ï @å D ç ù @ @ Õ @Ì @ç E é û @Æ @æ Œ @fi Ó é û F è ü @Ø ø œ @fl Ô ˇ @è ü 7

Proposal for encoding Shan characters in the UCS Michael Everson Row 10: MYANMAR hex Name hex Name 00 01 02 03 04 05 06 07 08 09 0A 0B 0C 0D 0E 0F 10 11 12 13 14 15 16 17 18 19 1A 1B 1C 1D 1E 1F 20 21 22 23 24 25 26 27 28 29 2A 2B 2C 2D 2E 2F 30 31 32 33 34 35 36 37 38 39 3A 3B 3C 3D 3E 3F 40 41 42 43 44 45 46 47 48 49 4A 4B 4C 4D 4E 4F 50 51 52 53 54 55 56 57 58 MYANMAR LETTER KA MYANMAR LETTER KHA MYANMAR LETTER GA MYANMAR LETTER GHA MYANMAR LETTER NGA MYANMAR LETTER CA MYANMAR LETTER CHA MYANMAR LETTER JA MYANMAR LETTER JHA MYANMAR LETTER NYA MYANMAR LETTER NNYA MYANMAR LETTER TTA MYANMAR LETTER TTHA MYANMAR LETTER DDA MYANMAR LETTER DDHA MYANMAR LETTER NNA MYANMAR LETTER TA MYANMAR LETTER THA MYANMAR LETTER DA MYANMAR LETTER DHA MYANMAR LETTER NA MYANMAR LETTER PA MYANMAR LETTER PHA MYANMAR LETTER BA MYANMAR LETTER BHA MYANMAR LETTER MA MYANMAR LETTER YA MYANMAR LETTER RA MYANMAR LETTER LA MYANMAR LETTER WA MYANMAR LETTER SA MYANMAR LETTER HA MYANMAR LETTER LLA MYANMAR LETTER A MYANMAR LETTER SHAN A MYANMAR LETTER I MYANMAR LETTER II MYANMAR LETTER U MYANMAR LETTER UU MYANMAR LETTER E MYANMAR LETTER MON E MYANMAR LETTER O MYANMAR LETTER AU MYANMAR VOWEL SIGN TALL AA MYANMAR VOWEL SIGN AA MYANMAR VOWEL SIGN I MYANMAR VOWEL SIGN II MYANMAR VOWEL SIGN U MYANMAR VOWEL SIGN UU MYANMAR VOWEL SIGN E MYANMAR VOWEL SIGN AI MYANMAR VOWEL SIGN MON II MYANMAR VOWEL SIGN MON O MYANMAR VOWEL SIGN E ABOVE MYANMAR SIGN ANUSVARA MYANMAR SIGN DOT BELOW MYANMAR SIGN VISARGA MYANMAR SIGN VIRAMA MYANMAR SIGN ASAT MYANMAR CONSONANT SIGN MEDIAL YA MYANMAR CONSONANT SIGN MEDIAL RA MYANMAR CONSONANT SIGN MEDIAL WA MYANMAR CONSONANT SIGN MEDIAL HA MYANMAR LETTER GREAT SA MYANMAR DIGIT ZERO MYANMAR DIGIT ONE MYANMAR DIGIT TWO MYANMAR DIGIT THREE MYANMAR DIGIT FOUR MYANMAR DIGIT FIVE MYANMAR DIGIT SIX MYANMAR DIGIT SEVEN MYANMAR DIGIT EIGHT MYANMAR DIGIT NINE MYANMAR SIGN LITTLE SECTION MYANMAR SIGN SECTION MYANMAR SYMBOL LOCATIVE MYANMAR SYMBOL COMPLETED MYANMAR SYMBOL AFOREMENTIONED MYANMAR SYMBOL GENITIVE MYANMAR LETTER SHA MYANMAR LETTER SSA MYANMAR LETTER VOCALIC R MYANMAR LETTER VOCALIC RR MYANMAR LETTER VOCALIC L MYANMAR LETTER VOCALIC LL MYANMAR VOWEL SIGN VOCALIC R MYANMAR VOWEL SIGN VOCALIC RR MYANMAR VOWEL SIGN VOCALIC L Group 00 Plane 00 Row 10 8 59 5A 5B 5C 5D 5E 5F 60 61 62 63 64 65 66 67 68 69 6A 6B 6C 6D 6E 6F 70 71 72 73 74 75 76 77 78 79 7A 7B 7C 7D 7E 7F 80 81 82 83 84 85 86 87 88 89 8A 8B 8C 8D 8E 8F 90 91 92 93 94 95 96 97 98 99 9A 9B 9C 9D 9E 9F MYANMAR VOWEL SIGN VOCALIC LL MYANMAR LETTER MON NGA MYANMAR LETTER MON JHA MYANMAR LETTER MON BBA MYANMAR LETTER MON BBE MYANMAR CONSONANT SIGN MON MEDIAL NA MYANMAR CONSONANT SIGN MON MEDIAL MA MYANMAR CONSONANT SIGN MON MEDIAL LA MYANMAR LETTER SGAW KAREN SHA MYANMAR LETTER SGAW KAREN EU MYANMAR SIGN SGAW KAREN HATHI MYANMAR SIGN SGAW KAREN KE PHO MYANMAR LETTER WESTERN PWO KAREN THA MYANMAR LETTER WESTERN PWO KAREN PWA MYANMAR VOWEL SIGN WESTERN PWO KAREN EU MYANMAR VOWEL SIGN WESTERN PWO KAREN UE MYANMAR SIGN WESTERN PWO KAREN TONE-1 MYANMAR SIGN WESTERN PWO KAREN TONE-2 MYANMAR SIGN WESTERN PWO KAREN TONE-3 MYANMAR SIGN WESTERN PWO KAREN TONE-4 MYANMAR SIGN WESTERN PWO KAREN TONE-5 MYANMAR LETTER EASTERN PWO KAREN NNA MYANMAR LETTER EASTERN PWO KAREN YWA MYANMAR LETTER EASTERN PWO KAREN GHWA MYANMAR VOWEL SIGN GEBA KAREN I MYANMAR VOWEL SIGN KAYAH OE MYANMAR VOWEL SIGN KAYAH U MYANMAR VOWEL SIGN KAYAH EE MYANMAR LETTER SHAN KA MYANMAR LETTER SHAN KHA MYANMAR LETTER SHAN GA MYANMAR LETTER SHAN CA MYANMAR LETTER SHAN ZA MYANMAR LETTER SHAN NYA MYANMAR LETTER SHAN DA MYANMAR LETTER SHAN NA MYANMAR LETTER SHAN PHA MYANMAR LETTER SHAN FA MYANMAR LETTER SHAN BA MYANMAR LETTER SHAN THA MYANMAR LETTER SHAN HA MYANMAR CONSONANT SIGN SHAN MEDIAL WA MYANMAR VOWEL SIGN SHAN AA MYANMAR VOWEL SIGN SHAN E MYANMAR VOWEL SIGN SHAN E ABOVE MYANMAR VOWEL SIGN SHAN FINAL Y MYANMAR SIGN SHAN TONE-2 MYANMAR SIGN SHAN TONE-3 MYANMAR SIGN SHAN TONE-5 MYANMAR SIGN SHAN TONE-6 MYANMAR SIGN SHAN COUNCIL TONE-2 MYANMAR SIGN SHAN COUNCIL TONE-3 MYANMAR SIGN SHAN COUNCIL EMPHATIC TONE MYANMAR VOWEL SIGN RUMAI PALAUNG FA MYANMAR SIGN RUMAI PALAUNG TONE-5 MYANMAR SHAN DIGIT ZERO MYANMAR SHAN DIGIT ONE MYANMAR SHAN DIGIT TWO MYANMAR SHAN DIGIT THREE MYANMAR SHAN DIGIT FOUR MYANMAR SHAN DIGIT FIVE MYANMAR SHAN DIGIT SIX MYANMAR SHAN DIGIT SEVEN MYANMAR SHAN DIGIT EIGHT MYANMAR SHAN DIGIT NINE (This position shall not be used) (This position shall not be used) (This position shall not be used) (This position shall not be used) MYANMAR SYMBOL SHAN ONE MYANMAR SYMBOL SHAN EXCLAMATION

A. Administrative 1. Title Proposal for encoding additional Myanmar characters for Shan in the UCS. 2. Requester s name Michael Everson 3. Requester type (Member body/liaison/individual contribution) Individual contribution. 4. Submission date 2007-08-28 5. Requester s reference (if applicable) 6. Choose one of the following: 6a. This is a complete proposal 6b. More information will be provided later B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Proposed name of script 1b. The proposal is for addition of character(s) to an existing block 1c. Name of the existing block Myanmar. 2. Number of characters in proposal 17 3. Proposed category (A-Contemporary; B.1-Specialized (small collection); B.2-Specialized (large collection); C-Major extinct; D-Attested extinct; E-Minor extinct; F-Archaic Hieroglyphic or Ideographic; G-Obscure or questionable usage symbols) Category A. 4a. Proposed Level of Implementation (1, 2 or 3) Level 2 4b. Is a rationale provided for the choice? 4c. If YES, reference Brahmic Level 2 implementation. 5a. Is a repertoire including character names provided? 5b. If YES, are the names in accordance with the character naming guidelines in Annex L of P&P document? 5c. Are the character shapes attached in a legible form suitable for review? 6a. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard? Michael Everson. 6b. If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools used: Michael Everson, Fontographer. 7a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? 7b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached? 8. Special encoding issues: Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? 9. Additional Information: Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. See above. C. Technical Justification 1. Has this proposal for addition of character(s) been submitted before? If YES, explain. 2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? 9

2b. If YES, with whom? Sai Zin Di Di Zone, Khwaan Tai, Sai Murngzuen Hengtai. 2c. If YES, available relevant documents 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? People in Myanmar. 4a. The context of use for the proposed characters (type of use; common or rare) Common. 4b. Reference 5a. Are the proposed characters in current use by the user community? 5b. If YES, where? In Myanmar. 6a. After giving due considerations to the principles in the P&P document must the proposed characters be entirely in the BMP? 6b. If YES, is a rationale provided? 6c. If YES, reference Contemporary use and accordance with the Roadmap. 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? N/A. 8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence? 8b. If YES, is a rationale for its inclusion provided? Some of the Shan tone marks look superficially like punctuation marks, but they are combining characters and typically have a hollow dot. 8c. If YES, reference 9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters? 9b. If YES, is a rationale for its inclusion provided? 9c. If YES, reference 10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character? 10b. If YES, is a rationale for its inclusion provided? 10c. If YES, reference See Issues above. 11a. Does the proposal include use of combining characters and/or use of composite sequences? 11b. If YES, is a rationale for such use provided? 11c. If YES, reference Brahmic vowel and consonant signs. 11d. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? 11e. If YES, reference 12a. Does the proposal contain characters with any special properties such as control function or similar semantics? 12b. If YES, describe in detail (include attachment if necessary) 13a. Does the proposal contain any Ideographic compatibility character(s)? 13b. If YES, is the equivalent corresponding unified ideographic character(s) identified? 10