This document requests an additional character to be added to the UCS and contains the proposal summary form.

Similar documents
This document requests an additional character to be added to the UCS and contains the proposal summary form.

This is a preliminary proposal to encode the Mandaic script in the BMP of the UCS.

Proposal to Encode the Typikon Symbols in Unicode: Part 2 Old Rite Symbols

Proposal to encode Al-Dani Quranic marks used in Quran published in Libya. For consideration by UTC and ISO/IEC JTC1/SC2/WG2

Proposal to Encode the Typikon Symbols in Unicode

ISO/IEC JTC1/SC2/WG2 N3816

Proposal to encode svara markers for the Jaiminiya Archika. 1. Background

L2/ Background. Proposal

Summary. Background. Individual Contribution For consideration by the UTC. Date:

Proposal to encode Quranic marks used in Quran published in Libya (Narration of Qaloon with script Aldani)

N3976R L2/11-130R

Request to encode South Indian CANDRABINDU-s. Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2010-Oct Background

Responses to Several Hebrew Related Items

If these characters were in second position in a cluster, would they interfere with searching operations? Example: vs.

@ó 061A

N3976 L2/11-130)

Proposal to Encode the Typikon Symbols in Unicode

Additional digits Since the 1960s Shan digits have been used alongside Myanmar and European digits.

+ HETH ḥw = WAW. ḥr = RESH + HETH. br = RESH + BETH + HETH ḥd = DALETH

Issues in the Representation of Pointed Hebrew in Unicode

A. Administrative. B. Technical -- General

Proposal to Encode Alternative Characters for Biblical Hebrew

ISO/IEC JTC1/SC2/WG2 N25xx

VOWEL SIGN CONSONANT SIGN SHAN MEDIAL WA contrasts with the

Ê P p P f Í Ṣ ṣ Ṣ ž? ˆ Š š Š č, ǰ. œ BI bi BI be. œ LIḄA lebba heart RḄH rabba great

1 RAÑJANA encompasses: Rañjana (Figure 1, 2, 3) Wartu (Figure 4)

Proposal to encode Grantha Chillu Marker sign in Unicode/ISO 10646

JTC2/SC2/WG2 N 2190 Date:

tone marks. (Figures 4, 5, 6, 7, and 8.)

2. Processing. Imperial Aramaic is an alphabetic script written right-to-left, in scriptio continua or with spaces between words.

4. Radicals. The chief issue about which we would like feedback at this time is the question of the encoding of Jurchen radicals.

Response to the Proposal to Encode Phoenician in Unicode. Dean A. Snyder 8 June 2004

This is a preliminary proposal to encode the Chakma script in the BMP of the UCS.

Proposal to add two Tifinagh characters for vowels in Tuareg language variants

Typographic Concerns and the Hebrew Nomina Sacra

Î 2CEB Ï 2CEC Ì 2CED Ó FE26 COMBINING CONJOINING MACRON

Review of Bengali Khanda Ta and PRI-30 Feedback

MOVING TO A UNICODE-BASED LIBRARY SYSTEM: THE YESHIVA UNIVERSITY LIBRARY EXPERIENCE

The Unicode Standard Version 8.0 Core Specification

ISO/IEC JTC1/SC2/WG2 N4283 L2/12-214

The Unicode Standard Version 11.0 Core Specification

Proposal to Encode the Grantha Script in the Basic Multilingual Plane (BMP) of ISO/IEC 10646

Comments on Grantha OM

INTRODUCTION TO THE Holman Christian Standard Bible

ISO/IEC JTC1/SC2/WG2 N2972

Request for editorial updates to Indic scripts

ƒ Δ ~ ÀÃÕŒœ ÿÿ Ä Å Ç É Ñ Ö Ü á à â ä ã å ç èê ë í ì î ñ ó ô õ ß Ø ± π ª

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 11.0 Core Specification

TOWARDS UNICODE STANDARD FOR URDU - WG2 N2413-1/SC2 N35891

CONTENTS. Preface 13. Introduction 15. Chapter One: The Man and his Works against the Background of his Time 23

The Letter Alef Is The First Letter Of The Hebrew

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532

The Unicode Standard Version 7.0 Core Specification

Chapter 40 The Hebrew Bible

Proposal to Encode the Mark's Chapter Glyph in theunicode Standard

ford residence southampton, ny

The Unicode Standard Version 10.0 Core Specification

Dual-joining Manichaean Characters Character X n X r X m X l. Right-joining Manichaean Characters Character X n X r

ISO/IEC JTC/1 SC/2 WG/2 N2474. Xerox Research Center Europe. 25 April 2002, marked revisions 17 May 2002

Mark McEntire Belmont University Nashville, Tennessee

Elaine Keown Fri, June 4, 2004 Tucson, Arizona

Follow-up to Extended Tamil proposal L2/10-256R. 1. Encoding model of Extended Tamil and related script-forms

Assignments. HEBR/REL-131 &132: Elementary Biblical Hebrew I, Spring Charles Abzug. Books and Other Source Materials for the Assignments:

Assignments. HEBR/REL-131 &132: Elementary Biblical Hebrew I, Spring Charles Abzug. Books and Other Source Materials for the Assignments:

Mikraot Gedolot haketer--biblia Rabbinica Behind the scenes with the project team

Tickling Ears (or) Seeking the Biblical Wow-Factor Tim Hegg TorahResource 2014

RBL 02/2005 Goshen-Gottstein, Moshe, and Shemaryahu Talmon, eds.; Galen Marquis, associate editor

The Making of the Hebrew Bible

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532-N3697

THE TRANSMISSION OF THE OLD TESTAMENT. Randy Broberg, 2004

Assignments. HEBR/REL-131 & HEBR/REL-132: Elementary Biblical Hebrew I & II, Academic Year Charles Abzug

Dual-joining Manichaean Characters Character X n X r X m X l. Right-joining Manichaean Characters Character X n X r

Ruth 4:5 by Mark S. Haughwout

Proposal to encode Vedic characters for the Grantha script. 1. Characters being proposed

LA003B Biblical Hebrew B. Unit Outline. About this Unit Outline

Revised proposal to encode Hanifi Rohingya in Unicode

THE QUR AN VS. THE BIBLE. I. Textual Criticism of the Qur an and the Bible: A Direct Comparison

Dual-joining Psalter Pahlavi Characters Character X n X r X m X l. Right-joining Psalter Pahlavi Characters Character X n X r

The Persian Language and Arabic Script IDNs

"Fuldensis, Sigla for Variants in Vaticanus and 1Cor 14:34-5" NTS 41 (1995) Philip B. Payne

Xerox Research Center Europe. 25 April at the earliest opportunity to include four additional characters,

Developing Database of the Pāli Canon

1 The authors wish to acknowledge the support of the Universal Scripts Project (part of the

Tips for Using Logos Bible Software Version 3

Carolina Bachenheimer-Schaefer, Thorsten Reibel, Jürgen Schilder & Ilija Zivadinovic Global Application and Solution Team

The Alphabet Mark Francois 1. Hebrew Grammar. Week 1 (Last Updated Nov. 28, 2016)

The Eden Proverb 2004 by Gerry L. Folbré III Research

Introducing This Study Series Leading This Study SESSION 1: The Making of the Hebrew Bible... 11

Instructional Materials Required The Cambridge Introduction to Biblical Hebrew, Brian L. Webster. Cambridge University Press, 2009.

Strong's Dictionary Of The Bible PDF

Exhibit 1.Example used by Everson in proposal L2/ The example given is grammatically and orthographically Tamil. This is an example of the

Qu'ran fragment, in Arabic, before 911, vellum, MS M. 712, fols 19v-20r, 23 x 32 cm, possibly Iraq (The Morgan Library and Museum, New York)

Houghton Mifflin Harcourt Collections 2015 Grade 8. Indiana Academic Standards English/Language Arts Grade 8

Schema for the Transliteration of Sanskrit and Pāḷi

Brinton follows this with a discussion of Parra s characters by Otto Stoll; I have given them alongside modern transcriptions:

Some comments on the Arabic block in Unicode

Syllabus for GBIB 561 Old Testament Hermeneutics and Exegesis (Hebrew) 3 Credit Hours Fall 2010

1 Chronicles - Nehemiah: Up from the Ashes

Transcription:

ISO/IEC JTC1/SC2/WG2 N2840 L2/04-310 2004-07-29 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation еждународная организация по стандартизации Doc Type: Working Group Document Title: Proposal to add HEBREW POINT HOLAM HASER FOR VAV to the BMP of the UCS Source: Michael Everson & Mark Shoulson Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2004-07-29 This document requests an additional character to be added to the UCS and contains the proposal summary form. A. Administrative 1. Title Proposal to add HEBREW POINT HOLAM HASER FOR VAV to the BMP of the UCS. 2. Requester s name Michael Everson & Mark Shoulson. 3. Requester type (Member body/liaison/individual contribution) Individual contribution. 4. Submission date 2004-07-29 5. Requester s reference (if applicable) 6. Choose one of the following: 6a. This is a complete proposal 6b. More information will be provided later B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Proposed name of script 1b. The proposal is for addition of character(s) to an existing block 1b. Name of the existing block Hebrew 2. Number of characters in proposal 1 3. Proposed category (see section II, Character Categories) Category B.1 4a. Proposed Level of Implementation (1, 2 or 3) (see clause 14, ISO/IEC 10646-1: 2000) Level 3. 4b. Is a rationale provided for the choice? 4c. If YES, reference Combining character. 5a. Is a repertoire including character names provided? 1

5b. If YES, are the names in accordance with the character naming guidelines in Annex L of ISO/IEC 10646-1: 2000? 5c. Are the character shapes attached in a legible form suitable for review? 6a. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard? Michael Everson. TrueType. 6b. If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools used: Michael Everson. Fontographer. 7a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? Yes, see bibliography below. 7b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached? 8. Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? Yes, see below. 9. Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties are: Casing information, Numeric information, Currency information, Display behaviour information such as line breaks, widths etc., Combining behaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts, Compatibility equivalence and other Unicode normalization related information. See the Unicode standard at http://www.unicode.org for such information on other scripts. Also see Unicode Character Database http://www.unicode.org/public/unidata/ UnicodeCharacterDatabase.html and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard. Yes, see Unicode properties below. C. Technical Justification 1. Has this proposal for addition of character(s) been submitted before? If YES, explain. In document L2/04-193 this character is described as HEBREW POINT LEFT HOLAM. That document recommends a joinerbased solution for the problem; as noted below, we believe such a solution is not preferable to the new character proposed here. 2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? 2b. If YES, with whom? Discussion on the hebrew@unicode.org list. 2c. If YES, available relevant documents 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? 4a. The context of use for the proposed characters (type of use; common or rare) Liturgical typesetting. 4b. Reference See examples below. 5a. Are the proposed characters in current use by the user community? 5b. If YES, where? See examples below. 6a. After giving due considerations to the principles in Principles and Procedures document (a WG 2 standing document) must the proposed characters be entirely in the BMP? 6b. If YES, is a rationale provided? 6c. If YES, reference All Hebrew points are in the BMP. 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? N/A. 2

8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence? While it is similar to HEBREW POINT HOLAM it contrasts in position and sometimes size and height. It it not a positional variant. It is functionally a different character. 8b. If YES, is a rationale for its inclusion provided? 8c. If YES, reference 9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters? 9b. If YES, is a rationale for its inclusion provided? 9c. If YES, reference 10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character? 10b. If YES, is a rationale for its inclusion provided? 10c. If YES, reference It is derived from HEBREW POINT HOLAM but it has a different placement, sometimes shape, and it has a different interpretation. 11a. Does the proposal include use of combining characters and/or use of composite sequences (see clauses 4.12 and 4.14 in ISO/IEC 10646-1: 2000)? 11b. If YES, is a rationale for such use provided? 11c. If YES, reference 12a. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? 12b. If YES, reference 13a. Does the proposal contain characters with any special properties such as control function or similar semantics? 13b. If YES, describe in detail (include attachment if necessary) 14a. Does the proposal contain any Ideographic compatibility character(s)? 14b. If YES, is the equivalent corresponding unified ideographic character(s) identified? 14c. If YES, reference D. Proposal The early Semitic letter VAV was originally used consonantally, but by the time the Square Hebrew script (encoded as Hebrew ) came into being, the use of VAV as one of the matres lectionis was well established. In unpointed Hebrew text, VAV can be read as a consonant [w] ([v] in modern dialects), or as an indicator of a vowel [o] or [u]. Various systems of pointing arose to allow for more precise readings. In the Tiberian system most widely used today, one vowel point relevant to the present proposal is U+05B9 HEBREW POINT HOLAM, which is used to indicate [o]. This point is unusual among the Hebrew vowel points in the Tiberian system, in that it is positioned above the letter which bears it, not below the letter like all the others. Its form is that of a dot, placed either above the left edge of the letter or over the space between it and the next letter. The original, proper position appears to have been between the letters, or at least overhanging a little; placing it on the top left of the letter which bears it may be a concession to printing technology. When HEBREW POINT HOLAM is used with VAV, the situation is more complex, because VAV can be read either with its original consonantal value, or as a mater lectionis. It must be understood that there is a terminological difference between holam (the vowel [o]) appearing with a consonant and HOLAM (the point indicating that vowel). So it is true to say that in pointed Hebrew, mem holam [mo] can be represented as MEM HOLAM or as ˆ MEM VAV HOLAM. The letter VAV is troublesome in this context because even when pointed, VAV plus POINT HOLAM can be ambiguous; it can be read [o] or [wo] ([vo]). A VAV used as a mater lectionis with a POINT HOLAM is called 3

holam male full holam ; when the POINT HOLAM is used on a consonant without the mater VAV, it is called holam haser deficient holam. But VAV can also be used as a consonant with holam haser in which case it can look just like the holam male. In order to make a distinction between these two uses of POINT HOLAM and VAV, writers of Hebrew manuscripts as early as the Aleppo Codex (ca. 900 CE) took to placing the dot on the VAV with holam haser further to the left of the dot on VAV when holam male. Indeed, some of the early manuscripts have the dot on the holam male decidedly to the right of the VAV, between it and the letter before, suggesting graphically that it is really a POINT HOLAM on the previous letter with a silent VAV mater written afterwards. In general, however, the holam male is generally drawn with the dot either squarely on top of the VAV or over a bit to the right of its centre (but still atop the VAV):. The VAV with holam haser has its dot over to the left, as POINT HOLAM is generally drawn on other letters:. The plain-text distinction made here, from a millennium ago in scribal manuscripts and later in careful typography, cannot be represented in Unicode and ISO/IEC 10646 at present. By far and away the most common use of VAV with POINT HOLAM is that of the mater lectionis vowel; the VAV with holam haser is the marked case, which is why we propose that a new character be added to represent that case. To quantify the matter, we compared the use of holam male vs. VAV with holam haser in the online Biblia Hebraica Stuttgartensia (ebhs): there are 34,699 instances of the former, and 421 instances of the latter in other words, holam male comprises 98.79% of the cases involving HOLAM on VAV and consonantal VAV with holam haser comprises only 1.21% of the cases in the same text. The proposal for a new character here is analogous to the proposal made in N2755R to distinguish QAMATS QATAN from QAMATS. In texts which do not distinguish the two, QAMATS is used as the generic mark; reading rules only distinguish them. In texts which do make a distinction, the QAMATS QATAN can be used. Here, we likewise recognize that in texts which do not distinguish the two, HOLAM is used as the generic mark; reading rules only distinguish them. In texts which do make a distinction, the HOLAM HASER FOR VAV can be used. The name of the character clearly indicates the intended use of the character. Discussion on the hebrew@unicode.org list has resulted in a different proposal for dealing with this to represent HOLAM HASER FOR VAV by means of a sequence VAV ZWNJ HOLAM. The precedent claimed for this use of ZWNJ after a base character and before a combining character is the sequence defined for Bengali Reph and Ya-phalaa in The Unicode Standard version 4.0.1. We contend that the precedent claimed in that proposal is not appropriate for the Hebrew usage described here. The sequence BENGALI LETTER RA + BENGALI SIGN VIRAMA + BENGALI LETTER YA is a normal sequence in a quintessentially ligating script where the RA takes a special form when ligated with YA. In the sequence BENGALI LETTER RA + ZWNJ + BENGALI SIGN VIRAMA + BENGALI LETTER YA the ZWNJ does not cause the VIRAMA to change its shape or position it simply indicates that the script s normal ligation behaviour is changed so that the RA retains its shape and the YA takes on a special shape. We contend that the proposal to use ZWNJ for Hebrew is an inappropriate extension of the use of the joiner because Hebrew is not a ligating script. The letter VAV is a base letter and the POINT HOLAM is a combining diacritical mark; is no more a ligature than ö is. We believe that the encoding principle accepted for QAMATS QATAN should be applied here as well a changed shape for the unmarked point should be reflected in plain text with the use of an explicitlyencoded character, for those users who prefer to make that distinction. 05B9 ü HEBREW POINT HOLAM used generically or as holam male with vav in orthography which distinguishes it from holam haser (05C7) û HEBREW POINT HOLAM HASER FOR VAV 4

Unicode Character Properties 05C7;HEBREW POINT HOLAM HASER FOR VAV;Mn;19;NSM;;;;;N;;*;;; Figures Here are some examples of texts which distinguish the two points. Holam male points are indicated with a featherless arrow, and VAV with holam haser points are indicated with a feathered arrow. Figure 1. From the Aleppo Codex, ca. 900 CE. The word is ba ăwono by his sin, from Joshua 22:20, with HOLAM HASER FOR VAV on the first (rightmost) VAV and with HOLAM on the second (leftmost) VAV. Figure 2. From the Leningrad Codex 1006. The word is ăwono his sin, from Leviticus 5:1, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 3. From the Lisbon Codex, 1492. The word is ăwono his sin, Leviticus 5:1, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 4. From the Biblia Rabbinica, Joseph ben Hayyim, 1524 5. The word is ăwono his sin, from Leviticus 5:1, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 5. From Letteris 1866. The word is ăwono his sin, from Leviticus 5:1, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. 5

Figure 6. From Feyerabend [1961]. The word is ăwon sin, with HOLAM on the first VAV and with HOLAM HASER FOR VAV on the second VAV. Figure 7. From the Biblia Hebraica Stuttgartensia, Schenker 1977. The word is ăwono his sin, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 8. From Snaith [1995]. The word is ăwono his sin, from Leviticus 5:1, with HOLAM on both VAVs. This is an example of the many cases where no distinction is being made. Figure 9. From Scherman 1996. The word is ăwono his sin, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 10. From The Jersualem Bible, Fisch 1997. The word is ăwono his sin, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 11. From The Jersualem Bible, Fisch 1997. The word is ba ăwono by his sin, from Joshua 22:20, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. 6

Figure 12. From Humash ha-menuqad, 1999. The words are miṣwotaj my commandments and ḥuqqotaj my laws, from Genesis 26:5, with HOLAM HASER FOR VAV on the first VAV and with HOLAM on the second VAV. Figure 13. From Humash ha-menuqad, 1999. The Horev Bible is very much intended for children; even all the commentaries are pointed. The examples here are from Rashi s commentary on Genesis 4:13: wətaḥtonīm, wa ăwonī ( and lower ones, and my sin ), with HOLAM on the first VAV and with HOLAM HASER FOR VAV on the second VAV. Figure 14. From Yardeni 2002. This text is in a presecriptive section describing best practice for designing inscriptions and typefaces. While Yardeni does not refer to holam when used with consonantal VAV, it is clear that she considers the position of holam in its more common usage to have a positioning behaviour which differs from that of holam with other letters. 7

Bibliography Feyerabend, Karl. [1961]. Langenscheidt s pocket Hebrew dictionary to the Old Testament. Hebrew- English. Langenscheidt. ISBN 3 468 97082 X Fisch, Harold, ed. 1997. The Holy Scriptures (The Jerusalem Bible). Jerusalem: Koren Publishers. Horev Publishing House. 1999. Humash ha-menuqad. Jerusalem: Horev Publishing House. Joseph ben Hayyim. 1524-25. Biblia Rabbinica. Venice: Daniel Bomberg. Letteris, Meir, ed. 1866. Sefer Torah, Nevi im u-khetuvim. London: British and Foreign Bible Society. Schenker, A., ed. 1977. Biblia Hebraica Stuttgartensia. 5th edition. Stuttgart: Deutsche Bibelgesellschaft. ISBN 3 438 05218 0 Scherman, Nosson, ed. 1996. The Stone Edition Tanach. New York: Mesorah Publications Ltd. Snaith, Norman Henry, ed. [1995?]. Sefer Torah, Nevi im u-khetuvim. London: British and Foreign Bible Society. ISBN 0 564 00029 9 Yardeni, Ada. 2002. The book of the Hebrew script: history, palaeography, script styoles, calligraphy & design. New Castle, Delaware: Oak Knoll Press; London: British Library. ISBN 1-58456-087-8 (Oak Knoll) and 0-7123-4793-3 (British Library) 8