This document requests an additional character to be added to the UCS and contains the proposal summary form.

Similar documents
This document requests an additional character to be added to the UCS and contains the proposal summary form.

This is a preliminary proposal to encode the Mandaic script in the BMP of the UCS.

Proposal to encode Al-Dani Quranic marks used in Quran published in Libya. For consideration by UTC and ISO/IEC JTC1/SC2/WG2

Proposal to Encode the Typikon Symbols in Unicode: Part 2 Old Rite Symbols

Proposal to Encode the Typikon Symbols in Unicode

ISO/IEC JTC1/SC2/WG2 N3816

Proposal to encode svara markers for the Jaiminiya Archika. 1. Background

Proposal to encode Quranic marks used in Quran published in Libya (Narration of Qaloon with script Aldani)

Request to encode South Indian CANDRABINDU-s. Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2010-Oct Background

Proposal to Encode the Typikon Symbols in Unicode

If these characters were in second position in a cluster, would they interfere with searching operations? Example: vs.

Summary. Background. Individual Contribution For consideration by the UTC. Date:

N3976R L2/11-130R

@ó 061A

N3976 L2/11-130)

Responses to Several Hebrew Related Items

A. Administrative. B. Technical -- General

Issues in the Representation of Pointed Hebrew in Unicode

Additional digits Since the 1960s Shan digits have been used alongside Myanmar and European digits.

+ HETH ḥw = WAW. ḥr = RESH + HETH. br = RESH + BETH + HETH ḥd = DALETH

L2/ Background. Proposal

Proposal to Encode Alternative Characters for Biblical Hebrew

VOWEL SIGN CONSONANT SIGN SHAN MEDIAL WA contrasts with the

Typographic Concerns and the Hebrew Nomina Sacra

Ê P p P f Í Ṣ ṣ Ṣ ž? ˆ Š š Š č, ǰ. œ BI bi BI be. œ LIḄA lebba heart RḄH rabba great

ISO/IEC JTC1/SC2/WG2 N25xx

4. Radicals. The chief issue about which we would like feedback at this time is the question of the encoding of Jurchen radicals.

Response to the Proposal to Encode Phoenician in Unicode. Dean A. Snyder 8 June 2004

Proposal to encode Grantha Chillu Marker sign in Unicode/ISO 10646

Chapter 40 The Hebrew Bible

THE TRANSMISSION OF THE OLD TESTAMENT. Randy Broberg, 2004

The Making of the Hebrew Bible

Lesson 1- Formation of the Bible- Old Testament

Assignments. HEBR/REL-131 &132: Elementary Biblical Hebrew I, Spring Charles Abzug. Books and Other Source Materials for the Assignments:

Assignments. HEBR/REL-131 &132: Elementary Biblical Hebrew I, Spring Charles Abzug. Books and Other Source Materials for the Assignments:

JTC2/SC2/WG2 N 2190 Date:

Introducing This Study Series Leading This Study SESSION 1: The Making of the Hebrew Bible... 11

CONTENTS. Preface 13. Introduction 15. Chapter One: The Man and his Works against the Background of his Time 23

2. Processing. Imperial Aramaic is an alphabetic script written right-to-left, in scriptio continua or with spaces between words.

RBL 02/2005 Goshen-Gottstein, Moshe, and Shemaryahu Talmon, eds.; Galen Marquis, associate editor

The Bible's Many Voices. Study Guide/Syllabus

MOVING TO A UNICODE-BASED LIBRARY SYSTEM: THE YESHIVA UNIVERSITY LIBRARY EXPERIENCE

Scriptural Promise The grass withers, the flower fades, but the word of our God stands forever, Isaiah 40:8

Assignments. HEBR/REL-131 & HEBR/REL-132: Elementary Biblical Hebrew I & II, Academic Year Charles Abzug

ford residence southampton, ny

tone marks. (Figures 4, 5, 6, 7, and 8.)

1 RAÑJANA encompasses: Rañjana (Figure 1, 2, 3) Wartu (Figure 4)

Proposal to Encode the Mark's Chapter Glyph in theunicode Standard

Look Learn Understand & Respect. One We care for the earth God is the creator, he cares for us God is creator of the world

Î 2CEB Ï 2CEC Ì 2CED Ó FE26 COMBINING CONJOINING MACRON

Mark McEntire Belmont University Nashville, Tennessee

Tips for Using Logos Bible Software Version 3

This is a preliminary proposal to encode the Chakma script in the BMP of the UCS.

Mikraot Gedolot haketer--biblia Rabbinica Behind the scenes with the project team

How the Bible Came to Us

God s Ways and God s Words

Proposal to add two Tifinagh characters for vowels in Tuareg language variants

From Garden to Exile to Garden Again An Old Testament Survey: A Literary Approach Mako A. Nagasawa Last modified: October 15, 2017

"Fuldensis, Sigla for Variants in Vaticanus and 1Cor 14:34-5" NTS 41 (1995) Philip B. Payne

6JSC/LC/31/rev/Sec final 17 February 2015 Page 1 of 13

LET S STUDY ONKELOS. By Stanley M. Wagner and Israel Drazin

A1. The Bible Part 1 2 Timothy 3:14-17, Psalm 12:6, Psalm 119:160

ISO/IEC JTC1/SC2/WG2 N4283 L2/12-214

Proposal to Encode Shiva Linga Symbols in Unicode

Thank You and Many Blessings!

How did the Bible get chapters and verses?

Transmission: The Texts and Manuscripts of the Biblical Writings

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532

AKC 4: The Physical Production of the Bible

*John H. Walton, Ancient Near Eastern Thought and the Old Testament: Introducing the Conceptual World of the Hebrew Bible

CODE COLLISSIONS IN THE PROPOSAL OF MICHAEL EVERSON! Working document with error samples from N3532-N3697

REL 011: Religions of the World

Thomas Römer University of Lausanne Lausanne, Switzerland CH-1004

The Unicode Standard Version 8.0 Core Specification

The Unicode Standard Version 11.0 Core Specification

Proposal to encode Vedic characters for the Grantha script. 1. Characters being proposed

ISO/IEC JTC1/SC2/WG2 N2972

REFLECTIONS ON MAIMONIDES' EIGHTH PRINCIPLE OF FAITH: ITS IMPLICATIONS FOR ORTHODOX BIBLE STUDENTS

INTRODUCTION TO THE Holman Christian Standard Bible

The Transmission of the OT Text

TOWARDS UNICODE STANDARD FOR URDU - WG2 N2413-1/SC2 N35891

An Orientation to Romans

ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC)

by Mark S. Haughwout Copyright 2010 Mark S. Haughwout - all rights reserved Please include a link to this web page when quoting.

ƒ Δ ~ ÀÃÕŒœ ÿÿ Ä Å Ç É Ñ Ö Ü á à â ä ã å ç èê ë í ì î ñ ó ô õ ß Ø ± π ª

Advanced Hebrew Open Book Quiz on Brotzman s Introduction

The Unicode Standard Version 7.0 Core Specification

Follow-up to Extended Tamil proposal L2/10-256R. 1. Encoding model of Extended Tamil and related script-forms

St John s Theological College. Anglican Studies COURSE NUMBER BST 510 TITLE THE BIBLE STORY: OLD TESTAMENT COURSE LEVEL 5 NZQF CREDIT VALUE 15

Proposal to Encode the Grantha Script in the Basic Multilingual Plane (BMP) of ISO/IEC 10646

LISTENING AND VIEWING: CA 5 Comprehending and Evaluating the Content and Artistic Aspects of Oral and Visual Presentations

5 Year Discipleship Bible Reading Plan

The LORD s Progressive Revelation about His Plan of Redemption. The Seven Appointments of the LORD - 27

Mishnah and Tosefta RELS2100G CRN: 15529

Torah & Histories (BibSt-Fdn 3) Part 1 of a 2-part survey of the Hebrew Bible or Christian Old Testament Maine School of Ministry ~ Fall 2017

you trust? Why do you trust them? Have they ever lied to you? Do officers, etc. But they aren t perfect people. They ve all made mistakes.

The Unicode Standard Version 11.0 Core Specification

Index of Graphics 9. PART 1: INTRODUCTION AND OVERVIEW 1. Introduction to the Old Testament Overview of the Old Testament 18

OT 3XS3 SAMUEL. Tuesdays 1:30pm 3:20pm

Inspiration of the Bible - Demonstrated

Transcription:

ISO/IEC JTC1/SC2/WG2 N2708 L2/04-089 2004-02-04 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation еждународная организация по стандартизации Doc Type: Working Group Document Title: Proposal to add two Masoretic punctuation marks to the BMP of the UCS Source: Mark Shoulson, Peter Kirk, John Hudson, Michael Everson, and Peter Constable Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2004-02-04 This document requests an additional character to be added to the UCS and contains the proposal summary form. A. Administrative 1. Title Proposal to add two Masoretic punctuation marks to the BMP of the UCS. 2. Requester s name Mark Shoulson, Peter Kirk, John Hudson, Michael Everson, and Peter Constable 3. Requester type (Member body/liaison/individual contribution) Individual contribution. 4. Submission date 2004-02-04 5. Requester s reference (if applicable) 6. Choose one of the following: 6a. This is a complete proposal 6b. More information will be provided later B. Technical General 1. Choose one of the following: 1a. This proposal is for a new script (set of characters) Proposed name of script 1b. The proposal is for addition of character(s) to an existing block 1b. Name of the existing block Hebrew 2. Number of characters in proposal 2 3. Proposed category (see section II, Character Categories) Category B.1 4a. Proposed Level of Implementation (1, 2 or 3) (see clause 14, ISO/IEC 10646-1: 2000) Level 1. 4b. Is a rationale provided for the choice? 4c. If YES, reference Spacing characters. 5a. Is a repertoire including character names provided? 5b. If YES, are the names in accordance with the character naming guidelines in Annex L of ISO/IEC 10646-1: 2000? 1

5c. Are the character shapes attached in a legible form suitable for review? 6a. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard? Michael Everson. TrueType. 6b. If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools used: Michael Everson. Fontographer. 7a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided? Yes, see bibliography below. 7b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached? 8. Does the proposal address other aspects of character data processing (if applicable) such as input, presentation, sorting, searching, indexing, transliteration etc. (if yes please enclose information)? Yes, see below. 9. Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script. Examples of such properties are: Casing information, Numeric information, Currency information, Display behaviour information such as line breaks, widths etc., Combining behaviour, Spacing behaviour, Directional behaviour, Default Collation behaviour, relevance in Mark Up contexts, Compatibility equivalence and other Unicode normalization related information. See the Unicode standard at http://www.unicode.org for such information on other scripts. Also see Unicode Character Database http://www.unicode.org/public/unidata/ UnicodeCharacterDatabase.html and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard. Yes, see Unicode properties below. C. Technical Justification 1. Has this proposal for addition of character(s) been submitted before? If YES, explain. 2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other experts, etc.)? 2b. If YES, with whom? Professor Aron Dotan, script expert 2c. If YES, available relevant documents 3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or publishing use) is included? Printing of materials for scholars and Bible translators. 4a. The context of use for the proposed characters (type of use; common or rare) Scholarly and ritual use. 4b. Reference See examples below. 5a. Are the proposed characters in current use by the user community? 5b. If YES, where? See examples below. 6a. After giving due considerations to the principles in Principles and Procedures document (a WG 2 standing document) must the proposed characters be entirely in the BMP? 6b. If YES, is a rationale provided? 6c. If YES, reference Other Masoretic characters are also in the BMP. 7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)? 8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence? 8b. If YES, is a rationale for its inclusion provided? 8c. If YES, reference 2

9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters? 9b. If YES, is a rationale for its inclusion provided? 9c. If YES, reference 10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character? 10b. If YES, is a rationale for its inclusion provided? 10c. If YES, reference 11a. Does the proposal include use of combining characters and/or use of composite sequences (see clauses 4.12 and 4.14 in ISO/IEC 10646-1: 2000)? 11b. If YES, is a rationale for such use provided? 11c. If YES, reference 12a. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided? 12b. If YES, reference 13a. Does the proposal contain characters with any special properties such as control function or similar semantics? 13b. If YES, describe in detail (include attachment if necessary) 14a. Does the proposal contain any Ideographic compatibility character(s)? 14b. If YES, is the equivalent corresponding unified ideographic character(s) identified? 14c. If YES, reference D. Proposal The traditional Masoretic text of the Hebrew Bible contains several anomolies of printing that have no parallel in other Hebrew usages. Many of these are already encoded in UCS, but there are a few that have thus far slipped through the cracks. 05C5 HEBREW MARK LOWER DOT lower punctum extraordinarium (Psalms 27:13) 05C6 HEBREW PUNCTUATION NUN HAFUKHA (glyph variants, ) does not historically derive from the letter nun 1. HEBREW MARK LOWER DOT In several places in the Bible, manuscripts show dots above and below the letters, even in unpointed texts. These dots, called by the scholarly community puncta extraordinaria, are of obscure meaning, but are always included in ritual and scholarly texts, as well as the ritual scrolls used in Jewish ceremony. They also appear in various non-tiberian documents (Tiberian orthography became the standard for Hebrew, but there had been others before it). The upper puncta appear 53 times in 16 words (Genesis 16:5; 18:9; 19:33; 33:4; 37:12; Numbers 3:39; 9:10; 21:30; 29:15; Deuteronomy 29:28,28; 2 Samuel 19:20; Psalms 27:13; Isaiah 44:9; Ezekiel 41:20; 46:22). And in fact, the upper punctum is already encoded in UCS, as U+05C4 HEBREW MARK UPPER DOT. But the lower punctum, which occurs only in Psalms 27:13 (see scans below), due to its even greater rarity, is as yet uncoded. In order to represent the text of the Bible properly, and for parity with the upper punctum, the lower one needs to be encoded also. Although there already is a codepoint for U+0323 COMBINING DOT BELOW, this is not acceptable for the lower punctum, because of the use of other dots above Hebrew letters when used as numbers in some contexts. The lower punctum is paired with the upper punctum, which is encoded in the Hebrew block: 05C4. It is probably acceptable to use 0307 and 0308 for Hebrew number dots, but 0323 would be confusing, especially if 0307 is used for hundreds and 05C4 is used for the upper punctum: it will make it more likely for people to get mixed up and think that 3

0307 is the puctum and 05C4 is for hundreds, especially if they're familiar with the single dot for hundreds but not the double dot for thousands. Also, many printers (for instance, see the BHS example below) make the puncta rather diamond-shaped instead of round like most other dots (Similarly, we suggest that the explanatory text Upper punctum extraordinarium be added to the listing for U+05C4). In general, the puncta are different from most other Hebrew marks in a qualitative sense. Hebrew vowels are optional, and accents are so optional as to be hardly written at all outside of restricted contexts. However, the laws regarding the writing of a Torah scroll are more precise in their prescription. In a ritual Torah scroll, only the letters are written: not the accents, not the vowels, not the sof-pasuq punctuation, not verse-numbers, nothing else. Just the letters, spacing (as indicated by traditional divisions) and the puncta (and the NUN HAFUKHA, see below). A ritually fit scroll must have them, and may not have anything else. Examples Since the lower punctum occurs only on one word in standard Bibles, it suffices to show examples of that word from various sources. Snaith Bible (1958) Letteris Bible (1866) Biblia Hebraica Suttgartensia (1997) IDF Tanakh (1982) Leningrad Codex (1009) Aleppo Codex (c. 920) Here is an example of the upper punctum in a Torah-scroll context, taken from a current use (note highlighted word). Since only the Pentateuch (and the Book of required to be written and read from a scroll, it is not common to find scrolls of Psalms, given must be from the Pentateuch (Numbers 3:39), and thus an upper punctum. special characteristic of the mark remain true for the lower punctum as well. 4

2. HEBREW PUNCTUATION NUN HAFUKHA In two places in the Bible, an unusual character appears, often in the form of a letter nun (U+05E0 HEBREW LETTER NUN) that is reversed or inverted or turned (depending on the particular scribe or printing). Although it is referred to in Judaic literature as nun hafukha ( reversed nun ), it plainly does not function as any sort of letter in the text. It is not part of a word, and it is not read aloud in any sense: it is just a mark that is written, and thus we classify it as punctuation and not a letter. In many manuscripts, it does not even resemble a transformed nun at all, and when it does, as mentioned, sometimes it appears reversed, sometimes inverted, sometimes turned 180, etc. All these glyphic variants make any official name such as REVERSED NUN or INVERTED NUN potentially misleading, so we choose simply to give it the name by which Rabbis have referred to it for centuries: nun hafukha. It might be appropriate to gloss this in the Unicode standard as not historically a nun, though conventionally named so and also called nun m zuneret. The NUN HAFUKHA appears in the book of Numbers, around verses 10:35-36, apparently setting them off from the surrounding text. According to the Talmud (Shabbat 115b-116a): Our Rabbis teach: And it came to pass, when the ark set forward, that Moses said... the Holy One, Blessed be He, made signs for this passage above and below, to show that this is not its place. Rabbi [Judah the Prince] says, not for this reason, but because it is an important book in its own right... Rabbi Jonathan says, Wisdom... hath hewn out her seven pillars... (Proverbs 9:1) these are the seven books of the Torah [i.e. Genesis, Exodus, Leviticus, Numbers up to 10:35, Numbers 10:35-36, Numbers after 10:36, Deuteronomy] (translation mine (Mark Shoulson)) It also appears seven times in the book of Psalms, chapter 107, separating verses (or, in some versions, in the margin). Stone Edition Tanakh (1996) J. H. Hertz Pentateuch (1980) 5

Biblia Hebraica Stuttgartensia (1997) Leningrad Codex (1009) Aleppo Codex (ca. 920) Psalm 107 6

Biblia Rabbinica (1525) Psalm 107 Modern Torah Scroll It is true that there are some scrolls and even printed editions (e.g. the Biblia Rabbinica, in Numbers, see below) which indicate the NUN HAFUKHA with actual inverted NUNs used as part of the text. We would argue, however, that such cases qualify not as instances of NUN HAFUKHA, which is never read, never part of a word, but is only a punctuation, but rather as instances of an ordinary NUN written in an idiosyncratic way. Indeed there are many such special letters in the Hebrew Bible, including (but not limited to) the various large and small letters, the famous broken letter VAV in Numbers 25:12, and also many lesserknown (and less-frequently observed) reversed, odd-shaped, or oddly-decorated letters in various traditions. These are all glyph variants of their respective letters, as they are treated as parts of words and read just like any other letters, thus they should be handled by special glyphs and/or higher-level markup. 7

Unicode Character Properties 05C5;HEBREW MARK LOWER DOT;Mn;220;NSM;;;;;N;;;;; 05C6;HEBREW PUNCTUATION NUN HAFUKHA;Po;0;R;;;;;N;;;;; 8