Response to the Proposal to Encode Phoenician in Unicode. Dean A. Snyder 8 June 2004

Similar documents
Elaine Keown Fri, June 4, 2004 Tucson, Arizona

This is a preliminary proposal to encode the Mandaic script in the BMP of the UCS.

TO: Unicode Technical Committee FROM: Deborah Anderson Subject: Expert Feedback on Phoenician DATE: 31 May 2004

Responses to Several Hebrew Related Items

This document requests an additional character to be added to the UCS and contains the proposal summary form.

Advanced Hebrew Open Book Quiz on Brotzman s Introduction

Proposal to encode Grantha Chillu Marker sign in Unicode/ISO 10646

ש פה Word Study: Language

Karljürgen G. Feuerherm

Proposal to encode svara markers for the Jaiminiya Archika. 1. Background

Request to encode South Indian CANDRABINDU-s. Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2010-Oct Background

Proposal to encode Al-Dani Quranic marks used in Quran published in Libya. For consideration by UTC and ISO/IEC JTC1/SC2/WG2

DIRECTIONS: 1. Color the title 2. Color the three backgrounds 3. Use your textbook to discover the pictures; Color once you can identify them

Proposal to Encode the Typikon Symbols in Unicode

This document requests an additional character to be added to the UCS and contains the proposal summary form.

Xerox Research Center Europe. 25 April at the earliest opportunity to include four additional characters,

Certification. American University of Cairo, Egypt, 2007 Center for Arabic Study Abroad, Colloquial Egyptian and Modern Standard Arabic

Proposal to Encode the Typikon Symbols in Unicode: Part 2 Old Rite Symbols

Summary. Background. Individual Contribution For consideration by the UTC. Date:

Facets of Hebrew and Semitic linguistics Yale, week 5, September 24, 2013

ISO/IEC JTC/1 SC/2 WG/2 N2474. Xerox Research Center Europe. 25 April 2002, marked revisions 17 May 2002

1. STUDENTS WILL BE ABLE TO IDENTIFY THE MAJOR GEOGRAPHIC FEATURES OF MESOPOTAMIA AND THE SURROUNDING MIDDLE EAST

Transcription ICANN London IDN Variants Saturday 21 June 2014

The Letter Alef Is The First Letter Of The Hebrew

Hosted by Search News

WORD STUDY ELI ELI LAMA איל איל I SABACHTHANI PART למנא שׁבקתני

The Unicode Standard Version 10.0 Core Specification

Mesopotamian civilizations formed on the banks of the Tigris and Euphrates rivers in what is today Iraq and Kuwait.

LC Classification for Biblical Studies

Peoples in the Eastern Mediterranean WORLD HISTORY

MOVING TO A UNICODE-BASED LIBRARY SYSTEM: THE YESHIVA UNIVERSITY LIBRARY EXPERIENCE

ISO/IEC JTC1/SC2/WG2 N3816

Joel S. Baden Yale Divinity School New Haven, Connecticut

Chapter 2. Early Societies in Southwest Asia and the Indo-European Migrations. 2011, The McGraw-Hill Companies, Inc. All Rights Reserved.

OLD TESTAMENT (OT) Old Testament (OT) 1

Opener - According to the text what 3 things should you know by the end of Chapter 1?

The L o s t. Ge n e s i s. Ancient Cosmology and the Origins Debate

Near Eastern Languages and Civilizations

Review of Books on the Book of Mormon

Proposal to Encode the Typikon Symbols in Unicode

Study Description: The Moses Controversy Patterns of Evidence: The Moses Controversy PatternsOfEvidence.com/Moses

Book Review of. The Language of Contemporary Assyrians (Sureth)

HOW WE GOT THE BIBLE #1 THE BIBLE COMBS INTO BEING SYNOPSIS: The history of writing goes back to the remote past. Writing was being practised

The 48 Laws of Power Through Spirituality

Proposal to encode Quranic marks used in Quran published in Libya (Narration of Qaloon with script Aldani)

Chapter 4 The Hebrew Alphabet

Survey of the Old Testament

N3976R L2/11-130R

Chapter 3 PHILOSOPHICAL ETHICS AND BUSINESS CHAPTER OBJECTIVES. After exploring this chapter, you will be able to:

World History I Mrs. Rogers Sem

Babylonian, Dead For Millennia, Now Online

8. The word Semitic refers to A. a theocratic governmental form. B. a language type. C. a monotheistic belief system. D. a violent northern society

Etymological Study of Semitic Languages (Arabic and Hebrew) Chapter two. Semitic languages

WHAT SHOULD A COMMENTARY COMMENT ON? Richard Elliott Friedman

Houghton Mifflin Harcourt Collections 2015 Grade 8. Indiana Academic Standards English/Language Arts Grade 8

ON GENERAL CHARACTERISTICS OF THE WRITING CULTURE OF PRE-MASHTOTSIAN ARMENIA. Summary

Above: Tigris River Above: Irrigation system from the Euphrates River

What is Civilization?

Frequently Asked Questions about Mid American Indian Fellowships with answers given by MAIF Consultant/Helper Robert Francis

Chapter 2 Lesson 2 Peoples in the Eastern Mediterranean

The Personal, Professional and Spiritual Success Mastery Program Created by

World History Topic 2 Reading Guide Ancient Middle East and Egypt

8/6/2013. Why did civilizations. occur?

6.041SC Probabilistic Systems Analysis and Applied Probability, Fall 2013 Transcript Lecture 3

Salam! [Sah-lahm] Hello in Persian

The Unicode Standard Version 7.0 Core Specification

The Cosmopolitan Middle East, BCE

Richard L. W. Clarke, Notes REASONING

A. Administrative. B. Technical -- General

Differentiated Lessons

A note about the notion of exp 10 (log 10 (modulo 1))(x)

De Nomine Sancto (Concerning the Holy Name)

Proceedings of the Meeting & workshop on Development of a National IT Strategy Focusing on Indigenous Content Development

Ancient Mesopotamia: Cradle of Civilization

Proposal to add two Tifinagh characters for vowels in Tuareg language variants

A JERUSALEM MASTER'S PROGRAM IN ANCIENT PHILOLOGY

The Language Jesus Spoke, by Rick Melnick

TRANSCRIPT. IDN PDP Working Group 1 Meeting Costa Rica 15 March 2012

A Rough Timeline Covering the most of the time frame of the two books

Babylon was the center of the Babylonian empire. It s location made it a crossroads of trade making it a very rich city.

Comments on Grantha OM

The Vocation Movement in Lutheran Higher Education

Lucky to Know? the nature and extent of human knowledge and rational belief. We ordinarily take ourselves to

THE INTERCHANGE OF SIBILANTS AND DENTALS IN SEMITIC.

Name: Period 1: 8000 B.C.E. 600 B.C.E.

AN INTRODUCTION TO THE STUDY OF CHRISTIAN MISSIONS HAROLD R. COOK CHAPTER TWENTY-TWO. WHAT A MISSIONARY DOES (Concluded) LEADERSHIP TRAINING

An Old Babylonian Version of the Gilgamesh Epic. On the Basis of Recently Discovered Texts. And

This title is also available at major online book retailers. Copyright 2011 Dr. Adam Yacoub All rights reserved.

Assignments. HEBR/REL-131 & HEBR/REL-132: Elementary Biblical Hebrew I & II, Academic Year Charles Abzug

1 Chronicles - Nehemiah: Up from the Ashes

EXECUTIVE SUMMARY. A Seminary of Intentional Relationships Delivering Theological Education. For the 21 st Century

Copyright 1998, 2001 by Franklin Covey Co. All rights reserved.

Chattha Sangayana CD. Dhananjay Chavan, Vipassana Research Institute, India

STATEMENTS ON MY FAITH AS A BELIEVER IN JESUS CHRIST

Tel Dan Inscription. The Assyrian Empire.

NEJS 101a Elementary Akkadian-Fall 2015 Syllabus

HISTORY 4263: 005 A HISTORY OF JEWS IN THE MIDDLE EAST Dr. Nancy L. Stockdale Spring 2016 Time: Tuesdays 6:30 PM 9:20 PM Place: Wooten Hall 121

THE HISTORY OF WRITING. Anne Pallant. 13 June 2007

6. Considerable stimulus for international trade throughout the Near East.

THE TRANSMISSION OF THE OLD TESTAMENT. Randy Broberg, 2004

Transcription:

JTC1/SC2/WG2 N2792 Response to the Proposal to Encode Phoenician in Unicode Dean A. Snyder 8 June 2004 I am a member of the non-teaching, research faculty in the Department of Computer Science, Johns Hopkins University. My educational background includes 12 years of higher education in ancient and modern languages and linguistics, including 3 years of Classics at the University of Oklahoma and 5 years of graduate work in Comparative Semitics at the Oriental Institute of the University of Chicago. My employment history includes 12 years of commercial software engineering, including writing internationalization tools in C++ for a pioneering email application and networking and database integration layers for group-ware applications. My job at Hopkins has allowed me to apply my computing interests to my research interests in ancient languages. I originated, organized, participated in, and secured funding for the Initiative for Cuneiform Encoding, the group of international cuneiform scholars and Unicode experts responsible for the current Sumero-Akkadian cuneiform encoding proposal to Unicode. I originated and am managing at Johns Hopkins University the Digital Hammurabi Project, which received a 3-year, $1.65 million research grant from the National Science Foundation to develop both new hardware and new software to capture and render high-resolution 3D images of ancient cuneiform tablets and to complete the encoding of cuneiform. I therefore write this response against the current proposal (N2746R2) to encode Phoenician in Unicode based on my experience in these two areas - research experience working in the native scripts of several languages and dialects of the ancient Near East and Mediterranean - Hebrew, Aramaic, Ugaritic, Akkadian, Syriac, Arabic, Phoenician, Moabite, Greek, and Latin - and software engineering experience in text processing and internationalization. Basically, I believe the needs to encode texts in the ancient Northwest Semitic scripts are well met by the current Hebrew block in Unicode. ******************************** As of right now, I have, in the "Canaanite" folder of my email application, 1480 emails devoted to the topic of encoding Phoenician, collected over the last few months from both the Unicode and Hebrew email lists and from private emails. This has been a very controversial proposal. I have read almost all of these emails more than once; and I'm afraid that I, myself, have been guilty of contributing a significant number of them to this torrent. The debate has become heated and personal at times, but I have striven to limit my involvement to the technical issues at hand. And so, trying not to spill onto you too much froth from this verbal stream, let me give you as succinctly as I can, what I believe

to be the salient arguments for and against a separate Unicode encoding for Phoenician [1]. [Responses against separately encoding Phoenician, most of them my own, but some gathered from others, follow immediately each numbered claim made below in support of a separate encoding.] 1) Phoenician is a separate script. Phoenician is not a separate script. Phoenician is but one paleographic assemblage in the script continuum known as Northwest Semitic which also includes, for example, Imperial Aramaic and Jewish Hebrew diascripts [2]. These Ancient Northwest Semitic diascripts all: 1) have the same 22 (letter) characters, 2) in the same order, 3) with the same names; 4) are written in the same direction, right-to-left; 5) are used to write texts in dialects of the same or closely related languages; 6) are found in geographically contiguous areas; 7) are legible to the same ancient people; 8) and have been unified by Semitists for centuries. The combination of all these traits forms a compelling argument that the Northwest Semitic diascripts form one script, being merely paleographic variants of one another. 2) Phoenician is an historically important node on the West Semitic script tree, with other scripts, most notably Greek, descending from it; Greek does not derive from Hebrew. There is no doubt that Phoenician is historically important, but there are several lines of evidence supporting the view that the Greeks borrowed the alphabet much earlier than previously thought and borrowed it from some unknown, but non-phoenician, Canaanite people. 3) Phoenician is illegible to modern Hebrew readers. Legibility is not a determining factor in script differentiation. Respondents have mentioned, for example cyphers, Fraktur, Suetterlin, etc. I would add that I can't imagine a more disparate set of glyphs than those of archaic cuneiform compared to, say, Neo-Assyrian cuneiform, and yet I know of no one who would argue for encoding them separately. (Archaic cuneiform is in fact, technically not even cuneiform at all, since the signs were not formed by making wedge-shaped impressions in clay, but rather by drawing the cursive lines of archaic pictograms on the clay.)

But the real problem is that the illegibility argument is based on modern Hebrew readers, whereas the encoding is aimed at ancient texts, and we have evidence that both Phoenician and Jewish Hebrew diascripts were legible to ancient Jews for example, they employed both, contemporaneously, in writing Dead Sea scrolls, jar labels, coins, and documents.) Furthermore there are even examples of the use of Paleo-Hebrew script in modern Israel on coins and public art. 4) Hebrew has a whole slew of "baroque signs" that do not apply to Phoenician. The same can be said for the accents, minuscules, and symbols of modern Greek compared to ancient Greek, or modern European Latin scripts compared to Roman Imperial Latin. But no one wants to encode these separately even though their descendants are much more complex. 5) There are some modern users of Phoenician who would like to display and process Phoenician in plain text, in particular Indo-Europeanists and script historians. For example, online articles dealing with Greek and Phoenician, should show Phoenician letters, and not Hebrew. This kind of request is a slippery slope into paleographic quicksand (aka glyph-based encodings). And these are precisely the sorts of things for which text mark-up was designed. Cuneiformists, too, will want to visually differentiate, for example, archaic cuneiform from Neo-Assyrian cuneiform; and they will be "forced" to use fonts. 6) If Phoenician is considered a glyphic variation of Hebrew, then it can also be considered a glyphic variation of Greek. The relationship between Phoenician and Hebrew cannot be compared to the relationship between Phoenician and Greek: Phoenician and Hebrew have the same 22 consonants; Greek added letters, dropped letters, and re-deployed letters. Phoenician and Hebrew are written right-to-left; Greek was written right-to-left, left-toright, and boustrophedon. 7) Jews believe that the Phoenician script is so different that its use in writing a Torah scroll rendered that scroll unfit for ritual use. But Jews so regard any Hebrew script other than square Hebrew. Do we therefore want to separately encode, for example, cursive Hebrew? (Compare also the practise in Amish and Hutterite religious communities where the Bible is still printed in Fraktur instead of

Roman fonts.) But what is really remarkable about this argument is that it is proof that the Phoenician script was legible to literate Jews - the argument was not that no one would be able to read it or even that it would be difficult to read, but rather that it was an undesirable font. 8) Phoenician lacks Hebrew's final forms. Ancient Greek lacks word final sigma, along with all the minuscules - nevertheless we do not want to separately encode ancient Greek. 9) It's much harder in software to distinguish scripts that have been unified than to jointly process separated scripts. The same thing could be said about Fraktur and Latin, or, for that matter, Neo-Assyrian and Old Babylonian cuneiform. The difficulty in separately processing plain text written in these scripts is not incentive enough to justify separately encoding them. The classification of written materials for bibliographical use is different from the classification of writing systems for encoding. For a reader faced with the choice of locating a Fraktur or Roman edition of a German classic, or a Paleo-Hebrew or Jewish Hebrew edition of the Torah, having that information is clearly valuable and meaningful. On the other hand, for the purposes of encoding, being able to sort, transmit and search both texts the same way is more important and the distinction is properly relegated to fonts (i.e. to rich text) in this case. 10) Semitic scholars who don't want a separately encoded Phoenician in Unicode can simply ignore any text written in it, just as they now ignore multiple competing encodings and transliterations for Northwest Semitic scripts. But the current practice among Semitists of using multiple encodings and transliterations for Northwest Semitic scripts is a problem all acknowledge. Adding yet another encoding to Unicode will only make matters worse. But what makes the issue more serious is that Unicode is forever, while we can be rid of the current problematic encodings in, say, 10 years. So why would we want to perpetuate problems like ones we are trying to eliminate now? 11) Semitists should not be the only group to decide what should be encoded; even the needs of amateur script enthusiasts should be taken into account. Absoultely. But in making an informed and fair decision, the convenience that a separate encoding would give to script enthusiasts and even some scholars should be judiciously weighed against the negative impact a separate encoding would have on the scholarly

community that uses the script every day. Semitists typically study the ancient documents in this script together, with a very high degree of cross-pollinating research going on. The languages are similar, the diascripts are similar, and there are pervasive political, cultural, linguistic, and religious interactions over the centuries between the geographically proximate users of this script. That's why they study them together, and why there are many degree-granting programs around the world in Northwest Semitics. It's the combination of these factors, along with others previously mentioned, that I believe inform a wise decision to keep these diascripts together in one encoding and not separate them out. ******************************** In the process of hashing out and reviewing these ideas on the Unicode email lists over the last several weeks, I came, at one point, to the conclusion that what was really needed was a separate encoding for Archaic Greek. After all, it is very different from classical Greek, in fact much more different than Phoenician is from Jewish Hebrew. The glyphs are very different; the writing direction was all over the place - right-to-left, left-to-right, boustrophedon; and the character inventories varied from place to place and time to time during the archaic period. Furthermore, encoded Archaic Greek would give to Indo- Europeanists a much more powerful tool than encoded Phoenician. Such a proposal would indeed be exceedingly simple to write, but in the end I rejected the idea for reasons similar to why I think separately encoding Phoenician is not a good idea. Why would I want to search for ΕΡΟΣ in two places when I could find it in one? [1] Phoenician is an inaccurate name for the paleographic assemblage referenced in this proposal; a better name would be "Old Canaanite". Phoenician is but one of the languages written with this script variant, others being Old Hebrew, Old Aramaic, Ammonite, Moabite, and Edomite. Furthermore, even acknowledging that the ancient Greeks believed, as do some modern scholars, that they borrowed the foundation for their alphabet from the Phoenicians, there is evidence that this may not be correct, that the Greeks really borrowed the script much earlier from some non-phoenician Canaanite peoples, giving thus another reason to call this paleographic assemblage "Old Canaanite". Nevertheless, for the sake of simplicity and to avoid being overbearing I will use the proposal's term "Phoenician" in my remarks. [2] Diascript is to script as dialect is to language.