ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages"). Each script is given both a four-letter code and a numeric code.[1]
Where possible the codes are derived from ISO 639-2, where the name of a script and the name of a language using the script are identical (example: Gujarātī ISO 639 guj, ISO 15924 Gujr). Preference is given to the 639-2 Bibliographical codes, which is different from the otherwise often preferred use of the Terminological codes.[1]
4-letter ISO 15924 codes are incorporated into the IANA Language Subtag Registry for IETF language tags and so can be used in file formats that make use of such language tags. For example, they can be used in HTML and XML to help Web browsers determine which typeface to use for foreign text. This way one could differentiate, for example, between Serbian written in the Cyrillic (sr-Cyrl
) or Latin (sr-Latn
) script, or mark romanized or transliterated text as such.
ISO appointed the Unicode Consortium as the Registration Authority (RA) for the standard. The RA is responsible for appointing a registrar who works with a Joint Advisory Committee (JAC) in developing and implementing the standard. The registrar from 2004 to 2018 was Michael Everson, and from January 2019 the registrar has been Markus Scherer, a technical director of the Unicode Consortium.[2][3] The JAC consists of six members: one representative of the RA (Markus Scherer), one representative of ISO 639-2 (Randall K. Barry of the Library of Congress), one representative of ISO TC37 (Christian Galinski), one representative of ISO TC46 (Peeter Päll), and two representatives of ISO/IEC JTC 1/SC 2 (Rick McGowan and Ken Whistler, both also officers of the Unicode Consortium).[4]
Two four-letter codes are reserved at the request of the Common Locale Data Repository (CLDR) project:[8]
This list of codes is from the ISO 15924 standard.[8]
The following standards are referred to as indispensable by ISO 15924.
For definition of font and glyph the standard refers to
Around 160 scripts are defined in Unicode. Through a linkpin called "Property Value Alias", Unicode has made a 1:1 connection between a script defined, and its ISO 15924 standard. See Script (Unicode).
Inherited
is the Unicode script property value of "characters that may be used with multiple scripts, and that inherit their script from a preceding base character. These include nonspacing combining marks and enclosing combining marks, as well as U+200C ZERO WIDTH NON-JOINER and U+200D ZERO WIDTH JOINER." The value Inherited
is described as matching the ISO 15924 code Zinh
.Qaag is a special script code for identifying the non-standard use of Myanmar characters for display with the Zawgyi font. The purpose of the code is to enable migration to standard, interoperable use of Unicode by providing an identifier for Zawgyi for tagging text, applications, input methods, font tables, transformations, and other mechanisms used for migration.