egXML indicates the person, or group of people, to whom the element content is ascribed. indicates the person, or group of people, to whom a speech act or action is directed. provides an externally-defined means of identifying the entity (or entities) being named, using a coded value of some kind. (reference) provides an explicit means of locating a full definition or identity for the entity being named by means of one or more URIs. gives a minimum estimated value for the approximate measurement. (\-?[\d]+/\-?[\d]+) gives a maximum estimated value for the approximate measurement. (\-?[\d]+/\-?[\d]+) where the measurement summarizes more than one observation or a range, supplies the minimum value observed. (\-?[\d]+/\-?[\d]+) where the measurement summarizes more than one observation or a range, supplies the maximum value observed. (\-?[\d]+/\-?[\d]+) specifies the degree of statistical confidence (between zero and one) that a value falls within the range specified by @min and @max, or the proportion of observed values that fall within that range. names the unit used for the measurement Suggested values include: 1] cm (centimetres); 2] mm (millimetres); 3] in (inches); 4] line; 5] char (characters) cm (centimetres) mm (millimetres) in (inches) line lines of text char (characters) characters of text [^\p{C}\p{Z}]+ specifies the length in the units specified (\-?[\d]+/\-?[\d]+) indicates the size of the object concerned using a project-specific vocabulary combining quantity and units in a single string of words. characterizes the precision of the values specified by the other attributes. high medium low unknown where the measurement summarizes more than one observation, specifies the applicability of this measurement. Sample values include: 1] all; 2] most; 3] range [^\p{C}\p{Z}]+ points to a <handNote> element describing the hand considered responsible for the content of the element concerned. categorizes the cause of the damage, if it can be identified. Sample values include: 1] rubbing; 2] mildew; 3] smoke [^\p{C}\p{Z}]+ provides a coded representation of the degree of damage, either as a number between 0 (undamaged) and 1 (very extensively damaged), or as one of the codes high, medium, low, or unknown. The <damage> element with the @degree attribute should only be used where the text may be read with some confidence; text supplied from other sources should be tagged as <supplied>. high medium low unknown assigns an arbitrary number to each stretch of damage regarded as forming part of the same physical phenomenon. indicates whether or not the element bearing this attribute should be considered to mark the end of an orthographic token in the same way as whitespace. [^\p{C}\p{Z}]+ (canonical reference) specifies the destination of the pointer by supplying a canonical reference expressed using the scheme defined in a <refsDecl> element in the TEI header supplies the value of the date or time in a standard form, e.g. yyyy-mm-dd. specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd. specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd. indicates the starting point of the period in standard form, e.g. yyyy-mm-dd. indicates the ending point of the period in standard form, e.g. yyyy-mm-dd. The @when attribute cannot be used with any other att.datable.w3c attributes. The @from and @notBefore attributes cannot be used together. The @to and @notAfter attributes cannot be used together. indicates the system or calendar to which the date represented by the content of this element belongs. @calendar indicates the system or calendar to which the date represented by the content of this element belongs, but this element has no textual content. supplies a pointer to some location defining a named period of time within which the datable item is understood to have occurred. contains a PID (persistent identifier) that aligns the given element with the appropriate Data Category (or categories) in ISOcat. contains a PID (persistent identifier) that aligns the content of the given element or the value of the given attribute with the appropriate simple Data Category (or categories) in ISOcat. indicates whether or not this element is selected by default when its parent is selected. true This element is selected if its parent is selected false This element can only be selected explicitly, unless it is the only one of its kind, in which case it is selected if its parent is selected. identifies one or more declarable elements within the header, which are understood to apply to the element bearing this attribute and its content. specifies whether or not its parent element is fragmented in some way, typically by some other overlapping structure: for example a speech which is divided between two or more verse stanzas, a paragraph which is split across a page division, a verse line which is divided between two speakers. Y (yes) the element is fragmented in some (unspecified) respect N (no) the element is not fragmented, or no claim is made as to its completeness I (initial) this is the initial part of a fragmented element M (medial) this is a medial part of a fragmented element F (final) this is the final part of a fragmented element (organization) specifies how the content of the division is organized. composite no claim is made about the sequence in which the immediate contents of this division are to be processed, or their inter-relationships. uniform the immediate contents of this element are regarded as forming a logical unit, to be processed in sequence. indicates whether this division is a sample of the original source and if so, from which part. initial division lacks material present at end in source. medial division lacks material at start and end. final division lacks material at start. unknown position of sampled material within original unknown. complete division is not a sample. describes the status of a document either currently or, when associated with a dated element, at the time indicated. Sample values include: 1] approved; 2] candidate; 3] cleared; 4] deprecated; 5] draft; 6] embargoed; 7] expired; 8] frozen; 9] galley; 10] proposed; 11] published; 12] recommendation; 13] submitted; 14] unfinished; 15] withdrawn [^\p{C}\p{Z}]+ (duration) indicates the length of this element in time. (certainty) signifies the degree of certainty associated with the intervention or interpretation. high medium low unknown (responsible party) indicates the agency responsible for the intervention or interpretation, for example an editor or transcriber. indicates the nature of the evidence supporting the reliability or accuracy of the intervention or interpretation. Suggested values include: 1] internal; 2] external; 3] conjecture internal there is internal evidence to support the intervention. external there is external evidence to support the intervention. conjecture the intervention or interpretation has been made by the editor, cataloguer, or scholar on the basis of their expertise. [^\p{C}\p{Z}]+ indicates whether this is an instant revision or not. unknown inapplicable (rendition) indicates how the element in question was rendered or presented in the source text. [^\p{C}\p{Z}]+ contains an expression in some formal style definition language which defines the rendering or presentation used for this element in the source text points to a description of the rendering or presentation used for this element in the source text. specifies the source from which some aspect of this element is drawn. (identifier) provides a unique identifier for the element bearing the attribute. (number) gives a number (or other label) for an element, which is not necessarily unique within the document. (language) indicates the language of the element content using a tag generated according to BCP 47. provides a base URI reference with which applications can resolve relative URI references into absolute URI references. signals an intention about how white space should be managed by applications. default signals that the application's default white-space processing modes are acceptable preserve indicates the intent that applications preserve all white space gives a name or other identifier for the scribe believed to be responsible for this hand. points to a full description of the scribe concerned, typically supplied by a <person> element elsewhere in the description. characterizes the particular script or writing style used by this hand, for example secretary, copperplate, Chancery, Italian, etc. points to a full description of the script or writing style used by this hand, typically supplied by a <scriptNote> element elsewhere in the description. describes the tint or type of ink, e.g. brown, or other writing medium, e.g. pencil [^\p{C}\p{Z}]+ specifies how widely this hand is used in the manuscript. sole only this hand is used throughout the manuscript major this hand is used through most of the manuscript minor this hand is used occasionally in the manuscript (MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type [^\p{C}\p{Z}]+ Where the media are displayed, indicates the display width [\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm) Where the media are displayed, indicates the display height [\-+]?\d+(\.\d+)?(%|cm|mm|in|pt|pc|px|em|ex|gd|rem|vw|vh|vm) Where the media are displayed, indicates a scale factor to be applied when generating the desired display size (\-?[\d]+/\-?[\d]+) (uniform resource locator) specifies the URL from which the media concerned may be obtained. (instances) points to instances of the analysis or interpretation represented by the current element. indicates the units used for the measurement, usually using the standard symbol for the desired units. Suggested values include: 1] m (metre); 2] kg (kilogram); 3] s (second); 4] Hz (hertz); 5] Pa (pascal); 6] Ω (ohm); 7] L (litre); 8] t (tonne); 9] ha (hectare); 10] Å (ångström); 11] mL (millilitre); 12] cm (centimetre); 13] dB (decibel); 14] kbit (kilobit); 15] Kibit (kibibit); 16] kB (kilobyte); 17] KiB (kibibyte); 18] MB (megabyte); 19] MiB (mebibyte) m (metre) SI base unit of length kg (kilogram) SI base unit of mass s (second) SI base unit of time Hz (hertz) SI unit of frequency Pa (pascal) SI unit of pressure or stress Ω (ohm) SI unit of electric resistance L (litre) 1 dm³ t (tonne) 10³ kg ha (hectare) 1 hm² Å (ångström) 10⁻¹⁰ m mL (millilitre) cm (centimetre) dB (decibel) see remarks, below kbit (kilobit) 10³ or 1000 bits Kibit (kibibit) 2¹⁰ or 1024 bits kB (kilobyte) 10³ or 1000 bytes KiB (kibibyte) 2¹⁰ or 1024 bytes MB (megabyte) 10⁶ or 1 000 000 bytes MiB (mebibyte) 2²⁰ or 1 048 576 bytes [^\p{C}\p{Z}]+ points to a unique identifier stored in the @xml:id of a <unitDef> element that defines a unit of measure. specifies the number of the specified units that comprise the measurement (\-?[\d]+/\-?[\d]+) indicates the substance that is being measured [^\p{C}\p{Z}]+ The @unit attribute may be unnecessary when @unitRef is present. may be used to specify further information about the entity referenced by this name in the form of a set of whitespace-separated values, for example the occupation of a person, or the status of a place. [^\p{C}\p{Z}]+ (reference to the canonical name) provides a means of locating the canonical form (nym) of the names associated with the object named by the element bearing it. names the notation used for the content of the element. [^\p{C}\p{Z}]+ specifies where this item is placed. Suggested values include: 1] below; 2] bottom; 3] margin; 4] top; 5] opposite; 6] overleaf; 7] above; 8] end; 9] inline; 10] inspace below below the line bottom at the foot of the page margin in the margin (left, right, or both) top at the top of the page opposite on the opposite, i.e. facing, page overleaf on the other side of the leaf above above the line end at the end of e.g. chapter or volume. inline within the body of the text. inspace in a predefined space, for example left by an earlier scribe. [^\p{C}\p{Z}]+ characterizes the element in some sense, using any convenient classification scheme or typology. [^\p{C}\p{Z}]+ provides a sub-categorization of the element, if needed [^\p{C}\p{Z}]+ The element should not be categorized in detail with @subtype unless also categorized in general with @type specifies the language of the content to be found at the destination referenced by @target, using a language tag generated according to BCP 47. @targetLang should only be used on if @target is specified. specifies the destination of the reference by supplying one or more URI References specifies the intended meaning when the target of a pointer is itself a pointer. all if the element pointed to is itself a pointer, then the target of that pointer will be taken, and so on, until an element is found which is not a pointer. one if the element pointed to is itself a pointer, then its target (whether a pointer or not) is taken as the target of this pointer. none no further evaluation of targets is carried out beyond that needed to find the element specified in the pointer's target. optionally specifies the identifiers of the elements within which all elements indicated by the contents of this element lie. (target function) describes the function of each of the values of the @target attribute of the enclosed <link>, <join>, or <alt> tags. [^\p{C}\p{Z}]+ [^\p{C}\p{Z}]+ [^\p{C}\p{Z}]+ characterizes the function of the segment. [^\p{C}\p{Z}]+ supplies the sort key for this element in an index, list or group which contains it. [^\p{C}\p{Z}]+ (edition) supplies a sigil or other arbitrary identifier for the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text. [^\p{C}\p{Z}]+ (edition reference) provides a pointer to the source edition in which the associated feature (for example, a page, column, or line break) occurs at this point in the text. indicates the end of a span initiated by the element bearing this attribute. The element indicated by @spanTo () must follow the current element identifies the language used to describe the rendition. css Cascading Stylesheet Language xslfo Extensible Stylesheet Language Formatting Objects free Informal free text description other A user-defined rendition description language supplies a version number for the style language provided in @scheme. [\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3} @schemeVersion can only be used if @scheme is specified. indicates the location within a temporal alignment at which this element begins. indicates the location within a temporal alignment at which this element ends. indicates the effect of the intervention, for example in the case of a deletion, strikeouts which include too much or too little text, or in the case of an addition, an insertion which duplicates some of the text already present. Sample values include: 1] duplicate; 2] duplicate-partial; 3] excessStart; 4] excessEnd; 5] shortStart; 6] shortEnd; 7] partial; 8] unremarkable [^\p{C}\p{Z}]+ documents the presumed cause for the intervention. [^\p{C}\p{Z}]+ (sequence) assigns a sequence number related to the order in which the encoded features carrying this attribute are believed to have occurred. specifies the date on which the source text was extracted and sent to the translator identifies the unit of information conveyed by the element, e.g. columns, pages, volume, entry. Suggested values include: 1] volume; 2] issue; 3] page; 4] line; 5] chapter; 6] part; 7] column; 8] entry volume the element contains a volume number. issue the element contains an issue number, or volume and issue numbers. page the element contains a page number or page range. line the element contains a line number or line range. chapter the element contains a chapter indication (number and/or title) part the element identifies a part of a book or collection. column the element identifies a column. entry the element identifies an entry number or label in a list of entries. [^\p{C}\p{Z}]+ specifies the starting point of the range of units indicated by the @unit attribute. [^\p{C}\p{Z}]+ specifies the end-point of the range of units indicated by the @unit attribute. [^\p{C}\p{Z}]+ An @formula is provided to describe a mathematical calculation such as a conversion between measurement systems. indicates whether the pronunciation or orthography applies to all or part of a word. Suggested values include: 1] full (full form); 2] pref (prefix); 3] suff (suffix); 4] inf (infix); 5] part (partial) full (full form) pref (prefix) suff (suffix) inf (infix) part (partial) [^\p{C}\p{Z}]+ indicates whether the name component is given in full, as an abbreviation or simply as an initial. yes the name component is spelled out in full. abb (abbreviated) the name component is given in an abbreviated form. init (initial letter) the name component is indicated only by one initial. specifies the sort order of the name component in relation to others within the name. (duration) indicates the length of this element in time. [0-9.,DHMPRSTWYZ/:+\-]+ provides a conventional name for the kind of section changing at this milestone. Suggested values include: 1] page; 2] column; 3] line; 4] book; 5] poem; 6] canto; 7] speaker; 8] stanza; 9] act; 10] scene; 11] section; 12] absent; 13] unnumbered page physical page breaks (synonymous with the <pb> element). column column breaks. line line breaks (synonymous with the <lb> element). book any units termed book, liber, etc. poem individual poems in a collection. canto cantos or other major sections of a poem. speaker changes of speaker or narrator. stanza stanzas within a poem, book, or canto. act acts within a play. scene scenes within a play or act. section sections of any kind. absent passages not present in the reference edition. unnumbered passages present in the text, but not to be included as part of the reference. [^\p{C}\p{Z}]+ (paragraph) marks paragraphs in prose. [3.1. Paragraphs 7.2.5. Speech Contents] Abstract model violation: Paragraphs may not occur inside other paragraphs or ab elements. Abstract model violation: Lines may not contain higher-level structural elements such as div, p, or ab. identifies a word or phrase as belonging to some language other than that of the surrounding text. [3.3.2.1. Foreign Words or Expressions] (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language] (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made. [3.3.2.2. Emphatic Words and Phrases 3.3.2. Emphasis, Foreign Words, and Unusual Language] identifies any word or phrase which is regarded as linguistically distinct, for example as archaic, technical, dialectal, non-preferred, etc., or as forming part of a sublanguage. [3.3.2.3. Other Linguistically Distinct Material] specifies the sublanguage or register to which the word or phrase is being assigned [^\p{C}\p{Z}]+ specifies how the phrase is distinct diachronically specifies how the phrase is distinct diatopically specifies how the phrase is distinct diastratically (speech or thought) indicates passages thought or spoken aloud, whether explicitly indicated in the source or not, whether directly or indirectly reported, whether by real people or fictional characters. [3.3.3. Quotation] may be used to indicate whether the quoted matter is regarded as having been vocalized or signed. unknown inapplicable may be used to indicate whether the quoted matter is regarded as direct or indirect speech. unknown inapplicable (quotation) contains a phrase or passage attributed by the narrator or author to some agency external to the text. [3.3.3. Quotation 4.3.1. Grouped Texts] (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used. [3.3.3. Quotation] may be used to indicate whether the offset passage is spoken or thought, or to characterize it more finely. Suggested values include: 1] spoken; 2] thought; 3] written; 4] soCalled; 5] foreign; 6] distinct; 7] term; 8] emph; 9] mentioned spoken representation of speech thought representation of thought, e.g. internal monologue written quotation from a written source soCalled authorial distance foreign distinct linguistically distinct term technical term emph rhetorically emphasized mentioned refering to itself, not its normal referent [^\p{C}\p{Z}]+ (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example. [3.3.3. Quotation 4.3.1. Grouped Texts 9.3.5.1. Examples] marks words or phrases mentioned, not used. [3.3.3. Quotation] contains a word or phrase for which the author or narrator indicates a disclaiming of responsibility, for example by the use of scare quotes or italics. [3.3.3. Quotation] (description) contains a brief description of the object documented by its parent element, typically a documentation element or an entity. [22.4.1. Description of Components] Information about a deprecation should only be present in a specification element that is being deprecated: that is, only an element that has a @validUntil attribute should have a child <desc type="deprecationInfo">. characterizes the element in some sense, using any convenient classification scheme or typology. [^\p{C}\p{Z}]+ identifies a phrase or word used to provide a gloss or definition for some other word or phrase. [3.3.4. Terms, Glosses, Equivalents, and Descriptions 22.4.1. Description of Components] contains a single-word, multi-word, or symbolic designation which is regarded as a technical term. [3.3.4. Terms, Glosses, Equivalents, and Descriptions] (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate. [3.4.1. Apparent Errors] (correction) contains the correct form of a passage apparently erroneous in the copy text. [3.4.1. Apparent Errors] groups a number of alternative encodings for the same point in a text. [3.4. Simple Editorial Changes] (regularization) contains a reading which has been regularized or normalized in some sense. [3.4.2. Regularization and Normalization 12. Critical Apparatus] (original form) contains a reading which is marked as following the original, rather than being normalized or corrected. [3.4.2. Regularization and Normalization 12. Critical Apparatus] indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible. [3.4.3. Additions, Deletions, and Omissions] gives the reason for omission Suggested values include: 1] cancelled; 2] deleted; 3] editorial; 4] illegible; 5] inaudible; 6] irrelevant; 7] sampling cancelled deleted editorial for features omitted from transcription due to editorial policy illegible inaudible irrelevant sampling [^\p{C}\p{Z}]+ in the case of text omitted because of damage, categorizes the cause of the damage, if it can be identified. Sample values include: 1] rubbing; 2] mildew; 3] smoke [^\p{C}\p{Z}]+ (addition) contains letters, words, or phrases inserted in the source text by an author, scribe, or a previous annotator or corrector. [3.4.3. Additions, Deletions, and Omissions] (deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector. [3.4.3. Additions, Deletions, and Omissions] contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source. [11.3.3.1. Damage, Illegibility, and Supplied Text 3.4.3. Additions, Deletions, and Omissions] indicates why the material is hard to transcribe. Suggested values include: 1] illegible; 2] inaudible; 3] faded; 4] background_noise; 5] eccentric_ductus illegible inaudible faded background_noise eccentric_ductus [^\p{C}\p{Z}]+ Where the difficulty in transcription arises from damage, categorizes the cause of the damage, if it can be identified. Sample values include: 1] rubbing; 2] mildew; 3] smoke [^\p{C}\p{Z}]+ (name, proper noun) contains a proper noun or noun phrase. [3.5.1. Referring Strings] (referencing string) contains a general purpose name or referring string. [13.2.1. Personal Names 3.5.1. Referring Strings] (electronic mail address) contains an email address identifying a location to which email messages can be delivered. [3.5.2. Addresses] contains a postal address, for example of a publisher, an organization, or an individual. [3.5.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.11.2.4. Imprint, Size of a Document, and Reprint Information] (address line) contains one line of a postal address. [3.5.2. Addresses 2.2.4. Publication, Distribution, Licensing, etc. 3.11.2.4. Imprint, Size of a Document, and Reprint Information] contains a full street address including any name or number identifying a building as well as the name of the street or route on which it is located. [3.5.2. Addresses] (postal code) contains a numerical or alphanumeric code used as part of a postal address to simplify sorting or delivery of mail. [3.5.2. Addresses] (postal box or post office box) contains a number or other identifier for some postal delivery point other than a street address. [3.5.2. Addresses] (number) contains a number, written in any form. [3.5.3. Numbers and Measures] indicates the type of numeric value. Suggested values include: 1] cardinal; 2] ordinal; 3] fraction; 4] percentage cardinal absolute number, e.g. 21, 21.5 ordinal ordinal number, e.g. 21st fraction fraction, e.g. one half or three-quarters percentage a percentage [^\p{C}\p{Z}]+ supplies the value of the number in standard form. (\-?[\d]+/\-?[\d]+) contains a word or phrase referring to some quantity of an object or commodity, usually comprising a number, a unit, and a commodity name. [3.5.3. Numbers and Measures] specifies the type of measurement in any convenient typology. [^\p{C}\p{Z}]+ (measure group) contains a group of dimensional specifications which relate to the same object, for example the height and width of a manuscript page. [10.3.4. Dimensions] contains a symbol, a word or a phrase referring to a unit of measurement in any kind of formal or informal system. [3.5.3. Numbers and Measures] contains a date in any format. [3.5.4. Dates and Times 2.2.4. Publication, Distribution, Licensing, etc. 2.6. The Revision Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 15.2.3. The Setting Description 13.3.7. Dates and Times] contains a phrase defining a time of day in any format. [3.5.4. Dates and Times] (abbreviation) contains an abbreviation of any sort. [3.5.5. Abbreviations and Their Expansions] allows the encoder to classify the abbreviation according to some convenient typology. Sample values include: 1] suspension; 2] contraction; 3] brevigraph; 4] superscription; 5] acronym; 6] title; 7] organization; 8] geographic [^\p{C}\p{Z}]+ (expansion) contains the expansion of an abbreviation. [3.5.5. Abbreviations and Their Expansions] (pointer) defines a pointer to another location. [3.6. Simple Links and Cross-References 16.1. Links] Only one of the attributes @target and @cRef may be supplied on . (reference) defines a reference to another location, possibly modified by additional text or comment. [3.6. Simple Links and Cross-References 16.1. Links] Only one of the attributes @target' and @cRef' may be supplied on contains any sequence of items organized as a list. [3.7. Lists] The content of a "gloss" list should include a sequence of one or more pairs of a label element followed by an item element describes the nature of the items in the list. Suggested values include: 1] gloss; 2] index; 3] instructions; 4] litany; 5] syllogism gloss each list item glosses some term or concept, which is given by a <label> element preceding the list item. index each list item is an entry in an index such as the alphabetical topical index at the back of a print volume. instructions each list item is a step in a sequence of instructions, as in a recipe. litany each list item is one of a sequence of petitions, supplications or invocations, typically in a religious ritual. syllogism each list item is part of an argument consisting of two or more propositions and a final conclusion derived from them. [^\p{C}\p{Z}]+ contains one component of a list. [3.7. Lists 2.6. The Revision Description] contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary. [3.7. Lists] (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc. [4.2.1. Headings and Trailers] (heading for list labels) contains the heading for the label or term column in a glossary list or similar structured list. [3.7. Lists] (heading for list items) contains the heading for the item or gloss column in a glossary list or similar structured list. [3.7. Lists] contains a note or annotation. [3.8.1. Notes and Simple Annotation 2.2.6. The Notes Statement 3.11.2.8. Notes and Statement of Language 9.3.5.4. Notes within Entries] indicates whether the copy text shows the exact place of reference for the note. points to the end of the span to which the note is attached, if the note is not embedded in the text at that point. (index entry) marks a location to be indexed for whatever purpose. [3.8.2. Index Entries] a single word which follows the rules defining a legal XML name (see ), supplying a name to specify which index (of several) the index entry belongs to. indicates the location of any form of external media such as an audio or video clip etc. [3.9. Graphics and Other Non-textual Components] (MIME media type) specifies the applicable multimedia internet mail extension (MIME) media type [^\p{C}\p{Z}]+ indicates the location of a graphic or illustration, either forming part of a text, or providing an image of it. [3.9. Graphics and Other Non-textual Components 11.1. Digital Facsimiles] marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element. [3.10.3. Milestone Elements] (page beginning) marks the beginning of a new page in a paginated document. [3.10.3. Milestone Elements] (line beginning) marks the beginning of a new (typographic) line in some edition or version of a text. [3.10.3. Milestone Elements 7.2.5. Speech Contents] (column beginning) marks the beginning of a new column of a text on a multi-column page. [3.10.3. Milestone Elements] (analytic level) contains bibliographic elements describing an item (e.g. an article or poem) published within a monograph or journal and not as an independent publication. [3.11.2.1. Analytic, Monographic, and Series Levels] (monographic level) contains bibliographic elements describing an item (e.g. a book or journal) published as an independent item (i.e. as a separate physical object). [3.11.2.1. Analytic, Monographic, and Series Levels] (series information) contains information about the series in which a book or other bibliographic item has appeared. [3.11.2.1. Analytic, Monographic, and Series Levels] in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement] contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc. [3.11.2.2. Titles, Authors, and Editors] (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement] (responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.2. The Edition Statement 2.2.5. The Series Statement] contains a title for any kind of work. [3.11.2.2. Titles, Authors, and Editors 2.2.1. The Title Statement 2.2.5. The Series Statement] classifies the title according to some convenient typology. Sample values include: 1] main; 2] sub (subordinate); 3] alt (alternate); 4] short; 5] desc (descriptive) [^\p{C}\p{Z}]+ indicates the bibliographic level for a title, that is, whether it identifies an article, book, journal, series, or unpublished material. a (analytic) the title applies to an analytic item, such as an article, poem, or other work published as part of a larger item. m (monographic) the title applies to a monograph such as a book or other item considered to be a distinct publication, including single volumes of multi-volume works j (journal) the title applies to any serial or periodical publication such as a journal, magazine, or newspaper s (series) the title applies to a series of otherwise distinct publications such as a collection u (unpublished) the title applies to any unpublished material (including theses and dissertations unless published by a commercial press) contains the formalized descriptive title for a meeting or conference, for use in a bibliographic description for an item derived from such a meeting, or as a heading or preamble to publications emanating from it. [3.11.2.2. Titles, Authors, and Editors] groups information relating to the publication or distribution of a bibliographic item. [3.11.2.4. Imprint, Size of a Document, and Reprint Information] provides the name of the organization responsible for the publication or distribution of a bibliographic item. [3.11.2.4. Imprint, Size of a Document, and Reprint Information 2.2.4. Publication, Distribution, Licensing, etc.] (scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work. [3.11.2.5. Scopes and Ranges in Bibliographic Citations] (cited range) defines the range of cited content, often represented by pages or other units [3.11.2.5. Scopes and Ranges in Bibliographic Citations] (publication place) contains the name of the place where a bibliographic item was published. [3.11.2.4. Imprint, Size of a Document, and Reprint Information] (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged. [3.11.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements] (structured bibliographic citation) contains a structured bibliographic citation, in which only bibliographic sub-elements appear and in a specified order. [3.11.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements] (citation list) contains a list of bibliographic citations of any kind. [3.11.1. Methods of Encoding Bibliographic References and Lists of References 2.2.7. The Source Description 15.3.2. Declarable Elements] contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it. [3.11.2.7. Related Items] If the @target attribute on is used, the relatedItem element must be empty A relatedItem element should have either a 'target' attribute or a child element to indicate the related bibliographic item points to the related bibliographic element by means of an absolute or relative URI reference (verse line) contains a single, possibly incomplete, line of verse. [3.12.1. Core Tags for Verse 3.12. Passages of Verse or Drama 7.2.5. Speech Contents] Abstract model violation: Lines may not contain lines or lg elements. (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc. [3.12.1. Core Tags for Verse 3.12. Passages of Verse or Drama 7.2.5. Speech Contents] An lg element must contain at least one child l, lg, or gap element. Abstract model violation: Lines may not contain line groups. (speech) contains an individual speech in a performance text, or a passage presented as such in a prose or verse text. [3.12.2. Core Tags for Drama 3.12. Passages of Verse or Drama 7.2.2. Speeches and Speakers] contains a specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment. [3.12.2. Core Tags for Drama] (stage direction) contains any kind of stage direction within a dramatic text or fragment. [3.12.2. Core Tags for Drama 3.12. Passages of Verse or Drama 7.2.4. Stage Directions] indicates the kind of stage direction. Suggested values include: 1] setting; 2] entrance; 3] exit; 4] business; 5] novelistic; 6] delivery; 7] modifier; 8] location; 9] mixed setting describes a setting. entrance describes an entrance. exit describes an exit. business describes stage business. novelistic is a narrative, motivating stage direction. delivery describes how a character speaks. modifier gives some detail about a character. location describes a location. mixed more than one of the above [^\p{C}\p{Z}]+ contains the whole of a TEI encoded corpus, comprising a single corpus header and one or more <TEI> elements, each containing a single text header and a text. [4. Default Text Structure 15.1. Varieties of Composite Text] specifies the version number of the TEI Guidelines against which this document is valid. [\d]+(\.[\d]+){0,2} (automatically generated text division) indicates the location at which a textual division generated automatically by a text-processing application is to appear. [3.8.2. Index Entries] specifies what type of generated text division (e.g. index, table of contents, etc.) is to appear. Sample values include: 1] index; 2] toc; 3] figlist; 4] tablist [^\p{C}\p{Z}]+ (text language) describes the languages and writing systems identified within the bibliographic work being described, rather than its description. [3.11.2.4. Imprint, Size of a Document, and Reprint Information 10.6.6. Languages and Writing Systems] (main language) supplies a code which identifies the chief language used in the bibliographic work. (other languages) one or more codes identifying any other languages used in the bibliographic work. specifies a regular expression against which the values of other attributes can be matched. specifies a replacement pattern, that is, the skeleton of a relative or absolute URI containing references to groups in the @matchPattern which, once subpattern substitution has been performed, complete the URI. (TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources. [2.1.1. The TEI Header and Its Components 15.1. Varieties of Composite Text] (file description) contains a full bibliographic description of an electronic file. [2.2. The File Description 2.1.1. The TEI Header and Its Components] (title statement) groups information about the title of a work and those responsible for its content. [2.2.1. The Title Statement 2.2. The File Description] specifies the name of a sponsoring organization or institution. [2.2.1. The Title Statement] (funding body) specifies the name of an individual, institution, or organization responsible for the funding of a project or text. [2.2.1. The Title Statement] (principal researcher) supplies the name of the principal researcher responsible for the creation of an electronic text. [2.2.1. The Title Statement] (edition statement) groups information relating to one edition of a text. [2.2.2. The Edition Statement 2.2. The File Description] describes the particularities of one edition of a text. [2.2.2. The Edition Statement] describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units. [2.2.3. Type and Extent of File 2.2. The File Description 3.11.2.4. Imprint, Size of a Document, and Reprint Information 10.7.1. Object Description] (publication statement) groups information concerning the publication or distribution of an electronic or other text. [2.2.4. Publication, Distribution, Licensing, etc. 2.2. The File Description] supplies the name of a person or other agency responsible for the distribution of a text. [2.2.4. Publication, Distribution, Licensing, etc.] (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor. [2.2.4. Publication, Distribution, Licensing, etc.] (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way. [13.3.1. Basic Principles 2.2.4. Publication, Distribution, Licensing, etc. 2.2.5. The Series Statement 3.11.2.4. Imprint, Size of a Document, and Reprint Information] categorizes the identifier, for example as an ISBN, Social Security number, etc. Suggested values include: 1] ISBN; 2] ISSN; 3] DOI; 4] URI; 5] VIAF; 6] ESTC; 7] OCLC ISBN International Standard Book Number: a 13- or (if assigned prior to 2007) 10-digit identifying number assigned by the publishing industry to a published book or similar item, registered with the International ISBN Agency. ISSN International Standard Serial Number: an eight-digit number to uniquely identify a serial publication. DOI Digital Object Identifier: a unique string of letters and numbers assigned to an electronic document. URI Uniform Resource Identifier: a string of characters to uniquely identify a resource which usually contains indication of the means of accessing that resource, the name of its host, and its filepath. VIAF A data number in the Virtual Internet Authority File assigned to link different names in catalogs around the world for the same entity. ESTC English Short-Title Catalogue number: an identifying number assigned to a document in English printed in the British Isles or North America before 1801. OCLC OCLC control number (record number) for the union catalog record in WorldCat, a union catalog for member libraries in the Online Computer Library Center global cooperative. [^\p{C}\p{Z}]+ supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc. [2.2.4. Publication, Distribution, Licensing, etc.] supplies a code identifying the current availability of the text. free the text is freely available. unknown the status of the text is unknown. restricted the text is not freely available. contains information about a licence or other legal agreement applicable to the text. [2.2.4. Publication, Distribution, Licensing, etc.] (series statement) groups information about the series, if any, to which a publication belongs. [2.2.5. The Series Statement 2.2. The File Description] (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description. [2.2.6. The Notes Statement 2.2. The File Description] (source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence. [2.2.7. The Source Description] (fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file description are present. [3.11.1. Methods of Encoding Bibliographic References and Lists of References 2.2. The File Description 2.2.7. The Source Description 15.3.2. Declarable Elements] (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived. [2.3. The Encoding Description 2.1.1. The TEI Header and Its Components] (schema reference) describes or points to a related customization or schema file [2.3.10. The Schema Specification] the identifier used for the customization or schema (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected. [2.3.1. The Project Description 2.3. The Encoding Description 15.3.2. Declarable Elements] (sampling declaration) contains a prose description of the rationale and methods used in sampling texts in the creation of a corpus or collection. [2.3.2. The Sampling Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements] (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text. [2.3.3. The Editorial Practices Declaration 2.3. The Encoding Description 15.3.2. Declarable Elements] (correction principles) states how and under what circumstances corrections have been made in the text. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] indicates the degree of correction applied to the text. high the text has been thoroughly checked and proofread. medium the text has been checked at least once. low the text has not been checked. unknown the correction status of the text is unknown. indicates the method adopted to indicate corrections within the text. silent corrections have been made silently markup corrections have been represented using markup indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] indicates the method adopted to indicate normalizations within the text. silent normalization made silently markup normalization represented using markup specifies editorial practice adopted with respect to quotation marks in the original. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] On , either the @marks attribute should be used, or a paragraph of description provided (quotation marks) indicates whether or not quotation marks have been retained as content within the text. none no quotation marks have been retained some some quotation marks have been retained all all quotation marks have been retained summarizes the way in which hyphenation in a source text has been treated in an encoded version of it. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] (end-of-line) indicates whether or not end-of-line hyphenation has been retained in a text. all all end-of-line hyphenation has been retained, even though the lineation of the original may not have been. some end-of-line hyphenation has been retained in some cases. hard all soft end-of-line hyphenation has been removed: any remaining end-of-line hyphenation should be retained. none all end-of-line hyphenation has been removed: any remaining hyphenation occurred within the line. describes the principles according to which the text has been segmented, for example into sentences, tone-units, graphemic strata, etc. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] (standard values) specifies the format used when standardized date or number values are supplied. [2.3.3. The Editorial Practices Declaration 15.3.2. Declarable Elements] describes the scope of any analytic or interpretive information added to the text in addition to the transcription. [2.3.3. The Editorial Practices Declaration] specifies editorial practice adopted with respect to punctuation marks in the original. [2.3.3. The Editorial Practices Declaration 3.2. Treatment of Punctuation] indicates whether or not punctation marks have been retained as content within the text. none no punctuation marks have been retained some some punctuation marks have been retained all all punctuation marks have been retained indicates the positioning of punctuation marks that are associated with marked up text as being encoded within the element surrounding the text or immediately before or after it. internal punctuation marks found at the start or end of a marked up text component are included within its surrounding element; external punctuation marks found at the start or end of a marked up text component appear immediately before or after the surrounding element (tagging declaration) provides detailed information about the tagging applied to a document. [2.3.4. The Tagging Declaration 2.3. The Encoding Description] indicates whether the element types listed exhaustively include all those found within <text>, or represent only a subset. documents the usage of a specific element within a specified document. [2.3.4. The Tagging Declaration] (generic identifier) specifies the name (generic identifier) of the element indicated by the tag, within the namespace indicated by the parent <namespace> element. specifies the number of occurrences of this element within the text. (with unique identifier) specifies the number of occurrences of this element within the text which bear a distinct value for the global @xml:id attribute. supplies the formal name of the namespace to which the elements documented by its children belong. [2.3.4. The Tagging Declaration] specifies the full formal name of the namespace concerned. supplies information about the rendition or appearance of one or more elements in the source text. [2.3.4. The Tagging Declaration] where CSS is used, provides a way of defining pseudo-elements, that is, styling rules applicable to specific sub-portions of an element. Sample values include: 1] first-line; 2] first-letter; 3] before; 4] after [^\p{C}\p{Z}]+ contains a selector or series of selectors specifying the elements to which the contained style description applies, expressed in the language specified in the @scheme attribute. (style definition language declaration) specifies the name of the formal language in which style or renditional information is supplied elsewhere in the document. The specific version of the scheme may also be supplied. [2.3.5. The Default Style Definition Language Declaration] (references declaration) specifies how canonical references are constructed for this text. [2.3.6.3. Milestone Method 2.3. The Encoding Description 2.3.6. The Reference System Declaration] (canonical reference pattern) specifies an expression and replacement pattern for transforming a canonical reference into a URI. [2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration 2.3.6.2. Search-and-Replace Method] (prefix definition) defines a prefixing scheme used in teidata.pointer values, showing how abbreviated URIs using the scheme may be expanded into full URIs. [16.2.3. Using Abbreviated Pointers] supplies a name which functions as the prefix for an abbreviated pointing scheme such as a private URI scheme. The prefix constitutes the text preceding the first colon. [a-z][a-z0-9\+\.\-]* (list of prefix definitions) contains a list of definitions of prefixing schemes used in teidata.pointer values, showing how abbreviated URIs using each scheme may be expanded into full URIs. [16.2.3. Using Abbreviated Pointers] (reference state) specifies one component of a canonical reference defined by the milestone method. [2.3.6.3. Milestone Method 2.3.6. The Reference System Declaration] specifies the fixed length of the reference component. (delimiter) supplies a delimiting string following the reference component. (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text. [2.3.7. The Classification Declaration 2.3. The Encoding Description] defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy. [2.3.7. The Classification Declaration] contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy. [2.3.7. The Classification Declaration] (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal <textDesc>. [2.3.7. The Classification Declaration] (geographic coordinates declaration) documents the notation and the datum used for geographic coordinates expressed as content of the <geo> element elsewhere within the document. [2.3.8. The Geographic Coordinates Declaration] supplies a commonly used code name for the datum employed. Suggested values include: 1] WGS84 (World Geodetic System); 2] MGRS (Military Grid Reference System); 3] OSGB36 (ordnance survey great britain); 4] ED50 (European Datum coordinate system) WGS84 (World Geodetic System) a pair of numbers to be interpreted as latitude followed by longitude according to the World Geodetic System. MGRS (Military Grid Reference System) the values supplied are geospatial entity object codes, based on OSGB36 (ordnance survey great britain) the value supplied is to be interpreted as a British National Grid Reference. ED50 (European Datum coordinate system) the value supplied is to be interpreted as latitude followed by longitude according to the European Datum coordinate system. [^\p{C}\p{Z}]+ (unit declarations) provides information about units of measurement that are not members of the International System of Units. [2.3.9. The Unit Declaration] (unit definition) contains descriptive information related to a specific unit of measurement. [2.3.9. The Unit Declaration] defines how to calculate one unit of measure in terms of another. [2.3.9. The Unit Declaration] indicates a source unit of measure that is to be converted into another unit indicated in @toUnit. the target unit of measurement for a conversion from a source unit referenced in @fromUnit. (application information) records information about an application which has edited the TEI file. [2.3.11. The Application Information Element] provides information about an application which has acted upon the document. [2.3.11. The Application Information Element] supplies an identifier for the application, independent of its version number or display name. supplies a version number for the application, independent of its identifier or display name. [\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3} (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting. [2.4. The Profile Description 2.1.1. The TEI Header and Its Components] contains a summary or formal abstract prefixed to an existing source document by the encoder. [2.4.4. Abstracts] contains information about the creation of a text. [2.4.1. Creation 2.4. The Profile Description] (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text. [2.4.2. Language Usage 2.4. The Profile Description 15.3.2. Declarable Elements] characterizes a single language or sublanguage used within a text. [2.4.2. Language Usage] (identifier) Supplies a language code constructed as defined in BCP 47 which is used to identify the language documented by this element, and which is referenced by the global @xml:lang attribute. specifies the approximate percentage (by volume) of the text which uses this language. (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc. [2.4.3. The Text Classification] contains a list of keywords or phrases identifying the topic or nature of a text. [2.4.3. The Text Classification] identifies the controlled vocabulary within which the set of keywords concerned is defined, for example by a <taxonomy> element, or by some other resource. (classification code) contains the classification code used for this text in some standard classification system. [2.4.3. The Text Classification] identifies the classification system in use, as defined by, e.g. a <taxonomy> element, or some other resource. (category reference) specifies one or more defined categories within some taxonomy or text typology. [2.4.3. The Text Classification] identifies the classification scheme within which the set of categories concerned is defined, for example by a <taxonomy> element, or by some other resource. (calendar description) contains a description of the calendar system used in any dating expression found in the text. [2.4. The Profile Description 2.4.5. Calendar Description] describes a calendar or dating system used in a dating formula in the text. [2.4.5. Calendar Description] (correspondence description) contains a description of the actions related to one act of correspondence. [2.4.6. Correspondence Description] (correspondence action) contains a structured description of the place, the name of a person/organization and the date related to the sending/receiving of a message or any other action related to the correspondence. [2.4.6. Correspondence Description] describes the nature of the action. Suggested values include: 1] sent; 2] received; 3] transmitted; 4] redirected; 5] forwarded sent information concerning the sending or dispatch of a message. received information concerning the receipt of a message. transmitted information concerning the transmission of a message, i.e. between the dispatch and the next receipt, redirect or forwarding. redirected information concerning the redirection of an unread message. forwarded information concerning the forwarding of a message. [^\p{C}\p{Z}]+ (correspondence context) provides references to preceding or following correspondence related to this piece of correspondence. [2.4.6. Correspondence Description] (non-TEI metadata) provides a container element into which metadata in non-TEI formats may be placed. [2.5. Non-TEI Metadata] (revision description) summarizes the revision history for a file. [2.6. The Revision Description 2.1.1. The TEI Header and Its Components] documents a change or set of changes made during the production of a source document, or during the revision of an electronic file. [2.6. The Revision Description 2.4.1. Creation 11.6. Identifying Changes and Revisions] points to one or more elements that belong to this change. groups a number of change descriptions associated with either the creation of a source text or the revision of an encoded text. [2.6. The Revision Description 11.6. Identifying Changes and Revisions] indicates whether the ordering of its child <change> elements is to be considered significant or not (TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resource class. Multiple <TEI> elements may be combined within a <TEI> (or <teiCorpus>) element. [4. Default Text Structure 15.1. Varieties of Composite Text] specifies the version number of the TEI Guidelines against which this document is valid. [\d]+(\.[\d]+){0,2} contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample. [4. Default Text Structure 15.1. Varieties of Composite Text] (text body) contains the whole body of a single unitary text, excluding any front or back matter. [4. Default Text Structure] contains the body of a composite text, grouping together a sequence of distinct texts (or groups of such texts) which are regarded as a unit for some purpose, for example the collected works of an author, a sequence of prose essays, etc. [4. Default Text Structure 4.3.1. Grouped Texts 15.1. Varieties of Composite Text] (text division) contains a subdivision of the front, body, or back of a text. [4.1. Divisions of the Body] Abstract model violation: Lines may not contain higher-level structural elements such as div. Abstract model violation: p and ab may not contain higher-level structural elements such as div. contains a closing title or footer appearing at the end of a division of a text. [4.2.4. Content of Textual Divisions 4.2. Elements Common to All Divisions] contains the primary statement of responsibility given for a work on its title page or at the head or end of the work. [4.2.2. Openers and Closers 4.5. Front Matter] contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer. [4.2.2. Openers and Closers] groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter. [4.2. Elements Common to All Divisions] groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter. [4.2.2. Openers and Closers 4.2. Elements Common to All Divisions] (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc. [4.2.2. Openers and Closers] (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text. [4.2.2. Openers and Closers] contains a postscript, e.g. to a letter. [4.2. Elements Common to All Divisions] (title page) contains the title page of a text, appearing within the front or back matter. [4.6. Title Pages] classifies the title page according to any convenient typology. [^\p{C}\p{Z}]+ (document title) contains the title of a document, including all its constituents, as given on a title page. [4.6. Title Pages] contains a subsection or division of the title of a work, as indicated on a title page. [4.6. Title Pages] specifies the role of this subdivision of the title. Suggested values include: 1] main; 2] sub (subordinate); 3] alt (alternate); 4] short; 5] desc (descriptive) main main title of the work sub (subordinate) subtitle of the work alt (alternate) alternative title of the work short abbreviated form of title desc (descriptive) descriptive paraphrase of the work [^\p{C}\p{Z}]+ (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline). [4.6. Title Pages] contains a formal statement authorizing the publication of a work, sometimes required to appear on a title page or its verso. [4.6. Title Pages] (document edition) contains an edition statement as presented on a title page of a document. [4.6. Title Pages] (document imprint) contains the imprint statement (place and date of publication, publisher name), as given (usually) at the foot of a title page. [4.6. Title Pages] (document date) contains the date of a document, as given on a title page or in a dateline. [4.6. Title Pages] gives the value of the date in standard form, i.e. YYYY-MM-DD. (front matter) contains any prefatory matter (headers, abstracts, title page, prefaces, dedications, etc.) found at the start of a document, before the main body. [4.6. Title Pages 4. Default Text Structure] (back matter) contains any appendixes, etc. following the main part of a text. [4.7. Back Matter 4. Default Text Structure] (text description) provides a description of a text in terms of its situational parameters. [15.2.1. The Text Description] (participation description) describes the identifiable speakers, voices, or other participants in any kind of text or other persons named or otherwise referred to in a text, edition, or metadata. [15.2. Contextual Information] (setting description) describes the setting or settings within which a language interaction takes place, or other places otherwise referred to in a text, edition, or metadata. [15.2. Contextual Information 2.4. The Profile Description] (primary channel) describes the medium or channel by which a text is delivered or experienced. For a written text, this might be print, manuscript, email, etc.; for a spoken one, radio, telephone, face-to-face, etc. [15.2.1. The Text Description] specifies the mode of this channel with respect to speech and writing. s (spoken) w (written) sw (spoken to be written) e.g. dictation ws (written to be spoken) e.g. a script m (mixed) x (unknown or inapplicable) describes the internal composition of a text or text sample, for example as fragmentary, complete, etc. [15.2.1. The Text Description] specifies how the text was constituted. [^\p{C}\p{Z}]+ describes the nature and extent of originality of this text. [15.2.1. The Text Description] categorizes the derivation of the text. [^\p{C}\p{Z}]+ (domain of use) describes the most important social context in which the text was realized or for which it is intended, for example private vs. public, education, religion, etc. [15.2.1. The Text Description] categorizes the domain of use. [^\p{C}\p{Z}]+ describes the extent to which the text may be regarded as imaginative or non-imaginative, that is, as describing a fictional or a non-fictional world. [15.2.1. The Text Description] categorizes the factuality of the text. [^\p{C}\p{Z}]+ describes the extent, cardinality and nature of any interaction among those producing and experiencing the text, for example in the form of response or interjection, commentary, etc. [15.2.1. The Text Description] specifies the degree of interaction between active and passive participants in the text. [^\p{C}\p{Z}]+ specifies the number of active participants (or addressors) producing parts of the text. Suggested values include: 1] singular; 2] plural; 3] corporate; 4] unknown singular a single addressor plural many addressors corporate a corporate addressor unknown number of addressors unknown or unspecifiable [^\p{C}\p{Z}]+ specifies the number of passive participants (or addressees) to whom a text is directed or in whose presence it is created or performed. Suggested values include: 1] self; 2] single; 3] many; 4] group; 5] world self text is addressed to the originator e.g. a diary single text is addressed to one other person e.g. a personal letter many text is addressed to a countable number of others e.g. a conversation in which all participants are identified group text is addressed to an undefined but fixed number of participants e.g. a lecture world text is addressed to an undefined and indeterminately large number e.g. a published book [^\p{C}\p{Z}]+ describes the extent to which a text may be regarded as prepared or spontaneous. [15.2.1. The Text Description] a keyword characterizing the type of preparedness. [^\p{C}\p{Z}]+ characterizes a single purpose or communicative function of the text. [15.2.1. The Text Description] specifies a particular kind of purpose. [^\p{C}\p{Z}]+ specifies the extent to which this purpose predominates. high medium low unknown describes one particular setting in which a language interaction takes place. [15.2.3. The Setting Description] contains a brief informal description of the kind of place concerned, for example: a room, a restaurant, a park bench, etc. [15.2.3. The Setting Description] contains a brief informal description of what a participant in a language interaction is doing other than speaking, if anything. [15.2.3. The Setting Description] (facsimile) points to all or part of an image which corresponds with the content of the element. points to one or more <change> elements documenting a state or revision campaign to which the element bearing this attribute and its children have been assigned by the encoder. indicates the element within a transcription of the text containing at least the start of the writing represented by this zone or surface. gives the x coordinate value for the upper left corner of a rectangular space. (\-?[\d]+/\-?[\d]+) gives the y coordinate value for the upper left corner of a rectangular space. (\-?[\d]+/\-?[\d]+) gives the x coordinate value for the lower right corner of a rectangular space. (\-?[\d]+/\-?[\d]+) gives the y coordinate value for the lower right corner of a rectangular space. (\-?[\d]+/\-?[\d]+) identifies a two dimensional area by means of a series of pairs of numbers, each of which gives the x,y coordinates of a point on a line enclosing the area. (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) contains a representation of some written source in the form of a set of images rather than as transcribed or encoded text. [11.1. Digital Facsimiles] contains a transcription or other representation of a single source document potentially forming part of a dossier génétique or collection of sources. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription] defines a written surface as a two-dimensional coordinate space, optionally grouping one or more graphic representations of that space, zones of interest within that space, and transcriptions of the writing within them. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription] describes the method by which this surface is or was connected to the main surface Sample values include: 1] glued; 2] pinned; 3] sewn [^\p{C}\p{Z}]+ indicates whether the surface is attached and folded in such a way as to provide two writing surfaces defines any kind of useful grouping of written surfaces, for example the recto and verso of a single leaf, which the encoder wishes to treat as a single unit. [11.1. Digital Facsimiles] defines any two-dimensional area within a <surface> element. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription] indicates the amount by which this zone has been rotated clockwise, with respect to the normal orientation of the parent <surface> element as implied by the dimensions given in the <msDesc> element or by the coordinates of the <surface> itself. The orientation is expressed in arc degrees. defines any line passing through two or more points within a <surface> element. [11.1. Digital Facsimiles 11.2.2. Embedded Transcription] The first and last elements of this path are the same. To specify a closed polygon, use the zone element rather than the path element. identifies a line within the container or bounding box specified by the parent element by means of a series of two or more pairs of numbers, each of which gives the x,y coordinates of a point on the line. (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (-?[0-9]+(\.[0-9]+)?,-?[0-9]+(\.[0-9]+)?) (added span of text) marks the beginning of a longer sequence of text added by an author, scribe, annotator or corrector (see also <add>). [11.3.1.4. Additions and Deletions] The @spanTo attribute of is required. L'attribut spanTo est requis. contains an area of damage to the text witness. [11.3.3.1. Damage, Illegibility, and Supplied Text] (damaged span of text) marks the beginning of a longer sequence of text which is damaged in some way but still legible. [11.3.3.1. Damage, Illegibility, and Supplied Text] The @spanTo attribute of is required. L'attribut spanTo est requis. (deleted span of text) marks the beginning of a longer sequence of text deleted, marked as deleted, or otherwise signaled as superfluous or spurious by an author, scribe, annotator, or corrector. [11.3.1.4. Additions and Deletions] The @spanTo attribute of is required. L'attribut spanTo est requis. (editorial expansion) contains a sequence of letters added by an editor or transcriber when expanding an abbreviation. [11.3.1.2. Abbreviation and Expansion] (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page. [11.5. Headers, Footers, and Similar Matter] classifies the material encoded according to some useful typology. [^\p{C}\p{Z}]+ contains one or more <handNote> elements documenting the different hands identified within the source texts. [11.3.2.1. Document Hands] marks the beginning of a sequence of text written in a new hand, or the beginning of a scribal stint. [11.3.2.1. Document Hands] indicates a <handNote> element describing the hand concerned. indicates restoration of text to an earlier state by cancellation of an editorial or authorial marking or instruction. [11.3.1.6. Cancellation of Deletions and Other Markings] indicates the location of a significant space in the text. [11.4.1. Space] (responsible party) (responsible party) indicates the individual responsible for identifying and measuring the space (dimension) indicates whether the space is horizontal or vertical. horizontal the space is horizontal. vertical the space is vertical. (substitution) groups one or more deletions (or surplus text) with one or more additions when the combination is to be regarded as a single intervention in the text. [11.3.1.5. Substitutions] must have at least one child add and at least one child del or surplus (substitution join) identifies a series of possibly fragmented additions, deletions, or other revisions on a manuscript that combine to make up a single intervention in the text [11.3.1.5. Substitutions] signifies text supplied by the transcriber or editor for any reason; for example because the original cannot be read due to physical damage, or because of an obvious omission by the author or scribe. [11.3.3.1. Damage, Illegibility, and Supplied Text] one or more words indicating why the text has had to be supplied, e.g. overbinding, faded-ink, lost-folio, omitted-in-original. [^\p{C}\p{Z}]+ marks text present in the source which the editor believes to be superfluous or redundant. [11.3.3.1. Damage, Illegibility, and Supplied Text] one or more words indicating why this text is believed to be superfluous, e.g. repeated, interpolated etc. [^\p{C}\p{Z}]+ (secluded text) Secluded. Marks text present in the source which the editor believes to be genuine but out of its original place (which is unknown). [11.3.1.7. Text Omitted from or Supplied in the Transcription] one or more words indicating why this text has been secluded, e.g. interpolated etc. [^\p{C}\p{Z}]+ contains the transcription of a topographic line in the source document [11.2.2. Embedded Transcription] supplies a list of transpositions, each of which is indicated at some point in a document typically by means of metamarks. [11.3.4.5. Transpositions] contains or describes any kind of graphic or written signal within a document the function of which is to determine how it should be read rather than forming part of the actual content of the document. [11.3.4.2. Metamarks] describes the function (for example status, insertion, deletion, transposition) of the metamark. [^\p{C}\p{Z}]+ identifies one or more elements to which the metamark applies. represents any kind of modification identified within a single document. [11.3.4.1. Generic Modification] indicates one or more cancelled interventions in a document which have subsequently been marked as reaffirmed or repeated. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications] points to one or more elements representing the interventions which are being reasserted. contains a sequence of writing which has been retraced, for example by over-inking, to clarify or fix it. [11.3.4.3. Fixation and Clarification] describes a single textual transposition as an ordered list of at least two pointers specifying the order in which the elements indicated should be re-combined. [11.3.4.5. Transpositions] indicates one or more marked-up interventions in a document which have subsequently been marked for cancellation. [11.3.4.4. Confirmation, Cancellation, and Reinstatement of Modifications] points to one or more elements representing the interventions which are to be reverted or undone. indicates the kind of information held in this cell or in each cell of this row. Suggested values include: 1] label; 2] data label labelling or descriptive information only. data data values. [^\p{C}\p{Z}]+ indicates the number of rows occupied by this cell or row. (columns) indicates the number of columns occupied by this cell or row. contains text displayed in tabular form, in rows and columns. [14.1.1. TEI Tables] indicates the number of rows in the table. (columns) indicates the number of columns in each row of the table. contains one row of a table. [14.1.1. TEI Tables] contains one cell of a table. [14.1.1. TEI Tables] contains a mathematical or other formula. [14.2. Formulæ and Mathematical Expressions] encodes the presence of music notation in a text [14.3. Notated Music in Written Text] groups elements representing or containing graphic information such as an illustration, formula, or figure. [14.4. Specific Elements for Graphic Images] (description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it. [14.4. Specific Elements for Graphic Images] indicates type of entry, in dictionaries with multiple types. Suggested values include: 1] main; 2] hom (homograph); 3] xref (cross reference); 4] affix; 5] abbr (abbreviation); 6] supplemental; 7] foreign main a main entry (default). hom (homograph) groups information relating to one homograph within an entry. xref (cross reference) a reduced entry whose only function is to point to another main entry (e.g. for forms of an irregular verb or for variant spellings: was pointing to be, or esthete to aesthete). affix an entry for a prefix, infix, or suffix. abbr (abbreviation) an entry for an abbreviation. supplemental a supplemental entry (for use in dictionaries which issue supplements to their main work in which they include updated information about entries). foreign an entry for a foreign word in a monolingual dictionary. [^\p{C}\p{Z}]+ gives an expanded form of information presented more concisely in the dictionary gives the list of split values for a merged form gives a value which lacks any realization in the printed source text. indicates an <anchor> element typically elsewhere in the document, but possibly in another document, which is the original location of this component. gives a reference to another element, where the original appears as a merged form. (optional) indicates whether the element is optional or not groups a sequence of entries within any kind of lexical resource, such as a dictionary or lexicon which function as a single unit, for example a set of homographs. [9.1. Dictionary Body and Overall Structure] contains a single structured entry in any kind of lexical resource, such as a dictionary or lexicon. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries] (unstructured entry) contains a single unstructured entry in any kind of lexical resource, such as a dictionary or lexicon. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries] (homograph) groups information relating to one homograph within an entry. [9.2. The Structure of Dictionary Entries] groups together all information relating to one word sense in a dictionary entry, for example definitions, examples, and translation equivalents. [9.2. The Structure of Dictionary Entries] gives the nesting depth of this sense. (dictionary scrap) encloses a part of a dictionary entry in which other phrase-level dictionary elements are freely combined. [9.1. Dictionary Body and Overall Structure 9.2. The Structure of Dictionary Entries] (form information group) groups all the information on the written and spoken forms of one headword. [9.3.1. Information on Written and Spoken Forms] classifies form as simple, compound, etc. Suggested values include: 1] simple; 2] lemma; 3] variant; 4] compound; 5] derivative; 6] inflected; 7] phrase simple single free lexical item lemma the headword itself variant a variant form compound word formed from simple lexical items derivative word derived from headword inflected word in other than usual dictionary form phrase multiple-word lexical item [^\p{C}\p{Z}]+ (orthographic form) gives the orthographic form of a dictionary headword. [9.3.1. Information on Written and Spoken Forms] gives the type of spelling. [^\p{C}\p{Z}]+ (pronunciation) contains the pronunciation(s) of the word. [9.3.1. Information on Written and Spoken Forms] (hyphenation) contains a hyphenated form of a dictionary headword, or hyphenation information in some other form. [9.3.1. Information on Written and Spoken Forms] (syllabification) contains the syllabification of the headword. [9.3.1. Information on Written and Spoken Forms] contains the stress pattern for a dictionary headword, if given separately. [9.3.1. Information on Written and Spoken Forms] (grammatical information) within an entry in a dictionary or a terminological data file, contains grammatical information relating to a term, word, or form. [9.3.2. Grammatical Information] classifies the grammatical information given according to some convenient typology—in the case of terminological information, preferably the dictionary of data element types specified in ISO 12620. [^\p{C}\p{Z}]+ (gender) identifies the morphological gender of a lexical item, as given in the dictionary. [9.3.1. Information on Written and Spoken Forms] indicates grammatical number associated with a form, as given in a dictionary. [9.3.1. Information on Written and Spoken Forms 9.3.2. Grammatical Information] contains grammatical case information given by a dictionary for a given form. [9.3.1. Information on Written and Spoken Forms] (person) contains an indication of the grammatical person (1st, 2nd, 3rd, etc.) associated with a given inflected form in a dictionary. [9.3.1. Information on Written and Spoken Forms] (tense) indicates the grammatical tense associated with a given inflected form in a dictionary. [9.3.1. Information on Written and Spoken Forms] contains information about the grammatical mood of verbs (e.g. indicative, subjunctive, imperative). [9.3.1. Information on Written and Spoken Forms] (inflectional class) indicates the inflectional class associated with a lexical item. [9.3.1. Information on Written and Spoken Forms] indicates the type of indicator used to specify the inflection class, when it is necessary to distinguish between the usual abbreviated indications (e.g. inv) and other kinds of indicators, such as special codes referring to conjugation patterns, etc. [^\p{C}\p{Z}]+ (grammatical information group) groups morpho-syntactic information about a lexical item, e.g. <pos>, <gen>, <number>, <case>, or <iType> (inflectional class). [9.3.2. Grammatical Information] (part of speech) indicates the part of speech assigned to a dictionary headword such as noun, verb, or adjective. [9.3.2. Grammatical Information] (subcategorization) contains subcategorization information (transitive/intransitive, countable/non-countable, etc.) [9.3.2. Grammatical Information] (collocate) contains any sequence of words that co-occur with the headword with significant frequency. [9.3.2. Grammatical Information] (definition) contains definition text in a dictionary entry. [9.3.3.1. Definitions] (etymology) encloses the etymological information in a dictionary entry. [9.3.4. Etymological Information] (language name) contains the name of a language mentioned in etymological or other linguistic discussion. [9.3.4. Etymological Information] (usage) contains usage information in a dictionary entry. [9.3.5.2. Usage Information and Other Labels] classifies the usage information using any convenient typology. [^\p{C}\p{Z}]+ (label) contains a label for a form, example, translation, or other piece of information, e.g. abbreviation for, contraction of, literally, approximately, synonyms:, etc. [9.3.1. Information on Written and Spoken Forms 9.3.3.2. Translation Equivalents 9.3.5.3. Cross-References to Other Entries] classifies the label using any convenient typology. [^\p{C}\p{Z}]+ (cross-reference phrase) contains a phrase, sentence, or icon referring the reader to some other location in this or another text. [9.3.5.3. Cross-References to Other Entries] indicates the type of cross reference, using any convenient typology. [^\p{C}\p{Z}]+ (related entry) contains a dictionary entry for a lexical item related to the headword, such as a compound phrase or derived form, embedded inside a larger entry. [9.3.6. Related Entries] (orthographic-form reference) in a dictionary example, indicates a reference to the orthographic form(s) of the headword. [9.4. Headword and Pronunciation References] indicates the kind of typographic modification made to the headword in the reference. [^\p{C}\p{Z}]+ (pronunciation reference) in a dictionary example, indicates a reference to the pronunciation(s) of the headword. [9.4. Headword and Pronunciation References] (script statement) contains a citation giving details of the script used for a spoken text. [8.2. Documenting the Source of Transcribed Speech 2.2.7. The Source Description 15.3.2. Declarable Elements] (recording statement) describes a set of recordings used as the basis for transcription of a spoken text. [8.2. Documenting the Source of Transcribed Speech 2.2.7. The Source Description] (recording event) provides details of an audio or video recording event used as the source of a spoken text, either directly or from a public broadcast. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements] the kind of recording. audio audio recording video audio and video recording provides technical details of the equipment and media used for an audio or video recording used as the source for a spoken text. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements] describes a broadcast used as the source of a spoken text. [8.2. Documenting the Source of Transcribed Speech 15.3.2. Declarable Elements] describes the set of transcription conventions used, particularly for spoken material. [8.2. Documenting the Source of Transcribed Speech] supplies an identifier for the encoding convention, independent of any version number. supplies a version number for the encoding conventions used, if any. [\d]+[a-z]*[\d]*(\.[\d]+[a-z]*[\d]*){0,3} (utterance) contains a stretch of speech usually preceded and followed by silence or by a change of speaker. [8.3.1. Utterances] (transition) indicates the nature of the transition between this utterance and the previous one. smooth this utterance begins without unusual pause or rapidity. latching this utterance begins with a markedly shorter pause than normal. overlap this utterance begins before the previous one has finished. pause this utterance begins after a noticeable pause. marks a pause either between or within utterances. [8.3.2. Pausing] marks any vocalized but not necessarily lexical phenomenon, for example voiced pauses, non-lexical backchannels, etc. [8.3.3. Vocal, Kinesic, Incident] indicates whether or not the phenomenon is repeated. unknown inapplicable marks any communicative phenomenon, not necessarily vocalized, for example a gesture, frown, etc. [8.3.3. Vocal, Kinesic, Incident] indicates whether or not the phenomenon is repeated. unknown inapplicable marks any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication. [8.3.3. Vocal, Kinesic, Incident] contains a passage of written text revealed to participants in the course of a spoken text. [8.3.4. Writing] indicates whether the writing is revealed all at once or gradually. unknown inapplicable marks the point at which some paralinguistic feature of a series of utterances by any one speaker changes. [8.3.6. Shifts] The @new attribute should always be supplied; use the special value "normal" to indicate that the feature concerned ceases to be remarkable at this point. a paralinguistic feature. Suggested values include: 1] tempo; 2] loud; 3] pitch; 4] tension; 5] rhythm; 6] voice tempo speed of utterance. loud loudness. pitch pitch range. tension tension or stress pattern. rhythm rhythmic qualities. voice voice quality. [^\p{C}\p{Z}]+ specifies the new state of the paralinguistic feature specified. [^\p{C}\p{Z}]+ groups together various annotations, e.g. for parallel interpretations of a spoken segment. [8.4.6. Analytic Coding] supplies the value of a date or time in some custom standard form. [^\p{C}\p{Z}]+ specifies the earliest possible date for the event in some custom standard form. [^\p{C}\p{Z}]+ specifies the latest possible date for the event in some custom standard form. [^\p{C}\p{Z}]+ indicates the starting point of the period in some custom standard form. [^\p{C}\p{Z}]+ indicates the ending point of the period in some custom standard form. [^\p{C}\p{Z}]+ supplies a pointer to some location defining a named point in time with reference to which the datable item is understood to have occurred supplies a pointer to a <calendar> element or other means of interpreting the values of the custom dating attributes. supplies the value of a date or time in a standard form. [0-9.,DHMPRSTWYZ/:+\-]+ specifies the earliest possible date for the event in standard form, e.g. yyyy-mm-dd. [0-9.,DHMPRSTWYZ/:+\-]+ specifies the latest possible date for the event in standard form, e.g. yyyy-mm-dd. [0-9.,DHMPRSTWYZ/:+\-]+ indicates the starting point of the period in standard form. [0-9.,DHMPRSTWYZ/:+\-]+ indicates the ending point of the period in standard form. [0-9.,DHMPRSTWYZ/:+\-]+ (organization name) contains an organizational name. [13.2.2. Organizational Names] (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc. [13.2.1. Personal Names] contains a family (inherited) name, as opposed to a given, baptismal, or nick name. [13.2.1. Personal Names] contains a forename, given or baptismal name. [13.2.1. Personal Names] (generational name component) contains a name component used to distinguish otherwise similar names on the basis of the relative ages or generations of the persons named. [13.2.1. Personal Names] (name link) contains a connecting phrase or link used within a name but not regarded as part of it, such as van der or of. [13.2.1. Personal Names] (additional name) contains an additional name component, such as a nickname, epithet, or alias, or any other descriptive phrase used within a personal name. [13.2.1. Personal Names] contains a name component which indicates that the referent has a particular role or position in society, such as an official title or rank. [13.2.1. Personal Names] contains an absolute or relative place name. [13.2.3. Place Names] contains the name of a geo-political unit consisting of two or more nation states or countries. [13.2.3. Place Names] contains the name of a geo-political unit, such as a nation, country, colony, or commonwealth, larger than or administratively superior to a region and smaller than a bloc. [13.2.3. Place Names] contains the name of an administrative unit such as a state, province, or county, larger than a settlement, but smaller than a country. [13.2.3. Place Names] contains the name of a settlement such as a city, town, or village identified as a single geo-political or administrative unit. [13.2.3. Place Names] contains the name of any kind of subdivision of a settlement, such as a parish, ward, or other administrative or geographic unit. [13.2.3. Place Names] marks that part of a relative temporal or spatial expression which indicates the direction of the offset between the two place names, dates, or times involved in the expression. [13.2.3. Place Names] (geographical name) identifies a name associated with some geographical feature such as Windrush Valley or Mount Sinai. [13.2.3. Place Names] (geographical feature name) contains a common noun identifying some geographical feature contained within a geographic name, such as valley, mount, etc. [13.2.3. Place Names] contains an informal description of a person's present or past affiliation with some organization, for example an employer or sponsor. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] sponsor; 2] recommend; 3] discredit; 4] pledged [^\p{C}\p{Z}]+ specifies the age of a person. [13.3.2.1. Personal Characteristics] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] western; 2] sui; 3] subjective; 4] objective; 5] inWorld (in world); 6] chronological; 7] biological; 8] psychological; 9] functional [^\p{C}\p{Z}]+ supplies a numeric code representing the age or age group contains information about a person's birth, such as its date and place. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] caesarean (caesarean section); 2] vaginal (vaginal delivery); 3] exNihilo (ex nihilo); 4] incorporated; 5] founded; 6] established [^\p{C}\p{Z}]+ contains information about the physical climate of a place. [13.3.4.3. States, Traits, and Events] contains information about a person's death, such as its date and place. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] proclaimed; 2] assumed; 3] verified; 4] clinical; 5] brain; 6] natural; 7] unnatural; 8] fragmentation; 9] dissolution [^\p{C}\p{Z}]+ contains a description of the educational experience of a person. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] primary; 2] secondary; 3] undergraduate; 4] graduate; 5] residency; 6] apprenticeship [^\p{C}\p{Z}]+ contains data relating to any kind of significant event associated with a person, place, or organization. [13.3.1. Basic Principles] indicates the location of an event by pointing to a <place> element specifies the faith, religion, or belief set of a person. [13.3.2.1. Personal Characteristics] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] practicing; 2] clandestine; 3] patrilineal; 4] matrilineal; 5] convert [^\p{C}\p{Z}]+ contains information about a person's period of activity. [13.3.2.1. Personal Characteristics] (geographical coordinates) contains any expression of a set of geographic coordinates, representing a point, line, or area on the surface of the earth in some notation. [13.3.4.1. Varieties of Location] (language knowledge) summarizes the state of a person's linguistic knowledge, either as prose or by a list of <langKnown> elements. [13.3.2.1. Personal Characteristics] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] listening; 2] speaking; 3] reading; 4] writing [^\p{C}\p{Z}]+ supplies one or more valid language tags for the languages specified (language known) summarizes the state of a person's linguistic competence, i.e., knowledge of a single language. [15.2.2. The Participant Description] supplies a valid language tag for the language concerned. a code indicating the person's level of knowledge for this language [^\p{C}\p{Z}]+ (list of organizations) contains a list of elements, each of which provides information about an identifiable organization. [13.2.2. Organizational Names] (list of events) contains a list of descriptions, each of which provides information about an identifiable event. [13.3.1. Basic Principles] (list of persons) contains a list of descriptions, each of which provides information about an identifiable person or a group of people, for example the participants in a language interaction, or the people referred to in a historical source. [13.3.2. The Person Element 15.2. Contextual Information 2.4. The Profile Description 15.3.2. Declarable Elements] (list of places) contains a list of places, optionally followed by a list of relationships (other than containment) defined amongst them. [2.2.7. The Source Description 13.3.4. Places] defines the location of a place as a set of geographical coordinates, in terms of other named geo-political entities, or as an address. [13.3.4. Places] contains an informal description of a person's present or past nationality or citizenship. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] birth; 2] naturalised; 3] self-assigned [^\p{C}\p{Z}]+ contains an informal description of a person's trade, profession or occupation. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] primary; 2] other; 3] paid; 4] unpaid [^\p{C}\p{Z}]+ indicates the classification system or taxonomy in use, for example by supplying the identifier of a <taxonomy> element, or pointing to some other resource. identifies an occupation code defined within the classification system or taxonomy defined by the @scheme attribute. (organization) provides information about an identifiable organization such as a business, a tribe, or any other grouping of people. [13.3.3. Organizational Data] specifies a primary role or classification for the organization. [^\p{C}\p{Z}]+ provides information about relationships identified amongst people, places, and organizations, either informally as prose or as formally expressed relation links. [13.3.2.3. Personal Relationships] provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a historical source. [13.3.2. The Person Element 15.2.2. The Participant Description] specifies a primary role or classification for the person. [^\p{C}\p{Z}]+ specifies the sex of the person. [^\p{C}\p{Z}]+ specifies an age group for the person. [^\p{C}\p{Z}]+ provides information about one of the personalities identified for a given individual, where an individual has multiple personalities. [13.3.2. The Person Element] specifies a primary role or classification for the persona. [^\p{C}\p{Z}]+ specifies the sex of the persona. [^\p{C}\p{Z}]+ specifies an age group for the persona. [^\p{C}\p{Z}]+ (personal group) describes a group of individuals treated as a single person for analytic purposes. [15.2.2. The Participant Description] specifies the role of this group of participants in the interaction. [^\p{C}\p{Z}]+ specifies the sex of the participant group. [^\p{C}\p{Z}]+ specifies the age group of the participants. [^\p{C}\p{Z}]+ describes informally the size or approximate size of the group for example by means of a number and an indication of accuracy e.g. approx 200. [^\p{C}\p{Z}]+ contains data about a geographic location [13.3.4. Places] contains information about the population of a place. [13.3.4.3. States, Traits, and Events] (relationship) describes any kind of relationship or linkage amongst a specified group of places, events, persons, objects or other items. [13.3.2.3. Personal Relationships] One of the attributes 'name', 'ref' or 'key' must be supplied Only one of the attributes @active and @mutual may be supplied the attribute 'passive' may be supplied only if the attribute 'active' is supplied supplies a name for the kind of relationship of which this is an instance. [^\p{C}\p{Z}]+ identifies the active participants in a non-mutual relationship, or all the participants in a mutual one. supplies a list of participants amongst all of whom the relationship holds equally. identifies the passive participants in a non-mutual relationship. describes a person's present or past places of residence. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] primary; 2] secondary; 3] temporary; 4] permanent [^\p{C}\p{Z}]+ specifies the sex of a person. [13.3.2.1. Personal Characteristics] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] explicit; 2] implicit [^\p{C}\p{Z}]+ supplies a coded value for sex [^\p{C}\p{Z}]+ (socio-economic status) contains an informal description of a person's perceived social or economic status. [15.2.2. The Participant Description] characterizes the element in some sense, using any convenient classification scheme or typology. Sample values include: 1] atBirth; 2] atDeath; 3] dependent; 4] inherited; 5] independent [^\p{C}\p{Z}]+ identifies the classification system or taxonomy in use, for example by pointing to a locally-defined <taxonomy> element or by supplying a URI for an externally-defined system. identifies a status code defined within the classification system or taxonomy defined by the @scheme attribute. contains a description of some status or quality attributed to a person, place, or organization often at some specific time or for a specific date range. [13.3.1. Basic Principles 13.3.2.1. Personal Characteristics] contains information about the physical terrain of a place. [13.3.4.3. States, Traits, and Events] contains a description of some status or quality attributed to a person, place, or organization typically, but not necessarily, independent of the volition or action of the holder and usually not at some specific time or for a specific date range. [13.3.1. Basic Principles 13.3.2.1. Personal Characteristics] (name of an object) contains a proper noun or noun phrase used to refer to an object. [13.2.4. Object Names] contains a description of a single identifiable physical object. [13.3.5. Objects] (list of objects) contains a list of descriptions, each of which provides information about an identifiable physical object. [13.3.5. Objects] (object identifier) groups one or more identifiers or pieces of locating information concerning a single object. [13.3.5. Objects] An objectIdentifier must contain at minimum a single piece of locating or identifying information. (canonical name) contains the definition for a canonical name or name component of any kind. [13.3.6. Names and Nyms] points to constituent nyms (list of canonical names) contains a list of nyms, that is, standardized names for any thing. [13.3.6. Names and Nyms] (corresponds) points to elements that correspond to the current element in some way. (synchronous) points to elements that are synchronous with the current element. points to an element that is the same as the current element. points to an element of which the current element is a copy. points to the next element of a virtual aggregate of which the current element is part. (previous) points to the previous element of a virtual aggregate of which the current element is part. points to elements that are in exclusive alternation with the current element. selects one or more alternants; if one alternant is selected, the ambiguity or uncertainty is marked as resolved. If more than one alternant is selected, the degree of ambiguity or uncertainty is marked as reduced by the number of alternants not selected. defines an association or hypertextual link among elements or passages, of some type not more precisely specifiable by other elements. [16.1. Links] You must supply at least two values for @target or on (link group) defines a collection of associations or hypertextual links. [16.1. Links] (anonymous block) contains any arbitrary component-level unit of text, acting as an anonymous container for phrase or inter level elements analogous to, but without the semantic baggage of, a paragraph. [16.3. Blocks, Segments, and Anchors] Abstract model violation: ab may not occur inside paragraphs or other ab elements. Abstract model violation: Lines may not contain higher-level divisions such as p or ab. (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element. [8.4.2. Synchronization and Overlap 16.5. Correspondence and Alignment] (arbitrary segment) represents any segmentation of text below the chunk level. [16.3. Blocks, Segments, and Anchors 6.2. Components of the Verse Line 7.2.5. Speech Contents] indicates a point in time either relative to other elements in the same timeline tag, or absolutely. [16.4.2. Placing Synchronous Events in Time] supplies an absolute value for the time. specifies the unit of time in which the @interval value is expressed, if this is not inherited from the parent <timeline>. Suggested values include: 1] d (days); 2] h (hours); 3] min (minutes); 4] s (seconds); 5] ms (milliseconds) d (days) h (hours) min (minutes) s (seconds) ms (milliseconds) [^\p{C}\p{Z}]+ specifies a time interval either as a number or as one of the keywords defined by the datatype teidata.interval regular irregular unknown identifies the reference point for determining the time of the current <when> element, which is obtained by adding the interval to the time of the reference point. provides a set of ordered points in time which can be linked to elements of a spoken text to create a temporal alignment of that text. [16.4.2. Placing Synchronous Events in Time] designates the origin of the timeline, i.e. the time at which it begins. specifies the unit of time corresponding to the @interval value of the timeline or of its constituent points in time. Suggested values include: 1] d (days); 2] h (hours); 3] min (minutes); 4] s (seconds); 5] ms (milliseconds) d (days) h (hours) min (minutes) s (seconds) ms (milliseconds) [^\p{C}\p{Z}]+ specifies a time interval either as a positive integral value or using one of a set of predefined codes. regular irregular unknown identifies a possibly fragmented segment of text, by pointing at the possibly discontiguous elements which compose it. [16.7. Aggregation] You must supply at least two values for @target on specifies the name of an element which this aggregation may be understood to represent. indicates whether the targets to be joined include the entire element indicated (the entire subtree including its root), or just the children of the target (the branches of the subtree). root the rooted subtrees indicated by the @target attribute are joined, each subtree become a child of the virtual element created by the join branches the children of the subtrees indicated by the @target attribute become the children of the virtual element (i.e. the roots of the subtrees are discarded) (join group) groups a collection of <join> elements and possibly pointers. [16.7. Aggregation] supplies the default value for the @result on each <join> included within the group. (alternation) identifies an alternation or a set of choices among elements or passages. [16.8. Alternation] specifies the destination of the reference by supplying one or more URI References states whether the alternations gathered in this collection are exclusive or inclusive. excl (exclusive) indicates that the alternation is exclusive, i.e. that at most one of the alternatives occurs. incl (inclusive) indicates that the alternation is not exclusive, i.e. that one or more of the alternatives occur. If @mode is excl, each weight states the probability that the corresponding alternative occurs. If @mode is incl each weight states the probability that the corresponding alternative occurs given that at least one of the other alternatives occurs. (alternation group) groups a collection of <alt> elements and possibly pointers. [16.8. Alternation] states whether the alternations gathered in this collection are exclusive or inclusive. excl (exclusive) indicates that the alternation is exclusive, i.e. that at most one of the alternatives occurs. incl (inclusive) indicates that the alternation is not exclusive, i.e. that one or more of the alternatives occur. Functions as a container element for linked data, contextual information, and stand-off annotations embedded in a TEI document. [16.10. The standOff Container] This element must have a @type attribute, since it is nested inside a contains a list of annotations, typically encoded as <annotation>, <annotationBlock>, or <note>, possibly organized with nested <listAnnotation> elements. [16.10. The standOff Container] represents an annotation following the Web Annotation Data Model. [16.10. The standOff Container] (identifier) provides a unique identifier for the element bearing the attribute. specifies the destination of the reference by supplying one or more URI References assessing intent is to assess the target resource in some way, rather than simply make a comment about it bookmarking intent is to create a bookmark to the target or part thereof classifying intent is to classify the target in some way commenting intent is to comment about the target describing intent is to describe the target, rather than (for example) comment on it editing intent is to request an edit or a change to the target resource highlighting intent is to highlight the target resource or a segment thereof identifying intent is to assign an identity to the target linking intent is to link to a resource related to the target moderating intent is to assign some value or quality to the target questioning intent is to ask a question about the target replying intent is to reply to a previous statement, either an annotation or another resource tagging intent is to associate a tag with the target (normalized) provides the normalized/standardized form of information present in the source text in a non-normalized form (original) gives the original string or is the empty string when the element does not appear in the source text. provides a lemma (base form) for the word, typically uninflected and serving both as an identifier (e.g. in dictionary contexts, as a headword), and as a basis for potential inflections. provides a pointer to a definition of the lemma for the word, for example in an online lexicon. (part of speech) indicates the part of speech assigned to a token (i.e. information on whether it is a noun, adjective, or verb), usually according to some official reference vocabulary (e.g. for German: STTS, for English: CLAWS, for Polish: NKJP, etc.). (morphosyntactic description) supplies morphosyntactic information for a token, usually according to some official reference vocabulary (e.g. for German: STTS-large tagset; for a feature description system designed as (pragmatically) universal, see Universal Features). when present, it provides information on whether the token in question is adjacent to another, and if so, on which side. The definition of this attribute is adapted from ISO MAF (Morpho-syntactic Annotation Framework), ISO 24611:2012. no (the token is not adjacent to another) left (there is no whitespace on the left side of the token) right (there is no whitespace on the right side of the token) both (there is no whitespace on either side of the token) overlap (the token overlaps with another; other devices (specifying the extent and the area of overlap) are needed to more precisely locate this token in the character stream) (analysis) indicates one or more elements containing interpretations of the element on which the @ana attribute appears. (s-unit) contains a sentence-like division of a text. [17.1. Linguistic Segment Categories 8.4.1. Segmentation] You may not nest one s element within another: use seg instead (clause) represents a grammatical clause. [17.1. Linguistic Segment Categories] (phrase) represents a grammatical phrase. [17.1. Linguistic Segment Categories] (word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation] (morpheme) represents a grammatical morpheme. [17.1. Linguistic Segment Categories] supplies the morpheme's base form. [^\p{C}\p{Z}]+ (character) represents a character. [17.1. Linguistic Segment Categories] (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark. [17.1.2. Below the Word Level 17.4.2. Lightweight Linguistic Annotation] indicates the extent to which this punctuation mark conventionally separates words or phrases strong the punctuation mark is a word separator weak the punctuation mark is not a word separator inter the punctuation mark may or may not be a word separator provides a name for the kind of unit delimited by this punctuation mark. [^\p{C}\p{Z}]+ indicates whether this punctuation mark precedes or follows the unit it delimits. associates an interpretative annotation directly with a span of text. [17.3. Spans and Interpretations] Only one of the attributes @target and @from may be supplied on Only one of the attributes @target and @to may be supplied on If @to is supplied on , @from must be supplied as well The attributes @to and @from on may each contain only a single value gives the identifier of the node which is the starting point of the span of text being annotated; if not accompanied by a @to attribute, gives the identifier of the node of the entire span of text being annotated. gives the identifier of the node which is the end-point of the span of text being annotated. (span group) collects together span tags. [17.3. Spans and Interpretations] (feature system declaration) provides a feature system declaration comprising one or more feature structure declarations or feature structure declaration links. [18.11. Feature System Declaration] (feature structure declaration) declares one type of feature structure. [18.11. Feature System Declaration] gives a name for the type of feature structure being declared. [^\p{C}\p{Z}]+ gives the name of one or more typed feature structures from which this type inherits feature specifications and constraints; if this type includes a feature specification with the same name as that of any of those specified by this attribute, or if more than one specification of the same name is inherited, then the set of possible values is defined by unification. Similarly, the set of constraints applicable is derived by combining those specified explicitly within this element with those implied by the @baseTypes attribute. When no @baseTypes attribute is specified, no feature specification or constraint is inherited. (feature system description (in FSD)) describes in prose what is represented by the type of feature structure declared in the enclosing fsDecl. [18.11. Feature System Declaration] (feature declaration) declares a single feature, specifying its name, organization, range of allowed values, and optionally its default value. [18.11. Feature System Declaration] a single word which follows the rules defining a legal XML name (see ), indicating the name of the feature being declared; matches the @name attribute of <f> elements in the text. indicates whether or not the value of this feature may be present. (feature description (in FSD)) describes in prose what is represented by the feature being declared and its values. [18.11. Feature System Declaration] (value range) defines the range of allowed values for a feature, in the form of an <fs>, <vAlt>, or primitive value; for the value of an <f> to be valid, it must be subsumed by the specified range; if the <f> contains multiple values (as sanctioned by the @org attribute), then each value must be subsumed by the <vRange>. [18.11. Feature System Declaration] (value default) declares the default value to be supplied when a feature structure does not contain an instance of <f> for this name; if unconditional, it is specified as one (or, depending on the value of the @org attribute of the enclosing <fDecl>) more <fs> elements or primitive values; if conditional, it is specified as one or more <if> elements; if no default is specified, or no condition matches, the value none is assumed. [18.11. Feature System Declaration] (feature structure) represents a feature structure, that is, a collection of feature-value pairs organized as a structural unit. [18.2. Elementary Feature Structures and the Binary Feature Value] specifies the type of the feature structure. [^\p{C}\p{Z}]+ (features) references the feature-value specifications making up this feature structure. (feature) represents a feature value specification, that is, the association of a name with a value of any of several different types. [18.2. Elementary Feature Structures and the Binary Feature Value] a single word which follows the rules defining a legal XML name (see ), providing a name for the feature. (feature value) references any element which can be used to represent the value of a feature. (binary value) represents the value part of a feature-value specification which can contain either of exactly two possible values. [18.2. Elementary Feature Structures and the Binary Feature Value] supplies a binary value. (symbolic value) represents the value part of a feature-value specification which contains one of a finite list of symbols. [18.3. Other Atomic Feature Values] supplies a symbolic value for the feature, one of a finite list that may be specified in a feature declaration. [^\p{C}\p{Z}]+ (numeric value) represents the value part of a feature-value specification which contains a numeric value or range. [18.3. Other Atomic Feature Values] supplies a lower bound for the numeric value represented, and also (if @max is not supplied) its upper bound. (\-?[\d]+/\-?[\d]+) supplies an upper bound for the numeric value represented. (\-?[\d]+/\-?[\d]+) specifies whether the value represented should be truncated to give an integer value. (string value) represents the value part of a feature-value specification which contains a string. [18.3. Other Atomic Feature Values] (value label) represents the value part of a feature-value specification which appears at more than one point in a feature structure. [18.6. Re-entrant Feature Structures] supplies a name identifying the sharing point. [^\p{C}\p{Z}]+ (collection of values) represents the value part of a feature-value specification which contains multiple values organized as a set, bag, or list. [18.7. Collections as Complex Feature Values] (organization) indicates organization of given value or values as set, bag or list. set indicates that the given values are organized as a set. bag indicates that the given values are organized as a bag (multiset). list indicates that the given values are organized as a list. (default feature value) represents the value part of a feature-value specification which contains a defaulted value. [18.9. Default Values] (value alternation) represents the value part of a feature-value specification which contains a set of values, only one of which can be valid. [18.8.1. Alternation] (feature library) assembles a library of <f> (feature) elements. [18.4. Feature Libraries and Feature-Value Libraries] (feature-value library) assembles a library of reusable feature value elements (including complete feature structures). [18.4. Feature Libraries and Feature-Value Libraries] provides the name of the character or glyph property being defined. provides the value of the character or glyph property being defined. specifies the version number of the Unicode Standard in which this property name is defined. Suggested values include: 1] 1.0.1; 2] 1.1; 3] 2.0; 4] 2.1; 5] 3.0; 6] 3.1; 7] 3.2; 8] 4.0; 9] 4.1; 10] 5.0; 11] 5.1; 12] 5.2; 13] 6.0; 14] 6.1; 15] 6.2; 16] 6.3; 17] 7.0; 18] 8.0; 19] 9.0; 20] 10.0; 21] 11.0; 22] 12.0; 23] 12.1; 24] unassigned 1.0.1 1.1 2.0 2.1 3.0 3.1 3.2 4.0 4.1 5.0 5.1 5.2 6.0 6.1 6.2 6.3 7.0 8.0 9.0 10.0 11.0 12.0 12.1 unassigned [^\p{C}\p{Z}]+ (character or glyph) represents a glyph, or a non-standard character. [5. Characters, Glyphs, and Writing Modes] points to a description of the character or glyph intended. (character declarations) provides information about nonstandard characters and glyphs. [5.2. Markup Constructs for Representation of Characters and Glyphs] (character) provides descriptive information about a character. [5.2. Markup Constructs for Representation of Characters and Glyphs] (character glyph) provides descriptive information about a character glyph. [5.2. Markup Constructs for Representation of Characters and Glyphs] (locally defined property) provides a locally defined character (or glyph) property. [5.2.1. Character Properties] (character mapping) contains one or more characters which are related to the parent character or glyph in some respect, as specified by the @type attribute. [5.2. Markup Constructs for Representation of Characters and Glyphs] (unihan property) holds the name and value of a normative or informative Unihan character (or glyph) property as part of its attributes. [5.2.1. Character Properties] specifies the normalized name of a unicode han database (Unihan) property kZVariant kAccountingNumeric kBigFive kCCCII kCNS1986 kCNS1992 kCangjie kCantonese kCheungBauer kCheungBauerIndex kCihaiT kCompatibilityVariant kCowles kDaeJaweon kDefinition kEACC kFenn kFennIndex kFourCornerCode kFrequency kGB0 kGB1 kGB3 kGB5 kGB7 kGB8 kGSR kGradeLevel kHDZRadBreak kHKGlyph kHKSCS kHanYu kHangul kHanyuPinlu kHanyuPinyin kIBMJapan kIICore kIRGDaeJaweon kIRGDaiKanwaZiten kIRGHanyuDaZidian kIRGKangXi kIRG_GSource kIRG_HSource kIRG_JSource kIRG_KPSource kIRG_KSource kIRG_MSource kIRG_TSource kIRG_USource kIRG_VSource kJIS0213 kJa kJapaneseKun kJapaneseOn kJinmeiyoKanji kJis0 kJis1 kJoyoKanji kKPS0 kKPS1 kKSC0 kKSC1 kKangXi kKarlgren kKorean kKoreanEducationHanja kKoreanName kLau kMainlandTelegraph kMandarin kMatthews kMeyerWempe kMorohashi kNelson kOtherNumeric kPhonetic kPrimaryNumeric kPseudoGB1 kRSAdobe_Japan1_6 kRSJapanese kRSKanWa kRSKangXi kRSKorean kRSUnicode kSBGY kSemanticVariant kSimplifiedVariant kSpecializedSemanticVariant kTGH kTaiwanTelegraph kTang kTotalStrokes kTraditionalVariant kVietnamese kXHC1983 kXerox specifies the value of a named Unihan property [^\p{C}\p{Z}]+ (unicode property) provides a Unicode property for a character (or glyph). [5.2.1. Character Properties] specifies the normalized name of a Unicode property. Age AHex Alpha Alphabetic ASCII_Hex_Digit bc Bidi_C Bidi_Class Bidi_Control Bidi_M Bidi_Mirrored Bidi_Mirroring_Glyph Bidi_Paired_Bracket Bidi_Paired_Bracket_Type blk Block bmg bpb bpt Canonical_Combining_Class Case_Folding Case_Ignorable Cased ccc CE cf Changes_When_Casefolded Changes_When_Casemapped Changes_When_Lowercased Changes_When_NFKC_Casefolded Changes_When_Titlecased Changes_When_Uppercased CI Comp_Ex Composition_Exclusion CWCF CWCM CWKCF CWL CWT CWU Dash Decomposition_Mapping Decomposition_Type Default_Ignorable_Code_Point Dep Deprecated DI Dia Diacritic dm dt ea East_Asian_Width EqUIdeo Equivalent_Unified_Ideograph Expands_On_NFC Expands_On_NFD Expands_On_NFKC Expands_On_NFKD Ext Extender FC_NFKC FC_NFKC_Closure Full_Composition_Exclusion gc GCB General_Category Gr_Base Gr_Ext Gr_Link Grapheme_Base Grapheme_Cluster_Break Grapheme_Extend Grapheme_Link Hangul_Syllable_Type Hex Hex_Digit hst Hyphen ID_Continue ID_Start IDC Ideo Ideographic IDS IDS_Binary_Operator IDS_Trinary_Operator IDSB IDST Indic_Positional_Category Indic_Syllabic_Category InPC InSC isc ISO_Comment Jamo_Short_Name jg Join_C Join_Control Joining_Group Joining_Type JSN jt kAccountingNumeric kCompatibilityVariant kIICore kIRG_GSource kIRG_HSource kIRG_JSource kIRG_KPSource kIRG_KSource kIRG_MSource kIRG_TSource kIRG_USource kIRG_VSource kOtherNumeric kPrimaryNumeric kRSUnicode lb lc Line_Break LOE Logical_Order_Exception Lower Lowercase Lowercase_Mapping Math na na1 Name Name_Alias NChar NFC_QC NFC_Quick_Check NFD_QC NFD_Quick_Check NFKC_Casefold NFKC_CF NFKC_QC NFKC_Quick_Check NFKD_QC NFKD_Quick_Check Noncharacter_Code_Point nt Numeric_Type Numeric_Value nv OAlpha ODI OGr_Ext OIDC OIDS OLower OMath Other_Alphabetic Other_Default_Ignorable_Code_Point Other_Grapheme_Extend Other_ID_Continue Other_ID_Start Other_Lowercase Other_Math Other_Uppercase OUpper Pat_Syn Pat_WS Pattern_Syntax Pattern_White_Space PCM Prepended_Concatenation_Mark QMark Quotation_Mark Radical Regional_Indicator RI SB sc scf Script Script_Extensions scx SD Sentence_Break Sentence_Terminal Simple_Case_Folding Simple_Lowercase_Mapping Simple_Titlecase_Mapping Simple_Uppercase_Mapping slc Soft_Dotted stc STerm suc tc Term Terminal_Punctuation Titlecase_Mapping uc UIdeo Unicode_1_Name Unified_Ideograph Upper Uppercase Uppercase_Mapping Variation_Selector Vertical_Orientation vo VS WB White_Space Word_Break WSpace XID_Continue XID_Start XIDC XIDS XO_NFC XO_NFD XO_NFKC XO_NFKD specifies the value of a named Unicode property. contains a single value for some property, attribute, or other analysis. [5.2.1. Character Properties] (Unicode property name) contains the name of a registered Unicode normative or informative property. [5.2.1. Character Properties] specifies the version number of the Unicode Standard in which this property name is defined. [\d]+(\.[\d]+){0,2} (locally-defined property name) contains a locally defined name for some property. [5.2.1. Character Properties] (character glyph name) contains the name of a glyph, expressed following Unicode conventions for character names. [5.2. Markup Constructs for Representation of Characters and Glyphs] (character property) provides a name and value for some property of the parent character or glyph. [5.2. Markup Constructs for Representation of Characters and Glyphs] (character name) contains the name of a character, expressed following Unicode conventions. [5.2. Markup Constructs for Representation of Characters and Glyphs]