Introduction

These are the encoding and technical documentation and guidelines for the Winnifred Eaton Archive project. This site is meant to serve as both a record of how the project encoded the texts as well as instructions for building and deploying the final site.
<listPerson> (list of persons) contains a list of descriptions, each of which provides information about an identifiable person or a group of people, for example the participants in a language interaction, or the people referred to in a historical source.
<person> provides information about an identifiable individual, for example a participant in a language interaction, or a person referred to in a historical source.
<persName> (personal name) contains a proper noun or proper-noun phrase referring to a person, possibly including one or more of the person's forenames, surnames, honorifics, added names, etc.
<forename> contains a forename, given or baptismal name.
<surname> contains a family (inherited) name, as opposed to a given, baptismal, or nick name.
<reg> (regularization) contains a reading which has been regularized or normalized in some sense.
<note> contains a note or annotation.
<listBibl> (citation list) contains a list of bibliographic citations of any kind.
<bibl> (bibliographic citation) contains a loosely-structured bibliographic citation of which the sub-components may or may not be explicitly tagged.
<head> (heading) contains any type of heading, for example the title of a section, or the heading of a list, glossary, manuscript description, etc.
<author> in a bibliographic reference, contains the name(s) of an author, personal or corporate, of a work; for example in the same form as that provided by a recognized bibliographic name authority.
<name> (name, proper noun) contains a proper noun or noun phrase.
<rs> (referencing string) contains a general purpose name or referring string.
<title> contains a title for any kind of work.
<publisher> provides the name of the organization responsible for the publication or distribution of a bibliographic item.
<biblScope> (scope of bibliographic reference) defines the scope of a bibliographic reference, for example as a list of page numbers, or a named subdivision of a larger work.
<date> contains a date in any format.
<distributor> supplies the name of a person or other agency responsible for the distribution of a text.
<idno> (identifier) supplies any form of identifier used to identify some object, such as a bibliographic item, a person, a title, an organization, etc. in a standardized way.
<body> (text body) contains the whole body of a single unitary text, excluding any front or back matter.
<div> (text division) contains a subdivision of the front, body, or back of a text.
<text> contains a single text of any kind, whether unitary or composite, for example a poem or drama, a collection of essays, a novel, a dictionary, or a corpus sample.
<opener> groups together dateline, byline, salutation, and similar phrases appearing as a preliminary group at the start of a division, especially of a letter.
<closer> groups together salutations, datelines, and similar phrases appearing as a final group at the end of a division, especially of a letter.
<byline> contains the primary statement of responsibility given for a work on its title page or at the head or end of the work.
<gap> indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible.
<supplied> signifies text supplied by the transcriber or editor for any reason; for example because the original cannot be read due to physical damage, or because of an obvious omission by the author or scribe.
<choice> groups a number of alternative encodings for the same point in a text.
<sic> (Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate.
<corr> (correction) contains the correct form of a passage apparently erroneous in the copy text.
<abbr> (abbreviation) contains an abbreviation of any sort.
<expan> (expansion) contains the expansion of an abbreviation.
<q> (quoted) contains material which is distinguished from the surrounding text using quotation marks or a similar method, for any one of a variety of reasons including, but not limited to: direct speech or thought, technical terms or jargon, authorial distance, quotations from elsewhere, and passages that are mentioned but not used.
<p> (paragraph) marks paragraphs in prose.
<lg> (line group) contains one or more verse lines functioning as a formal unit, e.g. a stanza, refrain, verse paragraph, etc.
<floatingText> contains a single text of any kind, whether unitary or composite, which interrupts the text containing it at any point and after which the surrounding text resumes.
<salute> (salutation) contains a salutation or greeting prefixed to a foreword, dedicatory epistle, or other division of a text, or the salutation in the closing of a letter, preface, etc.
<signed> (signature) contains the closing salutation, etc., appended to a foreword, dedicatory epistle, or other division of a text.
<ptr> (pointer) defines a pointer to another location.
<list> contains any sequence of items organized as a list.
<item> contains one component of a list.
<abstract> contains a summary or formal abstract prefixed to an existing source document by the encoder.
<catRef> (category reference) specifies one or more defined categories within some taxonomy or text typology.
<profileDesc> (text-profile description) provides a detailed description of non-bibliographic aspects of a text, specifically the languages and sublanguages used, the situation in which it was produced, the participants and their setting.
<taxonomy> defines a typology either implicitly, by means of a bibliographic citation, or explicitly by a structured taxonomy.
<category> contains an individual descriptive category, possibly nested within a superordinate category, within a user-defined taxonomy.
<catDesc> (category description) describes some category within a taxonomy or text typology, either in the form of a brief prose description or in terms of the situational parameters used by the TEI formal textDesc.
<term> contains a single-word, multi-word, or symbolic designation which is regarded as a technical term.
<listEvent> (list of events) contains a list of descriptions, each of which provides information about an identifiable event.
<event> contains data relating to any kind of significant event associated with a person, place, or organization.
<ab> (anonymous block) contains any arbitrary component-level unit of text, acting as an anonymous container for phrase or inter level elements analogous to, but without the semantic baggage of, a paragraph.
<graphic> indicates the location of a graphic or illustration, either forming part of a text, or providing an image of it.
<desc> (description) contains a brief description of the object documented by its parent element, typically a documentation element or an entity.
<ref> (reference) defines a reference to another location, possibly modified by additional text or comment.
<TEI> (TEI document) contains a single TEI-conformant document, combining a single TEI header with one or more members of the model.resource class. Multiple elements may be combined within a (or teiCorpus) element.
att.global provides attributes common to all elements in the TEI encoding scheme.
att.global.rendition provides rendering attributes common to all elements in the TEI encoding scheme.
att.global.linking provides a set of attributes for hypertextual linking.
att.global.facs provides an attribute used to express correspondence between an element containing transcribed text and all or part of an image representing that text.
att.global.change supplies the change attribute, allowing its member elements to specify one or more states or revision campaigns with which they are associated.
att.global.responsibility provides attributes indicating the agent responsible for some aspect of the text, the markup or something asserted by the markup, and the degree of certainty associated with it.
att.global.source provides an attribute used by elements to point to an external source.
att.typed provides attributes which can be used to classify or subclassify elements in any way.
teidata.version defines the range of attribute values which may be used to specify a TEI or Unicode version number.
<teiHeader> (TEI header) supplies descriptive and declarative metadata associated with a digital resource or set of resources.
att.fragmentable provides an attribute for representing fragmentation of a structural element, typically as a consequence of some overlapping hierarchy.
model.pLike groups paragraph-like elements.
<quote> (quotation) contains a phrase or passage attributed by the narrator or author to some agency external to the text.
<sp> (speech) contains an individual speech in a performance text, or a passage presented as such in a prose or verse text.
<cell> contains one cell of a table.
<availability> supplies information about the availability of a text, for example any restrictions on its use or distribution, its copyright status, any licence applying to it, etc.
<change> documents a change or set of changes made during the production of a source document, or during the revision of an electronic file.
<correction> (correction principles) states how and under what circumstances corrections have been made in the text.
<editorialDecl> (editorial practice declaration) provides details of editorial principles and practices applied during the encoding of a text.
<encodingDesc> (encoding description) documents the relationship between an electronic text and the source or sources from which it was derived.
<hyphenation> summarizes the way in which hyphenation in a source text has been treated in an encoded version of it.
<interpretation> describes the scope of any analytic or interpretive information added to the text in addition to the transcription.
<langUsage> (language usage) describes the languages, sublanguages, registers, dialects, etc. represented within a text.
<licence> contains information about a licence or other legal agreement applicable to the text.
<normalization> indicates the extent of normalization or regularization of the original source carried out in converting it to electronic form.
<prefixDef> (prefix definition) defines a prefixing scheme used in data.pointer values, showing how abbreviated URIs using the scheme may be expanded into full URIs.
<projectDesc> (project description) describes in detail the aim or purpose for which an electronic file was encoded, together with any other relevant information concerning the process by which it was assembled or collected.
<publicationStmt> (publication statement) groups information concerning the publication or distribution of an electronic or other text.
<punctuation> specifies editorial practice adopted with respect to punctuation marks in the original.
<quotation> specifies editorial practice adopted with respect to quotation marks in the original.
<seriesStmt> (series statement) groups information about the series, if any, to which a publication belongs.
<listRelation> provides information about relationships identified amongst people, places, and organizations, either informally as prose or as formally expressed relation links.
<occupation> contains an informal description of a person's trade, profession or occupation.
<org> (organization) provides information about an identifiable organization such as a business, a tribe, or any other grouping of people.
<place> contains data about a geographic location
<back> (back matter) contains any appendixes, etc. following the main part of a text.
<epigraph> contains a quotation, anonymous or attributed, appearing at the start or end of a section or on a title page.
<front> (front matter) contains any prefatory matter (headers, abstracts, title page, prefaces, dedications, etc.) found at the start of a document, before the main body.
<pc> (punctuation character) contains a character or string of characters regarded as constituting a single punctuation mark.
<address> contains a postal address, for example of a publisher, an organization, or an individual.
<cit> (cited quotation) contains a quotation from some other document, together with a bibliographic reference to its source. In a dictionary it may contain an example text with at least one occurrence of the word form, used in the sense being described, or a translation of the headword, or an example.
<emph> (emphasized) marks words or phrases which are stressed or emphasized for linguistic or rhetorical effect.
<foreign> identifies a word or phrase as belonging to some language other than that of the surrounding text.
<hi> (highlighted) marks a word or phrase as graphically distinct from the surrounding text, for reasons concerning which no claim is made.
<l> (verse line) contains a single, possibly incomplete, line of verse.
<label> contains any label or heading used to identify part of a text, typically but not exclusively in a list or glossary.
<lb> (line beginning) marks the beginning of a new (typographic) line in some edition or version of a text.
<milestone> marks a boundary point separating any kind of section of a text, typically but not necessarily indicating a point at which some part of a standard reference system changes, where the change is not represented by a structural element.
<num> (number) contains a number, written in any form.
<pb> (page beginning) marks the beginning of a new page in a paginated document.
<unclear> contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source.
<figure> groups elements representing or containing graphic information such as an illustration, formula, or figure.
<table> contains text displayed in tabular form, in rows and columns.
<biblFull> (fully-structured bibliographic citation) contains a fully-structured bibliographic citation, in which all components of the TEI file description are present.
<anchor> (anchor point) attaches an identifier to a point within a text, whether or not it corresponds with a textual element.
<seg> (arbitrary segment) represents any segmentation of text below the chunk level.
<listOrg> (list of organizations) contains a list of elements, each of which provides information about an identifiable organization.
<listPlace> (list of places) contains a list of places, optionally followed by a list of relationships (other than containment) defined amongst them.
<location> defines the location of a place as a set of geographical coordinates, in terms of other named geo-political entities, or as an address.
<orgName> (organization name) contains an organizational name.
<placeName> contains an absolute or relative place name.
<att> (attribute) contains the name of an attribute appearing within running text.
<code> contains literal code from some formal language such as a programming language.
<gi> (element name) contains the name (generic identifier) of an element.
<ident> (identifier) contains an identifier or name for an object of some kind in a formal language. is used for tokens such as variable names, class names, type names, function names etc. in formal programming languages.
<val> (value) contains a single attribute value.
<fw> (forme work) contains a running head (e.g. a header, footer), catchword, or similar material appearing on the current page.
teidata.enumerated defines the range of attribute values expressed as a single XML name taken from a list of documented possibilities.
model.pPart.editorial groups phrase-level elements for simple editorial interventions that may be useful both in transcribing and in authoring.
<addrLine> (address line) contains one line of a postal address.
<editor> contains a secondary statement of responsibility for a bibliographic item, for example the name of an individual, institution or organization, (or of several such) acting as editor, compiler, translator, etc.
<pubPlace> (publication place) contains the name of the place where a bibliographic item was published.
<resp> (responsibility) contains a phrase describing the nature of a person's intellectual responsibility, or an organization's role in the production or distribution of a work.
<speaker> contains a specialized form of heading or label, giving the name of one or more speakers in a dramatic text or fragment.
<figDesc> (description of figure) contains a brief prose description of the appearance or content of a graphic figure, for use when documenting an image without displaying it.
<authority> (release authority) supplies the name of a person or other agency responsible for making a work available, other than a publisher or distributor.
<creation> contains information about the creation of a text.
<edition> describes the particularities of one edition of a text.
<extent> describes the approximate size of a text stored on some carrier medium or of some other object, digital or non-digital, specified in any convenient units.
<language> characterizes a single language or sublanguage used within a text.
<rendition> supplies information about the rendition or appearance of one or more elements in the source text.
<collection> contains the name of a collection of manuscripts or other objects, not necessarily located within a single repository.
<institution> contains the name of an organization such as a university or library, with which a manuscript or other object is identified, generally its holding institution.
<repository> contains the name of a repository within which manuscripts or other objects are stored, possibly forming part of an institution.
<birth> contains information about a person's birth, such as its date and place.
<death> contains information about a person's death, such as its date and place.
<nationality> contains an informal description of a person's present or past nationality or citizenship.
<dateline> contains a brief description of the place, date, time, etc. of production of a letter, newspaper story, or other work, prefixed or suffixed to it as a kind of heading or trailer.
<docAuthor> (document author) contains the name of the author of the document, as given on the title page (often but not always contained in a byline).
<docDate> (document date) contains the date of a document, as given on a title page or in a dateline.
<titlePart> contains a subsection or division of the title of a work, as indicated on a title page.
<trailer> contains a closing title or footer appearing at the end of a division of a text.
teidata.pointer defines the range of attribute values used to provide a single URI, absolute or relative, pointing to some other resource, either within the current document or elsewhere.
<respStmt> (statement of responsibility) supplies a statement of responsibility for the intellectual content of a text, edition, recording, or series, where the specialized elements for authors, editors, etc. do not suffice or do not apply. May also be used to encode information about individuals or organizations which have played a role in the production or distribution of a bibliographic work.
model.addrPart groups elements such as names or postal codes which may appear as part of a postal address.
model.addressLike groups elements used to represent a postal or email address.
model.publicationStmtPart.detail groups the agency-specific child elements of the element of the TEI header.
model.milestoneLike groups milestone-style elements used to represent reference systems.
<docTitle> (document title) contains the title of a document, including all its constituents, as given on a title page.
<titlePage> (title page) contains the title page of a text, appearing within the front or back matter.
model.phrase.xml groups phrase-level elements used to encode XML constructs such as element names, attribute names, and attribute values
model.respLike groups elements which are used to indicate intellectual or other significant responsibility, for example within a bibliographic element.
<titleStmt> (title statement) groups information about the title of a work and those responsible for its content.
att.canonical provides attributes which can be used to associate a representation such as a name or title with canonical information about the object being named or referenced.
model.publicationStmtPart.agency groups the child elements of a element of the TEI header that indicate an authorising agent.
model.biblPart groups elements which represent components of a bibliographic description.
<divGen> (automatically generated text division) indicates the location at which a textual division generated automatically by a text-processing application is to appear.
att.docStatus provides attributes for use on metadata elements describing the status of a document.
model.biblLike groups elements containing a bibliographic description.
<relatedItem> contains or references some other bibliographic item which is related to the present one in some specified manner, for example as a constituent or alternative version of it.
<sourceDesc> (source description) describes the source(s) from which an electronic text was derived or generated, typically a bibliographic description in the case of a digitized text, or a phrase such as "born digital" for a text which has no previous existence.
<msIdentifier> (manuscript identifier) contains the information required to identify the manuscript or similar object being described.
<fileDesc> (file description) contains a full bibliographic description of an electronic file.
<notesStmt> (notes statement) collects together any notes providing information about a text additional to that recorded in other parts of the bibliographic description.
model.imprintPart groups the bibliographic elements which occur inside imprints.
att.datable provides attributes for normalization of elements that contain dates, times, or datable events.
att.datable.w3c provides attributes for normalization of elements that contain datable events conforming to the W3C XML Schema Part 2: Datatypes Second Edition.
att.datable.iso provides attributes for normalization of elements that contain datable events using the ISO 8601 standard.
att.datable.custom provides attributes for normalization of elements that contain datable events to a custom dating system (i.e. other than the Gregorian used by W3 and ISO).
att.dimensions provides attributes for describing the size of physical objects.
model.personPart groups elements which form part of the description of a person.
model.divWrapper groups elements which can appear at either top or bottom of a textual division.
model.pLike.front groups paragraph-like elements which can occur as direct constituents of front matter.
model.titlepagePart groups elements which can occur as direct constituents of a title page, such as , , docImprint, or .
att.pointing provides a set of attributes used by all elements which point to other elements by means of one or more URI references.
<textClass> (text classification) groups information which describes the nature or topic of a text in terms of a standard classification scheme, thesaurus, etc.
att.tableDecoration provides attributes used to decorate rows or cells of a table.
<row> contains one row of a table.
att.ascribed provides attributes for elements representing speech or action that can be ascribed to a specific individual.
<listChange> groups a number of change descriptions associated with either the creation of a source text or the revision of an encoded text.
<revisionDesc> (revision description) summarizes the revision history for a file.
model.quoteLike groups elements used to directly contain quotations.
model.encodingDescPart groups elements which may be used inside and appear multiple times.
model.divBottomPart groups elements which can occur only at the end of a text division.
teidata.word defines the range of attribute values expressed as a single word or token.
model.emphLike groups phrase-level elements which are typographically distinct and to which a specific function can be attributed.
model.pPart.transcriptional groups phrase-level elements used for editorial transcription of pre-existing source materials.
model.editorialDeclPart groups elements which may be used inside and appear multiple times.
model.dateLike groups elements containing temporal expressions.
model.descLike groups elements which contain a description of their function.
model.labelLike groups elements used to gloss or explain other parts of a document.
<listPrefixDef> (list of prefix definitions) contains a list of definitions of prefixing schemes used in data.pointer values, showing how abbreviated URIs using each scheme may be expanded into full URIs.
model.divLike groups elements used to represent un-numbered generic structural divisions.
model.divGenLike groups elements used to represent a structural division which is generated rather than explicitly present in the source.
model.frontPart groups elements which appear at the level of divisions within front or back matter.
teidata.temporal.w3c defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the W3C XML Schema Part 2: Datatypes Second Edition specification.
<egXML> (example of XML) a single XML fragment demonstrating the use of some XML, such as elements, attributes, or processing instructions, etc., in which the element functions as the root element.
model.egLike groups elements containing examples or illustrations.
<classDecl> (classification declarations) contains one or more taxonomies defining any classificatory codes used elsewhere in the text.
model.eventLike groups elements which describe events.
att.placement provides attributes for describing where on the source page or object a textual element appears.
model.global groups elements which may appear at any point within a TEI text.
model.qLike groups elements related to highlighting which can appear either within or between chunk-level elements.
att.personal (attributes for components of names usually, but not necessarily, personal names) common attributes for those elements which form part of a name usually, but not necessarily, a personal name.
model.persNamePart groups elements which form part of a personal name.
model.global.edit groups globally available elements which perform a specifically editorial function.
att.media provides attributes for specifying display and related properties of external media.
att.internetMedia provides attributes for specifying the type of a computer resource using a standard taxonomy.
att.resourced provides attributes by which a resource (such as an externally held media file) may be located.
model.graphicLike groups elements containing images, formulae, and similar objects.
model.headLike groups elements used to provide a title or heading at the start of a text division.
model.hiLike groups phrase-level elements which are typographically distinct but to which no specific function can be attributed.
model.nameLike groups elements which name or refer to a person, place, or organization.
model.lLike groups elements representing metrical components such as verse lines.
teidata.language defines the range of attribute values used to identify a particular combination of human language and writing system.
att.breaking provides an attribute to indicate whether or not the element concerned is considered to mark the end of an orthographic token in the same way as whitespace.
macro.paraContent (paragraph content) defines the content of paragraphs and similar elements.
model.divPart groups paragraph-level elements appearing directly within divisions.
model.availabilityPart groups elements such as licences and paragraphs of text which may appear as part of an availability statement
model.listLike groups list-like elements.
teidata.truthValue defines the range of attribute values used to express a truth value.
model.orgPart groups elements which form part of the description of an organization.
model.placeStateLike groups elements which describe changing states of a place.
att.milestoneUnit provides an attribute to indicate the type of section which is changing at a specific milestone.
teidata.text defines the range of attribute values used to express some kind of identifying string as a single sequence of Unicode characters possibly including whitespace.
model.nameLike.agent groups elements which contain names of individuals or corporate bodies.
model.persStateLike groups elements describing changeable characteristics of a person which have a definite duration, for example occupation, residence, or name.
model.noteLike groups globally-available note-like elements.
teidata.numeric defines the range of attribute values used for numeric values.
model.measureLike groups elements which denote a number, a quantity, a measurement, or similar piece of text that conveys some numerical meaning.
model.divTopPart groups elements which can occur only at the beginning of a text division.
model.personLike groups elements which provide information about people and their relationships.
att.linguistic provides a set of attributes concerning linguistic features of tokens, for usage within token-level elements, specifically w and in the analysis module.
model.segLike groups elements used for arbitrary segmentation.
teidata.sex defines the range of attribute values used to identify human or animal sex.
model.placeLike groups elements used to provide information about places and their relationships.
model.placeNamePart groups elements which form part of a place name.
att.patternReplacement provides attributes for regular-expression matching and replacement.
teidata.prefix defines a range of values that may function as a URI scheme name.
model.ptrLike groups elements used for purposes of location and reference.
att.ascribed.directed provides attributes for elements representing speech or action that can be directed at a group or individual.
att.msExcerpt (manuscript excerpt) provides attributes used to describe excerpts from a manuscript placed in a description thereof.
att.notated provides an attribute to indicate any specialised notation used for element content.
att.styleDef provides attributes to specify the name of a formal definition language used to provide formatting or rendition information.
teidata.count defines the range of attribute values used for a non-negative integer value used as a count.
model.resource groups separate elements which constitute the content of a digital resource, as opposed to its metadata.
model.pPart.data groups phrase-level elements containing names, dates, numbers, measures, and similar data.
model.inter groups elements which can appear either within or between paragraph-like elements.
model.divBottom groups elements appearing at the end of a text division.
macro.specialPara ('special' paragraph content) defines the content model of elements such as notes or list items, which either contain a series of component-level elements or else have the same structure as a paragraph, containing a series of phrase-level and inter-level elements.
model.common groups common chunk- and inter-level elements.
model.divTop groups elements appearing at the beginning of a text division.
model.highlighted groups phrase-level elements which are typographically distinct.
model.limitedPhrase groups phrase-level elements excluding those elements primarily intended for transcription of existing sources.
macro.phraseSeq (phrase sequence) defines a sequence of character data and phrase-level elements.
macro.phraseSeq.limited (limited phrase sequence) defines a sequence of character data and those phrase-level elements that are not typically used for transcribing extant documents.
model.phrase groups elements which can occur at the level of individual words or phrases.
macro.limitedContent (paragraph content) defines the content of prose elements that are not used for transcription of extant materials.
model.pPart.edit groups phrase-level elements for simple editorial correction and transcription.
teidata.temporal.iso defines the range of attribute values expressing a temporal expression such as a date, a time, or a combination of them, that conform to the international standard Data elements and interchange formats – Information interchange – Representation of dates and times.
teidata.certainty defines the range of attribute values expressing a degree of certainty.
teidata.probCert defines a range of attribute values which can be expressed either as a numeric probability or as a coded certainty value.
teidata.outputMeasurement defines a range of values for use in specifying the size of an object that is intended for display.
teidata.xTruthValue (extended truth value) defines the range of attribute values used to express a truth value which may be unknown.
teidata.pattern defines attribute values which are expressed as a regular expression.
teidata.replacement defines attribute values which contain a replacement template.
teidata.versionNumber defines the range of attribute values used for version numbers.