Jeffrey C. Witt (Loyola University Maryland) | @jeffreycwitt
Harvard University, September 29, 2017
Slide Deck: http://jeffreycwitt.com/slides/2017-0=9-29-intro-to-semantic-editing
XML semantically identifies pieces of data using opening and closing tags which are enclosed with angle brackets.
wrong
correct
Tags must be either siblings of another set of tags or children of a set of tags.
In addition to containing text and other elements, elements can take on "attributes".
The result of following these rules is a document whose content is nicely organized into a tree structure.
This means that XML actually does not specify any set of tags or element names. Anyone can make up their own set of elements and use them however they like.
It allows different industries and fields to create tags that meet their needs and their data.
If everyone can just make up their own tags, we can create confusion about what different tags mean and the datatypes they are encoding.
Someone might choose to tag something as <paragraph> and another person might choose to tag something as <para> and a third person might choose <p>.
How do we avoid this confusion?
Enter TEI
TEI is designed to privilege the descriptive markup of a text over presentational markup
Explore the TEI Guidelines http://www.tei-c.org/release/doc/tei-p5-doc/en/html/index.html
Question: What element (or elements) should I use to identify the manuscript I'm transcribing?
Question: What element should I use to encode a new page, a new column, or a new line in a manuscript?
Question: What element (or elements) should I use to record a scribal correction?
See: http://teibyexample.org/modules/TBED01v00.htm?target=generalstructure#generalstructure
See: http://teibyexample.org/modules/TBED02v00.htm
See: http://teibyexample.org/modules/TBED03v00.htm?target=structure#structure