Editing Holcot: Laying the Groundwork for the Future Use and Reuse of Holcot’s Corpus


Jeffrey C. Witt (Loyola University Maryland) | @jeffreycwitt


January 5, 2018

Paris, France

Slide Deck: http://lombardpress.org/slides/2018-01-05-holcot-use-and-reuse

## Outline 1. Introduction 2. Uses 1. Dynamic Presentations 2. Corpus Searches 3. Network Analysis 4. Topic Modeling 3. Concrete Proposal
# I. Introduction
#### Volume 2. Used Price between $186.00 and $224.00 ![Rimini 1](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Rimini-Book-Prices1.png)
#### Volume 1. Price $433.00 ![Rimini 1](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Gregorii_Ariminensis_vol1price.png)
#### 57 pages from Volume 1. Price $42.00 ![Rimini 2](https://s3.amazonaws.com/lum-faculty-jcwitt-public/rimini_prologe_q1_price.png)
#### Reuse of 57 pages for 10 students in a class room. Price $213.75 ![Rimini 3](https://s3.amazonaws.com/lum-faculty-jcwitt-public/rimini_prologueq1_coursepack_cost.png)
#### The Comparative ease of finding the 1522 Edition [https://books.google.com/books?id=NbEscE0n_44C](https://books.google.com/books?id=NbEscE0n_44C) ![Rimini 3](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Rimini-1522.png)
# II. Uses and Re-Uses
# 1. Dynamic Presentation
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Holcot-diplomatic-list.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Balliol-q16.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/balliol-with-collation.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/balliol-with-image.png)
[http://mirador.scta.info/?resourceid=holcotquodlibeta](http://mirador.scta.info/?resourceid=holcotquodlibeta) ![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Mirador_Viewer.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Mirador_Viewer2.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Holcot-Pdfs.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/print-text-example.png)
# 2. Corpus Searches
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/HolcotCorpusTextSearch.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/HolcotQuestionSearch.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/Holcot-Mirador-Search.png)
# 3. Network Analysis
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/bonaventure_scholion.png)
#### Rimini 1981 Volume 1 Apparatus Fontium ![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/rimini-apparatus.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections1.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections2.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections3.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections4.png)

Vargas Example

Holcot Example

![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections5.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/dead-end-and-networked-connections6.png)
![connections](https://s3.amazonaws.com/lum-faculty-jcwitt-public/quotation-explorer.png)
# 4. Topic Modeling
![topic-modelling](https://s3.amazonaws.com/lum-faculty-jcwitt-public/topic_modelling1.png)
![topic-modelling](https://s3.amazonaws.com/lum-faculty-jcwitt-public/topic_modelling2.png)
## III. Concrete Proposal ### How can we publish our editorial work as data?
[http://scta.info](http://scta.info) ![scta.info](https://s3.amazonaws.com/lum-faculty-jcwitt-public/SCTA-cover.png)
Concretely, to make this work, we would need: * A transcription of an individual witness (ideally a transcription of each witness) that records: * At a minimum: * the text as it appears in the witness in question * paragraph breaks that mirror paragraph breaks used in the critical text * quotations that mirror quotations identified in the critical text * Ideally also: * line breaks in the manuscript * original spellings * corrections such as additions and deletions
While the SCTA has established a long list of guidelines for recording these details in TEI XML ([see the LombardPress Schema](http://lombardpress.org/schema/docs)), this need not be a hurdle for those unfamiliar with these technologies.
We can accept transcriptions in Microsoft Word documents that record the most essential information as follows: * paragraph breaks with double hard return (two new line breaks) * quotations with quotation marks * line breaks with a single hard return (one new line break) If these basic rules are followed, we can automate most of the conversion process and allow later details about the witness (corrections, marginal notes, etc) to be added later.