Dr James Cummings
James.Cummings@newcastle.ac.uk
@jamescummings
CC+BY (press space to cycle through slides)
Thesis: TEI is the right format for scholarly digital editions, but we still lack a de facto solution for collaborative creation, annotation, publication and analysis which understands why editors make particular decisions
Think about the uses for an italic font in any form of printed publication. Why might an author/publisher put some text into italics? What are they signalling about that text?
We can usually tell these types of things apart from context. If we want to use these categories, computers need to be told these things are different.
Some common uses include:
<element> Text </element>
<element attribute="value">
Text or child elements here
</element>
<element attribute="value"/>
"Opening Tag"
"Closing Tag"
"Empty Element"
Usually Encoded as TEI XML
(But I don't need everything the TEI provides, or want something it doesn't give me)
Possibilities of the
TEI Framework
Project B
Project A
New Elements
- @louburnard
(What kinds of texts is the TEI good for?)
The TEI takes a generalistic approach to overall text structure and this means it should be able to cope with texts of
any size, any language, any date, any complexity, or any writing system.
This document could be in any form (books, journals, manuscripts, postcards, letters, rolls of papyrus, clay tablets, web pages, gravestones, etc.) and contain any type of text (plays, prose, poems, dictionaries, linguistic corpora, letters, emails, metadata, etc.).
Holinshed's Chronicles: columns, marginal notes, woodcuts
Medieval Drama: (York Cycle, Noah & Flood): Dramatic texts, speeches, rhyme schemes, editorial corrections, etc.
Medieval Manuscripts: full description, translations, stylistic rendering, variants, critical apparatus, editorial commentary, etc.
First Folio:
forme-work, catchwords, decorative initials, etc.
Wilfred Owen: manuscripts, corrections, multiple versions
Wilfred Owen: Letters, codewords
George Herbert: Graphic text layout, poetry
William Godwin's Diary: diary structure, abbreviated texts
Modern Manuscripts: Genetic editing, many hands, text (re)use, location/orientation on page
Print and Digital Dictionaries: entries, sense, etymologies, quotations, etc.
Epigraphical Texts: partial letters, supplied text, physical description
Various writing systems: Unicode/non-Unicode characters, right-to-left, reversing lines, etc.
Thinking about this material, and indeed your own, what do you think are the things you would like to mark up?
Pretend an authoritarian anti-intellectual government has come to power and, through a series of bad decisions, has to slash your project funding by 50%. What do you do?
Repeat the exercise.
SAVE THE DATE:
(Do I have to see the XML markup?)
In the end the best method to use is that which enables you actually to create your rich digital texts
(How can I publish these TEI files?)
Your Edition or Web Page Template
Embedded divisions of custom HTML elements
CETEIcean
JavaScript
It isn't just that we want to record the abbreviations but also at least:
This uses <choice> with <abbr> (abbreviated word) and <expan> (expanded word) with <am> (abbreviation marker) and <ex> (expanded text); also <stage>, <note>, <sp>, <speaker>, <l>, <hi>, and others
It isn't just benefits for presentation that such markup provides but analysis as well. With it one could ask questions about:
My example of the kind of scholarly editor I have in mind could be a vague stereotype that should work for any editor, but I also base it on people like my colleague the wonderful Professor Jenny Richards because she:
For years I have said:
"It is easier to teach the subject specialist TEI than it is to teach a TEI expert another subject specialism".
This remains true, but for large scale adoption of an editing platform editors should not have to learn:
(Unless they want to of course!)
Dr James Cummings
James.Cummings@newcastle.ac.uk
@jamescummings
CC+BY (press space to cycle through slides)