Dr James Cummings
James.Cummings@newcastle.ac.uk
@jamescummings
CC+BY (press space to cycle through slides)
(The Text Encoding Initiative: A markup vocabulary for digital texts)
Markup is used in many different fields, for many different purposes: storing data, relating information, encoding understanding, preserving metadata
Think about the uses for an italic font in any form of printed publication. Why might an author/publisher put some text into italics? What are they signalling about that text?
We can usually tell these types of things apart from context. If we want to use these categories, computers need to be told these things are different.
Some common uses include:
... and many more
(What kinds of texts is the TEI good for?)
The TEI takes a generalistic approach to overall text structure and this means it should be able to cope with texts of any size, language, date, complexity, writing system, or media.
This could be in any form: books, journals, manuscripts, postcards, letters, rolls of papyrus, clay tablets, web pages, gravestones, etc. and contain any type of text.
Holinshed's Chronicles: columns, marginal notes, woodcuts
Medieval Drama: (York Cycle, Noah & Flood): Dramatic texts, speeches, rhyme schemes, editorial corrections, etc.
Medieval Manuscripts: full description, translations, stylistic rendering, variants, critical apparatus, editorial commentary, etc.
First Folio:
forme-work, catchwords, decorative initials, etc.
Wilfred Owen: manuscripts, corrections, multiple versions
Wilfred Owen: Letters, codewords
George Herbert: Graphic text layout, poetry
William Godwin's Diary: diary structure, abbreviated texts
Modern Manuscripts: Genetic editing, many hands, text (re)use, location/orientation on page
Print and Digital Dictionaries: entries, sense, etymologies, quotations, etc.
Epigraphical Texts: partial letters, supplied text, physical description
WW1 Propaganda: font, colour, glyph substitution, image classification and metadata
Various writing systems: Unicode/non-Unicode characters, right-to-left, reversing lines, etc.
Thinking about this material, and indeed your own, what do you think are the things you would like to mark up?
Pretend an authoritarian anti-intellectual government has come to power and, through a series of bad decisions, has to slash your project funding by 50%. What do you do?
Repeat the exercise.
(But I don't need everything the TEI provides, or want something it doesn't give me)
Possibilities of the
TEI Framework
Project B
Project A
New Elements
- @louburnard
(Do I have to see the XML markup?)
In the end the best method to use is that which enables you actually to create your rich digital texts
(How can I publish these TEI files?)
Your Edition or Web Page Template
Embedded divisions of custom HTML elements
CETEIcean
JavaScript
(What could possibly go wrong?)
Original URL:
http://www.cursus.uea.ac.uk/
Working URL:
http://www.cursus.org.uk/
(What is the use of failure
if we don't learn something?)
You never know who will leave/retire/die, create documentation for their replacement:
Projects tend to hide away their work, not wanting to show work-in-progress until it is finished. It is better in the long run if they work in the light, work openly making as many internal project materials available openly to the greater community. Where feasible minimal requirements should be:
Dr James Cummings
James.Cummings@newcastle.ac.uk
@jamescummings
CC+BY (press space to cycle through slides)
(A new inter-disciplinary project hoping to learn from some of these challenges)