Text Encoding and Analysis with TEI

This course provides a theoretical and practical introduction to the modelling, encoding, and analysis of humanities data using XML and the Text Encoding Initiative (TEI).

Instructor: Roman Bleier, Hans Clausen, Sarah Lang, Christopher Pollin

Course Overview

The course introduces students to the theoretical foundations and practical application of text encoding in the Digital Humanities, with a particular focus on XML and the Text Encoding Initiative (TEI). As a widely adopted standard for the semantic annotation and enrichment of humanities data, TEI plays a central role in digital editions, digital collections, and linguistic corpora.

Combining lectures, discussions, and practical exercises, the course familiarises students with XML, schema languages, data modelling, and text encoding using the TEI Guidelines. Students gain hands-on experience in designing and encoding humanities data while developing an understanding of the conceptual decisions involved in modelling textual sources for scholarly research.

This course was taught at the University of Vienna in Winter Semester 2020/2021 (co-taught with Roman Bleier, Hans Clausen and Christopher Pollin) and Winter Semester 2021/2022 (co-taught withChristopher Pollin).

A YouTube playlist is available that covers part of this class (in German).