TEI related resources

This page provides information and resources for representing spoken language transcriptions in a TEI compliant format.
The approach implemented here is described in:
Schmidt, Thomas (2011): A TEI-based Approach to Standardising Spoken Language Transcription. In: Journal of the Text Encoding Initiative (1). [Online text]

Example files

The EXMARaLDA Demo corpus has TEI versions of transcripts in ten different languages.
Consult the Corpus overview to download the corresponding audio and/or video files and to view HTML presentations of the transcripts
Many further examples can be found in several of the HZSK corpora
Here are direct links to the TEI files:

German AnneWill_TEI.xml
Helge_Schneider_Arbeitsamt_TEI.xml
EnglishTranslator_TEI.xml
Hubert_Fichte_Interview_TEI.xml
ForumWaffenrecht_TEI.xml
HartAberFair_TEI.xml
Rossau_TEI.xml
Rudi_Voeller_Wutausbruch_TEI.xml
Helge_Schneider_Tropfsteinhoehle_TEI.xml
English Beckhams_TEI.xml
MyTheory_TEI.xml
PaulMcCartney_TEI.xml
PearStory_TEI.xml
French royal_TEI.xml
Italian Gasperini_TEI.xml
Spanish savater_TEI.xml
Polish SzymonMajewski_TEI.xml
Swedish and Norwegian TeliaTelenor_TEI.xml
Turkish Tuerkisch_TEI.xml
Vietnamese NguyenNgocNgan_TEI.xml

Document grammars

These XML schemas, generated by ROMA, can be used to validate TEI transcriptions.

XSL Stylesheets

These stylesheets can be used to transform an EXMARaLDA Basic-Transcription to a TEI transcription and vice versa.

This stylesheet can be used to transform a FOLKER transcription following the cGAT conventions for a minimal transcript into a TEI transcription.

This stylesheet can be used to generate an HTML representation of a TEI transcription.

TEI Drop

TEI Drop is an application to produce and parse TEI ouput for different tool formats (CHAT, ELAN, EXMARaLDA, FOLKER and Transcriber) and different transcription conventions (HIAT, cGAT). It is part of the EXMARaLDA tools package, you can download the latest preview from the previews page.