-
Notifications
You must be signed in to change notification settings - Fork 9
/
introduction_en.dita
96 lines (88 loc) · 5.93 KB
/
introduction_en.dita
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE topic PUBLIC "-//OASIS//DTD DITA Topic//EN" "topic.dtd">
<topic id="topic_g2d_jxc_5y">
<title>Introduction to the DTABf</title>
<titlealts><navtitle>Introduction (English)</navtitle></titlealts>
<body>
<section><title>Introduction</title>
<p>
The structural annotation of all DTA texts is done according to the DTA ›Base Format‹ (DTABf).
The DTABf was developed in accordance with the P5-Guidelines of the
<xref href="http://www.tei-c.org/" scope="external" format="html">Text Encoding Initiative (TEI)</xref>. Since the TEI Guidelines
are offering solutions for a huge amount of tagging requirements and are thus rather extensive
and flexible, they are meant to be adjusted to the individual necessities of projects working
with the TEI. For the DTA this was achieved by creation of the DTABf, a proper subset of the
TEI/P5 tagset, which offers not only fixed sets of elements but also of corresponding attributes
and (where applicable) values. The DTABf tagset is fully conformant with the TEI/P5-Guidelines,
i.e. the TEI tagset was only reduced not extended in any way.
</p>
<p>
The DTABf is part of the DTA Guidelines, which also contain General Guidelines
and the Transcription Guidelines. It is supposed to
allow for unrestricted tagging regarding possible structural phenomena while at the
same time avoiding ambiguities regarding the tagging of similar phenomena. This way
we want to ensure coherence in text structuring within the whole DTA corpus. Regarding
the wide temporal coverage of the DTA corpus as well as the diversity of text types and
genres this named intend of the DTABf turns out to be a huge challenge due to the fact
that the heterogeneity of texts is accompanied by a huge structural variability among
the original text sources.
</p>
<p>
With the DTABf we are proposing a standardized format for the structural annotation
of digitized historical texts. The advantage of such an approach is that diverse
TEI texts become analyzable not only by similar methods but also in comparison with
one another. The underlying annotation guidelines of the DTABf are documented extensively,
this way ensuring that the tagging remains comprehensive. Thus, DTABf conformity not
only facilitaes the integration of TEI texts into the DTA infrastructure but also
their re-use inside other full text archives.
</p></section>
<section><title>DTABf Documentation (German)</title>
<ul>
<li><xref href="ziel.dita" outputclass="nu">Introduction to the DTABf ⇗</xref></li>
<li><xref href="metadaten.dita" outputclass="nu">Structuring of Metadata ⇗</xref></li>
<li><xref href="transkription.dita" outputclass="nu">Transcription Guidelines ⇗</xref></li>
<li><xref href="texterschliessung_formal.dita" outputclass="nu">Structuring of Formal (Typographic) Phenomena ⇗ </xref></li>
<li><xref href="texterschliessung_inhaltlich.dita" outputclass="nu">Structuring of Semantic (Meaningful) Phenomena ⇗</xref></li>
<li>Besondere Textsorten <ul outputclass="embedded">
<li outputclass="embedded"><xref href="manuskript.dita" outputclass="nu">Structuring of Manuscripts ⇗</xref></li>
<li outputclass="embedded"><xref href="zeitung.dita" outputclass="nu">Structuring of Newspapers and Journals ⇗</xref></li>
</ul></li>
<li>Übersichten
<ul outputclass="embedded">
<li outputclass="embedded"><xref href="uebersichtHeader.dita" outputclass="nu">Overview of all DTABf-Elements within the <codeph><teiHeader></codeph> area ⇗</xref></li>
<li outputclass="embedded"><xref href="uebersichtText.dita" outputclass="nu">Overview of all DTABf-Elements within the <codeph><text></codeph>area ⇗</xref></li></ul></li>
<li><xref href="basisformat_template.xml" scope="external" format="xml" outputclass="nu">Boilerplate DTABf document ⇗</xref></li>
</ul>
</section>
<section>
<title>DTABf Schema</title>
<ul>
<li><xref href="basisformat.rng" scope="external" format="html" outputclass="nu">DTABf for prints: RNG schema⇗</xref></li>
<li><xref href="basisformat.odd" scope="external" format="html" outputclass="nu">DTABf for prints: ODD ⇗</xref></li>
<li><xref href="basisformat_ms.rng" scope="external" format="html" outputclass="nu">DTABf for manuscripts: RNG schema⇗</xref></li>
<li><xref href="basisformat_ms.odd" scope="external" format="html" outputclass="nu">DTABf for manuscripts: ODD ⇗</xref></li>
<li><xref href="basisformat.sch" scope="external" format="html" outputclass="nu">DTABf Schematron constraints set ⇗</xref></li>
</ul></section>
<section><title>Useful Tools and Applications</title>
<p><b>Webform for Metadata Entry:</b></p>
<p>
The DTA provides a web form, which facilitates the creation of DTABf conformant TEI Headers.
This way, users do not have to write the quite complex TEI-Headers by themselves but can
fill out the form and automatically generate a DTABf conformant TEI Header.
</p>
<ul>
<li><xref href="http://www.deutschestextarchiv.de/dtae/submit/clarin" scope="external" format="html">Webform for Metadata Entry</xref></li>
</ul>
<p><b>Framework for Text Entry:</b></p>
<p>
For text transcription and DTABf conformant annotation, the DTA offers a framework for the author
mode of the oXygen XML-Editor. This DTA-oXygen-Framework DTAoX enables users to obtain an immediate
visualization of their annotated texts as well as to transcribe and annotate texts from scratch in
a WYSIWYG-like environment. DTAoX is available under the GNU Lesser General Public License (LGPL).
The current version has been optimized for the oXygen versions 14.2 and 15.
</p>
<ul>
<li>Version 1.1.1 (November 29th, 2013): <xref href="http://www.deutschestextarchiv.de/files/DTAoX-1.1.1.zip" scope="external" format="html">Framework</xref> (.zip) </li>
</ul></section>
</body>
</topic>