DECTE | Corpus List

List of texts

The list of texts included in DECTE is a sequence of entity references defined by the interviews.ent file included in the DTD.

&decten1tlsg01;	&decten1tlsg02;	&decten1tlsg03;	&decten1tlsg04;	&decten1tlsg05;
&decten1tlsg06;	&decten1tlsg07;	&decten1tlsg08;	&decten1tlsg09;	&decten1tlsg10;
&decten1tlsg11;	&decten1tlsg12;	&decten1tlsg13;	&decten1tlsg14;	&decten1tlsg15;
&decten1tlsg16;	&decten1tlsg17;	&decten1tlsg18;	&decten1tlsg19;	&decten1tlsg20;
&decten1tlsg21;	&decten1tlsg22;	&decten1tlsg23;	&decten1tlsg24;	&decten1tlsg25;
&decten1tlsg26;	&decten1tlsg27;	&decten1tlsg28;	&decten1tlsg29;	&decten1tlsg30;
&decten1tlsg31;	&decten1tlsg32;	&decten1tlsg33;	&decten1tlsg34;	&decten1tlsg35;
&decten1tlsg36;	&decten1tlsg37;	&decten1tlsn01;	&decten1tlsn02;	&decten1tlsn03;
&decten1tlsn04;	&decten1tlsn05;	&decten1tlsn06;	&decten1tlsn07;	&decten1pvc01;
&decten1pvc02;	&decten1pvc03;	&decten1pvc04;	&decten1pvc05;	&decten1pvc06;
&decten1pvc07;	&decten1pvc08;	&decten1pvc09;	&decten1pvc10;	&decten1pvc11;
&decten1pvc12;	&decten1pvc13;	&decten1pvc14;	&decten1pvc15;	&decten1pvc16;
&decten1pvc17;	&decten1pvc18;	&decten2y07i001;	&decten2y07i002;	&decten2y07i003;
&decten2y07i004;	&decten2y07i005;	&decten2y07i006;	&decten2y07i007;	&decten2y07i008;
&decten2y07i009;	&decten2y07i010;	&decten2y07i011;	&decten2y07i012;	&decten2y07i013;
&decten2y07i014;	&decten2y08i001;	&decten2y08i002;	&decten2y08i003;	&decten2y08i004;
&decten2y10i001;	&decten2y10i002;	&decten2y10i003;	&decten2y10i004;	&decten2y10i005;
&decten2y10i006;	&decten2y10i007;	&decten2y10i008;	&decten2y10i009;	&decten2y10i010;
&decten2y10i011;	&decten2y10i012;	&decten2y10i013;	&decten2y10i014;	&decten2y10i015;
&decten2y10i016;	&decten2y10i017;	&decten2y10i018;	&decten2y10i019;	&decten2y10i020;
&decten2y10i021;	&decten2y10i022;	&decten2y10i023;	&decten2y10i024;	&decten2y10i025;
&decten2y10i026;

Each reference in the sequence denotes the text of a single constituent interview from the corpus, and each text is itself a TEI-conformant XML document.

All the text names begin with decte. Thereafter the names reflect the subcorpus to which the interview belongs: TLS (1970s), PVC (1990s), and NECTE2 (2000s). The table below illustrates how the naming system works for files from each subcorpus:

(a) TLS texts
decten1tlsg01 - decten1tlsg37; decten1tlsn01 - decten1tlsn07
decten1tlsg01	n1 indicates that the text comes from the NECTE corpus
decten1tlsg01	tls shows that the text is part of the TLS subcorpus
decten1tlsg01	g / n indicates that the interviewee is from Gateshead (g) or Newcastle (n)
decten1tlsg01	01: the two-digit number at the end ranks the texts
(b) PVC texts
decten1pvc01 - decten1pvc18
decten1pvc01	n1 indicates that the text comes from the NECTE corpus
decten1pvc01	pvc shows that the text is part of the PVC subcorpus
decten1pvc01	01: the two-digit number at the end ranks the texts
(c) NECTE2 texts
decten2y07i001 - decten2y07i014; decten2y08i001 - decten2y08i004; decten2y10i001 - decten2y10i026
decten2y07i001	n2 indicates that the text comes from the NECTE2 corpus
decten2y07i001	y07/y08/y10 indicates the year of the interview (y07=2007, y08=2008, y10=2010)
decten2y07i001	i001: the letter i (interview) followed by a three-digit number ranks the texts

All texts have the same structure:

<TEI xml:id="decten1tlsg01">

<teiHeader type="text">



</teiHeader>

<text>



</text>

</TEI>

where:

Each <TEI> element contains a single TEI-conformant document (i.e. one of the interviews) which in turn consists of a TEI header element and a <text> element. The identity of the interview in question is indicated in the opening tag of the <TEI> element by the @xml:id attribute, using the appropriate DECTE text entity reference (decten1tlsg01 in the example above).
The <teiHeader> element contains the metadata specific to the interview in question. This header has the same structure as the global header (discussed here), but it is much simpler since most characteristics of the individual interviews are described globally. Specifically, it contains only an empty <fileDesc> element (which is mandatory according to the TEI guidelines, even when empty), and a <profileDesc> element, which contains social data relating to the interviewee(s). The <profileDesc> element has the following internal structure:

<profileDesc>

<particDesc>



<person xml:id="interviewerTLSG01">

</person>

<person xml:id="informantTLSG01">

<age>31-40</age>

<sex>Female</sex>

<residence>Tyneside - Gateshead</residence>

<occupation>Housewife (previously Tailor)</occupation>

<education>Left school at 15</education>

</person>

</particDesc>

</profileDesc>

For some informants in some DECTE interviews, information is not available for all of the subelements within the <person> element. Where information is not available, the relevant subelement is left empty.

The <text> element contains the text of the interview. This element is relatively complex. Its internal structure and characteristics are described in the following sections: