The NECTE2 archive
Period: 2007-ongoing
As noted on the NECTE2 page, an average of seventy NECTE2 interviews have been recorded annually since 2007 by students in the School of English Literature, Language and Linguistics, as part of their modules in sociolinguistics and language variation and change.
In the period from 2010-2012, as part of the update process that turned NECTE into DECTE, 44 NECTE2 interviews were processed to become part of the TEI-conformant XML corpus: the transcriptions were revised, anonymized, and time-aligned to the corresponding audio, before the resulting text files were converted into TEI XML.
However, this did not account for all of the NECTE2 interviews that had been gathered between 2007 and 2012, and recordings have continued to be made and added to the collection in the years since 2012.
The tables below outline the current state of the full archive of NECTE2 interviews, from 2007 to 2017. Table 1 summarizes the number of interviews recorded in each year, and indicates the size of the associated transcriptions and audio files (in terms of word counts and audio durations). Tables 2 and 3 list the number of speakers recorded in each year by gender and by age group respectively.
It is our aim to process more of these materials, so that they can be added to the DECTE corpus proper, that is the TEI-conformant composite XML document described elsewhere on this website.
Table 1. NECTE2 Interviews: 2007-2017 |
Year |
Number of Interviews |
Word Count |
Audio (hrs:mins:secs) |
2007 |
63 |
728,262 |
68:04:45 |
2008 |
52 |
539,186 |
51:46:20 |
2009 |
69 |
722,591 |
64:43:13 |
2010 |
54 |
385,491 |
56:32:29 |
2011 |
110 |
707,352 |
96:26:48 |
2012 |
81 |
544,806 |
76:42:55 |
2013 |
55 |
339,538 |
53:08:09 |
2014 |
74 |
491,608 |
69:44:08 |
2015 |
60 |
372,492 |
61:11:57 |
2016 |
60 |
410,654 |
63:04:20 |
2017 |
104 |
679,553 |
111:23:24 |
Total |
782 |
5,921,533 |
772:48:28 |
Note:
Interview submissions are included in the calculations above where there is at least an audio file plus demographic information on both informants. Four of these submissions did not include student transcript files (one in 2008, two in 2015, one in 2017). There are also four additional interview audio files that were not accompanied by demographic information on both speakers (one in each of 2008, 2011, 2012 and 2017), making 786 interview audio files in total.
Table 2. NECTE2 Speakers: 2007-2017 — Gender |
Year |
Number of Speakers |
Female |
Male |
Non-binary |
2007 |
126 |
64 |
62 |
0 |
2008 |
104 |
52 |
52 |
0 |
2009 |
138 |
67 |
71 |
0 |
2010 |
108 |
66 |
42 |
0 |
2011 |
220 |
122 |
98 |
0 |
2012 |
162 |
92 |
70 |
0 |
2013 |
110 |
61 |
49 |
0 |
2014 |
148 |
96 |
52 |
0 |
2015 |
120 |
74 |
46 |
0 |
2016 |
120 |
80 |
40 |
0 |
2017 |
208 |
107 |
100 |
1 |
Total |
1,564 |
881 |
682 |
1 |
Table 3. NECTE2 Speakers: 2007-2017 — Age |
Year |
Number of Speakers |
15-20 |
21-30 |
31-40 |
41-50 |
51-60 |
61-70 |
71-80 |
81-90 |
2007 |
126 |
47 |
41 |
4 |
5 |
6 |
10 |
7 |
4 |
2008 |
104 |
44 |
26 |
6 |
11 |
8 |
5 |
4 |
0 |
2009 |
138 |
64 |
33 |
6 |
12 |
15 |
3 |
5 |
0 |
2010 |
108 |
43 |
36 |
5 |
12 |
9 |
1 |
1 |
1 |
2011 |
220 |
119 |
51 |
5 |
17 |
13 |
7 |
8 |
0 |
2012 |
162 |
91 |
33 |
1 |
10 |
12 |
9 |
6 |
0 |
2013 |
110 |
53 |
14 |
8 |
14 |
7 |
8 |
6 |
0 |
2014 |
148 |
83 |
32 |
7 |
10 |
4 |
3 |
7 |
2 |
2015 |
120 |
74 |
18 |
2 |
5 |
14 |
3 |
2 |
2 |
2016 |
120 |
74 |
16 |
7 |
8 |
8 |
4 |
3 |
0 |
2017 |
208 |
147 |
28 |
8 |
5 |
5 |
7 |
7 |
1 |
Total |
1,564 |
839 |
328 |
59 |
109 |
101 |
60 |
56 |
10 |
Note:
There is one interview from 2007 where the exact age of the two speakers is uncertain, though they are clearly in one of the older (i.e. 61+) categories. These two speakers are not currently included in the summary in Table 3.
|