The NECTE2 archive


Period: 2007-ongoing


As noted on the NECTE2 page, an average of seventy NECTE2 interviews have been recorded annually since 2007 by students in the School of English Literature, Language and Linguistics, as part of their modules in sociolinguistics and language variation and change.

In the period from 2010-2012, as part of the update process that turned NECTE into DECTE, 44 NECTE2 interviews were processed to become part of the TEI-conformant XML corpus: the transcriptions were revised, anonymized, and time-aligned to the corresponding audio, before the resulting text files were converted into TEI XML.

However, this did not account for all of the NECTE2 interviews that had been gathered between 2007 and 2012, and recordings have continued to be made and added to the collection in the years since 2012.

The tables below outline the current state of the full archive of NECTE2 interviews, from 2007 to 2017. Table 1 summarizes the number of interviews recorded in each year, and indicates the size of the associated transcriptions and audio files (in terms of word counts and audio durations). Tables 2 and 3 list the number of speakers recorded in each year by gender and by age group respectively.

It is our aim to process more of these materials, so that they can be added to the DECTE corpus proper, that is the TEI-conformant composite XML document described elsewhere on this website.

Table 1. NECTE2 Interviews: 2007-2017
Year Number of
Interviews
Word
Count
Audio
(hrs:mins:secs)
2007 63 728,262 68:04:45
2008 52 539,186 51:46:20
2009 69 722,591 64:43:13
2010 54 385,491 56:32:29
2011 110 707,352 96:26:48
2012 81 544,806 76:42:55
2013 55 339,538 53:08:09
2014 74 491,608 69:44:08
2015 60 372,492 61:11:57
2016 60 410,654 63:04:20
2017 104 679,553 111:23:24
Total 782 5,921,533 772:48:28

Note:

Interview submissions are included in the calculations above where there is at least an audio file plus demographic information on both informants. Four of these submissions did not include student transcript files (one in 2008, two in 2015, one in 2017). There are also four additional interview audio files that were not accompanied by demographic information on both speakers (one in each of 2008, 2011, 2012 and 2017), making 786 interview audio files in total.

Table 2. NECTE2 Speakers: 2007-2017 — Gender
Year Number of
Speakers
Female Male Non-binary
2007 126 64 62 0
2008 104 52 52 0
2009 138 67 71 0
2010 108 66 42 0
2011 220 122 98 0
2012 162 92 70 0
2013 110 61 49 0
2014 148 96 52 0
2015 120 74 46 0
2016 120 80 40 0
2017 208 107 100 1
Total 1,564 881 682 1
Table 3. NECTE2 Speakers: 2007-2017 — Age
Year Number of
Speakers
15-20 21-30 31-40 41-50 51-60 61-70 71-80 81-90
2007 126 47 41 4 5 6 10 7 4
2008 104 44 26 6 11 8 5 4 0
2009 138 64 33 6 12 15 3 5 0
2010 108 43 36 5 12 9 1 1 1
2011 220 119 51 5 17 13 7 8 0
2012 162 91 33 1 10 12 9 6 0
2013 110 53 14 8 14 7 8 6 0
2014 148 83 32 7 10 4 3 7 2
2015 120 74 18 2 5 14 3 2 2
2016 120 74 16 7 8 8 4 3 0
2017 208 147 28 8 5 5 7 7 1
Total 1,564 839 328 59 109 101 60 56 10

Note:

There is one interview from 2007 where the exact age of the two speakers is uncertain, though they are clearly in one of the older (i.e. 61+) categories. These two speakers are not currently included in the summary in Table 3.