THE NEWCASTLE ELECTRONIC CORPUS OF TYNESIDE ENGLISH

Home

Acknowledgements

Documentation

The corpus

People

Publications

Sponsors

References

Links

Appendices

Documentation: speech tags

 Guidelines 11 deals with transcriptions of speech, and begins with the disclaimer that ‘the present proposals are not intended to support unmodified every variety of research undertaken upon spoken material now or in the future; some discourse analysts, some phonologists, and doubtless others may wish to extend the scheme presented here to express more precisely the set of distinctions they wish to draw in their transcriptions’. NECTE uses a small selection of tags from those provided by TEI:

  •  <u>, the TEI ‘utterance’ tag. <u> is ‘a stretch of speech usually preceded or followed by silence or by a change of speaker’ (Guidelines 11.2). It is used in exactly this way to mark speaker utterances in all the NECTE representational types; the 'id' attribute in each case identifies the speaker in question, ie, <u id='informantTlsg01'>. Uniqueness of speaker 'id's is guaranteed by suffixing to 'interviewer' or 'informant', as appropriate, the entity name for the interview in question from the list defined by <!ENTITY % interviews SYSTEM 'interviews.ent'> %interviews; in the DOCTYPE declaration; where there is more than one informant, which is invariably the case in the PVC-derived interviews and occasionally in the TLS-derived ones, a final 'a', 'b', 'c'... is appended, ie, <u id='informantTlsg01a'>, <u id='informantTlsg01b'>, and so on.

  • <vocal> (Guidelines 11.2.3) marks ‘any vocalized but not necessarily lexical phonemenon, for example voiced pauses, non-lexical back-channels, etc’. NECTE uses this tag not only for voiced pauses but also for other phenomena such as laughter.

  • <event> (Guidelines 11.2) marks ‘any phenomenon or occurrence, not necessarily vocalized or communicative, for example incidental noises or other events affecting communication’. The main NECTE use of this tag is to note interruptions of one speaker by another.