Additional material for the Ukwabelana Zulu corpusSebastian Spiegler, Andrew van der Spuy, Peter A. Flach, Additional material for the Ukwabelana Zulu corpus. CSTR-10-003, University of Bristol. July 2010. PDF, 104 Kbytes.
In this document we describe the scheme used for labelling the open-source Ukwabelana Zulu corpus as well as the rules employed for the Part-of-speech (POS) tagger used to assign POS to morphologically analysed words. A detailed description of the Zulu morphology, the corpus itself and its generation is given in Spiegler et al. (2010). All resources can be downloaded from http://www.cs.bris.ac.uk/Research/MachineLearning/Morphology/Resources/.