Welcome to the Language Resource Management Agency website
Log In

Log In

Forgot Your Password?

Tray Subtotal: R0.00

NCHLT Afrikaans Annotated Text Corpora

Be the first to review this resource

Availability: Available for download

R0.00

Quick Overview

Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.

NCHLT Afrikaans Annotated Text Corpora

Double click on above image to view full picture

Zoom Out
Zoom In

* Required Fields

R0.00
Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.

Write Your Own Review

Only registered users can write reviews. Please, log in or register

Additional Information

Contact persons and email addresses Martin Puttkammer: Martin.Puttkammer@nwu.ac.za
Affiliations North-West University, Centre for Text Technology (CTexT)
Licensing Creative Commons Attribution 2.5 South Africa License
Licensing details http://creativecommons.org/licenses/by/2.5/za/legalcode
Names of principal developers Martin Puttkammer, Martin Schlemmer, Ruan Bekker
Media type Text
ISLRN 139-586-400-050-9
Category Monolingual text corpora: Annotated
Annotation details Annotated with lemma, part of speech and morphological analyses
Citation information Eiselen, E.R. & Puttkammer, M.J. 2014. Developing text resources for ten South African languages. (In Proceedings of the 9th International Conference on Language Resources and Evaluation, Reykjavik, Iceland. p. 3698-3703)
Description of background and purpose Lemmatised, part of speech tagged and morphologically analysed corpora developed during the NCHLT Text project.
Distribution RMA: www.rma.nwu.ac.za
Source Based on documents from the South African government domain crawled from gov.za websites and collected from various language units.
Stratum (structure of data) Details provided in documentation.
Size (number of tokens/duration) No
File size No
Specialised software required Spreadsheet software required for xls versions; LARA2 required for LARA2 versions.
Maturity Released
Verification and proof of quality Manually verified by language expert.
Compatibility with standards A common standard and fully compliant
Details of documentation available Readme and protocol included; project report available on request.
Standards compliance details No
Contributors No

Resource Tags

Use spaces to separate tags. Use single quotes (') for phrases.