Welcome to the Language Resource Management Agency of SADiLaR. This repository provides access to all of the collections, data sets, tools and other language resources that are distributed by SADiLaR.

The repository will eventually replace all of the functionality of the original RMA site, with all of the resources available from the RMA, also available from this repository.

Select a community to browse its collections.

Language Resource Management Agency [511]
  • Afrikaans lexical blends dataset 

    Trollip, Benito, et al. (North-West University, 2023-12)
    This a dataset of Afrikaans blend constructions that have been collected and analysed using the Levenshtein distance metric. This dataset serves as the ...
  • USAf National Language Resources Audit 2023 

    Van Dyk, T.J., et al. (South African Centre for Digital Language Resources, 2023-10)
    This report documents the findings of a comprehensive language resources audit conducted by the South African Centre for Digital Language Resources ...
  • Generic Multilingual Academic Wordlists with Definitions 

    Van Dyk, Tobie (SADiLaR; ICELDA, 2022)
    This multilingual generic academic wordlist has been developed to serve as a resource to students to assist with building a vocabulary and decoding ...
  • NCHLT isiZulu word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT isiXhosa word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT Tshivenḓa word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT Xitsonga word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT Setswana word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT Sesotho word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...
  • NCHLT Siswati word2vec-Skipgram embeddings 

    Roald Eiselen (North-West University; Centre for Text Technology (CTexT), 2023-05-01)
    Static word embeddings for the Skipgram flavour of the word2vec (w2v) architecture (Mikolov et al., 2013). The embedding provides real-valued vector ...

View more