Welcome to the South African Centre for Digital Language Resources website
Log In

Log In

Forgot Your Password?

Tray Subtotal: R0.00

Lwazi Sesotho Pronunciation Dictionary

Be the first to review this resource

Availability: Available for download

R0.00

Quick Overview

General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology systems, rather than phonetically accurate. Audio samples of all phonemes included. A letter-to-sound rule set for predicting the pronunciations of generic words included. (Separate entry describes rule sets.)

Lwazi Sesotho Pronunciation Dictionary

Double click on above image to view full picture

Zoom Out
Zoom In
R0.00
General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology systems, rather than phonetically accurate. Audio samples of all phonemes included. A letter-to-sound rule set for predicting the pronunciations of generic words included. (Separate entry describes rule sets.)

Write Your Own Review

Only registered users can write reviews. Please, log in or register

Additional Information

Contact persons and email addresses Karen Calteaux: KCalteaux@csir.co.za
Affiliations Meraka Institute, CSIR
Licensing Creative Commons Attribution 2.5 South Africa License
Licensing details http://creativecommons.org/licenses/by/2.5/za/legalcode
Names of principal developers Marelie Davel
Media type Speech
ISLRN 039-887-160-393-7
Category Pronunciation dictionaries
Annotation details No
Citation information M Davel and O Martirosian, "Pronunciation dictionary development in resource-scarce environments", In Proc. Interspeech, Brighton, UK, September 2009, pp 2851-2854.
Description of background and purpose General phonemic pronunciations for frequently occurring words in SA languages. Dictionaries were developed to be practically usable for speech technology systems, rather than phonetically accurate. Audio samples of all phonemes included. A letter-to-sound rule set for predicting the pronunciations of generic words included. (Separate entry describes rule sets.)
Distribution Downloadable
Source Studio recordings, Web
Stratum (structure of data) Frequently occurring words (top 5,000 from available word lists). No proper names or foreign words included.
Size (number of tokens/duration) Approximately 5,000 words
File size 760Kb zipped; 1.4Mb unzipped
Specialised software required N/A
Maturity Released
Verification and proof of quality Manually verified by language expert.
Compatibility with standards Some well-defined guidelines (in-house/external)
Details of documentation available

- M Davel and O Martirosian, "Pronunciation dictionary development in resource-scarce environments", In Proc. Interspeech, Brighton, UK, September 2009, pp 2851-2854.

- M Davel and E Barnard, "Pronunciation prediction with Default&Refine", Computer Speech and Language, 22(4), pp 374-393.

- Dictionary developer profiles at http://www.meraka.org.za/lwazi/oldsite/pdf/dictionary_developer_profiles.pdf

Standards compliance details No
Contributors No

Resource Tags

Use spaces to separate tags. Use single quotes (') for phrases.