Natural language processing projects
This section presents the most important national projects in the field of natural language processing.
ANNETA KÕNET - The ‘Anneta kõnet’ (‘Donate Speech’) project offers all adult Estonian speakers a chance to donate their speech to contribute to the preservation of the Estonian language and to the faster adoption of speech technology in our daily lives. People who speak Estonian as their mother tongue, as a foreign language or speak different dialects of Estonian are invited to donate their speech. Donors have the option to speak about a specified topic or a topic of their choice. We encourage donors to provide speech that is natural and reflects everyday language usage, including slang expressions. The project was funded by the European Union – NextGenerationEU.
SPONTANEOUS SPEECH MATERIAL TRANSCRIPTION PROJECT
The objective of the project is to transcribe approximately 400 hours of spontaneous speech in Estonian from publicly available sources to advance the development of Estonian speech technology solutions (speech recognition, speech synthesis) in both private and public sectors as well as for research purposes. Speech technology solutions developed on the basis of the Estonian language can be used by both public and private sector organisations and private persons. The transcribed materials will be made publicly available in the Estonian Open Data Portal. The project was funded by the European Union – NextGenerationEU.