en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

Revolutionizing Customer Engagement with AI Chatbots

From:Nexdata Date: 2024-08-14

Table of Contents
Voice assistants and pronunciation dictionaries
Pronunciation Dictionaries in Speech Recognition
Pronunciation Dictionaries for ASR

➤ Voice assistants and pronunciation dictionaries

In the modern field of artificial intelligence, the success of an algorithm depends on the quality of the data. As the importance of data in artificial intelligence models becomes increasingly prominent, it becomes crucial to collect and make full use of high-quality data. This article will help you better understand the core role of data in artificial intelligence programs.

A voice assistant is an intelligent application that helps users solve problems through smart conversations and instant question-and-answer interactions. Common voice assistants in daily life include "Siri" and "Xiao Du." These voice assistants come equipped with corresponding pronunciation dictionaries, which contain all the speech they can recognize.


➤ Pronunciation Dictionaries in Speech Recognition

A pronunciation dictionary is a dictionary that stores the pronunciation of all words and indicates their pronunciation. By using the pronunciation dictionary, a mapping relationship is established between the acoustic modeling units and the language modeling units, connecting the acoustic model and the language model. This creates a search state space that can be used by the decoder for decoding.


A sentence can be formed by combining several words, and the pronunciation dictionary allows us to obtain the phoneme sequence of the pronunciation of each word. The transition probabilities between adjacent words can be obtained through a language model, while the probability model of phonemes is mainly obtained through an acoustic model. This results in a probability model for a sentence.


In a speech recognition system, the larger the amount of data covered by the pronunciation dictionary, the higher the accuracy of speech recognition. When encountering new vocabulary, these words and their corresponding phonetic transcriptions can be added to the pronunciation dictionary, continuously expanding the vocabulary within it. It can be said that the three main factors for measuring the quality of a pronunciation dictionary are vocabulary size, phonetic labeling, and the accuracy of proofreading.


Currently, due to the need for professional control over the collection, labeling, and cleaning of pronunciation dictionaries, the performance of a speech recognition system can be impacted if there is not a large number of accurate pronunciation dictionaries that cover a wide range of vocabulary.


➤ Pronunciation Dictionaries for ASR

Nexdata Pronunciation Dictionary Corpus


80,279 Cantonese Pronunciation Dictionary

This pronunciation dictionary collects words with dialect characteristics in Guangdong cantonese regions. Each entry consists of three parts: words, pinyin and tones. The dictionary can be used to provide pronunciation reference for sound recording personnel, research and development of pronunciation recognition technology, etc.


101,702 Japanese Pronunciation Dictionary

The data contains 101,702 entries. All words and pronunciations are produced by Japanese linguists. It can be used in the research and development of Japanese ASR technology.


500,113 English Pronunciation Dictionary

The data contains 500,113 entries. All words and pronunciations are produced by English linguists. It can be used in the research and development of English ASR technology.


444,202 Korean Pronunciation Dictionary

The data contains 444,202 entries. All words and pronunciations are produced by Korean linguists. It can be used in the research and development of Korean ASR technology.

Data isn’t only the foundation of artificial intelligence system, but also the driving force behind future technological breakthroughs. As all fields become more and more dependent on AI, we need to innovate methods on data collection and annotation to cope with growing demands. In the future, data will continue to lead AI development and bring more possibilities to all walks of life.

4325ab75-c12c-4d6b-8476-8fdea0a7ebb2