From:Nexdata Date: 2024-08-15
The realization of multilingual AI speech recognition technology is inseparable from the support of data. Moreover, the richer the corpus, the better the quality of the language recognition model, and the higher the accuracy of the final speech recognition. Multilingual voice data covering a wide range of areas, many speakers, and a large demand have become a major bottleneck in speech recognition technology.
In response to the scarcity of Spanish speech recognition dataset, Nexdata has developed multiple sets of Spanish speech recognition datasets, covering multiple recording environments, multiple scenes, and multiple recording devices.
435 Hours - Spanish Speech Recognition Data by Mobile Phone
435 Hours - Spanish Speech Recognition Data by Mobile Phone. The data volume is 435 hours and is recorded by 989 Spanish native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones and iPhones.
227 Hours - Spanish Speech Recognition Data by Mobile Phone_R
The data volume is 227 hours. 227 Hours - Spanish Speech Recognition Data is recorded by Spanish native speakers from Spain, Mexico and Venezuela. It is recorded in quiet environment. The recording contents cover various fields like economy, entertainment, news and spoken language. All texts are manually transcribed. The sentence accurate is 95%.
343 People- Spanish Speech Recognition Data by Mobile Phone_Guiding
The 343 People- Spanish Speech Recognition Data is collected from 343 Spanish native speakers who from Spain, Mexico and Argentina. 50 sentences for each speaker, total 9.9 hours. The recording environment is quiet. All texts are manually transcribed with high accuracy. Recording devices are mainstream Android phones and iPhones.
338 Hours-Spanish Speech Recognition Data by Mobile Phone
The 338-hour Spanish Speech Recognition Data and is recorded by 800 Spanish-speaking native speakers from Spain, Mexico, Argentina. The recording environment is quiet. All texts are manually transcribed. The sentence accuracy rate is 95%.
762 Hours - Spanish (Latin America) Speech Recognition Data by Mobile Phone
762 Hours – Spanish Speech Recognition Data. 1,630 non-Spanish nationality native Spanish speakers such as Mexicans and Colombians participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, in-vehicle and home. The text is manually proofread with high accuracy.
500 Hours - Conversational Spanish Speech Recognition Data by Mobile Phone
The 500 Hours - Conversational Spanish Speech Recognition Data collected by phone involved more than 700 native speakers, developed with proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices are various mobile phones. The audio format is 16kHz, 16bit, uncompressed WAV, and all the speech data was recorded in quiet indoor environments.
End
If you want to know more details about the Spanish speech recognition datasets or how to acquire, please feel free to contact us: [email protected].