Speech Synthesis Datasets,dataset results for Text-To-Speech Synthesis - Nexdata

en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Home > All Category Datasets > Speech Synthesis Datasets

Voice Type

All

19

Average Tone

14

Female

4

Male

1

Language

All

19

Chinese Dialects

2

English

6

Japanese

2

Mandarin

1

Others

10

20 Hours - American English Male Voice TTS Dataset

This dataset contains 20 hours of American English male voice recordings. It is recorded by Americans (native English speakers) with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition research, and AI voice development.

TTS english dataset speech synthesis dataset TTS male voice dataset male voice dataset for tts American English speech synthesis dataset

19.46 Hours - American English Female Voice TTS Dataset

This dataset contains 19.46 hours of American English female voice recordings. It is recorded by American (native English speaker) with authentic accent and clear, sweet tone. The phoneme coverage is balanced. Professional phoneticians participate in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition, and AI voice development requiring natural-sounding female speech.

American English speech synthesis dataset female voice dataset for TTS American English female voice corpus speech synthesis training data female TTS dataset American English female speaker speech synthesis dataset TTS english dataset

10.4 Hours – Japanese Female Voice TTS Dataset

This dataset contains 10.4 hours of Japanese female voice recordings. It is recorded by Japanese native speaker with an authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. This corpus is ideal for tasks such as Japanese text-to-speech (TTS) training, speech synthesis research, and AI voice model development.

Japanese speech synthesis dataset Japanese tts dataset Japanese text-to-speech dataset female female japanese tts dataset

2 Speakers – Australian English TTS Dataset (Native Accent)

This dataset features recordings from 2 native Australian English speakers with authentic accents. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Australian English TTS dataset Australian speech dataset for AI Australian accent speech dataset Australian text to speech voices multi-speaker Australian English dataset Australian English phoneme balanced dataset

8 Hours – Spanish TTS Dataset with Native Castilian Accent

This dataset includes recordings from 2 native Spanish speakers with authentic Castilian accents. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Spanish speech dataset for TTS Spanish text to speech dataset Spanish voice dataset for AI models native Spanish accent dataset Castilian Spanish TTS dataset Spanish speech synthesis dataset

12 Hours – Italian TTS Dataset with Native Accent

This dataset includes recordings from 3 native Italian speakers with authentic accents. Covering both customer service and general speaking styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Italian speech dataset for TTS Italian text to speech dataset Italian voice dataset for AI Italian accent speech dataset multi-speaker Italian TTS dataset Italian TTS dataset

10 Speakers – British English TTS Dataset with Authentic Accent

This dataset contains recordings from 10 native British English speakers with an authentic UK accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the text-to-speech (TTS) systems and AI voice synthesis models.

British English speech synthesis dataset British English voice dataset for TTS British accent speech corpus UK English speech dataset female male natural British English voice dataset British English tts dataset

8 Hours - Canadian French TTS Dataset (Native Accent)

This dataset contains recordings from 2 native Canadian French speakers with authentic accents. It is ideal for researchers and developers seeking natural Canadian French voices.

Canadian French TTS dataset Canadian French speech dataset for AI Canadian French accent speech corpus Canadian French text to speech voices Canadian French speech dataset

6 Speakers – Taiwanese Mandarin Speech Dataset for TTS

This dataset includes recordings from 6 professional voice actors from Taiwan, covering news and colloquial speech. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Taiwanese Mandarin speech dataset Taiwan Mandarin TTS dataset Mandarin speech synthesis corpus native Taiwanese Mandarin corpus

loading

Tailor Your Data Now

Why off-the-shelf Datasets

Copyright
Clear Coyright and Ready to Check
Security
Properly Authorized Secure to Use
Professional
Designed and produced by AI data experts
Diversity
Collected from a varity of real scenes
Cost Effective
More Cost-Efficient Than Tailored Data
Efficiency
Ready-To-Go Deliver in Seconds

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; Embodied AI Datasets; LLM Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Embodied AI; Generative AI; Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; Partners; Quality & Security; Event
Links: OPENMPD; DataPlus; Datarade

Platform: Platform
Competition: Competition
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

nexdata_ai facebook

nexdata_ai twitter

nexdata_ai linkedin

nexdata_ai youtube

Copyright © 2023 NEXDATA TECHNOLOGY INC

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

80bf4577-7550-4849-9666-0080d95ca654