en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

589 Hours - Italian(Italy) Spontaneous Dialogue Smartphone speech dataset

phone
Conversational Speech
Italian
Italian discuss data
Italian discuss dataset
Italian discuss collection
Italian small talk data
Italian small talk dataset
Italian small talk collection
Italian conversational data
Italian conversational dataset
Italian conversational collection
Italian chat data
Italian chat dataset
Italian chat collection
Italian communication data
Italian communication dataset
Italian communication collection
Italian speech data
Italian speech dataset
Italian speech collection
Italian talk data
Italian talk dataset
Italian talk collection
Italian conversation data
Italian conversation dataset
Italian conversation collection
italia discuss data
italia discuss dataset
italia discuss collection
italia small talk data
italia small talk dataset
italia small talk collection
italia conversational data
italia conversational dataset
italia conversational collection
italia chat data
italia chat dataset
italia chat collection
italia communication data
italia communication dataset
italia communication collection
italia speech data
italia speech dataset
italia speech collection
italia talk data
italia talk dataset
italia talk collection
italia conversation data
italia conversation dataset
italia conversation collection

Italian(Italy) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(728 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16 bit, wav, mono channel;
Content category
Dialogue based on given topics;
Recording condition
Low background noise (indoor);
Recording device
Android smartphone, iPhone;
Speaker
728 native speakers in total, 44% male and 56% female;
Country
Italian(ITA);
Language(Region) Code
it-IT;
Language
Italian;
Features of annotation
Transcription text, timestamp, speaker ID, gender, noise, PII redacted.
Accuracy Rate
Word Accuracy Rate (WAR) 98%
Sample Sample
  • Audio

    Ah de dei bambini che utilizzano troppo social media eh

  • Audio

    Soddisfa il palazzo, sodi- soddisfa uno dei, dei centri del piacere più più remoti, no?

  • Audio

    Certo è vero.

  • Audio

    Perché mi è venuto un po' fame.

  • Audio

    ma sinceramente mi piacerebbe più parlare di cibo.

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

1608a318-b748-4b9f-8e66-1ac78201e806

5b697fc4-1b8b-4d92-b368-644997c36abb