[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"including interview, self-meida,variety show, etc."},{"@type":"PropertyValue","name":"Recording environment","value":"Low background noise"},{"@type":"PropertyValue","name":"Country","value":"Romania(ROU)"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"ro-RO"},{"@type":"PropertyValue","name":"Language","value":"Romanian"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text, timestamp, speaker ID, gender, noise,PII redacted"},{"@type":"PropertyValue","name":"Accuracy","value":"Word Accuracy Rate (WAR) 98%"}]
{"id":1479,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"839 Hours - Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset","datazy":[{"title":"Format","value":"16kHz, 16 bit, wav, mono channel"},{"title":"Content category","value":"including interview, self-meida,variety show, etc."},{"title":"Recording environment","value":"Low background noise"},{"title":"Country","value":"Romania(ROU)"},{"title":"Language(Region) Code","value":"ro-RO"},{"title":"Language","value":"Romanian"},{"title":"Features of annotation","value":"Transcription text, timestamp, speaker ID, gender, noise,PII redacted"},{"title":"Accuracy","value":"Word Accuracy Rate (WAR) 98%"}],"datatag":"Romania,Romanian,Casual Conversation,Monologue,Asr","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_8.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=jJi5%2F%2B41nBuHaogZeDQHm6uxCO4%3D","/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_8.wav","sau o parte din lucrurile pe care le putem vedea printr-o lentilă feministă, atunci când ne uităm"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_2.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B3sTI8la96NKBwQXp1lWRkeYU3w%3D","/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_2.wav","În acest episod, continuăm discuția noastră despre hipsteri și ne uităm cu precădere la ceea ce înseamnă"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_7.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2Fv7vu3Sya54HAnDu8R1GbP7DXN0%3D","/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_7.wav","cu Ciprian State să-l întreb cum l-a ales. Da, o să ne concentrăm în acest videoclip pe o parte din dinamicile de gen"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_6.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=EJdlDcKNBpuTC%2FgUoqGwHHw49I4%3D","/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_6.wav","Eroul ideal al clasei de mijloc sau țap ispășitor. Și cu cât mă uit mai mult la acest titlu, cu atât îmi doresc mai mult să vorbesc"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ZmN%2FtdgnYxicUpJTRXNZ54o1QZ0%3D","/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_1.wav","Salut tuturor! Eu sunt Adriana Radu și mă bucur să vă găsesc aici la un nou episod din SEXUL VERSUS BARZA!"]],"officialSummary":"Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Romania","Romanian","Casual Conversation","Monologue","Asr"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_8.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=jJi5%2F%2B41nBuHaogZeDQHm6uxCO4%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_2.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B3sTI8la96NKBwQXp1lWRkeYU3w%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_7.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2Fv7vu3Sya54HAnDu8R1GbP7DXN0%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_6.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=EJdlDcKNBpuTC%2FgUoqGwHHw49I4%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240329010_demo1732701600229/APY240329010_demo/000024_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ZmN%2FtdgnYxicUpJTRXNZ54o1QZ0%3D"}]
839 Hours - Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset Romania
Romanian
Casual Conversation
Monologue
Asr
Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied. This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16 bit, wav, mono channel
Content category
including interview, self-meida,variety show, etc.
Recording environment
Low background noise
Language(Region) Code
ro-RO
Features of annotation
Transcription text, timestamp, speaker ID, gender, noise,PII redacted
Accuracy
Word Accuracy Rate (WAR) 98%
Sample
Audio sau o parte din lucrurile pe care le putem vedea printr-o lentilă feministă, atunci când ne uităm
Audio În acest episod, continuăm discuția noastră despre hipsteri și ne uităm cu precădere la ceea ce înseamnă
Audio cu Ciprian State să-l întreb cum l-a ales. Da, o să ne concentrăm în acest videoclip pe o parte din dinamicile de gen
Audio Eroul ideal al clasei de mijloc sau țap ispășitor. Și cu cât mă uit mai mult la acest titlu, cu atât îmi doresc mai mult să vorbesc
Audio Salut tuturor! Eu sunt Adriana Radu și mă bucur să vă găsesc aici la un nou episod din SEXUL VERSUS BARZA!
Recommended Dataset
799 Hours - Sichuan Dialect(China) Spontaneous Dialogue Smartphone speech dataset
Sichuan Dialect(China) Spontaneous Dialogue Smartphone speech dataset, transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Sichuan speech data Sichuan Natural Conversational Speech Data Sichuan dialects conversional speech data Sichuan dialects conversional speech dataset Sichuan dialects conversional audio data
Details 2,657 Hours - Mandarin(China) Spontaneous Dialogue Smartphone speech dataset
Mandarin(China) Spontaneous Dialogue Smartphone speech dataset, transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
mandarin dialogue speech data chinese conversional speech data chinese conversional speech dataset chinese conversional audio data
Details 607 Hours - Cantonese(China) Spontaneous Dialogue Smartphone speech dataset
Cantonese(China) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Cantonese speech data Cantonese Natural Conversational Speech Data chinese dialects conversional speech data chinese dialects speech dataset
Details 1,136 Hours - English(the United States) Spontaneous Dialogue Smartphone speech dataset
English(the United States) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering generic domain. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(1,416 Americans), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
American English dialogue voice data natural dialogue voice data natural dialogue data dialogue data set dialogue data dialogue voice conversational AI data AI dialogue voice data
Details 1,351 Hours - Mandarin Chinese(China) Spontaneous Dialogue (Smartphone+Voice Recorder) speech dataset
Mandarin Chinese(China) Spontaneous Dialogue (Smartphone+Voice Recorder) speech dataset, collected from dialogues based on given topics, covering dozens of generic domain. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(1,950 people in total), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Mandarin natural dialogue audio data Mandarin conversational data Mandarin conversational dataset Mandarin conversational video data Mandarin conversational video dataset Mandarin conversational graphical data Mandarin conversational graphical dataset Mandarin conversational recording data Mandarin conversational recording dataset Mandarin conversational visual data Mandarin conversational visual dataset Mandarin speech data Mandarin speech dataset
Details 1,420 Hours - Mandarin Chinese(China) Spontaneous Monologue Smartphone speech dataset
Mandarin Chinese(China) Spontaneous Monologue Smartphone speech dataset, collected from dialogues without given topics, close to casual conversation, covering generic domain. Transcribed with text content, noise and other attributes. Our dataset was collected from extensive and diversify speakers(700 Chinese in total), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Mandarin asr data Mandarin asr dataset Mandarin asr collection Mandarin language data Mandarin language dataset Mandarin language collection Mandarin speech data Mandarin speech dataset Mandarin speech collection Mandarin discuss asr data Mandarin discuss asr dataset Mandarin discuss asr collection Mandarin discuss language data Mandarin discuss language dataset Mandarin discuss language collection Mandarin discuss speech data Mandarin discuss speech dataset Mandarin discuss speech collection Mandarin small talk asr data Mandarin small talk asr dataset Mandarin small talk asr collection Mandarin small talk language data Mandarin small talk language dataset Mandarin small talk language collection Mandarin small talk speech data Mandarin small talk speech dataset Mandarin small talk speech collection Mandarin conversational asr data Mandarin conversational asr dataset Mandarin conversational asr collection Mandarin conversational language data Mandarin conversational language dataset Mandarin conversational language collection Mandarin conversational speech data Mandarin conversational speech dataset Mandarin conversational speech collection Mandarin chat asr data Mandarin chat asr dataset Mandarin chat asr collection Mandarin chat language data Mandarin chat language dataset Mandarin chat language collection Mandarin chat speech data Mandarin chat speech dataset Mandarin chat speech collection Mandarin communication asr data Mandarin communication asr dataset Mandarin communication asr collection Mandarin communication language data Mandarin communication language dataset Mandarin communication language collection Mandarin communication speech data Mandarin communication speech dataset Mandarin communication speech collection Mandarin speech asr data Mandarin speech asr dataset Mandarin speech asr collection Mandarin speech language data Mandarin speech language dataset Mandarin speech language collection Mandarin speech speech data Mandarin speech speech dataset Mandarin speech speech collection Mandarin talk asr data Mandarin talk asr dataset Mandarin talk asr collection Mandarin talk language data Mandarin talk language dataset Mandarin talk language collection Mandarin talk speech data Mandarin talk speech dataset Mandarin talk speech collection Mandarin conversation asr data Mandarin conversation asr dataset Mandarin conversation asr collection Mandarin conversation language data Mandarin conversation language dataset Mandarin conversation language collection Mandarin conversation speech data Mandarin conversation speech dataset Mandarin conversation speech collection Mandarin impromptu asr data Mandarin impromptu asr dataset Mandarin impromptu asr collection Mandarin impromptu language data Mandarin impromptu language dataset Mandarin impromptu language collection Mandarin impromptu speech data Mandarin impromptu speech dataset Mandarin impromptu speech collection Mandarin free speech asr data Mandarin free speech asr dataset Mandarin free speech asr collection Mandarin free speech language data Mandarin free speech language dataset Mandarin free speech language collection Mandarin free speech speech data Mandarin free speech speech dataset Mandarin free speech speech collection Mandarin natural speech asr data Mandarin natural speech asr dataset Mandarin natural speech asr collection Mandarin natural speech language data Mandarin natural speech language dataset Mandarin natural speech language collection Mandarin natural speech speech data Mandarin natural speech speech dataset Mandarin natural speech speech collection Mandarin common speech asr data Mandarin common speech asr dataset Mandarin common speech asr collection Mandarin common speech language data Mandarin common speech language dataset Mandarin common speech language collection Mandarin common speech speech data Mandarin common speech speech dataset Mandarin common speech speech collection Mandarin immediate monologue asr data Mandarin immediate monologue asr dataset Mandarin immediate monologue asr collection Mandarin immediate monologue language data Mandarin immediate monologue language dataset Mandarin immediate monologue language collection Mandarin immediate monologue speech data Mandarin immediate monologue speech dataset Mandarin immediate monologue speech collection Mandarin Spontaneous asr data Mandarin Spontaneous asr dataset Mandarin Spontaneous asr collection Mandarin Spontaneous language data Mandarin Spontaneous language dataset Mandarin Spontaneous language collection Mandarin Spontaneous speech data Mandarin Spontaneous speech dataset Mandarin Spontaneous speech collection chinese asr data chinese asr dataset chinese asr collection chinese language data chinese language dataset chinese language collection chinese speech data chinese speech dataset chinese speech collection chinese discuss asr data chinese discuss asr dataset chinese discuss asr collection chinese discuss language data chinese discuss language dataset chinese discuss language collection chinese discuss speech data chinese discuss speech dataset chinese discuss speech collection chinese small talk asr data chinese small talk asr dataset chinese small talk asr collection chinese small talk language data chinese small talk language dataset chinese small talk language collection chinese small talk speech data chinese small talk speech dataset chinese small talk speech collection chinese conversational asr data chinese conversational asr dataset chinese conversational asr collection chinese conversational language data chinese conversational language dataset chinese conversational language collection chinese conversational speech data chinese conversational speech dataset chinese conversational speech collection chinese chat asr data chinese chat asr dataset chinese chat asr collection chinese chat language data chinese chat language dataset chinese chat language collection chinese chat speech data chinese chat speech dataset chinese chat speech collection chinese communication asr data chinese communication asr dataset chinese communication asr collection chinese communication language data chinese communication language dataset chinese communication language collection chinese communication speech data chinese communication speech dataset chinese communication speech collection chinese speech asr data chinese speech asr dataset chinese speech asr collection chinese speech language data chinese speech language dataset chinese speech language collection chinese speech speech data chinese speech speech dataset chinese speech speech collection chinese talk asr data chinese talk asr dataset chinese talk asr collection chinese talk language data chinese talk language dataset chinese talk language collection chinese talk speech data chinese talk speech dataset chinese talk speech collection chinese conversation asr data chinese conversation asr dataset chinese conversation asr collection chinese conversation language data chinese conversation language dataset chinese conversation language collection chinese conversation speech data chinese conversation speech dataset chinese conversation speech collection chinese impromptu asr data chinese impromptu asr dataset chinese impromptu asr collection chinese impromptu language data chinese impromptu language dataset chinese impromptu language collection chinese impromptu speech data chinese impromptu speech dataset chinese impromptu speech collection chinese free speech asr data chinese free speech asr dataset chinese free speech asr collection chinese free speech language data chinese free speech language dataset chinese free speech language collection chinese free speech speech data chinese free speech speech dataset chinese free speech speech collection chinese natural speech asr data chinese natural speech asr dataset chinese natural speech asr collection chinese natural speech language data chinese natural speech language dataset chinese natural speech language collection chinese natural speech speech data chinese natural speech speech dataset chinese natural speech speech collection chinese common speech asr data chinese common speech asr dataset chinese common speech asr collection chinese common speech language data chinese common speech language dataset chinese common speech language collection chinese common speech speech data chinese common speech speech dataset chinese common speech speech collection chinese immediate monologue asr data chinese immediate monologue asr dataset chinese immediate monologue asr collection chinese immediate monologue language data chinese immediate monologue language dataset chinese immediate monologue language collection chinese immediate monologue speech data chinese immediate monologue speech dataset chinese immediate monologue speech collection chinese Spontaneous asr data chinese Spontaneous asr dataset chinese Spontaneous asr collection chinese Spontaneous language data chinese Spontaneous language dataset chinese Spontaneous language collection chinese Spontaneous speech data chinese Spontaneous speech dataset chinese Spontaneous speech collection
Details 190 Hours - French(France) Gaming Real-world Casual Conversation and Monologue speech dataset
French(France) Gaming Real-world Casual Conversation and Monologue speech dataset, covers spontaneous dialogue about popular and evergreen games, including player discussions on battle strategies, social interactions, esports news, etc., mirrors real-world interactions. Transcribed with text content, offenssive expressions, speaker's ID, gender, accent and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
French Spontaneous Dialogue Gaming
Details 217 Hours - Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset
Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spanish Entity Spontaneous Dialogue Financial
Details
Tell Us Your Special Needs
1026c2a5-dbbb-49c7-9e9e-dfefda42ae10