{"product_id":"text-speech-and-dialogue-9783032025470","title":"Text Speech and Dialogue","description":"\u003cb\u003eBook Synopsis\u003c\/b\u003e\u003cbr\u003e\u003cp\u003e.- \u003cstrong\u003eSpeech\u003c\/strong\u003e.\u003c\/p\u003e\u003cp\u003e.- Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR.\u003c\/p\u003e\u003cp\u003e.- An Empirical Analysis of Discrete Unit Representations in Speech Language Modeling Pre-training.\u003c\/p\u003e\u003cp\u003e.- Optimizing ASR Models with Semantic Information.\u003c\/p\u003e\u003cp\u003e.- Efficient Enhancement of Norwegian ASR Model.\u003c\/p\u003e\u003cp\u003e.- Towards Stable and Personalised Profiles for Lexical Alignment in Spoken Human-Agent Dialogue.\u003c\/p\u003e\u003cp\u003e.- Audio–Vision Contrastive Learning for Phonological Class Recognition.\u003c\/p\u003e\u003cp\u003e.- TOSD-Net: A CNN-Transformer Architecture for Robust Frame-Level Overlapping Speech Detection in Diverse Acoustic Conditions.\u003c\/p\u003e\u003cp\u003e.- An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS.\u003c\/p\u003e\u003cp\u003e.- Emotion-Aware Speech-Driven Facial Avatar Animation via Joint Blendshape Prediction and Emotion Recognition.\u003c\/p\u003e\u003cp\u003e.- Beyond Static Emotions: Leveraging Multitask Learning to Model Dynamics of Dimensional Affect in Speech.\u003c\/p\u003e\u003cp\u003e.- Implicit Speaker Group Encoding in Self-supervised Speech Recognition Models.\u003c\/p\u003e\u003cp\u003e.- Combining Temporal Visual Dynamics and Audio Representations for Robust Speaker Identification.\u003c\/p\u003e\u003cp\u003e.- Sentences vs Phrases in Neural Speech Synthesis: the Phrases Strike Back.\u003c\/p\u003e\u003cp\u003e.- Evaluating Phoneme-Level Pretraining in Czech Text-to-Speech Synthesis.\u003c\/p\u003e\u003cp\u003e.- Unifying Global and Near-Context Biasing in a Single Trie Pass.\u003c\/p\u003e\u003cp\u003e.- Synthesising Cross-Speaker Data for Low-Resource Pathological Speech Recognition with PEFT.\u003c\/p\u003e\u003cp\u003e.- Multilingual Stutter Event Detection for English, German, and Mandarin Speech.\u003c\/p\u003e\u003cp\u003e.- How Far Can Synthetic Speech Go? Enhancing ASR in Low-Resource Scenarios via Voice Cloning.\u003c\/p\u003e\u003cp\u003e.- Enhancing Detection of Parkinson-induced Dysarthria with Cross-lingual Transfer Learning.\u003c\/p\u003e\u003cp\u003e.- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks.\u003c\/p\u003e\u003cp\u003e.- Detection of Cognitive Disorders Using ASR-Based Nonsense Words Repetition.\u003c\/p\u003e\u003cp\u003e.- Mind the Gap: Entity-Preserved Context-Aware ASR for Structured Transcriptions.\u003c\/p\u003e\u003cp\u003e.- Boosting CTC-Based ASR Using LLM-Based Intermediate Loss Regularization.\u003c\/p\u003e\u003cp\u003e.- Robust Disfluency Labeling in Spontaneous Speech: Insights from Diverse Hungarian Corpora Including Mentally Ill Speakers.\u003c\/p\u003e\u003cp\u003e.- ParCzech4Speech: A New Speech Corpus Derived from Czech Parliamentary Data.\u003c\/p\u003e\u003cp\u003e.- Towards an Accurate Domain-Specific ASR: Transcription for Pathology.\u003c\/p\u003e\u003cp\u003e.- Automated Speaking Assessment for L2 Learners of Czech.\u003c\/p\u003e\u003cp\u003e.- Inclusive ASR for Critical Public Services: Debiasing with Actor-Simulated Speech.\u003c\/p\u003e\u003cp\u003e.- RECA-PD: A Robust Explainable Cross-Attention Method for Speech-based Parkinson's Disease Classification.\u003c\/p\u003e\u003cp\u003e.- Systematic FAIRness Assessment of Open Voice Biomarker Datasets for Mental Health and Neurodegenerative Diseases.\u003c\/p\u003e\u003cp\u003e.- When Silence Speaks: Understanding Open-Ended Responses via LLMs in Therapeutic Voice Interaction.\u003c\/p\u003e\u003cp\u003e.- Multilingual Domain Adaptation for Speech Recognition Using LLMs.\u003c\/p\u003e\u003cp\u003e.- Using Cross-attention For Conversational ASR Over The Telephone.\u003c\/p\u003e","brand":"Springer","offers":[{"title":"Default Title","offer_id":53195515068759,"sku":"9783032025470","price":104.49,"currency_code":"GBP","in_stock":true}],"url":"https:\/\/bookcurl.com\/products\/text-speech-and-dialogue-9783032025470","provider":"Book Curl","version":"1.0","type":"link"}