Description

Book Synopsis

.- Automatic Speech Recognition.
.- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications.
.- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise.
.- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features.
.- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation.
.- Best Data is more Supervised Data - Even for Hungarian ASR.
.- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models.
.- Speech Processing for Under-Resourced Languages.
.- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models.
.- Modeling Intra-Word Code-Switching for Karelian ASR.
.- Improving Whisper-based Serbian ASR using Synthetic Speech.
.- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR.
.- Whistler Identification in Whistled Spanish (Silbo): A Case Study.
.- Digital Speech Processing.
.- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone.
.- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. 
.- Adaptive Singing Voice Enhancement for Live Stages.
.- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus.
.- Natural Language Processing.
.- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification.
.- Enhancing Retrieval Performance via LLM Hard-Negative Filtering.
.- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models.
.- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian.
.- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram.
.- Multimodal Systems.
.- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings.
.- Phonetic and Visual Characteristics of Cognitive Load.
.- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study.
.- Saudi Sign Language Translation Using T5.

Speech and Computer

    Product form

    £66.49

    Includes FREE delivery

    RRP £6,999.00 – you save £6,932.51 (99%)

    Order before 4pm tomorrow for delivery by Wed 1 Jul 2026.

    A Paperback by Alexey Karpov

    15 in stock

      Trusted by thousands of customers. See 2,385+ Customer Reviews

      View other formats and editions of Speech and Computer by Alexey Karpov

      Publisher: Springer
      Publication Date: 11/12/2025
      ISBN13: 9783032079589, 978-3032079589
      ISBN10: 3032079586

      Description

      Book Synopsis

      .- Automatic Speech Recognition.
      .- In-Domain SSL Pre-Training and Streaming ASR: Application to Air Traffic Control Communications.
      .- Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise.
      .- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features.
      .- Enhancing Speech Recognition through Text-to-Speech and Voice Conversion Augmentation.
      .- Best Data is more Supervised Data - Even for Hungarian ASR.
      .- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-based Models.
      .- Speech Processing for Under-Resourced Languages.
      .- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec using Features from Speech Self-Supervised Learning Models.
      .- Modeling Intra-Word Code-Switching for Karelian ASR.
      .- Improving Whisper-based Serbian ASR using Synthetic Speech.
      .- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR.
      .- Whistler Identification in Whistled Spanish (Silbo): A Case Study.
      .- Digital Speech Processing.
      .- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion based on the Pink Trombone.
      .- CrossMP-SENet: Transformer-based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. 
      .- Adaptive Singing Voice Enhancement for Live Stages.
      .- Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus.
      .- Natural Language Processing.
      .- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification.
      .- Enhancing Retrieval Performance via LLM Hard-Negative Filtering.
      .- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models.
      .- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian.
      .- Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram.
      .- Multimodal Systems.
      .- Ensembling Synchronisation-based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings.
      .- Phonetic and Visual Characteristics of Cognitive Load.
      .- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study.
      .- Saudi Sign Language Translation Using T5.

      Recently viewed products

      © 2026 Book Curl

        • American Express
        • Apple Pay
        • Diners Club
        • Discover
        • Google Pay
        • Maestro
        • Mastercard
        • PayPal
        • Shop Pay
        • Union Pay
        • Visa

        Login

        Forgot your password?

        Don't have an account yet?
        Create account