Speech recognition Books

11 products


  • Biometrics

    Oxford University Press Biometrics

    1 in stock

    Book SynopsisWe live in a society which is increasingly interconnected, in which communication between individuals is mostly mediated via some electronic platform, and transactions are often carried out remotely. In such a world, traditional notions of trust and confidence in the identity of those with whom we are interacting, taken for granted in the past, can be much less reliable. Biometrics - the scientific discipline of identifying individuals by means of the measurement of unique personal attributes - provides a reliable means of establishing or confirming an individual''s identity. These attributes include facial appearance, fingerprints, iris patterning, the voice, the way we write, or even the way we walk. The new technologies of biometrics have a wide range of practical applications, from securing mobile phones and laptops to establishing identity in bank transactions, travel documents, and national identity cards. This Very Short Introduction considers the capabilities of biometrics-based identity checking, from first principles to the practicalities of using different types of identification data. Michael Fairhurst looks at the basic techniques in use today, ongoing developments in system design, and emerging technologies, all aimed at improving precision in identification, and providing solutions to an increasingly wide range of practical problems. Considering how they may continue to develop in the future, Fairhurst explores the benefits and limitations of these pervasive and powerful technologies, and how they can effectively support our increasingly interconnected society.ABOUT THE SERIES: The Very Short Introductions series from Oxford University Press contains hundreds of titles in almost every subject area. These pocket-sized books are the perfect way to get ahead in a new subject quickly. Our expert authors combine facts, analysis, perspective, new ideas, and enthusiasm to make interesting and challenging topics highly readable.Table of Contents1: Are you who you say you are?2: Biometrics: where should I start?3: Making biometrics work4: Enhancing biometric processing5: An introduction to predictive biometrics6: Where are we going?Further readingIndex

    1 in stock

    £9.49

  • Voice User Interface Design

    APress Voice User Interface Design

    2 in stock

    Book SynopsisDesign and implement voice user interfaces. This guide to VUI helps you make decisions as you deal with the challenges of moving from a GUI world to mixed-modal interactions with GUI and VUI. The way we interact with devices is changing rapidly and this book gives you a close view across major companies via real-world applications and case studies. Voice User Interface Design provides an explanation of the principles of VUI design. The book covers the design phase, with clear explanations and demonstrations of each design principle through examples of multi-modal interactions (GUI plus VUI) and how they differ from pure VUI. The book also differentiates principles of VUI related to chat-based bot interaction models. By the end of the book you will have a vision of the future, imagining new user-oriented scenarios and new avenues, which until now were untouched. What You''llTable of ContentsChapter 1: Introduction – What is VUI Chapter Goal: defines VUI, its history and present day technology No of pages 10 Sub -Topics 1. What is VUI? 2. When did it all start? 3. The journey 4. Current day Chapter 2: Modern day VUI landscape Chapter Goal: understanding how major players are going forward with their VUI and device strategy No of pages: 25 Sub - Topics 1. Major players and their unique positions 2. Direction towards world domination 3. Device strategy of major players 4. Integration of AR, VR, digital assistance Chapter 3: Principles of VUI Chapter Goal: laying down the guiding principles to create a delightful VUI No of pages : 70 - 150 Sub - Topics: 5. Principles with associated examples This might be broken down into 5 separate chapters. Chapter 4: Multi modal interaction Chapter Goal: understanding an ecosystem where different modes of interaction takes place across devices simultaneously. No of pages: 50 - 70 Sub - Topics: 1. GUI model and examples 2. GUI + VUI as a model with examples 3. VUI only as a model with examples 4 . Challenges Chapter 5: Future Ahead Chapter Goal: giving an idea of the plethora of use cases, scenarios where VUI will affect human lives. 1. An idea about future tech 2. Imagining a future case with the reader 3. Creating potential scenarios, solving for VUI

    2 in stock

    £29.69

  • Designing Voice User Interfaces

    O'Reilly Media Designing Voice User Interfaces

    3 in stock

    Book SynopsisWhether you're designing a mobile app, a toy, or a device such as a home assistant, this practical book guides you through basic VUI design principles, helps you choose the right speech recognition engine, and shows you how to measure your VUI's performance and improve upon it.

    3 in stock

    £23.99

  • Voice Applications for Alexa and Google Assistant

    Manning Publications Voice Applications for Alexa and Google Assistant

    10 in stock

    Book SynopsisTo create their own voice "skills," users need to learn some new device toolkits, the basics of Voice UI design, and some emerging best practices for building and deploying on these diverse platforms. Voice Applications for Alexa and Google Assistant guides readers in the exciting world of designing, building, and implementing voice-based applications for Amazon Alexa or Google Assistant! They learn how to build their own "skills"—the voice app term for actions the device can perform—from scratch. Key Features · Designing a voice interaction model · Fulfilling skills via a serverless platform like AWS Lambda · Connecting a skill to a database Audience Written for JavaScript developers interested in building voice-enabled applications. No prior experience required! Author Bio Dustin A. Coates is a web developer and web development instructor. He has taught hundreds of students online and offline at General Assembly. Dustin also developed popular courses for OneMonth.com and the European non-profit Konexio, which teaches refugees how to code.

    10 in stock

    £47.99

  • Telecommunications Relay Services to Assist

    Nova Science Publishers Inc Telecommunications Relay Services to Assist

    1 in stock

    Book Synopsis

    1 in stock

    £67.99

  • Build Talking Apps for Alexa: Creating

    The Pragmatic Programmers Build Talking Apps for Alexa: Creating

    1 in stock

    Book SynopsisVoice recognition is here at last. Alexa and other voice assistants have now become widespread and mainstream. Is your app ready for voice interaction? Learn how to develop your own voice applications for Amazon Alexa. Start with techniques for building conversational user interfaces and dialog management. Integrate with existing applications and visual interfaces to complement voice-first applications. The future of human-computer interaction is voice, and we'll help you get ready for it. For decades, voice-enabled computers have only existed in the realm of science fiction. But now the Alexa Skills Kit (ASK) lets you develop your own voice-first applications. Leverage ASK to create engaging and natural user interfaces for your applications, enabling them to listen to users and talk back. You'll see how to use voice and sound as first-class components of user-interface design. We'll start with the essentials of building Alexa voice applications, called skills, including useful tools for creating, testing, and deploying your skills. From there, you can define parameters and dialogs that will prompt users for input in a natural, conversational style. Integrate your Alexa skills with Amazon services and other backend services to create a custom user experience. Discover how to tailor Alexa's voice and language to create more engaging responses and speak in the user's own language. Complement the voice-first experience with visual interfaces for users on screen-based devices. Add options for users to buy upgrades or other products from your application. Once all the pieces are in place, learn how to publish your Alexa skill for everyone to use. Create the future of user interfaces using the Alexa Skills Kit today. What You Need: You will need a computer capable of running the latest version of Node.js, a Git client, and internet access.

    1 in stock

    £36.57

  • Speech Recognition Technology and Applications

    Nova Science Publishers Inc Speech Recognition Technology and Applications

    2 in stock

    Book SynopsisSpeech represents the most natural means of communication between humans. By using Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems, machines also become able to interact with humans using speech. This is of particular importance for building interactive robots or speech-enabled chatbots. This book starts by exploring state-of-the-art ASR and TTS approaches, making use of artificial neural networks, relevant also to low-resource scenarios. Then, it explores the application of speech technology to specific domains, such as the medical domain, human-robot interaction, and even interlinking of speech and text resources using linguistic linked open data (LLOD) principles. The book also provides punctuation restoration techniques, enabling the production of high-quality text transcripts. Included algorithms have low latency and can be parallelized, thus enabling their use in interactive systems. Chapter authors are professors and scientific researchers with experience in building and using natural language processing algorithms and speech applications.

    2 in stock

    £113.59

  • Natural Language Processing and Computational

    ISTE Ltd and John Wiley & Sons Inc Natural Language Processing and Computational

    Out of stock

    Book SynopsisNatural Language Processing (NLP) is a scientific discipline which is found at the intersection of fields such as Artificial Intelligence, Linguistics, and Cognitive Psychology. This book presents in four chapters the state of the art and fundamental concepts of key NLP areas. Are presented in the first chapter the fundamental concepts in lexical semantics, lexical databases, knowledge representation paradigms, and ontologies. The second chapter is about combinatorial and formal semantics. Discourse and text representation as well as automatic discourse segmentation and interpretation, and anaphora resolution are the subject of the third chapter. Finally, in the fourth chapter, I will cover some aspects of large scale applications of NLP such as software architecture and their relations to cognitive models of NLP as well as the evaluation paradigms of NLP software. Furthermore, I will present in this chapter the main NLP applications such as Machine Translation (MT), Information Retrieval (IR), as well as Big Data and Information Extraction such as event extraction, sentiment analysis and opinion mining.Table of ContentsIntroduction ix Chapter 1 The Sphere of Lexicons and Knowledge 1 1.1 Lexical semantics 1 1.1.1 Extension of lexical meaning 1 1.1.2 Paradigmatic relations of meaning 6 1.1.3 Theories of lexical meaning 16 1.2 Lexical databases 23 1.2.1 Standards for encoding and exchanging data 25 1.2.2 Standard character encoding 25 1.2.3 Content standards 32 1.2.4 Writing systems 40 1.2.5 A few lexical databases 45 1.3 Knowledge representation and ontologies 49 1.3.1 Knowledge representation 49 1.3.2 Ontologies 63 Chapter 2 The Sphere of Semantics 75 2.1 Combinatorial semantics 75 2.1.1 Interpretive semantics 75 2.1.2 Generative semantics 80 2.1.3 Case grammar 82 2.1.4 Rastier’s interpretive semantics 84 2.1.5 Meaning–text theory 92 2.2 Formal semantics 95 2.2.1 Propositional logic 95 2.2.2 First-order logic 106 2.2.3 Lambda calculus 113 2.2.4 Other types of logic 121 Chapter 3 The Sphere of Discourse and Text 123 3.1 Discourse analysis and pragmatics 123 3.1.1 Fundamental concepts 123 3.1.2 Utterance production 125 3.1.3 Context, cotext and intertextuality 128 3.1.4 Information structure in discourse 130 3.1.5 Coherence 137 3.1.6 Cohesion 138 3.1.7 Ellipses 142 3.1.8 Textual sequences 143 3.1.9 Speech acts 144 3.2 Computational approaches to discourse 146 3.2.1 Linear segmentation of discourse 146 3.2.2 Rhetorical structure theory and automatic discourse analysis 148 3.2.3 Discourse interpretation: DRT 154 3.2.4 Processing anaphora 159 Chapter 4 The Sphere of Applications 169 4.1 Software engineering for NLP software 169 4.1.1 Lifecycle of an NLP software 169 4.1.2 Software architecture for NLP 170 4.1.3 Serial architectures 171 4.1.4 Data-centered architectures 173 4.1.5 Object-oriented architectures 177 4.1.6 Multi-agent architectures 178 4.1.7 Syntactic–semantic cooperation: from cognitive models to software architecture 180 4.1.8 Programming languages for NLP 184 4.1.9 Evaluation of NLP systems 186 4.2 Machine translation (MT) 191 4.2.1 Why is translation difficult? 192 4.2.2 History of MT systems 194 4.2.3 Typology of MT systems 196 4.2.4 The use of MT 198 4.2.5 MT techniques 199 4.2.6 Example of a translation system: Verbmobil 208 4.3 Information retrieval (IR) 211 4.3.1 IR and related domains 211 4.3.2 Lexical information and IR 213 4.3.3 Information retrieval approaches 219 4.4 Big Data (BD) and information extraction 234 4.4.1 Structured, semi-structured and unstructured data 234 4.4.2 Architectures of BD processing systems 235 4.4.3 Role of NLP in BD processing 237 4.4.4 Information extraction 238 Conclusion 259 Bibliography 263 Index 301

    Out of stock

    £125.06

  • Speech and Computer: 25th International

    Springer International Publishing AG Speech and Computer: 25th International

    3 in stock

    Book SynopsisThe two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023.The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: ​automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.Table of Contents​Automatic Speech Recognition.- Extreme Learning Layer: A Boost for Spoken Digit Recognition with Spiking Neural Networks.- EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition.- Significance of Audio Quality in Speech-to-Text Translation Systems.- Everyday Conversations: a Comparative Study of Expert Transcriptions and ASR Outputs at a Lexical Level.- Improving Automatic Speech Recognition with Dialect-Specific Language Models.- Emotional speech recognition of Holocaust survivors with deep neural network models for Russian language.- Computational Paralinguistics.- Aggregation Strategies of Wav2vec 2.0 Embeddings for Computational Paralinguistic Tasks.- Rhythm Formant Analysis for Automatic Depression Classification.- Determining Alcohol Intoxication Based on Speech and Neural Networks.- Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition.- Enhancing Stutter Detection in Speech using Zero Time Windowing Cepstral Coefficients and Phase Information.- Source and System-based Modulation Approach for Fake Speech Detection.- Digital Signal Processing.- Investigation of Different Calibration Methods for Deep Speaker Embedding based Verification Systems.- Learning to Predict Speech Intelligibility from Speech Distortions.- Sparse Representation Frameworks for Acoustic Scene Classification.- Driver Speech Detection in Real Driving Scenario.- Regularization based Incremental Learning in TCNN for Robust Speech Enhancement Targeting Effective Human Machine Interaction.- Candidate Speech Extraction from Multi-Speaker Single-Channel Audio Interviews.- Post-Processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality.- Region Normalized Capsule Network based Generative Adversarial Network for Non-Parallel Voice Conversion.- Speech Enhancement using LinkNet Architecture.- ATT:Adversarial Trained Transformer for Speech Enhancement.- Human Identification by Dynamics of Changes in Brain Frequencies Using Artificial Neural Networks.- Speech Prosody.- Analysis of Formant Trajectories of a Speech Signal for the Purpose of Forensic Identification of a Foreign Speaker.- Gestures vs. Prosodic Structure in Laboratory Ironic Speech.- Sounds of < sil > ence: Acoustics of Inhalation in Read Speech.- Prolongations as Hesitation Phenomena in Spoken Speech in First and Second Language.- Study of Indian English Pronunciation Variabilities Relative to Received Pronunciation.- Multimodal Collaboration in Expository Discourse: Verbal and Nonverbal Moves Alignment.- Association of Time Domain Features with Oral Cavity Configuration during Vowel Production and its Application in Vowel Recognition.- Prosodic Interaction Models in a Conversation.- Natural Language Processing.- Development and Research of Dialogue Agents with Long-Term Memory and Web Search.- Pre- and Post-Textual Contexts in Assessment of a Message as Offensive or Defensive Aggression Verbalization.- Boosting Rule-based Grapheme-to-Phoneme Conversion with Morphological Segmentation and Syllabification in Bengali.- Revisiting Assessment of Text Complexity: Lexical and Syntactic Parameters Fluctuations.- Analysis of Natural Language Understanding Systems with L2 Learner Specific Synthetic Grammatical Errors based on Parts-of-Speech.- On the Most Frequent Sequences of Words in Russian Spoken Everyday Language (Bigrams and Trigrams): An Experience of Classification.- Child Speech Processing.- Recognition of the Emotional State of Children by Video and Audio Modalities by Indian and Russian Experts.- Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition.- Gammatone-Filterbank based Pitch-Normalized Cepstral Coefficients for Zero-Resource Children’s ASR.- System Assisted Vocal Response Analysis and Assessment of Autism in Children: A Machine Learning Based Approach.- Addressing Effects of Formant Dispersion and Pitch Sensitivity for the Development of Children’s KWS System.- Development of Children’s KWS System Perceptual Experiment and Automatic Recognition by Video, Audio and Text Modalities.- Linear Frequency Residual Features for Infant Cry Classification.- Speech Processing for Medicine.- Identification of Voice Disorders: A Comparative Study of Machine Learning Algorithms.- Transfer Learning using Whisper for Dysarthric Automatic Speech Recognition.- Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury.- Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury.- Respiratory Sickness Detection from Audio Recordings using CLIP Models.- Investigating the Effect of Data Impurity on the Detection Performances of Mental Disorders through Spoken Dialogues.

    3 in stock

    £75.99

  • Chatbots gestalten mit Praxisbeispielen der

    Springer Fachmedien Wiesbaden Chatbots gestalten mit Praxisbeispielen der

    1 in stock

    Book SynopsisImmer mehr Unternehmen bauen Chatbots, damit ihre Kunden und Mitarbeiter in natürlicher Sprache mit den Systemen des Unternehmens kommunizieren können. Mittels Chatbots können dialogintensive Prozesse automatisiert und neue Wissensquellen erschlossen werden. Das vorliegende essential führt die Grundlagen von Chatbots ein und zeigt mit Praxisbeispielen der Schweizerischen Post, wo sie angewendet werden können, worauf bei ihrer Gestaltung geachtet werden muss und welche neuen Fähigkeiten in einem Unternehmen für deren Einsatz erforderlich sind.Die Autoren:Toni Stucki hat langjährige Erfahrungen als Softwarearchitekt. Bei der Schweizerischen Post leitet er ein Softwareentwicklungsteam, mit dem er bereits mehrere Chatbots implementiert hat. Dr. Sara D’Onofrio arbeitete als Innovation-Managerin bei der Schweizerischen Post und war bei der Chatbot-Entwicklung der IT Post mit dabei. In ihrer Dissertation beschäftigte sie sich mit der Thematik Chatbots. Prof. Dr. Edy Portmann ist Swiss Post Professor of Computer Science am Human-IST Institut der Universität Freiburg i.Üe., Schweiz, und beschäftigt sich mit Fragen rund um Informationssysteme, -verarbeitung und -beschaffung. Table of ContentsGrundlagen zu Chatbots.- Chatbots bei der Schweizerischen Post.- Erfahrungsberichte.

    1 in stock

    £9.99

  • Computational Linguistics, Speech And Image

    World Scientific Publishing Co Pte Ltd Computational Linguistics, Speech And Image

    Out of stock

    Book SynopsisThis book encompasses a collection of topics covering recent advances that are important to the Arabic language in areas of natural language processing, speech and image analysis. This book presents state-of-the-art reviews and fundamentals as well as applications and recent innovations.The book chapters by top researchers present basic concepts and challenges for the Arabic language in linguistic processing, handwritten recognition, document analysis, text classification and speech processing. In addition, it reports on selected applications in sentiment analysis, annotation, text summarization, speech and font analysis, word recognition and spotting and question answering.Moreover, it highlights and introduces some novel applications in vital areas for the Arabic language. The book is therefore a useful resource for young researchers who are interested in the Arabic language and are still developing their fundamentals and skills in this area. It is also interesting for scientists who wish to keep track of the most recent research directions and advances in this area.

    Out of stock

    £85.50

© 2026 Book Curl

    • American Express
    • Apple Pay
    • Diners Club
    • Discover
    • Google Pay
    • Maestro
    • Mastercard
    • PayPal
    • Shop Pay
    • Union Pay
    • Visa

    Login

    Forgot your password?

    Don't have an account yet?
    Create account