Description

Book Synopsis
This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

Table of Contents

Chapter 1 Introduction

1.1 Knowledge Discovery

1.2 Main Challenges

1.3 Book Overview

Chapter 2 Functional-dependency-based truth discovery for isomorphic data

2.1 Handling independent constraints

2.2 Handling inter-related constraints

2.3 Inter-source data aggregation

2.4 Update source weights

Chapter 3 Denial-constraint-based truth discovery for isomorphic data

Describe the truth discovery strategies for isomorphic data based on denial constraints

4.1 Denial constraint transformation

4.2 Optimized solution

4.3 Scalable strategies

Chapter 4 Pattern discovery for heterogeneous data

4.1 Problem definition for multi-source heterogeneous data

4.2 Optimization framework

4.3 PatternFinder algorithm

4.4 The optimized grouping strategy

Chapter 5 Deep fact discovery for text data

5.1 Fact extraction via mining patterns

5.2 The CNN-LSTM architecture

5.3 The fact encoder and pattern embedding

5.4 Training and inference

Knowledge Discovery from Multi-Sourced Data

Product form

£40.49

Includes FREE delivery

RRP £44.99 – you save £4.50 (10%)

Order before 4pm today for delivery by Fri 23 Jan 2026.

A Paperback / softback by Chen Ye, Hongzhi Wang, Guojun Dai

3 in stock


    View other formats and editions of Knowledge Discovery from Multi-Sourced Data by Chen Ye

    Publisher: Springer Verlag, Singapore
    Publication Date: 15/06/2022
    ISBN13: 9789811918780, 978-9811918780
    ISBN10: 9811918783

    Description

    Book Synopsis
    This book addresses several knowledge discovery problems on multi-sourced data where the theories, techniques, and methods in data cleaning, data mining, and natural language processing are synthetically used. This book mainly focuses on three data models: the multi-sourced isomorphic data, the multi-sourced heterogeneous data, and the text data. On the basis of three data models, this book studies the knowledge discovery problems including truth discovery and fact discovery on multi-sourced data from four important properties: relevance, inconsistency, sparseness, and heterogeneity, which is useful for specialists as well as graduate students. Data, even describing the same object or event, can come from a variety of sources such as crowd workers and social media users. However, noisy pieces of data or information are unavoidable. Facing the daunting scale of data, it is unrealistic to expect humans to “label” or tell which data source is more reliable. Hence, it is crucial to identify trustworthy information from multiple noisy information sources, referring to the task of knowledge discovery. At present, the knowledge discovery research for multi-sourced data mainly faces two challenges. On the structural level, it is essential to consider the different characteristics of data composition and application scenarios and define the knowledge discovery problem on different occasions. On the algorithm level, the knowledge discovery task needs to consider different levels of information conflicts and design efficient algorithms to mine more valuable information using multiple clues. Existing knowledge discovery methods have defects on both the structural level and the algorithm level, making the knowledge discovery problem far from totally solved.

    Table of Contents

    Chapter 1 Introduction

    1.1 Knowledge Discovery

    1.2 Main Challenges

    1.3 Book Overview

    Chapter 2 Functional-dependency-based truth discovery for isomorphic data

    2.1 Handling independent constraints

    2.2 Handling inter-related constraints

    2.3 Inter-source data aggregation

    2.4 Update source weights

    Chapter 3 Denial-constraint-based truth discovery for isomorphic data

    Describe the truth discovery strategies for isomorphic data based on denial constraints

    4.1 Denial constraint transformation

    4.2 Optimized solution

    4.3 Scalable strategies

    Chapter 4 Pattern discovery for heterogeneous data

    4.1 Problem definition for multi-source heterogeneous data

    4.2 Optimization framework

    4.3 PatternFinder algorithm

    4.4 The optimized grouping strategy

    Chapter 5 Deep fact discovery for text data

    5.1 Fact extraction via mining patterns

    5.2 The CNN-LSTM architecture

    5.3 The fact encoder and pattern embedding

    5.4 Training and inference

    Recently viewed products

    © 2026 Book Curl

      • American Express
      • Apple Pay
      • Diners Club
      • Discover
      • Google Pay
      • Maestro
      • Mastercard
      • PayPal
      • Shop Pay
      • Union Pay
      • Visa

      Login

      Forgot your password?

      Don't have an account yet?
      Create account