Data warehousing Books
Amazon Digital Services LLC - Kdp Lakehouse at Home
£19.58
Amazon Digital Services LLC - Kdp Vector Database Systems Engineering
£14.65
Amazon Digital Services LLC - Kdp Spatial Data Mastery with DuckDB
£18.32
Independently Published Implantação do Windows Server 2025
£13.66
Independently Published Build MultiEngine Lakehouse Catalogs with Apache Polaris
£21.38
Independently Published State of the Art Data Center Configuration
£123.46
Amazon Digital Services LLC - Kdp Aprenda Airflow
£13.28
Amazon Digital Services LLC - Kdp Learn Apache Airflow
£11.94
Amazon Digital Services LLC - Kdp Aprenda Terraform
£11.95
Amazon Digital Services LLC - Kdp Mastering Google Drive for Businesses
£11.39
Amazon Digital Services LLC - Kdp Dp700
£999.99
Independently Published Arquiteturas Emergentes E Paradigmas Disruptivos
£14.84
Amazon Digital Services LLC - Kdp DNA Digital Data Storage
£19.94
Amazon Digital Services LLC - Kdp Análise de Metodologias Para Dw
£20.01
Amazon Digital Services LLC - Kdp Roteiro Prático Para Projetos de Dw BemSucedidos
£21.91
Amazon Digital Services LLC - Kdp O Que É Governança de Dados Data Governance
£9.49
Amazon Digital Services LLC - Kdp Enabling Directing and Structural Pillars
£14.84
Amazon Digital Services LLC - Kdp Conceptos Fundacionales Orientadores Y Estructurantes
£17.86
Amazon Digital Services LLC - Kdp Aprenda Pydantic V2
£13.89
Amazon Digital Services LLC - Kdp Cloud Data Warehouses Demystified
£16.90
Amazon Digital Services LLC - Kdp ISO 11179 Unlocked
£15.79
Amazon Digital Services LLC - Kdp SQLite for Python Developers
£28.41
Independently Published NextGen Search with Vector Databases
£28.21
Amazon Digital Services LLC - Kdp Trilha de Implantacao de Governanca de Dados Para Ia
£20.01
Amazon Digital Services LLC - Kdp A Professional QA Guide for Snowflake Certifications
£20.00
Amazon Digital Services LLC - Kdp BMS Sensors for Data Centers
£23.47
Amazon Digital Services LLC - Kdp DynamoDB and PostgreSQL for RESTful API Development
£12.51
Amazon Digital Services LLC - Kdp Data Warehousing
£30.59
Amazon Digital Services LLC - Kdp RealTime Data Processing with Kafka
£12.10
Independently Published Bitcoin the NEW GOLD
£11.43
Independently Published Power Backup Systems for Data Centers
£28.15
Amazon Digital Services LLC - Kdp Simplificando o Big Data em 7 capítulos
£24.30
Amazon Digital Services LLC - Kdp Gestão de Big Data
£24.30
Amazon Digital Services LLC - Kdp Renewable Energy for Data Centers
£26.22
Amazon Digital Services LLC - Kdp Data Centers and IT Infrastructure
£15.40
Independently Published Mastering AWS Neptune Analytics
£28.49
Independently Published DynamoDB and PostgreSQL for RESTful API Development
£999.99
Amazon Digital Services LLC - Kdp O Glossário Definitivo Da Inteligência Artificial
£56.80
Whittles Publishing 3D Recording, Documentation and Management of
Book SynopsisDocumentation of our cultural heritage is experiencing an explosion of innovation. New tools have appeared in recent decades including laser scanning, rapid prototyping, high dynamic range spherical and infrared imagery, drone photography, augmented and virtual reality and computer rendering in multiple dimensions. These give us visualisations and data that are at once interesting, intriguing and yet sometimes deceptive. This text provides an objective and integrated approach to the subject, bringing together the techniques of conservation with management, photographic methods, various modelling techniques and the use of unmanned aerial systems. This interdisciplinary approach addresses the need for knowledge about deploying advanced digital technologies and the materials and methods for the assessment, conservation, rehabilitation and maintenance of the sustainability of existing structures and designated historic buildings. Furthermore, this book actively provides the knowhow to facilitate the creation of heritage inventories, assessing risk, and addressing the need for sustainability.In so doing it becomes more feasible to mitigate the threats from inherent and external causes, not only for the built heritage but also for moveable objects and intangible heritage that suffer abandonment and negligence as well as looting and illegal trafficking. The book is written by a team of international experts based upon their practical experience and expertise. It therefore creates a unique book that encapsulates the knowledge of this discipline required by anyone working in this field.Trade Review`...this new publication is a welcome addition, highlighting how these 3D techniques can be utilised... ...this well-illustrated volume represents a useful contribution for scholars wishing to gain a better understanding of the underpinnings of 3D recording and documentation’. Medieval Archaeology -------------------- `...I found this book very valuable. It can reach an eclectic audience in providing a broad spectrum of the subject. This book is of major importance for Cultural Heritage 3D recording and management and...an important resource handbook’. International Institute for Conservation of Historic and Artistic Works -------------------- '...this new, richly illustrated reference publication on recording and documenting cultural heritage. ... For anyone considering a digital camera for survey purposes ... this chapter [4] is essential reading, and is rightfully one of the best references currently available on the science behind imaging. ...manages to provide what is probably the most up-to-date reference book on 3D recording, documentation and management of cultural heritage. For any heritage professional, academic, student or interested individual considering applying, acquiring, undertaking or researching digital imaging, photogrammetry, Structure-from-Motion, laser scanning, GIS, BIM or RPAS/UAV within a conservation context, this book should be essential reading before embarking down any one of these rapidly developing technological routes'. Conservation and Management of Archaeological Sites -------------------- '...the images in this book, both in colour and high-resolution, play a critical role along with the text. This is a well produced book that is wonderful to read and view. ...I find this book exceptional for its publishing quality, content and production. It clearly includes cutting-edge knowledge, awareness and experience from many contributors involved in cultural heritage processes around the globe...would be very useful to anyone involved in cultural heritage, documentation of history and site preservation and conservation. It can readily serve as a course text in addition to being a reference text. ... I've nothing but positive things to say about this book - I think you will too'. 3D Visualization WorldTable of ContentsIntroduction - current trends in cultural heritage and documentation; Conservation techniques in cultural heritage; Cultural heritage management tools: The role of GIS and BIM; Basics of photography for cultural heritage imaging; Basics of image-based modelling techniques in cultural heritage 3D recording; Basics of range-based modelling techniques in cultural heritage 3D recording; Cultural heritage documentation with RPAS/UAV
£80.75
Wiley-VCH Verlag GmbH Data-Warehouse-Systeme für Dummies
Book SynopsisJede Business-Intelligence-Anwendung beruht letzten Endes auf einem Data Warehouse. Data Warehousing ist deshalb ein sehr wichtiges Gebiet der Angewandten Informatik, insbesondere im Zeitalter von Big Data. Das vorliegende Buch beleuchtet das Data Warehouse aus zwei Perspektiven: der des Entwicklers und der des Anwenders. Der zukünftige Entwickler lernt, ein Data Warehouse mit geeigneten Methoden selbst zu entwickeln. Für den zukünftigen Anwender geht der Autor auf die Themen Reporting, Online Analytical Processing und Data Mining ein. Das Lehrbuch ist auch zum Selbststudium geeignet. Kenntnisse über Datenbanksysteme sollten allerdings vorhanden sein.Trade Review"Das didaktisch gut aufgebaute Buch auf dem aktuellen Stand der Technik endet mit 10 Übungsaufgaben (mit Lösungen) und ist auch zum Selbststudium gut geeignet." (EKZ im Dezember 2018)Table of ContentsEinleitung 19 Über dieses Buch 19 Konventionen in diesem Buch 20 Was Sie nicht lesen müssen 20 Törichte Annahmen über den Leser 21 Wie dieses Buch aufgebaut ist 21 Teil I: Was ist ein Data Warehouse? 21 Teil II: Architektur eines Data-Warehouse-Systems 21 Teil III: Anwendungsbereiche für ein Data Warehouse 22 Teil IV: Modellierung eines Data-Warehouse-Systems 22 Teil V: Zugriff auf ein Data Warehouse 22 Teil VI: Speicherung und Optimierung auf Datenbankebene 22 Teil VII: Der Top-10-Teil 22 Symbole, die in diesem Buch verwendet werden23 Wie es weitergeht 23 TEIL I WAS IST EIN DATA WAREHOUSE? 25 Kapitel 1 Ein Beispiel zur Einführung 27 Daten und ihre Verarbeitung 27 Daten und Datenbanken 27 Die Verarbeitung von Daten 28 Analyse von Absatzmengen und Planzahlen als Beispiel 29 Besonderheiten analytischer Aufgabenstellungen 31 Wenn personenbezogene Daten ins Spiel kommen 34 Kapitel 2 Das Data Warehouse im Umfeld der betrieblichen Informationssysteme 35 Hierarchie betrieblicher Informationssysteme 35 Zusammenfassung: Analytische Informationssysteme 38 Beispiele für analytische Informationssysteme 39 Beispiel 1: Analytische Informationssysteme im CRM 39 Beispiel 2: Kennzahlen-Analysesysteme im Rechnungswesen 41 Beispiel 3: Website-Analysesysteme 43 Fazit: Data Warehouse und analytische Informationssysteme 45 Kapitel 3 Definition und Abgrenzung des Begriffs »Data Warehouse« 47 Die 3-Schichten-Architektur analytischer Informationssysteme 47 Definitionen des Begriffs Data Warehouse 50 Definition von Inmon 50 Definition von Kimball 52 Vergleich der beiden Definitionen 53 Anwendungsfall: Das Data Warehouse und Business Intelligence 54 TEIL II ARCHITEKTUR EINES DATA-WAREHOUSE-SYSTEMS 57 Kapitel 4 Überblick über die Architektur eines Data-Warehouse-Systems 59 Die Phasen des Data Warehousing 59 Ein allgemeines Data-Warehouse-Architekturmodell 61 Vorgehensweisen bei der Erstellung eines Data Warehouse 64 Projektdefinition und Machbarkeitsstudie 65 Analyse, Entwurf und Einführung für einen Anwendungsbereich 66 Kapitel 5 Der ETL-Prozess 69 Überblick 69 Ein einführendes Beispiel 70 Extraktion 71 Das Pull-Prinzip 71 Das Push-Prinzip 72 Beispiele 72 Transformation 77 Datenbestandsanalyse 77 Datenbereinigung 78 Datenintegration 80 Laden 82 Kapitel 6 Die Basisdatenbank 85 Merkmale der Basisdatenbank 85 Unterschied zwischen operativen Datenbanken und der Basisdatenbank 87 Die operativen Quellsysteme des Beispiels 88 Die Basisdatenbank des Beispiels 89 Kapitel 7 Das Analyse-Subsystem 93 Dimensionen und Fakten 93 Dimension oder Metrik? 95 Metriken als Dimension 96 Dimensionen als Metrik 97 Klassifizierung von Dimensionen 98 Fachliche Dimensionen 98 Kategorische Dimensionen 98 Strukturelle Dimensionen 99 Hierarchien von Dimensionswerten 99 Parallele Hierarchien 100 Unausgeglichene Hierarchiebäume 101 Strukturänderungen in Hierarchien 102 Slowly Changing Dimensions 102 Typ 1: Überschreiben 103 Typ 2: Neue Zeile 104 Typ 3: Spalten mit altem und neuem Wert 105 Typ 4: Mini-Dimension 105 Zusammenfassung 106 Verknüpfung von Dimensionen über Metriken 106 Aggregationstypen von Fakten 107 Die Themen Datenqualität und Datenschutz 108 Datenqualität 108 Datenschutz 109 Architekturvarianten für ein Analyse-Subsystem 109 Möglichkeiten für die Architektur 110 Die Hub-and-Spoke-Architektur 111 Auswertungen und Analysen 112 Kapitel 8 Metadaten 113 Was sind Metadaten?113 Metadaten im Data-Warehouse-Kontext 114 Das Metadaten-Management in einem Data-Warehouse-System 114 Standards für Data-Warehouse-Metadaten 118 Ein kleines Beispiel 119 TEIL III ANWENDUNGSBEREICHE FÜR EIN DATA WAREHOUSE 121 Kapitel 9 Reporting 123 Das Berichtswesen eines Unternehmens 123 Überblick und Definition 123 Erzeugung und Verteilung von Reports 125 Arten von Berichtssystemen 125 Was sich Anwender vom Reporting wünschen und wie die Wirklichkeit oft aussieht 126 Einige Tipps für die Report-Gestaltung 127 Graphische Darstellungen im Report 128 Die Hichert-Success-Regeln 131 Grundformen für Reports 132 Ist-Ist-Vergleiche 132 Plan-Ist-Vergleiche 133 Plan-Wird-Vergleiche 134 Berücksichtigung dynamischer Dimensionsstrukturen 135 Report as-is 136 Report as-of 136 Report as-posted 137 Ein praktisches Beispiel 137 Kapitel 10 Online Analytical Processing 139 Motivation und Definition 139 Charakteristika von OLAP 141 Abgrenzung OLAP und OLTP 141 Die Coddschen Regeln 142 FASMI 143 Spezielle OLAP-Operatoren 144 Pivotierung bzwRotation 144 Roll-up und Drill-down 145 Slice und Dice 146 Beispiel 148 Kapitel 11 Data Mining151 Einführung 151 CRISP-DM 153 Methoden und Verfahren beim Data Mining 154 Assoziationsanalyse 155 Clusteranalyse 160 Klassifikation mit der Diskriminanzanalyse 164 Entscheidungsbaumverfahren 166 Spezielle Data-Mining-Fragestellungen im Kontext von Data-Warehouse-Daten 171 Welche Artikel werden gemeinsam gekauft? 172 Unterscheiden sich gute, normale und schlechte Kunden? 172 Welche Kunden besitzen eine bestimmte Produktaffinität? 173 Praxisbeispiel »Predictive Analytics« 174 Kollaboratives Filtern 175 TEIL IV MODELLIERUNG EINES DATA-WAREHOUSE-SYSTEMS 177 Kapitel 12 Data Vault 179 Einführung 179 Hubs, Satelliten und Links 180 Hubs 180 Links 182 Satelliten183 Beispiel 185 Kapitel 13 Semantischer Entwurf eines Data Warehouse 191 Zur Wiederholung: das Entity-Relationship-Modell 191 Drei Schritte bei der Modellierung einer Datenbank 192 Das ER-Modell: Entitätstypen, Attribute und Beziehungen 192 Das multidimensionale ER-Modell 194 ADAPT 196 Kapitel 14 Relationale Modellierung der Datenwürfel 199 Einführung 199 Das Star-Schema 200 Beispiel 201 Besondere Merkmale des Star-Schemas 204 Das Snowflake-Schema 207 Vergleich von Star- und Snowflake-Schema 209 Das Galaxy-Schema 211 TEIL V ZUGRIFF AUF EIN DATA WAREHOUSE 213 Kapitel 15 Multidimensionale Abfragen mit SQL 215 Zugriff auf ein Data Warehouse mit SQL 215 Erzeugen der Tabellen 216 Typische analytische Fragestellungen 218 OLAP-Erweiterungen von SQL 220 Die WINDOW-Klausel 220 Erweiterungen der GROUP-BY-Option 225 Statistische Funktionen 228 Kapitel 16 Die Abfragesprache MDX 229 Einführung 229 Spezielle OLAP-Operatoren und Funktionen 233 Tupel und Sets 233 Member und Children 234 Kreuzprodukt mittels Crossjoin 234 Der WITH-Operator 235 Häufige Fragestellungen 236 Kapitel 17 Zusammenspiel von MDX und SQL 239 OLAP-Server 239 Der OLAP-Server Mondrian 241 MDX-Schema von Mondrian 241 Mondrian-Frontend-Tools 245 TEIL VI SPEICHERUNG UND OPTIMIERUNG AUF DATENBANKEBENE 247 Kapitel 18 ROLAP, MOLAP und anderes 249 ROLAP und MOLAP 249 Spaltenorientierte und In-Memory-Speicherung 252 NoSQL-Datenbanksysteme 255 Typen von NoSQL-Systemen 255 NoSQL-Datenbanken bei einem Data Warehouse 258 Beurteilung 263 Kapitel 19 Optimierungsmöglichkeiten bei relationalen Datenbanken 265 Einführung 265 Partitionierung 266 Partition by List 267 Partition by Range 268 Partition by Hash 268 Partition by Reference 269 Materialized Views 270 Klassische Views vsMaterialized Views 270 Materialized Views bei einem Data Warehouse 273 Indizierung 274 Klassischer Index 274 Bitmap-Index 275 Mehrdimensionale Indizes 276 TEIL VII DER TOP-10-TEIL 279 Kapitel 20 10 Schritte auf dem Weg zu Ihrem ersten Dashboard 281 Und so wird es gemacht 282 Festlegung der Datenquellen 282 Vorbereitung der Daten 283 Erstellung eines Dashboards 285 Daten aus mehreren Quellen 287 Integration von Landkarten 288 Kapitel 21 10 Schritte, die helfen, die richtige Data-Warehouse-Software zu finden 291 Marktanalyse für BI-Software 291 Definition der eigenen Anforderungen 292 Einbindung des Managements, Projektplan 293 Marktanalyse der infrage kommenden BI-Anbieter 293 Einholung von Angeboten 293 Durchführung von Testinstallationen 294 Bewertung der Systeme 294 Ermittlung der Kosten 295 Einholung von Referenzen, Anbieterqualifikation 296 Überprüfung der Lizenzvereinbarung 296 Kapitel 22 10 Übungsaufgaben zur Wiederholung 297 Aufgaben 297 Aufgabe 1: Assoziationsanalyse 297 Aufgabe 2: Diskriminanzanalyse 297 Aufgabe 3: Data Vault 298 Aufgabe 4: ADAPT 298 Aufgabe 5: MDX299 Aufgabe 6: Star-Schema 299 Aufgabe 7: OLAP mit SQL 299 Aufgabe 8: Snowflake-Schema 300 Aufgabe 9: Optimierung 300 Aufgabe 10: Multidimensionale Datenbank301 Lösungen 301 Lösung von Aufgabe 1 301 Lösung von Aufgabe 2 303 Lösung von Aufgabe 3 303 Lösung von Aufgabe 4 304 Lösung von Aufgabe 5 304 Lösung von Aufgabe 6 306 Lösung von Aufgabe 7 307 Lösung von Aufgabe 8 309 Lösung von Aufgabe 9 309 Lösung von Aufgabe 10 311 Literaturverzeichnis 313 Stichwortverzeichnis 317
£999.99
Springer-Verlag Berlin and Heidelberg GmbH & Co. KG E-Librarian Service: User-Friendly Semantic
Book SynopsisThis book introduces a new approach to designing E-Librarian Services. With the help of this system, users will be able to retrieve multimedia resources from digital libraries more efficiently than they would by browsing through an index or by using a simple keyword search. E-Librarian Services combine recent advances in multimedia information retrieval with aspects of human-machine interfaces, such as the ability to ask questions in natural language; they simulate a human librarian by finding and delivering the most relevant documents that offer users potential answers to their queries. The premise is that more pertinent results can be retrieved if the search engine understands the meaning of the query; the returned results are therefore logical consequences of an inference rather than of keyword matches. Moreover, E-Librarian Services always provide users with a solution, even in situations where they are unable to offer a comprehensive answer.Trade ReviewFrom the reviews:“The subtitle gives a much better idea of what this book is really about. … it offers a demonstration of their applicability within a narrow computer science framework. … the primary audience is computer science researchers. … It is interesting as an illustration of where information retrieval is heading, an explanation of the relationship between the semantic web and natural language processing, and a glimpse of the potential power of these new ways of representing knowledge.” (Toby Burrows, Australian Library Journal, Vol. 61 (2), May, 2012)Table of ContentsPart I: Information Retrieval in Digital Libraries.- Introduction to Digital Libraries.- Search Engines.- Part II: Key Technologies of E-Librarian Services.- Semantic Web and Ontologies.- Description Logics and Reasoning.- Natural Language Processing.- Multimedia Information Retrieval.- Part III: Design and Utilization of E-Librarian Services.- Ontological Approach.- Design of the Natural Language Processing Module.- Designing the Multimedia Information Retrieval Module.- Implementation, Configuration, and Deployment.- Best Practices.- Part IV: Appendix.- A - XML SChema Primitive Datatypes.- B - Reasoning Algorithms.- C - Syntactic Difference.- D - Brown Tag Set.- E - Part-of-Speech-Taggers and Parsers.- References.
£40.49
Springer-Verlag Berlin and Heidelberg GmbH & Co. KG Enabling Real-Time Business Intelligence: 5th International Workshop, BIRTE 2011, Held at the 37th International Conference on Very Large Databases, VLDB 2011, Seattle, WA, USA, September 2, 2011, Revised Selected Papers
Book SynopsisThis book constitutes the thoroughly refereed conference proceedings of the 5th International Workshop on Business Intelligence for the Real-Time Enterprise, BIRTE 2011, held in Seattle, WA, USA, in September 2011, in conjunction with VLDB 2011, the International Conference on Very Large Data Bases. The series of BIRTE workshops aims to provide a forum for researchers to discuss and advance the foundational science and engineering required to enable real-time business intelligence as well as novel applications and solutions based on these foundational techniques.The volume contains 6 research papers, which have been carefully reviewed and selected from 12 submissions, plus the 3 keynotes presented at the workshop. The topics cover all stages of the business intelligence cycle, including capturing of real-time data, handling of temporal or uncertain data, performance issues, event management, and the optimization of complex ETL workflows.The volume contains 6 research papers, which have been carefully reviewed and selected from 12 submissions, plus the 3 keynotes presented at the workshop. The topics cover all stages of the business intelligence cycle, including capturing of real-time data, handling of temporal or uncertain data, performance issues, event management, and the optimization of complex ETL workflows.Table of ContentsBlink: Not Your Father’s Database!.- MemcacheSQL – A Scale-Out SQL Cache Engine.- A Cost-Aware Strategy for Merging Differential Stores in Column-Oriented In-Memory DBMS.- Microsoft SQL Server Parallel Data Warehouse: Architecture Overview.- Relax and Let the Database Do the Partitioning Online.- Adaptive Processing of Multi-Criteria Decision Support Queries.- Scalable Social Graph Analytics Using the Vertical Analytic Platform.- A Near Real-Time Personalization for eCommerce Platform.
£37.99
Springer-Verlag Berlin and Heidelberg GmbH & Co. KG On the Move to Meaningful Internet Systems: OTM 2013 Workshops: Confederated International Workshops: OTM Academy, OTM Industry Case Studies Program, ACM, EI2N, ISDE, META4eS, ORM, SeDeS, SINCOM, SMS and SOMOCO 2013, Graz, Austria, Septembe
Book SynopsisThis volume constitutes the refereed proceedings of the international workshops, Confederated International Workshops: OTM Academy, OTM Industry Case Studies Program, ACM, EI2N, ISDE, META4eS, ORM, SeDeS, SINCOM, SMS and SOMOCO 2013, held as part of OTM 2013 in Graz, Austria, in September 2013. The 75 revised full papers presented together with 12 posters and 5 keynotes were carefully reviewed and selected from a total of 131 submissions. The papers are organized in topical sections on: On The Move Academy; Industry Case Studies Program; Adaptive Case Management and other non-workflow approaches to BPM; Enterprise Integration, Interoperability and Networking; Information Systems in Distributed Environment; Methods, Evaluation, Tools and Applications for the Creation and Consumption of Structured Data for the e-Society; Fact-Oriented Modeling; Semantics and Decision Making; Social Media Semantics; Social and Mobile Computing for collaborative environments; cooperative information systems; Ontologies, Data Bases and Applications of Semantics.Table of ContentsOn The Move Academy.- Industry Case Studies Program.- Adaptive Case Management and other non-workflow approaches to BPM.- Enterprise Integration, Interoperability and Networking.- Information Systems in Distributed Environment.- Methods, Evaluation, Tools and Applications for the Creation and Consumption of Structured Data for the e-Society.- Fact-Oriented Modeling.-Semantics and Decision Making.- Social Media Semantics.- Social and Mobile Computing for collaborative environments.- Cooperative information systems.- Ontologies, Data Bases and Applications of Semantics.
£42.74
John Wiley & Sons Inc The Microsoft Data Warehouse Toolkit With SQL
Book SynopsisThe techniques pioneered by the Kimball Group have become the industry standard for data warehouse design, development, and management. In this new edition of the Microsoft Data Warehouse Toolkit, the authors share best practices for using these techniques in SQL Server 2008 R2 and Office 2010.Table of ContentsForeword xxvii Introduction xxix Part 1 Requirements, Realities, and Architecture 1 Chapter 1 Defining Business Requirements 3 The Most Important Determinant of Long-Term Success 5 Adventure Works Cycles Introduction 6 Uncovering Business Value 6 Obtaining Sponsorship 7 Defining Enterprise-Level Business Requirements 8 Prioritizing the Business Requirements 22 Revisiting the Project Planning 25 Gathering Project-Level Requirements 26 Summary 28 Chapter 2 Designing the Business Process Dimensional Model 29 Dimensional Modeling Concepts and Terminology 30 Facts 31 Dimensions 33 Bringing Facts and Dimensions Together 34 The Bus Matrix, Conformed Dimensions, and Drill Across 36 Additional Design Concepts and Techniques 38 Surrogate Keys 38 Slowly Changing Dimensions 39 Dates 42 Degenerate Dimensions 43 Snowflaking 43 Many-to-Many or Multivalued Dimensions 44 Hierarchies 47 Aggregate Dimensions 49 Junk Dimensions 51 The Three Fact Table Types 52 Aggregates 53 The Dimensional Modeling Process 54 Preparation 55 Data Profiling and Research 60 Building Dimensional Models 63 Developing the Detailed Dimensional Model 66 Testing and Refining the Model 68 Reviewing and Validating the Model 68 Case Study: The Adventure Works Cycles Orders Dimensional Model 69 The Orders Fact Table 69 The Dimensions 69 Identifying Dimension Attributes and Facts for the Orders Business Process 72 The Final Draft of the Initial Orders Model 74 Detailed Orders Dimensional Model Development 75 Final Dimensional Model 77 Summary 77 Chapter 3 The Toolset 79 The Microsoft DW/BI Toolset 80 Why Use the Microsoft Toolset? 82 Architecture of a Microsoft DW/BI System 83 Why Analysis Services? 84 Why a Relational Store? 86 ETL Is Not Optional 86 The Role of Master Data Services 88 Delivering BI Applications 88 Overview of the Microsoft Tools 89 Which Products Do You Need? 90 SQL Server Development and Management Tools 92 Summary 97 Chapter 4 System Setup 99 System Sizing Considerations 100 Calculating Data Volumes 101 Determining Usage Complexity 102 Estimating Simultaneous Users 104 Assessing System Availability Requirements 105 How Big Will It Be? 105 System Configuration Considerations 105 Memory 106 Monolithic or Distributed? 106 Storage System Considerations 110 Processors 113 Setting Up for High Availability 114 Software Installation and Configuration 115 Development Environment Software Requirements 116 Test and Production Software Requirements 120 Operating Systems 122 SQL Server Relational Database Setup 122 Analysis Services Setup 126 Integration Services Setup 129 Reporting Services Setup 130 Summary 131 Part 2 Building and Populating the Databases 133 Chapter 5 Creating the Relational Data Warehouse 135 Getting Started 136 Complete the Physical Design 137 Surrogate Keys 138 String Columns 138 To Null, or Not to Null? 140 Housekeeping Columns 140 Table and Column Extended Properties 142 Define Storage and Create Constraints and Supporting Objects 142 Create Files and Filegroups 142 Data Compression 144 Entity and Referential Integrity Constraints 145 Initial Indexing and Database Statistics 147 Aggregate Tables 150 Create Table Views 151 Insert an Unknown Member Row 152 Example CREATE TABLE Statement 152 Partitioned Tables 153 Finishing Up 163 Staging Tables 163 Metadata Setup 163 Summary 164 Chapter 6 Master Data Management 165 Managing Master Reference Data 166 Incomplete Attributes 167 Data Integration 168 Systems Integration 170 Master Data Management Systems and the Data Warehouse 171 Introducing SQL Server Master Data Services 171 Model Definition Features 172 Data Management Features 174 User Interface: Exploring and Managing the Master Data 174 Importing and Updating Data 176 Exporting Data 177 Full Versioning of All Attributes 179 Creating a Simple Application 179 Summary 186 Chapter 7 Designing and Developing the ETL System 187 Round Up the Requirements 188 Develop the ETL Plan 191 Introducing SQL Server Integration Services 192 Control Flow and Data Flow 194 SSIS Package Architecture 197 The Major Subsystems of ETL 198 Extracting Data 199 Subsystem 1: Data Profiling 199 Subsystem 2: Change Data Capture System 200 Subsystem 3: Extract System 202 Cleaning and Conforming Data 206 Subsystem 4: Data Cleaning System 206 Subsystem 5: Error Event Schema 214 Subsystem 6: Audit Dimension Assembler 215 Subsystem 7: Deduplication System 216 Subsystem 8: Conforming System 217 Delivering Data for Presentation 218 Subsystem 9: Slowly Changing Dimension Manager 218 Subsystem 10: Surrogate Key Generator 223 Subsystem 11: Hierarchy Manager 223 Subsystem 12: Special Dimensions Manager 224 Subsystem 13: Fact Table Builders 225 Subsystem 14: Surrogate Key Pipeline 229 Subsystem 15: Multi-Valued Dimension Bridge Table Builder 235 Subsystem 16: Late Arriving Data Handler 235 Subsystem 17: Dimension Manager 238 Subsystem 18: Fact Provider System 238 Subsystem 19: Aggregate Builder 239 Subsystem 20: OLAP Cube Builder 239 Subsystem 21: Data Propagation Manager 240 Managing the ETL Environment 240 Summary 243 Chapter 8 The Core Analysis Services OLAP Database 245 Overview of Analysis Services OLAP 247 Why Use Analysis Services? 247 Why Not Analysis Services? 249 Designing the OLAP Structure 250 Planning 251 Getting Started 253 Create a Project and a Data Source View 255 Dimension Designs 257 Creating and Editing Dimensions 261 Creating and Editing the Cube 274 Physical Design Considerations 291 Understanding Storage Modes 293 Developing the Partitioning Plan 294 Designing Performance Aggregations 296 Planning for Deployment 298 Processing the Full Cube 299 Developing the Incremental Processing Plan 299 Summary 304 Chapter 9 Design Requirements for Real-Time BI 305 Real-Time Triage 306 What Does Real-Time Mean? 306 Who Needs Real Time? 307 Real-Time Tradeoffs 308 Scenarios and Solutions 311 Executing Reports in Real Time 313 Serving Reports from a Cache 313 Creating an ODS with Mirrors and Snapshots 314 Creating an ODS with Replication 314 Building a BizTalk Application 315 Building a Real-Time Relational Partition 315 Querying Real-Time Data in the Relational Database 317 Using Analysis Services to Query Real-Time Data 318 Summary 319 Part 3 Developing the BI Applications 321 Chapter 10 Building BI Applications in Reporting Services 323 A Brief Overview of BI Applications 324 Types of BI Applications 325 The Value of Business Intelligence Applications 326 A High-Level Architecture for Reporting 328 Reviewing Business Requirements for Reporting 328 Examining the Reporting Services Architecture 330 Using Reporting Services as a Standard Reporting Tool 332 Reporting Services Assessment 339 The Reporting System Design and Development Process 340 Reporting System Design 341 Reporting System Development 348 Building and Delivering Reports 351 Planning and Preparation 351 Creating Reports 354 Reporting Operations 368 Ad Hoc Reporting Options 369 The Report Model 370 Shared Datasets 371 Report Parts 371 Summary 372 Chapter 11 PowerPivot and Excel 375 Using Excel for Analysis and Reporting 376 The PowerPivot Architecture: Excel on Steroids 378 Creating and Using PowerPivot Databases 380 Getting Started 381 PowerPivot Table Design 381 Creating Analytics with PowerPivot 385 Observations and Guidelines on PowerPivot for Excel 392 PowerPivot for SharePoint 394 The PowerPivot SharePoint User Experience 394 Server-Level Resources 397 PowerPivot Monitoring and Management 397 PowerPivot’s Role in a Managed DW/BI Environment 400 Summary 401 Chapter 12 The BI Portal and SharePoint 403 The BI Portal 404 Planning the BI Portal 405 Impact on Design 406 Business Process Categories 407 Additional Functions 408 Building the BI Portal 409 Using SharePoint as the BI Portal 411 Architecture and Concepts 412 Setting Up SharePoint 417 Summary 426 Chapter 13 Incorporating Data Mining 429 Defining Data Mining 430 Basic Data Mining Terminology 432 Business Uses of Data Mining 433 Roles and Responsibilities 440 SQL Server Data Mining Architecture Overview 440 The Data Mining Design Environment 442 Build, Deploy, and Process 442 Accessing the Mining Models 443 Integration Services and Data Mining 443 Additional Features 444 Architecture Summary 445 Microsoft Data Mining Algorithms 445 Decision Trees 446 Naïve Bayes 447 Clustering 448 Sequence Clustering 448 Time Series 449 Association 449 Neural Network 449 The Data Mining Process 450 The Business Phase 451 The Data Mining Phase 453 The Operations Phase 460 Metadata 462 Data Mining Examples 463 Case Study: Categorizing Cities 463 Case Study: Product Recommendations 472 Summary 488 Part 4 Deploying and Managing the DW/BI System 491 Chapter 14 Designing and Implementing Security 493 Identifying the Security Manager 494 Securing the Hardware and Operating System 495 Securing the Operating System 495 Using Windows Integrated Security 496 Securing the Development Environment 497 Securing the Data 498 Providing Open Access for Internal Users 498 Itemizing Sensitive Data 500 Securing Various Types of Data Access 500 Securing the Components of the DW/BI System 502 Reporting Services Security 502 Analysis Services Security 505 Relational DW Security 514 Integration Services Security 520 Usage Monitoring 521 Summary 521 Chapter 15 Metadata Plan 523 Metadata Basics 524 The Purpose of Metadata 524 Metadata Categories 525 The Metadata Repository 526 Metadata Standards 526 SQL Server 2008 R2 Metadata 527 Cross-Tool Components 528 Relational Engine Metadata 532 Analysis Services 532 Integration Services 533 Reporting Services 533 Master Data Services 534 SharePoint 534 External Metadata Sources 534 Looking to the Future 535 A Practical Metadata Approach 535 Creating the Metadata Strategy 536 Business Metadata Reporting 538 Process Metadata Reporting 541 Technical Metadata Reporting 542 Ongoing Metadata Management 543 Summary 543 Chapter 16 Deployment 545 Setting Up the Environments 546 Testing 550 Development Testing 551 System Testing 555 Data Quality Assurance Testing 557 Performance Testing 559 Usability Testing 562 Testing Summary 563 Deploying to Production 564 Relational Database Deployment 565 Integration Services Package Deployment 567 Analysis Services Database Deployment 568 Reporting Services Report Deployment 571 Master Data Services Deployment 572 Data Warehouse and BI Documentation 573 Core Descriptions 573 Additional Documentation 575 User Training 576 User Support 579 Desktop Readiness and Configuration 580 Summary 581 Chapter 17 Operations and Maintenance 583 Providing User Support 584 Maintaining the BI Portal 585 Extending the BI Applications 586 System Management 587 Governing the DW/BI System 588 Performance Monitoring 593 Usage Monitoring 600 Managing Disk Space 602 Service and Availability Management 603 Performance Tuning the DW/BI System 604 Backup and Recovery 606 Executing the ETL Packages 611 Summary 611 Chapter 18 Present Imperatives and Future Outlook 613 Growing the DW/BI System 613 Lifecycle Review with Common Problems 615 Phase I — Requirements, Realities, Plans, and Designs 616 Phase II — Developing the Databases 616 Phase III — Developing the BI Applications and Portal Environment 617 Phase IV — Deploying and Managing the DW/BI System 618 Iteration and Growth 618 What We Like in the Microsoft BI Toolset 619 Future Directions: Room for Improvement 620 Conclusion 623 Index 625
£999.99
Princeton University Press Dark Data
Book SynopsisTrade Review"[A] penetrating study of missing (‘dark’) data and its impacts on decisions—skewing stats, enabling fraud, embedding inequity and triggering preventable catastrophes. Advocating ‘data science judo,’ Hand offers expert training, from recognizing when facts are being cherry-picked to designing randomized trials. A book illuminating shadowed corners in science, medicine and policy."---Barbara Kiser, Nature"A tour de force. . . . Hand is a good and able guide to take us through the many aspects of dark data that are potentially skewing our understanding of real world observations and potential scientific breakthroughs. He writes in an accessible and understandable way too."---Simon Cocking, Irish Tech News"Well-written and accessible."---Tim Harford, Undercover Economist"You need to read [Dark Data], and be convinced by David’s reasoning and his examples of cases in which unseen or unreported data play a critical and sometimes even a fatal role. You are likely to walk away with the feeling that the term dark data is indeed a very effective one to arouse both curiosity and suspicion, mixed with happiness that finally a great term was coined by a statistician—and sadness that the statistician is not you."---Xiao-Li Meng, IMS Bulletin"An exploration of a major problem in data analysis with an attempt of classification, analysing causes, mechanisms, and to some extent also suggest mitigations."---Adhemar Bultheel, European Mathematical Society"An excellent guide to the many reasons for caution in interpreting data."---Diane Coyle, Enlightened Economist
£22.50
Princeton University Press Dark Data
Book SynopsisTrade Review"[A] penetrating study of missing (‘dark’) data and its impacts on decisions—skewing stats, enabling fraud, embedding inequity and triggering preventable catastrophes. Advocating ‘data science judo,’ Hand offers expert training, from recognizing when facts are being cherry-picked to designing randomized trials. A book illuminating shadowed corners in science, medicine and policy."---Barbara Kiser, Nature"A tour de force. . . . Hand is a good and able guide to take us through the many aspects of dark data that are potentially skewing our understanding of real world observations and potential scientific breakthroughs. He writes in an accessible and understandable way too."---Simon Cocking, Irish Tech News"Well-written and accessible."---Tim Harford, Undercover Economist"You need to read [Dark Data], and be convinced by David’s reasoning and his examples of cases in which unseen or unreported data play a critical and sometimes even a fatal role. You are likely to walk away with the feeling that the term dark data is indeed a very effective one to arouse both curiosity and suspicion, mixed with happiness that finally a great term was coined by a statistician—and sadness that the statistician is not you."---Xiao-Li Meng, IMS Bulletin"An exploration of a major problem in data analysis with an attempt of classification, analysing causes, mechanisms, and to some extent also suggest mitigations."---Adhemar Bultheel, European Mathematical Society"An excellent guide to the many reasons for caution in interpreting data."---Diane Coyle, Enlightened Economist
£15.29
O'Reilly Media Amazon Redshift The Definitive Guide
Book SynopsisThis practical guide thoroughly examines this managed service and demonstrates how you can use it to extract value from your data immediately, rather than go through the heavy lifting required to run a typical data warehouse.
£47.99
O'Reilly Media How To Make Things Faster
Book SynopsisThis book explains in a clear and thoughtful voice why systems perform the way they do. It's for anybody who's curious about how computer programs and other processes use their time and about what you can do to improve them.
£33.74