You are previewing The Microsoft® Data Warehouse Toolkit: With SQL Server™ 2005 and the Microsoft® Business Intelligence Toolset.
O'Reilly logo
The Microsoft® Data Warehouse Toolkit: With SQL Server™ 2005 and the Microsoft® Business Intelligence Toolset

Book Description

This groundbreaking book is the first in the Kimball Toolkit series to be product-specific. Microsoft’s BI toolset has undergone significant changes in the SQL Server 2005 development cycle. SQL Server 2005 is the first viable, full-functioned data warehouse and business intelligence platform to be offered at a price that will make data warehousing and business intelligence available to a broad set of organizations. This book is meant to offer practical techniques to guide those organizations through the myriad of challenges to true success as measured by contribution to business value.

Building a data warehousing and business intelligence system is a complex business and engineering effort. While there are significant technical challenges to overcome in successfully deploying a data warehouse, the authors find that the most common reason for data warehouse project failure is insufficient focus on the business users and business problems. In an effort to help people gain success, this book takes the proven Business Dimensional Lifecycle approach first described in best selling The Data Warehouse Lifecycle Toolkit and applies it to the Microsoft SQL Server 2005 tool set.

Beginning with a thorough description of how to gather business requirements, the book then works through the details of creating the target dimensional model, setting up the data warehouse infrastructure, creating the relational atomic database, creating the analysis services databases, designing and building the standard report set, implementing security, dealing with metadata, managing ongoing maintenance and growing the DW/BI system. All of these steps tie back to the business requirements. Each chapter describes the practical steps in the context of the SQL Server 2005 platform.

Intended Audience

The target audience for this book is the IT department or service provider (consultant) who is:

  • Planning a small to mid-range data warehouse project;

  • Evaluating or planning to use Microsoft technologies as the primary or exclusive data warehouse server technology;

  • Familiar with the general concepts of data warehousing and business intelligence.

The book will be directed primarily at the project leader and the warehouse developers, although everyone involved with a data warehouse project will find the book useful. Some of the book’s content will be more technical than the typical project leader will need; other chapters and sections will focus on business issues that are interesting to a database administrator or programmer as guiding information.

The book is focused on the mass market, where the volume of data in a single application or data mart is less than 500 GB of raw data. While the book does discuss issues around handling larger warehouses in the Microsoft environment, it is not exclusively, or even primarily, concerned with the unusual challenges of extremely large datasets.

About the Authors

JOY MUNDY has focused on data warehousing and business intelligence since the early 1990s, specializing in business requirements analysis, dimensional modeling, and business intelligence systems architecture. Joy co-founded InfoDynamics LLC, a data warehouse consulting firm, then joined Microsoft WebTV to develop closed-loop analytic applications and a packaged data warehouse.

Before returning to consulting with the Kimball Group in 2004, Joy worked in Microsoft SQL Server product development, managing a team that developed the best practices for building business intelligence systems on the Microsoft platform. Joy began her career as a business analyst in banking and finance. She graduated from Tufts University with a BA in Economics, and from Stanford with an MS in Engineering Economic Systems.

WARREN THORNTHWAITE has been building data warehousing and business intelligence systems since 1980. Warren worked at Metaphor for eight years, where he managed the consulting organization and implemented many major data warehouse systems. After Metaphor, Warren managed the enterprise-wide data warehouse development at Stanford University. He then co-founded InfoDynamics LLC, a data warehouse consulting firm, with his co-author, Joy Mundy. Warren joined up with WebTV to help build a world class, multi-terabyte customer focused data warehouse before returning to consulting with the Kimball Group. In addition to designing data warehouses for a range of industries, Warren speaks at major industry conferences and for leading vendors, and is a long-time instructor for Kimball University. Warren holds an MBA in Decision Sciences from the University of Pennsylvania's Wharton School, and a BA in Communications Studies from the University of Michigan.

RALPH KIMBALL, PH.D., has been a leading visionary in the data warehouse industry since 1982 and is one of today's most internationally well-known authors, speakers, consultants, and teachers on data warehousing. He writes the "Data Warehouse Architect" column for Intelligent Enterprise (formerly DBMS) magazine.

Table of Contents

  1. Copyright
  2. About the Authors
  3. Credits
  4. Foreword
  5. Preface
  6. Acknowledgments
  7. Introduction
  8. Requirements, Realities, and Architecture
    1. Defining Business Requirements
      1. The Most Important Determinant of Long-Term Success
      2. Uncovering Business Value
      3. Business Requirements Example: Adventure Works Cycles
      4. Summary
    2. Designing the Business Process Dimensional Model
      1. Dimensional Modeling Concepts and Terminology
      2. Additional Design Concepts and Techniques
      3. The Dimensional Modeling Process
      4. Case Study: Creating the Adventure Works Cycles Orders Dimensional Model
      5. Summary
    3. The Toolset
      1. The Microsoft DW/BI Toolset
      2. Why Use the Microsoft Toolset?
      3. Architecture of a Microsoft DW/BI System
      4. Overview of the Microsoft Tools
      5. Summary
  9. Developing and Populating the Databases
    1. Setup and Physical Design
      1. System Sizing Considerations
      2. System Configuration Considerations
      3. Software Installation and Configuration
      4. Physical Data Warehouse Database Design
      5. Summary
    2. Designing the ETL System
      1. An Introduction to SQL Server Integration Services
      2. High-Level Planning
      3. Updating Analysis Services Databases
      4. ETL System Physical Design
      5. Developing a Detailed Specification
      6. Summary
    3. Developing the ETL System
      1. Getting Started
      2. Dimension Processing
      3. Fact Processing
      4. Tying It All Together
      5. Summary
    4. Designing the Analysis Services OLAP Database
      1. Why Analysis Services?
      2. Designing the OLAP Structure
      3. Physical Design Considerations
      4. Summary
  10. Developing the BI Applications
    1. Business Intelligence Applications
      1. Business Intelligence Basic Concepts
      2. The Value of Business Intelligence Applications
      3. Delivery Platform Options
      4. The BI Application Development Process
      5. Summary
    2. Building the BI Application in Reporting Services
      1. A High-Level Architecture for Reporting
      2. Building and Delivering Reports
      3. Summary
    3. Incorporating Data Mining
      1. Defining Data Mining
      2. SQL Server Data Mining Architecture Overview
      3. Microsoft Data Mining Algorithms
      4. The Data Mining Process
      5. Data Mining Examples
      6. Summary
  11. Deploying and Managing the DW/BI System
    1. Working with an Existing Data Warehouse
      1. The Current State of Affairs
      2. Conversion from SQL Server 2000
      3. Integrating with Non-SQL Server 2005 Components
      4. Summary
    2. Security
      1. Identifying the Security Manager
      2. Securing the Hardware
      3. Securing the Operating System
      4. Securing the Development Environment
      5. Securing the Data
      6. Usage Monitoring
      7. Protecting Privacy
      8. Summary
    3. Metadata Plan
      1. Metadata Basics
      2. Metadata Standards
      3. SQL Server 2005 Metadata
      4. A Practical Metadata Approach
      5. Summary
    4. Deployment
      1. System Deployment
      2. Data Warehouse and BI Documentation
      3. User Training
      4. User Support
      5. Desktop Readiness and Configuration
      6. Summary
    5. Operations and Maintenance
      1. Providing User Support
      2. System Management
      3. Summary
  12. Extending the DW/BI System
    1. ManagingGrowth
      1. Lifecycle Iteration: Growing the DW/BI System
      2. Marketing and Expectation Management
      3. System Interconnection
      4. Summary
    2. Real-Time Business Intelligence
      1. Making the Case For (and Against) Real-Time Data
      2. Executing Reports in Real Time
      3. Loading the DW/BI System in Real Time
      4. Using Analysis Services to Deliver Real-Time Data
      5. Summary
    3. Present Imperatives and Future Outlook
      1. The Big Risks in a DW/BI Project
      2. What We Like in the Microsoft BI Toolset
      3. Future Directions: Room for Improvement
      4. Conclusion