You are previewing KNIME Essentials.
O'Reilly logo
KNIME Essentials

Book Description

Perform accurate data analysis using the power of KNIME

  • Learn the essentials of KNIME, from importing data to data visualization and reporting

  • Utilize a wide range of data processing solutions

  • Visualize your final data sets using KNIME’s powerful data visualization options

In Detail

KNIME is an open source data analytics, reporting, and integration platform, which allows you to analyze a small or large amount of data without having to reach out to programming languages like R.

"KNIME Essentials" teaches you all you need to know to start processing your first data sets using KNIME. It covers topics like installation, data processing, and data visualization including the KNIME reporting features. Data processing forms a fundamental part of KNIME, and KNIME Essentials ensures that you are fully comfortable with this aspect of KNIME before showing you how to visualize this data and generate reports.

"KNIME Essentials" guides you through the process of the installation of KNIME through to the generation of reports based on data. The main parts between these two phases are the data processing and the visualization. The KNIME variants of data analysis concepts are introduced, and after the configuration and installation description comes the data processing which has many options to convert or extend it. Visualization makes it easier to get an overview for parts of the data, while reporting offers a way to summarize them in a nice way.

Table of Contents

  1. KNIME Essentials
    1. Table of Contents
    2. KNIME Essentials
    3. Credits
    4. About the Author
    5. About the Reviewers
    6. www.PacktPub.com
      1. Support files, eBooks, discount offers and more
        1. Why Subscribe?
        2. Free Access for Packt account holders
    7. Preface
      1. What this book covers
      2. What you need for this book
      3. Who this book is for
      4. Conventions
      5. Reader feedback
      6. Customer support
        1. Downloading the example code
        2. Errata
        3. Piracy
        4. Questions
    8. 1. Installing and Using KNIME
      1. Few words about KNIME
      2. Installing KNIME
        1. Installation using the archive
          1. KNIME for Windows
          2. KNIME for Linux
          3. KNIME for Mac OS X
        2. Troubleshooting
      3. KNIME terminologies
        1. Organizing your work
        2. Nodes
          1. Node lifecycle
        3. Meta nodes
        4. Ports
          1. Data tables
          2. Port view
        5. Flow variables
        6. Node views
          1. HiLite
        7. Eclipse concepts
          1. Preferences
          2. Logging
      4. User interface
        1. Getting started
        2. Setting preferences
          1. KNIME
          2. Other preferences
        3. Installing extensions
        4. Workbench
          1. Workflow handling
          2. Node controls
            1. HiLite
            2. Variable flows
          3. Meta nodes
          4. Workflow lifecycle
          5. Other views
      5. Summary
    9. 2. Data Preprocessing
      1. Importing data
        1. Importing data from a database
          1. Starting Java DB
        2. Importing data from tabular files
        3. Importing data from web services
          1. REST services
        4. Importing XML files
        5. Importing models
        6. Other formats
        7. Public data sources
      2. Regular expressions
        1. Basic syntax
        2. Partial versus whole match
        3. Usage from Java
        4. References and tools
        5. Alternative pattern description
      3. Transforming the shape
        1. Filtering rows
          1. Sampling
        2. Appending tables
        3. Less columns
          1. Dimension reduction
        4. More columns
        5. GroupBy
        6. Pivoting and Unpivoting
        7. One2Many and Many2One
        8. Cosmetic transformations
          1. Renames
          2. Changing the column order
          3. Reordering the rows
          4. The row ID
        9. Transpose
      4. Transforming values
        1. Generic transformations
          1. Java snippets
          2. The Math Formula node
        2. Conversion between types
          1. Binning
        3. Normalization
          1. Text normalization
            1. Regular expressions
        4. Multiple columns
        5. XML transformation
        6. Time transformation
        7. Smoothing
      5. Data generation
        1. Generating the grid
      6. Constraints
      7. Loops
      8. Workflow customization
      9. Case study – finding min-max in the next n rows
      10. Case study – ranks within groups
      11. Summary
    10. 3. Data Exploration
      1. Computing statistics
      2. Overview of visualizations
      3. Visual guide for the views
      4. Distance matrix
      5. Using visual properties
        1. Color
        2. Size
        3. Shape
      6. KNIME views
        1. HiLite
          1. Use cases for HiLite
        2. Row IDs
        3. Extreme values
      7. Basic KNIME views
        1. The Box plots
        2. Hierarchical clustering
        3. Histograms
        4. Interactive Table
        5. The Lift chart
        6. Lines
        7. Pie charts
        8. The Scatter plots
        9. Spark Line Appender
        10. Radar Plot Appender
        11. The Scorer views
      8. JFreeChart
        1. The Bar charts
        2. The Bubble chart
        3. Heatmap
        4. The Histogram chart
        5. The Interval chart
        6. The Line chart
        7. The Pie chart
        8. The Scatter plot
      9. Open Street Map
      10. 3D Scatterplot
      11. Other visualization nodes
        1. The R plot, Python plot, and Matlab plot
        2. The official R plots
        3. The RapidMiner view
        4. The HiTS visualization
      12. Tips for HiLiting
        1. Using Interactive HiLite Collector
        2. Finding connections
      13. Visualizing models
        1. Further ideas
      14. Summary
    11. 4. Reporting
      1. Installation of the reporting extensions
      2. Reporting concepts
      3. Importing data
        1. Sending data and images to a report
        2. Importing from other sources
        3. Joining data sets
      4. Preferences
      5. Using the designer
        1. In visible views
        2. Report properties
        3. Report items
          1. Label
          2. Text
            1. Binding
          3. Dynamic text
          4. Data
          5. Image
          6. Grid
          7. List
            1. Groups
            2. Sorting
            3. Filters
          8. Table
          9. Chart
          10. Cross Tab
            1. Setting up
            2. Changing
            3. Using data cubes
        4. Quick Tools
          1. Aggregation
          2. Relative time period
      6. Generating reports
      7. Using colors
      8. Using HiLite
      9. Using workflow variables
      10. Suggested readings
      11. Summary
    12. Index