Dimensionality reduction to improve performance

When we handle large volumes of data, some issues occur spontaneously. How does one build a representative model of a set of hundreds of variables? How does one view data across countless dimensions? To address these issues, we must adopt a series of techniques called dimensionality reduction. Dimensionality reduction is the process of converting a set of data with many variables into data with lesser dimensions while ensuring similar information. The aim is to reduce the number of dimensions in a dataset through either feature selection or feature extraction without significant loss of details. Feature selection approaches try to find a subset of the original variables. Feature extraction reduces ...

Get Hands-On Data Warehousing with Azure Data Factory now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.