Scale of features

Data scaling is a preprocessing technique usually employed before feature selection and classification. Many artificial intelligence-based systems use features that are generated by many different feature extraction algorithms, with different kinds of sources. These features may have different dynamic ranges. Popular distance measures, such as Euclidean distance, implicitly assign more weighting to features with large ranges than those with small ranges. Feature scaling is thus required to approximately equalize ranges of the features and make them have approximately the same effect in the computation of similarity.

In addition, in several data mining applications with huge numbers of features with large dynamic ranges, ...

Get Hands-On Machine Learning on Google Cloud Platform now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.