Chapter 8

Defect Prediction

Abstract

In this part of the book Data Science for Software Engineering: Sharing Data and Models, we offer some tutorial notes on commonly used software engineering applications of data mining, along with some tutorial material on data mining algorithms. Covered topics of SE problems include effort estimation and defect prediction. Covered aspects of data mining include discretization, column pruning (also known as feature selection), row pruning, clustering, contrast set learning, decision learning, and learning for continuous classes.

Defect prediction is the study of predicting which software “modules” are defective. Here “modules” means some primitive unit of a running system such as a function or a class. ...

Get Sharing Data and Models in Software Engineering now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.