Chapter 11

Bivariate Smoothing

Consider the bivariate data shown in Figure 11.1. If asked, virtually anyone could draw a smooth curve that fits the data well, yet most would find it surprisingly difficult to describe precisely how they had done it. We focus here on a variety of methods for this task, called scatterplot smoothing.

Figure 11.1 Predictor–response data. A smooth curve sketched through these data would likely exhibit several peaks and troughs.

img

Effective smoothing methods for bivariate data are usually much simpler than for higher-dimensional problems; therefore we initially limit consideration to the case of n bivariate data points (xi, yi), i = 1, . . ., n. Chapter 12 covers smoothing multivariate data.

The goal of smoothing is different for predictor–response data than for general bivariate data. With predictor–response data, the random response variable Y is assumed to be a function (probably stochastic) of the value of a predictor variable X. For example, a model commonly assumed for predictor–response data is Yi = s (xi) + imgi, where the imgi are zero-mean stochastic noise and s is a smooth function. In this case, the conditional distribution of Y|x describes how Y depends ...

Get Computational Statistics, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.