coleman {rrcov}R Documentation

Coleman Data Set

Description

Contains information on 20 Schools from the Mid-Atlantic and New England States, drawn from a population studied by Coleman et al. (1966). Mosteller and Tukey (1977) analyze this sample consisting of measurements on six different variables, one of which will be treated as a responce.

Usage

data(coleman)

Format

A data frame with 20 observations on the following 6 variables.

X1
staff salaries per pupil
X2
percent of white-collar fathers
X3
socioeconomic status composite deviation: means for family size, family intactness, father's education, mother's education, and home items
X4
mean teacher's verbal test score
X5
mean mother's educational level, one unit is equal to two school years
Y
verbal mean test score (y, all sixth graders)

For convenience, the data sets coleman.x, a matrix with the five (independent) variables of the data frame, and coleman.y, the numeric vector giving the sixth (dependent) variable, are provided as well.

Source

P. J. Rousseeuw and A. M. Leroy (1987) Robust Regression and Outlier Detection. Wiley, p.79, table 2.

Examples

data(coleman)
covMcd(coleman.x)
summary(lm.coleman <- lm(coleman.y ~ coleman.x))


[Package rrcov version 0.2-5 Index]