next up previous index
Next: Antje's slides about regression Up: Bivariate Data Previous: Scatterplots   Index

Correlation

In the above example the correlation of C1 and C2 = 0.983. The scatterplot shows a very definite linear relationaship, this is measured by a numerical summary: The correlation coefficient, it is symmetric in the two variables, is between -1 and +1, the larger it's absolute value the stronger the linear relation between the variables.

The two variables are usually measured in completely different units, the way to make the two comparable is to use our z-scale, ie deviations from the mean measured in standard deviations.

This gives the correlation coefficient defined as:

\begin{displaymath}r=\frac{\sum (x_i-\bar{x})(y_i-\bar{y})}{
\sqrt{\sum (x_i-\bar{x})^2}\sqrt{\sum (y_i-\bar{y})^2
}}\end{displaymath}

$r$ is not affected by:

Good places to visit.... happy trails:

  1. Guessing correlation
  2. About correlation
  3. NIST handbook
  4. Example with formulae
  5. Correlation, regression, outlier points
  6. Correlation lesson for psychologists
  7. With an ellipsoid

Outliers:
NIST handbook


next up previous index
Next: Antje's slides about regression Up: Bivariate Data Previous: Scatterplots   Index
Susan Holmes
2000-11-28