If yes, then it is as follows:
http://en.wikipedia.org/wiki/File:Boxplot_vs_PDF.svg
--
You received this message because you are subscribed to the ggplot2 mailing list.
Please provide a reproducible example: http://gist.github.com/270442
To post: email ggp...@googlegroups.com
To unsubscribe: email ggplot2+u...@googlegroups.com
More options: http://groups.google.com/group/ggplot2
IQR = Q3-Q1
upper whisker = max (Q3, Q3+1.5*IQR)
lower whisker = min (Q1, Q1-1.5*IQR)
upper whisker = min(max(x), Q3 + 1.5 * IQR)
lower whisker = max(min(x), Q1 - 1.5 * IQR)
Dennis
In other words...
upper whisker = min(max(x), Q3 + 1.5 * IQR)
lower whisker = max(min(x), Q1 - 1.5 * IQR)
Definitions:
What we now call the interquartile range (IQR) was called the
'H-spread' by Tukey.
A 'step' = 1.5 * IQR
The inner fences are one step above Q3 and one step below Q1.
The outer fences are two steps above Q3 and two steps below Q1.
An adjacent value is a value at each end closest to, but still inside,
an inner fence.
Values between an inner and outer fence are called 'outside values'.
Values beyond outer fences are said to be 'far out'.
Today the convention is to term any value outside the inner fence as
an outlier, but Tukey deliberately avoided that terminology.
On p. 47, Tukey suggested the following set of rules for a boxplot:
* outside and far out values should appear separately.
* whiskers should end at the adjacent values.
* 'far out values should be marked impressively and identified in
capital letters'.
* outside and adjacent values should all be identified in small letters.
The convention in R is not to label the adjacent, outside and far out
values, but to identify them as individual points. The user has the
option of identifying them with text annotation.
Dennis
Dennis