Case Studies in Biometry
The datasets in this collection came from the book CASE STUDIES IN
BIOMETRY, by Nicholas Lange, Louise Ryan and Lynne Billard, David
Brillinger, Loveday Conquest, Joel Greenhouse, (1994) Wiley,
0-471-58925-X.
The ".txt" files contain a complete description of the data.
NB: Some of the larger data files are gzipped, and the transfer relies
on your browser to gunzip them. Most cooperate, but you get a
non-text file, try gunzip on it.
=============== ORIGINAL README FILE=================
Data Diskette for
CASE STUDIES IN BIOMETRY
By Nicholas Lange, Louise Ryan and Lynne Billard, David Brillinger,
Loveday Conquest, Joel Greenhouse.
Published by John Wiley & Sons, Inc., copyright 1994.
____________________________________________________________________
You have obtained all of the data used by all of the authors in the
twenty-one chapters of Case Studies in Biometry.
Please note permissions and usage information included in
the .txt file for each subset.
You will need approximately 7MB space for the files.
____________________________________________________________________
Each chapter contains either one or two data sets, except for Chapter 19,
which has seven data sets. Corresponding to each data set are ASCII files
with extensions ".dat" (the data), ".txt" (the data description file), and
".sas" (a simple SAS program to read the .dat file).
For chapters containing only one data set, the file name prefix is
"ch" followed by chapter number, and then the above extensions.
For instance, for Chapter 2 you will find files "ch2.dat",
"ch2.txt", and "ch2.sas".
For chapters containing two data sets, the letters "a" and "b" are
added after the chapter number. For instance, for Chapter 1, you will
find the files "ch1a.dat", "ch1a.txt", "ch1a.sas", "ch1b.dat",
"ch1b.txt", "ch1b.sas".
For chapter 19, there are seven data sets, ch19a.dat
through ch19g.dat. Each one has the same format, but
refers to different subsets of the whole data set.
Details are given in the single text file, ch19.txt.
Please note permissions and usage information included in
the .txt file for each subset.
Here is a short index of the chapters, including the methods used in those chapters.
- Chapter 1.
Spatial Pattern Analyses to Detect Rare Disease Clusters
Lance A. Waller, Bruce W. Turnbull, Larry C. Clark, and Philip Nasca.
2-D random walk
Cluster detection
Geographical analysis machine
Spatial point processes
Voronoi polygons
- Chapter 2.
Assessing Toxicity of Pollutants in Aquatic Systems
A. John Bailer, James T. Oris.
Dose-response
Extra-Poisson variation
Generalized linear models
Resampling
Poisson regression
- Chapter 3.
Prediction Models for Personal Ozone Exposure Assessment
David Wypij and L.-J. Sally Liu.
Multiple linear regression
Regression diagnostics
Model selection
Residuals analysis
- Chapter 4.
Measurement Error Models for Gypsy Moth Studies
John P. Buonaccorsi.
Errors-in-variables
Inverse prediction
Measurement error models
Residuals analysis
Sampling and subsampling
- Chapter 5.
Estimating Pine Seedling Response to Ozone and Acid Rain
John O. Rawlings and Susan E. Spruill.
Analysis of variance
Design
Dose-response
Estimated generalized least squares
Longitudinal data
Mixed effects
Repeated measures
Randomized complete blocks
Split-plots and whole-plots designs
Transformations
Variance components
- Chapter 6.
Geostatistical Estimates of Scallop Abundance
Mark D. Ecker and James F. Heltshe.
Cross-validation
Kriging
Semivariograms
Stratified sampling
Transformations
- Chapter 7.
Survival Analysis for Size Regulation of Atlantic Halibut
Stephen J. Smith, Ken G. Waiwood, John D.Neilson.
Cox proportional hazards regression
Kaplan-Meier survival curves
Model selection
Parametric survival analysis
Regression diagnostics
Survival analysis
- Chapter 8.
Mixture Fraction and Linkage Analyses for Hybrid Onions
Dennis L. Clason, Joe Corgan, Catherine M. Cryder, N. Scott Urquhart.
Genetic linkage analysis
Maximum likelihood estimation
Mixture models
Multinomial models
- Chapter 9.
Spatial Association Learning in Hummingbirds
Jinko Graham, A. John Petkau.
Analysis of variance
Binary time series
Generalized estimating equations
Longitudinal data
Repeated measures
Transformations
- Chapter 10.
Habitat Association Studies of the Northern Spotted
Owl, Field Grouse, and Flammulated Owl
Fred L. Ramsey, Marti McCracken, John A. Crawford, Martin S. Drut,
William J. Ripple.
Case-control studies
Logistic regression
Model selection
Response surface analysis
Sampling
- Chapter 11.
Time-Series Analyses of Beaver Body Temperatures
Penny S. Reynolds.
Longitudinal data
Model selection
Regression diagnostics
Sampling
Time series
- Chapter 12.
Parametric Duration Analysis of Nursing Home Usage
Carl N. Morris, Edward C. Norton, Xiao H. Zhou.
Cox proportional hazards regression
Model selection
Parametric survival analysis
Survival analysis
Transformations
- Chapter 13.
Analysis of Attitudes Towards Workplace Smoking Restrictions
Shelley B. Bull.
Model selection
Polychotomous logistic regression
Residuals analysis
Regression diagnostics
Survey sampling
- Chapter 14.
Interpretation of a Leukemia Trial Stopped Early
Scott S. Emerson, Phillip L. C.Banks.
Clinical trials
Group sequential methods
Survival analysis
- Chapter 15.
Early Lung Cancer Detection Studies
Betty J. Flehinger, Marek Kimmel.
Screening
Survival analysis
- Chapter 16.
Modeling Interrater Agreement for Pathological Features of Choroidal Melanoma
B. Michele Melia, Marie Diener-West.
Clinical trials
Interrater agreement
Loglinear models
Ordered categorical
- Chapter 17.
Quality Control for Bone Mineral Density Scans
Suzanna Wong, Nancy Lane.
Design
Factor analysis
Interrater agreement
Principal components
Quality control
- Chapter 18.
Modeling the Precursors of Cervical Cancer
Allison J. Kirby, David J.Spiegelhalter.
Event history analysis
Gibbs sampling
Graphical models
Longitudinal data
Markov chain Monte Carlo
Repeated measures
- Chapter 19.
Patterns of Lung Cancer Risk in Ex-Smokers
Brenda W. Gillespie, Michael T. Halpern, Kenneth E. Warner.
Cox proportional hazards regression
Logistic regression
Survey sampling
Survival analysis
Splines
- Chapter 20.
Two-Stage Sampling Designs for Adolescent Depression Studies
Cheryl L. Addy, Kirby L. Jackson, Robert E. McKeown,
Jennifer L. Waller, Carol Z. Garrison.
Design
Logistic regression
Two-stage survey sampling
- Chapter 21.
Dose-response surface analysis
Logistic regression
Here is an index of methods, by chapter
2-D random walk 1
Analysis of variance 5, 9
Case-control studies 10
Clinical trials 16, 17
Cluster detection 1
Cox proportional hazards regression 7, 12, 19
Cross-validation 6
Design 5, 17, 20
Dose-response 2, 5, 21
Errors-in-variables 4
Estimated generalized least squares 5
Event history analysis 18
Extra-Poisson variation 2
Factor analysis 17
Generalized estimating equations 9
Generalized linear models 2
Genetic linkage analysis 8
Geographical analysis machine 1
Gibbs sampling 18
Graphical models 18
Group sequential methods 14
Interrater agreement 16, 17
Inverse prediction 4
Kaplan-Meier survival curves 7
Kriging 6
Logistic regression 10, 13, 19, 20, 21
Loglinear models 16
Longitudinal data 5, 9, 11, 18
Markov chain Monte Carlo 18
Maximum likelihood estimation 8
Measurement error models 4
Mixed effects 5
Mixture models 8
Model selection 3, 7, 10, 11, 12
Multinomial models 8
Multiple linear regression 3
Ordered categorical 16
Poisson regression 2
Principal components 17
Quality control 17
Randomized complete blocks 5
Regression diagnostics 3, 7, 11, 13
Repeated measures 5, 9, 18
Resampling 2
Residuals analysis 3, 4, 13,
Response surface analysis 10
Sampling 4, 6, 10, 11, 13, 19, 20
Screening 15
Semivariograms 6
Spatial point processes 1
Splines 19
Split-plots 5
Survival analysis 7, 12, 14, 15, 19
Time series 9, 11
Transformations 5, 6, 9, 12
Variance components 5
Voronoi polygons 1
The available files and their sizes are:
- ch1a.dat
- 30,020 bytes
- ch1a.sas
- 123 bytes
- ch1a.txt
- 956 bytes
- ch1b.dat
- 288 bytes
- ch1b.sas
- 121 bytes
- ch1b.txt
- 1,003 bytes
- ch2.dat
- 864 bytes
- ch2.sas
- 252 bytes
- ch2.txt
- 535 bytes
- ch3a.dat
- 2,294 bytes
- ch3a.sas
- 168 bytes
- ch3a.txt
- 1,209 bytes
- ch3b.dat
- 6,054 bytes
- ch3b.sas
- 157 bytes
- ch3b.txt
- 2,252 bytes
- ch4a.dat
- 1,791 bytes
- ch4a.sas
- 92 bytes
- ch4a.txt
- 525 bytes
- ch4b.dat
- 229 bytes
- ch4b.sas
- 102 bytes
- ch4b.txt
- 728 bytes
- ch5.dat
- 74,486 bytes
- ch5.sas
- 182 bytes
- ch5.txt
- 2197 bytes
- ch6.dat
- 5,239 bytes
- ch6.sas
- 158 bytes
- ch6.txt
- 931 bytes
- ch7.dat
- 19,111 bytes
- ch7.sas
- 161 bytes
- ch7.txt
- 820 bytes
- ch8.dat
- 828 bytes
- ch8.sas
- 118 bytes
- ch8.txt
- 382 bytes
- ch9.dat
- 231,661 bytes
- ch9.sas
- 120 bytes
- ch9.txt
- 774 bytes
- ch10.dat
- 2,284 bytes
- ch10.sas
- 126 bytes
- ch10.txt
- 897 bytes
- ch11a.dat
- 2,295 bytes
- ch11a.sas
- 120 bytes
- ch11a.txt
- 603 bytes
- ch11b.dat
- 2,102 bytes
- ch11b.sas
- 120 bytes
- ch11b.txt
- 661 bytes
- ch12.dat
- 26,333 bytes
- ch12.sas
- 137 bytes
- ch12.txt
- 502 bytes
- ch13.dat
- 116,615 bytes
- ch13.sas
- 109 bytes
- ch13.txt
- 4,906 bytes
- ch14.dat
- 9,490 bytes
- ch14.sas
- 267 bytes
- ch14.txt
- 827 bytes
- ch15.dat
- 56,760 bytes
- ch15.sas
- 141 bytes
- ch15.txt
- 1,207 bytes
- ch16a.dat
- 7,080 bytes
- ch16a.sas
- 89 bytes
- ch16a.txt
- 1,013 bytes
- ch16b.dat
- 6,120 bytes
- ch16b.sas
- 94 bytes
- ch16b.txt
- 693 bytes
- ch17.dat
- 5,644 bytes
- ch17.sas
- 205 bytes
- ch17.txt
- 1,040 bytes
- ch18a.dat
- 16,982 bytes
- ch18a.sas
- 118 bytes
- ch18a.txt
- 758 bytes
- ch18b.dat
- 78,120 bytes
- ch18b.sas
- 118 bytes
- ch18b.txt
- 729 bytes
- ch19.sas
- 249 bytes
- ch19.txt
- 1,969 bytes
- ch19a.dat
- 43,010 bytes
- ch19b.dat
- 650,531 bytes
- ch19c.dat
- 586,296 bytes
- ch19d.dat
- 1,118,997 bytes
- ch19e.dat
- 1,443,392 bytes
- ch19f.dat
- 680,198 bytes
- ch19g.dat
- 903,759 bytes
- ch20.dat
- 16,2640 bytes
- ch20.sas
- 146 bytes
- ch20.txt
- 1592 bytes
- ch21a.dat
- 32,448 bytes
- ch21a.sas
- 113 bytes
- ch21a.txt
- 933 bytes
- ch21b.dat
- 36,722 bytes
- ch21b.sas
- 109 bytes
- ch21b.txt
- 960 bytes