STATLAB Datensatz

Dies ist ein empirischer Datensatz und wurde aus dem Buch J. L. Hodges, D. Krech, R. S. Crutchfield, STATLAB - An Empirical Introduction to Statistics, Mc Graw Hill, 1975, entnommen. Dies ist ein äußerst lesenswertes Buch. Der Datensatz hat 1296 Beobachtungen von 33 Variablen (18 qualitativ, 15 quantitativ) und ist in verschiedenen Formaten verfuegbar: [txt] [csv] [rda]

Die Story zum Datensatz

The STATLAB Census covers 1296 member families of the Kaiser Foundation Health Plan (a prepaid medical care program) living in the San Francisco Bay Area during the years 1961-1972. These families were participating members of the Child Health and Development Study conceived and directed by Jacob Yerushalmy, for many years Professor of Biostatistics in the School of Public Health, University of California, Berkeley.

On her first visit to the Oakland hospital of the Health Plan after pregnancy was diagnosed, each woman was interviewed intensively on a wide range of medical and socioeconomic matters relating both to herself and to her husband. In addition, various physical and physiological measures were made. When her child was born, further data about her and her newborn baby were recorded. Approximately 10 years later the child and mother were called in for follow-up testing, interviewing, and measurement. In some instances, the husband was also interviewed and measured.

The 1296 families of the STATLAB Census are divided into two equal subpopulations: 648 families consisting of a mother, father, and female child; and 648 families of a mother, father, and male child. The children were all born in the Kaiser Foundation Hospital, Oakland, California, between 1 April 1961 and 15 April 1963. The Census does not cover any other children who may also have existed in these families.

Of the multitude of available data, 32 variables have been selected for the Census. The following 36 Census pages list each of these 32 variables for each of the 1296 families. The first 18 pages cover the families with girls; the second 18 pages cover the families with boys. Within each of these two sets of pages the families are listed in order of mother's age, with the youngest mothers first and the oldest last.

Legende

The 32 variables pertaining to each STATLAB family are grouped by CHILD, MOTHER, FATHER, FAMILY. Certain of the data were collected at the time of birth (1961-1963) and certain other data at the time of test (1971-1972), thus resulting in seven clusters of data:

The description and codes (where relevant) for each of the variables for each of the seven clusters, starting at the left of the printout, are as follows.

CODE
The Census pages consist of printouts numbered in consecutive dice numbers, i.e., the Census pages are numbered 11, 12, 13, 14, 15, 16, 21, 22, ... , 65, 66). Similarly the 36 families listed on each page are designated in consecutive dice numbers from 11 to 66. The identification number (ID no.) for any given family consists of two pairs of dice numbers, the first pair indicating page and the second pair indicating family on the page. To select a family purely at random from the population of 1296, it is necessary to throw the pair of dice twice. If, for example, the first throw gives a red 2 and a green 6, this selects page 26. If the second throw gives a red 5 and a green 4, this selects family 54 on that page. Thus the ID number for this family is 26-54.

CBSEX
Gender of child with coding: 0 (female), 1 (male).

CBB
Blood type with coding:
1 0 Rh-negative
2 A Rh-negative
3 B Rh-negative
4 AB Rh-negative
5 0 Rh-positive
6 A Rh-positive
7 B Rh-positive
8 AB Rh-positive
9 Unknown

CBLGTH
Length of baby to tenth inch.

CBWGT
Weight of Baby to tenth pound

CBMO
Month (1-12) of baby's birth.

CBD
Day (Su=1, Mo=2, ... , Sa=7) of baby's birth.

CBHR
Hour (1=1am,..., 12=12noon, 13=1pm,..., 24= 12midnight) of baby's birth.

CTHGHT
Height of child to tenth inch.

CTWGT
Weight of child to nearest pound.

CTL
Laterality: Combination of left or right handedness (H), with left or right eyedness (E), the latter being measured on two occasions (E1 and E2) with coding:
  H E1 E2
1 R R R
2 R R L
3 R L R
4 R L L
5 L R R
6 L R L
7 L L R
8 L L L
For example, code 2 indicates that a right-handed child was right-eyed dominant on the first observation and left-eyed dominant on the second.

CTPEA
Score on the Peabody Picture Vocabulary Test.

CTRA
Score on the Raven Progressive Matrices Test.

MBB
Blood type with coding:
1 0 Rh-negative
2 A Rh-negative
3 B Rh-negative
4 AB Rh-negative
5 0 Rh-positive
6 A Rh-positive
7 B Rh-positive
8 AB Rh-positive
9 Unknown

MBAG
Age of mother at last birthday before baby's birth.

MBWGT
Weight of mother (to nearest pound) at time pregnancy was first diagnosed.

MBO
Occupation of mother with coding:
0 Housewife
1 Office/Clerical
2 Sales
3 Teacher/Counselor
4 Professional/Managerial
5 Services
7 Factory worker
8 All other

MBSM
Smoking history of mother with coding:
-1 Never smoked cigarettes
0 Has now quit smoking, but did smoke at one time
1-99 Number of cigarettes currently being smoked per day

MTHGHT
Height of mother to tenth inch.

MTWGT
Weight of mother to nearest pound.

MTE
Education of mother with coding:
0 less than 8th grade
1 8th to 12th grade
2 High school graduate
3 Some college
4 College graduate

MTO
Occupation of mother with coding:
0 Housewife
1 Office/Clerical
2 Sales
3 Teacher/Counselor
4 Professional/Managerial
5 Services
7 Factory worker
8 All other

MTSM
Smoking history of mother with coding:
-1 Never smoked cigarettes
0 Has now quit smoking, but did smoke at one time
1-99 Number of cigarettes currently being smoked per day

FBAG
Age of father at last birthday before baby's birth.

FBO
Occupation of father with coding:
0 Professional (Freiberufl., akad.)
1 Teacher / Counselor (Anwalt)
2 Manager / Official (Beamter)
3 Self-employed (selbständig, nicht akad.)
4 Sales
5 Clerical
6 Craftsman (Handwerker) / Operator (Maschinist)
7 Laborer (Arbeiter)
8 Service worker (Angestellter im Dienstleistungssektor)

FBSM
Smoking history of father with coding:
-1 Never smoked cigarettes
0 Has now quit smoking, but did smoke at one time
1-99 Number of cigarettes currently being smoked per day

FTHGHT
Height of father to tenth inch.

FTWGT
Weight of father to nearest pound.

FTE
Education of father with coding:
0 less than 8th grade
1 8th to 12th grade
2 High school graduate
3 Some college
4 College graduate

FTO
Occupation of father with coding:
0 Professional (Freiberufl., akad.)
1 Teacher / Counselor (Anwalt)
2 Manager / Official (Beamter)
3 Self-employed (selbständig, nicht akad.)
4 Sales
5 Clerical
6 Craftsman (Handwerker) / Operator (Maschinist)
7 Laborer (Arbeiter)
8 Service worker (Angestellter im Dienstleistungssektor)

FTSM
Smoking history of father with coding:
-1 Never smoked cigarettes
0 Has now quit smoking, but did smoke at one time
1-99 Number of cigarettes currently being smoked per day

FIB
Income of family at time of birth in hundreds of dollars.

FIT
Income of family at time of test in hundreds of dollars.

FC
Church attendance with coding:
1 Entire family attends fairly regularly
2 Mother and child attend fairly regularly
3 Child only attends fairly regularly
4 Sporadic attendance - anyone in family
5 Attend on Holy Days only - anyone in family
6 No one in family ever attends


Last change: 2005-12-19 by zeileis