From association to causation : some remarks on the history of statistics

David Freedman

Journal de la société française de statistique (1999)

  • Volume: 140, Issue: 3, page 5-32
  • ISSN: 1962-5197

How to cite


Freedman, David. "From association to causation : some remarks on the history of statistics." Journal de la société française de statistique 140.3 (1999): 5-32. <>.

author = {Freedman, David},
journal = {Journal de la société française de statistique},
keywords = {Association; causation; regression; history of statistics; modeling significance; epidemiology},
language = {eng},
number = {3},
pages = {5-32},
publisher = {Société française de statistique},
title = {From association to causation : some remarks on the history of statistics},
url = {},
volume = {140},
year = {1999},

AU - Freedman, David
TI - From association to causation : some remarks on the history of statistics
JO - Journal de la société française de statistique
PY - 1999
PB - Société française de statistique
VL - 140
IS - 3
SP - 5
EP - 32
LA - eng
KW - Association; causation; regression; history of statistics; modeling significance; epidemiology
UR -
ER -


  1. ABBOTT A. (1997), Of time and space : the contemporary relevance of the Chicago school. Social Forces 75 1149-82. 
  2. ALBERTS B., BRAY D., LEWIS J., RAFF M., ROBERTS K. and WATSON J. D. (1994). Molecular Biology of the Cell, 3rd. ed., Garland Publishing, New York. 
  3. AMERICAN CANCER SOCIETY (1997). Cancer Facts & Figures - 1997 Atlanta, Georgia. 
  4. ANGRIST J. D., IMBENS G. W. and RUBIN D. B. (1996). Identification of causal effects using instrumental variables. J. Amer. Statist. Assoc. 91 444-72 Zbl0897.62130
  5. AURELIAN L., SCHUMANN B., MARCUS R. L. and DAVIS H. J. (1973). Antibody to HSV-2 induced tumor specific antigens in serums from patients with cervical carcinoma. Science 181 161-64 
  6. BERKSON J. (1955). The statistical study of association between smoking and lung cancer. Proc. Mayo Clinic 30 319-48. 
  7. BLAU P. M. and DUNCAN O. D. (1967). The American Occupational Structure. Wiley, New York. Chapter 5. 
  8. BRESLOW N. and DAY N. E. (1980). Statistical Methods in Cancer Research, Vol. 1. International Agency for Research on Cancer, Lyon. Sci. Publ. No. 32. Distributed by Oxford University Press. MR614143
  9. BRESLOW N. and DAY N. E. (1987). Statistical Methods in Cancer Research, Vol. 2, International Agency for Research on Cancer, Lyon. Sci. Publ. No. 82. Distributed by Oxford University Press. 
  10. BROSS I. D. J. (1960). Statistical criticism. Cancer 13 394-400. 
  11. BUCK C., LLOPIS A., NAJERA E. and TERRIS M., eds (1989). The Challenge of Epidemiology : Issues and Setected Readings, Sci. Publ. No. 505, World Health Organization, Geneva. 
  12. CANNISTRA S. A. and NlLOFF J. M. (1996) Cancer of the uterine cervix. New Engl. J. Med. 334 1030-38. 
  13. CARMELLI D. and PAGE W. F. (1996) Twenty-four year mortality in World War II US male veteran twins discordant for cigarette smoking. Int. J. Epidemiol. 25 554-559. 
  14. CARPENTER K. J. (1981). Pellagra, Academic Press. 
  15. COLWELL R. R. (1996). Global climate and infectious disease : the cholera paradigm. Science 274 2025-31 
  16. COOK D. (1980). Smoking and lung cancer. In S. E. Fienberg and D. V. Hinkley, eds. R. A. Fisher, An Appreciation. Lecture notes in statistics, Vol 1, pp. 182-91, SpringerVerlag, New York. Zbl0436.62002MR578886
  17. COPAS J. B. and LI H. G. (1997). Inference for non-random samples. J. Roy. Statist. Soc. Ser. B 59 55-77. Zbl0896.62003MR1436555
  18. CORNFIELD J. (1951). A method for estimating comparative rates from clinical data. Applications to cancer of the lung, breast and cervix. J. Nat. Cancer Int. 11 1269-75. 
  19. CORNFIELD J., HAENSZEL W., HAMMOND E C., LILIENFELD A. M. , SHIMKIN M. B. and WYNDER E. L. (1959). Smoking and lung cancer : recent evidence and a discussion of some questions. J. Nat. Cancer Inst. 22 173-203. 
  20. COX D. (1977). The role of significance tests. Scand. J. Statist. 4 49-70. Zbl0358.62006MR448666
  21. DESROSIERES A. (1993). La politique des grands nombres : histoire de la raison statistique. Editions La Découverte, Paris. English translation by C. Naish ( 1998). The Politics of Large Numbers : A History of Statistical Reasoning. Harvard University Press MR1925229
  22. DICKERSIN K. (1997). How important is publication bias ?A synthesis of available data. AIDS Education and Prevention 9 Suppl. A 15-21 
  23. DIJKSTRA T. K., ed. (1988). On Model Uncertainty and its Statistical Implications. Lecture Notes No. 307 in Economics and Mathematical Systems, Springer. Zbl1114.62303
  24. DOLL R. (1987). Major epidemics of the 20th century : from coronary thrombosis to AIDS. J. Roy. Statist. Soc. Ser. A 150 373-95. 
  25. DOLL R., EVANS H. J. and DARBY S. C. (1994). Paternal exposure not to blame. Nature 367 678-80. 
  26. DOLL R. and HILL, A. B. (1950). Smoking and carcinoma of the lung : preliminary report. Br. Med. J. ii 739-48. 
  27. DOLL R. and HILL A. B. (1952). A study of the actiology of carcinoma of the lung. Br. Med. J. ii 1271-86 
  28. DOLL R. and HILL, A. B. (1954). The mortality of doctors in relation to their smoking habit : A preliminary report Br. Med. J. i 1451-55. 
  29. DUBOS R. (1988). Pasteur and Modern Science. Springer. 
  30. EHRENBERG A. S. C. and BOUND J. A. (1993). Predictability and prediction. J. Roy. Statist. Soc. Ser. A 156 Part 2 167-206 (with discussion. 
  31. EVANS A. S. (1993). Causation and Disease : A Chronological Journey. Plenum, New York 
  32. EVANS R. J. (1987). Death in Hamburg : Society and Politics in the Cholera Year. Oxford University Press. 
  33. FlNLAY B. B. , HEFFRON F. and FIALKOW S. (1989). Epithelial cell surfaces induce Salmonella proteins required for bacterial adherence and invasion. Science 243 940-42. 
  34. FISHER R. A. (1959). Smoking : The Cancer Controversy. Oliver and Boyd, Edinburgh 
  35. FREEDMAN D. (1983). A note on screening regression equations. Amer. Statistician 37 152-55. MR702208
  36. FREEDMAN D. (1987). As others see us : a case study in path analysis. J. Educational Statistics 12 101-223. 
  37. FREEDMAN D. (1991). Statistical models and shoe leather. In P. Marsden, ed., Sociol. Methodol. 
  38. FREEDMAN D. (1995). Some issues in the foundation of statistics. Foundations of Science 1 19-83. Zbl0945.62004MR1798108
  39. FREEDMAN D. (1997). From association to causation via regression. Adv. Appl. Math. 18 59-110. Zbl0873.90019MR1425950
  40. FREEDMAN D., GOLD L. S. and LIN T. H. (1996). Concordance between rats and mice in bioassays for carcinogenesis. Reg. Tox. Pharmacol. 23 225-32. 
  41. FREEDMAN D. and NAVIDI W. (1989). On the multistage model for carcinogenesis. Environ. Health Perspect. 81 169-88. 
  42. FREEDMAN D. and NAVIDI W. (1990). Ex-smokers and the multistage model for lung cancer. Epidemiol. 1 21-29 
  43. FREEDMAN D., PISANI R. , and PURVES R. (1997). Statistics. 3rd ed. Norton, New York. 
  44. FREEDMAN D. and ZEISEL H. (1988). From mouse to man : the quantitative assessment of cancer risks. Statistical Science 3 3-56 (with discussion). Zbl0955.62637MR959715
  45. FRIEDMAN M. (1953). Essays in Positive Economics. University of Chicago Press. 
  46. GAGNON F. (1950). Contribution to the study of the etiology and prevention of cancer of the cervix. Amer. J. Obstetrics and Gynecology 60 516-22. 
  47. GAIL M. H. (1996). Statistics in action. J. Amer. Statist. Assoc. 433 1-13 
  48. GAMBLE J. F. (1998). PM25 and mortality in long-term prospective cohort studies : cause effect or statistical associations? Environ. Health Perspect. 106 535-49. 
  49. GARDNER M. J , SNEE M. P., HALL A J., POWELL C. A. , DOWNES S. and TERRELL J. D. (1990). Results of case-control study of leukaemia and lymphoma among young people near Sellafield nuclear plant in West Cumbria. Br. Med. J. 300 423-33. Published erratum appears in BMJ 1992 305 715, and see letter in BMJ 1991 302 907. 
  50. GARDNER M J. (1992). Leukemia in children and paternal radiation exposure at the Sellafield nuclear site. Mon. Nat. Cancer Inst. 12 133-35. 
  51. GAUSS C. F. (1809). Theoria Motus Corporum Coelestium. Perthes et Besser, Hamburg. Reprinted in 1963 by Dover, New York 
  52. GAVARRET J. (1840). Principes généraux de statistique médicale, ou, Développement des règles qui doivent présider à son emploi. Bechet jeune et Labe, Paris. 
  53. GOLDTHORPE J. H. (1998). Causation, Statistics and Sociology. Twenthy-ninth Geary Lecture, Nuffield College, Oxford. Publ. by the Economic and Social Research Institute, Dublin, Ireland. 
  54. GREENLAND S., PEARL J., and ROBINS J. M. (1998). Causal diagrams for epidemiologic research. Epidemiol. 10 37-48 
  55. HAKAMA M., LEHTINEN M. , KNEKT P. , AROMAA A., LEINIKKI, P., MIETTINEN A., PAAVONEN J., PETO R. and TEPPO L. (1993). Serum antibodies and subsequent cervical ne oplasms : A prospective study with 12 years of followup. Amer. J. Epidemiol. 137 166-70 
  56. HODGES J. L. and LEHMANN E. L. (1964). Basic Concepts of Probability and Statistics. Holden-Day, San Francisco. Zbl0131.34706MR185709
  57. HOLLAND P. (1988). Causal inference, path analysis, and recursive structural equations models. In C. Clogg, ed., Sociol. Methodol.. 
  58. HOWARD-JONES N. (1975). The Scientific Background of the International Sanitary Conferences 1851-1938 . World Health Organization, Geneva. 
  59. HUMPHREYS P. and FREEDMAN D. (1996). The Grand Leap. Brit. J. Phil. Sci. 47 113-123 
  60. HUMPHREYS P. and FREEDMAN D. (1999). Are there algorithms that discover causal structure? Technical report no. 514, Department of Statistics, University of California, Berkeley. Synthese 121 29-54. Zbl1157.62310
  61. IARC (1986). Tobacco Smoking. International Agency for Research on Cancer, Monograph 38, Lyon. Distributed by Oxford University Press. 
  62. KANAREK M. S. , CONFORTI P. M., JACKSON L. A. , COOPER R. C. , and MURCHIO J. C. (1980). Asbestos in drinking water and cancer incidence in the San Francisco Bay Area. Amer. J. Epidemiol. 112 54-72. 
  63. KAPRIO J. and KOSKENVUO M. (1989). Twins, smoking and mortality : a 12-year prospective study of smoking-discordant twin pairs. Social Science and Medicine 29 1083-89. 
  64. KINLEN L. J. and JOHN S. M. (1994). Wartime evacuation and mortality from childhood leukaemia in England and Wales in 1945-9. Br. Med. J. 309 1197-1201. 
  65. LANG J. M., ROTHMAN K. J., and CANN C. I. (1998). That confounded P-value. Epidemiology 9 7-8. 
  66. LEGENDRE A. M. (1805). Nouvelles méthodes pour la détermination des orbites des comètes. Courcier, Paris. Reprinted in 1959 by Dover, New York. 
  67. LIEBERSON S. (1985). Making it Count. University of California Press, Berkeley. 
  68. LIU T. C. (1960). Under-identification, structural estimation, and forecasting. Econometrica 28 855-65. 
  69. LOMBARD H. L. and DOERING C. R. (1928). Cancer studies in Massachusetts : Habits, charactenstics and environment of individuals with and without lung cancer. New Engl. J. Med. 198 481-87. 
  70. LOUIS P. (1835). Recherches sur les effets de la saignée dans quelques maladies inflammatoires : et sur l'action de l'émétique et des vésicatoires dans la pneumonie. J. B. Baillière, Paris. Reprinted by The Classics of Medicine Library, Birmingham, Alabama, 1986. 
  71. LUCAS R. E. Jr. (1976). Econometric policy evaluation : a critique. In K. Brunner and A. Meltzer (eds.), The Phillips Curve and Labor Markets, vol. 1 of the Carnegie-Rochester Conferences on Public Policy, supplementary series to the Journal of Monetary Economics, North-Holland, Amsterdam, pp. 19-64. (With discussion.) 
  72. MANSKI C. F. (1995). Identification Problems in the Social Sciences. Harvard University Press. 
  73. MCKIM V. and TURNER S., eds. (1997). Causality m Crisis ? Proceedmgs of the Notre Dame Conference on Causahty, Notre Dame Press. 
  74. MlLL J. S. (1843) A System of Logic, Ratiocinative and Inductive John W. Parker, London. 8th ed. reprinted by Longman, Green and Co., Ltd., London ( 1965). See especially Book III Chapter VIII. Reprinted in 1974 by the University of Toronto Press. 
  75. MILLER J. F., MEKALANOS J. J. and FIALKOW S. (1989). Coordinate regulation and sensory transduction in the control of bacterial virulence. Science 243 916-22. 
  76. MULLER F. H. (1939). Tabakmissbrauch und Lungcarcinom. Zeitschrift fur Krebsforsuch 49 57-84. 
  77. NATIONAL RESEARCH COUNCIL (1997). Possible Health Effects of Exposure to Residential Electric and Magnetic Fields. National Academy of Science, Washington, DC. 
  78. NEYMAN J. (1923). Sur les applications de la théorie des probabilités aux expériences agricoles : Essai des principes. Roczniki Nauk Rolniczki 10 1-51, in Polish. English translation by D. Dabrowska and T. Speed, 1990. Statistical Science 5 463-80. 
  79. OTTENBACHER K. J. (1998). Quantitative evaluation of multiplicity in epidemiology and public health research. Amer. J. Epidemiol. 147 615-19. 
  80. PANETH N., VlNTEN-JOHANSEN P., BRODY H. and RIP M. (1998). A rivalry of foulness : official and unofficial investigations of the London cholera epidemic of 1854 . Amer. J. Publ. Health 88 1545-53. 
  81. PASTEUR L. (1878). La théorie des germes et ses applications à la médecine et à la chirurgie, lecture faite à l'Academie de Médecine le 30 avril 1878, par M. Pasteur en son nom et au nom de MM. Joubert et Chamberland, G. Masson, Paris. 
  82. PEARL J. (1995). Causal diagrams for empirical research. Biometnka 82 689-709. Zbl0860.62045MR1380809
  83. PERNEGER T. V. (1998). What's wrong with Bonferroni adjustments. Br. Med. J. 316 1236-38. 
  84. POPE C. A., SCHWARTZ J. and RANSOM M. R. (1992). Daily mortality and PM10 pollution in Utah Valley. Archives of Environmental Health 47 211-17. 
  85. QUETELET A. (1835). Sur l'homme et le développement de ses facultés, ou Essai de physique sociale. Bachelier, Paris. 
  86. RAUFMAN J. P. (1998). Cholera. Amer. J. Med. 104 386-94. 
  87. ROBINSON W. S. (1950). Ecological correlations and the behavior of individuals. Amer. Sociol. Rev. 15 351-7. 
  88. ROSENBERG C. E. (1962). The Cholera Years. Chicago University Press. 
  89. ROTHMAN K. J. (1990). No adjustments are needed for multiple comparisons. Epidemiol. 1 43-46. 
  90. ROTHMAN K. J. (1996). Lessons from John Graunt. Lancet 347 37-39. 
  91. ROTHMAN K. J. and GREENLAND S. , eds. (1998). Modern Epidemiology, 2nd. ed. Lippincott-Raven. 
  92. RUBIN D. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology 66 688-701. 
  93. RØJEL J. (1953). The Interrelation between Uterine Cancer and Syphilis. Copenhagen. 
  94. SCHARFSTEIN D. O., ROTNITZKY A. , and ROBINS J. M. (1999). Adjusting for non ignorable drop-out using semiparametric non-response models. J. Amer. Statist. Assoc. to appear. Zbl1072.62644MR1731478
  95. SEMMELWEISS I. (1867). The Etiology, Concept, and Prophylasis of Childbed Fever. Translated by K. C. Carter, University of Wisconsin Press, 1983. 
  96. SNOW J. (1855). On the Mode of Communication of Cholera. Churchill, London. Reprinted by Hafner, New York, 1965. 
  97. STIGLER S. M. (1986). The History of Statistics. Harvard University Press. Zbl0656.62005MR852410
  98. STOLLEY P. (1991). When genius errs. Amer. J. Epidemiol. 133 416-25. 
  99. STOREY A., THOMAS M., KALITA A., HARWOOD C. , GARDIOL D., MANTOVANI F., BREUER J., LEIGH I. M., MATLASHEWSKI G. and BANKS L. (1998). Role of a p53 polymorphism in the development of human papillomavirus-associated cancer. Nature 393 229-34. 
  100. STYER P., MCMILLAN N., GAO F., DAVIS J. and SACKS J. (1995). Effect of outdoor airborne particulate matter on daily death counts. Environ. Health Perspect. 103 490-97. 
  101. TAUBES G. (1995). Epidemiology faces its limits. Science 269, 14 July 1995, pp. 164-9. Letters : 8 Sep 1995, pp. 1325-8. 
  102. TAUBES G. (1998). The (political) science of salt. Science 281, 14 August 1998, pp. 898-907. 
  103. TERRIS M., ed. (1964). Goldberger on Pellagra. Louisiana State University Press. 
  104. VANDENBROUCKE J. P. and PARDOEL V. P. (1989). An autopsy of epidemiologic methods : the case of 'poppers' in the early epidemic of the acquired immunodeficiency syndrome (AIDS). Amer. J. Epidemiol. 129 455-7; and see comments. 
  105. WALD N. and NICOLAIDES-BOUMAN A., eds. (1991). UK Smoking Statistics. 2nd ed., Oxford University Press. 
  106. WlNKELSTElN W. (1995). A new perspective on John Snow's communicable disease theory. Amer. J. Epidemiol. 142 (9 Suppl.) S3-9. 
  107. WYNDER E. L. and GRAHAM E. A. (1950). Tobacco smoking as a possible etiological factor in bronchogenic carcinoma : a study of six hundred and eight-four proved cases. J. Amer. Med. Assoc. 143 329-36. 
  108. WYNDER E. L., CORNFIELD J., SCHROFF P. D. and DORAISWAMI K. R. (1954). A study of environmental factors in carcinoma of the cervix. American Journal of Obstetrics and Gynecology 68 1016-52. 
  109. YULE G. U. (1899). An investigation into the causes of changes in pauperism in England, chiefly during the last two intercensal decades. J. Roy. Statist. Soc. 62 249-95. 

NotesEmbed ?


You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.


Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.