Records Commentary on: Medical Research Council (1948). Streptomycin treatment of pulmonary tuberculosis: a Medical Research Council investigation. BMJ 2:769-782. Cite as: Chalmers I (2010). Why the 1948 MRC trial of streptomycin for pulmonary tuberculosis used treatment allocation based on random numbers.
The James Lind Library (www.jameslindlibrary.org). Author contact details: Iain Chalmers, James Lind Initiative, Summertown Pavilion, Middle Way, Oxford OX2 7LG, UK.
Email: ichalmers@jameslindlibrary.org There is a view among some medical historians that the emergence of the randomized clinical trial originated from statistical thinking, and that the modern era of controlled trials was essentially ushered in with the iconic randomised trial of streptomycin for pulmonary tuberculosis reported by the British Medical Research Council in 1948. For example:
The conceptualisation of clinical trials as “a seminal statistical idea” which “can be traced back to RA Fisher’s work” has not been demonstrated by these writers or by others. The early history of clinical trials has little to do with statistical theory and much more to do with the more fundamental and less technical concept of a fair – that is, unbiased – test (Chalmers 1997; 1999; 2001; Edwards 2004; 2005; Edwards 2006; Chalmers 2009).
When quantitative methods began to be used at the beginning of the 18th century to assess the effects of variolation authors of the comparisons were sometimes reminded of the need to ensure that like was being compared with like. Thus Massey, challenging the interpretation of comparisons of mortality following variolation and after natural smallpox, wrote:
Several reports of prospective experiments were published during the eighteenth century. In the most celebrated of these James Lind notes that, apart from the treatments, the twelve patients he studied were otherwise similar: “They all in general had putrid gums, the spots and lassitude, with weakness of their knees. They lay together in one place, being a proper apartment for the sick in the fore-hold; and had one diet common to all” (Lind 1753). Lind does not tell us how he allocated his twelve patients to each of the six treatments he compared, but had he cast lots or used alternation or rotation it would not have been inconsistent with the use of these devices to make fair decisions in other contexts (Silverman and Chalmers 2002). At the beginning of the 19th century, Alexander Hamilton reported having used alternation to generate parallel comparison groups in a clinical trial of bloodletting done by him and two surgeon colleagues (Hamilton 1816). He described how sick soldiers had been “admitted, alternately (my emphasis), in such a manner that each of us had one third of the whole” and that “the sick were indiscriminately received”, and “attended as nearly as possible with the same care and accommodated with the same comforts” (Hamilton 1816). Although his report leaves several uncertainties (Milne and Chalmers 2002), it seems reasonable to speculate that he described the use of alternation to show that an effort had been made to generate comparable treatment groups. By the middle of the 19th century, the rationale for alternation was sometimes being made explicit. In 1854, Thomas Graham Balfour described his assessment of whether belladonna could prevent scarlet fever. He divided 151 boys into two comparison groups, “taking them alternately from the list, to avoid the imputation of selection” (my emphasis) (Balfour 1854). It is clear from these words that Balfour used alternation to control selection bias. This is not a statistical concept, and although Balfour was a distinguished statistician as well as a doctor, he can not be regarded as a theoretical statistician in the ‘Pearsonian/Fisherian’ sense (Chalmers and Toth 2009). There are further isolated examples of alternation being used to generate treatment comparison groups during the last half of the 19th century, but they became increasingly common during the first half of the 20th century (link to allocation bias records). Indeed, alternation as a feature of research design became referred to formally in English not only simply as ‘alternation’ (Bullowa 1928), but also as ‘the alternate method’, ‘rational alternation’ (Choksy 1908), and ‘the alternate case method’ (Choksy 1908; Cecil and Plummer 1930). In French it was referred to as ‘la méthode alternante’ (Cousin 1905; Netter 1906); and in German as ‘Simultanmethode’ (Wagner-Jauregg 1931). It is worth noting that designation of this methodological principle occurred before the theoretical statistical qualities of random allocation had been promoted in Ronald Fisher’s The Design of Experiments (Fisher 1935). Indeed, even though the word ‘random’ sometimes appeared in reports of controlled trials before the late 1940s, it was often actually alternation that was being used for allocation (Armitage 2002). Unsurprisingly therefore, the use of alternation was reflected in articles and a book published by the Lancet in 1937, written by the father of medical statistics in Britain, Austin Bradford Hill:
Of the two essential components of unbiased allocation – genesis of an unbiased sequence, and unbiased implementation of the sequence - the former remains a trivially easy task, while the latter will continue to pose challenges (Chalmers 2009). Hill was aware of this. In an internal report for the MRC dated 22 Dec 1933, he Hill expressed concern about the allocation of patients to comparison groups in a MRC study of serum treatment for pneumonia in which alternation should have been used (MRC Therapeutic trials Committee 1934). Imbalance in the sizes of the comparison groups made clear that alternation had not been strictly observed, prompting Hill to stress in his memorandum that greater effort should be taken “that the division of cases really did ensure a random selection.” In others words, to control allocation bias successfully, Hill realised that it is crucially important to conceal the allocation schedule from those involved in entering participants, thus preventing foreknowledge of allocations.
Note and Acknowledgments
References Armitage P (2002). Randomisation and alternation: a note on Diehl et al. The James Lind Library (www.jameslindlibrary.org). Balfour TG (1854). Quoted in West C. Lectures on the Diseases of Infancy and Childhood. London, Longman, Brown, Green and Longmans, p 600. Bullowa JGM (1928). The use of antipneumococcic refined serum in lobar pneumonia: data necessary for a comparison between cases treated with serum and cases not so treated, and the importance of a significant control series of cases. JAMA 90:1354-1358. Cecil RL, Plummer N (1930). Pneumococcus Type I pneumonia - a study of eleven hundred and sixty-one cases, with especial reference to specific therapy. JAMA 95:1547-1553. Chalmers I (1997). Assembling comparison groups to assess the effects of health care. Journal of the Royal Society of Medicine 1997;90:379-386. Chalmers I (1999). Why transition from alternation to randomisation in clinical trials was made. BMJ 1999;319:1372. Chalmers I (2001). Comparing like with like: some historical milestones in the evolution of methods to create unbiased comparison groups in therapeutic experiments. International Journal of Epidemiology 30:1156-1164. Chalmers I (2005). Statistical theory was not the reason that randomisation was used in the British Medical Research Council’s clinical trial of streptomycin for pulmonary tuberculosis. In: Jorland G, Opinel A, Weisz G, eds. Body counts: medical quantification in historical and sociological perspectives. Montreal: McGill-Queens University Press, pp 309-334. Chalmers I (2009). Explaining the unbiased creation of treatment comparison groups. Lancet 2009;374:1670-71. Chalmers I, Clarke M (2004). The 1944 Patulin Trial: the first properly controlled multicentre trial conducted under the aegis of the British Medical Research Council. International Journal of Epidemiology 32:253-260. Chalmers I, Toth B (2009). 19th century controlled trials to test whether belladonna prevents scarlet fever. The James Lind Library (www.jameslindlibrary.org). Choksy KBNH (1908). On recent progress in serum-therapy of plague. BMJ 1:1282-1284. Clarke M (2004). The 1944 patulin trial of the British Medical Research Council: an example of how concerted common purpose can get reliable answers to important questions very quickly. The James Lind Library (www.jameslindlibrary.org). Cousin M (1905). Des éruptions consecutives aux injections de sérum antidiphthérique et de leur traitement prophylactique par l’ingestion de clorure de calcium. Thèse pour le Doctorat en médicine. Paris: Jules Rousset, 36-44. Cox DR (2009). Randomization for concealment. James Lind Library (www.jameslindlibrary.org). Doll R (1984). The controlled trial. Postgraduate Medical Journal 60:719-724. Doll R (1991). Development of therapeutic trials in preventive and therapeutic medicine. Doll R (2002). The role of data monitoring committees. In L Duley, B Farrell, eds., Clinical Trials. London: BMJ Books, 97-104. Edwards MV (2004). Control and the therapeutic trial, 1918-1948. MD thesis, University of London. Edwards MV (2006). Control and the therapeutic trial. Amsterdam: Rodopi. Fisher RA (1935). The design of experiments. Edinburgh: Oliver and Boyd. Gaudillière J-P (2001). Beyond one-case statistics: mathematics, medicine, and the management of health and disease in the postwar era. In: U Bottazzini, AD Hamilton AL (1816). Dissertatio medica inauguralis de synocho castrensi (Inaugural medical dissertation on camp fever). Edinburgh: J Ballantyne. Hill AB (1937). Principles of medical statistics. London: Lancet. Lind J (1753). A treatise of the scurvy. In three parts. Containing an inquiry into the nature, causes and cure, of that disease. Together with a critical and chronological view of what has been published on the subject. Edinburgh: Printed by Sands, Murray and Cochran for A Kincaid and A Donaldson. Magnello E (2002). The introduction of mathematical statistics into medical research: the roles of Karl Pearson, Major Greenwood and Austin Bradford Hill. In: E Magnello and A Hardy (eds). The road to medical statistics. Rodopi: Amsterdam. Marks HM (2000). The progress of experiment. Cambridge: Cambridge University Press. Massey I (1723). A short and plain account of inoculation. With some remarks on the main argument made use of to recommend that practice, by Mr. Maitland and others. To which is added, a letter to the learned James Jurin, M.D.R.S. Secr. Col. Reg. Med. Lond. Soc. In answer to his letter to the learned Dr. Cotesworth, and his comparison between the mortality of natural and inoculated small pox. The second edition. London: W. Meadows. Medical Research Council (1944). Clinical trial of patulin in the common cold. Lancet 2:373-5. Medical Research Council (1948). Streptomycin treatment of pulmonary tuberculosis: a Medical Research Council investigation. BMJ 2:769-782. Medical Research Council Therapeutic Trials Committee (1934). The serum treatment of lobar pneumonia. BMJ 1:241-245. Milne I, Chalmers I (2002). Hamilton's report of a controlled trial of bloodletting, 1816. The James Lind Library (www.jameslindlibrary.org). Netter A (1906). Efficacité de l’ingestion de chlorure de calcium comme moyen préventif des éruptions consecutives aux injections de sérum. Séances et Mémoires de la Société de Biologie 58:279-280. Petrarca F (1364). Rerum Senilium Libri. Liber XIV, Epistola 1. Letter to Boccaccio (V.3). Podolsky SH (2006). Pneumonia before Antibiotics: Therapeutic Evolution and Evaluation in Twentieth-Century America. Baltimore: Johns Hopkins University Press.
Rosser Matthews J (1995). Quantification and the quest for medical certainty. Princeton, New Jersey: Princeton University Press. Silverman WA, Chalmers I (2002). Casting and drawing lots: a time-honoured way of dealing with uncertainty and for ensuring fairness. The James Lind Library (www.jameslindlibrary.org). Wagner-Jauregg J (1931). Ueber die Infektionsbehandlung der progressiven Paralyse [On infection treatment of progressive paralysis]. Münchener Medizinische Wochenschrift 78:4-7.
|
|||||||