1
THE BIRTH AND PROPAGATION OF THE OSCE
āThose who do not remember the past are condemned to repeat it.ā
George Santayana
In this opening chapter, our aim is to present a broad overview of the historical context of the birth of the OSCE, especially the factors that contributed to the development and propagation of the OSCE. We firmly believe that understanding the historical background is important for various reasons:
ā¢ It allows us to analyze the preceding events leading to the development of the OSCE.
ā¢ We develop a deeper understanding of the context of the OSCE in contemporary medical education and assessment.
ā¢ We learn from previous successes and mistakes, so that we can meaningfully implement the OSCE in our own settings.
ā¢ It prompts us to appreciate the enormous contributions of early doyens of medical education.
At the end of the chapter, we should be able to:
(i) Recognize how the shortcomings of traditional clinical examinations led to the birth of the OSCE.
(ii) Evaluate some of the seminal publications and events related to the development of the OSCE.
(iii) Compare and contrast early OSCE formats with contemporary OSCE formats.
The Birth
The development of the OSCE is rightly and widely credited to Professor Ronald McGregor Harden. In a seminal paper, published in the British Medical Journal in 1975, Harden et al. described a new approach to the assessment of clinical competence. However, even before the publication of this paper, there were considerable discussions in the medical literature highlighting the many shortfalls of traditional clinical examinations (Fleming et al., 1974; Harden et al., 1969; Wilson et al., 1969). In traditional clinical examinations, there are typically one or two long cases where the candidate takes a history, performs a physical examination, reviews investigations, generates a differential diagnosis, and discusses the findings, interpretation, and general management options with the examiners. The long cases are supplemented with a few short cases, where the candidateās task is more defined with a focused history or physical examination followed by a brief discussion with the examiners.
The traditional clinical examinations have several drawbacks. Firstly, it is difficult to have certains types of real patients such as patients with shortness of breath or acute chest pain during the clinical examination. Secondly, the decision about a candidateās competency in clinical settings is based on a very limited number of encounters. Thirdly, the lack of clear instructions to the candidates and to the examiners renders the marking of candidates vulnerable to the examinersā moods, personal preferences, and prejudices. Fourthly, the long cases, which typically use a patient from the ward or a previously known patient who has been called back from home, do not really represent the variety of patients a graduating doctor is expected to manage in real life. They are also heavily skewed towards in-patients. Finally, the logistical and practical difficulties of using real patients for the entire duration of the examination can be a daunting task and is ethically questionable.
Harden et al. (1975) summed up the above observations that traditional clinical examinations, with two examiners assessing the candidateās skills on only a few patients, are often āluck of the draw.ā The lack of clear instructions and prior discussions about the patient among the examiners also creates confusions regarding what should be tested and the expected level of competency required to pass the examination. In short, the crucial pass/fail decision generated during traditional clinical examinations is often arbitrary and depends on multiple confounding factors, including variability among the examiners, varying levels of complexity of patients, and the nature of illnesses. Simply speaking, such examinations are educationally, morally, and legally indefensible.
Harden was the Head of Division of Clinical Medical Education at the University of Dundee, UK (interview with Harden; Vimeo), when he and his colleagues wrote the groundbreaking paper on the OSCE in 1975. To counter some of the ills of the traditional clinical examination, Harden et al. (1975) proposed an āobjective structured examination.ā Interestingly, the term āOSCEā did not appear in the original paper. The format of the clinical examination first proposed by Harden et al. was different from what we might be familiar with now. It had 16 stations (the number 16 was chosen because it was āconvenientā). The duration at each station was 5 minutes. However, the stations included a mixture of clinical skills stations, observations/inspections of clinical materials, such as colored photographs, and written questions in the form of true/false multiple choice questions (MCQs). Thus, a typical configuration would look like this (Fig. 1.1).
Fig. 1.1. Basic configuration of the early OSCE.
Each of these stations had clear instructions to the candidate, such as āAuscultate the precordium for evidence of valvular disease,ā after which the candidate would move onto another station, where they would answer questions relating to their findings from the previous station. The questions could be open-ended or multiple choice type, although, for convenience, true/false multiple choice-type questions were often used (Harden et al., 1975). The examinersā checklist was rather simple in early examinations, with only āyesā and ānoā options for each of the tasks performed. The checklist was later modified to allow a āqualified yes.ā The new format of objective structured examination was piloted with a limited number of volunteer students and examiners. It received an enthusiastic response from the students and somewhat lukewarm response from the examiners. Students viewed the format as fairer and less dependent on luck (interview with Harden; Vimeo) and supported its implementation.
We should also review another early development that took place in North America that preceded the publication of the landmark paper by Harden et al. in 1975. This had a profound effect on the OSCE format as we practice now. Howard Barrows was an Assistant Professor of Neurology and Stephen Abrahamson was the Director of Research in Medical Education at the University of Southern California, School of Medicine, Los Angeles, CA, USA. They experimented with āprogrammed patientsā as a way of appraising studentsā performance in clinical neurology (Barrows and Abrahamson, 1964). They argued, among others, that patients are inherently prone to presenting their findings variably to the students and that an ideal patient that suits the needs of the examination is often hard to come by.
Therefore, they suggested training āprogrammed patients,ā which involved āsimulation of disease by a normal person who is trained to assume and present, on examination, the history and neurological findings of an actual patient in the manner of an actual patientā (Barrows and Abrahamson, 1964). Programmed patients, as we now know, have been expanded to include both āstandardized patientsā and āsimulated patients.ā Barrows and Abrahamson used programmed patients to āobtain appraisal of the studentās clinical performanceā ā a feat that might seem revolutionary to some, even four decades later. Further down the road, simulation and mannequins would become an integral part of the repository of clinical materials that can be tested during a clinical examination.
Programmed patients brought greater standardization to the clinical presentation of patients and reduced the variability that is inherent with the use of real patients. Harden et al. merged two emerging assessment techniques of that age: Programmed patients for standardization and multi-station examination by multiple examiners working independently in order to assess several domains at the same time (Hodges, 2003). Although these two ideas, taken separately, were neither novel nor revolutionary, marrying them together in a systematic manner to test clinical competency was unique (Hodges, 2003).
ā¢ Standardized patients: real patients or actors portraying findings in a standardized manner
ā¢ Simulated patients: trained actors acting as patients
ā¢ Simulation: enacting a clinical scenario
ā¢ Simulator: mechanical devises, mannequins with or without intelligent features
OSCE as a Global Phenomenon
Acceptance of the OSCE was further bolstered by continuous research and refinements, not only regarding the OSCE, but also regarding the nature of clinical competencies, appreciation of holistic roles of physicians, and psychometrics of assessment in medical education (Khan et al., 2013). Some of the crucial evidence related to the OSCE, as a method of student assessment, will be discussed in the subsequent chapters.
In the last two decades, anecdotal observations and professional experience support the notion that the pace of adoption of the OSCE as a preferred method of student assessment has accelerated considerably across the globe, from undergraduate to postgraduate settings. Many professional bodies and national licensing examinations have endorsed and adopted the OSCE or its variations as a standard clinical assessment across the spectrum of physician training. For example, after years of reliance on knowledge assessment through MCQs, most medical schools in the USA now use the OSCE or similar examinations for clinical assessment (Turner and Dankoski, 2008). National licensing authorities, including the National Board of Medical Examiners, USA (USMLE 2 CS), the Medical Council of Canada (Medical Council of Canada Qualifying Examination Part II), the General Medical Council, UK (PLAB Part 2), among others, are now regularly using the OSCE or its variants in high-stakes summative decision-making.
Key milestones
ā¢ 1979: Publication of āAssessment of Clinical Competence Using an Objective Structured Clinical Examination (OSCE)ā as an Association for the Study of Medical Education (ASME) Guide by Harden and Gleeson
ā¢ 1985: 1st Ottawa Conference on Assessment of Clinical Compe tency brought the idea of the OSCE to North America
ā¢ 1998: Australian Medical Council conducted an OSCE for the first time for licensing foreign medical graduates to practice in Australia
ā¢ 1998: Education Commission for Foreign Medical Graduates, USA (ECFMG) conducted large-scale OSCEs in the USA for the first time
ā¢ 2000: Publication of ACGME Toolbox of Assessment Methods (2000) endorsed the OSCE as the one of the preferred methods of assessment of clinical competency
Summary
The OSCE did not originate in a vacuum. Much impetus towards the development of the OSCE came from the recognition of many serious flaws associated with unstructured, unstandardized traditional clinical examinations, where the decision often depended on a sole examiner (or a few examiners) examining one or few aspects of clinical competency. The OSCE reduced bias, provided clearer instruction to the examiners and candidates, and changed the perception that āclinical examination is often a game of luck.ā
References
Barrows HS, Abrahamson S. (1964) The programmed patient: A technique for appraising studentsā performance in clinical neurology. J Med Educ 39: 802ā804.
Fleming PR, Manderson WG, Matthews MB, Sanderson PH, Stokes JF. (1974) Evolution of an examination: M.R.C.P. (U.K.). Br Med J 13; 2(5910): 99ā102.
Harden RM, Gleeson FA. (1979) Assessment of clinical competence using an objective structured clinical examination (OSCE). Medical Education Booklet No. 8. Med Educ 13(1): 41ā54.
H...