[1]
Archer, J. et al. 2008. mini-PAT (Peer Assessment Tool): A Valid Component of a National Assessment Programme in the UK? Advances in Health Sciences Education. 13, 2 (May 2008), 181–192. DOI:https://doi.org/10.1007/s10459-006-9033-3.
[2]
Archer, J.C. 2005. Use of SPRAT for peer review of paediatricians in training. BMJ. 330, 7502 (May 2005), 1251–1253. DOI:https://doi.org/10.1136/bmj.38447.610451.8F.
[3]
Association for the Study of Medical Education 2014. Understanding medical education: evidence, theory and practice. Wiley Blackwell.
[4]
Bandaranayake, R.C. 2008. Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37. Medical Teacher. 30, 9–10 (2008), 836–845. DOI:https://doi.org/10.1080/01421590802402247.
[5]
Black, H. D. et al. 1986. Assessment purposes: a study of the relationship between diagnostic  assessment and summative assessment for certification. Scottish Council for Research in Education.
[6]
Bloom, Benjamin S. 1956. Taxonomy of educational objectives: the classification of  educational goals, Handbook 1: Cognitive domain. Longman Group Ltd.
[7]
du Boulay, C. 2000. From CME to CPD: getting better at getting better? BMJ. 320, 7232 (Feb. 2000), 393–394. DOI:https://doi.org/10.1136/bmj.320.7232.393.
[8]
Breakwell, Glynis M. et al. 2012. Research methods in psychology. SAGE.
[9]
Brian Jolly 1999. Clinical logbooks: recording clinical experiences may not be enough. Medical Education. 33, 2 (Feb. 1999), 86–88. DOI:https://doi.org/10.1046/j.1365-2923.1999.00386.x.
[10]
Brigden, D. 1999. Constructing a learning portfolio. BMJ. 319, 7201 (Jul. 1999), 2a–2a. DOI:https://doi.org/10.1136/bmj.319.7201.2a.
[11]
Bullock, A.D. et al. 2009. How ratings vary by staff group in multi-source feedback assessment of junior doctors. Medical Education. 43, 6 (Jun. 2009), 516–520. DOI:https://doi.org/10.1111/j.1365-2923.2009.03333.x.
[12]
Campbell, L.M. et al. 1995. Use of videotaped consultations in summative assessment of trainees in general practice. British Journal of General Practice. 45, 392 (1995), 137–141.
[13]
Cangelosi, J.S. Designing tests for evaluating student achievement. Longman.
[14]
Challis, M. 1999. AMEE Medical Education Guide No.11 (revised): Portfolio-based learning and assessment in medical education. Medical Teacher. 21, 4 (Jan. 1999), 370–386. DOI:https://doi.org/10.1080/01421599979310.
[15]
Challis, M. 2001. Portfolios and assessment: meeting the challenge. Medical Teacher. 23, 5 (Jan. 2001), 437–440. DOI:https://doi.org/10.1080/01421590120075643.
[16]
Cleland, J.A. et al. 2008. Is it me or is it them? Factors that influence the passing of underperforming students. Medical Education. 42, 8 (Aug. 2008), 800–809. DOI:https://doi.org/10.1111/j.1365-2923.2008.03113.x.
[17]
Cohen-Schotanus, J. and van der Vleuten, C.P.M. 2010. A standard setting method with the best performing students as a point of reference: Practical and affordable. Medical Teacher. 32, 2 (2010), 154–160. DOI:https://doi.org/10.3109/01421590903196979.
[18]
Cox, M. et al. 2007. Assessment in Medical Education. New England Journal of Medicine. 356, 4 (2007), 387–396. DOI:https://doi.org/10.1056/NEJMra054784.
[19]
Cronbach, L.J. and Meehl, P.E. 1955. Construct validity in psychological tests. Psychological Bulletin. 52, 4 (1955), 281–302. DOI:https://doi.org/10.1037/h0040957.
[20]
Crossley, J. et al. 2005. Children and their parents assessing the doctor-patient interaction: a rating system for doctors’ communication skills. Medical Education. 39, 8 (Aug. 2005), 820–828. DOI:https://doi.org/10.1111/j.1365-2929.2005.02230.x.
[21]
Daelmans, H.E.M. et al. 2004. Feasibility and reliability of an in-training assessment programme in an undergraduate clerkship. Medical Education. 38, 12 (Dec. 2004), 1270–1277. DOI:https://doi.org/10.1111/j.1365-2929.2004.02019.x.
[22]
David, M.F.B. et al. 2001. AMEE Medical Education Guide No. 24: Portfolios as a method of student assessment. Medical Teacher. 23, 6 (Jan. 2001), 535–551. DOI:https://doi.org/10.1080/01421590120090952.
[23]
Davies, H. et al. 2008. Specialty-specific multi-source feedback: assuring validity, informing training. Medical Education. 42, 10 (2008), 1014–1020. DOI:https://doi.org/10.1111/j.1365-2923.2008.03162.x.
[24]
Dory, V. et al. 2010. Is case-specificity content-specificity? An analysis of data from extended-matching questions. Advances in Health Sciences Education. 15, 1 (Mar. 2010), 55–63. DOI:https://doi.org/10.1007/s10459-009-9169-z.
[25]
Downing, S.M. 2004. Reliability: on the reproducibility of assessment data. Medical Education. 38, 9 (Sep. 2004), 1006–1012. DOI:https://doi.org/10.1111/j.1365-2929.2004.01932.x.
[26]
Downing, S.M. et al. 2006. RESEARCH METHODOLOGY: Procedures for Establishing Defensible Absolute Passing Scores on Performance Examinations in Health Professions Education. Teaching and Learning in Medicine. 18, 1 (Jan. 2006), 50–57. DOI:https://doi.org/10.1207/s15328015tlm1801_11.
[27]
Downing, S.M. and Haladyna, T.M. 2004. Validity threats: overcoming interference with proposed interpretations of assessment data. Medical Education. 38, 3 (Mar. 2004), 327–333. DOI:https://doi.org/10.1046/j.1365-2923.2004.01777.x.
[28]
Driessen, E. et al. 2005. The use of qualitative research criteria for portfolio assessment as an alternative to reliability evaluation: a case study. Medical Education. 39, 2 (Feb. 2005), 214–220. DOI:https://doi.org/10.1111/j.1365-2929.2004.02059.x.
[29]
Driessen, E. et al. 2003. Use of portfolios in early undergraduate medical training. Medical Teacher. 25, 1 (Jan. 2003), 18–23. DOI:https://doi.org/10.1080/0142159021000061378.
[30]
Driessen, E.W. et al. 2006. Validity of portfolio assessment: which qualities determine ratings? Medical Education. 40, 9 (Sep. 2006), 862–866. DOI:https://doi.org/10.1111/j.1365-2929.2006.02550.x.
[31]
Driessen, E.W. et al. 2007. Web- or paper-based portfolios: is there a difference? Medical Education. 41, 11 (Nov. 2007), 1067–1073. DOI:https://doi.org/10.1111/j.1365-2923.2007.02859.x.
[32]
Epstein, R.M. Assessment in Medical Education. New England Journal of Medicine. 356, 4, 387–396. DOI:https://doi.org/10.1056/NEJMra054784.
[33]
Evans, R. 2004. Review of instruments for peer assessment of physicians. BMJ. 328, 7450 (May 2004). DOI:https://doi.org/10.1136/bmj.328.7450.1240.
[34]
Farmer, E.A. and Page, G. 2005. A practical guide to assessing clinical decision-making skills using the key features approach. Medical Education. 39, 12 (Dec. 2005), 1188–1194. DOI:https://doi.org/10.1111/j.1365-2929.2005.02339.x.
[35]
Farmer, E.A. and Page, G. 2005. A practical guide to assessing clinical decision-making skills using the key features approach. Medical education. 39, 12 (2005), 1188–1194. DOI:https://doi.org/10.1111/j.1365-2929.2005.02339.x.
[36]
Freeman, Richard T. and Lewis, Roger 1998. Planning and implementing assessment. Kogan Page.
[37]
Friedman Ben-Davis, M. 2000. AMEE Guide No. 18: Standard setting in student assessment. Medical Teacher. 22, 2 (2000), 120–130. DOI:https://doi.org/10.1080/01421590078526.
[38]
G. Rasch 1980. Probabilistic models for some intelligence and attainment tests. University of Chicago Press.
[39]
Gagnon, R. et al. 2006. The Cognitive Validity of the Script Concordance Test: A Processing Time Study. Teaching and Learning in Medicine. 18, 1 (Jan. 2006), 22–27. DOI:https://doi.org/10.1207/s15328015tlm1801_6.
[40]
Govaerts, M.J.B. et al. 2007. Broadening Perspectives on Clinical Performance Assessment: Rethinking the Nature of In-training Assessment. Advances in Health Sciences Education. 12, 2 (Feb. 2007), 239–260. DOI:https://doi.org/10.1007/s10459-006-9043-1.
[41]
Hart, I.R. 1992. Trends in clinical assessment. Approaches to the Assessment of Clinical Competence, Part 1 and 2.
[42]
Hays, R.B. 2004. Reflecting on learning portfolios. Medical Education. 38, 8 (Aug. 2004), 801–803. DOI:https://doi.org/10.1111/j.1365-2929.2004.01831.x.
[43]
Hill, F. et al. 2009. Implementing the undergraduate mini-CEX: a tailored approach at Southampton University. Medical Education. 43, 4 (Apr. 2009), 326–334. DOI:https://doi.org/10.1111/j.1365-2923.2008.03275.x.
[44]
Hurley, K.F. 2011. OSCE and clinical skills handbook. Elsevier/Saunders.
[45]
Jolly, Brian and Grant, Janet 1997. The good assessment guide: a practical guide to assessment and appraisal for higher specialist training. Joint Centre for Education in Medicine.
[46]
Kogan, J.R. et al. 2009. Tools for Direct Observation and Assessment of Clinical Skills of Medical Trainees. JAMA. 302, 12 (Sep. 2009). DOI:https://doi.org/10.1001/jama.2009.1365.
[47]
Liu, M. and Liu, K.-M. 2008. Setting Pass Scores for Clinical Skills Assessment. The Kaohsiung Journal of Medical Sciences. 24, 12 (Dec. 2008), 656–663. DOI:https://doi.org/10.1016/S1607-551X(09)70032-4.
[48]
Livingston, S.A. and Zieky, M.J. 1982. Passing scores. ETS.
[49]
Mathers, N.J. et al. 1999. Portfolios in continuing medical education - effective and efficient? Medical Education. 33, 7 (Jul. 1999), 521–530. DOI:https://doi.org/10.1046/j.1365-2923.1999.00407.x.
[50]
Messick, S. Validity. Educational Measurement (The American Council on Education/Macmillan series on higher education). Macmillan USA.
[51]
Miller, G.E. 1990. The assessment of clinical skills/competence/performance. Academic Medicine. 65, (1990), 63–67.
[52]
Moonen-van Loon, J.M.W. et al. 2013. Composite reliability of a workplace-based assessment toolbox for postgraduate medical education. Advances in Health Sciences Education. 18, 5 (Dec. 2013), 1087–1102. DOI:https://doi.org/10.1007/s10459-013-9450-z.
[53]
Murphy, D.J. et al. 2009. The reliability of workplace-based assessment in postgraduate medical education and training: a national evaluation in general practice in the United Kingdom. Advances in Health Sciences Education. 14, 2 (May 2009), 219–232. DOI:https://doi.org/10.1007/s10459-008-9104-8.
[54]
Murphy, D.J. et al. 2009. The reliability of workplace-based assessment in postgraduate medical education and training: a national evaluation in general practice in the United Kingdom. Advances in Health Sciences Education. 14, 2 (May 2009), 219–232. DOI:https://doi.org/10.1007/s10459-008-9104-8.
[55]
National Board of Medical Examiners Constructing Written Test Questions For the Basic and Clinical Sciences.
[56]
Norcini, J.J. 2003. ABC of learning and teaching in medicine: Work based assessment. BMJ. 326, 7392 (Apr. 2003), 753–755. DOI:https://doi.org/10.1136/bmj.326.7392.753.
[57]
Norcini, J.J. 2003. Setting standards on educational tests. Medical Education. 37, 5 (May 2003), 464–469. DOI:https://doi.org/10.1046/j.1365-2923.2003.01495.x.
[58]
Norcini, J.J. 2003. Setting standards on educational tests. Medical Education. 37, (2003), 464–469. DOI:https://doi.org/10.1046/j.1365-2923.2003.01495.x.
[59]
Norcini, J.J. 1995. The Mini-CEX (Clinical Evaluation Exercise): A Preliminary Investigation. Annals of Internal Medicine. 123, 10 (Nov. 1995). DOI:https://doi.org/10.7326/0003-4819-123-10-199511150-00008.
[60]
Nunnally, Jum C. and Bernstein, Ira H. 1994. Psychometric theory. McGraw-Hill.
[61]
O’sullivan, P.S. et al. 2004. Demonstration of Portfolios to Assess Competency of Residents. Advances In Health Sciences Education. 9, 4 (Dec. 2004), 309–323. DOI:https://doi.org/10.1007/s10459-004-0885-0.
[62]
Ozuah, P.O. and Reznik, M. 2008. Using unannounced standardised patients to assess residents’ professionalism. Medical Education. 42, 5 (May 2008), 532–533. DOI:https://doi.org/10.1111/j.1365-2923.2008.03083.x.
[63]
Pearson, D.J. and Heywood, P. 2004. Portfolio use in general practice vocational training: a survey of GP registrars. Medical Education. 38, 1 (Jan. 2004), 87–95. DOI:https://doi.org/10.1111/j.1365-2923.2004.01737.x.
[64]
Peile, E. 2006. Knowing and knowing about. BMJ. 332, 7542 (2006), 645–645. DOI:https://doi.org/10.1136/bmj.332.7542.645.
[65]
Pitts, J. et al. 1999. Educational portfolios in the assessment of general practice trainers: reliability of assessors. Medical Education. 33, 7 (Jul. 1999), 515–520. DOI:https://doi.org/10.1046/j.1365-2923.1999.00445.x.
[66]
Pitts, J. et al. 2001. Enhancing reliability in portfolio assessment: ‘shaping’ the portfolio. Medical Teacher. 23, 4 (Jan. 2001), 351–356. DOI:https://doi.org/10.1080/01421590120057021.
[67]
Pitts, John and Association for the Study of Medical Education 2007. Portfolios, personal development and reflective practice. ASME.
[68]
Postgraduate Medical Education and Training Board 2007. Developing and maintaining an assessment system - a PMETB guide to good practice. Postgraduate Medical Education and Training Board.
[69]
Postgraduate Medical Education and Training Board 2007. Developing and maintaining an assessment system - a PMETB guide to good practice. PMETB.
[70]
Postgraduate Medical Education and Training Board 2009. Workplace Based Assessment: A Guide for Implementation. PMETB.
[71]
Ramsey, P.G. 1993. Use of Peer Ratings to Evaluate Physician Performance. JAMA: The Journal of the American Medical Association. 269, 13 (Apr. 1993). DOI:https://doi.org/10.1001/jama.1993.03500130069034.
[72]
Rees, C. 2005. The use (and abuse) of the term ‘portfolio’. Medical Education. 39, 4 (Apr. 2005), 436–436. DOI:https://doi.org/10.1111/j.1365-2929.2005.02119.x.
[73]
Research Methods - Validity and Reliability in AllPsych Online: https://allpsych.com/research-methods/variablesvalidityreliability/validityreliability/.
[74]
Research Methods Knowledge Base: http://www.socialresearchmethods.net/kb/.
[75]
Rethans, J.-J. et al. 2002. The relationship between competence and performance: implications for assessing practice performance. Medical Education. 36, 10 (Oct. 2002), 901–909. DOI:https://doi.org/10.1046/j.1365-2923.2002.01316.x.
[76]
Rethans, J.-J. et al. 2002. The relationship between competence and performance: implications for assessing practice performance. Medical Education. 36, (2002), 901–909. DOI:https://doi.org/10.1046/j.1365-2923.2002.01316.x.
[77]
Richards, S.H. et al. 2009. A multi-method analysis of free-text comments from the UK General Medical Council Colleague Questionnaires. Medical Education. 43, 8 (Aug. 2009), 757–766.
[78]
Ricketts, C. 2009. A plea for the proper use of criterion-referenced tests in medical assessment. Medical Education. 43, 12 (Dec. 2009), 1141–1146. DOI:https://doi.org/10.1111/j.1365-2923.2009.03541.x.
[79]
Ringsted, C. et al. 2004. Educational impact of in-training assessment (ITA) in postgraduate medical education: a qualitative study of an ITA programme in actual practice. Medical Education. 38, 7 (Jul. 2004), 767–777. DOI:https://doi.org/10.1111/j.1365-2929.2004.01841.x.
[80]
Roberts, C. et al. 2002. Portfolio-based assessments in medical education: are they valid and reliable for summative purposes? Medical Education. 36, 10 (Oct. 2002), 899–900. DOI:https://doi.org/10.1046/j.1365-2923.2002.01288.x.
[81]
Rowntree, Derek 1987. Assessing students: how shall we know them?. Kogan Page.
[82]
Samuel Messick The Psychology of Educational Measurement. Journal of Educational Measurement. 21, 3, 215–237.
[83]
Schuwirth, Lambert W. T. et al. 2006. How to design a useful test: the principles of assessment. ASME.
[84]
Schuwirth, Lambert W. T. et al. 2006. How to design a useful test: the principles of assessment. ASME.
[85]
Schuwirth, L.W. 2004. Assessing medical competence: finding the right answers. The Clinical Teacher. 1, 1 (Jun. 2004), 14–18. DOI:https://doi.org/10.1111/j.1743-498X.2004.00012.x.
[86]
Schuwirth, L.W. 2004. Assessing medical competence: finding the right answers. The Clinical Teacher. 1, 1 (2004), 14–18. DOI:https://doi.org/10.1111/j.1743-498X.2004.00012.x.
[87]
Schuwirth, L.W.T. 2003. ABC of learning and teaching in medicine: Written assessment. BMJ. 326, 7390 (Mar. 2003), 643–645. DOI:https://doi.org/10.1136/bmj.326.7390.643.
[88]
Schuwirth, L.W.T. and Vleuten, C.P.M. 2006. A plea for new psychometric models in educational assessment. Medical Education. 40, 4 (Apr. 2006), 296–300. DOI:https://doi.org/10.1111/j.1365-2929.2006.02405.x.
[89]
Schuwirth, L.W.T. and van der Vleuten, C.P.M. 2003. ABC Of Learning And Teaching In Medicine: Written Assessment. BMJ: British medical journal. 326, 7390 (2003), 643–645.
[90]
Schuwirth, L.W.T. and van der Vleuten, C.P.M. 2011. General overview of the theories used in assessment: AMEE Guide No. 57. Medical Teacher. 33, 10 (Oct. 2011), 783–797. DOI:https://doi.org/10.3109/0142159X.2011.611022.
[91]
Schuwirth, L.W.T. and van der Vleuten, C.P.M. 2012. Programmatic assessment and Kane’s validity perspective. Medical Education. 46, 1 (Jan. 2012), 38–48. DOI:https://doi.org/10.1111/j.1365-2923.2011.04098.x.
[92]
Snadden, D. 1999. Portfolios - attempting to measure the unmeasurable? Medical Education. 33, 7 (Jul. 1999), 478–479. DOI:https://doi.org/10.1046/j.1365-2923.1999.00446.x.
[93]
Snadden, D. and Thomas, M.L. 1998. Portfolio learning in general practice vocational training - does it work? MEDICAL EDUCATION. 32, 4 (Jul. 1998), 401–406. DOI:https://doi.org/10.1046/j.1365-2923.1998.00245.x.
[94]
Tighe, J. et al. 2010. The standard error of measurement is a more appropriate measure of quality for postgraduate medical assessments than is reliability: an analysis of MRCP(UK) examinations. BMC Medical Education. 10, 1 (2010). DOI:https://doi.org/10.1186/1472-6920-10-40.
[95]
Tweed, M. and Wilkinson, T. 2009. A randomized controlled trial comparing instructions regarding unsafe response options in a MCQ examination. Medical Teacher. 31, 1 (2009), 51–54. DOI:https://doi.org/10.1080/01421590802146018.
[96]
Vleuten, C.P.M. 1996. The assessment of professional competence: Developments, research and practical implications. Advances in Health Sciences Education. 1, 1 (1996), 41–67. DOI:https://doi.org/10.1007/BF00596229.
[97]
Webb, C. et al. 2003. Evaluating portfolio assessment systems: what are the appropriate criteria? Nurse Education Today. 23, 8 (Nov. 2003), 600–609. DOI:https://doi.org/10.1016/S0260-6917(03)00098-4.
[98]
Webb, C. et al. 2002. Models of portfolios. Medical Education. 36, 10 (Oct. 2002), 897–898. DOI:https://doi.org/10.1046/j.1365-2923.2002.01318.x.
[99]
Whitehouse, A. et al. 2007. 360 degree assessment (multisource feedback) of UK trainee doctors: Field testing of team assessment of behaviours (TAB). Medical Teacher. 29, 2–3 (2007), 171–176. DOI:https://doi.org/10.1080/01421590701302951.
[100]
Wilkinson, T.J. et al. 2002. The use of portfolios for assessment of the competence and performance of doctors in practice. Medical Education. 36, 10 (Oct. 2002), 918–924. DOI:https://doi.org/10.1046/j.1365-2923.2002.01312.x.
[101]
Wood, T.J. et al. 2006. Standard Setting in a Small Scale OSCE: A Comparison of the Modified Borderline-Group Method and the Borderline Regression Method. Advances in Health Sciences Education. 11, 2 (May 2006), 115–122. DOI:https://doi.org/10.1007/s10459-005-7853-1.
[102]
Developing and maintaining an assessment system. Postgraduate Medical Education and Training Board.
[103]
1999. Does a student log provide a means to better structure clinical education? Medical Education. 33, 2 (Feb. 1999), 89–94. DOI:https://doi.org/10.1046/j.1365-2923.1999.00285.x.