While scales measuring subjective constructs historically rely on agree-disagree (AD) questions, recent research demonstrates that construct-specific (CS) questions clarify underlying response dimensions that AD questions leave implicit and CS questions often yield higher measures of data quality. Given acknowledged issues with AD questions and certain established advantages of CS items, the evidence for the superiority of CS questions is more mixed than one might expect. We build on previous investigations by using cognitive interviewing to deepen understanding of AD and CS response processing and potential sources of measurement error. We randomized 64 participants to receive an AD or CS version of a scale measuring trust in medical researchers. We examine several indicators of data quality and cognitive response processing including: reliability, concurrent validity, recency, response latencies, and indicators of response processing difficulties (e.g., uncodable answers). Overall, results indicate reliability is higher for the AD scale, neither scale is more valid, and the CS scale is more susceptible to recency effects for certain questions. Results for response latencies and behavioral indicators provide evidence that the CS questions promote deeper processing. Qualitative analysis reveals five sources of difficulties with response processing that shed light on under-examined reasons why AD and CS questions can produce different results, with CS not always yielding higher measures of data quality than AD.
Anderson, L.A. and R.F. Dedrick. 1990. “Development of the Trust in Physician Scale: A Measure to Assess Interpersonal Trust in Patient-physician Relationships.” Psychological Reports 67: 1091–1100. Doi: https://doi.org/10.2466/pr0.1990.67.3f.1091.
Bassili, J.N. and B.S. Scott. 1996. “Response Latency as a Signal to Question Problems in Survey Research.” Public Opinion Quarterly 60: 390–399. Doi: https://doi.org/10.1086/297760.
Braunstein, J.B., N.S. Sherber, S.P. Schulman, E.L. Ding, and N.R. Powe. 2008. “Race, Medical Researcher Distrust, Perceived Harm, and Willingness to Participate in Cardiovascular Prevention Trials.” Medicine 87: 1–9. Doi: https://doi.org/10.1097/MD.0b013e3181625d78.
Carpenter, P.A. and M.A. Just. 1975. “Sentence Comprehension: A Psycholinguistic Processing Model of Verification.” Psychological Review 82: 45–73. Available at: http://psycnet.apa.org/doi/10.1037/h0076248 (accessed April 2019).
Dillman, D.A., J.D. Smyth, and L.M. Christian. 2014. Internet, Phone, Mail, and Mixed-Mode Surveys: The Tailored Design Method (4th edition). Hoboken, NJ: John Wiley.
Draisma, S. and W. Dijkstra. 2004. “Response Latency and (Para)linguistic Expression as Indicators of Response Error.” In Methods for Testing and Evaluating Survey Questionnaires, edited by S. Presser, J.M. Rothgeb, M.P. Couper, J.T. Lessler, E. Martin, J. Martin, and E. Singer, 131–148. New York: Springer-Verlag. Doi: https://doi.org/10.1002/0471654728.ch7.
Dykema, J., J.M. Lepkowski, and S. Blixt. 1997. “The Effect of Interviewer and Respondent Behavior on Data Quality: Analysis of Interaction Coding in a Validation Study.” In Survey Measurement and Process Quality, edited by L. Lyberg, P. Biemer, M. Collins, E. de Leeuw, C. Dippo, N. Schwarz, and D. Trewin, 287–310. N.Y: Wiley-Interscience. Available at: https://onlinelibrary.wiley.com/doi/10.1002/9781118490013.ch12 (accessed April 2019).
Dykema, J., N.C. Schaeffer, and D. Garbarski. 2012. “Effects of Agree-Disagree Versus Construct-Specific Items on Reliability, Validity, and Interviewer-Respondent Interaction.” Presented at the American Association for Public Opinion Research, May 17–20. 2012. Orlando, Florida, U.S.A.
Dykema, J., N.C. Schaeffer, and D. Garbarski. 2019. “Towards a Reconsideration of the Use of Agree-Disagree Questions in Measuring Subjective Evaluations.” Unpublished manuscript, University of Wisconsin-Madison, Madison-WI.
Edwards, D.F. 2015. “Voices Heard.” Presented at the Health Equity Leadership Institute, Madison, WI.
Egede, L.E. and C. Ellis. 2008. “Development and Testing of the Multidimensional Trust in Health Care Systems Scale.” Journal of General Internal Medicine 23: 808–815. Doi: https://doi.org/10.1007/s11606-008-0613-1.
Fleiss, J.L. 1981. Statistical Methods for Rates and Proportions, 2nd edition. New York: Wiley.
Fortune-Greeley, A.K., K.E. Flynn, D.D. Jeffery, M.S. Williams, F.J. Keefe, R.B. Reeve, G.B. Willis, and K.P. Weinfurt. 2009. “Using Cognitive Interviews to Evaluate Items for Measuring Sexual Functioning Across Cancer Populations: Improvements and Remaining Challenges.” Quality of Life Research 18: 1085–1093. Doi: https://doi.org/10.1007/s11136-009-9523-x.
Fowler, F.J. and C. Cosenza. 2009. “Design and Evaluation of Survey Questions.” In The Sage Handbook of Applied Social Research Methods, edited by L. Bickman and D.J. Rog, 375–412. Thousand Oaks, CA: Sage.
Hall, M.A., F. Camacho, E. Dugan, and R. Balkrishnan. 2002a. “Trust in the Medical Profession: Conceptual and Measurement Issues.” Health Services Research 37: 1419–1439. Doi: https://doi.org/10.1111/1475-6773.01070.
Hall, M.A., F. Camacho, J.S. Lawlor, V. DePuy, J. Sugarman, and K. Weinfurt. 2006. “Measuring Trust in Medical Researchers.” Medical Care 44: 1048–1053. Available at: http://www.jstor.org/stable/41219560 (accessed April 2019).
Hall, M.A., E. Dugan, B. Zheng, and A.K. Mishra. 2001. “Trust in Physicians and Medical Institutions: What is It, Can It be Measured, and Does It Matter?” Milbank Quarterly 79: 613–639. Doi: https://doi.org/10.1111/1468-0009.00223.
Hall, M.A., B. Zheng, E. Dugan, F. Camacho, K.E. Kidd, A. Mishra, and R. Balkrishnan. 2002b. “Measuring Patients’ Trust in their Primary Care Providers.” Medical Care Research and Review 59: 293–318.Doi: https://doi.org/10.1177/1077558702059003004.
Hayman, R.M., B.J. Taylor, N.S. Peart, B.C. Galland, and R.M. Sayers. 2001. “Participation in Research: Informed Consent, Motivation and Influence.” Journal of Paediatrics and Child Health 37: 51–54. Available at: https://doi.org/10.1046/j.1440-1754.2001.00612.x (accessed April 2019).
Henderson, G., J. Garrett, J. Bussey-Jones, M.E. Moloney, C. Blumenthal, and G. Corbie-Smith. 2008. “Great Expectations: Views of Genetic Research Participants Regarding Current and Future Genetic Studies.” Genetics in Medicine 10: 193–200. Doi: https://doi.org/10.1097/GIM.0b013e318164e4f5.
Höhne, J.K. and D. Krebs. 2018. “Scale Direction Effects in Agree/Disagree and Item-Specific Questions: A Comparison of Question Formats.” International Journal of Social Research Methodology 21: 91–103. Doi: https://doi.org/10.1080/13645579.2017.1325566.
Höhne, J.K. and T. Lenzner. 2018. “New Insights on the Cognitive Processing of Agree/Disagree and Item-Specific Questions.” Journal of Survey Statistics and Methodology 6: 401–417. Doi: https://doi.org/10.1093/jssam/smx028.
Höhne, J.K., S. Schlosser, and D. Krebs. 2017. “Investigating Cognitive Effort and Response Quality of Question Formats in Web Surveys Using Paradata.” Field Methods 29: 365–382. Doi: https://doi.org/10.1177/1525822x17710640.
Holbrook, A.L. 2008. “Recency Effect.” In Encyclopedia of Survey Research Methodology, edited by P.J. Lavrakas, 695–696. Newbury Park, CA: Sage.
Krosnick, J.A. and S. Presser. 2010. “Question and Questionnaire Design.” In Handbook of Survey Research, Second Edition, edited by P.V. Marsden and J.D. Wright, 263–313. Bingley, UK: Emerald Group Publishing Limited.
Kuru, O. and J. Pasek. 2016. “Improving Social Media Measurement in Surveys: Avoiding Acquiescence Bias in Facebook Research.” Computers in Human Behavior 57: 82–92. Available at: https://doi.org/10.1016/j.chb.2015.12.008 (accessed April 2019).
Landis, J.R. and G.G. Koch. 1977. “The Measurement of Observer Agreement for Categorical Data.” Biometrics 33: 159–174. Doi: https://doi.org/10.2307/2529310.
Lelkes, Y. and R. Weiss. 2015. “Much Ado about Acquiescence: The Relative Validity and Reliability of Construct-Specific and Agree-Disagree Questions.” Research and Politics 2: 1–8. Doi: https://doi.org/10.1177/2053168015604173.
Liu, M., S. Lee, and F.G. Conrad. 2015. “Comparing Extreme Response Styles between Agree-Disagree and Item-Specific Scales.” Public Opinion Quarterly 79: 952–975. Doi: https://doi.org/10.1093/poq/nfv034.
Mainous, A.G., D.W. Smith, M.E. Geesey, and B.C. Tilley. 2006. “Development of a Measure to Assess Patient Trust in Medical Researchers.” Annals of Family Medicine 4: 247–252. Doi: https://doi.org/10.1370/afm.541.
Schaeffer, N.C. and J. Dykema. 2011. “Response 1 to Fowler’s Chapter: Coding the Behavior of Interviewers and Respondents to Evaluate Survey Questions.” In Question Evaluation Methods: Contributing to the Science of Data Quality, edited by J. Madans, K. Miller, A. Maitland, and G. Willis, 23–39. Hoboken, NJ: John Wiley & Sons, Inc. Available at: https://doi.org/10.1002/9781118037003.ch3.
Scharff, D.P., K.J. Mathews, P. Jackson, J. Hoffsuemmer, E. Martin, and D. Edwards. 2010. “More than Tuskegee: Understanding Mistrust about Research Participation.” Journal of Health Care for the Poor and Underserved 21: 879–897. Doi: https://doi.org/10.1353/hpu.0.0323.
Schuman, H. and S. Presser. 1996. Questions and Answers in Attitude Surveys: Experiments on Question Form, Wording, and Context. Thousand Oaks, CA: Sage Publications, Inc.
Smith, T.W., P.V. Marsden, and M. Hout. 2013. General Social Survey, 1972–2010 [Cumulative File]. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2013-02-07. Doi: https://doi.org/10.3886/ICPSR31521.v1.
Streiner, D.L., G.R. Norman, and J. Cairney. 2015. Health Measurement Scales: A Practical Guide to Their Development and Use. Oxford, UK: Oxford University Press.
Sturgis, P., C. Roberts, and P. Smith. 2014. “Middle Alternatives Revisited: How the neither/nor Response Acts as a Way of Saying “I Don’t Know”?” Sociological Methods & Research 43: 15–38. Doi: https://doi.org/10.1177/0049124112452527.
Thompson, H.S., H.B. Valdimarsdottir, G. Winkel, L. Jandorf, and W.W. Redd. 2004. “The Group-Based Medical Mistrust Scale: Psychometric Properties and Association with Breast Cancer Screening.” Preventive Medicine 38: 209–218. Doi: https://doi.org/10.1016/j.ypmed.2003.09.041.
Tourangeau, R., M.C. Couper, and F. Conrad. 2004. “Spacing, Position, and Order: Interpretive Heuristics for Visual Features of Survey Questions.” Public Opinion Quarterly 68: 368–393. Doi: https://doi.org/10.1093/poq/nfh035.
Tourangeau, R., L.J. Rips, and K. Rasinski. 2000. The Psychology of Survey Response. Cambridge, England: Cambridge University Press.
Yan, T. and R. Tourangeau. 2008. “Fast Times and Easy Questions: The Effects of Age, Experience and Question Complexity on Web Survey Response Times.” Applied Cognitive Psychology 22: 51–68. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1002/acp.1331 (accessed April 2019).
Zheng, B., M.A. Hall, E. Dugan, K.E. Kidd, and D. Levine. 2002. “Development of a Scale to Measure Patients’ Trust in Health Insurers.” Health Services Research 37: 185–200. Doi: https://doi.org/10.1111/1475-6773.00145.