Does online psychological test administration facilitate faking?

doi:10.1016/j.chb.2011.08.001

Computers in Human Behavior

Volume 27, Issue 6, November 2011, Pages 2386-2391

https://doi.org/10.1016/j.chb.2011.08.001 Get rights and content

Abstract

This study examined for the first time the effect of delivery mode on faking good and faking bad in psychological testing. Participants (N = 223) completed questionnaires either online or in pen-and-paper format in a mixed experimental design. After completing measures of personality (HEXACO-60, Ashton & Lee, 2009) and depression (DASS-21, Lovibond & Lovibond, 1995) under standard instructions, participants then faked the personality measure as if applying for a job, and faked the depression measure as if experiencing severe depression. Equivalence of internet and pen-and paper-administration on faking was then measured between groups. As predicted, participants were able to fake good on the HEXACO-60 and to fake bad on the DASS-21. Also as predicted, there were no significant differences in faked scores as a function of test administration mode. Further, examination of effect sizes confirmed that the influence of test administration mode was small. It was concluded that online and pen-and paper presentation are largely equivalent when an individual is faking responses in psychological testing. Given the advantages of online assessment and the importance of valid psychological testing, future research should investigate whether the current findings can be generalised to other faking and malingering scenarios and to other psychological measures.

Highlights

► We examined whether online or traditional test administration influences fakability. ► Administration mode did not influence scores when faking good. ► Administration mode did not influence scores when faking bad. ► Online and pen-and paper presentation appear equivalent when an individual is faking. ► Future research should investigate other measures and faking scenarios.

Introduction

The internet is being increasingly used for psychological research and assessment in a number of contexts, including for vocational (Piotrowski & Armstrong, 2006) and clinical (Hedman et al., 2010) purposes. Much research has examined the equivalence of pen-and-paper and web-based versions of specific psychological measures in a variety of domains (e.g. Coles et al., 2007, Denniston et al., 2010, Hedman et al., 2010, Lewis et al., 2009; Templer & Lange, 2008). However, the equivalence of psychological test presentation via the internet and pen-and-paper has not been examined in regards to the susceptibility of self-reports to faking. This study aimed to explore for the first time the influence that mode of delivery elicits on the fakability of self-report psychological tests.

The internet has rapidly become a valuable medium for data collection as it is considered inexpensive, easily accessible, and discrete (Birnbaum, 2004). Other advantages of on-line data collection are that participants can be required to endorse answers to all items (thereby minimising missing data), and that data can be transferred electronically for analysis (thereby reducing data entry error) (Carlbring et al., 2007; Lewis et al., 2009). However, rather than assuming that internet and pen-and-paper administrations of psychological measures are interchangeable, it has been recommended that all psychological measures be evaluated to investigate whether internet and pen-and-paper administrations are comparable (Buchanan, 2002).

To date, a number of tests have been compared, including clinical measures (e.g. Carlbring et al., 2007, Coles et al., 2007, Herrero and Meneses, 2006), personality measures (e.g. Templer & Lange, 2008), ability measures (e.g. Ihme et al., 2009), and health- and risk-related behaviour measures (e.g. Horswill and Coster, 2001, Lewis et al., 2009, McCabe et al., 2006, Whittier et al., 2004). Overall, findings suggest that the internet is both a feasible and largely comparable method for conducting psychological testing. However, no extant research has examined the role of mode of delivery in the administration of self-report psychometric tests and the potential facilitation of faking behaviour.

Faking or malingering occurs when an individual strategically alters their self representation in a particular test (Grieve & Mahar, 2010). Faking good is characterised by responses that augment an individual’s actual state, making them appear psychologically superior (for example, in a job application), while faking bad occurs when an individual presents themselves as psychologically worse than they actually are (for example, to be diagnosed with a disorder).

Faking of psychological assessments may have a number of consequences. For example, in vocational contexts, faking will not only influence who gets hired (Mueller-Hanson, Heggestad, & Thornton, 2003), but can also impact the subsequent training and management of employees (Landers, Sackett, & Tuzinski, 2011). In clinical contexts, faking may influence access to therapy or medication (Suhr, Hammers, Dobbins-Buckland, Zimak, & Hughes, 2008).

This study aimed to build on the existing research regarding the validity of pen-and-paper and online testing methods by investigating whether administration mode influences an individual’s ability to fake a measure. To more fully address this aim, both faking good and faking bad scenarios were employed.

Previous research has shown that individuals are readily able to fake good in vocational contexts (for example, as if applying for a job) by maximising positive, job-relevant personality aspects and minimising negative personality aspects (Mahar et al., 2006). Therefore, an initial hypothesis was that participants would be able to alter their original personality profiles to a more positive faked profile when asked to complete a personality measure as if they were applying for a job. Specifically, it was anticipated that the faked profiles would score significantly higher than the original profiles on desirable employee characteristics (honesty/humility, extraversion, agreeableness, conscientiousness, and openness), and significantly lower on undesirable employee characteristics (emotionality).

The second hypothesis addressed the main research question. Given that most research into the equivalence of online and pen-and-paper personality testing has found that both modes of administration elicit similar test results (e.g. Templer & Lange, 2008), it was hypothesised that faked profile scores would be equivalent regardless of which mode of administration was used. While it is acknowledged that this is in fact testing the null hypothesis, and that it is difficult to ascertain whether a hypothesis of no difference is true (Nickerson, 2000), a hypothesis of this nature was required by the research question. It follows that, in order to test the second hypothesis, a close examination of effect size, rather than statistically significant differences alone, was indicated.

It has also been shown that individuals are able to fake bad in clinical contexts (for example, as if they have depression, see Grieve & Mahar, 2010). Therefore, it was hypothesised that when participants were asked to complete a depression measure as if they had depression, they would be able to alter their original scores on that measure to faked scores suggesting a provisional diagnosis of depression.

In order to address the main research question, scores were compared between groups of participants who faked the depression measure either online or using pen-and-paper. Again, as previous research has largely supported the equivalence of the two modes of administration for clinical measures (e.g. Carlbring et al., 2007), it was hypothesised that there would be no significant differences in faked depression scores as a function of administration method. Once more, as this prediction was testing the null hypothesis, close examination of the effect size was also undertaken.

Section snippets

Participants

The sample consisted of 223 participants (54 men, 169 women) who completed the questionnaire on the internet (63%) or on paper (37%). Participants were recruited from the student body of an Australian university (41.5%), and the general public (51.8%). 6.7% of participants did not report whether or not they were students. Participants were invited to participate via in-class announcements, word of mouth, and using the social networking website Facebook. Participation was voluntary and no

Manipulation check

Answers to the manipulation check were reviewed to ensure that participants had understood and followed the experimental manipulation, and were dummy coded as either ‘followed instructions’ or ‘did not follow instructions’. Examples of responses coded as successfully facilitated were “to bend the truth or lie straight out to get the job” and “trying to answer like a depressed person would”. Examples of responses coded as not following instructions included “to answer honestly and openly” and

Discussion

The aim of this study was to examine whether internet and pen-and-paper test administrations are equally susceptible to faking for measures of personality and depression. As predicted, participants were able to alter their test profiles to present themselves as a desirable employee, and also to appear as if a provisional diagnosis of depression was indicated. When faking good, emotionality tended to be under-reported, while honesty/humility, extraversion, agreeableness, conscientiousness and

Conclusions

This research examined for the first time the fakability of two measures (the HEXACO-60 and the DASS-21) as a function of test administration mode. The small effect sizes indicated that internet administration and pen-and-paper administration are largely equivalent when an individual is engaging in faking behaviours on these specific self-report measures of personality (faking good) and depression (faking bad). While future research should extend investigation to other contexts and measures,

Acknowledgements

The authors would like to thank Catherine McSwiggan for her assistance in data collection. We would also like to thank two anonymous reviewers for their useful comments.

References (28)

P. Carlbring et al.
Internet vs. paper and pencil administration of questionnaires commonly used in panic/agoraphobia research
Computers in Human Behavior
(2007)
M. Coles et al.
Assessing obsessive compulsive symptoms and cognitions on the internet: Evidence for the compatibility of paper and internet administration
Behavior Research and Therapy
(2007)
M. Denniston et al.
Comparison of paper-and-pencil versus Web administration of the Youth Risk Behavior Survey (YRBS): Participation, data quality, and perceived privacy and anonymity
Computers in Human Behavior
(2010)
E. Hedman et al.
Internet adminstration of self-report measures commonly used in research on social anxiety disorder: A psychometric evaluation
Computers in Human Behavior
(2010)
J. Herrero et al.
Short Web-based versions of the perceived stress (PSS) and Center for Epidemiological Studies-Depression (CESD) Scales: A comparison to pencil and paper responses among Internet users
Computers in Human Behavior
(2006)
P. Lovibond et al.
The structure of negative emotional states: Comparison of the Depression Anxiety Stress Scales (DASS) with the Beck Depression and Anxiety Inventories
Behavior Research and Therapy
(1995)
D. Mahar et al.
Stereotyping as a response strategy when faking personality questionnaires
Personality and Individual Differences
(2006)
S. McCabe et al.
Comparison of web and mail surveys for studying secondary consequences associated with substance use: Evidence for minimal mode effects
Addictive Behaviors
(2006)
J. Suhr et al.
The relationship of malingered test failure to self-reported symptoms and neuropsychological findings in adults referred for ADHD evaluation
Archives of Clinical Neuropsychology
(2008)
K.J. Templer et al.
Internet testing: Equivalence between proctored lab and unproctored field conditions
Computers in Human Behavior
(2008)

M. Ashton et al.

The HEXACO-60: A short measure of the major dimensions of personality

Journal of Personality Assessment

(2009)

M. Birnbaum

Human research and data collection via the Internet

Annual Review of Psychology

(2004)

M.C.W. Braver et al.

Statistical treatment of the Solomon Four Group Design: A meta-analytic approach

Psychological Bulletin

(1988)

T. Buchanan

Online assessment: Desirable or dangerous?

Professional Psychology: Research and Practice

(2002)

Cited by (26)

HEXACO personality predicts counterproductive work behavior and organizational citizenship behavior in low-stakes and job applicant contexts
2018, Journal of Research in Personality
Citation Excerpt :
A meta-analysis of instructed faking studies found average changes of around three-quarters of a standard deviation (Ones, Viswesvaran, & Schmidt, 1993). In addition to studies of instructed faking on HEXACO personality in the laboratory environment (Grieve & De Groot, 2011; MacCann, 2013), a recent study by Anglim, Morse, De Vries, MacCann, and Marty (2017) compared large samples of job applicants and non-applicants on the HEXACO-PI-R and found that job applicants scored higher on honesty-humility, extraversion, agreeableness, and conscientiousness. Although there is relative consensus that response distortion occurs in applicant settings, there is less agreement about whether such settings also reduce the predictive validity of personality.
This study examined the degree to which the predictive validity of personality declines in job applicant settings. Participants completed the 200-item HEXACO Personality Inventory-Revised, either as part of confidential research (347 non-applicants) or an actual job application (260 job applicants). Approximately 18-months later, participants completed a confidential survey measuring organizational citizenship behavior (OCB) and counterproductive work behavior (CWB). There was evidence for a small drop in predictive validity among job applicants, however honesty-humility, extraversion, agreeableness, and conscientiousness predicted lower levels of CWB and higher levels of OCB in both job applicants and non-applicants. The study also informs the use of the HEXACO model of personality in selection settings, reporting typical levels of applicant faking and facet-level predictive validity.
Employment testing online, offline, and over the phone: Implications for e-assessment
2016, Revista de Psicologia del Trabajo y de las Organizaciones
Citation Excerpt :
Previous research does indicate that when faking, individuals form their own concept or schema of the desirable profile (Jansen, König, Kleinmann, & Melchers, 2012). The current study built on previous research (Grieve & de Groot, 2011) examining the equivalence of electronic assessment methods in a vocational context by including telephone administration and a specific applicant profile. Overall, the results support previous research indicating equivalence between online and pen-and-paper test administration (e.g., Bates & Cox, 2008; Carlbring et al., 2007; Casler et al., 2013; Williams & McCord, 2006), and between online, pen-and-paper, and telephone administration (Knapp & Kirk, 2003).
This research investigated faking across test administration modes in an employment testing scenario. For the first time, phone administration was included. Participants (N = 91) were randomly allocated to testing mode (telephone, Internet, or pen-and-paper). Participants completed a personality measure under standard instructions and then under instructions to fake as an ideal police applicant. No significant difference in any faked personality domains as a function of administration mode was found. Effect sizes indicated that the influence of administration mode was small. Limitations and future directions are considered. Overall, results indicate that if an individual intends to fake on a self-report test in a vocational assessment scenario, the electronic administration mode in which the test is delivered may be unimportant.
Este trabajo investiga el falseamiento en los diferentes modos de aplicación de tests en el contexto de las pruebas para conseguir empleo. Por primera vez se incluyó la aplicación telefónica. Se distribuyó a los participantes (N = 91) aleatoriamente en las modalidades de prueba (telefónica, Internet o papel y lápiz). Los sujetos realizaron una prueba de personalidad con instrucciones estándar y después con instrucciones de que falsearan la prueba como si fuesen aspirantes ideales a la policía. No resultaron diferencias significativas en ninguno de los dominios de personalidad en función del modo de administración. La magnitud del efecto indicaba que la influencia del modo de aplicación era escasa. Se abordan las limitaciones y directrices con vistas al futuro. En general, los resultados indican que si una persona trata de falsear una prueba de autoinforme en el contexto de la evaluación profesional el modo de administración electrónica de la prueba puede carecer de importancia.
Response Bias, Malingering, and Impression Management
2015, Measures of Personality and Social Psychological Constructs
Assessing a wide array of human characteristics using self-report questionnaires has been a successful item in the toolkit of many psychologists and other social scientists. However, the story itself is not one of successes only. Criticism against the use of questionnaires is as old and as common as the questionnaires themselves. The most recognized and most researched criticisms regard response biases, malingering, and impression management. The present chapter starts by shortly giving an overview of important concepts before providing descriptions of the most popular published measurement approaches to response biases, malingering, and impression management. The chapter concludes with a critical discussion of response bias scales in general. This is followed by a summary of some promising new approaches as well as a discussion of several unsolved issues on which future research in the area of response bias should focus.
Predicting intentions to fake in psychological testing: Which normative beliefs are important?
2014, Revista de Psicologia del Trabajo y de las Organizaciones
While previous research has examined the utility of the Theory of Planned Behavior (TPB) in relation to intentions to fake in psychological testing, the current research extended the TPB model to empirically assess the role of moral norms and ethics. A hierarchical multiple regression was conducted (N = 225). In step 1, attitude, perceived behavioral control, and subjective norm significantly predicted intention to fake, although only attitude and perceived behavioral control were significant individual predictors, with 52.3% of variance explained. In step 2, addition of moral obligation norms significantly improved predicted intention to fake and explained an additional 14% of variance. In step 3, ethical position explained no additional variance. Future research should consider specific applicant faking scenarios or a behavioral outcome measure. It is concluded that personal, moral norms, rather than other-centred norms, are valuable when predicting faking intentions, and that integration of existing theoretical models of faking is indicated.
Mientras la investigación precedente ha analizado la utilidad de la Teoría de la Conducta Planificada (TCP) en la intención de falsear los resultados de los tests psicológicos, esta investigación amplía el modelo de la TCP para evaluar qué papel juegan las normas morales y la ética. Se realizó un análisis de regresión jerárquica múltiple (N = 225). En el paso 1 la actitud, el control conductual percibido y la norma subjetiva predecían de modo significativo la intención de falseamiento, aunque solo los dos últimos de modo significativo, con un 53.3% de la varianza explicada. En el paso 2, la adición de las normas de obligación moral mejoraba significativamente la intención de falseamiento predicha, explicando otro 14% de varianza. En el paso 3 la posición ética no añadía varianza explicada. La investigación futura tendría que considerar los escenarios de falseamiento por parte de los aspirantes o una variable resultado conductual. Se concluye que las normas morales personales antes que las normas centradas en los demás son las que cuentan a la hora de predecir la intención de falsear y se propone integrar los modelos teóricos sobre falseamiento.
More of a (wo)man offline? Gender roles measured in online and offline environments
2013, Personality and Individual Differences
Citation Excerpt :
Researchers have generally found that online versions of traditional pen and paper tests are equivalent (Naus et al., 2009). Online and offline equivalence has been examined and established in a range of assessments, for example personality (Chuah et al., 2006; Grieve & de Groot, 2011; Salgado & Moscoso, 2003), clinical (Coles, Cook, & Blake, 2007; Grieve & de Groot, 2011; Naus et al., 2009) and intelligence (Franzis & Helge, 2003) measures. However, although many measures have been deemed equivalent, Buchanan (2002) stated that the equivalence of online and offline tests cannot just be assumed, and recommended that equivalence should be demonstrated for every test.
The increased availability of and access to the Internet has resulted in online psychological assessment becoming an attractive mode of collecting data. However, equivalence between online measures and their offline counterpart cannot be assumed. The aim of the current study was to examine for the first time the online and offline equivalence of a commonly employed measure of gender role orientation: the Bem Sex Role Inventory (BSRI) short-form. Participants (N = 372) completed the BSRI short-form either online (n = 244) or offline (n = 128). Equivalence was assessed through reliability measures and mean differences. Reliability analyses indicated good and comparable levels of internal consistency. There was no significant difference between femininity scores depending on mode of administration. However, masculinity scores were significantly higher when the BSRI short-form was administered offline. An additional and unexpected finding was that there was no significant difference between men and women’s masculinity scores. Explanations for the pattern of results seen are considered, including the possible role of social desirability. Future research should further consider conceptualisations of gender in the online environment. Given the findings reported here, it is recommended that researchers collecting gender role data online interpret their findings mindful of possible administration mode effects.
Construct validity of a personality assessment game in a simulated selection situation and the moderating roles of the ability to identify criteria and dispositional insight
2023, International Journal of Selection and Assessment

View all citing articles on Scopus

View full text

Does online psychological test administration facilitate faking?

Abstract

Highlights

Introduction

Section snippets

Participants

Manipulation check

Discussion

Conclusions

Acknowledgements

Computers in Human Behavior

Behavior Research and Therapy

Computers in Human Behavior

Computers in Human Behavior

Computers in Human Behavior

Behavior Research and Therapy

Personality and Individual Differences

Addictive Behaviors

Archives of Clinical Neuropsychology

Computers in Human Behavior

The HEXACO-60: A short measure of the major dimensions of personality

Journal of Personality Assessment

Human research and data collection via the Internet

Annual Review of Psychology

Statistical treatment of the Solomon Four Group Design: A meta-analytic approach

Psychological Bulletin

Online assessment: Desirable or dangerous?

Professional Psychology: Research and Practice