Combining psychometric and biometric measures of substance use

https://doi.org/10.1016/j.drugalcdep.2005.10.016Get rights and content

Abstract

This paper examines the need, feasibility, and validity of combining two biometric (urine and saliva) and three self-report (recency, peak quantity, and frequency) measures of substance use for marijuana, cocaine, opioids, and other substances (including alcohol and other drugs). Using data from 337 adults with substance dependence, we used structural equation modeling to demonstrate that these multiple measures are driven by the same underlying factor (substance use) and that no single measure is without error. We then compared the individual measures and several possible combinations of them (including one based on the latent factors and another based on the Global Appraisal of Individual Needs (GAIN) Substance Frequency Scale) to examine how well each predicted a wide range of substance-related problems. The measure with the highest construct validity in these analyses varied by drug and problem. Despite their advantages for detection, biometric measures were frequently less sensitive to the severity of other problems. Composite measures based on the substance-specific latent factors performed better than simple combinations of the biometric and psychometric measures. The Substance Frequency Scale from the GAIN performed as well as or better than all measures across problem areas, including the latent factor for any use. While the research was limited in some ways, it has important implications for the ongoing debate about the proper way to combine biometric and psychometric data.

Introduction

Although frequently used in research, the accuracy of measures of substance use collected by self-report is often questioned and perceived as biased. A major point of contention remains regarding the veracity of self-report and, to a lesser degree, peer report. Whether as a result of conscious distortion, unconscious denial, or measurement artifact, the literature clearly demonstrates systematic differences in self-report and biometric measures (Amsel et al., 1976, Buchan et al., 2002, Cook et al., 1995, Darke, 1998, Dennis et al., 2003a, Harrison, 1995, Hersh et al., 1999, Katz et al., 2005, Landry et al., 2003, McNagny and Parker, 1992, Messina et al., 2000, Mieczkowski, 1990, Nelson et al., 1998, Preston et al., 1997, Skog, 1992, Stephens, 1972, Stephens and Feucht, 1993, Weatherby et al., 1994, Wish et al., 1997). Biometric screening measures clearly identify people who denied recent use, but they also miss people who readily acknowledge use and can have limited utility for quantifying the extent of use of a long period of time—which is what is wanted in many clinical and clinical research settings.

A separate but closely related concern is how to best operationalize self-reported measures of substance use. Some of the options include days of use, amount of drugs consumed (over a period or on average), days of heavy use, times used, and peak use. Research comparing these measures has shown that they do not respond the same to demand characteristics but rather follow the logic of fuzzy number sets (Matt and Wilson, 1994, Matt et al., 2003). Lennox et al. (1996) combined quantity and frequency items into a single latent variable, using two quantity and two frequency items as measured indicators of the single heavy-drinking variables. This approach argues that the two types of measures can be considered fallible effect-indicators (Bollen and Lennox, 1991) of the same construct, that their intersection can be considered a more accurate measure of substance use, and that the latent variable can measure this alcohol use on a continuum of light to heavy drinking. Lennox et al. (1996) confirmed this structure in a sample from the National Household Survey on Drug Abuse and demonstrated that the latent variable model outperformed standard approaches to measuring alcohol use, including summed score of items and the multiplicative product of quantity and frequency in predicting work-related adverse consequences. Lennox et al. (1998) also used this approach to test the unique impact of heavy drinking and alcohol dependence on adverse alcohol-related consequences.

Compared to psychometric measures, biometric measures of substance use (e.g., from urine, saliva, blood, or hair) are often tacitly accepted as a more accurate assessment of use because they are not subject to various biases that often impact the accuracy of self-report. While they are certainly less likely to be biased due to individual demand characteristics, they too are impacted by several sources of error (Buchan et al., 2002, Cone and Weddington, 1989, Del Boca and Noll, 2000, Del Boca and Darkes, 2003, Dennis et al., 2003a, Dennis et al., 2003b, Dennis et al., 2004, Feucht et al., 1994, Mieczkowski et al., 1991, Visher and McFadden, 1991, Bennett et al., 2003). The greatest threat lies in a systematic error that comes from using a static snapshot of the biological state to infer a more general consumption level over a period of time (Goldstein and Brown, 2003). For example, blood tests may only reliably detect drugs that were used in a 1-h period prior to the test, saliva tests for substances used during the prior 1–2 days, and urine tests for use during the prior 1–7 days. As such, these tests may be insensitive to longer term and more devastating chronic drug use patterns.

A second problem is that researchers frequently ignore the large individual differences in the rates at which various drugs are metabolized. Published cut points are typically based on the 50th or 80th percentiles for how long people will continue to be positive after a given use. In some cases the actual distribution can go out days or weeks further (Buchan et al., 2002). Thus, there is no perfect way of comparing these measures of “metabolites” with self-reported measures of recency, frequency, or quantity of use. Other sources of error include relying on less expensive and/or faster screening tests that do not always agree with gas chromatography/mass spectrometry (or GC/MS, the gold standard) and often suffering from processing delays (particularly with unfrozen samples), handling errors, and participants tampering with the sample.

Rather than accepting one approach or the other, most researchers favor combining the results of multiple methods (Campbell and Fiske, 1959, Cone and Weddington, 1989, Hasin et al., 2003, Lennox and Dennis, 1994, Carroll, 1995, Kranzler et al., 1997). However, there is little research comparing the construct validity of different approaches for combining self-reported and biometric measures of substance use. Some alternatives include: (a) requiring each test to indicate substance use, (b) allowing any indication of substance use to be sufficient, and (c) combining the data into a dimensional measure (via structural equation modeling or a scale) that increases as there are more (and stronger) indications of use. Since no single approach can be considered a “criterion,” it is necessary to focus on the construct validity of the alternative approaches (Cronbach and Meehl, 1955). The construct validity of a measure is established by demonstrating that the measure is related to other measures in ways that are consistent with theory. No single test or relationship is sufficient to establish the validity of a measure, but rather a pattern of relationships is required to eliminate alternative interpretations to the measure.

A tangential issue that has been recently debated in this journal is whether information should be combined by substance or across multiple substances, especially when using the measure for prognosis and to track outcomes (see Rounsaville et al., 2003, O’Brien and Lynch, 2003, Strain, 2003, Conway et al., 2003). This is an important issue because over one third of the people with abuse or dependence in the community and two thirds of those in treatment have multiple substance use disorders (SAMH, 2003a, SAMH, 2003b). While it may be useful to identify individual patterns of use (e.g., offering methadone to an opioid user), it is likely that symptoms of withdrawal, abuse/dependence, and other substance-related problems are multiply determined by all of the substances being used. Thus, in addition to looking at how to combine psychometric and biometric information within single substances, it is also important to consider what happens when this information is combined across substances. In this article we will use data from a large cohort of chronic substance users to empirically examine the relationship of multiple psychometric and biometric measures of substance use and the construct validity of different approaches for combining them.

Section snippets

Data source

The data from this paper came from the 12-month post-intake wave of the early re-intervention (ERI) experiment2 (Dennis et al., 2003a, Dennis et al., 2003b, Scott et al., 2005), which was designed to evaluate the effectiveness of quarterly monitoring, checkups, and early re-intervention on long-term outcomes of persons with lifetime substance use dependence. As part of this experiment 448 adults

Comparing the measures

For each substance (marijuana, cocaine, opioid, and any drug) we conducted a confirmatory factor analysis to see if they were being driven by the same underlying factor (i.e., unobserved substance use). Table 2 presents the standardized loadings (all significant) for each item in each of the four models along with the goodness-of-fit statistics for each model. Factor loadings that are significantly different than 0 (all in this study) are “reliably” measured, with higher loadings more closely

Discussion

The results of this analysis support our contention that psychometrics and biometrics each primarily measures a single underlying latent variable (substance use) and they each do so imperfectly due to a combination of measurement error and differences in how they are operationalized. The results of the confirmatory factor analysis were mixed; however, the model fits sufficiently well to support the general strategy of combining biometric and psychometric in a single composite measure. The

Acknowledgements

This work was completed with support provided by the National Institute on Drug Abuse Grant No. DA 11323. The second author developed the GAIN, but no royalties are associated with the GAIN. The authors would like to thank Joan Unsicker and Tim Feeney for assistance in preparing the manuscript and the study staff and participants for their time and effort.

References (57)

  • R.D. Lennox et al.

    Latent variable models of alcohol-related constructs

    J. Subst. Abuse

    (1996)
  • G.E. Matt et al.

    Describing the frequency of marijuana use: fuzziness and context-dependent interpretation of frequency expressions

    Eval. Program Plann.

    (1994)
  • C.P. O’Brien et al.

    Can we design and replicate clinical trials with a multiple drug focus?

    Drug Alcohol Depend.

    (2003)
  • B.J. Rounsaville et al.

    Single versus multiple drug focus in substance abuse clinical trials research

    Drug Alcohol Depend.

    (2003)
  • C.K. Scott et al.

    Recovery management checkups to shorten the cycle of relapse, treatment re-entry, and recovery

    Drug Alcohol Depend.

    (2005)
  • P. Shane et al.

    Treatment outcomes among adolescents with substance abuse problems: the relationship between comorbidities and post-treatment substance involvement

    Eval. Program Plann.

    (2003)
  • E.C. Strain

    Single versus multiple drug focus in substance abuse clinical trials research: the devil is in the details

    Drug Alcohol Depend.

    (2003)
  • N.L. Weatherby et al.

    Validity of self-reported drug use among injection drug users and crack cocaine users recruited through street outreach

    Eval. Program Plann.

    (1994)
  • Z. Amsel et al.

    Reliability and validity of self-reported illegal activities and drug use collected from narcotic addicts

    Int. J. Addict.

    (1976)
  • J.L. Arbuckle

    AMOS 4.0 (Computer Software)

    (1999)
  • P.M. Bentler et al.

    Significance tests and goodness of fit in the analysis of covariance structures

    Psychol. Bull.

    (1980)
  • K. Bollen et al.

    Conventional wisdom on measurement: a structural equation perspective

    Psychol. Bull.

    (1991)
  • K.A. Bollen

    Structural Equations with Latent Variables

    (1989)
  • B.J. Buchan et al.

    Cannabis use: consistency and validity of self report, on-site urine testing, and laboratory testing

    Addiction

    (2002)
  • B.M. Byrne

    Structural Equation Modeling with AMOS: Basic Concepts, Applications, and Programming

    (2001)
  • D.T. Campbell et al.

    Convergent and discriminant validation by the multi-trait–multi-method matrix

    Psychol. Bull.

    (1959)
  • K.M. Carroll

    Methodological issues and problems in the assessment of substance use

    Psychol. Assess.

    (1995)
  • E.J. Cone et al.

    Prolonged occurrence of cocaine in human saliva and urine after chronic use

    J. Anal. Toxicol.

    (1989)
  • Cited by (54)

    • A randomized controlled trial of a therapeutic relational agent for reducing substance misuse during the COVID-19 pandemic

      2021, Drug and Alcohol Dependence
      Citation Excerpt :

      Previously, this measure was found sensitive to treatment effects (Prochaska et al., 2021). Prior research has supported the validity of frequency measures of substance use consumption, which provide a continuous measure of the severity of harmful, problematic use (Moss et al., 2012) and when combined across substances, appropriately take into account their collective contributions to substance-related problems (Lennox et al., 2006). From the list, participants identified their primary and secondary (if applicable) substance(s) of abuse.

    • Identifying optimal level-of-care placement decisions for adolescent substance use treatment

      2020, Drug and Alcohol Dependence
      Citation Excerpt :

      These three measures corresponded to the most important treatment outcomes identified by all stakeholder groups (SUD treatment service providers, policymakers, researchers, and parents). See (Lennox et al., 2006) for psychometric information on these measures, and full norms and psychometrics for all scales in the GAIN are available at http://gaincc.org/data-statistics/encyclopedia-norms-psychometrics/. Tailoring variables are the variables capturing current and past symptomatology and level of functioning that are used to personalize LOC to each adolescent.

    • Substance use outcomes for mindfulness based relapse prevention are partially mediated by reductions in stress: Results from a randomized trial

      2018, Journal of Substance Abuse Treatment
      Citation Excerpt :

      Participants respond with the number of days (range 0–14) they have used each substance. The SFS was reliable in both prior adolescent and young adult samples (Buchan, Dennis, Tims, & Diamond, 2002; Dennis et al., 2002; Lennox, Dennis, Scott, & Funk, 2006). Specifically, the SFS has shown strong reliability, validity, and strong relationship with abuse and dependence symptoms among a large sample of youth in substance use disorder treatment (Dennis et al., 2004).

    View all citing articles on Scopus
    1

    Tel.: +1 309 820 3805; fax: +1 309 829 4661.

    View full text