Cultural Consensus Theory: Comparing different concepts of cultural truth

https://doi.org/10.1016/j.jmp.2012.06.002Get rights and content

Abstract

Cultural Consensus Theory (CCT) is a model-based approach to aggregating the responses of informants (respondents) to questions (items) about some domain of their shared cultural knowledge. The purpose of CCT is to allow a researcher to discover consensus knowledge in cultural groups. This paper compares and contrasts two CCT models for items requiring a dichotomous, true/false answer. The first model is the General Condorcet Model (GCM). A special case of the GCM is already in wide use, especially in cultural anthropology, and this paper generalizes that version and provides new mathematical and statistical results for it. The character of the GCM is that of a general signal detection model, where the item-trial type (signal or noise) as well as the respondents’ hit and false alarm rates, are latent rather than observable. The second model, the Latent Truth Model (LTM), is a new model that allows cultural truth to assume continuous values in the unit interval rather than the two-valued truth assumption of the GCM. Both models are compared analytically, and hierarchical Bayesian inference for each is developed. A posterior predictive model check is established for both models that bears directly on the assumption that there is a single consensus truth. In addition, the similarities and differences between the models are illustrated both with mathematical and statistical results, as well as by analyzing real and simulated data sets, and a second posterior predictive check that tends to differentiate the models is also provided.

Highlights

► Two Cultural Consensus Theory models for dichotomous data are formalized. ► Mathematical and statistical properties of each model are provided. ► Hierarchical Bayesian Inference for both models on real and simulated data. ► Posterior predictive model checks are developed for both models.

Introduction

Cultural Consensus Theory (CCT) is an approach to information pooling (aggregation, data fusion) first presented in the mid-1980s (Batchelder and Romney, 1986, Batchelder and Romney, 1988, Batchelder and Romney, 1989, Romney et al., 1987, Romney et al., 1986). Since its inception, CCT has become a popular methodology for measurement and inference in the social and behavioral sciences, especially in cultural anthropology (e.g., Romney & Batchelder, 1999). CCT is designed for situations in which a researcher has access to a group of informants (respondents, experts) who share knowledge or beliefs and agree to provide answers to a set of questions (items) about some domain of their shared knowledge. While the researcher is assumed able to pose relevant questions to the informants, the nature of their shared knowledge, and more specifically the consensus, or ‘culturally correct’ answers to the questions, are unknown a priori to the researcher. Thus the primary goal of CCT is to infer consensus patterns of cultural truth that represent the informants’ shared cultural knowledge. To achieve this goal, CCT consists of cognitive response models, each of which is designed to accommodate a particular testing format (e.g. true/false, multiple choice, ranking, probability estimates). Each CCT model specifies parameters for the consensus answer to each item, as well as for the response characteristics of each informant.

In the case of dichotomous true/false items, the standard CCT model is the General Condorcet Model (GCM), first published in the mid-1980s (Batchelder and Romney, 1986, Batchelder and Romney, 1988, Romney et al., 1986), and it is described later in this paper. The GCM has become the most popular model used in applications of CCT to ethnographic studies in cultural anthropology, e.g. Weller (2007), and there are also the many articles that cite Romney et al. (1986). The popularity of the GCM is due in part to the availability of a software package for estimating a restricted case of the GCM developed by Steve Borgatti, which at the time of writing, is available in ANTHROPAC (Borgatti, 1996) as well as in UCINET (Borgatti, Everett, & Freeman, 2002). The original estimation method is based on an ad hoc blend of method-of-moments and Bayesian techniques developed in Batchelder and Romney (1988), and it applies only to a special case of the GCM. Later Bayesian estimation for a more general version of the GCM based on Markov Chain Monte Carlo (MCMC) methods is presented in Karabatsos and Batchelder (2003). One of the main goals of this paper is to provide new mathematical results as well as Bayesian hierarchical inference for the GCM, including a new posterior predictive check for the central assumption that the informants share the same basic cultural truths.

In general, questions appropriate for CCT are designed to tap knowledge or beliefs shared by the informants rather than their personal beliefs or opinions. For example, (1) “Is Indianapolis the capital of Indiana?” and (2) “Do Hoosiers like basketball?” (Hoosiers are folks living in Indiana) are appropriate CCT questions; but (3) “Do you like basketball?” is not appropriate because it invites an individual to express a personal preference. The GCM assumes that each question pertaining to the shared knowledge of the informants actually has a dichotomously-based ‘correct answer’, i.e. it is either true or false for the culture, as in two-valued logic. This dichotomous assumption about cultural truth is a strong and potentially limiting assumption in the GCM. For example, questions similar to (1) seem appropriate for two logical values, whereas questions like (2) may be better represented by a continuous representation of degrees of cultural truth, e.g. a value in [0, 1], as in fuzzy logic (Zadeh, 1965).

The primary purpose of this paper is to introduce and compare a new CCT model for dichotomous items, the Latent Truth Model (LTM). The new model weakens the GCM assumption that cultural truth is two-valued by assuming that cultural truth is specified by parameters representing truth-values coded continuously in [0, 1]. The LTM shares some of the structure of a CCT model by Batchelder, Strashny, and Romney (2010) for cases where informant responses are in the interval [0, 1], as in judgments of the probability of events or the similarity between pairs of semantic terms. In this paper, we contrast the strong distinction of using CCT to infer truths as continuous values, as with the LTM, versus the traditionally-used, two-valued cultural truth specification in the GCM, first analytically, then with simulated data, and finally by applying both to published data.

The paper is divided into six sections. After the introductory section, Section 2 presents a generalized version of the GCM along with several new mathematical properties, and then Section 3 presents the new LTM model along with several of its mathematical properties as well. Section 4 compares the two models analytically, and discusses similarities and differences between the roles of the parameters specified in the two models. In this section, a statistic of the response data is provided that tends to distinguish the two models. Section 5 develops hierarchical Bayesian inference and posterior predictive checks for both models based on MCMC methods, and applies these developments to both simulated and published data. This paper is the first paper that presents hierarchical Bayesian inference for CCT models, and we hope that it sets the stage for others to adopt this approach. Finally, in Section 6 we discuss suggested conditions that need to be met for the appropriate use of each model.

Section snippets

The GCM for dichotomous responses

In this section we present the assumptions for the GCM, and we show that it has the structure of a general signal detection model for dichotomous response data. In the case of the usual signal detection model, the hit and false alarm rates, as well as the ‘correct answers’, are manifest (observable); however for the GCM, these quantities are latent variables that play the role of parameters.

The LTM for dichotomous responses

The primary motivation for the LTM comes from a desire to represent cultural truth in terms of degrees rather than in terms of two-valued logic, as in the GCM. Of course certain cultural truths are dichotomous by their nature, such as statements of fact concerning explicit laws, moral codes, or food taboos; however other statements seem better represented as a degree of cultural truth: such as the perceived severity of crimes, the importance of observing certain moral codes, or culturally

Analytic comparison of the models

There are a number of similarities and differences in the properties of the GCM and LTM. Both models are latent variable models, in that an observable response variable depends probabilistically on latent (unobservable) variables. However, the models differ in the types of latent variables that characterize them. As pointed out in Batchelder and Romney (1988), the GCM is a latent-class model (e.g., Lazarsfeld & Henry, 1968). The model specifies two latent classes of items: items that are ‘true’

Statistical inference with the models

This section is divided into four subsections. In Section 5.1, we provide hierarchical Bayesian inference for both models, along with a strong model-check statistic that bears on the axiom of a single set of truth parameters that is assumed by both models. The next subsection shows that the approach to inference is able to recover parameters of simulated data to an adequate degree. Then in Section 5.3, we show how the models differ in interpreting data, especially as the number of informants is

Discussion

This paper presents and compares two alternative CCT models, the GCM and the LTM, for items requiring a dichotomous (true/false or yes/no) response. A restricted version (i,gi=1/2) of the GCM has frequently been used in social and behavioral research, especially in cultural anthropology. The main goal of these applications has been to examine informants’ responses to questions about their common culture and determine if there is a single cultural consensus behind these responses. If the model

Acknowledgments

We would like to thank John Kruschke and his two anonymous referees for many helpful suggestions made to earlier drafts of the paper. We would also like to thank Zita Oravecz for her helpful advice. Work on this paper was supported by grants to the first author from the Air Force Office of Scientific Research (AFOSR), the Army Research Office (ARO), and the Intelligence Advanced Research Projects Activity (IARPA).

References (44)

  • M.D. Lee

    How cognitive modeling can benefit from hierarchical Bayesian models

    Journal of Mathematical Psychology

    (2011)
  • E.C. Merkle et al.

    Hierarchical models of simple mechanisms underlying confidence in decision making

    Journal of Mathematical Psychology

    (2011)
  • L.A. Zadeh

    Fuzzy sets

    Information and Control

    (1965)
  • R. Aunger

    Reflexive ethnographic science

    (2004)
  • W.H. Batchelder et al.

    Consensus analysis of three-way social network data

    Journal of Mathematical Sociology

    (1997)
  • W.H. Batchelder et al.

    The statistical analysis of a general condorcet model for dichotomous choice situations

  • W.H. Batchelder et al.

    Test theory without an answer key

    Psychometrika

    (1988)
  • W.H. Batchelder et al.

    New results in test theory without an answer key

  • W.H. Batchelder et al.

    Cultural consensus theory: aggregating continuous responses in a finite interval

  • J.M. Bernardo et al.

    Bayesian theory

    (2000)
  • S.P. Borgatti

    ANTHROPAC 4.0. Columbia SC

    (1996)
  • S.P. Borgatti et al.

    UCINET for windows: software for social network analysis

    (2002)
  • G.E.P. Box et al.

    Empirical model-building and response surfaces

    (1987)
  • J.S. Bruner et al.

    A study of thinking

    (1956)
  • A.L. Comrey

    The minimum residual method of factor analysis

    Psychological Reports

    (1962)
  • L.M. de Menezes et al.

    Comparing fits of latent trait and latent class models

    Journal of Applied Statistics

    (2007)
  • O. Diekmann et al.

    Mathematical epidemiology of infectious diseases: model building, analysis, and interpretation

    (2000)
  • S. Embretson et al.

    Item response theory for psychologists

    (2000)
  • A. Gelman et al.

    Bayesian data analysis

    (2004)
  • L. Guttman

    The basis for scalogram analysis

  • D.J. Hruschka et al.

    When there is more than one answer key: cultural theories of postpartum hemorrhage in Matlab, Bangladesh

    Field Methods

    (2008)
  • E.B. Hunt et al.

    Experiments in induction

    (1966)
  • Cited by (54)

    • Hierarchical paired comparison modeling, a cultural consensus theory approach

      2021, Journal of Mathematical Psychology
      Citation Excerpt :

      CCT uses this information to assess a consensus ground truth while simultaneously providing measures for item difficulty and subject agreement. CCT item response models have been developed for binary response data (Batchelder & Anders, 2012), multiple choice (Romney et al., 1986), continuous closed interval responses (Batchelder et al., 2010), and ordinal data (Anders & Batchelder, 2015). For an overview of CCT, see Batchelder et al. (2018).

    • Beating the average forecast: Regularization based on forecaster attributes

      2020, Journal of Mathematical Psychology
      Citation Excerpt :

      In this paper, we develop a model-based approach that satisfies both of these aims by formally linking the unweighted average to an optimal forecaster weighting scheme. The model, inspired by Batchelder’s Cultural Consensus Theory (CCT; e.g., Batchelder & Anders, 2012; Batchelder & Romney, 1988; Karabatsos & Batchelder, 2003; Romney, Batchelder, & Weller, 1987; Romney, Weller, & Batchelder, 1986), allows us to estimate relevant forecaster and environment characteristics within a broad class of forecasting environments. Drawing on analytic work from Davis-Stober, Budescu, Broomell, and Dana (2015), we are then able to translate these characteristics into optimal forecaster weights, using mean squared error as an objective function.

    • Cultural Consensus Theory for the evaluation of patients’ mental health scores in forensic psychiatric hospitals

      2020, Journal of Mathematical Psychology
      Citation Excerpt :

      The property of CCT models to estimate the consensus truth from the data is ideal for psychiatric data, where a patient’s true state is unknown and a consensus from the raters is desired. CCT models can be applied to continuous data (e.g., the LTM;Batchelder & Anders, 2012), categorical data (e.g., the General Condorcet model; Batchelder & Romney, 1986), and ordinal data (AB). Since ratings are usually given on a Likert scale, we focus on a CCT model for ordinal data.

    View all citing articles on Scopus
    View full text