Smoothed empirical likelihood for the Youden index

doi:10.1016/j.csda.2017.03.014

Computational Statistics & Data Analysis

Volume 115, November 2017, Pages 1-10

https://doi.org/10.1016/j.csda.2017.03.014 Get rights and content

Abstract

For a continuous scale biomarker of binary disease status, the Youden index is a frequently used measurement of diagnostic accuracy in the context of the receiver operating characteristic curve and provides an optimal threshold for making diagnosis. The majority of existing inference methods for the Youden index are either parametric or bootstrap based. In the current paper, the empirical likelihood method for the Youden index is derived via defining novel smoothed estimating equations, and Wilks’ theorem for the empirical likelihood ratio statistic is established. Extensive simulation studies suggest that the chi-square calibrated empirical likelihood interval estimators are robust to model assumptions, enjoy computational efficiency and perform better than the bootstrap procedure almost uniformly across a variety of scenarios in terms of coverage probabilities.

Introduction

In the past few decades, the receiver operating characteristic (ROC) curve has become a standard statistical tool to evaluate the discriminatory ability of a diagnostic test to separate diseased subjects from non-diseased subjects. For a diagnostic test with continuous scale, sensitivity (true positive rate) and specificity (true negative rate) are inversely related, in the sense that the increase of the one is accompanied with the decrease of the other as the cutoff point moves along the real number line. The ROC curve is the plot of sensitivity against 1-specificity at all possible threshold points. For comprehensive reviews of ROC analysis, see Shapiro (1999), Zhou et al. (2009), Pepe (2004), and Zou et al. (2010).

Even though the area under the ROC curve (AUC) has been widely used for measuring the accuracy of a diagnostic test, the Youden index has its unique advantage in advising an optimal cutoff for the clinicians to make diagnosis. The Youden index, firstly proposed by Youden (1950), is defined as the maximum of the sum of sensitivity and specificity minus one. The cutoff point, where the maximum is achieved, provides an optimal threshold for the clinicians to use the diagnostic test for classification if equal weight is placed on sensitivity and specificity. The possible values of the Youden index range from 0 to 1 with 0 indicating no discriminatory ability and 1 indicating perfect diagnostic accuracy. Graphically, the Youden index is the maximum vertical distance between the ROC curve and the diagonal chance line.

A variety of approaches have been developed for the inference of the Youden index. Fluss et al. (2005) provided several estimators for the Youden index and the associated cutoff points based upon normal assumption, empirical distribution function or kernel smoothing. For interval estimation of the Youden index, Schisterman and Perkins (2007) developed parametric methods under either normal or gamma assumption. Lai et al. (2012) made some improvement by utilizing a generalized variable approach. Even though monotonic transformation, such as Box–Cox transformation, can be applied, these parametric interval estimators may still be not satisfactory given the departures from distribution assumptions, particularly when the diseased and non-diseased populations are not from the same family of distributions. Nonparametric interval estimators of the Youden index are mainly developed from bootstrap. Based upon the Youden index estimators derived from different methods, including estimators under normal assumption, estimators from delta method, and estimators derived from empirical distribution function and kernel density estimation, a range of commonly used bootstrap methods, such as percentile, normal approximation and bias correction and acceleration (BCa) adjustment, have been considered by Faraggi (2003), Fluss et al. (2005) and Schisterman and Perkins (2007), respectively. More recently, Zhou and Qin (2012) proposed an adjusted bootstrap procedure via an approximate method for interval estimation of a single proportion introduced by Agresti and Coull (1998). Via extensive simulation studies, the authors suggested that their modified bootstrap method was comparable to parametric methods when distribution assumption holds unless the Youden index is close to upper boundary ( $J \geq 0.90$ ) and their methods outperformed the previously developed bootstrap methods.

In this paper, we aim to propose a novel interval estimator of the Youden index via the empirical likelihood. Empirical likelihood, formally proposed by Owen, 1988, Owen, 1990, is an appealing nonparametric method with many desirable features such as automatic determination of the shape of the confidence regions by data, straightforward incorporation of side information and being Bartlett correctable in many cases; see Owen (2001) for a comprehensive review. Claeskens et al. (2003) derived empirical likelihood confidence regions for ROC curves over a certain range of specificity values. Molanes-López and Letón (2011) proposed an empirical likelihood approach for the Youden index and its associated optimal cutoff point from a quantile function point of view. As it is not ready to profile out the nuisance parameter, the authors had to propose a fairly complicated two-cycle bootstrap procedure and the resulting interval estimator seemed to be over-conservative as suggested by simulation studies. In this paper, we develop empirical likelihood based upon novel estimation equations using kernel smoothing methods.

The rest of the paper is organized as follows: In Section 2, the novel empirical likelihood method is introduced. We also establish the asymptotic properties of the empirical likelihood ratio statistic and discuss potential computation algorithms. In Section 3, we evaluate the empirical performance of our method through extensive simulation studies. We illustrate the proposed method in Section 4 via the application to a published data set. We draw conclusions and make discussions in Section 5.

Section snippets

Methodologies

Let $X_{1}$ and $X_{2}$ denote diagnostic biomarker values from the diseased (case) and non-diseased (control) populations with distribution functions $X_{1} \sim F_{1}$ and $X_{2} \sim F_{2}$ , respectively. Without loss of generality, we assume that $X_{2}$ is stochastically less than $X_{1}$ ( $X_{2} ⪯ X_{1}$ ); otherwise the proposed method is still applicable to the negative of the biomarker values.

Let $g (t) = F_{2} (t) - F_{1} (t)$ be the difference between the two cumulative distribution functions at a certain point $t$ . The Youden index can be expressed as $θ =$

Simulation study

In this section, the empirical performance of our empirical likelihood method is assessed by extensive simulation studies. We reran the simulation experiments published in Zhou and Qin (2012) as follows:

(i)
$X_{1} \sim N (μ_{1}, σ_{1}^{2})$ and $X_{2} \sim N (0, 1)$ , where the variance $σ_{1}^{2}$ is set to be 0.5, 1, 3 and 5. For each value of $σ_{1}^{2}$ , the mean $μ_{1}$ is chosen such that the Youden index is equal to 0.4, 0.6, 0.8 and 0.9.
(ii)
$X_{1} \sim Γ (α_{1}, β_{1})$ and $X_{2} \sim Γ (1.5, 1)$ , where the shape parameter $α_{1}$ is set to be 1.5, 2, 2.5 and 3. For each value of $α$

Examples

We illustrate our empirical likelihood method through a data set of prostate cancer patients from Miller et al. (1980). The data set, as shown in Table 6, consists of the acid phosphatase levels in blood serum of 53 prostate cancer patients: $n_{1} = 33$ of them without nodal involvement and $n_{2} = 20$ of them with nodal involvement. The data set was previously analyzed by Le (2006) and Zhou and Qin (2012). Neither normal nor gamma distribution was found to fit the data well even after Box–Cox

Conclusions and discussions

In this paper, we develop empirical likelihood for the Youden index via defining novel estimating equations and establish Wilks theorem for the empirical likelihood ratio statistics. Simulation studies suggest that for small to medium sample sizes, the empirical likelihood interval estimators calibrated by $χ_{1}^{2}$ distribution are robust under different distribution models. As compared to the bootstrap procedures, our empirical likelihood methods are more computational efficient and often have

Acknowledgments

We thank the editor-in-chief and two reviewers for their careful reading of the original manuscript and for their constructive comments which significantly improve the paper. The research of Yichuan Zhao is partially supported by NSF Grants (DMS-1406163 and DMS-1613176) and a NSA Grant (H98230-12-1-0209).

References (23)

C. Lai et al.
Exact confidence interval estimation for the Youden index and its corresponding optimal cut-point
Comput. Statist. Data Anal.
(2012)
A. Agresti et al.
Approximate is better than exact for interval estimation of binomial proportions
Amer. Statist.
(1998)
X. Chen et al.
Smoothed empirical likelihood confidence intervals for quantiles
Ann. Statist.
(1993)
G. Claeskens et al.
Empirical likelihood confidence regions for comparison distributions and roc curves
Canad. J. Statist.
(2003)
D. Faraggi
Adjusting receiver operating characteristic curves and related indices for covariates
Statistician
(2003)
R. Fluss et al.
Estimation of the Youden index and its associated cutoff point
Biom. J.
(2005)
C.T. Le
A solution for the most basic optimization problem associated with ROC curve
Stat. Methods Med. Res.
(2006)
R.G. Miller et al.
Biostatistics Casebook
(1980)
E.M. Molanes-López et al.
Inference of the Youden index and associated threshold using empirical likelihood for quantiles
Stat. Med.
(2011)
A. Owen
Empirical likelihood ratio confidences for single functional
Biometrika
(1988)

A. Owen

Empirical likelihood ratio confidence regions

Ann. Statist.

(1990)

Cited by (13)

A smooth nonparametric approach to determining cut-points of a continuous scale
2019, Computational Statistics and Data Analysis
Citation Excerpt :
Some intuitive approaches include considering arbitrary cut-points; specifying cut-points as sample quantiles (e.g. median) or according to clinicians’ experience (Altman et al., 1994); using cut-points that yield disease rates consistent with a known population disease prevalence, or the highest proportions of correct classification based on a gold standard (Altman, 1991; Mazumdar and Glassman, 2000). Another popular approach is to utilize the receiver operating characteristic (ROC) curve (Pepe, 2003) in conjunction with various accuracy measures, such as Youden index (Youden, 1950; Fluss et al., 2005; Zhou and Qin, 2005; Schisterman et al., 2008; Zhou and Qin, 2012; Lai et al., 2012; Wang et al., 2017), concordance probability methods (Liu, 2012) and the point closest-to-(0, 1) corner method (Perkins and Schisterman, 2006), among others. Dong et al.
The problem of determining cut-points of a continuous scale according to an established categorical scale is often encountered in practice for the purposes such as making diagnosis or treatment recommendation, determining study eligibility, or facilitating interpretations. A general analytic framework was recently proposed for assessing optimal cut-points defined based on some pre-specified criteria. However, the implementation of the existing nonparametric estimators under this framework and the associated inferences can be computationally intensive when more than a few cut-points need to be determined. To address this important issue, a smoothing-based modification of the current method is proposed and is found to substantially improve the computational speed as well as the asymptotic convergence rate. Moreover, a plug-in type variance estimation procedure is developed to further facilitate the computation. Extensive simulation studies confirm the theoretical results and demonstrate the computational benefits of the proposed method. The practical utility of the new approach is illustrated by an application to a mental health study.
Statistical inference for the two-sample problem under likelihood ratio ordering, with application to the ROC curve estimation
2023, Statistics in Medicine
A New Classifier for Imbalanced Data Based on a Generalized Density Ratio Model
2023, Communications in Mathematics and Statistics
A review of recent advances in empirical likelihood
2023, Wiley Interdisciplinary Reviews: Computational Statistics
Smoothed empirical likelihood for optimal cut point analysis
2023, Communications in Statistics - Theory and Methods
Confidence intervals and sample size planning for optimal cutpoints
2023, PLoS ONE

View all citing articles on Scopus

View full text

Smoothed empirical likelihood for the Youden index

Abstract

Introduction

Section snippets

Methodologies

Simulation study

Examples

Conclusions and discussions

Acknowledgments

Comput. Statist. Data Anal.

Approximate is better than exact for interval estimation of binomial proportions

Amer. Statist.

Smoothed empirical likelihood confidence intervals for quantiles

Ann. Statist.

Empirical likelihood confidence regions for comparison distributions and roc curves

Canad. J. Statist.

Adjusting receiver operating characteristic curves and related indices for covariates

Statistician

Estimation of the Youden index and its associated cutoff point

Biom. J.

A solution for the most basic optimization problem associated with ROC curve

Stat. Methods Med. Res.

Biostatistics Casebook

Inference of the Youden index and associated threshold using empirical likelihood for quantiles

Stat. Med.

Empirical likelihood ratio confidences for single functional

Biometrika

Empirical likelihood ratio confidence regions

Ann. Statist.