[R] Conditional assignment of values to vector using ifelse
Jason Stout, M.D.
jason.stout at duke.edu
Sat May 24 17:47:17 CEST 2014
This is obviously an oversimplified way of simulating different test characteristics, but for the specific purpose I need it will serve. The lesion is not in my understanding of clinical epidemiology. I am more interested in understanding how the code is working in this case.
Thanks.
Jason Stout, MD, MHS
Box 102359-DUMC
Durham, NC 27710
FAX 919-681-7494
________________________________________
From: David Winsemius [dwinsemius at comcast.net]
Sent: Saturday, May 24, 2014 11:35 AM
To: Jason Stout, M.D.
Cc: r-help at r-project.org
Subject: Re: [R] Conditional assignment of values to vector using ifelse
On May 24, 2014, at 6:58 AM, Jason Stout, M.D. wrote:
> Hi R-users,
>
> I'm trying to simulate the outcome of several diagnostic tests with binary outcomes (positive/negative) and different performance characteristics. What I would like to generate is a dataframe with the first column representing the result of a "perfect" test, and different columns to the right simulating tests with different combinations of sensitivity and specificity. Here's the code I used to attempt this:
>
> x3<-rmvbin(10000,margprob=0.2) # results of perfect test, disease prevalence 20%
> sens1<-0.7 # sensitivity of test 1
> spec1<-0.8 # specificity of test 1
> sens2<-0.8 # sensitivity of test 2
> spec2<-0.7 # specificity of test 2
> funcsensspec<-function(x,sens,spec) {
> result<-ifelse(x==1,ifelse(runif(1,0,1)<=sens,1,0),
> ifelse(runif(1,0,1)<=spec,0,1))
> return(result)
> }
> x4<-funcsensspec(x3,sens1,spec1)
> x5<-funcsensspec(x3,sens2,spec2)
> xx<-cbind(x3,x4,x5)
>
> The problem is that this is not behaving as I expected. Ideally I wanted R to randomly reassign values for each row with probability based on test characteristics, but what I think it is doing is generating one random value and using that to reassign the entire vector. Here is sample output:
I think you need to re-examine your understanding of sensitivity and specificity. They are mappings from data to a continuous variable in [0,1] for varying choice of a continuously variable threshold, rather than a sampling parameter. The typical display of ROC curves is if no help to physicians because it obscures this relationship by failing to include annotations with the threshold values.
--
David Winsemius, MD
>
>
>> head(xx,30)
> [,1] [,2] [,3]
> [1,] 0 1 0
> [2,] 0 1 0
> [3,] 0 1 0
> [4,] 0 1 0
> [5,] 0 1 0
> [6,] 0 1 0
> [7,] 1 1 1
> [8,] 0 1 0
> [9,] 1 1 1
> [10,] 0 1 0
> [11,] 1 1 1
> [12,] 0 1 0
> [13,] 0 1 0
> [14,] 1 1 1
> [15,] 0 1 0
> [16,] 1 1 1
> [17,] 0 1 0
> [18,] 0 1 0
> [19,] 0 1 0
> [20,] 0 1 0
> [21,] 0 1 0
> [22,] 0 1 0
> [23,] 0 1 0
> [24,] 1 1 1
> [25,] 0 1 0
> [26,] 0 1 0
> [27,] 0 1 0
> [28,] 0 1 0
> [29,] 0 1 0
> [30,] 0 1 0
>
> I know I could do this task by creating two actual vectors of random numbers and using them to assign probabilities, but am wondering if there is a simpler and more elegant way to accomplish the task (and this would provide some insight into how ifelse is working within this function).
>
> Thanks for any assistance.
>
>
> Jason Stout, MD, MHS
> Box 102359-DUMC
> Durham, NC 27710
> FAX 919-681-7494
>
> [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius
Alameda, CA, USA
More information about the R-help
mailing list