[BioC] Limma topTable; fold changes look completely different to the normalized data and Limma fold change
James W. MacDonald
jmacdon at uw.edu
Fri Jul 6 21:36:20 CEST 2012
Hi John,
On 7/6/2012 3:12 PM, john herbert wrote:
> Dear all,
> I have a problem with the log Fold changes calculated in Limma. I am
> using protein abundance index of proteomic data
> The log2 of this data is normally distributed and after log2, I use
> quantile normalization
>
> This is then the data matrix I use as input to Limma
>
>> class(norm_ctw)
> [1] "matrix"
>
>> dim(norm_ctw)
> [1] 683 9
> design<- model.matrix(~ 0+factor(c(1,1,1,2,2,2,3,3,3)))
> colnames(design)<- c("cam", "tumour", "wound")
> fit<- lmFit(norm_ctw, design)
>
> contrast.matrix<- makeContrasts(tumour-wound, tumour-cam, levels=design)
> fit2<- contrasts.fit(fit, contrast.matrix)
> fit2<- eBayes(fit2)
>
> topTable(fit2, coef=1, adjust="BH")
>
> Taking one gene as an example. NAMPT in tumour versus wound and
> calculating fold change by hand of normalized data;
>
>> norm_ctw["NAMPT",]
> cam1 cam2 cam3 tumour1 tumour2 tumour3 wound1
> wound2 wound3
> 19.80164 19.46355 19.26075 22.75347 22.62651 22.39521 16.17398 16.60262 16.72368
>
> In Excel, calculating log2 fold change using Average of Tumour/Average
> of wound =
> T1 22.75347 T2 22.62651 T3 22.39521 W1 16.17398 W2 16.60262 W3 16.72368
> Tumour average = 22.59173
> Wound average = 16.50009333
> Log2 Fold change = 0.453320567
Wait a minute... Are these data logged or not? You say above that you
take logs and then normalize, and then you present some data that would
be really big if they were log2 variates (but then I have no idea of the
scale for protein abundance data).
Anyway, you are acting like these data are not logged, whereas limma
assumes they are. So you either have to take logs before feeding into
limma, or you need to compute the fold change by subtraction (if the
data above are already logged).
Best,
Jim
>
>
> However, from TopTable....
>> topTable(fit2,coef=1)
> ID logFC AveExpr t P.Value adj.P.Val B
> 431 NAMPT 6.091632 19.53349 20.16810 2.688444e-09 1.750946e-06 11.409857
>
> > From toptable, NAMPT has an apparent log2 FC of 6 or 64 fold change
> but that is impossible right??
>
> Please can someone explain if I am using Limma wrong or how the fold
> change can be massively different between "by hand" and with Limma.
>
> Thank you very much for any advice.
>
> John.
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list