[BioC] Many missing GO terms from GOHyperG call using mgu74av2
Dick Beyer
dbeyer at u.washington.edu
Wed Jun 29 18:14:25 CEST 2005
Hi John,
Thanks for the good suggestion. Is there a way to tell what the date of the GO information is used in a particular build of an annotation package? I'm asking so I can figure out if doing my own build at any particular time is warranted.
Thanks very much,
Dick
*******************************************************************************
Richard P. Beyer, Ph.D. University of Washington
Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695
Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100
Seattle, WA 98105-6099
http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
http://staff.washington.edu/~dbeyer
*******************************************************************************
On Wed, 29 Jun 2005, John Zhang wrote:
>
>> In the course of using GOHyperG, I checked my results against results from
> NETAFFX and found large discrepancies in the number of GO terms returned for
> some probeids.
>
> Unless we build the annotation packages frequently (every week?), there will be
> descripancies. You may try to build your own annotation packages to minimize the
> descripancies.
>
>>
>> Here is an example code snippet that illustrates my point:
>>
>> library(affy)
>> library(mgu74av2)
>> library(GOstats)
>> library(annaffy)
>> smallList <- c("160102_at")
>> myLL <- unlist(mget(smallList, mgu74av2LOCUSID))
>> sum(duplicated(myLL))
>> length(myLLunique<- !duplicated(mget(smallList, mgu74av2LOCUSID)))
>> bphyper <- GOHyperG(myLL[myLLunique], lib="mgu74av2", what="BP")
>>
>> sort(names(bphyper$go2Affy))
>> [1] "GO:0006457" "GO:0007582" "GO:0008150" "GO:0008152" "GO:0009987"
> "GO:0019538"
>> [7] "GO:0043170" "GO:0050875"
>>
>> #From NETAFFX for 160102_at
>> #"GO:0006457" "GO:0007582" "GO:0008150" "GO:0008152" "GO:0009987" "GO:0019538"
>> #"GO:0043170" "GO:0050875"
>> #"GO:0044238" "GO:0044237" "GO:0044260" "GO:0044260" "GO:0044267
>>
>> Notice that NETAFFX returns 5 additional GO terms that are missing from the
> results from the GOHyperG call.
>>
>> Here is my sessionInfo:
>>
>>> sessionInfo()
>> R version 2.1.0, 2005-04-18, i386-pc-mingw32
>>
>> attached base packages:
>> [1] "splines" "tools" "methods" "stats" "graphics"
>> [6] "grDevices" "utils" "datasets" "base"
>>
>> other attached packages:
>> annaffy KEGG hgu95av2 GOstats multtest genefilter
>> "1.0.18" "1.6.8" "1.8.4" "1.1.2" "1.6.0" "1.6.1"
>> survival xtable RBGL annotate GO XML
>> "2.17" "1.2-5" "1.3.7" "1.5.15" "1.6.8" "0.95-6"
>> graph Ruuid cluster mgu74av2 affy reposTools
>> "1.5.0" "1.5.0" "1.9.8" "1.8.4" "1.6.7" "1.5.19"
>> Biobase
>> "1.5.12"
>>
>> Is this a problem with the mgu74av2 metadata, or with GOHyperG?
>>
>> Thanks much,
>> Dick
>> *******************************************************************************
>> Richard P. Beyer, Ph.D. University of Washington
>> Tel.:(206) 616 7378 Env. & Occ. Health Sci. , Box 354695
>> Fax: (206) 685 4696 4225 Roosevelt Way NE, # 100
>> Seattle, WA 98105-6099
>> http://depts.washington.edu/ceeh/ServiceCores/FC5/FC5.html
>> http://staff.washington.edu/~dbeyer
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>
> Jianhua Zhang
> Department of Medical Oncology
> Dana-Farber Cancer Institute
> 44 Binney Street
> Boston, MA 02115-6084
>
>
More information about the Bioconductor
mailing list