[BioC] annotation for TAqman assay IDs

James W. MacDonald jmacdon at uw.edu
Tue Jul 29 18:26:02 CEST 2014


Hi Bernard,

On 7/29/2014 11:46 AM, Bernard North wrote:
> I have a list of Taqman assay IDs that I am trying to annotate eg MSMB-Hs00159303_m1. They are all exon-spanning probe assays (ie ending _m1) as described here  http://www.lifetechnologies.com/uk/en/home/communities-social/blog/blogs/the-hows-whats-and-whys-of-gene-expression-assay-design.html
>
> The Life Technologies website does provides some information for each assay at  https://www.lifetechnologies.com/us/en/home/life-science/pcr/real-time-pcr/real-time-pcr-assays.html.  After searching for gene expression assays for MSMB and clicking on "gene expression" a list of assays appears eg  Hs00159303_m1 and when I click on "view details" I'm able to discover the Entrez ID is 4477, the UniGene ID is Hs.255462, as well as other useful information.
> I wondered if there was any automatic way via a bioconductor function of obtaining such useful annotation of Taqman assay IDs. Often the part of the assay name before the -Hs seems to be the HUGO gene symbol (so that lends itself to BIoMart) but that's not always the case eg NKX3-1-Hs00171834_m1 (NKX3-1 is not a HUGO symbol).
>

It appears that they are appending the HUGO symbol, as NKX3-1 is a HUGO 
symbol:

http://www.genenames.org/cgi-bin/gene_symbol_report?q=data/hgnc_data.php&hgnc_id=7838

You could use either biomaRt or the org.Hs.eg.db packages to annotate 
using the gene symbols.

 > select(org.Hs.eg.db, c("NKX3-1","MSMB"), 
c("ENTREZID","SYMBOL","GENENAME"), "ALIAS")
    ALIAS ENTREZID SYMBOL                  GENENAME
1 NKX3-1     4824 NKX3-1            NK3 homeobox 1
2   MSMB     4477   MSMB microseminoprotein, beta-

But using symbols is not guaranteed to get you 100% accurate mappings, 
so caveat emptor.

Best,

Jim


> Many thanks in advance
>
>
> 	[[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>

-- 
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099



More information about the Bioconductor mailing list