[BioC] Converting gene names into Illumina IDs
Aliaksei Holik
salvador at bio.bsu.by
Thu Sep 27 18:05:44 CEST 2012
Dear fellow Bioconductors,
I'm faced with a problem I can't get my head round and hope somebody
would be able to point me in the right direction.
I am trying to plot a heatmap using expression values from my array for
an external set of genes. I have used illuminaMousev2ALIAS2PROBE object
to extract IlluminaIDs corresponding to the gene symbols in the set.
Consistent with possibility of more than 1 probe per gene I got 1052
IlluminaIDs for 510 gene names. However, if I try to remove duplicates I
only get 259 IlluminaIDs, which makes no sense to me. I have checked and
I do indeed get a lot of duplicated probe IDs. I wonder where I go
wrong. Here's the code I used:
# Generate a list of gene symbols with corresponding Illumina IDs
xx <- as.list(illuminaMousev2ALIAS2PROBE)
# Subset all Illumina IDs for the genes present in SCSGenes vector
scs.probes.and.genes <- xx[SCSGenes]
# Generate a vector of probes while removing gene names
scs.probes <- as.character(unlist(scs.probes.and.genes)) #1058 probes
scs.probes <- na.omit(scs.probes) #1052 probes
# Remove duplicates
scs.probes <- scs.probes[!duplicated(scs.probes)] #259 probes
# end of code
Any help is much appreciated.
All the best,
Aliaksei.
More information about the Bioconductor
mailing list