[BioC] inconsistency between GRangesList and transcriptDB object.
zhenjiang xu
zhenjiang.xu at gmail.com
Wed Apr 20 17:45:53 CEST 2011
Hi,
There are 6717 transcripts read into txdb but why GRangesList is only
6550 long? Thanks
> txdb <- makeTranscriptDbFromUCSC(genome="sacCer2", tablename="sgdGene")
Download the sgdGene table ... OK
Download the sgdIsoforms table ... OK
Download the sgdCanonical table ... OK
Extract the 'transcripts' data frame ... OK
Extract the 'splicings' data frame ... OK
Download and preprocess the 'chrominfo' data frame ... OK
Prepare the 'metadata' data frame ... OK
Make the TranscriptDb object ... OK
> tx_by_gene <- transcriptsBy(txdb,'gene')
> txdb
TranscriptDb object:
| Db type: TranscriptDb
| Data source: UCSC
| Genome: sacCer2
| UCSC Table: sgdGene
| Type of Gene ID: ID of canonical transcript in cluster
| Full dataset: yes
| transcript_nrow: 6717
| exon_nrow: 7083
| cds_nrow: 7061
| Db created by: GenomicFeatures package from Bioconductor
| Creation time: 2011-04-20 11:20:56 -0400 (Wed, 20 Apr 2011)
| GenomicFeatures version at creation time: 1.2.3
| RSQLite version at creation time: 0.9-4
| DBSCHEMAVERSION: 1.0
> tx_by_gene
GRangesList of length 6550
$Q0010
GRanges with 2 ranges and 2 elementMetadata values
seqnames ranges strand | tx_id tx_name
<Rle> <IRanges> <Rle> | <integer> <character>
[1] chrM [3952, 4338] + | 122 Q0010
[2] chrM [4254, 4415] + | 123 Q0017
$Q0032
GRanges with 1 range and 2 elementMetadata values
seqnames ranges strand | tx_id tx_name
<Rle> <IRanges> <Rle> | <integer> <character>
[1] chrM [11667, 11957] + | 124 Q0032
--
Best,
Zhenjiang
More information about the Bioconductor
mailing list