[BioC] MEDIPS.createSet error
Vining, Kelly
Kelly.Vining at oregonstate.edu
Mon Mar 31 22:09:58 CEST 2014
Hi,
Thanks for the advice thus far! To confirm what is in my BSgenome variable, I did this:
> class(BSgenome)
[1] "BSgenome"
attr(,"package")
[1] "BSgenome"
> names(BSgenome)
[1] "Chr01" "Chr02" "Chr03" "Chr04" "Chr05" "Chr06" "Chr07" "Chr08" "Chr09"
[10] "Chr10" "Chr11" "Chr12" "Chr13" "Chr14" "Chr15" "Chr16" "Chr17" "Chr18"
[19] "Chr19" "scaf"
And then:
> print(BSgenome)
Black cottonwood genome
|
| organism: Populus trichocarpa (Black cottonwood)
| provider: Phytozome (JGI)
| provider version: 3.0
| release date: January 2010
| release name: Populus trichocarpa v3.0
|
| single sequences (see '?seqnames'):
| Chr01 Chr02 Chr03 Chr04 Chr05 Chr06 Chr07 Chr08 Chr09 Chr10 Chr11
| Chr12 Chr13 Chr14 Chr15 Chr16 Chr17 Chr18 Chr19
|
| multiple sequences (see '?mseqnames'):
| scaf
|
| (use the '$' or '[[' operator to access a given sequence)
So that looks ok. Interestingly, when I followed the vignette and did the equivalent of
BSgenome="BSgenome.Hsapiens.UCSC.hg19"
That didn't work for me. It only worked without quotes. If I included quotes, it just assigned a character vector to that variable.
Then, following your advice:
> si.bsg <- seqinfo(BSgenome.Ptrichocarpa.Phytozome.v3)
> si.bam <- seqinfo(BamFile("FallBud_brep1_aln_sorted.bam"))
Error in seqinfo(BamFile("FallBud_brep1_aln_sorted.bam")) :
error in evaluating the argument 'x' in selecting a method for function 'seqinfo': Error: could not find function "BamFile"
Hmmm....
Continuing down the rabbit hole...
--Kelly
________________________________________
From: mailinglist.honeypot at gmail.com [mailinglist.honeypot at gmail.com] on behalf of Steve Lianoglou [lianoglou.steve at gene.com]
Sent: Monday, March 31, 2014 12:20 PM
To: Vining, Kelly
Cc: Lukas Chavez; bioconductor at r-project.org
Subject: Re: [BioC] MEDIPS.createSet error
Hi,
On Mon, Mar 31, 2014 at 12:01 PM, Vining, Kelly
<Kelly.Vining at oregonstate.edu> wrote:
> Hi Lukas,
> Well, I'm basically back to the original error I had when I started this work back in December. At that point, I'd decided that my bam files must have had chromosome or scaffold names that didn't exist in the reference genome. So I went back to the original data set and redid all of the alignments to make sure the version of the reference genome I was using was the same. It seems that I something is still not matching up.
>
> How can I compare the set of chromosome names between the bam file and the reference genome?
>
>> budFall1_MEDIP = MEDIPS.createSet(file="FallBud_brep1_aln_sorted.bam", BSgenome=BSgenome, extend=300, uniq=TRUE, shift=0, window_size=1000)
> Reading bam alignment FallBud_brep1_aln_sorted.bam
> Total number of imported short reads: 21038273
> Extending reads...
> Creating GRange Object...
> Extract unique regions...
> Number of unique short reads: 17855868
> Error in as.environment(pos) :
> no item called "paste("package:", BSgenome, sep = "")" on the search list
> In addition: Warning message:
> In ls(paste("package:", BSgenome, sep = "")) :
> āpaste("package:", BSgenome, sep = "")ā converted to character string
I'm not so sure that's what this error is telling you.
First check if "BSgenome" is a real thing in your workspace -- you are
passing some variable named "BSgenome" as the BSgenome parameter to
the createSet function.
Skimming the vignette, it looks your BSgenoome variable should be set
to the name of a BSgenome package to use/load.
So, in the same R session where this call is failing, what is the
output of `print(BSgenome)`.
Note how the vignette has this line:
BSgenome="BSgenome.Hsapiens.UCSC.hg19"
Which uses the hg19 genome as the reference.
Anyway, if BSgenome is set correctly for you, try to load it first to
make sure it is installed, eg:
R> library(BSgenome, character.only=TRUE)
Finally, the BSgenome that you load and the BAM file you are parsing
need concordant names for their chromosomes. You can check the
chromosome info in each by invoking the `seqinfo` method.
If I was using the "BSgenome.Hsapiens.UCSC.hg19" package, I'd do:
R> library(Rsamtools)
R> library(BSgenome.Hsapiens.UCSC.hg19)
R> si.bsg <- seqinfo(BSgenome.Hsapiens.UCSC.hg19)
R> si.bam <- seqinfo(BamFile("FallBud_brep1_aln_sorted.bam"))
Now look to see that chromosomes (seqnames) in si.bsg and si.bam are concordant.
HTH,
-steve
--
Steve Lianoglou
Computational Biologist
Genentech
More information about the Bioconductor
mailing list