[BioC] Quick question about Chromosome names using Tophat2 and Cufflinks w/Ensembl NCBIM37
James W. MacDonald
jmacdon at uw.edu
Mon Mar 10 15:03:54 CET 2014
Hi Sindre,
Why are you asking about tophat and cufflinks on a BioC listserv? These
are not part of BioC, nor are they even R packages. In addition, they
have their own list where you can ask questions.
Best,
Jim
On 3/10/2014 3:36 AM, Sindre Lee wrote:
> Tophat2 cmd:
>
> tophat2 -o path/to
> --transcriptome-index=/Mus_musculus_Ensembl_NCBIM37/Mus_musculus/Ensembl/NCBIM37/Annotation/Genes/genes
> /Mus_musculus_Ensembl_NCBIM37/Mus_musculus/Ensembl/NCBIM37/Sequence/Bowtie2Index/genome
> 001.fastq.gz
>
> Cufflinks cmd:
>
> cufflinks --output-dir path/to --GTF-guide
> /Mus_musculus_Ensembl_NCBIM37/Mus_musculus/Ensembl/NCBIM37/Annotation/Archives/archive-2013-03-06-18-55-12/Genes/genes.gtf
> --frag-bias-correct
> /Mus_musculus_Ensembl_NCBIM37/Mus_musculus/Ensembl/NCBIM37/Sequence/Bowtie2Index/genome.fa
> --multi-read-correct --upper-quartile-norm --verbose accepted_hits.bam
>
> Cufflinks then names the chromosomes like this, is that correct? How
> can I control this or alter it?
>
> 20: MT
> 10: 18
> 11: 19
> 4: 12
> 5: 13
> 2: 10
> 3: 11
> 8: 16
> 9: 17
> 6: 14
> 7: 15
> 22: Y
> 21: X
> 19: 9
> 18: 8
> 17: 7
> 16: 6
> 15: 5
> 14: 4
> 13: 3
> 12: 2
> 1: 1
>
> Thank you!
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives:
> http://news.gmane.org/gmane.science.biology.informatics.conductor
--
James W. MacDonald, M.S.
Biostatistician
University of Washington
Environmental and Occupational Health Sciences
4225 Roosevelt Way NE, # 100
Seattle WA 98105-6099
More information about the Bioconductor
mailing list