Difference between revisions of "CH391L/UTpond"
From Marcotte Lab
< CH391L
(→Analysis tool) |
|||
(3 intermediate revisions by one user not shown) | |||
Line 5: | Line 5: | ||
* V3BC24: 562,803 (F3), 525,892 (F5), 588,304 (raw) | * V3BC24: 562,803 (F3), 525,892 (F5), 588,304 (raw) | ||
* V3BC25: 569,125 (F3), 531,015 (F5), 595,064 (raw) | * V3BC25: 569,125 (F3), 531,015 (F5), 595,064 (raw) | ||
+ | You can download CSFASTA files from [http://www.marcottelab.org/users/CH391L/UTpond/raw/ here]. | ||
== Databases == | == Databases == | ||
− | * Prokaryotes: | + | * Prokaryotes: http://www.marcottelab.org/users/CH391L/UTpond/NCBI.bacteria/NCBI_bacteria_rep.tgz <b>(835 MB)</b> (870 genomes) |
− | * Eukaryotes: | + | ** Source: ftp://ftp.ncbi.nih.gov/genomes/Bacteria/ (2011-Feb-28 version). |
− | * | + | ** Select one chromosome per species randomly. |
+ | * Eukaryotes (Organelle): http://www.marcottelab.org/users/CH391L/UTpond/ENA.organelle/ENA_organelle.fna.gz (2,982 genomes) | ||
+ | ** Source: http://www.ebi.ac.uk/genomes/organelle.html | ||
+ | == Mapping results (SAM file + read_freq) == | ||
+ | * Prokaryotes: http://www.marcottelab.org/users/CH391L/UTpond/NCBI.bacteria/ | ||
+ | * Eukaryotes: http://www.marcottelab.org/users/CH391L/UTpond/ENA.organelle/ | ||
== Analysis tool == | == Analysis tool == | ||
Line 17: | Line 23: | ||
** [http://bioinformatics.oxfordjournals.org/content/25/16/2078.long Paper] | ** [http://bioinformatics.oxfordjournals.org/content/25/16/2078.long Paper] | ||
* SHRiMP: Alignment tool http://compbio.cs.toronto.edu/shrimp/ | * SHRiMP: Alignment tool http://compbio.cs.toronto.edu/shrimp/ | ||
− | ** <b><font color='red'>Never, ever run this program on cygnus. It needs lots of memory, so easily kills the computer.</font></b> | + | ** <b><font color='red'>Never, ever try to run this program on cygnus. It needs lots of memory, so easily kills the computer.</font></b> |
** [http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000386 Paper for original version] | ** [http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1000386 Paper for original version] | ||
** [http://bioinformatics.oxfordjournals.org/content/early/2011/01/28/bioinformatics.btr046 Paper for version 2] | ** [http://bioinformatics.oxfordjournals.org/content/early/2011/01/28/bioinformatics.btr046 Paper for version 2] |
Latest revision as of 16:43, 28 March 2011
Contents |
Number of reads
- V3BC21: 213,419 (F3), 199,158 (F5), 223,100 (raw)
- V3BC22: 634,431 (F3), 595,251 (F5), 661,120 (raw)
- V3BC23: 1,015,471 (F3), 951,357 (F5), 1,060,308 (raw)
- V3BC24: 562,803 (F3), 525,892 (F5), 588,304 (raw)
- V3BC25: 569,125 (F3), 531,015 (F5), 595,064 (raw)
You can download CSFASTA files from here.
Databases
- Prokaryotes: http://www.marcottelab.org/users/CH391L/UTpond/NCBI.bacteria/NCBI_bacteria_rep.tgz (835 MB) (870 genomes)
- Source: ftp://ftp.ncbi.nih.gov/genomes/Bacteria/ (2011-Feb-28 version).
- Select one chromosome per species randomly.
- Eukaryotes (Organelle): http://www.marcottelab.org/users/CH391L/UTpond/ENA.organelle/ENA_organelle.fna.gz (2,982 genomes)
Mapping results (SAM file + read_freq)
- Prokaryotes: http://www.marcottelab.org/users/CH391L/UTpond/NCBI.bacteria/
- Eukaryotes: http://www.marcottelab.org/users/CH391L/UTpond/ENA.organelle/
Analysis tool
- SAM : alignment report format
- SHRiMP: Alignment tool http://compbio.cs.toronto.edu/shrimp/
- Never, ever try to run this program on cygnus. It needs lots of memory, so easily kills the computer.
- Paper for original version
- Paper for version 2
$ gsmapper-cs foobar.csfasta db.fasta -E -N 4 -o 1000 -h 80% >foobar.gsmapper_sam 2>foobar.gsmapper_log