New, revolutionary de novo assembly algorithm

ASCII art...
Click to enlarge.
Up until now, de novo assembly of large genomes has required computers with hundreds of gigabytes of RAM. Until recently, all RAM was required to be installed on one computer, but this has changed with ABySS [Simpson et al., 2009], that enables the RAM to be distributed on a number of standard computers.
 

CLC bio’s new de novo assembly algorithm will change the standards of de novo assembly of large genomes by enabling de novo assembly of gigabase-size genomes on a single desktop computer in a reasonable amount of time.

Visit clcdenovo.com for more detail on our de novo assembler

Video interviews with customers on de novo assembly – see for yourself!

 

Single read:

Comparison of ABySS and CLC bio’s de novo assembly on an Illumina Genome Analyzer dataset consisting of approximately 3.6 billion reads resulting in 38 fold coverage of the human genome and a total dataset size of approximately 130 Giga bases in total
CLC bio ABySS
 Number of computers 21 
 Time spent (hours) 15 
 Total RAM needed (GB) 21  336 
 Contigs >= 1000 bp    
   - Number of contigs 638.661  549,522 
   - Mean size (bp) 1,918  1,703 
   - Max size (bp) 18,222  15,911 
   - N50 size (bp) 2,021  1,731 
   - Sum (Gbp) 1.2  0.9 

 

Paired-end reads:

Comparison of ABySS and CLC bio’s de novo assembly on an Illumina Genome Analyzer dataset consisting of approximately 3.6 billion reads resulting in 38 fold coverage of the human genome and a total dataset size of approximately 130 Giga bases in total
CLC bio ABySS
 Number of computers 21 
 Time spent (hours) 80 
 Total RAM needed (GB) 42  336 
 Contigs >= 1000 bp    
   - Number of contigs 655,413  680,230 
   - Mean size (bp) 2,090  2,093 
   - Max size (bp) 32,285  18,800 
   - N50 size (bp) 2,273  2,282 
   - Sum (Gbp) 1.4  1.4 

Details of the benchmarks are described in our white paper on the new de novo assembler.