<rdf:RDF xmlns:community="http://www.bibsonomy.org/ontologies/2008/05/community#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xml:base="http://www.bibsonomy.org/user/dzerbino/Software"><owl:Ontology rdf:about=""><rdfs:comment>BibSonomy publications for /user/dzerbino/Software</rdfs:comment><owl:imports rdf:resource="http://swrc.ontoware.org/ontology/portal"/></owl:Ontology><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2150cd10c40aace2c238aaa20c8480e08/dzerbino"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2150cd10c40aace2c238aaa20c8480e08/dzerbino"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><swrc:date>Mon Sep 17 20:19:41 CEST 2007</swrc:date><swrc:journal>BMC Bioinformatics</swrc:journal><swrc:month>Feb</swrc:month><swrc:pages>64</swrc:pages><swrc:title>Minimus: a fast, lightweight genome assembler</swrc:title><swrc:volume>8</swrc:volume><swrc:year>2007</swrc:year><swrc:keywords>Chromosome Software Alignment, Base Mapping, Algorithms, Data, User-Computer Analysis, Sequence, Sequence DNA, Design, Software, Interface Molecular </swrc:keywords><swrc:abstract>BACKGROUND: Genome assemblers have grown very large and complex in response to the need for algorithms to handle the challenges of large whole-genome sequencing projects. Many of the most common uses of assemblers, however, are best served by a simpler type of assembler that requires fewer software components, uses less memory, and is far easier to install and run. RESULTS: We have developed the Minimus assembler to address these issues, and tested it on a range of assembly problems. We show that Minimus performs well on several small assembly tasks, including the assembly of viral genomes, individual genes, and BAC clones. In addition, we evaluate Minimus&#039; performance in assembling bacterial genomes in order to assess its suitability as a component of a larger assembly pipeline. We show that, unlike other software currently used for these tasks, Minimus produces significantly fewer assembly errors, at the cost of generating a more fragmented assembly. CONCLUSION: We find that for small genomes and other small assembly tasks, Minimus is faster and far more flexible than existing tools. Due to its small size and modular design Minimus is perfectly suited to be a component of complex assembly pipelines. Minimus is released as an open-source software project and the code is available as part of the AMOS project at Sourceforge.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="1471-2105-8-64" swrc:key="pii"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="17324286" swrc:key="pmid"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA. dsommer@umiacs.umd.edu &lt;dsommer@umiacs.umd.edu&gt;" swrc:key="affiliation"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="English" swrc:key="language"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="papers://055852FE-1648-42FE-91D0-8CA474D2B905/Paper/p22" swrc:key="uri"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="file://localhost/Users/danielzerbino/Documents/Papers/2007/Sommer/BMC%20Bioinformatics%202007%20Sommer.pdf" swrc:key="url"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1186/1471-2105-8-64" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Daniel D Sommer"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Arthur L Delcher"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Steven L Salzberg"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Mihai Pop"/></rdf:_4></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2ccc1431d60d762580edc392082a74be9/dzerbino"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2ccc1431d60d762580edc392082a74be9/dzerbino"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><swrc:date>Mon Sep 17 20:19:41 CEST 2007</swrc:date><swrc:journal>Genome Res</swrc:journal><swrc:month>Feb</swrc:month><swrc:number>2</swrc:number><swrc:pages>330--40</swrc:pages><swrc:title>ProbCons: Probabilistic consistency-based multiple sequence alignment</swrc:title><swrc:volume>15</swrc:volume><swrc:year>2005</swrc:year><swrc:keywords>Sequence, Algorithms, Acid Computational (Genetics) Software, Validation, Software Alignment, Benchmarking, Variation Molecular Protein, Data, Sequence Amino Biology, Databases, </swrc:keywords><swrc:abstract>To study gene evolution across a wide range of organisms, biologists need accurate tools for multiple sequence alignment of protein families. Obtaining accurate alignments, however, is a difficult computational problem because of not only the high computational cost but also the lack of proper objective functions for measuring alignment quality. In this paper, we introduce probabilistic consistency, a novel scoring function for multiple sequence comparisons. We present ProbCons, a practical tool for progressive protein multiple sequence alignment based on probabilistic consistency, and evaluate its performance on several standard alignment benchmark data sets. On the BAliBASE, SABmark, and PREFAB benchmark alignment databases, ProbCons achieves statistically significant improvement over other leading methods while maintaining practical speed. ProbCons is publicly available as a Web resource.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="15/2/330" swrc:key="pii"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="15687296" swrc:key="pmid"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="2" swrc:key="issue"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="Department of Computer Science, Stanford University, Stanford, California 94305, USA." swrc:key="affiliation"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="English" swrc:key="language"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="papers://055852FE-1648-42FE-91D0-8CA474D2B905/Paper/p2" swrc:key="uri"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="file://localhost/Users/danielzerbino/Documents/Papers/2005/Do/Genome%20Res%202005%20Do.pdf" swrc:key="url"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1101/gr.2821705" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Chuong B Do"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Mahathi S P Mahabhashyam"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Michael Brudno"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Serafim Batzoglou"/></rdf:_4></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/21dc921e2ef4587944697d75bf48c2db4/dzerbino"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/21dc921e2ef4587944697d75bf48c2db4/dzerbino"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><swrc:date>Mon Sep 17 20:19:41 CEST 2007</swrc:date><swrc:journal>J Comput Biol</swrc:journal><swrc:month>May</swrc:month><swrc:number>4</swrc:number><swrc:pages>394--407</swrc:pages><swrc:title>Gene maps linearization using genomic rearrangement distances</swrc:title><swrc:volume>14</swrc:volume><swrc:year>2007</swrc:year><swrc:keywords>Algorithms, Computational Genome, Biology, Software Genomics, DNA, Analysis, Sequence </swrc:keywords><swrc:abstract>A preliminary step to most comparative genomics studies is the annotation of chromosomes as ordered sequences of genes. Different genetic mapping techniques often give rise to different maps with unequal gene content and sets of unordered neighboring genes. Only partial orders can thus be obtained from combining such maps. However, once a total order O is known for a given genome, it can be used as a reference to order genes of a closely related species characterized by a partial order P. Our goal is to find a linearization of P that is as close as possible to O, in term of a given genomic distance. We first prove NP-completeness complexity results considering the breakpoint and the common interval distances. We then focus on the breakpoint distance and give a dynamic programming algorithm whose running time is exponential for general partial orders, but polynomial when the partial order is derived from a bounded number of genetic maps. A time-efficient greedy heuristic is then given for the general case and is empirically shown to produce solutions within 10% of the optimal solution, on simulated data. Applications to the analysis of grass genomes are presented.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="17572019" swrc:key="pmid"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="4" swrc:key="issue"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="IGM-LabInfo, UMR CNRS 8049, Universit{\&#039;e" swrc:key="affiliation"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="English" swrc:key="language"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="papers://055852FE-1648-42FE-91D0-8CA474D2B905/Paper/p37" swrc:key="uri"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="file://localhost/Users/danielzerbino/Documents/Papers/2007/Blin/J%20Comput%20Biol%202007%20Blin.pdf" swrc:key="url"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1089/cmb.2007.A002" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Guillaume Blin"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Eric Blais"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Danny Hermelin"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Pierre Guillon"/></rdf:_4><rdf:_5><swrc:Person swrc:name="Mathieu Blanchette"/></rdf:_5><rdf:_6><swrc:Person swrc:name="Nadia El-Mabrouk"/></rdf:_6></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2a94162d90a98876b571cb8460decb1c0/dzerbino"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2a94162d90a98876b571cb8460decb1c0/dzerbino"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><swrc:date>Mon Sep 17 20:19:41 CEST 2007</swrc:date><swrc:journal>Genome Res</swrc:journal><swrc:month>Jan</swrc:month><swrc:number>1</swrc:number><swrc:pages>149--59</swrc:pages><swrc:title>Hierarchical scaffolding with Bambus</swrc:title><swrc:volume>14</swrc:volume><swrc:year>2004</swrc:year><swrc:keywords>Benchmarking, anthracis Wolbachia, Haplotypes, Bacterial, Mapping, Brucella, Databases, Genetic, Dogs, Computational Software Algorithms, Biology, Genome, Validation, Shewanella, Animals, Contig Staphylococcus, Software, Bacillus Genomics, </swrc:keywords><swrc:abstract>The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="14/1/149" swrc:key="pii"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="14707177" swrc:key="pmid"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="1" swrc:key="issue"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="The Institute for Genomic Research (TIGR), Rockville, Maryland 20850, USA. mpop@tigr.org" swrc:key="affiliation"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="English" swrc:key="language"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="papers://055852FE-1648-42FE-91D0-8CA474D2B905/Paper/p18" swrc:key="uri"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="file://localhost/Users/danielzerbino/Documents/Papers/2004/Pop/Genome%20Res%202004%20Pop.pdf" swrc:key="url"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1101/gr.1536204" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Mihai Pop"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Daniel S Kosack"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Steven L Salzberg"/></rdf:_3></rdf:Seq></swrc:author></rdf:Description></rdf:RDF>