Mapping sequenced E.coli genes by computer: Software, strategies and examples

Kenneth E. Rudd, Webb Miller, Craig Werner, James Ostell, Carolyn Tolstoshev, Steven G. Satterfield

Research output: Contribution to journalArticlepeer-review

53 Scopus citations

Abstract

Methods are presented for organizing and integrating DNA sequence data, restriction maps, and genetic maps for the same organism but from a variety of sources (databases, publications, personal communications). Proper software tools are essential for successful organization of such diverse data into an ordered, cohesive body of information, and a suite of novel software to support this endeavor is described. Though these tools automate much of the task, a variety of strategies is needed to cope with recalcitrant cases. We describe such strategies and illustrate their application with numerous examples. These strategies have allowed us to order, analyze, and display over one megabase of E. coli DNA sequence information. The integration task often exposes inconsistencies in the available data, perhaps caused by strain polymorphisms or human oversight, necessitating the application of sound biological judgment. The examples illustrate both the level of expertise required of the database curator and the knowledge gained as apparent inconsistencies are resolved. The software and mapping methods are applicable to the study of any genome for which a high resolution restriction map is available. They were developed to support a weakly coordinated sequencing effort involving many laboratories, but would also be useful for highly orchestrated sequencing projects.

Original languageEnglish (US)
Pages (from-to)637-647
Number of pages11
JournalNucleic acids research
Volume19
Issue number3
DOIs
StatePublished - Feb 11 1991

ASJC Scopus subject areas

  • Genetics

Fingerprint Dive into the research topics of 'Mapping sequenced E.coli genes by computer: Software, strategies and examples'. Together they form a unique fingerprint.

Cite this