Hostname: page-component-78c5997874-mlc7c Total loading time: 0 Render date: 2024-11-10T10:21:08.352Z Has data issue: false hasContentIssue false

RNAML: A standard syntax for exchanging RNA information

Published online by Cambridge University Press:  25 July 2002

ALLISON WAUGH
Affiliation:
Stanford Medical Informatics, Stanford University Medical Center, Stanford, California 94305, USA
PATRICK GENDRON
Affiliation:
Département d'Informatique et de Recherche Opérationnelle, Université de Montréal, Montréal, Québec H3C 3J7, Canada
RUSS ALTMAN
Affiliation:
Stanford Medical Informatics, Stanford University Medical Center, Stanford, California 94305, USA
JAMES W. BROWN
Affiliation:
Department of Microbiology, North Carolina State University, Raleigh, North Carolina 27695, USA
DAVID CASE
Affiliation:
Department of Molecular Biology, The Scripps Research Institute, La Jolla, California 92037, USA
DANIEL GAUTHERET
Affiliation:
Centre National de la Recherche Scientifique, Marseille 28809, France
STEPHEN C. HARVEY
Affiliation:
Department of Biochemistry, University of Alabama at Birmingham, Birmingham, Alabama 35294, USA
NEOCLES LEONTIS
Affiliation:
Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, USA
JOHN WESTBROOK
Affiliation:
Department of Chemistry, Rutgers University, Piscataway, New Jersey 08855, USA
ERIC WESTHOF
Affiliation:
IBMC-Centre National de la Recherche Scientifique, Strasbourg 67084, France
MICHAEL ZUKER
Affiliation:
Mathematical Sciences, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
FRANÇOIS MAJOR
Affiliation:
Département d'Informatique et de Recherche Opérationnelle, Université de Montréal, Montréal, Québec H3C 3J7, Canada
Get access

Abstract

Analyzing a single data set using multiple RNA informatics programs often requires a file format conversion between each pair of programs, significantly hampering productivity. To facilitate the interoperation of these programs, we propose a syntax to exchange basic RNA molecular information. This RNAML syntax allows for the storage and the exchange of information about RNA sequence and secondary and tertiary structures. The syntax permits the description of higher level information about the data including, but not restricted to, base pairs, base triples, and pseudoknots. A class-oriented approach allows us to represent data common to a given set of RNA molecules, such as a sequence alignment and a consensus secondary structure. Documentation about experiments and computations, as well as references to journals and external databases, are included in the syntax. The chief challenge in creating such a syntax was to determine the appropriate scope of usage and to ensure extensibility as new needs will arise. The syntax complies with the eXtensible Markup Language (XML) recommendations, a widely accepted standard for syntax specifications. In addition to the various generic packages that exist to read and interpret XML formats, an XML processor was developed and put in the open-source MC-Core library for nucleic acid and protein structure computer manipulation.

Type
SYNTAX PROPOSAL
Copyright
© 2002 RNA Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)