This is an example of an ADAM RFC. It is meant to demonstrate syntax of an RFC as well as example headings and structure. You can view the source for this RFC online.
The following schema defines how to store FASTA formatted data in ADAM. The following schema captures all FASTA content.
1 2 3 4 5 6
All fields are optional and default to
end field can be elided as it can be inferred from the
However, for performance, a pushdown predicate on the start and end position would be
faster than materializing the sequence.
A commonly used predicate would be to find all sequences with a specific description that
end within a specified range.
- Reading the entire sequence
- Reading a portion of the sequence that falls in a specified range.
- Should we require that the
endfield is always specified for performance?
- Should we break up the description into the superset of all sequence identifiers?
Once coverted to ADAM, the file extension will be