RNAstructure logo

RNAstructure Command Line Help
DuplexFold

DuplexFold is used to predict lowest free energy structures containing two strands. The structures will not contain intramolecular pairs.
Note that output is written to a CT file where the sequences are concatenated, with an intermolecular linker between them ("III").

Optionally, --list indicates that a file with a list of sequence pairs will be read and the output will be the folding free energy changes.

USAGE: DuplexFold <seq file 1> <seq file 2> <ct file> [options]

Required parameters:

<seq file 1> The name of a sequence file containing input data for the first sequence or the filename of a text file list of oligonucleotide pairs (if --list is used).
Note that lowercase nucleotides are forced single-stranded in structure prediction.
<seq file 2> The name of a sequence file containing input data for the second sequence.
Note that lowercase nucleotides are forced single-stranded in structure prediction.
<ct file> The name of a CT file to which output will be written or the name of a text file to which folding free energies will be written (if --list is used).

Options that do not require added values:

-d, -D, --DNA Specify that the sequence is DNA, and DNA parameters are to be used.
Note that the backbone type must be the same for both sequences.
Default is to use RNA parameters.
-h, -H, --help Display the usage details message.
--list

Specify that list of oligonucleotide pairs will be used as input rather than two sequence files. <seq file 1> then specifies the list. <seq file 2>then is unused (but must be specified with some text). <ct file> is the name of a file that will contain the folding free energies for each pair of sequences. The input file format is specified below in Notes. This is substantially faster than invoking DuplexFold for each pair of sequences seperately because the thermodynamic parameters will only be read once.

Options that require added values:

-a, -A, --alphabet Specify the name of a folding alphabet and associated nearest neighbor parameters. The alphabet is the prefix for the thermodynamic parameter files, e.g. "rna" for RNA parameters or "dna" for DNA parameters or a custom extended/modified alphabet. The thermodynamic parameters need to reside in the at the location indicated by environment variable DATAPATH.
The default is "rna" (i.e. use RNA parameters). This option overrides the --DNA flag.
-l, -L, --loop Specify the maximum number of unpaired nucleotides in an internal or bulge loop.
Default is 6 unpaired nucleotides.
-m, -M, --maximum Specify a maximum number of structures. Note that suboptimal structures are generated until either the maximum number of structures are reached or the maximum percent difference is reached (below).
Default is 10 structures.
-p, -P, --percent Specify a maximum percent difference in folding free energy change in suboptimal structures. Note that suboptimal structures are generated until either the maximum percent free energy difference is reached or until the maximum number of structures is reached (above).
Default is 40 percent (specified as 40, not 0.4).
-t, -T, --temperature Specify the temperature at which calculation takes place in Kelvin.
Default is 310.15 K, which is 37 degrees C.
-w, -W, --window Specify a window size for generating suboptimal structures. Smaller windows (down to zero) will result in a larger set of structures that are similar. Larger windows result in fewer structures that are more different in predicted pairs.
Default is 0 nucleotides.

Notes

When a list of oligonucleotide pairs is provided (specifed by --list), the input is a plain text file with the format of (where any whitespace between the two sequences will work):

CGGGGAAG CUUCCCCG
GGGGGG CCCAACCCC
GGCGUGAGAAAC
GGCGUGAGAAAC

The output is then the folding free energies of duplexes predicted by free energy minimization.

References

  1. Reuter, J.S. and Mathews, D.H.
    "RNAstructure: software for RNA secondary structure prediction and analysis."
    BMC Bioinformatics, 11:129. (2010).
  2. Piekna-Przybylska, D., DiChiacchio, L., Mathews, D.H. and Bambara, R.A.
    "A sequence similar to tRNA3Lys gene is embedded in HIV-1 U3/R and promotes minus strand transfer."
    Nature Structural & Molecular Biology, 17:83-89. (2010).