DuplexFold is used to predict lowest free energy structures containing two strands. The structures will not contain intramolecular pairs.
Note that output is written to a CT file where the sequences are concatenated, with an intermolecular linker between them ("III").
Optionally, --list indicates that a file with a list of sequence pairs will be read and the output will be the folding free energy changes.
USAGE: DuplexFold <seq file 1> <seq file 2> <ct file> [options]
<seq file 1> |
The name of a sequence file containing input data for the first sequence or the filename of a text file list of oligonucleotide pairs (if --list is used).
Note that lowercase nucleotides are forced single-stranded in structure prediction.
|
<seq file 2> |
The name of a sequence file containing input data for the second sequence.
Note that lowercase nucleotides are forced single-stranded in structure prediction.
|
<ct file> |
The name of a CT file to which output will be written or the name of a text file to which folding free energies will be written (if --list is used). |
-d, -D, --DNA |
Specify that the sequence is DNA, and DNA parameters are to be used.
Note that the backbone type must be the same for both sequences.
Default is to use RNA parameters.
|
-h, -H, --help |
Display the usage details message. |
--list |
Specify that list of oligonucleotide pairs will be used as input rather than two sequence files. <seq file 1> then specifies the list. <seq file 2>then is unused (but must be specified with some text). <ct file> is the name of a file that will contain the folding free energies for each pair of sequences. The input file format is specified below in Notes. This is substantially faster than invoking DuplexFold for each pair of sequences seperately because the thermodynamic parameters will only be read once. |
-a, -A, --alphabet |
Specify the name of a folding alphabet and associated
nearest neighbor parameters. The alphabet is the prefix
for the thermodynamic parameter files, e.g. "rna" for RNA
parameters or "dna" for DNA parameters or a custom
extended/modified alphabet. The thermodynamic parameters
need to reside in the at the location indicated by
environment variable DATAPATH.
The default is "rna" (i.e. use RNA parameters). This
option overrides the --DNA flag. |
-l, -L, --loop |
Specify the maximum number of unpaired nucleotides in an internal or bulge loop.
Default is 6 unpaired nucleotides.
|
-m, -M, --maximum |
Specify a maximum number of structures. Note that suboptimal structures are generated until either the maximum number of structures are reached or the maximum percent difference is reached (below).
Default is 10 structures.
|
-p, -P, --percent |
Specify a maximum percent difference in folding free energy change in suboptimal structures. Note that suboptimal structures are generated until either the maximum percent free energy difference is reached or until the maximum number of structures is reached (above).
Default is 40 percent (specified as 40, not 0.4).
|
-t, -T, --temperature |
Specify the temperature at which calculation takes place in Kelvin.
Default is 310.15 K, which is 37 degrees C.
|
-w, -W, --window |
Specify a window size for generating suboptimal structures. Smaller windows (down to zero) will result in a larger set of structures that are similar. Larger windows result in fewer structures that are more different in predicted pairs.
Default is 0 nucleotides.
|
When a list of oligonucleotide pairs is provided (specifed by --list), the input is a plain text file with the format of (where any whitespace between the two sequences will work):
CGGGGAAG CUUCCCCG GGGGGG CCCAACCCC GGCGUGAGAAAC GGCGUGAGAAAC
The output is then the folding free energies of duplexes predicted by free energy minimization.
-
Reuter, J.S. and Mathews, D.H.
"RNAstructure: software for RNA secondary structure prediction and analysis."
BMC Bioinformatics, 11:129. (2010).
-
Piekna-Przybylska, D., DiChiacchio, L., Mathews, D.H. and Bambara, R.A.
"A sequence similar to tRNA3Lys gene is embedded in HIV-1 U3/R and promotes minus strand transfer."
Nature Structural & Molecular Biology, 17:83-89. (2010).
|