bifold is used to predict lowest free energy structures containing two strands. The structures can contain intramolecular pairs. A heuristic is used to predict suboptimal structures.
Note that output is written to a CT file where the sequences are concatenated, with an intermolecular linker between them ("III").
bifold-smp is a parallel processing version for use on multi-core computers, built using OpenMP.
Optionally, --list indicates that a file with a list of sequence pairs will be read and the output will be the folding free energy changes.
USAGE: bifold <seq file 1> <seq file 2> <ct file> [options]
OR: bifold-smp <seq file 1> <seq file 2> <ct file> [options]
<seq file 1> |
The name of a sequence file containing input data for the first sequence or the filename of a text file list of oligonucleotide pairs (if --list is used).
Note that lowercase nucleotides are forced single-stranded in structure prediction.
|
<seq file 2> |
The name of a sequence file containing input data for the second sequence.
Note that lowercase nucleotides are forced single-stranded in structure prediction.
|
<ct file> |
The name of a CT file to which output will be written or the name of a text file to which folding free energies will be written (if --list is used). |
-a, -A, --alphabet |
Specify the name of a folding alphabet and associated
nearest neighbor parameters. The alphabet is the prefix
for the thermodynamic parameter files, e.g. "rna" for RNA
parameters or "dna" for DNA parameters or a custom
extended/modified alphabet. The thermodynamic parameters
need to reside in the at the location indicated by
environment variable DATAPATH.
The default is "rna" (i.e. use RNA parameters). This
option overrides the --DNA flag. |
-d, -D, --DNA |
Specify that the sequence is DNA, and DNA parameters are to be used.
Note that the backbone type must be the same for both sequences.
Default is to use RNA parameters.
|
-h, -H, --help |
Display the usage details message. |
-i, -I, --intramolecular |
Forbid intramolecular pairs (pairs within the same strand).
Default is to allow intramolecular pairs.
|
--list |
Specify that list of oligonucleotide pairs will be used as input rather than two sequence files. <seq file 1> then specifies the list. <seq file 2>then is unused (but must be specified with some text). <ct file> is the name of a file that will contain the folding free energies for each pair of sequences. The input file format is specified below in Notes. This is substantially faster than invoking bifold for each pair of sequences seperately because the thermodynamic parameters will only be read once. |
-l, -L, --loop |
Specify a maximum internal/bulge loop size.
Default is 30 unpaired nucleotides.
|
-m, -M, --maximum |
Specify a maximum number of structures. Note that suboptimal structures are generated until either the maximum number of structures are reached or the maximum percent difference is reached (below).
Default is 20 structures.
|
-p, -P, --percent |
Specify a maximum percent energy difference in free energy of suboptimal structures above the lowest free energy change. Note that suboptimal structures are generated until either the maximum percent free energy difference is reached or until the maximum number of structures is reached (above).
Default is 10 percent (specified as 10, not 0.1).
|
-s, -S, --save |
Specify the name of a save file, needed for dot plots and refolding.
Default is not to generate a save file.
|
-t, -T, --temperature |
Specify the temperature at which calculation takes place in Kelvin.
Default is 310.15 K, which is 37 degrees C.
|
-w, -W, --window |
Specify a window size.
Default is 0 nucleotides.
|
When a list of oligonucleotide pairs is provided (specifed by --list), the input is a plain text file with the format of (where any whitespace between the two sequences will work):
CGGGGAAG CUUCCCCG GGGGGG CCCAACCCC GGCGUGAGAAAC GGCGUGAGAAAC
The output is then the folding free energies of duplexes predicted by free energy minimization.
bifold-smp, by default, will use all available compute cores for processing. The number of cores used can be controlled by setting the OMP_NUM_THREADS environment variable.
-
Reuter, J.S. and Mathews, D.H.
"RNAstructure: software for RNA secondary structure prediction and analysis."
BMC Bioinformatics, 11:129. (2010).
-
Mathews, D.H., Burkard, M.E., Freier, S.M., Wyatt, J.R. and Turner, D.H.
"Predicting oligonucleotide affinity to nucleic acid targets."
RNA, 5:1458-1469. (1999).
|