RNAstructure Command Line Help: bifold

RNAstructure Command Line Help
bifold and bifold-smp

bifold is used to predict lowest free energy structures containing two strands. The structures can contain intramolecular pairs. A heuristic is used to predict suboptimal structures.
Note that output is written to a CT file where the sequences are concatenated, with an intermolecular linker between them ("III"). bifold-smp is a parallel processing version for use on multi-core computers, built using OpenMP.

Optionally, --list indicates that a file with a list of sequence pairs will be read and the output will be the folding free energy changes.

USAGE: bifold <seq file 1> <seq file 2> <ct file> [options]

OR: bifold-smp <seq file 1> <seq file 2> <ct file> [options]

Required parameters:

<seq file 1>	The name of a sequence file containing input data for the first sequence or the filename of a text file list of oligonucleotide pairs (if --list is used). Note that lowercase nucleotides are forced single-stranded in structure prediction.
<seq file 2>	The name of a sequence file containing input data for the second sequence. Note that lowercase nucleotides are forced single-stranded in structure prediction.
<ct file>	The name of a CT file to which output will be written or the name of a text file to which folding free energies will be written (if --list is used).

Options that do not require added values:

-a, -A, --alphabet	Specify the name of a folding alphabet and associated nearest neighbor parameters. The alphabet is the prefix for the thermodynamic parameter files, e.g. "rna" for RNA parameters or "dna" for DNA parameters or a custom extended/modified alphabet. The thermodynamic parameters need to reside in the at the location indicated by environment variable DATAPATH. The default is "rna" (i.e. use RNA parameters). This option overrides the --DNA flag.
-d, -D, --DNA	Specify that the sequence is DNA, and DNA parameters are to be used. Note that the backbone type must be the same for both sequences. Default is to use RNA parameters.
-h, -H, --help	Display the usage details message.
-i, -I, --intramolecular	Forbid intramolecular pairs (pairs within the same strand). Default is to allow intramolecular pairs.
--list	Specify that list of oligonucleotide pairs will be used as input rather than two sequence files. <seq file 1> then specifies the list. <seq file 2>then is unused (but must be specified with some text). <ct file> is the name of a file that will contain the folding free energies for each pair of sequences. The input file format is specified below in Notes. This is substantially faster than invoking bifold for each pair of sequences seperately because the thermodynamic parameters will only be read once.

Options that require added values:

-l, -L, --loop	Specify a maximum internal/bulge loop size. Default is 30 unpaired nucleotides.
-m, -M, --maximum	Specify a maximum number of structures. Note that suboptimal structures are generated until either the maximum number of structures are reached or the maximum percent difference is reached (below). Default is 20 structures.
-p, -P, --percent	Specify a maximum percent energy difference in free energy of suboptimal structures above the lowest free energy change. Note that suboptimal structures are generated until either the maximum percent free energy difference is reached or until the maximum number of structures is reached (above). Default is 10 percent (specified as 10, not 0.1).
-s, -S, --save	Specify the name of a save file, needed for dot plots and refolding. Default is not to generate a save file.
-t, -T, --temperature	Specify the temperature at which calculation takes place in Kelvin. Default is 310.15 K, which is 37 degrees C.
-w, -W, --window	Specify a window size. Default is 0 nucleotides.

Notes for using --list:

When a list of oligonucleotide pairs is provided (specifed by --list), the input is a plain text file with the format of (where any whitespace between the two sequences will work):

CGGGGAAG CUUCCCCG
GGGGGG  CCCAACCCC
GGCGUGAGAAAC GGCGUGAGAAAC

The output is then the folding free energies of duplexes predicted by free energy minimization.

Notes for smp:

bifold-smp, by default, will use all available compute cores for processing. The number of cores used can be controlled by setting the OMP_NUM_THREADS environment variable.

References:

Reuter, J.S. and Mathews, D.H.
"RNAstructure: software for RNA secondary structure prediction and analysis."
BMC Bioinformatics, 11:129. (2010).
Mathews, D.H., Burkard, M.E., Freier, S.M., Wyatt, J.R. and Turner, D.H.
"Predicting oligonucleotide affinity to nucleic acid targets."
RNA, 5:1458-1469. (1999).