RNAstructure Command Line Help |
||||||
TurboFold predicts the common structure for multiple sequences (two or more), using one of three folding modes. It has two distinct executables, a serial program called TurboFold and a parallelized program called TurboFold-smp for use in shared memory environments. USAGE 1: TurboFold <configuration file>USAGE 2: TurboFold-smp <configuration file>Required parameters:
Options that do not require added values:
Options which require added values:NONEConfiguration file format:The following is a description of valid options allowed in the configuration file. ################################################################ # IMPORTANT CONFIG FILE FORMAT NOTES: # # Config file options described below are not case sensitive. # # Option lines may be specified by the option name followed by an equals sign and the option's desired value. # When specifying an option, there may be nothing else on the line. # <option> = <value> # # Specifying comment lines: # Comment lines must begin with "#" followed by a space. # There may not be more than one "#" in a comment line. # However, a comment line may be an unbroken string of "#", as in a divider between sets of options. # # Blank lines are skipped. ################################################################ ################################################################ # Input options ################################################################ # Mode specifies the resolving algorithm TurboFold uses after its initial fold. # A valid mode is required for TurboFold to run properly. # Valid modes can be one of three options: # 1. MEA (Maximum expected accuracy) # 2. ProbKnot (For pseudoknotted sequences) # 3. Threshold (Finding most probable pairs) # Modes should be specified as text strings: MEA, ProbKnot, or Threshold. # The default mode is MEA. Mode = <mode specification> # There are two separate ways to specify a group of input sequences: # 1. Place sequence file names in brackets separated by semicolons. # Note that there cannot be any spaces whatsoever between the brackets. # The list must hold "SequenceNumber" sequences. # 2. Each successive sequence "n" from 1 to SequenceNumber is specified as "Seq<n>." # Note that there cannot be any spaces in the file name. InSeq = {seq1;seq2;seq3;} Seq<n> = <seq file n> # There are two separate ways to specify a group of output CT files: # 1. Place CT file names in brackets separated by semicolons. # Note that there cannot be any spaces whatsoever between the brackets. # The list must hold "SequenceNumber" CT files. # 2. Each successive CT file "n" from 1 to SequenceNumber is specified as "CT<n>." # Note that there cannot be any spaces in the file name. OutCT = {ct1;ct2;ct3;} CT<n> = <ct file n> # SequenceNumber specifies the number of sequences given for calculation. # This is only needed if both sequences and CT files are specified individually. SequenceNumber = <number of sequences> # Note that sequence file, CT file, and sequence number options must appear in one of two valid combinations, depending if files are # specified singly or in a group. # # Singly: # Mode = <mode> # SequenceNumber = <n> # Seq1 = <seq file 1> # CT1 = <ct file n> # ... repeat single entries of Seq and CT until <n> is reached # # Groups: # Mode = <mode> # InSeq = {seq1;seq2;seq3;} # OutCT = {ct1;ct2;ct3;} # Save<1> - Save<SequenceNumber> specifies the partiton function save file names to be outputted for each successive sequence "n" # from 1 to SequenceNumber. # Note that there cannot be any spaces in the file name. # Save files are not required for any sequence. Save<n> = <save file n> # The output multiple sequence alignment filename can be specified. # Default is output.aln. OutAln = filename ################################################################ # TurboFold options ################################################################ # Note that TurboFold options affect output regardless of the mode specified. # Gamma specifies the TurboFold gamma value. # This should not be confused with MeaGamma (below). # Its default value is 0.3. Gamma = 0.3 # Iterations specifies the number of iterations TurboFold goes through. # This should not be confused with PkIterations (below). # Its default value is 3. Iterations = 3 # MaximumPairingDistance specified the maximum distance between nucleotides that can pair. # i.e. for nucleotide i to pair with j, [i - j| < MaximumPairingDistance. # This applies to each sequence. # Its default is no limit, which is indicated by a value of zero. MaximumPairingDistance = 0 # Temperature specifies the temperature at which TurboFold is run, in Kelvin. # Its default value is 310.15 K, which is 37 degrees C. Temperature = 310.15 # Processors specifies the number of processors TurboFold is run on. # Note that this flag only has an effect when TurboFold-smp, the parallel version of TurboFold, is run. # Its default value is 1. Processors = 1 # The format of output multiple sequence alignment can be choosen from Fasta or Clustal. # Default is Clustal. AlnFormat = Fasta # The number of columns of output multiple sequence alignment can be specified. # Default is 60 ColumnNumber = 60 ################################################################ # Maximum expected accuracy (MEA) mode options ################################################################ # Note that the following options only have an effect when MEA mode is specified. # If they are specified when TurboFold is in a different mode, they are ignored. # MaxPercent specifies the maximum percent energy difference. # Its default value is 50 (percent). MaxPercent = 50 # MaxStructures specifies the maximum number of structures to calculate. # Its default value is 1000 structures. MaxStructures = 1000 # MeaGamma specifies the MEA mode gamma value. # This should not be confused with Gamma (above). # Its default value is 1.0. MeaGamma = 1.0 # Window specifies the window size. # Its default value is 5 nucleotides. Window = 5 ################################################################ # Pseudoknot (ProbKnot) mode options ################################################################ # Note that the following options only have an effect when ProbKnot mode is specified. # If they are specified when TurboFold is in a different mode, they are ignored. # MinHelixLength is the minimum helix length allowed during folding. # Its default value is 3 nucleotides. MinHelixLength = 3 # Iterations specifies the number of iterations ProbKnot goes through. # This should not be confused with Iterations (above). # Its default value is 1. PkIterations = 1 ################################################################ # Probable Pairs (Threshold) mode options ################################################################ # Note that the following options only have an effect when Threshold mode is specified. # If they are specified when TurboFold is in a different mode, they are ignored. # Threshold specifies the probability threshold at which pairs are included in a structure. # If a threshold is explicitly specified, it should be expressed as a number >= 0.5 and <= 1.0. # Its default value is 0. # This signifies that structures should be generated at the following thresholds: # >= 0.99, >= 0.97, >= 0.95, >= 0.90, >= 0.80, >= 0.70, >= 0.60, >= 0.50 Threshold = 0 ################################################################ # Rsample mode options (using SHAPE data) ################################################################ # To apply SHAPE data on one or more sequences UseRsample = 1 # Place shape file names in brackets separated by semicolons. # The order is according to the list in InSeq. # Note that there cannot be any spaces whatsoever between the brackets. # For example, to apply SHAPE data on the first three sequences among five input sequences. SHAPEFiles = {shape1;shape2;shape3;;;} # Note that the following options only have an effect when ShapeFile and ShapeSeqIndex are specified. # If they are specified when TurboFold is in a different mode, they are ignored. # To specify a seed number for stochastic sampling in Rsample mode. Seed = 1 # Cparam and Offset specify the SHAPE constraints. # Cparam is used to establish the relationship between free energy and reactivities. # Offset is used to account for the fact that normalized reactivities can be less than zero. # The default values for Cparam is 0.5; for Offset is 1.1. Cparam = 0.5 Offset = 1.1 # NumSamples specify the number of samples for stochastic sampling in Rsample mode. # The default value is 10000. NumSamples = 10000 ################################################################ # Traditional way to use SHAPE data mode options ################################################################ # It is not required to specify any Rsample mode options. # SHAPEintercept specifies the SHAPE intercept used by TurboFold. # Note that if specified, this value is only used if one or more SHAPE files is also specified. # Its default value is -0.6 kcal/mol. SHAPEintercept = -0.6 # SHAPEslope specifies the SHAPE slope used by TurboFold. # Note that if specified, this value is only used if one or more SHAPE files is also specified. # Its default value is 1.8 kcal/mol. SHAPEslope = 1.8 References:
|
||||||
Visit The Mathews Lab RNAstructure Page for updates and latest information. |