RNAstructure logo

RNAstructure Command Line Help
ShapeKnots

ShapeKnots is used to predict a set of structures that can contain pseudoknots, modeled after HotKnots (see references below). It can take SHAPE mapping data as a restraint to increase structure prediction accuracy. It can also use user-specified restraints or DMS mapping data.

USAGE: ShapeKnots <seq file> <ct file> [options]

Required parameters:

<seq file> The name of a sequence file containing input data.
Note that lowercase nucleotides are forced single-stranded in structure prediction.
<ct file> The name of a CT file to which output will be written.

Options that do not require added values:

-h, -H, --help Display the usage details message.
-v, -V, --version Display version and copyright information for this interface.

Options that require added values:

-c, -C, --constraint Specify a constraints file to be applied. The constraint file format is defined here.
Default is to have no constraints applied.
-dms, -DMS, --DMS Specify a DMS restraints file to be applied. These restraints are pseudo free energy restraints.
Default is to have no restraints applied.
-dsh, -DSH, --DSHAPE Specify a differential SHAPE data file to be used to generate restraints in addition to SHAPE restraints specified by --SHAPE. These restraints specifically use SHAPE pseudoenergy restraints where the offset is zero. Also, the function for calculating the pseudo free energy for nucleotide i is = (differential slope) * (differential SHAPE for nucleotide i). These pseudoenergies are added to those generated with the --SHAPE option.
Default is no differential SHAPE data file specified.
-dsm, -DSM, --DSHAPEslope Specify a slope used with differential SHAPE restraints.
Default is 2.11 kcal/mol.
-dso, -DSO, --doubleOffset Specify a double-stranded offset file, which adds energy bonuses to particular double-stranded nucleotides.
Default is to have no double-stranded offset specified.
-im, -IM, --IMaximum

Specify a maximum number of internally generated structures for each call of the dynamic programming algorithm. Note that suboptimal structures are generated until either the maximum number of structures is reached or the maximum percent difference is reached (below).  This is not the maximum number of structures provided to the user, which is controlled by –m, -M, --maximum.

Default is 100 structures.

-ip, -IP, --IPercent

Specify a maximum percent difference in folding free energy change for internally generated suboptimal structures for each call of the dynamic programming algorithm. For example, 20 would indicate 20%. This is not the maximum percent difference in energy for structures provided to the user, which is controlled by –p, -P, --percent.

Default is 20%.
-iw, -IW, --IWindow

Specify a window size for the internally generated suboptimal structures for each call of the dynamic programming algorithm.  This is not the window for structures provided to the user, which is controlled by –w, -W, --window.
Default is determined by the length of the sequence.

-m, -M, --maximum Specify a maximum number of structures. Note that suboptimal structures are generated until either the maximum number of structures are reached or the maximum percent difference is reached (below).
Default is 20 structures.
-p, -P, --percent Specify a maximum percent difference in folding free energy change for generating suboptimal structures. For example, 20 would indicate 20%.
Default is determined by the length of the sequence.
-p1, -P1, --Penalty1

Specify a pseudoknot penalty 1.
Default is 0.35 kcal/mol.

-p2, -P2, --Penalty2

Specify a pseudoknot penalty 2.
Default is 0.65 kcal/mol.

-ph, -PH, --PseudoknottedHelices

Specify maximum number of helices to be processed.
Default is 100 helices.

-sh, -SH, --SHAPE Specify a SHAPE data file to be used to generate restraints. These restraints specificially use SHAPE pseudoenergy restraints.
Default is no SHAPE data file specified.
-si, -SI, --SHAPEintercept Specify an intercept used with SHAPE restraints.
Default is -0.6 kcal/mol.
-sm, -SM, --SHAPEslope Specify a slope used with SHAPE restraints.
Default is 1.8 kcal/mol.
-sso, -SSO, --singleOffset Specify a single-stranded offset file, which adds energy bonuses to particular single-stranded nucleotides.
Default is to have no single-stranded offset specified.
-w, -W, --window Specify a window size.
Default is determined by the length of the sequence.
--warnings, --warn Set the behavior for non-critical warnings (e.g. related to invalid
nucleotide positions or duplicate data points in SHAPE data). Valid values
are:
* on -- Warnings are written to standard output. (default)
* err -- Warnings are sent to STDERR. This can be used in automated scripts
etc to detect problems.
* off -- Do not display warnings at all (not recommended).
-x, -X, --experimentalPairBonus Input text file with bonuses (in kcal) as a matrix. As with SHAPE, bonuses will be applied twice to internal base pairs, once to edge base pairs, and not at all to single stranded regions.
Default is no experimental pair bonus file specified.
-xo Specify an intercept (overall offset) to use with the 2D experimental pair bonus file.
Default is 0.0 (no change to input bonuses).
-xs Specify a number to multiply the experimental pair bonus matrix by.
Default is 1.0 (no change to input bonuses).

Notes about Constraints:

ShapeKnots is capable of predicting pseudoknots, but user-specified constraints (using -c parameter) are not able to contain pseudoknots. The constraints are used directly by the underlying dynamic programming algorithm (shared with the Fold program). If a pseudoknot needs to be included in the base pairs, the nucleotides in "pseudoknotting pairs" should be specified as single-stranded and then the base pairs subsequently added manually to ct file that is output. Other pairs can be constrained to occur. The "pseudoknotting pairs" can be the considered as the minimum number of pairs that need to be removed to have no pseudoknot, and RemovePsuedoknots can help with this determination.

References:

  1. Hajdin, C.E., Bellaousov, S., Higgins, W., Leonard, C.W., Mathews, D.H. and Weeks, K.M.
    “Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.”
    Proc. Natl. Acad. Sci. U.S.A., In Press. (2013).
  2. Reuter, J.S. and Mathews, D.H.
    "RNAstructure: software for RNA secondary structure prediction and analysis."
    BMC Bioinformatics, 11:129. (2010).
  3. Deigan, K.E., Li, T.W., Mathews, D.H. and Weeks, K.M.
    "Accurate SHAPE-directed RNA structure determination."
    Proc. Natl. Acad. Sci. U.S.A., 106:97-102. (2009).
  4. Ren, J., Rastegari, B., Condon, A, and Hoos, H. H.
    "HotKnots: Heuristic prediction of RNA secondary structures including pseudoknots."
    RNA. 11: 1494-1504. (2005).