Advanced parameters

General settings

Configure the settings of the simulation and the path to all dependencies in case of need (non-default installation).

  • usesrun: Use srun binary to run PELE. Only when using intel processors.

  • pele_exec: Use a pele executable that is not the default one. Needs to be used with pele_data and pele_documents. default: $PELE/bin/Pele_mpi

  • pele_data: Use a pele data folder that is not the default one. default: $PELE/Data

  • pele_documents: Use a pele documents folder that is not the default one. default: $PELE/Documents

  • singularity_exec: Use a singularity container that contains the pele executable. Cannot be used with pele_exec.

  • pele_license: Use a pele_license path that is not the default one. default: $PELE/licenses

  • schrodinger: Use a schrodinger path that is not the default one. default: $SCHRODINGER

usesrun: false
pele_exec: "/home/pele/bin/Pele_mpi"
pele_data: "/home/pele/Data/"
pele_documents: "/home/pele/Documents/"
singularity_exec: "/home/pele/pele_release.sif"
pele_license: "/home/pele/licenses"
schrodinger: "/home/pele/schrodinger2020-1/"

PELE general parameters

These flags are exclusive of the PELE modes not fragPELE

Advanced parameters of PELE:

  • log: Retrieve PELE logfiles during simulation. Default=False

  • verbose: Set to true to activate verbose mode in PELE. DEfault=False

  • anm_displacement: Angstrom to displace carbon alphas in each ANM movement. Default=0.75

  • anm_modes_change: Number of steps before we change to a new normal mode movement. Default=4

  • sidechain_res: Receptor sidechain resolution. Default=10

  • overlap_factor: Vanderwals overlap factor (More in PELE docs). Default=0.65

  • steric_trials: Number of steric trials (More in PELE docs). Default=250

  • sidechain_radius: Residues within specified radius from the ligand will be included in the side chain prediction protocol. Default=6

  • report: Change the name of the report file. Default: report

  • traj: Change the name of the trajectory file. Default: trajectory.pdb

log: true
verbose: true
anm_displacement: 0.5
anm_modes_change: 3
sidechain_res: 30
overlap_factor: 0.60
sidechain_radius: 8
steric_trials: 50
report: pele_report
traj: trajectory.xtc

Adaptive PELE parameters

Advanced parameters of Adaptive PELE:

  • density: Density type ([null, exitContinuous…]. More in AdaptivePELE docs). Default: null

density: "exitContinuous"

Preparation parameters

Configure the parameters of the PPP (Protein Pele Preparation)

  • skip_preprocess: Skip protein pele preparation. Default: False

  • noTERs: Don’t include TERs on preparation. Used if PPP gets confuse with insertion codes or other. Default: False

  • charge_ters: Charge terminals of the protein. Default: False

  • nonstandard: List of names of nonstandard residues that will be omitted in protein pele preparation. Default=[]

  • prepwizard: Run Prepwizard (Still on testing version). Default: False

preprocess_receptor: true
noTERs: false
charge_ters: false
nonstandard:
  - TPO
prepwizard: false

Residue parametrization

In order to run a simulation, PELE requires the following files for every non-standard molecule (i.e. any non-standard small molecule or residue):

  • IMPACT template: containing force field parameters, please refer to this site for further information.

  • rotamer library: optional file containing the list of rotatable bonds to sample by the side chain perturbation algorithm. If missing, the flexibility of the corresponding molecule will not considered. More information available here.

  • solvent template: some special solvents like “OBC” require extra parameters, which are set in this file. Mind that it is mandatory when using OpenFF, since it works with the OBC solvent only.

The platform currently has two implementations for building hetero molecule parameters - PlopRotTemp (soon to be deprecated) and Peleffy (PELE Force Field Yielder), which offers more functionality but is still in beta testing.

Please refer to the following table for the comparison of the two methods and available forcefields:

Builder

Forcefields

Solvents

Charge parametrization methods

PlopRotTemp

(default)

“OPLS2005”

“OBC”

“VDGBNP”

“OPLS2005”

Peleffy

(beta)

“OPLS2005” (default)

“openff-1.3.0”

“openff-1.2.1”

“openff-1.2.0”

“openff-1.1.1”

“openff-1.1.0”

“openff-1.0.1”

“openff-1.0.0”

“OBC”

“VDGBNP”

“gasteiger”

“am1bcc” (default for OpenFF)

“OPLS2005” (default for OPLS2005)

PlopRotTemp

To continue using PlopRotTemp, you do not need to make any changes to your YAML file, previously existing flags are still available:

  • gridres: Resolution of the rotamers when sampling them by the Side Chain prediction algorithm. Default=10 degrees

  • core: List of PDB atom names that will be included as part of the rigid core. In case it is not specified, the algorithm will pick up a set of non-rotatable atoms centered in the molecular structure. Default=None

  • exclude_terminal_rotamers: Exclude terminal rotamers during parametrization of non standard molecules if they belong to a small terminal group. Default=True

  • mae_lig: External MAE file with quantum charges generated with Schrödinger suite. When supplied, any charge calculated internally in the platform will be replaced by the charges from this file. Default=None

  • maxtorsion: Maximum number of rotamers per flexible side chain. Default=4

  • n: Maximum number of flexible side chains in a molecule. Default=None

solvent: "OBC"
maxtorsion: 4
n: 5
mae_lig: "/home/dsoler/lig.mae"
gridres: 10

Peleffy

In order to use Peleffy instead of PlopRotTemp, you need to set use_peleffy: true in input YAML.

You can use the following parameters to control the way peleffy will parametrize non-standard molecules for you:

  • forcefield: Forcefield used to parametrize hetero molecules, you can use one of:

    • “OPLS2005” (default)

    • “openff-1.3.0”

    • “openff-1.2.1”

    • “openff-1.2.0”

    • “openff-1.1.1”

    • “openff-1.1.0”

    • “openff-1.0.1”

    • “openff-1.0.0”

  • charge_parametrization_method: The method to use to assign partial charges to atoms:

    • “gasteiger”

    • “am1bcc” (default when using any “OpenFF” force field)

    • “OPLS2005” (default when using “OPLS2005”)

  • use_peleffy: You have to set it to True to use peleffy instead of the default parameters builder. Default=False

  • gridres: Resolution of the rotamers when sampling them by the Side Chain prediction algorithm. Default=10 degrees

  • core: List of PDB atom names that will be included as part of the rigid core. In case it is not specified, the algorithm will pick up a set of non-rotatable atoms centered in the molecular structure. Default=None

  • exclude_terminal_rotamers: Exclude terminal rotamers during parametrization of non standard molecules if they belong to a small terminal group. Default=True

  • mae_lig: External MAE file with quantum charges generated with Schrödinger suite. When supplied, any charge calculated internally in the platform will be replaced by the charges from this file. Default=None

Important: Peleffy requires CONECT lines in the PDB file, otherwise they are automatically added with Schrödinger Protein Preparation Wizard.

use_peleffy: true
charge_parametrization_method: "gasteiger"
forcefield: "openff-1.3.0"
gridres: 20
core:
    - "O1"
    - "C1"
    - "C2"
    - "N1"

Ligand parameters

Advanced parameters for the conformation perturbation algorithm:

  • overlap_factor_conformation: van der Waals overlap factor in conformation perturbation. Default = 0.65

overlap_factor_conformation: 0.60

Constraint parameters

Alternatively to constraint levels, advanced users can manipulate the constraint parameters individually at their own risk, using the following flags:

  • terminal_constr - sets the spring constant for the terminal C-alpha constraints, default = 5 kcal/mol

  • ca_constr - sets the spring constant for the remaining C-alphas in the backbone, default = 0.5 kcal/mol

  • ca_interval - interval at which the backbone C-alphas should be constrained, default = 10 (i.e. every 10 residues).

Take into account that specific modifiers of constraint parameters will prevail over the settings coming from the constraints levels and those predefined in each package.

Other advanced parameters related with constraints:

  • water_constr: Water constraints. Default=5

  • constrain_smiles: SMILES string to indicate what part of the molecule to constrain. Default=None

  • external_constraints: You can specify 2 types of constraints: positional constraints or atom-atom constraints, e.g.

    • The positional constraints are given either by:
      • springConstant-atomnumber. i.e. “10-17”

      • springConstant-chain:resnum:atomname. i.e. “5-A:1:H”

    • The atom-atom constraints are specified either by:
      • springConstant-equilibriumDistance-atomnumber1-atomnumber2. i.e. “50-2.34-17-4159”

      • springConstant-equilibriumDistance-chain1:resnum1:atomname1-chain2:resnum2:atomname2. i.e. “50-2.34-A:1:H-L:1:C21”

  • remove_constraints: Do not place constraints on the carbon-alpha of the protein. Default: False

terminal_constr: 10.5
ca_constr: 6.0
ca_interval: 3
water_constr: 5
constrain_smiles: "C2CCC1CCCCC1C2"
smiles_constr: 5
external_constraints:
- "10-17" #constrain of 10kcal/mol at atomnumber 17
- "5-A:1:H" ##constrain of 10kcal/mol at atom with chain A residuenumber 1 and atomname H
- "50-2.34-17-4159" #constrain of 50kcal/mol with equilibrium distance of 2.34 between atomnumbers 17 & 4159
- "50-2.34-A:1:H-L:1:C21" #constrain of 50kcal/mol with equilibrium distance of 2.34 between atoms with respective chain resnum and atomname
remove_constraints: true

Metal constraints

Algorithm to automatically set metal constraints around the ligand.

  • no_metal_constraints: Ignore all metals in the PDB file, no constraints will be set automatically. Default=False

  • permissive_metal_constr: Expand the search for coordinated atoms by allowing 35% deviation from “ideal” angles. If the algorithm finds a valid geometry it will include the metal constraint into the simulation. Default=False

  • constrain_all_metals: Constrain all atoms around the metal, regardless of the angles or coordination number. Default=False

  • external_constraints: Set a manual constraint containing a metal atom to disable search for this particular metal. Default=[]

no_metal_constraints: true
permissive_metal_constr: true
constrain_all_metals: true
external_constraints:
    - "50-2.34-A:1:H-L:1:MG" #constrain of 50kcal/mol with equilibrium distance of 2.34 between atoms with respective chain resnum and atomname
constrain_core: "CN(C)C(=O)c1ccc(F)cc1"  # SMILES or SMARTS pattern
constrain_core_spring: 30  # optional, default 50.0

aquaPELE parameters

Other advanced parameters to set up aquaPELE. Its usage is discouraged.

  • water_trials: Numerical trials on water perturbation. Default=10000

  • water_constr: COM constrain applied to th water molecule after perturbation. Default=0

  • water_overlap: Overlap factor of water. Default=0.78

water_trials: 500
water_constr: 0.5
water_overlap: 0.5

Analysis parameters

Advanced parameters of Analysis package.

  • analysis_nclust: Numbers of clusters out of the simulation, if using the gaussianmixture clustering method. Default: 10

  • be_column: Column of the binding energy in the reports starting by 1. Default: 5

  • te_column: Column of the total energy in the reports starting by 1. Default: 4

  • limit_column: Specify the column where your external metrics start. Default: 6

  • mae: To extract the best energy and cluster poses as .mae files with the metrics as properties (schrodinger need it). Default: false

  • clustering_method: If you want to override the default clustering method (meanshift), you can set this flag to gaussianmixture or HDBSCAN.

  • clustering_filtering_threshold: Percentage of output structures to filter our before clustering. Default = 0.25.

  • plot_filtering_threshold: Percentage of output structures to filter out before creating plots. Default = 0.02

analysis_nclust: 12
be_column: 5
te_column: 4
limit_column: 6
mae: true
clustering_method: "gaussianmixture"
clustering_filtering_threshold: 0.1
plot_filtering_threshold: 0.1

Note

In case of the meanshift algorithm, the bandwidth refers to the maximum RMSD allowed within the cluster, whereas in HDBSCAN to distances between your data points.

Output

Configure the output

  • output: Output folder of the simulation. Default=output

output: "output_sim"