Molecular modeling‎ > ‎docking‎ > ‎


Docking (molecular)

From Wikipedia, the free encyclopedia

Docking glossary
• Receptor or host – The "receiving" molecule, most commonly a protein or otherbiopolymer.
• Ligand or guest – The complementary partner molecule which binds to the receptor. Ligands are most often small molecules but could also be another biopolymer.
• Docking – Computational simulation of a candidate ligand binding to a receptor.
• Binding mode – The orientation of the ligand relative to the receptor as well as theconformation of the ligand and receptor when bound to each other.
• Pose – A candidate binding mode.
• Scoring – The process of evaluating a particular pose by counting the number of favorable intermolecular interactions such as hydrogen bonds and hydrophobic contacts.
• Ranking – The process of classifying which ligands are most likely to interact favorably to a particular receptor based on the predicted free-energy of binding.
Schematic diagram illustrating the docking of a small molecule ligand (brown) to a protein receptor(green) to produce a complex.
Small molecule docked to a protein.

In the field of molecular modeling,docking is a method which predicts the preferred orientation of one molecule to a second when bound to each other to form a stable complex.[1]Knowledge of the preferred orientation in turn may be used to predict the strength of association or binding affinity between two molecules using for example scoring functions.

The associations between biologically relevant molecules such as proteins,nucleic acidscarbohydrates, and lipidsplay a central role in signal transduction. Furthermore, the relative orientation of the two interacting partners may affect the type of signal produced (e.g., agonism vsantagonism). Therefore docking is useful for predicting both the strength and type of signal produced.

Docking is frequently used to predict the binding orientation of small molecule drug candidates to their protein targets in order to in turn predict the affinity and activity of the small molecule. Hence docking plays an important role in the rational design of drugs.[2] Given the biological andpharmaceutical significance of molecular docking, considerable efforts have been directed towards improving the methods used to predict docking .



[edit]Definition of problem

Molecular docking can be thought of as a problem of “lock-and-key”, where one is interested in finding the correct relative orientation of the “key” which will open up the “lock” (where on the surface of the lock is the key hole, which direction to turn the key after it is inserted, etc.). Here, the protein can be thought of as the “lock” and the ligand can be thought of as a “key”. Molecular docking may be defined as an optimization problem, which would describe the “best-fit” orientation of a ligand that binds to a particular protein of interest. However since both the ligand and the protein are flexible, a “hand-in-glove” analogy is more appropriate than “lock-and-key”.[3] During the course of the process, the ligand and the protein adjust their conformation to achieve an overall “best-fit” and this kind of conformational adjustments resulting in the overall binding is referred to as “induced-fit”.[4]

The focus of molecular docking is to computationally stimulate the molecular recognition process. The aim of molecular docking is to achieve an optimized conformation for both the protein and ligand and relative orientation between protein and ligand such that thefree energy of the overall system is minimized.

[edit]Docking approaches

Two approaches are particularly popular within the molecular docking community. One approach uses a matching technique that describes the protein and the ligand as complementary surfaces.[5][6] The second approach simulates the actual docking process in which the ligand-protein pairwise interaction energies are calculated.[7] Both approaches have significant advantages as well as some limitations. These are outlined below.

[edit]Shape complementarity

Geometric matching/ shape complementarity methods describe the protein and ligand as a set of features that make them dockable.[8] These features may include molecular surfacecomplementary surface descriptors. In this case, the receptor’s molecular surface is described in terms of its solvent-accessible surface area and the ligand’s molecular surface is described in terms of its matching surface description. The complementarity between the two surfaces amounts to the shape matching description that may help finding the complementary pose of docking the target and the ligand molecules. Another approach is to describe the hydrophobic features of the protein using turns in the main-chain atoms. Yet another approach is to use a Fourier shape descriptor technique.[9][10][11] Whereas the shape complementarity based approaches are typically fast and robust, they cannot usually model the movements or dynamic changes in the ligand/ protein conformations accurately, although recent developments allow these methods to investigate ligand flexibility. Shape complementarity methods can quickly scan through several thousand ligands in a matter of seconds and actually figure out whether they can bind at the protein’s active site, and are usually scalable to even protein-protein interactions. They are also much more amenable to pharmacophore based approaches, since they use geometric descriptions of the ligands to find optimal binding.


The simulation of the docking process as such is a much more complicated process. In this approach, the protein and the ligand are separated by some physical distance, and the ligand finds its position into the protein’s active site after a certain number of “moves” in its conformational space. The moves incorporate rigid body transformations such as translations and rotations, as well as internal changes to the ligand’s structure including torsion angle rotations. Each of these moves in the conformation space of the ligand induces a total energetic cost of the system, and hence after every move the total energy of the system is calculated. The obvious advantage of the method is that it is more amenable to incorporate ligand flexibility into its modeling whereas shape complementarity techniques have to use some ingenious methods to incorporate flexibility in ligands. Another advantage is that the process is physically closer to what happens in reality, when the protein and ligand approach each other after molecular recognition. A clear disadvantage of this technique is that it takes longer time to evaluate the optimal pose of binding since they have to explore a rather large energy landscape. However grid-based techniques as well as fast optimization methods have significantly ameliorated these problems.

[edit]Mechanics of docking

To perform a docking screen, the first requirement is a structure of the protein of interest. Usually the structure has been determined using a biophysical technique such as x-ray crystallography, or less often, NMR spectroscopy. This protein structure and a database of potential ligands serve as inputs to a docking program. The success of a docking program depends on two components: the search algorithm and the scoring function.

[edit]Search algorithm

The search space consists of all possible orientations and conformations of the protein paired with the ligand. With present computing resources, it is impossible to exhaustively explore the search space—this would involve enumerating all possible distortions of each molecule (molecules are dynamic and exist in an ensemble of conformational states) and all possible rotationaland translational orientations of the ligand relative to the protein at a given level of granularity. Most docking programs in use account for a flexible ligand, and several are attempting to model a flexible protein receptor. Each "snapshot" of the pair is referred to as apose. There are many strategies for sampling the search space. Here are some examples:

  • Use a coarse-grained molecular dynamics simulation to propose energetically reasonable poses
  • Use a "linear combination" of multiple structures determined for the same protein to emulate receptor flexibility
  • Use a genetic algorithm to "evolve" new poses that are successively more and more likely to represent favorable binding interactions.

[edit]Scoring function

The scoring function takes a pose as input and returns a number indicating the likelihood that the pose represents a favorable binding interaction.

Most scoring functions are physics-based molecular mechanics force fields that estimate the energy of the pose; a low (negative) energy indicates a stable system and thus a likely binding interaction. An alternative approach is to derive a statistical potential for interactions from a large database of protein-ligand complexes, such as the Protein Data Bank, and evaluate the fit of the pose according to this inferred potential.

There are a large number of structures from X-ray crystallography for complexes between proteins and high affinity ligands, but comparatively fewer for low affinity ligands as the later complexes tend to be less stable and therefore more difficult to crystallize. Scoring functions trained with this data can dock high affinity ligands correctly, but they will also give plausible docked conformations for ligands that do not bind. This gives a large number of false positive hits, i.e., ligands predicted to bind to the protein that actually don't when placed together in a test tube.

One way to reduce the number of false positives is to recalculate the energy of the top scoring poses using (potentially) more accurate but computationally more intensive techniques such as Generalized Born or Poisson-Boltzmann methods.[7]


A binding interaction between a small molecule ligand and an enzyme protein may result in activation or inhibition of the enzyme. If the protein is a receptor, ligand binding may result in agonism or antagonism. Docking is most commonly used in the field of drug design — most drugs are small organic molecules, and docking may be applied to:

  • hit identification – docking combined with a scoring function can be used to quickly screen large databases of potential drugs in silico to identify molecules that are likely to bind to protein target of interest (see virtual screening).
  • lead optimization – docking can be used to predict in where and in which relative orientation a ligand binds to a protein (also referred to as the binding mode or pose). This information may in turn be used to design more potent and selective analogs.
  • Bioremediation – Protein ligand docking can also be used to predict pollutants that can be degraded by enzymes.[12]

[edit]See also


  1. ^ Lengauer T, Rarey M (1996). "Computational methods for biomolecular docking". Curr. Opin. Struct. Biol. 6 (3): 402–6. doi:10.1016/S0959-440X(96)80061-3PMID 8804827.
  2. ^ Kitchen DB, Decornez H, Furr JR, Bajorath J (2004). "Docking and scoring in virtual screening for drug discovery: methods and applications". Nature reviews. Drug discovery 3 (11): 935–49. doi:10.1038/nrd1549PMID 15520816.
  3. ^ Jorgensen WL (1991). "Rusting of the lock and key model for protein-ligand binding". Science 254 (5034): 954–5. doi:10.1126/science.1719636.PMID 1719636.
  4. ^ Wei BQ, Weaver LH, Ferrari AM, Matthews BW, Shoichet BK (2004). "Testing a flexible-receptor docking algorithm in a model binding site". J. Mol. Biol. 337 (5): 1161–82. doi:10.1016/j.jmb.2004.02.015PMID 15046985.
  5. ^ Meng EC, Shoichet BK, Kuntz ID (2004). "Automated docking with grid-based energy evaluation". Journal of Computational Chemistry 13 (4): 505–524. doi:10.1002/jcc.540130412.
  6. ^ Morris GM, Goodsell DS, Halliday RS, Huey R, Hart WE, Belew RK, Olson AJ (1998). "Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function". Journal of Computational Chemistry 19 (14): 1639–1662. doi:10.1002/(SICI)1096-987X(19981115)19:14<1639::AID-JCC10>3.0.CO;2-B.
  7. a b Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks CL (2004). "Performance comparison of generalized born and Poisson methods in the calculation of electrostatic solvation energies for protein structures". Journal of Computational Chemistry 25 (2): 265–84. doi:10.1002/jcc.10378.PMID 14648625.
  8. ^ Shoichet BK, Kuntz ID, Bodian DL (2004). "Molecular docking using shape descriptors". Journal of Computational Chemistry 13 (3): 380–397.doi:10.1002/jcc.540130311.
  9. ^ Cai W, Shao X, Maigret B (January 2002). "Protein-ligand recognition using spherical harmonic molecular surfaces: towards a fast and efficient filter for large virtual throughput screening". J. Mol. Graph. Model. 20 (4): 313–28. doi:10.1016/S1093-3263(01)00134-6PMID 11858640.
  10. ^ Morris RJ, Najmanovich RJ, Kahraman A, Thornton JM (May 2005). "Real spherical harmonic expansion coefficients as 3D shape descriptors for protein binding pocket and ligand comparisons". Bioinformatics 21 (10): 2347–55. doi:10.1093/bioinformatics/bti337PMID 15728116.
  11. ^ Kahraman A, Morris RJ, Laskowski RA, Thornton JM (April 2007). "Shape variation in protein binding pockets and their ligands". J. Mol. Biol. 368 (1): 283–301. doi:10.1016/j.jmb.2007.01.086PMID 17337005.
  12. ^ Suresh PS, Kumar A, Kumar R, Singh VP (January 2008). "An in silico [correction of insilico] approach to bioremediation: laccase as a case study". J. Mol. Graph. Model. 26 (5): 845–9. doi:10.1016/j.jmgm.2007.05.005PMID 17606396.

[edit]External links