This page gives a summary of two preliminary studies into the use of GOLD to dock combinatorial libraries.
Method
A set of commercially available carboxylic acids was selected and converted to MOL2 format by Dr R C Glen (Tripos). Similarly, a set of amines was chosen. These may be regarded as two "monomer pools". Calculations were performed to determine whether GOLD might be used to select a subset of the acids and a subset of the amines, such that the resulting combinatorial library contains amides that are likely to be active.
The method chosen was to dock each of the acids and each of the amines into the protein of interest, and then determine which pairs of acids and amines were docked in such a way that an amide link could be formed between them in a reasonable geometry. If this were the case, it was considered that the resulting amide was likely to be an active compound.
In more detail, the procedure was as follows. A program was written to convert each acid to the corresponding methylcarbamoyl derivative (RCO2H -> RCONHMe) and each amine to the corresponding acetylamino derivative (R'NH2 -> R'NHCOMe). The resulting molecules were docked separately by GOLD into the active site of the enzyme lipase (PDB code 1CRL). Three dockings were performed on each monomer. However, if the first two solutions were within 1.0A RMSD, then the third solution was not generated. In view of the simplicity of most of the monomer structures, and in order to speed things up, a reduced GA parameterisation was used. Specifically, each GA run comprised 50000 operations only and the initial energy barriers were increased (this parameterisation has been verified on the standard GOLD test set of 100 PDB complexes). Finally, the methyl group terminating the amide bond of each docked molecule was deleted and replaced with a dummy atom.
A C program was written to compare each docked acid moiety with all the docked amine moieties. When it was possible to superimpose the linking amide-group atoms of a docked acid and a docked amine to within an RMSD of 1.5A, it was considered that the two components might form a product amide which could bind to the protein (the superposition included the dummy atom of each monomer and the corresponding carbon atom in the other monomer). A second screen was then applied to remove high energy compounds. Specifically, if the van der Waals energy of the product was greater than 500 kcal/mol, then the compound was rejected. After elimination of duplicates, this resulted in 129 "docked" products.
Results
| No of acids | 426 |
| No of amines | 105 |
| No of GOLD solutions for acids | 1088 |
| No of GOLD solutions for amines | 280 |
| Total no of possible superpositions | 304640 |
| No of superpositions found | 311 |
| No of compounds following energy screen | 237 |
| No of unique compounds. | 129 |
| CPU time to dock acids | 124h (5d 4h) |
| CPU time to dock amines | 33.7h (1d 9.7h) |
| CPU time for superimposition and energy screen | 1.5h |
| Total CPU time | 159h (6d 15h) |
All CPU timings are on an R4400 SGI Indigo II.
In order to test the success of this approach, the amides identified through the above superposition procedure were docked separately by GOLD, as complete molecules. The reduced GA parameterisation was used again, though 5 dockings were generated. The best of these was compared with the prediction obtained using the superposition method. When the discrepancy between the two exceeded 3.0A RMSD, the amide was docked again using the normal, full GA parameterisation to see if a better fit could be obtained.
This docking verification stage took 209 CPU hours.
The table below summarises the RMSD between the docked orientations obtained by joining together the separately docked monomers, and the orientations predicted by normal GOLD docking of the complete product amide:.
| RMS | Count | Cumulative total | Cumulative percent |
| under 0.5 | 1 | 1 | 0.78 |
| 0.5->1.0 | 39 | 40 | 31.01 |
| 1.0->1.5 | 34 | 74 | 57.36 |
| 1.5->2.0 | 18 | 92 | 71.32 |
| 2.0->2.5 | 8 | 100 | 77.52 |
| 2.5->3.0 | 13 | 113 | 87.60 |
| 3.0->3.5 | 6 | 119 | 92.25 |
| over 3.5 | 10 | 129 | 100.00 |
A similar set of calculations was performed on the same set of amines and a set of sulphonyl chlorides. As before, the "most active" product sulphonamides were predicted by: (i) docking the separate monomers; (ii) identifying pairs of amines and sulphonyl chlorides that were positioned in such a way that atoms of the sulphonamide linkage could be superimposed with an RMSD of 1.5A or less; (iii) eliminating "products" with high-energy conformations (> 500 kcal/mol).
Following the superposition stage, there were 369 pairs of sulphonyl chlorides and amines overlaid. After application of the energy screen, there were 348 compounds. Many of these were duplicates, obtained by using different solutions of the same monomer. These duplicates were removed by selecting the superposition whose monomer solutions had the largest combined GOLD score. Following this procedure, there were 190 unique superpositions (i.e. 190 sulphonamides predicted to be "active").
Results
| No of sulphonyl chlorides | 204 |
| No of amines | 105 |
| No of GOLD solutions for sulphonyl chlorides | 540 |
| No of GOLD solutions for amines | 280 |
| Total no of possible superpositions | 151200 |
| No of superpositions found | 369 |
| No of compounds following energy screen | 348 |
| No of unique compounds | 190 |
| CPU time to dock sulphonyl chlorides | 48.5h (2d 0.5h) |
| CPU time to dock amines | 39.5h (1d 15.5h) |
| CPU time for superimposition and energy screen | 1.5h |
| Total CPU time | 83.7h (3d 11.7h) |
As before, the sulphonamides predicted to bind by the above procedure were then docked separately by GOLD. This verification stage was done in two passes of independent dockings, as described above for the acids and amines.
The table below summarises the RMSD's between the docked positions obtained from the monomer superposition method and the position obtained by GOLD docking of the complete sulphonamide.
| RMS | Count | Cumulative total | Cumulative percent |
| under 0.5 | 3 | 3 | 1.58 |
| 0.5->1.0 | 25 | 28 | 14.74 |
| 1.0->1.5 | 29 | 57 | 30.00 |
| 1.5->2.0 | 40 | 97 | 51.05 |
| 2.0->2.5 | 27 | 124 | 65.26 |
| 2.5->3.0 | 21 | 145 | 76.32 |
| 3.0->3.5 | 10 | 155 | 81.58 |
| over 3.5 | 35 | 190 | 100.00 |
We can conclude that a combinatorial library could, in principle, be designed by using GOLD to dock the monomers rather than the complete library of products. This is computationally tractable and therefore a methodology worth further investigation. Obviously, the ultimate proof of the method requires synthesis and testing of the designed library - something not yet done.
The method works better on the amides rather than the sulphonamides; presumably, this is related to the torsional flexibility of the linkage.