Challenge #6

Hit Identification
Method type (check all that applies)
High-throughput docking
Physics-based
Hybrid of the above
Combination of cheminformatics, physics-based methods, pharmacophore screening, high-throughput docking, molecular
Description of your approach (min 200 and max 800 words)

We will use our joint/combined expertise in cheminformatics, molecular dynamics (MD), structure-based drug design (SBDD), pharmacophore modeling, and medicinal chemistry to generate hits for multiple subcavities of the histone binding groove of the SETDB1 triple Tudor domain (TTD). In a first step, we will use the crystal structures of the SETDB1 TTD provided by the CACHE committee (PDB IDs: 7CJT, 8UWP, 6AU3) and available ligand data to benchmark our state-of-the art docking program (published in scientific literature, proprietary code).[1] Whereas our benchmarking study will act primarily as a sanity check, it will also inform 1) which key amino acids are engaged by the co-crystallized ligands and 2) the development of multiple pharmacophoric models (one or more for each subcavity considered). These models will incorporate both protein and ligand data (i.e., co-crystallized ligand conformation, where available). We will then curate a library of commercially available compounds to screen against our pharmacophoric models. We will start from the Enamine REAL database (either full database or Diversity Set). Irrespective of the chosen starting point, we will filter our library according to Lipinski Ro5 and Veber Ro3 rules using our drug discovery platform (published in scientific literature, proprietary code).[1] To further tailor the library for screening, we will include additional filters like 1) removal of PAINS and reactive groups (i.e., covalent warheads), 2) inclusion of only high QED compounds (> 0.75), 3) removal of any insoluble compounds (predictions made with Simulation-Plus), 4) the Lilly med chem rules, 5) synthetic feasibility constraints (compounds in-stock or easy/medium synthesis), 6) principles outlined in the CACHE white paper, especially fraction of sp3 carbons and logD (pKa computed with our proprietary software) values.[2] For this tailored library, we will generate 25 conformers/compound for pharmacophore screening (based on prior successful in-house benchmark). We will then screen the ligand conformers against the pharmacophore models developed in the first step with our in-house pharmacophore screening tool. Only compounds with a root mean square deviation (RMSD) under a certain threshold (i.e., < 1.0Å) to the models will be kept for further analysis. The compounds that pass the initial pharmacophoric filtering will then be virtually screened with our docking program against the crystal structures of each subcavity we consider. We envision docking at most 250,000 compounds/subcavity. If the number of compounds passing the pharmacophoric filtering is > 250,000, we will cluster them to ensure diversity. Once the docking is completed, we will filter out those binding poses that do not interact strongly with the key amino acids identified during the benchmarking using our in-house protein analysis platform. Then, we will rank the remaining compounds by docking score and select up to 1000 compounds/subcavity for visual analysis. Using our combined/diverse expertise, we will form "hit picking parties" where we will visually assess the binding poses using the following criteria: 1) overall fit of the ligands in the subcavities, 2) relative ligand strain, and 3) number and type of fulfilled and unfulfilled interactions (i.e., hydrogen bonding, π-stacking, hydrophobic contacts, etc.). We will then assign visual scores to these poses, and up to 100 compounds/subcavity with the best visual scores will be chosen for the final refinement step. In this final step, we will use the λ-dynamics method [3] integrated in ATOMFORGE (proprietary software) to calculate the relative binding free energy of these compounds. λ-dynamics is an alchemical method that can simulate the relative binding free energy of multiple ligands in one single simulated system. The λ-dynamics will, on top of the docking and visual results, collect entropic and free energy landscape information from an MD simulation that dynamically samples an ensemble of SETDB1 TTD conformations and different chimera states of the ligands. The ligands will then be ranked based on their relative binding free energy obtained from the λ-dynamics simulation. The top ranked 150 compounds from this refinement step will be proposed for purchase and testing.


References:
1. Moitessier, N. et al., Acc. Chem. Res., 2016, 49(9), 1646-1657.
2. Ackloo, S. et al., Nat. Rev. Chem., 2022, 6, 287-295.
3. Knight, J.L., and Brooks, C.L. 3rd, J. Comput. Chem., 2009, 30, 1692-1700.

What makes your approach stand out from the community? (<100 words)

Leveraging our combined know-how of drug discovery, medicinal chemistry, structural biology, cheminformatics, computer science, molecular dynamics, and large-scale computations, we are uniquely situated to tackle this challenge. The synergy of our technologies and ability to pool our resources (i.e., in-house supercomputing capabilities) allows us to explore scientific avenues that neither team would be able to explore on their own. We believe that this collaborative approach will provide valuable insights to this challenge and will enable us to identify high quality hits for SETDB1. Additionally, in line with our participation in Challenges 1-5, we are committed to sharing our findings with the biotech community in the form of an open
access manuscript.

Commercial software packages used

   FORECASTER (proprietary software); ATOMFORGE (proprietary software)

Free software packages used

RDKit, OpenBabel, DataWarrior, BIOVIA Discovery Studio

Relevant publications of previous uses by your group of this software/method

Moitessier, N. et al., Acc. Chem. Res., 2016, 49(9), 1646-1657.
Therrien, E. et al., J. Chem. Inf. Model., 2012, 52(1), 210-224.
Labarre, A. et al., J. Chem. Inf. Model., 2022, 62(4), 1061-1077.
Burai-Patrascu, M. et al., ChemRxiv, 2022, https://doi.org/10.26434/chemrxiv-2022-ncqsj-v2
Nivedha, A.K. et al., ChemRxiv, 2023, https://doi.org/10.26434/chemrxiv-2023-5g38r
Kumar, S. B.V.S. et al., ChemRxiv, 2024, https://doi.org/10.26434/chemrxiv-2024-rw78s