- Open Access
In-silico activity prediction, structure-based drug design, molecular docking and pharmacokinetic studies of selected quinazoline derivatives for their antiproliferative activity against triple negative breast cancer (MDA-MB231) cell line
Bulletin of the National Research Centre volume 46, Article number: 2 (2022)
Cancer is a major health threat especially in unindustrialized nations. It surpasses coronary diseases and takes the number one killer position as a result of different global wide influences. Among many breast cancer substrates, triple-negative breast cancer (TNBC) is particularly devastating because it rapidly metastasize to other parts of the body, with a high risk of earlier recession and mortality.
In this research work, four (4) quantitative structure activity relationship (QSAR) models were developed using a series of quinazoline derivatives with activities against triple negative breast cancer cell line (MDA-MB231), model 1 was selected due to its statistical fitness with the following validation parameters: R2 = 0.875, Q2 = 0.837, R2 − Q2 = 0.038, Next test set = 5, and R2ext = 0.655. Molecular docking studies was performed for the quinazoline series as well as the reference drug (Gefitinib) and the active site of the epidermal growth factor receptor (EGFR) (pdb id = 3ug2). Eight compounds (6, 10, 13, 16, 17, 18, 19 and 20) were observed to have better docking score docking scores relative to Gefitinib. Compound number nineteen from the training set (pred pIC50 = 5.67, Residual = − 0.04 and MolDock score = − 123.238) was identified as the best compound since it has the best Moldock score and was excellently predicted by the selected model with least residual value, Hence was adopted as template for the design of Ten (10) new novel compounds with better activities and better docking scores. The inhibitive activities of the designed compounds were predicted by the selected model and most of them possess an improved activity relative to the template compound (19). The designed compounds were also redocked on to active pocket of the EGFR receptor and it was observed that they displayed better docking scores compared to the Template and the reference drug (Gefitinib) utilized in the design. Furthermore, the designed compounds were subjected to ADMET and drug-likeness studies using SWISSADME and pkCSM online web tools and they were observed to be pharmacologically active, easily synthesized and do not violate the Lipinski’s rule of five.
Hence, the designed compounds can be employed as inhibitors of MDA-MB231 cell line after passing through in vivo and in vitro evaluation.
Cancer is a challenging problem for the global health community, and its increasing burden necessitates seeking novel and alternatives therapies (Rajabi et al. 2021). It takes the number one killer position as a result of different global wide influences. Although considerable progress were made in the chemotherapeutic remedy of some victims, the unrelenting obligation to the difficult task of detecting new anti-cancer drugs is still crucial. Breast cancer is the most predominant class of cancer diagnosed in females around the globe, with an incidence that intensifies vividly with age. Among many breast cancer substrates, triple-negative breast cancer (TNBC) is particularly devastating because it rapidly disperse to other parts of the body, with a high risk of earlier recession and mortality (Hu et al. 2012). Annually, at least one million females are identified with breast tumor and TNBC is accountable for close to 15–20% of the complete breast cancer identified (Jo et al. 2019).
The epidermal growth factor receptor (EGFR) plays a crucial part in the control of cell growth and is regarded as one of the most seriously evaluated tyrosine kinase’s (TK) target inhibitors (El-Azab et al. 2010). Numerous TKs had a vital functions in cell propagation, division, metastasis and endurance, besides their uncontrolled triggering via processes such as point mutations leads to a substantial proportion of clinical cancers. EGFR is over expressed in numerous tumors, such as brain, lung, bladder, ovarian, colon, breast, head, and prostate tumors (Tiwari et al. 2015).
Components of the erbB class of EGFR-TKs, which comprise of erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4), are overexpressed in a substantial ratio of human tumors, and this is attributed to the miserable prognosis of the malady (Chandregowda et al. 2009). Hence, inhibitors of erbB1 and erbB2 were acknowledged as possible anticancer drugs (Hynes and Lane 2005).
Extermination of cancer cells without causing damage on other normal tissues or cells is the main purpose of anti-cancer drugs. However, the fact that some of these drugs usually destroys some other normal cells and the resistance to these drugs experienced by some patients during early period of treatment necessitates the global search for identifying new higher quality drugs that are safe for the prevention and remedy of cancer (Al-Suwaidan et al. 2016). Immediate recognition, understanding of the cause and pathway of this disorder, and improvement in remedy have played a pivotal part in curtailing breast cancer mortality rates over the past few years. Chemotherapy is still the central key to thorough therapy since it can exterminate tumor cells rapidly in the human system (Kaplan 2013).
In-silico approach of drug discovery have proven to accelerate the drug discovery process, as it lessen the time taken, resources and it enables the estimation of properties of new molecules such as toxicity and efficiency even before their synthesis. A mathematical relations that are able to establish a quantitative relationship between biological activities of a molecules and their molecular structures in form of linear equation is called Quantitative Structure–Activity Relationships (QSAR) (Abdullahi et al. 2021). Efficiency and safety of the drug to the system are the two major causes leading to drug failure. Therefore, it is compulsory to find potent molecules with better ADMET properties “drug-likeliness” (Lawal et al. 2021).
This research is mainly purposed in utilizing QSAR approach to compute the inhibitory activities of a series of quinazoline derivatives against MDA-MB231 breast cancer cell line, perform molecular docking studies to understand the nature of interaction between the compounds and the EGFR protein receptor, design new potent compounds based on their docking scores and examine their ADMET and drug likeness properties.
Data sets retrieval
A series of 23 quinazoline derivatives with inhibitory activities (IC50 in µg/ml) against Triple Negative Breast cancer cell line MDA-MB231 are retrieved from Abuelizz et al. (2017). The inhibitory activities were linearized by taking their negative logarithm to base 10 as shown in Eq. 1.
Chemical structure of the quinazoline analogs as well as their respective inhibitive capacity at 50% concentration (pIC50) are presented in Table 1.
Calculation of molecular descriptors
2D structures of the quinazoline analogs were sketched by utilizing Chemdraw version 16.0 and they were transformed to 3D format using Spartan 14 software. Molecular mechanics force field were employed to clean the 3D structures to eliminate all strain from the structure of the molecule as well as guaranteeing a well-defined conformer relationship within the compounds (Viswanadhan et al. 1989). Density Functional Theory (DFT) quantum mechanical calculation was employed for the geometry optimization using B3LYP/631G* basis set. The optimized structures were saved in Spatial Document File (SDF) format and then exported to PADEL descriptor calculation software to compute the molecular descriptors (Amin and Gayen 2016).
Data set partitioning
The data set were partitioned into two separate set: Modeling set (training set) and external validation set (test set). The modeling set consist of eighteen (18) compounds while the external validation set is made up of five (5) compounds. Models are built using the modeling set while the predictive ability of the built model was ascertained using the external validation set (Tropsha et al. 2003). This splitting certifies that an analogous standard can be engaged to predict the activity of the test set. Kennard–Stone Algorithm was applied for dividing dataset into a modeling and test set (Kennard and Stone 1969).
QSAR model building and external validation
The most important aspect of QSAR studies is the designation and sampling of descriptors that offers an ultimate information in activity disparities and have minimal co-linearity. Hence, genetic function algorithm (GFA) progresses the model accurateness while selecting relevant molecular descriptors (Leardi 1996). Multi linear regression (MLR) was utilized on the model building set to express the mathematical relations between the depending variable A (pIC50) and independent variable B (molecular descriptors). An exceptional feature of GFA algorithm is that it is able to generate multiple models rather than single model. Validation parameters that provides a guide in selecting the best QSAR model include correlation coefficient (R2), adjusted R2 (R2adj), cross-validation coefficient (Q2cv) and correlation coefficient of the external validation set (R2ext), all are expressed in Eqs. (2, 3, 4 and 5) respectively.
where P is the number of independent variables in the model and N is the sample size. Yexp, Ypred, and Ymtraining are the experimental activity, the predicted activity, and the mean experimental activity of the compounds in the modeling set, respectively (Tropsha et al. 2003). The least recommended values for these parameters are shown in Table 2.
In order to assess the robustness of the model and to affirm that the model was not obtained by chance correlation Y-randomization was performed on the model building set data (Tropsha et al. 2003). A new QSAR model was generated using the descriptor matrix by shuffling the activity matrix randomly. A built QSAR model is robust and reliable only when it has low values of R2 and Q2 for numerous trials. Another validation parameter is the coefficient of determination for Y-randomization cR2p, and it should exceed 0.5 for passing this test as in Eq. 6
cR2p is coefficient of determination for Y-randomization, R is the coefficient of determination for Y-randomization and Rr is average ‘R’ of random models.
Molecular docking studies
Ligand–Protein molecular docking studies was performed on all the quinazoline derivatives to study the nature of interactions between active pocket of the EGFR protein receptor and the ligands on HP laptop equipped with a dual-core Intel (R) PENTIUM (R) B940 CPU processor running at 2.0 GHz and 4.0 GB of RAM running on Windows 8 using Molegro Virtual Docker (MVD) software and Discovery studio.
The least energy optimized structures was saved in pdb file format prior to docking studies (Abdullahi et al. 2021).
Protein retrieval and Preparation
3D X-ray crystallized structure of the EGFR protein receptor (pdb id = 3ug2) was obtained from the protein data bank (https://www.rcsb.org/), and was prepared on the MVD workspace by eliminating water molecules and co-crystallized ligand enclosed in the crystal structure. The amino acid residues with structural error were repaired/rebuilt. The Fully prepared protein structure was also save in pdb format prior to docking process.
Docking of the ligands and receptor using molegro virtual docker (MVD)
Due to its ability to produce a better and consistent results relative to other docking softwares, Molegro Virtual Docker software program was utilized for the docking study in this research. Before the start of the process, the prepared protein was exported from its folder to the MVD work space, cavities were detected and surface was created. The active pocket of the EGFR receptor was anticipated and was set inside a regulated sphere of X: 0.91 Y: 50.08 and Z: 22.54 with 15 Å radius respectively. The prepared ligands were imported into the MVD work space and docking process was executed using a grid resolution of 0.30 Å. The Root Mean Square Deviation (RMSD) threshold was set as 2.00 Å for the multiple clusters poses with 100.00 energy penalty values. The docking algorithm was set for a maximum of 1500 iteration with a maximum population size of 50. The docking simulation was run for a minimum of 50 times for the 10 poses, and the best poses were determined based on the MolDock score scoring function (Thomsen and Christensen 2006).
Hydrogen bond, Hydrophobic, alkyl, pi-alkyl, Halo bonds and aryl intermolecular interactions were viewed with the aid of Discovery studio software.
ADMET and drug-likeness properties
pkCSM ((http:// structure. bioc.cam. ac.uk/pkcsm), and SwissADME (http:// www. swiss adme.ch/ index. php) are free and easily accessible online web sites that are utilized to explore the ADMET and medicinal properties of tiny molecules. The ADMET and Drug-resembleness properties of the compounds are obtained from these sites in the current research. At the pre-clinical phase of drug invention, the most crucial parameter is the Lipinski’s rule of five (Abdullahi et al. 2021), and it states that any molecule that violates more than two (2) of the following criteria is not easily permeable or readily absorbed into the body system. The criteria are; MW ˂ 500, HBD ˂ 5, HBA ˂ 10, Log p ˂ 5 and PSA (PSA) ˂140 A2.
Structure based drug design
The design of drugs based on structures is also called the direct drug design, a very important, powerful and useful method of drug discovery. This includes the collection of information on the three-dimensional structure of the target receptor (protein) via approaches such as X-ray crystallography, NMR spectroscopy, or homology modeling, followed by the design of promising drug candidates based on binding and selective efficiencies for their target groups. The method involves several steps, such as retrieval and preparation of protein structure, preparation of ligand archives and manual design of new novel compounds (Batool et al. 2019).
In this research work Genetic function algorithm was employed to generate the QSAR models due to its ability to produce a vast population of model instead of just a single model. Four models were generated from the model building set and the first one was chosen because of its statistical significance.
The results of various statistical parameters of the selected model are shown in Table 1, therefore, the model satisfies the least required values for the evaluation of a robust QSAR model. Additionally, the selected model was utilized to estimate the inhibitive capacity of the external validation set and it was found to have passed the external validation test with R2ext = 0.655 (Table 2). The structures, experimental as well as predicted inhibitive activities of the compounds in this research work are placed in Table 1 respectively.
A plot of experimental pIC50 against the predicted pIC50 of both the model building as well as the external validation set is shown in Fig. 1, while Fig. 2 represent a plot of experimental Pic50 of both the model building and the external validation set against their residuals. The selected model does not exhibit systematic error since the standardized residuals are randomly dispersed on either side of zero (Ibrahim et al. 2018).
Pearson’s correlation matrix of the selected model indicates that descriptors are not correlated to each other, this illustrates that they are very good (Table 3).
Result of Y-randomization test is shown in Table 4 and the test is used to confirm that a model was not obtained by coincidental correlation. The coefficient of determination for Y-randomization cR2p is the most crucial parameter for this test and for a robust QSAR model its value must exceed 0.5. Its value in this research work is 0.74, this illustrates that the model is powerful enough and was not purely due to chance and has satisfied the minimal requirement for robustness.
Mean effect of the descriptors in the selected model
The mean effect designate the individual function and the influence of each descriptor in a model, and it is computed for each of the molecular descriptors using the below equation:
MFj denotes the mean effect for the indicated descriptor j, the coefficient of the descriptor j is denoted by βj, dij is the value of the Target descriptor for each molecule and m population of descriptors in the model (Dimić et al. 2015).
The MF value offers essential information on the effect of each molecular descriptors in the picked model; the size and signals of these descriptors combined with their mean effects reveal their stability and path in inducing the activity of a molecule. The mean effect values are presented in Table 5. BCUTc-1l, MATS8c and SpMAD_Dzs were found to possess the most pronounced influence on the model performance due to their large and positive mean effect values. Their positive sign indicated that increase in their value increases the inhibitive activities of a compound against MDA-MB231 breast cancer cell line. The other descriptor, AATSC8c is negatively correlated with the inhibitive activities of the compounds against the breast cancer cell line, higher value of this descriptor will be responsible for hindering the potency of these compound.
Variance inflation factor (VIF)
The inter-correlation amongst molecular descriptors in a model is detected using their variation inflation factors (VIF), to check whether the descriptors are highly correlated with one another or not. computed VIF values less than 1 illustrates that there is no inter-correlation between the descriptors between 1 and 5, the model can be accepted; and if it is higher than 10, the model cannot be accepted. It can be calculated using the Eq. 8 below. In this research work VIF values for all the descriptors are less than 10, this demonstrates the fitness of the selected model and the descriptors were independent of one another (Table 3).
Williams’ plot of the selected model
A plot of leverage values against standardized residuals of a particular set of compounds is called Williams plot. It enables the detection of a sample that is outside the defined domain of applicability of a particular model. A compound having a leverage value that exceeds the cut-off leverage is tagged as influential as it may affect the performance of the model. In this research, the Williams plot for model 1 is shown in Fig. 3, the cut-off leverage is 0.833 hence, and only the external validation set compounds lies beyond the defined domain of applicability (leverage values > 0.833). These compounds affects the performance of the model but cannot be tagged as outliers since their standardized residual values lies within ± 3 region.
Molecular docking studies
Molecular docking studies is performed to have the knowledge on the nature of binding interactions and the amino acid residues that are accounted for inducing the biological activity of a molecule. In this research work docking simulation study was performed between all the studied quinazoline derivatives and the binding pocket of the EGFR protein receptor (pdb id = 3ug2) and the results are placed in Table 2. The reference drug (Gefitinib) was also redocked into the same binding pocket to revalidate the docking results. Eight compounds (6, 10, 13, 16, 17, 18, 19 and 20) were observed to have better docking score as well as experimental and predicted activities than Gefitinib as such they were tagged as potential hit compounds. Various types of Amino acid interactions between the potential hit compounds and the active site of the EGFR receptor are presented in Table 6.
Compound 6 is observed to have interacted with the binding site of the EGFR receptor via one (1) conventional Hydrogen bond, two (2) Carbon-Hydrogen bond, one (1) Pi-sulfur interaction and several Alkyl and Pi-Alkyl Interactions. The Carbonyl Oxygen of the quinazoline ring forms a conventional Hydrogen bond with MET793 at a distance 1.86 Å, and a Carbon-Hydrogen bond with LEU792 at a distance 2.46 Å. Other Carbon-Hydrogen bond is between the Hydrogen atom H8 and GLN791 amino acid residue at a distance 2.85 Å. The benzene ring intercalated in space and forms a π-Sulfur interaction with MET790 at a distance 3.46 Å. ALA743, LEU718, LEU792, CYS775, MET790, MET793, LEU844, VAL726 and LYS745 residues forms Alkyl interactions with the compound and ALA743, LEU844, LEU718, LYS745 and LEU788 amino acid residues forms π-Alkyl interactions. 2D and 3D binding nature of compound 6 in the binding pocket of the EGFR receptor is shown Fig. 4.
Compound 10 interacted with the binding pocket of the EGFR receptor through a single Conventional Hydrogen bond, double Carbon-Hydrogen bond, single electrostatic π-Cation Hydrogen bond, Hydrophobic π-Sigma, one π-Sulfur, and several Alkyl and π-Alkyl interactions. The quinazoline group carbonyl oxygen forms a conventional Hydrogen bond with MET793 at a bond distance 2.15 Å, and forms a Carbon-Hydrogen bond with LEU792 at a bond distance 2.79 Å, ALA743 forms the other Carbon-Hydrogen bond with the cyanide carbon at a bond distance 3.14 Å. The phenyl ring intercalated in space and forms a π-cation Hydrogen bond with LYS745 at a bond distance 3.10 Å. LYS745 also forms a single electrostatic π-Sigma with the benzene ring, CYS775 forms a π-Sulfur interaction with the benzene ring of the quinazoline scaffold at a distance 5.58 Å. CYS775, MET790, MET793 and LEU718 residues forms an Alkyl interactions while ALA743, MET790, LEU844, MET793 and LEU844 amino acid residues forms a π-Alkyl interactions with the compound. 2D and 3D binding mode of compound 10 in the binding site of the EGFR receptor is presented in Fig. 5 respectively.
Compound 13 was observed to have interacted with the EGFR receptor via a single conventional Hydrogen bond, three (3) Carbon-Hydrogen bonds, a single π-sigma and π-Sulfur interaction and several hydrophobic alkyl and π-Alkyl interactions. The carbonyl oxygen of the quinazoline group forms a conventional and Carbon-Hydrogen bonds with MET793 and LEU792 at a bond distances 1.61 and 2.45 Å, other Carbon-Hydrogen bond interactions are observed between Methoxy Hydrogen atom and GLU762 at a distance 2.89 Å, Hydrogen atom H8 and GLN791 at a bond distance 2.84 Å. The phenyl ring of the quinazoline scaffold intercalated in space to form a π-sigma hydrophobic interaction with LEU718, a π-Sulfur interaction is observed between the other benzene ring and MET790. LEU718, CYS775, MET790, MET793 and LEU844 residues forms Alkyl interactions while LEU718, VAL726, ALA743, LEU844, LYS745 and LEU788 forms a Pi-Alkyl hydrophobic interactions. 2D and 3D binding mode of compound 13 in the binding site of the EGFR receptor (pdb id = 3ug2) is pictured in Fig. 6.
The binding interactions of compound 16 is through a single conventional and Carbon-Hydrogen bond, one (1) π-Sulfur, and numerous alkyl as well as π-alkyl interactions. The carbonyl oxygen atom of the quinazoline group forms a conventional and Carbon-Hydrogen bond with MET793 and LEU792 at a bond distance 1.65 and 2.55 Å. The phenyl ring of the quinazoline scaffold forms a π-Sulfur interaction with CYS775 at a bond distance 4.35 Å. CYS775, MET793, LYS745, LEU788 and MET790 forms an alkyl interactions while VAL726, ALA743, LYS745, MET790, LEU844, ALA743, MET793 and LEU718 amino acid residues forms a Pi-Alkyl hydrophobic interactions. Figure 7 represent the 2D and 3D binding mode of compound 16 in the binding site of the EGFR receptor (pdb id = 3ug2).
The binding mode of compound 17 is through single conventional Hydrogen bond, triple Carbon-Hydrogen bonds, single Pi-Sulfur interactions and many Alkyl as well as Pi-Alkyl Hydrophobic interactions. The quinazoline carbonyl oxygen interacted with MET793 to form a conventional Hydrogen bond at a distance 2.35 Å, and also forms a carbon-hydrogen bond when interacted with LEU792 at a distance 2.85 Å, the Nitro group oxygen forms another Carbon-Hydrogen bond with ASN842 at a bond distance 2.73 Å, Hydrogen atom of the methyl group that connects the quinazoline scaffold forms the other Carbon-Hydrogen bond with GLN791 at distance 2.69 Å. MET790 forms a Pi-Sulfur interaction, LEU718 and LEU792 forms an Alkyl interaction while LEU844, VAL726, ALA743, LEU718, CYS775, MET790 and MET793 forms a hydrophobic Pi-Alkyl interactions. 3D and 2D Binding interactions of compound 17 in the active site of the EGFR receptor is placed in Fig. 8.
The binding mode of compound 18 in the active site of the EGFR receptor is through a single conventional Hydrogen bond, an electrostatic π-Anion and π-alkyl interactions. Oxygen atom of the oxadiazole group forms a conventional Hydrogen bond with SER719 at a bond distance 2.88 Å, the quinazoline benzene ring moiety intercalated in space and forms an electrostatic π-Anion interaction with ASP855. LEU844, VAL726, ALA743, LYS745 and MET790 amino acid residues forms Pi- Alkyl hydrophobic interactions. 2D and 3D binding mode of compound 18 in the binding site of the EGFR receptor (pdb id = 3ug2) are shown in Fig. 9.
Compound 19 is bound to the active site of the EGFR receptor via two (2) conventional Hydrogen bonds, a single Pi-cation Hydrogen bond, a hydrophobic Pi-Sigma interaction, a Pi-sulfur and numerous Pi-Alkyl hydrophobic interactions. LYS745 and MET793 forms conventional Hydrogen bonds at bond distances 2.92 and 2.96 Å, LYS745 forms a Pi-Cation Hydrogen bond and Hydrophobic Pi-Sigma interactions at distances 3.33 and 2.57 Å, MET790 forms a Pi-sulfur interaction. VAL726, ALA743, LYS745, MET790, LEU788, LEU718 and LEU792amino acid residues formed Pi- Alkyl hydrophobic interactions. 2D and 3D binding mode of compound 21 in the active site of the EGFR receptor (pdb id = 3ug2) are placed in Fig. 10 respectively.
Binding mode of compound 20 is through two (2) conventional Hydrogen bonds, single Carbon-Hydrogen bond, single electrostatic Pi-anion, single Pi-Sulfur and hydrophobic Pi-Alkyl Interactions. Carbonyl Oxygen atoms of the quinazoline scaffold and Isoindoline-1,3-dione group forms conventional Hydrogen bonds with MET793 and LYS745 at distances 2.16 and 1.98 Å, Hydrogen atom of the methyl group that connects the quinazoline scaffold to the phenyl ring forms a Carbon-Hydrogen bond with GLN791 at distance 1.83 Å. ASP855 forms an electrostatic Pi-Anion interaction, MET790 forms a Pi-Sulfur Interaction, while VAL726, ALA743, LEU844, VAL726, CYS775, MET790 and MET793 forms a hydrophobic Alkyl interactions. 3D and 2D binding interactions of compound 20 in the active site of the EGFR receptor is shown in Fig. 11 respectively.
To validate the docking approach the reference drug, Gefitinib was also docked onto the binding pocket of the EGFR receptor and was observed to interact with the protein kinase via a single conventional Hydrogen bond, eight (8) Carbon-Hydrogen bond, single Pi-Sulfur interaction, several Alkyl and Pi-Alkyl hydrophobic interaction. Hydrogen atom attached to the Amino group formed a conventional Hydrogen bond with GLN791 at a bond distance 2.81 Å, Hydrogen atom H3 of the quinazoline group formed a Carbon-Hydrogen bond withALA743 at distance 2.89 Å, Hydrogen atom of the Hydroxyl group attached to the Quinazoline frame forms another Carbon-Hydrogen bond with GLU762 at a distance 3.00 Å, Additionally, Hydrogen atom of the Morpholine group and that of a methyl group adjacent to it forms double Carbon-Hydrogen bonds with ASP855 at a bond distances 2.75 and 2.25 Å respectively, ARG841 forms double Carbon-Hydrogen bonds with two morpholine group Hydrogen atoms at distances 2.76 and 2.61 Å, the last one is with ASN842 at distance 2.95 Å. MET790 forms a Pi-Sulfur interactions, ARG841, LEU718, VAL726, LYS745, MET790, ALA743 and LEU844 forms Alkyl and Pi-Alkyl Interactions. Figure 12 represent the 3D and 2D binding mode of Gefitinb in the active site of EGFR receptor.
Structure-based drug design
In this research work, all the quinazoline derivatives were docked on to the binding site of the EGFR (pdb id = 3ug2). Compound 19 (pred pIC50 = 5.67, Residual = − 0.04 and MolDock score = − 123.238) was identified as the best compound since it has the best Moldock score and was excellently predicted by the model selected with least residual value and was within the defined applicability domain, hence, it is adopted as template for the design. Ten (10) novel compounds were designed by addition of various groups on the Meta, Para and Ortho positions of the isoindoline-1, 3-dione phenyl ring. The inhibitive activities of the designed compounds were predicted by the selected model and most of them possess an improved activity relative to the template compound. The structure, predicted activity and MolDock score of the designed compounds are presented in Table 7.
Molecular docking studies of the designed compounds
Molecular docking investigation was also performed for the designed compounds and the binding site of the EGFR receptor (pdb id = 3ug2) using Molegro Virtual Docker (MVD) software. The designed compounds were optimized to obtain the most stable and least energy conformer using DFT calculations utilizing B3LYP 631G* basis set and the optimized molecules were saved in pdb format. All the designed compounds displayed better docking scores compared to the Template and the reference drug (Gefitinib) utilized in the design. Types of amino acid interactions of the designed compounds and the active site of the EGFR receptor are presented in Table 8. The results of three (3) compounds with best docking scores are discussed in this research.
Designed compound number three (3) has the best docking score (MolDock score = − 159.63) and it is found to interact with the EGFR receptor via three (3) Carbon-Hydrogen bonds, two (2) Alkyl and many Pi-Alkyl hydrophobic interactions. GLU762 forms two (2) Carbon-Hydrogen bonds with dimethyl amine Hydrogen atoms at a distance 2.64 and 2.98 Å, THR854 residue forms the other Carbon-Hydrogen bond with the Hydrogen atom of the methyl benzene that is directly attached to the Quinazoline scaffold at a distance 2.51 Å. LYS745 and LEU788 forms the alkyl interactions while ALA743, LYS745, LEU788, MET790, CYS775, MET793, LEU844 and LYS852 residues forms the Pi-Alkyl interactions. The 3D and 2D binding mode of designed compound 3 in the active pocket of the EGFR receptor is shown in Fig. 13.
Designed compound 2 has the second best docking score (− 152.085) and it is found to have interacted with the EGFR receptor through three (3) conventional Hydrogen bonds, three (3) Carbon-Hydrogen bonds, single electrostatic Pi-Cation, Pi-Anion and Hydrophobic Pi-Sigma, two (2) Pi-Sulfur, single Alkyl and many Pi-Alkyl hydrophobic interactions. The Oxygen atom of the Isoindoline-1,3-dione forms a conventional Hydrogen bond with MET793 at a bond distance 1.66 Å, ASP855 forms another conventional Hydrogen bond with Hydrogen atom of the OH- group attached to the amino benzene group at a bond distance 1.87 Å, Carbonyl Oxygen of the quinazoline scaffold forms the other Conventional Hydrogen bond with CYS797 at a distance 2.26 Å. Similarly, Oxygen atom of the Isoindoline-1, 3-dione forms a Carbon-Hydrogen bond with LEU792 at a distance 2.77 Å, Methyl group Hydrogen atom attached to the Isoindoline-1,3-dione forms another Carbon-Hydrogen bond with MET793 at a distance 2.92 Å and the last one is between ASP800 and the Hydrogen atom of the methyl group that connects the phenyl ring to the main Quinazoline scaffold at a distance 2.87 Å. Additionally, the para-Hydroxy amino benzene intercalated in space to form an electrostatic Pi-Anion interaction with LYS745, ASP800 forms a Pi-Anion electrostatic interaction, GLY796 forms a Hydrophobic Pi-Sigma, MET790 and CYS797 forms a Pi-Sulfur interaction, LEU844 forms Alkyl and VAL726, ALA743 and LEU844 residues formed a hydrophobic Pi-Alkyl interactions. 3D and 2D binding pattern of designed compound 2 in the active site of the EGFR receptor (pdb id = 3ug2) is presented in Fig. 14.
Designed compound 6 also having a promising docking score (MolDock score = − 146.947) interacted with the active pocket of the EGFR receptor via two (2) conventional Hydrogen bonds, four (4) Carbon-Hydrogen bonds, two (2) Pi-Anion electrostatic interactions, and Pi-Alkyl hydrophobic interactions. LYS745 and ASP855 forms a conventional Hydrogen bonds with the Oxygen atoms of the Isoindoline-1,3-dione at a distances 1.54 Å and 2.68 Å, GLU762 forms a Carbon-Hydrogen bond with methyl group Hydrogen atoms attached directly to the Isoindoline group at distances 2.63 Å and 2.73 Å, Oxazolidine Hydrogen atoms forms another Carbon-Hydrogen bonds with ASN842 and ARG841 at distances 2.97 Å and 2.49 Å respectively. ASP855 forms double Pi-Anion electrostatic interactions, and lastly, VAL726, LYS745 and ALA743 forms a hydrophobic Pi-Alkyl interactions. 3D and 2D binding pattern of designed compound 6 in the active site of the EGFR receptor (pdb id = 3ug2) is presented in Fig. 15.
ADMET and pharmacokinetics studies of the designed compounds
The results of ADMET and Pharmacokinetics of the designed compounds are depicted in Tables 9 and 10 respectively. None of the compounds designed violate greater than two of the acceptable thresholds established by Lipinski’s rule of five filters for tiny molecules. Accordingly, they were expected to be permeable across the cell membrane, easily absorbed, transported and diffused (Ibrahim et al. 2020). In addition, the designed compounds were further evaluated for their synthetic accessibility, testing on a scale between 1 (simply synthesizable) to 10 (tediously synthesizable).The designed compound’s synthetic accessibility values ranges 3–4 (Table 10), consequently, they are easily synthesizable. Furthermore, all the designed compounds displayed excellent intestinal (human) absorbance values from 81 to 100% (Table 9). The results of ADMET studies of the designed compounds illustrates that they have the potential of crossing blood brain barrier and central nervous system since for a molecule, blood–brain barrier (BBB), and the penetrability of central nervous system (CNS) approved values are > 0.3 to < − 1 log BB and > − 2 to < − 3 log PS respectively (Clark 2003).
The class of super enzymes Cytochrome P450 (CYP450) have a vigorous role in drug metabolism since it is the key liver protein system responsible for phase-1 metabolism (oxidation). Up to today, only seventeen CYP families were recognized in humans, among which only (CYP1, CYP2, CYP3, and CYP4, respectively) are involved in the drug metabolism, with a CYP (1A2, 2C9, 2C19, 2D6 and 3A4, respectively) were known to be accountable for the biotransformation of at least 90% of the drugs that undertake phase-I metabolism (Šrejber et al. 2018), and have been predicted and presented in Table 9. Moreover, cytochrome CYP3A4 inhibition is the most important principle in the current study. The results of this research indicated that all the designed compounds are the substrates of CYP3A4 as well as inhibitors of CYP3A4, respectively (Table 9). Clearance designates the correlation between the body’s drug levels to its elimination per unit time. A minimal value of the total clearance suggests good perseverance of the drugs in the human body, and all the designed compounds displayed a promising persistence in the body for the drug. Additionally, it is advisable to explore the toxicity level of the designed compounds as it plays an important part in screening of drugs candidates. Based on the results of this research, all the designed compounds are found to be non-toxic (Table 9).
In this research work four (4) QSAR models were developed for a series of quinazoline derivatives with inhibitory activities against Triple Negative Breast Cancer cell line (MDA-MB231). Model 1 was chosen due to its statistical fitness with the following validation parameters R2 = 0.875, Q2 = 0.837, R2 − Q2 = 0.038, Next test set = 5, and R2ext = 0.655. The selected model was utilized for the prediction of the activities of both the model building as well as external validation set of quinazoline derivatives. Molecular docking studies was performed on all the quinazoline series and the reference drug (Gefitinib) and the active site of the EGFR receptor (pdb id = 3ug2).Eight compounds (6, 10, 13, 16, 17, 18, 19 and 20) were observed to have better docking score docking scores relative to Gefitinib. Compound 19 (pred pIC50 = 5.67, Residual = − 0.04 and MolDock score = − 123.238) was identified as the best compound since it has the best Moldock score and was excellently predicted by the model selected with least residual value and lies within the defined applicability domain, Hence, it was adopted as template for the design of Ten (10) new novel compounds with better activities and better docking scores. The inhibitive activities of the designed compounds were predicted by the selected model and most of them possess an improved activity relative to the template compound (19). The designed compounds were also redocked on to active pocket of the EGFR receptor and it was observed that they displayed better docking scores compared to the Template and the reference drug (Gefitinib) utilized in the design. Furthermore, the designed compounds were subjected to ADMET and drug-likeness studies using SWISSADME and pkCSM online web tools and they were observed to be pharmacologically active, easily synthesized and don not violate the Lipinski’s rule of five. The designed compounds are said to be well absorbed by the Human intestine values due to their high absorbance values from 81 to 100%, they have the potential of crossing blood–brain barrier and central nervous system, they are all substrate as well as inhibitors of the most vital CYP families 3A4, displayed a promising persistence in the body and are all non-toxic. Hence, the designed compounds can be employed as inhibitors of triple negative breast cancer cell line (MDA-MB231) after passing through in vivo and in vitro evaluation.
Availability of data and materials
Epidermal Growth Factor Receptor
Triple Negative Breast Cancer
Quantitative structure activity relationship
Density functional theory
Pharmaceutical Data Exploration Laboratory
Bee -3- Lee Yang Par
Genetic function algorithm-Multi linear regression
Absorption, Distribution, Metabolism, Excretion and Toxicity
Abdullahi SH, Uzairu A, Ibrahim MT, Umar AB (2021) Chemo-informatics activity prediction, ligand based drug design, Molecular docking and pharmacokinetics studies of some series of 4, 6-diaryl-2-pyrimidinamine derivatives as anti-cancer agents. Bull Natl Res Cent 45:167. https://doi.org/10.1186/s42269-021-00631-w
Abuelizz HA, Marzouk M, Ghabbour H, Al-Salahi R (2017) Synthesis and anticancer activity of new quinazoline derivatives. Saudi Pharma J 25(2017):1047–1054
Al-Suwaidan IA, Abdel-Aziz AA-M, Shawer TZ, Ayyad RR, Alanazi AM, El-Morsy AM, Mohamed MA, Abdel-Aziz NI, El-Sayed MA-A, El-Azab AS (2016) Synthesis, antitumor activity and molecular docking study of some novel 3-benzyl-4 (3H) quinazolinone analogues. J Enzyme Inhib Med Chem 31(1):78–89
Amin SA, Gayen S (2016) Modelling the cytotoxic activity of pyrazolo- triazole hybrids using descriptors calculated from the open source tool “PaDEL-descriptor.” J Taibah Univ Sci 10(6):896–905
Batool M, Ahmad B, Choi S (2019) A structure-based drug discovery paradigm. Int J Mol Sci 20(11):2783
Chandregowda V, Kush AK, Reddy GC (2009) Synthesis and in vitro antitumor activities of novel 4-anilinoquinazoline derivatives. Eur J Med Chem 44:3046–3055
Clark DE (2003) In silico prediction of blood–brain barrier permeation. Drug Discov Today 8(20):927–933
Dimić D, Mercader AG, Castro EA (2015) Chalcone derivative cytotoxicity activity against MCF-7 human breast cancer cell QSAR study. Chemom Intell Lab Syst 146:378–384. https://doi.org/10.1016/j.chemolab.2015.06.011
El-Azab AS, Al-Omar MA, Abdel-Aziz AAM, Abdel-Aziz NI, El-Sayed MAA, Aleisa AM, Sayed-Ahmed MM, Abdel-Hamide SG (2010) Design, synthesis and biological evaluation of novel quinazoline derivatives as potential antitumor agents: molecular docking study. Eur J Med Chem 45:4188–4198
Hu K, Law JH, Fotovati A, Dunn SE (2012) Small interfering RNA library screen identified polo-like kinase-1 (PLK1) as a potential therapeutic target for breast cancer that uniquely eliminates tumor-initiating cells. Breast Cancer Res 14(1):1–5
Hynes NE, Lane HA (2005) ERBB receptors and cancer: the complexity of targeted inhibitors. Nat Rev Cancer 5:341–354
Ibrahim MT, Uzairu A, Shallangwa GA, Ibrahim A (2018) In-silico studies of some oxadiazoles derivatives as anti-diabetic compounds. J King Saud Univ Sci. https://doi.org/10.1016/j.jksus.2018.06.006
Ibrahim MT, Uzairu A, Shallangwa GA, Uba S (2020) Structure-based design of some quinazoline derivatives as epidermal growth factor receptor inhibitors. Egypt J Med Hum Genet 21:63. https://doi.org/10.1186/s43042-020-00107-y
Jo J, Kim SH, Kim H, Jeong M, Kwak JH, Han YT, Jeong JY, Jung YS, Yun H (2019) Discovery and SAR studies of novel 2-anilinopyrimidine-based selective inhibitors against triple-negative breast cancer cell line MDA-MB-468. Bioorg Med Chem Lett 29(1):62–65
Kaplan W (2013) Background paper 6.5 cancer and cancer therapeutics. In: World Health Organization (ed) Priority medicines for Europe and the world: update, pp 5–6
Kennard RW, Stone LA (1969) Computer aided design of experiments. Technometrics 11(1):137–148
Lawal HA, Uzairu A, Uba S (2021) QSAR, molecular docking studies, ligand-based design and pharmacokinetic analysis on Maternal Embryonic Leucine Zipper Kinase (MELK) inhibitors as potential anti-triple-negative breast cancer (MDA-MB-231 cell line) drug compounds. Bull Natl Res Cent 45:90. https://doi.org/10.1186/s42269-021-00541-x
Leardi R (1996) Genetic algorithms in molecular modeling. Elsevier, pp 67–86
Rajabi S, Maresca M, Yumashev AV, Choopani R, Hajimehdipoor H (2021) The most competent plant-derived natural products for targeting Apoptosis in cancer therapy. Biomolecules 11(4):534. https://doi.org/10.3390/biom1104053
Šrejber M, Navrátilová V, Paloncýová M, Bazgier V, Berka K, Anzenbacher P, Otyepka M (2018) Membrane-attached mammalian cytochromes P450: an overview of the membrane’s effects on structure, drug binding, and interactions with redox partners. J Inorg Biochem 183:117–136
Thomsen R, Christensen MH (2006) MolDock: a new technique for high accuracy molecular docking. J Med Chem 49(11):3315–3321
Tiwari SK, Sachan S, Mishra A, Tiwari S, Pandey V (2015) The anticancer activity of some novel 4-anilino quinazoline derivatives as tyrosine kinase (EGFR) inhibitor and the quantitative structure activity relationships. Int J Pharm Life Sci 6:4819–4828
Tropsha A, Gramatica P, Gombar VK (2003) The importance of being earnest: validation is the absolute essential for successful application and interpretation of QSPR models. Mol Inf 22(1):69–77
Viswanadhan VN, Ghose AK, Revankar GR, Robins RK (1989) Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships. 4. Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics. J Chem Inf Comput Sci 29(3):163
The author sincerely acknowledges all the contributors of this exploratory group for their guidance and motivation at some stages of this research work and Ahmadu Bello University for providing the softwares and conducive environment utilized for this analysis.
Funds allocation was not received by the authors for this research.
Ethics approval and consent to participate
Not applicable, because this article does not contain any studies with animal or human subjects.
Consent of publication
The correspondents did not acknowledge competing of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Abdullahi, S.H., Uzairu, A., Shallangwa, G.A. et al. In-silico activity prediction, structure-based drug design, molecular docking and pharmacokinetic studies of selected quinazoline derivatives for their antiproliferative activity against triple negative breast cancer (MDA-MB231) cell line. Bull Natl Res Cent 46, 2 (2022). https://doi.org/10.1186/s42269-021-00690-z
- Density function theory
- Quantitative structure activity relationship
- Triple negative breast cancer
- Molecular docking
- Pharmacokinetic studies