In-silico activity prediction, structure-based drug design, molecular docking and pharmacokinetic studies of selected quinazoline derivatives for their antiproliferative activity against triple negative breast cancer (MDA-MB231) cell line

Background: Cancer is a major health threat especially in unindustrialized nations. It surpasses coronary diseases and takes the number one killer position as a result of different global wide influences. Among many breast cancer substrates, triple-negative breast cancer (TNBC) is particularly devastating because it rapidly metastasize to other parts of the body, with a high risk of earlier recession and mortality. Result: In this research work, four (4) quantitative structure activity relationship (QSAR) models were developed using a series of quinazoline derivatives with activities against triple negative breast cancer cell line (MDA-MB231), model 1 was selected due to its statistical fitness with the following validation parameters: R = 0.875, Q = 0.837, R − Q = 0.038, Next test set = 5, and Rext = 0.655. Molecular docking studies was performed for the quinazoline series as well as the reference drug (Gefitinib) and the active site of the epidermal growth factor receptor (EGFR) (pdb id = 3ug2). Eight compounds (6, 10, 13, 16, 17, 18, 19 and 20) were observed to have better docking score docking scores relative to Gefitinib. Compound number nineteen from the training set (pred pIC50 = 5.67, Residual = − 0.04 and MolDock score = − 123.238) was identified as the best compound since it has the best Moldock score and was excellently predicted by the selected model with least residual value, Hence was adopted as template for the design of Ten (10) new novel compounds with better activities and better docking scores. The inhibitive activities of the designed compounds were predicted by the selected model and most of them possess an improved activity relative to the template compound (19). The designed compounds were also redocked on to active pocket of the EGFR receptor and it was observed that they displayed better docking scores compared to the Template and the reference drug (Gefitinib) utilized in the design. Furthermore, the designed compounds were subjected to ADMET and druglikeness studies using SWISSADME and pkCSM online web tools and they were observed to be pharmacologically active, easily synthesized and do not violate the Lipinski’s rule of five. Conclusion: Hence, the designed compounds can be employed as inhibitors of MDA-MB231 cell line after passing through in vivo and in vitro evaluation. © The Author(s) 2021. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/. Open Access Bulletin of the National Research Centre *Correspondence: sagirwasai@gmail.com Department of Chemistry, Faculty of Physical Sciences, Ahmadu Bello University, P.M.B.1045, Zaria, Kaduna State, Nigeria Page 2 of 23 Abdullahi et al. Bulletin of the National Research Centre (2022) 46:2 Background Cancer is a challenging problem for the global health community, and its increasing burden necessitates seeking novel and alternatives therapies (Rajabi et  al. 2021). It takes the number one killer position as a result of different global wide influences. Although considerable progress were made in the chemotherapeutic remedy of some victims, the unrelenting obligation to the difficult task of detecting new anti-cancer drugs is still crucial. Breast cancer is the most predominant class of cancer diagnosed in females around the globe, with an incidence that intensifies vividly with age. Among many breast cancer substrates, triple-negative breast cancer (TNBC) is particularly devastating because it rapidly disperse to other parts of the body, with a high risk of earlier recession and mortality (Hu et  al. 2012). Annually, at least one million females are identified with breast tumor and TNBC is accountable for close to 15–20% of the complete breast cancer identified (Jo et al. 2019). The epidermal growth factor receptor (EGFR) plays a crucial part in the control of cell growth and is regarded as one of the most seriously evaluated tyrosine kinase’s (TK) target inhibitors (El-Azab et  al. 2010). Numerous TKs had a vital functions in cell propagation, division, metastasis and endurance, besides their uncontrolled triggering via processes such as point mutations leads to a substantial proportion of clinical cancers. EGFR is over expressed in numerous tumors, such as brain, lung, bladder, ovarian, colon, breast, head, and prostate tumors (Tiwari et al. 2015). Components of the erbB class of EGFR-TKs, which comprise of erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4), are overexpressed in a substantial ratio of human tumors, and this is attributed to the miserable prognosis of the malady (Chandregowda et al. 2009). Hence, inhibitors of erbB1 and erbB2 were acknowledged as possible anticancer drugs (Hynes and Lane 2005). Extermination of cancer cells without causing damage on other normal tissues or cells is the main purpose of anti-cancer drugs. However, the fact that some of these drugs usually destroys some other normal cells and the resistance to these drugs experienced by some patients during early period of treatment necessitates the global search for identifying new higher quality drugs that are safe for the prevention and remedy of cancer (AlSuwaidan et  al. 2016). Immediate recognition, understanding of the cause and pathway of this disorder, and improvement in remedy have played a pivotal part in curtailing breast cancer mortality rates over the past few years. Chemotherapy is still the central key to thorough therapy since it can exterminate tumor cells rapidly in the human system (Kaplan 2013). In-silico approach of drug discovery have proven to accelerate the drug discovery process, as it lessen the time taken, resources and it enables the estimation of properties of new molecules such as toxicity and efficiency even before their synthesis. A mathematical relations that are able to establish a quantitative relationship between biological activities of a molecules and their molecular structures in form of linear equation is called Quantitative Structure–Activity Relationships (QSAR) (Abdullahi et al. 2021). Efficiency and safety of the drug to the system are the two major causes leading to drug failure. Therefore, it is compulsory to find potent molecules with better ADMET properties “drug-likeliness” (Lawal et al. 2021). This research is mainly purposed in utilizing QSAR approach to compute the inhibitory activities of a series of quinazoline derivatives against MDA-MB231 breast cancer cell line, perform molecular docking studies to understand the nature of interaction between the compounds and the EGFR protein receptor, design new potent compounds based on their docking scores and examine their ADMET and drug likeness properties. Methods Data sets retrieval A series of 23 quinazoline derivatives with inhibitory activities (IC50 in μg/ml) against Triple Negative Breast cancer cell line MDA-MB231 are retrieved from Abuelizz et al. (2017). The inhibitory activities were linearized by taking their negative logarithm to base 10 as shown in Eq. 1. Chemical structure of the quinazoline analogs as well as their respective inhibitive capacity at 50% concentration (pIC50) are presented in Table 1. Calculation of molecular descriptors 2D structures of the quinazoline analogs were sketched by utilizing Chemdraw version 16.0 and they were transformed to 3D format using Spartan 14 software. Molecular mechanics force field were employed to clean the 3D structures to eliminate all strain from the structure of (1) pIC50 = −log10 ( IC50 × 10 −6 )

Page 2 of 23 Abdullahi et al. Bulletin of the National Research Centre (2022)

Background
Cancer is a challenging problem for the global health community, and its increasing burden necessitates seeking novel and alternatives therapies (Rajabi et al. 2021).
It takes the number one killer position as a result of different global wide influences. Although considerable progress were made in the chemotherapeutic remedy of some victims, the unrelenting obligation to the difficult task of detecting new anti-cancer drugs is still crucial. Breast cancer is the most predominant class of cancer diagnosed in females around the globe, with an incidence that intensifies vividly with age. Among many breast cancer substrates, triple-negative breast cancer (TNBC) is particularly devastating because it rapidly disperse to other parts of the body, with a high risk of earlier recession and mortality (Hu et al. 2012). Annually, at least one million females are identified with breast tumor and TNBC is accountable for close to 15-20% of the complete breast cancer identified (Jo et al. 2019). The epidermal growth factor receptor (EGFR) plays a crucial part in the control of cell growth and is regarded as one of the most seriously evaluated tyrosine kinase's (TK) target inhibitors (El-Azab et al. 2010). Numerous TKs had a vital functions in cell propagation, division, metastasis and endurance, besides their uncontrolled triggering via processes such as point mutations leads to a substantial proportion of clinical cancers. EGFR is over expressed in numerous tumors, such as brain, lung, bladder, ovarian, colon, breast, head, and prostate tumors (Tiwari et al. 2015).
Components of the erbB class of EGFR-TKs, which comprise of erbB2 (HER2), erbB3 (HER3), and erbB4 (HER4), are overexpressed in a substantial ratio of human tumors, and this is attributed to the miserable prognosis of the malady (Chandregowda et al. 2009). Hence, inhibitors of erbB1 and erbB2 were acknowledged as possible anticancer drugs (Hynes and Lane 2005).
Extermination of cancer cells without causing damage on other normal tissues or cells is the main purpose of anti-cancer drugs. However, the fact that some of these drugs usually destroys some other normal cells and the resistance to these drugs experienced by some patients during early period of treatment necessitates the global search for identifying new higher quality drugs that are safe for the prevention and remedy of cancer (Al-Suwaidan et al. 2016). Immediate recognition, understanding of the cause and pathway of this disorder, and improvement in remedy have played a pivotal part in curtailing breast cancer mortality rates over the past few years. Chemotherapy is still the central key to thorough therapy since it can exterminate tumor cells rapidly in the human system (Kaplan 2013).
In-silico approach of drug discovery have proven to accelerate the drug discovery process, as it lessen the time taken, resources and it enables the estimation of properties of new molecules such as toxicity and efficiency even before their synthesis. A mathematical relations that are able to establish a quantitative relationship between biological activities of a molecules and their molecular structures in form of linear equation is called Quantitative Structure-Activity Relationships (QSAR) (Abdullahi et al. 2021). Efficiency and safety of the drug to the system are the two major causes leading to drug failure. Therefore, it is compulsory to find potent molecules with better ADMET properties "drug-likeliness" (Lawal et al. 2021).
This research is mainly purposed in utilizing QSAR approach to compute the inhibitory activities of a series of quinazoline derivatives against MDA-MB231 breast cancer cell line, perform molecular docking studies to understand the nature of interaction between the compounds and the EGFR protein receptor, design new potent compounds based on their docking scores and examine their ADMET and drug likeness properties.

Data sets retrieval
A series of 23 quinazoline derivatives with inhibitory activities (IC 50 in µg/ml) against Triple Negative Breast cancer cell line MDA-MB231 are retrieved from Abuelizz et al. (2017). The inhibitory activities were linearized by taking their negative logarithm to base 10 as shown in Eq. 1.
Chemical structure of the quinazoline analogs as well as their respective inhibitive capacity at 50% concentration (pIC 50 ) are presented in Table 1.

Calculation of molecular descriptors
2D structures of the quinazoline analogs were sketched by utilizing Chemdraw version 16.0 and they were transformed to 3D format using Spartan 14 software. Molecular mechanics force field were employed to clean the 3D structures to eliminate all strain from the structure of (1) pIC 50 = −log 10 IC 50 × 10 −6 Keywords: Density function theory, Quantitative structure activity relationship, Triple negative breast cancer, Molecular docking, Pharmacokinetic studies  the molecule as well as guaranteeing a well-defined conformer relationship within the compounds (Viswanadhan et al. 1989). Density Functional Theory (DFT) quantum mechanical calculation was employed for the geometry optimization using B3LYP/631G * basis set. The optimized structures were saved in Spatial Document File (SDF) format and then exported to PADEL descriptor calculation software to compute the molecular descriptors (Amin and Gayen 2016).

Data set partitioning
The data set were partitioned into two separate set: Modeling set (training set) and external validation set (test set). The modeling set consist of eighteen (18) compounds while the external validation set is made up of five (5) compounds. Models are built using the modeling set while the predictive ability of the built model was ascertained using the external validation set (Tropsha et al. 2003). This splitting certifies that an analogous standard can be engaged to predict the activity of the test set. Kennard-Stone Algorithm was applied for dividing dataset into a modeling and test set (Kennard and Stone 1969).

QSAR model building and external validation
The most important aspect of QSAR studies is the designation and sampling of descriptors that offers an ultimate information in activity disparities and have minimal co-linearity. Hence, genetic function algorithm (GFA) progresses the model accurateness while selecting relevant molecular descriptors (Leardi 1996). Multi linear regression (MLR) was utilized on the model building set to express the mathematical relations between the depending variable A (pIC 50 ) and independent variable B (molecular descriptors). An exceptional feature of GFA algorithm is that it is able to generate multiple models rather than single model. Validation parameters that provides a guide in selecting the best QSAR model include correlation coefficient (R 2 ), adjusted R 2 (R 2 adj ), cross-validation coefficient (Q 2 cv ) and correlation coefficient of the external validation set (R 2 ext ), all are expressed in Eqs. (2, 3, 4 and 5) respectively.
where P is the number of independent variables in the model and N is the sample size. Y exp , Y pred , and Y mtraining are the experimental activity, the predicted activity, and the mean experimental activity of the compounds in the modeling set, respectively (Tropsha et al. 2003). The least recommended values for these parameters are shown in Table 2.

Y-randomization test
In order to assess the robustness of the model and to affirm that the model was not obtained by chance correlation Y-randomization was performed on the model building set data (Tropsha et al. 2003). A new QSAR model was generated using the descriptor matrix by shuffling the activity matrix randomly. A built QSAR model is robust and reliable only when it has low values of R 2 and Q 2 for numerous trials. Another validation parameter is the coefficient of determination for Y-randomization cR 2 p, and it should exceed 0.5 for passing this test as in Eq. 6 (2) cR 2 p is coefficient of determination for Y-randomization, R is the coefficient of determination for Y-randomization and Rr is average 'R' of random models.

Molecular docking studies
Ligand-Protein molecular docking studies was performed on all the quinazoline derivatives to study the nature of interactions between active pocket of the EGFR protein receptor and the ligands on HP laptop equipped with a dual-core Intel (R) PENTIUM (R) B940 CPU processor running at 2.0 GHz and 4.0 GB of RAM running on Windows 8 using Molegro Virtual Docker (MVD) software and Discovery studio.

Ligand preparation
The least energy optimized structures was saved in pdb file format prior to docking studies (Abdullahi et al. 2021).

Protein retrieval and Preparation
3D X-ray crystallized structure of the EGFR protein receptor (pdb id = 3ug2) was obtained from the protein data bank (https:// www. rcsb. org/), and was prepared on the MVD workspace by eliminating water molecules and co-crystallized ligand enclosed in the crystal structure. The amino acid residues with structural error were repaired/rebuilt. The Fully prepared protein structure was also save in pdb format prior to docking process.

Docking of the ligands and receptor using molegro virtual docker (MVD)
Due to its ability to produce a better and consistent results relative to other docking softwares, Molegro Virtual Docker software program was utilized for the docking study in this research. Before the start of the process, the prepared protein was exported from its folder to the MVD work space, cavities were detected and surface was created. The active pocket of the EGFR receptor was anticipated and was set inside a regulated sphere of X: 0.91 Y: 50.08 and Z: 22.54 with 15 Å radius respectively. The prepared ligands were imported into the MVD work space and docking process was executed using a grid resolution of 0.30 Å. The Root Mean Square Deviation (RMSD) threshold was set as 2.00 Å for the multiple clusters poses with 100.00 energy penalty values. The docking algorithm was set for a maximum of 1500 iteration with a maximum population size of 50. The docking simulation was run for a minimum of 50 times for the 10 poses, and the best poses were determined based on the MolDock score scoring function (Thomsen and Christensen 2006). Hydrogen bond, Hydrophobic, alkyl, pi-alkyl, Halo bonds and aryl intermolecular interactions were viewed with the aid of Discovery studio software.

ADMET and drug-likeness properties
pkCSM ((http:// structure. bioc.cam. ac.uk/pkcsm), and SwissADME (http:// www. swiss adme.ch/ index. php) are free and easily accessible online web sites that are utilized to explore the ADMET and medicinal properties of tiny molecules. The ADMET and Drug-resembleness properties of the compounds are obtained from these sites in the current research. At the pre-clinical phase of drug invention, the most crucial parameter is the Lipinski's rule of five (Abdullahi et al. 2021), and it states that any molecule that violates more than two (2) of the following criteria is not easily permeable or readily absorbed into the body system. The criteria are; MW ˂ 500, HBD ˂ 5, HBA ˂ 10, Log p ˂ 5 and PSA (PSA) ˂140 A 2 .

Structure based drug design
The design of drugs based on structures is also called the direct drug design, a very important, powerful and useful method of drug discovery. This includes the collection of information on the three-dimensional structure of the target receptor (protein) via approaches such as X-ray crystallography, NMR spectroscopy, or homology modeling, followed by the design of promising drug candidates based on binding and selective efficiencies for their target groups. The method involves several steps, such as retrieval and preparation of protein structure, preparation of ligand archives and manual design of new novel compounds (Batool et al. 2019).

Results
In this research work Genetic function algorithm was employed to generate the QSAR models due to its ability to produce a vast population of model instead of just a single model. Four models were generated from the model building set and the first one was chosen because of its statistical significance.

Discussion
The results of various statistical parameters of the selected model are shown in Table 1, therefore, the model satisfies the least required values for the evaluation of a robust QSAR model. Additionally, the selected model was utilized to estimate the inhibitive capacity of the external validation set and it was found to have passed the external validation test with R 2 ext = 0.655 (Table 2). The structures, experimental as well as predicted inhibitive activities of the compounds in this research work are placed in Table 1 respectively. A plot of experimental pIC 50 against the predicted pIC 50 of both the model building as well as the external validation set is shown in Fig. 1 (Ibrahim et al. 2018). Pearson's correlation matrix of the selected model indicates that descriptors are not correlated to each other, this illustrates that they are very good (Table 3).

Y-randomization test
Result of Y-randomization test is shown in Table 4 and the test is used to confirm that a model was not obtained by coincidental correlation. The coefficient of determination for Y-randomization cR 2 p is the most crucial parameter for this test and for a robust QSAR model its value must exceed 0.5. Its value in this research work is 0.74, this illustrates that the model is powerful enough and was not purely due to chance and has satisfied the minimal requirement for robustness.

Mean effect of the descriptors in the selected model
The mean effect designate the individual function and the influence of each descriptor in a model, and it is computed for each of the molecular descriptors using the below equation:  MF j denotes the mean effect for the indicated descriptor j, the coefficient of the descriptor j is denoted by β j , d ij is the value of the Target descriptor for each molecule and m population of descriptors in the model (Dimić et al. 2015).
The MF value offers essential information on the effect of each molecular descriptors in the picked model; the size and signals of these descriptors combined with their mean effects reveal their stability and path in inducing the activity of a molecule. The mean effect values are presented in Table 5. BCUTc-1l, MATS8c and SpMAD_Dzs were found to possess the most pronounced influence on the model performance due to their large and positive mean effect values. Their positive sign indicated that increase in their value increases the inhibitive activities of a compound against MDA-MB231 breast cancer cell line. The other descriptor, AATSC8c is negatively correlated with the inhibitive activities of the compounds against the breast cancer cell line, higher value of this descriptor will be responsible for hindering the potency of these compound.

Variance inflation factor (VIF)
The inter-correlation amongst molecular descriptors in a model is detected using their variation inflation factors (VIF), to check whether the descriptors are highly correlated with one another or not. computed VIF values less than 1 illustrates that there is no inter-correlation between the descriptors between 1 and 5, the model can be accepted; and if it is higher than 10, the model cannot be accepted. It can be calculated using the Eq. 8 below. In this research work VIF values for all the descriptors are less than 10, this demonstrates the fitness of the selected model and the descriptors were independent of one another (Table 3).   Fig. 3, the cut-off leverage is 0.833 hence, and only the external validation set compounds lies beyond the defined domain of applicability (leverage values > 0.833). These compounds affects the performance of the model but cannot be tagged as outliers since their standardized residual values lies within ± 3 region.

Molecular docking studies
Molecular docking studies is performed to have the knowledge on the nature of binding interactions and the amino acid residues that are accounted for inducing the biological activity of a molecule. In this research work docking simulation study was performed between all the studied quinazoline derivatives and the binding pocket of the EGFR protein receptor (pdb id = 3ug2) and the results are placed in Table 2. The reference drug (Gefitinib) was also redocked into the same binding pocket to revalidate the docking results. Eight compounds (6, 10, 13, 16, 17, 18, 19 and 20) were observed to have better docking score as well as experimental and predicted activities than Gefitinib as such they were tagged as potential hit compounds. Various types of Amino acid interactions between the potential hit compounds and the active site of the EGFR receptor are presented in Table 6.
Compound 6 is observed to have interacted with the binding site of the EGFR receptor via one (1) conventional Hydrogen bond, two (2) Carbon-Hydrogen bond, one (1) Pi-sulfur interaction and several Alkyl and Pi-Alkyl Interactions. The Carbonyl Oxygen of the quinazoline ring forms a conventional Hydrogen bond with MET793 at a distance 1.86 Å, and a Carbon-Hydrogen bond with LEU792 at a distance 2.46 Å. Other Carbon-Hydrogen bond is between the Hydrogen atom H8 and GLN791 amino acid residue at a distance 2.85 Å. The benzene ring intercalated in space and forms a π-Sulfur interaction with MET790 at a distance 3.46 Å. ALA743, LEU718, LEU792, CYS775, MET790, MET793, LEU844, VAL726 and LYS745 residues forms Alkyl interactions with the compound and ALA743, LEU844, LEU718, LYS745 and LEU788 amino acid residues forms π-Alkyl interactions. 2D and 3D binding nature of compound 6 in the binding pocket of the EGFR receptor is shown Fig. 4.
Compound 10 interacted with the binding pocket of the EGFR receptor through a single Conventional Hydrogen bond, double Carbon-Hydrogen bond, single electrostatic π-Cation Hydrogen bond, Hydrophobic π-Sigma, one π-Sulfur, and several Alkyl and π-Alkyl interactions. The quinazoline group carbonyl oxygen forms a conventional Hydrogen bond with MET793 at a bond distance 2.15 Å, and forms a Carbon-Hydrogen bond with LEU792 at a bond distance 2.79 Å, ALA743 forms the other Carbon-Hydrogen bond with the cyanide carbon at a bond distance 3.14 Å. The phenyl ring intercalated in space and forms a π-cation Hydrogen bond with LYS745 at a bond distance 3.10 Å. LYS745 also forms a single electrostatic π-Sigma with the benzene ring, CYS775 forms a π-Sulfur interaction with the benzene ring of the quinazoline scaffold at a distance 5.58 Å. CYS775, MET790, MET793 and LEU718 residues forms an Alkyl interactions while ALA743, MET790, LEU844, MET793 and LEU844 amino acid residues forms a π-Alkyl interactions with the compound. 2D and 3D binding mode of compound 10 in the binding site of the EGFR receptor is presented in Fig. 5 respectively. Compound 13 was observed to have interacted with the EGFR receptor via a single conventional Hydrogen bond, three (3) Carbon-Hydrogen bonds, a single π-sigma and π-Sulfur interaction and several hydrophobic alkyl and π-Alkyl interactions. The carbonyl oxygen of the quinazoline group forms a conventional and Carbon-Hydrogen bonds with MET793 and LEU792 at a bond distances 1.61 and 2.45 Å, other Carbon-Hydrogen bond interactions are observed between Methoxy Hydrogen atom and GLU762 at a distance 2.89 Å, Hydrogen atom H8 and GLN791 at a bond distance 2.84 Å. The phenyl ring of the quinazoline scaffold intercalated in space to form a π-sigma hydrophobic interaction with LEU718, a π-Sulfur interaction is observed between the other benzene ring and MET790. LEU718, CYS775, MET790, MET793 and LEU844 residues forms Alkyl interactions while LEU718, VAL726, ALA743, LEU844, LYS745 and LEU788 forms a Pi-Alkyl hydrophobic interactions. 2D and 3D binding mode of compound 13 in the binding site of the EGFR receptor (pdb id = 3ug2) is pictured in Fig. 6.
The binding interactions of compound 16 is through a single conventional and Carbon-Hydrogen  Figure 7 represent the 2D and 3D binding mode of  The binding mode of compound 17 is through single conventional Hydrogen bond, triple Carbon-Hydrogen bonds, single Pi-Sulfur interactions and many Alkyl as well as Pi-Alkyl Hydrophobic interactions. The quinazoline carbonyl oxygen interacted with MET793 to form a conventional Hydrogen bond at a distance 2.35 Å, and also forms a carbon-hydrogen bond when interacted with LEU792 at a distance 2.85 Å, the Nitro group oxygen forms another Carbon-Hydrogen bond with ASN842 at a bond distance 2.73 Å, Hydrogen atom of the methyl group that connects the quinazoline scaffold forms the other Carbon-Hydrogen bond with GLN791 at distance 2.69 Å. MET790 forms a Pi-Sulfur interaction, LEU718 and LEU792 forms an Alkyl interaction while LEU844, VAL726, ALA743, LEU718, CYS775, MET790 and MET793 forms a hydrophobic Pi-Alkyl interactions. 3D and 2D Binding interactions of compound 17 in the active site of the EGFR receptor is placed in Fig. 8.
The binding mode of compound 18 in the active site of the EGFR receptor is through a single conventional Hydrogen bond, an electrostatic π-Anion and π-alkyl interactions. Oxygen atom of the oxadiazole group forms a conventional Hydrogen bond with SER719 at a bond distance 2.88 Å, the quinazoline benzene ring moiety  intercalated in space and forms an electrostatic π-Anion interaction with ASP855. LEU844, VAL726, ALA743, LYS745 and MET790 amino acid residues forms Pi-Alkyl hydrophobic interactions. 2D and 3D binding mode of compound 18 in the binding site of the EGFR receptor (pdb id = 3ug2) are shown in Fig. 9. Compound 19 is bound to the active site of the EGFR receptor via two (2) conventional Hydrogen bonds, a single Pi-cation Hydrogen bond, a hydrophobic Pi-Sigma interaction, a Pi-sulfur and numerous Pi-Alkyl hydrophobic interactions. LYS745 and MET793 forms conventional Hydrogen bonds at bond distances 2.92 and 2.96 Å, LYS745 forms a Pi-Cation Hydrogen bond and Hydrophobic Pi-Sigma interactions at distances 3.33 and 2.57 Å, MET790 forms a Pi-sulfur interaction. VAL726, ALA743, LYS745, MET790, LEU788, LEU718 and LEU792amino acid residues formed Pi-Alkyl hydrophobic interactions.
2D and 3D binding mode of compound 21 in the active site of the EGFR receptor (pdb id = 3ug2) are placed in Fig. 10 respectively.
Binding mode of compound 20 is through two (2) conventional Hydrogen bonds, single Carbon-Hydrogen bond, single electrostatic Pi-anion, single Pi-Sulfur and hydrophobic Pi-Alkyl Interactions. Carbonyl Oxygen atoms of the quinazoline scaffold and Isoindoline-1,3-dione group forms conventional Hydrogen bonds with MET793 and LYS745 at distances 2.16 and 1.98 Å, Hydrogen atom of the methyl group that connects the quinazoline scaffold to the phenyl ring forms a Carbon-Hydrogen bond with GLN791 at distance 1.83 Å. ASP855 forms an electrostatic Pi-Anion interaction, MET790 forms a Pi-Sulfur Interaction, while VAL726, ALA743, LEU844, VAL726, CYS775, MET790 and MET793 forms a hydrophobic Alkyl interactions. 3D and 2D binding  interactions of compound 20 in the active site of the EGFR receptor is shown in Fig. 11 respectively. To validate the docking approach the reference drug, Gefitinib was also docked onto the binding pocket of the EGFR receptor and was observed to interact with the protein kinase via a single conventional Hydrogen bond, eight (8) Carbon-Hydrogen bond, single Pi-Sulfur    Figure 12 represent the 3D and 2D binding mode of Gefitinb in the active site of EGFR receptor.

Structure-based drug design
In this research work, all the quinazoline derivatives were docked on to the binding site of the EGFR (pdb id = 3ug2). Compound 19 (pred pIC 50 = 5.67, Residual = − 0.04 and MolDock score = − 123.238) was identified as the best compound since it has the best Moldock score and was excellently predicted by the model selected with least residual value and was within the defined applicability domain, hence, it is adopted as template for the design. Ten (10) novel compounds were designed by addition of various groups on the Meta, Para and Ortho positions of the isoindoline-1, 3-dione phenyl ring. The inhibitive activities of the designed compounds were predicted by the selected model and most of them possess an improved activity relative to the template compound. The structure, predicted activity and MolDock score of the designed compounds are presented in Table 7.

Molecular docking studies of the designed compounds
Molecular docking investigation was also performed for the designed compounds and the binding site of the EGFR receptor (pdb id = 3ug2) using Molegro Virtual Docker (MVD) software. The designed compounds were optimized to obtain the most stable and least energy conformer using DFT calculations utilizing B3LYP 631G* basis set and the optimized molecules were saved in pdb format. All the designed compounds displayed better docking scores compared to the Template and the reference drug (Gefitinib) utilized in the design. Types of amino acid interactions of the designed compounds and the active site of the EGFR receptor are presented in Table 8. The results of three (3) compounds with best docking scores are discussed in this research. Designed compound number three (3) has the best docking score (MolDock score = − 159.63) and it is found to interact with the EGFR receptor via three (3) Carbon-Hydrogen bonds, two (2) Alkyl and many Pi-Alkyl   Fig. 13. Designed compound 2 has the second best docking score (− 152.085) and it is found to have interacted with the EGFR receptor through three (3) conventional Hydrogen bonds, three (3) Carbon-Hydrogen bonds, single electrostatic Pi-Cation, Pi-Anion and Hydrophobic Pi-Sigma, two (2) Pi-Sulfur, single Alkyl and many Pi-Alkyl hydrophobic interactions. The Oxygen atom of the Isoindoline-1,3-dione forms a conventional Hydrogen bond with MET793 at a bond distance 1.66 Å, ASP855 forms another conventional Hydrogen bond with Hydrogen atom of the OH-group attached to the amino benzene group at a bond distance 1.87 Å, Carbonyl Oxygen of the quinazoline scaffold forms the other Conventional Hydrogen bond with CYS797 at a distance 2.26 Å. Similarly, Oxygen atom of the Isoindoline-1, 3-dione forms a Carbon-Hydrogen bond with LEU792 at a distance 2.77 Å, Methyl group Hydrogen atom attached to the Isoindoline-1,3-dione forms another Carbon-Hydrogen bond with MET793 at a distance 2.92 Å and the last one is between ASP800 and the Hydrogen atom of the methyl group that connects the phenyl ring to the main Quinazoline scaffold at a distance 2.87 Å. Additionally, the para-Hydroxy amino benzene intercalated in space to form an electrostatic Pi-Anion interaction with LYS745, ASP800 forms a Pi-Anion electrostatic interaction, GLY796 forms a Hydrophobic Pi-Sigma, MET790 and CYS797 forms a Pi-Sulfur interaction, LEU844 forms Alkyl and VAL726, ALA743 and LEU844 residues formed a hydrophobic Pi-Alkyl interactions. 3D and 2D binding pattern of designed compound 2 in the active site of the EGFR receptor (pdb id = 3ug2) is presented in Fig. 14.
Designed compound 6 also having a promising docking score (MolDock score = − 146.947) interacted with the active pocket of the EGFR receptor via two (2) conventional Hydrogen bonds, four (4) Carbon-Hydrogen bonds, two (2) Pi-Anion electrostatic interactions, and Pi-Alkyl hydrophobic interactions. LYS745 and ASP855 forms a conventional Hydrogen bonds with the Oxygen atoms of the Isoindoline-1,3-dione at a distances 1.54 Å and 2.68 Å, GLU762 forms a Carbon-Hydrogen bond with methyl group Hydrogen atoms attached directly to the Isoindoline group at distances 2.63 Å and 2.73 Å,  Fig. 15.

ADMET and pharmacokinetics studies of the designed compounds
The results of ADMET and Pharmacokinetics of the designed compounds are depicted in Tables 9 and 10 respectively. None of the compounds designed violate greater than two of the acceptable thresholds established by Lipinski's rule of five filters for tiny molecules. Accordingly, they were expected to be permeable across the cell membrane, easily absorbed, transported and diffused (Ibrahim et al. 2020). In addition, the designed

Availability of data and materials
Not applicable.

Declarations
Ethics approval and consent to participate Not applicable, because this article does not contain any studies with animal or human subjects.

Consent of publication
Not applicable.

Competing interests
The correspondents did not acknowledge competing of interest.