Utilization of Boron Compounds for the Modification of Suberoyl Anilide Hydroxamic Acid as Inhibitor of Histone Deacetylase Class II Homo sapiens

Histone deacetylase (HDAC) has a critical function in regulating gene expression. The inhibition of HDAC has developed as an interesting anticancer research area that targets biological processes such as cell cycle, apoptosis, and cell differentiation. In this study, an HDAC inhibitor that is available commercially, suberoyl anilide hydroxamic acid (SAHA), has been modified to improve its efficacy and reduce the side effects of the compound. Hydrophobic cap and zinc-binding group of these compounds were substituted with boron-based compounds, whereas the linker region was substituted with p-aminobenzoic acid. The molecular docking analysis resulted in 8 ligands with ΔG binding value more negative than the standards, SAHA and trichostatin A (TSA). That ligands were analyzed based on the nature of QSAR, pharmacological properties, and ADME-Tox. It is conducted to obtain a potent inhibitor of HDAC class II Homo sapiens. The screening process result gave one best ligand, Nova2 (513246-99-6), which was then further studied by molecular dynamics simulations.


Introduction
Cervical cancer is cause by human papillomavirus (HPV) and in the second rank as a cause of cancer death in women worldwide [1]. Cervical cancer occurs in the cervical region, which is located in the hollow area between the vagina and the uterus or commonly called cervix Cervical cancer can be contagious among all women; a ratio of 1 out of every 4 women is likely to suffer from it [2].
Based on data from the World Health Organization, in 2008, it is estimated 530,232 cases of cervical cancer in the world, with 275,008 mortality cases [3]. Through these data, the estimated global mortality rate from cervical cancer is 50% [2].
HPV is a virus of the family Papillomaviridae and has a nonenveloped, icosahedral-shaped capsid and the double stranded circular DNA as its genetic material [4][5][6]. It is 7,800-7,900 base pairs long with a 55 nm diameter [7,8]. HPV has more than 100 different genotypes, and over 40 types of it can infect any part of the epithelial and mucosal lining of the anogenital tissue [9]. The HPV virus is divided into two classes, namely, low-risk HPV (e.g., HPV-6 and HPV-11) and high-risk HPV (e.g., HPV 16 and HPV 18) [10]. Lowrisk HPV usually causes a bulge impact on disease areas such as anogenital condylomata (wart) that usually grows on the cervix and vulva [11].
HPV genome is divided into 3 regions, namely, upstream regulatory (URR, noncoding), early gene, and late gene regions [12]. Proteins E6 and E7 oncogenes can make HPVinfected cells to become immortal [13]. E6 protein is associated with ubiquitin (protein ligase), which in turn interacts 2 Advances in Bioinformatics with p53. It results in the degradation process in the proteosome. E6 also increases the activity of telomerase and induces the creation of immortal cells [14]. E7 protein interacts with the retinoblastoma protein (Rb) and releases the E2F transcription factor that induces expression of genes involved in the process of cell proliferation [15]. E7 oncoprotein can interact directly with the interferon regulatory factor-(IRF-) 1 tumor suppressor proteins that inhibit the performance of the release of E2F and E7, thus increasing the transcriptional activity of cells containing the HPV genome [16]. E6 and E7 activities generally cause epigenetic changes that interfere with the process of cell regulation, apoptosis, DNA-repair processes, hormonal response, and cell differentiation processes, which lead to cervical cancer [17,18].
Oncoprotein of HPV E6 and E7 in particular has a correlation with the enzyme activity of histone deacetylase (HDAC) [19]. HDAC is a medium for binding with oncogene transcription of genes with the aim of transforming the processes of cells into the media of the viral proliferation [20].
HDAC is an enzyme with EC number 3.5.1 which acts as a catalyst for histone deacetylase [21]. In eukaryotic cells, it is useful for removing acetyl groups from lysine amino acid on a histone tail and wrapping the histones around DNA, thus interfering with the process of gene transcription by binding with transcription factor [22,23]. In general, there are two regulation processes of gene expression and DNA replication by regulation of chromatin structure [23]. The process of protein acetylation of histone and nonhistone was carried out by the histone acetyl transferases (HATs) and histone deacetylase by histone deacetylase (HDACs) enzyme [24]. These two enzymes are working as opposites because HATs will cause chromatin structure to stretch into euchromatin [25]. It provides space for the specific enzyme or other protein complexes involved in gene expression that serves to increase the activity of transcription and DNA repair [26]. While HDAC causes the release of an acetyl group on the N-acetyl lysine that is available on the histone tail, it causes the DNA to form loops on the histone called heterochromatin [27]. Hence, the transcription of DNA is obstructed and gene expression does not occur properly, thus causing the transformation of normal cells into cancer cells [17,28]. HDAC inhibition can inhibit the proliferation of epigenetic gene transcription of HPV that causes cancer cells broke down to apoptosis [29].
Suberoyl anilide hydroxamic acid (SAHA) has been through the stages of clinical trials and approved by the U.S. Food and Drug Administration (FDA) as cancer drug [30]. SAHA has the ability to inhibit HDAC, and it could interact with the HDAC metalloenzyme site [31]. Hence, the Zn 2+ ion lies at the basis of the metalloenzyme site of HDAC [32]. The following is an explanation of each unit in the design of HDAC inhibitors.
(1) Zinc Binding Group (ZBG). It is a site where a ligand (inhibitor) interacts with Zn 2+ cofactor contained in the HDAC formed charge relays system with amino acid residues [33]. In general, compounds that can interact with the Zn 2+ cofactor are nucleophilic compounds, for example, hydroxyl, carbonyl, thiol, carboxylic, and sulfonyl [34].
(2) Linker. It is the liaison between CAP and ZBG that forms a short-chain hydrocarbons, long-chain hydrocarbons or aromatic such as butane, fatty acids, -aminobutyric acid (GABA), p-aminobenzoic acid (PABA), furans, and others [35]. The linker is able to interact with amino acid residues found in the cylinder pocket of HDAC enzymes [36].
(3) Hydrophobic Cap (CAP). It is a group of compounds that are used to design a cap which is generally a hydrophobic compound that has properties of high lipophilicity [37]. It easily reacts with the surface of the active site and closes the entry point to the enzyme substrate. In general, hydrophobic cap is composed of phenyl, benzyl, furans, polycyclic, and so forth [37,38].
The reason boron compounds are selected to be substituted at the ZBG and CAP is because the clinical trials have shown that consumption of boron may prevent cervical cancer caused by HPV [37,39]. By consuming the boron content of 84.1 mg per day, it could prevent cervical cancer [40]. The forms of a functional group of boron compounds that have been proven to have therapeutic effects till date are diazoborin, boronic acid, boronic ester, and benzoxaborole [41]. Carborane has been found to be useful as a good inhibitor and has high lipophilicity properties which are useful for binding with the receptor binding site on the hydrophobic active site of the enzyme [42]. Carborane in the closed form can increase receptor affinity and activity of the enzyme with hydrophobic ligand binding cavity. Therefore, it can inhibit the enzyme activity that contributes to a disruption in the disease [43].

Materials and Methods
This research method was developed based on established pipeline of our group [44][45][46][47][48]. This is an in silico research that involves the use of computerized system. Each query was done using the online and offline software. In this study, MOE 2012.10, ACDLabs, ChemSketch, Toxtree, and VegaZZ were used. Multiple sequence alignment was done in HDAC class II sequences of Homo sapiens to obtain the conserved region. Then, the HDAC class II Homo sapiens enzyme was computed for its homology modeling with SWISS-MODEL server. As a result, ligand inhibitors for HDAC class II Homo sapiens were produced. After both the ligand and the enzyme were ready, the molecular docking simulations were performed. The result of molecular docking simulation was forwarded to the analysis of the existing parameters, namely, pharmacological analysis, ADMET testing, and bioavailability. Further test was carried out to examine the thermodynamic stability of ligand in the presence of solvents with molecular dynamics simulations.   compounds from the website of organoborons database (http://www.organoborons.com/) ( Figure 1). The sequences of HDAC class II Homo sapiens were searched in a protein sequence database. They could be accessed via the National Center for Biotechnology Information (NCBI). HDAC class II consists of six types of enzymes, namely, HDAC4, HDAC5, HDAC6, HDAC7, HDAC9, and HDAC10. The whole isoform of protein sequences has been encoded in the NCBI Reference Sequence (NCBI RefSeq), GenBank, and UniProt Knowledge Base (UniProtKB)/SWISS-PROT.

Results and Discussion
After conducting multiple sequence alignment, conserved region sequences were obtained. The obtained sequences of HDAC enzyme code are seen in Table 1. Furthermore, the sequences were piped into the Basic Local similarity Alignment Search Tools (BLAST) which could be accessed through the NCBI website (http://blast.ncbi.nlm.nih .gov/Blast.cgi). BLAST is useful for comparing sequences derived from the conserved region of the existing protein database. The BLAST protein code is written in Table 1.
Furthermore, 3D structure modeling was conducted by using SWISS-MODEL server. The predicted 3D image crystal structure of HDAC class II Homo sapiens results can be seen in Figures 2(a) to 2(f).
The modeling of the data was also obtained in determining the active site of the enzyme from each HDAC class II Homo sapiens enzyme. The active site of each enzyme is listed in Table 2.
Furthermore, molecular docking simulation has been conducted and produced 8 best ligands. The obtained ligands were having Δ binding value lower than SAHA and trichostatin A (TSA) as standards; they are listed in Table 3. The low Δ binding value would facilitate the spontaneous ligand binding reaction with the enzyme to form stable complexes. Deriving from Δ binding value, the inhibition constants of complex formation are in Table 4. The lower the Δ binding value the greater the p value. Docking results also included visualization of the interaction between the ligand with the target enzyme. Ligands would interact with amino acid residues that were owned by the enzyme and also with the active site of the enzyme. Table 5 presents the interaction between multiple ligands with the enzyme.
Furthermore, the previous modified ligands were forwarded for pharmacological analysis. The analysis was carried out by using Lipinski's rule of five Egan's, and Veber's rules to determine the best drug candidates in its stability and oral bioavailability. According to these rules, the drugs should have a molecular weight of less than 500 Dalton (Da), Log values of less than 5, the number of hydrogen bond donors of less than 5, the number of hydrogen bond acceptors of less than 10, polar surface area of less than 140Å 2 , and rotation of the ligand binding compound of less than 10 [49,50]. The test of pharmacological analysis was conducted using FAF-Drugs2 online software. Table 6 shows the results of pharmacological analysis of each ligand. As seen in Table 7, the data show that the best 8 ligands have good oral bioavailability with parameters based on the existing rules. Furthermore, an analysis of health impact of ligands has been completed by observing its absorption, distribution, metabolism, excretion, and toxicology (ADMET) properties. The analysis was carried out using Toxtree 2.6.0 software and ACD/I-Lab with the parameter of Benigni-Bossa rule. The method involved analyzing the groups of ligands that have fragments containing substances that cause Examples of the compounds that could cause these effects are acyl halides, benzyl, esters, epoxides, aliphatic halogen, alkyl nitrites, quinones, hydrazine, polycyclic aromatic hydrocarbons, tiocarbamate, aromatic amines, hydroxylamine, and so forth. Analysis of Toxtree software assessment was generated from data obtained by test using Salmonella typhimurium organism or commonly referred to as the AMES test. Table 8 shows the results of ligands that have been analyzed with the Toxtree software. The result is the best 8 ligands did not have mutagenic and carcinogenic effects. Furthermore, a probability analysis of ligand's side effects on the human health was conducted. The computational analysis was carried out by using the ACD/I-Lab software and the results for the safest ligands candidate are shown in bold, at Table 9. Table 9 shows that the entire modified ligand had adverse effects on the gastrointestinal tract, but it was not a problem because in order to distribute the drug, utilization of drug delivery technology could be in place to target the receptor. After passing the test, it was determined that the best ligand is Nova2 (513246-99-6). This is because of lower Δ binding value than the standard. Its value is almost the same in all of the HDAC class II enzymes. The best ligands were tested using molecular dynamics simulations to look at their stability due to the changes of solvent as well as temperature. The process was divided into three phases with the first phase of initialization temperature of 300 K, equilibration stage and heating temperature of 310 K, and the last stage of production. The simulated stage happened when the drug met the solvent and the occurrence of temperature changed and when the drug was distributed and reached its intended target. The dynamic simulation result is shown in Figure 3 as RMSD versus time (ps) curve at the molecular dynamic stage of the best drug candidate, Nova2 (513246-996), against the target enzyme.
The graph shows that Nova2 ligand (513246-99-6) was stable at the time of 5000 ps. However, progression in HDAC6 shows an increase in the curve due to the shallowness of the binding pocket; the ligand was somewhat less stable due to the influence of the solvent, which was outside the surface of the enzyfigure me.

Discussion.
Due to the versatility of organoboron compounds, they have proved themselves to be useful at the field or chemical science [51]. The synthesizing of organoboron compounds has been proven to be physically possible [52]. The flexibility of organoboron as electrophile and nucleophile compound adds a tunable property in the drug design [51]. The efficacy of organoboron compounds has been proven in the fungicide and bactericide-based in vitro experiment [53]. More specifically, the closo-carborane compounds have indeed become a logical choice for drug design due to their biological activity [43]. The hydrophobic property of closo-carborane is making chemical modifications on various compounds feasible, as a measure to observe its pharmacochemistry properties [54]. Hence, Velcade © is the only boron-based therapeutic in the market that is useful to threat multiple myolema, and there are several more that undergo clinical trials, including Talabostat © as lung cancer drug candidate [41,55]. In this end, it is expected that there will be more boron based drugs in the market. The computational measure for docking of simple boronic acid based compounds was already utilized and has paved the way for serious organoboron based rational drug design [56]. The starting point of the development of our pipeline was the optimization of docking method, which would be improved later on [57]. Our experience in HPV drug design was based on design of organic compounds as drug candidates [46,58]. However, as carbon and boron based compounds have some similarity of physicochemical properties, the utilization of boron as carbon substitute for 6 Advances in Bioinformatics   drug candidate becomes more feasible [59]. Now, closocarborane and boronic acid based lead compounds were successfully designed based on our established methodology. The low toxicity and degradability of boronic acid into safer boric acid made it an environmental friendly compound [60]. It is also observed that closo-carborane could improve the hydrophobic interaction with enzyme [43]. Due to the robustness of organoboron compounds, the existing pipeline could be applied "as it is" without any major modification, for working with organometalloid compounds. Moreover, the designed organoboron compounds are still below the threshold of Lipinski's rule molecular weight barrier, so it will be much simpler to develop them [59]. Thus, the Nova2 (513246-99-6) that was a combined compound from SAHA and (4-piperazin-1-ylphenyl) boronic acid was chosen as the most feasible drug candidate [61]. The slightly acidic properties of the boronic acid functional groups, combined with the electronegativity tendency from its nitrogen atom, may contribute to the inhibition activity to the Zn 2+ metalloenzyme pocket [62]. However, the acidic property of Nova2 (513246-99-6) should be taken into account for its oral delivery measure, as it could inflict certain hazard for heartburn patient. In this end, prodrug construction should be considered in its synthesis strategy [63]. The wet laboratory experiment that is working with the interaction of protein and organoboron compound was already proved to be feasible [64]. Moreover, organoboron compound is just starting to be applied as radiotherapy agent [65]. The synthesis pathway for both boronic acid and closocarborane derivatives are already applied by some research group [66,67]. Thus, in order to soften the complexity of the synthesis, a prediction method will be utilized to evaluate the synthesis accessibility [68][69][70]. To this end, by applying the information from in silico results, it is expected that the laboratory synthesis and bioassay experimentation for organoboron compound should be straightforward and not difficult.