Science and Technology Development Journal

An official journal of Viet Nam National University Ho Chi Minh City, Viet Nam since 1997

Skip to main content Skip to main navigation menu Skip to site footer

 Special Issue

HTML

1294

Total

347

Share

Design and screening of HepG2 cancer cell line inhibitors from Triterpenoid derivatives of Paramignya Trimera






 Open Access

Downloads

Download data is not yet available.

Abstract

Currently, Artificial intelligence (AI) is a ubiquitous technology that provides effective support across all fields. The pharmaceutical industry in general and drug production and development, in particular, are enjoying a very good application for the opportunity when in silico models have emerged as powerful platforms for designing new drugs. The aim of this project is to develop new anti-cancer agents by designing novel Triterpenoid derivatives from Paramignya Trimera and predicting their efficacy against the Bcl-2 target receptor. The project used three main in silico models: QSARMLR, QSARPCR and QSARANN. The models can be used to estimate IC50 values for novel derivatives and Escin extracted from Paramignya Trimera. Finally, the new good-value derivatives were docked to the Bcl-2 receptor to assess responsiveness. As a result, newly designed 196 compounds from the structural framework of Triterpenoid compounds were designed by combined with potential substituents. From there, screening by the rule of Veber identified 138 substances that met the requirement of having the ability to make drugs. Successfully, built QSARMLR, QSARPCR, QSARANN models with results of statistical values: R2 = 0.849, R2adj = 0.826, Q2LOO = 0.789 for the QSARMLR model; QSARPCR model with R2 = 0.860, R2adj = 0.831, Q2LOO = 0.805, and the QSARANN model with the best results: R2train = 0.941, R2test = 0.915, R2cv = 0.912. The use of models can help predict the effectiveness of newly engineered compounds. In this study, 20 compounds were found to be more efficient than Escin. Molecular docking on the Bcl-2 receptor found T.new7 gave the most potential results with the binding energy E_binding = -7.933 (kcal.mol-1), RMSD = 1.915 (Ă…). The research has achieved its goal by finding T.new7, a newly designed compound with better anti-cancer ability than natural Escin.

INTRODUCTION

According to GLOBOCAN, liver cancer is one of the 5 deadliest cancers, with a high number of new cases and deaths each year in 2020. Figure 1 shows that liver cancer has the third highest number of deaths in the world and the highest number of deaths in Vietnam 1 . Worldwide, liver cancer is the third most common cause of cancer death, accounting for 8.3%, following lung and colorectal cancer. In Vietnam, liver cancer is the leading cause of death, accounting for 20.6% of all deaths. These data indicate that with the current situation of deaths from liver cancer, project implementation is extremely necessary. Liver cancer not only causes hundreds of thousands of deaths annually but also imposes a significant socioeconomic burden. Therefore, the search for liver cancer derivatives is extremely urgent for patients and for humanity in general.

Figure 1 . Estimated number of deaths by cancer in 2020; World & Vietnam; both sexes; all ages. Source: GLOBOCAN

Liver cancer is a type of cancer that starts in liver cells. The liver is a football-sized organ located in the upper right quadrant of the belly, beneath the diaphragm and above the stomach. The liver can develop several types of cancer. Hepatocellular carcinoma (HCC) is the most common type of liver cancer and is the main type of liver cell (hepatocyte). Cancer that spreads to the liver is more common than cancer that spreads to liver cells. Cancer that develops in another part of the body, such as the colon, lung, or breast, and spreads to the liver is referred to as metastatic cancer rather than liver cancer. This type of cancer is termed by the organ in which it began; for example, metastatic colon cancer describes cancer that begins in the colon and travels to the liver.

Triterpenes are a class of terpenes made up of six isoprene units with the chemical formula C 30 H 48 . These compounds can alternatively be thought of as three terpene units. Triterpenes are produced by animals, plants, and fungi and include squalene, the precursor to all steroids. Triterpenes have a wide range of structures. Almost 200 distinct skeletons have been identified. These skeletons can be roughly classified based on the number of rings present. Pentacyclic structures (5 rings) predominate in general. One of the uses of Triterpenoids in the human body is to help prevent and treat cancer as well as to combat cancer metastasis. According to a 2011 study by Watchtel-Galor and colleagues, the use of triterpenoids in genital mushrooms has anticancer effects in vivo according to animal studies (mouse studies). In addition, the study indicated that the ingredients also contain active substances that help prevent cancer cells from growing in vitro (in the test tube). Thus, Triterpenoids help inhibit many types of cancer cells, such as lung cancer, breast cancer, and skin cancer cells. In addition, cancer metastasis is quite complicated. Cancer cells separate from the primary tumor and begin to move to other parts of the body. From there, small tumors—secondary tumors—form 2 .

Triterpenoids are of interest because of their anti-inflammatory and analgesic properties, especially in anticancer cell lines, including HepG2 cells. Artificial intelligence facilitates the creation of virtual screening models for derivative compounds. This prospective study was designed to explore a synthetic compound with superior cancer-fighting properties compared to the natural substance found in Paramignya Trimera . The triterpenoid of the oleanolic acid (OA) subgroup, called escin ( Figure 3 ), is extracted from Paramignya trimera ( Figure 2 ) of the family Rutaceae . OA has various benefits, including anti-inflammatory, antiviral, and hypoglycemic effects, and has potential for use against cancer cells. A large number of Triterpenoids are active against various human cancer cell lines, such as HepG2, SMMC-772 (hepatocellular carcinoma), HL-60 (leukemia), A549 (hepatocellular carcinoma), MCF-7 (breast cancer), and SW-480 (colon carcinoma) 3 .

Figure 2 . (a) Image of the Paramignya Trimera 4 ; (b) Escin structure from the Paramignya Trimera 5

Furthermore, oleanolic acid (OA) affects cancer cells via many routes. Increasing Bcl-2 receptor inhibition is a strategy that promotes the proliferation of OA-treated HepG2 cancer cells. As a result, the Bcl-2 receptor was chosen as the target of interest. This research used in silico algorithms to predict novel synthetic chemicals. Chemicals that are more effective at inhibiting HepG2 cells are being found. The topic is focused on developing three models: QSAR MLR , QSAR PCR , and QSAR ANN . Using virtual screening procedures saves time, money, and human resources. This research is likely to yield a result that speeds up compound screening in research and new medicine manufacture compared to experimental trials.

METHODOLOGY

Data mining from experiments

The data collected from the experiments are divided into 2 datasets: the training subset and the external evaluation subset. The two subsets are completely independent datasets. The condition is that the compound has a Triterpenoid framework and was tested on HepG2 cell carcinoma cells with an IC 50 value.

Design of new compounds

Two R1 and R2 sites ( Figure 3 ) in the structural frame were selected for the attachment of substituents via the maximum design method. The binding group, which includes 14 cells labeled T1 to T14, has been shown to have anticancer activity. Therefore, 196 novel compounds were synthesized using the maximum design technique. Multilevel design: This method is used in drug design and helps generate a list of design compounds based on various taxonomic factors and material quantities 6 . The maximum design limits the possibility of missing significant compounds, thereby providing a complete dataset of possible designs when combining taxonomic elements (2 positions selected on the frame Escin structure) and corresponding materials (14 functional groups T1-14 were selected).

Figure 3 . Structural framework for designing new compounds

Figure 4 . The substitution groups used in the design of new molecules 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21

Figure 4 
<a class=21" width="300" height="200">

[Download figure]

Optimization of the structures

New derivatives were created using the ChemDraw program. Using molecular and quantum mechanics, all novel and experimental derivatives have been structurally optimized. These two types of software used include HyperChem with an MM+ force field and a gradient level of 0.05 and MOPAC with the semiempirical PM7 approach. This approach helps molecules determine the most stable structure and acquire descriptive variables, including partial charge, HOMO, LUMO, MW, DH, and so on.

Calculation of molecular descriptors

After the structural optimization procedure, MOE software was used to determine the molecular descriptor for all the datasets. The selection and calculation of all descriptors were investigated from 0 to 3D. When the results are made public. The variable screening procedure was used to exclude variables that were not important. When combined with the descriptor variables produced from the structural optimization process, a dataset of descriptive characteristics for each molecule is obtained, allowing QSAR models to be built.

Estimation of QSAR models

This study focused on the development of three QSAR models: multivariate linear regression (MLR), principal component regression (PCR), and artificial neural network (ANN) models.

QSARMLR model

The QSAR MLR model predicts the dependent variable Y based on the values of two or more independent variables X. The model is represented as follows:

Y = 0 + 1 × X 1 + 2 × X 2 + … + k × X k + e (1)

Here, Y is the dependent variable, and β 0 , β 1 , β 2 … β k are the regression parameters of the model. X k is the independent variable (k is the number of variables), and e is the random error. In this study, the dependent variable was the IC 50 value. The independent variable is the molecular descriptor 22 . Regression 2008 23 software was used to construct the QSAR MLR model.

QSARPCR model

The set {X,Y}, where X is a data group with m observations and n variables and Y is the dependent variable. The information is gathered but not previously processed. Although outcome Y has no direct association with X, it does have a relationship with the principal components, which is a property of principal component regression 22 . To create the QSAR PCR model, the XLSTAT 2016 24 program was used.

QSARANN model

Artificial neural networks (ANNs) perform the same learning process as the human brain 22 . The structure of an artificial neural network I( m )-HL( n )-O( k ) includes the following: the input layer I( m ) is the descriptive variable of the QSAR MLR model, the output layer O( k ) is the IC 50 value, and the hidden layer HL( n ) is investigated to determine the best QSAR ANN model 25 . The QSAR ANN model was trained on the MATLAB 2016 26 tool.

Drug-likeness

The rule of action for Lipinski-5, the earliest and most well-known rule for identifying substances with good oral absorption, was proposed in 1997 27 . Since then, several analogous rules based on molecular characteristics, such as those given by Ghose 28 and Veber 29 , have been established. According to Veber's rule, substances in this study must meet the following two criteria: rotatable bonds (nRB) ≤ 10 and polar surface area (tPSA) ≤ 140 Å 2 . Therefore, screening according to the same rules is aimed at finding compounds that have the potential to become more effective oral drugs according to Veber's rule under two conditions: rotatable bonds (nRB) ≤ 10 and polar surface area (tPSA) ≤ 140 Å 2 . Reduced molecular flexibility, as measured by the number of rotatable bonds and low polar surface area or total hydrogen bond count (sum of donors and acceptors), are important predictors of good oral bioavailability. A reduced polar surface area correlates better with an increased permeation rate than does lipophilicity (C log P), and an increased rotatable bond count has a negative effect on the permeation rate 29 .

Bioactivity prediction

Medicinal characteristics were determined by using three QSAR models to predict the bioactivity at the IC 50 for new synthetic compounds, and esin is a natural triterpenoid derived from Paramignya Trimera . Then, using Escin as a reference, we looked for derivatives with greater biological activity than Escin. Currently, research predicts and discovers derivatives with higher bioavailability than natural chemicals and the potential to become medications.

Molecular Docking

The main goal of molecular docking is to understand and predict molecular recognition both in terms of structure-finding bonds and energy-predicting affinity. Currently, the application of molecular docking methods is very diverse and includes structure-activity studies, optimization, and potential molecule searches via virtual screening 30 . In this study, we used the MOE2019 package to perform the molecular docking process.

Escin, a Triterpenoid derived from Paramignya Trimera , belongs to the oleanolic acid group. This process affects cancer cells in a variety of ways, including inducing cyclic death, controlling the cell cycle, and killing cancer cells. Bcl-2 normally prevents cell cycle death (apoptosis). The target of action in this investigation was chosen to be Bcl-2, which inhibits Bcl-2 receptors, hence boosting cancer cell cyclic death 31 . The Bcl-2 receptor, encoded 4D2M, was obtained from the Protein Data Bank (PDB) 32 .

RESULTS

The training and test datasets

Seventy-four chemicals were gathered from articles published in reputable journals and PubMed. The data were utilized to develop models and assess the external inhibitory concentration (IC 50 ) . The training set of 60 compounds was used to develop QSAR models, and the external validation set of 14 compounds was utilized to assess the predictive power of the biological activity of the model.

Figure 5 . Summary of the 60 experimental IC50 values in the training datasets 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 , 42 , 43 , 44 , 45 , 46 , 47

Figure 5 
<a class=47" width="300" height="200">

[Download figure]

Design of the new compound

Using the multilayer design method and ChemDraw tool, a total of 196 novel molecules were obtained. All of these derivatives were optimized using the proper molecular mechanics sequence, followed by PM7 quantum mechanics. They are then calculated descriptors in the following phase.

Optimization of the structure and calculation of descriptors

All the compounds, both experimental and newly created, were subjected to structural optimization and molecular descriptor computations. The findings generated 310 descriptive attributes for each molecule utilized to construct the QSAR model.

Construction of the QSAR-MLR models

Table 1 Results of building the QSAR MLR model

The model yields the following equation: R 2 = 0.849, R 2 adj = 0.826, and Q 2 LOO = 0.789:

IC 50 (µM) = 3.739 + 2.247× x 1 + 85.94× x 2 + 24.36× x 3 (3.13)+ 0.156×x 4 + 5.24× x 5 – 0.03695× x 6 + 0.933× x 7 - 0.131× x 8 (2)

Construction of the QSARPCR model

The QSAR PCR model was built based on the variables of the QSAR MLR model and yielded the following results: R 2 = 0.860, R 2 adj = 0.831, and Q 2 LOO = 0.805. The equations are represented as follows:

IC 50 (µM) = 8.406 + 2.48×x 1 + 73.319×x 2 + 25.836×x 3 + 0.135×x 4 + 3.965×x 5 - 0.037×x 6 + 1.250×x 7 - 0.130×x 8 (3)

Construction of the QSARANN Models

The QSAR ANN model was built using the QSAR-MLR model descriptors from equation (2). The training of models uses a back-propagation algorithm with transfer functions such as Logsig, Tansig and Purelin. Therefore, the architecture of the ANN models in this scenario is I(8)-HL( m )-O(1). QSAR ANN models were developed in two stages. First, using the training dataset, multiple designs of MLP networks with different m values were identified, and the results are shown in Table 2 .

Table 2 The results of initial screening for the ANN architecture

Second, using the same external evaluation dataset (Table 5) of the QSAR MLR and QSAR PCR models, the best network was chosen based on the MARE (%) and Q 2 EV values. As a result, the best model was I(6)-HL(6)-O(1) ( Figure 5 a), with the statistical parameters Q 2 EV-ANN = 0.866 and MARE = 62.1%, as shown in Figure 5 and Figure 12 , respectively, with the Purelin transfer function.

Figure 6 . The architecture of the QSPR ANN I(8)-HL(6)-O(1) model

The external validation

External evaluation is considered a test to validate the predictive ability of the built models. From there, the model that gives the closest prediction results to the experimental results is selected. The external evaluation dataset includes 14 compounds obtained from experiments and is an independent set from the set used to construct the QSAR model. Detailed information on these derivatives is presented in Figure 7 .

Figure 7 . The experimental values of IC 50,exp in the external evaluation dataset

The predicted values of the QSAR models for 14 substances in the external evaluation dataset are presented in Table 3 .

Table 3 The predicted IC 50,pred values of the three models in the external evaluation set

Drug-likeness

After screening the drug likeness of all 196 compounds using Veber's criteria, we found 138 compounds that met these criteria.

Bioactivity prediction

The study predicts the IC 50 for 138 newly designed substances and Escin. Then, 20 compounds with IC50s less than Escin were obtained, ordered from small to large based on QSAR ANN for the best predictability. The structures of the 20 potential substances are presented in Figure 8 .

Figure 8 . The structures of the 20 new compounds

The detailed prediction results of 20 compounds from each model are presented in Table 4 .

Table 4 The predicted IC 50,pred values of new derivatives from three QSAR models

Molecular Docking

To test the inhibitory ability of the peptides on HepG2 cancer cells, 20 drugs were docked with the appropriate IC50 values for the Bcl-2 receptor. This docking process helps evaluate the binding ability of the compound to the Bcl-2 target receptor by simulating the 3D structure of both the receptor and the compound. Substances that are considered well bound have an RMSD < 2.0 Ă… and an E_binding < -7.0 kcal.mol -1 . The results for the 6 compounds with good binding energies and RMSD values are presented in Figure 5 .

Figure 9 . Docking results of six compounds with the 3U6J–Bcl-2 system

DISCUSSION

QSAR models

Table 1 and Figure 10 show that the R 2 , R 2 adj , and Q 2 LOO values are proportional to the number of variables. This shows that when the number of variables increases, the model improves. This change was accompanied by a significant change from 7 to 8 variables, although before each increase, the variables did not change much. Therefore, 8 variables are needed, indicating that this is the most promising QSAR MLR model. The model consists of 8 variables as follows: LUMO, PEOE_RPC- 21 C vsurf_DD13 vsurf_CW4 SlogP_VSA3 vsurf_EWmin1 and SlogP_VSA4. LUMO is the lowest unoccupied molecular orbital; PEOE_RPC- is a relatively negative partial charge; 21C is the partial charge at position 21; vsurf_DD13 is the vsurf_EDmin1 – vsurf_EDmin3 distance; vsurf_CW4 is the capacity factor at -2.0; logP_VSA3 is the bin 3 SlogP (0.00, 0.10]; vsurf_EWmin1 is the lowest hydrophilic energy; and SlogP_VSA4 is the bin 4 SlogP (0.10, 0.15].

Figure 10 . The variation in the SE, R 2 train , and Q 2 LOO values in response to the k value

The QSAR MLR model results for R 2 = 0.849 > 0.6(47) showed that the model encoded 84.9% of the biological activity variables in the dataset. An R 2 adj = 0.826 > 0.5 represents an encoding of 82.6% of the active value variable in the data, and Q 2 LOO = 0.789 > 0.5(47). As a result of these findings, the model produced relatively good prediction outcomes.

Moreover, the QSAR PCR model results for R 2 = 0.860 > 0.6(47) demonstrated that the model encoded 86% of the biological activity variables in the dataset. An R 2 adj = 0.831 > 0.5 represents an encoding of 83.1% of the variable to the active value in the data. Q 2 LOO = 0.846 > 0.5(47). Based on these findings, the model produces accurate predictions.

The architecture of ANN I(8)-HL(6)-O(1) using the Purelin transfer function for R 2 train = 0.941, R 2 test = 0.916 and Q 2 cv = 0.912 shows that the model has good predictability with high correlation values. The results with an external evaluation set of 0.866 show that the predictive ability of this model is closest to reality.

Based on the above reasons, the QSAR models were chosen to develop the new design and Escin.

The contributions of the variables in the model were also investigated, and the results are presented in Figure 11 . All the descriptors contributed significantly to various degrees the most significant contributor was PEOE_RPC-, and the least significant contributor was vsurf_EWmin1, with contributions of 42.3% and 1.6%, respectively. The remaining variables also contributed to the QSAR MLR model in the following order: PEOE_RPC- > vsurf_CW4 > SlogP_VSA3 > LUMO > 21C > vsurf_DD13 > SlogP_VSA4 > vsurf_EWmin1.

Figure 11 . The contributions of the variables to the QSAR MLR model

The external validation

As mentioned above, external evaluation is used to construct the MLR and PCR models. In addition, the best ANN model was identified from the initial survey models, as shown in Table 2 . The two values used as a basis for evaluation are Q 2 EX (>0.5) and MARE (%). The results are fully presented in Figure 12 . The results show that the linear regression models meet the requirements, and the 6 th neural network model (ANN6) is selected for the project because the Q 2 EX value is 0.866, which is the highest, while the MARE (%) value is comparable to that of the other models.

Figure 12 . The MARE (%) and Q 2 EX values of the QSAR models

As depicted in Figure 13 , the Q 2 EX values of the QSAR MLR model for the relationships between the IC 50,pred and experimental IC 50,exp values are shown, for a value of 0.840. Similar to the QSAR PCR and the external evaluation set, the result is Q 2 EX = 0.846, and the QSAR ANN6 gives a result of 0.866. The conclusion that the above three models all give good correlation index results for the external evaluation set shows that the evaluation ability is reliable and can be used to predict a wide range of design substances.

Furthermore, one-way ANOVA showed that the differences between the experimental and predicted values from the three models, QSAR MLR , QSAR PCR , and QSAR ANN , were not significant when the results were F = 0.0269 < F 0.05 = 3.2381. Therefore, the predictive ability of the three models is appropriate.

Figure 13 . Correlations of experimental and predicted values on the external dataset of QSAR models

Bioactivity prediction

Under the same calculation conditions, the same models predicting the results obtained above for the 20 substances had better IC 50 values than did those of Escin. The present study used Escin as a base to select compounds with better biological activity to prove that this potential new substance has superior cancer cell inhibitory ability compared to natural active substances.

Prediction results of new molecules and predicted Escin values from three QSAR models, QSAR MLR , QSAR PCR , and QSAR ANN6 . There was no significant difference in the analysis of variance (F = 0.71595 < F 0.05 = 3.07606). Therefore, the predictive ability of the three models is consistent and reliable.

Molecular Docking

The full interaction results of the six new compounds on the Bcl-2 receptor are presented in Table 5 . Among the six compounds that gave good results

T.new1 binds to the Bcl-2 receptor via a hydrogen acceptor bond to ARG74 (distance = 2.97 Ă…, energy = -1.7 kcalmol -1 ), E_binding = -7.158 (kcalmol -1 ) and RMSD = 1.539 (Ă…).

T.new4 binds to the Bcl-2 receptor via a pi-cation bond to ARG154 (distance = 3.58 Ă…, energy = -1.6 kcalmol -1 ), E_binding = -7.817 (kcalmol -1 ) and RMSD = 1.696 (Ă…).

T.new7 binds to the Bcl-2 receptor via a hydrogen donor bond to CYS174 (distance = 3.37 Ă…, energy = -1.2 kcalmol -1 ), E_binding = -7.933 (kcalmol -1 ) and RMSD = 1.915 (Ă…).

T.new11 binds to the Bcl-2 receptor via a pi-cation bond to TYR79 (distance = 3.78 Ă…, energy = -0.9 kcalmol -1 ), E_binding = -7.166 (kcalmol -1 ) and RMSD = 1.388 (Ă…).

T.new12 binds to the Bcl-2 receptor via a hydrogen donor bond to CYS174 (distance = 3.23 Ă…, energy = -0.8 kcalmol -1 ), E_binding = -7.869 (kcalmol -1 ) and RMSD = 1.279 (Ă…).

T.new19 binds to the Bcl-2 receptor via a pi-cation bond to ILE146 (distance = 3.65 Ă…, energy = -0.9 kcalmol -1 ), E_binding = -7.367 (kcalmol -1 ) and RMSD = 1.846 (Ă…).

The full interaction results of the six new compounds on the Bcl-2 receptor are presented in Table 5 . Among the six compounds that gave good results, T.new7 had the best results: T.new7 binds to the Bcl-2 receptor by a hydrogen donor bond to CYS174 (distance = 3.37 Ă…, energy = -1.2 kcalmol -1 ), E_binding = -7.933 (kcalmol -1 ) and RMSD = 1.915 (Ă…).

Table 5 Detailed interaction results of new compounds on the Bcl-2 receptor

T.new7 had the best results: T.new7 binds to the Bcl-2 receptor by a hydrogen donor bond to CYS174 (distance = 3.37 Ă…, energy = -1.2 kcalmol -1 ) E_binding = -7.933 (kcalmol -1 ) and RMSD = 1.915 (Ă…).

On the BCL-2 receptor, the amino acids considered essential are ARG74, ARG154, CYS174, TYR79, CYS174, and ILE146 when sequentially linked by the 6 most potential compounds.

Since then, QSAR MLR , QSAR PCR , and QSAR ANN6 models have been successfully constructed to predict new engineered substances. Finally, the T.new7 compound was selected to inhibit HepG2 cancer cells.

CONCLUSION

This study applied the QSAR model to screen and develop new drugs, specifically triterpenoids, for use on HepG2 cancer cells. The final selected compound T.new7 showed better bioavailability than the naturally occurring substance and met the conditions for use as a drug according to Veber's rule. Biological activity prediction based on the screening process and statistical statistics is fair and reliable. This study provides the foundation for future T.new7 experimental studies. Based on these findings, T.new7 was created using the Escin structural framework, with R 1 as morpholine and R 2 as coumarin. T.new7 has an IC 50 (µM) of 0.938, 1.764, and 1.187 according to the 3 models QSARMLR, QSARPCR, and QSARANN, respectively. In this case, QSAR ANN produces the best prediction results, and all three values are lower than those of the natural parent chemical Escin. The results of molecular docking showed that E_binding = -7.933 kcal.mol -1 and RMSD = 1.915 Å. The T.new7 compound binds to the Bcl-2 receptor via an H-donor. Specifically, T.new7 gives amino acid CYS174 a hydrogen (distance d = 3.37 Å, energy = -1.2 kcal.mol -1 ). Therefore, T.new7 was selected as the best inhibitor of HepG2 cancer cells. This current study is limited by the fact that it involved only virtual screening. Despite its exploratory nature, this study offers T.new7 for further experiments. Consequently, further experimental studies are necessary to confirm the effectiveness of T.new7 against HepG2 cancer cells.

ABBREVIATIONS

ANOVA: Analysis of variance

ANN: Artificial neural network

Bcl-2: B-cell lymphoma 2

GLOBOCAN: Global Cancer Statistics

HepG2: Human liver cancer cell line

HOMO: Highest occupied molecular orbital

IC 50 : Half-maximal inhibitory concentration

LUMO: lowest unoccupied molecular orbital

MLR: Multiple linear regression

OA: Oleanolic Acid

PCR: Principal component regression

PDB: Protein Data Bank

PM7: Parameterized Model 7

QSAR: Quantitative structure-activity relationship

COMPETING INTERESTS

The authors declare that they have no competing interests.

AUTHOR CONTRIBUTIONS

All authors participated in study design, coordination, and manuscript drafting

FUNDING

None

References

  1. Cancer today [Internet]. [cited 2023 Jun 25]. . ;:. Google Scholar
  2. Trigos Ă, MedellĂ­n JS. Biologically active metabolites of the genus Ganoderma: Three decades of myco-chemistry research. 2011. . ;:. Google Scholar
  3. Chudzik M, Korzonek-Szlacheta I, KrĂłl W. Triterpenes as Potentially Cytotoxic Compounds. Molecules. 2015 Jan 19;20(1):1610-25. . ;:. PubMed Google Scholar
  4. Phi TCM, Chu HH, Trieu Le N, Nguyen DB. Phylogenetic relationship of Paramignya trimera and its relatives: an evidence for the wide sexual compatibility. Sci Rep. 2020 Dec 10;10(1):21662. . ;:. PubMed Google Scholar
  5. Nguyen VT, Sakoff JA, Scarlett CJ. Physicochemical Properties, Antioxidant and Antiâ€proliferative Capacities of Dried Leaf and Its Extract from Xao tam phan (Paramignya trimera). Chem Biodivers. 2017 Jun;14(6):e1600498. . ;:. PubMed Google Scholar
  6. Multilevel Categoric. Tutorials. 2021. . ;:. Google Scholar
  7. Jain S, Pattnaik S, Pathak K, Kumar S, Pathak D, Jain S, et al. Anticancer Potential of Thiazole Derivatives: A Retrospective Review. Mini-Rev Med Chem. 2018 Apr 16;18(8):640-55. . ;:. PubMed Google Scholar
  8. Urban M, Vlk M, Dzubak P, Hajduch M, Sarek J. Cytotoxic heterocyclic triterpenoids derived from betulin and betulinic acid. Bioorg Med Chem. 2012 Jun;20(11):3666-74. . ;:. PubMed Google Scholar
  9. Al-Naqeb G, Ismail M, Bagalkotkar G, Adamu HA. Vanillin rich fraction regulates LDLR and HMGCR gene expression in HepG2 cells. Food Res Int. 2010 Dec;43(10):2437-43. . ;:. Google Scholar
  10. Bennani FE, Doudach L, Karrouchi K, El Rhayam Y, Rudd CE, Ansar M, et al. Design and prediction of novel pyrazole derivatives as potential anticancer compounds based on 2D-QSAR study against PC-3, B16F10, K562, MDA-MB-231, A2780, ACHN and NUGC cancer cell lines. Heliyon. 2022 Aug;8(8):e10003. . ;:. PubMed Google Scholar
  11. Alam MM. 1,2,3â€Triazole hybrids as anticancer agents: A review. Arch Pharm (Weinheim). 2022 Jan;355(1):2100158. . ;:. PubMed Google Scholar
  12. Kang X, Hu J, Gao Z, Ju Y, Xu C. Synthesis, anti-proliferative and proapoptotic activity of novel oleanolic acid azaheterocyclic derivatives. MedChemComm. 2012;3(10):1245. . ;:. Google Scholar
  13. Li Y, Cao TT, Guo S, Zhong Q, Li CH, Li Y, et al. Discovery of Novel Allopurinol Derivatives with Anticancer Activity and Attenuated Xanthine Oxidase Inhibition. Molecules. 2016 Jun 20;21(6):771. . ;:. PubMed Google Scholar
  14. Vega-Granados K, Medina-O'Donnell M, Rivas F, Reyes-Zurita FJ, Martinez A, Alvarez De Cienfuegos L, et al. Synthesis and Biological Activity of Triterpene-Coumarin Conjugates. J Nat Prod. 2021 May 28;84(5):1587-97. . ;:. PubMed Google Scholar
  15. Al-Ghorbani M, Gouda MA, Baashen M, Alharbi O, Almalki FA, Ranganatha LV. Piperazine Heterocycles as Potential Anticancer Agents: A Review. Pharm Chem J. 2022 Apr;56(1):29-37. . ;:. Google Scholar
  16. Borková ML. Triterpenoids With Anticancer Properties and Their mechanism of Action. . ;:. Google Scholar
  17. Arshad F, Khan MF, Akhtar W, Alam MM, Nainwal LM, Kaushik SK, et al. Revealing quinquennial anticancer journey of morpholine: A SAR based review. Eur J Med Chem. 2019 Apr;167:324-56. . ;:. PubMed Google Scholar
  18. Casini A, Scozzafava A, Mastrolorenzo A, Supuran C. Sulfonamides and Sulfonylated Derivatives as Anticancer Agents. Curr Cancer Drug Targets. 2002 Mar 1;2(1):55-75. . ;:. PubMed Google Scholar
  19. Patel SS, Tripathi R, Chavda VK, Savjani JK. Anticancer Potential of Mefenamic Acid Derivatives with Platelet-Derived Growth Factor Inhibitory Property. Anticancer Agents Med Chem. 2020 Jul 24;20(8):998-1008. . ;:. PubMed Google Scholar
  20. Sharma P, LaRosa C, Antwi J, Govindarajan R, Werbovetz KA. Imidazoles as Potential Anticancer Agents: An Update on Recent Studies. Molecules. 2021 Jul 11;26(14):4213. . ;:. PubMed Google Scholar
  21. Dadashpour S, Emami S. Indole in the target-based design of anticancer agents: A versatile scaffold with diverse mechanisms. Eur J Med Chem. 2018 Apr;150:9-29. . ;:. PubMed Google Scholar
  22. Roy K, Kar S, Das RN. QSAR/QSPR Modeling: Introduction. In: A Primer on QSAR/QSPR Modeling [Internet]. Cham: Springer International Publishing; 2015 [cited 2023 Sep 21]. p. 1-36. (SpringerBriefs in Molecular Science). . ;:. Google Scholar
  23. Steppan D.D . Werner J,, Yeater P.R. Essential Regression and Experimental Design for Chemists and Engineers, Free Software Package, 1998. 1998. . ;:. Google Scholar
  24. XLSTAT. version 2016.02.28451. 2016; Available from: Addinsoft, USA. . ;:. Google Scholar
  25. Rojas R. A Systematic Introduction. Neural Networks. 1996. . ;:. Google Scholar
  26. MATLAB. R2016a 9.0.0.341360. [Internet]. 2016. Available from: MathWorks, USA. . ;:. Google Scholar
  27. Lipinski C, Lombardo F, Dominy BW, Feeney PJ. Experimental and omputational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 1997 Jan 1;3:23-5. . ;:. Google Scholar
  28. Ghose AK, Viswanadhan VN, Wendoloski JJ. A Knowledge-Based Approach in Designing Combinatorial or Medicinal Chemistry Libraries for Drug Discovery. 1. A Qualitative and Quantitative Characterization of Known Drug Databases. J Comb Chem. 1999 Jan 12;1(1):55-68. . ;:. PubMed Google Scholar
  29. Veber DF, Johnson SR, Cheng HY, Smith BR, Ward KW, Kopple KD. Molecular Properties That Influence the Oral Bioavailability of Drug Candidates. J Med Chem. 2002 Jun 1;45(12):2615-23. . ;:. PubMed Google Scholar
  30. Stanzione F, Giangreco I, Cole JC. Use of molecular docking computational tools in drug discovery. In: Progress in Medicinal Chemistry [Internet]. Elsevier; 2021 [cited 2023 Sep 21]. p. 273-343. . ;:. PubMed Google Scholar
  31. Gu Z, Lin S, Yan W, Chen D, Zeng Z, Chen L, et al. Enhanced Water Solubility and Anti-Tumor Activity of Oleanolic Acid through Chemical Structure Modification. Int J Mol Sci. 2022 Oct 31;23(21):13291. . ;:. PubMed Google Scholar
  32. Bank RPD. RCSB PDB. . ;:. Google Scholar
  33. Liang D, Hao ZY, Zhang GJ, Zhang QJ, Chen RY, Yu DQ. Cytotoxic Triterpenoid Saponins from Lysimachia clethroides. J Nat Prod. 2011 Oct 28;74(10):2128-36. . ;:. PubMed Google Scholar
  34. Li DQ, Wu J, Liu LY, Wu YY, Li LZ, Huang XX, et al. Cytotoxic triterpenoid glycosides (saikosaponins) from the roots of Bupleurum chinense. Bioorg Med Chem Lett. 2015 Sep;25(18):3887-92. . ;:. PubMed Google Scholar
  35. Wang O, Liu S, Zou J, Lu L, Chen L, Qiu S, et al. Anticancer Activity of 2α, 3α, 19β, 23β-Tetrahydroxyurs-12-en-28-oic Acid (THA), a Novel Triterpenoid Isolated from Sinojackia sarcocarpa. Bauer JA, editor. PLoS ONE. 2011 Jun 10;6(6):e21130. . ;:. PubMed Google Scholar
  36. Dang Thi TA, Kim Tuyet NT, Pham The C, Thanh Nguyen H, Ba Thi C, Doan Duy T, et al. Synthesis and cytotoxic evaluation of novel ester-triazole-linked triterpenoid-AZT conjugates. Bioorg Med Chem Lett. 2014 Nov;24(22):5190-4. . ;:. PubMed Google Scholar
  37. Yu L, Wang X, Wei X, Wang M, Chen L, Cao S, et al. Triterpenoid saponins from Xanthoceras sorbifolia Bunge and their inhibitory activity on human cancer cell lines. Bioorg Med Chem Lett. 2012 Aug;22(16):5232-8. . ;:. PubMed Google Scholar
  38. Quang TH, Ngan NTT, Minh CV, Kiem PV, Nhiem NX, Tai BH, et al. Anti-inflammatory Triterpenoid Saponins from the Stem Bark of Kalopanax pictus. J Nat Prod. 2011 Sep 23;74(9):1908-15. . ;:. PubMed Google Scholar
  39. Gao J, Li X, Gu G, Liu S, Cui M, Lou HX. Facile synthesis of triterpenoid saponins bearing β-Glu/Gal-(1→3)-β-GluA methyl ester and their cytotoxic activities. Bioorg Med Chem Lett. 2012 Apr;22(7):2396-400. . ;:. PubMed Google Scholar
  40. Mu LH, Huang CL, Zhou WB, Guo DH, Liu P. Methanolysis of triterpenoid saponin from Ardisia gigantifolia stapf. and structure-activity relationship study against cancer cells. Bioorg Med Chem Lett. 2013 Nov;23(22):6073-8. . ;:. PubMed Google Scholar
  41. Lu Y hong, Chen M cang, Liu F, Xu Z, Tian X ting, Xie Y, et al. Synthesis and Cytotoxic Activity of Novel C-23-Modified Asiatic Acid Derivatives. Molecules. 2020 Aug 14;25(16):3709. . ;:. PubMed Google Scholar
  42. Zhang HJ, Zhang GR, Piao HR, Quan ZS. Synthesis and characterization of celastrol derivatives as potential anticancer agents. J Enzyme Inhib Med Chem. 2018 Jan 1;33(1):190-8. . ;:. PubMed Google Scholar
  43. Wang XY, Gao H, Zhang W, Li Y, Cheng G, Sun XL, et al. Bioactive oleanane-type saponins from the rhizomes of Anemone taipaiensis. Bioorg Med Chem Lett. 2013 Oct;23(20):5714-20. . ;:. PubMed Google Scholar
  44. Du Z, Li G, Zhou X, Zhang J. Synthesis of MeON-Glycoside Derivatives of Oleanolic Acid by Neoglycosylation and Evaluation of Their Cytotoxicity against Selected Cancer Cell Lines. Molecules. 2021 Feb 2;26(3):772. . ;:. PubMed Google Scholar
  45. Moustafa GO, Shalaby A, Naglah AM, Mounier MM, El-Sayed H, Anwar MM, et al. Synthesis, Characterization, In Vitro Anticancer Potentiality, and Antimicrobial Activities of Novel Peptide-Glycyrrhetinic-Acid-Based Derivatives. Molecules. 2021 Jul 28;26(15):4573. . ;:. PubMed Google Scholar
  46. Wu J, Zhang ZQ, Zhou XD, Yao QY, Chen ZL, Chu LL, et al. New Terpenoids from Potentilla freyniana Bornm. and Their Cytotoxic Activities. Molecules. 2022 Jun 7;27(12):3665. . ;:. PubMed Google Scholar
  47. Golbraikh A, Tropsha A. Beware of q2! J Mol Graph Model. 2002 Jan;20(4):269-76. . ;:. PubMed Google Scholar


Author's Affiliation

 Copyright Info

Creative Commons License

Copyright: The Authors. This is an open access article distributed under the terms of the Creative Commons Attribution License CC-BY 4.0., which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

 How to Cite
Tran, U., Pham, T., & Nguyen, Q. (2024). Design and screening of HepG2 cancer cell line inhibitors from Triterpenoid derivatives of Paramignya Trimera. Science and Technology Development Journal, 26(SI), 1-16. https://doi.org/https://doi.org/10.32508/stdj.v26iSI.4192

 Cited by



Article level Metrics by Paperbuzz/Impactstory
Article level Metrics by Altmetrics

 Article Statistics
HTML = 1294 times
PDF   = 347 times
XML   = 0 times
Total   = 347 times