Abstract
The Zika virus (ZIKV), a member of the Flaviviridae family, has caused multiple widespread outbreaks, posing significant challenges to global health. This study explores the potential of compounds from Echinacea angustifolia (E. angustifolia) to inhibit the activity of ZIKVâs RNA-dependent RNA polymerase (RDRP), a key enzyme in the viral replication process and an ideal candidate for antiviral therapy. Utilizing computational techniques, we conducted a thorough virtual examination using the MTi-OpenScreen tool to identify potential RDRP inhibitors among E. angustifolia compounds. The top four compounds were further examined through re-docking procedures. To assess the robustness and effectiveness of these interactions, we performed molecular dynamics simulations along with calculations of the binding free energy and PCA analysis. This investigation highlighted four naturally occurring compounds, viz., Echinacoside, Rutin, Echinacin, and Cynaroside, demonstrating a notable affinity for binding to the allosteric site of ZIKV RDRP. These compounds showed strong hydrogen bond formation with crucial residues of the RDRP and presented favorable binding free energies. Our research sheds light on the viability of these E. angustifolia compounds as ZIKV RDRP inhibitors, laying a foundation for further experimental research in developing novel antiviral treatments against ZIKV infections.
Similar content being viewed by others
Introduction
The Zika virus (ZIKV), belonging to the Flaviviridae family, is a pathogen transmitted primarily by Aedes mosquitoes, prevalent in tropical and subtropical areas1,2. Besides mosquito bites, ZIKV can spread through sexual transmission, blood transfusions, and from pregnant women to their unborn children3,4,5,6. While most people infected with ZIKV show no symptoms, those who do may experience mild conditions like fever, rash, joint pain, and conjunctivitis. In some cases, especially among pregnant women, ZIKV can cause severe neurological disorders such as microcephaly and Guillain-Barré syndrome1,7. Treatment for ZIKV is currently limited to symptom management and supportive care. Preventive measures like using insect repellent, wearing protective clothes, and destroying mosquito breeding grounds are vital in curtailing the spread of the virus. The global health threat posed by ZIKV underscores the necessity for comprehensive research into its transmission, pathology, and potential treatments. A significant focus is on finding and confirming effective drug targets against ZIKV. One such target is the RNA-dependent RNA polymerase (RDRP), a key enzyme in the virusâs replication cycle8,9,10. The identification of inhibitors that specifically target the ZIKV RDRP could provide a novel therapeutic approach to combat the infection. Medicinal plants have been used for centuries to treat various ailments, including viral infections11,12,13. These plants are rich in bioactive components, offering a treasure trove for the development of antiviral pharmaceuticals14,15,16,17. Among these, E. angustifolia stands out for its renowned immune-enhancing qualities, a staple in the realm of traditional healing practices18. This plant has garnered attention in scientific circles for its efficacy against various viruses, such as Influenza, Herpes simplex, Japanese encephalitis, and even the recent SARS-CoV-219,20,21,22. The specific constituents of Echinacea that impart its antiviral capabilities remain an area of active research, though compounds like polysaccharides, alkamides, and caffeic acid derivatives are known to play a significant role23 .
This study delves into the potential of E. angustifolia as a natural reservoir of compounds effective against the Zika virus (ZIKV). Our primary objective is to isolate potential inhibitors targeting the ZIKV RDRP and to unravel their inhibitory mechanisms via detailed molecular dynamics simulations. This exploration is not just about identifying new inhibitors; itâs about deepening our understanding of how these natural compounds interact at a molecular level with the ZIKV, potentially paving the way for novel treatment strategies.
Moreover, this research extends beyond the scope of just finding inhibitors. It seeks to contribute to the broader field of phytotherapy, highlighting the untapped potential of plant-based compounds in modern medicine. By investigating E. angustifoliaâs components, this study aims to illuminate the intricate ways in which natural substances can interact with viral pathogens, offering insights that could revolutionize how we approach viral infections. The findings from this study could have far-reaching implications, not only providing new avenues for ZIKV treatment but also setting a precedent for the use of phytochemicals in combating other viral diseases. This research, therefore, stands at the forefront of bridging traditional herbal wisdom with cutting-edge scientific inquiry, potentially unlocking new frontiers in antiviral therapy.
Methodology
Structure data collection and preparation
The three-dimensional structure of ZIKV RDRP in complex with its inhibitor solved at 1.94 Ã resolution was obtained from protein data bank (PDB id: 6LD5)24,25. The phytochemical structures of E. angustifolia were downloaded from the PubChem database26. Preparing the protein structure is an important step before performing docking studies. The preparation of a protein structure was performed in UCSF Chimera using the Dock prep tool, which involved loading the structure, checking for errors, adding missing atoms and residues, removing unwanted molecules, optimizing the structure, and saving the optimized structure as a new PDB file27. This preparation protocol ensures the quality and accuracy of the docking results and aids in identifying potential ligands for further study.
Structure-based screening (SBVS) and re-docking
Virtual screening is a computational technique to predict small moleculesâ binding affinity to a protein target. MTiOpenScreen is a web server that offers a user-friendly platform for virtual screening, which was used in this study28. The docking grid was generated by selecting the inhibitor molecule binding residues present in the crystal structure of Zika RDRP. The center coordinate was set at 77.36 (x), -3.58 (y), and 13.19 (z) with the grid box size of 25âÃâ25âÃâ25. SBVS of E. angustifolia phytomolecules using MTiOpenScreen were involved in preparing the protein structure, uploading the structure to MTiOpenScreen, preparing the ligand database, setting up the virtual screening parameters, running the virtual screening, and analyzing the results. The first pose of each 4 selected phytomolecules was taken for re-docking studies using the Autodock vina UCSF Chimera plugin that can provide valuable insights into the interactions between small molecules and protein targets27,29. The same docking parameters were set as in the SBVS experiment to validate the screening results.
MD simulation experiments
MD simulation of Zika RDRP-E. angustifolia compounds using free academic Desmond-maestro inter-polarity tool involved in preparing the protein-ligand complex, parameterization of the ligand, solvation, and neutralization, energy minimization, equilibration, production run, and data analysis30,31.
Desmond is a widely used software package for molecular dynamics simulation of biomolecules. The following steps were followed to perform MD simulations of the Zika RDRP-E. angustifolia compounds complex using Desmond.
Preparation of protein-ligand complex
Each protein-ligand complex was prepared in the first step using free academic Desmond-maestro by adding missing atoms and residues, removing water molecules and other ligands, assigning bond orders and hydrogen atoms, and optimizing the protein-ligand complex geometry and energy.
System setup for protein-ligand complexes
Setting up a molecular dynamics (MD) system in Desmond involves several steps, including preparing the input files, parameterizing the system, solvating and neutralizing the system, and energy minimization. Each protein-ligand complex was solvated in a box of water molecules (TIP4P) and neutralized by adding counter ions to the simulation box. After system setup for each complex, it was minimized to remove any steric clashes or bad contacts. It was achieved using the Desmond minimization module.
Equilibration and production run
The system needs to be equilibrated to reach a stable conformation before the production run. This includes two steps: (a) NVT equilibration, which keeps the temperature constant and allows the water molecules and counter ions to adjust to the system, and (b) NPT equilibration, which keeps the temperature and pressure constant and allows the protein-ligand complex to adjust to the solvent environment. The default parameters in Desmond molecular dynamics module were selected for system equilibration.
After equilibration, each system was subjected to a production run for 200 ns, which is a long MD simulation that allows the protein-ligand complex to explore the conformational space and the ligand-binding pocket. The production run was also performed using the Desmond MD module with default parameters such as time step, temperature, and pressure.
Simulation interaction diagram analysis
The MD simulation data were analyzed using the Simulation Interaction Diagram Analysis tool of Desmond. The analysis includes tasks such as trajectory visualization, RMSD and RMSF calculation, and identification of key binding interactions between the Zika RDRP and E. angustifolia compounds.
MM/GBSA calculation
The MM/GBSA method was employed to calculate the binding free energy of the selected E. angustifolia and Zika-RDRP complexes using the prime tool of the Schrodinger suit32,33. The last 50 ns trajectories of each complex were taken for analysis. The molecular mechanics energy (EMM) was calculated for 50 ns simulation trajectories at each 1 ns interval using the OPLS-2005 force field. The Generalized Born (GB) solvation energy was calculated using the GB model34. The binding free energy (ÎGBind) was calculated using the following equation:
Where ÎGMMÂ is the molecular mechanics energy difference between the protein-ligand complex and the separated protein and ligand, ÎGBGBÂ is the generalized Born solvation energy difference, and ÎGSAÂ is the nonpolar solvation energy estimated by the solvent accessible surface area (SASA) of the protein-ligand complex.
Principal component analysis
Principal component analysis (PCA) analysis offers valuable information about the flexibility and structural changes observed in complex35. This helps to understand better how these complexes bind and behave over time, improving our interpretation of their dynamics. PCA was conducted using the Bio3D package in R to investigate the conformational dynamics of the selected phytochemical compounds obtained from E. angustifolia and the reference molecule that is docked with ZIKV RDRP after MD simulation36. Prior to the PCA analysis, initially, the atomic coordinates of each phytochemical complex were retrieved from their simulation trajectory files. These coordinates were loaded into the Bio3D package, where they were converted into a dcd file format and then loaded into R using the read.pdb function available in the above-mentioned package36. The loaded R structure and the initial structure of the complexes generated during the MD simulation were aligned and superimposed using the align function and the fit.xyz function available in the Bio3D package. The generated output trajectories of each complex were further statically calculated using cov.pca function to find the correlation between the atomic deviations obtained in each complex using the covariance matrix calculation method. The values generated after this calculation are represented in eigenvectors, and these eigenvectors are considered as principal components, which are plotted during the PCA analysis. The generated conformers of each protein-ligand complex which are projected in cluster format and these clusters were also analysed for PCA analysis.
DCCM analysis
The dynamic cross-correlation matrix (DCCM) analysis, performed using the bio3D package in Rstudio, examined the entire molecular dynamics trajectory to evaluate correlated motions between protein residues36. By calculating the cross-correlation of atomic displacements, the analysis identifies regions with coordinated or anti-correlated movements, providing insights into protein stability, flexibility, and potential allosteric communication pathways. This analysis is valuable for understanding the dynamic behavior of protein-ligand complexes, revealing structural rigidity and functional motions. The heatmap visualization aids in identifying key regions involved in protein function and interaction, making DCCM crucial for drug discovery and studying protein dynamics.
Secondary structure analysis
The secondary structure analysis was performed using Visual Molecular Dynamics (VMD) to track changes in the proteinâs secondary structural elements, such as alpha helices, beta sheets, and random coils, throughout the molecular dynamics (MD) simulation37,38. The Timeline plugin in VMD was used to assign and visualize these structures based on backbone dihedral angles, providing a clear, color-coded representation of their evolution during the simulation. This analysis is crucial for assessing the stability and flexibility of the protein, particularly in understanding how ligand binding or environmental changes affect structural elements. It also offers insights into conformational dynamics and protein folding mechanisms, key for understanding protein function and guiding drug discovery.
RG-RMSD based FEL
The RG-RMSD-based free energy landscape (FEL) analysis using the Geo-measure plugin in PyMOL was conducted to assess the stability and conformational changes of protein-ligand complexes during molecular dynamics simulations39. RMSD was used to track structural deviations, while RG measured the compactness of the complex. The 2D FEL plot, with color-coded energy states, highlighted stable conformations (low-energy minima) and transitional states (high-energy regions). This analysis helps identify the most stable binding conformations, providing insights into ligand stability and protein-ligand interactions. It aids in understanding conformational flexibility and potential allosteric sites, which are key for drug design. By pinpointing regions of low energy and compact structures, the analysis helps select lead compounds with stronger binding affinity and better therapeutic potential.
Results and discussion
Virtual screening experiment
To examine the potential of 45 phytomolecules from E. angustifolia against the RNA-dependent RNA polymerase (RDRP) of the Zika virus, we carried out an extensive in-silico investigation. The docking energy of these compounds was determined by employing a method known as virtual screening, which was carried out with the assistance of the MTi-OpenScreen website28. The docking energy ranged from â10 to â2 Kcal/mol, which indicated that the binding contacts were favourable. Among the chemicals that were evaluated, Echinacoside, Rutin, Echinacin, and Cynaros revealed very strong docking energies of â10.0, â9.6, â9.4, â9.3 kcal/mol, respectively and robust binding interactions with the RDRP of the Zika virus. Based on these encouraging findings, these compounds were chosen for additional research, drawing attention to their potential as lead candidates in the development of innovative anti-Zika treatments (Fig. 1, Supplementary Table S1).
Re-docking, and molecular interaction analysis
A re-docking and molecular interaction investigation was carried out to learn more about the binding interactions between Echinacoside, Rutin, Echinacin, and Cynaroside and the Zika virus RDRP. Using the appropriate docking tool (AutoDock vina), the compounds were placed back into the Proteinâs active site. According to the re-docking score obtained i.e., â10.1 kcal/mol for Echinacoside, 9.5 kcal/mol for Rutin, 9.4 kcal/mol for Echinacin, and â9.3 kcal/mol for Cynaroside, all these compounds retained their favourable binding within the active site. Also, the reference molecule (G8O-G8L) (Supplementary Fig. 1) displayed an acceptable re-docking score of â9.0 kcal/mol.
Several significant interactions that contribute to the selected compoundâs high binding affinities were identified by analyzing the molecular interactions between them and the RDRP of the Zika virus. With several crucial amino acid residues (like Arg731, Arg739, Ile799, Ser798, Trp797, Arg794, Tyr768, and Cys711) in the binding site, echinacoside created seven hydrogen bonds with the binding site of the viral protein and pi-pi stacking interaction was also displayed by echinacoside compound with Arg794 residue of the protein. Amino acid residues Ser663, Ile799, Tyr609, Gln605, and Arg483, rutin demonstrated five hydrogen bond interactions, and residues Lys458 and Arg473 demonstrated salt bridge formations that strengthened its binding. Echinacin produced five hydrogen bonds with Arg473, Arg731, Ser663, and Cys711. In contrast, Cynaroside showed seven hydrogen bond interactions with Ser798, Ile799, Ser663, Ash665, and Tyr668. Strong binding and complicated stabilization of the docked phytochemical complexes were made possible due to the formation of these hydrogen bonds. However, the reference molecule (G8O-G8L) demonstrated hydrogen bond interactions with only two residues (Trp797 and Arg731) of its binding site. Along with the hydrogen bond, several other interactions were also formed in each protein-ligand complex and are listed accordingly in (Table 1). These findings highlight the diverse molecular interactions involved in the binding of these compounds to the ZIKV RDRP, with each compound utilizing distinct hydrogen bonding patterns to achieve strong and specific binding (Fig. 2, Supplementary Fig. 2).
Overall, the molecular interaction and re-docking analyses supported the highly potent binding affinities of echinacoside, rutin, echinacin, and cynaroside to the RDRP of the ZIKV. These substances displayed advantageous hydrogen bonds and other contacts inside the reference ligand binding site, indicating their potential as strong protein-targeting ZIKV replication inhibitors. Further, to confirm and investigate their binding stability, MD simulation experiments were performed.
Molecular dynamics simulation
Molecular dynamics simulations were conducted to investigate the protein-ligand complexes of echinacoside, rutin, echinacin, cynaroside, and the reference molecule with the Zika virus RDRP. The root-mean-square deviation (RMSD) analysis was performed to assess the stability and conformational changes of the complexes throughout the simulation. The RMSF measures how much an atom or group of atoms moves compared to a fixed structure. It calculates the average displacement for all the atoms. An in-depth understanding of the structural conformational change of protein structure and binding and displacement pattern of the ligand molecule in the protein-ligand complex that takes place during a continuous series of dynamic motions till the end of a particular time frame can be possible through MD simulation analysis. This can be possible by generating the three-dimensional structure of the initial pose and final pose of the protein-ligand complex retrieved from the 200 ns simulation trajectory of each complex. In this study, during the structural analysis of the initial and final pose of the RDRP-phytochemical complex, it was observed that the RDRP protein in each complex, including the reference complex shows, exhibited a very minimum amount of structural conformational changes during the simulation, and the phytochemicals showed acceptable displacement by remaining bound to the proteinâs binding site. These three-dimensional structural observations help to conclude that the protein-ligand complex remains in a stable state during the period of MD simulation and may have the ability to inhibit the function of viral protein compared to the reference complex (Fig. 3). To further confirm these findings, the statistical analysis that includes the RMSD, RMSF, and molecular contacts formed in each RDRP-phytochemical complex was studied by retrieving the necessary data from the simulation trajectories.
RMSD analysis
Studied complexes were further analyzed using Root Mean Square Deviation (RMSD) and indicated different patterns of stability for protein as well as the ligand component as shown in (Fig. 4). Protein RMSD of complex 1 was found to be around 2.5 à and RMSD of ligand was between 1 and 2 à , implies moderate stability with slight feasibility in the position of ligand. On the other hand, Complex 2 had a slightly more stable structure with RMSD of 2 à for the protein and the ligand also oscillated marginally between 1 and 2 à indicating a stable protein-ligand complex. In Complex 3, the protein remained stable where the RMSD is less than 2 A throughout duration of 180ns. However, a prominent raise of RMSD more than 2.5 à was observed in the range of 180â200 ns, which might point to the structural flexibility or conformational changes in the protein. In this case, the stability was maintained at a relatively low fluctuation range of 1 to 1.5 à for the RMSD of the ligand. On the other hand, another complex maintained stable protein RMSD less than 2 à , while ligand RMSD fluctuated between 2 and 3 à . This implies that though the protein conformation did not change much during molecular dynamics simulation, the ligand molecule was significantly flexible. For the control complex, the protein kept an RMSD of 2.5 à , while the ligand exhibited more significant deviations, with RMSD values ranging from 2.5 to 4.5 à up to 160 ns. In 160 ns MD simulation, the ligand RMSD rose higher than 3 à , suggesting that the stability was affected, or the conformation has changed.
RMSF analysis
The root mean square fluctuation (RMSF) values for the protein residues from 190 to 210 are shown to exhibit different flexibility patterns for the protein residues in various complexes. In complex 1, this residue has shown an RMSF value less than 3 à confirming moderate flexibility in the backbone of protein. Interestingly, Complex 2 exhibited a slightly higher RMSF value of 3.5 à for the same residues range which pointed out higher fluctuations. A Complex 3 had an RMSF value of less than 4 à in residue 190â210, this implies that although flexibility occurred in the protein, its stability remained high. Likewise, Complex 4 had RMSF value of 4 à within the same range of residues revealing similar degree of fluctuations. On the other hand, the control complex had an RMSF less than 4 à for residues 190â210 similar to the range of values observed for the experimental complexes and hence similar flexibility patterns for the tested complex. Based on this analysis, all the complexes exhibit moderate variations of the protein in terms of its mobility within this range of residues. The RMSF analysis is shown in (Fig. 5).
Molecular contact analysis of protein-ligand complexes from MD trajectories
The molecular contact analysis of the RDRP-phytochemical compounds focuses on finding the major interaction, such as hydrogen bonds, hydrophobic bonds, and ionic bonds, along with water bridge formation formed between the protein-ligand complex during the MD simulation, as shown in (Figs. 6 and 7). These are retrieved from the simulation trajectory of each docked complex, and the significant bonds formed with the protein residues for the maximum amount of time during the simulation period are observed to analyse the stability of each docked complex during MD simulation. So, in the RDRP- Echinacoside complex, mostly hydrogen bonds and water bridges are formed for more than 50% of the simulation period. Trp797 residue contributes to both hydrogen formation of 200 ns simulation time. Also, Ile799 participates in water bridge formation for 50% of the simulation time. Likewise, in the RDRP- Rutin complex, Trp797, and Tyr477 residues are responsible for the hydrogen bond formation 50â100% of the total interaction fraction, while Pro744, Tyr609 residue contributes to hydrophobic bond and water bridge formation for more than 50% of the simulation time. Similarly, in RDRP- Echinacin, namely Ser798, Asp666 and Ser663, form hydrogen bonds for 100% of the total simulation period. The residue Trp797 contributes to hydrophobic bond for more than 80% of the overall simulation period. Gly604 residue participates in water bridge formation for 40% of the simulation time in this complex. Also, in RDRP- Cynaroside complex, Ile799 hydrophobic interact more than 30% during simulation. The residues also contribute to bond formations, but the mentioned residues play a significant role in the stabilisation of the complex. Moreover, in the reference complex residues, His713 contributed to hydrogen bond formation for 30% of the total simulation period, whereas Trp797 formed hydrophobic and water bridge interaction for more than 40% of the total simulation period.
MM/GBSA analysis
MM/GBSA analysis is a computational technique employed to calculate the free binding energy between a ligand and its target protein or receptor. It combines a molecular mechanics force field with a generalized Born or surface area-based solvation model to account for the intermolecular interactions and solvation effects. For estimating the free binding energy, the conformations extracted from the last 50 ns of the 200 ns MD simulation trajectories were utilized. The free binding energy is calculated by the total energy contribution of all the intermolecular interactions. For a better understanding of the individual contributions of these intermolecular interactions, such as covalent, hydrogen binding, Vander Waal etc., were also calculated. This helps to understand the significant role of each energy dissociation component in the stabilisation of each RDRP-phytochemical complex. The analysis of the overall binding free energy for the protein docked with the screened phytochemical compounds of E. angustifolia revealed significant energy values compared to the corresponding reference complexes. RDRP- Echinacin complex exhibited maximum binding free energy (ÎGBind) value of â98.77 kcal/mol, followed by RDRP- Echinacoside (â84.13 kcal/mol), RDRP- Cynaroside (â64.65 kcal/mol) and RdRp- Rutin (â61.46 kcal/mol). However, the reference complex exhibited a binding energy of â50.18 kcal/mol, which is less than the selected phytochemical complex. Also, in the energy dissociation component analysis, it was seen that ÎGBind Coulomb and ÎGBind vdW contribute maximum energy in the stabilisation of the RDRP-phytochemical complex. The free binding energy analysis finalises that the selected phytochemical compounds tend to stabilise the complex and may participate in inhibiting the viral protein (Fig. 8, Supplementary Fig. 3, and Table S2).
Principal component analysis (PCA)
PCA is a statistical technique used to identify the main modes of motion and highlight the most significant variations within the system. By performing PCA on the MD simulation trajectories spanning 200 ns, valuable insights were gained regarding the RDRP- phytochemical complex. The analysis demonstrated that the first three principal components, PC1, PC2, and PC3, accounted for a significant amount of the total variance observed in the dataset. Visualizing the results of PCA by generating a cluster plot revealed different clustering patterns of the poses generated during the simulation by each docked complex, indicating the presence of conformational dynamics induced by the phytochemical compounds. This suggests that the phytochemical compounds have the potential to induce different structural changes in the protein. The sum total of the first principal component (PC1â+âPC2â+âPC3) of each RDRP-phytochemical complexe i.e., RDRP- Echinacoside, RDRP- Rutin, RDRP- Echinacin, RDRP- Cynaroside, and reference complex poses, RDRP-G8O-G8L (Reference ligand) are 40.27, 40.62, 39.58, 38.44 and 35.80% respectively This implies that the RDRP- Echinacoside complex may exhibit maximum stability due to the presence of similar conformers compared to all other phytochemicals and reference molecule. The cluster analysis of these complexes and reference molecule states that all the complexes are stable as the clusters are mostly overlapped with minimum splitting, and the scree plot analysis of these complexes also shows that the plotting of eigenvalues is in a curve slope rather than steep slope elbow formation. These findings suggest that all the docked complexes are stable. So, according to the results obtained from PCA analysis, all the complexes display acceptable stability compared to the reference complex (Fig. 9 and Supplementary Fig. 4).
Dynamic cross corelation analysis (DCCM)
To determine how each of the four compounds affects the proteinâs dynamic interactions relative to the control, the DCCM analysis of the dynamic cross-correlation matrix (DCCM) is in (Fig. 10). In the control (e), there are strong and frequent motions, which suggests that the protein showed extensive coordinated movements. This allows comparing how each compound alters the proteinsâ inherent flexibility and motion. Compound Echinacoside displays the positive shift in residues and has variable coupling motion to harmonize the fluctuation to other residues but with less negative coupling motion. Such flexibility may compromise the function of the protein; while being more flexible than the constant structure, it may compromise the stability of the protein. Compound Rutin, on the other hand exhibits relatively fewer regions of high values of correlation coefficient and may be implying fewer dynamic interactions and possibly leads to a stiffness of the protein. This lowered flexibility could help strengthen structure rigidity of the protein, thus it has lesser conformational changes which can be useful in engineering more stable enzymes. Compound Echinacin shows highly anti-correlated motions and therefore, the specific fragments of the protein are moving in the opposite directions, indicating structural changes due to this compound. This could affect active or flexible sites of the protein which may be of particular importance for change in enzyme function or stability. The results for compound Cynaroside are lower, indicating a moderate flexibility and dynamic coordination and positive as well as negative dynamic correlations. This balance might help the protein for some of the loops to remain as flexible as before and at the same time it can fix some regions which is very important for increasing substrate specificity or increasing the turnover number.
Secondary structure analysis
The secondary structure analysis is shown in (Fig. 11). In the complex second protein in complex 1 showed structure predominantly alpha-helical conformation with some beta-strand regions interspersed throughout. Loops are connecting these secondary structures. Since the compound interacts mainly with loop regions, it may cause localized flexibility, but the overall protein stability seems unaffected. The secondary structure of the protein in the complex 2 also maintains a strong alpha-helical structure, but the loops appear slightly more extended, with the beta strands still present in similar regions. The compound might induce slight conformational shifts near loop-helical junctions, potentially influencing protein activity by altering surface accessibility. The secondary structure analysis of complex 3 revealed that beta strands are more prominent compared to the first two structures with clear well-formed beta sheets. Alpha helices are still prevalent but seem slightly more compact. This indicates that binding could stabilize certain regions of the protein through interactions with beta strands, leading to potential structural constraints that might enhance or suppress the proteinâs natural function. Secondary Structure of complex 4 exhibits a more balanced mix of alpha helices and beta sheet. The impact of this compound might be more on the proteinâs functional dynamics rather than its structural integrity, potentially affecting interactions with other proteins or substrates and, with loops extending further from the core structure. The control secondary structure showed the native structure of the protein maintaining its natural secondary structure conformation. The alpha helices, beta strands, and loops are well-defined and follow the expected alignment pattern. The protein remains structurally sound without external influences, serving as a benchmark to measure the effects of the other four compounds.
RG-RMSD based FEL
The RG-RMSD-based free energy landscape (FEL) analysis of the four complexesâRDRP-Echinacoside, RDRP-Rutin, RDRP-Echinacin, and RDRP-Cynarosideâand the control highlights key differences in energy transitions, stability, and the dynamic behavior of the protein-ligand interactions during molecular dynamics (MD) simulations as shown in (Fig. 12). For the RDRP-Echinacoside complex, the FEL plot reveals a highly stable lower energy state, indicating that the complex reaches a favorable conformation with minimal transitions to higher energy regions. This deep energy well signifies strong and stable interactions between the Echinacoside ligand and the RDRP protein. The lack of significant transitions to higher energy states suggests that the complex remains in a stable conformation throughout the simulation, allowing little conformational flexibility. This indicates tight binding and an effective interaction between the ligand and the protein. In the case of the RDRP-Rutin complex, the FEL shows a stable lower energy state, but the transitions to higher energy regions are slightly more gradual compared to RDRP-Echinacoside. This suggests that while the Rutin complex maintains a stable binding conformation, it allows for more flexibility within the protein-ligand complex. The energy landscape is more rugged, indicating the presence of alternative, less stable conformations that the system occasionally explores during the simulation. While still exhibiting stability, the Rutin complex allows for more dynamic behavior compared to Echinacoside. The RDRP-Echinacin complex demonstrates a broader lower energy state, reflecting a stable binding conformation but with more conformational diversity. The wider energy basin implies that the RDRP-Echinacin complex can adopt multiple conformations, which may lead to more flexibility in the system. This complex undergoes more frequent energy transitions, and higher energy states appear more often compared to RDRP-Echinacoside and RDRP-Rutin. This indicates that the interactions between Echinacin and the protein are slightly weaker, allowing for more conformational variability and less rigid binding. The RDRP-Cynaroside complex displays a well-defined lower energy state, similar to RDRP-Echinacoside. The sharp energy well indicates that the Cynaroside complex forms a strong, stable interaction with the RDRP protein, resulting in minimal conformational shifts. Like Echinacoside, Cynaroside restricts conformational flexibility, as the complex stays in a stable state throughout the simulation. The energy transitions are infrequent, and the appearance of higher energy states is sparse, which implies that Cynaroside effectively stabilizes the protein-ligand complex. In contrast, the control compound shows a shallow lower energy state with frequent transitions to higher energy regions. This broader energy landscape indicates that the control does not stabilize the protein-ligand complex as effectively as the test compounds. The frequent transitions suggest that the control complex samples a wide range of conformational states, reflecting weaker interactions and greater flexibility in the system. This indicates that the control compound does not achieve the same level of binding stability as the tested ligands.
Conclusion
In conclusion, this computational study investigates potential bioactive compounds from E. angustifolia as inhibitors of the ZIKV RDRP, one of the most critical viral enzymes in the replication process. Employing a virtual screening approach, molecular docking, and dynamic simulations, four promising candidate compounds were identified: echinacoside, rutin, echinacin, and cynaroside. These compounds had good binding affinities, stable interactions with key RDRP residues, and robust binding free energies, probably indicative of their effectiveness against RDRP. Further, advanced analyses such as PCA, MM/GBSA, and secondary structure assessment have proved the stability and efficacy of these complexes in disrupting the function of RDRP. The findings from this study indicate the value of compounds from E. angustifolia as candidates for antiviral applications, which needs experimental confirmation at a clinical level. This study provides a foundation for future research into plant-derived antiviral therapies, emphasizing the importance of phytochemicals in addressing emerging infectious diseases like ZIKV.
Data availability
The datasets generated and/or analysed during the current study are available upon request from the corresponding author.
References
Hamel, R. et al. Biology of Zika virus infection in human skin cells. J. Virol. 89, 8880â8896 (2015).
Kuno, G., Chang, G. J. J., Tsuchiya, K. R., Karabatsos, N. & Cropp, C. B. Phylogeny of the genus Flavivirus. J. Virol. 72, 73â83 (1998).
Li, M. I., Wong, P. S. J., Ng, L. C. & Tan, C. H. Oral susceptibility of Singapore Aedes (Stegomyia) aegypti (Linnaeus) to Zika virus. PLoS Negl. Trop. Dis. 6, e1792 (2012).
Marchette, N. J., Garcia, R. & Rudnick, A. Isolation of Zika virus from Aedes Aegypti mosquitoes in Malaysia. Am. J. Trop. Med. Hyg. 18, 411â415 (1969).
Fiorentino, D. G. & Montero, F. J. The Zika virus and pregnancy. Curr. Obstet. Gynecol. Rep. 5, 234â238 (2016).
Grischott, F., Puhan, M., Hatz, C. & Schlagenhauf, P. Non-vector-borne transmission of Zika virus: a systematic review. Travel Med. Infect. Dis. 14, 313â330 (2016).
Marbán-Castro, E., Goncé, A., Fumadó, V., Romero-Acevedo, L. & BardajÃ, A. Zika virus infection in pregnant women and their children: a review. Eur. J. Obstet. Gynecol. Reprod. Biol. 265, 162â168 (2021).
Kumar, S. et al. Discovery of bispecific lead compounds from Azadirachta indica against ZIKA NS2B-NS3 protease and NS5 RNA dependent RNA polymerase using molecular simulations. Molecules 27, 2562 (2022).
Ganeshpurkar, A., Gutti, G. & Singh, S. K. RNA-dependent RNA polymerases and their emerging roles in antiviral therapy. in Viral Polymerases 1â42 https://doi.org/10.1016/B978-0-12-815422-9.00001-2 (Elsevier, 2019).
Kumar, R., Mishra, S., Maurya, S. K. & Shreya & Recent advances in the discovery of potent RNA-dependent RNA-polymerase (RdRp) inhibitors targeting viruses. RSC Med. Chem. 12, 306â320 (2021).
Hassan, B. A. R. & Mohammed, A. H. Medicinal plants and infection. J. Innovat. Med. Res. 2, 9â11 (2023).
Abubakar, I. B. et al. Traditional medicinal plants used for treating emerging and re-emerging viral diseases in northern Nigeria. Eur. J. Integr. Med. 49, 102094 (2022).
Chaughule, R. S. & Barve, R. S. Role of herbal medicines in the treatment of infectious diseases. Vegetos https://doi.org/10.1007/s42535-022-00549-2 (2023).
Dwivedi, V. D., Tripathi, I. P. & Mishra, S. K. In silico evaluation of inhibitory potential of triterpenoids from Azadirachta indica against therapeutic target of dengue virus, NS2B-NS3 protease. J. Vector Borne Dis. 53, 156â161 (2016).
Dwivedi, V. D., Tripathi, I. P., Bharadwaj, S., Kaushik, A. C. & Mishra, S. K. Identification of new potent inhibitors of dengue virus NS3 protease from traditional Chinese medicine database. Virus Dis. 27, 220â225 (2016).
Mahajan, P. et al. A multi-target approach for discovery of antiviral compounds against dengue virus from green tea. Netw. Model. Anal. Health Inf. Bioinf. 9, 20 (2020).
Denaro, M. et al. Antiviral activity of plants and their isolated bioactive compounds: an update. Phytother. Res. 34, 742â768 (2020).
Dobrange, E., Peshev, D., Loedolff, B. & Van Den Ende, W. Fructans as immunomodulatory and antiviral agents: the case of Echinacea. Biomolecules 9, 615 (2019).
Yadav, P. et al. Discovery of small molecules from Echinacea angustifolia targeting RNA-dependent RNA polymerase of Japanese Encephalitis virus. Life 12, 952 (2022).
Binns, S. E., Hudson, J., Merali, S. & Arnason, J. T. Antiviral activity of characterized extracts from Echinacea spp. (Heliantheae: Asteraceae) against herpes simplex virus (HSV-I). Planta med. 68, 780â783 (2002).
Hudson, J. & Vimalanathan, S. EchinaceaâA source of potent antivirals for respiratory virus infections. Pharmaceuticals 4, 1019â1031 (2011).
Singh, N. A comparison of both water and ethanol extracts prepared from Echinacea purpurea and Echinacea angustifolia on the response to influenza A/PR/8/34 infection in mice. (2010).
Kumar, K. M. & Ramaiah, S. Pharmacological importance of Echinacea purpurea. Int. J. Pharma Bio Sci. 2, 304â314 (2011).
Burley, S. K. et al. RCSB protein data bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res. 51, D488âD508 (2023).
Gharbi-Ayachi, A. et al. Non-nucleoside inhibitors of Zika virus RNA-dependent RNA polymerase. J. Virol. 94, e00794âe00720 (2020).
Kim, S. et al. PubChem 2023 update. Nucleic Acids Res. 51, D1373âD1380 (2023).
Pettersen, E. F. et al. UCSF ChimeraâA visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605â1612 (2004).
Labbé, C. M. et al. MTiOpenScreen: a web server for structure-based virtual screening. Nucleic Acids Res. 43, W448âW454 (2015).
Trott, O., Olson, A. J., AutoDock & Vina Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31, 455â461 (2010).
Bowers, K. J. et al. Molecular dynamicsâScalable algorithms for molecular dynamics simulations on commodity clusters. in Proceedings of the ACM/IEEE conference on Supercomputing - SC â06 84 https://doi.org/10.1145/1188455.1188544 (ACM Press, 2006).
Schrödinger Release 2024-2: Desmond Molecular Dynamics System, D. E. Shaw Research, New York, NY, 2024. Maestro-Desmond Interoperability Tools, Schrödinger, New York, NY. (2024)
Schrödinger Release 2024-2: Prime, Schrödinger, LLC, New York, NY, 2024.
Li, J. et al. The VSGB 2.0 model: a next generation energy model for high resolution protein structure modeling. Proteins Struct. Funct. Bioinform. 79, 2794â2812 (2011).
Greenidge, P. A., Kramer, C., Mozziconacci, J. C. & Wolf, R. M. MM/GBSA binding energy prediction on the PDBbind data set: successes, failures, and directions for further improvement. J. Chem. Inf. Model. 53, 201â209 (2013).
Elhaik, E. Principal component analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated. Sci. Rep. 12, 14683 (2022).
Grant, B. J., Skjærven, L. & Yao, X. Q. The Bio3D packages for structural bioinformatics. Protein Science: Publication Protein Soc. 30, 20 (2020).
Humphrey, W., Dalke, A. & Schulten, K. Visual molecular dynamics. J. Mol. Graph. 14, 33â38 (1996).
Feng, Z., Xia, F. & Jiang, Z. The effect of β-sheet secondary structure on All-β proteins by molecular dynamics simulations. Molecules. 29, 2967 (2024).
Kagami, L. P., Das Neves, G. M., Timmers, L. F. S. M., Caceres, R. A. & Eifler-Lima, V. L. Geo-measures: a PyMOL plugin for protein structure ensembles analysis. Comput. Biol. Chem. 87, 107322 (2020).
Acknowledgements
The authors acknowledge the generous charitable donation from the Late Sheikh Ibraheem Ahmed Azhar in the fund as a contribution to the scientific research community. The authors are also highly thankful to Dr. Amaresh Kumar Sahoo, Indian Institute of Information Technology, Prayagraj, India for providing his kind support for binding free energy calculation analysis using the Prime Schrödinger suite software package.
Author information
Authors and Affiliations
Contributions
Conceptualization, M.M.E.D, L.H.B., T.A.A., I.M.A., H.S.G., M.M.A., V.D.D. and E.I.A.; Data curation, M.M.E.D and V.D.D.; Formal analysis, M.M.E.D, L.H.B. and V.D.D.; Funding acquisition, E.I.A.; Investigation, M.M.E.D, V.D.D. and E.I.A.; Methodology, M.M.E.D, L.H.B., T.A.A., I.M.A., H.S.G., M.M.A., V.D.D. and E.I.A.; Project administration, E.I.A.; Resources, M.M.E.D and E.I.A.; Software, M.M.E.D, V.D.D. and E.I.A.; Supervision, V.D.D. and E.I.A.; Validation, M.M.E.D, L.H.B., T.A.A., I.M.A., H.S.G., M.M.A., V.D.D. and E.I.A.; Visualization, M.M.E.D, L.H.B., T.A.A., I.M.A., H.S.G., M.M.A., V.D.D. and E.I.A.; Writingâoriginal draft, M.M.E.D and V.D.D.; Writingâreview & editing, M.M.E.D, L.H.B., T.A.A., I.M.A., H.S.G., M.M.A., V.D.D. and E.I.A. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisherâs note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the articleâs Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the articleâs Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
El-Daly, M.M., Bajrai, L.H., Alandijany, T.A. et al. Exploring Echinacea angustifolia for anti-viral compounds against Zika virus RNA-dependent RNA polymerase: a computational study. Sci Rep 15, 4060 (2025). https://doi.org/10.1038/s41598-025-88481-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-88481-8