A quantitative model for the rate-limiting process of UGA alternative assignments to stop and selenocysteine codons

Yen-Fu Chen; Hsiu-Chuan Lin; Kai-Neng Chuang; Chih-Hsu Lin; Hsueh-Chi S. Yen; Chen-Hsiang Yeang

doi:10.1371/journal.pcbi.1005367

Abstract

Ambiguity in genetic codes exists in cases where certain stop codons are alternatively used to encode non-canonical amino acids. In selenoprotein transcripts, the UGA codon may either represent a translation termination signal or a selenocysteine (Sec) codon. Translating UGA to Sec requires selenium and specialized Sec incorporation machinery such as the interaction between the SECIS element and SBP2 protein, but how these factors quantitatively affect alternative assignments of UGA has not been fully investigated. We developed a model simulating the UGA decoding process. Our model is based on the following assumptions: (1) charged Sec-specific tRNAs (Sec-tRNA^Sec) and release factors compete for a UGA site, (2) Sec-tRNA^Sec abundance is limited by the concentrations of selenium and Sec-specific tRNA (tRNA^Sec) precursors, and (3) all synthesis reactions follow first-order kinetics. We demonstrated that this model captured two prominent characteristics observed from experimental data. First, UGA to Sec decoding increases with elevated selenium availability, but saturates under high selenium supply. Second, the efficiency of Sec incorporation is reduced with increasing selenoprotein synthesis. We measured the expressions of four selenoprotein constructs and estimated their model parameters. Their inferred Sec incorporation efficiencies did not correlate well with their SECIS-SBP2 binding affinities, suggesting the existence of additional factors determining the hierarchy of selenoprotein synthesis under selenium deficiency. This model provides a framework to systematically study the interplay of factors affecting the dual definitions of a genetic codon.

Author summary

The “code book” of protein translation maps 4³ = 64 triplets of RNA sequences (codons) into 20 canonical amino acids and the stop signal. This code book is universal in almost all organisms on earth. Selenoproteins consist of selenium-containing amino acids–selenocysteines (Sec)–that are not among the 20 canonical amino acids. The cells “borrow” a stop codon UGA to translate selenocysteines. Since UGA maps to two possible outcomes, the translation machinery can synthesize both full-length selenoproteins (when UGA encodes selenocysteine) and truncated peptide chains (when UGA encodes translational termination). Despite extensive study about selenoprotein synthesis mechanisms, a quantitative model for how cells allocate resources to synthesize each species is yet to appear. We propose a quantitative model that can explain the dependency of experimental observables such as protein stability and Sec incorporation efficiency by various factors such as selenium concentration and mRNA levels. Saturation of those quantities implies the existence of limiting factors such as mRNA transcripts and Sec-specific tRNAs. The match between model simulations and experimental data suggests that the cellular decision making of synthesizing the two species of proteins may follow simple first-order kinetics.

Citation: Chen Y-F, Lin H-C, Chuang K-N, Lin C-H, Yen H-CS, Yeang C-H (2017) A quantitative model for the rate-limiting process of UGA alternative assignments to stop and selenocysteine codons. PLoS Comput Biol 13(2): e1005367. https://doi.org/10.1371/journal.pcbi.1005367

Editor: Joshua B. Plotkin, University of Pennsylvania, UNITED STATES

Received: August 5, 2016; Accepted: January 18, 2017; Published: February 8, 2017

Copyright: © 2017 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: This work was supported by Career Development Award 101-CDA-L05 from Academia Sinica and Taiwan’s Ministry of Science and Technology (MOST) grant 103-2311-B-001-034-MY3 awarded to HCSY, and Career Development Award 104-CDA-M04 from Academia Sinica and MOST grant 103-2118-M-001-011-MY2 awarded to CHY. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Stop codons can be reassigned to encode amino acids [1, 2]. Failures in stop codon reassignment leads to the production of prematurely terminated proteins [3, 4], but how cellular factors influence alternative definitions of stop codons is not fully understood. While some stop codon reassignments are confined to certain species or organelles, redefinition of UGA to selenocysteine (Sec) in selenoprotein synthesis occurs in all three domains of life [5]. Selenoproteins are proteins that contain the Sec amino acid residue. Translating UGA to Sec requires Sec-tRNA^Sec (Sec-specific tRNA charged with Sec), the Sec insertion sequence (SECIS) element at the 3’ untranslated region (3’UTR) of selenoprotein mRNAs [4, 6, 7], and other regulatory factors such as SBP2 [8–10] and EFSec [11, 12]. Failed UGA to Sec decoding results in translation termination, with UGA being recognized by a release factor (RF) instead. RFs trigger the hydrolysis of ester bonds in peptidyl-tRNA and corresponding release of translated proteins from the ribosome [13, 14].

Translating UGA to Sec is inefficient [15–17] and influenced by the abundance of selenoprotein mRNA, Sec-tRNA^Sec, selenium, SBP2 and the intrinsic properties of SECIS elements [8, 17–21]. Overexpression of selenoprotein mRNA reduces UGA-to-Sec decoding [18, 22], but this effect could be rescued by co-expression of uncharged Sec-specific tRNA (tRNA^Sec) [18, 22] or SBP2 [8]. The efficiency of Sec incorporation has been shown to be positively correlated with tRNA^Sec or selenium supply in cells [20] yet differs among seleoproteins [23]. There are at least 25 selenoproteins in the human proteome [24] and their difference in Sec incorporation efficiency leads to a “selenoprotein hierarchy” under selenium deficiency [23]: proteins with higher Sec incorporation efficiency exploit more Sec-tRNA^Sec and are more rapidly synthesized. It is well known that hierarchical selenoprotein expression depends on the SECIS-SBP2 interaction [8], but whether this interaction is the sole determinant for selenoprotein hierarchy remains unclear.

Despite the aforementioned rich studies in selenoprotein translation, a systematic and quantitative characterization of the joint effects of various regulatory factors has not yet been reported. To fill this gap, we developed a simple mechanistic model that captures the quantitative characteristics of the UGA translation process and applied this model to experimental data to investigate how various regulatory factors influence the definition of UGA. We utilized differential protein half-lives from full-length and truncated selenoproteins, retrieved from a single-cell-based global protein stability (GPS) assay [25], to infer UGA definitions under cell culture conditions with variations in selenium supply and selenoprotein expression levels, and used those inferred quantities to estimate the model parameters. We found that the qualitative behavior of selenoprotein translation derived from our model closely resembles that from experimental data. Moreover, we re-capitulated the selenoprotein hierarchy by measuring and comparing the stability of proteins expressed from constructs with SECIS elements of four distinct selenoproteins. The estimated Sec incorporation rates are incongruent with the reported SECIS-SBP2 binding equilibrium constants, suggesting the existence of additional factors to explain selenoprotein hierarchy. Our model provides a framework to quantitatively study the regulation of UGA codon redefinition and selenoprotein synthesis.

Results

Experimental results

Inferring UGA definition using differential protein half-lives between full-length and UGA-terminated selenoproteins.

The UGA codon defines either Sec or translation termination during selenoprotein synthesis. Decoding UGA to Sec or translation termination results in the expression of full-length (P_L) or truncated (P_S) selenoproteins, respectively (Fig 1). UGA assignment can therefore be inferred from the ratio of P_L to P_S. We previously found that P_L is more stable than P_S, and that the protein stability of neither species is affected by selenium supply [3]. Thus, our deduction of UGA definition can be taken from the observed protein half-life of the total selenoprotein population (P_T), which is represented by the mixture of P_L and P_S. Intuitively, when a higher proportion of UGA is redefined as Sec, P_L is favored over P_S, resulting in a greater observed selenoprotein half-life and vice versa. This phenomenon can be depicted by the following formula:

Download:

Fig 1. A model of UGA decoding during selenoprotein synthesis.

The Sec-tRNA^Sec and RFs compete for a UGA site with association constants k₁ and k₂, respectively. The abundance of Sec-tRNA^Sec is determined by both selenium (Se) and uncharged tRNA^Sec with an equilibrium constant k_3. When Sec-tRNA^Sec binds to the UGA site, the mRNA will be translated into full length selenoproteins (P_L) with synthesis rate γ_L and degradation rate λ_L. When RF binds to the UGA site, the mRNA will be translated into truncated proteins (P_S) with synthesis rate γ_S and degradation rate λ_S. Φ denotes debris after protein degradation.

https://doi.org/10.1371/journal.pcbi.1005367.g001

where x and (1-x) are the proportions of UGA defined as Sec and translation termination, respectively, and λ_L, λ_S and λ_T are the degradation rates of P_L, P_S and P_T, respectively. The reciprocal of λ is proportional to the protein half-life. The half-life of P_T is a linear combination of that of P_L and P_S, and thus x can be deduced by measuring the half-lives of P_T, P_L and P_S.

Characterization of UGA assignments in SEPHS2 and SEPW1 syntheses.

We measured the protein half-lives of P_L, P_S and P_T using GPS, a single-cell-based dual-fluorescent reporter system [25]. GPS allows for simultaneous measurement of protein synthesis (mRNA level), abundance and half-life using red-fluorescent protein (RFP) signal, green-fluorescent protein (GFP) signal and the GFP/RFP ratio, respectively (S1 Fig). The GPS reporter was permanently integrated into the genome of cells in order to tightly control protein expression levels and to avoid artifacts resulting from transient expression.

We utilized the synthesis of selenoprotein SEPHS2 as a model to investigate the UGA decoding process. To measure the half-lives of P_L and P_S, we generated SEPHS2 mutant transcripts that exclusively express P_L or P_S (see Experimental Methods for details). Consistent with previous findings [3], P_L was much more stable than P_S (Fig 2A and 2B), and the half-lives of both species were not affected by either selenium supply (Fig 2C) or protein synthesis rate (Fig 2D). To reiterate the concept of deducing UGA definition using differential protein half-lives between P_L and P_S, a hypothetical curve for wild-type SEPHS2 transcripts (P_T) expressing both P_L and P_S is also shown (Fig 2B, the orange line), with the observed half-life lying between P_L and P_S. Favorable UGA to Sec or stop assignments shifts the curve closer to that of P_L or P_S, respectively.

Download:

Fig 2. Protein half-life analysis of full-length and truncated SEPHS2.

(a) Distributions of protein stability measurements of P_L or P_S by the GPS assay. P_L and P_S were expressed from SEPHS2 mutant transcripts that exclusively express one form of SEPHS2. % of Max indicates normalized cell counts such that the peak value of each distribution is 100%. (b) The relationship between protein synthesis and abundance for P_L and P_S. Each dot represents a single cell carrying the indicated GPS reporter with a corresponding protein synthesis (RFP) and protein abundance (GFP). The GFP/RFP ratio, or the slope of the protein synthesis-abundance plot passing through the origin, reflects the protein half-life. A hypothetical line for P_T, the total amount of proteins of both forms, is shown. (c-d) GPS analysis of P_L and P_S under various selenium concentrations (c) or synthesis levels (d). Relative mRNA levels are quantifications of the RFP signals in the GPS assay.

https://doi.org/10.1371/journal.pcbi.1005367.g002

To investigate how various factors affected the alternative translation of UGA, we measured the half-lives of P_T under different selenium concentrations and synthesis levels. In accordance with the hypothetical line in Fig 2B, the half-lives of P_T (i.e. the GFP/RFP ratio or slope) are situated between those of P_L and P_S (S2 Fig). As revealed by the corresponding increase in the half-lives of P_T (Fig 3A and 3B; S2 Fig), UGA to Sec translation is preferred with increasing selenium availability. In contrast, UGA to Sec translation is disfavored with increased SEPHS2 synthesis, as shown by the corresponding decrease in the half-lives of P_T (Fig 3C). Consistent with the idea of binding competition between Sec-tRNA^Sec and RFs at the UGA sites [26], decreasing the abundance of RFs promoted UGA to Sec decoding as revealed by the increase in the half-life of P_T (S3 Fig).

Download:

Fig 3. The effect of selenium supply and SEPHS2 synthesis level on UGA definition.

(a) The relationship between protein synthesis and abundance for P_T analyzed under various selenium concentrations. Both original and processed experimental results are presented in the graph and are represented by “original” and “mean”, respectively. The processed results present the mean abundance at each synthesis level. Since the half-life of P_S is not affected by selenium supply, only P_S analyzed at 40 nM selenium concentration is shown. (b) GPS assay of P_T under various selenium concentrations. (c) GPS assay of P_T under 40 nM selenium and various synthesis levels. Relative synthesis levels were estimated from the GPS assay. (d) The ratios of P_L and P_S abundance were quantified by Western blotting. Relative mRNA levels were estimated from the GPS assay.

https://doi.org/10.1371/journal.pcbi.1005367.g003

The abundance of P_T possesses a positive yet nonlinear relation with SEPHS2 synthesis (Fig 3A). The rate of increase for P_T abundance declines with elevated SEPHS2 synthesis. Both the half-life of P_T and the amount of protein synthesis that yields a declining rate of P_T abundance increase with increasing selenium supply (Fig 3A), suggesting selenium as a limiting factor for UGA to Sec translation. However, elevated selenium supply can only push saturation of P_T abundance toward higher protein synthesis but not eradicate it (Fig 3A), and the half-lives of P_T cannot reach that of P_L even at high selenium concentrations (Fig 3B; S2 Fig). Those observations suggest the existence of additional limiting factors beyond selenium supply.

We directly quantified the protein abundance of P_L and P_S by Western blotting as an alternative approach to investigate the UGA decoding process. The ratio of P_L/P_S abundance served as an indicator for UGA to Sec translation efficiency (Fig 3D). Consistent with results inferred from protein half-lives (Fig 3A–3C), UGA to Sec translation increased with selenium supply, yet became saturated at high selenium concentrations. The efficiency of UGA to Sec translation declined with synthesis (mRNA levels) at each fixed selenium concentration. Intriguingly, we observed the production of P_S even under ample selenium supply (data not shown), suggesting unavoidable binding competition between RFs and Sec-tRNA^Sec at the UGA sites.

We analyzed the UGA decoding process of another selenoprotein, SEPW1. The superior stability of the full-length proteins compared to truncated peptides was sustained (S4 Fig). Moreover, the relation between protein abundance and synthesis under various selenium concentrations also possessed similar qualitative characteristics for both SEPW1 (S5A Fig) and SEPHS2 (Fig 3A). In both proteins, protein abundance (GFP) increased with increasing protein synthesis (RFP) and selenium concentrations, yet the GFP-RFP curve slope declined with increasing RFP values. These results indicate that the qualitative behaviors of Sec incorporation are not idiosyncratic to SEPHS2.

Comparison of UGA assignments with four SECIS elements.

To investigate the role of SECIS elements on hierarchical selenoprotein translation, we replaced the SECIS element of the SEPHS2 transcript with those from three other selenoprotein transcripts–GPX1, SELK, and SEPX1 –and monitored SEPHS2 protein expressions by the GPS assay. We show the relations between protein abundance and protein synthesis under 40 nM selenium concentration (Fig 4A) and four selenium concentrations (S6A–S6D Fig). The data from those constructs exhibited a hierarchy of Sec incorporation efficiency. SEPHS2 and GPX1 had higher GFP-RFP slopes than SEPX1 and SELK, indicating superior Sec incorporation of the SECIS elements of SEPHS2 and GPX1 to those of SEPX1 and SELK.

Download:

Fig 4. Experimental and predicted Sec incorporation efficiencies in four SECIS constructs.

(a) The relationship between protein synthesis and abundance for P_T under 40 nM selenium concentration in the experimental data of four SECIS constructs: GPX1 (red), SEPHS2 (blue), SEPX1 (brown), and SELK (green). A solid circle indicates the mean GFP value for each RFP value. (b) The relationship between protein synthesis and abundance for P_T under 40 nM selenium concentration according to the models inferred from experimental data.

https://doi.org/10.1371/journal.pcbi.1005367.g004

Computational modeling results

A mathematical model of the Sec incorporation process.

We propose a simple mechanistic model of selenoprotein expression control that accounts for the aforementioned experimental characteristics:

P_L is more stable than P_S, and their half-lives are not affected by synthesis level or selenium supply.
Total selenoprotein abundance increases with both mRNA levels and selenium supply.
Additional limiting factors account for the saturation of P_T at high levels of selenoprotein mRNA and selenium.
UGA to Sec translation increases with selenium supply but decreases with selenoprotein mRNA levels, and it saturates at high selenium concentrations due to the existence of the same limiting factor.
Constituent binding competition between RFs and Sec-tRNA^sec occurs at UGA sites.

The model is schematically illustrated in Fig 1 and described below.

Basic reactions and hypotheses.

The model is based on the following simplifying assumptions:

Synthesis and degradation reactions of both P_L and P_S follow first-order kinetics, which stipulate that the reaction rates are proportional to the substrate concentrations.
P_L and P_S have distinct synthesis and degradation rates. P_S possess a considerably shorter half-life than P_L.
RFs and Sec-tRNA^Sec compete for UGA sites.
The total amount of selenoprotein mRNA is distributed among the transcripts participating in the translation of P_L (mRNA-Sec-tRNA^Sec), P_S (mRNA-RF), and free molecules.
The total amount of tRNA^Sec in a cell is fixed and distributed between free and charged tRNAs.
The conjugation of Sec to tRNA^Sec also follows first-order kinetics with respect to selenium and free tRNA^Sec molecules.

The selenoprotein constructs in our experiments are derived from intron-less cDNAs and thus are immune to nonsense-mediated mRNA decay (NMD), a well-known mRNA quality surveillance mechanism to eliminate mRNA with premature stop codons [27, 28]. Nevertheless, we have incorporated NMD regulation into our model (Eqs 5–7 in Materials and Methods).

Under those assumptions, we describe the following reactions at steady state in this model:

Sec-tRNA^Sec incorporation, P_L translation and degradation:
RF competition for the UGA site, P_S translation and degradation:
Sec-tRNA^Sec synthesis:

In addition to the aforementioned reactions, we also imposed three other constraints on the total amounts of selenoprotein mRNA and tRNA^Sec. The first constraint stipulates that the selenoprotein mRNAs are distributed among the molecules bound to Sec-tRNA^Sec, RFs, and free molecules (Eq 10). The second constraint stipulates that the total Sec-tRNA^Sec molecules are distributed between the charged tRNAs interacting with mRNAs and free molecules (Eq 11). The third constraint stipulates that the total tRNA^Sec molecules are distributed between charged and uncharged tRNAs (Eq 12).

The model consists of nine parameters: translation (γ) and degradation (λ) rates of P_L and P_S (i.e. γ_L/λ_L and γ_S/λ_S respectively); equilibrium constants for the interactions with Sec-tRNA^Sec and RFs (k₁ and k₂, respectively); the equilibrium constant of charging Sec to tRNA^Sec (k₃); the total amount of tRNA^Sec (T_total); and the total amount of RFs. The total amounts of mRNA and protein levels (m_total and P_total respectively) of each cell are measured by the RFP and GFP intensities in the GPS assay (Eq 13). Given those parameters as well as the equations and constraints derived from the hypotheses above, the relationship between total protein abundance and total mRNA levels can be expressed as a complex functional formula.

To estimate the model parameters, we further simplified the nine parameters in the model and combined them into six independent parameters: the ratios of synthesis and degradation rates γ_L/λ_L and γ_S/λ_S were calculated from the experimental data of P_L and P_S alone, respectively; k₂ and RF were combined into one parameter kF as they always co-occurred in the equations; we also introduced parameters ρ_p and ρ_m to specify the ratios of protein and mRNA abundance from GFP and RFP intensities, respectively, and replaced ρ_m with an equivalent parameter ρ = ρ_m/ρ_p. Consequently, only the following six parameters need to be estimated: k₁, kF, k₃, T_total, ρ and ρ_p. A detailed description of the model is reported in Materials and Methods.

Recapitulation of the qualitative characteristics of selenoprotein synthesis and degradation.

To verify the sensibility of this model, we examined if it could reproduce the qualitative properties observed from experimental data for SEPHS2. Moreover, to ensure that this model consists of all the essential requirements to explain the observed phenomena, we excluded the two constraints (mRNA and tRNA^Sec), both separately and together, and checked whether the reduced models could still recapitulate the same qualitative properties.

We selected a specific set of parameter values in the model ({k₁, k₃, k_f, T_total, ρ, ρ_p} = {3, 10, 0.1, 500, 10, 100}; [m_total] = 1∼4000), varied the amount of selenium supply and mRNA levels, and then generated simulated data for the GPS assay (Fig 5A) and the Western blot experiment (Fig 5B). We compared the simulation outcomes of four models: (1) the model with both mRNA and tRNA^Sec constraints; (2) the model with the mRNA constraint alone; (3) the model with the tRNA^Sec constraint alone; and (4) the model without mRNA and tRNA^Sec constraints. Only the model incorporating both constraints exhibits saturation of the total protein abundance (Fig 5A) and P_L/P_S (Fig 5B) with increased protein synthesis and selenium supply, respectively. At low mRNA (protein synthesis) levels, P_L formation dominates due to its superior stability. Hence, observed protein stability is higher, as indicated by the slope of the protein abundance-synthesis curve (Fig 5A, lower-right panel, left part of the curves) and the higher P_L/P_S (Fig 5B, lower-right panel, the red curve). As the mRNA level increases, Sec-tRNA^Sec molecule supply becomes exhausted and P_S formation dominates. Therefore, the observed protein stability approaches the lower rate of P_S (Fig 5A, lower-right panel, right part of the curves), and P_L/P_S becomes smaller (Fig 5B, lower-right panel, the purple curve). Similarly, at low selenium concentrations, there is an abundant supply of uncharged tRNA^Sec. Thus, the amount of charged Sec-tRNA^Sec is proportional to the selenium concentration, and the amount of P_L produced is roughly proportional to Sec-tRNA^Sec supply (Fig 5B, lower-right panel, left part of the curves). When selenium concentration increases, all tRNA^Sec molecules are charged. Thus, P_L formation depends only on the amount of tRNA^Sec and becomes insensitive to selenium concentration (Fig 5B, lower-right panel, right part of the curves). Increasing mRNA levels enhance incorporation of Sec and depletion of uncharged tRNA^Sec molecules, thereby pushing saturation of the P_L/P_S ratio towards lower selenium concentrations (Fig 5B, lower-right panel).

Download:

Fig 5. Prediction of Sec incorporation efficiencies under four different constraints.

(a) Protein abundance among various protein synthesis rates and selenium concentrations was simulated using mathematical models. Four models were compared using a specific parameter set ({k₁, k₃, k_f, T_total, ρ, ρ_p} = {3, 10, 0.1, 500, 10, 100}; [m_total] = 1∼4000). (b) Simulation of Sec incorporation efficiency (P_L/P_S ratio) using parameter sets identical to (a). The caption “mRNA total” indicates the number of mRNA molecules in the model.

https://doi.org/10.1371/journal.pcbi.1005367.g005

Both the mRNA and tRNA^Sec constraints are essential to reproduce the qualitative characteristics observed from experimental data. The model with only the mRNA constraint can account for the lower translational efficiency at higher mRNA levels due to the dominance of P_S (Fig 5A, upper-right panel), in accordance with our experimental results from GPS assay (Fig 3A). However, since the tRNA^Sec supply is unlimited, the charged Sec-tRNA^Sec abundance is proportional to the selenium concentration. P_L formation is therefore linearly dependent on the selenium concentration when it is high (Fig 5B, upper-right panel), which cannot explain the results from our Western blot experiment (Fig 3D). The intervals with zero P_L/P_S reflect the regimes where charged Sec-tRNA^Sec become a limiting factor. In contrast, the model with only the tRNA^Sec constraint can recapitulate the saturation of P_L formation at high selenium concentrations due to limited tRNA^Sec supply (Fig 5B, lower-left panel), in accordance with the results from our Western blot experiment (Fig 3D). However, since free mRNA supply is unconstrained, the maximum capacity to produce P_L is quickly reached (due to limited tRNA^Sec supply), and formation of P_S dominates subsequent protein synthesis. Thus, the protein abundance-synthesis curves are straight and are collapsed into a single line for all selenium concentrations (Fig 5A, lower-left panel), which cannot explain the experimental results from our GPS assay (Fig 3A). The model without either constraint does not exhibit non-linearity in either experiment (Fig 5A and 5B, upper-left panels).

Estimation of model parameters.

The six independent parameters were connected by complex nonlinear functional relationships. We developed an algorithm to estimate the parameters that fit the functional relationships between single-cell GFP and RFP intensities from our GPS experiments. In brief, each set of parameters π gave rise to a function GFP = f_π(RFP). We defined the loss function as the square error between measured and predicted GFP values, summing over all data points: Q²(π) = ∑_i(GFP_i − f_π(RFP_i))². A grid-search algorithm was employed to find the parameter values that minimized the loss function. The procedures for data processing and parameter estimation are described in Materials and Methods.

The parameter estimation algorithm can recover parameter values from simulations.

To see how precisely our algorithm recovered the parameter values, we performed a simulation test. We generated 100 random parameter combinations (Eq 18) and simulated the corresponding RFP versus GFP data points for each parameter set. The algorithm estimated the parameter values based on simulated data points. By comparing the input and predicted parameters, we evaluated the success rate of recovering correct parameters (see Material and Methods). The success rate varied between 70–100% with the highest resolution of grid search (Table 1, the last row). The average recovery rate ranged from 64% to 76% with grid densities increasing from 1024 (4 possible values for each parameter) to 248832 (12 possible values for each parameter) within the same parameter boundary. We also introduced noise in simulated data points and assessed the parameter recovery rates from noisy data (see Material and Methods, Eq 19). Experimental data indicated that the noise of GFP values for a given RFP value is proportional to the RFP signal level, and the standard deviation of the normalized noise is about 0.3. We varied standard deviation of the noise in simulated data from 0.3 to 5 and report the recovery rate in Table 2 (see Materials and Methods). The recovery rate varied from 70% to 33% as the normalized standard deviation of noise increased from 0 to 5. The recovery rate dropped below 50% when the normalized noise standard deviation is above 1.0. These results are intuitive, as it is hard to reconstruct a model when noise exceeds the signal level.

Download:

Table 1. Parameter recovery rate under different resolutions.

https://doi.org/10.1371/journal.pcbi.1005367.t001

Download:

Table 2. Parameter recovery rate under different levels of data noise.

https://doi.org/10.1371/journal.pcbi.1005367.t002

Estimated parameter values from GPS data.

We employed the grid search algorithm to estimate the six independent parameters from the SEPHS2 GPS data. Table 3 displays the top 10 parameter sets identified by the algorithm. They are grouped into two degenerate classes of solutions. Within each class, each parameter set gives rise to the same loss function value. Among them, the highest loss function value is 2.1-fold that of the lowest one. The differences between respective k₃, T_total and ρ_p values are all within 1.5-fold. Greater differences between minimum and maximum values occurred for k₁ (1.4-fold) and ρ_p (1.5-fold). Small differences between the top-ranking parameter values obtained from a global grid search suggest their closeness to the global optimum values.

Download:

Table 3. Top ten estimated SEPHS2 parameters from experimental data.

https://doi.org/10.1371/journal.pcbi.1005367.t003

We checked how well the model derived from the top-ranking parameter values fit the experimental data. Since the scattered plots of GFP-RFP intensities of the GPS data were noisy, we show the mean of GFP values corresponding to each single RFP value (Fig 6A). The GFP-RFP curves generated by the optimum parameter values (solid circles) fit well with the experimental data (dots) at high selenium concentrations (red, black and blue colors). At the lowest selenium concentration, the model underestimates the GFP value (protein abundance) with each fixed RFP value (mRNA level) (green dots and circles). This shift is likely due to the existence of endogenous selenium in cells with little or no external selenium supply. Beyond qualitative observations in Figs 3A, 3D and 5, we also compared two quantitative scores of goodness of fit (r² and root mean square error, RMSE) among three alternative models (with mRNA and tRNA constraints alone and a combination of both constraints) of the data from GPS (S1 Table) and Western blot (S2 Table) assays.

Download:

Fig 6. Comparison of experimental and predicted Sec incorporation efficiencies in SEPHS2.

(a) The relationship between protein synthesis and abundance for P_T analyzed under various selenium concentrations. Dots denote mean GFP values for each RFP value in the experimental data and are the same as Fig 3A. Solid circles denote the same quantities from model fitting. (b) The relationship between the full length protein quantities and mRNA levels under various selenium concentrations from model prediction.

https://doi.org/10.1371/journal.pcbi.1005367.g006

We also checked whether the estimated parameter values were within biologically sensible ranges according to prior studies (Table 4). In mammalian cells, the ratios of protein synthesis and degradation rates have a broad spectrum of values, ranging from 10⁻³ to 10⁴ [29]. The SEPHS2 protein synthesis/degradation ratio calculated from our control experiments varies from 70 to 80, which falls within this range. We also estimated the possible ranges of mRNA and protein copy numbers of SEPHS2. Previous studies have reported an SEPHS2 mRNA expression level of approximately 10² molecules per cell and a protein expression level of 10³ molecules per cell [29–31] (see Material and Methods). The mRNA and protein levels in our results are all within these ranges (Fig 6B).

Download:

Table 4. Physiological ranges of the model parameters.

https://doi.org/10.1371/journal.pcbi.1005367.t004

To justify the wider applicability of the model estimation algorithm, we estimated the model parameters of SEPW1 from the experimental data (S3 Table). Similar to SEPHS2, the GFP-RFP curves of SEPW1 generated by the inferred model (S5B Fig) recapitulates the qualitative characteristics of experimental data (S5A Fig).

Comparison of Sec incorporation rates and SECIS-SBP2 binding affinity in selenoproteins.

We replaced the SECIS element of the SEPHS2 transcript with those from three other selenoprotein transcripts to investigate the role of SECIS elements on hierarchical selenoprotein expression. We estimated the model parameters of the four SECIS constructs, compared their k₁ and kF values in Table 5, and reported all the inferred parameter values in S4 Table. While all the models possess a similar level of kF, their k₁s can be separated into two groups: SEPHS2 and GPX1 have higher values (17.0 and 12.2) than SEPX1 and SELK (5.7 and 5.7) (Fig 4B). This order is compatible with the order of GFP-RFP curves in experimental data (Fig 4A and S6 Fig). Similar levels of kF are consistent with the experimental setting, as all the constructs are derived from SEPHS2 and differ only in their SECIS elements. Their RF incorporation efficiency (k₂) and RF concentration should thus be invariant. Likewise, other parameters pertaining to the processing of alternative UGA codon assignments (k₃ and T_total) also exhibit similar levels (S4 Table).

Download:

Table 5. Comparison of estimated Sec incorporation strength of four SECIS elements and SECIS-SBP2 dissociation constants.

https://doi.org/10.1371/journal.pcbi.1005367.t005

However, the order of Sec incorporation efficiency among the four SECIS elements (k₁) is not compatible with their SECIS-SBP2 binding disassociation constants (K_d in Table 5). In particular, SELK possesses the lowest disassociation constant (thus the highest SECIS-SBP2 binding affinity), yet has the lowest Sec incorporation efficiency. The order of SECIS-SBP2 binding affinity among the remaining three SECIS elements (GPX1, SEPHS2, SEPX1) is roughly compatible with the order of their k₁ values (SEPHS2, GPX1, SEPX1).

Discussion

Selenoprotein synthesis serves as a remarkable model to study how cellular and environmental factors influence the definition of a dual-use codon. We have proposed a concise mathematical model of selenoprotein synthesis that matches well with both qualitative and quantitative characteristics of experimental results. By combining the power of biological experiments and computational modeling, we have revealed how multiple cis and trans regulatory factors collectively influence the definition of UGA.

The characteristics of experimental data can be explained by the competition between RF and Sec-tRNA^Sec for UGA codons of limited selenoprotein mRNAs, as well as the limited abundance of tRNA^Sec. We formulated these two types of resource limitation as a quantitative, mechanistic model. Simulations according to this model successfully reproduced qualitative characteristics of the experimental data (Fig 5). Beyond qualitative matching, we also proposed an algorithm to estimate model parameters from experimental data. The model derived from the estimated parameters fit well with the experimental data (Fig 6A, S1 Table and S2 Table).

Previous work on the importance of SECIS-SBP2 interactions for the selenoprotein expression hierarchy remains inconclusive. Some studies have indicated that SECIS-SBP2 interactions dictate the selenoprotein hierarchy [8], whereas others have suggested that those interactions alone are insufficient to determine Sec incorporation efficiency [21, 32]. Our deduced Sec incorporation rates attributed to distinct SECIS elements did not correlate well with reported SECIS-SBP2 binding affinities (Table 4). SEPHS2 and GPX1 had substantially higher Sec incorporation rates than SEPX1 and SELK, yet the SECIS-SBP2 binding of SELK was the strongest among the four SECIS elements. Thus, we provide evidence to support the presence of other determining factors for selenoprotein hierarchy.

The order of predicted GFP-RFP curves among the four SECIS elements is consistent with the order of the corresponding experimental curves except for zero selenium concentration (S6 Fig). At zero selenium concentration, the predicted curves of all SECIS elements coincide and are considerably lower than all the experimental curves. This is likely due to the existence of residual selenium in cells even at zero external selenium supply.

The parameters in our model conform to some of the fundamental quantitative features of cell biology, such as the translation and degradation rates of proteins, incorporation rates of Sec-tRNA^Sec and RFs, and the quantities of tRNA^Sec and RFs in cells. Few of these quantities have been reported for mammalian cells, so it is not possible to verify the accuracy of the estimated parameters from existing information. Thus, a thorough verification of the estimated parameter values remains to be conducted.

The concise selenoprotein synthesis model we propose circumvents detailed mechanistic description. It is now possible to build a more detailed, mechanistic model by including all the intermediate steps in the pathway. However, introducing additional free parameters without concomitant measurements merely complicates the model with little improvement in accuracy. Importantly, in our simplified equations, we reveal the existence of a limiting factor beyond selenium concentration in Sec-tRNA^Sec synthesis. Which enzymes or substrates constitute the true limiting factor warrants further investigation. Likewise, incorporation of tRNA^Sec or RFs at a UGA site involves binding of multiple molecules [8–12, 33, 34]. Some of them could possibly be limiting factors additional to excess mRNA and tRNA^Sec supplies.

Despite Sec incorporation being a very specialized process, the process of synthesizing and degrading multiple products with shared and limited resources is ubiquitous in biochemical systems. Some instances include dichotomy between growth and production of organisms, competitive binding of transcription factors and their repressors on promoters, and biosynthesis of metabolites from multiple pathways with shared substrates. Although the models capturing those phenomena may have very different formulations than the models described in this study, the methodology we introduced may be extended to other systems with similar characteristics. Furthermore, presence of multiple exogenous and endogenous limiting factors, such as selenium, selenoprotein transcripts and tRNA^Sec in our study, may yield a more complicated system behavior than the cases with single or no limiting factors.

Materials and methods

Plasmid construction

To generate the SEPHS2 and SEPW1 GPS reporter construct, SEPHS2 and SEPW1 cDNA from the Mammalian Gene Collection (GE Healthcare Dharmacon Inc., Lafayette, CO, USA) was cloned into a lentiviral vector carrying the RFP-IRES-GFP GPS cassette using Gateway technology (Life Technologies, Carlsbad, CA, USA). To generate SEPHS2 and SEPW1 mutants that exclusively express P_L or P_S, the TGA/Sec codon on SEPHS2 and SEPW1 cDNA was mutated into TGT/Cys or TAA/stop by site-directed mutagenesis (Stratagene, Santa Clara, CA, USA), respectively.

To replace the SECIS element of SEPHS2 with that of other selenoproteins, SECIS elements of GPX1, SELK and SEPX1 were amplified from corresponding selenoprotein cDNAs and cloned into the SEPHS2 reporter using Gibson Assembly (New England Biolabs Inc., Ipswich, MA, USA).

Tissue culture

HEK293T cells were maintained in DMEM with 10% fetal bovine serum (FBS, purchased from Hyclone Laboratories, Logan, UT, USA) and antibiotics in a 6% CO₂ atmosphere at 37°C. FBS is the main source of selenium in cell culture. To control selenium supply, cells were first depleted of selenium in FBS-free DMEM supplemented with 10 μg/mL insulin and 5 μg/mL transferrin for 24 hrs. Cells were then balanced with indicated concentrations of sodium selenite (Na₂SeO₃, Sigma-Aldrich, St. Louis, MO, USA) for another 24 hrs. All tissue culture media and supplements were purchased from Gibco Life Technologies, unless otherwise indicated.

To produce lentiviruses, HEK293T cells were transfected with pHAGE, pHIV gag/pol, pVsvg, pRev and pTat using TransIT-293 reagent (Mirus Bio LLC, Madison, WI, USA). Viruses were harvested 48 hrs after transfection.

Generation of GPS reporter cell lines and GPS assays

To generate GPS reporter cell lines, cells were infected with lentiviruses carrying GPS reporter constructs. Infection was carried out in media with 8 μg/mL polybrene (Sigma-Aldrich). To collect reporter cell lines with a series of SEPHS2 synthesis levels, cells were infected stepwise with lentiviruses carrying GPS reporter constructs. To prepare samples for FACS analysis, cells were washed with PBS, trypsinized and resuspended in medium containing 2% FBS and analyzed using a BD LSR Fortessa system (BD Biosciences, San Jose, CA, USA). 10⁶ cells were recorded for each sample. FlowJo (Ashland, OR, USA) was used for primary FACS data analysis.

Western blotting

Cells were harvested in cold PBS and lysed in RIPA buffer (150 mM NaCl, 1.0% IGEPAL®CA-630, 0.5% sodium deoxycholate, 0.1% SDS, and 50 mM Tris, pH 8.0). Standard procedures were used for Western blotting. Antibody against GFP (JL-8) was purchased from Clontech Laboratories (Mountain View, CA, USA).

Data processing

The single-cell-based GPS data consists of 10⁶ pairs of RFP-GFP intensities for individual cells. The RFP-GFP relationship in each cell manifests a high level of variation. However, for each small range of RFP values, the corresponding GFP values typically have a Gaussian distribution with a variance proportional to the RFP value. Therefore, we treated the GPS data as instantiations of the following random variables: y = f(x) + ϵ, where x denotes a random variable of RFP intensities with an unspecified distribution and y denotes a random variable of GFP intensities and is a function of x with an additive noise ϵ. ϵ∼N(0,xσ²) follows a Gaussian distribution with zero mean and xσ² variance.

To reduce data noise and size, we applied two filtering procedures to the GPS data. First, we divided the range of RFP and GFP values into 2000 grids and discarded the data points in grids comprising fewer than 30 data points. Second, we sorted the RFP values and selected 0.4% data points. The processed data thereby consisted of about 3000 pairs of RFP and GFP values for each selenium concentration.

A mathematical model of selenoprotein synthesis and degradation

The basic assumptions and reactions of the model are described in the Results and illustrated in Fig 1. Here, we demonstrate the mathematical formulation of the model. We first introduce the following notations:

m_total: concentration of total selenoprotein mRNA molecules
m_f: concentration of free selenoprotein mRNA molecules not interacting with Sec-tRNA^Sec or RFs.
SeT_f: concentration of free Sec-tRNA^Sec molecules
m − SeT₀: concentration of the mRNA-Sec-tRNA^Sec complex before mRNA degradation
m − SeT: concentration of the mRNA-Sec-tRNA^Sec complex after mRNA degradation
k₁: association constant of the reaction m_f + SeT_f ⇌ m − SeT
P_L: concentration of full-length selenoproteins
γ_L: translation rate of full-length selenoproteins
λ_L: degradation rate of full-length selenoproteins
RF: concentration of RFs
m − RF₀: concentration of the mRNA-RF complex before mRNA degradation
m − RF: concentration of the mRNA-RF complex after mRNA degradation
k₂: association constant of the reaction m_f + RF ⇌ m − RF
P_S: concentration of truncated selenoproteins
γ_S: translation rate of truncated selenoproteins
λ_S: degradation rate of truncated selenoproteins
Se: selenium concentration
T: concentration of uncharged tRNA^Sec
SeT_total: concentration of charged Sec-tRNA^Sec
T_total: concentration of all tRNA^Sec molecules (charged and uncharged combined)
k₃: association constant of the reaction T + Se ⇌ SeT_total
α_L: probability that an mRNA-Sec-tRNA^Sec complex escapes mRNA degradation
α_S: probability that an mRNA-RF complex escapes mRNA degradation
e₀: background mRNA decay rate
N: average number of proteins translated from one mRNA molecule during its life

Full-length protein synthesis and degradation.

At equilibrium, m − SeT₀ is proportional to the product of m_f and SeT_f prior to mRNA degradation: (1)

A fraction of m − SeT₀ complexes are degraded by the background mRNA decay process.

(2)

(3)

Likewise, at steady state, the total amounts of translated and degraded molecules are equal: (4)

Truncated protein synthesis and degradation.

The equations for truncated protein synthesis and degradation follow those of full-length proteins by replacing Sec-tRNA^Sec with RFs: (5) (6) (7) (8) where is attributed to NMD. Derivation of α_L and α_S is described in S1 File. Since mRNA degradation can be neglected in our system, we set α_L = α_S = 1.

Sec-tRNA^Sec synthesis.

We simplified the complicated process of Sec-tRNA^Sec synthesis to a first-order reaction that depends bilinearly on selenium concentration and uncharged tRNA^Sec: (9)

mRNA constraint.

The mRNA constraint simply states that the selenoprotein mRNAs are allocated among the mRNA-Sec-tRNA^Sec complexes, mRNA-RF complexes, and free mRNAs: (10)

tRNA constraints.

There are two constraints involving tRNA^Sec. First, the total amount of charged tRNA^Sec is distributed between the Sec-tRNA^Sec molecules interacting with mRNAs and the free Sec-tRNA^Sec molecules: (11)

Second, the total amount of tRNA^Sec is distributed between charged and uncharged species: (12)

Conversion of fluorescence intensities into mRNA and protein abundance.

The GPS assay measures fluorescence intensities rather than molecular abundance. To convert the RFP and GFP intensities into mRNA and protein abundance, we introduced two additional parameters: (13)

Reduction of model parameters.

The number of parameters appearing in Eqs 1–8 can be reduced in the following way. First, we collapsed k₂ ∙ RF into a single parameter kF as they always co-occurred in the equations. Second, only the translation/degradation rate ratios γ_L/λ_L and γ_S/λ_S are relevant in our experiments. Third, those ratios can be directly determined from the control experiments with complete full-length or truncated protein synthesis (Fig 2B): , , where SP_L and SP_S denote the slopes of the GFP-RFP curves from the two control experiments. After this reduction, we can express full-length and truncated protein concentrations in the following forms: (14) (15)

Combining Eqs 10 and 11 with Eqs 1–8, we specified the dependency of free mRNA concentration with total mRNA levels: (16)

With m_f, we can express P_L and P_S in analytic forms. Hence, the function of P_total with respect to m_total can be established.

A parameter estimation algorithm

We developed a grid-search algorithm to find the parameter values that best fit the experimental data. Among the six undetermined parameters, ρ_p is an arbitrary parameter that only affects the scale of selenoprotein expression but not the behavior of the translation process in simulation. Thus, we first excluded ρ_p in the fitting algorithm and manually adjusted ρ_p after fitting. We generated grids with different combinations of parameters and calculated the fitness of the predicted (RFP, GFP) intensities generated by these parameters with the experimental results. The grids were first generated by logarithmically dividing each parameter into 12 intervals within their boundaries (the range of each parameter value is shown in Table 4). These parameter sets were applied to the mathematical model to convert RFP values into P_L and P_S in the loss function Q²: (17)

The total loss function TQ² is summed over all data points indexed by i. P_total is calculated by transforming GFP intensities using ρ_p.

The loss function has a complicated nonlinear form and thus contains many local optima. Analytic algorithms such as gradient descent will likely find suboptimal solutions whose loss is far from the global minimum. We devised a variation of the divide-and-conquer heuristic approach to alleviate this problem. We started by partitioning the log-scale range of each parameter value by coarse-grained intervals. A small number of multi-dimensional grids were generated from the partitioned parameter space. We then recursively performed the following computations: (1) evaluation of loss function values of parameter configurations on the grids, (2) selection of the top 30 parameter configurations, and (3) subdivision of the selected grids into smaller intervals. Recursion stopped when the grid sizes reached the required resolution of parameter values. The criteria for selecting the parameter configuration from the top-ranking solutions are reported in S1 File. The Matlab codes of the parameter estimation algorithm are reported in S2 File. The GPS data of SEPHS2, GPX1, SEPX1, SELK and SEPW1 are reported in S3–S7 Files respectively. The top ranking solutions of the four SECIS element constructs and SEPW1 are reported in S8 File.

Parameter estimation of simulated data

We randomly generated 100 parameter sets within each parameter boundary by the following function: (18) Where UB and LB are the upper and lower bounds, respectively, of each parameter and X is a random number uniformly distributed on the open interval (0, 1). For each parameter set, about 1000 corresponding RFP and GFP values were generated by the mathematical model. The parameter estimation algorithm was applied to the simulated data, and the estimated parameter values were compared with the parameter values from which the simulated data were generated. We also introduced additive noise to the simulated data with the following formula: (19) Where GFP_original denotes the GFP values calculated from the model. NorR is randomly drawn from a normal distribution with a mean equal to 0. The standard deviation of NorR varied from 0.3 to 5.0 (Table 2).

The performance of our algorithm was evaluated by the log10 ratios between predicted and underlying parameter values: (20) Where P_predict denotes the parameters predicted by the algorithm and P_answer are the true parameters. A parameter value prediction was labeled successful if the error of at least one of the predicted parameter set was smaller than 1 among the top 15 answers reported by the algorithm. The recovering rate indicates the ratio of successful predictions among 100 test sets.

Parameter estimation of the experimental data.

We applied the parameter estimation algorithm to about 15,000 RFP-GFP pairs measured at five selenium concentrations. For ρ_p, we manually chose the value that yielded mRNA and protein levels within normal SEPHS2 expression ranges. We referred to MOPED [31] and BioGPS [30] to get the mRNA and protein expression levels of SEPHS2 relative to ACTN1 and ACTN2, and then converted the relative SEPHS2 expression level into absolute concentration using the dataset of absolute concentrations of ACTN1 and ACTN2 [29]. We estimated that the mRNA expression level of SEPHS2 falls within the order of 10² molecules per cell and the protein expression level within 10³ molecules per cell.

Supporting information

S1 Fig. The GPS assay system.

GPS is a dual fluorescent reporter system capable of simultaneous measurement of protein synthesis, abundance and stability in single cells [25]. In the GPS system, the reporter cassette enables translation of red fluorescent protein (RFP) and green fluorescent protein (GFP) from a single transcript via cap-dependent translation, as well as translation from the internal ribosome entry site (IRES). While RFP serves as a non-degradable internal control that reports protein synthesis, GFP is fused to the N-terminus of the protein of interest (e.g., SEPHS2) and reports protein abundance. The GFP/RFP ratio represents protein stability, measuring the relative steady-state abundance between RFP and GFP-fusion proteins. Single-cell fluorescent signals were recorded using fluorescence-activated cell sorting (FACS).

https://doi.org/10.1371/journal.pcbi.1005367.s001

(TIF)

S2 Fig. The relationship between experimental and simulated SEPHS2 expression under various selenium concentrations.

Each dot denotes the GFP (proportional to total protein abundance P_T) and RFP (proportional to total mRNA quantity) values of a single cell. Each solid circle denotes the simulated GFP value under each RFP value according to the inferred model. Yellow and orange dots denote the GPS data of mutants expressing only P_L and P_S, respectively. Their (P_L and P_S) mean GFP values under each RFP value are represented by solid circles of the corresponding colors.

https://doi.org/10.1371/journal.pcbi.1005367.s002

(TIF)

S3 Fig. The effect of release factor knockdown on UGA definition.

Distributions of GFP/RFP ratios of P_T with or without shRNA-mediated knockdown of RF1.

https://doi.org/10.1371/journal.pcbi.1005367.s003

(TIF)

S4 Fig. Protein half-life and UGA definition analysis of SEPW1.

(a) Protein stability measurement of P_L or P_S by the GPS assay. P_L and P_S were expressed from SEPW1 mutant transcripts that exclusively express one form of SEPW1. (b) The relationship between protein synthesis and abundance for P_L and P_S in SEPW1 analogous to Fig 2B. (c-d) GPS analysis of P_L and P_S in SEPW1 under various selenium concentrations (c) or synthesis levels (d). Relative mRNA levels represent quantifications of the RFP signals in the GPS assay.

https://doi.org/10.1371/journal.pcbi.1005367.s004

(TIF)

S5 Fig. Comparison of experimental and predicted Sec incorporation efficiencies in SEPW1.

(a) The relationship between protein synthesis and abundance for P_T analyzed under five selenium concentrations from experimental data. The style follows S2 Fig. (b) The relationship between P_T abundance and mRNA levels under five selenium concentrations from model prediction.

https://doi.org/10.1371/journal.pcbi.1005367.s005

(TIF)

S6 Fig. Comparison of the selenoprotein hierarchy under various selenium concentrations.

The relationship between protein synthesis and abundance for P_T analyzed under four selenium concentrations for four SECIS elements. The panels on the left column (a-d) indicate the results from experimental data. The panels on the right column (e-h) indicate the predictions from the inferred models. The selenium concentrations applied are indicated on the left.

https://doi.org/10.1371/journal.pcbi.1005367.s006

(TIF)

S1 Table. Quantitative evaluation of experimental and predicted protein abundances based on observed protein synthesis levels and selenium concentrations.

https://doi.org/10.1371/journal.pcbi.1005367.s007

(DOCX)

S2 Table. Quantitative evaluation of experiment and predicted P_L/P_S ratio corresponding to the relative mRNA levels from the Western blotting assay.

https://doi.org/10.1371/journal.pcbi.1005367.s008

(DOCX)

S3 Table. Estimated parameter values of SEPW1.

https://doi.org/10.1371/journal.pcbi.1005367.s009

(DOCX)

S4 Table. Estimated parameter values of constructs of four SECIS elements.

https://doi.org/10.1371/journal.pcbi.1005367.s010

(DOCX)

S1 File. Detailed description of the data processing protocol, parameter estimation algorithm, and an augmented model for incorporating mRNA degradation.

https://doi.org/10.1371/journal.pcbi.1005367.s011

(PDF)

S2 File. The Matlab codes of the parameter estimation algorithm.

https://doi.org/10.1371/journal.pcbi.1005367.s012

(ZIP)

S3 File. The GPS data of SEPHS2.

https://doi.org/10.1371/journal.pcbi.1005367.s013

(ZIP)

S4 File. The GPS data of the GPX1 SECIS element construct.

https://doi.org/10.1371/journal.pcbi.1005367.s014

(ZIP)

S5 File. The GPS data of the SEPX1 SECIS element construct.

https://doi.org/10.1371/journal.pcbi.1005367.s015

(ZIP)

S6 File. The GPS data of the SELK SECIS element construct.

https://doi.org/10.1371/journal.pcbi.1005367.s016

(ZIP)

S7 File. The GPS data of SEPW1.

https://doi.org/10.1371/journal.pcbi.1005367.s017

(ZIP)

S8 File. The top ranking solutions of four SECIS element constructs and SEPW1.

https://doi.org/10.1371/journal.pcbi.1005367.s018

(XLSX)

Acknowledgments

We thank John O’Brien for suggestions.

Author Contributions

Conceptualization: HCSY CHY.
Data curation: YFC KNC.
Formal analysis: YFC CHY KNC.
Funding acquisition: HCSY CHY.
Investigation: HCL KNC.
Methodology: CHY HCSY.
Project administration: HCSY CHY.
Resources: HCSY CHY.
Software: YFC KNC.
Supervision: HCSY CHY.
Validation: HCL YFC.
Visualization: YFC HCL HCSY CHL.
Writing – original draft: HCSY CHY YFC HCL KNC.
Writing – review & editing: HCSY CHY YFC HCL.

References

1. Lee BJ, Worland PJ, Davis JN, Stadtman TC, Hatfield DL. Identification of a selenocysteyl-tRNA(Ser) in mammalian cells that recognizes the nonsense codon, UGA. The Journal of biological chemistry. 1989;264(17):9724–7. pmid:2498338
- View Article
- PubMed/NCBI
- Google Scholar
2. Srinivasan G, James CM, Krzycki JA. Pyrrolysine encoded by UAG in Archaea: charging of a UAG-decoding specialized tRNA. Science. 2002;296(5572):1459–62. pmid:12029131
- View Article
- PubMed/NCBI
- Google Scholar
3. Lin H-C, Ho S-C, Chen Y-Y, Khoo K-H, Hsu P-H, Yen H-CS. CRL2 aids elimination of truncated selenoproteins produced by failed UGA/Sec decoding. Science. 2015;349(6243):91–5. pmid:26138980
- View Article
- PubMed/NCBI
- Google Scholar
4. Driscoll DM, Copeland PR. Mechanism and regulation of selenoprotein synthesis. Annual review of nutrition. 2003;23:17–40. pmid:12524431
- View Article
- PubMed/NCBI
- Google Scholar
5. Hatfield DL, Gladyshev VN. How Selenium Has Altered Our Understanding of the Genetic Code. Molecular and Cellular Biology. 2002;22(11):3565–76. pmid:11997494
- View Article
- PubMed/NCBI
- Google Scholar
6. Papp LV, Lu J, Holmgren A, Khanna KK. From selenium to selenoproteins: synthesis, identity, and their role in human health. Antioxidants & redox signaling. 2007;9(7):775–806. Epub 2007/05/19. pmid:17508906
- View Article
- PubMed/NCBI
- Google Scholar
7. Allmang C, Krol A. Selenoprotein synthesis: UGA does not end the story. Biochimie. 2006;88(11):1561–71. pmid:16737768
- View Article
- PubMed/NCBI
- Google Scholar
8. Low SC, Grundner-Culemann E, Harney JW, Berry MJ. SECIS-SBP2 interactions dictate selenocysteine incorporation efficiency and selenoprotein hierarchy. The EMBO journal. 2000;19(24):6882–90. pmid:11118223
- View Article
- PubMed/NCBI
- Google Scholar
9. Copeland PR, Driscoll DM. Purification, redox sensitivity, and RNA binding properties of SECIS-binding protein 2, a protein involved in selenoprotein biosynthesis. The Journal of biological chemistry. 1999;274(36):25447–54. pmid:10464275
- View Article
- PubMed/NCBI
- Google Scholar
10. Copeland PR, Fletcher JE, Carlson BA, Hatfield DL, Driscoll DM. A novel RNA binding protein, SBP2, is required for the translation of mammalian selenoprotein mRNAs. The EMBO journal. 2000;19(2):306–14. pmid:10637234
- View Article
- PubMed/NCBI
- Google Scholar
11. Tujebajeva RM, Copeland PR, Xu XM, Carlson BA, Harney JW, Driscoll DM, et al. Decoding apparatus for eukaryotic selenocysteine insertion. EMBO Rep. 2000;1(2):158–63. pmid:11265756
- View Article
- PubMed/NCBI
- Google Scholar
12. Berry MJ, Tujebajeva RM, Copeland PR, Xu XM, Carlson BA, Martin GW 3rd, et al. Selenocysteine incorporation directed from the 3'UTR: characterization of eukaryotic EFsec and mechanistic implications. Biofactors. 2001;14(1–4):17–24. pmid:11568436
- View Article
- PubMed/NCBI
- Google Scholar
13. Nakamura Y, Ito K, Isaksson LA. Emerging understanding of translation termination. Cell. 1996;87(2):147–50. pmid:8861897
- View Article
- PubMed/NCBI
- Google Scholar
14. Kisselev L, Ehrenberg M, Frolova L. Termination of translation: interplay of mRNA, rRNAs and release factors? The EMBO journal. 2003;22(2):175–82. pmid:12514123
- View Article
- PubMed/NCBI
- Google Scholar
15. Suppmann S, Persson BC, Bock A. Dynamics and efficiency in vivo of UGA-directed selenocysteine insertion at the ribosome. The EMBO journal. 1999;18(8):2284–93. pmid:10205181
- View Article
- PubMed/NCBI
- Google Scholar
16. Kollmus H, Flohe L, McCarthy JE. Analysis of eukaryotic mRNA structures directing cotranslational incorporation of selenocysteine. Nucleic acids research. 1996;24(7):1195–201. pmid:8614619
- View Article
- PubMed/NCBI
- Google Scholar
17. Mehta A, Rebsch CM, Kinzy SA, Fletcher JE, Copeland PR. Efficiency of mammalian selenocysteine incorporation. The Journal of biological chemistry. 2004;279(36):37852–9. pmid:15229221
- View Article
- PubMed/NCBI
- Google Scholar
18. Berry MJ, Harney JW, Ohama T, Hatfield DL. Selenocysteine insertion or termination: factors affecting UGA codon fate and complementary anticodon:codon mutations. Nucleic acids research. 1994;22(18):3753–9. pmid:7937088
- View Article
- PubMed/NCBI
- Google Scholar
19. Howard MT, Carlson BA, Anderson CB, Hatfield DL. Translational redefinition of UGA codons is regulated by selenium availability. The Journal of biological chemistry. 2013;288(27):19401–13. pmid:23696641
- View Article
- PubMed/NCBI
- Google Scholar
20. Jameson RR, Diamond AM. A regulatory role for Sec tRNA[Ser]Sec in selenoprotein synthesis. RNA. 2004;10(7):1142–52. pmid:15208449
- View Article
- PubMed/NCBI
- Google Scholar
21. Latreche L, Jean-Jean O, Driscoll DM, Chavatte L. Novel structural determinants in human SECIS elements modulate the translational recoding of UGA as selenocysteine. Nucleic acids research. 2009;37(17):5868–80. pmid:19651878
- View Article
- PubMed/NCBI
- Google Scholar
22. Low SC, Harney JW, Berry MJ. Cloning and functional characterization of human selenophosphate synthetase, an essential component of selenoprotein synthesis. The Journal of biological chemistry. 1995;270(37):21659–64. pmid:7665581
- View Article
- PubMed/NCBI
- Google Scholar
23. Schomburg L, Schweizer U. Hierarchical regulation of selenoprotein expression and sex-specific effects of selenium. Biochimica et biophysica acta. 2009;1790(11):1453–62. pmid:19328222
- View Article
- PubMed/NCBI
- Google Scholar
24. Kryukov GV, Castellano S, Novoselov SV, Lobanov AV, Zehtab O, Guigo R, et al. Characterization of mammalian selenoproteomes. Science. 2003;300(5624):1439–43. pmid:12775843
- View Article
- PubMed/NCBI
- Google Scholar
25. Yen HC, Xu Q, Chou DM, Zhao Z, Elledge SJ. Global protein stability profiling in mammalian cells. Science. 2008;322(5903):918–23. pmid:18988847
- View Article
- PubMed/NCBI
- Google Scholar
26. Mansell JB, Guevremont D, Poole ES, Tate WP. A dynamic competition between release factor 2 and the tRNA(Sec) decoding UGA at the recoding site of Escherichia coli formate dehydrogenase H. The EMBO journal. 2001;20(24):7284–93. pmid:11743004
- View Article
- PubMed/NCBI
- Google Scholar
27. Baker KE, Parker R. Nonsense-mediated mRNA decay: terminating erroneous gene expression. Curr Opin Cell Biol. 2004;16(3):293–9. pmid:15145354
- View Article
- PubMed/NCBI
- Google Scholar
28. Chang YF, Imam JS, Wilkinson MF. The nonsense-mediated decay RNA surveillance pathway. Annual review of biochemistry. 2007;76:51–74. pmid:17352659
- View Article
- PubMed/NCBI
- Google Scholar
29. Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, et al. Global quantification of mammalian gene expression control. Nature. 2011;473(7347):337–42. pmid:21593866
- View Article
- PubMed/NCBI
- Google Scholar
30. Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, et al. BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome biology. 2009;10(11):R130. pmid:19919682
- View Article
- PubMed/NCBI
- Google Scholar
31. Kolker E, Higdon R, Haynes W, Welch D, Broomall W, Lancet D, et al. MOPED: Model Organism Protein Expression Database. Nucleic acids research. 2012;40(Database issue):D1093–9. pmid:22139914
- View Article
- PubMed/NCBI
- Google Scholar
32. Fletcher JE, Copeland PR, Driscoll DM, Krol A. The selenocysteine incorporation machinery: interactions between the SECIS RNA and the SECIS-binding protein SBP2. RNA. 2001;7(10):1442–53. pmid:11680849
- View Article
- PubMed/NCBI
- Google Scholar
33. Ding F, Grabowski PJ. Identification of a protein component of a mammalian tRNA(Sec) complex implicated in the decoding of UGA as selenocysteine. RNA. 1999;5(12):1561–9. pmid:10606267
- View Article
- PubMed/NCBI
- Google Scholar
34. Xu XM, Mix H, Carlson BA, Grabowski PJ, Gladyshev VN, Berry MJ, et al. Evidence for direct roles of two additional factors, SECp43 and soluble liver antigen, in the selenoprotein synthesis machinery. The Journal of biological chemistry. 2005;280(50):41568–75. pmid:16230358
- View Article
- PubMed/NCBI
- Google Scholar
35. Moran U, Phillips R, Milo R. SnapShot: key numbers in biology. Cell. 2010;141(7):1262–e1. pmid:20603006
- View Article
- PubMed/NCBI
- Google Scholar
36. Xu XM, Carlson BA, Mix H, Zhang Y, Saira K, Glass RS, et al. Biosynthesis of selenocysteine on its tRNA in eukaryotes. PLoS biology. 2007;5(1):e4. pmid:17194211
- View Article
- PubMed/NCBI
- Google Scholar
37. Wolfe RR, Song J, Sun J, Zhang XJ. Total aminoacyl-transfer RNA pool is greater in liver than muscle in rabbits. The Journal of nutrition. 2007;137(11):2333–8. pmid:17951466
- View Article
- PubMed/NCBI
- Google Scholar
38. Iben JR, Maraia RJ. tRNA gene copy number variation in humans. Gene. 2014;536(2):376–84. pmid:24342656
- View Article
- PubMed/NCBI
- Google Scholar
39. Dittmar KA, Goodenbour JM, Pan T. Tissue-specific differences in human transfer RNA expression. PLoS genetics. 2006;2(12):e221. Epub 2006/12/30. pmid:17194224
- View Article
- PubMed/NCBI
- Google Scholar
40. King MP, Attardi G. Post-transcriptional regulation of the steady-state levels of mitochondrial tRNAs in HeLa cells. The Journal of biological chemistry. 1993;268(14):10228–37. pmid:7683672
- View Article
- PubMed/NCBI
- Google Scholar
41. Lee HW, Ryu JY, Yoo J, Choi B, Kim K, Yoon TY. Real-time single-molecule coimmunoprecipitation of weak protein-protein interactions. Nature protocols. 2013;8(10):2045–60. pmid:24071910
- View Article
- PubMed/NCBI
- Google Scholar
42. Boisvert FM, Ahmad Y, Gierlinski M, Charriere F, Lamont D, Scott M, et al. A quantitative spatial proteomics analysis of proteome turnover in human cells. Molecular & cellular proteomics: MCP. 2012;11(3):M111 011429. Epub 2011/09/23. pmid:21937730; PubMed Central PMCID: PMC3316722.
- View Article
- PubMed/NCBI
- Google Scholar
43. Donovan J, Copeland PR. Selenocysteine insertion sequence binding protein 2L is implicated as a novel post-transcriptional regulator of selenoprotein expression. PLoS One. 2012;7(4):e35581. pmid:22530054
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Lee BJ, Worland PJ, Davis JN, Stadtman TC, Hatfield DL. Identification of a selenocysteyl-tRNA(Ser) in mammalian cells that recognizes the nonsense codon, UGA. The Journal of biological chemistry. 1989;264(17):9724–7. pmid:2498338
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Srinivasan G, James CM, Krzycki JA. Pyrrolysine encoded by UAG in Archaea: charging of a UAG-decoding specialized tRNA. Science. 2002;296(5572):1459–62. pmid:12029131
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Lin H-C, Ho S-C, Chen Y-Y, Khoo K-H, Hsu P-H, Yen H-CS. CRL2 aids elimination of truncated selenoproteins produced by failed UGA/Sec decoding. Science. 2015;349(6243):91–5. pmid:26138980
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Driscoll DM, Copeland PR. Mechanism and regulation of selenoprotein synthesis. Annual review of nutrition. 2003;23:17–40. pmid:12524431
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Hatfield DL, Gladyshev VN. How Selenium Has Altered Our Understanding of the Genetic Code. Molecular and Cellular Biology. 2002;22(11):3565–76. pmid:11997494
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Papp LV, Lu J, Holmgren A, Khanna KK. From selenium to selenoproteins: synthesis, identity, and their role in human health. Antioxidants & redox signaling. 2007;9(7):775–806. Epub 2007/05/19. pmid:17508906
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Allmang C, Krol A. Selenoprotein synthesis: UGA does not end the story. Biochimie. 2006;88(11):1561–71. pmid:16737768
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Low SC, Grundner-Culemann E, Harney JW, Berry MJ. SECIS-SBP2 interactions dictate selenocysteine incorporation efficiency and selenoprotein hierarchy. The EMBO journal. 2000;19(24):6882–90. pmid:11118223
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Copeland PR, Driscoll DM. Purification, redox sensitivity, and RNA binding properties of SECIS-binding protein 2, a protein involved in selenoprotein biosynthesis. The Journal of biological chemistry. 1999;274(36):25447–54. pmid:10464275
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Copeland PR, Fletcher JE, Carlson BA, Hatfield DL, Driscoll DM. A novel RNA binding protein, SBP2, is required for the translation of mammalian selenoprotein mRNAs. The EMBO journal. 2000;19(2):306–14. pmid:10637234
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Tujebajeva RM, Copeland PR, Xu XM, Carlson BA, Harney JW, Driscoll DM, et al. Decoding apparatus for eukaryotic selenocysteine insertion. EMBO Rep. 2000;1(2):158–63. pmid:11265756
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Berry MJ, Tujebajeva RM, Copeland PR, Xu XM, Carlson BA, Martin GW 3rd, et al. Selenocysteine incorporation directed from the 3'UTR: characterization of eukaryotic EFsec and mechanistic implications. Biofactors. 2001;14(1–4):17–24. pmid:11568436
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Nakamura Y, Ito K, Isaksson LA. Emerging understanding of translation termination. Cell. 1996;87(2):147–50. pmid:8861897
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Kisselev L, Ehrenberg M, Frolova L. Termination of translation: interplay of mRNA, rRNAs and release factors? The EMBO journal. 2003;22(2):175–82. pmid:12514123
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Suppmann S, Persson BC, Bock A. Dynamics and efficiency in vivo of UGA-directed selenocysteine insertion at the ribosome. The EMBO journal. 1999;18(8):2284–93. pmid:10205181
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Kollmus H, Flohe L, McCarthy JE. Analysis of eukaryotic mRNA structures directing cotranslational incorporation of selenocysteine. Nucleic acids research. 1996;24(7):1195–201. pmid:8614619
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Mehta A, Rebsch CM, Kinzy SA, Fletcher JE, Copeland PR. Efficiency of mammalian selenocysteine incorporation. The Journal of biological chemistry. 2004;279(36):37852–9. pmid:15229221
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Berry MJ, Harney JW, Ohama T, Hatfield DL. Selenocysteine insertion or termination: factors affecting UGA codon fate and complementary anticodon:codon mutations. Nucleic acids research. 1994;22(18):3753–9. pmid:7937088
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Howard MT, Carlson BA, Anderson CB, Hatfield DL. Translational redefinition of UGA codons is regulated by selenium availability. The Journal of biological chemistry. 2013;288(27):19401–13. pmid:23696641
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref20] 20. Jameson RR, Diamond AM. A regulatory role for Sec tRNA[Ser]Sec in selenoprotein synthesis. RNA. 2004;10(7):1142–52. pmid:15208449
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref21] 21. Latreche L, Jean-Jean O, Driscoll DM, Chavatte L. Novel structural determinants in human SECIS elements modulate the translational recoding of UGA as selenocysteine. Nucleic acids research. 2009;37(17):5868–80. pmid:19651878
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref22] 22. Low SC, Harney JW, Berry MJ. Cloning and functional characterization of human selenophosphate synthetase, an essential component of selenoprotein synthesis. The Journal of biological chemistry. 1995;270(37):21659–64. pmid:7665581
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref23] 23. Schomburg L, Schweizer U. Hierarchical regulation of selenoprotein expression and sex-specific effects of selenium. Biochimica et biophysica acta. 2009;1790(11):1453–62. pmid:19328222
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref24] 24. Kryukov GV, Castellano S, Novoselov SV, Lobanov AV, Zehtab O, Guigo R, et al. Characterization of mammalian selenoproteomes. Science. 2003;300(5624):1439–43. pmid:12775843
View Article
PubMed/NCBI
Google Scholar

[94] View Article

[95] PubMed/NCBI

[96] Google Scholar

[ref25] 25. Yen HC, Xu Q, Chou DM, Zhao Z, Elledge SJ. Global protein stability profiling in mammalian cells. Science. 2008;322(5903):918–23. pmid:18988847
View Article
PubMed/NCBI
Google Scholar

[98] View Article

[99] PubMed/NCBI

[100] Google Scholar

[ref26] 26. Mansell JB, Guevremont D, Poole ES, Tate WP. A dynamic competition between release factor 2 and the tRNA(Sec) decoding UGA at the recoding site of Escherichia coli formate dehydrogenase H. The EMBO journal. 2001;20(24):7284–93. pmid:11743004
View Article
PubMed/NCBI
Google Scholar

[102] View Article

[103] PubMed/NCBI

[104] Google Scholar

[ref27] 27. Baker KE, Parker R. Nonsense-mediated mRNA decay: terminating erroneous gene expression. Curr Opin Cell Biol. 2004;16(3):293–9. pmid:15145354
View Article
PubMed/NCBI
Google Scholar

[106] View Article

[107] PubMed/NCBI

[108] Google Scholar

[ref28] 28. Chang YF, Imam JS, Wilkinson MF. The nonsense-mediated decay RNA surveillance pathway. Annual review of biochemistry. 2007;76:51–74. pmid:17352659
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

[ref29] 29. Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, et al. Global quantification of mammalian gene expression control. Nature. 2011;473(7347):337–42. pmid:21593866
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref30] 30. Wu C, Orozco C, Boyer J, Leglise M, Goodale J, Batalov S, et al. BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome biology. 2009;10(11):R130. pmid:19919682
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref31] 31. Kolker E, Higdon R, Haynes W, Welch D, Broomall W, Lancet D, et al. MOPED: Model Organism Protein Expression Database. Nucleic acids research. 2012;40(Database issue):D1093–9. pmid:22139914
View Article
PubMed/NCBI
Google Scholar

[122] View Article

[123] PubMed/NCBI

[124] Google Scholar

[ref32] 32. Fletcher JE, Copeland PR, Driscoll DM, Krol A. The selenocysteine incorporation machinery: interactions between the SECIS RNA and the SECIS-binding protein SBP2. RNA. 2001;7(10):1442–53. pmid:11680849
View Article
PubMed/NCBI
Google Scholar

[126] View Article

[127] PubMed/NCBI

[128] Google Scholar

[ref33] 33. Ding F, Grabowski PJ. Identification of a protein component of a mammalian tRNA(Sec) complex implicated in the decoding of UGA as selenocysteine. RNA. 1999;5(12):1561–9. pmid:10606267
View Article
PubMed/NCBI
Google Scholar

[130] View Article

[131] PubMed/NCBI

[132] Google Scholar

[ref34] 34. Xu XM, Mix H, Carlson BA, Grabowski PJ, Gladyshev VN, Berry MJ, et al. Evidence for direct roles of two additional factors, SECp43 and soluble liver antigen, in the selenoprotein synthesis machinery. The Journal of biological chemistry. 2005;280(50):41568–75. pmid:16230358
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref35] 35. Moran U, Phillips R, Milo R. SnapShot: key numbers in biology. Cell. 2010;141(7):1262–e1. pmid:20603006
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref36] 36. Xu XM, Carlson BA, Mix H, Zhang Y, Saira K, Glass RS, et al. Biosynthesis of selenocysteine on its tRNA in eukaryotes. PLoS biology. 2007;5(1):e4. pmid:17194211
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref37] 37. Wolfe RR, Song J, Sun J, Zhang XJ. Total aminoacyl-transfer RNA pool is greater in liver than muscle in rabbits. The Journal of nutrition. 2007;137(11):2333–8. pmid:17951466
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

[ref38] 38. Iben JR, Maraia RJ. tRNA gene copy number variation in humans. Gene. 2014;536(2):376–84. pmid:24342656
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref39] 39. Dittmar KA, Goodenbour JM, Pan T. Tissue-specific differences in human transfer RNA expression. PLoS genetics. 2006;2(12):e221. Epub 2006/12/30. pmid:17194224
View Article
PubMed/NCBI
Google Scholar

[154] View Article

[155] PubMed/NCBI

[156] Google Scholar

[ref40] 40. King MP, Attardi G. Post-transcriptional regulation of the steady-state levels of mitochondrial tRNAs in HeLa cells. The Journal of biological chemistry. 1993;268(14):10228–37. pmid:7683672
View Article
PubMed/NCBI
Google Scholar

[158] View Article

[159] PubMed/NCBI

[160] Google Scholar

[ref41] 41. Lee HW, Ryu JY, Yoo J, Choi B, Kim K, Yoon TY. Real-time single-molecule coimmunoprecipitation of weak protein-protein interactions. Nature protocols. 2013;8(10):2045–60. pmid:24071910
View Article
PubMed/NCBI
Google Scholar

[162] View Article

[163] PubMed/NCBI

[164] Google Scholar

[ref42] 42. Boisvert FM, Ahmad Y, Gierlinski M, Charriere F, Lamont D, Scott M, et al. A quantitative spatial proteomics analysis of proteome turnover in human cells. Molecular & cellular proteomics: MCP. 2012;11(3):M111 011429. Epub 2011/09/23. pmid:21937730; PubMed Central PMCID: PMC3316722.
View Article
PubMed/NCBI
Google Scholar

[166] View Article

[167] PubMed/NCBI

[168] Google Scholar

[ref43] 43. Donovan J, Copeland PR. Selenocysteine insertion sequence binding protein 2L is implicated as a novel post-transcriptional regulator of selenoprotein expression. PLoS One. 2012;7(4):e35581. pmid:22530054
View Article
PubMed/NCBI
Google Scholar

[170] View Article

[171] PubMed/NCBI

[172] Google Scholar

Figures

Abstract

Author summary

Introduction

Results

Experimental results

Inferring UGA definition using differential protein half-lives between full-length and UGA-terminated selenoproteins.

Characterization of UGA assignments in SEPHS2 and SEPW1 syntheses.

Comparison of UGA assignments with four SECIS elements.

Computational modeling results

A mathematical model of the Sec incorporation process.

Basic reactions and hypotheses.

Recapitulation of the qualitative characteristics of selenoprotein synthesis and degradation.

Estimation of model parameters.

The parameter estimation algorithm can recover parameter values from simulations.

Estimated parameter values from GPS data.

Comparison of Sec incorporation rates and SECIS-SBP2 binding affinity in selenoproteins.

Discussion

Materials and methods

Plasmid construction

Tissue culture

Generation of GPS reporter cell lines and GPS assays

Western blotting

Data processing

A mathematical model of selenoprotein synthesis and degradation

Full-length protein synthesis and degradation.

Truncated protein synthesis and degradation.

Sec-tRNASec synthesis.

mRNA constraint.

tRNA constraints.

Conversion of fluorescence intensities into mRNA and protein abundance.

Reduction of model parameters.

A parameter estimation algorithm

Parameter estimation of simulated data

Parameter estimation of the experimental data.

Supporting information

S1 Fig. The GPS assay system.

S2 Fig. The relationship between experimental and simulated SEPHS2 expression under various selenium concentrations.

S3 Fig. The effect of release factor knockdown on UGA definition.

S4 Fig. Protein half-life and UGA definition analysis of SEPW1.

S5 Fig. Comparison of experimental and predicted Sec incorporation efficiencies in SEPW1.

S6 Fig. Comparison of the selenoprotein hierarchy under various selenium concentrations.

S1 Table. Quantitative evaluation of experimental and predicted protein abundances based on observed protein synthesis levels and selenium concentrations.

S2 Table. Quantitative evaluation of experiment and predicted PL/PS ratio corresponding to the relative mRNA levels from the Western blotting assay.

S3 Table. Estimated parameter values of SEPW1.

S4 Table. Estimated parameter values of constructs of four SECIS elements.

S1 File. Detailed description of the data processing protocol, parameter estimation algorithm, and an augmented model for incorporating mRNA degradation.

S2 File. The Matlab codes of the parameter estimation algorithm.

S3 File. The GPS data of SEPHS2.

S4 File. The GPS data of the GPX1 SECIS element construct.

S5 File. The GPS data of the SEPX1 SECIS element construct.

S6 File. The GPS data of the SELK SECIS element construct.

S7 File. The GPS data of SEPW1.

S8 File. The top ranking solutions of four SECIS element constructs and SEPW1.

Acknowledgments

Author Contributions

References

Sec-tRNA^Sec synthesis.

S2 Table. Quantitative evaluation of experiment and predicted P_L/P_S ratio corresponding to the relative mRNA levels from the Western blotting assay.