Difference between revisions of "Formation of homo-dimer R2"

From ISMOC
Jump to: navigation, search
(Parameters with uncertainty)
(Parameters)
 
(19 intermediate revisions by the same user not shown)
Line 26: Line 26:
 
|-
 
|-
 
|<math>K_{d6}</math>
 
|<math>K_{d6}</math>
|<math>35.3-114</math> <ref name="Majka2001"> [http://www.jbc.org/content/276/9/6243.full.pdf Majka J, Zakrzewska-Czerwiñska J, Messer W. ''Sequence recognition, cooperative interaction, and dimerization of the initiator protein DnaA of Streptomyces.'' J Biol Chem. 2001;276(9):6243-52.]</ref>   
+
|<math>2.58-42.9</math> <ref name="Majka2001"> [http://www.jbc.org/content/276/9/6243.full.pdf Majka J, Zakrzewska-Czerwiñska J, Messer W. ''Sequence recognition, cooperative interaction, and dimerization of the initiator protein DnaA of Streptomyces.'' J Biol Chem. 2001;276(9):6243-52.]</ref>   
 
| <math>nM</math>
 
| <math>nM</math>
 
| N/A
 
| N/A
Line 33: Line 33:
 
[[Image:Kd6-text.png|center|thumb|700px|Majka et al. 2001<ref name="Majka2001"></ref>]]
 
[[Image:Kd6-text.png|center|thumb|700px|Majka et al. 2001<ref name="Majka2001"></ref>]]
  
These values agree with Ozbabacan et al. <ref name="Saliha2011"> [http://peds.oxfordjournals.org/content/early/2011/06/15/protein.gzr025.full.pdf Saliha Ece Acuner Ozbabacan, Hatice Billur Engin, Attila Gursoy, and Ozlem Keskin. ''Transient protein–protein interactions.'' Protein Engineering, Design and Selection first published online June 15, 2011]</ref> who state that strong protein-protein interactions such as homodimerization have equilibrium dissociation constants <math> < 10^{-6} M</math> and mostly in the nanomolar range.
+
These values agree with Ozbabacan et al. <ref name="Saliha2011"> [http://peds.oxfordjournals.org/content/early/2011/06/15/protein.gzr025.full.pdf Saliha Ece Acuner Ozbabacan, Hatice Billur Engin, Attila Gursoy, and Ozlem Keskin. ''Transient protein–protein interactions.'' Protein Engineering, Design and Selection first published online June 15, 2011]</ref> and Nooren et al. <ref name="Nooren2003"> [http://ac.els-cdn.com/S0022283602012810/1-s2.0-S0022283602012810-main.pdf?_tid=15a3bb1a-6249-11e7-8218-00000aacb35d&acdnat=1499345349_1bcdcd4ec9fa588ea3b6c1e04a2189c1 Irene M.A Nooren, Janet M. Thornton. ''Structural Characterisation and Functional Significance of Transient Protein–Protein Interactions'' Journal of Molecular Biology 2003;325(5)]</ref> who state that strong protein-protein interactions such as homodimerization have equilibrium dissociation constants <math> < 10^{-6} M</math> and mostly in the nanomolar range.
 +
As the protein homodimer is quite stable, we will focus more on the lower <math>K_{d}</math> values (<43nM).
 
|-
 
|-
 
|<math>k^{-}_{6}</math>
 
|<math>k^{-}_{6}</math>
|<math>2.12-6.84</math> <ref name="Janin1997"> [http://onlinelibrary.wiley.com/doi/10.1002/(SICI)1097-0134(199706)28:2%3C153::AID-PROT4%3E3.0.CO;2-G/epdf Janin, Joel. ''The kinetics of protein-protein recognition.'' Proteins-Structure Function and Bioinformatics (1997): 153-161.]</ref> <ref name="Northrup1992"> [http://www.pnas.org/content/89/8/3338.full.pdf Northrup S.H. and Erickson H.P. ''Kinetics of protein-protein association explained by Brownian dynamics computer simulation.''PNAS 1992;89(8),3338-3342]</ref>
+
|<math>0.144-51.2</math> <ref name="Janin1997"> [http://onlinelibrary.wiley.com/doi/10.1002/(SICI)1097-0134(199706)28:2%3C153::AID-PROT4%3E3.0.CO;2-G/epdf Janin, Joel. ''The kinetics of protein-protein recognition.'' Proteins-Structure Function and Bioinformatics (1997): 153-161.]</ref> <ref name="Northrup1992"> [http://www.pnas.org/content/89/8/3338.full.pdf Northrup S.H. and Erickson H.P. ''Kinetics of protein-protein association explained by Brownian dynamics computer simulation.''PNAS 1992;89(8),3338-3342]</ref>
 
|<math>min^{-1}</math>
 
|<math>min^{-1}</math>
 
| N/A
 
| N/A
Line 42: Line 43:
 
[[Image:K3-text.png|center|thumb|350px|Northrup et al. 1992<ref name="Northrup1992"></ref>]]
 
[[Image:K3-text.png|center|thumb|350px|Northrup et al. 1992<ref name="Northrup1992"></ref>]]
  
Therefore, the range <math>0.5-5 \cdot 10^{6} M^{-1}s^{-1} (0.03-0.3 nM^{-1} min^{-1})</math> is used to generate the probability distribution of <math>k_{on6}</math> as described in the following section.
+
Additionally, according to Ozbabacan et al. the binding rates can reach <math>10^{9} M^{-1}s^{-1}</math> in fast binding reactions.
 +
[[Image:K3-text2.png|center|thumb|350px|Ozbabacan et al. 2011<ref name="Saliha2011"></ref>]]
  
Afterwards, the log-normal distribution for the dissociation rate <math>k^{-}_{3}</math> of the ScbR homo-dimer formation is derived from the distributions of <math>K_{d6}</math> and <math>k_{on6}</math>.
+
Therefore, the range <math>0.5 \cdot 10^{6} M^{-1}s^{-1}-10^{9} M^{-1}s^{-1} (0.03-60 nM^{-1} min^{-1})</math> is used to generate the probability distribution of <math>k_{on6}</math> as described in the following section.
 +
 
 +
Afterwards, the log-normal distribution for the dissociation rate <math>k^{-}_{6}</math> of the ScbR homo-dimer formation is derived from the distributions of <math>K_{d6}</math> and <math>k_{on6}</math>.
 
|}
 
|}
  
 
==Parameters with uncertainty==
 
==Parameters with uncertainty==
  
When deciding how to describe the uncertainty for this parameter we must take into consideration that the values reported in literature correspond to ''in vitro'' testing of different protein-protein interaction and dimerization reactions than ScbR, although they refer to another ''Streptomyces'' protein (DnaA). This means that there might be a difference between actual parameter values and the ones reported in literature. These facts influence the quantification of the parameter uncertainty and therefore the shape of the corresponding distributions.
+
When deciding how to describe the uncertainty for this parameter we must take into consideration that the values reported in literature correspond to ''in vitro'' testing of different protein-protein interaction and dimerization reactions than ScbR, although they refer to another ''Streptomyces'' protein (DnaA). This means that there might be a difference between actual parameter values and the ones reported in literature. These facts influence the quantification of the parameter uncertainty and therefore the shape of the corresponding distributions. By assigning the appropriate weights to the parameter values and using the method described [[Quantification of parameter uncertainty#Design of probability distributions|'''here''']], the appropriate probability distributions were designed.
  
More specifically, the weight of the sampling is kept at <math> 30 nM </math> (value that corresponds to the wild type protein homodimerization) which is set as the mode of the log-normal distribution for the  <math>K_{d6}</math>. However, we wish to explore the full nanomolar scale when sampling for parameter values and therefore the confidence interval factor is set to <math> 30 </math>. In this way, the range where 95.45% of the values are found is between <math>1</math> and <math>900 nM </math>.
+
More specifically, for <math>K_{d6}</math> the mode of the log-normal distribution is <math> 3.89 nM </math> (value that is close to the wild type protein homodimerization). However, in order to explore sa larger part of the nanomolar scale when sampling for parameter values, the Spread is calculated to be <math> 1.9 </math>. In this way, the range where 68.27% of the values are found is between <math>2</math> and <math>7.4nM </math>.
  
With regards to the parameter <math>k_{on6}</math>, in order to explore the full range of plausible values, the mode of the log-normal distribution is set to <math>0.1 nM ^{-1 } min^{-1}</math> and the confidence interval factor is <math> 3 </math>. Thus the range where 95.45% of the values are found is between <math>0.0333</math> and <math>0.3</math> <math>nM ^{-1 } min^{-1}</math>.
+
With regards to the parameter <math>k_{on6}</math>, in order to explore the full range of plausible values, the mode of the log-normal distribution is set to <math>0.3 nM ^{-1 } min^{-1}</math> and the Spread is <math> 15.1 </math>. Thus the range where 68.27% of the values are found is between <math>0.02</math> and <math>4.66</math> <math>nM ^{-1 } min^{-1}</math>.
  
Since the two parameters are interdependent, thermodynamic consistency also needs to be taken into account. This is achieved by creating a bivariate system as described [[Welcome to the In-Silico Model of γ-butyrolactone regulation in Streptomyces coelicolor#Parameter Overview|'''here''']]. Since no information was retrieved for <math>k^{-}_{6}</math> and therefore is the parameter with the largest geometric coefficient of variation, this is set as the dependent parameter as per: <math>k^{-}_{6}=k_{on6} \cdot K_{d6}</math>. The location and scale parameters of <math>k^{-}_{6}</math> (μ=2.7438 and σ=1.2827) were calculated from those of <math>K_{d3}</math> and <math>k_{on3}</math>.  
+
Since the two parameters are interdependent, thermodynamic consistency also needs to be taken into account. This is achieved by creating a bivariate system as described [[Quantification of parameter uncertainty#Parameter dependency and thermodynamic consistency|'''here''']]. Since no information was retrieved for <math>k^{-}_{6}</math> and therefore is the parameter with the largest geometric coefficient of variation, this is set as the dependent parameter as per: <math>k^{-}_{6}=k_{on6} \cdot K_{d6}</math>. The location and scale parameters of <math>k^{-}_{6}</math> (μ=2.5303 and σ=1.5324) were calculated from those of <math>K_{d6}</math> and <math>k_{on6}</math>.  
  
 
The probability distributions for the two parameters, adjusted accordingly in order to reflect the above values, are the following:
 
The probability distributions for the two parameters, adjusted accordingly in order to reflect the above values, are the following:
  
[[Image:Kd6.png|500px]] [[Image:K6.png|500px]] [[Image:Kon6.png|500px]]
+
[[Image:KD6u.png|500px]] [[Image:K6u.png|500px]] [[Image:Kon6u.png|500px]]
  
The correlation matrix which is necessary to define the relationship between the two marginal distributions (<math>k_{d3}</math>,<math>k^{-}_{3}</math>) of the bivariate system is derived by employing random values generated by the two distributions.  
+
The values retrieved from literature and their weights are indicated by the blue dashed lines, and the uncertainty for each value is indicated using the reported experimental error (green lines) or a default value of 10% error (orange lines). The correlation matrix which is necessary to define the relationship between the two marginal distributions (<math>k_{d6}</math>,<math>k^{-}_{6}</math>) of the bivariate system is derived by employing random values generated by the two distributions.  
  
The parameters of the distributions of the multivariate system are:
+
The parameter information of the distributions of the multivariate system is:
 
{|class="wikitable"
 
{|class="wikitable"
 
! Parameter
 
! Parameter
 +
! Mode
 +
! Spread
 
! μ
 
! μ
 
! σ
 
! σ
Line 71: Line 77:
 
|-
 
|-
 
|<math>k_{on6}</math>
 
|<math>k_{on6}</math>
|<math>-2.058</math>
+
|<math>0.307</math>
|<math>0.4947</math>
+
|<math>15.1</math>
 +
|<math>0.85865</math>
 +
|<math>1.4273</math>
 
|N/A
 
|N/A
 
|-
 
|-
 
|<math>K_{d6}</math>
 
|<math>K_{d6}</math>
|<math>4.8018</math>
+
|<math>3.89</math>
|<math>1.1835</math>
+
|<math>1.9</math>
|rowspan="2"|<math>\begin{pmatrix} 1 & 0.8444 \\
+
|<math>1.6716</math>
0.8444 & 1 \end{pmatrix}</math>
+
|<math>0.55779</math>
 +
|rowspan="2"|<math>\begin{pmatrix} 1 & 0.2176 \\
 +
0.2176 & 1 \end{pmatrix}</math>
 
|-
 
|-
 
|<math>k^{-}_{6}</math>
 
|<math>k^{-}_{6}</math>
|<math>2.7438</math>
+
|N/A
|<math>1.2827</math>
+
|N/A
 +
|<math>2.5303</math>
 +
|<math>1.5324</math>
 
|}
 
|}
  
 
The multivariate system of the normal distributions (<math>ln(k_{d6})</math> and <math>ln(k^{-}_{6})</math>) and the resulting samples of values are presented in the following figure:
 
The multivariate system of the normal distributions (<math>ln(k_{d6})</math> and <math>ln(k^{-}_{6})</math>) and the resulting samples of values are presented in the following figure:
  
[[Image:Multidist7.png|800px]]
+
[[Image:Multidist7.png|500px]]
  
 
In this way, a system of distributions is created where each distribution is described and constrained by the other two. Therefore, the parameters will be sampled by the two marginal distributions in a way consistent with our beliefs and with the relevant thermodynamic constraints.
 
In this way, a system of distributions is created where each distribution is described and constrained by the other two. Therefore, the parameters will be sampled by the two marginal distributions in a way consistent with our beliefs and with the relevant thermodynamic constraints.

Latest revision as of 16:39, 4 January 2018

Two ScbR (R) proteins bind together to form an ScbR homo-dimer (R2).

Go back to overview
About this image

Chemical equation

2R \rightleftharpoons R_{2}

Rate equation

 r= \frac{k^{-}_{6}}{K_{d6}}\cdot [R]^{2} - k^{-}_{6}\cdot [R_{2}]

Parameters

The parameters of this reaction are the dissociation constant for binding of one ScbR to another (K_{d6}) and the dissociation rate for binding of one ScbR to another (k^{-}_{6}).

Name Value Units Value in previous GBL models [1] [2] Remarks-Reference
K_{d6} 2.58-42.9 [3] nM N/A Majka et al. published a study on dimerization of the initiator Protein DnaA of Streptomyces and on its mutants, where they report dissociation constants in the range 35.3-114 nM.
Majka et al. 2001[3]

These values agree with Ozbabacan et al. [4] and Nooren et al. [5] who state that strong protein-protein interactions such as homodimerization have equilibrium dissociation constants  < 10^{-6} M and mostly in the nanomolar range. As the protein homodimer is quite stable, we will focus more on the lower K_{d} values (<43nM).

k^{-}_{6} 0.144-51.2 [6] [7] min^{-1} N/A According to Northrup et al. the k_{a} of protein-protein bond formations occur in the order of 10^{6} M^{-1}s^{-1}.
Northrup et al. 1992[7]

Additionally, according to Ozbabacan et al. the binding rates can reach 10^{9} M^{-1}s^{-1} in fast binding reactions.

Ozbabacan et al. 2011[4]

Therefore, the range 0.5 \cdot 10^{6} M^{-1}s^{-1}-10^{9} M^{-1}s^{-1} (0.03-60 nM^{-1} min^{-1}) is used to generate the probability distribution of k_{on6} as described in the following section.

Afterwards, the log-normal distribution for the dissociation rate k^{-}_{6} of the ScbR homo-dimer formation is derived from the distributions of K_{d6} and k_{on6}.

Parameters with uncertainty

When deciding how to describe the uncertainty for this parameter we must take into consideration that the values reported in literature correspond to in vitro testing of different protein-protein interaction and dimerization reactions than ScbR, although they refer to another Streptomyces protein (DnaA). This means that there might be a difference between actual parameter values and the ones reported in literature. These facts influence the quantification of the parameter uncertainty and therefore the shape of the corresponding distributions. By assigning the appropriate weights to the parameter values and using the method described here, the appropriate probability distributions were designed.

More specifically, for K_{d6} the mode of the log-normal distribution is  3.89 nM (value that is close to the wild type protein homodimerization). However, in order to explore sa larger part of the nanomolar scale when sampling for parameter values, the Spread is calculated to be  1.9 . In this way, the range where 68.27% of the values are found is between 2 and 7.4nM .

With regards to the parameter k_{on6}, in order to explore the full range of plausible values, the mode of the log-normal distribution is set to 0.3 nM ^{-1 } min^{-1} and the Spread is  15.1 . Thus the range where 68.27% of the values are found is between 0.02 and 4.66 nM ^{-1 } min^{-1}.

Since the two parameters are interdependent, thermodynamic consistency also needs to be taken into account. This is achieved by creating a bivariate system as described here. Since no information was retrieved for k^{-}_{6} and therefore is the parameter with the largest geometric coefficient of variation, this is set as the dependent parameter as per: k^{-}_{6}=k_{on6} \cdot K_{d6}. The location and scale parameters of k^{-}_{6} (μ=2.5303 and σ=1.5324) were calculated from those of K_{d6} and k_{on6}.

The probability distributions for the two parameters, adjusted accordingly in order to reflect the above values, are the following:

KD6u.png K6u.png Kon6u.png

The values retrieved from literature and their weights are indicated by the blue dashed lines, and the uncertainty for each value is indicated using the reported experimental error (green lines) or a default value of 10% error (orange lines). The correlation matrix which is necessary to define the relationship between the two marginal distributions (k_{d6},k^{-}_{6}) of the bivariate system is derived by employing random values generated by the two distributions.

The parameter information of the distributions of the multivariate system is:

Parameter Mode Spread μ σ Correlation matrix
k_{on6} 0.307 15.1 0.85865 1.4273 N/A
K_{d6} 3.89 1.9 1.6716 0.55779 \begin{pmatrix} 1 & 0.2176 \\
0.2176  & 1 \end{pmatrix}
k^{-}_{6} N/A N/A 2.5303 1.5324

The multivariate system of the normal distributions (ln(k_{d6}) and ln(k^{-}_{6})) and the resulting samples of values are presented in the following figure:

Multidist7.png

In this way, a system of distributions is created where each distribution is described and constrained by the other two. Therefore, the parameters will be sampled by the two marginal distributions in a way consistent with our beliefs and with the relevant thermodynamic constraints.

References