接觸序

蛋白質的接觸序(contact order,CO)是蛋白質天然狀態三級結構中氨基酸間接觸位置的量度。它計算方法為摺疊蛋白質中形成天然接觸的殘基之間的平均序列距離除以蛋白質的總長度。較高的接觸序表示較長的摺疊時間,^[1] ^[2]並且低接觸階數已被建議作為潛在下坡摺疊或在沒有自由能壘的情況下發生的蛋白質摺疊的預測指標。CO模型的提出表明蛋白質的摺疊速率主要由它的天然態結構 (native state) 所決定。一般來說, α螺旋蛋白要比β蛋白摺疊得快，用CO的理論解釋是因為α螺旋蛋白中含有大量的局部有效接觸，所以有較低的CO值、較高的摺疊速率。接觸序的理論說明, 摺疊成有效接觸多的三維結構, 這樣的蛋白質三維結構能量更穩定, 摺疊速率快。

接觸序 (CO) 的正式定義為：

CO={1 \over {L\cdot N}}\sum ^{N}\Delta S_{i,j}

其中N是接觸總數，Δ Si,j是序列分離，即在氨基酸殘基i和j之間的序列差， L是蛋白質中的殘基總數。^[1]對於單結構域蛋白質，接觸序的值通常在 5% 到 25% 之間，較低的接觸序主要屬於螺旋蛋白質，較高的接觸序屬於具有高 β-片層含量的蛋白質。

蛋白質結構預測方法在預測具有低接觸序的蛋白質結構方面更準確。這可能部分是因為低接觸序的蛋白質往往很小，但可能是由於在最小化能量函數的全局優化過程中要考慮的可能的遠程殘基-殘基相互作用的數量較少。^[3]與在實驗確定的蛋白質結構中觀察到的分佈相比，即使是Rosetta方法等成功的結構預測方法也會過度產生低接觸階結構預測。^[2]

原生摺疊接觸序的百分比也可用作摺疊過渡狀態「原生性」的量度。結合分子動力學的Phi 值分析產生了過渡態模型，其接觸序接近於小且快速摺疊的蛋白質中的摺疊狀態。^[4]此外，過渡狀態下的接觸序以及原始狀態下的接觸序與整體摺疊時間高度相關。^[5]

除了它們在結構預測中的作用外，還可以根據序列比對接觸序本身進行預測，這可用於對與已知序列具有某種程度同源性的新序列的摺疊進行分類。 ^[6]

參考

^ ^1.0 ^1.1 Plaxco, Kevin W; Simons, Kim T; Baker, David. Contact order, transition state placement and the refolding rates of single domain proteins. Journal of Molecular Biology. April 1998, 277 (4): 985–994. PMID 9545386. doi:10.1006/jmbi.1998.1645.
^ ^2.0 ^2.1 Bonneau, Richard; Ruczinski, Ingo; Tsai, Jerry; Baker, David. Contact order and ab initio protein structure prediction. Protein Science. August 2002, 11 (8): 1937–1944. PMC 2373674 . PMID 12142448. doi:10.1110/ps.3790102.
^ Mount DM. (2004). Bioinformatics: Sequence and Genome Analysis 2nd ed. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY.
^ Pandit, AD; Jha, A; Freed, KF; Sosnick, TR. Small proteins fold through transition states with native-like topologies. J Mol Biol. 2006, 361 (4): 755–70. PMID 16876194. doi:10.1016/j.jmb.2006.06.041.
^ Paci, E; Lindorff-Larsen, K; Dobson, CM; Karplus, M; Vendruscolo, M. Transition state contact orders correlate with protein folding rates. J Mol Biol. 2005, 352 (3): 495–500. PMID 16120445. doi:10.1016/j.jmb.2005.06.081.
^ Shi, Yi; Zhou, Jianjun; Arndt, David; Wishart, David S.; Lin, Guohui. Protein contact order prediction from primary sequences. BMC Bioinformatics. 2008, 9: 255. PMC 2440764 . PMID 18513429. doi:10.1186/1471-2105-9-255.

[plaxco-1] 1.0 ^1.1 Plaxco, Kevin W; Simons, Kim T; Baker, David. Contact order, transition state placement and the refolding rates of single domain proteins. Journal of Molecular Biology. April 1998, 277 (4): 985–994. PMID 9545386. doi:10.1006/jmbi.1998.1645.

[Bonneau-2] 2.0 ^2.1 Bonneau, Richard; Ruczinski, Ingo; Tsai, Jerry; Baker, David. Contact order and ab initio protein structure prediction. Protein Science. August 2002, 11 (8): 1937–1944. PMC 2373674 . PMID 12142448. doi:10.1110/ps.3790102.

[Mount-3] Mount DM. (2004). Bioinformatics: Sequence and Genome Analysis 2nd ed. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY.

[Pandit-4] Pandit, AD; Jha, A; Freed, KF; Sosnick, TR. Small proteins fold through transition states with native-like topologies. J Mol Biol. 2006, 361 (4): 755–70. PMID 16876194. doi:10.1016/j.jmb.2006.06.041.

[Paci-5] Paci, E; Lindorff-Larsen, K; Dobson, CM; Karplus, M; Vendruscolo, M. Transition state contact orders correlate with protein folding rates. J Mol Biol. 2005, 352 (3): 495–500. PMID 16120445. doi:10.1016/j.jmb.2005.06.081.

[Shi-6] Shi, Yi; Zhou, Jianjun; Arndt, David; Wishart, David S.; Lin, Guohui. Protein contact order prediction from primary sequences. BMC Bioinformatics. 2008, 9: 255. PMC 2440764 . PMID 18513429. doi:10.1186/1471-2105-9-255.

[1]

[2]

[3]

[4]

[5]

[6]