The Putative Secondary Structure of the putative coronavirus nsp1
 
1.氨基酸组成:(%)
%A: 7.4 %C: 3.0 %D: 5.1 %E: 6.0 %F: 4.8 %M: 2.3 %N: 4.8 %P: 3.9 %Q: 2.9 %R: 2.9
%G: 5.5 %H: 2.0 %I: 4.9 %K: 6.1 %L: 9.6 %S: 7.3 %T: 7.7 %V: 8.2 %W: 0.9 %Y: 4.7
 
2.Helix,Sheet等的预测结果:
方法:The PredictProtein server提交多肽序列.预测实现方法参见Rost B(1996)
结果:结果包括两个部分,Fig1为图象化以后的结果图,直接的预测结果请参看这里
image
Fig1.Helix,Strand预测结果
 
3.Motif预测:
方法:在prosit中通过序列比对,找到蛋白中可能含有的序列的motif.
结果:下表中列出了改序列中可能含有的motif.其中,数字表示motif的第一个氨基酸在整个序列中的位置,后面的字母表示氨基酸序列.
 
N-glycosylation site
Casein kinase II phosphorylation site
Protein kinase C phosphorylation site
98 NFSS
810 NHTK
878 NKTV
1081 NASF
1261 NTSI
1431 NSSN
1689 NNTK
1867 NVSL
7 TFGE
12 TVWE
47 SGTE
135 TEDD
171 SEIE
180 TPEE
372 SKTE
399 TTLE
435 SFLE
450 TSGD
496 TLEE
584 TSKE
598 SLNE
636 SSPD
654 TSEE
668 SYRD
699 SPVE
720 SLRE
731 TTVD
756 TYLD
796 TLDE
880 TVGE
922 TGVE
999 TAKE
1047 TYTE
1206 TSEE
1436 TTMD
1452 SGLD
1578 TRVE
1706 SKCD
1803 TDVD
1820 SDLE
1843 TPRD
1879 SLSE
1956 SIHD
1981 STDD
2103 SYSE
2126 TYLE
2136 TTFD
2332 STFE
2384 SYRE
2403 SGAD
100 SSR
152 TVR
328 TVR
408 TNK
461 SKK
543 TRK
584 TSK
626 SLK
651 SSK
668 SYR
720 SLR
726 TIK
785 TLR
812 TKK
825 SIK
877 SNK
902 SAK
999 TAK
1035 TIK
1040 SYK
1137 SFK
1157 TTK
1161 TFK
1174 STK
1265 TIK
1361 SVK
1379 SPK
1472 SYK
1746 SVK
1843 TPR
1905 TTR
1914 TTK
1918 SLK
1998 SQR
2003 SYK
2131 SVR
2160 SGR
2308 SLK
2384 SYR
N-myristoylation site Prokaryotic membrane lipoprotein lipid attachment site
Tyrosine kinase phosphorylation site
5 GVTFGE
48 GTEVTE
228 GGGVAG
312 GIFGAK
489 GQGCAG
760 GADVTK
821 GGLTSI
915 GQKTTT
978 GTFLCA
988 GNYQCG
1009 GAHLTK
1045 GVTYTE
1242 GVKVTQ
1400 GSLICV
1410 GVLLSN
1443 GSFPCS
1562 GCTSST
1588 GMKRSF
1787 GVLSTF
1799 GVVDTD
2002 GSYKND
2027 GLPGTV
2154 GICLST
2175 GVFCGV
2209 GGIIAI
2328 GVTFST
2378 GALDTT
253 LNGPLTVGGSC 687 KRGDKIVY
827 KWADNNCY
Cell attachment sequence Zinc carboxypeptidases, zinc-binding region 2 signature Mitochondrial energy transfer proteins signature
688 RGD 1284 HGIAAINSVPW 1764 PMEKLKALVA
 
4.穿膜片断预测结果
这是用The PredictProtein Server预测二阶结构的结果之一,下表列出了可能的穿膜片断以及整个肽链在膜内和膜外的分布情况.由于以下是序列扫描以后的结果,所有有些非膜蛋白也有可能被预测为有穿膜序列.
Positions Segments Explain
1- 1281 o1 outside region 1
1282- 1299 M1 membrane helix 1
1300- 1387 i1 inside region 1
1388- 1411 M2 membrane helix 2
1412- 1483 o2 outside region 2
1484- 1508 M3 membrane helix 3
1509- 1516 i2 inside region 2
1517- 1541 M4 membrane helix 4
1542- 1931 o3 outside region 3
1932- 1949 M5 membrane helix 5
1950- 2203 i3 inside region 3
2204- 2222 M6 membrane helix 6
2223- 2240 o4 outside region 4
2241- 2259 M7 membrane helix 7
2260- 2263 i4 inside region 4
2264- 2281 M8 membrane helix 8
2282- 2289 o5 outside region 5
2290- 2307 M9 membrane helix 9
2308- 2422 i5 inside region 5
具体的预测结果请参看这里
 
参考文献:
1.PHD: predicting one-dimensional protein structure by profile-based neural networks.
2.Prediction of protein secondary structure at better than 70% accuracy
3.Topology prediction for helical transmembrane proteins at 86% accuracy
4.The PROSITE database, its status in 1999.