DK949614 |
Clone id |
TST38A01NGRL0006_H09 |
Library |
TST38 |
Length |
690 |
Definition |
Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0006_H09. 5' end sequence. |
Accession |
DK949614 |
Tissue type |
prothallia |
Developmental stage |
gametophyte |
Contig ID |
CL935Contig1 |
Sequence |
GTTCTCATATTCTTCGACAATCCGGGGCTTTTTGTGCCTGTCAGTACTCGTGCTCGTCAT CTGCTTCGCTTCTGCGCGCCATCTTGAGTACAATGAAGACGATCTCGCTTCCGAAGATCG CCTGCTGCAGCTCTTCGAGAAATGGGCAACCAAGCACTCTAAGAACTACACCTCCCCCCA TGAATCCTCTCAGAAGCACTCGCGCTTTCAAGTCTTCAAGCAGAACCTTGCTTACATTCA CCAGCAGAATAGCAACAAACAGAAGGAGTCTTCCCACAGGCTGGGCTTGACCCGCTTCGC AGATCTCACCCTTAACGAGTTTAAAGCTCGACATTTTGGCTTCAGAAACCGCCCCAGCCC TGTTCCCCTTCAGGAATACAGCTCTGTCTGCGATACCAAGAAACTCCCTGCATCTGTTGA TTGGAGAAAGCATGGTGCTGTTACCCCAGTTAAAGATCAAGGAACATGCGGAAGCTGTTG GGCTTTCTCGTCTGTTGGTGCTATTGAGGGTGCACATGCTATAGCCATCGGGGAGCTTGT GAGCTTGTCTGAACAGGAGCTTGTCAGCTGTGTTCACACTAACTTTGGCTGCCATGGTGG CCTCATGAACCCCGCATTCAAATGGGTTATCAGGAATGGAGGCATCAACACTGAAGAAGG GTATCCTTATGTCAGTGGCACAGGTAGAAC |
■■Homology search results ■■ |
- |
Swiss-Prot (release 56.9) |
Link to BlastX Result : Swiss-Prot |
sp_hit_id |
O65493 |
Definition |
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana |
Align length |
219 |
Score (bit) |
174.0 |
E-value |
4.0e-43 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK949614|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0006_H09, 5' (690 letters)
Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 174 4e-43 sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 164 4e-40 sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 162 2e-39 sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 161 3e-39 sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 160 6e-39 sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 159 2e-38 sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 158 2e-38 sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 158 3e-38 sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 156 1e-37 sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 156 1e-37 sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 155 2e-37 sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 155 2e-37 sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 152 2e-36 sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 152 2e-36 sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 150 8e-36 sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 148 3e-35 sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 147 5e-35 sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 147 7e-35 sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 144 4e-34 sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 142 1e-33 sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1 142 1e-33 sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 142 1e-33 sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 142 1e-33 sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 140 6e-33 sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 140 6e-33 sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 140 8e-33 sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 140 8e-33 sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max P... 139 1e-32 sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 139 2e-32 sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1 138 3e-32
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1 SV=1 Length = 355
Score = 174 bits (441), Expect = 4e-43 Identities = 100/219 (45%), Positives = 131/219 (59%), Gaps = 9/219 (4%) Frame = +2
Query: 56 VICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQN 226 ++C A AR Y + L + D+LL+LFE W ++HSK Y S E K RF+VF++N Sbjct: 22 LLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEE---KVHRFEVFREN 78
Query: 227 LAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG-----FRNRPSPVPLQEYSSVC 391 L +I Q+N+ + +S+ LGL FADLT EFK R+ G F + P Y + Sbjct: 79 LMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDIT 135
Query: 392 DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC 571 D LP SVDWRK GAV PVKDQG CGSCWAFS+V L SLSEQEL+ C Sbjct: 136 D---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 572 VHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 T N GC+GGLM+ AF+++I GG++ E+ YPY+ G Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEG 231
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1 SV=2 Length = 356
Score = 164 bits (415), Expect = 4e-40 Identities = 95/221 (42%), Positives = 129/221 (58%), Gaps = 8/221 (3%) Frame = +2
Query: 32 LCLSVLVLVICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHS 202 L LS L + FAS+ Y+ +DL S D+L++LFE W + K Y + E K Sbjct: 14 LALSAASLSLSFASSHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEE---KFL 70
Query: 203 RFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRN----RPSPVPL 370 RF+VFK NL +I + N +K S+ LGL FADL+ EFK + G + R Sbjct: 71 RFEVFKDNLKHIDETN---KKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127
Query: 371 QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLS 550 E++ D + +P SVDWRK GAV VK+QG+CGSCWAFS+V L +LS Sbjct: 128 AEFA-YRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLS 186
Query: 551 EQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670 EQEL+ C T N GC+GGLM+ AF+++++NGG+ EE YPY Sbjct: 187 EQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 Length = 362
Score = 162 bits (409), Expect = 2e-39 Identities = 94/223 (42%), Positives = 125/223 (56%), Gaps = 9/223 (4%) Frame = +2
Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223 VL + A ++++ DLASE+ L L+E+W + H T +KH RF VFK Sbjct: 10 VLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65
Query: 224 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 379 NL ++H N+NK + ++L L +FAD+T +EF++ + G FR P Y Sbjct: 66 NLMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122
Query: 380 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 559 V +P SVDWRK GAVT VKDQG CGSCWAFS+V LV+LSEQE Sbjct: 123 EKVVS---VPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179
Query: 560 LVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 LV C N GC+GGLM AF+++ + GGI TE YPY + G Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEG 222
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 Length = 362
Score = 161 bits (408), Expect = 3e-39 Identities = 94/223 (42%), Positives = 125/223 (56%), Gaps = 9/223 (4%) Frame = +2
Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223 VL L + A +++E DL SE+ L L+E+W + H T +KH RF VFK Sbjct: 10 VLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65
Query: 224 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 379 N+ ++H N+NK + ++L L +FAD+T +EF++ + G FR Y Sbjct: 66 NVMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMY 122
Query: 380 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 559 V +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSEQE Sbjct: 123 EKV---GSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQE 179
Query: 560 LVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 LV C N GC+GGLM AF+++ + GGI TE YPY + G Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEG 222
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2 Length = 376
Score = 160 bits (405), Expect = 6e-39 Identities = 88/228 (38%), Positives = 137/228 (60%), Gaps = 15/228 (6%) Frame = +2
Query: 32 LCLSVLVLVICFASAR------HLEYNEDDL-ASEDRLLQLFEKWATKHSK-NYTSPHES 187 L L +L +V+ AS HL+ D +++ + ++ +W+ +H K N + Sbjct: 8 LSLLLLYVVVSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGII 67
Query: 188 SQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVP 367 + + RF +FK NL +I N + K ++++LGLT+F DLT +E++ + G R P+ Sbjct: 68 NDQDKRFNIFKDNLRFIDLHNEDN-KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRI 126
Query: 368 L------QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXX 529 Q+YS+ + K++P +VDWR+ GAV P+KDQGTCGSCWAFS+ Sbjct: 127 AKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVT 186
Query: 530 XXLVSLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670 L+SLSEQELV C + N GC+GGLM+ AF+++++NGG+NTE+ YPY Sbjct: 187 GELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPY 234
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1 SV=1 Length = 462
Score = 159 bits (401), Expect = 2e-38 Identities = 87/197 (44%), Positives = 124/197 (62%), Gaps = 6/197 (3%) Frame = +2
Query: 110 SEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGL 289 SE ++ ++E W KH K S + +K RF++FK NL ++ + N +K S+RLGL Sbjct: 42 SEAEVMSIYEAWLVKHGKAQ-SQNSLVEKDRRFEIFKDNLRFVDEHN---EKNLSYRLGL 97
Query: 290 TRFADLTLNEFKARHFGFRNRPSP---VPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQ 460 TRFADLT +E+++++ G + L+ + V D +LP S+DWRK GAV VKDQ Sbjct: 98 TRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGD--ELPESIDWRKKGAVAEVKDQ 155
Query: 461 GTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRN 637 G CGSCWAFS++ L++LSEQELV C N GC+GGLM+ AF+++I+N Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN 215
Query: 638 GGINTEEGYPY--VSGT 682 GGI+T++ YPY V GT Sbjct: 216 GGIDTDKDYPYKGVDGT 232
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1 Length = 360
Score = 158 bits (400), Expect = 2e-38 Identities = 93/226 (41%), Positives = 137/226 (60%), Gaps = 7/226 (3%) Frame = +2
Query: 29 FLCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208 F+ L+++ L + A+ + + E DLASED L L+EKW T H T + +K+ RF Sbjct: 6 FIALALVALSF-LSIAQSIPFTEKDLASEDSLWNLYEKWRTHH----TVARDLDEKNRRF 60
Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--FRNRPSPVPLQEYS 382 VFK+N+ +IH+ N++K++ ++L L +F D+T EF++++ G ++ S +Q+ + Sbjct: 61 NVFKENVKFIHE--FNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNT 118
Query: 383 SVC---DTKKLPA-SVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLS 550 + LPA S+DWR GAVT VKDQG CGSCWAFS++ LVSLS Sbjct: 119 GSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178
Query: 551 EQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 EQELV C N GC+GGLM+ AF+++ +N GI TE+ YPY G Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDG 223
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000 PE=1 SV=2 Length = 458
Score = 158 bits (399), Expect = 3e-38 Identities = 88/212 (41%), Positives = 121/212 (57%), Gaps = 3/212 (1%) Frame = +2
Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223 +L+L + A + Y E SE+ +L+ +W +H K+Y + E + R+ F+ Sbjct: 13 LLLLSLAAADMSIVSYGE---RSEEEARRLYAEWKAEHGKSYNAVGEEER---RYAAFRD 66
Query: 224 NLAYIHQQNSNKQKE-SSHRLGLTRFADLTLNEFKARHFGFRNRPSPV-PLQEYSSVCDT 397 NL YI + N+ S RLGL RFADLT E++ + G RN+P + + D Sbjct: 67 NLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADN 126
Query: 398 KKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC-V 574 + LP SVDWR GAV +KDQG CGSCWAFS++ L+SLSEQELV C Sbjct: 127 EALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT 186
Query: 575 HTNFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670 N GC+GGLM+ AF ++I NGGI+TE+ YPY Sbjct: 187 SYNEGCNGGLMDYAFDFIINNGGIDTEDDYPY 218
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1 Length = 328
Score = 156 bits (394), Expect = 1e-37 Identities = 81/196 (41%), Positives = 122/196 (62%), Gaps = 8/196 (4%) Frame = +2
Query: 125 LQLFEKWATKHSK-NYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFA 301 + ++ +W+ +H K N S +Q+ RF +FK NL +I N N K ++++LGLT FA Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENN-KNATYKLGLTIFA 59
Query: 302 DLTLNEFKARHFGFRNRP------SPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQG 463 +LT +E+++ + G R P + +YS+ + ++P +VDWR+ GAV +KDQG Sbjct: 60 NLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119
Query: 464 TCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNG 640 TCGSCWAFS+ LVSLSEQELV C + N GC+GGLM+ AF+++++NG Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179
Query: 641 GINTEEGYPYVSGTGR 688 G+NTE+ YPY G+ Sbjct: 180 GLNTEKDYPYHGTNGK 195
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1 Length = 360
Score = 156 bits (394), Expect = 1e-37 Identities = 97/225 (43%), Positives = 129/225 (57%), Gaps = 9/225 (4%) Frame = +2
Query: 38 LSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVF 217 L L L + A +++E +L SE+ L L+E+W + H+ + S HE K RF VF Sbjct: 6 LLALSLALVLAITESFDFHEKELESEESLWGLYERWRSHHTVS-RSLHE---KQKRFNVF 61
Query: 218 KQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQ 373 K N ++H N+NK + ++L L +FAD+T +EF+ + G FR P Sbjct: 62 KHNAMHVH--NANKM-DKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGPRGNGTF 118
Query: 374 EYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSE 553 Y V DT +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSE Sbjct: 119 MYEKV-DT--VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSE 175
Query: 554 QELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 QELV C N GC+GGLM+ AF+++ + GGI TE YPY + G Sbjct: 176 QELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDG 220
|
TrEMBL (release 39.9) |
Link to BlastX Result : TrEMBL |
tr_hit_id |
A9NUC2 |
Definition |
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea sitchensis |
Align length |
230 |
Score (bit) |
177.0 |
E-value |
6.0e-43 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK949614|Adiantum capillus-veneris mRNA, clone: TST38A01NGRL0006_H09, 5' (690 letters)
Database: uniprot_trembl.fasta 7,341,751 sequences; 2,391,615,440 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 177 6e-43 tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 176 1e-42 tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeu... 175 3e-42 tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 174 4e-42 tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 174 5e-42 tr|A9TQ45|A9TQ45_PHYPA Predicted protein OS=Physcomitrella paten... 173 8e-42 tr|B6TLC8|B6TLC8_MAIZE Xylem cysteine proteinase 2 OS=Zea mays P... 172 1e-41 tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subs... 172 2e-41 tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 172 2e-41 tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 172 2e-41 tr|A3AAP5|A3AAP5_ORYSJ Putative uncharacterized protein OS=Oryza... 172 2e-41 tr|A2X8X3|A2X8X3_ORYSI Putative uncharacterized protein OS=Oryza... 172 2e-41 tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 172 2e-41 tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 171 3e-41 tr|A9S553|A9S553_PHYPA Predicted protein OS=Physcomitrella paten... 171 5e-41 tr|Q94DH7|Q94DH7_ORYSJ cDNA clone:001-029-D05, full insert seque... 170 7e-41 tr|Q0JFN1|Q0JFN1_ORYSJ Os01g0971400 protein (Fragment) OS=Oryza ... 170 7e-41 tr|A2WZK0|A2WZK0_ORYSI Putative uncharacterized protein OS=Oryza... 170 7e-41 tr|Q94HK7|Q94HK7_ORYSA Putative cysteine proteinase OS=Oryza sat... 169 1e-40 tr|Q7XBA4|Q7XBA4_ORYSJ Os05g0108600 protein OS=Oryza sativa subs... 169 1e-40 tr|A2XZJ0|A2XZJ0_ORYSI Putative uncharacterized protein OS=Oryza... 169 1e-40 tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 169 2e-40 tr|A7QDJ6|A7QDJ6_VITVI Chromosome chr10 scaffold_81, whole genom... 168 3e-40 tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 167 8e-40 tr|Q6RCL8|Q6RCL8_IRIHO Putative cysteine protease 2 OS=Iris holl... 166 1e-39 tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Popul... 166 1e-39 tr|A7Y7Y0|A7Y7Y0_SOLLC KDEL-tailed cysteine endopeptidase OS=Sol... 166 1e-39 tr|A9TY71|A9TY71_PHYPA Predicted protein OS=Physcomitrella paten... 166 2e-39 tr|Q6F6A6|Q6F6A6_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 165 2e-39 tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 165 2e-39
>tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea sitchensis PE=2 SV=1 Length = 463
Score = 177 bits (449), Expect = 6e-43 Identities = 102/230 (44%), Positives = 134/230 (58%), Gaps = 12/230 (5%) Frame = +2
Query: 32 LCLSVLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196 L +VL L SA + Y+ DL +D +++L+E W +H K Y E K Sbjct: 5 LLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE---K 61
Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF------RNRPS 358 +RF VFK N YIHQ N+ Q S++LGL +FADL+ EFKA + G R S Sbjct: 62 QNRFSVFKDNFLYIHQHNN--QGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNS 119
Query: 359 PVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 538 P P +YS D + LP S+DWR+ GAVT VKDQG+CGSCWAFS+V L Sbjct: 120 PSPRYQYS---DGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNL 176
Query: 539 VSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 SLSEQELV C N GC+GGLM+ AF+++I NGG+++E+ YPY + G Sbjct: 177 TSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDG 226
>tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea sitchensis PE=2 SV=1 Length = 367
Score = 176 bits (446), Expect = 1e-42 Identities = 102/228 (44%), Positives = 129/228 (56%), Gaps = 17/228 (7%) Frame = +2
Query: 38 LSVLVLVICFASARHL---EYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208 L + +IC SA Y D+ S + L++LF++W +H K Y S HE +K R Sbjct: 8 LLISATIICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGS-HE--EKARRL 64
Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRP----------- 355 Q+F+ NL YIH N N SS RLGL +FADLT EFK R+FG ++ Sbjct: 65 QIFRTNLQYIHAHNKNSN--SSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEG 122
Query: 356 ---SPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXX 526 PV Q S + + +S+DWRK GAVT VKDQ CGSCWAFS+ Sbjct: 123 AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFIS 182
Query: 527 XXXLVSLSEQELVSCVHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670 LVSLSEQELV+C TN+GC GG M+ AF WVI+NGGI+TE+ Y Y Sbjct: 183 TGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSY 230
>tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeum vulgare var. distichum GN=pap-5 PE=2 SV=1 Length = 351
Score = 175 bits (443), Expect = 3e-42 Identities = 105/230 (45%), Positives = 138/230 (60%), Gaps = 14/230 (6%) Frame = +2
Query: 38 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196 LSV VL++C + AR+ + Y+E+DL+S DRL++LFEKW KH K Y S E K Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEE---K 61
Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 376 RF+VFK NL I + N ++ +S+ LGL FADLT +EFK + G SP P + Sbjct: 62 LHRFEVFKDNLKLIDEIN---REVTSYWLGLNEFADLTHDEFKTTYLGL----SPPPARR 114
Query: 377 YSSVC------DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 538 SS LP +VDWRK GAVT VK+QG CGSCWAFS+V L Sbjct: 115 SSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNL 174
Query: 539 VSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 +LSEQEL+ C V N GC+GG+M+ AF ++ +GG++TEE YPY+ G Sbjct: 175 TALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEG 224
>tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcCysP8 PE=2 SV=1 Length = 460
Score = 174 bits (442), Expect = 4e-42 Identities = 95/230 (41%), Positives = 138/230 (60%), Gaps = 7/230 (3%) Frame = +2
Query: 20 IRGFLCLSVLVLVICFASARHLEYNEDDL--ASEDRLLQLFEKWATKHSKNYTSPHESSQ 193 I L LS+L + A + Y++ +++D ++ +E W KH K+Y + E Q Sbjct: 4 ILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQ 63
Query: 194 KHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPL- 370 RFQ+FK N YI +QN+ K + S +LGL RFADLT E+++++ G R + S + Sbjct: 64 ---RFQIFKDNFLYIDEQNAAKDR--SFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS 118
Query: 371 ---QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLV 541 Q Y+S+ + LP SVDWR+HGAV VKDQG CGSCWAFS++ L+ Sbjct: 119 GKSQRYASLAG-ESLPESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLI 177
Query: 542 SLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTGR 688 +LSEQELV C + N GC+GGLM+ AF+++I NGGI+++ YPY G+ Sbjct: 178 TLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQ 227
>tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Arabidopsis thaliana GN=At4g35350 PE=3 SV=1 Length = 288
Score = 174 bits (441), Expect = 5e-42 Identities = 100/219 (45%), Positives = 131/219 (59%), Gaps = 9/219 (4%) Frame = +2
Query: 56 VICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQN 226 ++C A AR Y + L + D+LL+LFE W ++HSK Y S E K RF+VF++N Sbjct: 22 LLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEE---KVHRFEVFREN 78
Query: 227 LAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG-----FRNRPSPVPLQEYSSVC 391 L +I Q+N+ + +S+ LGL FADLT EFK R+ G F + P Y + Sbjct: 79 LMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDIT 135
Query: 392 DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC 571 D LP SVDWRK GAV PVKDQG CGSCWAFS+V L SLSEQEL+ C Sbjct: 136 D---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192
Query: 572 VHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 T N GC+GGLM+ AF+++I GG++ E+ YPY+ G Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEG 231
>tr|A9TQ45|A9TQ45_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens GN=PHYPADRAFT_224348 PE=3 SV=1 Length = 463
Score = 173 bits (439), Expect = 8e-42 Identities = 99/230 (43%), Positives = 138/230 (60%), Gaps = 8/230 (3%) Frame = +2
Query: 23 RGFLCLSVLVLVICFASARH-------LEYNEDDLASEDRLLQLFEKWATKHSKNYTSPH 181 R L LS+++LVI ++Y + L S+D +L +F +W HS+ Y S Sbjct: 5 RRALGLSLVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRS-- 62
Query: 182 ESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSP 361 S+KH RFQ+FK+N YIH N +++ S+ LGL +F+DLT EF+A++ G +P Sbjct: 63 -LSEKHHRFQIFKENFLYIHAHN---KQQKSYWLGLNKFSDLTHQEFRAQYLG--TKPVN 116
Query: 362 VPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLV 541 +E + + + + VDWR GAVT VKDQG CGSCWAFS+V LV Sbjct: 117 RQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELV 176
Query: 542 SLSEQELVSCVH-TNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTGR 688 SLSEQELV C N GC+GGLM+ AF+++I+NGGI+TE+ YPY + GR Sbjct: 177 SLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGR 226
>tr|B6TLC8|B6TLC8_MAIZE Xylem cysteine proteinase 2 OS=Zea mays PE=2 SV=1 Length = 385
Score = 172 bits (437), Expect = 1e-41 Identities = 104/236 (44%), Positives = 135/236 (57%), Gaps = 18/236 (7%) Frame = +2
Query: 32 LCLSVLVLVICFASARH------LEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQ 193 L +S+L C A AR + Y+E+DL+S + L +LFE+W ++H + Y S E Sbjct: 19 LSVSLLAGSSCLALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEE--- 75
Query: 194 KHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNR------- 352 K RFQVFK NL +I + N +K SS+ LGL FADLT +EFKA + G R+ Sbjct: 76 KLRRFQVFKDNLHHIDETN---RKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSG 132
Query: 353 ----PSPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXX 520 P + Y V D LP SVDWR GAVT VK+QG CGSCWAFS+V Sbjct: 133 IDDDDEPEEEEGYEGV-DGASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQ 191
Query: 521 XXXXXLVSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 L +LSEQEL+ C N GC+GGLM+ AF ++ NGG++TEE YPY+ G Sbjct: 192 IVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEG 247
>tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subsp. japonica GN=OJ1191_G08.11 PE=2 SV=1 Length = 366
Score = 172 bits (436), Expect = 2e-41 Identities = 99/224 (44%), Positives = 134/224 (59%), Gaps = 10/224 (4%) Frame = +2
Query: 44 VLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208 +L V C A+A H + Y+++DLA ++L+ LF W+ KHSK Y SP E K R+ Sbjct: 20 LLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE---KVKRY 76
Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV 388 ++FK+NL +I + N ++ S+ LGL FAD+ EFKA + G + + Q + S Sbjct: 77 EIFKRNLRHIVETN---RRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGST 133
Query: 389 ----CDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQ 556 + LP +VDWRK GAVTPVK+QG CGSCWAFS+V LVSLSEQ Sbjct: 134 TFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQ 193
Query: 557 ELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 EL+ C +T N GC GGLM+ AF +++ N GI TEE YPY+ G Sbjct: 194 ELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEG 237
>tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=PM33cysP PE=2 SV=1 Length = 454
Score = 172 bits (436), Expect = 2e-41 Identities = 98/225 (43%), Positives = 130/225 (57%), Gaps = 7/225 (3%) Frame = +2
Query: 32 LCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQ 211 L LS + A + Y+ DL +D +++L+E W +H K Y E K +F Sbjct: 10 LALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE---KQKKFS 66
Query: 212 VFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF------RNRPSPVPLQ 373 VFK N YIHQ N+ Q S++LGL +FADL+ EFKA + G R SP P Sbjct: 67 VFKDNFLYIHQHNN--QGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSPRY 124
Query: 374 EYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSE 553 +YS D LP S+DWR+ GAVT VK+QG+CGSCWAFS+V L SLSE Sbjct: 125 QYSVGED---LPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 181
Query: 554 QELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 QELV C N GC+GGLM+ AF+++I NGG+++E+ YPY + G Sbjct: 182 QELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNG 226
>tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeum vulgare var. distichum GN=pap-4 PE=2 SV=1 Length = 356
Score = 172 bits (436), Expect = 2e-41 Identities = 103/227 (45%), Positives = 137/227 (60%), Gaps = 11/227 (4%) Frame = +2
Query: 38 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196 LS +L++C + AR+ + Y+E+DL+S +RL++LFEKW KH K Y S E K Sbjct: 10 LSGALLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEE---K 66
Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 376 RF+VFK NL +I + N ++ +S+ LGL FADLT +EFKA + G P+ Sbjct: 67 LHRFEVFKDNLKHIDKIN---REVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSR 123
Query: 377 ---YSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSL 547 Y V LP SVDWRK GAVT VK+QG CGSCWAFS+V L +L Sbjct: 124 SFRYEDV-SASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTAL 182
Query: 548 SEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685 SEQEL+ C V N GC+GGLM+ AF ++ +GG++TEE YPY+ G Sbjct: 183 SEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEG 229
|