DK949614
Clone id TST38A01NGRL0006_H09
Library
Length 690
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0006_H09. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
GTTCTCATATTCTTCGACAATCCGGGGCTTTTTGTGCCTGTCAGTACTCGTGCTCGTCAT
CTGCTTCGCTTCTGCGCGCCATCTTGAGTACAATGAAGACGATCTCGCTTCCGAAGATCG
CCTGCTGCAGCTCTTCGAGAAATGGGCAACCAAGCACTCTAAGAACTACACCTCCCCCCA
TGAATCCTCTCAGAAGCACTCGCGCTTTCAAGTCTTCAAGCAGAACCTTGCTTACATTCA
CCAGCAGAATAGCAACAAACAGAAGGAGTCTTCCCACAGGCTGGGCTTGACCCGCTTCGC
AGATCTCACCCTTAACGAGTTTAAAGCTCGACATTTTGGCTTCAGAAACCGCCCCAGCCC
TGTTCCCCTTCAGGAATACAGCTCTGTCTGCGATACCAAGAAACTCCCTGCATCTGTTGA
TTGGAGAAAGCATGGTGCTGTTACCCCAGTTAAAGATCAAGGAACATGCGGAAGCTGTTG
GGCTTTCTCGTCTGTTGGTGCTATTGAGGGTGCACATGCTATAGCCATCGGGGAGCTTGT
GAGCTTGTCTGAACAGGAGCTTGTCAGCTGTGTTCACACTAACTTTGGCTGCCATGGTGG
CCTCATGAACCCCGCATTCAAATGGGTTATCAGGAATGGAGGCATCAACACTGAAGAAGG
GTATCCTTATGTCAGTGGCACAGGTAGAAC
■■Homology search results ■■ -
sp_hit_id O65493
Definition sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana
Align length 219
Score (bit) 174.0
E-value 4.0e-43
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK949614|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0006_H09, 5'
(690 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 174 4e-43
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 164 4e-40
sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 162 2e-39
sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 161 3e-39
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 160 6e-39
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 159 2e-38
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 158 2e-38
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 158 3e-38
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 156 1e-37
sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 156 1e-37
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 155 2e-37
sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 155 2e-37
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 152 2e-36
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 152 2e-36
sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 150 8e-36
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 148 3e-35
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 147 5e-35
sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 147 7e-35
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 144 4e-34
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 142 1e-33
sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1 142 1e-33
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 142 1e-33
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 142 1e-33
sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 140 6e-33
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 140 6e-33
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 140 8e-33
sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 140 8e-33
sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max P... 139 1e-32
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 139 2e-32
sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1 138 3e-32

>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 174 bits (441), Expect = 4e-43
Identities = 100/219 (45%), Positives = 131/219 (59%), Gaps = 9/219 (4%)
Frame = +2

Query: 56 VICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQN 226
++C A AR Y + L + D+LL+LFE W ++HSK Y S E K RF+VF++N
Sbjct: 22 LLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEE---KVHRFEVFREN 78

Query: 227 LAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG-----FRNRPSPVPLQEYSSVC 391
L +I Q+N+ + +S+ LGL FADLT EFK R+ G F + P Y +
Sbjct: 79 LMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDIT 135

Query: 392 DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC 571
D LP SVDWRK GAV PVKDQG CGSCWAFS+V L SLSEQEL+ C
Sbjct: 136 D---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192

Query: 572 VHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
T N GC+GGLM+ AF+++I GG++ E+ YPY+ G
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEG 231


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 164 bits (415), Expect = 4e-40
Identities = 95/221 (42%), Positives = 129/221 (58%), Gaps = 8/221 (3%)
Frame = +2

Query: 32 LCLSVLVLVICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHS 202
L LS L + FAS+ Y+ +DL S D+L++LFE W + K Y + E K
Sbjct: 14 LALSAASLSLSFASSHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEE---KFL 70

Query: 203 RFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRN----RPSPVPL 370
RF+VFK NL +I + N +K S+ LGL FADL+ EFK + G + R
Sbjct: 71 RFEVFKDNLKHIDETN---KKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127

Query: 371 QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLS 550
E++ D + +P SVDWRK GAV VK+QG+CGSCWAFS+V L +LS
Sbjct: 128 AEFA-YRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLS 186

Query: 551 EQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670
EQEL+ C T N GC+GGLM+ AF+++++NGG+ EE YPY
Sbjct: 187 EQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362

Score = 162 bits (409), Expect = 2e-39
Identities = 94/223 (42%), Positives = 125/223 (56%), Gaps = 9/223 (4%)
Frame = +2

Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223
VL + A ++++ DLASE+ L L+E+W + H T +KH RF VFK
Sbjct: 10 VLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65

Query: 224 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 379
NL ++H N+NK + ++L L +FAD+T +EF++ + G FR P Y
Sbjct: 66 NLMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122

Query: 380 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 559
V +P SVDWRK GAVT VKDQG CGSCWAFS+V LV+LSEQE
Sbjct: 123 EKVVS---VPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179

Query: 560 LVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
LV C N GC+GGLM AF+++ + GGI TE YPY + G
Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEG 222


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362

Score = 161 bits (408), Expect = 3e-39
Identities = 94/223 (42%), Positives = 125/223 (56%), Gaps = 9/223 (4%)
Frame = +2

Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223
VL L + A +++E DL SE+ L L+E+W + H T +KH RF VFK
Sbjct: 10 VLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHH----TVSRSLGEKHKRFNVFKA 65

Query: 224 NLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQEY 379
N+ ++H N+NK + ++L L +FAD+T +EF++ + G FR Y
Sbjct: 66 NVMHVH--NTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMY 122

Query: 380 SSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQE 559
V +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSEQE
Sbjct: 123 EKV---GSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQE 179

Query: 560 LVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
LV C N GC+GGLM AF+++ + GGI TE YPY + G
Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEG 222


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1
OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2
Length = 376

Score = 160 bits (405), Expect = 6e-39
Identities = 88/228 (38%), Positives = 137/228 (60%), Gaps = 15/228 (6%)
Frame = +2

Query: 32 LCLSVLVLVICFASAR------HLEYNEDDL-ASEDRLLQLFEKWATKHSK-NYTSPHES 187
L L +L +V+ AS HL+ D +++ + ++ +W+ +H K N +
Sbjct: 8 LSLLLLYVVVSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGII 67

Query: 188 SQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVP 367
+ + RF +FK NL +I N + K ++++LGLT+F DLT +E++ + G R P+
Sbjct: 68 NDQDKRFNIFKDNLRFIDLHNEDN-KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRI 126

Query: 368 L------QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXX 529
Q+YS+ + K++P +VDWR+ GAV P+KDQGTCGSCWAFS+
Sbjct: 127 AKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVT 186

Query: 530 XXLVSLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670
L+SLSEQELV C + N GC+GGLM+ AF+++++NGG+NTE+ YPY
Sbjct: 187 GELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPY 234


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis
thaliana GN=RD21A PE=1 SV=1
Length = 462

Score = 159 bits (401), Expect = 2e-38
Identities = 87/197 (44%), Positives = 124/197 (62%), Gaps = 6/197 (3%)
Frame = +2

Query: 110 SEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGL 289
SE ++ ++E W KH K S + +K RF++FK NL ++ + N +K S+RLGL
Sbjct: 42 SEAEVMSIYEAWLVKHGKAQ-SQNSLVEKDRRFEIFKDNLRFVDEHN---EKNLSYRLGL 97

Query: 290 TRFADLTLNEFKARHFGFRNRPSP---VPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQ 460
TRFADLT +E+++++ G + L+ + V D +LP S+DWRK GAV VKDQ
Sbjct: 98 TRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGD--ELPESIDWRKKGAVAEVKDQ 155

Query: 461 GTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRN 637
G CGSCWAFS++ L++LSEQELV C N GC+GGLM+ AF+++I+N
Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN 215

Query: 638 GGINTEEGYPY--VSGT 682
GGI+T++ YPY V GT
Sbjct: 216 GGIDTDKDYPYKGVDGT 232


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp.
GN=SEN102 PE=2 SV=1
Length = 360

Score = 158 bits (400), Expect = 2e-38
Identities = 93/226 (41%), Positives = 137/226 (60%), Gaps = 7/226 (3%)
Frame = +2

Query: 29 FLCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208
F+ L+++ L + A+ + + E DLASED L L+EKW T H T + +K+ RF
Sbjct: 6 FIALALVALSF-LSIAQSIPFTEKDLASEDSLWNLYEKWRTHH----TVARDLDEKNRRF 60

Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--FRNRPSPVPLQEYS 382
VFK+N+ +IH+ N++K++ ++L L +F D+T EF++++ G ++ S +Q+ +
Sbjct: 61 NVFKENVKFIHE--FNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNT 118

Query: 383 SVC---DTKKLPA-SVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLS 550
+ LPA S+DWR GAVT VKDQG CGSCWAFS++ LVSLS
Sbjct: 119 GSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178

Query: 551 EQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
EQELV C N GC+GGLM+ AF+++ +N GI TE+ YPY G
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDG 223


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp.
japonica GN=Os04g0650000 PE=1 SV=2
Length = 458

Score = 158 bits (399), Expect = 3e-38
Identities = 88/212 (41%), Positives = 121/212 (57%), Gaps = 3/212 (1%)
Frame = +2

Query: 44 VLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQ 223
+L+L + A + Y E SE+ +L+ +W +H K+Y + E + R+ F+
Sbjct: 13 LLLLSLAAADMSIVSYGE---RSEEEARRLYAEWKAEHGKSYNAVGEEER---RYAAFRD 66

Query: 224 NLAYIHQQNSNKQKE-SSHRLGLTRFADLTLNEFKARHFGFRNRPSPV-PLQEYSSVCDT 397
NL YI + N+ S RLGL RFADLT E++ + G RN+P + + D
Sbjct: 67 NLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADN 126

Query: 398 KKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC-V 574
+ LP SVDWR GAV +KDQG CGSCWAFS++ L+SLSEQELV C
Sbjct: 127 EALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT 186

Query: 575 HTNFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670
N GC+GGLM+ AF ++I NGGI+TE+ YPY
Sbjct: 187 SYNEGCNGGLMDYAFDFIINNGGIDTEDDYPY 218


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment)
OS=Brassica napus PE=2 SV=1
Length = 328

Score = 156 bits (394), Expect = 1e-37
Identities = 81/196 (41%), Positives = 122/196 (62%), Gaps = 8/196 (4%)
Frame = +2

Query: 125 LQLFEKWATKHSK-NYTSPHESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFA 301
+ ++ +W+ +H K N S +Q+ RF +FK NL +I N N K ++++LGLT FA
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENN-KNATYKLGLTIFA 59

Query: 302 DLTLNEFKARHFGFRNRP------SPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQG 463
+LT +E+++ + G R P + +YS+ + ++P +VDWR+ GAV +KDQG
Sbjct: 60 NLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119

Query: 464 TCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNG 640
TCGSCWAFS+ LVSLSEQELV C + N GC+GGLM+ AF+++++NG
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179

Query: 641 GINTEEGYPYVSGTGR 688
G+NTE+ YPY G+
Sbjct: 180 GLNTEKDYPYHGTNGK 195


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1
SV=1
Length = 360

Score = 156 bits (394), Expect = 1e-37
Identities = 97/225 (43%), Positives = 129/225 (57%), Gaps = 9/225 (4%)
Frame = +2

Query: 38 LSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVF 217
L L L + A +++E +L SE+ L L+E+W + H+ + S HE K RF VF
Sbjct: 6 LLALSLALVLAITESFDFHEKELESEESLWGLYERWRSHHTVS-RSLHE---KQKRFNVF 61

Query: 218 KQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG--------FRNRPSPVPLQ 373
K N ++H N+NK + ++L L +FAD+T +EF+ + G FR P
Sbjct: 62 KHNAMHVH--NANKM-DKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGPRGNGTF 118

Query: 374 EYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSE 553
Y V DT +PASVDWRK GAVT VKDQG CGSCWAFS++ LVSLSE
Sbjct: 119 MYEKV-DT--VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSE 175

Query: 554 QELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
QELV C N GC+GGLM+ AF+++ + GGI TE YPY + G
Sbjct: 176 QELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDG 220


tr_hit_id A9NUC2
Definition tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea sitchensis
Align length 230
Score (bit) 177.0
E-value 6.0e-43
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK949614|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0006_H09, 5'
(690 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 177 6e-43
tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 176 1e-42
tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeu... 175 3e-42
tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 174 4e-42
tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 174 5e-42
tr|A9TQ45|A9TQ45_PHYPA Predicted protein OS=Physcomitrella paten... 173 8e-42
tr|B6TLC8|B6TLC8_MAIZE Xylem cysteine proteinase 2 OS=Zea mays P... 172 1e-41
tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subs... 172 2e-41
tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 172 2e-41
tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 172 2e-41
tr|A3AAP5|A3AAP5_ORYSJ Putative uncharacterized protein OS=Oryza... 172 2e-41
tr|A2X8X3|A2X8X3_ORYSI Putative uncharacterized protein OS=Oryza... 172 2e-41
tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 172 2e-41
tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 171 3e-41
tr|A9S553|A9S553_PHYPA Predicted protein OS=Physcomitrella paten... 171 5e-41
tr|Q94DH7|Q94DH7_ORYSJ cDNA clone:001-029-D05, full insert seque... 170 7e-41
tr|Q0JFN1|Q0JFN1_ORYSJ Os01g0971400 protein (Fragment) OS=Oryza ... 170 7e-41
tr|A2WZK0|A2WZK0_ORYSI Putative uncharacterized protein OS=Oryza... 170 7e-41
tr|Q94HK7|Q94HK7_ORYSA Putative cysteine proteinase OS=Oryza sat... 169 1e-40
tr|Q7XBA4|Q7XBA4_ORYSJ Os05g0108600 protein OS=Oryza sativa subs... 169 1e-40
tr|A2XZJ0|A2XZJ0_ORYSI Putative uncharacterized protein OS=Oryza... 169 1e-40
tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 169 2e-40
tr|A7QDJ6|A7QDJ6_VITVI Chromosome chr10 scaffold_81, whole genom... 168 3e-40
tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 167 8e-40
tr|Q6RCL8|Q6RCL8_IRIHO Putative cysteine protease 2 OS=Iris holl... 166 1e-39
tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Popul... 166 1e-39
tr|A7Y7Y0|A7Y7Y0_SOLLC KDEL-tailed cysteine endopeptidase OS=Sol... 166 1e-39
tr|A9TY71|A9TY71_PHYPA Predicted protein OS=Physcomitrella paten... 166 2e-39
tr|Q6F6A6|Q6F6A6_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 165 2e-39
tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 165 2e-39

>tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 463

Score = 177 bits (449), Expect = 6e-43
Identities = 102/230 (44%), Positives = 134/230 (58%), Gaps = 12/230 (5%)
Frame = +2

Query: 32 LCLSVLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196
L +VL L SA + Y+ DL +D +++L+E W +H K Y E K
Sbjct: 5 LLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE---K 61

Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF------RNRPS 358
+RF VFK N YIHQ N+ Q S++LGL +FADL+ EFKA + G R S
Sbjct: 62 QNRFSVFKDNFLYIHQHNN--QGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNS 119

Query: 359 PVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 538
P P +YS D + LP S+DWR+ GAVT VKDQG+CGSCWAFS+V L
Sbjct: 120 PSPRYQYS---DGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNL 176

Query: 539 VSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
SLSEQELV C N GC+GGLM+ AF+++I NGG+++E+ YPY + G
Sbjct: 177 TSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDG 226


>tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 367

Score = 176 bits (446), Expect = 1e-42
Identities = 102/228 (44%), Positives = 129/228 (56%), Gaps = 17/228 (7%)
Frame = +2

Query: 38 LSVLVLVICFASARHL---EYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208
L + +IC SA Y D+ S + L++LF++W +H K Y S HE +K R
Sbjct: 8 LLISATIICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGS-HE--EKARRL 64

Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRP----------- 355
Q+F+ NL YIH N N SS RLGL +FADLT EFK R+FG ++
Sbjct: 65 QIFRTNLQYIHAHNKNSN--SSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEG 122

Query: 356 ---SPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXX 526
PV Q S + + +S+DWRK GAVT VKDQ CGSCWAFS+
Sbjct: 123 AELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFIS 182

Query: 527 XXXLVSLSEQELVSCVHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPY 670
LVSLSEQELV+C TN+GC GG M+ AF WVI+NGGI+TE+ Y Y
Sbjct: 183 TGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSY 230


>tr|B4ESE7|B4ESE7_HORVD Papain-like cysteine proteinase OS=Hordeum
vulgare var. distichum GN=pap-5 PE=2 SV=1
Length = 351

Score = 175 bits (443), Expect = 3e-42
Identities = 105/230 (45%), Positives = 138/230 (60%), Gaps = 14/230 (6%)
Frame = +2

Query: 38 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196
LSV VL++C + AR+ + Y+E+DL+S DRL++LFEKW KH K Y S E K
Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEE---K 61

Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 376
RF+VFK NL I + N ++ +S+ LGL FADLT +EFK + G SP P +
Sbjct: 62 LHRFEVFKDNLKLIDEIN---REVTSYWLGLNEFADLTHDEFKTTYLGL----SPPPARR 114

Query: 377 YSSVC------DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXL 538
SS LP +VDWRK GAVT VK+QG CGSCWAFS+V L
Sbjct: 115 SSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNL 174

Query: 539 VSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
+LSEQEL+ C V N GC+GG+M+ AF ++ +GG++TEE YPY+ G
Sbjct: 175 TALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEG 224


>tr|Q6F6A3|Q6F6A3_DAUCA Cysteine protease OS=Daucus carota
GN=DcCysP8 PE=2 SV=1
Length = 460

Score = 174 bits (442), Expect = 4e-42
Identities = 95/230 (41%), Positives = 138/230 (60%), Gaps = 7/230 (3%)
Frame = +2

Query: 20 IRGFLCLSVLVLVICFASARHLEYNEDDL--ASEDRLLQLFEKWATKHSKNYTSPHESSQ 193
I L LS+L + A + Y++ +++D ++ +E W KH K+Y + E Q
Sbjct: 4 ILSLLSLSLLAAAVTAADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQ 63

Query: 194 KHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPL- 370
RFQ+FK N YI +QN+ K + S +LGL RFADLT E+++++ G R + S +
Sbjct: 64 ---RFQIFKDNFLYIDEQNAAKDR--SFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVS 118

Query: 371 ---QEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLV 541
Q Y+S+ + LP SVDWR+HGAV VKDQG CGSCWAFS++ L+
Sbjct: 119 GKSQRYASLAG-ESLPESVDWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLI 177

Query: 542 SLSEQELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTGR 688
+LSEQELV C + N GC+GGLM+ AF+++I NGGI+++ YPY G+
Sbjct: 178 TLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYTGRDGQ 227


>tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2
OS=Arabidopsis thaliana GN=At4g35350 PE=3 SV=1
Length = 288

Score = 174 bits (441), Expect = 5e-42
Identities = 100/219 (45%), Positives = 131/219 (59%), Gaps = 9/219 (4%)
Frame = +2

Query: 56 VICFASARHLE---YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQVFKQN 226
++C A AR Y + L + D+LL+LFE W ++HSK Y S E K RF+VF++N
Sbjct: 22 LLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEE---KVHRFEVFREN 78

Query: 227 LAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFG-----FRNRPSPVPLQEYSSVC 391
L +I Q+N+ + +S+ LGL FADLT EFK R+ G F + P Y +
Sbjct: 79 LMHIDQRNN---EINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDIT 135

Query: 392 DTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQELVSC 571
D LP SVDWRK GAV PVKDQG CGSCWAFS+V L SLSEQEL+ C
Sbjct: 136 D---LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDC 192

Query: 572 VHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
T N GC+GGLM+ AF+++I GG++ E+ YPY+ G
Sbjct: 193 DTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEG 231


>tr|A9TQ45|A9TQ45_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_224348 PE=3 SV=1
Length = 463

Score = 173 bits (439), Expect = 8e-42
Identities = 99/230 (43%), Positives = 138/230 (60%), Gaps = 8/230 (3%)
Frame = +2

Query: 23 RGFLCLSVLVLVICFASARH-------LEYNEDDLASEDRLLQLFEKWATKHSKNYTSPH 181
R L LS+++LVI ++Y + L S+D +L +F +W HS+ Y S
Sbjct: 5 RRALGLSLVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRS-- 62

Query: 182 ESSQKHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSP 361
S+KH RFQ+FK+N YIH N +++ S+ LGL +F+DLT EF+A++ G +P
Sbjct: 63 -LSEKHHRFQIFKENFLYIHAHN---KQQKSYWLGLNKFSDLTHQEFRAQYLG--TKPVN 116

Query: 362 VPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLV 541
+E + + + + VDWR GAVT VKDQG CGSCWAFS+V LV
Sbjct: 117 RQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELV 176

Query: 542 SLSEQELVSCVH-TNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTGR 688
SLSEQELV C N GC+GGLM+ AF+++I+NGGI+TE+ YPY + GR
Sbjct: 177 SLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGR 226


>tr|B6TLC8|B6TLC8_MAIZE Xylem cysteine proteinase 2 OS=Zea mays PE=2
SV=1
Length = 385

Score = 172 bits (437), Expect = 1e-41
Identities = 104/236 (44%), Positives = 135/236 (57%), Gaps = 18/236 (7%)
Frame = +2

Query: 32 LCLSVLVLVICFASARH------LEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQ 193
L +S+L C A AR + Y+E+DL+S + L +LFE+W ++H + Y S E
Sbjct: 19 LSVSLLAGSSCLALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEE--- 75

Query: 194 KHSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNR------- 352
K RFQVFK NL +I + N +K SS+ LGL FADLT +EFKA + G R+
Sbjct: 76 KLRRFQVFKDNLHHIDETN---RKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSG 132

Query: 353 ----PSPVPLQEYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXX 520
P + Y V D LP SVDWR GAVT VK+QG CGSCWAFS+V
Sbjct: 133 IDDDDEPEEEEGYEGV-DGASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQ 191

Query: 521 XXXXXLVSLSEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
L +LSEQEL+ C N GC+GGLM+ AF ++ NGG++TEE YPY+ G
Sbjct: 192 IVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEG 247


>tr|Q6ZHP9|Q6ZHP9_ORYSJ Os02g0715000 protein OS=Oryza sativa subsp.
japonica GN=OJ1191_G08.11 PE=2 SV=1
Length = 366

Score = 172 bits (436), Expect = 2e-41
Identities = 99/224 (44%), Positives = 134/224 (59%), Gaps = 10/224 (4%)
Frame = +2

Query: 44 VLVLVICFASARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRF 208
+L V C A+A H + Y+++DLA ++L+ LF W+ KHSK Y SP E K R+
Sbjct: 20 LLGFVACSATASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE---KVKRY 76

Query: 209 QVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQEYSSV 388
++FK+NL +I + N ++ S+ LGL FAD+ EFKA + G + + Q + S
Sbjct: 77 EIFKRNLRHIVETN---RRNGSYWLGLNHFADIAHEEFKASYLGLKPGLARRDAQPHGST 133

Query: 389 ----CDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSEQ 556
+ LP +VDWRK GAVTPVK+QG CGSCWAFS+V LVSLSEQ
Sbjct: 134 TFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKLVSLSEQ 193

Query: 557 ELVSCVHT-NFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
EL+ C +T N GC GGLM+ AF +++ N GI TEE YPY+ G
Sbjct: 194 ELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEG 237


>tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii
GN=PM33cysP PE=2 SV=1
Length = 454

Score = 172 bits (436), Expect = 2e-41
Identities = 98/225 (43%), Positives = 130/225 (57%), Gaps = 7/225 (3%)
Frame = +2

Query: 32 LCLSVLVLVICFASARHLEYNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQKHSRFQ 211
L LS + A + Y+ DL +D +++L+E W +H K Y E K +F
Sbjct: 10 LALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE---KQKKFS 66

Query: 212 VFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGF------RNRPSPVPLQ 373
VFK N YIHQ N+ Q S++LGL +FADL+ EFKA + G R SP P
Sbjct: 67 VFKDNFLYIHQHNN--QGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSPRY 124

Query: 374 EYSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSLSE 553
+YS D LP S+DWR+ GAVT VK+QG+CGSCWAFS+V L SLSE
Sbjct: 125 QYSVGED---LPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 181

Query: 554 QELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
QELV C N GC+GGLM+ AF+++I NGG+++E+ YPY + G
Sbjct: 182 QELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNG 226


>tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeum
vulgare var. distichum GN=pap-4 PE=2 SV=1
Length = 356

Score = 172 bits (436), Expect = 2e-41
Identities = 103/227 (45%), Positives = 137/227 (60%), Gaps = 11/227 (4%)
Frame = +2

Query: 38 LSVLVLVICFAS--ARHLE-----YNEDDLASEDRLLQLFEKWATKHSKNYTSPHESSQK 196
LS +L++C + AR+ + Y+E+DL+S +RL++LFEKW KH K Y S E K
Sbjct: 10 LSGALLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEE---K 66

Query: 197 HSRFQVFKQNLAYIHQQNSNKQKESSHRLGLTRFADLTLNEFKARHFGFRNRPSPVPLQE 376
RF+VFK NL +I + N ++ +S+ LGL FADLT +EFKA + G P+
Sbjct: 67 LHRFEVFKDNLKHIDKIN---REVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSR 123

Query: 377 ---YSSVCDTKKLPASVDWRKHGAVTPVKDQGTCGSCWAFSSVXXXXXXXXXXXXXLVSL 547
Y V LP SVDWRK GAVT VK+QG CGSCWAFS+V L +L
Sbjct: 124 SFRYEDV-SASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTAL 182

Query: 548 SEQELVSC-VHTNFGCHGGLMNPAFKWVIRNGGINTEEGYPYVSGTG 685
SEQEL+ C V N GC+GGLM+ AF ++ +GG++TEE YPY+ G
Sbjct: 183 SEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLMEEG 229