DK960359
Clone id TST39A01NGRL0007_D07
Library
Length 685
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0007_D07. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
GGAAGAAGAAATCTGTATCCCTGCCTTAATGGCTACCTCCTCCGTCATATGGGCCCTCTT
GGGCGTTTTCGCCCTCTTCTTGACACTCTCCTCTGCCGATAACTCTCATGTGTTAAGCTA
CTCCCCTTCCGATCTCCACTCCCAAGCCAAGCTTGTTAGCCTCTTTGACGCATGGAACAT
GAAGCATGGAAAACATTACACAGCCGCCCAGAAGTTAGAGAAGCACAAGAGATTCCACAT
CTTCAGAGACAACCTGATGCGCATAGAGGCGCACAACAGCAAGGGATCTACTTTTAAGCT
TGGTCTCAACCGTTTCGCCGATTTGACTCAAGATGAATTCAAGCAGAGCCGGCGTCTTGG
TCTCAAGCTTCCTTCTGTCAAGCTTGGATCCCTCCGCAGGCGGTCCCACTTCCATCACAA
GTCTGAGACCCCTATGGTAACAGCTGAATCTTTGGACTGGAGAACCCTTGGCGCCGTTAC
CCCAGTGAAAGATCAGGGCATGTGTGGAAGCTGCTGGGCTTTCTCTGCCACAGGAGCTAT
TGAAGGAGCCAACGCTGTTGCAACAGGAAACCTTGTCAGTGTTTCGGAGGAAGAGCTTGT
GACATGCAGCAGTGAGAGTGGATGTGATGGGGGGCTGATGGATGATGCCTTTGAATGGGT
TATTGACAATGGCGGGGATTGCCAC
■■Homology search results ■■ -
sp_hit_id P43297
Definition sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana
Align length 212
Score (bit) 184.0
E-value 4.0e-46
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960359|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0007_D07, 5'
(685 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 184 4e-46
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 178 2e-44
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 175 2e-43
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 166 1e-40
sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS... 162 2e-39
sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. ... 161 3e-39
sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2 157 4e-38
sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1 153 9e-37
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 151 3e-36
sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. j... 150 5e-36
sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 150 5e-36
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 149 1e-35
sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=A... 149 2e-35
sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 ... 148 3e-35
sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=A... 147 5e-35
sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Br... 146 9e-35
sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium dis... 146 1e-34
sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. G... 145 2e-34
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 144 4e-34
sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulg... 144 4e-34
sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulg... 144 4e-34
sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 ... 142 1e-33
sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1 142 2e-33
sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4 142 2e-33
sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1 141 4e-33
sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp.... 140 5e-33
sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium dis... 140 8e-33
sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium dis... 140 8e-33
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 137 4e-32
sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersi... 136 1e-31

>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis
thaliana GN=RD21A PE=1 SV=1
Length = 462

Score = 184 bits (467), Expect = 4e-46
Identities = 97/212 (45%), Positives = 140/212 (66%), Gaps = 8/212 (3%)
Frame = +2

Query: 65 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 226
+F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 227 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 406
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 407 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 586
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 587 EEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
E+ELV C S GC+GGLMD AFE++I NGG
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 217


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 178 bits (452), Expect = 2e-44
Identities = 99/223 (44%), Positives = 138/223 (61%), Gaps = 5/223 (2%)
Frame = +2

Query: 23 ALMATSSVIWALLGVFALFLTLSSADNSH---VLSYSPSDLHSQAKLVSLFDAWNMKHGK 193
AL + S ++ L + A L+LS A +SH ++ YSP DL S KL+ LF+ W K
Sbjct: 2 ALSSPSRILCFALALSAASLSLSFA-SSHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60

Query: 194 HYTAAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLP 373
Y + EK RF +F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK
Sbjct: 61 AYETVE--EKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTD 117

Query: 374 SVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 553
V+ R + F ++ + +S+DWR GAV VK+QG CGSCWAFS A+EG N
Sbjct: 118 IVRRDEERSYAEFAYRDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGIN 175

Query: 554 AVATGNLVSVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
+ TGNL ++SE+EL+ C + +GC+GGLMD AFE+++ NGG
Sbjct: 176 KIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 175 bits (443), Expect = 2e-43
Identities = 96/219 (43%), Positives = 138/219 (63%), Gaps = 13/219 (5%)
Frame = +2

Query: 59 LGVFALFLTLSS--------ADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQK 214
L F+L + +S+ A + ++ Y+P L + KL+ LF++W +H K Y + +
Sbjct: 8 LSKFSLLVAISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVE- 66

Query: 215 LEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSL 394
EK RF +FR+NLM I+ N++ +++ LGLN FADLT +EFK R LGL P
Sbjct: 67 -EKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFK-GRYLGLAKPQFS---- 120

Query: 395 RRR---SHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 565
R+R ++F ++ T + +S+DWR GAV PVKDQG CGSCWAFS A+EG N + T
Sbjct: 121 RKRQPSANFRYRDITDL--PKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITT 178

Query: 566 GNLVSVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
GNL S+SE+EL+ C + SGC+GGLMD AF+++I GG
Sbjct: 179 GNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGG 217


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352

Score = 166 bits (420), Expect = 1e-40
Identities = 98/221 (44%), Positives = 129/221 (58%), Gaps = 4/221 (1%)
Frame = +2

Query: 23 ALMATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYT 202
A M++ S I L + + LSSAD + + YS DL S +L+ LFD+W +KH K Y
Sbjct: 2 ATMSSISKIIFLATCLIIHMGLSSAD-FYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYE 60

Query: 203 AAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVK 382
+ EK RF IFRDNLM I+ N K +++ LGLN FADL+ DEFK+ K
Sbjct: 61 SID--EKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKK------KYVGFV 112

Query: 383 LGSLRRRSHFHHKSET-PMVT--AESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 553
HF ++ T VT +S+DWR GAVTPVK+QG CGSCWAFS +EG N
Sbjct: 113 AEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGIN 172

Query: 554 AVATGNLVSVSEEELVTCSSES-GCDGGLMDDAFEWVIDNG 673
+ TGNL+ +SE+ELV C S GC GG + ++V +NG
Sbjct: 173 KIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVANNG 213


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1
OS=Arabidopsis thaliana GN=GCP1 PE=2 SV=2
Length = 376

Score = 162 bits (410), Expect = 2e-39
Identities = 88/226 (38%), Positives = 138/226 (61%), Gaps = 10/226 (4%)
Frame = +2

Query: 29 MATSSVIWALLGVFALFLTLSSAD----NSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKH 196
MA S+ + +LL ++ + ++L+S D N H+ S + ++ S++ W+ +HGK
Sbjct: 1 MAPSTKVLSLLLLYVV-VSLASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKT 59

Query: 197 YTAAQKL--EKHKRFHIFRDNLMRIEAHN--SKGSTFKLGLNRFADLTQDEFKQSRRLGL 364
+ ++ KRF+IF+DNL I+ HN +K +T+KLGL +F DLT DE+++
Sbjct: 60 NNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGAR 119

Query: 365 KLPSVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIE 544
P+ ++ + + + + E++DWR GAV P+KDQG CGSCWAFS T A+E
Sbjct: 120 TEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVE 179

Query: 545 GANAVATGNLVSVSEEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
G N + TG L+S+SE+ELV C S GC+GGLMD AF++++ NGG
Sbjct: 180 GINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp.
japonica GN=Os04g0650000 PE=1 SV=2
Length = 458

Score = 161 bits (408), Expect = 3e-39
Identities = 92/207 (44%), Positives = 130/207 (62%), Gaps = 6/207 (2%)
Frame = +2

Query: 74 LFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFHIFRDN 253
L L+L++AD S ++SY S+ + L+ W +HGK Y A E+ +R+ FRDN
Sbjct: 14 LLLSLAAADMS-IVSYGE---RSEEEARRLYAEWKAEHGKSYNAVG--EEERRYAAFRDN 67

Query: 254 LMRIEAHNSKGS----TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHHK 421
L I+ HN+ +F+LGLNRFADLT +E++ + LGL+ + R+ +
Sbjct: 68 LRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRDTY-LGLRNKPRR----ERKVSDRYL 122

Query: 422 SETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELV 601
+ ES+DWRT GAV +KDQG CGSCWAFSA A+EG N + TG+L+S+SE+ELV
Sbjct: 123 AADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELV 182

Query: 602 TC--SSESGCDGGLMDDAFEWVIDNGG 676
C S GC+GGLMD AF+++I+NGG
Sbjct: 183 DCDTSYNEGCNGGLMDYAFDFIINNGG 209


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362

Score = 157 bits (398), Expect = 4e-38
Identities = 92/221 (41%), Positives = 130/221 (58%), Gaps = 5/221 (2%)
Frame = +2

Query: 29 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 208
MAT ++W +L F+L L ++++ + H DL S+ L L++ W H+T +
Sbjct: 1 MATKKLLWVVLS-FSLVLGVANSFDFH-----DKDLASEESLWDLYERWR----SHHTVS 50

Query: 209 QKL-EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL--PSV 379
+ L EKHKRF++F+ NLM + N +KL LN+FAD+T EF+ S G K+ P +
Sbjct: 51 RSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFR-STYAGSKVNHPRM 109

Query: 380 KLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 559
G+ F + E + S+DWR GAVT VKDQG CGSCWAFS A+EG N +
Sbjct: 110 FRGTPHENGAFMY--EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167

Query: 560 ATGNLVSVSEEELVTCSSE--SGCDGGLMDDAFEWVIDNGG 676
T LV++SE+ELV C E GC+GGLM+ AFE++ GG
Sbjct: 168 KTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362

Score = 153 bits (386), Expect = 9e-37
Identities = 91/221 (41%), Positives = 127/221 (57%), Gaps = 5/221 (2%)
Frame = +2

Query: 29 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAA 208
MA ++W +L +L L L A++ + DL S+ L L++ W H+T +
Sbjct: 1 MAMKKLLWVVL---SLSLVLGVANS---FDFHEKDLESEESLWDLYERWR----SHHTVS 50

Query: 209 QKL-EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 385
+ L EKHKRF++F+ N+M + N +KL LN+FAD+T EF+ S G K+ K+
Sbjct: 51 RSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFR-STYAGSKVNHHKM 109

Query: 386 --GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAV 559
GS F ++ + S+DWR GAVT VKDQG CGSCWAFS A+EG N +
Sbjct: 110 FRGSQHGSGTFMYEKVGSVPA--SVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQI 167

Query: 560 ATGNLVSVSEEELVTCSSE--SGCDGGLMDDAFEWVIDNGG 676
T LVS+SE+ELV C E GC+GGLM+ AFE++ GG
Sbjct: 168 KTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348

Score = 151 bits (382), Expect = 3e-36
Identities = 87/213 (40%), Positives = 120/213 (56%), Gaps = 2/213 (0%)
Frame = +2

Query: 41 SVIWALLGVFALFLTLS-SADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 217
S+ L LF+ +S S + ++ YS DL S +L+ LF++W + H K Y
Sbjct: 6 SISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVD-- 63

Query: 218 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 397
EK RF IF+DNL I+ N K +++ LGLN FADL+ DEF + + S+ ++
Sbjct: 64 EKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKY-----VGSLIDATIE 118

Query: 398 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 577
+ +E + E++DWR GAVTPV+ QG CGSCWAFSA +EG N + TG LV
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178

Query: 578 SVSEEELVTCSSES-GCDGGLMDDAFEWVIDNG 673
+SE+ELV C S GC GG A E+V NG
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKNG 211


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp.
japonica GN=Os04g0670200 PE=1 SV=2
Length = 466

Score = 150 bits (380), Expect = 5e-36
Identities = 82/185 (44%), Positives = 116/185 (62%), Gaps = 6/185 (3%)
Frame = +2

Query: 140 SQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRFHIFRDNLMRIEAHNSKGST---FKLGLN 310
++A+ + +D W ++G A E +RF +F DNL ++AHN++ F+LG+N
Sbjct: 44 TEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMN 103

Query: 311 RFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVK 490
RFADLT +EF+ + LG K+ + R H E P ES+DWR GAV PVK
Sbjct: 104 RFADLTNEEFRATF-LGAKVAERSRAAGERYRH-DGVEELP----ESVDWREKGAVAPVK 157

Query: 491 DQGMCGSCWAFSATGAIEGANAVATGNLVSVSEEELVTCSS---ESGCDGGLMDDAFEWV 661
+QG CGSCWAFSA +E N + TG ++++SE+ELV CS+ SGC+GGLMDDAF+++
Sbjct: 158 NQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI 217

Query: 662 IDNGG 676
I NGG
Sbjct: 218 IKNGG 222


tr_hit_id Q94BX1
Definition tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana
Align length 212
Score (bit) 186.0
E-value 1.0e-45
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960359|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0007_D07, 5'
(685 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana... 186 1e-45
tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis ... 184 5e-45
tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea... 182 2e-44
tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris... 182 2e-44
tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens ... 181 3e-44
tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus ... 181 4e-44
tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia delici... 181 4e-44
tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum GN... 178 3e-43
tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidops... 178 3e-43
tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens ... 178 3e-43
tr|A9P285|A9P285_PICSI Putative uncharacterized protein OS=Picea... 176 1e-42
tr|O49877|O49877_SOLLC CYP1 (Cysteine protease TDI-65) OS=Solanu... 175 2e-42
tr|Q3E9R1|Q3E9R1_ARATH Uncharacterized protein At4g35350.2 OS=Ar... 175 3e-42
tr|Q2AAC8|Q2AAC8_9ASTR Cysteine proteinase OS=Platycodon grandif... 175 3e-42
tr|Q155L4|Q155L4_HEVBR Cysteine protease OS=Hevea brasiliensis G... 175 3e-42
tr|Q6F6A9|Q6F6A9_DAUCA Cysteine protease OS=Daucus carota GN=DcC... 174 6e-42
tr|Q52QX8|Q52QX8_MANES Cysteine protease CP1 OS=Manihot esculent... 174 6e-42
tr|A9NUC2|A9NUC2_PICSI Putative uncharacterized protein OS=Picea... 174 6e-42
tr|Q40922|Q40922_PSEMZ Pseudotzain OS=Pseudotsuga menziesii GN=P... 173 8e-42
tr|A9NV34|A9NV34_PICSI Putative uncharacterized protein OS=Picea... 173 8e-42
tr|A7P8S5|A7P8S5_VITVI Chromosome chr3 scaffold_8, whole genome ... 173 8e-42
tr|B4ESE6|B4ESE6_HORVD Papain-like cysteine proteinase OS=Hordeu... 173 1e-41
tr|A9PFF7|A9PFF7_POPTR Putative uncharacterized protein OS=Popul... 173 1e-41
tr|A5HIJ2|A5HIJ2_ACTDE Cysteine protease Cp2 OS=Actinidia delici... 172 1e-41
tr|A5B6Y2|A5B6Y2_VITVI Putative uncharacterized protein OS=Vitis... 172 2e-41
tr|Q75NB3|Q75NB3_DIACA Cysteine proteinase OS=Dianthus caryophyl... 172 2e-41
tr|A7QDJ6|A7QDJ6_VITVI Chromosome chr10 scaffold_81, whole genom... 172 2e-41
tr|B2LSD2|B2LSD2_MUCPR Mucunain OS=Mucuna pruriens PE=2 SV=1 171 3e-41
tr|B0ZRH2|B0ZRH2_PINSY Cysteine protease (Fragment) OS=Pinus syl... 171 5e-41
tr|Q9FMH8|Q9FMH8_ARATH Cysteine protease component of protease-i... 170 7e-41

>tr|Q94BX1|Q94BX1_ARATH F2G19.31/F2G19.31 OS=Arabidopsis thaliana
PE=2 SV=1
Length = 462

Score = 186 bits (472), Expect = 1e-45
Identities = 98/212 (46%), Positives = 141/212 (66%), Gaps = 8/212 (3%)
Frame = +2

Query: 65 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 226
+F +T+SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 227 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 406
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 407 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 586
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 587 EEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
E+ELV C S GC+GGLMD AFE++I NGG
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 217


>tr|Q56XI5|Q56XI5_ARATH Cysteine proteinase RD21A OS=Arabidopsis
thaliana GN=At1g47128 PE=2 SV=1
Length = 433

Score = 184 bits (467), Expect = 5e-45
Identities = 97/212 (45%), Positives = 140/212 (66%), Gaps = 8/212 (3%)
Frame = +2

Query: 65 VFALFLTLSSADNSHVLSY------SPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKH 226
+F + +SSA + ++SY S + S+A+++S+++AW +KHGK + +EK
Sbjct: 11 LFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKD 70

Query: 227 KRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRS 406
+RF IF+DNL ++ HN K +++LGL RFADLT DE++ S+ LG K+ K G RR+
Sbjct: 71 RRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR-SKYLGAKME--KKGE--RRT 125

Query: 407 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 586
+++ ES+DWR GAV VKDQG CGSCWAFS GA+EG N + TG+L+++S
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query: 587 EEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
E+ELV C S GC+GGLMD AFE++I NGG
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 217


>tr|A9NW12|A9NW12_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 294

Score = 182 bits (462), Expect = 2e-44
Identities = 110/212 (51%), Positives = 136/212 (64%), Gaps = 5/212 (2%)
Frame = +2

Query: 56 LLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKRF 235
+L + L L SS ++Y+P DL S+ L+SLFD W HGK YTA Q+ RF
Sbjct: 7 ILKLVMLLLVFSSVT---AITYNPRDL-SENGLLSLFDRWCNHHGKTYTAKQR---PLRF 59

Query: 236 HIFRDNLMRIEAHNSKGS-TFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRR--RS 406
+F++NL I HNS+G+ TF LGLN F+DLT DEF+ ++++GL+ L S RR +S
Sbjct: 60 QVFKENLFYISEHNSRGNHTFWLGLNAFSDLTSDEFR-TQQMGLRGHPPSLKSRRREPKS 118

Query: 407 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 586
P SLDWR AVT VKDQG CG CWAFSATGAIEG N + TG+LVS+S
Sbjct: 119 GLLELYNIP----SSLDWRDKDAVTGVKDQGACGDCWAFSATGAIEGINKIVTGSLVSLS 174

Query: 587 EEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
E+EL C S SGCDGGLMD AF+WVI NGG
Sbjct: 175 EQELCDCDTSYNSGCDGGLMDYAFQWVIVNGG 206


>tr|O24323|O24323_PHAVU Cysteine proteinase OS=Phaseolus vulgaris
PE=3 SV=1
Length = 455

Score = 182 bits (461), Expect = 2e-44
Identities = 103/212 (48%), Positives = 139/212 (65%), Gaps = 8/212 (3%)
Frame = +2

Query: 65 VFALFLTLSSADNSHVLSYS-----PSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHK 229
+FALF LSSA + ++SY + + ++ SL++ W +KHGK Y A EK K
Sbjct: 3 LFALF-ALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALG--EKDK 59

Query: 230 RFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKL-PSVKLGSLRRRS 406
RF IF+DNL I+ N++ T+KLGLNRFADLT +E++ +R LG K+ P+ +LG
Sbjct: 60 RFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEEYR-ARYLGTKIDPNRRLGRTPSNR 118

Query: 407 HFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVSVS 586
+ ET +S+DWR GAV PVKDQ CGSCWAFSA GA+EG N + TG+L+S+S
Sbjct: 119 YAPRVGET---LPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLS 175

Query: 587 EEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
E+ELV C + GC+GGLMD AFE++I NGG
Sbjct: 176 EQELVDCDTGYNMGCNGGLMDYAFEFIIKNGG 207


>tr|Q6Y1E9|Q6Y1E9_TRIRP Cysteine protease 14 OS=Trifolium repens
PE=2 SV=1
Length = 351

Score = 181 bits (460), Expect = 3e-44
Identities = 98/215 (45%), Positives = 132/215 (61%), Gaps = 2/215 (0%)
Frame = +2

Query: 38 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 217
SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y +
Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62

Query: 218 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 397
EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R
Sbjct: 63 EKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VNLSQRR 118

Query: 398 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 577
S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL
Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178

Query: 578 SVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
S+SE+EL+ C + +GC+GGLMD AF +++ NGG
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGG 213


>tr|Q84M29|Q84M29_HELAN Cysteine protease-1 OS=Helianthus annuus
GN=scp1 PE=2 SV=1
Length = 461

Score = 181 bits (459), Expect = 4e-44
Identities = 99/219 (45%), Positives = 141/219 (64%), Gaps = 3/219 (1%)
Frame = +2

Query: 29 MATSSVIWALLGVFALFLTLSSADNSHVLSYSPS-DLHSQAKLVSLFDAWNMKHGKHYTA 205
MAT S + + A+ +++ + D +H+ S S S L + ++ +L+++W +KHGK Y A
Sbjct: 6 MATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNA 65

Query: 206 AQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKL 385
EK +RF IF+DNL I+ HNS T+KLGLN+FADLT +E++ + + K
Sbjct: 66 LG--EKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKK 123

Query: 386 GSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVAT 565
S + + ++S + E +DWR GAVT VKDQG CGSCWAFS TG++EG N + T
Sbjct: 124 LSKMKSDRYAYRSGDSL--PEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVT 181

Query: 566 GNLVSVSEEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
G+L+SVSE+ELV C S GC+GGLMD AFE++I NGG
Sbjct: 182 GDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGG 220


>tr|A5HIJ6|A5HIJ6_ACTDE Cysteine protease Cp6 OS=Actinidia deliciosa
PE=2 SV=1
Length = 461

Score = 181 bits (459), Expect = 4e-44
Identities = 104/214 (48%), Positives = 136/214 (63%), Gaps = 6/214 (2%)
Frame = +2

Query: 53 ALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKLEKHKR 232
ALL +F+LF S+ D S + S S + ++++++++W +KHGK Y A EK KR
Sbjct: 11 ALLLLFSLFALSSALDMSIIGELSSS--RTDDEVMAMYESWLVKHGKSYNAIG--EKEKR 66

Query: 233 FHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLRRRSHF 412
F IF+DNL I+ HN++ T+K+GLNRFADLT DE++ S LG + GS RR S
Sbjct: 67 FQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYR-SMYLG-----ARTGSRRRLSTQ 120

Query: 413 HHKSETPMVTAESL----DWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLVS 580
V ESL DWR GAV VKDQG CGSCWAFS A+EG N + TG+L+S
Sbjct: 121 KRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLIS 180

Query: 581 VSEEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
+SE+ELV C S GC+GGLMD AFE++I NGG
Sbjct: 181 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 214


>tr|Q9ST61|Q9ST61_SOLTU Cysteine protease OS=Solanum tuberosum
GN=cyp PE=2 SV=1
Length = 466

Score = 178 bits (452), Expect = 3e-43
Identities = 105/225 (46%), Positives = 140/225 (62%), Gaps = 8/225 (3%)
Frame = +2

Query: 26 LMATSSVIWALLGVFALFLTLSSADNSHVLSYSPSDLH--SQAKLVSLFDAWNMKHGKHY 199
+ A SS + L + +F TLSSA + ++SY + +H S ++ +L+++W ++HGK Y
Sbjct: 1 MAAHSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSY 60

Query: 200 TAAQKLEKHKRFHIFRDNLMRIEAHNS-KGSTFKLGLNRFADLTQDEFKQSRRLGLKLPS 376
A EK KRF IF+DNL I+ NS ++KLGL +FADLT +E++ S LG K
Sbjct: 61 NALG--EKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYR-SIYLGTK--- 114

Query: 377 VKLGSLRRRSHFHHKSETPMV---TAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEG 547
G R+ S P V ES+DWR G + VKDQG CGSCWAFSA A+E
Sbjct: 115 -SSGDRRKLSKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMES 173

Query: 548 ANAVATGNLVSVSEEELVTC--SSESGCDGGLMDDAFEWVIDNGG 676
NA+ TGNL+S+SE+ELV C S GCDGGLMD AFE+VI+NGG
Sbjct: 174 INAIVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGG 218


>tr|Q0WT15|Q0WT15_ARATH Putative cysteine proteinase OS=Arabidopsis
thaliana GN=At1g20850 PE=2 SV=1
Length = 356

Score = 178 bits (452), Expect = 3e-43
Identities = 99/223 (44%), Positives = 138/223 (61%), Gaps = 5/223 (2%)
Frame = +2

Query: 23 ALMATSSVIWALLGVFALFLTLSSADNSH---VLSYSPSDLHSQAKLVSLFDAWNMKHGK 193
AL + S ++ L + A L+LS A +SH ++ YSP DL S KL+ LF+ W K
Sbjct: 2 ALSSPSRILCFALALSAASLSLSFA-SSHDYSIVGYSPEDLESHDKLIELFENWISNFEK 60

Query: 194 HYTAAQKLEKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLP 373
Y + EK RF +F+DNL I+ N KG ++ LGLN FADL+ +EFK+ LGLK
Sbjct: 61 AYETVE--EKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY-LGLKTD 117

Query: 374 SVKLGSLRRRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGAN 553
V+ R + F ++ + +S+DWR GAV VK+QG CGSCWAFS A+EG N
Sbjct: 118 IVRRDEERSYAEFAYRDVEAV--PKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGIN 175

Query: 554 AVATGNLVSVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
+ TGNL ++SE+EL+ C + +GC+GGLMD AFE+++ NGG
Sbjct: 176 KIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218


>tr|Q6Y1F0|Q6Y1F0_TRIRP Cysteine protease 14 OS=Trifolium repens
PE=3 SV=1
Length = 351

Score = 178 bits (451), Expect = 3e-43
Identities = 98/215 (45%), Positives = 131/215 (60%), Gaps = 2/215 (0%)
Frame = +2

Query: 38 SSVIWALLGVFALFLTLSSADNSHVLSYSPSDLHSQAKLVSLFDAWNMKHGKHYTAAQKL 217
SS L LFL+L+ + ++ YS DL S KL+ LF++W +HGK Y +
Sbjct: 5 SSKTLVLTCSLCLFLSLAFGRDFSIVGYSSEDLKSMDKLIELFESWMSRHGKIYETIE-- 62

Query: 218 EKHKRFHIFRDNLMRIEAHNSKGSTFKLGLNRFADLTQDEFKQSRRLGLKLPSVKLGSLR 397
EK RF +F+DNL I+ N S + LGLN FADL+ EFK ++ LGLK V L R
Sbjct: 63 EKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFK-NKYLGLK---VDLSQRR 118

Query: 398 RRSHFHHKSETPMVTAESLDWRTLGAVTPVKDQGMCGSCWAFSATGAIEGANAVATGNLV 577
S+ + + +S+DWR GAVTPVK+QG CGSCWAFS A+EG N + TGNL
Sbjct: 119 ESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 178

Query: 578 SVSEEELVTCSS--ESGCDGGLMDDAFEWVIDNGG 676
S+SE+EL+ C + +GC+GGLMD AF ++ NGG
Sbjct: 179 SLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGG 213