DK951864
Clone id TST38A01NGRL0012_I14
Library
Length 664
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_I14. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
CGCCATTACAACTTCTTTAAAATAGACGGAGCTCTGCAGACGGTCCGCCATAGCCACTCT
CTGATGGGCTCGCATCTGTTTCTAGCCTCTTTGGCTATCTGCCTTCTCTTCATCTCCGCC
ACCGCAGCAGACACAGATCCTTTAATCAAGCAAGTTACCGACGCAGAGCTTGAGGGCTCC
TCGCTCTCTGGCATTCCGATTTCGCACATCTCGTCTCGGCAAGATGTAGAGAGCCTGTTC
AAGCGGTTTATCGACAGGTTTGGCAAAAGCTATGCGTCTCCTGAGGAGAGGGCGCACCGG
TTCCGTATCTTTGAGAGGAATTTGGTGAAGGCCGCGAAGATCCAGAGGCAGGACCCTTCG
GCGACGCACGGTGTCACAAAATTCTCTGATCTCACAGAAGAGGAGTTCTCACGTTACCTT
GGCCTGAAGACTCCCCACTTTCTCAAGCCCTCTCTTAACTCGGACGCCCCTATCCTGCCC
ACCAATGACCTCCCATCTGATTTTGATTGGCGCGATCGTGGGGCTGTCTCTGAAGTCAAG
AACCAGGGGAGTTGTGGATCATGCTGGACTTTCAGTACAACAGGAGCTGTGGAAGGTGCT
CATTTCGTCAAAACAGGAAAGCTATTGAGCCTTAGTGAGCAACAGCTAGTTGATTGTGAT
CATG
■■Homology search results ■■ -
sp_hit_id P43295
Definition sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
Align length 203
Score (bit) 197.0
E-value 4.0e-50
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951864|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_I14, 5'
(664 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabid... 197 4e-50
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 186 7e-47
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 186 1e-46
sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 ... 182 2e-45
sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1 140 5e-33
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 135 3e-31
sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1 132 1e-30
sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 P... 132 2e-30
sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Dro... 130 5e-30
sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium dis... 130 6e-30
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 128 3e-29
sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata mul... 124 4e-28
sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare ... 122 1e-27
sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana... 122 2e-27
sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear... 121 4e-27
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 120 6e-27
sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 ... 119 1e-26
sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei b... 117 4e-26
sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thal... 117 4e-26
sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsi... 116 9e-26
sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=A... 115 2e-25
sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2 115 2e-25
sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana... 115 2e-25
sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2 115 3e-25
sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear... 115 3e-25
sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear poly... 115 3e-25
sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=... 113 8e-25
sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. ... 113 1e-24
sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=A... 113 1e-24
sp|Q8B9D5|CATV_NPVRO Viral cathepsin OS=Rachiplusia ou multiple ... 112 1e-24

>sp|P43295|A494_ARATH Probable cysteine proteinase A494
OS=Arabidopsis thaliana GN=At2g21430 PE=2 SV=2
Length = 361

Score = 197 bits (501), Expect = 4e-50
Identities = 112/203 (55%), Positives = 141/203 (69%), Gaps = 3/203 (1%)
Frame = +1

Query: 64 MGSHL-FLASLAICLLFISATAA-DTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESL 237
M HL L S+++ +F+S + D D LI+QV D E E LS +D +L
Sbjct: 1 MDYHLRVLFSVSLIFVFVSVSVCGDEDVLIRQVVD-ETEPKVLSS---------EDHFTL 50

Query: 238 FKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR- 414
FK+ +FGK Y S EE +RF +F+ NL++A + Q+ DPSA HGVT+FSDLT EF R
Sbjct: 51 FKK---KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRK 107

Query: 415 YLGLKTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVE 594
+LG+K F P + APILPT +LP +FDWRDRGAV+ VKNQGSCGSCW+FSTTGA+E
Sbjct: 108 HLGVKGG-FKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALE 166

Query: 595 GAHFVKTGKLLSLSEQQLVDCDH 663
GAHF+ TGKL+SLSEQQLVDCDH
Sbjct: 167 GAHFLATGKLVSLSEQQLVDCDH 189


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2
SV=1
Length = 363

Score = 186 bits (473), Expect = 7e-47
Identities = 104/199 (52%), Positives = 132/199 (66%), Gaps = 4/199 (2%)
Frame = +1

Query: 79 FLASLAICLLFISATAADT---DPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRF 249
FL +L + +A DT D +I+QV D E + H+ + E F F
Sbjct: 5 FLFALFLFAAVATAVTDDTNNDDFIIRQVVDNEED----------HLLN---AEHHFTSF 51

Query: 250 IDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGL 426
+F KSYA+ EE +RF +F+ NL+KA Q +DP+A HG+TKFSDLT EF R +LGL
Sbjct: 52 KSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGL 111

Query: 427 KTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
K L P+ APILPT +LP DFDWR++GAV+ VK+QGSCGSCW FSTTGA+EGAH+
Sbjct: 112 KKRLRL-PAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHY 170

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TGKL+SLSEQQLVDCDH
Sbjct: 171 LATGKLVSLSEQQLVDCDH 189


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis
thaliana GN=RD19A PE=2 SV=1
Length = 368

Score = 186 bits (472), Expect = 1e-46
Identities = 102/201 (50%), Positives = 134/201 (66%), Gaps = 5/201 (2%)
Frame = +1

Query: 76 LFLASLAICLLFISATAADT----DPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFK 243
L+ + + +S +++D D +I+QV G + + +D SLFK
Sbjct: 6 LYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVV----------GGAEPQVLTSEDHFSLFK 55

Query: 244 RFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEF-SRYL 420
R +FGK YAS EE +RF +F+ NL +A + Q+ DPSATHGVT+FSDLT EF ++L
Sbjct: 56 R---KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHL 112

Query: 421 GLKTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGA 600
G+++ L N APILPT +LP DFDWRD GAV+ VKNQGSCGSCW+FS TGA+EGA
Sbjct: 113 GVRSGFKLPKDANK-APILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGA 171

Query: 601 HFVKTGKLLSLSEQQLVDCDH 663
+F+ TGKL+SLSEQQLVDCDH
Sbjct: 172 NFLATGKLVSLSEQQLVDCDH 192


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1
PE=2 SV=1
Length = 371

Score = 182 bits (461), Expect = 2e-45
Identities = 107/205 (52%), Positives = 128/205 (62%), Gaps = 9/205 (4%)
Frame = +1

Query: 76 LFLASLAICLLFISATAADTDPLIKQVT----DAELEGSSLSGIPISHISSRQDVESLFK 243
L L SLA +A A+ DPLI+QV D +LE + ES F
Sbjct: 6 LLLLSLASAAAVAAAVDAE-DPLIRQVVPGGDDNDLE---------------LNAESHFL 49

Query: 244 RFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YL 420
F+ RFGKSY +E A+R +F+ NL +A + Q DPSA HGVTKFSDLT EF R YL
Sbjct: 50 SFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYL 109

Query: 421 GL-KTPHFLKPSLNS---DAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGA 588
GL K+ L L +AP+LPT+ LP DFDWRD GAV VKNQGSCGSCW+FS +GA
Sbjct: 110 GLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGA 169

Query: 589 VEGAHFVKTGKLLSLSEQQLVDCDH 663
+EGAH++ TGKL LSEQQ VDCDH
Sbjct: 170 LEGAHYLATGKLEVLSEQQFVDCDH 194


>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462

Score = 140 bits (354), Expect = 5e-33
Identities = 77/145 (53%), Positives = 96/145 (66%), Gaps = 3/145 (2%)
Frame = +1

Query: 235 LFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEEEFS 411
LFK F+ + ++Y S EE R +F RN+++A KIQ D +A +G+TKFSDLTEEEF
Sbjct: 164 LFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFH 223

Query: 412 R-YLGLKTPHFLKPSLNSDAPILPTNDL-PSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 585
YL P K S +P NDL P ++DWR +GAV+EVKNQG CGSCW FS TG
Sbjct: 224 TIYLN---PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTG 280

Query: 586 AVEGAHFVKTGKLLSLSEQQLVDCD 660
VEG F+ G LLSLSEQ+L+DCD
Sbjct: 281 NVEGQWFLNRGTLLSLSEQELLDCD 305


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 135 bits (339), Expect = 3e-31
Identities = 72/164 (43%), Positives = 98/164 (59%), Gaps = 4/164 (2%)
Frame = +1

Query: 181 SLSGIPISHISSRQDVESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPS 360
S+ G H+++ + LF+ ++ K+Y S EE+ HRF +F NL+ + + S
Sbjct: 32 SIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS 91

Query: 361 ATHGVTKFSDLTEEEFS-RYLGLKTPHFLK---PSLNSDAPILPTNDLPSDFDWRDRGAV 528
G+ +F+DLT EEF RYLGL P F + PS N + DLP DWR +GAV
Sbjct: 92 YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAV 149

Query: 529 SEVKNQGSCGSCWTFSTTGAVEGAHFVKTGKLLSLSEQQLVDCD 660
+ VK+QG CGSCW FST AVEG + + TG L SLSEQ+L+DCD
Sbjct: 150 APVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCD 193


>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484

Score = 132 bits (333), Expect = 1e-30
Identities = 74/145 (51%), Positives = 95/145 (65%), Gaps = 2/145 (1%)
Frame = +1

Query: 232 SLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEEEF 408
S+FK F+ + ++Y S EE R +F N+V+A KIQ D +A +GVTKFSDLTEEEF
Sbjct: 185 SIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEF 244

Query: 409 SRYLGLKTPHFLKPSLNSDAPILPTNDL-PSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 585
R + L T +P N DL P ++DWR +GAV++VK+QG CGSCW FS TG
Sbjct: 245 -RTIYLNTLLRKEPG-NKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTG 302

Query: 586 AVEGAHFVKTGKLLSLSEQQLVDCD 660
VEG F+ G LLSLSEQ+L+DCD
Sbjct: 303 NVEGQWFLNQGTLLSLSEQELLDCD 327


>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2
SV=1
Length = 319

Score = 132 bits (331), Expect = 2e-30
Identities = 75/150 (50%), Positives = 98/150 (65%), Gaps = 4/150 (2%)
Frame = +1

Query: 223 DVESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQR-QDPSATHGVTKFSDLTE 399
+V+ + +F ++ K Y E+ RF IF+ N++KA Q SA +GVT +SDLT
Sbjct: 15 NVDEKYVQFKLKYRKQYHETEDEI-RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTT 73

Query: 400 EEFSRYLGLKTPHFLKPSLNSDAPIL---PTNDLPSDFDWRDRGAVSEVKNQGSCGSCWT 570
+EF+R T ++ PS S+ P N++P +FDWR++GAV+EVKNQG CGSCW
Sbjct: 74 DEFARTH--LTASWVVPSSRSNTPTSLGKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWA 131

Query: 571 FSTTGAVEGAHFVKTGKLLSLSEQQLVDCD 660
FSTTG VE F KTGKLLSLSEQQLVDCD
Sbjct: 132 FSTTGNVESQWFRKTGKLLSLSEQQLVDCD 161


>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163
OS=Drosophila melanogaster GN=CG12163 PE=2 SV=2
Length = 614

Score = 130 bits (328), Expect = 5e-30
Identities = 68/148 (45%), Positives = 94/148 (63%), Gaps = 3/148 (2%)
Frame = +1

Query: 226 VESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEE 402
V+ LF +F RFG+ Y S ER R RIF +NL ++ + SA +G+T+F+D+T
Sbjct: 304 VDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSS 363

Query: 403 EFSRYLGLKTPHFLKPSLNSDAPILPT--NDLPSDFDWRDRGAVSEVKNQGSCGSCWTFS 576
E+ GL K + S A ++P +LP +FDWR + AV++VKNQGSCGSCW FS
Sbjct: 364 EYKERTGLWQRDEAKATGGS-AAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFS 422

Query: 577 TTGAVEGAHFVKTGKLLSLSEQQLVDCD 660
TG +EG + VKTG+L SEQ+L+DCD
Sbjct: 423 VTGNIEGLYAVKTGELKEFSEQELLDCD 450


>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium
discoideum GN=cprA PE=1 SV=2
Length = 343

Score = 130 bits (327), Expect = 6e-30
Identities = 78/172 (45%), Positives = 94/172 (54%), Gaps = 11/172 (6%)
Frame = +1

Query: 181 SLSGIPISHISSRQDVESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQ----R 348
S GIP+ +S F F D+F K Y S EE RF IF+ NL K ++
Sbjct: 17 SSRGIPLEE-------QSQFLEFQDKFNKKY-SHEEYLERFEIFKSNLGKIEELNLIAIN 68

Query: 349 QDPSATHGVTKFSDLTEEEFSRY-LGLKTPHFLKPSLNSDAPILP------TNDLPSDFD 507
GV KF+DL+ +EF Y L K F D P+ N +P+ FD
Sbjct: 69 HKADTKFGVNKFADLSSDEFKNYYLNNKEAIF-----TDDLPVADYLDDEFINSIPTAFD 123

Query: 508 WRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHFVKTGKLLSLSEQQLVDCDH 663
WR RGAV+ VKNQG CGSCW+FSTTG VEG HF+ KL+SLSEQ LVDCDH
Sbjct: 124 WRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDH 175


tr_hit_id A9NVM7
Definition tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea sitchensis
Align length 197
Score (bit) 217.0
E-value 6.0e-55
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951864|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_I14, 5'
(664 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea... 217 6e-55
tr|A9NKS7|A9NKS7_PICSI Putative uncharacterized protein OS=Picea... 217 6e-55
tr|Q9XGH8|Q9XGH8_TOBAC Putative preprocysteine proteinase OS=Nic... 216 1e-54
tr|Q2QFR2|Q2QFR2_NICBE Cysteine proteinase glycinain type (Fragm... 215 2e-54
tr|Q43579|Q43579_TOBAC Tobacco pre-pro-cysteine proteinase OS=Ni... 214 3e-54
tr|Q84YH6|Q84YH6_TOBAC CPR2-like cysteine proteinase OS=Nicotian... 214 5e-54
tr|Q07491|Q07491_SOLLC Pre-pro-cysteine proteinase (Fragment) OS... 211 2e-53
tr|Q43580|Q43580_TOBAC Tobacco pre-pro-cysteine proteinase OS=Ni... 210 6e-53
tr|B1Q474|B1Q474_CAPCH Putative cysteine proteinase OS=Capsicum ... 206 8e-52
tr|Q9XED9|Q9XED9_SOLME Cysteine proteinase OS=Solanum melongena ... 204 3e-51
tr|Q75QV8|Q75QV8_ASTTR Cysteine protease OS=Aster tripolium GN=C... 200 6e-50
tr|A9P833|A9P833_POPTR Putative uncharacterized protein OS=Popul... 200 6e-50
tr|Q3L0K1|Q3L0K1_POPTO Cysteine proteinase OS=Populus tomentosa ... 200 8e-50
tr|Q5K4K8|Q5K4K8_GOSHI Putative papain-like cysteine proteinase ... 198 2e-49
tr|Q680L1|Q680L1_ARATH Putative cysteine proteinase OS=Arabidops... 197 5e-49
tr|Q058I7|Q058I7_ARATH At2g21430 OS=Arabidopsis thaliana PE=2 SV=1 197 5e-49
tr|A9TTL3|A9TTL3_PHYPA Predicted protein OS=Physcomitrella paten... 197 5e-49
tr|A9S6L4|A9S6L4_PHYPA Predicted protein OS=Physcomitrella paten... 196 8e-49
tr|A7P7V6|A7P7V6_VITVI Chromosome chr3 scaffold_8, whole genome ... 194 5e-48
tr|Q7PCC5|Q7PCC5_HORVU Putative cysteine proteinase OS=Hordeum v... 193 9e-48
tr|A9NU42|A9NU42_PICSI Putative uncharacterized protein OS=Picea... 192 1e-47
tr|A9UFX8|A9UFX8_VITVI Cysteine protease OS=Vitis vinifera GN=Cy... 192 2e-47
tr|Q945R8|Q945R8_SANAU Cysteine proteinase (Fragment) OS=Sanders... 192 2e-47
tr|A5HIJ3|A5HIJ3_ACTDE Cysteine protease Cp3 OS=Actinidia delici... 192 2e-47
tr|Q41698|Q41698_VICSA Cysteine proteinase OS=Vicia sativa PE=2 ... 190 6e-47
tr|Q8W179|Q8W179_BRAOL Senescence-associated cysteine protease O... 189 1e-46
tr|A9P971|A9P971_POPTR Putative uncharacterized protein OS=Popul... 189 1e-46
tr|Q7XW09|Q7XW09_ORYSJ OSJNBb0054B09.3 protein OS=Oryza sativa s... 189 2e-46
tr|Q0KJ00|Q0KJ00_PHAVU Cysteine proteinase CP2 OS=Phaseolus vulg... 189 2e-46
tr|Q01M10|Q01M10_ORYSA OSIGBa0148D14.11 protein (OSIGBa0130O15.4... 189 2e-46

>tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 366

Score = 217 bits (552), Expect = 6e-55
Identities = 110/197 (55%), Positives = 141/197 (71%), Gaps = 1/197 (0%)
Frame = +1

Query: 76 LFLASLAICLLFISATAADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFID 255
LF A ++F+S+ D LI+QVTD + + + S+ + E F+ FI
Sbjct: 7 LFSAFCIFSVIFLSSATKPDDDLIRQVTDEVVSDPQI----LDARSALFNAEVHFRHFIR 62

Query: 256 RFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFS-RYLGLKT 432
R+GK Y+ PEE HRF +F+ NL++A + Q+ DP A+HGVTKFSDLT+EEF +YLGL+
Sbjct: 63 RYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLRA 122

Query: 433 PHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHFVK 612
P DAPILPTNDLP DFDWR++GAV+EVKNQGSCGSCW FSTTGA+EGA+F+K
Sbjct: 123 PPLRDAH---DAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179

Query: 613 TGKLLSLSEQQLVDCDH 663
TG+L+SLSEQQLVDCDH
Sbjct: 180 TGELVSLSEQQLVDCDH 196


>tr|A9NKS7|A9NKS7_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 366

Score = 217 bits (552), Expect = 6e-55
Identities = 110/197 (55%), Positives = 141/197 (71%), Gaps = 1/197 (0%)
Frame = +1

Query: 76 LFLASLAICLLFISATAADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFID 255
LF A ++F+S+ D LI+QVTD + + + S+ + E F+ FI
Sbjct: 7 LFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQI----LDARSALFNAEVHFRHFIR 62

Query: 256 RFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFS-RYLGLKT 432
R+GK Y+ PEE HRF +F+ NL++A + Q+ DP A+HGVTKFSDLT+EEF +YLGL+
Sbjct: 63 RYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLRA 122

Query: 433 PHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHFVK 612
P DAPILPTNDLP DFDWR++GAV+EVKNQGSCGSCW FSTTGA+EGA+F+K
Sbjct: 123 PPLRDAH---DAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEGANFLK 179

Query: 613 TGKLLSLSEQQLVDCDH 663
TG+L+SLSEQQLVDCDH
Sbjct: 180 TGELVSLSEQQLVDCDH 196


>tr|Q9XGH8|Q9XGH8_TOBAC Putative preprocysteine proteinase
OS=Nicotiana tabacum GN=cpr2 PE=2 SV=1
Length = 363

Score = 216 bits (550), Expect = 1e-54
Identities = 121/199 (60%), Positives = 144/199 (72%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HRF++F+ NL +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 168

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 169 LATGELVSLSEQQLVDCDH 187


>tr|Q2QFR2|Q2QFR2_NICBE Cysteine proteinase glycinain type
(Fragment) OS=Nicotiana benthamiana PE=2 SV=1
Length = 355

Score = 215 bits (547), Expect = 2e-54
Identities = 121/199 (60%), Positives = 144/199 (72%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL LF SA A +D DPLI+QV +E E SH+ + + SLFK
Sbjct: 4 LFLLSLLAFALFSSAVAFSDEDPLIRQVV-SETETDD------SHLLNAEHHFSLFK--- 53

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HRF++F+ NL +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 54 SKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 113

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P KP LN++ APILPT+DLP+D+DWRD GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 114 KP---KPKLNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 170

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 171 LATGELVSLSEQQLVDCDH 189


>tr|Q43579|Q43579_TOBAC Tobacco pre-pro-cysteine proteinase
OS=Nicotiana tabacum PE=2 SV=1
Length = 363

Score = 214 bits (546), Expect = 3e-54
Identities = 121/199 (60%), Positives = 143/199 (71%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HRF++F+ NL +A Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 168

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 169 LATGELVSLSEQQLVDCDH 187


>tr|Q84YH6|Q84YH6_TOBAC CPR2-like cysteine proteinase OS=Nicotiana
tabacum PE=3 SV=1
Length = 363

Score = 214 bits (544), Expect = 5e-54
Identities = 120/199 (60%), Positives = 143/199 (71%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HRF++F+ N +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 168

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 169 LATGELVSLSEQQLVDCDH 187


>tr|Q07491|Q07491_SOLLC Pre-pro-cysteine proteinase (Fragment)
OS=Solanum lycopersicum PE=2 SV=1
Length = 361

Score = 211 bits (538), Expect = 2e-53
Identities = 117/199 (58%), Positives = 144/199 (72%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL S LF SA A +D DPLI+QV +SG +H+ + + SLFK
Sbjct: 2 LFLLSFLAFALFSSAIAFSDDDPLIRQV---------VSGNDDNHMLNAEHHFSLFKA-- 50

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HR ++F+ NL +A + Q DPSA HG+T+FSDLT EF R YLGL
Sbjct: 51 -KFGKIYASQEEHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLN 109

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P +P+LN++ APILPT DLPSDFDWR++GAV++VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 110 KP---RPNLNAEKAPILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHF 166

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 167 LATGELVSLSEQQLVDCDH 185


>tr|Q43580|Q43580_TOBAC Tobacco pre-pro-cysteine proteinase
OS=Nicotiana tabacum PE=2 SV=1
Length = 365

Score = 210 bits (535), Expect = 6e-53
Identities = 120/199 (60%), Positives = 142/199 (71%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL LF SA A D DPLI+QV +E E SH+ + + SLFK
Sbjct: 4 LFLLSLPRFALFSSAIAFPDEDPLIRQVV-SETETDD------SHLLNAEHHFSLFK--- 53

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
+FGK YAS EE HRF++F+ NL +A Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 54 SKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 113

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P KP +N++ APILPT+DLP+D+DWRD GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 114 KP---KPKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 170

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 171 LATGELVSLSEQQLVDCDH 189


>tr|B1Q474|B1Q474_CAPCH Putative cysteine proteinase OS=Capsicum
chinense PE=2 SV=1
Length = 367

Score = 206 bits (525), Expect = 8e-52
Identities = 113/200 (56%), Positives = 142/200 (71%), Gaps = 4/200 (2%)
Frame = +1

Query: 76 LFLASLAICLLFISATAA--DTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRF 249
LFL SL + +F S+ A D DPLI+QVT + ++ H+ + + SLFK
Sbjct: 4 LFLLSLLVFTIFSSSAFAFSDEDPLIRQVTSESDDNNN-------HLLNAEHHFSLFK-- 54

Query: 250 IDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGL 426
+FGK YA+ EE HR ++F+ NL +A + Q DP+A HG+TKFSDLT EF R YLGL
Sbjct: 55 -SKFGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGL 113

Query: 427 KTPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAH 603
P KP L++ APILPT+DLP DFDWR++GAV+ VKNQGSCGSCW+FSTTGAVEGAH
Sbjct: 114 HKP---KPKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAH 170

Query: 604 FVKTGKLLSLSEQQLVDCDH 663
F+ TG+L+SLSEQQLVDCDH
Sbjct: 171 FLATGELVSLSEQQLVDCDH 190


>tr|Q9XED9|Q9XED9_SOLME Cysteine proteinase OS=Solanum melongena
PE=2 SV=1
Length = 363

Score = 204 bits (520), Expect = 3e-51
Identities = 114/199 (57%), Positives = 141/199 (70%), Gaps = 3/199 (1%)
Frame = +1

Query: 76 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 252
LFL SL LF SA A +D DPLI+QV +S +H+ + + SLFK
Sbjct: 4 LFLLSLLAFALFSSAIAFSDDDPLIRQV---------VSETDDNHMLNAEHHFSLFK--- 51

Query: 253 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 429
++GK YAS EE HR ++F+ NL +A + Q DP+A HG+T+FSDLT EF R YLGL
Sbjct: 52 SKYGKIYASQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLH 111

Query: 430 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTGAVEGAHF 606
P +P LN+ APILPT+DLP DFDWR++GAV+ VKNQGSCGSCW+FSTTGAVEGAHF
Sbjct: 112 KP---RPKLNAQKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHF 168

Query: 607 VKTGKLLSLSEQQLVDCDH 663
+ TG+L+SLSEQQLVDCDH
Sbjct: 169 LATGELVSLSEQQLVDCDH 187