DK955664
Clone id TST39A01NGRL0023_L09
Library
Length 548
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0023_L09. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
GACGGTCCGCCATAGCCACTCTCTGATGGGCTCGCATCTGTTTCTAGCCTCTTTGGCTAT
CTGCCTTCTCTTCATCTCCGCCACCGCAGCAGACACAGATCCTTTAATCAAGCAAGTTAC
CGACGCAGAGCTTGAGGGCTCCTCGCTCTCTGGCATTCCGATTTCGCACATCTCGTCTCG
GCAAGATGTAGAGAGCCTGTTCAAGCGGTTTATCGACAGGTTTGGCAAAAGCTATGCGTC
TCCTGAGGAGAGGGCGCACCGGTTCCGTATCTTTGAGAGGAATTTGGTGAAGGCCGCGAA
GATCCAGAGGCAGGACCCTTCGGCGACGCACGGTGTCACAAAATTCTCTGATCTCACAGA
AGAGGAGTTCTCACGTTACCTTGGCCTGAAGACTCCCCACTTTCTCAAGCCCTCTCTTAA
CTCGGACGCCCCTATCCTGCCCACCAATGACCTCCCATCTGATTTTGATTGGCGCGATCG
TGGGGCTGTCTCTGAAGTCAAGAACCAGGGGAGTTGTGGATCATGCTGGACTTTCAGTAC
AACAGGAG
■■Homology search results ■■ -
sp_hit_id P43295
Definition sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
Align length 177
Score (bit) 151.0
E-value 2.0e-36
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK955664|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0023_L09, 5'
(548 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabid... 151 2e-36
sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis t... 143 6e-34
sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 ... 142 8e-34
sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2... 142 1e-33
sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1 110 4e-24
sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis ... 103 7e-22
sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1 102 1e-21
sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Dro... 101 2e-21
sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata mul... 99 1e-20
sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis ... 97 4e-20
sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 P... 97 7e-20
sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei b... 96 9e-20
sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana... 96 9e-20
sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear... 96 1e-19
sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare ... 94 6e-19
sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium dis... 93 7e-19
sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1 92 2e-18
sp|P56202|CATW_HUMAN Cathepsin W OS=Homo sapiens GN=CTSW PE=1 SV=1 91 3e-18
sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thal... 91 3e-18
sp|P41715|CATV_NPVCF Viral cathepsin OS=Choristoneura fumiferana... 91 4e-18
sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 ... 90 6e-18
sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsi... 90 6e-18
sp|P41721|CATV_NPVBM Viral cathepsin OS=Bombyx mori nuclear poly... 90 8e-18
sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis t... 89 2e-17
sp|Q9WGE0|CATV_NPVHC Viral cathepsin OS=Hyphantria cunea nuclear... 89 2e-17
sp|Q8B9D5|CATV_NPVRO Viral cathepsin OS=Rachiplusia ou multiple ... 88 3e-17
sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexica... 87 7e-17
sp|P25783|CATV_NPVAC Viral cathepsin OS=Autographa californica n... 86 9e-17
sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersi... 86 1e-16
sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. ... 86 2e-16

>sp|P43295|A494_ARATH Probable cysteine proteinase A494
OS=Arabidopsis thaliana GN=At2g21430 PE=2 SV=2
Length = 361

Score = 151 bits (382), Expect = 2e-36
Identities = 90/177 (50%), Positives = 116/177 (65%), Gaps = 3/177 (1%)
Frame = +2

Query: 26 MGSHL-FLASLAICLLFISATAA-DTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESL 199
M HL L S+++ +F+S + D D LI+QV D E E LS +D +L
Sbjct: 1 MDYHLRVLFSVSLIFVFVSVSVCGDEDVLIRQVVD-ETEPKVLSS---------EDHFTL 50

Query: 200 FKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR- 376
FK+ +FGK Y S EE +RF +F+ NL++A + Q+ DPSA HGVT+FSDLT EF R
Sbjct: 51 FKK---KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRK 107

Query: 377 YLGLKTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
+LG+K F P + APILPT +LP +FDWRDRGAV+ VKNQGSCGSCW+FSTTG
Sbjct: 108 HLGVKGG-FKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTG 163


>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis
thaliana GN=RD19A PE=2 SV=1
Length = 368

Score = 143 bits (360), Expect = 6e-34
Identities = 81/175 (46%), Positives = 109/175 (62%), Gaps = 5/175 (2%)
Frame = +2

Query: 38 LFLASLAICLLFISATAADT----DPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFK 205
L+ + + +S +++D D +I+QV G + + +D SLFK
Sbjct: 6 LYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVV----------GGAEPQVLTSEDHFSLFK 55

Query: 206 RFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEF-SRYL 382
R +FGK YAS EE +RF +F+ NL +A + Q+ DPSATHGVT+FSDLT EF ++L
Sbjct: 56 R---KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHL 112

Query: 383 GLKTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
G+++ L N APILPT +LP DFDWRD GAV+ VKNQGSCGSCW+FS TG
Sbjct: 113 GVRSGFKLPKDANK-APILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATG 166


>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1
PE=2 SV=1
Length = 371

Score = 142 bits (359), Expect = 8e-34
Identities = 88/179 (49%), Positives = 106/179 (59%), Gaps = 9/179 (5%)
Frame = +2

Query: 38 LFLASLAICLLFISATAADTDPLIKQVT----DAELEGSSLSGIPISHISSRQDVESLFK 205
L L SLA +A A+ DPLI+QV D +LE + ES F
Sbjct: 6 LLLLSLASAAAVAAAVDAE-DPLIRQVVPGGDDNDLE---------------LNAESHFL 49

Query: 206 RFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YL 382
F+ RFGKSY +E A+R +F+ NL +A + Q DPSA HGVTKFSDLT EF R YL
Sbjct: 50 SFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVTKFSDLTPAEFRRTYL 109

Query: 383 GL-KTPHFLKPSLNS---DAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
GL K+ L L +AP+LPT+ LP DFDWRD GAV VKNQGSCGSCW+FS +G
Sbjct: 110 GLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASG 168


>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2
SV=1
Length = 363

Score = 142 bits (357), Expect = 1e-33
Identities = 83/173 (47%), Positives = 107/173 (61%), Gaps = 4/173 (2%)
Frame = +2

Query: 41 FLASLAICLLFISATAADT---DPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRF 211
FL +L + +A DT D +I+QV D E + H+ + E F F
Sbjct: 5 FLFALFLFAAVATAVTDDTNNDDFIIRQVVDNEED----------HLLN---AEHHFTSF 51

Query: 212 IDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGL 388
+F KSYA+ EE +RF +F+ NL+KA Q +DP+A HG+TKFSDLT EF R +LGL
Sbjct: 52 KSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRDPTAEHGITKFSDLTASEFRRQFLGL 111

Query: 389 KTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
K L P+ APILPT +LP DFDWR++GAV+ VK+QGSCGSCW FSTTG
Sbjct: 112 KKRLRL-PAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTG 163


>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462

Score = 110 bits (275), Expect = 4e-24
Identities = 61/120 (50%), Positives = 77/120 (64%), Gaps = 3/120 (2%)
Frame = +2

Query: 197 LFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEEEFS 373
LFK F+ + ++Y S EE R +F RN+++A KIQ D +A +G+TKFSDLTEEEF
Sbjct: 164 LFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFH 223

Query: 374 R-YLGLKTPHFLKPSLNSDAPILPTNDL-PSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
YL P K S +P NDL P ++DWR +GAV+EVKNQG CGSCW FS TG
Sbjct: 224 TIYLN---PLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTG 280


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis
thaliana GN=XCP1 PE=1 SV=1
Length = 355

Score = 103 bits (256), Expect = 7e-22
Identities = 56/137 (40%), Positives = 78/137 (56%), Gaps = 4/137 (2%)
Frame = +2

Query: 143 SLSGIPISHISSRQDVESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPS 322
S+ G H+++ + LF+ ++ K+Y S EE+ HRF +F NL+ + + S
Sbjct: 32 SIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS 91

Query: 323 ATHGVTKFSDLTEEEFS-RYLGLKTPHFLK---PSLNSDAPILPTNDLPSDFDWRDRGAV 490
G+ +F+DLT EEF RYLGL P F + PS N + DLP DWR +GAV
Sbjct: 92 YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAV 149

Query: 491 SEVKNQGSCGSCWTFST 541
+ VK+QG CGSCW FST
Sbjct: 150 APVKDQGQCGSCWAFST 166


>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484

Score = 102 bits (254), Expect = 1e-21
Identities = 58/120 (48%), Positives = 76/120 (63%), Gaps = 2/120 (1%)
Frame = +2

Query: 194 SLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEEEF 370
S+FK F+ + ++Y S EE R +F N+V+A KIQ D +A +GVTKFSDLTEEEF
Sbjct: 185 SIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEF 244

Query: 371 SRYLGLKTPHFLKPSLNSDAPILPTNDL-PSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
R + L T +P N DL P ++DWR +GAV++VK+QG CGSCW FS TG
Sbjct: 245 -RTIYLNTLLRKEPG-NKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTG 302


>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163
OS=Drosophila melanogaster GN=CG12163 PE=2 SV=2
Length = 614

Score = 101 bits (252), Expect = 2e-21
Identities = 54/123 (43%), Positives = 75/123 (60%), Gaps = 3/123 (2%)
Frame = +2

Query: 188 VESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDP-SATHGVTKFSDLTEE 364
V+ LF +F RFG+ Y S ER R RIF +NL ++ + SA +G+T+F+D+T
Sbjct: 304 VDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSS 363

Query: 365 EFSRYLGLKTPHFLKPSLNSDAPILPT--NDLPSDFDWRDRGAVSEVKNQGSCGSCWTFS 538
E+ GL K + S A ++P +LP +FDWR + AV++VKNQGSCGSCW FS
Sbjct: 364 EYKERTGLWQRDEAKATGGS-AAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFS 422

Query: 539 TTG 547
TG
Sbjct: 423 VTG 425


>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata
multicapsid polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 324

Score = 99.4 bits (246), Expect = 1e-20
Identities = 48/117 (41%), Positives = 70/117 (59%), Gaps = 1/117 (0%)
Frame = +2

Query: 200 FKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEE-FSR 376
F+ F+ +F K+Y+S E+ HRF+IF+ NL + + D +A + + KFSDL++EE S+
Sbjct: 28 FEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQNDSTAQYEINKFSDLSKEEAISK 87

Query: 377 YLGLKTPHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
Y GL PH + P + P +FDWR V+ VKNQG CG+CW F+T G
Sbjct: 88 YTGLSLPHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGACWAFATLG 144


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis
thaliana GN=XCP2 PE=1 SV=2
Length = 356

Score = 97.4 bits (241), Expect = 4e-20
Identities = 54/136 (39%), Positives = 76/136 (55%), Gaps = 3/136 (2%)
Frame = +2

Query: 143 SLSGIPISHISSRQDVESLFKRFIDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPS 322
S+ G + S + LF+ +I F K+Y + EE+ RF +F+ NL + ++ S
Sbjct: 32 SIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS 91

Query: 323 ATHGVTKFSDLTEEEFSR-YLGLKTPHFLKPSLNSDAPIL--PTNDLPSDFDWRDRGAVS 493
G+ +F+DL+ EEF + YLGLKT + S A +P DWR +GAV+
Sbjct: 92 YWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVA 151

Query: 494 EVKNQGSCGSCWTFST 541
EVKNQGSCGSCW FST
Sbjct: 152 EVKNQGSCGSCWAFST 167


tr_hit_id A9NVM7
Definition tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea sitchensis
Align length 171
Score (bit) 173.0
E-value 6.0e-42
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK955664|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0023_L09, 5'
(548 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea... 173 6e-42
tr|A9NKS7|A9NKS7_PICSI Putative uncharacterized protein OS=Picea... 173 6e-42
tr|Q9XGH8|Q9XGH8_TOBAC Putative preprocysteine proteinase OS=Nic... 171 3e-41
tr|Q2QFR2|Q2QFR2_NICBE Cysteine proteinase glycinain type (Fragm... 169 7e-41
tr|Q43579|Q43579_TOBAC Tobacco pre-pro-cysteine proteinase OS=Ni... 169 9e-41
tr|Q84YH6|Q84YH6_TOBAC CPR2-like cysteine proteinase OS=Nicotian... 168 1e-40
tr|Q07491|Q07491_SOLLC Pre-pro-cysteine proteinase (Fragment) OS... 166 7e-40
tr|Q43580|Q43580_TOBAC Tobacco pre-pro-cysteine proteinase OS=Ni... 165 2e-39
tr|B1Q474|B1Q474_CAPCH Putative cysteine proteinase OS=Capsicum ... 161 2e-38
tr|Q9XED9|Q9XED9_SOLME Cysteine proteinase OS=Solanum melongena ... 159 9e-38
tr|Q7PCC5|Q7PCC5_HORVU Putative cysteine proteinase OS=Hordeum v... 157 3e-37
tr|Q75QV8|Q75QV8_ASTTR Cysteine protease OS=Aster tripolium GN=C... 156 6e-37
tr|A9P833|A9P833_POPTR Putative uncharacterized protein OS=Popul... 156 8e-37
tr|Q3L0K1|Q3L0K1_POPTO Cysteine proteinase OS=Populus tomentosa ... 155 1e-36
tr|Q5K4K8|Q5K4K8_GOSHI Putative papain-like cysteine proteinase ... 155 1e-36
tr|A7P7V6|A7P7V6_VITVI Chromosome chr3 scaffold_8, whole genome ... 154 3e-36
tr|A9UFX8|A9UFX8_VITVI Cysteine protease OS=Vitis vinifera GN=Cy... 152 9e-36
tr|Q945R8|Q945R8_SANAU Cysteine proteinase (Fragment) OS=Sanders... 151 2e-35
tr|Q680L1|Q680L1_ARATH Putative cysteine proteinase OS=Arabidops... 151 2e-35
tr|Q058I7|Q058I7_ARATH At2g21430 OS=Arabidopsis thaliana PE=2 SV=1 151 2e-35
tr|Q5MB22|Q5MB22_WHEAT Cysteine protease OS=Triticum aestivum GN... 150 3e-35
tr|A9S6L4|A9S6L4_PHYPA Predicted protein OS=Physcomitrella paten... 150 3e-35
tr|A9TTL3|A9TTL3_PHYPA Predicted protein OS=Physcomitrella paten... 149 9e-35
tr|A5HIJ3|A5HIJ3_ACTDE Cysteine protease Cp3 OS=Actinidia delici... 149 1e-34
tr|Q7XW09|Q7XW09_ORYSJ OSJNBb0054B09.3 protein OS=Oryza sativa s... 147 3e-34
tr|Q01M10|Q01M10_ORYSA OSIGBa0148D14.11 protein (OSIGBa0130O15.4... 147 3e-34
tr|A9NU42|A9NU42_PICSI Putative uncharacterized protein OS=Picea... 147 5e-34
tr|Q8W179|Q8W179_BRAOL Senescence-associated cysteine protease O... 146 6e-34
tr|O81930|O81930_CICAR Cysteine proteinase OS=Cicer arietinum PE... 146 6e-34
tr|A9P971|A9P971_POPTR Putative uncharacterized protein OS=Popul... 146 6e-34

>tr|A9NVM7|A9NVM7_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 366

Score = 173 bits (438), Expect = 6e-42
Identities = 89/171 (52%), Positives = 115/171 (67%), Gaps = 1/171 (0%)
Frame = +2

Query: 38 LFLASLAICLLFISATAADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFID 217
LF A ++F+S+ D LI+QVTD + + + S+ + E F+ FI
Sbjct: 7 LFSAFCIFSVIFLSSATKPDDDLIRQVTDEVVSDPQI----LDARSALFNAEVHFRHFIR 62

Query: 218 RFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFS-RYLGLKT 394
R+GK Y+ PEE HRF +F+ NL++A + Q+ DP A+HGVTKFSDLT+EEF +YLGL+
Sbjct: 63 RYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLRA 122

Query: 395 PHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P DAPILPTNDLP DFDWR++GAV+EVKNQGSCGSCW FSTTG
Sbjct: 123 PPLRDAH---DAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTG 170


>tr|A9NKS7|A9NKS7_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 366

Score = 173 bits (438), Expect = 6e-42
Identities = 89/171 (52%), Positives = 115/171 (67%), Gaps = 1/171 (0%)
Frame = +2

Query: 38 LFLASLAICLLFISATAADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFID 217
LF A ++F+S+ D LI+QVTD + + + S+ + E F+ FI
Sbjct: 7 LFSAFCIFSVIFLSSATRPDDDLIRQVTDEVVSDPQI----LDARSALFNAEVHFRHFIR 62

Query: 218 RFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFS-RYLGLKT 394
R+GK Y+ PEE HRF +F+ NL++A + Q+ DP A+HGVTKFSDLT+EEF +YLGL+
Sbjct: 63 RYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRASHGVTKFSDLTQEEFRHQYLGLRA 122

Query: 395 PHFLKPSLNSDAPILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P DAPILPTNDLP DFDWR++GAV+EVKNQGSCGSCW FSTTG
Sbjct: 123 PPLRDAH---DAPILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTG 170


>tr|Q9XGH8|Q9XGH8_TOBAC Putative preprocysteine proteinase
OS=Nicotiana tabacum GN=cpr2 PE=2 SV=1
Length = 363

Score = 171 bits (432), Expect = 3e-41
Identities = 99/173 (57%), Positives = 119/173 (68%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HRF++F+ NL +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTG 161


>tr|Q2QFR2|Q2QFR2_NICBE Cysteine proteinase glycinain type
(Fragment) OS=Nicotiana benthamiana PE=2 SV=1
Length = 355

Score = 169 bits (429), Expect = 7e-41
Identities = 99/173 (57%), Positives = 119/173 (68%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL LF SA A +D DPLI+QV +E E SH+ + + SLFK
Sbjct: 4 LFLLSLLAFALFSSAVAFSDEDPLIRQVV-SETETDD------SHLLNAEHHFSLFK--- 53

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HRF++F+ NL +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 54 SKFGKIYASEEEHDHRFKVFKANLRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 113

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP LN++ APILPT+DLP+D+DWRD GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 114 KP---KPKLNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTG 163


>tr|Q43579|Q43579_TOBAC Tobacco pre-pro-cysteine proteinase
OS=Nicotiana tabacum PE=2 SV=1
Length = 363

Score = 169 bits (428), Expect = 9e-41
Identities = 99/173 (57%), Positives = 118/173 (68%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HRF++F+ NL +A Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTG 161


>tr|Q84YH6|Q84YH6_TOBAC CPR2-like cysteine proteinase OS=Nicotiana
tabacum PE=3 SV=1
Length = 363

Score = 168 bits (426), Expect = 1e-40
Identities = 98/173 (56%), Positives = 118/173 (68%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL +LF SA A +D DPLI+QV +S SH+ + + SLFK
Sbjct: 4 LFLLSLLAFVLFSSAIAFSDEDPLIRQV---------VSETDDSHLLNAEHHFSLFK--- 51

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HRF++F+ N +A + Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 52 SKFGKIYASEEEHDHRFKVFKANRRRARRHQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 111

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP LN++ APILPT+DLP+DFDWRD GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 112 KP---KPKLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFSTTG 161


>tr|Q07491|Q07491_SOLLC Pre-pro-cysteine proteinase (Fragment)
OS=Solanum lycopersicum PE=2 SV=1
Length = 361

Score = 166 bits (420), Expect = 7e-40
Identities = 95/173 (54%), Positives = 119/173 (68%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL S LF SA A +D DPLI+QV +SG +H+ + + SLFK
Sbjct: 2 LFLLSFLAFALFSSAIAFSDDDPLIRQV---------VSGNDDNHMLNAEHHFSLFKA-- 50

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HR ++F+ NL +A + Q DPSA HG+T+FSDLT EF R YLGL
Sbjct: 51 -KFGKIYASQEEHDHRLKVFKANLHRAKRHQLLDPSAEHGITQFSDLTPSEFRRTYLGLN 109

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P +P+LN++ APILPT DLPSDFDWR++GAV++VKNQGSCGSCW+FSTTG
Sbjct: 110 KP---RPNLNAEKAPILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTG 159


>tr|Q43580|Q43580_TOBAC Tobacco pre-pro-cysteine proteinase
OS=Nicotiana tabacum PE=2 SV=1
Length = 365

Score = 165 bits (417), Expect = 2e-39
Identities = 98/173 (56%), Positives = 117/173 (67%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL LF SA A D DPLI+QV +E E SH+ + + SLFK
Sbjct: 4 LFLLSLPRFALFSSAIAFPDEDPLIRQVV-SETETDD------SHLLNAEHHFSLFK--- 53

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
+FGK YAS EE HRF++F+ NL +A Q DPSA HG+TKFSDLT EF R YLGL
Sbjct: 54 SKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDPSAEHGITKFSDLTPSEFRRTYLGLH 113

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP +N++ APILPT+DLP+D+DWRD GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 114 KP---KPKVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGSCWSFSTTG 163


>tr|B1Q474|B1Q474_CAPCH Putative cysteine proteinase OS=Capsicum
chinense PE=2 SV=1
Length = 367

Score = 161 bits (407), Expect = 2e-38
Identities = 91/174 (52%), Positives = 117/174 (67%), Gaps = 4/174 (2%)
Frame = +2

Query: 38 LFLASLAICLLFISATAA--DTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRF 211
LFL SL + +F S+ A D DPLI+QVT + ++ H+ + + SLFK
Sbjct: 4 LFLLSLLVFTIFSSSAFAFSDEDPLIRQVTSESDDNNN-------HLLNAEHHFSLFK-- 54

Query: 212 IDRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGL 388
+FGK YA+ EE HR ++F+ NL +A + Q DP+A HG+TKFSDLT EF R YLGL
Sbjct: 55 -SKFGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITKFSDLTPSEFRRTYLGL 113

Query: 389 KTPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P KP L++ APILPT+DLP DFDWR++GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 114 HKP---KPKLSTTKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTG 164


>tr|Q9XED9|Q9XED9_SOLME Cysteine proteinase OS=Solanum melongena
PE=2 SV=1
Length = 363

Score = 159 bits (402), Expect = 9e-38
Identities = 92/173 (53%), Positives = 116/173 (67%), Gaps = 3/173 (1%)
Frame = +2

Query: 38 LFLASLAICLLFISATA-ADTDPLIKQVTDAELEGSSLSGIPISHISSRQDVESLFKRFI 214
LFL SL LF SA A +D DPLI+QV +S +H+ + + SLFK
Sbjct: 4 LFLLSLLAFALFSSAIAFSDDDPLIRQV---------VSETDDNHMLNAEHHFSLFK--- 51

Query: 215 DRFGKSYASPEERAHRFRIFERNLVKAAKIQRQDPSATHGVTKFSDLTEEEFSR-YLGLK 391
++GK YAS EE HR ++F+ NL +A + Q DP+A HG+T+FSDLT EF R YLGL
Sbjct: 52 SKYGKIYASQEEHDHRLKVFKANLRRARRHQLLDPTAEHGITQFSDLTPSEFRRTYLGLH 111

Query: 392 TPHFLKPSLNSD-APILPTNDLPSDFDWRDRGAVSEVKNQGSCGSCWTFSTTG 547
P +P LN+ APILPT+DLP DFDWR++GAV+ VKNQGSCGSCW+FSTTG
Sbjct: 112 KP---RPKLNAQKAPILPTSDLPEDFDWREKGAVTGVKNQGSCGSCWSFSTTG 161