DK951787
Clone id TST38A01NGRL0012_F08
Library
Length 669
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0012_F08. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
CAGCTCGGCACGTGTGAATCCGACCTCCACCTCAGAGGATGCGTTCACTGAGGCATTCTC
GCAGAGGCTAGTGAGAGACTCGGCACGTGTGAAGGGCATAGAGGAGAGACTGGAAGTGGC
ACGCATGGGGGCAAATACTGCGGAATGGAAGCCGGCGAATGTGGATGGGTTAAGCCAGGA
GGAGATATCGAGCCCCCTTGTGTCAGGGCTGAGCCAGGGCAGCGGGGAGTACTTTGCGCG
ATTCGGCGTGGGAACGCCCCCTAAAGAATCGTATTTTGTGATCGACACAGGGAGTGACTT
ATCCTGGACACAGTGCGCCCCCTGCAATAGCTGCTACAAGCAGACTGACCCCATCTTCAA
TCCAGCGGACTCCTCTTCTTACAATCCTGTCTCTTGCAGTGATGCCCTATGCTCGCAGCT
GCTCGTGCGCGGCTGCCAACGCAATCAGTGCCTCTACCAAGTCAGCTACGGCGACGGCTC
CTTCACAGTCGGCGATTTCGCCGTCGAAACGTTCACTTTCTCCGGTAATCAAGTAGAGAG
AGTCGCCCTGGGCTGCGGCCATGACAACGAGGGTCTCTTTATTGGTGCGTCGGGCTTGCT
GGGTCTGGGCGCCGGCCGCCTATCTCTTCCCACTCAGCTGAAGCAGGAATTCTCCTACTG
CCTTCCCGA
■■Homology search results ■■ -
sp_hit_id Q766C2
Definition sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis
Align length 197
Score (bit) 134.0
E-value 4.0e-31
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951787|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_F08, 5'
(669 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepent... 134 4e-31
sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepent... 124 5e-28
sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arab... 70 8e-12
sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa su... 70 8e-12
sp|P69477|NEP2_NEPDI Aspartic proteinase nepenthesin-2 (Fragment... 65 4e-10
sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa su... 64 7e-10
sp|P69476|NEP1_NEPDI Aspartic proteinase nepenthesin-1 (Fragment... 60 8e-09
sp|P25796|CATE_CAVPO Cathepsin E OS=Cavia porcellus GN=CTSE PE=1... 58 5e-08
sp|Q64411|PEPC_CAVPO Gastricsin OS=Cavia porcellus GN=PGC PE=2 SV=1 54 6e-07
sp|P70269|CATE_MOUSE Cathepsin E OS=Mus musculus GN=Ctse PE=1 SV=1 54 6e-07
sp|Q9LX20|ASPL1_ARATH Aspartic proteinase-like protein 1 OS=Arab... 54 7e-07
sp|P16228|CATE_RAT Cathepsin E OS=Rattus norvegicus GN=Ctse PE=1... 53 2e-06
sp|P16476|PEPE_CHICK Embryonic pepsinogen OS=Gallus gallus PE=2 ... 50 8e-06
sp|P03955|PEPC_MACFU Gastricsin (Fragment) OS=Macaca fuscata fus... 50 1e-05
sp|P20142|PEPC_HUMAN Gastricsin OS=Homo sapiens GN=PGC PE=1 SV=1 50 1e-05
sp|Q6DLW5|RENI_MACMU Renin OS=Macaca mulatta GN=REN PE=2 SV=2 49 2e-05
sp|Q6DLS0|RENI_MACFA Renin OS=Macaca fascicularis GN=REN PE=2 SV=1 49 2e-05
sp|P11489|PEPA_MACMU Pepsin A OS=Macaca mulatta GN=PGA PE=2 SV=1 49 2e-05
sp|P03954|PEPA1_MACFU Pepsin A-1 OS=Macaca fuscata fuscata GN=PG... 49 2e-05
sp|P81214|CARP_SYNRA Syncephapepsin OS=Syncephalastrum racemosum... 49 3e-05
sp|P60016|RENI_PANTR Renin OS=Pan troglodytes GN=REN PE=3 SV=1 48 4e-05
sp|P00797|RENI_HUMAN Renin OS=Homo sapiens GN=REN PE=1 SV=1 48 4e-05
sp|Q9N2D3|PEPC_CALJA Gastricsin OS=Callithrix jacchus GN=PGC PE=... 48 4e-05
sp|Q9GMY7|PEPA_RHIFE Pepsin A OS=Rhinolophus ferrumequinum GN=PG... 48 4e-05
sp|Q9GMY6|PEPA_CANFA Pepsin A OS=Canis familiaris GN=PGA PE=2 SV=1 48 4e-05
sp|P04073|PEPC_RAT Gastricsin OS=Rattus norvegicus GN=Pgc PE=1 SV=1 48 5e-05
sp|P81497|PEPA_SUNMU Pepsin A OS=Suncus murinus GN=PGA PE=1 SV=2 47 7e-05
sp|P00790|PEPA_HUMAN Pepsin A OS=Homo sapiens GN=PGA3 PE=1 SV=1 47 7e-05
sp|P28713|PEPA4_RABIT Pepsin II-4 OS=Oryctolagus cuniculus PE=2 ... 47 7e-05
sp|P27678|PEPA4_MACFU Pepsin A-4 OS=Macaca fuscata fuscata GN=PG... 47 7e-05

>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes
gracilis GN=nep2 PE=1 SV=1
Length = 438

Score = 134 bits (337), Expect = 4e-31
Identities = 72/197 (36%), Positives = 100/197 (50%), Gaps = 8/197 (4%)
Frame = +2

Query: 5 SARVNPTSTSEDAFTEAFSQRLVRDSARVKGIEE--------RLEVARMGANTAEWKPAN 160
SA V PTS++ Q+ + RV +E+ + E+ + E + +
Sbjct: 16 SAIVAPTSSTSRGTLLHHGQKRPQPGLRVD-LEQVDSGKNLTKYELIKRAIKRGERRMRS 74

Query: 161 VDGLSQEEISSPLVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYK 340
++ + Q SS + + + G GEY +GTP ++DTGSDL WTQC PC C+
Sbjct: 75 INAMLQS--SSGIETPVYAGDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFS 132

Query: 341 QTDPIFNPADSSSYNPVSCSDALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF 520
Q PIFNP DSSS++ + C C L C N+C Y YGDGS T G A ETFTF
Sbjct: 133 QPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF 192

Query: 521 SGNQVERVALGCGHDNE 571
+ V +A GCG DN+
Sbjct: 193 ETSSVPNIAFGCGEDNQ 209


>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes
gracilis GN=nep1 PE=1 SV=1
Length = 437

Score = 124 bits (311), Expect = 5e-28
Identities = 72/209 (34%), Positives = 98/209 (46%), Gaps = 12/209 (5%)
Frame = +2

Query: 74 RDSARVKGIEERLEVARMGANTAEWK---PANVDGLSQEEISSPLVSGLS-------QGS 223
R A+V G + LE G N +++ A G + + +++G S G
Sbjct: 33 RHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAGD 92

Query: 224 GEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSD 403
GEY +GTP + ++DTGSDL WTQC PC C+ Q+ PIFNP SSS++ + CS
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSS 152

Query: 404 ALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVALGCGHDNE-XXX 580
LC L C N C Y YGDGS T G ET TF + + GCG +N+
Sbjct: 153 QLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQ 212

Query: 581 XXXXXXXXXXXXXXXXPTQLK-QEFSYCL 664
P+QL +FSYC+
Sbjct: 213 GNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241


>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2
OS=Arabidopsis thaliana GN=At1g65240 PE=1 SV=2
Length = 475

Score = 70.5 bits (171), Expect = 8e-12
Identities = 43/129 (33%), Positives = 60/129 (46%), Gaps = 15/129 (11%)
Frame = +2

Query: 224 GEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTD-----PIFNPADSSSYNP 388
G YF + +G+PPKE + +DTGSD+ W C PC C +T+ +F+ SS+
Sbjct: 72 GLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKK 131

Query: 389 VSCSDALCSQLLVR-GCQ-RNQCLYQVSYGDGSFTVGDFAVETFTF--------SGNQVE 538
V C D CS + CQ C Y + Y D S + G F + T +G +
Sbjct: 132 VGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQ 191

Query: 539 RVALGCGHD 565
V GCG D
Sbjct: 192 EVVFGCGSD 200


>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa
subsp. japonica GN=ASP1 PE=2 SV=1
Length = 410

Score = 70.5 bits (171), Expect = 8e-12
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Frame = +2

Query: 224 GEYFARFGVGTPPKESYFVIDTGSDLSWTQC-APCNSCYKQTDPIFNPADSSSYNPVSCS 400
G +F +G P K + IDTGS L+W QC APC +C ++ P V+C+
Sbjct: 36 GHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKL---VTCA 92

Query: 401 DALCSQLLV------RGCQRNQCLYQVSYGDGSFTVGDFAVETFTFS---GNQVERVALG 553
D+LC+ L R + QC Y + Y D S ++G ++ F+ S G +A G
Sbjct: 93 DSLCTDLYTDLGKPKRCGSQKQCDYVIQYVDSS-SMGVLVIDRFSLSASNGTNPTTIAFG 151

Query: 554 CGHD 565
CG+D
Sbjct: 152 CGYD 155


>sp|P69477|NEP2_NEPDI Aspartic proteinase nepenthesin-2 (Fragments)
OS=Nepenthes distillatoria PE=1 SV=1
Length = 178

Score = 64.7 bits (156), Expect = 4e-10
Identities = 35/88 (39%), Positives = 41/88 (46%)
Frame = +2

Query: 296 DLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQLLVRGCQRNQCLYQVSYGD 475
DL WTQC PC C+ Q DSSS++ + C C L C C Y YGD
Sbjct: 20 DLIWTQCEPCTQCFSQ--------DSSSFSTLPCESQYCQDLPSETC---DCQYTYGYGD 68

Query: 476 GSFTVGDFAVETFTFSGNQVERVALGCG 559
GS T G A E G+ V +A GCG
Sbjct: 69 GSSTQGYMAXE----DGSSVPNIAFGCG 92


>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa
subsp. indica GN=ASP1 PE=2 SV=2
Length = 410

Score = 63.9 bits (154), Expect = 7e-10
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 10/124 (8%)
Frame = +2

Query: 224 GEYFARFGVGTPPKESYFVIDTGSDLSWTQC-APCNSCYKQTDPIFNPADSSSYNPVSCS 400
G +F +G P K + IDTGS L+W QC PC +C K ++ P + V C+
Sbjct: 36 GHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLYKPELKYA---VKCT 92

Query: 401 DALCSQLL------VRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF---SGNQVERVALG 553
+ C+ L ++ +NQC Y + Y GS ++G V++F+ +G +A G
Sbjct: 93 EQRCADLYADLRKPMKCGPKNQCHYGIQYVGGS-SIGVLIVDSFSLPASNGTNPTSIAFG 151

Query: 554 CGHD 565
CG++
Sbjct: 152 CGYN 155


>sp|P69476|NEP1_NEPDI Aspartic proteinase nepenthesin-1 (Fragments)
OS=Nepenthes distillatoria PE=1 SV=1
Length = 164

Score = 60.5 bits (145), Expect = 8e-09
Identities = 37/114 (32%), Positives = 48/114 (42%)
Frame = +2

Query: 218 GSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSC 397
G GEY +GTP + ++DTGSDL WTQ P + Q+D P SSS++ + C
Sbjct: 13 GDGEYLMXLSIGTPAQPFSAIMDTGSDLIWTQXQPXTQXFXQSD----PQGSSSFSTLPC 68

Query: 398 SDALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVALGCG 559
YGD S T G ETFTF + + G G
Sbjct: 69 ----------------------GYGD-SETQGSMGTETFTFGSVSIPNITFGXG 99


>sp|P25796|CATE_CAVPO Cathepsin E OS=Cavia porcellus GN=CTSE PE=1
SV=1
Length = 391

Score = 57.8 bits (138), Expect = 5e-08
Identities = 44/149 (29%), Positives = 61/149 (40%), Gaps = 7/149 (4%)
Frame = +2

Query: 107 RLEVARMGANTAEWKPANVDGLSQEEISS---PLVSGLSQGSGEYFARFGVGTPPKESYF 277
R ++ G T WK N++ I S PL++ L EYF +G+PP+
Sbjct: 33 RKKLRAQGQLTELWKSQNLNMDQCSTIQSANEPLINYLDM---EYFGTISIGSPPQNFTV 89

Query: 278 VIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQLLVRGCQRNQCLY 457
+ DTGS W C S QT P+F+P+ SS+Y V S +
Sbjct: 90 IFDTGSSNLWVPSVYCTSPACQTHPVFHPSLSSTYREVGNS------------------F 131

Query: 458 QVSYGDGSFT----VGDFAVETFTFSGNQ 532
+ YG GS T +VE T G Q
Sbjct: 132 SIQYGTGSLTGIIGADQVSVEGLTVVGQQ 160


>sp|Q64411|PEPC_CAVPO Gastricsin OS=Cavia porcellus GN=PGC PE=2 SV=1
Length = 394

Score = 54.3 bits (129), Expect = 6e-07
Identities = 45/162 (27%), Positives = 63/162 (38%), Gaps = 6/162 (3%)
Frame = +2

Query: 86 RVKGIEERL-EVARMGANTAEWKPANVDGLSQEEIS-----SPLVSGLSQGSGEYFARFG 247
++K I E L E +G KP + + ++ S L +S YF +
Sbjct: 25 KIKSIREVLREKGLLGDFLKNHKPQHARKFFRNRLAKTGDFSVLYEPMSYMDAAYFGQIS 84

Query: 248 VGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQLLV 427
+GTPP+ + DTGS W C+S T FNP DSS+Y S
Sbjct: 85 LGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTHTRFNPRDSSTYVATDQS--------- 135

Query: 428 RGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVALG 553
+ + YG GS T G F +T T QV + G
Sbjct: 136 ---------FSLEYGTGSLT-GVFGYDTMTIQDIQVPKQEFG 167


>sp|P70269|CATE_MOUSE Cathepsin E OS=Mus musculus GN=Ctse PE=1 SV=1
Length = 397

Score = 54.3 bits (129), Expect = 6e-07
Identities = 44/176 (25%), Positives = 67/176 (38%), Gaps = 21/176 (11%)
Frame = +2

Query: 68 LVRDSARVKGIEERLEVAR----------MGANTAEWKPANVDGLSQEE-------ISSP 196
L+ D A+ +G R+ + R G + W+ N+D E ++ P
Sbjct: 11 LLLDLAQAQGALHRVPLRRHQSLRKKLRAQGQLSEFWRSHNLDMTRLSESCNVYSSVNEP 70

Query: 197 LVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSS 376
L++ L EYF +GTPP+ + DTGS W C S + P+F+P+ S
Sbjct: 71 LINYLDM---EYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCTSPACKAHPVFHPSQSD 127

Query: 377 SYNPVSCSDALCSQLLVRGCQRNQCLYQVSYGDGSFT----VGDFAVETFTFSGNQ 532
+Y V + + YG GS T +VE T G Q
Sbjct: 128 TYTEVGNH------------------FSIQYGTGSLTGIIGADQVSVEGLTVDGQQ 165


tr_hit_id A9NV06
Definition tr|A9NV06|A9NV06_PICSI Putative uncharacterized protein OS=Picea sitchensis
Align length 223
Score (bit) 219.0
E-value 1.0e-55
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951787|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0012_F08, 5'
(669 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9NV06|A9NV06_PICSI Putative uncharacterized protein OS=Picea... 219 1e-55
tr|A9NUE4|A9NUE4_PICSI Putative uncharacterized protein OS=Picea... 213 9e-54
tr|A7P351|A7P351_VITVI Chromosome chr1 scaffold_5, whole genome ... 206 1e-51
tr|Q9C6M0|Q9C6M0_ARATH Putative uncharacterized protein F2J7.6 (... 203 7e-51
tr|A7PGQ3|A7PGQ3_VITVI Chromosome chr17 scaffold_16, whole genom... 203 7e-51
tr|Q9LS40|Q9LS40_ARATH CND41, chloroplast nucleoid DNA binding p... 203 9e-51
tr|Q8RXI1|Q8RXI1_ARATH Putative chloroplast nucleoid DNA-binding... 202 2e-50
tr|A9PCK2|A9PCK2_POPTR Putative uncharacterized protein OS=Popul... 196 1e-48
tr|A9NV19|A9NV19_PICSI Putative uncharacterized protein OS=Picea... 192 2e-47
tr|Q9LNJ3|Q9LNJ3_ARATH F6F3.10 protein (At1g01300) (Chloroplast ... 190 6e-47
tr|A5AKV9|A5AKV9_VITVI Putative uncharacterized protein (Chromos... 190 8e-47
tr|Q8L9B9|Q8L9B9_ARATH Chloroplast nucleoid DNA binding protein,... 189 1e-46
tr|B8LLE4|B8LLE4_PICSI Putative uncharacterized protein OS=Picea... 185 2e-45
tr|Q9AWT3|Q9AWT3_ORYSJ cDNA clone:J013170M01, full insert sequen... 182 2e-44
tr|A2WKH1|A2WKH1_ORYSI Putative uncharacterized protein OS=Oryza... 182 2e-44
tr|Q9M356|Q9M356_ARATH Putative uncharacterized protein F15G16.2... 180 6e-44
tr|Q94BT8|Q94BT8_ARATH AT3g61820/F15G16_210 OS=Arabidopsis thali... 180 6e-44
tr|B8LN55|B8LN55_PICSI Putative uncharacterized protein OS=Picea... 178 2e-43
tr|A9NX87|A9NX87_PICSI Putative uncharacterized protein OS=Picea... 178 2e-43
tr|A7QNM3|A7QNM3_VITVI Chromosome undetermined scaffold_133, who... 171 4e-41
tr|B8LP16|B8LP16_PICSI Putative uncharacterized protein OS=Picea... 171 5e-41
tr|Q94D20|Q94D20_ORYSJ Nucleoid DNA-binding protein cnd41-like (... 169 1e-40
tr|A2ZV28|A2ZV28_ORYSJ Putative uncharacterized protein OS=Oryza... 169 1e-40
tr|A2WS70|A2WS70_ORYSI Putative uncharacterized protein OS=Oryza... 169 1e-40
tr|Q9LHE3|Q9LHE3_ARATH Nucleoid chloroplast DNA-binding protein-... 168 3e-40
tr|A2Y830|A2Y830_ORYSI Putative uncharacterized protein OS=Oryza... 168 3e-40
tr|Q6L558|Q6L558_ORYSJ Unknow protein OS=Oryza sativa subsp. jap... 166 9e-40
tr|Q6I5E1|Q6I5E1_ORYSJ Os05g0590000 protein OS=Oryza sativa subs... 166 9e-40
tr|Q7XKB9|Q7XKB9_ORYSJ OSJNBa0064G10.12 protein OS=Oryza sativa ... 166 2e-39
tr|Q259J3|Q259J3_ORYSA H0402C08.5 protein OS=Oryza sativa GN=H04... 166 2e-39

>tr|A9NV06|A9NV06_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 497

Score = 219 bits (558), Expect = 1e-55
Identities = 109/223 (48%), Positives = 139/223 (62%), Gaps = 9/223 (4%)
Frame = +2

Query: 23 TSTSEDAFTEAFSQRLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQ----EEIS 190
+S++ E +RL RD+ARV I R+++A MG + AE KP N + ++ S
Sbjct: 80 SSSNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFS 139

Query: 191 SPLVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPAD 370
S ++SGL+QGSGEYF R GVGTPP+ +Y V+DTGSD+ W QC PC CY QTDP+FNPA
Sbjct: 140 SSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAA 199

Query: 371 SSSYNPVSCSDALCSQLLVRGCQ-RNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVA 547
SS+Y V C+ LC +L + GC+ + C YQVSYGDGSFTVGDF+ ET TF G + RVA
Sbjct: 200 SSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQVIRRVA 259

Query: 548 LGCGHDNEXXXXXXXXXXXXXXXXXXXPT----QLKQEFSYCL 664
LGCGHDNE P+ Q + FSYCL
Sbjct: 260 LGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCL 302


>tr|A9NUE4|A9NUE4_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 485

Score = 213 bits (542), Expect = 9e-54
Identities = 108/223 (48%), Positives = 137/223 (61%), Gaps = 7/223 (3%)
Frame = +2

Query: 17 NPTSTSEDAFTEAFSQRLVRDSARVKGIEERLEVARMGANTAEWKPANVDG--LSQEEIS 190
N +E ++ E QRL RD+ARV I RLE+A G + KP + +++ +
Sbjct: 72 NSNKNNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQ 131

Query: 191 SPLVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPAD 370
SP+VSG+ QGSGEYF+R GVG P ++ V+DTGSD++W QC PC+ CY+Q+DPI+NPA
Sbjct: 132 SPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPAL 191

Query: 371 SSSYNPVSCSDALCSQLLVRGCQRN-QCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVA 547
SSSY V C LC QL V GC RN CLYQVSYGDGS+T G+FA ET T G ++ VA
Sbjct: 192 SSSYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQNVA 251

Query: 548 LGCGHDNEXXXXXXXXXXXXXXXXXXXPTQLKQE----FSYCL 664
+GCGHDNE P+QL E FSYCL
Sbjct: 252 IGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCL 294


>tr|A7P351|A7P351_VITVI Chromosome chr1 scaffold_5, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00030305001
PE=4 SV=1
Length = 491

Score = 206 bits (523), Expect = 1e-51
Identities = 102/221 (46%), Positives = 135/221 (61%), Gaps = 2/221 (0%)
Frame = +2

Query: 8 ARVNPTSTSEDAFTEAFSQRLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQEEI 187
+R + +S + RL RDS RV+ + R+++A G ++ KP + L E +
Sbjct: 82 SRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKE-LEAEAL 140

Query: 188 SSPLVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPA 367
+PLVSG SQGSGEYF+R G+G+PPK Y V+DTGSD++W QCAPC CY+Q DPIF P+
Sbjct: 141 ETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPS 200

Query: 368 DSSSYNPVSCSDALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGN-QVERV 544
SSSY P++C C L V C+ + CLY+VSYGDGS+TVGDFA ET T G+ + V
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNV 260

Query: 545 ALGCGHDNEXXXXXXXXXXXXXXXXXXXPTQLK-QEFSYCL 664
A+GCGHDNE P+Q+ FSYCL
Sbjct: 261 AIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 301


>tr|Q9C6M0|Q9C6M0_ARATH Putative uncharacterized protein F2J7.6
(Putative uncharacterized protein At1g25510)
OS=Arabidopsis thaliana GN=At1g25510 PE=2 SV=1
Length = 483

Score = 203 bits (517), Expect = 7e-51
Identities = 102/221 (46%), Positives = 133/221 (60%), Gaps = 2/221 (0%)
Frame = +2

Query: 8 ARVNPTSTSEDAFTEAFSQRLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQEE- 184
+RV+ T + RL RD+ARVK + RL++A + A+ KP + ++E+
Sbjct: 73 SRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQD 132

Query: 185 ISSPLVSGLSQGSGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNP 364
I +PL+SG +QGSGEYF R G+G P +E Y V+DTGSD++W QC PC CY QT+PIF P
Sbjct: 133 IEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEP 192

Query: 365 ADSSSYNPVSCSDALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERV 544
+ SSSY P+SC C+ L V C+ CLY+VSYGDGS+TVGDFA ET T V+ V
Sbjct: 193 SSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNV 252

Query: 545 ALGCGHDNEXXXXXXXXXXXXXXXXXXXPTQLK-QEFSYCL 664
A+GCGH NE P+QL FSYCL
Sbjct: 253 AVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL 293


>tr|A7PGQ3|A7PGQ3_VITVI Chromosome chr17 scaffold_16, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00017643001
PE=4 SV=1
Length = 496

Score = 203 bits (517), Expect = 7e-51
Identities = 99/202 (49%), Positives = 130/202 (64%), Gaps = 2/202 (0%)
Frame = +2

Query: 65 RLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQEEISSPLVSGLSQGSGEYFARF 244
RL RDSARVK I +L++A G + ++ P + + L ++ S+P+ SG SQGSGEYF R
Sbjct: 105 RLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQDFSTPVTSGTSQGSGEYFLRV 164

Query: 245 GVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQLL 424
G+G P K Y VIDTGSD++W QC PC+ CY+Q DPIF+PA SSS++ + C C L
Sbjct: 165 GIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCRNLD 224

Query: 425 VRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF-SGNQVERVALGCGHDNEXXXXXXXXXX 601
V C+ + CLYQVSYGDGS+TVGDFA ET +F + V++VA+GCGHDNE
Sbjct: 225 VFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLI 284

Query: 602 XXXXXXXXXPTQLK-QEFSYCL 664
+Q+K FSYCL
Sbjct: 285 GLGGGPLSLTSQIKASSFSYCL 306


>tr|Q9LS40|Q9LS40_ARATH CND41, chloroplast nucleoid DNA binding
protein-like (Putative chloroplast nucleoid DNA-binding
protein) OS=Arabidopsis thaliana GN=At3g18490 PE=2 SV=1
Length = 500

Score = 203 bits (516), Expect = 9e-51
Identities = 103/204 (50%), Positives = 132/204 (64%), Gaps = 4/204 (1%)
Frame = +2

Query: 65 RLVRDSARVKGIEERLEVARMGANTAEWKPA-NVDGLSQ-EEISSPLVSGLSQGSGEYFA 238
RL RDS+RV GI ++ A G + ++ KP N D Q E++++P+VSG SQGSGEYF+
Sbjct: 105 RLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFS 164

Query: 239 RFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQ 418
R GVGTP KE Y V+DTGSD++W QC PC CY+Q+DP+FNP SS+Y ++CS CS
Sbjct: 165 RIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224

Query: 419 LLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF-SGNQVERVALGCGHDNEXXXXXXXX 595
L C+ N+CLYQVSYGDGSFTVG+ A +T TF + ++ VALGCGHDNE
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAG 284

Query: 596 XXXXXXXXXXXPTQLK-QEFSYCL 664
Q+K FSYCL
Sbjct: 285 LLGLGGGVLSITNQMKATSFSYCL 308


>tr|Q8RXI1|Q8RXI1_ARATH Putative chloroplast nucleoid DNA-binding
protein OS=Arabidopsis thaliana GN=At3g18490 PE=1 SV=1
Length = 500

Score = 202 bits (513), Expect = 2e-50
Identities = 102/204 (50%), Positives = 132/204 (64%), Gaps = 4/204 (1%)
Frame = +2

Query: 65 RLVRDSARVKGIEERLEVARMGANTAEWKPA-NVDGLSQ-EEISSPLVSGLSQGSGEYFA 238
RL RDS+RV GI ++ A G + ++ KP N D Q E++++P+VSG SQGSGEYF+
Sbjct: 105 RLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFS 164

Query: 239 RFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQ 418
R GVGTP K+ Y V+DTGSD++W QC PC CY+Q+DP+FNP SS+Y ++CS CS
Sbjct: 165 RIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224

Query: 419 LLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF-SGNQVERVALGCGHDNEXXXXXXXX 595
L C+ N+CLYQVSYGDGSFTVG+ A +T TF + ++ VALGCGHDNE
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAG 284

Query: 596 XXXXXXXXXXXPTQLK-QEFSYCL 664
Q+K FSYCL
Sbjct: 285 LLGLGGGVLSITNQMKATSFSYCL 308


>tr|A9PCK2|A9PCK2_POPTR Putative uncharacterized protein OS=Populus
trichocarpa PE=2 SV=1
Length = 499

Score = 196 bits (498), Expect = 1e-48
Identities = 96/202 (47%), Positives = 126/202 (62%), Gaps = 2/202 (0%)
Frame = +2

Query: 65 RLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQEEISSPLVSGLSQGSGEYFARF 244
RL RD+ R + RL++A + ++ KP + + E++S+P+ SG SQGSGEYF R
Sbjct: 107 RLHRDTVRFNSLTARLQLALEDISKSDLKPLETE-IKPEDLSTPVTSGTSQGSGEYFTRV 165

Query: 245 GVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDALCSQLL 424
GVG P ++ Y V+DTGSD++W QC PC CY+QTDPIF+P SS+Y PV+C CS L
Sbjct: 166 GVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCSSLE 225

Query: 425 VRGCQRNQCLYQVSYGDGSFTVGDFAVETFTF-SGNQVERVALGCGHDNEXXXXXXXXXX 601
+ C+ QCLYQV+YGDGS+T GDFA E+ +F + V+ VALGCGHDNE
Sbjct: 226 MSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLL 285

Query: 602 XXXXXXXXXPTQLK-QEFSYCL 664
QLK FSYCL
Sbjct: 286 GLGGGPLSLTNQLKATSFSYCL 307


>tr|A9NV19|A9NV19_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 496

Score = 192 bits (488), Expect = 2e-47
Identities = 94/212 (44%), Positives = 123/212 (58%), Gaps = 4/212 (1%)
Frame = +2

Query: 41 AFTEAFSQRLVRDSARVKGIEERLEVARMGANTAEWKPANVDGLSQEEISSPLVSGLSQG 220
++ ++L R++ARV+ +E+R+E NV G++ E S +VSG+ QG
Sbjct: 92 SYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYENVAGVTAE-FGSEVVSGMEQG 150

Query: 221 SGEYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCS 400
SGEYF R G+GTP +E Y V+DTGSD+ W QC PC CY Q DPIFNP+ S S++ V C
Sbjct: 151 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCD 210

Query: 401 DALCSQLLVRGCQRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVALGCGHDNEXXX 580
A+CSQL C CLY+VSYGDGS+TVG +A ET TF ++ VA+GCGHDN
Sbjct: 211 SAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDNVGLF 270

Query: 581 XXXXXXXXXXXXXXXXPTQLKQE----FSYCL 664
P QL + FSYCL
Sbjct: 271 VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCL 302


>tr|Q9LNJ3|Q9LNJ3_ARATH F6F3.10 protein (At1g01300) (Chloroplast
nucleoid DNA binding protein, putative) OS=Arabidopsis
thaliana GN=At1g01300 PE=2 SV=1
Length = 485

Score = 190 bits (483), Expect = 6e-47
Identities = 108/212 (50%), Positives = 123/212 (58%), Gaps = 7/212 (3%)
Frame = +2

Query: 50 EAFSQRLVRDSARVKGIEERLEVARMGAN-TAEWKPANVDGLSQEEISSPLVSGLSQGSG 226
E FS RL RDS RVK I L G N T +P SS +VSGLSQGSG
Sbjct: 90 ELFSSRLQRDSRRVKSIAT-LAAQIPGRNVTHAPRPGG--------FSSSVVSGLSQGSG 140

Query: 227 EYFARFGVGTPPKESYFVIDTGSDLSWTQCAPCNSCYKQTDPIFNPADSSSYNPVSCSDA 406
EYF R GVGTP + Y V+DTGSD+ W QCAPC CY Q+DPIF+P S +Y + CS
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200

Query: 407 LCSQLLVRGC--QRNQCLYQVSYGDGSFTVGDFAVETFTFSGNQVERVALGCGHDNEXXX 580
C +L GC +R CLYQVSYGDGSFTVGDF+ ET TF N+V+ VALGCGHDNE
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGCGHDNEGLF 260

Query: 581 XXXXXXXXXXXXXXXXPTQ----LKQEFSYCL 664
P Q Q+FSYCL
Sbjct: 261 VGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL 292