DK950847
Clone id TST38A01NGRL0009_M16
Library
Length 663
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0009_M16. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID -
Sequence
GTGGAATAACGGGAAGCTTTGCCTTCAAACCAAGATCCTTGTTAAGTGGAGAATCAACAG
CATTCTATATTGCATCTCCAGCAGAGGACAAGCTGCCAAAGGACTCGCAAATAGGTACTT
CGCTTCTTGGAAAAATAACATATGGAAGAATTCAACTGCACAGTACCAAGGATGAGAAGA
ATGGCCAAGTTTGCCCAGCTTCTTATCCATTGACATGCATAGTCCCTCCAACAGCAAAGA
ACGAAGAGAAACAGAAAGAAAAGGATACAGGAAAGAAGACTCTTGCACAAAGTTTTGAGG
AGCAGGTACGAGATGCCAAGATAAAAGCATTGTCAAATCTTTCAAGAGGGACCCCAGAGG
AACGTAAAGAGTGGAACGATCTGGCTTCAGCCCTCAAGCTCGAGTTTCCAAAAAATTTAA
AGCTTTTGCATGAAGTTTTAATCAAGGTCTCTGCAACTCAAAAGAAAGAAGACGGAAAGG
ATGCTATCAATGAGGTTATTGATGCAGCAGATCAAGTAATTGATGTTATTGATAAAGAGG
CACTGGCCAAGTTCTTCGGAATGAAATCTGTAGCCGAAGAACCAGAAGCAATGAAAATTG
ATCAAGAAATGGAGAAGCAGCACGATGCATTGGTGGATGCTCTGTACAAAAAAGGCCTTG
CAT
■■Homology search results ■■ -
sp_hit_id P29144
Definition sp|P29144|TPP2_HUMAN Tripeptidyl-peptidase 2 OS=Homo sapiens
Align length 202
Score (bit) 80.5
E-value 7.0e-15
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950847|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_M16, 5'
(663 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P29144|TPP2_HUMAN Tripeptidyl-peptidase 2 OS=Homo sapiens GN=... 80 7e-15
sp|Q64560|TPP2_RAT Tripeptidyl-peptidase 2 OS=Rattus norvegicus ... 80 1e-14
sp|A5PK39|TPP2_BOVIN Tripeptidyl-peptidase 2 OS=Bos taurus GN=TP... 80 1e-14
sp|Q64514|TPP2_MOUSE Tripeptidyl-peptidase 2 OS=Mus musculus GN=... 75 3e-13
sp|Q9V6K1|TPP2_DROME Tripeptidyl-peptidase 2 OS=Drosophila melan... 40 0.011
sp|Q0I6V9|NU1C_SYNS3 NAD(P)H-quinone oxidoreductase subunit 1 OS... 37 0.072
sp|Q7VE30|NU1C_PROMA NAD(P)H-quinone oxidoreductase subunit 1 OS... 37 0.094
sp|Q7V4D7|NU1C_PROMM NAD(P)H-quinone oxidoreductase subunit 1 OS... 35 0.36
sp|A2CD61|NU1C_PROM3 NAD(P)H-quinone oxidoreductase subunit 1 OS... 35 0.36
sp|A5GP52|NU1C_SYNPW NAD(P)H-quinone oxidoreductase subunit 1 OS... 35 0.47
sp|P87377|VEGT_XENLA T-box protein VEGT OS=Xenopus laevis GN=veg... 34 0.60
sp|Q640E9|WIBG_XENLA Protein wibg homolog OS=Xenopus laevis GN=w... 34 0.79
sp|O77215|UNC4_DROME Homeobox protein unc-4 OS=Drosophila melano... 34 0.79
sp|B0YPP2|CEMA_ANEMR Plastid envelope membrane protein OS=Aneura... 34 0.79
sp|Q46HL3|NU1C_PROMT NAD(P)H-quinone oxidoreductase subunit 1 OS... 33 1.0
sp|A2BZY7|NU1C_PROM1 NAD(P)H-quinone oxidoreductase subunit 1 OS... 33 1.0
sp|Q3AGY8|NU1C_SYNSC NAD(P)H-quinone oxidoreductase subunit 1 OS... 33 1.4
sp|Q7SXU0|EI3JA_DANRE Eukaryotic translation initiation factor 3... 33 1.8
sp|Q9P4Z1|TOM1_NEUCR E3 ubiquitin-protein ligase TOM1-like OS=Ne... 32 2.3
sp|Q09006|STM1A_XENLA Stathmin-1-A OS=Xenopus laevis GN=stmn1-A ... 32 2.3
sp|P14873|MAP1B_MOUSE Microtubule-associated protein 1B OS=Mus m... 32 2.3
sp|Q7U402|NU1C_SYNPX NAD(P)H-quinone oxidoreductase subunit 1 OS... 32 2.3
sp|Q24595|XPC_DROME DNA repair protein complementing XP-C cells ... 32 3.0
sp|P48035|FABP4_BOVIN Fatty acid-binding protein, adipocyte OS=B... 32 3.0
sp|P58132|RPOC2_ASTLO DNA-directed RNA polymerase subunit beta''... 32 3.9
sp|P03169|VIE1_HCMVT 55 kDa immediate-early protein 1 OS=Human c... 31 5.1
sp|P13202|VIE1_HCMVA 55 kDa immediate-early protein 1 OS=Human c... 31 5.1
sp|Q8VZ42|SAP11_ARATH Zinc finger AN1 and C2H2 domain-containing... 31 5.1
sp|Q6FN49|HSE1_CANGA Class E vacuolar protein-sorting machinery ... 31 5.1
sp|Q09874|YAGB_SCHPO Uncharacterized protein C12G12.11c OS=Schiz... 31 6.7

>sp|P29144|TPP2_HUMAN Tripeptidyl-peptidase 2 OS=Homo sapiens GN=TPP2
PE=1 SV=4
Length = 1249

Score = 80.5 bits (197), Expect = 7e-15
Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 2/202 (0%)
Frame = +3

Query: 63 FYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQVCPASYPLTCIVPPT-AK 239
F++ S +DK+PK + G L G +T + +L K V P Y L I PPT K
Sbjct: 949 FFVTSLPDDKIPKGAGPGCYLAGSLTLSKTELG-----KKADVIPVHYYL--IPPPTKTK 1001

Query: 240 NEEKQKEKDTGK-KTLAQSFEEQVRDAKIKALSNLSRGTPEERKEWNDLASALKLEFPKN 416
N K KEKD+ K K L + F E +RD KI+ ++ L +D+ + LK +P
Sbjct: 1002 NGSKDKEKDSEKEKDLKEEFTEALRDLKIQWMTKLDS---------SDIYNELKETYPNY 1052

Query: 417 LKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAKFFGMKSVAEEPEAMK 596
L L L ++ A +++ +NE + ALA + MK+ P+A
Sbjct: 1053 LPLYVARLHQLDAEKER---MKRLNEIVDAANAVISHIDQTALAVYIAMKT-DPRPDAAT 1108

Query: 597 IDQEMEKQHDALVDALYKKGLA 662
I +M+KQ LVDAL +KG A
Sbjct: 1109 IKNDMDKQKSTLVDALCRKGCA 1130


>sp|Q64560|TPP2_RAT Tripeptidyl-peptidase 2 OS=Rattus norvegicus
GN=Tpp2 PE=2 SV=3
Length = 1249

Score = 79.7 bits (195), Expect = 1e-14
Identities = 66/202 (32%), Positives = 96/202 (47%), Gaps = 2/202 (0%)
Frame = +3

Query: 63 FYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQVCPASYPLTCIVPPT-AK 239
F++ S +DK+PK + G L G +T + +L K V P Y L I PPT K
Sbjct: 949 FFVTSLPDDKIPKGAGPGCYLAGSLTLSKTELG-----KKADVIPVHYYL--IPPPTKTK 1001

Query: 240 NEEKQKEKDTGK-KTLAQSFEEQVRDAKIKALSNLSRGTPEERKEWNDLASALKLEFPKN 416
N K KEKD+ K K L + F E +RD KI+ ++ L D+ + LK +P
Sbjct: 1002 NGSKDKEKDSEKEKDLKEEFTEALRDLKIQWMTKLDS---------TDIYNELKETYPAY 1052

Query: 417 LKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAKFFGMKSVAEEPEAMK 596
L L L ++ A +++ +NE + ALA + MK+ P+A
Sbjct: 1053 LPLYVARLHQLDAEKER---MKRLNEIVDAANAVISHIDQTALAVYIAMKT-DPRPDAAT 1108

Query: 597 IDQEMEKQHDALVDALYKKGLA 662
I +M+KQ LVDAL +KG A
Sbjct: 1109 IKNDMDKQKSTLVDALCRKGCA 1130


>sp|A5PK39|TPP2_BOVIN Tripeptidyl-peptidase 2 OS=Bos taurus GN=TPP2
PE=2 SV=1
Length = 1249

Score = 79.7 bits (195), Expect = 1e-14
Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 2/202 (0%)
Frame = +3

Query: 63 FYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQVCPASYPLTCIVPPT-AK 239
F++ S +DK+PK + G L G +T + +L K V P Y L I PPT K
Sbjct: 949 FFVTSLPDDKIPKGAGPGCYLTGSLTLSKTELG-----KKADVIPVHYYL--ISPPTKTK 1001

Query: 240 NEEKQKEKDTGK-KTLAQSFEEQVRDAKIKALSNLSRGTPEERKEWNDLASALKLEFPKN 416
N K KEKD+ K K L + F E +RD KI+ ++ L +D+ + LK +P
Sbjct: 1002 NGSKDKEKDSEKEKDLKEEFTEALRDLKIQWMTKLDS---------SDIYNELKETYPNY 1052

Query: 417 LKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAKFFGMKSVAEEPEAMK 596
L L L ++ A +++ +NE + ALA + MK+ P+A
Sbjct: 1053 LPLYVARLHQLDAEKER---MKRLNEIVEAANAVISHIDQTALAVYIAMKT-DPRPDAAI 1108

Query: 597 IDQEMEKQHDALVDALYKKGLA 662
I +M+KQ LVDAL +KG A
Sbjct: 1109 IKNDMDKQKSTLVDALCRKGCA 1130


>sp|Q64514|TPP2_MOUSE Tripeptidyl-peptidase 2 OS=Mus musculus GN=Tpp2
PE=2 SV=3
Length = 1262

Score = 75.1 bits (183), Expect = 3e-13
Identities = 64/210 (30%), Positives = 97/210 (46%), Gaps = 10/210 (4%)
Frame = +3

Query: 63 FYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQ--------VCPASYPLTC 218
F++ S +DK+PK + G L G +T + +L + + V P Y L
Sbjct: 949 FFVTSLPDDKIPKGAGPGCYLAGSLTLSKTELGKKAGQSAAKRQGKFKKDVIPVHYYL-- 1006

Query: 219 IVPPTA-KNEEKQKEKDTGK-KTLAQSFEEQVRDAKIKALSNLSRGTPEERKEWNDLASA 392
I PPT KN K KEKD+ K K L + F E +RD KI+ ++ L D+ +
Sbjct: 1007 IPPPTKIKNGSKDKEKDSEKEKDLKEEFTEALRDLKIQWMTKLDS---------TDIYNE 1057

Query: 393 LKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAKFFGMKSV 572
LK +P L L L ++ A +++ +NE + ALA + MK+
Sbjct: 1058 LKETYPAYLPLYVARLHQLDAEKER---MKRLNEIVDAANAVISHIDQTALAVYIAMKT- 1113

Query: 573 AEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
P+A I +M+KQ L+DAL +KG A
Sbjct: 1114 DPRPDAATIKNDMDKQKSTLIDALCRKGCA 1143


>sp|Q9V6K1|TPP2_DROME Tripeptidyl-peptidase 2 OS=Drosophila
melanogaster GN=TppII PE=1 SV=2
Length = 1441

Score = 40.0 bits (92), Expect = 0.011
Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 3/101 (2%)
Frame = +3

Query: 369 EWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGK---DAINEXXXXXXXXXXXXXKE 539
E N L S L L F N + SA ++KED K A+ E
Sbjct: 1239 ESNQLKSQLPLTFV-NAQKTSPPEAGESADKQKEDQKKVRSALERIVKLADKVIQETDSE 1297

Query: 540 ALAKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
AL ++G+K+ +A KI M+KQ + L++AL KKG+A
Sbjct: 1298 ALLSYYGLKNDTRA-DAAKIKTNMDKQKNTLIEALSKKGIA 1337


>sp|Q0I6V9|NU1C_SYNS3 NAD(P)H-quinone oxidoreductase subunit 1
OS=Synechococcus sp. (strain CC9311) GN=ndhA PE=3 SV=1
Length = 372

Score = 37.4 bits (85), Expect = 0.072
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Frame = -1

Query: 432 HAKALNFLETRA*GLK--------PDRS-TLYVPLGSLLKDLTMLLSWHLVPAPQNFVQE 280
+A AL L+ A GLK PDR+ ++ LG +L + ++LSW +VP QN +
Sbjct: 67 YAGALGVLQPLADGLKLLVKEDIIPDRADSILFTLGPVLVVVPVILSWLIVPFGQNLLIS 126

Query: 279 SSFLYPFLSVSLRSLLLEGLCMS 211
+ FL +SL S+ GL MS
Sbjct: 127 DVGVGIFLWISLSSVQPIGLLMS 149


>sp|Q7VE30|NU1C_PROMA NAD(P)H-quinone oxidoreductase subunit 1
OS=Prochlorococcus marinus GN=ndhA PE=3 SV=2
Length = 372

Score = 37.0 bits (84), Expect = 0.094
Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 9/83 (10%)
Frame = -1

Query: 432 HAKALNFLETRA*GLK--------PDRS-TLYVPLGSLLKDLTMLLSWHLVPAPQNFVQE 280
+A AL L+ A GLK PD++ L LG +L + ++LSW ++P QN +
Sbjct: 67 YAGALGILQPMADGLKLLVKEDIIPDKADNLLFTLGPVLVLIPVILSWLIIPFGQNLLIS 126

Query: 279 SSFLYPFLSVSLRSLLLEGLCMS 211
+ + FL ++L S+ GL MS
Sbjct: 127 NVGIGIFLWIALSSIQPIGLLMS 149


>sp|Q7V4D7|NU1C_PROMM NAD(P)H-quinone oxidoreductase subunit 1
OS=Prochlorococcus marinus (strain MIT 9313) GN=ndhA
PE=3 SV=2
Length = 372

Score = 35.0 bits (79), Expect = 0.36
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Frame = -1

Query: 432 HAKALNFLETRA*GLK--------PDRST-LYVPLGSLLKDLTMLLSWHLVPAPQNFVQE 280
+A AL L+ A GLK P R+ L LG +L + ++LSW +VP QN +
Sbjct: 67 YAGALGVLQPMADGLKLLVKEDVIPVRADGLLFTLGPVLVLVPVILSWLIVPFGQNLLIS 126

Query: 279 SSFLYPFLSVSLRSLLLEGLCMS 211
+ + FL +SL S+ GL MS
Sbjct: 127 NVGIGIFLWISLSSIQPIGLLMS 149


>sp|A2CD61|NU1C_PROM3 NAD(P)H-quinone oxidoreductase subunit 1
OS=Prochlorococcus marinus (strain MIT 9303) GN=ndhA
PE=3 SV=2
Length = 372

Score = 35.0 bits (79), Expect = 0.36
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Frame = -1

Query: 432 HAKALNFLETRA*GLK--------PDRST-LYVPLGSLLKDLTMLLSWHLVPAPQNFVQE 280
+A AL L+ A GLK P R+ L LG +L + ++LSW +VP QN +
Sbjct: 67 YAGALGVLQPMADGLKLLVKEDVIPVRADGLLFTLGPVLVLVPVILSWLIVPFGQNLLIS 126

Query: 279 SSFLYPFLSVSLRSLLLEGLCMS 211
+ + FL +SL S+ GL MS
Sbjct: 127 NVGIGIFLWISLSSIQPIGLLMS 149


>sp|A5GP52|NU1C_SYNPW NAD(P)H-quinone oxidoreductase subunit 1
OS=Synechococcus sp. (strain WH7803) GN=ndhA PE=3 SV=2
Length = 372

Score = 34.7 bits (78), Expect = 0.47
Identities = 30/83 (36%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Frame = -1

Query: 432 HAKALNFLETRA*GLK--------PDRST-LYVPLGSLLKDLTMLLSWHLVPAPQNFVQE 280
+A AL L+ A GLK P R+ L LG +L + ++LSW +VP QN +
Sbjct: 67 YAGALGVLQPLADGLKLLVKEDIIPARADGLLFTLGPVLVVVPVILSWLIVPFGQNLLIS 126

Query: 279 SSFLYPFLSVSLRSLLLEGLCMS 211
+ + FL +SL S+ GL MS
Sbjct: 127 NVGVGIFLWISLSSVQPIGLLMS 149


tr_hit_id A9RCQ9
Definition tr|A9RCQ9|A9RCQ9_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens
Align length 221
Score (bit) 185.0
E-value 2.0e-45
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950847|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_M16, 5'
(663 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9RCQ9|A9RCQ9_PHYPA Predicted protein OS=Physcomitrella paten... 185 2e-45
tr|A9TSA0|A9TSA0_PHYPA Predicted protein OS=Physcomitrella paten... 160 5e-38
tr|A7QXF8|A7QXF8_VITVI Chromosome undetermined scaffold_222, who... 151 4e-35
tr|Q6ESI7|Q6ESI7_ORYSJ Os02g0664300 protein OS=Oryza sativa subs... 146 1e-33
tr|A3A9W2|A3A9W2_ORYSJ Putative uncharacterized protein OS=Oryza... 146 1e-33
tr|B8AGD0|B8AGD0_ORYSI Putative uncharacterized protein OS=Oryza... 146 1e-33
tr|Q8L640|Q8L640_ARATH Putative uncharacterized protein At4g2085... 141 4e-32
tr|Q9SUC7|Q9SUC7_ARATH Putative uncharacterized protein AT4g2085... 126 1e-27
tr|A9U481|A9U481_PHYPA Predicted protein OS=Physcomitrella paten... 91 8e-17
tr|A4A0H0|A4A0H0_9PLAN Pyrolysin OS=Blastopirellula marina DSM 3... 84 6e-15
tr|Q5D072|Q5D072_MOUSE Tpp2 protein OS=Mus musculus GN=Tpp2 PE=2... 79 2e-13
tr|Q3U4M7|Q3U4M7_MOUSE Putative uncharacterized protein OS=Mus m... 79 2e-13
tr|Q3TW28|Q3TW28_MOUSE Putative uncharacterized protein OS=Mus m... 79 2e-13
tr|Q5VZU9|Q5VZU9_HUMAN Tripeptidyl peptidase II OS=Homo sapiens ... 77 1e-12
tr|Q922K4|Q922K4_MOUSE Tpp2 protein (Fragment) OS=Mus musculus G... 75 4e-12
tr|B7FXC9|B7FXC9_PHATR Predicted protein OS=Phaeodactylum tricor... 74 8e-12
tr|Q6GQZ3|Q6GQZ3_XENLA MGC83244 protein OS=Xenopus laevis GN=tpp... 72 4e-11
tr|Q93WW2|Q93WW2_MUSAC Putative uncharacterized protein (Fragmen... 71 7e-11
tr|Q6ESI6|Q6ESI6_ORYSJ Putative tripeptidyl peptidase II OS=Oryz... 66 2e-09
tr|Q4SHY6|Q4SHY6_TETNG Chromosome 5 SCAF14581, whole genome shot... 63 1e-08
tr|B7Z920|B7Z920_HUMAN cDNA FLJ61714, highly similar to Tripepti... 57 1e-06
tr|B6LJQ7|B6LJQ7_BRAFL Putative uncharacterized protein OS=Branc... 53 1e-05
tr|B6LJS5|B6LJS5_BRAFL Putative uncharacterized protein OS=Branc... 52 3e-05
tr|B7PQH8|B7PQH8_IXOSC Tripeptidyl-peptidase II, putative OS=Ixo... 43 0.015
tr|B4GB55|B4GB55_DROPE GL10584 OS=Drosophila persimilis GN=GL105... 42 0.026
tr|Q290W3|Q290W3_DROPS GA24414 OS=Drosophila pseudoobscura pseud... 42 0.034
tr|Q3TB11|Q3TB11_MOUSE Putative uncharacterized protein (Fragmen... 41 0.075
tr|B4QDY3|B4QDY3_DROSI GD25789 OS=Drosophila simulans GN=GD25789... 39 0.29
tr|B4LMI6|B4LMI6_DROVI GJ21137 OS=Drosophila virilis GN=GJ21137 ... 39 0.37
tr|B4P4U7|B4P4U7_DROYA GE13397 OS=Drosophila yakuba GN=GE13397 P... 38 0.49

>tr|A9RCQ9|A9RCQ9_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_111640 PE=4 SV=1
Length = 1293

Score = 185 bits (469), Expect = 2e-45
Identities = 98/221 (44%), Positives = 140/221 (63%), Gaps = 2/221 (0%)
Frame = +3

Query: 6 ITGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNG 185
ITG FK L +GE+ FYI +P +D++PKD+ G+ LLG+I+YG + + + G
Sbjct: 958 ITGGGPFKSALLAAGETRPFYIVAPTDDRIPKDATPGSLLLGEISYGEVSV-GNRGGNGG 1016

Query: 186 QVCPASYPLTCIVPPTAKNEEKQKEKDTG-KKTLAQSFEEQVRDAKIKALSNLSRGTPEE 362
CP+ +T +VPP K+E+K KEK+ G KK ++ + EE+VRDAKIK LS+ T EE
Sbjct: 1017 AACPSKARITFVVPPPPKSEDKAKEKEDGVKKNVSDTLEEEVRDAKIKVLSSFLLSTKEE 1076

Query: 363 RKEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGK-DAINEXXXXXXXXXXXXXKE 539
R+EW LA + K ++PK+L+L+ E+L KVS Q KE+ K + + +
Sbjct: 1077 REEWEKLAESFKADYPKHLQLMIEILNKVSGLQGKEEEKFSVVKQIIEAADDVIRLVNTD 1136

Query: 540 ALAKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+LA+ F MK AE+ +A K+ +EMEKQ DAL DALYKKGLA
Sbjct: 1137 SLARHFAMKCEAEDADAAKVKKEMEKQRDALADALYKKGLA 1177


>tr|A9TSA0|A9TSA0_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_149765 PE=4 SV=1
Length = 1192

Score = 160 bits (406), Expect = 5e-38
Identities = 88/219 (40%), Positives = 138/219 (63%)
Frame = +3

Query: 6 ITGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNG 185
ITG+ F L +GES FYI +PA++K+PK++ +G+ LLG+ITYG+++ ++
Sbjct: 961 ITGAEPFTNVQLAAGESRPFYIVAPADEKIPKEATLGSVLLGEITYGKVESDNS------ 1014

Query: 186 QVCPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEER 365
PA ++ +VPP K E KE+D KKT++Q+ +E+VR+AKIK LS+LS T EE
Sbjct: 1015 ---PAQSTISFVVPPPPK--ENPKEEDDTKKTVSQALDEEVRNAKIKVLSSLSLETKEEL 1069

Query: 366 KEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEAL 545
++W LA +LK+ +P L+L+ E+L K+ +Q + K ++ + L
Sbjct: 1070 EDWERLADSLKVNYPNYLQLMVEILNKMYGSQGIGEAKFSVAKVIKAADNVIRLVDTGDL 1129

Query: 546 AKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
A++F MK+ +E+ A K+ +EMEK+ D+L DALYKKGLA
Sbjct: 1130 ARYFSMKNESEDANAAKVRKEMEKKRDSLADALYKKGLA 1168


>tr|A7QXF8|A7QXF8_VITVI Chromosome undetermined scaffold_222, whole
genome shotgun sequence OS=Vitis vinifera
GN=GSVIVT00009157001 PE=4 SV=1
Length = 1284

Score = 151 bits (381), Expect = 4e-35
Identities = 87/220 (39%), Positives = 126/220 (57%), Gaps = 1/220 (0%)
Frame = +3

Query: 6 ITGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNG 185
I G+ AFK L+ G +FY+ P +DKLPK+ G+ LLG I+YG + + KN
Sbjct: 935 IMGNGAFKTSVLVPGVKESFYVGPPNKDKLPKNISEGSVLLGAISYGVLSFGGEEGGKNP 994

Query: 186 QVCPASYPLTCIVPPTAKNEEKQK-EKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEE 362
+ P SY ++ +VPP +EEK K + K++++ EE+VRDAKIK L +L GT EE
Sbjct: 995 KKNPVSYQISYLVPPNKVDEEKGKGSSPSCTKSVSERLEEEVRDAKIKILGSLKHGTDEE 1054

Query: 363 RKEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEA 542
R EW LA++LK E+PK LL ++L + + ED E ++
Sbjct: 1055 RSEWRKLAASLKSEYPKYTPLLAKILEGLVSESNAEDKICHDEEVIDAANEVVCSIDRDE 1114

Query: 543 LAKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
LAK+F +KS E+ EA K+ ++ME D L +ALY+KGLA
Sbjct: 1115 LAKYFSLKSDPEDEEAEKMKKKMETTRDQLAEALYQKGLA 1154


>tr|Q6ESI7|Q6ESI7_ORYSJ Os02g0664300 protein OS=Oryza sativa subsp.
japonica GN=P0461B08.4-1 PE=4 SV=1
Length = 1359

Score = 146 bits (369), Expect = 1e-33
Identities = 79/217 (36%), Positives = 125/217 (57%)
Frame = +3

Query: 12 GSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQV 191
G+ FK L+ GE AFY+ P+ +KLPK+ G+ L+G ITYG + S KD++N Q
Sbjct: 1034 GNGTFKSSILVPGEPEAFYVGPPSREKLPKNVLPGSVLVGSITYGAVSSFSKKDDQN-QH 1092

Query: 192 CPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEERKE 371
PASY ++ ++PP+ + +K+K +G+K++++ +++VRD KIK LS ++ T +++
Sbjct: 1093 APASYSISYLIPPSKVDNDKEKGVSSGRKSISERLDDEVRDTKIKFLSGFNQETEDDKSS 1152

Query: 372 WNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAK 551
W L ++LK E+PK LL ++L + +D E KE LAK
Sbjct: 1153 WTALVASLKPEYPKYTPLLAKILECIVQKATSDDKFSHQKEIIAAADEVVDSIDKEDLAK 1212

Query: 552 FFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+K E+ EA K ++ME+ D L DALY+KGLA
Sbjct: 1213 SLSLKPDPEDEEAQKNKKKMEETRDQLADALYQKGLA 1249


>tr|A3A9W2|A3A9W2_ORYSJ Putative uncharacterized protein OS=Oryza
sativa subsp. japonica GN=OsJ_007584 PE=4 SV=1
Length = 1323

Score = 146 bits (369), Expect = 1e-33
Identities = 79/217 (36%), Positives = 125/217 (57%)
Frame = +3

Query: 12 GSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQV 191
G+ FK L+ GE AFY+ P+ +KLPK+ G+ L+G ITYG + S KD++N Q
Sbjct: 998 GNGTFKSSILVPGEPEAFYVGPPSREKLPKNVLPGSVLVGSITYGAVSSFSKKDDQN-QH 1056

Query: 192 CPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEERKE 371
PASY ++ ++PP+ + +K+K +G+K++++ +++VRD KIK LS ++ T +++
Sbjct: 1057 APASYSISYLIPPSKVDNDKEKGVSSGRKSISERLDDEVRDTKIKFLSGFNQETEDDKSS 1116

Query: 372 WNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAK 551
W L ++LK E+PK LL ++L + +D E KE LAK
Sbjct: 1117 WTALVASLKPEYPKYTPLLAKILECIVQKATSDDKFSHQKEIIAAADEVVDSIDKEDLAK 1176

Query: 552 FFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+K E+ EA K ++ME+ D L DALY+KGLA
Sbjct: 1177 SLSLKPDPEDEEAQKNKKKMEETRDQLADALYQKGLA 1213


>tr|B8AGD0|B8AGD0_ORYSI Putative uncharacterized protein OS=Oryza
sativa subsp. indica GN=OsI_08378 PE=4 SV=1
Length = 1359

Score = 146 bits (368), Expect = 1e-33
Identities = 79/217 (36%), Positives = 125/217 (57%)
Frame = +3

Query: 12 GSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQV 191
G+ FK L+ GE AFY+ P+ +KLPK+ G+ L+G ITYG + S KD++N Q
Sbjct: 1034 GNGTFKSSILVPGEPEAFYVGPPSREKLPKNVLPGSVLVGSITYGVVSSFSKKDDQN-QH 1092

Query: 192 CPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEERKE 371
PASY ++ ++PP+ + +K+K +G+K++++ +++VRD KIK LS ++ T +++
Sbjct: 1093 APASYSISYLIPPSKVDNDKEKGVSSGRKSISERLDDEVRDTKIKFLSGFNQETEDDKSS 1152

Query: 372 WNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALAK 551
W L ++LK E+PK LL ++L + +D E KE LAK
Sbjct: 1153 WTALVASLKSEYPKYTPLLAKILECIVQKATSDDKFSHQKEIIAAADEVVDSIDKEDLAK 1212

Query: 552 FFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+K E+ EA K ++ME+ D L DALY+KGLA
Sbjct: 1213 SLSLKPDPEDEEAQKNKKKMEETRDQLADALYQKGLA 1249


>tr|Q8L640|Q8L640_ARATH Putative uncharacterized protein At4g20850
(Fragment) OS=Arabidopsis thaliana GN=At4g20850 PE=2 SV=1
Length = 1346

Score = 141 bits (355), Expect = 4e-32
Identities = 82/219 (37%), Positives = 122/219 (55%), Gaps = 1/219 (0%)
Frame = +3

Query: 9 TGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQ 188
TG+ AFK L+ G AFY+ P +DKLPK++ G+ L+G+I+YG++ K+ KN +
Sbjct: 1015 TGNGAFKSSVLMPGVKEAFYLGPPTKDKLPKNTPQGSMLVGEISYGKLSFDE-KEGKNPK 1073

Query: 189 VCPASYPLTCIVPPTAKNEEKQKEK-DTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEER 365
P SYP++ +VPP E+K+ T K++++ E++VRD KIK L NL + T EER
Sbjct: 1074 DNPVSYPISYVVPPNKPEEDKKAASAPTCSKSVSERLEQEVRDTKIKFLGNLKQETEEER 1133

Query: 366 KEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEAL 545
EW L + LK E+P LL ++L + + D E + L
Sbjct: 1134 SEWRKLCTCLKSEYPDYTPLLAKILEGLLSRSDAGDKISHHEEIIEAANEVVRSVDVDEL 1193

Query: 546 AKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
A+F K+ E+ EA K+ ++ME D L DALY+KGLA
Sbjct: 1194 ARFLLDKTEPEDDEAEKLKKKMEVTRDQLADALYQKGLA 1232


>tr|Q9SUC7|Q9SUC7_ARATH Putative uncharacterized protein AT4g20850
OS=Arabidopsis thaliana GN=T13K14.10 PE=4 SV=1
Length = 1396

Score = 126 bits (316), Expect = 1e-27
Identities = 77/218 (35%), Positives = 115/218 (52%)
Frame = +3

Query: 9 TGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNGQ 188
TG+ AFK L+ G AFY+ P +DKLPK++ G+ L+G+I+YG++ DEK G+
Sbjct: 1070 TGNGAFKSSVLMPGVKEAFYLGPPTKDKLPKNTPQGSMLVGEISYGKLSF----DEKEGK 1125

Query: 189 VCPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEERK 368
P P + + ++K T K++++ E++VRD KIK L NL + T EER
Sbjct: 1126 N-PKDNPHRLVKLDAPEEDKKAASAPTCSKSVSERLEQEVRDTKIKFLGNLKQETEEERS 1184

Query: 369 EWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEALA 548
EW L + LK E+P LL ++L + + D E + LA
Sbjct: 1185 EWRKLCTCLKSEYPDYTPLLAKILEGLLSRSDAGDKISHHEEIIEAANEVVRSVDVDELA 1244

Query: 549 KFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+F K+ E+ EA K+ ++ME D L DALY+KGLA
Sbjct: 1245 RFLLDKTEPEDDEAEKLKKKMEVTRDQLADALYQKGLA 1282


>tr|A9U481|A9U481_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_101807 PE=4 SV=1
Length = 339

Score = 90.5 bits (223), Expect = 8e-17
Identities = 49/119 (41%), Positives = 74/119 (62%)
Frame = +3

Query: 306 VRDAKIKALSNLSRGTPEERKEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDA 485
VR+AKIK LS+LS T EE ++W LA +LK+ +P L+L+ E+L K+ +Q + K +
Sbjct: 23 VRNAKIKVLSSLSLETKEELEDWERLADSLKVNYPNYLQLMVEILNKMYGSQGIGEAKFS 82

Query: 486 INEXXXXXXXXXXXXXKEALAKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKKGLA 662
+ + LA++F MK+ +E+ A K+ +EMEK+ D+L DALYKKGLA
Sbjct: 83 VAKVIKAADNVIRLVDTGDLARYFSMKNESEDANAAKVRKEMEKKRDSLADALYKKGLA 141


>tr|A4A0H0|A4A0H0_9PLAN Pyrolysin OS=Blastopirellula marina DSM 3645
GN=DSM3645_25512 PE=4 SV=1
Length = 1267

Score = 84.3 bits (207), Expect = 6e-15
Identities = 63/216 (29%), Positives = 100/216 (46%)
Frame = +3

Query: 6 ITGSFAFKPRSLLSGESTAFYIASPAEDKLPKDSQIGTSLLGKITYGRIQLHSTKDEKNG 185
I G+ + + GE AFY+ P+ LPK LGKI YG+ Q KD
Sbjct: 945 IRGAGKLATQKIADGEMEAFYLGRPSASSLPKTVGARDRFLGKIYYGKEQ----KDVAGS 1000

Query: 186 QVCPASYPLTCIVPPTAKNEEKQKEKDTGKKTLAQSFEEQVRDAKIKALSNLSRGTPEER 365
P + L P + KE T +K Q+ E+ D ++ L+ L++ ++
Sbjct: 1001 GRRPGGFELVYFPAP-----PEPKEAATTEKAKEQTLAEKQFDLRVTELARLAK--EKDP 1053

Query: 366 KEWNDLASALKLEFPKNLKLLHEVLIKVSATQKKEDGKDAINEXXXXXXXXXXXXXKEAL 545
E+NDLA AL E P+NL++L L ++ ++D K + E + L
Sbjct: 1054 AEFNDLADALLAEEPENLQVLQARLHQLD----RDDRKTHLPEVVDAADEIIRLIPQNKL 1109

Query: 546 AKFFGMKSVAEEPEAMKIDQEMEKQHDALVDALYKK 653
A++FG K + E + ++EM ++ D L DALY+K
Sbjct: 1110 ARYFGRKHDPKTEEEKEREKEMTRRRDLLTDALYRK 1145