DK962682
Clone id TST39A01NGRL0014_I10
Library
Length 575
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0014_I10. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
CCTCTCTCTCTCTCTCTCTCTCACACACACACAGACACACACACACACGCCCGCGCGCGC
ACTCTGCAGGCTGGAGGACTTGTGTATAGTACTGTGTAGATCTCTCTCTCGCCCCTCCCC
CCACATACACACGAAGACAGAAGACTTCTCGTGTATAGTGTATCAGATCTCTGCGCCCCC
CTCCCCCCCCCTCTCTCTCTCTCTCTCTCTCCTCTGTGTGTACAGTTCTCTTCTCCTGGC
ATTTTAATTTGACAGAAAATCTATTCTTATTTATTTCACGCGGTTCCGCTCCTCCATCGC
TTAGAGAACCCGGGGTTTCCAGCTGCCGTGCAGCTTCAGAGCATTGGGCCAACTTGAAAA
TTTCATGTATTGCACATGTTAGACTCAGAGGCGTCAAAGTCGAGCAGGAGAAAGTTGAAG
CAGGGAGGAAAAGAGGGTAGAGATTTGAGGAAGACGAGGGGGAGCAGCACATGGCCTCTG
TGAGTGGAGGAGGAGGAGCCAGAAAGCCTATTATCAGCTCAAGTGTCACAGCTGGCCGAG
ACGCATACTACCTGCGGAAAGAAATGAATGACATG
■■Homology search results ■■ -
sp_hit_id Q9P7X5
Definition sp|Q9P7X5|PPK32_SCHPO Protein kinase domain-containing protein ppk32 OS=Schizosaccharomyces pombe
Align length 42
Score (bit) 32.3
E-value 1.7
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK962682|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0014_I10, 5'
(575 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q9P7X5|PPK32_SCHPO Protein kinase domain-containing protein p... 32 1.7
sp|Q9D7I0|SHSA5_MOUSE Protein shisa-5 OS=Mus musculus GN=Shisa5 ... 32 2.3
sp|Q9EPH1|A1BG_RAT Alpha-1B-glycoprotein OS=Rattus norvegicus GN... 31 3.9
sp|Q19LI2|A1BG_MOUSE Alpha-1B-glycoprotein OS=Mus musculus GN=A1... 31 3.9
sp|Q54IT1|Y7983_DICDI Putative uncharacterized protein DDB_G0288... 31 5.1
sp|Q9BXC0|GPR81_HUMAN Probable G-protein coupled receptor 81 OS=... 30 6.6
sp|P43555|EMP47_YEAST Protein EMP47 OS=Saccharomyces cerevisiae ... 30 6.6
sp|P03204|EBNA6_EBV Epstein-Barr nuclear antigen 6 OS=Epstein-Ba... 30 8.6

>sp|Q9P7X5|PPK32_SCHPO Protein kinase domain-containing protein
ppk32 OS=Schizosaccharomyces pombe GN=ppk32 PE=1 SV=1
Length = 749

Score = 32.3 bits (72), Expect = 1.7
Identities = 16/42 (38%), Positives = 24/42 (57%)
Frame = -2

Query: 544 ASRPAVTLELIIGFLAPPPPLTEAMCCSPSSSSNLYPLFLPA 419
AS PAVT + + P L+ +PSSS++LYP +P+
Sbjct: 669 ASTPAVTAKSSFHYATPTSGLSNFNSVTPSSSASLYPPLIPS 710


>sp|Q9D7I0|SHSA5_MOUSE Protein shisa-5 OS=Mus musculus GN=Shisa5
PE=2 SV=1
Length = 235

Score = 32.0 bits (71), Expect = 2.3
Identities = 22/82 (26%), Positives = 33/82 (40%), Gaps = 4/82 (4%)
Frame = -2

Query: 544 ASRPAVTLELIIGFLAPPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLTPLSLTCAIH 365
A P++ L++ L PPPP C P N P+F P F C + + + C
Sbjct: 3 APAPSLWTLLLLLLLLPPPPGAHGELCRPFGEDNSIPVFCP--DFCCGSCS--NQYCCSD 58

Query: 364 EIFKL----AQCSEAARQLETP 311
+ K+ C E + TP
Sbjct: 59 VLRKIQWNEEMCPEPESRFSTP 80


>sp|Q9EPH1|A1BG_RAT Alpha-1B-glycoprotein OS=Rattus norvegicus
GN=A1bg PE=2 SV=2
Length = 513

Score = 31.2 bits (69), Expect = 3.9
Identities = 18/49 (36%), Positives = 23/49 (46%)
Frame = +3

Query: 222 TVLFSWHFNLTENLFLFISRGSAPPSLREPGVSSCRAASEHWANLKISC 368
TVL W F L L++ GS P EP ++ E WANL + C
Sbjct: 6 TVLLLWGFTLGPGNALWLDSGSEPELRAEP-----QSLLEPWANLTLVC 49


>sp|Q19LI2|A1BG_MOUSE Alpha-1B-glycoprotein OS=Mus musculus GN=A1bg
PE=1 SV=1
Length = 512

Score = 31.2 bits (69), Expect = 3.9
Identities = 18/51 (35%), Positives = 23/51 (45%)
Frame = +3

Query: 216 VCTVLFSWHFNLTENLFLFISRGSAPPSLREPGVSSCRAASEHWANLKISC 368
+ TVL W F L L + GS P EP ++ E WANL + C
Sbjct: 4 LATVLLLWGFTLGPGNTLMLDSGSEPKLWAEP-----QSLLEPWANLTLVC 49


>sp|Q54IT1|Y7983_DICDI Putative uncharacterized protein DDB_G0288537
OS=Dictyostelium discoideum GN=DDB_G0288537 PE=4 SV=1
Length = 846

Score = 30.8 bits (68), Expect = 5.1
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Frame = -2

Query: 508 GFLAPPPPLTEAMCCSPSSSSNLYPLFLP--ASTFSCSTLTPLSLTCAIHEIFK 353
G ++PP PL+ ++ SSN+ PL P +S+ S S L+P +LT + F+
Sbjct: 404 GNISPPLPLSISLGLCRGGSSNISPLPSPSLSSSSSGSALSPATLTLIMPSTFQ 457


>sp|Q9BXC0|GPR81_HUMAN Probable G-protein coupled receptor 81
OS=Homo sapiens GN=GPR81 PE=2 SV=1
Length = 346

Score = 30.4 bits (67), Expect = 6.6
Identities = 17/44 (38%), Positives = 24/44 (54%)
Frame = -1

Query: 539 SASCDT*ADNRLSGSSSSTHRGHVLLPLVFLKSLPSFPPCFNFL 408
S++CD L + S T+ +L PLV+ S PSFP +N L
Sbjct: 249 SSACDPSVHGALHITLSFTYMNSMLDPLVYYFSSPSFPKFYNKL 292


>sp|P43555|EMP47_YEAST Protein EMP47 OS=Saccharomyces cerevisiae
GN=EMP47 PE=1 SV=1
Length = 445

Score = 30.4 bits (67), Expect = 6.6
Identities = 19/50 (38%), Positives = 24/50 (48%)
Frame = +3

Query: 285 SAPPSLREPGVSSCRAASEHWANLKISCIAHVRLRGVKVEQEKVEAGRKR 434
S PP+ E G S+ A +E N K + +L V VEQEK KR
Sbjct: 350 SKPPANNEKGTSTDDAIAEDKENFKDFLSINQKLEKVLVEQEKYREATKR 399


>sp|P03204|EBNA6_EBV Epstein-Barr nuclear antigen 6 OS=Epstein-Barr
virus (strain B95-8) GN=EBNA6 PE=1 SV=1
Length = 992

Score = 30.0 bits (66), Expect = 8.6
Identities = 22/78 (28%), Positives = 31/78 (39%)
Frame = -2

Query: 505 FLAPPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLTPLSLTCAIHEIFKLAQCSEAAR 326
F PP PL ++M SS P AS +S TPL + + ++ + S
Sbjct: 893 FPPPPMPLQDSMAVGCDSSGTACPSMPFASDYSQGAFTPLDINATTPKRPRVEESSHG-- 950

Query: 325 QLETPGSLSDGGAEPREI 272
P S AE +EI
Sbjct: 951 ----PARCSQATAEAQEI 964


tr_hit_id B6KE50
Definition tr|B6KE50|B6KE50_TOXGO Protein kinase, putative OS=Toxoplasma gondii ME49
Align length 63
Score (bit) 37.4
E-value 0.59
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK962682|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0014_I10, 5'
(575 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|B6KE50|B6KE50_TOXGO Protein kinase, putative OS=Toxoplasma go... 37 0.59
tr|Q8X0L7|Q8X0L7_NEUCR Putative uncharacterized protein B11B23.0... 37 1.0
tr|A9U702|A9U702_PHYPA Predicted protein (Fragment) OS=Physcomit... 36 1.7
tr|Q68AN0|Q68AN0_PSOTE ABI-3 homolog OS=Psophocarpus tetragonolo... 35 3.8
tr|B3MPJ3|B3MPJ3_DROAN GF14619 OS=Drosophila ananassae GN=GF1461... 35 3.8
tr|Q2H647|Q2H647_CHAGB Putative uncharacterized protein OS=Chaet... 35 3.8
tr|B6KGP1|B6KGP1_TOXGO DNA-dependent RNA polymerase, putative OS... 34 6.5
tr|Q6K6K0|Q6K6K0_ORYSJ Putative uncharacterized protein P0047E05... 33 8.6
tr|B6ED08|B6ED08_9ROSI Putative lacasse/diphenol oxidase OS=Cast... 33 8.6
tr|Q7QI55|Q7QI55_ANOGA AGAP005291-PB OS=Anopheles gambiae GN=AGA... 33 8.6
tr|B4JJ07|B4JJ07_DROGR GH12924 OS=Drosophila grimshawi GN=GH1292... 33 8.6
tr|A0NBQ2|A0NBQ2_ANOGA AGAP005291-PA OS=Anopheles gambiae GN=AGA... 33 8.6

>tr|B6KE50|B6KE50_TOXGO Protein kinase, putative OS=Toxoplasma
gondii ME49 GN=TGME49_024480 PE=4 SV=1
Length = 1678

Score = 37.4 bits (85), Expect = 0.59
Identities = 26/63 (41%), Positives = 33/63 (52%)
Frame = -2

Query: 514 IIGFLAPPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLTPLSLTCAIHEIFKLAQCSE 335
++ F APPP + A C P SSS LYP FL A++ S + LT S CA+ L S
Sbjct: 601 LLAFQAPPPHGSHAGCAVPYSSS-LYPPFLKANSASPALLT--SPLCAVPAAAALPAVSS 657

Query: 334 AAR 326
R
Sbjct: 658 GCR 660


>tr|Q8X0L7|Q8X0L7_NEUCR Putative uncharacterized protein B11B23.080
(Predicted protein) OS=Neurospora crassa GN=B11B23.080
PE=4 SV=1
Length = 527

Score = 36.6 bits (83), Expect = 1.0
Identities = 32/102 (31%), Positives = 46/102 (45%), Gaps = 8/102 (7%)
Frame = +3

Query: 249 LTENLFLFISRGSAPPSLREPGVSSCRAASEHWANLKI----SCIAHVRLRGVKVEQEKV 416
LT L+LF++ A SL P SCR A E ++ + +A V + E +KV
Sbjct: 412 LTATLYLFLATADAWWSL--PASDSCREAREVMRSIAEKFLRNYLALVLICETNAEAQKV 469

Query: 417 EAGRKRG*RFEEDEGEQHMASVSGGG----GARKPIISSSVT 530
E E+ + EQH A SG G G +P++ VT
Sbjct: 470 EELETNESEAEQPKAEQHKAEGSGAGESTPGDSEPVVEPLVT 511


>tr|A9U702|A9U702_PHYPA Predicted protein (Fragment)
OS=Physcomitrella patens subsp. patens
GN=PHYPADRAFT_103630 PE=4 SV=1
Length = 762

Score = 35.8 bits (81), Expect = 1.7
Identities = 28/77 (36%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Frame = +3

Query: 312 GVSSCRAASEHWANLKISCIAHVRLRGVKVEQEKVEAGRKRG*RFEE-DEGEQHMASVSG 488
G SC E + + GV+VE++ E R+ G EE G ASV G
Sbjct: 342 GAGSCGVREEVGGRCSVEDVRGECSDGVEVEKQGGERLRRTGEGEEEVGRGPAREASVDG 401

Query: 489 GGGARKPIISSSVTAGR 539
GGG P + SSV AGR
Sbjct: 402 GGGNGSP-VRSSVEAGR 417


>tr|Q68AN0|Q68AN0_PSOTE ABI-3 homolog OS=Psophocarpus tetragonolobus
GN=wbABI3 PE=2 SV=1
Length = 751

Score = 34.7 bits (78), Expect = 3.8
Identities = 20/38 (52%), Positives = 23/38 (60%)
Frame = -2

Query: 505 FLAPPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLT 392
F A PPL + C S SSSS+ P LPA T +CST T
Sbjct: 59 FYADFPPLPDFPCMSSSSSSSSAPP-LPAKTMACSTTT 95


>tr|B3MPJ3|B3MPJ3_DROAN GF14619 OS=Drosophila ananassae GN=GF14619
PE=4 SV=1
Length = 2240

Score = 34.7 bits (78), Expect = 3.8
Identities = 20/52 (38%), Positives = 26/52 (50%)
Frame = -2

Query: 535 PAVTLELIIGFLAPPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLTPLSL 380
P VT I PP +TEA +P+++ L P LP T TL PL+L
Sbjct: 920 PVVTAAPIPPATVVPPQVTEAPTAAPTATPTLPPQTLPPQTLPPPTLPPLTL 971


>tr|Q2H647|Q2H647_CHAGB Putative uncharacterized protein OS=Chaetomium
globosum GN=CHGG_05868 PE=4 SV=1
Length = 1190

Score = 34.7 bits (78), Expect = 3.8
Identities = 23/87 (26%), Positives = 41/87 (47%)
Frame = +3

Query: 315 VSSCRAASEHWANLKISCIAHVRLRGVKVEQEKVEAGRKRG*RFEEDEGEQHMASVSGGG 494
V + + HW K+ +A V L +++ RG R +E+ G QH V G G
Sbjct: 858 VDAALGLTRHWTLNKLQSMA-VLLPAASLDE--------RGTRLQEEGGRQHFQYVGGEG 908

Query: 495 GARKPIISSSVTAGRDAYYLRKEMNDM 575
G RK ++ + A +D + L+ ++ +
Sbjct: 909 GTRKSVV---IHAIKDMFRLKSGLHTL 932


>tr|B6KGP1|B6KGP1_TOXGO DNA-dependent RNA polymerase, putative
OS=Toxoplasma gondii ME49 GN=TGME49_046060 PE=4 SV=1
Length = 2084

Score = 33.9 bits (76), Expect = 6.5
Identities = 15/29 (51%), Positives = 17/29 (58%)
Frame = -2

Query: 490 PPLTEAMCCSPSSSSNLYPLFLPASTFSC 404
PP TEA C SP +S+ P LPA SC
Sbjct: 463 PPSTEASCASPPTSAGASPPSLPACVLSC 491


>tr|Q6K6K0|Q6K6K0_ORYSJ Putative uncharacterized protein P0047E05.26
OS=Oryza sativa subsp. japonica GN=P0047E05.26 PE=4 SV=1
Length = 190

Score = 33.5 bits (75), Expect = 8.6
Identities = 25/84 (29%), Positives = 37/84 (44%), Gaps = 3/84 (3%)
Frame = +3

Query: 303 REPGVSSCRAASEHWANLKISCIAHVRLRGV---KVEQEKVEAGRKRG*RFEEDEGEQHM 473
RE + CR SE W + +S A VR GV KV + +R + E EG H
Sbjct: 24 RESTMEGCRVTSESWPSSPVSLAAGVRDAGVDACKVGHRCQDGESERLGKERESEGRDHA 83

Query: 474 ASVSGGGGARKPIISSSVTAGRDA 545
+ G G+ ++++ G DA
Sbjct: 84 DVAAEGFGSLAALVAAG-GGGSDA 106


>tr|B6ED08|B6ED08_9ROSI Putative lacasse/diphenol oxidase
OS=Castanea mollissima PE=2 SV=1
Length = 567

Score = 33.5 bits (75), Expect = 8.6
Identities = 26/59 (44%), Positives = 30/59 (50%), Gaps = 11/59 (18%)
Frame = -1

Query: 554 QVVCVSASCDT*ADNRLSGS----SSSTHRGHVL------LPLVFLKSLPSFPPC-FNF 411
Q+VC +ASC + NRLS S S T VL LP VF K P+ PP FNF
Sbjct: 368 QIVCANASCGGPSGNRLSASLNNISFVTPSTDVLQAYYKSLPNVFDKDFPNLPPYKFNF 426


>tr|Q7QI55|Q7QI55_ANOGA AGAP005291-PB OS=Anopheles gambiae
GN=AGAP005291 PE=4 SV=4
Length = 1033

Score = 33.5 bits (75), Expect = 8.6
Identities = 22/71 (30%), Positives = 33/71 (46%)
Frame = -2

Query: 496 PPPPLTEAMCCSPSSSSNLYPLFLPASTFSCSTLTPLSLTCAIHEIFKLAQCSEAARQLE 317
PPPPL + S +S PL +P C + P+ C++ E+ AQ + AA +
Sbjct: 200 PPPPLPRNIRNPVSKASKTVPLLIP-----CVNVAPMVKPCSVKEV---AQKTNAAPAAQ 251

Query: 316 TPGSLSDGGAE 284
P + S AE
Sbjct: 252 PPPAESGKSAE 262