DK959047
Clone id TST39A01NGRL0003_K01
Library
Length 692
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0003_K01. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
CAAAATGGTGAGGCAGCATTGTCTGAGTGCAGTGGTGGTGCTGGCGTTTCTGGTACCGCT
CGTAGCAGCGCGTGCAGTAGGCCGCCTCTGCGACACCTCCTCCTTCTTTCCAGACCCCAA
CAATGATCGCACTGTGACGTTCCTTGGCGCTAATGCAGGCTCCGCAGGTTTCAAGCTCTT
CTCTTTTGCCAGCAATGGCTTCCTCGGGGCTCCTGCCAATCTTGTCGAGGTAAGACTCCT
TAACCTGCACAGGAAAATGCTCTTCAACCCCACCCTTTCACACTCCCTGGATCTGCGCAT
CAAGCCAGATAAACCCTTTGTTGATGCTCCTGCCCGAAATCAAGTGGTCCAGACATGGAG
CTTCTCCACCAGCAACCCCCAGCATCCCATCATATGGATCAATATCCGCTCTCGCCGTGC
CCTTTGGCGCCCTGATCTTGCGGTTCCTCATGTTGAGCAGAAGCCCCTGAGTTTGGATTA
CGGTACCAATCTGTCGGTATTTGTGATTTCAGTTGTCTTGGGTTTTGTTTGCTTCTGGGC
TGCGGCAGCGTTAATACGAGATGTTCTGCGGAACTTTTTCATCCGCTCCCGTGAAGATCA
CGATATTGGCTTGCTCTATGGATATCAGGAATTGCCTTCTTCAGAAGCATCTGAGAAGGA
GGTTGCACAGAAGGAGTCTGCCAAGGTAAGTG
■■Homology search results ■■ -
sp_hit_id P16467
Definition sp|P16467|PDC5_YEAST Pyruvate decarboxylase isozyme 2 OS=Saccharomyces cerevisiae
Align length 65
Score (bit) 33.5
E-value 1.1
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK959047|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0003_K01, 5'
(692 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P16467|PDC5_YEAST Pyruvate decarboxylase isozyme 2 OS=Sacchar... 33 1.1
sp|P34734|PDC_HANUV Pyruvate decarboxylase OS=Hanseniaspora uvar... 33 1.9
sp|P06169|PDC1_YEAST Pyruvate decarboxylase isozyme 1 OS=Sacchar... 32 2.5
sp|P0C1D7|CSP1_CORML Protein PS1 OS=Corynebacterium melassecola ... 32 2.5
sp|P0C1D6|CSP1_CORGL Protein PS1 OS=Corynebacterium glutamicum G... 32 2.5
sp|Q810F1|CTL2_CAVPO Choline transporter-like protein 2 OS=Cavia... 32 3.2
sp|A6L1V8|LEUC_BACV8 3-isopropylmalate dehydratase large subunit... 31 5.5
sp|Q24143|HR96_DROME Nuclear hormone receptor HR96 OS=Drosophila... 31 5.5
sp|Q9UV64|MOCOS_EMENI Molybdenum cofactor sulfurase OS=Emericell... 30 9.4
sp|Q8MII8|LRC25_BOVIN Leucine-rich repeat-containing protein 25 ... 30 9.4
sp|Q9ET78|JPH2_MOUSE Junctophilin-2 OS=Mus musculus GN=Jph2 PE=1... 30 9.4

>sp|P16467|PDC5_YEAST Pyruvate decarboxylase isozyme 2
OS=Saccharomyces cerevisiae GN=PDC5 PE=1 SV=4
Length = 563

Score = 33.5 bits (75), Expect = 1.1
Identities = 24/65 (36%), Positives = 37/65 (56%)
Frame = +2

Query: 200 FLGAPANLVEVRLLNLHRKMLFNPTLSHSLDLRIKPDKPFVDAPARNQVVQTWSFSTSNP 379
+LG PANLV+ LN+ K+L P +DL +KP+ DA A +VV+T +
Sbjct: 164 YLGLPANLVD---LNVPAKLLETP-----IDLSLKPN----DAEAEAEVVRTVVELIKDA 211

Query: 380 QHPII 394
++P+I
Sbjct: 212 KNPVI 216


>sp|P34734|PDC_HANUV Pyruvate decarboxylase OS=Hanseniaspora uvarum
GN=PDC PE=3 SV=1
Length = 564

Score = 32.7 bits (73), Expect = 1.9
Identities = 23/65 (35%), Positives = 37/65 (56%)
Frame = +2

Query: 200 FLGAPANLVEVRLLNLHRKMLFNPTLSHSLDLRIKPDKPFVDAPARNQVVQTWSFSTSNP 379
+LG PANLV+ LN+ K+L +DL +K + DA A N+VV+T ++
Sbjct: 164 YLGLPANLVD---LNVPAKLL-----ETKIDLALKAN----DAEAENEVVETILALVADA 211

Query: 380 QHPII 394
++P+I
Sbjct: 212 KNPVI 216


>sp|P06169|PDC1_YEAST Pyruvate decarboxylase isozyme 1
OS=Saccharomyces cerevisiae GN=PDC1 PE=1 SV=7
Length = 563

Score = 32.3 bits (72), Expect = 2.5
Identities = 21/65 (32%), Positives = 36/65 (55%)
Frame = +2

Query: 200 FLGAPANLVEVRLLNLHRKMLFNPTLSHSLDLRIKPDKPFVDAPARNQVVQTWSFSTSNP 379
+LG PANLV+ LN+ K+L P +D+ +KP+ DA + +V+ T +
Sbjct: 164 YLGLPANLVD---LNVPAKLLQTP-----IDMSLKPN----DAESEKEVIDTILALVKDA 211

Query: 380 QHPII 394
++P+I
Sbjct: 212 KNPVI 216


>sp|P0C1D7|CSP1_CORML Protein PS1 OS=Corynebacterium melassecola
GN=csp1 PE=4 SV=1
Length = 657

Score = 32.3 bits (72), Expect = 2.5
Identities = 10/35 (28%), Positives = 23/35 (65%)
Frame = -1

Query: 575 SSAEHLVLTLPQPRSKQNPRQLKSQIPTDWYRNPN 471
++++H++LT+ + P +++ +P DWY +PN
Sbjct: 101 ATSKHVILTIQSAAMPERPIKVQLLLPRDWYSSPN 135


>sp|P0C1D6|CSP1_CORGL Protein PS1 OS=Corynebacterium glutamicum
GN=csp1 PE=3 SV=1
Length = 657

Score = 32.3 bits (72), Expect = 2.5
Identities = 10/35 (28%), Positives = 23/35 (65%)
Frame = -1

Query: 575 SSAEHLVLTLPQPRSKQNPRQLKSQIPTDWYRNPN 471
++++H++LT+ + P +++ +P DWY +PN
Sbjct: 101 ATSKHVILTIQSAAMPERPIKVQLLLPRDWYSSPN 135


>sp|Q810F1|CTL2_CAVPO Choline transporter-like protein 2 OS=Cavia
porcellus GN=SLC44A2 PE=1 SV=1
Length = 705

Score = 32.0 bits (71), Expect = 3.2
Identities = 26/79 (32%), Positives = 34/79 (43%), Gaps = 13/79 (16%)
Frame = -1

Query: 680 QTPSVQPPSQMLLKKAIPDIH---------RASQYRDLHGSG*KSSAEHLVLTLPQPRSK 528
Q P+V PS+ L ++ PDIH A+ Y D HGS + + LV Q
Sbjct: 156 QCPAVLIPSKPLAQRCFPDIHAHKGVIMVGNATTYEDGHGS--RKNITELVEGAKQANGI 213

Query: 527 QNPRQLKSQIPTD----WY 483
RQL +I D WY
Sbjct: 214 LEARQLAMRIFEDYTVSWY 232


>sp|A6L1V8|LEUC_BACV8 3-isopropylmalate dehydratase large subunit
OS=Bacteroides vulgatus (strain ATCC 8482 / DSM 1447 /
NCTC 11154) GN=leuC PE=3 SV=1
Length = 460

Score = 31.2 bits (69), Expect = 5.5
Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 4/47 (8%)
Frame = -2

Query: 271 GVEEHFPVQVKESY----LDKIGRSPEEAIAGKREELETCGACISAK 143
G+ EH PV K + LD +G P E++ GK+ + GAC + +
Sbjct: 299 GITEHIPVDDKSASFKKSLDYMGFQPGESLLGKKIDYVFLGACTNGR 345


>sp|Q24143|HR96_DROME Nuclear hormone receptor HR96 OS=Drosophila
melanogaster GN=Hr96 PE=1 SV=1
Length = 723

Score = 31.2 bits (69), Expect = 5.5
Identities = 16/33 (48%), Positives = 23/33 (69%)
Frame = -3

Query: 210 APRKPLLAKEKSLKPAEPALAPRNVTVRSLLGS 112
A RKPLL KE ++KPA PA + ++S+LG+
Sbjct: 315 ADRKPLLDKEPAVKPAAPA-ERADTVIQSMLGN 346


>sp|Q9UV64|MOCOS_EMENI Molybdenum cofactor sulfurase OS=Emericella
nidulans GN=hxB PE=2 SV=2
Length = 839

Score = 30.4 bits (67), Expect = 9.4
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 3/86 (3%)
Frame = +2

Query: 359 SFSTSNPQHPIIWINIRSRRALWRPDLAVPHVEQ-KPLSLDYGTNLSVFVISVVLGFVCF 535
S+ ++ Q PI+ N+R+ R +W V + K + + GT + ++ LG
Sbjct: 356 SYDDASSQGPILAFNLRNSRGMWIGKSEVERLASIKNIQIRSGTLCNPGGTALSLG---- 411

Query: 536 WAAAALIRDVLRNFF--IRSREDHDI 607
W A D+LR+F +R +DHDI
Sbjct: 412 WTGA----DMLRHFSAGMRCGDDHDI 433


>sp|Q8MII8|LRC25_BOVIN Leucine-rich repeat-containing protein 25
OS=Bos taurus GN=LRRC25 PE=2 SV=1
Length = 307

Score = 30.4 bits (67), Expect = 9.4
Identities = 23/83 (27%), Positives = 32/83 (38%)
Frame = +1

Query: 145 WR*CRLRRFQALLFCQQWLPRGSCQSCRGKTP*PAQENALQPHPFTLPGSAHQAR*TLC* 324
WR CR R Q L + W + +S G+ P + + P P
Sbjct: 191 WRFCRHRMDQNL--SKTWASQDGSRSGSGRQPRYSSQGRRPKSPANTP------------ 236

Query: 325 CSCPKSSGPDMELLHQQPPASHH 393
P+SS PD E + PPA+ H
Sbjct: 237 ---PRSSTPDYENVFVGPPAARH 256


tr_hit_id Q5AYC2
Definition tr|Q5AYC2|Q5AYC2_EMENI Putative uncharacterized protein OS=Emericella nidulans
Align length 60
Score (bit) 38.1
E-value 0.53
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK959047|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0003_K01, 5'
(692 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q5AYC2|Q5AYC2_EMENI Putative uncharacterized protein OS=Emeri... 38 0.53
tr|A1DCR1|A1DCR1_NEOFI Dihydrolipoamide acetyltransferase compon... 38 0.53
tr|Q2USG5|Q2USG5_ASPOR Dihydrolipoamide acetyltransferase OS=Asp... 37 0.91
tr|B8MX81|B8MX81_ASPFL Pyruvate dehydrogenase complex, dihydroli... 37 0.91
tr|Q2GMK9|Q2GMK9_CHAGB Putative uncharacterized protein OS=Chaet... 37 1.2
tr|B7X395|B7X395_COMTE Phosphoesterase PA-phosphatase related OS... 36 2.7
tr|B6VUD2|B6VUD2_9BACE Putative uncharacterized protein OS=Bacte... 35 5.9
tr|B3P624|B3P624_DROER GG11545 OS=Drosophila erecta GN=GG11545 P... 35 5.9
tr|Q2GY41|Q2GY41_CHAGB Putative uncharacterized protein OS=Chaet... 35 5.9
tr|Q5RII0|Q5RII0_DANRE Novel protein similar to vertebrate spect... 34 7.7
tr|A4KBN9|A4KBN9_PICV Capsid protein OS=Pigeon circovirus GN=Cap... 34 7.7
tr|B4KKD6|B4KKD6_DROMO GI17798 OS=Drosophila mojavensis GN=GI177... 34 7.7
tr|A2R4K6|A2R4K6_ASPNC Similarity to hypothetical protein SPBC12... 34 7.7

>tr|Q5AYC2|Q5AYC2_EMENI Putative uncharacterized protein
OS=Emericella nidulans GN=AN6708.2 PE=3 SV=1
Length = 488

Score = 38.1 bits (87), Expect = 0.53
Identities = 22/60 (36%), Positives = 37/60 (61%), Gaps = 6/60 (10%)
Frame = -3

Query: 216 AGAPRKPLLAKEKSLKPAEPALA-PRNVTVRSLLGSGK-----KEEVSQRRPTARAATSG 55
+G +P L +E ++ PA ALA + V +++L G+G+ KE+V + +PTA AA +G
Sbjct: 192 SGEKLQPSLDREPAISPAAKALALEKGVPIKALKGTGRGGQITKEDVEKYKPTAAAAAAG 251


>tr|A1DCR1|A1DCR1_NEOFI Dihydrolipoamide acetyltransferase component
of pyruvate dehydrogenase OS=Neosartorya fischeri
(strain ATCC 1020 / DSM 3700 / NRRL 181) GN=NFIA_026950
PE=3 SV=1
Length = 484

Score = 38.1 bits (87), Expect = 0.53
Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 6/73 (8%)
Frame = -3

Query: 201 KPLLAKEKSLKPAEPALA-PRNVTVRSLLGSGK-----KEEVSQRRPTARAATSGTRNAS 40
+P L +E ++ PA ALA + V +++L G+G+ KE+V + +P+ AAT+ T
Sbjct: 194 QPSLDREPNISPAAKALALEKGVPIKALKGTGRGGQITKEDVEKYKPSVSAATAPTYEDI 253

Query: 39 TTTALRQCCLTIL 1
T++R+ T L
Sbjct: 254 PLTSMRKTIATRL 266


>tr|Q2USG5|Q2USG5_ASPOR Dihydrolipoamide acetyltransferase
OS=Aspergillus oryzae GN=AO090005000436 PE=3 SV=1
Length = 459

Score = 37.4 bits (85), Expect = 0.91
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 6/72 (8%)
Frame = -3

Query: 216 AGAPRKPLLAKEKSLKPAEPALA-PRNVTVRSLLGSGK-----KEEVSQRRPTARAATSG 55
+G +P L +E ++ PA ALA + V +++L G+G+ KE+V + +P+A AA
Sbjct: 164 SGEKLQPSLDREPTISPAAKALALEKGVPIKALKGTGRGGQITKEDVEKYKPSASAAAGP 223

Query: 54 TRNASTTTALRQ 19
T T++R+
Sbjct: 224 TYEDIPLTSMRK 235


>tr|B8MX81|B8MX81_ASPFL Pyruvate dehydrogenase complex,
dihydrolipoamide acetyltransferase OS=Aspergillus flavus
NRRL3357 GN=AFLA_076680 PE=4 SV=1
Length = 485

Score = 37.4 bits (85), Expect = 0.91
Identities = 23/72 (31%), Positives = 41/72 (56%), Gaps = 6/72 (8%)
Frame = -3

Query: 216 AGAPRKPLLAKEKSLKPAEPALA-PRNVTVRSLLGSGK-----KEEVSQRRPTARAATSG 55
+G +P L +E ++ PA ALA + V +++L G+G+ KE+V + +P+A AA
Sbjct: 190 SGEKLQPSLDREPTISPAAKALALEKGVPIKALKGTGRGGQITKEDVEKYKPSASAAAGP 249

Query: 54 TRNASTTTALRQ 19
T T++R+
Sbjct: 250 TYEDIPLTSMRK 261


>tr|Q2GMK9|Q2GMK9_CHAGB Putative uncharacterized protein
OS=Chaetomium globosum GN=CHGG_10795 PE=4 SV=1
Length = 298

Score = 37.0 bits (84), Expect = 1.2
Identities = 28/96 (29%), Positives = 41/96 (42%), Gaps = 4/96 (4%)
Frame = -1

Query: 680 QTPSVQPPSQMLLKKAIPDIHRASQYRDLH-GSG*KSSAEHLVLTLP---QPRSKQNPRQ 513
+T + PP + K + D Q DL G G K +A ++LT Q R + RQ
Sbjct: 179 ETETWVPPLDLYFNKRLADFEIRLQRTDLDDGQGGKRTAGGIILTACRKIQQRLRPEERQ 238

Query: 512 LKSQIPTDWYRNPNSGASAQHEEPQDQGAKGHGESG 405
R SG S +H+ D G++ HG+ G
Sbjct: 239 QGPAANRGTARAHGSGESREHDRALDGGSRRHGQCG 274


>tr|B7X395|B7X395_COMTE Phosphoesterase PA-phosphatase related
OS=Comamonas testosteroni KF-1 GN=CtesDRAFT_PD5272 PE=4
SV=1
Length = 191

Score = 35.8 bits (81), Expect = 2.7
Identities = 25/81 (30%), Positives = 33/81 (40%)
Frame = +2

Query: 317 FVDAPARNQVVQTWSFSTSNPQHPIIWINIRSRRALWRPDLAVPHVEQKPLSLDYGTNLS 496
F DAP + F +N Q P+ WI + W P L V L+L G S
Sbjct: 3 FFDAPV-------FQFLNANTQSPLWWIQASRFASAWLPGLCALPVIAAMLALGKGWRRS 55

Query: 497 VFVISVVLGFVCFWAAAALIR 559
+ + +L C W A LIR
Sbjct: 56 LQL--ALLSMACAWVACRLIR 74


>tr|B6VUD2|B6VUD2_9BACE Putative uncharacterized protein
OS=Bacteroides dorei DSM 17855 GN=BACDOR_00905 PE=4 SV=1
Length = 319

Score = 34.7 bits (78), Expect = 5.9
Identities = 19/74 (25%), Positives = 36/74 (48%), Gaps = 6/74 (8%)
Frame = -2

Query: 256 FPVQVKESYLDKIGRSPEEAIA------GKREELETCGACISAKERHSAIIVGVWKEGGG 95
FP+ V + D I + PE + K +E+++CG + +S ++G+WKE
Sbjct: 96 FPMPVVDLLFDSIEKLPESVVCISPATYFKNDEIQSCGRPRHSFLTYSLFLMGLWKEQKI 155

Query: 94 VAEAAYCTRCYERY 53
++ Y T ++ Y
Sbjct: 156 KSKYTYQTGVFQTY 169


>tr|B3P624|B3P624_DROER GG11545 OS=Drosophila erecta GN=GG11545 PE=4
SV=1
Length = 569

Score = 34.7 bits (78), Expect = 5.9
Identities = 25/65 (38%), Positives = 31/65 (47%), Gaps = 4/65 (6%)
Frame = -3

Query: 213 GAPRKPLLAKEKSLKPAEPALAPRNVTVRSLLGSGKKEEV----SQRRPTARAATSGTRN 46
G+P LA+ KS K +P L RNV V G G + + S P +AATS T
Sbjct: 396 GSPTNQYLARVKSPKAKQPTLKVRNVRVE---GEGLEASIGVPPSTATPKPKAATSTTTP 452

Query: 45 ASTTT 31
TTT
Sbjct: 453 KPTTT 457


>tr|Q2GY41|Q2GY41_CHAGB Putative uncharacterized protein
OS=Chaetomium globosum GN=CHGG_07113 PE=4 SV=1
Length = 381

Score = 34.7 bits (78), Expect = 5.9
Identities = 21/60 (35%), Positives = 32/60 (53%)
Frame = -3

Query: 213 GAPRKPLLAKEKSLKPAEPALAPRNVTVRSLLGSGKKEEVSQRRPTARAATSGTRNASTT 34
G LLA+ ++ K AE + RSL G GKK+E +Q PT++ + TR +T+
Sbjct: 33 GQSESELLARREASKAAEGD--EKKKKKRSLFGFGKKKEPAQPAPTSKPTPATTRTTATS 90


>tr|Q5RII0|Q5RII0_DANRE Novel protein similar to vertebrate spectrin
repeat containing, nuclear envelope 2 (SYNE2) (Fragment)
OS=Danio rerio GN=syne2a PE=4 SV=1
Length = 999

Score = 34.3 bits (77), Expect = 7.7
Identities = 17/68 (25%), Positives = 30/68 (44%)
Frame = -1

Query: 629 PDIHRASQYRDLHGSG*KSSAEHLVLTLPQPRSKQNPRQLKSQIPTDWYRNPNSGASAQH 450
PD++ + R+ S +L+ PQ RS + IP +W + G S+ H
Sbjct: 524 PDVYDGERSRETQSPPSSSQPSMCLLSPPQERSGRETPVSVDSIPLEWDHTGDVGGSSSH 583

Query: 449 EEPQDQGA 426
+E ++ A
Sbjct: 584 DEEEEDAA 591