DK951556
Clone id TST38A01NGRL0011_L14
Library
Length 680
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0011_L14. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
CTCTTTTCTCATCAACCGGGAGAAGGCCAACCTCCCCTTAGATAATCTCTGCTTTGCGCG
CCCGCAGAGTTTGTGTGTTTGTGCGGCGTGCTCGCGTGTGGTTGAGAGAGAGAGAGAGAG
AGAGAGAGAGAGAGAGAGAGACCATGAACCTGCTTACCCAGTTTAAACTCTGCTTTCAAT
GCCTTCCTGTTTTCTCCTCTGAGTCGCCTCCCCCTTCCCCACGCCCATTCTCAGCAGGAG
CCGTCGCAAACTCAGAAAAGGATGTGCATCTCCATGGCTCTGCGACGACTCCTCCGGTAA
AGGAAGGCGCAATTGCGCCAGAAGGCGAGACCCCATCTTTTTCTGATGCCTCTCTCACAC
ATGACAATGAACAGAAAGATCTGCCCATTTCCCAGCAAGCAAAGGTTGAGCTTCTTGGTG
CACCCGGTTCGCAGCATATTGCATCGCCCCTGGAAACTACATATTCCCCGAAAGGAGAGG
AAGGTAATGCTTCTAAGCAGCAGACTACCTCTGGATTGATGCCGCTTCCGTCTCCCATCT
CTCCGAGCCCCATCGGTTTGGGGTATATTGTCGGCGAGCATGCTTCTTCTCATCACATTC
ATCATCAGGTCAGCTCTCAACACCATGATGGGGGCCACTCATTCTTCCATTTACACAGGC
ATGACCAGGCCCATCCTCTG
■■Homology search results ■■ -
sp_hit_id Q5SWP3
Definition sp|Q5SWP3|NACAD_MOUSE NAC-alpha domain-containing protein 1 OS=Mus musculus
Align length 100
Score (bit) 34.3
E-value 0.63
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951556|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0011_L14, 5'
(680 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q5SWP3|NACAD_MOUSE NAC-alpha domain-containing protein 1 OS=M... 34 0.63
sp|Q9C086|IN80B_HUMAN INO80 complex subunit B OS=Homo sapiens GN... 33 1.4
sp|Q99PT3|IN80B_MOUSE INO80 complex subunit B OS=Mus musculus GN... 33 1.8
sp|Q7Y1H8|IAA14_ORYSJ Auxin-responsive protein IAA14 OS=Oryza sa... 33 1.8
sp|A7MJ10|MOAA_ENTS8 Molybdenum cofactor biosynthesis protein A ... 32 2.4
sp|Q8N0Y2|ZN444_HUMAN Zinc finger protein 444 OS=Homo sapiens GN... 32 3.1
sp|A2ASS6|TITIN_MOUSE Titin OS=Mus musculus GN=Ttn PE=1 SV=1 32 4.1
sp|Q0P5X1|LRIQ1_MOUSE IQ domain-containing protein LRRIQ1 OS=Mus... 32 4.1
sp|A6T6M5|MOAA_KLEP7 Molybdenum cofactor biosynthesis protein A ... 31 6.9
sp|Q6MG48|BAT2_RAT Large proline-rich protein BAT2 OS=Rattus nor... 31 6.9
sp|Q7TSC1|BAT2_MOUSE Large proline-rich protein BAT2 OS=Mus musc... 31 6.9
sp|Q8JZM8|MUC4_MOUSE Mucin-4 OS=Mus musculus GN=Muc4 PE=2 SV=1 30 9.1

>sp|Q5SWP3|NACAD_MOUSE NAC-alpha domain-containing protein 1 OS=Mus
musculus GN=Nacad PE=2 SV=1
Length = 1504

Score = 34.3 bits (77), Expect = 0.63
Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 5/100 (5%)
Frame = -2

Query: 496 LEALPSSPFGEYVVSRGDAICCEPGAPRSSTFACWEMGRSFCSLSCVREASEKD-----G 332
L++ P+SP G Y+ + GD+ P CSLS + A D G
Sbjct: 276 LDSTPASPSGSYITADGDSWASSPS----------------CSLSLLDPAEGLDFPSDWG 319

Query: 331 VSPSGAIAPSFTGGVVAEPWRCTSFSEFATAPAENGRGEG 212
+SPSG++A A P +S S + + + EG
Sbjct: 320 LSPSGSVADDLEPHPAAPPEPPSSESSLSADSSSSWSQEG 359


>sp|Q9C086|IN80B_HUMAN INO80 complex subunit B OS=Homo sapiens
GN=INO80B PE=1 SV=1
Length = 343

Score = 33.1 bits (74), Expect = 1.4
Identities = 15/44 (34%), Positives = 20/44 (45%)
Frame = -2

Query: 436 CCEPGAPRSSTFACWEMGRSFCSLSCVREASEKDGVSPSGAIAP 305
C PG P +AC G++ CSL C R + P G +P
Sbjct: 296 CSVPGCPHPRRYACSRTGQALCSLQCYRINLQMRLGGPEGPGSP 339


>sp|Q99PT3|IN80B_MOUSE INO80 complex subunit B OS=Mus musculus
GN=Ino80b PE=1 SV=1
Length = 345

Score = 32.7 bits (73), Expect = 1.8
Identities = 12/28 (42%), Positives = 15/28 (53%)
Frame = -2

Query: 436 CCEPGAPRSSTFACWEMGRSFCSLSCVR 353
C PG P +AC G++ CSL C R
Sbjct: 298 CSVPGCPHPRRYACSRTGQALCSLQCYR 325


>sp|Q7Y1H8|IAA14_ORYSJ Auxin-responsive protein IAA14 OS=Oryza
sativa subsp. japonica GN=IAA14 PE=2 SV=1
Length = 195

Score = 32.7 bits (73), Expect = 1.8
Identities = 20/85 (23%), Positives = 40/85 (47%)
Frame = -2

Query: 448 GDAICCEPGAPRSSTFACWEMGRSFCSLSCVREASEKDGVSPSGAIAPSFTGGVVAEPWR 269
GD + + SST G + C + R+ +DG SP+ + G +R
Sbjct: 20 GDGVAAKKRRSASSTVKSEASGTACCGGAGARDV--EDGASPASKV--QVVGWPPVGSYR 75

Query: 268 CTSFSEFATAPAENGRGEGGGDSEE 194
++F +++ A +G+GGG++++
Sbjct: 76 RSTFQSSSSSTAAAAKGKGGGETDQ 100


>sp|A7MJ10|MOAA_ENTS8 Molybdenum cofactor biosynthesis protein A
OS=Enterobacter sakazakii (strain ATCC BAA-894) GN=moaA
PE=3 SV=1
Length = 329

Score = 32.3 bits (72), Expect = 2.4
Identities = 15/36 (41%), Positives = 20/36 (55%)
Frame = -1

Query: 455 FQGRCNMLRTGCTKKLNLCLLGNGQIFLFIVMCERG 348
F CN LR KL+LCL G+G + L ++ E G
Sbjct: 256 FCASCNRLRVSSVGKLHLCLFGDGGVDLRDLLAEEG 291


>sp|Q8N0Y2|ZN444_HUMAN Zinc finger protein 444 OS=Homo sapiens
GN=ZNF444 PE=1 SV=1
Length = 327

Score = 32.0 bits (71), Expect = 3.1
Identities = 18/47 (38%), Positives = 20/47 (42%), Gaps = 3/47 (6%)
Frame = -2

Query: 403 FACWEMGRSFCSLSCVREASEKDG---VSPSGAIAPSFTGGVVAEPW 272
FACWE G+ F V G S GA+AP GG PW
Sbjct: 278 FACWECGKGFGRREHVLRHQRIHGRAAASAQGAVAPGPDGGGPFPPW 324


>sp|A2ASS6|TITIN_MOUSE Titin OS=Mus musculus GN=Ttn PE=1 SV=1
Length = 35213

Score = 31.6 bits (70), Expect = 4.1
Identities = 18/53 (33%), Positives = 26/53 (49%)
Frame = +3

Query: 243 VANSEKDVHLHGSATTPPVKEGAIAPEGETPSFSDASLTHDNEQKDLPISQQA 401
V + DV LHGS ++ V+ A E SFS +S + E K +S Q+
Sbjct: 35024 VLKTSSDVSLHGSVSSQSVQMSASKQEASFSSFSSSSASSMTEMKFASMSAQS 35076


>sp|Q0P5X1|LRIQ1_MOUSE IQ domain-containing protein LRRIQ1 OS=Mus
musculus GN=Lrriq1 PE=2 SV=1
Length = 814

Score = 31.6 bits (70), Expect = 4.1
Identities = 18/52 (34%), Positives = 24/52 (46%)
Frame = +3

Query: 240 AVANSEKDVHLHGSATTPPVKEGAIAPEGETPSFSDASLTHDNEQKDLPISQ 395
AV N++ VH G A G +AP E S S +L E +D P S+
Sbjct: 668 AVINADASVHTEGEADLQDSASGKLAPSEEAGSHSANNLLATEEVEDSPKSE 719


>sp|A6T6M5|MOAA_KLEP7 Molybdenum cofactor biosynthesis protein A
OS=Klebsiella pneumoniae subsp. pneumoniae (strain ATCC
700721 / MGH 78578) GN=moaA PE=3 SV=1
Length = 329

Score = 30.8 bits (68), Expect = 6.9
Identities = 15/34 (44%), Positives = 18/34 (52%)
Frame = -1

Query: 455 FQGRCNMLRTGCTKKLNLCLLGNGQIFLFIVMCE 354
F CN LR KL+LCL G G + L +M E
Sbjct: 256 FCATCNRLRVSSVGKLHLCLFGEGGVDLRDLMAE 289


>sp|Q6MG48|BAT2_RAT Large proline-rich protein BAT2 OS=Rattus
norvegicus GN=Bat2 PE=1 SV=1
Length = 2161

Score = 30.8 bits (68), Expect = 6.9
Identities = 24/73 (32%), Positives = 33/73 (45%), Gaps = 12/73 (16%)
Frame = +3

Query: 252 SEKDVHLHGSATTPPVKEGAIAP----EGETPSFSDAS--------LTHDNEQKDLPISQ 395
S D L GSA PP + G +P GE+ S S+ S +H+ E+K+LP Q
Sbjct: 1670 SSPDGGLKGSAEGPPRRPGGPSPLKAVPGESSSASEPSEPHRRRPPASHEGERKELPREQ 1729

Query: 396 QAKVELLGAPGSQ 434
+G SQ
Sbjct: 1730 PLPPGPIGTERSQ 1742


tr_hit_id A6GH35
Definition tr|A6GH35|A6GH35_9DELT Putative lipoprotein OS=Plesiocystis pacifica SIR-1
Align length 47
Score (bit) 35.4
E-value 3.4
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK951556|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0011_L14, 5'
(680 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A6GH35|A6GH35_9DELT Putative lipoprotein OS=Plesiocystis paci... 35 3.4
tr|B8CF60|B8CF60_THAPS Predicted protein OS=Thalassiosira pseudo... 35 5.8
tr|A9FXL7|A9FXL7_SORC5 Putative uncharacterized protein OS=Soran... 34 7.5
tr|Q76K84|Q76K84_ARATH MADS-box protein (Fragment) OS=Arabidopsi... 34 7.5
tr|Q766C0|Q766C0_ARATH MADS-box protein OS=Arabidopsis thaliana ... 34 7.5
tr|Q295C4|Q295C4_DROPS GA19931 OS=Drosophila pseudoobscura pseud... 34 7.5
tr|B0DXY5|B0DXY5_LACBS Predicted protein OS=Laccaria bicolor (st... 34 7.5
tr|Q8PU81|Q8PU81_METMA Dipeptide/oligopeptide-binding protein OS... 34 7.5
tr|Q2T102|Q2T102_BURTA PhoH family protein OS=Burkholderia thail... 34 9.8
tr|B3MFC8|B3MFC8_DROAN GF12391 OS=Drosophila ananassae GN=GF1239... 34 9.8
tr|A7TCV6|A7TCV6_NEMVE Predicted protein (Fragment) OS=Nematoste... 34 9.8

>tr|A6GH35|A6GH35_9DELT Putative lipoprotein OS=Plesiocystis
pacifica SIR-1 GN=PPSIR1_14285 PE=4 SV=1
Length = 676

Score = 35.4 bits (80), Expect = 3.4
Identities = 19/47 (40%), Positives = 20/47 (42%)
Frame = -2

Query: 499 CLEALPSSPFGEYVVSRGDAICCEPGAPRSSTFACWEMGRSFCSLSC 359
C P P GE GD C+PG P S AC G FC SC
Sbjct: 573 CDPLAPDCPRGETCGRDGDTFVCQPGPPLSEGEAC---GGLFCGASC 616


>tr|B8CF60|B8CF60_THAPS Predicted protein OS=Thalassiosira
pseudonana CCMP1335 GN=THAPSDRAFT_11654 PE=4 SV=1
Length = 818

Score = 34.7 bits (78), Expect = 5.8
Identities = 28/102 (27%), Positives = 41/102 (40%), Gaps = 8/102 (7%)
Frame = -2

Query: 466 EYVVSR-GDAICCEPGAPRSSTFACWEMGRSFCSLSCVREASEKDGVSPSGAIAPSFTGG 290
E+V+S +A+ C PR AC E F S C + + SPS + + S +
Sbjct: 325 EWVISHCEEAVPC----PRGDAAACPENHACFASTPCTKAPTLTPTDSPSSSPSDSPSAS 380

Query: 289 VVAEPWRCTSFSEFATAPA-------ENGRGEGGGDSEEKTG 185
PW T+F EF +N G+ D +K G
Sbjct: 381 PTKAPWSNTAFLEFLYGEETTDGSNNQNNDGQLASDVNDKLG 422


>tr|A9FXL7|A9FXL7_SORC5 Putative uncharacterized protein
OS=Sorangium cellulosum (strain So ce56) GN=sce5364 PE=4
SV=1
Length = 311

Score = 34.3 bits (77), Expect = 7.5
Identities = 21/63 (33%), Positives = 31/63 (49%)
Frame = -2

Query: 373 CSLSCVREASEKDGVSPSGAIAPSFTGGVVAEPWRCTSFSEFATAPAENGRGEGGGDSEE 194
C+L+ E+ +G + GA P+ +GG +S +E AT+ A G G GGG E
Sbjct: 13 CALAACSESPADEGGAGGGA-GPAGSGGSGGAAEASSSAAEGATSSASTGAGAGGGGGGE 71

Query: 193 KTG 185
G
Sbjct: 72 PEG 74


>tr|Q76K84|Q76K84_ARATH MADS-box protein (Fragment) OS=Arabidopsis
thaliana GN=At1g69540 PE=2 SV=1
Length = 318

Score = 34.3 bits (77), Expect = 7.5
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Frame = +3

Query: 297 VKEGAIAPEGETPSFSDASLTHDNEQKDLPISQQAKVELLGAPGSQHIASPLETTYSPK- 473
++ G+I P+ ++L+ N+QK + Q A+ LLG+P +++ LE +Y P+
Sbjct: 235 LETGSIPGTSADPNQQFSNLSFLNDQK---LKQLAEWNLLGSPADYYVSQILEASYKPQI 291

Query: 474 GEEGNASKQQT 506
G + N + +T
Sbjct: 292 GGKNNGASSET 302


>tr|Q766C0|Q766C0_ARATH MADS-box protein OS=Arabidopsis thaliana
GN=At1g69540 PE=2 SV=1
Length = 344

Score = 34.3 bits (77), Expect = 7.5
Identities = 20/71 (28%), Positives = 39/71 (54%), Gaps = 1/71 (1%)
Frame = +3

Query: 297 VKEGAIAPEGETPSFSDASLTHDNEQKDLPISQQAKVELLGAPGSQHIASPLETTYSPK- 473
++ G+I P+ ++L+ N+QK + Q A+ LLG+P +++ LE +Y P+
Sbjct: 261 LETGSIPGTSADPNQQFSNLSFLNDQK---LKQLAEWNLLGSPADYYVSQILEASYKPQI 317

Query: 474 GEEGNASKQQT 506
G + N + +T
Sbjct: 318 GGKNNGASSET 328


>tr|Q295C4|Q295C4_DROPS GA19931 OS=Drosophila pseudoobscura
pseudoobscura GN=GA19931 PE=4 SV=2
Length = 951

Score = 34.3 bits (77), Expect = 7.5
Identities = 17/43 (39%), Positives = 23/43 (53%)
Frame = -2

Query: 430 EPGAPRSSTFACWEMGRSFCSLSCVREASEKDGVSPSGAIAPS 302
+P AP+ T C E G+S+ L C A+ SPS A +PS
Sbjct: 367 QPAAPQQ-TIRCAENGKSYLDLGCSSAAAAAGATSPSSAASPS 408


>tr|B0DXY5|B0DXY5_LACBS Predicted protein OS=Laccaria bicolor
(strain S238N-H82) GN=LACBIDRAFT_295769 PE=4 SV=1
Length = 515

Score = 34.3 bits (77), Expect = 7.5
Identities = 21/75 (28%), Positives = 34/75 (45%)
Frame = +3

Query: 279 SATTPPVKEGAIAPEGETPSFSDASLTHDNEQKDLPISQQAKVELLGAPGSQHIASPLET 458
S +PPV+ IAPE E + E +D+P + Q ++E + S + S
Sbjct: 207 SDLSPPVRADVIAPETEKVQEPETEKIQQVEPEDIPAAPQMQIERSNSTSSYEVVS---- 262

Query: 459 TYSPKGEEGNASKQQ 503
+P+ EE S Q+
Sbjct: 263 --APQEEEVQLSSQE 275


>tr|Q8PU81|Q8PU81_METMA Dipeptide/oligopeptide-binding protein
OS=Methanosarcina mazei GN=MM_2460 PE=4 SV=1
Length = 559

Score = 34.3 bits (77), Expect = 7.5
Identities = 17/44 (38%), Positives = 26/44 (59%)
Frame = +3

Query: 261 DVHLHGSATTPPVKEGAIAPEGETPSFSDASLTHDNEQKDLPIS 392
DV G+ T ++EG + +GE + +D T D EQK++PIS
Sbjct: 108 DVSPDGTEYTVHLREGVVWSDGEPFTANDVKFTFDYEQKNVPIS 151


>tr|Q2T102|Q2T102_BURTA PhoH family protein OS=Burkholderia
thailandensis (strain E264 / ATCC 700388 / DSM 13276 /
CIP 106301) GN=BTH_I0590 PE=4 SV=1
Length = 358

Score = 33.9 bits (76), Expect = 9.8
Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 3/85 (3%)
Frame = -2

Query: 562 PKPMGLGEMGDGSGINPEVVCCLEALPSSPFGEYVVSRGDAICCEPGAPRSSTFACWEMG 383
P+ G G GDGSG PEV P PF E VV GD + AP+ T G
Sbjct: 91 PRRNGNGN-GDGSG-QPEVDVRFRGDPDHPFDEPVVRIGDEPHADEAAPKLYTRRADLRG 148

Query: 382 RSFCSLSCVREASEKD---GVSPSG 317
R+ +++ D GV P+G
Sbjct: 149 RTPAQREYLKQILSHDVTFGVGPAG 173


>tr|B3MFC8|B3MFC8_DROAN GF12391 OS=Drosophila ananassae GN=GF12391
PE=4 SV=1
Length = 3047

Score = 33.9 bits (76), Expect = 9.8
Identities = 27/73 (36%), Positives = 31/73 (42%)
Frame = -2

Query: 400 ACWEMGRSFCSLSCVREASEKDGVSPSGAIAPSFTGGVVAEPWRCTSFSEFATAPAENGR 221
A G S SLS + A G P G+I+ S GG P S A A +G
Sbjct: 816 AAGAQGNSLQSLSSITAALGGAGGMPGGSISGS--GGTSPSP---ASAGAGAGASGGSGS 870

Query: 220 GEGGGDSEEKTGR 182
G GGG S K GR
Sbjct: 871 GSGGGSSSYKEGR 883