DK956057
Clone id TST39A01NGRL0024_L22
Library
Length 272
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0024_L22. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
GTCTATCATTGTGGCTTGACTAGCTTTGGATCAGCTGACCGCGCCGTAAAGACTCCCGTG
GGTTCTACTGCAGCGCTCGTTGAAGCTTATAACCGCGCCGCATTTACACACCCTCGATCA
AAGTCCTTGTCAACATGGACATGAGCTTACATTCAATGGAGGTTGTCAATAGACTCACAA
CGCATGTAGACTTACCTACAGAGTTAGTACTCATGTATATTTCCAACTGTACATCTTCTT
GCGAAAATATCAAGGACAAGTACATGCAAAAT
■■Homology search results ■■ -
sp_hit_id Q9CWN7
Definition sp|Q9CWN7|CB029_MOUSE Uncharacterized protein C2orf29 homolog OS=Mus musculus
Align length 55
Score (bit) 87.4
E-value 2.0e-17
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK956057|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0024_L22, 5'
(272 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q9CWN7|CB029_MOUSE Uncharacterized protein C2orf29 homolog OS... 87 2e-17
sp|Q9UKZ1|CB029_HUMAN Uncharacterized protein C2orf29 OS=Homo sa... 87 2e-17
sp|Q7S3S2|MIA40_NEUCR Mitochondrial intermembrane space import a... 31 1.7
sp|P14232|TGA1A_TOBAC TGACG-sequence-specific DNA-binding protei... 30 3.8
sp|P13429|SFAG_ECOL5 S-fimbrial adhesin protein sfaG OS=Escheric... 30 4.9
sp|Q54HP3|DRG1_DICDI Developmentally-regulated GTP-binding prote... 30 4.9
sp|Q6CUC6|TAF4_KLULA Transcription initiation factor TFIID subun... 29 6.4
sp|A9WSW6|EFG_RENSM Elongation factor G OS=Renibacterium salmoni... 29 6.4

>sp|Q9CWN7|CB029_MOUSE Uncharacterized protein C2orf29 homolog
OS=Mus musculus GN=D1Bwg0212e PE=2 SV=1
Length = 505

Score = 87.4 bits (215), Expect = 2e-17
Identities = 42/55 (76%), Positives = 45/55 (81%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNMDMSLHSMEVVNRLTT VDLP E + +YISNC S+CE IKDKYMQN
Sbjct: 387 TEYFSVLVNMDMSLHSMEVVNRLTTAVDLPPEFIHLYISNCISTCEQIKDKYMQN 441


>sp|Q9UKZ1|CB029_HUMAN Uncharacterized protein C2orf29 OS=Homo
sapiens GN=C2orf29 PE=1 SV=1
Length = 510

Score = 87.4 bits (215), Expect = 2e-17
Identities = 42/55 (76%), Positives = 45/55 (81%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNMDMSLHSMEVVNRLTT VDLP E + +YISNC S+CE IKDKYMQN
Sbjct: 392 TEYFSVLVNMDMSLHSMEVVNRLTTAVDLPPEFIHLYISNCISTCEQIKDKYMQN 446


>sp|Q7S3S2|MIA40_NEUCR Mitochondrial intermembrane space import and
assembly protein 40 OS=Neurospora crassa GN=mia-40 PE=3
SV=1
Length = 298

Score = 31.2 bits (69), Expect = 1.7
Identities = 17/57 (29%), Positives = 27/57 (47%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVKTPVGSTAALVEAYNRAAFTHPRSKSLSTWT*AYIQWRLSI 171
+Y L SA RA+++ +A + R A T K STW A ++W L++
Sbjct: 1 MYRTALRPSQSALRAIRSTTSPSALVSSGARRFASTTSAPKKKSTWKGAAVRWGLAV 57


>sp|P14232|TGA1A_TOBAC TGACG-sequence-specific DNA-binding protein
TGA-1A OS=Nicotiana tabacum GN=TGA1A PE=1 SV=1
Length = 359

Score = 30.0 bits (66), Expect = 3.8
Identities = 14/37 (37%), Positives = 21/37 (56%)
Frame = +3

Query: 159 EVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQ 269
E++ LT H++L TE L + N T SC+ +D Q
Sbjct: 229 ELLKVLTPHLELLTEQQLREVCNLTQSCQQAEDALSQ 265


>sp|P13429|SFAG_ECOL5 S-fimbrial adhesin protein sfaG OS=Escherichia
coli O6:K15:H31 (strain 536 / UPEC) GN=sfaG PE=3 SV=1
Length = 175

Score = 29.6 bits (65), Expect = 4.9
Identities = 17/33 (51%), Positives = 20/33 (60%), Gaps = 5/33 (15%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVK-----TPVGSTAALVE 84
V+H GLTS GSA RAVK TP A L++
Sbjct: 75 VFHVGLTSCGSAVRAVKLTFTGTPDNQEAGLIQ 107


>sp|Q54HP3|DRG1_DICDI Developmentally-regulated GTP-binding protein
1 homolog OS=Dictyostelium discoideum GN=drg1 PE=3 SV=1
Length = 370

Score = 29.6 bits (65), Expect = 4.9
Identities = 18/41 (43%), Positives = 21/41 (51%)
Frame = -2

Query: 202 SVGKSTCVVSLLTTSIECKLMSMLTRTLIEGV*MRRGYKLQ 80
SVGKST + L TS E T T I GV +G K+Q
Sbjct: 74 SVGKSTLLTKLTGTSSEVASYEFTTLTCIPGVINYKGAKIQ 114


>sp|Q6CUC6|TAF4_KLULA Transcription initiation factor TFIID subunit
4 OS=Kluyveromyces lactis GN=TAF4 PE=3 SV=1
Length = 366

Score = 29.3 bits (64), Expect = 6.4
Identities = 15/50 (30%), Positives = 28/50 (56%)
Frame = -2

Query: 193 KSTCVVSLLTTSIECKLMSMLTRTLIEGV*MRRGYKLQRALQ*NPRESLR 44
K+T ++ L++T+ E + ++T +LI + R+G KL + SLR
Sbjct: 188 KNTDILGLMSTACELYMRDVITNSLILSIHRRKGVKLNTGRRSEVSRSLR 237


>sp|A9WSW6|EFG_RENSM Elongation factor G OS=Renibacterium
salmoninarum (strain ATCC 33209 / DSM 20767 / IFO 15589)
GN=fusA PE=3 SV=1
Length = 704

Score = 29.3 bits (64), Expect = 6.4
Identities = 10/24 (41%), Positives = 17/24 (70%)
Frame = +2

Query: 191 LTYRVSTHVYFQLYIFLRKYQGQV 262
L ++V+TH +F +F+R Y GQ+
Sbjct: 320 LAFKVATHPFFGQLVFIRVYSGQI 343


tr_hit_id A9T502
Definition tr|A9T502|A9T502_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens
Align length 55
Score (bit) 94.0
E-value 2.0e-18
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK956057|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0024_L22, 5'
(272 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9T502|A9T502_PHYPA Predicted protein OS=Physcomitrella paten... 94 2e-18
tr|A9SFI3|A9SFI3_PHYPA Predicted protein OS=Physcomitrella paten... 92 7e-18
tr|A7QR14|A7QR14_VITVI Chromosome undetermined scaffold_147, who... 88 9e-18
tr|A9NXQ8|A9NXQ8_PICSI Putative uncharacterized protein OS=Picea... 92 1e-17
tr|B6MF65|B6MF65_BRAFL Putative uncharacterized protein OS=Branc... 91 3e-17
tr|B6MDT4|B6MDT4_BRAFL Putative uncharacterized protein OS=Branc... 91 3e-17
tr|A9SIG7|A9SIG7_PHYPA Predicted protein OS=Physcomitrella paten... 88 5e-17
tr|A4QP78|A4QP78_DANRE Zgc:163002 protein OS=Danio rerio GN=zgc:... 87 3e-16
tr|Q8C9T0|Q8C9T0_MOUSE Putative uncharacterized protein (Fragmen... 87 3e-16
tr|B0BNA9|B0BNA9_RAT Similar to DNA segment, Chr 1, Brigham & Wo... 87 3e-16
tr|A9UP30|A9UP30_MONBE Predicted protein OS=Monosiga brevicollis... 87 3e-16
tr|B3KNB0|B3KNB0_HUMAN cDNA FLJ14159 fis, clone NT2RM2001360, hi... 87 3e-16
tr|B7QNU9|B7QNU9_IXOSC Putative uncharacterized protein (Fragmen... 86 9e-16
tr|A7RPS5|A7RPS5_NEMVE Predicted protein OS=Nematostella vectens... 85 2e-15
tr|Q16W68|Q16W68_AEDAE Putative uncharacterized protein OS=Aedes... 85 2e-15
tr|B0WAH0|B0WAH0_CULQU Putative uncharacterized protein OS=Culex... 84 3e-15
tr|B8BNK5|B8BNK5_ORYSI Putative uncharacterized protein OS=Oryza... 84 4e-15
tr|Q2QWE6|Q2QWE6_ORYSJ Expressed protein (Putative uncharacteriz... 84 5e-15
tr|Q0IPI4|Q0IPI4_ORYSJ Os12g0197900 protein (Fragment) OS=Oryza ... 84 5e-15
tr|B3S5S5|B3S5S5_TRIAD Putative uncharacterized protein OS=Trich... 82 1e-14
tr|B4LP10|B4LP10_DROVI GJ20443 OS=Drosophila virilis GN=GJ20443 ... 80 5e-14
tr|B4KLX2|B4KLX2_DROMO GI20696 OS=Drosophila mojavensis GN=GI206... 80 5e-14
tr|B4J671|B4J671_DROGR GH21707 OS=Drosophila grimshawi GN=GH2170... 80 5e-14
tr|Q0ITL9|Q0ITL9_ORYSJ Os11g0240900 protein OS=Oryza sativa subs... 79 6e-14
tr|B8BJU8|B8BJU8_ORYSI Putative uncharacterized protein OS=Oryza... 79 6e-14
tr|A3CA32|A3CA32_ORYSJ Putative uncharacterized protein OS=Oryza... 79 6e-14
tr|Q9W1C6|Q9W1C6_DROME CG13567, isoform B OS=Drosophila melanoga... 80 7e-14
tr|Q8MLP8|Q8MLP8_DROME CG13567, isoform A OS=Drosophila melanoga... 80 7e-14
tr|B4QBH3|B4QBH3_DROSI GD24978 OS=Drosophila simulans GN=GD24978... 80 7e-14
tr|B4PAS3|B4PAS3_DROYA GE11493 OS=Drosophila yakuba GN=GE11493 P... 80 7e-14

>tr|A9T502|A9T502_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_168095 PE=4 SV=1
Length = 430

Score = 94.0 bits (232), Expect(2) = 2e-18
Identities = 46/55 (83%), Positives = 48/55 (87%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T K+LVNMD+SLHSMEVVNRLTT VDLPTE V MYISNC SSCENIKDKYMQN
Sbjct: 323 TDYFKMLVNMDISLHSMEVVNRLTTTVDLPTEFVHMYISNCISSCENIKDKYMQN 377



Score = 21.6 bits (44), Expect(2) = 2e-18
Identities = 11/36 (30%), Positives = 16/36 (44%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVKTPVGSTAALVEAYNRAAFT 108
V+HCGLT + P + A L++ N T
Sbjct: 288 VFHCGLTPRRLPELVDHNPFIAVAVLLKLMNSNQIT 323


>tr|A9SFI3|A9SFI3_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_184387 PE=4 SV=1
Length = 430

Score = 92.0 bits (227), Expect(2) = 7e-18
Identities = 44/55 (80%), Positives = 49/55 (89%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T +K+LVNMD+SLHSMEVVNRLT+ VDLPTE V MYISNC SSCENIKDKY+QN
Sbjct: 323 TDYLKMLVNMDISLHSMEVVNRLTSTVDLPTEFVHMYISNCISSCENIKDKYLQN 377



Score = 21.9 bits (45), Expect(2) = 7e-18
Identities = 11/36 (30%), Positives = 16/36 (44%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVKTPVGSTAALVEAYNRAAFT 108
VYHCGLT + +P + L++ N T
Sbjct: 288 VYHCGLTPRRLPELVNNSPFIAVEVLLKLMNSNQIT 323


>tr|A7QR14|A7QR14_VITVI Chromosome undetermined scaffold_147, whole
genome shotgun sequence OS=Vitis vinifera
GN=GSVIVT00003636001 PE=4 SV=1
Length = 184

Score = 87.8 bits (216), Expect(2) = 9e-18
Identities = 41/51 (80%), Positives = 47/51 (92%)
Frame = +3

Query: 120 KVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
+VLV+MDMSLHSMEVVNRLTT V+LPTE V MYI+NC SSC+NIKD+YMQN
Sbjct: 81 RVLVSMDMSLHSMEVVNRLTTAVELPTEFVHMYITNCISSCQNIKDRYMQN 131



Score = 25.8 bits (55), Expect(2) = 9e-18
Identities = 12/33 (36%), Positives = 16/33 (48%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVKTPVGSTAALVEAYNRA 99
VYHCGLT D P+ + L++ N A
Sbjct: 42 VYHCGLTPRKLPDLVENNPLIAVEVLIKLINSA 74


>tr|A9NXQ8|A9NXQ8_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 433

Score = 92.0 bits (227), Expect(2) = 1e-17
Identities = 44/51 (86%), Positives = 46/51 (90%)
Frame = +3

Query: 120 KVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
KVLVNMDMSLHSMEVVNRLTT VDLPT+ + YISNC SSCENIKDKYMQN
Sbjct: 331 KVLVNMDMSLHSMEVVNRLTTAVDLPTQFIHTYISNCISSCENIKDKYMQN 381



Score = 21.2 bits (43), Expect(2) = 1e-17
Identities = 10/31 (32%), Positives = 15/31 (48%)
Frame = +1

Query: 1 VYHCGLTSFGSADRAVKTPVGSTAALVEAYN 93
VYHCGL+ + PV + L++ N
Sbjct: 292 VYHCGLSPRLLPELVENNPVIAVEVLLKLMN 322


>tr|B6MF65|B6MF65_BRAFL Putative uncharacterized protein
OS=Branchiostoma floridae GN=BRAFLDRAFT_122946 PE=4 SV=1
Length = 446

Score = 90.9 bits (224), Expect = 3e-17
Identities = 44/55 (80%), Positives = 47/55 (85%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNM+MSLHSMEVVNRLTT VDLPTE V +YISNC S+CENIKDKYMQN
Sbjct: 330 TEYFSVLVNMEMSLHSMEVVNRLTTAVDLPTEFVHLYISNCISTCENIKDKYMQN 384


>tr|B6MDT4|B6MDT4_BRAFL Putative uncharacterized protein
OS=Branchiostoma floridae GN=BRAFLDRAFT_224116 PE=4 SV=1
Length = 441

Score = 90.9 bits (224), Expect = 3e-17
Identities = 44/55 (80%), Positives = 47/55 (85%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNM+MSLHSMEVVNRLTT VDLPTE V +YISNC S+CENIKDKYMQN
Sbjct: 325 TEYFSVLVNMEMSLHSMEVVNRLTTAVDLPTEFVHLYISNCISTCENIKDKYMQN 379


>tr|A9SIG7|A9SIG7_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_185306 PE=4 SV=1
Length = 430

Score = 88.2 bits (217), Expect(2) = 5e-17
Identities = 42/51 (82%), Positives = 46/51 (90%)
Frame = +3

Query: 120 KVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
K+LVNMD+SLHSMEVVNRLTT VDLPTE V MY+SNC SSCENIKDK +QN
Sbjct: 327 KMLVNMDISLHSMEVVNRLTTAVDLPTEFVHMYVSNCISSCENIKDKCLQN 377



Score = 22.7 bits (47), Expect(2) = 5e-17
Identities = 8/8 (100%), Positives = 8/8 (100%)
Frame = +1

Query: 1 VYHCGLTS 24
VYHCGLTS
Sbjct: 288 VYHCGLTS 295


>tr|A4QP78|A4QP78_DANRE Zgc:163002 protein OS=Danio rerio
GN=zgc:163002 PE=2 SV=1
Length = 445

Score = 87.4 bits (215), Expect = 3e-16
Identities = 42/55 (76%), Positives = 45/55 (81%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNMDMSLHSMEVVNRLTT VDLP E + +YISNC S+CE IKDKYMQN
Sbjct: 327 TEYFSVLVNMDMSLHSMEVVNRLTTAVDLPPEFIHLYISNCISTCEQIKDKYMQN 381


>tr|Q8C9T0|Q8C9T0_MOUSE Putative uncharacterized protein (Fragment)
OS=Mus musculus GN=D1Bwg0212e PE=2 SV=1
Length = 292

Score = 87.4 bits (215), Expect = 3e-16
Identities = 42/55 (76%), Positives = 45/55 (81%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNMDMSLHSMEVVNRLTT VDLP E + +YISNC S+CE IKDKYMQN
Sbjct: 174 TEYFSVLVNMDMSLHSMEVVNRLTTAVDLPPEFIHLYISNCISTCEQIKDKYMQN 228


>tr|B0BNA9|B0BNA9_RAT Similar to DNA segment, Chr 1, Brigham &
Womens Genetics 0212 expressed (Similar to DNA segment,
Chr 1, Brigham & Womens Genetics 0212 expressed
(Predicted), isoform CRA_a) OS=Rattus norvegicus
GN=RGD1560909 PE=2 SV=1
Length = 504

Score = 87.4 bits (215), Expect = 3e-16
Identities = 42/55 (76%), Positives = 45/55 (81%)
Frame = +3

Query: 108 TPSIKVLVNMDMSLHSMEVVNRLTTHVDLPTELVLMYISNCTSSCENIKDKYMQN 272
T VLVNMDMSLHSMEVVNRLTT VDLP E + +YISNC S+CE IKDKYMQN
Sbjct: 386 TEYFSVLVNMDMSLHSMEVVNRLTTAVDLPPEFIHLYISNCISTCEQIKDKYMQN 440