DK961032 |
Clone id |
TST39A01NGRL0009_A09 |
Library |
TST39 |
Length |
675 |
Definition |
Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0009_A09. 5' end sequence. |
Accession |
DK961032 |
Tissue type |
prothallia with plantlets |
Developmental stage |
gametophytes with sporophytes |
Contig ID |
- |
Sequence |
GGAACTTAGTGTTCGTCATCGGAGTGCACTGTCAAGGGGCTATGGCGCTGTTGCACAGGC GCAAGTCCTTCTCTGTCATTTCGCGCTATCGCTTATGTGGGCGCTTTTACTCTTCTCTGG ACTCTGGGAGCGTTTCTTCTAAGGCTGTCTTTGCAGGGCTTTCCGCCCCCTCTTCTTCGA CCCAGATTCATTCATTGCGGCAGCTCTTGCCGGCGATGGCGGTGGCGATGAAGAGCTCTT TAATACATAGTGCCCACAGCTACACAACATTGGCCACCTGTCTTGGAAGCCGGCATAAAA CATCTTGCTGCTACACGCACAAATGTACGTCTGAGTGTGGTTGCCTTTGTACAAGTGCAG GGGCCTTATGCATCGTGGCTAGGAACTGTACGACTACAGCATACCCTCCTCCTGCCTCTC CAGGAGAGTACGCAGGCGGAGTGGAAGAAGCTGAGGATGGGGCTCCCTTGGATGTGGAGG AAGCAGCAGGGCCCCCTATAGGAGCTCAAGACTTTAACATGCCCCCCTACGATGATTCCN AGTATGCTGAAGATGATGATGGTGACTATGAAGATGAGGACCCTCCTCCAGATCCCGAGG AGAAGCTTCTGTTCAGCCCATCAGATGAGGATTGAATACACATTTACCCTTGCTGTCGTA GTCAATGAGTAAACC |
■■Homology search results ■■ |
- |
Swiss-Prot (release 56.9) |
Link to BlastX Result : Swiss-Prot |
sp_hit_id |
Q626H5 |
Definition |
sp|Q626H5|PLX2_CAEBR Plexin-2 OS=Caenorhabditis briggsae |
Align length |
91 |
Score (bit) |
35.8 |
E-value |
0.21 |
Report |
|
TrEMBL (release 39.9) |
Link to BlastX Result : TrEMBL |
tr_hit_id |
B8MDR2 |
Definition |
tr|B8MDR2|B8MDR2_9EURO Mucin family signaling protein Msb2, putative OS=Talaromyces stipitatus ATCC 10500 |
Align length |
137 |
Score (bit) |
42.4 |
E-value |
0.027 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK961032|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0009_A09, 5' (675 letters)
Database: uniprot_trembl.fasta 7,341,751 sequences; 2,391,615,440 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
tr|B8MDR2|B8MDR2_9EURO Mucin family signaling protein Msb2, puta... 42 0.027 tr|B4L5L3|B4L5L3_DROMO GI21746 OS=Drosophila mojavensis GN=GI217... 38 0.67 tr|Q4U8V4|Q4U8V4_THEAN Eukaryotic translation initiation factor ... 37 0.87 tr|Q4D2A9|Q4D2A9_TRYCR Dispersed gene family protein 1 (DGF-1), ... 37 0.87 tr|A1TW92|A1TW92_ACIAC Putative uncharacterized protein OS=Acido... 37 1.1 tr|Q3V4B7|Q3V4B7_MOUSE Novel member of the keratin associated pr... 37 1.5 tr|Q09DX3|Q09DX3_STIAU Putative uncharacterized protein OS=Stigm... 37 1.5 tr|Q4N1J6|Q4N1J6_THEPA Eukaryotic translation initiation factor ... 37 1.5 tr|Q4CX06|Q4CX06_TRYCR Dispersed gene family protein 1 (DGF-1), ... 37 1.5 tr|A8I9U4|A8I9U4_CHLRE Centriole proteome protein OS=Chlamydomon... 36 1.9 tr|Q4D3R4|Q4D3R4_TRYCR Putative uncharacterized protein OS=Trypa... 36 1.9 tr|Q8SS34|Q8SS34_ENCCU Ribosomal protein S27 OS=Encephalitozoon ... 36 1.9 tr|A2A5X3|A2A5X3_MOUSE Putative novel member of the keratin asso... 36 2.5 tr|Q8PLP8|Q8PLP8_XANAC Putative uncharacterized protein XAC1745 ... 35 3.3 tr|Q08S86|Q08S86_STIAU Protein kinase OS=Stigmatella aurantiaca ... 35 4.3 tr|Q4D802|Q4D802_TRYCR Dispersed gene family protein 1 (DGF-1), ... 35 4.3 tr|Q4CQB2|Q4CQB2_TRYCR Dispersed gene family protein 1 (DGF-1), ... 35 4.3 tr|A8Q2T0|A8Q2T0_MALGO Putative uncharacterized protein OS=Malas... 35 4.3 tr|Q1DAU9|Q1DAU9_MYXXD Putative uncharacterized protein OS=Myxoc... 35 5.6 tr|Q1BQV7|Q1BQV7_BURCA Putative uncharacterized protein OS=Burkh... 35 5.6 tr|A0B2X2|A0B2X2_BURCH Putative uncharacterized protein OS=Burkh... 35 5.6 tr|B0JC38|B0JC38_RHILT Putative uncharacterized protein OS=Rhizo... 35 5.6 tr|Q4D824|Q4D824_TRYCR Dispersed gene family protein 1 (DGF-1), ... 35 5.6 tr|B6QGF4|B6QGF4_PENMA Mucin family signaling protein Msb2, puta... 35 5.6 tr|A9GDZ6|A9GDZ6_SORC5 Putative uncharacterized protein OS=Soran... 34 7.4 tr|Q4DUI9|Q4DUI9_TRYCR Dispersed gene family protein 1 (DGF-1), ... 34 7.4 tr|Q4DQA7|Q4DQA7_TRYCR Dispersed gene family protein 1 (DGF-1), ... 34 7.4 tr|A2F9M0|A2F9M0_TRIVA Beige/BEACH domain containing protein OS=... 34 7.4 tr|A8P5U5|A8P5U5_COPC7 Putative uncharacterized protein OS=Copri... 34 7.4 tr|Q9IGT6|Q9IGT6_ADEP3 162R OS=Porcine adenovirus A serotype 3 G... 34 9.6
>tr|B8MDR2|B8MDR2_9EURO Mucin family signaling protein Msb2, putative OS=Talaromyces stipitatus ATCC 10500 GN=TSTA_120390 PE=4 SV=1 Length = 715
Score = 42.4 bits (98), Expect = 0.027 Identities = 40/137 (29%), Positives = 52/137 (37%), Gaps = 3/137 (2%) Frame = +3
Query: 111 SSLDSGSVSSKAVFAGLSAPSSSTQ--IHSLRQLLPAMAVAMKSSLIHSAHSYTTLATCL 284 S SG+V++K + SA SSS + S L A SS+ + HS TT T Sbjct: 339 SPSSSGAVTTKVPGSSTSASSSSASASVSSSAPLTATSATVAPSSI--AVHSSTTQTTFS 396
Query: 285 GSRHKTSCCYTHKCTSECGCLCTSAGALCIVARNCTTTAYPPPA-SPGEYAGGVEEAEDG 461 S TS T+A + TT PPP SP E + G Sbjct: 397 ASETSTS---------------TTASTATSASSTQTTQYIPPPVQSPTETSTGTTATSSS 441
Query: 462 APLDVEEAAGPPIGAQD 512 P + +A PP G D Sbjct: 442 GPTQIPQAISPPGGTPD 458
>tr|B4L5L3|B4L5L3_DROMO GI21746 OS=Drosophila mojavensis GN=GI21746 PE=4 SV=1 Length = 601
Score = 37.7 bits (86), Expect = 0.67 Identities = 32/121 (26%), Positives = 47/121 (38%), Gaps = 5/121 (4%) Frame = +3
Query: 111 SSLDSGSVSSKAVFAGLSAPSSSTQIHSLRQLLPAMAVAMKSSLIHSAHSYTTLATCLGS 290 S LD+ + ++ A A SAPSSST + + A S+ +A + +AT Sbjct: 109 SPLDAAAAAAAAAAAAASAPSSSTATTAAATVTAAATTTTTSAAAAAAAAADNIATTAAG 168
Query: 291 RHKTSC-----CYTHKCTSECGCLCTSAGALCIVARNCTTTAYPPPASPGEYAGGVEEAE 455 KT+ T TS T+A A A T+ + AS A EA Sbjct: 169 SSKTAATETTTAATTTATSTATATTTTAAAAAAAAAATTSASAAAEASAATSAASKREAS 228
Query: 456 D 458 + Sbjct: 229 E 229
>tr|Q4U8V4|Q4U8V4_THEAN Eukaryotic translation initiation factor 3 (Subunit 8), putative OS=Theileria annulata GN=TA10240 PE=4 SV=1 Length = 939
Score = 37.4 bits (85), Expect = 0.87 Identities = 34/99 (34%), Positives = 43/99 (43%) Frame = +3
Query: 3 NLVFVIGVHCQGAMALLHRRKSFSVISRYRLCGRFYSSLDSGSVSSKAVFAGLSAPSSST 182 N VFV G A LH + S L G +Y + D + S+ A S ST Sbjct: 565 NFVFVYGTQRDKIRACLHLAYNKS------LHGHYYEAKDLLAASNLTEVA--SETDIST 616
Query: 183 QIHSLRQLLPAMAVAMKSSLIHSAHSYTTLATCLGSRHK 299 QI R L A ++ LI AHSY + CL +RHK Sbjct: 617 QILVNRNLAQLGICAFRAGLISEAHSY-LMDMCLQNRHK 654
>tr|Q4D2A9|Q4D2A9_TRYCR Dispersed gene family protein 1 (DGF-1), putative OS=Trypanosoma cruzi GN=Tc00.1047053510235.10 PE=4 SV=1 Length = 3467
Score = 37.4 bits (85), Expect = 0.87 Identities = 33/126 (26%), Positives = 53/126 (42%), Gaps = 14/126 (11%) Frame = +3
Query: 87 YRLCGRFYSSLDSGSVSS----KAVFAGLSAPSSSTQIHSLRQLLPAMAVAMKSSL---- 242 ++ C + S+D SV + K GLS P S + R + + VA + Sbjct: 1507 FKDCDPTFVSIDGSSVVTLTGCKMGSTGLSGPLLSQAVAGYRFVAGCLTVAGRVLTTAAE 1566
Query: 243 --IHSAHSYTTLATCLGSRHKTSCCYTHKCTS--ECGCLCTSAGA--LCIVARNCTTTAY 404 +H ++ TT+A C G K C+ T+ +C C C + G +C+ A Sbjct: 1567 LELHGINNVTTVAAC-GECTKEGDCFAPLTTAVIDCKCQCAAGGHGDVCVPAPVPAGPPP 1625
Query: 405 PPPASP 422 PPP +P Sbjct: 1626 PPPTTP 1631
>tr|A1TW92|A1TW92_ACIAC Putative uncharacterized protein OS=Acidovorax avenae subsp. citrulli (strain AAC00-1) GN=Aave_4695 PE=4 SV=1 Length = 293
Score = 37.0 bits (84), Expect = 1.1 Identities = 27/106 (25%), Positives = 48/106 (45%) Frame = +1
Query: 67 PSLSFRAIAYVGAFTLLWTLGAFLLRLSLQGFPPPLLRPRFIHCGSSCXXXXXXXXAL*Y 246 P L+ R +A + TL+W L +++L + GFPP R + G L + Sbjct: 2 PVLTCRQLAVLVVLTLVWGLNWPVMKLGVTGFPPLTFRVLVLGLGLPLLGAVLAALRLPF 61
Query: 247 IVPTATQHWPPVLEAGIKHLAATRTNVRLSVVAFVQVQGPYASWLG 384 VP A +W P+ G+ ++ L+++A ++ A+ LG Sbjct: 62 AVPRA--YWWPLAGLGLANMVVWYV---LAILAIPELSSGRAAILG 102
>tr|Q3V4B7|Q3V4B7_MOUSE Novel member of the keratin associated protein 4 (Krtap4) family OS=Mus musculus GN=1110054P19Rik PE=2 SV=1 Length = 195
Score = 36.6 bits (83), Expect = 1.5 Identities = 16/49 (32%), Positives = 23/49 (46%) Frame = +3
Query: 279 CLGSRHKTSCCYTHKCTSECGCLCTSAGALCIVARNCTTTAYPPPASPG 425 C+ S + SCC + C +C C+ + +C C TT Y P S G Sbjct: 145 CISSCCRPSCCVSSCCRPQC-CISSCCRPICCQTTCCRTTCYRPACSSG 192
>tr|Q09DX3|Q09DX3_STIAU Putative uncharacterized protein OS=Stigmatella aurantiaca DW4/3-1 GN=STIAU_5552 PE=4 SV=1 Length = 979
Score = 36.6 bits (83), Expect = 1.5 Identities = 42/162 (25%), Positives = 62/162 (38%), Gaps = 23/162 (14%) Frame = +3
Query: 96 CGRFYSSLDSGSVSSKAVFA-----GLSAPSSST-QIHSLRQLLPAMAVAMKSSLIHSAH 257 CG F D+ S+ V A G +PSS+T + + + P A A + S Sbjct: 570 CGGFSDFCDTTGTQSRTVTASTCGTGTCSPSSTTWETQACTRAAPDTACAQPNYGEWSPC 629
Query: 258 SYTTLATCLGSRHKTSCCYTHKC-TSEC----------------GCLCTSAGALCIVARN 386 SY+ G++H+T Y + C T +C G C AG +C VA Sbjct: 630 SYSDTCAQTGTQHRTVKRYAYSCATGQCEESTFTETQACTRSTAGVQC-RAGGVCDVAEY 688
Query: 387 CTTTAYPPPASPGEYAGGVEEAEDGAPLDVEEAAGPPIGAQD 512 C+ + P A A ++ G D + AG G D Sbjct: 689 CSNGSCPADAKMPTGA-SCDDGNAGTTGDRCDGAGVCTGCGD 729
>tr|Q4N1J6|Q4N1J6_THEPA Eukaryotic translation initiation factor 3 subunit 8, putative OS=Theileria parva GN=TP04_0739 PE=4 SV=1 Length = 951
Score = 36.6 bits (83), Expect = 1.5 Identities = 33/99 (33%), Positives = 44/99 (44%) Frame = +3
Query: 3 NLVFVIGVHCQGAMALLHRRKSFSVISRYRLCGRFYSSLDSGSVSSKAVFAGLSAPSSST 182 N VFV G + A LH + S L G +Y + D + S+ A S ST Sbjct: 577 NFVFVYGTQREKIRACLHLAYNKS------LHGHYYEAKDLLAASNLTDVA--SETDIST 628
Query: 183 QIHSLRQLLPAMAVAMKSSLIHSAHSYTTLATCLGSRHK 299 QI R L A ++ LI AHSY + C+ +RHK Sbjct: 629 QILVNRNLAQLGICAFRAGLISEAHSY-LMDMCVQNRHK 666
>tr|Q4CX06|Q4CX06_TRYCR Dispersed gene family protein 1 (DGF-1), putative (Fragment) OS=Trypanosoma cruzi GN=Tc00.1047053506045.19 PE=4 SV=1 Length = 2335
Score = 36.6 bits (83), Expect = 1.5 Identities = 35/119 (29%), Positives = 49/119 (41%), Gaps = 14/119 (11%) Frame = +3
Query: 108 YSSLDSGSVSS----KAVFAGLSAPSSSTQIHSLRQLLPAMAVAMK------SSLIHSAH 257 + S+DS SV + K GLS P S + + VA + +H Sbjct: 1533 FVSIDSSSVVTLAGCKMGLTGLSRPLLSQSEAGYQFFAGCLTVAGRLVTTAAELALHGIT 1592
Query: 258 SYTTLATCLGSRHKTSCCYTHKCTSECGCLCTSA----GALCIVARNCTTTAYPPPASP 422 + TT+A C G K S C+ T+ GC C A G +C+ A A PPP +P Sbjct: 1593 NVTTVAVC-GECTKESDCFAPLTTAVIGCKCQCAAGGHGDVCVPA---PVPAGPPPPTP 1647
>tr|A8I9U4|A8I9U4_CHLRE Centriole proteome protein OS=Chlamydomonas reinhardtii GN=POC18 PE=4 SV=1 Length = 2287
Score = 36.2 bits (82), Expect = 1.9 Identities = 28/103 (27%), Positives = 38/103 (36%) Frame = +3
Query: 186 IHSLRQLLPAMAVAMKSSLIHSAHSYTTLATCLGSRHKTSCCYTHKCTSECGCLCTSAGA 365 + +R A A A S S SY+T A S + TS + TS+ AGA Sbjct: 1277 LEQVRLARSAAAAAASSGQHTSQTSYSTSAAAASSSNTTSATTSAAATSQAAASTAGAGA 1336
Query: 366 LCIVARNCTTTAYPPPASPGEYAGGVEEAEDGAPLDVEEAAGP 494 + R+ + P +PG GG A G GP Sbjct: 1337 GGVGGRDAAQSRPAPNGAPGSAVGG--HAAGGTAAAARPQTGP 1377
|