DK960483 |
Clone id |
TST39A01NGRL0007_I18 |
Library |
TST39 |
Length |
648 |
Definition |
Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0007_I18. 5' end sequence. |
Accession |
DK960483 |
Tissue type |
prothallia with plantlets |
Developmental stage |
gametophytes with sporophytes |
Contig ID |
CL4230Contig1 |
Sequence |
CTTCTATGGAAGGCACAGATAAACATTTCCCAATATCCCTTTTAGGGACGCAAATACAGG TTTTCACTGCAGACATAAAGTCAGAGGGGACAGGAAAGCAGGTAGTTGAAATGTCAAGTA CTGACCCAGTCCAAACAGAAATACCTTCACAAGGGACAGGAAAGGAGGTACTCGAAATGT CAAGTACTTACCCATTCAAAGCAGATCCAGGTACAGAGACTTTACCAAGCGCTCCTTTGC TTGGCGCGTCAGAAGCAGCAGGGCAGACCGATGTCTCAGGAGCAGAGAAGCCAGATGAAC AGAAAACTGGAAATGCTGATGATGTACCTGAGACAAAGGAGCTTGATGCATTGGAGGCCA GGGAACGTGATATGACAAAAGCAGGGGATGAGGGGCGGTCTTTGCCAGACATGGGAACGG AGATCCCCCCTCTGGACTTGTCTCAAGCAGCACGGGAAGAATCAAATCAGCAGGGAATTG ATTCCAAGAGTAAGATAGAAAAAGGAGAAATCACAGCACATAAACAAGAGGGTGCTTCGA AGACATTGGGTTTAGTTACAAAAGATCCATTGCCAGTAGTAGGAGGAGGAGGGCAGAAGG TTGTGCCACCCAACGAACTACGGGCTTCCCCTTCTCAGGCACAAGGTA |
■■Homology search results ■■ |
- |
Swiss-Prot (release 56.9) |
Link to BlastX Result : Swiss-Prot |
sp_hit_id |
Q13428 |
Definition |
sp|Q13428|TCOF_HUMAN Treacle protein OS=Homo sapiens |
Align length |
194 |
Score (bit) |
40.0 |
E-value |
0.01 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK960483|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0007_I18, 5' (648 letters)
Database: uniprot_sprot.fasta 412,525 sequences; 148,809,765 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
sp|Q13428|TCOF_HUMAN Treacle protein OS=Homo sapiens GN=TCOF1 PE... 40 0.010 sp|Q6PGL7|FAM21_MOUSE Protein FAM21 OS=Mus musculus GN=Fam21 PE=... 38 0.040 sp|P17691|NEUM_CARAU Neuromodulin OS=Carassius auratus GN=gap43 ... 38 0.052 sp|P19334|TRP_DROME Transient receptor potential protein OS=Dros... 36 0.15 sp|Q54UW4|Y0777_DICDI Bromodomain-containing protein DDB_G028077... 35 0.34 sp|P49321|NASP_HUMAN Nuclear autoantigenic sperm protein OS=Homo... 35 0.44 sp|P31568|YCF2_OENPI Protein ycf2 (Fragment) OS=Oenothera picens... 34 0.58 sp|Q28062|PGCB_BOVIN Brevican core protein OS=Bos taurus GN=BCAN... 34 0.58 sp|Q86X53|ERIC1_HUMAN Glutamate-rich protein 1 OS=Homo sapiens G... 34 0.58 sp|Q14524|SCN5A_HUMAN Sodium channel protein type 5 subunit alph... 34 0.75 sp|Q4L3F8|RL25_STAHJ 50S ribosomal protein L25 OS=Staphylococcus... 33 0.98 sp|P48997|INVO_MOUSE Involucrin OS=Mus musculus GN=Ivl PE=1 SV=1 33 0.99 sp|Q8WXI7|MUC16_HUMAN Mucin-16 OS=Homo sapiens GN=MUC16 PE=1 SV=2 33 1.7 sp|Q06277|IBPA_HAES2 High molecular weight immunoglobulin-bindin... 33 1.7 sp|Q60432|HYOU1_CRIGR Hypoxia up-regulated protein 1 OS=Cricetul... 33 1.7 sp|P34616|CADH3_CAEEL Cadherin-3 OS=Caenorhabditis elegans GN=cd... 33 1.7 sp|Q86AF2|Y6864_DICDI Putative uncharacterized protein DDB_G0271... 33 1.7 sp|B0Z587|YCF2_OENGL Protein ycf2 OS=Oenothera glazioviana GN=yc... 32 2.2 sp|O88974|SETB1_MOUSE Histone-lysine N-methyltransferase SETDB1 ... 32 2.2 sp|P77161|GLXR_ECOLI 2-hydroxy-3-oxopropionate reductase OS=Esch... 32 2.2 sp|Q92994|TF3B_HUMAN Transcription factor IIIB 90 kDa subunit OS... 32 2.9 sp|Q3UQS8|RBM20_MOUSE Probable RNA-binding protein 20 OS=Mus mus... 32 2.9 sp|Q9W2U4|PP4R2_DROME Serine/threonine-protein phosphatase 4 reg... 32 2.9 sp|Q9HCI5|MAGE1_HUMAN Melanoma-associated antigen E1 OS=Homo sap... 32 2.9 sp|Q8K201|KCT2_MOUSE Keratinocytes-associated transmembrane prot... 32 2.9 sp|P14922|CYC8_YEAST General transcriptional corepressor CYC8 OS... 32 2.9 sp|Q69YN4|VIR_HUMAN Protein virilizer homolog OS=Homo sapiens GN... 32 3.7 sp|Q5NVP7|NEUM_PONAB Neuromodulin OS=Pongo abelii GN=GAP43 PE=2 ... 32 3.7 sp|Q5IS67|NEUM_PANTR Neuromodulin OS=Pan troglodytes GN=GAP43 PE... 32 3.7 sp|Q95K78|NEUM_MACFA Neuromodulin OS=Macaca fascicularis GN=GAP4... 32 3.7
>sp|Q13428|TCOF_HUMAN Treacle protein OS=Homo sapiens GN=TCOF1 PE=1 SV=2 Length = 1488
Score = 40.0 bits (92), Expect = 0.010 Identities = 47/194 (24%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Frame = +3
Query: 39 LLGTQIQVFTADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLE-MSSTYPFK-ADPG 212 L T V AD+ S K E +P TGK V +S P K A+P Sbjct: 105 LASTNSSVLGADLPSSMKEKAKAETEKAGKTGNSMPHPATGKTVANLLSGKSPRKSAEPS 164
Query: 213 --------TETLPSAPLLGASEAAGQTDVSGAEKPDEQKTGNADDV---------PETKE 341 TE S P GA+ G A+ E + ++D+ P + Sbjct: 165 ANTTLVSETEEEGSVPAFGAAAKPGMVSAGQADSSSEDTSSSSDETDVEGKPSVKPAQVK 224
Query: 342 LDALEARERDMTKA----GDEGRSLPDM-GTEIPPLDLSQAAREESNQQGIDSKSKIEKG 506 ++ +E KA G G P + G +PP ++ EES S+S+ E Sbjct: 225 ASSVSTKESPARKAAPAPGKVGDVTPQVKGGALPPAKRAKKPEEESESSEEGSESEEEAP 284
Query: 507 EITAHKQEGASKTL 548 T + + + K L Sbjct: 285 AGTRSQVKASEKIL 298
>sp|Q6PGL7|FAM21_MOUSE Protein FAM21 OS=Mus musculus GN=Fam21 PE=1 SV=1 Length = 1334
Score = 38.1 bits (87), Expect = 0.040 Identities = 36/134 (26%), Positives = 56/134 (41%), Gaps = 3/134 (2%) Frame = +3
Query: 156 TGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKPDEQKTGNADDVPET 335 +G + + T A E++P PLL + E ++V KP++ K NA PE Sbjct: 693 SGSSLFGLPPTSVPSATTKKESVPKVPLLFSDEE--DSEVPSGVKPEDLKVDNARVSPEV 750
Query: 336 KELD-ALEARERDMTKAGDEGRSLP-DMGTEIPPLDLSQAAREESNQQGID-SKSKIEKG 506 D A A++ + A D+ P D+ + PLD R + D + K+E Sbjct: 751 GSADVASIAQKEGLLPASDQEAGGPSDIFSSSSPLDKGAKGRTRTVLSLFDEDEDKVEDE 810
Query: 507 EITAHKQEGASKTL 548 T Q+G K L Sbjct: 811 SSTCAPQDGREKGL 824
>sp|P17691|NEUM_CARAU Neuromodulin OS=Carassius auratus GN=gap43 PE=2 SV=1 Length = 213
Score = 37.7 bits (86), Expect = 0.052 Identities = 32/147 (21%), Positives = 59/147 (40%), Gaps = 1/147 (0%) Frame = +3
Query: 66 TADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLG 245 TA +S T ++ +S ++ E+ ++ + P P T SAP Sbjct: 63 TAPDESAETEEKEERVSPSEEKPVEVSTETAEESKPAEQPNSPAAEAPPTAATDSAPSDT 122
Query: 246 ASEAAGQTDVSGAEKPDE-QKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTE 422 ++ Q + AE+P E + T ADD+ KE + E E + + + +PD + Sbjct: 123 PTKEEAQEQLQDAEEPKETENTAAADDITTQKEEEKEEEEEEEEEEEEAKRADVPD---D 179
Query: 423 IPPLDLSQAAREESNQQGIDSKSKIEK 503 P SQ + ++ +D E+ Sbjct: 180 TPAATESQETDQTDKKEALDDSKPAEE 206
>sp|P19334|TRP_DROME Transient receptor potential protein OS=Drosophila melanogaster GN=trp PE=1 SV=3 Length = 1275
Score = 36.2 bits (82), Expect = 0.15 Identities = 38/148 (25%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Frame = +3
Query: 78 KSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEA 257 K E K+ E SS + G K + + P A P ++ P A GA EA Sbjct: 1083 KPEAAAKK--EESSKTEASKPAATNGAAKSA---APSAPSDAKPDSKLKPGAA--GAPEA 1135
Query: 258 AGQTDVSGAEKPDEQKTG-------NADDVP--ETKELDALEARERDMTKAGDEGRSLPD 410 T+ GA KPDE+K+G D P + K+ D ++D D+ + D Sbjct: 1136 TKATN--GASKPDEKKSGPEEPKKAAGDSKPGDDAKDKDKKPGDDKDKKPGDDKDKKPAD 1193
Query: 411 MGTEIPPLDLSQAAREESNQQGIDSKSK 494 + P D + ++ +++ D K K Sbjct: 1194 NNDKKPADDKDKKPGDDKDKKPGDDKDK 1221
>sp|Q54UW4|Y0777_DICDI Bromodomain-containing protein DDB_G0280777 OS=Dictyostelium discoideum GN=DDB_G0280777 PE=4 SV=1 Length = 1823
Score = 35.0 bits (79), Expect = 0.34 Identities = 21/86 (24%), Positives = 45/86 (52%) Frame = +1
Query: 82 QRGQESR*LKCQVLTQSKQKYLHKGQERRYSKCQVLTHSKQIQVQRLYQALLCLARQKQQ 261 Q+ Q+ + + Q Q +Q+ + Q+++ + Q L H +Q+Q+Q+ Q L +Q+QQ Sbjct: 1503 QQLQQQQLFQQQQQQQQQQQQQQQQQQQQQQQQQQLLHPQQMQIQQNLQQPLQQIQQQQQ 1562
Query: 262 GRPMSQEQRSQMNRKLEMLMMYLRQR 339 + Q+Q Q ++ + +Q+ Sbjct: 1563 IQQQQQQQLQQQQQQQQQQQQQQQQQ 1588
Score = 33.1 bits (74), Expect = 1.3 Identities = 19/77 (24%), Positives = 41/77 (53%) Frame = +1
Query: 115 QVLTQSKQKYLHKGQERRYSKCQVLTHSKQIQVQRLYQALLCLARQKQQGRPMSQEQRSQ 294 Q+L+Q +Q+ Q++ L +Q+Q Q+L Q L +Q+QQ + Q+Q+ Q Sbjct: 1477 QILSQPQQQLQQLQQQQ-------LQQQQQLQQQQLQQQQLFQQQQQQQQQQQQQQQQQQ 1529
Query: 295 MNRKLEMLMMYLRQRSL 345 ++ + +++ +Q + Sbjct: 1530 QQQQQQQQLLHPQQMQI 1546
>sp|P49321|NASP_HUMAN Nuclear autoantigenic sperm protein OS=Homo sapiens GN=NASP PE=1 SV=2 Length = 788
Score = 34.7 bits (78), Expect = 0.44 Identities = 36/147 (24%), Positives = 63/147 (42%), Gaps = 21/147 (14%) Frame = +3
Query: 126 PVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKPDE-- 299 P + E+ S GK E+ K+ GT+ G E G+ VS EKP E Sbjct: 222 PNEAEVTS---GKPEQEVPDAEEEKSVSGTDVQEECREKGGQEKQGEVIVSIEEKPKEVS 278
Query: 300 --------QKTGNADDV------PETKELD--ALEARERDMTKAGDEGRSLPD--MGTEI 425 +K G A +V P K +D E E+ +T + G+++ + +G E+ Sbjct: 279 EEQPVVTLEKQGTAVEVEAESLDPTVKPVDVGGDEPEEKVVTSENEAGKAVLEQLVGQEV 338
Query: 426 PPLDLS-QAAREESNQQGIDSKSKIEK 503 PP + S + E + +++ S++ + Sbjct: 339 PPAEESPEVTTEAAEASAVEAGSEVSE 365
>sp|P31568|YCF2_OENPI Protein ycf2 (Fragment) OS=Oenothera picensis GN=ycf2 PE=3 SV=1 Length = 721
Score = 34.3 bits (77), Expect = 0.58 Identities = 42/183 (22%), Positives = 76/183 (41%), Gaps = 15/183 (8%) Frame = +3
Query: 6 MEGTDKHFPISLLGTQIQVFTADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSS 185 +EGT++ GT+ +V + + EGT + VE + + TE +GT +EV Sbjct: 186 VEGTEEEVE----GTEEEVEGTEEEVEGTEDEEVEGTEEEVEGTEEEVEGTEEEVEGTEE 241
Query: 186 TYPFKADPGTETLPSAPLLGASEAAGQT--DVSGAEK------------PDEQKTGNADD 323 D E + G E T +V G E+ DE+ G ++ Sbjct: 242 EVEGTEDEEVEGTEDEEVEGTEEEVEGTEEEVEGTEEEVEGTEEEVEGTEDEEVEGTEEE 301
Query: 324 VPET-KELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQAAREESNQQGIDSKSKIE 500 V T +E++ E E + T+ EG GTE ++ ++ E + ++ ++ ++E Sbjct: 302 VEGTEEEVEGTEDEEVEGTEEEVEGTEEEVEGTEDEEVEGTEEEVEGTEEEVEGTEEEVE 361
Query: 501 KGE 509 E Sbjct: 362 GTE 364
>sp|Q28062|PGCB_BOVIN Brevican core protein OS=Bos taurus GN=BCAN PE=1 SV=1 Length = 912
Score = 34.3 bits (77), Expect = 0.58 Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 7/99 (7%) Frame = +3
Query: 180 SSTYPFKADPGTETLPSAPLLGASEAAGQT---DVSGAEKPDEQKTGNADDVPETKELDA 350 + T P + + P + L+GA E +T ++SGA + + ++TG+++D P Sbjct: 541 TKTLPTPREGNLASPPPSTLVGAREIEEETGGPELSGAPRGESEETGSSEDAPSLLPATR 600
Query: 351 LEARERDM-TKAGDEGRSLPDMGTEI---PPLDLSQAAR 455 RD+ T + + R GT + P L A+R Sbjct: 601 APGDTRDLETPSEENSRRTVPAGTSVRAQPVLPTDSASR 639
>sp|Q86X53|ERIC1_HUMAN Glutamate-rich protein 1 OS=Homo sapiens GN=ERICH1 PE=1 SV=1 Length = 443
Score = 34.3 bits (77), Expect = 0.58 Identities = 25/97 (25%), Positives = 46/97 (47%) Frame = +3
Query: 243 GASEAAGQTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTE 422 G EA + V +E ++ +DV +T+E D +A E D+T+A E + D E Sbjct: 201 GVGEACEEDGVDTSE--EDPTLAGEEDVKDTREEDGADASEEDLTRARQEEGA--DASEE 256
Query: 423 IPPLDLSQAAREESNQQGIDSKSKIEKGEITAHKQEG 533 P + ++ + G+D+ IE+ A +++G Sbjct: 257 DPTPAGEEDVKDAREEDGVDT---IEEDLTRAGEEDG 290
Score = 32.3 bits (72), Expect = 2.2 Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 3/90 (3%) Frame = +3
Query: 279 GAEKPDEQKT-GNADDVPETKELDALEARERDMTKAGDEG--RSLPDMGTEIPPLDLSQA 449 GA+ +E T +DV + +E D ++ E D+T+AG+E + + G + D + A Sbjct: 250 GADASEEDPTPAGEEDVKDAREEDGVDTIEEDLTRAGEEDGKDTREEDGADASEEDPTWA 309
Query: 450 AREESNQQGIDSKSKIEKGEITAHKQEGAS 539 EE G + + + + T ++ S Sbjct: 310 GEEEGADSGEEDGADASEEDDTITNEKAHS 339
>sp|Q14524|SCN5A_HUMAN Sodium channel protein type 5 subunit alpha OS=Homo sapiens GN=SCN5A PE=1 SV=2 Length = 2016
Score = 33.9 bits (76), Expect = 0.75 Identities = 31/115 (26%), Positives = 45/115 (39%), Gaps = 7/115 (6%) Frame = +3
Query: 102 VVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPF----KADPGTETLPSAPLLGASEA---A 260 V E + D + E S GT +E + + P +A P + T +SEA A Sbjct: 1051 VAESDTDDQEEDEENSLGTEEESSKQQESQPVSGGPEAPPDSRTWSQVSATASSEAEASA 1110
Query: 261 GQTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTEI 425 Q D K + Q G ET E E DMT + +PD+G ++ Sbjct: 1111 SQADWRQQWKAEPQAPGCG----ETPEDSCSEGSTADMTNTAELLEQIPDLGQDV 1161
|
TrEMBL (release 39.9) |
Link to BlastX Result : TrEMBL |
tr_hit_id |
Q3IS97 |
Definition |
tr|Q3IS97|Q3IS97_NATPD Probable cell surface glycoprotein OS=Natronomonas pharaonis (strain DSM 2160 / ATCC 35678) |
Align length |
165 |
Score (bit) |
45.8 |
E-value |
0.002 |
Report |
BLASTX 2.2.19 [Nov-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.
Query= DK960483|Adiantum capillus-veneris mRNA, clone: TST39A01NGRL0007_I18, 5' (648 letters)
Database: uniprot_trembl.fasta 7,341,751 sequences; 2,391,615,440 total letters
Searching..................................................done
Score E Sequences producing significant alignments: (bits) Value
tr|Q3IS97|Q3IS97_NATPD Probable cell surface glycoprotein OS=Nat... 46 0.002 tr|A0NDN5|A0NDN5_ANOGA AGAP004411-PA OS=Anopheles gambiae GN=AGA... 42 0.024 tr|B0WNG0|B0WNG0_CULQU Putative uncharacterized protein OS=Culex... 42 0.032 tr|Q5B1D1|Q5B1D1_EMENI Putative uncharacterized protein OS=Emeri... 42 0.032 tr|Q51912|Q51912_PEPMA Putative uncharacterized protein OS=Pepto... 42 0.042 tr|B3MIK2|B3MIK2_DROAN GF11088 OS=Drosophila ananassae GN=GF1108... 41 0.055 tr|Q3IQ19|Q3IQ19_NATPD Probable cell surface glycoprotein OS=Nat... 41 0.071 tr|B6NLE1|B6NLE1_BRAFL Putative uncharacterized protein OS=Branc... 40 0.093 tr|B3M1H3|B3M1H3_DROAN GF18404 OS=Drosophila ananassae GN=GF1840... 40 0.12 tr|A7SM46|A7SM46_NEMVE Predicted protein OS=Nematostella vectens... 40 0.12 tr|B4E111|B4E111_HUMAN cDNA FLJ57346, highly similar to Homo sap... 40 0.12 tr|A0JLU0|A0JLU0_HUMAN TCOF1 protein (Fragment) OS=Homo sapiens ... 40 0.12 tr|Q8QFM0|Q8QFM0_DANRE Novel protein similar to mouse silencing ... 40 0.16 tr|Q8QFL9|Q8QFL9_DANRE Novel protein similar to mouse silencing ... 40 0.16 tr|A5PMW9|A5PMW9_DANRE Novel protein (Zgc:65960) (Fragment) OS=D... 40 0.16 tr|Q2CJG4|Q2CJG4_9RHOB Outer membrane protein, OmpA/MotB family ... 40 0.16 tr|Q7SCV4|Q7SCV4_NEUCR Predicted protein (Putative uncharacteriz... 40 0.16 tr|Q82EY2|Q82EY2_STRAW Putative uncharacterized protein OS=Strep... 39 0.21 tr|Q2FBJ5|Q2FBJ5_MACMU Treacle (Fragment) OS=Macaca mulatta GN=T... 39 0.27 tr|Q299Y6|Q299Y6_DROPS GA17283 OS=Drosophila pseudoobscura pseud... 39 0.27 tr|B4M9G3|B4M9G3_DROVI GJ17918 OS=Drosophila virilis GN=GJ17918 ... 39 0.27 tr|B7PD97|B7PD97_IXOSC Putative uncharacterized protein OS=Ixode... 39 0.35 tr|Q8BRP9|Q8BRP9_MOUSE Putative uncharacterized protein (Fragmen... 38 0.46 tr|Q17D61|Q17D61_AEDAE WOC protein, putative OS=Aedes aegypti GN... 38 0.46 tr|B3MWM8|B3MWM8_DROAN GF22424 OS=Drosophila ananassae GN=GF2242... 38 0.46 tr|B5DVY1|B5DVY1_DROPS GA26240 OS=Drosophila pseudoobscura pseud... 38 0.60 tr|A7TR48|A7TR48_VANPO Putative uncharacterized protein OS=Vande... 38 0.60 tr|B4MIG7|B4MIG7_DROWI GK10256 OS=Drosophila willistoni GN=GK102... 38 0.61 tr|A8R208|A8R208_9MURI ALEX (Fragment) OS=Mastomys huberti GN=Gn... 37 0.79 tr|Q0A9T7|Q0A9T7_ALHEH Putative uncharacterized protein OS=Alkal... 37 0.79
>tr|Q3IS97|Q3IS97_NATPD Probable cell surface glycoprotein OS=Natronomonas pharaonis (strain DSM 2160 / ATCC 35678) GN=NP1800A PE=4 SV=1 Length = 1241
Score = 45.8 bits (107), Expect = 0.002 Identities = 44/165 (26%), Positives = 67/165 (40%), Gaps = 10/165 (6%) Frame = +3
Query: 3 SMEGTDKHFPISLLGTQIQ-------VFTADIKSEGTGKQVVEMSSTDPVQTEIPSQGTG 161 ++E D +F SL G ++ V T + +G G+Q V +S D TE + G Sbjct: 524 TLEAGDPNFEASLEGDAVEAGEEVDLVGTVENTGDGAGEQDVTLSVADEENTETLALAVG 583
Query: 162 KEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKPDEQKTGNADDVPETKE 341 + + S D G T L +A + +V+ +E D+ +T + DDV Sbjct: 584 DDETILLSWETDADDDGEYTAE----LDTGDATAEAEVTVSEA-DDNETDSGDDVLTATS 638
Query: 342 LDALEARERDM---TKAGDEGRSLPDMGTEIPPLDLSQAAREESN 467 A D T A DEG SLPD P++L + N Sbjct: 639 QGGFIAFTEDTSSETTASDEGLSLPDEDDGDTPIELEADYNPDDN 683
>tr|A0NDN5|A0NDN5_ANOGA AGAP004411-PA OS=Anopheles gambiae GN=AGAP004411 PE=4 SV=1 Length = 1119
Score = 42.4 bits (98), Expect = 0.024 Identities = 45/172 (26%), Positives = 75/172 (43%), Gaps = 7/172 (4%) Frame = +3
Query: 45 GTQIQVFTADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETL 224 G Q++V + EGT + E+ D TEI G + +E A+ +ET Sbjct: 300 GPQVEVQPKSVGDEGTSQHAHEVPKDDDENTEIQMVGQQQVDIETRPAAVEGAEVDSETK 359
Query: 225 PSAPLLGASEAAGQTDVSGAEKPDEQ------KTGNADDVPETKELDALEARE-RDMTKA 383 P P+ G +E D G+E+ +++ ++G D+ T E+ E E + Sbjct: 360 P-IPVEG-TEVQTTADFDGSEENNDKEDKIDNESGKQDEKLVTNEIKPEENDEIVEENSI 417
Query: 384 GDEGRSLPDMGTEIPPLDLSQAAREESNQQGIDSKSKIEKGEITAHKQEGAS 539 ++G L D T+ P +D +++ + D KS I K EI K E A+ Sbjct: 418 EEKGNQLSDPATD-PVIDQEVTGEQQATESSTD-KSDIAKPEI-EEKSESAN 466
>tr|B0WNG0|B0WNG0_CULQU Putative uncharacterized protein OS=Culex quinquefasciatus GN=CpipJ_CPIJ008676 PE=4 SV=1 Length = 1372
Score = 42.0 bits (97), Expect = 0.032 Identities = 34/152 (22%), Positives = 65/152 (42%) Frame = +3
Query: 105 VEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGA 284 V S D Q E S + K+ + T ++ P +S V Sbjct: 623 VSASQDDEKQQEDESSQSPKKSASIEETNKPTSEDQITQDDEEPAEESSSPKKSASVEDQ 682
Query: 285 EKPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQAAREES 464 E + K + ++ P+T+ D ++ E K+G P+ ++ L+ +++ S Sbjct: 683 ETSEPAKQDSQNEEPQTE--DNPKSGEGSPEKSGSIEDPAPETNSKESSLEKTESEPNVS 740
Query: 465 NQQGIDSKSKIEKGEITAHKQEGASKTLGLVT 560 +Q +S +K +G+ TA KQ+GAS+ G ++ Sbjct: 741 GEQSKESSAKESEGDATASKQDGASEEAGEIS 772
>tr|Q5B1D1|Q5B1D1_EMENI Putative uncharacterized protein OS=Emericella nidulans GN=AN5649.2 PE=4 SV=1 Length = 1720
Score = 42.0 bits (97), Expect = 0.032 Identities = 41/149 (27%), Positives = 62/149 (41%), Gaps = 3/149 (2%) Frame = +3
Query: 108 EMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAE 287 E +P T P +G + D T+ P+AP SEAA +T + A Sbjct: 300 EPEPAEPETTATPEEGA-------------EPDAATQPEPAAPEAAESEAAAET--APAT 344
Query: 288 KPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQ---AARE 458 KP E T PE + A EA+ + E P E+ P+ ++ AA E Sbjct: 345 KP-EAATQPEPAAPEAEPEAAAEAKPTTEPETVLEAAPAPAAEPELEPVQKTEEPAAAPE 403
Query: 459 ESNQQGIDSKSKIEKGEITAHKQEGASKT 545 E N++ S K +K E K+ GA+++ Sbjct: 404 ELNEESTASSGKKKKKEKKKKKKGGATQS 432
>tr|Q51912|Q51912_PEPMA Putative uncharacterized protein OS=Peptostreptococcus magnus PE=1 SV=1 Length = 719
Score = 41.6 bits (96), Expect = 0.042 Identities = 38/144 (26%), Positives = 60/144 (41%), Gaps = 1/144 (0%) Frame = +3
Query: 120 TDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKP-D 296 ++P ++PS + ++ ST ++P T +PS P +E EKP + Sbjct: 584 SEPSTPDVPSNPSNPSTPDVPSTPDVPSNPSTPEVPSNPSTPGNE----------EKPGN 633
Query: 297 EQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQAAREESNQQG 476 EQK GN E + + K G+E + G E P S+ +EE+ + G Sbjct: 634 EQKPGN-------------EQKPGNEQKPGNEQKP----GNEQKPDQPSKPEKEENGKGG 676
Query: 477 IDSKSKIEKGEITAHKQEGASKTL 548 +DS K EK + E TL Sbjct: 677 VDSPKKKEKAALPKAGSEAEILTL 700
>tr|B3MIK2|B3MIK2_DROAN GF11088 OS=Drosophila ananassae GN=GF11088 PE=4 SV=1 Length = 1323
Score = 41.2 bits (95), Expect = 0.055 Identities = 43/185 (23%), Positives = 74/185 (40%), Gaps = 25/185 (13%) Frame = +3
Query: 30 PISLLGTQIQVFTADIKSEGTGK-QVVEMSSTDPVQTEIPSQGTGKEV------------ 170 P LG TA IK E V+ + +DP+ + Q G+++ Sbjct: 35 PTPALGKTSISSTASIKDEKIANGDEVDDTVSDPITESVKDQNAGRDLDALLDKISSIVD 94
Query: 171 --------LEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKPD--EQKTGNAD 320 L+ S K+D G+E +A ++ G+ + GAEK + E+ + A Sbjct: 95 RSPKNSDELDNSDKLSDKSDAGSENQKTAAEPVENQLEGKEEEIGAEKENKKEEDSSAAT 154
Query: 321 DVPETKELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQAAREESNQQGID--SKSK 494 DV E ++ + D G + + D ++P + S A+EE ++ + S S Sbjct: 155 DVIELSNVETESELDADNKSEGKDVEVIEDPDQDVPAPESSTDAKEEESEPKPEENSTSA 214
Query: 495 IEKGE 509 E GE Sbjct: 215 AEVGE 219
>tr|Q3IQ19|Q3IQ19_NATPD Probable cell surface glycoprotein OS=Natronomonas pharaonis (strain DSM 2160 / ATCC 35678) GN=NP3376A PE=4 SV=1 Length = 1184
Score = 40.8 bits (94), Expect = 0.071 Identities = 43/166 (25%), Positives = 68/166 (40%), Gaps = 11/166 (6%) Frame = +3
Query: 3 SMEGTDKHFP-ISLLGTQIQ-------VFTADIKSEGTGKQVVEMSSTDPVQTEIPSQGT 158 ++E D +F ++++G ++ + T + EG G+Q V +S D TE G Sbjct: 463 TLESGDAYFEAVAVVGDDVEAGEEAEILGTVENTGEGAGEQDVTLSVADEEVTETLEVGF 522
Query: 159 GKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAGQTDVSGAEKPDEQKTGNADDVPETK 338 +E S D G T L +A + +V+ +E D+ +T + DDV Sbjct: 523 EEETTIDLSWETDADDDGEYTAE----LDTGDATAEAEVTVSEA-DDNETDSGDDVLTAT 577
Query: 339 ELDALEARERDM---TKAGDEGRSLPDMGTEIPPLDLSQAAREESN 467 A D T A DEG SLPD P++L + N Sbjct: 578 SQGGFIAFTEDTSSETTASDEGLSLPDEDDGDTPIELEADYNPDDN 623
>tr|B6NLE1|B6NLE1_BRAFL Putative uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_96524 PE=4 SV=1 Length = 662
Score = 40.4 bits (93), Expect = 0.093 Identities = 35/153 (22%), Positives = 61/153 (39%), Gaps = 5/153 (3%) Frame = +3
Query: 66 TADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLG 245 T++ ++E G E + E +GT + +E T +A+P + P A G Sbjct: 109 TSEPEAEPVGTSEPEAEPEGSSEPEAEPEGTSEPEIEPEGTSEPEAEPEGTSEPEAEPEG 168
Query: 246 ASEAAGQTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMT--KAGDEGRSLPDM-- 413 SE + + G +P+ + G ++ E K EA + +A EG S P+ Sbjct: 169 TSEP--EAEPKGTSEPEAEPEGTSEPEAEPKGTSEPEAEPEGTSGPEAEPEGTSEPEAEP 226
Query: 414 -GTEIPPLDLSQAAREESNQQGIDSKSKIEKGE 509 GT P + + E+ +G +GE Sbjct: 227 EGTSEPEAEPEETPEPETEPEGASEPEAEPEGE 259
Score = 34.7 bits (78), Expect = 5.1 Identities = 31/140 (22%), Positives = 56/140 (40%), Gaps = 3/140 (2%) Frame = +3
Query: 66 TADIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLG 245 T++ + EGT + E T + E +GT + E T +A+P + P A G Sbjct: 71 TSEAEPEGTSEPEGEPEGTSEPEAE--PEGTSEPEAEPEGTSEPEAEPVGTSEPEAEPEG 128
Query: 246 ASEAAGQTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDM---G 416 +SE + + + + + + T + PE E +A +G S P+ G Sbjct: 129 SSEPEAEPEGTSEPEIEPEGTSEPEAEPEGTSEPEAEPEGTSEPEAEPKGTSEPEAEPEG 188
Query: 417 TEIPPLDLSQAAREESNQQG 476 T P + + E+ +G Sbjct: 189 TSEPEAEPKGTSEPEAEPEG 208
Score = 34.3 bits (77), Expect = 6.7 Identities = 33/136 (24%), Positives = 56/136 (41%), Gaps = 5/136 (3%) Frame = +3
Query: 84 EGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGASEAAG 263 EGT + E T + E +GT + E T +A+P + P A G SE Sbjct: 87 EGTSEPEAEPEGTSEPEAE--PEGTSEPEAEPVGTSEPEAEPEGSSEPEAEPEGTSEP-- 142
Query: 264 QTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMTK--AGDEGRSLPD---MGTEIP 428 + + G +P+ + G ++ E + EA + ++ A EG S P+ GT P Sbjct: 143 EIEPEGTSEPEAEPEGTSEPEAEPEGTSEPEAEPKGTSEPEAEPEGTSEPEAEPKGTSEP 202
Query: 429 PLDLSQAAREESNQQG 476 + + E+ +G Sbjct: 203 EAEPEGTSGPEAEPEG 218
Score = 34.3 bits (77), Expect = 6.7 Identities = 33/148 (22%), Positives = 58/148 (39%), Gaps = 13/148 (8%) Frame = +3
Query: 72 DIKSEGTGKQVVEMSSTDPVQTEIPSQGTGKEVLEMSSTYPFKADPGTETLPSAPLLGAS 251 +I+ EGT + E T + E +GT + E T +A+P + P A G S Sbjct: 143 EIEPEGTSEPEAEPEGTSEPEAE--PEGTSEPEAEPKGTSEPEAEPEGTSEPEAEPKGTS 200
Query: 252 E--------AAGQTDVSGAEKPDEQKTGNADDVPETKELDALEARERDMTK--AGDEGRS 401 E + + + G +P+ + G ++ E +E E ++ A EG S Sbjct: 201 EPEAEPEGTSGPEAEPEGTSEPEAEPEGTSEPEAEPEETPEPETEPEGASEPEAEPEGES 260
Query: 402 LPDM---GTEIPPLDLSQAAREESNQQG 476 P+ G P + + E+ +G Sbjct: 261 EPEAEPEGISEPEAEPEGTSEPEAEPEG 288
>tr|B3M1H3|B3M1H3_DROAN GF18404 OS=Drosophila ananassae GN=GF18404 PE=4 SV=1 Length = 382
Score = 40.0 bits (92), Expect = 0.12 Identities = 47/182 (25%), Positives = 69/182 (37%), Gaps = 18/182 (9%) Frame = +3
Query: 48 TQIQVFTADIKSEGTGKQV----VEMSSTD-PVQTEIPSQGTGKEVLEMSSTYPFKADPG 212 T V T D E T V + +TD PV+T + T E E S + ++ Sbjct: 180 TDAPVETTDAPVETTEAPVETTEAPVETTDAPVETTEAPEETTTESEEGSGS----SEET 235
Query: 213 TETLPSAPLLGASEAAGQTDVSGAEKPDEQKTGNADD---VPETKELDA----LEARERD 371 T P + E +G E P+E T A+D PE DA +A E Sbjct: 236 TTQEPEDSTTDSDEGSGSDTTQAPEDPEESTTDAAEDTTQAPEESTTDAGEETTQAPEES 295
Query: 372 MTKAGDEGRSLPDMGTEIPPLDLSQAAREESNQQGIDSKSKIEK------GEITAHKQEG 533 T AG+E P+ T + +QA E + G ++ E+ GE T +E Sbjct: 296 TTDAGEETTQAPEESTTDAGEETTQAPEESTTDAGEETTQAPEESTTDAGGETTQAPEES 355
Query: 534 AS 539 + Sbjct: 356 TT 357
>tr|A7SM46|A7SM46_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g172038 PE=3 SV=1 Length = 938
Score = 40.0 bits (92), Expect = 0.12 Identities = 30/88 (34%), Positives = 41/88 (46%) Frame = +3
Query: 288 KPDEQKTGNADDVPETKELDALEARERDMTKAGDEGRSLPDMGTEIPPLDLSQAAREESN 467 KPDE K +DDV E K D +E D + E PD E ++ E Sbjct: 602 KPDEPKDEKSDDVNE-KPKDEEAEKEEDKKEEKKEEEKKPDEKKE-------DESKSEGK 653
Query: 468 QQGIDSKSKIEKGEITAHKQEGASKTLG 551 ++G +SKS+ +K E TA K + A K G Sbjct: 654 KEGDESKSEGKKEEPTADKDKKAEKADG 681
|