BP913386
Clone id YMU001_000029_F09
Library
Length 352
Definition Adiantum capillus-veneris mRNA. clone: YMU001_000029_F09.
Accession
Tissue type prothallium
Developmental stage -
Contig ID
Sequence
CCACTCATAGAAATCACCCTGACACACGAACTCCAATCCATGGAATCCATCCTCGTGTCC
ATTGCATGTCACATGCGCACCAAGAGCATCTGTCTGTTCTGAACCCTGTGTGGTAAGAGC
AATTCCTAGCTCTAACTCACATACATCAGCCTCTGGCCTAAGCATATCCATGAATGACTG
AATCATGGCTGAATCCTCAACACTCAAGTCATCCATATCCGTCATCGTATCCACGAATGA
CATGCCCTGGTGCACAACTGGCTCAACATGCTCACAAACTAGAGCATCATGCAACCTATC
CGCAAAGGAGACATGCCTCATCTCTCTGCATGGCTGCATACTCACGAAAACT
■■Homology search results ■■ -
sp_hit_id Q5SSG8
Definition sp|Q5SSG8|MUC21_HUMAN Mucin-21 OS=Homo sapiens
Align length 67
Score (bit) 31.2
E-value 1.7
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP913386|Adiantum capillus-veneris mRNA, clone:
YMU001_000029_F09.
(352 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q5SSG8|MUC21_HUMAN Mucin-21 OS=Homo sapiens GN=MUC21 PE=1 SV=1 31 1.7
sp|P17053|G168_PARPR G surface protein, allelic form 168 OS=Para... 31 2.2
sp|A8DZE6|WTIP_DANRE Wilms tumor protein 1-interacting protein h... 30 2.8
sp|A6VTV8|URE1_MARMS Urease subunit alpha OS=Marinomonas sp. (st... 30 3.7
sp|Q5KYM1|URE1_GEOKA Urease subunit alpha OS=Geobacillus kaustop... 30 3.7
sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein ... 30 3.7
sp|Q6NI74|GLMU_CORDI Bifunctional protein glmU OS=Corynebacteriu... 30 3.7
sp|Q16RY9|LRC50_AEDAE Leucine-rich repeat-containing protein 50 ... 30 4.8
sp|P13837|G156_PARPR G surface protein, allelic form 156 OS=Para... 30 4.8
sp|Q8D2L3|FUMC_WIGBR Fumarate hydratase class II OS=Wigglesworth... 30 4.8
sp|P70323|TBX1_MOUSE T-box transcription factor TBX1 OS=Mus musc... 30 4.8
sp|Q9C105|YKT4_SCHPO Chitinase-like protein PB1E7.04c OS=Schizos... 29 6.3
sp|A1SYY1|URE1_PSYIN Urease subunit alpha OS=Psychromonas ingrah... 29 6.3
sp|Q9Z0J1|RECK_MOUSE Reversion-inducing cysteine-rich protein wi... 29 6.3
sp|Q8TFG9|YL61_SCHPO Uncharacterized serine/threonine-rich prote... 29 8.2
sp|Q9KG59|URE1_BACHD Urease subunit alpha OS=Bacillus halodurans... 29 8.2
sp|Q6BEA0|PLXA4_DANRE Plexin-A4 OS=Danio rerio GN=plxna4 PE=2 SV=1 29 8.2
sp|Q68FE6|FA65A_MOUSE Protein FAM65A OS=Mus musculus GN=Fam65a P... 29 8.2
sp|Q6ZS17|FA65A_HUMAN Protein FAM65A OS=Homo sapiens GN=FAM65A P... 29 8.2

>sp|Q5SSG8|MUC21_HUMAN Mucin-21 OS=Homo sapiens GN=MUC21 PE=1 SV=1
Length = 566

Score = 31.2 bits (69), Expect = 1.7
Identities = 19/67 (28%), Positives = 33/67 (49%)
Frame = +2

Query: 113 VRAIPSSNSHTSASGLSISMND*IMAESSTLKSSISVIVSTNDMPWCTTGSTCSQTRASC 292
V + +S HT++SG+S + N +E ST S IS+ ++ + ST + + +S
Sbjct: 72 VSIVTNSEFHTTSSGISTATN----SEFSTASSGISIATNSESSTTSSGASTATNSESST 127

Query: 293 NLSAKET 313
S T
Sbjct: 128 PSSGAST 134



Score = 28.9 bits (63), Expect = 8.2
Identities = 26/96 (27%), Positives = 41/96 (42%)
Frame = +2

Query: 26 TNSNPWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSISMND*IMAESSTL 205
TNS PSS T + ++ S +S+S T++SG S + N +ESST
Sbjct: 286 TNSESSTPSSGANTATNSESSTTSSG---ANTATNSDSSTTSSGASTATN----SESSTT 338

Query: 206 KSSISVIVSTNDMPWCTTGSTCSQTRASCNLSAKET 313
S S ++ + ST + + +S S T
Sbjct: 339 SSGASTATNSESSTTSSGASTATNSGSSTTSSGTST 374


>sp|P17053|G168_PARPR G surface protein, allelic form 168
OS=Paramecium primaurelia GN=168G PE=2 SV=1
Length = 2704

Score = 30.8 bits (68), Expect = 2.2
Identities = 30/115 (26%), Positives = 45/115 (39%), Gaps = 14/115 (12%)
Frame = +2

Query: 35 NPWNPSSCPLHVTCAPR-------ASVCSEPCVVR------AIPSSNSHTSASGLSISMN 175
N +P C + +CA + S C A+ S +S+T+ +G +
Sbjct: 2330 NSSDPKVCKPYTSCADAFYTTHSDCQIASSKCTTNGTTGCIALGSCSSYTAQAGCYFN-- 2387

Query: 176 D*IMAESSTLKSSISVIVSTNDMPWCTTGSTC-SQTRASCNLSAKETCLISLHGC 337
+ TL +S VI ST W TT S+C Q+ A + TC L C
Sbjct: 2388 -----DKGTLYTS-GVITSTGICTWDTTSSSCRDQSCADLTGTTHATCSSQLSTC 2436


>sp|A8DZE6|WTIP_DANRE Wilms tumor protein 1-interacting protein
homolog OS=Danio rerio GN=wtip PE=3 SV=1
Length = 648

Score = 30.4 bits (67), Expect = 2.8
Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 7/47 (14%)
Frame = +2

Query: 44 NPSSCPLH--VTCAPRASVCSEP-----CVVRAIPSSNSHTSASGLS 163
+P S +H +PR+S+CS+P CVV S +SH+S S S
Sbjct: 275 DPESAAVHGIPMASPRSSICSQPAVAANCVVSPRSSISSHSSRSSRS 321


>sp|A6VTV8|URE1_MARMS Urease subunit alpha OS=Marinomonas sp.
(strain MWYL1) GN=ureC PE=3 SV=1
Length = 568

Score = 30.0 bits (66), Expect = 3.7
Identities = 12/50 (24%), Positives = 25/50 (50%)
Frame = -2

Query: 342 SMQPCREMRHVSFADRLHDALVCEHVEPVVHQGMSFVDTMTDMDDLSVED 193
S P R H + + L +VC H++ + + ++F D+ + ++ ED
Sbjct: 297 STNPTRPYTHNTVDEHLDMLMVCHHLDSNIEEDVAFADSRIRKETIAAED 346


>sp|Q5KYM1|URE1_GEOKA Urease subunit alpha OS=Geobacillus
kaustophilus GN=ureC PE=3 SV=1
Length = 569

Score = 30.0 bits (66), Expect = 3.7
Identities = 13/50 (26%), Positives = 25/50 (50%)
Frame = -2

Query: 342 SMQPCREMRHVSFADRLHDALVCEHVEPVVHQGMSFVDTMTDMDDLSVED 193
S P R + + L +VC H++P V + ++F D+ + ++ ED
Sbjct: 299 STNPTRPYTKNTLDEHLDMLMVCHHLDPSVPEDIAFADSRIRKETIAAED 348


>sp|Q54KX0|GTAN_DICDI GATA zinc finger domain-containing protein 14
OS=Dictyostelium discoideum GN=gtaN PE=4 SV=1
Length = 953

Score = 30.0 bits (66), Expect = 3.7
Identities = 19/68 (27%), Positives = 35/68 (51%)
Frame = +2

Query: 38 PWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSISMND*IMAESSTLKSSI 217
P +PS+ P+ ++ P ASV +P + SS++H S I+ I + S ++
Sbjct: 126 PGSPSASPIPISSIPTASVYRQPF---SSSSSSNHDQGSYPIINTKSIIPSASQLQSQNL 182

Query: 218 SVIVSTND 241
++I S N+
Sbjct: 183 NIINSINN 190


>sp|Q6NI74|GLMU_CORDI Bifunctional protein glmU OS=Corynebacterium
diphtheriae GN=glmU PE=3 SV=1
Length = 484

Score = 30.0 bits (66), Expect = 3.7
Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Frame = -2

Query: 189 AMIQSFMDMLRPEADVCELELGIALTTQGSEQTDALGAHVTCNGHE-DGFHGLEFVCQGD 13
A+I + +RPE + + ++ EQ + G V C HE +GF G V GD
Sbjct: 54 AVIGHRREQVRPEVEAVAATVDCEISVAIQEQQNGTGHAVQCAMHELEGFEGTVVVTNGD 113


>sp|Q16RY9|LRC50_AEDAE Leucine-rich repeat-containing protein 50
homolog OS=Aedes aegypti GN=AAEL010772 PE=3 SV=1
Length = 1107

Score = 29.6 bits (65), Expect = 4.8
Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 2/55 (3%)
Frame = -2

Query: 291 HDALVCEHVEPVVHQGM--SFVDTMTDMDDLSVEDSAMIQSFMDMLRPEADVCEL 133
+D L + +E ++ Q + S D ++ DD S + + FMD +RP+ ++ E+
Sbjct: 815 NDNLAQQIIEKIISQSVKTSSEDLKSNFDDSSESSDSAEEDFMDSIRPDHNLLEI 869


>sp|P13837|G156_PARPR G surface protein, allelic form 156
OS=Paramecium primaurelia GN=156G PE=2 SV=1
Length = 2715

Score = 29.6 bits (65), Expect = 4.8
Identities = 30/115 (26%), Positives = 44/115 (38%), Gaps = 14/115 (12%)
Frame = +2

Query: 35 NPWNPSSCPLHVTCAPR-------ASVCSEPCVVR------AIPSSNSHTSASGLSISMN 175
N +P C + +CA + S C A+ S +S+T +G +
Sbjct: 2341 NSSDPKVCKPYTSCADAFYTTHSDCQIASSKCTTNGTTGCIALGSCSSYTVQAGCYFN-- 2398

Query: 176 D*IMAESSTLKSSISVIVSTNDMPWCTTGSTC-SQTRASCNLSAKETCLISLHGC 337
+ TL +S VI ST W TT S+C Q+ A + TC L C
Sbjct: 2399 -----DKGTLYTS-GVITSTGICTWDTTSSSCRDQSCADLTGTTHATCSSQLSTC 2447


>sp|Q8D2L3|FUMC_WIGBR Fumarate hydratase class II OS=Wigglesworthia
glossinidia brevipalpis GN=fumC PE=3 SV=1
Length = 464

Score = 29.6 bits (65), Expect = 4.8
Identities = 16/38 (42%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Frame = +2

Query: 161 SISMND*IMAESSTLKS-SISVIVSTNDMPWCTTGSTC 271
S+S D I+ STLK+ SIS++ +ND+ W ++G C
Sbjct: 267 SLSTCDAIVKVHSTLKNLSISIMKISNDIRWLSSGPRC 304


tr_hit_id Q5KDH4
Definition tr|Q5KDH4|Q5KDH4_CRYNE Putative uncharacterized protein OS=Cryptococcus neoformans
Align length 44
Score (bit) 39.3
E-value 0.098
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP913386|Adiantum capillus-veneris mRNA, clone:
YMU001_000029_F09.
(352 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q5KDH4|Q5KDH4_CRYNE Putative uncharacterized protein OS=Crypt... 39 0.098
tr|Q55PY8|Q55PY8_CRYNE Putative uncharacterized protein OS=Crypt... 39 0.098
tr|A4QTC8|A4QTC8_MAGGR Putative uncharacterized protein OS=Magna... 37 0.49
tr|A4HM87|A4HM87_LEIBR Proteophosphoglycan ppg4 OS=Leishmania br... 37 0.63
tr|A9RFP6|A9RFP6_PHYPA Predicted protein OS=Physcomitrella paten... 35 1.8
tr|A9UX20|A9UX20_MONBE Predicted protein OS=Monosiga brevicollis... 35 2.4
tr|Q2HG84|Q2HG84_CHAGB Putative uncharacterized protein OS=Chaet... 34 3.1
tr|B6KIX7|B6KIX7_TOXGO Ribonuclease II RNB family protein OS=Tox... 34 4.1
tr|Q28Y80|Q28Y80_DROPS GA11188 OS=Drosophila pseudoobscura pseud... 33 5.4
tr|Q8CAY5|Q8CAY5_MOUSE Novel member of the keratin associated pr... 33 7.0
tr|B4GCU2|B4GCU2_DROPE GL11676 OS=Drosophila persimilis GN=GL116... 33 7.0
tr|Q4V9G6|Q4V9G6_DANRE Zgc:113338 OS=Danio rerio GN=zgc:113338 P... 33 9.1
tr|B5X266|B5X266_SALSA Granulins OS=Salmo salar GN=GRN PE=2 SV=1 33 9.1
tr|Q7ULR2|Q7ULR2_RHOBA Probable swi/snf family helicase 2 OS=Rho... 33 9.1
tr|Q7F0Z0|Q7F0Z0_ORYSJ Putative speckle-type POZ protein OS=Oryz... 33 9.1
tr|Q0J766|Q0J766_ORYSJ Os08g0226800 protein OS=Oryza sativa subs... 33 9.1
tr|A2YSK2|A2YSK2_ORYSI Putative uncharacterized protein OS=Oryza... 33 9.1
tr|Q234N8|Q234N8_TETTH Putative uncharacterized protein OS=Tetra... 33 9.1
tr|A7F608|A7F608_SCLS1 Putative uncharacterized protein OS=Scler... 33 9.1

>tr|Q5KDH4|Q5KDH4_CRYNE Putative uncharacterized protein
OS=Cryptococcus neoformans GN=CNG03970 PE=4 SV=1
Length = 346

Score = 39.3 bits (90), Expect = 0.098
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 10/44 (22%)
Frame = +3

Query: 18 PDTRTPI---HGIHPRVHCMSHAH-------QEHLSVLNPVW*E 119
P+T+ PI HG +VHCM H H QE +S LNP+W E
Sbjct: 140 PNTKPPILELHGTLAKVHCMKHRHEQSRDEYQEQISRLNPIWDE 183


>tr|Q55PY8|Q55PY8_CRYNE Putative uncharacterized protein
OS=Cryptococcus neoformans GN=CNBG0760 PE=4 SV=1
Length = 361

Score = 39.3 bits (90), Expect = 0.098
Identities = 20/44 (45%), Positives = 25/44 (56%), Gaps = 10/44 (22%)
Frame = +3

Query: 18 PDTRTPI---HGIHPRVHCMSHAH-------QEHLSVLNPVW*E 119
P+T+ PI HG +VHCM H H QE +S LNP+W E
Sbjct: 147 PNTKPPILELHGTLAKVHCMKHRHEQSRDEYQEQISRLNPIWDE 190


>tr|A4QTC8|A4QTC8_MAGGR Putative uncharacterized protein
OS=Magnaporthe grisea GN=MGG_12939 PE=4 SV=1
Length = 593

Score = 37.0 bits (84), Expect = 0.49
Identities = 29/88 (32%), Positives = 40/88 (45%), Gaps = 6/88 (6%)
Frame = +2

Query: 56 CPLHVTCAPRASVCSEPC------VVRAIPSSNSHTSASGLSISMND*IMAESSTLKSSI 217
C +H C S C + C V + SS+S TSAS S+S A SS +S
Sbjct: 430 CSVHGWCGSTDSYCGDGCQPVPLSVASSSVSSSSSTSASPTSVSTTS---ASSSQSSTST 486

Query: 218 SVIVSTNDMPWCTTGSTCSQTRASCNLS 301
S IV + +T ++ S T AS + S
Sbjct: 487 STIVPLDTSSIDSTSTSASSTSASVSSS 514


>tr|A4HM87|A4HM87_LEIBR Proteophosphoglycan ppg4 OS=Leishmania
braziliensis GN=LbrM34_V2.0540 PE=4 SV=1
Length = 5384

Score = 36.6 bits (83), Expect = 0.63
Identities = 28/103 (27%), Positives = 51/103 (49%), Gaps = 3/103 (2%)
Frame = +2

Query: 14 SP*HTNSNPWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSH---TSASGLSISMND*I 184
+P ++S P + SS P +CAP +S S P + PSS+S +S+S S +
Sbjct: 4802 APSSSSSAPSSSSSAPSSSSCAPSSSSSSAPSSSSSAPSSSSSSAPSSSSSAPSSSSSSA 4861

Query: 185 MAESSTLKSSISVIVSTNDMPWCTTGSTCSQTRASCNLSAKET 313
+ SS+ SS S ++ ++ S+ + + +SC S+ +
Sbjct: 4862 PSSSSSAPSSSSSSAPSSSSSAPSSSSSSAPSSSSCAPSSSSS 4904


>tr|A9RFP6|A9RFP6_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_65494 PE=4 SV=1
Length = 597

Score = 35.0 bits (79), Expect = 1.8
Identities = 15/33 (45%), Positives = 21/33 (63%)
Frame = +2

Query: 41 WNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNS 139
W P++C + VTC PR+ C C+VR +P S S
Sbjct: 540 WVPNACDVIVTCTPRSCAC---CIVRHLPISLS 569


>tr|A9UX20|A9UX20_MONBE Predicted protein OS=Monosiga brevicollis
GN=24645 PE=4 SV=1
Length = 1267

Score = 34.7 bits (78), Expect = 2.4
Identities = 28/102 (27%), Positives = 44/102 (43%), Gaps = 2/102 (1%)
Frame = +2

Query: 50 SSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSISMND*IMAESSTLKSSISVIV 229
SS T + +S S +++S TS++ S S + SST S + V V
Sbjct: 514 SSTTSSTTSSTTSSTTSSTTSSTTSSTTSSTTSSTSSSTSSSTSSSTTSSTTTSGVQVPV 573

Query: 230 STNDM--PWCTTGSTCSQTRASCNLSAKETCLISLHGCILTK 349
+T+ P TT CS +C A + +L GCI+ +
Sbjct: 574 TTSASTNPPATTSQACSPVDCNCKDDAYKAYHRNLAGCIVCR 615


>tr|Q2HG84|Q2HG84_CHAGB Putative uncharacterized protein
OS=Chaetomium globosum GN=CHGG_00770 PE=4 SV=1
Length = 586

Score = 34.3 bits (77), Expect = 3.1
Identities = 21/91 (23%), Positives = 39/91 (42%)
Frame = +2

Query: 44 NPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSISMND*IMAESSTLKSSISV 223
NP+ C + C P +C+ S+ S T++S S + + + SST K++ +
Sbjct: 271 NPADCYVERGCQPAFGICASNSNSTTTSSATSGTTSSATSTTSSTKTTSSSSTTKTTSTS 330

Query: 224 IVSTNDMPWCTTGSTCSQTRASCNLSAKETC 316
++ P TT + S + +TC
Sbjct: 331 TTPSSSTPGTTTAPSTEIPGVSNLPACGQTC 361


>tr|B6KIX7|B6KIX7_TOXGO Ribonuclease II RNB family protein
OS=Toxoplasma gondii ME49 GN=TGME49_002720 PE=4 SV=1
Length = 1882

Score = 33.9 bits (76), Expect = 4.1
Identities = 19/42 (45%), Positives = 26/42 (61%)
Frame = +2

Query: 32 SNPWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASG 157
S P +PSS P HV+ +P + S P +V A SS+S +S SG
Sbjct: 63 SEPHSPSSLPCHVSLSPE-NASSAPSLVSASSSSSSASSDSG 103


>tr|Q28Y80|Q28Y80_DROPS GA11188 OS=Drosophila pseudoobscura
pseudoobscura GN=GA11188 PE=4 SV=2
Length = 891

Score = 33.5 bits (75), Expect = 5.4
Identities = 24/108 (22%), Positives = 46/108 (42%), Gaps = 2/108 (1%)
Frame = +2

Query: 23 HTNSNPWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSISMND*IMAESST 202
H+ S P + ++ P C+ RA+ ++ V+++ + I + +
Sbjct: 96 HSLSTPSSTNNSPTTTPCSSRATSYAQLLSVQSLVHN----------ILPQSELQTDQEA 145

Query: 203 LKSSISVIVSTNDMPWCTTGSTC--SQTRASCNLSAKETCLISLHGCI 340
K + D+ TTGS+ S + +SCN+S + TC+ S I
Sbjct: 146 TKDEVDRAAKLPDLLTITTGSSSIISSSNSSCNISNRSTCIASASSSI 193


>tr|Q8CAY5|Q8CAY5_MOUSE Novel member of the keratin associated
protein (Krtap) family OS=Mus musculus
GN=OTTMUSG00000002177 PE=2 SV=1
Length = 177

Score = 33.1 bits (74), Expect = 7.0
Identities = 30/118 (25%), Positives = 50/118 (42%), Gaps = 19/118 (16%)
Frame = +2

Query: 26 TNSNPWNPSSCPLHVTCAPRASVCSEPCVVRAIPSSNSHTSASGLSI----SMND*IMAE 193
++S + SSCP+ + CAP S CS PC ++I ++ T S + + D +
Sbjct: 19 SSSESSSESSCPVFICCAP--SWCSTPCCCKSICCHSTKTVNSCSQLCCPPTCCDPASCD 76

Query: 194 SSTLKSSISVIVSTND-------MPWCTTGS--------TCSQTRASCNLSAKETCLI 322
S+ K + I + +P C S TC +T +S K +C+I
Sbjct: 77 SNCCKPTCVTICCSTPCCQPSCCVPTCCQPSLFQLCCQPTCCETSCCKTISFKPSCVI 134