BP912577
Clone id YMU001_000020_E07
Library
Length 419
Definition Adiantum capillus-veneris mRNA. clone: YMU001_000020_E07.
Accession
Tissue type prothallium
Developmental stage -
Contig ID
Sequence
TCTGTGCTTCCTACGTCGCTTCTTCTGAACAGGCATACTTGAGCTACCTAGCTCGGGAAT
TGTATGCTTTTTTGAGCTGTCTAAGCTTGCATGAGAAAGGACACTGAAAGAGTCTGCATT
GGCCTTTGAAGCAATCACAGACCTTGCAAAGCCTCTGGTCGACTCTGGCTGACTTCTTGC
TTGCTGATGTTTTCCACGAGCTTCGTGACACACTACTTGAGGGAGCTGCAAATTGTGGCG
TCTGTCTCGCAATACGTTCCGTGATATTCTTTTCCTTTGAATTTGGATGTTGCATTTGGT
CATCCAAATTTCGACCCTTTATCTTGTTATTCGAACCTACTGGATAGTGCCTTTGACTAT
CCTGCACTAAGTCCTTATTTCTGAAAATTGGGCCAGGTCCTTGCTTGCGTGATTTTGAG
■■Homology search results ■■ -
sp_hit_id O88466
Definition sp|O88466|ZF106_MOUSE Zinc finger protein 106 OS=Mus musculus
Align length 92
Score (bit) 39.7
E-value 0.005
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP912577|Adiantum capillus-veneris mRNA, clone:
YMU001_000020_E07.
(419 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|O88466|ZF106_MOUSE Zinc finger protein 106 OS=Mus musculus GN... 40 0.005
sp|Q11BH8|TYPH_MESSB Putative thymidine phosphorylase OS=Mesorhi... 31 1.7
sp|Q80KJ6|CAPSD_MASV1 Capsid polyprotein OS=Mink astrovirus 1 GN... 31 2.2
sp|Q07950|YEH2_YEAST Sterol esterase 2 OS=Saccharomyces cerevisi... 30 4.9
sp|O74808|CLR1_SCHPO Cryptic loci regulator protein 1 OS=Schizos... 30 4.9
sp|Q46JH4|TRPC_PROMT Indole-3-glycerol phosphate synthase OS=Pro... 29 6.4
sp|A2C462|TRPC_PROM1 Indole-3-glycerol phosphate synthase OS=Pro... 29 6.4
sp|O46232|HUNB_DROAD Protein hunchback (Fragments) OS=Drosophila... 29 6.4
sp|P72481|UVRA_STRMU UvrABC system protein A OS=Streptococcus mu... 29 8.4
sp|O88563|MRP3_RAT Canalicular multispecific organic anion trans... 29 8.4
sp|P38138|GLU2A_YEAST Glucosidase 2 subunit alpha OS=Saccharomyc... 29 8.4
sp|Q54Q69|DHKG_DICDI Hybrid signal transduction histidine kinase... 29 8.4

>sp|O88466|ZF106_MOUSE Zinc finger protein 106 OS=Mus musculus
GN=Zfp106 PE=1 SV=2
Length = 1888

Score = 39.7 bits (91), Expect = 0.005
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
Frame = -3

Query: 258 NVLRDRRHNLQLPQ----VVCHEARGKHQQARSQPESTR------GFARSVIASKANADS 109
N+ RRH+ QLP V H AR H Q RS P S R G S+ ++ ++ +
Sbjct: 945 NIPTQRRHSAQLPSGHIMPVMHSARDLHSQERSTPLSERHAQESTGEGNSLSSNASSGHA 1004

Query: 108 FSVLSHASLDSSKKHTIPELGSSSMPVQKKRR 13
S L+ A+ DSS + S ++KKRR
Sbjct: 1005 VSSLADAATDSSCTSGAEQTDGHS--IRKKRR 1034


>sp|Q11BH8|TYPH_MESSB Putative thymidine phosphorylase
OS=Mesorhizobium sp. (strain BNC1) GN=Meso_3880 PE=3
SV=1
Length = 509

Score = 31.2 bits (69), Expect = 1.7
Identities = 22/67 (32%), Positives = 32/67 (47%)
Frame = -3

Query: 270 RISRNVLRDRRHNLQLPQVVCHEARGKHQQARSQPESTRGFARSVIASKANADSFSVLSH 91
R+ R +L + + + CH R + ARSQ T G AR V A+ D +VLSH
Sbjct: 16 RVRRLLLHTQHQPVVVMHTDCHVCRSEGLAARSQVLLTNG-ARQVHATLFQVDGDAVLSH 74

Query: 90 ASLDSSK 70
+ S+
Sbjct: 75 DEVGLSE 81


>sp|Q80KJ6|CAPSD_MASV1 Capsid polyprotein OS=Mink astrovirus 1
GN=ORF2 PE=3 SV=1
Length = 775

Score = 30.8 bits (68), Expect = 2.2
Identities = 17/49 (34%), Positives = 25/49 (51%)
Frame = -2

Query: 418 SKSRKQGPGPIFRNKDLVQDSQRHYPVGSNNKIKGRNLDDQMQHPNSKE 272
S R QGPG + +K RH P +NNK R +D++++ KE
Sbjct: 30 SAQRNQGPGKRWNSK-----KGRHMPKNNNNKGMKRTVDNEVKQKLKKE 73


>sp|Q07950|YEH2_YEAST Sterol esterase 2 OS=Saccharomyces cerevisiae
GN=YEH2 PE=1 SV=1
Length = 538

Score = 29.6 bits (65), Expect = 4.9
Identities = 17/52 (32%), Positives = 24/52 (46%)
Frame = -2

Query: 412 SRKQGPGPIFRNKDLVQDSQRHYPVGSNNKIKGRNLDDQMQHPNSKEKNITE 257
S K+G IF D + +P+ N+ G NLD+ H N K +N E
Sbjct: 423 SFKKGAEKIF------PDKKTWFPIAKNDDDSGNNLDNNKLHLNPKRQNSEE 468


>sp|O74808|CLR1_SCHPO Cryptic loci regulator protein 1
OS=Schizosaccharomyces pombe GN=clr1 PE=1 SV=1
Length = 1238

Score = 29.6 bits (65), Expect = 4.9
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 6/65 (9%)
Frame = -3

Query: 213 VCHEARGKHQQARSQPESTRGFARSVIASKANADSFSVLSHASLD------SSKKHTIPE 52
VC+ R KH + +Q S+ G +S+ + ++ + +S + D SKK I +
Sbjct: 487 VCYPTRNKHSEISAQSSSSLGVTKSLASEVYSSSTVDTISKLNTDKDNYLIKSKKEPIQQ 546

Query: 51 LGSSS 37
SS
Sbjct: 547 KSVSS 551


>sp|Q46JH4|TRPC_PROMT Indole-3-glycerol phosphate synthase
OS=Prochlorococcus marinus (strain NATL2A) GN=trpC PE=3
SV=1
Length = 295

Score = 29.3 bits (64), Expect = 6.4
Identities = 14/27 (51%), Positives = 17/27 (62%)
Frame = -2

Query: 331 NNKIKGRNLDDQMQHPNSKEKNITERI 251
N IK NL+ + HP+SK KNI E I
Sbjct: 9 NPSIKVANLEYAIPHPDSKPKNILEEI 35


>sp|A2C462|TRPC_PROM1 Indole-3-glycerol phosphate synthase
OS=Prochlorococcus marinus (strain NATL1A) GN=trpC PE=3
SV=1
Length = 295

Score = 29.3 bits (64), Expect = 6.4
Identities = 14/27 (51%), Positives = 17/27 (62%)
Frame = -2

Query: 331 NNKIKGRNLDDQMQHPNSKEKNITERI 251
N IK NL+ + HP+SK KNI E I
Sbjct: 9 NPSIKVANLEYAIPHPDSKPKNILEEI 35


>sp|O46232|HUNB_DROAD Protein hunchback (Fragments) OS=Drosophila
adiastola GN=hb PE=3 SV=2
Length = 192

Score = 29.3 bits (64), Expect = 6.4
Identities = 19/60 (31%), Positives = 31/60 (51%)
Frame = -3

Query: 234 NLQLPQVVCHEARGKHQQARSQPESTRGFARSVIASKANADSFSVLSHASLDSSKKHTIP 55
NLQL Q + H+ + + QQ + QP T A ++ S +N D S L+ L + + +P
Sbjct: 59 NLQLEQYLKHQQQQQQQQHQQQPMDTL-CAAAMTPSPSNNDQNSPLTWPGLPNPMQSIMP 117


>sp|P72481|UVRA_STRMU UvrABC system protein A OS=Streptococcus
mutans GN=uvrA PE=3 SV=2
Length = 943

Score = 28.9 bits (63), Expect = 8.4
Identities = 18/66 (27%), Positives = 30/66 (45%)
Frame = -3

Query: 264 SRNVLRDRRHNLQLPQVVCHEARGKHQQARSQPESTRGFARSVIASKANADSFSVLSHAS 85
+RNV+R + +LP CH R Q + G ++ + AD S+L+H
Sbjct: 392 TRNVMRSYMN--ELPCATCHGYRLNDQALSVRVGGKEGLNIGQVSELSIADHLSLLTHLE 449

Query: 84 LDSSKK 67
L ++K
Sbjct: 450 LSENEK 455


>sp|O88563|MRP3_RAT Canalicular multispecific organic anion
transporter 2 OS=Rattus norvegicus GN=Abcc3 PE=2 SV=1
Length = 1522

Score = 28.9 bits (63), Expect = 8.4
Identities = 14/34 (41%), Positives = 20/34 (58%)
Frame = -3

Query: 141 SVIASKANADSFSVLSHASLDSSKKHTIPELGSS 40
SVI + F VLS A +DS++K T P + S+
Sbjct: 1154 SVIRAYGRVQDFKVLSDAKVDSNQKTTYPYIASN 1187


tr_hit_id A2AKH3
Definition tr|A2AKH3|A2AKH3_MOUSE Zinc finger protein 106 OS=Mus musculus
Align length 92
Score (bit) 40.0
E-value 0.058
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP912577|Adiantum capillus-veneris mRNA, clone:
YMU001_000020_E07.
(419 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A2AKH3|A2AKH3_MOUSE Zinc finger protein 106 OS=Mus musculus G... 40 0.058
tr|Q4DLC7|Q4DLC7_TRYCR Dispersed gene family protein 1 (DGF-1), ... 35 1.4
tr|B7BPM8|B7BPM8_9CLOT Putative uncharacterized protein OS=Clost... 35 1.9
tr|B6KK30|B6KK30_TOXGO Putative uncharacterized protein OS=Toxop... 35 1.9
tr|Q5C6L3|Q5C6L3_SCHJA SJCHGC02391 protein (Fragment) OS=Schisto... 34 4.1
tr|Q4D8D4|Q4D8D4_TRYCR Dispersed gene family protein 1 (DGF-1), ... 33 5.4
tr|Q4PEC0|Q4PEC0_USTMA Putative uncharacterized protein OS=Ustil... 33 7.0
tr|Q4DRQ0|Q4DRQ0_TRYCR Dispersed gene family protein 1 (DGF-1), ... 33 9.2

>tr|A2AKH3|A2AKH3_MOUSE Zinc finger protein 106 OS=Mus musculus
GN=Zfp106 PE=4 SV=1
Length = 1888

Score = 40.0 bits (92), Expect = 0.058
Identities = 31/92 (33%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
Frame = -3

Query: 258 NVLRDRRHNLQLPQ----VVCHEARGKHQQARSQPESTR------GFARSVIASKANADS 109
N+ RRH+ QLP V H AR H Q RS P S R G S+ ++ ++ +
Sbjct: 945 NIPTQRRHSAQLPSGHIMPVMHSARDLHSQERSTPLSERHAQESTGEGNSLTSNASSGHA 1004

Query: 108 FSVLSHASLDSSKKHTIPELGSSSMPVQKKRR 13
S L+ A+ DSS + S ++KKRR
Sbjct: 1005 VSSLADAATDSSCTSGAEQTDGHS--IRKKRR 1034


>tr|Q4DLC7|Q4DLC7_TRYCR Dispersed gene family protein 1 (DGF-1),
putative (Fragment) OS=Trypanosoma cruzi
GN=Tc00.1047053506595.10 PE=4 SV=1
Length = 483

Score = 35.4 bits (80), Expect = 1.4
Identities = 25/84 (29%), Positives = 37/84 (44%)
Frame = +3

Query: 6 ASYVASSEQAYLSYLARELYAFLSCLSLHEKGH*KSLHWPLKQSQTLQSLWSTLADFLLA 185
A+ V + L A ++A +SC S E+G +L + L L LW + + LLA
Sbjct: 119 AAVVTGGPGSILEMQALGVFARMSCASAQERGSTVALPYFLSVFAALDPLWMVVGNALLA 178

Query: 186 DVFHELRDTLLEGAANCGVCLAIR 257
VF G +CGV A +
Sbjct: 179 AVF---------GCVHCGVTAAFQ 193


>tr|B7BPM8|B7BPM8_9CLOT Putative uncharacterized protein
OS=Clostridium hylemonae DSM 15053 GN=CLOHYLEM_02478
PE=4 SV=1
Length = 137

Score = 35.0 bits (79), Expect = 1.9
Identities = 18/38 (47%), Positives = 25/38 (65%), Gaps = 4/38 (10%)
Frame = -2

Query: 154 RLCKVCDCFKGQ--CRLFQCP-FSCKL-RQLKKAYNSR 53
R CK+ DC KG+ F+CP + CKL + L+K+YN R
Sbjct: 48 RKCKIKDCIKGKGLAYCFECPDYPCKLIKNLEKSYNKR 85


>tr|B6KK30|B6KK30_TOXGO Putative uncharacterized protein
OS=Toxoplasma gondii ME49 GN=TGME49_033440 PE=4 SV=1
Length = 1460

Score = 35.0 bits (79), Expect = 1.9
Identities = 26/92 (28%), Positives = 41/92 (44%), Gaps = 1/92 (1%)
Frame = -3

Query: 276 RKRISRNVLRDR-RHNLQLPQVVCHEARGKHQQARSQPESTRGFARSVIASKANADSFSV 100
R+R S++ R R + + P + AR H+ S S F+ S ++S + + S S
Sbjct: 715 RRRHSQSQFRGLFRSSRRYPFLSLAHARLHHRSTSSLSSSASSFSSSSVSSSSTSSSLSS 774

Query: 99 LSHASLDSSKKHTIPELGSSSMPVQKKRRRKH 4
S + SS + L SSS RR+H
Sbjct: 775 FSSLASSSSDSLSSSSLSSSSRTSSSASRRRH 806


>tr|Q5C6L3|Q5C6L3_SCHJA SJCHGC02391 protein (Fragment)
OS=Schistosoma japonicum PE=2 SV=2
Length = 165

Score = 33.9 bits (76), Expect = 4.1
Identities = 18/49 (36%), Positives = 23/49 (46%)
Frame = -3

Query: 312 EIWMTKCNIQIQRKRISRNVLRDRRHNLQLPQVVCHEARGKHQQARSQP 166
EI+ K N I R+ LR RHN+Q VC RG Q+ + P
Sbjct: 38 EIYQKKINRTIIETRLKAKALRKTRHNVQKQTDVCRSDRGSVQEIQFLP 86


>tr|Q4D8D4|Q4D8D4_TRYCR Dispersed gene family protein 1 (DGF-1),
putative OS=Trypanosoma cruzi GN=Tc00.1047053509925.70
PE=4 SV=1
Length = 3452

Score = 33.5 bits (75), Expect = 5.4
Identities = 25/84 (29%), Positives = 35/84 (41%)
Frame = +3

Query: 6 ASYVASSEQAYLSYLARELYAFLSCLSLHEKGH*KSLHWPLKQSQTLQSLWSTLADFLLA 185
A V + L A ++A +SC S E+ +L + L L LW L + LLA
Sbjct: 3055 AMVVTGGLGSILEMQALGVFASMSCASSQERASTVALQYFLSVFAALDPLWMVLGNALLA 3114

Query: 186 DVFHELRDTLLEGAANCGVCLAIR 257
VF G +CGV A +
Sbjct: 3115 AVF---------GCVHCGVTAAFQ 3129


>tr|Q4PEC0|Q4PEC0_USTMA Putative uncharacterized protein OS=Ustilago
maydis GN=UM01543.1 PE=4 SV=1
Length = 619

Score = 33.1 bits (74), Expect = 7.0
Identities = 28/93 (30%), Positives = 43/93 (46%)
Frame = -3

Query: 318 RVEIWMTKCNIQIQRKRISRNVLRDRRHNLQLPQVVCHEARGKHQQARSQPESTRGFARS 139
R E+W K Q++R VL R N QLP + H A +Q + ++
Sbjct: 142 RPELWF-KAVKQLER------VLEATRPNAQLPLLSPHNGSANDVAAVAQVRAVAEACKN 194

Query: 138 VIASKANADSFSVLSHASLDSSKKHTIPELGSS 40
++ASK S+ VL HA++ SS + L +S
Sbjct: 195 IVASKLR--SYLVLPHATIRSSVTTNLQVLQTS 225


>tr|Q4DRQ0|Q4DRQ0_TRYCR Dispersed gene family protein 1 (DGF-1),
putative OS=Trypanosoma cruzi GN=Tc00.1047053510175.60
PE=4 SV=1
Length = 3464

Score = 32.7 bits (73), Expect = 9.2
Identities = 22/73 (30%), Positives = 34/73 (46%)
Frame = +3

Query: 39 LSYLARELYAFLSCLSLHEKGH*KSLHWPLKQSQTLQSLWSTLADFLLADVFHELRDTLL 218
L A ++A +SC+S+ E+ +L + L L LW + + LLA VF
Sbjct: 3078 LEMQALGVFARMSCVSVQERASTVALPYFLSVFAALDPLWMVVGNALLAAVF-------- 3129

Query: 219 EGAANCGVCLAIR 257
G +CGV A +
Sbjct: 3130 -GCVHCGVTAAFQ 3141