BP913724
Clone id YMU001_000033_E10
Library
Length 536
Definition Adiantum capillus-veneris mRNA. clone: YMU001_000033_E10.
Accession
Tissue type prothallium
Developmental stage -
Contig ID
Sequence
CAAAATTGCGGGACAATACCTTACTGCTCAAACAAGAGGACCATTCTTTTGCCACACGCT
TTGTACAAACAGACCATGTATGGTAACCTCTACCGTCGCCACATGCACCAGCTCAGCGAT
ATTCCACAATCGTGATCAAGAAAATATGGATTAAAAAATCCATCATGCAAATCAATCCTA
AGCAATTGTGGATACCGAGAACGGACAAAACAATACAAGTCCACCTACATTGCCTCCATC
GCAAATCCGACCTCGCCAAATTGCCTACGGATTTTGTCAGACAACCTAGGCCAATGGAAC
AACACCCTTGCAAACCACACAATGTCAAGTGGGTATGGAAACCAAAACAAACCCGCTCAC
CAGAGGCCTCTTCTTGGAAATACATACACAAGCAATTTCATTGGCAACCAAAACAAAGCC
CTCAAGAAAAACAACCAGTAGCATCACAAATTCTACAGATCATAGTCAAACACAGGACGT
GTCTTCTACAATCACTACTGTTCGGACCACGTAGCTTCAATGTGACCTCGCCACCA
■■Homology search results ■■ -
sp_hit_id A6NE52
Definition sp|A6NE52|K1875_HUMAN WD repeat-containing protein KIAA1875 OS=Homo sapiens
Align length 73
Score (bit) 33.9
E-value 0.51
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP913724|Adiantum capillus-veneris mRNA, clone:
YMU001_000033_E10.
(536 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|A6NE52|K1875_HUMAN WD repeat-containing protein KIAA1875 OS=H... 34 0.51
sp|P50090|KEL2_YEAST Kelch repeat-containing protein 2 OS=Saccha... 33 0.87
sp|P13545|HMB1_STRPU Homeobox protein HB1 OS=Strongylocentrotus ... 32 1.9
sp|Q60519|SEM5B_MOUSE Semaphorin-5B OS=Mus musculus GN=Sema5b PE... 30 5.7
sp|Q9P283|SEM5B_HUMAN Semaphorin-5B OS=Homo sapiens GN=SEMA5B PE... 30 7.4
sp|P18488|EMS_DROME Homeotic protein empty spiracles OS=Drosophi... 30 7.4
sp|P45951|ARP_ARATH Apurinic endonuclease-redox protein OS=Arabi... 30 7.4
sp|P18948|VIT6_CAEEL Vitellogenin-6 OS=Caenorhabditis elegans GN... 30 9.7
sp|Q6C1W9|TRM10_YARLI tRNA (guanine-N(1)-)-methyltransferase OS=... 30 9.7
sp|P27508|PQQF_KLEPN Coenzyme PQQ synthesis protein F OS=Klebsie... 30 9.7
sp|P09814|POLG_TVMV Genome polyprotein OS=Tobacco vein mottling ... 30 9.7
sp|P92524|M880_ARATH Uncharacterized mitochondrial protein AtMg0... 30 9.7

>sp|A6NE52|K1875_HUMAN WD repeat-containing protein KIAA1875 OS=Homo
sapiens GN=KIAA1875 PE=2 SV=2
Length = 1622

Score = 33.9 bits (76), Expect = 0.51
Identities = 18/73 (24%), Positives = 30/73 (41%)
Frame = +3

Query: 210 TIQVHLHCLHRKSDLAKLPTDFVRQPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIH 389
T+Q H HCL +P V Q + P + W+W+P+ P + W+
Sbjct: 1030 TVQPHKHCLRPICFPGYVPNSAVLQQMWLNAEPGASQDALWLWRPR----PSQTQWQ--R 1083

Query: 390 KQFHWQPKQSPQE 428
K W ++ +E
Sbjct: 1084 KLLQWMGEKPGEE 1096


>sp|P50090|KEL2_YEAST Kelch repeat-containing protein 2
OS=Saccharomyces cerevisiae GN=KEL2 PE=1 SV=1
Length = 882

Score = 33.1 bits (74), Expect = 0.87
Identities = 16/69 (23%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Frame = +3

Query: 285 PRPMEQHPCKPHNVK-WVWKPKQTRSPEASSWKYIHKQFHWQPKQSPQEKQPVASQILQI 461
P P+ H ++ K WV+ + ++ +++Y Q W ++ EK P + +
Sbjct: 254 PPPLTNHTMVAYDNKLWVFGGETPKTISNDTYRYDPAQSEWSKVKTTGEKPPPIQEHASV 313

Query: 462 IVKHRTCLL 488
+ KH C+L
Sbjct: 314 VYKHLMCVL 322


>sp|P13545|HMB1_STRPU Homeobox protein HB1 OS=Strongylocentrotus
purpuratus PE=2 SV=2
Length = 308

Score = 32.0 bits (71), Expect = 1.9
Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 2/65 (3%)
Frame = +1

Query: 160 PSCKSILSNCGYRE--RTKQYKSTYIASIANPTSPNCLRILSDNLGQWNNTLANHTMSSG 333
PS LS+C + + +T Y S+ S+ + P C + L TL+N + ++G
Sbjct: 27 PSGSYELSSCAFSKNPKTSSYSSSSSPSLVATSKPPCTQQLGAATFYGGGTLSNFSTTAG 86

Query: 334 YGNQN 348
YG+ +
Sbjct: 87 YGDHS 91


>sp|Q60519|SEM5B_MOUSE Semaphorin-5B OS=Mus musculus GN=Sema5b PE=2
SV=1
Length = 1093

Score = 30.4 bits (67), Expect = 5.7
Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 8/76 (10%)
Frame = -2

Query: 505 PNSSDCRRHVLCLTMICRICDATGCFS*GLCFGCQ*NCLCMYFQEEAS--------GERV 350
P D + C C + A C++ C +C ++Q S GE +
Sbjct: 833 PCVGDAAEYQDCNPQACPVRGAWSCWT--AWSQCSASCGGGHYQRTRSCTSPAPSPGEDI 890

Query: 349 CFGFHTHLTLCGLQGC 302
C G HT LC Q C
Sbjct: 891 CLGLHTEEALCSTQAC 906


>sp|Q9P283|SEM5B_HUMAN Semaphorin-5B OS=Homo sapiens GN=SEMA5B PE=2
SV=4
Length = 1151

Score = 30.0 bits (66), Expect = 7.4
Identities = 20/76 (26%), Positives = 28/76 (36%), Gaps = 8/76 (10%)
Frame = -2

Query: 505 PNSSDCRRHVLCLTMICRICDATGCFS*GLCFGCQ*NCLCMYFQEEAS--------GERV 350
P D + C C + A C++ C +C ++Q S GE +
Sbjct: 891 PCVGDAAEYQDCNPQACPVRGAWSCWT--SWSPCSASCGGGHYQRTRSCTSPAPSPGEDI 948

Query: 349 CFGFHTHLTLCGLQGC 302
C G HT LC Q C
Sbjct: 949 CLGLHTEEALCATQAC 964


>sp|P18488|EMS_DROME Homeotic protein empty spiracles OS=Drosophila
melanogaster GN=ems PE=2 SV=2
Length = 497

Score = 30.0 bits (66), Expect = 7.4
Identities = 34/140 (24%), Positives = 53/140 (37%), Gaps = 2/140 (1%)
Frame = +3

Query: 66 QTDHVW*PLPSPH-APAQRYSTXXXXXXXXXXXXXXXNPKQLWIPRTDKTIQVHLHCLHR 242
Q H+ P P PH +PAQ++ + P++ + IQ L LH
Sbjct: 102 QQQHLQAPHPHPHLSPAQQH---------VLHQHLLMQHQHPGTPKSHQDIQELLQRLHH 152

Query: 243 KSDLAKLPTDFVRQPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIHKQFHWQPKQ-S 419
+ +A + + P + P ++K P +A + + H PK S
Sbjct: 153 NAAMASGLSPLQTRLSPETEQPQMAVSLKRERSPAPPAMEQAENPAQRIQPPHTPPKSVS 212

Query: 420 PQEKQPVASQILQIIVKHRT 479
PQ QP +S L I H T
Sbjct: 213 PQSSQPSSSPTLLISSPHAT 232


>sp|P45951|ARP_ARATH Apurinic endonuclease-redox protein
OS=Arabidopsis thaliana GN=ARP PE=2 SV=2
Length = 536

Score = 30.0 bits (66), Expect = 7.4
Identities = 13/38 (34%), Positives = 23/38 (60%)
Frame = +1

Query: 205 TKQYKSTYIASIANPTSPNCLRILSDNLGQWNNTLANH 318
T ++ S Y+ + P S + L+ LS + +W+ TL+NH
Sbjct: 377 TAEFDSFYLINTYVPNSGDGLKRLSYRIEEWDRTLSNH 414


>sp|P18948|VIT6_CAEEL Vitellogenin-6 OS=Caenorhabditis elegans
GN=vit-6 PE=1 SV=5
Length = 1651

Score = 29.6 bits (65), Expect = 9.7
Identities = 12/34 (35%), Positives = 20/34 (58%)
Frame = +3

Query: 393 QFHWQPKQSPQEKQPVASQILQIIVKHRTCLLQS 494
Q H+QP+ S E++PV +I K C+++S
Sbjct: 1313 QIHYQPRNSRYEQKPVMEKIAHHASKQANCVVKS 1346


>sp|Q6C1W9|TRM10_YARLI tRNA (guanine-N(1)-)-methyltransferase
OS=Yarrowia lipolytica GN=TRM10 PE=3 SV=1
Length = 371

Score = 29.6 bits (65), Expect = 9.7
Identities = 20/68 (29%), Positives = 29/68 (42%), Gaps = 8/68 (11%)
Frame = +3

Query: 288 RPMEQHPCKPHNVKWVWKPKQTRSPEASS---WKYIHKQFHWQPKQS-----PQEKQPVA 443
R E HP P NV + K PE S WK K+ W+ K+ +EK+ A
Sbjct: 29 RTSEPHPKNPQNVPAPYNTKTAVIPEGMSKNEWKKAQKKAIWESKKDEIAAVKKEKKKAA 88

Query: 444 SQILQIIV 467
+ Q+ +
Sbjct: 89 RKRKQLAI 96


>sp|P27508|PQQF_KLEPN Coenzyme PQQ synthesis protein F OS=Klebsiella
pneumoniae GN=pqqF PE=3 SV=1
Length = 761

Score = 29.6 bits (65), Expect = 9.7
Identities = 17/62 (27%), Positives = 33/62 (53%)
Frame = +3

Query: 240 RKSDLAKLPTDFVRQPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIHKQFHWQPKQS 419
R +L + DF+RQ PM++ +P + + + +R PEA + + +Q + P+ +
Sbjct: 669 RAGELLRCGKDFLRQLAPMDEATFRPLQQRLAAQIRASRPPEARALSAL-RQEYGLPELT 727

Query: 420 PQ 425
PQ
Sbjct: 728 PQ 729


tr_hit_id A7TJ78
Definition tr|A7TJ78|A7TJ78_VANPO Putative uncharacterized protein OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM 70294)
Align length 69
Score (bit) 37.4
E-value 0.5
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP913724|Adiantum capillus-veneris mRNA, clone:
YMU001_000033_E10.
(536 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A7TJ78|A7TJ78_VANPO Putative uncharacterized protein OS=Vande... 37 0.50
tr|Q6MKE3|Q6MKE3_BDEBA Putative uncharacterized protein OS=Bdell... 36 1.4
tr|A7ZFI8|A7ZFI8_CAMC1 Formate dehydrogenase, cytochrome B subun... 36 1.4
tr|B7VP70|B7VP70_VIBSP Putative transcriptional regulator, LysR ... 36 1.4
tr|A3Y1G5|A3Y1G5_9VIBR Transcriptional regulator, LysR family pr... 36 1.4
tr|Q54NK4|Q54NK4_DICDI Putative uncharacterized protein OS=Dicty... 36 1.4
tr|A8X8R2|A8X8R2_CAEBR CBR-CRM-1 protein OS=Caenorhabditis brigg... 35 1.9
tr|O96859|O96859_HELER Hox-type homeodomain protein (Fragment) O... 35 2.5
tr|A0DDS8|A0DDS8_PARTE Chromosome undetermined scaffold_47, whol... 35 2.5
tr|A7V6C3|A7V6C3_BACUN Putative uncharacterized protein OS=Bacte... 34 4.2
tr|A8HPW4|A8HPW4_CHLRE Predicted protein (Fragment) OS=Chlamydom... 34 5.5
tr|Q9NTA2|Q9NTA2_HUMAN Putative uncharacterized protein DKFZp434... 34 5.5
tr|A2U8G4|A2U8G4_BACCO Putative uncharacterized protein OS=Bacil... 33 7.2
tr|B4IAR9|B4IAR9_DROSE GM22129 OS=Drosophila sechellia GN=GM2212... 33 7.2
tr|A6CVJ7|A6CVJ7_9VIBR Probable LysR-family transcription regula... 33 9.3
tr|Q17C39|Q17C39_AEDAE Cyclin-dependent kinase 5 activator OS=Ae... 33 9.3
tr|B5VJH8|B5VJH8_YEAS6 YGR238Cp-like protein (Fragment) OS=Sacch... 33 9.3
tr|B3LI02|B3LI02_YEAS1 Kelch repeat-containing protein 2 OS=Sacc... 33 9.3
tr|A6ZUP6|A6ZUP6_YEAS7 Conserved protein OS=Saccharomyces cerevi... 33 9.3

>tr|A7TJ78|A7TJ78_VANPO Putative uncharacterized protein
OS=Vanderwaltozyma polyspora (strain ATCC 22028 / DSM
70294) GN=Kpol_1004p21 PE=4 SV=1
Length = 755

Score = 37.4 bits (85), Expect = 0.50
Identities = 26/69 (37%), Positives = 33/69 (47%)
Frame = +1

Query: 118 DIPQS*SRKYGLKNPSCKSILSNCGYRERTKQYKSTYIASIANPTSPNCLRILSDNLGQW 297
DIPQ S K L+NP IL N Y ER + +S + N PN L + +N G
Sbjct: 222 DIPQDNSNKKDLRNP----ILINLEYNERNNKNESNF----QNEIQPNNLNVNYNNNGNS 273

Query: 298 NNTLANHTM 324
NN A+ M
Sbjct: 274 NNYSADEDM 282


>tr|Q6MKE3|Q6MKE3_BDEBA Putative uncharacterized protein
OS=Bdellovibrio bacteriovorus GN=Bd2455 PE=4 SV=1
Length = 214

Score = 35.8 bits (81), Expect = 1.4
Identities = 17/39 (43%), Positives = 24/39 (61%)
Frame = +3

Query: 396 FHWQPKQSPQEKQPVASQILQIIVKHRTCLLQSLLFGPR 512
F W KQSP+E++ V S L+++ K +TC L L F R
Sbjct: 114 FEWNAKQSPEEQRFVYSHFLELVPKDQTCELVRLDFSWR 152


>tr|A7ZFI8|A7ZFI8_CAMC1 Formate dehydrogenase, cytochrome B subunit
OS=Campylobacter concisus (strain 13826) GN=Ccon26_17090
PE=4 SV=1
Length = 315

Score = 35.8 bits (81), Expect = 1.4
Identities = 12/22 (54%), Positives = 18/22 (81%)
Frame = -2

Query: 193 IHNCLGLICMMDFLIHIFLITI 128
IHNCLG++C + FL+HI++ I
Sbjct: 245 IHNCLGIVCAVFFLVHIYMAAI 266


>tr|B7VP70|B7VP70_VIBSP Putative transcriptional regulator, LysR
family OS=Vibrio splendidus LGP32 GN=VS_1634 PE=4 SV=1
Length = 311

Score = 35.8 bits (81), Expect = 1.4
Identities = 20/67 (29%), Positives = 28/67 (41%)
Frame = +3

Query: 210 TIQVHLHCLHRKSDLAKLPTDFVRQPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIH 389
+I + L + D+A+ P P Q K N+ PK RSP + Y H
Sbjct: 231 SIPIQLSNFNVAPDMAEATDIIFHLPTPFAQQAAKQRNLVCKRVPKAIRSPSEDVYLYWH 290

Query: 390 KQFHWQP 410
K+FH P
Sbjct: 291 KRFHNDP 297


>tr|A3Y1G5|A3Y1G5_9VIBR Transcriptional regulator, LysR family
protein OS=Vibrio sp. MED222 GN=MED222_15327 PE=4 SV=1
Length = 306

Score = 35.8 bits (81), Expect = 1.4
Identities = 20/67 (29%), Positives = 28/67 (41%)
Frame = +3

Query: 210 TIQVHLHCLHRKSDLAKLPTDFVRQPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIH 389
+I + L + D+A+ P P Q K N+ PK RSP + Y H
Sbjct: 226 SIPIQLSNFNVAPDMAEATDIIFHLPTPFAQQAAKQRNLVCKRVPKAIRSPSEDVYLYWH 285

Query: 390 KQFHWQP 410
K+FH P
Sbjct: 286 KRFHNDP 292


>tr|Q54NK4|Q54NK4_DICDI Putative uncharacterized protein
OS=Dictyostelium discoideum GN=DDB_0186385 PE=4 SV=1
Length = 1177

Score = 35.8 bits (81), Expect = 1.4
Identities = 19/56 (33%), Positives = 29/56 (51%)
Frame = +3

Query: 282 QPRPMEQHPCKPHNVKWVWKPKQTRSPEASSWKYIHKQFHWQPKQSPQEKQPVASQ 449
QP+P Q P +P + +P+Q++ P+ S Q QP+QS Q +QP Q
Sbjct: 626 QPQPQPQQPQQPQQSQQPQQPQQSQQPQQSQQSQ-QSQQPQQPQQSQQSQQPQPQQ 680


>tr|A8X8R2|A8X8R2_CAEBR CBR-CRM-1 protein OS=Caenorhabditis briggsae
GN=Cbr-crm-1 PE=4 SV=2
Length = 884

Score = 35.4 bits (80), Expect = 1.9
Identities = 18/49 (36%), Positives = 24/49 (48%)
Frame = -2

Query: 304 CCSIGLGCLTKSVGNLARSDLRWRQCRWTCIVLSVLGIHNCLGLICMMD 158
CC + LGC T++ L R D W++ T S LG H C +C D
Sbjct: 223 CCPVCLGCQTENQTKLERGD-TWQKDDCTSCTCSELGAHMCEKYMCKTD 270


>tr|O96859|O96859_HELER Hox-type homeodomain protein (Fragment)
OS=Heliocidaris erythrogramma GN=HeHbox1 PE=4 SV=1
Length = 181

Score = 35.0 bits (79), Expect = 2.5
Identities = 18/59 (30%), Positives = 32/59 (54%), Gaps = 2/59 (3%)
Frame = +1

Query: 178 LSNCGYRE--RTKQYKSTYIASIANPTSPNCLRILSDNLGQWNNTLANHTMSSGYGNQN 348
LS+C + + +T Y S+ S+A T P+C + L TL+N + ++GYG+ +
Sbjct: 33 LSSCAFTKNPKTSSYSSSSSPSLAATTKPSCTQQLGAATFYGGGTLSNFSTTAGYGDHS 91


>tr|A0DDS8|A0DDS8_PARTE Chromosome undetermined scaffold_47, whole
genome shotgun sequence OS=Paramecium tetraurelia
GN=GSPATT00016036001 PE=4 SV=1
Length = 2697

Score = 35.0 bits (79), Expect = 2.5
Identities = 33/106 (31%), Positives = 41/106 (38%), Gaps = 12/106 (11%)
Frame = -2

Query: 457 CRICDATGCFS*GLCFGCQ---*NCLC---------MYFQEEASGERVCFGFHTHLTLCG 314
C+ CDATGC S C G + +C+C MY + A C G T C
Sbjct: 1172 CKTCDATGCTS---CLGDRITVPSCVCPDGYFDSYQMYCTQCAKKCITCNGSAELCTACS 1228

Query: 313 LQGCCSIGLGCLTKSVGNLARSDLRWRQCRWTCIVLSVLGIHNCLG 176
S GC G SDL +QC TC + G +C G
Sbjct: 1229 GIRVSSPICGCPN---GYFENSDLDCQQCDSTCKDCNQYGCLSCFG 1271


>tr|A7V6C3|A7V6C3_BACUN Putative uncharacterized protein
OS=Bacteroides uniformis ATCC 8492 GN=BACUNI_03136 PE=3
SV=1
Length = 1079

Score = 34.3 bits (77), Expect = 4.2
Identities = 24/86 (27%), Positives = 37/86 (43%)
Frame = +1

Query: 148 GLKNPSCKSILSNCGYRERTKQYKSTYIASIANPTSPNCLRILSDNLGQWNNTLANHTMS 327
G KNPS +SN T YK+ Y++ + N T + N+G W+ L N+ +
Sbjct: 926 GSKNPSFLMSMSN------TLTYKNFYLSFLLNGTFKVTRELNEANIGSWSYNLYNYLHN 979

Query: 328 SGYGNQNKPAHQRPLLGNTYTSNFIG 405
+ Y P H + +NF G
Sbjct: 980 ADYWT---PEHTNSKYASPAYNNFDG 1002