BP918286
Clone id YMU001_000111_F10
Library
Length 520
Definition Adiantum capillus-veneris mRNA. clone: YMU001_000111_F10.
Accession
Tissue type prothallium
Developmental stage -
Contig ID -
Sequence
GTGTGTCAGAGACTATTTTGAGATGGTATATAGTCCGCTTTATGGCTATTATTTGCTATT
GAGATGGTGCTTATGGGTTGATTAGGAGCACTTTTCGGTATCGAACAGTCTGTCTCCGAG
TTTTTCCAGGCATCCATCTTCTCACAGGTTCTTGCATACATACATGACATCCAAATTTCT
TACAGGTTGTAGTGTTTTGCATCAGCATAGATTTCAGTGTTAGTATTTGATGCGGATATG
CATTGTTTGGGCACTTTTGATAGTGCTTGAGCTCTTTTGAGAGCGGTTTTACTCCCCGCA
GGGATTTGTTTACTTCTTGCACATTTTGTTTGGCGCTTGTTTGAGCGCGTATACTACACC
TTGCAGTATTGTCGACAGTGGATGTACGTTGAGTTGCGGAAACTGCCATATGTACCTTTT
GTAGAGCCGATTTGGCTTGTTTCTAGCACTTCCGACCGTTTCTGAAACTCGGTTCGGAGG
CCCTTTCTGAGATTGAGCTGGTTTCTTGTCTGCATAACCT
■■Homology search results ■■ -
sp_hit_id P43236
Definition sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus
Align length 50
Score (bit) 32.7
E-value 1.0
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP918286|Adiantum capillus-veneris mRNA, clone:
YMU001_000111_F10.
(520 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTS... 33 1.0
sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1 33 1.0
sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 ... 33 1.0
sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK ... 33 1.0
sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1 33 1.0
sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2... 32 1.4
sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=... 32 1.4
sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3 lysi... 32 1.8
sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=... 32 2.3
sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2 32 2.3
sp|O13586|YP092_YEAST Putative uncharacterized protein YPR092W O... 30 5.2
sp|Q17WQ8|IF2_HELAH Translation initiation factor IF-2 OS=Helico... 30 5.3
sp|P47033|PRY3_YEAST Protein PRY3 OS=Saccharomyces cerevisiae GN... 30 6.8
sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2 30 6.8
sp|Q8WHW9|YCF2_PSINU Protein ycf2 OS=Psilotum nudum GN=ycf2 PE=3... 30 6.9
sp|Q19157|PIN2_CAEEL LIM domain-containing protein pin-2 OS=Caen... 30 6.9
sp|Q4QXJ8|POLN_EEEVF Non-structural polyprotein OS=Eastern equin... 26 8.0
sp|Q306W8|POLN_EEEV8 Non-structural polyprotein OS=Eastern equin... 26 8.0
sp|Q306W6|POLN_EEEV1 Non-structural polyprotein OS=Eastern equin... 26 8.0

>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK
PE=1 SV=1
Length = 329

Score = 32.7 bits (73), Expect = 1.0
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYD 266


>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330

Score = 32.7 bits (73), Expect = 1.0
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 218 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYD 267


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1
SV=1
Length = 329

Score = 32.7 bits (73), Expect = 1.0
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYD 266


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK
PE=2 SV=1
Length = 329

Score = 32.7 bits (73), Expect = 1.0
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYD 266


>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329

Score = 32.7 bits (73), Expect = 1.0
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYD 266


>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2
SV=1
Length = 329

Score = 32.3 bits (72), Expect = 1.4
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 217 KAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYYD 266


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2
SV=1
Length = 330

Score = 32.3 bits (72), Expect = 1.4
Identities = 17/50 (34%), Positives = 27/50 (54%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y+ +Y+
Sbjct: 218 KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYD 267


>sp|Q6FKB1|SET1_CANGA Histone-lysine N-methyltransferase, H3
lysine-4 specific OS=Candida glabrata GN=SET1 PE=3 SV=1
Length = 1111

Score = 32.0 bits (71), Expect = 1.8
Identities = 25/88 (28%), Positives = 37/88 (42%), Gaps = 6/88 (6%)
Frame = -2

Query: 510 DKKPAQSQKGPPNRVSETVGSARNKPNRLYKRYI------WQFPQLNVHPLSTILQGVVY 349
+KK A+ K + S NKP R KRY+ + PQ + PL+ +L V
Sbjct: 622 EKKKAEQAKSKDFNIFNLYASYTNKPKRRQKRYLSDEEAEEELPQKKIKPLAHLLDEV-- 679

Query: 348 ALKQAPNKMCKK*TNPCGE*NRSQKSSS 265
+ +P + +P N S SSS
Sbjct: 680 -REDSPTSGLESTDDPTEGDNMSTSSSS 706


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2
SV=1
Length = 331

Score = 31.6 bits (70), Expect = 2.3
Identities = 17/52 (32%), Positives = 26/52 (50%)
Frame = -3

Query: 344 SNKRQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHY 189
S KR C++ ++P GS+ ALK A A I AS+ + +Y +Y
Sbjct: 217 SKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYY 268


>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329

Score = 31.6 bits (70), Expect = 2.3
Identities = 17/50 (34%), Positives = 26/50 (52%)
Frame = -3

Query: 335 RQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHYN 186
+ KC ++IP G++ ALKRA A I AS T+ + Y +Y+
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRKGVYYD 266


tr_hit_id A6M965
Definition tr|A6M965|A6M965_9VIRU Putative O-antigen polymerase OS=Geobacillus virus E2
Align length 59
Score (bit) 36.2
E-value 1.0
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= BP918286|Adiantum capillus-veneris mRNA, clone:
YMU001_000111_F10.
(520 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A6M965|A6M965_9VIRU Putative O-antigen polymerase OS=Geobacil... 36 1.00
tr|B6LEE1|B6LEE1_BRAFL Putative uncharacterized protein (Fragmen... 36 1.0
tr|Q8JIG2|Q8JIG2_DANRE BHLH-PAS transcription factor OS=Danio re... 35 1.7
tr|B8IN94|B8IN94_METNO Secretion protein HlyD family protein OS=... 35 1.7
tr|B3DH92|B3DH92_DANRE Clock homolog 3 (Mouse) OS=Danio rerio GN... 35 2.2
tr|B6LED9|B6LED9_BRAFL Putative uncharacterized protein (Fragmen... 35 2.2
tr|Q4Y2F2|Q4Y2F2_PLACH Putative uncharacterized protein OS=Plasm... 34 5.0
tr|B6LEC1|B6LEC1_BRAFL Putative uncharacterized protein (Fragmen... 34 5.0
tr|B4JQJ6|B4JQJ6_DROGR GH13708 OS=Drosophila grimshawi GN=GH1370... 33 6.5
tr|B3DM60|B3DM60_XENTR Rlf protein OS=Xenopus tropicalis GN=rlf ... 33 6.5
tr|B0YV76|B0YV76_9HIV1 Envelope glycoprotein OS=Human immunodefi... 33 8.5
tr|Q54RX7|Q54RX7_DICDI Putative uncharacterized protein OS=Dicty... 33 8.5
tr|Q0Q4G9|Q0Q4G9_PIG Cathepsin K (Fragment) OS=Sus scrofa GN=CTS... 33 8.5

>tr|A6M965|A6M965_9VIRU Putative O-antigen polymerase OS=Geobacillus
virus E2 PE=4 SV=1
Length = 496

Score = 36.2 bits (82), Expect = 1.00
Identities = 21/59 (35%), Positives = 30/59 (50%)
Frame = +2

Query: 176 FLTGCSVLHQHRFQC*YLMRICIVWALLIVLELF*ERFYSPQGFVYFLHILFGACLSAY 352
+L GC + QH FQ + +C+ AL+I + Y+P V FL +L G CL Y
Sbjct: 8 WLGGCILYLQHSFQKVFFCLLCVAVALIIGI----LTAYNPIASVLFLLLLIGVCLFVY 62


>tr|B6LEE1|B6LEE1_BRAFL Putative uncharacterized protein (Fragment)
OS=Branchiostoma floridae GN=BRAFLDRAFT_204802 PE=4 SV=1
Length = 400

Score = 36.2 bits (82), Expect = 1.0
Identities = 32/137 (23%), Positives = 52/137 (37%), Gaps = 3/137 (2%)
Frame = -3

Query: 428 GSTKGT--YGSFRNSTYIHCRQYCKV*YTRSNKRQTKCARSKQIPAGSKTALKRAQALSK 255
GST G YG N +C Y S + C + T ++A +
Sbjct: 106 GSTSGDSYYGDVVN-------YHCDPGYEISGDEERTCQSDQTWSGTQPTCNRKACPVLS 158

Query: 254 VPKQCISASNTNTEIYADAKHYNL*EIW-MSCMYARTCEKMDAWKNSETDCSIPKSAPNQ 78
VP + S T +Y D Y+ E + + RTC+ +W ++ +CS P
Sbjct: 159 VPN---NGSRTEGHLYGDKVTYSCNEGYELIGSENRTCQANQSWSGTQPNCSRTSCPPLP 215

Query: 77 PISTISIANNSHKADYI 27
P+ S + S+ D +
Sbjct: 216 PVEHGSTSGGSYYGDVV 232


>tr|Q8JIG2|Q8JIG2_DANRE BHLH-PAS transcription factor OS=Danio rerio
GN=clock3 PE=2 SV=1
Length = 813

Score = 35.4 bits (80), Expect = 1.7
Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 3/106 (2%)
Frame = -1

Query: 502 TSSISERASEPSFRNGRKC*KQAKSALQKVHMAVSATQRTSTVD--NTARCSIRAQTSAK 329
TSS S S + + C S K+HM VS++ R +TVD + R S+ Q+ +
Sbjct: 418 TSSRSSHKSSHTAISDNTC----TSTPSKLHMDVSSSPRPATVDMISQRRSSVSTQSMSS 473

Query: 328 QNVQEVNKSLRGVKPLSKELKHYQ-KCPNNAYPHQILTLKSMLMQN 194
QN + + ++ +S++ +H Q + P P+ + + + MQ+
Sbjct: 474 QNTTQTDSPVQ----ISQQTQHQQPQQPQQVQPNALFSAQLNAMQH 515


>tr|B8IN94|B8IN94_METNO Secretion protein HlyD family protein
OS=Methylobacterium nodulans ORS 2060 GN=Mnod_7473 PE=4
SV=1
Length = 390

Score = 35.4 bits (80), Expect = 1.7
Identities = 22/73 (30%), Positives = 37/73 (50%), Gaps = 3/73 (4%)
Frame = -1

Query: 517 LCRQETSSISERASEPSFRNGRKC*KQAKSALQKVHMAVSATQRTSTVDNTARCSIRA-- 344
L R S ++ ASE F+ K+A +A++K H + A +R V +T +RA
Sbjct: 158 LTRYRQLSANQYASEQRFQQAEADHKKAAAAVEKAHATLDAAERQLAVIDTQTRQVRAAR 217

Query: 343 -QTSAKQNVQEVN 308
Q A+Q++ +N
Sbjct: 218 EQARAEQDIARLN 230


>tr|B3DH92|B3DH92_DANRE Clock homolog 3 (Mouse) OS=Danio rerio
GN=clock3 PE=2 SV=1
Length = 820

Score = 35.0 bits (79), Expect = 2.2
Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Frame = -1

Query: 502 TSSISERASEPSFRNGRKC*KQAKSALQKVHMAVSATQRTSTVD--NTARCSIRAQTSAK 329
TSS S S + + C S K+HM VS++ R +TVD + R S+ Q+ +
Sbjct: 418 TSSRSSHKSSHTAISDNTC----TSTPSKLHMDVSSSPRPATVDMISQRRSSVSTQSMSS 473

Query: 328 QNVQEVNKSLRGVKPLSKELKHYQKCPNNAYPHQI 224
QN + + ++ +S++ +H Q+ P P Q+
Sbjct: 474 QNTTQTDSPVQ----ISQQTQHQQQQPQQ--PQQV 502


>tr|B6LED9|B6LED9_BRAFL Putative uncharacterized protein (Fragment)
OS=Branchiostoma floridae GN=BRAFLDRAFT_204816 PE=4 SV=1
Length = 320

Score = 35.0 bits (79), Expect = 2.2
Identities = 25/115 (21%), Positives = 45/115 (39%), Gaps = 1/115 (0%)
Frame = -3

Query: 368 YCKV*YTRSNKRQTKCARSKQIPAGSKTALKRAQALSKVPKQCISASNTNTEIYADAKHY 189
+C Y S + C + T ++A + VP + S T +Y D Y
Sbjct: 7 HCDPGYEISGDEERTCQSDQTWSGTQPTCNRKACPVLSVPN---NGSRTEGHLYGDKVTY 63

Query: 188 NL*EIW-MSCMYARTCEKMDAWKNSETDCSIPKSAPNQPISTISIANNSHKADYI 27
+ E + + RTC+ +W ++ +CS P P+ S + S+ D +
Sbjct: 64 SCNEGYELIGSENRTCQANQSWSGTQPNCSRTSCPPLPPVEHGSTSGGSYYGDVV 118



Score = 34.3 bits (77), Expect = 3.8
Identities = 30/135 (22%), Positives = 53/135 (39%), Gaps = 1/135 (0%)
Frame = -3

Query: 428 GSTKGTYGSFRNSTYIHCRQYCKV*YTRSNKRQTKCARSKQIPAGSKTALKRAQALSKVP 249
GST G GS+ + +C Y S + C + T + A + VP
Sbjct: 106 GSTSG--GSYYGDVVTY---HCDPGYEISGDEERTCQSDQTWSGTQPTCSRTACPVLPVP 160

Query: 248 KQCISASNTNTEIYADAKHYNL*EIW-MSCMYARTCEKMDAWKNSETDCSIPKSAPNQPI 72
+ S T +Y D ++ E + + RTC+ +W ++ +CS +P P+
Sbjct: 161 N---NGSRTEGHLYGDKVTFSCDEGYELIGSENRTCQANQSWSGTQPNCSRKPCSPLLPV 217

Query: 71 STISIANNSHKADYI 27
S + S+ D +
Sbjct: 218 EHGSTSGGSYYGDVV 232


>tr|Q4Y2F2|Q4Y2F2_PLACH Putative uncharacterized protein
OS=Plasmodium chabaudi GN=PC000645.01.0 PE=4 SV=1
Length = 629

Score = 33.9 bits (76), Expect = 5.0
Identities = 17/41 (41%), Positives = 24/41 (58%)
Frame = -3

Query: 497 LNLRKGLRTEFQKRSEVLETSQIGSTKGTYGSFRNSTYIHC 375
+NL+ FQKR+E ++S I S G + SF NS +I C
Sbjct: 540 VNLKAKFSLHFQKRTEGYKSSDIHSFAGLFMSFMNSIFILC 580


>tr|B6LEC1|B6LEC1_BRAFL Putative uncharacterized protein (Fragment)
OS=Branchiostoma floridae GN=BRAFLDRAFT_204663 PE=4 SV=1
Length = 432

Score = 33.9 bits (76), Expect = 5.0
Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 1/135 (0%)
Frame = -3

Query: 428 GSTKGTYGSFRNSTYIHCRQYCKV*YTRSNKRQTKCARSKQIPAGSKTALKRAQALSKVP 249
GST G GS+ + +C Y S + C + T + A + VP
Sbjct: 173 GSTSG--GSYYGDVVTY---HCDPGYEISGDEERTCQSDQTWSGTQSTCNRTACPILPVP 227

Query: 248 KQCISASNTNTEIYADAKHYNL*EIW-MSCMYARTCEKMDAWKNSETDCSIPKSAPNQPI 72
+ S T +Y D ++ E + + RTC+ +W ++ +CS AP +
Sbjct: 228 N---NGSRTEGHLYGDKVTFSCNEGYELIGSENRTCQANQSWSGTQPNCSRTSCAPLMAV 284

Query: 71 STISIANNSHKADYI 27
S + S+ D +
Sbjct: 285 EHGSTSGGSYYGDVV 299


>tr|B4JQJ6|B4JQJ6_DROGR GH13708 OS=Drosophila grimshawi GN=GH13708
PE=4 SV=1
Length = 1822

Score = 33.5 bits (75), Expect = 6.5
Identities = 16/42 (38%), Positives = 26/42 (61%)
Frame = -1

Query: 346 AQTSAKQNVQEVNKSLRGVKPLSKELKHYQKCPNNAYPHQIL 221
AQT+ QN +++ KSL + P K++ H P AYPH+++
Sbjct: 198 AQTNGLQNGKKLTKSLSEI-PNGKDMMHQHYPPGFAYPHELI 238


>tr|B3DM60|B3DM60_XENTR Rlf protein OS=Xenopus tropicalis GN=rlf
PE=2 SV=1
Length = 1323

Score = 33.5 bits (75), Expect = 6.5
Identities = 23/87 (26%), Positives = 39/87 (44%), Gaps = 2/87 (2%)
Frame = -3

Query: 341 NKRQTKCARSKQIPAGSKTALKRAQALSKVPKQC--ISASNTNTEIYADAKHYNL*EIWM 168
N Q C +K +P+ KT + + C + NT E+ KH NL +
Sbjct: 494 NTEQLNCV-NKCVPSSDKTGSAEDCGETHLSSVCHRSTCENTINELLTSLKHLNL-KNSN 551

Query: 167 SCMYARTCEKMDAWKNSETDCSIPKSA 87
SC+ + +A +S DC++PK++
Sbjct: 552 SCITTSGAQDSEANASSSIDCAVPKNS 578