DK963529
Clone id TST39A01NGRL0016_M13
Library
Length 621
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0016_M13. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
CTTTCCCCCCCATACAAGGTATGTAGCGAAGCCATTGGAAACTTGTAGCGTTCTCTCTCT
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCACACACACACACACAGGCACACATACC
CAAGGAAATGAGTAAAGAACGACGTTGGGAGAAGCAGATCGAGCATGACGAGCATCTGCT
ACACAAGAATATAGAAAGAAAGGAGCAGCAGCAACGTCGACAAGGACATTATTTCGCAGA
GCTTCCGGCCATAAACATGACATATACACGCACCACCAGCTGCAGGAACTCCATGTTGCA
GGACTCAGCTTTCGCCCGTGATCAGCATACTCATGTCTCCAGCCTCAAAGCCAGCAACCT
TTTGCAGATTCTACGGAGATGGAGAGCGCAAAGCCTGAAGCAAACCACCCTCAGTGGCGC
GTCCATGGCGTCGCGCAGATCTGCTGCCCGCTCGCACTCCTGCTTAAGTTTGGGGCTCAC
CTTAGCGCCATTGAGGTGCTTAAGCATGCAATTGCATTCTGATGATGAAGATGAAGGACA
AAATGGAGCCAAACTTTTAACAGCGCCAAGCCGTGGAAGCCCGAGGCAAATTTGCGGCTC
TAGGGGCCAGGGTAGTACTCT
■■Homology search results ■■ -
sp_hit_id Q86U86
Definition sp|Q86U86|PB1_HUMAN Protein polybromo-1 OS=Homo sapiens
Align length 66
Score (bit) 35.8
E-value 0.18
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK963529|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0016_M13, 5'
(621 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q86U86|PB1_HUMAN Protein polybromo-1 OS=Homo sapiens GN=PBRM1... 36 0.18
sp|Q90941|PB1_CHICK Protein polybromo-1 OS=Gallus gallus GN=PBRM... 35 0.31
sp|Q8BSQ9|PB1_MOUSE Protein polybromo-1 OS=Mus musculus GN=Pbrm1... 33 1.2
sp|Q6NS45|CCD66_MOUSE Coiled-coil domain-containing protein 66 O... 33 1.6
sp|Q54RP6|DHKL_DICDI Hybrid signal transduction histidine kinase... 32 2.0
sp|Q9FLI0|BH120_ARATH Transcription factor bHLH120 OS=Arabidopsi... 32 2.0
sp|A2RUB6|CCD66_HUMAN Coiled-coil domain-containing protein 66 O... 32 3.5
sp|Q04791|SASB_ANAPL Fatty acyl-CoA hydrolase precursor, medium ... 30 7.7
sp|Q6BSZ4|PACC_DEBHA pH-response transcription factor pacC/RIM10... 30 7.7
sp|Q8MY86|FGFR1_DUGJA Fibroblast growth factor receptor 1 OS=Dug... 30 7.7

>sp|Q86U86|PB1_HUMAN Protein polybromo-1 OS=Homo sapiens GN=PBRM1 PE=1
SV=1
Length = 1689

Score = 35.8 bits (81), Expect = 0.18
Identities = 18/66 (27%), Positives = 36/66 (54%)
Frame = +2

Query: 98 THTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTT 277
++T H+ ++ KER+ + E +E L + E++E ++ A L ++ TY++
Sbjct: 892 SYTTKHLHNDVEKERKEKLPKEIEEDKLKREEEKREAEKSEDSSGAAGLSGLHRTYSQDC 951

Query: 278 SCRNSM 295
S +NSM
Sbjct: 952 SFKNSM 957


>sp|Q90941|PB1_CHICK Protein polybromo-1 OS=Gallus gallus GN=PBRM1
PE=1 SV=1
Length = 1633

Score = 35.0 bits (79), Expect = 0.31
Identities = 17/66 (25%), Positives = 37/66 (56%)
Frame = +2

Query: 98 THTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTT 277
++T H+ ++ KE++ + E +E L + E++E ++ A L +++ TY++
Sbjct: 890 SYTTKHLHNDVEKEKKEKLPKEIEEDKLKREEEKREAEKSEDSSGSAGLSSLHRTYSQDC 949

Query: 278 SCRNSM 295
S +NSM
Sbjct: 950 SFKNSM 955


>sp|Q8BSQ9|PB1_MOUSE Protein polybromo-1 OS=Mus musculus GN=Pbrm1 PE=1
SV=3
Length = 1634

Score = 33.1 bits (74), Expect = 1.2
Identities = 16/66 (24%), Positives = 35/66 (53%)
Frame = +2

Query: 98 THTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTT 277
++T H+ ++ KE++ + E +E L + E++E ++ L ++ TY++
Sbjct: 892 SYTTKHLHNDVEKEKKEKLPKEIEEDKLKREEEKREAEKSEDSSGTTGLSGLHRTYSQDC 951

Query: 278 SCRNSM 295
S +NSM
Sbjct: 952 SFKNSM 957


>sp|Q6NS45|CCD66_MOUSE Coiled-coil domain-containing protein 66
OS=Mus musculus GN=Ccdc66 PE=1 SV=3
Length = 935

Score = 32.7 bits (73), Expect = 1.6
Identities = 20/92 (21%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
Frame = +2

Query: 134 KERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTTSCRNSMLQDSAF 313
+ERR +KQ+EH + ++ + E + +++ + E + + R ++
Sbjct: 454 RERRRQKQLEHQKAIMAQVEENRRKKRLEEEQRKKEEQELELRLAREREEMQRQYEEDIL 513

Query: 314 ARDQHTHVSSLKASNLLQILRRWR--AQSLKQ 403
+ Q + +LK + L ++R + AQ LKQ
Sbjct: 514 KQRQREEIMTLKTNELFHTMQRAQELAQRLKQ 545


>sp|Q54RP6|DHKL_DICDI Hybrid signal transduction histidine kinase L
OS=Dictyostelium discoideum GN=dhkL PE=3 SV=1
Length = 1709

Score = 32.3 bits (72), Expect = 2.0
Identities = 12/45 (26%), Positives = 28/45 (62%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGH 229
H H Q H+ ++++ E +H++H+ + ++++QQQ++Q H
Sbjct: 1156 HHHQQVHL----QQQQQHEHDAQHNQHIQQQQQQQQQQQQQQQQH 1196


>sp|Q9FLI0|BH120_ARATH Transcription factor bHLH120 OS=Arabidopsis
thaliana GN=BHLH120 PE=3 SV=1
Length = 209

Score = 32.3 bits (72), Expect = 2.0
Identities = 24/92 (26%), Positives = 43/92 (46%), Gaps = 7/92 (7%)
Frame = +2

Query: 98 THTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPA-INMTYTRT 274
T Q+H+P+E + ++ +K LLH+NIER+ +Q+ FA L + + + Y +
Sbjct: 10 TRHQSHMPQERDETKKEKK-------LLHRNIERQRRQE--MAILFASLRSQLPLKYIKA 60

Query: 275 TSCRNSMLQDS------AFARDQHTHVSSLKA 352
S + +F +D T + L A
Sbjct: 61 LSSQGKRAMSDHVNGAVSFIKDTQTRIKDLSA 92


>sp|A2RUB6|CCD66_HUMAN Coiled-coil domain-containing protein 66
OS=Homo sapiens GN=CCDC66 PE=1 SV=4
Length = 948

Score = 31.6 bits (70), Expect = 3.5
Identities = 20/92 (21%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
Frame = +2

Query: 134 KERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTTSCRNSMLQDSAF 313
++RR +KQ+EH + + + E++ ++Q + E + + ++
Sbjct: 457 RDRRRQKQLEHQKAITAQVEEKRRKKQLEEEQRKKEEQEEELRLAQEREEMQKQYEEDIL 516

Query: 314 ARDQHTHVSSLKASNLLQILRRWR--AQSLKQ 403
+ Q + +LK + L Q ++R + AQ LKQ
Sbjct: 517 KQKQKEEIMTLKTNELFQTMQRAQELAQRLKQ 548


>sp|Q04791|SASB_ANAPL Fatty acyl-CoA hydrolase precursor, medium
chain OS=Anas platyrhynchos PE=1 SV=1
Length = 557

Score = 30.4 bits (67), Expect = 7.7
Identities = 15/33 (45%), Positives = 18/33 (54%)
Frame = -2

Query: 224 LVDVAAAPFFLYSCVADARHARSASPNVVLYSF 126
L+D A P F++S V ARH R A V Y F
Sbjct: 409 LLDSIADPLFVFSAVEVARHHRDAGNPVYFYEF 441


>sp|Q6BSZ4|PACC_DEBHA pH-response transcription factor pacC/RIM101
OS=Debaryomyces hansenii GN=RIM101 PE=3 SV=2
Length = 617

Score = 30.4 bits (67), Expect = 7.7
Identities = 16/44 (36%), Positives = 24/44 (54%)
Frame = +2

Query: 101 HTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHY 232
H H PK++ K R +Q EHD +++ EQ+Q RQ H+
Sbjct: 251 HADDH-PKKLKKAHR-RQQEEHDHEREYEHEHEHEQEQYRQSHF 292


>sp|Q8MY86|FGFR1_DUGJA Fibroblast growth factor receptor 1
OS=Dugesia japonica GN=FGFR1 PE=2 SV=1
Length = 854

Score = 30.4 bits (67), Expect = 7.7
Identities = 17/50 (34%), Positives = 25/50 (50%)
Frame = +2

Query: 200 KEQQQRRQGHYFAELPAINMTYTRTTSCRNSMLQDSAFARDQHTHVSSLK 349
K Q R G+Y +LP+ + TR S R+S L++ F + SLK
Sbjct: 471 KRLSQFRDGNYGEQLPSTSTDRTRLESTRHSQLENEVFECGSGNNSLSLK 520


tr_hit_id Q7YYM3
Definition tr|Q7YYM3|Q7YYM3_CRYPV Putative uncharacterized protein OS=Cryptosporidium parvum
Align length 100
Score (bit) 36.6
E-value 1.2
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK963529|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0016_M13, 5'
(621 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q7YYM3|Q7YYM3_CRYPV Putative uncharacterized protein OS=Crypt... 37 1.2
tr|Q5CWQ4|Q5CWQ4_CRYPV Putative uncharacterized protein OS=Crypt... 37 1.2
tr|B4MX27|B4MX27_DROWI GK14540 OS=Drosophila willistoni GN=GK145... 37 1.2
tr|A7AW30|A7AW30_BABBO Adenosylhomocysteinase (S-adenosyl-L-homo... 37 1.2
tr|Q9NUX9|Q9NUX9_HUMAN cDNA FLJ11064 fis, clone PLACE1004824 OS=... 36 2.1
tr|B4M1I9|B4M1I9_DROVI GJ19315 OS=Drosophila virilis GN=GJ19315 ... 35 4.7
tr|B4JM34|B4JM34_DROGR GH24569 OS=Drosophila grimshawi GN=GH2456... 35 4.7
tr|A9VZY8|A9VZY8_METEP Integrase family protein OS=Methylobacter... 34 6.1
tr|B4MY21|B4MY21_DROWI GK10478 OS=Drosophila willistoni GN=GK104... 34 7.9
tr|B4LBK3|B4LBK3_DROVI GJ11337 OS=Drosophila virilis GN=GJ11337 ... 34 7.9
tr|O30328|O30328_ACEEU Aldehyde dehydrogenase subunit III OS=Ace... 34 8.0
tr|Q3U0E6|Q3U0E6_MOUSE Putative uncharacterized protein OS=Mus m... 30 8.5

>tr|Q7YYM3|Q7YYM3_CRYPV Putative uncharacterized protein
OS=Cryptosporidium parvum GN=1MB.594 PE=4 SV=1
Length = 1000

Score = 36.6 bits (83), Expect = 1.2
Identities = 22/100 (22%), Positives = 48/100 (48%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRT 274
HT TQ+ + + + ++Q + +HL H+ +QQQ+++ F++ N T T+T
Sbjct: 219 HTRTQSQPVAMVHQHQYQQQQQQQVQHLNHQQYFHSQQQQQQRMQTFSQSQTQNQTQTKT 278

Query: 275 TSCRNSMLQDSAFARDQHTHVSSLKASNLLQILRRWRAQS 394
+ +S + + + + + + QIL + + QS
Sbjct: 279 QAQIHSQSHSQSHVKAPNQTQTQTQVHSQTQILSQSQPQS 318


>tr|Q5CWQ4|Q5CWQ4_CRYPV Putative uncharacterized protein
OS=Cryptosporidium parvum Iowa II GN=cgd6_4170 PE=4 SV=1
Length = 1000

Score = 36.6 bits (83), Expect = 1.2
Identities = 22/100 (22%), Positives = 48/100 (48%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRT 274
HT TQ+ + + + ++Q + +HL H+ +QQQ+++ F++ N T T+T
Sbjct: 219 HTRTQSQPVAMVHQHQYQQQQQQQVQHLNHQQYFHSQQQQQQRMQTFSQSQTQNQTQTKT 278

Query: 275 TSCRNSMLQDSAFARDQHTHVSSLKASNLLQILRRWRAQS 394
+ +S + + + + + + QIL + + QS
Sbjct: 279 QAQIHSQSHSQSHVKAPNQTQTQTQVHSQTQILSQSQPQS 318


>tr|B4MX27|B4MX27_DROWI GK14540 OS=Drosophila willistoni GN=GK14540
PE=4 SV=1
Length = 281

Score = 36.6 bits (83), Expect = 1.2
Identities = 25/96 (26%), Positives = 42/96 (43%), Gaps = 2/96 (2%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRT 274
H + H EM+ R +++ E D H ++IER+ + P+I+ + R+
Sbjct: 28 HNRDRDHKSSEMNHHGRRDRERERDRHRSDRHIERERD--------YRHSPSISKSRKRS 79

Query: 275 TSCRNSMLQDSAFARDQH--THVSSLKASNLLQILR 376
+S +S D R +H T L N LQ+ R
Sbjct: 80 SSSSDSPYSDHESQRSRHKRTRFKKLDEQNQLQVER 115


>tr|A7AW30|A7AW30_BABBO Adenosylhomocysteinase
(S-adenosyl-L-homocysteinehydrolase) OS=Babesia bovis
GN=BBOV_I001740 PE=3 SV=1
Length = 491

Score = 36.6 bits (83), Expect = 1.2
Identities = 16/45 (35%), Positives = 23/45 (51%), Gaps = 2/45 (4%)
Frame = -2

Query: 278 WWCVYMSCLWPEALRNNVLVDVAAAPFFL--YSCVADARHARSAS 150
WWCVY S WP A ++VD + + + Y + RHA+ S
Sbjct: 121 WWCVYQSLRWPNADGPQLIVDDGCSAYTMIKYGLDLERRHAKDGS 165


>tr|Q9NUX9|Q9NUX9_HUMAN cDNA FLJ11064 fis, clone PLACE1004824
OS=Homo sapiens PE=2 SV=1
Length = 289

Score = 35.8 bits (81), Expect = 2.1
Identities = 18/66 (27%), Positives = 36/66 (54%)
Frame = +2

Query: 98 THTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTRTT 277
++T H+ ++ KER+ + E +E L + E++E ++ A L ++ TY++
Sbjct: 49 SYTTKHLHNDVEKERKEKLPKEIEEDKLKREEEKREAEKSEDSSGAAGLSGLHRTYSQDC 108

Query: 278 SCRNSM 295
S +NSM
Sbjct: 109 SFKNSM 114


>tr|B4M1I9|B4M1I9_DROVI GJ19315 OS=Drosophila virilis GN=GJ19315
PE=4 SV=1
Length = 1200

Score = 34.7 bits (78), Expect = 4.7
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 5/79 (6%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQ-----RRQGHYFAELPAINM 259
H Q H P+ +++ ++ +H +H LH+N + ++ QQ +R + F E +
Sbjct: 564 HLEQQQHFPQRYHQQQAQQQHQQHQQHQLHQNCQHQQPQQLVHRGKRYANGFNEAAQLRR 623

Query: 260 TYTRTTSCRNSMLQDSAFA 316
T + R + +A A
Sbjct: 624 TEPTHVALRRELAAVAATA 642


>tr|B4JM34|B4JM34_DROGR GH24569 OS=Drosophila grimshawi GN=GH24569
PE=4 SV=1
Length = 971

Score = 34.7 bits (78), Expect = 4.7
Identities = 19/63 (30%), Positives = 32/63 (50%), Gaps = 4/63 (6%)
Frame = +2

Query: 104 TQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQ----GHYFAELPAINMTYTR 271
TQAH +++ + HL H++ ++++QQQ++Q H LPA T T
Sbjct: 591 TQAHAAGDLNMNVNQAAPVVQQPHLQHQHQQQQQQQQQQQQQQPAHMLLLLPATTTTATT 650

Query: 272 TTS 280
TT+
Sbjct: 651 TTA 653


>tr|A9VZY8|A9VZY8_METEP Integrase family protein OS=Methylobacterium
extorquens (strain PA1) GN=Mext_0473 PE=4 SV=1
Length = 377

Score = 34.3 bits (77), Expect = 6.1
Identities = 19/49 (38%), Positives = 29/49 (59%)
Frame = +2

Query: 125 EMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAELPAINMTYTR 271
E KE RWE Q++ ++ LLH N E +EQ +R+ ++P I + Y R
Sbjct: 208 EAVKELRWE-QVDFEDGLLHLNPEEREQTSKRRP--VVKMPPILIAYLR 253


>tr|B4MY21|B4MY21_DROWI GK10478 OS=Drosophila willistoni GN=GK10478
PE=4 SV=1
Length = 466

Score = 33.9 bits (76), Expect = 7.9
Identities = 15/50 (30%), Positives = 32/50 (64%)
Frame = +2

Query: 95 HTHTQAHIPKEMSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAEL 244
H+HTQ H ++++ + +H +HL H+ ++++QQQ++Q H+ +L
Sbjct: 214 HSHTQQH-----QQQQQQQLHQQHQQHLQHQ--QQQQQQQQQQQHHAVQL 256


>tr|B4LBK3|B4LBK3_DROVI GJ11337 OS=Drosophila virilis GN=GJ11337
PE=4 SV=1
Length = 644

Score = 33.9 bits (76), Expect = 7.9
Identities = 15/55 (27%), Positives = 33/55 (60%), Gaps = 4/55 (7%)
Frame = +2

Query: 128 MSKERRWEKQIEHDEHLLHKNIERKEQQQRRQGHYFAEL----PAINMTYTRTTS 280
+ ++++ ++Q++H + + H ++ QQQ++Q H+FA++ P TYT S
Sbjct: 347 LQQQQQQQQQMQHMQQMQHMQQMQQMQQQQQQQHHFAQVQQQHPVAGPTYTHCCS 401