DK959976
Clone id TST39A01NGRL0006_C10
Library
Length 639
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0006_C10. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID
Sequence
ACTGTACTAAGGCCTCATATAGGCCAATAACTATTGCAAAAGTTTATGTGTATGCCACTC
CCACTGACGATCATCCATGGTAAGCTTCTTCTAATAAGTGGGCCGGTTCGTGTGTTATTG
AGAGCATCCTGAGGCCGGGCAGAGGTAGCCTACTACCATTAGAACAATGAAGAGGCTCAC
AGGTTCATGCGACCTCGCCCGCATTTTGACAAGGCCACGACACCTTTACTGGCAAAGCAG
GCAGATGGTCACCACTTGTTATGATGTAGAGATGGCCAGAAATGAGCAACAGGAAAACCA
CAGGCTTGCAAACCCTCAGTTTAAGAACTTTGCTAGCACGAACACGTCCGCGCTTGGGCA
TCCGATTGGTGTGGGTAATTCGAATGTTGGGGATGGTCATATTCAATCAAACAGCGTAGC
AAGCAGCTTGGCTAAGTTCGCTGACACAAAAGCTGAGAGTTCTAGAGCCGTTAAACCTGA
CTTGCCACCAGAAAAACGAAAAATCCGTCCTGATGAGTATAATGTACAGAAAGCACGAAT
ACATGCAAACTTGTGGCAACAGGAGATCAAGAAACATGTGGGAGAGTCGACTTCTATTGA
TGGAGATGGAGAGTCTGGATTGAGGGGAGACTCTGTTTG
■■Homology search results ■■ -
sp_hit_id O75592
Definition sp|O75592|MYCB2_HUMAN Probable E3 ubiquitin-protein ligase MYCBP2 OS=Homo sapiens
Align length 45
Score (bit) 32.3
E-value 2.1
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK959976|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0006_C10, 5'
(639 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|O75592|MYCB2_HUMAN Probable E3 ubiquitin-protein ligase MYCBP... 32 2.1
sp|Q63805|A1AG3_MOUSE Alpha-1-acid glycoprotein 3 OS=Mus musculu... 32 2.8
sp|Q54HS1|Y9416_DICDI Putative uncharacterized protein DDB_G0289... 31 6.2
sp|Q60929|MEF2A_MOUSE Myocyte-specific enhancer factor 2A OS=Mus... 31 6.2
sp|Q8BR86|KIRR3_MOUSE Kin of IRRE-like protein 3 OS=Mus musculus... 31 6.2
sp|Q8IZU9|KIRR3_HUMAN Kin of IRRE-like protein 3 OS=Homo sapiens... 31 6.2
sp|Q8SX83|SPEN_DROME Protein split ends OS=Drosophila melanogast... 30 8.1
sp|O25424|ASPG_HELPY Probable L-asparaginase OS=Helicobacter pyl... 30 8.1

>sp|O75592|MYCB2_HUMAN Probable E3 ubiquitin-protein ligase MYCBP2
OS=Homo sapiens GN=MYCBP2 PE=1 SV=3
Length = 4640

Score = 32.3 bits (72), Expect = 2.1
Identities = 16/45 (35%), Positives = 28/45 (62%)
Frame = +2

Query: 347 SALGHPIGVGNSNVGDGHIQSNSVASSLAKFADTKAESSRAVKPD 481
SA G G+GNS G+I ++S +S + ++ ++ SR++KPD
Sbjct: 2663 SAQGFDYGLGNSKGDRGNISTSSKPASTSGKSELSSKHSRSLKPD 2707


>sp|Q63805|A1AG3_MOUSE Alpha-1-acid glycoprotein 3 OS=Mus musculus
GN=Orm3 PE=2 SV=1
Length = 206

Score = 32.0 bits (71), Expect = 2.8
Identities = 36/155 (23%), Positives = 60/155 (38%), Gaps = 15/155 (9%)
Frame = +2

Query: 149 PTTIRTMKRLTGSCDLARILTRPRHLYWQSRQMVTTCY----------DVEMARNEQQEN 298
P T T+ L+G L + Y Q Q V T + +E+ +++
Sbjct: 30 PITNETLSWLSGKWFLIAVADSDPD-YRQEIQKVQTIFFYLTLNKINDTMELREYHTKDD 88

Query: 299 HRLANPQFKNFASTNTSALGHPIGVGNSNVGDGHIQSNSVASSLAKFADTKAESSRAV-- 472
H + N F N + + V N + H++ ++ F D K E R +
Sbjct: 89 HCVYNSNLLGFQRENGTLFKYEGEVENPS----HLRVLEKHGAIMLFFDLKDEKKRGLSL 144

Query: 473 ---KPDLPPEKRKIRPDEYNVQKARIHANLWQQEI 568
+PD+PPE R++ QKA H + + EI
Sbjct: 145 SARRPDIPPELREV------FQKAVTHVGMDESEI 173


>sp|Q54HS1|Y9416_DICDI Putative uncharacterized protein DDB_G0289263
OS=Dictyostelium discoideum GN=DDB_G0289263 PE=4 SV=1
Length = 815

Score = 30.8 bits (68), Expect = 6.2
Identities = 15/46 (32%), Positives = 25/46 (54%), Gaps = 1/46 (2%)
Frame = +2

Query: 281 NEQQENHRLANPQFK-NFASTNTSALGHPIGVGNSNVGDGHIQSNS 415
N+ NH NP F N + N++ +GH N+NV + +I +N+
Sbjct: 236 NQLYNNHNNGNPNFYINNNNNNSNNIGHNNNNNNNNVNNNNISNNN 281


>sp|Q60929|MEF2A_MOUSE Myocyte-specific enhancer factor 2A OS=Mus
musculus GN=Mef2a PE=1 SV=2
Length = 498

Score = 30.8 bits (68), Expect = 6.2
Identities = 26/89 (29%), Positives = 40/89 (44%), Gaps = 20/89 (22%)
Frame = +2

Query: 341 NTSALGHPIGVGNSNVGDGHIQS--------NSVASSLAKFADTKAE--------SSRAV 472
+T+ L P G GNS VG+G + S N+ A+SL K TK+ +
Sbjct: 208 STTDLTVPNGAGNSPVGNGFVNSRASPNLIGNTGANSLGKVMPTKSPPPPGGGSLGMNSR 267

Query: 473 KPDL----PPEKRKIRPDEYNVQKARIHA 547
KPDL PP + + P ++ ++A
Sbjct: 268 KPDLRVVIPPSSKGMMPPLSEEEELELNA 296


>sp|Q8BR86|KIRR3_MOUSE Kin of IRRE-like protein 3 OS=Mus musculus
GN=Kirrel3 PE=2 SV=1
Length = 778

Score = 30.8 bits (68), Expect = 6.2
Identities = 15/29 (51%), Positives = 16/29 (55%)
Frame = +3

Query: 453 LRVLEPLNLTCHQKNEKSVLMSIMYRKHE 539
LR +PLNLTCH N K I RK E
Sbjct: 160 LRAGDPLNLTCHADNAKPAASIIWLRKGE 188


>sp|Q8IZU9|KIRR3_HUMAN Kin of IRRE-like protein 3 OS=Homo sapiens
GN=KIRREL3 PE=1 SV=1
Length = 778

Score = 30.8 bits (68), Expect = 6.2
Identities = 15/29 (51%), Positives = 16/29 (55%)
Frame = +3

Query: 453 LRVLEPLNLTCHQKNEKSVLMSIMYRKHE 539
LR +PLNLTCH N K I RK E
Sbjct: 160 LRAGDPLNLTCHADNAKPAASIIWLRKGE 188


>sp|Q8SX83|SPEN_DROME Protein split ends OS=Drosophila melanogaster
GN=spen PE=1 SV=2
Length = 5560

Score = 30.4 bits (67), Expect = 8.1
Identities = 21/84 (25%), Positives = 31/84 (36%), Gaps = 2/84 (2%)
Frame = +2

Query: 182 GSCDLARILTRPRHLYWQSRQMVTTCYDVEMARNEQQENHRLANPQFKNFASTNTSALGH 361
GSC + L P YW+S QQ NH+ + Q +S+NT +
Sbjct: 1374 GSCSGSTFLPSPSSRYWRSSSH----------HQNQQNNHQQQSQQLHGSSSSNTCLMAS 1423

Query: 362 PIGVG--NSNVGDGHIQSNSVASS 427
P +SN D + + S
Sbjct: 1424 PARPRSLSSNSSDSDVPGQNAGGS 1447


>sp|O25424|ASPG_HELPY Probable L-asparaginase OS=Helicobacter pylori
GN=ansA PE=3 SV=1
Length = 330

Score = 30.4 bits (67), Expect = 8.1
Identities = 24/73 (32%), Positives = 35/73 (47%), Gaps = 3/73 (4%)
Frame = +2

Query: 332 ASTNTSALGHPI-GVGNSNVGDGHIQSNSVASSLAK--FADTKAESSRAVKPDLPPEKRK 502
AS N+ A G I GVGN NV G +++ AS + ++ S ++ +K
Sbjct: 237 ASLNSHAKGVVIAGVGNGNVSAGFLKAMQEASQMGVVIVRSSRVNSGEITSGEI-DDKAF 295

Query: 503 IRPDEYNVQKARI 541
I D N QKAR+
Sbjct: 296 ITSDNLNPQKARV 308


tr_hit_id Q4CM22
Definition tr|Q4CM22|Q4CM22_TRYCR Mucin-associated surface protein (MASP), putative OS=Trypanosoma cruzi
Align length 106
Score (bit) 37.4
E-value 0.77
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK959976|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0006_C10, 5'
(639 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q4CM22|Q4CM22_TRYCR Mucin-associated surface protein (MASP), ... 37 0.77
tr|B4MRB3|B4MRB3_DROWI GK15804 OS=Drosophila willistoni GN=GK158... 36 2.3
tr|Q1JSB3|Q1JSB3_TOXGO Putative uncharacterized protein OS=Toxop... 35 3.8
tr|B6KC50|B6KC50_TOXGO Putative uncharacterized protein OS=Toxop... 35 3.9
tr|Q7JLC3|Q7JLC3_CAEEL Protein C52E4.6a, confirmed by transcript... 35 5.0
tr|A8X6M5|A8X6M5_CAEBR Putative uncharacterized protein OS=Caeno... 35 5.0
tr|A5K0D5|A5K0D5_PLAVI Putative uncharacterized protein OS=Plasm... 35 5.0
tr|Q1YPE8|Q1YPE8_9GAMM Putative uncharacterized protein OS=gamma... 35 5.1
tr|A4FDS9|A4FDS9_SACEN Putative Glycopeptide antibiotics resista... 34 6.5
tr|Q8W398|Q8W398_ORYSA Putative mucin OS=Oryza sativa GN=OSJNBa0... 34 6.5
tr|Q10KL7|Q10KL7_ORYSJ Os03g0379400 protein OS=Oryza sativa subs... 34 6.5
tr|A3AII1|A3AII1_ORYSJ Putative uncharacterized protein OS=Oryza... 34 6.5
tr|A2XHC8|A2XHC8_ORYSI Putative uncharacterized protein OS=Oryza... 34 6.5
tr|B4IXF0|B4IXF0_DROGR GH15227 OS=Drosophila grimshawi GN=GH1522... 34 6.5
tr|A7APE4|A7APE4_BABBO DNA repair protein rhp16, putative OS=Bab... 34 6.5
tr|A5DYL6|A5DYL6_LODEL Putative uncharacterized protein OS=Lodde... 34 6.5
tr|B1BB88|B1BB88_CLOBO Putative phage related transcriptional re... 34 8.6
tr|B6T3M1|B6T3M1_MAIZE Putative uncharacterized protein OS=Zea m... 34 8.6
tr|B4FTX8|B4FTX8_MAIZE Putative uncharacterized protein OS=Zea m... 34 8.6
tr|A4HHB5|A4HHB5_LEIBR Putative uncharacterized protein OS=Leish... 34 8.6
tr|A8PR31|A8PR31_MALGO Putative uncharacterized protein OS=Malas... 34 8.6

>tr|Q4CM22|Q4CM22_TRYCR Mucin-associated surface protein (MASP),
putative OS=Trypanosoma cruzi GN=Tc00.1047053457979.10
PE=4 SV=1
Length = 350

Score = 37.4 bits (85), Expect = 0.77
Identities = 32/106 (30%), Positives = 46/106 (43%), Gaps = 5/106 (4%)
Frame = +2

Query: 332 ASTNTSALGHPIGVGNSNVGDGHIQSNSVASSLAKFADTKAESSRAVKPDLPP--EKRKI 505
A++N +++G P G NSN G G + +L K A + E K PP + K
Sbjct: 160 ANSNAASVGLPAGSENSNTGCGEVSDCGSQGNLEK-ATVEKEDFTVSKAQKPPATQNTKS 218

Query: 506 RPDEYNVQKA---RIHANLWQQEIKKHVGESTSIDGDGESGLRGDS 634
+ +E N +A QQEIK V ES S D + + S
Sbjct: 219 KDNEENPAEASSTESENTEPQQEIKTPVDESDSTSTDASTAVAARS 264


>tr|B4MRB3|B4MRB3_DROWI GK15804 OS=Drosophila willistoni GN=GK15804
PE=4 SV=1
Length = 532

Score = 35.8 bits (81), Expect = 2.3
Identities = 27/79 (34%), Positives = 36/79 (45%), Gaps = 8/79 (10%)
Frame = +2

Query: 362 PIGVGNSNVGDGHIQSNSVA-SSLAKFADTKAES-------SRAVKPDLPPEKRKIRPDE 517
P+ +VG HIQ N V L KF D + S+ DLPPEK K+ D
Sbjct: 268 PVSYNMISVGGLHIQPNKVLPEDLQKFLDGATDGAIYFSLGSQVRSADLPPEKLKVFLDV 327

Query: 518 YNVQKARIHANLWQQEIKK 574
+ K R+ LW+ E +K
Sbjct: 328 FGSLKQRV---LWKFEDEK 343


>tr|Q1JSB3|Q1JSB3_TOXGO Putative uncharacterized protein OS=Toxoplasma
gondii RH GN=TgIb.2400 PE=4 SV=1
Length = 4189

Score = 35.0 bits (79), Expect = 3.8
Identities = 29/107 (27%), Positives = 45/107 (42%)
Frame = +2

Query: 317 QFKNFASTNTSALGHPIGVGNSNVGDGHIQSNSVASSLAKFADTKAESSRAVKPDLPPEK 496
Q F TS +GV S +G H + SSLA+ T+ R P+ P +
Sbjct: 3987 QGDGFVLRETSQELSALGVVTSAIGCSHA---APGSSLAEAGQTQKTEERVELPEKPSRE 4043

Query: 497 RKIRPDEYNVQKARIHANLWQQEIKKHVGESTSIDGDGESGLRGDSV 637
+ +E ++Q A E ++ E S + +GE+ L DSV
Sbjct: 4044 SAAKVEEVDMQHVGGEAGTKTLEEREQSVEIQSEEQEGEAALLNDSV 4090


>tr|B6KC50|B6KC50_TOXGO Putative uncharacterized protein
OS=Toxoplasma gondii ME49 GN=TGME49_062840 PE=4 SV=1
Length = 178

Score = 35.0 bits (79), Expect = 3.9
Identities = 21/66 (31%), Positives = 33/66 (50%)
Frame = -1

Query: 630 SPLNPDSPSPSIEVDSPTCFLISCCHKFACIRAFCTLYSSGRIFRFSGGKSGLTALELSA 451
SP P P P + S + ++CC F+ + + L+SS F F+G SGL+ L S+
Sbjct: 86 SPAPPSHPLPCRWLSSVS---LACCSFFSFVCSTHRLFSSSCCFSFTGSPSGLSVLSTSS 142

Query: 450 FVSANL 433
+L
Sbjct: 143 LALPSL 148


>tr|Q7JLC3|Q7JLC3_CAEEL Protein C52E4.6a, confirmed by transcript
evidence OS=Caenorhabditis elegans GN=cyl-1 PE=2 SV=1
Length = 480

Score = 34.7 bits (78), Expect = 5.0
Identities = 21/71 (29%), Positives = 36/71 (50%)
Frame = +2

Query: 401 IQSNSVASSLAKFADTKAESSRAVKPDLPPEKRKIRPDEYNVQKARIHANLWQQEIKKHV 580
+++ +A +LAK A +S+ V + + RK+ PD N K R A+ ++E +H
Sbjct: 357 VKAKEIAENLAKMAPDGEKSTSTVT--IGKDSRKVSPDRKNGTKDRGEADRGKKEKDRHR 414

Query: 581 GESTSIDGDGE 613
S DG G+
Sbjct: 415 RRSNDRDGRGD 425


>tr|A8X6M5|A8X6M5_CAEBR Putative uncharacterized protein
OS=Caenorhabditis briggsae GN=CBG08470 PE=4 SV=1
Length = 892

Score = 34.7 bits (78), Expect = 5.0
Identities = 23/89 (25%), Positives = 39/89 (43%), Gaps = 6/89 (6%)
Frame = +2

Query: 374 GNSNVGDGHIQSNSVASSLAKFADTKAESSRAVKPDLPPEKRKIRPDEYNVQKARI-HAN 550
G+ N DG + + VAS +D ++ S D + DE+ + RI H++
Sbjct: 615 GSGNGSDGIEEEDEVASDEGNDSDESSDESSEDDVDSDDVSDEASTDEHKADRERIFHSD 674

Query: 551 LWQQEIK-----KHVGESTSIDGDGESGL 622
+W+Q K + G S G+G G+
Sbjct: 675 IWEQYRKSRCYDERAGSKQSGSGNGSDGI 703


>tr|A5K0D5|A5K0D5_PLAVI Putative uncharacterized protein
OS=Plasmodium vivax GN=PVX_084210 PE=4 SV=1
Length = 805

Score = 34.7 bits (78), Expect = 5.0
Identities = 17/57 (29%), Positives = 32/57 (56%)
Frame = +2

Query: 284 EQQENHRLANPQFKNFASTNTSALGHPIGVGNSNVGDGHIQSNSVASSLAKFADTKA 454
+++ + L+ + +F S+ S +GVG S+VG H+ SN VAS+ A +++
Sbjct: 232 KRKYDKNLSEDSWPHFGSSPRSLSSPSLGVGGSHVGGNHLASNCVASNSANAGSSQS 288


>tr|Q1YPE8|Q1YPE8_9GAMM Putative uncharacterized protein OS=gamma
proteobacterium HTCC2207 GN=GB2207_02940 PE=3 SV=1
Length = 504

Score = 34.7 bits (78), Expect = 5.1
Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 7/85 (8%)
Frame = -1

Query: 573 FLISCCHKFACIRAFCTLYSS------GRIFRFSGGKSGLTALELSAFVSAN-LAKLLAT 415
F ++ ++F +RAFC S +F G S LT L A A LAK+
Sbjct: 86 FALALGYRFTLVRAFCAFIRSIHELFWALLFMQIAGLSSLTGLLAIAIPYAGTLAKIYGE 145

Query: 414 LFD*I*PSPTFELPTPIGCPSADVF 340
LF+ + P+P LP P ++ F
Sbjct: 146 LFEEVDPAPANNLPVNKRRPLSEFF 170


>tr|A4FDS9|A4FDS9_SACEN Putative Glycopeptide antibiotics resistance
protein OS=Saccharopolyspora erythraea (strain NRRL
23338) GN=SACE_2926 PE=4 SV=1
Length = 376

Score = 34.3 bits (77), Expect = 6.5
Identities = 23/65 (35%), Positives = 34/65 (52%), Gaps = 3/65 (4%)
Frame = +3

Query: 12 ASYRPITIAKVYVYA---TPTDDHPW*ASSNKWAGSCVIESILRPGRGSLLPLEQ*RGSQ 182
A + P+T+ V+ T DH S + AG CV+++ +RP P E RG+Q
Sbjct: 309 ALWLPLTLIWAAVHGWGVLSTPDHR--GLSGRLAGLCVVDARIRPRTVQSEPREAHRGAQ 366

Query: 183 VHATS 197
VHA+S
Sbjct: 367 VHASS 371


>tr|Q8W398|Q8W398_ORYSA Putative mucin OS=Oryza sativa
GN=OSJNBa0013O08.2 PE=4 SV=1
Length = 309

Score = 34.3 bits (77), Expect = 6.5
Identities = 16/38 (42%), Positives = 23/38 (60%)
Frame = +2

Query: 497 RKIRPDEYNVQKARIHANLWQQEIKKHVGESTSIDGDG 610
+K + DEY +K R+HA LW +EI+K E + G G
Sbjct: 127 KKAQADEYRARKQRVHAALWVKEIEKM--EEARLGGGG 162