DK960377
Clone id TST39A01NGRL0007_E02
Library
Length 661
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0007_E02. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
GAGTGTAGGGACAGCTCCATGGAGTTTGGGCGTTTCGCTCACGGGCCAACCTGCGTTGCG
CCATAGCTCACGCAGCCGACCACTAGTGAGCGTGCCATTGTCGCTCTCTTTTCCCCGTCT
CGTCACAGCTGCTGCGGCTGCCGCTGCGACTTCTCAAGTTGCTGCTGACGCCCCTCCCGC
CTCTTCTCCTCCTCTCGCTTCTCCTCCGTCTCCCTCTTCTGTTGAAGATGAAGAGGGCGG
TCCTATCCAGCTCCCAGATTCTGGGGACCCCTTGTCCTTTACGTCTGAAAAGATTACACC
TCTGCAGTCTGCAGCGAGCATCTTGCTCACTGGAATTATTGCAGTTCTCCTCTACCGGTC
CTTGCGTCGCAGGATGAAGCTCGCCAAGGAAATGAAAGTTCGCTCGACTGGTTTAGAAAA
CATTAATGAAGCACCCAAGAACGCATCTGTGGTGATTGAGAAAACAGAGAAGACAATGGC
AAAGGCTCCTCTGACGGCCTGGCAAACTTTCCAAGGGGCAGTCATTGCTGGTGCCATTGC
TTTTGTTCTCTACAGGTTCGCAACCTATGTGGAAGCAGGATTTGCCTTAAAACCTGTGTC
AAATGTTTATACGATACGACAGTTGACAATAACCGTCAGGACGATCTTGAATGGAATGTG
C
■■Homology search results ■■ -
sp_hit_id O28575
Definition sp|O28575|Y1698_ARCFU UPF0098 protein AF_1698 OS=Archaeoglobus fulgidus
Align length 44
Score (bit) 30.4
E-value 8.7
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960377|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0007_E02, 5'
(661 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|O28575|Y1698_ARCFU UPF0098 protein AF_1698 OS=Archaeoglobus f... 30 8.7
sp|Q6ZR08|DNHD2_HUMAN Dynein heavy chain domain-containing prote... 30 8.7

>sp|O28575|Y1698_ARCFU UPF0098 protein AF_1698 OS=Archaeoglobus
fulgidus GN=AF_1698 PE=3 SV=1
Length = 287

Score = 30.4 bits (67), Expect = 8.7
Identities = 11/44 (25%), Positives = 19/44 (43%)
Frame = +1

Query: 454 D*ENREDNGKGSSDGLANFPRGSHCWCHCFCSLQVRNLCGSRIC 585
D R + G+G F S +C C C+ ++ G ++C
Sbjct: 6 DYSQRREEGEGRGSNFTRFSCWSCDFCRCLCNAEIAGESGGKVC 49


>sp|Q6ZR08|DNHD2_HUMAN Dynein heavy chain domain-containing protein
2 OS=Homo sapiens GN=DNHD2 PE=2 SV=1
Length = 1093

Score = 30.4 bits (67), Expect = 8.7
Identities = 15/47 (31%), Positives = 24/47 (51%), Gaps = 2/47 (4%)
Frame = +1

Query: 262 WGPLVLYV*KDYTSAVCSEHLAHW--NYCSSPLPVLASQDEARQGNE 396
W P++ + +D+TS C+ W +Y SS PV Q+ + NE
Sbjct: 563 WMPMLEKICEDFTSETCNSSFRLWLTSYPSSKFPVTILQNGVKMTNE 609


tr_hit_id A9SV52
Definition tr|A9SV52|A9SV52_PHYPA Predicted protein OS=Physcomitrella patens subsp. patens
Align length 147
Score (bit) 123.0
E-value 9.0e-27
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960377|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0007_E02, 5'
(661 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9SV52|A9SV52_PHYPA Predicted protein OS=Physcomitrella paten... 123 9e-27
tr|Q8L954|Q8L954_ARATH Putative uncharacterized protein OS=Arabi... 115 2e-24
tr|Q2HIU0|Q2HIU0_ARATH At3g15110 OS=Arabidopsis thaliana PE=2 SV=1 115 3e-24
tr|A9PIQ9|A9PIQ9_POPJC Putative uncharacterized protein OS=Popul... 114 5e-24
tr|A7P1U3|A7P1U3_VITVI Chromosome chr19 scaffold_4, whole genome... 114 5e-24
tr|Q6Z0W7|Q6Z0W7_ORYSJ Os02g0307800 protein OS=Oryza sativa subs... 111 3e-23
tr|B6T671|B6T671_MAIZE Putative uncharacterized protein OS=Zea m... 110 8e-23
tr|B7FJ85|B7FJ85_MEDTR Putative uncharacterized protein OS=Medic... 110 1e-22
tr|B8AG73|B8AG73_ORYSI Putative uncharacterized protein OS=Oryza... 93 1e-17
tr|A4RWZ1|A4RWZ1_OSTLU Predicted protein (Fragment) OS=Ostreococ... 69 3e-10
tr|Q9LIM4|Q9LIM4_ARATH Arabidopsis thaliana genomic DNA, chromos... 63 1e-08
tr|A8IF56|A8IF56_CHLRE Predicted protein (Fragment) OS=Chlamydom... 53 2e-05
tr|B8HWT7|B8HWT7_9CHRO Putative uncharacterized protein OS=Cyano... 45 0.004
tr|B1XI35|B1XI35_SYNP2 Conserved hypothetical membrane protein O... 41 0.058
tr|B0C4K0|B0C4K0_ACAM1 Putative uncharacterized protein OS=Acary... 41 0.075
tr|B4WM34|B4WM34_9SYNE Putative uncharacterized protein OS=Synec... 40 0.098
tr|A0YKB3|A0YKB3_9CYAN Putative uncharacterized protein OS=Lyngb... 39 0.22
tr|B1WTZ0|B1WTZ0_CYAA5 Putative uncharacterized protein OS=Cyano... 36 2.4
tr|A7H6V2|A7H6V2_ANADF Putative uncharacterized protein OS=Anaer... 35 5.4
tr|Q5N2C0|Q5N2C0_SYNP6 Putative uncharacterized protein OS=Synec... 34 7.1
tr|Q31RZ2|Q31RZ2_SYNE7 Putative uncharacterized protein OS=Synec... 34 7.1
tr|A8YXU2|A8YXU2_LACH4 Putative uncharacterized protein OS=Lacto... 34 9.2
tr|A4T522|A4T522_MYCGI Aldehyde dehydrogenase OS=Mycobacterium g... 34 9.2

>tr|A9SV52|A9SV52_PHYPA Predicted protein OS=Physcomitrella patens
subsp. patens GN=PHYPADRAFT_188722 PE=4 SV=1
Length = 277

Score = 123 bits (309), Expect = 9e-27
Identities = 76/147 (51%), Positives = 97/147 (65%), Gaps = 1/147 (0%)
Frame = +2

Query: 224 EDEEGGPIQLP-DSGDPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKV 400
E+ GPI+LP + DPL+ + +PLQ AAS++LTG+I VLL RSLRRR K AKE +
Sbjct: 84 EEISDGPIELPPELLDPLAIP--EASPLQVAASVVLTGLITVLLIRSLRRRSKKAKETRF 141

Query: 401 RSTGLENINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAG 580
RSTG E +A K+A ++ K + P +A QTF GAV+AG IA VLY+F VE
Sbjct: 142 RSTG-EIKEDARKSAMALLNKAPEVETPPP-SALQTFSGAVVAGFIALVLYKFTVTVEGS 199

Query: 581 FALKPVSNVYTIRQLTITVRTILNGMC 661
F K VS Y+IR LTITVRTI+ G+C
Sbjct: 200 FTGKAVSMNYSIRNLTITVRTIVTGLC 226


>tr|Q8L954|Q8L954_ARATH Putative uncharacterized protein
OS=Arabidopsis thaliana PE=2 SV=1
Length = 266

Score = 115 bits (289), Expect = 2e-24
Identities = 66/154 (42%), Positives = 95/154 (61%), Gaps = 8/154 (5%)
Frame = +2

Query: 224 EDEEGGPIQLPDSG-------DPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKL 382
E EE GPI+LP S + + TS+ TPLQ A S+LLTG I V L RS+RRR K
Sbjct: 53 EVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAKR 112

Query: 383 AKEMKVRSTGLE-NINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRF 559
AKE++ RSTG + ++ E + + T + +A Q F GA+ AG IA +LY+F
Sbjct: 113 AKELQFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKF 172

Query: 560 ATYVEAGFALKPVSNVYTIRQLTITVRTILNGMC 661
VE+G + +S+ +++RQ+T+TVRTI+NG+C
Sbjct: 173 TVTVESGLNRQTISDNFSVRQITVTVRTIINGIC 206


>tr|Q2HIU0|Q2HIU0_ARATH At3g15110 OS=Arabidopsis thaliana PE=2 SV=1
Length = 266

Score = 115 bits (287), Expect = 3e-24
Identities = 66/154 (42%), Positives = 94/154 (61%), Gaps = 8/154 (5%)
Frame = +2

Query: 224 EDEEGGPIQLPDSG-------DPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKL 382
E EE GPI+LP S + + TS+ TPLQ A S+LLTG I V L RS+RRR K
Sbjct: 53 EVEEDGPIELPTSSTSPFSSTNSIFATSDDPTPLQLATSVLLTGAITVFLIRSVRRRAKR 112

Query: 383 AKEMKVRSTGLE-NINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRF 559
AKE+ RSTG + ++ E + + T + +A Q F GA+ AG IA +LY+F
Sbjct: 113 AKELTFRSTGAKKSLKEEAMDNLKALSSTPIEGGNSTPSAAQAFLGAIAAGVIALILYKF 172

Query: 560 ATYVEAGFALKPVSNVYTIRQLTITVRTILNGMC 661
VE+G + +S+ +++RQ+T+TVRTI+NG+C
Sbjct: 173 TVTVESGLNRQTISDNFSVRQITVTVRTIINGIC 206


>tr|A9PIQ9|A9PIQ9_POPJC Putative uncharacterized protein OS=Populus
jackii PE=2 SV=1
Length = 238

Score = 114 bits (285), Expect = 5e-24
Identities = 62/146 (42%), Positives = 92/146 (63%)
Frame = +2

Query: 224 EDEEGGPIQLPDSGDPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKVR 403
E EE GPI++ S + TS+ + +Q A S+LLTG I+V L+RSLRRR K +KE+K R
Sbjct: 68 EPEEEGPIEILRSSPSIFATSDDPSSIQVATSVLLTGAISVFLFRSLRRRAKRSKELKFR 127

Query: 404 STGLENINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAGF 583
S+G + + S+ + K P + Q F GA+ AG IA +LY+F T +EA
Sbjct: 128 SSGAKKTLKEEALDSLKTFGSAPIDVKKPPSPVQAFLGAISAGVIALILYKFTTTIEASL 187

Query: 584 ALKPVSNVYTIRQLTITVRTILNGMC 661
+ VS+ +++RQ+T+TVRTI+NG+C
Sbjct: 188 NRQTVSDNFSVRQITVTVRTIVNGLC 213


>tr|A7P1U3|A7P1U3_VITVI Chromosome chr19 scaffold_4, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00027754001
PE=4 SV=1
Length = 277

Score = 114 bits (285), Expect = 5e-24
Identities = 61/143 (42%), Positives = 92/143 (64%)
Frame = +2

Query: 233 EGGPIQLPDSGDPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKVRSTG 412
E GPI+LP S + T++ TPLQ A S+LLTG I+V L+RS+RRR+K AKE++ RS+G
Sbjct: 64 EEGPIELPPSSSSIFATNDDPTPLQVATSVLLTGAISVFLFRSIRRRVKRAKELRFRSSG 123

Query: 413 LENINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAGFALK 592
++ + S+ + A AP + Q G + AG IA +LY+F +EA +
Sbjct: 124 VKKTLKEEALDSLKAMGSGSVKA-APPSPVQALLGGITAGVIALILYKFTITIEASLNRQ 182

Query: 593 PVSNVYTIRQLTITVRTILNGMC 661
VS+ +++RQ+TIT+RTI+NG+C
Sbjct: 183 TVSDNFSVRQITITIRTIINGLC 205


>tr|Q6Z0W7|Q6Z0W7_ORYSJ Os02g0307800 protein OS=Oryza sativa subsp.
japonica GN=OSJNBb0026D20.9 PE=2 SV=1
Length = 273

Score = 111 bits (278), Expect = 3e-23
Identities = 63/150 (42%), Positives = 94/150 (62%), Gaps = 4/150 (2%)
Frame = +2

Query: 224 EDEEG-GPIQLPDSGDPLSFT-SEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMK 397
+D EG GP++L P F+ E TPLQ+A S+LLTG I+V L+RS+RRR++ AKE++
Sbjct: 58 DDGEGDGPVELRT---PTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRVRRAKELR 114

Query: 398 VRSTGLENINEAPKNA--SVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYV 571
VRS G+E N K A + + P + Q G + AG IA +LY+F T +
Sbjct: 115 VRSGGVEKPNNLSKEALEGLRLVSASPIEVDKPPSPVQALLGGIAAGVIALILYKFTTTI 174

Query: 572 EAGFALKPVSNVYTIRQLTITVRTILNGMC 661
EA + +S+ +++RQ+TIT+RTI+NG+C
Sbjct: 175 EAALNRQTISDSFSVRQITITIRTIINGIC 204


>tr|B6T671|B6T671_MAIZE Putative uncharacterized protein OS=Zea mays
PE=2 SV=1
Length = 285

Score = 110 bits (275), Expect = 8e-23
Identities = 66/154 (42%), Positives = 97/154 (62%), Gaps = 7/154 (4%)
Frame = +2

Query: 221 VEDEEGGPIQLPDSGDPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKV 400
VE E GP++L L T + TPLQ+A S+LLTG I+V L+R+LRRR + AKE++V
Sbjct: 63 VEAEGQGPVEL--RAPTLFSTDDNPTPLQTATSLLLTGAISVFLFRALRRRARRAKELRV 120

Query: 401 RSTGLENINEAPKNA-------SVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRF 559
RS+GL+ N + A S +TEK A +P+ Q G + AG IA +LY+F
Sbjct: 121 RSSGLKKPNNLTEEALERLRLMSASPIETEK--ATSPI---QALLGGIAAGVIALILYKF 175

Query: 560 ATYVEAGFALKPVSNVYTIRQLTITVRTILNGMC 661
+T VEA + +S+ +++RQ+TIT+RTI+ G+C
Sbjct: 176 STTVEAALNRQTISDSFSVRQITITIRTIVTGLC 209


>tr|B7FJ85|B7FJ85_MEDTR Putative uncharacterized protein OS=Medicago
truncatula PE=2 SV=1
Length = 271

Score = 110 bits (274), Expect = 1e-22
Identities = 60/141 (42%), Positives = 91/141 (64%), Gaps = 1/141 (0%)
Frame = +2

Query: 239 GPIQLP-DSGDPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKVRSTGL 415
GPI++P DS L +++ + +Q AAS+LLTG I+VLL+RS RRR K K+ + RS+G
Sbjct: 58 GPIEIPYDSTPSLLSSTDDPSFIQIAASLLLTGAISVLLFRSFRRRAKRLKQTQFRSSGE 117

Query: 416 ENINEAPKNASVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAGFALKP 595
+++ E K P + QTF GA+ AG I+ +LY+FAT +EAG + +
Sbjct: 118 KSVKEEALETLKATGTASIETTKGPPSPVQTFLGAISAGVISLILYKFATIIEAGLSRQT 177

Query: 596 VSNVYTIRQLTITVRTILNGM 658
+S+ ++ RQ+TITVRTI+NG+
Sbjct: 178 ISDDFSARQITITVRTIINGL 198


>tr|B8AG73|B8AG73_ORYSI Putative uncharacterized protein OS=Oryza
sativa subsp. indica GN=OsI_06901 PE=4 SV=1
Length = 325

Score = 93.2 bits (230), Expect = 1e-17
Identities = 62/180 (34%), Positives = 92/180 (51%), Gaps = 34/180 (18%)
Frame = +2

Query: 224 EDEEG-GPIQLPDSGDPLSFT-SEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMK 397
+D EG GP++L P F+ E TPLQ+A S+LLTG I+V L+RS+RRR + AKE+
Sbjct: 80 DDGEGDGPVELRT---PTLFSIDENPTPLQTATSVLLTGAISVFLFRSIRRRARRAKELV 136

Query: 398 V------------------------------RSTGLENINEAPKNA--SVVIEKTEKTMA 481
+ RS G+E N K A + +
Sbjct: 137 IPSLLIHLRCAFSTGEHTWSTNPISFRDTEGRSGGVEKPNNLSKEALEGLRLVSASPIEV 196

Query: 482 KAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAGFALKPVSNVYTIRQLTITVRTILNGMC 661
P + Q G + AG IA +LY+F T +EA + +S+ +++RQ+TIT+RTI+NG+C
Sbjct: 197 DKPPSPVQALLGGIAAGVIALILYKFTTTIEAALNRQTISDSFSVRQITITIRTIINGIC 256


>tr|A4RWZ1|A4RWZ1_OSTLU Predicted protein (Fragment) OS=Ostreococcus
lucimarinus (strain CCE9901) GN=OSTLU_92691 PE=4 SV=1
Length = 181

Score = 68.9 bits (167), Expect = 3e-10
Identities = 44/131 (33%), Positives = 68/131 (51%)
Frame = +2

Query: 266 DPLSFTSEKITPLQSAASILLTGIIAVLLYRSLRRRMKLAKEMKVRSTGLENINEAPKNA 445
D +E+ TPLQ+ +L TG R++++R A+E +V N P+
Sbjct: 3 DSYGVPTEEATPLQNVFGVLFTGFSFWYFARAVQKRSGKAREFRVA-------NRLPEEE 55

Query: 446 SVVIEKTEKTMAKAPLTAWQTFQGAVIAGAIAFVLYRFATYVEAGFALKPVSNVYTIRQL 625
I++ E+ LTA Q+F G + I+ VLY FA+ V A F K + T+RQ+
Sbjct: 56 RAKIDE-ERAKKTKELTAMQSFTGGLTGIGISVVLYAFASKVSASFDGKALPESETVRQI 114

Query: 626 TITVRTILNGM 658
+ITVRTI+ G+
Sbjct: 115 SITVRTIVEGL 125