DK960277
Clone id TST39A01NGRL0006_P16
Library
Length 619
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0006_P16. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
GTATCACTGAAGGAAATAACCACAAGAACCACAATTTTTTTTTGCTTGTTGCGGTCGCAC
GTAACCAAGATTTCTTGGGCGTGTCAATTTCTACTCTTGTTTTTGAAGAGTGACAGGAAA
CCCAAGAAAGAAGGGCGCCTTGTGCCTCTTTTCACTCATTGAGAGCGGACCGTGCACTTA
GCGGCTGACAAATCTACGTGAGGTGGGAAAAAGCAACCTTGGTGTGAGCGCAAGGCCGCC
AAGGGGTGCGCCTGCGGAAAAGAGCCGAAATCGGGTGAAGAGAAAATCCTTCTCTAACAC
AGATTGTAAGCGAAAGAAGAACCTTAAGAACCTCAAGAACTCAAGAACCCCGAGAACTAA
CACCCTCAAGAACCTCAAGAACTCAAGACCCAAGATCCACAAGAAACCAAGTTCCTCGTG
AACTTAAGAACCAAAGATCTTAGTGGATGGTCATCACCCGAACCGGAATGGATACTTTGC
ATGAAGGTGATGAGCATGAGGGTGGTGAAGAAGAATTTTCCCCACTAGGAGGTTTTTCCG
TCCCTCCCCTTTCCCCATCTACTCCAAGGATTGCACGAACTCCTATGGAGGAGTTGAAGC
AATTGTGAAAAGGAAGTTA
■■Homology search results ■■ -
sp_hit_id P18583
Definition sp|P18583|SON_HUMAN SON protein OS=Homo sapiens
Align length 68
Score (bit) 35.4
E-value 0.24
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960277|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0006_P16, 5'
(619 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P18583|SON_HUMAN SON protein OS=Homo sapiens GN=SON PE=1 SV=3 35 0.24
sp|Q8S9J8|CAP1_ARATH Putative clathrin assembly protein At4g3228... 35 0.31
sp|Q9QX47|SON_MOUSE SON protein OS=Mus musculus GN=Son PE=1 SV=1 35 0.40
sp|Q6FX33|MSC3_CANGA Meiotic sister-chromatid recombination prot... 33 1.5
sp|P12036|NFH_HUMAN Neurofilament heavy polypeptide OS=Homo sapi... 31 5.9
sp|A0QHG7|LGT_MYCA1 Prolipoprotein diacylglyceryl transferase OS... 30 7.7

>sp|P18583|SON_HUMAN SON protein OS=Homo sapiens GN=SON PE=1 SV=3
Length = 2426

Score = 35.4 bits (80), Expect = 0.24
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1936 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1995

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1996 SRRRRSRS 2003


>sp|Q8S9J8|CAP1_ARATH Putative clathrin assembly protein At4g32285
OS=Arabidopsis thaliana GN=At4g32285 PE=1 SV=1
Length = 635

Score = 35.0 bits (79), Expect = 0.31
Identities = 24/49 (48%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Frame = +1

Query: 316 EEP*EPQELKN---PEN*HPQEPQ--ELKTQDPQETKFLVNLRTKDLSG 447
EEP + E+K PEN P P E K Q PQ T LVNLR D+SG
Sbjct: 379 EEPVDMNEIKALPPPENHTPPPPPAPEPKPQQPQVTDDLVNLREDDVSG 427


>sp|Q9QX47|SON_MOUSE SON protein OS=Mus musculus GN=Son PE=1 SV=1
Length = 2404

Score = 34.7 bits (78), Expect = 0.40
Identities = 23/64 (35%), Positives = 31/64 (48%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPRSTRNQVPRELK 428
R R+ +G R + S++ R R SRT R TPSR SRT S R++ P +
Sbjct: 1921 RRRRSISVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTP---SRRSRTPSRRR 1977

Query: 429 NQRS 440
RS
Sbjct: 1978 RSRS 1981


>sp|Q6FX33|MSC3_CANGA Meiotic sister-chromatid recombination protein
3 OS=Candida glabrata GN=MSC3 PE=3 SV=1
Length = 834

Score = 32.7 bits (73), Expect = 1.5
Identities = 28/107 (26%), Positives = 47/107 (43%)
Frame = +3

Query: 63 NQDFLGVSISTLVFEE*QETQERRAPCASFHSLRADRALSG*QIYVRWEKATLV*AQGRQ 242
N + GVS S + R+ P +SLR+DRA S I + AQ R+
Sbjct: 57 NPNATGVSRSYSLMHSYNPAAARQPPAGRTYSLRSDRASS---ITSNSRRPAAGKAQVRR 113

Query: 243 GVRLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRT 383
++RA + + Q + T+ T++ ++P+ T S T +T
Sbjct: 114 ASTQQRRASVDQELGTGVNQPRTNSITVTTTKVRDPQGRTKSITKKT 160


>sp|P12036|NFH_HUMAN Neurofilament heavy polypeptide OS=Homo sapiens
GN=NEFH PE=1 SV=3
Length = 1026

Score = 30.8 bits (68), Expect = 5.9
Identities = 24/79 (30%), Positives = 35/79 (44%)
Frame = +1

Query: 226 ERKAAKGCACGKEPKSGEEKILL*HRL*AKEEP*EPQELKNPEN*HPQEPQELKTQDPQE 405
E++ AK A K P+ + AKEE P E K+PE + P E+K+ P++
Sbjct: 534 EKEEAKSPAEVKSPEKAKSP--------AKEEAKSPPEAKSPEKEEAKSPAEVKS--PEK 583

Query: 406 TKFLVNLRTKDLSGWSSPE 462
K K + SPE
Sbjct: 584 AKSPAKEEAKSPAEAKSPE 602


>sp|A0QHG7|LGT_MYCA1 Prolipoprotein diacylglyceryl transferase
OS=Mycobacterium avium (strain 104) GN=lgt PE=3 SV=1
Length = 440

Score = 30.4 bits (67), Expect = 7.7
Identities = 18/48 (37%), Positives = 28/48 (58%)
Frame = +1

Query: 259 KEPKSGEEKILL*HRL*AKEEP*EPQELKNPEN*HPQEPQELKTQDPQ 402
+EP+S E + A EEP EP E + PE +EP+E +T++P+
Sbjct: 355 EEPESEETE--------AAEEPGEP-EAEEPEEPEAEEPEEPETEEPE 393


tr_hit_id Q80TM4
Definition tr|Q80TM4|Q80TM4_MOUSE MKIAA1019 protein (Fragment) OS=Mus musculus
Align length 68
Score (bit) 35.4
E-value 2.7
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK960277|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0006_P16, 5'
(619 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|Q80TM4|Q80TM4_MOUSE MKIAA1019 protein (Fragment) OS=Mus muscu... 35 2.7
tr|Q59A30|Q59A30_BOVIN SON DNA-binding protein OS=Bos taurus GN=... 35 2.7
tr|B4USW0|B4USW0_OTOGA SON DNA-binding protein isoform F (Predic... 35 2.7
tr|B1MT65|B1MT65_CALMO SON DNA-binding protein isoform F (Predic... 35 2.7
tr|B0VXG9|B0VXG9_CALJA SON DNA-binding protein isoform F (Predic... 35 2.7
tr|A9CB01|A9CB01_PAPAN SON DNA binding protein, isoform f (Predi... 35 2.7
tr|B3EX17|B3EX17_SORAR SON protein (Predicted) OS=Sorex araneus ... 35 3.5
tr|Q501G2|Q501G2_ARATH At4g32285 OS=Arabidopsis thaliana PE=2 SV=1 35 3.6
tr|Q8C9T5|Q8C9T5_MOUSE Putative uncharacterized protein (Fragmen... 35 4.6
tr|Q6IEB6|Q6IEB6_MOUSE Putative ISG12(B2) protein OS=Mus musculu... 34 6.1
tr|Q8VC49|Q8VC49_MOUSE Putative uncharacterized protein OS=Mus m... 34 7.9

>tr|Q80TM4|Q80TM4_MOUSE MKIAA1019 protein (Fragment) OS=Mus musculus
GN=Son PE=2 SV=1
Length = 480

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 102 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 161

Query: 417 RELKNQRS 440
+ RS
Sbjct: 162 SRRRRSRS 169


>tr|Q59A30|Q59A30_BOVIN SON DNA-binding protein OS=Bos taurus GN=son
PE=4 SV=1
Length = 2136

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Frame = +3

Query: 255 RKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVPRE 422
R+R+ G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1615 RRRSRSGGRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSR 1674

Query: 423 LKNQRS 440
+ RS
Sbjct: 1675 RRRSRS 1680


>tr|B4USW0|B4USW0_OTOGA SON DNA-binding protein isoform F (Predicted)
OS=Otolemur garnettii GN=SON PE=4 SV=2
Length = 2411

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1921 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1980

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1981 SRRRRSRS 1988


>tr|B1MT65|B1MT65_CALMO SON DNA-binding protein isoform F (Predicted)
OS=Callicebus moloch GN=SON PE=4 SV=1
Length = 2422

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1932 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1991

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1992 SRRRRSRS 1999


>tr|B0VXG9|B0VXG9_CALJA SON DNA-binding protein isoform F (Predicted)
OS=Callithrix jacchus GN=SON PE=4 SV=1
Length = 2454

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1931 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1990

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1991 SRRRRSRS 1998


>tr|A9CB01|A9CB01_PAPAN SON DNA binding protein, isoform f (Predicted)
OS=Papio anubis GN=SON PE=4 SV=1
Length = 2426

Score = 35.4 bits (80), Expect = 2.7
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1936 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1995

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1996 SRRRRSRS 2003


>tr|B3EX17|B3EX17_SORAR SON protein (Predicted) OS=Sorex araneus
GN=SON PE=4 SV=1
Length = 1825

Score = 35.0 bits (79), Expect = 3.5
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPR----STRNQVP 416
R R+ +G R + S++ R R SRT R TPSR SRT R S R++ P
Sbjct: 1326 RRRRSRSVGRRRSFSVSPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRSRTP 1385

Query: 417 RELKNQRS 440
+ RS
Sbjct: 1386 SRRRRSRS 1393


>tr|Q501G2|Q501G2_ARATH At4g32285 OS=Arabidopsis thaliana PE=2 SV=1
Length = 635

Score = 35.0 bits (79), Expect = 3.6
Identities = 24/49 (48%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Frame = +1

Query: 316 EEP*EPQELKN---PEN*HPQEPQ--ELKTQDPQETKFLVNLRTKDLSG 447
EEP + E+K PEN P P E K Q PQ T LVNLR D+SG
Sbjct: 379 EEPVDMNEIKALPPPENHTPPPPPAPEPKPQQPQVTDDLVNLREDDVSG 427


>tr|Q8C9T5|Q8C9T5_MOUSE Putative uncharacterized protein (Fragment)
OS=Mus musculus GN=Son PE=2 SV=1
Length = 488

Score = 34.7 bits (78), Expect = 4.6
Identities = 23/64 (35%), Positives = 31/64 (48%)
Frame = +3

Query: 249 RLRKRAEIG*RENPSLTQIVSERRTLRTSRTQEPRELTPSRTSRTQDPRSTRNQVPRELK 428
R R+ +G R + S++ R R SRT R TPSR SRT S R++ P +
Sbjct: 323 RRRRSRSVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTP---SRRSRTPSRRR 379

Query: 429 NQRS 440
RS
Sbjct: 380 RSRS 383


>tr|Q6IEB6|Q6IEB6_MOUSE Putative ISG12(B2) protein OS=Mus musculus
GN=1810023F06Rik PE=2 SV=1
Length = 283

Score = 34.3 bits (77), Expect = 6.1
Identities = 19/34 (55%), Positives = 25/34 (73%), Gaps = 1/34 (2%)
Frame = +1

Query: 313 KEEP*EPQELKNPEN*HPQEPQEL-KTQDPQETK 411
++EP EPQEL+ + PQEPQEL K Q+ QET+
Sbjct: 236 QKEPQEPQELQKQQE--PQEPQELQKQQETQETQ 267