DK950764
Clone id TST38A01NGRL0009_I23
Library
Length 673
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0009_I23. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID -
Sequence
CACATAAATCCATCAAGTATATCGCGTCACAAGGCTTTTGTGTCAGTCCGATGAAGATCC
AAGAGCACGGAAATAGGATGCCTGGCTCTCTGTGTTCAAACGATGCACTCTTGTTCTATC
AACATTCTTGCAGCAGTGCTGGGCCTTCTAGCGTACATATTTCGGAAAATGTCCTCTGTT
TTAAAGCAGTTGACGTGGTTGAATGCCCACAATGCAGCATCAAGCAGGACACAAGCCAAT
CATGTAGCTGCCCAGTTAGCACTTCATGTTTGCTCCCATTATTAAGTTCTGGAGAGGCAT
TTATTGGCATGAGAATAAAGCAATCAAGTGTAAAGCATTGTCGTGATGCGCCAAATGATC
CTATTTGTGAGCGTAACCCGGTATTTGTTGGAGATGTAAGAGTCTTTTCAAACAGCCTGC
AGCTGACTAACTACTGGCCACAGCAAATTGGGGATTTTGGCTTCTGTTTCATGCTGTTTC
GGCACCTTGAAAGCCTTCTTACGTACACCTTCTCTCTGTCAGCCGCAATGGCCCTGCTCA
ATAGTGCCCTGGTATTTTATCTTGATGGAGAGCAAATTTTCCAAGCCGGGCTCGTCTTGA
ACAAGTCATGGGTTCCCCCCAAGAGACATGCGAAGATCATGCGCACTGTGCTTACTTTAG
GAACAATTCTCAT
■■Homology search results ■■ -
sp_hit_id Q5IJ48
Definition sp|Q5IJ48|CRUM2_HUMAN Crumbs homolog 2 OS=Homo sapiens
Align length 70
Score (bit) 33.5
E-value 1.0
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950764|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_I23, 5'
(673 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q5IJ48|CRUM2_HUMAN Crumbs homolog 2 OS=Homo sapiens GN=CRB2 P... 33 1.0
sp|P09837|WAP_CAMDR Whey acidic protein OS=Camelus dromedarius G... 31 5.2
sp|Q9ERS8|SEBOX_RAT Homeobox protein SEBOX OS=Rattus norvegicus ... 31 6.8
sp|Q8VIK5|PEAR1_MOUSE Platelet endothelial aggregation receptor ... 31 6.8
sp|Q0D5P3|FH11_ORYSJ Formin-like protein 11 OS=Oryza sativa subs... 30 8.9

>sp|Q5IJ48|CRUM2_HUMAN Crumbs homolog 2 OS=Homo sapiens GN=CRB2 PE=1
SV=2
Length = 1285

Score = 33.5 bits (75), Expect = 1.0
Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 6/70 (8%)
Frame = +3

Query: 195 VVECPQ------CSIKQDTSQSCSCPVSTSCLLPLLSSGEAFIGMRIKQSSVKHCRDAPN 356
+ CP+ CS++ Q +CP++ +C +P+ SG S V HC +
Sbjct: 381 ICRCPETWGGRDCSVQLTGCQGHTCPLAATC-IPIFESG--------VHSYVCHCPPGTH 431

Query: 357 DPICERNPVF 386
P C +N F
Sbjct: 432 GPFCGQNTTF 441


>sp|P09837|WAP_CAMDR Whey acidic protein OS=Camelus dromedarius
GN=WAP PE=1 SV=1
Length = 117

Score = 31.2 bits (69), Expect = 5.2
Identities = 22/78 (28%), Positives = 33/78 (42%), Gaps = 7/78 (8%)
Frame = +3

Query: 165 ENVLCFKAVDVVECPQ---CSIKQDTSQSCSCPVSTSCLLPLLSSGEA-FIGMRIKQSSV 332
+N V+ CPQ C + S+SC+ P+ S P+L G ++ + +
Sbjct: 21 DNACIISCVNDESCPQGTKCCARSPCSRSCTVPLMVSSPEPVLKDGRCPWVQTPL---TA 77

Query: 333 KHC---RDAPNDPICERN 377
KHC D D CE N
Sbjct: 78 KHCLEKNDCSRDDQCEGN 95


>sp|Q9ERS8|SEBOX_RAT Homeobox protein SEBOX OS=Rattus norvegicus
GN=Sebox PE=2 SV=1
Length = 188

Score = 30.8 bits (68), Expect = 6.8
Identities = 11/26 (42%), Positives = 18/26 (69%)
Frame = +3

Query: 237 QSCSCPVSTSCLLPLLSSGEAFIGMR 314
Q+ +CP TSCL P+L G+++ G +
Sbjct: 119 QTSACPPQTSCLAPILGPGQSWSGAK 144


>sp|Q8VIK5|PEAR1_MOUSE Platelet endothelial aggregation receptor 1
OS=Mus musculus GN=Pear1 PE=1 SV=1
Length = 1034

Score = 30.8 bits (68), Expect = 6.8
Identities = 29/99 (29%), Positives = 39/99 (39%), Gaps = 13/99 (13%)
Frame = +3

Query: 138 AGPSSVHISENVLCFKAVDVVECPQC------SIKQDTSQSCSCPVSTSCLLPLLSSGEA 299
AGPS NV C + D CP+ + Q + SCSCP ++ L E
Sbjct: 210 AGPSC-----NVPCSQGTDGFFCPRTYPCQNGGVPQGSQGSCSCPPGWMGVICSLPCPEG 264

Query: 300 FIGMRIKQSSVKHCRDAPNDPICER-------NPVFVGD 395
F G Q CR N +C+R P ++GD
Sbjct: 265 FHGPNCTQ----ECR-CHNGGLCDRFTGQCHCAPGYIGD 298


>sp|Q0D5P3|FH11_ORYSJ Formin-like protein 11 OS=Oryza sativa subsp.
japonica GN=FH11 PE=2 SV=1
Length = 929

Score = 30.4 bits (67), Expect = 8.9
Identities = 19/65 (29%), Positives = 31/65 (47%)
Frame = -1

Query: 508 VYVRRLSRCRNSMKQKPKSPICCGQ*LVSCRLFEKTLTSPTNTGLRSQIGSFGASRQCFT 329
+++ L +++KQ + Q L + RLF K L + TG R +G+F Q F
Sbjct: 629 LFMANLPEEASNVKQSFATLEVACQELRNSRLFMKLLEAVLKTGNRMNVGTFRGGAQAFR 688

Query: 328 LDCFI 314
LD +
Sbjct: 689 LDTLL 693


tr_hit_id B4FN90
Definition tr|B4FN90|B4FN90_MAIZE Putative uncharacterized protein OS=Zea mays
Align length 258
Score (bit) 52.0
E-value 3.0e-05
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950764|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_I23, 5'
(673 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|B4FN90|B4FN90_MAIZE Putative uncharacterized protein OS=Zea m... 52 3e-05
tr|Q22NZ6|Q22NZ6_TETTH Insect antifreeze protein OS=Tetrahymena ... 37 1.1
tr|Q23C44|Q23C44_TETTH Putative uncharacterized protein OS=Tetra... 35 3.3
tr|Q23C50|Q23C50_TETTH Putative uncharacterized protein OS=Tetra... 35 4.3
tr|Q22NZ3|Q22NZ3_TETTH Putative uncharacterized protein OS=Tetra... 35 4.3
tr|Q0P4X8|Q0P4X8_XENTR Putative uncharacterized protein MGC14524... 35 5.6
tr|Q22CP5|Q22CP5_TETTH Putative uncharacterized protein OS=Tetra... 34 9.5

>tr|B4FN90|B4FN90_MAIZE Putative uncharacterized protein OS=Zea mays
PE=2 SV=1
Length = 431

Score = 52.0 bits (123), Expect = 3e-05
Identities = 56/258 (21%), Positives = 105/258 (40%), Gaps = 37/258 (14%)
Frame = +3

Query: 3 HKSIKYIASQGFCVSPMKIQEHGN------RMPGSLCSNDALLFYQHSCS---------S 137
H+ + +++G+CV I N ++P C ++ + F + C+ S
Sbjct: 165 HRYVATSSTKGYCVPDSWIDASKNLWQIRDKLP---CPDELIAFEKVICNVSTILTEKTS 221

Query: 138 AGPSSVHISENVLCFKAVDVVECPQCS----IKQDTSQSCSCPVSTSCLLPLLSSGEAFI 305
G + E C A DVV+ +C + +D SC+C CL+P+L+ G ++I
Sbjct: 222 IGSDQKEV-EGKYCLIAKDVVKLRRCGNGWHMTEDDESSCACFEDEHCLVPVLNPGFSWI 280

Query: 306 GMRIKQSSVKHCRDAPNDPI-------------CERNPVFVGDVRVFSNSLQLTNYWPQQ 446
+ + C + + CE + V+VGD+ + S++L+ Y P+
Sbjct: 281 EVSYARPYSLRCLQKGGNLLSSHTANSNFGQSPCEGSFVYVGDLSSAARSVRLSRYRPR- 339

Query: 447 IGDFGFCFMLF-----RHLESLLTYTFXXXXXXXXXXXXXVFYLDGEQIFQAGLVLNKSW 611
+ +LF LE+ L+ V++LDGE I + L +W
Sbjct: 340 -----WALLLFIADIPYILENGLSSLLHASAALAVINCLPVYFLDGEAILETTLSY-VAW 393

Query: 612 VPPKRHAKIMRTVLTLGT 665
P+ +I++ + T
Sbjct: 394 FTPRLQRRILKVCSVVWT 411


>tr|Q22NZ6|Q22NZ6_TETTH Insect antifreeze protein OS=Tetrahymena
thermophila SB210 GN=TTHERM_00417970 PE=4 SV=1
Length = 2162

Score = 37.0 bits (84), Expect = 1.1
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 2/57 (3%)
Frame = +3

Query: 84 GSLCS--NDALLFYQHSCSSAGPSSVHISENVLCFKAVDVVECPQCSIKQDTSQSCS 248
GS C+ N L FY+ SC++A P + + +N +C K V C C C+
Sbjct: 930 GSSCTSCNSGLFFYEGSCTAAQPQNTYCGQNQVCTKC--TVNCSSCDSTLKVCNQCN 984


>tr|Q23C44|Q23C44_TETTH Putative uncharacterized protein
OS=Tetrahymena thermophila SB210 GN=TTHERM_00222410 PE=4
SV=1
Length = 2029

Score = 35.4 bits (80), Expect = 3.3
Identities = 19/64 (29%), Positives = 29/64 (45%), Gaps = 3/64 (4%)
Frame = +3

Query: 114 FYQHSCSSAGPSSVHISENVLCFKAVDVVECPQCSIKQDTSQSC---SCPVSTSCLLPLL 284
+YQ+ C S+ PSS + N +C+ C CS + QSC S +CL +
Sbjct: 339 YYQNQCLSSQPSSTYCDSNFICYSCDPY--CASCSNSSNNCQSCKANSYLYQNNCLAKIC 396

Query: 285 SSGE 296
S +
Sbjct: 397 LSNQ 400


>tr|Q23C50|Q23C50_TETTH Putative uncharacterized protein
OS=Tetrahymena thermophila SB210 GN=TTHERM_00222350 PE=4
SV=1
Length = 1034

Score = 35.0 bits (79), Expect = 4.3
Identities = 16/51 (31%), Positives = 28/51 (54%)
Frame = +3

Query: 114 FYQHSCSSAGPSSVHISENVLCFKAVDVVECPQCSIKQDTSQSCSCPVSTS 266
FY +SC S+ P+S + N +C K+ D + C QC D + C ++++
Sbjct: 439 FYNNSCLSSQPNSTYCDSNFIC-KSCD-LSCSQCVAPGDANSCTQCNINSA 487


>tr|Q22NZ3|Q22NZ3_TETTH Putative uncharacterized protein
OS=Tetrahymena thermophila SB210 GN=TTHERM_00418000 PE=4
SV=1
Length = 1341

Score = 35.0 bits (79), Expect = 4.3
Identities = 28/108 (25%), Positives = 44/108 (40%), Gaps = 7/108 (6%)
Frame = +3

Query: 84 GSLCS--NDALLFYQHSCSSAGPSSVHISENVLCFKAVDVVECPQCSIKQDTSQSCSCPV 257
GS C+ N L FYQ +C+S P + + N LC C +CS Q++ +C
Sbjct: 655 GSTCTVCNTGLYFYQGACTSQQPDNTYCDNNFLCQSCSS--NCSKCS-SQNSCTTCQSGF 711

Query: 258 -----STSCLLPLLSSGEAFIGMRIKQSSVKHCRDAPNDPICERNPVF 386
S + P + ++ + + QSS C D + C F
Sbjct: 712 YLFQGSCTSTQPNNTYCDSNLVCQKCQSSCSQCTDGSSCTACNSGQYF 759


>tr|Q0P4X8|Q0P4X8_XENTR Putative uncharacterized protein MGC145244
OS=Xenopus tropicalis GN=MGC145244 PE=2 SV=1
Length = 582

Score = 34.7 bits (78), Expect = 5.6
Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 5/62 (8%)
Frame = -1

Query: 316 ILMPINASPELNNGSKHEVLTGQ---LHDWLVSCLMLHC--GHSTTSTALKQRTFSEICT 152
I+MP NA P+L N + + L G+ LHD SCL +C G ST ++ Q+
Sbjct: 177 IVMPTNAEPDLCNSNMIKELMGEKIPLHDKQTSCLGRNCSVGGSTVTSVQSQKRKESSLP 236

Query: 151 LE 146
LE
Sbjct: 237 LE 238


>tr|Q22CP5|Q22CP5_TETTH Putative uncharacterized protein
OS=Tetrahymena thermophila SB210 GN=TTHERM_01026340 PE=4
SV=1
Length = 1632

Score = 33.9 bits (76), Expect = 9.5
Identities = 22/79 (27%), Positives = 35/79 (44%), Gaps = 2/79 (2%)
Frame = +3

Query: 33 GFCVSPMKIQEHGNRMPGSLCSNDALL--FYQHSCSSAGPSSVHISENVLCFKAVDVVEC 206
G CVSP GN + C+ ++ FY + C S+ PSS N +C K C
Sbjct: 513 GECVSP------GNATSCTSCNINSTYKYFYNNQCFSSKPSSTFCDSNFICQKCNQT--C 564

Query: 207 PQCSIKQDTSQSCSCPVST 263
+C + + SC +++
Sbjct: 565 GECISPGNATSCTSCDITS 583