DK955556
Clone id TST39A01NGRL0023_G16
Library
Length 531
Definition Adiantum capillus-veneris mRNA. clone: TST39A01NGRL0023_G16. 5' end sequence.
Accession
Tissue type prothallia with plantlets
Developmental stage gametophytes with sporophytes
Contig ID -
Sequence
CAGTCACTTACACGCAGAGCCAGGGGAGAGAGCGGGGCTCCAAGATGCAATCAGAGGGCC
ATGCCGCAGGATTTCAAAACCATAGTCAGTTTCCTCGTGTCAACATGCCTATCTATACTG
AAGATTGGGAAACATTAAATGCCCTCGGCAGAGATCCCTTCAGAAGGGTATTTGTAGGGG
AGACTTTACAACAGATGGCAAGTCTGCAAATTGAACACAGAATAAAAAGACAAAACTGTG
GAAACAATGTGTCCCTACATCACGATTCGATATTTGGAAGCAGGGACCAGGTAGGGTCCA
GTTTTTCTGTTGCAAATGGTGAGGCTGAAGGGAGAATGAGGTACCCTCCCAGGACCATGT
CCCCTGGCTTTTATGGCTTGACAGGCGAAGGCAATAGAGCATTCACTGACAGAAATTTTC
CACCTGCTGAAATTGGAAGCAGCAGTTCCTTTGGGTTTCAATCCCACAATGGAATTCTTT
TGAGGAGTTCAGTTACAGAAAGTGAAATTGTACACTATCCAGGGAGCGACT
■■Homology search results ■■ -
sp_hit_id P12110
Definition sp|P12110|CO6A2_HUMAN Collagen alpha-2(VI) chain OS=Homo sapiens
Align length 45
Score (bit) 32.7
E-value 1.1
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK955556|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0023_G16, 5'
(531 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|P12110|CO6A2_HUMAN Collagen alpha-2(VI) chain OS=Homo sapiens... 33 1.1
sp|Q9EP78|CHST7_MOUSE Carbohydrate sulfotransferase 7 OS=Mus mus... 32 2.5
sp|Q02788|CO6A2_MOUSE Collagen alpha-2(VI) chain OS=Mus musculus... 31 3.2
sp|Q6XQG8|CHST7_RAT Carbohydrate sulfotransferase 7 OS=Rattus no... 31 4.3
sp|Q8BZH4|POGZ_MOUSE Pogo transposable element with ZNF domain O... 30 5.5
sp|Q7Z3K3|POGZ_HUMAN Pogo transposable element with ZNF domain O... 30 5.5
sp|Q9R159|ADA25_MOUSE ADAM 25 OS=Mus musculus GN=Adam25 PE=2 SV=1 30 5.5
sp|O94296|YOOC_SCHPO Probable phospholipid-transporting ATPase C... 30 7.2
sp|Q80U04|PJA2_MOUSE E3 ubiquitin-protein ligase Praja2 OS=Mus m... 30 9.4
sp|Q28084|CO4A3_BOVIN Collagen alpha-3(IV) chain (Fragment) OS=B... 30 9.4

>sp|P12110|CO6A2_HUMAN Collagen alpha-2(VI) chain OS=Homo sapiens
GN=COL6A2 PE=1 SV=4
Length = 1019

Score = 32.7 bits (73), Expect = 1.1
Identities = 18/45 (40%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Frame = +3

Query: 342 YPPRTMSPGFYG-LTGEGNRAFTDRNFPPAEIGSSSSFGFQSHNG 473
YP SPG G G+G+ R PP EIG+ S G+Q ++G
Sbjct: 356 YPGEAGSPGERGDQGGKGDPGRPGRRGPPGEIGAKGSKGYQGNSG 400


>sp|Q9EP78|CHST7_MOUSE Carbohydrate sulfotransferase 7 OS=Mus
musculus GN=Chst7 PE=2 SV=1
Length = 484

Score = 31.6 bits (70), Expect = 2.5
Identities = 15/34 (44%), Positives = 23/34 (67%)
Frame = +1

Query: 226 KDKTVETMCPYITIRYLEAGTR*GPVFLLQMVRL 327
+DK E+ CP +++R LEA R PV +++ VRL
Sbjct: 219 EDKACESTCPPVSLRALEAECRKYPVVVIKDVRL 252


>sp|Q02788|CO6A2_MOUSE Collagen alpha-2(VI) chain OS=Mus musculus
GN=Col6a2 PE=2 SV=3
Length = 1034

Score = 31.2 bits (69), Expect = 3.2
Identities = 20/67 (29%), Positives = 29/67 (43%), Gaps = 1/67 (1%)
Frame = +3

Query: 276 GSRDQVGSSFSVANGEAEGRMRYPPRTMSPGFYGLTG-EGNRAFTDRNFPPAEIGSSSSF 452
G ++G + + G YP SPG G G +G+ R PP + G S
Sbjct: 349 GKLGRIGPPGCKGDPGSRGPDGYPGEAGSPGERGDQGAKGDSGRPGRRGPPGDPGDKGSK 408

Query: 453 GFQSHNG 473
G+Q +NG
Sbjct: 409 GYQGNNG 415


>sp|Q6XQG8|CHST7_RAT Carbohydrate sulfotransferase 7 OS=Rattus
norvegicus GN=Chst7 PE=2 SV=1
Length = 485

Score = 30.8 bits (68), Expect = 4.3
Identities = 15/34 (44%), Positives = 22/34 (64%)
Frame = +1

Query: 226 KDKTVETMCPYITIRYLEAGTR*GPVFLLQMVRL 327
+DK E+ CP + +R LEA R PV +++ VRL
Sbjct: 220 EDKACESTCPPVPLRALEAECRKYPVVVIKDVRL 253


>sp|Q8BZH4|POGZ_MOUSE Pogo transposable element with ZNF domain
OS=Mus musculus GN=Pogz PE=1 SV=2
Length = 1409

Score = 30.4 bits (67), Expect = 5.5
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Frame = -3

Query: 244 FPQ-FCLFILCSICRLAICCKVSPTNTLLKGSLPR 143
FP F ++ CS+CR + CC + N ++ +PR
Sbjct: 759 FPNHFPTYVHCSLCRYSTCCSRAYANHMINNHVPR 793


>sp|Q7Z3K3|POGZ_HUMAN Pogo transposable element with ZNF domain
OS=Homo sapiens GN=POGZ PE=1 SV=2
Length = 1410

Score = 30.4 bits (67), Expect = 5.5
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 1/35 (2%)
Frame = -3

Query: 244 FPQ-FCLFILCSICRLAICCKVSPTNTLLKGSLPR 143
FP F ++ CS+CR + CC + N ++ +PR
Sbjct: 763 FPNHFPTYVHCSLCRYSTCCSRAYANHMINNHVPR 797


>sp|Q9R159|ADA25_MOUSE ADAM 25 OS=Mus musculus GN=Adam25 PE=2 SV=1
Length = 760

Score = 30.4 bits (67), Expect = 5.5
Identities = 13/36 (36%), Positives = 18/36 (50%)
Frame = +2

Query: 92 SSCQHAYLY*RLGNIKCPRQRSLQKGICRGDFTTDG 199
S+C ++Y + KC R+ KGI RG DG
Sbjct: 390 SNCSYSYYWATYATAKCMRKEKKSKGILRGKLCGDG 425


>sp|O94296|YOOC_SCHPO Probable phospholipid-transporting ATPase
C887.12 OS=Schizosaccharomyces pombe GN=SPBC887.12 PE=2
SV=1
Length = 1258

Score = 30.0 bits (66), Expect = 7.2
Identities = 12/22 (54%), Positives = 14/22 (63%)
Frame = -3

Query: 226 FILCSICRLAICCKVSPTNTLL 161
F L S+CR ICC+VSP L
Sbjct: 870 FELASLCRAVICCRVSPLQKAL 891


>sp|Q80U04|PJA2_MOUSE E3 ubiquitin-protein ligase Praja2 OS=Mus
musculus GN=Pja2 PE=1 SV=2
Length = 707

Score = 29.6 bits (65), Expect = 9.4
Identities = 26/98 (26%), Positives = 42/98 (42%), Gaps = 3/98 (3%)
Frame = +3

Query: 246 NVSLHHDSIFGSRDQVG---SSFSVANGEAEGRMRYPPRTMSPGFYGLTGEGNRAFTDRN 416
++ L +++ G R Q G + F + NGEAE +SP L GE + AF + +
Sbjct: 179 SLQLGAEAVEGGRHQKGLGRAVFELENGEAEIYA-----DLSPSVPSLNGEISEAFEELD 233

Query: 417 FPPAEIGSSSSFGFQSHNGILLRSSVTESEIVHYPGSD 530
P E +SS ++E+VH G +
Sbjct: 234 SAPLE-----------------KSSTADAELVHQNGQE 254


>sp|Q28084|CO4A3_BOVIN Collagen alpha-3(IV) chain (Fragment) OS=Bos
taurus GN=COL4A3 PE=1 SV=1
Length = 471

Score = 29.6 bits (65), Expect = 9.4
Identities = 24/88 (27%), Positives = 32/88 (36%), Gaps = 4/88 (4%)
Frame = +3

Query: 276 GSRDQVGSSFSVANGEAEGRMRYPPRTMSPGFYGLTG----EGNRAFTDRNFPPAEIGSS 443
G +G S + A G P PGFYG G +GN F PP + G
Sbjct: 80 GPPGAIGDMGSPGHPGAPGVPGQPGARGDPGFYGFPGMKGKKGNSGFPGPPGPPGQSGPK 139

Query: 444 SSFGFQSHNGILLRSSVTESEIVHYPGS 527
G + G + +I+ PGS
Sbjct: 140 GPPGVRGEPGTV--------KIISLPGS 159


tr_hit_id B3DSF4
Definition tr|B3DSF4|B3DSF4_BIFLD Superfamily I DNA and RNA helicase OS=Bifidobacterium longum (strain DJO10A)
Align length 79
Score (bit) 35.0
E-value 2.4
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK955556|Adiantum capillus-veneris mRNA, clone:
TST39A01NGRL0023_G16, 5'
(531 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|B3DSF4|B3DSF4_BIFLD Superfamily I DNA and RNA helicase OS=Bif... 35 2.4
tr|P95959|P95959_SULSO Orf c04020 protein (Esterase, tropinester... 35 3.1
tr|Q9PEA9|Q9PEA9_XYLFA Putative uncharacterized protein OS=Xylel... 34 4.1
tr|A3XIG9|A3XIG9_9FLAO Deoxyribodipyrimidine photolyase-class I ... 34 5.3
tr|B6JWI4|B6JWI4_SCHJP Meiotic coiled-coil protein OS=Schizosacc... 34 5.3
tr|B5DPK7|B5DPK7_DROPS GA23508 OS=Drosophila pseudoobscura pseud... 33 6.9
tr|B3M053|B3M053_DROAN GF17761 OS=Drosophila ananassae GN=GF1776... 33 6.9
tr|B2ESX3|B2ESX3_9BACT Peptidase M56 BlaR1 OS=bacterium Ellin514... 33 9.0
tr|Q9VMV5|Q9VMV5_DROME Viking OS=Drosophila melanogaster GN=vkg ... 33 9.0
tr|O18407|O18407_DROME Collagen type IV alpha 2 OS=Drosophila me... 33 9.0
tr|B6KSC2|B6KSC2_TOXGO Putative uncharacterized protein OS=Toxop... 33 9.0

>tr|B3DSF4|B3DSF4_BIFLD Superfamily I DNA and RNA helicase
OS=Bifidobacterium longum (strain DJO10A) GN=uvrD2 PE=4
SV=1
Length = 900

Score = 35.0 bits (79), Expect = 2.4
Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Frame = +3

Query: 240 GNNVSLHHDSIFGSRDQVGSSFSVANGEAEGRMRYPPRTMSPGFY-GLTGEGNRAFTDRN 416
G+ S S +GSR + GSS+ G G Y R+ S G G + G+R+ + +
Sbjct: 730 GSRGSSGSGSSYGSRSRYGSSY----GSGSGSSSYGSRSSSYGSRSGSSSYGSRSRSGSS 785

Query: 417 FPPAEIGSSSSFGFQSHNG 473
+ + GS SS+G +S +G
Sbjct: 786 YGSSGSGSRSSYGSRSRSG 804


>tr|P95959|P95959_SULSO Orf c04020 protein (Esterase, tropinesterase
related protein) OS=Sulfolobus solfataricus GN=orf
c04020 PE=4 SV=2
Length = 231

Score = 34.7 bits (78), Expect = 3.1
Identities = 17/71 (23%), Positives = 34/71 (47%)
Frame = -2

Query: 440 ASNFSRWKISVSECSIAFACQAIKARGHGPGRVPHSPFSLTICNRKTGPYLVPASKYRIV 261
A ++ WK + + S+ A RGHG P+SP+++ + LV + V
Sbjct: 7 AGSYKSWKFVIPKLSLDNTVVAYDLRGHGRSSTPNSPYNIEDHSNDLRRLLVQLGIEKPV 66

Query: 260 M*GHIVSTVLS 228
+ GH + ++++
Sbjct: 67 LIGHSIGSLIA 77


>tr|Q9PEA9|Q9PEA9_XYLFA Putative uncharacterized protein OS=Xylella
fastidiosa GN=XF_1119 PE=4 SV=1
Length = 128

Score = 34.3 bits (77), Expect = 4.1
Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 1/57 (1%)
Frame = -1

Query: 381 SSHKSQGTWSWEGTSFSLQPHHLQQKNWTL-PGPCFQISNRDVGTHCFHSFVFLFCV 214
SS +++ TWSW G+S H K TL GP + + C H++ F CV
Sbjct: 50 SSIRTKATWSWIGSSSRSTHHRYVAKLTTLETGPINTKQSAHINCPCSHTYTFRECV 106


>tr|A3XIG9|A3XIG9_9FLAO Deoxyribodipyrimidine photolyase-class I
OS=Leeuwenhoekiella blandensis MED217 GN=MED217_06467
PE=3 SV=1
Length = 434

Score = 33.9 bits (76), Expect = 5.3
Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 1/54 (1%)
Frame = +3

Query: 105 MPIYTEDWETLNALGRDPFRRVFVGETLQQMA-SLQIEHRIKRQNCGNNVSLHH 263
+PI+ D E LN L D R F+ ETLQ+M LQ +H G++++L+H
Sbjct: 32 LPIFIFDKEILNELPEDDARVTFIFETLQKMRDELQEKH-------GSSIALYH 78


>tr|B6JWI4|B6JWI4_SCHJP Meiotic coiled-coil protein
OS=Schizosaccharomyces japonicus yFS275 GN=SJAG_00759
PE=4 SV=1
Length = 670

Score = 33.9 bits (76), Expect = 5.3
Identities = 31/105 (29%), Positives = 39/105 (37%), Gaps = 8/105 (7%)
Frame = +3

Query: 210 IEHRIKRQNCGNNVSLHHDSIFGSRDQVGSSFSVANGEAEGRMRYPPRTMSPGFYGLTGE 389
I R+NC N S RD S SV NG + PP SP LTG
Sbjct: 175 ISDHFGRENCFN------PSSSSGRDNKLSLVSVQNGFWQSSTHAPPVAESPLSNSLTGS 228

Query: 390 GNRAFTDRNFPPA--------EIGSSSSFGFQSHNGILLRSSVTE 500
F R P+ ++ S FG Q ++G RS+ E
Sbjct: 229 DTFPFRSRGAAPSVTAKPFSPQLVSPRGFGKQLYSGTTSRSNAFE 273


>tr|B5DPK7|B5DPK7_DROPS GA23508 OS=Drosophila pseudoobscura
pseudoobscura GN=GA23508 PE=4 SV=1
Length = 441

Score = 33.5 bits (75), Expect = 6.9
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 5/96 (5%)
Frame = +3

Query: 93 PRVNMPIYTEDWETLNALGRDPFRRVFVGETLQQMASLQIEHRIKRQNCGNNVSLHHDSI 272
P ++P+ +D E + A GR P +G T QQ Q++H+ K+Q + H+
Sbjct: 286 PARDLPVTADDLEIIAATGRSPSSDSGLGMTQQQQQQQQLKHQ-KQQQHQHQQQQHYPPH 344

Query: 273 FGSRDQVG----SSFSVAN-GEAEGRMRYPPRTMSP 365
S +G +S S N G R +P + P
Sbjct: 345 HASLRHLGGININSISSGNVGSVPVRHHHPHEQLGP 380


>tr|B3M053|B3M053_DROAN GF17761 OS=Drosophila ananassae GN=GF17761
PE=4 SV=1
Length = 2044

Score = 33.5 bits (75), Expect = 6.9
Identities = 28/103 (27%), Positives = 44/103 (42%)
Frame = +3

Query: 165 RVFVGETLQQMASLQIEHRIKRQNCGNNVSLHHDSIFGSRDQVGSSFSVANGEAEGRMRY 344
++ + ASL +H NC N+ S F + VGSS S+ + +
Sbjct: 194 QIMSNSSTSSAASLLNQHN-NNSNCSNS------SNFSTASSVGSSISIGSNSSNSNSN- 245

Query: 345 PPRTMSPGFYGLTGEGNRAFTDRNFPPAEIGSSSSFGFQSHNG 473
+ S G + L+ +G T R+ P + G SSS G H+G
Sbjct: 246 DSNSSSGGTHSLSFKG----TGRHVPGSVNGVSSSVGASCHSG 284


>tr|B2ESX3|B2ESX3_9BACT Peptidase M56 BlaR1 OS=bacterium Ellin514
GN=CflavDRAFT_5522 PE=4 SV=1
Length = 771

Score = 33.1 bits (74), Expect = 9.0
Identities = 22/75 (29%), Positives = 33/75 (44%)
Frame = -2

Query: 509 NFTFCN*TPQKNSIVGLKPKGTAASNFSRWKISVSECSIAFACQAIKARGHGPGRVPHSP 330
NFT N + + +SI+ + P G ++ R + S C A G+ PGR+ P
Sbjct: 413 NFTLINRSQKSSSIIYIAPNGLCTNSLPRGTLFFSAC----------AEGYAPGRI--GP 460

Query: 329 FSLTICNRKTGPYLV 285
+ NR G LV
Sbjct: 461 INTRSSNRVEGLELV 475


>tr|Q9VMV5|Q9VMV5_DROME Viking OS=Drosophila melanogaster GN=vkg PE=2
SV=1
Length = 1940

Score = 33.1 bits (74), Expect = 9.0
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 3/70 (4%)
Frame = +3

Query: 276 GSRDQVGSSFSVANGEA--EGRMRYPPRTMSPGFYGLTGE-GNRAFTDRNFPPAEIGSSS 446
G R VG S +G A +G + P PG +GL G+ G+R + P E G+
Sbjct: 1044 GQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKG 1103

Query: 447 SFGFQSHNGI 476
G+ NGI
Sbjct: 1104 LGGYPGRNGI 1113


>tr|O18407|O18407_DROME Collagen type IV alpha 2 OS=Drosophila
melanogaster GN=vkg PE=1 SV=1
Length = 1761

Score = 33.1 bits (74), Expect = 9.0
Identities = 23/70 (32%), Positives = 32/70 (45%), Gaps = 3/70 (4%)
Frame = +3

Query: 276 GSRDQVGSSFSVANGEA--EGRMRYPPRTMSPGFYGLTGE-GNRAFTDRNFPPAEIGSSS 446
G R VG S +G A +G + P PG +GL G+ G+R + P E G+
Sbjct: 1042 GQRGPVGDSQPALDGVAGRKGEVGSPGPNGLPGRHGLKGQRGDRGLPGQQGRPGEPGAKG 1101

Query: 447 SFGFQSHNGI 476
G+ NGI
Sbjct: 1102 LGGYPGRNGI 1111