DK950751
Clone id TST38A01NGRL0009_I08
Library
Length 661
Definition Adiantum capillus-veneris mRNA. clone: TST38A01NGRL0009_I08. 5' end sequence.
Accession
Tissue type prothallia
Developmental stage gametophyte
Contig ID
Sequence
ATCGATATGCTGCACAGCTTCTAAACATCCATGACAACATGAACCCTGTCTTTTCGAGGA
GTTTTCTGGTTCAAGCGCTGAAGAAGTCCTCCTCTCACTCCAACCTCAGCTGCACCACTT
CTGTTGGGGTTTCTTCAAAAGCTTTCAATTATTCTACGTCTTGCCAGCCATCTCTGGAAG
GGGACATTTCGAACGGTGAGGTGCATGAGGAAAGTCAAATGCAACAGGCTGGTCTTGTTA
AAGAACATGTTGAAAGAAGCAGAGAGGAACTCATATCAAAGACACGGCTGGTCCAAAATA
TGCAGTCCCATCTCAAGCCAGATCAGGAAGAACATTTGAAGCAGTTTCCACTAAAAGTTG
TAGTTCAGAGTTGTGTGAAACGCTGGTTCAAGGAGCATTTGAGAAATGCAGAGAAGGGTG
ATTCAGGTGCTCAAGTTATGGTTGGACAGATGATCTCCCATGGGTATGGAGTTCCGAAGG
ATGTTCTGAAGGGAAAAGCATGGATTAAGAGAGGACACAGCAGACAAAGCCAGATTAAAG
AGCTGTGTGAAAAAAAGAGGTCGGAGGAAGAGGAACTCTATTTTATGTGATGATATTTTG
TGGAGCCTGTTACCTGCGTGCCAAACAAAGTTTCTCAGAGCCAAACATTCAATTGGTAAC
A
■■Homology search results ■■ -
sp_hit_id Q6AY96
Definition sp|Q6AY96|T2FA_RAT General transcription factor IIF subunit 1 OS=Rattus norvegicus
Align length 41
Score (bit) 33.5
E-value 1.0
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950751|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_I08, 5'
(661 letters)

Database: uniprot_sprot.fasta
412,525 sequences; 148,809,765 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

sp|Q6AY96|T2FA_RAT General transcription factor IIF subunit 1 OS... 33 1.0
sp|Q3THK3|T2FA_MOUSE General transcription factor IIF subunit 1 ... 33 1.0
sp|P35269|T2FA_HUMAN General transcription factor IIF subunit 1 ... 33 1.0
sp|Q5FWF7|FBX48_HUMAN F-box only protein 48 OS=Homo sapiens GN=F... 33 1.3
sp|Q80W03|TOX3_MOUSE TOX high mobility group box family member 3... 33 1.7
sp|Q5VV42|CDKAL_HUMAN CDK5 regulatory subunit-associated protein... 33 1.7
sp|Q5EA53|T2FA_BOVIN General transcription factor IIF subunit 1 ... 32 3.0
sp|A2SHT9|MDH_METPP Malate dehydrogenase OS=Methylibium petrolei... 32 3.9
sp|A6H751|FOLC_BOVIN Folylpolyglutamate synthase, mitochondrial ... 31 5.1
sp|Q95337|INVO_TUPGL Involucrin OS=Tupaia glis GN=IVL PE=2 SV=1 31 6.6

>sp|Q6AY96|T2FA_RAT General transcription factor IIF subunit 1
OS=Rattus norvegicus GN=Gtf2f1 PE=2 SV=1
Length = 508

Score = 33.5 bits (75), Expect = 1.0
Identities = 15/41 (36%), Positives = 26/41 (63%)
Frame = +3

Query: 147 NYSTSCQPSLEGDISNGEVHEESQMQQAGLVKEHVERSREE 269
N++T Q LE D+SN ++++E +M ++G E + REE
Sbjct: 38 NFATWNQARLERDLSNKKIYQEEEMPESGAGSEFNRKLREE 78


>sp|Q3THK3|T2FA_MOUSE General transcription factor IIF subunit 1
OS=Mus musculus GN=Gtf2f1 PE=1 SV=1
Length = 508

Score = 33.5 bits (75), Expect = 1.0
Identities = 15/41 (36%), Positives = 26/41 (63%)
Frame = +3

Query: 147 NYSTSCQPSLEGDISNGEVHEESQMQQAGLVKEHVERSREE 269
N++T Q LE D+SN ++++E +M ++G E + REE
Sbjct: 38 NFATWNQARLERDLSNKKIYQEEEMPESGAGSEFNRKLREE 78


>sp|P35269|T2FA_HUMAN General transcription factor IIF subunit 1
OS=Homo sapiens GN=GTF2F1 PE=1 SV=2
Length = 517

Score = 33.5 bits (75), Expect = 1.0
Identities = 15/41 (36%), Positives = 26/41 (63%)
Frame = +3

Query: 147 NYSTSCQPSLEGDISNGEVHEESQMQQAGLVKEHVERSREE 269
N++T Q LE D+SN ++++E +M ++G E + REE
Sbjct: 38 NFATWNQARLERDLSNKKIYQEEEMPESGAGSEFNRKLREE 78


>sp|Q5FWF7|FBX48_HUMAN F-box only protein 48 OS=Homo sapiens
GN=FBXO48 PE=2 SV=1
Length = 155

Score = 33.1 bits (74), Expect = 1.3
Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 9/63 (14%)
Frame = +1

Query: 154 LRLASHLWKGTFRTVR--CMRKVKCN-------RLVLLKNMLKEAERNSYQRHGWSKICS 306
+R + LWK TVR C R++ + R++LL+N K ++ + +S ICS
Sbjct: 69 IRNSDSLWKPHCMTVRAVCRREIDDDLESGYSWRVILLRNYQKSKVKHEWLSGRYSNICS 128

Query: 307 PIS 315
PIS
Sbjct: 129 PIS 131


>sp|Q80W03|TOX3_MOUSE TOX high mobility group box family member 3
OS=Mus musculus GN=Tox3 PE=2 SV=1
Length = 575

Score = 32.7 bits (73), Expect = 1.7
Identities = 20/91 (21%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Frame = +3

Query: 153 STSCQPSLEGDISNGEVHEESQ-------MQQAGLVKEHVERSREELISKTRLVQNMQSH 311
ST PS++ ++ ++ Q MQQ L + + + ++ + + +MQ H
Sbjct: 428 STQVSPSVQTQQHQMQLQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQH 487

Query: 312 LKPDQEEHLKQFPLKVVVQSCVKRWFKEHLR 404
L+ Q++HL+Q + Q +++ ++HL+
Sbjct: 488 LQQQQQQHLQQ----QLSQQQLQQQLQQHLQ 514


>sp|Q5VV42|CDKAL_HUMAN CDK5 regulatory subunit-associated protein
1-like 1 OS=Homo sapiens GN=CDKAL1 PE=1 SV=1
Length = 579

Score = 32.7 bits (73), Expect = 1.7
Identities = 25/118 (21%), Positives = 52/118 (44%)
Frame = +3

Query: 153 STSCQPSLEGDISNGEVHEESQMQQAGLVKEHVERSREELISKTRLVQNMQSHLKPDQEE 332
S SC L+ DI + E+S+ Q V R++++ K R +N Q +L+ ++
Sbjct: 3 SASCDTLLD-DIEDIVSQEDSKPQDRHFV-------RKDVVPKVRR-RNTQKYLQEEENS 53

Query: 333 HLKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWI 506
+ + + ++ W H +S + M GQ+ ++GY + ++ W+
Sbjct: 54 PPSDSTIPGIQKIWIRTWGCSH------NNSDGEYMAGQLAAYGYKITENASDADLWL 105


>sp|Q5EA53|T2FA_BOVIN General transcription factor IIF subunit 1
OS=Bos taurus GN=GTF2F1 PE=2 SV=1
Length = 517

Score = 32.0 bits (71), Expect = 3.0
Identities = 15/41 (36%), Positives = 25/41 (60%)
Frame = +3

Query: 147 NYSTSCQPSLEGDISNGEVHEESQMQQAGLVKEHVERSREE 269
N +T Q LE D+SN ++++E +M ++G E + REE
Sbjct: 38 NLTTWNQARLERDLSNKKIYQEEEMPESGAGSEFNRKLREE 78


>sp|A2SHT9|MDH_METPP Malate dehydrogenase OS=Methylibium
petroleiphilum (strain PM1) GN=mdh PE=3 SV=1
Length = 328

Score = 31.6 bits (70), Expect = 3.9
Identities = 14/34 (41%), Positives = 22/34 (64%)
Frame = +3

Query: 393 EHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKG 494
+H+R+ G +GA V +G + YG+PKDV+ G
Sbjct: 251 DHMRDWALGTNGAWVTMGVPSNGEYGIPKDVMFG 284


>sp|A6H751|FOLC_BOVIN Folylpolyglutamate synthase, mitochondrial
OS=Bos taurus GN=FPGS PE=2 SV=1
Length = 585

Score = 31.2 bits (69), Expect = 5.1
Identities = 26/94 (27%), Positives = 41/94 (43%), Gaps = 1/94 (1%)
Frame = +3

Query: 165 QPSLEGDISNGEVHEESQMQQAGLVKEHVE-RSREELISKTRLVQNMQSHLKPDQEEHLK 341
+PSL G + V + + Q GL H E R +L+ + L +
Sbjct: 330 RPSLPGQLPLAPVFQPTPRMQQGL--RHTEWPGRTQLLRRGPLTWYLDGAHTTSS----- 382

Query: 342 QFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMV 443
+Q+CV RWF++ L E+ DSG++V V
Sbjct: 383 -------MQACV-RWFRQALHRCERPDSGSEVRV 408


>sp|Q95337|INVO_TUPGL Involucrin OS=Tupaia glis GN=IVL PE=2 SV=1
Length = 400

Score = 30.8 bits (68), Expect = 6.6
Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Frame = +3

Query: 204 HEESQMQQAGLVKEHVERSREELISKTRLVQNMQSHLKPDQEEHLK-QFPLKVVVQSCVK 380
H+ESQ Q+ L ++ E ++EL + Q + HLK Q+E K + L+ Q C K
Sbjct: 158 HQESQEQKLHLEQQQQEPQKQELHLGQQEAQEEELHLKQQQQECQKEEVHLEQQQQECQK 217


tr_hit_id A9P1L8
Definition tr|A9P1L8|A9P1L8_PICSI Putative uncharacterized protein OS=Picea sitchensis
Align length 104
Score (bit) 81.3
E-value 5.0e-14
Report
BLASTX 2.2.19 [Nov-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.

Query= DK950751|Adiantum capillus-veneris mRNA, clone:
TST38A01NGRL0009_I08, 5'
(661 letters)

Database: uniprot_trembl.fasta
7,341,751 sequences; 2,391,615,440 total letters

Searching..................................................done



Score E
Sequences producing significant alignments: (bits) Value

tr|A9P1L8|A9P1L8_PICSI Putative uncharacterized protein OS=Picea... 81 5e-14
tr|Q9FLB8|Q9FLB8_ARATH Genomic DNA, chromosome 5, TAC clone:K18I... 75 4e-12
tr|Q84W16|Q84W16_ARATH Putative uncharacterized protein At5g0536... 75 4e-12
tr|Q3E9L2|Q3E9L2_ARATH Uncharacterized protein At5g05360.2 (At5g... 75 4e-12
tr|A7Q3R7|A7Q3R7_VITVI Chromosome chr13 scaffold_48, whole genom... 74 6e-12
tr|A7QFH8|A7QFH8_VITVI Chromosome chr8 scaffold_88, whole genome... 74 8e-12
tr|O80906|O80906_ARATH Expressed protein (At2g38450) OS=Arabidop... 67 1e-09
tr|Q5W6N1|Q5W6N1_ORYSJ Os05g0342900 protein OS=Oryza sativa subs... 67 1e-09
tr|A2Y3E1|A2Y3E1_ORYSI Putative uncharacterized protein OS=Oryza... 67 1e-09
tr|A8IVI2|A8IVI2_CHLRE Predicted protein OS=Chlamydomonas reinha... 65 4e-09
tr|Q8LEQ5|Q8LEQ5_ARATH Putative uncharacterized protein OS=Arabi... 65 5e-09
tr|B6SI80|B6SI80_MAIZE Putative uncharacterized protein OS=Zea m... 64 8e-09
tr|B6TG32|B6TG32_MAIZE Putative uncharacterized protein OS=Zea m... 63 2e-08
tr|B6T1M8|B6T1M8_MAIZE Putative uncharacterized protein OS=Zea m... 61 7e-08
tr|B6TNI0|B6TNI0_MAIZE Putative uncharacterized protein OS=Zea m... 59 2e-07
tr|Q10MR3|Q10MR3_ORYSJ Os03g0298600 protein OS=Oryza sativa subs... 59 3e-07
tr|A3AH17|A3AH17_ORYSJ Putative uncharacterized protein OS=Oryza... 59 3e-07
tr|A9THX4|A9THX4_PHYPA Predicted protein OS=Physcomitrella paten... 56 2e-06
tr|A5Z8T5|A5Z8T5_9FIRM Putative uncharacterized protein OS=Eubac... 46 0.002
tr|A4SB24|A4SB24_OSTLU Predicted protein OS=Ostreococcus lucimar... 44 0.007
tr|B4U9J8|B4U9J8_HYDS0 Sel1 domain protein repeat-containing pro... 44 0.012
tr|A8HVT5|A8HVT5_CHLRE Predicted protein OS=Chlamydomonas reinha... 43 0.020
tr|A2UVT6|A2UVT6_SHEPU Sel1-like repeat OS=Shewanella putrefacie... 41 0.075
tr|Q89HG7|Q89HG7_BRAJA Bll6024 protein OS=Bradyrhizobium japonic... 40 0.097
tr|B3ESE2|B3ESE2_AMOA5 Putative uncharacterized protein OS=Amoeb... 40 0.097
tr|Q1E2G0|Q1E2G0_COCIM Putative uncharacterized protein OS=Cocci... 40 0.097
tr|B8M2Y1|B8M2Y1_9EURO Ubiquitin-protein ligase Sel1/Ubx2, putat... 40 0.097
tr|Q3J7N8|Q3J7N8_NITOC Sel1-like repeat protein OS=Nitrosococcus... 40 0.13
tr|B8IML9|B8IML9_METNO Sel1 domain protein repeat-containing pro... 40 0.13
tr|B6C4U5|B6C4U5_9GAMM Sel1 repeat family OS=Nitrosococcus ocean... 40 0.13

>tr|A9P1L8|A9P1L8_PICSI Putative uncharacterized protein OS=Picea
sitchensis PE=2 SV=1
Length = 144

Score = 81.3 bits (199), Expect = 5e-14
Identities = 38/104 (36%), Positives = 61/104 (58%)
Frame = +3

Query: 225 QAGLVKEHVERSREELISKTRLVQNMQSHLKPDQEEHLKQFPLKVVVQSCVKRWFKEHLR 404
+AG V E V + ++ +V ++K E + PLK VV C +RWF++ L+
Sbjct: 16 KAGNVAETVVKVKKTAKFVDNIVSTSSINMKMQSGEEQNRLPLKEVVADCARRWFQDSLK 75

Query: 405 NAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRGHSRQSQI 536
A+ GD+G QV+VGQM GYGV +D+ +GKAW ++ +S++
Sbjct: 76 EAKAGDTGMQVLVGQMYCSGYGVHRDIQRGKAWFQKAAKSRSRV 119


>tr|Q9FLB8|Q9FLB8_ARATH Genomic DNA, chromosome 5, TAC clone:K18I23
OS=Arabidopsis thaliana PE=4 SV=1
Length = 179

Score = 75.1 bits (183), Expect = 4e-12
Identities = 32/65 (49%), Positives = 45/65 (69%)
Frame = +3

Query: 336 LKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRG 515
+ + PL VV+ CV+RWF++ L+ A+ GD G QV+VGQM GYG+PKD KG+AWI +
Sbjct: 75 INRVPLAQVVEDCVRRWFQDTLKEAKSGDVGMQVLVGQMYCSGYGIPKDENKGRAWINKA 134

Query: 516 HSRQS 530
+S
Sbjct: 135 SRTRS 139


>tr|Q84W16|Q84W16_ARATH Putative uncharacterized protein At5g05360
(Fragment) OS=Arabidopsis thaliana GN=At5g05360 PE=2
SV=1
Length = 157

Score = 75.1 bits (183), Expect = 4e-12
Identities = 32/65 (49%), Positives = 45/65 (69%)
Frame = +3

Query: 336 LKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRG 515
+ + PL VV+ CV+RWF++ L+ A+ GD G QV+VGQM GYG+PKD KG+AWI +
Sbjct: 69 INRVPLAQVVEDCVRRWFQDTLKEAKSGDVGMQVLVGQMYCSGYGIPKDENKGRAWINKA 128

Query: 516 HSRQS 530
+S
Sbjct: 129 SRTRS 133


>tr|Q3E9L2|Q3E9L2_ARATH Uncharacterized protein At5g05360.2
(At5g05360) OS=Arabidopsis thaliana GN=At5g05360 PE=2
SV=1
Length = 153

Score = 75.1 bits (183), Expect = 4e-12
Identities = 32/65 (49%), Positives = 45/65 (69%)
Frame = +3

Query: 336 LKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRG 515
+ + PL VV+ CV+RWF++ L+ A+ GD G QV+VGQM GYG+PKD KG+AWI +
Sbjct: 75 INRVPLAQVVEDCVRRWFQDTLKEAKSGDVGMQVLVGQMYCSGYGIPKDENKGRAWINKA 134

Query: 516 HSRQS 530
+S
Sbjct: 135 SRTRS 139


>tr|A7Q3R7|A7Q3R7_VITVI Chromosome chr13 scaffold_48, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00029389001
PE=4 SV=1
Length = 156

Score = 74.3 bits (181), Expect = 6e-12
Identities = 34/73 (46%), Positives = 46/73 (63%)
Frame = +3

Query: 312 LKPDQEEHLKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLK 491
LK + E ++ PL VV C KRWF++ L+ A+ GD+ QV+VGQM GYGV +D K
Sbjct: 57 LKMESSEGQRRMPLAQVVSDCAKRWFQDTLKEAKAGDTTMQVLVGQMYFSGYGVSRDAQK 116

Query: 492 GKAWIKRGHSRQS 530
G+AWI R +S
Sbjct: 117 GRAWISRASKSRS 129


>tr|A7QFH8|A7QFH8_VITVI Chromosome chr8 scaffold_88, whole genome
shotgun sequence OS=Vitis vinifera GN=GSVIVT00037427001
PE=4 SV=1
Length = 100

Score = 73.9 bits (180), Expect = 8e-12
Identities = 34/72 (47%), Positives = 45/72 (62%)
Frame = +3

Query: 321 DQEEHLKQFPLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKA 500
D+ H + PL VV CVKRWF++ LR A+ GD QV+VGQM GYGVP+D KG+
Sbjct: 7 DRSHH--RVPLSEVVSDCVKRWFQDTLREAKSGDVSMQVLVGQMYYSGYGVPRDAQKGRV 64

Query: 501 WIKRGHSRQSQI 536
W+ R +S +
Sbjct: 65 WMTRASRTRSSV 76


>tr|O80906|O80906_ARATH Expressed protein (At2g38450) OS=Arabidopsis
thaliana GN=At2g38450 PE=2 SV=2
Length = 105

Score = 67.0 bits (162), Expect = 1e-09
Identities = 30/63 (47%), Positives = 39/63 (61%)
Frame = +3

Query: 348 PLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRGHSRQ 527
PL VV C KRWFK+ L A+ G+ QV++GQM GYGVPKD KG+ WI + +
Sbjct: 23 PLSSVVSDCAKRWFKDTLEEAKAGNITMQVLLGQMYYSGYGVPKDARKGRLWITKASRVR 82

Query: 528 SQI 536
S +
Sbjct: 83 SSV 85


>tr|Q5W6N1|Q5W6N1_ORYSJ Os05g0342900 protein OS=Oryza sativa subsp.
japonica GN=P0015F11.6 PE=4 SV=1
Length = 103

Score = 66.6 bits (161), Expect = 1e-09
Identities = 28/55 (50%), Positives = 39/55 (70%)
Frame = +3

Query: 348 PLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKR 512
PL VV CV+RWF++ L+ A +GDS QV+V QM GYG+PK+ KG+AW ++
Sbjct: 15 PLSEVVGDCVQRWFQDALKEARRGDSAMQVLVAQMYHSGYGIPKNEHKGRAWAEK 69


>tr|A2Y3E1|A2Y3E1_ORYSI Putative uncharacterized protein OS=Oryza
sativa subsp. indica GN=OsI_19526 PE=4 SV=1
Length = 103

Score = 66.6 bits (161), Expect = 1e-09
Identities = 28/55 (50%), Positives = 39/55 (70%)
Frame = +3

Query: 348 PLKVVVQSCVKRWFKEHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKR 512
PL VV CV+RWF++ L+ A +GDS QV+V QM GYG+PK+ KG+AW ++
Sbjct: 15 PLSEVVGDCVQRWFQDALKEARRGDSAMQVLVAQMYHSGYGIPKNEHKGRAWAEK 69


>tr|A8IVI2|A8IVI2_CHLRE Predicted protein OS=Chlamydomonas
reinhardtii GN=CGL10 PE=4 SV=1
Length = 136

Score = 65.1 bits (157), Expect = 4e-09
Identities = 39/104 (37%), Positives = 52/104 (50%)
Frame = +3

Query: 213 SQMQQAGLVKEHVERSREELISKTRLVQNMQSHLKPDQEEHLKQFPLKVVVQSCVKRWFK 392
S +G K +RS EL R + +H P Q + + PLK VVQ VKRWF+
Sbjct: 26 SSNYSSGRPKSSNKRSAPEL---ERPGSSQAAHAHPSQMQP-EPIPLKYVVQEAVKRWFE 81

Query: 393 EHLRNAEKGDSGAQVMVGQMISHGYGVPKDVLKGKAWIKRGHSR 524
+ L A++GD Q +VG+M GYG KD K W + SR
Sbjct: 82 DTLLEAQRGDVKQQALVGEMYKEGYGCQKDARAAKEWSDKAASR 125