SFC123
Library SF
(Link to library)
Clone ID SFC123
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16368-1
Original site URL
Representative seq. ID SFC123E
(Link to Original site)
Representative DNA sequence
>SFC123 (SFC123Q) /CSM/SF/SFC1-A/SFC123Q.Seq.d/
ATTCTATTGTGAAGAACAAGGTGGTAGTGCATGCTGCGTACCACATCATGACGGTTGTGG
TAATATTCAATGTCCATGGGGCCACTATTGTGTCAATGAACATGGTAAATGCAGATGTGT
ACCACACAGACCACCACCACGTCCACCAGTTGACCAATGCCGTAATCAACATTGTCCACA
TGGTTATTCATGTCGTGTCATTAAAGGTTGTGCCACTTGTGTCAGAGATGCCCGTCCACC
ACATAACCTTTGTCGTGGTTTTGGTTGTCCAGAAGGCTCCCATTGTGAAGTTCTTGAAAA
ACATCCAGTTTGTGTTAGAAATCATGTTCCACCACACCCACCACCACCACCACAAATTTG
CGGTAGTGTAAATTGTGGTCCAGGTTATATTTGTACAATTATCAATGGTCATCCAACTTG
TATTCGTGGTGATGGTTATTTATGTAATCAAACCCGTTGTCCACACGATTATCAATGTGA
AACCATTAGCACCAATATCGTGAAATGTTCACCAAAGAATGACGAATGTAAATGGTATCG
TTGTCCACCAGGTTCCAGCTGCTTCAATAGTAGAAACGGTCCACACTGTCTTGCCAATAA
TGTATTCCCACAACTTTGTAAAGTTACTCAATGTCCAACTGATTTCTCTTGTAAAATGAT
TAGAGGTAATCCAACTTGTATTAAAGCAAGACCACCGGTACCACCACCACATTGCTCAAC
TTGTGCAGAGCTATCATCAGCATGTAATCACGTTGGAATGATTTGTATTCAAGTACCAAG
CAATTGTACCAATACTAGATTCCCATGTTGCCCATCTCATCCAATTTGTATTCATCCATC
AACTACTGCTGCTTCAACCATTGCAACAACTGCATCAACTGTCGCAACTACAACCTCTGC
AACTACTGCAGGTACAACAACTGGTGGAACTACAACTGGTGGTTCAACTTCTGATAGTAG
TGCTGCTTCCTCAGCCGATAGTAGCGCTGCCTCCTCATCACCATCAAGCAGTGCTGCTTC
AAGTGCCGCTTCAAGTGAACCATCAAGTAGCGCCGCTTCAAGTAGTGCTCCATCCTCTGC
TTCAAGTAGCGCTCCATCCTCTGCCTCAAGTAGTGCTCCATCATCATCAGCTTCAAGTTC
AGCTGCATCATCAGCTGCAAGCTCAGAAAGCTCAGAAAGCTCATCA
sequence update 2001. 6. 9
Translated Amino Acid sequence
FYCEEQGGSACCVPHHDGCGNIQCPWGHYCVNEHGKCRCVPHRPPPRPPVDQCRNQHCPH
GYSCRVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQIC
GSVNCGPGYICTIINGHPTCIRGDGYLCNQTRCPHDYQCETISTNIVKCSPKNDECKWYR
CPPGSSCFNSRNGPHCLANNVFPQLCKVTQCPTDFSCKMIRGNPTCIKARPPVPPPHCST
CAELSSACNHVGMICIQVPSNCTNTRFPCCPSHPICIHPSTTAASTIATTASTVATTTSA
TTAGTTTGGTTTGGSTSDSSAASSADSSAASSSPSSSAASSAASSEPSSSAASSSAPSSA
SSSAPSSASSSAPSSSASSSAASSAASSESSESSS


Translated Amino Acid sequence (All Frames)
Frame A:
ill*rtrw*cmlrtts*rlw*ysmsmgpllcq*tw*mqmcttqttttsts*pmp*stlst
wlfmsch*rlchlcqrcpstt*plswfwlsrrlpl*ss*ktsslc*kscsttptttttnl
r*cklwsrlylynyqwssnlysw*wlfm*snplstrlsm*nh*hqyremftke*rm*mvs
lstrfqllq**krstlscq*cipttl*sysmsn*fll*nd*r*snly*skttgttttlln
lcraiism*srwndlysstkqlyqy*ipmlpissnlyssinyccfnhcnncincrnynlc
nycrynnwwnynwwfnf***ccflsr**rcllitikqccfkcrfk*tik*rrfk*csilc
fk*rsilclk*csiiisfkfsciiscklrklrkli


Frame B:
FYCEEQGGSACCVPHHDGCGNIQCPWGHYCVNEHGKCRCVPHRPPPRPPVDQCRNQHCPH
GYSCRVIKGCATCVRDARPPHNLCRGFGCPEGSHCEVLEKHPVCVRNHVPPHPPPPPQIC
GSVNCGPGYICTIINGHPTCIRGDGYLCNQTRCPHDYQCETISTNIVKCSPKNDECKWYR
CPPGSSCFNSRNGPHCLANNVFPQLCKVTQCPTDFSCKMIRGNPTCIKARPPVPPPHCST
CAELSSACNHVGMICIQVPSNCTNTRFPCCPSHPICIHPSTTAASTIATTASTVATTTSA
TTAGTTTGGTTTGGSTSDSSAASSADSSAASSSPSSSAASSAASSEPSSSAASSSAPSSA
SSSAPSSASSSAPSSSASSSAASSAASSESSESSS


Frame C:
sivknkvvvhaayhimtvvvifnvhgativsmnmvnadvyhtdhhhvhqltnavinivhm
vihvvslkvvplvsempvhhitfvvvlvvqkapivkflkniqfvleimfhhthhhhhkfa
vv*ivvqvifvqlsmviqlvfvvmviyvikpvvhtiinvkplapis*nvhqrmtnvngiv
vhqvpaasivetvhtvlpimyshnfvkllnvqlislvk*leviqlvlkqdhryhhhiaql
vqsyhqhvitle*fvfkyqaivpildshvahliqfvfihqllllqplqqlhqlsqlqplq
llqvqqlvelqlvvqllivvllpqpivalpphhhqavllqvplqvnhqvaplqvvlhpll
qvalhplpqvvlhhhqlqvqlhhqlqaqkaqkah


Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

SFC123 (SFC123Q) /CSM/SF/SFC1-A/SFC123Q.Seq.d/ 1651 0.0
SFF631 (SFF631Q) /CSM/SF/SFF6-B/SFF631Q.Seq.d/ 1643 0.0
CFH216 (CFH216Q) /CSM/CF/CFH2-A/CFH216Q.Seq.d/ 1635 0.0
SFH368 (SFH368Q) /CSM/SF/SFH3-C/SFH368Q.Seq.d/ 1629 0.0
SFE782 (SFE782Q) /CSM/SF/SFE7-D/SFE782Q.Seq.d/ 1550 0.0
CFI758 (CFI758Q) /CSM/CF/CFI7-C/CFI758Q.Seq.d/ 1489 0.0
SFL185 (SFL185Q) /CSM/SF/SFL1-D/SFL185Q.Seq.d/ 1487 0.0
SFK752 (SFK752Q) /CSM/SF/SFK7-C/SFK752Q.Seq.d/ 1487 0.0
SFD672 (SFD672Q) /CSM/SF/SFD6-C/SFD672Q.Seq.d/ 1487 0.0
CFI753 (CFI753Q) /CSM/CF/CFI7-C/CFI753Q.Seq.d/ 1487 0.0

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

D13973|D13973.1 Dictyostelium discoideum DNA for Dp87 protein, complete cds. 1643 0.0 5
BJ170760|BJ170760.1 Physcomitrella patens subsp. patens cDNA clone:pph26m16, 3' end, single read. 190 3e-44 1
AF279135|AF279135.1 Dictyostelium discoideum spore coat structural protein SP65 (cotE) gene, complete cds. 64 5e-06 1
AC099379|AC099379.5 Rattus norvegicus clone CH230-42H5, *** SEQUENCING IN PROGRESS ***, 5 unordered pieces. 42 0.32 7
C26876|C26876.1 Oryza sativa (japonica cultivar-group) cDNA, partial sequence (C50317_1A). 36 0.65 2
AK100095|AK100095.1 Oryza sativa (japonica cultivar-group) cDNA clone:J023004B12, full insert sequence. 36 3.8 2
BX284669|BX284669.3 Zebrafish DNA sequence *** SEQUENCING IN PROGRESS *** from clone CH211-280E18. 44 4.4 1
AC147507|AC147507.1 Zea mays clone ZMMBBc0304D20, WORKING DRAFT SEQUENCE, 2 ordered pieces. 44 4.4 1
CB645342|CB645342.1 ---. 44 4.4 1
AE017095|AE017095.1 Oryza sativa (japonica cultivar-group) chromosome 10, section 49 of 77 of the complete sequence. 44 4.4 1
dna update 2003.12.18
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

(Q04503) RecName: Full=Prespore protein Dp87; Flags: Precursor; 540 e-152
D13973_1(D13973|pid:none) Dictyostelium discoideum gene for Dp87... 540 e-152
(Q54QX2) RecName: Full=Probable spore coat protein DDB_G0283555;... 110 1e-22
AF245486_1(AF245486|pid:none) Polysphondylium pallidum spore coa... 92 4e-17
T22274(T22274)hypothetical protein F46B3.9 - Caenorhabditis eleg... 91 1e-16
AC117267_7(AC117267|pid:none) Dictyostelium discoideum chromosom... 80 1e-13
S07638(S07638;A60942;B60942) spore coat protein SP96 precursor -... 76 2e-12
(P14328) RecName: Full=Spore coat protein SP96; &AC117075_49(AC... 76 2e-12
(P15270) RecName: Full=Spore coat protein SP60; Flags: Precursor... 75 5e-12
(P15269) RecName: Full=Spore coat protein SP70; AltName: Full=Pr... 75 5e-12
protein update 2009. 7. 3
PSORT

psg: 0.75 gvh: 0.28 alm: 0.45 top: 0.53 tms: 0.00 mit: 0.19 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.00 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

44.0 %: nuclear
36.0 %: cytoplasmic
8.0 %: cytoskeletal
4.0 %: vacuolar
4.0 %: plasma membrane
4.0 %: vesicles of secretory system

>> prediction for SFC123 is nuc

5' end seq. ID SFC123F
5' end seq.
>SFC123F.Seq
ATTCTATTGTGAAGAACAAGGTGGTAGTGCATGCTGCGTACCACATCATGACGGTTGTGG
TAATATTCAATGTCCATGGGGCCACTATTGTGTCAATGAACATGGTAAATGCAGATGTGT
ACCACACAGACCACCACCACGTCCACCAGTTGACCAATGCCGTAATCAACATTGTCCACA
TGGTTATTCATGTCGTGTCATTAAAGGTTGTGCCACTTGTGTCAGAGATGCCCGTCCACC
ACATAACCTTTGTCGTGGTTTTGGTTGTCCAGAAGGCTCCCATTGTGAAGTTCTTGAAAA
ACATCCAGTTTGTGTTAGAAATCATGTTCCACCACACCCACCACCACCACCACAAATTTG
CGGTAGTGTAAATTGTGGTCCAGGTTATATTTGTACAATTATCAATGGTCATCCAACTTG
TATTCGTGGTGATGGTTATTTATGTAATCAAACCCGTTGTCCACACGATTATCAATGTGA
AACCATTAGCACCAATATCGTGAAATGTTCACCAAAGAATGACGAATGTAAATGGTATCG
TTGTCCACCAGGTTCCAGCTGCTTCAATAGTAGAAACGGTCCACACTGTCTTGCCAATAA
TGTATTCCCACAACTTTGTAAAGTTACTCAATGTCCA----------
Length of 5' end seq. 637
3' end seq. ID SFC123Z
3' end seq.
>SFC123Z.Seq
----------TTTATGTAATCAAACCCGTTGTCCACACGATTATCAATGTGAAACCATTA
GCACCAATATCGTGAAATGTTCACCAAAGAATGACGAATGTAAATGGTATCGTTGTCCAC
CAGGTTCCAGCTGCTTCAATAGTAGAAACGGTCCACACTGTCTTGCCAATAATGTATTCC
CACAACTTTGTAAAGTTACTCAATGTCCAACTGATTTCTCTTGTAAAATGATTAGAGGTA
ATCCAACTTGTATTAAAGCAAGACCACCGGTACCACCACCACATTGCTCAACTTGTGCAG
AGCTATCATCAGCATGTAATCACGTTGGAATGATTTGTATTCAAGTACCAAGCAATTGTA
CCAATACTAGATTCCCATGTTGCCCATCTCATCCAATTTGTATTCATCCATCAACTACTG
CTGCTTCAACCATTGCAACAACTGCATCAACTGTCGCAACTACAACCTCTGCAACTACTG
CAGGTACAACAACTGGTGGAACTACAACTGGTGGTTCAACTTCTGATAGTAGTGCTGCTT
CCTCAGCCGATAGTAGCGCTGCCTCCTCATCACCATCAAGCAGTGCTGCTTCAAGTGCCG
CTTCAAGTGAACCATCAAGTAGCGCCGCTTCAAGTAGTGCTCCATCCTCTGCTTCAAGTA
GCGCTCCATCCTCTGCCTCAAGTAGTGCTCCATCATCATCAGCTTCAAGTTCAGCTGCAT
CATCAGCTGCAAGCTCAGAAAGCTCAGAAAGCTCATCA
Length of 3' end seq. 748
Connected seq. ID SFC123P
Connected seq.
>SFC123P.Seq
ATTCTATTGTGAAGAACAAGGTGGTAGTGCATGCTGCGTACCACATCATGACGGTTGTGG
TAATATTCAATGTCCATGGGGCCACTATTGTGTCAATGAACATGGTAAATGCAGATGTGT
ACCACACAGACCACCACCACGTCCACCAGTTGACCAATGCCGTAATCAACATTGTCCACA
TGGTTATTCATGTCGTGTCATTAAAGGTTGTGCCACTTGTGTCAGAGATGCCCGTCCACC
ACATAACCTTTGTCGTGGTTTTGGTTGTCCAGAAGGCTCCCATTGTGAAGTTCTTGAAAA
ACATCCAGTTTGTGTTAGAAATCATGTTCCACCACACCCACCACCACCACCACAAATTTG
CGGTAGTGTAAATTGTGGTCCAGGTTATATTTGTACAATTATCAATGGTCATCCAACTTG
TATTCGTGGTGATGGTTATTTATGTAATCAAACCCGTTGTCCACACGATTATCAATGTGA
AACCATTAGCACCAATATCGTGAAATGTTCACCAAAGAATGACGAATGTAAATGGTATCG
TTGTCCACCAGGTTCCAGCTGCTTCAATAGTAGAAACGGTCCACACTGTCTTGCCAATAA
TGTATTCCCACAACTTTGTAAAGTTACTCAATGTCCA----------TTTATGTAATCAA
ACCCGTTGTCCACACGATTATCAATGTGAAACCATTAGCACCAATATCGTGAAATGTTCA
CCAAAGAATGACGAATGTAAATGGTATCGTTGTCCACCAGGTTCCAGCTGCTTCAATAGT
AGAAACGGTCCACACTGTCTTGCCAATAATGTATTCCCACAACTTTGTAAAGTTACTCAA
TGTCCAACTGATTTCTCTTGTAAAATGATTAGAGGTAATCCAACTTGTATTAAAGCAAGA
CCACCGGTACCACCACCACATTGCTCAACTTGTGCAGAGCTATCATCAGCATGTAATCAC
GTTGGAATGATTTGTATTCAAGTACCAAGCAATTGTACCAATACTAGATTCCCATGTTGC
CCATCTCATCCAATTTGTATTCATCCATCAACTACTGCTGCTTCAACCATTGCAACAACT
GCATCAACTGTCGCAACTACAACCTCTGCAACTACTGCAGGTACAACAACTGGTGGAACT
ACAACTGGTGGTTCAACTTCTGATAGTAGTGCTGCTTCCTCAGCCGATAGTAGCGCTGCC
TCCTCATCACCATCAAGCAGTGCTGCTTCAAGTGCCGCTTCAAGTGAACCATCAAGTAGC
GCCGCTTCAAGTAGTGCTCCATCCTCTGCTTCAAGTAGCGCTCCATCCTCTGCCTCAAGT
AGTGCTCCATCATCATCAGCTTCAAGTTCAGCTGCATCATCAGCTGCAAGCTCAGAAAGC
TCAGAAAGCTCATCA
Length of connected seq. 1385
Full length Seq ID SFC123E
Full length Seq.
>SFC123E.Seq
ATTCTATTGTGAAGAACAAGGTGGTAGTGCATGCTGCGTACCACATCATGACGGTTGTGG
TAATATTCAATGTCCATGGGGCCACTATTGTGTCAATGAACATGGTAAATGCAGATGTGT
ACCACACAGACCACCACCACGTCCACCAGTTGACCAATGCCGTAATCAACATTGTCCACA
TGGTTATTCATGTCGTGTCATTAAAGGTTGTGCCACTTGTGTCAGAGATGCCCGTCCACC
ACATAACCTTTGTCGTGGTTTTGGTTGTCCAGAAGGCTCCCATTGTGAAGTTCTTGAAAA
ACATCCAGTTTGTGTTAGAAATCATGTTCCACCACACCCACCACCACCACCACAAATTTG
CGGTAGTGTAAATTGTGGTCCAGGTTATATTTGTACAATTATCAATGGTCATCCAACTTG
TATTCGTGGTGATGGTTATTTATGTAATCAAACCCGTTGTCCACACGATTATCAATGTGA
AACCATTAGCACCAATATCGTGAAATGTTCACCAAAGAATGACGAATGTAAATGGTATCG
TTGTCCACCAGGTTCCAGCTGCTTCAATAGTAGAAACGGTCCACACTGTCTTGCCAATAA
TGTATTCCCACAACTTTGTAAAGTTACTCAATGTCCAACTGATTTCTCTTGTAAAATGAT
TAGAGGTAATCCAACTTGTATTAAAGCAAGACCACCGGTACCACCACCACATTGCTCAAC
TTGTGCAGAGCTATCATCAGCATGTAATCACGTTGGAATGATTTGTATTCAAGTACCAAG
CAATTGTACCAATACTAGATTCCCATGTTGCCCATCTCATCCAATTTGTATTCATCCATC
AACTACTGCTGCTTCAACCATTGCAACAACTGCATCAACTGTCGCAACTACAACCTCTGC
AACTACTGCAGGTACAACAACTGGTGGAACTACAACTGGTGGTTCAACTTCTGATAGTAG
TGCTGCTTCCTCAGCCGATAGTAGCGCTGCCTCCTCATCACCATCAAGCAGTGCTGCTTC
AAGTGCCGCTTCAAGTGAACCATCAAGTAGCGCCGCTTCAAGTAGTGCTCCATCCTCTGC
TTCAAGTAGCGCTCCATCCTCTGCCTCAAGTAGTGCTCCATCATCATCAGCTTCAAGTTC
AGCTGCATCATCAGCTGCAAGCTCAGAAAGCTCAGAAAGCTCATCA
Length of full length seq. 1186