FC-IC1110
Library FC-IC
(Link to library)
Clone ID FC-IC1110
Atlas ID -
NBRP ID -
dictyBase ID -
Link to Contig Contig-U16527-1
Original site URL
Representative seq. ID FC-IC1110P
(Link to Original site)
Representative DNA sequence
>FC-IC1110 (FC-IC1110Q) /CSM/FC-IC/FC-IC1110Q.Seq.d/
ACCATATTTAATAAGGTAACGAGATTCAAGTGCTGCACTACTAGCAAATGCCCAACATGA
TCCGCATTGTCCTTGATCTCTTATTGGAGTTTGGTATGAAGTCCAATCTACTGTAAGAGT
TGATGTTGGGGCTGGTGTTGTTGGGGCTGGTGNTGNTGNTTTTGAAGTAATTNGGAGCTG
GNGTGGTTGGTTTTGAGTGGTTGAGNCTGGTTTXXXXXXXXXXGATNGTGNTGTTGGGTN
NGGAGTANTTGTNCCTGGTGTNGATCGTTTTGNTAGTGGTTGGTAGATGGTGNTGTTGGT
TTATGGGNNGGTTGGAATCTGGTGNTGTATGGTTTTGCAGCAGTTGGAGCTGGTGTGTGT
TGGTTTTGGTGTGGTTGGAGCTGGTGCAGTTGNAGNTNNGGNTGTTGTTTTTGGAGTATA
AGCAAGAACACCCTTAAATCCACCNGCTGGGTAATATGAACATGAAACTTCAGATTTATC
TGGNCATATTGTCTTTGCGCAACCATATGAAGT
sequence update 2001.11. 5
Translated Amino Acid sequence
hi**gneiqvlhy*qmpnmirivldlllefgmksnll*elmlglvllglvxxxlk*XGAG
VVGFEWLXLV---

---XVXLGXEXLXLVXIVLXVVGRWXCWFMGXLESGXVWFCSSWSWCVLVLVWLELVQLX
XXXLFLEYKQEHP*ihxlgnmnmklqiylxilslrnhmk


Translated Amino Acid sequence (All Frames)
Frame A:
tifnkvtrfkccttskcpt*salslisywslv*spiycks*cwgwccwgwxxxf*snxel
xwlvlsg*xwf---

---dxxvgxgvxvpgvdrfxsgw*mvxlvygxvgiwxcmvlqqlelvcvgfgvvgagavx
xxxvvfgv*artplnppag*yehetsdlsghivfaqpye

Frame B:
pylir*rdssaallanaqhdphcp*sligvwyevqstvrvdvgagvvgagxxxfevixsw
xgwf*vvexg---

---XVXLGXEXLXLVXIVLXVVGRWXCWFMGXLESGXVWFCSSWSWCVLVLVWLELVQLX
XXXLFLEYKQEHP*ihxlgnmnmklqiylxilslrnhmk

Frame C:
hi**gneiqvlhy*qmpnmirivldlllefgmksnll*elmlglvllglvxxxlk*XGAG
VVGFEWLXLV---

---xxcwvxsxcxwcxsfx*wlvdgxvglwxgwnlvxygfaavgagvcwfwcgwswcsxx
xgccfwsiskntlkstxwvi*t*nfrfiwxyclcati*s

Homology vs CSM-cDNA

Score E
Sequences producing significant alignments: (bits) Value

FC-IC1110 (FC-IC1110Q) /CSM/FC-IC/FC-IC1110Q.Seq.d/ 809 0.0
FC-IC1760 (FC-IC1760Q) /CSM/FC-IC/FC-IC1760Q.Seq.d/ 333 2e-90
FC-IC1748 (FC-IC1748Q) /CSM/FC-IC/FC-IC1748Q.Seq.d/ 333 2e-90
FC-IC1665 (FC-IC1665Q) /CSM/FC-IC/FC-IC1665Q.Seq.d/ 333 2e-90
FC-IC1648 (FC-IC1648Q) /CSM/FC-IC/FC-IC1648Q.Seq.d/ 333 2e-90
FC-IC1642 (FC-IC1642Q) /CSM/FC-IC/FC-IC1642Q.Seq.d/ 333 2e-90
FC-IC1631 (FC-IC1631Q) /CSM/FC-IC/FC-IC1631Q.Seq.d/ 333 2e-90
FC-IC1622 (FC-IC1622Q) /CSM/FC-IC/FC-IC1622Q.Seq.d/ 333 2e-90
FC-IC1583 (FC-IC1583Q) /CSM/FC-IC/FC-IC1583Q.Seq.d/ 333 2e-90
FC-IC1527 (FC-IC1527Q) /CSM/FC-IC/FC-IC1527Q.Seq.d/ 333 2e-90

own update 2004.12.25
Homology vs DNA

Score E
Sequences producing significant alignments: (bits) Value N

AB088483|AB088483.1 Dictyostelium discoideum gene for gamete and mating-type specific protein A, complete cds. 250 e-149 5
U28755|U28755.1 Entamoeba invadens cysteine proteinase gene, partial cds. 60 3e-05 1
BH165563|BH165563.1 ENTSB84TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 60 3e-05 1
AZ686636|AZ686636.1 ENTFZ42TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 60 3e-05 1
M94163|M94163.1 Entamoeba histolytica (clone gEh-CPp2) cysteine proteinase (CPp2) gene, complete cds. 60 3e-05 1
AZ535019|AZ535019.1 ENTBP24TR Entamoeba histolytica Sheared DNA Entamoeba histolytica genomic, DNA sequence. 58 1e-04 1
CD451153|CD451153.1 USDA-FP_103216 Adult Alate Brown Citrus Aphid Toxoptera citricida cDNA clone WHWTC-42_H04 5', mRNA sequence. 52 0.008 1
S58670|S58670.1 ACP2=cysteine proteinase [Entamoeba histolytica, HM-1, Genomic/mRNA, 933 nt]. 52 0.008 1
AV201503|AV201503.1 Caenorhabditis elegans cDNA clone:yk388h3 : 5' end, single read. 48 0.12 1
AV201502|AV201502.1 Caenorhabditis elegans cDNA clone:yk388h2 : 5' end, single read. 48 0.12 1
dna update 2003.12.20
Homology vs Protein

Score E
Sequences producing significant alignments: (bits) Value

AB088483_1(AB088483|pid:none) Dictyostelium discoideum gmsA gene... 141 8e-33
BC080004_1(BC080004|pid:none) Xenopus laevis MGC81823 protein, m... 54 2e-06
BT050087_1(BT050087|pid:none) Salmo salar clone ssal-evd-563-114... 54 2e-06
AY220615_1(AY220615|pid:none) Hydra vulgaris cathepsin L precurs... 54 2e-06
AB091670_1(AB091670|pid:none) Pandalus borealis PbCtL mRNA for c... 54 2e-06
FJ807676_1(FJ807676|pid:none) Dicentrarchus labrax cathepsin L m... 54 3e-06
KHCHL(S00081;A25654;A26818;B25654)cathepsin L (EC 3.4.22.15) - c... 53 4e-06
AY126275_28(AY126275|pid:none) Mamestra configurata nucleopolyhe... 53 5e-06
AX497240_1(AX497240|pid:none) Sequence 7 from Patent WO0229058. 53 5e-06
(Q5E998) RecName: Full=Cathepsin L2; EC=3.4.22.43; Flag... 52 6e-06
protein update 2009. 5.21
PSORT

psg: 0.75 gvh: 0.61 alm: 0.33 top: 0.53 tms: 0.00 mit: 0.09 mip: 0.00
nuc: 0.00 erl: 0.00 erm: 0.20 pox: 0.00 px2: 0.00 vac: 0.00 rnp: 0.00
act: 0.00 caa: 0.00 yqr: 0.00 tyr: 0.00 leu: 0.00 gpi: 0.00 myr: 0.00
dna: 0.00 rib: 0.00 bac: 0.00 m1a: 0.00 m1b: 0.00 m2 : 0.00 mNt: 0.00
m3a: 0.00 m3b: 0.00 m_ : 1.00

56.0 %: cytoplasmic
20.0 %: mitochondrial
16.0 %: nuclear
8.0 %: peroxisomal

>> prediction for FC-IC1110 is cyt

5' end seq. ID FC-IC1110F
5' end seq.
>FC-IC1110F.Seq
ACCATATTTAATAAGGTAACGAGATTCAAGTGCTGCACTACTAGCAAATGCCCAACATGA
TCCGCATTGTCCTTGATCTCTTATTGGAGTTTGGTATGAAGTCCAATCTACTGTAAGAGT
TGATGTTGGGGCTGGTGTTGTTGGGGCTGGTGNTGNTGNTTTTGAAGTAATTNGGAGCTG
GNGTGGTTGGTTTTGAGTGGTTGAGNCTGGTTTNNNNNNNNNN
Length of 5' end seq. 223
3' end seq. ID FC-IC1110Z
3' end seq.
>FC-IC1110Z.Seq
NNNNNNNNNNGATNGTGNTGTTGGGTNNGGAGTANTTGTNCCTGGTGTNGATCGTTTTGN
TAGTGGTTGGTAGATGGTGNTGTTGGTTTATGGGNNGGTTGGAATCTGGTGNTGTATGGT
TTTGCAGCAGTTGGAGCTGGTGTGTGTTGGTTTTGGTGTGGTTGGAGCTGGTGCAGTTGN
AGNTNNGGNTGTTGTTTTTGGAGTATAAGCAAGAACACCCTTAAATCCACCNGCTGGGTA
ATATGAACATGAAACTTCAGATTTATCTGGNCATATTGTCTTTGCGCAACCATATGAAGT
Length of 3' end seq. 300
Connected seq. ID FC-IC1110P
Connected seq.
>FC-IC1110P.Seq
ACCATATTTAATAAGGTAACGAGATTCAAGTGCTGCACTACTAGCAAATGCCCAACATGA
TCCGCATTGTCCTTGATCTCTTATTGGAGTTTGGTATGAAGTCCAATCTACTGTAAGAGT
TGATGTTGGGGCTGGTGTTGTTGGGGCTGGTGNTGNTGNTTTTGAAGTAATTNGGAGCTG
GNGTGGTTGGTTTTGAGTGGTTGAGNCTGGTTT----------GATNGTGNTGTTGGGTN
NGGAGTANTTGTNCCTGGTGTNGATCGTTTTGNTAGTGGTTGGTAGATGGTGNTGTTGGT
TTATGGGNNGGTTGGAATCTGGTGNTGTATGGTTTTGCAGCAGTTGGAGCTGGTGTGTGT
TGGTTTTGGTGTGGTTGGAGCTGGTGCAGTTGNAGNTNNGGNTGTTGTTTTTGGAGTATA
AGCAAGAACACCCTTAAATCCACCNGCTGGGTAATATGAACATGAAACTTCAGATTTATC
TGGNCATATTGTCTTTGCGCAACCATATGAAGT
Length of connected seq. 503
Full length Seq ID -
Full length Seq. -
Length of full length seq. -