Amino Acids Repeated Periodically at every Fifth Position
in Proteins from SWISS-PROT DataBase Release: 38 (July 1999) Total Entries: 80,000
---------------------------------------------------------------------------
>SWISS-PROT: P09031 (sequence length 97)
ANP_LIMFE (P09031) ANTIFREEZE PROTEIN PRECURSOR (AFP).
11-A @every Fifth position @35 :AVADPAAAAAAAVADTASDAAAAAAATAAAAAKAA
-ADTAAAAAKAAADTAAAAAEAAAAT
>SWISS-PROT: P40602 (sequence length 534)
APG_ARATH (P40602) ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR.
13-P @every Fifth position @67 :PKPVAPPGPSPKPVAPPGPSPCPSPPPKPQPKPPP
-APSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKP
-APPPA
>SWISS-PROT: P40603 (sequence length 449)
APG_BRANA (P40603) ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN C
12-P @every Fifth position @2 :PKPQPKPPPKPQPKPPPAPTPSPCPPQPPKPQPKP
-PPAPTPSPCPPQPPKPQPKPPPAPGPSPKPGPSPS
>SWISS-PROT: P26436 (sequence length 265)
ASPX_HUMAN (P26436) ACROSOMAL PROTEIN SP-10 PRECURSOR (ACROSOMAL VESIC
21-E @every Fifth position @67 :EHGSSEHGSSKHTVAEHTSGEHAESEHASGEPAAT
-EHAEGEHTVGEQPSGEQPSGEHLSGEQPLSELESG
-EQPSDEQPSGEHGSGEQPSGEQASGEQPSGEHASG
-EQASGAPISSTSTGT
13-S @every Fifth position @115 :SGEQPSGEHLSGEQPLSELESGEQPSDEQPSGEHG
-SGEQPSGEQASGEQPSGEHASGEQASGAPISSTST
-GTILN
>SWISS-PROT: Q06990 (sequence length 285)
ASPX_PAPPA (Q06990) ACROSOMAL PROTEIN SP-10 PRECURSOR (ACROSOMAL VESIC
25-E @every Fifth position @67 :EHGSSEHGSREHTVAEHTPGEHAESEHASGEPAAT
-GHAEGEHTVGEQPSGEQPSGEHLSGEQSLGEHASG
-EQPSDEQLSGEHASGEQPSGEHASGEQPSGEQPSG
-EHASGEQSLGEHALSEKPSGEQPSGAPISSISTGT
-ILNCY
15-G @every Fifth position @106 :GEHTVGEQPSGEQPSGEHLSGEQSLGEHASGEQPS
-DEQLSGEHASGEQPSGEHASGEQPSGEQPSGEHAS
-GEQSLGEHALSEKPS
12-S @every Fifth position @115 :SGEQPSGEHLSGEQSLGEHASGEQPSDEQLSGEHA
-SGEQPSGEHASGEQPSGEQPSGEHASGEQSLGEHA
>SWISS-PROT: P53353 (sequence length 349)
ASPX_VULVU (P53353) SPERM ACROSOMAL PROTEIN FSA-ACR.1 PRECURSOR (FRAGM
42-E @every Fifth position @51 :ETAAGENTLSEHTSGEHTSVEHASAEHSSTEHTSG
-EHASGEHTSGERATGEHTSSEHATSEHTSGEQPSG
-EQPSGEKSSGEQPSGEKSSGEQPSGEKSLGEQPSG
-EQSSGEKSSAEQTSGEQAVAEKPSGEHAVAEKPSG
-EQAVAERPSGEQAVAEKPLGEQAVAERPSGEQASI
-EKASSEQASAEQASAEQASSEQASGEKPLGEQPSG
-IPPSSTFSGPILNCHTCSYMNDQGKCLRGE
11-S @every Fifth position @114 :SGEQPSGEQPSGEKSSGEQPSGEKSSGEQPSGEKS
-LGEQPSGEQSSGEKSSAEQTSGEQAVAEKP
11-G @every Fifth position @115 :GEQPSGEQPSGEKSSGEQPSGEKSSGEQPSGEKSL
-GEQPSGEQSSGEKSSAEQTSGEQAVAEKPS
>SWISS-PROT: Q01851 (sequence length 423)
BR3A_HUMAN (Q01851) BRAIN-SPECIFIC HOMEOBOX/POU DOMAIN PROTEIN 3A (BRN
11-G @every Fifth position @133 :GAGGAGAAAGGGGAHDGPGGGGGPGGGGGPGGGGP
-GGGGGGGPGGGGGGPGGGLLGGSAHPHPHM
>SWISS-PROT: P48988 (sequence length 606)
CENB_CRIGR (P48988) MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN
12-E @every Fifth position @407 :EEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGE
-GEEVGEEEEVEEEGDESDEEEEEEEEEEEESSSEG
>SWISS-PROT: O61735 (sequence length 1023)
CLOC_DROME (O61735) CIRCADIAN LOCOMOTER OUTPUT CYCLES KAPUT PROTEIN (D
11-Q @every Fifth position @780 :QQQHQSHSQLQQHTQQQHQQQQQQQQQQQQQQQQQ
-QQQQQQQQQQQQQQQQLQLQQQNDILLRED
>SWISS-PROT: Q07202 (sequence length 204)
CORA_MEDSA (Q07202) COLD AND DROUGHT-REGULATED PROTEIN CORA.
13-G @every Fifth position @59 :GYNGGGYNHGGGYNHGGGGYHNGGGGYNHGGGGYN
-GGGGHGGHGGGGYNGGGGHGGHGGGGYNGGGGHGG
-HGGAE
>SWISS-PROT: P33240 (sequence length 577)
CST2_HUMAN (P33240) CLEAVAGE STIMULATION FACTOR, 64 KD SUBUNIT (CSTF 6
13-R @every Fifth position @408 :RGIDARGMEARAMEARGLDARGLEARAMEARAMEA
-RAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGP
-IPSGM
>SWISS-PROT: O70133 (sequence length 1380)
DDX9_MOUSE (O70133) ATP-DEPENDENT RNA HELICASE A (NUCLEAR DNA HELICASE
11-G @every Fifth position @1179 :GGGGYGGGGYGGGYGSGGFGGGFGSGGGFGGGFNS
-GGGGFGSGGGGFGSGGGGFGGGGGGFSGGG
11-G @every Fifth position @1230 :GGFGGGGGGFSGGGGGGFGGGRGGGGGGFGGSGGF
-GNGGGGYGVGGGGYGGGGGGGYGGGSGGYG
12-G @every Fifth position @1237 :GGFSGGGGGGFGGGRGGGGGGFGGSGGFGNGGGGY
-GVGGGGYGGGGGGGYGGGSGGYGGGGYGGGEGYSI
11-G @every Fifth position @1244 :GGGFGGGRGGGGGGFGGSGGFGNGGGGYGVGGGGY
-GGGGGGGYGGGSGGYGGGGYGGGEGYSISP
>SWISS-PROT: P03211 (sequence length 641)
EBN1_EBV (P03211) EBNA-1 NUCLEAR PROTEIN.
13-G @every Fifth position @191 :GGAGGAGAGGGAGAGGAGGAGGAGAGGAGAGGGAG
-GAGGAGAGGAGAGGAGAGGAGAGGAGGAGAGGAGG
-AGAGG
11-G @every Fifth position @209 :GAGGAGAGGAGAGGGAGGAGGAGAGGAGAGGAGAG
-GAGAGGAGGAGAGGAGGAGAGGAGGAGAGG
>SWISS-PROT: P03204 (sequence length 992)
EBN6_EBV (P03204) EBNA-6 NUCLEAR PROTEIN (EBNA-3C) (EBNA-4B).
16-P @every Fifth position @551 :PPTVSPSDTGPPAVGPPAAGPPAAGPPAAGPPAAG
-PPAAGPPAAGPRILAPLSAGPPAAGPHIVTPPSAR
-PRIMAPPVVRMFMRERQLPQ
>SWISS-PROT: P19470 (sequence length 212)
EGG1_SCHJA (P19470) EGGSHELL PROTEIN 1 PRECURSOR.
13-G @every Fifth position @32 :GGGGGGGGGYGGWCGGSDCYGGGNGGGGGGGGGNG
-GEYGGGYGDVYGGSYGGGSYGGGGYGDVYGGGCGG
-PDCYG
>SWISS-PROT: P19469 (sequence length 207)
EGG2_SCHJA (P19469) EGGSHELL PROTEIN 2A PRECURSOR.
12-G @every Fifth position @32 :GGGGGGGGGYGGWCGGSDCYGGGNGGGGGGGGGNG
-GEYGGGYGDVYGGSYGGGEYGDVYGGGCGGPDCYG
>SWISS-PROT: P04985 (sequence length 747)
ELS_BOVIN (P04985) ELASTINS A/B/C PRECURSOR (TROPOELASTIN).
12-G @every Fifth position @333 :GLPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV
-GVPGVGVPGVGVPGVGVPGVGVPGALSPAATAKAA
13-P @every Fifth position @335 :PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV
-PGVGVPGVGVPGVGVPGVGVPGALSPAATAKAAAK
-AAKFG
12-G @every Fifth position @336 :GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP
-GVGVPGVGVPGVGVPGVGVPGALSPAATAKAAAKA
11-V @every Fifth position @337 :VGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG
-VGVPGVGVPGVGVPGVGVPGALSPAATAKA
11-V @every Fifth position @339 :VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG
-VPGVGVPGVGVPGVGVPGALSPAATAKAAA
>SWISS-PROT: P07916 (sequence length 750)
ELS_CHICK (P07916) ELASTIN PRECURSOR (TROPOELASTIN) (FRAGMENT).
11-V @every Fifth position @449 :VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG
-VPGVGVPGVGVPGVGVPGLVPGAGPAAAAK
11-P @every Fifth position @450 :PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV
-PGVGVPGVGVPGVGVPGLVPGAGPAAAAKA
11-G @every Fifth position @451 :GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP
-GVGVPGVGVPGVGVPGLVPGAGPAAAAKAA
>SWISS-PROT: P54320 (sequence length 860)
ELS_MOUSE (P54320) ELASTIN PRECURSOR (TROPOELASTIN).
12-A @every Fifth position @712 :AAAAAAAKAAAKAAQYGLGGAGGLGAGGLGAGGLG
-AGGLGAGGLGAGGLGAGGLGAGGLGAGGGVSPAAA
>SWISS-PROT: P35550 (sequence length 327)
FBRL_MOUSE (P35550) FIBRILLARIN (NUCLEOLAR PROTEIN 1).
14-G @every Fifth position @17 :GFGDRGGRGGGRGGRGGFGGGRGGFGGGGRGRGGG
-GGGFRGRGGGGVRGGGFQSGGNRGRGGGRGGKRGN
-QSGKNVMVEP
>SWISS-PROT: P32768 (sequence length 1537)
FLO1_YEAST (P32768) FLOCCULATION PROTEIN FLO1 PRECURSOR (FLOCCULIN 1).
11-S @every Fifth position @1121 :SSVISSSVTSSLFTSSPVISSSVISSSTTTSTSIF
-SESSKSSVIPTSSSTSGSSESETSSAGSVS
>SWISS-PROT: P38894 (sequence length 1075)
FLO5_YEAST (P38894) FLOCCULATION PROTEIN FLO5 PRECURSOR (FLOCCULIN 5).
11-S @every Fifth position @670 :SSVISSSVTSSLVTSSSFISSSVISSSTTTSTSIF
-SESSTSSVIPTSSSTSGSSESKTSSASSSS
>SWISS-PROT: P10419 (sequence length 435)
FMRA_ANTEL (P10419) ANTHO-RFAMIDE NEUROPEPTIDE PRECURSOR.
14-F @every Fifth position @61 :FWKGRFSDPQFWKGRFSDPQFWKGRFSDPQFWKGR
-FSDPQFWKGRFSDPQFWKGRFSDPQFWKGRFSDGT
-KRENDPQYWK
>SWISS-PROT: P13709 (sequence length 2038)
FSH_DROME (P13709) FEMALE STERILE HOMEOTIC PROTEIN (FRAGILE-CHORION M
11-Q @every Fifth position @1518 :QQTHQQQQQHQQQHHQQQQQQLTQQQLQQQQQQQQ
-QQQHLQQQQHQQQHHQAANKLLIIPKPIES
>SWISS-PROT: P13816 (sequence length 678)
GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR.
13-K @every Fifth position @375 :KEGEHKEEEHKEGEHKEGEHKEEEHKEEEHKKEEH
-KSKEHKSKGKKDKGKKDKGKHKKAKKEKVKKHVVK
-NVIED
12-E @every Fifth position @549 :EDKKEESKEVQEESKEVQEDEEEVEEDEEEEEEEE
-EEEEEEEEEEEEEEEEEEEEEEDEDEEDEDDAEED
>SWISS-PROT: P36417 (sequence length 708)
GBF_DICDI (P36417) G-BOX BINDING FACTOR (GBF).
20-Q @every Fifth position @145 :QQPQHHQQMQQQQHHQQMQQQQQHHQQMQQQQHHQ
-QMQHHQLQQHQHQHQQQQQQQQHQQQHHQQQQQQQ
-QQHHQQQQHHQHSQPQQQHQHNQQQQHQHNQQQHQ
-QQQNQIQMVP
>SWISS-PROT: P34689 (sequence length 707)
GLH1_CAEEL (P34689) ATP-DEPENDENT RNA HELICASE GLH-1.
16-G @every Fifth position @21 :GGGFGGGNNGGSGFGGGKNGGTGFGGGNTGGSGFG
-GGNTGGSGFGGGKTGGSGFGGGNTCGSFGGGNSGF
-GEGGHGGGERNNNCFNCQQP
12-G @every Fifth position @25 :GGGNNGGSGFGGGKNGGTGFGGGNTGGSGFGGGNT
-GGSGFGGGKTGGSGFGGGNTCGSFGGGNSGFGEGG
>SWISS-PROT: Q05966 (sequence length 169)
GR10_BRANA (Q05966) GLYCINE-RICH RNA-BINDING PROTEIN 10.
13-G @every Fifth position @90 :GGGRGGGGYGGRGGGGYGGGGGGYGDRRGGGGYGS
-GGGGRGGGGYGSGGGGYGGGGGRRDGGGYGGGDGG
-YGGGS
>SWISS-PROT: P09789 (sequence length 384)
GRP1_PETHY (P09789) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1 PRECUR
12-G @every Fifth position @72 :GGGAGGGAGGGLGGGGGLGGGGGAGGGGGLGGGGG
-AGGGFGGGAGGGAGGGLGGGGGLGGGGGGGAGGGG
23-G @every Fifth position @106 :GAGGGFGGGAGGGAGGGLGGGGGLGGGGGGGAGGG
-GGVGGGAGSGGGFGAGGGVGGGAGAGGGVGGGGGF
-GGGGGGGVGGGSGHGGGFGAGGGVGGGAGGGLGGG
-VGGGGGGGSGGGGGIGGGSGHGGGF
14-G @every Fifth position @198 :GVGGGAGGGLGGGVGGGGGGGSGGGGGIGGGSGHG
-GGFGAGGGVGGGVGGGAAGGGGGGGGGGGGGGGGL
-GGGSGHGGGF
11-G @every Fifth position @255 :GGGGGGGGGGGGLGGGSGHGGGFGAGGGVGGGAAG
-GVGGGGGFGGGGGGGVGGGSGHGGGFGAGG
13-G @every Fifth position @293 :GGGGFGGGGGGGVGGGSGHGGGFGAGGGVGGGAGG
-GLGGGGGAGGGGGIGGGHGGGFGVGVGIGIGVGVG
-AGAGH
>SWISS-PROT: P10495 (sequence length 252)
GRP1_PHAVU (P10495) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.0 PREC
11-G @every Fifth position @65 :GGYGEGAGGGEGAGAGYGAAGGGHGGGGGNGGGGG
-GGADGGGYGGGAGKGGGEGYGGGGANGGGY
11-G @every Fifth position @86 :GGHGGGGGNGGGGGGGADGGGYGGGAGKGGGEGYG
-GGGANGGGYGGGGGSGGGGGGGAGGAGSGY
22-G @every Fifth position @122 :GGANGGGYGGGGGSGGGGGGGAGGAGSGYGGGEGS
-GAGGGYGGANGGGGGGNGGGGGGGSGGAHGGGAAG
-GGEGAGQGAGGGYGGGAAGGGGRGSGGGGGGGYGG
-GGARGSGYGGGGGSGEGGGH
>SWISS-PROT: Q99069 (sequence length 142)
GRP1_SORVU (Q99069) GLYCINE-RICH RNA-BINDING PROTEIN 1 (FRAGMENT).
12-G @every Fifth position @70 :GGGGGGGYGGGRGGGGGYGRRDGGGGGYGGGGGGY
-GGGRGGYGGGGYGGGGGGYGGGSRGGGGYGNSDGN
>SWISS-PROT: P27484 (sequence length 214)
GRP2_NICSY (P27484) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 2 PRECUR
12-G @every Fifth position @77 :GAAVQGGRGGGGGGGGRGGGGYGGGSGGYGGGGRG
-GSRGYGGGDGGYGGGGGYGGGSRYGGGGGGYGGGG
14-G @every Fifth position @86 :GGGGGGGRGGGGYGGGSGGYGGGGRGGSRGYGGGD
-GGYGGGGGYGGGSRYGGGGGGYGGGGGYGGGGSGG
-GSGCFKCGES
>SWISS-PROT: P29834 (sequence length 183)
GRP2_ORYSA (P29834) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 2 PRECUR
13-G @every Fifth position @60 :GGGSGGAAGGGYGRGGGGGGGGGEGGGSGSGYGSG
-QGSGYGAGVGGAGGYGSGGGGGGGQGGGAGGYGQG
-SGYGS
13-G @every Fifth position @102 :GVGGAGGYGSGGGGGGGQGGGAGGYGQGSGYGSGY
-GSGAGGAHGGGYGSGGGGGGGGGQGGGSGYGSGSG
-YGSGY
>SWISS-PROT: P10496 (sequence length 465)
GRP2_PHAVU (P10496) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8 PREC
16-G @every Fifth position @128 :GGGYGAGGEHGIGYGGGGGSGAGGGGGYNAGGAQG
-GGYGTGGGAGGGGGGGGDHGGGYGGGQGAGGGAGG
-GYGGGGEHGGGGGGGQGGGA
18-G @every Fifth position @368 :GGGAGGGYGTGGEHGGGYGGGQGGGGGYGAGGDHG
-AAGYGGGEGGGGGSGGGYGDGGAHGGGYGGGAGGG
-GGYGAGGAHGGGYGGGGGIGGGHGG
>SWISS-PROT: Q99070 (sequence length 168)
GRP2_SORVU (Q99070) GLYCINE-RICH RNA-BINDING PROTEIN 2.
11-G @every Fifth position @89 :GGGGGGGYGGGGGGYGGREGGGYGGGGGGYGGRRE
-GGGGYGGGGYGGGGGGYGGREGGGGYGGGG
12-G @every Fifth position @90 :GGGGGGYGGGGGGYGGREGGGYGGGGGGYGGRREG
-GGGYGGGGYGGGGGGYGGREGGGGYGGGGGYGGNR
>SWISS-PROT: P10979 (sequence length 157)
GRPA_MAIZE (P10979) GLYCINE-RICH RNA-BINDING, ABSCISIC ACID-INDUCIBLE
11-G @every Fifth position @88 :GGGGGGGGYGGGRGGGGYGGGRRDGGYGGGGGYGG
-RREGGGGGYGGGGGYGGRREGGGGGYGGGG
>SWISS-PROT: P17816 (sequence length 200)
GRP_HORVU (P17816) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN PRECURSO
11-G @every Fifth position @46 :GGGHHGHGRGGHGGGGYGGGGGYGGGGGGYPGGGG
-GYGGGGGGYPGHGGEGGGGYGGGGGYPGHG
14-G @every Fifth position @48 :GHHGHGRGGHGGGGYGGGGGYGGGGGGYPGGGGGY
-GGGGGGYPGHGGEGGGGYGGGGGYPGHGGEGGGGY
-GGGGGYHGHG
16-G @every Fifth position @105 :GYPGHGGEGGGGYGGGGGYHGHGGEGGGGYGGGGG
-YHGHGGEGGGGYGGGGGGYPGHGGGGGHGGGRCKW
-GCCGHGFLHHGCRCCARADE
>SWISS-PROT: P40274 (sequence length 90)
H162_TRYCR (P40274) HISTONE H1.M6.2.
11-K @every Fifth position @9 :KKASPKKAAAKKASPKKAAAKKASPKKAAARKTAA
-KKTAKKPAVRKPAAKKRAAPKKKPAAAKKP
>SWISS-PROT: Q07134 (sequence length 244)
H1O_CHITH (Q07134) HISTONE H1, ORPHON.
11-K @every Fifth position @157 :KKVVKKPAAKKPEAKKATKAAKPATKKVVAKPASK
-KAAAPKPKAAKPAAKKPEAKKATKAAAKKP
>SWISS-PROT: P11910 (sequence length 88)
H82_NEIGO (P11910) OUTER MEMBRANE PROTEIN H.8 PRECURSOR.
13-A @every Fifth position @16 :AACGGEKAAEAPAAEASSTEAPAAEAPAAEAPAAE
-AAAAEAPAAEAPAAEAPAAEAAATEAPAAEAPAAE
12-A @every Fifth position @23 :AAEAPAAEASSTEAPAAEAPAAEAPAAEAAAAEAP
-AAEAPAAEAPAAEAAATEAPAAEAPAAEAA
12-E @every Fifth position @25 :EAPAAEASSTEAPAAEAPAAEAPAAEAAAAEAPAA
-EAPAAEAPAAEAAATEAPAAEAPAA
>SWISS-PROT: P04196 (sequence length 525)
HRG_HUMAN (P04196) HISTIDINE-RICH GLYCOPROTEIN PRECURSOR (HISTIDINE-P
13-H @every Fifth position @342 :HNNNSSDLHPHKHHSHEQHPHGHHPHAHHPHEHDT
-HRQHPHGHHPHGHHPHGHHPHGHHPHGHHPHCHDF
-QDYGP
11-H @every Fifth position @350 :HPHKHHSHEQHPHGHHPHAHHPHEHDTHRQHPHGH
-HPHGHHPHGHHPHGHHPHGHHPHCHDFQDY
>SWISS-PROT: Q28640 (sequence length 526)
HRG_RABIT (Q28640) HISTIDINE-RICH GLYCOPROTEIN PRECURSOR (HISTIDINE-P
13-P @every Fifth position @339 :PPHGHHPHGPPPHGHPPHGPPPRHPPHGPPPHGHP
-PHGPPPHGHPPHGPPPHGHPPHGPPPHGHPPHGHG
-FHDHG
11-P @every Fifth position @348 :PPPHGHPPHGPPPRHPPHGPPPHGHPPHGPPPHGH
-PPHGPPPHGHPPHGPPPHGHPPHGHGFHDH
11-H @every Fifth position @365 :HGPPPHGHPPHGPPPHGHPPHGPPPHGHPPHGPPP
-HGHPPHGHGFHDHGPCDPPSHKEGPQDLHQ
>SWISS-PROT: P46593 (sequence length 634)
HWP1_CANAL (P46593) HYPHAL WALL PROTEIN 1 (CELL ELONGATION PROTEIN 2).
15-P @every Fifth position @103 :PCDYPQQPQEPCDNPPQPDVPCDNPPQPDVPCDNP
-PQPDIPCDNPPQPDIPCDNPPQPDQPDDNPPIPNI
-PTDWIPNIPTDWIPD
>SWISS-PROT: P28284 (sequence length 825)
ICP0_HSV2H (P28284) TRANS-ACTING TRANSCRIPTIONAL PROTEIN ICP0 (VMW118
11-A @every Fifth position @569 :ASAGAAPPSASPSSQAAVAAASSSSASSSSASSSS
-ASSSSASSSSASSSSASSSSASSSAGGAGG
>SWISS-PROT: Q01042 (sequence length 407)
IE68_HSVSA (Q01042) IMMEDIATE-EARLY PROTEIN.
34-E @every Fifth position @60 :EEQRREEVEEEGEERERRGEEEREGEGGEEGEGRE
-EAEEEEAEEKEAEEEEAEEAEEEAEEEEAEEAEAE
-EEEAEEEEAEEEEAEEAEEEEAEEAEEEAEEEEAE
-EEAEEEAEEAEEAEEEAEEEAEEAEEAEEAEEAEE
-EAEEAEEEAEEAEEEAEEAEEAEEAEEAEEEAEEA
-EEEEEEAGPSTPRLPHYKVV
15-E @every Fifth position @97 :EEEEAEEKEAEEEEAEEAEEEAEEEEAEEAEAEEE
-EAEEEEAEEEEAEEAEEEEAEEAEEEAEEEEAEEE
-AEEEAEEAEEAEEEA
>SWISS-PROT: P55875 (sequence length 1054)
IF2_STIAU (P55875) TRANSLATION INITIATION FACTOR IF-2.
11-P @every Fifth position @144 :PAAEAPKATAPVAPEPTVEAPKAAAPVAPEPTVEA
-PKTEAPVAAAPIAEAPTPPARTEVPVTSGR
>SWISS-PROT: P02537 (sequence length 486)
K1C0_XENLA (P02537) KERATIN 3, TYPE I CYTOSKELETAL 51 KD (51 KD CYTOKE
11-G @every Fifth position @36 :GGEGDFGGMGGFGACGAGYGGGAGYGGGAGGAGYG
-GGAGGGGAGYGGGFGGGSGAGYGGGFGGGA
>SWISS-PROT: P13645 (sequence length 593)
K1CJ_HUMAN (P13645) KERATIN, TYPE I CYTOSKELETAL 10 (CYTOKERATIN 10) (
15-G @every Fifth position @455 :GEGSSGGGGRGGGSFGGGYGGGSSGGGSSGGGYGG
-GHGGSSGGGYGGGSSGGGSSGGGYGGGSSSGGHGG
-GSSSGGHGGSSSGGY
12-G @every Fifth position @461 :GGGRGGGSFGGGYGGGSSGGGSSGGGYGGGHGGSS
-GGGYGGGSSGGGSSGGGYGGGSSSGGHGGGSSSGG
>SWISS-PROT: P02535 (sequence length 569)
K1CJ_MOUSE (P02535) KERATIN, TYPE I CYTOSKELETAL 10 (CYTOKERATIN 10) (
11-G @every Fifth position @88 :GGSSFGGGYGGSSFGGAGFGGGGSFGGGSFGGGSY
-GGGFGGGGFGGDGGSLLSGNGRVTMQNLND
12-G @every Fifth position @458 :GGGGGRRGGSGGGSYGGSSGGGSYGGSSGGGGSYG
-GSSGGGGSYGGGSSGGGSHGGSSGGGYGGGSSSGG
16-G @every Fifth position @477 :GGGSYGGSSGGGGSYGGSSGGGGSYGGGSSGGGSH
-GGSSGGGYGGGSSSGGAGGHGGSSGGGYGGGSSSG
-GQGGSGGFKSSGGGDQSSKG
>SWISS-PROT: P04264 (sequence length 643)
K2C1_HUMAN (P04264) KERATIN, TYPE II CYTOSKELETAL 1 (CYTOKERATIN 1) (K
11-G @every Fifth position @82 :GGGRGSGFGGGYGGGGFGGGGFGGGGFGGGGIGGG
-GFGGFGSGGGGFGGGGFGGGGYGGGYGPVC
11-G @every Fifth position @84 :GRGSGFGGGYGGGGFGGGGFGGGGFGGGGIGGGGF
-GGFGSGGGGFGGGGFGGGGYGGGYGPVCPP
11-G @every Fifth position @86 :GSGFGGGYGGGGFGGGGFGGGGFGGGGIGGGGFGG
-FGSGGGGFGGGGFGGGGYGGGYGPVCPPGG
12-G @every Fifth position @90 :GGGYGGGGFGGGGFGGGGFGGGGIGGGGFGGFGSG
-GGGFGGGGFGGGGYGGGYGPVCPPGGIQEVTINQS
>SWISS-PROT: P04104 (sequence length 581)
K2C1_MOUSE (P04104) KERATIN, TYPE II CYTOSKELETAL 1 (CYTOKERATIN 1) (6
11-G @every Fifth position @1 :GGGGSFCGGFGGGSYGRGGFGGGSYGGGGFGGGSF
-GGGGFGGSGFGGGSGGGGGFGSGGGFGGGR
13-G @every Fifth position @3 :GGSFCGGFGGGSYGRGGFGGGSYGGGGFGGGSFGG
-GGFGGSGFGGGSGGGGGFGSGGGFGGGRFGGYGPV
-CSPSG
>SWISS-PROT: P34099 (sequence length 648)
KAPC_DICDI (P34099) CAMP-DEPENDENT PROTEIN KINASE CATALYTIC SUBUNIT (E
16-Q @every Fifth position @140 :QQQPQQQQPQQQQPQQQQPQQQQQQQPQQQQQPQQ
-QLQQNNQQQQQQLQQQQLQQQLQQQQQQQQQQQQQ
-QQQKQQKQQQQQQQHLHQDG
15-Q @every Fifth position @144 :QQQQPQQQQPQQQQPQQQQQQQPQQQQQPQQQLQQ
-NNQQQQQQLQQQQLQQQLQQQQQQQQQQQQQQQQK
-QQKQQQQQQQHLHQD
12-Q @every Fifth position @163 :QQQPQQQQQPQQQLQQNNQQQQQQLQQQQLQQQLQ
-QQQQQQQQQQQQQQQKQQKQQQQQQQHLHQDGIVN
>SWISS-PROT: P38020 (sequence length 207)
KARP_CHLTR (P38020) HISTONE H1-LIKE PROTEIN KARP.
12-K @every Fifth position @7 :KRSTRKTAARKTVVRKPAAKKTAAKKASVRKVAAK
-KTVARKTVAKKAVAARKPAAKKTAAKKAPVRKVAA
>SWISS-PROT: P06719 (sequence length 657)
KNOB_PLAFN (P06719) KNOB-ASSOCIATED HISTIDINE-RICH PROTEIN PRECURSOR (
14-T @every Fifth position @547 :TKEASTSKEATKEASTSKEATKEASTSKEATKEAS
-TSKGATKEASTTEGATKGASTTAGSTTGATTGANA
-VQSKDETADK
>SWISS-PROT: P08131 (sequence length 181)
KR2D_SHEEP (P08131) KERATIN, HIGH-SULFUR MATRIX PROTEIN, B2D.
12-Q @every Fifth position @25 :QPTCCQTSCCQPTSIQTSCCQPTSIQTSCCQPTSI
-QTSCCQPISIQTSCCQPTCLQTSGCETGCGIGGSI
>SWISS-PROT: P26371 (sequence length 169)
KRUC_HUMAN (P26371) KERATIN, ULTRA HIGH-SULFUR MATRIX PROTEIN (UHS KER
12-C @every Fifth position @42 :CKPVCCCVPACSCSSCGKRGCGSCGGSKGGCGSCG
-CSQCSCCKPCCCSSGCGSSCCQCSCCKPYCSQCSC
>SWISS-PROT: P18160 (sequence length 1584)
KYK1_DICDI (P18160) NON-RECEPTOR TYROSINE KINASE SPORE LYSIS A (EC 2.7
12-N @every Fifth position @450 :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNSSNT
-NNNNINNTTNNNNSNSNNNNNNNNSNSNSNSNNNN
11-N @every Fifth position @451 :NNNNNNNNNNNNNNNNNNNNNNNNNNNNSNSSNTN
-NNNINNTTNNNNSNSNNNNNNNNSNSNSNS
>SWISS-PROT: P23490 (sequence length 316)
LORI_HUMAN (P23490) LORICRIN.
12-S @every Fifth position @91 :SGGGGSSGGGSGCFSSGGGGSGCFSSGGGGSSGGG
-SGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQS
>SWISS-PROT: P18165 (sequence length 481)
LORI_MOUSE (P18165) LORICRIN.
11-G @every Fifth position @34 :GSGCGGGSSGGGSSCGGGGGGSYGGGSSCGGGGGS
-GGGVKYSGGGGGSSCGGGYSGGGGGSSCGG
12-G @every Fifth position @138 :GGSGGGVKYSGGGGGGGSSCGGGSSGGGGGGSSCG
-GGSGGGGSYCGGSSGGGSSGGCGGGSGGGKYSGGG
11-G @every Fifth position @367 :GSSGGGGSCGGGSSGGGGGGGCYSSGGGGSSGGCG
-GGYSGGGGGCGGGSSGGSGGGCGGGSSGGS
>SWISS-PROT: P15714 (sequence length 255)
LP61_EIMTE (P15714) ANTIGEN LPMC-61 (FRAGMENT).
12-Q @every Fifth position @146 :QQQQQQQWPEQPEQQQQQQWPEQQQQQWSDQNQQQ
-QAQQWQAQQQQQWPQQQQQPQQQQQQQQQQDLGPD
>SWISS-PROT: P11746 (sequence length 286)
MCM1_YEAST (P11746) PHEROMONE RECEPTOR TRANSCRIPTION FACTOR (GRM/PRTF
11-Q @every Fifth position @189 :QPQQQQQQQPQQQMSQQQMSQHPRPQQGIPHPQQS
-QPQQQQQQQQQLQQQQQQQQQQPLTGIHQP
>SWISS-PROT: P36027 (sequence length 376)
MID2_YEAST (P36027) MATING PROCESS PROTEIN MID2 (SERINE-RICH PROTEIN S
13-S @every Fifth position @54 :SILSSSMVSSSSADSSSLTSSTSSRSLVSHTSSST
-SIASISFTSFSFSSDSSTSSSSSASSDSSSSSSFS
-ISSTS
>SWISS-PROT: Q05049 (sequence length 662)
MUC1_XENLA (Q05049) INTEGUMENTARY MUCIN C.1 (FIM-C.1) (FRAGMENT).
11-T @every Fifth position @217 :TKAPTTIQIATTTTTPTTTTTTTKATPTTTTTTKA
-TPTTTTTTKATTTTTTPTTTTTTTKATTTP
14-T @every Fifth position @229 :TTTPTTTTTTTKATPTTTTTTKATPTTTTTTKATT
-TTTTPTTTTTTTKATTTPTTTTTTTPTTTTTKATT
-TTTTTSGECK
13-T @every Fifth position @404 :TTTPTTTTTPTTTTTTKATTTTPTTTTTTPTTTTT
-TTTTTKATTTTPTTTTPTTTTTKATTTTPTTTTTT
-PTTTT
14-T @every Fifth position @418 :TTKATTTTPTTTTTTPTTTTTTTTTTKATTTTPTT
-TTPTTTTTKATTTTPTTTTTTPTTTTTKATTTTPT
-TTTTTPTTTT
>SWISS-PROT: P19706 (sequence length 1147)
MYSB_ACACA (P19706) MYOSIN HEAVY CHAIN IB (MYOSIN HEAVY CHAIN IL).
18-G @every Fifth position @989 :GGPGMGRGGPGMGGPGAGRGGPGMGGPGGPGRGGP
-GGPGAGRGGPGGPGAGRGGPGMGGPGGAGRGGPGA
-GRGGPGMGGPGAGRGGPGAGRGAAPAPAPAAPAKP
11-G @every Fifth position @1017 :GPGRGGPGGPGAGRGGPGGPGAGRGGPGMGGPGGA
-GRGGPGAGRGGPGMGGPGAGRGGPGAGRGA
>SWISS-PROT: Q40361 (sequence length 93)
N12A_MEDSA (Q40361) EARLY NODULIN 12A PRECURSOR (N-12A) (EARLY NODULIN
13-P @every Fifth position @20 :PQGFAEYYLNPAYRPPQTEPPVHKPPHKEPPVHKP
-PHKDPPVNKPPQKEPPVHKPPRKEPPTHRHPPSED
>SWISS-PROT: P20799 (sequence length 110)
N12A_PEA (P20799) EARLY NODULIN 12A PRECURSOR (N-12A).
13-P @every Fifth position @20 :PQGLAQYHLNPVYEPPVNGPPVNKPPQKETPVHKP
-PQKETPVHKPPQKEPPRHKPPQKEPPRHKPPHKKS
-HLHVT
>SWISS-PROT: Q40339 (sequence length 113)
N12B_MEDSA (Q40339) EARLY NODULIN 12B PRECURSOR (N-12B).
13-P @every Fifth position @34 :PPQTKPPVNKPSHKEPPVHKPPHKEPPVNKPRHKE
-PPVHKPPHKDPPVNKPPQKESPVHKPPRKEPPTHK
-HPPAE
11-P @every Fifth position @50 :PVHKPPHKEPPVNKPRHKEPPVHKPPHKDPPVNKP
-PQKESPVHKPPRKEPPTHKHPPAED
>SWISS-PROT: P30365 (sequence length 103)
NO12_MEDTR (P30365) EARLY NODULIN 12 PRECURSOR (N-12).
13-P @every Fifth position @30 :PAYRPPQTKPPVNKPSHKEPPVNKPPHKEPPVHKP
-PHKDPPVNKPPQKESPVHKPPRKESPTHRHPPAED
>SWISS-PROT: Q41701 (sequence length 100)
NO12_VICSA (Q41701) EARLY NODULIN 12 PRECURSOR (N-12).
11-P @every Fifth position @20 :PQGLAQYHLNPVYEAPVNGPPVNKPPQKETPVQKP
-PQKEPPVHKSPRNEPPRHKPPHKKSHLHVT
>SWISS-PROT: Q02937 (sequence length 365)
OMLA_ACTPL (Q02937) OUTER MEMBRANE LIPOPROTEIN A PRECURSOR.
11-P @every Fifth position @56 :PQADNSKAEEPKEMAPQVDSPKAEEPKNMAPQMGN
-PKLNDPQVMAPKMDNPQKDAPKGEELSKDK
>SWISS-PROT: P12348 (sequence length 1241)
PER_DROPS (P12348) PERIOD CIRCADIAN PROTEIN.
12-A @every Fifth position @687 :ANTSAAFNIAANTSAADNFGADTSAADTSGADTSA
-ADNYGPGNFGAENSCADNSGAENSCADNSGVDNSR
18-N @every Fifth position @724 :NYGPGNFGAENSCADNSGAENSCADNSGVDNSRPD
-NSGADNSAADNFGPDNSGADNSGPDNTGPDNSGAE
-NSRAENSRADNSRPDHPRPDISGASNSRPDKTGPD
>SWISS-PROT: P51524 (sequence length 212)
PF11_PIG (P51524) PROPHENIN-1 PRECURSOR (PF-1) (C6) (FRAGMENT).
17-P @every Fifth position @120 :PFLRRPRLRRQAFPPPNVPGPRFPPPNFPGPRFPP
-PNFPGPRFPPPNFPGPRFPPPNFPGPPFPPPIFPG
-PWFPPPPPFRPPPFGPPRFP
12-F @every Fifth position @132 :FPPPNVPGPRFPPPNFPGPRFPPPNFPGPRFPPPN
-FPGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPP
13-P @every Fifth position @133 :PPPNVPGPRFPPPNFPGPRFPPPNFPGPRFPPPNF
-PGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPPP
-FGPPR
>SWISS-PROT: P51525 (sequence length 228)
PF12_PIG (P51525) PROPHENIN-2 PRECURSOR (PF-2) (PR-2) (C12) (PROPHEN
17-P @every Fifth position @136 :PFLRRPRLRRQAFPPPNVPGPRFPPPNVPGPRFPP
-PNFPGPRFPPPNFPGPRFPPPNFPGPPFPPPIFPG
-PWFPPPPPFRPPPFGPPRFP
13-P @every Fifth position @149 :PPPNVPGPRFPPPNVPGPRFPPPNFPGPRFPPPNF
-PGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPPP
-FGPPR
>SWISS-PROT: P06600 (sequence length 211)
PR33_DAUCA (P06600) PROLINE RICH 33 KD EXTENSIN-RELATED PROTEIN PRECUR
11-P @every Fifth position @11 :PSLADFHSHPPIHKPPVYTPPVHKPPIHKPPVYTP
-PVHKPPVYTPPVHKPPSEYKPPVEATNSVT
>SWISS-PROT: P29617 (sequence length 1403)
PRO_DROME (P29617) PROTEIN PROSPERO.
11-Q @every Fifth position @717 :QQQQQQQQQQQQQQQQQQEQQRRFEQEQQEQQRRK
-EEQQQQIQRQQQHLQQLQQQQMEQQHVATA
>SWISS-PROT: P50493 (sequence length 1153)
PVDB_PLAKN (P50493) DUFFY RECEPTOR, BETA FORM PRECURSOR (ERYTHROCYTE B
12-Q @every Fifth position @886 :QTSSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSD
-QISSDQTSSDQTSSNQTSSDQTIDTEEHHRDNVRN
11-T @every Fifth position @887 :TSSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQ
-ISSDQTSSDQTSSNQTSSDQTIDTEEHHRD
11-S @every Fifth position @888 :SSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQI
-SSDQTSSDQTSSNQTSSDQTIDTEEHHRDN
11-S @every Fifth position @889 :SDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQIS
-SDQTSSDQTSSNQTSSDQTIDTEEHHRDNV
>SWISS-PROT: P51968 (sequence length 373)
RO31_XENLA (P51968) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A3 HOMOLOG
11-G @every Fifth position @270 :GNYGGGPGYGGRGYGGSPGYGNQGGGYGGGGGGYD
-GYNESGNFGGGNYNDFGNYGGQQQSNYGPM
>SWISS-PROT: P51992 (sequence length 385)
RO32_XENLA (P51992) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A3 HOMOLOG
13-G @every Fifth position @219 :GGNYGGGDGGGGGNFGRGGGFGNRGGYGGGGGRGG
-GYGGGGDGYNGFGGDGGNYGGGPGYGGRGYGGSPG
-YGNQG
>SWISS-PROT: P21522 (sequence length 342)
ROA1_SCHAM (P21522) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A1, A2/B1
12-G @every Fifth position @187 :GDAPGGRGGGGRGGVGGGAGGGWGGGRGDWGGSAG
-GGGGGGWGGADPWENGRGGGGDRWGGGGGGMGGGD
>SWISS-PROT: P27625 (sequence length 2339)
RPC1_PLAFA (P27625) DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT (E
14-N @every Fifth position @1094 :NDSNMNNINNNDSNMNSIHNNNSNMNNIHNNDSNR
-SIIHNNDSNMNSIHNNDSNMNSIHNNNSNMNNIHN
-NDSNRSIIHN
>SWISS-PROT: P16960 (sequence length 5035)
RYNR_PIG (P16960) RYANODINE RECEPTOR, SKELETAL MUSCLE (SKELETAL MUSC
11-E @every Fifth position @1870 :EVFTEEEEEEEEEEEEEEEDEEEKEEDEEEEAREK
-EDEEKEEEETAEGEKEEYLEEGLLQMKLPE
>SWISS-PROT: Q14242 (sequence length 412)
SEPL_HUMAN (Q14242) P-SELECTIN GLYCOPROTEIN LIGAND 1 PRECURSOR (PSGL-1
14-T @every Fifth position @114 :TDSAAMEIQTTQPAATEAQTTQPVPTEAQTTPLAA
-TEAQTTRLTATEAQTTPLAATEAQTTPPAATEAQT
-TQPTGLEAQT
>SWISS-PROT: P13730 (sequence length 328)
SGS3_DROER (P13730) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.
19-T @every Fifth position @107 :TTKRATTRRTTVRATTKRATTRRTTTKRAPTRRTT
-TKRATTRRNPTRRTTTRRAPTKRATTKRATTRRNP
-TKRKTTRRTTVRATKTTKRATTKRAPTKRATTKRA
-PTKRV
15-R @every Fifth position @114 :RRTTVRATTKRATTRRTTTKRAPTRRTTTKRATTR
-RNPTRRTTTRRAPTKRATTKRATTRRNPTKRKTTR
-RTTVRATKTTKRATT
16-T @every Fifth position @193 :TKRATTKRAPTKRATTKRAPTKRVTTKRAPTKRAT
-TKRAPTKRATTKRAPTKRATTKRAPTKRATTKRAP
-TKRATTKRATARPTSKPCGC
16-K @every Fifth position @194 :KRATTKRAPTKRATTKRAPTKRVTTKRAPTKRATT
-KRAPTKRATTKRAPTKRATTKRAPTKRATTKRAPT
-KRATTKRATARPTSKPCGCK
16-R @every Fifth position @195 :RATTKRAPTKRATTKRAPTKRVTTKRAPTKRATTK
-RAPTKRATTKRAPTKRATTKRAPTKRATTKRAPTK
-RATTKRATARPTSKPCGCKP
15-A @every Fifth position @196 :ATTKRAPTKRATTKRAPTKRVTTKRAPTKRATTKR
-APTKRATTKRAPTKRATTKRAPTKRATTKRAPTKR
-ATTKRATARPTSKPC
>SWISS-PROT: P02840 (sequence length 307)
SGS3_DROME (P02840) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.
14-T @every Fifth position @49 :TTTTTTCAPPTQQSTTQPPCTTSKPTTPKQTTTQL
-PCTTPTTTKATTTKPTTTKATTTKATTTKPTTTKQ
-TTTQLPCTTP
35-T @every Fifth position @81 :TQLPCTTPTTTKATTTKPTTTKATTTKATTTKPTT
-TKQTTTQLPCTTPTTTKQTTTQLPCTTPTTTKPTT
-TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
-TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
-TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
-PKPCGCKSCGPGGEPCNGCAKRDAL
25-T @every Fifth position @129 :TTTKQTTTQLPCTTPTTTKPTTTKPTTTKPTTTKP
-TTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKP
-TTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKP
-TTTKPTTTKPTTTKPTTTKPTTPKPCGCKSCGPGG
-EPCNG
25-T @every Fifth position @130 :TTKQTTTQLPCTTPTTTKPTTTKPTTTKPTTTKPT
-TTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPT
-TTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPT
-TTKPTTTKPTTTKPTTTKPTTPKPCGCKSCGPGGE
-PCNGC
24-P @every Fifth position @143 :PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
-PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
-PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
-PTTTKPTTPKPCGCKSCGPGGEPCNGCAKR
24-K @every Fifth position @147 :KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
-KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
-KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
-KPTTPKPCGCKSCGPGGEPCNGCAKRDALC
>SWISS-PROT: P13728 (sequence length 263)
SGS3_DROYA (P13728) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.
15-T @every Fifth position @100 :TTTTTTTRRPTTRSTTTRHTTTTTTTTRRPTTTTT
-TTRRPTTTTTTTRRPTTTTTTTRLPTTRSTTTRHT
-TKSTTSKRPTHETTT
14-T @every Fifth position @101 :TTTTTTRRPTTRSTTTRHTTTTTTTTRRPTTTTTT
-TRRPTTTTTTTRRPTTTTTTTRLPTTRSTTTRHTT
-KSTTSKRPTH
>SWISS-PROT: P18480 (sequence length 905)
SNF5_YEAST (P18480) TRANSCRIPTION REGULATORY PROTEIN SNF5 (SWI/SNF COM
12-Q @every Fifth position @204 :QRQQQQQFRHHVQIQQQQQKQQQQQQQHQQQQQQQ
-QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQGQIPQ
13-Q @every Fifth position @210 :QFRHHVQIQQQQQKQQQQQQQHQQQQQQQQQQQQQ
-QQQQQQQQQQQQQQQQQQQQQQQQGQIPQSQQVPQ
-VRSMS
11-Q @every Fifth position @218 :QQQQQKQQQQQQQHQQQQQQQQQQQQQQQQQQQQQ
-QQQQQQQQQQQQQQQQGQIPQSQQVPQVRS
>SWISS-PROT: P37963 (sequence length 575)
SP6D_BACSU (P37963) STAGE VI SPORULATION PROTEIN D.
12-E @every Fifth position @206 :ETEKAESEPPESVASEPEAREDVKEEEESEELAVP
-ETEVRAESETEESEPEPDPSEIEIQEIVKAKKETA
>SWISS-PROT: P14328 (sequence length 600)
SP96_DICDI (P14328) SPORE COAT PROTEIN SP96.
13-S @every Fifth position @470 :SSSAASSSPSSSAASSSPSSSASSSSSPSSSASSS
-SAPSSSASSSSAPSSSASSSSASSSSASSAATTAA
-TTIAT
11-S @every Fifth position @479 :SSSAASSSPSSSASSSSSPSSSASSSSAPSSSASS
-SSAPSSSASSSSASSSSASSAATTAATTIA
>SWISS-PROT: P19837 (sequence length 747)
SPD1_NEPCL (P19837) SPIDROIN 1 (DRAGLINE SILK FIBROIN 1) (FRAGMENT).
11-G @every Fifth position @53 :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQG
-GYGGLGSQGAGRGGLGGQGAGAAAAAAAGG
11-G @every Fifth position @381 :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQR
-GYGGLGNQGAGRGGLGGQGAGAAAAAAAGG
13-G @every Fifth position @472 :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAVGAGQE
-GIRGQGAGQGGYGGLGSQGSGRGGLGGQGAGAAAA
-AAGGA
13-G @every Fifth position @569 :GVRQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQG
-GYGGLGGQGVGRGGLGGQGAGAAAAGGAGQGGYGG
-VGSGA
>SWISS-PROT: P46804 (sequence length 627)
SPD2_NEPCL (P46804) SPIDROIN 2 (DRAGLINE SILK FIBROIN 2) (FRAGMENT).
12-G @every Fifth position @320 :GQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSASA
-AAAAAGPGQQGPGGYGPGQQGPSGPGSASAAAAAA
>SWISS-PROT: P21997 (sequence length 485)
SSGP_VOLCA (P21997) SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185).
14-P @every Fifth position @230 :PPSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPP
-PPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSP
-PRKPPSPSPP
12-P @every Fifth position @231 :PSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPPP
-PPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPP
13-P @every Fifth position @248 :PRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPP
-PPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSPP
-SVLPA
12-P @every Fifth position @254 :PPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPP
-PPPPPPPSPSPPRKPPSPSPPVPPPPSPPSVLPAA
>SWISS-PROT: P03186 (sequence length 3149)
TEGU_EBV (P03186) LARGE TEGUMENT PROTEIN.
11-A @every Fifth position @321 :ARYSPAKTNSPPSSPASAAPASAAPASAAPASAAP
-ASAAPASAAPASAAPASAAPASSPPLFIPI
11-P @every Fifth position @325 :PAKTNSPPSSPASAAPASAAPASAAPASAAPASAA
-PASAAPASAAPASAAPASSPPLFIPIPGLG
>SWISS-PROT: P29720 (sequence length 384)
TMPB_TREPH (P29720) TREPONEMAL MEMBRANE PROTEIN B PRECURSOR (ANTIGEN T
16-K @every Fifth position @151 :KAAADKAAAEKAAKEKAAREKSAKDKAAKEKAAKE
-KAAKDKAAKEKAAKEKAAKDKAAKEKAAKEKAARE
-MAAKEKAAKDKAAKEEAARK
19-A @every Fifth position @152 :AAADKAAAEKAAKEKAAREKSAKDKAAKEKAAKEK
-AAKDKAAKEKAAKEKAAKDKAAKEKAAKEKAAREM
-AAKEKAAKDKAAKEEAARKAAEEAAARKAAEEAAA
-RKAAE
18-A @every Fifth position @153 :AADKAAAEKAAKEKAAREKSAKDKAAKEKAAKEKA
-AKDKAAKEKAAKEKAAKDKAAKEKAAKEKAAREMA
-AKEKAAKDKAAKEEAARKAAEEAAARKAAEEAAAR
12-K @every Fifth position @174 :KDKAAKEKAAKEKAAKDKAAKEKAAKEKAAKDKAA
-KEKAAKEKAAREMAAKEKAAKDKAAKEEAARKAAE
>SWISS-PROT: P19934 (sequence length 421)
TOLA_ECOLI (P19934) TOLA PROTEIN.
11-A @every Fifth position @240 :AEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKK
-AAAAKAAAEKAAAAKAAAEADDIFGELSSG
>SWISS-PROT: Q00130 (sequence length 670)
VG50_HSVI1 (Q00130) HYPOTHETICAL GENE 50 PROTEIN.
13-T @every Fifth position @355 :TVVTTTPAMPTGATDTVVTTTPAMPTGATDTVVTT
-TPAMPTGATDTVVTTTPAKPAGANGTVVTTTPAMP
-AGAND
>SWISS-PROT: P28968 (sequence length 797)
VGLX_HSVEB (P28968) GLYCOPROTEIN X PRECURSOR.
15-T @every Fifth position @146 :TTTATATATSTPTTTTPTSTTTTTATTTVPTTAST
-TTDTTTAATTTAATTTAATTTAATTTAATTTAATT
-TAATTTAATTSSATT
19-T @every Fifth position @180 :TTTDTTTAATTTAATTTAATTTAATTTAATTTAAT
-TTAATTTAATTSSATTAATTTAATTTAATTTAATT
-TAATTTAATTTGSPTSGSTSTTGASTSTPSASTAT
-SATPT
17-T @every Fifth position @184 :TTTAATTTAATTTAATTTAATTTAATTTAATTTAA
-TTTAATTSSATTAATTTAATTTAATTTAATTTAAT
-TTAATTTGSPTSGSTSTTGASTSTPSASTA
14-A @every Fifth position @187 :AATTTAATTTAATTTAATTTAATTTAATTTAATTT
-AATTSSATTAATTTAATTTAATTTAATTTAATTTA
-ATTTGSPTSG
>SWISS-PROT: Q10778 (sequence length 678)
Y04H_MYCTU (Q10778) HYPOTHETICAL PROTEIN RV1547C PRECURSOR.
12-G @every Fifth position @193 :GNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNL
-GNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNT
34-N @every Fifth position @201 :NLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDL
-NLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSG
-NIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNL
-NVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSG
-NLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDN
-QIGFGALNSGSGNLGFGNSG
24-G @every Fifth position @263 :GTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNF
-GGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNT
-GDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNI
-GFGLTGDNQIGFGALNSGSGNLGFGNSGNG
11-G @every Fifth position @434 :GFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNS
-GTTNTGWFNSGTTNTGIGNSGGNLVTGSMG
12-N @every Fifth position @491 :NLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNV
-NTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNS
11-G @every Fifth position @498 :GLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNS
-GNINTGFFNSGDLNTGLFNSVNQPVQNSGW
>SWISS-PROT: Q93074 (sequence length 2124)
Y192_HUMAN (Q93074) HYPOTHETICAL PROTEIN KIAA0192 (FRAGMENT).
14-Q @every Fifth position @1998 :QQQQQQQQQQQQQQQQQQQQQQQQQQYHIRQQQQQ
-QILRQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQ
-QQQAAPPQPQ
13-Q @every Fifth position @2002 :QQQQQQQQQQQQQQQQQQQQQQYHIRQQQQQQILR
-QQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQA
-APPQP
11-Q @every Fifth position @2040 :QQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQAAPP
-QPQPQSQPQFQRQGLQQTQQQQQTAALVRQ
>SWISS-PROT: P39712 (sequence length 1322)
YAG3_YEAST (P39712) HYPOTHETICAL 138.1 KD PROTEIN IN FLO9-GDH3 INTERGE
14-S @every Fifth position @894 :SSSVISSSDTSSLVISSSVTSSLVTSSPVISSSFI
-SSPVISSTTTSASILSESSKSSVIPTSSSTSGSSE
-SETGSASSAS
>SWISS-PROT: P38190 (sequence length 124)
YBF3_YEAST (P38190) VERY HYPOTHETICAL 13.2 KD PROTEIN IN PTC3-SAS3 INT
12-S @every Fifth position @41 :SFSSCSCPFLFPSSSSSLSSSYVSSSSSFSSDICS
-SSMSSSRVKSSSSSSSSLAFSPTYNSVSTSFSTSS
>SWISS-PROT: P38216 (sequence length 128)
YBM6_YEAST (P38216) HYPOTHETICAL 14.6 KD PROTEIN IN TTP1-KAP104 INTERG
12-Q @every Fifth position @44 :QQYYQQQQQHPGYYNQQGYNQQGYNQQGYNQQGYN
-QQGYNQQGYNQQGHQQPVYVQQQPPQRGNEGCLAA
>SWISS-PROT: P53214 (sequence length 551)
YG1F_YEAST (P53214) HYPOTHETICAL 57.5 KD PROTEIN IN VMA7-RPS25A INTERG
15-S @every Fifth position @157 :STVASSTLSTSSSLVISTSSSTFTFSSESSSSLIS
-SSISTSVSTSSVYVPSSSTSSPPSSSSELTSSSYS
-SSSSSSTLFSYSSSF
13-S @every Fifth position @224 :SYSSSSSSSTLFSYSSSFSSSSSSSSSSSSSSSSS
-SSSSSSYFTLSTSSSSSIYSSSSYPSFSSSSSSNP
-TSSIT
>SWISS-PROT: P42611 (sequence length 517)
YHS6_MYCTU (P42611) HYPOTHETICAL 50.6 KD PROTEIN IN HSP65 3'REGION.
17-G @every Fifth position @211 :GQINLGFGNTGSGNIGNNNIGNNNIGNNNIGSGNT
-GTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNI
-GFGITGDHQMGFGGFNSGSGNIGFGNSGTG
14-N @every Fifth position @214 :NLGFGNTGSGNIGNNNIGNNNIGNNNIGSGNTGTG
-NIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNIGFG
-ITGDHQMGFG
22-G @every Fifth position @243 :GNTGTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGS
-GNIGFGITGDHQMGFGGFNSGSGNIGFGNSGTGNV
-GLFNSGSGNIGIGNSGSLNSGIGTSGTINAGLGSA
-GSLNTSFWNAGMQNAALGSA
>SWISS-PROT: P35732 (sequence length 738)
YKF4_YEAST (P35732) HYPOTHETICAL 84.0 KD PROTEIN IN NUP120-CSE4 INTERG
12-Q @every Fifth position @410 :QPQQPQQPQQQLQQQQQQQQQPVQAQAQAQEEQLS
-QNYYTQQQQQQYAQQQHQLQQQYLSQQQQYAQQQQ
>SWISS-PROT: P21260 (sequence length 141)
YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAGMENT).
11-P @every Fifth position @13 :PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
-PPPPPPPPPPPRRARIHHNIPLFLRFFKKS
>SWISS-PROT: Q09950 (sequence length 482)
YSR2_CAEEL (Q09950) HYPOTHETICAL 54.6 KD PROTEIN F59B10.2 IN CHROMOSOM
11-S @every Fifth position @214 :SSFESSSDSSSTSESSTSSESSSSASESESESKSE
-SQVSSSKTSTSKASSSKAYGSDFESEKSSS
>SWISS-PROT: Q10940 (sequence length 112)
YWS4_CAEEL (Q10940) HYPOTHETICAL 13.1 KD PROTEIN B0310.4 IN CHROMOSOME
17-R @every Fifth position @7 :RGESHRGETHRGETHRGETHRGETHRGKTHRGETH
-RGETHRGETHRGETHRGETHRGKTHRGETHRGETH
-RGETRRGETHRGKTQNFGGKFKFSEKNILA
17-G @every Fifth position @8 :GESHRGETHRGETHRGETHRGETHRGKTHRGETHR
-GETHRGETHRGETHRGETHRGKTHRGETHRGETHR
-GETRRGETHRGKTQNFGGKFKFSEKNILAG
15-H @every Fifth position @11 :HRGETHRGETHRGETHRGETHRGKTHRGETHRGET
-HRGETHRGETHRGETHRGKTHRGETHRGETHRGET
-RRGETHRGKTQNFGG
16-T @every Fifth position @15 :THRGETHRGETHRGETHRGKTHRGETHRGETHRGE
-THRGETHRGETHRGKTHRGETHRGETHRGETRRGE
-THRGKTQNFGGKFKFSEKNI
>SWISS-PROT: Q10540 (sequence length 443)
YZ06_MYCTU (Q10540) HYPOTHETICAL 43.6 KD PROTEIN CY31.06C.
25-G @every Fifth position @213 :GIGNIGNNNVGSGNTGDYNFGIGNIGNANLGNGNI
-GNANLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNI
-GSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNS
-GDLNTGIGSPVTQGVANSGFGNTGTGHSGFFNSGN
-SGSGF
22-N @every Fifth position @216 :NIGNNNVGSGNTGDYNFGIGNIGNANLGNGNIGNA
-NLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSG
-NEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDL
-NTGIGSPVTQGVANSGFGNT
---------------------------------------------------------------------------
Total entries in the DataBase... 80000
Total amino acid recidues....... 29085965
Total repeats detected.......... 108
Minimum repeating units......... 12
Maximum mismatch ............... 10%
---------------------------------------------------------------------------