Amino Acids Repeated Periodically at every Fifth Position
in Proteins from SWISS-PROT DataBase Release: 38 (July 1999) Total Entries: 80,000

 --------------------------------------------------------------------------- 

 >SWISS-PROT: P09031   (sequence length  97) 
  ANP_LIMFE (P09031) ANTIFREEZE PROTEIN PRECURSOR (AFP).                
  11-A @every Fifth   position @35    :AVADPAAAAAAAVADTASDAAAAAAATAAAAAKAA
                                      -ADTAAAAAKAAADTAAAAAEAAAAT

 >SWISS-PROT: P40602   (sequence length 534) 
  APG_ARATH (P40602) ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR. 
  13-P @every Fifth   position @67    :PKPVAPPGPSPKPVAPPGPSPCPSPPPKPQPKPPP
                                      -APSPSPCPSPPPKPQPKPVPPPACPPTPPKPQPKP
                                      -APPPA

 >SWISS-PROT: P40603   (sequence length 449) 
  APG_BRANA (P40603) ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN C 
  12-P @every Fifth   position @2     :PKPQPKPPPKPQPKPPPAPTPSPCPPQPPKPQPKP
                                      -PPAPTPSPCPPQPPKPQPKPPPAPGPSPKPGPSPS

 >SWISS-PROT: P26436   (sequence length 265) 
  ASPX_HUMAN (P26436) ACROSOMAL PROTEIN SP-10 PRECURSOR (ACROSOMAL VESIC
  21-E @every Fifth   position @67    :EHGSSEHGSSKHTVAEHTSGEHAESEHASGEPAAT
                                      -EHAEGEHTVGEQPSGEQPSGEHLSGEQPLSELESG
                                      -EQPSDEQPSGEHGSGEQPSGEQASGEQPSGEHASG
                                      -EQASGAPISSTSTGT
  13-S @every Fifth   position @115   :SGEQPSGEHLSGEQPLSELESGEQPSDEQPSGEHG
                                      -SGEQPSGEQASGEQPSGEHASGEQASGAPISSTST
                                      -GTILN

 >SWISS-PROT: Q06990   (sequence length 285) 
  ASPX_PAPPA (Q06990) ACROSOMAL PROTEIN SP-10 PRECURSOR (ACROSOMAL VESIC
  25-E @every Fifth   position @67    :EHGSSEHGSREHTVAEHTPGEHAESEHASGEPAAT
                                      -GHAEGEHTVGEQPSGEQPSGEHLSGEQSLGEHASG
                                      -EQPSDEQLSGEHASGEQPSGEHASGEQPSGEQPSG
                                      -EHASGEQSLGEHALSEKPSGEQPSGAPISSISTGT
                                      -ILNCY
  15-G @every Fifth   position @106   :GEHTVGEQPSGEQPSGEHLSGEQSLGEHASGEQPS
                                      -DEQLSGEHASGEQPSGEHASGEQPSGEQPSGEHAS
                                      -GEQSLGEHALSEKPS
  12-S @every Fifth   position @115   :SGEQPSGEHLSGEQSLGEHASGEQPSDEQLSGEHA
                                      -SGEQPSGEHASGEQPSGEQPSGEHASGEQSLGEHA

 >SWISS-PROT: P53353   (sequence length 349) 
  ASPX_VULVU (P53353) SPERM ACROSOMAL PROTEIN FSA-ACR.1 PRECURSOR (FRAGM
  42-E @every Fifth   position @51    :ETAAGENTLSEHTSGEHTSVEHASAEHSSTEHTSG
                                      -EHASGEHTSGERATGEHTSSEHATSEHTSGEQPSG
                                      -EQPSGEKSSGEQPSGEKSSGEQPSGEKSLGEQPSG
                                      -EQSSGEKSSAEQTSGEQAVAEKPSGEHAVAEKPSG
                                      -EQAVAERPSGEQAVAEKPLGEQAVAERPSGEQASI
                                      -EKASSEQASAEQASAEQASSEQASGEKPLGEQPSG
                                      -IPPSSTFSGPILNCHTCSYMNDQGKCLRGE
  11-S @every Fifth   position @114   :SGEQPSGEQPSGEKSSGEQPSGEKSSGEQPSGEKS
                                      -LGEQPSGEQSSGEKSSAEQTSGEQAVAEKP
  11-G @every Fifth   position @115   :GEQPSGEQPSGEKSSGEQPSGEKSSGEQPSGEKSL
                                      -GEQPSGEQSSGEKSSAEQTSGEQAVAEKPS

 >SWISS-PROT: Q01851   (sequence length 423) 
  BR3A_HUMAN (Q01851) BRAIN-SPECIFIC HOMEOBOX/POU DOMAIN PROTEIN 3A (BRN
  11-G @every Fifth   position @133   :GAGGAGAAAGGGGAHDGPGGGGGPGGGGGPGGGGP
                                      -GGGGGGGPGGGGGGPGGGLLGGSAHPHPHM

 >SWISS-PROT: P48988   (sequence length 606) 
  CENB_CRIGR (P48988) MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN
  12-E @every Fifth   position @407   :EEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGE
                                      -GEEVGEEEEVEEEGDESDEEEEEEEEEEEESSSEG

 >SWISS-PROT: O61735   (sequence length 1023) 
  CLOC_DROME (O61735) CIRCADIAN LOCOMOTER OUTPUT CYCLES KAPUT PROTEIN (D
  11-Q @every Fifth   position @780   :QQQHQSHSQLQQHTQQQHQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQQQQLQLQQQNDILLRED

 >SWISS-PROT: Q07202   (sequence length 204) 
  CORA_MEDSA (Q07202) COLD AND DROUGHT-REGULATED PROTEIN CORA.          
  13-G @every Fifth   position @59    :GYNGGGYNHGGGYNHGGGGYHNGGGGYNHGGGGYN
                                      -GGGGHGGHGGGGYNGGGGHGGHGGGGYNGGGGHGG
                                      -HGGAE

 >SWISS-PROT: P33240   (sequence length 577) 
  CST2_HUMAN (P33240) CLEAVAGE STIMULATION FACTOR, 64 KD SUBUNIT (CSTF 6
  13-R @every Fifth   position @408   :RGIDARGMEARAMEARGLDARGLEARAMEARAMEA
                                      -RAMEARAMEARAMEVRGMEARGMDTRGPVPGPRGP
                                      -IPSGM

 >SWISS-PROT: O70133   (sequence length 1380) 
  DDX9_MOUSE (O70133) ATP-DEPENDENT RNA HELICASE A (NUCLEAR DNA HELICASE
  11-G @every Fifth   position @1179  :GGGGYGGGGYGGGYGSGGFGGGFGSGGGFGGGFNS
                                      -GGGGFGSGGGGFGSGGGGFGGGGGGFSGGG
  11-G @every Fifth   position @1230  :GGFGGGGGGFSGGGGGGFGGGRGGGGGGFGGSGGF
                                      -GNGGGGYGVGGGGYGGGGGGGYGGGSGGYG
  12-G @every Fifth   position @1237  :GGFSGGGGGGFGGGRGGGGGGFGGSGGFGNGGGGY
                                      -GVGGGGYGGGGGGGYGGGSGGYGGGGYGGGEGYSI
  11-G @every Fifth   position @1244  :GGGFGGGRGGGGGGFGGSGGFGNGGGGYGVGGGGY
                                      -GGGGGGGYGGGSGGYGGGGYGGGEGYSISP

 >SWISS-PROT: P03211   (sequence length 641) 
  EBN1_EBV (P03211) EBNA-1 NUCLEAR PROTEIN.                             
  13-G @every Fifth   position @191   :GGAGGAGAGGGAGAGGAGGAGGAGAGGAGAGGGAG
                                      -GAGGAGAGGAGAGGAGAGGAGAGGAGGAGAGGAGG
                                      -AGAGG
  11-G @every Fifth   position @209   :GAGGAGAGGAGAGGGAGGAGGAGAGGAGAGGAGAG
                                      -GAGAGGAGGAGAGGAGGAGAGGAGGAGAGG

 >SWISS-PROT: P03204   (sequence length 992) 
  EBN6_EBV (P03204) EBNA-6 NUCLEAR PROTEIN (EBNA-3C) (EBNA-4B).         
  16-P @every Fifth   position @551   :PPTVSPSDTGPPAVGPPAAGPPAAGPPAAGPPAAG
                                      -PPAAGPPAAGPRILAPLSAGPPAAGPHIVTPPSAR
                                      -PRIMAPPVVRMFMRERQLPQ

 >SWISS-PROT: P19470   (sequence length 212) 
  EGG1_SCHJA (P19470) EGGSHELL PROTEIN 1 PRECURSOR.                     
  13-G @every Fifth   position @32    :GGGGGGGGGYGGWCGGSDCYGGGNGGGGGGGGGNG
                                      -GEYGGGYGDVYGGSYGGGSYGGGGYGDVYGGGCGG
                                      -PDCYG

 >SWISS-PROT: P19469   (sequence length 207) 
  EGG2_SCHJA (P19469) EGGSHELL PROTEIN 2A PRECURSOR.                    
  12-G @every Fifth   position @32    :GGGGGGGGGYGGWCGGSDCYGGGNGGGGGGGGGNG
                                      -GEYGGGYGDVYGGSYGGGEYGDVYGGGCGGPDCYG

 >SWISS-PROT: P04985   (sequence length 747) 
  ELS_BOVIN (P04985) ELASTINS A/B/C PRECURSOR (TROPOELASTIN).           
  12-G @every Fifth   position @333   :GLPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGV
                                      -GVPGVGVPGVGVPGVGVPGVGVPGALSPAATAKAA
  13-P @every Fifth   position @335   :PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV
                                      -PGVGVPGVGVPGVGVPGVGVPGALSPAATAKAAAK
                                      -AAKFG
  12-G @every Fifth   position @336   :GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP
                                      -GVGVPGVGVPGVGVPGVGVPGALSPAATAKAAAKA
  11-V @every Fifth   position @337   :VGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPG
                                      -VGVPGVGVPGVGVPGVGVPGALSPAATAKA
  11-V @every Fifth   position @339   :VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG
                                      -VPGVGVPGVGVPGVGVPGALSPAATAKAAA

 >SWISS-PROT: P07916   (sequence length 750) 
  ELS_CHICK (P07916) ELASTIN PRECURSOR (TROPOELASTIN) (FRAGMENT).       
  11-V @every Fifth   position @449   :VPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVG
                                      -VPGVGVPGVGVPGVGVPGLVPGAGPAAAAK
  11-P @every Fifth   position @450   :PGVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGV
                                      -PGVGVPGVGVPGVGVPGLVPGAGPAAAAKA
  11-G @every Fifth   position @451   :GVGVPGVGVPGVGVPGVGVPGVGVPGVGVPGVGVP
                                      -GVGVPGVGVPGVGVPGLVPGAGPAAAAKAA

 >SWISS-PROT: P54320   (sequence length 860) 
  ELS_MOUSE (P54320) ELASTIN PRECURSOR (TROPOELASTIN).                  
  12-A @every Fifth   position @712   :AAAAAAAKAAAKAAQYGLGGAGGLGAGGLGAGGLG
                                      -AGGLGAGGLGAGGLGAGGLGAGGLGAGGGVSPAAA

 >SWISS-PROT: P35550   (sequence length 327) 
  FBRL_MOUSE (P35550) FIBRILLARIN (NUCLEOLAR PROTEIN 1).                
  14-G @every Fifth   position @17    :GFGDRGGRGGGRGGRGGFGGGRGGFGGGGRGRGGG
                                      -GGGFRGRGGGGVRGGGFQSGGNRGRGGGRGGKRGN
                                      -QSGKNVMVEP

 >SWISS-PROT: P32768   (sequence length 1537) 
  FLO1_YEAST (P32768) FLOCCULATION PROTEIN FLO1 PRECURSOR (FLOCCULIN 1).
  11-S @every Fifth   position @1121  :SSVISSSVTSSLFTSSPVISSSVISSSTTTSTSIF
                                      -SESSKSSVIPTSSSTSGSSESETSSAGSVS

 >SWISS-PROT: P38894   (sequence length 1075) 
  FLO5_YEAST (P38894) FLOCCULATION PROTEIN FLO5 PRECURSOR (FLOCCULIN 5).
  11-S @every Fifth   position @670   :SSVISSSVTSSLVTSSSFISSSVISSSTTTSTSIF
                                      -SESSTSSVIPTSSSTSGSSESKTSSASSSS

 >SWISS-PROT: P10419   (sequence length 435) 
  FMRA_ANTEL (P10419) ANTHO-RFAMIDE NEUROPEPTIDE PRECURSOR.             
  14-F @every Fifth   position @61    :FWKGRFSDPQFWKGRFSDPQFWKGRFSDPQFWKGR
                                      -FSDPQFWKGRFSDPQFWKGRFSDPQFWKGRFSDGT
                                      -KRENDPQYWK

 >SWISS-PROT: P13709   (sequence length 2038) 
  FSH_DROME (P13709) FEMALE STERILE HOMEOTIC PROTEIN (FRAGILE-CHORION M 
  11-Q @every Fifth   position @1518  :QQTHQQQQQHQQQHHQQQQQQLTQQQLQQQQQQQQ
                                      -QQQHLQQQQHQQQHHQAANKLLIIPKPIES

 >SWISS-PROT: P13816   (sequence length 678) 
  GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR.             
  13-K @every Fifth   position @375   :KEGEHKEEEHKEGEHKEGEHKEEEHKEEEHKKEEH
                                      -KSKEHKSKGKKDKGKKDKGKHKKAKKEKVKKHVVK
                                      -NVIED
  12-E @every Fifth   position @549   :EDKKEESKEVQEESKEVQEDEEEVEEDEEEEEEEE
                                      -EEEEEEEEEEEEEEEEEEEEEEDEDEEDEDDAEED

 >SWISS-PROT: P36417   (sequence length 708) 
  GBF_DICDI (P36417) G-BOX BINDING FACTOR (GBF).                        
  20-Q @every Fifth   position @145   :QQPQHHQQMQQQQHHQQMQQQQQHHQQMQQQQHHQ
                                      -QMQHHQLQQHQHQHQQQQQQQQHQQQHHQQQQQQQ
                                      -QQHHQQQQHHQHSQPQQQHQHNQQQQHQHNQQQHQ
                                      -QQQNQIQMVP

 >SWISS-PROT: P34689   (sequence length 707) 
  GLH1_CAEEL (P34689) ATP-DEPENDENT RNA HELICASE GLH-1.                 
  16-G @every Fifth   position @21    :GGGFGGGNNGGSGFGGGKNGGTGFGGGNTGGSGFG
                                      -GGNTGGSGFGGGKTGGSGFGGGNTCGSFGGGNSGF
                                      -GEGGHGGGERNNNCFNCQQP
  12-G @every Fifth   position @25    :GGGNNGGSGFGGGKNGGTGFGGGNTGGSGFGGGNT
                                      -GGSGFGGGKTGGSGFGGGNTCGSFGGGNSGFGEGG

 >SWISS-PROT: Q05966   (sequence length 169) 
  GR10_BRANA (Q05966) GLYCINE-RICH RNA-BINDING PROTEIN 10.              
  13-G @every Fifth   position @90    :GGGRGGGGYGGRGGGGYGGGGGGYGDRRGGGGYGS
                                      -GGGGRGGGGYGSGGGGYGGGGGRRDGGGYGGGDGG
                                      -YGGGS

 >SWISS-PROT: P09789   (sequence length 384) 
  GRP1_PETHY (P09789) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1 PRECUR
  12-G @every Fifth   position @72    :GGGAGGGAGGGLGGGGGLGGGGGAGGGGGLGGGGG
                                      -AGGGFGGGAGGGAGGGLGGGGGLGGGGGGGAGGGG
  23-G @every Fifth   position @106   :GAGGGFGGGAGGGAGGGLGGGGGLGGGGGGGAGGG
                                      -GGVGGGAGSGGGFGAGGGVGGGAGAGGGVGGGGGF
                                      -GGGGGGGVGGGSGHGGGFGAGGGVGGGAGGGLGGG
                                      -VGGGGGGGSGGGGGIGGGSGHGGGF
  14-G @every Fifth   position @198   :GVGGGAGGGLGGGVGGGGGGGSGGGGGIGGGSGHG
                                      -GGFGAGGGVGGGVGGGAAGGGGGGGGGGGGGGGGL
                                      -GGGSGHGGGF
  11-G @every Fifth   position @255   :GGGGGGGGGGGGLGGGSGHGGGFGAGGGVGGGAAG
                                      -GVGGGGGFGGGGGGGVGGGSGHGGGFGAGG
  13-G @every Fifth   position @293   :GGGGFGGGGGGGVGGGSGHGGGFGAGGGVGGGAGG
                                      -GLGGGGGAGGGGGIGGGHGGGFGVGVGIGIGVGVG
                                      -AGAGH

 >SWISS-PROT: P10495   (sequence length 252) 
  GRP1_PHAVU (P10495) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.0 PREC
  11-G @every Fifth   position @65    :GGYGEGAGGGEGAGAGYGAAGGGHGGGGGNGGGGG
                                      -GGADGGGYGGGAGKGGGEGYGGGGANGGGY
  11-G @every Fifth   position @86    :GGHGGGGGNGGGGGGGADGGGYGGGAGKGGGEGYG
                                      -GGGANGGGYGGGGGSGGGGGGGAGGAGSGY
  22-G @every Fifth   position @122   :GGANGGGYGGGGGSGGGGGGGAGGAGSGYGGGEGS
                                      -GAGGGYGGANGGGGGGNGGGGGGGSGGAHGGGAAG
                                      -GGEGAGQGAGGGYGGGAAGGGGRGSGGGGGGGYGG
                                      -GGARGSGYGGGGGSGEGGGH

 >SWISS-PROT: Q99069   (sequence length 142) 
  GRP1_SORVU (Q99069) GLYCINE-RICH RNA-BINDING PROTEIN 1 (FRAGMENT).    
  12-G @every Fifth   position @70    :GGGGGGGYGGGRGGGGGYGRRDGGGGGYGGGGGGY
                                      -GGGRGGYGGGGYGGGGGGYGGGSRGGGGYGNSDGN

 >SWISS-PROT: P27484   (sequence length 214) 
  GRP2_NICSY (P27484) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 2 PRECUR
  12-G @every Fifth   position @77    :GAAVQGGRGGGGGGGGRGGGGYGGGSGGYGGGGRG
                                      -GSRGYGGGDGGYGGGGGYGGGSRYGGGGGGYGGGG
  14-G @every Fifth   position @86    :GGGGGGGRGGGGYGGGSGGYGGGGRGGSRGYGGGD
                                      -GGYGGGGGYGGGSRYGGGGGGYGGGGGYGGGGSGG
                                      -GSGCFKCGES

 >SWISS-PROT: P29834   (sequence length 183) 
  GRP2_ORYSA (P29834) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 2 PRECUR
  13-G @every Fifth   position @60    :GGGSGGAAGGGYGRGGGGGGGGGEGGGSGSGYGSG
                                      -QGSGYGAGVGGAGGYGSGGGGGGGQGGGAGGYGQG
                                      -SGYGS
  13-G @every Fifth   position @102   :GVGGAGGYGSGGGGGGGQGGGAGGYGQGSGYGSGY
                                      -GSGAGGAHGGGYGSGGGGGGGGGQGGGSGYGSGSG
                                      -YGSGY

 >SWISS-PROT: P10496   (sequence length 465) 
  GRP2_PHAVU (P10496) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8 PREC
  16-G @every Fifth   position @128   :GGGYGAGGEHGIGYGGGGGSGAGGGGGYNAGGAQG
                                      -GGYGTGGGAGGGGGGGGDHGGGYGGGQGAGGGAGG
                                      -GYGGGGEHGGGGGGGQGGGA
  18-G @every Fifth   position @368   :GGGAGGGYGTGGEHGGGYGGGQGGGGGYGAGGDHG
                                      -AAGYGGGEGGGGGSGGGYGDGGAHGGGYGGGAGGG
                                      -GGYGAGGAHGGGYGGGGGIGGGHGG

 >SWISS-PROT: Q99070   (sequence length 168) 
  GRP2_SORVU (Q99070) GLYCINE-RICH RNA-BINDING PROTEIN 2.               
  11-G @every Fifth   position @89    :GGGGGGGYGGGGGGYGGREGGGYGGGGGGYGGRRE
                                      -GGGGYGGGGYGGGGGGYGGREGGGGYGGGG
  12-G @every Fifth   position @90    :GGGGGGYGGGGGGYGGREGGGYGGGGGGYGGRREG
                                      -GGGYGGGGYGGGGGGYGGREGGGGYGGGGGYGGNR

 >SWISS-PROT: P10979   (sequence length 157) 
  GRPA_MAIZE (P10979) GLYCINE-RICH RNA-BINDING, ABSCISIC ACID-INDUCIBLE 
  11-G @every Fifth   position @88    :GGGGGGGGYGGGRGGGGYGGGRRDGGYGGGGGYGG
                                      -RREGGGGGYGGGGGYGGRREGGGGGYGGGG

 >SWISS-PROT: P17816   (sequence length 200) 
  GRP_HORVU (P17816) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN PRECURSO 
  11-G @every Fifth   position @46    :GGGHHGHGRGGHGGGGYGGGGGYGGGGGGYPGGGG
                                      -GYGGGGGGYPGHGGEGGGGYGGGGGYPGHG
  14-G @every Fifth   position @48    :GHHGHGRGGHGGGGYGGGGGYGGGGGGYPGGGGGY
                                      -GGGGGGYPGHGGEGGGGYGGGGGYPGHGGEGGGGY
                                      -GGGGGYHGHG
  16-G @every Fifth   position @105   :GYPGHGGEGGGGYGGGGGYHGHGGEGGGGYGGGGG
                                      -YHGHGGEGGGGYGGGGGGYPGHGGGGGHGGGRCKW
                                      -GCCGHGFLHHGCRCCARADE

 >SWISS-PROT: P40274   (sequence length  90) 
  H162_TRYCR (P40274) HISTONE H1.M6.2.                                  
  11-K @every Fifth   position @9     :KKASPKKAAAKKASPKKAAAKKASPKKAAARKTAA
                                      -KKTAKKPAVRKPAAKKRAAPKKKPAAAKKP

 >SWISS-PROT: Q07134   (sequence length 244) 
  H1O_CHITH (Q07134) HISTONE H1, ORPHON.                                
  11-K @every Fifth   position @157   :KKVVKKPAAKKPEAKKATKAAKPATKKVVAKPASK
                                      -KAAAPKPKAAKPAAKKPEAKKATKAAAKKP

 >SWISS-PROT: P11910   (sequence length  88) 
  H82_NEIGO (P11910) OUTER MEMBRANE PROTEIN H.8 PRECURSOR.              
  13-A @every Fifth   position @16    :AACGGEKAAEAPAAEASSTEAPAAEAPAAEAPAAE
                                      -AAAAEAPAAEAPAAEAPAAEAAATEAPAAEAPAAE
  12-A @every Fifth   position @23    :AAEAPAAEASSTEAPAAEAPAAEAPAAEAAAAEAP
                                      -AAEAPAAEAPAAEAAATEAPAAEAPAAEAA
  12-E @every Fifth   position @25    :EAPAAEASSTEAPAAEAPAAEAPAAEAAAAEAPAA
                                      -EAPAAEAPAAEAAATEAPAAEAPAA

 >SWISS-PROT: P04196   (sequence length 525) 
  HRG_HUMAN (P04196) HISTIDINE-RICH GLYCOPROTEIN PRECURSOR (HISTIDINE-P 
  13-H @every Fifth   position @342   :HNNNSSDLHPHKHHSHEQHPHGHHPHAHHPHEHDT
                                      -HRQHPHGHHPHGHHPHGHHPHGHHPHGHHPHCHDF
                                      -QDYGP
  11-H @every Fifth   position @350   :HPHKHHSHEQHPHGHHPHAHHPHEHDTHRQHPHGH
                                      -HPHGHHPHGHHPHGHHPHGHHPHCHDFQDY

 >SWISS-PROT: Q28640   (sequence length 526) 
  HRG_RABIT (Q28640) HISTIDINE-RICH GLYCOPROTEIN PRECURSOR (HISTIDINE-P 
  13-P @every Fifth   position @339   :PPHGHHPHGPPPHGHPPHGPPPRHPPHGPPPHGHP
                                      -PHGPPPHGHPPHGPPPHGHPPHGPPPHGHPPHGHG
                                      -FHDHG
  11-P @every Fifth   position @348   :PPPHGHPPHGPPPRHPPHGPPPHGHPPHGPPPHGH
                                      -PPHGPPPHGHPPHGPPPHGHPPHGHGFHDH
  11-H @every Fifth   position @365   :HGPPPHGHPPHGPPPHGHPPHGPPPHGHPPHGPPP
                                      -HGHPPHGHGFHDHGPCDPPSHKEGPQDLHQ

 >SWISS-PROT: P46593   (sequence length 634) 
  HWP1_CANAL (P46593) HYPHAL WALL PROTEIN 1 (CELL ELONGATION PROTEIN 2).
  15-P @every Fifth   position @103   :PCDYPQQPQEPCDNPPQPDVPCDNPPQPDVPCDNP
                                      -PQPDIPCDNPPQPDIPCDNPPQPDQPDDNPPIPNI
                                      -PTDWIPNIPTDWIPD

 >SWISS-PROT: P28284   (sequence length 825) 
  ICP0_HSV2H (P28284) TRANS-ACTING TRANSCRIPTIONAL PROTEIN ICP0 (VMW118 
  11-A @every Fifth   position @569   :ASAGAAPPSASPSSQAAVAAASSSSASSSSASSSS
                                      -ASSSSASSSSASSSSASSSSASSSAGGAGG

 >SWISS-PROT: Q01042   (sequence length 407) 
  IE68_HSVSA (Q01042) IMMEDIATE-EARLY PROTEIN.                          
  34-E @every Fifth   position @60    :EEQRREEVEEEGEERERRGEEEREGEGGEEGEGRE
                                      -EAEEEEAEEKEAEEEEAEEAEEEAEEEEAEEAEAE
                                      -EEEAEEEEAEEEEAEEAEEEEAEEAEEEAEEEEAE
                                      -EEAEEEAEEAEEAEEEAEEEAEEAEEAEEAEEAEE
                                      -EAEEAEEEAEEAEEEAEEAEEAEEAEEAEEEAEEA
                                      -EEEEEEAGPSTPRLPHYKVV
  15-E @every Fifth   position @97    :EEEEAEEKEAEEEEAEEAEEEAEEEEAEEAEAEEE
                                      -EAEEEEAEEEEAEEAEEEEAEEAEEEAEEEEAEEE
                                      -AEEEAEEAEEAEEEA

 >SWISS-PROT: P55875   (sequence length 1054) 
  IF2_STIAU (P55875) TRANSLATION INITIATION FACTOR IF-2.                
  11-P @every Fifth   position @144   :PAAEAPKATAPVAPEPTVEAPKAAAPVAPEPTVEA
                                      -PKTEAPVAAAPIAEAPTPPARTEVPVTSGR

 >SWISS-PROT: P02537   (sequence length 486) 
  K1C0_XENLA (P02537) KERATIN 3, TYPE I CYTOSKELETAL 51 KD (51 KD CYTOKE
  11-G @every Fifth   position @36    :GGEGDFGGMGGFGACGAGYGGGAGYGGGAGGAGYG
                                      -GGAGGGGAGYGGGFGGGSGAGYGGGFGGGA

 >SWISS-PROT: P13645   (sequence length 593) 
  K1CJ_HUMAN (P13645) KERATIN, TYPE I CYTOSKELETAL 10 (CYTOKERATIN 10) (
  15-G @every Fifth   position @455   :GEGSSGGGGRGGGSFGGGYGGGSSGGGSSGGGYGG
                                      -GHGGSSGGGYGGGSSGGGSSGGGYGGGSSSGGHGG
                                      -GSSSGGHGGSSSGGY
  12-G @every Fifth   position @461   :GGGRGGGSFGGGYGGGSSGGGSSGGGYGGGHGGSS
                                      -GGGYGGGSSGGGSSGGGYGGGSSSGGHGGGSSSGG

 >SWISS-PROT: P02535   (sequence length 569) 
  K1CJ_MOUSE (P02535) KERATIN, TYPE I CYTOSKELETAL 10 (CYTOKERATIN 10) (
  11-G @every Fifth   position @88    :GGSSFGGGYGGSSFGGAGFGGGGSFGGGSFGGGSY
                                      -GGGFGGGGFGGDGGSLLSGNGRVTMQNLND
  12-G @every Fifth   position @458   :GGGGGRRGGSGGGSYGGSSGGGSYGGSSGGGGSYG
                                      -GSSGGGGSYGGGSSGGGSHGGSSGGGYGGGSSSGG
  16-G @every Fifth   position @477   :GGGSYGGSSGGGGSYGGSSGGGGSYGGGSSGGGSH
                                      -GGSSGGGYGGGSSSGGAGGHGGSSGGGYGGGSSSG
                                      -GQGGSGGFKSSGGGDQSSKG

 >SWISS-PROT: P04264   (sequence length 643) 
  K2C1_HUMAN (P04264) KERATIN, TYPE II CYTOSKELETAL 1 (CYTOKERATIN 1) (K
  11-G @every Fifth   position @82    :GGGRGSGFGGGYGGGGFGGGGFGGGGFGGGGIGGG
                                      -GFGGFGSGGGGFGGGGFGGGGYGGGYGPVC
  11-G @every Fifth   position @84    :GRGSGFGGGYGGGGFGGGGFGGGGFGGGGIGGGGF
                                      -GGFGSGGGGFGGGGFGGGGYGGGYGPVCPP
  11-G @every Fifth   position @86    :GSGFGGGYGGGGFGGGGFGGGGFGGGGIGGGGFGG
                                      -FGSGGGGFGGGGFGGGGYGGGYGPVCPPGG
  12-G @every Fifth   position @90    :GGGYGGGGFGGGGFGGGGFGGGGIGGGGFGGFGSG
                                      -GGGFGGGGFGGGGYGGGYGPVCPPGGIQEVTINQS

 >SWISS-PROT: P04104   (sequence length 581) 
  K2C1_MOUSE (P04104) KERATIN, TYPE II CYTOSKELETAL 1 (CYTOKERATIN 1) (6
  11-G @every Fifth   position @1     :GGGGSFCGGFGGGSYGRGGFGGGSYGGGGFGGGSF
                                      -GGGGFGGSGFGGGSGGGGGFGSGGGFGGGR
  13-G @every Fifth   position @3     :GGSFCGGFGGGSYGRGGFGGGSYGGGGFGGGSFGG
                                      -GGFGGSGFGGGSGGGGGFGSGGGFGGGRFGGYGPV
                                      -CSPSG

 >SWISS-PROT: P34099   (sequence length 648) 
  KAPC_DICDI (P34099) CAMP-DEPENDENT PROTEIN KINASE CATALYTIC SUBUNIT (E
  16-Q @every Fifth   position @140   :QQQPQQQQPQQQQPQQQQPQQQQQQQPQQQQQPQQ
                                      -QLQQNNQQQQQQLQQQQLQQQLQQQQQQQQQQQQQ
                                      -QQQKQQKQQQQQQQHLHQDG
  15-Q @every Fifth   position @144   :QQQQPQQQQPQQQQPQQQQQQQPQQQQQPQQQLQQ
                                      -NNQQQQQQLQQQQLQQQLQQQQQQQQQQQQQQQQK
                                      -QQKQQQQQQQHLHQD
  12-Q @every Fifth   position @163   :QQQPQQQQQPQQQLQQNNQQQQQQLQQQQLQQQLQ
                                      -QQQQQQQQQQQQQQQKQQKQQQQQQQHLHQDGIVN

 >SWISS-PROT: P38020   (sequence length 207) 
  KARP_CHLTR (P38020) HISTONE H1-LIKE PROTEIN KARP.                     
  12-K @every Fifth   position @7     :KRSTRKTAARKTVVRKPAAKKTAAKKASVRKVAAK
                                      -KTVARKTVAKKAVAARKPAAKKTAAKKAPVRKVAA

 >SWISS-PROT: P06719   (sequence length 657) 
  KNOB_PLAFN (P06719) KNOB-ASSOCIATED HISTIDINE-RICH PROTEIN PRECURSOR (
  14-T @every Fifth   position @547   :TKEASTSKEATKEASTSKEATKEASTSKEATKEAS
                                      -TSKGATKEASTTEGATKGASTTAGSTTGATTGANA
                                      -VQSKDETADK

 >SWISS-PROT: P08131   (sequence length 181) 
  KR2D_SHEEP (P08131) KERATIN, HIGH-SULFUR MATRIX PROTEIN, B2D.         
  12-Q @every Fifth   position @25    :QPTCCQTSCCQPTSIQTSCCQPTSIQTSCCQPTSI
                                      -QTSCCQPISIQTSCCQPTCLQTSGCETGCGIGGSI

 >SWISS-PROT: P26371   (sequence length 169) 
  KRUC_HUMAN (P26371) KERATIN, ULTRA HIGH-SULFUR MATRIX PROTEIN (UHS KER
  12-C @every Fifth   position @42    :CKPVCCCVPACSCSSCGKRGCGSCGGSKGGCGSCG
                                      -CSQCSCCKPCCCSSGCGSSCCQCSCCKPYCSQCSC

 >SWISS-PROT: P18160   (sequence length 1584) 
  KYK1_DICDI (P18160) NON-RECEPTOR TYROSINE KINASE SPORE LYSIS A (EC 2.7
  12-N @every Fifth   position @450   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNSNSSNT
                                      -NNNNINNTTNNNNSNSNNNNNNNNSNSNSNSNNNN
  11-N @every Fifth   position @451   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNSNSSNTN
                                      -NNNINNTTNNNNSNSNNNNNNNNSNSNSNS

 >SWISS-PROT: P23490   (sequence length 316) 
  LORI_HUMAN (P23490) LORICRIN.                                         
  12-S @every Fifth   position @91    :SGGGGSSGGGSGCFSSGGGGSGCFSSGGGGSSGGG
                                      -SGCFSSGGGGSSGGGSGCFSSGGGGFSGQAVQCQS

 >SWISS-PROT: P18165   (sequence length 481) 
  LORI_MOUSE (P18165) LORICRIN.                                         
  11-G @every Fifth   position @34    :GSGCGGGSSGGGSSCGGGGGGSYGGGSSCGGGGGS
                                      -GGGVKYSGGGGGSSCGGGYSGGGGGSSCGG
  12-G @every Fifth   position @138   :GGSGGGVKYSGGGGGGGSSCGGGSSGGGGGGSSCG
                                      -GGSGGGGSYCGGSSGGGSSGGCGGGSGGGKYSGGG
  11-G @every Fifth   position @367   :GSSGGGGSCGGGSSGGGGGGGCYSSGGGGSSGGCG
                                      -GGYSGGGGGCGGGSSGGSGGGCGGGSSGGS

 >SWISS-PROT: P15714   (sequence length 255) 
  LP61_EIMTE (P15714) ANTIGEN LPMC-61 (FRAGMENT).                       
  12-Q @every Fifth   position @146   :QQQQQQQWPEQPEQQQQQQWPEQQQQQWSDQNQQQ
                                      -QAQQWQAQQQQQWPQQQQQPQQQQQQQQQQDLGPD

 >SWISS-PROT: P11746   (sequence length 286) 
  MCM1_YEAST (P11746) PHEROMONE RECEPTOR TRANSCRIPTION FACTOR (GRM/PRTF 
  11-Q @every Fifth   position @189   :QPQQQQQQQPQQQMSQQQMSQHPRPQQGIPHPQQS
                                      -QPQQQQQQQQQLQQQQQQQQQQPLTGIHQP

 >SWISS-PROT: P36027   (sequence length 376) 
  MID2_YEAST (P36027) MATING PROCESS PROTEIN MID2 (SERINE-RICH PROTEIN S
  13-S @every Fifth   position @54    :SILSSSMVSSSSADSSSLTSSTSSRSLVSHTSSST
                                      -SIASISFTSFSFSSDSSTSSSSSASSDSSSSSSFS
                                      -ISSTS

 >SWISS-PROT: Q05049   (sequence length 662) 
  MUC1_XENLA (Q05049) INTEGUMENTARY MUCIN C.1 (FIM-C.1) (FRAGMENT).     
  11-T @every Fifth   position @217   :TKAPTTIQIATTTTTPTTTTTTTKATPTTTTTTKA
                                      -TPTTTTTTKATTTTTTPTTTTTTTKATTTP
  14-T @every Fifth   position @229   :TTTPTTTTTTTKATPTTTTTTKATPTTTTTTKATT
                                      -TTTTPTTTTTTTKATTTPTTTTTTTPTTTTTKATT
                                      -TTTTTSGECK
  13-T @every Fifth   position @404   :TTTPTTTTTPTTTTTTKATTTTPTTTTTTPTTTTT
                                      -TTTTTKATTTTPTTTTPTTTTTKATTTTPTTTTTT
                                      -PTTTT
  14-T @every Fifth   position @418   :TTKATTTTPTTTTTTPTTTTTTTTTTKATTTTPTT
                                      -TTPTTTTTKATTTTPTTTTTTPTTTTTKATTTTPT
                                      -TTTTTPTTTT

 >SWISS-PROT: P19706   (sequence length 1147) 
  MYSB_ACACA (P19706) MYOSIN HEAVY CHAIN IB (MYOSIN HEAVY CHAIN IL).    
  18-G @every Fifth   position @989   :GGPGMGRGGPGMGGPGAGRGGPGMGGPGGPGRGGP
                                      -GGPGAGRGGPGGPGAGRGGPGMGGPGGAGRGGPGA
                                      -GRGGPGMGGPGAGRGGPGAGRGAAPAPAPAAPAKP
  11-G @every Fifth   position @1017  :GPGRGGPGGPGAGRGGPGGPGAGRGGPGMGGPGGA
                                      -GRGGPGAGRGGPGMGGPGAGRGGPGAGRGA

 >SWISS-PROT: Q40361   (sequence length  93) 
  N12A_MEDSA (Q40361) EARLY NODULIN 12A PRECURSOR (N-12A) (EARLY NODULIN
  13-P @every Fifth   position @20    :PQGFAEYYLNPAYRPPQTEPPVHKPPHKEPPVHKP
                                      -PHKDPPVNKPPQKEPPVHKPPRKEPPTHRHPPSED

 >SWISS-PROT: P20799   (sequence length 110) 
  N12A_PEA (P20799) EARLY NODULIN 12A PRECURSOR (N-12A).                
  13-P @every Fifth   position @20    :PQGLAQYHLNPVYEPPVNGPPVNKPPQKETPVHKP
                                      -PQKETPVHKPPQKEPPRHKPPQKEPPRHKPPHKKS
                                      -HLHVT

 >SWISS-PROT: Q40339   (sequence length 113) 
  N12B_MEDSA (Q40339) EARLY NODULIN 12B PRECURSOR (N-12B).              
  13-P @every Fifth   position @34    :PPQTKPPVNKPSHKEPPVHKPPHKEPPVNKPRHKE
                                      -PPVHKPPHKDPPVNKPPQKESPVHKPPRKEPPTHK
                                      -HPPAE
  11-P @every Fifth   position @50    :PVHKPPHKEPPVNKPRHKEPPVHKPPHKDPPVNKP
                                      -PQKESPVHKPPRKEPPTHKHPPAED

 >SWISS-PROT: P30365   (sequence length 103) 
  NO12_MEDTR (P30365) EARLY NODULIN 12 PRECURSOR (N-12).                
  13-P @every Fifth   position @30    :PAYRPPQTKPPVNKPSHKEPPVNKPPHKEPPVHKP
                                      -PHKDPPVNKPPQKESPVHKPPRKESPTHRHPPAED

 >SWISS-PROT: Q41701   (sequence length 100) 
  NO12_VICSA (Q41701) EARLY NODULIN 12 PRECURSOR (N-12).                
  11-P @every Fifth   position @20    :PQGLAQYHLNPVYEAPVNGPPVNKPPQKETPVQKP
                                      -PQKEPPVHKSPRNEPPRHKPPHKKSHLHVT

 >SWISS-PROT: Q02937   (sequence length 365) 
  OMLA_ACTPL (Q02937) OUTER MEMBRANE LIPOPROTEIN A PRECURSOR.           
  11-P @every Fifth   position @56    :PQADNSKAEEPKEMAPQVDSPKAEEPKNMAPQMGN
                                      -PKLNDPQVMAPKMDNPQKDAPKGEELSKDK

 >SWISS-PROT: P12348   (sequence length 1241) 
  PER_DROPS (P12348) PERIOD CIRCADIAN PROTEIN.                          
  12-A @every Fifth   position @687   :ANTSAAFNIAANTSAADNFGADTSAADTSGADTSA
                                      -ADNYGPGNFGAENSCADNSGAENSCADNSGVDNSR
  18-N @every Fifth   position @724   :NYGPGNFGAENSCADNSGAENSCADNSGVDNSRPD
                                      -NSGADNSAADNFGPDNSGADNSGPDNTGPDNSGAE
                                      -NSRAENSRADNSRPDHPRPDISGASNSRPDKTGPD

 >SWISS-PROT: P51524   (sequence length 212) 
  PF11_PIG (P51524) PROPHENIN-1 PRECURSOR (PF-1) (C6) (FRAGMENT).       
  17-P @every Fifth   position @120   :PFLRRPRLRRQAFPPPNVPGPRFPPPNFPGPRFPP
                                      -PNFPGPRFPPPNFPGPRFPPPNFPGPPFPPPIFPG
                                      -PWFPPPPPFRPPPFGPPRFP
  12-F @every Fifth   position @132   :FPPPNVPGPRFPPPNFPGPRFPPPNFPGPRFPPPN
                                      -FPGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPP
  13-P @every Fifth   position @133   :PPPNVPGPRFPPPNFPGPRFPPPNFPGPRFPPPNF
                                      -PGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPPP
                                      -FGPPR

 >SWISS-PROT: P51525   (sequence length 228) 
  PF12_PIG (P51525) PROPHENIN-2 PRECURSOR (PF-2) (PR-2) (C12) (PROPHEN  
  17-P @every Fifth   position @136   :PFLRRPRLRRQAFPPPNVPGPRFPPPNVPGPRFPP
                                      -PNFPGPRFPPPNFPGPRFPPPNFPGPPFPPPIFPG
                                      -PWFPPPPPFRPPPFGPPRFP
  13-P @every Fifth   position @149   :PPPNVPGPRFPPPNVPGPRFPPPNFPGPRFPPPNF
                                      -PGPRFPPPNFPGPPFPPPIFPGPWFPPPPPFRPPP
                                      -FGPPR

 >SWISS-PROT: P06600   (sequence length 211) 
  PR33_DAUCA (P06600) PROLINE RICH 33 KD EXTENSIN-RELATED PROTEIN PRECUR
  11-P @every Fifth   position @11    :PSLADFHSHPPIHKPPVYTPPVHKPPIHKPPVYTP
                                      -PVHKPPVYTPPVHKPPSEYKPPVEATNSVT

 >SWISS-PROT: P29617   (sequence length 1403) 
  PRO_DROME (P29617) PROTEIN PROSPERO.                                  
  11-Q @every Fifth   position @717   :QQQQQQQQQQQQQQQQQQEQQRRFEQEQQEQQRRK
                                      -EEQQQQIQRQQQHLQQLQQQQMEQQHVATA


 >SWISS-PROT: P50493   (sequence length 1153) 
  PVDB_PLAKN (P50493) DUFFY RECEPTOR, BETA FORM PRECURSOR (ERYTHROCYTE B
  12-Q @every Fifth   position @886   :QTSSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSD
                                      -QISSDQTSSDQTSSNQTSSDQTIDTEEHHRDNVRN
  11-T @every Fifth   position @887   :TSSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQ
                                      -ISSDQTSSDQTSSNQTSSDQTIDTEEHHRD
  11-S @every Fifth   position @888   :SSDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQI
                                      -SSDQTSSDQTSSNQTSSDQTIDTEEHHRDN
  11-S @every Fifth   position @889   :SDQTSSDQTSSNQTSSDQTSSNQTSSDQTSSDQIS
                                      -SDQTSSDQTSSNQTSSDQTIDTEEHHRDNV

 >SWISS-PROT: P51968   (sequence length 373) 
  RO31_XENLA (P51968) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A3 HOMOLOG
  11-G @every Fifth   position @270   :GNYGGGPGYGGRGYGGSPGYGNQGGGYGGGGGGYD
                                      -GYNESGNFGGGNYNDFGNYGGQQQSNYGPM

 >SWISS-PROT: P51992   (sequence length 385) 
  RO32_XENLA (P51992) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A3 HOMOLOG
  13-G @every Fifth   position @219   :GGNYGGGDGGGGGNFGRGGGFGNRGGYGGGGGRGG
                                      -GYGGGGDGYNGFGGDGGNYGGGPGYGGRGYGGSPG
                                      -YGNQG

 >SWISS-PROT: P21522   (sequence length 342) 
  ROA1_SCHAM (P21522) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A1, A2/B1 
  12-G @every Fifth   position @187   :GDAPGGRGGGGRGGVGGGAGGGWGGGRGDWGGSAG
                                      -GGGGGGWGGADPWENGRGGGGDRWGGGGGGMGGGD

 >SWISS-PROT: P27625   (sequence length 2339) 
  RPC1_PLAFA (P27625) DNA-DIRECTED RNA POLYMERASE III LARGEST SUBUNIT (E
  14-N @every Fifth   position @1094  :NDSNMNNINNNDSNMNSIHNNNSNMNNIHNNDSNR
                                      -SIIHNNDSNMNSIHNNDSNMNSIHNNNSNMNNIHN
                                      -NDSNRSIIHN

 >SWISS-PROT: P16960   (sequence length 5035) 
  RYNR_PIG (P16960) RYANODINE RECEPTOR, SKELETAL MUSCLE (SKELETAL MUSC  
  11-E @every Fifth   position @1870  :EVFTEEEEEEEEEEEEEEEDEEEKEEDEEEEAREK
                                      -EDEEKEEEETAEGEKEEYLEEGLLQMKLPE

 >SWISS-PROT: Q14242   (sequence length 412) 
  SEPL_HUMAN (Q14242) P-SELECTIN GLYCOPROTEIN LIGAND 1 PRECURSOR (PSGL-1
  14-T @every Fifth   position @114   :TDSAAMEIQTTQPAATEAQTTQPVPTEAQTTPLAA
                                      -TEAQTTRLTATEAQTTPLAATEAQTTPPAATEAQT
                                      -TQPTGLEAQT

 >SWISS-PROT: P13730   (sequence length 328) 
  SGS3_DROER (P13730) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.            
  19-T @every Fifth   position @107   :TTKRATTRRTTVRATTKRATTRRTTTKRAPTRRTT
                                      -TKRATTRRNPTRRTTTRRAPTKRATTKRATTRRNP
                                      -TKRKTTRRTTVRATKTTKRATTKRAPTKRATTKRA
                                      -PTKRV
  15-R @every Fifth   position @114   :RRTTVRATTKRATTRRTTTKRAPTRRTTTKRATTR
                                      -RNPTRRTTTRRAPTKRATTKRATTRRNPTKRKTTR
                                      -RTTVRATKTTKRATT
  16-T @every Fifth   position @193   :TKRATTKRAPTKRATTKRAPTKRVTTKRAPTKRAT
                                      -TKRAPTKRATTKRAPTKRATTKRAPTKRATTKRAP
                                      -TKRATTKRATARPTSKPCGC
  16-K @every Fifth   position @194   :KRATTKRAPTKRATTKRAPTKRVTTKRAPTKRATT
                                      -KRAPTKRATTKRAPTKRATTKRAPTKRATTKRAPT
                                      -KRATTKRATARPTSKPCGCK
  16-R @every Fifth   position @195   :RATTKRAPTKRATTKRAPTKRVTTKRAPTKRATTK
                                      -RAPTKRATTKRAPTKRATTKRAPTKRATTKRAPTK
                                      -RATTKRATARPTSKPCGCKP
  15-A @every Fifth   position @196   :ATTKRAPTKRATTKRAPTKRVTTKRAPTKRATTKR
                                      -APTKRATTKRAPTKRATTKRAPTKRATTKRAPTKR
                                      -ATTKRATARPTSKPC

 >SWISS-PROT: P02840   (sequence length 307) 
  SGS3_DROME (P02840) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.            
  14-T @every Fifth   position @49    :TTTTTTCAPPTQQSTTQPPCTTSKPTTPKQTTTQL
                                      -PCTTPTTTKATTTKPTTTKATTTKATTTKPTTTKQ
                                      -TTTQLPCTTP
  35-T @every Fifth   position @81    :TQLPCTTPTTTKATTTKPTTTKATTTKATTTKPTT
                                      -TKQTTTQLPCTTPTTTKQTTTQLPCTTPTTTKPTT
                                      -TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
                                      -TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
                                      -TKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTT
                                      -PKPCGCKSCGPGGEPCNGCAKRDAL
  25-T @every Fifth   position @129   :TTTKQTTTQLPCTTPTTTKPTTTKPTTTKPTTTKP
                                      -TTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKP
                                      -TTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKP
                                      -TTTKPTTTKPTTTKPTTTKPTTPKPCGCKSCGPGG
                                      -EPCNG
  25-T @every Fifth   position @130   :TTKQTTTQLPCTTPTTTKPTTTKPTTTKPTTTKPT
                                      -TTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPT
                                      -TTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPT
                                      -TTKPTTTKPTTTKPTTTKPTTPKPCGCKSCGPGGE
                                      -PCNGC
  24-P @every Fifth   position @143   :PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
                                      -PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
                                      -PTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTTK
                                      -PTTTKPTTPKPCGCKSCGPGGEPCNGCAKR
  24-K @every Fifth   position @147   :KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
                                      -KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
                                      -KPTTTKPTTTKPTTTKPTTTKPTTTKPTTTKPTTT
                                      -KPTTPKPCGCKSCGPGGEPCNGCAKRDALC

 >SWISS-PROT: P13728   (sequence length 263) 
  SGS3_DROYA (P13728) SALIVARY GLUE PROTEIN SGS-3 PRECURSOR.            
  15-T @every Fifth   position @100   :TTTTTTTRRPTTRSTTTRHTTTTTTTTRRPTTTTT
                                      -TTRRPTTTTTTTRRPTTTTTTTRLPTTRSTTTRHT
                                      -TKSTTSKRPTHETTT
  14-T @every Fifth   position @101   :TTTTTTRRPTTRSTTTRHTTTTTTTTRRPTTTTTT
                                      -TRRPTTTTTTTRRPTTTTTTTRLPTTRSTTTRHTT
                                      -KSTTSKRPTH

 >SWISS-PROT: P18480   (sequence length 905) 
  SNF5_YEAST (P18480) TRANSCRIPTION REGULATORY PROTEIN SNF5 (SWI/SNF COM
  12-Q @every Fifth   position @204   :QRQQQQQFRHHVQIQQQQQKQQQQQQQHQQQQQQQ
                                      -QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQGQIPQ
  13-Q @every Fifth   position @210   :QFRHHVQIQQQQQKQQQQQQQHQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQQQQQQQQQQQQGQIPQSQQVPQ
                                      -VRSMS
  11-Q @every Fifth   position @218   :QQQQQKQQQQQQQHQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQQQQGQIPQSQQVPQVRS

 >SWISS-PROT: P37963   (sequence length 575) 
  SP6D_BACSU (P37963) STAGE VI SPORULATION PROTEIN D.                   
  12-E @every Fifth   position @206   :ETEKAESEPPESVASEPEAREDVKEEEESEELAVP
                                      -ETEVRAESETEESEPEPDPSEIEIQEIVKAKKETA

 >SWISS-PROT: P14328   (sequence length 600) 
  SP96_DICDI (P14328) SPORE COAT PROTEIN SP96.                          
  13-S @every Fifth   position @470   :SSSAASSSPSSSAASSSPSSSASSSSSPSSSASSS
                                      -SAPSSSASSSSAPSSSASSSSASSSSASSAATTAA
                                      -TTIAT
  11-S @every Fifth   position @479   :SSSAASSSPSSSASSSSSPSSSASSSSAPSSSASS
                                      -SSAPSSSASSSSASSSSASSAATTAATTIA

 >SWISS-PROT: P19837   (sequence length 747) 
  SPD1_NEPCL (P19837) SPIDROIN 1 (DRAGLINE SILK FIBROIN 1) (FRAGMENT).  
  11-G @every Fifth   position @53    :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQG
                                      -GYGGLGSQGAGRGGLGGQGAGAAAAAAAGG
  11-G @every Fifth   position @381   :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQR
                                      -GYGGLGNQGAGRGGLGGQGAGAAAAAAAGG
  13-G @every Fifth   position @472   :GAGQGGYGGLGSQGAGRGGQGAGAAAAAAVGAGQE
                                      -GIRGQGAGQGGYGGLGSQGSGRGGLGGQGAGAAAA
                                      -AAGGA
  13-G @every Fifth   position @569   :GVRQGGYGGLGSQGAGRGGQGAGAAAAAAGGAGQG
                                      -GYGGLGGQGVGRGGLGGQGAGAAAAGGAGQGGYGG
                                      -VGSGA

 >SWISS-PROT: P46804   (sequence length 627) 
  SPD2_NEPCL (P46804) SPIDROIN 2 (DRAGLINE SILK FIBROIN 2) (FRAGMENT).  
  12-G @every Fifth   position @320   :GQQGLGGYGPGQQGPGGYGPGQQGPGGYGPGSASA
                                      -AAAAAGPGQQGPGGYGPGQQGPSGPGSASAAAAAA

 >SWISS-PROT: P21997   (sequence length 485) 
  SSGP_VOLCA (P21997) SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185).      
  14-P @every Fifth   position @230   :PPSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPP
                                      -PPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSP
                                      -PRKPPSPSPP
  12-P @every Fifth   position @231   :PSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPPP
                                      -PPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPP
  13-P @every Fifth   position @248   :PRPPSPPPPSPSPPPPPPPPPPPPPPPPPSPPPPP
                                      -PPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSPP
                                      -SVLPA
  12-P @every Fifth   position @254   :PPPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPP
                                      -PPPPPPPSPSPPRKPPSPSPPVPPPPSPPSVLPAA

 >SWISS-PROT: P03186   (sequence length 3149) 
  TEGU_EBV (P03186) LARGE TEGUMENT PROTEIN.                             
  11-A @every Fifth   position @321   :ARYSPAKTNSPPSSPASAAPASAAPASAAPASAAP
                                      -ASAAPASAAPASAAPASAAPASSPPLFIPI
  11-P @every Fifth   position @325   :PAKTNSPPSSPASAAPASAAPASAAPASAAPASAA
                                      -PASAAPASAAPASAAPASSPPLFIPIPGLG

 >SWISS-PROT: P29720   (sequence length 384) 
  TMPB_TREPH (P29720) TREPONEMAL MEMBRANE PROTEIN B PRECURSOR (ANTIGEN T
  16-K @every Fifth   position @151   :KAAADKAAAEKAAKEKAAREKSAKDKAAKEKAAKE
                                      -KAAKDKAAKEKAAKEKAAKDKAAKEKAAKEKAARE
                                      -MAAKEKAAKDKAAKEEAARK
  19-A @every Fifth   position @152   :AAADKAAAEKAAKEKAAREKSAKDKAAKEKAAKEK
                                      -AAKDKAAKEKAAKEKAAKDKAAKEKAAKEKAAREM
                                      -AAKEKAAKDKAAKEEAARKAAEEAAARKAAEEAAA
                                      -RKAAE
  18-A @every Fifth   position @153   :AADKAAAEKAAKEKAAREKSAKDKAAKEKAAKEKA
                                      -AKDKAAKEKAAKEKAAKDKAAKEKAAKEKAAREMA
                                      -AKEKAAKDKAAKEEAARKAAEEAAARKAAEEAAAR
  12-K @every Fifth   position @174   :KDKAAKEKAAKEKAAKDKAAKEKAAKEKAAKDKAA
                                      -KEKAAKEKAAREMAAKEKAAKDKAAKEEAARKAAE

 >SWISS-PROT: P19934   (sequence length 421) 
  TOLA_ECOLI (P19934) TOLA PROTEIN.                                     
  11-A @every Fifth   position @240   :AEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKK
                                      -AAAAKAAAEKAAAAKAAAEADDIFGELSSG

 >SWISS-PROT: Q00130   (sequence length 670) 
  VG50_HSVI1 (Q00130) HYPOTHETICAL GENE 50 PROTEIN.                     
  13-T @every Fifth   position @355   :TVVTTTPAMPTGATDTVVTTTPAMPTGATDTVVTT
                                      -TPAMPTGATDTVVTTTPAKPAGANGTVVTTTPAMP
                                      -AGAND

 >SWISS-PROT: P28968   (sequence length 797) 
  VGLX_HSVEB (P28968) GLYCOPROTEIN X PRECURSOR.                         
  15-T @every Fifth   position @146   :TTTATATATSTPTTTTPTSTTTTTATTTVPTTAST
                                      -TTDTTTAATTTAATTTAATTTAATTTAATTTAATT
                                      -TAATTTAATTSSATT
  19-T @every Fifth   position @180   :TTTDTTTAATTTAATTTAATTTAATTTAATTTAAT
                                      -TTAATTTAATTSSATTAATTTAATTTAATTTAATT
                                      -TAATTTAATTTGSPTSGSTSTTGASTSTPSASTAT
                                      -SATPT
  17-T @every Fifth   position @184   :TTTAATTTAATTTAATTTAATTTAATTTAATTTAA
                                      -TTTAATTSSATTAATTTAATTTAATTTAATTTAAT
                                      -TTAATTTGSPTSGSTSTTGASTSTPSASTA
  14-A @every Fifth   position @187   :AATTTAATTTAATTTAATTTAATTTAATTTAATTT
                                      -AATTSSATTAATTTAATTTAATTTAATTTAATTTA
                                      -ATTTGSPTSG

 >SWISS-PROT: Q10778   (sequence length 678) 
  Y04H_MYCTU (Q10778) HYPOTHETICAL PROTEIN RV1547C PRECURSOR.           
  12-G @every Fifth   position @193   :GNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNL
                                      -GNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNT
  34-N @every Fifth   position @201   :NLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDL
                                      -NLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSG
                                      -NIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNL
                                      -NVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSG
                                      -NLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDN
                                      -QIGFGALNSGSGNLGFGNSG
  24-G @every Fifth   position @263   :GTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNF
                                      -GGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNT
                                      -GDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNI
                                      -GFGLTGDNQIGFGALNSGSGNLGFGNSGNG
  11-G @every Fifth   position @434   :GFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNS
                                      -GTTNTGWFNSGTTNTGIGNSGGNLVTGSMG
  12-N @every Fifth   position @491   :NLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNV
                                      -NTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNS
  11-G @every Fifth   position @498   :GLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNS
                                      -GNINTGFFNSGDLNTGLFNSVNQPVQNSGW

 >SWISS-PROT: Q93074   (sequence length 2124) 
  Y192_HUMAN (Q93074) HYPOTHETICAL PROTEIN KIAA0192 (FRAGMENT).         
  14-Q @every Fifth   position @1998  :QQQQQQQQQQQQQQQQQQQQQQQQQQYHIRQQQQQ
                                      -QILRQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQ
                                      -QQQAAPPQPQ
  13-Q @every Fifth   position @2002  :QQQQQQQQQQQQQQQQQQQQQQYHIRQQQQQQILR
                                      -QQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQA
                                      -APPQP
  11-Q @every Fifth   position @2040  :QQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQAAPP
                                      -QPQPQSQPQFQRQGLQQTQQQQQTAALVRQ

 >SWISS-PROT: P39712   (sequence length 1322) 
  YAG3_YEAST (P39712) HYPOTHETICAL 138.1 KD PROTEIN IN FLO9-GDH3 INTERGE
  14-S @every Fifth   position @894   :SSSVISSSDTSSLVISSSVTSSLVTSSPVISSSFI
                                      -SSPVISSTTTSASILSESSKSSVIPTSSSTSGSSE
                                      -SETGSASSAS

 >SWISS-PROT: P38190   (sequence length 124) 
  YBF3_YEAST (P38190) VERY HYPOTHETICAL 13.2 KD PROTEIN IN PTC3-SAS3 INT
  12-S @every Fifth   position @41    :SFSSCSCPFLFPSSSSSLSSSYVSSSSSFSSDICS
                                      -SSMSSSRVKSSSSSSSSLAFSPTYNSVSTSFSTSS

 >SWISS-PROT: P38216   (sequence length 128) 
  YBM6_YEAST (P38216) HYPOTHETICAL 14.6 KD PROTEIN IN TTP1-KAP104 INTERG
  12-Q @every Fifth   position @44    :QQYYQQQQQHPGYYNQQGYNQQGYNQQGYNQQGYN
                                      -QQGYNQQGYNQQGHQQPVYVQQQPPQRGNEGCLAA

 >SWISS-PROT: P53214   (sequence length 551) 
  YG1F_YEAST (P53214) HYPOTHETICAL 57.5 KD PROTEIN IN VMA7-RPS25A INTERG
  15-S @every Fifth   position @157   :STVASSTLSTSSSLVISTSSSTFTFSSESSSSLIS
                                      -SSISTSVSTSSVYVPSSSTSSPPSSSSELTSSSYS
                                      -SSSSSSTLFSYSSSF
  13-S @every Fifth   position @224   :SYSSSSSSSTLFSYSSSFSSSSSSSSSSSSSSSSS
                                      -SSSSSSYFTLSTSSSSSIYSSSSYPSFSSSSSSNP
                                      -TSSIT

 >SWISS-PROT: P42611   (sequence length 517) 
  YHS6_MYCTU (P42611) HYPOTHETICAL 50.6 KD PROTEIN IN HSP65 3'REGION.   
  17-G @every Fifth   position @211   :GQINLGFGNTGSGNIGNNNIGNNNIGNNNIGSGNT
                                      -GTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNI
                                      -GFGITGDHQMGFGGFNSGSGNIGFGNSGTG
  14-N @every Fifth   position @214   :NLGFGNTGSGNIGNNNIGNNNIGNNNIGSGNTGTG
                                      -NIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNIGFG
                                      -ITGDHQMGFG
  22-G @every Fifth   position @243   :GNTGTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGS
                                      -GNIGFGITGDHQMGFGGFNSGSGNIGFGNSGTGNV
                                      -GLFNSGSGNIGIGNSGSLNSGIGTSGTINAGLGSA
                                      -GSLNTSFWNAGMQNAALGSA

 >SWISS-PROT: P35732   (sequence length 738) 
  YKF4_YEAST (P35732) HYPOTHETICAL 84.0 KD PROTEIN IN NUP120-CSE4 INTERG
  12-Q @every Fifth   position @410   :QPQQPQQPQQQLQQQQQQQQQPVQAQAQAQEEQLS
                                      -QNYYTQQQQQQYAQQQHQLQQQYLSQQQQYAQQQQ

 >SWISS-PROT: P21260   (sequence length 141) 
  YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAGMENT).     
  11-P @every Fifth   position @13    :PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
                                      -PPPPPPPPPPPRRARIHHNIPLFLRFFKKS

 >SWISS-PROT: Q09950   (sequence length 482) 
  YSR2_CAEEL (Q09950) HYPOTHETICAL 54.6 KD PROTEIN F59B10.2 IN CHROMOSOM
  11-S @every Fifth   position @214   :SSFESSSDSSSTSESSTSSESSSSASESESESKSE
                                      -SQVSSSKTSTSKASSSKAYGSDFESEKSSS

 >SWISS-PROT: Q10940   (sequence length 112) 
  YWS4_CAEEL (Q10940) HYPOTHETICAL 13.1 KD PROTEIN B0310.4 IN CHROMOSOME
  17-R @every Fifth   position @7     :RGESHRGETHRGETHRGETHRGETHRGKTHRGETH
                                      -RGETHRGETHRGETHRGETHRGKTHRGETHRGETH
                                      -RGETRRGETHRGKTQNFGGKFKFSEKNILA
  17-G @every Fifth   position @8     :GESHRGETHRGETHRGETHRGETHRGKTHRGETHR
                                      -GETHRGETHRGETHRGETHRGKTHRGETHRGETHR
                                      -GETRRGETHRGKTQNFGGKFKFSEKNILAG
  15-H @every Fifth   position @11    :HRGETHRGETHRGETHRGETHRGKTHRGETHRGET
                                      -HRGETHRGETHRGETHRGKTHRGETHRGETHRGET
                                      -RRGETHRGKTQNFGG
  16-T @every Fifth   position @15    :THRGETHRGETHRGETHRGKTHRGETHRGETHRGE
                                      -THRGETHRGETHRGKTHRGETHRGETHRGETRRGE
                                      -THRGKTQNFGGKFKFSEKNI

 >SWISS-PROT: Q10540   (sequence length 443) 
  YZ06_MYCTU (Q10540) HYPOTHETICAL 43.6 KD PROTEIN CY31.06C.            
  25-G @every Fifth   position @213   :GIGNIGNNNVGSGNTGDYNFGIGNIGNANLGNGNI
                                      -GNANLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNI
                                      -GSGNEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNS
                                      -GDLNTGIGSPVTQGVANSGFGNTGTGHSGFFNSGN
                                      -SGSGF
  22-N @every Fifth   position @216   :NIGNNNVGSGNTGDYNFGIGNIGNANLGNGNIGNA
                                      -NLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSG
                                      -NEGSGNLGFGNAGDDNTGWGNSGDTNTGGFNSGDL
                                      -NTGIGSPVTQGVANSGFGNT

 ---------------------------------------------------------------------------
 Total entries in the DataBase...     80000 
 Total amino acid recidues.......     29085965 
 Total repeats detected..........     108
 Minimum repeating units.........     12  
 Maximum mismatch ...............     10% 
 ---------------------------------------------------------------------------

(c) Division of Biochemical Sciences, National Chemical Laboratory, Pune 411008, INDIA