Amino Acids Repeated Periodically at every Third Position
in Proteins from SWISS-PROT DataBase Release: 38 (July 1999) Total Entries: 80,000

 --------------------------------------------------------------------------- 

 >SWISS-PROT: P22620   (sequence length 743) 
  ABRA_PLAFC (P22620) 101 KD MALARIA ANTIGEN (P101) (ACIDIC BASIC REPEAT
  16-D @every Third   position @228   :DEDDVNDEEDTNDDEDTNDEEDTNDDEDTNDDEDTN
                                      -DEEDTNDEEDHENNNATA

 >SWISS-PROT: P23745   (sequence length 600) 
  ABRA_PLAFG (P23745) 101 KD MALARIA ANTIGEN (P101) (ACIDIC BASIC REPEAT
  16-D @every Third   position @145   :DEDDVNDEEDTNDDEDTNDEEDTNDDEDTNDDEDTN
                                      -DEEDTNDEEDHENNNATA

 >SWISS-PROT: Q15848   (sequence length 244) 
  ACR3_HUMAN (Q15848) 30 KD ADIPOCYTE COMPLEMENT-RELATED PROTEIN PRECURS
  22-G @every Third   position @42    :GIPGHPGHNGAPGRDGRDGTPGEKGEKGDPGLIGPK
                                      -GDIGETGVPGAEGPRGFPGIQGRKGEPGEGAYVYRS
                                      -AFS

 >SWISS-PROT: Q60994   (sequence length 247) 
  ACR3_MOUSE (Q60994) 30 KD ADIPOCYTE COMPLEMENT-RELATED PROTEIN PRECURS
  22-G @every Third   position @45    :GIPGHPGHNGTPGRDGRDGTPGEKGEKGDAGLLGPK
                                      -GETGDVGMTGAEGPRGFPGTPGRKGEPGEAAYMYRS
                                      -AFS

 >SWISS-PROT: Q03637   (sequence length 471) 
  ACTP_TORMA (Q03637) ACETYLCHOLINESTERASE COLLAGENIC TAIL PEPTIDE PRECU
  55-G @every Third   position @118   :GPPGPSGPQGPQGIQGIMGPKGEIGEIGRPGRKGRP
                                      -GVRGPRGMPGSPCSPGPIGPRGEKGDIGLTGLPGAR
                                      -GPMGPKGLTGQKGEKGIIGEKGQQGIKGEMGVMGLP
                                      -GMLGQKGEMGPKGVSGAPGHRGPVGRPGKRGKTGLK
                                      -GDIGPPGIMGPSGPPGPSGLPVMSGSGHLMVGPKGE
                                      -RGLPGP

 >SWISS-PROT: P42568   (sequence length 568) 
  AF9_HUMAN (P42568) AF-9 PROTEIN.                                      
  15-S @every Third   position @145   :SIHTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
                                      -SSSSSSSSSSTSFSK
  15-S @every Third   position @149   :SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
                                      -SSSSSSTSFSKPHKL
  15-S @every Third   position @150   :SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
                                      -SSSSSTSFSKPHKLM

 >SWISS-PROT: Q28462   (sequence length 202) 
  AMEL_MONDO (Q28462) AMELOGENIN.                                       
  16-Q @every Third   position @106   :QPGQQPYQPQPAQQPQPHQPIQPIQPIQPIQPMQPM
                                      -QPMQPMQPMQPMQPQTPV
  14-P @every Third   position @116   :PAQQPQPHQPIQPIQPIQPIQPMQPMQPMQPMQPMQ
                                      -PMQPQTPVHAVR

 >SWISS-PROT: P02817   (sequence length 213) 
  AMEX_BOVIN (P02817) AMELOGENIN, CLASS I PRECURSOR.                    
  21-P @every Third   position @137   :PHQPLQPHQPLQPMQPMQPLQPLQPLQPQPPVHPIQ
                                      -PLPPQPPLPPIFPMQPLPPMLPDLPLEAWPATDKTK

 >SWISS-PROT: P27951   (sequence length 1164) 
  BAG_STRAG (P27951) IGA FC RECEPTOR PRECURSOR (BETA ANTIGEN) (B ANTIGE 
  40-P @every Third   position @827   :PETPDTPKIPELPQAPDTPQAPDTPHVPESPKAPEA
                                      -PRVPESPKTPEAPHVPESPKAPEAPRVPESPKTPEA
                                      -PHVPESPKTPEAPKIPEPPKTPDVPKLPDVPKLPDV
                                      -PKLPDAPKLPDGLNKVGQAVFTSTDGN

 >SWISS-PROT: P02851   (sequence length 210) 
  BAR2_CHITE (P02851) BALBIANI RING PROTEIN 2 (GIANT SECRETORY PROTEIN I
  16-K @every Third   position @61    :KPSKPSKHSKPSKHSKPSKHSKPSKHSKPSKHSKPS
                                      -KHSKPEKCGSAMKRTEAA
  14-K @every Third   position @135   :KPSKPSKHSKPSKHSKPSKHSKPSKHSKPSKHSKPE
                                      -KCGSAMKRTEAA

 >SWISS-PROT: P17208   (sequence length 421) 
  BR3A_MOUSE (P17208) BRAIN-SPECIFIC HOMEOBOX/POU DOMAIN PROTEIN 3A (BRN
  14-G @every Third   position @139   :GGGGAHDGPGGGGGPGGGGGPGGGGPGGGGGGGGPG
                                      -GGGGAPGGGLLG

 >SWISS-PROT: P02745   (sequence length 245) 
  C1QA_HUMAN (P02745) COMPLEMENT C1Q SUBCOMPONENT, A CHAIN PRECURSOR.   
  16-G @every Third   position @62    :GIQGLKGDQGEPGPSGNPGKVGYPGPSGPLGARGIP
                                      -GIKGTKGSPGNIKDQPRP

 >SWISS-PROT: P98086   (sequence length 245) 
  C1QA_MOUSE (P98086) COMPLEMENT C1Q SUBCOMPONENT, A CHAIN PRECURSOR.   
  16-G @every Third   position @62    :GIRGFKGDPGESGPPGKPGNVGLPGPSGPLGDSGPQ
                                      -GLKGVKGNPGNIRDQPRP

 >SWISS-PROT: P02746   (sequence length 251) 
  C1QB_HUMAN (P02746) COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR.   
  27-G @every Third   position @31    :GPPAIPGIPGIPGTPGPDGQPGTPGIKGEKGLPGLA
                                      -GDHGEFGEKGDPGIPGNPGKVGPKGPMGPKGGPGAP
                                      -GAPGPKGESGDYKATQKIAFS

 >SWISS-PROT: P14106   (sequence length 253) 
  C1QB_MOUSE (P14106) COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR.   
  28-G @every Third   position @31    :GPPGIPGIPGVPGVPGSDGQPGTPGIKGEKGLPGLA
                                      -GDLGEFGEKGDPGIPGTPGKVGPKGPVGPKGTPGPS
                                      -GPRGPKGDSGDYRATQKVAFSALR

 >SWISS-PROT: P31721   (sequence length 253) 
  C1QB_RAT (P31721) COMPLEMENT C1Q SUBCOMPONENT, B CHAIN PRECURSOR.     
  28-G @every Third   position @31    :GSPGIPGVPGIPGVPGSDGKPGTPGIKGEKGLPGLA
                                      -GDHGELGEKGDAGIPGIPGKVGPKGPVGPKGAPGPP
                                      -GPRGPKGGSGDYKATQKVAFSALR

 >SWISS-PROT: P02747   (sequence length 245) 
  C1QC_HUMAN (P02747) COMPLEMENT C1Q SUBCOMPONENT, C CHAIN PRECURSOR.   
  27-G @every Third   position @31    :GCYGIPGMPGLPGAPGKDGYDGLPGPKGEPGIPAIP
                                      -GIRGPKGQKGEPGLPGHPGKNGPMGPPGMPGVPGPM
                                      -GIPGEPGEEGRYKQKFQSVFT

 >SWISS-PROT: Q02105   (sequence length 246) 
  C1QC_MOUSE (Q02105) COMPLEMENT C1Q SUBCOMPONENT, C CHAIN PRECURSOR.   
  27-G @every Third   position @32    :GCYGIPGMPGMPGAPGKDGHDGLQGPKGEPGIPAVP
                                      -GTQGPKGQKGEPGMPGHRGKNGPRGTSGLPGDPGPR
                                      -GPPGEPGVEGRYKQKHQSVFT

 >SWISS-PROT: P02453   (sequence length 779) 
  CA11_BOVIN (P02453) COLLAGEN ALPHA 1(I) CHAIN (FRAGMENTS).            
 247-G @every Third   position @17    :GPMGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGAS
                                      -GPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPPGPQ
                                      -GARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPK
                                      -GEPGSPGENGAPGQMGPRGLPGFPGPKGAAGEPGKA
                                      -GERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAGER
                                      -GEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDL
                                      -GAPGPSGARGERGFPGERGVEGPPGPAGPRGANGAP
                                      -GNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLP
                                      -GPKGDRGDAGPKGADGAPGKDGVRGLTGPIGPPGPA
                                      -GAPGDKGEAGPSGPAGTRGAPGDRGEPGPPGPAGFA
                                      -GPPGADGQPGAKGEPGDAGAKGDAGPPGPAGPAGPP
                                      -GPIGNVGAPGPKGARGSAGPPGATGFPGAAGRVGPP
                                      -GPSGNAGPPGPPGPAGKEGSKGPRGETGPAGRPGEV
                                      -GPPGPPGPAGEKGAPGADGPAGAPGTPGPQGIAGQR
                                      -GVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGER
                                      -GPPGPMGPPGLAGPPGESGREGAPGAEGSPGRDGSP
                                      -GAKGDRGETGPAGAPGPPGAPGAPGPVGPAGKSGDR
                                      -GETGPAGPIGPVGPAGARGPAGPQGPRGBKGZTGZZ
                                      -GBRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPA
                                      -GPRGPPGSAGSPGKDGLNGLPGPIGPPGPRGRTGDA
                                      -GPAGPPGPPGPPGPPGPPSGGYDLSFLPQPPQQZKA
                                      -HDGGRY

 >SWISS-PROT: P02457   (sequence length 1453) 
  CA11_CHICK (P02457) COLLAGEN ALPHA 1(I) CHAIN PRECURSOR.              
  16-G @every Third   position @102   :GVEGPKGDTGPRGDRGLPGPPGRDGIPGQPGLPGPP
                                      -GPPGPPGLGGNFAPQMSY
 339-G @every Third   position @168   :GPMGPAGPRGLPGPPGAPGPQGFQGPPGEPGEPGAS
                                      -GPMGPRGPAGPPGKNGDDGEAGKPGRPGQRGPPGPQ
                                      -GARGLPGTAGLPGMKGHRGFSGLDGAKGQPGPAGPK
                                      -GEPGSPGENGAPGQMGPRGLPGERGRPGPSGPAGAR
                                      -GNDGAPGAAGPPGPTGPAGPPGFPGAAGAKGETGPQ
                                      -GARGSEGPQGSRGEPGPPGPAGAAGPAGNPGADGQP
                                      -GAKGATGAPGIAGAPGFPGARGPSGPQGPSGAPGPK
                                      -GNSGEPGAPGNKGDTGAKGEPGPAGVQGPPGPAGEE
                                      -GKRGARGEPGPAGLPGPAGERGAPGSRGFPGADGIA
                                      -GPKGPPGERGSPGAVGPKGSPGEAGRPGEAGLPGAK
                                      -GLTGSPGSPGPDGKTGPPGPAGQDGRPGPAGPPGAR
                                      -GQAGVMGFPGPKGAAGEPGKPGERGAPGPPGAVGAA
                                      -GKDGEAGAQGPPGPTGPAGERGEQGPAGAPGFQGLP
                                      -GPAGPPGEAGKPGEQGVPGNAGAPGPAGARGERGFP
                                      -GERGVQGPPGPQGPRGANGAPGNDGAKGDAGAPGAP
                                      -GNEGPPGLEGMPGERGAAGLPGAKGDRGDPGPKGAD
                                      -GAPGKDGLRGLTGPIGPPGPAGAPGDKGEAGPPGPA
                                      -GPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAK
                                      -GETGDAGAKGDAGPPGPAGPTGAPGPAGZVGAPGPK
                                      -GARGSAGPPGATGFPGAAGRVGPPGPSGNIGLPGPP
                                      -GPAGKZGSKGPRGETGPAGRPGEPGPAGPPGPPGEK
                                      -GSPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGER
                                      -GFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLA
                                      -GPPGEAGREGAPGAEGAPGRDGAAGPKGDRGETGPA
                                      -GPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPP
                                      -GPAGARGPAGPQGPRGDKGETGEQGDRGMKGHRGFS
                                      -GLQGPPGPPGAPGEQGPSGASGPAGPRGPPGSAGAA
                                      -GKDGLNGLPGPIGPPGPRGRTGEVGPVGPPGPPGPP
                                      -GPPGPPSGGFDFSFLPQPPQEKAHDGGRYYRADDAN
                                      -VMRDRDLEVDTTLKSLSQQIENIRSPEGTRKNPART
                                      -CRDLKMCHGDWKSGEYWIDPNQGCNLDAIKVYCNME
                                      -TGETCVYPTQATIAQKNWYLSKNPKEKKHV

 >SWISS-PROT: P04258   (sequence length 1049) 
  CA13_BOVIN (P04258) COLLAGEN ALPHA 1(III) CHAIN.                      
 343-G @every Third   position @15    :GIAGYPGPAGPPGPPGPPGTSGHPGAPGAPGYQGPP
                                      -GEPGQAGPAGPPGPPGAIGPSGKDGESGRPGRPGPR
                                      -GFPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGEP
                                      -GAPGLKGENGVPGEDGAPGPMGPRGAPGERGRPGLP
                                      -GAAGARGNDGARGSDGQPGPPGPPGTAGFPGSPGAK
                                      -GEVGPAGSPGSSGAPGQRGEPGPQGHAGAPGPPGPP
                                      -GSDGSPGGKGEMGPAGIPGAPGLIGARGPPGPPGTN
                                      -GVPGQRGAAGEPGKNGAKGDPGPRGERGEAGSPGIA
                                      -GPKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPA
                                      -GANGLPGEKGPPGDRGGPGPAGPRGVAGEPGRNGLP
                                      -GGPGLRGIPGSPGGPGSNGKPGPPGSQGETGRPGPP
                                      -GSPGPRGQPGVMGFPGPKGNDGAPGKNGERGGPGGP
                                      -GPQGPAGKNGETGPQGPPGPTGPSGDKGDTGPPGPQ
                                      -GLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGK
                                      -GDSGAPGERGPPGAGGPPGPRGGAGPPGPEGGKGAA
                                      -GPPGPPGSAGTPGLQGMPGERGGPGGPGPKGDKGEP
                                      -GSSGVDGAPGKDGPRGPTGPIGPPGPAGQPGDKGES
                                      -GAPGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQN
                                      -GEPGAKGERGAPGEKGEGGPPGAAGPAGGSGPAGPP
                                      -GPQGVKGERGSPGGPGAAGFPGGRGPPGPPGSNGNP
                                      -GPPGSSGAPGKDGPPGPPGSNGAPGSPGISGPKGDS
                                      -GPPGERGAPGPQGPPGAPGPLGIAGLTGARGLAGPP
                                      -GMPGARGSPGPQGIKGENGKPGPSGQNGERGPPGPQ
                                      -GLPGLAGTAGEPGRDGNPGSDGLPGRDGAPGAKGDR
                                      -GENGSPGAPGAPGHPGPPGPVGPAGKSGDRGETGPA
                                      -GPSGAPGPAGSRGPPGPQGPRGDKGETGERGAMGIK
                                      -GHRGFPGNPGAPGSPGPAGHQGAVGSPGPAGPRGPV
                                      -GPSGPPGKDGASGHPGPIGPPGPRGNRGERGSEGSP
                                      -GHPGQPGPPGPPGAPGPCCGAGGV

 >SWISS-PROT: P02461   (sequence length 1466) 
  CA13_HUMAN (P02461) COLLAGEN ALPHA 1(III) CHAIN PRECURSOR.            
 352-G @every Third   position @168   :GLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPP
                                      -GEPGQAGPSGPPGPPGAIGPSGPAGKDGESGRPGRP
                                      -GERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEK
                                      -GETGAPGLKGENGLPGENGAPGPMGPRGAPGERGRP
                                      -GLPGAAGARGNDGARGSDGQPGPPGPPGTAGFPGSP
                                      -GAKGEVGPAGSPGSNGAPGQRGEPGPQGHAGAQGPP
                                      -GPPGINGSPGGKGEMGPAGIPGAPGLMGARGPPGPA
                                      -GANGAPGLRGGAGEPGKNGAKGEPGPRGERGEAGIP
                                      -GVPGAKGEDGKDGSPGEPGANGLPGAAGERGAPGFR
                                      -GPAGPNGIPGEKGPAGERGAPGPAGPRGAAGEPGRD
                                      -GVPGGPGMRGMPGSPGGPGSDGKPGPPGSQGESGRP
                                      -GPPGPSGPRGQPGVMGFPGPKGNDGAPGKNGERGGP
                                      -GGPGPQGPPGKNGETGPQGPPGPTGPGGDKGDTGPP
                                      -GPQGLQGLPGTGGPPGENGKPGEPGPKGDAGAPGAP
                                      -GGKGDAGAPGERGPPGLAGAPGLRGGAGPPGPEGGK
                                      -GAAGPPGPPGAAGTPGLQGMPGERGGLGSPGPKGDK
                                      -GEPGGPGADGVPGKDGPRGPTGPIGPPGPAGQPGDK
                                      -GEGGAPGLPGIAGPRGSPGERGETGPPGPAGFPGAP
                                      -GQNGEPGGKGERGAPGEKGEGGPPGVAGPPGGSGPA
                                      -GPPGPQGVKGERGSPGGPGAAGFPGARGLPGPPGSN
                                      -GNPGPPGPSGSPGKDGPPGPAGNTGAPGSPGVSGPK
                                      -GDAGQPGEKGSPGAQGPPGAPGPLGIAGITGARGLA
                                      -GPPGMPGPRGSPGPQGVKGESGKPGANGLSGERGPP
                                      -GPQGLPGLAGTAGEPGRDGNPGSDGLPGRDGSPGGK
                                      -GDRGENGSPGAPGAPGHPGPPGPVGPAGKSGDRGES
                                      -GPAGPAGAPGPAGSRGAPGPQGPRGDKGETGERGAA
                                      -GIKGHRGFPGNPGAPGSPGPAGQQGAIGSPGPAGPR
                                      -GPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGERGSE
                                      -GSPGHPGQPGPPGPPGAPGPCCGGVGAAAIAGIGGE
                                      -KAGGFAPYYGDEPMDFKINTDEIMTSLKSVNGQIES
                                      -LISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQ
                                      -GCKLDAIKVFCNMETGETCISANPLNVPRKHWWTDS
                                      -SAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVQLA

 >SWISS-PROT: P30754   (sequence length 1027) 
  CAFF_RIFPA (P30754) FIBRIL-FORMING COLLAGEN ALPHA CHAIN.              
 336-G @every Third   position @13    :GPIGPRGPPGPPGSPGQQGYQGLRGEPGDSGPMGPI
                                      -GKRGPPGPAGIAGKSGDDGRDGEPGPRGGIGPMGPR
                                      -GAGGMPGMPGPXGHRGFRGLSGSXGEQGKSGNQGPD
                                      -GGPGPAGPSGPIGPRGQTGERGRDGKSGLPGLRGVD
                                      -GLAGPPGPPGPIGSTGSPGFPGTPGSKGDRGQSGIX
                                      -GAQGLQGPVGLSGQPGVAGENGHPGMPGMDGANGEP
                                      -GASGESGLPGPSGFPGPRGMPGTAGSPGQAGAXGDG
                                      -GPTGEQGRPGAPGVXGSSGPPGDVGAPGHAGEAGKR
                                      -GSPGSPGPAGSPGPQGDRGLPGSRGLPGMTGASGAM
                                      -GIPGEKGPSGEPGAKGPTGDTGRQGNQGTPGIAGLP
                                      -GNPGSDGRPGKDGRPGIRGKDGKQGEQGPQGPQGLA
                                      -GLQGRAGPPGARGEPGKNGAPGEPGAHGEQGDAGKD
                                      -GETGAAGPPGAAGPTGARGPPGPRGQQGFQGLAGAQ
                                      -GTPGEAGKTGERGAVGATGPSGPAGPGGERGAPGDR
                                      -GNVGPRGMPGERGATGPAGPTGSPGVAGAKGQGGPP
                                      -GPAGLVGLPGERGPKGVGGSXGSRGDIGPRGKAGER
                                      -GKDGERGERGENGLPGPSGLAASXGERGDMGSPGER
                                      -GSPGPAGERGPAGSQGIQGQPGPPGDAGPAGTXGDI
                                      -GFPGERGTRGATGKQGARGPRGLAGKRGLRGAGGSR
                                      -GETGAQGEIGLPGSPGQPGLPGPSGQPGPSGPAGTA
                                      -GKQGVXGARGSPGLVGKQGDRGSDGEPGRDGTXGER
                                      -GEDGPPGVSGPTGAPGQQGERGMPGMVGLRGETGPM
                                      -GGQGMXGDGGPPGPSGDRGERGNAGPQGPTGPSGQA
                                      -GAPGQEGAPGKDGLPGLAGRPGERGEPGVAGRAGSQ
                                      -GLAGLMGQRGLPGAAGPPGDRGERGEPGGQGVQGPV
                                      -GAPGSQGPAGIMGMXGEAGGKGAXGDKGWTGLPGLQ
                                      -GLQGTPGHSGESGPPGAPGPRGARGEAGGRGSQGPP
                                      -GKDGQPGPSGRVGPRGPSGDDGRSGPPGPPGPPGPP
                                      -GNSDYG

 >SWISS-PROT: P02661   (sequence length 284) 
  CAS1_RAT (P02661) ALPHA CASEIN PRECURSOR.                             
  17-A @every Third   position @142   :ASLAQQASLAQQASLAQQALLAQQPSLAQQAALAQQ
                                      -ASLAQQASLAQQASLAQKHHPRLS

 >SWISS-PROT: P18833   (sequence length 282) 
  CC08_CAEEL (P18833) CUTICLE COLLAGEN 8.                               
  42-G @every Third   position @141   :GCPGPRGPSGLVGPAGPAGDQGRHGPPGPTGGQGGP
                                      -GEQGDAGRPGAAGCPGPPGPRGEPGTEYRPGQAGRA
                                      -GPPGPRGPPGPEGNPGGAGEDGNQGPVGHPGVPGRP
                                      -GIPGKSGTCGEHGGPGEPGPDAGYCPCPGRSYK

 >SWISS-PROT: P42916   (sequence length 301) 
  CL43_BOVIN (P42916) COLLECTIN-43 (CL-43).                             
  38-G @every Third   position @29    :GHDGRDGKEGPQGEKGDPGPPGMPGPAGREGPSGRQ
                                      -GSMGPPGTPGPKGEPGPEGGVGAPGMPGSPGPAGLK
                                      -GERGAPGPGGAIGPQGPSGAMGPPGLKGDRGDPGEK
                                      -GARGETSVLEVDTLRQRMRNL

 >SWISS-PROT: O61735   (sequence length 1023) 
  CLOC_DROME (O61735) CIRCADIAN LOCOMOTER OUTPUT CYCLES KAPUT PROTEIN (D
  14-Q @every Third   position @788   :QLQQHTQQQHQQQQQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQLQLQQ
  15-Q @every Third   position @790   :QQHTQQQHQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQLQLQQQNDIL

 >SWISS-PROT: P98085   (sequence length 419) 
  COLE_LEPMA (P98085) INNER EAR-SPECIFIC COLLAGEN PRECURSOR (SACCULAR CO
  40-G @every Third   position @58    :GPMGPMGERGLPGPPGERGPLGLPGEKGETGLRGPP
                                      -GPAGLPGANGLNGDIGEKGDQGPVGLPGVPGIPGKP
                                      -GEKGDPGLKGDKGERGFSGLKGDPGERGEPGLNGTK
                                      -GSIGREGPMGPGLAGTKGLKGEQGLKG
  26-G @every Third   position @197   :GEKGERGPPGLRGEMGLNGTDGVKGERGEPGPLGGK
                                      -GDTGARGPPGPPGGRGMAGLRGEKGLKGVRGPRGPK
                                      -GPPGESVEQIRSAFSVGL

 >SWISS-PROT: P25050   (sequence length 105) 
  COLL_HSVS7 (P25050) COLLAGEN-LIKE PROTEIN.                            
  22-G @every Third   position @15    :GDRGPQGPPGPPGPQGPPGPQGPPGPQGPPGPQGPP
                                      -GPQGPPGPQGPPGPPGPPGPSGLPGLFVTNLLLGII
                                      -ILL
  18-P @every Third   position @19    :PQGPPGPPGPQGPPGPQGPPGPQGPPGPQGPPGPQG
                                      -PPGPQGPPGPPGPPGPSGLPGLFVTNL

 >SWISS-PROT: P22576   (sequence length 102) 
  COLL_HSVSC (P22576) COLLAGEN-LIKE PROTEIN.                            
  21-G @every Third   position @15    :GDRGPPGPPGPPGPQGPPGPQGPPGPQGPPGPPGPP
                                      -GPPGPPGPPGPPGPPGPPGLPGLFVTNLLLGIIVLL
  17-P @every Third   position @19    :PPGPPGPPGPQGPPGPQGPPGPQGPPGPPGPPGPPG
                                      -PPGPPGPPGPPGPPGLPGLFVTNL

 >SWISS-PROT: P23805   (sequence length 371) 
  CONG_BOVIN (P23805) CONGLUTININ PRECURSOR.                            
  56-G @every Third   position @46    :GLPGHDGQDGRECPHGEKGDPGSPGPAGRAGRPGWV
                                      -GPIGPKGDNGFVGEPGPKGDTGPRGPPGMPGPAGRE
                                      -GPSGKQGSMGPPGTPGPKGETGPKGGVGAPGIQGFP
                                      -GPSGLKGEKGAPGETGAPGRAGVTGPSGAIGPQGPS
                                      -GARGPPGLKGDRGDPGETGAKGESGLAEVNALKQRV
                                      -TILDGHLRR

 >SWISS-PROT: P33525   (sequence length 509) 
  CRU3_BRANA (P33525) CRUCIFERIN CRU1 PRECURSOR (11S GLOBULIN) (12S STOR
  18-Q @every Third   position @121   :QPMQGQQQGQPWQGQQGQQGQQGQQGQQGQQGQQGQ
                                      -QGQQGQQGQQGQQQQGFRDMHQKVEHV

 >SWISS-PROT: P08675   (sequence length 378) 
  CSP_PLACL (P08675) CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS).           
  16-A @every Third   position @155   :ARAEDGARAADGARAADGARAADGARAADGARAADG
                                      -ARAADGARAADGARAEDG

 >SWISS-PROT: P06914   (sequence length 367) 
  CSP_PLAYO (P06914) CIRCUMSPOROZOITE PROTEIN PRECURSOR (CS).           
  35-P @every Third   position @141   :PGAPQGPGAPQGPGAPQGPGAPQGPGAPQGPGAPQG
                                      -PGAPQGPGAPQGPGAPQGPGAPQGPGAPQGPGAPQG
                                      -PGAPQGPGAPQGPGAPQEPPQQPPQQPPQQPPQQPP
                                      -QQPPQQPPQQPR

 >SWISS-PROT: P80684   (sequence length 161) 
  CUC1_TENMO (P80684) PUPAL CUTICLE PROTEIN C1B (TM-C1B) (TM-PCP C1B).  
  16-A @every Third   position @86    :AAAPVAAYAAPIATAAYAAPVAHAAYAAPVAHAAYA
                                      -APVAHAAYAAPVARTIGV

 >SWISS-PROT: P04487   (sequence length 161) 
  DNB_HSV11 (P04487) DNA-BINDING PROTEIN (VMW21).                       
  26-R @every Third   position @80    :RPPTIPRTPRVPREPRVPRPPREPREPRVPRAPRDP
                                      -RVPRDPRDPRQPRSPREPRSPREPRSPREPRTPRTP
                                      -REPRTARGS
  25-P @every Third   position @82    :PTIPRTPRVPREPRVPRPPREPREPRVPRAPRDPRV
                                      -PRDPRDPRQPRSPREPRSPREPRSPREPRTPRTPRE
                                      -PRTARG

 >SWISS-PROT: P03211   (sequence length 641) 
  EBN1_EBV (P03211) EBNA-1 NUCLEAR PROTEIN.                             
  39-G @every Third   position @101   :GGAGAGGGAGAGGGAGGAGGAGGAGAGGGAGAGGGA
                                      -GGAGGAGAGGGAGAGGGAGGAGAGGGAGGAGGAGAG
                                      -GGAGAGGGAGGAGAGGGAGGAGGAGAGGGAGAGGAG
                                      -GAGGAGAGGAGAGGGAGGAGGAGA

 >SWISS-PROT: P12978   (sequence length 487) 
  EBN2_EBV (P12978) EBNA-2 NUCLEAR PROTEIN.                             
  14-P @every Third   position @60    :PPLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPP
                                      -PPPPPQRRDAWT
  14-P @every Third   position @61    :PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPP
                                      -PPPPQRRDAWTQ

 >SWISS-PROT: P23241   (sequence length 519) 
  ELAV_DROVI (P23241) ELAV PROTEIN.                                     
  14-Q @every Third   position @99    :QQQQQQQQQQQQQVVQQQQVQQVQQAVVAVQQQQQQ
                                      -QQQQQQQQQVVQ

 >SWISS-PROT: P07916   (sequence length 750) 
  ELS_CHICK (P07916) ELASTIN PRECURSOR (TROPOELASTIN) (FRAGMENT).       
  16-G @every Third   position @371   :GIGGVPGVPGVPGVPGVPGVPGVPGVPGVPGVPGVP
                                      -GVPGVVPGVGVGGPAAAA
  14-V @every Third   position @375   :VPGVPGVPGVPGVPGVPGVPGVPGVPGVPGVPGVPG
                                      -VVPGVGVGGPAA

 >SWISS-PROT: P54320   (sequence length 860) 
  ELS_MOUSE (P54320) ELASTIN PRECURSOR (TROPOELASTIN).                  
  17-G @every Third   position @131   :GLGGVGGVPGGVGVGGVPGGVGVGGVPGGVGVGGVP
                                      -GGVGGIGGIGGLGVSTGAVVPQVG
  15-G @every Third   position @544   :GVGGVPGGVGVGGIPGGVGVGGVPGGVGPGGVTGIG
                                      -AGPGGLGGAGSPAAA

 >SWISS-PROT: Q99372   (sequence length 864) 
  ELS_RAT (Q99372) ELASTIN PRECURSOR (TROPOELASTIN) (FRAGMENT).         
  14-G @every Third   position @124   :GLGGIGGVPGGVGVGGVPGAVGVGGVPGAVGGIGGI
                                      -GGLGVSTGAVVP

 >SWISS-PROT: P13816   (sequence length 678) 
  GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR.             
  16-K @every Third   position @120   :KKDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKE
                                      -KKHKKDKKKKENSEVMSL
  15-K @every Third   position @121   :KDKKEKKHKKDKKEKKEKKDKKEKKDKKEKKHKKEK
                                      -KHKKDKKKKENSEVM
  17-E @every Third   position @561   :ESKEVQEDEEEVEEDEEEEEEEEEEEEEEEEEEEEE
                                      -EEEEEEEEEDEDEEDEDDAEEDED
  16-E @every Third   position @571   :EVEEDEEEEEEEEEEEEEEEEEEEEEEEEEEEEEED
                                      -EDEEDEDDAEEDEDDAEE

 >SWISS-PROT: P36417   (sequence length 708) 
  GBF_DICDI (P36417) G-BOX BINDING FACTOR (GBF).                        
  14-Q @every Third   position @161   :QMQQQQQHHQQMQQQQHHQQMQHHQLQQHQHQHQQQ
                                      -QQQQQHQQQHHQ
  16-Q @every Third   position @192   :QHQQQQQQQQHQQQHHQQQQQQQQQHHQQQQHHQHS
                                      -QPQQQHQHNQQQQHQHNQ

 >SWISS-PROT: P10387   (sequence length 648) 
  GLT0_WHEAT (P10387) GLUTENIN, HIGH MOLECULAR WEIGHT SUBUNIT DY10 PRECU
  14-Q @every Third   position @376   :QLGQGQQTGQPGQKQQPGQGQQTGQGQQPEQEQQPG
                                      -QGQQGYYPTSLQ

 >SWISS-PROT: P08488   (sequence length 660) 
  GLT3_WHEAT (P08488) GLUTENIN, HIGH MOLECULAR WEIGHT SUBUNIT 12 PRECURS
  14-Q @every Third   position @386   :QLGQGQQIGQPGQKQQPGQGQQTGQGQQPEQEQQPG
                                      -QGQQGYYPTSLQ

 >SWISS-PROT: P08489   (sequence length 838) 
  GLT4_WHEAT (P08489) GLUTENIN, HIGH MOLECULAR WEIGHT SUBUNIT PW212 PREC
  23-Q @every Third   position @217   :QPGQLQQPAQGQQGQQPGQGQQGQQPGQGQQPGQGQ
                                      -QGQQPGQGQQPGQGQQGQQLGQGQQGYYPTSLQQSG
                                      -QGQPGY
  16-Q @every Third   position @316   :QPGQGQQPGQLQQPAQGQQPEQGQQGQQPGQGQQGQ
                                      -QPGQGQQPGQGQPGYYPT
  16-Q @every Third   position @448   :QSGQGQQPGQLQQSAQGQKGQQPGQGQQPGQGQQGQ
                                      -QPGQGQQGQQPGQGQPGY
  31-Q @every Third   position @550   :QPGQGQQPGQLQQPAQGQQGQQLAQGQQGQQPAQVQ
                                      -QGQQPAQGQQGQQLGQGQQGQQPGQGQQPAQGQQGQ
                                      -QPGQGQQGQQPGQGQQPGQGQPWYYPTSPQESG

 >SWISS-PROT: P10388   (sequence length 839) 
  GLT5_WHEAT (P10388) GLUTENIN, HIGH MOLECULAR WEIGHT SUBUNIT DX5 PRECUR
  23-Q @every Third   position @212   :QPGQLQQPAQGQQGQQPGQAQQGQQPGQGQQPGQGQ
                                      -QGQQPGQGQQPGQGQQGQQLGQGQQGYYPTSLQQSG
                                      -QGQPGY
  16-Q @every Third   position @311   :QPGQGQQPGQLQQPAQGQQPGQGQQGQQPGQGQQGQ
                                      -QPGQGQQPGQGQPGYYPT
  16-Q @every Third   position @443   :QSGQGQQPGQLQQSAQGQKGQQPGQGQQPGQGQQGQ
                                      -QPGQGQQGQQPGQGQPGY
  34-Q @every Third   position @545   :QPGQGQQPGQLQQPAQGQQGQQLAQGQQGQQPAQVQ
                                      -QGQQPAQGQQGQQLGQGQQGQQPGQGQQGQQPAQGQ
                                      -QGQQPGQGQHGQQPGQGQQGQQPGQGQQPGQGQPWY
                                      -YPTSPQESG

 >SWISS-PROT: P09789   (sequence length 384) 
  GRP1_PETHY (P09789) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1 PRECUR
  14-G @every Third   position @66    :GAGGGFGGGAGGGAGGGLGGGGGLGGGGGAGGGGGL
                                      -GGGGGAGGGFGG
  14-G @every Third   position @70    :GFGGGAGGGAGGGLGGGGGLGGGGGAGGGGGLGGGG
                                      -GAGGGFGGGAGG
  16-G @every Third   position @118   :GAGGGLGGGGGLGGGGGGGAGGGGGVGGGAGSGGGF
                                      -GAGGGVGGGAGAGGGVGG
  16-G @every Third   position @152   :GFGAGGGVGGGAGAGGGVGGGGGFGGGGGGGVGGGS
                                      -GHGGGFGAGGGVGGGAGG
  32-G @every Third   position @206   :GLGGGVGGGGGGGSGGGGGIGGGSGHGGGFGAGGGV
                                      -GGGVGGGAAGGGGGGGGGGGGGGGGLGGGSGHGGGF
                                      -GAGGGVGGGAAGGVGGGGGFGGGGGGGVGGGSGHGG
  14-G @every Third   position @280   :GGGVGGGAAGGVGGGGGFGGGGGGGVGGGSGHGGGF
                                      -GAGGGVGGGAGG

 >SWISS-PROT: P27484   (sequence length 214) 
  GRP2_NICSY (P27484) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 2 PRECUR
  20-G @every Third   position @82    :GGRGGGGGGGGRGGGGYGGGSGGYGGGGRGGSRGYG
                                      -GGDGGYGGGGGYGGGSRYGGGGGGYGGGGGYGG

 >SWISS-PROT: P10496   (sequence length 465) 
  GRP2_PHAVU (P10496) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN 1.8 PREC
  14-G @every Third   position @37    :GHGTGGGYGGAAGSYGGGGGGGSGGGGGYAGEHGVV
                                      -GYGGGSGGGQGG

 >SWISS-PROT: P17816   (sequence length 200) 
  GRP_HORVU (P17816) GLYCINE-RICH CELL WALL STRUCTURAL PROTEIN PRECURSO 
  14-G @every Third   position @127   :GGEGGGGYGGGGGYHGHGGEGGGGYGGGGGGYPGHG
                                      -GGGGHGGGRCKW

 >SWISS-PROT: P37218   (sequence length 287) 
  H1_LYCES (P37218) HISTONE H1.                                         
  14-A @every Third   position @152   :AAVKPKAKPAAKAKPAAKAKPAAKAKPAAKAKPAAK
                                      -AKPAAKAKPVAK

 >SWISS-PROT: Q06576   (sequence length 215) 
  HP25_TAMAS (Q06576) HIBERNATION-ASSOCIATED PLASMA PROTEIN HP-25 PRECUR
  15-G @every Third   position @34    :GNSEPCGPPGPPGPPGIPGFPGAPGALGPPGPPGVP
                                      -GIPGPQGPPGDVEKC

 >SWISS-PROT: P05227   (sequence length 332) 
  HRP1_PLAFA (P05227) HISTIDINE-RICH PROTEIN PRECURSOR (CLONE PFHRP-II).
  81-A @every Third   position @57    :AHHAHHVADAHHAHHAHHAADAHHAHHAADAHHAHH
                                      -AADAHHAHHAADAHHAHHAADAHHAHHAADAHHAHH
                                      -AADAHHAHHAADAHHAHHAADAHHAHHAAYAHHAHH
                                      -ASDAHHAADAHHAAYAHHAHHAADAHHAADAHHAAY
                                      -AHHAHHAADAHHAADAHHATDAHHAHHAADAHHATD
                                      -AHHAADAHHAADAHHATDAHHAADAHHATDAHHAAD
                                      -AHHAADAHHATDSHHAHHAADAHHAAAHHATDAHHA
                                      -AAHHATDAHHAAAHHEAATHC

 >SWISS-PROT: P05228   (sequence length 241) 
  HRP2_PLAFA (P05228) HISTIDINE-RICH PROTEIN PRECURSOR (CLONE PFHRP-III)
  32-A @every Third   position @66    :AHHAHHVADAHHAHHAANAHHAANAHHAANAHHAAN
                                      -AHHAANAHHAANAHHAANAHHAANAHHAANAHHAAN
                                      -AHHAANAHHAANAHHAANAHHAADANHGFHFNLHDN

 >SWISS-PROT: P04929   (sequence length 351) 
  HRPX_PLALO (P04929) HISTIDINE-RICH GLYCOPROTEIN PRECURSOR.            
  15-H @every Third   position @206   :HHHHHHHAPHHHHHHHHGHHHHHHHHHGHHHHHHHH
                                      -HGHHHHHHHHHDAHH
  15-H @every Third   position @216   :HHHHHHHGHHHHHHHHHGHHHHHHHHHGHHHHHHHH
                                      -HDAHHHHHHHHDAHH
  15-H @every Third   position @226   :HHHHHHHGHHHHHHHHHGHHHHHHHHHDAHHHHHHH
                                      -HDAHHHHHHHHDAHH

 >SWISS-PROT: P04930   (sequence length 221) 
  HRP_PLAFF (P04930) SMALL HISTIDINE-ALANINE-RICH PROTEIN PRECURSOR (SH 
  31-A @every Third   position @54    :AGDAHHAHHVADAHHAHHAANAHHAANAHHAANAHH
                                      -AANAHHAANAHHAANAHHAANAHHAANAHHAANAHH
                                      -AANAHHAANAHHAANAHHAADANHGFHFNLHDN

 >SWISS-PROT: P18127   (sequence length 1567) 
  ICEN_XANCT (P18127) ICE NUCLEATION PROTEIN.                           
  14-T @every Third   position @139   :TQATSATLPTPATPSTQATPSTQSTQSTQSTEATQS
                                      -TEATPVATVAAA

 >SWISS-PROT: P33417   (sequence length 597) 
  IXR1_YEAST (P33417) INTRASTRAND CROSSLINK RECOGNITION PROTEIN (STRUCTU
  15-Q @every Third   position @290   :QMQQQLQLQQQQQLQQQQQLQQQHQLQQQQQLQQQH
                                      -HHLQQQQQQQQHPVV

 >SWISS-PROT: Q01546   (sequence length 638) 
  K22O_HUMAN (Q01546) KERATIN, TYPE II CYTOSKELETAL 2 ORAL (CYTOKERATIN 
  16-G @every Third   position @105   :GFGGGRGVGSGFGGAGGFGGAGGFGGPGVFGGPGSF
                                      -GGPGGFGPGGFPGGIQEV

 >SWISS-PROT: P12035   (sequence length 629) 
  K2C3_HUMAN (P12035) KERATIN, TYPE II CYTOSKELETAL 3 (CYTOKERATIN 3) (K
  21-G @every Third   position @102   :GGGFGGGFGGGRGMGGGFGGAGGFGGAGGFGGAGGF
                                      -GGPGGFGGSGGFGGPGSLGSPGGFAPGGFPGGIQEV

 >SWISS-PROT: P34099   (sequence length 648) 
  KAPC_DICDI (P34099) CAMP-DEPENDENT PROTEIN KINASE CATALYTIC SUBUNIT (E
  15-Q @every Third   position @182   :QQQQQLQQQQLQQQLQQQQQQQQQQQQQQQQKQQKQ
                                      -QQQQQQHLHQDGIVN

 >SWISS-PROT: P53894   (sequence length 756) 
  KNQ1_YEAST (P53894) PROBABLE SERINE/THREONINE-PROTEIN KINASE YNL161W (
  14-Q @every Third   position @208   :QQQNSRQQQQQLQYQQQQQQQQQQQHMQIQQQQQQQ
                                      -QQQQQSQSPVQS

 >SWISS-PROT: P25692   (sequence length 127) 
  KRCL_CHICK (P25692) KERATIN, CLAW (C-KER).                            
  14-G @every Third   position @70    :GMGGTFGRGAGFGGYGGLGGYGGYGGLGGYGGYGGF
                                      -GSCGYGGFGRGY

 >SWISS-PROT: P08472   (sequence length 779) 
  M130_STRPU (P08472) MESENCHYME-SPECIFIC CELL SURFACE GLYCOPROTEIN PREC
  15-G @every Third   position @274   :GQGGQGGQGGQGGQGQYPGQGGQGGQGGQGGQGGQG
                                      -GYPGQGGQGGPGYYP

 >SWISS-PROT: P04934   (sequence length 1726) 
  MSP1_PLAFC (P04934) MEROZOITE SURFACE PROTEIN 1 PRECURSOR (MEROZOITE S
  21-S @every Third   position @64    :SAQSGTSGTSGTSGTSGTSGTSGTSAQSGTSGTSAQ
                                      -SGTSGTSAQSGTSGTSGTSGTSPSSRSNTLPRSNTS

 >SWISS-PROT: P13819   (sequence length 1701) 
  MSP1_PLAFF (P13819) MEROZOITE SURFACE PROTEIN 1 PRECURSOR (MEROZOITE S
  15-S @every Third   position @69    :SSGSVTSGGSVASVASVASGGSGGSVASGGSGNSRR
                                      -TNPSDNSSDSNTKTY

 >SWISS-PROT: P08569   (sequence length 1701) 
  MSP1_PLAFM (P08569) MEROZOITE SURFACE PROTEIN 1 PRECURSOR (MEROZOITE S
  15-S @every Third   position @69    :SSGSVTSGGSVASVASVASGGSGGSVASGGSGNSRR
                                      -TNPSDNSSDSNTKTY

 >SWISS-PROT: P50495   (sequence length 1726) 
  MSP1_PLAFP (P50495) MEROZOITE SURFACE PROTEIN 1 PRECURSOR (MEROZOITE S
  21-S @every Third   position @64    :SAQSGTSGTSGTSGTSGTSGTSGTSAQSGTSGTSAQ
                                      -SGTSGTSAQSGTSGTSGTSGTSPSSRSNTLPRSNTS

 >SWISS-PROT: P04933   (sequence length 1639) 
  MSP1_PLAFW (P04933) MEROZOITE SURFACE PROTEIN 1 PRECURSOR (MEROZOITE S
  17-S @every Third   position @69    :SKGSVASGGSGGSVASGGSVASGGSVASGGSVASGG
                                      -SGNSRRTNPSDNSSDSDAKSYADL

 >SWISS-PROT: P21758   (sequence length 453) 
  MSRE_BOVIN (P21758) MACROPHAGE SCAVENGER RECEPTOR TYPES I AND II (MACR
  24-G @every Third   position @272   :GPPGPPGEKGDRGPPGQNGIPGFPGLIGTPGLKGDR
                                      -GISGLPGVRGFPGPMGKTGKPGLNGQKGQKGEKGSG
                                      -SMQRQSNTV

 >SWISS-PROT: P21757   (sequence length 451) 
  MSRE_HUMAN (P21757) MACROPHAGE SCAVENGER RECEPTOR TYPES I AND II (MACR
  23-G @every Third   position @273   :GPPGPPGEKGDRGPTGESGPRGFPGPIGPPGLKGDR
                                      -GAIGFPGSRGLPGYAGRPGNSGPKGQKGEKGSGNTL
                                      -TPFTKV

 >SWISS-PROT: P30204   (sequence length 458) 
  MSRE_MOUSE (P30204) MACROPHAGE SCAVENGER RECEPTOR TYPES I AND II (MACR
  27-G @every Third   position @277   :GPPGPQGEKGDRGLTGQTGPPGAPGIRGIPGVKGDR
                                      -GQIGFPGGRGNPGAPGKPGRSGSPGPKGQKGEKGSV
                                      -GGSTPLKTVRLVGGSGAHEGR

 >SWISS-PROT: Q05585   (sequence length 454) 
  MSRE_RABIT (Q05585) MACROPHAGE SCAVENGER RECEPTOR TYPES I AND II (MACR
  24-G @every Third   position @273   :GPPGPPGEKGDRGPTGESGPPGVPGPVGPPGLKGDR
                                      -GSIGFPGSRGYPGQSGKTGRTGYPGPKGQKGEKGSG
                                      -SILTPSATV

 >SWISS-PROT: Q05049   (sequence length 662) 
  MUC1_XENLA (Q05049) INTEGUMENTARY MUCIN C.1 (FIM-C.1) (FRAGMENT).     
  14-T @every Third   position @410   :TTTPTTTTTTKATTTTPTTTTTTPTTTTTTTTTTKA
                                      -TTTTPTTTTPTT
  14-T @every Third   position @436   :TTTTTTTTKATTTTPTTTTPTTTTTKATTTTPTTTT
                                      -TTPTTTTTKATT
  17-T @every Third   position @470   :TTTTPTTTTTKATTTTPTTTTTTPTTTTTKATTTTP
                                      -TTTTTTTTTTKATTTTTSGECKME

 >SWISS-PROT: P16053   (sequence length 857) 
  NFM_CHICK (P16053) NEUROFILAMENT TRIPLET M PROTEIN (160 KD NEUROFILAM 
  20-P @every Third   position @658   :PEKPTTPEKVVSPEKPASPEKPRTPEKPASPEKPAT
                                      -PEKPRTPEKPATPEKPRSPEKPSSPLKDEKAVV

 >SWISS-PROT: P19338   (sequence length 706) 
  NUCL_HUMAN (P19338) NUCLEOLIN (PROTEIN C23).                          
  15-G @every Third   position @655   :GRGGFGGRGGGRGGRGGFGGRGRGGFGGRGGFRGGR
                                      -GGGGDHKPQGKKTKF

 >SWISS-PROT: P09405   (sequence length 706) 
  NUCL_MOUSE (P09405) NUCLEOLIN (PROTEIN C23).                          
  15-G @every Third   position @655   :GRGGFGGRGGGRGGRGGFGGRGRGGFGGRGGFRGGR
                                      -GGGGDFKPQGKKTKF

 >SWISS-PROT: P13383   (sequence length 712) 
  NUCL_RAT (P13383) NUCLEOLIN (PROTEIN C23).                            
  15-G @every Third   position @661   :GRGGFGGRGGGRGGRGGFGGRGRGGFGGRGGFRGGR
                                      -GGGGDFKPQGKKTKF

 >SWISS-PROT: P20397   (sequence length 650) 
  NUCL_XENLA (P20397) NUCLEOLIN (PROTEIN C23).                          
  14-G @every Third   position @585   :GRGGFGRGGGFRGGRGGRGGGGGRGFGGRGGGRGRG
                                      -GFGGRGGGGFRG

 >SWISS-PROT: Q29438   (sequence length 262) 
  ODFP_BOVIN (Q29438) OUTER DENSE FIBER PROTEIN.                        
  15-P @every Third   position @208   :PCNPCNPCNPCSPCSPCNPCNPCSPCSPCSPCNPCD
                                      -PCNPCYPCGSRFSCR
  16-C @every Third   position @209   :CNPCNPCNPCSPCSPCNPCNPCSPCSPCSPCNPCDP
                                      -CNPCYPCGSRFSCRK

 >SWISS-PROT: Q29077   (sequence length 262) 
  ODFP_PIG (Q29077) OUTER DENSE FIBER PROTEIN.                          
  15-P @every Third   position @208   :PCNPCNPCSPCNPCNPCNPCNPCSPCSPCSPCNPCD
                                      -PCNPCYPCGSRFSCR
  16-C @every Third   position @209   :CNPCNPCSPCNPCNPCNPCNPCSPCSPCSPCNPCDP
                                      -CNPCYPCGSRFSCRK

 >SWISS-PROT: P21769   (sequence length 245) 
  ODFP_RAT (P21769) OUTER DENSE FIBER PROTEIN (RT7 PROTEIN) (RTS 5/1).  
  14-P @every Third   position @194   :PCNPCNPCSPCSPCGPCGPCGPCGPCGPCGPCDPCN
                                      -PCYPCGSRFSCR
  15-C @every Third   position @195   :CNPCNPCSPCSPCGPCGPCGPCGPCGPCGPCDPCNP
                                      -CYPCGSRFSCRK

 >SWISS-PROT: P54674   (sequence length 1858) 
  P3K2_DICDI (P54674) PHOSPHATIDYLINOSITOL 3-KINASE 2 (EC 2.7.1.137) (PI
  14-N @every Third   position @185   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNNNTTSTTT
  14-N @every Third   position @186   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNNTTSTTTT
  14-N @every Third   position @187   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNTTSTTTTT
  15-N @every Third   position @1006  :NKENKDSSSNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NGNNNGNNSNNNSNS

 >SWISS-PROT: P54675   (sequence length 1585) 
  P3K3_DICDI (P54675) PHOSPHATIDYLINOSITOL 3-KINASE 3 (EC 2.7.1.137) (PI
  16-N @every Third   position @345   :NENNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNEE
                                      -LINNNNNNNNDENYKIEE
  14-N @every Third   position @347   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNEELI
                                      -NNNNNNNNDENY

 >SWISS-PROT: P32521   (sequence length 1480) 
  PAN1_YEAST (P32521) PAN1 PROTEIN.                                     
  15-Q @every Third   position @1084  :QPTQPVQSTQPVQPTQPVQPTQPVQPTQPVQPTQPV
                                      -QPTQPVQNVYNAKQE

 >SWISS-PROT: P27177   (sequence length 273) 
  PRIO_CHICK (P27177) MAJOR PRION PROTEIN HOMOLOG PRECURSOR (PR-LP) (ACE
  18-P @every Third   position @44    :PSYPRQPGYPHNPGYPHNPGYPHNPGYPHNPGYPHN
                                      -PGYPQNPGYPHNPGYPGWGQGYNPSSG

 >SWISS-PROT: P35242   (sequence length 248) 
  PSPA_MOUSE (P35242) PULMONARY SURFACTANT-ASSOCIATED PROTEIN A PRECURSO
  14-G @every Third   position @22    :GTEVCAGSPGIPGTPGNHGLPGRDGRDGIKGDPGPP
                                      -GPMGPPGGMPGL

 >SWISS-PROT: P35246   (sequence length 369) 
  PSPD_BOVIN (P35246) PULMONARY SURFACTANT-ASSOCIATED PROTEIN D PRECURSO
  58-G @every Third   position @46    :GLPGRDGRDGREGPRGEKGDPGSPGPAGRAGMPGPA
                                      -GPIGLKGDNGSAGEPGPKGDTGPPGPPGMPGPAGRE
                                      -GPSGKQGSMGPPGTPGPKGDTGPKGGVGAPGIQGSP
                                      -GPAGLKGERGAPGEPGAPGRAGAPGPAGAIGPQGPS
                                      -GARGPPGLKGDRGTPGERGAKGESGLAEVNALRQRV
                                      -GILEGQLQRLQNAFSQYK

 >SWISS-PROT: P35247   (sequence length 375) 
  PSPD_HUMAN (P35247) PULMONARY SURFACTANT-ASSOCIATED PROTEIN D PRECURSO
  59-G @every Third   position @46    :GLPGRDGRDGREGPRGEKGDPGLPGAAGQAGMPGQA
                                      -GPVGPKGDNGSVGEPGPKGDTGPSGPPGPPGVPGPA
                                      -GREGALGKQGNIGPQGKPGPKGEAGPKGEVGAPGMQ
                                      -GSAGARGLAGPKGERGVPGERGVPGNTGAAGSAGAM
                                      -GPQGSPGARGPPGLKGDKGIPGDKGAKGESGLPDVA
                                      -SLRQQVEALQGQVQHLQAAFS

 >SWISS-PROT: P50404   (sequence length 374) 
  PSPD_MOUSE (P50404) PULMONARY SURFACTANT-ASSOCIATED PROTEIN D PRECURSO
  59-G @every Third   position @45    :GLPGRDGRDGREGPRGEKGDPGLPGPMGLSGLQGPT
                                      -GPVGPKGENGSAGEPGPKGERGLSGPPGLPGIPGPA
                                      -GKEGPSGKQGNIGPQGKPGPKGEAGPKGEVGAPGMQ
                                      -GSTGAKGSTGPKGERGAPGVQGAPGNAGAAGPAGPA
                                      -GPQGAPGSRGPPGLKGDRGVPGDRGIKGESGLPDSA
                                      -ALRQQMEALKGKLQRLEVAFS

 >SWISS-PROT: P35248   (sequence length 374) 
  PSPD_RAT (P35248) PULMONARY SURFACTANT-ASSOCIATED PROTEIN D PRECURSO  
  59-G @every Third   position @45    :GLPGRDGRDGREGPRGEKGDPGLPGPMGLSGLPGPR
                                      -GPVGPKGENGSAGEPGPKGERGLVGPPGSPGISGPA
                                      -GKEGPSGKQGNIGPQGKPGPKGEAGPKGEVGAPGMQ
                                      -GSAGAKGPAGPKGERGAPGEQGAPGNAGAAGPAGPA
                                      -GPQGAPGSRGPPGLKGDRGAPGDRGIKGESGLPDSA
                                      -ALRQQMEALNGKLQRLEAAFS

 >SWISS-PROT: P54637   (sequence length 989) 
  PTP3_DICDI (P54637) PROTEIN-TYROSINE PHOSPHATASE 3 (EC 3.1.3.48) (PROT
  20-N @every Third   position @131   :NTMIIKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNNNNNNNNNNNNNNNNNSNSNIEINVPSIQ
  17-N @every Third   position @138   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNNNNNNNNNNSNSNIEINVPS
  17-N @every Third   position @139   :NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNNNNNNNNNSNSNIEINVPSI

 >SWISS-PROT: P08116   (sequence length 224) 
  PVA_PLAFA (P08116) PROCESSED VARIABLE ANTIGEN (FRAGMENT).             
  38-E @every Third   position @1     :ETGESKETGESKETGESKETGESKETGESKETGESK
                                      -ETGESKETGESKETGESKETGESKETGESKETGESK
                                      -ETGESKETGESKETGESKETGESKETGESKETRIYE
                                      -ETKYNKITSEFRETENVKITE

 >SWISS-PROT: P75578   (sequence length 237) 
  RL23_MYCPN (P75578) 50S RIBOSOMAL PROTEIN L23.                        
  21-K @every Third   position @156   :KVAKEVKEVKVEKPVKVEKPTKPAKVAKEAKTTKVA
                                      -KETKAEKSVQTTKVAKETKTEKSAKTTKTTATKTTK

 >SWISS-PROT: P51992   (sequence length 385) 
  RO32_XENLA (P51992) HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN A3 HOMOLOG
  16-G @every Third   position @225   :GDGGGGGNFGRGGGFGNRGGYGGGGGRGGGYGGGGD
                                      -GYNGFGGDGGNYGGGPGY

 >SWISS-PROT: O26116   (sequence length 258) 
  RS3_METTH (O26116) 30S RIBOSOMAL PROTEIN S3P.                         
  14-E @every Third   position @206   :EASEASEVVEDLEEVEDLEEIEDLEEVEDLEEVEDL
                                      -EDTEAEKKDADG

 >SWISS-PROT: P11716   (sequence length 5037) 
  RYNR_RABIT (P11716) RYANODINE RECEPTOR, SKELETAL MUSCLE (SKELETAL MUSC
  14-E @every Third   position @1869  :EVFTEEEEEEEEEEEEEEEEEEDEEEKEEDEEEEEK
                                      -EDAEKEEEEAPE

 >SWISS-PROT: P21750   (sequence length 142) 
  SALA_DROME (P21750) PROTEIN SPALT-ACCESSORY.                          
  15-G @every Third   position @21    :GQVGQGGYGGQGGFGGFGGIGGQAGFGGQIGFTGQG
                                      -GVSGQVGIGQGGVHP

 >SWISS-PROT: P21748   (sequence length 142) 
  SALA_DROOR (P21748) PROTEIN SPALT-ACCESSORY.                          
  14-G @every Third   position @21    :GQGGQGPYGGQGGFGGYGGLGGQAGFGGQIGFNGQG
                                      -GVGGQLGVGQGG

 >SWISS-PROT: P21749   (sequence length 139) 
  SALA_DROSI (P21749) PROTEIN SPALT-ACCESSORY.                          
  14-G @every Third   position @21    :GQGGYGGQGGFGGFGGLGGQAGFGGQIGFNGQGGVG
                                      -GQVGIGQGGVHP

 >SWISS-PROT: P18480   (sequence length 905) 
  SNF5_YEAST (P18480) TRANSCRIPTION REGULATORY PROTEIN SNF5 (SWI/SNF COM
  21-Q @every Third   position @216   :QIQQQQQKQQQQQQQHQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQQQQQGQIPQSQQVPQVRSMSGQP
  18-Q @every Third   position @218   :QQQQQKQQQQQQQHQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQQQGQIPQSQQVPQV
  16-Q @every Third   position @220   :QQQKQQQQQQQHQQQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQQGQIPQ

 >SWISS-PROT: P54705   (sequence length 685) 
  SNWA_DICDI (P54705) SNWA PROTEIN.                                     
  16-R @every Third   position @428   :RDNIDSRDNRDSRDSRDSRDSRDSRDSRDSRDNRDS
                                      -RDSRDNRDNRDNRRRDDS
  18-D @every Third   position @429   :DNIDSRDNRDSRDSRDSRDSRDSRDSRDSRDNRDSR
                                      -DSRDNRDNRDNRRRDDSNDRDRYSKRR

 >SWISS-PROT: P32583   (sequence length 406) 
  SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40.                         
  29-S @every Third   position @25    :SSSSSSSSSSSSSSSSSSSSSSSSSGESSSSSSSSS
                                      -SSSSSDSSDSSDSESSSSSSSSSSSSSSSSDSESSS
                                      -ESDSSSSGSSSSSSSSSDESSSESESE
  15-S @every Third   position @26    :SSSSSSSSSSSSSSSSSSSSSSSSGESSSSSSSSSS
                                      -SSSSDSSDSSDSESS

 >SWISS-PROT: P21997   (sequence length 485) 
  SSGP_VOLCA (P21997) SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185).      
  18-P @every Third   position @242   :PSPPPSPRPPSPPPPSPSPPPPPPPPPPPPPPPPPS
                                      -PPPPPPPPPPPPPPPPPPSPSPPRKPP
  21-P @every Third   position @255   :PPSPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPP
                                      -PPPPPSPSPPRKPPSPSPPVPPPPSPPSVLPAATGF

 >SWISS-PROT: Q01443   (sequence length 826) 
  SSP2_PLAYO (Q01443) SPOROZOITE SURFACE PROTEIN 2 PRECURSOR.           
  33-P @every Third   position @294   :PEKPSNPEEPVNPNDPNDPNNPNNPNNPNNPNNPNN
                                      -PNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNN
                                      -PNNPNNPNNPNNPNNPNDPSNPNNHPKRRNPKRRNP
                                      -NKPKPN
  27-N @every Third   position @307   :NDPNDPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNP
                                      -NNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNP
                                      -NNPNDPSNPNNHPKRRNPKRR
  26-N @every Third   position @314   :NPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPN
                                      -NPNNPNNPNNPNNPNNPNNPNNPNNPNNPNNPNDPS
                                      -NPNNHPKRRNPKRRNPNK
  38-P @every Third   position @460   :PNEPLNPNEPSNPNEPSNPNAPSNPNEPSNPNEPSN
                                      -PNEPSNPNEPSNPNEPSNPKKPSNPNEPSNPNEPLN
                                      -PNEPSNPNEPSNPNEPSNPEEPSNPKEPSNPNEPSN
                                      -PEEPNPEEPSNPKEPSNPEEP

 >SWISS-PROT: P54683   (sequence length 1905) 
  TAGB_DICDI (P54683) PRESTALK-SPECIFIC PROTEIN TAGB PRECURSOR (EC 3.4.2
  15-N @every Third   position @95    :NINNNNNNNNKLNNNNNNNNNNNNNNNNNNNNNNNN
                                      -NNNNYYNSIEYYSSF
  15-Q @every Third   position @1813  :QEQQEQQEQQQQQQQEQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQQQNDQ
  17-Q @every Third   position @1815  :QQEQQEQQQQQQQEQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQQQNDQPPNDYDQVPPP

 >SWISS-PROT: P20226   (sequence length 339) 
  TF2D_HUMAN (P20226) TRANSCRIPTION INITIATION FACTOR TFIID (TATA-BOX FA
  14-Q @every Third   position @55    :QQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQAVAAAAV
  14-Q @every Third   position @56    :QRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQAVAAAAVQ

 >SWISS-PROT: P49456   (sequence length 504) 
  TPM5_DROME (P49456) TROPOMYOSIN 1, FUSION PROTEIN 34.                 
  14-A @every Third   position @408   :AAPAEGAPPAEGAPAAEGAAPAEGAPAAEGAPPAEG
                                      -APAPAPAEGEAA

 >SWISS-PROT: P19351   (sequence length 396) 
  TRT_DROME (P19351) TROPONIN T, SKELETAL MUSCLE (UPHELD PROTEIN) (INTE 
  15-E @every Third   position @346   :EDDEEVEEEVVEEEDEEDEEDEEEEEEEEEEEEEEE
                                      -EEEEEEEEEEEE
  14-E @every Third   position @350   :EVEEEVVEEEDEEDEEDEEEEEEEEEEEEEEEEEEE
                                      -EEEEEEEEE

 >SWISS-PROT: P87498   (sequence length 1912) 
  VIT1_CHICK (P87498) VITELLOGENIN I PRECURSOR (MINOR VITELLOGENIN) [CON
  26-S @every Third   position @1151  :SSSGSSSSSSSSSSSSSDSSSSSRSSSSSDSSSSSS
                                      -SSSSSSSSKSKSSSRSSKSNRSSSSSNSKDSSSSSS
                                      -KSNSKGSSSSSSKASGTR
  19-S @every Third   position @1152  :SSGSSSSSSSSSSSSSDSSSSSRSSSSSDSSSSSSS
                                      -SSSSSSSKSKSSSRSSKSNRSSSSSNSKDS

 >SWISS-PROT: Q90508   (sequence length 1704) 
  VIT1_FUNHE (Q90508) VITELLOGENIN I PRECURSOR (VTG I) [CONTAINS: LIPOVI
  19-S @every Third   position @1122  :SSSSSSSRRSRSSSSSSSSSSSSSSSSSSSSRRSSS
                                      -SSSSSSSSSSRSSRRVNSTRSSSSSSRTSS

 >SWISS-PROT: P02845   (sequence length 1850) 
  VIT2_CHICK (P02845) VITELLOGENIN II PRECURSOR (MAJOR VITELLOGENIN) [CO
  15-S @every Third   position @1184  :SKSSNSSKRSSSSSSSSSSSSRSSSSSSSSSSNSKS
                                      -SSSSSKSSSSSSRSR
  24-S @every Third   position @1194  :SSSSSSSSSSSRSSSSSSSSSSNSKSSSSSSKSSSS
                                      -SSRSRSSSKSSSSSSSSSSSSSSKSSSSRSSSSSSK
                                      -SSSHHSHSH

 >SWISS-PROT: Q98893   (sequence length 1687) 
  VIT2_FUNHE (Q98893) VITELLOGENIN II PRECURSOR (VTG II) [CONTAINS: LIPO
  14-S @every Third   position @1088  :SSSSSGSSRSSRSRSSSSSSSSSSSSSSRSSSSSSR
                                      -SSSSLRRNSKML

 >SWISS-PROT: Q10637   (sequence length 603) 
  Y03A_MYCTU (Q10637) HYPOTHETICAL GLYCINE-RICH 49.6 KD PROTEIN CY130.10
  16-G @every Third   position @183   :GAGGAGGLLFGSGGAGGPGGVGNTGTGGLGGDGGAA
                                      -GLFGAGGIGGAGGPGFNG
  14-G @every Third   position @266   :GGTGGTGGTGGGGGLFSNGGAGGAGGFGVSGSAGGN
                                      -GGTGGDGGIFTG
  17-G @every Third   position @497   :GDGGAGGNAGLLNGDGGAGGAGGLGIAGDGGNGGKG
                                      -GKAGMVGNGGDGGAGGASVVANGG

 >SWISS-PROT: Q50615   (sequence length 498) 
  Y0DP_MYCTU (Q50615) HYPOTHETICAL GLYCINE-RICH 40.8 KD PROTEIN CY1A11.2
  18-G @every Third   position @389   :GATGVGGAGGNGGTAGLLFGAGGAGGFGFGGAGGAG
                                      -GLGGKAGLIGDGGDGGAGGNGTGAKGG

 >SWISS-PROT: Q93074   (sequence length 2124) 
  Y192_HUMAN (Q93074) HYPOTHETICAL PROTEIN KIAA0192 (FRAGMENT).         
  14-Q @every Third   position @2028  :QQQQQQILRQQQQQQQQQQQQQQQQQQQQQQQQQQH
                                      -QQQQQQQAAPPQ

 >SWISS-PROT: P22577   (sequence length 103) 
  YDH3_HSVSC (P22577) HYPOTHETICAL 9.5 KD PROTEIN IN DHFR 3'REGION (ORF3
  20-P @every Third   position @12    :PGSPGGPGGPGGPGGPGGPGGPGGPGGPGGPCGPGG
                                      -PCGPGGPCGPGGPGGPGGPRSPVSSIGYLRFGS
  18-G @every Third   position @17    :GPGGPGGPGGPGGPGGPGGPGGPGGPCGPGGPCGPG
                                      -GPCGPGGPGGPGGPRSPVSSIGYLRFG

 >SWISS-PROT: P40002   (sequence length 666) 
  YEA7_YEAST (P40002) HYPOTHETICAL 72.5 KD PROTEIN IN GCN4-WBP1 INTERGEN
  14-N @every Third   position @330   :NTHVNNNNNNSNNSSNSNNSNNNNNNNNNNNNNNNN
                                      -NINNINNVNTNA

 >SWISS-PROT: O13695   (sequence length 536) 
  YEN1_SCHPO (O13695) HYPOTHETICAL 52.9 KD SERINE-RICH PROTEIN C11G7.01 
  15-S @every Third   position @58    :SSSSSSSPLSSSSFTSPASSSFITSLVSSSSQQSSS
                                      -SSASLTSSSSATLTS

 >SWISS-PROT: P53214   (sequence length 551) 
  YG1F_YEAST (P53214) HYPOTHETICAL 57.5 KD PROTEIN IN VMA7-RPS25A INTERG
  20-S @every Third   position @236   :SYSSSFSSSSSSSSSSSSSSSSSSSSSSSYFTLSTS
                                      -SSSSIYSSSSYPSFSSSSSSNPTSSITSTSASS

 >SWISS-PROT: P38835   (sequence length 840) 
  YHT1_YEAST (P38835) HYPOTHETICAL 95.1 KD PROTEIN IN ACT5-YCK1 INTERGEN
  14-D @every Third   position @793   :DDGDGDDGEDDDDDDDDDDDDDDDEDDDDDDDDDDD
                                      -DDDDDDDGQ

 >SWISS-PROT: P40467   (sequence length 964) 
  YIN0_YEAST (P40467) PUTATIVE 108.8 KD TRANSCRIPTIONAL REGULATORY PROTE
  14-N @every Third   position @845   :NHNNNNNDNNNNNNNNNNNNNNNNNSGNSSNNNNNN
                                      -NNNKNNNDFGIK

 >SWISS-PROT: P47179   (sequence length 1161) 
  YJ9P_YEAST (P47179) HYPOTHETICAL 118.4 KD PROTEIN IN BAT2-DAL5 INTERGE
  55-T @every Third   position @125   :TSTTTTKSSTSTTPTTTITSTTSTTSTTPTTSTTST
                                      -TPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTST
                                      -TPTTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTPT
                                      -TSTTSTTSQTSTKSTTPTTSSTSTTPTTSTTPTTST
                                      -TSTAPTTSTTSTTSTTSTISTAPTTSTTSSTFSTSS
                                      -ASASSV
  51-T @every Third   position @136   :TTPTTTITSTTSTTSTTPTTSTTSTTPTTSTTSTTP
                                      -TTSTTSTTPTTSTTSTTPTTSTTSTTPTTSTTSTTP
                                      -TTSTTSTTPTTSTTSTTPTTSTTPTTSTTSTTSQTS
                                      -TKSTTPTTSSTSTTPTTSTTPTTSTTSTAPTTSTTS
                                      -TTSTTSTISTAPTTSTTSSTFSTSSASASS

 >SWISS-PROT: P34340   (sequence length 305) 
  YK61_CAEEL (P34340) PUTATIVE CUTICLE COLLAGEN C29E4.1.                
  41-G @every Third   position @142   :GPAGPPGQQGPVGPQGFPGVVGTCGPSGDDGQPGPA
                                      -GPLGDKGAQGPKGFDGADGPDGMPGTAYFPGAVGQP
                                      -GEPGWLGQPGLPGKHGEPGQDGEEGPKGAPGTPGSN
                                      -GRDAYPGQPGKAGEPGAVGKDANYCPCPARRDS

 >SWISS-PROT: P35732   (sequence length 738) 
  YKF4_YEAST (P35732) HYPOTHETICAL 84.0 KD PROTEIN IN NUP120-CSE4 INTERG
  14-Q @every Third   position @376   :QANTVPQPQQQSQQPQQPQQPQQPQQPQQPQQQQQP
                                      -QQPQQPQQQLQQ
  15-Q @every Third   position @386   :QSQQPQQPQQPQQPQQPQQPQQQQQPQQPQQPQQQL
                                      -QQQQQQQQQPVQAQA

 >SWISS-PROT: Q03825   (sequence length 758) 
  YM38_YEAST (Q03825) HYPOTHETICAL 85.0 KD PROTEIN IN HLJ1-SMP2 INTERGEN
  16-Q @every Third   position @280   :QQIQQPQHQPQHQPQQQQQQQQQQQQQQQQQQQQQQ
                                      -QQQQQQQQHQQQQQTPYP

 >SWISS-PROT: P21260   (sequence length 141) 
  YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAGMENT).     
  17-P @every Third   position @9     :PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
                                      -PPPPPPPPPPPPPPRRARIHHNIP
  17-P @every Third   position @10    :PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
                                      -PPPPPPPPPPPPPRRARIHHNIPL
  16-P @every Third   position @11    :PPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPPP
                                      -PPPPPPPPPPPPRRARIH

 >SWISS-PROT: P71933   (sequence length 778) 
  YQ04_MYCTU (P71933) HYPOTHETICAL 63.1 KD GLYCINE-RICH PROTEIN CY441.04
  18-G @every Third   position @529   :GNGGTGGAGGNGGRGGMLIGNGGAGGAGGTGGTGGG
                                      -GAAGFAGGVGGAGGEGLTDGAGTAEGG
  23-G @every Third   position @591   :GTGGLGGLGGVGGTGGMGGSGGVGGNGGAAGSLIGL
                                      -GGGGGAGGVGGTGGIGGIGGAGGNGGAGGAGTTTGG
                                      -GATIGG
  16-G @every Third   position @693   :GSGGAGGLIGWAGAAGGTGAGGTGGQGGLGGQGGNG
                                      -GNGGTGATGGQGGDFALG
  14-G @every Third   position @713   :GGTGGQGGLGGQGGNGGNGGTGATGGQGGDFALGGN
                                      -GGAGGAGGSPGG

 >SWISS-PROT: Q10707   (sequence length 434) 
  YY38_MYCTU (Q10707) HYPOTHETICAL 36.5 KD GLYCINE-RICH PROTEIN RV2098C.
  16-G @every Third   position @147   :GDLGAGGGGGDGGLGGRAGLIGHGGAGGNGGDGGHG
                                      -GSGKAGGSGGSGGFGQFG
 -----------------------------------------------------------------------------
 Total entries in the DataBase...     80000 
 Total amino acid recidues.......     29085965 
 Total repeats detected..........     131
 Minimum repeating units.........     15  
 Maximum mismatch ...............     10% 
 ---------------------------------------------------------------------------

(c) Division of Biochemical Sciences, National Chemical Laboratory, Pune 411008, INDIA