TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Streptomyces coelicolor, A3(2)
Gene type: CDS

Number of genes found: 304

Free access
Sort by:

 



# Streptomyces coelicolor, A3(2)

>gid:871351  23.ORF4  hypothetical protein
MAPKQGLLKIADLTPGPVGIHGGILSGVDTTIKIGAETRDKLAALAEARN
MSMRALIEEFAATALTPAQLRERAERTDAFLAAEFGHRVSDDEAGALRER
MRQAQNAGQTRARGTAA
>gid:387477  SCO0005  putative transposase
MVEVGAHVAYRGQTWQVAALQGQQVYLLQEDGTETRLLLGRLFADPGFEV
VGARAPDTVPQWGLFETVPLAAQQRALAWLPHIREVETGWPHPEGSREGQ
AMRPEYDPERWTLAQRDAAKAKELAALGFARVTRTTVERMRHGYRKQGLW
GLVDKRAVPARGRHPTGYADERVVAAVLEALRRQRGRSKGTVKGLQVLVG
QILEDTHGRGVVEMPSRSTFYRLVSVLADPADRPGRPARTATAPARASSA
PVVLRPGEQVQIDTTRLDIMAILEDGSLGRPELTIAVDVATRSILAAVLR
PYSTKAVDAALLLAEMAVPHPARPTWPSALHLSRAEVPYERMLSLDERLE
GAAARPVVVPETIVVDRGKIYLSQGFVAACEMLGVSVQPAPPRRPQAKAV
VERTFGAINDLFCQHVAGHTGSNPQRRGLTTAAETRWTIPQLQDFLDEWI
TCGWQNRPHDGLRHPVLPKTALTPNQMWAALITVSGYVPVPLSGADYLEL
LPVRWQPITERGIRLDYRTYNHDILDPHRGQRSPVASKDGKWEVHHNPHD
ARQIFVRLTDGQLHEIPWIHRDHVHQPFNEAIWRHVQAEVEQRGDRDQHE
ADLADALDQLLRRTRHLAETEQKTRRRRATRSGTAAQLPDLPGQRRPFDA
ETAPAPAPDWSESLDDLISVDTAAQTGTSEMEGASVPPAEAGGYGLWDAE
AEAEQW
>gid:387500  SCO0020  putative transposase
MGSLLLCSVELVSLLPQLSDVHVVSVEVSDAVVAVYARTRSGEPAGCTGC
GRLSEWCHSRYARRLADVTLAGRPLRIDLSVRRLYCENTTCPKVTFTEQV
PGLTVRYQRRTSRLQSLVEDVGVVLAGRGGSRMLRILGIKLSRVALLSQL
MRVPLPPLVTPQVLGVDDFALDGGTYGTLLVDATTRLPLTLWEGRDAKQL
GRWLREHPGIDIVCRDGSLTYRQGITAGAPEAVQVSDRFHLWQGLSRRVQ
DIASAHRGCLPAALPTVSEDDHPPVEETTQNAAADSRAGRHAQSLFEAIQ
ALTCTGRSHSSVARELGLDRRTVRKYARARTWQEVMRRPPRKPSMLDPYL
DYLRQRWDDGQHSAKILHEELQTKGYLGHYQRVKMAVAPLRRGLPIDEPC
ERPPSPRETARWITTHPHRRSPHVNERLPRLLDHCPELKLTHDLVRRFAT
MLDNRDAAPLPGWLGDLKKSGLGPLVGFAGALHEDRHAVAQGITTPYNSG
VNEGRITDVKLQKRLMAGRAGVRLLRHRVVLIAHLRRRHADRPTVPPR
>gid:387505  SCO0022  putative IS element ATP-binding protein
RLSRSSHHKTLDEYDFSFQPDLDPRKVRDLATLSFVEGRANAALLGVLVA
DEVGYQPLERAEARLVFQVISKRYEKGSIILASNKTFSEWGQVGPGFGDE
VLATALLGRLLDHCDVISINGPGYRLRNRLKAIERENDVA
>gid:387506  SCO0025  putative IS element ATP-binding protein
MDPAVIHTPATCEWGKEGLPLCLIGDSGTGKSHLLSALGTEASMAGYRAS
GTFRTRGDRALPGLGFLEHVPV
>gid:387581  SCO0083  putative transposase
MPCPARELSVRFGVGTSPWIVSDELWDRVEPLLPQRERRFRHPGRKPLPD
RDVLCGILYVLHTGIPWEYLPRGLGFGSGMPCWRRLRDWNEAGVRQRLHE
ILLAELNAAARLDRSRCVVDSPRQGAKGGQHTGPSPVDRGRAGSKHHLIT
DGHGTPLAVLLTGGNRNDVTQLLPLLDAIPPVRGPVGRPRRKPDSLFADR
GYDHDIYRDQARTRGIVPAIARRGTLHGAALGTYRWVVERSLAWLHGFRR
LRIRWERRADIHEALLRLACCLITHRQLRALR
>gid:387594  SCO0091  putative insertion element IS1652 transposase
MVHRNAPLTETGRLRLARCVVEDGWPVRRAAERFQVSHTTASRWARRYRQ
LGVTGMSDRSSRPHHQPRRTAAAVEEHVLRLRREHRIGPLRLAVRCGIAA
STAHRILVRHGLPPLAALDRATGEPVRRYERARPGELVHIDVKKLGRIPD
GGGHKTLGRAEGHRSRTNGAGWAYLHTALDDHSRIAYTEDLPDETAPTCA
AFLVRATAYFASLGIRIERVLTDNAWAYSKNTWRNTCRDLDISPRWTRPW
RPQTNGKVERFHRTLLDEWAYQKPYTSDHERREAFTHWLHWYNYHRPHTG
IGGHTPASRGTNLSEQHS
>gid:387601  SCO0096  putative noncomposite transposon transposase
MLRSEVVDVQQLAEYRYRTVREVLGGSPIGEVAARYGTSRQTLHTWRRRF
VLEGMPGLLDRSRRPRNSPAGLSAEMEAEICELRRRHLRWGARRIAHELS
VRGLESAPSRATVHRVLSRNGLVRAQEQQHPRKYRRWQREAPMHLWQMDL
VGGVPLADGRECKMVTGIDDHSRFVVIASVVAVPSARAVCSAFAAAMRRY
GVPFEVLTDNGKQFTGRHQRPQPVEVLFERICRENGITQRLTKPRSPTTT
GKIERSHRTLREEFLDHVVPFESLAAAQQAIDG
>gid:387606  SCO0098  putative transposase
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:387625  SCO0106  putative insertion element transposase
MSLTDAQWARIEPLLPDRTPRRAGQWRDHRQVIDAIVFKYRTGTPWMELP
ERFGSWKGAHNRLRMWAVDGTWEKVFTALLAQADAEGDLDWVIPVDSAVV
RAHQHAAGARQKGLRPASRTTMPSDGPAAD
>gid:387981  SCO0278  MutT domain containing protein
MTSSDHPLKASLDEPRAHLLDLIGAVEPWDDLEHTHLESARHWIAGGAPL
YRVRKPDVPPMHLVSYFAVLDDTRGQLLLVAHRKAGLWLPAGGHVESGED
PWAAVVRECREELGIEAVASPVAGELPFFLTVTGTRGQGPHTDVSLWYLL
DADAHTVTDYDRGEFSAVRWLTHEQVLAASAELLDPHMHRFARKLAWARA
GRGG
>gid:387985  SCO0280  conserved hypothetical protein SCF85.08c
MKARPGVDAPDHRGRTGLDRAGRELDRNPGVVVRDVELTSQGWHVLRRTT
FDYRRRDGRWETQARETYDRGNGAVVLPYDTGRGRVLLTRQFRYPAYVND
HPDGMLVEAAAGLLDADDAPAAVRREAAEELGVALGPLTHVLDAYMSPGS
VTERLHFFAAPYTPADRTGTGGGVEEEGEDIEVLEMPFTEALAMTRDGRI
ADGKTILLLQWAALHGPFAPHPADMADADNTGT
>gid:388085  SCO0325  hypothetical protein
MRVTGTSAHERDGVDVIVRMSGSAYDSWAGFEEITVRSCFTVRVSPASRW
REDPDDVDCPEDPALTFAPPVRGNHFDPQDCLLARVVPGATEVRGAAPHP
PDARSGRPQVACALDPAAAPH
>gid:388171  SCO0368  transposase
MVHRNAPLTETGRLRLARCVVEDGWPVRRAAERFQVSHTTASRWARRYRQ
LGVTGMSDRSSRPHHQPRRTAAAVEEHVLRLRREHRIGPLRLAVRCGIAA
STAHRILVRHGLPPLAALDRATGEPVRRYERARPGELVHIDVKKLGRIPD
GGGHKTLGRAEGHRSRTNGAGWAYLHTALDDHSRIAYTEDLPDETAPTCA
AFLVRATAYFASLGIRIERVLTDNAWAYSKNTWRNTCRDLDISPRWTRPW
RPQTNGKVERFHRTLLDEWAYQKPYTSDHERREAFTHWLHWYNYHRPHTG
IGGHTPASRGTNLSEQHT
>gid:388389  SCO0474  putative lipoprotein
MEPTVRTHPTRRLPTALVLTAALLATGCSEQSDDGREPAATESSGTPHGY
VEGAREAAEEQSRLILGDPGTGASRVLDLVTGKVHESEPVTGADGLSTDG
RFGFFHTDRGTHVVDGGAWTVDHGDHVHYYRAAIRHVGDLPGGRGAEIRS
DAAVTAATDPEGRTTLYDRTALEKGEVGPARTLDGVHAGAVVPYEEHLVA
LAGDDESAEVAVYDRSGRRVASPDATCERPAGDAVTRRGVVLGCADGALL
VSAEDGTFTAERIPYGQDVPEKERATAFRHRAGSDTLTAPAGERAVWVLD
VTDRDWTRVRTTGPVLAANTAGEGSPLLVLESDGALHGYDIATGRHTART
EPLLTRTAKTPAGAAAPVVEVDRSRAYVNDRTGKRVYEIDYNDALRVARS
FDLDIAPGLMAETGR
>gid:388466  SCO0504  putative DEAD-box RNA-helicase
MNAKPTASFDGLGLPPVLVETMTSLGVTRPFPIQAATLPEALAGRDVLGR
ARTGSGKTLAFGLALLAGTAGRRAEPKRPLALVLVSTRELAQQVSDALAP
YARALGVRLTTVVGGLSINRQTQALRDGAEVVVATPGRLTDLVSRRDCHL
NQVRITVLDEADQMCDLGFLPQVSGILDQVPSDGQRLLFSATLDGDVDQL
VRDHLHDPVPVSVDPASASVSTMEHHVLTVHPADKYATATEIAARDGRVL
MFLDTKAGVDRFTRELRAAGVSAGALHSGKSQPQRTHTLARFVEGGVTVL
VATNVAARGIHVDDLDLVVNVDPPADAKDYLHRGGRTARAGRAGSVVTLV
TPDQRREVNRMMSEAGIRPTVTPVRSGEQKLTDLTGAKRPPAGRGKESGN
APFRGMGTRPAGAAKGSRKAVEARRAAEARAAARVRKGR
>gid:388622  SCO0567  insertion element transposase
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:389227  SCO0842  putative deoxyribodipyrimidine photolyase
MATSVVLFTRDLRLHDHPPLRAALDRSDAVVPLFVRDRAVDAAGFAAPNR
LALLADCLRDLDSGLRDRGGRLVVRSGDLVEEVCAVAGEAEADEVHLAAD
VSAHAHRREELLRSALSARGCRLHVHDAVTTVLAPGSVTPASSDHFAVFT
PYFRRWSERAVREPLAAPRRVRVPDGVGSEPLPARGDLTGLSPGLPTGGE
RDARKRLTAWLRGGIADYADHHDDLAGDATSRLSPDLHLGALSPVELVHR
ARRAGGPGAEAFVRQLAWRDFHHQVLAARPHASAADYRTRHDRWRTGAAA
EADAAAWRDGRTGYPVVDAAMRQLRHEGWMHNRGRLLVASFLTKTLYVDW
RVGAWHFLELLVDGDVANNQLNWQWAAGTGTDTRPHRVLNPTTQARRFDP
GGTYVRRWVPELAGLAGRSVHEPWRLARAARAAYDAYPDPIVDLSEGLDR
FRRARGRD
>gid:389368  SCO0918  putative excinuclease ABC subunit A
MHSPHDPYVRVRDAREHNLKGVDVDVPRDVLAVFTGVSGSGKSSLAFGTV
YAEAQRRYFESVAPYARRLIHQIGAPKVGGITGLPPAVSLQQRRATPTSR
SSVGTVTNLSNSLRMLFSRAGTYPPGAERLDSDAFSPNTAAGACPECHGL
GRVHRTTEELLVPDPSLSIRDGAIAAWPGAWQGKNLRDVLDALGYDVDRP
WRELPAEAREWILFTDEQPVVTVHPVREAGRIQRPYQGTYMSARRYVLKT
FADSKSATLRAKAERFLNSAPCPVCGGGRLRAEALAVTVGGRTVAELAAL
PLAELPRLLPTEGEAAKVLAEDLISRIAPVVELGLGYLSLDRPTPTLSAG
ELQRLRLATQLRSGLFGVVYVLDEPSAGLHPADTEALLTVLERLKAAGNS
VFVVEHHLGVMRGADWIVDVGPLAGEHGGRVLYSGPVDGLARVAESATAR
HLFDRSPAPVREVRTPRGAVTIGPVTRHNLRGVTVRVPLGVLTAVTGVSG
SGKSTLVGEITEDLPGVDRLVSVDQRPIGRTPRSNLATYTGLFDVVRKVF
AATDRARERGFGVGRFSFNVPGGRCETCQGEGFVSVELLFLPSTYAPCPD
CGGARYNPETLEVAYRGRNIAEVLDLTVEGAAQFFTDTPAVARSLAALLD
VGLGYLRLGQPATELSGGEAQRIKLASELQRGRRGHTLYLLDEPTTGLHP
ADVEVLMRQLHGLVDAGHTVVVVEHDMSVVAGADHVIDLGPGGGDAGGRI
VAEGTPARVAGAAGSATAPYLARALDGGERR
>gid:389444  SCO0945  putative formamidopyrimidine-DNA glycosylase
MPELPEVEALRDFLTEHLTGREIVRVLPVAISVLKTYDPPLTALEGHRVA
AVHRHGKFLDVETAGGPHLVTHLARAGWLHWKDSLPSGLPKPGKGPLALR
VALETGAGFDLTEAGTQKRLAVYVVADPRQVPGVARLGPDPLAADFDEAR
FAALLDGERRQLKGALRDQSLIAGVGNAYSDEVLHAAKMSPFKLAASLTD
EETARLYAALRDTLTEAVERSRGIAAGRLKAEKKSGLRVHGRTGEPCPVC
GDTIREVSFSDSSLQYCPTCQTGGKPLADRRMSRLLK
>gid:389514  SCO0972  putative transposase remnant
MADRGYDHDTYRRVLWQRAIRPDIARRHEPHGTGPGLFRYVVERTIAWLH
GLRRPRIRRNDATTSTKPPSDSPSARSLTATSNGVAGNSKRPGQGSGTQV
AHPGEDGGAVVLHR
>gid:389586  SCO1000  integrase
MGSFFKECGCSRPTRCPHPYTIRFRDALGKQREEAGYDRQDDAIERLTQI
YAEKKITAPSVAEVRRELGQQTIVEYAKQWRPRQRKMTEYSTGWHVDSSI
NVHIVPRLGSRKLNSVTPIVVERFLDELETDGVGRGNQVNIFRVLKAILR
DAYGKGAMAADPVRGVQEPEYVREKVVIPSLAYVKKALAAADEHLAVEIV
MMVGCGLRNGEARAVNVNNVVADDVYRVHEQIHSNTHRPAKLKHRRAGEF
REVPLPRSVREAMERYEEKHGTTKEGYLLRGPSGYYTEPMERRRVQKLFK
GLPAEDGVGMYGFRHYFASNALGNGIPITDVAEWMGHKSIEETYRTYRHL
MPGSITKAARVLDAGLWDAA
>gid:389597  SCO1005  insertion element
MFDTEDVGVFLGLDVGKTAHHGHGLTPAGKKVLDKQLPNSEPRLRAVFDK
LAAKFGTVLVIVDQPASIGALPLTVARDAGCKVAYLPGLAMRRIADLYPG
EAKTDAKDAAVIADAARTMAHTLRSLELTDEITAELSVLVGFDQDLAAEA
TRTSNRIRGLLTQFHPSLERVLGPRLDHQAVTWLLERYGSPAALRKAGRR
RLVELVRPKAPRMAQRLIDDIFDALDEQTVVVPGTGTLDIVVPSLASSLT
AVHEQRRALEAQINALLEAHPLSPVLTSMPGVGVRTAAVLLVTVGDGTSF
PTAAHLASYAGLAPTTKSSGTSIHGEHAPRGGNRQLKRAMFLSAFACMNA
DPASRTYYDRQRARGKTHTQALLRLARQRISVLFAMLRDGTFYESRMPAG
VELAA
>gid:389617  SCO1013  putative mut-like protein
MATPDFIRDLRASAGHQLLWLPGVTAVVFDDEGRVLLGRRSDNGRWSLIG
GIPEPGEQPAACAVREVEEETAVQCVVERLVLVQALKPVTYDNGDVCQFM
DITFRCRAVGGEARVNDDESLEVGWFEVDALPDIKEFGQTRVKQALSDAP
TWFEPTGSF
>gid:389672  SCO1040  putative DNA repair protein
MDAELFPRTRAETAPGAVHLPDWLSPGQQRELLDACREWARPPAGLRTVR
TPGGGTMTARQVCLGRHWYPYGYAATAVDGDGAPVKPFPARLDGLARRAV
TDALGAEAVAPAPYDIALINFYDADARMGMHRDADERTDAPVVSLSLGDT
CVFRFGNPETRTRPYTDTELRSGDLFVFGGPSRLAYHGVPRVHPGTAPPE
LGLRGRLNITLRVSGF
>gid:389763  SCO1075  putative bifunctional protein (ATP/GTP binding protein/MutT-like)
MTVVWINGAFGAGKTTTARELIELIPNSTLFDPEVVGGALAHLLPPKRLA
EVGDFQDLPIWRRLVIDTAAALLADLGGTLVVPMTLLRQEYRDEIFGGLA
ARRIEVRHILLAPAETILRERIASREVPRDLPDGELRTRQWSYDHIEPYR
AALASWLTADAHPVDTGALTPYEAAARVAEAVGSGAVPACEIVQTPEPTG
ETVAAGVLLFDERDRVLLVDPTYKPGWEFPGGVVEPGEAPARAGMREVAE
ETGLSLRDVPALLVVDWEPPAPPAYGGLRLLFDGGRLDAADAGRVLLPGP
ELRDWRFATEDEAAGLLPPVRYERLRWALRARERGAAHYLEAGTPVG
>gid:389854  SCO1114  uracil DNA glycosylase
MAPRPLNEIVEAGWAKALEPVAGQITSMGEFLRAEIAAGRTYLPAGANVL
RAFQQPFDDVRVLIVGQDPYPTPGHAVGLSFSVAPEVRPLPGSLVNIFRE
LNTDLNLPQPANGDLTPWTRQGVLLLNRALTTAPRKPAAHRGKGWEEVTE
QAIRALAARGKPMVSILWGRDARNLRPLLGDLPALESAHPSPMSADRGFF
GSRPFSRANDLLMRLGGQPVDWRLP
>gid:389950  SCO1152  putative helicase
MRTADSPITTRDRQDGGVTLIDQLPRTADPDALYEAFEAWAQERGLTLYP
HQEEALIEVVSGANVIVSTPTGSGKSMIAAGAHFAALARDEVTFYTAPIK
ALVSEKFFELCKIFGTENVGMLTGDASVNSDAPVICCTAEVLASIALRDG
KHADIGQVVMDEFHFYAEPDRGWAWQIPLLELPQAQFVLMSATLGDVSFF
EKDLARRTDRPTAVVRSATRPVPLSYEFRYTPLTETLTDLLAARQAPVYI
VHFTQAQAVERAQALMSINMCTREEKQRIAELIGNFRFTTKFGRNLSRYV
RHGIGVHHAGMLPKYRRLVEKLAQAGLLKVICGTDTLGVGVNVPIRTVLF
TALTKYDGTRVRTLRAREFHQIAGRAGRAGFDTEGFVVAQAPEHVVENEK
ALAKAGDDPKKRRKVVRKKAPEGFVAWSESTFDKLIGSQPEPLTSRFRVT
HTMLLSVIARPGDAFSAMRHLLEDNHEPRRQQLRHIRRAIAIYRSLLDGG
IVEKLDQPDAEGRIVRLTVDLQQDFALNQPLSTFALAAFELLDPESPSYA
LDMVSVVESTLDDPRQILAAQQNKARGEAVAAMKADGVEYEERMERLQDI
TYPKPLEELLFHAYDTYRRSHPWVGDHPLSPKSVIRDMYERALTFTELVS
HYELARTEGIVLRYLASAYKALDHTVPDDLKSEDLQDLIEWLGEMVRQVD
SSLLDEWEQLANPAEMTAEEAQEKADQVRPVTANARAFRVLVRNAMFRRV
ELAALDQVGELGEMDADAGWDADAWGEAMDKYWDEYDDLGTGPNARGPKL
LVIEEEPQNALWRVRQIFDDPNDDHDWGISAEVDLTASDAEGRAVVRVTD
VGQL
>gid:389982  SCO1167  putative helicase
MEAVSVPAVLLPVRAALPVLTRARARANGHRASVFWGAVAVEALHLVARG
LLLPGLSAAEHDAWRVGPLDAEGTERVRHLAAAMPPEAHAVPVGGAGPLR
LPDPEPLVRSFLDAVADTLPRSPAAELLTAGPAYASWAPRPSPELRGWAL
DVAAGHDTGVRLSLRVEVRGLSAAGPQNARPTFRAVPQVHSVSDPGLVAD
TAQVWGGTVGEAFGTRARMDALLALRRAARAWPSLTPLLSATVPDAVELT
DEEITELLGAGSRALAAAGVDVHWPRELARDLTERAEVGPPDGSRASRAA
GPEAGPSFLSADALLAFNWTFALGDRTLTREELDLLAEANRPLVRLRDQW
VLVDPEEAHRARARQDHKVTPVDALAAALTGSAEVDGHRVEVRPTGWLAS
VRERLADPETQEPVAQPEALDATLRDYQRRGLNWLARMTSLGLGCCLADD
MGLGKTITLIALHLHRQSDAEAAGPTLVVCPTSLMGNWQREIERFAPGTP
VRRFHGPRRDLDGLADGEFVLTTYGTMRLDAERLSAVSWGMVVADEAQHV
KNPYSETARRLRSIGARARVALTGTPVENNLSELWAVLDWTTPGLLGRLG
SFRRHYAQAVEEGQDPAAAERLARLVRPFLLRRRKSDPGIAPELPPKTET
DHPVSLTTEQTGLYEAVVREALAEIAGADHMARRGMIVKLLTNLKQICNH
PAQFLKEDRPKITGRSGKLELLDELLDTILSEQASVLVFTQYVQMARLLE
QHLAARGVSSLFLHGGTSVTARESLVRRFQDGDAPVFLLSLKAAGTGLNL
TRAEHVVHYDRWWNPAVEAQATDRAYRIGQTRPVQVHRLIAEGTIEDRIA
ALLNRKRELADAVLGSGEAALTELTDAELADLVELRGGTR
>gid:390076  SCO1202  putative DNA ligase
MLLARLAQVSREVAATSARSRKTVLLAELFREAEAADVPVVIPYLAGRLP
QGRIGVGWKVLSRRVPPADAPTLTVRDVDARLTRLGAVSGAGSQAERTRL
VGELMGAATEDEQRFLIGLLTGEVRQGALDAAAVEGLAAATDAPPADVRR
AVMLAGSLQTVAEALLADGPGALDRFRLTVGQPVLPMLAHSASSVAEAVG
KLGAAAVEEKLDGIRVQVHRDGGTVRIYTRTLDDITDRLPEVTEAALALP
GERFILDGEAISLDADGRPRSFQETAGRVGSRTDVATAARAVPVSAVFFD
VLSVDGRDLLDLPLTERHAELARLVPEPLRVRRTLVHGPEDTGAAEEFLA
RTLARGHEGVVVKGLDAAYSAGRRGASWLKVKPVHTLDLVVLAAEWGHGR
RTGKLSNLHLGARTADGSFAMLGKTFKGMTDALLTWQTERLKELAVEEHG
WGVTVRPELVVEIAYDGLQRSTRYPAGVTLRFARVVRYREDKRPEDADTV
DTLLAAHPGVAP
>gid:390079  SCO1203  putative MutT-like protein
MRRSAGLLLHRRAPGGGLQVLLGHMGGPFYSRRDAGAWTVPKGEYDSGEP
AWEAARREFEEELGLPPPEGEAVPLGEVRQAGGKLVTVWAVAADLDPATV
VPGTFRMEWPPRSGRTEEFPELDRVAWFGLDRAREVIVKAQAAFLDRLAE
HSH
>gid:390220  SCO1255  G/U mismatch-specific DNA glycosylase
MLFCGINPGLMTAATGHHFARPGNRFWPVLHLSGFTPRLLKPSEQDELPS
YGLGITNVVARASARADELTAEEYREGGRLLARKVARLRPGWLAVVGVTA
YRAAFDEPKARVGPQERTFGSTRVWVLPNPSGLNAHWTAQTMAQEYARLR
EAAHGTEAQA
>gid:390341  SCO1300  putative exonuclease
MRLHRLDITAFGPFGASQSVDFDDLSAAGLFLLHGPTGAGKTSVLDAVCY
ALYGSVPGARQGGTGQGMTLRSDHAAVGTRTEVRLELTVAGRRLEVTRQP
PWERPKLRGKGTTVDKAQTWLREYDATTRAWKDLSRSHQEIGEEITQLLG
MSREQFCQVVLLPQGDFARFLRADAEARGKLLGRLFDTRRFAEVEKRLAE
RRRTTEARVREGDAALLADAHRMQQAAGDAMALPELAPGEPGLAEAVLEA
AAVARGTARELLTVADCRLTAAESAQAAAERRLADARELDRLQRRFAQAR
EHAARLAERADAHRAAEERMERARKAEAVAPALELRDAADSEHRRAAAAE
ASARALLPPGLADAGAAGLAAAARRAAEELGGLESARRAERRLAELVEER
AGLDRQERADDEVRQDAETWLADWETTRTGLQSRVDSAQEAAARAEQLAL
QREPARRRLDAARLRDRLTDDTEEARRRALAAAEDAVEARAHWLDLKEQR
LHGIAAELAANLTDGAPCAVCGATEHPAPARKTAGHVDRDAEERALAGHQ
AADERRAKAERHLGTVREALAAATAEAGDAATDRLAEEAEELEGTYARAR
ATASTLHAAQEELRRAEGEREQRVAAQQQAVVRSASRVAGRDRLEREQAS
LEEELARARDGAESVTARAAQLERQAALLTRAAETARVAEDTAQRLKDAD
ARLADAAFRARFDTPADAAAALLDDTAHRELQRRLDAWQSEDAAVRAVLA
EADTAEAARRPPADLAAAERAAADAGRRLREAASARDAAARCCAELDRLS
AHATTSVRRLAPLREEHTRVARLASLAAGTSVDNERRMRLESYVLAARLE
QVAAAATARLLRMSSGRYTLVHSDGRTGRGRSGLGLHVVDAWTGRERDTA
TLSGGETFFASLALALGLADVVTDEAGGVRLDTLFIDEGFGSLDDQTLDE
VLDVLDALRERDRSVGIVSHVADLRRRVHAQLEVVKGRSGSVLRQRGAG
>gid:390343  SCO1301  putative exonuclease
MRLLHTSDWHLGRAFHRVNMLGAQAGFVGHLVETVREHSVDAVVVSGDVY
DRAVPPLAAVELFDDALHRLAGLGVPTVMISGNHDSARRLGVGAGLIDRA
GIHLRTDPAGCGTPVVLGDAHGDVAFYGLPYLEPALVKTEFGVEKAGHEA
VLAAAMDRVRADLATRAAGTRSVVLAHAFVTGGEQSDSERDITVGGVAAV
PAGVFDGVDYVALGHLHGCQTLTERVRYSGSPLPYSFSEHRHRKSMWLVD
LGADGSVTAERIDCPVPRALARLRGTLADLLADPELTPHEDAWVEATLTD
PVRPDEPMARLTERFPHTLSLVFDPERAPDEPGVSYARRLADRSDQQIAE
DFVTHVRGAGPDDDEQAVLRDAFDAVRADETVREVAR
>gid:390400  SCO1322  hypothetical protein
MSVTARPIPDAPHSALADTVLERITLEYPAAADPERAVGLRAYMKDVAPF
LGMTSPVRRSLSRAVLAGVPRPDEPDCTAVALRCWRLPEREYHYFAVDYL
RRYVTHCSSGFLPVVRHLLTTVPWWDTVDLLAAHVVGAWSPPTAASRPTW
TPGSMTRTAGWSARPSSTSCGTRNAPTPTGSSATACAGPATATSSSARPS
AGACASTPGPTRTPCAPSWPSTAPASRRCPRGRRCGPSARSAGAVPLSNG
RSIKNHSTCDEASAMICLMFRYAFHLAASAVADAPKAAVAHFTAAVDGAR
S
>gid:390446  SCO1343  uracil-DNA glycosylase (EC 3.2.2.-)
MTDIAMLPESWREVLGGELQQPYFKELMEFVEEERANGPVYPPREEVFAA
LDATPFDRVKVLVLGQDPYHGEGQGHGLCFSVRPGVKVPPSLRNIYKEMH
AELDTPIPDNGYLMPWAEQGVLLLNAVLTVRAGEANSHKSRGWELFTDAV
IRAVAARTDPAVFVLWGNYAQKKLPLIDEARHVVVKGAHPSPLSAKKFFG
SRPFTQINEAVAGQGHEPIDWTIPNLG
>gid:390550  SCO1380  putative DNA damage inducible protein
MDAFFASVEQASKPSLRGKAVVVGGLGPRGVVATCSYEARVFGVHSAMPM
GQARRLAPHAAYLVPRFELYRSISEQVMRLLRELSPLVEPLSLDEAFVDL
DAGGAARDAETARLAGTKLRTDIRTVTGLTGSVGLAASKMLAKIASEAAK
PDGLVLIPPGTERAMLEPMTVRTLPGVGPATGDHLRRAGITTVGEIAEAG
EDELVRLLGKAHGHALYAMALARDERPVVAERETKSVSVEDTYDVDIHDR
VRVGVEVGRLADRCVRRLRASGLSGRTIVLKVRRYDFSTLTRSETLRGPT
DDPAVVREAAARLLDSVDTTGGVRLLGVGVSGLADYTQEDLFAQAAGDRA
EEPAEEPGTEPAEAHSPSPAERRWPSGHDVRHTELGHGWVQGSGLGRVTV
RFETPYSGVGRVRTFLVDDPELTPADPLPLVADTEGGAGQPSSGPLPLPA
SLPKSWSGGGGAAATSRP
>gid:390757  SCO1468  putative serine/threonine protein kinase
MVTGFCDTCFRRPLAPEPTATPPDPDPDPAGPDARGPRPAQGPGAPAAGE
LDRDGLLVLPHLPSPDPSEAADTVARPPTGGRRCGVDNCAGTIGVSYDGG
PAPDHGFCPECGTEYSFRPKLRPGDRVAGHYAVLGYLAVGGHGWVYLAED
TRVPGLHVVLKGLINTGDAVARRAAVEERRSLTTLHHRDIVRIVTHVQHQ
VPGDAEPTGYIVMEYVGGRSLSWIRFAPEEELARLFGTGGFEFGHVITYG
CKILGALEYLHDRGLLYCDMKPENVIHYGREIKVIDLGAIRRIDDRSSGL
VHTHGYAPPKRERDRRGLDVDSDLYTVGRTLKVLAERAARPAGLAARSFE
ALIRRATHPEPAARFRSAAEMSRQLWEVLREDQALGGREPYPERSTRFEP
TAAVFGAALGTVPALQWWTRRPGTGTPELPAGAPEPRAAARALPVPLPDA
SDPAAVLLGGLAADTPDRIAERSAGDPALRTVETALWLCRAYLEAGDAAR
AEEWVARAKGWSGDYDWRIFWHRGLIHLTRDAVDKAEDEFAATYAALPGE
AAPKLALGFCAEYLAERPRESAGEGGGPADAQAAARMRARQAQAEEFYEA
VLRRDPTQGSAAFGLARVRLRRAGRRPAVDVLDGVPTTSRHYDAARVAAV
RILTGRLPDRPAPLAAELREAAERLAGLHLDGSGSWDRLVTELREHALAC
RPPGGWGSGFPAGELCGPQDTEEALRRLLSASLRRLADQAGGVGERGDLL
DTAYAVLPAPAGLRELVRGWRRTA
>gid:390762  SCO1471  putative transposase
MGEGLQVRCGLRHLHRVQEALVVHRNAPLTETGRLRLARCVVEDGWPVRR
AAERFQVSHTTASRWARRYRQLGVTGMSDRSSRPHHQPRRTAAAVEEHVL
RLRREHRIGPLRLAVRCGIAASTAHRILVRHGLPPLAALDRATGEPVRRY
ERARPGELVHIDVKKLGRIPDGGGHKTLGRAEGHRSRTNGAGWAYLHTAL
DDHSRIAYTEDLPDETAPTCAAFLVRATAYFASLGIRIERVLTDNAWAYS
KNTWRNTCRDLDISPRWTRPWRPQTNGKVERFHRTLLDEWAYQKPYTSDH
ERREAFTHWLHWYNYHRPHTGIGGHTPASRGTNLSEQHS
>gid:390772  SCO1475  putative primosomal protein n'
MSSENGPEQGGAQDAPPEQLALIRETVRRTAAPRAKPRTWRGAALAKELP
VARVLVDKGVLHLDRYFDYAVPEELDAEAQPGVRVRVRFGAGRHRVRDGR
REGGGLIDGYLIERLAESDYAGPLAALAQVVSPERILDEELLGLVRAVAD
RYAGSVADVLQLAVPPRNARAEKRASPQPLPAPPVPEPGSWARYEQGAAF
VAALASGGAPRAVWNALPGPQWTDELARAVAATLASGRGALVVLPDGRAV
ARADAALTALLGEGRHAVLTADAGPEKRYAQWLAVRRGAVRAVIGTRAAM
FAPVRDLGLVALWDDGDDSHSEPHAPQPHAREVLLLRAAQDRCAFLLGGW
SCTVEAAQLVETGWARPLIAAREQVRAAVPLVRTVGDQDLARDEAARAAR
LPTLAWQAVRDGLRHGPVLVQVPRRGYVPRMACAACRTPARCRHCSGPLE
GQESGSALRCGWCGREESAWHCPECGAFRLRAQVVGARRTAEELGRAFPA
VPVRTSGREHVLDTVSETPALVVSTPGAEPVAEGGYAAALLLDGWAMLGR
PDLRAGEDALRRWLAAAALVRPQSAGGTVVAVAEPTLRPVQALVRWDPVG
HALRELAERAELGFPPVSRMAAVAGPPDAVTGFLDAVELPREAEVLGPVP
LPVTPAGRPRRVGAPPPGEHWERALVRVPPGRGAALAGALKAAQAARTAR
GSDTAVWVRIDPPDIG
>gid:390852  SCO1500  conserved hypothetical protein
MSTPENDQADGPRMRRGRRLAVDVGDARIGVASCDPDGILATPVETVPGR
DVPAAHRRLRQLVAEYEPIEVVVGLPRSLKGGEGPAAAKVRRFTQELAKG
IAPVPVRLVDERMTTVTASQGLRASGVKSKKGRSVIDQAAAVIILQQALE
SERVSGRPPGEGVEVVI
>gid:390865  SCO1506  conserved ATP/GTP binding protein
MEPDLFTAAAEERQEKDPTGSPLAVRMRPRTLDEVVGQQHLLKPGSPLRR
LVGEGASGPAGPSSVILWGPPGTGKTTLAYVVSKATNKRFVELSAITAGV
KEVRAVIDGARRAVGGYGKETVLFLDEIHRFSKAQQDSLLPAVENRWVTL
IAATTENPYFSVISPLLSRSLLLTLEPLTDDDVRDLLRRALTDERGLKGA
VTLPGDTEEHLLRIAGGDARRALTALEAAAGAALDKGEAEVGLTTLEETV
DRAAVKYDRDGDQHYDVASALIKSIRGSDVDAALHYLARMVEAGEDPRFI
ARRLMISASEDIGLADPNALQIAVAAAQAVAMIGFPEAALTLSHATIALA
LAPKSNAATTAIGAALDDVRKGHTGPVPAHLRDSHYKGAGKLGHGQGYVY
PHDLPEGIAAQQYAPDGLKDRTYYTPTRHGAEARYADAVEWTRGHLGRKP
S
>gid:390895  SCO1518  holliday junction DNA helicase
MNWDDTTDAEAAAERLVGAAADGEDQAVEAALRPKDLGEFIGQEKVREQL
DLVLRAARARGATADHVLLSGAPGLGKTTLSMIIAAEMEAPIRITSGPAI
QHAGDLAAILSSLQEGEVLFLDEIHRMSRPAEEMLYMAMEDFRVDVIVGK
GPGATAIPLELPPFTLVGATTRAGLLPPPLRDRFGFTAHMEFYGPAELER
VIHRSAGLLDVEIDPTGAAEIAGRSRGTPRIANRLLRRVRDYAQVKADGL
ITQEIAAAALAVYEVDARGLDRLDRGVLEALLKLFGGGPVGLSTLAVAVG
EERETVEEVAEPFLVREGLLARTPRGRVATPAAWAHLGLTPPRPQSSGNG
QPDLFGA
>gid:390898  SCO1519  holliday junction DNA helicase
MIAFVSGTVAALAPDAAVIEVGGVGMAVQCTPNTLSTLRLGKPAKLATSL
VVREDSLTLYGFADDDERQVFELLQTASGVGPRLAQAMLAVHQPDALRRA
VATGDEKALTAVPGIGKKGAQKLLLELKDRLGEPIGAPAVGAPVSTGWRD
QLHAALIGLGYATREADEAVSAVAPQAEAAGGTPQVGALLKAALQTLNRA
R
>gid:390900  SCO1520  crossover junction endodeoxyribonuclease
MRVLGVDPGLTRRGIGVVEGVAGRPLTMIGVGVVRTPADADLGHRLVAVE
QGIEQWLDEHRPEFVAVERVFSQHNVRTVMGTAQASAVAILCASRRGIPV
ALHTPSEVKAAVTGSGRADKAQVGAMVTRLLRLAAPPKPADAADALALAI
CHIWRAPAQNRLQQAVALHTAQGPRRPHKLHPSKGRPA
>gid:390935  SCO1534  putative DNA polymerase III
MTSWFEGPLAAFDTETTGVDTETDRIVSAALVVQDAPGLRPRVTRWLVNP
GVPVPESATAVHGLTEEYVQRHGRWPAPVMYEMAEALTEQARAGRPLVVM
NAPFDLTLLDRELRRHRASSLGRWLERTPLHVLDPHVLDKHLDRYRKGRR
TLTDLCAHYGVELAGAHDAAADAQAALEVVRAVGRRFQARLERLSPAELH
TLQAVWHAGQARGLQAWFALNGTEEAVDPAWPLRPDLPAAA
>gid:390979  SCO1551  putative eukaryotic-type protein kinase
MARHRDRAAGMNMAMMRLRREDPRVVGSFRLHRRLGAGGMGVVYLGSDKK
GQRVALKVIRPDLAEDQEFRSRFAREVSAARRIRGGCTARLVAADLEADR
PWFATQYVPGPSLHDKVADGGPLGAADVAAVGAALSEGLVAVHEAGVVHR
DLKPSNILLSPKGPRIIDFGIAWATGASTLTHVGTAVGSPGFLAPEQVRG
AAVTPATDVFSLGATLAYASMGDSPFGHGSSEVMLYRVVHEEAQLHGVPD
ALAPLVRACLAKDPEERPSTLQLSLRLKEIAAREAQGMADVRPPAPRSGE
ADVPTGRLADTYPERAQRRPQGRPGPQGTPAPRGTSASRGSAPRGSVPSR
GGAPSRGGTPSRGGTPSRGGTSSRGGGTGARSGSRPTPTRDTTGRNSSRN
DGGRPAPRGGSGRTAPRTTGTGWRRPANPRLLRQRLFVFVVVTLLVALGI
AAAQGCQGPARGLGDERGGGVRATQRDQVDPPGGLPEPRTPGR
>gid:391073  SCO1592  hypothetical protein
MQWAKQSEQNVYSNRWFSVNLADVLLPDGRHLDHFLIRMRPVAVATVVNE
ADEVLLLWRHRFITDSWGWELAAGVVEDGEDVAVAAARELEEETGWRPGP
LHHLMSVEPSNGLTDARHHVYWAEEGTYVGHPVDDFESERREWVPLKLVP
DLVARGEVPAANMAAALLLLHHLRLGRDQP
>gid:391106  SCO1603  putative transposase
MWEDSLTVFCGIDWAERHHDVAIVDDTGTLLAKARITDDVAGYNKLLDLL
AEHGDSSATPIPVAIETSHGLLVAALRTGSRKVFAINPLAAARYRDRHGV
SRKKSDPGDALVLANILRTDMHAHRPLPADSELAQAITVLARAQQDAVWN
RQQVANQVRSLLREYYPAALHAFQSKDGGLTRPDARVILTMAPTPAKAAK
LTLAQLRAGLKRSGRTRAFNTEIERLRGIFRSEYARQLPAVEDAFGHQLL
ALLRQLDATCLAADDLAKAVEDAFREHADSEILLSFPGLGPLLGARVLAE
IGDDRSRFTDARALKSYAGSAPITRASGRKHFVGRRFVKNNRLMNAGFLW
AFAALQASPGANAHYRRRREHGDWHAAAQRHLLNRFLGQLHHCLQTRQHF
DEQRAFAPLLQAAA
>gid:391157  SCO1622  putative 5'-3' exonuclease
MLLDTASLYFRAYFGVPDSVKAPDGTPVNAVRGLLDFIDRLVKDHRPEHL
VACMDADWRPHWRVELIPSYKAHRVAEERPAGPDAEEVPDTLSPQVPVIE
AVLDALGIARVGVAGYEADDVIGTYTARATGPVDIVTGDRDLYQLVDDAR
GVRVLYPVKGVGTLNLVDEAALREKYGVDGAGYADLALLRGDPSDGLPGV
PGIGEKTAAKLLAEFGDLAGIQAAVDDPKARLTPTQRKRLTEAGPYLAVA
PKVVRVAADVPLPDTGTALPHGPRDAAALEALAARWGLGGSLQRLLTTLT
A
>gid:391176  SCO1631  putative helicase
MIVRLSVRAGTLESTMTEDLSPAERYAAARQRAVEQATALASFREMYDFG
LDPFQIEACQALEAGKGVLVAAPTGSGKTIVGEFAVHLALQQGRKCFYTT
PIKALSNQKYADLCRRYGTDKVGLLTGDNSVNSEAPVVVMTTEVLRNMLY
AGSQTLLGLGHVVMDEVHYLSDRFRGAVWEEVIIHLPESVTLVSLSATVS
NAEEFGDWLDTVRGDTQVIVSEHRPVPLFQHVLAGRRMYDLFEEAEGHKK
AVNPDLTRMARLEASRPSYQDRRRGRAMKEADRERERRQRSRVWTPSRPE
VIERLDSEGLLPAITFIFSRAGCEAAVQQCLYAGLRLNDEGARERVRALV
EERTSSIPREDLHVLGYYEWLEGLERGIAAHHAGMLPTFKEVVEELFVRG
LVKAVFATETLALGINMPARSVVLEKLVKWNGEQHADITPGEFTQLTGRA
GRRGIDVEGHAVVLWQRAMNPEHLAGLAGTRTYPLRSSFKPSYNMAVNLV
DQFGRHRSRELLETSFAQFQADKSVVGISRQVQRNEEGLEGYKASMTCHL
GDFDEYARLRRELKDREQELARQGANQRRAEAAVALEKLKPGDVIHVPTG
KYAGLALVLDPGLPAGRSNGHRGFDHHDGPRPLVLTAERQVKRLASIDFP
VPVEALDRMRIPKSFNARSPQSRRDLASALRSKAGHITPERARKKRSQAA
DDREINRLRKAIRAHPCHGCDDREDHARWAERYHRLLRDTSQLERRIEGR
TNTIARTFDRIVALLTELDYLRGDEVTEHGKRLARLYGELDLLASECLRE
GVWEGLSPAELAACVSALVFESRAADDATAPKVPSGRAKAALGETVRIWG
RLDALEEDFRISQTEGVGQREPDLGFAWAAYMWASGKGLDEVLREVEMPA
GDFVRWCKQVIDVLGQISAAAPGAGSTVPKNARKAVDELLRGVVAYSSVG
>gid:391231  SCO1653  conserved hypothetical protein SCI41.36
MATAPASLSPSRAGDFMQCPLLYRFRVIDRLPEKPSEAATRGTLVHAVLE
RLFDAPAAERTAPTARALVPGQWDRLRESRPEVGELFADDPEGERLAQWL
AEAERLVERWFSLEDPSRLEPAERELFVEAELDSGLRLRGIIDRVDVAPT
GEVRIVDYKTGKAPRPQYAEGALFQMKFYALVVWRLKKVVPRRLQLVYLG
SGDVLTYDPVLADLERVERKLLALWEAIRLATETGDWRPRPTKLCGWCDH
QAHCPEFGGTPPPYPLPVRAADSGAVAQGRMGPD
>gid:391285  SCO1671  hypothetical protein
MSTEPEPVTEDTTDAVDAGPETPGDATAPATEAAGETVRPEGEPETADAG
ADSGTRPVEPAGQGPDAGGEGEGEAAAQVSEVDAELAAQRLERERIERRK
AEKQGPIDAGGKLSGTAADLLAAVRAVESGEKPAAAVFGAPEPARRPAPE
PVRPSRPTPAEPAAASAGSAGPAPETVQTVRRVLAEGGAPEALAPQTAAL
LGEGAQEALRADPWQLLRVGGVRPEQADGFARALLGAECGPDDERRGRAV
TVWLLEQAALAGHTALELPRLVATLAQRGVPDPDAAVQSTLAEGEALAFQ
DALEETGAHREPAASGAAGPDRAEEQDEGGEERPVRVLIGLERYALAEES
LADGLARLVNSTPKQDGSAADWEQAAASAQGSGAELIRAVAAHGLVLHTG
GEASLAEPAALLHAAHALGLRAWAAAPGPLGRDRFGALLGDPSAEPPSPG
SPAAPAAVTVAGLLTGAEGPGRDADGALDLDLLVVLDAPQLDVEAGALLA
ESLPDGARLVLAGDPAVLWSVGPGRVFADLLAARVCPQIASRRPDPGPLG
ELVSGIGVGELSQVEAPGKEVVIVPVRDAGEAVHRTVQLVADSVPRAIGV
PAEETQVITPGHGGAVGTRALNAALKERLNPGPGRFGGFDPGDRVVHTPA
PGRVLPGRVVGADADGLRLSCAGETVVVPRDRVEGSVRHGWALTAHQALG
GRWPAVVVVLPGDAVPALSRPWIYTAFGRAARHLSVVHGVEQALPRAVAE
VPAKPRTTRLPVLLAPQTPAAG
>gid:391317  SCO1686  putative NTP pyrophosphohydrolase
MDGLMSSADEILDIVDENDRVVGQARRGDAYARGLRHRCVFVWARDPEGR
VFVHRRTATKLVFPAPYDMFVGGVVGAGESYDDAALREAEEELGASGLPR
PEFLFKFLYDDGAGRTWWSAVYEVRVAGAVSPQVEEVAWHGFLPEAELEG
RLGEWEFVPDGLSAYARLRAWRGASGPGSVG
>gid:391388  SCO1724  putative serine/threonine protein kinase
MSGEPDGERVIAGRYRLLSPLGEGGMGTVWRARDEVLRREVAVKEVRAPA
GLSQPDVGRMYARLEREAWAAARISHPNVVTVYDVATDGGRPWIVMELVR
GLSLADLLDAEGPLEPRRAALIGAEVLAALRAAHAAGVLHRDVKPANVLL
ANDGRVVLTDFGIARVEGSEALTMTGEVVGSPEFLAPERALGRTPGAASD
LWSLGVLLYATVEGVSPFRQGTPLSTLRAIVDEAVPPPRRAGALGPVVEG
LLRKDPAERLPAEEAERALRLVGAGGAPPGRGPRTGAPPSGAFAPTVVAA
HPGPPTAPTPPMPVPAAGDTSAAGAPGAPGGRDRRARAVLLVGLAVLVLA
LAGLTYSLLDRSDGGGGTEGSGGSPGPGATSAATSGGAGEPSPPASSGTD
GGQSGPAAQSVEVTVVGSRTHYSGACPPPHDRAPAFTATFTVGRLPAEVG
YRWVTADGSVSDPGWRTLSFPSGGERSRQDGVVVTTHDDTGAFESAVRVE
VRSPVRATSDAVAFSVTCGTGTETETGTGTPSGGASHSSSPSAPSSSPSA
>gid:391414  SCO1738  conserved hypothetical protein
MTILCVRFQLPPTREAALPELLGLLEEFTPVVEALPPDGALADLRGAERY
FGRDAVALASVIRVRALALHGVDCVIGAGPGPMLARMALRDARPGLTRAV
PGGEERAFLDGKPVAALPGVGTATARTLCEYGLDTLGRVAAAPLSTLQRL
VGARTGRELHEKAQGVDRGRVVPNGVSRSLAADRPFDRDELDPDRHRRAL
LSAAGDLGARLRAVDKVCRTLTLTVRYADRSATTRSRTLREPTAHSAALT
RTAYDLYEALGLQRARVRSIALRAESLTSAEHASHQLTFDPVDEKVRRIE
EVADRARAKFGPRAVMPGSLAA
>gid:391415  SCO1739  putative DNA polymerase III, alpha chain
MPGFTHLHTVSGFSARYGASHPERLAERAFERGMDALALTDRDTLAGTVR
FAKACAKAGVRPLFGAELAVAAPESAAADRTETSVRRDRRRAPVRGGAFV
DESAPRVTFLARDGARGWADLCRLVSAAHTAEGSPLLSWPDNHADGLTVL
LGPDSDVGRALAAGRPDRAARLLVPWREVYGDALRLEAVWHGRTGIGPGS
LRLAARTVGFAAEQGVRPVLSNAVRYADPGQGEVADVLDAARRLVPIGAT
KELDSGEAWLKDAGAMRHAAERIVESAGFRRDTAHRLLEQTRATAAECLV
DPEDDLGMGAVHFPEPHLVGAGRRTAQRALASRAAAGMVRRGYDRRRDHR
EYWERMHHELDIIAHHGFASYFLTVAQVVDDVRHMGIRVAARGSGAGSLV
NHLLGIAHADPVEHGLLMERFLSKERVVLPDIDIDVESARRLEVYRAIIG
RFGTERVATVAMPETYRVRHAIRDVGAALSMDPAEIDRVAKSFPHIRARD
ARAALEELPELKELAGESRRDGGGRYGRLWELVEGLDALPRGVAMHPCGV
LLSDASLLSRTPVVPTSGEGLPMAQFDKDDVEDLGLLKLDVLGVRMQSAM
AHAVAEVKRATGTEVDLDAVPEGDPATYRMIRSAETLGCFQIESPGQRDL
VGRLQPATFHDLVVDISLFRPGPVAADMVRPFIEARHGRAPVRHPHEDLA
EPLAGTYGVVVFHEQIIDIVAIMTGCGRGEADRVRRGLSDPESQGRIKVW
FAQHATANGYDAETIQRTWEIVEAFGSYGFCKAHAVAFAVPTYQSAWLKA
HHPAAFYAGLLTHDPGMYPKRLLLADARRRGVPILPLDVNESGVAHRIEL
VSESGSRKPATWGLRLALSDVHGISEAEAERIATGQPYASLLDFWERARP
SRPLAGRLAQVGALDAFGANRRDLQLHLTELHRGARGGRGDQLPLTGGRK
TASAGLPDLTSEEKLSAELGVLSMDASRNLMDDHRTFLRELGVVSAKRLR
EARHGETVLVAGAKAATQTPPVRSGRRVIFSTLDDGSGLVDLAFFDDSHD
ACAHTVFHSWLLLVRGVVQRRGPRSLSVVGSAAWNLADLLEIRREEGLEG
VAARLAAAGGSQEGDAPGGGPARRRLAGSDGRPAPAGSAAQDPMEQRKIR
MSTGYEMHPWADLRPAGEGPAVGRKLWHQSPGSAG
>gid:391506  SCO1775  conserved hypothetical protein
MTGSTIKDTPEEWEIRGSQTPFRGKKTSVRTDDVVMPDGSVVTRDYQVHP
GSVAVLALDGEGRVLVIRQYRHPVREKLWEIPAGLLDVPGENPLHAAQRE
LYEEAHVKAEDWRVLTDVYTTPGGCDEAVRIFLARDLSEAAGERFEVEDE
EADMELARVPVADLVRGVLAGELHNNCLVVGVLALVAAERGDGLDALRPA
LAPWPARPFEA
>gid:391514  SCO1780  putative DNA repair protein
MRIRSLGVIDDAVVELSPGFTAVTGETGAGKTMVVTSLGLLLGGRADAAL
VRIGAKNAVVEGRIAVPGDAAVAVRAEEAGAELDDGALLISRTVSAEGRS
RAHLGGRSVPVGMLAELADELVAVHGQTDQQGLLKLNRQRQALDRYAGDA
VAGPLAKYAEAYRRLRAVVRELEEITTRARERAQEADLLRYGLDEIAAVE
PRAGEDVELAEEAERLGHAEALASAATVAHAALAGNPEDPEGVDGATLVA
GAQRALDAVRSHDPALAALAERIGEVGILLRDVAGELAGYADDLDADPLR
LAAVEERRAALTALTRKYGEDIAAVLSWAEQSAARLTELDGDDERIGELT
AERDALRAELGGLAQALTDARTEAAERFAAAVTAELASLAMPHARVSFAI
RQTEDPEGVEIGGRTVAYGPSGADEVELLLAPHPGAPARPIAKGASGGEL
SRVMLAVEVVFAGTDPVPTYLFDEVDAGVGGKAAVEIGRRLARLARSAQV
VVVTHLPQVAAFADRQLLVEKTNDGSVTRSGVKVLEGEERVRELSRMLAG
QEDSETARAHAEELLETARADR
>gid:391546  SCO1792  putative DNA glycosylase
MIASPDRTPLPREFFDRPVLEVAPDLLGRILVRTGPDGPITLRLTEVEAY
DGQNDPGSHAYRGRTPRNEVMFGPPGHVYVYFTYGMWFCMNLVCGPEGRS
SAVLLRAGEIIDGAELARTRRLSARNDKELAKGPARLATALGVDRALNGT
DACTSQETPLRILTGTPVPGDQVRNGPRTGVAGEGGVHPWRYWVADDPTV
SPYRAHVPRKRRS
>gid:391624  SCO1827  putative DNA polymerase III, epsilon chain (EC 2.7.7.7).
MLEDRTTAAPSATAWPAAYPQGYAVVDVETTGLARDDRIISAAVYRLDAR
GEVEDHWYTLVNPQRDPGPVWIHGLTSDVLRDAPLFPDVAEEFAARLDGR
VLVAHNAVFDWQMIAREYARAEREAPVRQRLCTIALSKALDLPLPNHKLE
SLAAHFGVVQQRAHHALDDARVLAEAFRPSLLAAAAGDVRLPLHECRPLT
EWSDRPAPRIGQQAGYGSYRPTSWRPSRKRPACPHPNPGRYEEGKRLKQG
MRVAFSGDTSVERDLLEDRAVEAGLHVATSLSRLTSLLVTNDPDSGTSKV
VKARQFGTPVVDEAAFGQLLADVEPADG
>gid:391873  SCO1953  ABC excision nuclease subunit C
MADPSSYRPRPGEIPDSPGVYRFRDEHRRVIYVGKAKSLRQRLANYFQDL
AHLHPRTRTMVTTAASVEWTVVSTEVEALQLEYSWIKEYDPRFNVKYRDD
KSYPYLAVTMNEEFPRVQVMRGQKKKGVRYFGPYGHAWAIRDTVDLLLRV
FPVRTCSAGVFKNAARTGRPCLLGYIGKCSAPCVGRITPDDHWDLADEFC
DFMAGRTGTYLRRLERQMAEAAEEMEYERAARLRDDIGALKKAMEKSAVV
LADATDADLIAVAEDELEAAVQIFHVRGGRVRGQRGWVTDKVEEITTGAL
VEHALQQLYGEEKGDAVPKEVLVPALPDPVEPVQQWLAERRGSGVSLRIP
QRGDKKALMETVQRNAQQALVLHKTKRASDLTTRSRALEEIAEALDLDSA
PLRIECYDISHLQGDDVVASMVVFEDGLARKSEYRRFQIKGFEGQDDVRS
MHEVITRRFRRYLAEKERTGEWADGEGLVDDARHPNGDAAPNGDAAPNDG
AAPDDGAARTDGRGLTDGQELTDGPALKDDDGRPKRFAYPPQLVVVDGGQ
PQVAAAQRALDELGIDDIAVCGLAKRLEEVWLPREDDPVVLPRTSEGLYL
LQRVRDEAHRFAITYQRAKRAKRFRAGPLDDVPGLGETRKQALIKHFGSV
KKLRSATIDQICEVPGIGRKTAETVAVALARATPAAPAVNTATGEIMDDD
DGAPETTADAPGEPVSAGTPDERRGQER
>gid:391886  SCO1958  ABC excision nuclease subunit A
MADRLIVRGAREHNLKNVSLDLPRDSLIVFTGLSGSGKSSLAFDTIFAEG
QRRYVESLSSYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTSRNPRSTVG
TITEVYDYLRLLFARIGKPHCPECGRPISRQSPQAIVDKVLELPEGSRFQ
VLSPLVRERKGEFVDLFADLQTKGYSRARVDGETVQLSNPPTLKKQEKHT
IEVVVDRLTVKDSAKRRLTDSVETALGLSGGMVVLDFVDLPEDDPERERM
YSEHLYCPYDDLSFEELEPRSFSFNSPFGACPDCSGIGTRMEVDAELIVP
DEDKSLDEGAIHPWSHGHTKDYFGRLIGALADALGFRTDIPFAGLPLRAR
KALLYGHKTQVEVRYRNRYGRERRYTTAFEGAIPFVKRRHSEAESDASRE
RFEGYMREVPCPTCQGTRLKPLVLAVTVMGKSIAEVSAMSISDCADFLGE
LTLNARDKKIAERVLKEVNERLRFLVDVGLDYLSLNRAAGTLSGGEAQRI
RLATQIGSGLVGVLYVLDEPSIGLHQRDNHRLIETLVRLRDMGNTLIVVE
HDEDTIKVADWIVDIGPGAGEHGGKVVHSGSVKELLDNAESQTGLYLSGR
KAIPLPDIRRPQDPSRRLTVHGARENNLQDIDVSFPLGVFTAVTGVSGSG
KSTLVNDILYTHLARELNGARNVPGRHTRVDGDDLVDKVVHVDQSPIGRT
PRSNPATYTGVFDHIRKLFAETTEAKVRGYLPGRFSFNVKGGRCENCAGD
GTIKIEMNFLPDVYVPCEVCHGARYNRETLEVHYKGKSIADVLNMPIEEA
TDFFEAVPAISRHMKTLKDVGLGYVRLGQSATTLSGGEAQRVKLASELQR
RSTGRTVYVLDEPTTGLHFEDISKLLTVLGGLVDKGNTVIVIEHNLDVIK
TADWVVDMGPEGGAGGGLVVAEGTPEQVAGVPASHTGKFLRDVLGADRVS
DAAPVTRPRKAAKTVAAKAAAKKTATKTVTGTAAKKATATRTAKTAVKKA
AKPAAKKTTRTSKA
>gid:391903  SCO1966  ABC excision nuclease subunit B
MRPVSQIERTVAPFEVVSPYQPSGDQPTAIAELARRVQAGEKDVVLLGAT
GTGKSATTAWMIEKLQRPTLVMAPNKTLAAQLANEFRELLPNNAVEYFVS
YYDYYQPEAYVPQSDTYIEKDSSINEEVERLRHSATNSLLTRRDVIVVAS
VSCIYGLGTPQEYVDRMVPLRVGEEHDRDELLRRFVDIQYTRNDMAFARG
TFRVRGDTIEIFPVYEELAVRIEMFGDEIEALSTLHPVTGEIISEDQQLY
VFPASHYVAGPERLERAVNDIEKELAERLTELEKQGKLLEAQRLRMRTTY
DIEMLRQIGSCSGVENYSMHFDGRSPGSPPNTLLDYFPDDFLLVIDESHV
TVPQIGAMYEGDASRKRTLVDHGFRLPSALDNRPLKWEEFQERIGQTVYL
SATPGAYELSRSDGAVEQIIRPTGLVDPEVVVKPTEGQIDDLVHEIRRRT
EKDERVLVTTLTKKMAEDLTDYFVELGIQVRYLHSDVDTLRRVELLRELR
AGEYDVLVGINLLREGLDLPEVSLVAILDADKEGFLRSGTSLIQTIGRAA
RNVSGQVHMYADKITPAMEKAIDETNRRREKQVAFNKANGVDPQPLRKKI
NDIVAQIAREDVDTEQLLGSGYRQTKEGKGAKAPVPALGGQKTGGAKAAR
GRAKETAVTDRPAAELAEQIEDLTTRMRAAAADLQFEIAARLRDEVSEMK
KELRQMREAGLA
>gid:391910  SCO1969  putative DNA-methyltransferase
MDSHGQYEQQVVWTVVGTDIGPLLLAATHDGLVNVVFHATDATRGRALER
LAARLGTEPVEAPGSPLLAEAIRQVEAYFAGRRRDFELPLDWSLISGFNR
QVLRELASGVPYGSVVGYGDLAGRVGQPGAAQAVGMAMGANPLPVVVPCH
RVVESDGGIGGFGGGVDTKRRLLALEGVLPEPLF
>gid:391957  SCO1990  conserved hypothetical protein
MDTGGLSVLDRRIEGCRACPRLVEWREEVARTKRAAFADWTYWGRPVPGF
GPPDARLLIVGLAPAAHGGNRTGRMFTGDRSGDVLYQALYDVGLASQPTA
VRVDDGLELYGVRVTSPVHCAPPANKPTPAERDTCRSWLVQELGLLRPTL
RAVVVLGAFGWQAALPAFAGAGWTVPRPRPAFAHGTQVTLDAADGPDLHL
FGCFHVSQRNTFTGRLTPEMLRDVLRTAAETAGLPAR
>gid:391966  SCO1997  conserved hypothetical protein
MLDPQDLYTWEPKGLAVVDMALAQESAGLVMLYHFDGYIDAGETGDQIVD
QVLDSLPHQVVARFDHDRLVDYRARRPLLTFKRDTWSDYEEPTIEVRLVQ
DATGAPFLFLSGPEPDVEWERFAAAVGQIVERLGVRLSVSFHGIPMGVPH
TRPVGITPHGSRTDLVPGHRSPFEEAQVPGSAEALVEYRLAQAGHDVLGV
AAHVPHYVARSAYPDAALTVLEAITAATGLVLPGIAHSLRTDAHRTQTEI
DRQIQEGDEELIALVQGLEHQYDAAAGAETRGNMLAEPVEIPSADEIGRE
FERFLAEREGDG
>gid:391973  SCO2000  putative ATP-binding RNA helicase
MIRYDALDALPVRGALPALHDALEEHGTAVLVAPPGTGKTTLVPLALAGL
LGGEGVPARRVVVAEPRRIAARAAARRMAWLLGERPGASVGYTVRGERVV
GRRARVEVVTTGVLLQRLQRDQELAGVDAVVLDECHERHLDADTSAAFLW
DVRQALRPELRLVAASATTDAEGWSRLLGGAPVVEAHGVSCPVETVWAPP
ARAVRPPHGMRVDPALLAHVASVVRRALAERGGDVLCFLPGVGEISRVAG
LLGSLEDVDVLQVHGRAPAAVQDAVLAPGARRRVVLATSVAESSLTVPGV
RVVVDAGLAREPRVDHARGLSALTTVRASRAAARQRAGRAGREAPGVVYR
CWTEAEDARLPRFPAPEIKVADLTAFALQAACWGDPDASGLALLDPPPGG
AMTAARSVLEAVGAVDPAGRATERGVRLSRLGLHPRLGRALLDAAEPGAG
VSPRTAPGAAAPAGRSGAGTGASPTRTSAPGHAGGGTAGPAAAGAGVPGA
DGLRSGGAEVVAEVVGLLSEEAPREYGDDLVSVLRAARRGGDAYGARWRA
EVRRLRAAAEGGDDRRGQRATVESGDDRRRQRATVESGDDRRRQRATVES
GDDRRRQRATVESGDDRRRQRATVESGDDRRRQRATVESGDDRRRQRATV
ESGDDRRRQRATVESGDGGRGERVGEDVLAGLVAALAFPERVARKDGGSY
LMVSGTRAELPQASALRGAPWIAVAVADRPVGRGHARVQLGAAVDEETAR
LAAGALLTERDEVHWADGDVVARRVERLGAVELAVRPLADADPVLVRGAL
LEGLRREGLGLLRRSADAGLLRKRLAFLRRRLGEPWPDVTDDALLARTDE
WLEPELSRARRRADLGRIDAGQALTRLLPWASGEATRLDELAPERILVPS
GSRIRIDYDDPEQPVLAVKLQEMFGLQRSPEIAGVPLLVHLLSPAGRPAA
VTADLASFWKDGYTGVRAELRGRYPKHPWPEDPATAEPTRYTKARLGK
>gid:391978  SCO2003  DNA polymerase I
MAKTASKKTDSTSGGRPRLMLMDGHSLAYRAFFALPAENFTTATGQPTNA
IYGFASMLANTLRDEAPTHFAVAFDVSRKTWRSEEFTEYKANRSKTPDEF
KGQVELIGELLDSMHVPRFAVEGFEADDVIATLATEAEAEGFDVLIVTGD
RDSFQLVSEHTTVLYPTKGVSELTRFTPEKVFEKYGLTPAQYPDFAALRG
DPSDNLPGIPGVGEKTAAKWINQFGSFAELVERVDEVKGKAGQNLRDHLE
SVKLNRRLTELERRVELPRTVTDLERTAYDRKGVAVILDTLEIRNPSLRE
RLYAVDPGAEEAEATPVVADGVELDGTVLGTGELAGWLAEHGAQPLGVAT
VDTWALGTGSVTEVALAASDGKAAWFDPTELDEADETAFRSWLSDPDRPK
VFHNAKGAMRVLAEHGWSVAGVGMDTALAAYLVKPGRRSFDLDALSLEYL
HRELAPAAAADGQLAFGADEGAEAEVLMVQARAILDLGETFESRLADVGA
ADLLRDMELPTSALLARMERHGIAADREHLQAMEQMFAGAVQQAVKEAHA
AAGREFNLGSPKQLQEVLFGELNLPKTKKTKTGYTTDADALAWLAAQTDN
ELPVIMLRHREQAKLRVTVEGLIKTIAADGRIHTTFNQTVAATGRLSSTD
PNLQNIPVRTDEGRAIRRGFVVGEGFESLMTADYSQIELRVMAHLSEDAG
LTEAFTSGEDLHTTAAAQVFSVEQSAVDAEMRRKIKAMSYGLAYGLSAFG
LSQQLNIEAAEARGLMDAYFERFGGVRDYLRRVVDEARATGYTATLFGRR
RYLPDLNSDNRQRREAAERMALNAPIQGTAADIVKIAMLNVDKALREADL
KSRMLLQVHDEIVLEIAPGERAAAEELVRREMANAVQLRVPLGVSVGAGP
DWESAAH
>gid:392082  SCO2064  DNA polymerase III alpha chain
MSKPPFTHLHVHTQYSLLDGAARLKDMFDACNEMGMSHIAMSDHGNLHGA
YDFFHSAKKAGVTPIIGIEAYVAPESRRNKRKIQWGQPHQKRDDVSGSGG
YTHKTMWATNSKGLHNLFRLSSDAYAEGWLQKWPRMDKETISQWSEGIVA
STGCPSGEVQTRLRLGHFDEALKAAADYQDIFGKDRYFLELMDHGIEIEH
RVRDGLLEIGRKLGIPPLVTNDSHYTYAHEATAHDALLCIQTGKNLSDPD
RFRFDGTGYYLKSTDEMYAIDSSDAWQEGCANTRLIAEMIDTTGMFEKRD
LMPKFDIPEGFTEITWFQEEVRRGMERRFPGGVPEDRQKQAEYEMDVIIQ
MGFPGYFLVVADFIMWAKNQGIAVGPGRGSAAGSIVAYAMGITDLDPIPH
GLIFERFLNPERVSMPDVDIDFDERRRVEVIRYVTEKYGADKVAMIGTYG
KIKAKNAIKDSARVLGYPYAMGDRLTKAMPADVLGKGIDLNGITDPTHPR
YSEAGEIRSMYESEPDVKKVIDTAKGVEGLVRQMGVHAAGVIMSSEPIVD
HAPIWVRHTDGVTITQWDYPQCESLGLLKMDFLGLRNLTIMDDAVKMVKS
NKGIDLDLLSLPLDDPTTFELLQRGDTLGVFQFDGGPMRSLLRLMKPDNF
EDISAVSALYRPGPMGMDSHTNYALRKNGLQEITPIHKELEEPLQEVLAV
TYGLIVYQEQVQKAAQIIAGYSLGEADILRRVMGKKKPDELAKNFVLFQA
GARKNGYSDEAIQALWDVLVPFAGYAFNKAHSAAYGLVSYWTGYLKANYP
AEYMAALLTSVKDDKDKSAVYLNECRRMGIKVLPPNVNESMSNFAAQGDD
VILFGLSAVRNVGTNVVESIIKCRKAKGKYASFPDYLDKVEAVVCNKRTT
ESLIKAGAFDEMGHTRKGLTAQYEPMIDNVVAVKRKEAEGQFDLFGGMGD
EQSDEPGFGLDVVFGEDEWDKTYLLAQEREMLGLYVSDHPLFGLEHVLSD
KADAGISQLTGGDFGDGAVVTIGGIISGLQRKMTKQGNAWAIATVEDLAG
SLECMFFPATYQLVSTQLVEDAVVFVKGRLDKREDVPRLVAMELMIPDLS
NAGTNAPVVLTIPATRITPPMVSRLGEILTHHRGDSEVRIKLQGPTKTTV
LRLDRHRVKPDPALFGDLKVLLGPSCLAG
>gid:392168  SCO2111  putative endonuclease
MSTASSSSSLPRNPVGGHVPVAGGLHSVGLSYARELKAEAVQVFVANPRG
WATPAGNPKQDEAFREACAAGSVPAYVHAPYLINFGSHTGATVERSVESL
RHSLRRGRAIGALGVVVHTGSATGGRERPVALKQVREHMLPLLDELTHDD
DPYLLLESTAGQGASLCSRTWDFGPYFEALDAHPKLGVCLDTCHIFAAGH
DLTGPSGMHQTLDLLVDTVGEGRLRLIHANDSKDVAGAHKDRHENIGAGH
IGEDPFRALMTHPATDGVPLVIETPGGKEGHAADVARLKKLRDG
>gid:392449  SCO2244  probable serine/threonine protein kinase.
MSRSASGIPAEIGPYRLERLLGEGGMGRVYLGRTPAGSAVAVKVVHRAYA
ADPEFRRRFALEVAAARRVQGLYTVPVVAADLDADEPWLATAYAPGPSLQ
QAVGERGPLPAAEVLALTAGVAEALETIHAAGVIHRDLKPSNIVLTADGP
KVIDFGIARAADVTALTATGMRAGTPAYMAPEYIRGQEVTEAGDVFALGL
VAHFAATGRLAFGGGSDHGVAYRILEASPDLDGCPESVRGVVALCLEKDP
ARRPTPAEIVRLCGRAANGDFDDGRTPTVVSTPPATGPDAPTATAPARTP
APDADPETPSTPPYAALLGVVAAIALVVVLVVTLLPSSGKPPKSKQPYPV
VAATAIFARGTSGVAFSPDGETLATGGQDGKVRLWDAATRKVRATLVEKG
WYGPSRVVGVTFSADGKTLVTRTGTHLVGVWDVARRREVRRIEESAYSLA
LSPDGKWIALGDSAGANLWDLGRRGEDPRAHLSQNLHATDLAFSPDGRTL
ASVGDFSDQRYQVENEPAKLWDLTRLDPRPYGQGDPRHNLALEDVVYAVA
FSPDGKTLATGGQGGGVRLWDAATGRPKATLTHKFVTEARDLAFSPDGST
LAVTAEGRVLLWNLADRKPSAILADDETGFGADINELAFSPDGRFLAGTT
TGGGAPEETAAPGTTEADAAKDSGVRLWKVPTKTAH
>gid:392617  SCO2320  hypothetical protein
MSRSHPCGAVRLTVRESRGGSAGGSRLRVSAGIAPASPRTGDDDAATLPV
SRVRAEGRPGRGRRPGDTFPGMRTGRLIGAGRSADVYEADAGRVLRRDRE
GVGDAVAEGAVMEHVRKHGYPVPRVWPGVTESPRTDLVMERLTGPTMLRA
WQDGALTPQEAGGMLAGLLRRLHRVPAYRSADPGARVLHLDLHPDNVILT
VDGPRVIDWSNAEEGDPGLDWGMTAVILAQVAAAGGPVSGPVEGALAALL
ADPRALTPDGLAEALRRRAANPTMSRDEVGLLPAAEALIRSRLD
>gid:392664  SCO2339  hypothetical protein
MTPLCRTPDTASPVDLSCGGDKATDLEGAHVLELTMAAVTAADAGATAGM
QMADAPSEPDAVLRVGRDKSVCRLSTPDDWLFVSRVHLEFRCGADGTWRL
SWLRGSQADPSSEVRLTIGEHRQALPYGGTVPLPRGGSGEVVVQDRAEPR
SVNVGFYHEV
>gid:392909  SCO2450  putative serine/threonine protein kinase (regulator)
MPTDRPGTPDPPPLPLLADLLGRAATGARPTPLELAELLWLAGHMEPPEQ
DPPDGPASGTRPAEPPPAPEGTREQGQEGDGQRERDRKPERDRHGDRERD
RDGDRDRGPGRGPWGDGPGRPSTRSEAPRTPLRLPSPAPAPGTSAAQPHS
ALLAPAPPMLRHTLALQRALRPLKRRADAPVGHEVDEAATADRIARLGAG
PEWWLPVLRPVRERWLRLHLVHDAGPTMPVWRPLVRELQAALAQSGVFRT
VTLHRADPDGTVRGDGAQIPADGRTVMLLISDCMGPQWRAGPDGVRWFAT
LRRWARRAPLAVLQPLPEQLWRDTALPPVPGRLSAPHRAAPSASLAFTPY
DTAAPRAPEGTVHVPVLEPGPEWLANWAALVASPGGTPYPGAAAALHRPL
PADADDRTDVARLSPEELVLRFRASASPQAYRLAGHLALGRPDLPVMRLV
QAAVEPDPRPQHLAEVILGGLLTTVAGPPGSYAFRPGVRELLLRGLPRTA
RNRTHDLLLRTGGLIDDRAGRSPGEFRALIPSRKGTERAGPSESFATISE
ESVRQLTVRERPSAPSPFPPGLGARYRPTRRLTPSGRIWLAEDTGPDRTA
PDRTPADRTAAHPVPTNRTVAIRLHDPATGPAARQTFLRNARRLTRLSHP
NLVTVLDAGIEGDVPYVVMEHLDGIALGALTRSGDGRLPVPLTVSVGAQL
ARALTALHRAGLAHGGLEASRVVLLPDGTVRLSLFEPALALGPGARSADL
RALCEVLLRLTSGTSRPAVPVDSRRLHRLPTALRIHYAHAFDQLLSPSPA
AQARGLDLLGDPALLARAGEAYAPRRYRALGPLRVGLPDGPADLPRDVRA
LLAMLLLKHGRTVTHEELRWGLWDPGNEPRNPRSETTRLAKRLAEILGPG
VLATAAHGYALHTSADELDLVRCDELVRRADAARLEGALAEAHDLVSGAL
SLWGDAEPLAGVPGPAARTARTRLLRLRLALHTKRAELDLVLGEYDRAAA
ELSGLLRAHPHREDFRRLCLIALRRQGRVEEALEVYEEYELSGGRSPALT
ALGRELREEYAEPVDDAPPWTEYEQRPVEPGAPESTVSAPDELPEGPFPT
EDGLWTPLLDDDAEHDTAQARAEADEREERAESRAALREAMRAEGLEPEP
ELDEDPGPEEEEEPDEPPWIDFDTGFRACARYTLADGPVDRDARAALHGL
VTELLADSGADAAAYELVDEPGGPLVLLAPRAEAAPLLRATVEGLPGRLA
RLAGLRLRIEFWQVEFGLDGNGEQSLGRADADAVGAVLDASAAQAVVVLS
DSLYYDEVREEGQYGPAFPPDLFRPLADDTGWYHLVEGGGRPGPAADLR
>gid:392955  SCO2468  DNA primase
MAGRINDEDVKAVRDAVPIDAVVSEYLQLRNAGGGNLKGLCPFHDEKSPS
FQVSPSKGFFHCFGCQEGGDTITFVMKIDHLTFSEAVERLAGQAGITLRY
EEGGYNPSHQRGERIRLVEAHKIAAQWYAEQLATGPEADTGRAFLADRGF
DQAAAEHFGVGYSPQGWDHLTRFLRGKGFSDKELLLSGLSQEGRRGPIDR
FRGRLMWPIRDIGGDVVGFGARKLYEADNGPKYLNTPDTAIYKKSQVLYG
IDLAKKDIAKASRAVVVEGYTDVMACHLAGVTTAIATCGTAFGGDHIKIL
RRLLMDNGSARVIFTFDGDAAGQKAALRAFEDDQKFAAETYIAIAPDGMD
PCDLRLAKGDDAVADLVEPRTPLFEFALRQIVARYDLDTPAGRAAALDEA
APVVARIKNSGAQHEVAVQLAGMLGILDTQFVVKRIAQLARWARDRGGKG
PAPDQRQRGGGPQQQAGPMTATPRGPALNLRNPVFATERELLKLALQRPE
LVSPAFDAYGVDEFTAPPYAAVREAIMEAGGAEFGVQDPQDYLVRVREAA
PDDTVRAMVTELAVEAIMLHRGVKGVDEVYAGAQLVTVRRRAVERRIRDI
TGRLTRLSGHGDPAELAAVQNELWILQQYDQNLREHGAAAL
>gid:393059  SCO2510  conserved hypothetical protein SCC121.13c
MGGMSLFRDDGIVLRTQKLGEADRIITLLTRGHGRVRAVARGVRRTKSKF
GARLEPFSHVDVQFFSKGSELVGRGLPLCTQSETIAPYGGGIVTDYARYT
AGTAMLETAERFTDHEGEPAVQQYLLLVGALRTLARGEHAPTLVLDAFLL
RSLAVNGYAPTFGDCAKCGMPGPNRFFSVGSGGSVCVDCRVPGSVVPSPQ
ALELLGALLTGDWGTADAAEPRYVREGSGLVSAYLHWHLERGLRSLRYVE
K
>gid:393172  SCO2564  putative DNA-binding protein
MRAMLVAMARKTANDDPLAPVTLAVGQEDLLLDRAVQEVVAAAKAADADT
DVRDLTPDQLQPGTLAELTSPSLFAERKVVVVRNAQDLSADTVKDVKAYL
GAPAEEITLVLLHAGGAKGKGVLDAGRKAGAREVACPKMTKPADRLAFVR
AEFRTAGRSATPEACQALVDAIGSDLRELASAVSQLTADVEGTVDEAVVG
RYYTGRAEASSFTVADRAVEGRAAEALEALRWSLATGVAPVLITSALAQG
VRAIGKLSSARGGRPADLARELGMPPWKIDRVRQQMRGWTPDGVSVALRA
VAEADAGVKGGGDDPEYALEKAVVIIARAARSRGRT
>gid:393177  SCO2568  putative DNA-binding protein
MALRSRSRTASATSGPGRAPASDGRLAHRRAPGSRTHARHRSHARHGRRH
AAPEELRRRAETLFAERAEGYDHAGHEGAHGETGKGPPLPGLDAPARQGS
PLPGLDAPTGPGTAWRERAGSALRERMPLWLQTRCGLERRSVAALSVLLV
VAAVFAVQHFWTGRTHPVAAPEVVREAAAYGAGKPEPTAEDRDTAGGSGP
KAAATATAGPEIVVDVGGKVRDPGVHSLPAGSRVADALRAAGGVRPGTKT
DGLNRARFLVDGEQVIVGAPAPVPRPGAGPAPDGPTGAAGPAAPVSLSTA
TTDQLDTLPGVGPVLAQHIIDYRTQHGGFRSVDELREVNGIGERRFADLR
DLVRP
>gid:393193  SCO2575  hypothetical protein SCC123.13c.
MAESNGRKGTAARREGVAMPGSILEEVGHLLGGAMARNTVTRLSCPSCGS
GHVAQVLGDNGGISYVCTACGHSWS
>gid:393254  SCO2603  putative integrase
MEAEWTEADLALLEELTRAEALLPQNAPRALLSIRLSVLTDDTTSPVRQE
LDLRILARERGSRVVGVASDLNVSATKVPPWKRKELGEWLNNRSPEFDEL
LFWKLDRFVRRLSDLSTMIEWSLKRGKNLVSKNDSLDLTTTAGKIMVTII
GGIAEIEAANTSTRVASLWDYAKSQEDWIIGKPAYGYVTDEDDTGKVVLV
IDPEAAKALHWARRMALRGRSAGFMVRCLKRSGLMTQGLTVATLHRRLRN
PALLGYRVEEDKNGGQRRSKPLLGKDGRPIKVAPPLFTEEEFETLGAALD
KRRKSQPPRRVGGATQFLGVLLCADCKTNMTVQITNNTHGTYQYLRCRNC
KSGGLGAPNPERVYERLVQDVLKVLGDFPVQVRQYAEGAEARKEIKRLQE
TIALYMKDLEPGGRYTKTRFTKEQAEATLDKLISELEAINPETAKDRWVN
VHGGKTFREHWQEGGMEAMSADLYRVGIRCEVTRTKVPKVRAPKVHLRLL
IPKDVRERLVIREDDFAQ
>gid:393314  SCO2626  putative DNA repair hydrolase (fragment)
MPEGHTIHRLAQDCTAAFARTAVRVTSPQGKFADSAALLDGTVLTTADAH
GKHLFLGFGAAGNAAEDAAENPAWVHIHLGLFGKVAFGPVPAPPPTDTVR
LRLANDTAHVDLRGPTTCALITDPEKRAIHDRLGPDPLRPDADPAAAHRR
ISRSRTAIAALLMDQKVIAGVGNVYRAEVLFRHGIDPYRPGKDLTPAEWD
TIWQDLTALMREGVRNNRIDTVRPEHTPEAMGRPPRVDDHGGEVYVYRRA
NQPCHLCGGPISTAGLAARNLFWCPTCQKR
>gid:393326  SCO2632  putative transposase
MGEGLQVRCGLRHLHRVQEALVVHRNAPLTETGRLRLARCVVEDGWPVRR
AAERFQVSHTTASRWARRYRQLGVTGMSDRSSRPHHQPRRTAAAVEEHVL
RLRREHRIGPLRLAVRCGIAASTAHRILVRHGLPPLAALDRATGEPVRRY
ERARPGELVHIDVKKLGRIPDGGGHKTLGRAEGHRSRTNGAGWAYLHTAL
DDHSRIAYTEDLPDETAPTCAAFLVRATAYFASLGIRIERVLTDNAWAYS
KNTWRNTCRDLDISPRWTRPWRPQTNGKVERFHRTLLDEWAYQKPYTSDH
ERREAFTHWLHWYNYHRPHTGIGGHTPASRGTNLSEQHT
>gid:393415  SCO2666  putative serine/threonine protein kinase
MSEAGRTCQRPGCGGTYEDMGGGELYCDTCGLAPVVAAGGALGATPTGVT
GGGGSGGSRGSRGASGSGGSRSSARSSRTSSQSSRSSKSRRSVSGRLSRA
VSGRSTGRSVSVRSSGSSAGSTGRGRLGVGLVEVPAVPRPDPRVMVMDHP
EVPERKRFCSRSDCGAPVGRSRGERPGRTEGFCTKCGHPYSFVPKLKAGD
VVHGQYEVVGCLAHGGLGWIYLAVDKAVSDRWVVLKGLLDTGDQDAMAAA
ISERRFLAEIEHANIVRIYNFVEHLDQRTGSLDGYIVMEYVGGKSLKEIA
NARRSPQGRRDPLPVEQACAYGIEALEALGHLHSRKLLYCDFKVDNAIQT
EDQLKLIDMGAVRRMDDDESAIYGTVGYQAPEVADVGPSVASDLYTVGRT
LAVLSFDFQGYTTVYADSLPDPDSIEVFRQYESFYRLLVRATDPDPARRF
ASAQEMAEQLTGVLREVVSLQTGRARPAVSTLFGPEVRVTDTELFPRLDG
EVSRLGARVPPARGRRGGSGAALPGGAGAAPALPGGSGPAVGAPGGTVAH
APPVGAGTPVAASGGASASLPGAVSSAGSAGWLGTGASGLVKEADAPTAS
LTLPVPRVDAGDPNAGFLAGLMASAPAELLTALAAAPAPSTETRLRQIRA
RLENGDASGALEALGALEGERPDDWRVVWYRGLAALVTGAHEDAALAFDA
IYDAFPGEIAPKLALGLCAEVLGQLDNAAEYYRLVWSSDPSHVSAAFGLA
RVQLAAGDRAAAVRTLESVPESSVHCTAARVAAVRARLRQRTAAAGDLRF
LDDLIAAARQVEALDVYGLDPARREQLSAEVLGCALDWVLSGGRGSVPPA
AGGRTLLGSGLDERGLRFGLERSYRTLARLARGGEERIDLVERANRYRPR
TWV
>gid:393455  SCO2683  putative single-strand DNA-binding protein
MNETMICAVGNVATTPVFRDLANGPSVRFRLAVTARYWDREKNAWTDGHT
NFFTVWANRQLATNASGSLAVGDPVVVQGRLKVRTDVREGQSRTSADIDA
VAIGHDLARGTAAFRRTARTEASTSPPRPEPNWEVPAGGTPGEPVPEQRP
DPVPVG
>gid:393577  SCO2737  putative deoxyribonuclease.
MPTQPPAAPGERRLATLEGVLERVTYANEENGYTVARVDTGRGAGDLLTV
VGALLGAQVGESLRMEGRWGSHPQYGKQFHVENYTTLLPATVQGIRRYLG
SGLVKGIGPIFADRITQHFGLDTLTIIEEEPKRLIEVPGLGPKRTKKIAD
AWEEQKAIKEVMLFLQTVEVSTSIAVRIYKKYGDASISVVKNQPYRLAAD
VWGIGFLTADKIAQSVGIPHDSPERVKAGLQYALSQATDQGHCYLPEEKL
IADAVKLLQVDTGLVIECLGELAAEPEDPDGDPGVVREKVPHSEGGEPVT
AVYLVPFHRAELALAGQLRRLLHTDEDRMPGFRDVAWDNALGWLKRRTGA
DLAPEQEEAVKLALSEKVAVLTGGPGCGKSFTVRSIVELARAKKAKVVLA
APTGRAAKRLAELTGAEASTVHRLLELKPGGDAAYDRERPLDADLVVVDE
ASMLDLLLANKLAKAVPPGAHLLLVGDVDQLPSVGAGEVLRDLLAPGSPV
PAVRLTRVFRQAQQSGVVTNAHRINAGQHPLTDGMKDFFLFVEDDTEEAG
RLTVDVAARRIPAKFGLDPRRDVQVLAPMHRGPAGAGTLNGLLQQAVTPG
RPDVPEKRFGGRVFRVGDKVTQIRNNYEKGENGVFNGTVGVVTSLDPVDQ
RLTVLTDEDEEVPYDFDELDELAHAYAVTIHRSQGSEYPAVVIPVTTGAW
MMLQRNLLYTAVTRAKRLVVLVGSRKAIGQAVRTVSAGRRCTALDFRLAG
PRT
>gid:393632  SCO2759  probable transposase.
MFDTEDVGVFLGLDVGKTAHHGHGLTPAGKKVLDKQLPNSEPRLRAVFDK
LAAKFGTVLVIVDQPASIGALPLTVARDAGCKVAYLPGLAMRRIADLYPG
EAKTDAKDAAVIADAARTMAHTLRSLELTDEITAELSVLVGFDQDLAAEA
TRTSNRIRGLLTQFHPSLERVLGPRLDHQAVTWLLERYGSPAALRKAGRR
RLVELVRPKAPRMAQRLIDDIFDALDEQTVVVPGTGTLDIVVPSLASSLT
AVHEQRRALEAQINALLEAHPLSPVLTSMPGVGVRTAAVLLVTVGDGTSF
PTAAHLASYAGLAPTTKSSGTSIHGEHAPRGGNRQLKRAMFLSAFACMNA
DPASRTYYDRQRARGKTHTQALLRLARQRISVLFAMLRDGTFYESRMPAG
VELAA
>gid:393651  SCO2766  putative secreted ribonuclease
MLAIRTRRRKAAALATAAVLAAVAAPSLTATPATATTTTASTASTASTAS
TASTTNTDYDSTYYKDAIGKTGASLKSSLHTIISDQTKLSYSAVWDALKA
TDEDPDNSGNVILLYSGVSRSKSLNGGDTGDWNREHVWAKSHGDFGTSTG
PGTDIHHLRPSDVRVNSVRGNKDFDNGGSAVSEGGGSLTDSDSFEPRDAV
KGDVARMIFYMAVRYEGGDGFADLEVNGQVDNGSNPYIGKLPVLKAWNDE
DPPDAFEEHRNQVIYDDYQHNRNPFVDHPEWVESIW
>gid:393782  SCO2810  hypothetical protein
MTVKALSASAPASFDPGAEAARATAAILHDTLHGTERGVVVDSPPGAGKS
TLVVRAALELAEAGRPLMVVAQTNAQVDDLVLRLAEKNPDLPVGRLHSSD
ADPYDKALDALGNVRKSAKAADLAGQAVVLSTAAKWAHVKVDEPWRHAIV
DEAYQMRSDSLLAVAGLFERALFVGDPGQLDPFATVGSEQWAGLAYDPSS
SAVTTLLAHNPGLPQHRLPVSWRLPASAARLVSDAFYPYTPFRSGTRHGD
RSLAFAVPSDGSGPDRVIDEAAASGWGLLELPARHTPRTDPEAVRAVAAV
VRRLLDREGRATSERSPDPAPLTAARIAVGTAHRDQAAAVRAELAGLGVH
DVTVDTANRLQGREYDVTVVLHPLSGRPDATAFHLETGRLCVLASRHRHA
CIVVCREGVGDLLDDYPSTEPVQLGTLVKFPDGWEANHAVLAHLAEHRAA
WRP
>gid:393887  SCO2863  putative helicase
MLALAEQEAPGAYAVRPLTPLSETSLLTNSPEDLNLGSELRAELATADRI
DLLCAFVKWYGIRVLEDALLAAKARGVPIRIITTTYMGATDRRALDRFVR
EFGATVKVNYETRSTRLHAKAWLFRRGTGFDTAYVGSSNLSRAALLDGLE
WNVRLSSVATPAVMDKFEATFDAYWNDVAFETYDPDVDGEHLDAALAQAG
GTASTTDFKINLSGLQVHPFPHQRDMLERLSAEREIRGRHRNLLVAATGT
GKTVMAALDYRALSNQATSGRPRLLFVAHRKEILKQSLRTYREVLDDASF
GELLYGGADPHEWSHVFASVQSLNVQRLEQLAPDHFDVIVIDEFHHATAG
TYRRVIEHFTPKELLGLTATPERMDGLNVQDEFFDGRIAAEMRLWEALEN
DLLSPFHYFGIPDGTDLTNLTWQKGSYADQELGNLLTANDARARIIVKQV
RDKISDPGAMRALGFCVTKAHAHFMADYFRRAGFQAAALDSDSSSEVRAQ
ALRDLQDGKLQVIFSVDLFNEGLDIPDVDTLLLLRPTNSATVFLQQLGRG
LRRTDTKPVLTVLDFIGQHRAEFRFEEQFRAMTNLSRNRLVEHIERGFPQ
LPSGCQIILEGKAKSLVLDNIRTQLAATVKTLVKEVKEYSTPLLADYLRE
SRREIKELYKNGNSWTVVLRRAGLAEEPAPPGEEALLKRVHAFLHVDDPE
RAQAYLSLLADDAPSYEELDQTGQAYARMLFFNLWDNAGGYTSYAQGLAA
LRPQRALRDELRQVLSYVIDRADHVPIPLSDGLGTVPLKVHSSYNRSEIL
AALGIARFGGQMPRSFAQGVQWAEETQTDALLITLEKDEKDFSPTVRYKD
YAISPNLFHWESQNATSPDSATGKRYQQHAGRGSHVLLFMRRYATTDTGK
SQPWMLLGPATYVRHTGSKPMAITWHLDHDLPADVWTYSAAIQAS
>gid:394009  SCO2929  putative transposase
MAFAKDLKRRNVLQEAVYHRIKADFDLGAQAAVRTVKKVCDAYATFKANL
RAGNYGLEGSKRRIKAESKPVRFRETSAQPFDDRMLTWNLDTKMVSVWTV
AGRLKGIPFVCSPEAMRLLAQRKGESDLVMRDGMFFLLATIDIPEPEVSG
PDGFLGVDLGIVNIATTSDGRIMSSRQVNRYRQRKRDLRGKLQKKRTKSA
ARVLKRQRRKEARYATQRNHIIARKLVHTAERTSRGIGLEDLTGIRQRVT
AREDQRARLHSWAFAQLGAFVEYKAKRAGVAVVHVDPRNTSRQCSECWHT
HRTNRVTRARFVCRSCGIVLHADHNGSRNIAHRADAAWQRGAANRPRTP
>gid:394024  SCO2936  conserved hypothetical protein
MTTTASSSTSHHLSPAFPGRAPWGTAGKLRAWQQGAMEKYLQDQPRDFLA
VATPGAGKTTFALTLASWLLHHHVVQQVTVVAPTEHLKKQWAEAAARIGI
KLDPEYSAGPLSREYQGIAITYAGVGVRPMLHRNRVEQRKTLVILDEIHH
AGDSKSWGEACLEAFEPATRRLALTGTPFRSDTNPIPFVAYEEGNDGIRR
SAADYTYGYGSALADGVVRPVIFMSYSGNMRWRTKAGDEIAARLGEPMTK
DAVSQAWRTALDPRGEWMPAVLRAADQRLTEVRKAIPDAGALVIAADQDS
ARAYAKLIREITGDKATVVLSDDTGASKNIDAFSASTDRWMVAVRMVSEG
VDVPRLAVGVYATTISTPLFFAQAVGRFVRSRRRGETASVFLPTVPDLLT
FANEMERERDHVLDKPKKEGEEDPYAESEKEMDEANREQDEDTGEQEQFA
FEALESEATFDRVMYNGAEFGMQAHPGSEEEQDYLGIPGLLEPDQVQLLL
QKRQARQIAHSRKKPDDEADLLELPAERRPVVSHKELMELRKQLNTMVGA
YVHQSGKPHGVIHTELRRVCGGPPSAEATGGQLRQRIAKVQEWATRMK
>gid:394056  SCO2950  DNA-binding protein Hu (hs1)
MNRSELVAALADRAEVTRKDADAVLAAFAEVVGDIVSKGDEKVTIPGFLT
FERTHRAARTARNPQTGEPIQIPAGYSVKVSAGSKLKEAAKGK
>gid:394115  SCO2973  serine/threonine protein kinase
MARKIGSRYTAHQILGRGSAGTVWLGEGPDGPVAIKLLREDLASDQELVS
RFVQERTALLGLDHPHVVSVRDLVVDGNDLALVMDLVRGTDLRTRLDRER
RLAPEAAVAVVADVADGLAAAHAAGVVHRDVKPENVLLDMQGPLGPGGSH
PALLTDFGVAKLIDTPRRTRATKIIGTPDYLAPEIVEGLPPRAAVDIYAL
ATVLYELLAGFTPFGGGHPGAVLRRHVTETVVPLPGIPDELWQLLVQCLA
KAPASRLRASELSARLRELLPMLAGMAPLDVDEPDAEQPEDAPDASAASP
AAPVSTAEPVRRRGAVPLVPGAKPADSNRDTHTSMRVPAPDELAGGARGT
ARAPRASGAPRPGSARNRAATRRRRIAVGAGAVALVAAIGVGTWLATGGD
EDGGGPQDTRNSAPAAP
>gid:394121  SCO2974  serine/threonine protein kinase
MRPVGSKYLLEEPLGRGATGTVWRARQRETAGAEAAVAGQPGETVAIKVL
KEELASDADIVMRFLRERSVLLRLTHPNIVRVRDLVVEGELLALVMDLID
GPDLHRYLRENGPLTPVAAALLTAQIADALAASHADGVVHRDLKPANVLL
KQTGGEMHPMLTDFGIARLADSPGLTRTHEFVGTPAYVAPESAEGRPQTS
AVDVYGAGILLYELVTGRPPFGGGSALEVLHQHLSAEPRRPSTVPDPLWT
VIERCLRKNPDDRPSAENLARGLRVVAEGIGVHANSAQIGAAENVGALLA
PDPAPAQVPGAPDAAYDPNGATSVLPHTSGPAGAADPTAVLPSTGAPDPT
AVMPPVPPGQPGAPGQGGPEDPHPWQNQLRAARDRNEQTQVQYLDPNQDP
LRRRPQRQVSRPPQQPRQAPQGPPPQQPGYGYPQQQQPQRYATPQPQQPQ
RYAPPPAPEPQQPRREPRPPRQRSANPMRIPGLGCLKGCLVTVVVLFVAG
WLVWELSPLQEWIGTGKGYWDQLTDWFTTVTDWIGDLGGSGGG
>gid:394260  SCO3038  conserved hypothetical protein SCE34.20
MAGRFAPRPPRAVVRDGIPRQALAPGRVRVWAPDGPLDVGLVLGPLRRGP
GDPTFRTMPDGSVWRTGRTPAGPGTLRVTARGGEVRGEAWGPGAEWLLER
LPLMLGAEDDPSAFVPRHRLLAVTAHRRPGLRLTRTGLVLESLIPSILEQ
KVTTLEAYQAWRLLVRKFGEPAPGPAPGRMCVMPAARTWALIPSWEWHLA
GVDDKRASTVLRAVRVAARLEEAAGMEPAAARERLELVPGVGPWTSAETV
QRSHGAPDAVTVGDLHLPGIVGFALGGDRYADDAEMLRLLEPYAGQRHRA
ARLVLLSGRVPERRAPRMAPRGIERL
>gid:394423  SCO3102  eukaryotic-type protein kinase
MGQVWTAYDRRLDRRVAVKLLRPDKVAGAEADELRRRFVRECRITAQVDH
PGLVTVHDAGSEGEELFLVMQYVDGADLSDHLAEHDPYPWQWAVAVAAQL
CAVLSAVHAVPIVHRDLKPRNVMVKQDGTVTVLDLGVASVMDADTTRLTH
TGTPIGSPAYMAPEQAMGGAVGPYTDLYALGVLLHELLSGDVPFAGSTAL
GVLHRHLYEPPLPVRRIRPEVPEALEALVLRLLAKDPQHRPDSAQEVYEH
LALLLPALGVPTGGPLDPTRPFVRPHAPWPDRARTPAPQPAPVPPAAEAA
KPDVARAVDDVKRLLGEGRITQAVDVLGAILPAAAEQHGERSPVVRTLRR
QYAATLMDDGQYRRALPELRRLADERAAEAGQADPQCLRHRYDAAQCLEQ
LGEPAAALAEYRALLPYYENQYVAGDPDLAHDVRRRIGHLLLALGDRAAA
HDTLARLLHDVERVHGPGHPLAADIRRTLQWLGRMHG
>gid:394440  SCO3109  putative transcriptional-repair coupling factor
MSLHGLLDAVVKDTALAEAIAAAADGNRMHVDLVGPPAARPFAIAALARE
TGRPVLAVTATGREAEDLAAALRSLLPPEGIVEYPSWETLPHERLSPRSD
TVGRRLAVLRRLAHPRPDDPETGPVSVVVAPVRSVLQPQVKGLGDLEPVA
LRTGQGADLEEIVQALAAAAYARVELVEKRGEFAVRGGILDVFPPTEEHP
LRVEFWGDDVEEIRYFKVADQRSLEVAEHGLWAPPCRELLLTDDVRERAR
VLAEDHPELGELLGKIAEGIAVEGMESLAPVLVDDMELLLDVLPKGSMSV
VCDPERVRTRAADLVATSQEFLQASWAATAGGGEAPIDVDAASLWSIADV
RERARELDMMWWSVSPFAADETLTSDLDAEGDSDTLKLGMHAPETYRGDT
AKALADTKGWLAEGWRAVYLTEGHGPASRTVEVLGGEGIAARLDNDLEAL
SPSVVHVSCGSIDHGFVDPALKLAVLTETDLTGQKAAGREGARMPARRRK
TIDPLTLETGDYIVHEQHGVGRYIEMVQRTVQGATREYLVVEYAPAKRGQ
PGDRLYIPTDQLEQITKYVGGEAPTLHRLGGADWTKTKARAKKAVKEIAA
DLIKLYSARMAAPGHAFGSDTPWQRELEDAFPYVETPDQLTTIAEVKDDM
EKTVPMDRLICGDVGYGKTEIAVRAAFKAVQDGKQVAVLVPTTLLVQQHF
GTFSERYAQFPVSVKALSRFQSDTESKATLEGLREGSVDIVIGTHRLFSS
ETKFKDLGLVIVDEEQRFGVEHKEQLKKLRANVDVLTMSATPIPRTLEMA
VTGIREMSTITTPPEERHPVLTFVGPYEHRQIGAAIRRELLREGQVFYIH
NRVESIDRAAAKLREIVPEARIATAHGQMSEQALEQVVVDFWEKKFDVLV
STTIVESGIDISNANTLIVERGDNFGLSQLHQLRGRVGRGRERGYAYFLY
PPEKPLTETAHERLATIAQHTEMGAGMYVAMKDLEIRGAGNLLGGEQSGH
IAGVGFDLYVRMVGEAVADYRASLEGGVEEEPPLEVKIELPVDAHVPHDY
APGERLRLQAYRSIASANSEEDVKAVREELVDRYGKLPEPVENLLLVAGL
RMLARACAVGEIVLQGNNIRFAPVELRESQELRLKRLYPGSVIKPGTHQV
LVPRPKTAKVGGKPLVGRELLGWVGEFLTSILGS
>gid:394452  SCO3112  
PLQVREWTCTACGTVHDRDHNAAINVKQAAGLAAS
>gid:394556  SCO3151  conserved hypothetical protein SCE87.02c
MPSNASGRSDKDGAPPLPEPLRVPVADSHTHLDMQSGTVEEALAKAASVG
VTTVVQVGCDLAGSRWAAETAAAHDAVHATVALHPNEAPRIVHGDPDGWS
RQGAREPGGDAALDDALAEIDRLAALPQVKGVGETGLDYFRTGPEGKEAQ
ERSFRAHIEIAKRHGKTLVIHDRDAHTDVLRVLKEEGAPERTVFHCYSGD
AEMAGICARAGYYMSFAGNVTFKNAQNLRDALAVAPPELVLVETDAPFLT
PAPYRGRPNAPYLVPVTVRAMAEVRGVDEDTLATALAANTARAFGY
>gid:394605  SCO3174  putative exodeoxyribonuclease (EC 3.1.11.2) (putative secreted protein)
MCRYRAGMLTVTSVNVNGLRAAAKKGFVEWLAGTSADVLCLQEVRAEPHQ
LPDHAGAPEGWHVTHAPAAAKGRAGVSLYTRREPDAVRVGFGSTEFDTSG
RYVEADLPGVTVASLYLPSGEVGTERQDEKVRFMGEFLAYLKELRERAAA
QGREVVVCGDWNIAHREADLKNWRANKKNSGFLPEEREWLGRVLDPAEGG
YVDVVRALHPDVEGPYSWWSYRGRAFGNDAGWRIDYHVSTPGLAAKAVKG
YVERAATHAERWSDHAPVTVVYDR
>gid:394797  SCO3238  hypothetical protein
MRLRHRDGTTVHLAYCTNVHAAEDLDGVLAQLARYGEPVRERLGADRIGL
GLWLAAPVVTALAADRSALDLLRKELDLRGIEVVTLNAFPYAGFHAPTVK
KAVYRPDWTERPRLDHTLACARVLAELLPPDAARGSVSTLPLAWRTPWTP
RRDDLARRHLDLLSQGLAALNADTGRTVRVGFEPEPGCLIETTGQAVARL
AGADPERLGVCVDTCHLAVAFEEPGPALTRLAAALPVVKTQASCAVHADR
PADPAARAALAAFAERRFLHQTRQAAPGGPSAVDDLPEALGGALNGDAAW
RIHYHVPVQRDLPPPLRSTRPELVAALTTLLGGPTALTDHVEVETYTWPV
LPGAPDGGGLVDGIAGELAWTRSALTALGLTEESTP
>gid:394816  SCO3250  putative integrase
MAGPRKRNPNGAGTITKRKDGRYQCAVYVLQPDGTRARKFAYGKTWAECD
VKRRELLAKVDQGVPVPTKSAKLSEWMPYWLDNVIKPRRKLSTYDKYEAH
VRLYLVPLLGAKRLESLGVADVRRFLVRLEKETTAATAKESHRVLRSALT
SACREELITRNVAKLVEPPRTDSRELKPWTLDETLDFLAASRKDPLYAAF
VLAIAMGLRRGEIIGLRWSDLDLDNRVLYVRQQTQRRRGVLYDDDPKSRR
RRAVPLPALCIAPLRWHRMRQAAARIKAGEQWQESGYVFTTRTGRQVEPR
NVYRSFTRVAESAGLRVIRLHDARHGTATLLTAAGVAPRVVMEILGHSQI
SITMDVYTHVVQDTQREAMSHMDRLLRKRRPDRG
>gid:394830  SCO3260  MutT-like protein
MREDGRLLAIRRADNGTWELPGGVLELNETPEAGVAREVWEETGIHVEVD
ELTGVYKNTTRGIVALVFRCKPSGGTERTSSESTAVSWLTPDEVSDRMAE
VYAIRLLDALDGNGPHVRSHDGKHLIPTG
>gid:395008  SCO3355  putative adenine glycosylase
MTAPTKPSPGGPSDPAASGVPLHAPVIDWFDEHARDLPWRRPEAGAWGVM
VSEFMLQQTPVSRVQPVYEQWLARWPRPADLAAEAPGEAVRAWGRLGYPR
RALRLHGAAAAITERHGGDVPADHAQLLALPGIGEYTAAAVASFAYGQRH
AVLDTNVRRVLARAVTGVQYPPNATTAAERKLARALLPEEQERAARWAAA
SMELGALVCTAKKESCHRCPIAAQCAWRLAGKPAHDGPPRRGQTYAGTDR
QVRGRLLAVLRDAAGPVPQAALDQVWQEPVQRARALDGLVADGLVEPLAD
GLYRLPLS
>gid:395160  SCO3434  putative DNA polymerase I
MAERWALAVAEGGGVEVAPLGPDGLPTGPVRREPGLAEAVRARPDVARWV
WRSTAETAPRLLAAGVRVERCYDVEAAETLLLGHEGRYGEPRSAAAALAR
LRGGPVPPDPPQRSAEPGAQSSLFEPQAVHLPLSDLLAVYAEQQRRHDRS
ALPDRLRLLTAAESAGMLVAAEMNRAGLPWRADVHREVLHDLLGERYAGG
GEPRRLAELAEEVSAAFGRRVRPDLPADVVKAFGQAGVKVGSTRRWELES
LDHPAVKPLIEYKKLYRIWVAHGWSWLQDWVRDGRFRPEFLAGGTVTGRW
VTNGGGGLQIPKVIRRAVVADPGWRLVVADADQMEPRVLAAISRDPGLME
VAGRETDLYQSVSDRAFSGDRDQAKLAVLGAVYGQTSGDGLKNLAALRRR
FPRAVAYVDDAARAGEEGRLVRTWLGRTCPPAAGAADGTEEAGLPQDEPA
GAGDRAQEWVPGYASTNARARGRFARNFVVQGSAADWALLLLAALRQTCA
DMAAELVFFQHDEVIVHCPEEEAATVVAAIRDAAELAGRLTFGATPVRFP
FTTAVVECYADAK
>gid:395231  SCO3466  transposase
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:395235  SCO3467  transposase
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:395240  SCO3468  transposase
MFSGLSALVIEEVADDGEVIRVAARTRDVPSPCPVCGVLTGKVHGYHGRP
MADVPVDGRKVVVHVQVRRLVCPVLECRRQTFREQVPGLMERLQRRTNRL
TSQVSAVVKELCGRAAARLSRSLAVPMSFATALRLLRRIPIPVVRTPRVV
GVDDFALRRRHRYATIIIDAETGERIDVLPGREAAGLEAWLLEHPGVETV
CRDGSATYAEAIRRALPDAVQVSDRWHLWRNLCDKVLAEVRAHASCWATV
NPARPGGVHEQTIRERWHQIHDLLGRGVGLLECSRRLDLALNTVKRYSRI
PEPPADRIAPRYRTTLVDPYREHLRIRRATEPAVAVTQLFHEIKAQGYTG
SHNLLVRYLTQGRAEGNRPVITPRHATRLLLTHPEHLWTKDTALLATLTS
ACPETAELAKFVGTFAQLLTPEKGNDARLTEWIADVRAADLPHLHSFCNG
LELDRAAVNAGLTLPHHNGRTEGVNTRTKRIMRQMHGRAGFDLLRHRILL
S
>gid:395245  SCO3469  transposase
MSVESLNELVATAFSGISPLVIEDVVDEGERVVVRARTPGSTAVCPACGA
LSERVHGYHWRTVADLPIDGRRVVVRVRVRRLVCPTRGCRHTFREQLPGV
LERYQRRTARLTRQIKAVVKELAGRAGSRLLAKLAIGLSRHTALRTLLRI
ALPTGRVPRVIGVDDFALRRRHRYATVVIDAETHERIDVLPDRTADTLEA
WLRENPGVEVVCRDGSATYAEAIRRALPDAVQVTDRWHLWHNLCETALSE
VKAHSTCWAAVLDTPIYEGPRAQTTLERWHQVHGLLQQRVSLLECARRLQ
LSLNTVKRYARADRPERMLRVPKYRASLVDPYRDHLRRRRAEDPAVPVQH
LFEEIKALGFTGCLNLLHKYINQGRADADRSHISPRRLARMLLTRPENLK
SGHRDLLDQLTAACPEMTHLATSVRTFAQLLKPRPENVDALDHWITQVRA
ADLPHLHAFTRGLERDNDAVIAALTLPYSNGPTEGVNTKTKRIARQMHGR
AGFNLLRHRILLG
>gid:395255  SCO3472  putative transposase remnant
MGSKYTKRYLEEYKRDAIELVRSSGRTVTEAARELGISRSEPFQVRYGFG
RDFSRTPETRGRCVGASAGHPHLPRRALGMPCTGRPA
>gid:395317  SCO3490  transposase
MSVESLNELVATAFSGISPLVIEDVVDEGERVVVRARTPGSTAVCPACGA
LSERVHGYHWRTVADLPIDGRRVVVRVRVRRLVCPTRGCRHTFREQLPGV
LERYQRRTARLTRQIKAVVKELAGRAGSRLLAKLAIGLSRHTALRTLLRI
ALPTGRVPRVIGVDDFALRRRHRYATVVIDAETHERIDVLPDRTADTLEA
WLRENPGVEVVCRDGSATYAEAIRRALPDAVQVTDRWHLWHNLCETALSE
VKAHSTCWAAVLDTPIYEGPRAQTTLERWHQVHGLLQQRVSLLECARRLQ
LSLNTVKRYARADRPERMLRVPKYRASLVDPYRDHLRRRRAEDPAVPVQH
LFEEIKALGFTGCLNLLHKYINQGRADADRSHISPRRLARMLLTRPENLK
SGHRDLLDQLTAACPEMTHLATSVRTFAQLLKPRPENVDALDHWITQVRA
ADLPHLHAFTRGLERDNDAVIAALTLPYSNGPTEGVNTKTKRIARQMHGR
AGFNLLRHRILLG
>gid:395355  SCO3508  putative maturase-related protein
MNTDELEHRRFDDLFSLVADPGFLLVAWDLVRGNKGASAAGVDGSTASSI
ALWVGVEKFLDMLRSQIKDRSFQPMPVRERMIPKAGGKLRRLGIATITDR
VVQASLKLARFGEAAPGKRTGRKTGTAPRADFTPAIGCAFRYSPV
>gid:395356  SCO3509  conserved hypothetical protein
MQRSREGALVAELMMFRRDSDGRDVELSSSTVALEVELQRRVEAGLEQML
GVRFLASEYPTGPWHRGRIDTLGLDENGSPVVIEFKKGSDSGVMSQAVSY
LSWLESAHHEFEALVRKVLGAEAAESVDWRRPRMICIAAGFSHHDRVAVQ
RLPERIDLVRYRIFDGGLLGLLLVDSATGFPSAASSRRVRERAPVVDSVP
TASVVSPSAGGGAVPECLRDLYAELDEALTAWGEVEVASLRHYIAYRRLV
NVASVIFRPKHEAILMYLRLDPDTVELEEGFTRDMRGIGHLGTGDLEVRI
VSAADLEKAAPLIRRAFEAA
>gid:395358  SCO3510  putative DNA methylase
MSYTLHRGDALTVLKSLPDESVQAVITDPPYNSGGRTSSDRTGRTARAKY
VTSNSAHDLANFPGENRDQRSYRSWLTELLTEAYRASTEHAVTMVFTDWR
QEPTTSDALQMAGWTWSGTIPWIKPSSRPRKGGPKQDSEFIIWGVKGSLD
NTRDLYLPGHYIASQPRKGRVHITQKPVEVMQQLVQVCPEGGTVLDPFTG
SGSTGVAALREGRHFVGVELSAHYADVAEERLRAELTKDDFELAGPEA
>gid:395382  SCO3528  conserved hypothetical protein
MSAEQAVLGAVLLDPEQLTHLQWLAAGHFYRPVHQALFDALRKLRDDGHP
ALSADGPLPLSWVTDAVGEAGQHVRGLTAAYAHTLIQACPRTEHAPVYGR
MVLEGAIHRTVAQHAIRLHQAARADAVQGEVEGALRTADALAGVLTDLAR
RWGTDPRPVPPTAGPSAATDTLPPARSGQVAEDERFLLAVLAEQPGAMGE
VVAWLRPGDFADPTHGQLYRCLGALHHRGEPIDRITLLWEAQRRGLLADG
TVSSEQLTAVCEGMVPGSADWFGQRVMRSSVTRTAAASARGIRTLAQDEV
LGPGRLINHALHELGPLDEVRARWATANSSPAPKTTASAPSAAEPPPDRV
KAARARSTPRPGASPPAPASRLPTLSAARPPSRGHP
>gid:395396  SCO3539  putative transposase
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:395400  SCO3541  putative DNA polymerase
MTVWDDLVGQEKVCEPLAAAARDADAFVTAAATAGPLPQSTSMTHAWLFT
GPPGSGVARTARAFAAALQCVSPDRALGGVPGCGFCDGCHTALVGTHADV
STVVAMGAEIRAQDMRDTVRKSFTSPANGRWQIILVEEAERLNEKSANAV
LKAVEEPAPRTVWLLCAPSVEDVLPTIRSRCRHLNLRTPSVEAVADMLVR
REGIEPDVAAAAARATQGHIDRARRLATDKAARDRRAAVLKLPLRVEDVG
GALRAAQELVDAAAEDAKQLAEEMDAKETEELKAALGAAQGGRLPRGTAG
VIKDLEADQKRRKARTQRNSLDLALSELTGFYRDVLALQLGSRVAIANAD
AEDALERLARGSTPESTLRRIEAVAACGEALDRNVPPLLAVEAMTMALRS
G
>gid:395406  SCO3543  probable DNA topoisomerase I
MSPTSETAKGGRRLVIVESPAKAKTIKGYLGPGYVVEASVGHIRDLPSGA
AEVPEKYTGEVRRLGVDVEHDFQPIYVVNADKKSQVKKLKDLLKESDELF
LATDEDREGEAIAWHLQEVLKPKIPVKRMVFHEITKDAIRAAVANPRELN
QKLVDAQETRRILDRLYGYEVSPVLWKKVMPRLSAGRVQSVATRLVVERE
RERIAFRSAEYWDLTGTFATGRAGDASDPSSLVARLQTVDGRRVAQGRDF
DSLGQLKSANTLHLDEANARALAAALENTRFAVRSVESKPYRRSPYAPFR
TTTLQQEASRKLGFGAKSTMQVAQKLYENGYITYMRTDSTTLSDTAVSAA
RAQVTQLYGADYLPPQPRTYAGKVKNAQEAHEAIRPSGDRFRTPAETGLT
GDQFKLYELIWKRTVASQMKDATGNSVTVKIGGAASDGRDVEFSASGKTI
TFHGFLKAYVEGADDPNAELDDRERRLPQVAEGDALTAEEITVDGHATKP
PARYTEASLVKELEEREIGRPSTYASIIGTILDRGYVFKKGTALVPSFLS
FAVVNLLEKHFGRLVDYDFTARMEDDLDRIARGEAQSVPWLRRFYFGEGD
GTGGGGAADAGNGDGDHLGGLKELVTDLGAIDAREVSSFPVGNDIKLRVG
RYGPYVERGEKDAENHQRADVPEDLAPDELSVELAEELLAKPSGDFELGT
DPATGHAIVAKDGRYGPYVTEVLPEGTPKTGKNAVKPRTASLFKSMSLDT
VTLDDALKLMSLPRVVGADAEGVEITAQNGRYGPYLKKGTDSRSLQTEDQ
LFEITLEEALAIYAQPKQRGRAAAKPPLKELGTDPVSEKPVVVKDGRFGP
YVTDGETNATLRSDDSVEEITPERGYELLAEKRAKGPAKKTAKKAVKKTA
AKKAPAKKAAATKKTAAAKTTAAKKTAAKSTAKKTTAKTAAKKATASKTS
ED
>gid:395453  SCO3568  conserved hypothetical protein
MTRASDTQGPARDRTRGAMLSKEGLPGWLDPVVRAVETIRPTQLSRFLPP
ADGAGRQSAVLILFGDGTRAGEGGEEAGRGPELLLMERAGSLRSHPGQPA
FPGGALDPEDGDPRGDGPLRAALREAEEETGLDPAGVQLFGVLPKLYIPV
SGFVVSPVLAWWREPTPVGVVDPAETARVFTVPVADLTDPANRVTTIHPS
GHRGPAFLVESALVWGFTAGIIDRLLHFAGWERPWDREKQVPLDWRA
>gid:395455  SCO3569  putative endonuclease
MRRARRINRELAEVYPYAHPELDFENPFQLVVATVLSAQTTDLRVNQTTP
ALFAKYPTPEDLAAAVPEEVEEILRPTGFFRAKTKSVIGLSKALTEDFGG
EVPGRLEDLVKLPGVGRKTAFVVLGNAFGRPGITVDTHFQRLVRRWRWTE
ETDPDKIEAAVGALFPKSDWTDLSHHVIWHGRRICHARKPACGACPIAPL
CPAYGEGETDPEKAKKLLKYEKGGFPGQRLKPPQAYLDAGGKPAPPLGAG
>gid:395467  SCO3573  conserved hypothetical protein
MANGQANGQWYPPEWPDRIRALAAGTLTPVTPKRAATVMLLKDTGSGTES
TGAGAGPAVHMLRRRTSMAFAGGAYAYPGGGVDPRDDDRAVGWAGPTRAW
WADRLGVDEAGAQAIVCAAVRETYEETGVLLAGPTGDSVVGDTTGADWEA
DRAALVDRELSFAEFLDRRGLVLRSDLLGAWARWITPEFESRRYDTWFFV
AALPAGQRTRNASTEADRTVWITPADATAGYDKGELLMMPPTVATLRGLA
GCASAAEALASAPGRDMTPVLARARIVDGEIVLSWPGHEEFTKHVPSTAP
ATPTDPTGGAPA
>gid:395552  SCO3618  putative recomination protein
MYEGVVQDLIDELGRLPGVGPKSAQRIAFHILQAEPVDVRRLAQALMEVK
AKVRFCATCGNVAQEEQCNICRDTRRDPSVICVVEEPKDVVAIERTREFR
GRYHVLGGAISPIDGVGPDDLRIRELLARLADGSVTELILATDPNLEGEA
TATYLARMIKPMGLKVTRLASGLPVGGDLEYADEVTLGRAFEGRRLLDV
>gid:395559  SCO3621  putative serine-threonine protein kinase
MRRLGDGDPVGVPPGVQHWGNIGVYRLLGRLGTGGMGHVYLARSDRGRTV
AVKLVREELAALEEFRERFRHEVESARRVGGHWTAPVLDADTEAAVPWVA
TGYVAGPSLQQVVGHDHGALPERSVRTLGAGLAHALQDIHAAGIVHRDLK
PSNVLVTIDGPRVIDFGIARALQTVADGGLTRTGALVGSPGFMAPEQVRG
DRVTPACDVFCLGSVLAYAATGKLPFGSANSGAHALMFRIAQEEPDLEGV
PEGIADLVRDCLRKDPAARPALADVLERTGAQDTVTGGRSRDPWLPSALV
AQLGRHAVQLLDTENPEDPAHPAGGPDPSRTPGAAAATPGDALPDGASAP
AAGVRPEGAGHGGVAPGSAGPGGGGRGGVGPGGAGPGGVGPGGVGPGGVG
PGGVGSGGVGPGGAGPGGVGPGGAGSGGVGPGGAGSGGVGPGGAGSGGVG
PGGAGSGSAGPDGADPGGVGPGGAWPGGGGARGGGSGGEGAAQGADPASG
GPPPPREPGSGGGAPLNHLPTQVAGRHPATPPPGAHGHAQPPAPGYDPAP
AWHASQPGHQHPYDGGGGLGPTPPQGPPPPYGPPHEPVRNGRSTALLIVV
ALVVALGAGGSVYALMRGDDDGRAGGDPTPTRSTGAPQDPGTGASGSSGA
TGGRSPSTGPAADEGTVPTGYLGDWSTSIDNASGTHPRTLSIVPGEVGDT
VLTLVADGPTDTGTYHCVFEAALTAEPGSGGPLRLGPSAVRTGPASSCAP
GGPSTVTLRPDGSLERTSDDTGESLLYTRAQGY
>gid:395649  SCO3660  hypothetical protein
MKRVLAVQRISRDTEASSALTRQDEALTRAIRQGTYREVGRVVDATVSGA
VHLDQRPSLRRWLAEPLVHEWDVLMVTEQDRITRDDMHWWHFVDWTLKNG
KDVEVLDDPTFDIQSEDGRMLAGIKAAQAAKYRKAVQAKQLDRTQFFREN
RLWNGGVWPFGYRAEVFSHLGERRKRLTVDPCTSRLVREAYDRLVHGEGT
VYAVARDWNLRGVPSARDHQRREQNKDLPEQEHRPEKGTRWSVTPLRNIL
RSPALMGVMMHRSEPVLDAEGQPVVWAEPVLTSEEFAELQEVVLTEQRTA
ESGPGKRWRTPLLGVVFCMCGRPMYVRHQKNHLADGTTAVLTYYQCRSVS
EMNRCEAPSTWRAESLCAVVERDFLDQAGDRTEMKRTYVPGRDHTAAIAE
LRSALANLTVAIGTATAPAAVAVLTRQMDEHARTITRLEAEPVVHARWKE
EPTGAAYREQWRAVKDWESRAALLAKAGVRFFCEGTHKSGSVHMYLPSAS
QRRVSGVSLSKEAVEVVGIGPLEEEARRYFTDLRRTKGMGPGFWYSR
>gid:395751  SCO3714  putative transposase
MQLRYAFRVYPDAGQRLALARAFGCARVVFNDVVRAREDARKAGRPFQTA
AGLSRKLITEAKRTAERSWLGEVSAVVLQQSLRDAETAYRNFFASLKGTR
KGPRVGPPRYKSRKDARQSIRFTANARWSITDSGRLNLPKIGAVKVKWSR
TLPTIPTSVTVIKDAAGRYFASFVVDTDPAADRVRMPDADRTVGIDLGLT
HFAVLSDGTKIGSPRFLRRAEKKLKKAQQELSRKQMGSKNRDRARLKVAR
AHAKVADARREFHHQLSTRLISENQGIAVEDLSVAGLARTRLAKSVHDAG
WSSFVGMLKYKAERYGRTLVVIGRFEPTSQTCSTCGVKDGPKPLQVREWT
CTACGTVHDRDHNAAINVKQAAGLAASACGAPVRPGAIPAQREETGSHGL
PTEPRAA
>gid:395786  SCO3732  putative DEAD-box RNA helicase
MNRARTNDRRRAGDGPRRSRSAGRPQNSGRRPAAAPQGGEFALPKTITPA
LPAVETFAELDLPARMLAALGDQGVTEPFPIQAATLPNSLAGRDVLGRGR
TGSGKTLAFGLALLARTAGKRAEPRRPLALVLVPTRELAQQVTDALTPYA
RAVGLRSATVVGGMSIGRQAGALRSGAEVVVATPGRLKDLIDRGDCALGD
VTITVLDEADQMTDMGFMPQVTALLDQVAADGQRMLFSATLDRNVDKLVR
RYLTDPVVHSVDPSAGAVTTMEHHVLHVQDEDKQRATIEIAARDGRVIMF
LDTKHRVDRLVKHLLKSGVRAAGLHGGKSQPQRTRTLAQFKDGQVTALVA
TNVAARGIHVDNLDLVVNVDPPGDHKDYLHRGGRTARAGESGSVVTLVTP
DQRREMTRLMSLAGITPQVTPVRSGEAELARITGAQTPSGVPVVITAPVV
ERPRRAAAGAGSPSRGRRGRSAQGRSGGQGTGTQARTGAAQSRTAGQSRT
AAQGRPTGESPRRRPRRQSTGGSTGSAA
>gid:396006  SCO3820  putative serine/threonine protein kinase
MSDAPENWGNGGLVGDGRYRLTRRLGRGGMAEVFAAEDVRLGRTVAVKLL
RADLAEDPVSKARFTREAQSVAGLNHHAIVAVYDSGEDVVGGQSVPYIVM
EIVEGRTIRDLLLNAEAPGPEQALIIVSGVLEALAYSHQHGIVHRDIKPA
NVIITNTGAVKVMDFGIARALHGAQSTMTQTGMVMGTPQYLSPEQALGKA
VDHRSDLYATGCLLYELLALRPPFTGETPLSVVYQHVQDIPTPPSEVSDA
TPPELDGLVMRSLAKEPDDRFQTAEEMRGLVQYGLQMLYEQGGHTGTWNT
GPVAAHDGRHTPSAGLAGTTVMPHPADHGSSGTQQIPQPILPGRYDGDDG
GFEGAGNKGTGRGKLWILAVLAVIAIAAGVALALNNGDDGKGGTETDKSP
SATTSQSTGEESPSSSPSDEATQETTDPGTEQGSGGGGTGDGDWDKPYTP
TWSPSETATDDPTGDPTGDPTGDPTGDPTGDPTGGGEPTGGGEPTGGGEP
TGGGDPTGGATGAPGGSEGGEG
>gid:396091  SCO3860  putative serine/threonine protein kinase
MGEVFAGRYELVDPIGRGGVGAVWRAWDHRRRRYVAAKVLQQRDAHSLLR
FVREQALRIDHPHVLAPASWAADDDQVLFTMDLVTGGSLVHLVGDYGPLP
PEFVCTLLDQLLSGLAAVHGEGVVHRDIKPANLLLEATGTGRPRLRLSDF
GIAMRLGEPRLTETNLVVGTPGYLAPEQMMGAEPDFPSDLFAVGLVALYL
LEGAKPDAKVLVQHFAEHGTPPAPRGIPEPLWQVVATLLQPDPSARFRTA
TGARKALAAAVELLPEPGPDDELIEIFDQVGPLPTGFGPEGPFKRASGVD
DVPTDPRRPAGTPPGVDPSPPATPPTPPPAPPWQGTPPAGPSSGLDRPSP
GSPGPPPTGPDSTPASPPPGTPVTATGTPSAPGLPPASDQGWTPSTPSGP
TAPPSAPSAPSAPSAPGPTRPAPHGTHSEEVPLAERPGAMSETGSFHLPP
PQPTVTPTSDAAASDAAAEAAQPPAPHPAFTGGPRGLPPDRAPGRSQHPA
PHGPPLTARSLAPSPARRADVPTAAYTARNPRSAPPAQHRGARRRRRPGP
PARVALPVLLLALACYAVGFWALTRI
>gid:396135  SCO3873  DNA gyrase subunit A
MTTPEGDALAMRVEPVGLETEMQRSYLDYAMSVIVSRALPDVRDGLKPVH
RRVLYAMYDGGYRPERGFYKCARVVGDVMGNYHPHGDSSIYDALVRLAQP
WSMRMPLVDSNGNFGSPGNDPAAAMRYTECKMAPLSMEMVRDIDEETVDF
TDNYDGRSQEPTVLPARFPNLLINGSAGIAVGMATNIPPHNLREVAAGAQ
WYLENYEASHEELLDALIERIKGPDFPTGALVVGRKGIEEAYRTGRGSIT
MRAVVEVEEIQNRQCLVVTELPYQTNPDNLAQKIADLVKDGKVGGIADVR
DETSSRTGQRLVIVLKRDAVAKVVLNNLYKHTDLQSNFGANMLALVDGVP
RTLSLDAFIRHWVNHQIEVIVRRTRFRLRKAEERAHILRGLLKALDAIDE
VIALIRRSDTVEIARGGLMDLLEIDEIQANAILEMQLRRLAALERQKIVR
EHDELQAKITEYNEILASPVRQRGIVSEELTALVEKYGDDRKTKLIPYEG
DMSIEDLIAEEDIVVTVTRGGYIKRTKTDDYRAQKRGGKGVRGTKLKEDD
IVNHFFVSTTHHWLLFFTNKGRVYRAKAYELPDAGRDARGQHVANLLAFQ
PDETIAQIRAIRDYEAVPYLVLATKAGLVKKTPLKDYDSPRSGGVIAINL
REQADGSDDELIGAELVSAEDDLLLISKKAQSIRFTASDDTLRPMGRATS
GVKGMSFREGDELLSMNVVRAGTFVFTATDGGYAKRTSVDEYRVQGRGGL
GIKAAKIVEDRGSLVGALVVEEHDEILAITLSGGVIRTRVNGVRETGRDT
MGVQLINLGKRDAVVGIARNAEAGREAEEVDGDVAVDETAEGAATTGTDE
GEAPSAE
>gid:396137  SCO3874  DNA gyrase subunit B
MADSGNPNENNPSTDTGVNDAVSTSHGDASASYDASAITVLEGLDAVRKR
PGMYIGSTGERGLHHLVQEVVDNSVDEALAGHADTIDVTILPDGGVRVVD
NGRGIPVGIVPSEGKPAVEVVLTVLHAGGKFGGGGYAVSGGLHGVGVSVV
NALSTRVAVEVKTDGYRWTQEYKLGVPTASLARHEATEETGTTVTFWADG
DIFETTDYSFETLSRRFQEMAFLNKGLKINLTDERESAKATAGADEAGED
EKHEVKSVSYHYEGGIVDFVTYLNSRKGELVHPTVIDLEAEDKDKSLSLE
VAMQWNGGYTEGVYSFANIIHTHEGGTHEEGFRAALTSLINKYARDKKLL
REKDDNLTGDDIREGLTAIISVKLAEPQFEGQTKTKLGNTEVKTFVQKVV
YEHLTDWLDRNPNEAADIIRKGIQAAHARVAARKARDLTRRKGLLESASL
PGKLSDCQSNDPTKCEIFIVEGDSAGGSAKSGRNPQYQAILPIRGKILNV
EKARIDRILQNQEIQAMISAFGTGVHEDFDIEKLRYHKIILMADADVDGQ
HINTLLLTFLFRFMRPLVESGHVYLSRPPLYKIKWGRDDFEYAYSDRERD
ALIEMGRQAGKRIREDSVQRFKGLGEMNAEELRITTMDQEHRVLGQVTLD
DAAQADDLFSVLMGEDVEARRAFIQRNAKDVRFLDI
>gid:396142  SCO3876  DNA replication protein
MHVTHLSLADFRSYARVEVPLDPGVTAFVGPNGQGKTNLVEAVGYLATLG
SHRVSSDAPLVRMGAERAVIRAQVRQGERQQLIELELNPGRANRARVNRS
SQVKPRDVLGIVRTVLFAPEDLALVKGDPGERRRFLDELITARSPRMAGV
RSDYDRVLKQRNTLLKSAALARRHGGRTMDLSTLDVWDQHLARAGAELLA
QRLDLIASVQPLADKAYEQLAPGGGPVALEYKPSAPGEAHTREDLYEQLM
AALAEARKQEIERGVTLVGPHRDDLLLKLGSLPAKGYASHGESWSYALAL
RLASFDLLRAEGNEPVLVLDDVFAELDARRRERLAELVAPGEQVLVTAAV
DDDVPHVLAGARFTVAEGTVERV
>gid:396156  SCO3878   DNA polymerase III, beta chain
MKIRVERDVLAEAVAWAARSLPARPPAPVLAGLLLKAEEGQLSLSSFDYE
VSARVSVEAEIEEEGTVLVSGRLLADISRALPNRPVEISTDGVRATVVCG
SSRFTLHTLPVEEYPALPQMPEATGTVPGEVFASAVQQVAIAAGRDDTLP
VLTGVRIEIEGDSVTLASTDRYRFAVREFLWKPENPDISAVALVPAKTLQ
DTAKALTSGDQVILALSGSGAGEGLIGFEGAGRRTTTRLLEGDLPKYKTL
FPTEFNSVAVIETAPFVEAVKRVALVAERNTPVRLSFEQGVLILEAGSSD
DAQAVERVDAQLEGDDISIAFNPTFLLDGLSAIDSPVAQLSFTTSTKPAL
LSGRPAVDAEADEAYKYLIMPVRLSG
>gid:396162  SCO3879  chromosomal replication initiator protein
MADVPADLAAVWPRVLEQLLGEGRGQGVESKDEHWIRRCQPLALVADTAL
LAVPNEFAKGVLEGRLAPIVSETLSRECGRPIRIAITVDDTAGEPAGPAP
QAPQSPPSRPQHRYEEPELPAPGQGGREEYRDRDEYEGYGRNRADQLPTA
RPAYPQEYQRPEPGSWPRPAQQDDYGWQQQRLGFPERDPYASPNQEPYGQ
EPPPPYSHENRTSYQQDYRPQPPERPSYDAQRGDYEQARGEYEQPRGDYD
KPRGDYDQQRGDYDQRGPRRDLPEPPPGSGHVHRGGPVGPGPATGAPGPL
AAQPAPATGPGEPTARLNPKYLFDTFVIGASNRFAHAAAVAVAEAPAKAY
NPLFIYGESGLGKTHLLHAIGHYARSLYPGTRVRYVSSEEFTNEFINSIR
DGKGDSFRKRYREMDILLVDDIQFLADKESTQEEFFHTFNTLHNANKQIV
LSSDRPPKQLVTLEDRLRNRFEWGLITDVQPPELETRIAILRKKAVQEQL
NAPPEVLEFIASRISRNIRELEGALIRVTAFASLNRQPVDLGLTEIVLKD
LIPGGEDSAPEITSTAIMGATADYFGLTVEDLCGTSRGRALVTARQIAMY
LCRELTDLSLPKIGALFGGRDHTTVMHADRKIRNLMAERRSIYNQVTELT
NRIKNG
>gid:396201  SCO3893  hypothetical protein
MAERSTAAVDVADTSGDEPLTAQADQSTADGVANNRERDTDSDEAQGNPT
SEEPGKTSPPELHSGHKLARRYRLEECVTRLDGFSSWRAVDEKLRRAVGV
HLLPADHARARSVLAAARSSALLGDPRFVQVLDAVEDNDLVYVVHEWLPD
ATELTALLAAGPLEVYDAYQMVSQVASAMAAAHREGLAHLRLSPNAVLRT
STGQWRIRGLAVNAALRGISSDTPQRTDTEAIGALLYATLTQRWPYESDA
HGLAGLPKDIGLIPPDQVRAGVHRGLSELAMRALANDGATASRHESPCTT
PEELVKAIGEMPRIRPPEPAYTAPPQYQRTPYQQGGYSRPAPHPGVTQPV
PTPPPPLQSRTGKALKWGVSALLIAALGLGSWQLADALMERGGKADDPNQ
TQTVDGDNDKSPKEPVSRPISIAGAHDYDPFGSDGSEYPENVGKAYDGDP
GTYWQTSHYASADFGRLKPGVGIVVDLGKVQQVGKVALTFGGDTSVELRS
AGATDSEPQSFEGYQKIAGGNGTTVDLKPDKAVKTRYLLVWLTKLPVTDG
QFRGRVADIKVTS
>gid:396225  SCO3907  putative single-strand DNA-binding protein
MAGETVITVVGNLVDDPELRFTPSGAAVAKFRVASTPRTFDRQTNEWKDG
ESLFLTCSVWRQAAENVAESLQRGMRVIVQGRLKQRSYEDREGVKRTVYE
LDVDEVGASLRSATAKVTKTSGQGRGGQGGYGGGGGGQGGGGWGGGPGGG
QQGGGAPADDPWATGGAPAGGQQGGGGQGGGGWGGGSGGGGGYSDEPPF
>gid:396237  SCO3911  putative replicative DNA helicase
MSISEPLDDPWADSGPSDRLPASRRRTEGGRGRDEQHERGPDSGWDGGGA
AFERVPPQDLDAEQSVLGGMLLSKDAIADVVEILKGHDFYKPAHETVYQA
ILDVYAKGEPADPITIAAELTKRGEINKVGGASYLHTLVQTVPTAANAAY
YAEIVHERAVLRRLVEAGTRITQMGYAADDDVDEIVNRAQAEIYAVTEQR
TSEDYLPLGDIMEGALDEIEAIGSRSGEMTGVPTGFTDFDSLTNGLHPGQ
MVVIAARPAMGKSTLALDFARAASIKNNLPSVIFSLEMGRNEIAMRLLSA
EARVALHHMRSGTMTDDDWTRLARRMPDVSAAPLYIDDSPNLSMMEIRAK
CRRLKQRNDLKLVVIDYLQLMQSGGSKRAESRQQEVSDMSRNLKLLAKEL
EIPVIALSQLNRGPEQRTDKKPMVSDLRESGSIEQDADMVILLHREDAYE
KESPRAGEADLIVAKHRNGPTATITVAFQGHYSRFVDMAQT
>gid:396290  SCO3937  putative integrase /recombinase
MLAADPNLKCVVCYARISFDGRVKDAHGIEDQHRDMSEAARRFGWLIVYR
YTDNDKSASKESVVRDDFEQLLADLAAGATPEGYPVHGVMAVNDDRLYRR
PSDWERYLKAFTSQDGRVYHDSNGLQDLYAEGFEIKGLVGVAMSLSETRK
KQRRSRNSHRSRAIRGQSVSAWRPFGWEDDKVTLRPDEAEAIRTAVHDVI
AGASISEITRRWKEAGFITSRGNPFQYQTVKQVLVNARLCGYREIKGEIV
RDGDDQPIVGEWEAIVTPKQWFAVTAKIRERGHGTGTPRGGLVHKYLLTN
ILRCGNVLEDGTVCNNKMIGIKANDWLKYQHAYMCKKTVDGGCNKTYKRG
DKTDKIIEELVIAKLERDAATKAQDVPDWDKAEALERALQSRRELERRWH
DDEDTDIDDEAFFRNLPVLERRIKELRVDQKAHEALKAEAEEAEADIRKS
WGAKTLTQKREAMKKVLGAVIALPGGKGNKTFDPDLLKPVWKTSE
>gid:396400  SCO3997  putative integrase
MAGKRKSRRGFGRVRKLPSGRFQARYPGPDGVLRPADRTFATTTDADRWL
MRKRIEIEEGCWLDPAEGQTTVRDWAARWLAAVSPQLKHKTQASYRSLIN
SLINPVLGDRELSSLRPITVTEWVGAMRTKGLSASRIRQAYRVLSQIMRS
AVDNDMIPQTPCRGVRLPRMPQTEPHILTPLEASRIVKGATKPHDLLIAL
LAYAGLRVGEAFALRRVDVDVSGGFVLVDENLAEANGVLVFDTPKSHQKR
LLRVGPSLAKRLGRHLETLPGGDDALLFTTPGGKPLRYNQWRKAYFDPAV
SAAGLTDVTPHDLRASHGTWVADRYGVMTAAHRLGHSNASVTTRHYARPV
AGRDDQVAEAADSWLSGSEDNDDGSAAVPA
>gid:396524  SCO4047  conserved hypothetical protein
MDVTLHLAQDPEADELLGRSPLAALVGMLLDQQVPMEWAFKGPSTIARRM
GAEDLDAHDIAAYDPESFAALLSDKPAVHRYPGSMAGRVQQLCRYLVETY
DGDAEAVWRGVSTGKELLKRLQELPGFGKQKAQIFLALLGKQLGVRPEGW
REAAGAYGEADSFRSVADIRGPESLTKVRAHKQEMKAAAKAAKASGK
>gid:396567  SCO4065  putative transposase
MNIQGDLPPDSDRHSRFSWCARHFLRGLLLDGGRKSVEPMAARLGEDGNR
QALAHFVTSSPWDAAHVRARLAWRMQPVIKPTALIIDDTGFLKDGDASAC
VTRQYTGTAGKVTDCQDGVSLHLASNGASAAVNWRLFLPGSWDSASPKAD
PAKVARRARCAIPAEVGHVEKWQLALDMIDETRSWGIEVPQVIADGGYRD
TAAFRLGLEERGLDYVVGISTTTTAQPEHAQPCTPAYSGLGRRPAPAYPE
PAQTVKSLVIAAGKRAARPVQWREGSRPGNGRSGHKRMYSRFVVLRVRPA
GREIRKATAGTGLPVRWLLAEWPADQDEPVQFWLSNLPEATSLPALVRTA
KLRWRIENDYREMKQTLGLAHFEGRTWPARHHHVTLVSVAHAFCTLQRLS
RSPKETASA
>gid:396571  SCO4067  DNA polymerase III subunit gamma
MSSLALYRRYRPESFAEVIGQGHVTDPLQQALRNNRVNHAYLFSGPRGCG
KTTSARILARCLNCEQGPTPTPCGECQSCRDLARNGPGSIDVIEIDAASH
GGVDDARDLREKAFFGPASSRYKIYIIDEAHMVSPQGFNALLKVVEEPPE
HLKFIFATTEPEKVIGTIRSRTHHYPFRLVPPGTLRDYLAEVCGREGIAV
EEGVLPLVVRAGAGSVRDSMSVMDQLLAGAGADGVTYDMTTSLLGYTDGS
LLDSVVESFVSGDGSAAFGVVDHVIEGGHDPRRFVTDLLERLRDLVILAA
VPDAAEKGLIDAPADVVERMQEQARSFGAAELSRAADIVNEGLTEMRGAH
SPRLQLELICARVMLPAAYGDERSVMARLDRIERGVQFSGGAGAPAMGYV
PGPEAHAGAGAAAPVAAPVPGGGPAAARAAVRGQGAGTPPGGDPSADVAA
GAGDGAGAGTGPASDAGGAPPGGGHPQAAPSSADVAPVPAPPSPATPSPA
AAPAPAPASAPGAWPSAAPAGGGRRPGGWPTATPAGGGQPRTPAAPASGP
AATAAQAPAPAAAAPAPVSPPSGASGVDPRSLWPNILEAVKNRRRFTWIL
LSQNAQVTGFDGTSLQLGFVNAGARDNFVSSGSEDVLRQALAEQFNVQWK
IDAVVDPSGGGAAPPPAGGPSGPSGGLGGSGGGGFGGGSAGAGGFGGGGG
FGGGAPAQRPSASASTPAAPPQAPASAPAPAPAPRPSAPEPPPVSPEDDI
PEDDDPDLDESALSGKELLVRELGATVVEEITNE
>gid:396622  SCO4091  putative DNA-binding protein
MTARTPDAEPLLTPAEVATMFRVDPKTVTRWAKAGKLTSIRTLGGHRRYR
EAEVRALLAGIPQQRSEA
>gid:396624  SCO4092  ATP-dependent helicase
MSTHPAPALGTLAPRLTELSLRDAHRLGRRLEGARKIRKPEARAAVLAEI
ETEVAKAEQRIGERRARVPAVSYPEQLPVSQKKDEIAAAIRDHQVVIVAG
ETGSGKTTQIPKICVELGRGVRGMIGHTQPRRIAARTVAERVADELDTPL
GETVGWKVRFTDQVNPESTFIKLMTDGILLAEIQTDRELRAYDTIIIDEA
HERSLNIDFLLGYLAQLLPKRPDLKVVITSATIDPERFSRHFGDAPIVEV
SGRTYPVEVRYRPLLEEDGDDADRDQITAITDAVEELMGEGKGDILVFLS
GEREIRDTADALEKKKYRFTEVLPLYARLSHAEQHRVFQQHTGRRIVLAT
NVAETSLTVPGIKYVIDPGFARISRYSHRTKVQRLPIEPVSQASANQRKG
RCGRTSDGICIRLYSEDDFTARPEFTDAEILRTNLASVILQMTAAGLGEI
EKFPFIDPPDHRNIRDGVQLLQELGALDPAQKDVRKRLTDTGRKLAQLPV
DPRLARMVLEADKNGCVREVMVIAAALSIQDPRERPAEKQAQADQQHARF
KDESSDFLAFLNLWRYVREQQKERGSSSFRRMCKQEYLNFLRIREWQDIY
TQLRTVAKQMGIHLNDEEHPAPDDRVHVSLLAGLLSHVGMKDVKDAGGEG
GRNTGKNEYLGARNAKFAIFPGSALFKKPPRFVMSAELVETSRLWARVNA
KIEPEWVEPLAGHLLKRTYSEPHWEKDQAAVMAYEKVTLYGVPIVAQRKV
NYGRIDPEISRELFIRNALVEGDWRTHHKFFADNRRLLTEVEELEHRARR
RDILVDDETLYDFYDQRVPEHVVSGAHFDSWWKHKRHEQPDFLDFEREML
INEKAGAVTKDDYPDSWRQGPLKFRVTYQFEPGADADGVTVHVPLQVLNQ
VSDEGFDWQIPGLREQVVTELIRSLPKPVRRNYVPAPNYAQAFLDRAVPL
QEPLTVTMARELKRMVGVPFDAEDFDWSKVPDHLKITFRIVDERRRKLAE
DKDLEALKLRLRPKARKALSQAAAATAERSGGESLERSGLTDWTIGTLTR
VFETRRAGQPVKAYPALVDDGSKADTVSVRLFDTEAEQAQAMWRGTRRLI
LRNIPVNPAKFASEKLTNQQKLGLSANPHGSIQALFDDCATAAADKLIAD
FGGPAWDEESYRKLYDRVRAEIVDTTVRTVGQVQQVLAAWQACERRLKAV
RSPALLANLQDVRGQLDALVKPGFVTEAGIKRLPDLMRYLVAADRRLQQM
PTGVQRDTSRMEKVHEMRDEYAWLLEQMPQGRPVPRQVLEVRWMIEELRV
SYFAHALGTAYPVSDKRIVKAIDALAP
>gid:396636  SCO4096  ATP-dependent RNA helicase
MPTRDSGRFRIRMSMTSTDHVVVPGNAEGAEGAPEAVEAVDASATTESPE
AAEAPEAAPEPTFADLGLPEGVVRKLAQNGVTTPFPIQAATIPDALAGKD
ILGRGRTGSGKTLSFGLPTLATLAGGRTEKHKPRAVILTPTRELAMQVAD
ALQPYGDVLGLKMKVVCGGTSMGNQIYALERGVDVLVATPGRLRDIINRG
ACSLENVQIAVLDEADQMSDLGFLPEVTELLDQVPAGGQRMLFSATMENE
IKTLVDRYLKDPALHEVDAAQGAVTTMSHHILVVKPKDKAPVTAAIASRK
GRTIIFVRTQLGADRVAEQLRDAGAKADALHGGMTQGARTRTLADFKDGY
VNVLVATDVAARGIHVDGIDLVLNVDPAGDHKDYLHRAGRTARAGRTGTV
VSLSLPHQRRQIFRLMEDAGVDATRHIIQGGAAFDPEVAEITGARSMTEV
QAESAGNAAQQAEREVGQLTKELERAQRRANELREEADRLVARVARERGE
DPETVLAEVAATVAEPEVSLPEQSGARDVEKAERGGNDRERSQDRRDYRR
DDRGDRGGRSFDRRDDRRDDRGGRSFERRDDRGDRGGRSFERRDDRGGFR
RDDRGDRGGRSFERRDDRGDRGGRSFDRRDDRRDDRGGRSFERRDDRGDR
GGRSFERRDDRGGFRRDDRGDRGGRSFERRDDRGERGGHRGSDRPFNRDR
QGDRPGFRSGGHDRPYGRRDEHRGSSFGRRDDKPRWKRNG
>gid:396744  SCO4143  putative mutT-like protein
MSPADDDTTVRAAGCVLWRPAPQAAPHGRELCLVHRPKYDDWSHPKGKLK
PGEDPLAGALREVAEETGYAAVPGAELTTVRYLANGRPKEVRYWAAAAGT
GAFAPSDEVDRILWLPPEAARARLTQPRDRALVDELLTTRLP
>gid:396846  SCO4183  putative transposase
MGEGLQVRCGLRHLHRVQEALVVHRNAPLTETGRLRLARCVVEDGWPVRR
AAERFQVSHTTASRWARRYRQLGVTGMSDRSSRPHHQPRRTAAAVEEHVL
RLRREHRIGPLRLAVRCGIAASTAHRILVRHGLPPLAALDRATGEPVRRY
ERARPGELVHIDVKKLGRIPDGGGHKTLGRAEGHRSRTNGAGWAYLHTAL
DDHSRIAYTEDLPDETAPTCAAFLVRATAYFASLGIRIERVLTDNAWAYS
KNTWRNTCRDLDISPRWTRPWRPQTNGKVERFHRTLLDEWAYQKPYTSDH
ERREAFTHWLHWYNYHRPHTGIGGHTPASRGTNLSEQHT
>gid:396905  SCO4211  putative integrase, partial CDS
MGDGCGLRQGEILGVAVDAIDFDSDTLHVVQQLKLSRSKAVFAPPKGGKL
RDVPLPRPVADALRAHTRRFPPVEITLPWKVADGPPVTKRLVFTGPRGGH
VWRTSLNEEAWKPALAAAGVIPAPERGRPYAESRENGMHALRHFYASVLL
DAGENIKALAEYLGHSGPGLTLRVYAHLMPSSRERTSRAVSDVYSKLLHP
EP
>gid:396916  SCO4218  putative small hydrophilic protein
MSAGGEARARLQQMRDKAQELKAASERTSDPDERKRLQEKARRLQSQSEE
ESMERGGDIYPSA
>gid:397145  SCO4316  putative ATP/GTP binding protein
MSTPGHDDPLSKERSHLAASRSALRAMREDVESLDITDVTANWVNAAVLE
SQIEQRIKALADLSETPLFFGRLDYLHAPGAEQAEGGDGERFYVGRRHVH
DADGDPMVIDWRAPVSQPFYRASKKDPQDVSLRRRFGYTGGDLTAYEDEH
LLDPAEAATTSRLLQQEIERPRVGPMRDIVATIQPEQDEIVRAGLAGSVC
VQGGPGTGKTAVGLHRVAYLLYAHRERLARTGTLVIGPNRSFLHYIEQVL
PALGELTVRQATVDDLVAHVEVRGTDEAATAVIKGDARMAEVLRRALYSH
VVPPTEGVVVVRGSRRWRVPVYELEEIVRELLARDIRYGAAREALPQRIA
HAVLVQMERAGEAPDDRVQNAVARNAAVKALVKSVWPQVDPARLVLRLLA
DADFLAEHAEGILTEDEQKAVLWVKPARSVKSAPWSPADAVLIDEATDLI
ERTHSLGHVVLDEAQDLSPMQYRAVGRRCTTGSATVLGDLAQGTTPWATR
SWAQALGHLGKGEAVVEELTAGFRVPTDVIAYASRLLPHIAPGLTPVASI
RENPGFFDIRTAPGGTADVVAACRELLEREGSVGLIAADARVPELAAALA
AAGIGHVGPGEETTRTTRLTLVPASLAKGLEYDYVVLDEPQAVVDGEPDE
RTGLRRLYVALTRAVSGLIVTHATGLPQQLA
>gid:397154  SCO4321  conserved hypothetical protein SCD12A.03c
MSTEEKSAAPRSLAEALRVRDDVSLAALLRSRPDLITPVPTDLTQLATRA
GTRASVVRALERLDRFALQTAEALAVAPDPASYGELLALMGGDEQDPAVA
AALPRAAALLREQALVWGADDRLRLVRTARELLAPSPQHPSPTGLGPTVR
EATAGMSPGRIQDILAAVGLPSTHDAVSAVSALGSLFADRRRMAALLAEL
PWESREVLDRLVWGPPYGQVTHDPAAHLRALLDRGLLLPTAPGTVVLPRE
VALHLRAGRAHRAPEPVPPQVEAAATHRPQVVDATAAGQALAALATVDEL
LKEWDEGGPTVLRAGGLSVRDLKRTAVALDVPEPVAAFWVELAYGAGLIA
SDGEAEERYAATPAYDEWRELPPAERWARLAGTWLTATRTPGVVGGRDAK
DRTLSALGPNLDRSAAPEVRHRVLALLAGLPEGASPVAESVLARLRWERP
LRGPQQQRATGAGAAREDDLRSRIARWTLSEAELLGVTGRGALAAPGRAL
IGAPEAPRPATANDTAGPGGPGDKLPVHHHRTPPVTAPPTPAERAAATAT
AARLLAPLFPEPLDHVLLQADLTAVAPGPLERGLADVLGVLADVESKGGA
TVYRFTPGSVRRALDAGQSAADLHAFLARHSRTPVPQPLTYLIDDVARRH
GRLRVGAASAYVRCDDDATLDEILADKRAAGLGLRRLAPTVLAAQADPAA
LLDGLRAIGFAPAAESAAGDVLIARADSHRTPPRAAPEPVPDGPPAPDDT
LLAAAIRAVRAGDLASTTPRKPGPGDGEGGGESGMPGGPLPRTGAAETLA
TMQAAVLTGEALWIGYVNAEGAASQRVIAPIRVEGGFVTAYDHTADEVRT
YPLHRVTGVAELADDAG
>gid:397189  SCO4340  putative integrase
MARVLGVVRLSRVSDETTSPERQRRSIQRWVDQEGHVVVGWVEDIDVSGG
IEPWKRPEFGKWLPSTIGKEVSAIEHRIAAEESRADEYDIICALKIDRLS
RRVLHVHTLLEWCEKNGKEVATVEDGINLNTQMGKLLLSLIASFAEGELE
AIKARAKSSYNHLVKEGRWRGGRTPYGYREEKQETGDGWKLVPDDYGTDT
AGTLREIVRRLIAGESANSIAQWLNEDVSKTPTSLDAQMIRSGKTPKGSR
WTAANTAKVVRSRCILGQMEVSEEMMVEGRKTTRRRVVRDADGQPLQRAE
PLITHEEWELANKKLDENTSKRNGNRKGGSPLLRVAFCTCGEPAYLGPGR
NWPYYRCASRTTHKPCPTGSKGIAAHTLEDAVGKAFLLAAGDVEIVRKVF
RPGVDYTRDIEEVNRALSDLREDREAGLYSSELGKQEYREAYKRLDARRE
QLIAQPTRPDTWEEIPTGETYRERWSTLSTQHEKGRELRAAGVKAVIHAE
PIPGMTAAQLMAPDGHDGMWQHPVGRVQVLIPMDFKQRLRNMAAVHSEG
>gid:397194  SCO4344  putative transposase
MVAEPVHVRRLTDQEGYELQQIVRRGSTNSVRYRRSMMVLSSAGGNGVPV
IAKLVQADEDTVRNVIHRFNEIGLACLDPRWAGGRPRLLSGDDGDYVVAT
ATTRPARLGQPFTRWSIRKLAAYLRRVHGHVIKIGREALRCLLARRGITF
QRTKTWKESPDPERDAKLDRIEEVLEHFPDRVFAFDEFGPLGIRPTAGSC
WAEQSRPERHPATYHRTHGIRYFHGCWSVGDDTLWGVNRRKKGAANTLAA
LRSIRAARQDGAPIYVILDNLSAHRGDTIRR
>gid:397197  SCO4346   hypothetical protein
MQRYGLKTELVDRLCNGPLEGVTDLEAAQALTRLVHTELENNGTGGGEKL
KNEEMAEALRTLKFLLLRLKIDLKAPFDDFQSFRKYWIREGMGGGGGYAK
RRSYLDGLFYPVREKLDEMEVTASSPTAYRGVDGEIKNIIFAPTGPTKPD
IFLEDALSNIIKVANEDKCLVYNRPLTDAGLTWGDLMAWWTEKNGLEDAS
DYEVAQSLWLQLLESVPSSSPPARALFMTYCRRHISGGVVERNQPALLPE
VYLHFDPLTKIQRGKLGKPRRLVRERMDFLLLLPGGVRIVIEVDGKHHYA
REVPEASRNWKAAPDRYAEMVAEDRALRLKGYEVFRFGGKEIKENDASGL
VGKFFDGLEARFGAKVAAT
>gid:397208  SCO4350  putative integrase
MPVLIDEDLCFEDAAGPRPTMVMSLWLRELPLSGAPSPKSWRTYAQALKS
WAEFLDARRIPVFADRQRLREALSMYAEYRLSGPLEARLSPASWNLAVRT
LSSFYQWAAAEGHAPTVPFSYVRQSMTCPDGARVEVTRNLATVRTGNAHA
TRKYLEKPYADLLMRALAGNDPTGERDVSFRGRETGRNAAVIGLALSSGL
RSQEFTYLTVYEVPPLRRRRTAVPVSLVLAPPTAKGRKGRSTWIGSEDLA
RVHDYIGWERAAAAEGSRWRPRDPLFVEAPTHDGALINGTRRRWHSLTPA
ERLRLVAPGGGSALLAVQAGGKPFVDWATVLRRTAQRIRDRFEPSFPHVH
PHVTRHTFAMATLERLVRGYYQQAAQLVVDAGGDDALALYLTKADPLLVL
RDLLGHTSAVTTQAYLHLLDTQRIYRDAYATAGGALAVDSDVVAEFEDEV
>gid:397211  SCO4351  putative DNA invertase
MQRDTLTAAGCARTFEDKASGKNADRPELRSALDYARAGDTLCVWKLDRF
ARSLIDLVTMVDTLRERGIGFKVLTGALANIDPGTADGRLMLQVVGAMAE
FERSLIKERTRAGLDAAKAQGRTGGRPSVVNEDVLTVARARKAKRESVSA
VAKALGVSRATLYRHLADDS
>gid:397249  SCO4370  putative transposase
MGEGLQVRCGLRHLHRVQEALVVHRNAPLTETGRLRLARCVVEDGWPVRR
AAERFQVSHTTASRWARRYRQLGVTGMSDRSSRPHHQPRRTAAAVEEHVL
RLRREHRIGPLRLAVRCGIAASTAHRILVRHGLPPLAALDRATGEPVRRY
ERARPGELVHIDVKKLGRIPDGGGHKTLGRAEGHRSRTNGAGWAYLHTAL
DDHSRIAYTEDLPDETAPTCAAFLVRATAYFASLGIRIERVLTDNAWAYS
KNTWRNTCRDLDISPRWTRPWRPQTNGKVERFHRTLLDEWAYQKPYTSDH
ERREAFTHWLHWYNYHRPHTGIGGHTPASRGTNLSEQHT
>gid:397267  SCO4377  putative serine-threonine protein kinase
MNHTAEVFQPLQGDDPRTVAGYRLAARLGAGGMGRVYLSHTRGGRPVAIK
VVRSELADDATFRRRFGREITAARRVKGAYTAELIDADPDGTPPWLATLY
VPGPSLAGAVARSGPLPVPAVLWLMAGVAEALQAIHAAGIVHRDLKPANV
LLAADGPRVIDFGISLAADSTAHTATGTTIGTPQYMAPEQASAGAITAAT
DVFSLGQTAAFAALGKPLYGDGPAATVLYRIVHSEPDLSELPERLRDLLG
RCLATAPEERATPAEIVEWCRRELGRDGGEGAGPAGWREIAGPPVTVPPP
AAATGPATAAAPVAAPGPTAVHTTPWTVPEGNVAPGTVPPPWPGAAGPGG
PFVPRRPDGPKERRRRRRNAVLITAAAVVACGLVIAAGTALLDAAGRGLD
RARDAVASASGTPRPVAEGAASAGEAESSTGPGAASSADTSDTGSDGPTA
TTTAPAAHAYYAPMLGNDNSLILHNGKESKDRKGDIRFGCENAGCELKSD
TSVMVENVLGPGATYETCRRLTSDPEASRELLLADKAAGSEICVKHRNGD
IALLVIQVKSTAMREDGFLTFDMTVWPAAG
>gid:397350  SCO4407  hypothetical protein
MPGGRLTQQERQQIALGLADGLAYAEIARRLDRPTSTITREVMRNGGPTA
YRADLAHRATERRAHRRRQAAPRGPQALPPAHGRDAEAVREYEETLTTVF
IQQGTPKMMARVMACLCVSDSGSLTASELVRHLQVSPASVSKAVAFLDEQ
GLVRRERDASRRERYVIDDDVWYQSMVASARGTLLVAETARQGVGVLGSG
TPAAERLENIARFLDFISESIVRAADQAREVLHTAPAAPPEGDATSGSDR
PPRTGRP
>gid:397528  SCO4481  putative serine/threonine protein kinase (fragment)
MLELDGSGAEPLRDGDPRWIGPIPLIGRLGSGGMGRVYLGVHEGRYAAVK
QLLPSVVAEDEDFLRRFGHELDNLARLPGEATAPLLAGDREARPPWFATA
YVPGLTLTEALEVHGGPLPAGALWLLLREAAKGLAAVHALDMVHRDLKPS
NVMLTLEGLTLIDFGVARAAEQSRLTRTGMVVGTPAYMSPEQASGRRASS
GAVDVFALGSVLAYAASGRPPFGDESGHAVLYRIVHEEPDLGPLRELDPE
LADVVASCLDKDHEGRPTAAELLETAERHGPYEPPLWPEALTERLTERAA
FVTKLPERADLPEPEPEPERDPGPGPGSAVLGRRDPGRSDHRRRHRVLFA
VVPVVVAAGATLAVQLLPYTFAPGDRADASPSSSAPVSSAPAASPAGPGK
ASRTATPDQDRSASPSRSPGEQGDRRGDADGGGGAAGGSQGDADAGGAGS
GSGSGSGSAGDGGAGAGSGAGSGDGSGSDSGSGSGADSGSGAGSGSGSGS
GSGSGDGGASGSGVFTLKNAKNGRCLTTAHADAPYDGDCTGDAATWTFRS
RPDGTVWIINGLLGRCIYTAAIGHPVFSFPCGRTTGQEWRVGSGGTLKNA
ATGGCVTVSAMSGTGVRNEACEQSAAQRWTRS
>gid:397545  SCO4487  serine/threonine protein kinase
MSALEPDDPRSVGEYRLLGRLGAGGMGRVFLGRSPGGRLVAVKVVHAELL
RRPEFRDRFRREVQAARMVSGAFTAPVVDADPDAPLPWLVTSYIAGPSLE
QAVAERGPFDPQAVLTLAAGLAEALVSIHAAHLVHRDLKPSNVLLAEDGP
RVIDFGIVRSVDADSLTGSGHMAGSPGFMSPEQVNGDEVTWASDVFCLGA
VLAFAATGTNPFGAGPTPALLYRVVHNAPDVAAVADPALRSLIADCLAKD
PAHRPAPREILARIGPLGGESATALPHAQQWTPAARPTRADAVPTRIVPP
VAAPPAHQHTRVDTSPAQVYPPAPAPADVRPTATGDGGRRSRRAFLFSGA
GALAALGVGTGFWLNRPADPDPAEGSAPAPSPSSAPSPPPGPVGLWPLDE
ASGQVARDTAGGHDGTVTGVAWQGAGEGAAFDGTGSQIVTAGPVLQTGAG
RSFTVAAWVRLSAVPGVFATAVSQDSADASGFYLQYSSEDQGWAFARPGL
RAVGRTAPAAHVWTHLTGVCDGPARKLHLYVNGVQEAVVEDTGPAPATGA
FMIGRASFDGQPRDFFPGAIRDVRAFDRALGPARVAQLAQLG
>gid:397550  SCO4488  putative serine/threonine protein kinase
MSGEAGSVLTGSGAEPLEDDDPRRIGPIPLLGRLGAGGMGRVYLGVHEGR
YAAVKQVLPSVAGEDKDFLRRFGHELDNLARLPEEATAPLLAGDREARPP
WFATAYVPGLTLREAVDLHGPLPAEALWLVLREAATGLAAVHALDMVHRD
LKPSNVMLTLDGLTLIDFGVARAADQSQLTRTGMVVGTPAYMSPEQASGK
RASSGVVDVFALGSVIAYAASGRPPFGDESGHAVLYRIVHEEPDLQPLRD
LDPELADVVASCLDKDHEGRPTAAELVERAAAHGPSAAAPWPQDITERLS
ERAAFAAREPVHPPSGPDAPAPPPPSASAPVVTGGRPEKRERRRRTKVLA
FVIPVTVTGATLGFTLLPYAMNDADQDGNAAAPPAATATAAPGPAASATG
PSASASPSPDGGKKDGQDKDKDKDRKQGQDGDGGAPGGGGSGDAAAAAQG
ASGGGSDSGASGSGGSSGGGGSSGAGGTSDSGTSAGSGSGSGADGDTGAG
SEGGGSASDSFMLKNGSSGKCMYVGTPVDDGACSGDVAVFRFQSTSGGAF
RILNVGTAQCIHSRSANAWLVKDTCGSFGGEWQEGASGSLRNLNTGGCLD
LNRRASMIGLTTTTCTGSDSQRWTRT
>gid:397568  SCO4495  putative DNA polymerase related protein
MATEDDYTAQPFVPDRGGLPALRAAAAECRGCPLHRDATQTVFGAGKASA
RVMLVGEQPGDQEDRQGKPFVGPAGHLLDRALAEAGLDPADAYVTNAVKH
FKFTRAEPRKRRIHKAPTLRETAACGPWLAAELDRVEPELIVVLGATAGK
ALLGSSFRVTRVRGTVLEEEIHGRPQRLVPTVHPSAVLRADDREAAYRGL
LSDLEVAARALA
>gid:397598  SCO4507  putative serine/threonine protein kinase
MRPLEDDEPTVVGPYRLLGRLGSGGMGRVYLGRSAGGRTVAVKIVHPHFA
LDEEFRARFRREVAAARRVGGAWTAPVLDADPEARVPWVATAYAAGPSLT
AAVADGGPLPAHSVRALGAGLGEALAAVHELGLVHRDVKPSNVLLTLDGP
LLIDFGIARATGGTSPTQSGGGTASLTSTGVSIGSPGYMSPEQILGKGVT
GAADVFSLGAVLAYATTGQPPFPGDSSAALLYKVVHEEPNLDGLDDGELR
ELVASCLAKDPSARPAPAEVARRLAPEGAARLVTGGWLPGALVERVSRSA
VRLLNLEAGESATGASGPVGFSRPSVTAGGDAGVATPASGVFGPPPVMPA
PTSPSYPPSPPAPAPAPSPALLSVPGPRHPGPPDDDPGAGTGTTRPPGRL
ALSVAATSTQGAQGRGRRISCTVALAVAGALAAVTVGSVFVLDLLPGSGN
DANNAGGNEASHDPPAATPGGLPARYLGTWEGQAAALDGKLPLGTFRITI
KQAKAGQELGRLRQTDALGGVCVDVITLKKVTEKQLVAGSAGAEGNHDGC
NPAPTTVTFTPVGDDLDYASKSEESGRPTARLSKVG
>gid:397623  SCO4520  conserved hypothetical protein
MSFLKFFSDDVKEMAKALENSGGRMKEASQEMKRADSSQVGHSELQSACD
DFADSWDYGFGQLSKLTKGVSKFADKASEEFLKMDQTLYDELKKSATKPK
K
>gid:397643  SCO4533  hypothetical protein
MGDQEADIDRIRESARALKRIRGTFAERANPAEGYGVGEIGSQKILDAFD
KFGSNWKIHRRQLTDELEKLHGITKAAADSYDKIDHELAEALRRANEKGK
SGKKGGDR
>gid:397726  SCO4577  putative helicase
MGGTGVIGEMAETAQVRDGDSEALATLHRVFGYDAFRGEQEAIVEHVIAG
GDAVVLMPTGGGKSLCYQIPSLVRPGTGIVVSPLIALMQDQVDALRALGV
RAGFMNSTQDFDERRVTEAEFLAGELDLLYLAPERLRLDSTLDLLSRGKI
SLFAIDEAHCVSQWGHDFRPDYLALSLLGERWPDVPRLALTATATDATHR
EITERLHMPAARHFVASFDRPNIQYRVVPKSDPKKQLLSFLREEHPGDAG
IVYCLSRNSVDKTAEFLSRNGVEAVPYHAGLDAGTRAAHQSRFLREEGLV
VVATIAFGMGIDKPDVRFVAHLDLPKSVEGYYQETGRAGRDGLPSTAWMA
YGLNDVIQQRKLIQSGEGDEAFRRRAQSHLDAMLALCETARCRRGQLLAY
FGQDPDGSACGNCDTCLTPPETWDGTVAAQKVLSTVVRLKRERGQKFGAG
QIIDILLGKRTAKVIQFDHDQLSVFGIGEELTEGEWRGVARQLLAQGLLA
VEGEYGTLVLTDTSGEVLRREREVPLRKEPKKPAAAKSAGGGRGERKAKA
AAAELPAELVPAFEALRAWRAEQAREQGVPAYVIFHDATLREIVTVWPTS
VGQLGGISGVGEKKLATYGEGVVEALAGLEGPGSAPASAPAPAPAPSDGP
GTGSGAKAGAVDWPEHEPEPEPDDWI
>gid:397820  SCO4615  integrase
MGKTYDVRIWSVRQRKDRGQTSAELRWKTGGTPHSQTFRTKTLAEGRRAE
LLRAAHAGEPFDESTGAPLSELRQRNDLSWYQHAREYIEMKWQHSPGSTR
RTLAEAMATVTPALVRDTKGMADPRTVRTALYSWAFNVSRRDQEPPDEVA
AVLAWLERKSLPTSALADRMQVRAALDALTKKLDGTTAAASTIRRKRAIF
HNALGYAVDAGRLTDNPLPQVQWKSPEQVAEELDPASVPDPRQALALLDA
VRTQSPRGRRLVAFFGCMYYAAARPAEVIGLRLQDCDLPRRGWGTLRLRE
TRPRSGSAWTDSGEAHDRRGLKHRPRKAVRTVPIPPDLVNLLRWHVTAYG
VAPDGRLFRTQRGGLIQDTGYGEVWAEARSRALTPAQCASLLAKRPYDLR
HAAVSTWLSSGVEPQEVAARAGHSVAVLFRVYAKCLDGGAATANARIERA
LKNGS
>gid:398001  SCO4685  putative DEAD-box RNA helicase
MNRTGMNDRMNDRPARTGKARTRALAVQGEFAHPETLTPALPAAARFADL
DMPAELLAALESQGVTVPFPIQAATLPNSLAGRDVLGRGRTGSGKTLAFG
LALLARTAGRRAEPGQPLGLVLVPTRELAQQVTDALRPYARSVKLRLTAV
VGGMSIGRQASALRGGVEVVVATPGRLKDLIDRGDCRLNQVSVTVLDEAD
QMADMGFMPQVTALLDQVRPEGQRMLFSATLDRNVDLLVRRYLSDPVVHS
VDPSAGAVTTMEHHVLHVHGADKHAATTEIAARDGRVLMFLDTKHAVDRL
TDHLLNSGVRAAALHGGKSQSQRTRTLAQFKTGHVTVLVATNVAARGIHV
DNLDLVVNVDPPSDHKDYLHRGGRTARAGESGSVVTLVTPNQRRAMTRLM
TTAGIVPQTTPVRSGTEALHRVTGAQAPSGIPVVVTAPVAERAERGATSR
GRRRPAPATRRGSVRRSVTDAAA
>gid:398026  SCO4698  putative insertion element IS1652 transposase
MGEGLQVRCGLRHLHRVQEALVVHRNAPLTETGRLRLARCVVEDGWPVRR
AAERFQVSHTTASRWARRYRQLGVTGMSDRSSRPHHQPRRTAAAVEEHVL
RLRREHRIGPLRLAVRCGIAASTAHRILVRHGLPPLAALDRATGEPVRRY
ERARPGELVHIDVKKLGRIPDGGGHKTLGRAEGHRSRTNGAGWAYLHTAL
DDHSRIAYTEDLPDETAPTCAAFLVRATAYFASLGIRIERVLTDNAWAYS
KNTWRNTCRDLDISPRWTRPWRPQTNGKVERFHRTLLDEWAYQKPYTSDH
ERREAFTHWLHWYNYHRPHTGIGGHTPASRGTNLSEQHI
>gid:398246  SCO4772  putative transposase
MAQAGADETGRHVRYTYRLRVSSAARASLAAEWGRCRWLWNECVAKSRAV
HLRNRATGEKATCGPAQLDRMLTEARARTPWLREGSCVPQQQVIRDFGRS
RAKAHKDITEGLPVARRAGMPTWKTRRESPATLNYTRRGFRLKDGRLHLA
GGIVLNVVWSRELPDEPSSVRVYQDSLGHWYCSFVVPAHVQPLPATGRVL
GVDWGVRETATTTSDAHDLPHPSHGGKARARLARYDRMMARRRRKKGTAA
SSGYRAAKRLRAKAYKKVARQRADTGRKWAKKVVRDHDAVAVEDFRPKFL
ARSTMARKAADAAIGATKAALIEMGRKHGRDIRLVHPAYTTMDCAQCGAR
AKHALPLGERTHTCTACGTTSPRDKNSARVMLARAGLNPAGAEGGRPPGA
PLQEAA
>gid:398255  SCO4775  serine/threonine protein kinase
MSEAERAGTSRTDKSARLLAGRYRLGDVLGRGGMGTVWRAEDETLGRTVA
VKELRFPGNIDEEEKRRLITRTLREAKAIARIRNNSAVTVYDVVEEDDRP
WIVMELVEGKSLAEAIREDGLLEPRRAAEVGLAVLDVLRSAHREGILHRD
VKPSNVLIAEDGRVVLTDFGIAQVEGDPSITSTGMLVGAPSYISPERARG
HKPGPAADLWSLGGLLYAAVEGTPPYDRGSAIATLTAVMTENLEEPKNAG
PLRDVIYGLLTKDPDQRLDDAGARAMLNKVIHAPEQTAGTEPVDATKVVP
LPPQPDGRRRRGGGAQGSGGKLGEEAGEKLRGALRSVRKAAVGAGAAGAA
AASRAKPGDGGRSAAGAAGTPTAHGSTVHGSTAQGSTVRHGSTAQGATAH
GPAGTAAAPPAPRAGSASGPGASAGTRPGAVGPGAGAANAAGGARSSGWP
VVPPPDLPARPVPRAPLTDVVPRRTLIIIAVVVALAVLGTVLALTLGGGD
DDKGAEGGNGGKAVASAGASGDTKQDEQSGTGTDGSASDPAANGDQTGAP
DTAASASGETGGESDDAGKSDDDEDVATTHQGGQGYRIGLPEGWKFASTG
SSGDRFTGPRGQKLLVAWTSTPKGDPVADWKSQEQYMVRSNYQKVRIEKV
GYRDWNAADWEFTYTDGGTKYRTVDRGFVVNDHQGYALMYTAKAADWGDD
LRRDTWRTLSKTFEPKS
>gid:398260  SCO4776  putative serine/threonine protein kinase
MDDYAGRVLADRYRLPLPPSDEYELTETRAFDTYSGQEVLVRQVPLPEVV
EAEVLDTDGLPDGFTARDGGVRHSGRRPTAAGGPRSPADPVVRRAVEAAQ
AAAAVPDHPRLDQVFDVFAEGGSLWIASELVAARPLAALLTEQTLTPYRA
AEVASDVLLALRVLHSYGWVHRNITARTVLVCDDGRVMLTGLAVGAAEEA
LCGYDPVPPPPDGEPDGAPERDAVSGAGGGVPRAGVFGPGDADPEAARRA
AIEARGVGGLPVPGTPVPGTAPAGLAPASLDAAADARAARAGAIAAYRAG
ARAAARIQEAQNGRAALPGARPAPDGATQAGGPGTSGGRPGLAPGPGSSY
DDDDGAGRPPHGSAPGGARPGHGVEAAGGAPGARPGQGSGRAAFAAGSGQ
APYAIGPDAASGAPWDEDSVGDPATSGAAVPPGQITDPYGVGTTAWHGAT
PRTPGDPASSAARPTGDPAFPAPRPAGEPAAGSGPAAGRALPGARPEDTA
GSAVPWRETTAGPAARSRETASEAAAGPWRAAPPRAADPEREGGPRPAAG
WDDLAGRAPGQRNPATALAAERARQTRMAVVGPVTERWAPEQAGPVHENW
QLAAPIGPATDLWALGALLFRAVQGHAPYPEESTAELVQIVCAEPPAFAE
ECGPLRPVVESLLRQDPTERIDFEELSGWLRSLVRSAPEPEAGLHVISAP
PVETGRLPIVRRRGELVRRRRAGLPAHHGRHKRGRAETDRSPRSLGRTLL
LLILLAMAGAVAYAMFFMPKNDTNGAGAPDRTGAAGEASQAPPAKSPDAG
GESRPDRSSPAPDPGASRTAGGSTEPQSNVADGFTLRKDAEGFRVAVAEG
WNRTPRNGSGQVVYGKGDFELIVVPGRDRASEFGDDPMAYQRDSEPELAS
YRASSWATSSGLKTIEVGGRTMAEGQFTWTGAGGELYVRNVAVLIDGRYH
VVQVRGPEGERDEVTRLFEQASATYQYTR
>gid:398265  SCO4777  protein serine/threonine kinase
MNQMQGRLVAGRYRLGEAIGSGGMGRVWRAHDEVLHRTVAIKELTAALYV
SESDQAILLARTRGEARAAARINHSAVVTVHDVLEHDGRPWIVMELVEGR
SLADAVKEEERVDPREAARVGLWVLRALRAAHTAGVLHRDVKPGNVLLAD
DGRVLLTDFGIAQIEGDSTITRTGEVVGSVDYLAPERVRGHDPGPSSDLW
ALGATLYTAVEGRSPFRRTSPLTTMQAVVEEEATEPRYAGALAPVISALL
RKDPAERPDATEAEHLLAQAAEGRRPDAAQAYVPTTRYEGPPRAGDTAVQ
GMPAGGSGATPYPQAGGSGATPYPPTTGPTGAGPAPTGHTRGGYTQVGHT
EGGYTAPGYAPAPAVGGAAPGRARRVRMRTLALVVAVAALIGAGTAVVLH
QRDEGGSSAGADPTQGPTGSAAPSPSPSASATTGKGPGGSVPADWTRRDD
PLGFSLYLPENWQRRDFDGDSGELRQIDYTPDGGNHVLRISVDTAPDYND
PYAHQQDLDAQLLQRLVDYRRVKLERAGYRDRDSARWEYTWTALAKDTEF
PGPRRAVSQMYMSRDGVEYALNMTGPAGDWPTTQRRFKAVLQGWQEETG
>gid:398269  SCO4778  serine/threonine protein kinase
MGRMVTEGAGGRVIAGRYRLHERLGRGGMGIVWRATDQLLAREVAVKALP
LDESLSAAEARRRRERTLREARAVAQLRHPHVIVVHDVVEDDGRAYMVME
LVDGGSLADRVLTRGPVDAVEAARIGVALLDALDTAHASGILHRDVKPSN
VLVADDGRVVLTDFGVAQVAGATTLTESGSFVGSPEYTAPERMSGAGTGP
ESDLWSLGVLLCAVLSGASPFHRDSLGGVLHAVVTEEIRPPAQAGPLLPV
VRGLLERDPRRRLDAASAQRMLRAFLSTGRTPATPEEATAARPLRARRSV
KRSVLLAVLLVVAAAGAGVSAAALFADGGDDGGGTPTSSVPATPTSTPPT
STPPTSTPTTATPTTANPGPPAASGSGNGT
>gid:398275  SCO4779  serine/threonine protein kinase
MPHPGNPYATPTQVVPPRSQTPPSTPAAAPPAAPPATPSSTPAAAVPDPG
AGRLIAGRYRLIAKLGHGGMGTVWRAKDETVDREVAVKEPRVPDHLPGRE
RANAFERMRREARAAARLDHPAVVDVHDVAVVDDQPWIVMELVRGRSLGD
ALQEGTLSAREAARIGLEVLGALEAAHAAGVLHRDVKPDNVLLGRHDRVV
LTDFGIAQIEGETSLTDTGGFVGSPEYIAPERVLGQRPGPASDLWSLGVV
LYAATEGVSPFRRSNTPATLQSVLNATPAPPASAQGPLAEVITGLLDKDP
ARRPDAARVRALLAAAVNPPAPPPTQVVRVDGPEGPGAAPASRWSVRLGR
NAWIALGSVVVAAAVASYLVLADPFAGPLPDGWKTPHEKQLAATLAVPEN
YKRTVPEEAGQHWVTYTDESGAVWIGLTLDKKREDTLGNIAGSAAAEMYD
DDGKYKESGAYDLAMPENPKTAPRETEYRGGKSAENTVVYTTTDSQNPRP
RELRIFYYRTSAGDMYKLTVSYPGKGDFTARGREVASTAVANLDIDGT
>gid:398314  SCO4797  putative ATP-dependent DNA helicase II
MSSLFDDSFLASLQTPRGHEEEPPPPPEDDHGPEPVPHDLFGGKFDAPPQ
RDTHYRDGAPRPALDAAALLEGLNENQRAAVVHSGSPLLIVAGAGSGKTR
VLTHRIAHLLAERNVHPGQILAITFTNKAAGEMKERVEQLVGPRANAMWV
MTFHSACVRILRRESKKLGFTSSFSIYDAADSKRLMALVCRDLDLDPKRF
PPKSFSAKISNLKNELIDEEDFAAQAADGFEKTLAQAYAMYQSRLREANA
LDFDDLIMTTVNLLRAFPDVAEHYRRRFRHVLVDEYQDTNHAQYALVREL
VGVPSEHPVDVPPEAEVPPAELCVVGDADQSIYAFRGATIRNILQFEEDY
PDATTILLEQNYRSTQTILSAANAVIERNESRRPKNLWTNQGSGAQITGY
VADTEHDEAQFVADEIDRLTDAGEAKAGDVAVFYRTNAQSRVFEEVFIRT
GLPYKVVGGVRFYERKEVRDVLAYLRVLANPEDSVPLRRILNVPKRGIGD
RAEAMIDALSQREKISFPQALRRVDEAYGMAARSANAVKRFNTLMEDLRT
IVESGAGPATVLEAILERTGYLAELQASTDPQDETRIENLQELAAVALEF
EQERAEGEEAGTLADFLEKVALVADSDQIPDEEDGDGVVTLMTLHTAKGL
EFPVVFLTGMEDGVFPHMRALGQAKELEEERRLAYVGITRARERLYLTRS
TLRSAWGQPSYNPPSRFLEEIPAPHLEWKRTGANGPAPSAPVSGVAASLS
SSRSRSRSSASGASGFATGRSAGAEQPTVSLAVGDRVTHDQFGLGTVVGV
KGSGSNAEATIDFGDTKPKRLLLRYAPVEKL
>gid:398360  SCO4820  putative serine/threonine protein kinase
MTEPYAVPVPRGYRVGVWEVHAPIATGAFGSVYAARRTGGDDTKASPTRP
GGDDNTKAPPTRPGGDGTGHRPSRPGTTDTAGTDDSHGTGTGTGTGTPSR
AQTDTAHPAEGDTDDPGHTSGNGTTGTDGTRRSHGTGTGTPSRAHTDTAH
PAEGDTDNPGHTSGNGTTGTDGTGRSHGTGTGTGTHNPSRVQTGAAHPAK
GDTDNPGRTGGDGTARRPGRAGVVGAGRGGDAGGEVPDTVALKFLPTGTG
TPRQLAHLRDLVEREVELLRRLRRPRLIRMYETLTVDDPAHPRLDGATVL
VLERAEGSLSALLAATPRPPAGPALLAQVCEGLQQLHRAGWVHGDLKPAN
VLLMADGSARLADFNMAAELEGTHAYTPAFATPDYTPPELLWSEIGERGR
RIRPSADVWAFGVLAHLVLTGSFPLPGGTPTARRDAAVAYARGGHELRLS
PEPPPGWREIVRACLTRTHADRIGTDALLRRVTGTTEGASGGAGFSPRTR
ARPRRRVLAALAAGLLALAALGYGVARWAGDGREPAAGPRPGGTGSVAAA
SYGAAELRTDRDVPPAYRLLIVETAHDCDREEVSPALIAAMLKVESDFDP
DLADPAKDEYGIARWTPSVLRWWMNEDGTPGETVPQPPFPPAESVPAMGR
YLCWIAPRLDAGLPGDRSVLVAVAYRTSYRKVNDAGGVPPKYRDYADRVA
HHLKEYTPRRGK
>gid:398587  SCO4911  putative bifunctional protein
MHNDDGINGAGGGADGHPEDRPPVPHTVIEGRYELLEPIGSGGMGEVWKA
HDRRLRRFVAVKGLLDRRAMTPDTQKAAMQRARREAEALAKIEHQNVVTV
HDQIETADQVWIVMKLLEGRSLADLLSRDRVLGVPRAAEIGLQMAQGLRA
VHEASVLHRDVKPGNVLVRDGGQVVLVDFGIATFEGADRVTRHGGIIGTP
PYLAPELFAPAAPGPTSASDLWALGVTLYEMVEGRLPFGGNEVWEVQANI
QQAPDPVLRYAGPLGPVIQGLLTTDPDRRLDAAGAEEMLRDVLADPGGPT
PARAATPAHPPTARPTPPASGPVPAVTAAVPAASSSGPVPSAAVPSEPSP
VASGGGRGRLSGWKVAAAAACVVLLAGGGWLVSQGDSGGKQDDASGQQGG
GAAGGDAEQAWEQWKSARKRLTIGAKEDQPGLSFYNKDTGVWSGFDVDIA
YALAGKLGYGKAQVDFYGVTTANRASKLKNGEVDLVVASYSMTPEREKRD
GISFVGPYYKAGSSLLVRKNSAKYDLGEAVDVKRNRVEVCTARDSTYADR
LEEDGYTTGKWQPDTYKECVERLLDKRSSVYAVASDDVLLAGYAQNDPAH
LKLLPSGAGTEPYGVAMRKDDPLLKSKVCSGLREILAGKEWAEMYMKDLS
PLTGRKTAPSRPEPRPCSAE
>gid:398746  SCO4969  putative regulatory protein
MGLLPSQISGCGHRGQERNSSIFPAGADPPPQGLLVAGHLTRAQPGKDPG
MGRKGPWDAWGHGGARGDPAGHRCRRRGRDMRDSHRAEAEGLLRRAVEEE
VRRSGGRTDGNVLLSRARAALDAMAGTAGEEYSAYTHALDEAAAGQLTFR
QRYAREGAGTPLLVAAVAAVAAAVADVALGTGTGTAVGAGVTVGVVGAAA
TVVKVVGAHVPAAHHRAGAVSQPGGPEQLRLQWLTALEVRGIRPFLDQQR
MLAASTGPAKTGPKLRGADKSAAARGRSALEQSFGQLPEPGDAFAGRRAE
MARLRQWVQAARASTETKPTVVVLHGAPGSGRTTLAVRAVHELRDYFRGA
CLVDLRGAGSAAGGSTAGGGGSGGPGPAGSVEGGESPLSTRDALLHLLNR
LGAPREQLLFRERSSADQQVKRLSELYHQHLTGVPVTVVLDDASDPEQVC
TLVPERSDSLVLVTARQALRLPPDLPARVYDLPVEALDAAGAEELLGAAA
EDASGPYDAESSEQIRQLCGGLPLALRVAGSSLGPRSPRALATDLAAYGP
VEPVERALWLRYTDQSEPARRLLRRLALAGRTSLGAAAAAALLATDETEA
NRQLAALSRAGLLDHVRGSRYRLHDVVRAFARARLADEEEPGERTAAQER
LIANYAELADSVLRLVDGNMSTRSNQFGQYGFTSLDEALRWLDDESSFIT
ATLRHAEGVDQGTVLNLLGALCDYCLLRGDLYRLGEISELAQSVDQGLLV
RSVQWRTGIAARQLGELDKARTTLASVVDLYRDAHHDAGAARALCSLGIT
LHHQGNLTDAAAKLQEALGLQAAPELATDRAWTLHALAAVQRDRARLAEA
LELLTESLVLHRAGESVHGQAWAHFQLGQLHLRMGDVPGAESDLRVALDL
YGRTQDARGEAWALTQLARARLVDGDVSAAVEGLRRAETRHRENADARGK
AWSIYYLGQALEETGALDQSVRALERSRTMFSRMRDVYGLACARHHSARV
TRDQRAAQTGSLRNSGFARQLLVDARADFQRIGVGHGEAWTCLELAVVDA
GNARTPQALALCDEAAALFAGYGDRRGEDWARFLRCTLLPYAAPGGVEIG
TAVAQEELTQLSRARHPARDTKLDDYVDAYQLLLERGVQLEAGWQAWRLG
MVPGRQAREVMGVAVTA
>gid:398855  SCO5016  putative integral membrane protein
MGEVAAVVSAITAVAGLLLAVFGIPLVGNSSPIGRSVADSGPTSGAAPAA
SHSTADAGSEGASASASASASASASGSSKVPSSTAPSGSAPTGQPPAGGD
EPRSSASASSCSSSLPQGWRRVEVPALTVCFGRPGGWGEKPAGELQSSWG
SPDGVYDLTVKRDRTYGTTARAASAGQLAWYRDTSESSMAGVEVTTHKTR
QNGRDALWLEIDYHWEKQSAPRKRLEVFVAGKAGYVYQLLVDTEATSQRL
AEQSRLFASARKNLLIDVSTG
>gid:398916  SCO5045  conserved hypothetical protein
MDLHKHAPAAAPRTSELRASDADRDRVADMLREALAEGRLTADEHAERVE
GVLAAKTVGELDVFVRDLPAAHRGRETTAPAPHRPTAGAIPAEPDENVVA
VFSSAVRKGRWRANRRIHAYAVFGSIEIDLSEAVFEYQQVVIKAFSVFGS
VEVRVPENVSVRGAGGSVLGSFEVHTLDSSEAEAPVIYVDGWAVLGSVEA
RPKRGKVVADILDRVHRRVEKGLRKHV
>gid:398933  SCO5055  putative exoribonuclease
MTSEVEQPTAMSEALGYEQARDELIEVVRRLEAGGTTLEESLALWERGEE
LAEVCRRRLDGARARLDAALAEEADPEDGASGADGGGA
>gid:398934  SCO5056  putative exoribonuclease large subunit
MAANSTPEGPLPVGEVSRLIGGWIDRLGAVWVEGQITQLSRRPGAGVVFL
TLRDPSYDISVSVTCYRQVFDAVADVVGEGARVVVHAKPEWYAPRGQLSL
RAAEIKPVGVGELLARLEQLKKALAREGLFAAERKQPLPFLPQLIGLVCG
RASAAERDVLENARHRWPAVRFEVRNVPVQGVHAVPQVVQAVKELDARDD
VDVIVVARGGGSVEDLLPFSDEQLVRAVAACRTPVVSAIGHEPDSPLLDL
VADLRASTPTDAAKKVVPDVGEEYERVRLLRDRARRCVAAFVDREERGLA
HALARPSIQDPHRMIEERAEQVTALLDRGRRSLRHHLDRADSELTHTHAR
VVALSPAATLKRGYAVLQRADGHAVRDPGEVEPGETLRARVSEGDFSVRV
DA
>gid:398952  SCO5064  putative bifunctional protein
MVPILPAVVSLGGEWLPRGSVDQQGAVNARRGPVWAVERCGEGQRGAGSV
GRGERDTVARGTRLYEAAQAVVGDHARAVEAVRAALKPIHDEAVKRELDA
IPVARLQDVTEGRLRLGSVEKSGLRTLGRVLEAGPYRLRQIPGVGQRTVD
QILAAARRLSEAVHETVAVHIDVDRPEPRTTALVMALHVLVEAGPEARRA
VDKATVLTERLGPLLADAGPAAGRVRMLLAGREKKARAWAAVAATRSLVD
EAEQAGLPGLLAQASVDLLRGTASDVAAWVDFELRSAEYYSLLAEISGRP
PDAAAVEGFLPDEVAERVRTQHLDDTHRRVSLRGYQAFGARFALAQRKVI
LGDEMGLGKTIQAIAVLAHLAAEGQSHFMVVCPASVLVNWTREIEARSAL
RVMVLHGPDRHYAFADWKGRGGVGVTTFDALRGFPAPGGGEVGLLVVDEA
HSVKNPKAKRSQAVGLWAERCERTLFMTGTPMENRVAEFRNLVQMLDGDV
ADSLGERDALAGSVTFRKAVAPVYLRRNQEDVLTELPSLQQTDEWEELSA
SDEEAYREAVRAGNFMAMRRAAYMSSGNSAKLERLREIVQEAGENGQKTV
VFSNFKDVLAVVKEALAVETTRVTPVFGPLTGGVPAQRRQEIVDDFAGVQ
GPAVLLGQIQAAGVGLNMQAASVVVICEPQIKPTIEHQAVARAHRMGQVR
PVRVHRLLATGGVDERMVKMLEAKTRLFDAYARRSAVAEATPDAVDVSDT
ELARRIVEEEQARLGMTDERPTSSEWGVPRSAASPQDPLNGRRESTNANE
>gid:399064  SCO5102  putative mutT-like protein
MTERIVVGAALLDGGRLLAARRSAPAELAGRWELPGGKVEPGETPEAALV
RELREELGVAAEAGGRVPGQWPLRAPFVLQVWTARLRPGSAAPAPLEDHD
ELRWLTPGQIWDVPWLDQDVPAVERVLAHLGLEAGTGRNGTGPG
>gid:399087  SCO5109  hypothetical protein SCBAC31E11.05c
MPPAVHRRAPANRCTSLPSDWCDARHSQAHERSPPRPAAHCRVRRKRGHV
LSVHDDLSSVQRSLDELSRTVTRLEQQLGSGDLEVRRVRTDADHLRESVA
LLRAATAAPQAPRRPDLVPIPDTPYDGSLWTDSDDEGLGARDRRAP
>gid:399192  SCO5143  DNA-3-methyladenine glycosylase I
MSAGEAVAGPDGALRCPWALSTADYVTYHDDEWGRPVHGDDALYERLSLE
AFQSGLSWITILRRRTGFRSAFAGFEIAKVAAFTDTDRERLLADTGIIRN
RAKIDATLTNARVLAEWAPGDLDELIWSHAPDPAGRPAPKTLTDVPAVTP
ESTALSKALKKRGLRFVGPTTAYALMQACGLVDDHLAACVARRP
>gid:399241  SCO5166  putative helicase
MTLPVALSGKDVIGQAKTGTGKTLGFGLPLLERVTVPADVEAGRAAPESL
TDAPQALVVVPTRELCQQVTNDLLTAGKVRNVRVLAIYGGRAYEPQVEAL
KQGVDVIVGTPGRLLDLAGQNKLSLKHIKSLVLDEADEMLDLGFLPDVEK
IINMLPARRQTMLFSATMPGAVIGLARRYMSQPTHISATSPDDEGATVAN
TKQFIYRAHNMDKPEMVARILQADGRGLAMVFCRTKRTAADLADQLKQRG
FASGAVHGDLGQGAREQALRAFRNGKVDVLVCTDVAARGIDVEGVTHVIN
YQSPEEEKTYLHRIGRTGRAGAKGTAITLVDWDDIPRWQLINKALDLGFN
DPPETYSTSPHLYTDLGIAEGTKGVLPRSERTRAGLDAEELEDLGEPGGR
GGRGRGDRGDRGGRGGRDDSRSGDRERDRSSRTTPRRRRRMRGGAPVDAE
ASVAPAAETVTVTDGAAGATDAGTDTDKAPRTLRRRRRTRSGEPSRRQET
AATLGTDGPAAQAAEDAALAPSEAPVVETVAAEPVAAPEAVEAPAKPRRR
TRTRKAAETPAAPLETAPTAAPTATEESIVAPAVVETPTVEAPAEKPRRR
TRKATAAEATVETAEGAAEPAPEPAETKPRRRTRKAAEPAAQAPDAEAEA
EAKPRRTRKTTATKTAAAKAEAAADTAEAAEAKPKARRTRKAAEPAVQAA
PEGEAEAEAKPRRTRKTTATKTAAAKAEAAADTAEAAEAKPKARRTRKAA
EPAVQAAPEGEAETEAKPRRTRKTTATKTAAAKAEAAAKAEAAADTAEAA
EAKPKARRARKTAAAVEATAEIPAQASQEPEAAPRRRTRKAAVAVEAPAA
GADTAEAKPKARRTRKTAAAAQPAEAGES
>gid:399304  SCO5182  conserved hypothetical protein
MVACRVRLGRHQGARAGRTVCPKHRLRARDAGSRECAALSAGCGRMGRMS
EQSLPDGVRPQNADALPEYAERVLDMTELIPPGRVMTYGDVAEYLEEGGP
RQVGRVMSLYGGGVPWWRVVRADGVLLAGHELEALDRYREEGTPLKEASR
AAEGHLPRLDLKRARWDGEGRPQGHGGRAQGHT
>gid:399305  SCO5183  putative ATP-dependent DNA helicase
MSSSSSSGHPSHPQVRRGSRGAYRLVRTPPARTDPPRLDAAQRAVVDHRS
GPLLVLAGPGTGKTTTLVESVADRIARGGDPERILVLTFSRKAAVELRDR
MALRIGAARAPRATTFHSFGYALVRAHQDSDLFVEPLRLLSGPEQDVTVR
ELLAGQVDLERLGLAHVRWPDELRACLTTRGFADEVRAVLARSRELGLGP
DALAAFARRIGRPDWRAAAVFLAEYLDVLDLQGVLDYAELVHRAVLLARR
PEVAAHLTAQYDAVYVDEYQDTDPAQVRLLHALADGGRTLVAFGDPDQSI
YAFRGADVNGILDFPTAFPRADGRPAPVGVLRTARRSGSGLLAATRLLTQ
RMPLTRLPADKVRAHRELTPVRGGGRVEAYTYPTAGTELDNIADILRRAH
LEDGVPWRDMAVLVRAGSRTIPTLRRALTSAGVPLDIDGDDLPLRHEPAV
APLLTALRAVAEAEADTGARTGERFGALGSEADADPRADADADADADADA
YVDADADAYVDEGGDAATKLSRGEHAYRADEPCWLSTETALTLLTSPLAG
MDAADLRRLGRALRDERRAAGNPLPPPSDELLAQALAEPERLAVHDPAYA
RGAQRLGALLRKARERLAGGGSAEEALWDLWDGTPWPTRLERSARRGGAA
GRNADRDLDAVCALFATAARAEERTGGRGALNFLEEIDAEDIAADTLARR
AVRPDAVRLMTAHRSKGLEWRLVVVAGVQEGLWPDLRRRGSLLEADRIGR
DGLAEPLSPGALLAEERRLFYVAATRARERLVVTAVKAPADDGDQPSRFL
TELGVEPADVTGRPRRPLAVAALVAELRATTVDPRVSEPLREAAARRLAR
LAALTDEDGRPLVPSAHPYRWWGMWDPTESKVPLRDRDQPVTLSGSALDQ
LANTCALQWFLGREVKADAPATTAQGFGNVVHVLADEVASGHTPADLAVL
MERLDSVWNALAFDAPWKSAQEKANARVALERFLKWHVMDRAGRTPVASE
HDFDVTLEAGEYEVRIRGQMDRVETDADGRAYVVDFKTGKQAPTSSEVAR
HPQLAVYQLAVREGAVDEAFDGVRPQPGGAELVHLRQGAPKRDGGETLPK
VQAQESQEGPEGEWVGDLLATAAGKVLDERFTPTAGQHCTHCSFRASCSA
RPEGRHVVE
>gid:399309  SCO5184  putative ATP-dependent DNA helicase
MWPDRSRPGPPEPPRRAAGVTRRTTRADLHFPPTREDIRHQLSVAAASLS
DVPAHITDPEQLKELLGIPFTPEQTACIVAPPAPQVIVAGAGSGKTTVMA
ARVVWLVGTGQVAPEQVLGLTFTNKAAGELAERVRTALIRAGVTDPDVID
PDHPPGEPAISTYHAFAGRLLTDHGLRLGLEPTSRLLADATRYQLAARVL
REAPGPYPALTRSFADLVSDLLALDGELAEHLVRPEDLRAWDAGLLDTLD
GVRLSNADLRKVPEAAAARRELADLVIRYRAAKRRQDLLDFGDQIALSAQ
LAGLPEVGRVLRDDYRVVLLDEYQDTSVAQRILLAGLFGGGTGHPVTAVG
DPCQAIYGWRGASVANLDDFPEHFAHADGRPAARQALSENRRSGGRLLDL
ANGLAEPLRAMHAGVEALRPAPGAERDGMVRCALLSTHAEEIDWIADSVA
HLVRTGKAPGEIAVLCRTATDFAEIQGALVARDVPVEVVGLSGLLHLPEV
ADLVAVCEVLQDPGANASLVRLLSGPRWRVGPRDLALLGRRARLLVSHAR
VEGDDDPDRRLAEAVEGVDPSEVISLADALDTFLETPLDGTGDDDGLPFS
ADARVRFARLATELRELRRALSDPLMDVLHRVLAVTGLEVELSASPHALA
ARRRETLSNFLDVAASFAAGDGEATLLAFLGFLRTAAQYEKGLDNALPGG
ENTVKVLTAHKSKGLEWDVVAVPGLVTGTFPSGQGREKWTAQGKVLPHGL
RGDAETLPDVASWDARGLKAFHEAMKDHQHTEELRLGYVTFTRPRSLLLG
SGHWWGPSQKKPRGPSDFLTALYEHCAAGHGEIEVWADEPAEDEENPALH
RATADEVWPLPLDDAALARRRSAAETVLAHLDGLAAREDGPPSAPATYED
PDWPPPPEDDEGLPEEAEPDRAGDPAHWDSWTTERPAAAREAATAPESPA
APQAGPTVPHQAPPPEQPPGTAPVPAPARLTPEEARTVASWDRDLDALTG
ELLRARESVTEVPLPASLTASQLLSLAADPDGFAQELARPMPRPPQPAAR
RGTRFHAWVEARFEPLTLPLLEPEELPGGDAEIADDHDLEFLKDAFERTE
YARRTPFRVEAPFQLSLAGRVVRGRIDAVYKEGDGDTATYEIVDWKTNRA
ATADPLQLALYRIAWAEQQHVPPASVTAAFLYVRTGEVVRPEGLPDRAAL
EKLLLAEPVGDEPHDRGVRAGR
>gid:399314  SCO5186  conserved hypothetical protein
MEAPVTTWTDHTADRPISLTAPSGVDRAAHHRLDEAWLAAAWSHPSTRCF
VVSGGQVLIDETPDGATELVMTPSFEAPLTEAHRYFLGTDEDGTSYFALQ
KDSLPGRMDQSARPAGLREAGLLLSPRDAGLMAHAVALENWQRLHRFCSR
CGERTVIAAAGHIRRCPACGAEHYPRTDPAVIMAVTDGEDRILLGRQVHW
ARGPLLDPRRFRGARRVHRAVGAPRGPGGGRRHRRPGRVRREPAVAFPSS
LMLGFMAHATSTEIDVDGDEIHEARWFSREELGAAFESGEVLPPYGISIA
ARLIELWYGRPLPTRSAF
>gid:399318  SCO5188  putative ATP-dependent DNA helicase
MRSHGVNVRGGTTIPVAHSHRVMPVALIRLSVRACGQPTHPSRETWQHGG
VTAATHSTLFPQVPDSADAVLEGLDLEQREVATALRGPVCVLAGAGTGKT
RAITHRIAYGVRAGILQPSSVLAVTFTNRAAGEMRGRLRQLGASGVQART
FHSAALRQLQYFWPKAIGGSLPRLVDRKIQLVADAAAACRIRLDRGELRD
VTAEIEWSKVTQTVPADYAPAAAKAGREAPRDPAEIAQLYAAYEDLKRAR
SVIDFEDVLLLTVAVLQDRHDVAEQVRAQYQHFVVDEYQDVSPLQQRLLE
LWLGDRDDLCVVGDASQTIYSFTGATPDHLLDFRGRHPGATVVKLVRDYR
STPQVVHLANGLLAQARGRAADHRLELVSQRPAGPEPGYAEYSDEPAEAE
GAARRIRELIDSGVPAAEIAVLFRTNSQSETYEQALADAGVPYQLRGAER
FFDRPEVRKAGSALRAAARFGGNDSLLDDAVDLPSQVRAVLSGEGWTTQP
PAGSGAVRERWESLAALVGLAQDFAAARAGATLSDLVVELDERAGAQHAP
TVQGVTLASLHSAKGLEWDVVFLVGVAEGMMPITYAKTDEQIEEERRLLY
VGVTRARERLHLSWALARSPGGRPNRRPSRFLKGLRPGSGTAGGQAAAAG
AGGVERGIRGGGGGGAAAAPRRTQRTPARCRVCGRTLTDAGEMKLMRCEG
CPSDMDEGVYERLREWRAVQAGRSGQPAFCVFTDKTLMAIAESVPEDERE
LARIPGVGMRKLNRYGTDVLAICAGQEGVGLDEDD
>gid:399339  SCO5198  hypothetical protein
MSLHDDAVLVLKAYEGQDELRQVYLDHLATHPDGMWKACADGHVTASALV
IDPSRERVLLTLHKKLRMWLQMGGHCEPVDETLARAALREGTEESGIAGL
ALLAGGPVRLDRHHTPCAWHLDVQYAAVAPPGAVEAISDESLDLRWFPYA
EVADVADDSVVRLLEATRARL
>gid:399598  SCO5308  conserved hypothetical protein
MAPMTEVEGRRVALSNLDKVLYPAAGLTKGELLHYYATTAEVLLPHLRDR
AVSFLRYPDGPDGQVFFTKNVPPGTPDWVTTAEVPRSEGPARMVLVQDLA
SLMWAANLVTEFHTHQWTVDDPGEADRLVFDLDPGPPATVVQCCEVALWL
RERLAADGIEAYAKTSGAKGLHLLAGVRGASSERVSEYAKGLAVEAERAL
PELVVHRMTRSLRPGKVFVDWSQNAARKTTAAPYTVRARGVPAVSTPVTW
EEVAGCGRAERFVFLTPDVGRRVRDHGDLLAPLFDRRRAAALP
>gid:399604  SCO5312  conserved hypothetical protein
MTTTPRFDPRVLLAESRLGVLATIKSDGRPQLSPVMPAYDPEAGVIRVST
REGLAKTANLRRDPRAALEVTAPDGRSWATAEGVATLTGPGADPHGPEVE
ALVEYYRAAAGEHPDWDEYRSTMVSDRRVLLTITVERVYGADIG
>gid:399688  SCO5349  putative integrase
MKSLDVKVWGVRKRNTKKSSYDVRWTVAGNVFSEQFRTKGLADHYRSKLL
RAAHGGEEFDTVTGLPDSMVEKAASMSWYAFALRYLAMKWPHAAPNTRNG
INESLTAVTMTLLDDRPGRPPEELIRRALRNWAFVLPGPDDRELPDEIAH
ALHWVSKASRPLADLGDAAIARAVLDALKLKLDGTAAAAETVRRKRRTLV
NALHYAVDLGEFKENPITGIRWKKPKVAGEVDPRVVANPEQARSLLTAVS
YVGGYGRARGRRLVGLFACMYYGAFRPAEAVGLTAADLKLPETGWGTALL
NRTRPSAGKQWTDSGETHDDRGLKNRPAEEVRLVPIPPQLVAILRRHLDT
FGTAEDGRLFTNERGGVVGSSTYYRVWQEARAFALLPAAVASPLAARPYD
LRHSALSTWLNAGVDPTEVAARAGNSVEVLLSRYAKCIDGRQEVVNRKIE
ELLREYE
>gid:399817  SCO5388  conserved hypothetical protein
MGEVGQRTPVPGYGRIASGPARRVALPTMRLVIARCSVDYAGRLTAHLPS
APRLILVKADGSVSIHADDRAYKPLNWMSPPCALKEGTGEEEGVWTVVNK
AGEKLIITMEEILHDSSHELGVDPGLIKDGVEAHLQELLADRIDTLGEGY
TLIRREYMTAIGPVDILCRDAQGGTVAVEIKRRGEIDGVEQLTRYLELLN
RDPHLAPVRGVFAAQEIKPQARVLATDRGIGCQVLDYDALRGIEDDKLRL
F
>gid:399832  SCO5396  putative cellulose-binding protein
MSDTSPYGFELVRRGYDRAQVDERISKLVSDRDSALARITALEKRIEELH
LETQNAQAQVNDAEPSYAGLGARVEKILRLAEEEAKDLREEARRAAEQHR
ELAESSAQQVRNDAESYAAERKAKAEDEGVRIVEKAKGDASQLRSEAQKD
AQSKRDEADALFEETRAKAAQAAADFETNLAKRREQSERDLASRQAKAEK
RLAEIEHRAEQLRLEAEKLRTDAERRARQTVETAQRQSEDIVADANAKAD
RIRSESERELAALTNRRDSINAQLTNVREMLASLTGAAVAAAPSVEDESV
SRGVPAQQSR
>gid:399865  SCO5411  putative integrase/recombinase
MEHMTEQRAVIAIYLRLSRESDDSTSLETQRSFAHRWLLAHGYDPADAVE
YIDASVSGAKPLEARKGMAALMAARPTVVVAWKLDRFARSVSDFLRLVAW
AEAHGASIATTDNAIDTTTATGRMVATVLAALAEWERNAIASRITDGHAT
RRSQGRWSSGRPPFGYRIERRDGAAYLAIDDDQAAKIRKAVAALVNGSTV
AATARLTGLSEPQWRRLLKSPTLRGQRAHKGELVCEVDGITPVKFAEPIL
SAAELLKVRERMLALATGQDRAPRRATPMCSGMAFCHRCAGKLNGGTSDK
GVPLYRCKAGHVTIYAETLDSRVESEFLTTFGSYAETVIRLEGGNDLSAE
LLEAKEQAERLAARMATAGPLMLGTLESLAQDLEDTHARLRAAHDPDVRE
VQVATGRTMAQAWEMYDVPDRSRLLAGMGLRVSLHPRQLADRLTITWGPT
PEEAPEAEWDGLALSEAL
>gid:400093  SCO5494  putative DNA ligase
MAGDKQGDKQAETTSVPAEARERHAQLAEQIEEHRFRYYVNDAPVVSDAE
FDRLLRTLEELEERHPELRTPESPTQKVAGAYATEFTAVQHPTRMLSLDN
TFNDDELAAWFERIARELGEQEYHFLCELKVDGLAVNLTYERGRLVRAAT
RGDGRTGEDITPNVRTIAEIPDRLAGDKVPDLVEIRGEVYFPMEKFQELN
ARLNEAGDKPFANARNAAAGSLRQKDPRVTASRPLHMVVHGIGTLEGYSG
LTRLSQAYDLLKAWGLPTSPHNRVVDGLDGVREFIAYYGENRHSVAHEID
GVVVKVDEIRLQGRLGSTARAPRWAIAYKYAPEEVNTKLVDIKVGVGRTG
RVTPYAQVEPVTVAGSEVEFATLHNQEVVKAKGVLIGDTVVLRKAGDVIP
EILGPVADLRDGSEREFVMPAECPECGTPLKAMKEGDIDLRCPNARACPA
QLRERVAYLAGRECLDIEHFGGVVAAALTGPLEPSEPPLVDEGDLFDLTV
EKLLPIKAYVLDPDSGLPKRDPKTGEEKIATVFANKEGEPKKNTLALLQH
IEEAKTRPLARFINGLSIRYVGPVAAQALAREFRSIDRIEQATEEELAST
DGVGGAIATAVKEWFAVDWHREIVRKWKAAGVPLEDRSTGEDEGPRPLEG
LTVVVTGTLENFTRDGAKEALQSRGAKVTGSVSKKTSFVVVGDNPGSKYD
KAMQLKVPVLNEDGFAVLLEQGPEAAADVALSAEE
>gid:400219  SCO5556  histone-like DNA binding protein
MNKAQLVEAIADKLGGRQQAADAVDAVLDALVRAVVAGDRVSVTGFGSFE
KVDRPARYARNPQTGERVRVKKTSVPRFRAGQGFKDLVSGSKKLPKNDIA
VKKAPKGSLSGPPPTISKAAGKKAAAKKATGAAKKTTGAAKKTSAAAKKT
TAKKTTGAAKTTAKKTTAKKSAAKTTTAAAKKTAAKKAPAKKATAKKAPA
KKSTARKTTAKKATARKK
>gid:400237  SCO5566  putative ATP-dependent DNA helicase
MDPVPALQEPLKKVLGPATAKVMAEHLGLHTVGDLLHHYPRRYEERGQLT
HLADLPMDEHVTVVAQVADARLHTFASSKAPRGKGQRLEVTITDGSGRLQ
LVFFGNGVHKPHKELLPGTRAMFAGKVSVFNRRLQLAHPAYELLRGGDDE
GEAAESVESWAGALIPLYPATAKLESWKLAKAIQTVLPSAQEAVDPLPGS
LREGRGLVSLPEALLKIHRPHTKADIEDARARLKWDEAFVLQVALARRRH
AESQLPAVPRKPGADGLLTAFDDRLPFTLTDGQRKVSREIFDDLATDHPM
HRLLQGEVGSGKTLVALRAMLAVVDSGGQAVMLAPTEVLAQQHHRSVVEM
MGELAEGGMLGGAEHATKVVLLTGSMGAAARRHALLDLATGEAGIVIGTH
ALIEDKVQFHDLGLVVVDEQHRFGVEQRDALRGKGKQPPHLLVMTATPIP
RTVAMTVFGDLETSVLDQLPAGRSPIASHVVPAADKPHFLARAWERVREE
VSNGHQAYVVCPRIGDEDDDPGKGAKQSKQPPEGDADKRPPLAVLDVAEQ
LARGPLQGLGVEVLHGRMQPDDKDAVMRRFAAGETDVLVATTVIEVGVNV
PNATAMVIMDADRFGVSQLHQLRGRVGRGSAPGLCLLVSEMPEASAARQR
LNAVASTRDGFELSRIDLEQRREGDVLGQAQSGARTSLRVLAVIEDEEII
AEARQEAAAVVAADPELTGLPGLRTALEALLDEEREQYLEKG
>gid:400240  SCO5567  putative DNA methylase
MTRVIAGKAGGRRLAVPPGTGTRPTSDRAREGLFSTWQSLLGGPLDGERV
LDLYAGSGAVGLEALSRGAGHVLLVEADARAARTVRANVDSLGLPGAEVR
AGRAEQIIRTPAPAEPYDVVFLDPPYAVSDDDLREILLTLRTEGWLGTEA
LVTVERSTRGGEFRWPHGFEAIRARRYGEGTFWYGRAASTCEDAR
>gid:400252  SCO5573  formamidopyrimidine-DNA glycosylase
MPELPEVEVVRRGLERWAAHRTVADVEVLHPRAVRRHVAGPDDFAHRLKD
HRIGTPSRRGKYLWLPLEDTDQAVLAHLGMSGQLLVQPHETPAEKHLRIR
VRFADALGTELRFVDQRTFGGLSLHDTSADGLPDVIAHIARDPLDPLFDD
EAFHHALRRKRTTIKRALLDQSLISGVGNIYADEALWRARLHYERPTATL
TRPRTTELLGHVRDVMNAALAVGGTSFDSLYVNVNGESGYFDRSLDAYGR
EGMPCRRCATPMRRRPWMNRSSYFCPKCQRPPRVTP
>gid:400323  SCO5600  conserved hypothetical protein SC2E1.17
MTPAGQDTYEGGLRRVARVVLLDPEDRILLLHGHEPDDPADDWWFTPGGG
VEGDETRAEAARRELLEETGITDVELGPVLWRRRCSFPFAGRRWDQDEWY
YLARTARTATEAVGPGLTELERRSVAGARWWTCEELTRARETVYPTRLAE
LLTTLLDEGPPAGPVTLDTEIV
>gid:400327  SCO5602  hypothetical protein
MIVGAGGGADMNARGAMGRYGETLAARRLTGAGMTVLERNWRCGRTGEID
IVARDGDVLVVCEVKTRRGGAFEHPMAAVTPDKAERLRRLAERWIQTHGG
APPGGVRIDLVGVLLPQRGAPVVEHARGVA
>gid:400330  SCO5604  conserved hypothetical protein SC2E1.21
MSGNTRPPEGAAPSGDTGPGTSGTLFPEAAVSFGGGSPAEGGAMTPPGAG
DGPDGGEARWAGPGPGDGDGAVTGAGVAARGRSAAGTCGTGGGGAGSGAR
GGGPEDRELLGRVFLARVFEPGDEAGGRWVRERGAPEVVRRLREGGRALP
GVSEKRWAGLCARAGRADPGRDLAVARSAGARFVVPGTAEWPGQLDDLGD
ARPLGLWVRGGPSLRMWALRSVAVVGARACTEYGAHMAATLAAGLAERGW
VVVSGGAYGIDGAAHRGALGAGGATAAVLACGVDRPYPPGHTALITRIAE
QGLVVGELPPGDHPTPSRFILRNRVIAALTRGTVVVEAAHRSGSLVTARA
ARRLGRHVMGVPTPLSGGAALAPIRFRLATAT
>gid:400359  SCO5620  putative recombinase
MYQRNTDALDVAMGVVERAVQPVRLRAVAYARVSTEEQAKGYGVQAALKK
ILRYIGRKDWDHVGTYTDEGISGSLEAADREDLKRLMADAHRQPKPFDIV
VVSEGRAIGRVGRAFWRWVWALEDIGIYVGVVEDDYDNSTPEGRKKMRRD
ADYAETEWETIRKRTQGGLQEKAEDGGWPGGQPPYGYEIACQGQKGKSHL
VQCAAEVKTLRMAWAMVVEEKLNTREVAARLNALNRFTRSGVPWSHSNLR
AKLLSESTINARVIFRNPNRAHAGHGAKFGEDGKPLHGETVIIKLKPIFE
PGEVAALQVTLAQTSKGAGRPKPKPYPLSKRLFNEHTGCNSHHVGMARTS
REGRWYRCTGLAAKYPGDPTCTCKMVDADAMESAVWGEVVSLLGDPDRLR
AMAAEWVGMAAGDQVQHADRIADFDKQIANLDRAIASMATECVKAGLPAA
AIAQASAALLEERRQLADLRDEAAAWLEETEAAEQRARDLEALATAARTR
LADMDPAEQGEVLALLDVKVTITGPVPKPKLGLTCSLAEWFKVAGRLVPS
ELTDDMWAAAEPVVKAWEPSNHKLRPGRLMLDAMFYKARTGCRWDDLPER
FGPWKGIHSRYKTWRNCGVWDEIMAALPVDGPGYRPVPEINLVPPFRVEG
RVDPRVLAGPDIQEEVVRPETGVPGPATSALSAGVHEMLRGDAVLVGDAA
EVVELVGAMGELPPERRGPVLPTDLLESRTRQVLAGLPGRGTATAGEVAL
RAQTTQDDAVARLYELRALGYVERHGDGWKLTRQAMISVRGGRSPC
>gid:400400  SCO5633  traSA:integrase fusion protein
MTWLMVAVVVVVAAAGLLRWRRPAWYWLTFGALVATVRILVRYASVMEAC
GLTVPPSRWRLALARMTNRPAPESRPPRILRLRPTRTGLVLRLKLQPGQD
AFDVAAATDRLRHSFGVYGVTSRELRSGVVEVRMTGYDVLQRVQMPAPAE
PRPMRIPVALREDGAVHYRDYRAVPHGLTLGATESGKSVYQRNLVAGLAP
HHVALVGIDCKQGVELFPLARRFSALADNPDTALDLLEALVGHMKDVYQL
IRAEQRISVAVPDAEIAADIWDLREDLRAVPVVVLVDEVAELALFASKDE
EKRRDRIITALVRLAQLGRAAGIYLEICGQRFGSELGKGITMLRAQLTGR
TAHRVNDETSADMAFGDLSPDAVLAAIQLPTDTPGIAVTGDSTGGWARIR
APHTTKSFPDRQKRLAELWLIEIASDMSRGRYVDPRAARVTFKGYAVKWL
ETHGIDPASQVVVEQRLRLHAFRLIGSRPLDSFRPEHIRGLVSALENDPA
VSGGYARNIYGDVRAVLSAAVDDGLLPRNPCSAKSVRPPAVEQRRVVPWL
PEQVQAVRAALPQRYRPMVDMGAGCGLRQGEIVGLAEDAVDFASGIVRVL
RQVKLIRGKAVFAPPKCNKERDVPLPPSVADALPAHMDAFKPVEITLPWR
KPDGPKVSARLLFTNTASGLVWRSNFNVQEWKPALAAAGLISEAGADGKY
ESAREHGMHALRHFYASVLLDAGESIKAVSEYLGHADPGLTLRVYAHLMP
SSQERTRSAIDQSLRFSG
>gid:400421  SCO5641  transposase
MVHRNAPLTETGRLRLARCVVEDGWPVRRAAERFQVSHTTASRWARRYRQ
LGVTGMSDRSSRPHHQPRRTAAAVEEHVLRLRREHRIGPLRLAVRCGIAA
STAHRILVRHGLPPLAALDRATGEPVRRYERARPGELVHIDVKKLGRIPD
GGGHKTLGRAEGHRSRTNGAGWAYLHTALDDHSRIAYTEDLPDETAPTCA
AFLVRATAYFASLGIRIERVLTDNAWAYSKNTWRNTCRDLDISPRWTRPW
RPQTNGKVERFHRTLLDEWAYQKPYTSDHERREAFTHWLHWYNYHRPHTG
IGGHTPASRGTNLSEQHS
>gid:400720  SCO5760  DNA glycosylase
MPEGDTVWQAARRLHDALAGRVLTRSDFRVPRYATVDLTGRTVLDVTPRG
KHLLTRVEGGLTVHSHLRMDGSWKVFAPGQRWSGGPAHQIRVILGTADRT
AVGYRLPVLDILRTAEEQRAVGHLGPDLLGPDWDPERALDNLRADPPRAL
GEALLDQRNLAGIGNVYKSELCFLLGVTPWLPVGELPADRAARLPTLAKK
LLEANRDRPVRRTTGLRGQDLFVYGRAPRPCLRCGTSVRVADQGDGSRER
PTYWCPTCQAGPAPRPGGRTGVRPRR
>gid:400741  SCO5769  recombinase A
MAGTDREKALDAALAQIERQFGKGAVMRMGDRTNEPIEVIPTGSTALDVA
LGVGGIPRGRVVEVYGPESSGKTTLTLHAVANAQKAGGQVAFVDAEHALD
PEYAKKLGVDIDNLILSQPDNGEQALEIVDMLVRSGALDLIVIDSVAALV
PRAEIEGEMGDSHVGLQARLMSQALRKITSALNQSKTTAIFINQLREKIG
VMFGSPETTTGGRALKFYASVRLDIRRIETLKDGTDAVGNRTRVKVVKNK
VAPPFKQAEFDILYGQGISREGGLIDMGVENGFVRKAGAWYTYEGDQLGQ
GKENARNFLKDNPDLANEIEKKIKQKLGVGVHPEESATEPGADAASAAPA
DAAPAVPAPTTAKATKSKATAAKS
>gid:400832  SCO5802  putative ATP-dependent helicase
MTKPSLPELLHAAVTAVGGTERPGQVAMAEAVEEAIDGGSHLLVQAGTGT
GKSLGYLVPALAHGERVVVATATLALQRQLVERDLPRTVDALHPQLRRRP
EFAMLKGRSNYLCLHRLHEGVPQDEEEGLFDQFEAAAPTSKLGQDLLRMR
DWADEAETGDRDDLTPGVSDRAWAQVSVSSRECLGASKCAYGAECFAETA
RERAKLSEVVVTNHALLAIDAIEGAPVLPQHEVLIVDEAHELVSRVTGVA
TGELTPGQVNRAVRRAAKLVNEKAADQLQTAAEGFERLMELALPGRLEEV
PEDLGYALMALRDACRTVISAIGTTRDKSVQDEDAVRKQALASVESVHDV
AERITNGSEWDVVWYERHDRFGASLRVAPMSVSGLLREKLFTDRSVVLTS
ATLKLGGDFNGVGASLGLAPEGTEGDDVPQWKGVDVGSPFDYRKQGILYV
AKHLARPARDGDRGDMLDELTELIQAAGGRTLGLFSSMRAAQLAAEELRS
RIPEYPILLQGEETLGELIKNFAADPQTCLFGTLSLWQGVDVPGPSCQLV
VMDKIPFPRPDDPLMSARQKAVEEAGGNGFMAVAATHAALLMAQGAGRLV
RASGDRGVVAVLDQRLATARYGSYLKASLPDFWFTTDRNQVRKSLAAIDA
KARQTEAAGQPESG
>gid:400858  SCO5812  probable ribonuclease HII
MPYEPPTHTVERSLRATTGAKIIAGVDEVGRGAWAGPVTVCAAITGLRRP
PVGLTDSKLLTIKRRTELEVELRTWVTSYALGHASPEEIDAMGMTAALRL
AAVRALGTLPVRPDAVILDGKHDYLGAPWRVRTVIKGDQSCIAVAAASVL
AKVQRDKMMAELGVDHADFGFADNAGYPSPVHKAALEERGPTPHHRLSWA
YLDALPQWRHLKKVRSWVEGSVPEIEGQLGFDF
>gid:400862  SCO5815  putative ATP-dependent DNA helicase
MDHVELRTEADAVLAELVGDREGSARLREDQWQAVAALVEEHRRALVVQR
TGWGKSAVYFVATALLRRRGAGPTVIISPLLALMRNQVEAAARAGIRART
INSANPEDWEAIYGEVERGETDVLLVSPERLNSVDFRDQVLPRLAATTGL
LVVDEAHCISDWGHDFRPDYRRLRTMLAELPAGVPVLATTATANARVTAD
VAEQLGTGAGDALVLRGPLDRESLRLGVLVLPDAAHRLAWLGERLGELPG
SGIIYTLTVAAAEEIAAFLRQRGYPVASYTGKTENADRLQAEEDLLANRV
KALVATSALGMGFDKPDLGFVVHVGSPSSPIAYYQQVGRAGRGVDHADVL
LLPGREDEAIWAYFASVGFPPEEQVRRTLAVLEEAGRPMSLPALEPLVDL
RRSRLETMLKVLDVDGAVKRVKGGWAATGQAWAYDAERYAWVARQRQAEQ
QAMREYVSTTRCRMEFLQRQLDDEKAAPCGRCDTCAGPWLDPAVSAGALA
AATGELDRPGVEVEPRKMWPTGLAAVGMDLKGRIPAGRQALTGRALGRLS
DIGWGNRLRPLLSAQAADGPVPDDVLRAVVTVLADWARSPGGWATGSPDA
VARPVGVVAVPSRTRPQLVGSLAEGVARVGRLPLLGSLAHTPHADEYAAH
RSNSAQRLRALAESFTVPGELAAALAATDGPVLLVDDFTDSGWTLAVGAR
LLRQAGADDVLPLVLALAG
>gid:400893  SCO5822  putative DNA gyrase subunit B
MTAETSVPSTALLAGADRDGSNYTARHLLVLEGLEAVRKRPGMYIGSTDS
RGLMHCLWEIIDNSVDEALGGYCDHIEVILHDDASVEVRDNGRGIPVDVE
PKTGLSGVEVVMTKLHAGGKFGGGAYAASGGLHGVGASVVNALSARLDIE
VDLGGHTHAISFRRGVPGTFAKAGPDAKFEAGSGLRKVKKVPKSRRGTRV
RYWADRQIFLKDAKLSLDNLHQRARQTAFLVPGLTIVVRDEFGLGDGGSK
GEESFRFDGGISEFCEFLAGDRPVCDVLRFSGQGSFKETVPVLDEDGQMT
PSEVTRDLGVDVALRWGTGYDTTVRSFVNIIATPKGGTHVAGFEQAVAKT
MNEVLRTKKMLRVAEDDIVKDDALEGLTAVVTVRLAEPQFEGQTKEVLGT
SAARRIVNNVISRELKAFLTSTKRDAAQQARVVMEKAVAAARTRIAARQH
KDAQRRKTALESSSLPAKLADCRSDDVDRSELFIVEGDSALGTAKLARNS
EFQALLPIRGKILNVQKSSVTDMLKNAECGAIIQVIGAGSGRTFDIDAAR
YGKIIMMTDADVDGSHIRTLLLTLFHRYMRPMVEAGRVFAAVPPLHRIEL
VQPKKGQDKYVYTYSDRELRDKLMEFQSKNIRYKDSIQRYKGLGEMDADQ
LAETTMDPRRRTLRRINLTDLDSAEQVFDLLMGNDVAPRKEFISSSAATL
DRSRIDA
>gid:400928  SCO5836  DNA gyrase-like protein
MARRSTKTPPPDDSYEERILDIDVVDEMQGSFLEYAYSVIYSRALPDARD
GLKPVHRRIVYQMNEMGLRPERGYVKCARVVGEVMGKLHPHGDASIYDAL
VRMAQSFSMRVPLVDGHGNFGSLGNDDPPAAMRYTECRMAEAAGLMTESI
DEDTVDFAPNYDGQEQEPVALPAAFPNLLVNGASGIAVGMATNMPPHNLR
EVIAAARHLIRYPNADLDALMKHVPGPDLPTGGRIVGLPGIRDAYETGRG
TFKIRATVSVETVTARRKGLVVTELPFAVGPEKVISKIKDLVGAKKIQGI
ADVKDLTDRAHGLRLVIEIKNGFVPEAVLEQLYKLTAMEESFGINNVALV
DGQPLTLGLKELLEVYLDHRFTVVRRRSEFRRSKRRDRLHLVEGLLTALV
DIDEVIRLIRSSENSAQAKQRLMERFSLSDVQTQYILDTPLRRLTKYDRI
ELESEKDRLNAEIEELTRILDSDAELRKLVSAELAAVAKKFGTDRRTTLL
ESSGAPVAAVPLQVADDPCRVLLSSTGLLARTANDEPLVTEAGAKRVKHD
LIVSAVPATARGEVGVVTSGGRLLRVNVVDLPQLPEAMPTPNLAGGAPLA
EFVSLEDDEDVVCLTTLDESSPGLALGTEQGVVKRVVPDYPSNKDELEVI
TLKDGDRIVGGVELRTGDEDLVFITDDAQLLRYQASQVRPQGRPAGGVAG
VKLADGAKVISFTAVDPAADAVVFTVAGSRGTLDDSVQTTAKLTPFDQYP
RKGRATGGVRCQRFLKGEDCLAFAWAGATPALAAQKNGTPAQLPDTDPRR
DGSGVSLPKTVSVVAGPV
>gid:401119  SCO5920  probable DEAD-box RNA helicase
MNRTRTNDRFARTRHGGADSGKGGSRFGSPAPRRPAGPSRSGGYGRRPGA
VQGEFALPRTITPALPAAEGFADLDMPGELLAALGQQGVTVPFPIQAATL
PNSLAGRDIMGRGRTGSGKTLAFGLALLARTAGRRAEPRQPLGLVLVPTR
ELAQQVTDALTPYARSVKLRLATVVGGMSIGRQASALRGGAEVVVATPGR
LKDLIDRGDCRLNQVSVTVLDEADQMADMGFMPQVTALLDQVRPEGQRML
FSATLDRNVDLLVRRYLSDPVVHSVDPSAGAVTTMEHHVLHVHGADKHAA
TTEIAARDGRVIMFLDTKHAVDRLTRDLLNSGVRAAALHGGKSQPQRTRT
LAQFKTGHVTVLVATNVAARGIHVDNLDLVVNVDPPTDHKDYLHRGGRTA
RAGESGSVVTLVTPNQRRGMVRLMSEAGIRPQTTQVSPGDEALSRITGAQ
APTGIPVVITAPATERPKKRGATSRGRRRPASATRRTPALKSTAGAAA
>gid:401369  SCO6015  hypothetical protein SC1C3.03c
MEDNAPLLVVVDAANVVGSVPDGWWRDRRGAAERLRDRLAADGVPGRAGP
VDIVLVVEGAARGVESVPGVRVESAPGSGDDHMVGLVARAADDRPVLVVT
ADRELRRRVTGLGAEVAGPRTVRPV
>gid:401502  SCO6084  putative DNA polymerase
MGWHRELLIGFDLETTGTDPREARIVTGAVIEVRGSEPLGRREWLADPGV
EIPADAVAVHGISNERAAREGRPADQVADAIATVLVDHWKAGVPVVAYNA
AFDLTLLAAELRRHALPSLRERLGGLDPAPVIDPYTIDRSVDRYRRGKRN
LEAVCREYGVRLDAAHDATADALAAARLACAIADRHPKVAALGPADLHRR
QIEWYAEWAADFQSFLRRKGDATAVVDGTWPLRESAGEPADERV
>gid:401647  SCO6150  putative ADA-like regulatory protein
MTPQTVQPAEHADAREDVRYEAVRSRDARFDGAFFFAVETTGIYCRPSCP
AVTPKRRNVRFFATAAAAQGSGFRACRRCRPDAVPGSADWNVRADVVGRA
MRLIGDGVVDREGVAGLAGRLGYSARQVQRQLTAEVGAGPVALARAQRAH
TARVLLQTTVLPVTEIAFASGFASVRQFNDTIRAVYAATPSELRAAAPAR
DRAARRTATPSAGVPLRLAHRGPYQAGPVFDLLQREAVTGVEEVSGETGR
RLYRRTLRLPYGTGIVAVQERPGRAGTGSGGWLEARLHLTDLRDLTTSVQ
RLRRLFDLDADPYAVDERLGADPRLAPLVAARPGLRSPGTADPAELAVRA
LVGRTEAERLVQRYGKALDAPCGTLTHLFPEPDVLAGAAPHGTPGALAAA
LADGAVRLDPGADRDDAERALLAVPGLDARTVAVVRTRALGDPDVAPPGA
AVPDTWRPWRSYALNHLRAAGEWENDR
>gid:401650  SCO6151  putative methylated-DNA-protein-cysteine methyltransferase
MTTTTPTTTTTSIPAETYWHEVDSPVGPLLLTAGSDGALTSLSVPGQKGG
RSVRDGWRHDAGPFRVAEEQLGAYFAGELTEFSLPLRAQGTAFRERVWAA
LDDVPYGATTTYGEIAARIGASRPAVRAVGGAIGANPLLILRPCHRVIGA
DGSLTGYAGGLERKTRLLSLEGAPLSRPVPLPATPR
>gid:401853  SCO6262  putative helicase
MGDMAVTEAGRAGATTASVPVGLAAVFLPAPLPREGRIAFWNPDGGPVDA
AEDFGGAAAGDGRGHGDGERSGGPTELTVVRRHGAGVRRGTAPALSLPLA
EALPHLVRARHDRAAHPATACWGAAALHALRLTARGRLLPGLTPTGHDAW
RAGPLDPDDIAHLRAVAAALPPEGHAVPLDGPGPIRLPEPEALVRAFLDA
VADTLPRTPAAPHASGRPFAAREARRLPDAHDWAAEVAAGMDAGVRISLR
LDLSAYDLFDREGEGAPGSGSEGVGARNAGAAVVQVHSLADPTLVADAAD
LWSGTADAAFGARTRVDTALAVRRAARVWPPLDRLTDQDVPDVLALSEEE
VTDLLGVAAGRLAAAGVAVHWPRDLAQDLTATAVIRTAPGSATDGTGFFE
SEDLLQFRWQLAIGGDPLTEAEMDTLAEAHRPVVRLRDRWVLVDPALVRR
ARKRDLGLLDPVDALSVALTGSAETDGETVEVVPVGALAALRDRLTAGVR
PAEAPPGLHATLRDYQLRGLAWLDLMTSLGLGGCLADDMGLGKTVTVIAL
HLRRARTEPTLVVCPASLLGNWQREINRFAPGVPVRRFHGPDRTLDDLTG
GFVLTTYGTMRSAATTLAEQPWGMVVADEAQHVKNPYSATAKALRTIPSP
ARVALTGTPVENNLSELWALLDWTTPGLLGPLKSFRARHARAVENGEDDQ
AVERLARLIRPFLLRRRKSDPGIVPELPPKTETDHPVPLTREQAALYEAV
VRESMLAIEEAEGIGRRGLVLKLLTSLKQICDHPALFLKEEHPPGGTDRM
TARSGKLALLDELLDTVLAEDGSVLVFTQYVGMARLITSHLAARAVPVDL
LHGGTPVPERERMVDRFQSGATPVLVLSLKAAGTGLNLTRAGHVVHFDRW
WNPAVEEQATDRAYRIGQTQPVQVHRLITEGTVEDRIAEMLQSKRALADA
ILGSGESALTELTARELSDLVSLRRPS
>gid:402074  SCO6337  putative transposase
MTPSASSGCAGYSRSRGPASTGGWPARTRGPAGRGRTPSSPNASPGSIRN
RTAPTGSARVTAELKDSGRRVNHKRGERVMRKFHIVGLHLRKKVRTTIPE
PSATPVPDLLQRGIAAQMPNTKNVGDITCLPVGNGQFLYLATVLDLCSKR
LTGWSIADHMPTSLVTDVLRAAARARGGDGLRGAISHSGNGAQYVSKEFA
QVCSELGVTRSRGAVGTSADKAAAESLNTTMKRGTLQGRKRWNGASEARL
AVFRWATRYNTRRRTPASARSARSPTNGDQLRWPPPHDNRCPRSQGRPSS
VRAGGWPSFS
>gid:402075  SCO6338  putative transposase
MGTSKYSPEFRADAVALYHASPGGTYASVAKDLGVNHETLRTWVRDAEQA
GRPGAVEATAMDKENRQLRARVKELELEREILRRAAKYFAAETSW
>gid:402083  SCO6341  putative exonuclease
MRIATWNVNSITARLPRLLAWLESSGTDVLCLQEAKVAEAQFPFDALREA
GYEAAVHATGRWNGVAVLSRVGLEDVVKGLPGDPGYEGVQEPRAISATCG
PVRVWSVYVPNGREVDHPHYAYKLQWFEALRAAVEGDAAGGRPFAVLGDY
NVAPTDDDVYDRAAFEGATHVTPAERAALASLRGAGLSDVVPRPLKYDHP
FTYWDYRQLCFPKNRGMRIDLVYGNEPFAKAVTDSYVDREERKGKGASDH
APVVVDLDL
>gid:402164  SCO6380  hypothetical protein SC4A2.16c
MKIVPADVGPIEVKGPRDTAGVFEPQMVKKRRQQRRTGLDEGVLSLSAQE
RRPADSGRESTTHGPPEAGQRSASVRALAGPADGGQAWKDVPGQCRQGEG
QQERDQKEEDHGLQQLTQ
>gid:402176  SCO6387  hypothetical protein SC3C8.06
MIHTRDHLRQHWEEGEHTATVLHQEIAAKGYGGHYQRDKMAIAPLRRGLP
IDTPMRATAITPPGHPLIATAPSRRDLHTTEALRRLLDHCRELARTDSLV
RRFAAMLAARDARPLAHWLEQLSDAGLPALASLANAIPEDQPSEGVGSPL
SAMANPAGSMALTLPAARRPGGQSGSRADVCPPAFTGP
>gid:402182  SCO6391  putative IS110 transposase/integrase
MFDTEDVGVFLGLDVGKTAHHGHGLTPAGKKVLDKQLPNSEPRLRAVFDK
LAAKFGTVLVIVDQPASIGALPLTVARDAGCKVAYLPGLAMRRIADLYPG
EAKTDAKDAAVIADAARTMAHTLRSLELTDEITAELSVLVGFDQDLAAEA
TRTSNRIRGLLTQFHPSLERVLGPRLDHQAVTWLLERYGSPAALRKAGRR
RLVELVRPKAPRMAQRLIDDIFDALDEQTVVVPGTGTLDIVVPSLASSLT
AVHEQRRALEAQINALLEAHPLSPVLTSMPGVGVRTAAVLLVTVGDGTSF
PTAAHLASYAGLAPTTKSSGTSIHGEHAPRGGNRQLKRAMFLSAFACMNA
DPASRTYYDRQRARGKTHTQALLRLARQRISVLFAMLRDGTFYESRMPAG
VELAA
>gid:402183  SCO6392  putative transposase
MTPPRPAPTTVAPLEKENNRVTMLAEQVDGVIGVDTHRDTLAAAAVSPIG
AALATTDAPANARGYRRLLEFARQHIPGRRCWALEGVGSYGAGLAAFLDQ
AGEQVVEVLRPKRSAVRGGRKTDMLDAIRAGKEALASEHLIQPRARGERE
ALRVLLVTRHGAVLASTAAINQFKGLIVSAPDDLRAELRKLKRPAQINRC
AQLRDRPAQSIEHRMTVRALRSTAQRVQTLQAEARDLEGEIISLVRQMAP
ELLELQGVGPITAAQVLVSWSHPGRFRSEAAFASFAGVSPIPASSGLTNR
HRINRSGDRQLNRALHTITLIRMRLDPATKTYVARRISEGKTSRDAQRCL
KRAICRQLFKILERSGRTPTTSSEPLDSI
>gid:402186  SCO6393  putative transposase
MSLLLSVRGGREALTVVLDPHRWLELRRFRPLYESGAMSLREIAKETGLN
RRTVSKYLKDPASLAPPKREVADQRPRRVVDEVAPLIDAMLRSEILLKGR
VIHERLVQEYGVAINYQRVKLYLQEARPRIAEELGISPGELAGLHRRFEV
VPGAQAQVDWGDEGKILAHVGIPKVYSFHMTLSYSRDPFCCFTTSQDLAT
FFDCHRKAFAHFGGVPMSVVYDRTKTVVRRHVAPGEAVPLHPEAVAFAGH
YDFDIDVLAAYRPQGKGRVERQVGIVRDHVLAGRAFSSVEEMNAAFAAWV
PFRRAKVHGTHGEVIGHRAVRDHMALRPLPRTPYVVAQRHLRHVGKDCLV
AFDANLYSVPARKIRPRQLVEIRATKSQVSLHSTVPDPSGRTLLAVHPRA
VGRGARIVDETHWDGLPTGAGRRVTTGDALPSPRRGQPAGPETGPLQALL
NRAAAANVEVGRRPLSVYDELTGTRPFTAPAPTKEAR
>gid:402187  SCO6394  putative IS element ATP binding protein
MSELTSNRIRTTAAKLGLPHLAEALNQYVQRADEAKMGYLDFLDLVLAEE
LAVRDDRRFRNGLRLSKLPHHKTLEDYDFSFQPDLDPRKVKDLATLSFIE
DKANVALLGPPGVGKTHIAVALAVAACRAGYSIYFTSLDDMVRHLKAAED
QGRLISKLTSYLRPAVLVVDEVGYQPLERAEANLVFQVISKRYEKGSIIL
TSNKTFGEWGQVFGDEVLATAILDRLLHHCEVVSINGNSYRLKNRLQAIE
RDTDVA
>gid:402199  SCO6400  putative IS117 transposase
MWEDSLTVFCGIDWAERHHDVAIVDDTGTLLAKARITDDVAGYNKLLDLL
AEHGDSSATPIPVAIETSHGLLVAALRTGSRKVFAINPLAAARYRDRHGV
SRKKSDPGDALVLANILRTDMHAHRPLPADSELAQAITVLARAQQDAVWN
RQQVANQVRSLLREYYPAALHAFQSKDGGLTRPDARVILTMAPTPAKAAK
LTLAQLRAGLKRSGRTRAFNTEIERLRGIFRSEYARQLPAVEDAFGHQLL
ALLRQLDATCLAADDLAKAVEDAFREHADSEILLSFPGLGPLLGARVLAE
IGDDRSRFTDARALKSYAGSAPITRASGRKHFVGRRFVKNNRLMNAGFLW
AFAALQASPGANAHYRRRREHGDWHAAAQRHLLNRFLGQLHHCLQTRQHF
DEQRAFAPLLQAAA
>gid:402209  SCO6405  putative DNA recombinase
MGKALTRAVDTALRTTGTAPLRAVDYLRVSTEEQADGYGIAYTGKKTVRY
IQKKGWKHVGTYADEGFSGSLEADDRPDLRRLMKDARKTPRPFDMVVVNE
GRGIGRTGRAFWKWVWDLEDLGVYVAVVKKDYDNSTPAGRSQMRKDADYA
EEERELIRERTQGGIQEKAEDGLYPGGMVPFGWDVAERGKKGASHYVVHK
EEAATLRRAREVFLEKRSWEETALILNSEKRLTRSGNGWTAKNIRCRLLG
DAALESRVTWRGHDAQRDQDGNPIYGETVVINLPSIFTDEEVQELKSAME
SRRPSTKTRSRIYLLTGLMTSPCGKTYEGHQHRPTEVIYRCKGRHESYAG
AGDRCTCPYLEATSIEKQVWKDVMATLGDADRMKAMAQDWLNATSHKRMD
YTKRIAELDQRIAETEDLIDITAATAAKRALRRGLSREEAEEAAERAAKP
HEDELAGLEKLRREAEAWQREAAETGRWLQNLERLAEVAGRNLQDVEPAE
KAELLRMMKTRAEVLRCAPRRKGVACAVREWFVAAGRCVPVLTDDAWERI
APLMGGPRCTIDRRVMVEALLEKATSGARYKELAPKYGVDWKTLQTQANR
WLNRGVWAEAMKVLADAECVPVWQPDPVEIKVTFAPLALESRVGVEERDG
SNRARPHRTPYVRRNPGRPCPG
>gid:402336  SCO6461  putative ADA-like regulatory protein
MHTDTERCVRAVQSKDARFDGWFFTAVLTTGIYCRPSCPVVPPKPGNMTF
YPSAAACQQAGFRACKRCRPDTSPGSPEWNRRADLTARAMRLIADGVVDR
EGVPGLASRLGYSTRQVERQLLAELGAGPLALARAQRAQTARLLIETTPL
PMADIAFAAGFSSIRTFNDTVREVYALSPSELRTRAPRNRRAATAPGALS
LRLPFRAPLNPDNLFGHLAATAVPGVEEWRDGAYRRTLRLPYGHGIVALT
PNPDHIACRLTLSDLRDLTVAISRCRRLLDLDADPTAIDDQLRADPLLAP
LVDKAPGRRVPRTVDEAEFAVRAVLGQQVSTAAARTHAARLVTAHGDPVD
DPEGGLTHLFPSTEALAAVDPETLAMPRTRRTTFTTLVAHLADGSVNPGV
ESDWAETRARLLALPGFGPWTADVIAMRALGDPDAFLPTDLGIRRAAAEL
GLPSTPAALTARAAAWRPWRAYAVQYLWATDDHPINFLPV
>gid:402340  SCO6462  putative methylated-DNA-protein-cysteine methyltransferase
MKQHTVIDSPYGALTLVAEDGALCGLYMTDQRHRPDEETFGARDERPFAE
TEEQLEAYFSGELKDFTLGLRLNGTPFQRMVWTQLRKIPYGETRSYGELA
AALGNPAASRAVGLANGRNPIGIIVPCHRVIGASGGLTGYGGGLERKQRL
LDFERGTAVPEALF
>gid:402345  SCO6465  hypothetical protein
MTTADYATYIAGLPRVLAGAAAVFRDAAGRVLLVEPNYREGWALPGGTIE
SGDGESPRQGAWRETLEEIGLDVRIGRLLAVDWSNGAGRPPIVAYLYDGG
VLSEDDLKAIRLQEEELLSWRLVPRAELGAHLLGSLHGRLLAALDVLADG
SGTAELEDGVRVDR
>gid:402371  SCO6481  conserved hypothetical protein
MRWENLTDSDHARAADTALFGADAVVTRTFDTPEFRGITFHEVRARSILN
RVPGASRMPFEWTVNPYRGCTHACVYCFARKTHSYLDLDTGLGFDSQIVV
KVNAPEVLRRQLASRRWQGEHVAMGTNVDCYQRAEGRYRLMPGIIEALTE
RANPFSILTKGTLILRDLDLLTRAARVTEVGISVSVGFTDAELWRTVEPG
TPAPERRLEVVRALGERGIGCGVLMAPVIPFLGDEPAQLRATVRAIAAAG
ATSVTPLVLHLRPGAREWFMAWLGRHHPHLVRRYERLYADGAYAPKWYQR
RITRQVHDLAREYGIGPARAGMPRRVAEPEHSRPAEPAPPEPVQLSLI
>gid:402372  SCO6482  conserved hypothetical protein
MAQVEATTERIVAADAETVFDTLADYSGTREKLLPEHFSEYEVREGGDGE
GTLVHWKLQATSKRVRDCLLEVTEPTDGELVEKDRNSSMVTTWRVTPAGE
GKSRVVVLTTWQGAGGVGGFFEKTFAPKGLARIYDALLAKLAAEVEK
>gid:402396  SCO6498  conserved hypothetical protein SC1E6.07
MTGSGGTGGGGRGGGARTVRAGRHTVEVHRPDKVLFPVGEGGTEEYTKGD
LVDYHRAAAPFMLPHLRGRPLMLERHPDGVDGPRFMQKNTPEHYPEWIDR
VEVGKEGGTVCHTVCDDSATLVYLADQAALTLHRWLSRTGRLDRPDRMVF
DLDPAQDDFEAVRAAARLLAGLLDELKLPSAPMTTGSRGLHVVVPLDGRQ
DFDDVREFARAVADVLAAAHPDRLTTAARKKDRGDRLYLDIQRNAYAQTA
VAPLTVRARPGAPVATPISWDQLDDPALHARRWTVADAVEQARTRPWAGI
MNSPRALGPARRRLNALHG
>gid:402430  SCO6517  uvrA-like protein
MSSAKRPGTPGPGSHVADSHDLIRVHGARENNLKDVSVDIPKRRLTVFTG
VSGSGKSSLVFNTIAAESQRLINETYSAFVQGFMPTLARPEVDVLDGLTT
AIIVDQQRMGADPRSTVGTATDVNAMLRILFSRLGEPRIGPPSAYSFNTA
SVRASGAITVERGNKKAVRATFERTGGMCTHCEGRGTVSDIDLTQLYDDS
KSLAGGAFTIPGWKSDSQWTVQVYAQSGFVDPDKPIREYTEKELRDFLYG
EPVKVKVNGVNLTYEGLIPKIQKSFLSKDKEAMQPHIRAFVERAVTFTTC
PECEGTRLSEGARSSKIKKISIADACAMEIRDLAEWVRDLTEPSVAPLLT
ALRDTLDSFVEIGLGYLSLDRPAGTLSGGEAQRVKMIRHLGSSLTDTTYV
FDEPTVGLHPHDIQRMNDLLLRLRDKGNTVLVVEHKPEAIAIADHVVDLG
PGAGTAGGTVCFEGTVEELRAADTVTGRHLDDRAVLKESVRKPAGALEIR
DARTHNLQGVDVDVPLGVLCVVTGVAGSGKSSLIHGSVPAGADVVSVDQS
PIKGSRRSNPATYTGLLDPIRKAFAKANGVKPALFSANSEGACPTCNGAG
VIYTDLAMMAGVATPCEDCEGKRFQPAVLEYRFGGRDISEVLAMSVDQAE
EFFGAGEARTPAAHKILQRLSDVGLGYLTLGQPLTTLSGGERQRLKLATH
MGEKGGVYVLDEPTTGLHLADVEQLLGLLDRLVDAGKSVIVIEHHQAVMA
HADWIIDLGPGAGHDGGRVVFEGTPADLVADRSTLTGEHLAAYVGA
>gid:402701  SCO6626  putative protein kinase
MREGRWVTVTESEFEHERRGLEAIRQKLPDGDPWRAWSNFTFTANTGHVR
EVDLLVVAPGGLCMVELKDWHGSVTSENGTWVQTTPGGRRRTHGNPLHLV
NRKAKELAGLLAQPGAKRVWVAEAVCFTDNGLRVRLPAHDQNGVYTVDEL
VDMLKQAPSDERRRVTAIGSREVAAALKNIGIRKSDAQYKVGPYELERKS
FDSGPTWADYLARHSDLPEAARVRIYLSERGSDASLRQSVENAARREAAV
LGRFKHPGAVQLKQYFPSGHAAGPALIFDYHPHTQKLDEYLVQYGEKLDI
LGRMALVRQLAETVRSAHASRIHHRALAARSVLVVPRSRGGKGRAVGEEA
AWLTPQLQISDWQIATQRSGDSSQGQGMTRFAPTALSAMHLADDADAYLA
PELTALNPDPVYLDVYGLGVLTYLLVTGKAPAASQAELLARLEAGEGLRP
SSLVDGLSEDVDELVQAATAYRPGQRLSSVDEFLELLEVVEDSLTAPAAA
LDGPAEDETGASADKDPLEVVAGDLLAGRWEVRRRLGTGSTSRAFLVRDL
EAETRRTRPLAVLKVALSDSRGEILVREAEAMRRLRPHSGIIRLAEPEPL
HIGGRTVLALEYVGDERDDDGPGAEGATRPRRREETVARQLREHGRLPVD
QLEAYGDYLFGAVDFLEGEGIWHRDIKPDNIAVRIRPNRTRELVLIDFSL
AGYPAKNTDAGTDGYLDPFVDVITRGSYDSHAERYAVAVTLHQMASGELP
KWGDGSVLPRMTDPKEWPYPTIAAEAFDPAVRDGLVAFFQKALHRDAGKR
FPELKPMRDAWRKVFLDASQTVPSSHRTRPAAPADGAAPAEGAAAGIADA
EPETAEQQRDRLAAEVTRDTPLTVSGLTPAAQSFLYGLGITTVGELLDYS
RRKLVNAPGLGAKTRNEVQQRQREWGERLREAPVSPLTPKGRAEAKEELE
QLTAAESALVGQLATGESAGALSARTLRSVSLDTLATVLVPAVNNNGSNR
NKAEMVRLLLRLPDEHGVLPGIGVWPKQKDVADALGLSHGRIPQMLKDER
KRWKAEPAVQALRDEIIELLASMGRVASAVEIADALAVRRGTHLAGREQR
RAMALAAVRAVVEVEQLVPQEVEFQHQPNRKATDESLGAGLLALDVREDD
APDTPTAPGLLDYATRLGKTADRLARLDTLPTAATVLAELGALTVPPGAV
DWDERRMVELAAAASVNAAATPRLEIYPRDLSLVRALRLTQAGLVRWIPG
VPEGRQPGLTGEDVHERVRARFPELVVPDGRGGTAHELPTAGPLTKALRD
AGFELSLSMREDTGTLRYLPTRVDEASSYLTTGAWRQSTRTGTVTRYADD
PQLAGAVRAEERLLASAHRDGYRVLTVRQQLVRDAVRELGAERLGGQAVS
VTELFLEALHGQVTPGTKPTWETLLKADAAEPGSKGAVRFAEYARTAWGS
VEPRIAELLGDGGGGAGPVLLTEAGVFARYDAMGVLDRLASAARRGGRGL
WLLVPQSDPSREPRLGQVAVPYQAGLGEWIQLPDTWVGNRHRGSGEVVAS
GVEGDAK
>gid:402737  SCO6641  conserved hypothetical protein
MVTVARLNVAVTRARHRVEVVASFHGADLPDNANKSVQHLKRYLQYAEQG
PTVLAPAAPDAEAAPESPFEEDVLAVLRDWGYDVQPQGGVAGFRIDMAVR
HPGAPGAYALGIECDGAMYHSSRAARDRDRLREEILRGLGWNLHRIWGTD
WYRNRKDAQRRLREAVEEACAADPYAPEPVSTPVPSTEQTPAEIAIVPVA
ESDRSEWSRPYRALGWEKPYELKDTLSTAAGLPGVDLHDPAAKAVVAEVA
HHIITMEGPIEEDVLIGRVRSAWLLDRSGQIVQSSVRDALSRLRKKNKAV
RSGTVWDLPAREVTFARTPTPDFDRKKVSQVPSAERRIALFGILSESPGM
RREELARETARFFGWLRLGPDIKAAFDQDIEELIGGGLVTSGSSGLLPVE
GSAG
>gid:402829  SCO6681  putative serine/threonine protein kinase
MTAATVRGGTPRDSVVSVSNRWEGAGVNKGYAVYCDADPYFYDAPHRTAD
RTGAARSRYAAASSPVPEGWQRHESGDWLALRPADADLPAQGWKIHVSAC
LDNAESVLDRVWRHCVDGGTAFKFVPSRYLLHQRNAKYADRAGSGKFVTV
YPADEAEFERLVGELSELLAGEPGPHILSDLRIGDGPVHVRYGGFTRRDC
YDADGELRPAVSGPDGVLVPDLRGPVFRIPEWVDPPAFLRPHLDARSAVT
VTGMPYTVESALHFSNGGGVYLARDTRTGARVVLKEARPHAGLAADGADA
VTRLHRERRALERLSGLACTPEVLDHRTVGEHHFLVLEHIDGKPLNTFFA
RRHPLIEADPGERRLAEYTDWALDVHARVERAVAEVHARGVVFNDLHLFN
IMVRDDDSVALLDFEAAHHVDEAGRQIVANPGFVAPPDRRGVAVDRYALA
CLRIVLFLPLTSLLAVDRHKAAHLAEVVAEQFPVDRAFLDAAVEEITRVD
GSTRVDGSTRADETTRADETTRLDVTTRVHGAPDAARRPAGPVAPVRPDD
WPRSRDSMAAAIRASATPSRTDRLFPGDIAQFATAGGGLAFAHGAAGVLY
ALAESGAGRDEDGEQWLLERTKRPPSGMPLGFHDGLAGLAWTLERLGHRD
RALDLAELLLDQPLDHLGPDLHGGTAGLGLALESLAATTGQAALHSAALH
CAELAADGLPGGSVPADRVSRGRARAGLLYGGAGRALLFLRLFERTRDSA
LLDLARDALRQDLARCVRGAGGALQVDEGWRTMPYLGAGSVGIGMVLDDY
LAHRADEEFARAANEIVAAAQAMFYAQPGLYRGVAGMVLHLGRTTATAPG
TGPRAVRRQLDALSWHAMSYRDRLAFPGEQMMRLSMDLSTGTAGCLLAVA
SVLGDAPAGLPFLPPPRRSGGPLTRPHQEP
>gid:402906  SCO6707  putative DNA ligase
MDLPVMPPVKPMLAKSVAKIPPGMHYEAKWDGFRAIVYRDGDEVELGSRT
GKPLTRYFPELVVAVRERLPERCVVDGEIVIARDGHLDFDALTERIHPAD
SRVRTLAERTPASLVAFDLLALDDASLLDVGLADRRALLVRALSGVTPPV
HVAPATDDIEVARRWFEQYEGAGLDGVVAKPLDLRYRQDERAMYKIKHER
TADVVVAGYRFHKSGPVVGSLLLGLYDDRGLLQHVGVSAAFTMKRRAELV
AELEPLRMDDARGHPWAAWAEEAAHESARLPGAPSRWSGKKDLSWVPLRP
ERVVEVAYDHMENGARFRHTARFRRWRPDRTPESCTYAQLEEPVRYDLAE
ILGGP
>gid:402912  SCO6709  conserved hypothetical protein
MAEAVELEAGGRSVRLSSPGKVFFPEHGYTKLDVARYYQAVAPGVLRALR
ERPTTLQRYPDGITGEWFYQKRAPKGMPDWIPTAHITFPSGRSADEMCPT
EEAAVLWAAQYGTLTFHPWPVRRDDVDHPDELRIDLDPQPGTDYDDAARA
AHELRAVLEEFGGLRGWPKTSGGRGIHVFVPIEPRWTFTQVRRAAIAVGR
EMERRMPERVTIKWWKEERGERIFIDYNQTARDRTIASAYSVRPRPNAPV
SAPLTWDEVGVAHPRDFDITTMPARFAELGDVHAGMDDTRYSLDALLELA
TKDEHDHGLGDLPYPPEYPKMPGEPSRVQPSRARKKAPGPD
>gid:402932  SCO6719  UvrA-like ABC transporter
MSEFISITGARENNLQDVTLRIPKGRLTVFTGVSGSGKSSVVFDTIAVES
RRQLNETFTWFVRNRLPKYERPHADALEGLTPAIVVDQRPVGGHSRSTVG
TMTDIHSVLRVLFSRHGTPGAGGATAYSFNDPSGMCPGCDGLGRRVQPDW
DRILDPARSLADGAVRFPPFAAGTWQGQTYTNTEELDTGKPVGDFTAAER
AFLMRGRPGSKVTVSGSGGTWSTEYEGLADRFERLYLKRDLSGMSERTRD
LVRGFLVEARCPDCGGARLNAAALASRIDGHSIADCSRMQITDLIAVLRG
IDDPVALPVAGAAVAALERVEAIGLGYLSLDRETATLSGGEGQRLKTVRH
LGSSLTGMTYIFDEPSVGLHPRDVGRLGDLLLRLRDKGNTVLVVEHDPDV
IALADHVVDMGPRAGADGGRVVFEGTPAGLAASDTLTGRCLGRRTAVKDT
VRAPTGELWVKGAERHNLREVTVAFPTGVLTAVTGVAGSGKSTLVAELTG
AHPDAVVVDQSAIGISARSTPATYLGIMDTVRKVFARETGAEPGFFSFNS
AGACGTCEGRGIIHTDLAFMDPVTTTCHDCEGRRFREEVLRLTVDGRSVA
DVLAMTAGQALGFFSDPGVRRRLRALRDVGLTYLTLGQPLSTLSGGERQR
IKLATRLHRTGAVYVLDEPTTGLHMSDVEGLLALLDRLVDAGNTVVVVEH
NLDVVAHADRVIDLGPDGGRDGGRVIFEGTPRELLAARGSSTAEHLRRAT
RR
>gid:402948  SCO6726  putative endonuclease
MTTVSVQIPAGWPATEERARAVQDELRARVVLDEPGPPPGTGRVTGVDVA
YDDERDVVAAAAVVLDAGTLAVVAEATAVGRISFPYVPGLLAFREIPTVL
AALEALPCPPGLVVCDGYGLAHPRRFGLASHLGVLTGLPTIGVAKNPFTF
THDDPDTPRGSTSPLLAGAEEVGRAVRTRDGVKPVFVSVGHRVGLGNACA
HTLALTPAYRLPETTRRADALCRAALRDAAYRA
>gid:403117  SCO6806  putative phage integrase.
MADKTSVPAGVRLSTDIEYRPDRPTPYRARVRWNDPTSKRRQSLSEGKGS
EEEAQEWLQDIIEAAQAGLSPSVATMKLAEYGEANMDLALRGLELKTLDP
YLSGWRMRVVPALGHLSVRMITNGVVDRTVQNWIADEHSRSTVKNTIAVL
VRVMEQAVRDGIIKVNPARVTGWQKLYKQAEDELLDPRALALPDWETLVQ
LADALVAASHDQYRGWGDVVLFAACTAARIGEVSGCRIGDIDTSQWIWTV
RRQTTPAPGGLTDKGTKGKRARKVPIVEEIRPLVAQRILSAGPDPDARLF
TGPRGGRISTAVLRDATHWDDVVTKLGYEHLRRHDLRHTGLTWFADAGVQ
VHVLRRIAGHGSLTTTQRYLHPDVHKITAAGAALSAHLSVLRAPRSLPVP
IVMTR
>gid:403203  SCO6841  conserved hypothetical protein SC3D9.09
MEESRIPGDRGTRASSVGPHKESHTIFTPPKPLAETIPYLIRVGDLRAEE
RKVSGLKLFRTDTTTRGMTEVIARLAGIEAEVQDLVEIHMETLLGVRFLA
SEYSTGPVHGGRIDSLGLDENGSPVIIEYKRGTDAGVINQGLFYLAWLMD
HRAEFEHLVHVRLGATAASQVLWSGPRLICIAGDFTRYDVHAVREHRRSI
DLVRYRIFGSDLLGLETVASVSGAMQVARRVRRQAVAGAAAGAQGAAMRE
LAGAVDEFLLGLGDGVNRVERKTYRAYQRLRNFACLCPPQRSKLLVYLKV
DPKDVDLVPGFTRDVSGLGHHGTGDLEVQLRTPRDVERAQDLFRASYAAA
>gid:403205  SCO6843  conserved hypothetical protein SC3D9.11c
MPHTPEPDEDDLDAIAPPQPVFHAEQALLGALLLDPHRLDDVSGIAADSF
STAAHAALYTALSTLPRPDPAEHAKNTKWLDRVLTAGREQARGLTATYLH
SLVQVCPWPRHAPAYARMVEAEHARRRLLTAAERLIHTVHDASLPLPVQT
VLTETDTLAAVVDDIADRFPPRSGVLPRTPAPPPVLAPDHTEAVEEEQLL
LATATARPDDIDAVRWLLPDDLTLPLHAGLWQCLTTLARRGEPVDPVTVL
WEAQQRGLLDDGCEPGTVLRMLGEPAGSVGHWGEGALQRSLLATAEHTGR
RIEAYAGDPANTPFQLVVGARRALADIAAVRTRWQNATRAAPPQQRRPAP
AIRGGPPTTTAAHAARSTRATR
>gid:403206  SCO6844  putative DNA methylase.
MILDLFAGPGGWSRAVHVLGMRDIGLEWDQWACKTRAAAGQLTIRCDVAR
YPAWPFIGRTRGVIASPPCQAWSMAGKRLGLVDQPLVHAAVEDLAAGRDT
RERLLSACRDERSLLAAEPMRYLYALNTVGEPDWVAMEEVPDVLPLWKQY
AAVLRRWGFSVWYGILNAADFGVPQTRKRAILLASRVRTAQPPTPTHAQL
AEPESLFGPGRARWVSMAEALGWGATDRPVPTVCAGGGPGGGPEPFPSGS
RKTLSDARKRGTWMPRPDRGVLQSCREGTGWAARHRIRESRAADAPAPTF
TAEPHRWSWSLRSNNQANATVRSIQEPAGTLFFGHRANECTWVAEPATPP
AEEAEVPAVPEPIRITAREAGLLQTFPADYPWAGNKGQQFSQIGNAVPPL
LAGHLLAPHLGVALDPDDFTLAA
>gid:403222  SCO6854  conserved hypothetical protein SC7F9.06c
MKQRQWQGWGPGEQAEQQHYLVQPRALAGGGDIRHVSEFLRASGWRDKSK
TCGPLLMESPDRTVRVAYDPYILPGGWTIHGRTGAKGEWSAHLGLQTPVE
IVAGLTDALTRPRSAHAPSVWEPLQEQNWHTRFADQNYTATSPDGAAWMQ
YRQDADGSAMWWTGAKDEEGNGWTAHFTPNTPMHLVQAFSTELAGPDPVM
RPRGRVPHSAQIRTWSVSVKPSQLSAWQQARITAARAATWARSSTRGTRP
RTTARPHTPTGGARTRR
>gid:403237  SCO6861  protein kinase-like protein.
MPLPSPDSSRVVLVGVADYDHLPPLPGIRNNLAALAEFFTSPQGWGLPPE
HCTVVTNPCQSSEFIDPVQRAAAEATDTLLVYYSGHGHLDDELRYSVSVT
GSRQDQPWTCLPYSWLKSVLIQTRAQRRVVILDSCFSGQAHGLMTDAADA
LRVQVATSGAVVISSARHDLPALAPVGETYTAFTGELLDVLTHGIPGGSP
EISVDAAYAHVKAALAARGRPTPDRTGTDTSGALIIARNPGFRPHTPAAT
LRPVPGTHLRALAERLYPLSADARSSRRHTKPAEAGQRTEPDSAGERYEI
LSPLGRGGMGTVSVAYDRLLDRRVALKSIEIYGGDEPFARQVFWSEVRIA
AALNHPNVAAIYDVVEGKTGRFFVMELVPGTDLSTAIKEGPLSPDQAAEV
VHDVLSGLEAAHQRGVVHCDVKPGNIMLTPNGRIKILDFGVSHVIAEGEH
GPLHRGDGTMVGTPLYMAPETLLGQLPTVAADIYGTGVVFYQLCTGRPPF
PVSHFAEIASLKHAGPPTPPSEIHPAVPATYQTIIMRALERDPKDRYSSA
AEMRAAIAAALYGAN
>gid:403275  SCO6885  putative DNA methylase
MPFSLHQGDALAVLSGLPDGCVDSVITDPPYNSGGRTAKERTSRSAKQKY
TSADVKNDLADFTGENMDQRSYGFWLTQIMTEAHRLTKTGGTALLFTDWR
QLPTTTDAIQAAGWLWRGVLAWHKPQARPQKGRFTQNCEFIVWASKGPID
GSRNPVYLPGMYSASQPSGSQRQHITQKPVEVMRELVKISPEGGTVLDFC
AGSGSTGVAALLEGRDFIGVEKTEHYASIAADRLTETIRATLTQDDVVLA
V
>gid:403311  SCO6907  putative DNA ligase.
MLCRLAETVRLQARSGRDVTTPWMDLAVAGMALPPGVVLDGEAVVVVDGR
VSFEAAQARAASSPARSRELAARHPALLIVWDVLALPTGDVRGRPYEERR
VLMLDVVAGLPEGSPIQAVSATEDVEIAQVWYETLAGSGVDGVVAKSGRS
PYRAGRSSSWKKVRHAETLDAEVVGFTGTASRPRALAVRLPDGRVALSQR
LGAQLAAQVAAALVNNAPAARGRAAGGESFKATEPGLLVEVLAGTTRHAV
VTVTRVR
>gid:403316  SCO6910  insertion element transposase.
MSERKPYPSDLSDARWALIEPTLTAWRNARLERRPTGKPAQVDLRDVFNA
ILYLNRTGIPWKYLPHDFPGHGTVYFYYAAWRDEGIFTQLNYDLTALARV
KEGRKPEPTASVIDTQSVKTSTNVPLTSQGTDAAKKIVGRKRGILTDTIG
LILAVTVTGAGLSENAVGIRLLDQAKRTYPTIVKSWVDTGFKNAVIEHGA
TLGIDVEVVNRNPEKRGFHVVKRRWVVERSIGWIMMHRRLARDYETLTTS
SEAMIHIASIDNLAKRITDETTPTWRGTY
>gid:403665  SCO7074  putative membrane protein
MGGRPPAFDREAYKQRNTVERCINRLKQWRGIATRYEKTATVHLAGLHIA
GIFFWSAR
>gid:403673  SCO7078  putative insertion element transposase (fragment).
SWIVPDGLWETAKPLIPPSKVRPRGGGTQDTPDETLFAAIIYVLVSGCAW
HALPPCFGTSKSTAHRRFLIWSRAGVRGRLHEAVLHRLDDAGLIDVTRAV
LDTAHVRARRGANTQVRAPWTTSPPPAPGEGLWWWPSRGRAGTSS
>gid:404027  SCO7240  putative serine/threonine-protein kinase
MGDGTLIQGRYRLLERIGRGGMGEVWRARDESLGRRIAVKCLKPLGTQHD
HSFTRVLRERFRREARVAAALQHRGVTVVHDFGEWDGVLFLVMELLEGND
LSRLLEDNKGHPLPVADVVDIAEQVASALAYTHEQGIVHRDLKPANIVRT
ADGTVKICDFGIARLGHDAGFTARLTGTGIAMGTPHYMSPEQIGGDEVDR
RSDLYSLGCVLYEMATGVPPFDLGDAWAILVGHRDTEPEPPRTHRAELPR
YLDRIILDLLAKRPEQRPDDAGELGRRITAGRTAPERVPALATARPRTPR
QVGGRAVLPSWTHGMTTGHKATGPGPRATPPDPGAGLSGEWIPRPTGDGT
DGGRTVGPVPDEWPEPASEAAAGLAERHNAALSLGRLGRWEEAGRAHRAV
AAERERLLGPDHPDTLASRYEAAFTLGRTGRAADALRAYKGVAQARIRTL
GADHADTLAVRQEMAYTLGRLGRHTDAHQVYTSVLTARERTMGADHPDTL
RCRHNLAFNLGRLGRTEDAHRMAREVAAARARVLGPGHPDTLVTRYEVAY
ELGRLDRWQEALATYREVAAARARALGPDHPDTFAARYETGVALGRLGRC
AEALTAYEALIADRARAQGADHPETLRARHGRGVNLGRLGRWGEALEEAR
AVCAARGRVLGPDHPDTLVSHREVAVALGWLGRWAAALTEYRLVAAARER
VLGPDRPDTLVARDDEAHCLEQLGRAAEAAELYRRVAALRSVAAR
>gid:404112  SCO7284  putative ribonuclease H
MIERMRERAVAACDGASKGNPGPAGWAWVVADASENPVRWEAGPLGKATN
NIAELTALERLLASTDPDVPLEVRMDSQYAMKAVTTWLPGWKRNGWKTAA
GKPVANRELVVRIDELLDGRSVEFRYVPAHQVDGDRLNDFADRAASQAAV
VQEAAGSALGSPEPPPAPDVPAARRAPRRGSSGAARKGGGGSSARTIKAK
FPGRCLCGRPYAAGEPIAKNDQGWGHPECRTVAAG
>gid:404124  SCO7291  putative serine/threonine protein kinase
MGRAHVSTHELVAGRYRLFEVVQRETNRVCWSGEDATTGRPCLVTRIELP
EGRAGEAARRAPGRVIRTGETMASLCPGRIAPVLDAVVADGMLWTVTEWV
AGVPLGDLLDRRGAFGCARAARVGLELLAVLEAAHTHGVTHGELSPGQVF
VREEGSVLVTGFGLAGATLAPRLTAPAYASPEQARDERIGPAADLWTLGA
ILYTMVEGRPPFRDRGRPEATLKGVDRLPLRTPVRAGPLAQAVTGLLRKN
SRERPTRPVVRAALARALAEDPGTAVTEVTTGPGVRGGYAAARTAGRDRS
RRTVAAGTALAVVTVAVAVLADLAVTDGLPGRGDGAAAGARQRPSASAAP
PTGSAPAAASPGSPSSPAGRPSASASPSKPGVPAPAGFRRYDAPEGFSVA
LPEGWRRLDTASAPGGAYRVVFGASGDPRTLAVTYSRRAGADPVVVWRDD
VEPGLARSDGYRRIGEIRSTTYRGRAAADMEWLVRDDGTRLRTFGRGFLL
GGGRSFSLRWTTPAGDWDDSANERALAAFLGTFRDGAA
>gid:404150  SCO7302  conserved hypothetical protein SC5F8.12c
MTTHHLQGSLFDQTDEVRLGPLDGLRRTRLGPGAWIDLLPGWLSGADALF
ERLAAEVPWRAERREMYERVVDVPRLLAFYGAGDALPHPLLAEARDALSA
HYAEELGEPFTTAGLCHYRDGRDSVAWHGDRIGRGARQDTMVAILSVGAP
RDLLLRPAGGGGETVRRPLGHGDLIVMGGSCQRTWEHCIPKSTRAAGPRI
SVQFRPHGVR
>gid:404236  SCO7345  probable ATP-dependent DNA ligase.
MELPPIPPMLATPGTLPPAGQDTRWAYETKQDGQRVVVYLPGDGSVLLRA
RSGQDITAAYPELAPLATALGGTPAVLDGEVLALDEQGRADFQLLQSRMG
LAHTPARAAHRAAKVPVHLVLFDALHLAGRSLLRLPYTGRRERLTDLGLN
GPSWSTPAALVGHGAQALRATREHGLEGLVCKRLDSVYEPGVRSRAWIKI
RNMRSEDVVVGGWLPGKGRLGGLPGAVLVGQRAAGRLRYVGGVGTGWSAG
ERTELAALLAAAASDVCPFDPVPRVPGARWVVPRLVGEVRYSTRTREGML
RQPSWLRLRPDLAPEESAADLPDDLA
>gid:404281  SCO7355  conserved hypothetical protein SC9H11.09c
MDGEDDGLRDYHGKRDFGRTGEPAGPGAPGEGGAPGGDGPPRFVVQIHDA
STLHFDFRLQVADVLKSWSVPKGPSADPRDKRLAVPTEDHPLEYEEFEGV
IPAGEYGGGTVIVWDHGTYEPLSHDRRGRPVDFAESLAHGHARFRLHGAK
LRGEYALTRFRGGPDGDAGADAAWLLVKTAGGGRGHGTPDPRRARSARTG
RTLARVAAEHGEE
>gid:404375  SCO7402  putative lyase.
MHDPSTAPDDGFGTLFGLDALTPGPPPVAGPRFRDSAAARRLLPVREIHA
EPAAAASPRGRQILARFPDARVVEVDSHWGIPGLHGNEGNVERWVRVKGE
TLVLGERKTLATRPNGRSADWIAPGASNGCAMACAYCYVPRRKGYANPVT
VFTNIERIVAHLGRHIARQGPKREPNQCDTEAWVYDIGENGDCSVDALIC
DNTADLVHAFRQWPTAKASFATKFVNPDLLALDPRGRTRVRFSLMPPDDS
RLLDVRTSPVAERIAAAGDFLEAGYEVHFNLSPVVLRPGWEEAWAQLLRQ
LDDVLPDRVKRQAAAEVIMLTHNLGLHEVNLGWHPRAEEVLWRPEVQQAK
RSENGALNVRYRADVKAGAVARLRALVAAHAPWLRIRYAF
>gid:404641  SCO7522  putative DNA ligase
MSAAPQAERPPRGGADPLSVVGGMIRAMTTPAAVIVDATAYAQAVEDAAH
AAAAYYAGGSSPLDDDAYDRLARGIAAWEAEHPDEVLPDSPTGKVAGGAA
EGDVPHTVPMLSLDNVFSPEEFTVWTASLARRIDREVTRFSVGPKLDGLA
VAARYREGRLTRLITRGDGTAGEDVSHAIGTVEGLPGTLAEPVTVEVRGE
ILMTTAQFEHANEVRIRHGGQPFANPRNAAAGTLRAKERAYTVPMTFFGY
GLLALPGTDAALAGRLEELPYSELMETAAGLGVHTSAGTAVPDVVVGTPE
QVVARVQEIAALRAELPFGIDGIVVKADLAADRRAAGSGSRAPRWAIAYK
LPAVEKITRLLEVEWNVGRTGIIAPRGVLEPVVIEGSTITYATLHNPADI
TRRGLRLGDHVMVHRAGDVIPRIEAPVAHLRTGEEQPIVFPEACPRCGSD
IDTSEERWRCAQGRNCHLVASLAYAAGRDQLDIEGLGTTRVVQLVEAGLV
ADLADLFLLRREQLLALERMGETSTDNLLAALARAKEQPLSRVLCALGVR
GTGRSMSRRIARYFATMDHIRAADAEAMQRVDGIGVEKAPSVVAEIAELA
PLIDRLAAAGVNMTEPGATPPRPADTDGADGATAEAPGDGGPLAGMKVVV
TGAMTGNLERLSRNEMNELIERAGGRSSSSVSKNTSLVVAGEGAGSKRAK
AEQLGVRLATPEEFAVLVAGLLS
>gid:404864  SCO7604  hypothetical protein
MGTAVAMCVHTEEAAVLKWGGSRLLVCAVALACVALVGPSAPGGALSGRS
LPVEAVKDVVPDRVMTWNLCNPCDASNLDRAAEIAAHAPQVIGMQEACAR
DVDRIRDYLEAFHGLVYHVAHGSVLQNWSRCGGAPWNPGGFGQAILSAAP
ITDAVNVEYPDGGSEDRGYLAVTTEVGGRSVRVFNTHLAQRRQEEFRTDQ
VRVLAKEVARHERAIVVGDFNAVPEASELDPMWSLATDTDPQCHPAPGGT
CEPTTDWQSKFDYVFLRGVAPREHRVVRTAHSDHDLLYADLDVT
>gid:405000  SCO7663  conserved hypothetical protein
MTVVDEVRDAVCSIPAGAVASYGDVGQRVGVGARQVGRAMSLLDENVPWW
RVVYTDGTPPSCHGGRAPVLLRDEGTPMRGARVDMTRARHHWTP
>gid:405198  SCO7743  hypothetical protein SC8D11.34c
MVFRHPDGDYAITAIYSVPDDAWYLELDLVAGQRNLVTAIIPDEDPAREP
TVCFNPSAGHTDVPYEVMRWFMHQVDEEIRSSRAWMRLRPELVETVYQLR
QEHMGVIDDDDFPQVLADVRTTVPEEDLPAVLEAAFGRNPDGTTMNHPQT
PQPVGDQGDTL
>gid:405201  SCO7746  putative insertion element transposase
MSVRLVITDAMWDRIKPLMPVEPVRGRRWANRRRTLETIAWKFRTGSPWR
ALPDELGSFQTAHKRLLRWAAYGIWERILAALPAAADGVDDIGWTVSVDS
TVCRAHQHAAGARIKGHHGTISG
>gid:405265  SCO7782  probable IS110 transposase
MFDTEDVGVFLGLDVGKTAHHGHGLTPAGKKVLDKQLPNSEPRLRAVFDK
LAAKFGTVLVIVDQPASIGALPLTVARDAGCKVAYLPGLAMRRIADLYPG
EAKTDAKDAAVIADAARTMAHTLRSLELTDEITAELSVLVGFDQDLAAEA
TRTSNRIRGLLTQFHPSLERVLGPRLDHQAVTWLLERYGSPAALRKAGRR
RLVELVRPKAPRMAQRLIDDIFDALDEQTVVVPGTGTLDIVVPSLASSLT
AVHEQRRALEAQINALLEAHPLSPVLTSMPGVGVRTAAVLLVTVGDGTSF
PTAAHLASYAGLAPTTKSSGTSIHGEHAPRGGNRQLKRAMFLSAFACMNA
DPASRTYYDRQRARGKTHTQALLRLARQRISVLFAMLRDGTFYESRMPAG
VELAA
>gid:405272  SCO7786  hypothetical protein
MDGEPEVVFCMHEFGPLNLVPNPSRQWAERGGKLEDPAREPRRRRRATYN
RYGGVRHLFAALDLARDKLYGYIKPIKKRTQFLEFCRYLRTLYPAQVRIA
VVCDNFSPHLTTRKCRRVGDRATASNVEIAYTPTNSARLNRIEAQFAAPR
YFTLDGTDHTTHKEQGSMIRRYVIWRNRHADDQHLQEVVDRANVA
>gid:405294  SCO7798  putative transposase
MRPPGGGSAGGEALGPSRGGFTTKIHLSADGRCRVLSLLLTAGQRADCTQ
FKPLMKRIRVPRLGPGRPRTTPDSVSADKAYSNRRTRRYLRRHGIRHVIL
EKSIQAGIRLRRGRVGGRPPGFDKERYKERNTVERAINRLKNYRAVPTRY
DKRAYVYLGTNTVAVTIIWLRT
>gid:405296  SCO7799  putative transposase
MQALTARAGHATYTRPHGIRHLFAAYDRHVINGILHRVRTGMQWRYLPER
YGPRKTLYERHRRWPAVRTWERLLQKVQAQVDAAGGLDWDVSVDTTSVRG
HQHAAGVRKAPPPAASKGAAAMAHRSGRSGARLCARLVEVVQEVRRSVPP
EEASPPRST
>gid:405303  SCO7803  putative insertion element transposase
MVHRNAPLTETGRLRLARCVVEDGWPVRRAAERFQVSHTTASRWARRYRQ
LGVTGMSDRSSRPHHQPRRTAAAVEEHVLRLRREHRIGPLRLAVRCGIAA
STAHRILVRHGLPPLAALDRATGEPVRRYERARPGELVHIDVKKLGRIPD
GGGHKTLGRAEGHRSRTNGAGWAYLHTALDDHSRIAYTEDLPDETAPTCA
AFLVRATAYFASLGIRIERVLTDNAWAYSKNTWRNTCRDLDISPRWTRPW
RPQTNGKVERFHRTLLDEWAYQKPYTSDHERREAFTHWLHWYNYHRPHTG
IGGHTPASRGTNLSEQHT
>gid:405352  SCO7827  putative transposase.
MMGSLLLCSVELVSLLPQLSDVHVVSVEVSDAVVAVYARTRSGEPAGCTG
CGRLSEWCHSRYARRLADVTLAGRPLRIDLSVRRLYCENTTCPKVTFTEQ
VPGLTVRYQRRTSRLQSLVEDVGVVLAGRGGSRMLRILGIKLSRVALLSQ
LMRVPLPPLVTPQVLGVDDFALDGGTYGTLLVDATTRLPLTLWEGRDAKQ
LGRWLREHPGIDIVCRDGSLTYRQGITAGAPEAVQVSDRFHLWQGLSRRV
QDIASAHRGCLPAALPTVSEDDHPPVEETTQNAAADSRAGRHAQSLFEAI
QALTCTGRSHSSVARELGLDRRTVRKYARARTWQEVMRRPPRKPSMLDPY
LDYLRQRWDDGQHSAKILHEELQTKGYLGHYQRVKMAVAPLRRGLPIDEP
CERPPSPRETARWITTHPHRRSPHVNERLPRLLDHCPELKLTHDLVRRFA
TMLDNRDAAPLPGWLGDLKKSGLGPLVGFAGALHEDRHAVAQGITTPYNS
GVNEGRITDVKLQKRLMAGRAGVRLLRHRVVLIAHLRRRHADRPTVPPR
>gid:405377  SCO7842  putative transposase.
MVEVGAHVAYRGQTWQVAALQGQQVYLLQEDGTETRLLLGRLFADPGFEV
VGARAPDTVPQWGLFETVPLAAQQRALAWLPHIREVETGWPHPEGSREGQ
AMRPEYDPERWTLAQRDAAKAKELAALGFARVTRTTVERMRHGYRKQGLW
GLVDKRAVPARGRHPTGYADERVVAAVLEALRRQRGRSKGTVKGLQVLVG
QILEDTHGRGVVEMPSRSTFYRLVSVLADPADRPGRPARTATAPARASSA
PVVLRPGEQVQIDTTRLDIMAILEDGSLGRPELTIAVDVATRSILAAVLR
PYSTKAVDAALLLAEMAVPHPARPTWPSALHLSRAEVPYERMLSLDERLE
GAAARPVVVPETIVVDRGKIYLSQGFVAACEMLGVSVQPAPPRRPQAKAV
VERTFGAINDLFCQHVAGHTGSNPQRRGLTTAAETRWTIPQLQDFLDEWI
TCGWQNRPHDGLRHPVLPKTALTPNQMWAALITVSGYVPVPLSGADYLEL
LPVRWQPITERGIRLDYRTYNHDILDPHRGQRSPVASKDGKWEVHHNPHD
ARQIFVRLTDGQLHEIPWIHRDHVHQPFNEAIWRHVQAEVEQRGDRDQHE
ADLADALDQLLRRTRHLAETEQKTRRRRATRSGTAAQLPDLPGQRRPFDA
ETAPAPAPDWSESLDDLISVDTAAQTGTSEMEGASVPPAEAGGYGLWDAE
AEAEQW
>gid:871156  SCP1.124c  hypothetical protein
MTPRHSRPEGAEKRLEEAVRAAKRQRDKAVRQAETTFWTEIAELKQSYRG
AQTDIASVLGVTRDAILKSVNKYAGGQEE
>gid:871239  SCP1.163  probable transposase
MVAEPVRVRRLTDQEGPKLQQIVRRGSTSTVRFRRAMMLLASAGGNRVPV
IAKLVQADEDTVRDVIHRFNEIGLACLDPQWAGDRPRLLSDDDEDFVIQT
ATTRPTKLGQPFTRWSIRKLAAYLRRVDGRVFRIGREALRCLLARRGITF
QRTKTWKESPDPDRDAKLDRIEHVLERFPDRVFAFDEFGPLGIRPAAGSC
WAEQGRPERHPATYHRTHGVRYFHGCYAIGDDTLWGVNRRKKGADNTLAA
LKSIRAARPDGAPIYVILDNLSAHKGARTRAWATKNRVELCFTPTYASWA
NPIAAHFGPLRQFTVANSHHRSHPTQTRALHAYLRWRNKNARHPEVLAAQ
RKERARVRSEKGTRWGGRPITEAA
>gid:870929  SCP1.18c  putative DNA integrase/recombinase
MTAEPATAAVPPRPFGDLSSSSSRSIAKLVVDTWDGGTSTRQQRGQAAGL
LLDYLAGYPGLTWQERWDASPVGQGLADVNDLECRRSPSVKLTTGLRALI
CLRVIQPSLVAFRRHHMTGFAAHFISAQSDPLLDEFAKQVEVHQHTYAHR
TETLFDLCVLLAVQGVALSDVTAPALLHFAQENRRAWSVVDPGNKAGNRL
RGQGVWHVLHAVGHLPPTVPATMRAALMRGQKSVEELVGLYPIRNQAVRG
LLIDYFTRRRADTDYATLKNLVLHLAHHFWEKIERLNPDQADLRISPEIY
AAWRQMIAVKDNGAPRAGADSLVISVRSFYYDLHTWAAQEPERWATWVAP
CPVPPSDLHGLGARRRRINERSADRTRQRQPLLPVLVEHVESRYDHFRQL
LQLAEEAVEGESFTFAGTSYTRVLSEADRKLSRHTAPIPPRLRNNDSGEM
VHIATEEETTFWDWAAVETLRHSGVRVEELCELTHLSIRQYQRPGGEVIA
LLVIAPSKTDRERVIPMSAELFHVIASIIRRHARTGRPIPLVTRYDPHDK
VWSPPMPFLFQRQNGTVPAVFNPGTIQKMIERRCLALAEMHPGFKGLKFT
PHDFRRIFITELVNSGLPIHIGAALLGHLNVQTTRGYVAVFDEDVIRHYQ
EHLEHRRQIRPTDEYRDTSSDEWTEFEEHFDRRKVELGSCGRPYGTPCQH
EHACIRCPMLHVNPKMLARLSELEADLLQRRTQAEGEGWIGEIEGIDLTL
TFLRAKRDDTQRRAQRPAVHLGIPARRRPQESE
>gid:871287  SCP1.190c  putative integrase/recombinase
MPTVVQLPAGKALTVRAAADAFLDSLRNPNTVRSYSTGIGKTAERIGEAR
PLGSVADDEIGEALELLWGEAAVNTWNARRAAVLSWLGWCEEYGYDGPSV
PAWTKRLAVPDSETPARSKMAVDRLIARREVHLREKTLWRMLYETAGRAE
EILSVNIEDLDLAARRCPVKAKGARSKARRRGQAREDFVLETVYWDAGTA
RLLPRLLKGVPAARCSSPTAARPGEGRQPARRVPGHWPGPPLLRSGPCAA
GRAHCRARAGNRLGPARVPPFRSDPPR
>gid:870932  SCP1.19c  putative DNA integrase/recombinase
MSVIAVKESEVDLRHELMEGRAQLPRAGAVVETGEAHPPFMVVNGLGREV
EPVTAYLRDLALSDSSPLTARSYGFGMLRWFRLLWLLDVAWERATEAETA
VLTGWLRTAPNPQRHRRNPQSPPPGTVNPRTGKPYLGPGYAPTTINHALS
VVSGFYAFHAHYGRGPVANPVPVSPQRRRALSHLSPLEPKPVLGRARLRQ
KVTGRPPRSIPDPLWDELFAAMGCERDRALLDCYVSSGARAEELLGVETG
DIHWAKPGLYVITKGTRERELVPISPEACVRLARYLDSTGTPEAGEPVWR
TRHGEDRPLTYWAMRRVVQRAQQNLRTNWTLHDLRHTAAHRMANGGKLTL
PEVQAVLRHANIQTTSRYLAVRVEEIFDALLEHYNQPRPQVSYPTGYATD
DIAAVFGA
>gid:871310  SCP1.205c  hypothetical protein
MPLNWDLVGTGDSEALLRPRDIYAGLASRPWPYLRHEQGEVLDKWFDRRE
QRDVVIKQNTGGGKTAVGLLIAQSTLNEGVGKAVYLTPDNYLAKQVREEA
ARLSIATTDDPYDPAFTTAQAVLVTNYQKLLNGRSAFGVVGDGKEPVDLG
IVVVDDAHAALARTEDQFRLRVPASHEAYGELLELFAEDLRHQSANAWAG
LEDKDPTALAPIPFWAWANKHDEVLKILRPHRTERAFEFVWPLLAEVLHL
CTATATATAIEVRPSCPPIDRITSFVRARRRVYLTATLADDSALVTDFGA
DPELVAQPVTPGSAADLGERMILAPVALNPDLDDEAVRVLARQFADGDQD
GDGIAETDPVNVVVLVPSERAAAAWTRHAHHIWRVDDLEAGVKKLQSGRP
VGLVVLVNKYDGVNLPADACRLLVLDQIPRPLDSVERREAVALADSTVRL
AREVQRIEQGMGRGVRDGEDYCAVLLLGAKLATAIHDARHLALFSPATQA
QLKLSRDIAGQIKGEGLDAVRQALRSCLTRVPQWRERSRRALAEVRYAEH
GTVRPEAVALREAFDFAATGRPQAAAERVQKAVNDIGENDKALRGWLREQ
KAAYLHLTDPVAAQQALAGALADNTFVLRSVHGAAPVRLKAAAVQSRAAA
EFLADRYRDGVRLRLGVQALFEEVVWGDDERSDDAEAAWQSLGEHLGFAS
SRPEKLYGKGPDNLWALTAARQAVIELKTGCTTDTISKKDMDQLGGSVRW
LNDHNPEVEAVPVMLHSSRAADAKATAVPGMRVITPELFEKLKDAVTAYA
AALASSPDRWANEQAVREQLAHHKLTGDRFLTAYAEPVSPS
>gid:871326  SCP1.214  putative transposase
MGRGELTDAAWERIAPLLPGVGGRGRPWRDHRQVINGVLWRLRTGAPWRD
LPERFGPWQTVYERFARWETDGTWARLLEEVQVRDDAVGAVEWTVSVDST
VSRAHQHAAGARKKGRRRGTNWKIRHARRLVRRLAAPAAG
>gid:871328  SCP1.215  putative transposase
MTTKVHLACDGRGLPLAVAVTPGNVNDSTVFDAVLNAVRVPRAGVGRPRR
RPDVVIADKAYSSRAIRQRLRRRGIRAVIPERTDQKANRTRRGRTGGRPP
AFDRDLYKARNVVERCFNRLKQFRAVATRFDKLAARYRAGLHLAALILWL
RKPVSA
>gid:871424  SCP1.259  putative transposon transposase
MLSVVNDDGTTPNGSLIDEIVREGARRMLAAALEAEVNAYVAELADQRDE
AGRRLVVRNGYHQPRKVTTAAGAVEVKAPRVNDKRVDAETGERKRFSSAI
LPPWCRKSPKFSEVLPLLYLHGLSSGDFVPALEQFLGSSAGLSPATVTRL
TQQWQADHAAFMDRYLPEVDYVYVWADGIHLNVRLEEAKACVLVLVGVRA
DGSKELVALKDGYRESAEAWADLMRDCARRGMRAPVLAVGDGALGFWKAL
AEVFPATREQRCWVHKTANVLDAMPKSAQPGAKKAIQDIYNAEDKQHAQA
AVKTFSQLYGAKFPKAVKKITDDQAELLTFFDFPAEHWIHLRTTNPIEST
FATVRLRTKVTRGAGSRTAALAMVFKLVESAQQRWRAVNAPHLVALVRAG
ARFEGGQLIERSARAA
>gid:871452  SCP1.271  hypothetical protein
MGKFRYVVEQTSALLHHFKRLAVRWKRGLELHNAFLSLASSLICWRRLRK
TGS
>gid:871462  SCP1.276  Insertion element IS466S transposase
MSVESLNELVATAFSGISPLVIEDVVDEGERVVVRARTPGSTAVCPACGA
LSERVHGYHWRTVADLPIDGRRVVVRVRVRRLVCPTRGCRHTFREQLPGV
LERYQRRTARLTRQIKAVVKELAGRAGSRLLAKLAIGLSRHTALRTLLRI
ALPTGRVPRVIGVDDFALRRRHRYATVVIDAETHERIDVLPDRTADTLEA
WLRENPGVEVVCRDGSATYAEAIRRALPDAVQVTDRWHLWHNLCETALSE
VKAHSTCWAAVLDTPIYEGPRAQTTLERWHQVHGLLQQRVSLLECARRLQ
LSLNTVKRYARADRPERMLRVPKYRASLVDPYRDHLRRRRAEDPAVPVQH
LFEEIKALGFTGCLNLLHKYINQGRADADRSHISPRRLARMLLTRPENLK
SGHRDLLDQLTAACPEMTHLATSVRTFAQLLKPRPENVDALDHWITQVRA
ADLPHLHAFTRGLERDNDAVIAALTLPYSNGPTEGVNTKTKRIARQMHGR
AGFNLLRHRILLG
>gid:871491  SCP1.291c  hypothetical protein
MPTLSINFDPVVIAHLLKASGDDTVHVTLSLSTGAQTSSSSPVTPHGPLA
ELMQADLIKAGTVLTFHQRRAKRSGRAVVTADGQLIVDGHASPFPSPSKA
AEAVTGNVINGWTLWHVEGVGRTLDDLRRELDSRTSR
>gid:871503  SCP1.296  SCP1.57c
MTGPAAPLPAGERMMQRLVVSAGAASEPAADMADLAEVVRSLMAVQTYRD
TRVRLGGGLFDVVITGSGAQARFVVALLDEIQDGCGPVIEDPCNGWLYWL
VPPGSAGQWAPHSHAVCLGGPHTITLPSLNRAVPPGPFWHRPPASDRLVP
LGPLREALAQLSPEPTPHAALADRLGITL
>gid:871547  SCP1.316  putative transposase
MRHRLYPSDMTDAEWALVEPLLPPPACDTARGGRPEKHPRREIVDAIRYV
VDTGCKWRALPADFPPWRTVWGFMARWAAVGVIGQLRDALAQRIRRDMGR
GPRAVATIIDSQSVKAASTVGKDSRGYDAGKRINGRKRHMVVDTKGLPLM
VMVTPADLHDSAVAKEVLFRLRLTHPEITLVWADSAYAGKLVTWAKKHLN
LTIKTVSRPKDTSGWVLLPRRWVVERSLAWMMNARRHARDYERLIQHSET
LITWAAITVMTKRLTRTGPTGWSKKPKATADSSPQVSPAATAGRPAGSAT
EEPARAAPLSARQPQPAGRF
>gid:871560  SCP1.321  hypothetical protein
MEVVVVGDAQAMAGRVPTVMHVRCPDRLREGVYRQVLELLTDLSPVVQAL
PPTAALVELKGALRYHGSEVHRLGEVLRVRTLSRLGVDVRVGIGPSITIA
ATASAQVPLPGGVLTVAPDDVTDWLAPLPGEALHGLGPRQAGALHEYGIH
TVGLLAAVSPETVQRLLGGKAGRLAADRARGRDPRPVVPRALPAAATVRM
RFDRHVLDGADVRAALLDLVVQLGRLLRRRGQAARAVSLTLQFAGGGSWE
KSRRLPEPSAHDEDLRTMAYRLIDAAGLQRGRLTGLFLKADDLVDADRVA
QQISLDQAREARLVAEATVDRVRDKFGPGVIGPAATARRAS
>gid:871583  SCP1.335  putative DNA integrase/recombinase
MSVIAVKESEVDLRHELMEGRAQLPRAGAVVETGEAHPPFMVVNGLGREV
EPVTAYLRDLALSDSSPLTARSYGFGMLRWFRLLWLLDVAWERATEAETA
VLTGWLRTAPNPQRHRRNPQSPPPGTVNPRTGKPYLGPGYAPTTINHALS
VVSGFYAFHAHYGRGPVANPVPVSPQRRRALSHLSPLEPKPVLGRARLRQ
KVTGRPPRSIPDPLWDELFAAMGCERDRALLDCYVSSGARAEELLGVETG
DIHWAKPGLYVITKGTRERELVPISPEACVRLARYLDSTGTPEAGEPVWR
TRHGEDRPLTYWAMRRVVQRAQQNLRTNWTLHDLRHTAAHRMANGGKLTL
PEVQAVLRHANIQTTSRYLAVRVEEIFDALLEHYNQPRPQVSYPTGYATD
DIAAVFGA
>gid:871586  SCP1.336  putative DNA integrase/recombinase
MTAEPATAAVPPRPFGDLSSSSSRSIAKLVVDTWDGGTSTRQQRGQAAGL
LLDYLAGYPGLTWQERWDASPVGQGLADVNDLECRRSPSVKLTTGLRALI
CLRVIQPSLVAFRRHHMTGFAAHFISAQSDPLLDEFAKQVEVHQHTYAHR
TETLFDLCVLLAVQGVALSDVTAPALLHFAQENRRAWSVVDPGNKAGNRL
RGQGVWHVLHAVGHLPPTVPATMRAALMRGQKSVEELVGLYPIRNQAVRG
LLIDYFTRRRADTDYATLKNLVLHLAHHFWEKIERLNPDQADLRISPEIY
AAWRQMIAVKDNGAPRAGADSLVISVRSFYYDLHTWAAQEPERWATWVAP
CPVPPSDLHGLGARRRRINERSADRTRQRQPLLPVLVEHVESRYDHFRQL
LQLAEEAVEGESFTFAGTSYTRVLSEADRKLSRHTAPIPPRLRNNDSGEM
VHIATEEETTFWDWAAVETLRHSGVRVEELCELTHLSIRQYQRPGGEVIA
LLVIAPSKTDRERVIPMSAELFHVIASIIRRHARTGRPIPLVTRYDPHDK
VWSPPMPFLFQRQNGTVPAVFNPGTIQKMIERRCLALAEMHPGFKGLKFT
PHDFRRIFITELVNSGLPIHIGAALLGHLNVQTTRGYVAVFDEDVIRHYQ
EHLEHRRQIRPTDEYRDTSSDEWTEFEEHFDRRKVELGSCGRPYGTPCQH
EHACIRCPMLHVNPKMLARLSELEADLLQRRTQAEGEGWIGEIEGIDLTL
TFLRAKRDDTQRRAQRPAVHLGIPARRRPQESE
>gid:870956  SCP1.33c  hypothetical protein
MEVVVVGDAQAMAGRVPTVMHVRCPDRLREGVYRQVLELLTDLSPVVQAL
PPTAALVELKGALRYHGSEVHRLGEVLRVRTLSRLGVDVRVGIGPSITIA
ATASAQVPLPGGVLTVAPDDVTDWLAPLPGEALHGLGPRQAGALHEYGIH
TVGLLAAVSPETVQRLLGGKAGRLAADRARGRDPRPVVPRALPAAATVRM
RFDRHVLDGADVRAALLDLVVQLGRLLRRRGQAARAVSLTLQFAGGGSWE
KSRRLPEPSAHDEDLRTMAYRLIDAAGLQRGRLTGLFLKADDLVDADRVA
QQISLDQAREARLVAEATVDRVRDKFGPGVIGPAATARRAS
>gid:870968  SCP1.38c  putative transposase
MRHRLYPSDMTDAEWALVEPLLPPPACDTARGGRPEKHPRREIVDAIRYV
VDTGCKWRALPADFPPWRTVWGFMARWAAVGVIGQLRDALAQRIRRDMGR
GPRAVATIIDSQSVKAASTVGKDSRGYDAGKRINGRKRHMVVDTKGLPLM
VMVTPADLHDSAVAKEVLFRLRLTHPEITLVWADSAYAGKLVTWAKKHLN
LTIKTVSRPKDTSGWVLLPRRWVVERSLAWMMNARRHARDYERLIQHSET
LITWAAITVMTKRLTRTGPTGWSKKPKATADSSPQVSPAATAGRPAGSAT
EEPARAAPLSARQPQPAGRF
>gid:871014  SCP1.57c  hypothetical protein
MTGPAAPLPAGERMMQRLVVSAGAASEPAADMADLAEVVRSLMAVQTYRD
TRVRLGGGLFDVVITGSGAQARFVVALLDEIQDGCGPVIEDPCNGWLYWL
VPPGSAGQWAPHSHAVCLGGPHTITLPSLNRAVPPGPFWHRPPASDRLVP
LGPLREALAQLSPEPTPHAALADRLGITL
>gid:871027  SCP1.62  hypothetical protein
MPTLSINFDPVVIAHLLKASGDDTVHVTLSLSTGAQTSSSSPVTPHGPLA
ELMQADLIKAGTVLTFHQRRAKRSGRAVVTADGQLIVDGHASPFPSPSKA
AEAVTGNVINGWTLWHVEGVGRTLDDLRRELDSRTSR
>gid:871066  SCP1.82  putative transposon Tn5714 transposase
MPLTSRQRRYPSDTTVAEWALIEPLLPVPACRTKTGGRPEKWHRREIVDA
IRYVVDNGIKWRALPADYPPWQTVYYHFARWHRAGVVAFLRDQLRRQIRT
GQGRCPWPVTLIVDSQSVKGAETVSKATRGFDAGKKINGRKRHIAVDTLG
LPVMITVTPADTTDRDAARELLWRLRVMQPQITQIWADSAYAGQLTTWSD
DFLNMTLKTVSRPRGAKGFVVLPRRWKVERTLGWIMKARRNVRDYERLPQ
HSEAHLTWALITLMTRRLTRKGSRPNWSRKR
>gid:871349  dnaE  putative DNA polymerase III, alpha chain (EC 2.7.7.7)
MRLHETALGGARLVADSFVHLHNHTEYSMLDGAQKLKPMFAEVDRQGMPA
VAMSDHGNMFGAYEFQQVAKGFGSVKPIIGIEAYVAPSSRRNRRQEFWGP
GGQRAVSDDGEGSKDVSGGGRFTHMTMWATGAQGLRNLFYMSTEASYTGQ
FPAGKPRMDMELIAEHAEGVVATTGCPSGAVQTRLRLNQYDEARQVASAY
QDIFGKENYFLELMDHGLSIERDVREGLLRLARELSLPLLATNDAHYIHE
SQADAHDNLLCIGVGKNKADEKRFRFSGSGYYLKTAAEMRALFAELPEAC
DNTLLIAERVGSYDEVFDNVDEMPQFPDVPDGETQESWLRKEVLTGLAMR
YGDPVPTEVLERFETEMSVIGPMGFSSYFLVVADICKYARDNGVPVGPGR
GSATGSIVAYATRITELCPLEHGLLFERFLNPERINPPDVDLDFDDRQRD
KMVRYVTEKYGDEYTAMVNTFGKIKAKNAIKDSSRILGYPFSHGERITKA
LPPDIMGKSIPLDGIFDPEHPRHGEAGEIRTMYENEPDVKTVIDTARGVE
GLTRGTGVHAAAVILSKTRLTDRIPLHMRASDGVKITGFDYPSCENMGLV
KMDFLGLRNLGVIDHAIENIRENRGVRLATVDPMDGDSGTVVIPLDDKKT
FELLGRGDTFGVFQLDGGGMRALLKLMEPSRFEDIAAALALYRPGPMAAN
AHTNYALRQNGKQEPDPIHPELKEVLDPILGSTHHLLIFQEQIMAIARIL
AGYTLGGADMLRRAMGKKKPEVLAAEWEKFHDGMKANDYSEEAIKAIWDV
MLPFSGYAFNKSHTAGYGLVSYWTAYLKANYPAEYMAALLTSVGDDKDKA
GIYLADARKLGVTVLPPDVNESVAEFTAVGDDVRFGLRSVRNVGDNVIDT
VITARKTKGKFTSFADFLDKADLPALNKRAVESLIKAGAFDSLGHTRRGL
CAVHETAIDAVVPLKKAAAYGQDDLFAGLGGGTGDDGESGFGLDVRIDDG
EWSRKQLLSTEREMLGMYVSAHPLDGTEHILAAHRDTTIPDLLASGRTEG
TVRLSGLITGVQNKVTKQGNAWAIVNLADRDGTIEVLFFPAAYQLVLGAL
VEDSVISVQGRINDRDGAISIFGQELKVLDVTAAERNGAAPVQLAFPYHR
VNEPMVSELRRIMGAHPGESPVHLFVRGPQKTTVYLLRSTVNAATIASDI
KGSFGQDTWHGVA
>gid:871147  dnaN  putative DNA-polymerase III, beta chain
MKLTIDHTAFAAAVSYAVRSLPARPPVPILAGLLLDASSSGLRISAFDYE
ISADTTAAATVTETGRALVSGRLLSDIVCTVRGDVHLELRGPRLVLKAGS
ARFILPTLPLDEYPALPEPGTITGTLDGPAFAEAVAQVACAVSREESLPT
LTGIGLHHDSEAATLTLYATDRYRFAVRTLSWKDANLPDTSAVIPGRALS
DAAKATADDTTVDLALPTGDAGLFTLRSDLTTTAIRALEGELPKYDALFP
TEFAHTATVEIAPLKAAAQRVALVSAKKDAPIKLTFTADSTLVLEGGTGD
DAQAVDSVDTSLTGGELTIAFNPAFLLDGLNALMAESVQFDFISPTKPAV
LRAHDGDDQALRYLLMPIRLTN