TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Sinorhizobium meliloti 1021, 1021
Gene type: CDS

Number of genes found: 237

Free access
Sort by:

 



# Sinorhizobium meliloti 1021, 1021

>gid:374168  ISRm28.3  PUTATIVE TRANSPOSASE NUMBER 3 FOR INSERTION SEQUENCE ISRM28
MPLRPDPLPQDAAQLSRIILSLDAENADLKARVAFLEGQLFGPKSEKMTA
IDPTQATLELGDLSDIPEAANDDVAPVAEGPRQERRSPSRNIGRLPKHLP
RYEELIEPESKICPCCSFELHCIGTDISEALDIVPAVVRVKRTIRPRYAC
RACESVIVQAPAPARMMDGGMVTTAFAAHIAVSKFAWHLPLHRQAQMLAS
CGVIIDRGTLGAWVTRLRDEQVGPPPQASWPGPERRGSIPRRADWR
>gid:871632  SMb20008  putative DNA ligase protein
MASSAHGKPTKKAAPSSAAKKAGGFPLPVSTAPMEARSATELPGGDAWQY
EPKWDGFRCLAFKSGDEVDIRAKSGKPLGRFFPEIVALLRRLEPDMFVLD
GELVIEVDGRLSFDALQMRLHPAASRVQKLSQETPAKLILFDMLAGTDGT
ILTGEPLQTRRACLEAFAKKNAVAEMLESSPFTLDIKEAERWLTSWEAGA
TDGVVAKRRDGVYECGERAMVKVKRLKTADCVVGGFRYESDSSLVGSLLL
GLYDTEGLLNHVGFTATISDKERPALTNQLEALREPPGFTGKAPGGPSRW
SSERSGEWEPVRPELVVEVRFDHVSNRRFRHGTKLMRWRPDKDPRQCTYQ
QIMPS
>gid:871690  SMb20064  HYPOTHETICAL PROTEIN
MDLASEGYSMPWREVSTMGERRGFVRLPLEEGVNRRELCRRFGISPDMRY
KWLARWEAGDGELADRSRRPHISPMRCNEAVEAEVLAMRDAHRAWGTWAD
DYMLTGKHDTLPYFDASRGRPLLYKCRWASTRPCRMRMMSIRSPRSRK
>gid:871749  SMb20119  HYPOTHETICAL PROTEIN
MTTTNPSRAACRQRRHGGNTLASWYEPHGVRQPRSVAARSHMVSTAFAGI
ADFERSLIVERTSAGRIAAKARGVRFGPSPALSAAQIENARKLIKDEEKT
GPRSLGSWVCIVRPFIGR
>gid:872023  SMb20399  HYPOTHETICAL PROTEIN
MQGDLFGNPPGNLPEGFRYQAGIVPRKLQEALLEALPELPFKPFDFHGYE
GKRRVVSFGWKYDFDTETVRRIDAIPPLLLPLRQLAADFAGLPAEGLQQA
LVTEYDVGAPIGWHRDKAVFGDVVGISLLTSCTFRLRRKRAGRWERTSVI
LEPGSAYLLSGAARSEWEHSIPPVERLRYSVTFRELRSGIGAGGRA
>gid:872147  SMb20528  HYPOTHETICAL PROTEIN
MCEVRDCFLEAADDLDYPAPFVAGLLDPACPTPTLLTGPHSKAADRRFAV
YRNNVTVSLIEALAATFPATRRITGDAFFRAMARFHIREVPPKSPLLFEY
GRDFPDFIERYAYARPMPWLADIARIERAWLDAYHAADAAALLPHVLATA
KPTELANLVFEPHPATGVVRSAYPAVTVFLANRGDGPVGRIEASVPETAL
ITRPSLEVEVRALSRGIDIFVGSLLEGEPFGRAAVAGHACCPNFDLATAI
RVMLEAGAFAAIRHGG
>gid:872160  SMb20541  putative protein in ISRm14
MSSPLDLSLFPNLPPEVVKAFAAMQFELSVERAARQHEQAVVAEKDAFIA
ELKELIEKLEGQVHDYRRTKFGPKSEKLDPAQMELALEDLETAIAETQAR
IAAVEKKIEASASDPDKVAPRKERKARALPEHLPRVERVIEPESIVCPCG
CGNMVRIGEDRTERLDRIPARYEVIVTIRPKYACPKGRTGVVQARAPAHL
LEGSWPTEALLAEIAVSKHSEHMPLNRQAEVMARHGVPIDRTVLADWMGR
TGSEIAPVVDHMAKRLLWESTRLYVDETTAPVLDPGRGKTKTGYLWAVLR
DDRGWNGSAPPGVVFHYRPGRKGEYAAEILDGFNGTIQVDAYGGYSHLAT
LDRVGGDPLKLAFCWAHGRRKLIKATPKSGSPIVDEALVRIAALYKIEDS
IRGSDPEHRRAVRQELSLPLVDAFFAWLAAQAKRVSRKSDLGKALAYMLT
RQDGFRLFLDDGHVDIDSNLVENAIRRPAMNRRNALFAGHDEGGRNWARF
ASLIGTCKMNGVEPYAYLCDLFTRLANGHLAKDIDALMPWAYAARITASQ
>gid:872161  SMb20542  putative protein in ISRm14
MIVAGQRLPILIATRPVDFRCGHQALALMVQTELKLDPHSGVTVIFRSKR
GDRLKILVWDGTGMVLTYKILEHGSFAWPKVQDGTMRLSRGQYEALFEGL
DWRRVMAQRVTAPSAAG
>gid:872163  SMb20543  putative protein in ISRm14
MAGDGFVGRYEVVEPRRGNRRWPDAVKARIVAESLEPGVRVVDVARRHDV
VPHQLSFWRRQAREGILALPFEAMPGLSESGDAEPAFVPLAIAAEPSEAV
NVLAPPLSEAVSSVLTLEIGPDVVLRVPGDVPVERVAALVRAMRAPV
>gid:873024  SMb20665  putative partial transposase of insertion sequence ISRm17 protein
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKTISAIRADTWEAVNKMLLASARQERLESGRVVRVDSTVTAALIHEP
SDSSLLWDCVRVMVRLLQQADSDTESFTTFLMQGAGRRMRGGRASPNPPR
RTTGAVHFWCAAATNLS
>gid:873046  SMb20685  conserved hypothetical protein
MATPKLEAYRKKRDFSKTPEPVGNLTAGGNRFVVHKHHATADHYDLRLEV
DGVLKSWAVPKGPSLNPADKRLAVETEDHPLDYIDFEGVIPEGEYGGGPM
IVWDKGVWAPMGDVDDDLRKGAFKFRLAGEKLNGGWMLARLKRRPGEEDQ
RNWLLFKERDPAADTSIDILAARPESVKSGRRIEELVEKPKAPPKPVKLN
PGALAGAVKAAQPERIEPQLATQTILPPEAEKDSKKPERWLHEIKFDGYR
TMAHVADGKVRLVTRGGLDWTKRYGDLPEVFRRLPCRDVIIDGEIVVLDE
TGVSRFALLQDALSTGAGNSLVFYAFDLLHLNGWNLFDVPLEKRKALLKQ
LLAGQVSSRSAIQFSDHVLGEGRALYDRASEMGLEGIVSKRISAPYRSGR
SKTWTKTKALKAEDFVIVGYTVSEAAEGIASLALGEWADGELEYRGKVGT
GFDAQMLKELLARLEPLRAGAAKLEGAPREIIWVRPLLRAHIHYGNRTTD
NVLRHAVFKGLRDVELSTPVSMSRKRLISDADLAGISITNPTRRLFGKSG
PTKLDLAVYYAMIGDFMLPHILGRPVSLVRCPTGRTQDCFFQRHPFTGMP
PSVATFQATNSEGEAKTYLSVEDARGYLALAQFGVVEFHNWGTTRKLLGK
PDRVVFDLDPGEGIAWREVVEAAIHIKAELEALALVPFVKTSGGRGVHVV
VPVVPKLDWKKFHQATSALATRLAATAPATFTTTMGKDNRIRRIFIDFHR
NARSHTWAAPYSLRARTNLPASTPLSWADLETIDAPADLNYSSLPGLLAT
SGDPWADIDDFARDLPIL
>gid:873069  SMb20708  putative methylated-DNA--protein-cysteine methyltransferase
MNHYHIFETARGFCGIAWSDAGITRFQLPTRSAESTERLLLRRLPDGAPH
EPPPRVAEAVAAVKDYFEGRETDFSRFDLDLQGQGAFFENIYQAARKIPW
GSTTTYGALAKELGAGPEAARDVGQAMAKNPVALIIPCHRVLAAGGKIGG
FSAPGGSSTKLGMLEMEGVKVAPPEPAQQSLGF
>gid:873070  SMb20709  putative 3-methyladenine DNA glycosylase protein
MHRLTSDVDFFARSAVQVAADLIGADFTVSGVGGTIVETEAYLPDDAASH
SFAGTTARNRAMFGPPAHAYIYLSYGLHWCLNFVCLPGSAVLIRAIEPRW
GIDTMRARRGVREERLLCSGPGRVGQALAISRELDGLPLGEDPFRLTLPS
TKPPLAAGIRVGITKAVEQPWRFGLAGSSFVSRKF
>gid:873132  SMb20780  putative reverse transcriptasematurase of intron RmInt1 protein
MTSESTTDKPFRIEKRRVYEAYKAVKANRGAAGVDGQTLEIFEKDLAANL
YKIWNRMSSGTYFPPPVRAVSIPKKAGGERVLGVPTVSDRIAQMVVKQMI
EPDLDSLFLPDSYGYRPGKSALDAVGVTRQRCWKYDWVLEFDIKGLFDNL
PHDLLLKAVRKDVKCNWALLYIERWLTAPMEKNGEVIERSRGTPQGGVVS
PILANLFLHYAFDLWMTRTHPDLPWCRYADDGLVHCQSEQQAEALKVELS
SRLAACGLQMHPTKTKIVYCKDQRRREAYPNVTFDFLGYQFRPRRVANTQ
WDEFFCGYTPAVSPTALKSMRATIKSLNIPRQTPGTLAEIAKQLNPLLRG
WIAYYGRYSRSALSTLADYVNQKLRAWIRRKFKRFQSHKTRASLFLRKLA
RENPGLFVHWKAFGTNTFT
>gid:872184  SMb20804  putative membrane-anchored protein
MGSWISDARKRIYRNLKYRIMRPDPPAAPFRFNSPVVVVGSAPVSNRPAG
LDESFRIITVNGSQSVIAKWGVDAPDITMMMFNQVEGTTANAIEVRRVLK
GQRTGTLYVFLWRKDDRARLEEGLRAFDYKFDRLEIVDRYERMALLDRVA
DLRSLEMDADSKCSNGMNAVLFALYNGAPAVIVTGINPNSSGHVYNSTGL
TRLHVQMDKVLVSKLISEGRPIFTADPPVSKELGIPLWSGKNR
>gid:872191  SMb20811  putative protein
MQSPGGGSLHFEDDGLRVATLEFLVKVLATKRLRDDPDADLKQDVLSALE
RIPLQFVSSEVERAEIRAKMHAEANAILNWAVEGRTTTISPIAGRPLSAS
HYRQWR
>gid:872206  SMb20826  putative protein, probably encoded by an unidentifed IS element
MSKHSRAFKQSVVEFYLRGNDGYGKVGAEFGIDHSTVLKWVAIYEAHGVA
GLSKKFSSYDAEFKLSVLKRMWEDGLSCRQTAALFNIRNAGCLSDWERRY
EIGGIDALAPRRRGRPRTMPEPPLPKQPQAPQNDETKSRAELLAELNYLR
MENAYLKKLEALTREQPAPRKRKSSRR
>gid:872207  SMb20827  putative transposase, probably encoded by an unidentified IS element protein
MVEMGLKSLVRPKKYRSYKGDAGRAAPDLLRRQFAADAANQKWVTDVTEF
NVAGEKLYLSPVMDLFNGEIIAFETARRPAFKLVEGMLNKALERLKSKDR
PILHSDQGWHYQMPAYQRRLQARNIAESMSRKGNCLDNAAMESFFAVLKS
EFFHPNRFDSIQSLKDGISDYNHQRIKLKLKGLSPVQYRTQSLIS
>gid:872848  SMb20905  putative transposase protein
MQPHGQTSQIRCAGSGWREIAVLCREFGISRKTGHKIISRYNACGLEGLT
DRSRRPYRHANQLPFQIEKLIVRAKQEKPNWGAPKIRERLARLYPDVQTP
AISTVHAVLDRNGLVEHRKRRRNRAQGTPLSQPCRANDLWCADYKGEFML
ADKRYCYPLTITDFASRYLLACEALHTTKEVYAFTVFERVFKEFGLPRAI
RTDNGVPFASPNALFNLSRLSVWWLRLGIDIERIKPGNPQQNGRHERMHL
TLKLETTKPAGANFLQQQARFDDFVEEFNIERPHRALNMACPAECYSVSP
RPYRGLPSLDYPFHDQAVTVTTCGRICYKRKKINLSQVFAGQTVGIKQVE
DHIWLTSFMQYDLGYFDDETCRLEPLQNPFGPKVLPMSQE
>gid:872857  SMb20912  putative ATP-dependent DNA ligase protein
MARASSKKPRDTPPLDPMPARIDPCLASLVDRPPKGPDWAFEVKWDGYRI
AVHIEPGRVRILTRGGYDWTERFPTIVDDARRLAVKTAILDGEAVVLDDK
GRSDFGMLQRALGRLPSAVEAGAIVFYAFDLLYLDGHDLRRLPLRERRRL
LEPLVAAREGAVRLSEELQADGDEFFRVACAHGLEGIIAKHIEKPYRSGR
GEWWQKITCKSRDSFVIVGFEPSTVPGHLGRLLLAAREGDEPVYVGGCGT
GWSNKLSRELRKLLEGMATKSPAVDLRRRAAVSVEPVHVADVEYRAWTDD
GKLRHASFKGIGSGRMTWRLFSGTLVVNSGCTEFTPNAVLESCRLTFVKR
FATRGGDNTPKIYQ
>gid:872759  SMb20988  conserved hypothetical protein
MRKTECGSVASAQFQTGRSKMPKKSAGVLLYRYEAGNLLVLLVHPGGPFW
MNRDLGAWSIPKGEYDADEEPEAAARREFLEETGIAITGPLEFLGEERQK
GGKLVTAYAHESEFDVASLRSNVFETEWPPRSGRMQVFPEVDRAGWFTLE
EARSKISVSQRPFIDRLEASRNANRSAGSAS
>gid:872256  SMb21044  putative ATP-dependent DNA ligase protein
MDLEWRAILRQRAKAPHPREGVPNGAQGRKAAVGSRWTASPVDRLANLGA
TQSAFPAFIEPCHPTLKKRPPDGADWVHEIKLDGYRAQLHINGGKITLYT
RTGLDWTTEFEAIATAAEDLAGHDVVMDGEVTVFGKTGLPDFQALRRELA
KRSSPNLTFQAFDLLHLDGYDLRRVPLIERKRVLRELIGDAAGTIAYVDY
LELEEGEPVYRHACKMGLEGIVSKRKDAPYRSGRQEIWTKTKCTKRDAFP
IVAFVEKLGAQPRRIASLYLGRWEGDRLVYAGKAQTGYTLTAAREVRERL
NPLIIGKSPLSHPINKPKATWVEPQVYAEIDYGGVTDDGLLREPVFKGLQ
DVSRAAGKPKPTTAPASIRVPPANILQLLPEAVVPSKEELANYWTRVADR
ALHFLGGRPLKLVRHIHGTTFYHKGPLPPIPPEVHQLRIEKREGGAGVRL
WVDDLAGLLGLVEIGAVELHPWAASVDDIEHADALIFDLDPGEGVSWAFV
VETALRLREFLEAEGFKTWPKLTGGKGVHLMSPLPAKMTHNAAHAYAKRL
AQQFASTDPDRYVTSAQLSRRPGKLFLDYLRNGRGTTAVGTYSPRARSGF
PVAAPVTWRDIERGIKPDAFTIGRPPKRRAVLASAS
>gid:872266  SMb21045  putative reverse transcriptasematurase protein
MTSESTTDKPFRIEKRRVYEAYKAVKANRGAAGVDGQTLEIFEKDLAANL
YKIWNRMSSGTYFPPPVRAVSIPKKAGGERVLGVPTVSDRIAQMVVKQMI
EPDLDSLFLPDSYGYRPGKSALDAVGVTRQRCWKYDWVLEFDIKGLFDNL
PHDLLLKAVRKDVKCNWALLYIERWLTAPMEKNGEVIERSRGTPQGGVVS
PILANLFLHYAFDLWMTRTHPDLPWCRYADDGLVHCQSEQQAEALKVELS
SRLAACGLQMHPTKTKIVYCKDQRRREAYPNVTFDFLGYQFRPRRVANTQ
RDEFFCGYTPAVSPTALKSMRATIKSLNIPRQTPGTLAEIAKQLNPLLRG
WIAYYGRYSRSALSTLADYVNQKLRAWIRRKFKRFQSHKTRASLFLRKLA
RENPGLFVHWKAFGTNTFT
>gid:872305  SMb21083  hypothetical protein encoded by ORF1 of ISRm14, IS66 family
MADDGFVGRYEVVEPRRGNRRWPDAVKARIVAESLEPGVRVVDVARRHDV
VPHQLSFWRRQAREGILALPFEAMPGLSESGDAEPAFVPLAIAAEPSEAV
NVLAPPLSEAVSSVLTLEIGPDVVLRVPGDVPVERVAALVRAMRAPV
>gid:872306  SMb21084  hypothetical protein encoded by ORF2 of ISRm14, IS66 family
MIVAGQRLPILIATRPVDFRCGHQALALMMQTELKLDPHSGVTVIFRSKR
GDRLKILVWDGTGMVLTYKILEHGSFAWPKVQDGTMRLSRGQYEALFEGL
DWRRVMAQRVTAPSAAG
>gid:872307  SMb21085  hypothetical protein encoded by ORF3 of ISRm14, IS66 family
MSSPLDLSLFPNLPPEVVKAFAAMQFELSVERAARQHEQAVVAEKDAFIA
ELKELVEKLEGQVHDYRRTKFGPKSEKLDPAQMELALEDLETAIAETQAR
IAAVEKKIEASASDPEKVAPRKERKARALPEHLPRVERVIEPESIVCPCG
CGNMVRIGEDRTERLDRVPARYEVIVTIRPKYACPKGRTGVVQARAPAHL
LEGSWPTEALLAEIAVSKHSEHMPLNRQAEVMARHGVPIDRTVLADWMGR
TGAAIAPVVDHMAKRLLWESTRLYVDETTAPVLDPGRGKTKTGYLWAVLR
DDRGWNGSAPPGVVFHYRPGRKGEYAAEILDGFNGTIQVDAYGGYSHLAT
LDRVGGDPLKLAFCWAHGRRKLIKATPKSGSPIVDEALVRIAALYKIEDS
IRGSDPEHRRAVRQDLSLPLVGAFFAWLAAQAKRVSRKSDLGKALAYMLT
RQDGFRLFLDDGHVDIDSNLVENAIRRPAMNRRNALFAGHDEGGRNWARF
ASLIGTCKMNGVEPYAYLCDLFTRLANGHLAKDIDALMPWAYAARITASQ
>gid:872479  SMb21167  putative reverse transcriptasematurase protein
MQEARHQMPARAGRSGGRQGEALSEPGSDEATCPRCDAQSTGPGLLEAAL
ARENLQRAWKRVKANKGAAGADGLSIEATAAHLRTSWPDIRERVLAGTYR
PMPVRRVTIPKPDGGERELGIPTVTDRLIQQALLQVLQPLLDPAFSEHSH
GFRPGRSAHDAVLEAQSYVQSGRRIVVDVDLEKFFDRVNHDILIDRLSKR
ISDRRVIRLIRAYLNSGIMDHGVVQERVMGTPQGGPLSPLLANVLLDEVD
KELERRGHCFVRYADDCNVYVGSRKAGERVMALLRRLYGRLHLTINEGKS
AVTSVFGRKFLGFSFWRGRDGAVKRRVADKPLNAFKRRVRQLTRRSGGRS
MAEVAERLRVYVLGWKAYFRLAQTPRLFKELEEWMRHRVRAIQLKHWKRG
KTTYKALLAKGAKPEAARQIAVNSRRWWRNSGMLLNSVLTLRWMDALRIP
RLA
>gid:872884  SMb21409  HYPOTHETICAL PROTEIN
MNGPGRISPSQIKNTSRSELRRTRPVHLQLCCVVTPFRCLGNGGFHRPGH
RPTTHRLATNPSSTKHLFRLRSRRIPPHRLKRSAELGALGRASLFQPLKL
STSGRKTVTHFPGSALLNVGKSRFLDVRGSEHQELVQQPRRGRARRGIPA
PDAAETVDAPLVVRHGKPEQRTTCQVGLYEWERHVSAGRFPRATALLHAK
IRKPTRAVHEKIPACGLLHAPERVMGCRHWAIDTEPAAMHSRLERRSPRD
RRCR
>gid:872922  SMb21445  putative DNA topoisomerase I protein
MPEETGLVYVSDSEPGIRRQRRGKGFVYRMPDGSIVTDPSIKSRIAALGL
PPAYDNVWICLEERGHLQATGYDARGRKQYRYHSEWQALRSADKFAQLVE
FGKALPKIRRTIRRHMQGDVENMQTVLAALVALLDEAHLRTGNQAYVQAN
GSYGATTLLKRHLRLGDGFIELKFIGKGGKRVQRLLRRPKLQQLLEEIAD
LPGRQLFVWKDENDVLRPVDSGRLNRYLSDVAGTAVSAKTFRTWGGTLAA
FTAARTSIEQGEWPTIKQMSEAAASVLHNTPAISRSSYIHPDVLALADKS
ASISARQLQARGRLDSELRVEEQRLLSFLQRSARSRARQAKSVAEERRAS
S
>gid:872925  SMb21448  putative DNA polymerase related protein
MPKQLAKLGPALAGEHAEATTIASLRRQAEGCERCDLYKNATQLVFGEGP
VDARVVLVGEQPGDREDLAGRPFVGPAGRILDECLHEAGIDRSECYLTNA
VKHFKFEQRGKRRMHSRPNAGEIQACAWWLGAELDELRPEIVVALGATAL
MSLLGRSVGVTRNRGQLLTAPGGFSVLVTIHPSYLLRIRGRSDPEAERAR
FVDDLAKVATRIG
>gid:872951  SMb21477  putative reverse transcriptasematurase protein
MKANKGAAGADGLSIEATAAHLRTAWPGIRERVLAGTYRPMPVRRVTIPK
PDGGERELGIPTVTDRLIQQALLQVLQPLLDPAFSEHSHGFRPGRSAHGA
VLAAQSLVQSGRRIVVDVDLEKFFDRVNHDILIDRLSKRISDKRVIRLIR
AYLNSGIMDHGVVQERVMGTPQGGPLSPLLANVLLDEVDKELERRGHCFV
RYADDCNVYVGSRKAGERVMALLRRLYGRLHLTINEGKSAVTSVFGRKFL
GFSFWRGRDGAVKRRVADKPLNAFKRRVRQLTRRSGGRSMAEVAERLRVY
VLGWKAYFRLAQTPRLFKELEEWMRHRVRAIQLKHWKRGKTTYKALLAKG
AKPEVARQIAVNSRRWWRNSGMLLNSVLTLRWMGALRIPRLA
>gid:372659  SMc00132  HYPOTHETICAL PROTEIN
MTPGPASTAEEIIEHLRALGSDENIAGMARFGIATETALGLSNVQLRRVA
RMVKTDHARALDLWTSGIREARLLAAFTADPKALTLAEARQWANACNSWE
VVDTVADLFVAARLEQVLIPEFAADEREFVRRLAFAMIATAAVHLKKEPD
STLLAWLPLIETHAADERNFVKKSVNWALRQIGKRSPACHGPALALAQKL
AASTDRTARWTGKDALRELTNGKVRERLGLPA
>gid:372583  SMc00190  HYPOTHETICAL TRANSMEMBRANE PROTEIN
MATKKTNESIDEKAFQALEAALKIDFDDLKSALNDETSLEEPVPENVSET
ARQAAGSGEARAARAQQEAPKPARGAEAPRSFASEMAPKQPPLAPANDDT
RKSPAAMLRSLEVRSSRAAIRVAAIISLVWTVAGLGVAHLLYAPGIWQIR
SLGDLAAMPGAIGILLGIALPVMLFFSFAIMIARAQELRSAARSMAEVAM
RLAEPETNAADRVMTVGQAVRREVSAMNEGIERTIARATELEALVHSEVS
ALERSYSENELRVRTLVQELGLEREAIIGHSDRIRTAIAGAHTKLKDDLE
TASEDIASRIAVSGEAFASLIDTRAAALTDKSDHALENLSTMLTTRTDAL
LSGLTTAGVALSNEFDARLDALSDNLTQRGEQLLSQFETRASTLDANTEK
LNAALNERARQLNETLIARTRDLNESLRIGQQAISGGLDDVLSSLNSALD
EKGASFRQSLKSSADDAIMDLDLRGGFFEEKLQTTVGQLASAFDERFHEF
ASAFDKRASQLDTKLMESLHRINETVSGGSEAIGGALDSSVDKINSALSE
QSLTLATALGATQDFIEETIGSRTSELSSLIGNAHNRIESVLSDKTGSLM
GALTEAQERIENGFGQRADALANALTTSERSLTDGLDSRTSAFIEGLQSA
HARIEQTLTGSTDEITSAIAASQHRLDNTLSERTAALSTALTSGASVIES
AVGGTADRLERVLSERGQTITDALSTQTAALDGVLAERATQINSTMSARA
SEMADSLSRHAEDVADSLTFRAMAVAETMTDRVGEIENKLSQSVSEVAEN
LGGRVSLIADTLTDTSARIAEDLSSRVGKISETLAGTSAEIAEALTARTS
EATASLAGKAAEIEQTLTGRASHLRDTLTTTHDQIRSTLDDRITAINLAV
GQGREQLEELLSDQSMAMATTLATSASMLEMSLEERQASIAGAIERSTEA
LAARTSEATASLADKATEIDRTLSGKAAQLRDTLATTHEQLRTTLDDRIN
TINLSVGQGRERLEELLSDQSIAMATTLATSASMLEMSLEERQASIAGAI
DRSAEALDSRMRSTTGEIAERLAETADQISLAADTLTNRVDISISGVNSR
LDETGARIETSLGSLEERIRGSVGDVDAILGETGSRIETSLGSLEERIRG
SVGDVDAILGDTGARIETSLGSLEERIRDSVGSVNAIVDNAGQRIADSLG
ERAGEIDRISEAAATRISTAIETGTGRIEERLGTMDRALNIGLENVNRTI
EGKAAALVTSLRGAVSDATQEIDAEAARSAGLLSKAGADFAGALAASNAE
FASSIEQTASATAARHADLARSVAEAADTATARLASTNSQIATHAKSIQQ
SLTEAEKALDARGQSIRSTLDESTRELNSMLAGRSMELSRLLDEQARPVI
EQYAATGKEAAERIASLTQESADRLRAENAALINAITERTGETLDAISLR
AEETAKAMKMVENRLQSTAMGLIDQLASSNSAIATVIDQASGNLGDMDQR
LEATAAKVSETARQASDMLSTSTRLIEGKVDKLSDISSSTLSQIGGIVGR
FEDHSKVLGQASDLLAAAQSNLVSTLEERQDALRTLSVGLVQRSEEIERT
MRALEGFVDGAFQRAEERSGQVAGNLRSGIQSSFSDVGRLLSNTEQRATE
AAEALRDTLVKASDEAAASVEGVFSQAEERSRQIADRLRSGVETSFADAN
KTLSQVEGRALSASEALRQVMAKTGEEAGQALEGAFASVEERAKDAASRL
RGTIGASVSDVERMLAESGKKSDGVAAQLREAVRQAIEDAIGRFNGATDD
IRRSAGEIRKELDMTREELKRGAFDLPEEAKENAALMRRAVGEQIKALQE
LSEIIGKSSSQLEVAQPVRQQEAPAAPAARVVAPQAAAPQVAAPQPAAPQ
PAPNAALRGSLGIEQATRPLQPARPPATEERAEEGGGWMRDLLRAASREE
EPAAARPRSAESQPVAKAGDSRNPRHVVESLNSLSVDIARAIDHDASVEL
WRRYQRGERDVFTRRLYTLKGQQTFDEIKRKYDREPEFRTAVDRYIADFE
KLLADVARNDPDKRITQTYLTSDTGKVYTMLAHAAGRFS
>gid:370720  SMc00344  CONSERVED HYPOTHETICAL PROTEIN
MKGYVYILASKRNGTLYTGVTRDLPRRLLEHQNDLTPGFTSRYGVKTLVW
FEEFDLLTSAITREKTMKKWPRKWKLNLIEELNPEWEDISHHLHGL
>gid:370763  SMc00378  PUTATIVE EXODEOXYRIBONUCLEASE PROTEIN
MTSFFDSESPSNVAEYSVSELSGSIKRTVEQAFEHVRVRGEISGYRGPHS
SGHAYFALKDDRARIDAVVWKTTFARLKFRPEEGMEVIATGRVTTFPGSS
KYQIVIDSLEPAGAGALMALIEERKRKLAAEGLFDAGRKRPLPFMPRVVG
VVTSPTGAVIRDILHRIADRFPVHVIVWPVRVQGDGACEEIVAAIEGFNA
LQPGGSIPRPDVLIVARGGGSLEDLWCFNDEAMVRAAAASAIPLISAVGH
ETDWTLIDHAADQRAPTPTGAAEMAVPVKADLEAQLASLAARLKGAAARQ
MDNRRQTLRSLARALPSLDQLLALPRRRFDEAAAGLGRGLQMNTANKRRS
FERNAAHLRPELLTARIVDRRQRVLDAVNRAERIVERKVQRGAQRISSAD
ASLRVLPSRLIGQIHRASDRVSSLGRRGDAAIAADLRRMKSALAAQDRVL
QSLSYHSVLQRGFALVRDAAGEPVKQAAAVHPGMALSLEFADGRIAAVAG
EDGTAPQSPKKRPARSGEPTKQGSLF
>gid:370806  SMc00418  CONSERVED HYPOTHETICAL PROTEIN
MKAERPDTRKLKALKRGHIAEYRAALCLMLKGYRIVAMRYRTRLGEIDII
ARRGDLVACVEVKARVSLEDAVFAVTDTAQRRIRAASDLWLSRQGDFHRL
SVRYDIVAVTPWRWPRHLPDAF
>gid:371767  SMc00586  CONSERVED HYPOTHETICAL PROTEIN
MTEILFYHLTESKLEDALPPLLDKSIERGWRVVVQTIDAERRDALDTHLW
VYRDDSFLPHGTDAGEFAAVQPILVVADDGNRNAATVRFVVDGAEPPPVS
EYERVVFMFDGYDQQQLEAARDQWKRLKGEGHNLTYWQQNRDGRWEKKA
>gid:371796  SMc00610  CONSERVED HYPOTHETICAL PROTEIN
MTKNADARFRIIDRKTVWDGFINLEQITIEQEMSDGSTARLVREVHDHGR
AATILLFDPERQVVVLVRQLRLPVFLQGETGYLLEAPAGLLDGEAPEVAI
CREAMEETGYRIETAMHLFDAYMSPGSITERTSFFLGLIDISKKVAAGGG
LAHEGEDIEVLEISFDEAVARIGTGEICDAKTIMLLQWAMLNRASLAK
>gid:371803  SMc00617  CONSERVED HYPOTHETICAL PROTEIN
MKRHLSEQLPLVFGHAPATGRDDLLVSDRLSAAISIVDHWPAWPSPVVII
CGPVGSGKSHLAGIWREKARAEPIHPFAGSNAADIAAEKPVLFEDADRQG
FDDAALFHVINSVRQNGTALLMTSRLWPMSWPVGLPDLKSRLKAATVVEI
GEPDDELLVQVLTKLFADRQLLVDERLVGYVVARMERSLEAAQTIVERID
HLALARGTRPTRALAAEVLEELANTSLSD
>gid:373567  SMc00736  PUTATIVE DNA-3-METHYLADENINE GLYCOSIDASE II PROTEIN
MRIIRTHEDIEAGLAGLVMLDARLEHVIDKAGPVPLRRTDPGYRGLANII
VSQMVSKASAAAIWNRMEASLGEITADAVLALDDDDCRRFGLSRAKADTL
RRVAAAATAGEIDLDAICNEEAVKAIHELTAIKGIGRWTAEVYLLFCAGH
PDVFPAGDVALQNAIGHALGLELRPTASEVDSLAGGWSPWRSVAARLFWA
YYAQEMRRDALPVTP
>gid:371321  SMc00828  CONSERVED HYPOTHETICAL PROTEIN
MNFLHRLASDVQLMLRRPARMQYAALCYRLARKTNALEILVITSRDTGRW
VIPKGWPMQGKQAHEVAEREAYEEAGVKGKVQRAAIGAYVYQKRKDHGLE
ISCKVQVHALEVEDFCKNFPEKGSRRLEWVDYREAAKRVAEPSLKELILD
FGRRVDPDPEQRPKASSN
>gid:371397  SMc00866  HYPOTHETICAL PROTEIN
MNDLSQLKQGVYAPGNSVDVAEALVAGSGIRIDVDRRRGRGAALNISGRF
EQKSREVFDDGWQTLEELPPFKTEVQIEKPKTAITRNESPDISFDRSINP
YRGCEHGCIYCFARPTHAYMGLSAGLDFEAKLFAKPDAPRLLERELAKPD
YKLRPIAIGTNTDPYQPIEKEWRIMRQILEVLKEANHPVMIVTKSAMVTR
DIDLLAPMAEKGLARVGISVTTLDRKLARSMEPRASTPTKRLEALRAISE
AGIPAGVLVAPIIPALNDHEIERVLDSAKSAGASDASYVLLRLPLEVSPL
FRDWLLRNYPDRYRHVMSLIRSMRCGKDYDAEFGKRMKGSGPYAWQIGRR
FELAAKRLGLNLTRRQLRSDLFVPPLGMGVQLSLL
>gid:371353  SMc00902  CONSERVED HYPOTHETICAL PROTEIN
MRIVGGEFRGRTLAAPKSDDIRPTTDRARESLFNILSHAYPEALDGTRVL
DLFAGTGAIGLEALSRGCRQVLFVEQGVEGRGLLRINIEALGLQGRAKIF
RRDATDLGPVGTMEPFHLVFADPPYGKGLGERALSAAARGGWLVPGALAI
LEERADVRPQFSESFESVDERAFGDTLMHFLRFRGV
>gid:371442  SMc00970  PUTATIVE EXODEOXYRIBONUCLEASE PROTEIN
MNNNAQPDVSALSFEQAVEELERIVSALERGDVALDKSIEIYERGEALKK
HCEALLKAAEDRIEKIRLDRAGRPQGVEPLDAE
>gid:372329  SMc01190  PUTATIVE DNA POLYMERASE III, DELTA' SUBUNIT PROTEIN
MTAEQPRVLEGALAPATNSKLFGHQEAEAFLAQSYRSGKGHHAVLIEGPE
GIGKATLAFRFANHILSHPDPADAPERLADPDPASMVTRQLASGASHNLL
HLARPVDEKTGRAKGAITVDEVRRAGKFFGQTSGTGNWRIVIIDPADDLN
RNAANAILKMLEEPPRRSLFLVLTHAPGKLLPTIRSRCLPLRLKPLDTQA
LRQALAHLGLDLAGPDAERVLAAASGSVSEALKLINYGGLDIAGAFDDIL
AGQGPAARKNMHKLADVLSGKDSETIFDFFSSLLSGRIMQLAREAAIAGD
VGRAERFARLSSSVGERLAVSNAYNLDRKQTILSLLDDLKDAL
>gid:372325  SMc01193  CONSERVED HYPOTHETICAL PROTEIN
MLIDTHCHLDFPDFEAERDAIIERARDAGVGQMVTISTRVKRFDTILAIA
ERYPNVFCSVGTHPHNADEELDVTADDLVRLSAHPKVVAIGEAGLDYFYD
NAPRDAQAVGLRRHIAAARTTGLPLVIHSRSADDDMAAILTEESGKGAFP
FLLHCFSSGPDLARIGVELGGYVSFSGILTFPKSEELRDIARTVPRDRMI
VETDAPYLAPKPFRGKRNEPAYVAHTAEVLAQAIGVSREEIAEITTENAF
RIFSKMPRL
>gid:372321  SMc01195  PUTATIVE PARTIAL TRANSPOSASE PROTEIN
MPLCYSQLTLSDRRRLHQLVERKVPVGEIARQLGRHRSTIYRELKRNTFH
DAEFPEYSGYYSGIANDISKERRRRLRKLSRHPQLRELVIEQLKALWSPE
QIAGRLLADGVSAVRVCTETIYRFIYGKEDCALELF
>gid:372053  SMc01279  CONSERVED HYPOTHETICAL PROTEIN
MSDLFAPHEPPEIASARPLADRLRPRTLAEVTGQEHLTGPDGVLTRMIAS
GSLGSMIFWGPPGTGKTTVARLLSGEAGLAFEQISAIFSGVADLKKVFES
ARARRMSGRQTLLFVDEIHRFNRAQQDSFLPVMEDGTVILVGATTENPSF
ELNAALLSRARVLTFKPHDEASLEELLKRAEAAEGKPLPLDDEARASLIR
MADGDGRAVLTLAEEVWRAARREEIFDSAGLQEIVQRRAPVYDKGQDGHY
NLISALHKSVRGSDPDAALYYLCRMFDAGEDPLYIGRRLVRMAVEDIGLA
DPQALAICNAAKDAYDYLGSPEGELALAEACVYLATAPKSNAVYTAYKAA
MRAAKENGSLLPPKHILNAPTKLMKGEGYGDGYRYDHDEPDAFSGQDYFP
EKMGRKTFYDPPERGFEREIRKRLEWWNKLRRDRNS
>gid:371957  SMc01355  CONSERVED HYPOTHETICAL PROTEIN
MAILTMEELAERLPPYQSVAGLDLGTKTIGISVSDLGRRFATPREVIRRV
KFGADAQALLSFAEKEKIAAFIIGLPVNMDGSEGPRCQATRAFVRNMGEK
TDIPFVLWDERLSTVAAERVLIEMDVSRKKRAERIDSAAASFILQGALDR
LASLARGASGDRDAP
>gid:371945  SMc01367  CONSERVED HYPOTHETICAL PROTEIN
MTPEPLGGRAFALEAEAASSPTVRTRDAASIMLLDRSGKDIRVLMGKRHS
AHVFMPDLYVFPGGRRDPGDHRLRFDGDLHPAVLRSLRAGDGRAITEARA
RALALAAARELYEEAGVSLGRTRERQGAVLPFLPDLSNLRYMARAITPPG
LPRRFDTRFFAVFADEAEIDPSLVAESRELQDLQWIDVNDFSALRVPEIT
AIILSDLRNDLMSDPSLPPERTVPFYFTRHGRFHRTLL
>gid:372913  SMc01414  HYPOTHETICAL PROTEIN
MRTCGRRLAARLTSFVGWPAKSALAPGGRILTLKVMQFAPQQDEALKAVS
QWLKGGRSQLFRLFGYAGTGKTTLARHFAENVDGEVVFAAFTGKAAQVLR
SKGATNAKTIHSLIYRPRGEEEVEDEETGKTSIAPMFAINRQSPVAKAAL
IIVDECSMVDEALGKDLMSFGTPILVLGDPGQLPPVSGGGYFTNHEPDFL
LTDIHRQARDNPIIQLAMQVREGKEIMHGDYGTAQVISKGQVTQPLVLEA
DQVLVGTNRTRRRYNQRLRELKGFTSEYPQSGDKLVCLRNDPAKGLLNGS
LWQVMSSSKETVKPGINLMIRPEDDDMDRGAAKIKLLKAAFEDVEGEIPW
STRKRYDEFDYGYALTVHKAQGSQWNNVVLFDESWAFRDTRERWLYTAIT
RAAERLTIVR
>gid:373167  SMc01749  HYPOTHETICAL PROTEIN
MLLVAGRPGHGKTTLGLQLLLDAARDGRKAVFFTLEFTEQQARRHLRSLD
EGRHGLCDKLQILTSDDISADYIIRHLSGSERGTVAVIDYLQILDQQRSK
PALSDQVLALGDFARQTGVVFGFISQVDRSFDPESKRLPDIRDIRLPNLV
DLRLFNKAYFLHNGEARLQDVA
>gid:371894  SMc01909  CONSERVED HYPOTHETICAL PROTEIN
MVRPAMPFRFVHTADLHLDSPLRSLALRNAELAGLVRSATRNALVRIVDL
CIAESVDALLIAGDLYDGSQTSMNTALFLAGELRRLDEAGIRTFIIRGNH
DSQSQVTRELTLPPSVHVFAGRSRAVHVKTLANGRTVHVHGVSFADPHAP
ESLLPQFHPPVAGGINIGMLHTSLSGSAAHDPYAPCSVAELQRHGFDYWA
LGHVHQRQVHCEKPFIVMPGMPQGRDINEAGTKGVTVVTIDDEGLVTLDE
RPTGTAAFERHEIDVSGIADWRDMLDAVTSELILLRKNMAADNLILRLTL
TGATPLAWRLRRDADLLEAEIVNIAAGIGGCWIENTEMSCQATGNAEQNA
ADPVGELAALVENDVLPSFGFRAELSGVAKELLQQLPPELRQVLAGDEEA
LERLALEAALAGSADVLAHLHGRAGAGGTD
>gid:372211  SMc02088  CONSERVED HYPOTHETICAL PROTEIN
MAGYVYIVTNHKRGTLYIGVTSDLERRIFEHREGSTPGFASKYGCNRLVW
YEEHLQIGTAIQREKSLKRWYREWKIELIEKMNPDWRDLYFQLW
>gid:371010  SMc02152  PUTATIVE HELICASE PROTEIN
MKLEEIKSGQSLSGIEPSQIVTVVAIVSLGEGAVQLIYRTPEGSMKERFL
SRGDEPSVEVATVERPFSFDGDGASFQLACEAKRIDLAFLFDPMMAVHSS
NVEPLPHQITAVYESLLPRQPLRFVLADDPGAGKTIMAGLYIRELIMRAD
SHRILIVAPGSLVEQWRDELHEKFGLEFYVYSSLLEQTSPSGNPFEDYPR
LIVRLDQISRNEELQDKLCAPGWDLAVFDEAHKLSAHYFGSKLEKTGRFR
FAEKLGAHVRHLLLMTATPHNGKEEDFQLFLSLLDSDRFYGKFRDGVHKV
DASDLMRRMVKEELVKFDGTPLFPERKAYTVNYELSPIEAALYEAVTNYV
QTEMGKADQLEGARKGSVGFALTALQRRLASSPEAIFQSLKRRRERLENR
LRDEKLGIKGRNALAETYATAPEDEDELSAEEQESLEENLIDDATAAKTV
AELEAEIVILKGLEAQAKSVVASGQDRKWDELSRILQNNPEMRDASGRQR
KIIIFSEHRDTLNYLQARIAGVLGNPDAIVTIHGGTHRDERRRLQALFRS
DLDVRVLVATDAAGEGVNLQNANLMVNYDLPWNPNRLEQRFGRIHRIGQT
EVCHLWNLVAKETREGDVYHRLLLKLEVESQALHGRVFDILGEVFEETSL
KDLLVEAIRYGDRPDVRARLTQQIDKALDHDHLKSLLNRNALAQETMSPE
RLFAVKEEMERAEARRLQPFFVRAFFSKALDALGGTAHPREAGRYEISHV
PSTIRERDRRLTGRNRRGHEPVLRRYSRICFERQAIQPLDKAGLERAVLM
HPGHPLMLAMSDMILEQHTNLLRQGSILVDPSDEGIDPALLFLLTHEIKS
GDNTVLSKRLQFVRVGPDGQATFAGWAPHLDLEPLPDSERPLFKDVLDAP
WLSSGQEARALSLAATTLVPEHYGEVAQRRIEHVEKTLNAVHERLSKEIA
FWQDRWMKLKDDAEAGKDVRLNLQNVERTLGDLQGRLDNRKKELQAMRHV
VNGTPVVVGAALIVPAGLMNKLRGDEPADPLAAAFAADAAARSRIEHLAL
WAVRQAEEARGCRVVDVSAAKCGWDLTSYPPPVDGRQPDPKHIEVKGRVK
GATTITITRNEMLYAFNQGDKFVLAIAMVGEDGAIDGPHYIPSPFDREPG
WGVASINFHLSDLLAKADAR
>gid:371007  SMc02154  HYPOTHETICAL PROTEIN
MMTANVRTPKKLIEVALPLDAINEAAAREKSIRHGHPSTLHLWWARRPLA
AARAVIFAQMVNDPSWKWELEHPGEIPPNNIKASWAASRNRLFAIIKDLV
EWENTTNEVVLEKARAEIRKSWRETCDLNKDHPQAAGLFNPERLPAFHDP
FAGGGALPLEAQRLGLASYASDLNPVAVLINKATIEIPPKFAGRPPVNPE
ARTSRDAWSKQWFRAQGLAEDVRYYGRWMRMVAQKRIGHLYPPVEITSDM
AKERPDLRPLVGQQLTVIAWLWARTVKSPNPAFRHVDVPLASSFVLSTKT
GAEAYVDPVIDKDTYHFAVRSGRAPSQAREGTKFSRGNFRCLLSQAPIDG
DYIKAEAKAGRMGERLMAIVAEGRNGRIYLSPSSDQENIASAAHPDWKPD
VEFFQQALGFRVGNYGMTKWSDLFTARQLVALTTFTELIAEVRKQIVADA
IAAGMIDDQTGLDKGGEGASAYAEAVSVYLAIALSRLTDICNALCRWEVT
KTQVRNLFSRQAIPMLWDFAENNVFGGAAGDYIISLGNMVKALEKLPARD
PGVSRQQDAQTQSISAGKAISTDPPYYDNIGYADLSDFFYVWLREPLRSI
YPGLFATVVTPKAEELVATPARHGGSEAAELFFLGGMTAAMQRLAELAHP
STPVTIYYAFKQSETESDTGTSSTGWETFLDAVIRSGLALTGTWPMRTEL
GNRMRGQDSNALASSIVMVCRPRPATAETVSRRAFLRELNQVLPEALDEM
TRGSGDDRSPVAPVDLSQAIIGPGMAVFSKYAAVLEADGTPMTVQAALRL
INRFLAEDDFDHDSQFCLHWFEQYGWKEGRFGEADTLARAKGTSVDGVKQ
SGVIYASGGIVRLLKWAEYPSDWDPVGDGRLPVWEALHHLIRIFKSEGES
GAGNVLAAVAAKAEPARQLAYRLYTLCERAGWAEDARAYNDIITSWSAIE
SAAAAAPKAQQGNLFG
>gid:370998  SMc02187  PUTATIVE INTEGRASE DNA PROTEIN
MPRPLYKLTAVAVKNAPPGKYSDGGGLWLHKREDGGAQWILRVNVHGRRR
EMGLGSVSEVSLKEAREAAERWRSLVRGGLDPIKERQRQRREAARNLHCL
DDIARDAFESRKAELKGDGVAGRWFSPLEIHVLPRLGKVPVVDIDQTDIR
DVLAPIWHSKAETARKALNRLAICLKHAAALGLSVDLQATDKARALLGKQ
RHKPENIPAMPWRDVPIFYASLSDGTVTHLALRLLILTGVRSAPLRFLRE
EQINGDVWTIPGEAMKGRRGATADFRVPLSDAALEVIEQARRHAREGFLF
PSVKKGVISDATMSRLMERAGMAARPHGFRSSLRDWIAEVTEASHDVAET
TLGHIIGGAVERAYRRTDFLEQRRALMVRWARHVTGQRGQVIALGKRSRN
DA
>gid:371047  SMc02230  HYPOTHETICAL TRANSMEMBRANE PROTEIN
MNGSRSTSQRMGSPSYGDRPSLDALNRTIEGLEARIEGLMSSVSRETRQP
ERPRPAPSEAVSEIIDRQRALNAARERSPLRERAAPSESDRLAPTRLAAE
ARYQPPQAAAAPSGRSSAAADIAEALVGLRHELKRDLDDGLSREMHALRS
EIRGIKAEAAQDRRFAEDVRHDIERLSAGIKELGRQASPAEADALRIEFD
DLRSMLEGLAREDSMRRMENRWTGVEDRLNAFDQNRDDELVALAYRLDEI
KAQINSLDRGAGVEVLESKLVAVAQAIEMLGRQIQPDERRLAPQFADLDK
RLDEISRAIAAGSRNVAGTDGSFVDRLENRLGDLSRQIDTLSNPVDSGLG
ARIEALAARVEDLAGEKAAARLEERLDQLSAVLEHNRPGAEPDLTDRLAD
ISRKIEALSGDSITDALAERLDDLARRIDGLAGNVEPSADTRFDRLEDRL
AGIAQRLEETHAAPFDDRQALRNLEAQIGNLSTLISQSHGETAGVPAEFE
SRMNTLEDYLATSDEYIIEAARQAAEAVMEAYSRNTAPQMAASTDMAAIS
ALAEDLRTLEELSRSSDERTARTFEALHETLVHIAEKLERLEEREPPASH
APMPKAAPLEAAGTLTKAERDEDVERAAHALAAESETSIHAPVAAAAEAN
DAAEVAMVEDGEIKVEARAAKTGLLAGLTRRFAPKQNEGMPIQARQMVEP
APSIDPSEMLAPEDANQLLEPGSGVPDVKKILERVRAGQIARGAQEANEG
EKADFIAAARRAAQLAVEESDTLNRVKESKAPSAVGGALARHRRPILIAV
GAVLLAIMSYPLVSTVLKGKEAPSAPPVAAIERRAEVAEPKAAVDRVVQT
AATAKIGEEPRAPAGAEKTDAPAIAAKQPPTQPAAAEAGKIDEAIGGAGD
GQAALMRPAEAQTPGPISTLQPSSESSAAEPAKMSEIVLPEGFGPPALVT
AAKGGDPLAF
>gid:371113  SMc02287  PUTATIVE DNA-INVERTASE PROTEIN
MIVGYARVSSTGQDHQTQVDRLKAVGCEKVFSEKLSGLDRDRPELQKALE
FVREGDVLVITKLDRLARSASHLHQIVDGLTAKGVGFKVLDDSGIDTTTR
SGKLLFGILASFAEFETALRRERQLDGIAKAKAEGKTGGRPALITSELKA
RIATMKKEGASVRQIAAQVGFSKSAVQKVIEETR
>gid:371120  SMc02297  PUTATIVE INTEGRASE/RESOLVASE RECOMBINASE PROTEIN
MDRRFAEHRSRRLEPLLPELSDAQIKHIEEVYYSHLLDEDDDLRLDGMDE
QEFAYYQELIHTMETLDRQGIARGLPPVYARSEAEEVLTWDSVDLKLDPA
SPSWPRLIRAILSASVRAAIAKRQRNEGTPVETPAMPQGKMTTSSPAAST
IIEAWVTEKSRAKGGWTPATAKANQLWAERFIAMAGDRPMSEYTKADARE
FKSALLALPPNWTKIKELEKLSMKEAGRRATTLGLKPMAGKNVNKVLGFV
RAFWNWAEANYDAIPSNPFDGLNVKISGKARDERQPFTFAELSAIFKSPI
YTGCQSVRYHNSPGDLIPNDNGVYWVPLVGLFTGCRSGEIIQLRTEDVKI
EGGIAYIEVTDDGEDLSLKNSGSRRRIPVHRTLKEIGFLRFAERQRKLGH
KRLFPDFPKTKDGTYSTAYSRKFSNLLKALEVKHDKISFHSFRHSFEDAC
RNSRIPLDFINALQGHSQQGMAGRYGNGLYGLQLLNEEMEKLRYEGLDLS
HLSKAALANKAVA
>gid:371129  SMc02303  PUTATIVE PARTIAL TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFEHYNNLHPHSALGYQSP
REFISSQSQT
>gid:371619  SMc02375  CONSERVED HYPOTHETICAL PROTEIN
MISANDLSRSELAALLAFHAEAGVEWLLEEEPVDRLAEFEAMKSRRAQSR
AVPQTGATAAEPQQSAPPAAARERRPAPPAPPPVVAIPDEQAIKEAQFVA
DAARSLGELRTAMEAFAGCNLRNSARNLVFAEGSAASGVMIIGPMPYADD
DRDGRPFAGRHGEMLERMLSGIGLSREDVLLANAVPWRPPGNRVPSAREA
DICRPFIERQIALAEPKQLLLLGNFTARFFFGGETIHQLRGEWRELTFGG
TSIPTLATLHPQDLVAAPINKRFAWLDLLAFKSRLG
>gid:371643  SMc02397  PUTATIVE INTEGRASE PROTEIN
MTSDAEKQVGDLGFPDMGSGYEVNYTFGRQGPIGVVKETTLAELLQRYAD
ILWDEGRHKYNVKAFIGELDEILLGARFSAFSQELLDGLIGALRKRGNSN
ATINRKMAALGKLLRKAYKMGDIHNLPEFKRQKEKAGRLRFLEADEEEVL
FREIASRSELYLYLSIFLVDTGARLGEAIALKWNDIHEGRATFWVTKSGR
SRTVPLTVRAKDALKRVADRSPGPFSRIDQQKYRAVWNAAKVDAGLGNED
DLVPHILRHTCASRLVRGGIDLRRVQMWLGHQTLEMTMRYAHLASHDLDM
CVPVLERHLK
>gid:374068  SMc02489  PROBABLE INTEGRASE/RECOMBINASE PROTEIN
MLRAWISELTALMKPPGPMMNELLAIGHPEVMAERRRWLASLAEERRLSE
KTVDAYERDTRQFLTFLTGHLAGPPRLSDICALRPADLRGFLAQRRKGGA
GARTLGRGLAGLRSFLRYLERNGLANAAGAGAVRSPKQPKSLPKALTDRE
ALKVVTADAQLAEEPWIAARNAAVLTLLYGCGLRIAEALDLTPADFSGPV
TSLRVTGKGGKTRIVPMIAAAAEAVETYRKLCPYHIEPEEPIFRGARGAK
LQPAIIQREMQKLRAALGLPDSATPHALRHSFATHLLAGGGDLRTIQELL
GHASLSTTQVYTGVDSARLLEIYDRAHPRA
>gid:371683  SMc02635  HYPOTHETICAL PROTEIN
MLRFPRRVYASFMQWSDEAIILGIRRHGESSVIAEVMTPGHGRHLGLVRS
GRSRTMQPVLQPGNSVEVSWRARLDEHLGEFRVEPLQLRAASLIETATSV
YGIQALGALLRLLPERDPHPHLYEALAVIVDHLQDPADAGELFVRFELAV
LNDLGFGLDLSRCGATGARSELVYVSPKSGRAICREAGAPYADRMLALPD
FLSGGSRAADHESLAAAFRLTAYFLNRHVYEPRGVDAASARDGFVHATLK
ALKSASSAA
>gid:373244  SMc02701  CONSERVED HYPOTHETICAL PROTEIN
MSSHPYSADEFRRRALQQAGGPIETSWRDHGDFLLNPGIVSYLESLHLKD
AAVLVPVVDDGEDASVIFTQRTSNLRKHSGQVAFPGGAVDPEDHSIEVAA
LREAEEEIGLDPRFVETVARLPHYMAMSGFRITPVLAVVQPGFVLEPNPE
EVESVFEVPLSFLMNPRNHGRGSSHWQGAERHFYRMPYGERNIWGITAGI
VRMLYERLYA
>gid:373255  SMc02711  HYPOTHETICAL PROTEIN
MDESRGPEMPSWRLRFLTRFAHVYFALARGMTLGVRAACFDDEGRVFLVR
HSYLPGWHLPGGGLDRNETAVEGLARELREEGNLVLTTAPLLFQVYYNRR
TSKRDHVIFFRCDNVRQERPKRADLEIAAAGFFPLEDLPADTTPATRRRL
AELAGDVVPDRFW
>gid:370427  SMc02759  HYPOTHETICAL PROTEIN
MPGHPNVFTIPAGLPFLRTVAERLCNGELTPGFRYDPAQPLALAGVTIFV
PTRRSARVLRSEFVDLLGGRSAILPMIRALGETDDDSGFFDAEVPAILDL
APPLSGTARLIELGRLILAWRNRLPQVVLDIHAESPLIAPASPADAIWLA
RNLAELIDAIETEELDWDALDGLDGGEHALWWQLTLAFLKIARTYWPERL
AELKHSSPARHRNAVLKAETQRIAAGKVSGPIIIAGSTGSIPATAALIAA
VKTLPNGTIVLPGLDGSMSDAEWRLIAGETSGSGTSARNPASRTHPQYGF
HRLLKRMGIELGDVPALASAADDLDYRSAVLSRALLPADATDVWMEARRS
FDAERLLSAFADVALIEAANEREEATAIAIALRLALEADEESQAALITPD
RGLARRVGTELARFGIEADDSAGIPLSATPAGALARLLLEATLRPDDPVA
LVALLKHPLARFGRTADDARRAADVLELAALRGGTDAADVSALEAVLDKA
LDRQKTDRHPPPWRAGIGPDDIAMARALARRIASAVGPLSSNLSADPQTG
RHRSTVLPLSDWAERTGRALEAAAIDEQGNLGALWASEAGEILATLLRGI
IETDGQMEADGPQWCDILEALAASEAVKPRSMRHPRVFIFGALESRLQSV
DLVVLGGMNEGTWPGQTSNDPFLSRTMKAGIGLEPPERRIGQLAHDFQMA
CGTRRLILSRSMRQGSAPTVASRWLQRLQALGGERLTALLKANGADYLHW
MRILDQGERQPLSERPEPKPPAELQPRKYSFSEVTRLRRDPYAVYARRIL
RLEPIQPFNRDPGAAERGLLYHRIVDRFVKGGFDPAAREGEEAMIRLTDE
AFDEEKLPAHIDTIWRPRFQAVGRAFLEWERERRHGIVKSFTEVPAAMDL
GIGDIRLTGIADRLDRLADGTIDIIDYKTGSSPSPKEARALLDPQLALEA
AALRAGAFRVIGPAQPHSLRYVRLKPGSRFAVDTVNNEGAGSKETKSTGQ
LADESLAELRKLLSALMSGRFGFASRLIVQKQRDYGGEYDHLARVAEWAT
ADGEDDDEE
>gid:370426  SMc02760  PUTATIVE ATP-DEPENDENT NUCLEASE/HELICASE PROTEIN
MRSDATGPEERSPEGWLDWTTQRQALASDPACSAWVSANAGSGKTHVLTQ
RVIRLLLAGCRPSAILCLTYTKAAASEMSNRVFEKLAEWATLDDTTLEKR
IEAIEGKRPPTAKIQEARRLFARALETPGGLKIQTIHAFCEALLHQFPLE
ANVAGHFSVLDDRAAAVLLADARRALLTATAAASDGELAEAFATVLDLAD
DTGLEKLLAAIVANRAPIQAFLDHASGRGGMEAHLRAALGLEPGETAGTV
MAAVWPLAGLNGPALDDYIDLGLRLGGAKPSAIADGLRAVRAIDDAATRY
SKLVELFFNGGGKPKAESAFLNAAMRRAAPQLELRVEEARSHMLACVDRL
SIVQMYGATRAALVLAERLNRDYEALKKARSQLDFEDLIHRTAALLARSD
VGAWVHYKLDQGIDHILVDEAQDTSPAQWTIIQSLAADFFAGETARADDR
TIFAVGDEKQSIYSFQGARPERFSRESTLTERRVRAGNKHFSPIRLQLSF
RSTVDVLSAVDTVFANPGNARGLSARSEAIVHASNRIGQPGAVDLWDVIA
PEPAASEEDWTAPFDATPERAPVNILARRIAAVLEDWIGRETVIEKGVRR
AMRPGDVIVLVRKRDAFVNALTRALKRRGNIPVAGADRLVLTSHIAVQDL
IALGRFVLLPEDDLSLAALLKSPLLDLGEEDVFELAARRTEGESLWRRLR
QAGAEETSRYHEAVRTLSRYSGLARELLPHDFYARVLGADGGRRAFLARL
GSEVSDILDEFLTFALDHERNGLPGLQAFISTLEIEAPTVKREQDKERDE
VRVMTVHAAKGLEAPVVFLVDGGGEAFVRQQVSDLRFLEKAQVDHSTLTV
PVWRAPGSAPNSLIAADNERLKKLAEEEYRRLLYVGMTRAADRLIVCGYR
GQRQNTDTWHSMVQGTLAQDLKGRGTPRLFRAGSEEWQGIAWREAHVPRD
LPTSEGRSEESQRPTAGLPTALFTPPAAPRRLPRPLAPSGTTIAIDDPDA
EAIVGSALFAGKSAPKFSMLRGAILHRLLQVLPSVDGPERLAAAERYLAR
SVPRWPEAERRALAGTVMDVLDHADLQPLFGEHSRAEVSVMGTLKLGVRE
FAVSGRIDRLAVTGDMVTIADYKTNREIPETPEEIAPVYRNQLAIYRELL
KPLYPGKRFRCVLIFTEGPAIRVLPEPMLDRSLEELATK
>gid:374432  SMc02802  CONSERVED HYPOTHETICAL PROTEIN
MSEVKSHEFDSFLQRSAAIYRLFLLYGPDRGLVSERASELAGSFGIPLDD
PFAVVKLDAATLQGAGSVLDEVNAIGLFGGDKLVWVRGASAEKALTEALQ
ILAADPPAGARLIVEAGDLKKGAALRKVGETSRSVASIACYSDDIRGLQA
LVDTELSEAGLRIGPAARERLVEALGGDRMASRNELRKLALYCRGKDMVD
EDDVLGIVGDASAISVDDAVDAVLKGDADALLHAMKKITTSKTPPFLVLQ
ACLRQFQQLDVMRAEMDASRQSAGQVVASLGRGLHFRRKPVVEAALKHWT
GPAIRRELGRLQATIYQSRSRQSLEESLVIQNLLAITIQSARR
>gid:370594  SMc02841  PUTATIVE METHYLATED-DNA--PROTEIN-CYSTEINE METHYLTRANSFERASE TRANSCRIPTION REGULATOR
MNVVTSIPTDITPEGTDYDTVRRVIAMLTEDYREQPSLESLARRLGQSPT
QLQKVFTRWAGLSPKAFLQAVTLDHAKRLLRQEDMPLLETSIEVGLSGPS
RLHDLFVTHEAMSPGEWKARGAGLTIRYGFHPSPFGTALVMVTDRGLAGL
AFADSGEERASFEDMAARWPNATYVEDSAATARYAARIFDPDRWCAEEPL
RIFLIGSDFQIRVWQTLLQIPLGKATTYSRIAENLGQPTASRAVGAAVGR
NPISFVVPCHRALGKTGDLTGYHWGLTRKRAILGWEAGKA
>gid:370670  SMc02903  HYPOTHETICAL PROTEIN
MTKTIFDLHSPHPEPSTLVAFAENHLDRKSEHRPDDCIEAALKHPGAHFL
AFSGAKLIVKHDEKIIDPLFAPYELAGLEPTMDDAVLLGFLPNGEPRLAV
PSGLTEETVPEPFKIADARMLYRQQMLPENLLGQFAQASSLIVWNAGNRF
CGRCGGPMDGAAGGYRRICTACGHMVFPRTDPVVIMLTIDIERDQCLLGR
SPHFTPGMYSCLAGFVEPGETIENAVRRETLEESGIRIGRVRYHASQPWP
LPHSLMIGCYAEAKSTAIKRDEQELEDVRWFTRAETEAMLERATGVADTG
DEHIPPPKGAIAHQLMRDWLAWPERS
>gid:373898  SMc03177  PUTATIVE DNA LIGASE PROTEIN
MKAFAELLDRLVLTPQRNAKIRLLVDYFRGAPDPSRGYALAAIAGTLSLN
TVKPALIRDLLLERMDDVLFHYSYDYVGDLAETVSLAWEPPPDVALQDIP
LGEVVERLQRAGRSEVRSLVRDFLDRLDTSGRFALLKLATGGLRIGVSAR
LAKQALAEMGGKEVSEIETLWHGLEPPYLPLFLWLTGEAEMPVLKTPAVF
HSVMLATAVGDGDLDGLDPADFAAEWKWDGIRVQLANVGGARRLYSRSGD
EISSAFPEIIEAADITGVIDGELLVGGTMRSNRATGTFADLQQRLNRKTV
SRKLMDEYPAFIRAYDILFSGERDIRPEPFRVRREALSSLIEAASPQHFD
LSPLVGFSSWKELDELRSSPPDPVIEGVMLKRLDSPYMAGRAKGPWFKWK
RAPFNIDAVLMYAQRGHGKRSSYYSDFTFGVWAEGEDGASLVPVGKAYFG
FTDAELEVLDRFVRDNTVERFGPVRAVRAEPDSGFVVEVAFEGLNRSTRH
KSGVAMRFPRIARLRPDKLPRDADRLETLQAMMGTQR
>gid:374138  SMc03246  PUTATIVE INTEGRASE DNA PROTEIN
MPINKLTDKQVRGIKKPGNHPDGGNLYLRVRENGSKSFIVKATIDKKQRE
WTIGSYGSEEHHFSLAEARKRRDEIMAVIRAGGISEPTGKSVTQSVEPES
TTFPAPLFGPFSIELIQQIEGGFRNAKHRAQWRSTLETYCKSIWEKRVDR
ITTDDVLAILRPIWSTKAETASRVRGRIERVLNAAKVRGFRSGENPAAWQ
GHLQLLLPKRQKLQRGHHPALPFQDLPTLWQKLTSIDTIASAALQFLILT
AARSGELRGAKWEEFDLEKRLWTIPASRMKAAREHLVPLSDSCIEILKRM
QKIRHSDFVFPGTREHAPLSDMTLSKVLHGLCKGYTVHGFRSTFRDWCGE
KTDFSREHAEACLAHTIGSAVERAYRRGNSLEPRRRIMTAWEAFILTGRS
PEDQVDKAA
>gid:374139  SMc03247  PUTATIVE INTEGRASE/RECOMBINASE PROTEIN
MAAADPASFGVAQRVLTIPIKRAHIEVIHHLTNAEVDAVIAAPDQRTPRG
RRDRAFLLFLARTGARVSEAIGVNADDLQLERPRSQVLLHGKGRRDRVIP
VPQDLARALAALLRERGIANHEPRPIFIGAHNERLTRFGATHIVRRAAAM
AVAIRPDLAGKPISPHIFRHSLAMKLLQSGVDLLTIQAWLGHAQVATTHR
YAAADVEMMRNGLEKAGIEDDYRTHFRPTDAVLQMLNSI
>gid:374140  SMc03248  PUTATIVE TRANSPOSASE PROTEIN
MIDRTHRLSVVRQAKLLGFSRGSVYYSPRPVSDGDLALMRRIDELHLDYP
FAGSRMLQGLLKVEGLQAGRLHVATLMKKMGIEAIYRRPNTSKPAPGHKV
YPYLLRKLAVTRPNQVWAMDLTYIRMARGFVYLCAVVDWFSRRVLSWRLS
ITMETAFCIEAVEEALARYGKPDIFNTDQGSQFTSVDFTAVLKKAEIAIS
MDGKGAWRDNVFVERLWRSIKYEEVYLHAYKTVSEARVGIGRYLTFYNTR
RPHSSLDRQTPDQAYFNALTPMMAAA
>gid:374143  SMc03250  HYPOTHETICAL PROTEIN
MDTKWWKKVKKLVVRLVGRNGRRRFDPASKDRLIAACLEPGASVSKLALG
HGVNANLVWKWIRKRTQAPPFSPSSTSAFLPVQITAASSKFASEMPATDG
EELPPKAERIGPLSSPAKVSASLPNGVRLTLECGDANALAAIIGALGNVQ
TGR
>gid:374148  SMc03256  HYPOTHETICAL PROTEIN
MRQYESHLDCRRAVGLKIAREAPDRIGALYDVGRDIAGQPANFRLAARQK
HSKAKAEAFRVWAEAQLTRIPGKGDLAQAFHYGLSRGHSFCLFLEDGRGR
HGYNAAERANGNYQV
>gid:374183  SMc03293  HYPOTHETICAL PROTEIN
MRVEILGQERRRRWGDAKKLDVVMSIGLDGATVTEVAHRHDVSRQQIYAW
RHELKKKGLLPSKRRCAAGRGYGQGSARAQRTRSWYLLPRLRW
>gid:374142  SMc03751  HYPOTHETICAL PROTEIN
MFKLGADLQVYLHREPIDFRAGINSLAVLVQETMALDPFAPAVFAFCNRR
RDRMKLLFFDRSGFVLVLKRLAEDKFRWPRHQETVMRLTTERCAGFSTAS
ISMRWCAIRCGNTNRWLGLLNCAVDAAVRRAGSGNLNSGDKWNFCLTLA
>gid:374195  SMc03763  PUTATIVE CYTOSINE-SPECIFIC METHYLTRANSFERASE PROTEIN
MKKYSFADLFSGCGGLSLGLTQAGLKGQFAIERDAMAFRTFATNFIDARG
ASDRFEWPAWLERRAWGIEELLEYHGKELLGLRGTIDVLAGGPPCQGFSF
AGRRNEDDPRNLLFKKYVEMVEALQPQALVIENVPGMRVAHARRNVVELQ
AADPGGRKISFYDKLVESLSAAGYDVDAMLVDSASFGVPQKRSRLIAIGV
SKDLCKWLDGGILRAFKLLEEARVEQLAELGLPPLVTASDAISDLEITGR
PLVECVDPESPKRFMEASYGGPRTPYQTLMHHAHVGSMDSMRLARHTDEV
KDRFAKILAECRKGVRMDDASRRTYGLKKHRIYPMAEREPAPTVTTLPDD
ILHYREPRILTVRECARLQSFPDRFIFKGKYTTGGERRTKECPRYTQVGN
AVPPLLARAIGNALTRLLDEVSVAQLDSDNAATQLALTMA
>gid:374222  SMc03789  HYPOTHETICAL PROTEIN
MTAASTHSMSAGAPLFRQTMSQRILSIHFPHLPTDRVARSRWGASWLSRG
RPDHPPVVFAAKIDNAMRLVALDTFAERIGLKRGQGAAEARAMCPSLDVI
AADPAADQAFLEALADWCDRYTPLVALDGTDGLFLDITGCAHLHGGEKAL
ISDVLSRLFSLGVEARATVSSAAGLSWAVTRYGDADVIAPGDGDPVLAPL
PVAALRLPADTAAALERVGLKQIGDLLEAPRAPLARRFGSSLLLRLDQAR
GFEDEPLSPRLPVPSFSAERRLAEPLQDEEHILELTRHLARDVRSSLERH
GEGGRLFELLLFRVDGRVFRVRAHAAVPLNDAVRIAALFRERLQAVHDDL
DAGYGFEIVRLSVLRSERLDPAQQDFSGAPDESQPLAAFVDKVSARFGPG
CLMQATLAESHLPERAGGFAAVTDVAAVLRTAALEEKAFPENRPLRIFAH
PEPVEATAEVPEGAPRSFNWRKTRYRVARAEGPERIAAEWWIDGEDHPTR
DYFRIEDQEGRRFWLFREGLYGRETASPRWFMHGVFA
>gid:374333  SMc03877  CONSERVED HYPOTHETICAL PROTEIN
MSFQSMILSGRGVTAVLGPTNTGKTHYAIERMIAHDSGVIGLPLRLLARE
VYTRLVEKVGHHNVALITGEEKIAPHRARYSVCTVEAMPRETTASFVAID
EVQLAGDLERGHIFTDRILHLRGRGETLLLGAATMRPILEYLLPGITVVE
RPRMSQLLYAGSKKITRLPNRSAIVAFSADEVYAIAELIRRQRGGAAVVL
GALSPRTRNAQVALYQEGDVDYLVATDAIGMGLNLDVDHVAFAQDRKFDG
YQYRNLNPAELAQVAGRAGRHVRDGTFGVTGRVDPFDEDLVHRIESHEFD
PVRVLQWRSKALDFSSLKALRKSLEAAPAVSGLARALPAVDQQALEHLTR
YPEIVDVATASERVEKLWEACALPDYRRITPAQHADLISTIYADLVRHGT
VNEDFMAEQVRRADHTDGEIDTLSARIAQIRTWTYVSNRPGWLADPTHWQ
EKTREIEDRLSDALHERLTKRFVDRRTSVLMKRLRENAMLEAEISVNGDV
FVEGHHVGQLAGFRFTLAAGSEGTDAKAVQGAAHKALALEFEARAARLHA
AGNGDLALSSDGLVRWLGDPVARLTASDHVMRPRVILLADEQLQANAREH
VLARIERFVNHHISTVLKPLDDISRAEDLEGLAKGLAFQIVENLGVLFRR
DVAEEVKSLDQESRASIRRYGVRFGAYHIFLPALLKPAPAELITLLWALK
NDGLDKPGYGELIPMLAAGRTSVVTDPSFERTFYKLAGFRFLGKRAVRID
ILERLADLIRPLLQWKPGTSPRPDGAYDGRRFVATTSMLSILGATPDDME
EILKGLGYRADAVTAEEAAAFLASQNGETAPAEEAGASVADAGSVEAEAA
DAPTAETEATGTAAAEAPAAPAAETEAADTAAAETQAAPAGDDAAAPAEP
EAPAEAKPVLLWRPGTRQDNQRQGGR
>gid:373698  SMc03959  CONSERVED HYPOTHETICAL PROTEIN
MAARNEPLSEYNRRRDFTRTSEPKGAVARRSGDNRMRFLVQKHAATRLHY
DFRLEWEGVLKSWAVTRGPSLNPEDKRLAVRTEDHPLAYGDFEGTIPKGE
YGGGTVMLWDTGWWEPEDDPSKALKKGKLSFKLHGSRMKGGWALVRMRPR
EGEKRENWLLVKETDDVASDDGESLINENITSIVTGRTMEEIAEGRGEKR
ARVWHSNKSISANLKAGAIAENGNAGKRATRKASGKLPAFKAPQLATLVT
KVPAGDAWLNEAKFDGYRLVCAVGAGTVRCYTRNGLDWTEKFPAIAAALA
ELDCQSALIDGEVVALSEGGSTFSALQKALRTGASTRLYAFDLIELDGKD
LSRKPLVERKERLEALLQTLGATSTVQFSEHVRGNGEHVLSAICKAGQEG
IIAKEADAPYRSGRSRSWLKVKCTKRQEFVIGGYTPSSKKGRAFASLLVG
TFEGGKLIYRGGVGTGFSGKTMEDLAAAFAKRKRDTSPFDSVPRERMRNS
VWLEPDLVAEVDFAEFTADGHVRHGSFEGLREDKEARAVKLETPKPAEAE
PETAKSKSSARTRKTPPVQGDADVLGIRISHPDRVLFEGQGITKIDLARY
YAVVAERMLPFAADHPVSLVRCPQGGERHCFFQKHASDGFPEAIREVPIT
ESSGDTENYMYIHDAKGLVAAVQMGTLEFHIWGSSIDRLEKPDRLVFDLD
PDPSVDFETVKAAALRLRDELAEIGLKTVPMVTGGKGVHVIVPLRPHAEW
EEAKGFAKALARSIAERDPDNFVATMSKAKRKGKIFIDWLRNDRGATAIA
PYSTRARSGGPVATPVGWDELQGLEAANGFQIPDIIERIEAGTDPWREIG
KISQSLTKKILNSVE
>gid:372821  SMc04190  CONSERVED HYPOTHETICAL SIGNAL PEPTIDE PROTEIN
MPRIANLYFRTAIVFLILGISIGLHMSITGNHAATGAHAHANLLGWVTMA
IFGGYHALNPQKAARRLAMIQYAVYTFGVTMLIPSLYLLLSGNTAVEPIV
AISSLIAFAGVLLFAVIIFSSSEASVSARVAPTH
>gid:372829  SMc04197  HYPOTHETICAL PROTEIN
MVENSELARQTRHYLAVRERLARPGDAAGRSARIKELEGQLADLASDNEA
KGRRIARLEADLADAGARLLAQARILLGGRDTGASNEDGGDRAPIEEIVA
AVLEDFPGVSWDDIISVRRERRLVKPRHACMRAVYERRRDLSLAGIGRIF
HRDHTTVLAVVNDGGAGSGTAS
>gid:372661  SMc04210  CONSERVED HYPOTHETICAL PROTEIN
MTEEQLQALRAAISVCRRCRDEPARGEGHRLPHEPRPVAVLSASARILIA
GQAPGLRVHESGLPFNDASGDRLRQWLSVDRAAFYDQRNFAIVPMGFCFP
GYDRHGSDLPPRSECAPLWRQRAMDAMPQIELVLAVGHYAQRWHLGTDCP
KSMTETVRNWQRYAKRNSGISVLPLPHPSWRNTGWLRRHPWFEAELLPFL
RERVQALTI
>gid:372662  SMc04211  HYPOTHETICAL TRANSMEMBRANE PROTEIN
MAGGWIFLVLLGIGAYIAAQLPAPERPVTGELSGPASASDGDSLRLDGRR
IRIEGIDAPEIGQMCRRGETAWDCGAQARRRLVALVAGTTTVCRLHGRDR
YGRELGVCAAGGADLGREMVLSGHAVSYGLYRDEEETARTGRLGLWGGDF
VRPQEWRRSNGGAEEAPHRAGDWLEIIIQWLQEHSSAIMARIGGD
>gid:372849  SMc04272  CONSERVED HYPOTHETICAL PROTEIN
MTLPETDQAQWPLDGTIFPVAEIDIEVSPEPHPFHLAEAERARESWGREI
AANPHLFDGRMVLQRSVRITDGRISARAHIVPYSTFLWWRKTRAAGASHI
FGMPMLVSSDGALIAIRMGAHTANPGRVYSPGGSLEPEDIIDGRCDVARN
IAREVMEETGISLSEAIAEPGWHAIRMDGTLTVFRVFRLAATAEEILARV
AAHVAADPHPEIDEAVAIRGPEPAAHNYPEFIPPILEWLFARVRGQQTA
>gid:372888  SMc04359  PUTATIVE TRANSMEMBRANE PROTEIN
MGRWYLAAAVIVGGIVAYEHRGEFGHLPGASKLTALLSHAEKPAEKRAQA
KKPAEKKPPKKVELATIPIPFRPKEQTGSIAPKTALIPPAPVELPVRQSL
EQASLRPAESGTFYFCGIRHDNCVLDGDTFLYQGQRILIADIDAPETKLA
KCDEERSRGSYAKARLRELLNSGNFALVASETSPGDEPGKRRLVVRNGRS
LGDILISEGLARKRTGQPQSWCGQSTARVSG
>gid:374182  SMc04428  PUTATIVE PARTIAL TRANSPOSASE PROTEIN
RRRVCCRPKDDAPPAAGMDKAARERKGLDPGTCCPDCGGELRLVGEDASE
ILDMIAAQMKVIEVARLKKSCRCCEKMVQLPAPSPPIPGSMAGAGLLAYI
LVSKFDDHLPLYRLNEIFARMGATFRTARWSIGAVAPCRCSSR
>gid:372988  SMc04453  HYPOTHETICAL PROTEIN
MTTLIGYARVSKADGSQLHDHQRDALIKAGVLQEHLHRNAASGRRDRMRL
RQA
>SMb21046 TRm10-1b-2, putative transposase of insertion sequence ISRm10-1, orfB C-terminus protein
GLSTKMARLRGRALRGERCRAGVPHGHWKTTTFTGALRLTGMTAPFVYDG
AMNGNVFLAYVEQVLLPTLQAGDVVVMDNLPAHKTSGVRDAIERAGAKLM
FLPPYSPDFNPIENAFSKLKAMLRGRAERKIDALWDAVGALIPRFTPDEC
ANYFRAAGYDPD
>gid:371368  TRm11a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAESSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:371645  TRm11a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:374137  TRm11a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:371116  TRm11a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:371974  TRm11a  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:374022  TRm11a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>gid:374021  TRm11b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>gid:371115  TRm11b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>gid:374135  TRm11b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
ACIHTALVTLRHARAYEAG
>gid:371975  TRm11b  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>gid:371367  TRm11b  TRANSPOSASE FOR ISRM11/ISRM2011-2 PROTEIN
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>gid:371646  TRm11b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM11/ISRM2011-2
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>gid:374154  TRm17  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM17
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKSVLHKTISAIRADTWEAVNKMLLASARQERLESGRVVRVDSTVTAA
LIHEPSDSSLLWDCVRVMVRLLQQADSLGSTIPWHDHCRAAKKRARVIEY
TRGRPKRVQHYRALLRIARNTLDYLQQAAAQLPLAAGPAGKLWQAQVRHY
QPLITQIIAQTERRVLAGEAVPAGEKLVSLFEPHADIIVKGSRDVDYGHK
LNLTTGRSGLILDLVIEAGNPADSERLLPLLERHIAFYGEAPRQAAADGG
YASRENLRQAKAWGVRDMAFHKKSGLRIEDMVRSRWVYRKLRNFRAGIEA
GISCLKRTYGLARCTWRGLDHFKTYVWSSVVAYNLALFARLRPT
>gid:373525  TRm17  PUTATIVE TRANSPOSASE FOR ISRM17 PROTEIN
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKSVLHKTISAIRADTWEAVNKMLLASARQERLESGRVVRVDSTVTAA
LIHEPSDSSLLWDCVRVMVRLLQQADSLGSTIPWHDHCRAAKKRARVIEY
TRGRPKRVQHYRALLRIARNTLDYLQQAAAQLPLAAGPAGKLWQAQVRHY
QPLITQIIAQTERRVLAGEAVPAGEKLVSLFEPHADIIVKGSRDVDYGHK
LNLTTGRSGLILDLVIEAGNPADSERLLPLLERHIAFYGEAPRQAAADGG
YASRENLRQAKAWGVRDMAFHKKSGLRIEDMVRSRWVYRKLRNFRAGIEA
GISCLKRTYGLARCTWRGLDHFKTYVWSSVVAYNLALFARLRPT
>gid:373797  TRm17C  PUTATIVE PARTIAL TRANSPOSASE FOR ISRM17 PROTEIN
MPAGEKLVSLFEPHADIIVKGSRDVDYGHKLNLTTGRSGLILDLVIEAGN
PADSERLLPLLERHIAFYGEAPRQAAADGGYASRENLRQAKAWGVRDMAF
HKKSGLRIEDMVRSRWVYRKLRNFRAGIEAGISCLKRTYGLARCTWRGLD
HFKTYVWSSVVAYNLALFARLRPT
>gid:373796  TRm17N  PUTATIVE PARTIAL TRANSPOSASE FOR ISRM17 PROTEIN
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKSVLHKTISAIRADTWEAVNKMLLASARQERLESGRVVRVDSTVTAA
LIHEPSDSSLLWDCVRVMVRLLQQADSLGSTIPWHDHCRAAKKRARVIEY
TRGRPKRVQHYRALLRIARNTLDYLQQAAAQLPLAAGPAGKLWQAQVRHY
>gid:374150  TRm18  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM18
MRKVSMATRAELVAAISCRYVLGGRAEKARMLDEFVALTGFHRKHAMRLL
RGDCEAAKGGPRPGRRVYGDDVRAALIVVWEASDRICGKRLHPLLPTLIE
AMERHGHGDMDSETRRQLLTMSPATIDRALKEIKASATGPRRRKGSTAIR
RSVPVRTFSDWDDPAPGFVEADLVSHSGPYAKGAFSQTLVLTDIATGWTE
CAPLLVREQTVLITALAELRKLLPFPLLGFDTDNDSVFMNESVHEYCLRD
NIELTRCRPYRKNDQAFVEQKNGAIVRKIVGYRRFEGLRATRELAKLYSS
MRLFVNFFQPSFKLKEKHRDGAKVIKRYHRPATPYQRLLDDARTPEDTCL
RLKAMYLTLDPVRLLRDMRLAQERLVEIADKPDSSPATDGGALPLEDFLS
GLRIAWRGGEVKPTARSKPAAKRERRRPDPLLAVTAELEEWFEAEPWRTS
RELLERLQIKYPGVYPDGLIRTVQRRMKIWRSTQANALVFGPFADAARQT
QIVKVVQ
>gid:374173  TRm18  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM18
MRKVSMATRAELVAAISCRYVLGGRAEKARMLDEFVALTGFHRKHAMRLL
RGDCEAAKGGPRPGRRVYGDDVRAALIVVWEASDRICGKRLHPLLPTLIE
AMERHGHGDMDSETRRQLLTMSPATIDRALKEIKASATGPRRRKGSTAIR
RSVPVRTFSDWDDPAPGFVEADLVSHSGPYAKGAFSQTLVLTDIATGWTE
CAPLLVREQTVLITALAELRKLLPFPLLGFDTDNDSVFMNESVHEYCLRD
NIELTRCRPYRKNDQAFVEQKNGAIVRKIVGYRRFEGLRATRELAKLYSS
MRLFVNFFQPSFKLKEKHRDGAKVIKRYHRPATPYQRLLDDARTPEDTCL
RLKAMYLTLDPVRLLRDMRLAQERLVEIADKPDSSPATDGGALPLEDFLS
GLRIAWRGGEVKPTARSKPAAKRERRRPDPLLAVTAELEEWFEAEPWRTS
RELLERLQIKYPGVYPDGLIRTVQRRMKIWRSTQANALVFGPFADAARQT
QIVKVVQ
>SMb20777 TRm19, putative transposase of insertion sequence ISRm19 protein
MTEISMKAAGIDTGKIWLDVATYPVSDKQKVPNNADGWQTLADWLERQGI
GRVGIEASGGYERDVIAYLHQRGFEVVLLQPRQVRAFGLYKLRRAKNDQL
DAALIAECAARSDARCHAPDSRLIAFGEWLLFIEQIEADIACLKTRRERF
TDRWILEEIDRSIGELKSRCKAQLALLQAAVREHDDLARKLDLIESIDGI
GIRTALTLVILMPELGRVDREEIAALTGVAPYDDQSGKREGERHIAGGRA
RVRRALFNAALPASQRWNETLVELYDRLTAKGKSHKAALIACVRKLIIFA
NTVVKRQTPWTKSAPQQNSCA
>SMb20636 TRm19, putative transposase of insertion sequence ISRm19 protein
MTEISMKAAGIDTGKTWLDVATYPVSDKQKVPNNADGWQTLADWLERQGI
GRVGIEASGGYERDVIAYLHRRGFEVVLLQPRQVRAFGLYKLRRAKNDQL
DAALIAECAARSDARCHAPDSRLIAFGEWLLFIEQIEADIACLKTRRERF
TDRRILEEIDRSIGELKSRCKAQLALLQAAVREHDDLARKLDLIESIDGI
GIRTALTLVILMPELGRVDREEIAALTGVAPYDDQSGKREGERHIAGGRA
RVRRALFNAALPASQRWNETLVELYDRLTAKGKSHKAALIACVRKLIIFA
NTVVKRQTPWTKSAPQQNSCA
>gid:370402  TRm19  PUTATIVE TRANSPOSASE FOR INSERTION SEQUENCE ISRM19
MTEISMKAAGIDTGKIWLDVATYPVSDKQKVPNNADGWQTLADWLERQGI
GRVGIEASGGYERDVIAYLHQRGFEVVLLQPRQVRAFGLYKLRRAKNDQL
DAALIAECAARSDARCHAPDSRLIAFGEWLLFIEQIEADIACLKTRRERF
TDRWILEEIDRSIGELKSRCKAQLALLQAAVREHDDLARKLDLIESIDGI
GIRTALTLVILMPELGRVDREEIAALTGVAPYDDQSGKREGERHIAGGRA
RVRRALFNAALPASQRWNETLVELYDRLTAKGKSHKAALIACVRKLIIFA
NTVVKRQTPWTKSAPQQNSCA
>gid:370913  TRm19C  CTERM FRAGMENT OF A PUTATIVE TRANSPOSASE PROTEIN
MRLIAFGEWLLFIEQIEADIACLKTRRERFTDRRILEEIDRSIGELKSRC
KAQLALLQAAVREHDDLARKLDLIESIDGIGIRTALTLVILMPELGRVDR
EEIAALTGVAPYDDQSGKREGERHIAGGRARVRRALFNAALPASQRWNET
LVELYDRLTAKGKSHKAALIACVRKLIIFANTVVKRQTPWTKSAPQQNSC
A
>gid:370914  TRm19N  NTERM FRAGMENT OF A PUTATIVE TRANSPOSASE PROTEIN
MKAAGIDTGKTWLDVATYPVSDKQKVPNNADGWQTLADWLERQGIGRVGI
EASGGYERDVIAYLHQRGFEVVLLQPRQVRAFGLYKLRRAKNDQLDAALI
AECAARSDAADRVWRMAAVHRADRSRYSLPQDPPRAFHRQADPRGDRSFH
R
>SMb21234 TRm1a, probable transposase of insertion sequence ISRm1 orfA protein
MGRSNVGNMTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIV
ARRHGIQPNQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQR
LLGKKTMESEILKEALEIAGSPKKHLLRSLSLPRGILG
>SMb20918 TRm1a, transposase of insertion sequence ISRm1 orfA protein
MTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>gid:373400  TRm1a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQPNQLFAWR
KLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTMESEILKEA
LEIAGSPKKHLLRSLSLPRGILG
>gid:371122  TRm1a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>gid:374363  TRm1a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>gid:374186  TRm1a  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>SMb21233 TRm1b, probable transposase of insertion sequence ISRm1 orfB protein
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>gid:374187  TRm1b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>gid:374362  TRm1b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>gid:373399  TRm1b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>gid:371123  TRm1b  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM1
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSKEQTSRCTLLDWLSVN
>SMb20919 TRm1b, transposase of insertion sequence ISRm1 orfB protein
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>gid:372159  TRm20  PUTATIVE TRANSPOSASE PROTEIN
MPWREVSTMGERREFVRLALEEGVNRRELCRRFGISPDIGYKWLARWEAG
DRELADRSRRPHISPMRCNEAVETEVLGVRDAHPAWGARKIVGYLERQGT
HPPAVSTVHAILKRHDRIVAPPGGAPALQRFEKEAPNQLWQMDFKGWVQL
ADATLCHPLTVIDDHSRFVPCLMACADQRGATVRGHLERTFRRYGLPDAM
FVDNGAPWGDPSGEGWTGLGVWLLKLGVALLHSRPYHPQSRGKNERFHRT
LKAEVFAFDRFRDLAAVQRAFDAWRELYNFERPHGALDHDVPASRYHPSP
RAMPDRLPEPVYDEGEIVRKVSATKAYVSFKGRLWKVPKAFCGERLAIRP
LDRDGHYGAFFGAHHIATINLTNKQSVSDVSEQVSAMSPG
>gid:370716  TRm20  PUTATIVE TRANSPOSASE PROTEIN
MPWREVSTMGERREFVRLALEEGVNRRELCRRFGISPDIGYKWLARWEAG
DRELADRSRRPHISPMRCNEAVETEVLGVRDAHPAWGARKIVGYLERQGT
HPPAVSTVHAILKRHDRIVAPPGGAPALQRFEKEAPNQLWQMDFKGWVQL
ADATLCHPLTVIDDHSRFVPCLMACADQRGATVRGHLERTFRRYGLPDAM
FVDNGAPWGDPSGEGWTGLGVWLLKLGVALLHSRPYHPQSRGKNERFHRT
LKAEVFAFDRFRDLAAVQRAFDAWRELYNFERPHGALDHDVPASRYHPSP
RAMPDRLPEPVYDEGEIVRKVSATKAYVSFKGRLWKVPKAFCGERLAIRP
LDRDGHYGAFFGAHHIATINLTNKQSVSDVSEQVSAMSPG
>gid:372709  TRm20  PUTATIVE TRANSPOSASE PROTEIN
MPWREVSTMGERREFVRLALEEGVNRRELCRRFGISPDIGYKWLARWEAG
DRELADRSRRPHISPMRCNEAVETEVLGVRDAHPAWGARKIVGYLERQGT
HPPAVSTVHAILKRHDRIVAPPGGAPALQRFEKEAPNQLWQMDFKGWVQL
ADATLCHPLTVIDDHSRFVPCLMACADQRGATVRGHLERTFRRYGLPDAM
FVDNGAPWGDPSGEGWTGLGVWLLKLGVALLHSRPYHPQSRGKNERFHRT
LKAEVFAFDRFRDLAAVQRAFDAWRELYNFERPHGALDHDVPASRYHPSP
RAMPDRLPEPVYDEGEIVRKVSATKAYVSFKGRLWKVPKAFCGERLAIRP
LDRDGHYGAFFGAHHIATINLTNKQSVSDVSEQVSAMSPG
>SMb20304 TRm2011-2C, probable ISRm2011-2 transposase protein, C-terminal portion
KKTLVASERERPDVARHRARWLKHCPGIDPARLVFIDETWTKTNMAPLRG
WAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGERFRIYV
QQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKYSPDLNP
IEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFKEAGYER
A
>SMb20305 TRm2011-2N, probable ISRm2011-2 transposase protein, N-terminal portion
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMb20783 TRm2011-2a, putative transposase of insertion sequence ISRm2011-2, orfA protein
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMb20778 TRm2011-2b-2, putative transposase of insertion sequence ISRm2011-2, orfB C-terminus protein
WTKTNMAPLRGWAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDG
PINGERFRIYVQQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLF
FLPKYSPDLNPIEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQEC
AAYFKEAGYERA
>gid:371881  TRm20C  PUTATIVE PARTIAL TRANSPOSASE PROTEIN
MGYLERQGTHPAVSTVHAILKRHDRIVAPPGGAPALQRFEKEAPNQLWQM
DFKGWVQLADATLCHPLTVIDDHSRFVPCLMACADQRGATVRGHLERTFR
RYGLPDAMFVDNGAPWGDPSGEGWTGLGVWLLKLGVALLHSRPYHPQSRG
KNERFHRTLKAEVFAFDRFRDLAAVQRAFDAWRELYNFERPHGALDHDVP
ASRYHPSPRAMPDRLPEPVYDEGEIVRKVSATKAYVSFKGRLWKVPKAFC
GERLAIRPLDRDGHYGAFFGAHHIATINLTNKQSVSDVSEQVSAMSPG
>gid:371880  TRm20N  PUTATIVE PARTIAL TRANSPOSASE PROTEIN
MGERREFVRLALEEGVNRRELCRRFGISPDIGYKWLARWEAGDRELADRS
RRPHISPMRCNEAVETEVLGVRDAHPAWGRAKLWAIWNGREPTRRFRPFM
PSSSAMIGS
>gid:374354  TRm21  PUTATIVE TRANSPOSASE PROTEIN
MSRFAACFEDLPDPRGRNARHPLTSILFIAVAAIVCGAESCTDMADFGVA
KKKWLKTIVPLPYGIPSHDTFSTVFRHLDPDAFDAAFRRLTASFAQGLEG
VVAIDGKAVRGAYRRAAKATPLHFVNVWAAGPGLVIGQKLAPGRNEVQGA
LDALALLALEGSIVTADALHCRPDTARAILAAGGDYALALKANQPGLLAQ
ALARIEDADHVESIQIAAETAHDRTETRRASVVAVDDINFPGLQAIGCVE
TTSRHTNGHLTSHVRYFLLSTTMSPSALIEVVRTHWQIENKLHWVLDVHF
REDAARNRKDNGPQNIAFLRKIALNLLRSHPDKASIRRKIKKAGWDDQFL
TSLIAHMR
>gid:372980  TRm21  PUTATIVE TRANSPOSASE PROTEIN
MSRFAACFEDLPDPRGRNARHPLTSILFIAVAAIVCGAESCTDMADFGVA
KKKWLKTIVPLPYGIPSHDTFSTVFRHLDPDAFDAAFRRLTASFAQGLEG
VVAIDGKAVRGAYRRAAKATPLHFVNVWAAGPGLVIGQKLAPGRNEVQGA
LDALALLALEGSIVTADALHCRPDTARAILAAGGDYALALKANQPGLLAQ
ALARIEDADHVESIQIAAETAHDRTETRRASVVAVDDINFPGLQAIGCVE
TTSRHTNGHLTSHVRYFLLSTTMSPSALIEVVRTHWQIENKLHWVLDVHF
REDAARNRKDNGPQNIAFLRKIALNLLRSHPDKASIRRKIKKAGWDDQFL
TSLIAHMR
>SMb20421 TRm21, putative Transposase for insertion sequence ISRm21 protein
MSRFAACFEDLPDPRGRNARHPLTSILFIAVAAIVCGAESCTDMADFGVA
KKKWLKTIVPLPYGIPSHDTFSTVFRHLDPDAFDAAFRRLTASFAQGLEG
VVAIDGKAVRGAYRRAAKATPLHFVNVWAAGPGLVIGQKLAPGRNEVQGA
LDALALLALEGSIVTADALHCRPDTARAILAAGGDYALALKANQPGLLAQ
ALARIEDADHVESIQIAAETAHDRTETRRASVVAVDDINFPGLQAIGCVE
TTSRHTNGHLTSHVRYFLLSTTMSPSALIEVVRTHWQIENKLHWVLDVHF
REDAARNRKDNGPQNIAFLRKIALNLLRSHPDKASIRRKIKKAGWDDQFL
TSLIAHMR
>gid:374344  TRm21  PUTATIVE TRANSPOSASE PROTEIN
MSRFAACFEDLPDPRGRNARHPLTSILFIAVAAIVCGAESCTDMADFGVA
KKKWLKTIVPLPYGIPSHDTFSTVFRHLDPDAFDAAFRRLTASFAQGLEG
VVAIDGKAVRGAYRRAAKATPLHFVNVWAAGPGLVIGQKLAPGRNEVQGA
LDALALLALEGSIVTADALHCRPDTARAILAAGGDYALALKANQPGLLAQ
ALARIEDADHVESIQIAAETAHDRTETRRASVVAVDDINFPGLQAIGCVE
TTSRHTNGHLTSHVRYFLLSTTMSPSALIEVVRTHWQIENKLHWVLDVHF
REDAARNRKDNGPQNIAFLRKIALNLLRSHPDKASIRRKIKKAGWDDQFL
TSLIAHMR
>gid:372962  TRm22  PROBABLE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:370563  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:370424  TRm22  PROBABLE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:373281  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:370551  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:371240  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:373585  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>gid:373266  TRm22  PUTATIVE TRANSPOSASE PROTEIN
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGSTLSDANRRRPVAVFAETFALLAG
QLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVYDPKA
DCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIAEAGA
SFVTRPKTNMGLALVAERPVEQPQGDGFLVLEDSQVSLASKGDSKLPIGL
RRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQHLKI
RKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVTQCLF
QRKSIAEIHKPPQVNPSRPKPRTIPNQMVFRYA
>SMb20766 TRm22, putative transposase of insertion sequence ISRm22 protein
MRFTPSIFAQLLKAIDRRSFQAIVDRHAGDAYDKCFTSWDHLVALIYAQL
SATTSLRGLEASFNANSQHHYHLGSGRLMRSTLSDANRRRPVAVFAETFA
LLAGQLDRQTRREGTKMLRLIDSTPIPLGKLYDWAKSNGRIRGMKLHVVY
DPKADCPGILDITDANVNDAQIGRTITIEKGATYVFDKGYCHYGWWTAIA
EAGASFVTRPKTNMGLALVAERPIEQPQGDGFLVLEDSQVSLASKGDSKL
PIGLRRVIVKRQDGDTITLLTNDLERSAVEIGQLYKDRWQIELLFRWIKQ
HLKIRKFLGNNDNAIRLQIFAAMIAYALLRIAARLARVPLPILRFTDLVT
QCLFQRKSIAEIHKPPPVNPSRPKPRTIPNQMVFRYT
>gid:374192  TRm26.1  PUTATIVE TRANSPOSASE NUMBER 1 FOR INSERTION SEQUENCE ISRM26
MSDSVNQPRTFEVLTAAPVRARRKPRDWPNEEKERLIAETLLPGANVSAI
ARAEGLDPSQLYGWRRKALSSGLVAPLTETTKKEVKFARVEPVASSAVEI
VLGDVVVRVGGDIESDHLVKILRAVRKA
>gid:374191  TRm26.2  PUTATIVE TRANSPOSASE NUMBER 2 FOR INSERTION SEQUENCE ISRM26
MIASGVVVYVSCQPVDFRKGAASLMALVRDGGLDPFNGALYVFRSKRADR
VRIVWWDGSGVCLYSKTLEEQSFCWPGISAARIRLDHSQLMALLAGLDWK
KIRPTKVRRPLLTG
>gid:374190  TRm26.3  PUTATIVE TRANSPOSASE NUMBER 3 FOR INSERTION SEQUENCE ISRM26
MDLPLSDLPDDVDALKAMVLALAREQAAKEVRLKVAEIARLEAVEKSANE
RIANLTLIMKVLQRTQNGKRSERLRLGVNDEQVSFAFEEVETGLSAIRSE
LDRAAKDKPKRAPRPRKGFAAHLERIEEVIEPEIPAGCEGLAKVLIGEDR
SERLDVVPPKFRVIVTRRPKYAFRGSDGVVQALAPAHIIEGGLPTERLLA
YIAVSKYADGLPLYRQEAIYLRDGVEISRSLMAQWMGHLGFELQMLADYI
LERVKEGERIFADETTLPTLAPGSGKTTKAWLWAYARDDRPYGGTSPPMV
AYRFEDSRGADCVTRHLSGFTGILQVDGYSAYTNLAKTRAKTGSNETVQL
AGCWAHLRRKFYDLHISGVSQAATDTVLAMTELWRIEDEVRGKDADSRAA
RRQEKSSTTAASLFELWEKELGKVSGKSKTAEAIRYALTRREALERFLTD
GRIEIDSNIVERAIRPQTITRKNSLFAGSEGGGRTWAAVATLLQTCKMNG
VDPLDWLSQTLTRIAQGWPASEIEALMPWNFRSDAVS
>gid:374169  TRm27.2  PUTATIVE TRANSPOSASE NUMBER 2 FOR INSERTION SEQUENCE ISRM27
MKNVHNTIDEARLGIMLNELRLPTIKTLWPQFAEQADREGWPAARFLSAI
AEHELAERAHRRIERHLAEAHLPPGKTLESFAFDAVPMVSKAQVMAIAAG
DSWLAKGASILLFGPPGGGKSHLAAAIGLALIENGWRVLFTRTTDLVQKL
QVARRELQLESAIAKLDKFDLLILDDLAYVTKDQAETSVLFELISARYER
RSIMITANQPFGEWNRVFPDPAMTLAAVDRLVHHATASPRRQACRDR
>gid:374166  TRm28.1  PUTATIVE TRANSPOSASE NUMBER 1 FOR INSERTION SEQUENCE ISRM28
MVGDRAGAMLGVMDEARHDGVYRRIEVITGRRQRRNWSDEEKARILAESA
EPDVNISAVARRWGVNRGLLNVWRRQAGLTARRSVQACAQQAMFVPVTVV
GERAPPQSASSDIASVASGRIEIEIAGARMTVIGSVAPELAQAIVAALRA
RW
>gid:374167  TRm28.2  PUTATIVE TRANSPOSASE NUMBER 2 FOR INSERTION SEQUENCE ISRM28
MIGLSPNGVKIMVATQPVDFRRGMNGLVALVASAPSADPYCGDVFVFRAK
RCDRLRCIYWDGSGMILATKWLEAGKFVWPPIRDGAMQMSSQEFSLLLAG
IDWTRVKRNLVKRPTKVG
>gid:373988  TRm3  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM3
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>gid:370587  TRm3  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM3
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>gid:372826  TRm3  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM3
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>gid:371127  TRm30.2  PUTATIVE TRANSPOSASE NUMBER 2 FOR INSERTION SEQUENCE ISRM30
MTKEGKPDRLRQMGALNPKPEGVRAPWFREAGFFDPLDLVQVKYEMLRHA
REEGTNKADAAALFGLSRQTYYQAEAAFERDGMSGLLPRTRGPKSAHKLT
GEVMRLVEEHLDANGQLQARSLADLVHARLGISVHPRSIERAVARKKKR
>gid:371125  TRm30.4  PUTATIVE TRANSPOSASE NUMBER 4 FOR INSERTION SEQUENCE ISRM30
MSTDNAMKISADHLRRDAFLYVRQSSLRQVFENTESTKRQYALRDRAVAL
GWPIERVHVIDNDLGLSGAQSQDRDGFQRLVTEVAMGHAGIVLGLEVSRL
ARNNADWHRLLELAAMSRTLIMDEDGVYDAASFNDRMLLGLKGTMSEAEL
HILKSRLQGGILNKARRGELELPLPIGLVYTPDMRVVLDPDRQIQDTVRM
LFDTFREVGSACAVVRRLRSEKILFPRRIRRGIGKGDVLWSEIDHSRVIQ
ILHNPRYAGAFAYGRTRTIYNAKLKSVQQKMPRSDWQVLIPQAHEGYISW
DEFERNQTSLEQNAVGFSPGLRGRMPRQGNGLLQGRVLCGRCGARMRVHY
EQFEGNLRPYYICNEAVVRHAGKACQWARGPAIDEAVSALLLEAMAPTAI
EVALAVQEEISQRVEQAASLRDKQLQRARYEAELARRRYLKVDPDNRLVA
DALEADWNGKLRDLDTLQREHERRNETDQSLLDGAMQERIRALAADFPGI
WNNERTSPVERKRMLGLLIEDVTLLVDEQINMHIRWRGGRTQSLAVARPR
PMAVIRKTPEAVVALINELLETDNDQQIASRLNALGHRNWRGEAFTLKKV
MLVRRAYGLKTRFERLRESGMLTGEEVARRFGVSATTVHQLGRDGVLKRH
RYATNHRYLYEPPGNVRLAKGVGGRYGSRKPRLIDAQPIQQGAS
>gid:372268  TRm5  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGRTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:372458  TRm5  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAVWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:373301  TRm5  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:373726  TRm5  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:373071  TRm5  TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>SMb20060 TRm5, probable ISRm5 transposase protein
MTKTEGRTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLQEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAVWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:370585  TRm5C  PARTIAL TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
VRYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYV
VRIFPNTESCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:373986  TRm5C  PARTIAL TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
WVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVVRIFPNTE
SCLRLVRALAVETHENWMEANRYINMDDLREHKKLALRQAA
>gid:373989  TRm5N  PARTIAL TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVGDCQEFCAVGHDDEKERIITWLSRKNFWTSSWLDVIHPRFS
ARTVCWTI
>gid:370588  TRm5N  PARTIAL TRANSPOSASE FOR INSERTION SEQUENCE ELEMENT ISRM5
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYGTVRNSVRWAGFIS
>gid:370928  ada  PUTATIVE TRANSCRIPTION REGULATOR PROTEIN
MLFDLPDDDILYDALLARSSDYEGQAFVCVKSTGIFCRLSCPARKPKREN
TLFFDSISACVNSGFRPCQRCRPLEGASGKDPLVKELIELLDRRPEHRWT
EGDLVRRGFDPSTVRRAFKRSLGVTFLDLARQRRMGEAARQLSGGASVIE
AQIDAGYESPSGFRAAFGRLIGQAPAKSQGRVLLFADWIETPLGPMVAVA
DQTHLHLIEFHDRKALPAEVENLKRKTHSAVVPGRTPPIDQVERELNDYF
AGRSADFRTPLALDGSAFERQVWAELVAIPVGETRSYSDIARKVATPQAV
RAVARANGSNCLAIVVPCHRCIGADGSLTGYGGGLRRKQWLLRHEGRMRP
VGLFREWDGEKPVQAAAPTM
>gid:370446  alkB  PROBABLE DNA REPAIR SYSTEM SPECIFIC FOR ALKYLATED DNA PROTEIN
MPPMLVLPKGVRHIPGFLDRSRQEELVEAVRAVVAEAPLFAPAMPKTGKP
MSVRMTNCGALGWVTDRERGYRYQATHPVTGKPWPPIPGMLQDIWNAVAG
SDKSPEACLVNFYSAEARMGLHQDRDERDLETAVVSISLGDDCLFRVGGR
TRGGQTVSFRLESGDVVVLGGEGRLAFHGVDRIYPNTSTLLRSGGRLNLT
LRRVNP
>gid:371494  ccrM  ADENINE DNA METHYLTRANSFERASE PROTEIN
MSSVVSLAEISRAARPLNWLDSIIKGDCVAALNALPDHSVDVVFADPPYN
LQLGGTLHRPDQSLVDAVDDDWDQFASFEAYDAFTRAWLLACRRVLKPTG
TLWVIGSYHNIFRVGAILQDLHFWVLNDIIWRKTNPMPNFKGRRFQNAHE
TLIWATPNAKAKGYTFNYEAMKAANDDVQMRSDWLFPICSGSERLKGDDG
KKVHPTQKPEALLARILMASTKPGDVVLDPFFGSGTTGAVAKRLGRHFVG
IEREQDYIDAAAERIAAVEPLGKATLSVMTGKKAEPRVAFNTLVESGLIK
PGTVLTDAKRRYSAIVRADGTLASGGEAGSIHRLGAKVQGLDACNGWTFW
HFEEGSVLKPIDELRSVIRNDLAKLN
>gid:370922  deaD  PUTATIVE ATP-DEPENDENT RNA HELICASE PROTEIN
MTEFAGIAPAIAEALAKRGYETLTPVQQAMLDPSLGTADALVSAQTGSGK
TVAFGLALAPTLIGSARKFAQAGAPLALVIAPTRELALQVMRELDWLYEM
AGATIASCVGGMDMRSERRTLERGAHIVVGTPGRLCDHIRRGSLDISGLK
AVVLDEADEMLDLGFREDLEFILEASPADRRTLMFSATVPRSIATLAKSY
QRDAVRIGVTSEQKQHGDIEYRSLLVAPSDRENAIINVLRFYEARNAIVF
CSTRAAVNHLTARFNNRGFSVVALSGELSQNERTHALQAMRDGRARVCIA
TDVAARGIDLPGLELVIHADLPTNSDTLLHRSGRTGRAGQKGVSALIVPV
NARRKAERLLENARITAAWAKPPLADEVARRDDERIAADPALAEAPREDE
QAIVETLISRYGAEKIAAAFVRQFRSNRSAPEELVDVAVYDDRRKPRRDG
PAFTGDSEPAPRADFADGQWFSLSVGRKQNAEPRWLIPMLCRYGKLSKRD
IGAIRMQPEETYVEMTADGAERLLAAIGPNRMLEKGIRVKTLPGAPDSSR
PRQDKPDFEKKRPPATAAEREHKDFEPKRKFDKQPPVAQDAHSDSRGDKP
WSKKKGKPEAQKTGDFKPGAKRNNAKKRQP
>gid:371939  dinP  PUTATIVE DNA-DAMAGE-INDUCIBLE PROTEIN P
MIDSAAPPSGFCRDCLKEQAAHSRRCLACGSPRLLRHSELYRLTLAHIDC
DAFYASVEKRDNPELADKPVIIGGGKRGVVSTACYIARIHGVRSAMPMFK
ALEACPQAVVIKPDMEKYVRVGREVRAMMQELTPLVQPLSIDEAFLDLSG
TERLHHDPPARTLARFAKRVEQEIGITVSVGLSYCKFLAKVASDLQKPRG
FSVIGQAEAADFLKAKPVTLIWGVGKAFAATLERDGIRAIGQLQTMEEAD
LMRRYGTMGRRLYRLSRGLDERSVEIDGEAKSVSSETTFNDDLARQEDLV
AHLRGLSEQVAFRLRKSALAGQTVVLKLKTADFKTRTRNRRLESPTRLAD
RIFRTGLQLLEKEVDGTKYRLIGIGVSDLVDPDLADPPDLVDPQASRRAA
AEDAINRLRDKFGKTSVETGYTFGKGRRGQ
>gid:370847  dnaA  CHROMOSOMAL REPLICATION INITIATOR PROTEIN
MRMNLATAPGGFQAGSNQSQAAGEKHDMRHDALFERVSARLKAQVGPDVF
ASWFGRLKLHSVSKSVVRLSVPTTFLKSWINNRYLDLITTLVQQEDSEIL
KVEILVRTATRGHRPTAPEESVAAAAEAAVVPPSRRSAAPTVAIAAAAVA
AAPARPVQAPLFGSPLDQRYGFDSFVEGSSNRVALAAARTIAEAGAGAVR
FNPLFIHSSVGLGKTHLLQAIALAALQSARAPRVVYLTAEYFMWRFATAI
RDNDALSLKESLRNIDLLIIDDMQFLQGKSIQHEFCHLLNMLLDSAKQVV
VAADRAPWELESLDSRVRSRLQGGVAIEMEGPDYEMRLEMLKRRLEAARQ
DDASLEIPLEILSHVARNVTASGRELEGAFNQLLFRRSFEPQLSIERVDE
LLGHLVNAGEPRRVRIEDIQRVVAKHYNVSRQELVSNRRTRVIVKPRQIA
MYLSKTLTPRSFPEIGRRFGGRDHTTVLHAVRKIEELISADTKLSHEIEL
LKRLINE
>gid:371737  dnaB  PROBABLE REPLICATIVE DNA HELICASE PROTEIN
MNDAARKLAPLAKDQAEQHYREAPNNLEAEQALLGAILVNNDAFYRVSDF
LKPVHLYEPLHRRIFEIAGEIIRMGKTANPVTVKTFLKADEKVGDLTVAQ
YLARLAAEAVSIINAEDYGRAIYDLALRRSLITIGEDMVNIAYDAPLDMP
PQSQIEDAERRLFELAETGRYDGGFQSFNDAVALAIDMAGQAFERDGHLS
GISTGIHSLDGKMGGLQRSDLIILAGRPGMGKTSLATNIAYNIAAAYEPE
VQPDGSFKAKNGGVVGFYSLEMSSEQLATRIISEQTEVSSSKIRRGDISE
ADFEKLVACSQMMQKVPLYIDQTGGISIAQLAARARRLKRQRGLDVLVVD
YIQLMTGSKKSGENRVQEITEITTGLKALGKELNVPIIALSQLSRAVESR
EDKRPQLSDLRESGSIEQDADVVLFVFREEYYVKNMEPRDEFDPKYEEWK
MQMEKVKGTADVIIAKQRHGPTGTVKLAFQSEFTRFSDLADPSFTQYEH
>gid:371934  dnaE1  PROBABLE DNA POLYMERASE III, ALPHA CHAIN PROTEIN
MTGNQGIGQGIGHGTGAAADPQFVHLRVHSAYSLLEGALPLKKIIGKAVA
DDQPAIGIADTNNLFAALEFSQKAADDGLQPIIGCQLSIDMEDEAEGERR
GHAHQFVKLPAIVLIAATEDGYARLVELVSRAYLEGEGHQQTRISRSWLA
AGGTTGLIALTGAGAGPVDMALKSGSPALAEARLKALIELFGDRLYVELQ
RHGNYDRRHENRMIDLAYRLDIPLVATNEAFFPSPSDYDAHDALMAVAHN
AMVSDDSRFRLTPDHYLKSRKEMAALFADLPEALENTIEVARRCSFMLKT
RGPILPRFTGASDDPEEAERAEVAELRRQAEEGLEERLAKLGMAPGYKEE
DYRERLAFELGVIQRMKFPGYFLIVADFIKWAKQHDIPVGPGRGSGAGSL
VAYALTITDVDPMRFSLLFERFLNPERVSMPDFDIDFCQDRREEVIRYVQ
QKYGREQVAQIITFGSLQARAALRDVGRVLEMPYGQVDKICKLVPNNPAN
PTPLSKAIEEEPRLREEAEKEPVVARLLDIAQKIEGLYRHASTHAAGIVI
GDRPLSQLVPMYRDPRSDMPVTQFNMKWVEQAGLVKFDFLGLKTLTVLKT
AIDFVGKRGIHIDLASIPLDDPKTYETLSRGETVGVFQVESAGMRKALIG
MRPDCIEDIIALVALYRPGPMENIPVYNARKHGEEEIESIHPKIDYLLKE
TQGVIVYQEQVMQIAQVLSGYSLGEADLLRRAMGKKIKAEMDKQRARFVD
GAVKNGVSKPQADLIFDLLAKFANYGFNKSHAAAYAIVSYQTAYMKAHYP
VEFLAASMTLDMSNTDKINDFRQDAMRLGIQVVAPSVQTSHRHFETGDNR
IYYSLAALKGVGESAVDHIVAVRGDRPFASLEDFCLRIDPKLLNRRVFES
LIAAGAFDCFGYDRAELIGGLDRILGFAQRAQENKVSGQSDMFGAGAATG
PEKIALPPYTPWLASEKLHREFQVLGFYLSAHPLDTYNNLLAKMRVQTFA
DFSAAVKKGAAAGRLAGTVTSKQERKTRTGNKMGIVAFSDASGQFEAVLF
SEMLNQYRDLLEPGKSLVMTVDAEERPEGIGLRIRTLRSLEEESLQMQKA
LRVYVRDCGPLRSIASHLNAKGDGLVSFIVIKDNGQREIEVELNEKYRIS
PEIAAALRSAPGVVDVELV
>gid:374221  dnaE2  PUTATIVE DNA POLYMERASE III ALPHA CHAIN PROTEIN
MSADAVFCELGARTNFSFLEGAAPAEEMVVFAKKAGLAGLGIADRNSVAG
VVRAHAKAKVEGYPFQPGARLVFADGTPDILAYPKNRRGWGHLCRLLSAG
NLRSKKGDCTLHLADLLEWQEELLLIVMQGEGRPEPESLEVLLGTLKEHA
GNRLYLGLAPHYDGFDRHDFAVLAAIARKAGIGLLATNDALYHDPHYRPL
ADVVTSIREHVPIAGAGFLLQKNAERHLKGPREMARLFSDYPEAIANTRK
FFRELAFSLDELSHQYPDENADGETPAESLRRLVAEGAAERYPEGVPEKV
MRQIDYELELIHDKKYEPYFLTVHKLVKFARSVNILCQGRGSAANSSVCF
CLGITDVDPQKFTLLFDRFLSKDRDEPPDIDVDFEHERREEVIQYIYRTY
GKEHAGLTAAVISYRSRSAGREVAKAFGLSEDVQSALVSSIWGWGTSPFT
EEQAKGAGLDAADPLTRRVLAYASLLMNFPRHLSQHVGGFVITRDRLDEV
VPIMNTAMPDRYMIEWDKDDLDELKILKVDVLALGMLTCLAKGFKLLEAH
YGEPITLAEIYQDHRDAVYDMICRADTVGVFQIESRAQMSMLPRLQPREM
YDLVIEVAIVRPGPIQGNMVHPYLKRREAQRRGEAVVYPSPELKAVLERT
LGVPLFQEQAMQIAITAAGFSPSEADRLRRAMATFKRTGTIHTFERKMVE
GMVANDYEREFAERCFNQIKGFGEYGFPESHAASFASLVYASAWLKTYYP
DIFCAALLNAQPMGFYAPAQLVRDAREHGVRMLPVDINHSDWDALLEGEG
AFDKNAVHPRHASMREVIKTRKAVRLGFRLVKGLKQTDMKALVARRGEGY
RSVHDLWLRSGLSRSVLERLADADAFRSIGLDRRAALWAVKALDEQSAVE
RLPLFEGAGSDDLQIEPKVALPDMPAGEQVIHDYRTLTLSLKAHPVSFMR
EDFSRRGILRSRDLAATATGRWVTVAGLVLVRQRPGSANGVIFMTIEDET
GIANIIVWEKTFQKYRRQVMGSRLVKVRGRLQNQSGVIHVVADHLEDITP
MLGLLRREARRFGVNDRADGALRPSADAREKKKLRQLRLGLPARAAPEGE
AAAQVAEVMPKGRNFH
>gid:373191  dnaG  PROBABLE DNA PRIMASE PROTEIN
MRFSPSFLDEIRDRVPISDVIGKRVTWDRRKTNVSRGDYWACCPFHGEKS
PSFHCEDRKGRYHCFGCGVSGDHFRFLTELEGLSFPEAVQQIADLAGVPM
PQPDPQAEQRERERTSLLDVMELATQFFQDQLQTANGAKARAYLRERGLT
GRTIETFRLGFAPDSRNALKEFLAGKGIGKEQIEACGLVVHGDGIPVSYD
RFRDRIMFPIPSAREKVIAFGGRAMSPDAPAKYLNSNETELFHKGNVLFN
FARARRASQGADGAGTIIAVEGYMDVIALHQAGIENAVAPLGTALTENQL
DLLWKMTTQPVLCFDGDGAGVRAAHRAVDLALPHLKPGRSVRFALLPEGK
DPDDVVRHDGREPFDKVLANARSLADMVWQREVQGGDFDTPEKRAELEAR
LRQVTSVIADESVRRHYGQDMRDRLNAFLQGSAPFRGERRPFERGGRQAG
RQGGRGWSPAPNPVMGAAVEAGTAGSSKLNQMLKAGGSLRPPVLRESVLA
LTIVNHPQLLFDEYDEIATIEFDHRDLQRCWAAVLNAAAANGLRLTRETL
MEQLEAEGFSALIGALDQQVRYARLWTATAAAAPEDAREGYLQALTLHRR
TKALLWQKRELEREIAEATASEDVEQGQRLVRAMEEVQLELHRMDKLEAI
IEGFGVLSGRVKGPAAR
>gid:370804  dnaN  PROBABLE DNA POLYMERASE III, BETA CHAIN PROTEIN
MRITLERSNLLKSLNHVHRVVERRNTIPILSNVLLRSDGASLEMKATDLD
LEITEATPAQVEQAGATTVPAHLLYDIVRKLPDGSEVRLATNAEGTAMTV
ASGRSKFSLQCLPQSDFPDLTAGTFSHSFRLKAPDLKMLIDRTQFAISTE
ETRYYLNGIFVHTVESNGDLKLRAVATDGHRLARADVEAPSGSEGMPGII
IPRKTVSELQKLLDSPDVVVTVEVSDAKIRLTIGSIVMTSKLIDGTFPDY
QRVIPASNDKELRVDCQSFSQAVDRVSTISSERGRAVKLALADGQMTLTV
NNPDSGSATEELPVGYESDPLEIGFNAKYLLDITGQLTGGEAVFLLADPG
SPTLVRDLAAEDALYVLMPMRV
>gid:370390  dnaQ  PROBABLE DNA POLYMERASE III, EPSILON CHAIN PROTEIN
MREIIFDTETTGLDNREDRVIEIGGIELENQFPTGRTIHIYINPGERKVH
PEALAVHGITDEFLKDKPPFADVAQEIVDFFGDARWVAHNATFDIGFINA
EFERLGLPPIGSDRVIDTLALARRKHPMGPNSLDALCRRYGVDNSHRTRH
GALLDSELLAEVYIEMIGGRQAALGLVVTEAGDRPIEADDGPVVVVTRER
PLRPRLTEAEIAAHAALVAKIGANAIWSKYSEADDVLRSEAV
>gid:370672  dnaX  PUTATIVE DNA POLYMERASE III SUBUNIT TAU PROTEIN
MSDEAPTSSPILPVEKPAAYRVLARKYRPKDFSDLMVGQEPMVRTLTNAF
ETGRIAQAYMLTGVRGVGKTTTARILARALNYKTAEIDKPTIDLRVPGEH
CQAIMDGRHVDVIEMDAASHTGIDDIREIIEQVRYRPVSARYKVYIIDEV
HMLSTQAFNGLLKTLEEPPEHVKFIFATTEIRKVPITVLSRCQRFDLRRI
SASDLVGLFSTILGKEGVPFDPEALAMVARAAEGSARDGLSLLDQAIAHG
GGSVEIETVRSMLGLADRARIVDLFEHVIKGDVAGALDEFAAQYEAGANP
TVVLTDLADFTHLVTRLKYVPDAVNDQSLSEIERTRGAEFAGSVAVTTLS
RVWQMLLKGIPEAESSARPAGAAEMVLIRLAHAAHLPSPEEAARRLLDLS
GGEGGGRPAPNGGGGGGGAQAPAGTPVEARAVETAVSRPSGNGATMLRAV
PSSAPPQPISVGRIEERTVASAAAKPEPKVPVNSIGDIADLCAKNRDIKL
KTMVRGFLRLVHIEPGRLDVNLPEDAPKTLLNELAVKLKEWTGIHWVVSY
SREQGEPTLVEAEQRAQEQRVNDARQDPDVAAILARFPGARITDVRIRAA
EEELESLAPAAAESEDGDIVPDDDIE
>SMb20951 exoI, putative periplasmatic protein
MTRIKSAVAAGGRRAPHSARLGSASTRTIGAVLAALLMTHDAGAAEPIIG
QASVIDGDTIEIAGERVQLNSVDAPEEWQVCLDERGADYRCGKESASALD
AFLSASRPTRCEFAGRDRYGRFVGTCFRADGKDVNRWLIESGNAVDRDTD
NKGLYASAQQTAKSNGAGIWRAQPEHACAARVGRVNRKPSC
>SMb21662 exoI2, putative periplasmic protein
MNSHRSIGVTLALLLFAHFAFAADPITGRATVIDGDTIEIRGERIRLHGV
DAPESWQKCENADGSSYQCGREAAQELDRFLAESRPVRCQFVQRDRYKRF
VGVCFRADGRDVNHWLVESGNAVDWTRYSNGVYANAQELARSHRAGIWRG
NFELPCNARASRAKREASC
>gid:370844  fpg  PROBABLE FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE PROTEIN
MPELPEVETVKRGLAPTMEGALLVRAELRRPDLRFPFPENFANAVAGRRI
IALSRRAKYLMIELEGGDVIIAHLGMSGSFRIEKGPIEAGADPATPGAFH
HPRGKDEKHDHVVFHLDGGSGPARVIYNDPRRFGFMDLARRSALADHVFL
RGLGEEPTGNALDAAYLATRFAGKIQPLKAALLDQRTIAGLGNIYVCEAL
WRSGLSPKRSAGTLVDKRGRPKQALIALTERIRAVIADAIAAGGSSLKDH
IQADGSLGYFQHSFSVYDREGEACRTPGCHGTVARIVQAGRSTFYCPHCQ
K
>gid:372277  gyrA  PROBABLE DNA GYRASE SUBUNIT A PROTEIN
MTEQSTPGGGKNPPGIEPISIIEEMQRSYLDYAMSVIVSRALPDVRDGLK
PVHRRILYGMSELGIDWNKKYVKCARVTGDVMGKYHPHGNAAIYDALARM
AQDWSLRLPLIDGQGNFGSVDGDPPAAERYTECRLQKAAHSLLDDLDKDT
VDFRDNYDGTLHEPVVVPAKFPNLLVNGAGGIAVGMATNIPPHNLVEVID
GCIALIDNPAIELPELMQIIPGPDFPTGALILGRSGIRQAYETGRGSVIM
RGRAHIEPMRGDREQIIITEIPYQVNKATMIEKMAELVREKRIEGISDLR
DESDRQGYRVVIELKRDANAEVILNQLYRYTPLQTSFGCNMVALNGGKPE
QMTLLDMLRAFVSFREDVVSRRTKYLLRKARERAHVLVGLAIAVANIDEV
IKLIRQAPDPQTAREQLMERRWPAHDVDALIRLIDDPRHRINDDATYNLS
EEQARAILDLRLQRLTALGRDEIGDELNKIGEEIKDYLDILSSRLRIMQI
VKNELTAVRDEFGTPRRTEIAEGGPDMDDEDLIAQEDMVVTVSHLGYIKR
VPLTTYRAQRRGGKGRSGMATRDEDFVTRLFVANTHTPVLFFSSRGIVYK
EKVWRLPIGTPQSRGKALINMLPLEPGERITTIMPLPEDETTWENLDVMF
STTRGTVRRNKLSDFVQVNRNGKIAMKLEEEGDEILSVDTCTEFDDVVLT
TALGQCIRFPVADVRVFAGRNSIGVRGISLGDGDRIISMAIVAHVEAEPW
ERAAYLKRSAAERRALTGEEEEIVLVGEEVTNGGELTNERYEELKAREQF
ILTVSMKGFGKRSSSYDFRTSGRGGKGIRATDTSKTAEIGELVAAFPIEH
NDQIMLVSDGGQLIRVPVEGIRLASRATKGVTIFSTAKDEKVVSVERITE
PDGDEEIEAVGAEGVMAEPESPPDTTSTPDE
>gid:370397  gyrB  PROBABLE DNA GYRASE SUBUNIT B PROTEIN
MTDISETEAGAIAEYGADSIKVLKGLDAVRKRPGMYIGDTDDGSGLHHMV
YEVVDNAIDEALAGHADIVTVTLNPDGSVTVTDNGRGIPTDIHREEGVSA
AEVIMTQLHAGGKFDQNSYKVSGGLHGVGVSVVNALSVSLKLKIRRAGKI
HEMSFTHGVADGPLKVTGDAGGETGTEVTFTPSEQTFSNIEFEFGTLEHR
LRELAFLNSGVRIVLTDKRHSDIRREEMMYDGGLEAFVAYLDRAKKPLVQ
KPVSIRGEKDGITVEVAMWWNDSYHENVLCFTNNIPQRDGGTHMAGFRGA
LTRQITSYADTSGITKKEKVSLTGDDCREGLTAVLSVKVPDPKFSSQTKD
KLVSSEVRPVVESLVNEALSVWLEEHPSDAKILVGKVVEAAAAREAARKA
RELTRRKGALDISSLPGKLADCSERDPAKSELFLVEGDSAGGSAKQGRSR
ENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTSIGKDEFNADKL
RYHKIIIMTDADVDGAHIRTLLLTFFFRQMPELIERGHLYIAQPPLYKVA
RGKSVQYLKDEKALEDYLISMGLEEASLELASGEVRVGQDLREVINDALR
LRSLMEGLHSRYNRSVVEQAAIAGALNVELNGERDEYQLIAAEVARRLDV
IAEETERGWEAAVTAEGGLKLERMVRGVKEAAVLDMALIGSSDARHIDQL
KARLKEVYGAPPVLRRRDGTQEISGPRALLDAIFAAGRKGLTMQRYKGLG
EMNAEQLWETTLDPNVRSLLQVKVTDATDADGLFSRLMGDEVEPRRDFIQ
ENALSVANLDI
>gid:370470  helO  ATP-DEPENDENT HELICASE PROTEIN
MRALPALPIREILPGLGDALASAGSVVLSAPPGAGKTTLVPLFLLDQPWR
GDGKIILLEPRRLAARAAAGRMAELLGEKVGETVGYRMRLDNRVSGKTRI
EVVTEGVFSRMILDDPELSGVSAVLFDEFHERSLDADFGLALALDVQSAL
REDLRIVVMSATLDVERIAGLVGDAPVLKSEGRSFPIDIRYENRAAGESV
EDAMVRTIAEAHRSEKGSILAFLPGQAEIARTAARLADRFGEATAIVPLY
GNLSQKEQDAAIRPAPKGTRKIVLATSIAETSITIDGVRIVVDSGLQRLP
VFEASTGITRLETVRVSRASADQRAGRAGRTEPGIAIRLWHSGQTAALAA
FTPPQILASDLSGLLLDLAHWGVADPSTLRFLDPPPETTLREARGLLLEL
GAIDSRGALTPRGRRIRDLALPVRLAAMAVAAAEEGRAQEACLLAVMLTE
QGLGGNGIDIEERLRRFRSERSDRAKAARGLARRMAAELGASKNAGPKPV
LPGALLMHAFPDRIALQRGGRGRFVMANGRGAEIPETERLAAAGMLVIAD
LTGRAGAQRVLAAAEIDRSDVEGHMPEAIVTEEQSFFDRASRQVRARRVT
RLGAIIFEEKPLPRPSGEAAARALADGIRQLGLAAVPFPKDVEQLRDRIG
FLHRSIGEPWPDMSDTGLISRLDEWFVPFQGGAGGIDGIKGRDLAEGLMS
LVPYELQRDLARLAPTHFEAPTGQRHPIHYDGDEPLLSIRVQELFGLKTH
PAIGDGRLPLLLELISPGHRPIQTTRDLPGFWAGSWKDVRAEMRGRYPKH
PWPEDPANAMPTTRAKPRGT
>gid:371851  himA  PROBABLE INTEGRATION HOST FACTOR ALPHA-SUBUNIT PROTEIN
MSGKTVTRADLAESVFRKVGLSRTESAELVETVIDEICNAIVRGESVKLS
SFATFQVRDKNERIGRNPKTGEEVPISPRRVMTFKASNVLKQRVLKAHLS
RKSKLKPSNPAG
>gid:371889  hrm  HISTONE-LIKE PROTEIN
MNKNELVAAVADKAGLSKADASSAVDAVFETIQGELKNGGDIRLVGFGNF
SVSRREASKGRNPSTGAEVDIPARNVPKFTAGKGLKDAVN
>gid:374216  ialA  PUTATIVE INVASION PROTEIN A (ADENOSINE 5'-TETRAPHOSPHO-5'-ADENOSINE PYROPHOSPHATASE)
MTAEDLPYRPCVGVMVLNRQGLVWAGHRLAVGNSEYDGSPQLWQMPQGGI
DEGEDPLEAACRELYEETGIRSVSLLAEAPDWIHYDLPSHLIGIGLKGKY
RGQRQRWYAFRFEGDESEIAINPPPGGHEPEFDAWEWKPMHELPGSIVPF
KRRAYEEVVAAFSHLVR
>gid:370871  ihfB  PROBABLE INTEGRATION HOST FACTOR BETA-SUBUNIT PROTEIN
MIKSELVQIVAARNPHLYHRDVENIVNAVLDEITDALAAGNRVELRGFGA
FSVKNRPSRSGRNPRTGDSVFVEEKWVPFFKTGKELRERLNPGMNDNNNG
EDD
>gid:373000  ligA  PROBABLE DNA LIGASE PROTEIN
MLNQRKSVEQLNEAEAAEELAFLAAELARHDMLYHGKDAPEISDADYDAL
KRRNDLIEERFPVLVREDSPSRKVGAAPSLTFAPVVHARPMLSLDNTFSD
EDARAFVAGVYRFLGKLPDGSIAFTAEPKIDGLSMSLRYENRRLVTAATR
GDGTTGENVTANVRTIGMIPQTLPADAPDVVEIRGEIYMAKSDFAALNAE
MAAQGRPLYVNPRNTASGSLRQLDAKVTANRKLRFFAYAWGEMSAMPADT
QLGMVETFKAWGFPVNPLMQRFFSADELLEHYHHIERERPDLDYDIDGVV
YKVDRLDLQARLGFRSRSPRWATAHKFPAEQAFTRLKGIDIQVGRTGALT
PVARLEPITVGGVVVTNATLHNEDYIRGVGNTGEPIRDGRDVRIGDMVIV
QRAGDVIPQIVDVVMDERPEGAEPYRFPTTCPICGSHAVRDINEKTGKVD
AVRRCTGGFVCRAQAVEHLKHFVSRNAFDIEGLGSKQIEFFFESEDENLR
IRTAPEIFTLERRQEASLNKLENTDGFGKVSVRKLYEAINARRSIALHRL
IYALGIRHVGETTAKLLARSYGSYEHFGAAMTEAAGFSGDAWNELNSIDG
IGEVVARAIVEFYKEPRNLKVVSELLQEVTPESAELPVATDSPVAGKTVV
FTGSLEKMTREEAKAKAESLGAKVAGSVSKKTDIVVAGPGAGSKLDKARE
LGVQTMDEDEWLALIGG
>gid:372386  mfd  PROBABLE TRANSCRIPTION-REPAIR COUPLING FACTOR (TRCF) PROTEIN
MTMIPGFDPRKILAATREITIGPVPTGAEALVLAELARAGAPVAYILSDG
QKVADLEQVLGFVAPDIPVLTLPGWDCLPYDRVSPSADTSARRLAALSAL
IAHRRKPHPAIVLVTINAALQRISPQDVIESLAFTARPGNQIRMDDLAAR
LERNGFERVPTVREMGEFAVRGGILDVYVPGSGEPLRLDFFGDTLEAIRS
FDPASQRTIGQVRSLDLNPMSEVSLTPETISHFRKQYLSLFGAATRDDAL
YQAVSEGRRYAGMEHWLPLFYDRLETVFDYLDGFRIVTDHLAREAAAERS
KLVLDYYDARLASASPGKSQVTQGTPYKPVPPDMLYLTAKGFGEALNDLN
AVRLSPFTEHEGEARQVVNIEARQGLRWAKPAGEADNDGTRTNVFDQAVK
HIAEKRAKGAKVIVSGWTEGSLDRLLQVLAEHGLANIRPVKALSDIGSLK
PGEAASAVLSLEAGFETGDLVVIGEQDILGDRLVRRSKRRKRGADFIAEV
TGLDEGSYVVHAEHGIGRFVGLRTIEAAGAPHDCLELVYADDAKLFLPVE
NIELLSRYGSEGTDAVLDKLGGVAWQARKAKLKKRLLDMAGGLIRIAAER
HTRHAPVLAAQDGVYDEFAARFPYDETEDQLNSIDAVRDDLGRGRPMDRL
VCGDVGFGKTEVALRAAFIAAMNGVQVAVVVPTTLLARQHFKTFSDRFRG
LPIRIQQASRLVGSKDLALTKKEVAEGKTDIVVGTHALLGSSIKFANLGL
LIIDEEQHFGVKHKERLKELKTDVHVLTLSATPIPRTLQLALTGVRELSL
ITTPPVDRMAVRTFISPFDALVIRETLMREHYRGGQSFYVCPRVSDLPEI
HDFLKSDVPELKVAVAHGQMPATELEDIMNAFYEGRYDVLLSTTIVESGL
DVPTANTLIVHRADMFGLAQLYQLRGRVGRSKVRAFALFTLPVNKTLTGP
AERRLKVLQSLDTLGAGFQLASHDLDIRGAGNLLGEEQSGHIKEVGFELY
QQMLEEAVAELKGEEEIHDTGWSPQISVGTPVMIPEEYVPDLNLRLGLYR
RLGELTDLKEIDGFGAELIDRFGPLPTEVQHLLKIVYVKSLCRTANVEKL
DAGPKGVVVQFRNKEFPNPAALVGYIAKQGTVAKIRPDQSIFFQRELATP
EKRLSGAAMVMTQLAALAKAG
>gid:371370  mutL  PROBABLE DNA MISMATCH REPAIR PROTEIN
MAIKQLSETLINQIAAGEVIERPASAAKELIENALDAGATRIEIATAGGG
KTLLRVTDNGIGMSPADLELAIRRHCTSKLNDSLADIRTLGFRGEALPSI
GSVARLSITTRTAEAREGAAITVTGGRSEPARPSAAIVGTVVEVRDLFFA
TPARLKFMKSEKAEAAAISEVVRRMAIAFPRVRFVLSGSDRTTLEFPATG
DDRLARMAQVLGRDFRDNAIEIDAEREGARLTGFAGVPTFNRGNSLQQYA
FVNGRPVQDKLIMSALRAAYAETIPQGRYPIAVLSITLDPALVDVNVHPA
KSDVRFRDPGLIRGLIIGAIREALTREGDRAATTGAHGLMRAFRPEFHRA
GQQRPQEPWSAAASPHRPLRFEEAARGFAEAPQAAFSDFAQPSARSAAAA
VEATRATDGQAASFPLGAARAQLHENYIVAQTDDGLVIVDQHAAHERLVF
ETMRTALHARPVPAQALLIPEIVGLPEDDCDRLMAHAEEFTRLGLAIERF
GPAAVAVRETPAMLGEMDAAGLVRQLADELAEWDTASGLAGRLEYLAATM
ACHGSVRSGRRLRTEEMNALLRQMEATPGSGQCNHGRPTYIELKLADIER
LFGRS
>gid:370881  mutS  PROBABLE DNA MISMATCH REPAIR PROTEIN
MNFLMDASNRSGDVLSVSDLASEESRSTATPMMEQFIEIKANNPDSLLFY
RMGDFYELFFQDAVEASRALGITLTKRGQHMGQEIPMCGVPVHAADDYLQ
KLIASGYRVAVCEQVEDPAEAKKRGSKSVVRRDVVRLVTPGTITEDKLLS
PSESNYLMALARIRSGSEPAYALAWIDISTGIFRLAETAESRLLADILRI
EPRELILPDTVFHDPDLRPVFDVLGRVAVPQPAVLFDSATAEGRISRYYG
VGTLDGFGSFSRAELAAASAAVSYVEKTQLQERPALGIPERESAASTLFI
DPATRANLELAKTLSGSRDGSLLKSLDRTMTSGGARLLAERLMSPLTDPE
RINQRLDSIEVLADQPRFTTDVRDALRRAPDMPRALSRLALGRGGPRDLG
AIQAGMRAAAAISALLSGAELSAELTEARDAIAALPGELLARLDATLAEE
LPLLKRDGGFVREGASAELDEMRALRDQSRRVIAGLQLQYCEETGIKSLK
IKHNNVLGYFIEVTAGNAGSMTDTDAGRARFIHRQTMANAMRFTTTELAE
LETKIANAADRALAIELETFEAMVREVVAEAEAIKAAALALATIDVSAGL
AVLAEEQNYTRPTVDRSRMFAIDGGRHPVVEQALRRQAANPFVANGCDLS
PPNGEQGGAIWLLTGPNMGGKSTFLRQNALIAIMAQTGSFVPAAAAHIGV
VDRLFSRVGASDDLARGRSTFMVEMVETAAILNQATDRSLVILDEIGRGT
ATFDGLSIAWAAVEHLHEVNRCRGLFATHFHELTVLSEKLVRLSNATMRV
KEWDGDVIFLHEVGPGAADRSYGIQVARLAGLPASVVARARDVLAKLEDA
DRKNPASQLIDDLPLFQVAVRREEAARASSGPSKVEEALKALNPDDMTPR
EALDALYALKKELSNR
>gid:373553  mutT  PUTATIVE MUTATOR PROTEIN 7,8-DIHYDRO-8-OXOGUANINE-TRIPHOSPHATASE
MQGKKIVLVAACALVDADGRVLLAQRPEGKPLAGLWEFPGGKVESGETPE
ETLIRELEEELGIRTKVACLAPLTFASHGYDDFHLLMPLYICRRYEGFAE
GREGQAIKWVRPKALRDYAMPPADEPLIPFLMDLL
>gid:371496  mutY  PROBABLE A/G-SPECIFIC ADENINE GLYCOSYLASE PROTEIN
MMRDLPQAASAALLLEWYDRHHRDLPWRVPPAAARKGAVADPYRVWLSEV
MLQQTTVQAVKAYFEKFLALWPTVGDLAAADTEDVMKAWAGLGYYARARN
LKKCAEAVARDHGGRFPDSEEGLKALPGIGDYTAAAIAAIAFNRASAVLD
GNVERVISRLHAVETPLPAAKPEMRALVQALTPADRPGDFAQAMMDLGAT
ICTPKRPACSLCPFRTDCRALKTADPETFPRKAAKKEKPLRLGAAFVAVD
GLEAVYLRKRPETGLLGGMTEVPGTDWTSRRDGDTSIDAHPFPAEWEPCG
TVNHVFTHFELHLSVFRARVGRADIGEARTDTSGWWEPLASLRAQALPTV
MKKAIAKAIPHAFEAG
>gid:370596  nth  PROBABLE ENDONUCLEASE III PROTEIN
MKNVKLKSPPQEPKPAKATTRRTGGRTGPRSAYRTAEVEEIFRRFSVQRP
EPKGELEHVNPFTLVVAVALSAQATDAGVNKATRQLFAVADTPEKMLALG
EERVRDYIKTIGLYRNKAKNVIALSEKLIADFGGEVPRTREELVTLPGVG
RKTANVVLSMAFGQPTMAVDTHIFRIANRIRLAPGKTPDEVEAHLLRVIP
EHYLFHAHHWLILHGRYVCKARRPECERCVIADLCKSPEKTCDIPAPLVE
LPPQAISVAARSR
>gid:371822  parC  PROBABLE TOPOISOMERASE IV SUBUNIT A PROTEIN
MGQSLLPPSGGDDNIQPVDLKAALEERYLAYALSTIMHRALPDVRDGLKP
VHRRIIHAMSEMGLRPNSSFKKCARIVGDVIGKFHPHGDQSVYDALVRLA
QDFSQRYPVVDGQGNFGNIDGDNAAAYRYTEAKMTEVAALLLEGIDQDAV
DFRPTYNEEDQEPTVLPGAFPNLLANGASGIAVGMATSIPPHNAHELCDA
ALHLIKHPDATVEDLLFDPANPQRGGIEGPDFPTGGVIVESRASMAESYR
TGRGGFRVRARWAVEDLGRGGFQIVVTEIPYQVQKSRLIEKIAELLIARK
LPLLEDIRDESAEDVRVVLVPKSRSVDANILMESLFKLTELESRIPLNMN
VLSMGRVPRVMALNEVLSEWLAHRREVLQRRSRHRLAAIDRRLEILGGYL
IAYLNIDEVIRIIREEDEPKAVMIERFGLTDVQAEAILNMRLRSLRKLEE
FEIRTEFDSLSKEKAEIEALLASGDKQWQAVAWEIGEVKKKFAKATELGK
RRSTFSDAPDADVEAIQQAMIEKEPITVVISEKGWIRALKGHIADTSSLQ
FKDGDGLKVSFPAQTTDKILIFTTGGKAYTLGGDKLPGGRGHGEPLRIMV
DMENDQDVLTAFVHDPARKLIVSSTAGNGFVVTESDIVANTRKGKQVMNV
TMPDEAKLVVPVKGDHLAVVGENRKMLVFPLVQVPEMARGKGVRLQRYKD
GGVSDIRSFAIAEGLTWEDSAGRVFTKTRDELIEWMGDRAGAGRVVPKGF
PRSGKFSG
>gid:372118  parE  PROBABLE TOPOISOMERASE IV SUBUNIT B PROTEIN
MDDSSDLFSAMPQQPKPADAAPRKPAAPAAGNEASRPAPKTSDGSDYDAS
AIEVLEGLEPVRRRPGMYIGGTDEKALHHLFAEVIDNSMDEAVAGHANFI
DVNLDADGYLTVTDNGRGIPVENHPKFPGKSTLEVVMTVLHAGGKFDGKA
YETSGGLHGVGVSVVNALSDDLEVEVARNRRLYRQRFSRGIAQGGLEDLG
DVHNRRGTRVRFHPDPQIFGPHARFEPARLFRMARSKAYLFGGVEIRWSC
DPALLPEGSEIPDRAVFHFPGGLKDYLAATLGKEFTVTREIFAGKSEKSG
GHGSLEWAVTWYGGDQQIHSYCNTIPTPEGGTHEAGLRIALTKGLKNYAE
LTQNKRAAIVTTDDVMISAAGMLSVFIREPEFVGQTKDKLATVEAQRIVE
NALRDPFDHYLADNPAEAAKLLDWVVERAEERVRRRKEKEVSRKTAVRKL
RLPGKLADCAQNTAEGAELFIVEGDSAGGSAKQARNRANQAILPLRGKIL
NVASAGREKLGANQQIGDLVQALGCGTRSKYREEDLRYERIIIMTDADVD
GAHIASLLITFFYQEMPELIRGGHLYLAVPPLYRLSQGSKTLYARDDAHR
EELMRTEFNGRGKVELGRFKGLGEMLPAQLKETTMDPAKRTLLKVEIDDV
DFEGTRDAVDSLMGTKAEARFRFIQERAVFADNLDI
>gid:370605  polA  PROBABLE DNA POLYMERASE I PROTEIN
MKNGDHLFLVDGSGFIFRAFHAIPPLNRKSDGLPVNAVAGFCNMLWKLLT
DARDTSVGVTPTHLAVIFDYSSKTFRNGLYDQYKANRTAPPEDLIPQFGL
IRHATRAFNLPCIEKEGYEADDLIATYARLAEEAGADVTIVSSDKDLMQL
VTPKVSMYDSMKDKQITVPDVIEKWGVPPEKMIDLQAMTGDSTDNVPGIP
GIGPKTAAQLLEEYGDLDTLLARAGEIKQQKRRESIIANADLARLSRELV
TLKKDTPLDVPPEDFRLDSQDGPKLIAFLKAMEFTTLTRRVAAATDTDAE
AIEPAHVPVEWGAQAHGPDLDVGEAGGPPPSPQSSSATPPRGNAARAAVS
FLSSGQDADTTGATPTGLAEARAAYFGKAPFDHSGYRTIRDIDTLERWIA
DAREAGLVGFDTQATSPDAMRADLVGFSLAVADYANDPSGSRIRAAYVPL
AHKSGVSDLLGGGPVDSQVPGRETLSRLKELLEDPSVLKVGQNLKYGYLV
MKRHGIAMRSFDDTMLMSYVLDAGNGAHGMDSLAERWLGHTPIAYKDVTG
TGRSSLTFDFVDIDKATAYAAEDADIALRLWHVLKPRLAAKGLTRVYERL
ERPLISVLAGMEERGITVDRQILSRLSGELAQGAAALEDEIYRLAGETFT
IGSPKQLGDILFGKMGLPGGSKTKTGQWSTSAQVLEDLAAAGHDLPRKIV
DWRQLTKLKSTYTDALPGFVHPETKRVHTCFAMAATTTGRLSSSDPNLQN
IPIRTGEGRKIRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFA
DGVDIHAMTASEMFGVPVDGMPSEIRRRAKAINFGIIYGISAFGLANQLS
IERSEAGDYIKRYFERFPGIRDYMENTKAFARENGYVETIFGRRAHYPDI
RSSNPSMRAFNERASINAPIQGSAADIIRRAMVKMEPALEAAKLSARMLL
QVHDELIFEVEDGEIERTIPVIISVMENAAMPALDMRVPLKVDARAAHNW
DEAH
>gid:374061  priA  PROBABLE PRIMOSOMAL PROTEIN N' (REPLICATION FACTOR Y)
MPMPAPRAYSYAVPDGMAVEPGSIVQVPLGPRFVVGVVWDGEDDGGVDPK
KLKQIEKVFDCPPLGPDMRAFLDWVAAYTLTPPGLVVRMALRAPTAFDPE
PMVEALRLTEMRPERMTAARERVLATASDGLSWTRSGLAHAAGVSSSVID
GLTSQGVFETVFMPPPPVVAAPDPAFAAPRLEGPQKQAAADLLAAVEERK
FSVSLIDGITGSGKTEVYFEAIAATLRAGKQVLILLPEIALTASFLERFQ
DRFGAKPAEWHSDLAPRMREKVWRQVTTGQVRVVAGARSALFLPFEDLGL
IIVDEEHDPAYKQEDRVFYNARDMAVVRGRIGNFPVVLVSATPSVESRVN
GEVGRYRPIHLPTRFGDAALPDLGIVDMRRHPPERGGFLSPVLVNQIGKT
IGRREQALLFLNRRGYAPLTLCRVCGHRFQCPDCSSWLVEHRFRGQIQCH
HCGYSERTPEACPECGTLDHLVACGPGVERIAEEVERHFPDARTIVLSSD
LMGVKRLRLELEAIARGEADIVIGTQLVAKGHNFPNMTLVGIVDADLGLA
NGDPRAAERTFQLLSQVTGRAGRTGLKSLGLLQTYQPQHPVMQAIVSGDS
DAFYEREIHERERAVLPPFGRLASVIVSADTRGDAEGHARGLRNAAPRVD
GIAILGPAEAPLALIRGRHRFRLLVHGRRNSDMQSFVRAMIAAGPKERGS
VSVQLDIDPQSFL
>gid:372479  radC  PROBABLE DNA REPAIR PROTEIN
MTKIPPGEPDELFDAADERGFFPEKRTRNSPATAPAPATDTHYHGHRDRL
RARYREHGDAALADYEILELILFRLIPRRDTKPIAKALLDRFGTLAAVFG
APLHLLQEVKGVGESVALDLKLVATASHRMLRSELRNKQVLSSWSAVIDY
CHAAMAHETKEQFRILFLDKRNTLIADEVQQQGTIDHTPVYPREVVKRAL
ELSATALILVHNHPSGDPTPSRADIDMTKLIAEAAKPLGIALHDHVIIGK
DGHVSLKGLRLF
>gid:372559  recA  DNA STRAND EXCHANGE AND RECOMBINATION PROTEIN
MDKSKALEAALSQIERSFGKGSIMKLGAKDSVVEIETVSTGSLGLDIALG
IGGLPKGRIIEIYGPESSGKTTLALQTIAEAQKKGGICGFVDAEHALDPV
YARKLGVDLENLLISQPDTGEQALEITDTLVRSGAIDILVIDSVAALVPR
AEIEGEMGDSLPGMQARLMSQALRKLTASISKSNCMVIFINQIRMKIGVM
FGSPETTTGGNALKFYASVRLDIRRIGSVKEREEVVGNQTRVKVVKNKMA
PPFKQVEFDIMYGEGVSKTGELIDLGVKAGIVEKSGAWFSYNSQRLGQGR
ENAKLFLRENPELLREIETALRQNAGLIADRFLENGGPESDGDEAADM
>gid:370620  recF  DNA REPAIR PROTEIN
MPHKVFLTRLKLSDFRNYATLALDLDQRHVVLTGENGAGKTNLMEGVSFL
SPGRGLRRAAYADVARVGAPDGFSVFAAVDGMEGSVEIGTGTQGTEEGQS
RRLRINGTAARTVDELTDHLRVLWLTPAMDGLFTGPSADRRRFLDRLVLS
LDPEHGRRASEFDRAMRSRNRLLSEFRPDPAWLSAIEREMAGLGISMALA
RQEMLGLLSALVERSRSDGTFPSASLSLAGFLDDCAGIPAFELEERYLAM
LAEGRARDAAAGRTLDGPHRSDLLIRHREKDIEAERCSTGEQKALLVGLV
LAHARLVGDMTGHAPVLLLDEIAAHLDQGRRAALFDLVDGLGGQSFMTGT
DRAMFDALGERAQYLAVANGRVSG
>gid:372388  recG  PROBABLE ATP-DEPENDENT DNA HELICASE PROTEIN
MRPALLDPLFSPLDTLPGIGPKTGELYARLLGRETVEDCRVVDLLFHIPH
SLIDRRRQPGIAHAPNGAIVTITGRVDRHQPAPSGRSNVPYRVFLHDETG
ELALTFFRVRGNWLEKALPIDETVIVSGKVDWFNRRASMVHPDYMVRAAE
SENMPLVEPVYGLTAGLTSRPLRKSIEAAVARVPDLPEWLDEALLRQQGF
KSAKESFQRLHEPRDETDIDAQAPARRRIAYDEFLAGQLSLSLVRQRLRK
VAGTPIHPTGRLSGPVIAALPFSLTNSQSAAVDEILADMSGADRMLRLLQ
GDVGSGKTAVALMAMLAAVESGGQAVLMAPTEILARQHHATLSRMAAPAG
ITIDILTGRTKGKERDAILERIASGETQLVIGTHALFQDAVIYRQLVLAV
VDEQHRFGVHQRLRLTAKGISPHMLVMTATPIPRTLVLAAFGDMDVSKLT
EKPAGRKPIQTVTIPNERTDEIVERLDAALRQGKKAYWICPLVEESEETD
AMSADERYQSLARRFGKDVGLVHGRMAGPEKDAVMLAFKNGEIRLLVATT
VVEVGVDVPDATIMVIEHAERFGLAQLHQLRGRVGRGDEASTCILLYKSP
LSEAGRARLSVLRESEDGFLIAEEDLKLRGEGELLGTRQSGTPGFLIASL
EAHADLLEMARKDAAYVIDRDPELTSERGQALRTLLYLFRRDEAIRFLRA
G
>gid:372471  recJ  PROBABLE SINGLE-STRANDED-DNA-SPECIFIC EXONUCLEASE PROTEIN
MDVLADPIQRAFLGVERSVSGQRWVSRLDQAGQNRALAISQVHGYSELIA
RVLAGRGVGLDDAAAFLEPTLRALMPDPDTLTDGRKAAERLADAIRRREK
VVIFGDYDVDGAASSALMARFLRHFDITAEIYIPDRIFEGYGPNPQAIDQ
LIDRGSELIVTVDCGSTSHEALAVAAERRTDVVVIDHHQVGSVLPPCHAL
VNPNREDDLSGQGHLCAAGVVYLVLVNTLRVLRLRGDPRAASFDLLSLLD
LVALATVCDVVPLKGLNRAYVVKGLLAARHMGNAGLAALLRKAAIGGPVT
PYHLGFLLGPRINAGGRIGDAALGSRLLTLDVAAAAEAIASQLDELNRDR
QAMEAAMLAEAEAEVLAEYGTGEGASVIVTARQNWHPGIVGLIAARIKEK
FRRPAFAIAFDPNGRGTGSGRSINGFDMGRLVRAAVDNGLLVKGGGHAMA
AGLTVERGKLGQLRQFFEERAETAIRDLVAVQTLKVDGALAAAGATLDLV
DLLDQAGPYGSGHPQPLFAFPQHRLRDSRQVGANHVKVTLEGQDGSRMEG
IAFRAAETPLGDFLLGNRGATVHVAGSVSADLWQGTRKVQLRVTDAAKAN
>gid:373001  recN  PROBABLE DNA REPAIR PROTEIN
MLAQLAIRDIVLIERLDLSFDAGLSVLTGETGAGKSILLDSLSLALGGRG
DGSLVRHGEDRGQVTAVFDVPAGHTARLFLRENGIDDDGDLIFRRVQSAD
GRTKAFINDQPVSVQLMRQVGQTLVEIHGQHDDRALVDTDAHRTLLDAFG
GTTEAAEDVAAFYRAWKDAERCLKKHREKVEAAAREADYLRSSVEELETL
SPRDGEEEELAENRARMMKAERIAGDISEASEFLNGNASPVPLIASLVRR
LERKSHEAPGLLEETVELLDGALNQLADAQMAVERALRDTEFDPKELERV
EERLFALRAASRKYSVPVTELPALAARMIADLADLDAGEEKLKQLEIQVA
EARAAFDVAARSLSEKRHNTAAALSAAVMEELPALKLERARFMVEVTSDP
GSPTADGIDAVEFHVQTNPGTRPGPIMKVASGGELSRFLLALKVALADRG
SAPTLVFDEIDTGVGGAVADAIGQRLKRLSKTVQVLSVTHAPQVAARAAT
HLLISKGPSAEKTEMIATRVARMDEAARTEEIARMLAGASITEEARAAAA
RLLAGNG
>gid:373190  recQ  PROBABLE ATP-DEPENDENT DNA HELICASE PROTEIN
MPSISLDSILRAPDESDSMPESHNTGRLFETEGVSNPLDLLKRIYGYSTF
RGQQQAVVDHVVAGGDAVVLFPTGAGKSLCFQIPALCRRGVGIVVSPLIA
LMRDQVEALKQLGIRAAALNSSLTRDEAIAVRRALSRDELDLLYVTPERA
VTDGFAEMIADADIALFAIDEAHCVSQWGHDFRPEYRGLGCLAERFPGVP
RIALTATADPHTRDDMIERLGLGGARVFASSFDRPNIAYEIVERDQPRQQ
LLRFLSRFKDASGIVYCLSRAKVEDTAEWLDAQGIRALPYHAGMERAARD
AHQDAFLKEENLCLVATVAFGMGIDKPDVRYVAHLDLPGSVEAYYQETGR
AGRDGLPSEVWMAYGMADVIQRRRMIDEGGAPEEIKRIERAKLNSLLAIC
ETAGCRRQAILAHFGEAHPGGCGHCDTCLKPVETWDGTEAAIKALAAVYR
TGERFGTGHLIDVLTGSVNEKTERFGHVDMPVFGAGKDLPARTWQSIFRQ
LLAAGLISVDHAAFGALKLEPEARSVFRRERQVLFRKDRPSSGKAKTARG
SKPASERSDLAGSDLELFERLRSERLSLAREMDVPPYVVFPDTTLIALAK
RRPRDFEELLDVPGIGESKRERYGEAFLAVIEAFEG
>gid:370676  recR  PROBABLE RECOMBINATION PROTEIN
MAKRVTGPEIEKLIQLLAKVPGLGPRSARRAALHLVKKKEQLLGPLAEAM
GEAHRKVKICSCCGNVDTIDPCTVCTDERRDQAVIIVVEDVADLWALERA
GAMNAAYHVLGGTLSPLDGIGPEDLNIRGLIDRVAKGGVRELIIAVNATV
EGQTTAHYITDQLEGMEVRITRLAHGVPVGGELDYLDEGTLAAALRARTV
I
>gid:372498  rhlE1  PUTATIVE ATP-DEPENDENT RNA HELICASE PROTEIN
MTNFADLGLSQKVLSAVTDAGYTIPTPIQVGAIPPALQRRDILGIAQTGT
GKTASFVLPMLTLLEKGRARARMPRTLILEPTRELAAQVAENFDKYGKNH
KLNIALLIGGVSFDEQDRKLERGADVLICTPGRLLDHCERGKLLMTGVEI
LVIDEADRMLDMGFIPDIERIAKLIPFTRQTLFFSATMPPEIQKLADRFL
QNPERLEVARRSSTAITVTQRFVAAHGKDYEKRAVLRELIRSQEDLKNAI
IFCNRKKDVAELFRSLDRHGFSVGALHGDMDQRSRMAMLANFKDGNIKLL
VASDVAARGLDIPDVSHVFNFDVPIHAEDYVHRIGRTGRAGRSGAAFTIV
TKRETKFSDAIEKLIGQKVEWFNGDLSELPEPMESHDSRRDRGSDGRKDR
KRERPGKVHADRKPVTASGNNTEELETVPERVDAVKIERNADSKDRHNGR
SDRPGRAQNAANDDNRDRRQRNRRDHDDGPTPVGFGDDIPAFMLIVGKA
>SMb20880 rhlE2, putative ATP-dependent RNA helicase protein
MSTFKELGLSEHIVATLSANGFEKPTPIQAQAIPLVLKDHDLIGLAQTGT
GKTAAFGLPMIEKLVADGRRPDPRNIRALVLAPTRELVNQIAANLKLFVK
KSPLKIGLVVGGVSINKQTEQLARGVDILVATPGRLLDLVSRKAVTLTQA
RYLVLDEADQMLDLGFIHDLRKISKLVPKNRQTLLFSATMPKLIAELAGE
YLTDPVKVEVSPPGKAADKVEQYVHFVPGKDLKTTILKQTLTANPDGLSL
IFSRTKHGAEKLMKHLDHVGFKAASIHGNKSQGQRERALKAFRDGEIRVL
VATDVAARGIDIPGVTHVYNYDLPEVPDAYVHRIGRTARNGRDGIAIAFC
APDEIRLLRDIEKLMGIQIAVASGEAPADQARPSKGRGGRGNGQPRGNGQ
PRGNGAGQRQGGPRRDRPQRQAAAGGFAGDELLRDERSHERRDQRAAGHG
PADGRPEGHRNHKAQKHHGRPGPQQARRGSEQSGERNGGGNRPQRADGRG
HQR
>gid:371477  rnhA1  PROBABLE RIBONUCLEASE HI PROTEIN
MKHVHIFTDGACSGNPGPGGWGAVLRYGDVEKEMSGGEAETTNNRMELLA
AISALNALRQPCEVDLHTDSKYVMDGISKWIHGWKRNGWKTGDRKPVKNG
ELWQALDEARNRHNVTWHWVKGHAGHPENERADELARKGMEPFKKARRAD
AVK
>gid:371110  rnhA2  PROBABLE RIBONUCLEASE HI PROTEIN
MTTEVLPPVQDTRRYIVHTDGACRNNPGPGGWGAVLQLEEEGEIIKEKDL
SGTTISETTNNRMELSAVLGGLRRLKDKTIPITVRSDSKYVVNGMSTWLE
GWKRNGWRKADKTTVLNMDLWQELDQLQETLGPISWVWVKGHSGDPMNDR
CDRLANVAIDNALKRSNAA
>gid:371396  rnhB  PROBABLE RIBONUCLEASE HII PROTEIN
MSRRKQPDSPLFPLQAPVPDFTFERAAHRDGFWPVAGADEAGRGPLAGPV
VAAAVILDPDAIPAGLNDSKLLTAEQREALFEEILATSTVSIASSSSARI
DTTDILKASLDAMRRAVHGLELAARIVLVDGRDVPPGLSCHAKAIVKGDS
RSVSIAAASIVAKVTRDRMMARADATFPLYGFAHHAGYATVKHRTAIESH
GPCSLHRMSFRPFRQV
>gid:373707  ruvA  PROBABLE HOLLIDAY JUNCTION DNA HELICASE PROTEIN
MIGKLKGTIDEIGEDHVVLDVHGVGYVAHCSARTLGKLGSAGEAAVLFIE
TYVREDQLKLFGFLSALEREWFRLLQSVQGVGSKVALAVLSTLTPGELAN
AIALQDKTSISRAPGVGPKVAVRIVTELKNKAPAFSGEMAPSIGLKQELG
EGVAAAPVADAVSALTNLGYSRDQAANAVAAALKNGGEGGDSAKLIRLGL
KELSR
>gid:373706  ruvB  PROBABLE HOLLIDAY JUNCTION DNA HELICASE PROTEIN
MSEAARLIAPEKRGEDLDATMRPQTLDEFTGQAEARANLKIFIEAARNRG
EALDHVLFVGPPGLGKTTLAQIMAKELGVNFRSTSGPVIAKAGDLAALLT
NLEERDVLFIDEIHRLNPAVEEILYPAMEDFQLDLIIGEGPAARSVKIDL
AKFTLVAATTRLGLLTTPLRDRFGIPVRLNFYTVEELELIVRRGARLMGL
GMTDEGAREIARRARGTPRIAGRLLRRVRDFAEVARAEAVTLKIADEALT
RLLVDSMGLDQLDRRYLTMIAQNFGGGPVGIETIAAGLSEPRDAIEDIIE
PYLIQQGFIQRTPRGRVLTANAWKHLGLNPPRDVEASQFRLTLEDD
>gid:373708  ruvC  PROBABLE HOLLIDAY JUNCTION ENDODEOXYRIBONUCLEASE PROTEIN
MQNTIRIIGIDPGLRRTGWGVIETLGNSLRFVASGTVTSDGELDLASRLC
QLHDGLAEVVHGYQPHEAAVEQTFVNKDATATLKLGQARGIAMLVPARAG
LRVAEYAPNAVKKAVIGVGHGEKQQIHMMLKVLMPKAEFKGNDAADALAI
AICHAHNRQAVTSRLAALAG
>gid:371949  smf  CONSERVED HYPOTHETICAL PROTEIN
MPLGGARRTGTALSERQKIAWLRLIRSDNVGPATFRDLISHFGNAEAALE
ALPELSRRGGADRSFRIATVDEAERELEAAHRFGAVFVGIGEPDYPDALR
QIDGAPPLLATKGNLNATARPSLGIVGSRNASVSGAKFAAMIARDAGAAG
YVITSGLARGIDTAAHRASLRTGTIAVLAGGLDRPYPPENLGLLQEIVSG
EGLAVSEMPFGWEPRARDFPRRNRLVAGISLGVAIVEAANRSGSLITARY
AADFGRLVFAVPGSPLDPRCHGTNDLLKQGATVTTSSADVIEALAPLSRD
DLFSRLEANEPSAEEPRPMPQPPDDTDRSRVVEALGPTPVAIDDLIRYTG
LAAPQIHMVLVELDLAGQLCRHGGNLVSLATAE
>gid:372275  ssb  PROBABLE SINGLE-STRAND BINDING PROTEIN
MAGSVNKVILIGNVGADPEIRRTQDGRPIANLRIATSETWRDRNSGERRE
KTEWHTVVVFNEGLCKVVEQYVKKGAKLYIEGQLQTRKWQDQTGNDRYST
EVVLQGFNSTLTMLDGRGGEGGGAGRGGSDYGGGGYEEYDQSRPSSGGAR
SGGQSNQPNQGGNFSRDLDDDIPF
>gid:371332  tag  PROBABLE DNA-3-METHYLADENINE GLYCOSYLASE I PROTEIN
MAARGLITGDDGLDRCAWHGNLEDYRRYHDEEWGRPVADDHRLFEKICLE
GFQSGLSWLTILRKRDAFRAAFAGFDFDKVAEFGEEDVERCLADAGIVRH
RGKIVSTINNARRAKELRAEFGSLASYFWSHEPDGSERPQIVDFETLIAN
PTTPASVRISKDLKKRGWTFVGPTTVYAFMQAMGLVNDHIEGCFCRAPIE
DLRRRFARPAA
>gid:371948  topA  PUTATIVE DNA TOPOISOMERASE I PROTEIN
MFQRMSMNVVVVESPSKAKTINKYLGPGYKVLASFGHVRDLPAKDGSVRP
DEDFEMSWEVDGASAKRMKDIADAVKSSDGLILATDPDREGEAISWHVLD
LLRKKKVIGDKPVKRVVFNAITKKAVLDAMAEPRDIDASLVDAYLARRAL
DYLVGFNLSPVLWRKLPGARSAGRVQSVALRLVCDREAEIERFVTEEYWN
ISALLKTPRGDEFEARLVSADGKRLPPKAIGNGEEANRLKSLLDGASYVV
ESVEAKPVKRNPSPPFTTSTLQQAASSKLGFSASRTMQVAQKLYEGVDIG
GETVGLITYMRTDGVQMAPEAIEAARQAIGSQFGERYLPEKPRFYSTKAK
NAQEAHEAIRPTDFDRTPDQVRRFLDGDMLRLYDLVWKRGIASQMASAEI
ERTTAEIVADNAGKKAGLRATGSVIRFDGFIAAYTDMKEDGEQADDGDED
GRLPEINARENLAKQKINASQHFTEPPPRYSEATLIKKMEELGIGRPSTY
AATVTTLIDRDYVEIDKRKLVPQAKGRLVTAFLESFFTRYVEYDFTASLE
EKLDQISAGELNWKDVLRDFWKDFFSQIEDTKELRVTNVLDALNEELAPL
VFPKREDGGDPRICQVCGTGKLSLKLGKYGAFVGCSNYPECNYTRQLSSD
SNGDAEAAAANEPQSLGKDPHTGEEITLRNGRFGPYVQRGDGKEAKRASL
PKGWTPAAIDHEKALALLSLPRDIGPHPESGKMISTGIGRYGPFVLHNGT
YANLESVEDVFSIGLNRAISVLADKQSKGAGGGRASAAALKELGEHPDGG
AITVRDGRFGPYVNWGKVNATLPRGKDPQSVTVEEALALIAEREAKGGVT
KGKAAKGKPTGGKSAGTKSAKAASAGTAKAEKPKSAAKTKTKAAAKAKKD
>SMb21087 traA2, putative conjugal transfer protein
MAIMFVRAQVIGRGAGRSIVSAAAYRHRTRMIDEQAGTSFSYRGGASELV
HEELALPDDIPAWLKAAIAGRSVAKASEALWNAVEAHETRADAQLARELI
IALPEELTRAENIALVREFVRDNLTSKGMVADWVYHDKDGNPHIHLMTAL
RPLTEQGFGPKKVPVLGEDGEPLRVVTPDRPNGKIVYKLWAGDKETIKAW
KIAWAETANRHLALAGHEIRLDGRSYAEQGLDGIAQKHLGPEKAALARKG
IAMYFAPADLARRQEMADRLLAEPGLLLKQLGNERSTFDERDIAKALHRY
VDDPVDFANIRARLMASDELVLLKPQQIDAETGKAKQPAVFTTREMLRLE
YAMAQSAEVLSRRKGFGVSNARAAAAVRSIETADTEKPFRLDLEQVDAVR
HVTRDNAIAAVVGLAGAGKSTLLAAARAAWEGEGRRVIGAALAGKAAEGL
EDSSGIRSRTLASWELAWESGREQLQRGDVLVIDEAGMVSSQQMARVLKA
VEDAGAKAVLVGDAMQLQPIEAGAAFRAISERIGFAELAGVRRQRDAWAR
DASRLFARGKVEEGLDAYAQQGRIVETETRAEIVDRIVADWANARRDLLQ
KSADGEHPGRLRGDELLVLAHTNDDVRKLNEALRNVMIGEGALTGAREFQ
TARGLREFAAGDRIIFLENARFVEPRARRLGPQYVKNGMLGTVVSTGDRR
GDTLLSVRLDSGRDVVISQDSYRNVDHGYAATIHKSQGSTVDRTFVLATG
MMDQHLTYVAMTRHRDRADLYAAKEDFEPKPEWGRKPRVDHAAGVTGELV
EEGMAKFRPNDEDADESPYADIRTDDGTVQRLWGVSLPKALKDAGAAEGD
TITLRKDGVERVKVQVPIVDEQTGEKRFEERQVDRNVWSASQLETAAARR
ERIERESHRPQLFKQLVERLSRSGAKTTTLDFEGEAGYQAQARDFARRRG
LYHLSLVAAGMEAEVLRRWAGIAEKREQVAKLWERASVALGFAIERERRV
SYNEERTETLSTGIPSDGKYLVPPTTTFSRSVAEDARLAQLSSQRWKERE
AILHPVLAKIYRDPDGALSALNALASDAAIEPRKLAEDLGLAPDRLGRLR
GSELVVDGRAARDERTAATVALSELLPLARAHATEFRRNAERFGIREQQR
RAHMALSVPALSKTAMARLVEIEAVRKQGGDDAYRTAFAFAVEDRLLVQE
VKAVNEALTARFGWSAFTAKADVIAERNIAERMPEDLAPERREKLTRLFA
VIRRFAEEQHLAERQDRSKIVAGASVELGKGTFAVLPMLAAVTEFKTTVD
EEARERALAAPHYAHHRAALVETATRVWRDPADAIGKIEDLIVKGFAAER
IAAAVTNDPAAYGALRGSDRIMDKLLAVGRERKGALQAVPEAASRIRSLG
ASYASALDAETRSITEERRRMAVAIPGLSPAAEDALKRLAAQIKNKDGKL
DVAAGSLDPRIAREFAKVSRALDERFGRNAILRGETDVINRVSPAQRRAF
EAMRDRLTILQQAVRVQSSEKIVSERRQRAINQSRGIRM
>gid:373916  ung  PROBABLE URACIL-DNA GLYCOSYLASE PROTEIN
MDATIRLEESWKAVLGGEFRHGYMAELKRFLLEEKQQGRQIFPRGVEYFR
ALDLTPLDRVRVVILGQDPYHGDGQAHGLCFSVRPGVRTPPSLVNIYKEL
QEDLGIPPARHGFLESWARQGVLLLNSVLTVERGRAASHQGRGWERFTDA
VIRAVNEQAQPVVFMLWGSYAQRKAAFVDRSRHLVLTAPHPSPLSAHAGF
FGCRHFSKANAFLTSKGLDPIDWRLPEDPPLAVERQMAPNC
>gid:372274  uvrA  PROBABLE EXCINUCLEASE ABC SUBUNIT A (DNA REPAIR ATP-BINDING) PROTEIN
MSELKTISIRGAREHNLKGIDLDLPRNKLIVMTGLSGSGKSSLAFDTIYA
EGQRRYVESLSAYARQFLEMMQKPDVDQIDGLSPAISIEQKTTSRNPRST
VGTVTEIYDYLRLLFARVGVPYSPATGLPIESQTVSQMVDRVLEFGEGTR
LYMLAPLVRGRKGEYRKELAELMKKGFQRVKVDGQFYEIADVPALDKKYK
HDIDVVVDRVVVRPDIGTRLADSIETCLTLADGLAIAEFADRPLPPEETS
AGGSANKSLNETHERVLFSEKFACPVSGFTIPEIEPRLFSFNNPFGACTT
CDGLGSQQKIDEALIVPEPNRTLRDGAIAPWAKSTSPYYNQTLEALGTVF
GFKLGSRWSELSEEAQEAILHGTKDKITFHYQDGARSYNTTKTFEGIVPN
LERRWKETDSAWAREEIERYMSAAPCPACAGYRLKPEALAVKIHALHIGE
VSEMSIRAARDWFEVLPEHLSTKQNEIAVRILKEIRERLRFLNDVGLEYL
SLSRNSGTLSGGESQRIRLASQIGSGLTGVLYVLDEPSIGLHQRDNARLL
DTLRHLRDIGNTVIVVEHDEDAILTADYVVDIGPAAGIHGGEVIAEGTPS
DIMSNPKSLTGKYLSGELSVAVPGERRKPKKKKEVTVVGARANNLKNVTA
SIPLGVFTAVTGVSGGGKSTFLIETLYKAAARRVMGARENPAEHDRIDGF
EHIDKVIDIDQSPIGRTPRSNPATYTGAFTPIRDWFAGLPEAKARGYQPG
RFSFNVKGGRCEACQGDGVIKIEMHFLPDVYVTCDVCHGKRYNRETLDVH
FKGKSIADVLDMTVEEGVEFFAAVPAVRDKLVTLNQVGLGYIKIGQQANT
LSGGEAQRVKLAKELSKRSTGRTLYILDEPTTGLHFHDVAKLLEVLHELV
NQGNSVVVIEHNLEVIKTADWIIDFGPEGGDGGGEVIAQGTPEEVVKEPR
SYTGQFLKELLERRPVKKVVAAE
>gid:372682  uvrB  PROBABLE EXCINUCLEASE ABC SUBUNIT B PROTEIN
MARSTKNSPKKSSAPGGFEEAPQAPLSGTPLSGNVSDWVKQLESEAEAST
FESRREVASKAGRHRKKVEISASKSARGTSMGGTTDPKTRAAAGLNPVAG
LDVALEDADKLTSGSGVTATVEALAKLIESGNPLFKDGKLWTPHRPARPE
KSEGGIAIRMQSDYEPAGDQPTAIADLVDGLTSGERNQVLLGVTGSGKTF
TMAKVIEATQRPAVILAPNKTLAAQLYSEFKNFFPDNAVEYFVSYYDYYQ
PEAYVPRSDTYIEKESSINEQIDRMRHSATRSLLERDDVIIVASVSCIYG
IGSVETYTAMTFQMSVGDRLDQRQLLADLVAQQYKRRDMDFQRGSFRVRG
DTIEIFPAHLEDAAWRISMFGDEIDSITEFDPLTGQKTGDLKSVKIYANS
HYVTPRPTLNAAIKAIKEEMSQRLAELERAGRLLEAQRLEQRTRYDIEML
EATGSCQGIENYSRYLTGRRPGEPPPTLFEYIPDNALIFIDESHVTIPQI
GGMYRGDFRRKATLAEYGFRLPSCMDNRPLRFEEWDAMRPDTIAVSATPG
AWEMEQAGGVFAEQVIRPTGLIDPPVEVRSAKTQVDDVLGEIRETAAAGY
RTLVTVLTKRMAEDLTEYLHEQGVRVRYMHSDIDTLERIEIIRDLRLGAF
DVLVGINLLREGLDIPECGFVAILDADKEGFLRSETSLIQTIGRAARNVD
GKVILYADTITGSMQRAMEETSRRREKQMAYNAEHGITPESVKAKISDIL
DSVYERDHVRADISGVAGKGFADGGHLVGNNLQAHLNALEKQMRDAAADL
DFEKAARLRDEIKRLKAAELAVLDDPMAREEAKSQESSKRASKGVPGTTD
SLPLVGSVGEAKSDGEGKTQSYFQKPSLDNMGPGTDTERPLFRKPELDEM
GRDVAIPSSRGREEGARKADEGQSLFRKNTLDEMTVGRTEKPVPGQAPEK
PTLTRAKPGVGSYEDPAEEKRRKSRTKGKTGRPGR
>gid:371785  uvrC  PROBABLE EXCINUCLEASE ABC SUBUNIT C PROTEIN
MNGQTPTDGGILYDATETDDEDDLVEVTEAERPAPAIGWAESLPEAAGLK
GAELIQAFVKRLPNGPGVYRMLNEAGDVLYVGKARSLKKRVSNYAQGRGH
SNRIARMVRETAHMEFVTTRTEIEALLLEANLIKRLRPRFNVLLRDDKSF
PYIVVTGDTRAPALYKHRGARSRKGDYFGPFASAGAVGRTINSLQRAFLL
RTCTDSVFETRTRPCLLYQIKRCSAPCTNEVSDADYAELVSEAKDFLSGK
SQAVKATIASAMAEASENLDFERAALYRDRLAALSHVQSHQGINPAGVEE
ADVFATHHEGGISCIQVFFFRTGQNWGNRAYFPKADPSIPPAEVLSAFLA
QFYDDKPCPRQVLLCAPVEEQELLAQALSEKSGYKVSILVPQRGEKKDLV
EHALANAREAHGRKLAETASQGRLLEGFAATFQLPYVPRRIEIYDNSHIM
GTNAVGGMVVAGPEGFVKGQYRKFNIKSTDITPGDDFGMMREVMTRRFSR
LLKEEGKPDRSAEPGEDAGFPAWPDVILIDGGQGQMTAVRTILKELGIED
CVTAIGVAKGVDRDAGRERFFAEGRESFTLPPRDPVLYFIQRLRDEAHRF
AIGSHRARRKKEMVKNPLDEIAGIGPTRKRALLTHFGTAKAVSRAGINDL
MSVNGISETVARIVYEHFHEDAAK
>gid:372972  uvrD1  PROBABLE DNA HELICASE II PROTEIN
MTKGFDDIPFFDEEPAPRKPAAADGGIAARAMAARDKAQRPDYVSGLNPE
QREAVEALEGPVLVLAGAGTGKTRVLTTRIAHILSTGRAYPSQILAVTFT
NKAAREMKERIGVLVGHAVEGMPWLGTFHSIGVKLLRRHAELVGLRSDFT
ILDTDDVVRLIKQLIQAEGLDDKRWPAKQFAGMIDTWKNKGLDPSQIPEG
DARAFANGKGRELYAAYQNRLLTLNACDFGDLLLHPIRMFRANPDVLKEY
HAKFRYILVDEYQDTNTAQYMWLRLLAQQTQASRNRPSSGLPATFSPSDG
EKGQAARSQSPLSPSERGEGRGEGQPTVNICCVGDDDQSIYGWRGAEVDN
ILRFEKDFPGAKVIKLERNYRSTEHILGAAAHLIAHNEGRLGKTLFTERT
NPDDEKVHVHAAWDSEEEARAIGEEIEQLQRKKHNLNDISILVRASFQMR
EFEDRFVTLGLNYRVIGGPRFYERLEIRDAMAYFRLVCQPADDLAFERIV
NTPKRGLGETTVRTLHDYARARDIPMLAAASDIVETDELKPKARKGLFDV
VADFRRWQTLLETTPHTELAERILDESGYTAMWQADKSAEAPGRLENLKE
LIRSMEAFESMRGFLEHVALVMDAEQNENMDAVSIMTLHSAKGLEFDTVF
LPGWEEGLFPHQRALDEGGRAGLEEERRLAYVGITRAKRRCHIWFVSNRR
IHGLWQSTLPSRFLDELPIAHVEVAEQEVSYGGYGRGGYGQSRFDKADPF
ENNYQTPGWKRAQQHRSEATRDNWGTRSGHAIERIGYGESGPRTRTIEGE
LVAKSTSAEPSRFNVGDRVFHIKFGNGNIAAIEGNKLTIDFDRAGQKRVL
DGFVERV
>gid:374196  vsr  PROBABLE DNA MISMATCH ENDONUCLEASE, PATCH REPAIR PROTEIN
MVDTLSPAQRSERMSRIRSNSTKPEMALRRALHRLGFRFRVQGTGLRGKP
DIVLAKYTTVIFVHGCFWHRHPGCKVATTPKSNTQFWTEKFSRNVKRDER
VVKQLEDEGWRVIVVWECEVNALSKAASVAEGIAKVLRSGRPGDD
>gid:373617  xerD  PROBABLE INTEGRASE/RECOMBINASE DNA RECOMBINATION PROTEIN
MTDMSAAYVEAFLEMMSAERGAAANTLQSYERDLEDARSFLRSRGTGLTD
ASADDLRSYLSHLAGQGFKASSQARRLSALRQFYKFLYAEGLRTDDPTGI
LDAPKKARTLPKTLSIEDVTRLIGQAEAEAKSGSDDVMAKLRMHALIELL
YATGMRVSELVSLPASVLAQNGRFLIIRGKGNKERLVPLSQAAIRAMRAY
GEALQEESADSPWLFPSNGKSGHLPRQVFARDLKSLAARAGIRVAAISPH
VLRHAFASHLLANGADLRAVQELLGHSDISTTQIYTHVLEERLHDLVQNH
HPLAKQAKKQD
>gid:372383  xthA1  PROBABLE EXODEOXYRIBONUCLEASE III PROTEIN
MKIATYNVNGVNGRLGVLLRWLEEASPDVVCLQELKAPDPKFPVKAIEAA
GYGAIWHGQKSWNGVAILARDREPTLTRKGLPGDPDDTHSRYIEAAVEGM
VIGCLYLPNGNPYPGPKFEYKLAWFHRLTAYAAELLELDVPVILAGDYNV
MPTELDVYKPERWVNDALFRIEVRDAYHRLLEQGWTDALRQLHPGERVYT
FWDYFRNAFARDAGLRIDHLLLSPHVTLRLSAAGVDRHVRGWEHTSDHAP
AWIELSDGPAEEDQ
>gid:372229  xthA2  PUTATIVE EXODEOXYRIBONUCLEASE III PROTEIN
MKIATWNINGVKARLDGLVGWLRESNPDIACLQEIKSVDETFPRGEIEAL
GYHVETHGQKGFNGVALLSKVRPDEINRGLPGDPADEQSRFIEGVFSVNG
GALRVCCLYLPNGNPVETEKYPYKLAWMRRLAAFAEQRLVLEEPLILAGD
YNVIPEAHDCWDVKVWRNDALYLPETRAAFRRLRNLGFTDAVRATSDEAP
LYSFWDYQAGCWQKNFGIRIDHLMLSPEAADKLVSTSIEKHVRAWEKPSD
HVPVTAEFAFETA
>gid:374264  xthA3  PUTATIVE EXODEOXYRIBONUCLEASE III PROTEIN
MPLSIATWNINSVRLRMPLVEHFLKTWQPDILCLQEIKCRNDQFPSAPLR
KLGYEYIEMHGQKGYHGVATISRLPLHELSDRRDYCGVGDARHLSVVFAA
GGKTIRLHNFYVPAGGDEPDRTVNPKFGHKLDFVDEMKLLHAEAEAGVSS
ILVGDLNIAPLEHDVWSHKQLLKIVSHTPVETEGLVSVMTGGAWVDLMRR
HAPPPEKLYTWWSYRAKDWTAADRGRRLDHIWSSADLASQLVRVEILKEA
RGWERPSDHVPVIAHFEL
>SMb20689 xthA4, probable exodeoxyribonuclease III protein
MKIATFNINGVNKRLDILLTWLGTAEPDVVCLQELKATDGQFPRAAIEAA
GYGAVWRGQAAWNGVAILARDREPVLTRTGLPGDSSDTQSRYIEAAVNGI
LIASLYAPNGNPQPGPKFDYKLAWHLRFNQHAAALLETGVPVVLAGDYNV
VPEPRDIYPTRSYDDNALVQPESRAAFRSLVDQGWLDALRKIHPKEELFT
FWDYRRNRWQRDAGLRLDHILLSRKLRRRLTGAGIDREIRALEGSSDHAP
VWVSMRD