TitleGenColors Logo

Gene list

Applied filters:

Organism: Sinorhizobium meliloti, 1021
Gene type: CDS

Number of genes found: 1294

Free access
Sort by:

 



# Sinorhizobium meliloti, 1021

>SMa1846 Putative
MKSYSIALIPGDGIGQDVTDAAWQVLSTVARHSGFTLTGTSFPWSCAFYK
ETGAMMPADGIEALRPFDAILLGSVGWPAEVPDSISLHGLLLPIRKAFVQ
YANIRPHRLLSGVEGPLKASDFDILCVRENTEGEYSGAGGRVHQGTADEV
AVETSIFTRAGVERILRFAFEQARQRRGKLASVTKSNAQKHSMVFWDEVT
RQLADEYPDVEVTSYHIDAMAARVVMAPESLDVVVASNLFGDILTDLGAA
IQGGLGFAASANINPDRSAPSMFEPVHGSAPDIAHLGIANPIAAIWSGAM
MLEHLGEREAAGMIMAALERTTIRGIGTVPGKDRTRSITAAVLAALD
>SMa0552 allantoinase, putative
MSSSNFQRKRPVARDRKGNPSIDAMTTFQGTAMCDLFVRNATVVTEERTF
DGGISVSNGRIDELVVGQRDIAAARQIIDASGLLLLPGLVDAHVHFSEPG
RGHWEGFETGSRAAAAGGITTFVEMPLNAQPATIDAAALVMKKAAAKQSH
IDYALWGGLVDDNLDDLADLRRGGVVGFYRERHCRGGGRGLAGAT
>SMa0917 hypothetical protein
MPEEHRELFAAEIDRLARPRGMTMSREFLPGQVIAYPYLWAWQHEHGETE
GRKTRPDRVVVAVRDANDGLTHLALLAFTTQPPQADRIALEVSDIECRRA
GLSDRKRC
>SMa0116 putative DnaJ/CbpA-type protein
MHSTCPLNWWRCSVTDDPYQILGVPRTGKPDEIRKAYRKRAKELHPDLHP
GDKEVETKFKALSAAYHLLSDPEQRARFDRGEIDASGAERPQQHFYRHYA
DADHARRYGSAGGTGEFEDVSDIFADLFGQGGAGGRFKARGQDRHYHLEL
EFLDAVNGARRRITFPDGNTLDLAIPAGTRDGSTLRLRGKGTPGIAGGEP
GDALIEISVRSHSVFRREGNDIEIDLPITLYEAVLGAKIEVPTISGRVSM
TIPKGSNTGDILRLRGKGVKSQHGPAGDQRVTLKVVLPTATDPDLERFME
TWRRSHAYDPREALGRAT
>SMa1095 Conserved hypothetical protein
MSNAKSVHVAMVDPTATPRQSGDEPGADVAAYLARHGIDVTVDALPSAGR
SVAQVLQRHANDVAADMIVMGAYGHSPLRELVFGGVTRSMLDEARLPVLM
AR
>SMa0160 putative GntR-family transcription regulator
MSENIFYKLRDDIENGIVTGEFEPGERLDETQLAMRFGVSRTPIREALMQ
LSAIGLVEIRPRRGAVVVDPAPQRVYEMFEVMAELEGMAGALAARRHTEE
DSAALLAAHEKCREAVLSEETDRYYYENEIFHRAIYTASHSGFLEEQCTT
LHRRLRPYRRLQLRVRNRMKVSFREHGEIVEAILSGNTDGAREGLRSHVA
VQGDRFGDLVANLSNRERRSGRLG
>SMa0155 hypothetical protein
MLKAYSRAVGNASRGLAAVATALLIAAMLVVCQMIMQRYVFRQATIWQTD
FVVFSATAAMFLGAPYVLLKGGHVGIDVVEMVVGERTRYVLRIIAGLLGL
LFCAVMLIATWIQFHDAWAGNWKHSSVWAPPLWVPLAALPVSFAMLCLQY
VAQILTLLTAPAVPAAMGHGAAEPGSAPGADRHPQEIIQ
>SMa2187 putative integrase/recombinase
MLSQLLADHAALHQALGFKFRTPGVLLRNFVAFAEHRGEHVITTATVHEW
ALQAPSPEQRRNRLLTVRRFALSLHAEDPRHEVPSADLFGRASRKRRTPY
IYSPEEIRRLIDAARRLGPDDSIRSLTYATMFGLIAATGMRVSEAIAIRL
HDVTDDGLIIAQTKFKKSRLLPLHTTTRRALDGYLATRLKAASQSDALFI
SHQGTPLAYPTVITVFLQIIRSIGLRGAPGSRGPRIHDLRHTFAVRSLEH
CAHDSQAVARHITALSTYLGHAHVTDTYWYLQSTPELMAHMSEAGETLHR
GVTA
>SMa0882 hypothetical protein
MCNDYRLMVDVASIVEDFADLKIKVRFGEGAPNLEPREDIKITDVGPIVR
TLDGAPDEVELVQRRWSWPGPNKRPVYNFRSEGREFNSNRCLIIADGFYE
FTEPKDPKKKRKDKWLFTKKDEPIFCIAGIWREMPGVGQAFTMLTMEPGP
DIAPYHDRQIVILERCAWADWLDPTVSAKSLIRPLPIRTLSVEQVG
>SMa0451 hypothetical protein
MELIRERQVGNEKRSTNQPSTAFTSNASNPTAVMDEIHSIILTRSAGRIA
ILLLCTPGEETYPLLNCCRPASSRTWPKATVTV
>SMa1513 Putative ABC transporter permease
MRIAAPVIATALTFLAGSVLFAALGYDPLATLHAFFVAPINSTNGLSEWL
LKASPLILIACGLAVGFRANIWNIGAEGQLIIGAIAACGVGLFYPDPESP
LLIPLMFLAGAGAGMAWAAIPAFLRARMNTNEILVTLMLTYIATLLLSFL
VHGPWRDPAGFNYPQTALLPAAAMFEPFDYSYRLNPSIFITAVAVVMMWL
FTDRSFLGYKMSVSGAAPLAARYAGFRESSAVWTGLLAGGAAAGIAGMAE
VAGPLGQLSPQISPGYGFAAIIVAFIGRLNAFGIVLGGLLMSLLFLGGET
VQMTLGLPAALTRIFQGILLFFLLAADFFIYYRLRLPEHA
>SMa1387 Putative LysR-type transcriptional regulator
MPTNRRNRHLPSISLSLVQTFYQLGRTGSYSAAARELNLSHPSVANHIRR
LEHLLGERLVVAERGARRVGLTPRGLALYELIRPEFDIMLTRLTRLMESQ
RPVLRIGMPQGIFHDLFPRVVRKFHESRPDVELVVYERDTALSDLVRQGS
LDIFIAERHFGDSVVIQQLIGRYSLSLVYPRTWGPAPAEGDIPEWARGRP
FVSHEPGQIIRDIATGFLSSKGAVVEPLISVSSSVSIKRYVSEGLGFSIL
PTWSVGPDDDTITRVELLSLTPVPIYFGTAHFLKDNETVRAFFSHCQFEL
SGRS
>SMa1787 Hypothetical protein
MNDALVSLERDFAALYSPIGRPSIAPEKLLRAMLLQAFYSIRSERLLMER
LEYDLLFRWFVDLGIDDPAWDHSVFSKNRDRLLDGDIAAKFLGAIPRRMG
LHLRGRRL
>SMa1919 Hypothetical protein
MKANSMRDHTFPIGLSVRLKDRTYISPGAAETYRITAKLPWTSNWPQYRI
RNDELGQERVSGEDNLEPIEWGMASPH
>SMa1483 Putative
MRSHAQAVVIGGGLIGCSILYHLTKLGWSDVVLLERSELTSGSTWHAAAN
IHGLHDSTNISLLQHYTMALYKELEVETGQGCGIFQPGSLYLAQTEAREH
QLRLQGAKARRYKMNFYEIGRDEAERLHPLVNFDGIRCIMYEPEGGNVDP
SGVTMAYAAGARRRGAEIHRFTPVTGTEAQADGSWIVRTPKGDIRTRWVV
NAAGLWGREVAAMAGLELPLMPTEHQYFVTETIAEIAALDRRLPSVADRD
GEYYLRQEGLGLLIGAYERDMRFWAEDGTPLGFGHELFPDDLERIEENMM
RAIDRVPVVGTAGIKRVINGPMIWSPDSAVLFGPVPEMTNYFCCNGIIPG
FSQSGGMGKLAAEWMIEGEPSLDMFGWDMARFGHWANKAFTKARVQDQYS
HRFKIHFPNEERAAGRPVRTRPAYEKQKAMGAVFGLNFGWEHPLWFSAEG
EPKEETIGFTRQNWWAPVGREARMLRESAGIIDISNFAKYAVKGAGASDW
LNALFANRMPTVVGRSCLTPLIGKRGGIAGDFTVTKLGDDEFMIFGSGMA
ERYHQRFFNAVPLPDDTTFTSLTERLCAFNIAGPKSRELLMRLTNDDLSN
ENFSFMRSRRMRVAGVEVIALRVSFTGDLGWELYCDAERQVALYDALLEA
GADLGAGPVGSRALASLRIEKGYGSWSREYSPEYWPQECALDRLIKLDKD
AFLNKAPYQEIAGKLPRQKLAMISIDATDADATGGEPIFLRDGTPIGQVS
SGAYGYTVGMSLALCYIKAEMAKPGNKVSVAILGRAHDAVILERPPFDPA
GERLRM
>SMa0538 hypothetical protein
MALLDEILEIERRLWRNDADVYAATYLPQAVLIFPGIGRIDLDTAVEAIR
GENAAGRHWAEVSFSAETAVEVAAGTCLVAYHADARWNDQPTAEAVDCLT
VYVKRDGRWRVAAHQQTASA
>SMa2323 hypothetical protein
MPGIVLVCLLSAALIIVTSATHFEIMTGVAILRRRKVLSKRAEMVVFLGS
AFVAHLFGVGLYALAFAWMHYHPEFGSLAGLAGSNATDFFYFSLTCYSTM
GFGDVYATGDMRILAGLEGLNGLVLIAWSASFTFIAVERIWRDDPK
>SMa2381 hypothetical protein
MLLKAMKIGRQSLYDTFGDKWKLYCLAVERYSASETSAHIALLRGKPRAL
EGIRLVMERVVDNANQACLGVNSICEFGQSRPDLAALHHAADRRLKNVVV
ERIREAQAAGDLAVSLPAEAVADFLVANIAGIRIAARGGARREHLQSLSR
LALRAVT
>SMa0580 hypothetical protein
MSSRALQRSTVRGGLGQFWVAKASFRARSDCSSRTACFDGAIKGALVGAT
KPEPRSRQTSALQLADGFVRG
>SMa2327 probable sensor protein
MPHNDRDLKSRPDPDALLALAEQGRRGKLTVFLGAAPGVGKTFAMLTRAR
RLKEEGGDIVIGLVETHGRGETAALLEGLEVLPRRQVLHNGRTLHEFDLD
AALARRPRVILVDELAHTNFGESRHPKRYQDIEELVDAGIDVWTALNIQH
LESLSDVVAHIAGVPVRERVPDTVLNRADDVLLVDLPPAELIERLKEGKV
YLPDNAKRAADRFFRLGNLTALRELALRRTADRVDDQMVDYLRQNAIEGP
WGAAERLLVCIGPDPLSEKVVRTASRLASSLNADWMVVSVERAEAESSGA
MRQLDETFRLAEQLGAETRRIIGNDFVEEILKLARREHATQIVIGTRRHY
FPLRLFRRSLPDALAARAAGIAIHLVTDGSAPAVKPAARRRSTLPDGWGR
GVAIATGTAAAATALGLLIEQFVVLQNISLLFLLAVLVSATYAGYVAAIA
AALISILAYNFCFIEPVGTFTVAEPHEVFALFVFLAAAMLAGGLASRVRE
QAKTARRRAAATQALYDFSRKLSGTANAEDVLWAAVTQIHATLRRNAALL
LPEDGDVRLMAAWPPDTELGLTDTMAGRWAFEKKEAAGNGTGTLPNSPFQ
FRPLMSPHGVVGVFGFLQEDKPLEINEERALAAILDQTAIAVDRARLSRE
SLDQAAQLEGEKFLAALLSSISHDLRTPLATITGAVTSLRQLGERMSEES
RDDLLKSIEEESGRLTRFVANLLDMTRIEAGTVNAKRDWVDVADVVHSAV
ERARKYFPGRVFETSIAPDLPLIRGDSVLLGQVIFNLLDNANRFGGDEPV
SIYGRREEDEIVLSVTDLGKGIAPADLDRVFDKFFRKGKPDGRSLGTGLG
LSISKGFVEAMDGRIKAESPAMKRRGTRISMRFPVAETGITEKERG
>SMa1767 Hypothetical protein
MINFRVDDLDGLMAKLRASGIAVETRADWNSEVGRFARIHDPEGNPVELW
EQTHE
>SMa1262 Conserved hypothetical protein
MGGFVVRVNMPGWLKPRPTDHGHGPLAMIVESSLDPGRPIAMHEHRNDEI
ISWVPFGVMRHDDKTTGRLVTDSKHLLVMNAGRSFWHSEETLSSDPPLRM
LQIFVRPRAVDLDPRIQHGPIPLRRPNTWRHLVGPEGGDAPFHIRNTIDL
FDIRLEPGARLVFPHMRGRDLYFYVYSGLLFAAGQTFAEGAQGLLLSDRE
LSVESKTQSTVVAFLIDPHAPITRKGTVGDHRKIPPVILIRMLRKWRQLW
KWRRSY
>SMa0861 conserved hypothetical protein
MLSDSEALIASLELAIEKLKRELRGLRSGRTALIDQMELQLEELVMAVTE
DEVAAQAAAAKTSSVRSFTRKRPVRKPWPEDIERERVVIDPPVAYACCGG
SRLSKLGEDVTETLKEIPRRFKVIETVREKFTCRDCEAISQPPAPFHATP
RGFIGPHLLATIPLRQVWNAYAAHPPEHPVQMRGPRAFDLDAGRPGRVWN
GRAPATRGAQNPAALLGWLPSLALVPATTTCTEAESASWPPSSTSSPSTM
GPNAQRRPGIPRGVRPKLQQSGQWKYHQRRNQCR
>SMa0077 hypothetical protein
MPAIELSNGEARLLARPDLGAGLTAFDVLHDGVWQAIFRRVDPSTAHPFA
LSNILLVPFSGRVSGGGFTFDGTFHALPRNVETELYPIHGNGFSAAWDVA
SLSADLLTLTLSAEGPGPFRYDATMSYRLQGAGLLMELSLVNRARIRLPY
GAGFHPWFVREPDTTLHAPARGVWLEQSDHLPKAHDAIAAQADLDFNRPK
PLPARWINNWFDGWDGKARIHWPSRGLAVDVDASEGLRQYVVFSPAAAAD
FFCFEPVTHPVDAFNLPGKAEAHGLKILEPDESLAVSARVAADFG
>SMa0059 putative
MSDFNGKSIVVTGGSLGMGLACAHRFAAGGGKVTIVANDKASVDEAVTSI
GDNAAGFVGDVRSKADMNAAVQAAVSRHGGVDILACCAGIQRYGTVVDTA
DEVWDDVLDINLKGIFLASKFAIPEMRKRGGGAIVAISSVQAYASQTGVA
AYTASKGAINALVRAMALDHAGDNITVNAVCPASIDTPMLRWAADLWKGE
GTVEATLETWGKGHPLGRVGKPSEVAELVAFLASEKARFITGADIKIDGG
VLSKLGIVIPD
>SMa0894 hypothetical protein
MVGAGSACLSRDRLEYLRWIFREDLRKEFIRAYGNDAVGKVAEIAHVEVT
MTVARQRLLPRQHSDPWDRSACPALFFPSFQPASGNASSMTVRILLPG
>SMa0752 possible dioxygenase reductase subunit
MTQFKQLSFWSDAEPLECVTRTPEAPNVVTFSFQSPSGALFNHDPGQFVT
LELPAPGGPLYRTYTISSAPSRPTALTITVKAQDGSTGTRWMLDNLHKGM
RIRAIGPAGKFSIVHHPADKYLFISAGSGITPMVAMTTWLYDSGREPDVV
FINCARRPSEIILRDRMELMASRIVGIDLKWVVEEPDPFRSWTGYRGMFN
QIMLGLMAQDYLEREVFCCGPEPFMRAVREALAGLGYDMSRYHQESFTAE
PGHAEDVPEDVIPDEQNHAEIAFALSGVTTRCSETDTILAAAKAAGLVIP
SGCSMGICGTCKVRKTEGQVHMVHNGGITDEDVEDGYILACCSKPLRRVS
VEA
>SMa0990 hypothetical protein
MKPELEAMVEDALDVSVREFTWRCFFRLFAWENSLPISLMNKVTAGPSVS
FWPRTIVACRLPHPALPQPIAASGIMSDRSRGQPSKSGMSHREYLRHIIT
QRMNLNHCFEGRSLLMSVQDRSSQSPLHIKADRRKYTERYDPASRSLLSI
SKPLSAVIG
>SMa1779 Hypothetical protein
MRSDSRPHTYALGHHGFSGEWASLGFVVHDPYGSERMDLALTGVDGRSDR
TSSRTSDGVDRVAADIVEAGGAAIGIVCEVGELDQITAAVDQVVAAYGRI
DILVNNAAGRSAVLSTILDLSIEQLQRNFDTGAIAYIRFMQSGGLRLL
>SMa0044 hypothetical protein
MCRNIKPLFNFDPPATDEEVRDAALQFVRKLSGTTKPSERNEAAFERAVD
SIAACARELLDALETSQPPRNREEVAARARERSAMRFA
>SMa0369 hypothetical protein
MYPEMPRAISRSTMDVETDDKSQTIVRGILALAHGLGMRVTAEGVETADQ
ANWLRNQGCDRLQGNLFSAPIPAGSMEKFLRQSSMASQ
>SMa0590 hypothetical protein
MICGPYHDSVRFAANCAAGCLSYLSDTRGRFGRLPGEAGAQFVARHVVSP
AREGGTDDGRGGSGYHAPTPNPRILASQTQAQDQSDHRQERTDSLSRHRS
RQSGRQRAAFALTGAPHPSMARRRGRRAPTAAMRAVKTYPRPRPGVCAIW
PS
>SMa1021 putative cytochrome C-like protein
MTLGSLNRCPNGISTGRILINASRIRTAFVLLVVGAFATTIPAVTQSIAT
DNVAQRQDGMKAMAAAAKTIDGMFKGSSVYDANAFKAAAETIRSHAGEKL
SSLFDRSIAAPGSKASVNIEAERQSFDKLAADLGVYASALAAAAERYPDV
ISPQMRMQSGDAMIGGPLARKAKADRDVRSIPAEHAFHLMLQTCTSCHAQ
FRVRAE
>SMa2015 Putative transcriptional regulator
MEQPNLKELEAIIAIPRRGTSRAAAIDLGMSTTALSHTVGRLEAGLGVRL
FNRTTRSVSLTDAGRLFMQQVAPSLQDLRTALETVRSQRETPSGTIRINA
APFAARAIISPLVLEFLRPYPDMHVDIVTEGKLVDVVGDGFDLGVRVAGL
VPTDMIAVSLGRPQRHAVVGSPE
>SMa0457 hypothetical protein
MGGALTGAGAARRGGARAALHNPRLDPSGGKAWGECHRTHRPLDPGRAEV
NILIKIPIGWHDLISCRCRATRGFHVVSNLYEAIERQWMHFRLPTRKGKA
QRITKMVRHFCQDILRMGAFPGGTNTS
>SMa1288 Conserved hypothetical protein
MYRSDQVTRRFSDLQDFAAMLERRGRLLRISRPVSLVHEVTEIHRRVLLA
GGPALLFERPVDASGRVCDIPLLANLFGTLERIEWGFGLPAGGLPGLAEM
LAELREPRSPQSLSDAWGKLPLLKAALAMRPRNVSSPPVQDTVWRGAEAD
LSRLPIQWCWPGEPAPLVTWPLVITRAPDDPSDVNVGIYRMQVLGPNRLV
LRWLAHRGGARHHRMWQQRREDMPVAVAIGSDPATILAAVMPLPESMNEL
AFAGLLKAERQPVAQAVTVPLSVPANAEIVLEGVVSADETAEEGPYGDHT
GYYNSVERFPVMSLSAITMRHRPFYLSTFTGRPPDEPSKLGEAMMELFLP
LVKRQFPEIIDLYLPPEACSYRAMVVSIDKRYPGQAKRIMMGLWSMLPQF
NYTKLIVAVDPDIHVRNWSDVIWALSTRFDASRDVTILNDTPIDYLDFAS
PKPGLGGKLGLDATRKVGPETNREWGRVLEMPAEVVGKIDQFWAELGLGE
AP
>SMa1438 putative ABC transporter, periplasmic solute-binding protein
MGNKSVRVIAGAMVVAGWAGYAAAQSASEIKIVLPEQPANLEPCGTIITN
VGQILSRNVVEPLTIIDPKSGQPTPGLATEWKQTDPNTWQLKLREGVKFQ
DGAAFDAEAVKFSIERMTGGKLTCSNIAKFGDAKLTVTPLDELTVEIKSD
KPQPILPTLLSVVMIVSPNTPADKAVNDPVGTGPFKLSSFTPQTVVLEAF
DGYWGEKPAIAKASYVWRPESSIRAAMVETGEADLTPSIAIQDATNPETD
FAYLNSETTAIRIDAGFAPLDDVRIRKALNLAIDWDGLAQLFGEDVQRAS
QMVVTGINGHDDKLAPWTFDAEKARALIAEAKAAGVPVDTEIELIGRNGI
YPNGTEAMEAMMAMWQDVGLNVKLTMLDVNDWLRYLQKPFPESRGPNLLQ
MMHDNNKGDAAFTIPIFYTSAGSYSTFSDAALDKEVADAMAATGEDRTAK
FKAIFAKVHDDLAVDIPMFHMIGYTRVGSRLEWKPDITTNSEIPLANIAL
KD
>SMa1403 Putative
MHPGEKRKFGRVDLDVTAFGFGTAPLGNIFREIDEETSQSMFELAWDAGV
RFFDTAPMYGHGLAELRTGQGLRWRDRDEYVLSSKVGRLLTPAKRSTIDF
APWVNAAPFSMRFDYSYDGTMRSFEDSLQRLGLERMDICFIHDIDVFTRG
SEQPEVFGQAMDGTWRALEKLRSEGLVKAIGVGVNEWEVCHEALKQRDFD
CFLLAGRYTLLEQDALEEFLPLCEERGAAVVVGGGFNSGILATGAREGAK
YNYAPAPKAILEKVARIEAVCRTHDVPLGAAALQFVVAHPAVPSFMAGTR
TIEQLRQNLSWFSHPIPAGFWTELKSKGLLREDAPIPA
>SMa1331 Hypothetical protein
MAKEEYFLYPTDVESFGFDWGRLALTVAPEVNGAERFSGGVVDLPSGEGH
TRHNHPGAEEIIFVVSGEGEQMVEDENGDPVTQRVGPGCTIYVPESRFHS
TRNTGPGPMQLFVVYSPAGPERALRDLPDFRLIPPGT
>SMa1500 Putative oxidoreductase/oxygenase
MLDTAKTRWHPVAASYDLPFRHIFHAQLLGREFAVWRADDGYVNIWENRC
LHRGVRLSIGINDGRELKCQYHGWRYSNRTAGCTYIPAHPADAPARTITN
RTFASVERYGLVWTAEEPQGDVPEVTGLAEGDLLTLRGIPVNAPADVVVA
ALTGYRFQPNGRLEGRAADMSLKASDGFSVALTAREEGAETLAVFFVQPV
DSNRSVIRGVLDSSPRGAERLTVLHHHNERLSKLREIVEREAQAAPQPAP
LEPVIERVSPELAELPEMTAHGRKATIRVTVARKWMAADGIAAFELRPIK
GLLPTFQPGAHIDVHMPNGLIRQYSITNGPGESDSYVIGVKLERESKGGS
RCMHETLRAGDVLAISEPRNNFPLRRDAEKTIFVAGGIGATPLIAMAQAL
KNQSLDFAFHYFAQNQAQLAFPEKTALLGEALKPQLGLDPEGTEAKLKDI
LSGYRPGMHVYLCGPGPMLEAARRIAAEVGWPETAVHFEYFKNTNTIDDS
SSFEVALARSCVTFKVPAGRTILDVMREVGIDMPSSCEQGACGTCLATVI
EGEPDHQDVYLNDAERKSGTKIMTCVSRAKSARLVLDL
>SMa1855 Putative aminotransferase
MYSNSLIELDRAHLIHPVASYRGHEKLGVRVLASAKGATVTDASGRQLID
GFAGLWCVNAGYGQETIVEAAAKQMRELSYATAYFGLGSEPAIRLASELA
ERAPGNLNHVYFTLGGSDAVDSTIRFIRYYWTARGEPQRDQFISVEQGYH
GSSTVGAGLTALPAFHTGFGIPFDWQHKIPSHYAYRNPVGDDPQAIIAAS
LTALRRKVEEIGPERVAAFYAEPIQGSGGVLVPPRGWMKAMRELCRDLGI
LFVADEVITGFGRTGPLFASTENEIVPDFITTAKGLTSGYVPMGAVFMAD
HIYQTIADGAGASAVGHGYTYSAHPVSAAVGLEVLRLYENGLLENGVKAG
ARLMEGLGSLRDHPLVGDVRGRGMLAAIELVVDKAQKTPLPAAAEPARRI
FDRAWENGLVIRAFANGVLGYAPPLCCSETEIDAIIERTRQSLDETLEDP
DVRRALKT
>SMa2025 Putative oxidoreductase
MPVGRRILPRSMSISSSFATTTSATLPELIYGGTHVKLYTHDENVPANTS
FAHNTVGDVFDFIVCGAGSSGSVVAARLAENGNASVLLLEAGGDHKSETV
LNPAQWPLNLGSSRDWGFVGQPTPGLDGRRLPLSMGKGLGGGSSINVMVW
ARGHKEDWNHFAAEAGDDAWGYQSILGYYRRIEIWQGAPDPTRRGVGGPA
YVAQPTSPQAVAEALLNAASAIGIPLYNTTNGEMMEASGGASIAELRIRE
GKRETVFESYVDPLLSRPNLMVITEALVTRLIFDGKRVRGVEALIDGRRR
QFMAHCETVLSLGAINTPKVLMQSGIGPENELQSHGIPVVQHLPGVGRNH
QDHLAFGCTWAYRKPEAVGGSGCEANLYWKSDARLSQPDVLQCQLEFAVP
SPLEAGLETPEHGWTMFNGLAQPKSRGRLRLSGPDINDPVLIEPNSLSEP
EDMAAALAAVELCRELGNSDGFRPLVTGETAPGQRDRSGMIDFIRRSAVT
YWHQSCTAKMGRDNMSVVDNELKVYGIDGLRIADSSIMPRITTGNTMAPC
VVIGERAAVLIRNTHGLIAVSEPLAQSI
>SMa0747 hypothetical protein
MRLPGLSLSEKVPAAKDKPAKLAQKDRDARWTVKYSKVNATDENAVIWRD
QAV
>SMa1979 Putative LysR-family transcriptional regulator
MTFGFDLDLLRAFTAVVETGGFTRAAERVHRTQSTISQQIKKLETNLGHV
LLIRDRATGSVRTTEEGELLMSYARRILSVSAEANEALGRSMPSPKTVRL
GVPEDFAGRRMIDLLSGFARASPRTRLDTISGWSFELRRLLQAGEIDLAL
VKREPGDGACIAKWEERLVWIGDPLLAVGNDPVPLAVFPSGCIYRERVIR
AVERSGKAWRIAYSSQGLMGVQAAVASGLGISLLPNDAVLPEHRLLHETD
GFLPEPASELALITVAKKLDPALQDLVDYLLASLKAMKFT
>SMa1473 Putative aminopeptidase
MSIVVFDPDSVDDVDFKDRMRHPETADPAGGMWLSDTEPSFIDADALRNG
RLKKLRDWMRAAGYGAVVLFDPYNQRYATGSRNMFGYFLRNSTRYFFIPT
EGPVVLFEYPQSYHVSMVLDTIDEARPSKLVWSSVSGKDDETAGPFADEI
TDLLKQHGGGSMKLGMDRCSHLQALALEKRGCEVKDCQGEILAVRAVKTP
EEIKCLQVSMAGAEAAVAAVREAIKPGVFETKLFAIMYHEVIRQGGEFIE
TRLLSSGQRTNPWFNEASGRKIRPGELVALDTDTIGCYGYYSDFSRTFRC
GPGKPTPYQKSLYRMAYDQVQHNIDIVKPGMAFREIAEKAWKIPDRFVDQ
RYTSVMHGVGMHGETPFIAHAMDYETYGRDGYLVPGMVVSVESYIGEKGG
REGVKLEDEILITENGTELLSRFPYEDEFLSGET
>SMa2047 TRm24 putative transposase
MSQCYLQLTLPDRRRVHQLLERKVPIAEIARQLGRHRSTIYRELKRNTFH
DAEFPEYSGYYSGIANDISKERRRRLRKLSRHPQLRELVIEQLKALWSPE
QIAGRLLADGVSAVRVCTETIYRFIYSKEDYALELYQHLPEGRRKRRPRR
SRKPRDGSIPLDCRISQRPDFIADRSQFGHWEGDLLIFRRDLGEANVTSL
VERKSRYTVMIKNGSRHSRPLIDKIIDAFSPLPAFARQSFTFDRGTEFRG
FKALEDGLGARSWFCDPNSPWQKGAVENTNKRIRRFVPSDTDLSAVSQPQ
LVALAHHLNSLPRKCLGYRTPAEVFMAHLRDCG
>SMa1600 putative transport protein
MPHTPLIATLVAGLGLAFILGTLANRLRLSPLVGYLLAGVLIGPFTPGFV
ADQALARQLAELGVILLMFGIGLHFSLHDLLSVRTIAVPGAFGQMALVTS
LGFIVTQAIGWPIGAGLVFGLALSVASTVVVLRALQEKRQLETDGGRIAV
GWLVVEDVAMILALVLLPAFADVLGGTANRAEPENSGMLTFFEPHTISGA
LGLTLAKLAAFFALMAIVGRRVIPAILHYVAHTGSRELFRLAVLAIALGV
AFGAAELFGVSFALGAFFAGMILAESQLSQRAAQETLPLRDAFAVLFFVS
VGMLFNPMILVEQPLLVAATFLIIVIGNAAAASAIAVMFGYSLPIAVTLG
LSLAQIGEFSFILAGLGVELNLLPETGRDLVLAGAILSILINPLLFAGLD
RLMPRLENRAPVRTEEEGRIDITPKLTTTSLTDHAILVGYGRVGRLVAET
LQNAGQPYLIVEERQVVADQLRAGGVDVISGNAAQPGLLEAANVNSAKWL
ISAIPNPFESGNFIEHARATNPKLEIIARAHSDAEVEYLKRLGANLIIMG
EKEIARSISEHILSNINAPDTTTPSDSESRLTSE
>SMa2387 hypothetical protein
MIRARRRAGGRPTREEAEALTRRLLDSARSTFARKGIANSSMEEIAAELG
ISKHTLYRRYPNRQALLEAVVERDLVRFRKTLAEAAGQGEAPLAALRDMA
FRYFRFGTDRDYSAFYLSVTAEAVFSLPLRERLAAWSSAALEPLVQAIIS
AQAAGLVVPGSTIEICHVLIDLLEGANNRVRLCLSESPDASERLRLFESR
WAVFQTAMMPEPNRPLGV
>SMa1351 Conserved hypothetical protein
MPCGLRAARLGGEMRIKTVQAWWVRIPIEANRQHQSDFGRLTTFDAAILR
IETDDGIVGWGEGKNAAGSAGSYGTLVHMLNYEVGPRLVGRDAADISAVW
EMLYNGVRHERAAMSGHAMPELSRRGLSIAAISAVDIALWDILGKSLGVP
VWKLLGGRKADRLPAYASGGWESAEKIGGQLQSYLASGGFKAVKMRVGAM
DGAPYVSAARVRAARKALGPSVDIMVDAHGTYTVADAKRFIQLVRDCDLA
WFEEPVIADDKAGMAEVRAAGNVPIATGESEATRFAFRDLAVLRSADIFQ
PDPAFCGGITEAMRIGAIASAFNLRLAPHLWAGAPCFFSGLHICAASPAS
FVVEYSVGANPMIHDLVEETVAVKDGMLEIPDKPGLGFTINERVLETHAQ
RL
>SMa1326 Conserved hypothetical protein
MSKLRIHAFSISLDGYGAGPDQSLDNPLGRGGEALHEWFLPTRTFQRMSG
KEQGTTGIDEDFAARGFDNIGAWILGRNMFGPVRGPWPDQSWRGWWGENP
PYHCPVFVLTNHARPSLQMEGGTVFHFITDGIEAALARAREAADGRDVRL
GGGSATVRQYLNARLVDEMHIAISPILLGSGEALFAGMDLPLLGYEVTEY
VPSQKSTHIVVSRRA
>SMa0699 hypothetical protein
MAEKQHIDLEAAPASAKDLRMQLLQRQMEEMEKERQFKAVQEQKLTDFAG
VFLEEHVTPEEVAVVRRLVANAVRDGKFEAMVYSFPSNLCTDSGRAINSA
DRDWPQTLQGKAKEFYERYQKFGKPQGFKLKAMIVNFPGGMPGDVGFFLN
WAPDDA
>SMa1132 hypothetical protein
MTRQQRAVVWAKPAKEVARRLHPHFVREEEFALPPLSLLGALATGKLAPG
MTDVLALTDRLEAELSGMLGEHKEIVAALGDLVAAVKAENMPKYTVFAQK
LVLHARTEEEVLYPAAILVGHYVKRVLGR
>SMa0714 putative ABC sugar transport ATP binding protein, carboxyl terminus
MTMADRVVIMRDGAIQQIADPDTLFAKPENLFVAGFIGSPGMNFLRARIE
AGKLTLFGQTFNAAVGVGAGEIIVGIRPEHLALGPGDVTFTVKPTLVESL
GSEKYVYFEPGEHAYRADARDEERGKGLIARIAHAGPIREGEELVLSFNA
SEVHLFDAKTEKAVN
>SMa2083 probable ABC transporter, permease protein
MSTTDFGAFPAVTGGAYAAPATNEQATAPERPAWSNMKAPDARSQQTAAV
AITNGRRALKVFLANPNALFGTAFLALVIAVALLAPVLYPGDPLSMVGKP
FLWPGQHPAFPLGTDSLGRDVLAGILHGSRISLFVGLMATALGLTFGVVV
GAIAGYFSGWIDDLLVRLIEIFQTLPSFVLLVVLVAIVQPSATTVTLAIA
VISWPTVARLTRAEFRAIREKDFVMAARSLGFGHGRIIVREILPNALPPI
IVTSSVMVATAILMESALSFMGLGDPNVVSWGSMIGTGRELVRTAWYLTA
LPGLAIVFTVLALNLIGDGLNDALNPRFTQDR
>SMa2249 conserved hypothetical protein
MKKRLPSVEHVETLSLLAMRRLVGGLVEELHALKAEVATLRSENEALRED
NAHLRLDNSRLKAENQQLRDEIARLKNLPPRPPFRPSGMEKATELGNGDH
AAGKSPRGPKRDRNRITRTVTLRVDAPEGSRFKGYKSFFVRNLVLGAELV
NYRRERWLTPEGEVIVAPLPEGVSSGFGRNLRRACLALHTQGQVTTPRLT
AILNSIGVEISKRQVVRLLTADLEQFVEEDNAVLHAGLVSAPFVTVDDTG
ARHNRRNAFTTQIGGERFSTFRTSFSKSRLNFLSVLRAGHQDYVLNDEAM
NWLKAQGFEHAIMGRLKTNPPAVFTDQVAFLEHLASKGIDILDRQLLRPL
GEAATGAPSAITACSAGP
>SMa2032 Putative non-heme chloroperoxidase
MAAGHDGEAPRRTTMASRPSRKPTKTDDLKAITVPTPVLHGEDDQVVPIA
ASALKAVKLLPNGSLKTYPGFSHGMLTVNADVLNADLLEFITNRSDCGA
>SMa2157 probable oxidoreductase
MGAALPVRPAGAQQASRTPIRRPIPKSGEMIPAIGLGTFETFDILPGEPR
DDLRDVIRLFHENGGRVIDTSPLYGTAEVCVGDFIMDLGIADDIFITNKT
WTTGDYLSDNSHSERQLRQSRERLWRERIDVLQVHSLENHDQVRHWLAHK
KAEGSIRYIGITQWSPEYYDTMERLVNTGTLDFVQIAYTIVTRAAEQRLL
DACSANGVAVQVNTPFEKARLFTPVAGQPVPDFARELGVETWAQYFLKWI
ISHPAVTNVIPATSQPEHVVDNMGALYGDLPDQAMRKRMTDHYTGLTGVA
DALKQPPYPGKQYGGVVKWPFPQPKRT
>SMa1361 Hypothetical protein
MADDKTKTAADRRLVSGTQKYEVDYFARKHGIAAADARRIIKQHGSDRDA
ADKAASRLKG
>SMa1608 hypothetical protein
MATITHVFTINHVAEMLGEDLELLEAIVSNSDNLTYGSIISVVHGDDETI
TALTDDGIDELRQMLADARRSPEAWNDFLDCFVDDEEVIARVKAKSPR
>SMa2325 probable transcriptional regulator
MTAERILVADDEPQIQRFLRPALAAAGYEVIEAANGAQALKAAVTAAPDV
VILDLGLPDMDGKDVVANIRAWSQVPIIILSARDRESEKIAALDLGADDY
IEKPFGIGELTARIRAALRHRIQMAGGQAQLSADGLSIDMVKRVVTRDGA
ALRLTPKEYDLLVMLAHHAGRVVTHRTLLTSVWGLAHGEDLHYLRVFIGQ
LRGKIERDPGNPKIVRTEPGVGYRFVGDED
>SMa0719 putative
MTGSSRGIGASIAQAYAAYGARVVLHGQRPGATAEIEKAIRAAGGDAVSI
HRELSPPSAGRDLIAAAEGAAGPLDILVINASAQINGALHDVTPEDFATQ
IDVNLRSTVEMLQAALPAMAERGWGRVVNIGSINQLRPKSIVSIYAATKA
AQHNLIQSLARDYASRGVLLNTLAPGLIDTDRNAARRDGDPEAWSNYVRT
LNWIGRAGRPDEMVGAALFLASDACSFMTGEAVVLSGGF
>SMa0803 putative ABC-transporter ATP-binding protein
MNRPFLQIRGIRKEYGPVTAVQDVTLDVAQGEFLTFLGPSGSGKSTTLYI
LAGFENPTRGDILLNGETLLATPSHKRNIGMVFQRYTLFPHLSVGENIAF
PLKVRRLPKAEIDAKVRAMLKLVRLEGFEDRKPAQMSGGQQQRVALARAL
AYDPPVLLMDEPLSALDKKLREEIQHEIRRIHQQTEVTILYVTHDQEEAL
RLSDRIAVFSKGVIDQIGTGPELYANPATRFVAEFIGDSDFLPCDVVSTV
NGRAEIAVCGSMTFSNIPLHGTASLGSKAALMLRPERLLLSKTKSDIGLP
VTVSDITFLGNNVHVATKTRKGNDLSVRLPFGHEAISGLNRGDAVWLRFD
AGSAHVFGQ
>SMa2207 putative ABC transporter, ATP-binding protein
MSNDSIQVEGITKRYGAMTAVDNVSFDVGQGEFVSLLGPSGCGKTTTLRM
IAGFVDPSGGLLRVRGRDVTHLPPEKRDLGFVFQNYALWPHMTVAENVAF
GLKLRKKNRNFIRDKVAEALTTTGLSGYENRLPRQLSGGQQQRVALARAL
ALEPQVLLMDEPLSNLDRALRVTMRRELKELQARMKMTTLYVTHDQEEAL
SMSNRVVIMNKGQVEQVAAPFELYENPATGFVADFVGITNFLGGIVTSAE
GGGVEVKLGSGQMLRAVTKQPVALGTKVRALIRPERVTLKTTAADGDNIL
TGQVVLAEYFGALLRYSVRLDSGDILRAEVHNFDSFIESGSRVWVCVSPE
HLRLIPDERNA
>SMa0742 hypothetical protein
MRGGRYAERDICSKSQGRSRAVKTLISHRGTKPAVTLVARSRVSGRMESG
PFDGRYKPPQALANTPQIVQLLRVICLRYIFEIVDDEGQSRAYVASHMYR
KADKLIVLHRGAEVGIDICRICDWCAVDEDEAQRAGTAQTGWLHDC
>SMa0279 conserved hypothetical protein
MGIYRDVILPRLCDLSMRNERLRPYRERVIGAAQGRVLEIGVGSGLNLPF
YGPVVGEVLALEPSAGLVAMAREAPRSDLPVSFIDASAEAIPLDDKSVDT
VVTTWTLCTIPDAAAALTEMRRVLRPGGKLLFVEHGLAPDRGVRWWQDTL
TPVWRRISGGCHLNRPIRSMIECGGFRMERVETGYMQGPKPMTFMYEGSA
RPE
>SMa2077 probable oxidoreductase
MVEYRYLGRSALKVSPLTLGSMMFGNQTPDDVAFRIIDKAREQGINFIDT
ADVYHDGKSEEVVGRGIKAHRDHWVVATKFVNSRQKGPNVGGYSRKWVYQ
TVENSLRNLGTDYIDILYFHRAVFDAPLEEPVRAIADLIKAGKLRYFGVS
NFRGWRIAEVAHLAYQLGIDRPIASQPLYNIVNRTAEAEQLPAAAAYGLG
AVSYSPLARGVLTGKYNPGEAPAADTRAGRGDKRMHDVEFREESISIAKQ
IAVHAQAKGIAAADFALAWVLNNRLITSTIAGPRTEEHWDGYIRALDVKL
DAEDEALVDRLVAPGHPSTPGFTDPGHPLEGREPRSGATEAEIIPLARSQ
RVA
>SMa0091 hypothetical protein
MKTALAVFGGFVLSLALFLSGAVVAVLFLTGKPARQPQLDVNQSEVWTKQ
PRAVDRTAQQFERLPARPAPSDVNASREPETPAMANEAEKERSPEPLDRM
ATASIQSPPAEEEPTASTVPAAHAEWCARRYRSYRPFDNTYRSFSGGRRS
CNSPYMDAAWGPSEDPSPVPRGIDAEDADDPSTQMEYSAGDGEAIRLTQE
HVAYCFSRYRSYRPEDNSYQPYSGGPRRQCR
>SMa0470 putative ABC transporter, ATP-binding protein
MPITLPLEIVVERGETIAIVGESGSGKSLTARAIVGILPPGINAKGAVTL
DGVPLMRLAERELRTIRGSRVSMLMQDPFTMLNPLMRSGDHIDEMLRDRP
EFASRAVRADEVKRRLAEVGIVDEDVARRMPFQLSGGMCQRVALAAALAR
DPELLIADEPSTALDVTTQAEIIKLLRRIQRERNMAVILITHNLRLAFST
CQRIYVLYAGSMLEVGDAAAVERQPFHPYTLGLLLSEPPVDIRVPRLVAI
RGSVPRAADVIDSCGFADRCEWAKQICRAGKPSLAARDASRFTACIRQDE
IQGELDALRSATLSATPETPRRGGTAGALVHVDALVKTFAGRRGRPICAI
RDVSLHIMAGESVGLVGESGSGKTTIGRCLVGLETPTDGDIRINGIAAAD
FGAMAKADRDRVRRTIQMIFQDPYSTLNPKHSVGQALREALGASAGAPSP
APQERIASLLAEVGLSAAYATRRPASLSGGERQRVAIARALAVKPAILVC
DEPVSALDVSVQAQVLNLFRRLQVEHELSYLFITHDLAVVRQIAERIYVL
YLGEIVEEGPTERVISNPQHPYTRRLIESIVRSAIQRAP
>SMa1147 Conserved hypothetical protein
MTYKTVLLVLDANQYEADLAAAAELCAAANAHLSVFLVKVAAPSRFGDYA
ALSVAWLDIRAAEFEQLDEAVEAARTTLKDLDLSFDVAGEYSEPAWADDL
VGERARYADVTLVGTSMDPSFRARAIEGALFYSPCPVLLAPRRQSVTLLP
KRILLAWNSSLESKRAAREALDMMKNAEGVNVVLVDPAASRWNGHEPGAD
VATYLARHGIKVTVDRLPSAGRRIDEVLNQHAIDTSAELIVIGAYGHTRL
RQRIFGGVTKAMIEAPVVPVLMAR
>SMa1056 Putative transcriptional regulator
MGMSTDLEAVLKAGLSPAEMASRAGEVAKLLKTLSHPARLMIVCTLVQGE
YSVGELEEKVDVHQPHLSQHLTVLRGSGIVQTRRDGKQIFYRLTGEKAAR
LIAALYDIFCVKEDK
>SMa1149 Conserved hypothetical protein
MTYKTILLIIGVNDQENDLAAAADLSAAAGAHLSVLVVQLAAPPPVGDYA
ELSVAWLDERAEDMKHLDDAVRRARATLKDLGISFDANGIYCETAWADDD
VGSRARYADVTLIGASLQTDPSLRNRAIDGALFYSARPVLLATSRQSVTL
RPKKILLAWNSTIESARAAREAMELMESAEEVNVVLVDPSAAPARNGEEP
GADIATYLARHVVNVTVDRLPSAGRRVEEVLNQHAIDTSADLIVMGAYGH
TRLRERIFGGVTKAMIERPIVPVLMVH
>SMa0380 conserved hypothetical protein
MATRRSFLGGASSLAFANFFSPAKAADPNQTGVDTMHPDLILHNGRVTTL
DRTNPNATAIAIKDGLFLEVGSDSEVMALAGSGTKIVDLKGKGVLPGLID
NHTHVVRGGLNYNMELRWDGVRSLADAMDMLKRQVAITPAPQWVRVVGGF
TEHQFAEKRLPTIEEINAVAPDTPVFLLHLYDRALLNGAALRAVGYTRDT
PNPPGGEITRDANGNPTGLLLAKPNAGILYSTLAKGPKLPLDYQVNSTRH
FMRELNRLGVTGVIDAGGGFQNYPDDYEVIQKLSDENQMTVRLAYNLFTQ
KPKEEKQDFLNWTQSVKYKQGNDYFRHNGAGEMLVFSAADFEDFRQPRPE
MAPEMEGELEEVLRVLAENRWPWRLHATYDETISRALDVFEKVNKDIPLE
GLNWFFDHAETISDRSIDRIAALGGGIATQHRMAYQGEYFAERYGHGVAE
ATPPIRRMLDKGVNVSAGTDATRVASYNPWVSLSWMVTGKTVGGMQLYPR
ANCLDRETALRMWTEKVTWFSNEEGKKGRIEKGQFADLVVPDKDFFSCAE
DEISFIVSELTMVGGKIVYGAGDFKTLDENEIPPAMPDWSPVRKFGGYAA
WGEPERAGARSLRRTAISTCGCASDCGVHGHDHAGAWTSKLPIADLKGFF
GALGCSCWAV
>SMa2305 putative ABC transporter, periplasmic solute-binding protein
MQATHCVAGAGILFAGGQPWEETGRAPRSSHSREDYRVTFNWIERSLSRR
TFLKGTAGVSTALSGFPIPALAQSSEVTIISAESNANTAEVLRRIAADFG
KAAGTKVVVNNMDHEAHKTAIRNYLVAGAPDVCFWFSGNRMRAFVTRGLF
DDISDLFEKEKYADVLGATAGSVTVEGKQYGLPTGGTLWGMFYRKDVFDE
HGLKVPTSWDDLLAFGEKCKSASLTPVAMGTKDLWPAAGWFDQMNLRING
LDKHFALMNGEMAYTDETLKPVFEHWEELIKQGFFTPDHTSFGWQEAGAF
LAQKRAGMMNLGAFVRAAFPRQDLPQLTFAPFPVIEDGLGRYEEFSLNSV
HIPTNAKNKPGAREFLTHFYKPENLAAYLEPGGNVPPRNDLPPSKDPLVN
AAVESLKAVAGTSQFYDRDTDPDMAQAGLVGFQEFMARPERRDAILQRLE
GTRRRIFKL
>SMa2233 hypothetical protein
MVPEEAPVSIPQSTKRIRKQDVAVAFLHRRYSFASRLGDCMADFDFRKYS
DDERLLLISFENGGPMTAAQVAAILKALDADYRRMTGRELVLARLELGST
WIWLADIATSAGGWIKGTAAVVKATQDLASFAKKLREGFKPKQEPVPLNQ
LGAFDDSVDRSITAMAKASEETHSTIRMRKTITTDAGTETIDIEVTPKEA
KEARKRVKDKPKTATLAAPDVPKAIAHHDELVGTMRALPQLTGELEAVVR
ALVNAAIVNGGAYLVEQVAGTLEAEGRWDIASIIRQHLDGGRGFVRVDN
>SMa0751 putative aromatic-ring hydroxylating dioxygenase, alpha-subunit
MTANPTSIHQRLDRRLSGFSLEQPFYTSPEVYALDLQHIFYKQWLYAVPV
CQLAKAGSYTTLRVGAYEVVIVRSRDGEVRAFHNSCRHRGSLICKARQGQ
VAKLVCPYHQWTYELDGKLIWANDMGPDFDASKYGLKPVNLRNLDGLIYI
CLSDTPPDFQTFAQLARPYLEVHDLKDAKVAFTSTIIEKGNWKLVWENNR
ECYHCSSNHPALCRSFPLDPEVAGVQADGGVSKKLQAHFDRCEAAGTPAQ
FVLAGDGQYRLARMPLQEKALSYTMDGKAAVSRHLGRVAPPDAGTLLMFH
YPSTWNHFLPDHSLTFRVMPISPTETEVTTTWLVHKDAVEGVDYDLKRLT
EVWIATNDEDREIVETNQQGILSPAYVPGPYSPGQESGVMQFVDWYAASL
ERALAPRQVAAE
>SMa0548 conserved hypothetical protein
MPGNLHVRNLDDDLISKLKIRAARHGRSAEAEHREILRQALASEGGPDFE
ELAADLRKLTASRKQTPSEVLLRESRDER
>SMa1503 Hypothetical protein
MSRTALCSDRVLLNDWHVVADLTNLSSTAPFHTRLLGVDLTIRCGGRYRR
QVVRSDGGEPVNSDSRYGFLWACLGKPERDIVFVPEANEADRYLVTGGSI
AVNVSGLRAVENFLDMGHFPFIHTGWLGEEPHTEVAPYKVELTDADEVVA
TECKFYQPVASPTAKEGFVVDYIYKVIRPYTVALYKSNPVHKARLDVITL
FVQPVDEERCIAHPFLCYLKEGVSEASIRSFMQLIFAQDKPILENQLPKR
LPLDPRAETPIRADAVSVYYRRWLRDRAVTFGAIPARM
>SMa1041 putative copper oxidase, possibly exported
MPMFNQPKDHTMKALIFGLLVAAHASPALAAGSHAGGHGEAMVVGEPGKK
AQATQTIQVTMKETDDGKMIFTPSTFNVSKGQTIRFAIKNAGELDHEFVL
DQEDKIMEHKAVMEKFPDMEHDDPNAIRLAAGESGEIVWKFTNDGTFKIA
CLVPGHYDAGMHGDVTVAKK
>SMa1356 TRm3 transposase
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>SMa2375 TRm2011-2b transposase
MAPLRGWAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGE
RFRIYVQQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKY
SPDLNPIEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFK
EAGYERA
>SMa1461 Putative muconate cycloisomerase
MVKISNVRVRPLVLPLKQPYHWSYGIRESFAVNLIEIEADDGTVGIGECT
VAPDQTGTAAILYRLAKHLVGHSPHDVAPLIARIFHQEYLGHGANIMRAA
NQIFSGIDMAMWDLQGKLAGLPVHQLLGGAHRKAVGYFYFLQGETAEELA
RDAAVGHAQGERVFYLKVGRGEKLDLEITAAVRGEIGDARLRLDANEGWS
VHDAINMCRKLEKYDIEFIEQPTVSWSIPAMAHVREKVGIPIVADQAAFT
LYDVYEICRQRAADMICIGPREIGGIQPMMKAAAVAEAAGLKICIHSSFT
TGITTCAEHHIGLAIPNLDDGNQIMWQLVQEDIVSSPDLTPKNGWLDAFR
KPGLGFQLAEDLVAEGEGRYAASR
>SMa0323 hypothetical protein
MSAYLIVDLDIHDMKSIEDYRSKALPLVAKAGGKLIAIDESPLELEGWIA
TNMLIIEFPDKDAIRQLFASPEYAPLAAQRQAAAASRIIAINGV
>SMa0526 probable maltose/trehalose-like ABC transport protein
MIQIESLHKQFGSYEAVQGVSLSVPKGAFLVLVGPSGCGKSTILRMLAGL
EAPTSGTITFGGNTVSDGGRGWVIEPSRRDTGLVFQSYALWPHMTVAGNI
DWPLKVAGLDRDKRCSRVGEVLDLLGIGQLAARYPNEISGGQQQRVAIAR
MIAPKPGILLFDEPLSNLDAKLRVEMRTELLRVHRATGATSVYVTHDQVE
AMTMATHVAVMNAGRIEQFGSPGELVARPRTAFVATFVGTPPANLVPVVN
GAYCGRLADAALAGRNGSAMFRPEELTLAESPSDRTLTLDYAEASPMAGR
VMVTGIRADLRLTAIVDSLPSFSVGHEVHFQLPPAPSMFFTPEGVRAQ
>SMa2223 putative opine oxidase subunit
MSDRCELLVIGGGLIGSAIAWGAARKGARVTLLDEGDIAYRASRANFGLV
WLQGKGLGHPDYMHWSIRAGQLWPELSTILLKETGIDIGWRGGGGLHFCL
SESEMAARRALIARSSAEGAAIRIQLLDRDALHEIVPDIGPEVRGASLSD
LDGEANPLLTLNALQLAFQQNGGRLVVQFAARNIRSEPARGFVVSDANGD
EISGRRVVLAAGLGNNELARQVGLKLGLTPERGQIVVTDRIAPFLRYPSN
AIRQTREGTVLLGSSHEDAGFSTGTDVETIARLCRIGTRVFPALRAARLI
RAWGALRIMSPDGLPIYEEAPEMPGAYVVTCHSGVTLASLHALELGPALA
EGHLGPAPRSMRSTRFAL
>SMa0097 putative transcriptional regulator
MEIKWLEDFVTLADTSSFSRAAELRNVTQPAFSRRIKQLESWLGATLISR
ATMPAELTPAGRNFLPVAQEAIRTFYAAREVLRLPHEPGLIRFAALHTLT
VTFFPRWLKTLEAAGGSFSTSLIPDRGGIEANLDALVGDEADFFLTYAHP
EVPFHLDSGQFSSLTVAHDRLIPLVAAEIVLSGDPQPGLNLLDRAIAQPR
LAIPYLSYGFNSFFGVALSRLLLRRPPFRRRTTHENTISAGLMNMAVTGA
GVCWLPESLAREEIDARRLVPASADEGWNLDLEIRLYRHAASRNRKVEEL
WATAQRLLEPAPA
>SMa1377 Putative amidase
MTALENLSIRKLAAMVRAREISAVDVTTHFLGRIVAYDDALCGFNVPAPE
AALDAAHDLDTYLNAGGEAGALAGIPLSIKDTADVAGLPSAGASASRSGR
TATADATVVARMKAAGGIVLGKANCHELAFGGPSFDLPFPPARNPWNLDR
FPGGSSSGSGVTVAAGLCLGSLATDTAGSIRLPATMCGVFGLKPGHDTLP
LDGIAVLAATMDHVGPVARTADDTRILFDVMAGRSPGSPFVGSLKGLRIG
VPENEWDVGRLIHPDVRAAIDGAIEVARSEGADIVPLRLPSLEDFHAPGT
VLMMCEVAAEHAASVRAAWDKFGAIFRARALVGEGIAVHDFRLAERLRPV
LRKRLLDAMSNVDCLLVPGALAPPGPLASVDPFYFMKDPIPNIVANYTGF
PALAFPAGFGGDGMPVGVQIMGVPRSEHRLLDIAESFERADPSRFAARTP
PGLEGRPEPCLFKLELPARAM
>SMa1664 putative drug resistance protein
MFKTAFRSVDFVLGVSGLLLCSAGDGVAQTGPVVGVMTVQVENVSPAHEF
VGRVEALNAVDIRARVEGFLERRLFAEGQNVEKGQDLFTLERTTYELALE
DAQATLVGAQTNFDNAERQLQRNRALSQRTVSQAVIEESHAARDIARASV
LSAQTRVNQAELNLGYTHIKAPIDGRIGRAAYSVGSLVSPSSEPLARVVQ
TDPIRVVFSVSDRTILDLRTIAGGAGKDELAKGYALKLRLSNGEPYQQSG
KLEFFGNEIDVQTGTLPIRALFANAQSLLMPGQFVTVIVEPEEREERPVV
PVGSVEQDREGRFVLVVDGESRAAVRRIRASVQVGQNWVVEEGLQGGEKL
IVEGLQRVSPGAVVEAQSVSAGDAATDTAAPAPRLSSQ
>SMa1092 hypothetical protein
MPRIVKVPLKTATSMSVSSIPGSSQVISHLLSVSAKSTAGATSNSDSRGT
RRMRAEAKGRCHRFGAKSSNSRSISDLRLSKGSQDSARRSTSFSSDLMGS
LLAGSAMAGVLLQSGGSRLVSPRVAYGRGRKPVWPTSTGASPPAATPDEI
IKTPCCGSLIDWRQRPGDL
>SMa1677 hypothetical protein
MIVMCQKTPGSRQLARCNAVVHAVTRAEALFLLARPVFTLLRSGREMLAA
ALVLGLCATASADDVLFENVRIFDGKGAVLSAPSNVLVKGNVIAAISTSP
IEGEGAERIAGDGRTLMPGLIDAHWHAMLAASSPAEAMGDVSFASILAGE
EATDTLMRGFTTVRDLGGPAFGLKRAIDQGIIPGPRIYASGAMITVTSGH
GDFRQLTDLPRRTGGLLTPMETVGGAMVVDSPDEVRLRVREQFMQGAVLI
KLTAGGGVSSPFSPLDVTTFTEPELRAAVEIAENWGTYVAAHAFTSDAIR
KAIAAGVKCIEHGFLMDEATARLIAENDIWLSLQPLPELMRTGLRDGSVE
RAKADEVWPGIGRTYELAKRYKIKTAWGTDVLFSRALAKQQGAILASLVR
WYTPAEALVMATATNAELLALSGQRNPYPGKLGVIEEDALADLLLVEGNP
LENIDLVADPANSFKIIMKDGVIYKNTLTP
>SMa1913 Putative transport protein
MNDSLSRLPREPADRLTKPFMRFLRIEATAGIILLLSTLLALGLANTAWS
SSFLAFWEMPAGVRLGDIGIYRSLKHWINDGLMTFFFFVIALELKRELVL
GELRNPRMAALPVAAALGGMAAPAGIYLLLVGGGPGASGWGTVMSTDTAF
VIGCLALLGSRVPGSLRLFLLSLAIFDDIGAILIVAVGYGEPLNWVALGT
GGLGFAFVAGIALLGIRSIPVYFAMGSAIWLAFDASGVHATLVGVILGLM
TPARRWVSEIRLHAILDRVIAHPPGDHRSRDTAARSDLHRAGVATREAVS
PIERLEIALHPWVAFAIMPLFAVSNAGIPIEDANFDVPLTIAIVVAFVVG
KPAGIVLFSFLAVKLRLASRPEQLSWSLLAAGSLLTGIGFTMALFIAELA
FEPELLIPVKLGVLGASVISAALGFMALTLLTSPNRR
>SMa0572 hypothetical protein
MKTIIPRQLARSDVEAAIDYYAREAGTEVTHGFIGALQAAYASIASHPEA
GSLRYAYELGLPDLRSVSLKRYPYLIFYRDQPDHVDVWRVLHAKRDNPQW
MQEPNNH
>SMa2231 conserved hypothetical protein
MILADTSIWIDHFRHTDAELRRIIEDDRLLCHPAVIGELALGSLRERSSV
IAFLMAQREALVATHQEVMMMIDRHAIFSMGIGYTDAHLLASVLLDQRMA
LWTRDKRLQAAAEKAGASLHTPAHTRN
>SMa1636 TRm17a putative transposase
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKSVLHKTISAIRADTWEAVNKMLLASARQERLXSGRVVRVDSTVTAA
LIHEPSDSSLLWDCVRVMVRLLQQADSLGSTIPWHDHWPRGEXAGSGDRV
YPRSPETGSALPRSAQDRPEHPGLSAAGGGTVAAGGRPGGQTLAGPGSPL
SXTDHADHRPRPSGGC
>SMa0292 hypothetical protein
MDCRLRLDFRPPRILPRPLNQRANRKLLAAHEMDEWPIVIEALVQWPGSC
TASAR
>SMa1552 putative methyltransferase-chemotaxis
MTRHGPSLELAEQVAGRVGLSFSASRKETVASAIDRVMVRRGIGEGRLLL
DRLGSDEDLTEDVISAVTVAETHFFRGPEQFELIRQTILPELLRRRPEGS
PLRIWSVGCATGEEPYSLAILCQEEGLLDGVRIDAADISRRALAAAKTGD
YGEWALRNTGPRLRQRYFTSCEGRFRLNERLRGQVDFAHLNLGTDALPAP
EKRLSEFDLILCRNVLVYLEASAVRRIARQLLDCLSDGGWLLTAPTDPPL
WKYAPFETSITPAGIVYRRIMAPDTRKSSTACVAPVQRQTGNPVAGVPLA
RARAAASRKTTGDDTAKAIARQIRSRFDLGEPREAARLAARAIERHPLSA
ELHFLEGVSRMASGETDAAKAALRRVAYLDGKLAAAQFLLGVCLKDSDPR
AALRALENALSSCLSRSPREHVELMPEATAGHLAQRARREIATIRQHLGS
EAG
>SMa1493 Putative LysR-type transcription factor
MPKEDFNDLLWFLAVAEERSFTRAAAKLGITQSTLSHTIKRLETRMGLRL
LTRTTRSVAVTEAGERLLRSLAPRISAIRDEIAALMAFRDKPSGTVKITL
SDHALESLVWPKLQPVLLEYPDIRLELSRDNGFRNIVEDGFDAGIRLGES
VEKDMIAVRIGPDWRLLAVASPDYLADRPLPEHPQELIQHNCINSRQATG
GGHYVWEFEKNGQELRVRVDGQLTFNSSYPMVDAALKGHGIAYLPEDLVA
GHIAAGRLVALLEPWSPPFAGYYIYYPSRHHNSPAFKVIVDALRHHGTPD
SEGTRDA
>SMa1158 Conserved hypothetical protein
MLFSYLGLISVGRASVGPQAGGIDMVQNIKSILIGLTKEFGRDETSSALA
YGISLAQQAGAHATIQAASIKLSLSSAGVTKVVAGLVREENQRLHSLAQA
AARAAEAQASSAGVACSTESPHLSYPELLAVFRTQARLHDLTIVDAESES
LAIDRGLIEALLMESGRPLLVVPATQDVFRAKRIILAWDGSGRAARSAAD
ALPFLRAAEAVEVVAVSGEKDLPATPTCSDAALHLSRHGVHVKAQTLPVL
NGDVAETLRKHAMLIHADLIVMGGYVHSRLRELVFGGVTQSLLKQSPVPL
FMAY
>SMa1194 hypothetical protein
MSSIWRAPYRPLFFLAGLWALIVPIVWLLPEQLVPDRVEWHSRELLFGMG
GAAAGGYLLTALPAWTRGAVPPAATVIATCLWCAARLTGAFSDHLPLIAA
AIGVSGYFAFLTAMLMRGVVVSRAWARCWAPLGTGALGVNAFVSIADGPS
PTPLLFAALIVVVGGRAVPAFTGSWLYRTAGGKSLRNRRELSHLAVAGIL
IATFLHGDWSPTLPGLLLLFSGAMLLWQMSEWRSLGTRGYPALFILHVAF
AWTPTALLLGGLSATISGHVPADDALHALTMGAMGTMIAAFMMRPAMVRD
GESLILGGTMAGAFSLVSLSALLRTSGGWLDADHFEPEAAAAICWMAGWT
LFLIAYLPAMSGPVPRPAFSAALGNQVKRGNGTPPWKAGCLCP
>SMa0537 hypothetical protein
MTHTKAPITAEDTIRSWAGAVAACDIEAVCYADPLLVFDVVGQLQREGKA
LYRRAWEDEFFPWHGGTGKFALRELSVHAGGDIAFATGLLDCGGTEGGRP
AEYTLRLTLGLAKTPNGWYIVHEHSSEPTSFNEKVVG
>SMa0287 hypothetical protein
MNKGSTKAKRQENSRCENEEPTRKPGTSIAEWIVAGVSCLALLAVLGYLI
LDGLSGRNGAADIIVLPAEVTAMNDGYVVEFAANNRAGKSVAAVEIKGEL
RGGEEVVEESGVTIDYIPQKSERKGALIFRSDPEGYELRIFASGYSEP
>SMa2307 putative ABC transporter, permease
MTSLWRRHRWWLTPTLLILPGAILFAVVILASSVQSLWISLHDWDGFGPM
IWIGLGNYRELINDPQFYVSLKNNLIWLVMFMLAPPIGLAIALLVNQKIK
GMRVVIPLVLASVAVGVVFTWVYTPEFGLLALIFRAFGANAPALLSDEHL
VTFAIVIAALWPQIAFCMVLYLAGLNNLSEELIGAGRVDGARGWNMLRHI
VLPQLTQVTFIAIAVTVVGALRSFDMISVMTRGGPFGSSSVLAYQMFEQS
IFSYRFGYGAAIASVLFVIMAVLYLVPHAHHPRGRKGRLMYPRPVPETAQ
CEPQGCT
>SMa0809 hypothetical protein
MASLPFNSNEGRGTFGSRETRLVDPINAEPLNGRVEVALPEIQRANGTKV
EISQHRVADTPTEAVAQTKERHTPSQRLSTSGTARNRTKRLHDLKVGHEG
RKAQTRTVDDPISLDELAALGADNKLLKRLLAELLLAQNLWLKKMLERFD
AERDVAPLASGRKAREPEFRG
>SMa0759 hypothetical protein
MVVEVKKAGAVDFIEKPFENTVIIEVIERASEHLVVPQADADEVNDIRAR
LQTLSERERQVLSAVVADLPNKSIAYDLDIQSSLRRGASRERHGQNEGEE
PSPPRANGSHRWLRSFLSRRVDLAQCGALPQAA
>SMa1745 putative iron uptake protein
MGPEGLREAAQRGSQEVAMTAGIIKTSDGRWSPVSRSNRMRSLWLAALFA
LVIVLLAASVTVGTRNVGWDDVAAAMGGAQNNIDRASVALRIPRTVLALL
AGGALGLAGAIMQGVTRNPLADPGILGVNMGASLAVVVGVAWFGMSSLYA
YIWVAILGAGCAAIFVYAIGSLGRGGATPLKMALAGAATSVAFASMVIAV
VLPRGDIAGGIHSWQIGGVGGATYERILPVLPFLLVGFVISLLSARKLNS
LALGDELATGLGESVATARAVASLGAILLCGATTAICGPIGFLGLVVPHL
CRLLVGVDHRWLLPFSTLAGACLLLAADVLGRIVARPAEIDVGIVTAMIG
APFFIWIVRRQRVRDL
>SMa0443 hypothetical protein
MVEAEVIQNKLKIAAIGFEVDVEDIAKHRHTSRNRVKTDIDQHLYELVVR
HAETPSLVDDDETDGGGRKIADAGDQAKDRVSPERHACTGNSERRIHQPR
QSAYPAKTRELFDGYVGLLHSFCVVSARRFKCHAAALCLPSRLMACSCGR
CPS
>SMa0994 hypothetical protein
MRSFLRAFIGFVLEIRPNFLQRGVVLALTLKRSKSFQIQHVDFLSRQRSL
TSIASYWACVPTNRTKIRFDRNATSPTMRYLLPPTSKITLLPATISAELN
VFFNSLKLLQLAFDAVVYHSIIAASAVSLGAPFGKLLQKFTSADLAITFT
RTPN
>SMa1840 Hypothetical protein
MSLLKTIETNPSFAPRESSPLPERLISGNPAFKTWAQDVARGEMIQTGVW
EASPGETRSVKGETFEFCHILSGVVELTPENGKPVVYKSGDSFVMKPGFV
GIWKTIETVRKIYVTVM
>SMa0722 conserved hypothetical protein
MRFDRWNTAFCDMDFQILSLSYRTHRVTAGTQEPRSNELRSRSVEVISGE
DRFRIDADAIRSASWMKRIAGIVGRADAQAPRPGRITFRVRNHGWTEESP
LSFMHERVGGWSYVIATDGQCVEMLEDDYQQLPIGLVGRHRRCRYDPASK
NRLVAACLEPFASAARLALEHRVNANLLCKRIKKRAEPASRPVLITGDYP
SSV
>SMa0039 probable LysR-family protein
MARKTAQSKRGVFNMDRLTSMAVFVKAVDLGSFAAAAEALGLSGPMVGKH
VRSLEDRLGVQLINRTTRRQSLTEFGYVYYERCRVVLAEAQAADALASDQ
LSEPRGKLRVTMPVHFGRHCVTPVLLELARKYPALELDLSFNDRIIDLAE
DGFDLSIRTGVLEDRAGLIARRVARQPMIVCASPSYLESHGCPEAPEDLA
QHAAVVYRRSGPVSPWLFPRPGQSSSEVTPLHRFRLDDLDAIADAAARGL
GLAWLPYWLVRGRIEAGTLVRLLPEQPPFLYDCHAVWLQTPHLPLKVRVA
VDALAAQLPSFMT
>SMa0343 hypothetical protein
MEKTPDTFVDLGRGTPAWFTFAARILPVGILGQLLSAGLALFRDSSLWGL
HAALGGALSFPAVSLVAGALFVSRLRGFGWWAGLTFLLYLLQVILAAGGM
PLLLSFHPVNGAFLLAASLVLLANVERRRSQSLAEDNATAKPGK
>SMa1776 Hypothetical protein
MVNSHPVALLDLGRTSTSYIKQVGLLNFFWAAVGFLLLFCLAGCATQHVE
PPTLPNTESAALHRTQTVRSHPSSGTFDLLYITDRAPITASDTALSYGAE
RAIFLSFGSVSIASTRKPLTSKNELRVSAVSETGRFPATPYGVEATANGL
SRSLEAVTAHDQAAASLQAEVARRLSTSDRKEVVVFIHGYGNSFDAAALT
TGEICRSLQNQFVCIVLTWPAGGSGGFFFGYNIDRESSEFSVADLKKAIR
IIAETQGLERLHLLAHSRGTDVLASVVQQLSIEAYVSQSSMWQRYKIANV
VFFAPDIDLDVASSKMFAWVSDPDLAFGNKPRPSTVPPQGPLHLTVYSSP
RDKALGASTLLFGSALRLGQLAVDRLPENRSEAASRWAGSQMGELVDFIE
FPGGGFIGHSYFLSNPAVKADFIALIRARAKAGDPRRQIVEIKRPFWRVS
DVQQAW
>SMa2359 conserved hypothetical protein
MEDFILFALVGFLAQVIDGALGMAYGVICSTVLLAFGVPPAQASASVHAA
ELFTTAASGSAHLYHRNIDWKLFWRLIPFGIAGGMLGAFVVTSFDGDQVK
PFVTAYLAVIGAWLLYRSFHRIPTNPVKLRIVAPLGATAGFLDAAGGGGW
GPVATTGLLGAGGQPRFVIGTVNASEFLIALSVSLSFLATVLTGHWEQAG
DFRDHLTSIGGLITGGVVAAPFAGWVVKALKEKTLLRLVGSLITLLAGYQ
TLELTGFL
>SMa1951 Hypothetical protein
MKVVAVAQAVLFRRMRAVMLRPHDKGLIATTLNFDYEVRSAKEAFKEIPD
IKIEADMPDLARHIIGMKKGTSSAEECDDRYEPHSPS
>SMa1417 Putative
MKAAVLVEPRRFEVREVGIPEIGPADVLIRVTRAGICGTDLHIFNGHYAA
DRLPIVPGHEFCGTIAEIGASVTHLKTGMRVVADINIGCGNCYWCRRNEV
LNCGEVEQIGIGRDGAFAQYVALPGRLVLPVPDGVPEAVLALVEPVACVV
RAARKAGAAFGRSGVVLGAGPIGNLHVQMMRLVGMAPIIVADLSSERCRM
AVEAGADAAVSEPATLRSKVLEMTGGRGADFVVESVGSSKLYRQAFDLVR
KGGHVAFFGITPPGETIPIEILRTVLEENSLKGSVAGMGEDMHDALTLLS
HGRFRTAAFTAAHYPLERIQEAFETIPARTGHLKTQILLDA
>SMa0303 putative LysR type regulator
MVRRNLPPLTTLRAFEAAARLMSFKAAAEELRVTQSAVSHQVASLERNLG
TPLFVRLPGRVELSQEGTVYFPVVQDALDKIALTTDLIRKTNTSASLTVQ
VYVTVAVRWLIPRLQAFKEASPEIAVNLDASLLDWEFNPDRADVGFIYTR
APNRPNLTYTLLRRERLVGVCSPAIARAIKTAEDLRHFSFLSVSGTTDDA
ETWAASVGVAGLSQKSSPLFDSNLLAIEAAANGQGVVVVPQFLVEGDIAN
GNLVAPLASDVMQPGGWYLVHLQRRGNEKAIRKFLQWIQSQT
>SMa2271 hypothetical protein
MHLAGSPQPDELGLVLVVTIRTCLRELLLYERLPAELSSTASSNRSRSGK
SRSVSMPKAERNVQKQLGHDPQIAAPTRPI
>SMa1900 Hypothetical protein
MMDLVRSFMGPTWAPLRAMVSGSQIEAKSEAELILGTDLSLSAEAALYFP
RHQLDTSRPMRSAQPSSAVSCSRQIGDDLLSCIEHLILLDLLDDRPTIDR
MDRRLGMSRRTLQRRLAEQGTSFEATLKAVLEGQAEMDARYTGFLDFADC
LSTRINRPCALHSSFHGLEGDVSAGMAACLRHSPE
>SMa0806 probable SyrB-like regulator
MVDESNAGPVAPAVVADAEVKAPTGKKRSSSRPQKAPPEPAQPKMPAAKR
RGYSEQERSEKLRLIETKVSEGNTLKDAIKSAGISEQTYYHWKGAAKSAA
REDIERTRPLSAGDEFAELVQLEEENQRLRKQLAEKLRTENTELRKRLGL
D
>SMa1907 Hypothetical protein
MKRRRLLFAWPRRSVPNFGSKYYGRCCLRASRPWPQGKEDVMSSIENNIS
SVDFEMIRSVLDDAGYDASVLVEDQCLFDTAALLVTKLFLSGVDSRSALA
AKLECQLGRAGTHRHMLSLSLWRYAI
>SMa2369 putative ABC transporter, permease
MNAVTDIFASAGLWAAVLRIATPLILGTLGALLCERSGVLNLGIEGIMTF
GAMIGWLAVYNGADLWTGILVAGLSGGIFGLLHAGLTVTLGLSQHVSGLG
VTLFASSFSYYVFRLLVPVAGTPPTIEPFQPIDVPALSSLPFLGPALFTQ
TPPTYVAILLALVLGYVLFRTPLGLAIRMTGENPHAAEAQGINPMAIRFG
SVIVGSALMGIGGAFLTLSAFNSFFPTMVQGRGWICIALVVFASWRPGRA
LVGALLFALFDGFQLRLQTRLSGVVPYQIFLMIPYLLSIAALALMARRAR
VPQALMQPYRRGER
>SMa0620 hypothetical protein
MSGTGKLQRWVGEVSGIEETWNPKWQRHLPAQAPFEWLAKGWRDLITYPM
LSLSYGVAVFVVSFLIIWLLFATGRDYFLFPAVAGFMIIAPLLATGLYLK
SSRLERSEPVSLGSMLRVRPVAGAQVFFTGLLLCMLMLLWMRAAVLVYAL
FFGVRPFPGLGHITQLLLTTPTGWAMLAVGIFIGALFAGFSFAISVFSIP
MLLDQRIDAFTAMGVSVALVWNNLRPMLVWGAIVLGMFLVSVATAMIGLV
VIFPLLGHATWHAYRAVR
>SMa0180 conserved hypothetical protein
MVELIAGLIDVFADVPLTGNPLAVVQDADGLTDDQMRRIAGEFNQAETTF
LMRSTRADWKLRSFTASGAEVFGAGHNALGAWLWLAENGDLGSLTAARTF
QQEIGRDVLPIELESVGGRIHGRMRQVPLRLSDPLDDVAPLADALGLDPR
DILPEPPARPADTGATHLMVRVLNVDSVDRALPVADKLLAVLEKTPAEGC
YIYALDADAPDTAYARFFNPSVGLWEDAATGTAAGPLAAYLAATGNLTNN
ELVIEQGTKMGRRSILRIRLAPLPELSGAGIVVVKGVIRL
>SMa0226 hypothetical protein
MRLGRRLETGLTPLQIRSSAIQGAFGGVAMSVSFPRAGFLALVMVVALAF
GVPFDAARGELLARVPAAREMAVCGDTLFVGTKGSSVYAVSLPGGRARRV
ASGFSNANGVACSRGRLFVASRSSITAFEIGRGGTLSGRRDIRRDLPNSG
AHSFRYIALGPDARLYVSLGSPCNICVPGGLQGTIVSMNQDGSDLRRVAW
GVRNSVGFDWRGGRMFFTDNGADRMGDDVPPDELNALRPGGFYGFPYFGG
QVPLTGFEDAMPPARQIPPVFNFQAHVAALGIHFFRSLGGDALVAQHGSW
NRSVPVGYQVVRVRFRGGRPVSAATFLRNVGRPVDVKEAPDGAILVSDDA
GGAIHIFRR
>SMa1386 putative oxidoreductase
MAVPAVRGHSRRGRYGEQGQGRKAQLRQVVSFFITRALVLLSTRFCRARY
DLSRDLLARRSDQVREQHSSPPVIVIGAGIVGAATASFLALAGTPVRLLD
ASTPASGATGAADGAVSVASKRPGPMMTMARAGAALYRELAEEGLFQGLF
HRRPTFLVATSDAEADVLSDHSEALAEVGAPVLWLTRDMAADRLAPLSRK
TVAVLEVEDEGHAIGYSIVSRLISAAGLKVERNSPVAALEYDGRSGRVSG
VRVGEAVIEASAVVVAAGGGAGGLIGLPDVSRPRRGQQLVTERAPTLNAA
LPGSLLSCSYLLSKKQGGDADQRGYGVVIDPLETGQFLIGSTREEGRNIP
ENDIDAVAHLAASAGDMVPALGRLRILRCFAGIRTAICDGLPMVGRMPGI
DNLFVAAGFEGDGICLGPLTGRIVADLVRGEEPEIDVSPFDPGRFAGRSI
AA
>SMa2163 hypothetical protein
MVTLNGGAAYHAFRTDDGYRYEGRDAKGMVSFLPAGCRRELILRDVAWEW
GAIAIDPAISSPRLANIKSFLVGRDDFIYGMTAQMGGVFHRDGSLDATYG
STMALALSEYLSNRIAGQRQPDASARYTLTRRQLNDTYERIDAMLAAPIA
IADLSIPLGISEGHFFRAFRGATGETPLQAISNRRMDHAARLLSETDLQI
IEVAAHCGIESPSHFARLFRTRKGHSPSEWRKRSDAGFLRDR
>SMa0683 hypothetical protein
MRQAAAPGTGVWFVPYLEGDMCNEQTFLRASIAATVVVAAFGIILGLLSG
SFSITFDGVYSLADAGMTVLALWVSRLIAVSATGDALSARMRDRFTMGFW
HLEPIVLLLNGTLLMAIAVYALINALTSVLKGGHQLQFGFAIAYAAVTVF
VCAMMAVIGARANRGLRSNFIALDVKAWIMSGGIASALLVAFIIGHAVQE
TALHWMTLYVDPVVLAFVCIVIIPLPIGTVKSALADILLITPNELRARVE
RIADETVRKQGFLSYRAYVARVGRAKQIELHFIVPSNFPPQPVESWDQIR
DEIGIAIGDEGHNRWLTVAFTADERWAE
>SMa1797 Hypothetical protein
MSPKSRAHRQAYRPAQGGIMADAQKMKKLIEDCERDIAACLHPESGIPET
HLLQQLLSRLDGQQAKEALGDDWQGRWHDPEGDDDMSPSHPLWWTQPEFF
GTIWRLKI
>SMa1017 hypothetical protein
MLRWRGDKEMAKANDYPGMPRWVKISGLIVAVLISLAAMIVVFDIGGPHG
PGRHMSPNSEMPPAGDRP
>SMa1363 putative ABC transporter, permease protein
MSRARSEERSGWLFVLPFAISMMLFFAYAIVRTAYYSFTDFDLFKAPSFV
GMSNYTALVSDELFLLALRNTIGFSLIVTTTQTVLALGLAVLVNHAVRAR
GLIRTVFYLPSIMSSAAMTLIFLWLFQRNGFMTAIVSVVLAYRQHILLFL
VGMAALQGALVLNARRSYEGISIFDPFFLLVAAVGALALAASCALAGLLS
VFDNNLLISWLNTQRHFLFMPVTLWSVALMNVFTTVPTLMLLFLAGLQSI
PSSLYDAAEVDGANAFQRFRHITVPALRPVTFAVVTMGVIGTLQMFDQVA
ILGDAAPLASRVTLAYYVYENAFPAGASSRIGMASAAALVLGILTIAAVY
VQKSVGVNEKGE
>SMa2008 Putative transcriptional regulator
MSQQHMSPPKRRGRPPNQLAQATILKAAHDILTEDGFGRLTVEAVAARSG
VGKPTIYRHWANASELAMAALMSGDPGIFAEGGTSLRSALAGQMRSLIQA
FATTRGRQIAMTLAAADPESEFTKAFRNQVILSSREAGRAMLLEAAARGE
ITLQQDLEVLLDMIYGPVFYRLLVGHRPLDTGFADSIVAIALEAVAASPQ
S
>SMa1696 hypothetical protein
MEAAKVSKPDVSTVYAVDLSQRPVPQGDAAREKAALLQLARRMHDAPGEM
LPRFVELAMELTGGISAGISLLEQTEPSPVFRWHHLKGILSPFNGATTPR
DFSPCGITLERSATTLTIHPERVYDWIPPGLSLPEVLLVPLYIGRTEPLG
TLWIVADRIGHFHCGHGATMQELAGFIGIALKMVRSEQELQQALEQQELL
TREMSHRLKNLFTIVDGMIRISARSTDNKDDLVALLSGRLHALAAAHSLV
RPSFSDVQGAASNLAELLSIVTEPHEPPATGGRRRLSLRGPSVLCGEQSV
NGLALVFHELATNAAKYGALCSENGRVDVLWQIDGDDLSITWSEDRGTQI
SIPPASKGFGSTLVEATVTHQFGGTLSYDWRPVGLSVNIVMPLSILAR
>SMa2049 putative LacI-family transcription regulator
MRVHVYEQRTRAWNRPWPRQPTRGCRKVSKPNYRDIARHAGVGTATVERV
LNGRGGVRPELVEKVVVAARALNYPRTLPDAHRGLLRIEVLLVRPETTFY
RRLSKAFERIADTLDPLVVVHRSFAEEMKPEEIARRILSADLTRAGLILA
VPSSPVISAAVEAVVERGLPVVHVVTRASVDKGEFVGIDNYAAGRTAAHF
IARMARAEGPAVGLCHPIYQVHRDRIRGFSDYFRDKPGPIAFDWLGFTRD
EEHYSAETLSTALEIYPNLVGLYNAGGANSALIDVLRRHRRGRDVLFVGH
ELTEYTRAALREGIMDVVLDQAPEAQAQRALDLILRRIGLTAIEPDYAPI
RFITITPEGL
>SMa0736 hypothetical protein
MKGLDDADIVVAPPKDLLDSTMSAADFAQLFGVYTQGGMSWETFYERGQA
DGIFRLSGTRKTNTPSSILRAQRTSGRQPSSDLEHSWPFV
>SMa2055 hypothetical protein
MKTSFVAAVLAAALMAGTASAETIGVSMQSFDNNFQTLLREGLDARASKV
SGVNLQIEDAQADISNSAARWTTSSPRGWTQLS
>SMa1740 conserved hypothetical protein
MNAVHSFKLSGVAVPGSAADMLDEICEHFVEHAKVERRDDLAVLQSELGV
ARISIENGRLLIELDCPTREKLHMSRTILAEHLFYFAEGQPFELTWSEPT
SLSVLPNLHEVTVVSAHDVTPHMRRVIFSCVDVTPFVGSDMHVRLLVPPK
GKPPVWPGYREDGRIAWPEGENELLVRVYTIRAVDLDRSELCIDFLQHPA
PGVPTPGADFARDAQPGDVAALLGPGAGGLPAERSILLIGDESALPAIAR
IAAEAPAETHIRAIIEVEDKAEEQPLLTDGVLDVRWLHRGSYPGDAADIL
VSEAKAAISAVDDETFVWVACERTDIRAIRTFLKARQHDRRKMYVAWYWE
RDVKIA
>SMa2317 putative aminoglycoside adenylyltransferase (C-term)
MLTLARMWRTSTTGDFITKDAAATWAANQMPDQEAGTLIHAREAYLGKVR
DDWGNRQSASERTATFLRQRVLELL
>SMa0964 hypothetical protein
MSKIFDNTAVGEQSPRKDGFWQKQQQEEVVEAAIGLYGSEAATAAAYCAL
EAWTERRDSDYGFWFEVFLRLRDRLT
>SMa1367 Putative dehdyrogenase
MRLFLHTGTNAEGLRTVAKEVSARGAEVATELGDLSDPTVPGHLVQAARA
AFGGLDQIVSNAGRAQRSSFGQLTDADLQTAFDMMPMAFFRLVDAALPDL
RTSMQGRVVAVSSFVAHGFGTNGMHFPASGAAKAALEALAKSLAAQLAPV
GVTVNCVAPGFTRKDTGGHAATSSAAMESARAVTPNGRLGEPIDVAELVA
FLLSPGARHITGQVMHVDGGLLLA
>SMa1358 Hypothetical protein
MPLIAETCHQDTDASAREDRCSQVDIARCAAKVCSERIFVRQAHILQEKL
MFDHVKFGVSDYAASRAFFLKALEPLGVAVVSEGLPAYGVELSPKGKASL
CLYQTEEKPAHLHLAFTADNRQQVEAFYRAALEAGGKDNGAPGLRPHYHA
NYYAAFVIGPDGHNIEVVCHEAEA
>SMa0921 hypothetical protein
MRDILILENEMRSLRHGLRHEKVEGIAVIKRKSRKSDKMRICNIQPVESP
ARQYRKNLFYIGIKFADAQLHQFPRRMRH
>SMa1471 Hypothetical protein
MKKPLRIDRAKAAAIFIDLQEEHRRDERYLVEGYDTVLGNAARLQSAARR
SGIPVFHCAYIVDLADNLRPFHPIGPDGRSAFSDKDDPLTAICPEVAPLP
SERLLVKNEASAFGKSPLIGELRDAGVEWLVVAGVWTEACVDATVKDAIN
LGLHVLLVKDGCGSGGLAMHQTGILNLANRLYGGAVVGTEAACALLEGDT
VEAWRVEGSVPLRYTFENAAKLYGEL
>SMa0360 putative transposase
MTAADVALLATEVETSRATAYRLIKLFRAGGTVMSLVDRKRGRPEGHRVL
DDKREEIIRTTINRHYLTRNRPTVSQLVRDVQTNCISAGLKRPHRRTIKA
RLEEIEPQRRAKRRGETEIVKQTQAVPGVFAASRPLQVVQVDHTKADIFV
VDEETRQPIGRPWLTLAMDVCSRMVTGFYLTMDAPSRLSTRGVAP
>SMa1445 Conserved hypothetical protein
MEAEYYLRGTTGKGIDDSDAGSDAGPVTGGKQVSFDTPAIGEFMQEVAEN
ELAHVRFYRKTLADQAVPRPAIDFDAGFAAVAKSAGLGEDFDPFGNETNF
VLGGMLFEDVGVTAYAGAATVLKNKDFLAAAAGILAVEAYHMGMARSTLY
RKGEEAWKAAQAVSDARDKIDGPEDKDQGLQVDGKANIVPSTPDAIAFTR
TPQEVLRIVYISDKEGASKGGFYPNGMNGKIKST
>SMa2043 TRm2011-2a transposase
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGHRHQHRDRLGAAVSRDGQ
RRTRPDGWAQAPQAFRSAPGLAALPLPRARLHAARTCRRVERARPEGGFI
GAVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMa0223 putative transcriptional regulator
MSSRPNKTQPPAKRVRLSREQRRRQLLDLAWHLVREEGSDALTLPRLSQE
AAVAKPVVYDHFGTRNGLLIALYRDFDARQTEMIDAALAGSGPTLEDRAR
VIATSYVDCVLAQGREMPGVLAALAGSPELEAVKRDYQLAFIEKCRRALA
AFVGRRGIEPAGFWAMLGAANALSDAASTGEITAEQAQDELFETIVAMIE
RSHP
>SMa1623 conserved hypothetical protein
MRVEILGQERRRRWGDAKKLDIVMSIGLDGATVTEVAHRHDVSRQQIYAW
RHELKKKGLLPSTADTVFLPVDMAAMHGAPLVREDVARVSAMIELRLNDG
RSLRFDSGVDPAVLTRLIRAVEAA
>SMa1860 Putative ABC transporter, periplasmic solute-binding protein
MNDKITNWTRSDDAMVETAIRRGATRRELLQMMLAGGAALSAGSLVLGRA
GNAVAATPVSGGTLRAAGWSASTADTLDPAKASLSTDYVRCCSFYNRLAF
LDKGGTPQMELAEAIETKDAKTWTVKLRKGVTFHDGKPLAADDVIFSLKR
HLDPSVGSKVAKIAAQMTGFKAVDKQTVEITLASPNADLPTILSMHHFMI
VADGTTDFSKGNGTGAFVREVFEPGVRSVGIKNKNYWKSGPNVDSFEYFA
ISDDSSRVNALLSGDIHLAATINPRSMRLVESQGAGFVLSKTTSGNYTNL
NMRLDMEPGSKRDFVEGMKYLVNREQIVKSALRGLGEVGNDQPVSPANFY
HNPDLKPRAFDPEKAKFHFEKAGMLGQSIPVVASDAANSAVDMAMIIQAS
AAEIGLKLDVQRVPADGYWDNYWLKAPVHFGNINPRPTPDILFSLLYSSE
APWNESQYKSEKFDKMLIEARGSLDQERRKAIYNEMQVMVASEAGTIIPA
YISNVDAITAKLKGLEANPLGGQMGYAFAEYVWLEA
>SMa0277 hypothetical protein
MEKHRIYSVSVASVYPHYVAKAEKKGRTKAEVDEIICWLTGYGPQALDDQ
LAKNTSLENFFAQAPRMNPSRSLITGVICGVRVEEIQDTTMREIRYLDKL
IDELAKGKAMDKILRK
>SMa2241 hypothetical protein
MVRKRYKWEDGAQLDEHSKRKHKILHEYVLQYIVVRCKHPQQPRFRFAIV
DGFAGAGRYAGGAPGSPLIFVDRVKAAAEEINLARAIEGLPFIDLECHLI
LNDADPVAASLLKQNIAPLLAEVKETSKNLRIQTSFLNEVFEDAYPKIKT
ILTSAKITNALFNLDQCGHSYVDRTTLVDIIQTYQSEIFYTFSIQSFLAF
LSRTNPELLLSQLAFLNLPMRDIEQLNFPMSNERWLGTAERLVFDAFQVC
APFVSPFSIHNPDGWRYWFIHFAKSYRARQVYNNVLHDNSSAQAHFGRAG
LNMLSYNSRHTDGHLYLFDRAGRTTAVDQLRDDIPRLLTDRGHAINVLDF
YEGIYNLTPAHSADILSSLVGHPDLKVTTKSGGERRKASTIKVTDTISLN
PQRTFHQILRPIENR
>SMa1101 hypothetical protein
MCAKHPPGLDEGAATAIVVSPSGSRAGRQALESMTARAVFTDPLPVLNAY
HWISFCRKDVTFRPVGSNSFWKTASLAFESLPLFMRINDTAAEEGSNGLI
RGESWRPFDSSSVRRLRSAPVASSLCFRVHASDAAGRVD
>SMa2309 putative ABC transporter, permease
MRAARLYMTVVGAILIIWLAPLFAVILTSFRSMADVMSGNLWGWPTEIAV
VENYTAVFTQTPMAGYFLNSLVITIPSVIGVLSLSTLAGFVLARYRFPGN
MVIFALFVGGNFLPHQIMMIPVRDLMVRLNLYDTTTALIIFHVAFQTGFA
TLFMRNFIAALPDELFQAARAEGATPFQTLIHVVVPLVRPAWAALAILLF
TFIWNDYFWAVVLTVSDNVKPVTAGLANLRGEWVSAWNLISAGTIIVAVP
PVVMFFLMQKHFIAGLTMGAVKG
>SMa0664 conserved hypothetical protein
MTIAGRGSVRAGSPTALPTNTEILVQLDRIRLSAEFDVPDRARKFLAYIV
GEAIAGRADRIKAYSIATEVFGRDSSFDAQTDPVVRIEAGRIRRALERYY
FVAGSNDPIVIKIPKGGYAPAFEKRGGAPYQLSSGQAANVQSRSMSLEQT
ALWVSVATVGLLTCGLLANAFFGSAATTIESLTKPGGTRPNIPKLMVMPF
EDLSQTPQSAMITRGLTDEVISNIAKFKEIVVVAGPAAPNPHSAEREYPA
FALEGRVRLDGDKLRLGIRLVQHSDGSVVWANTYDEVLQPRKIIELQQNA
AAAVASAIAQPYGIVFQANATHFMRSVPDDWQAYACTLAYYGYRGDLNPQ
THASVQECLQHATTQFPDYATAWALLSLTYVDELRFRYRLNRSTTVSLSH
AIEAAARAVELDPQNVRALQAEMLTLFFRGEVNAALTVGARAYAINPNDT
ELSGEYGFRLALSGQWRSGCDLVSKTVASNPGPVGYFEAALAVCCYIEHD
YVAAERWARSADLHANPVYHVILLAILGKLGKMDLARAEREWLEINVPGF
LENARNEVALRIHRPEDQKHFIEGLRQAGVPIPGK
>SMa0196 putative gluconolactonase precursor
MAEASIYEIHDPRFRQMIVTSAGLDELYSGCRWAEGPVWFNDANQLLWSD
IPNQRILRWTPEGGVSVYRQPSNFTNGHTRDRRGRLISCEHGTRRVTRTE
VDGSITVLADRFEGRRLNSPNDVVVKSDGTIWFTDPTYGIMSDYEGYHAD
PEQPTRNVYRLDPETGELSAVVTDFTQPNGLAFSPDEKILYVADSSASHD
DRLPRHIRAFDLTDGGRLANGRVFCVIDKGIPDGIRTDANGNLWSSAGDG
VHCFDTAGKLIGKIRVPQTVANLTFGGPRRNRLFIAATRSLYSVYVAVTG
SQVP
>SMa2353 probable oxidoreductase
MMRFDTPATTNPIDQGKVIGKPITRIDGPLKTTGKAIYAYEWHDPNTRYA
YGYIVGSAIAKGRIRSMDVAAARNAPGVIAVVTSDGVGELKKGKYNTAKL
FGGTEIQHYHQAIAVVVAETFEEARAAAALVKVDYAEEKGAFDLAAAKDS
AVKPEGGGDSGAGDFDAAFKAAPVKLDQVYTTPDQSHAMMEPHASIAAWN
GDDLTVWTSSQMIDWWRTDLATTLGIDKDKVHLMSPFIGGGFGVKLFLRA
DAVLAALAAREAKRPVKVALPRPFLMNNTTHRPATIQRIRIGAGRDGKIT
AIAHESWSGDLPGGGPEVAVQQTRLLYAGENRMTAMRLATLDLPEGNAMR
APGEAPGMMALEIAIDEMAESLALDPVEFRIINDTQVDPENPERPFSHRN
LVGCLRTGAERFGWRERSKQPGARREGNWLIGMGVAAAFRNNLVLNSGAR
VRLDREGIVTVETDMTDIGTGSYTIIAQTAAEMLGVPIEKVAVSLGDSRF
PVSSGSGGQFGGNCSTAGVYAACAKLREAVAQKLGFNSADDPIFAEGEVR
SGDRRMPLAQAAGDEGLVAEDQIEFGDLTKTHQQSTFGAHFVEVAVDVAT
GETRIRRMLAVCAAGRILNPITARSQVIGAMTMGVGGALSEELVVDKERG
FFVNHDLAAYEVPVHADIPHQEVVFLDETDPMSSPMKAKGIAELGICGVA
AAVANAIYNATAIRVREYPITLDKLINELPEIS
>SMa0113 putative sensory transduction histidine kinase
MKSRESPVYMSNVEEFPLTGRVLTEYEALITSAPTALDAIPGAVYVCDHE
GWLVGYNAEAAELWGRQPSLSAPRERFCGSQRLFLSDGSPMDHQECPMAE
AIKTGVSTRNAEVIIERPDGSRIFALVNIRPLRDHRGVIQGAINCFQDIS
QQKRIEAEVNCKSKDLEDFFENSAIGLHIVSAAGIILRANKAELELLGYP
ADEYVGRHIAEFHADAPVIGDILDRLSCGEKLDRYPARLRARDGSIKHVL
ITSNSRFEDGKFVNTRCFTTDITSLHETENAWRETEERLAATYQAATIGI
AESDADGRLLRVNDAFCTMLGRSREQLLNMTFLDYTHEDDRDEEARCYAR
QVSGETDTYAIRKRAVKADGTVIYLDVYSSTVRDRTGRFRYGVRVLLDVT
EAKRMDDRRRESEQHMRDLLEALPAAVYTTDAEGRITFFNKAAVEMAGRT
PQIGDKWCVTWRLYRPDGTYLPHDQCPMAVTLKEDRPVRGEQAVAERPDG
TRVPFIPYPTPLHDVAGKLVGAVNMLVDISDREKAAEYAERLASIVRFSD
DAIVSKDTQGIIQTWNKGAERLFGYSAEEVIGKPINILIPPDRQGEEPGI
LERIRRDEHIDHYETVRIRKDGSLIDVSLTVSPLKDARGRVVGASKIARD
ITERRRSEEHRKLLVNELNHRVKNTLATVQSLAAQTFRGVNASEDFGRFQ
SRLVALARAHDVLTRESWQGADLGEVLHATINPICVEPQQRVQASGPPLR
LRPKMALALSMAFHELCTNAAKYGALTNDGGLIKVNWHVSNIESVSHLHL
QWEEIGGPSVMVPARTGFGTRLLERALARELGGKVDLVFAPSGVRFHIEA
PLT
>SMa0726 hypothetical protein
MDPEHRRKHIHTALFNALRDKAKEQGNVVSITCVTHATRRKPHSWPEQQT
VTNGHWVRLSPRRLRLGHQAPEANRTHFMLGTSR
>SMa1450 Probable thiolase
MNKQDPVVIVGQARTPLGSFQGELKDLSAADLGAAAIVDALKRAGLAPDA
VDEVMFGCVLTAGQGQAPARQAALGAGLPPGVGATTVNKMCGSGMKAAML
AHDLIKAESASIVVAGGMESMTNAPYLLDRARQGYRIGHQKVLDHMFLDG
LEDAYDKGRLMGSFAEDCAEAYQFTRSAQDEYAIASLEKAQKASADGSFA
EEIVPLSIASGKGERTVNLDEQPQKARLDKIPLLKPAFRDGGTITAANAS
SISDGAAALVLMRRSAADKQGIGPLAVICGHATHADAPSLFPTAPIGAIK
ALCRRIGWDIGEVDLFEINEAFAVVPMAAMRELGLDAEKVNVHGGACALG
HPIGASGARVIVTLVNALRRRGLRRGIASVCIGGGEATAVAVEISG
>SMa0347 putative LysR-type regulator
MNLRSLDLNLLVVLDALLDEAHVSRAADRLGLSQPAASAALQRCRHLFRD
ELLERGRGTMRLTSRAEALRAPLKSLLASMMELIGPPEIPLTEIRQVLRI
TMADYPALFVIAQLQRELQPSAPGIDLVIQSWHGGDAARSALVDGTADLA
VSVFTAPDDDLHREELLTEHYVVVMRAGHPAAEAFDLNAWLSYPHILVSG
RGDTTTPLDAELSRFGLSRRVGLVVPNFQMVPSLLQDSDMIAMLPSRATP
SGGSLVSFLPPIPVPGFQLHMAWHRRRAKDAALQHVARILGALLR
>SMa1564 hypothetical protein
MATSLLFPVFVFLLASTSIGGMMVAAFYPRVSKASAYRQRFERISARAED
KRSEPAETDGRDRRRSVEKTLREIEEKRQANARKGKVTLSARLRQSGLHW
SRKAYFLVCAGATLATWVVMLLLLGLGPLVSVGFAIAGGLLLPHLYVNMK
RNARFTRFAAEFPNAVDVIVRGLKAGLPMPDCLRVIAMEAQEPVKGEFLA
IVQDQTLGIPVDEAVKRMSERMPLAEANFFAIVVAIQSRTGGSLSEALGN
LSKVLRERKKMKGKIKAMSSEAKSSAGIIGALPFLVAGAVYFMSPDYMAL
LFTTMIGKVVVVGCGLWMGIGILVMRKMINFDF
>SMa0478 probable NAD-dependent formate dehdyrogenase
MEMAKVACVLYDDPVDGYPTAYARDGLPTLERYPGGQTLPTPKAIDFEPG
ALLGSVSGELGLRKFLEGQGHTLVVTSDKDGPDSVFERELVDAEIVISQP
FWPAYLTAERIVKAARLKLAITAGIGSDHVDLQAAIDRGITVAEVTYCNS
ISVSEHVVMMILSLARNYIPSYQWVVKGGWNVADCVARSYDIEGMDIGTV
GAGRIGTAVLRRLKPFDVKLHYTDRHRLPDEVAKELGVTFHQTAAEMVPV
CDVVTINAPLHPETENLFNEAMIGKMKRGAYLVNTARGKICNRDAVARAL
ESGQLAGYAGDVWFPQPAPKDHPWRSMPHHGMTPHISGSSLSAQARYAAG
TREILECWFEGRPIREEYLIVSGGKLAGAGAHSYSAGDATRGSEEAAHFK
T
>SMa0414 conserved hypothetical protein
MADNLSKDRAKRDFKKTREPSGEAQVKPSNRRRFVIQKHDATRLHYDLRL
ELDGVFKSWAVTKGPSLDPSDKRLAVEVEDHPLDYGDFEGTVPKGQYGGG
TVMLWDRGYWEPEGRKSPEEALKKGDFKFTLHGKRLHGSFVLVRMRNDRD
GGKRTNWLLIKHRDDYSVDENGAAILEENATSVASGRSMEQIAEGTGRKP
RPFMMANADVEADAVWDSKHGLAAEERKKGSRRDVATSTAADLPDFIEPQ
LCETLARPPASDDWLHEIKFDGYRIQMRIADGKVTLKTRKGLDWTAKYPE
IADAASELPDCIIDGEICALDDNGAPDFAALQAALSEGKTGNLVYFAFDL
LFDGGEDLRSMRLVERKKRLEDFLAAGSDDPRIRYVDHFESGGDAVLRSA
CKLSLEGIVSKQMDAPYQSGRTDTWAKSKCRAGHEVVIGGYATTNGKFRS
LLVGVHRGDHFVYVGRVGTGFGAAKVERFFPKLKALEASKSPFTGIGAPK
KEKEVTWLPDGRALPACRVRSSTDVVRFRRSGRSPRALRSDRSEGRASCP
RRLSPV
>SMa2239 conserved hypothetical protein
MAETQIEWTDATWNPVAGCTIMSAGCTNCYAMAMAKRLEAMHVDKYVGLT
RTSGARTVWTGVVREDEAALLIPHTWKKPRKIFVNSMSDLFHDSVTDEFI
LKVWQVMRDTPHHNYQILTKRPDRMATVVRQVIGEVLPNVWLGTSIENGA
VAERVDHLRQVPAAIRFISFEPLIGSVGAIDLTNIHWAIVGGESGKSARP
IQEEWIDEIYGRCLDHETAFFFKQWGTWGKDNKRRSKKANGREYKGRTWD
QMPTHPAVL
>SMa1580 hypothetical protein
MSRSSKGAVLLLAMSALVRPAAADEQKTAIIPVQQRSTGAKGVERLVVDF
AKTVALPRPASTVIIGNTGIAQASLSDDRTVILTGKTPGSTNLIVIDSDG
AEVANLVLDVVASSGRLVTVHQGARRATFTCARRCDPVLLVGDDADHFNA
TASQIAARNGFSAPSPDNQQ
>SMa1476 Putative AraC-family transcription regulator
MRDTQEIRFAVLMFPNFPLMAFSSVIEPLRAANTLSGRRCYSWLTVAAGE
KISASNGIGIEPDFHVRNAPEVDRIVVCSGGDAEHLVADEEMAWIRKNLR
AGAQLGAVADAAFFLARKGLLDGHSCTLHWTSQPAFKEAFPHLDMRSDLY
VIDRRRFTSVGGIGSLDMMLDMIGRDYGAELADGVAGWFMHSPLRPDADR
RKLTLRIRSGIADDLVLSAVAMMEDAIEDVLRIEDLASRLNVSSDKLERA
FKAELGVSPNSYYRNLRLGHAADMLTHSNLKVNEVAVACGFVNAANFSRA
FKEQFGYVPHSVRRRVSRAGEAPRAVAGLK
>SMa0168 methyltransferase-like protein
MAMASDVLARSFGKEAFGLDPQNYHTARPAYPELVWDALRNRAGLRRGIS
ILEIGAGTGLATERLLEDRPHRLLAVEPDRRLARFLRGRLDKEELEVVET
PFEKLKVPEKSFDLVVSATAFHWIDAAPALRRIHRLLRAGGTVALFWNVF
GDGVRPDPFHRATAHLFSGHRTSPSGGGTTKTPYGLNVGARLGELAEAGF
TADEPELIDWTLALDPPAVRRLYATYSNVTALPADERERLLSGLEKIAET
EFAGVVTRNMTTSVYTGRRE
>SMa0779 conserved hypothetical protein with localized similarity
MACRQNEALSAPRMERVRNLKLYAGLSVSTNCSPGLERRRKWKLRFVSSN
VGCWAGCAIAHSIVWPRSMRRLANCSMISMISAFCAVSAPPAANCSRSLI
VRPLRPLPVERYVFAEWRIRRAGLDYHVEIERHYYSVPYRFAREQVEARI
TANTIEIFHKGERIAAHRRSSGNGKHTTIPDHMPSAHRRFADWTIERIER
EASAMGPDVALLCERILADRPHPEQGFRACLGIIRLNKSFGRDRVNAACG
RALEIGARTYGSVRSILDNHLDRTAASNGAAPHEPIHHANIRGPRYYH
>SMa1867 Putative AsnC (lrp)-family transcriptional activator
MKLDRIDIKILYELQKNGRITNVELAELVNLSPSPCLMRVKKLQSEGYID
GYSAQINVGKLGQTLTVFTEITLKNHRQIDFARFLAAIEKVDQVIECHLV
SGGYDYLLKFVTAGINEYQTIMERLTDMDVGIDKYFSFVVLKSPIVKAHM
PLTSLFRV
>SMa0400 putative zinc-binding
MKAVRLYDIRDLRVEEVAELAAPPPGFVNLEVRAAGICGSDLHNYRTGQW
ISRRPSTAGHEFCGRVTAIGEGVSHLVRGDVVSADSRMWCGTCPACASGR
SNVCETLGFLGEVCDGGFAEAVQLPSRLVFRHDPKLSPHVAAMAEPLAVA
LHAVRRLAVPDGAPVLVMGCGTIGGLSALLLSRLHQGPLLLTDLNADKAA
LVAEVTGGVVVALDGAAIEEALPGTRLRHALDATGSIQAIARALDILSGG
GALALVGIGHGKLDLDPNILVEREISLVGCHAFAGELPEAIELLADLAPA
LQRFIEVLPTLDDVPEAYERLLRGESNALKTIIEVAG
>SMa2379 catalase/peroxidase
MDQKSDSAGKCPVAHTAPRGRSNRDWWPDQLDVQVLHRHSGLSDPLGNTF
NYAEEFKKLDLDALKRDLRALMTDSQDWWPADFGHYGGLFIRMAWHSAGT
YRITDGRGGAGQGQQRFAPLNSWPDNANLDKARRLLWPIKQKYGNRISWA
DLLILTGNVALESMGFKTFGFAGGRVDVWEPEELFWGPEGTWLGDERYSG
ERQLSEPLAAVQMGLIYVNPEGPNGNPDPVAAARDIRETFARMAMNDEET
VALIAGGHTFGKTHGAGDPSFIGADPEGGAIEDQGLGWKSTFGTGVGKDA
ITGGPEVTWSQTPTRWSNHFFENLFNHEWELTKSPAGAYQWKAKNAEATI
PDAYDPSRKHVPTRLTTDLSLRFDPAYEKISRRFLENPDEFADAFARAWF
KLTHRDMGPKVRYLGPEVPAEDLIWQDVIPAVDHRLVDETDIAGLKAKII
ASGLSVQELVSTAWASASTFRGSDKRGGANGARIRLAPQKDWEVNRPAQL
ARVLSVLEGIQRDFNAAQTDGKKISLADLIVLAGGAAVEKAAKAGGHDIT
VPFTPGRMDASEAQTDAASFAALEPRADGFRNYVSTTRQQFMKPEEALVD
RAQLLTLTAPEMTVLVGGLRVLKAGEPKHGVFTSRPEALTNDFFVNLLDM
GTQWSPIEGEEGVYEGRDRRTGAARWTGTRVDLIFGSHSQLRAFAEVYAQ
SDAREKFVKDFVAAWTKVMNADRFDLV
>SMa0643 hypothetical transposase, partial match
MNPGKLNNWIDDAAISGLVAIARFARVLHRDLDAVCSAIELLWSNGQAEG
QINPLKTIKRAMYGRAGPELLRARMLPLDQNYRHKK
>SMa2227 Putative histidine utilization repressor protein
MSQQQIISERNRRFTHWEPGLETRLEQRVTSILLLRRVRGQVHTKRDTPQ
TISLSRPNMPESDPHLYEKIKETIDRRVEAEEWPAEFQVPAEVELAEEFG
ASPLTVRRALRELQAEGVLIRIQGRGTFVVGRRMQCAVFNISDMAEEIAL
SGGAHTSKLINLGVIARDSSESNMLVLGPDGVVFHVRLLHLEDGTPIQLE
DRYVNATEAPSFLEQDFTKITPHAYLLRETTVTSVDNTIRAIRPDEEACR
LLQIDSSQPCLLLDRSTWREGVPVTRSRFIYPGDRYRLRSSHEARQSRIA
TTRGPAASKSLKKIK
>SMa1579 hypothetical protein
MQRFRVTICLRLRETWMSKRIDAIGEASRSPRRQRLARRFLTAEDGAVAV
IAAVAFPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAAVRHRAGDQQ
SALETAARRVAGGSGFSPGGLTVSTAPGSAGGSNKVTVELTETHPRMFSS
VFGTGTITMKARAVAQVTGGSKACVLALSNSASGAVTVTGSTEVLLSGCS
VVSNSNAADAFLMKNGSALMSTDCVYTVGEAVTTTGLTLTGCSKPVQQVP
PTPDPFASVAEPDKLHIQQLPCRDLTYVSNSTYVFDRLASGFEAIRFCGG
VDIKGTITLKPGLYIIDGGELTITAGAKLTGEGVTFFFTNSAAANLLGNA
DIDLSAPTGGPFAGLLFFSSRQSAGVLHKITGNSESTLVGNLYAPTGKID
FTGNSTVSGGCTQIIADQVTFTGNSTMETCASPTEEILVGRTVSLIE
>SMa1974 Hypothetical protein
MDATHFVHHPRTSAMRQRLAALIGAPRSLHTMFYFPFSDRENIRFDPIDE
PMGAAGDMAWYSMRAVVEYLAPDALEDIKVFAEWDPHSGAVIRATGVLQF
NAGKTATFDVGYTAGAAVMELTDAGTSGVLTQDDFVLDWQSGFGFDNPDI
QTGFIHRSGLATRNDFAFIETPSERSQHALMIDNFADLIRNGDAAEREIW
LSATEKTQVFLDAIWQEIQTQTRASPAMTPQVNMREIP
>SMa1005 hypothetical protein
MWPGIQFRNNPEPSVMKWGFILVTLYLGPVGLLLYVLADKERVPGTHEEF
IKPLWKQGVGSTIHCVAGDATGIIVAAVVTAVLGLPMWLDIIEYAAGFAL
GLFIFQALFMKNMMGGSYWENVRKSFMPEFISMNAMMAGMAPVLAILMMG
RDMRAMWPSEMLFWGVMALGVGVGFLAAYPFNVWLVAKGMKHGLMTDRSE
HSSATPAATAMPPKLLTPQRIASGENTTPSTGNTRRRSPRRERRSLPWRR
SRRSSSLPGSRIPSPR
>SMa0890 hypothetical protein
MVIARSGSKRWISAADTAATKLGLRVGMPAAKAQALVQGLVMIDADTAAD
LAALERLALWAFSQYSPVVAMDPPDGIVMDTEGADHLQGGEDLMLSSLVN
RFRGRGLAARAAIADTWGAAHALARTTDRETVIVGRGDAARAANRLPLSS
LRLPAETVQSLRTLGFTTVGDLAATPRAPLTLRFGLEVGRRLDQMFGHIA
EPIDPIRPAELVEVQKSFAEPIGAPETIEKYVSRLVRQLCLELERRGLGV
RRADLIVHRVDNTIQALRAGTAKPVRDIAWLTKLFRDRIDRIEPGFGIEK
LGLAAILAEPLIEAQSASSLIEEQVTDVTPLIDVLGNRGGQRIFRVAPVA
SDVPERSVRRIAPIADEDGATWPLNWPRPPRLLARPELIEVIALLPDHPP
VSFTWRSKRRRVKRADGPERIFGEWWKHSSEWVAVRDYFVVEDDRGERFW
IFRSGDGVDAETGSHKWFMHGMFA
>SMa0339 putative
MSSLFSNKVVTVTGAGSGIGRAIALGLARDGATVHLADRDADGLTQTAEL
IRAEDGRAFTTELDVASELQVVGWIEQIGSTSGRLDAAFNNAGITGPAKR
IEDYPLEDFQRVIAVNLQSVFLGMKYQIPLIKRNGGGSIVNTASIAALTG
PGGMSAYAASKHGVQGLTRVVAMENAAHGIRVNAIAPGWTETPMVAANSQ
QNPAFAALAQNAIPAKRGGKPEEIAAAAIWLASDAASYVTGHMLTVDGGM
TIGGFEL
>SMa1488 Putative oxidoreductase
MNDMTPVTSEITASEIGKPFKLPRRGFLGASLGALVLGVTLPAGRARAQA
AAAAITPGTRISAFLEILPNETVLFRSAFIEGGQGIFTAMAQIVGEELDV
DPMQFVVEGAPPGPDYLLTGGGRFTGGSMSVRMSYDAMRKLGASARHMLI
QAAAVRLRVPVSELSTEPGRVVHGASGRTLPYGEIADAAAGLPLPTNVVL
RDRADFRWIGKPVARLDVRDKSTGKARYAIDLKVDRMLHAAVQHSPRLGG
EPGALQNEADVRGMPGVHSIHSLPGTVAVVADSWWRARMAAEALQVTWTE
PTRGTAHVMPADFSTEAHMAMLKATPGEGVAYETVGNAATALGDAARVVE
ATYDAPYLVHGQLEPPSALARWNDDGSLDLWVPNQAPEMFQAEAAKVAGI
APEKVTIHSPMLGGFFGRHFLYQTANPFPQAILLAKAVARPVKLIWSREE
EFLRDTLRPMGAVRFRAGLDAEGLPVALEAVAVGEGPTGRWFGRQPDKVD
SSSVEGIAGKVYAIPNRHIGQVHVDDPAIIGFWRSVGHSMNDFFYETFFD
EMADAGQQDPYELRRRLLADSPRHRTLLEAVAELSGGWRRGPFIADDGTR
RARGVAMASPFGSEVATIAEVSLRSGEVVVHDVWVAIDPGSIVNPAIIEA
QVNSAVALGLSSALLEEVVYVDGMPQARNYDGYPILTPDRMPRVHVRIVE
SGAPMGGIGEPGLPGVPPAIANAVSVLAGRRVRSLPLSKHDFKGVDG
>SMa0563 aldehyde or keto oxidase, probable
MQKRKLGQGLEVSALSLGCMGYGKARDIPDRPQMIELLRRAVDLGMDFFD
TAEVYGPWTNEEMVGEAFAGMRDKVKIATKFGWDIDQSTGEHGGGVNSKP
TQIRSAVEGSLKRLRTDFIDLLYQHRVDPDVPMEDVAGTVKDLIAEGKVR
YFGLSEAGAESIRRAHAVQPVAALQSEYSLWTREPEAEIIPTLEELGIGL
VPFSPLGKGFLAGKIDASTAFAANDFRSQIPRFAPEAREANQALVDLIRS
VGERRSATPAQVALAWLMAQKPWIVPLFGTRKLERLEENLGALSVTLSDD
DLEQIESGAAAIRIEGARYPEEMLRRSGR
>SMa0607 hypothetical protein
MKIARKYGPIAAAFVAALAITTSYAQSPTPGYNTKIPEQILTPDKVESSI
GTLNFADGVPTAETAGKIYDYLDTLRGVEVFLNFMPAASLEALRMGNAEM
GATKANQALIFDQLLDSNPLLLTGNTDTVYCSVFLDLETDGPTVVEVPPG
TGPGTVNDAFFRFVIDMGAPGPDQGKGGKYLIVPADYKGDLPKDKSEGGE
YYVARSPSHVNWLILRGFLVDGKPDAASKLFREGLKVYPLAKKSNPPKME
FLDGSKVAFNTVHANTFEFYKELDHVIQKEPIDLFDPELRGLAAGIGIRK
GRSFAPDDRMTKILTDAVGIGNATARSIAFHNRDPRSPLYPNSQWRSGFV
GSDYRWIDLDGVSGRNKDARTNFFYMATVNTPAMAAKLIGKGSQYALITA
DATGNAFDGAKTYRLNVPSNPPAKDFWSVVLYDPQTRSELQTSQPFPSRN
SKRDKLVANADGSVDLYFGPKAPAGKDSNWIETVPGKGWFSLLRLYGPLE
PWFDKTWRPGEIEEVR
>SMa0466 putative ABC transporter, periplasmic solute-binding protein
MAIEFTRRSILAAGMAHAAFVSVTGTFVIARAAKAAEQIELVTWALPSIP
DTLFIPHAWTTYTGAIMSLVQEGLLGFGEDLGLETALADRWEEVDPTTIK
YHLRGGVKFGDGSPLAADDIVATFKYHMSPDSPSQLASFYNSVATVEATA
PDEVTVKLKAPNVQFAFTPAHMAGFVFKKEQLADKNIGTPEALPLGTGAY
KLVEFVPTDRVILEARDDYSGPKPAVKRITFKAIPDRPARLLAMQQGEID
GTFDLAISDIDQWKALGNVDVVAAPSLGVYLLTLDHSAPPFDDIHVRRAV
AYSVDRNGLVQALLKGNGEAATALNPPEIWSGVLPPDEVRAFYATLLPNY
AFDLNKAKAELAQSNHPDGFDVTIPASTADPYMVNILQSVTENLKQIGIR
AHIQESDNNTWFTNYVTHDKLGMQIMAYYPDFADAANYPSLFFSSANAEK
GGMNGSNFNNREVDAKLKIANENADPKARADALKKVFTIANEEVAVVPIF
WPASAMAINNKYKLTGYNAFWYSIPWAMRGFGPK
>SMa2145 probable aminomethyltransferase
MDDIKFTTGLTTAQGARVAFRGTPFVERTAPLNQNALWMRWDRNMVVDAY
SDMVAELSAIRTAVAMGDMSPLSKYVIAGPDAEAMMDRLIPRDIRKLQVG
QIYYAPWCDENGYVVGDGLVFRMDENTFRVSADPGFTWWRQHAEGLDLQV
TDITDTYGILTLQGPRSREVLEAATEAGFQELPFSRLAVVTIAGRQVEIL
RQGFTGEHGYELWVKAEDGPTVWDAVEAAGRPFSIRPAGAWALDVARLEA
GLLIVGYDYTSAGPDHGGAGIQASGKFRASPFDLGLGRLVDFKKSDFIGR
TALERLSKYGQHRQLVGLEIDWKQIAGTGLESEEPGNLRRVRWYPVPVFG
GSVEIGHASSVAWSPTLRKLIGFGHLQQAFGEIGTQVTLRWEDDGTTRDV
AARVVALPFHSLRRTASN
>SMa0278 hypothetical protein
MILGRVGPVARKTSDRVMTTFTGRSPALRDPAASHLGRVWSAEPARIAGA
SGLIASSTSMTCSSGSYCTRINRAASCASNAEVARAAPASSLRPWLVQNL
RQGATWRLEWQGAFGT
>SMa1410 Putative oxidoreductase
MGGPMSVYFEKSYQRGFGTYPLKGEPLKAAVREAITVGYRAFDTAQMYGN
EAETGEALAESGLARDELCITTKVHPDNYSEEAFLPSVEASLKALRVDQA
DVLMLHWPEINGENARSLRLLQKAFDIGLARNIGVSNYTAPMMREAQSIV
EAPLVTNQVEFHPLIDQSRLLDAAEETKIALSSYCSVARGEVFKHPVFAE
IGARYGKTAAQTVLRWILQKGVSMNTMSTKPENIRANFEILDFALSPHDM
KRIDAMNATNYRILKAGMLPWVPDWDR
>SMa2263 hypothetical protein
MLQIHHHESFDVDIFLDDPQLLPYLNPKTQGYALDINPDGYESDGSRTLM
IVFENVGEIDLCPQPAGNPAVRAEVRGRQVLLEAPGEIIAKKVYTAAPRC
SPATCSTSLAS
>SMa0609 hypothetical protein
MITKRDLLRSAAIGALVAATANSTTVIAQDKAEWPSPLEAKDIAEEGFIY
GLPLVMNYAVMQEFAVNRDSGQFKAPFNEINNMHQLASPEDTAIITPNSD
TPYSILWLDLRVEPVVVSVPAVDKERYYSVQLIDGNTYNFGYIGSRATGT
EPGSYLVVGPDWKARSPRASSRSSDRPPRSYSPTFGPSLST
>SMa1231 conserved hypothetical protein
MTYKTILLVVGISQLEDDLRAAADLCASEGAHLSVLVSKIATLPPMGDLA
AISAAWIDSRDGDMEQLRQSVREAREILGSAGISYDVVGRYSETMRLGQD
VGERAWYADVTLVGTSLRVDDLLRRGVIEGALFYSARPVLLAANLRSVTL
EPKNVLLAWNSTIESARAARESLDMMQNAEGVHLVLVDPKTKNGEEPGVD
VATYLARHGVKVAVDQLASAGRPVEEVLAQHARDTSADLIVIGAYGHSRI
RERVFGGVTKSMIDAPMLPVLMVR
>SMa0128 hypothetical protein
MRHPSDNGFAERRNAAADAKRQLLTKFASAPKPTDPEMREKLAAREAASR
ARDARRAEREALKTAENERILAEAAASAAAAEAEQRAEAEARQAEISNRV
SRVVADEAARKAERDRRYAARKARRA
>SMa2117 putative oxidoreductase
MLEDARTRWHPIAATDDLPLRHVFHGQLLGREFAVWRADDGYVNVWENRC
LHRGVRLSIGINDGRELKCQYHGWRYSNRTAGCTYIPAHPADAPARTITN
RSFQAVERYGLVWSSEDPRGDLPVVEGIGEDDLLVLRAIPVNASADLVVE
RLQSYRFQPNSEIAGAGANVELVAATQAAVALRSYQGRAETLGVFFVQPV
DSGRSVIRGVVSGRPNDAQLTVVLRCHNEALSTLRADLEREAAALPAPTP
IEPIFERVSEELASLPELDSRGRKAAIRVQVARKWQTADGIMAFQLRPVR
GLLPTFQPGAHIDVHLPNGLVRQYSLTNGPGETDCFTIGVKLDPASRGGS
QCLHDSVREGDVLAISEPRNNFPLRRDALKTIFVAGGIGVTPLLAMAQTL
NNQSLDYELHYFAQNEQQLAFSECRQALGDAVKPHLGLSPGDTVKELRRL
LSAYLPDTQLYVCGPGPMLESTRSLAAEAGWPEAAVHFEYFKNTNVIDDS
SSFEVALARSCLTIKVAAGQSILEAMREAGVDLPSSCEQGACGTCLATVI
EGEPDHQDVYLSPSERASGTKIMTCVSRSKSARLVLDA
>SMa0940 probable response regulator of two-component system
MLAASGFQIESYSSGEELLARLPRGSGCILLDLQMPGLSGLELQARLADL
APLLPVVFLTGRGDIGITVRAMKAGAHDFLEKPASSAAVLEAVERALQLC
ETRRKEHDRAQALHTLLASLTPRESQVSDQIVRGKRNKQIAFELNTSERT
VKAHRHKVMEKLGVRSLAEVVSMAERLGLLEKTT
>SMa0316 conserved hypothetical protein
MTMTHIEQQLENLASPGDRQKHLRGLAVLKQIGGENFGGPVSQLASFSED
LARFTIQYPYGDVLSRDGLDLRTRQILTAATLLAHGSAQSQLSFHLYGLL
NAGGTRADVVDLLFISAGLLGFPTAINAVPIVRDILADRDETKNVAGAQT
SPAVPDMPSDRFAVLDRIAPDFVNWREHTLGEEIFGAVHLEPRLAHLASA
AMLAARGKVGASFDAHIVSALAAGATDSDIVEMVIQLSVYSGFPAALNAA
GRARNVLAAPERPQASTQKGVDTTKDDEKRFMRGVATLAETSGGSGADVV
DSFKDVAPGLGRLIVAHCYGDIFCRPALDPKVRELGAISALAAQGTVAAE
KPLGVHIDAALNLGATPEEIVETLFNIIPYAGYPLIEKALLIAHDRISLF
DEKQVDGSPSDHRTLDMCSG
>SMa0800 putative ABC transporter, permease protein
MVSQTETGSGRPYSPAQVRRLTDFDLTLPALGLLILFFVLPVAMLLTRSV
TEPVPGLGNYAELLGSSTYLRIFANTFIVSSLVTLVSLLIGFPVAWALAI
MPSRAASIVFAILLLSMWTNLLARTYAWMVLLQRTGVINKMLLGMGLIDT
PLPLVNNLTGVTIGMTYIMLPFIILPLYGVIRKIDPAILQAAALCGANRW
QSLVRVLLPLAMPGMAAGALMVFVMSLGYFVTPALLGGTSNMMLAELIAQ
FVQSLVNWGMGGAAALVLLVVTLALYAVQLRFFGTNRIGGR
>SMa0567 acetoacetyl CoA synthetase, carboxyl terminus
MRSSLRGLSINSFFETTSYLEFRSSYSDESARIDQVAEPAFDVSQHGGIV
IHGRSDATLNPGGVRIGTAEIYSQVEQLHEVAESLCIGQDWDDDIRVILF
VLLRDGFDLTEELQAKIKAKIRTGASPRHVPTKIVQVSDIPRTKSGKIVE
LAVRDVDHGRPVVNKEVLANPEALDQFVAMVELSV
>SMa1073 TRm23b IS ATP-binding protein
MLAHPTLDKLNAMGLAGMAKAFSELVANGESEHLSHAEWLGLLLEREWSS
RYDRKLAARLRFAKLRHQATPEDVDYRAERGLDRALFMKLLGGDWINAHD
NLAICGPSGVGKSWLACALGHKACRDDRSVLYQRVPRLFAQLALARGDGR
YARLQRTLGHVQLLILDDWGLEPLNEQARHDLLEILEDRYGRKSTIITSQ
LPVSAWHGVIGDPTYADAILDRLVHNAHRIELSGDSLRRNLPRKA
>SMa1614 TRm1b transposase
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>SMa0677 glutamate/aspartate transport protein, putative
MRSREALGGCNMQAAPGKGRQSGRRIVRFGLTLGAVTVSMWASAEAQTLD
RIRSSSTVKLGYDATARPFSFKAEGESATGYAVSLCMEVVEELKRELGIA
DLAVEWIELTREAADHAIRQGSADLFCGASPVTLTRRKEVSFSIAIFPSG
TGAVLNASAPLALREVLTQGRPSDRPIWRGSPARTVLNQKTFSPIAGTTS
EDWLAERIKTFQLSATIAAVENYDQGIANILNGESDVLFGDLPLLLDAAA
RGENSGDLIVLKRHFTYEPLALVLARNDEDFRIVVDRALSRTYRSEDFPA
FFSEWFGPPDDTIVTFFRQTTLPE
>SMa2161 hypothetical protein
MEVRLDLAFDLSRTTSPGGLKPATGPFGFKMVLGRLRQLPVFGFHRLLHW
YTTRLASRYNVSSGQDAERMAG
>SMa0501 putative ABC transporter permease
MDYTFQFGPLLNYLPQIVSGLWLTIGLSFVGIFGGIALGIFCAVMSTLSS
PIPRLLVRLYVEVVRNTPLLVQIFVIFFGLPNIGIRLSPLTSVFIALVLN
NGGYVAEIVRGGIEATHRSQVEAAESLGLSYFQTLRYVILPPALEKVFTP
VVSQCVLLMLSTSLVSAIGVEDLTGAAMIASSETFRTMEIYLVVAMVYVV
LNFVFRAVLNVAGLMIFTRSRRRLLGRA
>SMa1435 Probable ABC transporter permease
MTDLALPSVRSPGALRQLAIYLTRDKLALCAAVFLLVLIVCVLVGPLFVG
ELAGKLGLRQRNLPPFALEHGFIYILGADTLGRPILARLIVGAQNTLGIA
AAAVFCSMALGGILGLAAGYSERWYSHVILRLADVVMSFPSLLLALIVLY
TLGPSMTNLVIVLAITRMPIFIRTARAEVLELRERMFVSAARSMGASTGR
ILFRHIAPLVLPTLVTIAAIDFATVILAESSLSFLGLGIQPPDFTWGAMV
ANGRGYLKTAWWIAFWPGLAILLTTLSLNILASWARTVADPLQRWRLQSL
RKAPR
>SMa1918 Hypothetical protein
MFARTKHRTVHFDKPFWISGLSEMVTAGDYVVDEEEELIEGLSWSAYRRV
ATFITLPATTENKYRMRLVPIDPEELEGLIAFDQRDAAASSNL
>SMa1036 hypothetical protein
MLSCAAAISYSDKTMKPLFFVAALPLFASGCASTLPPDVIASGDLAASQA
QVRPLRYTSPVSGYTHRVPVDPQPWRNQNDAQTPEGDAS
>SMa1028 conserved hypothetical protein
MYRRYGRWSLLLSWMPLFGDALTVLAGVLREPLWSFLLLVTVAKTGRYLV
LAAATLGTKAAFGV
>SMa2261 hypothetical protein
MFDDAASLFKQVAPSADRVFDQLRRPRQREGVSGKYGELPSQIGGGRRIN
VHFVQERDLKTPVGLGKEGKC
>SMa0067 putative ABC transporter, periplasmic solute-binding protein
MIAGTALGLTLMAPAASAEGLSIAFISHSSASNTFWQAVKKGYDDACEKV
GASCQLILTQTEGAVEQAVANLQAAIASRPDAIFVAIVDNNAYDNVIKEA
VDAGILVLAVNVDDSEGAKGNARKAFIGQGFTAAGYSLAKAQSENFPKDG
PLNLLVGVNAPGQTWSEQRAGGVTKFLEEYKAEHSDREVNITRIDSATDL
ALTADRIGAYLNANPDTAAYFDTGYWHAGVAKVLKDRGIEPGKVLLGGFD
LVPEVLQQMEAGYVQVQVDQQPYMQGFIPVMQAYLWKTAGLTPSDVDTGQ
GIVTPKDVPTILELAKQGLR
>SMa1086 Conserved hypothetical protein
MLARDIMKKRVLSISPDHSVSHAARAMLENQISGLPVCDDRGRLVGMLSE
GDLLRRAELGLVSRRDIAGVRAKPEAFIKGHSWRVGDVMTQPVVTVDEDM
PVGRVAELMAAKGIKRIPVMRAEEMVGIISRSDILRAVTASLPDVIANGD
EAVRRAVLARLCSDLGLEKGAIDVTIENGTVSLSGQVESEALREAARVAA
ETISGAGGVRNRLRIVANGGASDG
>SMa0522 hypothetical protein
MKRMMMIGAVLVALAAPAMAQDVDTLSPEALLTLAQKEGKVTVYSFTSRI
ARVEKAFEEAYPGIDLIGFDMSSTEMITRLRTEAAAGITNADVVYVSDAP
VVLSDLLETGLLKNYVPPRIADKLDTAFKSPLLAQRLSTKVLMYNEAAYP
NGAPIKNLWDLTTPEWKGKVLMVDPLQRGDYLDLMTEFVLRSDEMAKAYE
ALFRKPIELDDGVETAGHQFIVDLFENDLVLLADGRCECGGW
>SMa1547 probable Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
MKPMNEEHLAVLRRHMVEVVAIYADLASEELGKAALDERVMAAMLRVPRH
LFVPAQAAPFAYQDMPLPIGFDKTVSQPFMVALMTDLLAPKPHEAVLEIG
TGLGYQTAILAQLAGKIWSVEIIEEFASHAEALLHGLGMSNVGIRIGDGS
RGWPEHAPFDKILVTAAAEEPPPALLEQLKPMGRLVLPVGSEEQVLTVID
KDSEGQFLARQLIPVRFSKLEAV
>SMa0447 conserved hypothetical protein
MSMVTQGKLIGQLLAFGALGLLAIGCTLVLWPFLSAILWAAVICFSTWPA
YRLFERAVGGYRALAAAAMTVLVVVVIVAPLALLATTLADNISSLVAGVT
HVLEQGPPAPPDWVRGLPIAGEGLATYWEGLAHNAPAFTIELKKVIGPFA
DVALIGGTLFGAGLLELALSIFIGFFLFLHGRRMTALTRQIAERVAGARA
RRLLSVVGVTVTGVVYGLIGTALAQGLLAGVGFWIAGVPQALLLGCLTFV
LSFVPAGPPFVWGPVALWLFMQESVWWGIFVAIWGLLLVSSIDNFLRPYL
LGRNTNLPVLLGLFGLIGGVLAFGLIGLFLGPTLLAVAHSLFREWIAAEL
EERRQPPSSSTGRDQRSGPRQGG
>SMa0265 putative
MRLPPARLRNLSVALLEKRGVPADSARLQANLLLEAELRGLPSHGLQRLP
LLLSRLDKGLANPTTRGNGTWRRASFLSVDGERGLGPVVMMDAMRVTRRI
LKETGLAIAAIRNANHMGMLAYYAEAAARDGLIGIVMSTSEALVHPFGGT
QALIGTNPVAIGIPAAGHPFVLDLATSIVSMGKINNHAMRGLAIPPGWAV
DRDGRATTDPHAAQAGAIAPFGDAKGYGLGLAIELLVAALAGSNLAPDVN
GTLDDIHPANKGDLLILIDPSAGAGSIPALAAYLDRLRLSRPLDPTQPVA
IPGDGARARRAAAAKTGIELPQPLFDHLTALEAA
>SMa1050 Hypothetical protein
MSTSILAAVGSGGIVGFMLGLLGGGGSILATPLLLYVVGVTQPHVAIGTG
ALAVSVNAFANFASHAIKGHVWWRCAAVFSALGVLGALGGSSLGKAMDGD
RLIFLFGILMVVVIGGGIVGGVLGMLLATRLSAYKNILHRLFAALIFVVA
AYILYQSARQAGAHQSLLDPHVVFDGAHAPS
>SMa1943 Putative transcriptional regulator
MTHNDELPPLEALQAVLSAYRLGSFSAAAAALGISHGAVSRRVAATERWA
GLRLFGRHGRGVRATLDGERFAARIELALAMLHDSRGMGRSDHGLDTVRV
GLVQSFARLWLTPRLAALEGTPPDLRIEMDIDNAHMALSDARIAIRLGRG
G
>SMa1491 Putative oxidoreductase
MLLLPSQLKQFARCFHARFPASCIEDTAMELTINGSRHQVDIEPDTPLLW
VLRDELGMTGTKYGCGLAQCGACTVLVDGQATRSCVTPVESVAGSEIMTI
EAITEDPVGQKVVEAWVSNQVPQCGYCQSGQVMAATALIKQSPRPSDEDI
AGAMINLCRCGTYNAIAAAVRQAAEARSGATP
>SMa0211 hypothetical protein
MLGDWDMDDRDTNHNGEKNSASDLLEIAAQLAALAEDIKTLAAMPIEQIS
TLIEPGDRPEASSE
>SMa2253 conserved hypothetical protein
MKVLLDSHAVYWWTIGSDRLSLTARSLIEDKANTILVSAVSFYELDNKMR
LKKLDLKPQELRAAVSASGLQTLAITDLHAELAAAFEWDHRDPWDRILAA
QARLEHCALVSVDGAFDAVLHKRVW
>SMa1539 hypothetical protein
MKMRAGWTKSIIVVLAPHQPGAMAEKLGNEDEQHAGAEAVRRRPARQALP
QVFGVVAGNQEVAGDDQDEHQIEQPPAQPRILRVTKEIDDRLDHRITPSA
RVTK
>SMa1822 Hypothetical protein
MNFVQDRFPGRGDDAAIEGDSAHLSGGCLLKPRSAQGLGPIPMFVVERFS
LQVSSKYGTQGMTDVSSIAATKRHDSLGCSMDRLDDDRTIEGRDMSFHRK
RSASAPPGRVTTPATGRGYLLGLSAAGGHSRRIIKEHHSTVHDFEQNAVY
LRNFADDYCADLSGSFDFMLLEINHNALERIADAADLRSVSELRPVAAHA
DPVLGGMLGALFATVDGSADRSALFVDQLSIAIGVHVVQQYGNGRGNVAA
SGRRLSSRCQARIKDLVQSQLNGELTVEQLASACNLSQATFLRAFRETMG
KTPHRWLQQQRIEKAVDLLQFSQTQLSEIASVCGFSDQSHFTRAFVQAMG
ATPGAWRRSRQLSA
>SMa0429 hypothetical protein
MADRFLALSKDDRREALAVAAARTGRPIHLLEKDVWVVWTLETLYSSKLG
EHLVFKGGTSLSKAYGAIRRFSEDIDLTYDIRALAPDLVGDNDEALPKTR
SEEKRWTSEVRKRLPVWVAESVEPVIAAAVRGQSLPARIRIEGDTLYLDY
EAVSSGSGYVAPSVMLEFGARSTGEPASIRDVGCDAAGHVDGVEFPKASP
RAMHAERTFWEKATAIHVFCLQERPRGDRFARHWHDVVRLDDAGFADKAF
AERQLANAVAKHKSMFFAEKAADRSPIDYAAAGHLVLTPSGDGLRALSED
YAKMVDDGLLLGDSEPFEQLIERCALIQERANKVGTTE
>SMa2313 putative oxidoreductase
MNEPTRIRWGILGPGNIAKDFFAGALQSANGKVVAIGARNPAKSGLAEDF
PGARIVDGYDALIDDPGIDAVYIATVHPLHAEWAIKAAEKGKHVLCEKPM
GLSTAEADAMFEAARKAGTFMAEAYMYRLHPLTARIVELVKNGMVGDVRK
IQSSFGFAKLPFDEGHRLFSNEMAGGGILDVGGYTTSMARLIAGIGTSSG
VMEPAEVTALGHLGRTGVDEWTSALLSFPNGIIAELSCSVSLEQENVLRI
LGTKGRIEVDQFWFAGGKPGGTSIIRIVHADGRQEEVPLVEPRHLYSFEV
EAAGDAIRAGRTEFAYPGMSRADTLGNLRVMDKWRAAIGLEYEGEKHTTR
TRTVRGDKLARKTSLVRSGRIDGLQKEISHAALGLMEFSTFSSAAIVLDA
FFEAGGNLVDTAFLYGNGVQDRLVGEWMRSRGVRQETVVIAKGAHSPLCY
PDVIGKQLTTSLERMGTDYVDIYFMHRDNPDIPVGEFVDAMDAEVAAGRI
RGPIGGSNWTRERFEEAIAYAERAGKTKPSVLSNNFSLAEMVQPVWAGCI
SSSDDAWMRWLEENDVTNFAWSSQARGFFTDRAGRGKLDDLELARSWYSE
GNFARRDRAIALGRKLGKDPIQIALAYVLAQKGRVIPLIGPRLLAELNHS
LDAFAVTLSPGDVQWLRDGDPGESSAA
>SMa1615 TRm1a transposase
MTTSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>SMa2000 Putative ABC transporter, periplasmic solute-binding protein
MSTIIRKIRASVLLAGVFFATNSFAQEVPLLEGVTTRAENNPTVEEGKYK
KDAPWVIGMSSFGVNANTWTVQVAHEAQAAADNDKRITKFILLDAGFDQK
KQVADIEDLIAQKVDAIIVQPVTSTSANASIEKAVAAGIPVVLHTGRIES
EAYTTEIQGGAEHFGKVMGDFLVKELGGKGNIWVLRGLAGHPEDTNRYNG
LKQSLEGTEIKIAAEEHGDWQYDKAKKVCETLYLSDPNVDGIWSSGADMT
RACVDVFKQFGSPIPPISGEGNNGFFGQWIADGFKSISSEYSPAQGAAGI
RAAVALLEGKALHKHYDYNPPGWDLEKVKKYYRDDLSANVWWPSELSEEQ
LKEFYAKQ
>SMa1261 Hypothetical Protein
MPNLRTEQNSRAGTQFETVSPIVCSPSKCDCPRGDRFGKFRSGSSRRRIG
MHEGRSTENSTTMTETQPKPSIDVRTVPPRERHPRIFGMLGALPAGGSML
ITSDHDPRPLRYQLETNFPGEFGWDYLEKGPEVWRVEIARLEEGAGCECC
CGSDH
>SMa0492 putative ABC transporter, permease
MNIGIVFDAIPRMLGGIVMTFQLLLLSLAIGTMIAVLLLLMRISGRWWLS
WPAQFYTYVFRGTPILVQIFIVYYGLPQFEWIRESIFWPILRDPFGCAIL
ALSLNTGAYLSEIFRGGVLAVERGLLEAGAALGMSATHRFIYITTPLAIR
IALPAYGNEVISLMKSTALASTITLVDMTGIGRTIVAETFAPYQVFLSLA
IVYVAITWIIQRSVKRLEVYLGRSTAR
>SMa1354 Putative epimerase
MGRRYRGSEKREEGTMKIGMCMFLWTTSVSRKHEKLLRDIRATGFDGVEI
PIFEGTPDDYRRLGALLDGIGLERTAVSAMSDPSMNLISPDTASRKRGIA
YMQWAIDCAAALGAHALSGPLHSTLGQFSGTGPTAAELRRSISSQRTIGD
HAAQRGVTVALEALNRFECYLVNTMDDLTAHIDAIGHPNIRAMYDTFHSN
IEEADPVGAFTRNADRIVHVHISENDRGVPGHGNIPWPETFKAIRASGYD
GWLTIEAFGRALKDLAAATKVWRDFSETPEVVYRDGYRHIRDGWKAAA
>SMa2057 hypothetical protein
MPLLARSATLITMMWTWRKGVSVLREKTARQDIPLSQFMAIVERKSEHAP
VQVPRTAIFLTATPDTTARGG
>SMa1625 putative LysR-family transcriptional regulator
MLNEIDLSRADLNLLVLYEMVLEERNVGRAAERLNLSASAVSHGLGRLRR
LLNDPLFLKTPKGVVPTERARALAAPIADILARVRRVISTAEPFDPAHTR
RRFTIGTADGFSVFLPPLLDEIARKAPGIDIVVRHMQMESALSDLDERLI
DVAVAPFYELPARFAAQRLYEEEFVVAARIGHPFLKNPTLENYCRMQHLL
VAPRGDPRGLVDQLLESRNLARRVALAVPNFMLALDLIVKTELVSVLPKR
FIEMHAERFAVATARLPFDLAISSIQAVTPKAALMDAGLAWLLQVLGRAD
CDAESSL
>SMa2159 hypothetical protein
MAFPATQAYLTWVVFLVAPPSALGASDRIAIGDFAIDRNEVTIGDFSTFA
EATKLKTAAEQEGGGREWGSGWERRPGWTFRTPYGEPPLDTLEPAVHVSW
HEARGYCAWVGGRLPTREEWRLAAYREHGGGSAAGFVTGTTYQYPTGATP
EGANTNDDDAWARHAPRGATRQGVNGLYDMGGNVWEWLADREGGDALTAG
GSWWYGPDKMQEESMQWKPADFYVAYVGFRCAYDPKS
>SMa0217 putative ABC transporter, permease
MGMPAGAIFLVFMTLQIVCIAGALLYPDEFRYLSPQNLTILMKAIPVLGC
LALGAGVLMIAGEFDLSIGSVYTFTAVLMASLVNAGLSAFIAAPIAILTG
LLIGSLNGHITLRFGLPSFIVTLGGLLFWRGAVLLYNGAVQVRFDPEPVF
TSLFSGTLFGVNAAFIWIVLFVTGFHLLLHRHRFGNHVFATGGNRGAAEA
IGINTSRVKLIAFAIAGGMAAVAGILATARVGSVQPGQGAGLELQAIAAC
VIGGLSLRGGRGSIIGIFLGVLLIHTITDVLLLLRAPGFYLDMFIATLIV
LAAIFNHLIERRGLA
>SMa1415 probable aldehyde
MDQLNNFLSPPAAPRDFGFFVDGKWQSGHDFFVRHSPGHGVAVTRTAKCS
VDDLNAAVAAARRAFEDRRWSGLPGGSRASVLLRVAEILRTRRDELAYWE
TLENGKPIAQARGEIDHCIACFEVGAGAARLLHGDSFNSLGDGLFGMVLR
EPIGVVGLITPWNFPFLILCERVPFILASGCTMVVKPSEVTSATTLILAE
VLAEAGLPDGVYNVITGSGRTIGQAMSEHPDIDMLSFTGSTAVGRSCVHA
AADSNFKKLGLELGGKNPIIVFADSDLEDAADGAAFGISFNTGQCCVSSS
RLIVERSVAREFEALLAEKMKRIRVGDPLDETTQVGAITTEAQNTTILDY
IAKGKTEGAELVTGGTAIDLGRGQYIAPTLFSGVSREMAIARDEIFGPVL
CSMTFDTVEQAVELANDTVYGLAASVWTKNIDKALTVTRRVRAGRFWVNT
MMAGGPEMPLGGFKQSGWGREAGMYGVEEYTQVKSVHVEIGKRTHWIS
>SMa0439 hypothetical protein
MRAPRILFIRQLPSKPRTACGGGRVHSFLSVRDLASARHCKAPRGSYLFT
RRSTEKRAMKELVINIVSDDMAEIVVPSGNNGRLRCSVVCLCCQRADQPM
DDDGCGICDACLDLPVRAMDALDGLELPTPFPHLSPTTRNQ
>SMa1459 Putative
MTPVETSWEEIEAVCLEALTLHGAAAETARAVAGAITRAEADGNRVCGLY
YLPIFCRHLAIGKVDGEAVPEVTTRGVTVTVDARSGFAHPAIAAGTPALI
DLARQAGLAAMAVRNSYNCLALGHHVRPLADAGLIGICVSNAPASVAPPG
ATRALFGTNPLAFAVPSKEGAPTIVVDQSMSAVTKTEMILRRDRGEAIPI
GWAQDGNGQPTTDAATGLEGSLLPAGGRKGANIALLVEVLAAALTGSALS
TEASAFGNEEGGPPHVGQFLIAIDPDHFAAGHFSEAMDNLVASHDAAGVR
LPGHFGRKQPVCVDADLWKKAVLLSKSKNRQKPG
>SMa1264 Conserved hypothetical protein
MASERNPPGRERPCSITGRRSRSSWTAPTCMLPRRRSASTSITASCSRLF
GSAPICCGGNYYAPLVEDQETPTIRLLIDWLDYNGYQMVTKPIREFTDTL
GRRRIKGNMDIDLAIDAIELAKTADHLVIFSGDGNFTSVVAALQRKGCRV
TVVSTMATRPPMISGELRREADHFIDLAKLRGEIAREHAEVGPVREKDAV
GEVETEM
>SMa1869 Putative deaminase
MPAPLKFVHTTCELPDAADAVVIGGGIIGVFSAYYLARRGLKVALVEKGR
IGAEQSSRNWGWCRQQNRDARELPMATKSLDLWERFAAENGGDTGFRRCG
LFYLSNSEEELAGWARWRDFARTVGVTTHMLDSAQATERGRATGKPWKGG
VFSPTDGTADPASAAPAVARAILELGGTIHQSCAARGIETEGGRLSGVVT
EHGAIRTKTAILAGGAWASSFCRQLGIRFPQAAIRSSILAVSPGVTSLPD
ALHTAAVSVTRRSDGGYSLAISGRGRIDPTPQQLRFAPQFLPMFVKRWRS
LAPGGLEGFRSGHETLVRWRLDAPTPMERMRILDPAVDNATIRLTHSRAL
DLLPALKNTRITAAWAGYIDSTPDGVPGIGEIAAIPGFILAAGFSGHGFG
IGPGAGHLIADIVTGSEPIVDPHPYHPDRFGKSAWGKVADF
>SMa2349 probable oxidoreductase
MELDMQSPGALEISRRDLLAASAATVTVVSAHSLGHAQTNASTAPQSTKV
TFTVNDERRELQLDNRTTLLDALREHLHLTGTKKGCDHGQCGACTVLIEG
RRVNACLTLAIMHEGDSITTLEGLGQPENLHPMQAAFVKHDGFQCGYCTP
GQICSSVAMLDEIKANIPSHVTVDLTAPAEITPAEIRERMSGNICRCGAY
SNIVEAITEVAGRKA
>SMa2053 MocE-like protein
MTWVSACKLDDIEQEGAIRFDHGGRTYAIYRGPDDSVYCTAGLCTHEAIH
LADGLVMDFEVECPKHSGAFDYRTGEAIRLPACENLKTYPAEVVDGEVRV
ALA
>SMa0288 hypothetical protein
MTPAEGVVGPTPREFWIGLGRAFAGALIFAVPVLMTMEAWALGFHLHPLR
LALLLVATVPTLVLLHKYGGFRKSVGLRDRIADAFVALLVAAIAASAILF
SFGIVDADMPLREIVGKVAVQIVPGSLGASLARAQLGPSPLEGDAVPEPA
YAGELFLMVVGALFLSVNIAPTEEVVLIAYKMNPWHEIALALGTLGLMHV
FVYELEFRGTHNPEPGAGFVNIFFRYTIVGYCLVMLVNFYILWTFGRTDG
VGFSETLSAVVVLSFPGALGAAVARLIL
>SMa1418 Putative ABC transporter permease protein
MDIQRLKPHLPWITLTVLVAIVGMADPGFLKPQNLMSLAGDIVPLFIMAL
GLTFAIYIGGIDLSAQSMANMVTVIASVYLASMGAWVALLCVAAGFLLGT
LSGYITTRLYVPSFISTLAVGGVAFSVAQWLSGQRALNMDAAQRNETFGW
MIGRTWGVPNELLIAAVLLLVCLFIERRTILGRALKAVGAGELAAAASGL
NVARYKILAFAISGALAAIAGLLFAVKLSGGAPTIANGFLLPAIVAVLVG
GTPLTGGVGGVLNTVIGTLIVAVIRASMLYFEIDATQQQMVFGIVLIGAI
ALTIDRAKLRTVK
>SMa0473 conserved hypothetical protein
MRLVWARYALDDRDTIFSYIERENPRAAVHVDEEIVSAVRRLLDFPESGR
PGRIAGTRELVIPRTPYIAAYMVMEDRIRILRVLHGAQKWPSELDDG
>SMa1965 Hypothetical protein
MTPTVPGVRTIRKPRSWRARSLIIDFAEWISTKDWRSDKKGRGVARQPSK
DFRLYCPGQALEEVTRGGNLVDLDDEARHQLRITAKKLRYAAEFFSPLCM
GKAETKRHKRFITAMEGLQDQLGSLNDLATAPDMLSKLALSGVPGAGDLV
SAADKARF
>SMa0237 putative D-threonine
MSGKTKIAFLGTGLMGAPMARRLLGAGFSVTVWNRDAAKAEPLAADGADI
AASPADAVAGAAIVFTMLTNGQAVSEVLFERGVADSLAEGRIVVDCSSIA
PQIAREHARRLAEKGIRHLDAPVSGGVVGAAAGTLAIMAGGDGAAVESLK
EVFAVLGRVTHVGPSGAGQVCKLANQQIVAVTIGAVAEAMVLVEAGGASR
AAFRDAIRGGFAESRILELHGARMVERNFAPGGASNNQLKDLNAVMAMAD
ELSLELPLTRQVRQEFADFVESGGGEQDHSGLLLQLEKLNPRN
>SMa0489 putative ABC transporter, ATP-binding protein
MIEATFMNKPNDTMIQLIGVGKRFGQFEALKQVSLEVRRGEKIVLCGPSG
SGKSTLIRCINRMEEHTSGRIIIDGRELTDRTKDINAVRREVGMVFQSFN
LFPHMTILKNLTIAQRLVRKTPEKEAKEVAMHYLKRVKIPEQASKYPVQL
SGGQQQRVAIARALCMKPQIMLFDEPTSALDPEMISEVLDVMVDLARDGM
TMICVTHEMGFARSVADRVMFMDGGQLIEEGDPETFFANPRNERTALFLR
QILRH
>SMa0952 AttA2-like ABC transporter, permease protein
MTRSLASLPHSVSFRRSSSARFYSPIASMALVTPLLAAMLAGFLYPVARL
IALSFSGGTFSHYRRIFTEPLHLEVLFSTIEVAFVVTVAGLLLGFPVAYL
MARLSRGLAMAVAACVFVPLWTSVLIRSYAWVVLLQRNGIVNKLLADTGV
TEGPLKLIYTQGAVILAMTHVLMPFMILPIYSALRALPPDYVRAARNLGA
GPIRAFVTVTLPLSLPGIFAGSVMCFVLALGFYITPALVGGPSSMLMATL
IGQQTTVLLDWPFAAALSTVLLAVTLLFVLMFRRTLSLSKGLNSVY
>SMa1081 hypothetical protein
MIACRGYRYALVAPTCRDFHPDFRPDLTPAEMLALGVFGGKYMTDCRDEF
PNS
>SMa0498 putative LysR-type regulator
MTLSFSALESFYWVSQLRSFNAAANKLNVSQPTVSYRIRELEERLGVSLF
VRQRRQLVLTSEGEALKHYAESMIAIARDIESNIKTRNTRLPTLRVGVID
SFAAVCLPSLLDELDIRFAGARIAATVDTSHKLADQLSEGLLDIAVLSTP
PSHDNVALELLGRQSVDWIASHKLGLPQTIVSDEELLRQRIFATPAPSNL
HSLTTGFLAATAGAGLRLNVCNSLGTILNLVESGTGISILPSRLLQEQIR
HGTIQVLKTRTTLPLQEVFIGTNKGAIVRALPQVSQMIRKVSASVGFCV
>SMa0740 hypothetical protein
MFKSSGSHLAGKLEIAAPNVNARRPSCLLRTRIRAMAVPDPVQSIAALDE
PPIYFAAVGAKTDTDEPLISIALSDVAPDGRDAARIDQARLPRPFRTDRC
RHLRLSIFANFPAHLHRAAHTVACDF
>SMa0139 hypothetical protein
MLTVDFTVAGILNGGPAFKHNEAFSFQIATDDQEETDRYWNAIVGNGGQE
SACGWCKDKWGISWQITPRVLAEAMAAGGDEAKRAFEAMMTMKKIDVAAI
EAARRG
>SMa2191 conserved hypothetical protein
MRYGMRVEILGQERRRRWGDAKKLDIVMSIGLDGATVTEVAHRHDVSRQQ
IYAWRHELKKKGLLPSTADTVFLPVDMAAMHGAPLVREDVARVSAMIELR
LNDGRSLRFDSGVDPAVLTRLIRAVEAA
>SMa1413 Hypothetical protein
MSVKSSISLTDQQDAFARSLVESGRYSSLSSVLQQGLELLRQKTEAEAVE
TAALREVIQRRLTGPMISAAEMEGRVAAMIERKRRTVRVDT
>SMa1967 Putative short chain alcohol
MDGAVSLISAASPSASPSELLMRPQRGLVSFTRTWALELAQTGITVNAVA
PGPTEPNCEGEYQYLTGVPMHRLGRPDEIAAAIQFLLSEDAGFITGQTLF
VYCGASIGKALL
>SMa0172 conserved hypothetical protein
MSNSKDEVERIDWLEAELADTIDEDYELELSEPTLSEKIREIYRKAHPPA
LPRMDYFRALLALQAELIKLQDWVVYHKQKVVVIFEGRDAAGKGGVIKRI
TQRLNPRIVRTVALPAPSDREKTQWYFQRYVPHLPAGGEIVLFDRSWYNR
CGVERVMGFATEEEVEQFFDDVPEFERMLVRSGVRLVKYWFSITDEEQQL
RFLTRIHDPLKQWKLSPMDLQSRVRWEAYTKAKEETFARTNIREAPWHIV
EANDKKRARLNCIDHLLKQIPYEDVPHEDITLPERIFNPNYERKVLPPEL
YVPAKY
>SMa1887 Putative transcriptional regulator
MRVSRAQAEANREAVIDVASRLFRKHGFDGIGLKDLMKGAGLTQGGFYKQ
FESKDDLAAQASRRALESALRRWSAAAAANPQHPFGAVVAFYLSMAHREE
KMDGCPVVALGSDAARQGVDVKASFEAGIREYLEVLGEWIGAADGEEPDG
KAMAILSTMVGAVLLSRAVNDEQMSKRFLKTAAESVLKASAADAEPGARQ
>SMa0444 TRm1b transposase
MEDIKAVIADMPTYGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLL
LVRHTGAVDDRLHEGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCC
DREAIAHVATTEGIKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSC
FVARDTASLLRDIGMEPCTTPVRSPQSNGMAEAFVKTFKRDYVSVNPTPD
AETVMAQLPFWFEHYNNLHPHSALGYQSPREFISSQSQT
>SMa1558 putative histidine kinase, chemotaxis
MITPEELDPGLLTMFTQEVRERASEMENAVLAIEASGDADRKRHLQEQLL
RGAHSLKGAAGLLQVRGVETICHWMEEILSMAANVGLVLERSRLDLLLSA
ADAIRDAANLLESGEIPSPAHGQDVVEKLKAVATAGSDDGLKEAHHHPPM
AEPEIAIRATDTDGSMRVSAARLDALLYRSGEMLTFNAVMRRHAEQASSL
REEARKMRAPGSDPTGQVASVEGGLRQLAAFLRQDVRLMQSAVTALDEEI
RLARTQPFAEACKGLGRIVRDVAAASGKRAELEISGGELEVDRSIVSALQ
DSLRHLVRNAVAHGIQTPEERRTAGRPEKGRILVSAAMRGNRMEVRVEDD
GRGLNLALLSAATGESTHGKQAEADILRRVFEPGVSTSATVTSLSGRGIG
LDIVKRNVEKLRGAVDVSQVPTGGAAFTLTLPLTLATVRVLEVLAGGHVF
TIDTTSVQRVIRIDRGDFTLIEGRNFVRTPAGPMPFVDVSLWLRLQPNRS
PASETTPAVVVDSPSGSTAVLVDEITGEQELLARSLGPRLANVRRYSRGM
VLPDGRIALLLNVAALAEAAAEVRPRNEVGARPSVAAARRKVLVVDDSKY
VRTLVKLILEGAGYDVTMATDGTEALKQLRDHGADVVVADVDMPSMNGFE
LTRAIRQSDRFTGTPVVLVTGRESLEDKVKGLRAGANAYLRKDQFDAHDF
LETMRQVV
>SMa0781 conserved hypothetical protein
MTKEGKPDRLRQMGALNPKPEGVRAPWFREAGFFDPLDLVQVKYEMLRHA
REEGTNKADAAALFGLSRQTYYQAEAAFERDGMSGLLPRTRGPKSAHKLT
GEVMRLVEEHLDANGQLQARSLADLVHARLGISVHPRSIERAVARKKKR
>SMa0516 probable
MTNRVALIGAGAMGGSIGARLVETGNRLTVFDPGPDKVQALVDKGAFAAP
SAAEAAAVSDYVILSLNAPAIVRQAVFGDAGVAAGAQAGTLIIDMSSIDP
NATKQLAADAAEKGLRWVDSPLSGGAPKALIGELTLMAGGTAQDVKDAHA
VLRHVASNYTHMGSVGAGQTTKLINQVLCGLGFLAVAEATQLALDAGVDA
SKIPQALMGGRADSAILQEYMPRFVTKDYRHTGRIDNMVKDLAGAQDLAR
RTNTAMPLTAACAEIHRMLTAAGLGGEDQAALMEFFRGPNKENFK
>SMa0078 putative LacI-family transcriptional regulator
MSSQERKKSRVTLLDVARHASVSRATASLVLRKSPLVGSETRARVEQAMR
DLGYVYNIGAARLRVERSQIIGAIVPNLTNPFFAELLSGIEEAIGATGKV
VILANSGERVERQSMLLQRMREHGVDGVVLCPAAGTEPDLSEQLAAWGMP
VVQVLRHISVDMDYVGVDYAGGMRQAVDYLASLGHEKIAFAVHGPFHSAY
RERVDGFRDAMLAKAFDPEILIHLPAQLGEIADSTHLLFAKATQPTAVIC
FNHLVALGLAAGLHDCGLTIGRDLSLIGFDDVADAEAIRPRLTSVSTGPT
AIGEMAARLLVERIANPDLPPRRVVNDTTLSVRQSCGRPA
>SMa0750 probable LysR-family transcriptional activator
MPRPYEFSSMTALVCFEAAARNASFKKAAQEMNVTPAAISHQIKALEMDL
KCSLFLRHHRGVELTEKGALLFVVIQRGFETISETLTQIRERPETVDVTI
GATTAFSSLWLTPKISAFWKIHPSITVSQVVSDVPGMTSRCDLTIHYGNP
QENGVEYRKLFQDHIIALGTTRFAAEHRIARLEDLLKAPLIHSSSNETGW
TAWDDWFAVLGCPAPKGRSFYVNNYMIALQAAQDDVGAVLGWDGLVGSLV
NEGRLVKLVQESIPSPVGFHLRIHRRATAKARLFADWLVEAT
>SMa0535 hypothetical protein
MKAIPFVTRIDSLAGEIDLLVSVDADSIDSVEDVRRQVASVPGIATVTTA
LVLRRHL
>SMa1153 Hypothetical protein
MDNDVRLRQDILDELEYEPTIAANIGVAVEDGIVTLTGHVRSYAEKHAAE
RIAERVKGVRAIAEEIDVRLPEHKKTADDEIAARVLKILAWGAAISDPED
INVKVEKGFVTLNGTVDWHFQRSAAENSVRVLTGVTGIDNQLRIRPRMNV
VDVRHGIREALKRNAETEAENIDVEVSGSHVILHGKVQSLRARAMAERAA
WSAPGVTAVEDRLRIEDARVALGA
>SMa2215 putative GntR-family transcriptional regulator
MENAVASDQPLYEQIKSAIDRRIETEEWPPNFQVPSEQDLASEFGASRLT
VRRALRELQTDGVLLRVQGRGTFVIGPRMQCAVFNLADISEEIASSGAAH
SCRVLQHSILGKDGLGRNMLQLGAEDTIFHTRLLHLEDGSPIQLEDRYVN
AAEAPTYIEQDFSKMSPHSWLLRETTVTTVDNTIRAIRVDEEIRQHLRID
ASQPCLLLDRSTWRDGIPVTRSRFIYPGDRYRMRSSHEARTNRIVTIPAT
AGTGR
>SMa1545 putative oxidoreductase
MRREFVGERSASMHQGFFLLFATVSVVTALTLVLARNPVHSALALMACFL
QISAIFVMLGAPLLAVIQIFVYVGAIMVLFLFVIMMVDVREAVLQRFMPG
GNLPALALLVLLGAEMVVLVLWSDRFIESPPVAERGGDAIRDLSKTLFSD
YLLPFEVASVILLAALVGAIVLARKEPG
>SMa2285 hypothetical protein
MRPRLTTWSILTRWRGLTGAFGAPSLESALRLAVLASARPRQRKSRKAVT
GDVLAKLLEACASDRLVDVRHRALFLTAFASGGRPRSEVAGLRVDDLVDD
DPVLGDPAGSWPKIRLSDRGGKPRREAMQQSLHKSVMQAATYYNNAERKQ
GRAARLLI
>SMa0909 hypothetical protein
MNVTSFRAANAAAVPLAPSVRADRAGKNLVAGHQLLPFLERGQAIGTTDL
RATLTNVVGGSDAEGFWAWKDAFAASEVAQILFLRKLGAAISARANAPQA
ALIMLKNVVGLILTHTRRSEESQQLQQFSTPLRLGLVAAHAAGVMAIVQP
PRFRMIATNVKAERRFLSTIARRLDALDAITRGQRQTGGLSRLSRNFGLD
QTEVINADQVWQSLLGGSSVVALAGDMPLRRVRLMSDHRVALSGSTDGMR
DRLKAMSLFSETIAWKLRFFIPTSEGGSAILARLIERHPIVDVTGPV
>SMa1053 Conserved hypothetical protein
MSAYLPSLAGGMLIGASAVMLLLLNGRIAGISGIVGRLLQGVGMTTNLAF
VLGLLLGPLAYLLMFGSWPAVQITAGWPLIIIAGLLVGFGSRMGSGCTSG
HGVLGLARVSPRSMVAVATFLTAGVAAVALLRGLAL
>SMa0503 putative ABC transporter permease
MRTFGSPEFLFIVYALRWTLTLTVLAFVGGGIMGIVLALLRIARIRAFSA
ITTFYMQVIQGLPLLVLLFLCYYAPSLFGIEIAALTAAAIALTINSSAFL
GAIWESALRAIPKAQWESADALALTPYKTLRFVIAPQAIRLALPSTVGFL
VQIIKQTSLASIIGFIEITRAGQLVSNATFEPLKAFLSVAALYFAVCFPL
TQLSLWLERRTACRGART
>SMa0711 putative ABC transporter permease protein, MalFG family
MQRTTLQTLSLYAGLAVVCGVLLFPIYWLFVTALSTLAEIRQLPPSFWPA
EPQWSTFAKVGTERPIFLWLWNSTLAALGSVALSMVVSVFAGYSLSRFSV
KGGRSLGLFILTAKMLPATLLVIPLFGIFRSMGLIGSLWSLVLAHATLII
PFTTWMLKGYFDTIPRELEQAAMVDGCSPLGALFRVVLPVATPGLAATAL
YAFVLSWADYAYARTFLTNAQGSWTANLGITTMKGEYVTDWNEISAAAVF
IALPIILIYLFLERYLVGGLTAGAEK
>SMa2377 putative transmembrane transport protein
MTVNVSPTVMFLLISFGTVLGIAGTDLVLPAIPAMPTALGGTAALAQMVL
AAYAGGTLVGLLTFGELGARYSRRKLLVWSLGLFAVTSLLSAYAPTLEWL
VILRFAQGAFGSGPAVFAPGFIHGLYPGDKAPSMFGRLGSIESLTPALAP
IAGAYLMTVGGWQTSFLMLAGLAILCAVGSWAYRQSLPDRLEALEVHQSY
MSIIRNGDFLRHGLSQALSLGSILIFVFGAPAVMTGALGMTIGDFILLQV
FGIALFILASNASNALARRFGIERMIMIGTGSLVLGFLLILLYTSLGGRS
LTVLVPLWMTANGAFGIRGPIGFHQAIVASRGDHSRGAALVVAAILGITA
GGTAAAAPFINVGWWPLALASSLAALLALLCLKLIGSTA
>SMa0144 hypothetical protein
MTRADFSDEILMRFADGELDTGTAAEVERAMETDDALVARVALFIETRHA
AKAAMEPLLDEPVPSELKRAVERMVMEKTSPPSVSGAGILPFPRRAANES
HRSRWLAPVAASLAAVVAGFGGYWLRGSVEAPIEGGLGVAAIGSPALDEA
LATVAAGEERRLPGSDQRFRAIATFRDNAQALCREFELDSDDRSTVVSVA
CRAGSEWRVTFAVIAPGATGGYAPASSTETLDAYLTAIDAGAPLEAAEEA
RALSELQGAQR
>SMa0374 putative dioxygenase
MIKDIKGLHHVTSMASDARQNNRFFTDTLGLRRVKQTVNFDDPSVYHLYY
GDETGSAGTVMTYFPFPNMMLGRPGVGEVGETQFSVPKGSLKFWQDRFTT
QGVDGLERDTVFGADRLRFMGPDGDSFALIESADDKRAPWLADGIPDDAA
IRGFAGARFSLHDSAATEELLGFMGYERAEKEGDVVRFIISNGNGADTID
LLALPKTPFARQGAGSVHHIAFAVDNREKQLEVRKALMDTGYQVTPVIDR
DYFWAIYFRTPGGILFEVATNEPGFNRDEDTAHLGEALKLPVRYEEYRGQ
IQANLVPLAA
>SMa0649 conserved domain of hypothetical protein
MPELVDLVMARPLVSASMVAKTLDVTPQAARRIVLELGLREIALVTVDDA
ASLVLRIVNTLQECFR
>SMa1082 Hypothetical protein
MARICRRYATGDGWRPPLTLSDGYLANLIRVNVDRSSRRTLEASRSPEVR
RSEMTTLRTILLASTILTLATPGFAEDAHHPEATGGQAGAEAVEPAAPAA
NAVAAMPGGMMCRDMMGGMMQMMPGGMMAAGQGQTGQAGMSAMALMIAPE
HIEGRIAFLKAELRITPEQEPLWNAFAEVLRANSRGMDGMMQTQAAMGQL
AGAAATPLQRVDVGERALASRLESVRKLKAALVPLYQLFEGAQKQAADKL
LMPPMMGIM
>SMa0950 probable AttC
MISPLSRSTTVLKTAIAGLLSLLPISLPALGAEKVIIASTGGAYDKALRE
AWFDPFTKATGIEVVTVAATNSEMRAKAAAMVKTGNVNWDLYLDGEIQAA
SKAHRKVTEDLTEFCRQFADREDLSVNACSAGGALLQSTATLLAYRTGKH
GDSVPATWADMWDTRKFPGERAFPNFDDPWRVMAAALLADGVTRDELFPL
DIDRALAKLDQIKPSVSLWWKTGDQSVQGFRNGEYSLGQIWLTRAKAMQS
EGLPIAWSYKAAFLVGDRIALLKGAPHRDNALKLITFWLNSPAAQAKACE
VLSCTPPSSDAIAMMSDEARKTMPQGDDVRDYVIIPDAEWINANAAMMLQ
RWNEWIR
>SMa0462 hypothetical protein
MVTKKKARAKQTPKRLDVMKATKNKAETANAAAFEIPEQAGKFLRDTSIP
QGFANPQPIVLAELQKRVTEMNLRPTVSLEQFLASANRIKPVDLPIEELS
DDVKESIPALKPEIRRFHGVKIPLYWFPFPWLSSVCSDRFGYMSSAATRA
ATKLPFNVATQALLGQLGNMMGDAGRDPNPASHNPADAGVSSLPAGFTYF
AQFVDHDITFDVSSTLDADIDANTVNNMRSPALDLDSLYGRGPGLDPFLY
VFPTSGPATAIKLHRGTNTPVGPGGPSNNGNPSGMLQQTNWDVPRMQGTN
TAVTGDPRNDENLIIVQFHHAMLRFHNAVVDLLLAAAFAGDIFAEAKRIV
THHYQWAVVHDFLERICGVATVNNAIASVSAPIGSSFRMPVEFAVAAYRF
GHSMIRDTYWVNFNFPNATLGQVFEFNRIPRLPVFSNWVVDFNAFFDTGV
PVPVHNKARKIDSLMASGLESLPGFSGMMAILATRNLRRALALGLPSGQG
MANSFGIAPMTAAQLIFGLPPAEVAVLNASGGLLLNKTPLWYYVLREAAV
LAGGNQLGAVGGRIVAETFVRILKRDASSYLNVAGGFTPILPSSTPGNFT
VADLVAFVGVTQP
>SMa0771 hypothetical protein with local similarity
MHHHHRDGVAGKIQVEQSKRARVNLSTRRVSIVWKEEVAGKRSDPGELVR
AISERGHETHIFTHGEGEGDAVGEPISVSSGDAVEAGTLNLTGPLVAPGD
GKRAQLLPCRDHGADGGGRRREGAKLVSSTARIKVIQTIDFTNHQVCSIG
DKAKEFK
>SMa1200 hypothetical protein
MATAHTSLETEEAPVGASYKAVFRVPHGCEGKPTNVVRVQIPEGVIAVKP
MPKPGWTLEKVSGAYEKSYDNHGSPTTEGVKEVMWKGGNLGDDEYDEFVV
RVYLTPDLPVGKVIYFPTVQECSEGAVERWIEVPAEGQSGDDLEFPAPGV
KLLEKAGGH
>SMa2167 hypothetical protein
MGAGGFRPERGRSHIMTTLKVGIASYEDYKKRTMAVARGQIKPGADDPKV
WFTSMESFAKVLSDRNRELLALIAATKPSSMNELAEKTGRAPSNLSRTLR
TMERFGLVRFEKGAGKTRAPRVDYSDIILDVPLKRAAGWQCERCVDVDSS
DLLFP
>SMa1750 hypothetical protein
MTSHNASGRSCDYRCLSHRSTTTDQRADCMENPVALNRIAENSVFRKGDV
FVLFGELFGRGYATGLLDEARRAGMEIVGITVGRRDENNALRPLNAEELS
AAEERLGGRIINIPLMAGFDLDAPTGGPTPTDLLATMTLESWELERLDWG
YIEQCRDIATSRFTNALSQVMAVLDGMIAEGRNVFFAHTMAGGIPKAKVF
LAVANRIYKGRGPRHMSSQALLDSEMGKLILQNFDEVTAITFRHLIDFSA
GIRERVEASGAQVRYTAYGYHGTAILIDGSYRWQTYTNYTQGYAKMRLEC
IAQEAWAAGVKATVYNCPEIRTNSSDVFTGIELPLIPLLLALRKENGGQW
AEDQWQACQELLADGFTMKDVFQKIADMQVNEVMRPFYDFSAWPMANSQA
QADLTIGTSNEITQMHRDSKAMISDLLSALVVEATGQLIFGASSDPSGPI
QWLNHDIVARRLNASHLQRKAPAPLLAQAAKDSQLELA
>SMa1819 Conserved hypothetical protein
MPTRRGFLGAASALALPNLFSPARAADPVSTSGDKPMSADLILHHGLVTT
LDRTNPNATAIAIRDSKFLAVGDDRDIMALAGPETKVIDLKGKRVLPGLI
DNHTHVVRGGLNYNIELRWDGVRSLADAMDMLKRQVAITPPPQWVRAVGG
FTEHQFVEKRLPTIDEINAVAPDTPVFLLHLYDRALLNGAALRAVGYTKD
TPNPPGGEIIRDASGNPTGMLLAKPNAAILYSTLAKGPKLPFEYQVNSTR
HFMRELNRLGVTGVIDAGGGYQNYPDDYAVIQKLADDGQMTVRLAYNLFT
QKPKEEKQDFLNWTSSVKYKQGDDYFRHNGAGEMLVFSAADFEDFRQPRP
DMPPEMEGDLEEVVRVLAENRWPWRMHATYDETISRALDVFEKVNQDMPL
EGLNWFFDHAETISERSIDRIAALGGGIATQHRMAYQGEYFVERYGHGAA
EATPPIAKMLEKGVHVSAGTDATRVASYNPWVSLSWMITGKTLGGMQLYP
RANCLDRETALRMWTENVTWFSNEEGKKGRIEKGQFADLIVPDKDFFACP
EDEISFITSELTMVGGKIVYGTGTFADFNENDVPPAMPDWSPVRMFGGYA
AWGEPEGAGKRSLRRTAMATCGCASNCNVHGHDHAGAWTSKLPISDVKGF
FGALGCSCWAV
>SMa1347 putative
MRRPMSFIFSTHRLHPDAEAMLKAASDLRVASAPDPETLLREGEGAEIVI
VRAPIPPAFFGNVPALRAVVRHGAGLDMIPYDAATAAGVLIANVPAVNAP
TVAEHVFMVTLALLRQFRPMDRDLRNMGWSTGRAHSDRALDLAGRIMGVI
GMGNVGKAVFRIAKYGFQLEIVANSRSPESLPDGVRFLSVDDLLSTADIV
VLCCPLTPKTTGLLSRERIARMKPGAILVNVSRGPVVDDAALIEALERGR
IGGAALDVFSTQPLPPEHPYFRQDNVIVTPHLAGISEESMMRMGKGAAAE
AIRVMEGGLPVNLRNPEVVEHYRRRFPG
>SMa1755 Putative ABC transporter, periplasmic solute-binding protein
MKTIFKSGVLVAAVAVLSASLAAPAFARDLTVVSWGGNYQDAQRDIYFKP
FAEKSGKPVLDETWDGGIGVIQSKVKAGAPNWDAVQVEAEELALGCADGL
YETIDWDKMGGKGKFLESAVNDCGVGAIVWSTAIAYDGDKLKEGPVSWAD
FWNVEKFPGKRSLRKGPKYTLEFALLADGVSKDELYDVLGTDEGVERAFK
KLGELKPHIVWWESGAQPLQFLASGEVAMTSAYNGRITGINRTEGKNFKV
VWPGSIYAVDSWVVLKDAENKDAAQDFIAFASLPEHQAKLPEFVAYGLPN
KEAAARVPPEFAKDLPTDPVNMKEAISLNVDFWIDNAETLTQRFNAWLAQ
>SMa0493 putative ABC transporter, permease
MEALNGWWDDYLLASVTVAKVFVCSLILMVIFGLLGASAKLSSNRLANAV
GNAYTVFFRGTPEILVILLLYFGSAVSLTTIARVFDPSVAFVDIPPFWAG
SIAIALVVGSYATETFRGAFNGVKSGSIEAARALGMNGLQTFFYIRIPEM
WRIALPPFGNHMLSLIKDTALISIIGLNETLFVAKQAASTTGKPFTMYIV
VGLIYLGFSTAITISVLLLEALANRHIQRRPS
>SMa1995 Putative ABC transporter, permease protein
MSALVNSTGFRRISKVPPSYYLLAMLLVILFVARPQMLNANVLGVFVRQV
VPLGILVLGQLLVMRVKSIDLSGGGVILLINYCISSGIFPGASLGFYVAL
ALTTGLVIGLFNGVMVAKRRVSAVIVTLALSIVLVGFVQYLSSGKPPGDV
PKLFADLFNTRFAGLPSPVILWIAVTALMALLLSQTVFGRFVAAVGESMP
AAHFSGVPVERTVILAHTLAGLMAAIAALVQTASIAVGSVRVGLDLPVLS
VAATILGGVVFGRGEGGVWGPFFGVLCFAFLFVAMTTFGVGDAGKLVAQG
MIILLAAIFYGLRAGK
>SMa0637 hypothetical protein
MNARVSFAREPLYRSQVAGAASWGVSALEDQQLAGLIMWVPAGMIYVAAA
LAIAAALIGGSNTSFAGESNDDAGARGGKLPCVTAAFKTARITLSVRRGS
SGQCQIHASCPTRVRRLSRSLRAKRHGRR
>SMa0247 hypothetical protein
MSGGFLVSFEQALSAESIQPADASSAMLVGRVWSKTAGGPCPVLISEGEV
FDLTPLAATISALLEIDGLVDALRDPSRFASLGSLDAFLRGEAGDLLAPA
DLQAVKAAGVTFADSMLERVIEEQAKGDPLRAQEIRGRLAPVLGDNLKGL
VAGSDKAAEVKKLLQELGLWSQYLEVGIGPDAEIFTKAQPMSSVGCGAYI
GIHPKSDWNNPEPEVVLAVTSKGKIVGATLGNDVNLRDFEGRSALLLSKA
KDNNASCSIGPFIRLFDGAFTIEDVKQAEVSLVVDGKEGFKMTGISPMSA
ISRSPEDLVSQLLNDNHQYPDGVVFFLGTMFAPVKDRRGTGLGFTHEIGD
RVEISTPRLGRLVNWVDHSDRCPKWSFGLGALMKNLAERGLLQAKREG
>SMa1629 putative
MIEFLNLRGKRALITAGTKGAGAATVSLFLELGAQVLTTARARPEGLPEE
LFVEADLTTKEGCAIVAEATRQRLGGVDVIVHMLGGSSAAGGGFSALSDD
DWYNELSLNLFAAVRLDRQLVPDMVARGSGVVVHVTSIQRVLPLPESTTA
YAAAKAALSTYSKAMSKEVSPKGVRVVRVSPGWIETEASVRLAERLAKQA
GTDLEGGKKIIMDGLGGIPLGRPAKPEEVANLIAFLASDRAASITGAEYT
IDGGTVPTA
>SMa0669 conserved hypothetical protein
MLEFLICSLLTIFPDYLVRRYVQGKRVGREINLYSMWFELRYGITACLIL
TISLITMIFYFHPSTSNVTAVFRTVTIMPESTGRVAEVYVDLNEKVSAGA
PLFRLDDSEQRAALETARRRVAEIEAEAIVAQSELASADGLIAQAEGDYL
QALNELETQVELNQRNPNIVARREIERLQVAVDGRDGARAASISNKRTLE
TKIASLLPAQKASAEAALTQAQVELDKTIVRAGTAGVVQQFALRAGDIVN
PMIRSAGILVPEQRRIGLIAGFGQIEAQVMKAGMIAEATCIGKPFTIIPM
VVTEVQDVIAAGQIRPTDQLVDAQQLASPGTLTAYLEPLYEGQFSGVPPG
SSCIANAYTSNHDALHAPGISTPHWLFLHVVDAVGLVHAMILRLQALLLP
VQTLVFKGH
>SMa0112 hypothetical protein
MQAGGVSMTNAQCACGALRLMLNEPPQLTALCHCLACQRRTGAPFSANAF
YSIDCVEISGVSTEYIRTAESGRKVRMHFCPTCGSTLFWKADASPSWIGV
AVGSFADPAFAPPAMSVFERSKHKWVQLEGTVEHFQDLPIGQY
>SMa0026 hypothetical protein
MEQAFPNIVGGDEKEIAMSNLAPTRPSACVLAAFCLLIALAPNPASAQNT
NERADCAAAAGKAEPATPPDRASADGTAPGNSGSTGWTGGTGGAHIGTNP
QGATAGSTTWQPPTARGIDLATAPEIPAPETAAPGTQTPTDC
>SMa0943 putative arylsulfatase
MSDKHANSRRHPSRREVLLAGGSLLAVTALGSLTGAGAKAQTAGKKPNIL
VIFGDDIGWWNTSAYNRGQMGYQTPNIDRIADEGAIFTDLYAQQSCTAGR
AAFITGQSCFRTGLLKVGLPGAKEGLSDKDPTLAELLKPHGYATGQFGKN
HLGDRNEFLPTVHGFDEFFGNLYHLNAEEEPENPDYPKDPQFRAKFGPRG
VLRCVASDTDDPTEDPRFGRVGKQKIEDTGPLDKRRMETVDGEFLGAAMN
FIDKNQKAGKPFFCWFNATRMHIYTHLKPESQGKTGLGLVADGMTEFDGM
VGQLLKQLDDLGIADNTIVLFTTDNGAEVFSWPDGGSTPFHGEKNTNWEG
GYRVPGVMRWPGVIKPGTEINDIVSHEDWVPTFVAAAGEPDVKQKLLTGY
EAAGKTFKVHLDGYNQRELLAGSGPSSRKEYFYWTDDGNLAALRYDRWKL
VFMEQRAEGLDVWQDPLITLRFPKLIDLRADPFEIAQHVAGDYDRWRVEH
AFALVPAQAFVVRHLQTYVDFPPRQTPGSFALNEVMAKLQEGGRH
>SMa1556 putative methyl-accepting chemotaxis protein
MSRSPPATRRKGPSSMSPRPANSLYGALGLAMLLFVAASALLVGSSFIAL
ERVRADLAATSSLGKERLAYQMIYTASHLERAEGPARAAATDELRRLMAR
NERLLASLANSEEGIGLAAANDPAALTQLEQARQQWLDEVRPGLESVMAS
APLSRGALDELDPEIRAFAARLDGLISRIEQAGVTRLQRSQLLQFGFSAL
ALLLLLHVLRVVRRLARRTRALAVLAEKVSAGDLAQKAFLEGSDELAVLG
DSFNAMTARLAGMIDNERGSRERLEKLLATISETAQHLSSSAAEILAGTT
QQVEGMREQSSAVAQTVTSVDEVLQTSEQAAQRAQHVAASYDNAVKISNE
GRRALDDTVQVMNAVSARTETIAADILSLAENSLEIGEIVSVVAEIADQT
NLLALNAAIEASRAGEHGRGFNVVASEIRTLADQSKSATTRVRRILMEIQ
KSTNSAVIGAEDGSKSVSRALETVSEAGETIRQLEAIVADSARSVAQIAA
SAGQQRAGMKQIHEAMHYIEQTSSQNLSAIRQAEEAAKDLNELGSRLKEM
LTDHGNEHDNT
>SMa1172 Hypothetical protein
MRTAEGKLHLFVAIDRTSKFAYAELHEKAGNMAAAKFLRNLVAAVPYAIH
TCSPTTESSSPTASTSSTASATTTASSTGSPK
>SMa0907 conserved hypothetical protein
MKRCPCTFGRSKGRIEVRNFAIGDQTLDESSPEFRALLPHAYEQKLRPMC
MCKEPPVPMYIARLDDQYLIKRMPLSGRDHDPACPSYEPPYELSGLGPLI
GNAIQIGASGKADLKLDFSLTKRGPRSAPGTSAEAAEPGIRSEPKKLSLR
AMLHYLWEAGELTEWRSTWTGRRGWGRVRTSLINAASQMTARGGPLSDMI
FVPEVFHPDDKDGIAARRAAALKSIQTTGSGTRKLMMIVAEVKEFAGARE
GHRIVVRHLPFPLMIEDGAWRRLAGRYDTELELWRSSEALHLIVIATFGV
STAGIASVDEVALMVVNEHWLPFEDTHELRLLEKLSHLKRKSVKGLRFNL
PRDAPIVSVTLPEQKPVPVAMFIVPASAGDEYEQALTNMIDARPEIAPWI
WRVAEGEMPRLP
>SMa2029 Hypothetical protein
MVVNHTTALAKLARAFNVPTILTSVIAARGGLLFKQITDVFPDQEVIDRT
WVNTWQDENVVNAVKATGRKQLIIAGLWTEVCVAMPVIQAAGEGWDVTVI
TDASGGISKESHEVAIQRMAAAGANVMTVMALAGEWQRDWARTEHVEELT
EILIQHFGGSGIAYLWEQQLLNTPVPSEG
>SMa1501 Hypothetical protein
MNDMSPTKCQDQVVLDLWHPLAALEEMPARTVQDTVLLEERISYVSDGEG
KAAAWHSRPELPAGSRVDIDTLDGGLPVKMAYGYIWTSLGTPPAELFAIP
EYAESDRRRLNAASIGVNVSAPRAIENFLDMGHFPYVHTDILGAEPHTEV
KEYDVELSVERDEIVATRCRFFQPMASTASTGGADVEYIYRVPHPYCSVL
YKSSPVDESRLDVIAVFLQPVDQEHVRAHMMLCVLDEENEDKVIKRFQQT
IFGQDKPILENQFPKRLPLDPRAETPIRADKSAIAYRRWLSQKGVTYGVI
PAAT
>SMa1011 putative nucleotide binding protein
MVALDMIAIMSDQILVYLQTKAVAKKSLEKNEHPFHRDDPVHALFLVTEG
CVNLLRYQEDGSPAVLQRSGGRSILAEASVFSDHYHCDAVAATQTSILVV
PVGVVRALLNNEPAFAAAWARHLSFELQSARKRAEIVSRAR
>SMa0754 hypothetical protein
MRVPRSSIWIERHKRAAGFWLAEDYFTGTKCIGVILRAGGEVLRNRDCAG
QGHSARQPRDYRSGEEVTSARQADWEKLRRMPADDTLVAPGWWFRSHQVW
YLQARGRW
>SMa1462 Putative ABC transport protein
MNFAPKLAKTLLSAAAIALVTTTAWAATITVGGKNFTEQLIIAEITKQLL
ESKGHTVDKKDGMGTKIVRAALENGEVDLYWEYTGTSLITFNKVTERLSP
EETYNRVKELDGEKGLVWLAPSAANNTYAYVVKPGNAKTEGMETISDLAK
AYNDGKDILMGTTAEFPKRPDGLIGLEKVYGFETGRANVRPMDLGLAYNA
LANGDLDTIAAQATDGQIAALGLKTLKDDKGFFPNYALTPVVRKEVLDAN
PDLKETLETVSKKLDDATMQRLNSQVDVEKKTIETVAADYLKSLGM
>SMa1811 Hypothetical protein
MTTRRSVLKGTLSLMLAPTAMTTLAPPGAAAKAQQVKIQAPGYYRMMLGD
FEITALSDGTAKFPAETLYAGAKDQVAALLAKAFLDSPVELSVNAFLVNT
NERLVLIDAGAGGFFGPALGKLVPNLVAAGYQPEQIDDIILTHAHVDHLG
GLVAGEKIVFLNATVHLNQRDADFWLSSANRDAAPEAKKEFFSMAVQALS
PYRDSGRLKTFADEAEPVPGFKTVLRAGHTPGHSAVAMESKGQKLVFWGD
ITHGDVVQFEEPDVTIGFDENPAAAASARDAAFAEAVKEGYLIAGAHTRF
PGIGHVGTDSDKFDWVPLNYRATL
>SMa0657 cytochrome c binding protein, probable amino terminus
MTLGGFVGGVIFWGAFNTALELTNTEEFCVSCHEMRANVYEELRRTVHFS
NRSGPGVLSRLPRAA
>SMa1617 putative integrase/recombinase
MDANSRNRWFLDPGPLSSWIDQFADDLAAQRYTPLTIEGYTASARHFAAW
LGCAGISIDLIDDDVVRRFAEHRCRCPGRRQWLRISPKYSRRARRFVVFL
QKEGVARPPLKVASPYPLLDDYQSWLRVHRGLAERTIARHLRHLHKLLPE
LGTPTLDYDAALIRNVVREWRERTGPADLRTITSALRSYLRFLAGVGLCR
PNLDHAIPPVLQWRLSSLPRYLAAADVERVIASCDQLTRGRLRDRAILLL
LARLGLRAGDVAGLRLSDIEWTSGMLRLSGKARRQVRLPLPQDVGDALLA
YIEQERPRMHQEAVFLTMIAPYRSFAQSSHVSTIVALALKRAGISDPPST
GACLLRHSAATSMLRSGATLEAVGTVLRHRSLDMTAHYAKVDAAMLEQVA
QPWPGELPC
>SMa0888 hypothetical protein
MTHVRFENKYRTYCTISSRKRVAVKEIMSAAVATSNPVLHELRGKIASLE
GAGSRNRSILPFGVPELDAQLPGGGLAFGALHEVAGGGNGSIDGAAAALF
IGGIAARTTGKVVWCLTRFDLFFPALAKVGLHPDRVIFVECYKEETVLAS
FEEALRYGGLGAVVAELVRLPMTASRRLQLAAEKSGTLGLVIRRWRRQTE
ASDFGMPTAAATRWRISVLPSEPLPVPGVGRARWLAELMRVRAGEGGEFI
IGACDGQGRICLSSETANGPDQAGRSFAIGRKAAGGHRQERFEAMDFGS
>SMa2221 hypothetical protein
MSHSRTTVHGTIVGVIVLDTGFRRLAGDIAHAATWPFPVQFRIVHGVRPR
DVIEGDPRHSLDAFRAAIDDLVALGCTAITTSCGFLAALQGELTLHSPVP
FLSSALLQIPMIEHILPAGKKPGLILSDPHSITERHLHAVGAAPGLPMAA
LPVDGPLLRNMREQATEVDAAAQEADVMATVAELMAHHSEVGALVFECAN
LPPYSAAVSRRFGLPVFDIVTLVRWMQLSLAPPDYFK
>SMa1174 Hypothetical protein
MGQVLHGSATTTEAIRRAIQNSQESLRTLAKRYGINQKTVAKWKKRPSVA
DLPTGPREPRSTMLSLEEEAIIVAFRKHTLLPLDDCLYALQPTIPHLKRS
SLHRCLQRHGISRLPEVEGDKPGKKEVQALPDRLFPHRHRRSAHGRRQAP
PLRSHRPDLQVRLCGTA
>SMa0405 hypothetical protein
MGPHFGLTPKKCQSSESDHSGRISKIGDGSVRTALYEAANVILTRPVKGS
DLKGWALAVARRAGPRKARVALARKLAVVLHCMLRDRTNFIAHKGAPALA
A
>SMa0064 conserved hypothetical protein
METYIHEVVAPKLVGRDPLEIDRISKDLTGYLGFRSTGAEMRGNSAVDIG
LWDLFGKATNLPIAQLLGGFSRREIRTYNTCAGNTYMRDAKGQQTANYGI
GGPRRDYDDLNGFLERADELAEDLLSEGITAMKIWPFDIAAEKSGGQYIS
GPDLRKALEPFEKIRKRVGDRIDIMVEFHSMWQLTPAIQIARALEPYATF
WHEDPIKMDSLSSLKRYAAASRAPLCASETLATRWAFRDLMETDAAGVVM
LDLSWCGGISEAKKISTMAEAWHLPVAPHDCTGPVVLAASTHLSLNAPNA
LVQESVRAYYKTWYADLTTQLPTVTNGMITIPPGAGHGVDLAPDLDRKFE
VSRRSSQIED
>SMa2153 hypothetical protein
MRSPWPLGHLGVRVALPRPLLWPQPPHCFCLLGSALGHPVEGHPQGPARE
QARASPLMDRRRGLLQDVPPNGIQQVE
>SMa0785 putative site-specific recombinase
MSTDNAMKISADHLRRDAFLYVRQSSLRQVFENTESTKRQYALRDRAVAL
GWPIERVHVIDNDLGLSGAQSQDRDGFQRLVTEVAMGHAGIVLGLEVSRL
ARNNADWHRLLELAAMSRTLIMDEDGVYDAASFNDRMLLGLKGTMSEAEL
HILKSRLQGGILNKARRGELELPLPIGLVYTPDMRVVLDPDRQIQDTVRM
LFDTFREVGSACAVVRRLRSEKILFPRRIRRGIGKGDVLWSEIDHSRVIQ
ILHNPRYAGAFAYGRTRTIYNAKLKSVQQKMPRSDWQVLIPQAHEGYISW
DEFERNQTSLEQNAVGFSPGLRGRMPRQGNGLLQGRVLCGRCGARMRVHY
EQFEGNLRPYYICNEAVVRHAGKACQWARGPAIDEAVSALLLEAMAPTAI
EVALAVQEEISQRVEQAASLRDKQLQRARYEAELARRRYLKVDPDNRLVA
DALEADWNGKLRDLDTLQREHERRNETDQSLLDGAMQERIRALAADFPGI
WNNERTSPVERKRMLGLLIEDVTLLVDEQINMHIRWRGGRTQSLAVARPR
PMAVIRKTPEAVVALINELLETDNDQQIASRLNALGHRNWRGEAFTLKKV
MLVRRAYGLKTRFERLRESGMLTGEEVARRFGVSATTVHQLGRDGVLKRH
RYATNHRYLYEPPGNVRLAKGVGGRYGSRKPRLIDAQPIQQGAS
>SMa0142 possible protease
MLTKLKLALIAPIAAIALVSGDILAPGNMIVREVIGTGLALADDDDDDGG
SGGNRGSGGSGRSGSYGAGAGWSGGKSLFPFREFLPRRSIPRRSRAAAPA
APIRAPDEIVGLGFSPTELGELTATGFEVLERNTMTSFNAEVIKLRIPRR
LTLEAARQRARAAAPQAVIDFNHYFRPEQHPDAPCVTSDCLARDVIGWPS
AQTGLSNCAAGVRIGLVDTAINPDHLAFEARNIEIVRLVEEELPESGRQH
GTAVAALLVGSASSRTPGLIPGGKLIAVDAFHRGERQDDRSAAFDLARAL
DLLTRRQVQVINLSLAGPPNLLLEQAVMKAGERGIIMVAAAGNDGPKAEP
VYPAAYEEVIAVTATDRRKRPYRRAGRGEHIDFAAPGVAVWTAASVSGAR
PKTGTSFAAPFVTAAVAMMKASEPDLAPEMIHSRLSGHAEDLGDPGKDAV
FGWGLLNARAICKTKS
>SMa0162 hypothetical protein
MIDREKFDEISTESPFADAGGLTPMPVHRATTKQKNAKREPLDLVMVPVF
LHDTRHNVALS
>SMa1626 hypothetical protein
MSDFTWYIESLALRSLPATQAQSPAAADPVFNRRLREALAAPAEAAAVID
FWREAGLTRWFAKDAEFDRAFRDRFLTAYEAARRGELFKWTASPEKALAL
LILLDQFPRNAFRGTPRMYDTDPLALGVARAAVDAGNDLKGPPDLQLFFY
LPFGHSEKLADQERSVELAGRLGEPSLSNAKRHHDIIHRFGRFPHRNVIL
GRMMTEEEQRFLDEGGFAG
>SMa1653 conserved hypothetical protein
MEFDFAELPEKDRYRLLCAFVGPRPIALVTTIDEQGCKNAAPMSFFNVFS
HDPPLLILGMQTRPDGNSKDTVANIRRSGEFVVHMVDMAIAKEMIITGIN
FPSDVDEIQVSGLTSVSSVKVAPPRIQESPCAMECRVSQILNYGRRSIVI
GEVLQMYVRDECLDASGRYVLPEVYQPIARLHANNYIVADNQFVLTKPDE
FAHHDNAAGYGGSVHEAKGGSTIRIASAADGQKLDETVEQP
>SMa0636 conserved hypothetical protein
MTTKVVKLSPDDSVRQAAKLMFDHHVSGVPVVDDDGHLLGVISEGDLIRR
AELCSEASVLMADMAIDPDDRANAFIRRCSWRVGDVMTANPVTIEEEAPL
ARVAGLMQERGIKRIPVVRDGELVGIVSRADLLQAIFSTKPDETAAGDEA
IRRSILVRLGENTSLEELDVTVTVTEGIVHFWGQVETAGCRRAARIMAES
VHGVRGIVEHFPDPYTQ
>SMa0659 cytochrome c binding protein, probable carboxyl terminus
MQASKEVWGKIFGSINTREKFLDHRLELAKHEWARLKANDSLECRNCHSS
AAMDLSKQTQRAAEIHTRYLLPGKVTCIDCHKGIAHELPNMQGVEPGWKL
PPELEGETLPSASATDELKKVMNDSHTVAFGN
>SMa1791 Hypothetical protein
MDVGLPCISSPPGCGCSRAAATAGRNVPRRLSTRVAARGCCRPIQTLPSA
VRTTEPDDDVHLELHEENHSACWHMINVLPNVIRKVVIDLENIRLDLRRS
FHNTRPVEPVAPRSAV
>SMa1076 Hypothetical protein
MLSKIERGQMFPTLPTLLRIAMVFGVGLDHFFNADKEEPLIAIVRKEQRL
KLPSPPGEKHPAFLFESLDYPASDRRMEAFYAEFPVDSPPSEPHQHGSAE
FIYVLSGRLIVNVNGKECALDTGDAVYFDSSVPHSYRREGGEISTAIVVT
SS
>SMa1741 putative iron uptake ABC transporter, ATP-binding protein
MTDHLLVASGLTAGYDKTEILHALDLTIPPRKITVIVGANACGKSTFLRT
LSRLIAPSKGQVLLDGKSIHRTPSRDLARTLGLLPQSPIAPEGITVVDLV
SRGRHPHQSLFSGWTRRDDEAVDSALRATKTFDLADRPIDELSGGQRQRV
WIAMALAQQTDILLLDEPTTFLDINHQIEVLDLLTDLNSARRTTVVMVLH
DLNLAARYADHLVAIAGGRVHISGTPEEVLTEETVRHVFGLDSRVISDPT
SGRPIMLPIGRHRTAVIDDMGDAPQKERSA
>SMa0790 hypothetical protein
MVRKPAHVSSGHVKRRFSCVQRMVFRRRHNPKDVPLGNTHHIMLHKRYGL
PNECNQKKGSASPLTANEQI
>SMa2245 conserved hypothetical protein
MVDFRKRLGSTEAKKVVDPVALYETLDRATDKGPLRPAQEAVLGDWFKNY
GGDVSGGSKRDVIIKLHTGQGKTLIGLLILQSRLNDNRRPCVYLCPDNFL
IEQTCEQASQFGIKVSTVEDDLPDDFLAGKSILVTSVQKLFNGLTKFGLH
RQSIEIDTILMDDAHACSDRIRDACKIKIPKDEPAYHALFKLFSTELELQ
GVGTFADLENGKRDALLPVPYWAWMAKEGEVAAILSAAADKKSIKFTWPL
LKDRLRLCQCIFSGAALEIEPHIAPLEDFGSYARAKHRIFMSATVTDDSF
LVKGLQLSPDTISNPLTYAKETWSGEKMVLIPSMMHEDLDRAKIVAWLAP
VNPKLKFGIVALVPSFARNKDWGAYGAKTVDKDSVSEAVSDLKKGQYGTP
LVLANRYDGIDLPDNTCRVLVFDSRPFSENLTDLYQEHCRPESEATLMRT
IRSIEQGMGRSVRGEKDYSVVVAIGADLVRTLRDVSSRRYLSSQMATQIE
IGLEIAEMAREEIAAGKEPLAALVGLINQCLKRDDGWKDFYADQMKKVAP
KGANKEILELYSRELAAEQAYAAGDYNRAEQTIQKLLDDGLAHPDDRGWY
LQERARYLHDGNRVEAQKLQVAAHRNNKLLLKPPTGVTVTKLTIVSQGRT
ERIANWVNKFESYADLDATVSDILGRLVFGTKAEKFESALNELAFALGFA
GERPDAEWKEGPDNLWALNDIQYLLFECKSEVDTTRSEIHKRETEQMNRS
AAWFDKHYLGMKVKRLIVHPANKIQSAAAFTHEVDGMLDSNLKAFVRSAR
AFFKSFENQNLKDLSVLHIQGLIDAHHLSVDNLINRYCSKLKNVK
>SMa0329 putative
MSKRFDGKVAIVTGGGSGIGAAIANRLLEEGASVMMSGRTEKRLSDVASK
MPADRSGIFVANVSSRPDCDALVAATVERFGRIDTVVNAAGMNFVGTIQE
TSDQDWDECIASDLSGVFYMSRAAVPHLKETKGSIVNIGSVSSLGGGWSH
AAYNAAKGGVANLTRSAACDLGKFGVRANTVAPGLTVTGMVEAIMDDDAL
LEKAWDRIPLRRAGQPASAVAFLASDEAAWITGIVLPVDGGQTCTDGGPE
WGK
>SMa0082 putative ABC transporter, periplasmic solute-binding protein
MRREGLMNISRRIAISVLGAAIVAGLVAPAAAQTVETIKSAGTVKVGMLV
DFPPFGIMDANNQPDGYDADVAKLLAKELGVEVTIVPVTGPNRIPYLQSN
QVDLLVASLGITEERAKNVDFSQPYAGISIGVFGAADLAVTKPEDLAGKT
IGVARASTQDTATTKIAPQDANIQRFDDDASAVQALLSGQVELIGVSNVV
AAQIEAAAPGRFNQKLQLSQQVQGIAVRKGSSEMLKFVNGFLDKVKADGQ
LNAIHEKWLGSPLPEFVTEAK
>SMa1737 conserved hypothetical protein
MSDRTEQVATSDIPRPTQNLRTRVVSYFPGLAVAVLIAISAQFLSEHYGA
PATLMALLLGMSLNFLSESGARTVPGIHFASRAVLRFGVALLGARVSLEV
LSDLGVSLLCLVTTALACTILFAIIVGKFAGMDWRLSLLTGGAVAICGAS
AAVALNAVLPPRQNSDRDLALTIVAITLLSTSAMVLYPVLASHLQFDAKE
SGVFIGGTIHDVAQVVGAGFAMSEETGQIATLVKIVRVSLLAPTIIAVLI
MVTVLGAGAGQKPQKLGQVIPGFVLGFAFLAALKSMGFLPAAAGDVANDL
SRWLLLIALGAVGLKTSVKEFASIRPSHVTLALLATAFLAAFIVVGLLWY
RG
>SMa0346 conserved hypothetical protein
MRRNLLLVATQETTVTKTLILLFHPDLKRSKANAALAGAAAKLDGVEVAD
MQAAYPDSMDMFRDGEREARRLLAADRIVLQFPIQWYSTPPLMKAWQDGV
LTRMFYVTYETEGRALEGTPLMLAATAGNVPESYRPGGRNMFTMEALLAP
LRATAHRCGLSCTAPFIIYQADKLEAEELEAAASNYAATLKNWIAGPLVT
RQEAV
>SMa0197 putative ABC transporter, permease
MVTGGAGTLGLSQAALTFAAFSVIVGIGQMFVITLGPGNIDLSVPATMTL
AGTVALKLMNVENGMVLPGLLLAVFIGLGVGLCNYTLIKALRIPPIIATL
SMSFVVQSGAIWTNRGLRIKPPSVLAEFTTTNTLGVPNVAIVAVLISVLA
WILLEKTIYGRWISAIGQSMPAARMAGIPVDGTRFVTYLFCAVLASISGY
LLACFSGGAALNMGAEYLLMSIAVVVIGGTAVAGGDSNVPGIWGASLFMF
LVVSMLNTYGLGAGIRLIMTGLIIISVIMLAGGRRGMR
>SMa0389 putative
MAEAGAHVAVTARTVEGLAETRALIEKTGRRAVALAQDVRDVEACASVTR
AAAEGLGGLDILVNNAGFENVRPSFDVDEALWDTIVSTNLKGAFFCAQAA
GRIMADANGGAIVNLCSLTSYVGIPTAVPYGASKSGLLGVTRALATEWAA
HNIRVNAIAPGYFRTAMTAGFYEDEDWQSRMLEKIPQRRFGKESDIGGVA
VFLCSDAAAYITGHCIPADGGYLASI
>SMa0495 putative ABC transporter, periplasmic solute-binding protein
MNAMKNWQTSLTGLITAVLLSAAPANADTLRVGMECTYAPFNYRTSDGKL
EGYDVDVAKGISEIIGVDFEYVCQEWDGMIPALLANKFDLIIASMSITDK
RKEQIDFSSPYRNSVGRIVGPVGKDLKLFDDKGQPVVGNFDGLRIGVERA
STYFEWFSAKLPKADLVLYDSNEAMYLDLKNGRVDVIMTNPMKAHLSFLS
GEGKGKYEFIGPEVNEPKFFGPGVGVGLRKGNDELRDKISAAIRKLIREG
KLKEYALKIFPFQIHDDAWAEE
>SMa1646 putative ABC transporter, ATP-binding protein
MRAQPATNDSNRRQQRFRGLPNRQPRKGRRMTHAPVLKVENLQTRFKSVQ
RGKYVHAVDDVSIELYPGEIVGLVGESGCGKSTLGRTIVGLEKATSGRVL
LDGVDLSSLSGAALRRSRRALQYVFQDPYSSLNDRQTVGETIDEALLIDG
TYSLDERSRRTKELLDQVGLANAVKDRHTRELSGGQRQRVAIARSLAVNP
RVLICDEPVSALDLSIRAQVMNLFLRLQKDLGVACLFIAHDLALVRQAAS
RVYVMYLGKIVEHGPSQQLYDRPSHPYTQMLLASVPEVDPRVEKLRSGPL
LMGEIPSPINPPSGCRFRTRCPLAVDECAGKAPPSHRLSPDHNAACVFAP
ELYGGKRSALVQQSALAASV
>SMa1784 Hypothetical protein
MARVSPFAKAFAGRWRIVEMDNWDNDVLDLVEEAHLTFQGAADGEIAFVA
LKGFLDVRYGARDGSACAEFSWEGQDESDPVCGRGWAALGSAGRLVGHIY
VHNGDDSGFVCERD
>SMa1937 putative transmembrane transport protein
MIVSQFTSGGSAWPRISLVVGAGVVSAFQVGKAPAALAAVQGDLALSLAA
ASWLISAFAILGALAGAPIGLAVDRIGARTMASLGLLLQAAGSALGALSP
GFTALLATRVIEGLGFLCLVVAAPALIAGLAPIQIRDRAMALWASFMPVG
LTVIMLAAPLLSIVTWRGFWFLNASILVSYAMLLRWGLHPPPNHPRPYRK
IHQDIGEALVSPGPWVLGGLFTAFSAIFFAVFGLLPPLLSQRLGISNETA
SMLSALAIAASGVGNLVCGQLLARGFQPARLLNFSFGIMALCGIGIFSHT
LWAIASYALCVVFSFAGGLIPVVIFDSAPRQAPGAELVGVTIGFAMQGNN
LGLIIGPAAASGLAGAFGWPMVSVAVVGIAFVAALLVLPFNRRQLTEAAS
GQSPLTRAGGRF
>SMa0191 hypothetical protein
MTADKISPDMLRFERILAAPAATVWQYLVDPELRARWFMSGPTDLQVGGA
FGLTMDHDRLSDEVVPTPDRYKPYVGHRWHERITRYEPPHLLAFTWEDGK
AGEVTFALTEIDSGTTRLVLTHTGLRGSEDALNFGGGWHAHLAVLEKRIA
GISIPDFWALHAVAEREMEIALVSEA
>SMa0337 hypothetical protein
MSDHHEASYEALKAVVRRNTLEVQSGGNFELFDELFADDFLDHTPQPGGT
PDKEGARRLYHALREAFPDFHAEIHWQAVDGDIVTTFKTYHGTHQGVVLG
SRQRAARSSSKPSTPCAFATARSSNTGGSPTSTIFCSNSTRCPPRPPKCR
NIKDHHDEHQ
>SMa0299 putative ABC transporter permease
MSVITTIPARRFSRFNAERWRATPGPFKVGAVILVAHALFAILGVFWTPY
GFAEMGAGLPLSGASWQHPMGLDQIGRDVFSRFMHGAHIVLMLSFAGTML
GMVAGTTLGLLSGYIGGWFDEVTQRIVEAMISIPFLALALVMIIAAGPAL
AGKPVLIVFVVGLVYAPRIVRIARAAAMDIATREYVAVAELRGEGTWSIV
FREILPNAANVLLVEFSLRLSYAPILIGALGFLGFGIRPPTPEWGLMISE
NRNLLIASPVTVFGPGLGLASLVIGFNLFTDGLSRMLGQRPVAGA
>SMa0797 hypothetical protein
MFYEIRTYRLRNGAIPTYLKVVEEEGIAIQRKHLGELVGYFFSEIGPINE
IVHIWAYPSLDERERRRAALMNDAAWRDFLPKVRDLIEVAENKIMKAARF
SPTGAVAS
>SMa0189 hypothetical protein
MRTDLVNRPQDGSAGRSALLVASAVNAVAAIYHIIGGTPEVMYPVYSANL
PPSSAGVLDILWYQMAALIVGSAVATLVAAFRSDWRWPVAWIIGGHFLVV
SGICLFFTFVWFGNPWGLIQWAIFGPVGLIIFWAAARPAERAGAPTL
>SMa1151 Conserved hypothetical protein
MFVKEMSRHECNSVIQAGHVARLACCKEGMPYIVPINYAFTGQCLYGFSM
PGQKTDWMRENPHVCLEIEEISGERQWKSVLVFGRYQELPPEGQWHNECM
HAWSLLQSRPNWWEPGGLKPGKPEIAAASPHVFFCVDIDEITGRAAFEGD
E
>SMa1323 Hypothetical protein
MTSALNVHWQSQLFAAYALQEVLDKPIDGETPQSRLKQLGMMTVLYMMHQ
SHEKLTLSNIVKITGLTRNAVAESVDPLVERGILTETIVKNSMGRGTARQ
FEFCPEIFDRLRSGSEERKRGA
>SMa1192 hypothetical protein
MVERGEGEDAGTCRLRVKIGNRVWTAILNSEDRVITEEMTIYRASSRSEI
TLASDAS
>SMa1091 hypothetical protein
MTYAHTAQAIADVATSADVLFDYLDDQASLGSHMQKPSMMMLGGRMSYEF
DEARGRTVGSVIRMRGNILGLVLSVEEVITERQPPRRKVWETRGRSNLLA
IGAYRMGFEIIALGRAASRLRVFIDYDYPAAIAAKFLGPMFGPIYARWCV
NRMANDATNAFEGSARS
>SMa1589 hypothetical protein
MNESCPVTKARHDSLASVGTNVAFPVRVPPIQFYELAFQRKETVLIRQLA
DIYRAYLDCLNRQAWDELGHFVDNEIQHNGRLLRISGYREMLVKDFEDIP
NLQFNIQLLVCEPPRLAARLSFNCSPKGEFLGLSVNGQQVSFTENVFYEF
VGSKIVSVWSVIDKSAIEAQLS
>SMa0914 conserved hypothetical protein
MSLTSSLDDAFEPHHASSPADRFIYEMQIYGHRPFQDEPDPRPLPEEPVV
QSALTAMFDAVFEMLGDTRLEPDLEDLLWSTVNLFHRAGERIQRELQRNE
EAQRAGQSEQDGSEVKSVELERLIAEGITLLERRNAFEFMRDYAADLFEA
QTGSAWRPRTGSKVSHANMTAAMIDSRDFLSARRRAETEVLIPAGTKIAF
GGGIDYNDHERIWAKLDQALAKHPDMVLLHGGSPKGAERIAACWAEARKV
TQIAFKPNWTKHAKAAPFRRNDDMLSVMPSGVVIFPGSGITENLADKARR
LGIPVWQASGSGA
>SMa1969 Conserved hypothetical protein
MKPIRTFALALVLGTSAFTAQAANVLVVLSDSDHLDLKDGKVFETGFYLN
ELMQPVKALTEAGHDITFATPKGTAPTLDKSSVDNMYFGGDEAAMQESIA
MLDKLKLTSDSSSPVLSLARVEQIGYDHFDAVYVPGGHAPMQDLLVSPEL
GKLLADFHAKGKTTALACHGPIALLSTLPDASAFTTKLETSGSAKAEGWI
YAGYKMTVISNQEEEIAKGLLNGGKMKFYPQTALEAAGGDFVSNEAPWAS
NVVTDRELITGQNPASAPAVATELLKRLK
>SMa1702 HYPOTHETICAL 21.7 KDA PROTEIN IN SYRB 5'REGION (ORF4)
MKTPWKFLARLASRQPSGKTQESSAGNDTGSKTLEHTSALPPSPTVAASP
LARNEDVSVDQGPIASDKPAGHNGVAQALEPPIHADEAQTTARDEADQSG
AEANSLAPKSTASTKSQRKPRIKRRERGKRANARVDAQSAVVQKHYQNLQ
QSSSRDLFFHELATLDEEIKMLRTQLAQKLHLQNVQLKKMLERFELS
>SMa0601 conserved hypothetical protein
MRPPAGGKAGVFSFTYGGAVIHAAASEAEKPQSTDSRCLAQLARIRQSAE
FDATGREHRFLQYVVEETLAGRGSRIKAYTVAVEVFGRDSTFDPQNDPIV
RIAASHLRRSLERYYLTAGKSDPIVIGIPKGGYLPTFSERGSPEDANAAE
SSMPTMQAQAAGPSDASPVAARPPPGPGPGPGPDDVRAQFERIVSSKEFH
GGGRGDALLRYIIEETLAGRAERIKGYSIAIEVFKRDKSFTQDDPVVRIE
AARLRRALERYYLVAGQNDPLRIEVPKGGYGPTFSWKEAVRAESDRTAVP
DASGPIVSARRRGRVLLTVGVVAVAAAAILGYWTIDRPGSVSSLRAGSVS
VPDGPTLVIAPFANLGEGPNAELYTDGVTEELLTALPRFKEIKVFGRETS
KSLPPDVDVSQVRDELGARYLLAGGVRVSGSRIRVTARLVDASDGAILWS
EDYDNDLQSRDLFAIQSDVASKVATAVAQPYGIIAQTDAANPPPDDLGAY
SCTLSFYDYRAELSAERHAKVSACLESAVARYPGYATAWAMLSIAHLDEE
RFKFNPKSGAPMAMERALQAARRAVQLDPGNTRGLQALMTALFFNGQYAE
AMRTGEQALAMNSNDTELMGELGTRVAMGGQWQRGAALLDRAIALNPGGA
GYYHGTRALAAHMLGDHPAAVAGIRQADLQKFPLFHAVASVIYAEAGMLH
EARRAGETFMRRRPDFVPNLQAEFMMRNLQPKDQLRLVSGLRKAGFSIPD
GVEASIAAAEAADAKSR
>SMa0911 hypothetical protein
MIAMLSASQLADRLARDAEAVCRHYLSAGHRAGNYWIVGDVANSKGKSLY
VHLSGPRAGRWTDAATSQHGDLLDLVRETCGLVDFRDVADEVRRFLSLPH
PAPSHRGDAHTDPPVDRPAGERARRLFRMTKPLAGTLADSYLRQRGILRG
DLHQALRFHPSCFYRDLVTGRTFSYPALIAAVTDPSGAITGVQRTWLDPG
GDGKAKVDDPRRALGGLLGNAVRFWFPAHGPVPVTAAGEGLESILSVAHV
MPGMPMAAALTANHLAAFRLPDGCRRLYIAADADAAGRHGAERLSRRAQA
LGILPLVLAPELGDFNEDLLRLGSDRLTVHLRAQLAPEDAEAFLPT
>SMa2067 probable sulfate/thiosulfate binding protein
MEVRVESLRKEFGRFPALVDVTLDILSGELIALLGPSGSGKTTLLRLIAG
LESPTEGMIFFGDEDASKRTVQERNIGFVFQHYALFRHMTVLDNVTFGLK
VRPANRRPPAAEIRRRALDLIDLVQLSGLEKRYPAQLSGGQRQRVALARA
MAVEPSVLLLDEPFGALDAQVRKELRRWLREIHDRTGYTTLFVTHDQEEA
LELADRVVVMSKGAIEQVGTPDEIYDHPVSPFVYGFIGQSNCLNVTLANG
EIWFEGRPIGLRAANEPDGQATLFFRPHDVELIEGGSGCLAGRVTASRRV
AGTRHLELDLGKTQSSIEVELPPELASSADRTRIALRPTKWKLFCGE
>SMa1978 Probable haloacid dehalogenase-like hydrolase
MSILLPVKQESDMIRSGQRPEWLTFDCYGTLIQWDEGLLAAMERILAGKN
RSIERDAFISVYDRYEHRLERERPHRSFKNVSATALALAMGEFSLDVSPD
DADILTSSISRMPPFPEVVATLTRLKAAGFKLAIISNTDDAIIAGNVAQL
GGSVDRVITAEQAGAYKPARQIFQHAWRELGIEKEQLVHICASPHLDLAA
ARELGFRTIWVDRGTGRKPLADYVPNETVARLDEVCGLLSAAGWME
>SMa1891 Putative thioredoxin reductase
MLAAGDVIDSVFRQAITAAGMGSMAALEAEKFLAEHSPDPAVRPIVPHET
EMVGAQVWGTRPRAGPLRKRGPAHPQFICEGPRFPTPSGCPCIQTAPE
>SMa0392 ABC transporter, periplasmic solute-binding protein
MSHEKFLSAQIGRRTLLASMAAAGASAGLSTLGVSRAFAQEPEKPAEIIV
RAWGGSWVDALKAGVSDSFTKMTGIAVRHDLTEDNEIQPKVWAAVAQKRV
PPIHINWDTTTNATKSALRGVTEDLSDLPNLKNATDLAKPVGLDGYPIVN
TYGYVYVLAYRPSAFPNGAPKSWKDLLDPKLKGRIALYNDGIGFHFPAQV
AGGGKLEDIPANMQPAWDFISKIKEQQPLLGEDPDFTTWFQKGEIDAACT
ISTNAREAKKNGIEIAWVVPEEGAKFDTDGLWIPKGLPENELYWAKQYIN
HALTKEAQQIWLDGLGLPGVIPGITPPADLVGDPSYPTTEEDFKQLIRIS
AKVQVENESQWFAKFKEIMQG
>SMa0581 Nitrate transport ATP binding protein, probable
MTKSYLSLELLDKSFERGGTRTEVLKQVSLTVDKGEFISIIGHSGCGKST
LLNIVGGLTQATTGVVLLDGKVVDEPGPDRAVVFQNHSLLPWLTVYENVR
LAVDKVFSRTRNKQERHEWTMRNLELVQMAHAAEKHPSEVSGGMKQRVGI
ARALAMEPKVLLLDEPFGALDALTRAHLQDQVMQIHATLGNTVLMITHDV
DEAVLLSDRIVMMTNGPSARVGEILDVPLARPRRRIELASDRTYLTCRES
VLKFLYERHRFVEAAE
>SMa2027 Putative LysR-family transcriptional regulator
MDIEDLQTFVAVADAGGVSAAARRLGISKSIVSRRLLRVEAELGVQLLAR
TTRGAALTEAGITFRDHAARASAEIDTAKETILPTGELRGRLRVAMPLTF
GPTHFAPVLADMARQHPQLHIHTSYSDRFVDLIAEGFDCAIRGAYLQDSN
LIAKRVGPIHGKLVASPDYIKAHGSPGTLDELVTHQALMQGTEAWQFMDG
DKIVTVQPQGRFKADSATALAAAAAAGLGIAWLPDCITYGYVASGALVPI
MTRYPVPPGAAYVVRPPGQHPTRKVRALTEMLTEYFKRNPDVWGLDR
>SMa1245 putative crp/fnr-like transcriptional regulator
MSDADLDRMLAHATARRVPQGDAVFEQGQRATSFFLLLHGRLKVTQVTED
GQQIIVRVVHPGDLFGFAKALQRSDYPGTATAATESLALSWPTDLWPQFV
EQNPHLAVSTMQTIGQRLEEAHTRIREMSTQEVERRVAHAVLRLSRQAGK
QEKGGVRIDFPISRQDIAEMTGTTLHTVSRILSAWEQKGLVEGGRQKLII
CDLSGLAALADGGRD
>SMa1329 Putative proline dipeptidase
MEEAGIDVLLATSKHNTQYLLGGYKFIFFAAMDAIGHSRYLPIVVYEKGS
PDHSAYIGNRMEGGEHQNNPFWTPAVHTATWGTLDAAALAVEHLRKIGKA
GARVGIEPSFLPADARDLFASRLDGARFVDATHTLERLRSIKTPQELEKL
KVASELITDSMLATIAAAREGSTKGEIIERLRREETNRGLHFEYCLLTLG
ASHNRAASPQAWAKGEVLSIDSGGNYQGYIGDLCRMGVLGEPDAELEDLL
AEVDSIQKAAFARIKAGATGSEMIASAEEILKSSPSAAFTDFFCHGMGLI
SHEAPFLMTNHPVAYEGVDADRPLEVGMIISVETTMLHPRRGFIKLEDTV
AVTTDGYEMFGNRGRGWNPGGVRSGEILRKTEPGARWHAGLVVCHVW
>SMa2389 putative stress-induced protein Ohr
MCLHPGTISGVSQTFPIRSPGKTHVVKCRDRATPRTECLMTTQEKVLYTG
KTHTTGGREGFARSDDAQLDVRLSPPGGGKAGTNPEQLFAAGWSACFIGA
LGLAASKHKLTLPAETAVDAEVDLAKSDGGFFLQARLAVSLPGIDADLAR
SLIAEAHQTCPYSKATRGNIEVDLTLA
>SMa0988 hypothetical protein
MVSFVLRAAPSTPSVAFVRRSADFSNQLTIQPKPSHSRPSSGSAKALVLF
LRCWVTSSLYTIVHCSSISFLRHAVWPECGSWRLEDVDAQAKGTFQR
>SMa0275 conserved hypothetical protein
MSGKKILMLTGEFTEEYEIYVFQKGMEAVGHTVHVVCPDKKAGDRIKTSL
HDFEGDQTYTEKLGHYADINKTFSEVRPEEYDAVYAAGGRGPEYIRTDKR
VQDMVRHFHDTGKPIFTICHGVQILMAVPGVLKGRKVAGLGACEPEVTAV
GGTYIDVEPTGAYVDGNMVSAKGWTGLAAFMRECLNVLGTKITHT
>SMa1572 hypothetical protein
MRAPPAILRRLFRSQSGATAVEFALVCLPLLLLVFGIIEFGRAFYVRNEL
SHAVDVAARRVLIGQIARDATDSEALTKLAGAVRESFHSGDPTLLTIAVT
KETVDGIAFRVLSIRYPFTFVVPGLAQSPVSLNLSRRIPIG
>SMa2337 putative transmembrane transport protein
MLAAVVQGSDPIMTIAQTSPAVREGSTAAGAGRLYAVLGGLYLAQGIPTY
LLLVALPPLMRESGASRTAIGLFSLLMLPLVLKFAVAPLVDRWAPWPGLG
HRRGWVVPTQLLVSAGIASMALVEPDRAGTLFAIGICITLLSSVQDIATD
GYAVRHLNGRTLAIGNAVQAGSIALGVIVGGTLTLVLFHKIGWRPTILLV
ACLSLLPLVAAIWMKDRAVASPEAPLRRRASLFGFFRRPNAWMILAFALT
YRASEGLVRGMEGSYLVDSKVPTEWIGYMSGAAAATAGLLGALIAALIIR
KAGLTATLILLGGLRSLCFLAFALNAFGIWPGIAVAMSASAFQTLIRYME
LVAIYSFFMASSSDDQPGTDFTILSCAELVVYLIGTSIAGYVADRFGYAT
LFSSATVISVLGIGLSVWMLERLKARPSRSR
>SMa1620 putative integrase/recombinase
MTQLAQHLTAFLREHLPRERRASVHTCDAYAYSFQLLVTFAARRLSKRPC
LLQIEDIDVPMILAFLEHIEETRGNKARSRNARLAAVKSFFRYLEHRVPA
VLDQALRVHAMPMKKIDEALVASLSRTEVQALLNAPDRRSLSGIRDRAML
HLAFAGGLRVSELVGLTLDQFDGRSPASIHIIGKGRRERVLPLWQETAAA
IRAWIAVRPKNGDTALFLNNAGRMMTRSGFEYILEKHAAAAVSVAPTLAT
KSISPHVLRHSCAMHMLQATRDIRKVALWLGHASLQSTEIYLRADPTEKL
EMLDALAPLGIKPGKFRPPDKLIAMLATR
>SMa0967 hypothetical protein
MLHEEFADALKEGTADFKKLGRILKIILFGSYARGTWVDEPHTKKGYKSD
YDLLIVVNNRKLTDFSSYWQKAQDRLMHLPEIRTPVSLIVHSRREVNTAL
YDGEYFFVEIRRDGILLYELDDEPLAEPRPRGPADALRIAKDYFEDRLPH
AKTFVEGTQFFVSRGRRKEAAFLLHQSIEQTYAALLLVLTSYSPASHNLR
HLRSLAEERDQRLAEVWPRDQHQYVAWFNILNEAYVKSRYSKHYEISEDA
LAWLLERAHQLIADVEAICVEHLDRLRKQAEDDVD
>SMa2267 conserved hypothetical protein
MRKTPIKCQTTPGCPVSGHLKRPLLDQTIPCRLERHPGDWNVALKRMDNV
GIVVEDLEGTIEFFRELGLELEGKATIGGEWAGRVTGLGDQHVEIAMMRT
PDGHSRLELSRFLTPPIVADHRNAPVNSLGYLRVMFTVDDISETLERLRT
RGAQLVGEVVDYEDVYRLCYIRGPGGLLIGLAQELG
>SMa1294 Hypothetical Protein
MRGAVLSSAVDCECRDRVDGALRDLERLERDRIVQRLLAAADEQRRRIEA
LLVLLADFDPKESAVLDDGMIVEAGLLFGDIAAAAELGSSLLRQSRQLRF
ANDMVQEVAESASCEFPDIDK
>SMa1322 Hypothetical protein
MNTSLVRAFVAAASLASFYLVFDGKARADDWGCQVILCLSNPGGPTQYAE
CRPPIEKLWRVLAKGHSFPACSGVGFQASRPGYEPYYCNDGYRLTTRYGD
RGREASCISTTPQIVSSHECYFDNDRSTSPRWQRSEGRIKCQRYVTTRPN
IRPQPHYVDVTIDGGGKQRVWY
>SMa0137 putative kinase/esterase
MSSLSGGVRMRFVAGKVAVFRWMLPLAIAMALMGGIYLAVRWSAQQSDAV
AVKRQQHLVELVISKMQGSIAHDQESVTVWDDAVRKVSQEWDPRWVDSNL
GSWMNSYFRHDGAFVISPDRKPLYAFLAGQTNEQEAFSEIGPEAMPLIAK
LQERLAAGDEGGTSEQVLSIGESDLVRIGGRPAIISVKPIVSDTGDIVQE
PGREYLHLAARFLDGDFLTHLGDDYGFEDLRFSVLPELGRQRSYAPIVSS
SRETIGYFSWLPFRPGADVMKATAPVLLVAGALLFALTSALSVVLRRRSR
RLQESQAELDHLARHDPLTGLANRASFNRLLARVVATSTIDQANALLYLD
LDRFKQVNDTLGHPVGDRLMVEVAKRLKETAAGAAISRIGGDEFTIIVAR
TRQDEVEKLCDALIAAIRRPFEIDGQPILIGLSIGVAVATGNDADPIEIT
RKADIALYHAKSAGRNRYAVFGPHMDELIRTKRDLEQDLRAALDAGNQLE
VFYQPVYSAQSHEISSLEALIRWRHPSKGPIAPDAFVPLAETAGLIDRLG
AFVLKEACSTAREFPGLDIAVNVSAVELAQRHYAAQVLSLLRQFGIEPSR
LELEITETALLDEAGVCEKNITALREFGVKFALDDFGTGFSSFGRLQRLE
VDRIKIDRSFVHAFDRPGGGVAVVQSIVGIAHAKGLRITAEGVETEEQSE
ILRKLGCDELQGFHLSRPMSKNSLRQLLGADKAQGTSSSSRS
>SMa2335 hypothetical protein
MITLGPHSLSFLYVAAKPSQSIIDAARTHHPDLGSIWRMPMSDFLFLAAG
IGGLAALALYARALSRL
>SMa0383 conserved hypothetical protein
MSAAKSSGGTFAPLAQPVFAVLWTATVLGNTGSFMRDVASAWLMTDLSAS
PAAVAMVQAAGTLPIFLLAIPAGVLTDILDRRKFLIAVQLLLASVSISLM
VLSQTGMLSVSSLIGLTFLGGIGAALMGPTWQAIVPELVKREDVKSAVAL
NSLGINIARSIGPAAGGLLLAAFGAGITYGADVASYIVVIAALVWWPRAK
NANDALQENFFGAFRAGLRYTRSSTPLHVVLLRAAIFFAFASAVWALLPL
VARQLLGGDAGFYGILLGAVGAGAIGGALVMPKLRERLSSDGLLLGAALI
TAAVMGVLALAPPKVVAIIVLLFLGGAWITALTTLNGTAQSVLPNWVRGR
GLAVYLTVFNGAMTAGSLGWGAVGEAVGIQSTLIIGAIGLLIAGLIMHRV
KLPAGDADLVPSNHWPEPLVAEPIAHDRGPVLILIEYKVEKQHRTAFLHA
IDHLSRERRRDGAYGWGVTEDSADPEKIVEWFMVESWAEHLRQHKRVSNA
DADLQGKVLAYHVGPDKPVVRHFMTINRPGAA
>SMa0394 putative ABC transporter, permease
MRAPRAAYPLTWRVMDVLERLAAIVWPSSFQRGLPYLMLMPALVLVGLLV
LGLVQIGDTSLRTLDTNTFLMSESYTLANYQRVLTESFFATVAGRSLVGS
VIVTVITLLLAFPYAYLMVRTPSSALRKFLLVALFLPFFIGQVVRAYGWL
IILGNQGMVNEALGLVGVPPIRLLYNYPAVLFGLVQYMLPFAVLMLAPAL
TAIPSELEAAAASLGAGWTRTFRHIVLPLSRPGLVGAGLVVLTLSLTDFA
IPAILGGGTQDFIANAIYDQFFRTSDQGMGATLSLMLVAVGSMLVGVVFM
LFGAGTLAMTGDRK
>SMa1727 putative hydrolase
MTTTILKLALAVGLLFAAGPGFATEIPVPTQTEWAAAKKTVDLPNGIKLA
YSEMGNVEGKPLLLIHGYTDNSRSWSLVAPYLKNHHIYAIDLRGHGKSSA
PECCYTYLDFANDAFLFLEAMKIEQADVVGHSLGSLAVQMLAAQHPEKVR
KVVLISSTLNTGGGPGTWLWDNIKPLQPPIDPNGKFMTDWYWNPNPVDER
FIKPEREESAAVPIHVWKGVLWGTTTGDLGKISSLIKAPVMIFWGDQDQL
MNAPQQAKLKAAFPKARFETFPGAGHNMFWERPEKAAELINSFLSE
>SMa1717 putative integral membrane transporter
MMFKMGRNTMNYQSKVAGDPASPEKPGPGGAIDRLFEVTRSGSTIRTEII
AALTTFLAASYVIVVNPAILQNAGIPFSGGVTATVLVSFIGSCAMGLYAR
SPILVAPGMGINALFAYTMVMGAKVPLEIALGCVFWAGVLFTILAILNLR
TAVIEAVPKDLRYGIACGIGLFIALIGLENAKFIVASPDTIVALTQFTPV
TLTFIAGFIITAALVVRRIPGAMMTGMIITTVLAIPIGRLWGDGSAFAGG
TPDVQTLVNWSGLFAAPDFSFVGRIDLLGALQVAYAPFIFVFLFTNFVEA
LSTFLGLAEAANLKDESGMPRNIKESMHVDAVAALISAPLGTSPATVYLE
SGAGIAQGGRTGLVAFIAGLLFLPFLFLSPLLSLVPTIATAPVLILTGLF
MSAPMGQINWADMEDAIPAFLAIVLIPLTFSITLGLSLAIIAFVMMKLAL
GKVSEVKPVMWFVAVLAAMLVMQVQ
>SMa2095 conserved hypothetical protein
MSERPILVTGATGKLGRLVVERLVAFGQPVRVFTRRPEAAGALFGKTVQI
AAGDFGDRTSLETAVRGVARLLLLSPISARLVTDQVAAADAAVAANVARI
VKISGSDWTIEPAGNSISGDAHAAVERHLGSLPIEAVSLRPNAWMQVSLA
NIIRQVVTSDQVVAANLDAGIGYIGARDIADVAVQQLLADRLDGRTLNLT
GPDIVSFRQIASLMAAALKRSVAALEEPPQIVPQDGDFEHRAVAQFVRLI
AAGRAATTTDVVRTLLARSPRTVAAFILEQVAPAAAVPEYQP
>SMa0021 hypothetical protein
MYRLYNHAIARFSLDCFEGLTGDKTCRLKAQAWLIAELHQLGYRDMPQKR
CCAYRGLRHSYRKIAKEPRSEQTSSCEEADMKPATILLAEDEALLLLDYE
AALADAGFVVVAVARGGKAIEVWRSADSEVAGVATDIRFYELPNGWSVAR
VAREIYPGIPIVYGTGHGALEWRSRGVANSILLEKPFALAQLVTAVSELL
NEPVLLSVAPDPNP
>SMa2061 conserved hypothetical protein
MNVAERKKQTVVSSGISGLDEILRGGLPASNLYIVQGAPGSGKTTAALQF
LRAGVALGEPCIYVSLSQTRAELEAIALSHGWTLDGIRVEELSASDSINR
AADQTIFQTTELRLDETRKAIESAIEEHKPSRLVYDSLLEIRLITADSPR
FRRELIAFKAFLAQRNIVALLLDTQTSETDRTGEEVDGIAHGVIRLDKSL
EEYGGVRRRIEVSKMRGVPVADGYHDMAIREGEGVVVFPRIVPGAAARTV
KPQLIKSGVDTLDEMFGGGQESGTTTLVIGQAGTGKSTMASLYATAALKR
GESVGLFLFEERIETFFRRSEGLGMALRQFHEDGQLILRDFNPNEISPGE
FGQIVQQSVNSEKVKVVVIDSFTGYLNSLPHREKAVRDIQSLLKYLARAG
VLTILIVAQHGLLGQGVGIDVDVSFLGDTVLLLRIAEHEGRLRRSITVVK
KRHGPHDLDVRELFIQSSGVSVIAYNPLPEV
>SMa2361 conserved hypothetical protein
MTRDLMMSRRNVLASGLVLGVSALAPAVRASAPIKVAGVHASPVENAWNS
VLHKALQDAAAEGVIEYVFSEGISGTDYPRAMREYAEQGAKLIIGEAYAV
EKQAREVAADYPETAFVLGSSGKESGDNFGVFGTWNHDGAYLAGMLAGKM
TKSNVVGSVGAMPIPEVNMLINAFAAGVKAVNPDAKHLVTFIGTFFDPPK
AREAGLAQIDAGADILFGERIGTADAAKERGLKSVGSLIDYTPRYPDTVF
ANALWGFRPILNAAIADVSAGKPVGKDYTAFGLLKEGGSDVAYVKGVAPA
DAEAAMEAKRAEIKSGAFEVPRITDEPK
>SMa0665 conserved hypothetical protein
MALCDRIERTSLKNQADELLFPALQARCAMDDNPSSQRNPDPVNPAARPS
IEPRITEIARIGLVGLFAYWSFTLIAPFAIILIWAAILAVALYPAYAALS
AILGQRPRVAALVITMLGLLVIVAPLAAIAFSFAEGLQVVLARLNDRSLL
ISAPPDSIRSFPLIGERIYSVWSMASDNLEAVLQQIKPALLQAGSKALGK
IASIGADLLSFVVSVLVAGFLFGSGARLANSAQGFASRMGGDRGVGFLQL
AAATIRNVARGVIGVALLQAFLCILILSLFKVPAPGAIAFVVLILCIIQI
GPALALLPVIVWAWTSMEFGMAALFTILLIPLLIIDNVMKPILVARGLST
PTLVILLGVLGGTLSYGLIGLFLGPIVLSVFHSLLLIWMNTDTVGSEGLR
LGNPDYRIRREKT
>SMa0656 TRm24 transposase
MSQCYLHLTLPDRRRVHQLLERKAPTAEIARQLGRHRSTIYRELKRNTFH
DAEFPEYSGYYSGIANDISKERRRRLRKLSRHPQLRELVIEQLKALWSPE
QIAGRLLADGVSAVRVCTETIYRFIYSKEDYALELYQHLPEGRRKRRPRR
SRKPRDGSIPLDCRISQRPDFIADRSQFGHWEGDLLIFRRDLGEANVTSL
VERKSRYTVMIKNGSRHSRPLIDKIIDAFSPLPAFARQSFTFDRGTEFRG
FKALEDGLGARSWFCDPNSPWQKGAVENTNKRIRRFVPSDTDLSAVSQPQ
LVALAHHLNSLPRKCLGYRTPAEVFMAHLRDCG
>SMa0286 conserved hypothetical protein
MSLFCGSGGRRRAVIHLFSRDLAQGKVPAGHTSCQTPSFRSTMAATSWRG
PVHSSPSILRARAASRPRSDCLPSAPQGPTGNSAHTPLGGPVARGLDGSH
EFLAIDLVPVMRDASERLDLAAAREIVTASPDVLGGTPVIRGTPVPVYDV
AAS
>SMa0480 hypothetical protein
MGLQVESVIRRFRLLEGISPEVAEMLKNTPCSMKVFDVLRQMNAVRQIEA
ADLMIGQHNFTLMFARAIRAATPDSQLVAAKKMTGAAASTPTGQQIARME
RELHHFRLK
>SMa1789 putative adenylate cyclase
MDIAAWLRSLGLEEYASAFRDNDIDAQLLLHLKAEDLKELGVASIGHRRK
LIDAIADLRDEDARSSNVSVDRPLPAPTAASPEAGAERRQLTVMFCDLVG
STALASRLDVEDLREIIGAYQQCVSDTIKRFGGFVAKYMGDGVLVYFGYP
QAHEDDAERAVRAGLALIDIVNELELSDPLQVRIGIATGIVVVGDIVGFG
EARERGVIGETPNIAARLQGLAEPDTVVIGERTHHLLGHLFDFRDLGTLE
VKGYSEPIRAYQVLRPSILDSRFEALHGERLTPLVGRENEIEALRHCWQR
AKGIEGQVILLVGEPGIGKSRITVAVLEEIANEQRTHLCYFCSPHHSGSA
LYPIIRQLERAAEMSPDDDTSTKLDKLETLLAATSTAAEDCSLVSDLLSL
PSSGRFPTFDLSPQQRKSRTLQALVRQLEALAGREPVVMIFEDVHWIDPT
SLELLDRTVERIRMLPVLLVVTFRPEFTPPWSGQPHVSMMTLGRLGQRDG
VALVEHVLGQQDLPAEAIQGIVERTDGVPLYLEEFTKAVAEMCAEGDHAQ
STASSASLGIPATLHASLMARLDRLGAAKNVAQIAAVIGREFSHDLLAAV
APYAATELRAFIDQLTSSGMVFRRGTAADSLYLFKHALVQDAAYNTLLRR
PRQQLHGKIARTLEELFPGRAAREPEVLAHHFAKAGQAARAIDYWLIAGK
QAAQRSANLEAIDHFSRGLKALEALPLGSEKDWKELALQTALGTALISVH
GYAASQTGAAYARARILCQKFGDAAALHATLSGEFVYHFVRGDRAMMRRL
TKEARRTAESTGDDAFQLAGHRMDGITAMYDGSFIEASHEFETILSLYDP
DRHRPPPVLYIHDPKISALAYLAILKWILGQSDTARGFAIEALRYAEELK
QANLTAHVRTYAGAGLHELLGETSAVRRHADAIVALADQHSLHYWRLNGL
FLRGWAIAQGASVEEGLAVMRQNLRARSALGVSWYHVRYLCMLATTLQKS
GAAESALAVVAEAKDQAACHCEHMWDAEVERIEAEMLEVCARSTVECEAR
FQSALVTARRQSAKSFELRAALGLAKLWAEHGRGDEARDLLWPLYESFTE
GFNTRDLTQARQLLDKLH
>SMa1115 Putative manganese transport protein
MSCACTAGTDEIAPTITKKIIARTIKPNLGPFVSHGSERYDTAICQPAPL
NGSRSPSFRPPDDGGGSSNLRCCLRPTWTARRDYIALTVWTADNLYVEIG
PEAPRMRRKSWHVDNDDRVQSGLLAKPVVYRLRRAALLDKLGPGLITGAA
DDDPSGIATYSQAGAQFGANMLWMMFFLYPLMCTMQMISARIGRVSGHGL
AANMRRIFPSWVVTSLVALLFIANTINIGADLAAMGAAAELVLGWGRHLF
TLVFAVASLTVQVLVPYHRYVLYLKWLTLVLFAYVGVVFTIEIDWSETAL
RMVTPQLALTRETAIMVVAVFGTTISPYLLFWQASEEVEDDEADPTTDPL
IDHPEQALVQLSRIRWDTYIGMAFANLVAFFIILTTALTLHAAGVTEIET
SADAAEALRPIAGDLAFALFSLGIIGTGLLAVPILAGSAAYAVCESRGWP
IGLEHKPREAVGFYTVIGLATLIGLAVDYSDLDPIRALFWSAVLNGVVSV
PLMAAMMIVVSRKDEMGQFVASFRLRVMGWFATACMAAAAITMFILS
>SMa0998 probable ISRm25b transposase
MEGISTRSVDDLVKAMGMSGISKSQVSRLCEEIDGKVKAFLDRPIEGDWP
YLWIDATYLKVRRGGRIVSVAVIIAVGVNSDGRREVLGMEVGTSEAEPIW
TEFLRKLTRRGLRGVKLVVSDAHEGLKAAVTKVLSATWQRCRVHFMRNVL
AHAGKSGRREIGAGTCAAPNRRSECRKRSSLDPRSFAPL
>SMa1637 TRm17b putative transposase
MSACSNRHADIIVKGSRDVDYGHKLNLTTGRSGLILDLVIEAGNPADSER
LLPLLERHIAFYGEAPRQAAADGGYASRENLRQAKAWGVRDMAFHKKSGL
RIEDMVRSRWVYRKLRNFRAGIEAGISCLKRTYGLARCTWRGLDHFKTYV
WSSVVAYNLALFARLRPT
>SMa1198 possible Copper export protein
MNKPRLDGRRWDAPARMLGISVLACSAWLWLAATAFAHASLVETIPADNA
VLAESPATFSMTFSEAVSPLSLKLVGPDGSSVSLERYEPRDRTLEVEPPS
SLVRGTHVLVWRVISEDGHPIGGSVIFSIGPPGATPRAAAVKIDGEVGTA
IWLAKVALYLGLFLGIGGSFALSWLGRVERSGTVTVHIILGIGLFGALLS
VGFQGLDALAAPLRRLADSATWQAGMSTSFGRTSVVAVLASAMAIFALVA
KGGWGRLLSLAALIGTGLALALSGHASAAEPQWVTRPMVFLHGVGIAFWT
GALIPLGLALARRTPESGYMLRRFSNTIPLVLALLIIAGMVLAVVQVRNL
SALVETAYGAVLLAKLALFVLLFALAVFNRLRLTEPAERRDAPAARRLAR
SIAIETVVAVLIFGVAAVWRFTPPPRALEIAAAQPATVHLHAPRAMANVR
LSPGRAGQVAASIEVFSKDAKVLTPKEVTLVLSNPASGIEAIRRPAQRAG
EANWRVDGFVVPLPGTWHVRLDLLVSDFELVKLEGEVDIRR
>SMa1699 hypothetical protein
MLAQGRREREGLSQCDALRHGQARSKVVLWFYLGVLYTTGRANGLRLGWK
ANTPAPNAEVHGSPGCACSAIWLGPWGTAVDRYGPSGRKDGSISIDLMRS
CTRSVQAETQRPKGLPSQPAAEGGE
>SMa1332 Conserved hypothetical protein
MPLIPRKTILEKFHGMISAGKPIIGGGAGTGISAKAEEAGGIDLIIIYNS
GRYRMAGRGSAAGLLAYGNANEIVKEMALEVLPVVKATPVLAGVNGTDPF
ILMPQFLAELKAMGFSGVQNFPTIGLFDGRMRRGFEETGMGYGLEVDMVA
EAHRLDLLTTPYVFNEEEAIAMTKAGADIVVAHMGVTTGGAIGATSAISL
DDCVSEIDAIAAAARSVRKDVIVLCHGGPISMPEDARYILDRCPGCNGFY
GASSMERLPAEVAIRRQTEEFKALAISTVV
>SMa1538 putative oxidoreductase
MRKAYLVLIPVLAIVAVTAIEPGSYGETQFLGQDILVAKADKLSIVFATV
FTIMALIGTVYALHLSGAGQHVAAFVYVGSALGVVFAGDYLTLYLFWEGM
AFASAYLVFAQGGRQAIRAAFRYLMVHVTGGVVLLGGILLHGAAAGSLLF
GPVEGPLGAGAYLILAGFLLNAAVPPLNAWLTDAYPEATVTGAVFMSAFT
TKTAVYVLARAFPGIELLVWLGTVMALYGVIYAVLENDCRRLLAYHIVSQ
VGYMVAGIGIGTELAVNGATSHAFAHILYKGLLFMGAGAVIYVTGRRKLT
ELGGLYKAMPLTVALYMIGAFAISAFPFFSGFVSKSMVVAAAGQDHRALV
MLALTMASSGTFLHTGLKLPYYMFFGTDRGLQAREPPGNMLAAMGMAAAL
CIAIGVFPGPLYALLPYPVVFEPYTGVHVTESLGILMFTALGFVIFLRAL
DPENTISLDTDWFYRKGARAFMWFAERPLARYEKAVSDVSETTVLPFLHG
SARTGMRLDLNGVDGVVNGVARSILEGGGVLRRLQTGVITHYVLAMIAGL
IAAILVFAVVWR
>SMa1864 Putative ABC Transporter, ATP-binding protein
MGNLVEIRDLKVEATTDTGRRVEIIKGVSLDVAEGEIVALIGESGSGKTT
IALTLMGYARPGCRISGGSVLVAGNDLVTLTEKQRAKVRGTEVTYVPQSA
AAAFNPAATIMDQVIEVTRIHGLMAAAEARARAVELFRALSLPEPETIGS
RYPHQVSGGQLQRLSAAMALISDPKLVIFDEPTTALDVTTQIEVLRAFKS
VMKKGGIAGVYVSHDLAVVAQIADHIVVLKGGEVQEVGTTEEILSSAKHP
YTRELLSAFEPKPREAADAAERAPAPLLKIENLVAGYGASKTDGLPLVRA
VEDVSLKVEKGRNLGIIGESGCGKSTLARAIAGILPAAVGKIVFDGKELG
RSARERTRDQLREMQIVFQYADTALNPAKSVEDILDRPLVFYHGMNARAR
SLRIDELLDMVRLPRNLRHRRPGELSGGQKQRVNFARALAADPKLILCDE
ITSALDTVVAAAVIELLKELQRELGLSYIFISHDLSLVEAICDEIVVMYG
GKKVEDITPAKINAPHHPYSQLLFSSVPKLDPSWLDGLEQDPELVRAYCR
R
>SMa1632 putative LysR-family transcriptional regulator
MNKWDKLACGIEQIGTMEQANLKELEAVIAIARRGTFRAAAIDLGMSTTA
LSHTISRLEAALGVRLFNRTTRSVSLTDAGRLFVQQVAPSLQDLYAALDS
VRSQRETPSGTIRINAAPFAARAIISPLVLQFLRRYPDMNVDIVTEGKMV
DIVKDGFDLGVRVAGLVPSDMIALSLGRPQRHAVVGSPKYFEQHGKPIVP
PDLLNHRCIRVRLPDGSLFRWRFEKDGETLQIDVRGPITLDEASIVRTAV
LESTGVGYTMEQEVLPDIKAGRLVRVLEDWTPPYPGLCLYYPGRRNLSAG
IRAFLELAREFSRRAAE
>SMa2203 putative ABC transporter, permease
MTRWDISSIALRLVTFSMMAFLIFPIAVTLIVAVNPREFVLPPNGFTLDW
FRAAWSSKTFLRGMGVSLVLGLAAALIANALALPAAIALARTNFPGKGAI
NLILMTPLLIPTTILSLALYIYFVRVGYGSGLVPLLVGHAIHIMPYAVRI
LTASLLNFDPSYEEAARNVGAGRIRTLFSVTLPVIRTGLISSLTLCFILS
WNDFPISVFLAPPSWTPLPVELYSYIKFQYDAVAAALASSLILLSAVAMV
VIDRLAGLRRVLRS
>SMa1985 Hypothetical protein
MVQTVSTHVGNRAANRTALVLGATGGVGGAIAARLMREGWQVRALCRNAD
AARSGWRHDCPAPQFVTGDAMDGASVVRAATLGDGVAAIVHAVNPPGYRN
WSSLVLPMIDNTLAAARAAGGARIVLPGTVYNYDPVQTPVIDENTPQNAR
TTKGRIRVALERKLAEASPEVSSLILRAGDFFGPGTRASWFAQAMVQPGR
PVRKFTSMASGVPHAYAYLPDLAAAFAGLMAIPERLRAYETVQFAGHWDP
TGTQMRDAVRRAVAQDVPERAFPWWMMRLAAPFGGFPKETLEIEPAWKHP
MRLDNQRLVDLLGVEPHTPLDQAIAAALTDMGCLARSQSHARLRHA
>SMa2081 probable ABC transporter, ATP-binding protein
MAPLLQVDNLTIGFPRAEPVRNLSFEVGAGETLAIVGESGSGKSLTALAL
MQLLPRAAQVTSGRIIFDDRDLLDLDAREMRRLRGREIAMIFQEPMTSLN
PVMSIGRQIGEVLKVHEKASARAARERAIELLKLVRIPAAEKRIDDYPHQ
LSGGMRQRVMIAMAVACRPKLLIADEPTTALDVTIQAQVLDLLDTLRREL
QMAVVLITHDLGVVAQWADKVVVMYAGRKVEQALPGDLFNDPLHPYTRGL
LSASPRLKADFHYLRGPLNEIPGSIVSAAGEAGCPFRPRCDQARASCALQ
VPPLIAQTPDRLVACPFTSSLKAVPDAAHLSL
>SMa2063 hypothetical protein
MDATDSPLDWVLTMAPYRRDAEHLEALLAQNGLPVRRAEGVEELTALLEQ
GPGVLVATHEALNPEVLEVIRRYLIDQPEWSEMPLVILLDRAVPQARVRA
ELSAAWPRSRQLYYQRPVTTLELLSGIQSILLARLRQRDIRDRIEREIEL
RRELNHRVKNILASISSIFQMTRRRAVSLDEFAEDFAGRLAALANVHSAV
FQADGEAVEFSSVVELTFSPYRVRGTDRLVFTGPTVLLRPDAATTLALCL
HELATNALKYGALSTPEGRVSLQWSVSEEADLLSLEWVESGGPSVSEPSR
SGYGTRYIRSALSSQFGTPPVILFHPQGLRCIVSGPLSRVSPRQPEV
>SMa0130 putative fatty acid desaturase
MSAHVYPPASLVEDNAGAWLKTLAKYRQPRLGRSAFELFVTLVPFAIFWA
AACFSLANGFWPGLIAILPASAFLLRLFMIQHDCGHGSFFSRRGLDDWTG
RLLGVLTLTPYDYWRRAHAAHHATAGNLDERGVGDITTLTITEYCALSPI
KRLGYRLYRHPLVMFGIGPAWLFLFKQRLPFGMMNSGALPWISTMATNFA
IVTLAALMVWAVGLGTFLLIHLPVVLLAGAAGVWLFYVQHQFEETHWSAG
EDWRFPQAALHGASHYDLPPVLRWLTGNIGIHHVHHLSSRIPYYRLPEVL
RDHPQLAGIGRITLWDSLKCVRLVLWDDRRRRLVSFHDAAGSLRRSLTED
GRKTK
>SMa0448 hypothetical protein
MVGLLTGLELMAILGVGPVVAAGWLAATAAGAVAGAVAGGAAGGLIGALT
DSGVDEEDAHVFAKGVRRGGTFVTARVDDTLAPEAQAILQDLNRVDPAAR
WGVFAQEEWTRFDENADPYTLEAVPLYDAVTASKDW
>SMa0151 hypothetical protein
MFSGIANLAYVSIPMFVLMGAAVASSPAGSDLYTSLDRWLNRIPGGLILS
NIGACAIFSGMTGSSPATCAAIGKMGIPEMMRRGYPASVASGSIAAGGTL
GILIPPSVTLIVYGIATETSIGRLFMAGILPGILLTIMFMTWAVIDCKRK
GYEFEARLVRYSMKERLAVLPRILPFLLIIAGTLYVLYGGIATPSEAAGA
GAFLTLAVVIVAYRLFRFRPVAGIFGSAMKESVMIMMIMAAAELFAFALS
SLFITQTVAAAIADMEVNRWVLMAVINVFLLICGMFLPPVAVIVMTSPML
FPIVTQAGFDPYWFAIVLTINMEVGLITPPIGLNLFVINAIAPQIPTKDI
LWGSLPYVLVMFLAIILLCVFPDVATWLPNQMLGDRPMNTTSFFSDMLQS
IAERGRRFLSLGPARNGDVNPVGTMEALCDTLLSSRGEASGMALAKNILD
RWQGFDQDKRRDFMLALLSRFGPDIERLERAIDAYRADPTPKALLEMSMA
AEPRRQELIRRLNLAPNGIATLVRMRADLLELKAQNPDLEAVDTDFAHLF
GSWFNRGFLVLRPISWSTPADILEKIIRYEAVHHIGGWDELRRRLAPEDR
RCFAFFHPQLVDDPLIFVEVALTREMPPNIADLLKEDRAPIRATDATTAV
FYSISNCQEGLRGISFGNFLIKQVVEDLRRDLPRLDTFVTLSPVPGFADW
LSRERQAETSNALSAADRSRLAALDEPDWADQPEIAAAIQPSLTAAAAWY
FLRARNRNGKTVDPVARFHLGNGARLERINFLGDRSERAMRQAHGLMVNY
LYKLDDIETNHEAFATRGEVVAAPAIRRLIPADRSSRSLVPGPNVFPPGV
RSADAPGRKGDKEFTS
>SMa1996 Putative ABC transporter, permease protein
MTSSSSTFEKLLTSRGSRRGLWLLPLVIAIAMALWLAATTSQFGQSSNLA
NLVAQGMPLLITAVGQMFVVLVGGLDLSIGSVVSFTTAILALDLPGFVTI
PAVFVLAGLIGATNGYCVTRLSVHPIVATLSMQYIVLGITRVLRPVSGGA
VPDSVRWMVEGSLFGIPLPVFWGIVTMLVAWKLLYGSRYGLHLFAIGGGV
ASGSADAARNFGIPANRNILLAYVICTLFAALAGVFLAGRIVSGDPNVGL
LMELDAITAVAIGGTQLSGGVGSLHGTVIGVAALALLSNGMNLLNVTPFV
QTAIKGILLMAVVALQSRKKIGL
>SMa1353 Putative epimerase
MHLSTHNWMRAEPLAVTLKRIKKYGYESIEISGEPAQYDIKDTRALLKEH
GIRCWGAVTLTLGERNLAAKDESQRAKSVDYVKSVITMVSELEGEIVTLV
PATVGKVVPDGTEEEEWQWVVDATKECFAHAREKGVRLAIEPLNRFETYL
FNRAAQALALADAVDPDCGVCLDAFHLNIEEEDIYDAIRLAGNRLFDFHV
ADNNRFAAGLGHLDWPKIVATLKEIGYDGALTNEFVAPVDRTPAAKYPDM
VERNPVDIPPEQLKFIQDHGSSLLTEKFYDDQMRITAETILPLIK
>SMa1018 hypothetical protein
MTMPPALRKFTLVVHVTSSVSSLGAVACFLALGLVGLASADVDTAGSLYV
AMDAIARLVIVPMVLASPSLITGLVQALGTVWGLFRHYWVLAKFLPTVLT
AVVLLLQLDLIGYVAGEAAT
>SMa1371 Probable ABC transporter ATP-binding protein
MPLLEIDNLHVCFDTRAGTVQALRGVSLTVAPGETLGIVGESGSGKSVTA
QAVMGLIDVPGRISDGEILWEGKPLAGFAVANAARDIWGREITMIFQNPM
TSLNPLMTVGAQIAEVIEVHMGSSRRAARRRAAELLSAVGISGAERRLDQ
YPHEFSGGMRQRVMIAMGIACEPKLLIADEPTTALDVTIQAQILELLAEL
QEKMGLAIVLITHDLGIVAGLCHRVAVMYAGQIVETGPVDAIFENPSHPY
TQGLIRSTPGLDADEERLTAIDGAPPGLLQPPSGCAFLPRCPIGDEGCQG
PQVLRAVGAGTVACRKAGEQAWREAV
>SMa2125 probable ABC transporter, permease protein
MISVQRRSGNSLSWNFACYAAAMLCALVSAAALLHLSGGDVAKAFSSLIV
GAFGSQKALLGSLAKATPLLLVGLGTVIAFRAKIWNIGQEGQVLAGAMCA
YWASLWIGPLPYWIAFTVLVLAGLAGGGALGVLAGVLKTRFGTSEIISTV
MLNYIVIFLLAYLLDGGPWMETGVTVAYHQSPPVNAMLEWPTLLGQGAHK
LHFGFLLALVATVLCAVLLERTPLGYEIRAFGSNPTALRFRGTDISRLLL
VVMLVSGALAGLAGAGELFGTSHRLRAETLLGIGSSGIVVAMVGGLRPSG
AMLAALFFGALKSGAIYMRLQSGTPAGLVSAMEGLVLLFFLCAAVATRIH
ITVRSEAHA
>SMa2373 TRm2011-2a transposase
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMa2237 hypothetical protein
MATSFQVSSDLLPSSGHTPTFSAAGLGTVLASSFDANLLANRSRFPFLTR
GRTLLLVADFGGHHQKQHFDTYTFLILDLAKNQEWLAQQRRFRNAILPNR
RRMSFKALNDGMRRQALVPFMQAAAGIEGYLAQFAISKAGEALFTGLAED
EVGAQLLKRWKPSVQERLLRVLHLSAFLLSGLSSPGQDVLWIIDEDDIAA
NVNLLTDLTQLFMRVMTSYFSHSLGHIRCSTTGIADDGSLVLEDLAAIAD
LTTGALGELGTGFVNEKVFPRKSLITPLPKQLTWKTSLLASWKATPGFPI
RRHTTILELGSGAKNTRISTLGWRIYERNFAAAP
>SMa2141 probable formyltetrahydrofolate deformylase
MTNVAQPFVLTLTCDDRAGIVAAVTSQLHVLGANIVESSQYWDRATNRFF
MRIAFNPSGETSADLIERGLAPVLGQFEMQGRLIDCRKPQKIVIMISRFD
HAFLHLLYQIRVGWLDAEVVAVISNHDDSRETAAWAGIPYHFLPINRENK
KKQEDRIFAIVQETEADLVVLARYMQVFSDDIAGRLFGKVINIHHSFLPS
FKGARPYHQAHEHGVKLIGATAHYVTADLDEGPIIEQETERVSHAMSVED
FVAAGRDIESRVLARAVKRHLEARVMLNGRKTVVFA
>SMa0013 hypothetical protein
MAEALARAQAGDYGSALRIWEPLARAGVARAQNNIGACFAEGLGVPENRE
LACKWLRLAAEAGDPVGQRNYAALHMQGLAGTDADYGIAAEYYRRAAEKG
DAPAQDMLSWLLLEGEIMTADPLEARRWAECAAEAGIASSMTRLGMLHHN
ALGVERDAQKAVYWWLKAAERGDADAQAMLGAACHMGAGTIRNGVTALVW
LIRATEGGSTLAKPFMGPVRDSLSPAEIQEAERRAQEPLSRRAP
>SMa2123 probable ABC transporter, permease protein
MLELFQGPLFISILAAMVRIATPLLFSAMGELVTQRAGIWNISVEGTMLL
GAVVAYVIASSTGSPWLALFVAVLACALLSVILSFVTIVLKSEQFIAGLA
LNLLASGLTLFWFQTYIIGRDPPKFAGFEAVEIQYLSDIPVLGTVLFSQR
VLTYVSFLLPLAVWFFLYRTRYGLEVRCVGENPKALDVKGLSVGSRRCLA
IMFGSVMSGFGGAFLMLGYSDRFVPDLIAGRGWLVVVAIIAGNWMPFRVV
GAIFIFALLEAVGIHAQVVGVSVPHHVFLVLPYVASLVLLAGLRSRTHQP
AALGIPYRRE
>SMa1875 Putative reverse transcriptase
MDTDHRTDKVWVLGIQRKLYQWSKANPDDQWRDMWGWLTDLRVLRHAWQR
VASNKGGRTAGVDGMTVGRIRNRSEHRFLVDLQADLRSGAYRPSPARRKL
IPKAGKPGQFRPLGIPTIRDRVVQGAAKILLEPIFEAQFWHVSYGFRPGR
NTHGALEYIRRAALPQKRDEDTRRNRLPYPWVIEGDIKGCFDNINHHHLL
ERMRKRIGDRRVVRLVGLFLKAGVLTEDQFLRTDAGTPQGGIISPLLANI
ALSAIEERYERWTYHRKKTQARRKSNGVAAAASARDSDRIAGRCVYLPVR
YADDFVVLVSGSLEEAMAEKSALADYLIKTTGLTLLPEKTKVTAMTEGFE
FLGFRFSVHWDKRYGYGPRVEIPKAKAANLRHKVKQLTQRDSISVSLGEK
LRGVNAITSGWANYYRYCVGAGRVFVALDWYIGLRLYCWLHKKRPKATPS
ELWGSKQPSRRRATRRVWREGSVEQHVLGWTPVDRYRLAWMDMPDFAMSS
GEPDA
>SMa1394 Conserved hypothetical protein
MRDFSSDHSALALNTASLGHNLEGHGAGWTSEQVIDACAERGFGSIVFWR
REIGERAVKIGERVRSAGLTVAGLCRTPFLVGTQATDVESVMDDFKRSID
MAAALGAPVLTIVVGGVHPGTKGVAESLKIVADRVAEAAPCAQASGVKLA
LEPLNPVYAGNRSCLTTLRDAVDLCDRIAAPNVGIAVDVYHVWWDTELER
QLKRAGAERIYGYHLCDWLAETNDVLLDRGMMGDGVADLKAIRSFVEGAG
YRGPCEVEIFSANDWWKRDPGEVLDVMVERFRNCC
>SMa0630 conserved hypothetical membrane protein
MASVGSFLAFEWPPLLRKIILTLLLAVIVFRVVRAIGKLLFALSGASGIA
DQPPSVFESDAARRFWLSRISIIAGFLLFGWAIASLMPALSFSNEVTRLA
AFLFGLGILVTAVEVVWRRPDKPAPLVVKSLLTIYLVVLWSAWVAGLLGL
LWLGIYSLLLPPLVRGVGDVAQALATRAQRTGPPGIVLSVVIVRGARAAV
IAAAVAWLAYIWRIRAAALAGSEPGAIVIPGLLNGIIALLVADLLWQMSK
ALIAYRMGLGPKDGSNVDELARSGRLRTLLPIFRNVLAVFIGAVTVLTIL
SGLGVQIAPLIAGAGIFGVAIGFGSQTLVKDVLSGVFYMLDDAFRVGEYI
QSGSYKGTVESFSLRSVRLRHHRGPIFTVPFGELGAVQNMSRDWVIDKMT
LNVTYDADVDLARKLIKKVGQELAADPEFAADTIEPLKMQGVDSFGDFAI
VLRMKLMTKPGAQFTIKRRAFMMIKKAFDENGIKIAVPTVHVSGGSDNAA
AAQQALNMSKAAELAANKAPVGL
>SMa0065 putative GntR-family transcriptional regulator
MQNIKPQRVLSAIEAVGPQVYRILREQIIQAELVPGARISEAEIARSLSI
SRQPVREAFIKLAEEGLVQVLPQRGTYVTRISTASVMDVRFVREAIEADI
VRQVAGEHPAAIVDELREQIARQKQVPHDDRAAFLRLDELFHHTLATAAG
RAHAWSVIESVKAQMDRVRFLSVDDLQIGRLIEQHERIVDAIAARDVGGA
EQALRMHLREILKSLPEIARSREEFFDGTG
>SMa0308 conserved hypothetical protein
MPSRRSILKSALTGLIVAPFVAPSVTFAKAPFAVVQAPGYYRLKIGSVEV
TALSDGTVALPLAKLYTNTSEHDAQNALKDAFLPDMVPTSVNAFLVNTGE
RLVLIDAGTGGYLGASLGKLVSNIEASGYKVGDIDDVILTHIHTDHSGGL
MSNGKRTFPNATLRVNEREAKFWLSSANAMTATGTVKQHFGEADQCVTPY
VKAGKFETFADNAAPVPGLGSILYAGHTPGHSAITLESESQKIVFWGDIT
HGDILQFDEPGVAIEFDIDQKAAVAARNTAFKQAVEGKYLVAGAHIAFPG
IGHVREDSKNYDWLPINYA
>SMa1408 Putative dehydratase
MSEPVRLKIGEGVAWLTISRPDKLNALDGEMVDAIVPACREIERSAAKVA
ILTGEGERSFCAGGDIDAWSNASPEAFGRHWVRDGHAAFDALARLAVPLI
AVLNGHTLGGGLELAACADLRIAEAHVRIGQPETGLGIIPGWSGTQRAVR
RFGPQLVRRMALFGHVYGAEEALALGLVDQVVATGEGRAAAEIAAAGVMK
RSARATELTKMLINAAEGEESERVVEALAGAVAAASDELQEGLSAFRERR
PARFGQ
>SMa1428 putative TetR/AcrR-family transcriptional regulator
MRTTSAQKAKSAGKPRLRDAEATKARILEAAKKEFAKNGLGGARVDVIAE
KANANKRMIYHYFDGKDHLFQTVLEDAYNGIRTAEQKLNLNDLEPKAALE
KLVRFTWEYYLKHPEFITLVNSENLHRAKHLKKSQVVRATSRKFVGMVKE
LLERGVSEGVFRPGIDPVQLNLTIAAIGYYYITNRFTGSIVFERDLMAKD
ALEERVQFNIETIMRMVCA
>SMa1163 Putative cation transport P-type ATPase
MLAGIVTWLAGRPDLSAASWAAGTAVILASLLTEIAISLGRKEFGLDLIA
ALAMGGALILGEYLTGTIVALMYTGGEALEDFAQRRARRELTALLNRVPR
TAVRYADGQLQEVSIEELNPGDRILIRRGEVVPVDGSVMDGIAVLDESAL
TGEAMPIRRRSGEPVTSGTTNAGDAFEMVASSAASDSTYAAIVRLVEAAK
AAKAPIVRLADRYALGFLAVTLLLAGGAWAISGEPVRALAVLVIATPCPL
ILAVPVAIIAGVSRCAGKGVLVKGGGALEMLARIKTVILDKTGTITDGRA
HLIELKSRTDLDPLEVLRLAASLDQGSHHVIARALVAAARERGLQLVAPS
GTRESAGSGVSGNIDGHEIAVGGWDFISERIDETAFSRDIRTWIRRDGVV
SVLAAMDGVLAGAFLLADEVRPEVGSVLRQLREAGVRRIVLATGDRTELA
ESLQSFLRLDNVAAELKPEDKTRIVEAERAAGPVMMVGDGVNDAPALAAA
DVGVAMGARGAAASSEAADVVILVDRLDRLVSAIRIAHRSRGIALQSVYM
GMALSAAGMVAAAFGYLTPVQGALLQEAIDIVAILNALRALGDPMRGWRK
TTRLEHAELLKLEAEHRALMDVVDEIRHTSARIQDLPEDEVRNQLAHLDT
LLRQRLLPHERQDDEEVYPRLRRQAGAPDAFAGMSRTHMEIQRQVHSLTS
LRQAFGEHASGPAQRYEIQRLLHGLEAITRLHFAQELEIYRSLEHE
>SMa1957 Hypothetical protein
MYVVLGANGRAGRETAHALIDLRKPVRVVLRRPEQAEKWTKLGASVAISS
IEDVPSLAAALSGASGAFLLSPPPVSGDPYKRADEIGSALAEAVRQSGLS
KVVALSSVGAQHQTGTGVIATLNSLEKHLAEVAPSTTFLRPGYFVETWGE
IAPAVIADGVLPSFLEPSQKIPMVSTIDVGRTAACLLSDEFSGKRIVELR
GPQDWSANDVAAAFGRVLDRNVVTAFVPPGARAAVLAQEGVPAKVADALL
GMYDGIANGRVEHEEGAQQRRGSVALARAVERIVREVGEDASGLAT
>SMa0717 putative calcium binding transcriptional regulatory protein
MDDKAGLGGLEDRGAPAGAAALAKGLALLDLIAEAPKPLRFADLQKMSGV
PKPTLARMLKTLMVFRLIRQDETTGAYLLGHRFVELSHRVWDKFDLVSAA
IPELDRLASELGETVALCRLDGQRVVYLEERSSGGLGVLIEVGRRVPVHC
TAAGKVLLAFQEPSFARSLAGQITYDRFTPNTITDSQALEADLVLTRARG
YAVSYEEHLAGVNSVAAPIAGRDGVPLGALVVLGPASRLDSSAIHPAGRE
LMAAARRITGTVGAVAISSGPRPRTRSGGFSDVQCVLPWGAQLGEAPVWV
EREKRLYWVDILHPAVHRFDPVTGKNESCNVAKLVSAVLPTRNEGLIVAS
QDGVEHFDFDRGDFNPFAEPEPGLPENRLNDAKVDPSGRLWVGSMRLDVS
RPTGSLYRLTSAGEVTRAGSGFTVANGLAWSPDSSTFYFVDTVPGIIYAY
DFDAREGSIANRRVFVTVPEAEGRPDGLAVDADGGVWCAIWDGWRVNRYR
PDGRLDRAVELPVPRPTSVAFGGDELATLFITSARTRLPASTLTEAPLSG
GIFACNPGARGLPTSLFGV
>SMa0198 putative ABC transporter, permease
MTFRLSSDAIRLVIPALSLSLLLAAVFWLQPRAMSYIGLNLLFNLAVPIA
LATIAQMLVMAVNDLDLSMGAFVSFVACVTATFLRDAPAIGILILAGAVA
AYAAIGVVIHLRNLPSIVVTLGMSFVWGGLAVLLLPAPGGQAPDWVRWLM
TVKPPLAPMAIVASIIIAVIAHFIVKRSSLGVLIRGVGGNQRSVERAGWS
IVAARATAYALAGLFAVLAGIALVGLTTSADANIALRYTLLSIAGVILGG
GEFTGGRVSPIGAVIGALTLTLASSFLSFLRISPDWQIGAQGAILIIVLA
LRLMLNRLERREKRR
>SMa0391 putative ABC transporter, ATP-binding protein
MGAVAAGLRPTGSVNPGAVSVKDVGMAFGDVHAVRNASFDLPKGRFLTIL
GPSGSGKTTLLRMIAGFDRPTHGEIFINGRPVSAVPPHKRAIGMVFQKLA
LFPHMTAAENVAFPLKMRRHDARTIPEKVERYLDLVRLGGYGDRRINELS
GGQQQRVAIARALVFEPDLLLLDEPLAALDRKLREEMQLEFRRIQKELGV
TTINVTHDQREALVVSDEIIIMNGGAIQQKARPVDAYRAPSNAFVANFIG
VTNFLEGRIVELTSTQAVFETNGVRLVGIAADAALAPGLSCSGALRAEQI
RIAPMGGRLDELETLVDGQVVDCIFEGDRVVYEIRVPDLAGVLMRVFDHD
PESHLQFGPGDEVRLGWNARDMHVFQK
>SMa2085 probable ABC transporter, permease protein
MPVLIRIARTLRRTLLQAVPTILGIVILNFFLLQLAPGDAADVLAGEAGS
ATEESVAALRARFGLDSPVLEQLATYLGNLAQFSLGFSPRYGMPVADLIA
QRLPGTLTLMAVALFIAILLGVVLGSLMASFAGRIPDRLLSIFSLLFYSI
PSFWIGLMLIVLFSVKLGWLPSGGAATIGSQLKGFPALLDKARYMVLPAT
SLALFYVAIYARLTRAAMLEVKNHDFVRTAYAKGLTPFGVTARHILRNAL
IPITTMAGMHIGGMLGGAVVVETVYSWPGLGRLAFEAVMGRDFTVLLGIL
LLSSLLVIIANAAVDLVHAWLDPRIGAR
>SMa1089 hypothetical protein
MILDRKWTLFAASLAVIFAFFILREHWQHALGLAPYLLLLACPLMHLFHG
HGGHGHADHRIKADDIT
>SMa0146 hypothetical protein
MSGPRSLVAPSTFPQSQRPVCHSLPGRYDRSDRFELELDERFEEEFDELF
EDELELEFEELLDEELELELLEELELELLEELELELDELLPATMISPSVR
PVWAVCVEVRSTPGNKGAYSLASAAEPTNAARPAAIALFVQFFAISSTPC
ENRTGSSVRRSNGRCAPLFHDRNF
>SMa0645 hypothetical protein
MQNDIGGELVHLEDLVLDDATKNVRTPTHELTIPLGLSAAGLRSLRGRDF
VSGSPAASSAADGTVEVTTDPDVDRSTEGRDESNGDVSVFGAEFVAIDAV
LA
>SMa1635 hypothetical protein
MFTLLAFERTVKIWIAIGSWSERGPATIPIVNLIRGNEIMTLDQILERTK
PYAIGLVVGLIAAPIIGFNAGWITTTNASTLAAETARVDALSGICSTAAG
RMATASSTDLAALKGYDNRAKRDELVAVIMADIQVPADVLGKVSTSCSRS
LS
>SMa1434 Probable ABC transporter, ATP-binding protein
MNAPDKPILRIDKLTVDFLSEGDPVRAVDDVSFDVCPGETLVILGESGSG
KSVSTGTVMGLIDCPPGDIVSGSLVFDGTDLSRLDDEGRRELNGRRIAMI
FQDPLAYLNPVYTVGRQIAEVFESHGEGEGGAVRDKVVRLLERVGIPEAD
ERIDYYPHQFSGGQRQRVMIAMAIALKPDILIADEPTTALDVSVQAQILE
LLRDLQRETGMALIMITHDLEVAAAMADRIIVMNGGKVVESGKAEDVFTN
PSHAYTRRLMSAVPHADAPKAPRNAAQGEVLLQVAHLSKHYKLGSGPFSP
KREFRAVDDVSFTLRRGETVGIVGESGSGKSSIARMLLRLNEPTSGAALF
AGEDIFELKGKALDGFRRRVQMVFQDPFGSMNPRMNVRSIISEPWAIHRD
ILPRERWNERVVELLELVGLKAEHAARHPHQFSGGQRQRIAIARALASEP
ELIVCDEAVSALDVSIQMQVIELLADLRQRLGLSYVFITHDLPIVRQFAD
RILVMQRGRIVEEGETEALFVSPQHEYTQALLRAVPQPKWLRSDPAPIAG
>SMa0018 TRm2011-2a transposase
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMa2143 probable 5,10-methylenetetrahydrofolate reductase
MVAALAAANGICLFENPGEAMTNPAEACTDTRVHPVDGADTTISIELAPE
QVRNFAPQADTLALGSRVFLTHLAGKPEVLQVEAAARIIEMGYVAVPHLA
ARNFKSERDYISHVEAHSRNGIDEVLFLGGNPALFPGPLGESAELLAHPV
LSDSSIRTAFVAGYPEGHPNISEARLKDALKRKLEICARRSLEPRIVSQF
AFDGAMIGAWAKRLHDDFPEVPIHVGLAGVTSLTKLIRFAAMCGVGPSIA
ALRRSVSGLLNIVADRNPADVIDAVSKAYPDTVTPLHLHFFPFGGWEKTL
TWLGDYRVARQLQAGVR
>SMa1201 hypothetical protein
MARQRTGLRRGLALVAACMLALQSVVHAFAGQPRDILPFDAFGNSLCVTG
ASHSDTAHGGGDHDKLPECCILGCAMSPALPVAPSGGTSLLEPPRSSVAV
LPPCCEVFVRRSDHEPGGARGPPPIA
>SMa0939 probable sensor histidine kinase of two-component system
MIARSQARFGKIRVSPSCLNASAFICLLLVVIFAATGICSAQGAGPVPRV
LILYPYDERIPATNIVGEAARTRLVEATSGKVEVFSEFLDLSRFPKKGHV
DRIARYLAEKYSDGRPDLVIALGEESTRFIVANRNAIAPDAKIVFGGFGN
ATAEELRLPNDVVGALTEFEIRKTVEMAIGLQPDARQVVVIAGSAEFDKA
WIAAAQEDLIGLPSDIETTYLTGLSIEEFVERVAALPPDTIVIVLTVLSD
RTGRNFVPRNSLEKIAKAASAPVYGPYSTYLDHGVVGGNAATFESTGTAV
AGLAIDALAGKAITDITVPQSYMADSRQLRRWGLSEADLPLGTAVLFKER
SLWQEYWKEIIAISAFVAAQSLVILSLVLERRRRAAAEFQARHRLLEVIH
LNQSATAGALSASIAHELNQPLGAIRINAETAALMLERPNPDLKLIQQIL
ADIRDDDQRAGDIIDRMRGLLRKRSEIDWQEFDLNDVVRSAIHILHPEAQ
RRDVTVSSVASAQGLPVRADRVHVQQVILNLAINAMDAMLQAPTAERNLM
FQTAMVERNEKVEVSISDTGSGIPSDKLSNIFDAFYTTKPTGTGLGLSIA
RVIIEIYGGKLWADNRPQGGAVFRFVLPLARPG
>SMa1597 hypothetical protein
MTISRVCGSRTEAMLTNGQEIAMTSILKSTGAVALLLLYTLTANATSLMI
SPSSIERVAPDRAAVFHLRNQMDRPISIKVRVFRWSQKGGVEKLEPTGDV
VASPISAQLSPNGNRAVRVVRVSKEPLRSEEGYRVVIDEADPTRNTPEAE
SLSARHVLPVLFRPPDVLGPEIELSLTRSDGWLMLVVENKGASRLRLSDV
TLAQGSAVIARREGFVGYVLPGLTRHWRVGREDSYSGGIVTVSANSSGGA
IGEQLVVSGR
>SMa2193 putative oxidoreductase
MRTRQGIGPSQKWDLIVIGGGIVGCSTAFYAARLGLRVLIVERDTPGSGQ
SGRNLGFVRQQGRDFRELPLAIAALRLWNDLEEDLGRKVGWLRGGNVVLA
ANDRETARQADWQSKAKDFGLDTILLTSAQTREKLPFLSDKASLCGAMFT
ASDGRAEPGRTTRAFFEAAVELGVSVILGAHVTQLELQAGNVHGVWVNDK
LYRADRVLCSAGTGSGKLLRGVGYNLPQERIRATVARTLPAPGLTLTSCI
SLPFTGLRQDEKGAFIFSVAGGEYDVRFDSWRYLHHYRETRRANPDAARV
NYFGPILRFMSSRIASPIADIAPSSEGARPAHYRVRQAQDELRQFVPQLA
ELAIEAVWAGVIDALPDVVPVMGHVYERPGLLLATGFSGHGFGLGPMAGK
IMAGLAAGHSASVDISGLSPSRFSHLSVTARLASPAG
>SMa1466 Probable ABC transporter ATP binding protein
MIRLENLTKHYGPAHDPLIAVDNVSLDLPTGEICVLLGPSGCGKTTTMKM
INRLIQPTSGKVFINGKDTSTIDPIKLRRTIGYVIQQIGLFPNKTVEENI
CVVPDLLGWDRRKSRARAKELLELVGLQPDLFLKRYPKELSGGQQQRVGV
LRALAADPPVMLMDEPFGAIDPINREAIQEEFLKMQREIRKTIIFVSHDL
DEAVKMADKIAIFRSGRLEQYAAPDELLARPANSFIEDFLGSDRALKRLR
LVSVRDAMETGFITVRSSDSVEHALERMRSSRSAAVFVLNADGAPQSLLS
EQVAELRSGTVGDHAEPVKSAVPTTGDLRQAVSIMFAHDMPLLPCVDEGG
RMAGVMSYRSIVHYLAHGAKA
>SMa0402 putative GntR-type regulator
MNTLSDSASPLYEKVKDFVLGNIGSGKWARNSRLPSENELVSALGVSRMT
VHRALRELTSEGHLRRIQGVGTFVAPPRPRSTLIEISNIITEIKTRGSRH
RAEVVVLERIARPEPELLLAFEFETVKPVDHSVVIHFENDLPVQLEERYV
NPNLVSGYIDQDFAAVATYDYLQNATPLTEVEHLISALPANAEQARLLNV
RPGDCCLILHRKTWTGPVVATVNTFTYVGSRYSLGSRYLHGGK
>SMa2023 Conservative hypothetical protein
MPANAADTIPESQAVKNVVLVHGAFADGSGWKGVYDNLTKRGYRVTIVQN
PLTSLEDDVAATRRALERQDGPVILVGHSWGGTVITETGIDPKVAGLVYV
SALSPDAGETTAQQYEGFAPAAEFVIETTKDGFGYVSPAKFKAGFAHDVS
DADVAFMRDAQVPINMSAFATKLENAAWRTKPSWAVIATEDKAFDQAMLI
HMAERIKAKITKVSASHALFMTQPAAIADTIDQAAKTVSAKKQ
>SMa0224 putative transmembrane-transport protein
MPSHAVRRGFVPMTDTSLPRTVDRTAWLGLIAILPLVLLVAMDGSILYLA
MPHVTSALMPTADQALWILDIYGFVVGSLLIAFGNIGDRYGRLKLIITGA
AVFGAGSLGAAYSQTPEQLIASRALMGLGGATLLPSGLAIVSALFPDPRL
RAQAIGIFAATFAAGFAIGPLIGGMLLRQFAWGAVFLINVPVVIGFMIGA
PILLREVRSTVGGSIDLASLVLSFAGILLFTYSLKNAAAYGFTPTQIVAG
AAGIFALALFARRQTKLEYPLLDLGLFRDRIFSIAILTGLLSLVVWSAAG
YLSGVYLQSVLGIDVFAAALLTLPGAIVLTATCVATARIVERIGRKTALV
ATHLLIGAGVFLLLFTTTETGIAVFIASTMIAGIGYGLSFSLVADIAVSA
VPANRAGAAGSIAETSNELGNALGISLLGSLATLSFRLFGPGVAGTLDET
LDQPGLAHQSLIQAQEAFLTGMHVAIGTGGLLTLAVGMVAWLWLPSKLPE
>SMa0527 phosphoglycerate mutase, putative
MKRIILVRHGESAWNSVRRLQGQADIGLSARGEAQATALRATIEAMRPDH
VIASDLLRARHTAALLGYPHAQLSPALREIDVGDWTGRAIGDLMAEDQDA
YLGWRAGTYAPRGGERWQEFRDRVTAGLGKAVSIPGERLLVVCHGGVIRA
LLDGLLGLPPKRIIPVGPASVTVLADKPGGMRLETFNFSPDGPVFDAPD
>SMa1885 Putative membrane efflux protein
MSGTRQQGSSDMKRKFLLTAIGLVAAAAGAAFAFVFETPARDAESADPRL
APPLVRVAEATRPERAERAFTGTIAARVQSNLGFRVPGKIVERLVDVGQQ
VKAGQALMRIDETDLRLALTAKRNAVAAARAVLVQAIADERRYATLVKGG
VAATPQRYEQAKAALDTAAAQLAAAEAEARVAENETTYSVLVAGADGTVV
ATLGEPGQVVAAGQAVVQLAHAGLREALVALPETVRPAIGSEAEASVYGS
DGRRGRARMRQISDAADPQTRTYEARYVLDGDAASAPLGATVTISIRNGE
GQSEVAVPVGAVLNDGSRTGVWVVDQASATARFVPIEIKRLGEETASVTG
IELGEQVVALGAHLLKDGAPVRTELQAEVSNR
>SMa2275 hypothetical protein
MSVKSSISLTDQQDAFARSLVESGRYSSMSSVLQQGLELLRQKTETETVE
TAALGELVQRRLNGPMISATEMESRVAAMIERKRRVVRVDS
>SMa0218 putative ABC transporter, periplasmic solute-binding protein
MIIYLDPSVQFFNPVVKGAQDAAAQFGVDLDVQYANNDPVRQNDLIESAT
ASGVDGIAVAISSSDAFDESICAAVKAGIIVIGFNNDDLDGAKGNCRQAY
VGMDELASGYELGNRMIKEFGLKSGDVVFNPREIPEASFAVARGGGIEKA
MTENGIKVETVRAGLDPAEAQNIIAQFLIANPNVKALFGTGSVTSTVGAG
AIKDAGVDIPFGGFDLAVEIVNAVESGAMYATMDQQPYLQGYYPIAQIAL
AKKYGLTPTDIDTGQGAFLDKTRIGSVKPLIGSYR
>SMa2031 Putative non-heme chloroperoxidase
MAKAVLVAAIPPLMLKNDDDPEGTPMEVFDGFRTALAGNRAQFFRDVPSG
PFSGFNREGAAVHEGVLQNWWRQGMMGKRQGALRWRQGLLGNRPRPMTSR
RSPCQRRCCTVRTTRSFRSPPRL
>SMa1162 Conserved hypothetical protein
MDPRTVVLILRNIPQEPHRRGKKTHDNGNRGVVRSASRACNSAPGLLLLG
GGSLGSSQMTIQRRRRLIKRQRSSVGLALVAAISFLAGMTDAMGLMSLGD
FVSFMSGNTTRASVALIQADVARGVLLIGGLSTFVIGNAVGVMVSIRFRP
HSVLLFVSVLLACAALLADRREMQFLLLILAMGAINASVEQIEGLPVGLT
YVTGALSRFGRGLGRWVMGIRNTHWVIQIVPWLGMMAGAISGAALVQEIG
SLALWIPSAAALLLTAVAIQIPRRWQSRFIQHR
>SMa2037 Putative oxidoreductase
MWHLHTRLPHCRKGSAVEESGSDRRGNTLRPCRKSLSLHRLRQDRARRSG
RSRRHERSLTMNFDPSHSKRSFSSVGTRPIRPDGVDKVTGRARYGADFNM
PGQLVGRILRSPHAHAIIRKIDTLKAAQLPGVKAVLTAADLPDLTDGDAA
MYDILDNCMARKKALYDGHAVAAVAAIDARTAKQALKLIEVEYELLPHVT
DVDQAFAPDAPLINDTIFTTGVDPKPDRPSNVSMRSQFGHGDVDAGMARA
DFVVERTFKTEQTHQGYIEPHACVASVSSDGTADLWVCTQGHFVYRQHCA
QLLGMDASKLRVTSSEIGGGFGGKTHVWAEPVALALSRKAGRPVKLVMTR
DEVFRASGPTSATSIDVKIGALKDGTIIAADATLRYSCGPYAGAWAEVGA
MTAFACYKLENVRTVGYEVLVNRPKTAAYRAPSAPMAAFAVESAIDELAK
KVGMNPIDFRIRNAAQEGTKASYGPVYGPIGIGPTLEAVKNHPHMKAPLG
RNQGRGMACGFWFNFGGQTCTDLNIGMDGTVSLAVGTVDVGGSRASLSLV
AAEELGIDYSQVRTVIADTSSLGYNDMTDGSRGTFSSSMATISAARNAIK
ILRERAAQMWDIPVEDVTWEKGHAVAKGETHGNLPKLSLKEIAAASGTTG
GPIAGHSEIVADGAGVSFATHICDIEVDPETGATKVLRYTVVQDAGKAVH
PTYVEGQYQGGAAQGIGWALNEEYIYGNDGRLQNAGFLDYRIPVCSDLPM
IDTQILEIPNPNHPYGVRGVGETSIVPPLAAVANAVSNAVGVRMLHIPMS
PPRILAALEA
>SMa1971 Hypothetical protein
MTNSIKVVARVMITPGNEDAFETYAAELSVATRAEAGCLSYHLHRHLNQK
GVYVFVEEWASRQVWEQHMSGEAIRAFNRQLPAGTIAHIEIHPLEQIA
>SMa0341 hypothetical protein
MAKGLLGGGEMKFYPQDALTQAGGRFSQSDAPFSAHVVTDRELITGQNPA
SAPAVAQELLKRLK
>SMa1253 Hypothetical protein
MGAESNVRLLRELCLHGRHISATQLAERSGLVRNSTRNALNSLRQHGLVV
EEGTDGARLFRFNSEHPLGDSIGQLFGAESEGFKISSKG
>SMa1956 Putative LysR-family transcriptional regulator
MECIEAMREVDLRHADLNLLVVLNALLDERSVTRAAMRLGMSQPAVSRAL
ARLRALFSDALLVDGPGGYLLTSRAEDLRPLLRNTLAGVSELLDGRTFDP
MQATGSVRLLMLDLEAAVLAPRLIASLAVQAPAVDLQVVPPGLRPLEALE
ADAVDALIGVVEDAPAGVKKRKLYQDNFVTLMRAEHPTAARKLTLERFLE
LDHVVVSITGTGRAWVDEILARSGRKRRVKVRVPSFFAAVEIAARSDLVM
TLPSSLARTAADMRRFVMASPPLDLGSVVISLAWHARHQDAPRHVWLRRT
IAAAVADIGL
>SMa1998 Putative ABC transporter, ATP-binding protein
MSHLIEMAAISKAFGGSVALRDVSLALAPGTVHALMGENGAGKSTLMKIL
AGVHQPDSGEVRRAGRTVSFANPRAALEAGISTVFQELSLLPNMTIAENM
FLGREPTGCFGGIDRRRMRVGTKDALARLGLTLDPDTLVSELSIAERQFV
EIAHGIDSDADVFILDEPTAALNAADVEVLNRHIRSLREAGKAIVYISHR
MDEIFAICDVVTVLKDGQLVGTRPLSEMTPASLIAMMVGRELEDLFPERG
QGEGAAALSVSGLRLHTDSQPFSFTVRKGEIVGFAGLEGQGQQKAVRALV
GQFAPVEGTASRRGETIQLPVPKESGVRRWQALGGAFVPEDRKDEGLFLG
HSVGQNIVAALHAGRPTLKAAKRYGDVITETMRRLNIKASGPSAIVGALS
GGNQQKVLLGRYLATDADLLLIEEPTRGVDVGAKAEIYRILRDFAKAGGA
VLVLSRETIELIGLCERLYVIHGNTAVSEIRAVDATEHSILNAALSA
>SMa0467 putative ABC transporter, permease
MHRAAPSTGLTAISGGANGGSSLVVFITRRFAVTIPLLLIISFGVFALIH
IAPGDPVSSLLGARASDPATLAAIRARYHLDDSLLVQYGTWLSQVIRGDL
GVSILGNRSVTSTIADRLGVTIFLSLMSTTLVLGLGILLGALAAFRRGTG
LDRTVVMFSVFGISSPAFVTGIFFLYVFGVLLHWFPTFGAGTGFLDRAWH
LALPALALAASTMAIVVKITRAAVIEELARDYVTFARARGVSSRRILLAY
VLRNSLIPVITAAGLIVIGILAGAIYVEVTFSLPGLGALMIDAVQKRDIP
TIQGITLLFSAFVVLVNLAVDVIYTLIDPRIRFGRVGS
>SMa2295 probable penicillin-binding protein
MLPHYKQQAVIDSGLAAVGFRPENQRIYPNGSLAAHILGAVDIDNFGIAG
IEKWIDANWLADLRGAGLALKGRDLQPVELSIDLRVQYAMEQELGKAVKK
FGAAAGAGLLLDVTNGEILALASYPAFDPNNPVDAFKPDRINRVTAGVFE
MGSTFKALTTAMALESGRFDLESVVDASRPLSFGRQRIHDYRGKYRPLTV
PEAFIHSSNIVMAKMAMTLGPEALRTFLGRFGVLSQLATELPESAAPLRP
SSWSEVTTATVSFGHGIAVTPLQASMAVAALANGGRLIRPTFIKDSPVSE
RTIAESLLSANTSEAIRHLLRLNAAVGSATKADVPGYVIGGKTGTAEKVV
QGRYSKVLNVTSFMGIVPADNPRYLLLTLLDEPRGLAETYGNRTSGWNAV
PLGGALLHRILPMLLKPVFPSPSSAEAVPVATGEPSVDDSSSD
>SMa1297 Hypothetical Protein
MTDAIGLMSIGDFVSFMSGNTTRASVALVQGDAAQGLLLIGGLVSFVLGN
AAGVMISIRFRPQAALLFVSALLACAALQEGQPELRFVSLIFAMGAVNAS
VEQIEGLPVGLTYVTGALSRFGRGLGRWAMGVRNTQWIIQIVPWLGMFAG
AIMGAVLVREAGDLALWVPSLAALLLTAAAFQIPRRWQSRFIQSR
>SMa2257 possible integrase-like protein
MTIVPSTPIASRAEELDALDAILPFDRRDQLAALLTDDDVATLKHLAQEG
MGENTLRALASDLGYLEAWCRLATGDPLPWPAPEALLLKFVAHHLWDPVE
RAEDPSHGMPADVEVGLRAERLLRADGPHAPGTVQRRLTSWSILTRWRGL
SGAFGAPSLKSALRLAVRASNRPRQRKSKKAVTLDILTKLLEACAGDRLV
DARDRALLLAAFASGGRRRSEVAALRVEDLADEEPVRANPADENSPPLPC
LSIRLGRTKTTTADENEHVLLIGRPVTVLKRWLSDAQIKDGPVFRRIDQW
GNIDRRALTPQSVNLVLKARCEQAGLDPALFSAHGLRSGYLTEAANRGIP
LPEAMQQSLHKSVTQAASYYNNAERKNGRAARLIV
>SMa1874 Hypothetical protein
MTDQTREAVGAWIESRRLGERDYLFPSRVHTKPHLSTRQYSRVVERWVSS
IGLDPKRYGTHSMRRTKVAHIYKKTGNLRAVQLLLGHKKLESTVQYLGTE
VDDALAISEQVEL
>SMa1043 hypothetical protein
MKSLIKLTLATALAIGTVSGAFAQEFTKATVKKVDAKAKKVTLIHEELKS
LEMPAMTMVFRVQDDAILEKLKEGANVEFVAERVNGKLTVTQVK
>SMa1753 Putative Transporter
MLSLLPRSAVVLTDWDRKSPPTVPMQMALMEDLKAVDDDQALGDMVRRLN
SARSGFRTLMSKTTRALDDTANPPANLVSIDKRWEKPEFWLAIADALSPL
TDRNLLAAVDLGRNSTGDIEHLPADQSVNRVILVRTFWIAALVTFACAGI
GFPYAMIAASLTGWKRDLMLAAVLLPLWTSLLVRTAAWFILLQEKGLIND
LLQTLGLINAPLPLIFNRTGVVIAMTHVLLPFMVLPIYSVLITIPKNLMP
AAASLGAPPWRAFLRVLLPLSLRGLASGSLLVFISAIGYYITPALIGGPG
DQMISSIIAFYAMGSANWGMAGALGVVLLVATLLLYGVYARLSTGEPGRR
>SMa0758 hypothetical protein with local conservation
MAGARGGNDMKADVFDARALREAFGAFPTAVTAITASDPAGRPVGFTANS
FTSVSLDPPLLLVCVAKTARDYSTMTAAEHFAINILSEAQKDVSIKFARP
LEDRFAAVDWARAPNGCPIFAQVAAWFECSMHDVIEAGDHVMMVGRVTAF
KSSGLNGLGYARGGYFAPSVAAKANSSAAGGEIGAVAVLERHAALFPLGD
>SMa1452 Probable
MQLKSRVFIVTGASSGLGAAVTRMLAQEGATVLGLDLKPPAGEEPAAELG
AAVRFRNADVTNEADATAALAFAKQEFGHVHGLVNCAGTAPGEKILGRSG
PHALDSFARTVAVNLIGTFNMIRLAAEVMSQGEPDADGERGVIVNTASIA
AFDGQIGQAAYAASKGGVAALTLPAARELARFGIRVVTIAPGIFDTPMMA
GMPQDVQDALAASVPFPPRLGRAEEYAALVKHICENTMLNGEVIRLDGAL
RMAPR
>SMa2069 probable ABC transporter, sulfate-binding protein
MRTDRLAEVVKRALLVGMLQLGYIGLAQADTTILNVSYDSTRKLYKEFNA
AFAEKWQADTGETVTIQTSHGGSGKQARLVIDGLNADVVTLALEADIVAI
AQATGKIPVDWKTRLENDSAPYTSTVVFLVRKGNPKGIRDWSDLTKEGIQ
VITPNPKTSGGARWNFLAAWAWARATNNGDDAKAEEYVAQLFKHVLALDT
GAWGAMTTFVQRGLGDVLLAWENEAYLALEELGPDNFDIVTPSISIRAEP
PVALVDGNVDRKGTRKMAEAYLNYLYSDVGQKIVAKHYYRPFKPEAADQK
DIDRFADLTLVTIDEFGGWKEAQPKFFGAGGIFDRIYSPGR
>SMa0476 TRm17b putative transposase
MPLLERHIAFYGEAPRQAAADGGYASRENLRQAKAWGVRDMAFHKKSGLR
IEDMVRSRWVYRKLRNFRAGIEAGISCLKRTYGLARCTWRGLDHFKTYVW
SSVVAYNLALFARLRPT
>SMa1801 Reverse transcriptase, maturase
MTSESTTDKPFRIEKRRVYEAYKAVKANRGAAGVDGQTLEIFEKDLAANL
YKIWNRMSSGTYFPPPVRAVSIPKKAGGERVLGVPTVSDRIAQMVVKQMI
EPDLDSLFLPDSYGYRPGKSALDAVGVTRQRCWKYDWVLEFDIKGLFDNL
PHDLLLKAVRKDVKCNWALLYIERWLTAPMEKNGEVIERSRGTPQGGVVS
PILANLFLHYAFDLWMTRTHPDLPWCRYADDGLVHCQSEQQAEALKVELS
SRLAACGLQMHPTKTKIVYCKDQRRREAYPNVTFDFLGYQFRPRRVANTQ
WDEFFCGYTPAVSPTALKSMRATIKSLNIPRQTPGTLAEIAKQLNPLLRG
WIAYYGRYSRSALSTLADYVNQKLRAWIRRKFKRFQSHKTRASLFLRKLA
RENPGLFVHWKAFGTNTFT
>SMa1442 Putative GntR-family transcription factor
MGIFMPSTKKGGGRPSLSSQVADAIKAQIEAGYYAPGDKLPTEPALIDKF
GFSRTVVREAIAALRADGLVESRQGSGVFVIGPRQSDPGLKLFTGETDKI
SDIIEELELRIGIEVEAAGLAAARSSPAQEAEIQAQVERFAQLVAEGKPT
DEADFQFHMAIATATNNTRFRTFLEHVGRRMIPRVKFRTVMGGVDPLPNR
DEPILLEHREIAEAILARDPDRAREAMRRHLVTGIKRYRSLTTWRPSEKT
DAVD
>SMa0117 hypothetical protein
MKLTERQVLEQFDIVTRKQLRQWVHSGWIVPAQGERGPSFDDVDVARIRL
VCELRRDMNVNDDAVPIILALLDQLYGLRRELRTIATALLEQPDEVRQQL
KQALLSPIGQQH
>SMa0748 putative mucR family transcriptional regulatory protein
MWRRSLTESHSNELRLELTSRIVSAYLSRNVIAPSELPHLIQQTYGSLGK
TSEPTKTPATVEEQRPAVPIKKSVTDDFIVCLEDGKSFKSLKRHLMAKYA
LTPEQYREKWKLPADYPMVAPNYARKRSELARATGLGKKSAANLSPSLQA
VRSA
>SMa1299 Conserved hypothetical protein
MNFGSTSHRRVGRPREFDINQALDAAIRIFSEKGYHGTSIAELKTEMGLT
AGSIYKAFRDKRDVFVAAYDRYKQIRADLLDQALFSASTGREKIARVVGF
YAMSACGETGRRGCLAIGAAVELVLSDAEIADRVGRHNRKLVARLEGFIR
DGQLDGSIRQDVDPAATALALFSYLQGVRIAGKTAELENQMLPSAEAVLR
ILD
>SMa1400 Probable fatty acid acyl-CoA
MAESLDREERFPAELYGEMAKLGLFGIGVPEHLGGPGFDTLTYAVVMEEL
SRGYASVADQCGLVELISTLLVRHGTEGQQRMLPDVLNMSAKVAYCITEP
EAGTDVSGIRTTAERDGDGWMLNGGKIWIHNAPVADVGFVLARTDKEAGN
RGMSIFIVDLNSAGVERGPKEHKMGQRASQVGALTFTDVRLPGGALLGQE
GRGFHMMMSVLDKGRVGIAALAVGIAQAGLEAAVDYAGTRKQFGKAISDF
QGVQWLLADMAKDIEAARLLVHSAASKIDRGLDATKACSIAKCFAGDMAV
QRTADAVQVFGGSGYIRGFEVERLYRDAKITQIYEGTNQIQRMIIARELL
KKGARA
>SMa2129 conserved hypothetical protein
MKQEEYLMSTSITRRTAILGSMGAFLLSALDRTFAAGPTSLKIALVLESR
TDIGWTRTLLDALEQVKQARPDGLDISWEYTDPLWNDDAENAMRFYAEGG
EYDIIWAHGRYSDQVKKLSAEFPDIMFVVTGSGNLPLGANQYWLYKRLHE
PSYLLGMLAGRTTKSGVVGLVGTFAADDVNDQINAFLDGARSVRPDVRHR
VSFIGSWSDNALAAEHANVQIASGADVVFMLTDNFKPCQEHRIICFANIN
DQSKLAPDAIASSAIIDWQPDIKWIISEWLKHKAGAPYDGNTEPKWFSMS
QGGVDIAPYHDFDAKLPVAVKEELAATRQKIISGEFVVPLNTAEVK
>SMa1683 putative arylsulfatase
MNRRLMRSVGAFVASTVLWCTASSPQAQEARPKPNILFIVSDDTGYGDLG
PYGGGEGRGMPTPNIDRLADEGMTFFSFYAQPSCTPGRAAMQTGRIPNRS
GMTTVAFQGQGGGLPAAEWTLASVLKRGGYQTYFTGKWHLGEADYALPNA
QGYDEMKYVGLYHLNAYTYADPTWFPDMDAETRALFQKVTKGSLSGKAGG
EVTEDFKINGQYVDTPVIDGKPGVVGIPFFDGYVEKAAIEFLDAAAKKPD
QPFFINVNFMKVHQPNLPAPEFQHKSLSKSKYADSVVELDTRIGRILDKL
RETGMDKNTLVFYTTDNGAWQDVYPDAGYTPFRGTKGTVREGGNRVPAIA
FWPGKIQPGSRSHDVVGGLDLMATFASAAGVPLPDRDREDKPIIFDSYDM
TPVLLGTGESARKNWFYFTENELTPGAARVGHYKAVFNLRGDNGQATGGL
AVDTNLGWKGAQSYVAIVPQVFDLWQDPQERYDVFMNNYTEHTWSLVSIS
AAIKDLMKTYVEYPPRKLQSMGYDGPIELSKYQMLQSVREQFEKEGVRLA
MPTGN
>SMa0020 TRm2011-2b transposase
MAPLRGWAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGE
RFRIYVQQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKY
SPDLNPIEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFK
EAGYERA
>SMa2291 putative enoyl reductase
MNAPAIIPSPVFTGTSLKGLDFANRLSVAPMTRISASCAGVPGERMVRYY
QRFARGGFSLVMSEGIYTDKAYAQTYSGQPGLTDREQADAWRRAVDAVHA
AGGSIFAQLMHGGALSQGNPYRSDTIGPSAIRPKGSQMTGYRGDGPYAMP
REISEAEIAAAIEGFAGAARLAIEVAGFDGIEIHGANGYLLDQFFTDYTN
HRQDRWGGDIVARFGLSLEVLKAVRAAIGPSVPLGLRLSQGKVNDFRHKW
AEREQGAARVFEALAESPADFAHVTEFEAWRPAFEGGEISLVSLARRHAP
RLFLIANGSLHELARAEQVMESGADMIALGRGALVNPDWPHLTAARSALR
PFELSMLAPISDIKDSELAL
>SMa1890 Putative oxidoreductase
MRRPPMAETLPFTLLPPEESKVGLLATPEPRFCNLTNTVHGGWIMTMLDT
VMALAAQTTLSAGGGNTAVQEALYLSNLASKVTVVHRRDRFRAEPILQDR
LPAKPNVEVIWNHVIDEVIGEQEPRKSVTGIRIREVNTDDVKELTTHGLF
VAIGHDPATALFRGQLDMDDAGYIKVGSWLTKTTVPGGAGCRRCHRLGIP
SGNHRCRHGKHGGP
>SMa0723 conserved hypothetical protein
MAAADLAGTCDGVRGSALEKSCAFSSPTTVSASLPNGVMLECGHVDALMA
VIGALGVHFALAMLTLGGGASRPKKKQLSVITGPLTVGASAAMLCRRKDS
AKLDVPSHYFTGELLVKPRISVLTLGVADLERSLAFYRDGLGLHTPGIVG
REFEHGAVAFFDLSNGVKLAIWDQDDLAHDSGLTKAPASSTSFSIGHNVS
QRSEVDEVMEQAREAGAAIVKPAQETFYGGYAGYFTDPDGHLWEIVWNPG
NLAERTDLSPLP
>SMa1694 hypothetical protein
MTIRTVFIATHVMLLARRSVIRSYPIPTEDGGHRGGGASDKHRLSFCRTH
VGKLGKKRPLGSAGHFDRTKQVRPKQFPGGSARWPEAASDIKVEGLDGKL
FLARANGDAIFHLYPVAGAVSRLLAEPLGKAEAAKLIHAAFSDADRRRVR
RDIRILFED
>SMa1724 conserved hypothetical protein
MTYAKRMTDAVPLASDACGGGQKISHAQYSRGLSAVVTEDSDLQCIFHAV
HTAPVFVNYATHMVDRDLLNQDIVWKGYPMSKRAIIVVDLQNDYLTTGKF
PLVGIDKALENAARLVDAARRSGDLVVNIRHESPAGAPFFVSGTEGAEII
PNIAPQHGEAVVTKRYPNSFRETELASLLSSAGVDEVTVIGAMSHMCIDA
TARAASDLGYKTTVVEDACATRDLEFRGEVVPAAKVHAAYMSALAFGYGQ
VVSTRDYLAK
>SMa1916 Conserved hypothetical protein
MTQPQAFAFAILIGLMIAFIWGRLRYDLIAVLALLAAVFTGIVPHKVAFS
GFGDDIVIIVASALVVSAAVERSGVIEAFLHRVAPRVTSVRDQVVILVAA
VTVLSVFIKNIGALAMMIPVAFQMARRSKASPSSFLMPMAFGSLLGGLIT
LVGTSPNIVVSRMREELTGQPFSMFDFTPVGIGIAAAGVVFLAFGYRLLP
RDREAVPTMEKALSIKDYMTEARITGNSQVVGKSIRHLSALLDDEVMVTG
LVRNQVERLLPLPDLVLQVDDIVLLEGDPEALERGIRRARLELEGHNRPT
EAKGVDEEIAGIEVVVGPRSVLIEQTAKRLALHNRFNINLLAVSRSSQRF
TERLRDIVLRSGDVLVLKGDLVLLPTKLMELGLLPLAEREIRLGNPRERL
MPVLILAGAMAFTALGVLPVATAFFAAAVLTVLFGALSLREAYEAVDWPI
VIMLGALIPVSESIRTTGGADLIAGALAQLAGALPVFGALAMIMVVAMAA
TPFLNNAATVLVVAPIAVTLAHRLGYNPDAFLMAVAVGAACDFLTPVGHQ
CNTLVLGPGGYRFGDYWKLGLPLSCIVVVLGVPLILLFWPTV
>SMa1894 Hypothetical protein
MTYRKTDEEVSKLTPEQYRVTQQNGTERPFTGEYTDNKRPGIYVDIVSGE
PLFASADKFDSGCGWPSFTKPIVPANVNELRDNSHGMIRTEVRSAHGDSH
LGHVFPDGPQDRGGLRYCINSAALRFIPREEMEAEGYGGYFNQVEDI
>SMa0193 hypothetical protein
MLAALTSGEKPISELARPYQMTLAGAAKHVAILARAGLIERRKVGRQNIC
RLNAGNLKEANDWLAQWQRFWNVRLDALEKALKEELSQ
>SMa1259 Hypothetical Protein
MPGATLSPWTMSYFAAACLFLVVGQCAMVAGYGYPFAAVESPATLALVHV
VAIGWLGLLMTGALLQFVPVLVAAPLRAGRLALPALALLIPGLLLLVGGF
AALGGAEGVSPAMLPSGALLLAMGFGLVIIMLATTLLTVRPLPLPARFVA
VGLGALIAAVLVGGAFTLVLSGTVTNYAAIGLLLKGVPLHATLGLGGWLT
FSAIGVSYRLLPMFMLARDDMWPTSRAVWWAGAAALAIVAARIVSIAVDS
DGMEGAESVAAFLAVLAAVLYSADVLRLFRERRRKLVELNVRASYAAFAA
LFASVVMSALPTTRAAAGEGAAALVYLFVFGWLTGLGLAQLYKIVPFLTW
LECYGPVLGRVPVPRVQDLVEEKRARLWFFVYYLAVIAATLSLFAQSPAA
FRVAVLAQLCATVGLMVELVRARMLSSVATPLRLPAGAPRPHLFLPISRP
QE
>SMa0629 hypothetical protein
MNQFKIISAMDSHNFERKRASGPTKRDTLMLGREARRKTTLPLWFRGFVA
ADRGDRPRLALGLVGARTAERLRRNARQISGAHSRPLDRRLGLAERHFVA
RCNYAIQSNALVRNEFETRRREVLE
>SMa1939 Putative oxygenase
MTEHAVIIAGGGPTGLMLACELTLTGIDVAIVEKRVSPAIVGSRAGGLHS
RTIEVLDQRGIADRFLSEGQVAQVTGFAATKLDISDLPTRHPYGLGLWQN
HIERILASRADELDVTIYRGSAVSDFEQDDIGVDMKLSNGQSLRAEYLVG
CDGGRSLVRKKVGIEFSGWESTTSSLIAEVEMTEQPPMGTHHTPLGIHSF
GRLEYEIRDGEVVYKDEGPVRVLVPEQHVRGAAGEATLRELSAALIAACG
TDYGVHSPTWISRFTDMARQAASYRKGRVLLAGDAAHVHSPVGGQGLNTG
VQDAVNLGWKLAQVVSGTSPDTLLDTYHSERHPVAARVLRNTLAQVALLR
PDDRTEAARSVVAELLQMDEPRKRFGAMMSGLDIHYDLGEGHPLLGRRMP
DLNLVTASGQVRVYTLLHSARPILLNLGKPGAFDIKPWIDRVQLVDAQYA
GAWELPVFGKVTAPGAVLVRPDGYVAWVGDSTRRGLVDALWTWFGAPADY
RPGAHNEGKPL
>SMa0041 probable quinone oxidoreductase
MPEPGPHEVRIRVKALGLNRAEALLRSGAYIETATFPSGLGLEAAGFVEK
VGPGVQGFIPGDPVSVLPPKSMIRWPAYGELAIFPAALLVRHPPSLSFEE
AAAVWMQYLTAYGGLVDIGGLRRGDFVAITAASSSVGLAAIQIANMVGAI
PVAVTRTSAKRQGLLEAGAAHVIASMEEDLEAQLKRVSGQHGIRVVFDPV
GGPIFEPLAAAMAWGGILVEYGGLSPEKTPFPLFAVLSKSLTLRGYLVHE
LLADPGRLERAKAFILDGLVSGALRPIIARAFPFDQIVEAHRFLESNEQF
GKIVVTV
>SMa1657 hypothetical protein
MMKRQRSYTHCGSPRPKHILTARAHGGDSSTSTDLSELPTMSASKRWRDR
SATFVWRMATSPFSIGVYIAIAIWISFLVSADTGIPAYVLMAAQFLLVIF
VDPPKRVAQYRDRRYGRPARSPPNSRTVPHGRQHR
>SMa1057 Conserved hypothetical protein
MPITAISDKLSVSPQLSVEDIPSLRDKGFKTLINNRPHKEDTFQPNTQAE
RQEVKHCGLTYAFIPVTADTITEADVRAFQRAVDESDGPVLAHCQTGGRS
LNLYLIGEVLDGRMSADEADAFGRSRGFDTSVAAAWLKQHAARRPQVKGF
FDKRTWSVQYVVSDPETGKCAIIDPVLDFDERAGATATINADAILDYVRD
NGLTVEWILDTHPHADHFSAAQYLKEKTGAQTAIGERVVDVQKLWQKIYN
WPELATDGSQWDRLFADGEDFKIGSIDAKVLFSPGHTLASITYVIGDAAF
VHDTLFLPDSGTARADFPGGDARVLWNSIQEILALPDETRIFTGHDYQPD
GRAPRWESTVAEQKKSNPHLAGVSEKEFVALRTKRDKTLPMPKLILHALQ
VNICGGRLPEPEANGKRYLKFPLDALQGAAWE
>SMa1033 hypothetical protein
MGMVRITLQKMLVLLRLAIVMSLTVYSLPTASAAMHGAWSNPEVTQTDDH
HPEIAGGAHTHGDQKSSPDDSQKLVKTDCCKGFCVSMALAAETNTVGGLR
VTSIREFVDDARTKGELPLLHRPPNI
>SMa2339 putative siderophore biosynthesis protein
MDRGDRGQWGSTPGAGGGYAGPEVQDGRAFRSFQLHPCGPVELYEAGSIL
SASASGHRARFAFAEGELRIDDLRSGNAEGLRLACAAFEYLFATRRHLAS
IRLAGEGWNVLSRELKRRGLLVENATAISHRTIFAEMFWQVPEIWMASPS
VTFPRRDHFDGRTEHPLRPPKPAGCVYARFIPWLSGTLSLHVATLNDLPD
IHRWMNNPRVNEFWNEAGSKAAHGRYLERMFADPHTIPLIGRFNARAFSY
FEIYWAKEDVIGPFSGAGDYDRGCHVIVGEESCRGKPWFTAWLPSLLHLM
FLDDPRTERIVQEPSAAHHRQLGNLQRSGFSHTRTVDLPTKRAAIMSISR
QRFFPNRLWHPAADPDRSNS
>SMa1820 Hypothetical protein
MLKGMTMKMYLLSLGAGLLVGVIYSLLNVRSPAPPVIALVGLLGILVGEQ
AVPLVKRLMSGNPVNISWFNSHCIPHVFGELPTGAAHNPVAARDEPSARE
IS
>SMa1948 Hypothetical protein
MAIPPEAAVDNQLLDLLPEPDYNQIAPDLDYVKVTRGALIATAGAPIDHV
YYLTSGIGSLVASTPEGNKAEAGIFGSEGYIPTSAAMGVELSVHDIIMQI
EGNAYRMEFSAFRKWMDRNRNFARVMIRCIEAFSIQLTYTAISNALHGVD
ERLARWLLMCHDRVSGDEIPLTHEFISLMLAVRRPSVTTSLHVLEGNGFI
RSNRGNIIIRNRQALEEFARDAYGKPEAEYRRLMTALF
>SMa1025 hypothetical protein
MDRYRVLQPIVMRLHYRRAMVCRLKIWRGRKAMEADHCEAKDGNPDEPGN
CSQNAHRG
>SMa0079 putative ABC transporter permease
MYEFNFAPVLASFDQLLVGAWLTVRLSCAAMLIGLVVSVFCAWGKTAGPA
ILRHVIDGYIEIIRNTPFLVQIFFIFFGMPSLGLRLSPNSAALLALVVNF
GAYGTEIIRAGIESIHKGQVEAGWALGLSRPQIFRYVVMKPALRTVYPAL
TSQFIYLMLNTSVVSVISADDLAAAGNDLQSATFASFEVYIAVTLIYLAL
SVGFSALFYALEKAAFKYPLGR
>SMa0483 putative inositol monophosphatase
MPPMQFLRARRSPADNRLSIGIAIPTCRRWTCSPAGGTVPPTTIKGVQNS
MFDPPLGEFASFAHDIADLARQTISSAAGVRREPIAKSDASPVTETDRAV
EKCLRERIADHFPDHGVLGEEFGAEGLGNEFVWVIDPIDGTKAFVAGLPV
YGTLISLTRGGTPILGLIDNPMTGDRWLGVSGQPTTLNNVPIRTASTTAL
ATAFIANGNPDAFSPADKSRVESLRRITRWCVYGGSCIAYGRVADGSVDI
SIDGGLDPYDYCALVPVITGAGGCITDWQGRPLTLNSGGLCVATATDLLH
RHVLEILA
>SMa0661 hypothetical protein
MRATWAIPVAALVLLLDSSDGTAGQPGKSADQPSKYVPSVGEIMNKIDLS
RSKLWYSVKLRNWALAGYQLDQVKTGFDDVSRHHPDKSNFDFANVQKIAN
LIEEAIVARSDEQFQQSFALMTTECNDCHRAVGKPFLHVGAPKVPSPYSD
KMLEPGAFR
>SMa1078 Conserved hypothetical protein
MRFINIFTLVLVIVGGLNWGLVGLFSFDLVAAIFGVGSGLARIVYILVGL
SAAWQIIPLFSVMGSGEFAAGQNR
>SMa1301 putative transmembrane transport protein
MAAACGLIAANLYYTQPLAGPIAVDIGLPAEATGLIVTLTQIGYGLGLLL
LVPLGDLVENRRLIVTMIGVVTLALIAAGLSTTPGPFLTASLAIGVGSVA
VQMIVPFAANLAPDAARGRVVGNIMSGLMVGIMMARPISGLIAGLSSWHA
VFYISAIVMVGLGTLLWVQLPIRMPTARLSYGQLLKSMAQLLAAQPVLQR
RAAYQAFQFAAFSLFWTVTPLYLAGPRFGLGHNGIALFALAGVAGAIASP
IAGRLADKGLVRPATAFGLLSVGVAFLVTQIASEGSAIALTLLTLAAILL
DFGVTMTLVTGQRSIYELGAELRSRLNGLFMAIFFTGGAIGSALGAWAFA
SGGWWFASMIGFALPATAVAIFLTEKHGQERSLQH
>SMa0050 hypothetical protein
MQVGFYDDFAERLAAKVRKMKVDNGFEPGADTGPLISQKALARVQEHISD
AVAKGRECGLFFEPTVLMGGRWR
>SMa1660 putative acetyltransferase
MKSPTVKVMAAAEEDLAVETVMLAFAADPMARWTWPHAHQYLAAMPRMIR
AFGSRAFSNGSAFCTDDYAGTALWLSPGVHSDEEGLGAVLESTVARSLAP
ETAAIFEQMAAYHPTEPHWYLPLIGVDPAHQGKGHGDALMAYALERCDRD
HAPAYLESSNPRNIPFYRRYGFEPLGAIQFGSSPTLVPMLRRPR
>SMa0270 probable ABC transporter, ATP-binding protein
MNAALPSNAVAPPPNRAVAALAAEGLGKAYGPITVLSDVTLEVHAGEVHA
IIGENGAGKSTLMKLLSGHVVPTAGHLLLEGKSVEFRNAVEAENAGIVLV
HQEILLASDLTVAENLYLGREVGRGLLVNDKAMNSRAAELLARVGSAARP
RDRVGELPLAQRQLVQIARALLDERKVIIFDEPTAVLANDEVAALLDIVR
SLRDHGVAVLYISHRLDEVQALADRITVLRDGRMIGTWPAAGLGQREMAE
LMVGRELDMLYPHKRSATTAAPILSVTNLAIDHGSQTVSFSVSPGEVLGI
GGMVGAGRTELIEGLMGLRPSEAESIVLNGREIGSRSVRTLMDAGLVYLT
EDRKGKGLLLEEKLGPNLTLQALDTINPGVFLDKQGELSRLRKAVADYDI
RVRSMRLEASQLSGGNQQKLLLAKVMMADPSVIIIDEPTRGIDIGNKSQI
YDFIDGMVRAGKACIVISSEMPELVGLADRVLVMRAGRIVAELRGDEINE
ENVVYAATTGGCADVGEEKHDRSDSNTRTN
>SMa1643 hypothetical protein
MADRAMSELAVDAGLFLIALAAATILLMQSEAGGPARCRLHALAAPRHRE
PRQRPRSVVNWLIDRDRERFRQKRWFPVSDAGLRAARSTGIHRYGKWSLL
LSWMAIIGDPLTVAAAS
>SMa0953 AttA1-like ABC transporter, ATP binding protein
MIGQSLSLAGLQKRYGEALAVRELSLEIAAGEFVSLLGPSGSGKTTALTM
IAGFESPNAGKIAIGGRDVTFLAPNHRNIGMVFQKYALFPHLTIRQNVAF
PLRMRGRMQKTAIAQRVEEMLDLVQLSSYAERYPNQLSGGQQQRVALARA
LAFEPPVLLMDEPLGALDNKLREAMQFEIKRLQERLGATVVYVTHDQDEA
MTMSDRVAIMSNGGLIQVGTPTELYRQPKTEFVADFIGRMNFLDGDCLDC
TAEQTVVRFSERTVLRLRAAANERARKYDVGTALRVAIRPERMRLAKRGD
GGADALPGVVDAAVFIGSTYIFLVRLADRPEVSLQVQVAADGLLPFQRDD
EVDIVLDGAAMHVFPVAERCAA
>SMa1612 conserved hypothetical protein
MVVRMMSNASQNLPDDPAFLKAMIAALQAENAKMSATLQAHDQLIGELRL
RIAKLKKQVFGKSSEKIEREIQQLELALEDLLIAAAENSTKPLDEVDAAV
PAAPVASRPEKTMRRRPRVSEKAARERKELDPGTCCPDCGGELRLVGEDV
SEILDMIAAQMKVIEVARLKKSCRCCEKMVQLPAPSRPIPGSMAGAGLLA
YILVSKFDDHLPLYRLNEIFARMGADIPDSTLVDWCGRAMQVLQPLIERI
EAVVMSSDLLHADDTPIRVLDRSLRDKGLGKGVKKGRIWTYVRDQRPWAG
AAPPGAVYYFAPDWKEEHVHHHLRQTSGILQADGYKGYGKLYEPGADGIG
RFREAACWAHWRRDFHDIVDLEQIRDRARGSRPYRRALRHRARHCRPACR
YPPCCSSEAQHSKGRSLARLG
>SMa1850 Putative oxidoreductase
MKFLSYWHDTAPAFAGAAQGSVEGHFDVAVIGGGFTGLAAARQLAKAGSK
VVVLEAEKVGWGASGRNGGHLNNGLAHSYLAAKAELGKERAIALYKALDD
SIDTIEALIAEEGIDCSLRRAGKLKLASKPQHFETIARNFEAVHAEVDPD
TALLSADDLKQELGAPFFGAMLSKKSAMMHMGRYVVGLAEAATRHGATIF
EQAAVTEHRQEGGGRHALKTTRGNVTADAVLVATGAYTPSAFGYFRRRII
AVGSFIIATRPLTAEEIEATMPGNRTCVTSMNIGNYFRLSPDSRLIFGGR
ARFSATSDQRSDAKSGAILKASLAEIFPQLANVDIDYCWGGLADMTKDRF
PRAGYHDGVWYAMGYSGHGAQLSTHLGMIIADAILGRPDRNPLKGFEWPA
VPGHFGKPWFLPLVGMYYKMLDRVR
>SMa0525 putative ABC-type iron transport system protein
MIRVPPSPSGRGASALSDILMSTQSRNRSFKDAFLSPGLGLKAAVLVLLT
ALVAAPLLKVFGATLAPGAWSAWSDVLASNLSRNLFWLPLANTMILGAGV
ATGCVLVGGFLAWLVVMTDVPFRRTIGLLATLPFMIPSFATALAWGSLFR
NARVGGQIGFLEGLGFSVPDWLAWGMVPTLVVLMAHYYSLAFTVIAAALA
TVNSDLVEAAQMTGAGRRRIFLGIVLPVALPALVAGASLTFAGAVSNFAA
PALLGLPVRMQTLATRLYGMIEIGQAERGYVLAILLILVSAFFLWAGNRV
ISGRRSYATITGKGGRSKRFALGTARLPLFVAAASICVLTTVVPVVILIA
SSLAPSSSALFSDWSLHYWIGASDPAIARGQAGIWNNPLILSATGVTVGL
GVTVAFSASLVGLLVAFVLARSRSGFLSAAINQISFLPLLVPGIAFGAAY
VALLGAPIGPLPALYGTFLLLVIAATAYLVPFAVQTGRAVIQQVSGDLDE
SARMTGAGFLRRLFAITVPLAIRGLSAGALIVFVKIVRDLSLVVLLFTPT
MPLLSVLAYRYASDGFTQFANALTVVVLVISVAATLFANRLQAKSQPWLQ
S
>SMa2034 Hypothetical protein
MVEVTLWGSLAAAAEGNSKVEIEAKNIRELFARLSERFPRLEPLMARGIA
VAIDGTIYRDTWSKELPTGAEIYLLPRLAGG
>SMa1146 Conserved hypothetical protein
MSIRNLHHALDPTSLAIIGASDRDGSLGRVVIENVIRAGFEGEIWPINPK
HDQVAGHRCYRRVADVPGVPDLAVIVTPPQTVPALIHDLGIRGTRAAVII
TAGISADQDLRQAMLDAAKPFLLRIIGPNTVGLIVPSAKLNASFAHLQAQ
PGGIALLSQSGAIATSLIDWAADNDVGFSKVVSLGDMADADAGDFLDLLA
GDPETHAIVMYLEAISNPRKFLSAARAAARVKPIVAIKAGRHAEAAKAAA
THTGALSGADRVVDAALRRAGILRIEGLGELFDATETIARFPPLEHSRVA
IVTNGGGAGVLAVDRLIDFGCALADLSPETVGTLDRNLPANWSRANPVDI
IGDAPPHRYKTAVETIVRDVGVDILIVMNCPTGLASPVDAAHAIASLAQS
GTISGKPVLTCWLGGRTAREGRTLLQQAGLANYDTPSDVALAASYLAKWS
KAQQALVRVPEGRDDEVHCNRDLGRSVLQRVAAEGRRMLNEPEAKAVLAA
YGIPVPQTIIATSPKEAEAIAGLLLAGAPKLVVKLISKSITHKSDVGGVV
LDILSPVAAREAAEAIVARLKAHDPIAVVDGFAVQPMIERKHAWELLLGV
TRDPIFGPVVLFGSGGVSVEVVADTAVALPPLDAVLAGDLIDETRVGKLL
AGFRNEPAADRAAICKALTALSQLIVDFPCVLSMDINPLVASAEGVIALD
GRIEINPRAVTRPGPNRDLAIRPYPSEWQKQVTLAERRYHLRPIRPADAA
LYPDFLAKTSPADIRFRFLSSRKRFQDQMLVRLTQIDYEREMAFVALDSE
TGELVGISRLYADPDHEVAEYGLLIRTDLQGHGLGWALLAYLREYASADG
LKRIEGLILGDNAKMLKLCREFGFSISTHPGDATLRIATLALQSQASVS
>SMa1817 Hypothetical protein
MTTISKETRGAAYARPLLTSPPIRFLALLALCSAYIQGPLMKIYDFEGAI
AEMNHFGLTPAPLFAVGVIVFELAMSALILLGIFRKVAALCLAVFTVAAT
FLAFRFWELPSGMERMMATNGFFEHLGLAGGFVLVAWHDLHERTVSGKVR
AL
>SMa0058 conserved hypothetical protein
MKPIPDFRSKDFLLAHMREIMDFYHPICLNEEDGGYYNEYRDDGFITDRK
TQHLVSTTRFIFNYATAAVLFERPDFAEAAAHGVRYLDEVHRDPEHGGYY
WLMRGRDAVDATKHCYGHAFVLLAYATAMKAGIPGTGARVSQTWDLLENR
FWEPDRELYKDEVSRDWGATSPYRGQNANMHMTEAMLAAYEATGEIRYLD
RAETLARRICVELAANTQDVVWEHYRQDWSVDWDYNKDDPKHLFRPYGYQ
PGHMTEWTKLLLILERYRPQDWLLPKALLLYETALAKSADLEFGGMHYSY
GPEGKLYDLDKYHWVHCETIAAAAALAGRTGRERYWQDYDRLWRYSWRHL
IDHEYGCWFRILSPDGVKQSDIKSPSGKTDYHPFGACYEILRVLGEAQ
>SMa0596 hypothetical protein
MCVTKITCRLFLAWQMQFASTVNMLSLSRLSCCRIGLIFQINDFDIEFAT
PDLEGADIALAVNGFKLFKLREG
>SMa1793 Hypothetical protein
MEAVNRRSTLALGLTMAATPLIAWVTPAAAQTYGPDEGEEIGPGVRVVAL
GERASVIPAYKMVKLRDVVIQAGAKTPDNVMTNDMLCHMTEGELSVVQNE
KKFTVKKGDVWTCAKADTTEGTQNTSNSVAIMRIIDLMTS
>SMa1427 putative sugar ABC transporter, periplasmic solute-binding protein
MRMSTFIKNFIERETISRRHFLLASAAGLGAAVLPNPLGGSARAALTDPA
IAWSYRDRASAYWNSVVSGGEAFVESLGKPKSALVSLINEGSSEKSLADI
KAFLAKNNGNCALACDANDSPNARPIVEAVAAAGGYISTIWNKTDDLHPW
DFGDNYVSHMTWSDEKPAEETARILFEAMGGEGGVVHLGGIAANNPAVER
LNGLKNALKDFPNIELLDAQPADWDTQKGAALMSSFLTRYGDRIKGVHCA
NDNIAYGVIEALRAEGIEGMPIVSYDGNPEAVQMVMDGQLLATVFTNPHW
GGGITSALAYHAATGSFKPSEEPKEHREFYGPTVLVSKKDAEEFKAKYLD
SVPKYDWADFWGPSNGQIQYRS
>SMa1029 TRm1b transposase
MKSVCETLGVARSNIAARAAGSPSRARGRPPLPDRELVEDIKAVIADMPT
YGYRRVHAILRRNARKLGRSWPNAKRVYRVMKLHNLLLVRHTGAVDNRLH
EGQVAVERSNIRWCSDGFEIGCDNKEKVRVAFALDCCDREAIAHVATTEG
IKSQDVQDLVITAVENRFGRINMLSEPIEWLTDNGSCFIAKDTASLLRDI
GMEPCTTPVRSPQSNGMAEAFVKTFKRDYVAVNPTPDAETVMAQLPFWFE
HYNNLHPHSALGYQSPREFISSQSQT
>SMa1058 hypothetical protein
MTGSGFHINIKYNIMYTNKHSRSGRELCACPVVPGGRGETTMATTANRVS
AQTSDEINRLLRWQMEERLAYYETHADEIDTRLAELDREWDIERTLEANA
STLAFTGTMLAATGDRRWLALPAIVTGFLFQHAIQGWCPPLPILRRLGFR
TAEEINQERYALKALRGDFEAQSDNKLDAVLRAVGIRRGGA
>SMa1654 putative GntR-family transcriptional regulator
MFKDDRFTEKQTSWWPIYVALRDAIVSHRLAPGTKLPEDELASIYDVSRT
VIRSALQALTHDRLAQLQPNRGTFISSPTKQEAREVFEARLLIEPKIAAI
AAGVAKKSDIAKLRKHMQAEHEAVASGGTSDAIAASAQFHIEIAEIANHT
VLTNFVRELVSSSSLVVALYWKKRETTCESHAHAALVEAIAEGNAVQSAE
LMKSHLEDVLSGLDGGLAATKQEGLADILRSS
>SMa1166 Putative hydrolase protein
MAAMGRIGWTLSAISIAIAVAGGMVFLSYSNDIDRARSAVANGARVANTA
AGPIEYAERGEGTPLLSIHGAGGGWDQGLTNVADLVGRGFRVIAPSRFGY
LGTPIPADASPSAQADAHVALLSKLEINKTVVVGVSAGARSAIELALRHP
DKVSALVLIVPGTYAPESPVMLEGSRGSAFAFWLVNAGADFAWCATEKIA
PSVLIRFLGVPPELVEAAPAQDRNRVMAIIRGVEPLSRRFPGINMDSAPD
LHRLPLEKIAAPTLVVSAQDDLFNTLPAAIFAARSSPGAKLVVYDTGGHL
LVGQGGKVKKVVSDFLAQTGTMQPFGSGAGTSVRPKAPAPTVSLTRS
>SMa0653 hypothetical protein
MARRNSQNRRGALRLRAGTGAHNSFHLRAHSPAGSASLPIASFGKNHAVI
RISKRHHGLVPAHPPANESQPVTVLGPQAAGNKDAVFVISIGKTLQQALP
QETAASTLIVAVSDQIHELIPAVPPRGDQVHEPFAPVAAVADQFHQVIRA
VPPRGKEVPKPLEPVAAVAHQIAAVSEKI
>SMa1065 hypothetical protein
MKAHVLATALLISVAADSRAIAADLQQPFIVPEANPASTEGWAFAATPYF
WGAGILGDVEQFGLPAVHLESDFGDVLKDLDFGFMAVGEARYDRFSIFGD
IIYTKVSSGAATPLGVVAERVEATSETFAGLAAAGYAVFQDGRSNIDIVA
GARVWLASTEISFSGGVLGRISGRDSATWVDAMAAFRGRYFLTDHFYLNA
WGLVGAGQADLDWDLAAGVGYEFNDRISTVAGYRALGIDYNNDRFVFDVV
QQGPILGLLVRF
>SMa1658 hypothetical protein
MHGLSRGCWRSVLVRSHNVSSGTNKGLRGTGTGNVIFVEAKTRTLVSLRR
SLQRGNRVFANFWIRLVQIEATADHCAAATGVTSSGRSDSLAQPRQSRPC
RPGQLDRGSGFAARGVTLVGP
>SMa0486 putative cyclodeaminase
MTKVFGIEEIRAATASMDLTSVMEAGFVAYSDGKVVVPPVGELLFEEPPG
DTHIKYGYIRDDRVFVIKIASGFYGNAALGLPSSSGLMLVFSQKTGFLES
VLLDEGYLTNVRTALAGRIAARYLAPKEVKAIGVFGTGTMARMQVTYLST
ETDCQNIVAWGRSDDSLRRYCDDMAALGYHVTTTRDAREVAEACNLIIMT
TPSTKPLLMASQLQPGTHITAMGSDTPEKQELDSLILARADIVVADSLEQ
CLSRGEIFHAVRAGHIAAAQVRELGGQIRAGTRVRTSQDQITVADLTGVA
VQDIQIAKAVCERLSS
>SMa0203 putative ABC transporter
MQHGGFLQKRQAPQPAAIALPVSARGSAAVLRKPAACTAKPANILPPCAC
TGPPISRMLPLLRGGSSVQEQSHSNRLAGTGEAIACQAIMRERDMTIRKM
LLASAAITCAAMPASAFADTSAKKIALSNNYAGNSWRQAMLTSWEKVTGE
AVKAGIVASADAFTTAENQATEQAAQIQNMILQGYDAIVLNAASPTALNG
AVKEACDAGITVVSFDGIVTEPCAWRIAVDFKEMGRSQVEYLSNKLPDGG
NLLEIRGLAGVFVDDEISAGIHEGVKQFPQFKIAGSVHGDWAQDVAQKAV
AGILPSLPDIVGVVTQGGDGYGAAQAIAATDRKMPIIVMGNREDELKWWK
EQKDANGYETMSVSIAPGVSTLAFWVAQQILDGKEVKKDLVVPFLRIDQD
NLEANLANTQAGGVANVEYTQEDAIKVIESAK
>SMa1878 Putative transposase
MAENALTAVIQEAYIQGISTRSVDDLVKGHGYEWHLQEPGQPAVRGDRRQ
G
>SMa0886 hypothetical protein with local similarity
MPRSDLRQTTFRSLQGHLVADPRDIRRVHADDRAAIARRGLSRCFRNPGG
HGDSHRHRPGNPGKDQSDDRAQRLRRHLYNKFLAKMASGQNKPDGQFVIT
PKNGPAFVEQLPIKKFQGVGPATAEKMHRLGIETRADLKEQTLEFLVEHF
GESGPCFYGIARGIDNCQVKADRVRKSVGAEDTSSEDIHSFEAAREGLQP
LIEKVWSYCEANEISAKTVPLKVKYADFPDHPEQDGCGTFANDR
>SMa2345 hypothetical protein
MRDYAAAAQRHFGSGPAQRSLGTRLVVVDNRVIGGEIEKSFGLDHRSVLH
QEYPDELISVRRSRSRGCRSVEPDDDRVSIGPDMMHPWHQRGCETADQGS
RGPLDEILDATVSAGHGSGTVDCPDDIWREKLCKDVAPCSPFLECRAHRG
SILRLDIGWRDGIGMRRSNAAHSRRSDQQHADAAGEAIWQ
>SMa1339 probable ABC transporter, permease protein
MAQVRPAPGSGRRSTLATVVMTSLDSKSRAASRGLSDIKIRNLFIIPTIL
FLIVFNIFPLIYSLGYSFTDFRASTNAPATFVGLQNYRELLNDPFIWANF
AITAKYVIVSVTGQVVVGFGTAMLLNREIPFKGLITTLLLLPMMLSMAVV
GLFWKLLYDPSFGIINYALGLGSFEWLANPEMALYAVAITDIWMWSPFVM
LLSLAGLSAVPRHLYEAAAIDRAGPFYTFFRITLPLVAPILMIAIIFRTM
EAFKTFDLAYILTSQPTTEVISIRLYKMAFQEWQTGRSCALAYIVLIMVL
AITNIYVKYLNRVKER
>SMA2213 putative aldehyde
MQNLPKLSMYIDGQWVAPASGEYIETVDPFTARPWALVPRGNAEDADRAV
RAAHRAFSQGPWGKMHPTERGRIIQRFAALIEEHADALADIEVRDNGRLL
AEMTHQIRYIPRWYHYYAGFADKIEGTLHPCDKPALSFSRHEPLGVCVGI
VPWNAPLLLFSLKAAPALAAGNTLVMKPAEFTSATALKLMELVEKAGFPT
GVINVVTGYGPEVGEPLVTHPLTRHVGFTGSTKTGAHLYSLAAKDVKRVS
LELGGKSPNIVFGDADLDNAVRGVVGGIFGAVGQTCIAGSRLLVHRSIHD
EFLEKLAVFTKTARIGDPRKVETQIGPIANSMQFEKVLGYIDIARREGAE
LILGGGRPDLEECGTGYFIEPTIFAGVSNDMRIAREEVFGPVLSAIVFDE
PEEALAIANDSEFGLGAGVWTSDMRLALKMSERLEAGSVWVNTYRDISYT
TPFGGYKKSGIGRENGVAGIYEYLQTKAVWLSTAEEIANPFVIG
>SMa0081 putative ABC transporter permease
MIRAFGWNEFLVIVAAAQWTIALSAIAFAGGSVGGLLVALMRVSETRALR
LFATGFIRVFQGTPLLMQLFLVFFGMNIFGFAINPWIAATIALALHASAF
LGEIWRGCIEAVPKGQREAATALGLRYFRSMRHVILPQAARIAVAPTVGF
LVQLIKGTSLASIIGFTELTRQGQIINNATFSPFLVFGTVAAVYFLLCWP
LSLIARRMETRFSRATAR
>SMa2297 hypothetical protein
MWWHHACLQGVENMKSTLLKIVTTGLLVLAPAIAQAAEGFATANVNMRAG
PSTAYPAVTVIPAGESIEIYGCLADVPWCDVEFYDGRGWVHGRYIQALYQ
QRRISVGPRYYRPLGIPVVVFSFGSYWDRHYRDRDFYRDRDRWRRGPDFY
RSPDRRAEPYRPPGRRPGFEPRPAPRPEFDRGLERSPDFSRTPERRRDFD
DRPDRLPGLDRAPSRRPNVDTQPDSDRELRNRGDRNRQNFERRGDNGDRV
IRRGDDNRNRGGDSDRRRPQRPVCQPGEPGCPN
>SMa2347 conserved hypothetical protein
MTKWASCNIVCYMSESDRRMRFPSFEGPAFTAAHASSYVEGTSRKVPGLA
ALHRMTSMLVAERAPVQARVLVLGAGGGMELKALADENSDWSFCGIDPSA
DMLRVAEQTVGPHLLRVHLQQGYIGAAPEGPFDAAVCLLTLHFVGRAQRL
DTLEQIRRRLVPGAPFVVAHISFPQSEPERSTWIARHVAFGGTASGEAES
ARQAIATKLSVLSPEEDEAVLRKAGFSDVRLFYAAMTFRGWVGYA
>SMa0887 hypothetical protein
MNAELSRLVAGGRHHAAGGRIADGDRQAPQVGIAALFDRCVEGIHVHMHD
FARPLAAFVVCHGLASIRTICIAQTWEVRAAFVTLRRLYRLAAHRHAPAD
HSEGHASAKRFHQQLALKRAVAMTWAQMNSTIGFRRFFSFGARSFIGCRG
RMRKIPIPAIDLNPLAEFSDGQCRSTSARHMRIDSCSL
>SMa1397 Conserved hypothetical protein
MKIALPDAEGRLAEYSLSGNAIAGAALGPGRARVVYSAAHVVADPFTANE
PSGRATVDWPKTLEFRRYLAGLGLGIAEAMDTAQRGMGLDWAGALELIRR
TKAELPDALVANGCGTDHLDLSRSHSIDDVRRAYLDQVEAVQKLGGRIIL
MASRALVRAARGPDDYISVYSDVLDACDHPVILHWLGDMFDPQLAGYWGS
QTFQPAMQTALAVIGANVRKVDGIKISLLDKDKEIVMRRLLPAGVKMYTG
DDFNYPELIEGDEQGFSHALLGIFDPLAPAAAFAVQRLGEGDVSAFRATL
DPTVPLARLIFRAPTQHYKTGVVFLAWLNGFQDHFVMLNGAQAMRPLPYF
TEVFRLADQCGLLRDPEVAVTRMKRLLAVYGV
>SMa0799 putative ABC transporter, periplasmic solute-binding protein
MPATAADLVFTSWGGTTQDAQKIAWAEKFTEKTGINVLQDGPTDYGKLKA
MVEANAVTWDVVDVEGDYAAQAGKKGLLEKLDFSVIDKSKLDPRFVTDYS
VGSFYYSFVIGCNKDAVDACPKTWADLFDAQKFPGMRAFYKWSAPGVIEA
ALLADGVSPDKLYPLDLDRAFKKLDTIKSDIIWWSGGAQSQQLLASAEAP
FGSVWNGRMTALAQSGINVETSWEQNITAADALVVPKGSRNKEAAMQFIA
LATSPEAQADLAKITGYAPINLDSPKMMDPELANTLPDAQTASQVNADMN
YWAENRDAIGERWYAWQAK
>SMa0326 putative
MSTQNPKVWLVTGCSTGFGRYIAEHLLEVGEKVVVTARKADKIADLEQKG
DALILPLDVIDRDQCQKVVDAAEAHFGRIDVLINNAGIGFFGAIEETDES
NARRLFDVNFFGTANTIHSVLPHMRARRSGTIVNLTSIGGLVGYTGVGYY
CATKFAVEGLSDTLRNEVAPLGINVMTVEPSAFRTEWAGSSNEVSASIED
YEATAGEARRAYHTSVGKQAGDPARAAKAIREAVLAQQPPHHLPLGNDAA
DAALKKAEDLKANVLAWEALSRSADFPAN
>SMa2012 Hypothetical protein
MYRRERQGLPLTAPPIGTARRPVSDRFHWAATLLGNACRREPFDDAAPIA
PDCRIFYLPLQDTHLVRSSSRRRFLGEARMRKKGSRNGVTALAVAGTNVV
VLGWDMSEKDIRTRGILGFAIQRTRHEDGEKIWLSGLKTFESVDPHPDPG
VPVSSFWHPLQTFQWSDYTPSPGKKYTYRIVAMGGQPGALVEAADVSLEV
TTERIDQGKHAIFFNRGAIASQEYARRFQNLAPNQVGQAAYDWLSRGLVE
GLEAFLSQAGQGDELYGAIFEFENKRIHVAIRAAHDRGAKIKILYDGDSQ
REGNEDALKGSGIAGLTKARTRSGQFAHNKFFVLRRAGKFSEVWTGSTNL
SDNGIFGHSNNAHIIRDQKIAEAYVAYWEVLNKDKTVRPTATASTAISPT
PPQQINTSGDTVAVFSPRTDLAALDWYAQLAGNAERALFTTFAFGMNSRF
VTVYDQTDDVLRFALMEKKGNGRNYKVQAAEVDRIRKRPNTTVAVGNYIT
TNAFDRWLKEIDRVQDDVHVRFVHTKYMLIDPLGSKPIVIVGSANFSKAS
TDTNDENMLVIEDNDAVSDIYLGEFMRLFSHYAFRESLTFKKSNKPADIL
RRKHLKEDHSWIDGDGGNSGYFVQGFDRALRRLYFSGQ
>SMa1742 putative iron uptake protein
MTAPSSTLAVVIAHRRKRARRHHAIIATLLTLVAVTFGVTLSIGQSITPP
SDVLRVLLGEPVPGASFTVGQLRLPRAVLSILAGLCFGLGGVAFQVMLRN
PLASPDIIGITSGAGAAAVFAIVVLSMTGPMVSVIAVVAGLGVALLVYAL
SFRNGVAGTRLILVGIGVSAMLQSVIAYILQSAPAWNLQEAMRWLTGSVN
GAQLGQALPLLLALIFFGGLLLVRGRDLETLRLGDDTAAALGTRVSNTRM
LVIVAAVGLIASATAASGPIAFVAFLSGPIAGRIVRNDGSVLIPSALTGA
VLVLAADYVGQHLLPSRYPVGVVTGALGAPYLLYLIVRINRIGGS
>SMa1349 Putative GntR-family transcriptional regulator
MKENNLLSDLAAHLFSTSSGNGRTPSERELAEHFAVSRGQIREALAILEA
MRIVERRPKSGIYLTTTEASVEAIALFARAGVPLDPIVIYETVELRKIHE
IKAAELACARATEENYERLREILAASETKIAAGEGLAREDRDFHLEIVRA
TKNSVFHRICSVYYTMGEQRLPIYFADIARSRRSHEEHIRIYEALLARDG
NLAQALMNAHLQGAESYWKGLIGGPATAAE
>SMa1178 hypothetical protein
MRQINLIKFNSVQCVSGPEDRAQIAHLARATGRAHMPQKRTEPWVMASMT
ELLSFARAMEQEAVDGYVALAARMRAEGRPDLAAVFERLIAEEEGHLGKV
DQWLGERAPQPVSLLEPFFDDEGAGVVAPELLTSYRAFSMAVRNEERAFV
FWTYAAAHAPSEEIRQAAERLAREELGHVATLRRERRRAFHEMRHAESGS
IRDDLPTLEGRLAELLIRTPAAPLGETAERLRGLANEAQERATALTATPP
GETPLLQHVPGNVTGRLVPLCEFLLDCYLDLAEHESTESARARVQTFASD
IIRCLYAVREL
>SMa1252 conserved hypothetical protein
MSGSKVAKREGYVPRGFARTGPVLFSYGFRPFFLGAAVWAVVAMTLWIAA
LVGHLEVAGSYGAHAWHAHEMLFGFAPAVLAGFLLTAVPNWTGRLPVSGW
PLAGLFTLWLAGRAALLSPDVIGIPPAAAIDGLFLPALLLICAREVIAGR
KWKDLKVLGGLLALSLANACFHFAVVTGDHVHIAMRLGISAYVALVTIIG
GRILPSFTRNWLNRAGRTEFPVPYNHFDTVAILAGIAALGAWTLAPDHPV
TAVPAFAAALLHTVRLARWRGWRTWPEMLLVILHVAYAFVPLGFAATGIG
ALGFVEELSVMHVLTVGAIAAMMLAVMTRASRGHTGYPLTASRLTAASYA
AVVLSALLRPLAEMLPEIAPTLYAVSGSAWILAFALFCIEYGPILVRKRR
AVQ
>SMa1368 Hypothetical protein
MELLISPQGIAAAVALILAGAVQGSTGFGFNMLAAPMLAIIDPAFVPGPM
LAMAIAVSAGGTVREWSDVNRQDLAFSLTGRLLAAGAAAFCLQLLSPDAF
AAVFGFGVLFAVALSLAGLRIDTTRSSLFLAGVLSGFMGTLTSIGAPPMA
MVYQNTGGARMRATLNAFFVVGGIISIGALFVAGSFGLSDLLLAATMLPF
AFLGFLLSGWGRRLVDRGHVKVIVLIVSAASALVLLLRAFS
>SMa0554 hypothetical protein
MVWREQLDLLLGALSALHIGGLQAEGAEIWRDPEGQFVWELLCHPAVIAY
YERHYPFAPPLLLRAAGDRRLPDTYRSQWQAELEQEGFDAAYRQFLHLNA
RFISNDVIGYFIELLDGFYVFDTHIDEFRRELEQPARLGGWLTRPDRWQL
LEGMASFYEFALDLDQYLAALEFPMLRGHVWLHFAYWFGNGGARMEEVAL
WLQNAVAHAAEDESIDGAELGEALARLRAPQRYPLVLIEQTAEVLGPWLE
SSGVGEQLSAGSRSL
>SMa1585 hypothetical protein
MDNLDLQAFRDSLAEPQPPVGLSPALEALWWDGKGDWNKAHERAQEHENR
AGMHVHAYLHRKEGDQSNAEYWYRRCDIVPSTLTVDEEWEELARALLKQG
>SMa1004 hypothetical protein
MRGCGPAIRRDSAGYNRWKVNPRFWNIAGEPPGFPDPGVRIVQLHYMAWV
ATSSMLPAAIQTAIAIIIRFSVSEIKPRGTLLDPPRQAHFSSTLSCFNER
LFPIQNVRQGFFCPVTNVTA
>SMa1045 hypothetical protein
MTSSLHPQWDFAREFRSIILRFVAFAILVANFLLGGNEGAEGTHSIVIVS
YLAISIAAVATARYVPGRSWLKAFFVVLDALLVTLILYAHILAGR
>SMa1832 Hypothetical protein
MVTKILTGSGSAERGANSSDGSRSPRASRLACSLSVAKGSHNEAIGAHQL
YARSTLAARAVAENVHVRAINEHPASAPPGASSTIPRFCSWSSAAFTVGT
VGPLRLDADIDEPPFYSRNCSPCRDPDASGRPEPSPDCFACLARSSPAWP
>SMa2051 possible desaturase
MDDLKYGTRNKRGDWAPNQPVKTAPLFAFPPRLKAVLKWLPHYFFPWNMI
FAASAVAYWAWVIPPVERLQTLGIEWIAWLYVVNAISVFLFYGAFELHLY
VFKRQENRFKYNGRFPADQKSKAFWFESQNLDNILRTFLSGVTIWTAVEV
AMLWAYANGYAPWLDFAENPWTLALIALVVPIIHEFHFFCIHRLIHTPLL
YKWVHSVHHNSVNPSPWSSLSMHPVEHLLYFGTAFYHLILPSNPVLMLYQ
LHYAGFGAIPGHVGFDKVEIGEDKLVDSHAYAHYLHHKYFEVNYGDTLIP
LDRWFGTWHDGSAEGEARMQERYRRRKEKLAVRKARVGIGEAAE
>SMa2311 probable ABC transporter, ATP-binding protein
MTAASSPIFATPRNRNERNTMKSLELHRIEKSYGAYHALRGIDLSVEEGE
FIVMVGPSGCGKSTLLKTIAGLETISSGQILISGRNVTKEEPGDRGIAMV
FQSYALYPHMTVAENMGFGLRMAKRPKEEIDAAVARAAKILRITDQLDKR
PKQLSGGQRQRVAIGRAITRSPDVFLFDEPLSNLDAALRTQMRVELSGLH
AELGATMIYVTHDQVEAMTMASRIVVLNRGAIEQVGSPLDLYRNPANLFV
AGFLGAPRMNFFDVTVDRVSGATAAISAPGLAPMTVSLADGVALKPGDRA
TLGIRPENIRLSPDDTTRAAISGKVRLVEHLGRETILYVDAGALQCVSSE
SGTGNVTVQIGQVTPKAADTPVSLSFHPHDAYLFAGDGQRTVTVRKAIHS
NQTKVGTI
>SMa1328 Probable MtbA protein
MAVQVPTDFRRVIVAASVGNIIEWYDFYIFGSLAAVLSVKFFEQSHPVAA
LLSTIALFTAGFLIRPLGAFLFGWMGDRVGRKYTFLITLTGMGLGTGAIG
LIPTYESIGLTAAFLLFSLRMIQGLCLGGEYGGAITYVAEHVPDERRGYY
TGWLQTSPTLGIVVSLAVIIAARTYFGSEAFDAWAWRVPFLVSFLLVGIA
IYIRLQLQETPIFQEIKAKGQMTQNPWREAFLSSNIKYVGIATIVLIGQG
VVWYSGQFWALYFLQQVSKVDPLNSAYIVGAALLLATPSLILFGWLSDII
GRKPVILGGMLLAALTYYPLYLWLGAVTQPDNINYPIAIFIIFILVCYVG
MVYGPVGAFLAEYFPGRIRYTSVSVPYHIGNGWGGGLVPFITSAAFAATG
SIGYALIYPIAVPAVCFVLAIFLMPETRRISIWQPIEPRT
>SMa1254 Hypothetical Protein
MSSGCFPSKTNASAMQLFHHGSSSRATLFRSLKSSPTTFGRKKRLSLVSR
RCSRTASTISAGSSRSCDQTNIDVAAGVILSTGEAPVQPHSGNATAKLYA
AGLYPFEDILKPSLSAPKSWPIESPKGCSELNRNRRAPSVPSSTTRPCCR
RLFSAFRVLFLTNPLLSASCVAEMCLPCRQSSLSRRTFDSAPKIAFSRYR
IVHSARKAPAV
>SMa0564 conserved hypothetical protein, possible oxidoreductase
MRKRLICTTLSILVGGCCLSGTSMAQDASTPILDRGEGAWSSSVLADGLD
YPWDIVRDGERLILTEKAGTVVIIEGGNVQRSTLQTSDPLRTEGGAGLLG
IALAPDFADSGQAFFYYSYSSGSEPANRIVAARFDGNTWRETAVLVDAIP
GHRLYNGGRIAIGPDDHLYVTTGWTENYERPQDLQSLAGKVLRLTLAGGV
PEDNPFQGSLVYSFGHRNPQGLAWNAEGELFVSEHGQAALDEINLIAPGA
NYGWPIISGDETQEGMQPPFVHSGGDTWAPSGIAFAGNELLVTALQGRGL
YVLDRQARTLQPVVSLGERVRHVLPVGDDLLLITTNRSPRGQGPSKDRLV
RLSAQN
>SMa0136 hypothetical protein
MVAALISAAATMIMFFMWLPLQKTAIDPPSQKHAGRRLVPPRCYGYHAGG
RLGEKATAAVPCNGASAGRRPTLPYLLANKFLACWSQPGRPLGDVVEGAV
LKFSRGIDQDRAATAGGLQQIAFARGSAAAAQPSRLPTLVATAAAVAVLY
FARDVFLPLAIAILLTFALAPLVSRLRRVGCPRSVAVIGTVTTAFLFLSA
FGVVIAMQVSEVAQNLPTYQYNIVEKVRTLKETGSESQILERIGRVIERI
STEISRPEPEVRASPEPTPETKPLLVEIFSPQRPIETLKNIINPLLGPLA
TTGLVIVVVIFMLLEREELRDRFIRLVGYGDLHRTTEALQDAGARVGRYL
LMQLVVNITYGIPLAIGLSLLGIPNAVLWGMLAIVLRFVPYIGPVIAAAL
PLFLAFAAAPGWSLLVWTAALFIVLELLSNNVVEPWLYGSRTGLSPLAII
VAAIFWAWLWGPVGLVLSTPLTVCLVVLGRHVPQFEFLEILLGNEPVLDP
KERLYQRLLAGDPDEATDNAEDMLQEKYLVEFYDTVAIPALLLAERDRAR
GALTNTQAAQIAQSANTLIANLEEIAGEEEGEEETSTEAQESDDDNDDAE
EYDLPPGDGKSVLCVGGRSDLDDVTASMLAQTLWIQGADAAHATHEVLKA
GNIKALQLEGRNAVVLSVLDQDFMRHAKFTVRRLKRIAPAARVGIVLWKE
DGRPGTTERDQLIESLQADFVVFGMGDAVREALSDELPRSLKLAHPKIAP
GYAMRRSKRTDTESTVKAD
>SMa1007 Copper protein, putative
MQLQEWLKREWLTFPAMPMESALPNFFTINGKSYPATDTIRMKVGRTLKV
RFVGSHTAAIHPMHIHGGPFEVAAVDGVTLRQSARSLADTVNVGPGQRFD
VLWKAQRPGKWLIHCHISHHTTNNNVEIQGGGGLMLVIDVQS
>SMa1896 Putative methionine sulfoxide reductase
MTKRAVLAGGCFWGMQDLIRKLPGVIETRVGYTGGDVPNATYRNHGTHAE
GIEIIFDPERISYRRILELFFQIHDPTTKDRQGNDIGTSYRSAIYYVDDE
QKRIAQETIADVEASGLWPGKVVTEVEPVRDFWEAEPEHQNYLERYPNGY
TCHFPRPNWVLPRRSAAE
>SMa0523 hypothetical protein
MNAAVGKLGQDQPPVGFTSYSDRRDNEDEGWALQVVNDVVPSNGIVFPAL
LALTADTKNPAASRLAIDFLMGDDSETGGPGYAPFYVAGDWPTRSDIKGH
PDAIPLADFKAWRVDPAATATIRKSVGDLVLQLQ
>SMa1104 hypothetical protein fragment
MVFARTTCAQCHSIDRVGASPLSVAPPFRDLHKLYPVETLEEALAEGIRT
GHPSMPEFRLEPDQIGDLIAFLKSLE
>SMa0481 hypothetical protein
MKSVEETYGIDNLHLTVARGYVAKLLANTRITRWPSYHRQEYLGEFQKIA
EIEAIGPQPEAPEA
>SMa0734 hypothetical protein
MSGQETLVAINGEAPKTVGAGVVHQMMGSDPMTPDLKYVSSTCSGTQKHE
DAIEKQKEAAVMAGAGMFEQEKSTQESGEARRLRFARETANLMSVSQVSV
LHCSSVA
>SMa0364 hypothetical protein
MDHATKSEATRGSSALPYSDGRSTAAPFHRVRHGLFTAAFSIIPFSFIVL
FAASAGGQSAAVNGLAPQDAVEIHLPGWHTLFGDAAKAALPNGTFTIGSA
GALELPGIGRVPAAGLHASELAKLIADRLQARSGSHDSPVTIVEPRRPAL
EGQRVSPPAKQPAMVEREAMQALGGERSSVEALLRDLAAARKEAEAAREE
ERAAHQAARDASILHRRHLAAERQRAAKLTQELTAARVDLETMKTQLKQE
TNAAHDWKAAVAMIKAAREAAARERSERAALEEELRAARREIEAARNGAQ
MVASEREEPLRHDMVPATGALDTMGVAADGAGAQARKAADTMAERESALE
QQRQRAEGLARDLTVLRRDMDSLQAKVAGAIRSKAAALRARRAGEAALVD
AKRALVEERQKIGVYARDLALALQSAAALESRAKLAAAEQAAAAQARKIA
EAAAKRAGEALALELEAGKSLARELDTARRERDAAKEELTQVLAQHTSLK
GERAWANGRELSAARQQHDGKKARTERRVEDVDEPKTRAGNHASERAKTA
RATGTRSVRELGARKGRTLETRKPLKIALPNALLPKRWLAPGLW
>SMa1780 Hypothetical protein
MKMRTFPDRIRHALLFEAIGLMIITPVAAYLFNKPIMHMGVVGIGSATIA
TIWNFVFNLGFDHGMRRMFGDTNKTFKMRLVHTVLFEIGLLAILLPPIAW
YLNMGLFETLQLDLAIVAFYLVYNFVFNVAYDRVFPVPAGRRAEYAT
>SMa2019 Putative oxidoreductase
MVARGCGVVVHVTSIQGVMPLPESTTAYAAAKAALSTYGKSIAKEISSKG
IRVVRVSPGWIATEASVRLAERLAKQAGTDLEGGKKIIMDALGGIPLGRP
ANPEEVADLIAFLAPDRASSITGTEYTIDGGTVPTA
>SMa0896 conserved hypothetical protein
MRSCRPAKGYFSPHPLPAKLVRGMICGMLDPFDPDRFIVRAEVHILGIEP
KISRTLELPITLNLAQLHEVLQAAFGWTDSHLHQFNIGGLIYGAPEFDED
GLSDSRTFEATEVRIIDLQFPYDPEENPLTILYEYDFGDNWRHLLRLERV
ARQEGVKYPRCLAGKRSGPPEDVGGTSGYADFLDAWLDPDHEEHKAMRRW
VGRKFHREACNLDEINKAIGKALRASKGDYRFRRESHRD
>SMa1381 Putative
MTMREADVIVVGGGPAGVSAAIEAAKSGLSVMLCEQRPALGGAIHRQPAE
GATPVAVLPSLRGRWQALSAELSASGVDVRTRRAFVGVDSTGAVLIEDRA
AGKVEVRRPRALILSCGAVERVRPRRGWHLPGVAAAGGLQVMLKEGRVPG
GRILLAGSGPLLLALAAQMTAAGNPPVAVIEEGDPASRPLAGVRLLAHPS
ILPDMAALMMPVLFRRVVWRRGTRLTEITQSGDMLTACLIAPNGREERIE
VDRIGLHDGLRPNDFGLPANDAAAGLVILRAGDVREVLGAHAAEADGAEA
GREAAARLAGRPPRSGANGIRRLRSLQTSLSRLFAPVHGAPILDDCPGDT
VICRCENRTISHLKAQLSGPDTVSARELRLNGRFGMGACQGRFCSEWTLS
LMSELRPTASPSSIAEMGACRWPLRPVALSSLAKGGTNADTLTEPHMEEI
SA
>SMa1844 Putative aldehyde
MTLSFDPDTLPLPVGHFIDGRLVPAEGIIDMHRPSDGKPYAGCPLADEVL
VDRAVETAKKALKATNWSGVRPRERTVVLQRWADLIESEAETLAKLEALS
STRPVGHLVAGDIAVSAEQIRFFAEFADKEGSDLVPTDDSNLGMIMTEPY
GVVGAITPWNFPVSMAAWKLGPALAAGNAVVLKPSEMTPFSTVYLAELAI
RAGLPAGLINIVLGDGPVTGTAMTGHPDIAKVSFTGSTGAGTAIMTNVAC
TGIKPMTLELGGKSPQLVFADADLDKAAAAIAQSMLSNAGQACVAGSRLI
VEECIAAPLAEAIVTRMEAAKPGPTWDETSEYSPIISKRQLNRIHGIVEA
AVEAGGECLFGGAQMDAPGYFYQPTLMAVRDSSSPAITEEIFGPVLTMQT
FADEEEALALADHPTYGLAAGLFTRDLSRALGLTRRLQAGTVWVNRYSRS
RDHILPTGGYKRSGIGKDLGREAYHANRKSKSVLISL
>SMa2365 probable ABC transporter, ATP-binding protein
MSEAILNICSVSKRFGDNLANDDISLSLGKGEIVALLGENGAGKTTLMSI
LFGHYVPDSGKVLVEGRELPPGKPRAAIRAGIGMVHQHFSLAPNLTVLEN
VMAGTERLWHLRSGTSAARRKLHRICQRFGLTVEPDARVGDLSVGEQQRV
EILKALYNDAHILVLDEPTAVLTNLEAERLFSTLKDMAREGLSLIFISHK
LDEVMAAANRIVVLRGGRKVAERLAKETNKAELAELMVGRRVARPVREPS
TPGEVVLKVADVSVSIDGVERLKSIDFSLRAGEVLGIIGVSGNGQTTLAH
LLSGTLRRDKGDLLLFGEPIGDLTVDDAVRAGIGRIPEDRNKEGAIGEMA
IWENAVLERLPRFSRYGLVDRPSGQAFAGQIIDAFDVRGGRPTTRTRLLS
GGNMQKLILGRNLMDRPRILLAAQPARGLDEGAVAAVHERLLEARRAGTA
VLLISEDLEEVMALADRIQAIVNGRLSPPIAADSASATKLGLMMAGEWNE
EHEVPHAF
>SMa1169 hypothetical protein
MNTTLVGSTRTRAGRTNMPLKLLSGMMKASGDEALLIMERDVGGDEIVLV
TKEALLDIAEPPLCNECRLQQYIAVFSDIASTKFDGKELAPDGRVAVTAA
DVSVWKVNHPEAT
>SMa1514 Putative ABC transporter permease
MDLFIAIFTGTIIAATPLIFAALGELVVEKSGVLNLGLEGMMLMGAAFAF
WAVIAGLPMPVAIAAGALAGAATSLLFGVLALTFLTNQYAAGLALAIFGS
GVSAFLGRGFGSAPIDALKRVHIPFLSDIPVVGPMFFRFDPMVYLAIGMF
GLITWFLYRTKGGLILRTIGESPETSHAIGYPVIRIRYLAVLFGGLMAGL
AGAYLSVAYTPLWVENMTAGKGWISLALVVFATWRPLRVLIGAWLFGGMT
ILQLQGQALGIAVPSELLSALPYLATIIVLVIISQNRQLLTLHFPASLAK
PFRAAS
>SMa0314 hypothetical protein
MTSQTLSHPYVGMWETTDGRIRHQLLPNGRYDEARGSRESAYRGRYEVSG
NHIEYWDDTGFTADGDFVDDVLHHAGMVLYRKQ
>SMa0662 FixK-like regulatory protein
MNTSISASWGRYINAIPPRSGEAALAPLSSFVDGQCIYSFGERADQVYQV
EFGAVRVYRLAANGRRQILAFHFGGNWFGLQSRDRHSSNAEAIGVTGVRC
ISLQEEPLFRPALFSAALDNVSAAQEHQLVIGRQSAIERVAAFLLEMSER
SGYSRRFELSMSRVDVADYLALTVETVSRSLTKLKHRGFIELHGARGIEL
VGYRALQNLCL
>SMa0017 hypothetical protein
MAAADDGRNGGGLRQRKARRGLPQARRNHLSGTTRIGKRQAQILGSGGLG
PSNEPRGGCCMPLGREIPSEDGGENSRSKRSAWGRRQKTMSLTTAFIAEL
IRAANEVERLTPYEISRLLDRSVDTIRDMRRQTGIAASHRARDVVIDLQL
ASARARDLSAAETRDVLLDAADIIRTLKIVLDGKE
>SMa0947 hypothetical protein
MANLRLCRSSCPRSGMTTDEFDRIVCAWTSGAKHPNTGQLCMKMVYQPRL
ELLAYHQSQRFQDVHCVRWGIDSMRVFSEECIPPEQVIGSSERPTSKCAT
ERPC
>SMa1113 putative cytochrome C fragment
MGLIVRAIIFLVIVGAISSVPAVQAQDIQHGRQLALEVGAACHAVLAGQA
QSPIGEAPSFEWIAATPVMTAVALNVWFTAQDHPTIVLSQTEAQDVAAHI
TV
>SMa0372 putative LysR-type regulator
MNDYKALRTFLLAAEKRNFAQVARELDMTPAAVTRAIAALEAELGVQLFV
RTTRQVSLTTDGAIFAAQLQPAVKTIEDARREVMNAHKADEGRLRISAPT
WFGKAVLPPILSAFKERYPKMSFEISLSDGLVNIVDDDYDLAIRISSQPS
DKFTIWRKIRVVPRILVAAPGSRFVDMQHPNELTPDDCLAYSGDSRRENW
VLSDGGSSITISAGRAFSANNGEVLADIAADGAGVAMLPGFHIFEHLRTG
RLVHVFKGWAPPDLWLTLYYPPYQALPPRIASFSKFFEEQVPAHMVMLD
>SMa0710 putative ABC transporter, sugar permease protein
MSLRFLDLARPSRQHWLGYLLLLPAVALVALIIVYPLFVSLDLSFQKIGM
ATLSAPRKPFTLENYHKLFASPDFWNSCWVTIKLVVVVSAACFAVGLGTA
LLVNNRFKGRTLARLFVALPWAVPEVIAVVIFAWIFDSSFGLMNWLFIKL
GITSQMINWFSEPTAAFWVVAITMIWKGYPFVSIMTLAGLQSIPEDFYNA
AKVDGANAFQRFWYITIPVLMPVLGVTSVLVVLWVFRDFSIIKVLTDGGP
LKATQTLSIMTYDQAFGFFNMGYASAIGIVTLVLCVVASLLMLGRKSQAM
Y
>SMa0631 hypothetical protein
MNSVQPIRIGFLLLAVAPILFWMLLGVSWAQQAPVGVPPEKAQQFLDLLS
DPQVKTWLEGKIPSAAAEPPAGSPVETISSWEAAIRDRINGLMGAVPRIP
EELARGAAVVSRDVNSGRPGLVVSILAVLIAVGLGAEWLIRRVFARARKS
GANENAGQEILSEIACLADVCLGQRRFIPGIRMAALTA
>SMa1814 Putative Dioxygenase
MTDVTAPTSGFAPLRQKVFAVLWIATIVGNTGSFIRDVASSWLVTDLSAA
PAAVAMVQAAATLPIFLLAIPAGVLSDILDRRKFLIVIQLLLAAASICLM
LLSATGLQSVSSLIALTFVGGIGAALMAPTWQAIVPELVARQDVKSAVAL
NSLGINISRSIGPAVGGLLLAWFGAAFTYGVDVISYVFVIVALTWWRRAA
TPDDVLSERFFGAFRAGLRFAKASRELHVVLLRAAVFFAFASAVWALLPL
VARDLLDGDAGFYGILLGAVGAGAIGGALILPRLRTRFDADALLLGAAVV
TAAVMAILSVAPPRWGAIVALLALGAAWITALTTLNGAAQAILPNWVRGR
SLAVYLTVFNGAMTAGSLAWGAVAEALNIPLTLNISAIGLAAAGLLFHFV
KLPKGESDLIASNHWPEPLVAALVDNDRGPVLILIEYKVDKTERPDFLKA
LAKLSNERRRDGAYGWGVTEDAADPERIVEWFMVESWAEHLRQHRRVSKA
DADVQQEVRRFHKGAEAPVVSHLLSINRPQ
>SMa0841 conserved hypothetical protein
MATLVATGQPTPCKVRIPSCALGMVEDSLPRDANRQHRRRDLPRVIFSGR
THRRSRDLRKSRLLHAEGAGGRAKVAELCRKHGISEATFYNWKAKHGGME
VSEAKRLKALEEENAKLKKMLSGQMLGAAALRELLQCYGLPPGVKPSPI
>SMa1337 putative sugar ABC transporter, periplasmic solute-binding protein
MHKRGGKIMRKTVAGLLAGISFMIACGTSAQSQELTIFWAEWDPANYLQE
LVNEYEAETGVKVTVETTPWADFQTKAFTEFNAKGSAYDMVVGDSQWIGA
ASEAGHYVDLTEFFNQHKLNEVMAPATVKYYSEYPANSGKYWSIPAEGDA
VGWSYRKDWFEDPKEMEAFKAKYGYDLAPPKDWKQLRDIAEFFHRPDQKR
YGIAIYTDNSYDGLVMGVENAIFSFGGELGDYSTYKVDGIINSEKNVKAL
EAYRELYGFTPPGWAKSFFVENNQAITENLAAMSMNYFAFFPALVNEASN
PNAKVTGFFANPAGPEGDQYAALGGQGISIVSYSENKEEAIKFLEWFVKD
ETQKRWAELGGYTASAKVLESEEFQNATPYNKAFYETMFRVKDFWATPEY
AELLIQMNQRIYPYVTAGQGTAKEALDALAKDWNATFKKYGRQ
>SMa0114 putative sensory transduction regulatory protein
MTERRLRVLVVEDESMIAMLIEDTLCELGHEVAATASRMQEALDIARKGQ
FDIAIIDVNLDGEPSYPVADILAERNVPFIFATGYGSKGLDTRYSNIPLL
TKPFLDSELEAVLVQISKEV
>SMa0739 hypothetical protein
MTKTEDKTAEFREARVMRTLEDGVGTAIEVDLAVMHPRQRRTYAGPEQYR
GPVEVRGVQELAKELPNPADLCYLRCRDGVAVSGDRAIRYADALLRSRLS
CGGPPATATGGKQKARRYHFEPLKHVGASRHPNWEQAFEQNT
>SMa2111 hypothetical protein
MRIGSYIIPFGYHNFLFTLRSPIMTRIVGTSADDTLDGMAEGDRIWSLDG
NDVVDGGAGDDFVDGGAGDDALTSSSGFDEFTGGEGNDRLSFIGVGGAAR
GGTGVDTLVGDYAAISDAFLFDGMHGHAAFGDLSVKGNHLYFLDIERLSL
TTGIGDDRIIATGFSFVHVHTGAGDDRVETGIGDDQIYAGDGRDLLFGGG
GDDFIHTGQGDDYVNGGNDDDRLEGEDGNDSLVGGRGNDRLDGGSGDDDV
NGGDGNDSLTGGLGSDTVTGGAGDDYLSNSFAAGDILLGGDGNDTLSAGG
EDTAYGGWSDLYGGAGDDRLHIYTDGSLGALDGGEGFDRASIALDDVSAG
FVLDASRFGSIEEFNITVKSAYRGVHLSGGNGNDRLFCFDTYREGPSGND
VLNGRNGDDILVGGSGADSLLGGDGNDSLSGEYHSDRLLGGAGADLLTGG
SDADTFIWDEASVRNDRSVDRITDFRGGDGDVLLFRGFGGTEFRDFESFL
AASRDTPEGVYVSFDGDAHGILIQNTLLADLSAADVLFA
>SMa1002 hypothetical protein
MDKSTVGDSSRPCIMYPEVKSWCRRGSRPVVHDARREPLPLVSTIGRKSP
DCTSWRGRVVANDESRHSSWLAAMTAFHPLEKFSAHAAARIVGGVFQPRR
RSRHKAAAASISERSSRRQPAPAR
>SMa1862 Putative ABC Transporter, permease protein
MNNRVLSLVLSRLLIAVITLVIVSFAVFFATTLLPGDTASILLGQAATPE
AVEGLRKAMHLDEPAIFRFLRWIVGLLQGDLGTSYANEMPVKDLIGGRFV
NTLQLAGVTALFSVPIALTLGITSAMLRGSPYDRIVTVLTIGVISVPEFM
IATSAVLVFAVYLKWLPALSFANEVTSMTDLLRVYAMPVITLTFVISAQM
IRMTRAAVIETLNTPYVEMALLKGASRSRMVFRHALPNALGPIVNAVALS
LSYLLGGVIIVETIFNYPGIAKLMVDAVSTRDLPLIQSCAMVFCLGYLLL
ITTADIIAILSNPRLR
>SMa1370 Probable ABC transporter ATP-binding protein
MTVAPVLSVSGLAIDYELSSGGLFRKRRSVNAVSDVSFDLAPGETLGLVG
ESGSGKTTVGRAVLRRIPAAQGRIVFGGEDITHLGGEPLRRLRARMQIVL
QDPYTSLNPRMKVSSIVAEPLIVHGLAASAEEARAAVAELLERVGLPGDA
ADRYPHSFSGGQRQRIGIARALALKPALIVADEPVSALDVSVRAQVVNLM
QDLQRDLGISYLFIAHDLAIVRHISHRVAIMYAGRIVEIAPRDAIYQRPI
HPYTEALLSAVPVANPKLQRARRRIVPPGESVDIASPPTGCRFHPRCPLA
TDRCREEAPQLLRKTGDHFAACWNRHA
>SMa1289 Hypothetical Protein
MEFPRPLAAPLAVVPIPLIEAAVKLMFKSLLKRHPALFDRLGEHKSKRYV
FRPVDLPLVFVVEPSHAAVSVMRKPSDCAADAVVEGPLFLLLALLEGRCD
ADALFFSRALTVGGDIEAMLALRNALDGCEIDLPRDLGASAGPLSPLVGR
TAAAIRRRALAGEKATWN
>SMa0545 hypothetical protein
METLVADASIAIKWVVEEEGTDSAVELRSRFRFAAPELLIPECANILWKK
VQRGELSRDEAVLAAKLLERSGIDFVSMTGLLEEATNLSIVLSHPAYDCT
YLIAAQRTGSRFVTADMRLLRIVSERAPGEIARLCVSLPDARNDAH
>SMa1070 TRm23a transposase
MPAERLEMRRVREILRYRFEQGLGHKSIAVRVGAAPSTVRETLRRAAIAE
LSWPLGDDVSDAVLEAALYKAAGTKTGHRRSPEPDWAQVHRELKRKHMTL
QILWDEYISRYPDGYRYSRFCDLYRGWAMKLPVTMRQDHAAGDKLFVDYA
GDTVTVVVDRLSGKTRQAHLFVAVLGASSLSYAQARWSETLPDWIECHIL
ALEYFGGAPALLVPDNAKVAIIKACHFDPQVNRTYCGMAAHYGSAVLPTR
PRRPRDKAKVEAAVRIVERWLLGRLRHRIFYSLAEVNAAIGQLLHDLNDK
RVLRRVGATRRQLFEELDRPALRPLPVERYIFAEWRIRRAGLDYHVEIER
HYYSVPYRFAREQVEARITANTIEIFHKGERIAAHRRSSGNGKHTRSPII
CPLRIAALPTGRLNGFNAKPLRWGRMLRCCASAFLPTGLIPSRAFELASA
SSASTRASAATGSMPLAGVRWRLAHEPMARCDPSSTITLTGRLPQMERRR
MNRSITPTSADLAITTKENERCLPIQHWIN
>SMa1298 Hypothetical protein
MAFLQRGACEESRYKQQGGLRSKPDRYHHSRGVPEYEADQPSDEKQPLSG
IALDERDGRASRVAAHEGHEVSDGHESYGVGHARQERYCSDKRQTNRTPP
PFYDPPPPLHRHWFLPKLKRSKESKPRPLWQALQLRFDHVQRIRRKGASQ
AMDWATAYRLPTSPPPALTSTSISERVDQINSAA
>SMa1398 Putative
MSNPTAKPVALVTGSSRGIGLAAAEALAREGFSVAINGLTADDELAAAAA
RVSRHGAPVIAVAFDVAELAAHEAALTKIEAELGPLTTLVNNAGVGVLKR
GDLLDVTEESWDRCLTVNAKAMFFLSQVFARRLLARERSPLFHSIVNVTS
SNSVAVAVQRSEYCASKAAASMVSKALAVRLGRENVAVYDVQPGLIATEM
TAPVIDSYRERAEQGLTLFPRVGEPEEVGAVIASLASGRLPYTTGQTISA
DAGMLVPRF
>SMa0626 putative membrane protein
MSNAHITIFGDSKYDLRWRKSGRNASLCEGVARKLLMAGALVAATNVPCW
SQSVADPVKQYSKESMQQALQTREQYDVHGLSFDTGQSTLQPGAKPLLDD
IATALKNFPDWSLRIVGHTDASGSAESNERLSLERANTIKAALVERGIDA
GKLLAAGAGQSRPIASNETDEGKALNRRVELMRFTDSAEAKKLLKAMSDY
LAAQKAISFAYDANLQVVTNSDQKLGLASSGTVTLSRPDKVHTTRSGGFV
DVESLFDGKTLTLLGKNVNKYTQVEIPGTVDHLVDELKDKYGLPLPAADL
LLTNAYDELIQGVYDSKDLGSGVINGAECDSLAFRKDDVDFQIWVAQGAQ
PHPCRLVITSRLVKGGPEYSVQIRDWKSGDGVTLGDFSFKNPTDAEKIDV
NDLKGTLGELPENFVGGGGK
>SMa2225 putative opine oxidase subunit
MSLWDAIVIGAGPAGIGASGLLAENGAKVLVIDEAPGPGGQIWRGVEDVS
DARARILGSDYLAGRDEVRRLRASGAELSFETQAWRVEPEGTVWLKDSHG
IRRERGRRLLIATGAMERPCPLDGWTLPGVTTVGGLQILLKREGMLPGGP
LVLIGTGPLFYLFAAQCLAAGMRDLSLIDTAAAGAIVSALRHVPAALTGK
GPSYLIKGLKLLWMLRRAGVDIYNHSGDLRIKSAADGLEVHFRMREVEHR
LSASHVGLHEGVIPETHLPRALGCRMHWSEAGGAFHPHRDIHLQSSVAGV
YIAGDAGGIGGATVALLEGRLAAMGILASLGRPIDELLLRATRRDRAAHL
AARPLLDHLYQPSPAILTPADGVLACRCEEVTCGEIRAALRAGCAGPNQV
KAFLRCGMGPCQGRMCGMTLTSLAASTHDISMGDAGFLTIRPPLRPISLG
EVADLVEP
>SMa2195 probable ABC transporter, ATP-binding protein
MSHPMIVFDRVSKSYGSMTVLNDINLEIATGEVISLIGPSGSGKSTLLRC
VNHLERIERGSITVDGELIGYRRVGNLAHELPEKLVARQRAGIGMVFQNF
NLFGHKTALENVIEAPVHVRGLARRDAEAQATALLERVGLADKMQSYPRM
LSGGQQQRVAIARALAMKPKVLLFDEPTSALDPELVGEVLDVMRSLAAEG
LTMMVVTHEMSFAREVCNRIAFMQAGEIVECGAPSEIFDKPAFQRTRSFL
SKVH
>SMa2265 hypothetical protein
MRHENSRHGIPRLGLKAFQDKCEAALKVARQMNPQFAETIMTRLLYRETL
VLHDPAPVQLPLRTFIPCSHVGHPERSSRTKVE
>SMa1562 hypothetical protein
MRRVEGERERIRQRERARLEEKSRVSLRVEPKRLFQAIVDRFRLAKQAED
GEIVRKLSMAGYRGHAAVTAFLAFRLIAPVAIFAACLLYILLVVRPEAPL
LLVVAMAAAMGALGYFSPAIFVRNRITKRQQSIRRSWPEALDLLLITVEA
GMGIESAFRKVGEEIGTQSPETAEEILLTTAELSYLQDRRQAFENLGQRT
GVEGVRAVVTSLIQAEKYGTPLGQALRVMAQENRDMRMSEAEKRAAALPP
KLTVPMILFFLPVLFAVIITPAVIQIMST
>SMa0052 hypothetical protein
MEMKVARGNFGPVASLFKFSTEEAVFEMANNTEFGLASYFYSKDISKIFR
VPEALEYGMVGINAGLIH
>SMa0169 hypothetical protein
MALDRADFYDAELARHNRQLRVAADFGADDRVLDIGCGAGQTTREAARAA
PQGEAIGVDISAEMLEEARRRSAAEGLRNAMFEQGDAQFHGFPTGSFDLC
ISRFGVMFFADPAAAFANIGRAMRPGARLVWMVWQSRERNEWSRAIRQAL
APAIAVSAGAANPFSLGDPPVATDLLSAAGFTSIDFADVQEPVFYGSDVD
AAFDALTSLYLVQDALASTNEPPDKPLQRLRDLLEGHMTPEGVFFDSRAW
IITARRAGGGG
>SMa0349 conserved hypothetical protein
MTTYAIIGAGAIGSALAERFTAAQIPAIIANSRGPASLSSVTDRFGASVK
AVELKDALQADVVILAVPYDSIADIVTQVSDWGGQIVVDASNAIDFPAFK
PRDLGGRLSTEIVSELVPGAKVVKAFNTLPAAVLAADPDKGTGSRVLFLS
GNHSDANRQVAELISSLGFAPVDLGTLAASGPIQQFGRPLVALNLLKD
>SMa2169 hypothetical protein
MLPIPEAPKLVGEVTQLLPLFDLRRSAGPIAHHLPVGGCDLVICQTRGSE
RPISAFTCATASGLAAGHTINGTIRPVLLACLSADWTVPTARVYATCSAF
VRPGSHHVSTGTDRLLQVWWLFLNN
>SMa0792 hypothetical protein with local similarity
MQTVTSHTSGEGGASLPRKTTARGTAYFEAGNGETLILIHGVGMRLEAWA
PQIEAFAKTHRVFALDMPGHGASEKIPAGSTVRDYVAWFGCFLEDLSIAR
ASIAGHSMGALISGGAVATFSDRITRVAYLNGVYRRDAAAKAAVLARAEA
IRKNGVDAEGPLERWFGEDPESQRARELTRTWLEMVDPEGYAIAYAAFAG
GDEIYADCWPSVECPALFLTGSGDPNSTPEMAKQMASVTPKGWARIVDGH
RHMVNLTAPEIVNALMSEWLTSREKPR
>SMa1455 Hypothetical protein
MRFTGTCYRAHDPRWAFKPASGDGAAIKGARFNPKGVKTLYLALSIMTAV
KEANQGFAHRIDPCVLCSYDIDCADIADLTAERGRAEHGVSLEEMACSWA
GAFAEGRRPASWAIYNRLHPRGTAGILVPSFAPGTEAGDRNLVLWTWGPD
LPHRLDVYDPSGRLPKDQLSWG
>SMa0404 putative FMN-dependent
MALVNIDDFRDLARRRRPKIFFDYIDGGSFEEETMRANRSDFSRLTLRQN
VLVEPQPQDLATAYLGKRHPLPFMLGPVGFLGLYSGKGEVKAVRAAHAAG
IPFCLSTFSIASLADLRIVTDGPLHFQLYVLEDRSLCEEFLRAAEYAGVD
TLFVTVDTAITGIRERDVRNGFRSLTRVTPDLFARLALKPRWLAEVVLAG
MPSVRAVEHRPEFGRGALEQAANLSRRIDKTLSWKDIAWLRERWAGKLVI
KGVLTPADAVRARDLGCDGVVVSNHGGRQLDGAPSTIRALPSIRATVGTD
FCLMLDGGIRRGADVIKAIALGADGVMLGRAYAYGLSAAGQAGVAEVIAI
LEREISISLALMGIASVEQLKALGAEAVSTL
>SMa2171 conserved hypothetical protein
MVVRMMSNASQNLPDDPAFLKAMIAALQAENAKMSATLQAHDQLIGELRL
RIAKLKKQVFGKSSEKIEREIQQLELALEDLLIAAAENSTKPLDEVDAAV
PAAPVASRPEKTMRRRPRVSEKAARERKELDPGTCCPDCGGELRLVGEDA
SEILDMIAAQMKVIEVARLKKSCRCCEKMVQLPAPSRPIPGSMAGAGLLA
YILVSKFDDHLPLYRLNEIFARMGADIPDSTLVDWCGRAMQVLQPLIERI
EAVVMSSDLLHADDTPIRVLDRSLRDKGLGKGVKKGRIWTYVRDQRPWAG
AAPPGAVYYFAPDWKEEHVHHHLRQTSGILQADGYKGYGKLYEPGADGIG
RFREAACWAHWRRDFHDIWTSNKSEIAREALDRIGALYDIERDIAGQPAD
IRLAARQKHSTAKVEALRVWAEAQLTRIPGKGDLASAFRYGLSRWHSFCL
FLEDGRVAMDNNAAERALRPIGIGRKNWLFAGADTGAETLARAMTIIETA
KMNGIDPQAYLADVLDRIHDHKINRLDELLPWNWAPVAIICAEAA
>SMa0585 NrtA-type periplasmic nitrate transport binding protein, probable
MTKTLFGGLTRRSVLKTTATAAMVGAARTLLPSGAFAQGAGPETAKATLG
FIALTDSAPLIIAKEKGLFDKYGMTEVEVVKQASWGTTRDNLVLGSAGAG
IDGAHILTPMPYLISTGKVTQNNQPLPMAILARLNLDAQAISVGAAYADL
KVGIDASVLKDAFAKKKAGGEAAKVAMTFPGGTHDLWIRYWLAAAGIDPD
KDVETIVVPPPQMVANMKVGTMDCFCVGEPWNEQLVNQKIGYTAVNTAEI
WAEHPEKSFAMRADWVEKNPRAVKALVMAIEEAAQWCDDMANKDELAKIV
GKRSWFNVPPKDIVDRLKGEYDYGNGKIVENSPHFMKFWREHASYPFQSH
DAWFLTENIRWGKLASDTDIKGLIAKVNREDIWREAAKDLGISDIPASTS
RGPETFFDGKVFDPANPETYLKSLAISRIA
>SMa0063 putative GntR-family transcriptional regulator
MQFGIDNNRRQIYAQVNANLSDRLEVNLIELKAEEQSSSGRSGAEGLLGG
VMTAVKTHIRENGLQVGDSLPSEGTFAEKLGVSRSVVREAYRSLAALTLI
DIGNGRRPRVAAPKADVLALVTDHAVHTDQVTIQQIFDVRRTVERRTVVL
AAMRRTDKEAAEIVALAEAMQRDFDQPVRVMEHDIAFHEAIGRASRNPMF
ALIVGSFHVVTRHTWPIGWASRGSNETRQESIDGHMTIAQAIANGDPAKG
EEAMVEHFDLTVKALLAAGVY
>SMa0976 TRm3 transposase fragment
MTDTVLEAVGEWQNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAV
LADGSKEILGLWIEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPE
AITAVFPQTIVQTCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAG
LKALEAFEEGYWGQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAI
EALNSKLRRAVRSRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKT
QFAVIFGERFFN
>SMa1952 Putative Beta lactamase
MPALATVSTIARAELQHALEARLAELERRHGGRVGVAALNLSTGARVGHR
ADERFLMCSTFKALASAMVLARVDKGVEKLDRRIVFSKEVLVYFSPVTET
RVGGEGMSVAELCMATLTQSDNTAINLLLESFGGPPALTEFVRSFGDELT
RLDRFEPELNEHDGPDDLRDTTTPGAMMETLRKLIFGEVLSRSSRAQLAG
WMVMNKTGDSRLRAGMPESWMIADKTGGNGNQHANNNDIAVAWSPNRGAI
VVATYCEIPTISADERNAVVAEVGRLVAELA
>SMa2075 probable extracellular solute-binding protein, family 5
MTSRKTAILKPTRRAFLLSSVALATATAMPVLPVPSFAAEEPTRGGSVSI
NIGTEPPVLVLIAHSAGAAYYISGKATESLLSYDKDFNPQPLLATEWTVS
EDGLRYWFKLRQDVRWHDGRDFTAEAVAFSILALKENHPRGRATFAHVKE
ANVLNSHEVELTLAKPAPYLLTAFASFEAPIVPKHLYEGTKIAENPHNVA
PVGTGPYKFVEWVRGSHALFVRNEDYWGSPKPYLDQIIFRFIVDPAAAVA
AIETGEVQVSTANLPLTDIDRLKANPNLVVDTDPAPYSPSIARAEFNLEN
KYLADIKVRHAIAHAVDKDFIVNTVYLGYATRLDGPVSPDLAKFYSPDLP
KYEFDPAKSEKLLDEAGYARGADGFRFKLFIDPTQPSGPPKQTAEYIAQA
LAKVGIKVELRTQDFATFVKRVFTDRDFDIAIEGMSNLYDPTVGVQRLYW
SKNFKPGVPFTNGSKYSNPEVDRLLETAAVEIDPKKRLELFNEFQKLVVE
DLPTLDIVTPAVITVYDKRVKNLKLGVEHLWSNGADIYLDGQS
>SMa2065 hypothetical protein
MAPSVSPLRIFYLEDNPLIVFHIEAMIEDLGYVFAGSASSFADLVGRIET
IEVDGVLVDIDLADGRTGPAAAKWLRQRGIPSIFVTGQEAIAAEYPETAL
ATIGKPVSESELAEKLELFRLTSSTSKTI
>SMa0356 hypothetical protein
MAALWREPDSGQRASELLKSTHLCRRRRHLEGPDWAQSSRSLRPSLTTAR
NRREPFVTSPVHGLLRGCSRSGPRRSHNIDWNVRYRAASLMLLYRFPGST
PVRHAALFPISSVRLKLVCMFIQIPALCGNTWRGAERYPWLSVSALPL
>SMa0579 adenylate cyclase, putative
MLFAFFGISAFAVLATVGALYAFLELSQVLERVTERRAPSALASLELSRH
AERVAATAPAFLASTSRARHSEVSAAIGSEMARLEELLAALKGATLSSGV
VSEIEDAVVGLRRNLHALDDLVTVRLAAVARKEELLRRLSATTNASQRLV
APGILVMNSKVPRWRAATADAVTTPEAEAAATRDLARAIAAYIPQQTAQR
EIAAINDTLLQAAVAPTPGDLSLISFPLRRSIETLESVTPEFDEQLRKRF
QQLVDQFEALIDGQRSIPNARNEELAVVAEGEKLVVENDKLSRKLTLAVD
RLVAAAKGDIAEAGSEAATVRRYGTGVVLGSALLSLLSSVLIVWLYVDRN
LLARLTGLSHSMLAIAAGDLRVPLPQTRGDEIGRMAKALRVFRDTAIEVE
EKNLRTVAEARQRLIDAIESISEGFAFYDSEDRLLVCNSRYRDILYPGMD
DTVVSGTHFEAIIRAAAERGLIEDAIGREQEWLAERLEAHRNPTGTLLQQ
RGPDRWIQISERRISGGGTVAVYSDITELKRREQDLSEKSVALEALSAKL
AKYLAPQVYNSIFSGKQDVRIESRRKKLTICFSDIAAFTETTDKMESEEL
TQLLNQYLTEMSKIALSFGATIDKYVGDAILMFFGDPETRGIREDAIACV
SMALAMQERMGELGETWRSVGIEMPLRCRIGIHTDYCTVGNFGSEDRMDY
TIIGGAVNLAARLEEEAAPGSVLISYETFAQVKDLIHCEETGRVQIRGIA
YPVATYRVVDFKANLTKSCNAIRTELPHLRLEAEPELMSTGEREVAITAL
RETLDRLRR
>SMa0087 hypothetical protein
MIVRQALFEGTIRPGREQAFRGYVEEKLVPLWRAFPGVREVRVLHAVDRD
EGAPAFAMILSTTYDDREALARALASPVRYESRELTKGLLEMFEGHIHHH
VFDLDARSTA
>SMa2293 probable beta lactamase transcriptional activator
MELPKLPLNALRAFEASARHCSFTRAGLELRVSQTAISHQVKSLEDLLGV
KLFRRLPRGLALTDEGSALAPVMSDVFKRMCATLSRFEEGNFHEVVTVGV
VGTFAIGWLMERLPDFHDAHPSLDLRILSNNNRVDLAGDGLDFAIRFGDG
SWHGTDAMHLSAAQLSPVCSPQIAARLKEPVDLARIELLRSYRSDEWAQW
FRAAGAQPPILRGMVFDTSLALAEAAARGSGAALLPITMFEHYLESGRLV
QPFDITISAGDYWLTWLKSRQFTRGMIAFKSWLKQQLPDPSGVL
>SMa1010 hypothetical protein
MKKLLPVLAVLLSPTFAAAEGEHVHSSPYAGQEQSLSADDIAALETGAGL
GLAKAAELNGMPGPSHVLTMKTELHLNGEQEDRTRNIFQRMRREAIAEGK
RLVAGEQALESAFRERSINHGGLREHLRRIEASRAKLRYIHLAAHLAMLH
VLSVGQIDRYNELRGYSR
>SMa2002 Conserved hypothetical protein
MEETVARTVLCFGDSNTHGQVPGRGPLDRYRREQRWGGVLQGLLGPNWQV
IEEGLSGRTTVHDDPIEGSLKNGRIYLRPCLQSHAPLDLIIIMLGTNDLK
RRFNMPPSEVAMGIGCLVHDIRELSPGRTGNDPEIMIVAPPPMLEDLKEW
ESIFSGAQEKSRKLALEFEIMADSLEAHFFDAGTVCQCSPADGFHIDEDA
HRLLGEALAQEVLAIGWPDA
>SMa1725 putative AraC-family transcriptional regulator
MLGETGLLAGRTVTTHWSYEEHLVSRFPDINVNTDRLIIDDGDILTAGGV
MAWIDLSLILIERFLGPTIMVETARAFLVDPPGREQSYYSAFSPRLNHGD
DAILKVQHWLQATGGKDMGLAVLAEHAGLEPRTFMRRFQKATGHTAGEYV
QRLRINKARDLLQFTRDPVDAIAWKVHYSDPSSFRKIFTRIIGLTPAEYR
QRFTSHQLSPGT
>SMa1478 Putative methyltransferase
MSGRRGGGRRSRAIRGGDGISQLPWQEVVNTYAPMQLLDEERIEALHRNS
MRILSEIGIRVMSEKAMALFEKAGAIVDRENLMIRIDESIVEAALATTPS
SFTLTSRNPEKRLTIGGNTIAFGLVAGPPNVHDRVNGRRQGNLADYQNFT
RLAHHFNAIHILGNQVCAPMELPANSRHLDTYKANLALSDLCFHCTAIGR
GRAVDGINMMAIARGISVEEMSQSPGVITIISVNSPRLFDDAMADGLIAM
AEHGQPVTITPFTLMGAMTPVTLAAALCQQNAEALFGVTLAQLVNPGTPV
MYGAFTSNVDMRSGAPAFGTPENAKANIIAGQLARRYKLPYRTSNANASN
AVDLQAAYETEMATWGAVLGGANLIYHAAGWLEGGLTASYEKLVLDVEIL
QNMMGFLEPMPFSEDDLGFDAIRSVPAGGHFFGSEHTMARYETAFYQPML
SNWQNYGSWQEAGGRDALERATDIWQQALREYEEPVLDPAIREELDAYTI
KRRAEIGAGEP
>SMa2071 hypothetical protein
MLADPDQVGTKAPSRSFERLWRRKSRPPNRETPMGSAKDKVAGKANELAG
KAKKAAGDATDNNSLRAKGAAQEAKGGAQQAKGKLKDAVKGAVDKT
>SMa0271 putative ABC transporter-permease
MTVATATLAPTNEGARIRWADLAPFIALAVIVLFGALVNPNFVSSANLIN
VITRSAFIAIIAVGATFVISSGGLDLSVGSMMAFVTGIMIMAMNHLAPAF
GAWAIPMGAGVALLVGALCGLFNGLIVTVGRIEPFIVTLGTMGIFRAFIT
FMTDGGSLPIDRSLREAYRPVYFGSFFGIPYPVLITFVVVMAGAFLLYKT
KYGRRLKSAGSNVEVARFSGVNVAGVRTGAYVIQGFCVAVAAICYVPRLG
AATPTTGQLWELQVITAVVIGGTLLRGGRGRIGGTVAGALILEVIANVMV
LSDLVSEYLVAAVQGAIIIIAMLAHRFTANR
>SMa0095 putative D-aminopeptidase
MKTARELGLIPQGRLEPGPGNAITDVAGVSVGHRSLRGEGLFTGVTAILP
HAGDVFRVKPRAAVEVINGFGKSAGLMQVAEIGTIETPIVLTNTFGVAAC
TEALVRRAISANPAIGRKTSTVNALVCECNDGSINDIQALAVTPADAEAA
LDAARTGPVEQGAVGAGSGMTAFGFKAGIGTASRRMRVGKRDFTLGTLVL
ANFGAAGDLVLPDGRRPDPRVPAGPERGSVIVVMATDLPLADRQLQRVAR
RAGAGLARLGAFWGHGSGDVALCFTTADPVEHEPATAFTTQERLADGHID
IAFRAAADTTQEAVLNALCMAPAMPARNGRIYPCLADWLMENPSP
>SMa1046 Probable adenylate cyclase
MTENHNLTTTSLVVALILLNHAGLQLERRLVLIFSGIVLIAWVAMLAITA
VRHHTADAMSLLASFFNQDLGLTVAVGFTAFAIYLLARDHDRTRKEALKA
DRRRHNLTRFFSPLIVSELQERGQALGLERRNAAIMFVDLRDFTSFAETA
TARELAFVLAEYRQLVSQTIFDHGGTVDKFIGDGVMAVFGQPRPTDDDAD
RALACALDLVDTLSDWRSHGVRIGYPALDAAIGLHYGTVVGGVLDSGCHS
EFTVIGDAVNVAQRLETLAKSLDASLVISSNLVARLQSPVPPAAWMSVTS
AALPGRRLPIDVWYLLRATDSARGHNTPGYDTGVSQTALQRSAFQSLPAP
DMGTM
>SMa1666 hypothetical protein
MISFFLGIRYIMILASIGVLGGALLMFLEGALLLRNAFTFVRTEPELSVT
AAVLRATDKFLFGIVLTIFGYAITFGFVIDVSDEIRKRVPRWMILNTVAE
MKILFIEVIILYLVVHFATVVAETEGMLDWNGLVLPGAALLLAAAMKLVA
SSTHDLGPK
>SMa0246 putative GntR-family transcriptional regulator
MDFGQISRSEHLPARIAAKIGREITEGRIAPGEKLPTEHLLATTFGVSRS
VVREAIAQLRNEGLVETRQGVGAFATEIERRQSLRIEQGDLANRGSFRDL
FQLRIPLEVEAARLAAIHHTPQDLGKIDEALQQMTGAEKWTEQGIVADLA
FHRAIAAATHNEYFLLFIGFIAERISLAINAARAAAILEEIVEVTIAEHV
SIRNGVSARDPVQAEEAMRHHLNGAAARLDLTI
>SMa2283 hypothetical protein
MRIDQVVSRGRTVPGACGRRARNPAADLRRHAVGKVLLAPNRMPQVMGNE
FRGQRFRRRPGEGAPVASRHQAWIAELRERIFAHALTGEVLEGRQVVVGQ
QRGELVAPVERQYGVERRAAAGNGCRWHDHRGSPSSS
>SMa2107 probable GstR transcriptional regulator
MRDNMSRLEDLEAFIHIAESGSLTRAATRLNRSLQAVSRSLTSLEVDVGL
QLVHRTTRHSALSEAGQAFYRRVKPAVLEIREAQLEAAGRRTGPSGILRV
GAPMLFGPDFLVPIVAEYIESYPQAEVDLSLTDAFVDLASKGIDVVVRIA
DLPDSNLQSKRLGALRRVVFGAPSYFARNGRPEHPAELRQHACIVRTIGN
RPGQWAFQIDGKRRMVGVRGSFRSNAMAAIYSAVRSGLGLGYSPLWQIRH
LVDAGQVEIVLQEFEPKPVPIHALWQESTRPPAKVRAFVDLLAMRLRLDD
L
>SMa1009 Conserved hypothetical protein
MTCGHCVSAVEKAVKSVDPNAKVVVNLEAKTASIDSQAASEAFVAAIEDA
GYKASFAKSCCSHVA
>SMa0453 Yle homolog, A. tumefaciens
MYLVDTNIVSEARRGTPQAVSWLRSVDPLSIHLSALSLGEIMRGIALKQR
SDPKTAAHLTEWLRKLRHDHGDRILPITDQIAVEWGRIAAIRPRGDIDGL
IAATAIVHDLILVTRNVKDFEDTDASVINPWETSA
>SMa0983 hypothetical protein
MQAASTKGSSPSRSKIRSGSDVPMIMPNSTFPRATARRPSNGGPSQHSMQ
RPRQRRLNLRGLSDQALEIQPMGMYMHMYIQRRARCPNSDTTNHHRQGKP
SSSEIIEVRLYVSPLILSFPAIG
>SMa1593 putative oxidoreductase
MQSQDAAWMVNLPGNRGSCWVATADRTGYPQLDSSVHAETVVVGGGIVGL
TTALRLLEAGRSVILVEALEIGQQVTGRSTAKITTQHALIYRHLVDTCGL
PTARNYTEANSAGVELIKDWIREYGIACDLECKNAYSYATNSKGREEIEA
EAEAARQVGLQAEVLDRAPLPFETVGALCFADQAQFNPAKYLVGLAHTVA
NRGGRLFEHSRAILIGEASRWRVVTAYGTVHAENMVVATNMTVKSPVGMA
NRTQPRCHTAMAFRVDDPLVVDGMSLMIPCTRYEPAGMTEVLCLSCWAPS
STPVRMGTSRHDSSTWKSGRERTCPSAMLPGVGAMRITTRRIECLLSANP
IPTTLRASTSPLASMPGASAMGPQLA
>SMa1341 probable ABC transporter, permease protein
MAAVQTRSERALNRVAIAAVLVITLLFLAPIYWITSTAFKPRNLATTIPP
TVIFEPEISPFVKLFTKRSQLRSAPEPEDYAAAPWWERLVFDGGEKVVRS
GRGQVQLSGYPNRFMNSLIVAITSTVLAVAMGTFTAYGFSRFKVKGEADL
LFFILSTRMLPPVVVAIPMFLMYRAVGLNDTHWGLIILYTAFNLSFSVWL
MKGFIDEIPKEYEEAALVDGYTRLEAFFKIVLPEAATGIAATAVFCFITA
WNEYAFALIMTNRRAQTAPPFIPSQVGSGLPDWTVIAAGTFLFLLPVAIF
TFLLRNHLLRGMSFGAIRK
>SMa1619 putative integrase/recombinase
MLSQLLADHAALHQALGFKFRTPGVLLRNFVAFAEHRGEHVITTATVHEW
ALQAPSPEQRRNRLLTVRRFALSLHAEDPRHEVPSADLFGRASRKRRTPY
IYSPEEIRRLIDAARRLGPDDSIRSLTYATMFGLIAATGMRVSEAIAIRL
HDVTDDGLIIAQTKFKKSRLLPLHTTTRRALDGYLATRLKAASQSDALFI
SHQGTPLAYPTVITVFLQIIRSIGLRGAPGSRGPRIHDLRHTFAVRSLEH
CAHDSQAVARHITALSTYLGHAHVTDTYWYLQSTPELMAHMSEAGETLHR
GVTA
>SMa1456 Hypothetical protein
MAQAQKIQSDGGPLILSYMDKGGKIAVQQVAVGFGMSKTQLAETAGLARE
TLYRPERSRGTKTQNRLREMLEIISRVTDWAGGKEQAMAWYRAQPLPAFG
GRTAEALVKDGKAAAVRDYLDHMALGGFA
>SMa1831 Hypothetical protein
MTISPYQTLIKAPGPNAPIVFAFHGMGGDEYQFAELTRQILPDAGVISPR
GDVSEFGALRFFRRTSEGSYDMEDLILRTEKMARFIAAHKAANPGRPIYG
LGYSNGANILASVLFKNAGLFDRAALLHPLIPWTPPNSEQLKDRPILITA
GQRDPICPLPLSERLADYFAAQKARVEACYHSGGHEIRPEELEALHAFLT
>SMa0249 conserved hypothetical protein
MQKAIDTFYKLLELLLIVLLAGMAVMVFLNVVLRYGFNSGINVSDEMSRY
FFVWLTFIGAVVTFRENSHVGVETLVSLFGRKGRIICMILSNIVVIAVSA
IFFWGTWKQSPINASMAAPVTGISMLWVYGIGYFTGAGVVLIALERLVRL
LTGRVTEEEIAAFAGENMTLEQLAERT
>SMa1171 Hypothetical protein
MKHPWTNGQVERMNRTIKEATVKRFHYDDHDQLRQHLQDFIKAYNFGRRL
KTLNGLTPYEYICKCWTNKPERFRLDPIDQMPGLNN
>SMa2175 putative integrase/recombinase
MTQLAQHLTAFLREHLPRERRASVHTCDAYAYSFQLLVTFAARRLSKRPC
LLQIEDIDVPMILAFLEHIEETRGNKARSRNARLAAVKSFFRYLEHRVPA
VLDQALRVHAMPMKKIDEALVASLSRTEVQALLNAPDRRSLSGIRDRAML
HLAFAGGLRVSELVGLTLDQFDGRSPASIHIIGKGRRERVLPLWQETAAA
IRAWIAVRPKNGDTALFLNNAGRMMTRSGFEYILEKHAAAAVSVAPTLAT
KSISPHVLRHSCAMHMLQATRDIRKVALWLGHASLQSTEIYLRADPTEKL
EMLDALAPLGIKPGKFRPPDKLIAMLATR
>SMa2209 putative ABC transporter, periplasmic solute-binding protein
MYIRHLIAGAAISASWAGMAVAEDCALRVTTWGGSYQATYQAVAQKFEEE
HNCRIEWVVGASPDHLIKARLGQVDVVTNTLLNSIAGEKEGLWQKLDPAK
IPNMANLYPNAVHSPYTVFANVGDYVLAYNKDTVTTVPATWDELWKPEYK
NRVVIYGIDHIPTLSLTVLQAEKNGGSIDNVEPGLDRMAELIKSGNLIGS
LDVESQMVSLFETGDAWLGMLATGRMKELLSKGVTNVSFVRPEEGTFPLI
TSVNIHKDAKNPAMAAAFVNYILSSEVQVAFATRNLYAPTVKNAEIPDDF
EFRDLLVLNDAFGRLYLPDQEKITANKAGWQQQLNQKAMR
>SMa0433 hypothetical protein
MTKVTVIAHSPGAPVSRRALLDAIRNQASWAGKTRLQLFAPAHIGAYLNK
VRKELGIASRLIASLTALTKFGVLSLDGLEPGSPFLTQLMEDSSKELANG
WDAQVKARQVIFGDGENVVMVQRFLEDPPHTVWPGHGHCSVCRCDKTAPA
IASYLA
>SMa1362 Putative inner-membrane permease
MMASMPAAEVSPSRRRLPAGALRRRQLAASAWVYLAFLAVATIMAGTFVL
AFIASLKVDPLERPFRIYFDQLNPAAWVAAARLGREGAGDALWGGLAPGA
NVHFSVTYAAPADARIVEPRAEIPRRRPGSGIAAALMRHYAADYASIALR
SSSSATKQARDKRGIRQGWQARTFVYEITYRSDRHNGPWIERTPVNLTAP
TSQTLIDSPISPSRSERRGRLLSWDNITPGALGLIFNNYRRVISETADLQ
TGKSLIGSWLGNSLAIATGRLVLTLVVASLAGYALARLKFRGSRLLFAAA
LFSMTIPAQVTFVSNYLIFRDLSLLNTPWSVIVVFVASANVLLMKQFFEA
FPREIEEAAIVDGANRFLVFYKIVLPNAKPALLVNAILAFQGAWNDFFWP
LVLITSPPEALTIQVGLLSLRRSFGGVQGDWGLVLAGAFISIVPVVILFV
LFQRHIVSTEFAKGTT
>SMa1650 putative ABC transporter, permease protein
MFHFTSKRLKHSLVLLFLASLVCFTLVVSAPGNVAVLIAELRGAVATPGY
VEQVAEEIGLNEPLLMRYVDWLGDAVRGNFGISYRTGQDVSADLAARMSV
TGILIAGGAVIATLLSLALGFLGALWPYRVPDRVSRGLALLGASTPTFFV
GALMIYGFSVHFQLLPSFGFNGPQSWILPWLTVSVLPAAVLSRVVRVGLE
ETMASPFVLTAKSKGLGRTAILFRDALPNITPIFINALGTQVGLMTATAI
IVEPLFAWQGLGDYFLAGVRFRDFMVVQACLLIFLTFFILVNLIVDLAVL
LTDPRIRRQWS
>SMa2014 Putative transcriptional regulator
MPFRSAGHSDMRWSALPSSSRSMGSHWPPDLFNHKCIRVRLPDGSLFRWR
FEKDGEQVQIDVRGPITLDEASLTRTAVLDSAGVGYIFEQDILPDIDAGR
VIRILEDWTPPYPGLCLHYPGRRNLSAGVRAFLELARELSRRATG
>SMa0791 hypothetical protein with local similarity
MPVQIRKTLLQMETTLIEGGKAAPRPLKLFSAVAVVKNPWAGHGFVEDLR
PEIHRAAPVLGELLTRMIIDAVGSAEAVEAYGKAAVVGIDGEIEHASALI
HTLRFGNHYRQAVGAKSYLAFCNTRGPANAPIMIPLMDKNDEGRRSHYLT
IQTAVPDAPAADEIVVALGASTGGRPHHRIGDRYEDLKELGQDVTNPAGV
>SMa0640 hypothetical protein
MRRRRNAPATAKRSSKSHQTTRAGDCLKHRNPKRWSQLRKGGLNITSILL
SSYLALPADSLLGRDLGEGERQLEFAGKFHRSPRQAPDSGRSLPPAGRQS
RLRISAVEKQAAEAATSMKAVRIRSTASILTHDPHPLVQ
>SMa2151 probable DNA-binding protein
MTTSSWLSGSLPNSIARKTQAELVRKSGPAPYWRFAMATKRKFKSDAFEA
IHSAVEDMYSAGTIDKETMKTFDETCLSIPQELTPSEIKALRENNHVSQP
VFARYLNTSESTVQKWETGAKRPSGPALKLLSIVRKHGLEMLS
>SMa0179 putative transcriptional activator
MHRPRLPPLSALRAFEAAARLASFKAAAEELLVTPTAISHQIKQLEAHMS
LRVLDRTPRAVTLTPRGKALYEATAAGFGEIERVVTRLLAETAPTTVTLT
STIAFLSHWLVPRMDALRQTIPNIDLRLHASNKVEELRSGGIETAIRYGR
GPFAGTASMQLCSDAMTPVCSPSLGLSQLGDLRRVTLIHIDGRSRPAPKP
DWNRWCEQAGITDLDTSAGPRFPDSMLAVQAAIAGQGVVIASRVLVADAL
ATGLLEAPFTQSLAGDAYHFACALGLEQRTDIAALRMWFQNCFSAAEPDP
AHRLP
>SMa1024 hypothetical protein
MTMPEACKSEMAASGTMQMPGGDMGQMAEHQKAMMEGMRETEPAMMQGMM
AKDPDVAFVCGMIAHHTAAINMSEVELKYGDDQQAKSMAEKIIEAQKKEI
EEMTNWVEEHSN
>SMa1016 Conserved hypothetical protein
MPKSLLAQPQIVTVNTTVASYSPYFPTGQSLSAGTIEKVQSRALGPDVLR
SLAILLVILVHLPVEATPPSLVGHAWLGVDVFFVLSGFLIGTQLFREVAR
TGRVDLKSFYLRRAFRIFPAFFVVLGLYAIFPVIWDASTMQSVWSFATFT
VNFDFDPRVGRAFSQAWSLCVEEHFYLVLPLLVLILHRRISMGSTLLIAG
AMVGGGMALRYTIWESQVGVLVAADKLGDAFAVYLRDVYYPTYTRLDGLI
FGVILAAARFFKPELCKRYAPPRIALPIGFALVAAALVLFSIRGPLAGTN
LFLVFQAQVGSVAGFPLISIGIALILGAMLDVEHILRRWPFPGAATVATL
SYSLYLTHKSVFHIDRLVFGEGNLQGGFGFAVYLATSFAAATMLWFCVER
TFLLLRDRVLSPKRPLAKEGAHRF
>SMa2201 hypothetical protein
MKNRIFAGVPSEEMAGYAKAVIVGTTVYVSGTTGRDPDTGNFPDNAAQQA
RNALASIDKALRQAGGSLANAVVSRVYVVDQACAADVTAVLGEVFRDIRP
TSTMLICEIPAPGAKVEIEITASLEC
>SMa1613 hypothetical protein
MRLCGAPHKRKGVEGLAALAQDVLRQKPTGGAVFAFRGRRGDRLKLLYFD
GQGFCLYYKILQRGRFPWPSAADGTARLTTAQLAMLWEGIDWRRPNWGAP
PARVG
>SMa1691 putative TrkH-like protein
MNASLFRHAMHIAAILGLYLSAAMLIPAMLDLYYGHYDWQVFAAAAFVTS
GLSATTFMATRGGPPPFSKKFGFLLVNVLWSAFALIGALPLWMSSLDLDF
AQALFESVSAITTTGSTVIVGLDDAPPGILIWRSLICWLGGVGIVVLGLF
IIPYLRVGGMSFFKMESSDTNEKPFARLATFSRAFFTVYVGITLLCAIGY
NLTGMNRFDAINHAMSTVATGGFSTHDASFAYFGSIPLLWTATFFMTLCS
LPFSILIVFVARGRLDALRDPQIFVFLAYLTVFAFSVAIYHRLTNGVEFH
IALAHAFFNFSSILSTAGYMSEDYQLWGPFVVMAAFIATFMGGCSGSTAG
GIKAYRFIVLFNAIHSGLRKLIYPAAVYPVRYGRNSLDADTLRAILLFFV
TYILLWIFGSLTMAALGYDFLTAVSAVITCLSNVGPGLGTLVGPAGNFST
LEDPELYLLSLMMMMGRLEVLTVLVILTPVFWKQ
>SMa1705 HYPOTHETICAL MUCR FAMILY TRANSCRIPTIONAL REGULATORY PROTEIN IN SYRB 5'REGION
MTETRSNGRRLELTSRIVSAYLSRNVIAPDELPYLIQQTYGSLNETSGPG
ETPPAVEEQRPAVPIKKSVTDDFIVCLEDGKKFKSLKRHLTTKYGMTPDQ
YREKWKLPSEYPMTARNYALQRSKLARAMGLGKSRALK
>SMa2103 possible sulfite oxidase
MEVPLPLFIVHHQHSPETCPARDPAKGTMLLNHLSRPSAARHGVLIKGEA
VAQGSHSLFFIAEAADEAILQAFLAPLRQAGNVEVTAAMTCAAMVSSGGC
EERPVDVSAEVLDPADACQDAVEAGLLIHSVNPLNGETSVPDLAGGAVMP
NGRFYLRNHFDIPNLGGDNYRLSIGGLVERPLKLSMRELHNLHAESQVVT
LECAGNGRSLFDPAVPGEAWGLGAVSTAEWTGVRLMEVLERAGLRAGATE
LTFRGADSGVVDGHDAPVRFERGLSLDQIRETDALLAYEMNGETLSPPHG
YPLRLIVPGWYAVASVKWLTEIVVTDQPCEAYYQAEKYWYHWVRNGHDER
AQVRLMNVRALISSPEEGENLPRGDTAIRGVAWSGAGNISRVDVSLNGSR
WREARLVGERRRSAWQWWELITRLEETGPLTVRARATDMTGRTQPEHAEW
NRLGYGNNSIHSVAARVI
>SMa0431 hypothetical protein
MQNLTSQILGYAEQMPEGSPLSAKSLLHLGNRAAVDQALSRLTERGELIR
AGRGIYMRPVKSRFGSRPPTVEQAVEAVAQQRGEIIVSSGAAAANALGLT
TQVPVRSVYLTSGRSRTMNLGKQSVELKHVPRWQLALADRPAGVAVRALA
WLGPEKAEEALSRIKRKLPPAEFGELVAAAPQFPTWLARSVGRAAHG
>SMa0850 hypothetical protein
MHLATMGTLSSCRFTPGRTVSLGRSQERYCVLDGADRPKYGWRAAERAPL
KRPVPPRAPHGFSIVKTLLRVQFPTPRRNAAAKHSSVSADRSPRWRRGAT
DGTPTFGAGSRIVIGSDERSI
>SMa0576 Leu or Leu/Val/Ile Transport Binding Protein
MRHLFTAAALAFALASQSEAEVLIGVAGPMSGKLAWTGTQLRRGAEMAVA
NINAAGGVLGQQVRLIVADDFCDPRQALAAAEKLVADGAVFVIGHYCSGA
SIPASKIYAAAGVLQISPSSTNPMLTEQGHANVFRVCSRDDAQGHKAGNY
LADHWGDSKIAILHDNTTYGKGLADETKKQLNMRGVTEAVYQSYTPGKDD
YSVEVAALQTAHIAVLYLGGYHTEAALMVRAARDRAYPVQLISGDDTATE
AFGLIAGPAAEGTLFTFVADPRRNAEAAEVVERFRAENFEPDSWTLHSYG
AAEIWAQAVTKANSLDLQAVIAALREDQFDTVLGRIDFDKKGDLTVQSWV
WYVWKSGEYVPVE
>SMa0280 hypothetical protein
MLDRTIPHNLARKAVLYRMVMPGHTCPYGLKTKDLLQRSGYEVDDHHLTT
REETDAFKAQHGVPTTPQVFIDGQRVGGYDDLRRFLGKPVADPKATTYRP
VIVLFTLTALTAMAASHAVNGTPFTLRAAEWFIAFSMVVLAMMKLQNVES
FATMFLNYDLLAMRWIPYSYIYPYAEGLAGVLMVAGALNWLSVPVAMFIG
TVGAVSVFKAVYIDKRELKCACVGGSSNVPLGFISLTENLMMIAMAVWMA
AGSIGLIATHAM
>SMa1087 Putative cation transport P-type ATPase
MGIDPNHHHSRTNVHAPPAQTEGSQVPTMEGVIYTCPMHPQVRQIGPGNC
PICGMALEPAVVTAETGPSAEFVDMRRRFWIGLVLTSPVLALEMGGHLTN
LHMLLGAQTSNWLQLVFATPVVLWAGAPFFERAWRSLVTRRLNMFTLIAM
GTGVAWVYSVIATVAPGLFPATFRSADGAVPIYFEAAAVITVLVLLGQVL
ELRAREQTGGAIRALLDLAPKTARRIRNDGTDEDLPLEAVAVGDRLRVRP
GEKVPVDGTLVEGRSSVDESMITGESMPVTKEVGAKLIGGTMNKTGGFVM
EAGKVGRDTMLSRIVQMVAEAQRSRAPIQRLADEVSGWFVPAVILIAIVA
FVAWMWLGPEPRFTHGLVAAVAVLIIACPCALGLATPMSIMVGVGQGARA
GVLIKNAEALERFEKVNTLVVDKTGTLTEGKSKVTSVVAVNGIAEDELLQ
VAATLERASEHPLAAAIVEAANVSRLGLGTAENFDSPVGKGVTGTVKGHR
LVIGSHQIMSEEKVDVAPLTEKAEALRGEGATVIFVAIDGRVGGLFAISD
PIKPTTPAAVAALMKDGVRVVMLTGDNRTTANAVARKLGITEVEAEILPE
HKSEIVRRLRNEGRVVAMAGDGVNDAPALAAADVGIAMGTGTDVAIESAG
VTLLKGDLQGIVRARQLSHATMRNIRQNLFFAFIYNAAGVPVAAGVLYPA
FGLLLSPIIAAAAMALSSVSVIGNSLRLRSTRI
>SMa0436 hypothetical protein
MSTTPTVHFEAASTLAVVVSSHEPLLFLSDDLKIIAASASFCRAFDIDPT
SVPGKKLGELGQGEWAMPKLESLLKATASRSADIQAYEIDLKRPNRGTRQ
LVVNAQTLDDGDVEHIRLLVAVTDMER
>SMa2341 putative LysR-family transcriptional regulator
MNLLPLFLAVAEEDNFRAAADRLGVTRSAVSQGIRRLEDAFGTTLVTRTT
RSVRLTEAGERLREALSQPLSDIGTALDRVAGEDEPRGLLRIAVTSIAEQ
FLSGPLIASFAQAHPKVAIDVTVTDEEFDIVAAGFDAGVRLGEVIEQDMI
AVPLTGDQRELAVAAPTYLAVRGTPAHPRELVHHRCIGWRRAPNVAPYRW
EFEENGVPFDVAVEPQITTNDLRFMLRSALCGAGITFATEETFRPFVETG
ELVPLLQDFLPPFPGFFLYFPQRRNMTPKMRALIEHVRRWR
>SMa2089 hypothetical protein
MTDPKRFAPVLAKAEELGKRFALTARHYDETGEFPFANFDALHEAGLLGL
VTATEHGGLGGGLTDALAVVSAIARGEPSTALVLSMHYNQHYSVRTSGKW
PPHLVERVTRANREVWR
>SMa1927 Conserved hypothetical protein
MIFPSLTRRAAMAAGFAFWSALALGAPAKAQDFPDRTVTLVVPFAAGGST
DVVARIIAQKMSEDLGQQVIVQNVAGAGGNLGAANVARAEPDGYTILMAT
VATHALNPLILKTKPYDPEKDFAPISLLVVVPNVLVVNPELPAKTVQELL
ALLKAEPEKYAYASSGNGTPLHLSGELFKKMAGVEMQHIPYKGSGPALND
VLGNQVPIMFDNLPSSSGHIKAGTLRALAVTTAERASSFPDIPTIAESGI
PGYETYTWNALFAPANTPQPVIARLNEAAKKAMADPAVQKRMEEFSAKIV
GSTPEELAAHVKAELAKWIPVVRDANIQMD
>SMa2020 putative transcription regulator
MTKPHNDPPPLHDQLCYAIYTAGIAIQRAYKPLLDELGLTYAQYLVLNVL
WSEDEQTVGAIANTLALESSTLTPLLKRLETSGLLRRTRNLSNERQVVIA
LTEKGRALQHRAGCLSDTLLAASTQTPPELAALNRDVRYLRNAIYSQIGG
WDTPA
>SMa0673 TRm3 transposase
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>SMa1052 Conserved hypothetical protein
MNRNICQFGAALASGIVFGFGLSLSGMLNPARVQGFLEVFGTWDPSLAFV
LGGAVVVAFIGVQVMKLMRHPAFDDTFHVPTIRRIDAPLVIGSAVFGLGW
GIGGFCPGPAVASLALGLPQTVLFVVAMLVGMTLHDRLWSRGT
>SMa1747 putative ferrichrome-iron receptor
MNMDIDQTGPWPTRLPLRSRNAFLLGCTAFFALTPALSFAQDAVPEGDTT
VLETIVAHGAGGGSVLNTDEDSKSIIATETTGAGKMPTDILIAPASVSVI
TSKEIEERAADTIEQVVQYTAGVVTDFYGSDDRYDYFDIRGFTPYTYRDG
LAIGRTFAGVREEPYAFERIEVLKGASSSSFGAAEPGGSVNYVTKTPKSD
RFGEVYGTGGSFSHKELGFDFGDNLTADETLSYRLTGKFQRSDAEYDYSQ
DDENFVMGGVTWRPTDATSLTFIFDHLDKDGVPGSGGHPLGTDFDRDQFF
GEPDYYFSETNRNSYSVLFDHDFGNGLSFSSNARYSNLNDGFGSAYIGST
PTDGSTVAGRYFFGNEKSTDQFVIDAHLVYEASLDNVESRTLFGADYNKY
ESDSANFYAPAPSIDWEDPIYSGGPGAMAPYASTNNDQQTNAIYLQQDLT
FFDKLTVSFGLRNDWLDLSETNLLAGTRRAGNHREFTTRIGASYKVTEEL
APYISYAESAAPPAAGSDPTTGKQYEVGIKYRPDAFPAMFTASVYDLTKG
NITVFDQVTYLPQTVEKVRHRGFELEAKAEVTNNISVIAAYSYIDSKIEE
PGGANDGNRLMRVPKNMASVWGTYTLEGDGARGDMLFGLGARYTDAYYTS
ITNTTSSESAVVFDAAFTYKIQENTTFQLNVNNLFDEKHVASKDSGAVYY
NPGRSILATLRQSW
>SMa0969 putative response regulator of two-component system
MRQLDRMENQYTKPLVLICSGNANFYVLLAHILASEGFQTLLLGEEEVVD
QAALRPITAIILDSAQDPELTVRTCAAIKANEITSRIPTIALISSGNERY
YLALLKAGIDENFVRPVSPARLLAYLGSLPHNGADDKQPTDPAREETAAI
FGALRIEPGRRLVRYGDEGAQFGRIEFNLLRCLLEAPGRVRSRLELIEAA
WPSNRYVQPRTVDVHVARLRRELERLTGRPLIRTIRATGYAIDIDDGAD
>SMa1765 Hypothetical protein
MASSRNTMPSLAALLGLLAVAGYQNRDKLGQIFQDAKGGAQNSGGILGEL
GKMFGDPGEASLSKGLGELVDAFKNGGNREAADSWVSAEKPSQGISAEQV
ESSIGKDTLQELAAKTGLDYNELLKRLSTSIPDAVDKMTPNGHLPQNDEE
VLGRSVGPASA
>SMa1131 Conserved hypothetical protein
MMIEPTIRFDGAAGTVTGSCQLVEFGATKILVDCGLFQGSKTEKELNYRD
FPFDPKTIDAVILTHAHIDHSGLLPKLARLGYDGPIFATPATTDLCSVML
PDSAHIQESEVDQLNRRNLRRGRQTVTPIYTPRDAAVAITLFRQVRLGSW
ETVGEGIRFRFWNAGHLLGSASVEMEIGTGPSPLRLLFSGDIGPRHKLLQ
FDAKAPSGWDYVICESTYGNIEREEADDAARRHILRSEVLTAAHPNGALL
IPSFAVERTQELLTDLVHLMETGAVPKCPIIIDSPLATRASEIFRRHARE
LENGDALVRGIESKNVRFTETAEQSKAVDLIRGFHIVIAASGMCEAGRIR
HRLKNWLWRDEATVLLVGYQANGTLGRLLEEGATVVRIQGDDIRVRARIR
KLDIYSGHADGSELADWVRARQPISAGVFLVHGEDEALEGIRERLSTFLP
QDHLIRPRLDSAFRLGSHGAVQISEAVAPPRIDPVRVGRTDWHNDIQKLV
LDLQDELAKAADDKARGVVIRRLRRALQEEA
>SMa0674 hypothetical protein
MSEKVPGNGGSKHLFYGVLNPVDRSNEVLFGLIMVMTFTNSLSTTDAGRA
DVRSMLVGAVGCNLAWGIIDAVMYFMSSMAERKLSERTVHRVRTADDVKA
GTVRNSVRWAGFIS
>SMa1674 hypothetical protein
MASSSLRLLVALLCSEKRRARRQQSPTVPTHRSAGEQTVSMGLARAARRQ
IALSPLISAKDVAHDESYADDQKDYSCQADSKRAISVARRRDKETNNDTP
EAD
>SMa1871 Putative deaminase
MHPMNPKIERSCMKTLLLKKDEVRRLIGMAEVIGAVEEAYKAFSSDQVEQ
PDYIGIHHPSLRGEIDFKLGYYKANEIISMKAHSGGFTNNPAEHGVPNSI
GTILLFDARSCALICIMDGSLITGLRTGASGAVSVKALARKNARTVASIG
TGNQARMQIRAVNEIMKIEKIHAWSRTPESLSRYKTDIQREFGIPVIVAS
SKKEAVEQADILITTTRGKGSLVEADWVKPGTHIVAIGTDQRGKQELDPE
IFRNAKVVVDSVAQCTEKGETWHPLNKNIITKDDIHGEIGEVLLGRKPGR
ESDDEITIFDSTGMAIQDNTTASKIYQNAIANNVGTFFQFFE
>SMa1494 Hypothetical protein
MDDGSEGKMWIASHVGGASKLNWAADDLRSVFVAAAGVHPCYVFAVVCTQ
RFAECTISEESVSPACRKLRPACHALSQSATSRPVALSSWRMMPSGMTPD
LKSDCQLYDMSKGSHRPFAAIILRSSVDYKVAADCEPSMHGNFVSG
>SMa1409 Putative acyl-CoA transferase
MAKVMTANEAAGLVRDGAVLAVNSSSGLGCPDEVLKALGERFELEGHPRG
LTSVHPIAAGDFFGTKGVDHIARRGMLKKIIGGSYPSGPSNAEPPLIWQM
ILKEEVAAWNVPSGIVFDMLREGAAKRPGVLTKVGIDTFVDPEQEGCAMN
AAARSEPIVRRVRFADDDWLHFPPIQPDVAIIRATTADERGNLTFEHEGA
TLGAMEMALAARNSGGIVIAQVKRVAAEGTMRPHDVRVPGILVDVIVEAP
DQLQTTATPYDPAISGELFRPLHTFRTPALDPGKVIARRVAQELKAGWAV
NIGFGISANVPRVLIEEGHHGKITWVIEQGAVGGVPLLDFKFGCASNAEA
FVASPHQFCYFQAGGFDCSLLSFLEIDAEGSVNVSRLAATPHRTAGAGGF
VDITARARKIVFSGNFNAGAKMRIEDGRLVIDKEGRIAKIVPKVDQVSFS
GRRARVQGQDVTYVTERCVMKLEPDGLVVSEVAPGLDLRRDVLDQAATPL
RVADGLKEMDQALFLSTAMGLGL
>SMa2281 hypothetical protein
MPVAALERVVHLLAPNDSQFKYRLVPKATYERRKATLRLSPDEGMRVARV
ARVWNLALDVWQSEEEARDFLFRPHPMLEDKRPIDVVIQSEIGGELVLET
LASLKYGSAA
>SMa0903 hypothetical protein
MEATRDNAERLAMIVPGGREPIAPTHWSYGQLCSLAGAPASYMRQLPAPL
AAINLQHGLLNHKAELVKTLEMDDGRVELRAVTGPEYGRIWDKDLVAAVM
SIAGTGTGDTIWKVPGVLDWSTMTHNPFVDITKDTTTLYASDRDVFLFLV
DDTHPIEAGRLPNGEPDLYFRGFYAWNSEVGSKTLGIASFYLRAVCANRN
LWGTENFEEITIRHSKFAAQRFAHEAAPALTNFANSSPAPFIAGIRAARE
RVVARNDEDRSEFLRKRGFSKAETGKIIETVLSEEGRPPESIFDFVQGIT
ALARGKAHQDTRLELEGKARKLLESAA
>SMa0255 conserved hypothetical protein
MTSQLDMFARDVRPAAARAEASRARRQTSTSFSETEMLAALKATGRYRIL
RKLEARTVASEVRPGFPLRGVILDTETTGLDARKCEIIEIGLVSFTYNEE
GEIGDVVAVYGGLQQPTIAIPPDITRLTGITDAMVAGQSIDIAAVQAIVG
PADLIIAHNAGFDRPFCEAFSDVFVRKAWACSVSEIDWSGRGFEGTKLGY
LIGQSGYFHDGHRAVDDCFALLGVLEQSSGGAALPPFAELYKASQRSRVR
IFAENAPFDLKDVLKARGYRWSDGSDGRPKSWWIEVAEEELEAELGFLRK
EIYRWDEADPPTQRLTAFDRYRARR
>SMa1924 Conserved hypothetical protein
MPATKTLANESRNDSSNIALSMKLPRKADFEKALIRQYAKAVKLSRKAGH
QVSFRVVVDPLAGAQTISVVEEEPLCHQDTFPVEEVPEPDDELQAALAAA
RERGRRRAAEILAEGDMLSAEAFADLLGVSRVTVNTKRHNGQVLGLDGAK
RGFRFPSWQLDRDGRPYAALPKLHEVLGGAWAVYRFLVTPHGALNGRTGL
DALKRGQDEDVVAAAEGVARGDFR
>SMa1872 Putative threonine dehydratase
MRNYPTIQDIREARERLKPHVRHTPLLRAEKIEKAAGCQLYLKPETLQIT
GAFKIRGALNKALSLSREEIANGIIATSSGNHAQGLSYAAKMLGVKVILV
LPVTTPKIKIENTKALGAEVILFDGDNAARWKKVYEIAEGNKYAVIHGFE
DPVVMAGQGTIGCEILEDLDDVDTVIVPLGGGGLISGIATAIKETKPSVR
VIGAEPALTPKYFHSRVNKERTSLPLKNTIADGLRISVPGQNPYPIIEKY
VDEIVLVEDEHIIAGMRALAKDAKLIAEPAASIGVGALLAGIIDVKLDEK
VCAVLSGGNWDLRDLAAIYNVAG
>SMa0148 hypothetical protein
MLSFLKKDPQVIQFVCHPEDDGVIAPPLPAKSVLPDWFRRLPAVDKGHLS
ATNNGLTVKRCMPFLDAMTTGWILPIAATVRLDVKDDGRSVDAGWEFDRV
MVSNHGAHQVAGNPKEPSPPCKFHNYWSIRTPPGWSCLFLPPLNRPGQPF
ECVAGIVDTDSYSAHIHFPFFATAPDGLHVVEKGTPLVQVIPFRRADAAV
TAEIRAETRAEATEREAIHRNTLAGEGWYRKSARAAR
>SMa1697 conserved hypothetical protein
MKTPQRRFVVEFKSGRRQPKAQTNSIWGNTDLKALAREVEETTPHLFNSS
EAAGTLNSDETAPAYPIDAEPANPRADDVDVALAAMPFANGAEVEISKHH
GADHPPETVVQEEESQPASQARTTSTSTSRKRAKRAYAQTIAHNSEVGKG
DPKPVDERISLDELAVLEADNKRLRRLLAKQLHAQNLQLKKMLARFDVE
>SMa1954 Putative LysR-family transcriptional regulator
MDWRRIDLNLLKVLAVMLEERSVARCAERLFVSPSAVSHALAKLRQMFSD
PLFVRTGGGMLPTARAQSLERSLSLLKAVLDVELEAGRVSAPGATFEPGE
SVRDIRIVSPGALEISLLPVLAAILRSRAPHWSLTIEPFERRSYEIDLAT
GRVDFVLSVGGATPTGALVDAAVVWEDELVVLAGPRSALHAGSDRISTEL
YLAQQHIYPVPWPTSQNYLDVELARAGKHRKIAFSVPSYAALGHVLEATD
LIASMPDRSARALICRHPDLRLIRLDPPRRSQLSLLWGISALQEPAISWA
RSIIQDAAAKHGDAGGGTANRIGSGQD
>SMa2351 possible oxidoreductase, molybdopterin-binding subunit
MRAFSYERVSSIGAAARAAAATDGAKFIAGGTNLLDLMKLEIETPTHLVD
VNGLGLDRIEATREGGLRIGALVRNTDLAADERVRRDYALLSRALLTGAS
GQLRNRATTAGNLLQRTRCPYFYDTNQPCNKRQPGSGCSALAGFSRQLAV
VGVSDACIASHPSDMAVALRALDAVVETVRADGTTRSIPMGDLHRLPGDT
PHIEHVLERGELITAVVLPKPAGGKQIYRKVRDRASYAFALVSVGAVVQP
DGTGRVAVGGIAPKPWRDEAAEKELANGAKAVAAMLLAGARPTEQNAFKL
TLVERALSAVLVEAKG
>SMa1505 Putative GntR-family transcriptional regulator
MERPVMSSRSQAIADILTRAIIDHRLVPGCKLGERELAEIFEVSRIVIRQ
ALIRLADDGLAQIERNRGAFVARPSMQEAMEIYDALTLVEQGVAAQLCDR
LGPAGFAELRQHVERQRQAVASGNDALADVLGQEFHTLFVRLSRNKVMQE
IHAQLVRRTTLLRSLISADFDYCNLLDDHQRVIDLLEKGRLKPVMELIDT
HHRSVVRGYIMDRQVFPELTPREALQPYLDGKADEAPKPAPRRKTGESGG
TGAGRHVHATK
>SMa1960 Conserved hypothetical protein
MLIVRDAFDGVTRFGEFQKSLGIARNILTARLRTLVDRGILEAVPISEGG
ARQEYRLTQMGRDLFPVMVALRQFGERHLFAPNEKRSKLVERSTGRPIRL
DVLTEDGRPVNSEETVILKISEDLG
>SMa1176 Hypothetical protein
MFSNHLPFAFAGREKAQPKEITMGNWTADLMTRTGLKIHVRPVRTEDEPM
LAEFFTHVTKEDLSFRYLTGLNEVGKERIAALTDVDHVRTENYLAFGESG
DPLIATAMLACDPAFERGEVAISIRADYKNRGVGWELLGFLSRVAQAKGV
KVLESIERRENRAAIEIEQQMGFTTVTDPDDPTILLVRKELRAA
>SMa1168 hypothetical protein
MTRHIAIVQGHPDPARHHLLNAMADAYAEAATAAGHEVRRIEVARLEFPL
LRTQEDFETGALPPGLEQAREDMRWAEHWVFLFPLWHGTMPALLKGFLEH
IFRPGFAMEYKKGGFPKRLLAGRSARIIVTMGMPVLLYRWYFGAYGVRSF
ERSMLGFAGIKPIRENFYGLSFADEKKRSRWLDEMRDYGRRAR
>SMa1001 hypothetical protein
MRTGKAWVWQLRRAAERTRRPFAGDNSPTFTVEHTHRMYPCHVFCNPLIG
TIQMKACSRSERREGSKKMEAAKASKPDVSAVYAVDLSQRPVSQGDAARE
KAALLQLARRMHEAPGEMLRRFVELAMELTGGISAGISLLEETEPPPVFR
WHHLKGILSPFNGATTPRDFSPCGITLDRSAPTLTIHPERVYDWIPPGLF
LPEVLLVPLYIGRTEPLGTLWIVADRIEHFHCGHAATMQELAGFIGIALK
MVRSEEELQQALEQQELLTREMGHRLKNLFTILDSMVRISARSTDNKDDL
VALLSGRLHALAAAHSLVKPSFSDVQGAASSLAELLSIVLEPHEQPAISG
KRRLSLTGPTVLCGEQSVNGLALVFHELATNAAKYGSLCGDLGTVDVVWQ
IDGDDLGITWREDGGAQATSSSPASKGFGSTLVEATVIRQFGGTLSYDWR
STGLSVHIVIPLCRLAQ
>SMa0690 conserved hypothetical protein
MMRGMTLALAGAMTLMLSMQVNGSLAQEPTKAATGTVASTPEPLTDDELE
VLVARIALYPDELVALISAVSLYPLQIVEAERFLENHKKKPDLKPKESWD
GSVISLLNYPDVVKMMSDDLEWTQALGQAVAYQQKDVLIAIQQLRDEAVA
KDIIKSDDKMTVVQEGDNIIIQSANPETIYVPQYPPEMLYEPDYAPVPID
YYDTPYPSYYYPGAAFFAGAVTGAVFGAIVDWDDWGVWGGDWGGDIDVDC
DNCFNNVDIDGKVKWNDIDWKNVDRSKLKFDRDQLQKLDRTNLKNNIKAN
GDNNIRNRATEINRDRLKSGPGGGASQLKDVRKSTLEGLKAQPRRDAAAR
PTAKPGGGQAVAKAKSSGAKAGVNRPKGKKSSANRPAGKKKMASKAQNRP
KKPSGLGNVNSGRREVSASRRGGHSMGGGQRGGGRPQMSRGGGRPPMGGG
GRGGGRGGGGRGGGRR
>SMa0670 probable regulatory protein
MDKHTDDRKKNNHWKAEDRKSAATEASETRSGGNYAKELARLQEEIAHLQ
AWVKKTGARIVIVFEGRDAAGKGGVIKRITERVSPRVFRVVALPAPTDRE
KTQIYMQRYIQQFPAAGEVVIFDRSWYNRPGVERVMGFCSEKKAKRFLEI
APRFEAAMIESGIVLLKYFLDVSEEEQDRRFRQRINDPLRQWKLSPMDVE
SYRRWWDYTRAYDEMIRMTDTDDAPWWIVPSDNKKQARVNCIAHILSSIP
YERVKFEDPDLGKRQKRPADFEGDTRRRTVPNLF
>SMa0149 hypothetical protein
MSVAFIKEESAETASETLLPDRPISPHPNLVTETGLKALELHLQQAREAF
DATSTIEDVNERRRQSAGPLRELRYFAERVRTAQLVPDPASFDVVAFGST
VTFSRDDGRVQKYRIVGEDEADPKAGSISYVSPVARVLMGKSVGDVASVG
DQELEILAIS
>SMa0559 4-carboxymuconolactone decarboxylase, putative
MKRLIAPLMGVMMMTTSNAGAQQPNVEGRRFSPDQVRSVAPALEQYTQQR
LYGDVWQRPGLNRRDRSLVTIAALIARGEAPALTYYADQALENGVKPSEI
SETITHLAYYSGWGKAMATVGPVSEAFAKRGIGQDQLAAVESTPLPLDEE
AEAQRATTVGNQFGSVAPGLVQYTTDYLFRDLWLRPDLAPRDRSLVTIAA
LISVGQVEQITFHLNKALDNGLSEEQAAEVITHLAFYAGWPNAMSALPVA
KAVFEKRRG
>SMa1641 probable NreB protein
MLQVLANRTYRRLFLAQVIALVGTGLATVALGLLAFDLAGADAGAVLGTA
LAIKMIAYVGVAPVAAAFAEQLPRRSMLVCLDLVRAAVAVFLPFVTEVWQ
VYVLIFVLQSASAAFTPTFQATIPEVLPDEKEYTRALSLSRLAYDLESVA
SPMLAAALLTVVSFHSLFAGTVVGFLASAALVVSVVLPSPKASERRGIYD
RTTRGLRIYLATPRLRGLLALNLAVSAAGSMVIVNTVVLVQAEFGLAQRD
TAMALAAFGVGSMIAALLLPRLLDNMPDRTAMLAGAAVLVAGLFIGVFVP
RFALLLPLWLAIGVGYSLTQTPSGRLLRRSAHPEDRPALFAAQFALSHAC
WLITYPLAGWLGAKVGLSTTFAMLGVIAATAILIATRIWPVHDPEEIEHV
HDALPVDHPHLVGATRVGNGHRHIHLFVIDSHHPDWPTEQ
>SMa0121 hypothetical protein
MIRFERKPDPQPSLKEAENKRTKQISEAAKEENKKSGTTGAPRRAPQAAD
DDRLI
>SMa2357 putative adenylate cyclase
MPLRGPASIPFRAFGIRPSRMNPNTCTICELMFTRVMKARKITVDVSVLF
ADLRGYTALSQSLSADTVSSLLDDFYDECAAAIWEFDGLLNKTVGDAIMA
IFNFPIPHQDHAERAVLAAREIQRRCQLRRELRLAEGVGLDGGELGVGIG
IDTGEASFGEFGRSHRDLTAIGTVVNTAARAQSAAEAGRILVTRAVCERA
QSQTAASEGREYRLKGFEKPIELYAI
>SMa1676 conserved hypothetical protein
MLLMNCGDPLRGRCMPAARIKIIYITQILIATLLLLAMLSMASSIFAPVA
FALFIIALVWPTQCRLQAMLPRYLALIISFLLVVLAIVAFGGLIAWAFGH
VGRWIIADAARFQQLYDQVRLWLEEHGVAVGVLWSENFGVGWVLHTVQAV
SGRLNSTFSFWLIALVYVLLGLMEMDDFGRRIEALRNRTASALLLRGSQQ
TAMKIRRYMMIRTVMSVVTGLLVWIFTRAVGLSLAEEWGFIAFALNYIPF
LGPLLATLFPTLFALIQFGTVETVLIVFTGLNLIQFVVSSYIEPRASGSA
LSMSPVMVLFSVFLWGYLWGIFGAFIGVPITIALLTFCNQHPSSKWLSEL
FGLEMAADQVASTPSG
>SMa2343 putative oxidoreductase
MDKVILITGASGGIGEGIARELGVAGAKILLGARRQARIEAIATEIRDAG
GTALAQVLDVTDRHSVAAFAQAAVDTWGRIDVLVNNAGVMPLSPLAAVKV
DEWERMIDVNIKGVLWGIGAVLPIMEAQRSGQIINIGSIGALSVVPTAAV
YCATKFAVRAISDGLRQESTNIRVTCVNPGVVESELAGTITHEETMAAMD
TYRAIALQPADIARAVRQVIEAPQSVDTTEITIRPTASGN
>SMa0036 putative ABC transporter ATP-binding protein
MIRIENISKSNSHRILYIEASAALNRGEKIGLVGPNGAGKTTLFRMITGQ
ELPDEGQVAVEKGMTIGYFDQDVGEMAGRSAVAEVMEGAGPISAVAAELH
ELETAMSDPDRMDEMDAIVERYGEVQARYEELDGYALEGRAREVLAGLSF
SQEMMDGDVAKLSGGWKMRVALARILLMRPDVMLLDEPSNHLDLESLIWL
ENFLKGYDGALLMTSHDREFMNRIVTKIIEIDGGALTTYSGDYGFYDEQR
ALNARQQQAQFERQQAMLAKEIKFIERFKARASHASQVQSRVKKLEKIDR
VEPPRRRQTVAFEFLPAPRSGEDVVNLKSVHKTYGSRTIYDGLDFMVRRR
ERWCIMGINGAGKSTLLKLVTGTTNPDKGSVSLGASVKLGYFAQHSMDLL
DGESTILQWLEERFPKAGQAPLRALAGCFGFSGDDVEKRCRVLSGGEKAR
LVMAAMLFDPPNFLVLDEPTNHLDLDTKEMLIKALSAYQGTMLFVSHDRR
FLSALSNRVLELTPDGINQYGGGYSEYVERTGQEAPGLRG
>SMa0753 hypothetical protein with localized conservation
MKADVFDPRALREAFGAFPTAVTVITASDPAGRPVGFTANSFTSVSLDPP
LLLVCVAKTARDYSTMTAAEHFAINILSEAQKDVSIKFARPLEDRFAAVD
WARAPNGCPIFAQVAAWFECSMHDVIEAGDHVMMVGRVTAFKSSGLNGLG
YARGGYFAPSVAAKANSSAAGGEIGAVAVLERHAALFPLGDQNLSLPRYS
AAGGDPAKTLASQLERSGLSVHDWLSLLDL
>SMa1447 Putative transmembrane transport protein
MLIAAARQLEKGRMTTITVDDALDRAGTGTYQRRLMAIFGLVWAADAMQV
LAVGFTAASIAATFGLTVPQALQTGTLFFLGMLFGAAGFGRLADRIGRRR
VLIATVACDAVFGLLSVFAQDFTVLLLLRFLTGAAVGGTLPVDYAMMAEF
LPARNRGRWLVMLEGFWAVGTLIVALAAWAASLAGVADAWRYIFAVTAIP
ALIGVGLRFLVPESPLYLLRLGKTSEAKAIVDEILVVNGKMRLGAGASLV
PPPPTASAGIFSADLRKRSLMILAIWFLVSISYYGVFTWMPPRLAGEGFG
FVRGYGFLVVLALAQIPGYALAAYGVEKWGRRPTLIGFCLLSALGCLLFV
AAGTAMLIGVSLLIMSFALLGTWGALYAYTPELYPTASRATGMGAAGAMA
RLGGLLAPSLMGLVVAQSFGLAIGIFAGLLLVAAVAAFLIDAETRRVSLA
>SMa1550 putative response regulator of two-component system
MNASGVALIADDDEFFRIALGFILKSKLRFTEIIETGSLDEAIERLSERE
DVSLALFDLAMPGMQSVASLAAVRDVHPDLKVAVVSASSRRSDILSALTA
GINGYVPKGLGATDLAEAVRAILNGSVYVPPSIAGRASPSEAETPAPGGG
VQNERHRTIEFLTPRQREVLQLLVQGFSNKEIARKLKLGEGTVKIHMAAL
FRSLRVRNRQEAAAAGARLLPMTENRQ
>SMa0606 hypothetical protein
MSMKRTSHRYISAAAMIAALVTVDGWTARLSGHGAVFIGNAQARVGRPLT
PASVAGVARRTTRRTIRRSAIYVAALPAACVKTSVSGTVLWQCGATYYQP
YGGRYVVVYVD
>SMa2269 hypothetical protein
MNVPDDAERLILATGRYIGEGFDDARLDMLFLTMPIAWKRTLAQYIGRLH
RQHDDKKDVVVVDYVATPFRFWPAWLRSDARVIGRWAI
>SMa0252 conserved hypothetical protein
MRKLLLATTAIAFGLSAAAPAFAEFNDRNIRVSNGINEDHPVGNGIKAMQ
ACLDQKSGGKLKLTAFWGGALGGDLQATQALRSGVQEAVVTSSSPLVGII
PALGVFDLPFLFANAQEAYTVLDGDFGDMMNEKLEAAGLVNLAYWENGFR
NLSNSVRPVTKWEDFEGMKVRVMQNNIFLDTFQNLGANATPMAFGEVFSA
LETKAIDAQENPYVTIDTSKFFEVQKYVTETNHAYTPFLFLFSKPIFDSY
TPEEQAALRECAVVGRDEERKVIQDLNKKSLEKIKEAGLEVNTLSAEEQA
RIREKSMVVYEKHKAEIGAEVVDAILAKLEEIRK
>SMa1440 5-dehydro-4-deoxyglucarate dehydratase
MSPEEIKSRVGSGLLSFPVTHFTSDYKLNLESYRRHVEWLSGFEAAALFA
AGGTGEFFSLSPNEVGQVTRAAKDVSGEVPIIAGCGYGTSLAVETAKIVE
EAGADGILLLPHYLTEAPQEGIYAHVKAVCDSTGLGVILYNRANSVANAD
TVARLAEACPNLIGFKDGTGKVDLVRHVTAKLGDRLCYIGGMPTHELFAE
GFNGVGVTTYSSAVFNFVPELAQRFYRAMRAGDRAVMEGILHTFFFPFAA
LRDRKAGYPVSIIKAGVELAGFAPGPVRPPLVDLTGEEREILQGLIEASR
R
>SMa0802 putative ABC transporter, permease protein
MLLNFDRLGWWKLVLIGITLLTTAFLLLPILFIAVLSFGSSQWLIFPPPG
WTLRWYQELLEDPRWLDSAWTSFRIAIIVTVLSVLLGLVTSFGLNRGRFL
FREALKALFLTPMILPVVVLAVALYAFFLQIGLNGTLTGFVISHLVLALP
FSILSITNALEGFDKSIEDAAVLCGASPFEAKIRVTLPAISHGVFSAAIF
SFLTSWDEVVVAIFMASPTLQTLPVKVWSTLRQDLTPVVAAASTLLILLT
VLLMVLVAIVRKGLKS
>SMa0883 hypothetical protein
MWGCCSCYVLTMTSATVFQSDAPLDERRVIIRRHDGDVEMVELPWGLRPR
DGDARAVNVVRSKGRTFPTHRCLVPASEFRHRTFSFSLANGDWFYFAGIW
RPAINDWPESYAILTIEANDDLAPFHDRQMVVLRREQRMAWFDGLVPEGE
ILRRLPAGTFRVTRHSTSPVQPMLAV
>SMa0320 putative
MTSLNGKIALVTGASSGIGAATAAKLAEAGAKVGIAARRTDKLEDLKKKI
EAKGGEALVIEMDVVDTTSVEAGVKKLVDAYGSIDILVNNAGLMPLSDID
QFKVDEWQRMVDVNVKGLLNTTAAVLPQMIKQHSGHVFNMSSIAGRKVFK
GLSVYCATKHAVTAFSDGLRMEVGQKHGIRVTCIQPGAVATELYDHITDP
GYRQQMDELATQMTFLQGEDIGDTIVFAAQAPAHVDVAELFVLPVEQGW
>SMa0357 hypothetical protein
MVDARPVGDYPNLRQLAFLHVSHDWRGKKLALRLYQLCKDTVVGSGAEGF
YISSTPTRRTVEFYLRQGAKLMARPDTTLVSIEPDDIHLAHWF
>SMa1769 Hypothetical protein
MMPEPDSWRHMASAPKDGSRILVTIRPSEQGAAEVDLVFWSNGDQFGADG
WRASDSSPGRIIEYAEPELKCWMPMPSANLNRTSMPSPWEGEDERELDGS
GI
>SMa2205 putative ABC transporter, permease
MPEMTRRILPLLSPALLALLLFVAFPMVWVFRTSFNELAEGAYIVEAMTL
ENYTRFLTEPWYLVNTLWFSVRIALLATAISVICAYPVALYIAKTSGLQR
NVLMALTMAPLLIGLVTLVYGWIVIFRGGGLMNSLMIALRVYDQPVRYMW
DIKGVVILLVYIGTPYIVLSLLDSIERINPFLVEAARNVGANRWTAFWKI
VFPLSTPGLYAGLVIVFSLNFAAFAVPLMIGDSNTQMIGLVIYREALLNN
DLPFASALSVIMVSVNALLILGMSALAGRLILSRLEAKR
>SMa0520 conserved hypothetical protein
MLIAERVQTIADTLTPAERRLVKEIIAKPRDVALGTAGELARRTGVHEAT
ASRLARKLGFETYAGFRHAIRDEFIVKTDPALRVRRTLETSRGHGMLEML
VQQEIEALTRLSSYVDEERLAAAAAALSDRRRIFIFARGNAETLAVLMNR
RLRRMAFETVLVCGDSRDIAEQILSMGPDDALLVFAFRRQPRAYAPLIER
AGKVGAVSVVVSGTVGPSLSPKADHLLAAPRAGDADAFQTLTVPMAICNG
LILSMAQSDEVRSLSNLEQLGELIGELEGR
>SMa0187 putative oxidoreductase
MYDAPFYKGSDKLKDKVALITGGDSGIGRSVAVLFAREGADVAIVHLDES
QDADDTKAAVEKEGRKCLVIKGDVKDASFCRKAVEKTVMQLGRLDILINN
AAFQVHTRDIEDLTDEHFDETLKTNLYGYFYMAKAAIPHLKNGSAIINTG
SVTGLTGSKELLDYSMTKGGIHAFTRALSGHLVPKGIRVNAVAPGPVWTP
LNPSDKEAEDVEKFGSQTPMKRAAQPEEIAPAYVFLASPQMSSYITGEIL
PIVGGY
>SMa0583 NrtB, Nitrate transport permease protein
MSVTNLKLKPQATPQTAAQVIALGQTASRGIDGRLSRFVTQTVTNLLPLL
VTLTFFTLAWQLICSSPESSLPAPSRVLEESWELIAHPFYIGQGVDQGLF
WHVFASLQRVALGYAMAAAVGVALGTLVGQSALAMRGLDPIFQVLRTVPP
LAWLPLSLAAFQDGTPSAIFVIFITAIWPIIINTAVGIRNIPQDYQNVAK
VLRLNSFEYFGKIMLPAAAPYIFTGLRIGIGLSWLAIVAAEMLIGGVGIG
FFIWDAWNSSLISDIIVALIYVGIVGFLLDRLIALVGRAVTRGTANA
>SMa0132 hypothetical protein
MIELTPTQIRGLKLAKDGDVHPQAGKRWTHLNAQVTYAKQDRFKERPQKI
KFLTTATLNELRDHRLVKALNTDVPPEESAHGITMAGKMLLLKIK
>SMa1141 putative fnr/crp family transcriptional regulator
MNAQVVSMQDARTRATCSQAFPVADLSSLFARQPVERFAPAQAVFWQGDE
ATHIFEVTVGMLRALRLLGDGRRIIVGFLRPGDLLGVSLKERYLYTVEAV
SSVELRRFPRRRFEDEVARHSNLQQQLFSRLRDEMTAAQDQAVLLSRRSA
EEKLANFFLLMGQNQNCKQTSIVDLPMTRLDIADYLGMTIETVSRTITKL
ANSGVIATPERRSVTVLKMETLRSLADGDESDYWTPSPYRVRSMHPPVFE
GGGA
>SMa1159 hypothetical protein
MIAAALIFAAGGAALMTKTAPSADEPVEPRNPFELVPLLIFAALFAVTAT
TGAALMKGMGHSSLIGISAASGIFDVDIAVLTALRAGDGATPLQIVGDAV
LVAVLANAGGRVLVAIASGTLRYWTSLAAISLLAAGTGVAVSVLIAR
>SMa0166 hypothetical protein
MEMTTGASGRSVLGRSGGGRRPTELVMAIVATVILSSCQTSEVLSGAEFD
PTSALASSGDVSKSDLDQGKLQFMNGNYGLAEKHFRKAVELRQDNAEALM
GLAACYDRLGRFDLADRAYNQLLKVAGRQPRIVNNMGYSQYLRGEKAKAR
KLLLEARAASPGDETIEANLALLDRS
>SMa0591 conserved hypothetical protein, fragment
MVRSRWVYRKLRNFRAGIEAGISGLTRTYGLAHCTWRGLHHFETYVSSSV
VAYNLALFARLRPT
>SMa0959 probable
MSRGSTSTIAVVTGGTSGIGLATARHLLERGNRCAIFGQRPINVESAAEA
LSQDFGSERVFARSVDLAEPTQITSFFRELDERWGRAEILVCNAGISPKG
PDGPTPFQEITLEEWNAVLSVNLTGTMLCCQAALPGMVAQNFGRVVLVGS
IAGRALPKIAGTAYVASKAALAGFARSLIARYAGQGITVNVVAPGRIATE
MAGPRDSLVNRAAVARIPAGRMGEPEEVAAAIGFLTSDKAAFINGAIIDV
NGGEFVPL
>SMa0445 TRm1a transposase
MTSSNFKMEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQP
NQLFAWRKLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTME
SEILKEALEIAGSPKKHLLRSLSLPRGILG
>SMa2011 Hypothetical protein
MNRADGTRTGRAGTAPLENSAGEDSLSHRLLQPGDGRHYGLGGRVWLRPL
LVSRPAAKSISNRLTPMSSAHAHRRQKSAVVTSRFLWSANAQHSNEHPLL
PPKRPCCK
>SMa1686 putative two-component response regulator
MNDLEPIVHIVDDDQSFRTAVGRLLAASGFRVALYESGDQFLAQFADGEP
GCVLLDLGLPGLSGLELQDRLAEKAPLLPIVFLTGRGDIRATVQAMKAGA
EDFLEKPTPKEVLLETIGRALRRYALRRLEQDRKHALRRRLANLTPREFE
VFGLIVRGKLNKQIAQALGTSERTVKAHRHNLMEKLGTRSLAETVSIAER
LGLVDSAADQLQR
>SMa1223 Conserved hypothetical protein (ORF151)
MFVRVMSREECQGVVAAGDLARLACCRDDQPYIVPITYAHSGNRLYCFSM
PGQKIDWMRSNPKVSLQIAEFASNRQWKSVVVTGRYQELPATQGCHHERI
HAWSLLEKKPNWWEPGGLKPVPQEISGASAHIFFCVEMDEMTGRAACAGE
L
>SMa1757 Putative Short Chain
MYQASKESAPMTEECRYMLLTGASRGIGHATVKLFQSKGWRILTVSRQPF
AEECAWPSARESHIQADLADLTQIDRLAATVRERLPNGRLHALVNNAGIS
PKGPGKNRLGVLDTDADVWTQVLNVNLVSTALIARALMSELEAAKGSIVN
VTSIAGSRVHPFAGVAYAASKAALASLTREMAHEFGQRGVRANAIAPGEI
ETSILSPGTDELVAAEVPMRRLGEPREVAETIFFLCTEPSSYINGAEIHI
NGGQHV
>SMa2383 probable oxidoreductase
MKAVVMKEVGGTDVMEFVDRPEPVARPAHVVVEVAAAGVNFMDIGVRQGM
AWTDIPNPKVLGVEGAGRVLAVGDGTGEFAVGDRVAWVYAPGSYAQRQSI
PAASLVKIPDTVDDRTAASTMMQGLTASHFATDFYPVQPGDIALVHAAAG
GVGLLLTQIIRLRGGRVIGRVSSEDKVAIARKAGAEHVIVDTDGRFADEV
LRLTGGEGVNVVYDGSGPKTFKGSIEALRRSGTFCWYGPVLGGPGPLEIM
NLPKSIKIGYATFMDHIHTRELLLDRTKQLFDWIEDGSITVTIGETYRLA
DAAGAHAAMASRATTGKLLLIP
>SMa1751 putative ABC transporter, permease protein
MPMKLAKFAFASLTILFLVAPLIAILPLAFTSSVILTYPIPSWSLRWFEE
LFTADAWRRAIFNSLIIGTGTTLLATILGTAASLGLRNRLIFFRGSMRTL
FLLPMVVPAVVLGVGMQVLLAGFGLTNSYAGVIIAHTVVAVPFVVVSVIG
ALDGIDERVELAAQSLGASPATVFHRVTLPLALPGVLSGAVLAFATSLDE
VVLTLFVAGPNQRTLARQMFSSIRENISPAIAAAAFLFIAATIFFGLVVV
LSRFMLAKRR
>SMa2219 probable decarboxylase
MSSRRLIVGITGASGAAYGIRALELAQAAGVETHLVVSRSALLTLNQELG
LQKADLAGQADMIYPVADIGASIASGSFPTIGMLIAPCSVRTMSEIATGV
TSTLMSRAADVALKERRRLVLMLRETPLHLGHIETMGALTRMGAIIMPPV
PAFYAKPQSLDEMITHSTARALDLFGIDTGAVKRWSGLKDALAEEAT
>SMa0471 conserved hypothetical protein
MASNALVQTRIDAEVKERATAVLENMGLTVSDAVRILLTRTANEGALPLE
LFSHSEAHDAWFRAKVLRALEDTRPDVDDADADAHFRERRAAALRKAAAG
DR
>SMa0846 hypothetical protein
MALHRTGKADVRHGYVERCNGRMRDELLNESLFFGLSHASRAISKWSTTA
IRSGRTRRSDTAPRQPILGSSPQPSQLVYRKRSRL
>SMa0625 hypothetical protein
MMARERTGLSPALALVAAFMLFLQSVVHAFAGQPSDILPFDAFGNSLCVT
GTP
>SMa1430 Conserved hypothetical protein
MISSPIARRPSPRKGVRLPHEITIPAKQTFRLDRYVDLPLNSGEIRGRTV
CTLVSPGTELGWANGDVFPIRPGYSAVFEVEEVADGVDGIRPGELRFAMG
MHRSTQTHTARDTVPLPAGLRPEIAVLSRLMGVSLTSLMTTKARPGDHVV
ITGAGPVGLLAAQLFKISGYRVTVVDPDPLRRAQLASCGISDCRERVPLQ
SSLQGQVALVLDCSGHEGAVLDACRIVRRLGEVVLVGVPWRKLTEISAHE
LLNAVFFNLVTLRSGWEWELPVHARAFEWEELLGGYNNARHSVFGGFARA
LDWLAEGRIELGGLLRRVPPTDPASLYAEIAARRNEEPFIVLDWTDFDGP
DLKSS
>SMa1649 putative ABC transporter, permease protein
MELIMKILTSFSARLNKMPRAALITIGLFVALGTFAPWLAPQDPSAQNLL
EAGVGPNGAHWLGTDHLGRDTFSRLIIAANTSLVSVGSVLAIAMTIGIAM
GTIAGYHRGWVDDVLMRVVDVGLSIPSLIIALAVIGIVGPGYWTMVMALA
LAWWPMSGRISRAVAVSIMSKPHIEALRVLGASPWRIYFNHLLPGTIGAV
MVYATADAGVAALAVATLSFLGLGIQPPTPEWGQMLVDALPYLESDPRQV
ILPGLALTAAVIGFNTLGESIALNRIPKPLTRRMLAARRIEVAGWAKESI
DAK
>SMa0072 putative ABC transporter ATP-binding protein
MTTPIIECRNLQKWYSGVHALKNVDLTIHPGETVGLVGDNGAGKSTLIKI
LSGVHHQDSGDVLIEGRRVALRSPKDAMRYGLETIYQYNSMVPTMSIARN
LFIGREPMKWSVFGVGIMDQKRMRVESIQAIADVDLHLRSPDALVGELSG
GQRQGVAIARAMHFKSKVLILDEPTNHLSVKETDKVIGFVRGLKAQGLTG
IFISHNMQHVFQSCDRIVAMARGEIVFDKPTADTSIDEVHALL
>SMa0793 hypothetical protein with local similarity
MADFQGETEMTAEVFDPRALRDAFGAFATGVTVVTASDAAGKPIGFTANS
FTSVSLDPPLLLVCLAKSSRNYESMTSAGRFAINVLSETQKDVSNTFARP
VEDRFAAVDWRLGRDGCPIFSDVAAWFECSMQDIIEAGDHVIIIGRVTAF
ENSGLNGLGYARGGYFTPRLAGKAVSAAVEGEIRLGAVLEQQGAVFLAGN
ETLSLPNCTVEGGDPARTLAAYLEQLTGLNVTIGFLYSVYEDKSDGRQNI
VYHALASDGAPRQGRFLRPAELAAAKFSSSATADIINRFVLESSIGNFGI
YFGDETGGTVHPIANKDAHS
>SMa1760 Hypothetical protein
MFQCCEIRHGTGPENGILRYLMHLLLRCFNYHVRQTVSAAMQPAPFANRR
YATIEDKTENCANGTSMTEPTPATKKTDSMSATWNVTSLADAAAAVMRVY
GAGGTVRRLSSERDETFLFTRSDGRDFILKIANPAEDAAALEFQDGALLH
LEAAAPVVPVPRLVRTKSGEQSHTLSTADGPRVMRLLTFLRGELQYRTPA
SEAQSRNVGRALAALGLGLEDYRGRPPAGKLMWDISHTLDLTAVVDHVAP
ERRAQAEAVLAEFERALPAITGLKRRQIIHNDFNPHNVLLDPSSPTTVVG
IIDFGDMVHAPLINDLAVALSYHLGTENWAARTGSFLEGFHSVRALEPGE
IEVLPVLTRARLAMSLIIAEWRSARFPENRDYIMRNHATAWRGLQNISDL
TPAGLKKLVPNLYEV
>SMa1851 Putative dehalogenase
MSIFRPKYITFDCYGTLTNFQMAEAARDLYSEQLDEARMAEFIKNFAAYR
LDEILGDWKPYAEVVHNSLERTCKRNGIEFREEAARMVYERVPTWGPHAD
VPAGLARVAKEIPLVILSNAMNSQIMSNVEKLGAPFHAVYTAEQAQAYKP
RFKAFEYMLDMLGCGPEDILHCSSSFRYDLMSAHDLGIKNKVWVNRGHEP
ANSYYGYVEIADISGLPGVVGL
>SMa0675 cation (Ca) exchange protein, possible
MHSETKSQALQGRRTIRMRTFEVKGSTKQSRSAIETVLRELRRSPLLALI
FFVPVVILLERMAPKAHTLLFFVAIAAIVPLAALLSRATESVAAKTGDAV
GGLLNATLGNLTELIITLAALQAGQYLLVKASIAGAIVTNSLFMFGGAFL
IGGLKHHLQEFNRVNARFQAILLFLATIAILVPSLTSGLDRTPAAEFSQQ
LSLGLAILLIVVYGLGMVFSLKTHKELFASVGQAEAGEEPWPLSIGIVVL
IVVTLLVAMVSEIFVGSVQAAAQHLGLTPAFVGFIVIALVGGAAEMTTAF
SAARANRLDLSVGIALGSAAQIALFVAPVLVLVSYFIGPQPMSLEFWPGA
VAMMLVATMAATLLSNGGHAAWYAGVMALAVYAIFALTLFLLPPGVTQ
>SMa1933 Putative transcriptional activator
MRIFVAIAETGSFRAAAARLSRVQSALSHAVANLEAELGVSLFDRSGHRP
VLTPAGRSLLSDARAILLKTDTMRARARGLGGGVELGLTIALDPQFPPGL
AGAALEEMHRDYPSVAIRLLTASLGEAVHALRERRCAVAISGIDLLDPHI
ERRALALVPRAAVVAASHPLAVLAAAGNPVTAADLADHAGRCRGSISADT
RPRL
>SMa0046 hypothetical protein
MCPISCPTTVRVTTFWKAMRKMTLFFEGYASRRFRETALLPEGFLRLVIL
RRTQAADDALRGDNGLHHPASVWSSAICAFSSVSAVRWSLKCCDISNPKV
WLRTCRIRGRSFRGWI
>SMa2041 probable oxidoreductase
MRYIRPSSIEDAVGLLAEASGKAAVLAGGSDLLVRMKGGFIEPELIIDIK
AIDALRHITESEDGFVIGAAVPCAVLGENAALRRAWPGVVEGANLIGSKQ
VQGRCTIVGNLCNASPAADSVPALLAAGAEALIRGPAGSRTIAVEMVPVG
PGKTSLAPGEIIEAILLDKRQPRSGDAYQRFTPRTEMDIAVVSAAVKLTL
DDQGVVQSARVALGAAAPTVLLVEEAAEILTGSRVDDKTLDRLAAACSGA
CRPIDDKRGTVEFRRKVAGVLAKRVALAAYQRAGLE
>SMa0118 hypothetical protein
MFRLGIGFACFATSAFGLQPLGAAADDPAIARGEYLVTMGGCNDCHTPGY
FFGKPDSSRFLGGSDVGFEIPGEGVFIGRNITPDKETGIGSWTREQIVTA
IQTGQRPDGRMLAPIMPWHAFAQLTKEDAGAIAAFLQSLKPVSHQVPGPF
KPGEKVSTFMFRILPPGETAAAAPK
>SMa1139 Truncated response regulator receiver domain
MGLLMANAPTLPEHILVVDDDSRIRQMLSRYFEEEGYRVSLAGDGQEMRE
RLDKQPVDVILLDLMLPGEDGLHLRATYARARMYRSSC
>SMa1310 Hypothetical Protein
MTYPLPKCDGYSRRPLNRSMWQWEDNSNFKLKQSDARPAASQSVATAYAG
EGREFPAFAHLDIDASYRPCEG
>SMa1963 Putative polyketide synthase protein
MQPKQVRLTGATETLLITLQAKAAESAMPDSLLRDRFAADALHRLDPDGR
HLEIGHDMTIGIALRAYMLDRWTEAFLQRCTEATVLHLGCGLDSRIFRID
PGPRVRWFELDVPDVISLRKRIYPGRAGCTTIACSIVERGWIERLPADKP
TMIVAEGVLPYLEDHEVSQVLRRIAGHFPEGEIAFDAYSSTAIRLLRFNP
AIRATGASLQWAVDDPAELERQVPGLQLIEDRSDWDVGQVARMSPAAQVA
LQLFSTIPYPMGRLIRYGF
>SMa1156 probable alcohol
MPKMKAAIFVAPGRIVLDEKQIPDVGPLDALMRITTTTICGTDVHILRGE
YPVARGLTIGHEPVGVIEKLGSAVRGYSEGQRVIAGAICTSGHSNAALCG
CHAQDGPGTKHGWKGMGGWKFGNTIDGSQAEYVLVPDAMANLAPVPDGLA
DEQVLMCPDIMSTGFSGAESGAVRVGDAVAVFAQGPIGLCATAGARLIGA
TTIIAVESVPARMEMARRMGADDVVDFTVSDPTAEIMRLTDGRGVDVAIE
ALGRQETFEGALRVLRPGGTLSSLGVYSGDLRIPLDGFLAGLGDHTIRTT
LCPGGKERMRRLMEVIASGRVDTRPLVTHRFKLDQIEEAYDLFANQRDGV
LKVAINP
>SMa0306 putative histidine ammonia-lyase
MRIDSLFHSVVGACLLFSSPVMADVLLNGRNATPEMIVRVAKGEAVTVDG
ESLDKVERAYAVLLQGAKEGQEIYGLTVGVGWNKDRKMVDATGELTPELM
EASREFNEGLLRAHSVGVGPDAAVPVVRAAMAVRLNNILTGGPGVQPHVA
EMLLAFLNKGITPTMPSRGSVGQADMTLLSHIGLAMLGEGDVDYQGRRMS
AAEALKAAGIEPLKPFGKDGLAILSSNAYAAGLAALAIHDAEQLLSTSRL
VYALSLEGLNGNVSPLLEDVAALRPFPSHLSSTTELRALLEGSYLWQADD
KRILQDPLSFRTAPYLLGSFADSLARTKALVQIQINSSDDNPGIVVGVEP
KSDLFQARRGYVEGGAVLPTANFEPLPWIIAFEELGIVLAHHTTASSERV
LKLNNPTHTKLARYLGTENTHHAFLVVEAPLMALATENRALAQPTSFDSR
PIAGGVEDVGTNAPLVVERIRAQIDNSFTILSMEMLHAAQAADLRLKGHP
ERKLSTATTSFHDAFRQRVPFLDRDRSMTPDIAAGTAFLKEYALTN
>SMa1375 probable ABC transporter, periplasmic solute-binding protein
MLLASTLAASAAWAQSITIAIGSEPSTLDPQLRDDGGERQVNDNIYETLM
ARTPTGELVPGLAAQPPKQVDATTWQFKLRDGVKFHNGEPFNADAVVASV
SRVIDPANNSEQMAYFGTIKTAEKVDDLTVNLVTTGPDPILPSRMYWMKM
IAPGYAKDGDLAGAPVGTGPYKFDSWNRGTDLKLVANADYWGGEPQIDDV
TYRFVTEPGTRLSGLLSGEFDVITNLLPEFTTNVPKFAAVPGLETSVFVL
GTDNEVTKDPKVREALNLAIDRKAMAEGLFMGYATLAKGSHINPAAFGFN
EKLEHYPHDIEKARALIKEAGAEGKPLVVVGESGRWLKDREQIEAVAGYW
AETGLNVTTDIQEFSQYLDSLMGDGPRPDAIFIANSNELLDADREMSFIY
HKDGAAASNSDAEMATMIEAARVETDAAKRKALYDDIQKKGHDLNYTVPL
FNLQDIYGMSERMEWQPRVDAKLMVSEMKVTE
>SMa0776 TRm23b IS ATP-binding protein
MLAHPTLDKLNAMGLAGMAKAFGELVANGEAEHLSHAEWLGLLLEREWSS
RYDRKLAARLRFAKLRHQATPEDVDYRADRGLDRALFMKLLGGDWINAHD
NLAICGPSGVGKSWLACALGHKACRDDRSVLYQRVPRLFAQLALARGDGR
YARLQRTLGHVQLLILDDWGLEPLNEQARHDLLEILEDRYGRRSTIITSQ
LPVSAWHSVIGDPTYADAILDRLVHNAHRVELSGDSLRRNLPRKA
>SMa1554 hypothetical protein
MKTPSRPAARPLDWTAVRSRMAAAIEQTEALLEAAQQDSEAQYEQRSPRV
AVDVSAGQSEEQAVGLVIFVLADRQFALEIRYVCEIVSRARVSPLPGMPP
HACGVYDLRGQLLPVFDLRGPLDLPQEARIADDWAIVCGQERPEFLILSE
AAPEISTLPLEQIRAAEPAPGKPWHWATTKIGAVILDGRRLLDDRRFFLE
DEQITASDETERTEFHEPPTSE
>SMa2127 probable ABC transporter, ATP-binding protein
MENLLSVQNLTKRFGAVTANDSVDLDVRKGEIHCLFGENGAGKSTLSACL
YGYYRADSGVIRFKGQVAELNSPADALRLGIGMVHQHFVLVENFTVLENI
IVGSPDVGMLLSKSTARQKVEDLCLRCGIELDLDREIWQLSVGEQQWVEI
LKALYFGAELLILDEPTAVLTPQQSDQLFVILDGMRRQGLSIILISHKLR
EVMQSDRVTILRKGKVVATVETATTTAESITALMVGHQVTKRVSDRSVAP
GREVLVVDHAVAIGEWGEEVLCDINFTIAENEILGLAGVAGNGQKELFEV
LMGVRTLSSGRFHLNGEAIVAPTSREMLDRGVGLVPDDRFREGLISEFGT
AENLVLGWQRKPEYRRGPFLDRGKINDLAQRKLEEFRIVAASTDLPVERL
SGGNAQRVILAREFLNAKCLLLANQPTRGLDVAASEFVYEKILEKRAEGF
AVFLASEELDDLLRLCDRIAVIFKGKIVGTVRPEETTLLELGMMMAGNAS
NLGGQVNDFGSKALRQ
>SMa2109 conserved hypothetical protein
MKFAKDTLMVSNQPKILVVGATGKFARWVIPELMRRDAVVHALVRNDARA
AVARSLGVAEVFIGDLRNTDSLAEATRGMDGVFHIGPAFTPDEAAMGIAI
VEAAERNGVKKFAFSSVIQPTNTRLKNHASKIPVEEALYSSRLEYTILQP
ANFMQNIGIAWPSIILHGRFGEPFPKDIKIARVDYRDVAEVAAIALTEDK
LAFATLELAAGMFSRNDVVAAISAELGRPIEAFEPSFSEWVQSARLPYSE
QQMHLLSKIHEHYRNYGLGGNSLSLTAALGREPRSLRDYIRELARQNDPL
TPHLAKDS
>SMa0551 hydrolase, putative
MGRRVGMALIGPGFIDLDALSDIDTGVLGLDHQPGWKKGRVWPRDYVEEG
PVEMLTPEELAFQKRYAFAHLIRNGITTALPIASLFYRAWNETPEEFASA
AESAADLGLRVYLGPAFRAGHSVIEADGTLTVEIDAARGRAGLDAAIAFC
AAHDNTHGGLVRAMLAPDRVEYWTADLLKRTAGVARDLGVPVRLHCCQST
FEVETIRRSFGTGSAEWRHDIGFLSERALLPHGTHTDREGLRIIADSGAT
VVHCPLVMARHGAALNHFGDLRRAGLRLGMGTDTWPPDMILNMQIGLMLG
RVMGGELDSPSSADLYDVATLGGADARGRPDLGRLQAGAAADIVVIDLAA
HHLGQVRDPIAGLVASANGRDVRTVFIAGRRVMSEGTIPGFDFAEAHARA
GAQFERLVAQYPRRTWRHPAVQSIFPPSYEVTRT
>SMa1008 Hypothetical protein
MSAGHFGYIQVIHLGLLVGFDTIQCGVSNDGKVNTPVEKIASIHFSAWRA
KMRRREFLFSVAAAGSIGLIGPARAANEKMIVYKDPNCGCCRAWAEAMKA
AGFSVSTEEAVDLAALKGRYAIPAEMHGCHTAIVADYYVEGHVPLDAVTR
LLAERPDIAGLAVPGMPEGSLGMGDHPRASYDVFAVNGDGSSTVYQTVRP
KS
>SMa2105 conserved hypothetical protein
MLIKGDKLSQTGKCMRNETAGRLGRFVETAVGGERVKAFVPPPLPPNPPL
EITGLLTRLSAAERALGRLDGVSILLPNKELFLYMYVRKEAVLSSQIEGT
QSTLSDLLRFETEAISGEPVDDIREVSNYVDAMMFGLERMRQLPLSLRLI
REMHQRLLDSGRGGRRSPGEFRTSQNWIGGTRPGNAMFVPPPANEVMTCL
GDWERFIHEETPSIPPLIKAGLIHVQFETIHPFLDGNGRLGRLLITLFLC
ATGVLQQPLLYLSLYFKSRRPDYYRLLQEVREYGTWEAWLEFFLDGVAET
ADQAFETANRIARLFHDDRERIVRESERTGSVLQIHEIMRTSPYLTAASA
AKRSGLTVPTVNAALDQLQRLGVVEEATGRRRGRVFVYRAYMDILSDGAG
TNATRS
>SMa0974 hypothetical protein
MYEHLAECCVFVAGKRNIIESVFLRVAEFCVSTVDNGTEKIFRFIDGEAV
VRAVDVGILMRVGATDVISYYGIRTALEGSILEVAVLPPEGLAWHPVGQR
PSTATDPINSPLSSGSNLQ
>SMa0319 putative AraC-type regulator
MANIEASRQNVSPPIPAVAGNPIYPAVLVPELNEVIARPDRTSPLGLEQY
IAGRTLVSGDTPAWSDMFVQVYSRLNKQEPFLVPAVAEPLIVWVMSGEAV
VEERDLDGDWVANTVTVGDFFLTRSPTPYEMRWRSVDAAPFQVMHLYLSV
PLFERVAYDVLGCAAPPALRDISGGRDTQLSHLLALVHQELTAEGKGSQL
FIQGLAQSLAVHLIRNYAANEADDRQNALTGFKLRRAVAHLEEHLAEPFN
LAQLAETVGMSEFHFSRLFKKATGLSPSRYFIRQRVARAQLLLQETDTSI
IEIGMSVGYSSPSHFAQVFRRETGLPPSHYRRG
>SMa1030 TRm1a transposase
MEVLSGPERRRRWSTAEKLAIIHETYEADATVSIVARRHGIQPNQLFAWR
KLASQGALTATAAEEEVVPASEYRALQAQVKELQRLLGKKTMESEILKEA
LEIAGSPKKHLLRSLSLPRGILG
>SMa1037 hypothetical protein
MMRATVKRAAALALPLVLGGGCVSASEYAAKNAGFSSVEAKTAEAVGKQT
VWIQSQQHARVVSDRVKTLMAKKAIDVETAVQVALLNNKGLQAAYADLGD
SAADAWQSTMLVNPTVSVGLTGIGTPGLEAFKSVEGMIANNILALATRDR
NIAIADTGFRRAQLNAALRTLQLASDTRRAWINAVAAWETVAQLNQAQAA
ADAASELAQELGKSGALTKEGQAREHVFSAELAGQTAKARLEARLAKEEL
TRLMGLWGSGIDYQVPNRLPQLPKGIMKRDLIEAEALQRRVDLQMAKLDL
EATAKSYKLTEATRYVTDLELLTGFETERELEEGDIKRETTGQAELEFVI
PIFDSGRARMRKAELAYMRAANLLAEKAVNVRSEARSAYQAYRANYDIAR
HYRNSVVPLRTKIEEESLLTYNAMITNTFELLADSREKVNANLLAVNAKR
DFWLAEANLAPAIYGGGAGAAAGETEVAAAAEGGGGGH
>SMa0302 putative ABC transporter, periplasmic solute-binding protein, family 5
MKRHLLKGTALAVTMAACSFGATVAFAQEGCVRVLGYESDGEKQTMDPAA
LIGTDSVYHIRAVYEPLVDRSNTMQPVPALAESWESNADATEWTFHLRKG
VKFHDGSDFDAKDVVYSYRRLLDPAVSPGGFSTLAFLDADGITAIDDHTV
RFKVKEPVVELPMLIATKYALMVPEGAKSEDLRVKGNGTGPFMQETFTIN
GAVRVMRRNPTYWRAGLPKSECLEITTSLEPTSRLSALLSGTVDLSLTVD
PASLITLKDNPAVELAATPGATSLYIAVWTDTPPFDSVKVREAMKLVMDR
QKILDTVLLGYGEVGADTPIPPSSPFGLGTPAKTADIEKATALLAEAGHP
DGIDFDLYTSDSYPGMMLLAQVYAQMAAPAGIRVNVITSPAEGYWDTIWL
KKPAVISYTSARPPAEALTLSLNSKSEWNETHWTRDDFDALVVKAGQTAD
EKQRNDLYRDAQRIVADEGGMILPVFSSVVAGLRKGCSGYEPNVDVNRID
YAELTCAD
>SMa0134 hypothetical protein
MRTVTGLFDDYGDAREAVSDLEAAGVPSDDISIVANNIGDRYSTDGSNAA
EGAGTGAGLGAAGGGVVGLLTGLGLMAIPGVGPVVAAGWLASTAAGAAAG
AIAGGAAGGLIGALTESGVDEEDAHVYAEGVRRGGTLVTARVEDSVAPRA
EAILKQRKIVDPAARRSIYAQEGWSRFDENADPYTLDQVDRERERYRSMM
P
>SMa0093 putative D-alanine aminotransferase
MGRIVYVHGQFVPEEEARIGLFDRGFLFGDAVYEVTAVIGGRMIDNDLHL
GRLERSLRELAIPLGLSRKEIAGVQAELIARNALLDGTVYLQVSRGEADR
DFLYSDALAPRLVGFTQAKTLTGTKAQQDGISVDLADDPRWHRRDIKTAM
LLGQVMAKQAARARGFDDVWLVENGLVTEGASSTAHVITGDGRILTRAAS
RATLPGCTQRALALLCAAEDLAIEERAFTPNEAQAAAEAFQTSASSLVMP
VVRIGERVVGNGKPGPMTRKLQALYLEAAGVPV
>SMa1256 Hypothetical Protein
MTIETQNPAPREFDVRPILRSGGEPFQAIMEAVNGLRPGQALRLLAPFRP
QPLFKVMEGRGFSHEAQEIQGGDWEVLFKPNAAGAPVEVSADADNAASWP
DPVENLDLTELDPPEPMVRILAAVERLQPGEVLFALLSREPIFLFPELSK
RGHQWAGNFDETRTTFRIFVRVGDKG
>SMa1334 conserved hypothetical protein
MKRIYVVGTADTKGEELVYLASCVEAAGGRPVLVDVGTRRPTVLVDISAE
TVAAVHPGGAAAVLSGNDRGTAIAAMGEAFARFLPARDDVAGVVGMGGGG
GTSIITAGMRRLPLGLPKVMVSTLASGDVGPYVDVSDIIMMPSVTDMAGL
NRVSRVILKNAAEAITAMANRPAEETASKPAIGLTMFGVTTPCVTAIVER
LKADHDCLVFHATGTGGRAMEKLADSGLLSGVLDITTTEVCDLVFGGVLP
ATEDRFGAIARTDLPYVGSVGALDMVNFWAPETVPERYSGRLLYRHNPNV
TLMRTTPEECAAIGRWIGAKLNLCSGPLRFLIPERGVSALDIEGGAFFDP
AADAALFEALETTVNRSDRRRIERLPLHINDPQFAEAAVAAYRDIANP
>SMa0263 putative alcohol
MNLFGTLRAPRELLFGAGQRHALGGIAAKLGHRALIVTDTRLAVDADLLA
LVRRLEEAGLEVMVDSSTLPDVPVESAIVSAAAASGFAPDLVIGIGGGSC
LDMAKCVTLLLTHGGRPQDYYGEYAVPGPVMPLIAIPTTAGTGSEVTPVA
VLSDAERSLKVGISSPHLIPAVSICDPELTLSCPPGLTAIAGADALTHAI
EAFTAIRREPVPGIAQQRVFVGKNELSDHFALSAITLLWQGLERACKDGA
DAGARETVMLGATLAGLAFGVAGTAAAHAIQYPVGALTHTAHGLGVACLM
PYVMTWNAPLIRDELAQIAHAAGLGGPDEVIPALVSLFERIGIPATLRDL
GLEEDRIDWVAEQSSGIARLIQNNPRPLNPHEMRNLVAAAHCGDRSRLN
>SMa0250 conserved hypothetical dedA-like protein
MTLVVFIVSLLGAMAIGVPVAFSLMFCGVVLMWYMGMFNTQIIAQNMIAG
ADTFTLLAIPFFILAGELMNAGGLSRRIIDFAIACVGHIRGGLGIVAIMA
AVIMASISGSAAADTAALAAILIPMMAKAGYNVPRSAGLIAAGGVIAPVI
PPSMAFIVFGVAANVSITQLFMAGIVPGLIMGIALVATWLLVVRKDDIQP
LPRTPMKERVGATGRALWALGMPVIILGGIKAGVVTPTEAAVVAAVYALF
VGMVIYRELKPRDLPGVILQAAKTTAVIMFLVCAALVSSWLITAANIPSE
ITGFISPLIDRPTLLMFVIMLVVLVVGTALDLTPTILILTPVLMPIIKQA
GIDPVYFGVLFIMNTCIGLLTPPVGVVLNVVSGVGRVPLGKVIVGVTPFL
VAQILVLFLLVLFPDIVIVPARWLH
>SMa0689 hypothetical protein
MIKLLHTLLVGSAASLAPLLGAIDAALAQAEGPASVYDYAAAEEPPVFDD
PAKAVEAFKSVLAANDFDDLARLLGLNAAKLKAGEGSMETFGLIREGAAR
NIVVRDLDGRKIIVIGDRLWPLPFPIVKDEAGKWAFDTYVGLEEIVNRRV
GENELEAIETARAYVEAQRDYVSQDRDADGVLEYAQKLISSPGQTDGLYW
PSDQGDGESPVGDAISEAALEKARTGEGYFGYRFRILTSQGDNIAGGKYD
YTINGNMIAGFALVSWPVAYAETGVKTFVINQQGIVYERDLGPSTEEIVP
FIDRFDPDEKWSVVTD
>SMa0204 putative two-component sensor histidine kinase
MLHSAPAAMFDHYLGISRLLAGQLDFRSAIRSVAAEVAHIIPHDHLDVCV
LLEDGNYHTAYETGIETAWGDLAGAPVVNSPIRSLLWGEVDFLLADDAMT
DPRFHFDGAFKRPIVEQSLRSRLHVPMKVQGAIIAALSCSSQEAGAYTME
DVERARIIADLLTPYFFALRAAEQAQRSAIVEAEARAREEGLRLGALKLT
EALEQERQRIGMDLHDQTLADLTRLARRIDRLSRNGEVAPEALEPVSRSL
QHCMQDLRQIIEQAKPSVLQLFGLTQAIEHHLDRSTRDTGSIIEWGLADE
TNGALERLEPTVIVALFRIAQEAINNAVRHAAPLAVKVRLDADDDRLSIE
ISDDGTGLAKTRGRIGEGIDNMKTRARLISARFTIGPGHNNRGTVVRVVL
PLAPHDSGSIEERAE
>SMa0728 hypothetical protein
MQVKQWFGFVPDFIITLDAEYCRACGAPVFTIRGHNVEEFVGVVRRYGAN
AAGVRAIVDAAPARQR
>SMa1197 hypothetical protein
MLQRKGPSVRRSSMEDFTSRMSRDRARNDGGYRADSCPCDPRSRPLEPDA
IHLAQTCRPRAVPSATACPEIASSRVRDILPRPPGLAIELADGAVLISRF
SRRSLHCCAGTSWALLFWRW
>SMa2243 hypothetical protein
MYHRQYEEIMVAEVGPWAKEKLDILARYLDFYTKVLKNQPWRTIYIDAFA
GGGSAKVRIKNEPAATFDLLEPTDSQDGEQEEFLHGSPRVALDIANPFSR
YVFCEPAAKRAVELNELEAEFSNSRQIKLLKVPAAEGIAWVTSQAISKKT
HRGVAFLDPFGARLEWSSVQSLADTGLFEVVVNFALNMAINRMLPNDGDV
PVAWADTLDRYFGSHEWFEEVYSSDAHGLFASTEIRKRDDYSERLLELYR
RNLKNAFGFVSTPRLIRNTRGAPLYYLLWAGPHRKGLEGADYILRMGDKL
PKIRKGST
>SMa0557 LysR family transcriptional regulator, probable
MQREELGDLLAFLAVAEEESFTKAAARLGTSQSSLSLIIKRLEARLGVRL
LTRTTRSVAPTEAGEQLFSTLAPAFGTIEAQLSALSEFRDKPAGNFRITA
GQHSIDTILWPKLSAFLLAYPDIKVELVAESALTDIVAERFDAGVRLGDQ
VEKDMIAVRIGPPARMIVVGAPSCLRDRPPPKTPQELTTHRCINLRLPSY
GGFYAWEFERDGHEVRVRVDGQVAFNGVPQIVKAALDGFGLTYVHEDVVR
EYLKNGRLVQILDDWTPPFPGYHLYYPSRRHPSPAFTLLVEALKE
>SMa0794 conserved hypothetical protein
MKFSLFVHMERLDASQDHKTLYEEFIKLCEIADKGGMHAIWTGEHHGMEF
TIAPNPFVTIADLARRTKTARLGTGTVIAPFWHPIKLAGEAAMTDLICEG
RLDIGIARGAYSFEYERLLPGLDAWSAGQRMRELIPAVKGIWAGDYAHDG
EFFKFPATTSSPKPLQKPHPPIWVAARDPNSHEFAVANGCNVQVTPLWQD
DEEVRSLMARFNDACAKDPEVPRPKIMLLRHTYVGSDEADIAQAAHEMSV
YYNYFFAWFKNERPIRQGLIDRIPEEEIAANAMLSGEAMRRNNVVGAADE
VIARIKSYEAMGYDEYSFWIDTGMTFERKKASLERFIADVMPAFAE
>SMa1823 Hypothetical protein
MALHEALNAAGRVPGPIVKSIAAGDINVTRTYSDRFDLDVAPAIPCTDAF
GVIVQLRDFDTHRLWRRGELVYEGGHAKASLAITDLRDQWQCHHLSPFDN
IRFHIPFSRMRAFAEEVGRSEYMALACVQGRIDPVMHGLAQALLPSLDDP
SDANPLFLEQINLAMLAHLSQTYGGLHFPVDKKGTLSPWQERLATDFLAS
HFNKPFSIGDLASRCGLSRSYFNKAFKESFGRTPSKWLTEYRVARVKEML
LLDLSIAEVSINCGFADQSHMTRVFTSLTGDTPARFRRKNRTFGVALDPL
MV
>SMa2121 hypothetical protein
MREFRCVDRPLLNDWHVVADRSALTLNSVFTTRLMGHDLQVTLDGRYNLQ
VVALDTGKEVCSDSRYGFIWACLGRPERDIIYLPETNEADRHLLGGGSIA
VRVSGLRAVENFLDMAHFPFVHAGWLSDEPHTEVMPYNVTITAADELLAT
DCKFHQPIASPTAQTVMVVDYVYKVFRPYTVALRKSSPLDPNRKDLIVLF
IQPVDEENCIVHSYLCYLKQGTEAADVRRFMQLIFAQDKPILENQCPRRL
PLDPRAETPIRADAVSVHYRRWLRDRSVTYGAIAYPV
>SMa1142 FixL-related histidine kinase
MEKRVEMVEETVSDIADLKSAQADVAAREAHLRSILDTVPEAMVVIDSKG
VISSFSAAAERLFGYTESEVVGLNVKVLMPSPHREAHDQYLNNYLRTGER
RIIGIGRVVTGLRKDGTTFPMELSVGEATSDGRRIFTGFIRDLTSRQRIE
NELRQAQKMEAVGQLTGGLAHDFNNLLTVITGNLEMIEARLQDEKLMPLL
QEAQAAADDGAKLTAQLLAFGRRQPLNPKLVDVGELVTNFSKLLTRTLGE
TIELSTVVKGSANLALIDVSQLQNTVLNLGLNARDAMPNGGRLTIEVSTT
VLDRDYALMLPEVRMGHYVLVAVTDTGVGMTEEVKRHAFEPFYTTKGFGA
GTGLGLSMVYGFVKQSGGHIQLYSEPDRGASVRLFLPAAEGENQLAEIGA
AEEAAPQPMPRGHETILVVEDDARVRRVVVARLRDAGYSVIEAEAGTKAL
QLLAEHPEVSLVFTDMIMPGGIDGGELAEHVRALRPDVKMLFTSGYAEPS
AAGRAVGSWLQKPYTARALALRLRELLD
>SMa0123 hypothetical protein
MQNSKTSITQQFGSYKPSKAILVWACVATAAATMIVGFNWGGWVTGGTSR
TAAAAAADIARGELASAICVERFNAAPDAAAKLIEFKAITDGYKKRQFVE
AGGWATMPGETAPDSRSVQGCATALAI
>SMa1335 Hypothetical protein
MHAHHLNWSNQFLCGRECARSCRGLRIPDRRLDQGCCRPFCNGLRAFGRE
APNILEHDQHLDTENKRERLSQLLAGLSRHLALEWHGDLSKEFDAVPSRE
LWFVAERTCLALRPLVERELQRSLERRLELIDARHGGRKQIANASERIGR
FGSPVEDRAERLVHHRMNQRRLARKVSVGGGARNFSGFRDLTHRGGYTGL
HQPRGGIQHEFACAGSGTAFGGCGGFAMLT
>SMa2289 hypothetical protein
MFLEQVVTKPDPYAPFLLSQAIKANGFVFVSGQAAIGDNGEIVGEGDFDR
QAGQAFGNLDRALKAAGSGLDKVVKVTIFLRSMENFAKIVELRRKWFSAP
YPADSIIEVSSLYSPKAMIEIEAIALDGSH
>SMa1097 hypothetical protein
MPLELHCTHWSEERQSVSQNRAGGPECAANAHRLRRAALLERFGSGLITG
AADADPSGIATYSQAGAQLRAKMRRRGMGVAQNMRRLFPSRAGQLEPRPF
RDLSATVKRAVLSRLVDKFSPANASRCSTSEKHDEASGHRLHFYQETVLK
LCRIIWRPSAAWVTPSTTAKT
>SMa1821 Conserved hypothetical protein
MGSRPRHGVGHQQPTEEATTMTATPTPGKLLVSPKDHALIMIDFQSQMSF
ATKSIDAVQLRNNAALVASAAAGFGVPTILTTVAEKSFSGPMFAEITDAF
PEQPLLDRTSMNTWEDAAVIEEVNRIGKKRLVFCGLWTSVCIVGPTLSAL
DQGFEAYVIADACGDVSDEAHERAMDRMVQAGVRPMTSLQYMLELQRDWA
RTDTYDMTTGIAKKFGGAYGLGIIYAKTMFGASEGH
>SMa0403 hypothetical protein
MSAAATRSAAAICTAASDRRSYALKDPSGKNAMSQPTSRTAEIRKVAKTG
DGKAEQLLAELLRDLFRIEARNVAINHDQYSLNSLNGFFETEDGAFFFKF
HQEEGEEAMSGEYYRADILARAGLPVDQPLLMSVIPGEQILVYRRRTDPR
FSDVLRALDLKDDAAARGRAVEAERRLSEAVLKVYLATLHTVGAREVAAE
PIHRLFYERLVDRSAASYPGGRMASFYVGKDFAFPNLTLDWETLSRCRFV
VNGIEYSDSIGALFDAAHERLNPARLADAGGVVAHGDAHNANVWYEERDG
GAHLSFFDPAFAGENIPALLAEVKTTFHNILAHPLWLYDPAMAAGRYKAS
AVLDGALLRVTTDFAPSPVRRALLDVKAEALWRPLLSELKARGMLPADWR
RVIRLGLFLCPTLVMNLRAGATTHNAISSLIGFSVAVMAGSEPVAGDDLI
SRFIDTIDPDNG
>SMa1495 putative serine-pyruvate transaminase
MNAFSPPPRLLMGPGPSNVSPEVLAAQARPTIGHLDPSFVGLMDRIKDQL
RLAFRTDNRVTFPLSAPASLAMEMALVTLLEPGDTAIIAQNGVFGGRMAE
IAQRAGAEVRLVSVEWGKPVDPEAVRASILEAPQAKLLAFVHAETSTGVR
SDAASLCALAREAGLLSVVDTVTGLGGIPVSVDEWQADAVYAGTQKCLSA
PPGLAPITFSDRAVSAVKARKTPIQSWFLDLGLMLGYWEGEGARSYHHTA
PVNALYGLHESLSRLLGEGLETAWARHRAAHDRLVERLQGLGIAFVVDKE
HRLPQLNTVWLPEGVKDVPERRRLLDEFGIEIGGGLGPLAGRIWRIGLMG
ETCRIENVDRLAEAIAAVLP
>SMa1734 conserved hypothetical protein
MGRENILFIVDSDCHNYWCSATVLEPYMDGFFKDMFVRGEKTGPRGAFPH
GHRPWFHPEGFSRHDVNPVEEDDNYAIMKEKHLDKYNIDVAILTGDEPIE
ASTLANAHYANALCRAYNDYMIDYWLPKDSRFWGSIIVAPQDPKLAAEEI
RRLGSHPRIVQVLVSHGAQRPYGDPFYHPIYEACAEMGLPFAMHLGGQGG
VNSTPIGAGPSTFFWETHAILPQSAMTHMASLIAQGVFEKWPSLKVVIIE
CGVAWVPSVLWRLDANYKALRKETPWLKRLPSEYFKTNIRMSTQPLEQPE
NVQHLWATLEAMDGENTLLFASDYPHWDYDDVTKLHIPPAWREKVLGLNA
LDVYRRIPRPAAIAAE
>SMa1291 Conserved hypothetical protein
MELICPAGTPAAFREAVDAGADAVYCGFRDETNARNFPGLNFSRAELGEA
IAYARRKGTQTFVALNTFMRAGHESLWYQAAADAVRLGADALILADFGLM
AHVAETYPEQRLHVSVQASASNPDAVNFLVGAFGARRVVLPRTLTISDIA
RLARQIRCEIEVFVFGGLCVMAEGRCSLSSYATGKSPNMNGVCSPASHVR
YRQDRGDLVSELGAYTINRFPRDEAAGYPTLCKGRFDIADARGYAFEDPV
SLDVMDQIDALREAGVSALKIEGRQRGKAYVAEVVSTLRKAVAASPEERR
TLLARLRLLSEGQRTTSGAYEKRWR
>SMa0056 putative dehydratase
MKRPRITDIRATTVTVPLEAPLRHSNGAHWGRFVRTIVEVETDVGIVGLG
EMGGGGESAEAAFRALKPYLLGHDTFELENLRFMICNPTASLYNNRTQMH
AAIEFACLDIMGKFLGVPVCDLLGGKMRDAVPFASYMFFRLPNKDTGEGE
TRTADQLIEQTLALKKKCGFTSHKLKSGVFPPDYELEVFRAWAKALGPDS
VRYDPNAAFSVEEAIRFAKGIEDLNNDYYEDPTWGLNGMRRVRENTTMPL
ATNTVVVNFEQLATNILNPAVDVILLDTTFWGGSALREGGGRLRDLPTRH
CGTFVGRTRHPARHHASPRRGSPEPRLPRGCALSPTHGRYHRRRPDALRE
RHYQGADGAGSRGGARSRQARAVRRPP
>SMa1838 Putative
MSGDGRGARRPLHSVRRRAMAFLFNSDAKRGAIFAETFARELPDIPFAVD
PAAVDPDAVRYLITWTVPDNLARYRSLEILFSIGAGVDQFRIDAVPPHVR
VVRMVEDGIVRMMQEYVTLAVLAHHRNLPAYLEQQQGENWQAIAPVQAVE
RRVGVLGLGMLGTAVLDRLKPFGFPLSGWSRSPHEIEGVRCLSGRNGLDT
LLGSTDILVCLLPLTDETRGFLNAQLFARLPAGAALVHVGRGPQLDHDAL
VEGLDKGHLSGAMVDVTDPEPLPSGHRFWTHPKILLTPHIASVTQPETAA
RAVIENIKRHRQGLEPIGLVDRRRGY
>SMa1521 conserved hypothetical protein
MHICQILGSKSPEIFSVTPDQTMVEVLRLFRDKNIGFVVVGRSPGECLGT
LSERDCCYAVAEYGTEAPLMRVGEIMNRTVATCSTEDFLPFVMSIMTERR
TRHVLVMDGNDAVGVVSIGDVVKHRLEEALQAERDMHDYICGANYR
>SMa1480 Probable LysR-type activator
MPKKNHVIARLRLASAGAGRLASHKRELQMKPPPPLNYIRSFECSARHLS
FTEAASELGYTQAAISNHVRALEQYLGRQLFIRYPRSLKLTEMGEAFLPT
LRQALNQIDFATEAVLASARNKTVVISCPTSLAENWLARCVAGFSRKHPD
VEILLHGTIWEDSTEQIADILISIRRFDNMPASGMQLWNDRLVLLCAPQL
VHGQNAIRTPADVLKSNWIAVHGRQEFWQEMADALGIDASANDKRLSTNS
TNIALEMAASGAGCIVTTQSLARTYVDRGLLVEPFQIRTRSAWNYYLAEG
QASKGTTVSKVRDWIIAEAQKVIG
>SMa1726 putative transcriptional regulator
MNSALDLSITIQEHVMALTKISDIRRRELRRAAVEVMKREGAAGTTLEKV
AQEAGASKGIVLHYFRNKQELFEEAMREANAGLRDDVVRRLKKARSPIER
IWAIVEANLGEDVFKPPHGHAWLSLCAEAPREPQLARLQSIFHARMHSNL
MSGFRSLVPHSQAERLSLSMSALIDGLWVRLGIGDRTMTSSTATSLARDL
LTNALPTIAIPDVTKCEQ
>SMa1989 Hypothetical protein
MKARNDAVFFKLNAGTVPLNDVRSGSHKQILYSGPFNRSRGRVLEYGREG
LSLLAVHASKYRDAC
>SMa2165 probable short chain
MTRFTGKNVLITGGTSGIGLAGGRRIIAEGGMVILTGMNEDRLEATRKEF
GDKAVVVRNDAADPAASADLSEIVKSAGGIDGLWLNAAFAALGPPEEIHA
WDFDRMMATNVRGPMLQLAKLSPLLRPGTSIVLTSSSSTYEGASATSLYA
ATKGAVLAMSRSWASAMAPRGIRVNVLVPGPIESNLRSFLPDEARHGFER
FVLNQVPLGRVGTADEAAAVALFLLSDDSSYVTGSQYAVDGGLIMH
>SMa0604 conserved hypothetical protein
MRESDMKQQNVGANLVSGTRRRAGGLAVAAMAVALSNAPSRAQSVINPDA
DSVLRAMTDQLQALQEFSVEYDTDHEVVQLDGEKIQYSASGRIAMSRSAG
FRMTRQGPYTDTEISFDGKVVSLYGKRLNVYARIDSPGPSIDEAVAEIQA
ATGFDAAGADFLSADPYAALAEGVLSGSLVGTAFVGGMLCDHLAFRNDDV
DWELWISKGEQNLPLKYVITTKWVTGSPQYTLRFRNWATGGVSSKSFEFK
PPTDARKVDVVHTDVVGDLLLEAQQ
>SMa1853 Hypothetical protein
MKDIPMQTNEIELTAFGPEYLEAAIRLSRQAGWPHRLEDWQMAFALSEGI
VAVEDGRVVGTVLVTPYKRDCATINMVIVDEAVRGRGLGRKLMDAAFRIA
GDRPLRLVATAEGLPLYDKLGFGESDAVLQHQGVVGEIAAPAEPEAASTA
DVEAIAKLDRLAFGADRGALIAYLAKVGEFAVLRRDGRVTGFAALRAFGR
GEVIGPVVAADLDNAKALVAHFIAARPGRFLRVDTTAGTGLSVWLAEQGL
AHVGGGIAMMKPPIRRAADPIANTFALANQALG
>SMa1465 Putative ABC transporter permease
MSALVSPAAPQQRRRIETAPLIVLIVLAGTVALLWWSGMAEEILAYSDDI
SYLTVQHLELVAWAGGLAILVAVPVGIVLSRPAFRLVSEAVMQVFNIGST
VPTLAILALSMTLLGIGTVPAVFGLWAATLLPIVRNTYAGLRAVPPHLVE
AATGMGMTPRQVLWRVELPNALFVIFGGIRTALAICVGSAPLVFLIGAGG
LGELIFTGISLDELPMMMAGAIPTAMLAVLVDFIVGQIQYYLVPRGINPL
R
>SMa0222 putative GntR-family transcriptional regulator
MVVQIGCQRMTAAKDMNLESLKIDTGETAAAQVERDLRESIIRLELAPGM
RLSEQEIATRMGVSRQPVREALIALGKSKLVDIRPNRGTVVVRISARQMM
EARFVREAIEVAVARRASETFDSWTRRKIDTILARQKAANEAHDHNAFRR
EDEQFHIAIAEGAGCGLAWNAVSDIKAHMDRVCNLQLRHPDSMKKLIAEH
EAIITAIDARDADAAAAAMRSHLNGILADLPQIEADNPDLFE
>SMa1638 hypothetical protein
MLFYAYCAPATHSPLQHSAQTKMVAVDSHSHEHGDHSHDDLDFLGNTSTA
PDHHHADHTHEKTELVAAGAFATPLPSTRSFPRRPLD
>SMa1325 Hypothetical protein
MVREDILQRFGLLAFRLKRTNYLFGDTFGVADRYPFILTGGAQELGFPLS
ACYRDYVARIEARPAVREAERREALSEASSSQL
>SMa0232 hypothetical protein
MPNRSRPAQGVPVSSARAKADCDGRGRSCPILAREGRSMTRRKLERCAMS
REAFCQLTAGDLNILMSMLDETGHTESFTILLREKLNYASVFFREDIPEN
VVTLDTQVGYTVNGVRTGPHLLVRNAAGRPANSAISVRTMRGLALLGLAV
GERTEILGEDGWPETLAVERIVFQPESEARLKQTSTEPVQLIDHAPQVVN
FRPRPRKAAIPHDDDPGPSAA
>SMa1825 Hypothetical protein
MSQRTGTEEGVGAHFGLPYAPCLPARPVRDAGFSVTRLEWRLNGEANRLV
SLPPDSAYFLMLYLKDAYHCDVAPDGTESETLRFRQGSVCLVDLAHGACI
RLFSDLDSLAFQLPRELIREVSEFSAAPRATTLRCRRGEDDDVLRNLGAA
LLPLFERQGNFHTAVLQHIAIAICAHLLHAYGDHGGQGGPRSTQFTVWQE
RAAKNFMIDHFADQFPMAAAASAAGVSIRRFIESFKRVTGQTPKQWLLGY
RTARAKQYLGERSLTLAEIATGCGFTDEDHFTKVFRRVAGTTPAAWRARW
LH
>SMa1990 Conserved hypothetical protein
MNTMTYNGYHARIEFDAEDEVFFGKIAGISDVIGFHGDSVAELKKAFHEA
VDDYLETCRKIGKEPQRPYSGKMMFRVAPEVHRRAALAAELSGKSLNQWA
EEVLEEAADHFAEARLSA
>SMa0229 hypothetical protein
MSSIMVSSNLPLPFIGATDYRLLSRLAYQALSRDFDIAAELIEKLERPCV
LPDDQVPPDVVKIGSIVTCEVEDGPCRTFSLVYPDDVDAEKGRISVLTPV
GVALLGLRPGQAVEWFSRDGQRNHLIVVRVEDDVKDELAL
>SMa1122 Conserved hypothetical protein
MSSLDRKLIRELWRLKAQVLAIALVIASGTALLIMALTTIEALEETTAAY
YERTRFADVFAQAKRAPEHLGRDIANIPGVRLAETRIVEGAIVDMPGFAE
PVVAQLLSLPERGPELLNALVIRSGRLTDPTRPNEVVVSEPFAEAHDLKP
GDSFSAVLRGRKRTLQVVGTALSPEFVYAIAPGGLMPDDERFGVLWMGHD
ALAAAFDLKASFNSVTLDLLPGADEKDVIRRLDGLLAPFGGIGAYGRADQ
TSNWFLESEIAQQKNMSRIMPTIFLAVAAFLNNMVIARLIETERHEIGLL
KAFGYSNLAIGWHYAKMVLTIGTIGVLIGSLLGAWLGHWNTELYTKFYRF
PFLLYRPGPAGFVIAGAISLGAALAGSLAAVRRAVRLPPAEAMFPPSPPI
YRRSWASRTALAGALDEPSRMILRRIIRWPVRAFLASLGLAMSIAVLIMA
LQWVDAIDALAETAFERGQHQDATIAFNDLLPIHTAGDYEHLPGVLAAEP
YRHASARISHGHLVERQGIIGVPSGAILSPVFDVERGRIEVPPGGLVLSR
KLAELLHVSAGDTVGVELLEGRQARLSLPVAQVFETYLGTPAYMDMEALN
RISGDGRVVSGLHILVDAPSRTRLLAKLKEIPNVAAVLFRQAAIDTFYKT
MGETIFIFVGFFVAFSMTLSVGVTYNSIRIALSERARELATLRVLGFSRW
EISYILLGEVGILTWIAIPFGATVGYGLAWYMTSAFETELYRVPLVLRDA
TYGKAALIALAATLVCAALVRRRLDRLDLIAVLKTRE
>SMa2229 hypothetical protein
MTLRTSTVPDLWSFMLKLYAMPGVAQACLELQDRFELDVPLFLALLHGAG
RGYRIDSETIRALDKACGKWRAEVVRPLRAVRVQLKANPWKENHEPVVAF
REKIKALELEAEKLEVSVLEKAIIALARTENHVDKPARIAAVAHMVLSHF
AADNLTGELPQASLIVDAVRSLLRR
>SMa0370 hypothetical protein
MVTDQATVAVTWADQNRTDRRVFNFGCKDPENGPTAQVLSDAPDILPVGK
FIEDRRLEDGAWDRHQFTDQ
>SMa1773 Hypothetical protein
MPLAGSERTEGQEGDEQKALAHTIASGSREPAMMPAIIRRTISGVATSTT
RATAIHTAAGH
>SMa1884 Probable cation efflux protein
MSLGRAEDPSFTIKTLTVTTVWPGATAREMQDLVAEPLEKRIQELTWYDR
VETTTRPGYAFLTVTLKDSTPPTAVEEEFYQARKKLGDEARNLPSGVFGP
FVNDEYSDVSFALYALKAKGMPMRELVRQAEVIRQDLLHVPGVKKINILG
ERPEQIFVEFSYAKLATLGISAQDIAAALQRQNTVTPAGSIDTRGPQVFI
RFDGAYNSVQAIAATPIVAAGRTLKLSDFAEVRRGYEDPATYIIRHEGEP
AIMLGAVMQQGWNGLELGKALEERSAAIARTLPLGMTLAKVSDQAVNIDA
AVGEFMLKFAMALGVVLLVSLLSLGWRVGIVVALAVPLTLAVVFLIMLET
GRFFDRITLGALILALGLLVDDAIIAIEVMVVKMEEGMDRIKAAAYAWSH
TAAPMLSGTLVTIIGLMPVGFARSTAGEYAGNIFWVVGFALIVSWVVAVI
FTPYLGVKMLPDIKPVEGGHHAIYDTPNYRRLRGIIEFAVRHKYVTCAVV
GIVMALSVVGMGGVKHQFFPTSDRPEVLVEVRMPEGASIETTIATVEKLE
RWLQEQPEADILTSYIGQGAPRFFFAMAPELPDPAFAKIVVLTPDSHARE
ALKLRLRAAVSDGLVPEGYVRVTQLVFGPYTPFPVEFRIMGPDPAQLYQI
SEKALEIMKGVPDVRQANRDWGNRTPVLRFVPDQDRLNLIGLSPAEAAQQ
MQLLLSGIPVTQVRENIRNVPVVARSAGESRLDPARLADFSLMSRDGRQV
PLDQIGHSEIRFEEPILKRRDRTPVITIRSDINEATQPPEVSQQIMTALQ
PLIASLPVGYRIEMGGNIEESLKANVALVKIFPAMIAAMLIVIILQVRSL
STMTMVMLTAPLGLAGVVPVLLLFNQPFGFNAILGLIGLAGILMRNTLIL
TEQIKENKAAGLHDYHAVIEATVQRTRPVILTALAAVLAFVPLTHSVFWG
SMAYTLIGGTAVGTVMILLFLPALYAAWFRIKPTADDTHEEPTEGPELRI
AMAAE
>SMa0789 hypothetical protein
MEGIRQSYAAKDMPAVLNHTSAFYQTLFTKVDRHVAWGVVSLLTVRINHL
RSMTIKTRNRDVEGPAQMEKIVEAIRKGDGEAAYKAALDHVARASVIAEA
VLSAQQTGD
>SMa0210 hypothetical protein
MHSWNHFSATAIAHSPRRWIQGVTMDVQKAGMARLLLAAPDVRDSAWMIH
DLTFLKLCEVYEHACLRRDVLRCAASIDDAALLKSEEECKSLEAAAIAYI
RERQKFSGLGSH
>SMa1548 conserved hypothetical protein
MDGIAEPVIIKDDASRFVFINDAACDLLGKDRNQLIGRTDHDILPGEQAD
RIVSLDRIVLSTSEGHELEEQITTPDGTQRTLLTKKRCVSIPAGSTQEKF
VVVTIVDITNLRRTEETLRASEEHYRSLVDLHPQVPWTADTAGEVLEVGP
RWSELTGLGEKETLGSGWAKAVHPQDAGALQEEWRRSLASGAPFDCEYRL
LTKTGDYRWFRARAAAKRDADGKIVRWYGVLEDIDERRRAADDLRESEAR
FRAIADDAPVMIWVADPTGDTSFFNRLWLETTGQTEAEALGFGWVDVIHP
DDRQAVQETFFRATAGKEPVRSEYRLRRADGSWAWVIDVGQPRFSADGTF
LGYVGSVLDITERRAAEIAQQEAQAFIRSIFDSSPDCVRVLDMEGRPLLM
NEAGRRIFGLNEGAPVTGQTWDSIGRASDADKVEAAWESVRRGKTARFEI
SVRDAGGEERCMDVISAPITDHHGKPFRILSIWRDITDAKRASDEISRAQ
RHAEAAADQLSSVLESTMDSVMLLDADWRVRYLNENARKLLQVGDEALGR
VFWKLFPEEEEGSFAKHCREVMDRRVRSFFEDHLSSLGRWVEANASPTRD
GISIFCRDITERRRAEEDTLLAQKQMAHMARHDMLTGLANRMFFRECFEE
ALNESNHARMAVLCLDLDGFKAVNDTLGHPAGDALLRQVSTRLIQAVRTT
ETVARLGGDEFAIIQPLTESRDEAFRLAQRLIDTLSEPFSIEGAAANVGT
SIGIAFAPEDGTSADELIKAADIALYSAKSSGRGTYKLFDVAMHAQLQAH
QQMKITMRDALAKGEFELHYQPLVSLESRCVSCCEALLRWRHPERGMIAP
SEFIPIAEETGLIVPIGEWILGEACRQAARWPERVSVAVNLSPVQFKHRN
LVRAVAKALSATRLDPARLQLEITETVLLDESEHNLELLQDLRRLGVKIA
MDDFGTGYSSLGYLRSFPFDKIKVDQAFVRDLPHGKESLAIVKAVAGLGQ
SLGMTTSVEGVETEDQLAVVDSEGFNEVQGYLFSRPLPAAEISKLIAAGS
L
>SMa0783 conserved hypothetical protein
MSESGKDVYEALRADVICGTACARRMGAIVFHGLWRGLAVLIAPHQSAVA
RQQSAPRFKTTSMVAHDRQLVHMLANMVLAAETGGSHVY
>SMa0110 putative ABC transporter ATP-binding protein
MTDPLLEVDDLHVRFSVSGGGLLGTGRRMLHAVNGIGFSLSKGECLSIVG
ESGCGKSTTALSVLGLQEPTEGTIRYRGQPLTGPGAPGRMQRAKAVQMVF
QDPYASLNPRQSVRTSLAAPLRLHGITAASEIADRIEVMLANVGLTPEQA
NRYPHEFSGGQRQRIGIARALILEPEIVVLDEPVSALDVSIRAQIINLLL
DLQEKLGLGYLMISHDLSVVEHMSDRVLVMFFGQVVEEGGWRDIFERPAH
PYTRRLIAAIPDPDAALNPGTKDHFADVPLPDGRSFAVDGSTAPDVFSAP
PPSELVEIAPGHRMRLVPAV
>SMa0220 putative aldehyde
MLSNFIAPDSNDPRLRIKSRYQMLVDGKSVDAASGSTIDRVSPGHAGEVV
GTWPEASADDVRKAVAAARKAFDAGPWPRMSGAERSRLMFKVADLILARQ
EELALIESLEVGKPIAQARGEIGFCADLWSYAAGQARALEGQTHNNIGDD
RLGLVLREPVGVVGIITPWNFPFIIASERVPWAIGSGCTVVLKPSEFTSG
TSIRLAELAREAGIPDGVFNVVTGYGDPAGQVLAEDPNVDMVAFTGSVRV
GTKLGEIAARTVKRVGLELGGKGPQIVFADADLDAAADGIAYGVYHNAGQ
CCISGSRLLVQEGIRDALMERLLDISRKVAFGDPLNERTKIGAMISEAHA
EKVHSYVTAGITSGAELLLGGERIGREAGLYYAPTVFAGVTPDMSIAREE
IFGPVLSTLTFKTADEAVALANATEFGLSASVWSTNLETALQTIRRIRAG
RCWINSVIDGTPELPIGGYKKSGLGRELGRYGFDEYSQFKGVHVTLGRPA
PWFT
>SMa0216 putative ABC transporter, ATP-binding protein
MMSTPNLLELHNISKSFGALTALRNLSFHIGEGEVVGLLGDNGAGKSTTV
NLISGIHKPTDGYLSVDGKKTTFSCRSDSADAGIETIYQHTALVDSLSIT
RNIFMGRELTDRFGFLRQREMRDIAMEVLQNAVHISGIDSPDTLVGNLSG
GQKQAVAIARAVYFKKRVLLLDEPTSALSVRETEALLNQVLKLKAENVSS
VLVTHNLYHAYQVCDRFVIMSHGTKVFDVQKADTTISQLTEYVVLT
>SMa1124 Hypothetical protein
MGVVKRRLAIWGSLLALLAAGIAYALRPQPIQVDLAVAEIGLLRVTLDEE
GETRVRDVYTLHAPLRGQLQRITAEVGDVVKAGETQLAQIEPAPPAFLDV
RTEAELQAAVEAARAAHNLAAAELNKAKADLTFAEGERARARQLIERRTI
SQRTLEDAERSYHVAQANLATAEAALKVREHELHQARSRLLSRQEIRSLR
EDCECMPVTAPVSGVVLQVMRRSEGVVEAGTPLLDIGDPTDLEIVVNFLS
EDAVRIRPGQRAIITDWGGEDLNAVVRRIEPFGQTQVSALGIEEQRVDVI
LDFADPTESWRSLGHGYRVDVQVILFEGEVLKLPLGALFRQGEEWAVFVA
AEGRARLRPVAVSQRNSLAVEIREGLVPGERVILYPSDRIKDGAAIVER
>SMa0824 hypothetical protein
MLLDDDIVDVLKKTGVDIARRTVAKYRGAMNIHPLSKAAARSVHCRGPPD
SEGCRQPASTLEQAGPVLLERDCDISRPVFWAALFCSPAAGRHSRSQQLS
ICGRNSVVPDREEGPKRSRYPEQPVQRHRLSSGHSACRRPVTRASTHGGN
SLSDF
>SMa0333 hypothetical protein
MSYSDTIAQRVETLSHSALINGVAPLAGRVLLAAIFLLSGISKISDPAGT
IGYINMVGLPFPPLSYGAPY
>SMa0353 putative LysR-type regulator
MSDPGQPTLDQLRVFIAVVETGSFAAAARKLNRATSVISYTIANLEAQLG
VTLFDRLSTKKPQLTLEGRTVLAEARSVSNGIDNLRAKVKSMLRGIEPEV
HLALDVMLPASRVMDALKAFRKEFPSVSLRLYVEALGAVTQIVLNRTATV
GISGPLDVDVLGLERIGVGFVQLVPVAAPDHPLAGGSHAPGAARSHIQLV
LTDRSPLTQGHEFAVVGTHTWRLADLGAKHMLLKEGIGWGNMPEPMVRDD
LADGRLVQLDLPDCKGGPYRLQAIYRTDTPPGPAGRFLIEHFQAQDAKTP
TTAW
>SMa1610 conserved hypothetical protein
MDNNAAERALRPIGIGRKNWLFAGADTGAETLARAMTIIETAKMNGIDPQ
AYLADVLDRIHDHKINRLDELLPWNWAPVAIICAEAA
>SMa1126 Conserved hypothetical protein
MGWSLKLGTIAGTEIRIHMTFVLLLVWIWFTHYQIGGAPAAWEGVAFILS
VFVCVVLHEFGHIAAARRFGIKTPDITLLPIGGVARLERNPSEPREELLI
AVAGPLVNVVIAALLIAVIGGVAGLEQLVRPQDPQIDFFVRLAGVNIFLV
LFNMIPAFPMDGGRVLRAILAWRWSLERATRVAATIGQGTAFVMGVAGLF
YSPLLILIAIFVYLAAESEAQSSELQAISVTVGDVMLTEFGVLQSDARLS
EAAELLLATSQNEFPVVDGEGQFAGLLTRDGIIGAMKEGGPNALVGTVMR
TDIPWVYEETALGDSLRVMQTTGAPAAAVVSRSQHPIGIMNYETIGEMLM
LRAAVHDFRFGMLRRSRAGSHG
>SMa0992 hypothetical protein
MRSNAIRSLATPEQRRHRSRSHQAQRRHAEAVRCFASGRKAAVPSRSSPA
RRKAARCWRRPTIARALEQRLGSADPRSFKEELDRLRVKDTAAIDRINEV
ASLVDRTHRAELSRTYELTRSLKKGLGLSI
>SMa1806 Hypothetical protein
MHPALERMQRGKRRNLATALAADPVNFTGSELGKFATTEMQQCDQRTSQI
ESEPPTVAQRLIPFRAHLPTNRCGATVASRLRRRAADRLSRTERRRRTRW
DARRGSRASSAWSAAPRPAYRCFIAVARSASGPCNMLLS
>SMa1882 Putative transcriptional activator
MASLTTCRSQMLHGGPFRPGADSENMRARIQEVAEEHFRRIGHHKTSVAD
IASELSMSRANICRFFPSRDAIKEFICRRVLNGTAELALTIARRSTPPSE
KLKELLTAVHHQNKTKLLHDRQMRDLIVAAMQENRPVIKVHAEQTMTILE
AIIREGIETGQFEVEDPAEAARAVKTAFIPFFHPVLIEHRLRHGEDTEAG
LPEQIRLILKALERSGYAR
>SMa1509 Probable ATP transporter, ATP-binding protein
MIPRLELRSITKCYPGTVANDAVSLSILPGEIHAVLGENGAGKSTLMKII
YGAAQADSGEIYCDGRRIEAHNPAISRSLGIEMVYQHFALFESVSVVENI
ALAVKGTFDLDRLAAEIKTLSARYGMPIDPHRRVHDLSVGERQRVEIVRC
LLQSPKLLILDEPTSVLTPQAVVKLFETLRQLASEGCSIVYISHKLDEVQ
ELCDTATVLRNGKVTGTAKPKESTSLELARMMVGSQLPQMHVSPSAPSAK
PLLEVRGLSAPARDKYGTELTDVSLEVHGGEIVGLAGVSGNGQAELIALL
SGERTHPRAETILIGGRPSGHLNAGERRKLGMAFVPEERLGRGAVPPHAL
WENAVLTAHRAGLVRNALVDRRRAGEFARHIIERFKVKANGPQASAQSLS
GGNLQKFIVGRELTLEPKILLVSQPTWGVDVGAAAFIRQTLVDLSRGGAA
VLVVSEELDELFEICDRLLVISNGRVSPPLIRKQTNREEIGLLMTRVGHG
ETRRSEVALED
>SMa0558 conserved hypothetical protein
MILDISTISYWRLIETNGEVYISLARSSGVFANQTIWSCVMGLKRLLIIA
ASIIAPLPVQQVVAQEAKGPVIRIAELEIDPAQMAAYSAAVKEEMEESMR
VEPGVLALYAVSIKGQSHHLRFFEMYADQAAYESHRESPHFRKYVETTKD
MITSRKLLETDNFQLSAKLR
>SMa1454 Putative transcription factor
MFNEKGPSHMERRMIAPCFMDDTLECLRRHGVDAGPLLAQAGLPSIVTGP
VSANQYGAFWHAVAQAMDDEFFGEGARPMRAGSFALLCHAILSTATLEHA
LRRALRFLRVVLENPHGELVVENGLAQIVLKDAGATRSAFAYRTFWIILH
GVNCWLIGRRLPIRWVDFRCSAPPAGTDYRLFFGAPVHFDQRRTRLVFDA
EYLKLPPIRDERALKHFLRHAPANILVRYRHDAGLSAAIRQRLHATAPSA
WPGFEAIAARMRIAAPTLRRRLRQEGQTYRSIKEDLRRALAMEALADGRT
NVAQLAVELGFSEPSAFHRAFRKWTGKSPAQFRRNATQTDFAESSSPRKR
TQPQES
>SMa1672 conserved hypothetical protein
MPARIVKVGFQGARQVAAERKLAVNVTHRELRNSSKWELVMSSLIAIIYP
SEDKAEEVRKRLIELQNEYLLTLGDAVIATKTDAGKVKLNQLMNLTAAGA
ASGSFWGLLVGVLFLNPLIGVALGAASGAIGGALSDVGINDNFMKELARG
LQPGNAALFVLVKEMTEDKVLKDISPFGGTILRTSLDESKEQLLRDALQK
ASAP
>SMa1678 hypothetical protein
MKMRGVEQFGDYGIMLGFAMTTRPGHQTQVRRRAKALIKDAFKKHGIHFA
SPTAQVAGNEAQSSMAAAATTRGTIAKKNAALAAQEGGEAAAE
>SMa1487 Probable c-type cytochrome
MNKVVIAALVSLAVSGHATPALSQEAAPGQKLFQQRCGACHQLETPRNGV
GPHLLGVVGRTAGSVDGFRYSAALKGSGIAWTAETLETFLSNPAAMVRGT
RMAQRFNNADERRAIIAFLRAQ
>SMa1735 putative oxidoreductase
MLMEASFPFSLSAANQQPCYRGGPATLVCKAVIDETHDSKTFVFEDSQSR
SFDFKPGQYISFKFEIEGKLCPRAYSICSTPTRPHNVQITVKRVPGGLVS
NWLNDHMRPRMSVEIADIAGRFNYFDIPSRKPLLLSGGSGVTPVMSMLQY
ITDVVDQVDVEFVHFARTPKDIIFRDQLEFIARRFSNIKVHMVVGETGEE
TCFRGRMGTISASLMQSLVPDLPQREIFMCGPEGFMKAARAMAAEVPIRA
VYEESFGERIPIEEPDKLGGEVYFSLSGKHGTCAPGETILEAALNSGIWI
ESSCHQGVCGSCKVKLTQGMVDMQDLGGLPACERSEGFVLACCSRPMGSV
SIDA
>SMa2259 hypothetical protein
MSFSQELYRDFAILIGQFLKEEFQVGGIVEISASDRDLIAVLRRYFAAQA
ELESLKAQLEAARQAAGEAIGVFYDPRQNAEHAAELQRSHRLREEMASLM
QRAEAWGRAAFGADEHDRSAAEAEPEEWGSFENQADALFGA
>SMa1515 hypothetical protein
MLPCSRPFSYNHFQRQRSPAMSEDAFNMSIRKFLKEVGVTSQREIEETVR
KGQIDGNKLKVRMTLTAEGTDLNHVVAGEIELP
>SMa0150 probable long chain fatty acid CoA ligase
MSNHLFDAIRRAARPDSAFILTADGRVWTYGDMLEHSGRIASVLDALGVR
PGDRVAVQVEKSPEALMLYLACLRTGAVYLPLNTAYTLAELDYFFGDAEP
RLIVCAPGAKEGIAKHAADCGAEVETLDEKGGGSLIDLARGKAPDFPDAD
RGPDDLAAILYTSGTTGRSKGAMLTHDNLLSNATTLREYWRFTADDRLIH
ALPIFHTHGLFVASNVILLAGASMFFLPKFDANEVLRLMPQSTSMMGVPT
FYVRLVQNPGLTHEATAGMRLFVSGSAPLLAETHRTFAQMTGHAILERYG
MTETNMNTSNPYDGERIAGTVGFPLPGVSLRVADPESGRPLPKGETGMIE
VKGPNVFKGYWRMPEKTQGEFRADGFFITGDLGRIDERGYVHIVGRGKDL
VISGGYNIYPKEVETEIDQMPGVVETAVIGLPHPDFGEGVTAVVVRKPGA
AIDERAILDGLEGRLARYKQPKRVIFVDDLPRNTMGKVQKNVLRETYARL
YAGAEARV
>SMa1709 hypothetical protein
MRSRSLPRFCNYLYRGRTPHNSWHRSSAAAPERHRLPLYLNACVKLIDYR
TVLDGKGDMSAIAHGRWFSINRNLNAESETCRFHSLVHLVLCHRHKADDG
QKRHRRSASNAQHHSSLPRHGGSQLPALQLCP
>SMa2247 hypothetical protein
MRPGAFASSIAASHRASTASMAANVGYSFPSPGRFDIGPVAVVRYGDVDT
V
>SMa0083 putative ABC transporter ATP-binding protein
MNTEREYASPPDSREALDPAVRMEGVNKWYDAFHALKNIDLTVGRGERIV
ICGPSGSGKSTLIRCINQLETIHSGRIVVDGHDLTAGGRNVDLVRQETGM
VFQQFNLFPHMTVLENCTLAPMKVRGLAKAEAEETAMKYLKRVRIPEQAV
KYPAQLSGGQQQRVAIARALCMNPKIMLFDEPTSALDPEMVKEVLDTMVD
LANEGMTMLCVTHEMGFARSVADRVVFMDRGEVLEIAPPDAFFGAPQHER
TRFFLGQIS
>SMa0254 hypothetical protein
METSLYLPVKGFLEKAGYVVKGEVDGCDLVGLSDDDPPVVVICELKLRFN
LELILQAVDRAAVADEVWIAARVSAKGKGREADKRYRDLCRRLGVGMLGI
SDAGDVSVIVGSVTPMPRTNPKRRSKLMREHRRRRGDPAIGGSTRAPVMT
AYRQQALGCALALTSGPLRVREIRSSVPDAGKILLANVYGWFERLDRGVY
GLTAAGREALQRWPQQDMQAKTAVPA
>SMa0667 hypothetical protein
MEDSLHPAAVAHLPPFITAPGQTDVLFNVMIVFVLLMVFVVGILYLRLHA
LPEHMAHGASKVQLQIVGVLALIALFTHNHLFWIAALLLAMVEFPNFSSP
VESIARSLAKMADRHDGTDEPGTSAQPAAMPPRANESHASPPQWVPLEIE
PVAGDNKAERRG
>SMa0715 putative UDP-glucose 4-epimerase
MKVLVTGSAGRVGAFVVRRLIAGGHQVRGFDLRSAGIEDGGFDEVIGAFD
DREAAIRACEGTDAVLHLGAFMSWLASDRDKLFRANVEGTRIVLEAAAAA
KVGRFVFASSGEVYPENKPEFQPITEDHPKKPLSPYGLTKLLGEELVTFQ
GRVSSMETVILRFSHTQNASELLDPESFFSGPRFFLRPKIEQQENFGNKA
AADLLRAADPGRPALVLTRNEEGRPFRMHITDTRDMAQGVLLALTHQKAA
GGIFNLGATEPVDFAQILPVMAQMTGLPLLTVDLPGAGVWYHTSNQRIRE
TLGFEPDWPIMRMLDEAVAEWTIRQA
>SMa1060 hypothetical protein
MDVSGCGGSKMKRRILLQGTTALMVLSALPAFAQDDIRAVAREAYIYTYP
MVKNYLTMYQYALDPGGSQYKGPLNTLVSIARVYTPEDTAIITPNSDTPY
SFIVFDLRAEPVVVTMPPIEKDRYYSLQLIDLYTNNVDYPGTRVDGNGGG
DFLITGPGWKGDVPKGIKRVIEMPTTLALGIIRTQLFSPDDLEKVKQIQA
RYKAAVLSAYAGAPAPAAPPSIDWLLISDELMVTDYWSIAAFLLQFAPPY
SGDEAQRENLAKLGIRDRGVWPGTDLSPETVALMKEVAVATEKEIRDEAA
RLTDSSKIFGTPEFMKGRFMVRAAAAQGGIYGNSVQEALYVIYAFDAQKA
PLDGKTGRYKLTFTPRTLPPVDAFWSLTMYDRQNQFLVDNPIDRYLINSP
MLAGLKKSNKGEIVLYLQRESPGAELESNWLPAPSEIFYVVMRLYLPRAE
ALDGRWAPPPIEALS
>SMa0900 possible anti-restriction protein
MERGLSGLNPVASLESTSMSTSVIAPQSATIVPEASRPEFLPTLFGRSLL
IVAENAVYSLMERLSPLDYGGGFWTFYEHEGKPLFLAPQSKSRFRITGEI
TGFQGEVPAEAAGIIATLFAFSHLSFQYQSEHLSEGYGRLYAYSADHPEA
VEIFQAID
>SMa2119 hypothetical protein
MNMKESPMAKTRCLDPVVLNLWHPLGALIELPVDTVVDTVLLEERLSLAV
GLDGAVAVWQSCPDFAAGDKIDVAAVSKSLPAKVAYGYLWASLGSPPDEL
FHIPEYDEADRRRLNAATFGVNVSAPRAIENFLDMGHFPYVHTDILGVEP
HTEVKEYDVDISVERDEILATRCRFFQPLASSASETGAEVEYIYRVPHPY
CSVLYKSSPVDDARYDVIAVFMQPLSQESVRAHMMLCILDEDNEDKVIKR
FQQTIFGQDKPILENQFPKRLPLDPRAETPIRADKSAIAYRRWLSQKDVR
YGVIPASN
>SMa0684 probable transport protein
MSAPSSGKSLGLAACTAIVVGNMVGSGFYLSPAAVAPYGNLAIVIWIVMG
AGAICLGLTFARLAKLSPAVGGPYAYTRLAYGDFPGFLIAWGYWISIWAS
LPVIAVAFAGVVIDFFPFLRGRGTATLLTLSVIWLVVLVNLRGVHAAGLF
SEITTYAKMIPFGAVALLGLFYIDFSHFADFNPSGQPLLQASAALAPLTM
FAYLGLESATVPAGDVRDAERTIPRSTVLGIAIAVTLYVLGTIVVMGLVP
REELVHSVAPFSEAARRMWGPAGELAISLAVVLSSIGALNGWTLLMGQVP
MAAARDGLFPPLFSRLSARHVPATGIVVSATLATILVLVQAAGSEGFSSI
YRLFVGLSTMTAVIPYAFCALASSLVSARVSGGTLIPRVTLIELVGFTFA
IFTLYGCGAEPVLYGLVLLLLSIPVYIWQRRRSFVPGDFGQ
>SMa0384 TRm3 transposase
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>SMa0496 conserved hypothetical protein
MKKKDQELGKGIGPELSPFVPVCDAIANLFAPFAEVVLHDMASASVVYIA
GNFSKREFGDPSNLEEIDFKPADVLIGPYEKTNWDGRRIKSISSVLRTAS
GKEVGVLCINVDVSVFENILLTLQTFVSLPATAGKLDSLFRDDWFERINS
YIRHWTTSRGLNISDLSRAQKKELVQALADDGAFSGKNAAGYICRLLGMG
RATVYNYLNGDAAKVAGGNSKQ
>SMa1355 Conserved hypothetical protein
MPSRFRFDVLAKAGYTARGVVFILVAGLALFSGVTGGQPEMKSALLTLLG
QPFGRVWVGLIGLGLLGFIAWRLAQSLADSDGHGSGRKGLTVRSALFGSA
AIYLGLAVYALGHALFFAGGDQESGEKGLAEWIMSKPFGSYLAIAVGIGF
IVGGVVTAYKGLTRKFERYLRIPDRNRVLTLICIYGLVARGAVFVIIGIL
FAYAGFRVDPEQAGSISDALEWLRQLPFGSILYIAVAAGIAAFGIYNLVE
ARYRVVRGPDLTAVKQPISRRKSDPTQPARPN
>SMa0355 putative LysR-type regulator
MIRERWSHLIISLLQFRIRAIKSGPGRVKRSCDSGVCMKGPGTPTFDQLK
VFLTVVDVGSFAGAARKLNRATSVVSYTIANLEAQLGFALFDRDSTRRPC
LTEAGRIVLSETRSVANGVNRLRAKAQGLLQGLEPSLSIALDAMLPASRV
LDALKGFRSEFPTIPLQIRTEQFRALSMLVRQGDIVIRHEYGRRCNGGPL
A
>SMa0293 hypothetical protein
MVLAVTSGKKAVEHLKAAESSIAGIVTDIRFAESPSGWDVTRIAREIDPE
MPVVYISGDSAPDWASQGVPKSIMIEKPFVMSQMIVAISQLLNDRRAGAA
ALE
>SMa0273 putative ABC transporter, periplasmic solute-binding protein
MKSIIRKAGLLLSTATLVLAQPLGTAQAQEKRIVAVSIPAATHGWTGGVV
YHAEQAEKEVEAAFPNVDVVLSTASSATAQVSALEDLSATRKLDALVILP
FTSEELTGPVEQIKKNGTFISVVDRGLTDPTIQDLYVAGDNIAVGANTAR
WLSDKLGGQGEIVVLRGIPTVIDDERIKGFSDVIDKTNIKILDIQYANWN
QDEAFKLMQDYLAKYPKIDAVWANDDDMLLGVIEAVDRAGRKDIKYALGG
NGMKQVIEMVKEGNERTPVSTPYPPSMIKSAIYMTAAQFAGQAPMRGSFL
LGAPLITPENADQFYFPDSPF
>SMa0244 hypothetical protein
MTSAGTIEDAKGPAELRLALRQMLGEDIVLSETEEMLRFCRDWHGDVTTG
TVAVIRPRSTQQVAAAVKACRELGLSIVPQGGNTGLVLGAIPDAPERQVV
LSLSRMNRIRKIDPADFSAVVESGCILSELKDAIAKMGMFFPLALGAQGS
CQIGGNVSTNAGGVNVLRYGMTRELVLGLEVVLPDGSILEGLSTLRKDNR
GIDLKQLFIGAEGTLGIITAVSITLTPYPDHVATALLGLASLEDAIRLYR
RARRDCCDLMSAFEFMPPLAFTLAQEAMPDLPIPISAEYPAYVLMEISGS
GLVDVDDLMQRFLEGAMEEGLVLDGTIAASQTQARNLWLIREGMNEGQAK
RGTHMRTDISVPLSQLASFVEEAEKAVSEALPGAVSVSYGHVGDGNVHLN
VLPPAGSTPEERIQLIYKAKTVVNEVLDRYTGSISAEHGIGRLKRPDFDA
RLPATRRKLLTALKHAVDPEMIMNPGCQLRF
>SMa1595 hypothetical protein
MISLTANCTGTLARGQFEEVEPCCSLTIIQMRFCLPASICWMKFALAWLA
VLGARLRLRVDLEKASLARLATACLPSFTIIFEVERSPDLCDPGSINVQT
VFSCNIKHRHRNISSPSPIQKVLRAIQERCAAAERILPSSLAEAIRN
>SMa0541 hypothetical protein
MELSIASWRFRVPEHPQRVGTMRRILIALAATTTIVGAAAAQTAETTTTE
TFVTAKPTDVLSYNLINLNVTNTANESIGEIKDLVLSEGQLAGYILSVGG
VLGMGERYVVVSPKALKITYVENDKKWTAVMDATKDQLKAAPEFKYEGRW
KR
>SMa1062 hypothetical protein
MFDLDAGPVTINVPDTGTRFVSLLVIDEDHYAHGVYYGPGSYTLTRKVIG
TRYVVAAFRFFADPNNTEDMKRVHALQDAVTVNQPAAGTFEIPEWDGKSR
DKTRRALLDLGSLLPNTARMFGPRDEIDPVRHLIGAALGWGGNPDKEALY
LNVTPEKNDGKSIYRLTVKDVPVKEFWSVTVYNKEGFFTPNELDAYSLND
VTAQKSQDGSITIQFGGCDGKIANCLPTPEGWNYMVRLYRPKSQILDGRW
TFPVAEEVK
>SMa0199 putative ABC transporter ATP binding protein
MMASSMDDVAKAIIAVDGAKVSFGAVKALDGVTLRVMPGECVGLVGHNGA
GKSTIVSVINGGLTPHHGIVTSDGERQERYGINAARDRGVRCVFQELSLC
PNLSIVENTRLVHRTLGGFGWRLRAAKIIEKSLDAVFPGHGIDSGRTVGD
LSIAERQMVEIALAFSDAGTPARLVILDEPTSSLDASLARQMLDHVRRFI
AAGGSVIFISHILHEILETADRIVVMKDGRVVAERPAHRFDHHGLVEAMG
SVAKAETRQRPVCDQPAAPVILSHEAAGIPFTARTGEIIGLAGLAGHGQT
ELLLALHAARSGNWLPQGNPLVTFVAGDRRLNGVFELWSILRNFSIASLG
DMSRRGLVLEAGEKTRGTAWKERIEIRTPDLNNRILSLSGGNQQKVLFAR
ALATRAPIVLMDDPMRGVDVGTKQEVYAIIREEAAAGRTFIWYSTEMDEV
CLCDRVYVFREGRITAELGGDAVDEANIISASFEGTA
>SMa1732 hypothetical protein
MASKQFACKASEVPADAAKIIKLGNLSLGIFRVGDGYHALLNVCPHKGAA
LCQGPVCGTTKQTDKAEFVYERAGELVRCAWHGWEFDIRTGEFLVDPRVK
ARTFPVSVESEDIFVHV
>SMa0508 putative ABC transporter, ATP-binding protein
MSVVVAKNIRKSFGGLQVLKGVSLTVEGGEVVALIGGSGSGKSTFLRCLN
GLESVDSGEIEVAGHRMSRKPAELRRLRRDVGIVFQSYNLFPHLTAGENI
MLAPVQVKGIGKDAARSEAKRCLSLVGLGDRFEAYPDMLSGGQQQRVAIA
RSLAMQPKVLLFDEVTSALDPELTGEVLAVIEKLARDGMTMILVTHEMGF
ARRVANRTIFMRDGIIHEEGPSAEFFSSPKTPELRAFLHAEVG
>SMa2301 putative response regulator
MQLASSSAGDLASRRCSPRWARMELQESAVAASRMSASPQVRYQPRSPAV
MAEVERLLAGRTRDIRLRGELGRLFQERSWSRTAKIIRAWMIWVTLLDVL
TLGLNAILLPKAVALSMLPPACLLPPAALATAFIWRKPRGVWLQRVSLLT
GLFLILLSVALVGVSAGGEFYERHLNIMLFVAITAIIIFAIPLAWTMTVA
SFALGLYLIFQLQNPGLERGSAVAGTLFFASGIIATVVARRTITILAQKT
FLLELRDMRRVAELADANARLERLAKTDPLTGIANRRWMMETLNRLWSSG
AERRPGTAMLMCDIDDFKSLNDRLGHAEGDRCLVKVAGIIQSSVRRNRDH
VARYGGEEFLVVLPGANEEAAVATAERIRASVEAASLPNPASRVAPYVTL
SIGVAAQAPGEEIVAPEKLQNQADAALYLAKQAGRNRVVLYQPDLPTV
>SMa0680 amino acid (ornithine, lysine, arginine) decarboxylase, probable
MKAPTVPIPFHKLLKVVAFVDESNLETKRLLAHIVAENFEVELRGSYNAD
VSEDASVGAYIGTVEGGRLEDARKFVRAVRDIGFRTPLWALADSHGIADI
AAIKMAGEVDGFVYLGQQTPAFYAKQIISSLVNYGKTLLPPFFGGLMAYD
GEANIAFDCPGHQGGQFYRKSPAGQLFFNYFGESIFRADLCNADVDLGDL
LIHEGPAAEAQKNAARIFGADRTYFILNGTSTSNKVVTNAVLRAGDLVLF
DRNNHKSLHQGALVQAGAIPVYLPTSRNSFGMIGAVDWDAWDEASLRRQI
ERHPLVEDKARASAERPFRLACIQLATYDGTIYNVRKVLEQIGHLCDYVL
WDEAWIGYNAFHPLFEDHSPMRIDTLDAEMPGLFSTQSVHKQGAGFSQAS
QIHKRDEHIRDQRRYVEHKRFNESLLMHVSTSPFYPLFASLDVNAKIHEG
KAGEMLWDRCIELGIEARKKLREFTRYYESSGAGPQEQWFFDPFVPDVVT
ISGSKHTEDVVESRWEALPTEVIKREQQCWRFRPGASWHGYSGYSDGYAM
VDPNKLTLLTPGIDRATGEYRDFGIPATMVANYLREQRIVPEKCDLNSIL
FLLTPAEDESKLNTLIAKLVKFKNLWDRDAPLAEVLPTVFAANRERYAGY
TLRQVCAEMHDFYRQAGVKELQRLCFRAESMPEPAMTPKAAYEALVANEV
DYVALDEAFGRISATLALIYPPGIGVIVPGERWDERARPMHNYFLAFQES
FNRFPGFNYEVQGVFQERVDGQIKFYTYAVRE
>SMa0034 possible protease
MPAATTYATTGNAYIDGLLGDWKWGIKDFTFSFPTSASFYSAGYGNGEPL
KGFAALSGAQQAATRAALDQFSSVANVTFTEITESATKHADLRLASSDAP
STAWAYFPSTAAEGGDAWFNKSSGHYSRPVKGNYAYVTFLHETGHALGLE
HAHEGNVMPVNRDSMEYTVMSYRSYVGASTTTGYTNETWGYAQSLMMYDI
AALQHMYGADFTTHSENTTYRWSPTSGEMFVNGMGQGAPGGNKILLTVWD
GGGTDTYDFSNYTTALKVDLRPGEWTTTSAAQLAKLHYDGSKVAIGNIAN
ALQYQGDTRSLIENAKGGAGNDAITGNAAANALWGNGGNDRLIGGDGNDN
LAGGAGADRLDGGNGTDLANYSNATAGMVADLYSPGSNTGEAAGDTYVSI
ERLYGSAFNDTLRGDNRANLLNGLAGNDMLNGRDGNDTLIGGNGADRLIG
GGGADTFVFQTTAQSAPAFRDVIDDFASGVDRMDLRSIDASSKAIGDQAF
LFIGSSAFHGKAGELNFRSGIVSGDVNGDGLADFQIRVMNLSALSGSDFL
L
>SMa1606 conserved hypothetical protein
MTGDFKLTQSPHPSDNEVLADTANSSAAAKKKVLVVGATGFLGTKILRNL
AHDASVAVVAMSRKGAPSNESADVEWVRGDMMDPGSLDRALQGVDVVVTS
ANSYMKGSLDTDFQGNRNLIEAAARANVGRFVFLSIVSCEAASPVPHFHA
KKVAEDLIQASGVPYVFVRAPTFLDQSTDFIAKGAQAGRFLAMGDKTTRW
SYVLTDDLASYLAKAATFPGSEINNQTIDVGWRDGPKSQQEIADLVSEIA
KKSLKVRVVPWLVLRLLVHPVKPFSELGYDLIQMLLFFKKGVYVSNISKQ
EHFFGPAPTSRDAITRWAKSQQLIS
>SMa0708 putative isomerase
MSDRVKKIESFTLTLPRETPYLGKPRPGEEPNGRGYLVRKANRTVYPTFD
RSVLVRIETENGAVGWGETYGLVAPRATMEIIDDLLADFTIGRDPFDAAA
IHDDLYDLMRVRGYTGGFYVDALAAIDIALWDLAGKLAGLPVCKLLGGQR
RDRIAAYISGLPEDTRAKRAELAAAWQAKGFSSFKFASPVADDGVAKEME
ILRERLGPAVRIACDMHWAHTASEAVALIKAMEPHGLWFAEAPVRTEDID
GLARVAASVSTAIAVGEEWRTVHDMVPRVARRALAIVQPEMGHKGITQFM
RIGAYAHVHHIKVIPHATIGAGIFLAASLQASAALANVDCHEFQHSIFEP
NRRLLVGDMDCLNGEYVVPTGPGLGVEPSKEAQGLLKKH
>SMa1863 Putative ABC Transporter, permease protein
MTMTHSETKSARISGTRLGYRFNIVGMVGLSIILCWALVAIFAPLIIPYP
VGEIVDLDYFGPMSRDFWLGSDYLGRDMLSRILMGARYTVGISLAAVTIA
CFSGVVLGMIAAVAGGWLDTLLSRLLDALNSIPSKLFGLVVVAAVGSSIP
VLILTLSVIYIPGAYRFARALAVNINAMDFITVARIRGESTLYLIRSEIL
PNIIGPVLADLGIRFVFIVLLLSGLSFLGLGVQPPYADWGALVRENIGGL
PFGAPAVIFPSFAIASLTISVNLLIDNLPQKIRDRSE
>SMa0209 hypothetical protein
MLKSIDLMYQAMLAELRQRSLDAAWSADFPSDGRFTPLTVKQRRYWYFDR
PDGKGGRTRDYVGPASDPEIAKRVEEFKAQKDDLQARRRMVSTLTREGGM
VAPDRMSGDVIEALASGGLFRLRGILIGTIAFHTYAGVLSVRLPGHSIMT
GDADVAQDFAVSREVGDSMPPILQLLQSVDPTFRPVPHRSGQAAFSSFQN
KNSYKVEFLTNRSSDDYMDQPARMPALGGASADPLRFLDFLIRDPVRTVV
LHKSGIPVTVPDPSRYAVHNLIVASRRHNDGQSAVKRDKDIRQAGILFEA
ILQTRRSSDLALVYNEAWQRGDAWRAGIRAGAAMLPDEGRKHLKACLRTG
AVEIDEDVSLPF
>SMa0421 hypothetical protein
MLPVKDQQTALEEFRSLPPIAPRELVGLWKGHGIPTGHPFDGVLENLSWF
GKRFTPDMRVDALLFRSGDRRLVAIDPKWIPLRLALRFHEVGRMRVARNL
FSYLQRGLRAKGPVASSKTMVFGGVESAAMVYDHQPIVDHFRRIHADRIM
GAMTIRNDERIYFFELQRVDGP
>SMa2131 hypothetical protein
MVAFLNLMHFSRAPPSQGKGHGEHPWDCDMENLSHIDPIARDEWHVVASI
EELPSTGMFSTVLLGQKISIGRHGDRLGAWCEPQAAAEPSGSQVFGAELP
VIQRFGYLWATLGDPPRDLFEIPEFDEPDRSRIVQGVTRVGVSAPRAVEN
FLDMGHFPFVHTGILGEEPHTEVKPYKVDVYSDPPEILVTDCEFYQPQAN
AGSETGADTEYVYRIPHPFCAVLYKTGSTRPDRRDVIALFGQPVGEDQVL
VHIVICLLDEVTDVAAMRSFHQTILGQDKPILENQMPKRLPLDSRSEVPI
RADAASAAYRRWLRERGLRYGVVTGSA
>SMa1662 putative drug resistance protein
MNLSAHFIDRPRLATVIAVVMAIAGALALFQIPIAQFPQITPPEVQVTAS
YPGANASVLEESVGAPIEDQVNGVEDMLYMSSSSTNNGTYSLTVTFAVGT
DPALAQVNVQNRVALATPRLPASVTQTGVSVRARSSSMLMGVAIYSPEGT
RDEIFISNYAANNIRDAIARVAGVGEAGIFGPSYSMRIWMNPDRMQALGL
TATDLTSAIQAQNAQASAGQLGSPPATSGQQLQLTIMAQGRLATEEDFSN
IIVRTNTEGALVRLRDVARVELGAQSYDTASTFNGQPSATVVVYQSAEAN
ALAVSRAVLSELDRLSRQFPEDVAYAIVFDTTAFITETIKEIAITLAITF
ALVVAVTYFFLQDWRATVIPTLTIPVSLIGGFAVLYLLDYSANTITLFAV
ILAISLVVDDAIIVVENVKRLMAEERLNVHDATRRTMSQVTGPIVATTLV
LAALFVPIAFVAGITGQLYRQFSVTILITITFSTINALTLSPALCVLMLR
SPREQRSGIFGTFNRGLDFSRNWYVAMLDRMSRRLWIASVILLAILGGVY
GLFRALPTGFVPSEDQGYLFINVQLPNAASLERTQQALDTVSRILQRTPG
VANSVGIAGNSMVGGGGSNAGMVITALKPWGERRSAEESIDAIMNRLRAD
FGRIPTASVVPFNPPAIPGLGTTGGFDLRLQARSGQSQQEIAEVMRGLIV
KANQTPGLASVFSTFSADVPQVFLNVDRRRAELFGVSTATIFNAMQSHLG
SSYVDDFNIFSRVYQVRIQDEPQFRSRIEDIQRLRVRSRNGELVPLQSLL
SISTSYGPTAINRYNLFPSASINGQAATGTSTGQALATMASLAEQNLPEG
FGFEWTGLALQEEQAGNQTALILLMGLIFTYLFLVGQYESWSVPLAVMLS
VAVAVLGALVGLMLASIDINIYAQIGLVLLIGLAAKNAILIVEFAKERRD
KGMATPEAAAAGTAQRFRPVLMTAMASILGVIPLVIATGAGAGSRRAIGM
TVFGGLLVGTVVGLLLIPVFYVLVQTVREQAKERFFRTRAGRKA
>SMa0678 putrescine/ornithine antiport transport protein, probable
MTEHSLNVAMEAASKKKMNLVQLTFIVAVNMMGSGIIMLPANMAQVGAIS
LLSWLVTAVGSMAIAYGFAQAGLFNQRPGGMSAYAEDAYGKPGYFLVFLL
YFLSLAVGNVAIGISAVGYLAGFFPWLTSTPIATCVSLIILLWLTTVANF
GGPRVTGRIGSITVWGVILPVGLLCIIGWAWFSSEVFAAAWNPNGLTLVQ
GMGSSISLTLWAFLGMESAAQNSDAVENPKRDVPLACLFGTLGAAIIYIL
STTVIQGIVPNAELAASTGPFALAFATMFNPAIGSVVMALAVLACVGSLL
GWQFTIAQTARAAADERMFPSLFSRVNEMGAPVTGMIVMGVVQSLLALMT
ISPTLNEQFAALVNLAVVTNVLPYIISLSALFVMMRAAGVSESKFRLNSS
IAIVGMLYSVFAIYASGKDAVLGGMLVTGIAFIIYGLIAPRFTPRPGIVR
TA
>SMa1684 putative two-component sensor kinase
MSLERHWTTRSRQLGRYVVPCCVILGTSAAAPVAALKTGDHVPRVLILYP
YDERLAATTAAGEAIRTRLLQATKARIDLFSEFLDLSRFPEAEHIGRMAR
YLSAKYADRRPDVVIALGEVSARFISANRRELAASAKIIVAGFSSSTAEE
MDLPNDVVGAFSEFDIVKTAEMARGLQPEARHLFIIGGSSEFDRSWLSRA
RADLAAFSKDYETTYLEDLTIEEFTKVAAEVPRDSIILALTILKDRDGRN
FMPREAVKQIAETAGAPIYGPYLTYIDYGVVGGSVVTFESLGKTVADLAL
EAVAGKPISNLESRQTYVADARQLERWGLSEKNLPAGAIQMFKEPTFWEQ
YWLAAVFALAVITFQGTVIAGLLIERRRRQAAETESRHRLLEVVHLNQSA
TAGALSASIAHELNQPLGAIRSNAEAAAVILRSERPDLELIGQILVDIQD
DDQRAHDVISRIRGLLKKRSEIDWQEFDLNDVTTSAIRILRGEAERGSVV
VSSSQTSRELPVRADRVHVQQVILNLATNAMDAMLEVITTEKRLLLETRL
TAESKVELSISDTGRDIPDERFASVFEPFYTTKPTGTGLGLSIARAIVET
YARSRPPTVLEAERSFVLCCLSHEMGE
>SMa1993 Conserved hypothetical protein
MTINSHSWRTLMVEKRSVLCFGDSLTWGWIPVKESSPTLRYPYEQRWTGA
MAARLGDGYHIIEEGLSARTTSLDDPNDARLNGSTYLPMALASHLPLDLV
IIMLGTNDTKSYFHRTPYEIANGMGKLVGQVLTCAGGVGTPYPAPKVLVV
APPPLAPMPDPWFEGMFGGGYEKSKELSGLYKALADFMKVEFFAAGDCIS
TDGIDGIHLSAETNIRLGHAIADKVAALF
>SMa1675 hypothetical protein
MASYDTMPVNAGGDPSGPPPSVRVVLGGVMLLAGLVVLTDVAFASVVTPV
FVGTAAILVGVFEIVYAFWARRWGGLSWQTLLGSLYIALGLMLTDVAGSS
LMEVLTNIVARSVRTQELLQTYTIGLLFILSGVVRILLSVSHWREAGWPM
MLSGAFGAAAGLVVLAEFPKMGLWLLALLLGVDFLAHGLAWLRFAFFPGR
TKVETIRHGHPPDFRD
>SMa1161 hypothetical protein
MEPLILRLGLALAIGLLVGLERGWREREAPAGSRTAGIRTYGISGLLGGI
VAALSDAQRSDLIFAAGFMSFALSFTWFKLHEARHDEDFASPA
>SMa1093 hypothetical protein
MPTTSRQIFEGRSLGDPAKDPKRSRAHARSRSNKPSIRTTLASRVGALME
KNMPFKTILAVLGVPQTDRDLQTAADISIQVRAHLSVLIVGFAPQPTSRY
ATLASAWFEQRDWNLKALGETAKTVRDKLSKREISFEVDSIYAEVAGATY
DIGDAASILISS
>SMa0104 putative ABC transporter, periplasmic solute-binding protein
MLKRLTLAAMLSLGVAAGALAAGERHGGTLVFTAPYGSSFATLDVQSSPN
TQEEFITQAIHRALYSWDSNQNKPVLELATSEEVSEDGKVHTYHLRKNAV
FHNGKPLTADDIIYSYKRIANPENAFPGASFIAVIKGAEDYIAGKADEIS
GLKKIDDHTLEITYTGTINPGFPLMQNTTVIYPSNVEGESTFGKNPVGLG
AFVFKEHVPGSQVVVEKFDKYYEEGRPYLDRINIVLMAEDAARDVAFRNK
EIDVSILGPTQYQAYQGEEGLKDHLLEVAEVYTRNIGFNPAFEPFKDKRV
RQAINHAINAPLIIERLVKNKAYPASGWLPISSPAFDKDKAPYAYDPEKA
KALLAEAGYADGFEFEVTASPNESWGVPIVEAILPMLKKVGITVKPKPVE
SSTLGEAVTTNNFQAFIWSNLSGPDPLNALRCYYSKTAQSACNYTSYASP
EFDKLYEAAQQERDSAKQNDLLRQANNIVQDDAPVWFFNYNKAVMAYQPW
IHGLVPNATELAIQPYDEIWIDETAPSSRQ
>SMa0164 hypothetical protein
MIRRSSRPRGFPGLLFASSILLSGCQNHELVRSETIALSAGDAIAANSVM
QMVDPWPPRVKQTSLATPADLEQYKPQQPNAEQNGGNGETYPNDTTTQ
>SMa0190 hypothetical protein
MADKQGGPTKADWVAAGLSALTAGGIEAVRVERLAVILGVSKGPFYWRFK
NRGELLEAIIEFWKRDFTADLIEQTSHFDTPRERLEALAELAVVSTSGAL
DVAKTECALRAWAAQDPLPRAAVREVDAMRTKHLTEEFKLLGAPHPLAEQ
LAKAIYLALLGLYTVRQYTPELADEQSYLTAVRIALDAAQIQSHSTEASA
AKSRLEPDI
>SMa0725 hypothetical protein
MAKVPAGSGLMSDLAETVGSSQSAQAFVNLLRLNFEECAALTADADEYEE
GNAGSSSH
>SMa1424 Putative ABC transporter permease protein
MPIVTRPISRETLIKWAPLLVLIALVIFFTVLNPTFFSARNFARIAIASA
PALMVAVGVTFIIIMGSIDLSMEGVISLTAVLFSFAFIKLGGTLLGTAWL
ALPLILIIGGLIGFLNGLVHVKLRVPSFMASLAMGFVGTGAAILITGGDI
VRVSDPAFRGLLTVRWLGFPLMVYVAAVFLIVAWFIQEHTRLGRNFYAVG
GGEELAHASGLDVSRVRIAGFALAGVFYAIAAVLVVARIGQAESVTGSNF
MFVSITSVVVGGVALWGGIGGVWNALIGVLIVGVINNGMVVIGLPDFLQD
GVLGLLVILAVVLSTDRKLVSFVK
>SMa2093 hypothetical protein
MSSLSSVWYTRCPAPTPLSIAHQLGWVDKQFQSAGVAVRSIRDSKDPSVR
QSHFTHALDYSFRQGGNIPPIWARSGGRETRVVGITTTDEFQAIVALPGS
GILAASDLKGRRIGVPRKPNSEVVDFQRATALKGIVSALSLGGLHQGDVE
LVSLDTDEGNLIERGNAAFLGLKRRYPYGDELLALASGKIDAFFVKGAEG
IVLANQIGAVVVSEFGFHPDAKIRINNGTPRPLTVDARFLDEHFDLVVDL
VATVARVGEWAVSNPDTAVRFIANEIGVGEDAVWAANGPNVHRHLALTLD
DEQIAAFDHFKRFLLDWGFIPKDFDVFGWIDEKPLQAALRRTAA
>SMa1136 hypothetical protein
MESKMSVVYTVGPIKRHQVDKAYRLIEAAGCHFDLQAWREFCAGKVAGER
PASEIERIVTAENPLGYIAGICIMHPVQNAKYGRMLDVPVFIVTSAGDTR
GAANALLEYFMAVAGENSCGFIRVAALDPADWRRSITTSPREDRGILIPV
QYP
>SMa1782 Putative LysR-family transcriptional regulator
MTVSMEQLEAFVAAAEHGSFSAAGRALRKAQSAVSTQVSNLEEDLGLELF
SRQGRNPTLTAAGERLLSEARLILDRREHLIGVAASFEAHVEKRLVVAID
ELYPEHALGELFAEFAVHFPHVELELLFPMMEDVSRLVLDGKADIGVMWR
QEDLPAELGFHTIGWVQLKLVCGRNHPLANAVVGWEDLKRHRQIMVTVRT
EGMERHRLRVAAEVWWVESHWVILQILKQGIGWALIPAHILARSPVAQDL
AIPALQFDDGAHPVALELVWHKQRPSGPAATWLRKRFATTKIDMVAE
>SMa1973 Hypothetical protein
MTMTKAKIGIVGTGFIATILAPQIQSSKKARLDAVSSRTLAKAESFVANY
PGAIAVEGADQLIARDDVDAVYIATPTSAKEDVASRALTAGKHVLIEKPL
HSAASFKRLSALARQKASC
>SMa0610 hypothetical protein
MPNVEKVQAGYKAQPLSAFLKQPAPTAAPEIAFVPATTAGIKDNFFQYLD
AALQFVPETLRDNAIRAKLAKIGIGPGKTFEFKELSLEHKAEMLISVKQG
DDKINKWLASGNKPINRWNVSSLLGDEVFFNGDWLRRAGAAKAGLYGNDA
VEAMYPFTRTDATGEPLDGSKHKYTLTFPPGQLPPVNAFWSVTMYDGKSQ
FLVKNPINRYLINSPMLPGMKTEPDGSVTLYIQEDNPGAGKEANWLPAPD
GPIYLVMRLYWPKTTPPSILPAGKGTWQPPGVKRVS
>SMa1195 conserved hypothetical protein
MPFSCTNISVQGRGFCSEWLQRVLSQGHLWQATPDKEQIMTGIRLDNTVA
AIAAELPGAAELFRGHDISFCCGGNVQLSEAAVKAGVAPSALLAELQALV
VAARRDAPAETSDLIGHILDRYHQTHRAELAWLIPLAQKVERVHGDHPSA
PIGLSQVLERLRDDLESHMMKEEQVLFPIMRRGGSAVIAHPITQMRDEHE
EEAEHLRTVEHVTHGLSLPPGACGSWTALYTGLRKFTDDLVTHMHLENAV
LFPRFETQAQAAV
>SMa2355 putative DNA damage-inducible protein
MNARRRRARTMTRAMGHLPPSPGGRVRKIIHIDMDAFYASVEQRDNPELR
GKPVAVGYPEARGVVAAASYEARKFGVHSAMPSVTAKRKCPELIFVPHRF
DVYRAVSRQIQAIFAEYTPLVEPLSLDEAYLDVTENFRGLKLATEIAEEI
RGRIRAETHLTASAGVSYNKFLAKMASDQRKPDGLFVITPKHGPDFVQAL
PVKKFHGVGPATAEKMKRLGIETGADLKSRDLAFLQQHFGKSGPYFYWIA
RGIDERKVKPDRIRKSIGAEDTFREDVHDLETARAGLKPLIDKVWHYCEA
SGIRGKTMTLKVKWADFTQITRSKTIVAPIASVAEMSEIAELLLSPIFPA
PKGIRLLGVTLSSLDTVDDRSEPQLALAL
>SMa1292 Conserved hypothetical protein
MNANKPTLTLGPVLYLWEGEKWRDFYFRIADEAPVSHVVIGETVCSKRLH
FTDPYFAPVIERLVAAGKRIVLSTLALVTLERESQYVRSLIADSPYPVEA
NDLSALALLEREPHWIGPLVNVYNAATARVLARRGARAICLPPELPASSI
DEIVAHTPGVDFEVLAFGRMPLAISARCAHARAKGHIKDNCQFVCKDDPD
GLPVNTLDRQSFLALNGVQTVSFTCQALLAELSGLVGRGVSRFRLSPQDC
DMVSVARLYDGVLHGRLDAEDGLARLRQIYPTAPFSNGFHHGQEGAAWIA
RARNTAHGANA
>SMa1720 putative LysR-family transcriptional regulator
MDLSVIDCFVKVADARSISAASRLHRLPKSTLSHRIRQLEDQFGVELFVR
EGHQLHLTDAGSELLRHARKIRASCDDAVTAMAEMHKEVAGTLRVGSTGE
FGTTVTSELLYAFQKTYPQVILDVVFLSARQPFTDVSDMALDGIFHWGEP
SDVDYVSRRLATASYGLYASPDYLAQHDQPANEDELALHRGLIFRSTTRL
QPWYLSRPGGSETEILLTAALTANDYWTLKYFAVAGQGIAYLPGFFVETE
CQSGLLVPVLSEWRSRETAINLIYSRRRHVSRRFKAFVRFCMEFYRRRER
DQIPRYFVEKIARE
>SMa0185 possible transmembrane-transport protein
MADNIHTPEQASRREWVGLCVLSIACLIYSMDLSVLFLAVPAIVADLDPS
ASQLLWINDIYGFMVAGFLVTMGTLGDRIGRRRVLLMGAFAFGVASAFAA
FSNTPGQLILARALLGIAGATIAPSTLSLIVNLFKNEAERNRAISIWGTA
FALGGLVGPLIGGILLQYFHWGSVFLINIPVMLLLLAVAPFLLPEYKNND
AGRLDLLSVVLSLATVLPIIYGFKHMAADGFQLAQIVYIGLGLLVGLLFV
RRQRRLSDPLVDLALFRVPAFTASLMVNLAGVFFVFGVFLFQNLFLQLVL
GLSPLEAALWSAPSALVFAVMSFQAYRFTNRFGPVRTVLGGLLVNAAGAA
AMAIAAYAESLIGILGSSMIIGFGFVPVVLTTTGLIVGTAPPERAGSASA
ISETSAEFGGALGIAVLGSLATLIYRMAMNRADLSSLNPVQAEAVSATLA
GAVETARSMPGSTSAVWLETAKSGFSLGFAICCVVATVTLLLLAIVARRV
YATAHIDESTLAPH
>SMa0235 putative dihydroxy-acid dehydratase
MTDKPQRRLRSQDWFDNPDHIDLTALYLERFMNYGVTPEELRCGKPIIGI
AQSGSDLTPCNRVHMDLAKRVRDGIRDAGGIPIEFPTHPIFENCKRPTAA
LDRNLAYLGLVEILYGYPLDGVVLTTGCDKTTPSALMAASTVDIPAIVLS
GGPMLDGWHEGDLVGSGTVIWRMRRKLAAGEIDREEFMQAALDSAPSVGH
CNTMGTASTMNAMAEALGMSLTGCGAIPAAYRERGQMAYRTGRRAVELVF
EDLKPSDILTREAFLNAIRVNSAIGGSTNAQPHLAAMAKHAGVELYPDDW
QVHGFDIPLLANIQPAGAYLGERYHRAGGTPAIMWELLKAGKLDGGCRTV
TGRTVAENLEGREPTDREVIRPFDEPLKEKAGFLVLKGNLFDFAIMKMSV
VSDDFRKRYLQEPGREGVFEGKAVVFDGSEDYHKRINDPELDIDENTILV
IRGAGPLGWPGSAEVVNMQPPDHLLKRGIRSLPTIGDGRQSGTADSPSIL
NASPESAAGGGLAWLRSGDVIRIDFNLGRCDMLVSDEDIERRKADGIPAV
PADATPWQRIYRKSVTQLSDGAVLEGAADFRQIAKNMPRHNH
>SMa0469 putative ABC transporter, permease
MVICAVLGERIAPDSPFLQRLGVGDTPPSQDHIAGTDLLGRDVLSRVIYG
ARTALAGPVVVAAGAFAISTLLGLLSGYLGGLVDSAIMRWVDFMFALPGP
LVAIVVVGVVGGGYWTAVLVLVVLFTAPDTRIVRSAVLEQRPLAYIDAAR
TLGISKTRILFVHILPNIAPIILAYVVLDFAFALVNLAGLSFLGLGVEPG
TPDWGRMLFENRTILFSNPAALLLPAGMIILTAVSMNLVGDWLFERFSK
>SMa1761 putative aminotransferase
MSMVNAFSREDFERLDKAERALIARREKVLGPAYRLFYEKPLHLVRGEGV
FLYDAAGERYLDAYNNVSSVGHCHPRVVEAITRQTAVLNTHTRYLHEGIV
DYAEALTATFPEALSQAMFTCTGSEANDLAVRIARFVTGGTGIIATELAY
HGLTSAVAEFSPSLGESVTLGPHVRTVSAPDSYRHSPEEIMEKFGRDVRA
AIADLKRHGIKPAMLITDTIFSSDGIFDGPRGFLKPAVDAIHEAGGLFVA
DEVQPGFGRTGETMWGFERHGVAPDIVTIGKPMGNGYPMAGIVLRPEVIA
EFGPRARYFNTFGGNPVAAAAGKAVLDTIRTEGLQQNALVVGRHIMERVK
SLSAIHPAIGNVRGSGLFIGVEIVADSTIKRPDAALTTRIVNGLRERRIL
ISASGPNANVLKIRPPLIFSRENADMLVDALGNVLKTL
>SMa0709 putative ABC sugar transport system sugar binding protein
MKTIGKYAVAATVTAMLACTASAVPASAEVLKFVSWQKDEKGIGDWWGSV
IKEWEAKHPGNTIEWTKVERSAYSDTMTTLFAGGTPPDIVHLASFEFQTF
ANNGWLEDLGPWVEKTGLDLDGWSGQDICKFQDTTVCIMMLYYGTIFGYN
EEMLKQAGVDVPTNYEEFLAAARATTKDLNGDGIVDQFGTGHETKGGGGQ
YIAEIASYLFDAGARFTSAEGEVTIDTPEMVEGLTRWKTVVKENLTPRDL
SAGEVRKLFADGKIALKVDGPWIYSIMQQGAAKDKLKLASVPFDPPLGGS
SNILAMPSEISEEKKQLVWDFIAIATSDKFQTSFATLAASTPPSPRADLT
EAKAQIPHFHLMAQSQKAAAEHKIDRIPIGLEIQFNEFSKMIQEEAQRMI
IEDLDPAAVAKTMHEKAEALQ
>SMa0995 TRm5 transposase
MTKTEGKTASAAVKDILLSNPDGLREVIRTVMQEVLEAEMDEALGAAKGE
RTPERLGYRSGHYGRTLITRVGKLELRVPQDRSGHFSTELFERYQRSERA
LVATLAEMYVQGVSTRKVKAITEELCGHAFSASSISAINKRLDESLKAFA
ERSLEEPFAYLILDARYEKVREAGVVMSQAVLIAVGIDWDGRRQILSVEM
AGRESRSAWKDFLVRLKGRGLKGVELVVSDDHAGLVAAIGEVIPEAAWQR
CYVHFLRNALDHLPRKHGDDCLQELRWLYDRRDLDEAKADLAAWLGKWSV
RYPRLTSWVEETIEQTLTFFRLPRQHHKHLKSTNMLERLNEEIRRRTYVV
RIFPNTESCPTPRPRARRRNPRKLDGGQSLHQHGRPARAQETRTPSSRMT
STHDRPICRT
>SMa0206 putative two-component response regulator
MKVLIVEDDPLHRSYLHEAVNAALPECDTVIEAENGTVGEKLAREHKSAH
IVMDLQMASRNGIEAARTIWKERPETRILFWSNYSDEAYVRGVSRIVPDG
AAYGYVLKSASDERLKLALRSIFIESQCVIDREVRGLQQKSLGQTNGFND
SEYEILVDIALGLTDRAIARRRGLSLRSVQNRLQQLYDKLDVYQAASDDN
DDGRFNLRARAITIAFLRKLLNYSALERAEAELQEWLDGK
>SMa0599 hypothetical protein
MSLLAFPIRNLPDATGAPLCFRLVLCVGRGRLIDGIPHLAGLTLNMGKQV
TGHNRCTIHSGLFRPLQSIVVCGMLVVVAGCGGHPKGVLTPVADSMPATS
RVEMLITTTRGRSEVPGEMFTGERARAPAFANITVSIPPVRKVGEVAWPK
KLPSNPATDFATLKADDLTRDGAKTWLNTTVSKSPDRSVLVFVHGFNNRF
EDSVYRFAQIVHDSGIKSAPVLVTWPSRGSLLAYGYDRESTNYTRNALES
LFQYLAEDGEVKEVSILAHSMGNWLTLEALRQMAIRNDGLPAKFKNVMLA
APDVDVDVFRSQIEDMGSQHPRFTLFVSRDDRALAFSRRVWGDIPRLGSI
DPEADPYKQELAENEITVIDLTKVKAGDGMHHGKFAESPEVVRLIGARIS
EGQPLTDSRMGLGDHLIAGTTGAAAAAGSAAGLILAAPVAVIDPHTRDNY
ANHVGAAMGQSDGKQKIAVTDCVSRQSAERDPACAPQN
>SMa0702 hypothetical protein
MEAWVLFRLAIACGLIAATVAVQASFMSAGLGVFRRLEEDRRDFLLRYPT
VTTVIWIVYLIVPIALDVVLWATFYYLSRALPDFEDALYFSTVTFTTVGY
GDIVLGRDWRQVATFEAVNGWIIFGWATALMMAVIQRLHFRGND
>SMa2091 hypothetical protein
MALINAAQVEPRVGSPSHGSLPETIARRVGDQWRITGHKTYATGIPLLRW
VAVLAVTDEPEPRLGSFLVPTSADGIRVEKTWNATGMRATNSDDLILDDV
AIPLEDVLEIAPASEGLKRDERMGAWYFSLVPAIYDGAARGARDWLIDFT
TSRAPASLGAPLSTVPRIQDGLGEIEVLLTVNRRLLRSIAEDFDSGRAFG
ADAAAVKHTVIANAITVTTLALELGGNPGISRDNPLERHHRDALSGKAHA
PQNNLIRMMLAKAAFSRHAATHAVALDPVPVRSHQQPRLAVVGRS
>SMa2099 conserved hypothetical protein
MTAVTHSAALEPAERLTALKADIADPEEVARIVVGHDAVISAYSPGLRRH
SAEDAAVLIEKAHASLFEGVKRAGVRRILIVGGVGSLQASLGVDVVDSDF
YPADHRAHTLRNREILRSLRRGEHDLDWTYVSPPLSIKAGERTGRFRLGE
DALLRDEAGESRISAADFAIAIVDELDKGQFIRRRFTAVY
>SMa1414 putative L-sorbose dehydrogenase, FAD-dependent
MMEGFDYVIVGGGSSGCVLAARLSENPSVRVCLIEAGGRDRHPLIHMPVG
FAKMTAGPMTWGLTTAPQKHANNREIPYAQARVLGGGSSINAEVYTRGHP
RDYDRWVEEGADGWSFQEVKPYFLRSEGNTILSGEWHGTDGPLGVSNLPD
PQPMTRAFVQSCQELGIPYNPDFNGPVQEGAGVYQTTIRNSRRCSAAVGY
LRPALARKNLMLITGALVLRIVFQGRRAVGVEYSTGGAAKIARAESEVLV
TSGAIGTPKLMMLSGVGPAASLRSHGIDVVQDMAGVGQNLHDHFGVDIVA
ELKGHDSLDKYNKFHWMLLAGIEYALFKSGPVASNVVEGGAFWYGDRASP
YPDLQFHFLAGAGAEAGVPSVPKGSSGVTLNSYTVRPKSRGSVTLRSADP
RALPIVDPNFLDDPDDLRISVEGIRISREIFGQPSLQKYIKTIRFPDESV
RTQADFEAYARQYGRTSYHPTCTCKMGRDDMSVVDPQLRVHGLDGIRICD
SSVMPSLVGSNTNAATIMIGEKAADLIRGNI
>SMa1968 Conserved hypothetical protein
MRSNSSRFQEQIMTSATQDLKPVLFVLTSHSVKGETGEYTGFYLGEVTHP
LAVLDAAGIPVEFASIAGGEPPVDGLDLHDAVNARYWNSEGFRHAIRNTS
RLSDVDPKDYSAIFFAGGHGAMWDFPTSPAVNSVARDIYEAGGVVAAVCH
GPAALVNITLSSGAHLVAGKNVAAFTDDEERAVKLDKTVPFLLASTLSAR
GAHHHPAADWAAKIVVDGRLVTGQNPQSATGVGEALRDLLTA
>SMa1364 putative ABC transporter, periplasmic solute-binding protein
MLFRPGALVAATMLFGLPASAEEIVRISGWGGSEVAIVNGLLTNVLAEKL
AEEGIRVKYEPVDGDYSQFIINGLSAGTAPDLFYVDTFWARSVFSAGQAA
PVTNDVSGFAANLLTAFTYDGKLYSIPKDFNTLAIHFNKDIFDDAGVGYP
SDDDTWTTLQEKLVAVNKSLPEVDGLCVVPEYARFGAFALSTGWSPFDAR
GKTVLDKRFRRAFDFYTGLVKSGAAVMAADAGHSWTGGCLASERAAVAIE
GAWILSALRDSAPNMNLGTVRMPKDPESSKRGNIVFSVGWTVNAASKVSK
AAAKVAELLTTEEAQQWVLEQGLALPSRTSLNDNPWLRGGQPEQIASRVV
LEGLSDDHVMPYFFGDVGGSWMQPINAALNSVILGEEDANTALPSAQSAF
DRMLAK
>SMa2277 hypothetical protein
MTKYGREHRSASQTTAILPTPLRRPIYRARQHPMLPCPRLQHAASATPSL
SLEAMRALQVGGVLSVLTRSVEYAIASPALSCRVQLCFQALQLGFGDKYC
LITATSSLSLALTSTIPPPWNSSLGN
>SMa0941 hypothetical protein
MLNACGLRTEGYSSAEAFLSHATPAKLGCVVLDMHLVDMSGMELRRRLKT
AHSRLPVIFITAIDDDALELEARRVGCLAYLQKPFAAASLIAAVKEALAG
DAMD
>SMa0687 hypothetical protein
MNRRRARAGRRTSYCPPADGRPARCPARVGKRFHTRHQKQPGRTGRAETR
RSNVMNVTSKTRIVAATAVSLLCSGTMVQAQDMKKYGTAAGWDIVVRGDM
GPGCLIAKKLGNDMQIQMGIDETTGRRGYMALYTKLAANVGSGEKRAVIF
DVDGQKFSGEATGQQLEGFDGAYVWVNNPDFIYDLAKMKTLTITPDGRKP
FALSLAGTDAAMQAMRACQEAN
>SMa0025 hypothetical protein
MVVILSERAFSRRDQTGAEVMLNRRRFLTNAATAAAATLWVSKGGMQARA
EEAGNGSPWRRFEIITKVDLQPAEGPAQLWLPVVRSAGDYQKADAPLWVS
NSTDIRMEKDPSSGADILRVMWEDEPRRVIEVTQHVATRDRRVTSKIAAT
EAELQHELRGTPSMPVDGIVKTTAMRIVEGRDRAEDRARAIYDWVIDNTF
RDANIDGCGVGNARDMLETGYFGGKCADISSLFVSLARAAGLPARDVFGI
RVADSADFKSLGRSGDITKAQHCRAEVYLDAHGWVPVDPADVRKVVLEEN
LPLDNPAVRAFREKAYGNWEMNWVGYNTARDLVLPGGELRQGFLMYPAAV
TSRGELDCLNPQTFAYSITSREITA
>SMa1344 putative ABC transporter, ATP-binding protein
MTQIELRGIEKHFGAVQVIKDLNLTIADNEFIVLLGQSGCGKTTTLRAIA
GLETIDEGDILIDSQPVQHIKAAHRDIAFVFQSFSLYPHMTVFENIAFPL
RATRGNRADIEREVQAVAKTLQITHLLVKKPSALSGGDMQRVAIGRALVR
RPKAMLMDEPIGALDAKLREEMRAEIKRLHIKQGSTTIYVTHDQVEAMSL
ADRIVVMHEGVLQQVGSPHEVYARPANMFVAQFVGSPVMNMSDVTVSEDA
GHARVVVRGAPTSFDFPSNLTAQLAVAGAQNGNLTLGVRPEGVLVSREAR
EGFVPVEAHIIEPLGSHDIIDLKVGDQMLRARTKSGFVPRPGEAVWARID
PAQAHFFDSSTGTSFGIRL
>SMa0426 conserved hypothetical protein
MALRPYWKGYLKLSLVTCPVQMMPATSESEKVRFHTLNRQTQNRVVSHYV
DSVTGKEVKEEDEVKGYQRGEHEYVILEDEELENVALESTKTIDISTFTP
RDSIEWIWLDTPYCLSPNDPVGQEAFSVIRDAMEAQNMVGISRLVISRRE
RAVMLEPRGKGIVLWTLRYGDEVRDEDSYFEAIGDEQADSEMMPLVQQLI
KKQTKDWTPKMVADPVQERLLEIIAAKKKALKPQKAQGKTAPSSSPSNVV
SIMDALRKSVAAEKRATK
>SMa0396 putative ABC transporter, permease
MTGSKSKSFVLWAFVTTALLMLSAPTVVVLGASFTAGNIITFPPDGLSLK
WYGAIAQASDLRQAFVRSLIVATICTLVSIPVGTLAGIALAKYRVRFARS
IQIYLLLPFTIPLIGSGIGMMLVLGNMGVLGKLWPVGIACAVINLPFMIW
AVTASASNLSPDLELAAANCGAPPLQRFLYITLPAVLPGVITGSLLMFIL
ALNEFLVSLLLVDARSVTLPVQIYNSIRSIITPDLAAISVVFIACAGLAI
ALLDRLVGLDIFLKSK
>SMa2217 putative decarboxylase
MRRAPTASSPSPPQPAQAANARQKEPFMRDFVRKLQERGDLLVVEREIDP
AHELAAVTHLAQKKWAKPVMFTNVKGTRFPVVTNVYSTRERLGEVIGIDA
GDFCRQWSRLSSLGSAEMREPLVPANQPPGYDEVKLSDLPLITYSDRDGA
PYFTSAMFIARDPDTGVANLSYHRSMFISDNELRCRLAPRHHLTIYHEKA
EKMGKPLEAAMLIGPPAHAFLTAAAPLAYDVDELEVAARLRGKPIEMRRC
NHIDLEVPSETEVVIEGRFLPNERRPEGPFGEFMGYYVPVGPNAVFEVLG
VTVRKDAIFHSILCGSPEEVLTLELSVSANIYQRLSAALPGIVNVTCQPF
VNHAVVQIEPQFEGHARQVMLATIGAEPIWAKQITVIDTDVDIYSMDDVQ
WAILTRCRPDKDTMIIPETPSFYRDEAKDHWGRLLVDATKPWGREEEFER
KRLRMVNDIRLSDWFAGA
>SMa1704 HYPOTHETICAL 50.8 KDA PROTEIN IN SYRB 5'REGION (ORF3)
MRLVSSSLRPFERVSVNDLLRSLLPMKSSDAECRYFGRNEGTMSRESLYI
SACHPYEGSMAENHTHWLRKENTDQTKASFPELFFDLVFVFALIQLSESL
SDDFSLGIAAEAVLFIFALWWVWIHTTWVMDLLDTEIEPVRLLLFTLMFF
GIVMAIALPEAFKGMGLLFAVAYSAMQVSRSLFALYAFRRGDPASFMTFF
RITAWLTISSTFWITGGLSEPHLRVVLWIVALVVEYTGPTVRYWVPLIGA
SPRETLDIDGEHLAERSALFVIIALGETILTIGKHTFSNLETEGTPWVLC
FSFLTTVLMWWIYFHDGQQRAADKAEDTSKPQTTAQYLFTYGHLPIVGGI
IFTAVGEDFSLAHPYQLGTYNFALAQLGGPILFLAGTMWMKRVSSRVLPY
SHVFGISLLTASFTLVPFVANFAIQALTGVILLVVAVWEYVALKHLRRSA
A
>SMa1746 putative iron uptake protein
MGVNLNVEWSVAMRLFRFAVLAVIALHWADRHGTVFAAESTSYPITIKHA
FGNTIIAKKPERVATVAWANHEVPLALGIVPVGMARANFGDDDDDGILPW
VDARLGELKAEKPMLFDEGDGIDFEAVAATRPDVILAAYSGLSQADYDTL
SDIAPVIAYPQAPWSTDWRETIRLNSAGLGMAAEGEGLIASIEAEIDLAL
DGHPELKGKSAMFITHLSSWDLSVVNFYTTNDTRVRFFGDLGLMSPKSVV
QASQPGRFSGSVSAEQIDAFDDVDILVTYGDGMLFDALKANALMMHMPAV
ARESIVMLGNNAVGNAANPTPLSIRWVLKDYVKLLSEAAKKSQ
>SMa2235 hypothetical protein
MRPPTPVRFSKNPMDMYPKGTKALAALMKRLSDTDDVRLLVMIWAMHTLQ
GKRAAIARNYIRFPREAYNAQIGSLYFAPKWEMETLVTLTLNTPKYVFPA
HVRDPIDTEEFSHFAHLLNLVKKVEDDESRQHVNLDTIMREFHKLGHRQF
PWQTGWDNIASVYRHIFIYGDGPCAKHFEDTYGLSVSDFVGCAFALYVQM
GLSPFNPALEAVPFAVADGAIAKTLAMVSQDLYRARAESRALYAKFRNSL
GPIPLAYHPSYLRMKPIIRYTGAKNHYIAPFPELVLMRATVGLYYDLIGA
STSVMNHARTRFEVYAREVIKAYCPEFDPEPAIKYKHKNSDAETPDVLLK
RAGQVVSVFECKATKLSFQAQYGDDPAEDATDQYKQIANAVFQLWRFFSH
VRRGIINVPLAEEVNAVVLTMEPWTQTSKELRKKMIEEAEKIAVVKEPEM
TEADKRTPLFVSINELEYVLGHSDGDQMLATFKTATEDRYAGWGAREVRR
DLGNMPEKVKRYPFTPGAFLPWWDHAETRARDRLAVAKETAEEE
>SMa0663 hypothetical protein
MNGEPDWKHCMAAALRRFPARSLQIRELSMRDEDFRGMCEDFAAAENALA
AVDQLPLHIREERRAEFKGLIESLAAEIAIALGPADGSI
>SMa0335 putative
MSTSNILDGKVVIVTGASSGIGRAIAIRAAEHGAKAVIVSDVVEAPREGG
EPTASEIRKLGAESVFVKADVSRKVDNDALVAAAEEFGGVDVMVANAGIT
LKTDGAEVPEDDYRRLMSVNLDGPLFGAQAAARQMKALNKQGSIVLMASM
GGISGAGITVAYSTSKGGVVLMAKSLADALGPDGIRVNAVAPGTIDTELL
RTSPGIAQASEGFRQRTPLRRLGKPAEVGDAVAFLGSDLSSYVSGTALLV
DGGLLAVI
>SMa1921 Hypothetical protein
MPPRKGTKDYQNEIIRVYYGINNQTLGVQSVWETRGGKLASMSAGHHFRH
QCVAGNIAHSHAVSVAFDLTGVFTLPVDVENHPLVNELEEKASIMRAQRT
HDAFREVLSIVGAK
>SMa0561 conserved hypothetical protein
MLPMNIWPTDELRQIAESDDLHIAPFRENGRTYGTLTWIWSVVVDGELYV
RGYNGQQSRWYQAAIRQKAGRITAAGMTKEVSFEPVEGAINDQIDEGYRR
KYATSRYLAPMIGERARAATVKITPKD
>SMa1809 Probable NON-HEME HALOPEROXIDASE
MLYFLGKGYRVIAHDRRGHGRSTQVGDGHDMAHYAADVAALSTELDLRDA
IHIGHSTGGGEALAYVARHGAGRVAKLVMVGAVPPIMLKTEAYPGGLPIE
VFDGLRVQLAANRAQFFLDLPSGPFYGFNRPGAQVSTGVIQNWWRQGMMG
SAKAHYDGIKAFSETDFTEDLKRVEVPVLVMHGDDDQIVPIDSSARLAVK
LLKNGTLKVYKGYPHGMLTTHADVINADLLEFIKA
>SMa2251 conserved hypothetical protein
MIVSDDAGQFRVANHALCWVHAERLLQKLMPATPKQERLITATRDLVWRF
YKALKIWKQQPSPHLIPGFRRRFDKIFARRTGYEALDKLLLRLHRRRDEL
LKVLEHPFIPLHTNASENDIRSFVTRRKISGGTISLNGRIARDVMLGLMK
TCQKLGISFYHFLGDRLGLGSPGRPIPPLSQLVMMAS
>SMa2385 probable ABC transporter, ATP-binding protein
MPSITLSALSWSKPDGEHVFSDLDLAFGPERTGLVGRNGIGKTSVLNIIA
GTLRPSSGTVAIQGRVALARQILRAGADETIADVFGATQAVAVLRRAEKG
DASVEELETADWTVEERIVSALARLGLEARADTLLNQLSGGQRTRAVLAA
AIFSEPDFLLLDEPTNNLDRDGRRAVIGLLSGWRSGAIVVSHDRELLEEM
DAIIELTSLGTKRYGGGWSAYQAARAVELEAAQQSLTLARKTADEVDRKA
RALAERLDKRDASGTRKAAKGDMPRILVGRRKSNAEESRGKSVELAERRR
AGALDAVTAAKARIEVLQPFSIRLPRTELPAGRQVLAFDGVTAGYDPARP
IIRDLSFSLVGPRRVSVTGPNGSGKTSLLKVVTGELPPFKGTVSVNVPFT
LLDQSVSILERGETILENFKRLNPGASDNACRAALASFRFRADAALQRVE
ALSGGQVLRAGLACALGGSDPPSLLILDEPTNHLDIDSIEAVEAGLLSYD
GALVVVSHDETFLANIGIGTRVELSTSRG
>SMa2367 putative ABC transporter, permease
MRFERREHRSFALVIATPVLAILCALALAGLLIAIAGAPVMEAYWRILVG
AFGSRLSATETLTRASPLILTGLAAAVAFRAKLWNIGAEGQFYLGAIAVA
AASSHLFGGLPPPLQVPLLLVAGAAAGIVLLLVPLWLRLRFSVDEVVTTL
LLNFVAVLFVSMLIDGPLKDPMGFGWPQSQPVADAAVLPKLFARSRLHVG
LMIALVFAVAVHLVQSRTVFGMQSRAAGLNPAGAVFAGVPLGRTLVTVAC
ISGGLAGLAGAIEVMGVQGYVTTDLSPGYGYSGIVVAMLANLHPLGVVLA
ALFTAVMFVGADGMSRSMGIPSYIADVTVALSLISMLTGVFFTQYRIRR
>SMa0300 ABC transporter, permease
MSVLRLLASRILFSVLTLLLVSVLIFLILEALPGDVATRILGRDATAKAL
ELLRSQLDLDQPALVRYFQWLGDFLRGDLGVSIATGRPITDVLGPRIVNT
LLLSLFAFAIYLVLALVPAIIQATNRGKAVDNIISVITLVLLSLPDFLLA
TILLFLFAVAVPILPALATISEASTWQETLRAMVLPAVTLAIMMAVYAIR
VLRDSLIEVLRSDYVRLAELKGLRPTAVLFRHALPNAIVPALNVTALNFG
FLIGGVVIVERVFSYPGFGTLLIDALQLRDIPLIKATVMISAVVYVAANL
VADVLAVLLQPRLRTSR
>SMa0682 decarboxylase, probable (lysine, ornithine, arginine)
MRSLAEAIEKEGYRVVAGLTYEDARRLVNVFNTESCWLISVDGTESSTTR
WEILAELLAAKRSRNNLLPIFLFGDDTTAEMVPAPVLRHANAFMRLFEDS
PEFMARAIVRAAQNYLERLPPPMFKALMEYTLHGAYSWHTPGHGGGVAFR
KSPVGQLFYTFFGENTLRSDISVSVGSVGSLLDHVGPIGEGERNAARIFG
ADETLFVVGGTSTANKIVWQGMVTRNDLVLCDRNCHKSILHSLIMTGATP
IYLTPSRNGLGIIGPIAKEQFTPEAIAHKIVASPFASETNGKVRLMVVTN
STYDGLCYNVDGIKSALGDAVEVLHFDEAWFAYANFHEFYDGYHAISSTK
PARSQEAITFATQSTHKLLAAFSQASMLHVQHAVAKQLDITRFNEAFMMH
TSTSPQYGIIASCDVAAAMMEQPAGRALVQETIDEAMSFRRAVNAVRTQM
QDSWWFEVWEPPIADRAPSDATSDWLLKPGDAWHGFEDLAENHVMVDPIK
VTILSPGLNAGGAMLEHGIPAAVVTKFLSSRRIEIEKTGLYSFLVLFSMG
ITRGKWSTLITELLNFKDLYDANAPLSRALPALAAAHPDVYRAMGLRDLC
EKIHDVYRSDGVPNAQREMYTVLPEMALRPADAYNRLVKGCVESVDIDEL
IGRTLAVMIVPYPPGIPLIMPGERITAATRSIQDYLVYARSFDRKFPGFE
TDIHGLRFVANPSGRRYLVDCIVEEGQDDTA
>SMa1902 Hypothetical protein
MGPLFRNSGFPLYHEAMLFVRMRPQQRQASFEATFTAARSALSSGLQALA
YYERAGLVKRVTPLSKEAPHYASRMLVSSVRNARVMPELLGAPGNEAEGW
SASALNLSAAMDERTPGGAASLLEMASAADVRQLQLALAGDPSVASVSRV
PLRYLTAKKP
>SMa1783 Hypothetical protein
MSRRIIRGNDVEIATGAFDGRAHPLVLLVMRAPMPWWPERFCSRIPDKTA
MAELVRSY
>SMa0257 probable methylamine
MSRDSRFDILFEPVKIGPVTARNRFYQVPHCSGMGYRYPNAEAHLRGMKA
EGGWAVVSTQEAEIHPTSDLTPANEARLWDDGDLPALSAVTERVHAHGSL
AAIQLVHNGLHVANRFSRMIPLAPSHAVSDSLDPVQARAMDKADITDMRR
WYRNAALRAKKAGFDIVYLYAGHDMSVLQHFLSRRHNDRSDEYGGSFENR
LRLFREILDDVREAIGDTCALAVRLAVDELMGPSGITCEGEGKDIISALG
ELPDLWDVNLSDWSNDSQTARFSEEGYQEPYIRFVKSVTTKPVVGVGRYT
SPDSMVRVVKQGILDFIGAARPSIADPFLPKKIEEGRIDDIRECIGCNIC
TSGDNTNVPMRCTQNPTVGEEWRKGWHPETIARSEAPEPALIIGGGPAGL
EAARALAQRGVDVMLAEGGGEWGGRVARECRLPGLATWGRVRDWRIGQLS
TRVNAELYLHSPLSAADILQYGIPHVAIATGASWRTDGVGRTHRMALDFL
SEGILVSPDAILSEGAEAVPSDGPVVVFDDDCFYMGSVLAELLARRGRTV
TFVTPESQVSPWSRNTLEQARIQKRLIGLGVEIVTAMALAGRTKDQLELS
CVYSGRSRPVDCATLVPVTARLPDETLWLELKAREAEWADAGIKTITRLG
DCLAPGLIAAAVYSGHQYARTYQEQVDKDRVPFMREDIARLYGLRSG
>SMa1111 hypothetical protein
MREHELHQAQSRLLSRQEIRSLRGDCECRSEGVVEAGTPLLHIGDPADLE
IFLNFLSEDAVRTKPGQRKLQRSRSSNRALRADPGVGARNRGQRVDVILH
FADPTESWRSLGHGYRGRCRGSMRLPPPRQPTDTSLELPAERLWPQSVQR
RKGPGVATERIAVDGQPRPTEANQGLECTAPFANHPVADVIKLRTFRLAH
PKNAALRDFHLWPAGTCGDFLNRFAIDRST
>SMa1828 Putative transcriptional activator
MGDLSDIRVFLAVAQQRSFTAAARQLSMTPPTVTRSVSALEQSLGVQLLL
RTTRQVSLTSAGAVYAARVEPLIRQFDQVRDDLREEQGDVAGLIRINAPL
SLGQQLLPDIVSGFRAEHPQVSVSLSLTDRFIDIVSERFDLALRVSEAPR
DRSTIWRKVCRVRRILVASPGYLAIFGTPETPDDLSRHGCIAYDEEAISE
TWELTNGGRTRKALAGKVLAANNGELIAKLVEDGQGLALLPHFIVGDALA
SARRAEAGPCRVGASGALADIVLSALRETADARCEILGLLRKICYPYAPA
VKKRRTQVLFSRRKASFPAKGEHMLQEFDDALRHASFESDLNGLYAALLL
SRSFGSSESRIRSLGRSLPDLSQQKSRFVWRRE
>SMa0325 conserved hypothetical protein
MADLTNVAYFTAKPGRSGELGDELLQLVTPSRNEEGCLRYEIHQSNDVPD
VWMVLEDWRHASDFKLHMNTPYVQAFMAKVPDLCVEDVEICGYQQRSPQV
>SMa1032 Partial conserved hypothetical hypothetical protein inactivated by IS Rm1
MLWSIGGSPPGNADDIFKLGRGQMLALSAYLGLFAVAFGAATLLPFQSEP
LLVGLLLSGEFSTLGLVAVASFGNVLGAVCN
>SMa2073 hypothetical protein
MTITKALTESRPTRAELLARIPAFVSEVAKGAAERDLSRKLPFEAFELFR
ELELGTLRVPVSLGGPGGSVADYIEMIAAIGAADSNVAHALRAHFNYVEN
VILSEPRERDGGAIELILAGKLFGGAHTEQGTARPGQVTTKIVRQGETYR
LNGRKWYATGTAFADFASFSALDEEGPAVGVLLPVDRQGITILDDWDGMG
QRLTASGSVILDNVEVFPHELSRRTLDSLVGRHCSTLRQLHLAASAAGAV
RNVLSDGLAYVRKQARSAAHSSAETASEDSFVQQVIGEIAANSFAIDAAV
ATAAEALDRSAAALGVGTDVEIEDALIASALSTARTQLVLGQLGLRSAGR
MFELGGGSATSRNNNFDRHWRNIRTILNHNPLLHKSRVLGDYLLNGTTTH
LREGKVF
>SMa0070 putative ABC transporter permease
MKRFLKIYLDKPELAGLALLFVLVLVFQAKSNGILLSFENMRGVMGLLPE
MALVAIGVTLLMICGEFDLSVGSVFALMPMSMAVLLNSGVPFTVAVLLGL
MICGAIGFINGYVTLQFSIPSFITTLGMLFIARSLTIVISGGFPPLLPPD
LPTWLFTDYIWQGSPFRMSFLWFVVIAALVAALLSLTNFGNWIRATGGFN
EAAASMGIPVKRVKIVCFMLCSVLAGFAGLLQVLRLGSPLPSIGEGLELQ
AVAAAVIGGTALAGGIGTVFGAIIGTLLIRTIDNGLVLSRVDANWFKFAI
GVLTMFAVIANAWMGKMSRKIKVEAHK
>SMa1759 putative GntR-family transcriptional regulator
MPDFATLEHENLNSAVYGALCDALMQGRFQPGDRLKIRDLAEQFGTSVTP
IRDAILRLANDEAITFRSPRDIRIPGLSETRYREIRAIRVRLEGLAAETA
AQVATSKDIEALARILRENELAIEAGDRLKGTQLNQAFHFMLPQIAGLPV
LNGILRRLWLQMGPHISDVYIEGGRAMIDHHYPVVEALKRHDSAAASMAI
VDDILLGGKPILARIERATERQARPA
>SMa1652 probable hydantoin racemase
MLIKLINPNTTSAMTELMGDTARTVAAAGSDIVLATSRSGAASIEGHYDE
ALSILGVIDEIARGKPADAYIIGCFGDPGLLAAREITASPVLGVAESAMH
AATFVATSFSIITTLERTRIISERLVRSYGMQNHCRSVRATDVPVLELER
SSSTADAAILAECEKALIEDRAQAIVLGCAGMSNLVERLQHRLGVPVIDG
VAAAVKFAEGIVGMGLRTSKVGDLAYPLPKAYAGKLAEYAPASLKLRNND
EPSALATAIGS
>SMa0633 hypothetical protein
MGTPVLDQVGFEMIVYVAEISGRGIAAFDAANDIEAQAQLANRGLLRDLI
VLQNEGRALWDGVADIHLRTATPEETEIWQTSRTAAVQSGEDSDDEGRHV
FLVPVVDPSHDNFDDDDNPHHDDDRDGD
>SMa1541 putative oxidoreductase
MDPVISTRPLLAVAVAGLAALAILFLNKREKLRDAVSPIAAVAMFAIVVS
MAPTVLAGGTVELRLFEVLPGIDVALRADALGMVFATVSSLLWIVAAVYS
IGYMRHLHEHAQTRFFACFATSLAAAVGGAFAANLFTLVIFYEVLSLVTY
PLVYHHEDEEGWRGSRKYLVYLMGASKSVLLAALALTYHIAGSLDFVRGG
LLTGANASATLLTVVYFCYLFGFAKAAVMPMHAWLPAAMVAPTPVSALLH
AVAVVKMGVFCVLRVIFHVFGVSTVGELGLGVATAYLVSFTILMASVYAL
TRDDLKARLAYSTVSQLSYIVLGAVLLSPLAMVGGIIHIAAHAFSKITLF
FCAGSIYCASGKRNISDMAGIGRRLPWTMGAFFVASLSMIGIPPTAGFVS
KWYLAEGSVEAGQMAFLAVLLASSVLNAAYFLPVSYTAFFEAETKESRAP
VREIPLVAIPLVATAILSVLMGIFPYYFVTLADGVIR
>SMa0763 hypothetical protein
MRAQPSPRVAAADRAWWEQGKKRCSYGRFLVLNVTDSRRWRSGAIGLLQG
RPALHRADHVRALGQLHLLFFYVRPENRLDAQQSKMSLQIAEFASNRQWK
SVVVTGRYQELPATQGCHHERIHAWSLREKKPNWWKPGGHKPVAKPSAHI
FVCVVMDEIAGRAASAAVAGVLRAELASRPQRARVTGPAKGPGCRVSDVA
VTPVSEARAATLPLS
>SMa1138 Truncated response regulator
MLTGRSDVVDRVVGLEVGADDYIAKPFHLREVLARVRGVLRRRQPRRSPE
VGDQQAEVYSFEGLRLDVGSRQLLSDDEREVPLTTGEFDMLCVLVKHAGR
VLQREFLMDLTRGRNLEAFDRSIDAQIARLRRKIERDHTRPALIKSVRGV
GYVFTARTTRQRS
>SMa0359 hypothetical protein
MTMGTIIRPLQRAEVELVWQIERREVVQEIYEVADGRLHLRPQFYDTREW
PDGEPEIYTPILFDCFDHDGVFLGALLEKNL
>SMa1754 Putative ABC Transporter, ATP-binding protein
MESSVAEFIQFDRITKFYGPLCVVENLVLGIGQGEFVSLLGPSGSGKTTL
LMMLAGFEQPTSGNILLDGTAINDVPTHKRDMGVVFQSYALFPHMSVGEN
VAFPLQMRGLGKAEIAECVARALDMVQLSAFADRRPSQLSGGQQQRVALS
RALVFEPRVVLMDEPLGALDKQLREQMQFDIRDLHRRLGLTIVFVTHDQS
EALTMSDRVAVFNRGKIEQIGTPRQVYDEPATRFVAEFIGETNLVEGVVE
TVQGQEAIVRLPSGAHIVSAGSGSLVSGQSVFLSIRPERVDLSETRGDAR
NCLETEVTDSVYQGDHLRVQLQSAAHPLIAKLGRRSREFPPGTKVYAAFS
ANDCRVIAP
>SMa2033 Hypothetical protein
MAFFAKEQLIAAAGSLAFCLLTCVAEAASCGKSAAGFQQWKVEFAETAKA
AGVRGKGLAALLGAKYAVGTIRADRSVNKAFSGSVDDFMRRRGGSAIISK
GRSLKTANAALFGNIERKYDVPAGVLLAIWGMETGFGASMGNQNTVSAIV
TLAYDCRRPQFFAPHAIAALKLVDSGVLSARSVGAMHGEIGQTGFLPGNV
LKYGVGSRNMRDTSTALMSTANFLKAHGWRAGGGYQGNMGAIAGWNSASV
YQKAIARIGEAIDGR
>SMa0175 conserved hypothetical protein
MERGDKEQENLSRLEQAAQQTADPKTMQQKATKGPEEKLRFSLRKKDVWW
TVGLAAAALLLFGIQVLINWRLDWLDVPLRLRVMNYVKGGLLIFIMLTVA
NVIEVFLIGRIPNRVSRFNLKRIFRLVVVVAIVFVAISVLFVNWYAAFVS
LGLISLILGFALQMPISSFIAWIYILARAPYRVGDRIRIGDAHGDVIDVS
YLDTTLWEFGGEYLWTDHPSGRIIKFPNSTVFDTPVFNYSWPLFPYVWNE
IKFQLAYESDLEFVARTMREVVEEQVGDIMSQKVKIYKHILSNTPVDELE
VKEHPVVHFRVSENTWLEAIVRYLVPPKEAGRTKTRLIKEMLARMNAEPD
RVLFPKSNLR
>SMa0352 conserved hypothetical protein
MSHISTADIEQVILPSSRDLGGFSVRRALPAPMRQMVGPFIFLDSFGPVR
FGEGEGIDTRPHPHIGLSTLTYLLEGELTHRDSERYVQAIRPGEVNVMVA
GAGIVHSERTPEHQRATGGKLAGLQSWIALPKKSEETAPLFQHLDAGSLP
TVSGEGIGMKLLAGNLHGRQSPATVFSDLFAAEVHLEAGARYRIDGEHVE
RAIFVVAGALEIVGQDGNFGQDRLLVFKPGSEIVVKATGPARFLAFGGEP
LPEKRFIRWNFVATDQERIRHAADLWRERGFPGVPDDDEFIPLPENFR
>SMa2133 hypothetical protein
MRLPPIDLSRENTFGEISYESGGQYGPIRGDYLAFIIVHLGTLKIEADET
VLQLEAGHCALSLTRDSYLTTIAPETLTHISWCDGRPLNAAWLYKERHKY
APFINASTQVQTLMQLGVDLGIGDRDGRGGLREALTNTLLNAYIYEAEVV
AEERPIPQTILQARRFLDENFAADVSIERVAQQVRVSPQYLVSAFKKHLG
MTPARYLWRCRLDYGAHLLQRSGLSVSEIAYQVGYKNPYHFSRQIKLAFD
CSPTELREKMLNG
>SMa1604 putative oxidoreductase
MEIPMNPMFTSVRVGRYMLPNRLVMAPMTRSRAAFDGTPGELATEYYVQR
ARLGLIVTEGTQPSDDGQGYLTTPGIYTPAHIAGWRKITSAVHDKGGHIF
IQLMHAGRMSHPDNTPHHRQGVAPSAIAPGAGMFTATGMQDIPTPRALTT
EEVRRTVAEFSHAARSAIEAGADGIEIHGANAYLVQQFFAPSANTRTDEY
GGSIENRARFAIEVATAIAEEVGADRTAIRLSPGTALWGIDEGAEGPDLY
RYLVAELDKLGLAYVHILHQSNEPLLADIRKLWRQPLILNRPGRPRGQIG
ADVASGLADLEAFGQMVLANPDFVARLKADAAMNESDPKTFYGGAAKGYI
DYPVLSARTDTSS
>SMa0307 putative LysR-type regulator
MSRSAIKSISIMQVIDHFNLRSFDLNLLVAFDAMMEEMSVTRAAQRLKIQ
QPAMSHNISTLRTLFQDELFIRVGQVMKPTARALNLSGPVRQALRQAQAA
VLMADVFDPATEQRTFRLGLSSEVELLLLPDLTARLRDIAPGIRILARGG
DAAEVDAMLDAGVIDLSVGCSYLPDSRHHCEPLYQSSVLCCFNPQLLEIS
NPVSLDAYMAAQHAVISQTDSLHGCVKDALEHAGAELEVVAAAPDFMSIL
ATARSSAVIATVSSRIAARYGPLLGLQVSPVPLALSFPPVAMVWPLHMDS
DLGCAWLRQQIREAMLRTTETNVADIAA
>SMa1966 Putative LysR-family transcriptional regulator
MPFGLAIRGGVWELTRNLVSTRQGEGLKHFIDAKIEDKAVPKKLTYSLGV
KMGRRFDHLGDVEAFITVAEKGSMTEGAVTLSTTPSVLSRAITRLEARLG
AQLMRRTTRRLRLTDEGRAYLEQARAAFSMIDDAERAIQGPTGASLTGHV
RISVPTTYGHYRLPAMLDRFTQIHPEVQVELSITNRNVDLVAEGYDLAIR
LGPFPDSGLVARKLEDAPLRLVASPHYLERAGVPRSVEDLATHQCLPFVM
PSTGRCAPWLFRVEGRDVDWTPPGRIRVFDDVLGVVSLAENGLGICQTYD
FIVRGRIEQGRLVDVLEHARGRSRPFSLIFAPHRRLSAATRTLIDFLASD
GLGTCLALSGPGRRTSVKV
>SMa1647 putative ABC transporter, ATP-binding protein
MQNEPLLSVSNLTVDLLTAKSAVRPVDEVSYSVRQGECLAIVGESGSGKT
VMNFAPLGLMPSGVATQLSGSVRFQGQELIGLPEREIRKYRGKSIGFIFQ
DPMSALNPVRRIGRQIAEMAELHLNMSPRAAEERALDLVKLVGISDPAAR
LSQFPHELSGGLRQRIVIAIALAGEPKLLIADEPTTALDVTVQAQILRLL
KDLQHRLNMAMVLITHDMGVVAGAADNVLVMYAARGAERGPVDKVLVNPR
HPYTKGLINAIPRREDPVGSEFRGLPGVPPTLGAPINGCAFAPRCEFAVA
ECARSRPPMTATADSSVFVACPIVNQGKAAA
>SMa1038 Putative copper-containing oxidase
MFNRRQLLGASAALVSTAAWAKTSNMGLPDAAVMETAETQGPLKPTSGPD
YNPVVTLNGWTLPHRMNNGVKEFHLVAEPVEREMAEDMTAYLWGYNGQSP
GPTIEAVEGDRVRIFVTNKLPEHTTIHWHGMILPSGMDGVGGLSQPHIPV
GKTFVYEFDLVKSGTFMYHPHSDEMVQMAMGMMGFFVIHPKDPKFMPVDR
DFVFLLNAYDIDPGSYVPRIMEMTDFNMWCWNSRIFPDISPLVVSKNDRV
RVRVGNLTMTNHPIHMHGYDFEVTCTDGGWVRPEARWPEVSIDIPVGAMR
AYEFDAKYAGDWAIHCHKSHHTMNAMGHDIPTFIGVDKSKVAEKIKKLRP
EYMPMGTKGMADMGEMEMEIPENTIPMMTGWGPHGPIEMGGMFSVVKVRE
GISAGDYADPGWYENPPGTQAWEWTGELPDWTKAKDAKTQITPKHKNHG
>SMa1592 putative oxidoreductase
MHSLRTGRDDRGPLLVVLGPKFNTGQDGNVAARFVDLEEWARKNLPVGDV
AWRWCNEDYDTADRVPFVGEPDPDNAPGFHIATGFNAWGISNGTAAGMMI
ADTIQGRSSPWQRLYNPTRTYPKDYHQNGESQSIVGRADDIPPGEGGVIL
RGDDKIAVCRDIEGSLHAVSATCTHKGCTVTWNNADGTWVGHSWSGAQAA
RPREAMKQPQRAGERIRTTSCGPFCLGACGLETSGVC
>SMa1134 hypothetical protein
MAKKLRTAVVLLGIALLPMSAHAHCDAADGPVATAAVRALDTGQVNLILS
FAPAGAEPELRAVFDQALNVRKRGPDAKALADRYFMEIAVRLHRAGEGAP
YTGLKPAGTDFGPAIPAAEEALETGKPDAVTALMTEQVGHGIAQKYRETT
ALRSASNEPTTQAEVAKARDRVSAELAFIGYVEGIYLAAKGGMHVEAAST
QEHHHGTE
>SMa2137 probable glycerate
MRRDRIVKPKVIVTRRWPTEVEDRLTAEFDTRLNETDQPYDRRELRAALE
EADAVLPTVTDKISADMLEGGIRAKILGNFGVGFNHIDTAAATKVGLVVT
NTPGVLTDATADLAMTLLLMCARRAGEGERELRAGKWTGWRPTHLCGSHV
TGKTVGIIGMGRIGQAVARRCHFGFGMDVVFFDSHSIAGLDVPARQLPSV
DDVLATADFVSLHCPGGGENYHLIDDDRLACMKWSAFLINTARGDVVDEH
ALVRALETRRIAGAGLDVFEGEPRVPGRLAERQDVVLLPHLGSATKETRV
AMGMRVIENLKAFFSGRSPPDAVC
>SMa0281 putative regulator, MerR family
MVQSQDLQELTIGKLAAAGGVGVETIRFYQRKGLLATPKRLEGVRRYGGE
DVRRLRFIKQAQAAGFTLEEIGQLLALDAGHNRSAARELAKKKLEQLDAR
IGELNRAREALRKLVSECAEDKTGPCPILASFGV
>SMa0997 putative fragment of transposase protein
MTDDMMNLRALVEKSPDADLLREMIGFAAERLMELEVGAATGDGYGEKNP
LRTAQRNGYRERDWETRAGTVELRIPKLEARALTFRASWSRGAWPRRR
>SMa0972 conserved hypothetical protein
MEGKALRQGSIAGKGAPRNTASLIQYAQDHFDAAHFQGLMAIYGRPLADR
AVALTHDRGRRVVDFVRCSYLGLDNHPQIVAGAVEAMKEYGTLHWSCART
RLNFSILGDLEAALSELFDARVITYTTVLAANMGALPLIASGHLTGGVKP
LMVFDRLAHATLAFHKGTIAAETRVETIAHNDLEALEMLCRTNGSVAFVC
DGVYSMGGSADLARLRRLQERYGLFLYIDDAHGVSIFGKHGEGFARSQMS
GPLGERTIVAASLGKGFGASGGLIMLGTARQEELFRRFAVAHAFSASLNV
AAIGAARASQQLHLTEELTELQQRLRSRTALFDSLVPTEQQGSPLPIRTV
EIGDEMTAIGAARALLDRGFYTSAIFFPTVARGRAGLRLCPTAGHSEDDI
RGLGIAIQDVLKEISGR
>SMa2273 hypothetical protein
MFVWTLEYSEDAERDFELIFDHLFDAYVELGDSPDEAVERTAERIRKLRV
EIDRLVDTPYIGTLRPDIHSGIRFLRRDKAAIWFLRAEHSRTIIVAAIFY
GAQDHIRNMLARMLAG
>SMa1365 Putative ATP-binding protein
MTTGRPSDRTTDGLQAQILELLRDLQREPGMALIMITHDLKVAAAMADDR
IIVMNGGKVVESGKAEDVFTNPSHAYTRRLMSAVPHADAPKAPRNAAQGE
VLLACVQCWICPNRTRRRFPHDEA
>SMa0575 hypothetical protein
MLAVERRSAMGSFEPLIAPIADDHLRAKGAVRHEPCARLVARMFTFWAKK
IAYCIAWNSGLLVELRRVVGAVPKAPTTLITEPPPLSHRSTLIGQEPDPA
TRFA
>SMa1945 Hypothetical protein
MSARILCQQMFVWHTARMNKNAADIPMTPFLPALAIAGTVITWSFSFAAI
GYALREVEPLPLAAIRFALAAVFAIAWIAWRRPRWFLPRDFVVLAISGLL
GIAAYNVLLNLGQAAVSAGAAGFIVNTQPLFMVLLAVLFLKERFGRWNWV
GTIVGFSGVALIASGQPGGLSFGTGSTLIVLAAACAAAYSILQRPLFARA
EPLDVTGARHRCRRSRPHALATGRRLPIDARASGHLADDHVPGRRSGHYR
SKLLDLRTQEFRCRAGRSISLFGSTVLCWTGVAPAR
>SMa0565 conserved hypothetical protein
MTPTGIQTTSPELISRFSPPSCCTHPLPEVTISIWPAGCVCQAVRAPGAK
VTWPPVPCVNSLAGNSEATTTCPLNSLASPSADGADAFGVISIVCARAIA
VDRMTSDAAAISFFISFPFWSLIVHDGQTPRDCRSLRLSAAAVSSARIV
>SMa1067 Putative transcriptional regulator
MPWLRFCDRDRSVHMLVAMLDLTREKAEQVMRRGPWLASMTEPFRTELLR
HAHLQKFAPDQVVYRHGDAVGGMYGLVAGSLTINSAPPDAASRLIHLGMP
GAWTGEGPFLTGQPRRVELRALGGAWMMHVPLDALEQMMARDPGVSRAIA
MNTVFTVDVLLRIIHDLQKRAVGRRIASVLQRASWVGDMPIPLSQTDLGI
MANASRQQVNTAMQRFARAGWVSYTYRSVTVSNSQALRRFSEGDGSEW
>SMa0848 hypothetical protein
MTVMRKRDFSCLAECHYKNVRYNRRAEVGSEINSDRLLCNESMICDQGLR
CVEAVFFSKFQAGPLWRDHFVACSLALKYDVSRQFLESILDQRVFDIHQA
RPARPGGRHSQVLLINGKSRRFPFVGGPGKSLARTPIL
>SMa1591 putative adenylate cyclase
MRSFLLGGVRPRDFWSMLKNLALRVDGEGAMDIGAWLRDQGLGQYEGTFR
QNDIDPEVLRHLTAEDLIGVGVASVGHRRKLLAAIAALREVAEQPSGAAG
FGATPVINPEAERRQLTVMFVDLVESTRLSSRLDPEEMGELLRGYQRAVA
GAIARFEGHVAKYMGDGVLAYFGYPRAHEDEAERAVRAGLAAIDAVRKLQ
PPHGETLEARVGIATGLVVVGELIGEGAAREETVIGETPNLAARLQSVAE
PGAVVVASATRQLIGGLFDLAELGFHPLKGFAASIPAWRVLGESSAESRF
EAFHGASLTPLVGREHDIGLLLKRWEAVKKGKGRVVLLAGEPGIGKSRLV
RALRRRLEGEPHTTVSHYCSPYHQTSPLYPVIRLLERAAGFAAEDPPEVK
LSKLEALLTQSIEEVADAAPLLAALLSVPADDRYQPLELSPHRQKKRMLE
VLVDQLIGLAARQPILAVYEDVHWADPTSLELLDLVVDRVQDSPVLVLIT
FRTEFLPPWTRYPHVTVLTLSRLSRRQGAEMVDRLTGRRALPTEVLDQIV
AKTDGVPLFVEELTRAILETSLLKAEGDHYALARPLSAISIPATLHESLL
ARLDRLAPAREVMQVAAAIGREFSHELLVTAAPLQASEVEEALEDLIASG
LVFRHGTPPQLTYSFKHALVRDAAYATLVRTKRQRLHAAIATAIEQRFPE
MVQTQPELLAQHYAEAGRLEPAVNYWLRAGQAEIARSATTEAISHLTRGL
ELLEGLPDDAARLRKELELQVALSVALMTAKGWAAPEVGRANARARNLCE
RLGDKSRLFPVLYGDWVFHVVRAELEAGRKAGEELLRRAQEEREVSAEIV
GNRIAGTGAFLRGEIANAREYLERSLALYDPQQHRALAFLFAQDPRVAGL
SVLSLTLFALGYPEQAQARSNEALADARELSHSNTLGFALLYGCILSQLR
GDWREARDRAGSLITLARAQGSPHFLGAGKILQGWTLGQTGELPAASTKV
QEGLASWQMTGARFLVPYFLSLLARVETQSAGAKRALDLLTDALQRARET
GERWFEAELHRLTGELMLQLPAFDRAEGEARLQHAVELARGQGADLWELR
AATSLARLRIGQNRFGDVHHLLAPLCGKFAEGFATTDLQSAQRLLREAAG
FDGAIKSSD
>SMa0060 putative regucalcin
MSATVSLLLDAKDIVGESILWCGDEKALYWVDIVGKRIHRLEPENGRHDT
WPTPDFVTSIGMRKDGGFIVGLSRNVCLWTPDGPFEEFAMPEPDLPENRL
NEGRVAPDGSFWVATMQSNLDAGGSPKDMDRQSGAVYRIDPTGHVSQLTP
NEYGITNTMGWTRDNRFFFADTLANEIYMFDCDLAARRIDNRRTIVAGFA
RGLPDGSCLDADDRLWNCRVAGGAAVAGFDGAGRLMHLIELPASWPTSCT
FGGPVLSTLYVTSARFTMTGDHLDMHPLEGGLFAVEGVGHGVEEPKFGQA
PDKFPSVSEIA
>SMa0592 conserved hypothetical protein
MALRSANILFKDETAGTLVETANGGTRFAYHSDWNEGNIACCFPSTQREH
EWKVGLHPFFQHLGPEGWLREQQARSAHIVEEDDLGLLLRYGADCIGAVS
IRPPDDAAQLPEITEATVSPGRTVSGVQKKLLVTKDDENRFVPASATGSA
LYIAKFNSDRIDNLVRNELLSLRWTAAVLGEREVTGFTASLTAVVDETAL
IVTRFDRRPNGEKLRLEDCAQILSKPKGQDYAGKYDAAYEDIAAIIRQHS
SRAPIDLLRFFNRLIVFTLIGNCDAHLKNFSLLETPTGLRLSPAYDVVNT
AFYDGFDQTLALSIGGEKIHLEAANQAIFRAFGKEIGLPDRAIDQTFKQL
KRQVEKAASIIRPPDAEPADGFVHRFKEIVDNSCLRILET
>SMa1680 hypothetical protein
MKPPRVSGRSLAEILGRAFWLSARDLPGWWMRAVRSCVCNSRAAVLWTVV
FCSLAALTGSVAASENSKTSMADPLIAVHDGFVDEKTCSSCHADEAAVFA
KSHHAKAMTVADDKSVLGNFNNIQFDRDGVAASFFRRDDRFFVRTEGSDG
KQADYEVKYTFAYEPLQQYLVDLGGGRLQALDIAWDTQKREWFWLGEGSA
AKPGSTFHWTGPFYRWNRTCIDCHSTDPRTNFKPQSNEYNSSYVATSIGC
QSCHGGGAKHVDWARTKAANASTAAADPGLAKVDSNTCFACHARRTRLVD
RYQPGGHFLDQFSPALLRSDLYFPDGQILDEVFEYGSFQQSKMAMAGVTC
FDCHRPHEGTVKAEGNGLCTQCHAETAPERFAGNDPSGAFDTQAHTHHPQ
GSPGALCANCHMPERTYMKVDPRRDHSFVTPRPDLSALYGTPNACISCHT
GQTNAWASEHLDRWYGKAWRERPTIAHAFARAAQNDVAAIENLRRFVTDR
EQPGIVRGSAIGEMTRLDGAATAADVRVAAGDPDPIVRLGAAEAAANLSA
DRRLDAIGFLLADETRAVRVAAARVLGATPSLDLLGARRGAFDAALDDLG
AYAEANADVAETQSTYGSILFGQGRTDEAEKALRQAIILDPTLSGAHINL
AEFYRASGDNEKSEQAYAAAIAANPDRADLRYGHGLSLVRLKALPDAIEE
LTAAMRLDPGNSHYRTTAAIALDSMGRTDDAFALFGPTIAGGATEANLLG
TAIQLGLKLGRYAETLKFAEALARLQPNDPQLEELVGQLQDAVQHGR
>SMa2189 putative integrase/recombinase
MDANSRNRWFLDPGPLSSWIDQFADDLAAQRYTPLTIEGYTASARHFAAW
LGCAGISIDLIDDDVVRRFAEHRCRCPGRRQWLRISPKYSRRARRFVVFL
QKEGVARPPLKVASPYPLLDDYQSWLRVHRGLAERTIARHLRHLHKLLPE
LGTPTLDYDAALIRNVVREWRERTGPADLRTITSALRSYLRFLAGVGLCR
PNLDHAIPPVLQWRLSSLPRYLAAADVERVIASCDQLTRGRLRDRAILLL
LARLGLRAGDVAGLRLSDIEWTSGMLRLSGKARRQVRLPLPQDVGDALLA
YIEQERPRMHQEAVFLTMIAPYRSFAQSSHVSTIVALALKRAGISDPPST
GACLLRHSAATSMLRSGATLEAVGTVLRHRSLDMTAHYAKVDAAMLEQVA
QPWPGELPC
>SMa1431 Hypothetical protein
MRVFGMQHIGFTVPHLEEAVRFFEAAFGAVTCLETGRIEADDAFMQRRLG
VPAGCRIENIKVLRIGNGTNLELFQYSGEEDGEEPLKRNSQSGGFHLAFE
VDDCHSAADRLRQAGVDVLEGPTFVDAGAMQGLTWLYLRAPWGQFLELVS
MNGALGYERAGGPKQWSPVTGE
>SMa1155 Cation transport P-type ATPase, hypothetical
MNSATFVDTEPHVQDAAMDDDLSTGLGQKEAEVRLTQFGPNVLPEPQASS
LFATFLRQFRSPLIYILLAATLVSLALGDVRDALFIGIVLVANGTIGCMQ
EHSAGKAALALRKLEQPKANVARDGHVQEIDARLLVPGDLVLIEAGGRVP
ADLRLLSATDLVCDESLLTGESAPVHKSLTAVDTTPEVNARLMAFAGTLV
TRGRGRGSISATGAATEIGKIAAEIGKASVSKPPLMIRMERFSQFIAWVV
AAALVLLILVGIARSMSPSDLFMMSVGLAVSAIPEGLPIAISVALAISMR
RMAKAHVIVRRMPAVEALGSCTMIATDKTGTLTLNELTVTDIRLPDGTDI
VCDTGFDLDACTIRGDGTPPEEARERALALLMAASLPNEGSLTRQDNGWT
AVGDTVDVALLAAAYKGGLPRDVIEDDYPLVARIPYEPDLKYAASFHRHG
DSIRIFVKGAAETLIDMADRMDMDGRAEPIDREALLRQKEEMAARGLRVL
AFAEGETAVESDGGFGRHLLVDLVFLGLAGMQDPVRPEVPQAIRDCHSAG
LDVAMVTGDDPKTAAAIASQAGLIFTEDQVVTGEAVRRAEENGQESLDTL
TRHGRIYARVAPSQKLALVLSLARNGHFVAVTGDGVNDAPALKHAHIGVA
MGRKGTEVAKESADIIITDDNFASIVSGIREGRVAYANIRKVIFMLMSTG
AAELLLFLLAIPLGLPMPLLPVQLLWLNLVTNGIQDIALAGESPEGDELS
RAPRRPSEPIFDRLMIRRIWQSTLVMGAGGFAMFYVLLEQGYGESEARNL
LLLLFVLFENFQTLASRSERKSVLQLGFLANPLLLLSIAAAQGLHIAAMY
TPILSETLQVSPISFSEWALLLVAASSALLVVEIDKWRARHTARGRSRGQ
>SMa1120 ABC transporter, ATP binding protein, hypothetical
MAIDATEVHTELVLPPHEVSAGARASQRQISNDLLKFTFLRSRGASRNRP
ELRESTARRRSEELVVENEVSAIALEARELTKIYRMGAVEVHALRGVEVD
FREGELIVLLGPSGSGKSTLLNILGGLDTPTSGTVMFRGQPLAMDSERAL
NLYRRNHVGFVFQFYNLIPSLTARENVSLVTEIARNPMSPVEALSIVGLG
DRLDHFPAQLSGGEQQRVAIARAIAKRPDLLLCDEPTGALDSKTGILVLE
AILKINRELGTTTALITHNAVIAEIANRVLYFADGRIVETRQNGVQRAPG
ELAW
>SMa0074 putative
MTELSGKTILITGALGTLGRAQAERLGRAGAGLLLLLDRPGAEAGEGFAA
SLAAAHETMAIYVGEDLNNLASAEKRAATLSSEHGGIDILINNAALIINK
PFEEFSLEEYEDQVRVNSSAAFALARAVTPGMKQKRYGKIVNFCSLTLNG
RWDGYVPYVASKGAMLGLTKALARELGPHGVRVNAVSPGAVVSEAEERVF
ADRLQQYNDWIVENQSLKARIQPSDVADLVHFLVSPASDMISGQNIAIDG
GW
>SMa1987 Putative LysR-family transcriptional regulator
METGSLSAAARRMGLSQPTVRARIEGLEAALGTALFTRSVHGLVPTPTAE
AMATPARAMAHASEAMLRAASADSATAAGRVRLSVSEFVGIEVLPPMLRS
LRDKHPQLTVEFELSNARADLLDQQVDVAVRMHPPEQSALVAKKVPLIPL
GLFAHRDYLAVYGRPATKAEVRDHLFIGPDRNRGDLAVAAQIAGSVPVSW
IARTDSHPAQLALARAGLGIAVAQIPAAARYPELERVLPDVELPQLPTWI
VTHENLRRLPRVAALFDHLVEAFESYGRR
>SMa0464 putative adenylate cyclase
MAVIRRPSRSLTPTEEIALARPEPPAEDVRKTITILIADIVDSSRLSLTL
DPEALRELFARYFDEMTSTIQRHGGVVDRYVGDEILAVFGVPTLHEDDAL
RAVSAAVDMRDTLARLNHEFETGWGVQLAHRIGLNTGEVFTGIDRWGHRF
LTGEAVRVAKRLQEAAAANEILMGEATHKLVRHAVVVESSSPRAVKHGET
FPAIIVLTVIARTTGFQRRFDTPFVGRKRQLAMISTLLGDFVSNRTCHLL
TVLGEAGVGKSRLVSEVAGNLAREMTVAHGRCLPYGDGITYWLLADIVRE
IFRAGGGDSGKLSVAAIAEVLAGVDKAKLIAERIAGLLGFGAGDPGTREE
TFWAVRRLFEVFARERPVVIVVEDLHWADPTLLDFIEHLVDFSHGFPIMI
VATARPELLDTRPGWGGGTPNATTIALEPLSEAESRDMVLNLLHRLPLSP
AVELMITRAVGGNPLFAEELVAMLVDEELLRRNEDCWVAREDLSELPVPS
TIIALLAARLEGLTSQERAILTAAAVEGAVFHRSAVDELARPAPKALGDG
LLSLVRRDLIRPEAPSFVGEETYRFRHDMIREAAYRSLPKNARADLHERF
ASWLEITAKERLREFEEVVGYHLEQAFQYRIALGPRGARAASLAARACER
LEAAGRRALVRSDLSAAISLLERVSRLLLADDPRRIALLAELSGALIESG
RLDDAGRVLEEAGGLAGAAMDRRLAAHVRVQRQFLRLLHGEEGGLEKAAQ
AAAEVIPVFEGFGDDLGLCRARRLEAWLFFNGARGEAAAAAWERAAAHAR
RAGNLHEYYEILTWIASSLWFGPTPAAEGIRRCEAMRAEVGESLESEAAI
LRQLACLNAVVGRFAIARELIAASNATYADLGLTLYVASSEHEAVVELLA
GNPAAAERSARAAYRALEEMGERAFRSTMAASLAVVILEQGRDEEAEDFA
KLSAQLAASGDLVTQVRWRRVRARVLARRAEISAAEALAREALEIAEKTD
FINDRADALVDLSHVLEASRRRDEAVAAATGAVHLYELKGNVVAAAATRL
RLGKLVAM
>SMa2315 putative aminoglycoside adenylyltransferase
MRADRQHIDQALAATETIRSILREAVLAVYLHGSAVSGGLRPQSDVDLLA
IVDCPVADEQRRDLLAALLRISGRHPRAVGTPRCIELMVFLRADIATPKF
PVRAEFIYGEWLREAFESEELPVPISDPENTLVLAQARQEAVPLFGPDAK
ELLPSIPPEQVRRAMRDALPLLIDSLQGD
>SMa0499 hypothetical protein
MHKTISREHILEDLPEIAEIQSDDLREKVVDAWVFALERSSFDRVVDIPG
EGSPNVFALKRGTQDAHLRGVTRLALAIYDEFARTYPEARVDRDIILAGG
LCHDIGKTWEFDPINLKRWRERGDRYGEPSFRHSAYGTHVCLSVGLPEEI
GHICMGHSLEGAHIGHSTECYIIRQADHAWWHVAAALDLCHPETIGFAGP
NLRVRPIGMQ
>SMa1283 Probable NnrU protein
MAEFLLALFVFLTLHSIPAIPAIRERLLFLLGRAGYFSLYSFASILALAW
VLYAALDVDHIPLWQPSAWQAWLTMIAAPVGVFLVLAGLFSVNPLSVSIR
QGQKPGSIVSVTRHPVLWGFAIWALGHLVANGDVRSLILFGGFALFALGT
IPMIEKRARRRLGDQWQRQSAKTSILPFAALFTGRTRLSGDTPIATATVA
TAVLTLWLLAGGHAALFYADPVLLATAQ
>SMa0475 TRm17a putative transposase
MRQERTVQGSIFDLFAEHEIGRELEAMSQWLDAHRDLLNLVTSDLRRQGV
TETGRQGLPSEAVLRCALLKQYRQLSYEELAFHLEDSASFRAFARLPWGW
SPKKSVLHKTISAIRADTWEAVNKMLLASARQERLESGRVVRVDSTVTAA
LIHEPSDSSLLWDCVRVMVRLLQQADSLGSTIPWHDHCRAAKKRARVIEY
TRGRPKRVQHYRALLRIARNTLDYLQQAAAQLPLAAGPAGKLWQAQVRHY
QPLITQIIAQTERRVLAGEAVPAGEKLVSLFEPHADIIVKGSRERRLRT
>SMa1644 hypothetical protein
MPQAIVDIQRDIYLALAEHIKTIANGGGWTDFLTFLPMGIIFGAVHAMTP
GHSKAVLATYLTGASAGMRRGLVVSLALSATHVTMAVVIALFSLPLVSLM
LGSAGSAPLLEDVSRGLLGLIGAWMLWSVCFRPPHVHGEGEGVAVGFMAG
LIPCPLTLFVMTFAISRGVPGAGIMFALVMMTGVAITLSSVALVTVFFRT
RMEKLLATRHALLVKISKFVEAFTGLILVVIAMREIFIR
>SMa0417 conserved hypothetical protein
MKPELVAEIEFAGWTADGLVRQAAFKGLREDKPAKEVQAEKPSPPAKTDT
PEPGPSAKTRPVRRKGAKAEVMGVLISSPDKPLWPDAGDGEPVTKEDLAR
YHEAVGTLFDLDPGPDVPFATVVAAAREMRDRLDELGLVSFCKTTGGKGL
HVVTPFAVNKRKPLSWAEAKGFAHDVCEQMARDNPDLYLIKMSKSLRGGR
IFLDYLRNDRMATAVAPLSPRARPGATVSMPLNWTQVKSDLYPKRFTVRT
VPALLAKTTAWQDYCDGERPLEQAIKRLGKSRRAA
>SMa0285 conserved hypothetical protein
MAIEKGHGETSHIVWMKMGGLKDWELKPNILEGDWTFVTKNSVDFQGPNT
SPAPRDNIPKRCSSR
>SMa2004 putative ROK-family transcriptional regulator
MRFAPPNPLRIADRASGLNSLSVRSYNERLVLSLLLQNEATTRLEIGEKT
GLSAQTISVIVRSLEQEGLVVRGEAQKGRVGPPTTPLMLNSEGAYSVGVS
VGYHNTHIVVVDFIGNVQHHRVMPHKAADTFDAPEALALEIRLAVAGLSE
SRQARIAGVGLALPTPPIRSEPDHDVLHRYLEAEFGLPVFVQNDITAAAG
GESLFGTARQLQDFLFFYLGARLHCRLVLNHQIYNGNSPLSYDVGVLALE
RRLPADSAMAEKLWGQDLSWPPLEATLADWQQACAENLIQLTQSLLQFIE
LRTVVLSSFVPAAICEALCRLVRQEIPSVNAVVGRTSAAPKAVGAASLPF
SSRFMVN
>SMa1729 putative periplasmic binding protein
MFGKLLAPACFAVGLTSLTQGAFAEECGDVVIANMNWQSIDVLANIDKII
LENGYGCSAEITIGDTVSIMTSMIEKGEPDIAPEAWLNALPEIVSRGIAE
GKLISAGNALSDGGIQGWYIPKYIADAHPEIKTVSDALKHPELFPAPEDS
SKGAVVNGPSGWGASTTTSQLYKAYDGDAKGFVLVDPGSAAGLDGSIVKN
YERKIGWFGYYWSPTSLLGKYPMVKLDFDAPHNAAEWKRCNTVVDCEDPT
VNDWPTDKVETLVTREFKERAGAAMDYLKTRRWDNATLNSLLAWMTDNQA
TGEDAARYFLEKNPEIWTKWLSPEIAEKVRSSL
>SMa0085 putative
MEFRNVKPDLLLVEPMMPFVMDELQRNYSVHRLYQAADRPALEAALPSIR
AVATGGGAGLSNEWMEKLPSLGIIAINGVGTDKVDLARARRRNIDVTTTP
GVLADDVADLGIALMLAVLRRVGDGDRLVREGRWAAGEQLPLGHSPKGKR
IGVLGLGQIGRALASRAEAFGMSVRYWNRSTLSGVDWIAHQSPVDLARDS
DVLAVCVAASAATQNIVDASLLQALGPEGIVVNVARGNVVDEDALIEALK
SGTIAGAGLDVFVNEPAIRSEFHTTPNTVLMPHQGSATVETRMAMGKLVL
ANLAAHFAGEKAPNTVN
>SMa1063 hypothetical protein
MSALPAAADPILVNSDNFVRAESDLYFSGIVANGGFGKFDHTREMAPLDK
HGYSPEPGHSLFFCRVRS
>SMa0105 putative ABC transporter permease protein
MLRFTLRRVLQIIPTVVVVALLIFVIFSVVPGTFAASLFADGRRAADPQM
IARLNEEFGLNKPLMERFVTYVTDLAQFDLGTSFRTRQPVIDLINDRMWA
SLQLAIAAMIFALVISVPLGFVAALRPGSVLDTVTMIGAVSGLSMPQFWL
GLLMMYLFALQLNWLPSFGYGDGSFRNLILPAVTLGVTPLALLARTTRAG
VLDVLNADFIRTAHSKGMSEAKVVRWHVARNALVLIVTTVGLQFGSLIGQ
AVVIEKLFAWPGIGSLLVDSVASRDIPVVQGTILVIVLWFLVINTAVDLI
YAAIDPRIKQE
>SMa1803 TRm2011-2a transposase
MARPFSNDLRERVVDAVTGEGLSCRAAAKRFGIGISTAIDWVRRFRETGS
AAPGQMGGHKPRKLSGPHRAWLLCRCRERDFTLHGLVAELSERGLKVDYR
AVWTFVHEEGLSYKKRRWSPANGSGPTSPATGHDG
>SMa1096 Conserved hypothetical protein
MLTIKSLAIAGLLGRTMLENFEISLPVLAMTGGIILFLVALRTVLHQSSS
LPDQTTEPGQPSDLRLALTPLAFPTILAVIVFATLAGGRQAEGRTVAAIV
LLILAMDWAAMIFAESILRWIGTSLQVLAVVLGVTQAALGLQIILHSLST
VSW
>SMa0707 dihydrodipicolinate synthase, putative
MVHNIEIKSHNMDRLMRLTGILPVLPTPFTETGVDLDAMRRIVRFALDAG
VAGVVFPGFASEVEALTGEERQALLKVVATEVGDRVPIVAGASAPTVDEV
VAYGRQALGLGITRLMIQPPKSIGAGAGAVTAFLAAAAAQLPEVEIILQN
APAPRGSDLAPEAVLEVVRRVAAVRYVKEETLPAGPAITAILANAPEHML
GVIGGGGARYILDEYDRGACAAMPALEIADLHAALDRSYRDGRGSEARAL
YVKSLPLLVLQAVYRMRLTKHVLGLRGVLDNPIVRAPTPDLDALAVRDIE
RCFQECGLGSVGADKWRRTK
>SMa1467 Probable inner-membrane permease
MEIVRFIIDNLDIIGTRTIEHISIVFLAVGIAIATAVPIGVAITQSKSTA
DTVLYLASMMITVPSIALFGLMIPLLSPIGHGIGYVPAVVAVILYSQLPI
IRNTYTAITNVDPALREAAKGMGMSTWQRLRQVEIPLAIPVIMAGVRTAV
VMNIGVTAIAAYIGAGGLGTFISRGISQSDPRQLVTGALAVSLLAIAADL
FLALVQRLLTSRGIQGEVTP
>SMa1285 Probable decarboxylase
MSKQRVVVGVSGASGAALALRVVERLAEIPSVETHLIVSDSGRRTLLHEV
GPHALHQLLTIADRSYQVRDIGAATASGSFGTSGMIVVPCSMRTLAAIAA
GLADNLIVRAADVHLKERRRLVLMTRETPLHLGHLRNMTAVTEMGAVVMP
PVPAFYHRPQSVEEIVDHLAARAIDLVGLLGGPLATEWDPQSHRQHAATR
>SMa0951 AttB-like ABC transporter, permease protein
MSIDHAQLPGRVPALMDIFASPTIRAYSRAMGIIAIGVGIGCILFFLLLP
TLIVLPMSLSETDYIEFPPQGLTLKWYSAYFNDPDWMTATWFSLKIALAT
TATATVVGTMAALAIVRGSLPFRSTLQALALGPMIVPHIILGVALYLSFT
PLQLTGSFYGFLVAHTVLAVPYVIITVTASLQRFDPTLELAALNCGANRL
QAFFLVVLPNIVPGVAAAAVFAFLASFDEATVAFFISDIGGKSIGRKMFE
DIDFNLTPVIAAVSTVLVATSLLLMGTLHLMNRKSRS
>SMa1688 putative two-component response regulator
MLCPSAPERRQLESPARIVAVVDDDPSMRRSVERLLKVNGFVAEGHSSAE
AFLNSADVSQIGCVVLDIHLGGMSGIALWHRLRDTGTNLSIIFITAVEDE
ALEREALKAGCVAYLHKPFPADLLIGAVNRALEGPPGN
>SMa1799 TRm2011-2b transposase
MAPLRGWAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGE
RFRIYVQQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKY
SPDLNPIEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFK
EAGYERA
>SMa1706 hypothetical protein
MSLAAASGHCFLKADCRLFTTRYSLAFMLNGNLCCVQHTEASMRAQISSA
AENEAKVALTRNKLVLDQARAVGLLGTAKNTRLSGRVPSELIEAAKKRAH
VTSDTELLELALARLALEDDFGARLVGRKGSIPADVDLGL
>SMa0312 hypothetical protein
MAFGICAAPVAAQDSSCPREGADAPLALHADWIMKGWERHEGDGKFEFSQ
KLNRYYDLENTKGVFYDNFAPGSTQLFDNSARYGANWEALQNAARSVRHG
LTGGHDVIVSDTVASTTLGFVGRIDRLDGAVVAFDGRSQLGWTCTGGEWK
IRHELNYAWVVEPETIQSFLGKTEPAQ
>SMa2045 TRm2011-2b transposase
MAPLRGWAPRGERLVGYAPFGHWNTMTFVAALRADRVSAPFILDGPINGE
RFRIYVQQVLVPELKAGDIVILDNLGSHKGQEIRAAIRKAGARLFFLPKY
SPDLNPIEKLFAKIKHWLREAQARSRDAIHDELRHILQAVTPQECAAYFK
EAGYERA
>SMa0301 hypothetical protein
MTVFALTTFLNSFADALDRDARTVSRSHAARLRRLADQLPPMNDVPQGRV
PIQAACEAELAAHDQSAIARALATLLPFVHITRSKSYLANPPSSDFGENY
GYGVICGPSSGPPALIKDPDIALGLMFLGPKTHYPLHHHPADELYYTVTG
PSFWRAGEADWTRRGIDEIIHHPPWLPHATLSAEGPLVLLYIWEGDLETD
AAFIPDTVTADAALSSMGLDAT
>SMa2059 hypothetical protein
MLTRLTVHPTKQHKSSGSIFVTAESRLYKASKRFTAFAEMLGIDRGGMLI
NPRWTGSDGRQSQPGSAARNCRKIKRGGRGRLSDRPRLQKLFGVRLQSYS
NLVVDSYFTRNAKRGSRIIILPLVLVLLVIAIYGFLGISTADLTASQSRF
TPQASSQVMANIATIPRRIMMPATLTGSIRVSCSTLQATVCSA
>SMa1103 Probable adenylate cyclase
MPLAVSTDDGKFCRRILPILRVWKTRPSASLVSMEMRGQPKLLVTGAYRL
GFEVSPFGELRDCACSSISTIRQRSRASSLGPLFGQIYACRRVNRIANSL
LCRGIMSSNPHRLRPRGEPCSEGEVCSMGIFLFNAHRYDTGRSALGRTSV
EKLFRESESEAEYTTGWVRITLGLAMFVSGILVTSGTTELTDMNNSNQLR
ASALFTVFAFLALGIISLLLVVSGRFRPWMAFVLVTCDAAILGVSLYFAL
HGIGLGGNWVAAIPTIWAVPLLLAVGALRYRPLVQIWATTATMIALVGVA
SVLGFPLSPSGPETRATLGTLGESVGRLFSLPSYLMRAVMLTLIGLTTAL
AMARSRRLLIREVSETARRANLARFLPAEIAPLVGEDDLATWRQGRRQQG
TILFIDIRGFTAYAEKLDPARLSIFISSFRRRVVRATEAFGGVVDKFIGD
GALVVFGIPEPQSDDCTRAIACAHRLLALIDRWNIKRGFDPPVRVGIGIH
SGELYYGLVGDDHRLEFTVLGDTVNVAAKIEQATKRFDTALLASETVVLR
ARQQPSWQAVCREPLGGRGEHVAVFIYTGTKPAG
>SMa2009 Hypothetical protein
MSTSISSLTPYFIVKNARVAIDFYGRAFGAEELFRMTDPRDGRIGHAELK
IGESTVMIADEYPDFGALSPDTIGGSPVTFHLATLAVDADLARAVDAGAV
ALRAAADQGYGERVAMVVDPFGHRWMLSQKIEDVALEEMQRRWNEQTGA
>SMa1976 Putative adolase/adducin
MRNVDLGGEEIWQARVDLAACLRAAARYGLEEGICNHFSALVPGHADVFL
VNRLGWAFEEATASSLLICDFDGNVLSGDGEPEATAFFIHARLHKAAPRI
GAAFHTHMPNATALSMIEGEPLEWAGQTALKFYGRTVVDEDYNGLALDDR
EGDRLASVLGDKDILFMKNHGVMVCAPNIAEAWDDLYYLERAAEVQLKAM
STGRPLLPVDPEIAAATARQMREGDPESARLHLESLKRRLDVVAPEYRL
>SMa0945 hypothetical protein
MKGRLMTIEKARPTATIAERAKRELKEYALLSVYLYVCFGALVLYKMAIL
GSQGVHVSAFGVPIIKALILGKFILLGHAMKLGERYGRLRLVSVIAYKAG
LYLLLLIVLSIVEEAIVGLSNGRTIAATLSEVGGAKLPELLATSMVMLLI
LIPYLASREFAVALGEGRLWGLLLEYRDSLDHGGPAHVAKE
>SMa1485 hypothetical protein
MSEGEAGQAISISGLFAVLTSLFIAGFTRKIDRKFVLSSFSLMLIVSGLV
VSFAPNYTALMVGRALLGVAIGGFWSMSTAVVMRLLPESAVPEGLALLNA
GNAIAATISAPLGSFLGDYIGWRGAFFFVVPLALIALIGQWTNMPSLPPR
NRRATGNVFRLLARRQVALGMTAILLLFMGQFALFTYLRPFLESAAGFSV
SGLSLVLLLMGLAGVAGTWCISRLLVTRLYSIVIAIPLVMAAIALTLMAV
GSKLPVAALLIAWGFFGTAAPVGWGTWLSRVLHDDAEAGGGLQVAVIQFA
IAIGAAAGGLLFDWAGWWSSFAFAAALLLGSSVAAWAASFEWEHSKSNGD
PPRSRRGSRSAAPSPGPYCRAWYGLKRRLT
>SMa1345 Probable ABC transporter ATP-binding protein
MAHIELKGITKTFGTHTALRDLSFEIADGEFFVLLGETGAGKTTTLRLIA
GLEKPTGGQIFIDGEDVADWGAAERDVALVLQQYSLYPRYTVRENLEFPL
KSRIRRVEPVEINERVSRVARTLRIEHLLDRKTDRLSGGEMQRVSIGRAI
VRKPRLFLMDEPLSALDAKLREALRTELKNLQMNLGATFLFVTHDQIEAM
SMGDKIGVLNNGQLVQTGTPQEIYRYPVNTFVARAVGSPPMNLISGKLGA
SEAIADEGYRLPFDTALGAGLNGRPLTFGIRPEDLFLESGAPGEARVHDV
ENHGVEKIVTLRTGNHFLQATVPAQTDLEIEKSVRFSWNPEKVVLFDGGS
GMSLRHAG
>SMa2279 hypothetical protein
MSAQILDRTLSAFRIGDPKGTYPIFDATGSTIAPGRWNTPASPIIYSSEH
YSTAVLEKLVHGSGRLPPNQHYVEITIPRGFTYEVFSPPDIPGWDTMPPT
VSRGFGEQWCLERRSAILLVPSVVARLDKIILINPAHHEFPEIQVSLHQP
VYWDRRLFGS
>SMa1929 Hypothetical protein
MGIKNYLIEGGSGTGKTSVATELERRGYHVVHGDRVLAYVGDPETGQALA
GPPKGADRIVWGYAHWIWPVDKVRVIAADTTYPVTFFCGGSRNFHKFLDL
FDKVFVLDTDVETLNRRLDGRPNEPGFEPAERALVLRYHDTREYLPAGIN
IDTAGTVQSVVDGILAQLT
>SMa1881 Hypothetical protein
MAMERFTRLVLLASALESLPGIVLAGEPFVPSWIKNDPTAKTVAIEIVAD
WNQVERFRRGNIRTDIIDFNGYWGGNLTIIVPAAWTVQLEFINGSSSFRH
SLMVTRVYAQSEMPVKLTAEDAIWGAYTDPPEGIKLNERRQLNFVAKDVG
NYFLACARQTHLMDGHWIYFEVRDDLKQAVAIVDESKFPQEQPPGRP
>SMa2087 hypothetical protein
MTRKNAISHIWYTRCPVPTPVGLATQLGLLDTAFAAEGIKLNSIIDSKDR
SIRSSHFDHHLDYSFRHGGNVPPIRARSEGNPTRLVGITWTDEFQAIITL
SETGIKTTRDLVGRRFGIARRPPGIVDFMAATALKGLVSALSLDGLAPSD
VEIVDIPLSESVLDGREGPQLYGLRNRQAYGPEIAALLRGEVDAIYVKGT
PGIAVANLFAAHTVAEFGFHPDPKIRINSGSPRVLTVDERLAEHRPDLVA
KLIATLNQAGAWAEEHPDEVRRFVAREVGASEEVVAAANGPDLHKHLGIG
LEPGLVAAIGHYKDFLHEWGFLECNFDINAWVDHRPWAELDVRTVA
>SMa0322 conserved hypothetical protein
MKELPASQAYRVLEPGPIVMVSTSDNGKPNVMTMGFHTMIQHDPPLIGCV
IGPWDHSYQALRKTGECVIAVPGLDLAETVVDVGNCSGDRVDKLQRYGLM
TQPARDVSAPLLRDCLANIECRVVDTRLLDPYNLFILEATRIWINENRKE
RRMMHHRGDGTFTVDGGTLDLKDRMVRWRHLP
>SMa0543 hypothetical protein
MVAVALLMPGTDEDATETATPPATTTEPTTPQPSTSAPPPTTEPAPTTPR
RNHQPLSRPTRSPRSRLTHGSDRRSLLLDRRNVRDASRDRRARRPSRFGC
QRRNNNVSSRACGSQMATQDKRGGNGRNAGNGSYLRRTSSTLSIESASTL
TSRSISPARESIRVTNGTAPYLRGRFLSISPHQAAFDAVEDFPHNL
>SMa1693 hypothetical protein
MTWVAIKTVLIVKAGIWLVALSSGTFGGVLTPLLIVGGAFGALFGRVLPG
EGTLGASRHGVDDGRYHAGAADGHLFAVELTVDVCLLVPLFAATVASYGI
TVLLLMIDPNRKDRAPRTARHPRICPHAAIRTLAIGAETVGWPLSHYGER
SLLTAPRQGHSSAEAYIRSPEDNYSLDTPSFPGRDQMNFTVTP
>SMa1207 FixK-like regulatory protein
MFIGTRAMETLQLTSSDRTTFLRSRFFVRLPRSTVDSILKDTRLSTYEEH
DVLFHQGDGIDDVFFVLSGLIRLYRVGKDGREADVAVFSKGEMFAENAMY
LGRATASAEAAEASIVARIDGAKLRQLAAADGDVAQAFIEHLCHRGKMTE
DLLAQDRLLTAPQRVASYLLGHCPNGTATSFSFRLPFQKSVLAGKLGLAP
EALSRAFSTLRQSGVTVKGRMIEIHDRHALERF
>SMa0241 conserved hypothetical protein
MHILIIGAAGMVGRKLTQRLVKDGTLGANSVEKLTLVDVVAPERPQGFAG
TVVAREGDLSASGEAEKLVEGRPDVIFHLAAIVSGEAELDFDKGYRINLD
GTRYLFDAIRLAHDQDGYKPRLVFTSSIAVVGAPLPFPIPDDYHLTPLTS
YGTQKAICELLLSDYSRRGFFDGIGIRLPTICIRPGKPNKAASGFFSNIL
REPLVGQEAVLPVSEDVRHWHTSPRSAVGFLIHGATINLEKVGPRRNLSM
PGLSATVGEQIEALRRVAGEKAVQLIRREPDEMIMKMVAGWAPGFEAKRA
TELGFTAEKSFDEIIRVHIEDELGGKL
>SMa1255 Conserved hypothetical protein
MIRAEAMSEPENATVCDSIRDALRMIIDPELGRNIVDLGLIYDVSVEDGG
IAHVTMTTTTKGCPASEYLKEAVRNCVWYVPGVEYAEVRLTYEPAWTPDM
MAG
>SMa0407 hypothetical protein
MELRVGGAKPSDNSPAASKGFSDPLDSHGVPGRSKQYLIHSAFTGTIGGV
AMAYVDWAIKGPKIASCSCDYGCPCEFNGKPTEGLCEGLECMLIEEGWFG
DLRLDGLKVAAVYRWPGPVHEGGGVVRGFFDANADQAQIDALFTILGGKE
QEPTTVFNIYGSTIAQELEPIFAPIEFHSDIEKRTGGFRIDGHLELELEP
IRNPVTGAPHRARIVLPEGFEFRQAEIASGTFNANGEIAMGRQKRYGALW
RAAYGPYGIIEE
>SMa2363 putative amidase
MAGDATSLGQAIQMGRLTASEAMEASLAAACQLAETGAIVHVDPTLGRTA
AENADARLRHLPGGGRIPPFLGVPSLAKDLGGPFAGLSVAAGSNMLERRA
AVAEDSDLAERLRGTGLCFFGLTTVPEMGLSLASEPAAGPICRNPLDAGR
TPGGSSGGAAAAVAAGIVAIAHATDAGGSMRVPAACCGLFGLKASRGAIA
AGPSFGNHLGGIASELALCRSVRDLAVIFNAAAGRARGPFADPWFAPSPP
GPLRVGLLLETGDQYPTEPARSEAVEEAARALEADGHSVVPMYWDAFAAS
VATSGRALRDIIAVNLANFVVSAGLDAGRSERLTQAFINHGSQLEATALW
ATLGDAVHASHAVWSLFDRVDCILTPMLASAPLPIGSFPFDHDDIDLQIR
RMTAFAPLAALANATGFPALTLPFGADDAGLPLPVQILAPMGGDRLLIEL
AARLEREGRWQHRFAVAGIPE
>SMa2147 hypothetical protein
MNEECKGDDGEDGRIHFAFDRDIQTIAISAIVPLKSLPEGARESRKFAQV
LSSIKAIGLVEAPVVIADTRSAGTWYLLDGHLRLEALKELGFAEVECLVA
IDDDTYTTTSGQPAGPIQEHRMIARAIERGVSSADIADALGLQSPSFAGS
VCWKASARGSRDAERHALLDEGVRCPAADERRSATRSGRFDDRSAQFYAY
VRASDTRCHASQPVGRSQKDDRRCCIHAARPADCSHGKGTGSTSDSSEKR
RGNLRHRQSASYRRTRIRCQAARQHSHHPVAFVSPPRISWGVSENR
>SMa0291 hypothetical protein
MVLFCSDPHISPETNPENAVSEFISYLFAILVVGPLQAEISERLRGVPST
EIVQAGRACVAAEAPRLLQRAQEDWGWAAANAVGVSVGLIDPASLLAGQN
ADCDRLVQVLSQSSGGNGDDEA
>SMa1160 hypothetical protein membrane domain
MIAGLAVFALGGLAVAGDYRIAAAGAAALAGVLASRELMHNLLKTLSWIE
LRSALVLAAMTAIGLPLLPNHAIDPWGGFNPREVWLFTLLSATISFMGYV
AVRVLGSSRGLLVGGLIGAVVSSTAVTASFGQKARSGEEPLPLAGAASIA
AVVALLRVLTVTLFFSPPVFPEVCLAHDRSSFDLCRGWCRSDD
>SMa1651 putative ABC transporter, periplasmic solute-binding protein
MSVKRRTFLQGAAGAIGLAMAQGALSKIVYAQGAAGTLRVAIAKPAGNLD
PQSHYAIWAIQDLMFEPLVKYGKGGQIEPCLATDWKIEDGGKTLHLTLRE
GVKFQDGTKFDAAACKWNLERWMGIDQFSWMNCSKHFQSLEVVDDYHITV
HFKEPVLALMQELSYTRPTRFLSPKSVDADGKFKEPVGTGPWVQISADDT
QSVFEHYDGYWGDKPTYERLEAKVIPDARSRVAALRAGEIDLVGGFWIAP
LTPEEGKQLEAAGFNVVVDPGNVTLVMAFNPDRAEPLKDPQVRKAVSIGI
DRAAISKVLYHGYAKPAGNMFSAALPYAGKQFDAPVRDAAAASALLEKAG
WTGSPIRSKDGKPLTLELVVSPDAVPGSRVIAEVIQSEMKEVGIDLVIRS
VDHASKHTDMLEQKYDLGFFLTYGAPYDPFGSLVALCLSTFKNDVEGKLV
TDPVNLDPLINAATAATGDQIEPTIQKVYDWLRDNDAIAPLVYVPSIWAH
SKRVQGFTSPVTEYDMPYENIVLAE
>SMa1421 Probable ABC transporter ATP binding-protein
MYELKSLDKKFPGVHALKAIDFHIKRGEIVGLVGENGAGKSTLMKVIYGA
YQPDGGQILINGDAVRFANPRQAMEKGIGMVFQEQSLIPNLTVMENIFLG
YEQQFVRLGVINWKEMAKAAKAQLAKVKLDIDPSTVTSKLSFAQRQLVEL
AKVLTLEERVDGDLVILLDEPTSVLSKDEVELLFKLVRELVTRASFIFVS
HRMDEVMELSDRIYVMKDGQVVDVVERGGGGADAESIQHKMVGRNVDKQY
YREQLQKPYDPSRVLVEMSGIDLPGRTKDISLRLHAGEVLCLVGTEGSGR
EAILRAIYGMLSPTGGRLKIKGREVRTYSPRQSVGLGVGYVPRERKIEGI
VAGMNVYENMTLSQLKKHSTASVLQVGKERALAREWIRKLSIKAHSELAD
CGNLSGGNQQKVVLAKWRSAGADIMLLDHPTRGLDIGAKEDVYEMIRAMS
DAGVGIVLVADTLEEAIGLSHTIVVVKDGRIQKRFDCVPGAKPSLYDLLH
YMI
>SMa0157 conserved hypothetical protein
MSSFRRKLTTSAVAAICSLVASTAGAQTVLKASHQFPGGKGDIRDEMVQL
IAREVAAANVGLEIQVFPGSSLYKPNDQWNAVTRGLLDMTSFPLDYASGR
HPEFSATLMPGLVGNFDRAMRLNDSEFMGDIKKVIENAGALVIADAWLSG
AFASKKNCITSPDTIKGQVIRAAGPAFEEMLVEAGASISSMPSSEIYTGM
QTGVLDAANTSSASFVSYRLFEQAKCLTAPGENALWFMYEPVLVSKRVFD
GLTEEQQKAILAAGEKAEVYFNEEVRKGDQVMIDTYKKAGVEVVEMSKED
YDAWLELAKASSYKNFAANVPGGDKLIEKALAVK
>SMa0731 hypothetical protein
MAKAKLSDEVKTYIVQALACFDSPSVVAAAVKKEFGVDVSRQLVQSHEPN
KKAESGRAPPQNHAVPALARPIWRADDGAGRGRRAVPCCYAGNSIINWDR
DFFVMDESGKKARRV
>SMa0506 putative ABC transporter, periplasmic solute-binding protein, family 3
MTMRLKSLLLPLVGLLAITVGAATASASSLDDIIARKKVMIGVDLSVPPF
GITNEEMQPDGLDVDVAKLLAKDLGVELELVPVTGQSRIPSLQTGKVDFV
VASFGIYTERALSVAFSNPYGGHRSIIIAPKEATIKSLADLAGKRVGVPR
GTAHEKILSAANVEGMELVRFDDDSTTLNALVSGQVDAIGTVNYIAAQLQ
ERYPDRGFEEKTTYLQSFYGVGLRRNDPDLLHWLNTVLFVHKQSGELGAI
YEKWMKTPIPELPSF
>SMa0412 hypothetical protein
MSKSLQPKSGCALRVGELEGPLGRSERRSVRSPTMIKMSVYYPADGGSKF
DHDYYRTRHMPLIQERLGDACLRYEIDKGLAGREPGSAPEFVAACHVYSP
SLATFQEALGPHRSEIAADVANYTDIAPIVQISEVVEG
>SMa1630 hypothetical protein
MGRFMRRNTMDLPDIVNMYFDADSCNDTDALSETFAPDAVVEDEGARHQG
VVAILRWWVAAKKAASYVAEPLESTVDGDKALVRAKVSGRFPGSPVTLTY
SFTIKDGRIARLEIQ
>SMa1682 NapD-like protein
MFAVHDFLKSAQTGAVAVAAALLLVTSAKAQSDPLPSWNDTAPKAAIVSF
VEKVTKEGSPDFVPEPERIAVFDNDGTLWVEHPMYTQLAFALDRVKAEAA
AHPEWKDKQPFKAVLEGDMKALAAAGEKGLVELIMETHADMTHDEFQKVV
SEWIATARDPKFNKPYTELVYQPMLELLAYLRANGFKTFIVSGGGIEFVR
PWAEQVYGVPPEQVVGSSIKTQFQMRDGTPTLFRLPQVNFIDDKAGKPVG
INAHIGRRPIAAFGNSDGDLEMLQWTTMTGGSVRFGMLIHHTDADREYAY
DRKSEFGRLDKALDAAAINNWTVVDMKADWKEIFLEK
>SMa1857 Hypothetical protein
MPNWSRGKGPDPSKFDPFTKSGKITRRQEQYNLLHEAGAPSEPENWGIDV
RKSLPEFKYMTFDVVGTLIDFEGGLKDCLAGIAAEAGATIDGEEALSLYR
AARYSKDADLFPDDLVRVYLEIAPKLGLPAEPKYGERFRDSTKNWKGFAD
SAEALARLAKSCRLVAMTNARRWAFDLFAQQLGNPFYAAFTADDTGTEKP
DPVFFEKVFDFVGSEGNSKDDILHVAQSQYHDIGISRKLGLANCWIERRH
AQKGYGGTIEPAEFTAPDYHFTSMAALADAVVVARG
>SMa2199 putative ABC transporter, periplasmic solute-binding protein
MSAVKANRYQTAVAATAWVAVGVVLTTGAVHAQTAADILPQIYRDAGVIK
LVTDAKYPPFQSVNDAGEIVGFEVDLWNAIADRLNVKVDVTSVSFDSLIP
GVQAGRWDIAMEGITDNAERQKVVSFVDYGYTTSSAYVLEQKGSEIKDHL
GLCGLKGSAQSGTEWVGMIAKEIGDACVAAGKDKPSVSEFGTSEATLLSL
YSGRSDFVLTSAALAGEIQKVAPHPVKVIPMQILPRMPSGIAFRKDETDL
GDALLLALKEIRANGDYEKIYAKWTVSPMAMEHEPGINLATVPQTK
>SMa1602 putative LysR-family transcriptional regulator
MKSVFHLWRSHMRHLNDMALFVEVARARSFRKAAEALGMPNSTLSRRVSE
LEKAIGLRLLHRTTRKIELTEAGQLYYERSKRIVDEARLAHEQLGAMLNQ
PSGVLRVSLPVNLATFYLTPIIGEFARHYPGITFEFDLTPRRVDLVAEPF
DVAIRIGKLEDSGLITRLIGRHSRHLYASPGYIEASGEPATPAELAEHEC
IGMLRSPIWSLHQGINKVDVAVRGRFTLNSVGMIRALAVNHQGIALLPEK
IVAEDVAFGRLRRILPEWQGSAVSIFAVTETRLIPAKTQRFIEFLSSRLM
EA
>SMa2079 probable ABC transporter, ATP-binding protein
MPLISASNLKTHYKTRDGLLRSVDGVDLVVERGETVGLVGESGCGKSTLG
KTLLRLVDPTAGRIEFKGEDITALDQGRLRNVRKSIQMIFQDPFASLNPR
HTIGEILEAPLIVHQAGSPPERRSTVASIVAKVGLPADAINRYPHEFSGG
QRQRIGIARALLLNPELIVCDEPVSALDLSIQAQILNLLVEMKKEFGLSY
LFISHDLSVVRYFCDRVLVMYLGRVVESADNETLWSDPRHPYTRALMAAV
PDPSRPRQAAPLGGELPSPSNIPPGCRFHTRCPLATELCRVAEPEFRSIK
PGHRVACHVADHIN
>SMa0796 probable
MQRFQCYINGEFADGEARFESIDPTTGRAWAEMPEAREADVNRAVEAARI
ALHDQPWSTLTATQRGKLLYKLADLVAENAGRLAELETRDTGKIIRETSS
QIAYVADYYRYYAGIADKIEGSYLPIDKPDMDVWLRREPIGVVAMVVPWN
SQLFLSAVKIGPALAAGCTMVVKASEDGPAPLLEFARLVHAAGFPAGVVN
IVTGFGPSCGAALSRHPQVDHIAFTGGPETARHIVRNSAENLASTSLELG
GKSPFIVFADADLESAANAQIAGIFAATGQSCVAGSRLIVEKSVKDRFLQ
ILKAKAETIRIGSPLEMSTEVGPLATERQSNHVKTLVARSLAAGAKLVTG
GTAPEGAGFYYRPTILDCDGSASPSLENEFFGPVLSVLSFETEAEALHLA
NDSRFGLAAGVFTQNLTRAHRLMKGIRAGIVWVNTYRAVSPVAPFGGFGL
SGHGREGGLEAALDYTRSKTVWLRTSDDPIPDPFVMR
>SMa0089 conserved hypothetical protein
MKRNEQMTRRDAVFPANRHALYEAHGYSAAIRSGDLLFVSGQVGSRSDGS
PEPDFGRQVQLAFDNLRATLNAAGCTFDDIVDVTTFHTDPENQFETIMAV
KNQVFGTPPYPNWTALGVNWLAGFDFEIKVIARIPEAA
>SMa0713 putative ABC sugar transport ATP binding protein, amino terminus
MAGISIQNVYKDYGALNVLKEFSLEIADGEFVVLVGPSGCGKSTMLKILA
GLEPASGGKIMIGDRDVTDLAPGDRDIAMVFQNYALYPHLTVGQNMGFGL
KMRGMPKAEIDRRVRAAAKILAVDHLLDRRPKALSGGQRQRVALGRAIVR
EPRAFLMDEPLPTSTPSCASIRAPRSARFTSASASRRSMSRTIRSRR
>SMa1700 hypothetical protein
MMKSAQTSSFDPIGPDELETIGKAFLDELQRRALPRKSEEAEALAAKLIN
AYQSGIRDDLGLSIVAGLS
>SMa0367 hypothetical protein
MVMRKYLTVALLITVAIGPASALDIGVGVSVGGTGVSAGVRTGKNGTSAG
VSTSVGGIGGAKAGTAAGKSGGSSIGASGNLGGIGVGAGVGVGKNGTSAG
IGAGIGGAKAGASVGTSGGSSIRASGNVGRASGGVSTGSVPGAGLSGTGP
GNAPSGLAARSGSPATGTASRILGAAPEKGVRPSIALPRILWPLKSRRRH
ERGEWSYPLRFPAPIAAITGTPRAVVRVCRQAIAQAASALGGVRVRAVSA
GPLHRGRRGTLTAPLDVRIDYAGQGGREVRQARIRCRLDTSGRVIAVI
>SMa1154 Hypothetical protein
MHGTNSHPTSAFGLGWTSLPAPFEDPNELFSLGRLQTDALSAVLRYQIEA
LSFLKSRREQDLRYLQEIWSPAHVNDSFDLWCSFWQDAFLDYSKEAGRMA
DIGSSIAAKAAKRVHREEKVFADNLAAQTLV
>SMa1519 putative oxidoreductase
MSRALDKAGRWIGWAFFADLANGLALTFGYMFSRPVTMQYPDKEKWLPYS
RYRGHHFLKRDDEGEIKCVACELCARICPCDCIEVVPYEDEKGNRRPAKF
EIDTARCLFCGLCEDACPADAIALGQQYEFSSFSSRDLVIGRDDLLAKPG
KAMTGGGVVAARLNTERDVLVEASEPRGYNWWRNIRRK
>SMa0101 conserved hypothetical protein
MAFGIKADLILINGRIWRGREEGISEALAVWQGKILATGSDTDILGLKGP
RTEVIDLEGRFATPGLIDNHLHLIATGMAMGWVDATPASAPTLAALMGRI
SDRAATTPKGGWVRARGYDQVKLDTGRHPTRDDLDRVAPDHPVLLTRACG
HVSIANSRALELAGITEATAVPEGGVIGVTEGRLNGFLAENAQNLVKAAM
PSATTEDLIDGIERAGRYLLSFGITSCMDAAVGQVSGFAEIQAYEMAKLS
GRLPVRVWLTLLGDPGVSIVEDCWRAGLLSGAGDDMLRVGGVKVFLDGSA
GGRTAWMTRPYRGEPDNIGVQMLPDAEVEAVVKACHDRGYQMVCHAIGDG
AIEQLITAYEKALAANPDPDRRHRVEHCGFSTPDQNARMKAAGILPAPQM
AFIHDFGDSYISVLGEERGRLSYPIGTWMRMGLKPSTGSDSPVCSPDPFP
NLHAMLTRQTGKGTVMEASERLSRQEALQTYTEYGAYSQKAEGVKGRLVP
GQWADIAVFDNDLLAAPPETILSDTSCVLTLLAGRVVHDAR
>SMa0031 hypothetical protein
MSEWIDFERWPDCKRMERPGIVFEVTNGDQTLLTGCVVPLPLPSDWVAHP
LRFRAVPQPRPRHSSPLPKPAGPQQ
>SMa1961 Putative POLYHYDROXYALKANOATE DEPOLYMERASE
MIAWRRQRGGDMRSLSDTLERLARFRKDKSGHAATARSRLSRLQRFGSNP
GALQAWYHVPVGLKESPALVVVLHGCTQNAAGYDHASGWSKIAEDFGFAV
LYPEQVPANNPNVCFNWFTPSDIRRGQGEVHSIRQMVETIIVEYGIDRRR
VYITGLSAGGAMANAALCAYPEIFTGGAIIAGLPFAAATTVPEAFDRMRG
HGIPDVESLRSRLSGASPHAGPWPTISVWHGTNDRTVAEANAKAIIAQWS
GVHGVPSNPSSVETVDGHKRLAWRDRSGRDAIELYLIEGMGHGTPLKVAS
GYGHTAPYMLDVGISSTLHIARSWGLTPLSRRQPEKAGSVQPAPPHQAAH
RSQWDRRADIQAVIERALRSAGLMR
>SMa0510 putative D-isomer specific 2-hydroxyacid
MPKIELLQVGPYPSWDEERLNANFTMHRYFEAADKAAFLAEHGAAIRGIA
TRGELGANWAMIEALPRLEIISVYGVGYDAVDLAAARERGIRVTNTPDVL
TKDVADLGVAMMLAHARGMIGGETWVKSGDWAKKGLYPLKRRVHGKRAGV
LGLGRIGFEVAKRLAGFDMEIAYSDTGAKDFARDWSFIADPVELAARSDF
LFVTLAASAETRHIVGRRVIEALGPDGMLINISRASNIDEEALLDALESK
VLGAAALDVFEGEPNLNPRFLALDNVLLQPHMASGTAETRKAMGQLVFDN
LSAHFGGRPLPTPVL
>SMa0638 hypothetical protein
MTAAPVTEKSRYRSRPPSLYAPWWKSKEVITPSVIGGGASLLTVIGGIPW
QNLLLILVAFGAIAGFLYWRKNADRRAVARVEGMA
>SMa0424 putative DNA ligase
MSRRAKPLLQDDSVAKSKPARPRDPAQPNLPFDPMAERIEPCLALLKPVP
PKGPDWVFEVKWDGYRLAIHIEPKGVRIITRGGHDWTHRFPAIAEATSKL
GVGTAILDGEAVVLDEEGRSDFGALQRSLGGRGGKRSSTESVFFAFDLLY
FDGHDLSGTELSVRRHLLEGFLDGPTGAIQSSEEVFGDGALLEKACSMGL
KGIIAKHRDWPYRSGRTGDWLKIKCLQSESFMIVGYEQSLTARGGLGSLL
LAGRKGHDWIYVGSVGTGFNTKDAEYLRKTLDRLKTSKPAVPLKGKNLVF
AQPTLIAEIEFRGWTHDGSLRHASYKGLREVQDNAAVFDMSERAIL
>SMa1910 Hypothetical protein
MRNRGTLIMTLIFPNSSRSFDEKRNAVRFLGHEGMFEVRFFVEADALVVA
DAELGRSKVSESKLLSAFDALRSSVYDVARKAYSGGRRDCYTLTAAHFR
>SMa1594 hypothetical protein
MFRTELYRRAQPLQTLSLKINYHKWPSTFGARRCEHGDTTWASTPISSCR
SSVIFRCAMTTLSRATDRRGGGRVVEAEIGSGLNLPFYRPAVREVLPLES
APKRLAMARRVPDPGMPVSFIEGTAVSIFLDDQSVDTVVIAWTLRTIPGG
RGDCRNAARTQIWWQAAVRRTWIGTRCRCALMAGPAHTDLASHCVETGYM
ARPKPMMFRYEGSARTR
>SMa1766 Hypothetical protein
MGIIWTIIIGFVAGIVAKLIVPGENEPKGFVLTTILGIVGAFVASYLGQA
LGWYNANEGAGFIGAIVGAVIVLLLWGAVARRA
>SMa1953 Putative beta-lactamase
MPKIVSLTAALAVFASLVPTQLIASGLSDRDRLRSELTALANAHPGRVGI
CVRDEASPAICVNGEQRFSLQSVMKVVVAAAVMQAVDDRRIALGDRLTIR
RGDLSVNIQPIADIVAERGSFETSIGDLVSRAVVESDSAATDVLISHLGG
TKAVQAFLDEAGLQGIRIDRTERELQTETDGLTWTPEFVFPERLEQARKE
VADARRQAAFEAYLKDPRDTATPIEMVGFLHRLATGQLLSASSTAHLLEV
MNRTVTFPDRLRAGVPSGWTIGHKTGTSQTRNGINGVTNDVGILTAPDGT
HVAVAAFVAESRAGKDERAATIAAAARAITAAYK
>SMa0461 TRm3 transposase
MAIEKELLDQLLAGRDPSEVFGKDGLLDDLKKALSERILNAELDDHLDVE
RLEGGPANRRNGSSKKTVLTGTSKMTLTIPRDRAGTFDPKLIARYQRRFP
DFDDKIISMYARGMTVREIQGHLEELYGIDVSPDLISAVTDTVLEAVGEW
QNRPLELCYPLVFFDAIRVKIRDEGFVRNKAVYVALAVLADGSKEILGLW
IEQTEGAKFWLRVMNELKNRGCQDILIAVVDGLKGFPEAITAVFPQTIVQ
TCIVHLIRHSLEFVSYKDRRTVVPALRAIYRARDAEAGLKALEAFEEGYW
GQKYPAIAQSWRRNWEHVVPFFAFPEGVRRIIYTTNAIEALNSKLRRAVR
SRGHFPGDEAAMKLLYLVLNNAAEQWKRAPREWVEAKTQFAVIFGERFFN
>SMa0594 hypothetical protein
MPENTGNIAPLAWQRYVEEALRRRKAEGLTQKHHSALAGVSHPTMAAFER
GETTLTLAKALDILRVVGLVDEPTEGDTQARFVRDAFERWRNLVAPLPQD
SPARFPNGWYRFDYWLEGDLKMSELTAFERILEKAVVRKTGWPPFWLPTR
EAIQPREVDGLIECWLAPQGEEVERGFNDPAHCDFWRAAPSGRMFLIRGY
QEDGAETFPAGTILDTTLPLWRMGEVLLHAEKLASLLRKDADTAVTVHFR
AMFTGLRGRVLRSWANPLSDLLVEGHGARSDEAVLEAKFSANDIESRLAE
CMLPLLTSLYERFGVAGLSLNRVEAEVQRLLNSPISKERRPRR
>SMa0171 hypothetical protein
MNATKLLLILPLLAAFAYISLVSLTYLSQRALLYPGASATPAPERASWGQ
NASIQTPDGETLHGLYSRGEPGQPSVLFFLGNADRVSNYGFFAQALAARG
IGLLALSYRGYPGSSGTPNEHGLLIDGIAAFDWLAARSGNEIVVLGQSLG
SGVAVDTAGKRPAVAVILVSAYLSVLSLAQTYYPFFPVALLTKDPFRSDL
KIAGVRQPEAVYPRPARHHHPIVFGRSSVSDRSRAQADAHLRCRPQRSVG
CPHG
>SMa0639 conserved hypothetical protein
MKAPTLAFAVLASAIGMSPAKAQTCIGICNGGTGGTTNHNLFIEREYRDF
LQQRYPNYGSRYRGMRPDISIGPGATVGGPRVGVRQQFRLRQRMVIDANK
HLRWCQERYLSYRLADDTFQPFEGARRPCNSPYN
>SMa2101 probable nitrilotriacetate monooxygenase component A
MPRKLNLNVGINTTGYLPAAWKYRSGNRHDIYDPGYYKRLAELAHRGLFD
AMFFSDHPALMTDPNSRPFHTIDPLILCTALAAQVPDIGFVATMSSTYNS
PYNLARRTQSTDIVSGGRLIINIVSSFNPSVAANFGSAPLPPRSERYAKA
SEFLDVAKKLWASWDPAREGHVPDERFWDAGSAHAIDHEGDHFTVKGPLN
VPRGPQGHPVIAQAGASEGGIELAARHGEIIYCNILSRPAGQAFGKRVRD
RAAGLGRDPKGIRIVPGVVVILGETKEEALRKHELFSGAGSEDGLIARFI
KENGIDPDGFNPDAVLDAERFIPDPNRLQAVGMGLGLSDLLTHEKLTARQ
VVRRSEGHHRLLLGTAEEVADALIDLWDDGTVDGYTLQPPRAPDDIEEFV
DKVVPILQDRGVYRSRYEERTVRERYGLPFPAD
>SMa0267 putative GntR-family regulator
MVVAIAAERASRGAGIRPTSVVEGVYDSIYHRLMSLDIAPGARIPIDVLA
RELGVSQTPIREALSRLEREGLVRKEHLIGYSAAPQWTRKQFEDLYAFRL
LIEPEAARLAAANMTPEALQQLENSAADMGHGEAPVDRNTRYSRFARADA
QFHDEILKIAGNDVIRSTLSNQHVHLHIFRLMFHIRVTQEALEEHESLLA
AFRARDPQAAYDAMRVHIERSRDRLLSAFE
>SMa1770 Hypothetical protein
MKLVWARYALDDRDAIFSYIERENPRAAVHVDEEVVSAGRPLDFPESRRP
GRIAGTP
>SMa0977 TRm24 transposase
MSQCYLQLTLPDRRRVHQLLERKVPIAEIARQLGRHRSTIYRELKRNTFH
DAEFPEYSGYYSGIANDISKERRRRLRKLSRHPQLRELVIEQLKALWSPE
QIAGRLLADGVSAVRVCTETIYRFIYSKEDYALELYQHLPEGRRKRRPRR
SRKPRDGSIPLDCRISQRPDFIADRSQFGHWEGDLLIFRRDLGEANVTSL
VERKSRYTVMIKNGSRHSRPLIDKIIDAFSPLPAFARQSFTFDRGTEFRG
FKALEDGLGARSWFCDPNSPWQKGAVENTNKRIRRFVPSDTDLSAVSQPQ
LVALAHHLNSLPRKCLGYRTPAEVFMAHLRDCG
>SMa1191 putative flavohemoglobin like protein
MLTQKTKDIVKATAPVLAQHGYAIIQHFYKRMFQAHPELKNIFNMAHQER
GEQQQALARAVYAYAANIENPESLSAVLKDIAHKHASLGVRPEQYPIVGE
HLLASIKEVLGDAATDEIISAWAQAYGNLADILAGMESELYERSEERAGG
WAGWRRFIVREKNPESDVITSFVLEPADGGPVADFEPGQYTSVAVQVPKL
GYQQIRQYSLSDSPNGRSYRISVKREDGGLGTPGYVSSLLHDEINVGDEL
KLAAPYGNFYIDVSATTPIVLISGGVGLTPMVSMLKKALQTPPRKVVFVH
GARNSAVHAMRDRLKEASRTYPDFKLFIFYDEPLPTDIEGRDYDFAGLVD
VEKVKDSILLDDADYYICGPVPFMRMQHDKLLGLGITEARIHYEVFGPDL
FAE
>SMa2299 hypothetical protein
MQRWRRRPPLWQSITTIIPFRAPSQQRPPVRQEGQHPVPQVPRAPLPRRA
PPALRSQHHLHHPRQCRPRLVPRERPAPRRHRSSRLRRRRLPCRPRGRSF
RRLRRPLPCRSREMAEEAVELARREIAYCETLLAKGGLNVLSETFRDTAE
IKAWDHYPTGDVFDPTSGAQWFYHCHPAEEGAEEHGHFHCFLRPQGPQGP
IHHLAAVGVDAHGRLLRLFTVNQWVVGDDWLGAEGTIALLPRFDVQMPRP
SYLVNRWLTAIFTAYEQQITELIRERDRALLAHRPPEGVEARQDRALEVT
SELKLSDR
>SMa1903 Putative protease
MWNLRKINWQEARDLNGFNDASEIKVGVLDTGIDAGHPDLKDQVAGYIYE
HPDLPGASSDQDLIGHGTHVAGTIAATINNDVGINGISRARIHAWKIFDD
RPDLLTHPDGTAEFAYFVDPVMYLRALLDCVDQGIDVINLSIGGGGAPDP
TESAAFEALLANGSTIVAAMGNERRDGSPISYPAAMPGVIAVGATNLQDR
ITNFSNRGNHITIAAPGDAIWSTLPTYPGQIGWRAERGPDGHWWQGKAAI
RETDYDAWPGTSMAAPHVAAAAALYIANGGKRDPAAIRSALTASADKVPA
MGEQDFTPDFGYGRLNLERLIAGIGTND
>SMa0647 conserved domain of hypothetical protein
MLRQAHDLPPVLQAIVALDAWNVLSVLQHAPWLGRVLAASILRQAGVTAS
AHLATVNLGLETMPVERRRHRDRNFRLLALPMG
>SMa0787 putative transposase
MPAERLEMRRVREILRYRFEQGLGHKSIAVRVGAAPSTVRETLRRAAIAE
LSWPLGDDVSDAVLEAALYKAAGTKTGHRRSPEPDWTQVHRELKRKHMTL
QILWDEYISRYPEGYRYSRFCDLYRGWAMKLPVTMRQDHAAGDKLFVDYA
GDTVTVVVDRLSGKTRQAHLFVAVLGASSLSYAQARWSETLPDWIECHIL
ALEFFGGAPALLVPDNAKVAIIKACHFDPQVNRTYCGMAAHYGSAVLPTR
PRRPRDKEQTSRCTLLDWLSVN
>SMa0108 putative ABC transporter, ATP-binding protein
MTEPLLDIRDLHLGIAVGRSVNRPLLKGVSFQIMPGEAYGLVGESGSGKS
VTSLAVMGLLKKPLAVSGGEILFKGQNLLELPKREMRRLRGNRIAMIFQE
PMTALNPLSTIGRQIAEMFVLHQGKSWDEAQKLAIEALASVRVPNPDRRA
RNYPHQMSGGLRQRVMIAMALACNPDLLIADEPTTALDVTVQAEVLRLIK
ELCAERGTAVLFISHDLGVIASICQRVGVMYAGCLVEENETRALFAAPRH
EYTRGLLGALPRFGSRSLHGRQRLVDIDSIIADRSKLIETRFIAPRGAEE
GGQP
>SMa0937 conserved hypothetical protein
MRPPAWVSRPSITGAKDWKQMSAVFKHGDRGLAAIVLCLQLVFGLAATVA
AQEPATATAPPAKVQQLIELLDDPEVRQWLTAKQAAAPAAATTPAGLASQ
WIAEIRRHLGGIRNAVPRVVPEWMAARERIAAEMHQGTMPILRGFAFLLL
AGYGAEFLLRYLLRRSASRSLAQFGPGLDALLRIAPLVVFAIAAIAAFVL
AGWPRRLEVAVAPLLIAWIAARLLVAIAAAVFKPAEDGRALEGGVSLTPG
AAHFWHRRFVLFACSVAFLWAVIDVMQALAFPADVRDLTAAALGLVVLGL
AIDTVLRRPVAEMSAGRRIMRNALLIGFLVLLWLVWVAGMKVLFWLGIYV
LGLPPLLRFTSATTRTMLDAEAADNVRVMRNVLIDRGARFAIIGLAAAWL
AVVFRINGSAMMQDDVFNRIFRGLLAGVVILLAADLIWQLAKGFIDLHLR
RASVNGAADSAQLARSMRLRTLLPILRNFLAVFIAVVAGMMVLSGLGVHV
GPLIAGAGVFGVAIGFGSQTLVKDILSGVFYMMDDAFRVGEYIQSGSYKG
TVESFSIRSVKLRHHRGPVFTVPFGSLGAVQNMSRDWVIDKFMINVSYDA
DVAKVKKVVKGIGAALLDDPELGPLIIETVKMKGVEQFGDYGITLSFAMT
TKPGHQTQIRRRAQAMIKDAFAANSIHFASPTVQVAGDEAQATAAAAATR
DAIAKRNAAAAQKGETAAE
>SMa1917 Hypothetical protein
MKNLDCNCLLVKEQAPIDLDNRLEGIALTFEQGLERTKPYTIGLVVGGFA
APIVGFNAGWITTTTASAQAAETPRVDALTGICSSAAGRMATARSTGLAT
LEGYDNRAKRDERVAVIMTDIQVPADILDKVSTSCSRSLS
>SMa1833 Probable
MKAMPFKDTNRQALRIACTSGLGGHMGLSDIFGRVVALIGAAAILLRRQD
SDSLEPAYGSTPKIPKAKPQGIPTLKMPTAKGWAAEQMPTAAAGLKVNAF
AAGLKHPRWIHVLPNGDVLVAEALSEPGGIKSAFDYAMFSTMKRAAAVGA
SPNRITLLRDADGDGVAEVRPVFLDGLRQPFGMALLGDTLYVCNTDGVVA
FPYRTGDTRISTTGRKLADLKAGGHWTRSLLASRDGQKLFIGVGSLSNIG
ERGMAVEEGRAAIHELGLKSGEHRIFASGLRNPVGMAWEPTTGALWTVVN
ERDGLGDETPPDYLTSVRDGGFYGWPYCYWGQIVDDRVPQDPAMVATAIT
PDYALGGHTASLGLCWLPAGILPGFSEGMVIGQHGSWNRSTLSGYKVVFV
PFVNGRPAGPARDILTGFLSPDERASYGRPVGVAIGPNGSLLVADDVGDV
IWRVTDEAERG
>SMa0298 putative ABC transporter
MKNLIEVRNLNIAYGGPSGWTNVVQDVSFEIAPGEAFGLVGESGCGKSTV
AYRLLGYGTINSLVQTGEVLFDGTDLLKLDAASLMRLRGNRIAFVPQNPT
TSLSPGMRVGSQICEMIATHKALPDGMTMERRIVELFTLVGLPDVGHRYP
HELSGGQQQRVTIAMAVACNPDLLVLDEPTTGLDVTTQRQIIQLLADLRS
RIGMAMLYVTHDLALLAQIADRVGVMYAGQLVEVAPCDKLLSAPAHPYSR
GLIASIPTNDGTDRQARSLRGMLRRDEMSTGCKFEPRCDFATGACRATPQ
LLELIEDARSVACMRWREATAPLAPSVTAKAVARTAVRSESLLSVTELSL
SYQQPGLFNRLLGRTSPAVVREINLNLAAGEVVALVGGSGSGKSTIARAI
SARLPPRAGIIRLDGTALAPSLKDRSVEELRQIQYIFQNPDASLNPRGLR
LFERTQELDDPCLYRNVERREDLVADQKLRIDEKCAGCYSKPPLDSKRTI
TSHARPPFRGRRPYRASAVPSPAAVRARSAIHAPKTGHPSRSRLPASSAA
ISLPKHWYATPPRQITTARRSRSDQCRDLARDRAQASRRSALRRSAADRT
ANSC
>SMa1835 Hypothetical protein
MHVATIESANLEQLHALSVSVGWPLRSEDLQFLRDCGRGYVAHDDIGRLT
GSAMWFPHADDFATIGMVITSPRLQSNGTARWLMEHVLWDCCGRNLRLNA
TRASRRLYHSLDFQPMRTIYQCQGIVRQADSTATTEQPPIRRLEGEDLAA
VAELDAGAFGVSRTALIGKLFAQSVGYGLFRGGRLEAFALCRPFGRGHVI
GPVVADSDADALAVIRRHVAAHENQFLRLDTPVETGPFATFLSQSGLAVF
DTVLAMSRRGKGCADVVQGSNLYGLASHALG
>SMa0574 conserved hypothetical protein
MSTGAWIERDEKQTRDCISAWRQLPDYASVNISERDAPAVIALLHRMGVG
VEAGLATVADAERFVTLPDCHRAFRILIEIEEQDLGKADAIADGIAQVLE
RANILRPVLLHGLDATAWHFVNRAHQRRWSTRVGLEDGCQLTNGEIAGGN
ADLVADALQIFRS
>SMa0376 putative hydrolase
MSKLEVLTPANSQLIFIDQQPQMAFGVQSIDRQTLKNNVVGLAKAAKIFN
VPTTITTVETQSFSGNTFPELLAVFPENDLLERTSMNSWDDQNVRDALAK
NAANGRKKIVVSGLWTEVCNTTFALSALHDVPEYEIYMVADASGGTSSDA
HKYAMDRMVQAGVIPVTWQQVLLEWQRDWARKETYDAVTTLVKEHSGAYG
MGIDYAYTMVHGAEERVKHGKRIGPNPAK
>SMa1437 Probable ABC transporter permease
MLGYIGRRAYHSVISVIGLLTLVFFLTRLTGDPSALYLPLDSSLEARQAF
ARLNGLDQPIYVQFFQYLQNLMSLDFGQSLRQNRSAIEVALEAFPATLKL
ALVAMSLATVLAIVVGALAAARPGSLFDRIAGLVSLAGASTPDFWIAIVG
ILVFAVGLGILPTSGTGTALHWIMPIVVLMLRPFGLLVQVVRGTMISALA
SPFIKTAHAKGMKRRKIIFGHALRNSLLPVITVAGDLATGLINGAVVVET
IFGWPGIGKLMIDSIIQRDFAVVQSTILVTAIAIFIVNIAIDLLYAVLDP
RIRY
>SMa0062 putative GntR-family transcriptional regulator
MTRDIKISAAEHCYRTLSRKIIGLELKPNEPIGEHALAGLLGVSRTPVRE
ALSRLSAEGLVDLRSRAGVVVAPIRMDAVRTAQFVREQLELAIVAEAAQQ
SNRRVLLGIRQAIEEQELAILEDNPDLFFECDERMHALYCSLAGRDGVWA
FISDAKKHMDRVRRLSIQAGQLDQLVEDHRRVLKAVGEGDATKAQEIMRL
HLRRAVVDLGELSQRFSSYFALDAAGDAR
>SMa1723 hypothetical protein
MLARGNIRYRRETSSGLACLSGEPVSSGFSRCSGERTATPRARYSELLVS
VVAFRKWCLRWDAARAHLGYAGLTQAPGVLRVDPRAFSVSDRDLFRMGRA
EAESRRRCEYRH
>SMa0259 conserved hypothetical protein
MTAQDLYRIRDFVPDFDTIAAEFAERSWAVSARADVRADIRYGSGVREVI
DLILPERVQAGAPLHVFVHGGYWRSGEKINYRFVAAPVLAAGGIAALVEY
DLMPGKRLDVLVDQVRRSVLWLQAHAGDFGADPARLTVSGHSAGAHLASF
LAATGPEEAYPPSLPTLQGLLLLSGIYDLSGIPDSFLRHEAEMTPMEAAA
WSPLTSSQLPCPLRIIAYGADETAPFQNQAAGLCELLRAQDKSAELLPVP
DLNHMSIVLDLADTDGVLGRQLHDLVAQPTR
>SMa0775 hypothetical protein
MLLSEVLLVRSEHVVSSAHDVFERIVESLDPLAGGRTPSFAEEKMPNETQ
SASVPGNTTLSGLIISVPETAGGPFPPLPLENKGEFVDVDSCAVERTVAT
QTCFSLLR
>SMa1507 Conserved hypothetical protein
MFNLTRRQFLRYSGATGVALGTGSLASFAGAEEPLKIGVVYVSPIAEIGW
TKQHSLGVDAIKKEFGDKVAITVIDNIFMPQDAERVFRELAASGNQLIFG
TSFSHGTPMQKVAPRFPKTAFEHCSGIVHRANLGTFEAKYYEGTFVAGAA
GGHMSKSGKIGFIGGFPIPDIVGPANALLLGAQSVNPEVTCNAIFLNSWF
DPGKEKEAANTLLSQGCDVICSMTDTATGVQVAGEGGAWSIGYASDMAKF
GSGKQLTAFTLDWSSEYLRAAKGVAEGTWKAEARWDGLAAGVVKMAPYNE
AIPADIQAKLKQLEADIAAGKIHPYAGELKDQDGNVKVAAGSVLSDTDIR
GMNWFVRGMIGKLS
>SMa2097 hypothetical protein
MSGLEDIFHTGKRFERYERLADGTVLAHFADGSAIRANLLVGADGAGSTV
RRQLLPHLKSMDTGVRRLAGKITLASAARHGISPLLTEFNTNIRPRDGRG
LMITSHRVDASAYARHDLIGSEDPDHADIPGFHFNNSTSYTWWNTAYDTD
ELGPDAVLETLDGAALLETLLRHIGHWDERILKLIRHSDPSTVAFLKVKS
STPGAVWQSGPVTLLGDAIHAMTYFRALGGNTALYDTGLLVRELVAARRN
GKPPLAAVNDYENAMREHGYEAVRSSLSAMQRNVGANRPLKAIPHL
>SMa1749 putative transcriptional regulator
MGFWHSMSWKTEGIRVTAPVQWRQYDGMVSVLWEAESQAGASGYYLADDP
RIMFFFNDVSSSVRISNQDNDLARNSRPMARAVYVPAGVPLWTGSRKTHS
FSHLNLHLHKDRVLRFLAPSVGNSAAQTALRCPVELQNVAAIDALAKLLA
DEIKSPTKHAVYAESLIGSIIAGLLDIPTEREERAGGRLTQAQINKLVSH
IDALGDYRMSVADMAAVVGLSESWFASVFKQTMGITPLQWQLAKRVELTK
KLLGESDLPVASIAAQLGFADQAHLTKVFRQIVGETPAAWRRMRQFRQS
>SMa2255 hypothetical protein
MPMRIFIDVEEAAERLEELIDLACGQDEVYVCRAGWPIAQLSFFSGGDDS
PSDEIAETLPDTVHRPGSKLVEDGKVSSVDAVWMLAAEGKPRREHDMTSA
HDDLYDEDRLPR
>SMa1582 conserved hypothetical protein
MDFMQMVRPLEIWLRTFVLSEWTFYQLGIIAAGYVFASFVASRTEPAMES
RARRIKGNPDLLRVIIAFMRRLKWLFLTLWLWLASVILKQGTWPSQRWLI
ATALSLAAAWFIISVLTKIIRNPTLSRLVAMVSWGYLAVYATGLDGPILS
ALDAAAVNLGVMRLSLLIVLKAVVLTVALIWVAVFVGNVLAHWVQRSGDL
SPSFKVLISKVIKIALIMIAGAIALSATGIDLTALTVFSGAVGVGVGFGL
QKVVSNFISGIIILLDKSIKPGDTITLGETFGSIRDLRARFVSVITRDGK
EYLIPNEDFISQQVVNWSFSSDYVRIDVDFGTSYDSDPHEVVRIAVETAS
AVPRVVNDYNAPVCWMTAFGASSLDFRLRFWISDPANGLTNVRGQVLMAL
WDSFKEAGVSIPFPHREIIMKTPVEIQRPPRG
>SMa2287 putative LysR-family transcriptional regulator
MRGEVMRKLPPLGALRVFEAAARRLSFKDAAEELNVSATAVSHQIRQLEE
MLNVKLFERATRQVHLTAAGKTLFPVLREGLDRFEQAIADVHRQQAGQVA
RLTSTVAFVAKRLAPLAGSFREMYPDWTLRLDASNRAVDLEADADAAIRF
GGGNYPGLVTEPLFADRFAPVCAPSLAQTSAADLRLATLIHFDWGPARRD
DPRAPVWRQWLARAGVEDIDASAGISFTDEIHAVQAVVAGQGIGLLSLTL
VAEELASGILVQPFELSLEGDRYDLVYSPRMADRPATRVLRDWVIAQFGG
TVHQLGPRPASQPRA
>SMa0985 putative LysR-family transcriptional regulator
MLALERAIGRALFVRARTGYELAPDGHVLLERVKAMYEAAQDIHNWQESV
HSLPMVRLLSDSTLSCFTAASFNHLWSPSDSFRVCFKTSEAILDLTYREA
DIGLAAERPETGNVAARRSVRIAYAPYCAQGFDQQRNNNWVSLGTDVANQ
RWKRWTFEHRGQFITNWVNAPRSMFDLVKAGAGVGVLPCFIGDRDPGFLR
AGHVIDELSHHLWIVLHDDERDRESVRTVADRLSALLAANAPLFCGFTGQ
DPL
>SMa0905 hypothetical protein
MTNIEASIICHPPIPLAHITPLETLVLTNTLECREIEASLVPFTDFGAMH
PIRVRLHELIEAFRASAPHVDSALNIFIASRIIALLPARSGNADTAASVD
IDLSEFPWPFVVQGHRGAIIEPPRSGGEATAIF
>SMa1898 Hypothetical protein
MAFNTQRLQFSGHSGATLSARLDLPNGPLRAYALFAHCFTCSRDLAAARQ
IGAELAREGIAVLRFDFTGLGSSEGEFASTNFSSNVADLLSAADYLRHHY
QAPAVLIGHSLGGAAVLAVAGEIPEVRAVATIGAPADVGHVLKNFGASLE
EIDKNGEADVDLAGRTFLIRKQFVEDTRAHRIKDAVGRLKKPILILHAPL
DHTVGIENATEIFVAARHPKSFISLDKADHLLTDPEDAAFAGRIISEWLT
RYLAADTPQGAGPIEHVHVRETGEGKFQNAIQAGGHRLFADEPESVGGLD
AGPSPYDFLAIALGACTSMTLRLYAGHKQLKLGRIGVDVSHTKIHAKDCE
ECTETERGDSRKIDRFERVISIEGEVSEELREKIVEIAGKCPVHRTLETV
AKIETVVK
>SMa1808 Hypothetical protein
METSPAPQSRRCRAFARTRRPKTSGLRSWFREGVHKLVDKTQTLGGQSLV
VGQTAQIASVSDSASAKATRSGPDLRSSAIASRRPTPISHRRYAASTRYP
NARPSSASCPWPCCTSLTRKEEHGSALAPIGQVT
>SMa0409 hypothetical protein
MTRHHLSYGDAEAVPFGCDNSVLSAPCRRPLPGFPGHLWIHQSICPCPRI
VLWLCRRLCFTPSILRNSRSTDVPFIDAERKVASGGSNAGVIPFGALQRA
LAHSSMMPYGP
>SMa1736 putative LysR-family transcriptional regulator
MNLDLIDTFLDLLETGNFNRTAERLETTQSTISGRVKALEQAVGAKLFQR
GRAGAIPTPSGLRFEQHARSLKAGWAHARRDIGGLERYEGSLRVSGQFSL
LRTLFLDWIGELHATNKRVALHLEADYSTQIISDLANGAIDIGVVFAPKF
LPDLSIEEIGAQRLVMVSTEFANVGNVTADRYIRASYTAYIERAHAELLP
HLAHPALTVGYEELAVGIMKRFGGTTYLPEHALDYLASSGVSAKVVEDAP
DIHQPVYVATQRRRKHDPLVHRAILALKSVADRHFKISKVKDA
>SMa2197 putative ABC transporter, permease
MSNDLTLAPTASPDRPMLTYVPASTWKTRFAATLLILLVLYCATMIASNP
NFGWDVVADYLFDSRILWGLSLTVWLTVVTMVIGVVLGTIFAIMAMADNV
VISTVANAYIWFFRGTPVLVQLIFWYNLGALFPQLSVGVPFTSLWVSVPT
NTLISPVTAAVLGLGLNEGAYMSEIIRSGLMSVDPGQRQAAKSLGMTNGK
TLWRILLPQAMPVIIPPTGNQTIGMLKTSSLVSVISLADLLYSAQTIYSR
NFQTIPLLIVACIWYLAATTILSAVQVRIERHFARSSQRANITQPRRLFR
QKAR
>SMa1959 Hypothetical protein
MTTNRRSRRAIGRSDNSLRCDSCVGPCMLVQFPKSVAILNFPRYPSNSLM
LQLSKGLMSICEIESTAIAPACDTTSGKRSKPGLSRSLTLLFAAASGLAV
ANAYFAHPLLDVIADDLSLPRATIGFVVGATQLGYGLGLVLLVPVGDLVD
RRNLVIIQSLLSVLALLCVGFAPTEEVLFPALIAMGFFAVATQAFVAYAA
SLARPEERGAVVGTVTSGIVLGILLARTVAGAVVDIAGWRAVYLLSAAFT
LAITAILARVVPAQPKSGPAVSYPKLIGSLFTLFLQEPVLRVRAILAFLI
FADVTTLLTPLVLPLSAPPYSLSHAAIGLFGLAGAAGALGASRAGRWTDE
GFGQRVTGVALTLMLCSWILIGLLPYSILFLVTGVLLLDFGLQAVHVASQ
GLIYRVRPEAQSRLTAAYMVFYSTGSALGSSISTLVYARWAWTGVSMLGA
GIAAAALLFWAITLPKRMA
>SMa1100 hypothetical protein
MAAKLCRNLGGRLTILHVIMHGLRAEEASRLAEEEYLVRRVSAVTLPDLQ
PIPETMVNLFRASHGDLGEMVSILGDRIVEEAAESARSIGADRVDARVEP
GDYAETILGVADEVDADLIVVGSRGLGGLRGLLVGSVSQKVVQHSDCSVL
VVR
>SMa0961 putative response regulator of two-component system
MHAGVLIYTCNAELFLLLEYILETEGFPVRHCSDVSELVGTIGARKPLAV
LVDCSDRQLEAYGLCRRIKATRDRPPIAVLTNAPSSDLSSLGIDIVICSP
YDPRHLLAFLKGIQTSLPYGSRPDVSREQIFRHADIEMNVTRIRVMRNGH
AVSLSALQFRLLLQLLSMPDVVHSRDDLIAAGWPPEAEVEPRTVDIHIGH
IRRALNQFGPDVIRTVRSIGYSLDGLAAPGGKSGALHSAG
>SMa1379 putative ABC transporter, periplasmic solute-binding protein
MNTRILGLAAGLAVLFASTSWAQSITVAIGSEPSTLDPQLRDDGGERQVN
DNIYEALMARTATGDLVPGLAADAPTQVDARTWQFKLREGIKFHNGEPFN
ADAVVASVLRVIDPANNSEQMAYLGPLSGAEKVDDLAVRIITSAPDPILP
ARMYWLKMVPAAYSKNATAIAEKPVGTGPYKFGTWDRGNSITMTANTDYW
GGEPQIDDVTYRFVSEAGTRLSGLVAGEFDVVVNLLPEFADSVPQAKSVS
GLETSVIILGVDNPAVKDVRVRRALNMAIDRHALAESLFAGNAKVTTGQL
VLPGAFGYNESLENWPYDPQGAKKLIAEAGAAGTTIDLVGEAGRWLKDRE
LIEAVAAYWTEVGLKVNVEILEFSVYLDRIFDMGNRPDSYFVLNSNELFD
ADREMAFAYEPGQGGASNSDKALGEAIRAARSEVDGEKRKAAYAVITKKL
HDEAYDVPLLNHQDIYGMSEKMEWQPRVDSKIIVREMKVNE
>SMa1327 Putative hydrolase
MLIIKGNDVEIATEAFGDSAHPPVVLVMGGMASMLWWPERFCRRVAEHGR
FVIRYDHRDTGLSTKYPPGQPGYAFDNAVADVVRVLDGYRISAAHVVGMS
LGGMIGQATALKHPERVLSLTAISSSPVGMNTTHLPASGTAWMDHMNMEV
DWSDRAEAVAYMLEDARLVASTVHPFDEAETRAFIERDFDRSGGYLSATN
HSVLFEISDAWQDRLPEMKVPLLVIHGTADPVFPVEHGAAVATAVDGARL
VEIEGGGHELHPADWDKIISAIIKRTNTRPNE
>SMa1373 Probable ABC transporter permease
MPHMLEFILRRLFQGVLVVFGVTATVFVVTRLVGDPVALMLPLSATEAQR
AAFAQQIGLDQPIATQFLRFVGDIATLDFGNSLWQRRPAIEVVFERLPNT
LLLIAAGLGAAVMLSIPLGAVSALRPGGLVDRLTMSVGLLGLAMPQFWLG
LVAIMIFAVTLRWLPTSGMGTAAHLVLPALTLALTPLARFTMMVRASMID
ELNKPYVKTARAKGLGLTRILRVHTLRNILVPFLSISGWELIATLSGYTV
VVETVFAWPGLGLTAVQAIQRGDLFLMQAIVFVIAFLIVLIGIALDILSK
AVDPRMELN
>SMa0106 putative ABC transporter, permease
MRLGFNFWLGAGLTTLVILAGILAPWIAPFDPVLDADLMNSELPPDATFW
FGTDGQGRDVYTRILYGAQISLTVGIVSQVINSIIGVTLGMTAGYWGGWW
DDLVNGFTNVMLAIPSLIFALAVMAVLGPGLPSLLIALGLTNWSWTCRIA
RSSTLSLKSLGYVQAAQTLGYGDLRIMFTQILPNMMGPILVMATLGMGSA
VLSEAALSFLGLGIQPPFPSWGSMLTDARQLIQLAPWVAIFPGLAIFLSV
LGFNLLGDGLRDSLDPHMRTRNP
>SMa0518 hypothetical protein
MITRYALFEGKVKDGHTEAFRKAVIERILPKWKQFPHATDVRISFAESRD
EGAPELSMILAINYPDLEAVEEALASPVRAEARAATEAVLAEFFEGRIHH
HVMSASEFKL
>SMa0310 putative LysR-type transcriptional regulator
MRLTEAGNRLHETLSQPMAEVRSAFENVAGDARPSGLLRIAVTSIAEQFL
SGPLIASFAETHPGITIDVTVTDDVFDIVAAGFDAGVRLGEVIEQDMVAV
PLTKEEREVVVATPRYFELHGTPRHPRELVRHRCVGWRPSPSAAPYRWEF
EEDGIPFDVAVEPQITTNDLHLMIRTALAGGGITFALEETFRPYIVRGEL
ITALDDYLPPFPGFFLYFPNRRNMAPKLRALIDHVRAYRPTPS
>SMa2303 putative ROK family transcriptional regulator
MHSRENEAIFNLQNKIMSDVRTKGDQSTTRAMNRRLILNLLRREGAKSRA
EIAAATGLSPAAVTFVVSDLIEEGLLIEGQSVAGAQGRRPIPVAIDYAGG
VALGFKLMAGSVECVVTDLETTPLASLRLPLPAHDPDTIAEALAAAAPKL
VALANRPAARLAGIGIAMPGVIDNQRAVCIRSNRYGWDNVPLGDLVASRI
GVPVWLEDDTNAYAIAQQLFGLGRHYKTMGVLAIGVGVACSLVLDGKLYR
GAHGAAGKFGHFPHMEGGRPCECGKRGCLMSYFSEPAMLQTWHERSGRPE
TDGRAEMVAAIAAGDKAAHSVMREAGETLGRHLAGLMNVIDPEVIVVGGE
AVAYGDALFGPLRATLERFAFRQAPPVLLDWEDDSWARGAAALVTQKLFD
FETTAGNA
>SMa0450 hypothetical protein
MVTEMSSEKMRKLIQDYQRDIAAYLAPESGISERQLLQQLASRLDGQQAQ
DALGNGWQGWLPDDEDPAAADDGSPAPTRWWTEPEFFGTMWKLRS
>SMa1639 probable NreA protein
MIASGRPCLDIAQQLHAVEKAIAQAKRTLIQDHLDHCLEETIGALPRDRR
QSIDEFKSITKYL
>SMa0833 conserved hypothetical protein
MMELHDLAEDLPSKWTEIMAVAEKAFRAFAELDAVKRELAESENAQ
>SMa0922 hypothetical protein
MRHREWSRATEGDRTMATTIATLTQKTDGILEGVFATIRVNAPIAIIPNA
SKSSEEAPDYRVIHRKTGFEIGAGWNRIARQTGEEYLSVKLEAPEIGVIF
GNLAPAPGGDPSKKVILWNNPD
>SMa1374 Probable ABC transporter permease
MAASASAASPTPVGTRVPRGRISELWHDKTAAIGLALILLIVFLALFAPL
IAPYDPAAQSIMARLKPPVWMERGTWEHLLGTDNLGRDVLSRIIWGARAT
LTIGAVTCLLAATLGTVIGLWAGFMGGRTDSILMRLVDIQVSFPGILLIL
LVVAVLGPGVWTLVAVLSVTNWMVYARLVRGIVSSTRQTPYVEAAEVIGC
RPARVIFRHILPNIVSPLMTLAILEFTNIVLAEAAVSFLGFGVQPPATSW
GLDVASGRDYLFIAWWLVTFPGLAIVATVLSINLFANWLRVTTDPEEREK
RFARAEAARRRRGRRRAEA
>SMa1893 Putative LysR-family transcriptional activator
MDINQVRYFLNLAETLNFTEAARRSGVSQPSLTRAIQRLEEDLGSPLIYR
DGKDSRLTALGRDVQAEFMRIELALRNVREHSESTVLGRRRILDIAVAPT
IGPAAFAAFFDDALGELPSVKINMHQLLAGEGANEVLSGKYHACILPRAP
RSNPKLNVVPLFREPFLLACAESHPLAGKDVVSTEAIAAYPYVDRLACEF
HTEITEHLMDHDAVMQPRFSADREDWVQQVVAHGRAICIMPERSIVVQGI
VTRPVEGISLARELVFVTVSGSGTPLEIRKIAQLAARRNWS
>SMa0126 putative cold shock protein
MTTGTVKWFNSTKGFGFIAPDDGSADVFVHISAVERAGMNSIVEGQKLGF
ELERDNKSGKMSAGQLRAA
>SMa1084 Conserved hypothetical protein
MTNTLSPELLERMDAYWRAANYLSVGQIYLRDNPLLKQKLTLADVKPRLL
GHWGTTPGLNFLYVHLNRLIQTHDLNMIYVTGPGHGGPGLVANTYLEGTY
SELYPEVSQDEAGIKRLFTQFSYPGGIPSHVAAEVPGSINEGGELGYCLM
HAYGAVFDNPDLIAACVVGDGEAETGALATSWHSNKFLNPARDGAVLPIL
HLNGYKIANPTVLARISHDELEALFMGYGYEPLFVEGADPRQMHQLMASA
LDKAHGKIAEIQRQARSRGFSDRPAWPMIIFRSPKGWTGPREVDGKKTEG
TWRSHQVPLAKLAEVPAHLGILEEWLKSYRPWELFDESGSLRPELRELAP
KGERRMGANPHANGGLLLKDLNLPDFRQYAVEIGIPGTVTAESTRTAGLY
LRDVMKLNAQERNFRIFGPDETESNRLSPVFQETDRVFTGDILASDMQLS
PDGRVMEVLSEQLCQGWLEGYLLTGRHGFFSCYEAFIHIVDSMFNQHAKW
LKVCREVPWRKPIASLTYLLTSHVWRQDHNGFSHQDPGFIDHVANKKADI
IRVYLPPDANTLLSVVDHCARSRDYINVIVAGKQPQLQWLDMGAAVAHCR
AGLGVWEWASNDEGDPDAVVACAGDVPTMEALAAVMIVREAFPSLRLRVV
NVVDLMALQAPSQHPHGIADDAFDRMFTTDKPVIFAYHGYPGLIHRLTYR
RTNHANFHVHGYQEEGTTTTPFDMAVLNKLDRFHLAKAVVERVPSLASPR
EGFAHFVESKLAEHDAYIRENGEDLPEIRNWRWLASAVDAQ
>SMa1576 CpaB2, probable CpaB2 pilus assembly protein
MIRIVILLLALASGGAASWLALGTGDQRAVEVAEVQETPSQEVLVAAAEL
KRGAVIEENQLRWQPWEGEIPPVFISRSSRPDATTALKGSLALSGFVAGE
PIRDDKLAQGGTGYLSSLLPSGKRAIAVRVTAESTAGGFILPNDHVDVIH
TVARPDASGDAGKVVSRAILSNIRVLAVDQTVSQASDGASVIGKTATLEV
DPEQISAVAAAEASGTVSLALRAITDNHEASVVEAEGTRPGVVRFFNGGR
MSMVEVPSRSGGS
>SMa1573 CpaE2, probable CpaE2 pilus assembly protein
MEIPNMKQLDSETREAPQPMTAAPLLPIPKVDIAVFCQSEEVREAVGTAA
IDRRMARATVTVKAGGMKEATALYGGVTSPNLVVVESDDGEARLMATLET
LAMECVTGTKVIVIGRSNDVGLYKKLLDAGVSDYLVKPLEPMDFVAAVHR
CFRDSTEEKLGRIVAFVGAKGGTGSSTLAHNVAYAMSKRVDADVLLADLD
LQSGTLGLNFDIEAKHGMVDVLQSPDRLDDVLLRRLAVSYTDRLHLLPAT
TDLDKFINLREGDVDHLLDVARSSSWHVVVDLPHILTQWTRKILLEADEI
VVTATPDLAGMRNAKNLIDFLKKARPNDPPPRLVLNKVGTPKLQEIKPKD
FVAAVGLEEGVSLAFEPSLFGAAANNGRLVIESAPDSKAGKAIVSLAWRV
GGTRERRTRQKGVKALLQKVFKRGKPKAPSTLRKKAGELKTSEAGASAVE
FALVAPVLALGLVATADLGLAIHERMTIDHVLRAGAQAALADPGAAQVQK
VLVSTLAESPRLASAVLPAVKRYCACPENADVAPEAAPQCGTVTCANAKP
QFVYYRLAAAKSYRPMSLPAVLPVFELGSSMQVQVQ
>SMa1013 actP, ActP copper transport ATPase
MTAFTQIEKSAAVPAPTDFGIEGMTCASCVRRVEKAISAVPGVASATVNL
ATERASVQFTGAPDTGGVLLAIEKAGYEPKVIIQEFGIEGMTCASCVSRV
EKALRTVPGVADASVNLATEKGTVRFVSGVDVAAIEAAVRDAGYDVRKAK
ASGATAEPEDRRELETRTLKRLVILSAVLTLPLFLVEMGSHFMPGVHEWI
MENIGMRHNLYIQFALATAVLFGPGLRFFRKGVPNLLRWTPDMNSLVVLG
TTAAWGYSVVATFASGLLPSGTANVYYEAAAVIVTLILLGRYLEARAKGR
TSQAIKRLLGLQPKTAFVAHGDEFVEIQISDVVVGDVIRIRPGEKIPVDG
TVLDGNSYVDESMITGEPVPVQKAAGAEVVGGTINKNGSFTFRATKVGGD
TLLAQIIKMVETAQGSKLPIQALVDKVTAWFVPAVILVAVLTFAAWYVFG
PSPALTFALVNAVAVLIIACPCAMGLATPTSIMVGTGRAAELGILFRKGE
ALQSLREADVIALDKTGTLTKGRPELTDIVPADGFEADEVLSFVASLEAL
SEHPIAEAIVSAAKSRGIALVPATDFEATPGFGVRGAVSGLPVQVGADRA
FSGVGIDVSPFVVEAERLGNSGKSPLYAAIDGRLAAIIAVSDPIKDTTPQ
AIKALHDLGLKVAMITGDNRRTADAIARQLGIDEVVAEVLPDGKVDAVKR
LREGGRKVAFIGDGINDAPALTEADVGIAVGTGTDIAIESADVVLMSGDL
IGVPKAIALSKATIRNIKQNLFWAFAYNVSLVPVAAGVLYPLNGTLLSPI
LAAAAMAMSSVFVLGNALRLRSVNPA
>SMa1715 adeC3, putative AdeC3 adenine deaminase
MLQPWSEIAPRLVDVAMGRKPADLVVRNGRWVNVYSGEIVPGADIAIVGG
RFAYVGPDAGHTIGEGTKIVDAAGRYLVPGLCDGHMHVESGLVTVTEFAR
AVIPHGTTTMFVDPHEIANVLGIAGVKLMNDEAQTLPVNIFVQVPSCVPS
APGLENAGATLSAADVREALAWPNIIGLGEMMNFPGVAANDSKMVAEIAA
TRAAGLTVGGHYASPDLGRAFHAYAAGGPADDHEGTTVEDAIARVRQGMR
SMLRLGSAWFDVAAQVKAITERGIDPRNFVLCTDDSHSGTLVSDGHMNRV
VRHAISQGLKPITAIQMATLNTAQHFGLERDLGSIAPGRRADLIVTSDLT
ALPIEIVFVRGRLLAEKGVLVADIPAYDYPASAKNTVKLGKRLAPTDFDI
CAAGSSEVEVRVIGVIENQAPTKALQRRLPVECGVVQMDRASDVCQIALV
ERHRATGGVINAFVSGFGYDTHCAMASTVAHDSHHMIVVGTNKADMAQAA
NRLQEVGGGIVLIAGGRELALVELPVAGLMSDQRAEIVAEKASRLVEAMR
ACGCKLNNAYMQHSLLALVVIPELRISDVGLIDVTRFESTEVIVR
>SMa1718 adeC4, putative AdeC4 adenine deaminase
MSDVREFIRAASGGESKATVAVCGGRLVNVVSEEIYQADVAIYRDRIIAV
GDISEYIGPQTEIIDAADRYLTPGMIDGHLHVECSKLSLTSFAKAVLPLG
TTSIVSGLDQIIVVGGPDAAREFLDEVRQTPLKVFWGAPCKTPYTMPRST
VGHYFSPKDHRDTHHWPECVGIWETVREFIQEEDEDVLQAIEIGQANRLP
VLGCCPMTRGARLNGYMQSGVRADHESYTPEEMLEKLRAGMHVVVRESSI
SHFLSDNLRIVTEMGVKALRRISFCTDDVVASDILSRGHLDNMVRMAMAM
GISPMAAIQMATINGAEALRIDHKVGSISPGRTADILIVNDLRDFRIEAV
VANGTVAARDGRMVVKLVPPQRSAGLLRSVKTTPVTAADIAVPFTGTTPF
AEVLAIAVTPEKVFVRTRRDVRLPVVDGKILADASQNVQYVTVVERYGKT
LNRPVAFVSGFNLKSGAIASSTAPDDNNIICIGADPQDMAIAINHLVANN
GGQVVVDKGEVVEFLHLPIGGIVSDIDPAEMAAFELRLDEAARRLGCDLP
WPFMYMFVLQITAIPDYAMTDLGVVDCVNLRIISPLAPDGPAKANTLAAE
>SMa1296 adhA1, AdhA1 alcohol
MTMTAAVVREFGKPLVIEEVPVPQPGPGQVLIKYEATGVCHTDLHAAKGD
WPVRPNPPFIPGHEGVGYVAKLGAEVTRLKEGDRVGVPWLHTACGCCTPC
RTGWETLCGSQQNTGYSVDGTFAQYGLADPDFVGRLPARLEFGPAAPVLC
AGVTVYKGLKETEVRPGEWVLVSGIGGLGHMAVQYAKAMGMHVAAADIFP
DKLALAEKLGADLVVDARAPDAVEEVQRRTGGLHGALVTAVSPKAMEQAY
SMLRSKGTMALVGLPPGQICLPVFDTVLKRITVRGSIVGTRQDLEEALEF
AGEGKVAAHFSWDKIENINAIFERMEEGKIDGRIVLDLNG
>SMa2113 adhC2, probable AdhC2 glutathione-dependent dehydrogenase
MDARAAVAIQAGKPLEVMTVQLEGPRAGEVLVEVKATGICHTDDFTLSGA
DPEGLFPAILGHEGAGIVIDVGPGVTSVKKGDHVIPLYTPECRACPSCLS
RKTNLCTAIRATQGQGVMPDGTSRFSLNGDKIHHYMGCSTFSNFTVLPEI
ALAKVNPDAPFDKICYIGCGVTTGIGAVINTAKVEIGATAIVFGLGGIGL
NVIQGLRLAGADMIIGVDLNNDKKPWGEKFGMTHFVNPKEVGDDIVPYLV
NLTKRGADQIGGADYTFDCTGNTRVMRQALEASHRGWGKSVIIGVAGAGQ
EIATRPFQLVTGRTWMGTAFGGARGRTDVPKIVDWYMEGKIAIDPMITHT
MPLDDINKGFELMHSGESIRSVVLF
>SMa0627 aqpZ2, probable AqpZ2 aquaporin
MFKKLCAEFLGTCWLVLGGCGSAVLASAFPQVGIGLLGVSFAFGLTVLTM
AYTVGGISGGHFNPAVSLGLAVAGRVPAASLVSYVIAQVAGAIIAAAVLY
VIATGKADFQLGSFAANGYGEHSPGGYSLTAALVTEVVMTFFFLIIILGS
THRRVPAGFAPIAIGLALTLIHLVSIPVTNTSVNPARSTGQALFVGGWAL
SQLWLFWIAPLFGAAIAGIVWKSVGEEFRPVD
>SMa0693 arcA1, ArcA1 arginine deiminase
MRTVGVHSEVGKLRTVMVCRPSLAHQRLTPGNCHDLLFDDVIWVHEAQKD
HYDFVLKMEERGVEVLELHDLLSDTLIDAEARKFVLDRRVAPNVMGSQIA
ELMRPWMEEMDSRRLAAFLIGGISIADLPEGQGKALMASAFHSTQFVLPP
IPNTLFQRDPSCWIYNGVTCNPMFWPARRAETLIQRAVYKFHPSFKGAAF
DIWWGDSDEQFANATMEGGDVMPIGDGILLVGMGERTTYQAVGQVAKALF
KAGAATRVIGCLMPKSRAAMHLDTVFTFCDRDVVTLFADVVDQIRCYSLF
PLDDEGNFEVRQEDRPMLEVVAEALGVDKLRTIATGGNTYEAEREQWDDG
NNVVALEPGVVVAYDRNTYTNTLLRKAGIEVITIRGSELGRGRGGGHCMT
CPIWREPTD
>SMa1670 arcA2, probable ArcA2 arginine deiminase
MSSKSSTQHTFGVHSEVGQLRKVMVCAPGRAHQRLTPSNCDALLFDDVLW
VDNARRDHFDFMTKMRDRGVEVVEMHNLLAQTVAIPEARKWILDNQVVPN
QVGLELLDEIRSYLEGLPDRELAETLIGGLSTHEFPETHGGEMLELIRDA
AGVAEYLLPPLPNTLYTRDTTCWIYGGVTLNPLYWPARHEETILATAIYK
FHPDFVGKVNVWWGEPTTDWGLATLEGGDVMPIGKGNVLIGMSERTSRQA
ISQLAATLFEKGAAQRVIVAAMPKLRAAMHLDTVFTFADRDCVLIYPDIV
NEIEAFSYRPGEKPGSLELHKDRGSFVETVRDALGLKEMRVVETGGNAYV
RERTQWDSGANLVCLSPGVVLAYDRNTYTNTLLRKAGVEVITITGAELGR
GRGGGHCMTCPIIRDAVDY
>SMa0695 arcB, ArcB catabolic ornithine carbamoyl transferase
MSFNLRNRSLLTVQDYTPREFRYLVDLARDLKRAKYARTEQEHLKGKEIC
LIFEKTSTRTRCAFEVACSDQGANVTYLDPAGSQIGHKESFKDTARVLGR
MYDAIEYRGASQAGVETLAKYAGVPVYNGLTDEYHPTQMIADVMTMREHS
DKPISEIKYAYIGDTRSNMGHSLLIVGCLMGMDVRICGPRSLWPSEEYQT
IAKRLKAQSGARLMITDNPREAVEGVDFIHTDVWVSMGEPKEVWKERIQL
LTPYQVNAELMAASGNPQTKFMHCLPAYHDTETTIGKQISDDYGMSDGLE
VTDEVFESQANIAFEQAENRMHTIKALLVATLGD
>SMa0697 arcC, ArcC carbamate kinase
MRVVIALGGNALLKRGEPMTAEVQRQNIKIAAEAIAPIAAEHQIVVTHGN
GPQVGLLALQGSAYKPEEAYPLDILGAETEGMIGYMLEQELGNVLPFEVP
LATILTMVEVDGNDPGFQNPTKFVGPVYDASEAGELHQQKGWVFKQDGNK
WRRVVASPIPRRIFELRPIQWLLDKGAVVICAGGGGIPTMYERGKERTLI
GVEAVIDKDLCSALLARDIEADLLILATDAEAVFTGWGTPERKAIFKTNP
RRLGEFSFPAGSMGPKVEAACHFVNATGRVAAIGALADIPAMVRAERGTI
ISSSFSDITWHVEVPIPGPASRPV
>SMa1667 arcD1, probable ArcD1 arginine/ornithine antiporter
MTSTAQKLSLASLAALVVGSMVGAGIFSLPRTFGDATGPFGAIVAWCIAG
AGIFTLAHVFRVLAERKSDLDAGVYAYANAGFGDYAGFLSVLGYWLVGCI
ADVSYWVLIKATLGAFFPIFGDGNTIAAVLVSSVALWGFHFMILRGIKEA
AAINTVVTVAKIVPILIFIVILLGAFETDLFRSNFWGGADMPEASLFEQI
RATMLVTVFVFIGVEGASVYSRYARKRSDVGVATTLGFVVVLGLMVLVTL
LPYGALERPEIAAMRQPSMASVLESIVGPWGSVFVSAGLIVSVLGAYLAW
SLICVEVLFCAAKNGDMPSVLARENSNSVPAAALWLSNGVIQLFLISTLF
SEDAFRLMVNLTSAMVLIPYLLVAAYGFLVAKRGETYNIRPKERFRDLIL
AGAATVYTAFMIYAGGLKFLLLSAILYALGTALFFYARREQKKPLFSPRE
WLVFIAVVAGCLVGIYGLVTGSITI
>SMa1668 arcD2, probable ArcD2 arginine/ornithine antiporter
MAQKLSLFALTGMVVGSMVGAGIFSLPRTFGVATGPFGAIIAWCIAGGGM
YMLARVFQSLAERKPDLDAGVFAYAKEGFGDYPGFLSAFGYWIGSCIGNV
SYWVLIKSTLGNFFPVFGDGNTVVAILFASVGIWLFHFMILRGIQQAAFV
NTVVTVAKVIPIIAFIIILFFFFKLDLFRLNFWGGEGMPEATLLQQIQAT
MLATVFVFIGIEGASNYSRYAQARSDIGTATIMGFIGVSALMVLVTLLPY
AALTRPEIAAMSQPSMAGVLAAVVGPWGAVFISIGVIVSVLGAYLAWSLV
CAEVLYVAARTDDMPRLFGTENQNKVPAAALWLTNIVVQLFVISTYWSQD
AFALMLNLTSSMSLIPYMFVAAFGFMLAQRAETYEVRPRERTRDLIIASI
AAVYTFFMIVAGGIKFVLLSALLYAPGTILYFWARRERGKRVFNTSIDWL
IFATAVIGCFAAIIGLSTGYLTI
>SMa1836 argE, putative ArgE acetylornithine deacetylase
MQAAEILGKLVGFRSVVGLPNNDVVSWIRGYLESHGIAVDVLPGPEGDRS
NIFATIGPKEARGYIISGHMDVVPAAETGWTSDPFRLRVEADRLYGRGTT
DMKGFLAAVLAAVPKLAAMPLRRPLHLALSYDEEAGCRGVPHMIARLPEL
CRQPLGAIIGEPTGMRAIRAHKGKAAARLTVRGRSGHSSRPDQGLNAIHG
VAGVLTQAVAEADRLVGGPFEHVFEPPYSSLQIGTVKGGQAVNIIPDSCE
VEFEARAISGVDPAELLAPVRKTAEALTTLGFEVEWQELSAYPALSLEPD
APLAALLEELTGREALPAVSYGTEAGLFQRAGIDAIICGPGDIGRAHKPD
EYILIDELMACRAMVEALGARCTA
>SMa1711 argI2, probable ArgI2 arginase
MPTLPYAILEAPSTLGLATHGVERLPDQLLHLGLAERIHARHAERLAVPP
KEPTPDPETGILNARAIAAWSPKLADAVEAVLTAGEFPVVLGGDCTIVLG
SMLAFRRRGRYGLLFIDGNADFFQPEAEPNGEGASMDLALVTGYGPSHLT
DIEGRGPLVRPEDAVAFAYRDHKDQEEYGSQALPNELKAIDLPAVRAVGI
EAAAREAVDHLTREELDGFFIHLDADCLDDAIMPAVDFRMPGGLSWDELG
TALRVALASGKAVGIEITIYNPRLDESGSAGRDLADVLATALGTAAS
>SMa0955 atrA, AtrA transcriptional regulator
MADNSKESPDLERAPERLGDAAYREMKERIIRGVYRPGHKLTVRAIAQDL
EVSTTPARDAINRLTSEGALFYAGPKTVVVPVLDASALREITLTRLALEG
LASEQAAQHGTPAEVEKLKSLQKLINSALDEKRYAEALWHNKEFHFTVYR
LAGLPQLVSMIESLWLRIGPSLHNLYPEFAEEKYGVRNHEIAMEALAERD
AASLRAAMENDIRDGYRRLKRANPERNAGQ
>SMa0956 atrB, probable AtrB glutamate-1-semialdehyde 2,1-aminomutase
MLVQATNRLRQASARSRSKHLFDRAKGVFTDGTTRASVERDPFPIYAQRG
EGAYLVDVDGNRLLDLNNNFTTLIHGHGFAHVSEAVVDLLRLGTCFANPT
EHEIALAELLTARIPAMERVRFVNSGTEAVMFAIKAARAFTARPAIARIE
GAYHGAYDWAEAGQGVSPGKDGWDPIPIPTPTYRGTPSSVADEVHLLRFN
DVEGLERRLSAASDRIACVLIDPMPSRAGLLHPEPTFIEALSETAHKYGI
LIVADEVLNLRQGYAGASPRYGLKPDLVTAGKIIGGGFPIGAIGGREEVM
RVFGTENARPLLPQGGTFSANPVSMAAGLAAMEAMTPDAFDRLEAMGERL
RAGLRASIASRDARFSVTGAASLFRIHPKRVAPLEYRDAHLSAEEAWIMR
TMSRYFLEAGILLPYGAAACLSTPMVHSDIDRILSAFDEFLEAKIGPEKE
RAK
>SMa0958 atrC, probable AtrC acetolactate syntase
MTQQNPNVTVAQRIANILRRHGVEFIFGQSLPSAVILAAEAIGIRQIAYR
QENMGGAMADGYARVSGKVGVVAAQNGPAATLLVPPLAEALKASVPIVAL
VQDVERDQTDRNAFQDLDQIALFQSCTKWVRRVTVPERIDDYVDAAFTAA
ASGRAGPAALLLPADLLRAEAKSPAVVRSKQLGHWPLDRVRPSDDALAEV
ASLIAAAHAPIIIAGGGVHCGGATHELAALQQEACLPVFTTNMGKGAVDE
YHPLSAGVLGSLVGPRSLGRYSYGLVEDADLVILIGTRTNQNGTDTWRQI
PSSARVVHIDVDPVEIGRNYEAIRLVGDARESLAALRAALTRVDLTRRHG
DRARLEECIAQYWKGFELDRHDVVTSRSRPIRPERVMAELQDLLTGDVTV
VADASYSSMWVLGQLRARASGMRFITPRGLAGLGWGVPLAIGAKVARPGK
PVIAVVGDGGFAHSWAELETMVRMKLPVTIVVLNNGILGFQRDAETVKFG
TYTTACHFAEVDHAKLAEACGCPAVRVEDPGELAFHLHRGMDQGPLLIEV
MTDPAAHPPLSLFAKMDEAA
>SMa1243 azu1, Azu1 pseudoazurin (blue copper protein)
MRIIAKGMAVAAVLAAFTGSAFAADFEVRMLNKGSEGVMVFEPAFVKVNP
GDSVTFVPTDKGHNVETIKDMIPDGASAFKSKMNETYKVTFDVPGVYGVK
CTPHVGMGMVAAVVVGDAPANVEKVKAVKLPKKARERLDAALAVALQ
>SMa1731 betB2, putative BetB2 betaine aldehyde
MQSYQIREMATPLCQPAASHFIDGTFIEDRTGPEILSVNPVDGEIIAKLH
GATSCIIEKAIASAKRAQKEWARKEPAERGRVLSRAADIMRARNRELSVL
ETRDTGKPISETLVADAASGADCLEYFGAIAATLSGDSIQFGEDWVYTRR
EPLGVCLGIGAWNYPIQIAAWKAAPALACGNAMIFKPSEVTPLSALKLAE
ILTEAGLPPGVFNIVQGAGDVGAELATHPAIAKVSLTGSVKTGARVASAA
MAGIRPVTMELGGKSALIVFDDADVEAAVSGAILGNFYSAGQICSNGTRV
FLQRGIREAFLARLLARVAALKIGDPMDEETDIGPLVSAAHRNRVATYVA
RAEVEGAYQMAPPRKLPPGDAWHEPVVFTNVTDWMTLAREEVFGPVMAVL
DFDDEQDVVARANATDFGLAAGIFTRDLVRAHRLAAELEAGTVWINAYNL
TPAGMAFGGIKRSGIGRENGRVAIDHYTQLKSVFVSMQT
>SMa0045 cah, Probable carbonic anhydrase, Cah
MERRDFLRGLALLAACPLCVKTAYAAEGVHWRYEGEEGPEHWGSLAKENS
ACSAGSQQSPIDIRGAVKADIPELTADWKSGGTILNNGHTIQVNAAGGTL
RRGDKSYDLVQYHFHSPSEHFVDGKSFPMEAHFVHKNAETGTLGVLGVFL
VPGAANSTFASLAEKFPRNPGEESPLITIDPKGLLPSSLSYWTYEGSLTT
PPCSEIVDWMVAMEAVEVDPGDIKKFTALYSMNARPALAGNRRYVLSSS
>SMa1561 cheB2, probable CheB2 chemotaxis methylesterase
MVRILLATSTVELEDLVKRAIEGDASAELVLIARSGREAVRMTGELLPDI
VAVELCPSGDDSAETVREIMIAAPTPVVMLSHRDGSQLGTISARALEAGA
LAVIPAPAAHGMQLEQPAIEKFLSTIKAMSQVKVVRQWRQKVRGDRAAKD
QPPTARTPIGIVGIAASTGGPAAIRAILKDISADLPVPILIVQHMSNGFI
DGVAASLNATVPLTVKVARNGELLKPGTVYLAPDNCQLGVSGRSRLRVSD
DAPVNGFKPSGSYLFGSIARAFKGESLAVVLTGMGDDGTEGLRALRMAGG
KAIAQDEKSSVVFGMPKSAIGAGLVDLVLPLESIAENITAIARGRSEPEG
ETRT
>SMa2371 codA1, putative CodA1 cytosine deaminase
MFDLIIRNANLPDGRQGFDIGLAGGKIAAIEKSITASPGEEIDAAGRLVS
PPFCDPHFHMDATLSLGLPRMNISGTLLEGISLWGELRPLLTKEALVERA
LRYCDLAVTQGLLYIRSHVDTSDPRLVTAEALLEVKEQVAPYIELQLVAF
PQDGYFRAPGGVASLERALDMGIGIVGGIPHFERTMEDGARSVEALCRLA
ADRGLPVDMHCDETDDPMSRHIETLAAETVRFGLKGRVAGSHLTSMHSMD
NYYVSKLISLMAEAEINVIPNPLINIMLQGRHDTYPKRRGMTRVRELMAA
DLNVSFGHDCVMDPWYSMGSGDMLEVAHMAIHVAQMAGIEDKCKIFDAIT
VNSAKTMGLEGYGLDIGCKADLVVLQAADVTEALRLKPNRLFVIKAGKVI
ARTAPRVGELFLSGRPASIDMGRDYVPPVLQR
>SMa1578 cpaA2, probable CpaA2 pilus assembly protein
MISTSAAWFAFLLFAGAMTYAGIRDVATMTISNRVVVFLVIAFAILAPAA
GLNLATVMSSVVVASAVLACTFVLFAAGWIGGGDAKLLPVAVLWLGADLA
LPFILYTSVIGAALTVGLLQLRRVPLPLALKKNAWAKRLLDRETGIPYGA
AMAPAALLLLPESHWCSVLL
>SMa1568 cpaF2, probable CpaF2 pilus assembly protein
MPSAFAFNELKLLNLGNRATTMFGKKSISEERTAEAAVQPVVENELQAAH
APVQVRKPPQVPATAKPENAEQYYSLKKEIFSALIATIDVAALSNMDGEQ
ARNEIGAIINDIVAAKKAGISMAEQNDLLSDICNDILGYGPLEPLLARDD
IADIMVNGANQVFIEVNGRVQETGIRFRDNEQLLNICQRIVSQVGRRVDE
SSPICDARLGDGSRVNVIAPPLAIDGPTLTIRKFKKEKLTLDQLVRFGSI
SPEGAEVLKIIGRVRCNVLISGGTGSGKTTLLNCLTGYIDHGERVITCED
AAELQLQQPHVVRLETRPPNIEGQGEITMRNLVKNCLRMRPERIIVGEVR
GPEAFDLLQAMNTGHDGSMGTLHANSPREAMARIEAMITMGGSSLPAKTI
REMLVSSVDVIVQAARLRDGSRRITHITEVLGMEGEVVTTQDLFVYDILG
EDEKGNIIGRHRSTGIGRPAFWDRARYYGEEARLAAALDAAELKAAA
>SMa0181 cspA5, probable CspA5 cold shock protein transcriptional regulator
MPKGTVKFFNDDKGFGFITPEDGGTDVFVHVSALQHGGSLKEGDKVSYDV
GQDRKTGKSKAENVSVL
>SMa0738 cspA6, probable CspA6 cold shock protein transcriptional regulator
MATGTVKFFAQDKGFGFITPDNGGPDVFVHISALGFGGSLQDGQKVSYEL
GQDRKTGKSKAENVSIL
>SMa0570 cyaF4, putative CyaF4 adenylate cyclase
MERKLCAILAADVVGYSALMERDEAGTFERLHAGRKELFEPEIARHHGKV
FKLMGDGLLAEFGSVVDAVECAVSLQRGLTERNAAVPHDQRIRVRIGINL
GEVIVEREDRYGEGVNVAARLQQLAEPGGICVSGKVAREVEKKLAFGFEP
MGEQKVKNIIEPVQAFRIIIEWQARRRPVIRFQRYWVGTGTAVLALLLVL
AGAAWQFWPTATVSGKPSVAVLPFDNYGGDEASGRLADGLTEDIITDLAR
FPEFKVIARNSTETYKGKAIDVREIGKALDVGFVVEGSIAREADRVRVTA
QLIDSKQGRHLWSQRWDRPDKEVFVIQAEIAEQIANRLGGGAGLVQESGR
IAAHRKVPGNLNAYELYLLGTERLEQLDQANLEAALSLLTQAVQSDPGLA
RGWVELFHTHDLLAGLGIEPERNRALADAAAERALTLDPSDPEAHAVYGS
SLGMRGDFARAEAEYEAALRMAPNAAEILIFYIGWASTFGKPERGADLVE
RAIQLDPNYPGWANRPFGLAYFMAGRYPEAVTMFERLGIERHNRWSWAAH
AGALAAAERRTEAAALVARAMAAHPDLSIELIANEPGWSDAERRRFINTM
RLAGFPACTKPEVLAKIEKPLRLPECASL
>SMa1583 cyaF5, CyaF5 adenylate cyclase
MAKESIRRRLAAILAADAVGYSRLMERDEKSTHTLLMARWKEVLEPLVGI
HQGRVFKRTGDGVLVEFGSAVNAVECAAALQQAMAAANRDLPEDRAIVLR
VGVNLGDIMVEDSDLFGDGVNVAARIEALADPGGVAISDGIHEYVHGRTD
IDFVDSGYHEVKNIERPVHIWTWSPKDRAREPPNIAAEPPPQLPAKPSIA
VLPFDNMSGDPEQGYFADGITEDIITDLSKVSGLFVIARNSSFAYKGKTP
DIRKVSRELGVRYVLEGSVRRAANRIRINAQMIDGTTGGHLWAERYDRGL
EDIFAVQDEVTRTIVNALRVKLTAGEEERRESRGKVDPEAYDLLVRSRQA
ILQFNALSSMEARRMLHRVLEIDPGMAAAHASLSIIALTDFINQWNGATP
DNLTQALGLAQEAIDTDGSEPQGHYTLALALSWMRRLDEAEHAAERAIEL
DPNSANAYTALGTIRDFQGRHEEALALYTRAHRLDPQFDLSLHFQGRALL
NLGRFDEAEVAFKRRLLLAPRSDMTRFYLACLYGRTGRHEEARGYWREVL
GVNPSFSVDHLRRSLPYQDPHLMDRLVEGLREAGVSI
>SMa1099 cycB1, Putative CycB1 cytochrome c-552 precursor
MRVTIASIIAGSNAIVPAVHAQDIHEGRQLALEVCAVCHAVLTGQAQSPF
AEAPSFEAVAAIPGMTAAALNVWLTAQDHPTMPNIVLSQTDVQDVSAYIL
SLRK
>SMa1170 cycB2, putative CycB2 cytochrome c552
MKRICGLQCAGTVALLLTVTGEGAAADRLHGRAIAQRWCSECHVVAPGQV
RGSDTVPTFAQIGESERFDERSLAAFLATPHHSRMPNLSLTRAEIADLVA
YIKAK
>SMa1128 degP4, DegP4 protease like protein
MSGFDARQCVGGQIRLKSSINPGSATMLKISKLLVLAALFDPFGGAAEAT
GIDAGVSAMLERVQPAVVSIRTISSGIFRNEMLEDPNVRKVLGLPDEVFI
VQTGATATGSGVIIDREGGYIITSRHLVADADDVSVTLADGRVFHADRVG
EDAPTDLAVIKIDASELAALQWGNSSELKVGEFVAAIGSPFGLAQTATFG
IVSGLGRAGLGADEYQDYIQTDAAINPGNSGGALVNVSGKLIGIARGIAA
PDQTSTGIGFAIPSNIVAKIVRELISHGEYKRGWLGLSVTAAVDAQDGSA
TQPTGLIVNELACNSPAERQGVRLGDVITGLDDRAFSTEQAFRNAISLLP
ANARITLDVRRGETSQKLGMTLSDSVDRPDVSDQGSVIVKFPSESGSPAC
VPAGAVLVDVAPGSAAYSIGLRTGDYLTAINGKPPVSADQIGSVLESVEG
SASLDILRAGTAYRIDVQ
>SMa0705 dgoK2, putative DgoK2 2-dehydro-3-deoxygalactonokinase
MSIIELRRRSTLSNSAFEPVTVVLDWGTTGFRAFLVRSDGSLVDQKEGER
GIQSIAKGEHGRVVSEALASWRAGYGPLDIVAAGMIGSRNGWIEMPYVPT
PASAADVAAAARTEGLPEGNRITFLPGLTDPTGFPFPDVMRGEEAQLVGF
GLDRDIIVVLPGTHAKWAEIRGGHIERFRTFVTGEIYATLADHSFLSKVA
TAERDHAADAFAEGVALAQEESTRAGGLLTRLFAVRTGWLAGAIAPDEMK
SRLSGLIIGWEFVEARTGGWFKEGDTIAVVGDDDLVEVYGRVAENFGVKL
APAPADAAIRGALTIWRRHRLAAK
>SMa0892 dnaE3, DnaE3 putative DNA polymerase III alpha chain
MRYAELQVTTHFSFLRAASSAEELFATARLMGIEALGVVDRNSLAGIVRA
LEASRATGLRLVVGCRLDLQDGMSILVYPTDRAAYSRLTRLLTLGKGRGG
KANCIIHFDDVALYAEGLIGILVPDLADEVCAVQLRKIAEVFGDRAYVSL
CLRRRPNDQLQLHELTNLAVKHRVKTIVTNDVLFHEHGRRQLQDVVTCIR
TGMTIDDVGFERERHADRYLKPPEEMARLFPAYPEALARTMEIVERCRFS
LEELVYQYPEEALILGMTAQQSLQHYTWEGVRARYPEGLPTHVEKTIRHE
LALIETMKYAPYFLTVFSIVRYARSQGILCQGRGSAANSAVCYVLGITSI
DPETNDLLFERFVSQERDEPPDIDVDFEHERREEVIQWIYKTYGHDKAAL
CSTVTRYRAKGAIRDVGKALDLPEDLIRTLSSGIWSWSETVGERQVRELG
LNPDDRRLTLTLRLAQQLMGAPRNLSQHPGGFVLTHDRLDDLVPIEPATM
ADRQVIEWDKDDIEALKFLKVDVLALGMLTCMAKAFALISEHKHEDIDLA
TIPQEDPATYAMIRKADTLGTFQIESRAQMSMLPRMKPRTFYDLVIQVAI
VRPGPIQGDMVHPYLRRREGKEKVEYPTPELEAVLHKTLGVPLFQESAMR
VAMVCAGFTGGEADQLRKSMATFKFTGGVSRFKDKLVNGMIRNGYTKEFA
EKTFSQLEGFGSYGFPESHAASFALIAYASNYIKCYFPDVFCAALLNSQP
MGFYAPAQIVRDAREHGVEVRPICINRSRWDCMLEPIDGSGGHAVRLGMR
LVRGLATADAARIVAARADEPFTSVDDMWRRSGVPVASLVELAEADAFLP
SLSLERRDALWAIKALRDEPLPLFTAAADREARAIAEQEEPEVELRQMTD
GQNVVEDYSHTGLTLREHPLRFMRDDLAKRRIVTCAQAMTAHDGQWLMAA
GLVLVRQRPGSAKGVMFITIEDETGIANIVVWPKLFERSRRVVLGASMMA
INGRIQREGEVVHLVAQQLFDLSADLSSLAERDGAFRPPTGRGDEFAHGS
PGSADSRGKAPPGVRARDILVPDLHIDTLKIKSRNFQ
>SMa1587 eglC, EglC ENDO-1,3-1,4-BETA-GLYCANASE
MSRTVTNALGEPLSYGGSSTAWFSASGSGPLLYGTAGNDSMWADSSVDVT
MIGDSGDDIYYLYSGVNRASEAPSAGVDTINTWMSYSLPENFENLTVTGV
EGFGFGNSASNIISGGSGSQTINGGAGNDVLTGAGGADTFAFKRGNGSDL
ISDFGSDDVVRLEGYGFTSFDHILANVAQEGLDLKLSLADGEYLVFANTS
ADQLHANQFSLALDRSVLTQTFSDDFNTLQLSDGTSGVWDPKYWWAPEKG
ATLTGNDELQWYVNPTYQPTASANPFSVTDGVLTITAKPASQAIQAETNG
YDYTSGMLTTYSSFAQTYGYFEMRADMPDDQGAWPAFWLLPGDGTWPPEL
DVVEMHGQDPNTVIATVHSNETGSQTSIASAARVTDTSGFHKYGVLWTEE
EIVWYFDDAAIARADTPSDMHDPMYMLVNLAIGGMAGPPTDGLMGGAEMK
VDYVKAYSLDADWHI
>SMa1389 etfA2, probable EtfA2 electron-transport flavoprotein, alpha-subunit
MAILLLADHDNSHLSDQTAKALTAAAKIGGDVHVLVAGQNVKGIAEQASK
LSGVAKVLAAEDASLANNLAEPLAALIVSLAGNYDTVVAAATSVGKNVMP
RVAALLDVAQVSEIIEVVSSDTYRRPIYAGNAIQTVQTSEPKKVITVRTA
SFATAQEGGSAPVETVAAAANPGLSAHVSDALSSSDRPELTSAKIIISGG
RALGSSEKFKEVILPVADKLGAAVGASRAAVDAGYAPNDWQVGQTGKVVA
PDLYIACGISGAIQHLAGMKDSKVIVAINKDEEAPIFQVADYGLVADLFE
VLPELEKAL
>SMa1391 etfB2, probable EtfB2 electron transport flavoprotein, beta subunit
MKILVTVKRVVDYNVKIRVKADGSGVELANVKMSMNPFDEISVEEALRLK
EAGKASEVVVVSIGPGKAEETLRTALAMGADRAILVETEDQVEPLAVAKI
VKGVAEAEQPGLIIVGKQAIDDDSNQTGQMLSALLGWAQGTFASKVEIGD
GKVNVTREVDGGLQTVELKLPAVVTTDLRLNEPRYASLPNIMKAKKKPLD
KKTPADFGVDTSPRLKVLKTEEPSGRKAGVKVKTVAELVEKLKTEAGVL
>SMa0009 fdhE, probable FdhE formate formation
MSVSPVQPDPSVIGGVPKAPFVLKPNLARLFNDRASRFEALAQGSHLAPY
LNFLAGITRIQSELVSALPPPEPVPADRVERARANAMPPIDRAAMGGSPD
CREVLQQFFEKAEALEKPAAAAEALAQVRTADEEMLTWMIGNVMADDLPV
ESLAHHLYVAAAMQIQAARLAAGLDGSRLVPIRVGVCPACGGRPVASMVI
GFHGAEGARYASCSCCATMWNEVRVKCLACGSTKGIGYQAVETGDEEATV
KAEVCDTCNSWMKILYQNKNPSLDVVADDVASLGLDLLMKDTEYKRAGFD
PFLMGY
>SMa0002 fdoG, probable FdoG formate dehydrogenase-O alpha subunit
MEAVPMNVDLSRRSFLKLAGAGAAATSLGAMGFGEAEAAVVAHVRPHKLT
TTTETRNTCPYCSVACGVIIYSKGDLRKGEAADIIHIEGDADHPTNRGTL
CPKGAALKDFVKSPTRLQYPMHRKPGSDKFERISWEDAFDRIARLMKDDR
DANFIAANAAGVPVNRWTTVGMLAASATTNETAWATFKFAKALGIVGFDN
QARVUHGPTVSSLGPTFGRGAMTNSWTDIKNTDLVVVMGGNAAEAHPCGF
KWVTEAKATRGAKLIVVDPRYTRTASVSDYYAPIRQGTDIAFLNGVMKYC
IDNDKVQWDYMKAFTNASYLVKDGFGYQDGLFTGYDAEKRDYDKSTWDYV
LGDDGFVVTDPALQHPRCVWNLLKAHLAPYTPEMVERICGTPKDKFLKVA
EMISECSSPTKTMTSMYALGWTQHSSGSQNIRAMAMLQLILGNIGVRGGG
MNALRGHSNIQGLTDLGLMSHLLTGYLTMPTEKDVDFTTYMSTRQFKPLR
PGQTSYWQNYRKFMVSFQKAMWGDAARIDNDWAFNYLSKLDVPAYDVLRV
FELMYAGKVNGYICQGFNPLLAFPNRDKNTKALSNLKWLVTMDPLDTETA
RFWENHGDFNPVDTASIQTEVFQLPTTCFAEEEGSLTNSGRWLQWHWAGG
TPPGEAKHDTYIVAQIFLRMKEMYRNEGGAFPDPILNLSWDYADPNEPTP
EELAKEINGRALTDLMDPANPMKVQVAAGKQILNFSQLRDDGSTMCGCWI
YSGNFNEQGNNMARRDNHDPDDTGAYLGWSFAWPLNRRTLYNRASADLQG
KPWDPSRKLLEWDGTKWAGYDVPDIAPTAKPDEIGPFIMNQEGTARLFSR
GLMRDGPFPAHMEPFESPVANVFNPKMRGNPVSRVFQTDVAQMGLSDEFP
YAATSYRLTEHFHYWTKHNRVNSALQPEFFVEISEELAEEKNIENGGWVR
VWSKRGSVKAKAVVTKRIRPLMCDGKPVHVVGIPLHWGFTGSAKKGLGPN
SLAPFVGDANIETPEYKAFLVNIEPSTAPEEATV
>SMa0005 fdoH, probable FdoH formate dehydrogenase-O, beta subunit
MMDSPRTAVSNPPVQPMESNLTERDLVRRSATTELPPPERQLTPVAKLID
VSKCIGCKACQSACVEWNDTHPGIGENVGYYTNPHDLTEDMFTLMRFTEW
VNPETDNLEWLIRKDGCMHCADPGCLKACPAPGAIVQYTNGIVDFVHENC
IGCGYCIKGCPFNIPRISKVDHRAYKCTLCSDRVAVGQGPACAKACPTQA
IVFGTKEDMKKHAEHRIADLKSRGYTNAGLYDPPGVGGTHVMYVLHHSDK
PHIYSDLPDDPKISAVVQAWKGVTKYTGLAAMGLVAAGAILHGVFGRANR
VQPEDEESAERLVDSAGAAGRTPDDKGQRS
>SMa0007 fdoI, probable FdoI formate dehydrogenase-O,gamma subunit
MTKASDLEPEDAIHRGPPVTVDRYGPGKRVNHWITASSLILLALSGLAMF
HPSLFFLTGLFGGGQNTRMLHPWIGVVLFFSFYIFFFQLWKANLFTRADM
GWFTGIRDVIGGHEDRLPEMGKYNAGQKVIFWAMALLIVALIITGVIIWD
QYFYSYTSIETKRFAVLAHAVAAVLIICVFIVHVYAAFWTRGTFRAMTKG
SVTGGWAWRHHRKWLKELAGRGRIDPAE
>SMa0834 fdxB, FdxB ferredoxin III
MISSFVTRDGSRWMPKYLSAIDGATCIGCGRCFKVCSREVMHLHGIDDVG
EILGPFDGEEDDFGGELNRMIMVVDSRGRCIGCGACARVCPRDCQTHVAA
DILAA
>SMa0811 fdxN, FdxN ferredoxin
MAFKIIASQCTQCGACEFECPRGAVNFKGEKYVIDPTKCNECKGGFDTQQ
CASVCPVSNTCVPA
>SMa0822 fixA, FixA electron transfer flavoprotein beta chain
MHLVVCIKQVPDSAQIRVHPVTNTIMRQGVPTIINPHDLAALEEALKLCD
TYGGEVTVVTMGPKMAEDALRKALTFGAHRAVLLTDRHFAGSDTLATSFA
LAQAIAEIGETFGTPDVVFTGKQTIDGDTAQVGPGIAKRLDLQQLTYVAK
ILSIDAASREITVERRAEGGSQILRTGLPCLVTMLDGADAIRRGRLDDAL
RAARTKVVKWSAADAGIAEPANCGLRGSPTVVKRVFAPTSREQKARQIDT
TNKPLREIADGLIAAIFADRPALKHDLGSTGQQGAPDVDRES
>SMa0819 fixB, FixB electron transfer flavoprotein alpha chain
MKKGLPKQFQDYRNVWVFIELEHGQVHPVSIELLGEGRKLADKLGVHLAG
VVIGPPGGQGTANAIADAFAYGADLSYLVESPLLAHYRNEPFTKALTDLV
LANKPEILLLGATTLGRDLAGSVATTLKTGLTADCTELNVDSDGSLAATR
PTFGGSLLCTIYTLKCRPQMATVRPSVMATPQRVNRPTGSIIRHDLKMLE
EEIATKVLAFFSDCDSTIANLAYADVVVAGGLGLGAVQNLQLLKDLARTL
GGDFGCSRPLVQKGWMPFDRQIGQTGNTIRPKLYIAAGISGAVQHRVGVE
GSDLIVAINTDPNAPIFDFAHLGVVADAISFLPALTEVFTKRLEPRNLEK
FVQ
>SMa0817 fixC, FixC oxidoreductase
MTKEKFDAIVVGAGMSGNAAAYAMASRGLKVLQLERGEYPGSKNVQGAIM
YANMLEAIIPDFRNDAPLERHLVEQRFWIMDDTSHTGMHYRSDDFNEVTP
NRYTIIRAQFDKWLSRKVCEAGGTVLCETTATGLEWDSAGKAIGVRTDRA
GDVVLADVVVLAEGVNGLLGTRAGLREMPKSKNVALAVKELHFLPEEVIA
ERFGLTGDEGCVIEAGGTISRGMAGLGFLYTNKESISLGIGCLISNFAET
MERPYALLDAFKRHPSIQPLIAGSEVKEYAAHLIPEGGFNAIPRLCGNGW
VVVGDAAQLNNAVHREGSNLAMASGRMAGEAISIIKSRGGVMDKASLSLY
KTMLDKSFVVEDLSKQKDMPSLLHTNSPNFFTTYPQLISHAAQNFVRVDG
TPKIEREIATTAAFLRARSRWGLVSDAVRLASAWR
>SMa1211 fixG, FixG Iron sulfur membrane protein
MLHQPKTKATVGRLDAETVNAARVRGPLYEKRRKIFPKRAEGRFRRFKWL
VMLVTLGIYYLTPWIRWDRGAHAPDQAVLIDLASRRFYFFFIEIWPQEFF
FVAGLLVMAGFGLFLVTSAVGRAWCGYACPQTVWVDLFLVVERFIEGDRN
ARMRLDAGPWSLDKIRKRVAKHAIWVAIGVATGGAWIFYFADAPSLMSSL
VALDAPPVAYTTIGILTATTYVFGGLMREQVCTYMCPWPRIQAAMLDENS
LVVTYNDWRGEPRSRHAKKAAAAGEVVGDCVDCNACVAVCPMGIDIRDGQ
QLECITCALCIDACDGVMDKLGRERGLISYATLSDYAANMALATSGGTAA
IDPSRVRNAHGAFRDKVRHLNWRIVFRPRVLVYFGVWATVGFGLLFGLLA
RDRLELNVLHDRNPQFVVESDGSVRNGYMVKLLNMIPEQRTISLTIEGMP
AATMRVAGQATGDGRRVTIGVEPDKVTPLKVFVTLPKGRFAEAEEGFSLI
AEDPSSHERDVYQANFNLPGAAGR
>SMa1210 fixH, FixH nitrogen fixation protein
MSTATKQRSPKRGFTGWHMVAVMSLFFGTVISVNLVMAWNASRSWSGLVV
ENTYVASQQFNGKVAEGRAFQASGIKGRLTTEPGAIRYVLTRNGEPEQKI
DKVIAVLKRPVEEHEDLRVELHPRGEGAFVLAEELKPGQWIAAMMAMAGD
AVVHRQTIRFIAEGRDK
>SMa1209 fixI1, FixI1 copper transport ATPase
MSCCASSAAIMVAEGGQASPASEELWLASRDLGGGLRQTELSVPNAYCGT
CIATIEGALRAKPEVERARVNLSSRRVSIVWKEEVGGRRTNPCDFLHAIA
ERGYQTHLFSPGEEEGDDLLKQLILAVAVSGFAATNIMLLSVSVWSGADA
ATRDLFHWISALIAGPALIYAGRFFYKSAWNAIRHGRTNMDVPIALAVSL
SYGMSLHETIGHGEHAWFDASVTLLFFLLIGRTLDHMMRGRARTAISGLA
RLSPRGATVVHPDGSREYRAVDEINPGDRLIVAAGERVPVDGRVLSGTSD
LDRSVVNGESSPTVVTTGDTVQAGTLNLTGPLTLEATAAARDSFIAEIIG
LMEAAEGGRARYRRIADRAARYYSPAVHLLALLTFVGWMLVEGDVRHAML
VAVAVLIITCPCALGLAVPVVQVVAAGRLFQGGVMVKDGSAMERLAEIDT
VLLDKTGTLTIGKPRLVNAHEISPGRLATAAAIAVHSRHPIAVAIQNSAG
AASPIAGDIREIPGAGIEVKTEDGVYRLGSRDFAVGGSGPDGRQSEAILS
LDFRELACFRFEDQPRPASRESIEALGRLGIATGILSGDRAPVVAALASS
LGISNWYAELSPREKVQVCAAAAEAGHKALVVGDGINDAPVLRAAHVSMA
PATAADVGRQAADFVFMHERLSAVPFAIETSRHAGQLIRQNFALAIGYNV
IAVPIAILGYATPLVAAVAMSSSSLVVVFNALRLKRSLAAGRGATPGTLI
HSGAVTS
>SMa0621 fixI2, FixI2 E1-E2 type cation ATPase
MSCCSGIAVPSAGAASRTSISPEEIRLASRDLGDGLRQTAMTLPDAHCAA
CIAAVEGALRKISGVELARVNLSARRVTINWRGNDDESPDFAAALAKIGY
ASHLASIEEETQDPVLASLLKALAVAGFSAMNIMILSVSVWSGADPATRH
AFHLVSAALALPAIVYSGRFFYRSAWAALRHGRTNMDVPISVGVLLAFAL
SVYDTLHNAAFAYFDASTSLLFVLLAGRTLDHLMRGRARSAVGALARLSP
RGASVVQADEAIDYVPLSEIQPGMRLLVAAGERVPVDGVVVKGASELDAS
IVSGESEWRRAAPGSALQAGVMNLANPLTLLATASVDGSFLAEMTRMMEA
AESGRSTYRRIADRAASLYAPVVHGVALLSMVAWLFGTGDLHKSVTIAIA
VLVITCPCALGLAVPMVQVVAVRRLFERGIMARDGSAFERLNEIDTVLFD
KTGTLTLGEMRLVNAGDIQPRLLSLAAAMARVSRHPASVAIALADPRRPV
APVEFDSLEEVHGCGIEGRAGDAVYRLGRPSWASTAKQVDLGTSSTTVLS
KDAETIAVFAFEETVRPGARELVQTLRSAGLSVRILSGDRSAAVSSIARQ
LDIEAFSAELLPGEKVEAIRALAATGRKVLMIGDGLNDAPALAAAHVSIA
PSSATDVGRSASDFVFLGQSLLAVRDIIQTAARADVLIRQNFAMAIAYNV
VSVPFAIGGVVTPLAAALAMSLSSIVVVGNALRQGAKVKKGPRTFGATKP
ATAKI
>SMa1227 fixJ, FixJ Transcriptional activator
MTDYTVHIVDDEEPVRKSLAFMLTMNGFAVKMHQSAEAFLAFAPDVRNGV
LVTDLRMPDMSGVELLRNLGDLKINIPSIVITGHGDVPMAVEAMKAGAVD
FIEKPFEDTVIIEAIERASEHLVAAEADVDDANDIRARLQTLSERERQVL
SAVVAGLPNKSIAYDLDISPRTVEVHRANVMAKMKAKSLPHLVRMALAGG
FGPS
>SMa1225 fixK1, FixK1 Transcriptional activator
MYAAAQAKPQSIEVEHLGPAPMSGPRLVATYKPGREIYAQGDLNDKCYQV
STGAVRIYRLLSDGRRQVVSFHLPGEMFGFEAGSNHSFFAEAITETTLAI
FGRRNMQERSRELLALALTGMARAQQHLLVIGRQCAVERIAAFLVDLCER
QGGGRQLRLPMSRQDIADYLGLTIETVSRVVTKLKERSLIALRDARTIDI
MKPEALRSLCN
>SMa0762 fixK2, FixK2 transcription regulator
MYAAAQAKPQSIEVEHLGPAPMSGPHLVATYKPGREIYAQGDLNDKCYQV
STGAVRVYRLLSDGRRQVVSFHLPGEMFGFEAGSNHSFFAEAITETTLAI
FGRRNMQERSRELLALALTGMARAQQHLLVIGRQCAVERIAAFLVDLCER
QGGGRQLRLPMSRQDIADYLGLTIETVSRVVTKLKERSLIALRDARTIDI
VRLEALRSLCS
>SMa1229 fixL, FixL Oxygen regulated histidine kinase
MLSKSGIERTQWGRRVVRWRGDGVAAYIVAAIVTSSVLAIRMIRAEPIGE
GLLLFSFIPAILVVALIGGRNPILFAAGLSLVAAVSHQQISSADGPSVVE
LLVFGSAVLLIVALGEVLEAARRAIDRTEDVVRARDAHLRSILDTVPDAT
VVSATDGTIVSFNAAAVRQFGYAEEEVIGQNLRILMPEPYRHEHDGYLQR
YMATGEKRIIGIDRVVSGQRKDGSTFPMKLAVGEMRSGGERFFTGFIRDL
TEREESAARLEQIQAELARLARLNEMGEMASTLAHELNQPLSAIANYSHG
CTRLLRDMDDAVATRIREALEEVASQSLRAGQIIKHLREFVTKGETEKAP
EDIRKLVEESAALALVGSREQGVRTVFEYLPGAEMVLVDRIQVQQVLINL
MRNAIEAMRHVDRRELTIRTMPADPGEVAVVVEDTGGGIPEEVAGQLFKP
FVTTKASGMGIGLSISKRIVEAHGGEMTVSKNEAGGATFRFTLPAYLDER
IVAND
>SMa1220 fixN1, FixN1 Heme b / copper cytochrome c oxidase subunit
MKHTVEMVVLSVGAFLALVGAGLAQDRLFGAHMWVLFFALLAGTLVLMRR
VDFRPAVAGHPGRRREYFDEVVKYGVVATVFWGVVGFLVGVVVALQLAFP
ELNVEPWFNFGRVRPLHTSAVIFAFGGNALIATSFYVVQRTSRARLFGGD
LGWFVFWGYQLFIVLAASGYLLGITQSREYAEPEWYVDLWLTIVWVAYLV
AFLGTIMKRKEPHIYVANWFYLAFIVTIAMLHVVNNLAVPVSFLGSKSYS
AFSGVQDALTQWWYGHNAVGFFLTAGFLAMMYYFIPKQVNRPVYSYRLSI
IHFWAIIFMYIWAGPHHLHYTALPDWAQTLGMVFSIMLWMPSWGGMINGL
MTLSGAWDKIRTDPVVRMMVMAVAFYGMATFEGPMMSIKTVNSLSHYTDW
TIGHVHSGALGWNGLITFGAIYYLVPKLWNRERLYSVRMVNWHFWLATLG
IVVYAAVMWVAGIQQGLMWREYDDQGFLVYSFAETVAAMFPYYVMRAAGG
ALFLAGALLMAFNVTMTILGRVRDEEPIFGAAPLPAPAE
>SMa0765 fixN2, FixN2 cytochrome c oxidase polypeptide I
MKHTVEMVVLAVGAFLALVGAGLAQDRLFGAHMWVLFFVLLGGTLVLMRR
VDFRPAAAGRRAGETEYFDEVVKYGVIATVFWGVVGFLVGVVVALQLAFP
DLNVEPWFNFGRVRPLHTSAVIFAFGGNALIATSFYVVQRTSRARLFGGD
LGWFVFWGYQLFIVLAATGYLLGITQSREYAEPEWYVDLWLTIVWVAYLA
VFLGTVLMRKEPHIYVANWFYLAFIVTIAMLHIVNNLAVPVSFMGSKSYS
AFAGVQDALTQWWYGHNAVGFFLTAGFLAMMYYFIPKQVNRPVYSYRLSI
IHFWALIFMYIWAGPHHLHYTALPDWAQTLGMVFSIMLWMPSWGGMINGL
MTLSGAWDKIRTDPVVRMMVMAVAFYGMATFEGPMMSIKTVNSLSHYTDW
TIGHVHSGALGWNGLITFGAVYYLVPKLWNRERLYSLQMVNWHFWLATLG
IVVYAATMWVAGIQQGLMWREYDDQGFLVYSFAESVAAMFPYYVMRAAGG
ALFLAGALVMAFNVTMTILGRVRDEAAALDAAPLPAPAE
>SMa0612 fixN3, FixN3 cytochrome c oxidase subunit 1
MGQLTTRERDLAAAILLVLAIVGIAMAAAGRFDPLGVHGAVVLLYSLALL
YLIMSSSFGPPPDPSRISRYYDDPIKAGVWFTLFWAIFGMFIGVWAAAQL
AWPSLNFDTAWASFGRIRPAHTTGVIFGFGGNALIATSFHVVQRTSRARL
ADQLSPWFVLFGYNLFCILAVTGYFMGVTQSKEYAEAEWYADLWLVIVWV
TYFILYIRTLARRREPHIYVANWYYMAFIVVVAILHIINNLTVPVSLGHA
KSYTIWSGVQDSMVQWWYGHNAVAFFLTAGFLAMLYYYLPKRAERPIFSY
RLSILSFWGITFFYMWAGSHHLHYTALPHWVQNLGMTFSVMLLVPSWASA
GNALLTLNGAWHKVRDDATLRFIMMAAFFYGLSTFEGSFLAVRPVNSLSH
YTDWTVGHVHAGALGWVALITYGSLYTLVPAIWKRERMYSAALVEVHFWL
AFAGTVIYVFAMWNSGIIQGLMWRTYTGDGTLAYSFVDSLVAMYPYYIAR
AFGGLLFLIGAVVATYNIWMTVRGVPALAERHGDVPVAAPLPEGAATGPA
E
>SMa1216 fixO1, FixO1 c-type cytochrome
MSILDKHAILERNATLLLIGSLLVVSIGGIVEIAPLFYLENTIEKVEGMR
PYSPLELAGRDIYIREGCYVCHSQMIRPFRDEVERYGHYSLAAESMYDHP
FQWGSKRTGPDLARVGDRYSNEWHVQHMIEPRSVVPESVMPSYAFLKETP
LEVKNVAMSLEANRAVGVPYTDEMIGNAAADLKAQADPNADGSGVEARYP
KAKLGDFDGDPQRLTEMDALVAYLQMLGTLVDFSTYDDAAGYR
>SMa0766 fixO2, FixO2 cytochrome c oxidase
MSILDKHALLERNATLLLVGSLLVVSIGGIVEIAPLFYLENTIEKVEGMR
PYSPLELAGRDIYVREGCYVCHSQMIRPFRDEVERYGHYSLAAESMYDHP
FQWGSKRTGPDLARVGDRYSNEWHVQHMIEPRSVVPESVMPSYAFLKDTP
LEVTNIAMNLEANRAVGVPYTDEMIDNATADLKAQADPDADASGVEARYP
KAKLGDFDGDPQRLSEMDALIAYLQMLGTLVDFSTYDDTTGYR
>SMa0615 fixO3, FixO3 cytochrome-c oxidase subunit
MRELIHRKLERTAIGFVLAIILAASVGGIVEIAPLFTIDETVEDVEDMRL
YTPLELAGRNIYIREGCYACHSQMIRTLRDEVERYGPFSLAVESKYDHPM
LWGSKRTGPDLARVGGKYSDFWHVAHLTNPRDVVPESNMPAYAWLARTPL
RLDDLGSHLEAQRSVGVPYTDEMIENAARDAFGQAVPDSEQASGVTERYG
DETQVSAFDGVATRVTEMDALVAYLQVLGRLTKAAYQNTAAPEQVPDPTN
>SMa1213 fixP1, FixP1 Di-heme cytochrome c
MADKHKHVDEVSGVETTGHEWDGIRELNNPMPRWWVYSFYATIIWAIGYA
IAYPSWPMLTEATKGMLGYSSRAEVSVELAAAKAAQAGNLEQIASSSVEE
IIANPQLQQFAVSAGASAFKVNCAQCHGSGAAGGQGFPNLNDDDWLWGGK
PQEIYQTIAHGVRHAPDGETRVSEMPPFGDMLTPELMQQTAAYVVSLTQA
PSQPHLVQQGKQVFADNCASCHGADAKGNREMGAPNLADAIWLKGEGEQA
VITQMKTPKHGVMPAWLPRLGDDTVKQLAVFVHSLGGGE
>SMa0769 fixP2, FixP2 cytochrome c oxidase
MTDKHIDEISGVETTGHEWDGIRELNNPLPRWWVYSFYATIIWAIGYAVA
YPSWPMLTESTKGVLGYSSRAEVSAELAEAKAAQAGNLERIASSAVEDIM
ANPELKQFAITIGASAFKVNCAQCHGSGTAGGKGFTNLNDDEWLWGGKPE
EIYQTIAHGIRYSGDGETRVSEMPAFTDTLAPREVRATAAYVASLTGTPS
NPALVEPGKQLFAENCASCHGADAKGTREFGAPNLADAIWLNGEGEQAII
DQMKSPKHGVMPAWLQRIGDPVVKELAVFVHSLGGGE
>SMa0617 fixP3, FixP3 cytochrome c oxidase membrane anchored subunit
MDVEEVDPISGRRTTGHEWNGIKELDTPVPRGVLLFLVVTHVFALLWWVL
LPTWPLGTTYTKGLLGIDERNVVEEKLAAAAAARAVWEKRIDTLSYEQIR
ADEQLMATVRSTGHQLFGDNCAVCHGIDGKGRSNYPDLTDDDWLWGGGPE
DIEQTLRVGINTRHPESRVAQMPSFGREQMLERNQVRDVAAYVYSLTNPG
YSTPENIGRIEAGREVFLTSCAACHGENARGSREVGAPNLTDAYWIYGGT
MQTIIESVHGGRQGHMPTWDERLTSAEIKILALYINSLGVEKP
>SMa1214 fixQ1, FixQ1 cbb3-type cytochrome oxidase
METYTAMRHFADSWGLLAMTLFFLGVVFFIFRPGAKNAAAQASVIPLKED
>SMa0767 fixQ2, FixQ2 cbb3-type cytochrome oxidase
METYTAMRHFADSWGLLAMTLFFLGVVLFIFRPGAKKSAAQASAIPLNED
>SMa0616 fixQ3, FixQ3 nitrogen fixation protein
MEVTHETLVEAAKTWGLFYLIGFSICVIVYAFWPANRERFDRAKRGILEE
DDQPWT
>SMa1208 fixS1, FixS1 nitrogen fixation protein
MNTLIYLIPVALSLGGLGLVAFLWALKSGQYEDLDGASWRILDDGDGEGE
SSQTL
>SMa0622 fixS2, FixS2 nitrogen fixation protein
MNYLALLVPVALAMGFVGLLAFFWSLRSGQYDDLDGAAERILLDDEEDEG
PLPDPVRDGTASRRQRSPKEDGESKLYSGVSNAKHG
>SMa1226 fixT1, FixT1 Inhibitor of FixL autophosphorylation
MLDGKTIIVVAADQGLRRSVAFALEVEGYYTESYDSVQKSEASCREALCA
IVDDDILRTEPQAAAQFLSNRGGRAILLVDGLSALQPPVDYATLTKPFTG
ADLLGVINSLVVAAK
>SMa0760 fixT2, FixT2 transcription regulator
MLDGKTIIVVAADQGLRRSVAFALEVEGYSTESYDTVQKAEASSGEALCT
ILDDEILKSETVAATQLLKNLGRRAILLVDGLSAPQPPADYTTLTKPFSG
ADLLGVINSPTEAAK
>SMa0810 fixU, FixU nitrogen fixation protein
MKVTIRITGDALSAYIPKKDLEEPIISVANEDLWGGSILLRNGWRLALPH
LPQDTRLPVTVEANIRRH
>SMa0816 fixX, FixX ferredoxin-like protein
MKTAIAERIEDKLYQNRYLVDAGRPHITVRPHRSPSLNLLALTRVCPAKC
YELNETGQVEVTADGCMECGTCRVLCEANGDVEWSYPRGGFGVLFKFG
>SMa0260 gabD3, GabD3 succinate-semialdehyde dehdyrogenase
MTFHAKFSGYADPALHAAGLYIGGKWQSGSGITVLDPSTGNLLAEVADAS
IEDAQRAVDAADAAAAGWRATPARQRSEILRRWYQLMTQHAEELATLIAL
ENGKALADARGEVAYAAEFFRWYAEEATRIPGEFRHTPSGSHNILVDHEP
IGIAVLITPWNFPAAMATRKIGPALAAGCTVILKPASETPLTAYAMARLG
EEAGVPPGVVNVLTTSNPGGITNAMLADPRVRKLSFTGSTGVGRVLLAEA
AKSVVSCSMELGGNAPFIVFDDADLEVALDGAMIAKMRNAGEACTAANRF
YVQAGIHDAFVAGLTARMKSLKLGPGYDPETQCGPMITQNAVRKIDRLVS
EALAAGARATTGGKPLTENGYFYPPTVLENVPVNASIAREEIFGPVAPVY
KFESDDEAIRLANNTEYGLAAYIYSRDLKRAMKVGKRIETGMLGINRGLM
SDPAAPFGGVKQSGLGREGGVTGILEFMEPKYFAVDY
>SMa0805 gabD4, GabD4 succinate-semialdehyde dehdyrogenase
MTISETLLVKLKDPSLAVDKGLIGAEWLDRSDSGKTFDVSNPATGEVIAI
LPDMSRSETARAIDAAHAAQRAWAEKTGKERAAVLRNLYDLVVANADDLA
TILTMEMGKPLTEAKGEILYGASYVEWFGEEAKRVYGDTIPGHQPDKRII
VLKQPIGVVAAITPWNFPNAMLARKLAPAAAAGCAVVSKPAAETPLSALA
LALLAERAGLPAGVFNVILSTDSAEVGKEMCANDKVRKLTFTGSTNVGKI
LMRQGADQIMKLGLELGGNAPFIVFDDADLDAAVEGAMVAKYRNNGQTCV
CANRIFVQAGIYDAFAARLTAKVSEMTIGDGFEPDVDAGPLISEKALAKV
EEHIRDAVTKGADLVLGGNARGGLFFEPTVLTGATMDMKIAGEETFGPVA
PLFKFETENEVVSMANKTEFGLASYFYSKDVSKVFRVAEALEYGMVGINT
GLISTEVAPFGGVKQSGQGREGSKYGIDDYVETKYLCLSI
>SMa1848 gabD5, GabD5 succinate semialdehyde dehdyrogenase
MRLKDRELFRQLGLIGGEWIAGASGVVVDVIDPANQAVLGTVPDMGTAET
RAAIEAANAAFGPWKKKTHAERAAVLERWHALMIENLEDLAVLVTMEQGK
PLEEARGEIRYGAAFVKWFAEESRRIGGHTIPSPTSDRRIVVLKEAVGVC
AIVTPWNFPNAMITRKVAPALAAGCTVVIKPSEFTPFSALALGVLAERAG
IPAGVVNIVTGMPTAIGNEFMTNETVRKISFTGSTRVGSLLMRGAADSVK
RLSLELGGNAPFIVFDDANLDLAVEGAIASKFRNGGQTCVCANRILVQAG
VYDAFAEKLGARVNAMKVGPGTEPGIAIGPMINEAAIDKIDRHVEDAIAK
GAKLAARGRSVPEGRQYTAPIVLTGATTDMLLASEETFGPVAPLFRFETE
DEAIAIANGTPFGLAAYFYTEGLKRSWRVAEALEFGMIGLNTGAISTEVA
PFGGVKQSGLGREGAQVGIEEYLEMKSFHIGGLD
>SMa0228 gdhA, probable GdhA NADP-specific glutamate
MNVDEKLEPILAEVLRRNGGEHEFHQAVREVLESLGRVIAKHPRYAENAL
IERICEPERQIIFRVPWVDDKGQVQINRGFRVQFNSALGPYKGGIRFHPS
VNIGIIKFLGFEQTFKNALTGMPIGGGKGGSDFNPRGRSDGEIMRFCQSL
MTELHRHLGEYTDVPAGDIGVGGREIGYMFGQYKRLTNRYEAGVLTGKAL
FYGGSRARKEATGYGATYFVQRMIATKGLDFEGKRVTVSGSGNVAIYTME
KVIEFGGKIVACSDSNGYVVDEDGIDLELVKEIKEVRRERISEYARLKGA
GTHYIEAGSVWDVPCDVAMPSATQNELTGKDARTLVKNGVLAVGEGANMP
CTPEAVRIFQEAGVLFAPGKAANAGGVATSALEMQQNASRDSWTFEQTEA
RLATIMQAIHDRCAETAEEYGTPGDYVLGANIAGFVRVAEAMDALGVI
>SMa2135 glyA2, probable GlyA2 serine hydroxymethyltransferase, SHMT
MPGLFERQLKHDSVIAGAIAREMGRQRSEIELIASENIVSPAVLAAQGSV
MTNKYAEGYPGHRYYGGCQYVDLVEAAAIERAGMLFDASFVNVQPHSGAQ
ANGAVMLALLKPGDTFMGLSLAAGGHLTHGARPTMSGKWFNAVQYGVRES
DCLIDYDELEVKAIATRPKLIITGGSAYPRLIDFKRIRAIADSVGAAMMV
DMAHFAGLVAGGVHPNPVEIADIVTTTTHKTLRGPRGGMILTNNQDVAKK
VNSAVFPGLQGGPLMHVIAAKAVALGEALEDNFRQYARQMVANARALASA
LTERGYDIVSGGTDTHLILVDLRSKGVSGKDAEEALGRAGLTCNKNGIPF
DPAPPAVTSGIRLGTPAATSRGFREAEFNEVGALIANVLDALGTEQSGEQ
ERRARMSVHDLCAAFPIYSARH
>SMa0744 groEL2, groEL2 chaperonin
MAAKEVKFGRSAREKMLRGVDILADAVKVTLGPKGRNVVIDKSFGAPRIT
KDGVTVAKEIELEDKFENMGAQMVREVASKTNDIAGDGTTTATVLAQAIV
REGAKAVAAGMNPMDLKRGIDLAVAEVVKDLLAKAKKINTSDEVAQVGTI
SANGEKQIGLDIAEAMQKVGNEGVITVEEAKTAETELEVVEGMQFDRGYL
SPYFVTNPEKMVADLEDAFILLHEKKLSNLQAMLPVLEAVVQTGKPLLII
AEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAILTGGTV
ISEDLGIKLESVTLDMLGRAKKVSITKENTTIVDGAGQKSDIEGRVAQIK
AQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKEKKDRIDDAL
NATRAAVQEGIVPGGGVALLRSSVKITVKGENDDQDAGVNIVRRALQSPA
RQIVENAGDEASIVVGKILEKNTDDFGYNAQTGEYGDMIAMGIIDPVKVV
RTALQDAASVASLLITTEAMIAELPKKDAPAMPGGMGGMGGMDMM
>SMa0124 groEL3, GroEL3 chaperonin
MSAKQIVFSTDARDRLLRGVELLNNAVKVTLGPKGRNVVIDKSYGAPRIT
KDGVSVAKEIELEDKFENMGAQMVRAVASKTNDLAGDGTTTATVLAASIF
REGAKLVSVGMNPMDLKRGIDLGVAAVLAEIKARATKVISSSEIAQVGTI
AANGDAGVGEMIARAMEKVGNEGVITVEEARTADTELDVVEGMQFDRGYL
SPYFVTNAEKMRVELEDPYILIHEKKLGSLQAMLPILEAAVQSGKPLLII
SEDVEGEVLATLVVNRLRGGLKIAAVKTPGFGDRRKAMLEDIAVLTAGQM
ISEDLGIKLENVTLDMLGRARRVLIEKDTTTIIDGSGDKASIQARVSQIK
AQIEETASDYDKEKLQERLAKLAGGVAVIRVGGATELEVKEKKDRIDDAL
NATRAAVEEGIVPGGGVALLRAKSALVGLTDDNADVTAGISIVRRALEAP
IRQIADNAGVEGSIVVGKLVDGRDHNQGFDAQTETYVDMIKAGIVDPAKV
VRTALRDAGSIASLLITAEAMIADIPERGSPQSTGNGAVDSMGY
>SMa0745 groES2, groES2 chaperonin
MASTDFRPLHDRVVVRRVESEEKTKGGVIIPDTAKEKPQEGEIVAVGSGA
RDESGKVVPLDVKAGDRILFGKWSGTEVKINGEDLLIMKEADIMGVIG
>SMa0125 groES3, GroES3 chaperonin
MTFRPLLDRVVIRRAEGNTQSKGGIIIPDTAKEKPQEGEVIAVGPGSRDE
SGKLIPLDVKIGDTILFGKWSGTEVKIDGEDLLIMKESDIMGIVANTVPT
AANAA
>SMa1497 gst12, putative Gst12 glutathione-S-transferase
MITLYDYELSGNCYKLRLLMSMLGIEYKTVPVDFYPGREHKSDWFLKLNP
LGQLPVIEDDGLVLRDAQAILVYIGSKYDPSGTWYPRDNPALLGEISQWL
AFADGITGTASAARLHDGLFYELDVEAARAGAHRLFRILDEHLWFGEQEG
RDWICSAPHPTVADIACFPYIMLSEEGGIPRQDYPAIRRWCDRLKRIKGF
TVMSGIFPAGPARAA
>SMa2115 gst13, Gst13 glutathione S-transferase
MIKLYDYELSGNCYKLRLLMSILGVDYKTVPVDFYPGREHKSEWFLKFNP
LGQLPVIDDDGLVLRDAQAILVYLAAKYDTAGSWYPRDNPELLGEISQWL
AFADSITSTASAARLHDALFYDFDIETARAGAHRLFRILDEHLWFGERQE
RQWICSAKHPTIADIACFPYIMLSEEGGISRQDYPAIRRWCDRVKRINGF
IVMSGVFPAGPAKAA
>SMa2319 gst14, probable gst14 glutatione S-transferase
MARDMRVRWALEEVGQPYGVRLLSFKAMKEPAHLALHPFGQIPTYQEGDL
ALFESGAIVFHIAERHAGLLPDDANARARAITWMFAALNTVEPPIVDREV
AEYLEDETWYEQRLPFIDERIRKRLGELSGRLGNADWLDGAFSAGDLLMV
TVLRRLEGSGILEEYPNLSAYVARGKARPAYKRAFVAQLAVFTAAPTA
>SMa1266 hemN, HemN coproporphyrinogen III oxidase
MQPALVAKYGEARLPRYTSYPTAPRFSPAIDANTYGDWLADIAPKQPASL
YLHIPFCRSMCWYCGCHTTITQRDQPILDYLDMLREEVHLVSAKTRAPLS
IDHVHFGGGTPTIMQPEEFRALVALLRERFEFASMTEIAVEIDPRTLEPD
MATALGEAGVRRASLGVQSFDPVVQKAINRIQSEEQTMEAVSRLRQSGVD
SINFDLIYGLPHQTVESCIATAEAAIRMGPERFAVFGYAHIPSFKKHQKL
IDEQALADAEGRVAQAEAIAATLAAAGYRRIGLDHFALPDDSLAIAQASG
RLHRNFQGYTTDACETLIGLGASAIGRTNDGYVQNEVPPGLYAQHIASGR
LATVKGYRMTPEDRLRAGIIERLMCDFGVDVPALATAHGFDPEMLLRGNT
RLAMLESDGILDIADGVIRLREGRRFLIRAAAAAFDAYIEQSGRTHSKAA
>SMa0387 hisC3, putative HisC3 histidinol-phosphate aminotransferase
MAIPSRPIREEIRSIAPYNAGLTLEEVRAKYHVDEVAKLSSNENPLGPSP
ALRRLFPDIGELARLYPDPQGRALCARLAASFDVENNQVILGNGSEDLIA
VICRSVVRAGDTVVTLYPSFPLHEDYTTLMGGKVDRVTVTPDLSVDMDAL
LAAIARKPRMLMFSNPMNPVGSWLTPLQLAKVVAALDPETLIVVDEAYAE
YAAGDDYPSAAEVLKVTGLNWVVLRTFSKAYGLAGLRIGYGIVSDGSLCD
FFNRARTPFNTNAIAQVSALAAFDDTYHLNRSVELALVERERMKKELATM
GYRIAPSKCNFLFFDARTEATPVAEALLRRGVIVKPWKQPRFETYIRVSI
GSPVENDHFIHALKEVEAVG
>SMa0398 hisD2, probable HisD2 histidinol
MTLRPQLARYKDKTMTSVSFYEYSKLNAEEKAALLRRSETDISGFIEKVA
PILEAVRTEGDKALARFGRELDKADVTEANLKVTAAEFDAAFKLVDASVL
ESVQFGIDNIRKFHEEQKPEAMWLKEIRPGAFAGDRFTPIQSVALYVPRG
KGSFPSVTMMTSVPAVVAGVPNLAIVTPPAPDGSVDAATLVAARLAGVET
VYKAGGAQAVAAVAYGTETVKPALKIVGPGSPWVVAAKRSLSGVIDTGLP
AGPSEVMILADDTVHGGLAALDLLIEAEHGPDSSAYLVTHSGRVAEEALA
ALPEHWARMTEQRTAFSKTVLSGKTGGIVLTSSIEESYEFVNAYAPEHLE
LLSEQPFIHLGHITEASEILMGTHTPVSIANFSLGPNAVLPTSRWARTFG
PLSVTDFVKRSSIGYVTAPAYPEFARHSHNLAIYEGFSSHALAVSPVRDA
YLKKGA
>SMa1014 hmrR, HmrR heavy metal dependent transcriptional regulator
MSGVSSKMIRYYEQIGLIRPALRTASSYRVYGDNDIHTLQFVRRARDLGF
SVEQIKDLLALWRDRSRNSANVKAVALEHIAELERKIAAIEEMTTTLKHL
ASHCHGDDRPECPIIEEIANAADGKKPRANARFGLSAL
>SMa1118 hspC2, probable HspC2 heat shock protein
MAEPATKLPIKSEEKGVERRAESWLPFESLRSEIDRLFDDFAPSLWHRPL
ASALMRRVPRLSELDVAPAVDLAETEKTYEITCELPGMEEKDIEVAISNG
TLTIRGEKQEEKKENKKEYVLSERRYGSFQRTFRMPDGVDTEKIAAKFSK
SVLSITLPKTPEAQQNERKIQIKVT
>SMa0512 idnD, IdnD L-idonate 5-dehydrogenase
MKAIVIHTAKDLRVEECAVEKPGPGEVEIRLAAGGICGSDLHYYNHGGFG
TVRLKEPMILGHEVSGHVAALGEGVSDLAIGDLVAVSPSRPCGACDYCLK
GLPNHCFHMRFYGSAMPFPHIQGAFRERLVAKASQCVKAEGLSAGEAAMA
EPLSVTLHATRRAGEMLGKRVLVTGCGPIGTLSILAARRAGAAEIVAADL
SERALGFARAVGADRTVNLSEDRDGLVPFSENKGTFDVLYECSGAQPALV
AGIQALRPRGVIVQLGLGGDMALPMMAITAKELDLRGSFRFHEEFATAVK
LMQGGLIDVKPLITHTLPLGEALKAFEIASDKGQSMKTQIAFA
>SMa0514 idnK, gluconate kinase IdnK
MAPHSARRGIVLMGVAGCGKSAIGAALATRLGATYVDGDDLHPPENIARM
SRGEPLTDDDRWPWLTLVGRRLAAPDGVLIIGCSALKRRYRDHIRNEAGA
PVTFVHLSGTKVLITARMGARAGHFMPVSLIESQFAALEPPTTDENVITV
DIDQPLEVLVDEIAVKLEETPS
>SMa0513 idnO1, IdnO1 gluconate 5-dehydrogenase
MSTELFDLTGKRALVTGSSQGIGYALAKGLAATGAEIILNGRDAAKLAAA
ARDLGAGHTLAFDATDHQAVRKAVDAFEADVGAIDILVNNAGMQHRTPLE
DFPADAFERLLKTNVSSVFNVGQAVARHMIKRGAGKIINIASVQTALARP
GIAPYTATKGAVGNLTKGMATDWARYGLQCNAIAPGYFDTPLNAALVADP
SFSDWLERRTPAGRWGKVEELVGACIFLSSDASSFVNGHVLYVDGGITAS
L
>SMa2211 ilvB2, probable IlvB2 acetolactate synthase large subunit
MRKGAEILADELAVNGAEIVYHVPGESFLTALDSLTTRHPRIRSVSCRHE
NGAAQMAEAYGKLTGKPGIAFATRSPGATNAVNGVHTAFQDSSPMILIVG
QVKRSILEREAFMSYDFRTMFAPMAKWVGQIDDPARIPEFVQRAWATALS
GRRGPVVLVVPEDVLEEKCDVAPSARPAVASHAGAPREEDLDRIFAMLAA
AAKPLVVVGGSGWSETAREDVQRFATNLGLPVVTTFRRRDIIDHRLHCYV
GEIGIGSNPTLLAHIREADFILMLNDALSDVNTIGAGYMEGFTLFSIPRP
RQRVVHVMPDHGDLNRVFQVDLALAADNDATARALAGRSATPRAEHADWT
STLRATLMKECEPQPCPGAIDLPGVMGWLRERLPEDAIVTNGVGAYATWS
QRYFPHYRLHTQLGPISGSMGYSLPAAISAKIAHPDRVVVEFVGDGCFQM
SSEELATAVQYGVNIIIVLFNNGLYGTIRIHEETRLEGRVNGTDLTNPDF
LLLAKAYGAHGERVTCTKEFAPAFERCLAAGKPALIELVVDPEAIHCRYS
LSDLRARRAARVPHEE
>SMa2333 kdpA, probable KdpA potassium-transporting ATPase A chain
MSVIGWLQISLLFLAVLVVVKPLGLYMAGVLSGEPNVLSPILGPVERGLY
GAAGIDPTREQGWLAYTLSMLAFSIAGFLLLFAILRLQAWMPLNPQGFGN
VPPDLAFNTAVSFVTNTNWQNYGGETTLSHFSQMMGLTVQNFVSAATGIS
IALAVTRSFARSAAPTIGNFWVDLTRSTLYVLLPLAVLVAVAFVAMGLPQ
TFRASVEATTLEGVKQTIALGPVASQEAIKQLGTNGGGFFNVNAAHPFEN
PTALSNYLNIFAMLSVSAALLYTFGQLVGNRRQGWAFIAVTYAFLIVGVG
IVYWAEAQGNPILSQLGLDPALGNMEGKEVRFGQAMTALYATVTTGLSDG
GVNGMHGSFTGLGGLVPMFLIQLGEVLPGGIGSGLYGMIVFAILAVFVAG
LMVGRTPELLGKKIEAREMKYAMLAVLILPLSILGFTAISAVMPSAVASV
GTAGPHGLSEILYAYTSAGGNNGSAFGGLSGNTLWYNTTLGFAMLLGRFA
YAVPVLAIAGSIAAKTRASTSKGTFPTDTPLFAGLLIGIILILGGLQFLP
ALALGPIVEHFAMLSGQTF
>SMa2331 kdpB, probable KdpB potassium-transporting ATPase B chain
MSNKPTTPNLVDPKILFPAAKAAFVKLDPRQLVRNPVIFVTEAMAALVTL
FFVLDVATGGGSRLFSGQIAAWLWFTVLFATFAEAVAEGRGKAQADFLRH
TKSELSARKLVAPEGRETKEIPATMLKVGDLVLVQAGELIPGDGEVVEGV
ASVNESAITGESAPVIREAGGDRSAVTGGTEVLSDWVKVRITTAPGSTFV
DRMIALIEGAQRQKTPNEIALSILLSGLTLIFLIAVVTLWGLASYSATVL
SVTVLSALLVTLIPTTIGGLLSAIGIAGMDRLVRFNVIATSGRAVEAAGD
VDTLLLDKTGTITFGNRMASDFLPVPGVTVEELADAALLASLADETPEGR
SIVALATGEFGRGASQTGIDAVVPFTAETRLSGVDHRGRRLRKGAVDSVL
RFAGLSDSKIPQEFRQAVDKVARTGGTPLAVADGNRLLGVVHLKDVVKPG
IKERFSELRAMGIRTVMVTGDNPITAAAIASEAGVDDFLAEATPEDKLAY
IRKEQNGGRLIAMCGDGTNDAPALAQADVGVAMQTGTQAAREAANMVDLD
SSPTKLIEIVEIGKQLLMTRGSLTTFSIANDVAKYFAIIPALFVTTYPAL
GVLNIMGLASPQSAILSAVIFNALIIVALIPLALKGVRYRPVGAAALLRG
NLLVYGLGGLVLPFAGIKLIDLAVSNLNLV
>SMa2329 kdpC, probable KdpC potassium-transporting ATPase C chain
MLNQLRPALVLTFALTLITGLGYPLLITGVAQALMPAEANGSLVRKGSVL
IGSQLIGQNFASEKYFWPRPSATGPEPYNAVASSGSNLGTTSDKLKERVA
ADIERLRAAGIGGEIPADAGMASGSGLDPHISPEFARVQIARVAKARGLP
EAGVDALVDRATQGRLFGLIGEPRVNVLELNLALDAPRT
>SMa0214 kduI, putative KduI DKI isomerase
MSCSPDAHPRKQEADAVTISVSVRQVVGPEDAARRNTQGLRDGFVIEALF
QPGRANLTYSHLDRMIVGGVVPAADRLVIDRVAETGTQRFLDRREAAIIN
IGGSGTVSVGDKDHVLGFQEALYVGMGGGALGFASDDANAPALFYVLSAP
AHRSCPTVHITRDMAKKLSLGSAEESNARTINQYVHPDVCESCQLLVGLT
MFEPGSVWNTMPAHVHDRRMEVYLYFGMQEATRIFHFMGEPGETRHVVLK
NHEAVLSPGWSIHSGAGTGRYAFIWAMAGDNMSFTDMDKVPMEALR
>SMa1798 kup2, Kup2 Potassium uptake protein
MADSLDHAPAQANNLPQFLALTIGAIGVVYGDIGTSPLYAFREALRPFGP
GGVGRDEVIGLVSLVLWTLTAIVTIKYVLFLLRADNDGEGGTLSLLALLL
KKGTKYPVLMFFAGVLGAALFIGDAMITPALSVLSAVEGLKLVAPALHDY
VLPISVVIILLLFAVQSRGTGAVSVFFGPITLVWFLMMAAAGVAHIGDDL
AILSAFNPLNAIGFLWNAGLIGFIVLGAIFLTVTGAEALYADLGHFGRHS
IQAAWFAVVFPALALNYLGQGALVLSHPDAISNPFFLMFPNWALLPMVIL
ATAATIIASQSVITGAFSLIRQAIHLGFLPRFEICYTSETQTGQIYLPLV
NTILLTGVLALMLMFGSSEALAPAYGVSITGAMVIDTILAFEFVRRQWGW
PALTAIAVLLPLFNLELVFLGANLLKVHHGGYVPILIAGTLITMMWTWRK
GVSVLREKTARQDIPLSQFMAIVERKSEHAPVQVPGTAIFLTATPDTTPA
VLLHNIKHNHVLHQHNVIMTIKTARVPYVPEKDRYTITKLSDRFSLLELR
FGFMDDQNVSRALARCRKEGFKFEIMSTSFYLGRRKLIADPQSGLPQWQD
KLFIAMADSAIDPTEYFHLPANRVVELGEQVII
>SMa2294 mrcA2, probable MrcA2 penicillin-binding protein
MKLIGYIFGITSAALVGIFGIGAVYLWEVANDLPDHRKLAEWEPALMTRF
YAYDGTPIAEYARERRLYLPIAAIPERVKAAFLSAEDKSFYSHSGIDVLS
VVKAAWSNAVNLSSGQRLIGASTITQQLAKNFLLSSDRTLRRKIKEAVLS
IRIERSLSKDRILELYLNDIYLGLGAYGIAAGALTYFEKSVSELTVAEAA
YLAALPKGPNNYNPHRHPERAVSRRNWVIGQMQRNGFVSAAQASNMMARP
LGVKPQAKNPVMAEANYFVEEVRREVAGQYGEAALYDGGLSVRTTLDPTL
QAAALMALRAGLVDYDQARGFRGPVAHIDISGDWATALAKHETFTDMKEW
QLAVVLSCSDAGVGIGLRRAKLGPEHSPTSPETGLIGRNTMRWALKLAAG
GKIRQVDSIDKVLSPGDVIYVEKVGNGYLLRQIPEVQGAIVSMDPNTGRV
LASVGGFSFAQSRFNRATQASRQPGSAFKPFVYAAALDTGYTPATLVMDA
PFSMPDGAGRLWKPKNYDGKFGGPSTLRTGLEMSRNLMTVRLARHLGMDL
VAEYGRAFGLYDKLAPYLPMSLGAGETTLTRLVSAYAVLANGGHAVQPSM
IDRIQDRHGRTIFRHDDRACDRCNATRWLHQEEPVLRDTRKQVLDPMTAY
QVTSMLRGVVERGTAHRVSQLGAPVAGKTGTTNDEKDVWFVGYTPTLVTG
IFMGYDAPKPMGYAKRAADLRPRSSLSS
>SMa1905 mrcB, putative MrcB penicillin binding protein B
MPRPSKTQCRSASGSDKGSGVAEATPNFAGDHAGAESVSSSEGEASAFIQ
ADQPQQPPRPFEQSRLAAHKLLQALRHDLHAISISTKVSAKAAASSLKAK
FNKSSRQADLPTIHNISQWSATAVANTGRGIRYLTRPLKFSRYAWQNTSG
WKVTVGLTFVTCIVLIGAVMVWALKDVPWNEIRNGTLKPVVVLETADGEP
LVRQGPYQGPYAQYNQFPPHLIDAVLSIEDRRFMDHFGIDVRGIGRALVR
NFEAGSVIEGGSTITQQLIKLQYLDSDRTIKRKIQEFVIALWLELKLGKK
EILTRYLNSAYLGAGATGMPAAARVYFNKNIDVLNLPESAMLAGLLRAPS
QWNPIDNLEGARQRTAVVLDTMVANGKITAPDAERVKESFATLNPTMPTP
RSGSWFADWISPRASEIAGPSPGTTTVRTTLVPHLQQIAERVVRNALDTE
GRAVGASQAALVAMTPDGAVVAMVGGRDYKQSQFNRAVTAMRQPGSTFKL
FVYYAALKAGFTPTDRVLDAPIDVNGWSPENSSGRYRGWVSIAEAFARSL
NAPTVALAQEVGLDNVIAAARELGIDAPLVSTPSLALGTSEVSLLELTSA
YASVRFGKAPVEPWGIVHFQAAGQPRAFRVGSHTTPAIDLSAYQSDLVGL
LQLAVERGTGREADPGAFAAGKTGTSQNNRDAWFVGFTEPLIAAVWVGND
DDTPMKGVTGGDLPAHIWRDFMREAMAAPAPIPERPEGTIVDSHGVPQSC
NITACSRSYRSFRPSDCTYQPYSGRRRLCEK
>SMa1236 napA, NapA periplasmic nitrate reductase
MTGELTRREMLKAHAAGIAAATAGIALPAAAQPVPGGVEALQITWSKAPC
RFCGTGCGVMVGVKEGQVVATHGDMQAEVNRGLNCIKGYFLSKIMYGTDR
LKTPLLRKRNGAFAKDGEFEPVSWDEAFDVMAEQAKKVLKDKGPTAVGMF
GSGQWTIFEGYAATKLMRAGFRSNNLDPNARHCMASAAYAFMRTFGMDEP
MGCYDDFEHADAFVLWGSNMAEMHPILWTRLADRRLGHEHVKVAVLSTFT
HRSMDLADIPIVFKPGTDLAILNYIANHIIQTGRVNEDFVRKHTTFMVGA
TDIGYGLRPDNPLEVKAVNAKDAAKMTPSDFESFKSFVSEYTLDKVVELT
GVEAGFLEQLADLYADPKRKVMSLWTMGFNQHVRGVWVNQMVYNLHLLTG
KISEPGNSPFSLTGQPSACGTAREVGTFAHRLPADMTVTNPEHRKHAEEI
WRIPHGIIPEKPGYHAVEQDRMLKDGKLNFYWVQVNNNVQAAPNTQNETY
QGYRNPDNFIVVSDVYPTITAMSADLILPAAMWVEKEGAYGNAERRTHVW
HQLVDAPGEARSDLWQMVEFSKRFTTDEVWSTDILDANPGYRGKTLYDVL
FKNGNVDSFPASEINKEYANREAEAFGFYIQKGLFEEYASFGRGHGHDLA
PYDRYHDERGLRWPVVDGKETLWRYREGYDPYVKPGEGVKFYGRPDGKAV
ILAVPYEPPAESPDDEYNVWLVTGRVLEHWHSGSMTMRVPELYKAFPGAV
CFMNAGDARDRGINQGAEVRIVSRRGEIRARVETRGRNRMPPGVIFVPWF
DASRLINKVTLDATDPISKQTDFKKCAVKIVSVA
>SMa1233 napB, NapB periplasmic nitrate reductase
MRGQNRLCRMMRSPGSMLGALLAILFVATGAIAQMADKRVPELSGPPQEM
GEVEAHPIPKWVVDDVRKERAYPDQPPVIPHSIEGYQLSVNTNRCLSCHK
RELTQESGAPMISVTHYMTREGQMLADVSPRRYFCTACHVPQADVRPLVG
NTFRDMSEMGVKQAGSE
>SMa1232 napC, NapC membrane protein, e-donor to periplasmic nitrate reductase
MAGIKRLLLWVWKILTTPAATLSLAFLTLGGFVGGVIFWGAFNTALELTN
TEEFCVSCHEMRANVYEELTRTIHFSNRSGVRASCPDCHVPHEWTDKIAR
KMQASKEVWGKIFGTINTREKFLDHRLELAKHEWARLKANDSLECRNCHS
SAAMDLSKQTQRAAEIHTRYLLPGRATCIDCHKGIAHELPNMQGVEPGWK
LPPELEGETLPSASAIDELKRVMDEAHSAALAN
>SMa1239 napD, NapD component of periplasmic nitrate reductase
MSDRSAPYHISSAVIVTMPHMRERVVGSLLEMPNVEVYAHEAGKIVVVIE
GTSTGMLGESLSRISTLEGVVAANMVFEHVETQGEVGHDRRTDAA
>SMa1241 napE, NapE component of periplasmic nitrate reductase
MPDTDQPAASLPPSRRRNEIITFLVLAFGIWPIVAVGVVGAYGFLVWMFQ
IIFGPPGPPAH
>SMa1240 napF, NapF Ferredoxin component of periplasmic nitrate reductase
MGEGIEISRRSFLRGRHKRGSGRVSPPGATAEGLEACTGCGRCADACPTH
IIRVIDDRPALDFFIAECTFCGQCAELCPEPVFTGRSQQFPHVAMIGESC
LARNRTDCQACRDACPTEAIRFRPRAGGPFLPELNEELCTGCGACLSVCP
VAAIGIREVEWERAHV
>SMa1077 nex18, Nex18 Symbiotically induced conserved protein
MKLNSLLFAAAITLGSVSAFAADKDVVDTAMEAGQFKTLGAALEAAGLIA
TLKETGPFTVFAPTDEAFAKLPAGTVENLLKPENKQKLTEILTYHVVAGR
VMAADVAGIDEAKSVNGKMIDIEVEGSTVKVNDAAVTAADIAASNGVIHV
IDKVIMPPEG
>SMa0815 nifA, NifA transcriptional activator
MAPTRLETTLNNFVNTLSLILRMRRGGLEIPASEGETKITAATRNSGSPS
AADYTVPKAAIDQVMTAGRLVVPDVCNSELFKDQIKWRGIGPTAFIAAAV
EVDHETGGMLWFECAEESDYDYEEEVHFLSMAANLAGRAIRLHRTISRRE
RTFAEEQQEQQNSRDEQSQSSARQRLLKNDGIIGESTALMTAVDTAKVMA
ETNSIVLLRGETGTGKECFAKLIHQHSTRQKKPFIKFNCPALSESLLESE
LFGHEKGAFTGAIAQRVGRFESANGGTLLLDEIGEIPPAFQAKLLRVIQE
GEFERVGGTKTLKVDVRLIFATNKDLEMAVQNGEFREDLYYRISGVPLIL
PPLRHRDGDIPLLARAFLQRFNEENGRDLHFAPSALDHLSKCKFPGNVRE
LENCVRRTATLARSKTITSSDFACQTDQCFSSRLWKGVHCSHGHIEIDAP
AGTTPLLGAPANDVPPKEPGSAGVASNLIERDRLISALEEAGWNQAKAAR
ILEKTPRQVGYALRRHGVDVRKL
>SMa0814 nifB, NifB FeMo cofactor biosynthesis protein
MSTPMILRESRTSTTFSDQLLENAKSVGCSPPSTAPGDIDPGTWDKIKNH
PCFSEEAHHYFARMHVAVAPACNIQCNYCNRKYDCANESRPGVASEKLTP
DQAVRKVIAVANEVPQLSVLGIAGPGDACYDWKKTRATFERVAREIPDIR
LCISTNGLSLPDHVDELAEMNVDHVTITINMVDPRVGVKIYPWIYYGQRR
HTGIDAARILHERQMLGLEMLAERGILTKVNSVMIPGVNDEHLIEVNKVV
KGRGALLHNVMPLISNRIHGTYYGLTGQRGPEAFELQALQDRLEGTKLMR
HCRHCRADAIGLLGDDRGHEFTLAEIPDEITYDASKRQAYRQLVARERGD
HLVAKNEANRTVMSVEYGGSLLIAVATKGGGRINEHFGHAKEFHVYTVSQ
RGIKLAGRRRVEQYCLGGWGEVATLDHIVVALEGIDILLCVKIGDYPRKQ
LTQAGLRATEAYGHDYIESALGALYAAEFGIEPPVKTATA
>SMa0827 nifD, NifD nitrogenase Fe-Mo alpha chain
MSLDYENDNALHEKLIEEVLSHYPDKAAKRRKKHLSVAKNKQETAEEGQV
VSECDVKSNIKSIPGVMTIRGCAYAGSKGVVWGPIKDMVHISHGPVGCGQ
YSWSQRRNYYVGTTGIDAFVTMQFTSDFQEKDIVFGGDKKLEKIIDEIEE
LFPLNNGVTVQSECPIGLIGDDIEAVSRKKAEEYKTTIVPVRCEGFRGVS
QSLGHHIANDAIRDWVFDTTEVAYEAGRYDVNVIGDYNIGGDAWASRILL
EEIGLHVVGNWSGDATLAEIERAPTAKLNLIHCYRSMNYICRHMEEKYGV
PWMEYNFFGPSQIEASLRQIAKHFGPEIEERAERVIAKYSGLTDAVIDKY
WPRLHGKRVMLYVGGLRPRHVITAYEDLGMEIVGTGYEFAHNDDYQRTGH
YVKEGTLIYDDVTGYELEKFIERIRPDLVGSGIKEKYSVQKMGIPFRQMH
SWDYSGPYHGYDGFAIFARDMDLAVNNPVWDLYDAPWQKVTMPAASGAAE
>SMa0830 nifE, NifE oxidoreductase
MPSLSAKNQAFFNEPACERNRSKDFKVRKKGCSQPPMPGAAAGGCAFDGA
KVALQPITNVAHLIHAPLACEGNSWDNRGTASSSHMLWRTSFTTDVTEFD
VVMGHSERKLFKAIREINEAYAPAAVFVYATCVTALIGDDIDAVCRRAAE
KFGLPVVPVNAPGFVGSKNLGNKLAGEALLDHVIGTVEPDDARSSDINIL
GEFNLSGEFWQVRPLLDKLGVRVRACIPGDSRYLDIATAHRARAAMMVCS
TALINLARKMLERWDIPFFEGSFYGITDTSEALRQIAGLLVKQGAGPDLI
SRTEALIVEEEARAWRRLEVYRPRLQGKRVLLNTGGVKSWSVAHALMEIG
LEIVGTSIKKSTDNDKERLKQMLTNDSRMSGATTPRELYSALSDHKADIM
LSGGRTQFIALKAKMPWLDINQERQHSYAGYHGVVELARQIDLSIHNPIW
AQVREAAPWEMAPARGEEMSEEEALL
>SMa0825 nifH, NifH nitrogenase Fe protein
MAALRQIAFYGKGGIGKSTTSQNTLAALVDLGQKILIVGCDPKADSTRLI
LNAKAQDTVLHLAATEGSVEDLELEDVLKVGYRGIKCVESGGPEPGVGCA
GRGVITSINFLEENGAYNDVDYVSYDVLGDVVCGGFAMPIRENKAQEIYI
VMSGEMMALYAANNIAKGILKYAHAGGVRLGGLICNERQTDRELDLAEAL
AARLNSKLIHFVPRDNIVQHAELRKMTVIQYAPNSKQAGEYRALAEKIHA
NSGRGTVPTPITMEELEDMLLDFGIMKSDEQMLAELHAKEAKVIAPH
>SMa0829 nifK, NifK nitrogenase Fe-Mo beta chain
MPQSAEKVLDHAPLFREPEYRKMLAEKKRNFERPYPDRTVTDQREFTKTW
HYREINLAREALVVNPAKACQPLGAVFAAAGFERTMSFVHGSQGCVAYYR
SHLSRHFKEPSSAVSSSMTEDAAVFGGLKNMVDGLANTYKLYDPKMIAVS
TTCMAEVIGDDLHGFIENAKDEGAVPHDFDVPFAHTPAFVGSHVDGYDSM
VKGVLENFWKGEQRTVNPGSINIIPGFDGFCVGNNRELKRLLNLMGVSYT
FIQDASDQFDTPSDGEFRMYDGGTKIEDVRAALNAEATVSLQQYNTRRTL
EYCKAAGQSTVSFHYPLGVKATDEFLVKVSEISGREIPEAIRLERGRLVD
AMADSQSWLHGKKYAIYGDPDFVYAVARFIMETGGEPTHCLATNGTPAWE
AEMKMLLASSPLGNDAQVWASKDLWAMRSLLFTEPVDLLIGNSYGKYLER
DTGTPLIRLMFPIFDRHHHHRFPLMGYQGGLRVLTTILDKIFDRLDRETM
QVGVTDYSYDLTR
>SMa0873 nifN, NifN Nitrogenase Fe-Mo cofactor biosynthesis protein
MVRILSQTKWATINPLKSSQPLGGALAFLGVGGAIPLFHGSQGCTSFALV
LLVRHFKEAIPLQTTAMDDVAIVLGGAGHLEQAILNLKIRAKPKLIGICT
TALVETRGEDLAGDLASIKLERAEELTGTDVVLANTPDFDGAMEEGWAKA
VTAMIKAITRIGEQERQSRTIAILPGWNLTIADIEQLRDIVESFGLKPII
LPDLSGSLDGIVPDDRWVPTTYGGISVEEIRELGTAAQCIAIGEHMRGPA
EEMKTLTGVPYVLFQSLTGLNAVDRFVSLLSSISGRPAPAKVRRRRAQLQ
DALLDGHFHSAGKKIAIAAEPDQLYQLATFFICLGAEIVAAVTTKGASKI
LHKVPVEIIQVGDLGDLESLATHADLLVTHSHGQHASARLGTPLMRVGFP
VFDQLGSQHKLTILYHGTRDLIFEVSNIFQSHSLAPTHRGT
>SMa0831 nifX, NifX nitrogen fixation protein
MISIRRLSLVSDQSQREISDRPVGALRIAIATEDMKGLNAHFGSAKRFAI
YDVTAHKSQFMEAIEFDDASDESGRHRTEGDGRIRSRVSALKGCQLLFCL
AIGGPSAAKVISAKIHPIKAQQAVSMSQVLSSVETMLQTAPPPWLRKMLA
DAGAAKKRADFEDETE
>SMa1250 nirK, putative NirK Cu-nitrite reductase
MSEQFQMTRRSMLAGAAIAGAVTPLIGAVSAHAEEAVAKTAHINVASLPR
VKVDLVKPPFVHAHTQKAEGGPKVVEFTLTIEEKKIVIDEQGTELHAMTF
NGSVPGPLMVVHQDDYVELTLINPDTNTLQHNIDFHSATGALGGGALTVV
NPGDTTVLRFKASKAGVFVYHCAPPGMVPWHVTSGMNGAIMVLPREGLTD
GKGNSITYDKVYYVGEQDFYVPRDANGKFKKYESVGEAYADTLEVMRTLT
PSHIVFNGAVGALTGDSALKAAVGEKVLIVHSQANRDTRPHLIGGHGDYV
WATGKFRNAPDVDQETWFIPGGTAGAAFYTFEQPGIYAYVNHNLIEAFEL
GAAAHFAVTGDWNDDLMTSVRAPSGT
>SMa1247 nirV, NirV periplasmic nitrate reductase
MAALSKSREVAPVALAIPSALILILAGLLALETGLLGSAPSGSAVEEPPV
VTVAPRDFRYRVAGEFFKNGYAVDGPVETVHMSAPLTIMKYQVTAADYAR
CVAEDACLPAEPEHVPVDPARMPATGVSFDDAQAYAAWLSRRTGAIWVLP
TDEQLAFAAGSRFPDDALGVEDDASNPALRWLADYYRETARKASRESEPQ
RLGHFGESETGLSDFAGNVWEWTTTCVRRVTLDRRGKVISDASSCGIYIA
TGRHRAALSSFVRNPKGGGCAVGAPPDNVGFRLVKDGRWYAPLLRTLREK
GLDV
>SMa0869 nodA, NodA N-acyltransferase
MSLKVQWKLCWENQLERADHQELSEFFRKSYGPTGAFHAKPFEGGRSWAG
ARPERRAIAYDSVGIASHMGVLRRFIKVGETDLLVAELGLYAVRPDLERM
GIAHSVGALTPTLRELGVPFAFGTVRHAMRNHVERYCQNGMASILTGVRV
RSSIAEVNADLPSTRTEDPLVVIFPVGRPLNEWPPGTLIERNGSEL
>SMa0868 nodB, NodB chitooligosaccharide deacetylase
MKHLDYIHEVPSNCDYGTEDRSIYLTFDDGPNPHCTPEILDVLAEYGVPA
TFFVIGTYAKSQPELIRRIVAEGHEVANHTMTHPDLSTCGPHEVEREIVE
ASEAIIAACPQAAVRHIRAPYGVWSEEALTRSASAGLTAIHWSADPRDWS
RPGANAIVDAVLDSVRPGAIVLLHDGCPPDESGALTGLRDQTLMALSRIV
PALHERGFAIRPLPPHH
>SMa0866 nodC, NodC N-ACETYLGLUCOSAMINYLTRANSFERASE
MYLLDTTSTAAISIYALLLTAYRSMQVLYARPIDGPAVAAEPVETRPLPA
VDVIVPSFNEDPGILSACLASIADQDYPGELRVYVVDDGSRNREAIVRVR
AFYSRDPRFSFILLPENVGKRKAQIAAIGQSSGDLVLNVDSDSTIAFDVV
SKLASKMRDPEVGAVMGQLTASNSGDTWLTKLIDMEYWLACNEERAAQSR
FGAVMCCCGPCAMYRRSALASLLDQYETQLFRGKPSDFGEDRHLTILMLK
AGFRTEYVPDAIVATVVPDTLKPYLRQQLRWARSTFRDTFLALPLLRGLS
PFLAFDAVGQNIGQLLLALSVVTGLAHLIMTATVPWWTILIIACMTIIRC
SVVALHARQLRFLGFVLHTPINLFLILPLKAYALCTLSNSDWLSRYSAPE
VPVSGGKQTPIQTSGRVTPDCTCSGE
>SMa0870 nodD1, NodD1 transcription regulator
MRFRGLDLNLLVALDALMTERKLTAAARRINLSQPAMSAAIARLRTYFGD
ELFSMQGRELIPTPRAEALAPAVRDALLHIQLSVIAWDPLNPAQSDRRFR
IILSDFMILVFFARIVERVAREAPGVSFELLPLDDDPHELLRRGDVDFLI
FPDVFMSSAHPKAKLFDEALVCVGCPTNKKLLGNISFETYMSMGHVAAQF
GREMKPSVEQWLLLEHGFNRRIELVVPGFTLIPRLLSGTNRIATLPLRLV
KYFEQTIPLRIVTSPLPPLFFTEAIQWPALHNTDPGNIWLREILLQEASR
IDPQSDTC
>SMa0757 nodD2, NodD2 nod box-dependent transcription activator
MRFRGLDLNLLVALDALMTERKLTAAARRVKLSQPAMSAAIARLRTYFGD
ELFSMQGRELIPTPRAEALAPAVRDALLHIQLSVIAWDPINPAQSDRRFR
IILSDFMTLVFFERVVERLAREAPGVSFELLPLDDDPYELLRRGDVDFLV
LPDLFMSSAHPKAKLFAEALVCVGCPTNEQLLGELSFEKYMSMGHVAAQF
GRALKPSFEQWLLLEHGFKRRVELVVPGFTLIPPLLPHTNRIAIIPLRLV
KYFEQTIPLRIVKHPLPPLWFTEAVQWPALHNKDPGNIWMREILLQEASR
SEFQGETSLE
>SMa0840 nodD3, NodD3 transcriptional regulator
MRFKGLDLNLLVALDALMTKRSVTAAARSINLSQPAMSSAIARLRSYFQD
ELFRMQGRELITTPRAEALAPAIRDALLHIQFSIISWDMFNPAQSDRCFR
IILSDFMTLVFFEKVVERVAREAPGVSFELLPPDDNPDELLRRGEVDFLI
FPDVFMSSVHPKAKLFDQTLVSVGCLTNEQLLGDLSFERYMSMGHVAAQF
GRALKPSVEQWLLLEHGYKRRIELVVPGFNLIPPLLSGTKRIAIIPLRLA
NHFAKSIPLRIVKHPLPLLSFTEAVQWPALHNKDQASIWMREILLDEAAR
IAAPRETAGCLGR
>SMa0853 nodE, NodE beta ketoacyl ACP synthase
MDRRVVITGMGGLCGLGTDTTSIWKWMREGRSAIGPLLNTELHGLKGIVG
AEVKALPDHNIDRKQLVSMDRISVLAVIAAHEAMRQAGLSCNEGNALRFG
ATVGVGLGGWDATEKAYRTLLVDGGTRTEIFTGVKAMPSAAACQVSMSLG
LRGPVFGVTSACSSANHAIASAVDQIKCGRADVMLAGGSDAPLVWIVLKA
WEAMRALAPDTCRPFSAGRKGVVLGEGAGMAVLESYEHATARGATILAEV
AGVGLSADAFHITAPAVHGPESAMRACLADAGLNAEDVDYLNAHGTGTKA
NDQNETTAIKRVFGDHAYSMSISSTKSTHAHCIGAASALEMIACVMAIQE
GVVPPTANYREPDPDCDLDVTPNVPRERKVRVAMSNAFAMGGTNAVLAFK
QV
>SMa0852 nodF, NodF acyl carrier protein
MVDQLESEIIGIIKNRVESEGGDGETALIVGDLTAATELTALGVDSLGLA
DIIWDVEQAYGIRIEMNTAEAWSDLQNVGDIVGAIRGLLTKGA
>SMa0854 nodG, NodG 3-oxoacyl-(acyl carrier protein) reductase
MFELTGRKALVTGASGAIGGAIARVLHAQGAIVGLHGTQIEKLETLATEL
GDRVKLFPANLANRDEVKALGQRAEADLEGVDILVNNAGITKDGLFLHMA
DPDWDIVLEVNLTAMFRLTREITQQMIRRRNGRIINVTSVAGAIGNPGQT
NYCASKAGMIGFSKSLAQEIATRNITVNCVAPGFIESAMTDKLNHKQKEK
IMVAIPIHRMGTGTEVASAVAYLASDHAAYVTGQTIHVNGGMAMI
>SMa0851 nodH, NodH sulfotransferase
MTHSTLPPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNTYDTNWPD
KERLLLSDRELLERAFLRYPPHSDKKVTHVGCKINEPQFQERPSFFAELT
AWPGLKVILVIRRNTLESLRSFVQARQTRQWLKFKSDSSAPPPPVMLPFA
TCEAYFKAADDFHARVVYAFDSSRIRLIEYERLLRDPVPCVATVLDFLGA
PALQLADRGILRRQETRPLDQTVRNFHELRVHFANGPYARFFELAND
>SMa0864 nodI, NodI membrane transport protein
MTGNGRVLRQEAENQLSDREMAQEAPRWLEPSPFEWKDQTGLAVKTAIPG
AKPTVAIDVASVTKSYGDKPVINGLSFTVAAGECFGLLGPNGAGKSTITR
MILGMTTPGTGEITVLGVPVPSRARLARMRIGVVPQFDNLDLEFTVRENL
LVFGRYFRMSTREIEAVIPSLLEFARLENKADARVSDLSGGMKRRLTLAR
ALINDPQLLILDEPTTGLDPHARHLIWERLRSLLARGKTILLTTHIMEEA
ERLCDRLCVLEAGHKIAEGRPHMLIDEKIGCQVIEIYGGDPHELSALVSP
HARHIEVSGETVFCYASDPEQVRVQLDGRAGVRFLQRPPNLEDVFLRLTG
RELKD
>SMa0863 nodJ, NodJ membrane transport protein
MWKLYVAALPANGWNWIAVWRRNYLAWKKVALASILGNLADPLIYLFGLG
AGLGMMVGRVDGVSYIAFLSAGMVATSAMTASTFETIYATFARMRAQRTW
EAILHTQVTIGDIVLGELAWAATKASLAGTGIGVVAATLGYTEWVSLLYA
LPVIALTGLAFASLAMIVTALAPSYEYFIFYQTLVITPMLFLSGAVFPVN
QLPGAFQHVTRILPLAHSIDVIRPIMLGSPLVHVGLHIGALCCYAVVPFF
LSTALLRRRLMP
>SMa0772 nodL, NodL Nod factor acetyltransferase
MTRTQKEKMLAGEMYNAADPEIQADLLAAGAWLKRYNSTLGDSAEQWHLF
LREGLGEVGPGAVIRPPFHCDYGFNISIGAHAYMNFNCVILDVAKVTIGD
GTAIGPAVQIYTADHPDDPEQRQAGLQLGRPVRIGKHVWIGGGAIILPGV
TIGDHAVVGAGSVVTRDVPPGAKVMGSPARVRG
>SMa0878 nodM, NodM Glutamine aminotransferase
MCGIVGIVGNQPVSERLVEALKRLEYRGYDSAGVATIDAGTLQRRRAEGK
LVNLESRLREEPLAGTIGIAHTRWATHGAPTERNAHPHFTEGVAVVHNGI
IENFAELKDELAAGGAEFQTETDTEVVAHLLTKYRRDGLGRREAMHAMLK
RVKGAYALAVLFEDDPSTIMAARNGPPLAIGHGSGEMFLGSDAIALAPFT
NEITYLIDGDWAVIGKTGVHIFDFDGNVVERPRQISTAAAFLVDKGNHRH
FMEKEIYEQPEVIAHALGHYVNFIENRVVPISDAIDFGKVPSLAISACGT
AYLAGLIGKYWFERYARLPVEIDVASEFRYREIPLSPQSAALFISQSGET
ADTLASLRYCKEHGLKIGAVVNARESTIARESDAVFPILAGPEIGVASTK
AFTCQLAVLAALAVGAGKARGTISGEEEQALVKSLAEMPRIMGQVLNSIQ
PKIESLSRELSKCHDVLYLGRGTSFPLAMEGALKLKEISYIHAEGYAAGE
LKHGPIALIDENMPVIVIAPHDRFFDKTVSNMQEVAARGGRIILITDEKG
AAASKLDTMHTIVLPEVDEIIAPMIFSLPLQLLAYHTAVFMGTDVDQPRN
LAKSVTVE
>SMa0874 nodN, NodN putative dehydratase
MHEISLSDVSSLVGQELGTSKWITIDQAMINLFADATHDHQFIHVDPNRA
AAESPFGGAIAHGFLTLALLSVMNFSGMPKFREQTMGINYGFDRVRFISP
VRTGSRVHGRFVLSDCRLRRASILMTAYNVTVEIENENKPALTANWIAIA
QFNPKDRPKAG
>SMa0855 nodP1, NodP1 ATP-SULFURYLASE SMALL SUBUNIT
MSLPHLRRLEAEAIHVIREVVATFSNPVVLYSIGKDSSVLLHLAMKAFYP
AKPPFPFLHVDTKWKFREMIEFRDRMARELGFDLLVHVNQDGVEQGIGPF
THGSNVHTHVMKTMGLRQALEKYGFDAALAGARRDEEKSRAKERIFSIRS
AQHGWDPQRQRPEMWKTYNTRVGQGETMRVFPLSNWTEFDIWQYILREEI
PIVPLYFAARRPVVKREGMLIMVDDDRMPIQPEEEVTEQLVRFRTLGCYP
LTGAVESDAVTVPEILREMLTVRTSERQSRLIDTDEVGAMEKKKREGYF
>SMa0857 nodQ1, NodQ1 ATP-SULFURYLASE LARGE SUBUNIT)-APS KINASE
MSYVQSIPPHDIEAHLAEHDNKSILRFITCGSVDDGKSTLIGRLLYDAKL
VFEDQLANLGRVGSPGAANGKEIDLALLLDGLEAEREQGITIDVAYRYFA
TSKRKFIVADTPGHEEYTRNMVTGASTADLAIILIDSRQGILQQTRRHSY
IASLLGIRHVVLAVNKIDLVDFKQQVYEEIVADYMAFAKELGFASIRPIP
ISARDGDNVISASANTPWYRGAALLEYLETVELDPTDQAKPFRFPVQMVM
RPNADFRGYAGQISCGRISVGDPVVVAKTGQRTSVKAIVTYDGELATAGE
GEAVTLVLSDEVDASRGNMLVAPGARPFVADQFQAHVIWFDANPMMPGRS
YILRTETDSVSATVTTLKHQVNINSFIREAAKSLQMNEVGVCNISTQAPI
AFDAYNDNRATGNFIIVDRVTNATVGAGLIDFPLRRADNVHWHALEVNKS
ARSAMKNQLPAVLWFTGLSGSGKSTIANELDRILHAQGKHTYLLDGDNVR
HGLNRDLGFTEEDRVENIRRVAEVAKLMADAGLIVLVSFISPFRDERRMA
RELMEEGEFIEIFVDTPLDECARRDPKGLYEKALAGKIANFTGVSSCYEA
PENPELHIRTVGHQPNDLALAIEEFLDRRIGGQMTPLQRPT
>SMa0773 noeA, NoeA host specific nodulation protein
MARMADSKLVAAAPRPGRVAGSFRDPSGQVFHFQDRILRTMDSAAAIEFA
SAERVMRQLVDEGRLVDFSDAEPSLHQLFQGSIARVLQHPLLEQITYPYE
WSFAGLKAAALFHLQLQLDLLDQGFCLSDATAYNVQFEGSRPTFIDHLSI
KPYRDGQLWYGHKQFCEQFLVPLLLRSVFDITHHSWYRGNLEGVPSADFV
KLLSTRHWFSHKLFMHIILPAKLQSSRTSQTKVDLGDSRARRLPKDAFRA
MLAQLYSWISGLKVDVGKQSVWQGYAANNTYTATQRSDKGQYVAEFVAQH
KPRTIIDLGCNTGDFSYVALENGAEKAIGFDFDPHALDAAFDRSVQTSKN
FLPLYLDARNPSPSQGWGERERQGFSSRFSADAVLALAFEHHLAIAHNVP
LAEVVAWVTQVAPKGIIEFVPKEDETVRRMLAGREDIFSDYNEEAFASAL
SQKARVVNKHLIPGSKRTLYTFERSE
>SMa0774 noeB, NoeB host specific nodulation protein
MKRIVLLLSPLVILLSPIIDAFQGVYIDPRSDAGYAVIACVAIIGLALGM
IATFCYKRGAVGNVVTAGTLAVTLFLFGDLSYGVFWRLADHIGMEGAAAL
AFAGLVVLILILFKLMAAVPRMMAAFAVALLGSTVLDRGLITEASAEEPA
PAVIYIVTDEMIGISGIDTRLPHGAEAKAALSRVFQKHGFRLHSKAFSRH
ILTQVSVPAALNMDYSYNFPGDRSHYAYPGEVTKFKVLSLFNLWHRQGLS
VNVFQSAHLDFCKTEALVDCHTFASFDGSRFIQRQRTEGGPYRDVPPASA
SLADVKALVEGNRQSLAMAAVGFVASNLLETKETALQEWHPRSYDQLAFP
YWLGQFQNMILDKGRGNAFFAHFLFPHSPSVYNKDCKPTNRWVERSYLTE
VRGLAGQELDEARVKEYGFYFAQTLCLAKQLDKFFNAVLSDKRFQDATIV
VHGDHGSRISAGRNVETMSPRDYVDNYSALYAVRKPGVATGTDYKLHSVQ
WLNASLFRGNVSEVAPTVVGSIQGKSDDPAVYAPITDPDHVAIMPMHDFD
ASH
>SMa0876 nolF, NolF secretion protein
MTISAQCNLQKLAFATTLAVTIVLSQGRAIGQVKHGSPIELAKADVSTAV
RQDMANEVRIVGSLTPIRRSTLTSRVSSTIIELPVQIGDVVNAGDLLVRF
ERGALESAVTGRKAEADALSAQTELAEAVLERNTRLGERGAASEATRLAA
LADVLDLRAQLRSKQAEVSDAERSLSHAEVRAEFGGVIAARSVEEGQTVP
LNTQLMTIVELNRLEVDAGVPTSRIPLIRLKQSVELTVEGFPGRTFSGEV
ARISPTADAGSRAVRVFIAVDNEEGLLRGGMFTIGDLRVDDQKDVIALPA
ASIRHDADGFFVLKVEAGVLQRRPVGLGRSWSDRDLVQVSGVSEGDVIVT
APLPDLVVNTPVIIEGI
>SMa0875 nolG, NolG efflux transporter
MFLTRISINHPVFATMMMVMILVLGLFSYGRLGVDHYPETDLPVVVVATT
YTGASPESVESEISRPIEAALNTIGGIDTITSESYEGRSIVVVQFEVDVD
SQDAAQEVRDRVARLETKFPDGVATPQVTRYKPEGQAILSVAVSSTSRTL
PEITTLATRVINNRLSVISGVGQVSLIGSSERQVLVVVDPDRLGAYGLAV
STVIEAIRGENQDRAAGTLISGINQRIVTVEGRIANTSGFNRIIVAQRNG
YPVYLSEVATILDTGAEVTSLANYQGQTTLGLHIVKVQGANTVEVASAVR
REVSALNAELTKDNVQLTITRDNSRPIASQVSQVQRTLVEGGVLSVLIVF
IFLNSWRSTVITGLTLPISVIGTFAAIYALGFTLNIMTLMALSLSIGILI
DDAIVVRENITRHLQMGKDPVRAALDGTNEIGLAVLSTTLCIVAVFLPVA
FMGGLIGRFFLQFGVTVAVAVVISLFVSFTLDPMLSSVWCDPQSQKTAKR
GFFGQLIERFDQWFEGLASRYRSVIYFTFDYRKTTIAIVLGMFVVSLLLV
PRIGTEFLPPPDQGEVSISLEANEGASLDYMAAKVGQIERALREFNYVSS
TYSTINSGEMRGFNKALVAVQLVHSSQRRLKTAETLGPIRRRLSRIAGLE
ISVGQRSEVVGSIKPLQLSILGDGDEELRRISDHITSVLAAIPGATEIES
SIEKLRPTLAVRVRREAASDLGVSIATIGDTLRSLVAGDAISVWNSPDGE
THDVVVRLPAAGRENAAQLRNLPIATARMDDNGKPIMVLLDQVADVVEST
APAQITRKDLSRDIRISSNIEGRTLGDVVADLKAAMTKMDIPVGFRISFG
GDAENLTESTAYALQSLAMAVIFIYIILASQFGSFIQPIAIIMTMPLSLM
GVLLGLLFTGSTLNMFSMIGIMMLMGLVTKNAILLVDYSNLGVREGKSLR
QSLADAGAVRLRPIVMTTLAMIFGMLPTALGLGEGGAQRAPMAHAIIGGL
ISSTLLSLVFVPVVLTYLDAFAGRVRRWVPSPTGSNASAQHDGSDKTKTP
ACALTSQDQLGTTIM
>SMa1273 norB, NorB nitric oxide reductase
MKYQTQKVAMLYFYGALGLFLAQILFGVLAGTIYVLPNTLSELLPFNIVR
MVHTNALIVWLLMGFMGATYYLLPEEAETELYSPKIAIAQFWIFLIAAAV
AVVGYLLHIHEGREFLEQPFAIKVGIVVVALMFLFNITLTVLKGRKTTVT
NILIFGLWGVAIFFLFAFYNPVNLALDKMYWWYVVHLWVEGVWELIMASV
LAFLMIKLNGIDREVVEKWLYVIVGLALFSGILGTGHHYYWIGAPGYWQW
IGSLFSTLEVAPFFTMVVFTFVMTWKAGRKHPNKAALLWSIGCSVMAFFG
AGVWGFLHTLSSVNYYTHGTQVTAAHGHLAFFGAYVMLNLAVMAYAIPEI
RGRAPYNQWLSIASFWAMCSAMSVMTFALTFAGVVQVHLQRVLGESFMAV
QEQLALFYWIRLGSGVVVLISALMFFWAVLVPGKRASPALNQAVQPAE
>SMa1276 norC, NorC nitric oxide reductase
MAERLTKTGARNVFYGGSIFFFAIFVGLTAHSHYYMKTESTDETTLTDSV
ARGKHVWEKNSCINCHTLLGEGAYFAPELGNVWKRWGGESDPEGARETLK
SWMAAQPSGAEGRRQMPQFKLTEQELNDLADFLEWTSRIKTQNWPPNEAG
>SMa1269 norD, NorD protein required for nitric oxide reductase (Nor) activity
MLDFLELEETVGRAWHRLIGNTRTWPRFPDEAVRLEDVQPVLAVYFRGLG
GERTVQIAPARGRTSAHRLRLRQRMGLGEEKLVQPARDHATLMLPAEFDL
FPLRRLNRDLYFWLAAMMAVMPLKPVGAADPLSRDLALLARAGETVKSVL
ATYPAMRQRYRRLCRAVLNVRQRRSLPSVEQRVENQILSMLRAGAGLNDD
VPPIIFPRCGPPGYLPMLPVPLWPDALLREETERRNGEDEPARGGDRAEG
SETTRHMATREAQRQSERSPFILNRFEKILAMAEMVNVGRPADDSNDHDA
SAADELDDMTLGERQGRPAARFRFDLDLPPEALDRTPLTAELTYPEWDYR
RGSYLKDHCRVLAAPVSAEGAAGETDPATKSLIRRVRRRFEVLRPRHEML
RAQIDGADLDLDAVVRARTDLRAGGQGSDRIHMMSRPQAHDLAVTILVDV
SLSTDAWFDDLRVLDVEKQALQVLAHGLSACGDAHEILTFTSRRRDWVRI
ETVKAFDEAMSATIEARIAALKPGYYTRIGAAIRHAAAGLVERPNRRKLL
IVLTDGKPNDVDHYEGRFALEDSRRAVGEARRSGISVFGVTVDREAKSYL
PVIFGQNGYAVVSNIGRLPAALPAIYRGLVS
>SMa1279 norE, NorE protein involved in nitric oxide reduction
MRRCGMENAMTAGAADEQEADTFILWVLIWSELAAFGILIAAFLVASVLT
ADDFAVARLHLKPAIAASNTLVLLTSGWQAAVAAGKGASVARRRRALVLA
ALLGFAFVAIKIYEYGTEIRFAGEAAFHSFFELYFLLTGFHLAHVGFVAI
VLLVVAWRPRPANVGLVTTLWHVIDLVWIVMFPILYLV
>SMa1272 norQ, NorQ protein required for nitric oxide reductase activity
MNIVLKASPIPDSAIPAYSPSGRECELFESAWTRQLPLLLKGPTGCGKTR
FVTHMAAKLGLPLSTVSCHDDLAAADLTGRFLLKGGDTVWVDGPLTRAVR
EGGICYLDEIVEARKDVAVVLHPLTDDRRILPLERTGELLEAPPGFMLVV
SYNPGYQNLLKTLKPSTRQRFVAIEFDFLPRVSEIAVVSEESGLDESRVA
PLVDLAHRLRSLKGHDLEEGVSTRLLVYCASLVDNGVSVRDAVLATMIEP
LTDEPDVKAALIEIADAVVRQG
>SMa1183 nosD, NosD periplasmic copper-binding precursor
MSRPNISAFGMAALAAVILACPVSAATIRKSADGLPLQPVLDRASPGDVI
VLQGEHQGPVTIDKTLTLEGEPGALVMGNGKGSVITVKAPQSIVRGLEVR
GSGKDLYGMDSGIFVAQTASGARVEKNTIIGNLVGIYLHGARDSWALGNR
IIGLREGRISEAGDGISVWNAPGARVVDNDVSYGRDGIFSKTSKRNVFRG
NRFRELRFAVHYMYTNDSEISDNVSTGNAVGYAIMYSDRLKIKGNRSDGD
RDHGLLLNYANNSRITGNIVVGRLQPADRWLKARSSGHGVPKTDEENQTA
GADRRLGPEKCVFIYNANKNRFRDNVFEGCAIGIHFTAGSEGNLISSNSF
INNRNQVKYVGTRHLDWSSEGQGNYWSDNPAFDLDGDGIGDNPYRPNDLI
DKVLWTSPQAKLLTTSPAVQVIRWAQAQFPAILPGGVVDSRPLMVPAGRV
AVQ
>SMa1184 nosF, NosF copper ABC transporter
MSGTVEIAGVSKCYGDSTVVRDISFGLGAAETVALVGHNGAGKTTLIKLM
LGLIRPTKGLVRVLGENPATGDFAVRQRLGYLPESVSFNMALTGRETLRF
YARLKQVDGAATGDLFERVGLAQEAVDRPVRTYSKGMRQRLGLAQALLGM
PRILLLDEPTSGLDPALRRNFYELITELRAKGTTVLLSSHALTELEGRAD
RVIIVNKGVKIADGTLEQLRRIARLPTRISLKLSQAGATPAWLNGGMKWC
RGPDGAVDAEVSSDRKIALLHDITSDAALLSGLTITEPTLDDLYAHFLNG
GVTK
>SMa1186 nosL, NosL protein required for nitrous oxide reduction
MKLTVTAILAATLFLAGCQKEEDTTMPSPYSLTADAMGRYCGMNVLEHPG
PKGQIILQDIPEPIWFSSARDTVAFTMLPEEPRDVAAIYVSDMGAAPSWQ
EPGAENWIDAKKAFYVIGSKVRGGMGAEEAVPFSSERAARDFAAKNGGRV
TGFAEIPKGYVLGTGTAEQGAAIETESHGEAQIHG
>SMa1179 nosR, NosR Regulatory protein for N2O reductase
MYQAAIHKNLFKRLLRGLVALCIAMLLAGPSMAAGQLSNYLQKVQPQQIF
PGATRFGEVTGDPPIAPVFRGESLLGYAYLNSDVTSSVGYSGKPIHIVVG
IDPKGVVRGLKLVDHKEPIVLIGIPEAKVVASVNALIGKDLGRVSAGVEG
PPQVDIVSGATVTVLVMGDSIVRSALKLIRGNRFGADAVAPEQSSEIRKV
DLLRTGTSDWETLVGDGSVRSLRLTVGEVTEAFRQAGQPAAADRPETHSP
GDRFIDLYIAPVSVPVIGRSLLGDSRYEQMRAKLKPGEEAIVVAGDGAYS
FKGSGYVRGGIFDRIELIQDGQGLRFRDRYHTRLPTLAASGAPRLREIAL
FVVPADFGFDIAAPWELQLLVQRSAVGRDKAVLPYNLSYVLPDAHVTVEA
AAAPSPVEPSIAEQATPAEPIPEGEALWVKMWEMNRVSVAVTMAAVLVLT
LIFFFQDWLVKRPALFAWVRRVYLLFTLVWLGWYANAQLSVVNVLTFFNS
LVTGFHWEFFLSAPLVFLLWGSVAAALLFWGRGPFCGWLCPFGALQELTN
NVARWLKVPQVRVPWGLHERLWPIKYIIFLGLFGLSLYSLALAEMFAEVE
PFKTAIILKFAREWPFVVFALTVLAAGLFIERFYCRYLCPLGAALAIPGR
IRMFEWLKRWPECGSPCQRCAKECPVQSIHPEGAINVNECIYCMHCQELY
HDDQRCPHMIQVRLKREKFMALSTPASRGEAPAKTVVTHKGAPIRKADAA
PENPV
>SMa1188 nosX, NosX protein required for nitrous oxide reduction
MASPITRRRAICIMAAAAGLPLLDLRGRAEGAVAAVTWRGRALGAPATLI
LNLESQADAAGLVDRVVAEVARLERVFSLYQRDSALAELNRTGAIAAPPP
DLVNLLEASRDFWETTGGAFDPSVQPLWTLYAEHFAAGDADPAGPPEAAK
RRALSRVGFDKLEFNRDRVVFARPGMALTLNGIAQGYITDRIVGLLKDAG
IANSLVSMGETRAIGSQHDGRPWRVGLATREDASTPDSVLNLVNRAVATS
SPDGFRFDDSGRFGHILDPLSGRAPRLRRRVSVVAPTATAADAFSTAFSL
MGSSAVRIACEHHSELTVDMISTSGAHERFGRAA
>SMa1185 nosY, NosY nitrous oxide metabolic protein
MSNILTIAGKEIQEGMRNRWVLATTLLLTALALTLSFLGSVPTGSVGVDK
LDVVIVSLSSLTIFLVPLIALLLSHDAIVGEMERGTMLLLLSYPIGRREV
VCGKFLGHLAILAFATLFGYGAAAAALVATGSAVGPDSWQAFGSMIASSI
LLGAVFAAIGYLISSVARERATAGGIAIGIWLFFVLIYDMALLGGLVAAQ
GLAIPTGLLNLLLLANPTDVYRVLNLGSGGASALSALGGVADHTGLSSPV
LLAALGLWTLAPLGFATLIFSRREL
>SMa1182 nosZ, NosZ N2O reductase
MSNEETKMRLNRRQMLGTTAFMAAAGAVGAGGALTLSGGTATPARAQETS
GSSYEVKPGELDEYYVFFSSGQSGEIRIVGAPSMREMMRIPVFNRCSATG
WGQTNESRKVMTEGLLPETVEFLKDQGGLYLNGDLHHPHPSFTDGTYDGR
YLYANDKSNSRVCRIRLDVMKCDKIIQLPNQHTVHGLRVQKYPKTGYVFC
NGEDAVPVPNDGKTMGDKNSYQAIFTAVDGETMEVAWQVMVDGNLDNVDA
DYQGKYCFATCYNSEEGFTLADMMASEQDWVVIFNLKRIEEAVAKGDYKE
IGGVPVLDGRKGSPYTRYVPVPNSPHGINTAPDGIHVVANGKLSPTVTVF
DVRKFDDLFDDKIQARDTVVAEPELGLGPLHTAYDGKGNAYTTLFIDSQV
CKWNIEDAKRAYAGEKVDPIRHKLDVHYQPGHNHTSMGQTKEADGKWLIS
LNKFSKDRYLNVGPLKPENDQLIDISGDEMVLVHDNPTFAEPHDATIVHA
SKINPVHVWNRDDPFFADAVAQAKADNIDLMVDSEVIRDGNKVRVYMTSA
APAFGLDDFTVKQGDEVTVYVTNIDEVEDLTHGFCIVNYGINMEVAPQAT
ASVTFKASRPGVYWYYCTWFCHAMHMEMKGRMLVEAQGA
>SMa0981 ntrR2, probable NtrR2 transcription regulator
MSRLYMLDTNIVSELARNPQGAVTKRIAEVGPEAVCVSIITAAELRYGCA
KKGSPKLLAQIEAILGSMQVLALDVPADAEYGNIRAELETAGKPIGPNDL
FIAAHACVLGAVLVTVNSSEFTRVRDLKVENWLDFTSSG
>SMa1533 nuoA2, NuoA2 NADH I CHAIN A
MTAMEFLPVLFMVTGIVLVAAATLFVSSLLRPSNPYPEKNAPYECGMEAA
GEAAGGRFRVPFFILAILLVVFDVEAMFLFPWAVVLKEIGFVGYIEMFVF
MLLLLVGFAYAWLKGALEWQE
>SMa1532 nuoB2, NuoB2 NADH I CHAIN B
MAGVNDAIRDSVLFTTADSIISWSRRSALWPETFGIACCAIEMISAGCAR
YDLDRFGVVFRPSPRQSDVMIIAGTVTRKFAPVVRRLYDQMPEPRWVIAM
GTCAISGGVYNTYAVVQGSETFVPVDVHVPGCPPRPEALMHGFLLLQEKI
KKSRALTGTPLGRVIAS
>SMa1531 nuoC2, NuoC2 NADH I CHAIN C
MTGERPVHLTAIIGSFGGAVENLGAAHGIYAFAVPPEQIVEFCRFLKEHP
ALEFDFLSDICGVDHYPETPRFETVYHLYSLKNKWRVRIKCRLGEPPHVP
TVTGVWRTANWHEREAWDMYGIRFEGHPDLRRIYMWEGFEGFPQRKDFPL
RGYKDKLNPFGAEGPPPTQPDLATNDIPQGGR
>SMa1529 nuoD2, NuoD2 NADH I CHAIN D
MTEVTELMRPEGEALNTKEVLLNLGPQHPSTHGVLRLVLQLDGEYVERVD
PHIGYLHRGTEKLAESFTYTQIFPLTDRLDYLCPPSNNLAFALAVEKLLG
IEAPIRAQYIRVMMAELARISGHLLITGALPMDLGAMTALLYAMREREMI
MDLLEMITGARMHTSYCRVGGVREDLPDGFLPKIREFCEIFPNRIRDYER
LIENNRVFLSRTQGVGVISATDAIDLGLSGPNLRASGVDWDIRRDEPYEI
YDRLDFDVITREEGDCYSRWLCRVDEMRESIRLIEQCMEQMPEGPFQVDI
PTIAFPVDKERVHCSMEALIQHFDLSAYGFDVPAGEVYSVIEAPKGELGF
YIISDGSPKPFRMKVRAPSFVNLQALFGVTNARYLADMIAVLGSLDPVMA
EVDK
>SMa1526 nuoE2, NuoE2 NADH I CHAIN E
MTMREEIEAAAARYPDRRSAIMPALMIAQKEHGHLPGPVLEEVAQILGVE
RVWVYELATFYTLFHTEPIGRFHLQLCDNVSCMLCGSEALLTHLETTLGI
RKGETTPDGAFTLSTVECLGACEMAPVMQVGDDYHGNLDAARLDALLESF
RAAERVTSVERAAAAPGE
>SMa1525 nuoF2, NuoF2 NADH I CHAIN F
MFEPVLLRNVDVPDGHLLSTYEAGGGYRALRKALGEYTPDEIVELVKESN
LRGRGGAGFPTGMKWSFVPKAAGKPKYLCCNADEGEPGTFKDRIIMERDP
HQLIEGLAVSAYAIGAETAYVYIRGEYVTAIRRMEQAIAEAHENGYLGIG
ILGSGFNFMVHIHRGAGAYICGEETAMLESLEGKRAQPRLKPPFPAVAGL
YASPTVINNVETLACVPHIVMRGSAWFRGIGPDRSPGPKLYCLSGQVRKP
GLYELPMGISLRELVEEHAGGPLPGRKVKAVIPGGVSAPVIPEGELEVGM
DFDSLTAAGSMLGSAGVVVIDDSTCMVKLATRIIEFFHHESCGKCTPCRE
GLDWTVKVLRRIEAGEGETGDLEQLEMLCKGIFGNTFCALGDGAAMGLRA
ALKHFRAEFVAHIEERRCPFH
>SMa1523 nuoG2, NuoG2 NADH I CHAIN G 2
MIKVTIDEQSLEVEAGSTVLAAAERLGIEIPTFCYWKRLPPLASCRMCLV
EIEGLRRLQPACATVAADGMVVRTNTPLIEETRSSMLDMLLANHPLDCPI
CDKGGECELQDMVMAYGPGESRFRDPKRVFHSKDIRLSPVIIMNVNRCIQ
CQRCVRMCEEVVGAVALGTVEKGMDTAVTGFEGSLASCDQCGNCVEVCPV
GALMSFPYRYKARPWDLAETDTICPHCGTGCQLTVGARKGEFMRVRSDWE
HGVNRETLCVRGRFGLDFIESRDRIKRPMIRRDGTLTPVSWEEAGDFLRQ
RLGVAEGKAAGGLISPRLPNEVLYQFQKLMRTVLRTNNVDCSSRWSAPLD
ILVPIVASFYSRDPLEQVIGKDCVLIIGGNVTEENPVTEYLLRDAARRRH
TRLLMLSARPSRLDADARAVLRAHPGGEGQSLAAVVAALVAVTDEGLPDD
IFAKTSGTTASSGANDALDRLVSTLKEGRSVTLLVSVDLLRSPLARKTLE
QLGNLLQLLRLLGKEPSLQFLFDRANQMGAWDMGVLPGVLPGLSPIADEA
TRTRFERSWGAEIPREPGADVDAMLELCEKGGMGVLYVVGSDPLISYPDR
EFVERALGAANLLIVQDAFLTDTAGLADVVLPAAGYGEESGTFTNNEGRT
QALRKFREPAFDARSNLAIFGFIAALRERPLQPSTETVIFEEMTRLVPAY
EGLTWEGLGADGAFTTSAPKPWTSGFFAPLSAPAVTDVLQLITGNCLFHN
GYVSEHSETLNSVADDPFIEMSAQDAAGLSLSDGDQVLVRSARGELTAKL
KVNRRFPHGLVFVPENYRALRLNSLMRRGEYPCPVEIRECAKRAASALDE
ERV
>SMa1516 nuoH2, NuoH2 NADH chain H
MELIVALGVIVFKVALVIAILLLLPLPLTWLERKIAGHMQQRMGPMRVGW
HGLLQPVADGIKLLTKEDHIPAEADRFLFKLAPILALAPPFVVFAAIPFG
ESVSVLGNEITLYISNLNVALLFVFAVIGLEVYGVIFGGWAANSKYAVLG
SLRTCAQMISYEIPMGFAVIGVVMLAQSMSLLDIVRAQTDVWNVVYQPIG
FFVFFVAGLAEAQRIPFDLAEAEGDLGAGFHTEYSGIRFALFMVSEYVVM
VLVSVLTVILFFGGWNGVLIPLPPLLWFLLKVAFFVYLFMWFRFTFPRYR
YDQLMAIGWKVLLPLSLANIIISGVVFS
>SMa1544 nuoK2, NuoK2 NADH I chain K
MVPLWWSILLGVALFVIGAGGVLLRRNILIVLMSLELLLNSVNINFIAFG
QYYDDFRGQIFAIFVIAITAAEVAVALGILVALVRNKSTLKVDDVTIMKG
>SMa1536 nuoM2, NuoM2 NADH-Ubiquinone/plastoquinone (complex I) oxidoreductase
MSIPLLSLIVFVPVAGAAVLMFMRSDDAVRWTALGFGIVDLALCVVMLAG
FDTTTHEMQFTESRPWVPALGITYALGVDGISALFLFLTALLSLISVLAS
WVAIDRKVKEFMVSLLVMQALMLGVFCALDLFLFYVFWEAMLIPMYLIIG
VWGGEGRVYAAFKFFLYTLAGSILFLIGVIVLYFQGGETFDILALTGQDL
PFGVQSWLFFAFLVAFAVKVPMVPVHTWLPDAHVQAPTAGSIILAGVLLK
MGAYGFLRFSLPMLPEASLYYSTLMLALSALAIVYGGLLALAQDDLKKLV
AYSSISHMGFVTLGIFALNLRGLEGSILQMFNHGITTGALFLFVGLIYER
THTRSIANYGGLMKAAPVYTAFLALFTLSSMALPGTNAFVGELLVLSGGF
AANLAAGAAAVVGALLSAAYLLGMYGKVALGPPSVSARYEIHDVNGREMA
AILPLAIFVLWVGLYPRPFLEIIDASVRNLLVQVHGEGSGR
>SMa1535 nuoN2, NuoN2 NADH I CHAIN N
MTAAALLQWALASVPEIIVVTGACVLLIVGELVRKGRDDLLLWASVAIVL
LAAVATLMLAGEMRPAYAGMFISDRFAVFFKLVFYLATILTFFLSRKYAE
IEGIGRSEYYVLLLFALVGMMIMASAIDLLSIYVGLELMVLCTYVLTGFL
RKERRSNEAALKYVILGAVSTAIFLYGVSLIYGLTGTTQLDRMAEAVSGG
PLDPALLLAVVFIVAGLVFKIGAVPFHMWVPDVYEGAPTTITAFMSVAPK
AAGFAVILRVFLNPLVEASNAWIIVAAIAVATMALGSFVALVQDNFKRLL
AYSSIAHAGFAIFGVVAGGQDGIASVMLYLLIYTFMNLGIFGAVIMMRNG
DFSGEVIEDYAGFAKFHPGLALLMLLYLFSLAGIPPTAGFFAKFYVLVAL
VERGFVALAVIAVLLSVVSAYFYIRIVMVIYMREPERAFEPALTPLVSAT
LAFTAAGTIGIGLFPAWFLRLAQQSAFGG
>SMa0872 orf 110, hypothetical protein ORF 110
MSRLGETAFCGRRRFGEFFSQSTHRLTACRPKGSHRGNALDRKCNGVDQN
GGEATASKRQANPSHHRRRKVLVGLDVDRHERRWHDGQEIGANPELGHQA
DHEGRETEES
>SMa0835 orf10.5, hypothetical 10.5 kDa ORF
MTVMTSHSSTASPLTRAQDFCGSWKRGGPAELMSQLRRDAALTSPRRRRI
DSAVETQYLYAEGGPPSQGAREPSVNPPIAISRRTRANATYAQTTV
>SMa0233 otsA, probable OtsA trehalose-6-phosphate synthase
MSRLVIVSNRVPVPDKGGIAPAGGLAVALKVALEEHGGIWMGWSGRSSGE
NEPEPLAQLHQGNITYALTDLTDTDVGEYYHGFANRVLWPICHYRLDLAE
YGRKEMAGYFRVNRFFAHRLAPLVRPDDVIWVHDYHLIPLAAELRQMGLK
NRIGFFLHIPWPPADVLFTMPVHEEIMRGLSHYDVVGFQTDHDLENFAGC
LRREGIGDELGGGRFSAYGRVFKGGIYAIGIETAAFAEFAKKALTNKTVR
KARESIEHRSLIIGVDRLDYSKGITQRIDAFERFILANPAQQGRVTYLQI
TPKSRSEVPEYEAMQRTVAEQAGRVNGALGAVDWVPIRYINRSVGRHILA
GLYRLGKVGLVTPLRDGMNLVAKEYVAAQDPDDPGVLVLSRFAGAARDLK
GALLVNPYDIEGTANAMARALSMPLEERKDRWKTMMDHLLEHDVSRWCRD
FLNDLATSPDPSG
>SMa1570 pilA2, probable PilA2 pilus assembly protein
MKNLLARFARNESGATAIEYGLIAGLISVVLITVMGTIGTGLTTRFTAIG
TALTGG
>SMa0163 pilQ2, probable PilQ pilus assembly protein
MQRFDEIVIGDPEIATVTPLTDRSFYILGNELGSTSVTIFDAEKNPVGII
DIEVTLDTKLLSSTIRQSVPGSSVKVTSANGRIVLSGSATDAVAATQAEQ
IASRFAGDEEVINSIKITSSQQVQLNVRFVEINRDVGKELGTQISAAYAW
SNGSVEFNSSPRATSNTPAGSLIGSLIGEGYSVDVAIDALEDRGMARRLA
EPNLIARSGETSSFLAGGEFPIPISEQDGTITVSYKKFGVGLDFTPTVLS
DGLIALDIEPEVSAIDNTASYRVGNIAIPGFSVRRARTSVDLKSGQSFMI
AGLLQSENNLITQRVPGLGQLPILGALFSSKAYQRRETDLVIIVTPHLVK
PIDPLKKVASPSDRTKRPTEAEFFLGNIDEVEVNGRGRSASRQARVVRAP
SSGHFLELQ
>SMa2395 repA2, RepA2 replication protein
MTRHLSSLAFMENFSSKLEMALNNLSMAQFPPNARRTMRKFTSTEVATLL
GVTEAYIRQVAAKEQGPEPEIANGRRFYTLEQVLELRMALAANGRKKWMN
PRRTGNEQCQIVAVTNFKGGSSKTSTTIHLGHYLALKGYRVLAVDLDPQA
SLTALHGSLPEFDYRGGDTLFSAIRFDDPVPTKSIIHKTHIVGFDVICAG
LELTEFETAVALEMRRSAGTGFLLRVSQALEQVADDYDFVLMDSAPSLNF
LTLSSLTAATGVIIPVPAHMLDVDSTAKFLELAGSYMQILNEVGTAAQWD
FAKFLITKFEPNDHPQANMSALMRQVFGEDLLLNSVIKSTAVADALTWKQ
SLYEVQRSRFSAPKTYDRAIESINAANAELEGLFWKAWGRE
>SMa2393 repB2, RepB2 replication protein
MGERMSRKDSKGLFANVLGQLENSAEKGGMQRSTSPHLLKVAAGVRQMQE
RSELAERLLKDGGQIVEIDPDEIMESAIRDRFDSGYSEAGIADLLESMRE
HGQSTPGLVRPVRGAARPFQIVFGRRRLAAAKLLGIKFKAIARELSDEDA
IVLQGEENSNRKDLSFIERCLFAQSQEAAGYRRDVICKSLSTGRSHISEM
IRIAAALPREILMQIGPAPEIGRRRWIEFEVRWAAHREPAKVAQLVLEQE
QIQASSSDLRFTAVFEALTKVDVRAAASSTSDLISHGLVLGQIQRGKSAA
KLTFNKSVPSGFVDFIAGQIESLHDQFMQKQSSKQGD
>SMa2391 repC2, RepC2 replication protein
MEIGSVTTPFGRRSMTLALLAGQFISRDIEPGKSVDKWKLFRSLCAAKRR
LGISDRTLAVLNALLSFYPENTLSGETSLVVFPSNVQLSLRAHGMAEATL
RRHLAALVDAGLVARKDSPNGKRYARKTRDGAIGTAFGFSLAPLLARSGE
IERLAAEAEADRMELQRLRERLTLCRRDIAKLIDVVLVEVPGHWTSMQED
FRSLVQAIPRTPDIADLTPLVVAFETLREQVANQLQKHIKPHEMSGNPDQ
TERHKQESKPESIIELERCSRNERLDGQAIEPANKAKRAFPLGMVLKACP
EIMNYGPDGAVRSWRDLMIAAVTVRSMLGVYPSAYQEACEVMGPENAGVA
VACILERAGHINSAGGYLRDLTAKARQGKFSLGPMLMALLRANGFDGRGV
S
>SMa2400 rhbA, RhbA diaminobutyrate-pyruvate aminotransferase
MPADLAARTSSKIFNGVDLMDASARADNAFYLDRQERRESNARSYPRRFP
VALKSASGCIVTDVDGRSYLDCLAGAGTLALGHNHPEVIETLQQVLGSGL
PLHTLDLTTPVKDRFVSDIFGTLPAGLRDEAKIQFCSPSGTDAVEAAIKL
AKTATGRTDLVSFRGAYHGMSQGSLSLMGSLGPKASVGQLVPGAHFFPYP
YAYRCPFGRGGNETATLAAEYFERALRDPEGGINRPAAVILEAVQGEGGV
IPAPVEWLRAVRRVTRDLGIPLIVDEVQSGVGRTGSFYAFQKAGIIPDVV
VLSKAIGGGLPLAVVIYREDLDLWKPGAHAGTFRGNQLAMAAGSKTLEII
ERERLVERAAIAGRRLRANLERIAAQTPYIGEVRGEGLMLGVEVVDPEGL
PDALGHPPHGQEIARMIQHEMFRAGIILETGGRFGSVLRLLPPLVISDAE
IDQVSGALAAAFERLGRKAA
>SMa2402 rhbB, RhsB L-2,4-diaminobutyrate decarboxylase
MNINVAAFRTPPTKQTDHADQILGTDSESRRVFRNAMLQAIDMVVDQTAA
ASSLYSGTSFQGLRGLIDDLDPLPEVGTGIAAALAEIGRPALEHAMVVGH
PAAMAHLHCPVAVPALAAEVLISATNQSLDSWDQSPFATLVEERVLACLT
QLAELPASASGNFTSGGTQSNMTALYLAAVRCGPDARKAGVVLTSAHAHF
SIRKSAAILGFAEDAVIAIAADADGRMSVPALKAELLRVAGEGRIPVAVV
ATAGTTDLGAIDPLVEIADLAAAQNVWMHVDAAYGGGLLFSRHRSRLEGL
EHAHSITLDFHKMLFQPISCGVLLLRDRADFAPLASKADYLNPEDAVFAD
APNLVERSMQTTRRADALKILMTMRAIGRDGLDALICQTLQNTHAAAEAV
KTREYLSLAGPPSLSTVLFRYVSARGPKFADAITLKTRAALFNAGIAALA
TTVLDGRVHFKLTLLNPRSTPDVVHRILDAIGETARELETHHARP
>SMa2404 rhbC, RhbC rhizobactin siderophore biosynthesis protein
MHAHNAATLTLRSLLNCVAREFPDHVRWLETQGHLRFMLTFPEGGGSLGL
PAHYRSATGHHLFGEPVMLTDENGAKTVDAVEAISAVIERLEPSIAAKDG
RVDLLNRTHSSRLLIEAALHARNGDLAALAGDEVSFVAAEQGLIAGHGIH
PCPKSREGMTEAESRRYSPEFAAGFPLRWFAVESELFHTGHSQGSPSAEE
WLKEAMGSDIDALKAPLPAGDFSLLPVHPWQADQMLKDPTVAALVAAGRM
IDCGEAGKPWFPTSSVRTLYRPDASFMLKMSLGVGITNSVRVNLARELLR
GDDMYRFRRHELWQDFSRSYPGLTLIPDPAFMGVKIDGALIDGLSVSMRE
NPFTGANADRNVSLLAAVCEHLPDRGSRLGALIRNRAHLERRPLDIVARD
WFERFLTIFVRPIFGLYLRHGIAMEAHQQNIVVEIEHGYPIGLFYRDNQG
FFHHERAHGALVEALPGFGEPSESVFGEEPVDERLLYYAFINSVLGMVGA
LGREGLVSETALLAMLRRELLRLEALEGANSGIVRKMLAPTLQCKANLKT
RLARMDELVGPLETQSVYLQITNPLFETEKVLAHA
>SMa2406 rhbD, RhbD Rhizobactin siderophore biosynthesis protein
MLERSEPDTALAYSRYDPDIRRTISFRLLEKQRDLKLLWRWMNQAHVVPQ
WKMAKPIEDIAAYIDINLADPHQDPYIGLIDGTPMSYWEAYWAKDDVLGR
YYPAEKKDRGWHMLVGEPSFFGRGIAPAVIRAFTRFLFLDDPGTQKVVGE
PSVAARRLLRYAPACAFEEQGEIDLPDKRAKLMFCYRERFIQQFGL
>SMa2408 rhbE, RhbE Rhizobactin siderophore biosynthesis protein
MIMTDFDLAGIGIGPFNLGLAALLSSHENLSNVFLERKPAFRWHEGLILP
GTTLQVPFMADLVTMADPTHRLSFLNYLAVHDRLYKFYFYENFMIPRQEY
DHYCRWASQQLSACRFGEEVVDVAHESASDSFIVESRSASGGKQQYRSRN
IAIGVGTAPFLPKWAQIKTLAPLMHSSEFGRRRLELSKRRRVTVIGSGQS
AAECVLALLNDLTPEMVAAGASIQWITRSAGFFPMEYSKLGLEYFTPDYM
RHFHRIAPVRRREIVADQGLLYKGISFSTIGEIFDLMYERSVGGRDPGLA
LFSNCAVETLESAGGSGSFRIGINHNHLDEKATVETDAIVAATGYRHAWP
EWLGSLKGSVLDTCEWGDLVVGGDFRARRSDGGKGHVFVQNAETFHHGVG
APDLGLGAFRNAVIVNQLLGREHYRVNASASFQKFGLPSSQTAPSSISGD
FYAHAS
>SMa2410 rhbF, RhbF Rhizobactin siderophore biosynthesis protein RhsF
MLMHHDPLLPGAFSEELWQQVSKRLLAKVIEEFAYERVFGVVEEKPGHYR
IDIDELQYRFKAKRYVFDNLSVDPASLFKRHGNSDAPLHDPLAFCAEVLP
KLGVKPMTVAHFIKELGNTLVSDAHIAARASKTGAELAELDDICMEGETT
GHPWVTVSKGRIGLGYSDYLAFTPENRTPTEVLWLGVSKERASFIAEETL
TNEGLVREAVGVSRFESFCAKLAARGGSVDTHYMMPVHPWQWDHMIVPHF
AADIAAGHIVFLGKGDDLYLPQQSVRTLSNISHPEKSTLKLCMTILNTAV
YRGIPGKRALTAAPLTTWLDQLLARDQFLSEECGLVLLGERAGMHYVHPQ
FSTIEGAPYQFNEMLGCMWRDSLSAHLKSGETGLPLAALLHAGTDGKPVV
QALAEKSGMTVSEWTARLFDVVIPPVFHLLAKHGLAFSAHGQNATLILKN
GRPERLALRDFIDDVIVCDQAFPESATLPEEVRAVLLCLPADFLIHFIQT
TLFICVFRYMSVLLDQRSGLPEHAFWGLARSSILAYQKRFPEMASRFATF
DLFGDEYPRLCLNRVRLFTHGYADDDERPVPDFQGMVDNPLVAFDKRSNA
A
>SMa2412 rhrA, RhrA transcriptional activator
METIRPLKFGTLSLPDRESRLVCRSILLDMLGEATIAPDEGDLTGVTGLF
WKYVSLSLATVYFPRTMLRVNASGMGDSGVVILRAMDSPLVIRHRRIKVE
AARADVIFLPSDASSEITLPEGGRFDCAHLPAYALASKRDLLKPIMMQPL
AAECLPLQLLTNYAGYLLRQEYQSEEHAGMMVAHFYDLLPVLAQDIGNVS
PRETPHNRMASIKMRVEQNLANGSFSITDVAEAERITPRAIQKFFSREGT
TFSRYVLGRRLSLAKSLILAEGEATSISQIAYNVGFNDLSYFNRTFRSRY
GVRPSDLRRLAAAA
>SMa2414 rhtA, RhtA Rhizobactin receptor precursor
MGNNENGGISFCVFVVVIGFGTGAVAQEPANQSEAVTSLEEIVVTGGRSA
QQISEIARTIYVVDSDQIQAEARSGKTLQQILGETIPSFDPASDGARTSF
GQNLRGRPPLILVDGVSMNSARSLSRQFDAIDPFNIERVEVLSGATAIYG
GNATGGIINIITKKGKDAEPGLHAEVTGGMGSGFAGSQDFDRNAAGAVTY
NSENWDARLSIAGNRTGAFYDGSGTLLIPDITQTSTAFNERIDLMGSIGY
QIDDDRRVEFSGQYFDSKQDSDYGLYYGPFFAALADPSLFETRSGYESDF
NPQTRRSMLNVTYTDNDVFGQQLLLQGSYRTERIKFHPFPASGNSETGPY
FYGSSQDTDYYGIRAALVAEPTDALKITYGIDADMDSFTARQNIFDMVAA
GQSGGLDFNTIGKTGLYPSIDVSTVAGFAEASYEATDRLTLNGGVRYQFV
NTEVSDFIGAAQQVAILQGRATSADTIPGGEVNYDAALFSAGATYQLTNT
QQVYANFSQGFELPDPAKYYGIGNYSFSGGHYTLVNSVNVGDSALEAIKT
NSFEIGYRLDDGTFNLETAAYYSLSDRSINLNRSSLAVEIIDRERRVYGI
EGKAGVKLDHGFDVGVLGHWVRTEVKGADGWEKDSVGSASVSKLGGYVGW
TNDALSLRFSGQHIFELTDAQNFTIDDYTLFDLTGGYRFENTDTTLNFGI
HNVFDTDYTTVWGSRAKALYGGLADESVFDYKGRGRTFAVSLTKVF
>SMa0143 rpoE6, putative RpoE6 RNA polymerase sigma factor
MSERLSRRSIPSSSQIAEFREMSNAVKDVGERLMAFLPNLRRFAISLCGS
RDVADDLVQSACERALASAERFEPGTRFDAWIFRILRNLWIDQVRRQKTA
GVQDDITERHDIAGSSGERETEARLTLKTVAEAITELPDEQREVVLLVCV
EELSYREAADVMGIPIGTVMSRLMRARRSLAEAAGITQATGRSQSMKGAN
E
>SMa0011 selA, SelA selenocysteine synthase
MSGPVDLRALPSVDQMLNAAAVSPLVEQHGRAVVTDELRKVLGEVRLAVR
SGGALPGKDGIVAALLSRLDDRSRSNLRPLFNLTGTVLHTNLGRALLAQE
AVDAAVDAMREAAALEFDLDSGGRGERDSHLRELLCELTGAEDATVVNNN
AAAVLIALNSVGAGRQAIVSRGELIEIGGAFRMPDIMERAGVDLVEVGTT
NRTHAKDYVKAIGPETALILKVHTSNYRIEGFTAEVPGAELAAIAHERGV
VLLNDLGSGSLVDLSRYGLGREPTVREAVAEGADLVTFSGDKLLGGPQAG
FIVGRRDLIAEINRNPLKRALRVDKIRIAATAATLKLYRDPDRLASRLPT
LFMLSRVQAEVRAQAERLAPQVGAMLAPSGYAVEVCSCSSQIGSGALPVD
TIPSAGLRIVGSSGSALEALAALFRSLSRPILGRLRDGALVLDLRCLSDE
AEFLKTLSEGSGDAVA
>SMa0015 selB, SelB selenocysteine-specific elongation factor
MIVGTAGHIDHGKTTLVKALTGVDTDRLKEEKARGITIDLGFAYARFAKD
AVTGFVDVPGHERFIHTMLAGAGGIDYAMLVVAADDGIKPQTLEHLAILD
LLGVSRGLVAITKADLADPARLENLTDEIGAVLSSTSLRDAEILPVSVAA
GQGIELLKERLAAAECATAASAVGGRFRLAVDRSFTLSGAGTVVTGTVLS
GSVGVGDQVTVSPAGRSARVRSIHAQNQRAERGFAGQRCALNLAGEGISK
DAITRGDMVVDPHLHAPSDRLDADLSVLESETKPIGEWFSARFHHASAET
GIRIVPLEGPLLPGERRRVQLVLDRPIAAAVGDRFILRDVSARRTIGGGR
LLDLRAPARKRRSPERLSYLQAASLSHAGEALAALLDVPPFLVDLDVFAR
DRALSEAELQNAIRSASAEVIEGSAVRHALSKRQRAAFSDEVQRVLSAFH
VENPDLQGIGRERLRLQVTPRLPPPAFLVALRAEQTAGRLVLEGAFVRLP
GHEVRLSEKEEELYARILPHLEGEERFRPPRVRDFAEALGVDEREIRRIL
KLCARLGRVDQIRHDHFFTRQTTAEMVAIIRQVAANAERGEFSAGLFRDR
VNNGRKVAIEILEFFDRQGVTIRHGDVRRVNPHRLDLYEGPVPEADEGRG
SSPVERPDFKFYRAGS
>SMa0028 selD, selenide, water dikinase (selenophosphate synthetase), SelD
MNTLAPRLTDLAHGGGCGCKLAPSVLQQLLANQPAARPFAQLLVGNDMAD
DAAVWQVDDNTCVIATTDFFMPIVDDPRDFGRIAATNAISDVYAMGGKPI
LALAIVGMPINKLDSTTIAKILEGGASICAEAGIPVAGGHSIDSVEPIYG
LAVIGLCHPSEVRRNGGVKAGDALILTKGIGVGIYSAAIKKNALPPGCYE
EMIGSTTLLNRIGADLAADPDVHALTDVTGFGVLGHGLEMARASQLSLTL
RLSAIPFLSQAYMLAEQGFITGASGRNWASYGADVVLPDDLPDWQRLLLA
DPQTSGGLLVACAPEKAGALVEKVRLAGYPLAGIVGTAAAGAAQIKVIS
>SMa2139 sgaA, probable SgaA serine-glyoxylate aminotransferase (SGAT)
MRNGTSHLFVPGPTNIPDAVRRAMNVPMQDMRAPDFPDLVLPLFADLKGV
FRTDNGSIFLFPGSGTGAWEAAISNTLNRGDRVLMSRFGQFSHLWADMAG
RLGLDVECLDVEWGEGVPVEEYRRRLDADKNRRIKAVFVTHNETATGVTS
DVAAVRAALDDTGHKALLFVDGVSSIASIEFRMDDWGVDLAVTGSQKGLM
LPAGLGILAVSPKALEAHASSTIERCYFSFEDMKAPSETGYFPYTPPTQL
LLGLRASLDLIFAEGLDAVIARHHRLAEGVRRGVHAWGLNLCATEKKWWS
DTVSAIVVPEDVDARQVIANGYSKYRTSFGAGLSKVAGRVFRIGHLGDLN
EVMCLSALAAAEMSLRDAGAKIEAGSGVAAAQEWYRSQIGLAAPNLQERA
A
>SMa0838 syrA, SyrA protein involved in EPS production
MSFRFVCLILWLLLCASSLAIYFALQPCPGFIVTTLACLLLFQLAYFGSV
LLLVCLAAIAQLSARLRIFGIFSENRNHSSK
>SMa1698 syrB, SyrB regulatory protein
MADESNTGPVAAAVAADAEVKVPTAKKLRSPRPQKAAAEPAQPKAPAAKP
RRYSEQERNDKLKLIETQVSEGNTLKNAIQSAGISEQTYYHWKGAAKPVG
KKDAKSTKPLPAGDEFADLVKLEEENQKLRKRLAEKLRTENAELRKRLGL
D
>SMa1586 syrB2, SyrB2 transcriptional regulator
MADESNTGSIAAAVAPNADVKAPAAKKKRSPRRQKAVAEPRRAVSETPAA
KPRRYSEQQRKEKLKLIETQVTEGKVTLKDAIKSAGISEQTYYQWKRTVK
PVEQKAEKRLPTGEELADLVRLEEENQRLRKLLAEKLRAENADLRKRLGL
D
>SMa0849 syrM, SyrM transcriptional regulator
MDQPTWKRPHRAKFAGVSDAAQQRQMPNLASIDLNLLVDLEALLQYRHIT
QAAQHVGRSQPAMSRALSRLRGMLKDDLLVAGSRGLVLTPLAECLTQMLP
SVLDAIRQMMNLSLAPAQRRWKVTMAMPDHQAVVLLPHLLPRLHERAPHL
DIVTDPLLGGALGLLEQGEIDVVVGQMGAAPLGYLRRRLYADSFTCVLRH
NHPALAQEWTIEAFAALRHVAIASEPDELFGQIYDRLTKLGLQRGDPMVV
STVLTAAVLIAATDSVLVVPSRVATRVAAMLSLAVIPPPVELRPYEVALI
WHERCHRDPEHRWLRGEIAAAASTAG
>SMa0485 thrC2, probable ThrC2 threonine synthase
MIKYVSTTGGIEPVGFDEAVLQGFAADGGLFVPDRIPVIDQEQLQAFSTL
SFQDLAFELVSLYIDASIIPRQDLRRLIDNSYREFQRPDIVNLVPIRGNR
DTYVLELFHGPTQSFKDMAMGFLMQVVDYLLGQRRERLNIVLATTGDTGP
AAAWAAAGKQRIDCWPLYPRGMISREQERQMTTLRADNVHPVGVENCPDG
GDDLDLVVAELFSDEKLKRTLALSSVNSINWCRVMTQTVHYFYSYYRAVE
RVGDPVVFSVPSGAFGNLFAGYLARSMGLPVARFVCANNVNNALHTAFSR
GVLPRHDLVQTPSSAIDIVAPYNFWRLLYFATNRDTARIRQWMKDFAARR
EVVLDVETTKTIQGGFISASISDEETLATVRSVYEAEGHYLIDPHTAVAV
AAVEAVRDSLPAAAAIVCFATAHPAKFPDVIKRALDVEELPAAGHHPVID
EARDACEHLRICQLENLRSNLIGEMTRQVARRRG
>SMa0934 traA1, probable TraA1 conjugal transfer protein
MAIMFVRAQVISRGAGRSIVSAAAYRHRARMIDEQAGTSFSYRGGASELV
HEELALPDDIPAWLRAAIDGRSVAKASEALWNAVEAHETRADAQLARELI
IALPEELTRAENIALVREFVRDNLTSKGMVADWVYHDKDGNPHIHLMTAL
RPLTEQGFGPKKVPVLGEDGEPLRVITPDRPNGKIVYKLWAGDKETIKAW
KIAWAETANRHLALAGHEIRLDGRSYAEQGLDGIAQKHLGPEKAALARKG
IAMYFAPADLARRQEMADRLLAEPGLLLKQLGNERSTFDERDIAKALHRY
VDDPVDFANIRARLMASDELVLLKPQQIDAETGKAKQPAVFTTREMLRLE
YAMARSAEVLSRRKGFGVSNARAAAAVRSIETADTEKPFRLDPEQVDAVR
HVTRDNAIAAVVGLAGAGKSTLLAAARAAWEGEGRRVIGAALAGKAAEGL
EDSSGIRSRTLASWELAWENGREQLNRGDVLVIDEAGMVSSQQMARVLKA
VEDAGAKAVLVGDAMQLQPIEAGAAFRAISERIGFAELAGVRRQRDAWAR
DASRLFARGKVEEGLDAYAQQGRIVETETRAEIVDRIVADWADARRDLLQ
KSADGEHPGRLRGDELLVLAHTNDDVRKLNEALRNVMIGEGALAGAREFQ
TARGLREFAAGDRIIFLENARFVEPRARRLGPQYVKNGMLGTIVSTGDRR
GDTLLSVRLDSGRDVVISQDSYRNVDHGYAATIHKSQGSTVDRTFVLATG
MMDQHLTYVAMTRHRDRADLYAAKEDFEAKPEWGRKPRVDHAAGVTGELV
KEGMAKFRPNDEDADESPYADIRTDDGTVQRLWGVSLPKALKDAGVAEGD
TITLRKDGVERVKVQVPIVDAQTGEKRFEERQVDRNVWSASQLETAAARQ
ERIERESHRPQLFKQLVERLSRSGAKTTTLDFEDEAGYQAQARDFARRRG
LYHLSLVAAGMEAEVLRRWAGIAEKREQVAKLWERASVALGFAIERERRV
AYNEERTETLSTGIPSDGKYLVPPTTTFSRSVAEDARLAQLSSQRWKERE
AIVHPVLAKIYRDPDGALAALNALASDAAIEPRKLAEDLGKAPDRLGRLR
GSELVVDGRAARDERTAATVALSELLPLARAHATEFRRNAERFGIREQQR
RAHMALSVPALSKTAMARLVEIEAVREQGGDDAYRTAFTYAVEDRLLVQE
VKAVNEALTARFGWSAFTAKADVIAERNIAERMPEDLAPERREKLTRLFA
VIRRFAEEQHLAERQDRSKIVAGASVELGKETFAVLPMLAAVTEFKTTVD
EEARERALAAPHYAHHRAALVETATRVWRDPADAIGKIEDLIVKGFAGER
IAAAVSNDPAAYGALRGSDRIMDKLLAVGRERKGALQAVPEAASRIRSLG
ASYASALDAETRGITEERRRMAVAIPGLSPAAEDALKRLAAQIKNKDGKL
DVAAGSLDPHIAREFAKVSRALDQRFGRNAILRGETDVINRVSPAQRRAF
EAMRDRLTILQQAVRVQSSQEIISERQRRVIDRARSVTR
>SMa0933 traC, probable TraC conjugal transfer protein
MAAKSSIADIDAQIEKLRARRRSLIVKSAERFARAATKSGLAEMEITEEE
LDRIFEEMAARFRKGEKKGVDHATASPHRPAAGATGTAAEVSHDG
>SMa0930 traD, probable TraD conjugal transfer protein
MTADRKNEAREKFRLGAIVVRAGLTKADRAFLLGGFIELARVTPGSAEHR
RLRDIGEEAFKAPALDGGSPGTGETAEWH
>SMa0929 traG, TraG conjugal transfer protein
MALKAKPHPSLLVILFPVAVTAAAVYVVGWRWPGLAAGMSGKTAYWFLRA
APVPALLFGPLAGLLAVWALPLHRRRPVAMASLACFLTVAGFYALREFGR
LSPSVESGALSWDRALSYLDMVAVVGAVVGFMAVAVSARISTVVPEPVKR
AKRGTFGDADWLPMAAAGKLFPPDGEIVIGERYRVDKDIVHELPFEPNDP
ATWGQGGKAPLLTYRQDFDSTHMLFFAGSGGYKTTSNVVPTALRYTGPLI
CLDPSTEVAPMVVEHRTRVLGREVMVLDPTNPIMGFNVLDGIEHSRQKEE
DIVGIAHMLLSESVRFESSTGSYFQNQAHNLLTGLLAHVMLSPEYAGRRT
LRSLRQIVSEPEPSVLAMLRDIQERSASTFIRETLGVFTNMTEQTFSGVY
STASKDTQWLSLDSYAALVCGNAFKSSDIVSGKKDVFLNIPASILRSYPG
IGRVIIGSLINAMIQADGSFKRRALFMLDEVDLLGYMRLLEEARDRGRKY
GISMMLLYQSLGQLERHFGRDGAVSWIDGCAFASYAAVKALDTARNISAQ
CGEMTVEVKGSSRNIGWDTKNSASRKSENVNYQRRPLIMPHEITQSMRKD
EQIIIVQGHSPIRCGRAIYFRRKDMNEAAKANRFVKAIP
>SMa1079 tspO, TspO Tryptophan rich sensory protein homologue
MEGLACLAGGSLHLLQMDMHSLLVLVAFEVASFAAAATGVIFRPGDWYKQ
LNKPRWRPPDWLFAPVWAVLYASIGLSGWLVWQEAGIAGAALPLGTYAVQ
LLLNAAWTPIFFGLHRPGLAAVEIMVLWAAILATTVMFHPVNAAAALLLV
PYLAWVSFAAALNLSIWRRNRSKTLSQSTR
>SMa1406 ttuD3, putative TtuD3 hydroxypyruvate reductase
MSPEDRDFLSELFEAAVGAADPKLALRARLPQRPRGRTVVVGAGKGAAQL
AAAFESLWGGPLEGVVVTRYGYAVHCDRIRVIEAAHPVPDCNGLIASHAL
FEAVRGLTPDDLVVALFCGGGSALLPCPPEELALEDEIALNRALLASGAP
ISVMNAIRKQVSRIKGGRLAAACHPAKVISFIVSDVPGDDPAQVASGPTV
PDATDRAAARAMRDAWRIELPERLVDWLKGENGTAPSPNDPVFAGHEVQV
IASARLSLEAAAARADALGIPAIILSDAIEGEARDVGKVHAAIAREVVLR
NRPFERPVVLLSGGETTVTLRGHGRGGRNTEFLLSLAIAAEGLSFASLAA
DTDGIDGSESNAGAFADGSSATRLRALGRDPVALLSGNDAWTAFNCLEDL
FVPGPTGTNVNDFRAILVR
>SMa2321 uvrD2, putative UvrD2 DNA helicase
MATATHLEKLNERQRCAVEYGIGAEEGAQAGPLLIIAGAGSGKTNTLAHR
VAHLIVNGADPRRILLMTFSRRAAAEMSRRVERICAQVLGRSSGMMTDAL
SWAGTFHGIGARLLRIYAEQIGLNAEFTIHDREDSADLINLIRHELGFSK
TESRFPTKGTCLAIYSRTVNAEMRLNEVLRSWYPWVAGWEQQLQELFAGY
VEAKQAQNVLDYDDLLLYWAQMVSDASLADDIGNRFDYVLVDEYQDTNKL
QSSVLLALKPDGRGLTVVGDDAQSIYSFRAATVRNILDFPKQFAPAAEIV
TLDRNYRSTQPILAAANGVIDLARERFTKNLWTDRESAERPKLLTVKDEA
DQANCIVEQVLANRESGMLLKQQAVLFRTSSHSGPLEVELTRRNIPFVKF
GGLKFLDSAHVKDMLAVLRFAQNPRDRVAGFRLLQMLPGVGPQTAGKILD
TIAADPEPLMALGEVPAPPRSGTDWMSLVELLQALRKPGSAWPTEIEMAR
MWYEPHLERIHEDAETRRADLVQLEQIAAGYPSRERFLTELTLDPPDATS
DQAGVPLLDEDYLILSTIHSAKGQEWRSVFMLNVVDGCIPSDLATGTSHE
LEEERRLLYVGMTRAKDQLALMVPQRFFTGGQHAQGDRHVYASRTRFIPA
TLLQFFESANWPVASPNVSERSAKQIRIDVAARMRTMWR
>SMa1321 virB1, virB1 type IV secretion protein
MPAAFLDLAQTCAPIVAAETLAGVVSLESRFEPFAIRINSGVPLSEQPAT
KTEAIAMATSLAAERQDIQLGLGGIGMGELRRLKLSISDAFDPCLNLHAT
ATLLDGYYRLAMKAGADPDHAEQVMLQSYYGRDDPSVGAMVQYDQQVRQE
VKRLGKSLAALMIGDGGQARGITEESPVDVAAEKPPGDRPVDERASVPSW
DVFSSRRRSSVLVFQNSQMEQSE
>SMa1303 virB10, VirB10 transmembrane type IV secretion protein
MAQQDENRIPGERAETVSGRKIDNNPMLKRGAVALAVVAFVGFALWSMGG
EGKRQDNAQPERVVIRQTTNFEPAKEKLEPVQPVPEVKLPTPVVTEEVKE
EDPLLDSARRAPVIAYSSGQKNATSHRDSENPPISADSNFIPLDGDTMGQ
NTANADEQRFNGLLRPTRLEGSRAGTLGNRNFIVAMGTSIPCVLETAMAS
DQPGFTSCVVNRDVLSDNGRVVLMEKGTQVVGEYRGGLQRGQKRLFVLWY
RAKTPNGVIVTLASPATDALGRAGVDGYVDTHWWERFGSALLLSIVGDAT
SYASSRLQDSDVDAQNTTSAGQQAAAVAVEQSINIPPTLNKHQGELVSIF
VARDLDFSGVYGLRVTGSKNKVLDRAVLGDFRPQSTLVTK
>SMa1302 virB11, VirB11 type IV secretion protein
MTEGADATVVRELLSPFAPFLGDRSLYEVIVNRPGQVLTEGAGGWRTYDL
PELSFEKLMRLARAVASFSHQSIDETRPILSATLPGDERIQIVIPPATTR
NTVSITIRKPSSVTFTLNDLKEREFFSETRSANDGASTRDDGLLALYRAG
RFKEFLRHAVISRKNIIISGATGSGKTTLSKALIKHIPEHERIISIEDTP
ELIIPQPNHVRLFYSKGAQGLSGAGPKELLESCLRMRPDRILLQELRDGT
AFYYVRNVNSGHPGSITTVHADSAKLAFQQLTLLVKESAGGRNLDRDDID
KLLKVSIDVIVQCKRIDGRFRATEIYVRA
>SMa1319 virB2, VirB2 type IV secretion protein
MTFSSRIRPIAASTVMATAIMVTMVEPAFAQAAGIETVLQNIVDMLTGNI
AKLLAVIAVIVICIAWMFGYMDLRRAGFWIIGIGGIFGATELVNTIVGS
>SMa1318 virB3, VirB3 type IV secretion protein
MIAGVTMEAMGMNIMLTTILYIVAGSVAYALVGVVFHLVFRALVKHDHNM
FRILLAWIETRGRSRNSAFWGGATLSPMKLARKYDERDLGFA
>SMa1315 virB4, VirB4 type IV secretion protein
MPSLTTLRSRELGPETFIPYVRHVDESTIALDSRALMVMIALEGVSFETA
DILDLNALHRDLNTLYRNIADERLALWTHLIRRRDNSYPEGTFATPFSAA
LNDKYRERMVGEDLFRNDLYLSILWSPARDPADKAAKLLSRLRRARRVGT
ELDEGALKHLRDKVIDVTAALRRFEPRVLTLYEHDGLMFSEPSEVLHQLV
GGRREPIPLTEGHIASAIYSDRVIVGRETIEIRHEADSRYAGMLSFKEYP
ARTRTGMLDAVLTSPFELILAQSFSFVSKADARMIMGRKQNQMVSSGDKA
ASQIEELDGAMDDLESNRFVFGEHHLTLSVFAPSVKELTDNLAKARASMT
SGGAVVAREDLGLEAAWWAQLPGNFRYRARSGAITSRNFAALSPFHSYPL
GQKDGNEWGPAVALLKTASGSPYYFNFHYGDLGNTFVCGPSGAGKTVLLN
FMLSQLEKHDPHVVFFDKDRGADLYVRAAGGTYLPLKNGIPTGCTPLKAL
ELTPENKVFLTRWVGKLVGSATRELSVTELRDISSAIDGLADLPVERRTI
GALRTFLDNTNPEGIAARLRRWERGGPLGWVFDNVIEDIGFGEFGGGKLV
GYDMTDFLDNEEIRAPLMAYLFHRVEQLIDGRRIIIVIDEFWKALQDEGF
RDLAQNKLKTIRKQNGLMLFATQSPRDAIVSPIAHTIIEQCPTQIFLPNS
RGNHGDYVDGFKLTEREFELVARELSIESRRFVLKQGHNSVVAELDLKGL
DDELAILSGRTANVELADAIRAEVGSNAKDWLPVFQQRRSAT
>SMa1313 virB5, VirB5 type IV secretion protein
MIDQTAIAKQIESIAQLKAQLDALNQQIEQAQQLHGSLNKLTDMSDVASV
LNDPAIRKALPADFSAIEGLFKGNGTGVFADSASKFLDGNTTYQTNAADD
FYAQELSRIQNKNAGQMSLGQQIYDAATKRIDGIDQLREKISTAGDAKDI
ADLQARLQAEQAFLQTDVLRMEGLRMVQQAQEQVDEQRKAEDWRQRMDAI
KAALQ
>SMa1311 virB6, VirB6 type IV secretion protein
MPMYEVFAFVDEQFKTPLENFISTGTSNISEWVSGPLTAAVTLYIVLYGY
LVLRGSVQEPILDFAFRAIKLAIIVMLVKNAGDYQTYVTNIFFDVLPREV
SQALNTGTAPSASTFDSLLDKGQASATDIWSRASWPVDIVTGVGGMMVIG
ASFIVAAIGYIVSLYARLALAIVLAIGPIFVALAMFQATRRFTEAWIGQL
ANFVILQVLVVAVGSLLITCIDTTFAAIDGYSDVLMRPIALCAICLAALY
VFYQLPNIASALAAGGASLTYGYGAARDAHESTLAWAASHTVRAAGRGVR
AVGRTFTSKGSGS
>SMa1308 virB8, VirB8 type IV secretion protein
MVSADELKTYFEKARRFDQDRVIQVERSARIAWSVAIVAGILAGASIFAV
AALTPLKTIEPFVVRVDNSTGIVDVVSALTSTAGTYDEAVTKYFAAKYVR
AREGYVWSEAEENFRTVALLSTQPEQARFSAIYRGSNPDSPQNTYGRSAT
ARISIASISLINPNVVSVRYMRTITRGEEVRPTHWVATLTFSYVNSPMSS
TDRLVNPLGFAVSEYRADPEAIN
>SMa1306 virB9, VirB9 type IV secretion protein
MRTTFIATLLLTAAAPTALALEIPRGATQDSRVRFVDYQPYNITRIIGSL
RSSVQVEFAPDEEIAHVALGNSVAWEVAPAGNILFLKPRENQPVTNISVV
TTRRDGSTRSYQMELTVRDGKVEVGQNTYFYVKYRYPADEAERRRQAAAA
RAIAAQAKEADNVLAIHEAYGPRNWRYSAQGAQALEPQSVYDNGKVTTFA
FVGNQEMPAIYIENSDGSESLVPKSVDGNLVLVHAISRKFILRRGGDVLC
VFNEAYDRVGINPDTSTTSPSVERIVRIDAGAVQ
>SMa0340 wrbA2, probable WrbA2 Trp-repressor binding protein
MTRVLVLYYSSYGHIETMAGAVAEGARSTGAEVTIKRVPETVPIEVADKA
HFKLNQAAPVATVAELADYDAIIVGTGTRFGRMSSQMAVFLDQAGGLWAR
GALNGKVGGAFVSTGTQHGGQETTLFSIITNLMHFGMVIVGLPYSHQGQM
SVDEIVGGAPYGATTVAGGDGSRQPSQIDLAGAFHQGEIVARTAAALVAA
RN
>SMa1935 wrbA3, probable WrbA3 Trp repressor binding protein
MTKMLVLYYSSYGHIEAMAKAVANGAKQAGATVALKRVPELVPEAVARSS
GYRLGQEAPIATVAELADYDAIVIGTPTRFGNMASQMKNFLDQTGGLWAE
NKLVGKVGSVFTSTGSQHGGQESTILSTHVVMLHLGMVIVGLPYSFKGQM
RMDEITGGSPYGASTLAEDENHRDRSPSANELDGARFQGRHVAEVAAAMQ
LGRSHLQPELVR