TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Gene type: CDS
Genomic element: pSymA

Number of genes found: 197

Free access
Sort by:

 



# Sinorhizobium meliloti, 1021

>SMa1921 Hypothetical protein
MPPRKGTKDYQNEIIRVYYGINNQTLGVQSVWETRGGKLASMSAGHHFRH
QCVAGNIAHSHAVSVAFDLTGVFTLPVDVENHPLVNELEEKASIMRAQRT
HDAFREVLSIVGAK
>SMa0185 possible transmembrane-transport protein
MADNIHTPEQASRREWVGLCVLSIACLIYSMDLSVLFLAVPAIVADLDPS
ASQLLWINDIYGFMVAGFLVTMGTLGDRIGRRRVLLMGAFAFGVASAFAA
FSNTPGQLILARALLGIAGATIAPSTLSLIVNLFKNEAERNRAISIWGTA
FALGGLVGPLIGGILLQYFHWGSVFLINIPVMLLLLAVAPFLLPEYKNND
AGRLDLLSVVLSLATVLPIIYGFKHMAADGFQLAQIVYIGLGLLVGLLFV
RRQRRLSDPLVDLALFRVPAFTASLMVNLAGVFFVFGVFLFQNLFLQLVL
GLSPLEAALWSAPSALVFAVMSFQAYRFTNRFGPVRTVLGGLLVNAAGAA
AMAIAAYAESLIGILGSSMIIGFGFVPVVLTTTGLIVGTAPPERAGSASA
ISETSAEFGGALGIAVLGSLATLIYRMAMNRADLSSLNPVQAEAVSATLA
GAVETARSMPGSTSAVWLETAKSGFSLGFAICCVVATVTLLLLAIVARRV
YATAHIDESTLAPH
>SMa1447 Putative transmembrane transport protein
MLIAAARQLEKGRMTTITVDDALDRAGTGTYQRRLMAIFGLVWAADAMQV
LAVGFTAASIAATFGLTVPQALQTGTLFFLGMLFGAAGFGRLADRIGRRR
VLIATVACDAVFGLLSVFAQDFTVLLLLRFLTGAAVGGTLPVDYAMMAEF
LPARNRGRWLVMLEGFWAVGTLIVALAAWAASLAGVADAWRYIFAVTAIP
ALIGVGLRFLVPESPLYLLRLGKTSEAKAIVDEILVVNGKMRLGAGASLV
PPPPTASAGIFSADLRKRSLMILAIWFLVSISYYGVFTWMPPRLAGEGFG
FVRGYGFLVVLALAQIPGYALAAYGVEKWGRRPTLIGFCLLSALGCLLFV
AAGTAMLIGVSLLIMSFALLGTWGALYAYTPELYPTASRATGMGAAGAMA
RLGGLLAPSLMGLVVAQSFGLAIGIFAGLLLVAAVAAFLIDAETRRVSLA
>SMa0322 conserved hypothetical protein
MKELPASQAYRVLEPGPIVMVSTSDNGKPNVMTMGFHTMIQHDPPLIGCV
IGPWDHSYQALRKTGECVIAVPGLDLAETVVDVGNCSGDRVDKLQRYGLM
TQPARDVSAPLLRDCLANIECRVVDTRLLDPYNLFILEATRIWINENRKE
RRMMHHRGDGTFTVDGGTLDLKDRMVRWRHLP
>SMa0510 putative D-isomer specific 2-hydroxyacid
MPKIELLQVGPYPSWDEERLNANFTMHRYFEAADKAAFLAEHGAAIRGIA
TRGELGANWAMIEALPRLEIISVYGVGYDAVDLAAARERGIRVTNTPDVL
TKDVADLGVAMMLAHARGMIGGETWVKSGDWAKKGLYPLKRRVHGKRAGV
LGLGRIGFEVAKRLAGFDMEIAYSDTGAKDFARDWSFIADPVELAARSDF
LFVTLAASAETRHIVGRRVIEALGPDGMLINISRASNIDEEALLDALESK
VLGAAALDVFEGEPNLNPRFLALDNVLLQPHMASGTAETRKAMGQLVFDN
LSAHFGGRPLPTPVL
>SMa2137 probable glycerate
MRRDRIVKPKVIVTRRWPTEVEDRLTAEFDTRLNETDQPYDRRELRAALE
EADAVLPTVTDKISADMLEGGIRAKILGNFGVGFNHIDTAAATKVGLVVT
NTPGVLTDATADLAMTLLLMCARRAGEGERELRAGKWTGWRPTHLCGSHV
TGKTVGIIGMGRIGQAVARRCHFGFGMDVVFFDSHSIAGLDVPARQLPSV
DDVLATADFVSLHCPGGGENYHLIDDDRLACMKWSAFLINTARGDVVDEH
ALVRALETRRIAGAGLDVFEGEPRVPGRLAERQDVVLLPHLGSATKETRV
AMGMRVIENLKAFFSGRSPPDAVC
>SMa0453 Yle homolog, A. tumefaciens
MYLVDTNIVSEARRGTPQAVSWLRSVDPLSIHLSALSLGEIMRGIALKQR
SDPKTAAHLTEWLRKLRHDHGDRILPITDQIAVEWGRIAAIRPRGDIDGL
IAATAIVHDLILVTRNVKDFEDTDASVINPWETSA
>SMa0708 putative isomerase
MSDRVKKIESFTLTLPRETPYLGKPRPGEEPNGRGYLVRKANRTVYPTFD
RSVLVRIETENGAVGWGETYGLVAPRATMEIIDDLLADFTIGRDPFDAAA
IHDDLYDLMRVRGYTGGFYVDALAAIDIALWDLAGKLAGLPVCKLLGGQR
RDRIAAYISGLPEDTRAKRAELAAAWQAKGFSSFKFASPVADDGVAKEME
ILRERLGPAVRIACDMHWAHTASEAVALIKAMEPHGLWFAEAPVRTEDID
GLARVAASVSTAIAVGEEWRTVHDMVPRVARRALAIVQPEMGHKGITQFM
RIGAYAHVHHIKVIPHATIGAGIFLAASLQASAALANVDCHEFQHSIFEP
NRRLLVGDMDCLNGEYVVPTGPGLGVEPSKEAQGLLKKH
>SMa1898 Hypothetical protein
MAFNTQRLQFSGHSGATLSARLDLPNGPLRAYALFAHCFTCSRDLAAARQ
IGAELAREGIAVLRFDFTGLGSSEGEFASTNFSSNVADLLSAADYLRHHY
QAPAVLIGHSLGGAAVLAVAGEIPEVRAVATIGAPADVGHVLKNFGASLE
EIDKNGEADVDLAGRTFLIRKQFVEDTRAHRIKDAVGRLKKPILILHAPL
DHTVGIENATEIFVAARHPKSFISLDKADHLLTDPEDAAFAGRIISEWLT
RYLAADTPQGAGPIEHVHVRETGEGKFQNAIQAGGHRLFADEPESVGGLD
AGPSPYDFLAIALGACTSMTLRLYAGHKQLKLGRIGVDVSHTKIHAKDCE
ECTETERGDSRKIDRFERVISIEGEVSEELREKIVEIAGKCPVHRTLETV
AKIETVVK
>SMa1644 hypothetical protein
MPQAIVDIQRDIYLALAEHIKTIANGGGWTDFLTFLPMGIIFGAVHAMTP
GHSKAVLATYLTGASAGMRRGLVVSLALSATHVTMAVVIALFSLPLVSLM
LGSAGSAPLLEDVSRGLLGLIGAWMLWSVCFRPPHVHGEGEGVAVGFMAG
LIPCPLTLFVMTFAISRGVPGAGIMFALVMMTGVAITLSSVALVTVFFRT
RMEKLLATRHALLVKISKFVEAFTGLILVVIAMREIFIR
>SMa1853 Hypothetical protein
MKDIPMQTNEIELTAFGPEYLEAAIRLSRQAGWPHRLEDWQMAFALSEGI
VAVEDGRVVGTVLVTPYKRDCATINMVIVDEAVRGRGLGRKLMDAAFRIA
GDRPLRLVATAEGLPLYDKLGFGESDAVLQHQGVVGEIAAPAEPEAASTA
DVEAIAKLDRLAFGADRGALIAYLAKVGEFAVLRRDGRVTGFAALRAFGR
GEVIGPVVAADLDNAKALVAHFIAARPGRFLRVDTTAGTGLSVWLAEQGL
AHVGGGIAMMKPPIRRAADPIANTFALANQALG
>SMa2117 putative oxidoreductase
MLEDARTRWHPIAATDDLPLRHVFHGQLLGREFAVWRADDGYVNVWENRC
LHRGVRLSIGINDGRELKCQYHGWRYSNRTAGCTYIPAHPADAPARTITN
RSFQAVERYGLVWSSEDPRGDLPVVEGIGEDDLLVLRAIPVNASADLVVE
RLQSYRFQPNSEIAGAGANVELVAATQAAVALRSYQGRAETLGVFFVQPV
DSGRSVIRGVVSGRPNDAQLTVVLRCHNEALSTLRADLEREAAALPAPTP
IEPIFERVSEELASLPELDSRGRKAAIRVQVARKWQTADGIMAFQLRPVR
GLLPTFQPGAHIDVHLPNGLVRQYSLTNGPGETDCFTIGVKLDPASRGGS
QCLHDSVREGDVLAISEPRNNFPLRRDALKTIFVAGGIGVTPLLAMAQTL
NNQSLDYELHYFAQNEQQLAFSECRQALGDAVKPHLGLSPGDTVKELRRL
LSAYLPDTQLYVCGPGPMLESTRSLAAEAGWPEAAVHFEYFKNTNVIDDS
SSFEVALARSCLTIKVAAGQSILEAMREAGVDLPSSCEQGACGTCLATVI
EGEPDHQDVYLSPSERASGTKIMTCVSRSKSARLVLDA
>SMa2343 putative oxidoreductase
MDKVILITGASGGIGEGIARELGVAGAKILLGARRQARIEAIATEIRDAG
GTALAQVLDVTDRHSVAAFAQAAVDTWGRIDVLVNNAGVMPLSPLAAVKV
DEWERMIDVNIKGVLWGIGAVLPIMEAQRSGQIINIGSIGALSVVPTAAV
YCATKFAVRAISDGLRQESTNIRVTCVNPGVVESELAGTITHEETMAAMD
TYRAIALQPADIARAVRQVIEAPQSVDTTEITIRPTASGN
>SMa1050 Hypothetical protein
MSTSILAAVGSGGIVGFMLGLLGGGGSILATPLLLYVVGVTQPHVAIGTG
ALAVSVNAFANFASHAIKGHVWWRCAAVFSALGVLGALGGSSLGKAMDGD
RLIFLFGILMVVVIGGGIVGGVLGMLLATRLSAYKNILHRLFAALIFVVA
AYILYQSARQAGAHQSLLDPHVVFDGAHAPS
>SMa0592 conserved hypothetical protein
MALRSANILFKDETAGTLVETANGGTRFAYHSDWNEGNIACCFPSTQREH
EWKVGLHPFFQHLGPEGWLREQQARSAHIVEEDDLGLLLRYGADCIGAVS
IRPPDDAAQLPEITEATVSPGRTVSGVQKKLLVTKDDENRFVPASATGSA
LYIAKFNSDRIDNLVRNELLSLRWTAAVLGEREVTGFTASLTAVVDETAL
IVTRFDRRPNGEKLRLEDCAQILSKPKGQDYAGKYDAAYEDIAAIIRQHS
SRAPIDLLRFFNRLIVFTLIGNCDAHLKNFSLLETPTGLRLSPAYDVVNT
AFYDGFDQTLALSIGGEKIHLEAANQAIFRAFGKEIGLPDRAIDQTFKQL
KRQVEKAASIIRPPDAEPADGFVHRFKEIVDNSCLRILET
>SMa0074 putative
MTELSGKTILITGALGTLGRAQAERLGRAGAGLLLLLDRPGAEAGEGFAA
SLAAAHETMAIYVGEDLNNLASAEKRAATLSSEHGGIDILINNAALIINK
PFEEFSLEEYEDQVRVNSSAAFALARAVTPGMKQKRYGKIVNFCSLTLNG
RWDGYVPYVASKGAMLGLTKALARELGPHGVRVNAVSPGAVVSEAEERVF
ADRLQQYNDWIVENQSLKARIQPSDVADLVHFLVSPASDMISGQNIAIDG
GW
>SMa1594 hypothetical protein
MFRTELYRRAQPLQTLSLKINYHKWPSTFGARRCEHGDTTWASTPISSCR
SSVIFRCAMTTLSRATDRRGGGRVVEAEIGSGLNLPFYRPAVREVLPLES
APKRLAMARRVPDPGMPVSFIEGTAVSIFLDDQSVDTVVIAWTLRTIPGG
RGDCRNAARTQIWWQAAVRRTWIGTRCRCALMAGPAHTDLASHCVETGYM
ARPKPMMFRYEGSARTR
>SMa1680 hypothetical protein
MKPPRVSGRSLAEILGRAFWLSARDLPGWWMRAVRSCVCNSRAAVLWTVV
FCSLAALTGSVAASENSKTSMADPLIAVHDGFVDEKTCSSCHADEAAVFA
KSHHAKAMTVADDKSVLGNFNNIQFDRDGVAASFFRRDDRFFVRTEGSDG
KQADYEVKYTFAYEPLQQYLVDLGGGRLQALDIAWDTQKREWFWLGEGSA
AKPGSTFHWTGPFYRWNRTCIDCHSTDPRTNFKPQSNEYNSSYVATSIGC
QSCHGGGAKHVDWARTKAANASTAAADPGLAKVDSNTCFACHARRTRLVD
RYQPGGHFLDQFSPALLRSDLYFPDGQILDEVFEYGSFQQSKMAMAGVTC
FDCHRPHEGTVKAEGNGLCTQCHAETAPERFAGNDPSGAFDTQAHTHHPQ
GSPGALCANCHMPERTYMKVDPRRDHSFVTPRPDLSALYGTPNACISCHT
GQTNAWASEHLDRWYGKAWRERPTIAHAFARAAQNDVAAIENLRRFVTDR
EQPGIVRGSAIGEMTRLDGAATAADVRVAAGDPDPIVRLGAAEAAANLSA
DRRLDAIGFLLADETRAVRVAAARVLGATPSLDLLGARRGAFDAALDDLG
AYAEANADVAETQSTYGSILFGQGRTDEAEKALRQAIILDPTLSGAHINL
AEFYRASGDNEKSEQAYAAAIAANPDRADLRYGHGLSLVRLKALPDAIEE
LTAAMRLDPGNSHYRTTAAIALDSMGRTDDAFALFGPTIAGGATEANLLG
TAIQLGLKLGRYAETLKFAEALARLQPNDPQLEELVGQLQDAVQHGR
>SMa0793 hypothetical protein with local similarity
MADFQGETEMTAEVFDPRALRDAFGAFATGVTVVTASDAAGKPIGFTANS
FTSVSLDPPLLLVCLAKSSRNYESMTSAGRFAINVLSETQKDVSNTFARP
VEDRFAAVDWRLGRDGCPIFSDVAAWFECSMQDIIEAGDHVIIIGRVTAF
ENSGLNGLGYARGGYFTPRLAGKAVSAAVEGEIRLGAVLEQQGAVFLAGN
ETLSLPNCTVEGGDPARTLAAYLEQLTGLNVTIGFLYSVYEDKSDGRQNI
VYHALASDGAPRQGRFLRPAELAAAKFSSSATADIINRFVLESSIGNFGI
YFGDETGGTVHPIANKDAHS
>SMa1770 Hypothetical protein
MKLVWARYALDDRDAIFSYIERENPRAAVHVDEEVVSAGRPLDFPESRRP
GRIAGTP
>SMa1959 Hypothetical protein
MTTNRRSRRAIGRSDNSLRCDSCVGPCMLVQFPKSVAILNFPRYPSNSLM
LQLSKGLMSICEIESTAIAPACDTTSGKRSKPGLSRSLTLLFAAASGLAV
ANAYFAHPLLDVIADDLSLPRATIGFVVGATQLGYGLGLVLLVPVGDLVD
RRNLVIIQSLLSVLALLCVGFAPTEEVLFPALIAMGFFAVATQAFVAYAA
SLARPEERGAVVGTVTSGIVLGILLARTVAGAVVDIAGWRAVYLLSAAFT
LAITAILARVVPAQPKSGPAVSYPKLIGSLFTLFLQEPVLRVRAILAFLI
FADVTTLLTPLVLPLSAPPYSLSHAAIGLFGLAGAAGALGASRAGRWTDE
GFGQRVTGVALTLMLCSWILIGLLPYSILFLVTGVLLLDFGLQAVHVASQ
GLIYRVRPEAQSRLTAAYMVFYSTGSALGSSISTLVYARWAWTGVSMLGA
GIAAAALLFWAITLPKRMA
>SMa1327 Putative hydrolase
MLIIKGNDVEIATEAFGDSAHPPVVLVMGGMASMLWWPERFCRRVAEHGR
FVIRYDHRDTGLSTKYPPGQPGYAFDNAVADVVRVLDGYRISAAHVVGMS
LGGMIGQATALKHPERVLSLTAISSSPVGMNTTHLPASGTAWMDHMNMEV
DWSDRAEAVAYMLEDARLVASTVHPFDEAETRAFIERDFDRSGGYLSATN
HSVLFEISDAWQDRLPEMKVPLLVIHGTADPVFPVEHGAAVATAVDGARL
VEIEGGGHELHPADWDKIISAIIKRTNTRPNE
>SMa1086 Conserved hypothetical protein
MLARDIMKKRVLSISPDHSVSHAARAMLENQISGLPVCDDRGRLVGMLSE
GDLLRRAELGLVSRRDIAGVRAKPEAFIKGHSWRVGDVMTQPVVTVDEDM
PVGRVAELMAAKGIKRIPVMRAEEMVGIISRSDILRAVTASLPDVIANGD
EAVRRAVLARLCSDLGLEKGAIDVTIENGTVSLSGQVESEALREAARVAA
ETISGAGGVRNRLRIVANGGASDG
>SMa2023 Conservative hypothetical protein
MPANAADTIPESQAVKNVVLVHGAFADGSGWKGVYDNLTKRGYRVTIVQN
PLTSLEDDVAATRRALERQDGPVILVGHSWGGTVITETGIDPKVAGLVYV
SALSPDAGETTAQQYEGFAPAAEFVIETTKDGFGYVSPAKFKAGFAHDVS
DADVAFMRDAQVPINMSAFATKLENAAWRTKPSWAVIATEDKAFDQAMLI
HMAERIKAKITKVSASHALFMTQPAAIADTIDQAAKTVSAKKQ
>SMa0636 conserved hypothetical protein
MTTKVVKLSPDDSVRQAAKLMFDHHVSGVPVVDDDGHLLGVISEGDLIRR
AELCSEASVLMADMAIDPDDRANAFIRRCSWRVGDVMTANPVTIEEEAPL
ARVAGLMQERGIKRIPVVRDGELVGIVSRADLLQAIFSTKPDETAAGDEA
IRRSILVRLGENTSLEELDVTVTVTEGIVHFWGQVETAGCRRAARIMAES
VHGVRGIVEHFPDPYTQ
>SMa1331 Hypothetical protein
MAKEEYFLYPTDVESFGFDWGRLALTVAPEVNGAERFSGGVVDLPSGEGH
TRHNHPGAEEIIFVVSGEGEQMVEDENGDPVTQRVGPGCTIYVPESRFHS
TRNTGPGPMQLFVVYSPAGPERALRDLPDFRLIPPGT
>SMa2315 putative aminoglycoside adenylyltransferase
MRADRQHIDQALAATETIRSILREAVLAVYLHGSAVSGGLRPQSDVDLLA
IVDCPVADEQRRDLLAALLRISGRHPRAVGTPRCIELMVFLRADIATPKF
PVRAEFIYGEWLREAFESEELPVPISDPENTLVLAQARQEAVPLFGPDAK
ELLPSIPPEQVRRAMRDALPLLIDSLQGD
>SMa2103 possible sulfite oxidase
MEVPLPLFIVHHQHSPETCPARDPAKGTMLLNHLSRPSAARHGVLIKGEA
VAQGSHSLFFIAEAADEAILQAFLAPLRQAGNVEVTAAMTCAAMVSSGGC
EERPVDVSAEVLDPADACQDAVEAGLLIHSVNPLNGETSVPDLAGGAVMP
NGRFYLRNHFDIPNLGGDNYRLSIGGLVERPLKLSMRELHNLHAESQVVT
LECAGNGRSLFDPAVPGEAWGLGAVSTAEWTGVRLMEVLERAGLRAGATE
LTFRGADSGVVDGHDAPVRFERGLSLDQIRETDALLAYEMNGETLSPPHG
YPLRLIVPGWYAVASVKWLTEIVVTDQPCEAYYQAEKYWYHWVRNGHDER
AQVRLMNVRALISSPEEGENLPRGDTAIRGVAWSGAGNISRVDVSLNGSR
WREARLVGERRRSAWQWWELITRLEETGPLTVRARATDMTGRTQPEHAEW
NRLGYGNNSIHSVAARVI
>SMa1430 Conserved hypothetical protein
MISSPIARRPSPRKGVRLPHEITIPAKQTFRLDRYVDLPLNSGEIRGRTV
CTLVSPGTELGWANGDVFPIRPGYSAVFEVEEVADGVDGIRPGELRFAMG
MHRSTQTHTARDTVPLPAGLRPEIAVLSRLMGVSLTSLMTTKARPGDHVV
ITGAGPVGLLAAQLFKISGYRVTVVDPDPLRRAQLASCGISDCRERVPLQ
SSLQGQVALVLDCSGHEGAVLDACRIVRRLGEVVLVGVPWRKLTEISAHE
LLNAVFFNLVTLRSGWEWELPVHARAFEWEELLGGYNNARHSVFGGFARA
LDWLAEGRIELGGLLRRVPPTDPASLYAEIAARRNEEPFIVLDWTDFDGP
DLKSS
>SMa0447 conserved hypothetical protein
MSMVTQGKLIGQLLAFGALGLLAIGCTLVLWPFLSAILWAAVICFSTWPA
YRLFERAVGGYRALAAAAMTVLVVVVIVAPLALLATTLADNISSLVAGVT
HVLEQGPPAPPDWVRGLPIAGEGLATYWEGLAHNAPAFTIELKKVIGPFA
DVALIGGTLFGAGLLELALSIFIGFFLFLHGRRMTALTRQIAERVAGARA
RRLLSVVGVTVTGVVYGLIGTALAQGLLAGVGFWIAGVPQALLLGCLTFV
LSFVPAGPPFVWGPVALWLFMQESVWWGIFVAIWGLLLVSSIDNFLRPYL
LGRNTNLPVLLGLFGLIGGVLAFGLIGLFLGPTLLAVAHSLFREWIAAEL
EERRQPPSSSTGRDQRSGPRQGG
>SMa0359 hypothetical protein
MTMGTIIRPLQRAEVELVWQIERREVVQEIYEVADGRLHLRPQFYDTREW
PDGEPEIYTPILFDCFDHDGVFLGALLEKNL
>SMa0572 hypothetical protein
MKTIIPRQLARSDVEAAIDYYAREAGTEVTHGFIGALQAAYASIASHPEA
GSLRYAYELGLPDLRSVSLKRYPYLIFYRDQPDHVDVWRVLHAKRDNPQW
MQEPNNH
>SMa1198 possible Copper export protein
MNKPRLDGRRWDAPARMLGISVLACSAWLWLAATAFAHASLVETIPADNA
VLAESPATFSMTFSEAVSPLSLKLVGPDGSSVSLERYEPRDRTLEVEPPS
SLVRGTHVLVWRVISEDGHPIGGSVIFSIGPPGATPRAAAVKIDGEVGTA
IWLAKVALYLGLFLGIGGSFALSWLGRVERSGTVTVHIILGIGLFGALLS
VGFQGLDALAAPLRRLADSATWQAGMSTSFGRTSVVAVLASAMAIFALVA
KGGWGRLLSLAALIGTGLALALSGHASAAEPQWVTRPMVFLHGVGIAFWT
GALIPLGLALARRTPESGYMLRRFSNTIPLVLALLIIAGMVLAVVQVRNL
SALVETAYGAVLLAKLALFVLLFALAVFNRLRLTEPAERRDAPAARRLAR
SIAIETVVAVLIFGVAAVWRFTPPPRALEIAAAQPATVHLHAPRAMANVR
LSPGRAGQVAASIEVFSKDAKVLTPKEVTLVLSNPASGIEAIRRPAQRAG
EANWRVDGFVVPLPGTWHVRLDLLVSDFELVKLEGEVDIRR
>SMa0013 hypothetical protein
MAEALARAQAGDYGSALRIWEPLARAGVARAQNNIGACFAEGLGVPENRE
LACKWLRLAAEAGDPVGQRNYAALHMQGLAGTDADYGIAAEYYRRAAEKG
DAPAQDMLSWLLLEGEIMTADPLEARRWAECAAEAGIASSMTRLGMLHHN
ALGVERDAQKAVYWWLKAAERGDADAQAMLGAACHMGAGTIRNGVTALVW
LIRATEGGSTLAKPFMGPVRDSLSPAEIQEAERRAQEPLSRRAP
>SMa1945 Hypothetical protein
MSARILCQQMFVWHTARMNKNAADIPMTPFLPALAIAGTVITWSFSFAAI
GYALREVEPLPLAAIRFALAAVFAIAWIAWRRPRWFLPRDFVVLAISGLL
GIAAYNVLLNLGQAAVSAGAAGFIVNTQPLFMVLLAVLFLKERFGRWNWV
GTIVGFSGVALIASGQPGGLSFGTGSTLIVLAAACAAAYSILQRPLFARA
EPLDVTGARHRCRRSRPHALATGRRLPIDARASGHLADDHVPGRRSGHYR
SKLLDLRTQEFRCRAGRSISLFGSTVLCWTGVAPAR
>SMa2099 conserved hypothetical protein
MTAVTHSAALEPAERLTALKADIADPEEVARIVVGHDAVISAYSPGLRRH
SAEDAAVLIEKAHASLFEGVKRAGVRRILIVGGVGSLQASLGVDVVDSDF
YPADHRAHTLRNREILRSLRRGEHDLDWTYVSPPLSIKAGERTGRFRLGE
DALLRDEAGESRISAADFAIAIVDELDKGQFIRRRFTAVY
>SMa0545 hypothetical protein
METLVADASIAIKWVVEEEGTDSAVELRSRFRFAAPELLIPECANILWKK
VQRGELSRDEAVLAAKLLERSGIDFVSMTGLLEEATNLSIVLSHPAYDCT
YLIAAQRTGSRFVTADMRLLRIVSERAPGEIARLCVSLPDARNDAH
>SMa0320 putative
MTSLNGKIALVTGASSGIGAATAAKLAEAGAKVGIAARRTDKLEDLKKKI
EAKGGEALVIEMDVVDTTSVEAGVKKLVDAYGSIDILVNNAGLMPLSDID
QFKVDEWQRMVDVNVKGLLNTTAAVLPQMIKQHSGHVFNMSSIAGRKVFK
GLSVYCATKHAVTAFSDGLRMEVGQKHGIRVTCIQPGAVATELYDHITDP
GYRQQMDELATQMTFLQGEDIGDTIVFAAQAPAHVDVAELFVLPVEQGW
>SMa0400 putative zinc-binding
MKAVRLYDIRDLRVEEVAELAAPPPGFVNLEVRAAGICGSDLHNYRTGQW
ISRRPSTAGHEFCGRVTAIGEGVSHLVRGDVVSADSRMWCGTCPACASGR
SNVCETLGFLGEVCDGGFAEAVQLPSRLVFRHDPKLSPHVAAMAEPLAVA
LHAVRRLAVPDGAPVLVMGCGTIGGLSALLLSRLHQGPLLLTDLNADKAA
LVAEVTGGVVVALDGAAIEEALPGTRLRHALDATGSIQAIARALDILSGG
GALALVGIGHGKLDLDPNILVEREISLVGCHAFAGELPEAIELLADLAPA
LQRFIEVLPTLDDVPEAYERLLRGESNALKTIIEVAG
>SMa0383 conserved hypothetical protein
MSAAKSSGGTFAPLAQPVFAVLWTATVLGNTGSFMRDVASAWLMTDLSAS
PAAVAMVQAAGTLPIFLLAIPAGVLTDILDRRKFLIAVQLLLASVSISLM
VLSQTGMLSVSSLIGLTFLGGIGAALMGPTWQAIVPELVKREDVKSAVAL
NSLGINIARSIGPAAGGLLLAAFGAGITYGADVASYIVVIAALVWWPRAK
NANDALQENFFGAFRAGLRYTRSSTPLHVVLLRAAIFFAFASAVWALLPL
VARQLLGGDAGFYGILLGAVGAGAIGGALVMPKLRERLSSDGLLLGAALI
TAAVMGVLALAPPKVVAIIVLLFLGGAWITALTTLNGTAQSVLPNWVRGR
GLAVYLTVFNGAMTAGSLGWGAVGEAVGIQSTLIIGAIGLLIAGLIMHRV
KLPAGDADLVPSNHWPEPLVAEPIAHDRGPVLILIEYKVEKQHRTAFLHA
IDHLSRERRRDGAYGWGVTEDSADPEKIVEWFMVESWAEHLRQHKRVSNA
DADLQGKVLAYHVGPDKPVVRHFMTINRPGAA
>SMa1332 Conserved hypothetical protein
MPLIPRKTILEKFHGMISAGKPIIGGGAGTGISAKAEEAGGIDLIIIYNS
GRYRMAGRGSAAGLLAYGNANEIVKEMALEVLPVVKATPVLAGVNGTDPF
ILMPQFLAELKAMGFSGVQNFPTIGLFDGRMRRGFEETGMGYGLEVDMVA
EAHRLDLLTTPYVFNEEEAIAMTKAGADIVVAHMGVTTGGAIGATSAISL
DDCVSEIDAIAAAARSVRKDVIVLCHGGPISMPEDARYILDRCPGCNGFY
GASSMERLPAEVAIRRQTEEFKALAISTVV
>SMa1653 conserved hypothetical protein
MEFDFAELPEKDRYRLLCAFVGPRPIALVTTIDEQGCKNAAPMSFFNVFS
HDPPLLILGMQTRPDGNSKDTVANIRRSGEFVVHMVDMAIAKEMIITGIN
FPSDVDEIQVSGLTSVSSVKVAPPRIQESPCAMECRVSQILNYGRRSIVI
GEVLQMYVRDECLDASGRYVLPEVYQPIARLHANNYIVADNQFVLTKPDE
FAHHDNAAGYGGSVHEAKGGSTIRIASAADGQKLDETVEQP
>SMa1811 Hypothetical protein
MTTRRSVLKGTLSLMLAPTAMTTLAPPGAAAKAQQVKIQAPGYYRMMLGD
FEITALSDGTAKFPAETLYAGAKDQVAALLAKAFLDSPVELSVNAFLVNT
NERLVLIDAGAGGFFGPALGKLVPNLVAAGYQPEQIDDIILTHAHVDHLG
GLVAGEKIVFLNATVHLNQRDADFWLSSANRDAAPEAKKEFFSMAVQALS
PYRDSGRLKTFADEAEPVPGFKTVLRAGHTPGHSAVAMESKGQKLVFWGD
ITHGDVVQFEEPDVTIGFDENPAAAASARDAAFAEAVKEGYLIAGAHTRF
PGIGHVGTDSDKFDWVPLNYRATL
>SMa1507 Conserved hypothetical protein
MFNLTRRQFLRYSGATGVALGTGSLASFAGAEEPLKIGVVYVSPIAEIGW
TKQHSLGVDAIKKEFGDKVAITVIDNIFMPQDAERVFRELAASGNQLIFG
TSFSHGTPMQKVAPRFPKTAFEHCSGIVHRANLGTFEAKYYEGTFVAGAA
GGHMSKSGKIGFIGGFPIPDIVGPANALLLGAQSVNPEVTCNAIFLNSWF
DPGKEKEAANTLLSQGCDVICSMTDTATGVQVAGEGGAWSIGYASDMAKF
GSGKQLTAFTLDWSSEYLRAAKGVAEGTWKAEARWDGLAAGVVKMAPYNE
AIPADIQAKLKQLEADIAAGKIHPYAGELKDQDGNVKVAAGSVLSDTDIR
GMNWFVRGMIGKLS
>SMa2337 putative transmembrane transport protein
MLAAVVQGSDPIMTIAQTSPAVREGSTAAGAGRLYAVLGGLYLAQGIPTY
LLLVALPPLMRESGASRTAIGLFSLLMLPLVLKFAVAPLVDRWAPWPGLG
HRRGWVVPTQLLVSAGIASMALVEPDRAGTLFAIGICITLLSSVQDIATD
GYAVRHLNGRTLAIGNAVQAGSIALGVIVGGTLTLVLFHKIGWRPTILLV
ACLSLLPLVAAIWMKDRAVASPEAPLRRRASLFGFFRRPNAWMILAFALT
YRASEGLVRGMEGSYLVDSKVPTEWIGYMSGAAAATAGLLGALIAALIIR
KAGLTATLILLGGLRSLCFLAFALNAFGIWPGIAVAMSASAFQTLIRYME
LVAIYSFFMASSSDDQPGTDFTILSCAELVVYLIGTSIAGYVADRFGYAT
LFSSATVISVLGIGLSVWMLERLKARPSRSR
>SMa1151 Conserved hypothetical protein
MFVKEMSRHECNSVIQAGHVARLACCKEGMPYIVPINYAFTGQCLYGFSM
PGQKTDWMRENPHVCLEIEEISGERQWKSVLVFGRYQELPPEGQWHNECM
HAWSLLQSRPNWWEPGGLKPGKPEIAAASPHVFFCVDIDEITGRAAFEGD
E
>SMa1398 Putative
MSNPTAKPVALVTGSSRGIGLAAAEALAREGFSVAINGLTADDELAAAAA
RVSRHGAPVIAVAFDVAELAAHEAALTKIEAELGPLTTLVNNAGVGVLKR
GDLLDVTEESWDRCLTVNAKAMFFLSQVFARRLLARERSPLFHSIVNVTS
SNSVAVAVQRSEYCASKAAASMVSKALAVRLGRENVAVYDVQPGLIATEM
TAPVIDSYRERAEQGLTLFPRVGEPEEVGAVIASLASGRLPYTTGQTISA
DAGMLVPRF
>SMa2225 putative opine oxidase subunit
MSLWDAIVIGAGPAGIGASGLLAENGAKVLVIDEAPGPGGQIWRGVEDVS
DARARILGSDYLAGRDEVRRLRASGAELSFETQAWRVEPEGTVWLKDSHG
IRRERGRRLLIATGAMERPCPLDGWTLPGVTTVGGLQILLKREGMLPGGP
LVLIGTGPLFYLFAAQCLAAGMRDLSLIDTAAAGAIVSALRHVPAALTGK
GPSYLIKGLKLLWMLRRAGVDIYNHSGDLRIKSAADGLEVHFRMREVEHR
LSASHVGLHEGVIPETHLPRALGCRMHWSEAGGAFHPHRDIHLQSSVAGV
YIAGDAGGIGGATVALLEGRLAAMGILASLGRPIDELLLRATRRDRAAHL
AARPLLDHLYQPSPAILTPADGVLACRCEEVTCGEIRAALRAGCAGPNQV
KAFLRCGMGPCQGRMCGMTLTSLAASTHDISMGDAGFLTIRPPLRPISLG
EVADLVEP
>SMa0136 hypothetical protein
MVAALISAAATMIMFFMWLPLQKTAIDPPSQKHAGRRLVPPRCYGYHAGG
RLGEKATAAVPCNGASAGRRPTLPYLLANKFLACWSQPGRPLGDVVEGAV
LKFSRGIDQDRAATAGGLQQIAFARGSAAAAQPSRLPTLVATAAAVAVLY
FARDVFLPLAIAILLTFALAPLVSRLRRVGCPRSVAVIGTVTTAFLFLSA
FGVVIAMQVSEVAQNLPTYQYNIVEKVRTLKETGSESQILERIGRVIERI
STEISRPEPEVRASPEPTPETKPLLVEIFSPQRPIETLKNIINPLLGPLA
TTGLVIVVVIFMLLEREELRDRFIRLVGYGDLHRTTEALQDAGARVGRYL
LMQLVVNITYGIPLAIGLSLLGIPNAVLWGMLAIVLRFVPYIGPVIAAAL
PLFLAFAAAPGWSLLVWTAALFIVLELLSNNVVEPWLYGSRTGLSPLAII
VAAIFWAWLWGPVGLVLSTPLTVCLVVLGRHVPQFEFLEILLGNEPVLDP
KERLYQRLLAGDPDEATDNAEDMLQEKYLVEFYDTVAIPALLLAERDRAR
GALTNTQAAQIAQSANTLIANLEEIAGEEEGEEETSTEAQESDDDNDDAE
EYDLPPGDGKSVLCVGGRSDLDDVTASMLAQTLWIQGADAAHATHEVLKA
GNIKALQLEGRNAVVLSVLDQDFMRHAKFTVRRLKRIAPAARVGIVLWKE
DGRPGTTERDQLIESLQADFVVFGMGDAVREALSDELPRSLKLAHPKIAP
GYAMRRSKRTDTESTVKAD
>SMa1978 Probable haloacid dehalogenase-like hydrolase
MSILLPVKQESDMIRSGQRPEWLTFDCYGTLIQWDEGLLAAMERILAGKN
RSIERDAFISVYDRYEHRLERERPHRSFKNVSATALALAMGEFSLDVSPD
DADILTSSISRMPPFPEVVATLTRLKAAGFKLAIISNTDDAIIAGNVAQL
GGSVDRVITAEQAGAYKPARQIFQHAWRELGIEKEQLVHICASPHLDLAA
ARELGFRTIWVDRGTGRKPLADYVPNETVARLDEVCGLLSAAGWME
>SMa1053 Conserved hypothetical protein
MSAYLPSLAGGMLIGASAVMLLLLNGRIAGISGIVGRLLQGVGMTTNLAF
VLGLLLGPLAYLLMFGSWPAVQITAGWPLIIIAGLLVGFGSRMGSGCTSG
HGVLGLARVSPRSMVAVATFLTAGVAAVALLRGLAL
>SMa0959 probable
MSRGSTSTIAVVTGGTSGIGLATARHLLERGNRCAIFGQRPINVESAAEA
LSQDFGSERVFARSVDLAEPTQITSFFRELDERWGRAEILVCNAGISPKG
PDGPTPFQEITLEEWNAVLSVNLTGTMLCCQAALPGMVAQNFGRVVLVGS
IAGRALPKIAGTAYVASKAALAGFARSLIARYAGQGITVNVVAPGRIATE
MAGPRDSLVNRAAVARIPAGRMGEPEEVAAAIGFLTSDKAAFINGAIIDV
NGGEFVPL
>SMa1500 Putative oxidoreductase/oxygenase
MLDTAKTRWHPVAASYDLPFRHIFHAQLLGREFAVWRADDGYVNIWENRC
LHRGVRLSIGINDGRELKCQYHGWRYSNRTAGCTYIPAHPADAPARTITN
RTFASVERYGLVWTAEEPQGDVPEVTGLAEGDLLTLRGIPVNAPADVVVA
ALTGYRFQPNGRLEGRAADMSLKASDGFSVALTAREEGAETLAVFFVQPV
DSNRSVIRGVLDSSPRGAERLTVLHHHNERLSKLREIVEREAQAAPQPAP
LEPVIERVSPELAELPEMTAHGRKATIRVTVARKWMAADGIAAFELRPIK
GLLPTFQPGAHIDVHMPNGLIRQYSITNGPGESDSYVIGVKLERESKGGS
RCMHETLRAGDVLAISEPRNNFPLRRDAEKTIFVAGGIGATPLIAMAQAL
KNQSLDFAFHYFAQNQAQLAFPEKTALLGEALKPQLGLDPEGTEAKLKDI
LSGYRPGMHVYLCGPGPMLEAARRIAAEVGWPETAVHFEYFKNTNTIDDS
SSFEVALARSCVTFKVPAGRTILDVMREVGIDMPSSCEQGACGTCLATVI
EGEPDHQDVYLNDAERKSGTKIMTCVSRAKSARLVLDL
>SMa0719 putative
MTGSSRGIGASIAQAYAAYGARVVLHGQRPGATAEIEKAIRAAGGDAVSI
HRELSPPSAGRDLIAAAEGAAGPLDILVINASAQINGALHDVTPEDFATQ
IDVNLRSTVEMLQAALPAMAERGWGRVVNIGSINQLRPKSIVSIYAATKA
AQHNLIQSLARDYASRGVLLNTLAPGLIDTDRNAARRDGDPEAWSNYVRT
LNWIGRAGRPDEMVGAALFLASDACSFMTGEAVVLSGGF
>SMa0180 conserved hypothetical protein
MVELIAGLIDVFADVPLTGNPLAVVQDADGLTDDQMRRIAGEFNQAETTF
LMRSTRADWKLRSFTASGAEVFGAGHNALGAWLWLAENGDLGSLTAARTF
QQEIGRDVLPIELESVGGRIHGRMRQVPLRLSDPLDDVAPLADALGLDPR
DILPEPPARPADTGATHLMVRVLNVDSVDRALPVADKLLAVLEKTPAEGC
YIYALDADAPDTAYARFFNPSVGLWEDAATGTAAGPLAAYLAATGNLTNN
ELVIEQGTKMGRRSILRIRLAPLPELSGAGIVVVKGVIRL
>SMa0168 methyltransferase-like protein
MAMASDVLARSFGKEAFGLDPQNYHTARPAYPELVWDALRNRAGLRRGIS
ILEIGAGTGLATERLLEDRPHRLLAVEPDRRLARFLRGRLDKEELEVVET
PFEKLKVPEKSFDLVVSATAFHWIDAAPALRRIHRLLRAGGTVALFWNVF
GDGVRPDPFHRATAHLFSGHRTSPSGGGTTKTPYGLNVGARLGELAEAGF
TADEPELIDWTLALDPPAVRRLYATYSNVTALPADERERLLSGLEKIAET
EFAGVVTRNMTTSVYTGRRE
>SMa1760 Hypothetical protein
MFQCCEIRHGTGPENGILRYLMHLLLRCFNYHVRQTVSAAMQPAPFANRR
YATIEDKTENCANGTSMTEPTPATKKTDSMSATWNVTSLADAAAAVMRVY
GAGGTVRRLSSERDETFLFTRSDGRDFILKIANPAEDAAALEFQDGALLH
LEAAAPVVPVPRLVRTKSGEQSHTLSTADGPRVMRLLTFLRGELQYRTPA
SEAQSRNVGRALAALGLGLEDYRGRPPAGKLMWDISHTLDLTAVVDHVAP
ERRAQAEAVLAEFERALPAITGLKRRQIIHNDFNPHNVLLDPSSPTTVVG
IIDFGDMVHAPLINDLAVALSYHLGTENWAARTGSFLEGFHSVRALEPGE
IEVLPVLTRARLAMSLIIAEWRSARFPENRDYIMRNHATAWRGLQNISDL
TPAGLKKLVPNLYEV
>SMa1819 Conserved hypothetical protein
MPTRRGFLGAASALALPNLFSPARAADPVSTSGDKPMSADLILHHGLVTT
LDRTNPNATAIAIRDSKFLAVGDDRDIMALAGPETKVIDLKGKRVLPGLI
DNHTHVVRGGLNYNIELRWDGVRSLADAMDMLKRQVAITPPPQWVRAVGG
FTEHQFVEKRLPTIDEINAVAPDTPVFLLHLYDRALLNGAALRAVGYTKD
TPNPPGGEIIRDASGNPTGMLLAKPNAAILYSTLAKGPKLPFEYQVNSTR
HFMRELNRLGVTGVIDAGGGYQNYPDDYAVIQKLADDGQMTVRLAYNLFT
QKPKEEKQDFLNWTSSVKYKQGDDYFRHNGAGEMLVFSAADFEDFRQPRP
DMPPEMEGDLEEVVRVLAENRWPWRMHATYDETISRALDVFEKVNQDMPL
EGLNWFFDHAETISERSIDRIAALGGGIATQHRMAYQGEYFVERYGHGAA
EATPPIAKMLEKGVHVSAGTDATRVASYNPWVSLSWMITGKTLGGMQLYP
RANCLDRETALRMWTENVTWFSNEEGKKGRIEKGQFADLIVPDKDFFACP
EDEISFITSELTMVGGKIVYGTGTFADFNENDVPPAMPDWSPVRMFGGYA
AWGEPEGAGKRSLRRTAMATCGCASNCNVHGHDHAGAWTSKLPISDVKGF
FGALGCSCWAV
>SMa0171 hypothetical protein
MNATKLLLILPLLAAFAYISLVSLTYLSQRALLYPGASATPAPERASWGQ
NASIQTPDGETLHGLYSRGEPGQPSVLFFLGNADRVSNYGFFAQALAARG
IGLLALSYRGYPGSSGTPNEHGLLIDGIAAFDWLAARSGNEIVVLGQSLG
SGVAVDTAGKRPAVAVILVSAYLSVLSLAQTYYPFFPVALLTKDPFRSDL
KIAGVRQPEAVYPRPARHHHPIVFGRSSVSDRSRAQADAHLRCRPQRSVG
CPHG
>SMa2365 probable ABC transporter, ATP-binding protein
MSEAILNICSVSKRFGDNLANDDISLSLGKGEIVALLGENGAGKTTLMSI
LFGHYVPDSGKVLVEGRELPPGKPRAAIRAGIGMVHQHFSLAPNLTVLEN
VMAGTERLWHLRSGTSAARRKLHRICQRFGLTVEPDARVGDLSVGEQQRV
EILKALYNDAHILVLDEPTAVLTNLEAERLFSTLKDMAREGLSLIFISHK
LDEVMAAANRIVVLRGGRKVAERLAKETNKAELAELMVGRRVARPVREPS
TPGEVVLKVADVSVSIDGVERLKSIDFSLRAGEVLGIIGVSGNGQTTLAH
LLSGTLRRDKGDLLLFGEPIGDLTVDDAVRAGIGRIPEDRNKEGAIGEMA
IWENAVLERLPRFSRYGLVDRPSGQAFAGQIIDAFDVRGGRPTTRTRLLS
GGNMQKLILGRNLMDRPRILLAAQPARGLDEGAVAAVHERLLEARRAGTA
VLLISEDLEEVMALADRIQAIVNGRLSPPIAADSASATKLGLMMAGEWNE
EHEVPHAF
>SMa0036 putative ABC transporter ATP-binding protein
MIRIENISKSNSHRILYIEASAALNRGEKIGLVGPNGAGKTTLFRMITGQ
ELPDEGQVAVEKGMTIGYFDQDVGEMAGRSAVAEVMEGAGPISAVAAELH
ELETAMSDPDRMDEMDAIVERYGEVQARYEELDGYALEGRAREVLAGLSF
SQEMMDGDVAKLSGGWKMRVALARILLMRPDVMLLDEPSNHLDLESLIWL
ENFLKGYDGALLMTSHDREFMNRIVTKIIEIDGGALTTYSGDYGFYDEQR
ALNARQQQAQFERQQAMLAKEIKFIERFKARASHASQVQSRVKKLEKIDR
VEPPRRRQTVAFEFLPAPRSGEDVVNLKSVHKTYGSRTIYDGLDFMVRRR
ERWCIMGINGAGKSTLLKLVTGTTNPDKGSVSLGASVKLGYFAQHSMDLL
DGESTILQWLEERFPKAGQAPLRALAGCFGFSGDDVEKRCRVLSGGEKAR
LVMAAMLFDPPNFLVLDEPTNHLDLDTKEMLIKALSAYQGTMLFVSHDRR
FLSALSNRVLELTPDGINQYGGGYSEYVERTGQEAPGLRG
>SMa0601 conserved hypothetical protein
MRPPAGGKAGVFSFTYGGAVIHAAASEAEKPQSTDSRCLAQLARIRQSAE
FDATGREHRFLQYVVEETLAGRGSRIKAYTVAVEVFGRDSTFDPQNDPIV
RIAASHLRRSLERYYLTAGKSDPIVIGIPKGGYLPTFSERGSPEDANAAE
SSMPTMQAQAAGPSDASPVAARPPPGPGPGPGPDDVRAQFERIVSSKEFH
GGGRGDALLRYIIEETLAGRAERIKGYSIAIEVFKRDKSFTQDDPVVRIE
AARLRRALERYYLVAGQNDPLRIEVPKGGYGPTFSWKEAVRAESDRTAVP
DASGPIVSARRRGRVLLTVGVVAVAAAAILGYWTIDRPGSVSSLRAGSVS
VPDGPTLVIAPFANLGEGPNAELYTDGVTEELLTALPRFKEIKVFGRETS
KSLPPDVDVSQVRDELGARYLLAGGVRVSGSRIRVTARLVDASDGAILWS
EDYDNDLQSRDLFAIQSDVASKVATAVAQPYGIIAQTDAANPPPDDLGAY
SCTLSFYDYRAELSAERHAKVSACLESAVARYPGYATAWAMLSIAHLDEE
RFKFNPKSGAPMAMERALQAARRAVQLDPGNTRGLQALMTALFFNGQYAE
AMRTGEQALAMNSNDTELMGELGTRVAMGGQWQRGAALLDRAIALNPGGA
GYYHGTRALAAHMLGDHPAAVAGIRQADLQKFPLFHAVASVIYAEAGMLH
EARRAGETFMRRRPDFVPNLQAEFMMRNLQPKDQLRLVSGLRKAGFSIPD
GVEASIAAAEAADAKSR
>SMa1368 Hypothetical protein
MELLISPQGIAAAVALILAGAVQGSTGFGFNMLAAPMLAIIDPAFVPGPM
LAMAIAVSAGGTVREWSDVNRQDLAFSLTGRLLAAGAAAFCLQLLSPDAF
AAVFGFGVLFAVALSLAGLRIDTTRSSLFLAGVLSGFMGTLTSIGAPPMA
MVYQNTGGARMRATLNAFFVVGGIISIGALFVAGSFGLSDLLLAATMLPF
AFLGFLLSGWGRRLVDRGHVKVIVLIVSAASALVLLLRAFS
>SMa0187 putative oxidoreductase
MYDAPFYKGSDKLKDKVALITGGDSGIGRSVAVLFAREGADVAIVHLDES
QDADDTKAAVEKEGRKCLVIKGDVKDASFCRKAVEKTVMQLGRLDILINN
AAFQVHTRDIEDLTDEHFDETLKTNLYGYFYMAKAAIPHLKNGSAIINTG
SVTGLTGSKELLDYSMTKGGIHAFTRALSGHLVPKGIRVNAVAPGPVWTP
LNPSDKEAEDVEKFGSQTPMKRAAQPEEIAPAYVFLASPQMSSYITGEIL
PIVGGY
>SMa2361 conserved hypothetical protein
MTRDLMMSRRNVLASGLVLGVSALAPAVRASAPIKVAGVHASPVENAWNS
VLHKALQDAAAEGVIEYVFSEGISGTDYPRAMREYAEQGAKLIIGEAYAV
EKQAREVAADYPETAFVLGSSGKESGDNFGVFGTWNHDGAYLAGMLAGKM
TKSNVVGSVGAMPIPEVNMLINAFAAGVKAVNPDAKHLVTFIGTFFDPPK
AREAGLAQIDAGADILFGERIGTADAAKERGLKSVGSLIDYTPRYPDTVF
ANALWGFRPILNAAIADVSAGKPVGKDYTAFGLLKEGGSDVAYVKGVAPA
DAEAAMEAKRAEIKSGAFEVPRITDEPK
>SMa0664 conserved hypothetical protein
MTIAGRGSVRAGSPTALPTNTEILVQLDRIRLSAEFDVPDRARKFLAYIV
GEAIAGRADRIKAYSIATEVFGRDSSFDAQTDPVVRIEAGRIRRALERYY
FVAGSNDPIVIKIPKGGYAPAFEKRGGAPYQLSSGQAANVQSRSMSLEQT
ALWVSVATVGLLTCGLLANAFFGSAATTIESLTKPGGTRPNIPKLMVMPF
EDLSQTPQSAMITRGLTDEVISNIAKFKEIVVVAGPAAPNPHSAEREYPA
FALEGRVRLDGDKLRLGIRLVQHSDGSVVWANTYDEVLQPRKIIELQQNA
AAAVASAIAQPYGIVFQANATHFMRSVPDDWQAYACTLAYYGYRGDLNPQ
THASVQECLQHATTQFPDYATAWALLSLTYVDELRFRYRLNRSTTVSLSH
AIEAAARAVELDPQNVRALQAEMLTLFFRGEVNAALTVGARAYAINPNDT
ELSGEYGFRLALSGQWRSGCDLVSKTVASNPGPVGYFEAALAVCCYIEHD
YVAAERWARSADLHANPVYHVILLAILGKLGKMDLARAEREWLEINVPGF
LENARNEVALRIHRPEDQKHFIEGLRQAGVPIPGK
>SMa1840 Hypothetical protein
MSLLKTIETNPSFAPRESSPLPERLISGNPAFKTWAQDVARGEMIQTGVW
EASPGETRSVKGETFEFCHILSGVVELTPENGKPVVYKSGDSFVMKPGFV
GIWKTIETVRKIYVTVM
>SMa1809 Probable NON-HEME HALOPEROXIDASE
MLYFLGKGYRVIAHDRRGHGRSTQVGDGHDMAHYAADVAALSTELDLRDA
IHIGHSTGGGEALAYVARHGAGRVAKLVMVGAVPPIMLKTEAYPGGLPIE
VFDGLRVQLAANRAQFFLDLPSGPFYGFNRPGAQVSTGVIQNWWRQGMMG
SAKAHYDGIKAFSETDFTEDLKRVEVPVLVMHGDDDQIVPIDSSARLAVK
LLKNGTLKVYKGYPHGMLTTHADVINADLLEFIKA
>SMa0247 hypothetical protein
MSGGFLVSFEQALSAESIQPADASSAMLVGRVWSKTAGGPCPVLISEGEV
FDLTPLAATISALLEIDGLVDALRDPSRFASLGSLDAFLRGEAGDLLAPA
DLQAVKAAGVTFADSMLERVIEEQAKGDPLRAQEIRGRLAPVLGDNLKGL
VAGSDKAAEVKKLLQELGLWSQYLEVGIGPDAEIFTKAQPMSSVGCGAYI
GIHPKSDWNNPEPEVVLAVTSKGKIVGATLGNDVNLRDFEGRSALLLSKA
KDNNASCSIGPFIRLFDGAFTIEDVKQAEVSLVVDGKEGFKMTGISPMSA
ISRSPEDLVSQLLNDNHQYPDGVVFFLGTMFAPVKDRRGTGLGFTHEIGD
RVEISTPRLGRLVNWVDHSDRCPKWSFGLGALMKNLAERGLLQAKREG
>SMa0335 putative
MSTSNILDGKVVIVTGASSGIGRAIAIRAAEHGAKAVIVSDVVEAPREGG
EPTASEIRKLGAESVFVKADVSRKVDNDALVAAAEEFGGVDVMVANAGIT
LKTDGAEVPEDDYRRLMSVNLDGPLFGAQAAARQMKALNKQGSIVLMASM
GGISGAGITVAYSTSKGGVVLMAKSLADALGPDGIRVNAVAPGTIDTELL
RTSPGIAQASEGFRQRTPLRRLGKPAEVGDAVAFLGSDLSSYVSGTALLV
DGGLLAVI
>SMa1503 Hypothetical protein
MSRTALCSDRVLLNDWHVVADLTNLSSTAPFHTRLLGVDLTIRCGGRYRR
QVVRSDGGEPVNSDSRYGFLWACLGKPERDIVFVPEANEADRYLVTGGSI
AVNVSGLRAVENFLDMGHFPFIHTGWLGEEPHTEVAPYKVELTDADEVVA
TECKFYQPVASPTAKEGFVVDYIYKVIRPYTVALYKSNPVHKARLDVITL
FVQPVDEERCIAHPFLCYLKEGVSEASIRSFMQLIFAQDKPILENQLPKR
LPLDPRAETPIRADAVSVYYRRWLRDRAVTFGAIPARM
>SMa2369 putative ABC transporter, permease
MNAVTDIFASAGLWAAVLRIATPLILGTLGALLCERSGVLNLGIEGIMTF
GAMIGWLAVYNGADLWTGILVAGLSGGIFGLLHAGLTVTLGLSQHVSGLG
VTLFASSFSYYVFRLLVPVAGTPPTIEPFQPIDVPALSSLPFLGPALFTQ
TPPTYVAILLALVLGYVLFRTPLGLAIRMTGENPHAAEAQGINPMAIRFG
SVIVGSALMGIGGAFLTLSAFNSFFPTMVQGRGWICIALVVFASWRPGRA
LVGALLFALFDGFQLRLQTRLSGVVPYQIFLMIPYLLSIAALALMARRAR
VPQALMQPYRRGER
>SMa1864 Putative ABC Transporter, ATP-binding protein
MGNLVEIRDLKVEATTDTGRRVEIIKGVSLDVAEGEIVALIGESGSGKTT
IALTLMGYARPGCRISGGSVLVAGNDLVTLTEKQRAKVRGTEVTYVPQSA
AAAFNPAATIMDQVIEVTRIHGLMAAAEARARAVELFRALSLPEPETIGS
RYPHQVSGGQLQRLSAAMALISDPKLVIFDEPTTALDVTTQIEVLRAFKS
VMKKGGIAGVYVSHDLAVVAQIADHIVVLKGGEVQEVGTTEEILSSAKHP
YTRELLSAFEPKPREAADAAERAPAPLLKIENLVAGYGASKTDGLPLVRA
VEDVSLKVEKGRNLGIIGESGCGKSTLARAIAGILPAAVGKIVFDGKELG
RSARERTRDQLREMQIVFQYADTALNPAKSVEDILDRPLVFYHGMNARAR
SLRIDELLDMVRLPRNLRHRRPGELSGGQKQRVNFARALAADPKLILCDE
ITSALDTVVAAAVIELLKELQRELGLSYIFISHDLSLVEAICDEIVVMYG
GKKVEDITPAKINAPHHPYSQLLFSSVPKLDPSWLDGLEQDPELVRAYCR
R
>SMa0341 hypothetical protein
MAKGLLGGGEMKFYPQDALTQAGGRFSQSDAPFSAHVVTDRELITGQNPA
SAPAVAQELLKRLK
>SMa0059 putative
MSDFNGKSIVVTGGSLGMGLACAHRFAAGGGKVTIVANDKASVDEAVTSI
GDNAAGFVGDVRSKADMNAAVQAAVSRHGGVDILACCAGIQRYGTVVDTA
DEVWDDVLDINLKGIFLASKFAIPEMRKRGGGAIVAISSVQAYASQTGVA
AYTASKGAINALVRAMALDHAGDNITVNAVCPASIDTPMLRWAADLWKGE
GTVEATLETWGKGHPLGRVGKPSEVAELVAFLASEKARFITGADIKIDGG
VLSKLGIVIPD
>SMa0357 hypothetical protein
MVDARPVGDYPNLRQLAFLHVSHDWRGKKLALRLYQLCKDTVVGSGAEGF
YISSTPTRRTVEFYLRQGAKLMARPDTTLVSIEPDDIHLAHWF
>SMa2129 conserved hypothetical protein
MKQEEYLMSTSITRRTAILGSMGAFLLSALDRTFAAGPTSLKIALVLESR
TDIGWTRTLLDALEQVKQARPDGLDISWEYTDPLWNDDAENAMRFYAEGG
EYDIIWAHGRYSDQVKKLSAEFPDIMFVVTGSGNLPLGANQYWLYKRLHE
PSYLLGMLAGRTTKSGVVGLVGTFAADDVNDQINAFLDGARSVRPDVRHR
VSFIGSWSDNALAAEHANVQIASGADVVFMLTDNFKPCQEHRIICFANIN
DQSKLAPDAIASSAIIDWQPDIKWIISEWLKHKAGAPYDGNTEPKWFSMS
QGGVDIAPYHDFDAKLPVAVKEELAATRQKIISGEFVVPLNTAEVK
>SMa1452 Probable
MQLKSRVFIVTGASSGLGAAVTRMLAQEGATVLGLDLKPPAGEEPAAELG
AAVRFRNADVTNEADATAALAFAKQEFGHVHGLVNCAGTAPGEKILGRSG
PHALDSFARTVAVNLIGTFNMIRLAAEVMSQGEPDADGERGVIVNTASIA
AFDGQIGQAAYAASKGGVAALTLPAARELARFGIRVVTIAPGIFDTPMMA
GMPQDVQDALAASVPFPPRLGRAEEYAALVKHICENTMLNGEVIRLDGAL
RMAPR
>SMa1717 putative integral membrane transporter
MMFKMGRNTMNYQSKVAGDPASPEKPGPGGAIDRLFEVTRSGSTIRTEII
AALTTFLAASYVIVVNPAILQNAGIPFSGGVTATVLVSFIGSCAMGLYAR
SPILVAPGMGINALFAYTMVMGAKVPLEIALGCVFWAGVLFTILAILNLR
TAVIEAVPKDLRYGIACGIGLFIALIGLENAKFIVASPDTIVALTQFTPV
TLTFIAGFIITAALVVRRIPGAMMTGMIITTVLAIPIGRLWGDGSAFAGG
TPDVQTLVNWSGLFAAPDFSFVGRIDLLGALQVAYAPFIFVFLFTNFVEA
LSTFLGLAEAANLKDESGMPRNIKESMHVDAVAALISAPLGTSPATVYLE
SGAGIAQGGRTGLVAFIAGLLFLPFLFLSPLLSLVPTIATAPVLILTGLF
MSAPMGQINWADMEDAIPAFLAIVLIPLTFSITLGLSLAIIAFVMMKLAL
GKVSEVKPVMWFVAVLAAMLVMQVQ
>SMa1126 Conserved hypothetical protein
MGWSLKLGTIAGTEIRIHMTFVLLLVWIWFTHYQIGGAPAAWEGVAFILS
VFVCVVLHEFGHIAAARRFGIKTPDITLLPIGGVARLERNPSEPREELLI
AVAGPLVNVVIAALLIAVIGGVAGLEQLVRPQDPQIDFFVRLAGVNIFLV
LFNMIPAFPMDGGRVLRAILAWRWSLERATRVAATIGQGTAFVMGVAGLF
YSPLLILIAIFVYLAAESEAQSSELQAISVTVGDVMLTEFGVLQSDARLS
EAAELLLATSQNEFPVVDGEGQFAGLLTRDGIIGAMKEGGPNALVGTVMR
TDIPWVYEETALGDSLRVMQTTGAPAAAVVSRSQHPIGIMNYETIGEMLM
LRAAVHDFRFGMLRRSRAGSHG
>SMa0478 probable NAD-dependent formate dehdyrogenase
MEMAKVACVLYDDPVDGYPTAYARDGLPTLERYPGGQTLPTPKAIDFEPG
ALLGSVSGELGLRKFLEGQGHTLVVTSDKDGPDSVFERELVDAEIVISQP
FWPAYLTAERIVKAARLKLAITAGIGSDHVDLQAAIDRGITVAEVTYCNS
ISVSEHVVMMILSLARNYIPSYQWVVKGGWNVADCVARSYDIEGMDIGTV
GAGRIGTAVLRRLKPFDVKLHYTDRHRLPDEVAKELGVTFHQTAAEMVPV
CDVVTINAPLHPETENLFNEAMIGKMKRGAYLVNTARGKICNRDAVARAL
ESGQLAGYAGDVWFPQPAPKDHPWRSMPHHGMTPHISGSSLSAQARYAAG
TREILECWFEGRPIREEYLIVSGGKLAGAGAHSYSAGDATRGSEEAAHFK
T
>SMa2317 putative aminoglycoside adenylyltransferase (C-term)
MLTLARMWRTSTTGDFITKDAAATWAANQMPDQEAGTLIHAREAYLGKVR
DDWGNRQSASERTATFLRQRVLELL
>SMa0085 putative
MEFRNVKPDLLLVEPMMPFVMDELQRNYSVHRLYQAADRPALEAALPSIR
AVATGGGAGLSNEWMEKLPSLGIIAINGVGTDKVDLARARRRNIDVTTTP
GVLADDVADLGIALMLAVLRRVGDGDRLVREGRWAAGEQLPLGHSPKGKR
IGVLGLGQIGRALASRAEAFGMSVRYWNRSTLSGVDWIAHQSPVDLARDS
DVLAVCVAASAATQNIVDASLLQALGPEGIVVNVARGNVVDEDALIEALK
SGTIAGAGLDVFVNEPAIRSEFHTTPNTVLMPHQGSATVETRMAMGKLVL
ANLAAHFAGEKAPNTVN
>SMa0308 conserved hypothetical protein
MPSRRSILKSALTGLIVAPFVAPSVTFAKAPFAVVQAPGYYRLKIGSVEV
TALSDGTVALPLAKLYTNTSEHDAQNALKDAFLPDMVPTSVNAFLVNTGE
RLVLIDAGTGGYLGASLGKLVSNIEASGYKVGDIDDVILTHIHTDHSGGL
MSNGKRTFPNATLRVNEREAKFWLSSANAMTATGTVKQHFGEADQCVTPY
VKAGKFETFADNAAPVPGLGSILYAGHTPGHSAITLESESQKIVFWGDIT
HGDILQFDEPGVAIEFDIDQKAAVAARNTAFKQAVEGKYLVAGAHIAFPG
IGHVREDSKNYDWLPINYA
>SMa1779 Hypothetical protein
MRSDSRPHTYALGHHGFSGEWASLGFVVHDPYGSERMDLALTGVDGRSDR
TSSRTSDGVDRVAADIVEAGGAAIGIVCEVGELDQITAAVDQVVAAYGRI
DILVNNAAGRSAVLSTILDLSIEQLQRNFDTGAIAYIRFMQSGGLRLL
>SMa0041 probable quinone oxidoreductase
MPEPGPHEVRIRVKALGLNRAEALLRSGAYIETATFPSGLGLEAAGFVEK
VGPGVQGFIPGDPVSVLPPKSMIRWPAYGELAIFPAALLVRHPPSLSFEE
AAAVWMQYLTAYGGLVDIGGLRRGDFVAITAASSSVGLAAIQIANMVGAI
PVAVTRTSAKRQGLLEAGAAHVIASMEEDLEAQLKRVSGQHGIRVVFDPV
GGPIFEPLAAAMAWGGILVEYGGLSPEKTPFPLFAVLSKSLTLRGYLVHE
LLADPGRLERAKAFILDGLVSGALRPIIARAFPFDQIVEAHRFLESNEQF
GKIVVTV
>SMa0326 putative
MSTQNPKVWLVTGCSTGFGRYIAEHLLEVGEKVVVTARKADKIADLEQKG
DALILPLDVIDRDQCQKVVDAAEAHFGRIDVLINNAGIGFFGAIEETDES
NARRLFDVNFFGTANTIHSVLPHMRARRSGTIVNLTSIGGLVGYTGVGYY
CATKFAVEGLSDTLRNEVAPLGINVMTVEPSAFRTEWAGSSNEVSASIED
YEATAGEARRAYHTSVGKQAGDPARAAKAIREAVLAQQPPHHLPLGNDAA
DAALKKAEDLKANVLAWEALSRSADFPAN
>SMa1168 hypothetical protein
MTRHIAIVQGHPDPARHHLLNAMADAYAEAATAAGHEVRRIEVARLEFPL
LRTQEDFETGALPPGLEQAREDMRWAEHWVFLFPLWHGTMPALLKGFLEH
IFRPGFAMEYKKGGFPKRLLAGRSARIIVTMGMPVLLYRWYFGAYGVRSF
ERSMLGFAGIKPIRENFYGLSFADEKKRSRWLDEMRDYGRRAR
>SMa1641 probable NreB protein
MLQVLANRTYRRLFLAQVIALVGTGLATVALGLLAFDLAGADAGAVLGTA
LAIKMIAYVGVAPVAAAFAEQLPRRSMLVCLDLVRAAVAVFLPFVTEVWQ
VYVLIFVLQSASAAFTPTFQATIPEVLPDEKEYTRALSLSRLAYDLESVA
SPMLAAALLTVVSFHSLFAGTVVGFLASAALVVSVVLPSPKASERRGIYD
RTTRGLRIYLATPRLRGLLALNLAVSAAGSMVIVNTVVLVQAEFGLAQRD
TAMALAAFGVGSMIAALLLPRLLDNMPDRTAMLAGAAVLVAGLFIGVFVP
RFALLLPLWLAIGVGYSLTQTPSGRLLRRSAHPEDRPALFAAQFALSHAC
WLITYPLAGWLGAKVGLSTTFAMLGVIAATAILIATRIWPVHDPEEIEHV
HDALPVDHPHLVGATRVGNGHRHIHLFVIDSHHPDWPTEQ
>SMa1732 hypothetical protein
MASKQFACKASEVPADAAKIIKLGNLSLGIFRVGDGYHALLNVCPHKGAA
LCQGPVCGTTKQTDKAEFVYERAGELVRCAWHGWEFDIRTGEFLVDPRVK
ARTFPVSVESEDIFVHV
>SMa0551 hydrolase, putative
MGRRVGMALIGPGFIDLDALSDIDTGVLGLDHQPGWKKGRVWPRDYVEEG
PVEMLTPEELAFQKRYAFAHLIRNGITTALPIASLFYRAWNETPEEFASA
AESAADLGLRVYLGPAFRAGHSVIEADGTLTVEIDAARGRAGLDAAIAFC
AAHDNTHGGLVRAMLAPDRVEYWTADLLKRTAGVARDLGVPVRLHCCQST
FEVETIRRSFGTGSAEWRHDIGFLSERALLPHGTHTDREGLRIIADSGAT
VVHCPLVMARHGAALNHFGDLRRAGLRLGMGTDTWPPDMILNMQIGLMLG
RVMGGELDSPSSADLYDVATLGGADARGRPDLGRLQAGAAADIVVIDLAA
HHLGQVRDPIAGLVASANGRDVRTVFIAGRRVMSEGTIPGFDFAEAHARA
GAQFERLVAQYPRRTWRHPAVQSIFPPSYEVTRT
>SMa1727 putative hydrolase
MTTTILKLALAVGLLFAAGPGFATEIPVPTQTEWAAAKKTVDLPNGIKLA
YSEMGNVEGKPLLLIHGYTDNSRSWSLVAPYLKNHHIYAIDLRGHGKSSA
PECCYTYLDFANDAFLFLEAMKIEQADVVGHSLGSLAVQMLAAQHPEKVR
KVVLISSTLNTGGGPGTWLWDNIKPLQPPIDPNGKFMTDWYWNPNPVDER
FIKPEREESAAVPIHVWKGVLWGTTTGDLGKISSLIKAPVMIFWGDQDQL
MNAPQQAKLKAAFPKARFETFPGAGHNMFWERPEKAAELINSFLSE
>SMa1501 Hypothetical protein
MNDMSPTKCQDQVVLDLWHPLAALEEMPARTVQDTVLLEERISYVSDGEG
KAAAWHSRPELPAGSRVDIDTLDGGLPVKMAYGYIWTSLGTPPAELFAIP
EYAESDRRRLNAASIGVNVSAPRAIENFLDMGHFPYVHTDILGAEPHTEV
KEYDVELSVERDEIVATRCRFFQPMASTASTGGADVEYIYRVPHPYCSVL
YKSSPVDESRLDVIAVFLQPVDQEHVRAHMMLCVLDEENEDKVIKRFQQT
IFGQDKPILENQFPKRLPLDPRAETPIRADKSAIAYRRWLSQKGVTYGVI
PAAT
>SMa0380 conserved hypothetical protein
MATRRSFLGGASSLAFANFFSPAKAADPNQTGVDTMHPDLILHNGRVTTL
DRTNPNATAIAIKDGLFLEVGSDSEVMALAGSGTKIVDLKGKGVLPGLID
NHTHVVRGGLNYNMELRWDGVRSLADAMDMLKRQVAITPAPQWVRVVGGF
TEHQFAEKRLPTIEEINAVAPDTPVFLLHLYDRALLNGAALRAVGYTRDT
PNPPGGEITRDANGNPTGLLLAKPNAGILYSTLAKGPKLPLDYQVNSTRH
FMRELNRLGVTGVIDAGGGFQNYPDDYEVIQKLSDENQMTVRLAYNLFTQ
KPKEEKQDFLNWTQSVKYKQGNDYFRHNGAGEMLVFSAADFEDFRQPRPE
MAPEMEGELEEVLRVLAENRWPWRLHATYDETISRALDVFEKVNKDIPLE
GLNWFFDHAETISDRSIDRIAALGGGIATQHRMAYQGEYFAERYGHGVAE
ATPPIRRMLDKGVNVSAGTDATRVASYNPWVSLSWMVTGKTVGGMQLYPR
ANCLDRETALRMWTEKVTWFSNEEGKKGRIEKGQFADLVVPDKDFFSCAE
DEISFIVSELTMVGGKIVYGAGDFKTLDENEIPPAMPDWSPVRKFGGYAA
WGEPERAGARSLRRTAISTCGCASDCGVHGHDHAGAWTSKLPIADLKGFF
GALGCSCWAV
>SMa1831 Hypothetical protein
MTISPYQTLIKAPGPNAPIVFAFHGMGGDEYQFAELTRQILPDAGVISPR
GDVSEFGALRFFRRTSEGSYDMEDLILRTEKMARFIAAHKAANPGRPIYG
LGYSNGANILASVLFKNAGLFDRAALLHPLIPWTPPNSEQLKDRPILITA
GQRDPICPLPLSERLADYFAAQKARVEACYHSGGHEIRPEELEALHAFLT
>SMa0101 conserved hypothetical protein
MAFGIKADLILINGRIWRGREEGISEALAVWQGKILATGSDTDILGLKGP
RTEVIDLEGRFATPGLIDNHLHLIATGMAMGWVDATPASAPTLAALMGRI
SDRAATTPKGGWVRARGYDQVKLDTGRHPTRDDLDRVAPDHPVLLTRACG
HVSIANSRALELAGITEATAVPEGGVIGVTEGRLNGFLAENAQNLVKAAM
PSATTEDLIDGIERAGRYLLSFGITSCMDAAVGQVSGFAEIQAYEMAKLS
GRLPVRVWLTLLGDPGVSIVEDCWRAGLLSGAGDDMLRVGGVKVFLDGSA
GGRTAWMTRPYRGEPDNIGVQMLPDAEVEAVVKACHDRGYQMVCHAIGDG
AIEQLITAYEKALAANPDPDRRHRVEHCGFSTPDQNARMKAAGILPAPQM
AFIHDFGDSYISVLGEERGRLSYPIGTWMRMGLKPSTGSDSPVCSPDPFP
NLHAMLTRQTGKGTVMEASERLSRQEALQTYTEYGAYSQKAEGVKGRLVP
GQWADIAVFDNDLLAAPPETILSDTSCVLTLLAGRVVHDAR
>SMa1734 conserved hypothetical protein
MGRENILFIVDSDCHNYWCSATVLEPYMDGFFKDMFVRGEKTGPRGAFPH
GHRPWFHPEGFSRHDVNPVEEDDNYAIMKEKHLDKYNIDVAILTGDEPIE
ASTLANAHYANALCRAYNDYMIDYWLPKDSRFWGSIIVAPQDPKLAAEEI
RRLGSHPRIVQVLVSHGAQRPYGDPFYHPIYEACAEMGLPFAMHLGGQGG
VNSTPIGAGPSTFFWETHAILPQSAMTHMASLIAQGVFEKWPSLKVVIIE
CGVAWVPSVLWRLDANYKALRKETPWLKRLPSEYFKTNIRMSTQPLEQPE
NVQHLWATLEAMDGENTLLFASDYPHWDYDDVTKLHIPPAWREKVLGLNA
LDVYRRIPRPAAIAAE
>SMa1367 Putative dehdyrogenase
MRLFLHTGTNAEGLRTVAKEVSARGAEVATELGDLSDPTVPGHLVQAARA
AFGGLDQIVSNAGRAQRSSFGQLTDADLQTAFDMMPMAFFRLVDAALPDL
RTSMQGRVVAVSSFVAHGFGTNGMHFPASGAAKAALEALAKSLAAQLAPV
GVTVNCVAPGFTRKDTGGHAATSSAAMESARAVTPNGRLGEPIDVAELVA
FLLSPGARHITGQVMHVDGGLLLA
>SMa1255 Conserved hypothetical protein
MIRAEAMSEPENATVCDSIRDALRMIIDPELGRNIVDLGLIYDVSVEDGG
IAHVTMTTTTKGCPASEYLKEAVRNCVWYVPGVEYAEVRLTYEPAWTPDM
MAG
>SMa0389 putative
MAEAGAHVAVTARTVEGLAETRALIEKTGRRAVALAQDVRDVEACASVTR
AAAEGLGGLDILVNNAGFENVRPSFDVDEALWDTIVSTNLKGAFFCAQAA
GRIMADANGGAIVNLCSLTSYVGIPTAVPYGASKSGLLGVTRALATEWAA
HNIRVNAIAPGYFRTAMTAGFYEDEDWQSRMLEKIPQRRFGKESDIGGVA
VFLCSDAAAYITGHCIPADGGYLASI
>SMa1301 putative transmembrane transport protein
MAAACGLIAANLYYTQPLAGPIAVDIGLPAEATGLIVTLTQIGYGLGLLL
LVPLGDLVENRRLIVTMIGVVTLALIAAGLSTTPGPFLTASLAIGVGSVA
VQMIVPFAANLAPDAARGRVVGNIMSGLMVGIMMARPISGLIAGLSSWHA
VFYISAIVMVGLGTLLWVQLPIRMPTARLSYGQLLKSMAQLLAAQPVLQR
RAAYQAFQFAAFSLFWTVTPLYLAGPRFGLGHNGIALFALAGVAGAIASP
IAGRLADKGLVRPATAFGLLSVGVAFLVTQIASEGSAIALTLLTLAAILL
DFGVTMTLVTGQRSIYELGAELRSRLNGLFMAIFFTGGAIGSALGAWAFA
SGGWWFASMIGFALPATAVAIFLTEKHGQERSLQH
>SMa1156 probable alcohol
MPKMKAAIFVAPGRIVLDEKQIPDVGPLDALMRITTTTICGTDVHILRGE
YPVARGLTIGHEPVGVIEKLGSAVRGYSEGQRVIAGAICTSGHSNAALCG
CHAQDGPGTKHGWKGMGGWKFGNTIDGSQAEYVLVPDAMANLAPVPDGLA
DEQVLMCPDIMSTGFSGAESGAVRVGDAVAVFAQGPIGLCATAGARLIGA
TTIIAVESVPARMEMARRMGADDVVDFTVSDPTAEIMRLTDGRGVDVAIE
ALGRQETFEGALRVLRPGGTLSSLGVYSGDLRIPLDGFLAGLGDHTIRTT
LCPGGKERMRRLMEVIASGRVDTRPLVTHRFKLDQIEEAYDLFANQRDGV
LKVAINP
>SMa1351 Conserved hypothetical protein
MPCGLRAARLGGEMRIKTVQAWWVRIPIEANRQHQSDFGRLTTFDAAILR
IETDDGIVGWGEGKNAAGSAGSYGTLVHMLNYEVGPRLVGRDAADISAVW
EMLYNGVRHERAAMSGHAMPELSRRGLSIAAISAVDIALWDILGKSLGVP
VWKLLGGRKADRLPAYASGGWESAEKIGGQLQSYLASGGFKAVKMRVGAM
DGAPYVSAARVRAARKALGPSVDIMVDAHGTYTVADAKRFIQLVRDCDLA
WFEEPVIADDKAGMAEVRAAGNVPIATGESEATRFAFRDLAVLRSADIFQ
PDPAFCGGITEAMRIGAIASAFNLRLAPHLWAGAPCFFSGLHICAASPAS
FVVEYSVGANPMIHDLVEETVAVKDGMLEIPDKPGLGFTINERVLETHAQ
RL
>SMa2385 probable ABC transporter, ATP-binding protein
MPSITLSALSWSKPDGEHVFSDLDLAFGPERTGLVGRNGIGKTSVLNIIA
GTLRPSSGTVAIQGRVALARQILRAGADETIADVFGATQAVAVLRRAEKG
DASVEELETADWTVEERIVSALARLGLEARADTLLNQLSGGQRTRAVLAA
AIFSEPDFLLLDEPTNNLDRDGRRAVIGLLSGWRSGAIVVSHDRELLEEM
DAIIELTSLGTKRYGGGWSAYQAARAVELEAAQQSLTLARKTADEVDRKA
RALAERLDKRDASGTRKAAKGDMPRILVGRRKSNAEESRGKSVELAERRR
AGALDAVTAAKARIEVLQPFSIRLPRTELPAGRQVLAFDGVTAGYDPARP
IIRDLSFSLVGPRRVSVTGPNGSGKTSLLKVVTGELPPFKGTVSVNVPFT
LLDQSVSILERGETILENFKRLNPGASDNACRAALASFRFRADAALQRVE
ALSGGQVLRAGLACALGGSDPPSLLILDEPTNHLDIDSIEAVEAGLLSYD
GALVVVSHDETFLANIGIGTRVELSTSRG
>SMa1757 Putative Short Chain
MYQASKESAPMTEECRYMLLTGASRGIGHATVKLFQSKGWRILTVSRQPF
AEECAWPSARESHIQADLADLTQIDRLAATVRERLPNGRLHALVNNAGIS
PKGPGKNRLGVLDTDADVWTQVLNVNLVSTALIARALMSELEAAKGSIVN
VTSIAGSRVHPFAGVAYAASKAALASLTREMAHEFGQRGVRANAIAPGEI
ETSILSPGTDELVAAEVPMRRLGEPREVAETIFFLCTEPSSYINGAEIHI
NGGQHV
>SMa2273 hypothetical protein
MFVWTLEYSEDAERDFELIFDHLFDAYVELGDSPDEAVERTAERIRKLRV
EIDRLVDTPYIGTLRPDIHSGIRFLRRDKAAIWFLRAEHSRTIIVAAIFY
GAQDHIRNMLARMLAG
>SMa0499 hypothetical protein
MHKTISREHILEDLPEIAEIQSDDLREKVVDAWVFALERSSFDRVVDIPG
EGSPNVFALKRGTQDAHLRGVTRLALAIYDEFARTYPEARVDRDIILAGG
LCHDIGKTWEFDPINLKRWRERGDRYGEPSFRHSAYGTHVCLSVGLPEEI
GHICMGHSLEGAHIGHSTECYIIRQADHAWWHVAAALDLCHPETIGFAGP
NLRVRPIGMQ
>SMa2053 MocE-like protein
MTWVSACKLDDIEQEGAIRFDHGGRTYAIYRGPDDSVYCTAGLCTHEAIH
LADGLVMDFEVECPKHSGAFDYRTGEAIRLPACENLKTYPAEVVDGEVRV
ALA
>SMa2032 Putative non-heme chloroperoxidase
MAAGHDGEAPRRTTMASRPSRKPTKTDDLKAITVPTPVLHGEDDQVVPIA
ASALKAVKLLPNGSLKTYPGFSHGMLTVNADVLNADLLEFITNRSDCGA
>SMa0473 conserved hypothetical protein
MRLVWARYALDDRDTIFSYIERENPRAAVHVDEEIVSAVRRLLDFPESGR
PGRIAGTRELVIPRTPYIAAYMVMEDRIRILRVLHGAQKWPSELDDG
>SMa2031 Putative non-heme chloroperoxidase
MAKAVLVAAIPPLMLKNDDDPEGTPMEVFDGFRTALAGNRAQFFRDVPSG
PFSGFNREGAAVHEGVLQNWWRQGMMGKRQGALRWRQGLLGNRPRPMTSR
RSPCQRRCCTVRTTRSFRSPPRL
>SMa1223 Conserved hypothetical protein (ORF151)
MFVRVMSREECQGVVAAGDLARLACCRDDQPYIVPITYAHSGNRLYCFSM
PGQKIDWMRSNPKVSLQIAEFASNRQWKSVVVTGRYQELPATQGCHHERI
HAWSLLEKKPNWWEPGGLKPVPQEISGASAHIFFCVEMDEMTGRAACAGE
L
>SMa1937 putative transmembrane transport protein
MIVSQFTSGGSAWPRISLVVGAGVVSAFQVGKAPAALAAVQGDLALSLAA
ASWLISAFAILGALAGAPIGLAVDRIGARTMASLGLLLQAAGSALGALSP
GFTALLATRVIEGLGFLCLVVAAPALIAGLAPIQIRDRAMALWASFMPVG
LTVIMLAAPLLSIVTWRGFWFLNASILVSYAMLLRWGLHPPPNHPRPYRK
IHQDIGEALVSPGPWVLGGLFTAFSAIFFAVFGLLPPLLSQRLGISNETA
SMLSALAIAASGVGNLVCGQLLARGFQPARLLNFSFGIMALCGIGIFSHT
LWAIASYALCVVFSFAGGLIPVVIFDSAPRQAPGAELVGVTIGFAMQGNN
LGLIIGPAAASGLAGAFGWPMVSVAVVGIAFVAALLVLPFNRRQLTEAAS
GQSPLTRAGGRF
>SMa0548 conserved hypothetical protein
MPGNLHVRNLDDDLISKLKIRAARHGRSAEAEHREILRQALASEGGPDFE
ELAADLRKLTASRKQTPSEVLLRESRDER
>SMa0751 putative aromatic-ring hydroxylating dioxygenase, alpha-subunit
MTANPTSIHQRLDRRLSGFSLEQPFYTSPEVYALDLQHIFYKQWLYAVPV
CQLAKAGSYTTLRVGAYEVVIVRSRDGEVRAFHNSCRHRGSLICKARQGQ
VAKLVCPYHQWTYELDGKLIWANDMGPDFDASKYGLKPVNLRNLDGLIYI
CLSDTPPDFQTFAQLARPYLEVHDLKDAKVAFTSTIIEKGNWKLVWENNR
ECYHCSSNHPALCRSFPLDPEVAGVQADGGVSKKLQAHFDRCEAAGTPAQ
FVLAGDGQYRLARMPLQEKALSYTMDGKAAVSRHLGRVAPPDAGTLLMFH
YPSTWNHFLPDHSLTFRVMPISPTETEVTTTWLVHKDAVEGVDYDLKRLT
EVWIATNDEDREIVETNQQGILSPAYVPGPYSPGQESGVMQFVDWYAASL
ERALAPRQVAAE
>SMa1589 hypothetical protein
MNESCPVTKARHDSLASVGTNVAFPVRVPPIQFYELAFQRKETVLIRQLA
DIYRAYLDCLNRQAWDELGHFVDNEIQHNGRLLRISGYREMLVKDFEDIP
NLQFNIQLLVCEPPRLAARLSFNCSPKGEFLGLSVNGQQVSFTENVFYEF
VGSKIVSVWSVIDKSAIEAQLS
>SMa1851 Putative dehalogenase
MSIFRPKYITFDCYGTLTNFQMAEAARDLYSEQLDEARMAEFIKNFAAYR
LDEILGDWKPYAEVVHNSLERTCKRNGIEFREEAARMVYERVPTWGPHAD
VPAGLARVAKEIPLVILSNAMNSQIMSNVEKLGAPFHAVYTAEQAQAYKP
RFKAFEYMLDMLGCGPEDILHCSSSFRYDLMSAHDLGIKNKVWVNRGHEP
ANSYYGYVEIADISGLPGVVGL
>SMa2367 putative ABC transporter, permease
MRFERREHRSFALVIATPVLAILCALALAGLLIAIAGAPVMEAYWRILVG
AFGSRLSATETLTRASPLILTGLAAAVAFRAKLWNIGAEGQFYLGAIAVA
AASSHLFGGLPPPLQVPLLLVAGAAAGIVLLLVPLWLRLRFSVDEVVTTL
LLNFVAVLFVSMLIDGPLKDPMGFGWPQSQPVADAAVLPKLFARSRLHVG
LMIALVFAVAVHLVQSRTVFGMQSRAAGLNPAGAVFAGVPLGRTLVTVAC
ISGGLAGLAGAIEVMGVQGYVTTDLSPGYGYSGIVVAMLANLHPLGVVLA
ALFTAVMFVGADGMSRSMGIPSYIADVTVALSLISMLTGVFFTQYRIRR
>SMa0665 conserved hypothetical protein
MALCDRIERTSLKNQADELLFPALQARCAMDDNPSSQRNPDPVNPAARPS
IEPRITEIARIGLVGLFAYWSFTLIAPFAIILIWAAILAVALYPAYAALS
AILGQRPRVAALVITMLGLLVIVAPLAAIAFSFAEGLQVVLARLNDRSLL
ISAPPDSIRSFPLIGERIYSVWSMASDNLEAVLQQIKPALLQAGSKALGK
IASIGADLLSFVVSVLVAGFLFGSGARLANSAQGFASRMGGDRGVGFLQL
AAATIRNVARGVIGVALLQAFLCILILSLFKVPAPGAIAFVVLILCIIQI
GPALALLPVIVWAWTSMEFGMAALFTILLIPLLIIDNVMKPILVARGLST
PTLVILLGVLGGTLSYGLIGLFLGPIVLSVFHSLLLIWMNTDTVGSEGLR
LGNPDYRIRREKT
>SMa0189 hypothetical protein
MRTDLVNRPQDGSAGRSALLVASAVNAVAAIYHIIGGTPEVMYPVYSANL
PPSSAGVLDILWYQMAALIVGSAVATLVAAFRSDWRWPVAWIIGGHFLVV
SGICLFFTFVWFGNPWGLIQWAIFGPVGLIIFWAAARPAERAGAPTL
>SMa0352 conserved hypothetical protein
MSHISTADIEQVILPSSRDLGGFSVRRALPAPMRQMVGPFIFLDSFGPVR
FGEGEGIDTRPHPHIGLSTLTYLLEGELTHRDSERYVQAIRPGEVNVMVA
GAGIVHSERTPEHQRATGGKLAGLQSWIALPKKSEETAPLFQHLDAGSLP
TVSGEGIGMKLLAGNLHGRQSPATVFSDLFAAEVHLEAGARYRIDGEHVE
RAIFVVAGALEIVGQDGNFGQDRLLVFKPGSEIVVKATGPARFLAFGGEP
LPEKRFIRWNFVATDQERIRHAADLWRERGFPGVPDDDEFIPLPENFR
>SMa1835 Hypothetical protein
MHVATIESANLEQLHALSVSVGWPLRSEDLQFLRDCGRGYVAHDDIGRLT
GSAMWFPHADDFATIGMVITSPRLQSNGTARWLMEHVLWDCCGRNLRLNA
TRASRRLYHSLDFQPMRTIYQCQGIVRQADSTATTEQPPIRRLEGEDLAA
VAELDAGAFGVSRTALIGKLFAQSVGYGLFRGGRLEAFALCRPFGRGHVI
GPVVADSDADALAVIRRHVAAHENQFLRLDTPVETGPFATFLSQSGLAVF
DTVLAMSRRGKGCADVVQGSNLYGLASHALG
>SMa0349 conserved hypothetical protein
MTTYAIIGAGAIGSALAERFTAAQIPAIIANSRGPASLSSVTDRFGASVK
AVELKDALQADVVILAVPYDSIADIVTQVSDWGGQIVVDASNAIDFPAFK
PRDLGGRLSTEIVSELVPGAKVVKAFNTLPAAVLAADPDKGTGSRVLFLS
GNHSDANRQVAELISSLGFAPVDLGTLAASGPIQQFGRPLVALNLLKD
>SMa2165 probable short chain
MTRFTGKNVLITGGTSGIGLAGGRRIIAEGGMVILTGMNEDRLEATRKEF
GDKAVVVRNDAADPAASADLSEIVKSAGGIDGLWLNAAFAALGPPEEIHA
WDFDRMMATNVRGPMLQLAKLSPLLRPGTSIVLTSSSSTYEGASATSLYA
ATKGAVLAMSRSWASAMAPRGIRVNVLVPGPIESNLRSFLPDEARHGFER
FVLNQVPLGRVGTADEAAAVALFLLSDDSSYVTGSQYAVDGGLIMH
>SMa1417 Putative
MKAAVLVEPRRFEVREVGIPEIGPADVLIRVTRAGICGTDLHIFNGHYAA
DRLPIVPGHEFCGTIAEIGASVTHLKTGMRVVADINIGCGNCYWCRRNEV
LNCGEVEQIGIGRDGAFAQYVALPGRLVLPVPDGVPEAVLALVEPVACVV
RAARKAGAAFGRSGVVLGAGPIGNLHVQMMRLVGMAPIIVADLSSERCRM
AVEAGADAAVSEPATLRSKVLEMTGGRGADFVVESVGSSKLYRQAFDLVR
KGGHVAFFGITPPGETIPIEILRTVLEENSLKGSVAGMGEDMHDALTLLS
HGRFRTAAFTAAHYPLERIQEAFETIPARTGHLKTQILLDA
>SMa1967 Putative short chain alcohol
MDGAVSLISAASPSASPSELLMRPQRGLVSFTRTWALELAQTGITVNAVA
PGPTEPNCEGEYQYLTGVPMHRLGRPDEIAAAIQFLLSEDAGFITGQTLF
VYCGASIGKALL
>SMa1153 Hypothetical protein
MDNDVRLRQDILDELEYEPTIAANIGVAVEDGIVTLTGHVRSYAEKHAAE
RIAERVKGVRAIAEEIDVRLPEHKKTADDEIAARVLKILAWGAAISDPED
INVKVEKGFVTLNGTVDWHFQRSAAENSVRVLTGVTGIDNQLRIRPRMNV
VDVRHGIREALKRNAETEAENIDVEVSGSHVILHGKVQSLRARAMAERAA
WSAPGVTAVEDRLRIEDARVALGA
>SMa1629 putative
MIEFLNLRGKRALITAGTKGAGAATVSLFLELGAQVLTTARARPEGLPEE
LFVEADLTTKEGCAIVAEATRQRLGGVDVIVHMLGGSSAAGGGFSALSDD
DWYNELSLNLFAAVRLDRQLVPDMVARGSGVVVHVTSIQRVLPLPESTTA
YAAAKAALSTYSKAMSKEVSPKGVRVVRVSPGWIETEASVRLAERLAKQA
GTDLEGGKKIIMDGLGGIPLGRPAKPEEVANLIAFLASDRAASITGAEYT
IDGGTVPTA
>SMa1176 Hypothetical protein
MFSNHLPFAFAGREKAQPKEITMGNWTADLMTRTGLKIHVRPVRTEDEPM
LAEFFTHVTKEDLSFRYLTGLNEVGKERIAALTDVDHVRTENYLAFGESG
DPLIATAMLACDPAFERGEVAISIRADYKNRGVGWELLGFLSRVAQAKGV
KVLESIERRENRAAIEIEQQMGFTTVTDPDDPTILLVRKELRAA
>SMa0279 conserved hypothetical protein
MGIYRDVILPRLCDLSMRNERLRPYRERVIGAAQGRVLEIGVGSGLNLPF
YGPVVGEVLALEPSAGLVAMAREAPRSDLPVSFIDASAEAIPLDDKSVDT
VVTTWTLCTIPDAAAALTEMRRVLRPGGKLLFVEHGLAPDRGVRWWQDTL
TPVWRRISGGCHLNRPIRSMIECGGFRMERVETGYMQGPKPMTFMYEGSA
RPE
>SMa2019 Putative oxidoreductase
MVARGCGVVVHVTSIQGVMPLPESTTAYAAAKAALSTYGKSIAKEISSKG
IRVVRVSPGWIATEASVRLAERLAKQAGTDLEGGKKIIMDALGGIPLGRP
ANPEEVADLIAFLAPDRASSITGTEYTIDGGTVPTA
>SMa0064 conserved hypothetical protein
METYIHEVVAPKLVGRDPLEIDRISKDLTGYLGFRSTGAEMRGNSAVDIG
LWDLFGKATNLPIAQLLGGFSRREIRTYNTCAGNTYMRDAKGQQTANYGI
GGPRRDYDDLNGFLERADELAEDLLSEGITAMKIWPFDIAAEKSGGQYIS
GPDLRKALEPFEKIRKRVGDRIDIMVEFHSMWQLTPAIQIARALEPYATF
WHEDPIKMDSLSSLKRYAAASRAPLCASETLATRWAFRDLMETDAAGVVM
LDLSWCGGISEAKKISTMAEAWHLPVAPHDCTGPVVLAASTHLSLNAPNA
LVQESVRAYYKTWYADLTTQLPTVTNGMITIPPGAGHGVDLAPDLDRKFE
VSRRSSQIED
>SMa0339 putative
MSSLFSNKVVTVTGAGSGIGRAIALGLARDGATVHLADRDADGLTQTAEL
IRAEDGRAFTTELDVASELQVVGWIEQIGSTSGRLDAAFNNAGITGPAKR
IEDYPLEDFQRVIAVNLQSVFLGMKYQIPLIKRNGGGSIVNTASIAALTG
PGGMSAYAASKHGVQGLTRVVAMENAAHGIRVNAIAPGWTETPMVAANSQ
QNPAFAALAQNAIPAKRGGKPEEIAAAAIWLASDAASYVTGHMLTVDGGM
TIGGFEL
>SMa2347 conserved hypothetical protein
MTKWASCNIVCYMSESDRRMRFPSFEGPAFTAAHASSYVEGTSRKVPGLA
ALHRMTSMLVAERAPVQARVLVLGAGGGMELKALADENSDWSFCGIDPSA
DMLRVAEQTVGPHLLRVHLQQGYIGAAPEGPFDAAVCLLTLHFVGRAQRL
DTLEQIRRRLVPGAPFVVAHISFPQSEPERSTWIARHVAFGGTASGEAES
ARQAIATKLSVLSPEEDEAVLRKAGFSDVRLFYAAMTFRGWVGYA
>SMa0470 putative ABC transporter, ATP-binding protein
MPITLPLEIVVERGETIAIVGESGSGKSLTARAIVGILPPGINAKGAVTL
DGVPLMRLAERELRTIRGSRVSMLMQDPFTMLNPLMRSGDHIDEMLRDRP
EFASRAVRADEVKRRLAEVGIVDEDVARRMPFQLSGGMCQRVALAAALAR
DPELLIADEPSTALDVTTQAEIIKLLRRIQRERNMAVILITHNLRLAFST
CQRIYVLYAGSMLEVGDAAAVERQPFHPYTLGLLLSEPPVDIRVPRLVAI
RGSVPRAADVIDSCGFADRCEWAKQICRAGKPSLAARDASRFTACIRQDE
IQGELDALRSATLSATPETPRRGGTAGALVHVDALVKTFAGRRGRPICAI
RDVSLHIMAGESVGLVGESGSGKTTIGRCLVGLETPTDGDIRINGIAAAD
FGAMAKADRDRVRRTIQMIFQDPYSTLNPKHSVGQALREALGASAGAPSP
APQERIASLLAEVGLSAAYATRRPASLSGGERQRVAIARALAVKPAILVC
DEPVSALDVSVQAQVLNLFRRLQVEHELSYLFITHDLAVVRQIAERIYVL
YLGEIVEEGPTERVISNPQHPYTRRLIESIVRSAIQRAP
>SMa0298 putative ABC transporter
MKNLIEVRNLNIAYGGPSGWTNVVQDVSFEIAPGEAFGLVGESGCGKSTV
AYRLLGYGTINSLVQTGEVLFDGTDLLKLDAASLMRLRGNRIAFVPQNPT
TSLSPGMRVGSQICEMIATHKALPDGMTMERRIVELFTLVGLPDVGHRYP
HELSGGQQQRVTIAMAVACNPDLLVLDEPTTGLDVTTQRQIIQLLADLRS
RIGMAMLYVTHDLALLAQIADRVGVMYAGQLVEVAPCDKLLSAPAHPYSR
GLIASIPTNDGTDRQARSLRGMLRRDEMSTGCKFEPRCDFATGACRATPQ
LLELIEDARSVACMRWREATAPLAPSVTAKAVARTAVRSESLLSVTELSL
SYQQPGLFNRLLGRTSPAVVREINLNLAAGEVVALVGGSGSGKSTIARAI
SARLPPRAGIIRLDGTALAPSLKDRSVEELRQIQYIFQNPDASLNPRGLR
LFERTQELDDPCLYRNVERREDLVADQKLRIDEKCAGCYSKPPLDSKRTI
TSHARPPFRGRRPYRASAVPSPAAVRARSAIHAPKTGHPSRSRLPASSAA
ISLPKHWYATPPRQITTARRSRSDQCRDLARDRAQASRRSALRRSAADRT
ANSC
>SMa0620 hypothetical protein
MSGTGKLQRWVGEVSGIEETWNPKWQRHLPAQAPFEWLAKGWRDLITYPM
LSLSYGVAVFVVSFLIIWLLFATGRDYFLFPAVAGFMIIAPLLATGLYLK
SSRLERSEPVSLGSMLRVRPVAGAQVFFTGLLLCMLMLLWMRAAVLVYAL
FFGVRPFPGLGHITQLLLTTPTGWAMLAVGIFIGALFAGFSFAISVFSIP
MLLDQRIDAFTAMGVSVALVWNNLRPMLVWGAIVLGMFLVSVATAMIGLV
VIFPLLGHATWHAYRAVR
>SMa1008 Hypothetical protein
MSAGHFGYIQVIHLGLLVGFDTIQCGVSNDGKVNTPVEKIASIHFSAWRA
KMRRREFLFSVAAAGSIGLIGPARAANEKMIVYKDPNCGCCRAWAEAMKA
AGFSVSTEEAVDLAALKGRYAIPAEMHGCHTAIVADYYVEGHVPLDAVTR
LLAERPDIAGLAVPGMPEGSLGMGDHPRASYDVFAVNGDGSSTVYQTVRP
KS
>SMa1410 Putative oxidoreductase
MGGPMSVYFEKSYQRGFGTYPLKGEPLKAAVREAITVGYRAFDTAQMYGN
EAETGEALAESGLARDELCITTKVHPDNYSEEAFLPSVEASLKALRVDQA
DVLMLHWPEINGENARSLRLLQKAFDIGLARNIGVSNYTAPMMREAQSIV
EAPLVTNQVEFHPLIDQSRLLDAAEETKIALSSYCSVARGEVFKHPVFAE
IGARYGKTAAQTVLRWILQKGVSMNTMSTKPENIRANFEILDFALSPHDM
KRIDAMNATNYRILKAGMLPWVPDWDR
>SMa1814 Putative Dioxygenase
MTDVTAPTSGFAPLRQKVFAVLWIATIVGNTGSFIRDVASSWLVTDLSAA
PAAVAMVQAAATLPIFLLAIPAGVLSDILDRRKFLIVIQLLLAAASICLM
LLSATGLQSVSSLIALTFVGGIGAALMAPTWQAIVPELVARQDVKSAVAL
NSLGINISRSIGPAVGGLLLAWFGAAFTYGVDVISYVFVIVALTWWRRAA
TPDDVLSERFFGAFRAGLRFAKASRELHVVLLRAAVFFAFASAVWALLPL
VARDLLDGDAGFYGILLGAVGAGAIGGALILPRLRTRFDADALLLGAAVV
TAAVMAILSVAPPRWGAIVALLALGAAWITALTTLNGAAQAILPNWVRGR
SLAVYLTVFNGAMTAGSLAWGAVAEALNIPLTLNISAIGLAAAGLLFHFV
KLPKGESDLIASNHWPEPLVAALVDNDRGPVLILIEYKVDKTERPDFLKA
LAKLSNERRRDGAYGWGVTEDAADPERIVEWFMVESWAEHLRQHRRVSKA
DADVQQEVRRFHKGAEAPVVSHLLSINRPQ
>SMa1513 Putative ABC transporter permease
MRIAAPVIATALTFLAGSVLFAALGYDPLATLHAFFVAPINSTNGLSEWL
LKASPLILIACGLAVGFRANIWNIGAEGQLIIGAIAACGVGLFYPDPESP
LLIPLMFLAGAGAGMAWAAIPAFLRARMNTNEILVTLMLTYIATLLLSFL
VHGPWRDPAGFNYPQTALLPAAAMFEPFDYSYRLNPSIFITAVAVVMMWL
FTDRSFLGYKMSVSGAAPLAARYAGFRESSAVWTGLLAGGAAAGIAGMAE
VAGPLGQLSPQISPGYGFAAIIVAFIGRLNAFGIVLGGLLMSLLFLGGET
VQMTLGLPAALTRIFQGILLFFLLAADFFIYYRLRLPEHA
>SMa1521 conserved hypothetical protein
MHICQILGSKSPEIFSVTPDQTMVEVLRLFRDKNIGFVVVGRSPGECLGT
LSERDCCYAVAEYGTEAPLMRVGEIMNRTVATCSTEDFLPFVMSIMTERR
TRHVLVMDGNDAVGVVSIGDVVKHRLEEALQAERDMHDYICGANYR
>SMa1973 Hypothetical protein
MTMTKAKIGIVGTGFIATILAPQIQSSKKARLDAVSSRTLAKAESFVANY
PGAIAVEGADQLIARDDVDAVYIATPTSAKEDVASRALTAGKHVLIEKPL
HSAASFKRLSALARQKASC
>SMa2231 conserved hypothetical protein
MILADTSIWIDHFRHTDAELRRIIEDDRLLCHPAVIGELALGSLRERSSV
IAFLMAQREALVATHQEVMMMIDRHAIFSMGIGYTDAHLLASVLLDQRMA
LWTRDKRLQAAAEKAGASLHTPAHTRN
>SMa1660 putative acetyltransferase
MKSPTVKVMAAAEEDLAVETVMLAFAADPMARWTWPHAHQYLAAMPRMIR
AFGSRAFSNGSAFCTDDYAGTALWLSPGVHSDEEGLGAVLESTVARSLAP
ETAAIFEQMAAYHPTEPHWYLPLIGVDPAHQGKGHGDALMAYALERCDRD
HAPAYLESSNPRNIPFYRRYGFEPLGAIQFGSSPTLVPMLRRPR
>SMa0224 putative transmembrane-transport protein
MPSHAVRRGFVPMTDTSLPRTVDRTAWLGLIAILPLVLLVAMDGSILYLA
MPHVTSALMPTADQALWILDIYGFVVGSLLIAFGNIGDRYGRLKLIITGA
AVFGAGSLGAAYSQTPEQLIASRALMGLGGATLLPSGLAIVSALFPDPRL
RAQAIGIFAATFAAGFAIGPLIGGMLLRQFAWGAVFLINVPVVIGFMIGA
PILLREVRSTVGGSIDLASLVLSFAGILLFTYSLKNAAAYGFTPTQIVAG
AAGIFALALFARRQTKLEYPLLDLGLFRDRIFSIAILTGLLSLVVWSAAG
YLSGVYLQSVLGIDVFAAALLTLPGAIVLTATCVATARIVERIGRKTALV
ATHLLIGAGVFLLLFTTTETGIAVFIASTMIAGIGYGLSFSLVADIAVSA
VPANRAGAAGSIAETSNELGNALGISLLGSLATLSFRLFGPGVAGTLDET
LDQPGLAHQSLIQAQEAFLTGMHVAIGTGGLLTLAVGMVAWLWLPSKLPE
>SMa2377 putative transmembrane transport protein
MTVNVSPTVMFLLISFGTVLGIAGTDLVLPAIPAMPTALGGTAALAQMVL
AAYAGGTLVGLLTFGELGARYSRRKLLVWSLGLFAVTSLLSAYAPTLEWL
VILRFAQGAFGSGPAVFAPGFIHGLYPGDKAPSMFGRLGSIESLTPALAP
IAGAYLMTVGGWQTSFLMLAGLAILCAVGSWAYRQSLPDRLEALEVHQSY
MSIIRNGDFLRHGLSQALSLGSILIFVFGAPAVMTGALGMTIGDFILLQV
FGIALFILASNASNALARRFGIERMIMIGTGSLVLGFLLILLYTSLGGRS
LTVLVPLWMTANGAFGIRGPIGFHQAIVASRGDHSRGAALVVAAILGITA
GGTAAAAPFINVGWWPLALASSLAALLALLCLKLIGSTA
>SMa1461 Putative muconate cycloisomerase
MVKISNVRVRPLVLPLKQPYHWSYGIRESFAVNLIEIEADDGTVGIGECT
VAPDQTGTAAILYRLAKHLVGHSPHDVAPLIARIFHQEYLGHGANIMRAA
NQIFSGIDMAMWDLQGKLAGLPVHQLLGGAHRKAVGYFYFLQGETAEELA
RDAAVGHAQGERVFYLKVGRGEKLDLEITAAVRGEIGDARLRLDANEGWS
VHDAINMCRKLEKYDIEFIEQPTVSWSIPAMAHVREKVGIPIVADQAAFT
LYDVYEICRQRAADMICIGPREIGGIQPMMKAAAVAEAAGLKICIHSSFT
TGITTCAEHHIGLAIPNLDDGNQIMWQLVQEDIVSSPDLTPKNGWLDAFR
KPGLGFQLAEDLVAEGEGRYAASR
>SMa0758 hypothetical protein with local conservation
MAGARGGNDMKADVFDARALREAFGAFPTAVTAITASDPAGRPVGFTANS
FTSVSLDPPLLLVCVAKTARDYSTMTAAEHFAINILSEAQKDVSIKFARP
LEDRFAAVDWARAPNGCPIFAQVAAWFECSMHDVIEAGDHVMMVGRVTAF
KSSGLNGLGYARGGYFAPSVAAKANSSAAGGEIGAVAVLERHAALFPLGD
>SMa0275 conserved hypothetical protein
MSGKKILMLTGEFTEEYEIYVFQKGMEAVGHTVHVVCPDKKAGDRIKTSL
HDFEGDQTYTEKLGHYADINKTFSEVRPEEYDAVYAAGGRGPEYIRTDKR
VQDMVRHFHDTGKPIFTICHGVQILMAVPGVLKGRKVAGLGACEPEVTAV
GGTYIDVEPTGAYVDGNMVSAKGWTGLAAFMRECLNVLGTKITHT
>SMa0753 hypothetical protein with localized conservation
MKADVFDPRALREAFGAFPTAVTVITASDPAGRPVGFTANSFTSVSLDPP
LLLVCVAKTARDYSTMTAAEHFAINILSEAQKDVSIKFARPLEDRFAAVD
WARAPNGCPIFAQVAAWFECSMHDVIEAGDHVMMVGRVTAFKSSGLNGLG
YARGGYFAPSVAAKANSSAAGGEIGAVAVLERHAALFPLGDQNLSLPRYS
AAGGDPAKTLASQLERSGLSVHDWLSLLDL
>SMa0329 putative
MSKRFDGKVAIVTGGGSGIGAAIANRLLEEGASVMMSGRTEKRLSDVASK
MPADRSGIFVANVSSRPDCDALVAATVERFGRIDTVVNAAGMNFVGTIQE
TSDQDWDECIASDLSGVFYMSRAAVPHLKETKGSIVNIGSVSSLGGGWSH
AAYNAAKGGVANLTRSAACDLGKFGVRANTVAPGLTVTGMVEAIMDDDAL
LEKAWDRIPLRRAGQPASAVAFLASDEAAWITGIVLPVDGGQTCTDGGPE
WGK
>SMa2383 probable oxidoreductase
MKAVVMKEVGGTDVMEFVDRPEPVARPAHVVVEVAAAGVNFMDIGVRQGM
AWTDIPNPKVLGVEGAGRVLAVGDGTGEFAVGDRVAWVYAPGSYAQRQSI
PAASLVKIPDTVDDRTAASTMMQGLTASHFATDFYPVQPGDIALVHAAAG
GVGLLLTQIIRLRGGRVIGRVSSEDKVAIARKAGAEHVIVDTDGRFADEV
LRLTGGEGVNVVYDGSGPKTFKGSIEALRRSGTFCWYGPVLGGPGPLEIM
NLPKSIKIGYATFMDHIHTRELLLDRTKQLFDWIEDGSITVTIGETYRLA
DAAGAHAAMASRATTGKLLLIP
>SMa0346 conserved hypothetical protein
MRRNLLLVATQETTVTKTLILLFHPDLKRSKANAALAGAAAKLDGVEVAD
MQAAYPDSMDMFRDGEREARRLLAADRIVLQFPIQWYSTPPLMKAWQDGV
LTRMFYVTYETEGRALEGTPLMLAATAGNVPESYRPGGRNMFTMEALLAP
LRATAHRCGLSCTAPFIIYQADKLEAEELEAAASNYAATLKNWIAGPLVT
RQEAV
>SMa1968 Conserved hypothetical protein
MRSNSSRFQEQIMTSATQDLKPVLFVLTSHSVKGETGEYTGFYLGEVTHP
LAVLDAAGIPVEFASIAGGEPPVDGLDLHDAVNARYWNSEGFRHAIRNTS
RLSDVDPKDYSAIFFAGGHGAMWDFPTSPAVNSVARDIYEAGGVVAAVCH
GPAALVNITLSSGAHLVAGKNVAAFTDDEERAVKLDKTVPFLLASTLSAR
GAHHHPAADWAAKIVVDGRLVTGQNPQSATGVGEALRDLLTA
>SMa0056 putative dehydratase
MKRPRITDIRATTVTVPLEAPLRHSNGAHWGRFVRTIVEVETDVGIVGLG
EMGGGGESAEAAFRALKPYLLGHDTFELENLRFMICNPTASLYNNRTQMH
AAIEFACLDIMGKFLGVPVCDLLGGKMRDAVPFASYMFFRLPNKDTGEGE
TRTADQLIEQTLALKKKCGFTSHKLKSGVFPPDYELEVFRAWAKALGPDS
VRYDPNAAFSVEEAIRFAKGIEDLNNDYYEDPTWGLNGMRRVRENTTMPL
ATNTVVVNFEQLATNILNPAVDVILLDTTFWGGSALREGGGRLRDLPTRH
CGTFVGRTRHPARHHASPRRGSPEPRLPRGCALSPTHGRYHRRRPDALRE
RHYQGADGAGSRGGARSRQARAVRRPP
>SMa1857 Hypothetical protein
MPNWSRGKGPDPSKFDPFTKSGKITRRQEQYNLLHEAGAPSEPENWGIDV
RKSLPEFKYMTFDVVGTLIDFEGGLKDCLAGIAAEAGATIDGEEALSLYR
AARYSKDADLFPDDLVRVYLEIAPKLGLPAEPKYGERFRDSTKNWKGFAD
SAEALARLAKSCRLVAMTNARRWAFDLFAQQLGNPFYAAFTADDTGTEKP
DPVFFEKVFDFVGSEGNSKDDILHVAQSQYHDIGISRKLGLANCWIERRH
AQKGYGGTIEPAEFTAPDYHFTSMAALADAVVVARG
>SMa0792 hypothetical protein with local similarity
MQTVTSHTSGEGGASLPRKTTARGTAYFEAGNGETLILIHGVGMRLEAWA
PQIEAFAKTHRVFALDMPGHGASEKIPAGSTVRDYVAWFGCFLEDLSIAR
ASIAGHSMGALISGGAVATFSDRITRVAYLNGVYRRDAAAKAAVLARAEA
IRKNGVDAEGPLERWFGEDPESQRARELTRTWLEMVDPEGYAIAYAAFAG
GDEIYADCWPSVECPALFLTGSGDPNSTPEMAKQMASVTPKGWARIVDGH
RHMVNLTAPEIVNALMSEWLTSREKPR
>SMa1166 Putative hydrolase protein
MAAMGRIGWTLSAISIAIAVAGGMVFLSYSNDIDRARSAVANGARVANTA
AGPIEYAERGEGTPLLSIHGAGGGWDQGLTNVADLVGRGFRVIAPSRFGY
LGTPIPADASPSAQADAHVALLSKLEINKTVVVGVSAGARSAIELALRHP
DKVSALVLIVPGTYAPESPVMLEGSRGSAFAFWLVNAGADFAWCATEKIA
PSVLIRFLGVPPELVEAAPAQDRNRVMAIIRGVEPLSRRFPGINMDSAPD
LHRLPLEKIAAPTLVVSAQDDLFNTLPAAIFAARSSPGAKLVVYDTGGHL
LVGQGGKVKKVVSDFLAQTGTMQPFGSGAGTSVRPKAPAPTVSLTRS
>SMa1052 Conserved hypothetical protein
MNRNICQFGAALASGIVFGFGLSLSGMLNPARVQGFLEVFGTWDPSLAFV
LGGAVVVAFIGVQVMKLMRHPAFDDTFHVPTIRRIDAPLVIGSAVFGLGW
GIGGFCPGPAVASLALGLPQTVLFVVAMLVGMTLHDRLWSRGT
>SMa2157 probable oxidoreductase
MGAALPVRPAGAQQASRTPIRRPIPKSGEMIPAIGLGTFETFDILPGEPR
DDLRDVIRLFHENGGRVIDTSPLYGTAEVCVGDFIMDLGIADDIFITNKT
WTTGDYLSDNSHSERQLRQSRERLWRERIDVLQVHSLENHDQVRHWLAHK
KAEGSIRYIGITQWSPEYYDTMERLVNTGTLDFVQIAYTIVTRAAEQRLL
DACSANGVAVQVNTPFEKARLFTPVAGQPVPDFARELGVETWAQYFLKWI
ISHPAVTNVIPATSQPEHVVDNMGALYGDLPDQAMRKRMTDHYTGLTGVA
DALKQPPYPGKQYGGVVKWPFPQPKRT
>SMa0967 hypothetical protein
MLHEEFADALKEGTADFKKLGRILKIILFGSYARGTWVDEPHTKKGYKSD
YDLLIVVNNRKLTDFSSYWQKAQDRLMHLPEIRTPVSLIVHSRREVNTAL
YDGEYFFVEIRRDGILLYELDDEPLAEPRPRGPADALRIAKDYFEDRLPH
AKTFVEGTQFFVSRGRRKEAAFLLHQSIEQTYAALLLVLTSYSPASHNLR
HLRSLAEERDQRLAEVWPRDQHQYVAWFNILNEAYVKSRYSKHYEISEDA
LAWLLERAHQLIADVEAICVEHLDRLRKQAEDDVD
>SMa1676 conserved hypothetical protein
MLLMNCGDPLRGRCMPAARIKIIYITQILIATLLLLAMLSMASSIFAPVA
FALFIIALVWPTQCRLQAMLPRYLALIISFLLVVLAIVAFGGLIAWAFGH
VGRWIIADAARFQQLYDQVRLWLEEHGVAVGVLWSENFGVGWVLHTVQAV
SGRLNSTFSFWLIALVYVLLGLMEMDDFGRRIEALRNRTASALLLRGSQQ
TAMKIRRYMMIRTVMSVVTGLLVWIFTRAVGLSLAEEWGFIAFALNYIPF
LGPLLATLFPTLFALIQFGTVETVLIVFTGLNLIQFVVSSYIEPRASGSA
LSMSPVMVLFSVFLWGYLWGIFGAFIGVPITIALLTFCNQHPSSKWLSEL
FGLEMAADQVASTPSG
>SMa1514 Putative ABC transporter permease
MDLFIAIFTGTIIAATPLIFAALGELVVEKSGVLNLGLEGMMLMGAAFAF
WAVIAGLPMPVAIAAGALAGAATSLLFGVLALTFLTNQYAAGLALAIFGS
GVSAFLGRGFGSAPIDALKRVHIPFLSDIPVVGPMFFRFDPMVYLAIGMF
GLITWFLYRTKGGLILRTIGESPETSHAIGYPVIRIRYLAVLFGGLMAGL
AGAYLSVAYTPLWVENMTAGKGWISLALVVFATWRPLRVLIGAWLFGGMT
ILQLQGQALGIAVPSELLSALPYLATIIVLVIISQNRQLLTLHFPASLAK
PFRAAS
>SMa1381 Putative
MTMREADVIVVGGGPAGVSAAIEAAKSGLSVMLCEQRPALGGAIHRQPAE
GATPVAVLPSLRGRWQALSAELSASGVDVRTRRAFVGVDSTGAVLIEDRA
AGKVEVRRPRALILSCGAVERVRPRRGWHLPGVAAAGGLQVMLKEGRVPG
GRILLAGSGPLLLALAAQMTAAGNPPVAVIEEGDPASRPLAGVRLLAHPS
ILPDMAALMMPVLFRRVVWRRGTRLTEITQSGDMLTACLIAPNGREERIE
VDRIGLHDGLRPNDFGLPANDAAAGLVILRAGDVREVLGAHAAEADGAEA
GREAAARLAGRPPRSGANGIRRLRSLQTSLSRLFAPVHGAPILDDCPGDT
VICRCENRTISHLKAQLSGPDTVSARELRLNGRFGMGACQGRFCSEWTLS
LMSELRPTASPSSIAEMGACRWPLRPVALSSLAKGGTNADTLTEPHMEEI
SA
>SMa0257 probable methylamine
MSRDSRFDILFEPVKIGPVTARNRFYQVPHCSGMGYRYPNAEAHLRGMKA
EGGWAVVSTQEAEIHPTSDLTPANEARLWDDGDLPALSAVTERVHAHGSL
AAIQLVHNGLHVANRFSRMIPLAPSHAVSDSLDPVQARAMDKADITDMRR
WYRNAALRAKKAGFDIVYLYAGHDMSVLQHFLSRRHNDRSDEYGGSFENR
LRLFREILDDVREAIGDTCALAVRLAVDELMGPSGITCEGEGKDIISALG
ELPDLWDVNLSDWSNDSQTARFSEEGYQEPYIRFVKSVTTKPVVGVGRYT
SPDSMVRVVKQGILDFIGAARPSIADPFLPKKIEEGRIDDIRECIGCNIC
TSGDNTNVPMRCTQNPTVGEEWRKGWHPETIARSEAPEPALIIGGGPAGL
EAARALAQRGVDVMLAEGGGEWGGRVARECRLPGLATWGRVRDWRIGQLS
TRVNAELYLHSPLSAADILQYGIPHVAIATGASWRTDGVGRTHRMALDFL
SEGILVSPDAILSEGAEAVPSDGPVVVFDDDCFYMGSVLAELLARRGRTV
TFVTPESQVSPWSRNTLEQARIQKRLIGLGVEIVTAMALAGRTKDQLELS
CVYSGRSRPVDCATLVPVTARLPDETLWLELKAREAEWADAGIKTITRLG
DCLAPGLIAAAVYSGHQYARTYQEQVDKDRVPFMREDIARLYGLRSG
>SMa2131 hypothetical protein
MVAFLNLMHFSRAPPSQGKGHGEHPWDCDMENLSHIDPIARDEWHVVASI
EELPSTGMFSTVLLGQKISIGRHGDRLGAWCEPQAAAEPSGSQVFGAELP
VIQRFGYLWATLGDPPRDLFEIPEFDEPDRSRIVQGVTRVGVSAPRAVEN
FLDMGHFPFVHTGILGEEPHTEVKPYKVDVYSDPPEILVTDCEFYQPQAN
AGSETGADTEYVYRIPHPFCAVLYKTGSTRPDRRDVIALFGQPVGEDQVL
VHIVICLLDEVTDVAAMRSFHQTILGQDKPILENQMPKRLPLDSRSEVPI
RADAASAAYRRWLRERGLRYGVVTGSA
>SMa1509 Probable ATP transporter, ATP-binding protein
MIPRLELRSITKCYPGTVANDAVSLSILPGEIHAVLGENGAGKSTLMKII
YGAAQADSGEIYCDGRRIEAHNPAISRSLGIEMVYQHFALFESVSVVENI
ALAVKGTFDLDRLAAEIKTLSARYGMPIDPHRRVHDLSVGERQRVEIVRC
LLQSPKLLILDEPTSVLTPQAVVKLFETLRQLASEGCSIVYISHKLDEVQ
ELCDTATVLRNGKVTGTAKPKESTSLELARMMVGSQLPQMHVSPSAPSAK
PLLEVRGLSAPARDKYGTELTDVSLEVHGGEIVGLAGVSGNGQAELIALL
SGERTHPRAETILIGGRPSGHLNAGERRKLGMAFVPEERLGRGAVPPHAL
WENAVLTAHRAGLVRNALVDRRRAGEFARHIIERFKVKANGPQASAQSLS
GGNLQKFIVGRELTLEPKILLVSQPTWGVDVGAAAFIRQTLVDLSRGGAA
VLVVSEELDELFEICDRLLVISNGRVSPPLIRKQTNREEIGLLMTRVGHG
ETRRSEVALED
>SMa2359 conserved hypothetical protein
MEDFILFALVGFLAQVIDGALGMAYGVICSTVLLAFGVPPAQASASVHAA
ELFTTAASGSAHLYHRNIDWKLFWRLIPFGIAGGMLGAFVVTSFDGDQVK
PFVTAYLAVIGAWLLYRSFHRIPTNPVKLRIVAPLGATAGFLDAAGGGGW
GPVATTGLLGAGGQPRFVIGTVNASEFLIALSVSLSFLATVLTGHWEQAG
DFRDHLTSIGGLITGGVVAAPFAGWVVKALKEKTLLRLVGSLITLLAGYQ
TLELTGFL
>SMa2123 probable ABC transporter, permease protein
MLELFQGPLFISILAAMVRIATPLLFSAMGELVTQRAGIWNISVEGTMLL
GAVVAYVIASSTGSPWLALFVAVLACALLSVILSFVTIVLKSEQFIAGLA
LNLLASGLTLFWFQTYIIGRDPPKFAGFEAVEIQYLSDIPVLGTVLFSQR
VLTYVSFLLPLAVWFFLYRTRYGLEVRCVGENPKALDVKGLSVGSRRCLA
IMFGSVMSGFGGAFLMLGYSDRFVPDLIAGRGWLVVVAIIAGNWMPFRVV
GAIFIFALLEAVGIHAQVVGVSVPHHVFLVLPYVASLVLLAGLRSRTHQP
AALGIPYRRE
>SMa2125 probable ABC transporter, permease protein
MISVQRRSGNSLSWNFACYAAAMLCALVSAAALLHLSGGDVAKAFSSLIV
GAFGSQKALLGSLAKATPLLLVGLGTVIAFRAKIWNIGQEGQVLAGAMCA
YWASLWIGPLPYWIAFTVLVLAGLAGGGALGVLAGVLKTRFGTSEIISTV
MLNYIVIFLLAYLLDGGPWMETGVTVAYHQSPPVNAMLEWPTLLGQGAHK
LHFGFLLALVATVLCAVLLERTPLGYEIRAFGSNPTALRFRGTDISRLLL
VVMLVSGALAGLAGAGELFGTSHRLRAETLLGIGSSGIVVAMVGGLRPSG
AMLAALFFGALKSGAIYMRLQSGTPAGLVSAMEGLVLLFFLCAAVATRIH
ITVRSEAHA
>SMa1434 Probable ABC transporter, ATP-binding protein
MNAPDKPILRIDKLTVDFLSEGDPVRAVDDVSFDVCPGETLVILGESGSG
KSVSTGTVMGLIDCPPGDIVSGSLVFDGTDLSRLDDEGRRELNGRRIAMI
FQDPLAYLNPVYTVGRQIAEVFESHGEGEGGAVRDKVVRLLERVGIPEAD
ERIDYYPHQFSGGQRQRVMIAMAIALKPDILIADEPTTALDVSVQAQILE
LLRDLQRETGMALIMITHDLEVAAAMADRIIVMNGGKVVESGKAEDVFTN
PSHAYTRRLMSAVPHADAPKAPRNAAQGEVLLQVAHLSKHYKLGSGPFSP
KREFRAVDDVSFTLRRGETVGIVGESGSGKSSIARMLLRLNEPTSGAALF
AGEDIFELKGKALDGFRRRVQMVFQDPFGSMNPRMNVRSIISEPWAIHRD
ILPRERWNERVVELLELVGLKAEHAARHPHQFSGGQRQRIAIARALASEP
ELIVCDEAVSALDVSIQMQVIELLADLRQRLGLSYVFITHDLPIVRQFAD
RILVMQRGRIVEEGETEALFVSPQHEYTQALLRAVPQPKWLRSDPAPIAG
>SMa1262 Conserved hypothetical protein
MGGFVVRVNMPGWLKPRPTDHGHGPLAMIVESSLDPGRPIAMHEHRNDEI
ISWVPFGVMRHDDKTTGRLVTDSKHLLVMNAGRSFWHSEETLSSDPPLRM
LQIFVRPRAVDLDPRIQHGPIPLRRPNTWRHLVGPEGGDAPFHIRNTIDL
FDIRLEPGARLVFPHMRGRDLYFYVYSGLLFAAGQTFAEGAQGLLLSDRE
LSVESKTQSTVVAFLIDPHAPITRKGTVGDHRKIPPVILIRMLRKWRQLW
KWRRSY
>SMa1969 Conserved hypothetical protein
MKPIRTFALALVLGTSAFTAQAANVLVVLSDSDHLDLKDGKVFETGFYLN
ELMQPVKALTEAGHDITFATPKGTAPTLDKSSVDNMYFGGDEAAMQESIA
MLDKLKLTSDSSSPVLSLARVEQIGYDHFDAVYVPGGHAPMQDLLVSPEL
GKLLADFHAKGKTTALACHGPIALLSTLPDASAFTTKLETSGSAKAEGWI
YAGYKMTVISNQEEEIAKGLLNGGKMKFYPQTALEAAGGDFVSNEAPWAS
NVVTDRELITGQNPASAPAVATELLKRLK
>SMa1328 Probable MtbA protein
MAVQVPTDFRRVIVAASVGNIIEWYDFYIFGSLAAVLSVKFFEQSHPVAA
LLSTIALFTAGFLIRPLGAFLFGWMGDRVGRKYTFLITLTGMGLGTGAIG
LIPTYESIGLTAAFLLFSLRMIQGLCLGGEYGGAITYVAEHVPDERRGYY
TGWLQTSPTLGIVVSLAVIIAARTYFGSEAFDAWAWRVPFLVSFLLVGIA
IYIRLQLQETPIFQEIKAKGQMTQNPWREAFLSSNIKYVGIATIVLIGQG
VVWYSGQFWALYFLQQVSKVDPLNSAYIVGAALLLATPSLILFGWLSDII
GRKPVILGGMLLAALTYYPLYLWLGAVTQPDNINYPIAIFIIFILVCYVG
MVYGPVGAFLAEYFPGRIRYTSVSVPYHIGNGWGGGLVPFITSAAFAATG
SIGYALIYPIAVPAVCFVLAIFLMPETRRISIWQPIEPRT
>SMa2119 hypothetical protein
MNMKESPMAKTRCLDPVVLNLWHPLGALIELPVDTVVDTVLLEERLSLAV
GLDGAVAVWQSCPDFAAGDKIDVAAVSKSLPAKVAYGYLWASLGSPPDEL
FHIPEYDEADRRRLNAATFGVNVSAPRAIENFLDMGHFPYVHTDILGVEP
HTEVKEYDVDISVERDEILATRCRFFQPLASSASETGAEVEYIYRVPHPY
CSVLYKSSPVDDARYDVIAVFMQPLSQESVRAHMMLCILDEDNEDKVIKR
FQQTIFGQDKPILENQFPKRLPLDPRAETPIRADKSAIAYRRWLSQKDVR
YGVIPASN
>SMa2127 probable ABC transporter, ATP-binding protein
MENLLSVQNLTKRFGAVTANDSVDLDVRKGEIHCLFGENGAGKSTLSACL
YGYYRADSGVIRFKGQVAELNSPADALRLGIGMVHQHFVLVENFTVLENI
IVGSPDVGMLLSKSTARQKVEDLCLRCGIELDLDREIWQLSVGEQQWVEI
LKALYFGAELLILDEPTAVLTPQQSDQLFVILDGMRRQGLSIILISHKLR
EVMQSDRVTILRKGKVVATVETATTTAESITALMVGHQVTKRVSDRSVAP
GREVLVVDHAVAIGEWGEEVLCDINFTIAENEILGLAGVAGNGQKELFEV
LMGVRTLSSGRFHLNGEAIVAPTSREMLDRGVGLVPDDRFREGLISEFGT
AENLVLGWQRKPEYRRGPFLDRGKINDLAQRKLEEFRIVAASTDLPVERL
SGGNAQRVILAREFLNAKCLLLANQPTRGLDVAASEFVYEKILEKRAEGF
AVFLASEELDDLLRLCDRIAVIFKGKIVGTVRPEETTLLELGMMMAGNAS
NLGGQVNDFGSKALRQ
>SMa2313 putative oxidoreductase
MNEPTRIRWGILGPGNIAKDFFAGALQSANGKVVAIGARNPAKSGLAEDF
PGARIVDGYDALIDDPGIDAVYIATVHPLHAEWAIKAAEKGKHVLCEKPM
GLSTAEADAMFEAARKAGTFMAEAYMYRLHPLTARIVELVKNGMVGDVRK
IQSSFGFAKLPFDEGHRLFSNEMAGGGILDVGGYTTSMARLIAGIGTSSG
VMEPAEVTALGHLGRTGVDEWTSALLSFPNGIIAELSCSVSLEQENVLRI
LGTKGRIEVDQFWFAGGKPGGTSIIRIVHADGRQEEVPLVEPRHLYSFEV
EAAGDAIRAGRTEFAYPGMSRADTLGNLRVMDKWRAAIGLEYEGEKHTTR
TRTVRGDKLARKTSLVRSGRIDGLQKEISHAALGLMEFSTFSSAAIVLDA
FFEAGGNLVDTAFLYGNGVQDRLVGEWMRSRGVRQETVVIAKGAHSPLCY
PDVIGKQLTTSLERMGTDYVDIYFMHRDNPDIPVGEFVDAMDAEVAAGRI
RGPIGGSNWTRERFEEAIAYAERAGKTKPSVLSNNFSLAEMVQPVWAGCI
SSSDDAWMRWLEENDVTNFAWSSQARGFFTDRAGRGKLDDLELARSWYSE
GNFARRDRAIALGRKLGKDPIQIALAYVLAQKGRVIPLIGPRLLAELNHS
LDAFAVTLSPGDVQWLRDGDPGESSAA
>SMa1057 Conserved hypothetical protein
MPITAISDKLSVSPQLSVEDIPSLRDKGFKTLINNRPHKEDTFQPNTQAE
RQEVKHCGLTYAFIPVTADTITEADVRAFQRAVDESDGPVLAHCQTGGRS
LNLYLIGEVLDGRMSADEADAFGRSRGFDTSVAAAWLKQHAARRPQVKGF
FDKRTWSVQYVVSDPETGKCAIIDPVLDFDERAGATATINADAILDYVRD
NGLTVEWILDTHPHADHFSAAQYLKEKTGAQTAIGERVVDVQKLWQKIYN
WPELATDGSQWDRLFADGEDFKIGSIDAKVLFSPGHTLASITYVIGDAAF
VHDTLFLPDSGTARADFPGGDARVLWNSIQEILALPDETRIFTGHDYQPD
GRAPRWESTVAEQKKSNPHLAGVSEKEFVALRTKRDKTLPMPKLILHALQ
VNICGGRLPEPEANGKRYLKFPLDALQGAAWE
>SMa0169 hypothetical protein
MALDRADFYDAELARHNRQLRVAADFGADDRVLDIGCGAGQTTREAARAA
PQGEAIGVDISAEMLEEARRRSAAEGLRNAMFEQGDAQFHGFPTGSFDLC
ISRFGVMFFADPAAAFANIGRAMRPGARLVWMVWQSRERNEWSRAIRQAL
APAIAVSAGAANPFSLGDPPVATDLLSAAGFTSIDFADVQEPVFYGSDVD
AAFDALTSLYLVQDALASTNEPPDKPLQRLRDLLEGHMTPEGVFFDSRAW
IITARRAGGGG
>SMa2121 hypothetical protein
MREFRCVDRPLLNDWHVVADRSALTLNSVFTTRLMGHDLQVTLDGRYNLQ
VVALDTGKEVCSDSRYGFIWACLGRPERDIIYLPETNEADRHLLGGGSIA
VRVSGLRAVENFLDMAHFPFVHAGWLSDEPHTEVMPYNVTITAADELLAT
DCKFHQPIASPTAQTVMVVDYVYKVFRPYTVALRKSSPLDPNRKDLIVLF
IQPVDEENCIVHSYLCYLKQGTEAADVRRFMQLIFAQDKPILENQCPRRL
PLDPRAETPIRADAVSVHYRRWLRDRSVTYGAIAYPV
>SMa1296 adhA1, AdhA1 alcohol
MTMTAAVVREFGKPLVIEEVPVPQPGPGQVLIKYEATGVCHTDLHAAKGD
WPVRPNPPFIPGHEGVGYVAKLGAEVTRLKEGDRVGVPWLHTACGCCTPC
RTGWETLCGSQQNTGYSVDGTFAQYGLADPDFVGRLPARLEFGPAAPVLC
AGVTVYKGLKETEVRPGEWVLVSGIGGLGHMAVQYAKAMGMHVAAADIFP
DKLALAEKLGADLVVDARAPDAVEEVQRRTGGLHGALVTAVSPKAMEQAY
SMLRSKGTMALVGLPPGQICLPVFDTVLKRITVRGSIVGTRQDLEEALEF
AGEGKVAAHFSWDKIENINAIFERMEEGKIDGRIVLDLNG
>SMa2371 codA1, putative CodA1 cytosine deaminase
MFDLIIRNANLPDGRQGFDIGLAGGKIAAIEKSITASPGEEIDAAGRLVS
PPFCDPHFHMDATLSLGLPRMNISGTLLEGISLWGELRPLLTKEALVERA
LRYCDLAVTQGLLYIRSHVDTSDPRLVTAEALLEVKEQVAPYIELQLVAF
PQDGYFRAPGGVASLERALDMGIGIVGGIPHFERTMEDGARSVEALCRLA
ADRGLPVDMHCDETDDPMSRHIETLAAETVRFGLKGRVAGSHLTSMHSMD
NYYVSKLISLMAEAEINVIPNPLINIMLQGRHDTYPKRRGMTRVRELMAA
DLNVSFGHDCVMDPWYSMGSGDMLEVAHMAIHVAQMAGIEDKCKIFDAIT
VNSAKTMGLEGYGLDIGCKADLVVLQAADVTEALRLKPNRLFVIKAGKVI
ARTAPRVGELFLSGRPASIDMGRDYVPPVLQR
>SMa0512 idnD, IdnD L-idonate 5-dehydrogenase
MKAIVIHTAKDLRVEECAVEKPGPGEVEIRLAAGGICGSDLHYYNHGGFG
TVRLKEPMILGHEVSGHVAALGEGVSDLAIGDLVAVSPSRPCGACDYCLK
GLPNHCFHMRFYGSAMPFPHIQGAFRERLVAKASQCVKAEGLSAGEAAMA
EPLSVTLHATRRAGEMLGKRVLVTGCGPIGTLSILAARRAGAAEIVAADL
SERALGFARAVGADRTVNLSEDRDGLVPFSENKGTFDVLYECSGAQPALV
AGIQALRPRGVIVQLGLGGDMALPMMAITAKELDLRGSFRFHEEFATAVK
LMQGGLIDVKPLITHTLPLGEALKAFEIASDKGQSMKTQIAFA
>SMa0513 idnO1, IdnO1 gluconate 5-dehydrogenase
MSTELFDLTGKRALVTGSSQGIGYALAKGLAATGAEIILNGRDAAKLAAA
ARDLGAGHTLAFDATDHQAVRKAVDAFEADVGAIDILVNNAGMQHRTPLE
DFPADAFERLLKTNVSSVFNVGQAVARHMIKRGAGKIINIASVQTALARP
GIAPYTATKGAVGNLTKGMATDWARYGLQCNAIAPGYFDTPLNAALVADP
SFSDWLERRTPAGRWGKVEELVGACIFLSSDASSFVNGHVLYVDGGITAS
L
>SMa0814 nifB, NifB FeMo cofactor biosynthesis protein
MSTPMILRESRTSTTFSDQLLENAKSVGCSPPSTAPGDIDPGTWDKIKNH
PCFSEEAHHYFARMHVAVAPACNIQCNYCNRKYDCANESRPGVASEKLTP
DQAVRKVIAVANEVPQLSVLGIAGPGDACYDWKKTRATFERVAREIPDIR
LCISTNGLSLPDHVDELAEMNVDHVTITINMVDPRVGVKIYPWIYYGQRR
HTGIDAARILHERQMLGLEMLAERGILTKVNSVMIPGVNDEHLIEVNKVV
KGRGALLHNVMPLISNRIHGTYYGLTGQRGPEAFELQALQDRLEGTKLMR
HCRHCRADAIGLLGDDRGHEFTLAEIPDEITYDASKRQAYRQLVARERGD
HLVAKNEANRTVMSVEYGGSLLIAVATKGGGRINEHFGHAKEFHVYTVSQ
RGIKLAGRRRVEQYCLGGWGEVATLDHIVVALEGIDILLCVKIGDYPRKQ
LTQAGLRATEAYGHDYIESALGALYAAEFGIEPPVKTATA
>SMa0831 nifX, NifX nitrogen fixation protein
MISIRRLSLVSDQSQREISDRPVGALRIAIATEDMKGLNAHFGSAKRFAI
YDVTAHKSQFMEAIEFDDASDESGRHRTEGDGRIRSRVSALKGCQLLFCL
AIGGPSAAKVISAKIHPIKAQQAVSMSQVLSSVETMLQTAPPPWLRKMLA
DAGAAKKRADFEDETE
>SMa0854 nodG, NodG 3-oxoacyl-(acyl carrier protein) reductase
MFELTGRKALVTGASGAIGGAIARVLHAQGAIVGLHGTQIEKLETLATEL
GDRVKLFPANLANRDEVKALGQRAEADLEGVDILVNNAGITKDGLFLHMA
DPDWDIVLEVNLTAMFRLTREITQQMIRRRNGRIINVTSVAGAIGNPGQT
NYCASKAGMIGFSKSLAQEIATRNITVNCVAPGFIESAMTDKLNHKQKEK
IMVAIPIHRMGTGTEVASAVAYLASDHAAYVTGQTIHVNGGMAMI
>SMa0851 nodH, NodH sulfotransferase
MTHSTLPPQPFAILAMPRTGTHYLEELVNEHPNVLSNGELLNTYDTNWPD
KERLLLSDRELLERAFLRYPPHSDKKVTHVGCKINEPQFQERPSFFAELT
AWPGLKVILVIRRNTLESLRSFVQARQTRQWLKFKSDSSAPPPPVMLPFA
TCEAYFKAADDFHARVVYAFDSSRIRLIEYERLLRDPVPCVATVLDFLGA
PALQLADRGILRRQETRPLDQTVRNFHELRVHFANGPYARFFELAND
>SMa0772 nodL, NodL Nod factor acetyltransferase
MTRTQKEKMLAGEMYNAADPEIQADLLAAGAWLKRYNSTLGDSAEQWHLF
LREGLGEVGPGAVIRPPFHCDYGFNISIGAHAYMNFNCVILDVAKVTIGD
GTAIGPAVQIYTADHPDDPEQRQAGLQLGRPVRIGKHVWIGGGAIILPGV
TIGDHAVVGAGSVVTRDVPPGAKVMGSPARVRG
>SMa0773 noeA, NoeA host specific nodulation protein
MARMADSKLVAAAPRPGRVAGSFRDPSGQVFHFQDRILRTMDSAAAIEFA
SAERVMRQLVDEGRLVDFSDAEPSLHQLFQGSIARVLQHPLLEQITYPYE
WSFAGLKAAALFHLQLQLDLLDQGFCLSDATAYNVQFEGSRPTFIDHLSI
KPYRDGQLWYGHKQFCEQFLVPLLLRSVFDITHHSWYRGNLEGVPSADFV
KLLSTRHWFSHKLFMHIILPAKLQSSRTSQTKVDLGDSRARRLPKDAFRA
MLAQLYSWISGLKVDVGKQSVWQGYAANNTYTATQRSDKGQYVAEFVAQH
KPRTIIDLGCNTGDFSYVALENGAEKAIGFDFDPHALDAAFDRSVQTSKN
FLPLYLDARNPSPSQGWGERERQGFSSRFSADAVLALAFEHHLAIAHNVP
LAEVVAWVTQVAPKGIIEFVPKEDETVRRMLAGREDIFSDYNEEAFASAL
SQKARVVNKHLIPGSKRTLYTFERSE
>SMa1272 norQ, NorQ protein required for nitric oxide reductase activity
MNIVLKASPIPDSAIPAYSPSGRECELFESAWTRQLPLLLKGPTGCGKTR
FVTHMAAKLGLPLSTVSCHDDLAAADLTGRFLLKGGDTVWVDGPLTRAVR
EGGICYLDEIVEARKDVAVVLHPLTDDRRILPLERTGELLEAPPGFMLVV
SYNPGYQNLLKTLKPSTRQRFVAIEFDFLPRVSEIAVVSEESGLDESRVA
PLVDLAHRLRSLKGHDLEEGVSTRLLVYCASLVDNGVSVRDAVLATMIEP
LTDEPDVKAALIEIADAVVRQG
>SMa1185 nosY, NosY nitrous oxide metabolic protein
MSNILTIAGKEIQEGMRNRWVLATTLLLTALALTLSFLGSVPTGSVGVDK
LDVVIVSLSSLTIFLVPLIALLLSHDAIVGEMERGTMLLLLSYPIGRREV
VCGKFLGHLAILAFATLFGYGAAAAALVATGSAVGPDSWQAFGSMIASSI
LLGAVFAAIGYLISSVARERATAGGIAIGIWLFFVLIYDMALLGGLVAAQ
GLAIPTGLLNLLLLANPTDVYRVLNLGSGGASALSALGGVADHTGLSSPV
LLAALGLWTLAPLGFATLIFSRREL
>SMa0981 ntrR2, probable NtrR2 transcription regulator
MSRLYMLDTNIVSELARNPQGAVTKRIAEVGPEAVCVSIITAAELRYGCA
KKGSPKLLAQIEAILGSMQVLALDVPADAEYGNIRAELETAGKPIGPNDL
FIAAHACVLGAVLVTVNSSEFTRVRDLKVENWLDFTSSG
>SMa1523 nuoG2, NuoG2 NADH I CHAIN G 2
MIKVTIDEQSLEVEAGSTVLAAAERLGIEIPTFCYWKRLPPLASCRMCLV
EIEGLRRLQPACATVAADGMVVRTNTPLIEETRSSMLDMLLANHPLDCPI
CDKGGECELQDMVMAYGPGESRFRDPKRVFHSKDIRLSPVIIMNVNRCIQ
CQRCVRMCEEVVGAVALGTVEKGMDTAVTGFEGSLASCDQCGNCVEVCPV
GALMSFPYRYKARPWDLAETDTICPHCGTGCQLTVGARKGEFMRVRSDWE
HGVNRETLCVRGRFGLDFIESRDRIKRPMIRRDGTLTPVSWEEAGDFLRQ
RLGVAEGKAAGGLISPRLPNEVLYQFQKLMRTVLRTNNVDCSSRWSAPLD
ILVPIVASFYSRDPLEQVIGKDCVLIIGGNVTEENPVTEYLLRDAARRRH
TRLLMLSARPSRLDADARAVLRAHPGGEGQSLAAVVAALVAVTDEGLPDD
IFAKTSGTTASSGANDALDRLVSTLKEGRSVTLLVSVDLLRSPLARKTLE
QLGNLLQLLRLLGKEPSLQFLFDRANQMGAWDMGVLPGVLPGLSPIADEA
TRTRFERSWGAEIPREPGADVDAMLELCEKGGMGVLYVVGSDPLISYPDR
EFVERALGAANLLIVQDAFLTDTAGLADVVLPAAGYGEESGTFTNNEGRT
QALRKFREPAFDARSNLAIFGFIAALRERPLQPSTETVIFEEMTRLVPAY
EGLTWEGLGADGAFTTSAPKPWTSGFFAPLSAPAVTDVLQLITGNCLFHN
GYVSEHSETLNSVADDPFIEMSAQDAAGLSLSDGDQVLVRSARGELTAKL
KVNRRFPHGLVFVPENYRALRLNSLMRRGEYPCPVEIRECAKRAASALDE
ERV
>SMa0340 wrbA2, probable WrbA2 Trp-repressor binding protein
MTRVLVLYYSSYGHIETMAGAVAEGARSTGAEVTIKRVPETVPIEVADKA
HFKLNQAAPVATVAELADYDAIIVGTGTRFGRMSSQMAVFLDQAGGLWAR
GALNGKVGGAFVSTGTQHGGQETTLFSIITNLMHFGMVIVGLPYSHQGQM
SVDEIVGGAPYGATTVAGGDGSRQPSQIDLAGAFHQGEIVARTAAALVAA
RN
>SMa1935 wrbA3, probable WrbA3 Trp repressor binding protein
MTKMLVLYYSSYGHIEAMAKAVANGAKQAGATVALKRVPELVPEAVARSS
GYRLGQEAPIATVAELADYDAIVIGTPTRFGNMASQMKNFLDQTGGLWAE
NKLVGKVGSVFTSTGSQHGGQESTILSTHVVMLHLGMVIVGLPYSFKGQM
RMDEITGGSPYGASTLAEDENHRDRSPSANELDGARFQGRHVAEVAAAMQ
LGRSHLQPELVR