TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 553

Free access
Sort by:

 



# Methylococcus capsulatus str. Bath, Bath

>MCA0766 hypothetical protein
MCPCSRRAGLALPGPGWVKCLPCRNRQIPSTTLSRGHPMDQDQAFRRTLA
GLTATAALLAGCGSETPEPAKTSAPAATTSSTPSTPPHIPVTQSEHQIFE
KLYVEKCIKGQQNDPDSLVKDDQELGRVCECMAKEISQRISKADAVHFNQ
KGEFPFDLVIMSNQAANHCLGSS
>MCA0929 hypothetical protein
MFCRNPRLHRGWRRNIRDPTRRRVIGGALSECIAERRTEPGRTQDGAGRR
RQGRRQFFAPEGDGDVPSGFAKNGDAPADRKKPMNTRASGGGTLPSRRLS
TQIRSRVI
>MCA1779 conserved hypothetical protein
MRFLFYFLLIANVSFFFWSLGHRDTKRHVDVKSVEASGRLLLRSEVADTR
GEIQSPPTESPPPARIENELAAGEAGCYRMGPFAETAEARRVLALVKEWI
DRANLTSESSESLDGYWLLYPKAENMDEAKGNRKMLMDKGIKDVWLIDKG
EMAGTISLGIFKTREEAETEVVRLSKLNVRAQIRPRITQAETVWIQFVWD
KSPLELDEIMIQLRGENPEVSVPPLSPCPSASPQRPA
>MCA2763 putative lipoprotein
MVRRISLMALVCLSVAVGGCGADRSRKLLGGVDVRNEQLGAVRPGEGPRT
VYVADFTLDAENIQSDPGVKGLLPGQSQSQRPGLLGGLGQRLRGTLGAGS
PGDRAREIVDAMAAALVKSLSDKGVPARRIASTTGALPRDGWLVQGMFTE
VDEGSRLKRAVIGFGAGATSMDVQVGVSDLAGKDPRQPFIMFGTVKDPSK
MPGAVVTMNPYVAAAKFVMEKNATGKDIEKTAEQIVDEILKYRQKFEDEA
RANRTAP
>MCA2406 hypothetical protein
MSPSIRISRSWIASSGAAALLAVVSALRAESLAHDAVASATTPLTTPTPQ
PQAQPLVTVAKRKQDGMALTVKRTWWSAKKDQIEVSLTVKIPNETLKLMT
AGDAKSRQGLLVEFRHADGAAFAECELDLGQAKTRQVVFDARELYRRGML
QIKSGICDPDISTEIAEQTIPDVKPGDVAVLKETSAADLLSATFVKTR
>MCA2636 hypothetical protein
MPCVAILFGGSVPLSLTVAFPGAVMKNIELASPLEMSPGARAGEITTILA
AAIVRTLGSSGLEQSAARLGFLPDQRVHTTPSQQEKL
>MCA1584 hypothetical protein
MPQAHAGRRTLRLACRVEFAGCPAISGNTRDLSPVDVSLQSAALLTPGPR
RPRPGDRGVLTLTIRGPGLPNALLKIPCRVAFVNGNILNLQITTAGLDTR
QQETFDLLFQR
>MCA2502 hypothetical protein
MTSLLARNPGGGRGKSGFASFGAKTRPQLVGSGAALALAAGEPLTVPRFG
PFLRYAA
>MCA0276 putative killer suppression protein HigA
MEIRFRDKKVRLLCESQATAVKKLGDACARKLRARLDDLEHAASVAELVA
GNPHPLTGDRSGEYAVNLAGGWRLTFSPANDPVPRHADGGINWRAVTIVC
IEFIGDYHD
>MCA2675 conserved hypothetical protein
MTVAITVEHNEARLAGTLAFLDAGPEPARLRIYGGTRPPTPATVPSSEML
VEIRLTKPAGTISGGLLTLTQQEDGLITATGVATWARLVNGNDVTALDLD
CSGTDGNGDVKLASTALYLGGDARMVSAILG
>MCA2161 hypothetical protein
MRPLTSPTALILTLLFLAEKAVTAAAAPPAASQQRCVAALQKAGAGVAGA
AARAALHCVKASVGGKLAADGIGACLGARNPAIERARGKTLAAAARECSE
VPAFGPANPEDVNAAFSEASDMTALFGAELADVLDSGGDHATARCQLTLA
RSFAAFVRTELAEYHRCVQKRLKRNQASTAADLEDCLGADSPRLRDAERK
AESSIARHCGGIAGRTAFPGLCASLPGGEPGACLIAQAHCSACAAINRAD
RLSADCRKRGNYLAAPYCDSRPQRHWSVARQWNEEILDAIRLDNPRPGVH
ARNLFHLSLAMYDAWTAYGDESKPYLTAEHPASADIEHDRRIAISYAAYR
ILSRRYSEKLALGSATSQARFDSRMARLGLDPGFTDTAGSTPAALGNRIA
AAVLAFGDRDGAGEGANYQDPTYTPVNKPLIVKLPGIDLTDPSDTGYHLD
PNRWQPLALDKTVSQNGIPLPGKVQTFVGSQWGGVIPFALPDTATAEALY
DPGPPPQLGGEGDDEYKAGALRVIELASQLAADDGATMDISPASLGNNPL
GSNEGTGHRVNPATGQPYPPQVVPRGDFGRVVAEFWADGPHSETPPGHWN
TIANAVADSPGFPHRIGGAGPELDRLEWDVKTYFALNGALHDAAIACWGT
KRKYDGVRPITMIRYMAKKGQSSDPFLPSYSPQGLPLKPGLVELITPETA
APGGRHAELVSAETGGRIGDIAVLSWPGSPTDPQTQVSGVHWVLGTAWLP
YQRRTFVTPAFPGYISGHSTFSRAAAEVLAALTGSPYFPGGLMEFAARKD
NYLIHERGPSVDLRLQWASYFDAADQAGQSRLWGGIHIEADDFAGRRIGQ
QVGRKAFAKAMEYFDGAANR
>MCA2649 conserved hypothetical protein
MTTTILALDLGTTTGWALRGGDGHITSGSESFRPQRFEGGGMRFLRFKRW
LTELKGHVDGIGALYFEEVRRHASTDAAHAYGGFLATLTAWCEHHQIPYQ
GVPVGTIKKHATGKGNAGKEGVIASVRARGHAPSDDNEADALALLLWAIQ
NHDDGQEV
>MCA1564 conserved hypothetical protein
MTHLDARAVLGLGAAAVLAVIVALTIASGRQPVSEPKGGGYALPELRDHL
NEIRSITIMGPENKTLVSLLKEEQGWKVQEKDGYPADGGKLREMLITLAD
AELLEPKTVNVERYASLGVEDVQAKEARGVELRLEGLNQPARLIVGKVDD
DNGATFIRRPEEKQSWLAKGVLHIERDPLRWLDKALTDIPSSRIMEIKLS
RPGGKSLRLFKSKPEDSTYNVADLPPGREADSTISGLASTLAGLNLVDVF
PISAVAIPPESNLLKAQYRTFDRLTIEVKAWNQDGRHYAQFSAKLDPQPA
QSRLPSEGDTPSQAPAAGKTPGSPAQATPAPDPGQRLEALDKEVERLNRR
FAGRVFAIPPDKYGNMDKSLTDVLQPAATGTRQKKK
>MCA0616 hypothetical protein
MNVASHNYEQFVCIEQWLEMLESNGVLAGRTVIVESLERFFPSRFGNIGT
CRKSKSRDKSVVKPFEAFTHGEIPTSRRTGGEIPTDVVYLLKVLKNAVLE
ALLGSDATIHGVAVNAPLTSRDLFSNRRSDRILYYYEDDAEGSWSDEQIG
KAVYSISAIQRRIEAAEGRFIFVLVPNKSSVYARFIVGRPDADRYTTVIR
RLGDAGVATVDMLAPFRGHAEKEVDFYLPDDTHVSTKGYRVLGEVISGEM
KKMAAR
>MCA0409 putative lipoprotein
MNRNTGFEKRLGITVLALLLACAELPRIAGAHGGFGGGFGGGHFGGGFGG
GGFGSHVGGGGFGQRGGGDRGEEGPRYGGTGIGGGSFGGGGLGSVRASDG
ADGAGHDWRQNANRDFGNGVQGFQAPEDRQAATPAQRQEDRDDEVNTAQQ
NREKEIDTVQQNAYKQQDAMAQYHYQQAEQLQKNRYDYYGGGNYYGPVFI
GGTWGGYYSGMAMMGMMEGLMIGEMVASVPRNSQPVVVENNTYYYSNGAY
YLPQGSGYVVTPPPLGATVTSLPPSCVTVYSGQQPYSDCGGAFYTPSGGG
YTVVMPPPGLTVNSLPGGAATHTVNGVRYYEFGGVWYRPFYSGSDVVYQV
VNSPNA
>MCA1606 hypothetical protein
MTPPTHRSLWRADFAAVPPVERRLARYALQLFRHDPQRALNLSDHEVLGA
LWRLTAPLLDPRAARELMGCSEEEVPWAAEALESETLSREEVVRDLEAAF
AQDRNEPARPAPVAERRQRCRAVFETLPQQIARRLAEADGVAPTSSTVAT
LGQALGLSATEVLILDYLEYRESSRPLRVLLRAEPGARYATRAVGRLNRG
YWVAPRRRGRWESICCFTARPAPARRSWRARWSRLPECAPTRSGVAMTTA
TG
>MCA1287 hypothetical protein
MRFAGKLGDSVETKTTTKNAKKARLWRAFLCPPGRENEATSKSRPGGGEQ
KPRNNRHITGWTAPHRPFELG
>MCA1641 conserved hypothetical protein
MRSLCRTLLHGPADAGSLRTAETFFQSVLAGRNGPVPDFGHPWRAPALPP
PDDPTERRNILLALAPVMLLDQICLARAARPATAHRPAECHLFDLYCRSI
GLDDPAVSAPIRFRARLALAGVAAPSLNDPGFFQTPGIPDFAWNLPAVQL
CLFHRPRRYFPELLGWTLAHNLREPAWWDGSESADPNDMARNRALAHAAL
EAFGARDEERIRAGWDLYRRLFAELLNQAAATADRKLPADEALALIIEAR
RSHAVGHHGRILLDGRSLDEWLTDPDPGPLLRALRESAWVDRTCPPSSRL
IRAMDFGGPMFGVFSRAEQRICLDWIQDDKHPLPLAPQAPPFVPRPAETA
PPRGHGLRSKRALYTALLGAESTPDLPGRAAGTVSRVLRLTRLILPLQRG
HRRFFPYTESGFQDRIEAIHAHEIGRYRPSDQPPQIGEAFCRWVILQLAP
TILVDGCWLAGIGTAAECLDTAGRHLLKIYADELGNGRPERNHPNVYRHL
LGRLGIELPPFDSEAFASDPRFLDDAFDLPVYLLAVGQHPQRWFPELLGL
NLAIELSGLGAGYMRMIDLLRSHGIDSAIVQLHLSIDNLASGHAALAREA
IMIYLDEMERKGGRKVSCQLWKRIWTGYLSLYTAASRLAWRLHRRSARSG
N
>MCA0489 hypothetical protein
MSRILIVLFWLALMPLPAAVGDEAADIAKVEAATCAGATVGQRLQEEIQS
HSRRDLGWRVFVEDDHRDLERSLRISKAMEARYRWRIDAAGKIEPVSDAA
RQLCAPR
>MCA1289 hypothetical protein
MLKTIDILLGISVVMLIASMAVTVLTQLITDVANTRGKHLARGLGDLLRQ
IDPSLSREMAEQVSRALLTHPMISHVGQRLGATIHREEFTKLLLDIASGQ
TPRDDKNRLSDSARQILAKMLEKNGIEDPATTLENVRAYALQLELSHPEL
ATNVRHNIALLQEANSRLVAKINGWFDQTIDRVSERFTATTRLITVTCST
VVVVAVQLDTIDIINRLSVDEGLRNALVQKAFALDQRPRDTGAPEKASAV
SRPKTPPPSIDAPLHMKASDGTTAMPGRSDEPKGKPAAAPGVLAAAAAPP
LASISTADKGRSDVGTDLEYLQDLGLINLLGSSKGWLKRWHEVNPIGLIL
SVFLLSLGAPFWYAALQNLLKLRGVLAVKDDAQRRERQSPQTSAERPAGG
TPAGLPVTGERGILG
>MCA1524 hypothetical protein
MPHILRLCPAPGVPPSRSAPGDRTEIQTSIRKIPGRAKHMKINLILAYVF
KFVTFVFFSFIILVYAGLLILLPLDIMFQVIRIFAGIGFPTVIAALVGMG
ALGYVGYKVYLAPALYKLVFDVGYQLIEFTRLQLRRFDDLIAAEQGGKG
>MCA0543 hypothetical protein
MVPGSRRVGMGESVAAPLAWWEWDLRPIVPSVGIAERQSQPFRTGVGFHA
HGFFQGAQPVFHDVGFMGLAQHVEHIGQTGAIVAVDPAGAHVAVDRGHFL
ANRLERGIHRRFPAEPVIQGLDVLEVALAGLFPFLVRPPLLPPVGHALSA
HEIPLIGCWQARVAIKLQRP
>MCA0190 hypothetical protein
MDMEDSQIFATRCQESASVTACEGINGQTDPEPLLSASLPASAPSCRRQS
TRPHSQR
>MCA0655 CRISPR-associated protein, CT1133 family
MILQALYEYYGRKPELPALGFEEKEIPFLIVVDEQGRFVQLEDTREGEGK
KKVPKRFFVPQAVKRTAGIAANVLWDNSSYVLGMDAKGKPERLKEQTRAF
EQAIRDLGLEDDAGVSAVLAFLDSKEEKAKALEAASSWPEILEKGANLAF
RLQSRSELVCQHEMVRTALTRRLTEANADPTVCLVSGEEDAPERLHPPIK
GVWGGQPTGANIVSFNLDAFNSWGKSQGDNAPVGKRAAFAYTTALNHLLA
KDSKQRLQVGDASTVFWAAQANPLEDLLADLFSEPPKDDPDRNTRAVESL
YKAPHTGASVFDDDGTQFYVLGLAPNAARIAVRYWHVATVRELARHIKQH
FRDLEIIHAPSEKPYLSLFRLLVCTATQGKSENIPPNLAGEFMKSILAGI
PYPHSLLQAAIRRIRADREITYPRAALVKACLNRQTRYSNPDNVKELTVA
LDDTQTNIGYRLGRLFAVLEKIQEEANPGINATIRDRYYGAASSTPVAVF
QTLMRLKNHHLAKLENRGRAVNLERLIGEIVDGIDAFPPILSLADQGHFT
IGYYHQRQALFTKTDRS
>MCA1573 hypothetical protein
MIASSDERDGPPPSPERPNPCADRRPGTRSPSSDIFHSLTPHPDAIDRNS
LFLCRSGDLCPHSPGHPRNRLRDSLSHPGGEHSPSPRRP
>MCA0292 conserved hypothetical protein
MWDFNLLSATRTLEKSMAFVLHRWLIYLGVGLAYIFGAIIGAGTTIGIGS
LSSNPTAFAGTGAWIGMGVVGWILYKFRNGFLTGVQARNAALLGAEALRE
PVPKGKAQIDYARRRVAERLPPPPALSPLIAAIRSVASALPGWKEPAPQT
LPQRWLLQAKGLLVTTETLTLLGYHFAQPANSFARSARDGLLLLAAHLPT
ILRNRLYLSGFGWLGCFAAFPVLLAAIQGILAGLPLDPGIWPYVFAFLLA
WTIKAAFLDAIAMAAMLDLFLKLPAPENGSELSDALARDFPAFAGICAQT
DI
>MCA2135 hypothetical protein
MIDAEDPARTSSSCPWQGIFQYPARRLRKETRRQG
>MCA1244 hypothetical protein
MGIHVPATDSKSGGADYLIMPFRPSRTLDNVLASGCAVVNYSDDVRIFAG
CLTGRRDWPLVPADRVAGQRLGGALAHAELELVRREDDPERPRLYFRVVH
EANHAPFRGFNRAQFAVLEAAILVSRLHFLPWSKIERELDYLRIGLDKTA
GPHELEAWSWLMEAIEAYQRQREEIE
>MCA1916 hypothetical protein
MVRPRSNINMTESPNPGRKSTKRRNRPAALPPPPRPTRPEKSEPSLENGF
AILDALAIERKTEQRIRRMPHDVAWLLITAGVVGLVTPGVLGLPFLAMGG
LVLWPGSSTRLERWLNGQPPRLLKGSMRQIGRFLDDLERRYPPSAR
>MCA2466 conserved hypothetical protein
MPFPTRRLPVLLLLLAPLAWGIDSPEGGDLQPVPDPPDIPSPVQSGEELE
PDVTIMRKGEDLYEEYRINGRLYMIKVKPKIGPPYVMMDKDGDGNMDVRT
TDMARSMDIPQWVLFSW
>MCA2723 hypothetical protein
MWFNSMQREEPYLALTCLESCRDAGVPSGAKTQVLHGCRQLVS
>MCA1753 putative lipoprotein
MGERCSFSGDVKNGVAVLGFSLLLGCAEQAQKSEIDIQAANALKERETRV
FLSSHPCPATGKISENGVCDGFTIGMVVPERCGGKMKASNLYWQNEELAR
EAELQEKANCGRPAADEEEDALIRTQMRRTLDQSIYNSMPR
>MCA0956 hydroxylamine oxydoreductase
MTSFGFVPILCTDNDQDGLPKEHTPMIKMTTTWLLAALLMFVQLTASATA
GETDFSGLKDKYEKDHPGKGKFSQYWEPIPIQKYWNPRNFYQPPTAVSGE
VSRDQCVACHQSLTPGAFHAWENSTHAKLDAIRNLSNGQDARFYKKEKLA
EIERNLVKQGVLKEGEPLKEVGCIDCHGKVGAQSIRHDKDLVMPDRTQCG
SCHLQEFAEAESEKDQQWPQGQWGKGHPSHAVDWEANVETAIWAGMAERE
IAQGCDMCHYQQNKCDGCHTRHSFSAAEARQPEACATCHNGVDHNEWENY
TLSKHGTVYQTHKSTWNFDVPLKDALTKGGYTAPTCQYCHFEFNGEFSHN
LVRKVRWGFNPTPAIADNLKHPWFEGRKENWNTTCAHCHSPSFARSYLEA
ADKGTLAGLKVEQEAKQVVEGLFRDGLLPGQKTNRPAPPAPEKDAPGGFF
QLFWAKGNNPSHVERVHADMWEHDLIKLYKGLVHGNPGGFTYTEGWSELM
RDYAVIMDENTRLREKSGNAPGAAAANPPAGKDDSNVRNVLGGLALLAGI
AVLLYRRKH
>MCA2875 hypothetical protein
MKLVLQIAIGVFLGTLTSQSLVEVWRLHQERAARAAAEKIEAEKAKTRHE
LSERIRTLLMRGRQEHGDRPPLPPSFVPDDTATGPAGEN
>MCA2461 conserved hypothetical protein
MRLSCFPRAKSRMDGLSHNFVFIRRTQLQTSLFAYGTLQLPEVMAAVTGR
TFAAVPAWLFDHARYRLRHRIYPGLRRETGAVTAGTLFLGLDPQALARLD
RFEDGFYTRTGVMVSTVELGCQPAQVYLIPPCSEHLLVYRGWSLDDFIRT
HASAYVRRCRRQFR
>MCA1698 hypothetical protein
MLHSGRCLNRWWVSRSRAPRSWFKNRPDAELCRLRPIGPRSVRAQHYPC
>MCA1464 hypothetical protein
MVGMPRHQRGLTFIGFVLLMAVAGFFLLLLFRLGPVYLNHYKVNSSLKSL
KSDPQLLEKTREDILKTLEKRWDINMVDSVTTKDVRISRSEGTLRVQVVY
DVVRPIVGNVAALIHFDDQIEVEHR
>MCA1366 hypothetical protein
MPLIRLFIEICLFRKGPQDVPRSLLLLWLTAAAYLLVGFILLGLEVDWLP
AVVESLAELAMLLGFVWLLLALFKKTPRWQKTVIAMLGSDVVISAPAIPL
VGWTLAVPDAAGIHLMLFGMMLWHVAVVAHIFRHALSQPWATGLLLAVAY
VAGSYSVMMTLFPPTSV
>MCA0595 hypothetical protein
MKRIVALLALGLVPAVSHAAPSHCTAEEKTVFSCSVGEKTVSVCASKELS
PTAGSIQYRFGPEGHPEIRLPEPAAHPAAAAKGATLMYSGGGGAFLRFSK
GQYDYVVYTGSGRGWKKDGVVVDKGGKLIANFRCKGVPVSELGPEFFQGS
GVSEDERAFEIP
>MCA2923 conserved hypothetical protein
MNLADLFTVTTLTAAVNKLPYATYKIRDLGIFAEAGVRTTTVAIEEDQGR
LHLVPNRSRNDAPEVVRRKRRTRRVFETFHLAEAGVILPEDIQNIAPFGE
DMTGSSLEPQARVINDKLQIMRDSIEITREWQRVGALGGQILDADGTVVY
DLYNEFGVTKKTVDIAFGTNNLDVRAKIVEGKRHAEQKLTGATVTGFACA
ASKEFMDLLTDHPKVQAAYANWQAAQDRLGGDMRNDFTFGGVRFFELDVT
VSGQRFIPAGKARLFPLGAGVFQMHNAPANYNETVNTQGQPYYSKGEPRK
FNKGWDLEVQANPLALCLFPEALVEFGAV
>MCA2310 conserved hypothetical protein
MGFSPLATGPHGFYQQVARPGAAPGSSTARPRNVRRSGPGDLRPFKNSFS
VSSRTEARRQRRADTGHPEITTLANMEMEGRCRPKSPERIRHGRAASALQ
PGEVMASPGGFEPPSPP
>MCA0315 hypothetical protein
MNNVVTNTLKLGLFLAMTFGTAARADDFDQLKATTPSERAAAQTEYMKNH
LGLTDAQMPKIAAINQKYAEQMEPVIKGADGGLGKMLKAQAIQENKDAEL
RQILTPDQFERFASSKDEMREAVKRVLRKE
>MCA1295 hypothetical protein
MLSLLVALGIGAWFFHRAMVRGRNPATAAITGIAAYYSIYMFVDWSVRLI
YGGTVHPYERHADGTVEAVAIVAGAVAAIALGLVVVQAEET
>MCA2574 conserved hypothetical protein
MSRTGIILALIVCGILPPAHAAEDEDYARSRRRLIEANLPLTPEEAARFW
PIYDQYWKELSALTARREAYYVELGNNFDSMTDEHAKRIVLEHISLEEER
YRLLRTFFPKFATAILAKKAARYYQIEAKIQAAVNAEIAERIPLIK
>MCA0918 putative lipoprotein
MLSCAATAAHAGPWGIVQLMEALAAKHSGRAEFVEKKYVSVLTGPVESSG
ELSFAAPDRLEKRTVQPLDERLVLEGDRLSLERGGKRHSVGLDEYPEIAA
LVGSIRSTLMGDRAALEKIYRLSLNGDAHRWTLLLVPSTPAVADLVAYIR
IGGSEGLVDRVEIHQPGGDYSVMSIREMPQR
>MCA2150 hypothetical protein
MADNANSKNLCSLLIVSAGLLAAQVAQADAPCVVPLKLIDINAGSGKPPE
YKLGIHVGLGGGAPQLYEFDTGGPGFWAAYTPTPPKNKEQWWGNYETVQT
DALSIAYTSGNEYTANLVDTVVALYRPQGSGFAKQCESAAPVGVAQITAF
ADKKKPKKVKAWYKALATGKPPLFGHFYGDFGAALFPIMTADKSAGVYSA
LPQLPQTGLTNGFIVHVGPLGKSKPTLTIGITDEELATFTTQLPMNPTCV
DAGGAPEAPSSACPPYPNFPVGDMPTWSEQITYANLSWEYGKSSSGNGQA
FSNVGLTLDTGAPATTIWQNDDLYVDSRFLRNPEGSTVPYTGDFKSKVRL
TISAATTVPGGHDLDFYLLTGQKATVNQVSASVRGNDGGPTWSGYMNTGL
MLYTYYDVMFDVEHGVVGFRPVKR
>MCA0935 CRISPR-associated protein, CT1972 family
MNLLTDPLFRVETPDGIERLSLPQLLEALGQDRVESLLGLQRHQEDAFHI
FLCYLAGAVLAREARSEPRQPEDFWREGIRKLTGRDDDWAWTLIVDDVTQ
PAFMQAPVPDKKDFGAFKLKARTADALDILPTAKNHDVKASRSGATSPDG
WVYALVSLQTMSGFFGQGNYGIARMNGGFGSRPAVAVYHAERMGMRWHCD
VTRLVGIREELLAGPWGYRERGIVLVWEQPWDLESSLSLNVLDPFYIEIA
RAVRLMGDGKNVRAFGASTKAARLAAGDAGGVLGDPWTPVNVADKKKGQS
AMTVSASGLSPELIRNVLFEDGFRAARMQCLLEENEGQSCLFSATVLVRG
QGTTDGFHHVAIPVPARAHRLFRRSSERDRLASISKTALNDAKEIQNRVL
KPSVIALLEAGPDKINFDRREVNLWLNEATQRFSAAWSEDYFPWLWRQAE
QDDADAARLEWLRALRDKAHKVLEEAIARYPSREGRRYRARVKAEGLFHG
SLFKTFPQLKEGSHDAPRPG
>MCA0410 hypothetical protein
MPEPIIFRRTVVRISNEEKGSGGMPSRDVSGRSRALPGAVAVALLCGCSA
TKQAREVQPSGFLGSYSSLRKGNGDGPLLVYANPAADCRK
>MCA1733 hypothetical protein
MLHFTQSPSDMAQISIILASHSSFYVWFFRGNELASWRVRQPPSLPFAHS
MITRTPMLRRSKFHSLVFLAAATLAALGLRQASAFEYNGEGKTLVNKRLK
GSYGFAAQGYFGADFDAATRLTTGAIAIRVGVFKFDGDGHCSIHSMANKA
GLQAAVTQDTSECTYEVYLDGTGKVDAVLGGKSFQTFFVLVNNGKEFMFT
RREGTGTPADQGGATLVFAIAKKQ
>MCA1216 hypothetical protein
MNKLAKAFAIAGTICSLGGMTQTAVAGPYDCGIERVTPFQIFNQSWVAIR
GENLGFMSPPPHIEICDVVTPSPFNIGWFMINPAGNWIAFVRVDNELNGP
FGTHPSACEISIWQKKTANPFNSLMYTCDSEGLLPAVLLLESGIPLSGNP
SVDDVKNALAAELGDGTKVSQFPQ
>MCA3094 hypothetical protein
MSAEPAAVSAAMAADTTAHGACETGETVRSYLRRLRFQDPALIERLAAEC
VERARRRVSRHDPAELSRRAIEEAQRRLDQALARALGVNAGREPALIAAA
RAALLLADAGVTADEVYLTASANPALASRLSRHLPQAQPPEAPASMQPQK
LRFFLFKSV
>MCA2916 conserved hypothetical protein
MNKIKLHEPLKTGDGRTLTELTMRAPKVKDLKAAQRFGGSDADVEVALIA
SLVGVVPEDLDELGLADYRRLQDSFRRFSDPDGGAVEGDGAAGPVVPVPA
Q
>MCA0587 hypothetical protein
MAQTNSNQTTASPVSSGAIDRRSKSVGTALFSTLLALSQATWGADRQTEP
PSASPTEWAGPADTKITLEAFDRARGEFADWFGNPLVKGKPLPKDYNYSF
VGNKFQLGLRMTGQPFEAFAQFQDTYIGGLPTNGVGIGAAYYASTPLSTQ
NGAFLRQGWLRLKDMFGVDGLYLNGGRQLFYDGQQGPARHKNLRWIQDYR
IAQRLIGPFEYTHAGRSFDGGSIGYLTDDLEAVGFGFMPTFGGFESNGMP
TIGKVNVAGASLSLRDSENVGNTIGRLSWYYYSDDRDILFVDNRPLAARE
KAVGRGSAIHTIGGHLAHVEEIGPGVADGTVYAFGQVGNWQGQNQRAWAF
GVEAGYQFKEVWAAPWLRAGINSGSGDDNPNDDTHGTFFQMLPTAWLYAQ
FPFYNMMNNQDVFVQGMLTPDPKLSLRLDFHALRVNAPQDFVYSGSGATN
DTVFGYVGTPTGGASNLAYLTHMMINIKPIDHLAFNLFYGHAFGQSIINN
QYDGKQGNYGFLEAIVSF
>MCA2631 hypothetical protein
MASGSYHGPQRKTENREALSGRERRSDGVRVRKAPPGNRANIGVAGEKKG
QPRTVGLGVLVEAAGIEPASANPLPSVLHAYPLY
>MCA2637 conserved hypothetical protein
MGKKTLTNSHCLLELTEKAQAGVLKAFSGLPECQALARGFDWSRDDGALP
AALVERIKHLRKEQRDPAEREALRVLRLASPRGAAILATVAEQLNDSDLI
ALFLSQDGCEIGRSVWMRTHSDESARLFDVAESILNTGDLRGNKRLHDAF
DVPCDDAPPFIWSDSVKKELETHLTTAMRLAEPCEVIHVPLADEARDGET
KTVHYLVVRFAGEQVTAVQVINRNRRSFCYFPARDATLIYAPHRKMVEVY
AHTLSTRAPLANVLSKHGFKMPLSNRPLDRSRYDLSRFARPLKDEKPRID
GAKVEHLYLIEAKALLGHATDAVTLHIDSGAELHEVIDERWGNHPFAQPG
ALLGVTLVADLVFEGETSVTPLSIVLAEPGRCSLSGEKDQRLRRVGMQLL
EALGVRKPLHPGSGVDDPNLIAQVARLLECATSPLDGFALAKLGIDIDRL
EDEGILTEGERITETVVQLDDGAPFTVKLERCADLNQVRYRDPLTGMDVV
LPAKLARRWKVQLNWLREEIITALGSALKGVRGRHLDDEPVFLGEMDIDG
HAVALYFAAKMSSERQYARVDAALRLRPRAVPGIVLTTASIPFPFAGTNV
VIPIEDVLSDGGDGSAIDTTRLKVAYRHGQLAAMGGTTVSLKVSADGYSA
TLYLPGKAPWKVTNKAKIAVLQRLVDAYTAGTPHVNTKKLMEDTGCASPA
NLFSKNSPWRDYLVKVKGAHAWQLNLPTLDAPVDDDTEVETEEAAMAG
>MCA0482 hypothetical protein
MPFRTSSVPHDFLLDEQSLIDGKKWKGIRFGGCRSGSARCLIVEPCYLAM
SMRLYQTSMLACTFLAGCAFGNKQDYRQAQPSLAFSGVQKIAVGVQDQRP
YVVNGNKDEDFVGLQRGGLGNPFDVRTQSKRALAEEMTNLLADALKHSGA
NVTPVRLPPSLSRDQVLSTLLAEKPDKSLLVTLYEWKSDTYVGSEIQYNV
DMQAFGRSGKLLAEKRIEGIDQLGSAFWYPQGVAQDGAPAAFKKKMEEMY
SGGIAKAITQTGESQIGLSNDQSPEDKYTRLAKLRKLLDDGTLTQQEFEA
EKRRILEPD
>MCA0803 hypothetical protein
MFGTVSEELDHGSCTRRKDDGGRGDGRGQGDGARNDGADGGYGGHDESRC
DARRPPPGPVVAQRRHGGQGCGGGRGSDGGDAHRPLAVQAGVHPSAGAVR
PGHGRRLLHSQIPQGNHRLGHERSRRRQGFHPSAAGKSGRPAGRTPGGVR
AG
>MCA1495 conserved hypothetical protein
MVTPTDGMLAVLYLRQAELPAVLDLIGERGLGAVAFGAAPPATLPPSLPR
LGIAMPQRLPAEPLVEIWLGVSRATRGRTGAIDWACDGLHLFGSLQTEAG
ADIRGATGRAYDALFQLLERHGYPHLVRVWNYFPGINREESGLERYRQFN
IGRQEAFLRANRPVSGALPAACALGTHAGPLVVYFVAARIPAFPIENPRQ
LSAYRYPPAYGPRSPSFSRATLLRWANRTAVFISGTASILGHESVHPGDA
AAQTRETLMNLTALVETLNRESGGAGFGWKDFRYKVYVRPGCPLEPIQCE
LAAMLPVADDVPFVEAEICRRELLVEIEAFGSRTP
>MCA2756 conserved hypothetical protein
MFFNYRGFLKALRLALFQRPFRLRRWFYVLLFSALYLAFVGFVALGRLLD
HVFFPGFRKTSVERPVFVIAPPRSGTSFLQRVLCADEQRFVHWKMYQTIF
PSICFQAVFNGLAWIDAKCGGVIRCLMQGCERKWFGGWDEMHRMRLDQPE
EDQALFLYAFASEAIFMLFPFVEPLWEVGFPDALPPASRRKLMAYYRSCM
QRHVYANGGGRTLLVKSTHASGAIESIAEEFPDARFITIVRHPDEAIPSH
VSLFVPVWQTHSPEIERDGAESKAYAALAVEWYRHLDRFRARVEPANYYC
IDYRDLRADPGRTVAALYRHFGWNVTGSYRAVLQDFTERQRTFQSTHRYS
LEEFGLSKQWIRQELGPVIESHGLDGEETSSQKPQTESLGQTLVPGGHRR
SSAFGQQAVIRGMQEPQEDQGEPVGHHHRHRLRAGEALDGDDDGQGDVAV
GCAECDHSPAL
>MCA2663 conserved hypothetical protein
MQNIFENPAFSMSALTAAINLLPNNYDRLGAMGLFVDKPQRFRSVIVEEQ
NGVLTLLPTMPPGSPGTVGVRGKRKVRSFTIPHIPHDDVILPEEVQGIRA
FGSETELQTVAGVMAQHLQTMRNKHAITLEHLRFGALKGQILDADGSVIY
DLYNEFEITPKTFTFDIADPGNGFDVKKACLDVIRYVEDNLQGERMTGLH
AFVGEDFFDALTGHDEVKAAYDRWQDGQALRTDMRAGFTFAGITFEEHRG
RAVAPGNAVRRFIEPDEGHAFPLGTMDTFATYYAPADFNETANTVALPLY
AKQEPRKFDRGTDLHTQSNPLPLCHRPALLVKLVMGGV
>MCA2026 hypothetical protein
MTSPPAVQPCPDPPRFGHLVCASIVALGLASCAGMTPTQQRMTSGTLIGA
GSGAVIGALAGNAAMGAGIGAAAGLAGGYLYQKYKENQAATYAEGYRQGQ
ASTQSKKKQKKPQPQ
>MCA2832 conserved hypothetical protein
MKFLPQWVRRSWCKAFAVVLALTGLYALLGFLILPKLVEQFAPELAARYL
KRQLSIAEVGFNPFRFSLELRGFVLAEVSGEPLLSFGRLLLDFEPASLLE
DTWTFSEVLIEQPSADLVVDEEGRLNLARLGDELPKSPEAEKGEGQPPAL
SFKHLSMVGGKVRFADRSRTAPFDETFESIDIEVADLSTLPEVQGTQHLT
AKFRERAVLDWQGRVSINPPYSEGRLRVGNFPLAALWPFVKERLALAEPG
GELGGVAHYRLAEGPAGLALSVSDLSAQIEGLKLTPEGGSEPILALGRLE
AREADFDPANRRVRMPKFEIREGRLRAAVDEAGEIDWLKLFRPKAAAAAS
AMSNDQPEPPWRIEVGAFGAAKIGVDYADASRHSPIRLGIGALDLAFAAA
IEVGTEDWKAAISRLALRLDNIALSTAGASGQGDDLATLESLSVEDANLD
LAKREAAIGRIVLKGGGTRIVRDTTGGIMPAAALAPKSAGSRTPRPSDRG
AWRYAVQQFVLEDFGLALADRTFAPAIAYDLENLRLSLGPIASDEGRPMS
FDAAFVVGQGGEFKASGSLSQSLDQAEGTVRLGRIDLKSLHPLVARFSGL
ELERGDLSAQLEFGYRRDAGTALKAKGDLSVGGLLLKETATGKRFLSWKN
LSAEALDFGLAERRLAVKQVRVAEPGWNIEIFENRSTNIQRVVAGGRPAG
AAPKPVGGRRRSEKPEPWLVTVEAVRVEKGDIDFSDGSLVLPFATHIHDF
FGTAAGLSTRSEARALLQFHGQVDRYGAVDVYGQLSPLAPKHFGDVSVAF
RNVDMPSLSPYSATFAGREITSGKLNLDIRYQIDDGRLRNDNRIVLERFA
LGERIESPKAVSLPLDLAIALLTDGDGRIDVSVPVEGDLGNPRFDYGKVI
WEAFVNVVTRTVTAPFRALGRLFGGDGEEIGGVLFEPGSADVGPPEREKL
NKVMVALAQRPRLSLEVHGGVDPELDGRALKSSQVRRSVAAKLGFRWSPE
EEPGPVSFDSAATQSALEELAGEQGGPNAVEEVQSAFARKEGRKAERIGR
ISALMGRPSPDSAFYRALFDHLVETAPLSAQDLATLAGRRARAVVGVLTQ
RGALDPGRIRIGETTSGDVRDRRIVTRLAVKPGGS
>MCA1568 hypothetical protein
MSGRQKPSRRRYTAEFEEQPVKRMLWVASHGDGGVGVGGWRRADSQRAGA
PSSAIRAWGVGRGAGGSECATRATVGQREEDRLKKTI
>MCA2666 hypothetical protein
MTALVLTRSHTHAGKPYAPGDRIEVDVATADWLIAHDIAKPGPAAPVANP
DTTPEPKPLQRKEPKP
>MCA0564 hypothetical protein
MAIAIPVVDNWYEERDSGRQFRIVALDDTGDTIEIQYLEGDLSELDRASW
DQAAIIEIEAPEDWSAPFDDVETDDLGYSDPDLHLPDTDDLTLDDVLEDE
DKEPY
>MCA2240 hypothetical protein
MDDRCFIPAEMGVVMVPVETVGMAVVVFVAVTMVMLVTMTMVVPPGQSAM
AFDEVSPEEGGRDNRLFHGGTSDEGCSL
>MCA1628 hypothetical protein
MGCLQDALFQAAIADQVNLDRGGFADLGDVRPPVAIGDRVPGVGDDDGLA
GHAGAAALHVADEAVEDVGDAVPVSAVKPVMVAKFALPFIAPGVAPETAP
GEPLQGDEVRHIVAGVGRFAGVAVDLAIAAGQDFPEPVHVQFATGGCRGD
GADGGGMFAGVFFVACFHHVSS
>MCA0925 beta-ketoacyl synthase domain protein
MTLLQCHIEGLGAIGPGFADAEEFRRLVEAGRVDPSAPTPVPPPVCLPAA
ERRRAGTSIKLALAAGLQALEGSGRDPATLPTVFASSGGDGDNCHIICEA
LASNDRQISPTRFHNSVHNAPSGYWGIALGATAPSTSLCAFDASFGAGLL
EAMVQVAQSREPCLLIAYDVPYPFPLGRVRAIGGAMGLALALAPDATVRS
HARLTVGLGEGPCTELGDPGLEAMRRSIPAGRGLPLLRALALSEPCRVTV
DYLDPLRLELEVAPCR
>MCA2498 putative cysteine rich repeat domain protein
MRDSSHHPINKEHLMYSARYLFAGLLLAAFMTQPACAAKKAETKASAPLE
QAQVDDPVSTFKTGCKAELDKFCKDVMPGDGRQLACLYAYQDKLSTRCEY
AIYDAAAQLQREVNALSYVAAECDDDLDKYCANVDPGEGRLLACLEKNEA
KVSSRCEQAIQDTGLNKVK
>MCA1324 hypothetical protein
MRRNISLLLLVLFIVGLYVLQNKGVTPFVMKVLESDLFDMKEEEEEQLGK
VKTPRTDFAYLHCKAAMLEDHVVPENSEFLDDKYEAWALGGRNYILRSEV
LVDSPQGRIAQKFVCKLRMTGDDQANPDDWSILGTELNPADGGE
>MCA2973 hypothetical protein
MIAITTDSKPRIGKNQVGIAVPTSDGKARTVIRFFPSVSLIFLHSQAGHV
VSRRRSPLDREPKSLNRRIYKGKNDTPTVA
>MCA2657 conserved hypothetical protein
MGISIRAYARHRGVTDTAVHKAIRAGRITPEADGTIDPEKADREWARNSG
PPNTGTRTKVPKVAVPDAPGMGSEGPAALPTGGASLLQARTVNEVVKAQT
NKVRLARLKGELVDRNQAIAHVFKLARAERDAWLNWPARISAQMAARLGV
DPHTMHVALEAAVREHLQELGELRPRVD
>MCA1084 hypothetical protein
MRKGKRELVGYPQDMCDPFKLSGCCDGFMTPQRDLSIPCLTGKTTRTTVR
LGLYFVPFFFSAMSEIEPPRYVLVKMSFGAMKTSLSSLRCAGLVGLAAAN
LSGCLAPTALDHAVLAYNEAATDIIDKQILLNIARAARHQPIHFTALSSI
AATYNVTFNAGATPALTGNSGGLMMPVFGGSVAENPTVSIVPVEGEAFTK
RMLAPLQENKLTLLLRQGMDVDMALRLMAQEFRTFREGEEVAYTNKPSDR
TGYSFFRRLVLHLSSIQDRNRLYVEPMFFTQSWTLPASAITPEGFAALEQ
GYEIRFEPATNSYLLSKKVTGRLMITNYDPDNLPEAERIRLNREAENSPP
NEIAVDIRKDYPGGEFPIHGTFRLRSFQGILDFIGRSITEEPEYDIPPDP
RTPPVRENPASTVDIVETDSRPSDAGLAVEFAGRYYSLRPETGHQWNLEA
FRLLYQLYQMTVTDLPTFGTPSITIAK
>MCA0525 hypothetical protein
MRFETIIACLPLPLVPVMASGAALSHHGGLILDCTPPLFFEESPANDSSV
EIFDRFSFTASDNTDGATLKIWVDGKPVAPTITPQRSGRLTVEGRPDAPI
GQPGSRVRILVSGFSKDGCERSHAYYVHVRPKG
>MCA2617 conserved hypothetical protein
MHLIMELETEMSEKDLLETLEQLRAQVAALEADGSSKARLESLIGTLEQR
LQGEGSEAHHGPLVAELKEAIAHFEVEHPRLTGILNDLMVTLSNMGI
>MCA1261 hypothetical protein
MKQPMSYLRLAVPALCLLSLAVPPAEAKKADDKYNRYITFQNDFPFTVYP
VIQVPADICDGAAATGDRRIIINYDAKTEGLPAGKKITVWIPRDQKQVNV
GGKTVDKNCWYQSGRVYIFPVALKTYEANIVKLDPLQTKVITKFDDPTHP
SEVVDCYKGDNTVEGEAVKGNCMTGVSGASYSVDAPAQLAEFTFDADNPT
DQNMDTGDPMADIDVSMVDELYIPVAGSVANHGATGYMGGSAATINANGV
NDLAQFKKRIDGFLTDPDGRGGRKNVWPVFAAYTSQYFNSQNKEINTFSD
LLPKELGKNAGNTIIPHFPGGYNSIDMTLTRGPSTNYYTGTNYVTDKGTL
ITGVRYDEKMKKPGNPLVQPYIDRWMYWVKQRDNPGFCADKTNLEQLKWP
DHVCKYDSKKAVQDCTFRHQYFCEKFQESVKVVWDHFTNDPVDGFNPNQA
KVWEKCGFPKAPYPTDENTKNFCIIQQIVGYDSKVLGGDLPGRVQALLRG
VAFEANDPYDPNHPSVPKADVQQWQFDPFLTFAAPYDSQFNLNPYTRLVH
DSGKDGLGSVSYSFSIDDKYGNFRDAASGFTVDAGGVTALENKRPYDPYQ
QYTVTWGYNRDQFSLAWLNTGTSIEAIRAKLEQVAESHGKRPFLLREGDK
LAVLGHDANGAWQLTNPLTTKTDLEALAAQEKENSKGKNHTYQDLIDRVF
LKKPADVFPAQALNVDAVSEQAIGVLNFDVDTGWSAGEARLYDQIALKKA
DVPQANNWVSLTTCGHQVPVRGPGAQTLPLVYDTAKGYLPCEIVAKDKFG
DELKLSMAPEKKTGIIDIYTGSKVTDWTTLWGFPTGKKHTGAPPVTSDLN
AKDLQYCIDNSSPYFSVSGFCNNVNVSAVWAGAPLARDVVYMGLDYVNMP
RVGVSIAQPPKNGPDPDAVFWPNGAKVTGELMPDGVTVHVTWPKAIITSG
KPLNYSLTFGGQFQASCNTQHNPDINPNTYCDAKITKGDKTPKKMDVVAI
NYTGTPVTQSVLLSGTFTSGVRP
>MCA0125 hypothetical protein
MVGQEIRKASKVYCVNLRLYCTNCGIVKNQVQPSARGGAGRR
>MCA2659 conserved hypothetical protein
MTYTTTQLDALKRALGSGERRVTFGDKTVEYRSVEELQAAIRTVEAELAR
NAGESPTRQIRVTTSKGF
>MCA0934 CRISPR-associated protein, CT1973 family
MTHPAPADPPETPSTSLASLIGHLAATIAAEHFPTGDRAALRRLNPDAPP
NLAFYRFAFRHLPQNWENRRTAWTALVAGIALMCPKPHRPDRSVGLTLAE
TGYSEKRLERLLAAEGDTLHTLLLRAARFLAAKNESCNWTDFAHLLLDRN
PEKARLKIARDYYRNLKDHD
>MCA2320 conserved hypothetical protein
MPAHRYAMNAGAPRETPAVRTAVVSDDIADLACIYQPDVNLCLIRREPEP
AIERFVGELLKRGEAIESSQALSFEHFDFSSLLPEYRALEGHDPWWRDVA
RLTVAFCDLFETGSVGLRLRTLNQAMCPRFHVDFVPCRLVCTYGGLGTEW
LADDEVDRSKLGLAGSKGLADEDSGLILGRVRAMPAHAVALMKGATWAGR
DYPGVVHRSPRPTSEQPRRLLLTLDLVL
>MCA0761 hypothetical protein
MTYEMNWLEITVVMLVVLSAYGVLRLCFGQVFRRARMERQEG
>MCA2307 histidine decarboxylase, pyruvoyl type
MLLAVLDGLTPVFSGVPARNDPPAGLPRVPPALSLPEVVKGAVGPFPAHS
DGYGNPGASGLGYITLITLHTGQTRRELAITGQTGEGLDGTLAFDRAEAN
GAYLGQINLIVASSFVGVNGAIWGYDVAKAAEIGERKLFEIGEPKTGGIP
VYPADPLLDAAARLLGTRDAPRFPLLPGAQVIAAHKEITAPGPATVWCGV
AIAIAAHRATDASAIMELCGKHRGRSGRNPPGEHYFRLIRRNLAKSVLRV
GANQGVRYREIFVAVKHESVPPGSIGHAMATAPYIVLANNAVPPGGPEKL
LEMGIYDWERTVQERP
>MCA0785 MxaA protein
MVWSLLPVAALVSVPLHGATSLSFDTPRAFGYVIGDLIRHEVRVETDAGQ
GIEAASLPKEGWINRWLLLRRVEVRREGRHRILTLEYQTFYAPLEVKNLT
IPGFELQLAGSGERLAVPDWTFTTAPIRELSVLRAEGPSMRPDAAPAPLP
TLGPAAASVGSGLAATGALAWWAYLSAWLPFVSRGRHFAEARRVLRDLRG
LGDSREALRRGFSCLHQAFNRTSGEPLFIEGLDEFFRSHPAYDLLRDEIQ
DFFLASYEVFFGEGAPAPSFDLARMEALARSCQLAERRRP
>MCA0788 MxaL protein
MSIWRQRVADPVFAGLIVALLLAVAACFPLRLVLERLVFSHIVVVDITRS
MNVEDYRRGARAVSRLEFVRQSLIGAVADLPCGSAVGVGVFTEREPALLF
EPIETCAGFSAISAAIEQLDWRMAWAADSLIAAGLHNTLDLLGRGDADVI
FVTDGHEAPPLNPRYCPDFSDLRGKVRGLIVGVGGLSLSPIPKYDESGRR
SGVYGEDEVPQRSSFGLSELPPEQIEGYHARNAPFGSERAGGTEHLSQLK
EGYLRQLAEAAGLGYHRLESPEGLGRALTAPALARRQRIATDVRWIPAAL
ALAVLMAVYLRVLLPRPGFSTSN
>MCA2163 hypothetical protein
MTTSAAGFAGAKIPAGHPPLNPTGASAQMVPAELAQKATVIDVINVAQYT
YLEVKQDKQSQWIAAPSVEVKKGDVVRFDGGVEMKDFHSKTLDRTFPSIV
FVNRVVVGEK
>MCA1707 hypothetical protein
MTPRSVLNKAESAAYPDLSTEAFDKNVNVKPFPFKNLRAIRYDMRDLDER
PDRLKCRERQES
>MCA2667 conserved hypothetical protein
MSTYASFQGRVFLGKRDIAGLPIEVRSPGNVAELKLSLKTDVLEHYESQT
GQRSLDHRMVKQKSATVNLTIEEFTKENLALALYGNHVTGSGGTVTGEPV
GGAAPVVGDRYFLAHPKVSSLVVMDSAGTPATLTLGTHYTADTDFGALQF
LEITGFTPPFKASYAYGVATEIGIFTQALPERYLRLEGVNTAQGNAKVLV
ELYRVAFDPLKEISFISDEYNKFELEGSLLADSTKPFDALLGQFGRIVQL
>MCA1639 fatty acid desaturase domain protein
MFPLTVLGFLASGPHRPLVALLWTLPMLLLLLAEYFGPAETRVVPEPVPA
SLFDALLYLLSVLQLLNLFALGRMVSQLGWSGMAEVGASLADLLVVRTMV
SADFVSAGICPAHELIHRRSAGQRHLGRLLLATLGYDHFYLAHKLGHHAR
LGSADDPSTAFRGESFEAFYRRGLRQQWRIAWSARRGAVLAGMAFELGWL
GVYTMLFGPLAMLVLLHQARGAIRTLESVNYFQHYGLTEDSGPGRVLAWR
NDSAVSLFMFLALTRHADHHRRPGVSFWALRPAAEGPQMPYGYMVMALWV
QRRNESFRTWAERQLDLGRGGVEPRLPKADTASGAIRTAFPGSGKRDRRN
GAGRMPAG
>MCA0279 DNA binding domain, excisionase family
MPDEEVMKEDAGEIFTLDEVAAYLKVGKRTVYRLAAAKKIPAFKVGGTWR
FRRQEIDQWITEQTEKGWQGREDYHASPGPVSSEQKKTV
>MCA0347 hypothetical protein
MIDVVGRGVEVPRFRRGTTCRQRIAGRITHHGHGRSGTVGIAQGGDREAR
GRLQQPGLDLLLAAVLDCRHVIHRLIDAGVRGKERRLPAGLDPERETGVL
GTGKVAVLPRHCPSAIAGRMNGAVTGFDLAEPRSHRGKRGTPVPHHGVLD
DLDGLARNRLGDAVQSMPLRRHYDDLHDREVLLAFAAVMGNDRRRIDGLA
FDVLHRGAVRESLEAGMAEHLAGAQGDSCAQQGQRQKFVRIGHCCFSLLC
VLEKAKPTE
>MCA0245 hypothetical protein
MDRDGNPRLDGKGRPRLIESKYGASTTWMGKLPYRVNAGRHGGCPVLAVG
ITPELMTDREEYDREHA
>MCA0421 cytochrome c5530 family protein
MVIADYQFGRPSPTRAPRTTIKNPGGTMNTPRPATETRRSRRLLLGVPLL
PLVLALAPQAVPAAAKIKVAKTVWSDKAGTLTVAGKAKGGSGAIDIYDIN
GRWLGSGQGDGFALTLSRSDLAGVPCAVRVRSGDAEVIKAVKGAPKSCAG
APTCSIVNPTEGKAVQAGAETAFEATASTKDPAAQPLKYEWDFAGGAMGE
LIAGSNPPAYKRPDTLATTVAFVRNDSHYRVRFIATDAKGRRCEDSVEVT
VGNPPSGLPGKVAEQPAPKLGGELDGIRGDVVVMPFEEWTYQNLSDMRYG
RNGWGSATPVVNNVRAYAFRKDRLPVFLSGSDVELRYSAASNPSDPVGGD
SINSTSRNWPLGASLVDAALQKTDVWELPDRPASQKSANYYACSWAMEGA
WGAMWDCASALGKPEADEGYFIAKKDAAGNIVSDDRNTDLQHGAYMPGKD
QPFESNTPQPFSQYVAATQSDGKDKAASWFAANMLPFTDVDDRGRVNPYP
LLRVEAVAKGGGSVLARTDGVVSASRDFHCRECHAKGGIAANPNAPRTKA
AYGSSAWGKTALQIGREKGPSHRYYLPDDKIPSQPELFSVSDIGGDPNNP
FDAEYAAALNYSSMHQFYDAYPFLYKMLYGVEKLNLSDSDKTDPTNIAHD
NPMPCYGCHLSPLAYVPYKMDMAWYDEEGFDINDPAYAPNYSISMHRFHG
ELQWNDSRTDIVRDDKGAFVRFDWKTKGQHNSTRTGSLFPIFDDNGKQLP
MEQNCLRCHAGHREQQYRDRMATAGVTCYDCHGDMLAVGEAFPKNYLANR
DKLGSIDRDDYRVPWFDQPDCGSCHIGDGNQGADKSGGFFSAGVMKRAFD
DADFSATPRAVDRSDPNSRRFSAAPLETYKAVFPTTYAYGYDPVGKMFLE
QSTETRIDVPLFREGKDRHGNVACAACHGAAHSIWPNRDPSANDNVTALQ
LQGHTGTILECNVCHTADSFARKEDLDGGQYSGDAKAGILGGPHDMHPVN
DPYWWKGAQGDGANSDGTTYGGWHNDYAKLPGMKGEDQCAACHGNDHKGT
RLSKTPVDRVFDFRGFDGKKLKKAGFKTRVVKVAAGTPIGCDTCHSLQTS
FIGSPGH
>MCA1951 conserved domain protein
MHYYTAATFSTRTPVMKLRSLSLAIAIASSYPAFADRAEPARYRPVAPLN
TAGVPQSGFLGSGSRPFELEVPTVVADLTPENPGEFKTGTVYSLPIPVTP
SLSYWESVAGGHASRIRISTANAVGLRLHLVFGSVPPSLELRLQGSEDVT
LPAPIASTEIHGNELWLPVTRGSDADLEIFVGADVSPEALDLRLDKINLI
LVDTSGASTSGFSAQSAGLAQYKEYDLACWSNDPAYPALQTAAAATANIH
FLRKGSSYLCSGTLLRDKGDTQTPWFATANHCLPDQTVADTASFEWFWQA
IDCNAYTTDPRYGQTFGGAQLLWTEFNREPSFLRLRNPPPADVYLSGWDT
TIHVGDPVWGVHHPRGDHTMVSKGKVTALQKTFTDSGQGGTHLLDEVQYT
YGGTEGGSSGSGLFAVSGGNAYWKGSLFGGSENDYQDSVYSHFAGYYEQI
KPWLTSCALPWGGSIPGGQSVTAYQKPTATQCAAVAEVRTCTDGQLSGSH
TYETCTYVAGASCTLPWGGSLEDGRSVTAYQIDKAVNCASVAEVRRCSDG
SLSGSYEHQTCTNSADSASMTVVHPAGGETFTAGEVVPVQWQLTGYGSKA
KVNIALSKNGGAKWSLLKSGARNSGSWNWKVRKSQATSQALIRVCLPMTR
KTPAICDVSDAVFTVQK
>MCA2570 conserved hypothetical protein
MLPTHSHADRRELNDEVHARAYENLFAPEQVSQLIMLVTPEDRANEAAHI
AKLCRAWRVSAPEAGGQIYFDNGEVRVQIERHQEFTRYRFSRRVDGVGSF
ENSLCNALPNDWLGKLPGALLVAAHVELVPHQDESRFRSEEDLSRVFEGN
RVIGAKIAGGAGRAYTDFRIHADGYSRFLVVDESLSPGQAGRTVQRLLEI
ETYRMLALMAFPEARKLIPRLRAADHELLRLTACLARDGGETDEVIQAEL
SQLAAHVENLLSASYSRFDASRAYHAIVNTRLEQLREQRIPGFPTFTEYL
TRRLAPAMQTIVAVAHGLEQLSRRIANANQLLTTRIDVKLAQQNQSLLQS
MNRRAKLQLRLQQTVEGLSAVAITYYGVSLANELFKGLKAAGLHFINVEF
MTGLSVPWVGYMVYRGLCRFRETIEQDELSEPK
>MCA2924 conserved hypothetical protein
MPSHTDPKVISDVLLYEVGEQWSRDRVTIAAGADLALGSVLGRISASGKY
KLHDPAAADGSQNAVAVLLANAAAAGADVAAPVIARGAVLDANGLVWKSG
ITDPQKATARAALLALGLKVIDSV
>MCA2676 conserved hypothetical protein
MAAPESQQTDLLFDRPAATDANLLFGADFAPPRNDLRVLATLPVPAVAIK
FIPPARAELLAELPRPTVRSLVLRPSVPLTLASSLPGIVFTGEVRYYSRM
QRPTVGEIRHPWQGTRATEEVATQPQQHAHATPAGWSGLWEGASGAPQGI
AHRLPHVLQTAHQQRRAGHQDASRLQDATWFAHQDGSPLGLVRFAAFERA
TGVRRATWFRHQDGSVTCRAGRASGWQDARVLVQHQGSDHQSATPCPKGW
RGRYQNTRRPPPGISLLVIPRPPQPRPCYTPSPHLLFAGLAVSQGDLLFV
CENHIDPPPPDGEPVVVPVRRVYFVVNNVTLHRLPDGVPVPVFNLSLSLD
VASWAWGFEAQLPAKAESLVAPGNASGPVELVASINGTEFRVFAENISRE
RIFGEASIRVSGRGHNAVLAAPYAPVMTFRNAEARTARQLMDDVLTLNGI
PLGWTVDWGLTDWNVPAGVFARQGTWIEALTAIASAAGGYLVPHPSDQRI
RVRHRYPVAPWEWHTVTPDFVLPVDAVARESLRWIEKPAYNRVFVSGQDV
GVLGQVTRAGTAGDVLAPMVVDALITEAAAARQRGLAVLADTGHQIEVSL
RLPVLAETGIIEPGAFVEYQDGSVTRLGLVRSTQIEAGIPEVWQTLGVQG
HA
>MCA3048 hypothetical protein
MKKPFSTHILSAALLLAAPLAGAADYPPDFKPSVIYRDPSLTGKPAAPEA
APPAQPQAAPAAPARPAPAAPAKTEAAPEPQAAPSSPAKPAPDSGSDYYL
FGGVVAALIGFVLWSSRRPATAHPAATAAAAAPAAAPAPAATGVAKYLQA
QGLATGPETGVAKYLKALPEPVRTPETGVARYLKNLPLPEVAAAAETGVA
RYIKNLPKPAVVATGETGVTKYLKSLNG
>MCA0725 conserved hypothetical protein
MIHHVSLPARDPLHVAGILAEMLGGRAFRFPGPLADAAMAVNGDPHGTMI
EVYPDTVIMMPGEGDAPVSYAQSPADRQFVPFHVMLSTPLEHAEIEAIGI
RAGWRTKLFGRAAPGLPPAFNVIEVWVENRTLIEVVPGSMIGVYEDYMQL
ERLDALGLAL
>MCA0236 conserved hypothetical protein
MAEVPASPMESAFIKELVKQWRAQDSYGTWEGKSDEQLLDPYILDKEKRA
SIPIIGDPDPETLWRLELFYNAVGLSIERATKVMVTPMMKMSHEGFGRMV
LIAGRLIVVNKQLRDVHRFGFPSMEKLAEEGEKLVNAGVEMIRRYPDVAN
YS
>MCA1395 hypothetical protein
MRHVFWRTEQVRREMILEQTAHGGSLTFFQILGGSLDVVAQGAFGQQRGE
RQDGRRDHEVLQADSAGMQSDCQCRGVGGDTGGPETVEHRRMFPQTVVQC
MRGLFDFWPELARIGRDGLRETVEHGGLPVADLASQRQPPGPGFRGSLCD
LGHLLAQREHDVEGDLLQYRTFAAQIGGIGPPPPSELPDRPGHDPPHLRQ
HASHTGNFQQGQQFGECALEGLVMVGVVEIVTAVEPACRTDRPADQGSPL
LLGLLFDADRREPAFPVRPHESHQSQALARAVRLPEQGPGGFDQRIHAGR
LGSVPSQIGQGDDGRHGVTPNLAPLSAQRAYHRLCGCRRRLVQADEAGDA
ERLDHGQQLPHRLVGIFVLAEHANQKIRKRDGPEQAAAVAFRIDRIEIRR
VPDGDGKAQPGKAGRHRFETEKPEILGGQVGDASCLQGIEVAVQCGHGLS
LAAVGESLRMPGESCQGACHGGILAQEPVDHRALAGLGGADHGDLRQHRA
RSEPTQPEVRQLERVQVPAMIRGPRRPGLRQMVVLAHVASFQAGEKLPKR
RPAGTVAGIHQQGITACALLRHQRGLPAAAIREAGPTVSGFR
>MCA0210 putative nifZ protein
MIDQRAPIYQWGQKVSTEVDIFNDGSYPDHEPEALLVAAGTEGEIVQIGH
HEEANIPVYLVEFPGGYVIGCFEEELRSERKSVQVAGLL
>MCA1520 hypothetical protein
MGKRRHRADSCVTRTGSHAVRSARSGNGRGGMLGIADGAVTPAGAQIRPE
RKDAVRSAGGVPDTA
>MCA0239 putative nitrogen fixation protein nifQ
MLARNDVFSHDPLLAYAENPSDPLTLAFASAIELARSSSRLAGRGCFGLD
PTEFCTLLDRYFPGARGVYVKPAGGAGRAPADEFDSLLGLLLEHCRGDAV
ESRWLAHAIAACCLGDDHLWQDMGLADRKTLSDLLARHFPALYARNDGGM
RWKKFFYKQLCEREGAHVCRSPTCAECDEYANCFGPEEDDAWQPVKR
>MCA1668 hypothetical protein
MRAIPPQTSMDNKGSKASEDRLRPDFGGSMPARPILLEPRGSSWLRVGGM
LFLWLTLVIMALILWRGEALIKALNMSPGERIALQNPDLQQYQNELDSLQ
GRVAGLMKDSVENKLRTLERSVQNGRFGADEAELLEALKYELRFLQSYAT
QGGQLKSAGEHERYRMSSEPAEDPAAARTVREWVEFEKLLYLGAGALGAG
TLMLGGAWLSTRSVQYLPSRQDVFLTGPAKDDSAS
>MCA2462 hypothetical protein
MAIVAIPELGDGLPHLFEVAEEAAMNGLFLQRPVEAFRHSNRTIGTASSR
ATGVEVRGEKYAHREELSRAIDGRMVIWVNGIPERRDVHEGLPTTHVDIL
CSGGGYKRSSSSSGEPIVRSWA
>MCA2952 hypothetical protein
MTEERRERALKLLEAAVAAHGSFAAVAKLLGLNRATISTVARRCYPGDDA
KVLARVLEQFDQIACPYLRRPMAPEECRSVWSGATPSHDPALLAHRRACR
SCPHKGDGHAHS
>MCA1038 hypothetical protein
MAFDYKKMLKRELNVGLKDRQYRMMGGAAALLLSVFLGNIFLLLIGIVLV
ATAFMRWCPVYSGLSKSTVDPNEPAPDSGSHSGTESH
>MCA2796 conserved domain protein
MSRTIHLALILAIALLGLLPAKPSSADPDFSKIGDILSGKRRLFPVDDLI
VTQMSPTGLQVQTIVQTKNGGLGSQQSYSNLPNPGSYATGIGRMFKLPRD
VLVTVVQDSVVIQDQNPNGTVTRTFPLNVSGTPNLNQFPRADFTGDGYAD
FAYLVGNNIYILTAKNVEKIDDGLFYSEAGAAPFDLTNKWAVLAAGDFDG
DGVPEVALAAAQNNNVTVTVTHDSQGRLVSISLAPSGSTAFTIPSSTNLE
MAMVAGVYSGAVNPQTALPLSDLVLMYKYSLGDTDHVDLRSLQVEVASTN
PTVYSIAVADTESWGATSNGIWKIAMVSDNLDFFGDSEQIVATTTETPGE
YAHVAVFTLDGNLNIYKPREQIFSLLEHDQEGNPVYSAGIYSVAVGNFDQ
EIEQNQPIQLEIALLWYKGTYNPPPVHQPVYQAQLWLFHVDPSNDYALSP
VPGGNVQFTGPQGGLSHLMAGDTQGRSALLGNPTKLTSTFTQPLTILSMP
PQHVDWITPADGTQPEVLNLSGHKGFSSTYQMEQSQSNQSSSESTTSFSH
SITESISGSYSWGVPDVSTISITVKAGSTQQWQDSVAKKYNKYSKYAFDL
SVQTTIDDDIWIKDETQNIYVYPVIGQYVCPPKPGSPTPLCSPSELVQLN
VMFSGPSDVSYAHLDGAKVEWYQPVHEPFNVFSYPWSLQQLQKLEPNIDL
LTSNSPQLFATDSSQQTQKTNWSAGQGSTVTVGSAHNYSWFLDLSISRNA
KVGAGGSFNFSYNGSKSISNLNTSTISVGASTGIGIIKPGTFVDPQNYQY
MVGSYIFGTKPDPDTQQLDLGTDVQTNGILRAEYTADPTNPNSGSWWQTN
PYSLPDVALNHPARWTIVQGNKLALNCLLVQPGLPGTWCANFNAPDWDPD
DLWTSQFLWMKGLLISPGEANGEGPQIAKARAGEKVRLQARVYNYSTTDV
PPGSSVQVRFYGQPFDETTKIPAGDAFLIDQVSLAPIPGFNSQTNSAEEQ
QPNWVLASTDKLDTTAHADQYLAFWVLVWMQDAQGNLIGEMPGHGLYQLP
GTFTSITDAAGLVEAYSNNVGFYPSLFYVAPPNAPGASHAGNGSGDVRLD
RIQLSKRIALLGDKVTVRANVHAENDHDHLTALFYDGNPEKEGKAFEYET
VSHIRGGSFYHLKTIYDADDCGIHDIYLDLWPEGIRTHTKLYVTLDPRPI
ISDMLGYLPQPLSASSIDPPLPLRPVTSMRLLHRLVPSQAFPFEEHSGDA
VTHLQAAKKFFGERDNASALASLQKFEAHLRSQSVKSLSAQQIEILLGQT
QRILDCVKPWPL
>MCA2714 conserved hypothetical protein
MSRIFEFGGADWEASFSAKERADALNEIEQGKILYFPRLGFAIEEHEKAF
LSPETVGKSKNVSFDAATGDLHGIGPAVADVEALRSMMARFAGRAAGLVA
RLLPEYSAALVQGRTSFRPVEAAGRASSWRKDDTRLHVDAFPSSPVQGRR
LLRVFSNVNPEGRPRCWRLGEPFEPMARRFLPSIPAPFPGSAPLLQWLGL
TKSRRSNYDHYMLQLHDRMKADMAYQAEVDQVSFDFPPSTTWLVFTDQAS
HAVMSGQYLLEQTFYVPVEALRDPATSPLRILERLTGRALV
>MCA2197 conserved domain protein
MNETLARSGGKPVGIRNKALWLAVVVAGAGSVSADASAAATYGGSCQGCH
GAITASGTAGGYGPALTGSKRNASATKAAIAKVGAMNSLSNLTNADLNSI
ALEIGGAADLPVATPTPTPTPAPTATPAPTPTPAPTATPAPTATPAPTPT
PCGDESEPELDTIPSPWDANVGKELRFTVSALDCDDDSLVIKAKGLPTGA
KMTQGFDVNTRKQVATITWTPGPEAAGRIYHVTFTAVESEVSAKRSWKGE
KEHEDENHSSQSRSTDIRVWPANTTPEAGAVEAVVIQQAQWQPYREAGKL
AVNGSIKFSKLLSTAERAALLAHPVIIRADSTQEIIGQVTASSSGKWSAN
LPLANTAVPCAVDVEFLSDTASRPVKRAPSQCK
>MCA0286 hypothetical protein
MTGSIVPASRAKPRVLLELMLAPLRFTNALALVIHGLGHALALFAVTRDP
SAFSATTILEGISFRHLGRSLLPFGPIPQLSSHSPRIPASSCEGWRCRAV
AAAGTVANLLALAAASRCLESFDGIVAVLGWTCFCVSSILAMLSVPDVLA
VVRGSTPYWACGPAFAVRCNLQGEENDPLLISDRLREMARILAREASTRG
GQSAGFSVLVDKGGAQSVIFDKVVKGKRDDIVGVLSGRLDGLLEKAGREG
YRRPPDFEAILLHLRYATGGATHWHNAQPHWYEYYDSMMHHRVEGQTLVS
QPGEVFNMIAHNGDMDGVYLEFTVDGEKRRHFFTQQEARGVFLNMMPRTS
SRGDSDSRSVAEWVDFIYTQGLSYKALRYAYFTAALDYNRDIAGGNFNLD
LLLRWAEATDLALLNLRKELGQRALAAPARSLADLSAESRQRLREVLTRE
VGTALEADVLPGFLATFEEAFCHHDLTWVMRTASRDLVGEFALMVCSTLE
PRMGVFSLTQAFSLGHNRTRGEIFGSAEPLGVTSALHQGEPDDDALQIYL
EDGQYATIEYRAASPSEFIRIYDRAKVGDDFLQAPRPSPKALQPGAEQRA
AGVRSNWFAVNDNAKISRIHQLPAPGNAVEKDLREIPFVLKRVLDSFEPG
GENHATMNHFGELLFQNLLNPNRDPRMHDLVLYGVDFNQDLLNEFAIALH
SVLPGLRIRAENSGNVLKEMKRTQREGIGCYGPSTVFLGVSNSAQTQSTL
AVVRKARDLVGAERCFVLSQSFLNSMTEALGQGYQPDDPILPNTFVNLSH
LSPDGTSGRRRAEAATIVVVATQAVLTEILIHLARKAIEAYSTLSREVLT
GQSGDFELRSDLQMSDIKAFREFQAAVYGVDIPNRVGCNAAGERIDSPDT
EALNREAAARAENQVEFVRSYAIFAAYIVIATVFGVPVFGLLFSPFDFLA
GVGLAAHVLDAALFLTALWLIHLGVRRWQGRPVFERIGARAEVYIDRKYI
ARIVERYNATLFSNMPAFLTPFFYWADTVRDALHRYGIRAHRGVVTIHRL
PDERMGIEEANNAAEENMVYAQLGGIRFNGGQPQSRDKVRQGSCYVSASR
PYQTVLSDSLAGLRAKYDGKLSPEVFRLVNRRLIDLCDGLITEFVIGFRR
KEIVNRALWDVIRWIPGASLVYEVFLRNGLDLTNLAGEADTANQAQIQST
KHPVSPLDIHTRTMVPRSTFDALRTEEQAPDDSFAVLVFSEHHLALHLNR
HAMLEQPQGRVQEVILKPGRGSERGRLISETGDAAAGEFVGTLDRIDGEP
CLVIRNKTADLRMAVLLGSLNPEQRHYLLHHYHLGSPSHLEAAA
>MCA2811 hypothetical protein
MSIYTYSLASTHPRGKWRWRLYPPQKIQDGYAVLPKNDGD
>MCA1304 conserved hypothetical protein
MSAQAARAAAAESASEFSATADTPSPHPYPSEGKPMEIDLNLIPAKLDAV
HVGLAAVVVVLLLVQIILLSVAVIALLRRRPEPAVTFQSSPAVSAAEPVK
ALEPVMKKETVVLKETTPDAALQLLALLQKEARFIDFVQENIAHYSDAEI
GAAARVVHEGCRKVIGQVFDLAPVRSETEGSRLTLPKGFDAASVRLSGNI
VGEPPFTGTLVHRGWRVENIRLPKVAEGHDVRILAQAEVEL
>MCA2501 hypothetical protein
MWCRPRTVWTAPIRTRTLWLRLEMKKCETCDPSGIGENRVVIGKNHGQIP
YATRILGVFLIYLPILTLPFVMISAYISYFHLKLMGAENIKTLKDFLPDR
ASFRYKLSNQIVMRPGYRLSPTQSRLFWVLNCTWYCPFSVGLFEWHAYLV
KLVENWWCPFFHGRKEGHYQEGAIDRSFWHIFPEDVVKLDPEDRDNPIWN
EDVKGVPEH
>MCA0955 hypothetical protein
MSGETGQPKRSRLASLAGLVLLLSGGALLLYDRFRPAVTLEEVPATEAER
TALEPGGAFPITALTRYALRRGNASVTLTVADYRDARNQSHRVTLFPADR
LRHDTWKAVGEAIIRHTDGNALFLAWWDDAQRIHFLSGRETWLDRPAAAA
FPDDEERSVWGKVAGGFAAVPDASRRLARWLTMEADAALEAMKAELPAER
PVYLLVCLDDLARLGEIEALAGVRLPFEAARFPGGDLHGQIAAVQQWARQ
DVDQASYLVQQTPGAGVQAWRITTPQGRDTLLARLLPFTTSLARPLPQAQ
TVYQSGWGSHITIYALNFSEGKM
>MCA0958 hypothetical protein
MGDLFARGFLKCGHPVHPILRGSRPESVAQAIPEPDLVLVAVGETELHPV
LASMPSAWRTRLGLLQNELLPCDWQRHGITDPTVIVVWFDKKKGRPFVPV
LPTPVAGPRAGLVVQSLEAIEVPCHTIPDEELLYELVRKNLYILTINIAG
LRTGGTVSELWDRHRELAEKTADEILDIQEWLTQTRLPRGRLMAGMLEGF
AGDPNHICTGRTAPARLRRALDHAKAAGIATPTLTAIAEGLAAAMPAS
>MCA2901 hypothetical protein
MGIQVSATGTASSTGASLSITRPAGIATGDLLLLAVALDAGTFGSGAAIA
TPSGFTPVPAGSVGFGSNGRSQLAAFYKVASDGEPASYTLNFSAGGFPSY
AANAVICAFGGVNTEAPVDASAGVSGGSSSSLAAPSVTPSEGEGDDLLVG
LWGGTASVGASTWPAGMGDTLVANAGGGSLLMADQQLVSAGATGTRTLTA
SAATTWGAVSLLLLPDRSAALPLWENF
>MCA2693 hypothetical protein
MVKRLNIRVLVTLWQSGAMAPRSRPGPPDRRPELGVPSRHPIRRCAPPRR
KVMGELNPHPRHPVENESTPEFLGIDAQALTPGSRAAVQRAGCRRRDKGL
PLRADGSLFFGAVSLDRPSHRCLSRARRPS
>MCA1688 hypothetical protein
MATRLRGCNEPSILIVPPRSGTHRLGQHRLSRRAQQEQCVADDGQESNNG
ENQERSHDIPLSPTDSIDDRPPRFPFCPDEDDMNTSVWRIGQTLNSVQST
FVGLLRIWSIPLVLLLSLASGYTTYYGLSYFITDWIALIITVAVQSVIVI
CSLELAGCHWRANRVRFLTVGVTLAVALLVSVSFSYFKFYELSQQDNTLL
ARYRALDQNLNDYLDRVNKLKSALMAHQQKRAEAAAKEATQAYLGTLQGI
RDESRKRVGKGPMWSHYNELQRAEENRLRQMETSFGELDQRITAARGTMQ
KFSSDLKNPALYAELMEQVRQVQSKADNLASVHGATPVLAPPWGSHAEFV
RGITPSFAMWEDLSLFALACAAMVDFFTLVLSYRLEFSAPGPLTEEEKVL
AFHGLREFSQFAINDNDELEFVFEKSELERAKRYPDWNRMFAVAFLLNRG
FLRKVSERSVEFAPNLYPIVAECMRKELPQAGSAFHDGNVRQFVERKAHE
RRV
>MCA1547 hypothetical protein
MAPASLSNAAQNQPHYTVHRQDENDRADDASHDVGLGRAVLEIPEPVLFA
LYAESAGKRADDHLQQVGFVEVFGHGCVVGFAAERPCSDRRSGCGRGSFF
PEQIVLDPESAGDHQQHQGHGDFVTADHVFVHVQAIGAHIGAQEVEGLNR
GDGQRERDDELVGDRVLRKSHFVEEELGDQITRDEHFQDGDHESFHPVER
RCRGEHRQQDDDDGAHPGDTDPKFLVYFTEFQKVADFHCFSPSVS
>MCA0614 hypothetical protein
MQAEEPVLGDDEQEVRLGHGNALSEYSMKMPGTAGRTPRMKAGPRFYTAK
PPLWDASPNVPLSPV
>MCA2179 putative lipoprotein
MQRAPRHLLHPFESTMKPYQKIAVILLGTAFLSACGKEGPFGNTPGPKKP
EWMKSKNETPEETDASPPNK
>MCA2958 conserved hypothetical protein
MSNSRTNTDTTQPRKRAMSNTNQAAGAQANPFCAFIEAQGGTITPDQFRS
WLALQGITMADWARERGFRPREVSLVLNGQIKARYGRSFSIAVAMGLKPD
PNRAQKQHAA
>MCA1626 conserved domain protein
MKIKLLAGAALGISLAAAADDPARCYNIRDPDLRNACLADTRGDKSRCYS
IQDPDAKQLCLARLTGDKSRCYNVRDKDLRAACLAGMGR
>MCA0623 conserved domain protein
MAFHKNLTRVERARLAVTKLICPITRDALRVIPLTDFIFGIYISSRYRYV
YIDNPKTGCTSLKSAMAELELRGRKSDLDPLNLEVIHYDSSPLKSFVPIF
PKPTLSNLIKNNYRFVTFVRNPYQRLLSCYLNKFDNSTAKDNPQARRMPR
GAAPGSFSEFIDAIIGQTDHDMDPHWRSQSINIHYSRVPYAYIGRFENYA
TDYVTTFHKLGIPADEIPTLRHLNKTGAGRASLHEFYDKKSQDAVYARYQ
DDFVNFGYPYELPD
>MCA1136 hypothetical protein
MIASLGRKAGNWIELDFSGVESASAAFTDELFGFVERELPDIWLVPTHYN
ETIRPLLNRHLSLLQQRRDRAWCLASIATGFEGCAALAGHPRTRR
>MCA2939 putative prophage MuMc02, lipoprotein
MRCVSGFPGAAILALPLTLAGCVQVGPKELVPKPILTPAFAPESCALPDL
PPLASTVFVDIAPGRPPKADAGGKALVVGYGRAREAIQACKGGAEGH
>MCA0780 MxaJ protein
MLTSSPYYRSGYVFVYRKDTGLSIQDWNSAALKTVKRIAFMPDTPAETMI
RTIGRYNDMFNYMHSLVGFKSRRNQYVRYDPAKLVAEVADGNAEVAVLWG
PAAARYVRGAGLAMTVIPDDNRRSDGEKVPHHYSTSVGVRKGEEALLKQI
DQVLARFGKEVNAVLEAEGIPLLPMDEKPARTASHDRRKG
>MCA0175 hypothetical protein
MNPSRWKCIFVLLLLTLLGVGPMPITSIIGIYVVVARPRWFRDLVRRVYD
E
>MCA1531 conserved hypothetical protein
MSLAKLLIVVGLGIAALGALLHLAPGLFGWFGRLPGDIRLGDENRYVFIP
ITSMIVISLIISLILNLFFRR
>MCA2545 putative membrane protein
MPIHALTRIVAVSITVLIASRTSGILWSSLFWSIAFVHYGLALYYSRHRI
LAVAKDSSSLVPALIVLSGGAALYVSQFSLLLYFGVHHVFNEVYLLDRKL
PAKEAERRRGLRVAAIFLNAAIYAVLLRHYSELAWIGPEFLFVVLLFFYA
LFFYRLGQLRPVLGVRGMIDGSIFEVFGLLLVLSSFFVRFDLNHIVLYHF
IFWSLYPIEKFKRLGDGALWRYAAISLVMVVVFMLFSPAGIAGGAVSESF
YYQQFIFWSYAHITLSFVLSDAHPAWITRWFRSDRRQWAVS
>MCA0505 hypothetical protein
MLEYPWDLPPVDAAPTADGRFYPNFLCQLPGSADRPGVVLAVGHKGADRW
NGAEDDRSIGRLWANLSQGRCRFVMVKDER
>MCA0759 conserved hypothetical protein
MQHVVIVWLKDHGSQAARQQYIENSRRLGQLPMVLSYQVGTVLTADRAVV
DGSYDVGIVATFENEQALQDYLAHPEHRKVIEESLKPLVEKAVVYDFKES
N
>MCA2944 hypothetical protein
MSHTKRSFIQACLIRALPALDRVDAGIAHAEALWERLTAKGYGAPRQTGP
RESVDWYARLVEPSRGWFDQFWTAYGLKRDRNGAAMRWYQLGDLTEHEAR
RIIDAAKQDNRQWRETAQPGQVRKMAQGWLHEKRWMDYAPTPQPPLSGGY
SAGLAGDAQLRELKQQLASLQRLNAAAPSKELQRQINELVQEIGNFQRPG
HG
>MCA0426 cytochrome c family protein
MKGELGRRLRSAAWCLSLFGFLPSAVAAGAVNAFDRTLMHPAIPLVDEDG
RHVLESGRPYSVRMSCGNGNGGGCHDYDGMNHAYHFEQGRDETRDDYGAR
RGLPQLVGPGYFGGFNCMQGNAAAALAKKANTSEAEFGDYGAAGYVKACS
TCHLGGGWEESDRGGVRYDQMPAGSVPAWDGDYYDRNGDGQVVPWDWRKS
GVREADCLTCHADFSLLKKFSASGLGGNGDGTAGAQDHWGLLQDGKFVAQ
GFFRYANSAMFEFLDVRPDLSGGAQLLTVARAVKPGTAKPDYDLVLGDSG
EPVLHWNRDAFDDSGKVRIPMRRFPGNDNCMLCHLAGAGINRINAKVKSS
RRGFYGFGEEAAQTLGEDGKPVDDYKDDVHKGKSWTDDTGETRIIDNCNA
CHAKQYYKPAYANIDLDADHDFPKGNGDADIRNDLDYQPGPADCEYCHST
AKAPALPSGQPTLLDAHRERWKSSGFMRGYATNSYNRVVQVHFDDLACQA
CHNHKAVYNGKTLPMHYRYRARADGALRVIPYVPNARFFVQDRSSGRVLY
RYERQSVFRLKNGGQAAIVDPASGQETGSVTVTAGQFDLPATYNDFKALK
QAYDNLLKGKGYSSADVRFVYAESNEYIFTHQTRPAEQAVACEECHTRRA
DGSINGAVAANGLMGANRVIEVARIPDARLVDEGVVELASGYHKVQADGR
ITETVAEVLEATDKAPDMSILKAATGRAVGGPLRKLPASEAAALAALDQD
ATGKLTAGWDSGLSLTFSARVGHPSVQGAALFIPGSPLNQLLLEGVRVEL
SSRESTAAERRKVGKLGTGTLAPDVYGLSLTRSGGSGVKNFGGGEGWIKL
PYWGAATRIKGVKVVYSEDGKTWRTLARHRIVAFKAASGGAAGYVLLRLK
RPVAYLAFAGKAGSKS
>MCA0019 conserved hypothetical protein
MILLALVLHVLSAVLWVGGMFFAHQVLRPVAAGLLQAPERLRLWNAVFAR
FFPWVKGAIVLLFLTGFGLIHAYGGFGQVGWPVHLMLLIALSMTVIFGLI
YARPYQALKAAVAAEDWAVGAEALGVIRRLVGINLVLGLITAAVASGGSF
LPI
>MCA3110 conserved hypothetical protein
MTEVRFNPFSGRLEFDQMALATLDGQSALTAEHVFFDIAVAASLRSGYLV
VEGMADQVGLRICLDAKGVSNWTKLMGGTGESPPPRVPFGVERFSIRNAG
VEFIDEGSKVHLGASEVNADLYGLGPDLSEPARLEMKASIAGRAKVSGNS
ELTLIPFHVNPVVYLDDFDLTSIAAYLEGMGVVIKQGRMKGNLAFEYGSD
EAGQVFRLGKSNLIWHDVQWGMKNGPDADWEVDDIRLDGLEYDGATATLR
LAKSHIDTLAVASRNGVKVRIRDADLGKLLLDLYRFSAQAGSIAIRTVDL
TQDRGSGSMAAAVFNLWSNGLGWSETRLAARGLGWEGMRLWDSSATLGEP
PEIRFRKMSADDVAADFVKRSFSMAKFDSADAEISAWISPKREFEIPGFL
GDWAGVIRPGSSNEGWAFSLGEGVIRNYRLNLADHGVDPPARIGLDDLEI
RLQGVDTRQGKFALRLESVVDRKGRISVQGSGSFDPPEAELRLQVENLGL
RPFRSYLDDFARIDLAKGRLNLEGGLAYRPVGNDVRFSGTAEIAGLVTVD
RKDGRDFIHWRSLRAEGLTLETSANRLSIRQLVADRPYARIVVSRQRTLN
LIENLFQPRSKPAGPSGQASRPFAVTVGSLLVRDGSADFSDLSLQPSVSV
DIRGLTGVVQSLSSRPDAEAEVSIKGSISDTSPVTISGRINPFLFGTFAD
LSIRFKNVDLTELSPYSARFAGYRIDKGKADLDLHYRLRDRKLLADNNLV
FDHLTLGERVDSPEAISLPVKLAVSLMRGLDGKINIDLPISGNLDDPKFS
ITGLLTKAAVGVITKVVSSPFSAIGMLFDGGSDDAGSIDFRPGSFELEGA
EKSRLDGLATALSQRPGLSLEIRGTARSGRDASALAEQQLRRQLENAKAI
ELRLAGGDRDRAPAGSSVLSGEDYRRLFSHFYRLRYPGAAEWAALPRGER
VLGGELFESARGKVLKDWSISEIDLRRLAQARAAAVRSYLVQKGIEPTRI
YLLDVELTPGDGDTIALLSLS
>MCA0044 conserved hypothetical protein
MFLKKKQNGHLVEILGLGDLFNPMHKMVPGRLHYGEELQDPEKFAKSDLM
FPSGEDLPRCWVDPHYRDSELKK
>MCA0488 hypothetical protein
MNHIDTLIRLINQGFSAQVMLKDGSMIRGFRPHDIIARGGVSAGVVVARG
LVGGEDRIVPLCDVSGVFSERPRRVLTTDAGFFQRNPVPRGWTRLHPLAV
SDGN
>MCA0893 hypothetical protein
MRDPGIGEVVDGASTSPLLGFAMRVSPLRRQAPTARDVAASILYEYQLGE
LGRQFATVGLAPVVVKGQAIVDLAFPKEEIRLAGDVDLLVGDEAEAVVDV
LTGLGYEEIPPRSTHFRSEDRSFVQQGKRLPKLVELHQCLDKVLLRPIPY
DEILARAKPSGRPGFRYPEIEDLFLLVVLHASADIFFDQARVERDLWFLL
NHGRPDMAAVWSRAQQWELSRALRRLLKGHYPAAKNKPPGPFVYLANQAL
WHDSFLTVLRGVAKYSYARLLDRLYP
>MCA3084 conserved hypothetical protein
MEQQLDLRDIHLPPPVGGWPPAPGWYLAPLLVLALILTLWIAWRRLIRNP
FRAALLRELAVIEASDELEPAEKIARVSILLRRACLTLYPREDVAGLTGP
AWLERLDAVLGGRQFRDGPGRCLIDAPYRPGGAVQLDVLFALCRSWARKL
PRSRRYPP
>MCA2010 calx-beta domain family
MRFHLLLSRFPLNWLGVLIPLCLNPANAAPPDPFEGARHVLTVKFNVNRP
VTDTDDQFRAVLQLMDAIDGETAGVAEHHPAKDATCDQCHVKDFISVDHY
GLMINEANPAKSCEKAGCHSLTVAGEMPFYVKPFASLDETRNAGLAKQRL
AAARTLLRYAGSFKGNMAARLSGLPQIGSFSIDVRAQDKNINNLTHDVPL
VFFEPKNAFDVKKNADLLPQYAAALSNEVLSLVEQDMSVPTRLEATVEAR
MARAADLHKHLALADQDMSIQPLLTNSQIALRYLATLPAEAADFMSSRLA
EDFPAAVLKTDATTGEAATGLRRTVTWEIPAVANDGVNAATIVLQRNEEG
SRTQPSTQLQFQRLAGSMRDTAVRMLESFQERYSAFAADHARFTLTYQAS
READKSLLTTAIGNSREAAGAGELRRTLTHNENTATGTLPAPGQVSAAIG
AAKEIAAANPAAIRRLRSTLRFKPDLTTVSQIAGEVLLGSLPRVWVSAAR
TEIAEGSGERLVFTVSRETPMDRSLTVNLTLGGTARPGKDYIKPKLRVTL
PKGVASVDVSVKLRDDRLIEGRETVTVAIAGRREYTRLPAAEATVAIADD
D
>MCA1943 hypothetical protein
MCLLSQADAMHGMMGTLPFTPPDRARFSAERTESPFDLARRVDADRPRSG
IGRGGAVGGTPAAEDAVAGDHALQRTEGVPHLPEVLCHVFLQVSAAAQAQ
HHNVSMAQ
>MCA0160 hypothetical protein
MSTLAQYSVTKTIRGVTMNDAKEEKDYFWFYIAGLVIAVVVTLAMVKSSE
HGKYVTAAKAIEEDAANSAYKHAP
>MCA2177 hypothetical protein
MNNHGLRRVLLASFLGLATGVSFAGGRYADPPPNLVEVFNASKAALEAAK
QSNKDLCLENAKKARKLAIDSYKEKSTMPMQLSSSRLKSVINSLEAGQVT
EAVAPLEEVVKEVGDEVEYYKKEGKL
>MCA0605 putative lipoprotein
MCSKFRSPQKKESDMMKATKFAVVLMAAGLTVGCASKSDITNLQTQIDGL
KAEQASIKSTADEALSAAQAAESKAGAAEAAARRAAAAAEETNSKLDRMF
KKSMMK
>MCA0157 hypothetical protein
MVRHIAFWVLLACAVPAWSGPPNPPDVPEALKPWIPWALHGHETDACPHL
LEQPGEHRCAWPGELDLELHERGGSFRSRWRSFTDVWVPLPGDPQSWPLA
VEMDGKPATVSARNAVPGVRLPPGAHTIAGRFAWATLPDTLRLPGETGLV
RLSVDGRATAFPFVDEQGQIWLRKPPEGSAPETSPLRLEVVRLLRDAVPM
EISTRLELDVSGEAREVLLEGSVLPGFVPLSLDSPLPARIDPDGRLRLQL
RPGHWKVGLTSRATRDLREIALPVFAAPWPGEELWSFAADRSIRVVEVSG
APPVDPRLTQLPEEYRTLPTYAMVPGTALRLDQVRRGDPEPEPDALHIRR
TLWLDFSGQGLTILDDISGRVSRSWRLDAAPTMVPGRVTVDGQPQTITRL
EAGPGGVEIRRGELNLQAESRWEAGMSSLPATGWNADFVSAAAELHLPPG
WRLFAAPGADHAPDSWVGRWSLLDLFLVLITALAAARLWGWPAGTVTLLT
LALTWHEPDAPQSVWLWLLGVTALSRVIPPGSFASALRVVRMIVLAALAI
ASLPFAVQQLRLSLYPQLEPHALSFDYGDRAAEPEAGAPAAPAAALQQKS
GSLAEPHTPPPAGLTRTIDPEALAQTGPGLPQWDWNLIALNWNGPVLAEQ
ELRLLLIPPWANRGLNVSRVALLAVTVLLLLGWRTKPGVSGLILLPLALF
AAPDGHAAEFPPPALLDELRMRILAAPECGTECAQIQRMKLRLGRGELAM
TLEAHAEDRVAIPLPVRSAQWIPSSIRVDGAPPTGPYADPAGTLWLMLDP
GIHRVDLSGPLPNLPSIDLPLPLRPYRVDVDSEGWEIVGVDPNGRPESQL
QLVRKSSGTSPEPRGDDAPLPVFLTVERTLRLGLDWRVATRVARLSPSDA
PVALTIPLLPGEAVISPGIATENGSVRIHLPPGIVDASWESTLDRTGSLE
LRAPQTDDWTEIWRLDTSPVWHVETSGIPVVHHQDAGGNWLPEWRPWPGE
TVTLQISRPPPLPGNALTIESSTLETRPGGRATDSELALSIRSAKGGQHS
VTLPEGAELQSAEIDGTAQPIRQNGRAVAVPLHPGEQTVRLAWRIPAGIG
LRTVAPAVDLGSASTNSRTGIELGQDRWILLVGGPPWGPSVLFWGTLGVI
VVAAFALGTWLNRLPLEIRHWLLLLIGLSQVPLLASLVVAGWIVALNWRR
SANMPLKPEHFNLVQTGLAILTVLALSILLSAVHRGLLGLPDMGITGYDS
NAYRLNWYTDRSGPMLPRPWVISVPLFAYRLLMLAWALWLAYALLDWLRW
GWGCFATGGLWRQSPKRETASAEPPRTADATGSTDPWQA
>MCA2112 conserved hypothetical protein
MTQTAKLRPLVFGILATLSMGGEAKDFAETTGLLDWATDGANVQPFEEWG
VKWGGWIDTGVSTNFTNSKWNGPVSFGDRSAELQMNQLYLYLERAVATSG
DDWDFGGRFDFMYGTDAGFTQTYGAPQGNWDLHLNASNIKYYRTALPQAY
ASVYAPIGNGLTLKLGHFYTIIGYEVVTAPDNFFYSHAYTMQYGEPFTHT
GLLGSYPIDSNWTLTGGVVTGSVTGGWDGGFNQGLGAWSGIGGIGWVSDD
QGTKVNISGTTGPTSETNSNQWSIYSLVIQHDITEDLHYVFQHDHGFANG
ASMGANGKPQDATWYGINNYLFYDIQDDLSIGLRGEWFRDDDALVNFGSS
GANFGRVYSIGRQVGGVGLGSGLPASSYYEFTLGLNWKPSQWLIVRPNLR
YDWADNAKPFNNGGASVGYAGDRRDQILFSTDVVISF
>MCA2764 putative lipoprotein
MMKIPASLRRGLLSSLVASVAGCTTIVDRFSGRSESCEILRDGSPATARI
VGLADTGITINQNPVAEFVLEVQPDRGPPFEAKTEALIPRLEVPLVQPGR
TVPVKYDPQRHDRVALDLWECD
>MCA0427 hypothetical protein
MVWLAVGGLVPFPGRCDNWHGSVRSKLQVDNRYTLQNAHVFGELWGQGFY
DDSQDDLHGAMEFVTRTGYQPDGGGVAGLYQAFVEKGFDSLNTRVKLGRF
QRTDNLGLYLVDGGAVAYAADDNGWGFDAYAGHPSRYDHVISVEGKFVGG
LEGRAQWTPDWGWGSDGPSLGRIDVRGGYQYFWRDLSQRAFGYSPYTGDG
GNGSGTVNPGMLIGDTGLQGAVAGSYSAYGGSKPWGSSLQRLYFATTGAG
RLGLWRNSDYELGVLGTYRADRDAFENIRLNGQLDLTNDVRVRGSYEYFQ
PRDPILTFREKFYSAYALGEQTLARTRIQHEPVKGFNYYVGGLASSRKGY
DGYGGELGANYVFNPNFGLIGEFDYMSLGPESASSFYVSSTHTVNSRLQM
RVNTALRFEDKILYGFNRAVGAEIEAFYMLRNNLVLNVAGSYVWYTRILD
EYLGAVQVIYYFDNFKPKGM
>MCA3075 hypothetical protein
MKSRMLLNLSLLAVLAVLGAVAYFEPGKKEPAETPLTGVETDKVDTLTLQ
RGDKTIVLAKKGGHWWVSAPFSAPANEFRVRQLLEIAKTPSDASYPLKPD
ELAKFELDKPLASLTLGDVVLVFGGADPINMRRYVRIGDTLHLVRDDFSR
HLTAQATDYVDKKLLPEDARIQEIALPEWKARLGQDGKWVFETPQEGGET
LIGELLTQWQAARAIDVKHLDKAAEGQKLHIGLANGESVDFVIVQREPEL
LLVRPDYGLQYEVVGEPAKRLLSLNPKPAETRTEEASPSASPVPQAPTDE
GGAEEIEAD
>MCA0922 putative membrane protein
MATTFEKSPISPRQDTSRVRLAVALMLLALTFAIHGDALWMGFGRDDGGG
LVQAARLSPADYFFDPPTSAAISGGSISPWHALVYDVDILLFVLDSRWHH
FHLVLVLWLTSLATYFLLRSWLTPDWSLLGTALFLTGAPTLYVSHEEMSS
HYLYGLIFCILSIQCFLSSKEPHGGKYGAASALFYLLSVASKEIYVPLPL
VLLLYPAENSRRRLQKTLPHWIILACHAIWRYLAFAAAWGMKFPSVPTWS
RRPEIRPGCARVPASP
>MCA0524 cytochrome P460
MRKLAIALLFPAAAVLAEPAAAPNGISLPAGYKDWKMIGVSSRIEQNNLR
AILGNDIAVKAAREGRTHPWPDGAILVKLSWKKSTHELFPSAEVPGDFTQ
ADFMVKDAAKYASTGGWGYARWLGMEQKPYGANADFAQECMGCHSGAKAA
DYVFTHPAKLP
>MCA2672 conserved hypothetical protein
MPIQTGDVKLLKSAVMADVPEGGGAPTGNTIADGVSNAIFPDISELDRAG
GRVNLRKTFVSVQTDDTDTYFGANVIVAEPPKDARVSVTLFSTERTFDNR
EQAQLRIEAYLNKGPEWAGYLFENHIAGQRVIQLFQRPTDTVPNVGQTLV
LVESEGQATQKEQYVRATAVSVVERTFTYDGDKDYKAAVVTVDISDALRY
DFTGSPASRTFTRAANGTKVRDTVVADAGTYVGVVPLTQAAAVGDFTIKG
ASIYTQLVPSAQTETPISFVPPYAAAGLPVPGASSVSYTANHAWTPGLRF
NLPGGCLPGSLTLQTDGITIFDDAGLLKTASGTIGTIDYANGILALNAGS
MSNAKAVTYTPAAQILRAPQSSEIPVTPESRSQSYVGTVTPVPQPGTLSI
SYMAQGRWYVLSDAGNGTLKGLDASYGAGTVNRNTGAFVVTLGALPDVGS
SLILTWNVPTQETQQPQVTLKASQTLVLSPPEGKAVQPGSLTVSWEYTGT
KTATAATNGVLSGAATGVLRIASNRLEFAPNVLPSVGTQLTVSYVAGPKQ
EDAFAHPSRNGSGLVPVTATLGSIEPGSLEVEWNTLTDTSVLGAYTLAQL
LEMGVQAAWRDPTQIARDDGNGHVVLNGSSIGTVDYATGQVTFNPDVTVL
IPRPLYTSTAINGTGRWRLNYRGLAYVEAPSLYPNDESGYVKLRYNSAGS
TSSVTETFQFTPSFKLVPGVNAQIVPGTVVLTLPGAQPWGDNGQGTLREF
TSSGWVTRGTINYLSGDVTLTSWTAGTTNAFTRASCVTTVGENISSEYVF
RTGAAPLRPGSLSIQYARAVGGTQTVTAGIDGKIQATGITGSVDYENGLV
RVRFGSFVTAAGNESEPWYSADRVGSDGKIFRPEPVAASTVRYSAVAYSY
LPLDADLLGIDPVRLPSDGRVPIFRAGGFAVVGHTGKITAPVANGQTINC
GRVRLSRVRVVGHDGTVIHSGYTTDLEAGTVSFNNVTGYSQPVTIEHRIE
DMAVVRDVQINGEISFTRALTHEYPVATAGDPASGSYVSSALIAGDLFAR
VSLVFDQATWNGAWVDAMSGSAATATFNNTQYPIRVTNRGAVTERWVVRF
TNSTAFEVIGENVGVIATGNTSADCAPNNPATGVPYFHLPALGWGSGWAT
GNVLRFNTIGAQFPVWVVRTVQQGPESVPNDNFTLLIRGDVDTP
>MCA3053 conserved hypothetical protein
MPSGLHPDADHLHPTDDLPDHHSLAAQIRAIFETGFGQLLQDLFSRIDDE
LFKLADRADNSGLQTLYFDAMRQIRRDQAGMRSRCLREILEAEERFWNPA
DTPSAQPEDSGNPGLTLLENDVLEIELAVTHAAQKANLLFQDTLGPFELR
LAALKGLQGPPLPPHPFTPEALAHAFAASLTGQPVDIRVTLLILKLFDRH
VLCQMGTTYRQMNQLLADHGVLPDASLVFKAKTSTMPEQHRPSSAAHVRE
DDSAEQLSDLLHLLDLWKRQAGNDASQTDGQHFDPGEVVGALSLLQQSAS
ALASGEWRPGAPIPRLKQSLVEQLTGFCPDGELRRLGRFEEDVIDMVSMI
FDFILEDRNLPDPVKALIARLQIPVVKVAIIERSFLAKKTHPLRALLNAL
AQAGVGLEPDDRKDRIVLKKIEEVVYRILTEFDEDTRIFAELLEDFSAFI
EGERKGVKAMEERTRQAAISQERLALAKRAAAVEITARIEKRPLPPALVS
LLLNAWKDVLVIAHLRREKDSTDWDEALAIMDRLIDWADAHDAEPRRADA
GTGEPEAIRAARERLENIAFDPGQLHAFFKDLELWHLSRGNLGPTPAPAN
ATVEPCEIEDILIEGMEDAQPAAQSAPAIDESFVEQARNLCAGDWVEFCD
GPALPFRAKLLLKSRFTSRYVFVNRRGMKLREIGLHELASGLAANTIRLI
AGANEPLLDRALNAVVDTLACSTEQSDASATS
>MCA2342 hypothetical protein
MAKKVVPITSKPRGPVLSLVSAAEERTAPSPSPSTNTIDDLGKSGLVAKA
VYGSVYCLSFSVVLGALVLGKLIPGRRVIAKGLQDGAAAARQSLTWLETP
RSSPAPGFGPSTLKA
>MCA0552 hypothetical protein
MIESHHASTRLLVSPNRPTHPTRSVHVSQNQSIALTRSRPGPARLLVLVQ
LGKLGQDFRQSLHLVGFDVRIQLVEFEVVAEQEGALREKRHRIHHRIRPI
RQQRRRGVLCPDQPAGTGIWHHRLGRRQGHLCRHRQGTGRRQTGPATVRG
AAPATRECRSDPYAVHRRGLPLIRLAVFALLVLPATLRAAEAHSARIDYL
YLDANEGQAAGGHAALRFGDDVYHFEYVEGGLIRAARQAFEPFRRLYTET
ENRTLHISRIAVTSVVYTALRERFESLLRRQNAQFEVPAALHRDAELLAL
LARPAPQPGLPVKGLGLFEPAGTDRPEDPALAGLRTRALARYGADGLRTR
MASVRQAISALRPGPPDDPLPSDANHSPPWRYRFADRYHDFHALLEAYRL
LERAAPLRRDACHVPAGREFALRAGETETLRNFAGRLEDWVLAALDRPRP
DQGPGLLVAFARLEALGKTLRYGRLVFLDTLSEPEPAVLPWSSAPPVFTT
ADADFRLARRELTGGRELDEVGYSLLENAANRRLEARRALAGKAPFRLRI
PPGLMPARPGTPTEIVRPALDAEAMQDAAAAFLAAESAQRERLAQIYRYH
LVERNCVTELFRTLNDGLAGLNESDTRALGGHIEPTAADAIPFVSAQSVE
RRYRITESLELASRRKLFLESLDAEAYLAELSPLSSAIYPHNDEDSLFLF
FTDDATWPRPLFGAANLAVGAAEALAGILTLPFDAGRTLHRGLRGMLVSL
PELAFFNIRKGTFR
>MCA1843 hypothetical protein
MITIRCELVNDPGRGPGHGLIRIGGLEQTAGRLEFCLERNQGSAPFLGPD
GIWQAQEFWHGVGSEAGPDSAVAGTHPQPFSQGEKGADPSSFSTVREMAA
EARSSHPFPSGTKPPGAVLVEQSSPEGFRAGWPENEKWPEGPDEGEKPHG
ADTPSMGKGTLQSIPRQIPVGPEIVDPIVSQPPSVVFRLTLSADGARYPC
VLRIQRPLLGSGAYFEAKPAPAPQAEPEPVPEPVPVPEPEPIPAPAPSPE
PVPPPVPRPSRGLWLGFALLLALAGVGAWAWWDCRIPALPGPQCQTAAPK
KEQNAEAQPVSPPQHCTGLSAEDCLAAAQKRLAARQPDAARQLFQEAAEL
GAPKAYIELGRMYDPDTWSAETSPAANADWETAAYWYDEAARTGDRDGRL
GAGRLYCRNTQDPAFLSHALELLRKAVAEAPGDAAAEAVLKECEEKAK
>MCA1839 putative lipoprotein
MTSILRSSTSFAVVMIFLSACASQPLTHSEKETRRAEIRSMANQMLAQLY
QSYPEAKARVRNSAGYAVFSDFGMKILFGGGSHGEGVAINNANQKATYMK
MLELSPGLGFGAQKFRAVFLFDTRESFDKFVNSGWEFGGNTEAALQTTTQ
GAGGRLGVTVSPGVTMYHISEAGAIVGISLAGAKYYKNDELNN
>MCA0077 conserved hypothetical protein
MLITFESKVGRVTMFGDVALHLLRMMGNTGTVPGALLARDIPVALEKLEQ
GLAAEAAPVSTSSEDEEEERQEKPNLRVRAFPLIGLLKDSLKHGCDLLWE
QEGKAPLKF
>MCA0132 conserved hypothetical protein
MNPLTAMALTPTQRRNQRTIIIIALICIVPFAAAWYFARHPQWVMGDLGN
VGHLITPAIPLDYGELLAAPVTSSESMAQLRGRWIVLSVADGPCAEVCRQ
NLYKTRQMRLMLNKEIPRVRRLLLLTDPGAADALADWLRQDEFLAVAGLA
PTLREKLEKAVGGPLQPGQILLLDPLGNLMMYYDADFDPYGVLKDLKHLL
RASQIG
>MCA0500 hypothetical protein
MIDRPRSRPGFERLPRLKAVSSPQRLSFPDRHRRLYVSLTQRQLPLRRSE
IRSQR
>MCA0487 hypothetical protein
MFFEGFSVWACAPSAGATASNPGTPCIAGKTVQRAASARDVATLGKFLST
LAVSV
>MCA2554 hypothetical protein
MNEKQPSLARSADHRKTFGTKPGLPEGLRRISLRGTRATEFRPTGSEDRP
MDSIELESPDTGG
>MCA0265 hypothetical protein
MSQADGYHLFARLRPNRKLDVDGAAQPAQGHRAEGGFFGLVPDQAQQGFF
AVEQFVLLQFGHGKLAGQRVRLVAGEHLARRRDAVGNLVFRREGSVLALD
GDDERHGSLQSNLRKCSGISVCISLGK
>MCA1087 hypothetical protein
MLKAEIVHCLPGRLRLRVAEKRGDEAALRELANGLAAHPEVASVEVRPVT
GGLLVVHNLGEAWQPITRYAADRNLFAVTAPAASVSPPSIADATVAGFKV
IDRRLRRATGGALDFQSAMFLTFVGFAVHQARQGHMLGPVSTLLLNAYGF
LKEKGR
>MCA1460 conserved hypothetical protein
MVPGHQYRNMVRRCVMKRLYYLADNLDSVERISDALHKEGISDWNFHVIS
KDQAGLYRRHIHSANFIQKSDAVRYAERGAMVGFLFSVLGSVWVATEQPF
GPDMGGMVYLAIFGFVTLFGVWVGGLMGMATENQKISAYHDQIEAGKHLI
LIDTRPQEEDRVRELMARTYPEAHLLRVGSTLINPFKFAHPAV
>MCA0765 hypothetical protein
MRAAGRSARLSQEQCEVVMNDGLRQVRRPSGISRIGPRSGVVLLLLASVA
EPALAKRVYCPTNLQSELDAIKSGGTLEISGTCVGHFVVWKNVELRGKGT
APTLFGAGGGTVLTIMGAANRVKVKNLTITGGGGVGDGGGILNYGDLTLI
GSTVVGNNAVNHGGGIYNYGTTTLKQSQVTGNIAGLDGGGIYNNSSASGW
GTLSLYGSEVTGNKAGGGGGGIYSFGSAYLKNSVLNGNQAGGDGGGLKSA
QGSITTVTGSVITGNIAASNGGVSGTVTFKGKQSTISGNLPDGP
>MCA1235 hypothetical protein
MLMCPIATPRSSAAETVDLFSFRPPRAWPFGTRHHASPSFTPVCGRVRSA
QVQKLEKRMQGTTPHVCTPNRD
>MCA1398 hypothetical protein
MSNENNSEGPAKANILDTLKSNPKILYVIGGVAVVILLALAMGGGGSGEV
QVKTAVQVGQTVVVSNPNVGDSQLTAAPGLMNVASGEEENDEQKVCVVKS
GTRATVEEESIVGALPFVKLKILDGACEGKSGWTSKVNVSAN
>MCA1172 hypothetical protein
MLPQGNGSHRITASRTENKPFSNEAATSRLHSVRKKLAGSSRSSEFALYP
SEIKGSGSC
>MCA1582 hypothetical protein
MRQDRRRPKATHPDPDRSRLASRRHLYLACPSLDPTEGYGSHRLMESSAA
AAAVPLSREMNIAAMAGIIDLSRHPP
>MCA1503 putative membrane protein
MREVMRDFAGLIMRGEGLAFLVASLLGALSLYFLPAASFCAAAIGLVGLR
LGAVKGVVLALAVAVPVTVCGYLFPPRPGLELPLLVLIGPLVLGAAALLR
RDGRQGPPLLFIGAVCMAGAIAIELISGDAAAFWKGWLQHAVKGVKGATV
QGFEENGTLVLMNGLIPFLVGASAFLTLLLARWWQSLLYNPDQFGPEFQR
LRLPRTALAAVVALLLLVHFLRPALEMHILLVGIMIYAFQGLAVVHGVVA
QRGIGWTGWLPPYLALVFVPQFGAVGLAVLGAADALANFRRLPAAESRAA
>MCA1806 putative fatty acid cis/trans isomerase
MLAIAGCIEAYRTGHDARHGPRVPWERRLSEEDYLARLAAGSVSYSRDVQ
PILDSRCVVCHGCYDAPCQLKLESFDGLERGASKTPVYDTTRLQATPPTR
LFIDAENVQGWRSKGFFPVLNERGDSPEANLRDSLLFRMLELKKAHPLPV
SGALPDTFDFRLDRTLNCPTVEEFDDFEDDHPEWGMPYGFPGLSEREHDI
VVKWLREGGFAPPPAPVSAEAASAVAQWEAFLNGSSPKERLFARYVYEHL
FLGHLHFRGLAPREFFRLVRSRTPPGEPIREIATTRPYDSPGPGEFYYRL
RPVQQSIVAKTHMVYELDDARMRRYRELFLDPEYAVDRLPGYAAADAANP
FKTFVAIPPRSRYRFLLDDAHFFFSGFMKGSVCRGQAALNVIQDRFWVAF
THPDADPVSNDAQFLADQAERLRLPAEKENSPGIGDIWYTYRALEEDYQA
AKAAWLRAHGERGGTTLADLWDGDGDNRDALLTVFRHFDSASVERGLVGD
TPLTGWVVDYPLLERIHYLLVAGFDVFGNVGHQLATRLYMDFMRMEAENN
FLRFLPSSIRQAERARWYKGIGARINDLWQSPRWGMGGETLIDYRTGNPK
QEFFDRVRAGFGKAAGRPDPFTGCGVEVEGCGEGGEPALPPVQASVEREL
RRLAFARGGGVAFLPELSYLRIRTGSGGAEDGWIYSLVKNTALENVSMLF
LEEIRRTPADDTVTLVRGIVGSYPNFFFDLDAAEVGEFVDAILSVDGSKA
FGRRLVERFGVRRSNPAFWRISDWFNRRYLEASPVTAGWLDLSRYDNP
>MCA0446 hypothetical protein
MVSLRRLLLFHVDGQLVIDVVFGVGVLGIAAHRDRQIILGAALEPELPHD
RGEILRGGRVGAVVAELIALDEGVTLQQPAPEFRHPLEFARKPVHLRSQC
LVTAVDGEGRDHVVVRGITAGVAFDADGDPDGAQPLALELAQVDQIPVGV
HTPAVAAIVLVAGRGDGRVAPGHRLGLRDHRRVAVVVRRVEGLSGVQGGG
GRAGLGVLLVEDQLGGILDAVARIDVAGQGVASGSGEAGGRHRSQQLAEG
HRFLMQGMIGKYYPVLMDAICGPKAGRRNSGVSGGQAAPWVCEITHLVRL
ITQLVGIAAGCHCPSTRDDETSRMKDGFPRMARLMLRSARRARSRRA
>MCA1901 hypothetical protein
MARLSRRIACAILASPFSIAPAFADTTLQYQVAGEKAPQSLFVKDGQVLI
KSAGGDGELDILFNRSRNAAFLIDHRKQAYMPVTEQRVAELVSQVQDVQP
LLRGLGEQLKKVSPEQRAKWSDILGGVDLDRITAKSEDSRPIVLAQVGTA
KTQGGYACSKVELRQGGDKKGDICLASAETLKLPAGDYETLRAMLLFSSR
IAEKAKGVAARYVDLGPLPTVNLTDYPGIPVELHDASSQQAATLTLSRVN
SDVFPPGVMAIPESYAAKKLKFW
>MCA0311 putative lipoprotein
MKQRLRGMTRTLLPAAIGLILAACENKVTRENYDKLAIGMEYSKVVELLG
EPENCQSVVSVKSCVWGKTPKTISVQFVGEKILFYSNTGL
>MCA1570 hypothetical protein
MLASSELKTFRGLRPSRARAVQIPLRAEEGGLNVN
>MCA2829 hypothetical protein
MSDQLLRDIETLAPENLDDLLLVIARNVEESLKKGGARPGIDYSILDLYQ
LAQPFALEIFKKNIDIMNYAVRW
>MCA2399 putative cytochrome c oxidase, CbaD subunit
MSVEHEHDSAHEEAPRGTWVLLCLVALGMTAAWLFLYFGVFLPRGQLGQ
>MCA2766 conserved hypothetical protein
MNASLYERDFFGWTQQQSELLKAGRFSELDTEHLCEEIEAMGRSARLQLT
RRLEVLLTHLLKWRYQPELHGQRWESAILDQRRRLAKLIDANPSLKPALH
VCFLETYDNARFSAMMETGLTLEAFPAQPPFDLVEVLDPDYLPG
>MCA1029 hypothetical protein
MAMQNTTNALHLMKAGALSLVLTGLITGCGDNGSSIPAASSSEAHISGNV
SDQTGPINDGRLQVTDRNGAVITAFDLKDTNHFELTIPAGTVYPIVITVT
PTNPNAASTAPVKAVVTSDIADRQDVTAVSTIVVDSAIALGGLTAENIAK
ASGGAIGLRQSQGVSAGAGGGGAGPGQSGGGTGRGGHGGHGASEGGHGAH
SAATPAPAVQH
>MCA1278 conserved hypothetical protein
MERLTLAYLGQLAEAAAVRPYFEPVVTDRGVQGARLGLVDDNTTGRFTRQ
EFALAETWLAPGDDAEAAFKRLVADGRRFILVDLKPEVLRRLAALPQAHD
VLLLDVSSRDDALRGEDCASNVLHLLPSRAMRADALSQYLAKKKWTRWFL
AVGPAPEDRLFADALKRSAKRFGMKIVAEKTWTHSFDDRKTPESEVPVFT
QGSDYDLVLVADEAGSFGDILSYRTWLPRPVAGTQGLVPSAWHHTLEAWG
AIQLQNRFRQQAGRWMTEADYGAWLAVRAIGEAATRTRSLEFEAIRSYLL
SDGFALAGFKGVPLSFRRWDGQLRQPVLLVQERSLVAVAPVEGFAHPKNE
LDSLGYDEPETSCHAKP
>MCA0536 hypothetical protein
MMTERLPVLSRIRLTLSLLLMLGPASAGAGLTGPANYWECILHGMEDVKN
DPVAREVMKICLSDFPDGFSVEEPLEGTPTECILKHGEDVSSALAAQQIH
IACNVLYSRP
>MCA3073 hypothetical protein
MDSDAYLKTVSTGIFALLAALATFCYLVDPFWYNRKVSIPGFNAVKPEFK
RYERHVKPQIVRRERPAALVFGNSYAEIGFDPLHPALTRSDRHERGYNFG
LAGADWERVYCSVRFALEYAAPKRMVIGFQPGHPFPAVDCRVLMAEMEHL
PVAALLLSGKAIKASLRTVKGQRRMPTHTAEGLFYYTRFEVDEVERRFRA
DFARYLANLRPDAPCLLKPRAAVVPPDFDPSLGTTIPADLAGLQDLVDRL
ARAGVETRLVAYPVHALRAEADIACGFADLRWEALLSVARTVRAADPEGR
WAEVWDFQGYDPDLLEPIRDNQTRLWQDVGHFNFEFGNRLLDRMFGLGEE
GFGSRVEPAAVPALRAAFFRDRATFLESHPGFMREFADLIASCRKTS
>MCA2874 hypothetical protein
MGREFRGNRYHPINPTAGQRVVNIFSHASAMLDGVVVFPQNNRR
>MCA2285 hypothetical protein
MLRFLSSLFTPTAAGDAGPDRALIDAAIERAVDATDRRVRALGDYRQRLR
EPVARAVDHVIALVDSMPPPTEISAQSFGTDPRIRAFFSSVDHLHDVLGK
LKDVREYRRHCAEIPADEIFGLLVMQKEERTVLGMELEGDTVRKDVVQVA
VSFSHHRYIAPAPSEEGARRELKKRAFDFLIERALERLARETRKRVELEH
QRRLLRRKLDALRAGGWGISSALADDGLDGGGIEAAEAEIESIDAELGQV
GTNALGLDQSLDCIAEVLGQPGKWLDIRPVVLRLDYRGIKLADFEPSLAG
ELQLTELFSATGARRIILPGRIPKGEIPDRPDVVKELSRYLG
>MCA2650 conserved hypothetical protein
MKIPTPRYRCPLGRLQPDTTDLDAIKQRGWRDQHILVVNAFDERLDFIER
EIVRRIGERLYGQGGTNHG
>MCA2169 BNR repeat domain protein
MLWMSTTIAASSFPQTRRASGSRGHRFENTEDWQVSSSNSDMRTSSLNAR
LLTALVAAWAGFLFEPASLAQQPYAVTYPDIAGLTEIQGLDVAVDGRSIH
ALLVGKPADGGRSRVVHVHSEDGGRTWTPPNFLDHPDDPPVIARPGNDAR
IAVKGKDLVAAWQTQGELPGTGPMHIVGSRDGGKTWTPGGNPATGDALKN
QAYLSLAADPQGAFHLAWLDDREETGDTQGLRYASSKDGGLHWSPETTLD
PAVCTCCWSRLAVLPDGALSVLYRDAEPRDMKLFLRRDPAQGWLSAGPVG
AFGWHFPGCPHCGGGLTGSQAGNGRVTLHSVVWTGKEDSAGLFYLKSADA
GRSWSPPRKVGDSQSRDGDIATLGQGMLAIAFTRNTAAGAAVQLIQSGDD
GARWTEPAALSAETAKARHPRVLATPFGFRVFWTETRADGQKTWAIAAPK
TETQARRDS
>MCA1171 hypothetical protein
MKVAFTRLTTIFALSIPLLMPSTGTAAPYKASLSDADRQLIQDARQSVKD
QAASRIEALRAELDAGKISKREYKSRVWEIQSTQNQLLNSLSSKANQREL
AGLLGDLKAHPESAEDILSARGSNNHGNGWNYKWFASWISWLREYISNCR
IL
>MCA1738 hypothetical protein
MNQIKKSGGFTTLLTASIAAALAPSVSMAAPKAHASSAQASLERRLQVME
QEMQALRAELEASRSKAEAEAAEAKAEAKDTAQQVQAQQAKVNQGLAELA
KHEEKKDDMVFFRGGWAAMNHARTSELLVNNNLLSSNNFGSDKAGWYVGA
GLDHRLSDDTFGISDDLALDAEIMFDYKNYGSVNNSFVSSVTGTRMQAQV
TMFSLYASPKLKYTGIEGFRPWIVPFGLSVNVISPPSSGVTVLNPGLMLG
TGLEYNIFKNLWVGADFRYNFTGGDLNYSVRTNNGKTILNSTDTDNYTAG
AYVGIGF
>MCA1720 hypothetical protein
MKRSILSSARNGLAAACALIASAGVAHAQSACYTDSLFPDERFVIDAETQ
GVLVSSWYDLAALLFGGKQTAYSVHGKYVYAYQEEGDPWVVHMAAATGTI
DVGTRYFGKNVNPGAVHTQTGARLGLTVHIVEGTGGEALFLPVTLDCRST
ETSVLPSEWSCESYNAWGDYFGVSTLTRVPYQADDERCNLFEVVPPTMPT
VSLSQGGHHKSRAFRQ
>MCA2478 hypothetical protein
MNSPPSAYDTPRGLQGKSPGQPGVRRRDSDRRSGFEKRRTGRSLPQAHLR
MPGLGSPGCRAEGRKQSFESQ
>MCA2510 putative lipoprotein
MRSTAGFPLGASLLMAACQAAGAGGIEGMAGAWTSLSLLGSLGSVSPDWR
KFKWYVRDQVRLRDDNPPNAWRMYEDLLWVGVGYQVDPRFYLGIGYAHTW
LHPVDQPAYQENRPYLEAVLTHEAVGGKLVSRTRLEERVLQQGGEVGIRF
REAVTWSHPVRFVAEGAEFYMGDELMVCSNYSLFGPAGFCQNRILSGISY
RLSRNLGVDFGYLGQYMAGTPGTADVWTHNIQFDLHYVFLDD
>MCA2685 hypothetical protein
MGLDANILNSLTKESAAEAWRVIIDSAVEIASKPLAAGNAPSEVVKRDRA
IGALDHFLATSGWDLWSSFGDVVERTSDRPVRWWKEPYSAKAVLILDGLS
LRELPWLMQGAKERGFALHEVGANASELPGETNEFARALGFTSRSQLQNN
GGGLAHKLQPAHTECVDMPWKDCEGLINSTPNWVFWHHWPDSKVHDGAGA
GQGLEILTRDAAQQLSSDDFWAFVERLATGRRLVITSDHGYAASGHFPDA
DGEVGQFLKKTFSSGRSASGSGDTGPFVPPVALQINSPHGAHLMALGRWK
WRSQGGYPTLTHGGLSLLEVLSPFVEVTK
>MCA1828 dinitrogenase reductase ADP-ribosyltransferase-like protein
MAGPFPPSETHHSPNVALAGHSTNLVGIPTELLASPDFNQHPLPLRIHGT
REAHSGLFARLGRCENLREAGGVFQDYMSVVFGFEEEQRLGEDRQGRRRF
RGSYLRLLQDWGFDSNNPQGAVLKGWVESRFGLFPTFHKQPLTGFATPAW
MDYVDEKMASRFHNNCIYLQLDLLFEFCQWAIARFETPARRHVRLYRGVD
SLGEFCPLGAGNGREILVRLNNIVSFSADRTQAGQFGAYILETEVPVVKL
LFFNDLLPSHPLRGEAEYLVIGGGYRVKIVV
>MCA0209 nifZ protein
MGDISRDSDSMELNDPPKFNFGEKVRSKKVVRNDGTFTGAEIGEVLVKKG
EVGYVKSINTFLQQFYIYAVDFPDRGYAVGMKGRELESVDNPPPSKQDAE
PRGETA
>MCA2460 hypothetical protein
MPNKPLPALLALLCLGAAVPSAQADDYGTTTTLKLGSGLSNLTLGWLEIP
KNMINTSNQINVLFGISGGLLKGLLHTAGRTLTGAVDFLTFPVPTQPIAH
PEFVWQKFSDETSYGPAFTSGTFKNPKPAPAPATSPYSKM
>MCA1402 conserved hypothetical protein
MTPEELIARIRQGDRIAFTDTVEAIDSAYHFTPTEFRNGRGDDVVINPAG
TNSGSCKIFGFARLHRLSVTETLALFGDYYWEDVLGRPDGDDHRNIRTFM
KYGWEGISFPKSPLAPR
>MCA0432 putative lipoprotein
MKLKDKMKLGPAGLVLSLSCWANGGSSFAGEPLFVQRVESAAPTREAIGK
AEEQVREHKDVKLREKMAVPAFHKRIEPPLHEGETYCQGCHRPQPHSKKL
RTRSFLNMHSRYVACETCHFRPEDVRLEYRWFDYAARRPAAADGSRFRTG
RNLDNSVPIDGKFKIAPFYRGEPAFVLPGTAFAERVGREWQEGDLAARAQ
LKARLHAPLSKEGPACAKCHTEDSPLLDLSALGADSRQASAIRRHVIPQF
FGRYQSDDERLKIIDILR
>MCA2599 hypothetical protein
MAVGGRSGPGKSAVDGQGHDFRTPPSPVVPIFQVGQRALEFTLQAREYLE
RLGRVVHHLEGHRSVQAAPTAGTERLPGHLDVAGTGAVDEHDLEVVGHRQ
IEHVANGDEQWVGAPAAEQYPAEVVGCGAEQACLASEFARTMEGFLHVDG
VACWVPV
>MCA2906 hypothetical protein
MTQQRVKIPNGEDWLAGWAVPSAGGQLGFSESVGVADPDTGMAAAVRDGA
PAASDPGLVVRPVPVPLSGGDAAATVVEHSVHTTTLLPANPGRRGALIFN
ESVERLYVKCGGFASASAFTVCLGPFEVWQVPAGYGGRLTGQWSMLSGLA
DLGRARITEY
>MCA2092 hypothetical protein
MTMSPSTDRPVRRSRGHIPIPQSDRFPELEESNRL
>MCA2855 ammonia monooxygenase/methane monooxygenase, subunit C family protein
MAATTIGGAAAAEAPLLDKKWLTFALAIYTVFYLWVRWYEGVYGWSAGLD
SFAPEFETYWMNFLYTEIVLEIVTASILWGYLWKTRDRNLAALTPREELR
RNFTHLVWLVAYAWAIYWGASYFTEQDGTWHQTIVRDTDFTPSHIIEFYL
SYPIYIITGFAAFIYAKTRLPFFAKGISLPYLVLVVGPFMILPNVGLNEW
GHTFWFMEELFVAPLHYGFVIFGWLALAVMGTLTQTFYSFAQGGLGQSLC
EAVDEGLIAK
>MCA0617 hypothetical protein
MNPDHGGFTRAGLSECRKPQPDRPPSSPPKHQKRHRQAPCPHRNLETADI
LDPAITSPRNRWGGSPPLRARKTASHTFVSRESNPAGQ
>MCA2925 conserved hypothetical protein
MTNRIHCFKPGRHLPMGGSQPIEFSERDLRDAAACYDPALHRAPLVIGHP
TRLDPAYGHVKALSYGPDGLEAEPEEVEPAFAELVNKQAFPNVSVGWYAP
DHPRNPVPGKWYVREVSFLGAVPPAVRGLRRPTLNPAFAADEDGIIRFSG
ERDDSVNAGLWRRFREWLLVRFGTEDADQVVPSADLQYLEDAAREEIRDE
AGDPATASQPVFSQSTPEDNTVDQAQAAALTAENERLKAELAAARQRESA
AAEARRRDDAVQFAEELTRPDGNGRVRLAPKHKNLLVELLMLAGKGADEG
GLPQFAGEDGSTRPLVDAVKAMFAEGGAVVRFGQFATADRADRAPLKNPL
VADAERRAGNA
>MCA2835 hypothetical protein
MSTFIKSTQDGRKVEVIGLAVCLDGHKEATRLVPVSEHPNREAILAAVPA
ATHMAGRLPLTAEEAAAAQAALDAAREAYARSPKGLSERIRAVQNRALAN
RDG
>MCA0069 hypothetical protein
MRGMRSGEKRSGFYRLAVVAGMLAVCSHALAVPGKIVILRHGEKQDDCAS
CDVGRERSPALGVRYLGGNTADSVLPSEDPAALPAITLHTLETVSPVANI
QGKPVVTCSAVPLPDQSDGEKAPLLNKLARAAARDVLNDPPSNYRGKKPQ
VSLRQRVRGMDAVVPAMPPPRSPRTA
>MCA2644 conserved hypothetical protein
MTAWNDFNDAEQQQSFDLIPKGTVARVRMTIKPGGYDDPAQGWTGGYATQ
SFDTGSIYLACEFVVLEGEYAKRKLWSNVGLHSPKGQTWASMGRSFIRAA
LNSARNVLPQDNSPQAAAARRIQGFHELDGIEFVARIDIEKDARGELRNV
VKLAVEPDQPDYARVMGLPPKTPGGSSGAPAAAIPSRVMPAPAAAQRPAV
PGKPAWAQ
>MCA0445 hypothetical protein
MALYRRCRTDRPAGSLPAGGREWPGSGRLVREVAIPGLFLPVWRRRRAAS
QEGYVRVSRKLTVSVALALAQILRQVAQIHVHAVAHRRMPAGLADDAAVG
IDHLAAAATTAGLLDVHLELEIDVVFLAGIAVLGIAAHGHGQPVFLPALE
CQRAADAGHGLRTDTVGRHVARVRFAGPRGEQPAPERRCGLELARIPVQV
RPLVLVAAPDGEGGDEVALGRIAADIAFGVDGDVELAQTLRFQTAQVDEE
PFRMHGPAVAGSGYGAGRRIGRRVAPGHGLGGRDAGGRAAVAGGVDGLAG
IDRERRGAGLGVLFLEHKFRGVGDAVAGDDFDGFRRGRGADEPQGQGGDN
SFHRES
>MCA0546 hypothetical protein
MVMFRKFRGLPRLPTWMLFVLPVLAAAAADRPVTEVLGIADLGGLDQGEI
VSYDVREPTDKTLAAGLAMYVEAPPERLIAAVRNGALLMGDSDVIAAGIF
LPAGGPEALNGFEFGPSDAEEAEALSEVEPGSEFNFSAAEMQGFRALART
LAHVSETEKLRLLSRRYREVLWQRLDGYRRYGADAIEPYARSGGATVDVA
AELRTFTEQSAVLRHYAPELQAALLRFPAPSDAASTLQWVERKVEGRPTV
ILVHQMVQPKAGGALVAVRDFYVGHSFNSSQMVLGILPYRQGSLVFYSHC
TSTDQVAGVGAGLKHAIGRERLRAVMFGRMERLKASVR
>MCA1161 hypothetical protein
MDNPRKRSRSLEEALSGLSANVEWREPAFLKAISLLLGGCLAIAYLNLNF
TGRKTDFEDAAALGAATTTIRASVAGYRIHCDDSRDAAECLAGVEARHAV
HSVLWLGNSQVHAVNQLRAGETNATPLLFDQLKNHGLDLITFSQPNANLQ
EHYVLFEYLRQRLPLRFLILPVVFDDTREDGLRKGVADFLGDPPTALALS
RTEIGQKILQAARTVPPSAESDTAGISHTLQERVEKSINAWLGAHSSLWA
SRPEMRGRVLLNLYLLRNALLGIQPTSKRKVIKGRYLDNLAALEAILTAA
ASQGTRVVMYVAPLRNDVDTPYVDSEYRHFKAEVQALAQRHDASFANLEN
LVPAELWGSKAGTSVGKGQEMDFMHFQAGGHKLLAACLAELVSDAWVRRE
VRR
>MCA2410 hypothetical protein
MAATDRLAAFSAAALALPGMVAAQGLDLSPEAPAIDSGYSNYQESNGRMS
VQAFQNDASLRLGEDVNFRVNSTLDFIGGGSNPMNLSQIGGASPTFYWGR
NNNYGRKYGELELSQATQGPQRGRQGGIHDQRLAIDGALSVALDDLTLGV
AGGNSTEWDYISNFFNLDARWDINRKATTLAAGFGYASDTVWADGGNQGR
IHQQYTDGTVIGGDKNTYQGLLGLTQVLDKNSLLQVNLTVANSSGFLSDP
YKSAWVNDVPAVRSPSRRSASARAAPDMSTHSGASGGSPEPSPSPVPAPD
QPAEVPATPVNTFCRQNVPFSFQINLCADTRPGSRIQSAVLLRYVRNIPE
LDAAALHLDYRFYSDDWNVDSHTFEAGWLQPLPYETLLTFRLRYYTQRSA
YFYQTVYDHPTADGLYSSDYRLASFGAVGGGLQVNKTFFGMLQIGGGVDL
YQRTQGMGFMGGTGSHVDNFSFALYSVNLNLKF
>MCA1342 hypothetical protein
MRSKSAGRLRDAVESLGWDASRQDSRRGRKVTKWWF
>MCA1947 conserved hypothetical protein
MKTIFPARTIRPVALTLLLAGLHACEPQPVQPTVQTSAVAEQPAATPPST
GDFPTLTRVEYVLQCMQEHGGQNYDNMYHCACAVDVVAKAMTADEYDQAV
TFTNLFGMGGERGAVFRDPPQSEQLRKKLKDAKAAANDTCFPKMPGSAGG
QRGG
>MCA0406 putative lipoprotein
MDRNTSWRRPSAAGLLVAALLAACATIRVDTGFDKNADFSRYRTYFWLNG
PASGDASVDRRIVELVDAALRSKGWRRAAEGKGDAAVDVEVLTEEEERND
TYYDGWVLNQNLGPAQTVVTTFREGTLIVNVLDGRSMRPVWRGVAHEMLS
GNPAENEKLAKEAIARMFAAFPPGPAPR
>MCA0795 conserved hypothetical protein
MDTRLRMKTPRLILLCLIFTTAQNAAAMGPNTLKGPMSFIEVLNEVLVMR
PVGLVATIVGTTLFLATSPFTGIASAAEPHDAFRKAGDALVVGPAAFTFS
RPFGVYGYNPKGVYPERRPD
>MCA3022 hypothetical protein
MSSLCASTARGKLPTIQGRVQPSRRHRISRWQKVPMTSSDTLKFLESFLA
KDFEAEYHRRRPRRRAEPLVITLSRDFGAGGEALAADLAHFLDLPIYDKE
ILDRVADKTRIDKSHFEHHDEDSAERISDLLYNLMFGTAATRYDYRRALI
EVVTELARSDCIIVGRGAHLILAGRKVFRLRVVGSRAVCGERIAAELGIG
QAEAERLVFETNNKRHKSVQTLFSDLYDACSLEHAINFDLVLNTDHLPAG
NALPTVLMAVRQFGFEIFDLGQREAS
>MCA1913 hypothetical protein
MIRSFQDGDPLYVLSPEQSRDIVKRHPQFIGKDPVLIVRDMRRSGPRDGG
EYHGFHLNLGLIQKPVRALHALQEFLRFLSLHHDSPNIEKDIKDRLHKDG
FLGTIEVIREGAKDAMGA
>MCA2302 hypothetical protein
MAPHPRCCQRRAFTSSGWVRIIFDIAGKSTSMNAQTSSHSLPVHVGARAA
AIAAFALGGLLVLMVGFSPLPAVHDATHDTRHSAGFPCH
>MCA2328 hypothetical protein
MPLLASIASVAFIVVQRASATPCPCGVFTAANPTVSTANDAGGLEVGMKF
KANTNGYVTGVRFYKYNQSLMGGNHTGSLWDANGNRITSVLFTNETATGW
QEATFTSPVSITANTIYTV
>MCA1241 conserved domain protein
MIEVADVDAAHRALAIHAPPPITTSWGSRTFSLRDPDGTAVCYLQWVAG
>MCA0900 hypothetical protein
MYRKKALTLAVTTLTSLNAPSLNAKTQHDRVDVLEARVHDLESKLEKALH
ALEAASAKQTTAAAAAPVPDIKKIDQKVKLIERKLEVDKEVSDGKWAKLP
NVEVGTQGLNVESKDGDFKLNFRGLVQADGYYFVDDEDPNNVNGDGLVNR
FIMRRVRPIFSGTLWKDIDWRIMPDFGMGAARLFDAYADLRYFRSASLAA
GKFKAPISLERLQSASALTFLERGFPTQLAPNRDIGAMLHGEFDGPWETE
STRSYNLYQFPEFLAYQVGIFDGTVNNGNIDSPTTDGKQVEARIFAHPFK
GAGLDAVEGLGVGIGGSWGHPNDSATPTYATAGQQTMFKYASAARIDGTQ
YRIYPQAYWYWGPFGLIGEYAFSQAGVANQSTVNTGTGPHTVTKYSTIEN
DYAWNVTASYVLTGEDNTFQGVRPRHAFNPFEGKWGAWQAAARWTEIAFD
SDVFRNVAKPGSSTPIYAFADPRQAVRNATNWSIGINWWLNQNVKIMADY
NQTSFQGGGGVYDSKGTLTNDVVDRATEKVFDTRIQVSF
>MCA1699 hypothetical protein
MALFACDPGKALNVVEMSLYINLMNPDCPVSPHPYPQILWTFSDPGNRVS
ACEPGFPYKWSGYCFILVDA
>MCA3042 hypothetical protein
MKKPFSTHILSAALLLAAPLAGAADYPPDFKPSVIYRDPSLTGKPAAPEA
APPAQPQAAPAAPARPAPAAPAKTEAAPEPQAAPSSPAKPAPDSGSDYYL
FGGVVAALIGFVLWSSRRPATAHPAATAAAAAPAAAPAPAATGVAKYLQA
QGLATGPETGVAKYLKALPEPVRTPETGVARYLKNLPLPEVAAAAETGVA
RYIKNLPKPAVVATGETGVTKYLKSLNG
>MCA0656 CRISPR-associated protein, CT1134 family
MKDICLKVFGDYACFTRPEMKVERVSYDIITPSAARAIFEAILWKPAIRW
RVTKIEVLKPIRWISVRRNEVGAVASVRTAQTAMQAGRGDLGLYIEEERQ
QRAGLFLREVAYRLHARCEPTDRKGEQESPDKFLAMFHRRATNGQCFNQP
YLGCREFAAHFEPVTNLDAAMADEPPITETRDLGWMLHDIDFANGIQPKF
FRAHMQQGVIRVPEWNSEGVKG
>MCA0424 cytochrome c family protein
MWIHRLQICPWLWAVCFIAGILPSYGGEAPADNGFDRAVLHPAIPLLDES
GRHVLDSGLPYSPKNSCGNGSGSGCHDYARITRGYHFEQGRDETRDGFGN
KLGLPQLTGPGYFGGYNCMSGNAPGWLARKSNGSAAEFGDFGAPDLVRYC
GACHSGGGWGEFDRNGGRYDEQSAETVKAFDGDYFSRQFQEPGKTGQYGG
SGPSEVVAWDWRRSGVREADCMLCHADFSRLKIFPPSGLGTGGSESAALQ
FARLRDEKFIAGGFFRHAASAIWEFLDVRPDTEGGAALLAVERTPATGTA
TPDYRLVLDDQGNPKLHWNRDAFDESGKIQVPMLRFPASDNCMYCHKTGN
SRRGFYGFGPEVRVRMAGDGTTITDFRTDVHKGAVWTEDNGQARVIDNCN
ACHARQYYKSPAANVDLDADHNFPKGNGDNDVRNDLDNAPPPASCEHCHD
QAAKPALPSGHKNVLEAHREIWKANGDMRGYPENTLDRITQTHLNVVACQ
TCHISRLADNGKEFPMRYRYRVGYGGRLKIFPYKPAYRYFVQDRTSGRVL
NRYERFSVIEERTGSDGGNYGAILEPASGKELGRVVMNGDEFGEPPTFAD
YKALKQAYDALLGMKGYAMPNVRFVYIESNEYALSHATRPSPQAVQCEDC
HARKQSGAFSALISAEGLLGEANVAEVAKLPDRRLVDAGIVELGMPYYKV
QDDGRIVENVADVLYASRLDPSMSILRSETARTVENEFKTLSRAEALAFA
DLDEAAGQKLAADLPSGEALLFGSKVGHSSLRGFALIQTRGTRTLAYGDV
LKGRVESRPAKAKDRTRIFGQGFGNLVADIYSLAVMDASGRTLPGLVEGT
ALVRLPYRGKAKARGGVNVLVSNDGKVWQRVGGKNLLVFRPRGDVDGYVV
VRIRRSALYLTLADKVG
>MCA2630 hypothetical protein
MRKRRRWRLFGLQMDLRGHLHLGMHLRFRDPRRSHVAPGLTSQTARLNES
FPDGRSEPIIPEKPWTWAEGQAPPRVASYFVEHLLCVADDALFFLLARLG
QIVEDVPCLGIVHFRRCFLVQGHEGVVEGAGDLHHFVHAHAGTGFGLCQQ
LFDMVAGGELQPGIEKGTAGVFLETLQHFLLFVFVPAQIVESHLQPDFRL
IRLFFQGFVIGSYLIVEFFAMIQNIPDLFEIFHDAILRAVGRLGIFCFGA
PKISG
>MCA1811 hypothetical protein
MTSGDSPSVGSRGDGHGNASAETIDGLPEPSGYIRTPKPRQPITGDSPVR
LPRWRSDSTRRASAKTGAVQSPQIGSLAPSAPPEVQFPPHRRRPPSAPSS
DRPRVVSSSSAGTQTGRHGRGAEPPRIDHAPDRGGCRPVAGRPDPGGLQD
PEHALSARVVDDDSHRLPGAADGTGYGGLRSLISRLDDHCRHGAGDLTNP
AFQPLETGQEHQNARTGQMRELFLEADVRKSPYEPAASRWVWNFFMLSVG
GFCRPAAGCQQPGPPLPEEAQAKLEELDAQGGYRPERIYATNEVRETWKH
LHTELDISFLAPSGSGRFPLVVYLPGLGEDATAGAFWRRSWAAAGYAVFT
VQPAALGKVFWASDKLDAGELRATGRAYFSSASLESRLSHLAWALGELRQ
RAATPGNPYAVADPAKAAVAGFDLGAQTASALAGEAVKAAFPANAEFRFA
GAVVLSPHVTLAEGKLDERAAGMAMPLLAVTGTDDDDPYGMSATSLRPAL
WQSMPAGGKYLLVLQGGTHDTLAGVEPGQKWKGPPSQPVPGEGSEGGGWL
PWSGDDLTLQVGQRSGTAGARNGSASGPSGGPMRSVERPLRKRGPDVRHW
AEVAGVSTAFLDLVLKGRERAREWLDRDAGRWMGASAELRHK
>MCA1908 hypothetical protein
MDEADSEHQVDSDRHHAFEPGRGMVAQGFRATFTDRMMPTSSSRVNSSPY
SCLRDGRR
>MCA1866 hypothetical protein
MFTWDDPRRIALSLQRSAESSGRRKAGPFRSAMSMLNFYINRAGSQLSES
RRACLEAAKDELRALYGRPRRRLPP
>MCA1255 hypothetical protein
MDPMTTPKKRYRLTPLAERDTQVNDLFSVFEAAAATTAHRHP
>MCA0341 hypothetical protein
MRRRGAPQVRIGGSPGINAFFSRVRKRPVDLGLNEERAERARRLTGDLSG
VVEPCLPNSSNASCNAGPNNLDASKRPLPRGMTSRKNTVLLPTSIRRSDG
PVRCPPKYRPPARGRRPVFALRRLSAQGRGAAAGASVAVARLGGRAGSSS
AIGDQIIGATDPMPSRAWR
>MCA0130 hypothetical protein
MPHFPTVAGAPHPTSRNPNMLPKVLIVLVLVVIVASLGSALFYLVKDGDR
RSPRTVRALTVRIGLSIALFLALLLGYALGFLRPHGLRPNAPAQTPAPAA
ANPR
>MCA1186 putative membrane protein
MDIDTLYALRSSSGLPSHPAVFLVLLVLTWALHMIAVHVMLGSTALALTG
AFSSNANWRRLTGPMLDTAKVAVSLAIVIGVAPLLFVQVIYDPFWYVSNV
LSARWAIAFIVILLTAYWAMYHHYFVGKEGGSATSRWSLAVSLALLLVAG
FIMHALTSQALRPELWMAWYAPEGHLETAGSGIHEFNLGRYLFFIVLAAP
VTGAWLLGCRRYLDRREPRDAPYLDWIAALGNRLLTAGGFAALICYAAWM
AALPESAAGFPVSFWSIAANASILLLIVIPALFRSASGGYGPFVTAAGVI
LVLATVREALRHRILFGTHGYELSTYAVNLDWYSNALFCLTFAGVGGFAI
AYSVAIAWEAGKTPGIYTASPRVNRLGGFALAALIVWVAQYFFFGFLVIT
R
>MCA0556 hypothetical protein
MVWFFRLVHNCRTIDAVFPQQDTIRRIVHTSIHNYIRRTDGPFSHRDRRR
SAGLCALCRLFIARRREKGTGRRGGNRHAGRTADPGPNRRTTGGNHRRGT
RTGRRRCRRRHTVAGDGNGRGDPRGLTGHTAAGRNRRPPGGLNVPGWGAA
RG
>MCA2591 aquaporin Z domain protein
MRGRCGGKDVIPCWTGQIIGAVPAAARVSGR
>MCA0425 hypothetical protein
MHAKSHRSQLCCSSSPPVAEDAAALPRAPGMPGHPLVQKVLPWLAGAGMG
LGHAAVASNCTVPQQGRCSSCGSCIVVVGSLVAWALSRQRGRGAFYEEGR
R
>MCA0457 hypothetical protein
MHGPGLPARAGRRLQRSGLGRMGRRSGQPHPGAVLPGGTLGRRLQSGAGR
ERHRAQQPAGRAHLQQRGIRLQGTLHRLRPRLRLLRRHLRRHVRTTVRQT
VIPSGTGSKPAVAASAGPSKAGASRDAPVRQRGD
>MCA2283 conserved hypothetical protein
MLADEPTLASTRMFGDKVSAGVDDAPALLFEDHAEITLYAQPGADLGLQY
RALMLAGDEDMVLIGGPRSEDFETYCRHTLGLGSVTVLGVPPASAIGHAT
LPERCAATPSALEPIVETARKHRRLNLIPYIGTRQAWRFAGLIAERTGAR
VKVAAPPPHLTQRVNDKLWFARCVKDLLGPAALPPSYCTFGPVGLAGRLG
ALARRFERIVIKIPDSAGGAGNLIFLSELIRRLPPALLSRRVSRLLARRG
WNGGFPLLVGAWDCDVLASPSVQIWIPARDDSLPVVEGIFDQAIGDEEGT
FVGAMPCELPESLRNRLAGEALQLAHLFQQMGYFGRCSFDAVIAGNDAAT
ALPHWIECNGRWGGTSIPMTLANRLLGDWKRRALLIVQRTDSKNRPCSLP
AALARLGPLAFRDRGEEGIVILTPAGLETGTGMHLMSIAGSTDAARRQAL
AAESLLRAEPTDGRAGSGPR
>MCA2244 hypothetical protein
MGAILDLVEKMRSPTGIVVTIAIVAIAFFFVRWVFKDEDGSNQ
>MCA0218 conserved hypothetical protein
MTATIPTEIQAGLAGGQVIPYLGPGVLDLDGCSAVPSAPETLVELITAKV
TVPHKIRRNLTAAGQYIENFKHRKTLVGLLKEAFAAAPAPNGLHRYLAGQ
PLPLIVDAWYDASMAAALAGRSDFGQVQGVSRAEHFGEWVHYFHADGSPA
AEADAANWNLLLYKPLGGIAPAANFILSDSDYVEVLTEIDIQTPIPEPVK
QIRSGRHFLFLGCRFRTQLERTYARQIMKRSSDLHWAVLPDEPTRNEERF
LAEQRIRRIDLPLGDFVRRLAGAAAARQAA
>MCA3097 conserved hypothetical protein
MVAAAGSRLDKLSLDTVKLIFKRKSMVNAYGDRWIPVNLPPAHPLRRLFS
LAVFDALPEDMEEYWNEQYFQGINPPEVLSSEEAVLRFVAATPAAIGYVR
QAQADARVKILLRIPSTGDLGEPP
>MCA2646 conserved hypothetical protein
MDFNSSASLSGQVTALVDLGMQLARSLQGTRQYLGASRLGASCERVLQYE
YAQAPVDPGRETEGRMLRIFERGHVIEDCMVTWLRDAGFDLRTRKADGEQ
FGFIALDGRLQGHVDGVIVGGPEGFGYPALWENKCLGAKSWRELEKNRLA
SARPIYAAQVALYQAYLELHEHPALFTAVNADTMEIYAELVPFDAELAQR
MSDRAVKVISATEAGELLPRSFSEPTHFECRMCPWQDRCWRV
>MCA1689 hypothetical protein
MSGESDAERWDPGFVGDHYLVTARLGPFRKLFLKPREFTPRFYHRLTELT
IEDWNLPVPDLRLGEAVRISVDLSVRFQPTLDYARSHPDSLDQLSHTIKL
HHQRVLLDMVLEELRVLEDPFWLGEGYAGIERRVETLINETLAGQGIRCR
TRCVLSPEFQALDEAELENLPPWSPYRPLYQALLQRQHRIAAEAERRRLE
LEAEEEAVRMERERARLLLEQRETELRKAKYALEAERLKAELAAEETLQA
ERRAAEARQKEEQVRHEQRLREMEIEAELQSKARRTEAMEDADARLRREI
ELLALERQRLLLEEEVRDIKLAKAKGWIINASRRFQLGQDAEFDDPEPPG
LPPELPET
>MCA1904 putative lipoprotein
MLFRRLLWVTAALAVAACAPAPVVDQSNFSDVSGLEPSLGQQAQAPAGGR
MFTVYNYWHRKGAVFTESSRQVVGEGAVFVDAGDPVFPATVQSDPVYCSD
KLMYIDPLIGAYKRACFSDETGDGLFDHVRVAPESSWIKQPLSPRLPYKT
QDIVVPHGGAFRYELIYQGFANNTLSLRFMEYKGGDFDRPMVTLDMSYAA
DTFPTTITFRNLKAEVLAADNHKIVYRVLSGF
>MCA2309 hypothetical protein
MNHRYSGAILGLAGVLFGIVVHARGFGGAVQQSSRAAQQERQQSAEQMQQ
SRQQNLQGMQESAQQNRNANQQSRQQYSAEKTYQRQLTGSQVHGQRARAY
DDVQENIQQTEKRRGAAKLLGGALRAKQMQQQQAQPQQ
>MCA1972 hypothetical protein
MPATRQVAGSFAPSVRLESALANLSETYDFHHAVTWPALAYGATNPNRGE
>MCA1480 hypothetical protein
MTSSRVKFSNTIIIRRTGLRAERTGRVFARYGSGRATTKDPAERRFKVGI
ALPTVRRMGPVQEPLVPPQPIFNTQHRDTHHPNDRFDIANLKKSIRMTEE
TEQEGMWAKSCGTTGSGGSPARTPKESIDEKNVDQRHAARGIAGRAGRWT
KAL
>MCA1108 conserved hypothetical protein
MKLKTANVLVSASVLALSAGVTAGEDLTEQEVPKAVLNAFKTVYPAATDV
EYEKKVKHGETVYEIEFKDKGVEREIVYSADGKVLKAELED
>MCA2463 hypothetical protein
MCSSPIAAIFLPVGIGQTEEGPGGDAAPGDRRVTAGGSGGRALTAAPASS
RPRLLPRRSARRGTVPDERRCRGARVPERIGGVDAAWNRVAADGTPSNCI
IRRRMNGRRVRPTDLASLFIPRKVEGLESSWIRRFQVKAMEHNEIGPLHQ
IEQISITLEDSRKPGYAASTPTRMTSSRLRERPGHRIPSRNRDQYISVPS
AADWKRPSVWHETVGLPGMTGSPVSPGADGDRSDP
>MCA0624 conserved hypothetical protein
MRLPRVLFTRLPTNTRYPFEGQLPLSDSMPRPARASLRNIETNVSLTENT
GNMLIGESLSRILELNRPRSCHLNLTRLLALGWSAERIRDEIHKHFDLVV
FLMANAIRKDFDLGNLADAVSALETDFMVFGIGMQDSLPPTLSTLPEGSQ
RLLRMFDQKALIFGVRGKETENWLHQVGLNRAKALGCPCLYVYPQNLLAV
SPPSGGSNAMAIASGHLTNPSIRSQKLISLFRDADSHYIMQDEFLTIART
CKDDERLYNDATGRVRTELCRPIFERILGSRIPFKNFWYFQNLDAWRVFC
AQADFYLGDRFHGGIVAMQAGIPSIFIWNDQRARELTDFFALPNISVADI
GDAKAHDLVDSLLTKQAFTEFRDTYRQRLDVFVRTLGEHGIRLDIDSQYY
TSKREDMRQIVYKTYRKFFRGFN
>MCA2455 hypothetical protein
MAKSESFLPGGGRGGVMDEHDRLVAALKDIIVLFSLMAAVGSVYHGNWLA
GGGYAGLALLFFNEKQLRRWWRGRHGRR
>MCA1897 hypothetical protein
MKRPLLILAAAPGLALAIPATPVMTLYQFNGPLDIPYYDADAFLRNGPAS
PAGTLSQGSSVIPCLVLKNGQPLTDATGTPYVGFKLVVDSRTATPASAEK
FKQAVAERKTLVVANHHCDASVRHVIDVRKLYPMEKAPFFDPPRELARRP
VRPDQGELDRIVKAFHDSPQCESANGSLTGRRSALARAWDQFARANPGHW
PARALEQAKQLDYVMRTALFEGHLERGCNAYGACERNVIALSIRNRGKEG
CSSGQGCGAPGDFQGVASRPSQYNIWDEYLTQVTGLTACFLRQDLSQTER
YAKLQAMYAQTLPDVQRILFGDDADLREIFPFVPLTDLKSLKHYYHAPAM
GKCFPGHERAEYIAGAVARKGRDFALIANTRIQVEDRADGGYFFRDFLVT
AKDDRDEIRIVDNYPGFVIDARKVDLKPTARCLPYGIPAGCEFGEPGRYR
TTPVWLNTGRPLELRCHLEDRGENCQAPPVAKTVGVGGRCDTQMRPVAGV
K
>MCA1174 conserved hypothetical protein
MDFPALSSFVGDAGEGGGAGKAPSSGTKALLIRTALLALRLLAGFCALYP
ASTWSQGGNGVANPLDGSSGYSNFSPGNTYDPYNRTPQYVRMDQPLGTEG
RLDDEFDFRLPIVGRMRVEGFYDDNVLNTGILKKGSFGTFIQPWFFLPFH
YRSLTAGLGYSFRGNVYENVPQNNYVDNYVNAFSDVRFDHRNRLSLNGSL
SYAHDPLGTQFVQGDASLLLKNPNEWQSQSFGALYRYGAEGAKGLLEFRF
DYMNRYYQNNPEFTRGRSLNQYNVGGAFYYRVLPKTSLLFEINESFSDYL
NPLPNGISQTNNQLRVLGGVTWRATAKTTGILKAGVLVKEFDEPAQGLGA
NGQPRPTFDAAVLWQPRSIDTVRVGAVMITNESNYLGGAVNNQTYSVMWN
HQWLPKLSSNIYANAMENSYTGSDRTDKGYMLGVGVYYPLQRWLTSGLQY
TYTDRSSTLEQYSFTRNLVILNLQMNF
>MCA2951 hypothetical protein
MAEISTKRCTRCGQTLPKDRFYRQGVRLEATCKACVNAARTAKRRAAGIA
PRPPAKPRQPDYSPEVWDRELRAVNRAFNHTLKTIFGSLHRCNPHPTLSR
REMALTQGEHA
>MCA0751 conserved hypothetical protein
MTSSHHGPYGQGYTRATMAGTEGSQAARWSQSQKAGLSPDCSLQLDCMKS
ESLVIADQHAAVNTFPGLVHTARHTMGVGCTRSR
>MCA2632 conserved hypothetical protein
MSTKHRGRFEGAPVTQRLPTPAGGVKLETFVPWRLVRRGIRRAVIAPPGA
PRAVEAHSTGAGPPRPEAKDTALMRALGLAHHWQRLLHEGRVASAAEIAQ
AEGLDVSTVHRLLRLTLLAPEVIERLLGSPDLAIEKVLGRPWPYGWREQV
RLLD
>MCA2920 conserved hypothetical protein
MTRYRYDGPVSSVTLPGGLDVNLHPGAEVELPEDNDYVGVLVAKGFLTAI
ASNRKPKGADAPPQE
>MCA2645 conserved hypothetical protein
MSGKCWVCKRQARGFGHTDNRHGVGDPRRYPIDWVFCSRRCQEAFHALYG
NWLRVKEGRADIEEVAMIDPSDVELAAMKKCLKAFGEAAGEIGFGKPLGD
YSEAEALQVIDAIVTCYTEAMVEHHEATKYPPVRGLTNPVSDPFADMKDD
LPWEVKP
>MCA0646 hypothetical protein
MFNYAVIVTAALAGIVLVDDDPTSVSLEEWVLFAVMIYAASSFMRLYRR
>MCA0217 hypothetical protein
MTTPEKIESLIHVRFAPNGTVMEIGERPPGVEAQTWFNYLSQNTSNCYQA
LAGGRGMFRLPRPQVDALKAALSSAG
>MCA1910 hypothetical protein
MRDAVQNATEDAAKVKEKLAGMDVGRSLSRFAYTSSYMVSYGIVYGTVFI
ARAIPQDNPIVEGFIDGGRAALDALNEAKETPVTAPTT
>MCA2722 conserved hypothetical protein
MTSSHHGPYGQGYTRATMAGTEGSQAARWSQSQKAGLSPDCSLQLDCMKS
ESLVIADQHAAVNTFPGLVHTARHTMGVGCTRSR
>MCA0863 conserved hypothetical protein
MSDETFERPAITGIGRRLVFMVFFMVVLGAVRFVFWAIVAFQFLSHLLTG
GVNPNAVQSGARLAEYIYRIMLFLTYETEAMPFPFGERTGRPDPRGR
>MCA2953 hypothetical protein
MARKPVTIELAGGKSPRQRIWEMIRKLRIFTVPELRGHLPGPVPLATVRT
YVESLHAAGILEKTPGLYELIDDRGIEAPRVNKAGQPVTLGQGNERMWGA
MEALGAFNCRVLARMADVPLATVKTYCAYLQRAGFLTVERAGKGRGAGGV
PTTYRLLHSRITGPRPPMITRLKTVYDPNVGKIVWQQDPQEQLDD
>MCA0951 hypothetical protein
MRSVGRPLKRVGQVWEVAFAWLLVLPAAAPVQGVEPGEQFSPSGSSPEHS
AGYATIYREHVHSTEPDRPRRPGSTPAPRGVAPAPRAVTVREPGPDLANF
PNSSYTLPEGGIYLEMSPFTLQGASSDSVKQYNWEFLFRYGLTDDVELRL
FTQGLTVQGPPQSATGFSPLTFDTKIHLYKDDFEYFNYSVGLEAYVQSTW
GSPFFDNGTQGSLTINADHVLPWGIAFNWNFGFASLKDFRGAEVILPSLQ
WAFQRDVVEDFALFIQGYVNAAALPRTLHAGAVGPVKTLQEHAVGAGFQW
TLNDRIAVFGSYSGALGAYVPSYLGTMGLAVAF
>MCA1489 hypothetical protein
MKRVSAAVVGLLFSANVFAIAQHYVRQDGGHVQHMKINKVGDEIGVEVDV
DFEPVGSVDEGKRPCSHTVSGSAEKVADNELVLKKQVEGEARYCELKIHL
KGDAAVIDQSKDCDYFLGTYCKFASEGKPLLLVK
>MCA0581 site-specific recombinase, phage integrase family domain protein
MKALVPAVFNRANQEGITDRPNPAVALKCKLIQSRERFPEPAESPQFFAA
VADSPLADFFLLAILIGARRSNVQEMRRQDIDAAGGVKAGAQIVPLKKKA
KA
>MCA2905 prophage MuMc02, tail fiber domain protein
MLAALLGQHQAKPMAELLQPSLRDARGLALDALIERISRLDQSALLVYLV
DQVHPSALPHLADQFHVMGLEGWDTVSTDTERRALIKSAIAYHRLKGTLA
GLTWAGSRVGLSILRAITPPAKTYLAPALTRAERDAFLSGYPQLRLYRYR
TRGVRIGSGWYCSAAFVKGRHYPVVTDAILRLGWRAFLWHRSGNDSKSSG
ADSKSNGVETPVTTLVREIQRAGAEAELSLAIRKPGAAGLGTYPGRLVPR
SYLVRQDGGGRLFNVRLSTPYLDFDESVHTHAVRPGLDPIDIRYTTIAEQ
GIEHGAMLGRFVAGHLADNGARDRLYRRFHLFDPAVPVPRRGRSTHVGAM
KLGMPAYTAELAVSMPARRPPRLIGRFATGYWAPASVSRLERGREAMALA
SAFRDRVWLDTHTRKQVRAGYAVRSGEILSGQFITV
>MCA2695 hypothetical protein
MDLETFIEEALLQIYRGVENANEALNAVRSKKGSTEFPKMFLLSPGADRE
KGNGVFF
>MCA0469 conserved hypothetical protein
MNIRGTLLGAVLAMTAACADLDIKVHTDYDPAADFSKYKTYSWIKTPQTG
NPLMEERMVQTVDAQLSAKGWRKIAAGPSDVALGVQVTAQEQERVDTFYS
GWGGYGHMGMAQSEVIKYTVGTIIVDMFDTRTKQPIWRGTATDTVSDDPQ
KNTALMQEAMEKLFRGFPPKPAGS
>MCA0933 CRISPR-associated protein, CT1975 family
MFLQIHSLTSYHATLLNRDDAGLAKRIPFGDAVRLRVSSQCLKRHWRESL
KQTIPLPTGLRTRHVFEREIYPRLKQEGVEDSLAKQLTLSLMGLLLQKSD
KTAKPEKAKKGKNGHEEQAEFDFEEGAGTEESSAGDLRVKQPILFGRPEV
DYLISLLKACAEEGSGAEKALQAKLKGDKANFKAMLKAAGHGDLYAGLEG
ALFGRFVTSDVLSRSDAAVHVAHSFTVHGLDTEVDYFTVVDDLNREEETG
AAHAGDMELGAGLFYGYVAVDIPLLVSNLTGCDTTRWAEQEPADVRKVLT
GLIRAIATVSPGAKLGATAPYAFSEFVLLETGKQQPRALSNAYLQALPMR
GDPLQAAIDALAKYLRALDAMYGRTSDSRSVASTRAFDADLAPTNSLDAS
IGAALDAIFPPAKA
>MCA2668 conserved hypothetical protein
MSDLETLIPQSVELVIDGEPLAIKPLKVGQMPGFLRAITPVMQHLTGGEI
DWLALFSERGDDLLSAIAIAVGKPRAWVDELAADEAILLAAKVLEVNADF
FTRTVIPRIDGLFARVTAVQAKAAGSTPSNT
>MCA2452 hypothetical protein
MFEGGLLLLAFGFGGLAGVDPVAALRFDLDGLVYGLAGTLPLYLVFQWSY
NTHVASLREIKRVLVDRLGPLLAGCGMADLLFLGFLAGFTEEILFRGSFS
PGSRPTGVGWAAWFSATWCSPWCIGFLLYTPCWRA
>MCA2012 hypothetical protein
MDIPDARWSERPRHESGLAFPAYASCEPLLPQTAGRMCSEGNEYVKMSDR
RAERRALRGLRFRRNRRRANRSSVTPMQTLNGKMIKKASFLVFIALLGAI
PQAGAESDKEIQERALKARELGQHGGMDHSAHAGDETTGRFRGVFYGYLP
CQEENCDGLKMTLSLNDKDRYLLVIQSAKPQNRESFEKGKYQWDDKNGIV
VLTPNKEAPPRRLAIKDEGTLIYLASDGKPLPGDPDRYRLERSDKAGNRE
MHIH
>MCA0459 hypothetical protein
MEPDYEALGRYYASLETFNELSSKRHRLSGELCRMLSHSMERNHDVLTLF
DHKAARMLLEDIIALNAHLIQVTEHLNRHAAECGKPEIRTLYRKG
>MCA1149 conserved hypothetical protein, truncation
MKTDTLFHELFRQWPAVVLDLAGLDPGRAAHYRFRSEELKQAAFRIDGVL
APIEDGDDPHVFIEVQFQPDDTLYRRLFAEIFLYLQRLAHPRGYANLLNL
PEVRRVDLGVLSGQDSETPGWDLLRLIVDDTDVALARAAGMRSFSRGDRG
AHQRRIASKTKPVL
>MCA1122 general secretion pathway protein M
MGGLLKRLLNLKIQVSKSRAAALALLALTVLALYGLIMAPLWALSSGYDE
SIESLLFRLKRFQTVAAERSYWEGRLQEIRNEGEQAKNLFSQATPALAAA
DLQSRIKDTVTSAGGELISTQVIPERKEDQFTRIAVKVRLNGSTEVLRQV
LYEIESEQPVLFVENLNLRPIRVPPRPGQKQAAMPDKLSIDFDAVGYMSQ
KAP
>MCA0881 hypothetical protein
MDLRKKNLILTGVIVLIVIGLYLFSIYKVISSSSPS
>MCA2759 hypothetical protein
MGRPGENLDSDVRQRSRARRKPGSRRTAGGQPLIRTGTHHTAFSAVPSLR
KSARLIRTPLADRLRHRPTARKARRGRDKYRDSARNRAARRTPGPGRSGT
PYRCWSRRRSRRRRRACLRDRRPQRRKDSSRAPCRSRGTRPQHLHGRRPA
HQHDAELRRRSPPHPLRPRRKHHQQPDLRYLHPRRPRRRGIG
>MCA0151 putative membrane protein
MLAYRPWEIGLVTAVAVFAGWLASISQPIVLAVFVGGFVSVFLLSRIDFA
VWLLLAGTLVVNGAVLLMFPRMTKISWALSMLGFFLLAAGLFPLFLGRWR
ERVRLPGFLTLFLALVALSAGTSLLGQGPALEILAGVKRSYQMLGLALVL
ATAPVTAVWRRHFHGWGGFLLSAAVLQLPVALYERIVLVPLRVGMGGGIV
PIDIVAGTFEPNFEGGGENGTMVIFLVACLAYVLTAWRERVLSTLWAAAF
AVELGVPLFLGETKIALVLLPMMFLLVFSREIRRNPAVAAAALTTGVLLT
GVLAWLYFSVFSIQGKSPEQMVQNTIDYNFGSAGYYGNKASLNRTTVLTF
WFHEHGLHNPAETLFGHGMGASYAGAGTLVEGHLNRKYPGMAINLNTAST
LLWDSGAIGAAMFLGVLITAWRASGRLLSRCDDGAERGRLVALRVCLAAN
VFSLWYSNSLMNSPSHELLFAFTLGYLAWLVRANPLPQNFPQRHEHQTAV
SHA
>MCA0645 hypothetical protein
MNAEQFLLALGLVEEGGVWRRKTDGGMVGENTEAALIAMKGDILRMAFEP
GIDEPLSMKQAMEALRLLSALEKR
>MCA1397 hypothetical protein
MAHDEKTSAIGRFRMSHKHPVEAVESLVMADRYGRPAGLRRGFEHPGHEA
GGRAPGDFDAARPDRLARLLRAAAAYAAGGGHPMQASPERGDAEIPESIR
RLRQP
>MCA2413 hypothetical protein
MTHRIHIVGCSPRSGTTLLNELMVTCFEIGGFAEHEQSIYKPYDYRDEIL
LTKYPLQTTVVAPFLAVDPDLWVIYLLRDPRDAISSRSHRKDTQRYWSNL
GLWRELHGAATKLMSHPRFITIRYEDLVTAPDAVQRELMARLPFLELRHP
FSEFHRHARPSRDSLDALGSLRPIDPGSIGNWRRHKPHLVAQMAKYGDLS
KLLIELGYEKDDSWLAELEGVSPDDSGSLVLPERPWYWHLGKRMDLYWRA
AVYWLTRQPALTPTLYQLRTRRRQRKSAR
>MCA1719 hypothetical protein
MKESIFSLGSSRLLSGGGRTGHAHHDRTSAGGILARHAVERKPNLAFQEP
ETVCPLIQADAIHGMTGTSPFTPPDRARFSAERTGSPLDLDRRVDADRPR
SGIGRGGAVGGTPAEARRDPNVPIRDPSRPLSGAAASGRFRLQRRDPPPR
QVRLRRPAPDRPARSRRRIRCGPSAGQAPPGRDRVAGTAAAGSLIASQGA
AADSRFWPNLPARRGVSCRSGGSPDRGSLSLKKPRAQKPAFRNGAGDPVV
P
>MCA2833 hypothetical protein
MVAEVRVSPEHDADAPVRFDGHLLDRLCKDTLASPGDAVLLDRLRQAYPG
YPLHIARLGHQWYRLGGIVKPNGARVAADIGEWAERTYIECGQNFNTLLA
HCEEGGFLATHHTGVTLYLVAQTGPRAEDFVQIEVDRTQETADRYLVAPE
NPPEDLEGLIDPIDPVGVQPFAVGAARYTYRRKTEVALFMRELGRHRADR
HPAQRFMDDWNASSAGQHKAFCEDWSLRLFQHIGRHGEQVMNVEIVLNRT
REVPRLEGPDGKKGKALAVLLNRFDAQAGHPFAWYFYMLKGLVSPHVGEA
VQRDLAKDYAYLPDRDTVVLKQWATAPYCL
>MCA2245 hypothetical protein
MKKVLCALGVVMMSLVVVGCGNSPDGDKQKISSTIASPSL
>MCA2228 hypothetical protein
MAQRGLLRRRRLRLEAMTVHRERRCAALAGFAAEKSNTSPGFVPDARTRL
G
>MCA1444 hypothetical protein
MHQGSFQQCGLYKPHEQGHCFPLRGGRIHKYRLNHILTINLLVPTLAIVS
LRNYECLREGEANRAVHWDGIRPHVRADWHTTKKEENNEMNEETRTGKDY
FWIYVALATAALVGVVVMAKLSENEKYDPIRAQQNEEVARMNIRVLN
>MCA0839 hypothetical protein
MKIRRWRNAGGGRQTHPRSRSARSRSVEGFGAEIETLESPNDSRFAVAAI
LHACAGVSHFKRGVFPKAVYKSRGCGGFAEKAPRGSSSTAATGRSSKAPV
RQEFRPSNLTTEEYSMTEDRMQLSELLEKAVSCFNRWNNPLVKTTPEPDR
FDLATCRIPFGVTGGGNQCDGKRLHSVAPRHGRRRLDSSTER
>MCA2160 cytochrome c5530 family protein
MNNSRRPQSFTSSRNPFLACLSICFALALAPESVLAAAKLKIAKASWSEK
TGILTVKGSLKNSAGPVEIYDINGRRLVAIGDSRQGGAFGASISRDALAA
VPCAVRVQSGDGEAIKVVKGAPKSCSGVPTCSIVSPVDGTALQMGIETHF
EANATARDPAALPLKYEWDFGGGAMGFPAGTLVQSGSVGEHATFVRDHGI
YRVRFVVTDALGRRCEDSIAVSVGQAPTAPPAVASLAAASVQAAPKFGSE
LEGKAGDVVVLPFAHSAAMGQGVVNLDAGNPWPGFFHLNAIAYTKARQPL
NVDADAYELFYSAAVNPADPVGADSINSTSRNYPVGSSFSQAQIRKTDMW
EAQEGWETTAGYEPFWTVDENTPTVSPLWKDPPTITKILNAAFPDMWGQG
ADWTWYLARNSVPDEGYLTGKFETRMTGNPGDKDNFKGGLMPGKDNPYQA
NTPQAFLTREADTQAFLALGIPLTDMDDQGRVNPFPVMRVEVRSKSTGQT
AAAADVALNTAKDVRCSECHVYGGIGADPTVKRYLRPQKKDASGNPVFDG
YGNKVLLAGEQNPRVEYFPDYMNSNPKSLEEKENEAFWNQYAIHAFMELE
DRTGALLDDWDPDADGVAGEGNSHPWENGLRYIKQFKVPEPCQWCHQSAY
LNECGYGTTNWDGLEYGPSEHRFHGRIQVDGQGKVIRDAQGRPLMWDDQG
TGKTNPNSLFPVVDKQGNKVAMEQNCLKCHVGASQKGYHDPMYSAGIQCA
DCHGDMLAVGAVFAKKAQGDQPRNAKGELVDPTKLDEQGNPTPVHRVDYL
DQPNCGSCHTGNGSEPVRKLAYDAADPAATPLLPENPRFAVNSAKIAFNY
LDWDMNRVDKSYDLPLFRKSLDTHGKVPCAACHGSTHAIWPIKEPGANDN
VTALQLQGHTGTILECNVCHTADSFARKEDLDGGQYSGDAQTGILGGPHD
MHPVNDPYWWKGAQGDGANSDGTAYGGWHNDYAKLPGMKGEDQCAACHGN
DHKGTRLSKTPVDRVFDFRGFDGKKLKKAGFKTKVVKVAAGTPIGCDTCH
SLQTSFKGAPGH
>MCA0411 hypothetical protein
MMVNPVTLWAKTGDSWLQRLEPKERDMLRKLGTDTLLEVMNKARFEVVTR
QIRAQGAKSADLPHSCGPARRGGVTASSDSPGVRRALCVACMLLRRMA
>MCA2071 conserved domain protein
MTESARATVSLGFTAELFPPGTHMCYIYNDERERLEIMSEFVRSGIESGE
KIAYFVEEMSPQALRDYLAGLGIVPTEEAQLEIAPSVDVYCPEGRFSPDR
MLGRLRSAYEDGIREGFNGVRLTGEMHWALRGLPGSERLAEYESRINLLV
VDCPLTVICQYDANHFDGATLFKILNAHPMMIVRGQVVHNPYYLSPEEYF
ARYPVPNV
>MCA2664 conserved hypothetical protein
MSLVEQIYDAAAHAGLLKECLWRPADGSPPQSHPVGFAAPDGTVLDGLAL
STEYAISYPASVFTGLASREAVEIDGATFLVRDIRAVGDGSEMRATLTRA
>MCA1382 hypothetical protein
MAPHLDGRQHQNRRQHRQQPGHRNHTEQQAAQPDPGERSQGHEGDETALR
GDHREAPVATVAGKARHHGGQTYGEGQAAGEFEIQSEQQHQGGYEQFASR
NTQNRRDHSDADTGQRTGQDQGDALQPGGRSGLHIMPPKQGRCDSHQQYC
YYPVENPRLEPRRPTSADPGPAETSGQQREDDVPMRQYAGKGDGAGAERQ
GGCHYDEAHRLVQYDRLQRTETEQTDQQREPKLCSADADQPPQHTDRGTA
PERGERRTDILGSGKHVLIAPGLRA
>MCA1144 hypothetical protein
MKPKWLDPRQGRFVKDRRMAAMSGAAQCCVPVVASVFFLLAGCAGREEKV
ESVEELLSASGFTTFYAQNPQQQANLEKIPQRQLVAHRTGKAVTYLYADD
ELCDCVYVGDQQAEDRLKQLAARKIQADKRLMASELKLDMTENPQMWMMG
SGFGY
>MCA1608 conserved hypothetical protein
MAHRQCPKRLWLQVHRPELTEWADASSIAIQHGYGIGEVARTLYPDGWLI
DGANLADALADTRLALASKLNRTLFEATFEHEGVLIRADVLIPEEGEHRL
VEVKASTRVKDYHLSDCAIQAWVCRQAGLSLNRVELAHVDSGFVYPGGGD
YRGLLRHVDLTGTIEPLISQVPGWIEAARATLERGEPAIAPGAQCHTPYD
CPFLTHCADAAGLPEESEFPLRLLPYPGKLKAELEAEGYRDLRDVPAERL
AKPKHRRIHRVALSGEAELDPAAAAALSKHEYPRYYLDFESIQFAVPRWA
GTRPYQQLVFQWSCHVEDTPGELRHLEFLDGSGDDPRRPFAEALIEALGE
HGPVFVYNIGFERTRTNELASDFLDLSAPLLAIGERMVDLLPLVRNHYYH
PGMDGSWSIKSVLPTLAPDLDYTALQVRHGAMAQEAYAELIDPGTVPQRR
GELRRNLLDYCRLDTLAMVRLAWYREGTAYLPSPPGTNLPGADLH
>MCA0640 conserved hypothetical protein
MVPGVGIEPTQCFHYQILSLARLPVSPSRQSGCQV
>MCA1701 hypothetical protein
MNASVTYNASRPADNFNGPVSFNDRAGEPQMNQFYLYLERAIAVGPNEWD
VGGRFDFMYGTDTIFTQAYGIPYVDPRTGRPLNRGHWDLHLSSWSDRFYG
IALPQAYAEFNLPMGNGIVVKAGHFYTPLGYEMVTAPDNFFITRPYTFQF
NPFTHTGILSRYAIDANWAVSAGAVTGSATGGGDGTWDEQLGNWDFLGGG
VWTSDDKSDSLSLTAMAGGRSERFGDLWAIYSLVGKTSFLDDRLHYVIEH
THGFADQVQTANYTRNGGKLENAQWYGIAQWLTYALEEHWSVGLRAEWWR
DNNGFRVSGPPRCSGSMNVNGAGEARPYACNPDFSTVYPFQGSGYYALTV
GLNWKPLNWVILRPNARYDWSDAIKIFDAGKRSDQFLFSADVTVTF
>MCA0580 conserved hypothetical protein
MNARLHDEDFYTWTHTQAGLLREGRLAELDTEHLIEELEAMGARERRELV
NRLTVLLAHLLKWAHQPDRRGNSWRRTIQIQRSDVADVLDDNPGLRPELG
AIFGKAYAKARLLAANETGLDEHAFPADPPFSVDQTMSMDYWPE
>MCA2638 conserved hypothetical protein
MSSHFTVVYDACVLYPAPLRDLLMHLALSDLYRARWSDMIHDEWTRNVLV
SRPDLTQDQLNRTRQLMNAHVRDCLVTGFEYLIPSIDLPDPDDRHVVAAA
IHSGASLIVTFNLKDFPPEALRPYNLAAQHPDDFIVDLLDLHPAGVLEAA
ASHRRSLKNPPKTADEYLDTLLAQGLTQSVAVMRQWIVAM
>MCA0189 hypothetical protein
MNGLHGVDRPAMRAAPGFDGPFEFRAGQVFDDDARAVLLETDGYRTPQGP
QFVGKGLEARDHDIGQIGLACSLMLGRCGARGGLFERPASEEAVKLLAPV
HYHRDLRRQPVAFHGSAGRFQNENLYCRKERGDR
>MCA2762 hypothetical protein
MPRYDYFCEANDSVVEVSHPMNDRLTTWGEVCERAGLAVGDTPLDTPVRK
LITGGGIVRSGALKNPEAPPCQSGAPCCGAGACGLD
>MCA0208 leucine rich repeat domain protein
MGTTRSEIEEARDWQGNLLDCSRCEQREGLTEGRCHLGHACVHDRYARRI
DRFFRWNPDLAKSYLEHPYFETRAVAAKHVELFYLPRLVADPDETVRQSV
AQRLPVRSKPFEILRHDPHRDVRVRVAERLEPRDLATMIGDTDYFVRQIV
ARRLPPGLLVKMIHDPDPEVRKVVAQRIAPEWLPTLAADTEMPVRLAAVR
RMDNKQLRQFVKDPDWRLRYEVAGRLELADLEPLRADPEPAVRELVAQRL
NRPRGEGVWPEAC
>MCA1873 conserved domain protein
MKRIPNLCRASLRPVTYGLLLLLPVAQTPAHSRQTQCQVKLYRLVGEYLH
CLMNAEAALARGQDPARNEHIRKACDDAYKKSYRAALKLSAEGCPAVGDT
ESGAQSPLEQEVWQTVAEVRGMVAGTPPEPVPGQLILYNNCTQPMKIMSP
TSSTINGTTLQPYDSISYPTAAGGGLGQNTPNTFMFAPITTDAQCAQVQC
GKWTDIQAAGQRMGYMWMDNTPGNDNLVYAAYCQPTNAAAGQCTTTSATP
CCGSQMNYDKTFGTTFEITPNGGTKNNQDFVDLSTNFGSGPTSPPTLCSA
SGANPDDCVTATANIFFNVPIGVTMSTGAGCTFPQGGPGLTCTDVSCPDA
YQYPVDNKQVACPAGTGYLVTLCPGTSKLPALGQTGATPNKITVQNNLSK
SSPCPNGNTVTIFTSGGRQQVVQPGGGSVTFQGDYSAYPGLGLQVNNWYW
TSENLPVQKGPPQNPDNSGAQFMISDQCVLSQNPPVYGKGIETYEISTVT
ARKTGTNECLITVNENQPYTDAVTPACCAPPLQNMGSVCTGPWGVTNNQQ
PWPPQ
>MCA2229 hypothetical protein
MKTDYLIPAVLMSAGLLAAGNVQAHEARCLPSATHNDPCYYLVLVGFTHE
PALANEMNGLDLWIKKNVGAPTTLDLTKTIPDFDDDTNNGGAKYVPVDVM
KDDTIMNVMIHVLRLKKQTHVTAESDKNVISATMLKPYYGYTMMGPLVQS
MDPQFNYKYSARFMTGPTGPRTGDSADVQGVGAYGFHITAMINDETYDEF
FVCGQGTTNPTGDAFSCVGVPQKIK
>MCA1530 putative MxaL protein
MKARVGRPFAGLAAAAVLTLLAWADPRIPLERPVFRYLFVLDITQSMNAR
DYHLEGLPADRLGFAKAAIRRAITDLPCGSQAGLGLFTTQNVELLFEPLE
VCRHAGVIEDVLEHVDWRMAWAADSFIAQGLNAALRIVKKRDPAPRLVFL
SDGQQTPEDPVRPELKIKPGEVQGLIVGVGDVKPVPIPRLDRENRPLGFW
EKADILAPVTTTAYLDASADTSHRRGNDGSLYLSWLHETELRDFADTAGL
GYLRLDTPERLSLALRDPALGELRVVPTGIGWTLALIAWLLVIWPHLVPD
PRARHVSG
>MCA0161 putative hydrogenase expression/formation protein
MQDKFAFIATGNTPLFRLHDDPECTCPPMPEAMATFTPPPLHALSRDETG
EGCAVLQKVIERLGGAPGIVDLAPCDPAARRFVDEVLGEGEVLIRIVTPR
DRVTIRESVFAGVWRVQHRIDGELRRDDLETGPVPEAVYGWAERLTVDGP
PHRPQSFPEGLMNAPALLTEIFDHSADCPNDRPHVINLSLLPLTPEDSKF
LIDTLGTAGLSVLSRGYGDCRVTLTRLPNVWWVQYFNSPGQLILNTLEIT
RLPAVVAAAPEDLEDSRERIAEALAQLK
>MCA1323 hypothetical protein
MMLLSFCSAYVLIARNPGVPWGDATENDQLCGTAVEMYTGAAGTFAIFRP
>MCA2036 putative membrane protein
MKERWQAVLVIAGLVCLSFMMPLVGLLSSAALGLVVLRQGLPAAATVLAL
SAAAVAVFGGVVLGSVAAPLIYALLLWFPTAVAAWVLRISRRIEWALASV
VVPALMAVLAVYGLIGDPSEFWNEKLIRLVQPLLDQAPDGLDADGARFGL
RVAAHYATGFVSAGSALSVFLTLMLARWWQSLLYNPGGFRAEFVELRPAP
AFAYAALACIGGAMSLPAVAELLWNLGVVFFVLYLMVGVAVIHVLLSRRS
AGKFWLAGFYLLLFVIPQVALPVALMGFTDVWMDWRHRRAAGV
>MCA2555 hypothetical protein
MSMNFKKTAIAAGMFTALAATSMSAHAVRETEAGEANLVPFVLWSSSVFD
NNPTIFGINTVIKLTVPMSVGNDVIPNFYTAIHTSPTNGTINKPGKQKPA
DPDLVPSNTVHWYFMDQTSVHRLNGTIDVTPEDVAVLDWGAFVRKNGKQG
EFDGFPGYMVLVTEAGAGGDDADFSFFAEAWMFAGVRAGANPDTGGVIGI
VDAKIPVMPMSDGADNGSKKPTVENSVIEAGVSQSVIASPLVAGIRTNWS
DGNGSDVTVVDLTLGNRNVPIGNANLINALQVPTLLVVWNDRNAGGKWSG
LGVDIYNDKEEKCSDSIDLPYQLNLVWAQTDVTAGKNAQIPFAWPVPKFI
YGKNPYTGKPFGLDKIFCVPPYQATPVTDPDGAIALEQLLQGGFMKLYLP
ELIDTGIGAPESAAVAWSVPLQYFVTLETDPATGDWVPTDISLIPFETAL
GHDRGLFSQAP
>MCA0111 conserved hypothetical protein
MPAYPRAELEEMVARWLAANRDAERAGDWIRSLGAMYTEDAEYGWNMGPD
QEFLARGRQQIMEWALGFHMEGFEQWRYPYDHVVIDDAQGEVIGFWRQVS
PARRPDGSHYEVAGLGGSWFRYGGDYQWSWQRDFFDLGNVKALFMELAAN
GQLDAPVRRKIARLAKGQAMPGVHPLHPAPGLLAKLKGYYAMARIVLSGG
>MCA1043 hypothetical protein
MGVAAFVDKIPSFRQNTAACLGDFIEIKSWHTQWRVSWC
>MCA1290 hypothetical protein
MIKQFFLSVLAILVLFEEWLWDLLTVFGQWISRLLHLERFDAWLSQAPPK
SALVALVLPLALVTPLNVGAVVLMVHGAVTAGILLELAAKLLGTLLVARV
FRLTRPALMSFAWFALLYEWIMRLLRWAHALVRESRLYRAVLAFKETVKH
FFREFRGG
>MCA0340 putative membrane protein
MQKALFNLVLRGLEKQVPATGLGLFRLAFGLVAFQEICFLYYFRQLIFDP
VPYLDIASPSVHLFLVLWAIAALCLALGLYTRLAAIANYLFWLVFTVFTP
MWKDFDGGFDQLMLGSSLLLIFLPSERAWSLDRLRLAWRHSTVDRCYALP
RTVPVLCYFLPLAVSLGFIYFDSVIHKLFAEFWRNGLGPWLPSSLPYYMS
PLDMGWLLEIEPLQRAIGYTIIAFQFAFLFLLYFRRFRVPLMLVGLSLHA
GIIVSLNIYPFGFGMLVHYFLMVPFRWWRTLGRTLRPAEPALQVFYDERC
PLCLKTVLAIEHFDVFRAVEFRGLQTHAATAPALEDIPERDLLGDLYAVD
REGRRYSGVATYARILVAMRYPALAGLAMRLPGLATIADRVYRRIADNRV
RLGCDASCAPAPGRTEPDLAQRIGRWVGGSLQQRANRISRMLVVVLILQL
NCTLHYAILYRLGVDTKANEAGQVLTMLSNALISASHTFLGITPHPLYLH
DHFQGYEHILGIVHLDADGKERWLPFVDEEGRIVSPNWGRVHSMWANVAV
TRHMDPRRLDKFVRKVTAFWGTRLGLDLNRTTFVLKLKTVKAPMDWEPGL
RRYNLAQPWEDVGRAVWRDGEMRLELDRDLEALSAD
>MCA0804 hypothetical protein
MSLKKQIILRYNGVGHVRFELPAPLCEVSARTQIEAALRALDGVYRVSLS
PGSRKLSVRFDIAVCDLKVIARRLAQLVDAGVGQSGRVPGRVGIVGGRPF
GWLREKAQEAGETLAAMKIVAGRTLKKSPRLLTPARERGFIEFFNDVLVL
YLIKLHWHLITQHWLRQPLRYRYEWMAVFYLIFLLVRSRRPRNV
>MCA0455 hypothetical protein
MMIVVSHDFHRLGQFKRDEIADTQVSEAARPGADRYGDPLSIGEQEYHLA
DGWVNGLDRSFEGPRTDEGRFRRGLSSASGKDSGGVIPGRCGGQGRGVLA
GGGFWGGFGASDDSKAGAQETGNRTATLFREHCSDLCQNWYWIWRRL
>MCA1912 hypothetical protein
MTRRARWRPRSPAPFASPARLETAQERIAHVRTKHPHIPWPQKQNRARRP
GKRRRAAGSEVSHIHRGYRLDLSRRAGDPGQFRDDPLFPGW
>MCA0932 CRISPR-associated protein, CT1976 family
MDILLLRFDAPLMSFGGVMVDQHGPTEQFPGLSMLAGLLGNAMGYRHGDA
DALEALQARIEYAARWDVEPEALLDYHTVDLGQEKMREPGWTTRGEPEHR
AGGPAAAFGTHQRYRHYWANGVMTLAITLKDAGTPNLDDLAAAIVQPARP
LFLGRKTCLPAAPLLAGRVEAENVLEALRRTPRVVGRVRPERREERNPTE
DRKVNACWPAYLGETEQSRQTGIYDRRDWRNQIHAGRRIRIEGLLEIAP
>MCA0649 hypothetical protein
MHHAKRATAGLAEHREQIEVFHIPRNRPGSIPMDARAGTSKPNRDHLTPP
PALVRKSADAAIPCGPSSRKRE
>MCA0180 hypothetical protein
MKCSVAFSRPLRPFGRRLAASADKLLCCHRPGAVCGDAGGLTGSGEGTMN
LKEEMEKFRDGLLQQRDEIVVQLDLARMNIRDEWEKAEEQLEELKARVAR
AADEAKDASEDVWAGVQVLGEEIRNAYERIKNKL
>MCA1546 hypothetical protein
MKRFSTLAGIALLAGLSAGTAAADVYLKDASEIVGTWQLETVSSSLKGPR
IEENRTWEFRPDGVIVTSGYNRHLKMNDVHEWKYQIVDGKISADDPGRPG
KTIDYAVQEKTADSMILRGGIEGFYFFKKR
>MCA1823 hypothetical protein
MTTGLDKNDTHDDRQKKTDPLRTSRHFPDLGRFCRRTDARGADRIRRPPA
RTFPACPDHLLFEGVRTRSSGSPAQGGLDQHDHVHHRADLGHPETGAESR
PQNRRRQDLVSRSARCAPRLVARPGLVQENRVRGRRRGGAGRPRDDPLMR
PNRSVRTGPASRGN
>MCA1577 conserved domain protein
MKPTFAMGAMLLAACAGPGLKIGFDENYHFGGTKTWSYAEMPGSDKAAAD
RLRLDTLMKDTLEPLLAAKGYTRRGTGADFLVGWSFGEWKIDRRSRSGSE
WGAVGLFYPGMHAVPAPKAPEGRALPPSIDPYGSSHEKAKLDLVVVDGSS
RRVVWHASVEDDGDFGYDPDVQRTEIREAVERAFQAFPPGR
>MCA2902 hypothetical protein
MPTRFDRYRFKDGVTPLSEDTFNAILQDIDLRIAALEEVRISWQAAVTLL
TDQGLLRINEALAPAIETLQYQIDHIVELASQVQVDRILDAPDQVTDVHI
GNRTADPALVPVNNTGTLTQWLGRLANRLKAITGAANWYDAPATTLAAVA
ATLASQAAQLATAAAHASNVSNPHNTTAAQVGALPVGGGNLLGALGLSGY
PISGVKTLGFQAEYDNGNSGTAKTLSLVNGQKQKLKLTASTTLTVSSTGA
PVGNYVIRLIQDATGGRAVTWAGLSGSRWLGRATAPSVNAAANGESLLSI
YWDGASMIQSLAKVGAA
>MCA2503 hypothetical protein
MSAALLGASEAALATPNMYWDHLRSSLSQADCVSRGESLMSSLTTGRVSK
DVDSVRSWTDKTLAVVECIRMGDHMTVMVLVGSDDAVAGSKLLDALKKGM
Q
>MCA2950 hypothetical protein
MIKIQRYPRLGAYGWKALVTVGGKAREVLTVGTWFDIRRLARRAEEEMAE
KPRRSPCVDAARQRQLDVRRGLQGAAP
>MCA0716 hypothetical protein
MFSRLWCLVVVVVSVAGIPGEAPAAEVPGVPRCESTDSGASIRVSPRIPV
PGEPLKIMAVSADVPIQNLSVRAPDGTSLPLEPERRGGPPWSLAAEIGRA
EPGNYRMEARGAEGTVACLDLNVDGGTAASRPGRWDRATEAFFSAWVETL
FDYPLEDSVNLPSLKPVLTDPSRNFLYDHFGAGEDRHIPTDPDCADLPYY
LRAYFAWKTGLPMAYRACSRGTASQPPRCGAATVADGFARGPVSAGMFTA
VVRKVMDTVHSGSARTGLRDGATDFYPVPLKREYLWPGTVYADPYGHVLM
IAQWVPQSADRSGMLLAVDAQPDNSISRKRFWEGTFLFADDVAGAGPGFK
AFRPLLSAGAAVRIPGNAELTHSALIAPFSDQQGDLSPEDFYADMGRLIN
PQGLDPRQAYEAMFEALMEQLETRVASVENGEAYFRKHPRAVIPMPAGGA
VFETIGPWEDYSTPSRDMRLLIALKVMSQLPDQVVRYPELFVLHGQSPSE
VKTQLERLDAERLRDASIVYHRSDGSLWRLSLAELFARRKALEVAYNPND
CVEVRWGAERGTEEYSTCTRHAPADQAARMEQYRVWFREARRPPR
>MCA1405 hypothetical protein
MCYRIEILLLNIRRGTDGNGTSLDMENRRQPGSSPQEGRQSCHSGRTFFN
HEALLTRTSPKSIMCAFTPGALVGVSIPTVA
>MCA1594 hypothetical protein
MDDPPASGGKWPIMPPVQRAAIIRTPQWSITSRIIDERRDEFLSFIPLSL
LSDTAR
>MCA2938 conserved domain protein
MLTPQEKCVGGGALLMLAFCIGVLAGCDCLACEPTGYAKFADTLSVLHGE
RPIEFGAGLQCEFTERTDAGSR
>MCA1026 hypothetical protein
MAKRPSSRTSQDTGVESPPTTPTEATAPETSMESASGTEATKSTAITAVA
VAVEGLKEGASQARKAAGEFVPALARTLSKVVYTGSYGITYGLVFGGLMI
GSLIPKDSALAKGMCDGADSAVKDFTRRENERAALAASEEGMATT
>MCA2846 conserved hypothetical protein
MNRFAGWLALFGLLAPLAGWNAEPGFTVRRAELFQHEDRSWGLDADIDYH
FNETAVEAMQNGVPLTLVLRLRVKRARPWWWDETVISENHRRTIRYHPLA
RAYQLTAAESGVTENFATLRALLEELGRIRNLPVQAPRPFDSGDYHAALS
VTLDIESLPLPLRPTAYLSPQWHLNSPWYRWSFAN
>MCA0506 hypothetical protein
MREAGAGFGEPDELGQLRECHVAADLAVGVEQRFGGQAAELETAAQSFDG
FLSSPACGRGGITTGLPFVTRRSGLRRRSFPGTSDEARAKNRPRCLSCTR
QGHRRAAGRGPVAKSSGGRAFQKRGTMSRPGTSLAAGVDATCHLPKPCQY
GVLTNPEKSLRKVGLGEGGEAIFFHPAKIRPERSAYHSRPKMTIPVISAA
MSVVPMRKGKSKSRRCRRIYRLILRLSAWSSTIRWSPSSLGRMQEP
>MCA0192 hypothetical protein
MISPGVFGGTENGGLARRRFRIGVDHAAPGNRSGKRLRKNRYLASIVGRR
RRCSGPG
>MCA2777 conserved hypothetical protein
MKRIYFLVPSVELAHAIVDELLLARVEERHIHILAKVGTPLGDLPEASLF
QKSDFIPAMERGLAIGGITGTMAGLVAVALPTGMVLGGGAVLAIALAGAG
IGSWLAGMIGLDVENSRLKNYQSAVEKGELLLMVDVPRERVEEIHQRVQK
HHPEAEFEGTEPTIPAFP
>MCA0280 hypothetical protein
MIDPASQAQLKEAIADCIGTDQGVLDALREEIRPLKGATRRIQPRATTSI
SLVGTDGGNNQLQFDPFLIQ
>MCA2747 cellulose-binding domain protein
MNTRFLAPLSGLVLVATLAGGTPAQAATLPAPVPNAIVSASSSWGTASDS
WAGYTGVLQIWVPDAVSGGWTLTFQSAGLGRQAQVSSFWNANAVFDPVTN
TFTLTSPSWGGDVAANSVLDVGFNANGAFDTSVDLANCKFNGQPCVISAM
TSQSAQQTLANLKAGYQGGGSATPTPAPSATPSPSATPVPVASPKLEVLF
SISSSWDGGYSGNVAVKNLSSKTLKAGANGWQAPLKFPDAATAQDVFKSG
PWNFSVNIAGDGTATLKPKSWAAALAPGDVAASGFNGGSPANLQKAAAAD
STVTVLFAPSVPNSNPTPTPNPTATPSPTATPAPTPVASATPSPSPTPVP
TTPPTGGAGSLLFSPYKDVTISMNWNSNVMSTAVTGTPSPLLSVLPAKVP
AVTWAFATGECGKENWAGIQPDALVQANLQAFVDTGIDYVVSTGGAAGAF
TCSSETGMRAFIDRYASSRLVGIDFDIEAGQSQATIASLVRQVAAVQSDY
PNLRFSFTVATLGSSNGTVTSTPYGDLSVTGYNVVKAVQQYGVANYTINL
MVMDYGTANAGNCVVVNGKCDMGQTAIQAAKNLKARYGIPYERIELTPMI
GVNDVTDELFSLQDTGTMVQWALANGIAGIHFWSVDRDTPCSQTSASPIC
SSVPSVPAWGYTNRFIGDLGL
>MCA1445 hypothetical protein
MLYVWSSRERAGYSSAASIRTAPLLRNRVFPETAAVRRLVSLISERQAGP
SALAWNQPDRTGFDPLALYDHVFHMAAEAAVVAAFAAFLADQLARTFALF
DMPAIALGFTEYPGVLFEVVVVGCGGFGFRSRRAFLGRERIAHQVAIDVF
RGEDAALKQRDQHQEKTGIEFQACVDHK
>MCA0366 hypothetical protein
MNGELDYLLGFVGKNRLDPHRFDAARIGRLLDFVASPKDEARLYRAGERN
GAASAYYESDLRRSLDHLLRLTYSNDIPSVFSAPSTVRTERWTTVDAPGQ
RLPELWAQPTAPDKPVVVTGMEHLVNSPDSHSGAYYEYDLYRTLLLTRVG
GRKLLISLSSQTGESRVGRKGIIVGPDDNWDYLYTDQTGLNRTGLGWAST
YMYGSQSVAFYLETETSPPKTRFGAFKWIRAGWAGINVVNSQHIFSGLRR
FGDVFKRIAEHPRTDDTAILARNFRAVARLPQSRLKRLTQDYLAGLRKRC
EEEGLLADGEVQALFRDGHYLDSLSREDMESILAIEYAKQILGKPHYVEL
ADEFAQME
>MCA0266 hypothetical protein
MIFTVTSPRRLLEAAGLTPQSPEAGASHGPIFELAEKHGWDTDLYRADDL
LGEGVYDPEPVSEFEACVATLAEGLHACRVYLSDESATRALYGDPFYQGR
EGFPAHMALREQLIALEVISDDL
>MCA3087 conserved hypothetical protein
MAARGLRAWGPLILGLLLWLGWSGAVLATEIIVTADREPVSVNESFNLTF
SADESPDGEPDFTPLSRDFRVLSQSQNSQISMVNGKVSRTFEWTVTVVAK
QAGTLTVPPIAFGDDRSKPLTVTVTDRPAPGPAGSAGDDALLIEVDVSPK
NPYVQAQVIYTVRVLYRTRLGGARLSPLEIPDALVQQLGDKRNYSTERNG
ATYSATEIRYAIFPQKSGPLLIPPLTLDAEVQASGRGGFNPFFGRPMKTV
SLRSKAIELQVRPAPAAFSGKHWLPAQNVTLEETLSPDTGRIETGQPLTR
TLTLRAQGATVGVLPELGLAGLPEDIRRYPDQPALDEQRQGTGLLSTRQE
KTALVPDRPGRYLLPAIEVPWWNTAAERMEVARLPEREIEVAAGAAAPIQ
PVPEPTPASAQETAGTPPVKGGEPESGRADSPIWFWLCLFLAGGWLITAG
LWWRTARWSQEKTVMKPSESPQLSGSRLMRELKEACAANDPGRARRALLD
WAAQRWPDCDASLEGIAGHSQGELKHAVQALGRALYGYPRTEWNGAALWA
AIVTTDLGKARAENRGPVALAPLHPPAQASPQSGARRPNALSMTQ
>MCA1940 hypothetical protein
MDSKDFPDWRTAGGHLERRRRVRFGAFSDQDMPVDFSWISVSRYSRLS
>MCA2826 hypothetical protein
MTGPGKRLDPAHQPRRDSMFEEHVQDPYRARGKWPEPTVCPECGAVFQQG
RWRWGAAVPDAERHLCPACQRTRDRLPAGELTLSGPFFGAHRPEILNLVR
NTEAAARAEHPLERIMTIEEEDDRTVITFTDKHLTHGTAEALRHAYQGEL
ESRYTDEEALLRVSWRR
>MCA1710 hypothetical protein
MVSPLGFDFRIMRLLLRCLLLATMLTLFGCVWLRLLEVKNQLEDFDDHFK
VVSTDHFLLEFLHPVLYADDFRELTRMEPTRIETLPKGSRWHVLFEKIDM
NGKRDPDQRDLEFVLGFNAKSRLEVWDFPPPFLAAAPPQFLEASLRSLGK
GEIFQDQRQLKVDPKDLPQIGAELPNQARVKEGLGQPALELDHEDGRLWV
YRYRLNTPHIEDDEGSRRIAVARLWFDPATQTLRKFAGKFIGLKLSIDYR
KFLSAKHAAHTDDQ
>MCA2929 hypothetical protein
MSNKIQHKRNATWGNIPAPEQLEDGELAINTFEGKLFLKRSASDNKVVEV
GTLAPRVVTLLYPTGVDVVPLFYTPTYFPVGYLKAVLVGSGTPSVTFTIK
YGTSLASGTEMVVGGVTCTNTTTGMTLYSTSFDNDTVPAGNWVWLETTAI
AGSVIALQVTLVAG
>MCA0923 hypothetical protein
MKHPSYQFITASNEHRHIICHNFLYLSISNRSGISKQNWNGYDI
>MCA2282 hypothetical protein
MRGIEADAGELHDRQGWPRRSGGGALRRRGHRGKDSVSAGLGQAGPARMP
FAPDRQLPVQRGLVLLGKSLHRREHLPAFRDVGLHRAGQFHQPGALVFRE
RRAGFALQQGADGLEPLAVLRVQAAYFRRAQHVRQRGIEVLFLEGDVGAG
RGVDALGGLARAGGISAPQALRKLVKAPPEHGVVLAECVEQVMLWWRLTV
FHRI
>MCA1322 5'-nucleotidase
MNQPSSLVVAISSRALFDLDESHRIFEQEGREAYCRYQIAHEDEILAPGV
AFPLVRKLLAMNDRLGTPLVEVILLSRNSADTGLRIFNSIQRHGLDITRA
AFTGGRSPFKYVAAFGAHLFLSADAGDVADALNDGQAAATIITAPARSSD
TDQLRIAFDGDAVLFSDESERIYRSAGLEAFAENERVAAREPLPGGPFRN
FLAALHHIQTRFDADGSPLRTALVTARGAPAHERVVRTLRAWNIRIDEAL
FLGGRDKSQFLAAFGADIFFDDQKSNCEAASRHVATGHVPHGVANAPDDA
DDRPPGMEAGSRSP
>MCA2141 hypothetical protein
MVGGSNPLAPTNSQNQESGLGDFTCRFFYCPRDQIVRSHRARSPPHGHFL
E
>MCA1842 conserved hypothetical protein
MKLRLRVFGFVTVLFMAPAFGAEPLLMPGKKTLYQRVLTRPGARIVLNPG
KTEGKPAAALSRYYVYERETLDGREWLQVGGDSRGRIDGWLDAAQSVPWN
QQLTLAFTNPANRSRALLFEKKEGVLEVLKSADPGAMAASILKTVESGKP
DPRLISVEPQEYVDPAKKFYLLPILQAEELTSPRNVRVLEVASVTAKRGE
SSPAETRREVPDAPSVLRNFSAAVVFVIDSTISMGPYIDRTREAVRRVYT
RIEKAGLADQVRFGLVAFRSSTQAVPGLEYVSKVYADPSEVKTGKDFLAK
VASLSPAKVSSSTFDEDSYAGIMTALQKIKWSGFGGRYVVLITDAGAIDG
NDPLSQTKLGADQVRIEAEQLGVALFGLHLKTPAGKADHAKATAQYKVLT
QNQVAGRPLYYDVESGDVGRFGKIVDSLADTMAKLVEGASQGRMVAGSVR
TAAGAKPKDESERITSDALLLGHAMQLAYLGREKGTQAPDLFQAWLCDRD
YAHPELAATEVRVLLTKNELSNLSQTVKLILDQGEKSQETTGTAAFFDLL
RSSAAHLVRDPAKLADPKARKLAELGLLGEYLEGLPYKSAVMNLTSDQWE
QWSQRQQEDFLDGLRRKLRHYQIYHDDADRWVMLDGSNDPSEAVYPVPIG
ALP
>MCA0752 hypothetical protein
MKNFRNATIYKHGHTIVVCMLVTTLVLAVITVRSGILDRTDFKQQDQLIP
VDASDDQAIPVNLGVYVENIYNFSPNQKTFDAEGWVWLTWPQAAQDIFAV
NGIPSSQMLDFVNSVNGWDFAMTPEYSEPIRLPNGSYYQNFRYSGHFYAN
ELNFRQFPFQTLRLAQTFELNSEDEALNAKHVRLIPDTAESGTGEYIDIM
GYITHGSEIKTFIHNYGTNFGLADSDGEPTKKISQIRFEVIYKKSITSSI
LELFLPLVTVMALVMFAPMLSSSLWDVRLGLPPMVLLTLIFLQQGYKTEL
PDLPYVTFLDTIYNLCYLTTLILFCLFMWGSNKLDEASDMERAKVIAQIN
AMDLRFQIGLTIALIGLGTINWFVVGTQTP
>MCA1881 conserved domain protein
MKGKFSLVLVLALLAACAGKEQSDLREALLAKLQDDSDLKDYNLDPGEIA
DCVVNDLTDDLPGFPGDPRRKQYLTAYARFYSVKGSGDFEKVAEEYKDLF
GSVKAAHQAALRMTDYIMTCMGQAIERSGPTER
>MCA1422 hypothetical protein
MVTRQPPALSLRWVVDGMGGRWLDGIYAAALCLGLTACDEPDAPTPDSGF
GIGVNIAGLAYYGTEIPFVDLFKLSGPWLTQCRTGRDPGCNGQGWPPGAS
AWNTLEQDALALDAAGYPKSLSDPARNDASGSRFTSVATLVPTDLNPSRP
AGRFIVRYQGQGTLGYARGAVRNAVLSRPGRDVVDVTGTGGEYWFELAIL
ATDPGGNGDHLRNIRVVPEGGTCEGRPTAFCTDTQTCGTQRACRPFEEAD
PIFDPRFLRNLAPFRAIRFMAYQNTNDSTAVGWTDRTVPDSRTWASEGGD
GGPAELIPALGNRLQADVWVNMPARADDGYVRQFAMLVRRELDKDRKVYV
EYGNEAWNDAFSAGRWMEVQALAKWPGAGESAYGKRLQWYGMRTAQICDI
WEEVWAEEGDRVICVMGSQAANPWTARQALDCRLWRAERGGAPCYRHHVR
ALAIAPYFGHYLGLPEHAAVVKEWSTDPDGGLHRLFTEIFYGGELPGGPA
GGALEEAKRQMQENKRLASEYGLNLVAYEGGQHLVGVGEAGTDPAITALF
VAANRDERMGEAYLRHLEDWRAAGGGLYNLWNSAGPYTRWGSWGLLEYRD
QPGAPKYDAVRAAMPRE
>MCA1869 conserved hypothetical protein
MPKSDNLYSAIPAELPEELFDTLEQTDGFRLERIVSRGHTTPEGRWYDQP
QAEWVILLQGEALLRFEQEAAPRRLVPGDWLRIPPRCRHRVEWTSSRPEA
IWLALHYPEPVRTEALESE
>MCA0193 hypothetical protein
MFNHRKHILEIFVHCSFPPHGIDSQPVILKLVRGREVRRMNTVQAIPLFS
QAFQDVSSYIASIRAPYTLQDIQGFNTAYKRAYPSLSREEKRRIEAFVDT
MIERVAQKELASKIFGVV
>MCA2054 hypothetical protein
MEILSHTAELLLEAMEMLLDTVFEAVLGLTPRGAQVLTAWLAVGVLTWLG
SMLVGKIARGWDERQRRIEDYWRDTVGKAKTWYMRNRLKIILIGACVGLL
TLLALF
>MCA0155 hypothetical protein
MGGKFDSLFSGSAASPQRRIFRRIAGVLAFIVLASSLQAATAAKVKSSKV
RATPTPKPTATPTPAPTPTPVSSPTPTPTPAPGATITGSVFVAMEPYDII
VGNQVQHISLSGKSFTFDFGGSVPFVNPNIGIADFDTINYWDSAAGKAKV
INGWTITDAGISGRVFKTNGYTAIGYKAGDGVVEGALRTQLNSYYVPSRR
RFVWDLCVRFGGADLTKPWTFMPRDSHPGLIWQIKPDGSPPSIGMVVDTD
PTNSQRLAIHIDGSIGPELKHERLGMITGLLPQQDINIVIDAYLDDRPIA
SGGQGYFKAWINGTLVANAVGATLVPNASSPHYWSMAMYLYNDTTPLQFD
YFGYWKRARMIVPN
>MCA2903 hypothetical protein
MAKQTTVNAIDPTLIDDDGQYRVVLAERIVVDGVVLYPGWDIILKGDVVK
SNREAIEHADPI
>MCA2752 hypothetical protein
MCCAAATAPAEDAVFFFNPESSIDSYATLKTAFDTYLAPLGPYVFQPFNV
RATFEQALAQNSRGVYMLSGWHYGEIKGRQSLDAVLVGVSKGAFLQRKIL
SAKDVADVSALKGVTVAGTGSEDYLRTLLKQTLGPERHALVDSFKVLSVP
KDIDALMAVSFGMAQAAISSEASLQKLAAINPKQHSQLKPLAQSEKAFLL
IATVPRSFRQEGAPLVEILEDMDKKPEGVTNLKLLGLDGWKRVETLEAPY
STQLRAP
>MCA2225 hypothetical protein
MPPAAGRRGRARPRQPPRSEPVRQVVVDHRHAAVVQRQAGRMAVGQDRIV
GIELAIRRRGVARRGHREITALAEGMRVGGVVLGGQAGAAGIGCDLLDDL
QLVAGRTVVGGFAPLIGIFRPIHVVAACRVGERGCNAVDGDETVVVFEVG
IAFGALYTDGPGLDSGRIGFAVVGALIEIIQAGVDIPRARVQDHDRSCQR
IGGGSGTQCAVGAGCIEEIGIGGGLGTRSVDPAADIDLVVRDRSVGVLGQ
VAFRLAVLVDDFDFAPFADVAGEQAGDLILSQAGHAALGVLPFAVRVQGT
HVDLGALQGVAQRFGHRRVENQGAAGDHVQALADLRGIHRSHAIRRVHAP
EHRGIDVGDAGVGHLLREVAGLGDPAQPVTARRILLAVAVVDHHGVDRLF
VEHRAHTLRRTAGGALVEQYFGIVDGKPRHFGRIDLAEDSRVCGTGVGGV
RVVEIDIERSSHAEFFEQLPDIAILGRCAGHEDFDVGRYAGFQCPALSAE
SQRACRDRCSDFLDLHELTPVCLRVIG
>MCA0323 conserved domain protein
MRTRQKTRSHLAWAAARLMAEQGIDDPQQALNKAAARLGQTDRRQLPEPD
EVDAALIEYNRLFRPARQSAELDRQRVLALEAMEFLAEFDPRLTGAALDG
TAGRHSSITLQVFADAPEEVMRKLLDAHVPFRETSCRCRLRGENTTLPAL
SFYVDETPMDLCIFPATAAGRAAAAGNREGGASIRELRTLLGRAAGTENG
ETD
>MCA0294 hypothetical protein
MALFVPVSPPPPSGGIGHPPILVCITTDSLLRSRFDALRRSPARLHANTR
RRLSMPPDNRTGMEKQEKKRAALRWGKTAIGGRWGLVAWGISGASAMAVA
VWGLQTVAIPLIMLKTASWGIWGFGVFRESRAHLKQAGHDGKSVPPRD
>MCA2935 conserved hypothetical protein
MNLENFETEQRRLVILRRLIRAPAYTLDQTVLAKALALEGLAVSRDRLKT
DLAWLAEQDLIVGQQPGGVWVATLTHRGLDVAKGLTVMPGVARPEPGE
>MCA2949 hypothetical protein
MKAIVLACLAFIYLIGLGTGLLLGAVLDINLVGRAERKAQARQSDTDDIN
LGI
>MCA0203 conserved hypothetical protein, internal deletion
MVHKSAVAGRPCPERDKRDPLAVEEAGGDRERAVPRLDDTSPANVGKISR
HRLEPAATDQAPPPAHDPDFNKLRETP
>MCA0429 hypothetical protein
MRSRDAQQVFSLALSFVELMAGQAQLAVLGRQPQQYFLVDPPVPQLESSS
ALFAQSLQPRLLVFPHHRIPGVLAVADTGFQIGVVHPAGGGIERIDDARL
MRGVAGRAHDLALPAEGQNDPVALLRVLHVELHPGVGLRFLMFQRIEVRR
LQMAAVAKIFRRQPQLVVRRAQRPVPLPVNLLVTIGAVFLLELAALVQHD
VDVGTLEPVFLVALVTEGIQLVVRPAPQEIRQAAVDEAERHPVDVLDAMT
GRTAQLAVLAQRQAFRRLHLLGQHDAGDVVQGGQLLRGRIAVLGARSRDA
GAEQPRHHSRQPPARSRPRAPAFPHNPPFTGSPWVWHWLQSEVTSCTPDR
PLPAS
>MCA0822 hypothetical protein
MLPKGFMRIRHFGWLANRCRAGRRSTPHHEKRLPRIGKRLRPSTTAIPPA
NEGVQK
>MCA2575 conserved domain protein
MTATLAAGCTSLGKPSPKTRTGAATVATEAPRTETATVVAADATFASSYA
AFRTFAKQHVAAMEARRRNADPGTKTGRGNPAPAEAEAPAAFARPPSYEP
FETYGSQRAEAVTEAFFKGRYNRSRARLPENVEKWRSQPDIDNPGPDLAN
FPNSAFTLPAGHAYVEFVPFTYYGTSRSNPAQYNTEFLLRYGLTDEVELR
LFGNGVAWQGGGNSAWGFAPIAFDTKINVWLEKPDYFLPALGIEAYLQTQ
WLGSAPFDSGTQPSISFNFDQSLPFDIDLEYNLGVTRTQQTPGQDEWEFV
FQWALQRDLFDKDFAVFIHGYYNAMTLPRLPSSQAAVTDDQTQDAVGAGL
IWTVDSRFAMWAQSAAGTTRNSPSLISSVGLALAF
>MCA1877 hypothetical protein
MCATSGMTPPPSAAQDRSHESQEIALVPHPGDVPDVHGVAVAVGDGLVAR
QPRFEKVRKHPAHEFVGRHGLDRGVTLFLQENRQMFVQEGLPVIQRLGVT
RGPFDQHLFQFRAGADVEYQRVFRQCLQRFRIVQFAHAAAGIVDEMLVTV
NLAHFLDEFRHVPHRRDLGLPVQRLPPLEFLQRQATLRRVAELAQEGRDG
REIVHRDVATDIDRIRHQEVAQERHLHRLALDVVQDRLIEIAGTDPVVAG
IMEPGPFRQLVGQGGLTRPRHAEQGDLLAIPVQKLLRGQTHRG
>MCA0943 hypothetical protein
MTEPPRLFARCGITGTPRVSRERVYDYILGFGETRCKSGMNEDRKDRLEP
DSAAALRRLLEKAAARGVADYIESRKAKIPGFVAQHFSFRGAWEMHKKTL
GRDYYRIPVNLLWSLPAFLSHSTAAIAQKLGAKDLACRLRKVPSGLPTAL
QSELNWLIHVELLELPYSDGSRRSTKDALLETILSDPELSARLAEYLGAI
RSRAGSGEFRASLAARLQEYGKIRQTATELAASIATLAGGYAAFGRMTPG
AASAGSAAAAAIAQQIAIANFWLGPTLGAWYYTMFPADVSFTLIAASTGA
VMAALGILGALSAVVVDPLLAMTGFHRKRLERFVESLEPVLAGREDEGYR
VHDHYLARLFDLFDLLRLAAQP
>MCA1554 hypothetical protein
MLMTALELVLFTALFATFAGSVGYCWNHYRSLGK
>MCA1292 hypothetical protein
MMHLVVAMFGLLIASFGAFGLIRPPDFVKLARNFWATPKGVHYAAALRLA
LGAALLLAAADSAYPRALAAFGYLSLAGAVIVVLLGHVRLAKIIEWWGRQ
PDIVIRLWGLIALAFGIFLVSATLTGPIGLR
>MCA0630 hypothetical protein
MKGCATSSVATSTSSPAQFAKIAEKFCRLVEGFARYEPQDWLALIARELG
PLETAVRELAGRAGIGDYSMLADIEQRHRMYVELKTFLDGMDDYWTEADL
EAGDGVMTGSLSDNVTEIYFALKRGLAHWNKGPAEAGAAVGEWLSGFEVN
WGYHLANLRSQLKHKATLH
>MCA2175 conserved hypothetical protein
MLYYAVKLAVSLLALVAITEIAKRSTVFAALVASLPLTSLLAFVWLHYDG
APAAQIAELSSRIFWLVLPSLLLFLLLPLLLKHGLGFWTSLTLSSAITAL
AYLGLMMLLRFSGIRL
>MCA0876 hypothetical protein
MKGSFSRWRGVVFAAAMTASSGSGAGPVQFTLLAQGAQSGIEDERNVVIL
DEAALRSLWTSHGAGANPVPPPPQVDFSSEMVIAAFAGTRNSRGYRLTIA
GIEEMDRRLQVDLLLERPGAGCMTAQVLTQPHVWAKLRRSALPVEFRMST
VDIPCAAGG
>MCA0179 hypothetical protein
MGRGSFAAGVVDRLPPTVQVWVRGRRADAGGVGTIMAGPPFGSEGWPPPY
TPSGWSSCSRMMLTAQIKVPESGAFPWHCLERQADGPGLGQKAAERSRCI
LPQRPLACQGGLMPCPFGGRDATSSTIGQDAPAPRRSGFPDGARPRSGSI
GGCRLFRGPVLQPLGPRQPHLRQRGENHHRRGQTVADAADHGQPGEDGEE
LGQPPRQDVDDEEGEGRGDEDQLPFPREFGQAENPAHEHEDEGDAKQMDV
RDAGQ
>MCA0929.1 CRISPR-associated protein Cas2
MSMTVMVTRNVSLRMRGFLASSMLELAPGVYSAPRLSTAVRERIWAVVED
WFTAEQDASVVMVWVDPAMPGGQNVRTLGLPPVKFVELDGIVLTRRP
>MCA0356 putative lipoprotein
MSRLFPPKSMTASVFACLLGACAQHVDISYQPANLSKGIGEIRVETFEYL
PAAKGDVEPNEAQQDKSGLASTFFQENVDALFTGALKKELGSSGYALSDG
GSRSISGEIRRFSFDWVGMTGMKFDIVVHFTVATNGKVVYSNVLEATREA
RQEDHKLDAPSEPIELAMADCIGRFIRDAQKKGAL
>MCA2748 hypothetical protein
MSQGERTKIRSPFHLRRHDPSRTPCSPSPALKFRRAKTAERRSRFPAAAR
MPPLNEAGRQVSKGM
>MCA0384 hypothetical protein
MGESETIEIWLENFRHRRRESPSIRHSAHPVEAALAKRPIGPGRGKCLEN
SRAAAVFGAAQARREAAAGERPNPVIIPVLLPREGSSPHRGEFASRHVPG
RRPFAP
>MCA0845 hypothetical protein
MKTPLRFACLALAILPLQPSFPAFGADVPEESPSIEDIGEKMKETARAIG
DYSSRKGNEAADALKYGFSELKSRTGEQWQRVEETARDTSKEALDQTATQ
AERFKSGSQDAWSHIRKGFSDAFSSFRRAWDKDRQDGGSNE
>MCA1934 hypothetical protein
MSLNGWPYVPPHSPVVELRIAGRSVFFLWTIT
>MCA1007 hypothetical protein
MTRPFAFVRPRGCIGMPRLRILRIPAPFTGSAFQDRAVGHRKPSSSLSNG
LPHEKQGQGHGTSYLDRPAHRSDRLPPRGSPSIRQHPAVRRSRCDRRYPA
RNLAQVFRSRPGIHGARTVFRLSAAGRGLGILHRGVFEAARRLGVCARGR
VFRDGIRRTPDDSRTHRAGHRRRLADPLPAGHGFCGHRLAFGLHTTAGSA
AAAVMDRRRHRLDRLPTVRGRRMGMGCGE
>MCA1726 hypothetical protein
MGFMGVSETEGKGSLRFFAGAEQPIYRPQGQQSRGQGTDGCGTDIRIEND
ELARQRDQGDQYHRLDLNDAAAAQGRPQDGMLEFHRDQQGHDHAEQFLKH
RMIERFENPAGDQHGDGANELIERDDDDHRDHVRQNDCNHLLETLIEIHQ
TPFFGKGRTSFHTGDFRVIHHRFFSCFDTDYSMPRTDAPACGNRDNRPFA
PVPMAGRRASVAMRRARPILRQSVSPARRKRDRSLSVSSSRSRMRAATSR
LVSKSRLRRSRRRHRWISAAAKRQAVASAPCGVNTPSSTSSAIQSIPAPQ
AKASSSRDNTPSSPIQTPTPATMILLLMA
>MCA2679 conserved hypothetical protein
MIETLLGGFLGGAFRLAPEILKWLDRKGERGHELAMQDKALEFEKLRGAQ
RMAEIGASADAAWNSGAIEALRDAVRTQGEKTGVRWADALSSSVRPVITY
WFMALYCAAKTAAFAAAVTAGAGWGVAILHAWTEADQALWAGVLNFWFLG
RVFDRGRP
>MCA1091 hypothetical protein
MPDRPQRGQGGHRLGHGRVVLVELQLVNEAPSGIGIIRIGILRLVGFLGE
QQGPPGRPGQAVQYLPVGAGRQQPGRAGFQVNHRKRTPGGPVQMGVLARI
GFMGAVEVAACLRAGFVRDDKGRPVVELVDVQIGFIRDDALGIAVRNEPG
YGVEDRLVGRGFGVGRRGNRVEVHCFGFASADVAAYCRSRYGGKDVSHQI
SPLLSFLAAFRCSQNEKRATVKWVG
>MCA1612 hypothetical protein
MGLFSALDEFLFGPPGEVGGLIGTPTMAVFDAIESSVELISENPGKAALV
AAGTLATGGIALAAAPAIASAAGAAGMLVPQARGRQSAPCLERLLPMPHL
LQSEAGRSLPAAAEWPLAPQSLLALARPRAPE
>MCA2198 hypothetical protein
MPGRWEYRTASSLVVSTNKQEVRVMTKKLGIFRSANTTAEGPWSRHALSL
AVVMVATGSVPGESFAALTYAGNCQMCHGAYDSGSGATAGGMAPALKGAK
KNASATRAAIAAGGTMGTMGASWSDADLNAVALEIGGAADLGAPTPAPTA
TPTPAPTATPTPAPSATPAPTTTPAPTATPAPTGTPAPTPKPSPTPAPTA
TPRPSPTPAPCADEARPEIDTIPSPWDANAGKELRFTVSALDCDDDSLTI
KAKGLPSGATLTQEFDPDLRKQIATVSWTPGPEAVGTVHTVVFVAVDKDG
NGHKKSVSIPRWTAIRVWPANTTPEAGAIEVVAIQRAQWSAGQSQLLLTG
RIKFSRILTKSERQSLVADPVVITDATTHEVIGQANASVSGKWSARIPLD
GGKVPCSVAVDFHGEMGSRPVKRAPSQCN
>MCA1926 ChrB domain protein
MIFAMHESDWLLLIYKVPAEPSKRRLALWRKLKGLGAVYLQNGVCLLPKS
DHHRRRFKIVQNEIGEMGGESFLLESSGFDRRQQELIAGRFNEERNAEYR
EFLGRCADYLAEIERETAAQNFTYAELQENDEDFKKLKSWLEKIGRLDFY
GAELAAEAARQLLLREERLEEFSQAVFAAEHERLKSRPFPPSEENP
>MCA0407 hypothetical protein
MEAKSAPYAASGLEGTSSVIAGIGLKCAFLGSVCSNAVARVSASDRQDLV
DWPGLKRREKIGEKMRLADRPGSAVLKLGVAMTDAENATPVLRSVSMIVP
QTLFNLQISGDGHLPLRRRRSR
>MCA2921 conserved hypothetical protein
MSVAPNTIEALEAAIVARLGAALPDVEVAPYPADPANYQFLHPVGALLVR
YHGSHYGALMDTDAVVQERLLAVEITFLFRALNGQDGLYAYLERARRVLT
GFKPAGFGKVYPLRDAFLEEHGGEWRYAVDFCAPTLAIEDGCEEDGPLLK
HVTTLDGYERAETVRQPDGSTTYEEYAQ
>MCA0211 conserved hypothetical protein
MGSIQAIANTQPNELDLKRIERAIQARKRYRYVTPEVHPTEDGYLIQSSC
CSRNIDPNGGVIDIARLEFKPSRGCWWLYHKDHAIGHWIMHGEYRSLQQI
LALLNEDPNRRFWQ
>MCA2799 conserved hypothetical protein
MGMSGWLAAAGEHEIWLLTWLSAAMAALCLWQGFRHLGHGRAITDRPTSR
IRSAAQGYVELEGRARMMAGAPIIAPLSGKRCVWYRYTLERKDRGSGDSS
WQTVDAGTSTAIFEIEDETGRCVVDPEDAEVLPPIRLSWRGLYPQPGGPP
RGRRSVWEVLLPAGPYRYTESRIPEGEWLFVSGQFAGIGGGDCSPEEETR
DLLAAWKRDKTALLRRFDENKDGDIDLAEWEKARESARDEVFRRRGTAAH
SVELNVLRKPRDGSRFLISALSQDHLARRHLWLGLAWLTGFLVAASLGGA
VSARLWNG
>MCA2928 hypothetical protein
MPTVILTSGTTWTVPHGVAMLDSVECWGGGAPGNNGNNIGSGGGGAYSKT
INLAVIPNQTCYLQIPGTTAISGGSPADCWFSKTSNSAPAATSDGCLAKS
GAKSTGTVGGQGGQASGCVGDVKYSGGNGGYSSGFGRGGSGGGASASPFG
AGGDGNTPTSGSTGGNGGNSPSAGSGGTAGSAGDSIADGGGGGGGGNGTT
FGSPASGAAGGAPGGGGGQSGANVGGTHAASGGPGARGQIRITYRYATRA
IVR
>MCA0423 cytochrome c5530
MNSKHLQQRGGPLGVATGVAALVLALAGGGAQAASVSGSAKLDAGLGKVS
VKGKTAGLAPGSWVSIYDADSNMLLYTAKTDGKQKFKVTLQNGGVPCRLR
LETGDGSKVLVPVGGADASCKKAAACSIQNPASDVEIAEGQSVEFAAWAK
AKKKVTLNYLWTVSDGSDPYQSPSFSHQFNHAGQYRVMLQVADSTGNRCA
DDVVVRVAPPNANPYPKVSERPAPTVAEALNAADGSYVVMPFEETGMQGG
SQVTLPFNPLIPYNTLNAQVFKKVKQKPALIDPSELDVFYSAASNPKDPV
GGDSINSTSQNRFSTGESGANWDPAQTTASQTVLIGGRDFAEATLRKTEM
WDKIDQPNSNLSGKPKNGKSMAEQQSTYTPAKPMMQLDEGIRGNADEGAG
ARRMPGAANPYRANDPQQISAYDSASKTFLAQFIPASDVDDKGRVNPYPL
FRVEARDKGGNLVAKTDAVFSTASETRCRECHAKGKVAADDTVWRTPVQE
TELVNADGSPGPATGAGSFSAGNTGYNGIWPPAIHNVFNVNAYDPLPPNK
PAPYLLAGAVPADASGLRTDRVAESRVGADGKLQIRLKFKDASEYGDPDD
WVAQEKAALFNTLVLHDYMVKYSPTGTISSQIADLVEDKLSGSRGAAMYF
CSSHHTSQLKFDTGVAARSYPSNRSDYSRAFHAFHGKMQVYAQDVSASES
ADGQEHKKGELIRDRRGHPLMFGGRGWDSQHNDNNGVPVKSDGSGTSYAW
DTGKNDWAPEQFPMHPKGELLYQFGEGVAAEENCAKCHTGHTEKAYRDIH
YASGLKCDNCHGDMLAVGNAYSSPRYDANLSGAGAYVGDSVHFRRPWLDE
PDCGSCHVGDGNRKDPNQGFFSAGVKKTAWQANDPAKASVFPDDARFAVM
PVVETRKEKSTDANGNTVYVDKQVSVALFRKSKDVHGTGSGGAIACSTCH
GGSHAIWPNADPDANDNVTARQLQGYDGNIVECSVCHVKDDFKEGLVATD
GGASNKGVAQGVRDGKVVDATSGRAYLAGPHGMHPVNDPYWWKEAAASAP
NGSGTRKGGWHNDFAKKPGPFGEDQCAACHGSDHKGTRLSKTLTAREFVN
EKGKVVKVAANTPIGCNLCHSLQKSFTGVPTGQPLAYPPPTPSPIHGGGG
GGGGGHGH
>MCA1551 hypothetical protein
MAIGGLLRTLPRRKLRRARTLPCQIKRILFSPEEHALFAVLKSAIGDEFE
VFAKIRASDILSPHRGSGRREAGELYQSMAGRRFTFVLCHKPDLSVAGIV
ELAEHGAGRKAQDADEPVALLCHAAGLPLIRIPASPYYDMGEIARQIRDE
IRREPVFIHGDGEGGRIEPRISNLEDLKF
>MCA2647 conserved hypothetical protein
MKNNDERTLTAVSEMVGEPMIDAKQAAAALRLPYYWFADHAMRSKYRIPH
YLLGGLVRYRLSELSAWAARSAAVQGREAREEDTPAGEAE
>MCA2882 hypothetical protein
MVMIGHLRLPCAPHSVNPAADCPINDRGCPCRKAPAPDRCAPEPQNRPPG
KKLSSRHPDRLN
>MCA2188 hypothetical protein
MQRACRHPRLNFTIRWSMKHFEDWSRLARCGVLSFLAWTAGCASQPGGGT
SPAAATVPAAELRLSEFFKLPVGPRGLEPTDKLRGLAGKRVRVHGYLVQE
EEPLPGLVMLTPVPVTLAELADGPADYLPPATLFAHVSGDNADRTLAYRP
GLWTLTGTLELGSREEPNGRVSYARLNLDGIDTIRAPDGSAPVFVEPAVV
THGHHH
>MCA0945 conserved hypothetical protein
MQAGCSLGAIALALSGGAHADMTTDDWGSWGQVVAEGSLGFIDPGLEKGR
LWLEGQMRWDGDWRHWYQGVARAALGYSLSDRATVWIGYSFVPTQNAGKP
YLAQQDVWPGFRYVLPTEFGTFTFRTMLESNFIRGDQVRYQPRQMFRYLR
PFDFEPRLSLIAWDEVFVRLNSTPWGGPAGFGQNRAFLGGGWSFDPALRV
ELGYMNQYIDSRPNQVMHNLIMGSLFINF
>MCA2683 hypothetical protein
MRQTLDTGWRRRGTSWIWDEEARNIVCTAPEVWSLRQFLQAVGHWSDDLP
SNNGDTLVVAGLDGSLDLLSPKDAEVWLADVIKPAVLSFQSYYEGQAALS
FWMPSARERMQTNPATDAVTWLCEAPHRGSQLDFGRILWGEANEYPQEIL
LREGAKPAGLFHLRIT
>MCA2956 hypothetical protein
MMSLLEVTHARAFSVAAAMKESHLLELKSRIDLLVATGASFREAETTLRP
LCAAIQKDREESV
>MCA2940 prophage MuMc02, structural protein P5
MSFQKPRGLRNNNPGNLVYSPRNAWEGQVGHDGRFARFEQMEDGVRALGV
TLLNYQRKRQLHTVRQIITRWAPPNENDTATYIRRVAAALDVGADDRIIL
SDRKTLTLLVWAISTHENGAMACERWLKHEDVEAGVDRALR
>MCA0085 hypothetical protein
MNEAAANLSGFPKEYAFLLLGTGLGFFLGWLVSRIGGPGRTAPGRTAGGA
GIPSDAQPVDVVVNGKMVRLPSDTAAEIRRLIRDGNPGGAVNRLREVTGL
RPAEAKAVIDSLARIAD
>MCA1617 hypothetical protein
MTKEEFESYLDDIASKLRDEARKTPFAAAKQFEQRVREITKETIQAPGIE
IDFNPHPQAFPDIEIGQFGIEVKFTTNDEWRSVANSVLETNRIESVQHVY
IMFGKMGGNPDVRWGEYEKCVMHVRTSHVPRFEVQIDATRSLFEIMGISY
DQFRVLEMHEKMQYIRKYARSRLKNGERLWWLEDSPGEAHTLPMQARLFT
ELEQSEKIRLRAEAILLCPQIVQSGRARHKYDDVALFMLTYHGVICHQTR
DMFSAGSVGNPENDDNGGLYIARMLKLMEAELEKAAARMDAALFEEYWGV
AVPPEERIAEWLRRADKFASGIWKPSEELFDGRYAQPRGA
>MCA2981 putative membrane protein
MPEENTKTSGYFDLQLRFDSANRVVSIKVNDLTLPLPPSVEDPCCEDSDG
TSFTGMIIPDTSTPATETTPAGPSLGPLLRLGLNVVVVIGAVLALQGLVY
WNYFADDPQFLDRYGWWMVYLDLSVVPLIATLGYLRSYIYAHASHMMGMV
IGMTIGMQVGTMIGGVLGATNGFFVGAMVGMSLGTLYGVLTAWCCGPMAV
IHGLMAGVMGGTMGAMVVVMMIPDHVLIFMPVFTTANLLILIWFTYLFYK
EGVAAGKCQLRGPLTLAQLSSFSLVTIGLLAALMVLGPKGPMVWKGHKRA
AMDADMTENPFQPREPGKDGPSTDRREMEMACGARMMEGGNHR
>MCA1086 hypothetical protein
MPPIKEFFDQLPPVNEVLNNDIVKGVAIGAGVVALAGLALPAAARAARPM
ARSTVKAAILLSEKGREILAEAAENLEDIVAEVKAELAAGAAAGAAAGET
AAEAKGVEAAD
>MCA2553 conserved domain protein
MFQAGERQSLVGFHPGDLLRLSGARPRYLVAAEGQNKVLVLEPGPGGSLR
QTGELKVPVPRHVASFRWPGWGDGVALTPFGQDFLVLLKNFNAARIQEAE
QVQVALSEKQHSIRRAEWVRPVDVDGDGVDELLFASNTTQEVLMLKYPGP
EGAIKPKLVAKFRSGAPWDVLSADLNGDGAIDLLVPNQSKPFVIHVLLND
GHGNFQETAALPFPTDMGLRRFAWSKDKDGFGYLFGVGYGAVTLYRFPDR
WDGKAPVPMRSAPVGSSEGSQDIVLEDLDGDGWLDGVVGRGKGPPGAWIV
YGPLWEHFEELSAARFVLN
>MCA0892 conserved hypothetical protein
MTEETGLSNYRTIAIPEPVVFEKVGEETVLLNLDTGFYFGLNPVGSRMWE
LLVATKNPASVLASMREEYDVDDDVLERDLANLIRDLAEKKLIELE
>MCA2551 hypothetical protein
MVRRWVVWTAGVWLAVVTSVGVAAGAEGEAALRERVKAYWDARKINDLQT
LYSMETATVEGRLRPDQMSRTQFSTLRIVGYSITDVRITGDKAEIKLDTE
VMHPAMQGKALVGPAIVDYWTYVDGNWYHGERSKSAKGGASPSPRP
>MCA0947 hypothetical protein
MDKKTKQIYAEPGPGRVRLGTFEQVEEKSASSLEQKLDEKKVELEALEKR
LDKKAAAIKESEKKVASAPAPSGEKKWFDKISIKGYNQVRYSQLLNGDEN
TRDNLAMPNDNSIGQNKDFLLRRTRLALTADVSDHLLFYLQTDFAANTPA
VDQNNSTSQYQFAQIRDAYSDVFIDSEREYRFRVGQSKIPFGWENLQSSQ
NRIAFERNLATDANAIRDERDLGVFAMWSPDVAQERFKYLQKSGLRGSGD
YGVVAFGVYNGQGANRFELNDNLHIDGRVTYPFELPGEQILEVGASGYNG
KYVVNMQNFNGPADPVTGKVTQYIIGKNFFNTAAGGRNMTQKGTITGTSV
NGINDGQGQQDLRGAIHAVLYPKPFGLQTEWTWGIAPTLQIDPSKQLAWI
ESKNVWGGYVQAHYKIDHFYGGWMPYARFETYNGGSKFDNNSPNISERLW
EFGVEYQPWPEVEVTAAYDLINTTNFRTTTAGSTAGWNSKELASGYGQAD
GNVLRLQLQANY
>MCA2688 hypothetical protein
MPPPPTNQPVSVGPRWPRLWHKNDTNLPQADEADRLTAGYGTHECRRFRR
GIFDDLSPLAVDLALRYAKTAERFSFRQANLELLEIHERLVIHDLNLSSD
TQDLQEAAKGNPPGIRGDQK
>MCA1635 hypothetical protein
MSELTKLIYAFVGIMIVGFGIVYFSKETEQDKVGQALLMSGNMLNTYARE
SCTQAGEAKAGTHLYMPSESQSDGNSYVSLTWNYTNNGDHVLTCRYERDK
GITEMTLDGAPVGNVSVDRGADAPSRAAGAANGKHDAGH
>MCA0492 conserved domain protein
MKAGVIHSSRRPEHITSRFKNGSGAGAPSCNSCRQLVSTVIIMAALTVCG
GREFWTGAHFAYALTVAVLGGFIAPVARDLAAAMESLRNRPR
>MCA2239 hypothetical protein
MQSRRTASGKAPAEGETARHCGGSASALRRTFVPLRAWNETYRPEPRRG
>MCA0255 nitrogen fixation protein NifW
MNPVLQRLQSFSAAEEFLDFFGVEYEPSVVHVNRLHILKRFNQYLNRSPV
PDDMDEVTAMATCKALLKQAHDDFVKSTAAQEKVFKVFQDQDGKSISLDS
LKASLATRGQRA
>MCA0375 hypothetical protein
MKSRTGWPLLLLAVAAGLDGQAIAAEAAFSRQELLEKGFEPAAPRIDVRV
ARVVLPVGYKTPLHTHEGPGPRYVVRGRVRVEEGGQSNTYEPGQVFWESG
QWMSIENVGENEAEIILVELAKPK
>MCA0214 conserved hypothetical protein
MALTTEALERIRSQLGTAATAAEQVSALRRAFPGLSVTRCDAGDIDTETP
VLETTRFNLYLLDAADHCARVTADLSRATGLILAEKPKGAAP
>MCA0224 hypothetical protein
MTYQVIKSALKLCQGTRKSSGEPCNGTLYTCKTCGATGCKQSRDDLCSDQ
AFNVLEHCLKCGAIGQAEAIAPGDYRPHQAWAS
>MCA0714 conserved hypothetical protein
MARPCPSPTHSSTGTPAMTILGITLITLVTGLASMGIHGLIQRWVDHRRL
RNLNDVAAAVYTNFGVLFSLILGILIGQGEERRGDISSAAVHEAAILMDL
VNVAQAYDDEVGRAVHTAAIGYASAVIDEEWPLLAEHQGGRIKRAPLEQL
WRISKAIDPNGPRGQALYDTTVSMLENLAEARYERISVANDNLSDLLRFV
LWVGAAFTIAFLWFFGAENQKVQLLLTGIVTSMLSLVLILVLSMNNPLAG
ELGVRPEPFVKLLRDLSSAS
>MCA2393 hypothetical protein
MPERTCPPHTLRWPSPRNGEAARSARMAANRSASETSGFSPRRTGCSGPH
LWMLPRTAGRSSSNAGTPHPANPFQTLAFAGALRGRQAHRDDLLQPKGRR
ASLSLRRISTSFTSLPMRRSASSNRASRVPRDRHRCRPAHD
>MCA2323 hypothetical protein
MPDAGTNPVIPASLPTHYIISDPPESHVKRPMPRNFPILALAMASLTGCS
HDIGLMRQDETLTGYESAIRWGQWEKAASYQIPRPRPEQRLDRLKGYKVT
GYQVRYRHSEEPSALLFQTVEIRYLRPDELTERSMVEEETWRYDGDKERW
MLETDLPHLR
>MCA0689 hypothetical protein
MKVGEGYRKSVPTVANSARSPPAAPTTTPPAPSRPGTSPAGSTSSSRRTP
RRRRGLFHGGTAFVGAGQDRLTRTAFVQRLPTGRRSIELIAETIQQHSTS
PCSGSPA
>MCA1758 conserved hypothetical protein
MRTKRHGLTSQVCAVLVRCEYAFNRSFGPWQGFDLRSGNLCLRHRAGTPG
VPAVPAGGRSAGRPPHPADPEPPVSNGRRFSGAMTPLARLLPCMLWLCSA
CAGPGTPSPAPVPGEPHASIYIVNEGKHAGLILRKADIPAGFIPESSDFP
EADYLELGWGDWDYYQSGDPGLWLTLKAAFWPTASVLHVAGIEGDIAGRY
AGYEWIRLDLAPRPFAGLAGYVDRSFARNGSPRASPVGPEHHEDGWFYPA
NAKFHIFHTCNGWLAGALAAAGYPMGWFEPVTAEQLMTKIRPYAAPSLET
HPPIRPSTGLRPARRNSKAPADASSDYPAAFLQSTHNHASQRR
>MCA2031 hypothetical protein
MGGEVGDDRFEGAGALPPRFGRLQWEKGCGGPEIFLKFDPGQLLFFIVL
>MCA2662 conserved hypothetical protein
MPVFVESLNLGDLLKYEAPNLYSRDRVTVAAGQNLPLGAVVGMITATGKV
KQIDPSATDGSQVAAGVLMQRCDAALAERDDGLMVARHAIVSDHALSWPT
GITTAEQQAAIAQLKALGILVRQGV
>MCA2108 hypothetical protein
MSKLIPYPRILPAVAMALTTDLTHADISPVIQIFTGGLDGQGELTTSLHV
NATPQGITRGSYVGEIMNGGGVRITPELSYGLLESLEGSLYLPVVTDEHG
QWNVAGGIGRLTWVPIEAPDEGGWFLGTTWAFSDYNWKYSQATLNLNGGF
TAGYDDDDWLFAINAFFNWGLNDRYQSSDAPGLEPAFAAKHYFGENLTLG
LEYYGAIGPITHPEPIHLQSHTIFATVDWDEEDWSINFGIGRGMTSETDD
WTIKTILFVPIGNY
>MCA0653 conserved hypothetical protein
MTEIHCVFEESPERGSTICAVEMAIFTEEDDTQDLRWRVGDALHCHFDGA
NTPRLTHSHFTRDEPIAAVKRLIDKWLG
>MCA1190 hypothetical protein
MATGKPKAGSAEHRELFCRSFIDSHLRFEPERLAWPELDPDMLQRLREVP
FWQEVLYTEMRAIRIIEAFAPTIPDPLVREAMELMAEEERRHERLVRHLI
ARYGIVIEARDNPPLPSDIERTFIDFGYGECVDSFLGFGFFRLARESGFL
PPEMFDVLEILMGEEVRHVLFFVNWMAWHQARRGRGAAAWRGPASAWYYA
RAVAALVGVALRNARQQGNGREFSATQAAGLMPGFTVGGLLDACLEENRR
RLDGYGRDLLRPLFLPALARFALRLTGRRIPEDMREEGSLMRRVWTVLRG
>MCA0171 putative lipoprotein
MRRLSWTLMLSLATSACSLLPERPPQPALHDFGAAGAAAPAPWSSVDVDA
PEWLRNDRLQYRLLYAKPTELRSYTLDRWIAPPSELLEQRLKAGRSANGY
RLHIELQAFEQVFERPGSSHVMIRLRAETPADTETFQFDQPTASPDAAGA
VQAFAQAVDRAVAQLRAWRPAN
>MCA0721 hypothetical protein
MTLLAPVCSDRWISLVDKNPPGHIAHGRPGASSGARLAEAVRFELTNGFP
LPVFKTGAFNHSATLP
>MCA0034 hypothetical protein
MSKPSPKAVLKTIKGYELDKWTATFWLVKRSMANREARYTVLRVNTDAKL
QKRLKGYITSQLQGKDFHLAEYDYSNADGDDTLFTIAADATDFPQVEKEI
NAGFNNAIAKDYAELLNSWAYVIQIEQGNEKLFAWRKISTLTQPKKVESR
RATFFVEHKLIDVEDKEVFLIDPRFDFFVHDGTVFIANKREFETSMNFRE
GMKAKAAEVIQNFTDSGHFRHVDLIQKYVGDNLHHLRKMASILKAGYYQQ
PDYIQRLIEVSKEEGWALKVENGQIVVEEDSIELLLKLLNNDRLRSPINN
ETFDAAAKAPVKKSS
>MCA1921 hypothetical protein
MIESQPGRTEPARHLGRAFRCLPQSHDNGPASLVRVRGPFSGPMPNRDAS
GFWNGHERSARTRRTTDPGQVGETGLRLQKEIEEGEVCLTQAIHHQTLGV
VGSVGTVAGMIRLLEWFLAHPPSP
>MCA1802 hypothetical protein
MPAGDTMRGSLWSVELHPMSSDPFKRVVQAIDAANAQDPNREDWQGTSYP
KELLYGLRMSECLERLCPTASELLRLAARGQHIRRWEIPRSDYPATREGY
LRWRTRLYGFHGDQLGLIMAGAGYDEDSIRRVKRLLSKRDLRSDPESQAL
EDTACLVFLEYYFAPFAAGQDESKLIDIVRKTWRKMSDVARLRAVELTLP
GPFGAVVAKALDAGMS
>MCA0888 conserved hypothetical protein
MEDLSWGVEMRCTKRVQRFLSLSAMERSLFIQALFLILLFRVALPVVRFR
RAWACLAGMASIRPEDMKRRIVIQPEKTGAALVRASRYVPGATCLMQALA
GVLMLRRQGHIAELCIGVAKPNGEFGAHAWVECGGRQVVGTKADFTTLIR
IEMTRMA
>MCA2909 hypothetical protein
MASLKQFIANAKAMLPAVTGKPGWLDAPSMHLEQVTGQIKALLGKADQFA
APDIDLHQADAALKAVLGEAASTAPPAISLHDTAAALKAVTGKASATAAP
DITLAAAKAHVDAANNPHATTAAQVGAMAITHPANAISGFGNFVYGLGPA
QSAGSASTVARSDHVHPFPNAVQIGAIPDTHFASVLEGFGVSASPLAAAA
SPGVSYKIAREDHVHPFPTAAQVGAMATTHPANAITGFGASAQALAASGS
AGSASTVARSDHVHPFPTAAQVGAMATTHPANAITGFGSSAQALAATQSA
GAATTVARSDHRHPYPTAAQVGAAATYGDAANQFNADFFQATQDVGYRFA
NSPGSYAMTSTDGGVRVPVGGGFHVRNTSLAYAPCYASAFTVSSNRRLKR
VLGEVRHALERVRALQPIRYRLEADGPQGRIELGLIAEDAREVLPEVVYP
VTDGANGPDGASLSIDYGRLAVLALAAIRELEARVEALEAAR
>MCA1215 hypothetical protein
MVCGGGGGIKQCSRGVLGAGLIERGIPIGCGALTRKWSVNSEPLSFRHFW
MRNGAASIMVFKDALFLRGIWSLWDVTSTSNVEEMAWRRG
>MCA1827 conserved hypothetical protein
MNMKMIFTTAALAIFLLLAGIDAHALKEPAPSSAPPARPAEQDELALFRA
SIRMEKREFVADAMDLDEVQARKFWSIYHQYEADLMKLNEARYALINDYA
VNFDSISETKADELVRAALEFRKSRTALLENYYGKLAKALSKKIGARFLQ
VESVLQGAEDVEIGASLPLMPKSR
>MCA2904 conserved hypothetical protein
MEKTVIFRDRQEFQAADPNALQAYARDSLDHVVADGISAQKHYTGFGVGA
VSATEVEVQPGRYYNGGAVYVAEQPVSINLFQYIPLVAKKIVAVVLWGQE
VDTSVEPRDFLVDLQTGATEPQAVAMQRIRKCEVNPLSGQESADPQPPVI
QTGTLAVAHVYLTPAGIERIEMQTQSLLPNGYDQERRLDGIEIWKAAAEP
RIASIATDLAALAKKSSDKADRAMFVEIAQDLASVKERLQLPSSYSSYQS
DAFADTAKSNAGHVGYAARIDHGLLFPFAASIQANLALFNPIDAGVSVSG
QNVVLPAYTHAARIQTQGYAGDLSISQYQVQTHTVREQKIVSWQKKYGWH
GNFLTRWYARNVWAKLGARYQWTLPWHGYFEQHETTQYVDEVTTTGYNGA
MVAQSFLVANALWLTRLGLFFTQIGANGDVQVIVCETDGGKPNLEKTVAS
ITLPRASLKAYPSETVIDVPHVLLEGGKRYAIVLITQGDHRVATVSGNAY
TQGTLFYGSDGDYFVGDLTRDLMFTVYGAKFARVRTEVQLQSVSLSGGLT
DLAIAAQHVVPEGTELRYEIQPSGSGAWYPLGDPSLVLSTGPNLVNLRAV
LLGTSDLAPAFVLTNNAIQASRPDTSFVHWSTARSVASTTSVTLKLLVAH
WDAANHTLTPRIVTGGSNEVAPSTTVITDEDGAKRFSYTFTIPASTAYEI
KLSGTRNAASQPFAVVERIDVAA
>MCA2881 conserved domain protein
MGALRRRRSQPSRLRVALAPEDWAERLQIWLSLVLQALILGVFIGALYET
QWLVAFTSLAVLALTFLPAMIARQFQVQLPVEFTFVTCLFLYASFGLGEV
AMFYERFWWWDLLLHSVSALVMGLLGFLLIYAFHSTHRVKMAPFYVAITA
FCYSMSVGALWEIFEFLMDQGFGFNMQKSAHDTMTDLIVDAAGAFLAAWM
GYHYVKNGDSLIADRLVRRFVARNPRLFPPRRQSGAS
>MCA2669 conserved hypothetical protein
MRGSCRSLRLVRAVSPASSTRPSTPSPTVRISVRIDSAAAQAQLRRWGGE
FRDKVKKAVARAIASEATELKQDVRGHVAGQMAVIKKSFLKGFTAKVLDK
DPNRLPALYVGSRIPWSGMHERGGLIAGRLLIPLHGRVGRKRFKAQIAEL
MRGGNAYFIKNAKGNIVLMAENIKEHDRVLSGFKRRYRKAEGIKRLKRGA
DIPIAVLVPKVVLKKRLDVERLVGGRIPRLSAAIERQIRTVD
>MCA2917 conserved hypothetical protein
MRLDWNLARSVLEAAEALGDEKDRVAPGAFPGVAEEVAIEHFRLLCEAGL
ADGYPQGRSPLFLTRLTWSGHQFLATLRSRTLWSRVKAEAKDRGLALSFE
VIKALAAKLVGQLVD
>MCA2569 hypothetical protein
MSFFRGAVSRPCCSKDDGRRKRSVNSVSVIRRALSAAIGVLASGCVGMGQ
DGIQSDRYVLYYCDGGRKFRVWFQRDRATAVVDLGDRSVSLPRLTASESG
DYYSDGVTGLSVKDDVAVVEENEVVTYGACVSG
>MCA2876 hypothetical protein
MNQPRDGILELRFALEGDLAALSLPPPAPRRWTRGLWEHTCFEAFLRPED
GTRYLELNFSPSGEWAAFVFSGYRVGEAFGEGLQPAVTRRQTGNRLELGA
TLDAGLLAGLGVYTGLRIALCAVVEDRRSGISYWALHHPAETPDFHHPDG
FALTLAHRMDTGKPMKNQDIA
>MCA2635 conserved hypothetical protein
MNEKQASVAARIAELSHLPMAELWVLWDRYFERRPQFPNRTHVESRIAYK
MQEEVFGGLAPETRQRLEAIGAKHSKIKLRAKPRVFNFAPGTVLLREWGE
REHRVTVTAEGRFEYEGRSFKSLTAVARHITGQHWSGPLFFGLKGGA
>MCA2912 conserved hypothetical protein
MTTYLSYITQEGDRWDGLAYRFYGDPFRYEPLVVANPHVPIVPVLPSGLT
LAVPVLAKSDSTPSVESLPPWKRGVPAGGVA
>MCA0994 conserved hypothetical protein
MGRGLPYAIGLWGDLPYSTAQAAGVNNLIADMNAQFLAFTVHDGDLKSGG
SECTDAVYTTALGYFNALKAPAMFTPGDNDWTDCDRNPNYNSLAQLDKER
KLFFSTPFSLGQRRMRQEVQTSPLCLGADGSYVACVENRRWSVGRVIYAT
LNVQGSCNNLCDVKPDPVEYAARNAANIAWMKETFELAKTRKAAAVMFIS
QANPGWDLNDPERAPLRDPKTLAETDGYPDGFQEFLSALRDEVIAFRRPV
AYVHGDSHYFRVDRPFLDAQGRRLENFTRVETFGDNQGNGTNDVQWLKVL
VDPGSREVFSYQPQIVPANRVAVPAP
>MCA2482 conserved hypothetical protein
MDLATKVMILRIVVVFAAALFLIGVIVGLIKPKWVLFWAKNPDRLTAAMV
VSAIAMLLFMAGWTGIAKLTLKPKEPQQRHEEDRRSRDEQNMLQLNR
>MCA1070 conserved hypothetical protein
MSKLHVRLLAAVLTAVGLALCYYKVEILGLPLLPTEQAEVWTVEARIEFK
ARRDTAVKVQFAIPRAPRGYTVLDENFVASDYGLTTDDNGLRRQARWAIR
ETHGWQVLYYRIHLAKDPAAEASPPPTGVVMPVEPQSLPEPLQAAARGLL
NEVRSKSADNVTFAQALLQRLNAADADSNVSLLRQDITGPDDWARRIVAI
LREADVSARPVYLAILKDNASRGSLTPWVEVFDGDSWVDLSPSSGETGLP
GDAIVWTVGSEPLLEITGGKNGRVEYSVRRHSQEMTQIAEQRARKMGSRI
MEFSLFSLPVKTQNVYRVLLLVPIGAFLIVFLRNVVGLQTFGTFMPILIA
AAFRETQLLWGVAMFCLLVALGLLLRFYLESLKLLLVPRLAAVLIIVILL
MAMVSVMTYKLGLDRGVSVALFPMVIMAMTIERMSMVWEEFGPVEALKQG
FGSLAVAVLGYLSMSSDYLGHLVFVFPELLLVVLALTLLLGRYTGYRLLE
LWRFRGALKG
>MCA2941 hypothetical protein
MTAAAKAWPILWRYIAPMLADGMRQQWEILLPVAERMVRAVEDSLSWAPK
SGEAKKSAAIALILKELKEREIAWADRIPDRMMNRAIEMAVGLLPEKQSA
ETAP
>MCA2677 conserved hypothetical protein
MHNLYVQFRQLIPDPPLQAGTVVEIGSGVVTVQLPGGGRIKARGTGAIGQ
NVFVRDDTVEGIAPTLTLELIEI
>MCA1329 hypothetical protein
MPRAEIRERSFAPFPGGSQDHGQGEQGARAGKDNGVRFDGVVLQMPADRR
RCHYVDEN
>MCA0267 hypothetical protein
MTVEQLIRALLEMPREAVVLYEGDAGYARVGGIDLQRNGNGVPDEVILSP
DMSE
>MCA1389 formate dehydrogenase, delta subunit
MHSENLVKMANNIGAFFQAEPDHAVAVQGVVDHLHKFWEPRMRRQIIAHL
QAGGEGLSPLAREAVAVLMNEQQKNAA
>MCA1189 conserved hypothetical protein
MLSTRSVFQEIRRDPRAFQLLLSIAAKGETQGGWENERIAALTPDPVLAR
KVRRHGADETKHGLMFAKLLRKAGLDEVPVPADADYCMLLERRGIGLPHG
RLGRDEPLGLEEILQYLVHSKVTEERAFDEVNRLLRVFGGDAELAPSLRV
IAEDETNHLAYAHEELLRLSGQGHGDRIARMLKSYALAEIRVYRDVGLAF
VRHMAALLGWSGPKQWLLKLGVSAAYILERTLVWRRLAALRPPLRSNAMG
S
>MCA2171 hypothetical protein
MDLRLREDPFPKAGAADGSAVPDLLQLRDENGHIFAGRMESLAKASPVQG
SMDVPFYGAKIHFPVRGNPDTLTLVPTLAPGQKLGIVVSHRGVPLADLGE
LEAPVRLRLDWNDPWRSRFDDPGRLRRHSAPRSFLYVEPYEVRHEVLLRL
EDLAPYLRFKPGDPQNLSTSERESLKRAAGDFLRAHNPLRINGAAVEAQL
DRVEFLRFGRQGMQNANDAETLHPASALVGTILVYLTEAPTETVELQWEL
FGPDLERRPVTLYFGRESFEFEATPRDPVFRWSSEEALGLAPEIERPASA
AEDGRGVPVIGLRTAAAAVALLTAAWFARRRHPVLACTGLAAAGAAMLFP
QLLDPVGQPEARTMAFPPPQVRTRLEALLHNVYRAYALRGEEASYDRLAM
SLDGPLLEQVYLRQRKAVLARNQGLGGEGRVSRVEMLDDGLVIRKLGASS
FEVTGRWIAHGAVSHWGHSHERHTLYAARMVLSIAGDGTWKITAMDLLEG
QPSPSARS
>MCA2505 hypothetical protein
MRAASRNEAAGTLGDAQRPFLLRAHTYLRRLDHGRAHPVPGRTMDTKSSC
GKWSAGLDEAERRPTGRSPRHSRPALDSKRLVGSGQRKWPCRNEKRLAIA
RKPLNFGSGGRI
>MCA1036 hypothetical protein
MAQVRVGMECRGLREKTSSFPGFDLPYAERGLVHGFQVRKIGGGHDWHQN
CWIHGA
>MCA0371 hypothetical protein
MFFNAPSPGAGLRIYRYLAFRMDLVLTALAFFNVTMQGFLPRGRFRRRMG
SLRSWRFSREIQVRYFPNPRFFEAPLSAWNLDGCSRSEASFEAPALHISG
EPPDAKPGKPHLRPCAVSSR
>MCA2381 MxaD protein
MKKILLVATGLLVVWTASAFAHGPTRQKVTETIEINAAPAVVWGIIKDFD
KGEWMPQVASTIGTGGNEKGATRELKLKSGGIIKEELKSYDAEKMSYSYK
ITEVDPKDLPVANYSSTISVTPAGAGSVVEWNGAFYRSFMNNNPPPEEND
EAALKAVTSVYKEGLANLKVLAEKK
>MCA2957 hypothetical protein
MNAAQQDLFGGGAVHLPQPIPIAPGRFALGDRVAVAMSRAIDASGVDRPD
LASRITALTGKRLPLSVLNAQTAMSRPDHTPSLLQAMAFDAVTGKWALLE
LYAEAAGGKVIYGDDIAAFEVGRAVMVKKLAGQREREALRSVGVGR
>MCA1773 hypothetical protein
MARFFTPFGTAGSGVRIGITTPVYSATNPAQ
>MCA0470 hypothetical protein
MNIKAIAAAVIVAGSIGGTPAGAVTADDFIIRDAKDLADLCATQPADPAH
VAALQFCYGYLQGAYDYYLAERRGPDAGHFVCLPKPEPSREAVVRLFLAW
LEAHPGYGRDAAIEVLFRFGADQWPCPEKDDAAISAPKS
>MCA1914 hypothetical protein
MPWAPDWTARSPRIFTPLNPEDRRMEQVTNALTTALAAIFLSAMIIEVRR
RQKQLQELYNVLDEQDRLIVMELDRMVELGEIKPYAD
>MCA0944 conserved domain protein
MQCARQHPFGGFASAPDRIAVIGLLPFFALLSGCLSDAELLAENSRIALQ
TAEMRGAGELDCPRVKAAIVTEKEVPGQPLGELYSEYGVRVDGCGRTVFY
DIECRDEKICAVREKPLPK
>MCA1507 hypothetical protein
MGMARVDARKSPREVRDGMRRQALRMQSFWKEIARAEAMNTDLYAGFSER
LIGDSGPDNLKVSHAKRVTAGLAGRRERIEVFRSSVHTIFLIL
>MCA2255 hypothetical protein
MISFRRNEASGYRFRNRPGLRRAVVDDRQFGGPLPQIDRTPAPGLAQNAG
EGFQGGPHDIAEIGLALPALFRFGRRGTLDLQSQIPETKVYTPMPAHPRF
PSAPKAGYLAHRDSA
>MCA0336 hypothetical protein
MSSDRNFKHLLRVRVKEFNIPPISDGVVLGREAPIGCAAFRKALELLVVA
PFEHIQLDDDVISDILVRRAFLRRVPREYLVQFVLQRIKPLMGSEEIMQL
DLNAEILIEDEAL
>MCA1988 hypothetical protein
MSLLIQADAIHGMTATFPLPPHDRSPPPAERTESPLDLDRLADADRPRSI
IRLGRPRFFARPHGGFPISAEPARSPQCFLPFMRISRPRAALPEEAPRSR
NRRSAPAMNTDIR
>MCA0931 CRISPR-associated protein, CT1974 family
MTASLYMIELRPDPAALIRFAQDQGLNTHQDQDLGYATHAWLKALCGELA
PKPFRLLQDGRNLRPPRLLGFSAHDGTRLTEHARAFASPLAAQVCSLADG
IAFKPMPESWPNGRKLGFEVMACPISRLGRNEDDVYRRHLRDCDARAQSP
DSREMVYRRWLTRQFGSAATLDDFSLDGFRYLRLLRKARGTRSGFLAPQA
LFRGTLSVRDGAGFGALLARGIGRHRAFGFGMLLLRPAP
>MCA0060 putative general secretion pathway protein B
MSYILDALRKSERERKLSQAASLDSVIFSPEPVPGRPWLPWVLGVVVVAN
VAALGYFLGLTSGSPPAPDAPASQSPPSLTREIPSMTAPVAATQPSPPPA
ATVPPFARRLGGADAPPRGLHPHPAAPPAAVRPVGGKGPETETGEAAEED
AGDETMAGEIDDATEAEEEDPETEAPVRKAPPPAPATPAPRRDTVPLLSE
MPPSFQSRVPQLKINLFAYGPHPDDRFAVINMKRHSAGDTVAEGVRLESV
DESSLTLVFEGQRFRLERP
>MCA2266 hypothetical protein
MNPHSSTLTEPQISTDILIGLLRSLLMQYARTPSPVIAGNIANCLDRLLS
HPRFDEPPRERCTYLYMRTYWRLVESLG
>MCA0123 hypothetical protein
MKRACWILYLSLLEGCAHLPNGEIDYKGSFEKSMEYLGMAIWAPIVVPSL
LIAYTKTTGPVNTNCWKSGGRLICNSQGFINGSYYSGTRTYRFDR
>MCA2548 hypothetical protein
MFREESSSQAGSVWWVGQAPRRQRARRAGVRHFSPAEFFECAPTSSESER
EDRPSCLWIDLQGWSAPPLKILSHAHSLLRKGGSLILELGAEKDGSPARC
REKARYLQALAAHSGFSGHVALPRGDGPAQTVLAKTTAARWRLSESTPED
LPGCLSLFKEAFAAEMSPELWQWKYGEGRGRAVIARRNGRIVAHYGATSR
RIAINGRIVEGLQVCDVMVVPGERGLMTKKGVFFEVASAFLETHFGYYDE
HELAFGFPNHRAMRVAERLGLYGEVGRISEVRWTPYPSFRVRYAATLSDV
AELDDGLLARLWGQMAADLPDAVLVLRDPAYVRYRYQHHPVHRYEALVVR
DRLTGRPQGVAILRCDNNECKLMDMLAPLNSIPLLVDAARTAAARRSLSV
VTAWISSSYVRCLVHGGGVATDTDIRIPTNVWTDGRSVERLQDRWWLMMG
DTDFL
>MCA3088 hypothetical protein
MTTRDIFHRLRVGLALLAGFLVGKLLGGHFGHHASEFFIGGFMLGFLLTH
ALYWVIDRAFGRRAPL
>MCA1846 conserved hypothetical protein
MAGVLLRSGRLKHFLPLGATGQPVYRAASQLRAAIRRQLGQEAADYFAVP
VQDEKGDTLDWYSAFDGDVVPWTSATPEERGPARASLLAARERLLEKSRA
LQASEDGEQQVFGKLLALATRIPGEDHVYLVGGRPVMTFWGFAQREEVSE
RDVLASLDVSGISASAARFEAGPGKIPPMPGRRRWWHWLLLPLLLLVGLL
LFGLKNCGPDGLSGFVRQEPTPSPLPLEREGDVIPDVSPSPDAKEESGEK
SRSDVAAFGERIDRSGRQDRDTVSVKRSVEGSRTVVDSTQVDQSPEGATD
TENWDTSAAMPPESAADAAESTSVSEAASAHTDVAGTGSPGDDKPQPDPG
SGGQTAEGQPADEAAQSAEDKSAAKTEPPAEAPSEKAISKAGANARTSGA
PLTIPKEAARKGSTDFLNGQWQSVTGLQDSSGNPIRLEYDFKGKDGMARL
KRSVGGKEMECTAPVTSSFSGSRLVIDQAGDIRCPDGSSIQRSKVECTTD
PQGHAVCKGRNPDGSDYKVRITKK
>MCA2196 hypothetical protein
MNVKLNKAVTKPSGYPFRTLSLAVIIAASGGFSSETFAAATYAGSCQGCH
GAYDSAKGATAGGYGPALSGSKKNASATKAAIASVSAMNSLSNLSDADLN
AIATEIGGAADLPVATPTPTPTPTPAPTVTPAPTATPAPTATPAPTATPA
PTATPAPTATPAPTSSPAPTPSPAPTASPAPTSTPAPTSSPAPFPIGCLL
QSRPELDTIPTGWNVQVGKILKIILSALDCDDDTATIKGVKLPKGAKLTQ
AFDAERRKQVATLTWVPKAEDAGKTFDLGFQAVTLDHAGHESDASDPRWT
NVTVQPAPSDVPDVESIRALIIQKAQWLADQSELILSGRIKFDRQLSKSE
REALVADPITLVDGATDAVIGEVLADVHGKWLAKIPLAEGAVPCTVKVQF
QDQTAVRSVKRVPQCK
>MCA2453 hypothetical protein
MTTLLFLAGFVLLALGVLVVVGMRLPKTHRAASRIRLPATPERVWEIITD
FEGFPRWRPGLAAVDRAPDVDGFPSWDEVCAMGAKVRFRVLEAVPPRRLV
TCLAGEHLPLRGVWVYDLQADGDAGTVLTITERDSIFHPAFRFFVRYVLS
YHGVMDVFLLALARHLDSPATPEHLSLRVEAPDSGGAEA
>MCA2499 hypothetical protein
MPASSGPSRPFRWRRTSGIVARRPGKTYPARNRVRPNGLGQPFCKMKKAG
GK
>MCA2409 hypothetical protein
MSGPSSPHGYECKDLVRYRHHAVTKRVIVLCCHLALLPGCVQTVAPWERG
ALAKPQMALVPDVQHSALMLHTYASKETSSDGYGVGGGGCGCN
>MCA1787 hypothetical protein
MPLNLSNLGIPVKLLFSGYLIVVAIGYGLAMVQILFTHGMADGKFGLSIE
DIVYSYYGNRSGSVLEAKLNGSMKANAPDEARFQIIQWVRDGADESTYTR
QIRPLFEAHCTGCHNGDSGLPDFTRYESIKTRAQSDEGATFSSLTRVSHI
HMFGIAFIFMFVGMIFALSSGVPCKLKCSAILMPYIFLLLDIASWWLTKL
DRHFALLVVMGGGGLAMAFVFMWVVSMYEMWILPRQLPCVDRRDALRRC
>MCA0680 hypothetical protein
MAEASPYGSMETAPGVGSGSCRLRPWMRVT
>MCA3012 conserved domain protein
MTAPKVFVSHASEDKARFVVDFARRLRENGVDAWLDQWEMKPGDSLVDKI
FEEGLREARAVVIVLSATSVRKPWVREELNASVVNRISRGTKLIPVVIDD
CDVPEALRSTVWQKIDNLDDYGESLQRILSVIFDVTDKPALGAPPARFAG
PAPLIAGLSRVDDLVIRVIARRQIDEGAGLVEWDRLKAEPILRDVPQQEL
LDSLEILKRHDFIRVREVIGALHVVLTDLGFLKFAEAYVDDYQGTVNQIA
ALLVNENVRQNRELAARVNKPVAFIDFVLNLMESNGYIKVSKYIGGQSHV
REVSASLRRALQSA
>MCA2565 hypothetical protein
MTASVLVGTAVGLVVKYVLDKKYIFRFKVNGLGHDTRTFALYTLMGLATT
GVFWGFEFGFDYLFRTKTLRYAGGIIGLTLGYLIKYHMDKRFVFRQRGV
>MCA1048 MJ0042 family finger-like domain protein
MFTRCPSCGTAYGISVRQLREGRGVMLCDHCGRVFNALPGLEESVPDPYS
RPPRTGAGRWFRGGRGREDRPARPGFFLRLAWGIGSIALAIALLGQLAYF
GSTRFAQDEALRPWLVMLCDAVGCQVPPYSDVESIQIVERTLRPVPATGG
YEFRLVMANQSTAAQVFPSIVLRVVDRQGKPAAGRVFSPAEYLPKGSGLS
MMPVGKSLEIRLDLAKPSREISGFTFELI
>MCA2633 prophage LambdaMc01, site-specific recombinase, resolvase family, truncation
MKDGVRETFVPLTLRRRGVRRLVQHQAEDRDTHDSTLIEGMARAFHWQRL
LDSGAMPSGSAIARAEGLHHSVVNELLRLTLLAPDIVEMLMAGRQPRRMS
LIWFQRHPLPVDWVAQREIVRRFEEGA
>MCA0965 hypothetical protein
MTELLFVFTVVFVGYVLYEVFKTVSRPASTAETHRACAPDAAVEKHAEIP
PEPVAEEPAAETKAPVAEKAETGGEEKAVSLRDPVTGEIAAVPTNYRFAK
KWIKEALVTEGLLDRVYKNAELEGETARKVKDALERFKGLAKYQG
>MCA2398 hypothetical protein
MIPRTTRRRDIAPCRSAKPARQRRMRGKGYANPFSRQAMVPRKNGRAGRR
NVPAGDR
>MCA2936 hypothetical protein
MRPSCNCMGEPTDYNKWRLVFEMFLFVWNLGVSGWLWLGRSHKATLKRID
EVEADFHDRISTYEQRCLVKHERITRLESQIGKMPNHDDLGRIYDRINAV
SGDVREMKGTLEGTAKTVDRLHRYLLEHDRGSKS
>MCA2130 conserved hypothetical protein
MIRTKFLKCGLLAAILGLGALIPATLWAHGGGSGVDVDSCRIPVGGFWVH
FTAYQPQLTGTTEYCDKIPETGSTTLVFDYEGKALRNMTVEFEITKEPEG
SRVFYQAPAAYPTGTVNATINFTEPSDYLAHVTLVNEGQKIDAHIPFKVG
VNQTSVSGSTWIIILVIVIALGYILYLSNAGFKKVVDSLLLIKKKT
>MCA2673 conserved hypothetical protein
MPDLSVKYFNSGMAGAPQISNNWGDLVSMLDACLVNGFALKAIDTLTFAN
GIATATITSGHAYQRDQVVQVAGAEQPEYNGQFRVLTTTATTFTYAVTGT
PASPATTATSLSAKVAPLGWEKPFAGTNKAAYRSKNPASPQNLLLIDDSL
KTPGYTTSWAKWANVGIVEDLADIDTIVGAQAPYDPNNPTQNWKQVTANQ
WGWHKWYHARQTGYDTYGDSGGGNRNWVLVGDDRLFFLFCTFAPGFNWYG
RSCYCFGDITSFKPGDNYATVLAAHDLYWSNNNQYMSYPGEYGGASLIAS
LDFSGHVLLRNHTQLGNPVRWAATSLNTNNGQQICGRGPMPFPNGADYSL
WLLPTYVRQEDGHMRGLMPGMYWMPQDRPYSDQTIVDNVVGQTGKRFLLV
RTQYSSETEGAQVAFDITGPWR
>MCA1135 hypothetical protein
MAVAPREGSDGFIASAKGDRWFLSKRETRFVPVRSEAKPLNHLKTVGIPS
GRWWKSTLGIWGFQPAIELPRVGLLSVDLPSRNIPECRHLPEPFRQSHPK
ETRGRT
>MCA2259 cytochrome c5530 family protein
MEFPPGSITAMAHRRSPRSPSAGQQSINYGGTVTTPSHAPSSLLRSGPLV
AGLLLTLAATLAPEAAAAAAKLKIKAAWSDKTGTLTIKGSAKGNSGPVDV
YDINGRRLGSGQGASFALTLGRQDLANVPCAVRVEAGGTEAVKPVKGAPK
SCTGVPTCSITSPADGASLQIGVQSHFEATANSSDPAAQPLRYEWDFAGG
AWGHPTDLMAMATFIRDNSRYRVRFSATDAKGRRCEDAVEVVVGTPPAGL
PGKVSERPAPRFGGELDGTPDDLAVLPFEDWTMQHTTDAKTMPNDYVSFN
PLISTMNAYVYKKARLPVPLAGDAVTVDYSAASNPSDPVGSNSINSTSQN
WPVGASLMEAAVQKTDLWETFTRTDFADVDLSKYINGSWIMDPWKKGGQP
QPDEGYFKYKAVTNRDGGKVYGSVPDNDPNNSDHGRYMPGIAQPYQANDP
QAFSKFLDNEQRFAAITIPMTDIDDQGRVNPFPLLRVEAKQNGQTLASTD
AVLSNSRDHHCRECHAKGGIAAPENPPQTKAACRSSPFGAISDERRKRMG
RAAYPPCEEKPVYYSVSDFGGDPNSVFDQEYAAALNYSSIHEFYDGLYFL
DQMLHGVKDHMSGKVIADRPSPCYGCHATAMLFVPFKQDWWDVDGFKVND
PVYEPNYSIAMHRFHGELQWNDSKTDIVRDDKGMYVRWDWKTKGRNTRTL
FPVFDANGNQLPMEENCLKCHAGHREQLYRDRMYTAGVTCYDCHGDMLAV
GEAFPKNYLANKDKVGSVERDDYRVPWFDETDCGSCHVGNGNLGKDGSGG
FFSAGVMKRAFDDADLSATTRPVDRTHPDSVRFAVAPKENYKATFPTLFY
YTVDPTSMVFQTKMVDTKIDAPVYRHGKDRHGNVACAACHGAAHSIWPNR
DPSANDNVTALQLQGHTGTILECNVCHTADSFARKEDLDGGQYSGDAKAG
ILGGPHDMHPVNDPYWWKGAQGDGANSDGTTYGGWHNDYAKLPGMKGEDQ
CAACHGNDHKGTRLSKTPVDRVFDFRGFDGKKLKKAGFKTRVVKMAAGTP
IGCDTCHSLQTSFIGSPGH
>MCA2656 conserved hypothetical protein
MTTIQLTPAQHAIVAYAIEHTDGKIVWFPDNVKGGARKKVLDGLFRRALI
TSIGADWFVAAEGYDAMGCPRPAPAPVEPDEELEAAVSAAEATWAQRRSD
AKPRTRENERSEVSRGEAERVRSIRQNSKQAEVIRMLKRPEGATVRQICE
ATGWQAHTVRGTFAGAFKKKLGLVLVSEKLDGGERVYRIVAEDVAA
>MCA1948 hypothetical protein
MTARGFRRRADRFGRWPVVSLILLCLAGCAGKPPVASAPQPVPAKPARPV
VQKSPEPLATDRREVSPVPGVPLWRWDSWDQHGKMHLVSKPEKYTLQLLA
NGWLKFSAECLKGEGIYETHGDRIVIAVTRSDAGRCRPGPTAEHFIQSLE
AASHYHRSGGRLFLDLGRNGGGMSFSLLPD
>MCA0658 hypothetical protein
MGIWFPVSWFILACSLTSADHKHPTVSKPEPPHQKNASDTASMLPQPFHD
GPAKPLMPHHFSLQPVFGDVQEGFSGGVAPVALHPAASFRSGSSTQCGPC
AGTRVRRAGQPKV
>MCA0776 hypothetical protein
MDETAHYSQPFTELLANERLNSLVPGAPNLGMRPRLDALRIETAFEPRTV
RDPDMARCCLAAVWLYHDFFDESHRISQGIATTSGSYWHGILHRREPDAW
NAKYWFKRVGSHPVFPEVTVAARELAQAEPPSHETEFLLRQTAWDPFAFI
DLCDTARLGHTPAEPLCRRIQLAEWRILFDYCYRRAVGD
>MCA1629 hypothetical protein
MAQKTLDQLLEGLQKNAEDRGNGGATSSLREQAHKLSPAIMAWLAAGGEN
LTLCKKTSER
>MCA2640 conserved hypothetical protein
MQNQVPSVQSGRKPSRPRSDGATRIALDENELAARWGLSVKTLRRWRQEQ
LGPVFCKLGARVTYLIAEVEAFERRVSRYSTFARAYQ
>MCA1374 hypothetical protein
MGGAGEGGAVPAAPVRSRCRPVSSPGSWACVLRSHRNTLRRWSSSPPMKT
YRKRHAARLPGRTPKTRSETA
>MCA0522 hypothetical protein
MDSKTVVDAALTRLDTLLAEDYGAKGGSLAERIRGLAAVLPADTRKRLLD
LTARGESLARTGAGERALAEFVFDCGVAYEQVDFLRKAQLAEDLGLVEVG
GLAPAELERGDLDAMARFIAARDRLVAKVAGFTLKALAVMGALFIIGLVF
GVV
>MCA2651 conserved hypothetical protein
MAEWTIDDVAARFEEAATTGRRLPPVRVQGYFNTWPAFVRKEWEAFAADE
KVYRPFPPSPEAIDRMLETMRWVQWLEVEQRHLVWMRAKRYGWRDITIRF
ACDRTTAWRRWQKALEIVAARLNEKGLPNFAKAATMKP
>MCA2697 hypothetical protein
MASQCVAYCEGFTSWHDAMIPSPIAKSHERSVGEPLNEPDSAGKNTSRRQ
SGLENG
>MCA2717 hypothetical protein
MNRHTRLLTLCVLVLIQVWAPLVHAHPADGEPWGKFHVPGLEFLERTPGE
GWHVPDAGAETSVVVAMQGGILEPSRVSSPRWTHEQTPDHPPCHHDTAPV
TLERAAEFPRDPFIPLPYGFPVVQPHSARAPPADA
>MCA1831 hypothetical protein
MTGYGFIDQNHCKSAIPGESPQEQETVAKDPEEYPFYIIPAPPVPGRTSW
SVVPW
>MCA0704 conserved hypothetical protein
MRLWTLHPCYLDAKGLVALWREALLAQAVLRGQTRGYTHHPQLSRFRAMP
DPAAAIAHYLRAVHAEAERRGYRFDAGKIAPCPVPTPMTATDGQLTYEWA
HLKAKLRIRAPAWLEPMETLKRPEPHPSFRIVPGPVADWEVLPQQPNTSA
ADPPAARR
>MCA0756 hypothetical protein
MGLRRILSVSGRQDCAIPTAGIRWRRECTFTVRRHMPLMKPYHGRNAGRT
ERRSRAFGPLAQESRSSSGVRGFCPVPVRRHGSRYLLLDRTVIPVPAAGS
RWSWRGGHFRYRVSMIVAASPKGSIFGRIAATAPPKFRRGLAHGGRPGAY
SPVADKDLAAWKEAEETTAAPAMACRLESVQPGDQLASGGGASMIHRGRF
P
>MCA2113 conserved domain protein
MSIRTRTPEGGDVTMNEREFATYVEHVFRHHNRVLDELITSEQTALANGD
VDADELEEAEADMIRTCDPLNEVIAAEAEQHHASFSTLMKLADVVPECES
MTRELEALLHPAPATLGAPRVNSGGAGTSGTGEGPASDTAP
>MCA2030 hypothetical protein
MENQEKDNFWLWIGALIVGTVVLVVLFKARENEVSGWQSGAAQEAAKEVD
AQIAKQKTK
>MCA3098 hypothetical protein
MAVGDTGIRFGGYASVEAGAPSPGPWEFSVSDLSLFTTWTDHAHWRFFSE
TEVGEALTAGGNQGFSTRNAHFELERLYIDRLFDDLFSLRFGKFLTPVGR
WNLLHASPLTWTTTRPVATYDLFSKHATGLMLHGGFPVLDRHMDYAFYAD
LTEDLDPYRSADPFDNALGSHWLYSLADNLELGLSYVNYQLHDDPSHRFN
LVGVEGAWFYRRFELNTEWVYRNGGLQGLYQGFVQGVAPLVSHWYAIGRY
EYFDQVQQPAGQLGVFGLAFRPVPPLVWKVEYRIGSHNEISAPDGLYGSF
AVLF
>MCA0035 hypothetical protein
MSTPAVADPSLRLRPWASLVLFLSAYSPLMLILIIKDYDAVNPGWLPQNP
VFSGTLLLVAVFSSLAVLRSVKEIDGGLTVTVTKASNKSGDMFGYTIPYM
LSFMKVDLGDWQTIVSLVLFLGILFIMAYRTQTVFVNPILALAGYMLIDC
TFRRGDKEIQAMVVTRQPLAVGDTCRLERLSHYLYVAARDSSTPEEKRGK
A
>MCA3082 hypothetical protein
MRGQPAGAAVAVRDSGESRPSGGEFDAEVLRPRQRELQALPVRRWHSVPS
RFESGH
>MCA1085 conserved hypothetical protein
MRISHQFAFSFSQTFDMRSAKSKRVLPFSLTPDDRRDLQRAFDHLEYPSF
AARLSGVVGTPIEMAVKLLPRSWYMAWHRGVDAAVAKALSMAIGSLSASA
APLADRRYWMMGAVSGAVGGFFGGPALLLEIPLTTVLMLRAIADIARQEG
EDVESPEGQLACLEVFALGGRSSEDDAADTGYYGLRLALELPLSSAARHI
ARRGLDGRSAPALVNLVGHVSERFGVALSERAAAKLIPVVGALGGALINN
AFIQHFQDTARSHFTIRRLERKYSRSLIEAEYKKLKRRPGKASVRYTALA
A
>MCA1887 conserved hypothetical protein
MANKASSSNTRHVVPNPKGGWDNKRGGASRAGSHHDTKQEAIDAARRMSQ
REGSELKIHNRDGKISQSDSHGNDPKNIPG
>MCA2116 leucine rich repeat domain protein
MPEPVTDAATTAGTGAVSALAPGCAGCEFQKDLLASSRCDPGDACIQVQN
SRRIDGFLKRNPHLAERYLFDSRWEQRAVAARYASQSALKPLLFDDDEAV
RRTVAYRVPVDWLDALMHDVDREVRITVADRLPAERLEAMVVDPDYLVRV
YVARRLPVGRLFRLIADPDEQVRKEVARRLPPQSLALMAYDENLQVRQTV
AERMDPAGATLLLRDREWLIRYLAAAKAPPESLRGLLDDPVEEVRELVRA
RLAGG
>MCA0962 putative membrane protein
MPAMSCPPYLIPRHESGARLAAAALMTGFFVISLTVGDVINKDGVLYLDA
AAAFLDQGVKAAVAVYGWPGYSLLIAAASRLTGWPLETSAHALGAFCFLV
IADCFVRLHFVLRGAGETPSGWTPVLLILAFPALGDRLNIVRDWGFLAAS
LWGLLHLVQFRFEPRGKLGHALLWQAGMAGAFVFRIEALVLILSVPLYFL
VEPLPWRARAHNFLLPISGIVPVLILGAALLASGRLAPGKLWEITAYGYH
DPTLIWKFFSLHADRLAAALPNDYGAEYAKLILGTGLVATLLWLIGANLG
PVLLAIFAYAVHRFGGRLPPRFGLILWAAAMASGTLLVFLAVQLVPDKRY
ALLPSALLLLAGTVYLERIAAGTGSPWARRLALGLVAVLCIKSAVVTPDY
RLYLRDVGGWVRENIPAGASLVTNDPRIDYYAGRPMNQEQTRRLMPLDRL
ADWLTQPDLPDYLALRLTSPRDLKKAEKALNREALKTFSPSPREHVLIYR
LNDGSSRSR
>MCA0264 hypothetical protein
MQVTTILVDGVPRAVVRPNDRKDLARFLRNGRAYFTADTPDAELSHRPAD
EAEAARWRSAYRLHLAWGGSEEWFFGVPL
>MCA1585 putative fatty acid cis/trans isomerase
MRPFIALFLLALVGGCAVYGQYRLDQRYGAPDPRRFDRPRSTAALPVEYR
RDVKPILDSRCVVCHACYDAPCQLQLGSYEGVTRGANGEQVYDAARLLAA
DPTRLFLDARSTAEWRSKGFHPVLNERNSTTEANREAGVMANLLALKRRF
EFPSYGLLPRDRYDFSLDRSQQCPAIEEIGDFEAEHPEWGMPYGLPPLSE
RENRTLMAWLESGAPAAPEPALSAAERDQISRWEAFLNGADPKMRLMSRY
LYEHWFLAHLYFDELPPGGYFELVRSRTAPGQPIDAIATRRPYDDPGVAR
VYYRLRPHRPALVSKTHMPYALNTARMARIKTWFLDEPYTVKALPSYEPK
VSANPFITFEQIPVRSRYRLLLDEAQFTIMNFIKGPVCRGQVALNVIDDH
FWIGFIHPDNPVFNETSAFLADVLKEVGLPTEQQSNALLTKWLLYSRGQT
DYLRRKSELVNRRFTGRNSPSLSMLWDGDGYNPNAALTVFRHFDSASVVK
GLLGDRPQTALIMGYTLLERIHYLLVAGFDVYGNTAHQLEARLYMDFLRM
EGEMNFLALLPAGDRNAVRDFWYRGAGDEVKTYLNGSKAYFTQRTGVEYR
TGDPLTELFGLWKAHLAPVLDHRHEYAQSTQDDPNAGLLKTLSGLRGRML
SILPESSFLTVQGEDGVDRHYTLIRNSAHSNISELFFEEDRRLPDEDTLL
VARGFIGAYPNAFYRLRSNRLAAFIDQLGRLRSEQDYAEFAARFAVRRTD
PRFWAHSDAVHEAFRKADPVEASLFDFNRLENR
>MCA0458 putative cysteine rich repeat domain protein
MRTIASDTGIGRLKHVALSTFLAIAGTGPANAAPHPLPCSDEIARYCKNL
GLGGGRVQNCLREHDAQLSPACRAGLESALARHREIQAACGADVERLCKD
VRPGKGRLAACLKEHHREVSSLCKDTLAAARKAGSRTGPTRRTQGIHP
>MCA0435 conserved hypothetical protein
MKFLDSTRADTARRFPLRSLFASVLLVTGAFFGQAAVAPAADRTPSADRA
GKEGPPEAPARPASAPLPFESAVPALLQAAVARLRQEIENGPPFSLTERK
RRIDGLEKLLAEPGVDIAEKYRRVLEAYRAELEYGKTVEAYRDLLRTEEG
ERLVDFLSIGRLALYYQTLDGRESGMWDDRDRQWHRLAPRYDRAIALGLR
AARKLEPPQLLPLPLPGPRSPSP
>MCA2674 conserved domain protein
MSYPLYDTFASAPAAGYTTVLGGMSASHNAAQQALDLSASSTQSILRFNE
AANGDFWFEADIELLTDPSGRKHVGLWMTTGNAAEGYRFAHLDSAWSVSR
WSSGFGDGSAVTGSVNDGARPMAGIAATAPTFNVGQRRVLRCEVITGAPD
ANGVPWSRLLQFSAGGVVLFQVADATYRGKLVPGVFLYGATARVHAIAGD
TPSGLPAFPATVAVNAADDLLPLAGGPTSVLPDPASPIGVAADCDLMRRN
SPASDLWNRPGGYDHDFHPIQSGRKDIHFSGHGVIAGTVKEKGQPDQPLV
RRVQIISENANVLVAETWSDAAGNYRFEFIDPAQRYTVVSYDYKHLYRAV
IADNLKPELLP
>MCA2712 conserved hypothetical protein
MLLSLRYNFLFVHIAKTGGTSVRDSLWRYKWTDPARIPQFLCSKLSALTG
HEIGAKFPRHAKAIAAMEMLPRELYQKLFKFAFVRNPWDLQVSSYHHIRR
ERPDLLEGRDDFEAFLCWKLDPERAPQYHADMSIELQTDYLVDLHGNLIV
DFVGRYERLAEDFEEACRRIGIACPKLLHSRKATDRNKDYRRYYNDTTAE
LIEAHYRPDIERFGYRFDEPFP
>MCA2737 hypothetical protein
MNSEADPIVGNWYRHLDKGQAFTVVAIDDDASTVEIQHFDGNLEEIDLDT
WSLLPIEPIEEPENWSGPMDVAELDDLGGTEITDTLPEDWSEPLEEIIKQ
EERPAEEISEEAEEEDALFEEEAPETIPLETEPSAEASEKTEEEERAKGG
GTAYSEDE
>MCA3014 conserved hypothetical protein
MRYATSTSSAMRDKALEFEKRRGAQRMAEIGAGGDAAWNTGAIETLREAV
AAQGQRSDVRWADALSVSVRPMITCWFRALYCAAKTATSVGAISGGAD
>MCA0454 hypothetical protein
MKPLHMLFSLLGLSACAMDTFTTGVPVDELEKVSGEKPAHHARNVDGVDI
WTTGAPDRKYKILGMIHDVRRNVPWRTQTYLSDLAQATKKAGGDAAIIII
IADSKLVYKMGMGCDASVETTEKCIHAQGASLSGAAPGEEATIYSENIES
TSVPLEYKDSRVLVIKYLDTK
>MCA0188 conserved hypothetical protein
MQGRSQHSADHLHKQSTPCLAVGGPCMHKCARLVLICSAVLAGCSANPLL
PEARAVRIVTSEPAGCEYLGEVTGNQGNFFTGDFTSNANLETGARNGMKN
EAARLGANTVLLLTSRAGETGSMRISNGSGSGHVAETNVTYSGIAYKCPG
R
>MCA0709 hypothetical protein
MSYPKALLAARQHLVVILGLLSAAALVSWLEMPSCPVPPPLPAGEIPGAL
HGALSEEETRWAQAAWRYFQRNSQEETGLVNAADGRSVTTLWDTGSHLLA
MIAAHRLGVIDRSEFDQRLGRVLNALARIPLFQGELPNRFYLTETLGMAD
AEGLPSDRGVGWSAVDIARVLVPLDVIVWNYPAHAAEASAVVRRWRLDRL
VREGVLTGAVVDAQGQVEYSQEGRGAYEAYAAKALALMGIAAPAVAVSHG
YAKVQGVEVPFDPRPPERFSTRNYTVGEPYVIEGLEFGWDRGGREEEAWR
VFRAQENRFKATGRLTAASSGPVDEAPYFVYNAVYAGTRSWQTLTTDGRE
VSDFRFLNAGDAVGWHVLYDTEYGRQLAEKAAGLFDPERGWYAGWYETKD
RVNRAMSAGTNAMVLEAMHFRQFGALVAVQPPLELAEPAPSTRPQAAQPP
PDGRGAAARAKPAAKSGPVLRARTGMAGSRKGKGASAAPKTSTGHGAKTR
KTDARTERKHR
>MCA0145 hypothetical protein
MVPDPAGAFRLLLRCCLVCATGPAWAVADRTDWFQPFVSNTFTYDSNVFR
LPSSLTAAEANALIDNVSKGGSKADFVNRISAGAKVNWYSGPHRVNMNLG
VDDNIFVRNTALQNLSTLNEASWIWDVSRAWSTQAGAEYQQYLNAFGNYV
IPRKDMLTNMAYFGKVKYQFSPDLRAGIGVRWDEISHSVRNRRQNDVQEL
SETVELVYQNATRNSAGIEYRHGAGFYPQRNPAKFSFDQYEVNSLKFVGG
YMFSVKTRFDGYVGYLAQTFAQAPWFDYSGCIWRMGVEWLPTDKTSVKIS
GWQNLEAYQDVASSYFVGNGVKLLSKWEASRKVEVAGEFSWESRNYVGGG
GANSSGTPVSRRVDDFFAAQAGFTYRPVDYVDIFLTYRYEKRKVNRVFFN
YEYDGIALGVALRF
>MCA1386 hypothetical protein
MTARFSPARLPGPEPTPTGERNLSAAEGPADPILFALLGLLRLHADAARW
AVDVFRSHRIDIDTDHPRDFPPRGFRFRTHP
>MCA0194 hypothetical protein
MSGSRIVNTFPRSSSRQKKAPAANDAAGVPRGAFLSQTHHKEKRRSGQET
RPELAEHRD
>MCA2734 hypothetical protein
MIGAIAVLFVAYWFYRTAVRVKLPPLPWAAGGVIVYYAGFVVWLYLVLKP
LLGDSFRHHSFLLGLGMDLSAVLVGTLLAALFRAKVMLKQGDTPYEKHF
>MCA1154 hypothetical protein
MLRSGSTLQYNIAAMVMEMRGSLQRSGFMGDFSKPETRAKLDKLKAAEGW
SIVKTHEAPLPREFYNERVRVLFSYRDVRDIAASIRKKWDPPFEKILSQI
DAMIEIKAAFAEIPGVLMQPYDLLFHDIPSATRQIAHHLGAEVTDAEVCA
IANALSVNALNNTGPEDVGFFSRFVSRIVRRDYDNKTLLHADHISATGGR
DGDWANQFSGDEIARLEAHYSEWLKAHDYRASHCS
>MCA0750 hypothetical protein
MWFNSMQREEPYLALTCLESCRDAGVPSGAKTQVLHGCRQLVS
>MCA2809 hypothetical protein
MRRNVIRYCAHRWCNIDLCTIMMHSTATLRPVQEERPMLTRMTTTLGIVL
TMGLLAGQADAKPDIVTENDVKNCQFLKTVRGSSGYGKKFGSWQPEAKAS
AEREATQIGASHIVWSDIKPSGAFNGIATAKAYDCGR
>MCA2394 ccp, cytochrome c'
MNKPSFLLVGLLVVSGVLGAAETKVKYPDGFRSWYHVKSMVIQPGHPLEN
PFGGIHHVYANAEAIQGLRGGNYPDGAVLVFDLFDYQEDNHALVEGKRKL
IGVMERDAKRFSATGGWGYEGFGEGKPDKRLVTDGGQGCFGCHAAQKESQ
YVFSRLRD
>MCA2218 kdpF, K+-transporting ATPase, F subunit
MSPVRRPSACEFARTGFFMPPLRTLPSSLGDLYPEIAKLSRPKFKLQGIW
SYGRFLSRGINWRVHRDDRFGSWPGMPEGERMSWIHLLSALLALGVSVYL
VFALLYPEKF
>MCA1196 mmoB, methane monooxygenase, B subunit
MSVNSNAYDAGIMGLKGKDFADQFFADENQVVHESDTVVLVLKKSDEINT
FIEEILLTDYKKNVNPTVNVEDRAGYWWIKANGKIEVDCDEISELLGRQF
NVYDFLVDVSSTIGRAYTLGNKFTITSELMGLDRKLEDYHA
>MCA1199 mmoD, methane monooxygenase, D subunit
MVESAFQPFSGDADEWFEEPRPQAGFFPSADWHLLKRDETYAAYAKDLDF
MWRWVIVREERIVQEGCSISLESSIRAVTHVLNYFGMTEQRAPAEDRTGG
VQH
>MCA1194 mmoX, methane monooxygenase, A subunit, alpha chain
MALSTATKAATDALAANRAPTSVNAQEVHRWLQSFNWDFKNNRTKYATKY
KMANETKEQFKLIAKEYARMEAVKDERQFGSLQDALTRLNAGVRVHPKWN
ETMKVVSNFLEVGEYNAIAATGMLWDSAQAAEQKNGYLAQVLDEIRHTHQ
CAYVNYYFAKNGQDPAGHNDARRTRTIGPLWKGMKRVFSDGFISGDAVEC
SLNLQLVGEACFTNPLIVAVTEWAAANGDEITPTVFLSIETDELRHMANG
YQTVVSIANDPASAKYLNTDLNNAFWTQQKYFTPVLGMLFEYGSKFKVEP
WVKTWNRWVYEDWGGIWIGRLGKYGVESPRSLKDAKQDAYWAHHDLYLLA
YALWPTGFFRLALPDQEEMEWFEANYPGWYDHYGKIYEEWRARGCEDPSS
GFIPLMWFIENNHPIYIDRVSQVPFCPSLAKGASTLRVHEYNGQMHTFSD
QWGERMWLAEPERYECQNIFEQYEGRELSEVIAELHGLRSDGKTLIAQPH
VRGDKLWTLDDIKRLNCVFKNPVKAFN
>MCA1195 mmoY, methane monooxygenase, A subunit, beta chain
MSMLGERRRGLTDPEMAAVILKALPEAPLDGNNKMGYFVTPRWKRLTEYE
ALTVYAQPNADWIAGGLDWGDWTQKFHGGRPSWGNETTELRTVDWFKHRD
PLRRWHAPYVKDKAEEWRYTDRFLQGYSADGQIRAMNPTWRDEFINRYWG
AFLFNEYGLFNAHSQGAREALSDVTRVSLAFWGFDKIDIAQMIQLERGFL
AKIVPGFDESTAVPKAEWTNGEVYKSARLAVEGLWQEVFDWNESAFSVHA
VYDALFGQFVRREFFQRLAPRFGDNLTPFFINQAQTYFQIAKQGVQDLYY
NCLGDDPEFSDYNRTVMRNWTGKWLEPTIAALRDFMGLFAKLPAGTTDKE
EITASLYRVVDDWIEDYASRIDFKADRDQIVKAVLAGLK
>MCA1198 mmoZ, methane monooxygenase, A subunit, gamma chain
MAKLGIHSNDTRDAWVNKIAQLNTLEKAAEMLKQFRMDHTTPFRNSYELD
NDYLWIEAKLEEKVAVLKARAFNEVDFRHKTAFGEDAKSVLDGTVAKMNA
AKDKWEAEKIHIGFRQAYKPPIMPVNYFLDGERQLGTRLMELRNLNYYDT
PLEELRKQRGVRVVHLQSPH
>MCA2589 mopE, surface-associated protein precursor
MRDTMNEKHCYSLLAAGLIAAVPQLAAAHGGTHDVTAVAHLSYSEAYSEK
LKKGQEVGTELLVLDGRFEFNEHVGMGDITPDTTWSAVVQGQTLATGTLG
DATKKKFGAKGGMAVINVPGGGTLKFTWNAKAIMLKLKWTGEPALARLYK
DQNTSINLPQFPVDIAIGSLHGYFNVPVTGQAKATTKNGTMLSKIALKGT
ANSAGLDTLDRDGDGSTADADCNDFAPTIHPGAAEATLDGVDSNCDGRDS
GVAEVVETFKNPGTYSSPVINFKIASPPGPGTPIYGPPRDFSGYNKSYSL
AIGKTSYYDPTTGTKWNDDTITPVSDGQDIWRGWTHTGKWSFFNGKAGDK
ITLSVQRDAQEASLKGAHPGFILFWRPEGGPLFWAGTQDLDEGQTALPAD
SDTVIGHVIVQHADWTLQGLPPKADHTAPAGVDTELYPMKPDSYTMYYVD
SGYDADKYVASKKLIMHPTAFKGLALNDGTAGAFTKSITLPKTGYYMLYV
ANVLEVDDWSVDADGKLTTTGEVWEVPAKGCWVNITISKP
>MCA0782 mxaI, methanol dehydrogenase, small subunit
MMQKTSFVAAAMAVSFAAGVQAYDGTHCKAPGNCWEPKPGYPDKVAGSKY
DPKHDPNELNKQAESIKAMEARNQKRVENYAKTGKFVYKVEDIK
>MCA1796 pmoB, methane monooxygenase, B subunit
MKTIKDRIAKWSAIGLLSAVAATAFYAPSASAHGEKSQAAFMRMRTIHWY
DLSWSKEKVKINETVEIKGKFHVFEGWPETVDEPDVAFLNVGMPGPVFIR
KESYIGGQLVPRSVRLEIGKTYDFRVVLKARRPGDWHVHTMMNVQGGGPI
IGPGKWITVEGSMSEFRNPVTTLTGQTVDLENYNEGNTYFWHAFWFAIGV
AWIGYWSRRPIFIPRLLMVDAGRADELVSATDRKVAMGFLAATILIVVMA
MSSANSKYPITIPLQAGTMRGMKPLELPAPTVSVKVEDATYRVPGRAMRM
KLTITNHGNSPIRLGEFYTASVRFLDSDVYKDTTGYPEDLLAEDGLSVSD
NSPLAPGETRTVDVTASDAAWEVYRLSDIIYDPDSRFAGLLFFFDATGNR
QVVQIDAPLIPSFM
>MCA2853 pmoB2, methane monooxygenase, B subunit
MKTIKDRIAKWSAIGLLSAVAATAFYAPSASAHGEKSQAAFMRMRTIHWY
DLSWSKEKVKINETVEIKGKFHVFEGWPETVDEPDVAFLNVGMPGPVFIR
KESYIGGQLVPRSVRLEIGKTYDFRVVLKARRPGDWHVHTMMNVQGGGPI
IGPGKWITVEGSMSEFRNPVTTLTGQTVDLENYNEGNTYFWHAFWFAIGV
AWIGYWSRRPIFIPRLLMVDAGRADELVSATDRKVAMGFLAATILIVVMA
MSSANSKYPITIPLQAGTMRGMKPLELPAPTVSVKVEDATYRVPGRAMRM
KLTITNHGNSPIRLGEFYTASVRFLDSDVYKDTTGYPEDLLAEDGLSVSD
NSPLAPGETRTVDVTASDAAWEVYRLSDIIYDPDSRFAGLLFFFDATGNR
QVVQIDAPLIPSFM
>MCA1798 pmoC1, methane monooxygenase, C subunit
MAATTIGGAAAAEAPLLDKKWLTFALAIYTVFYLWVRWYEGVYGWSAGLD
SFAPEFETYWMNFLYTEIVLEIVTASILWGYLWKTRDRNLAALTPREELR
RNFTHLVWLVAYAWAIYWGASYFTEQDGTWHQTIVRDTDFTPSHIIEFYL
SYPIYIITGFAAFIYAKTRLPFFAKGISLPYLVLVVGPFMILPNVGLNEW
GHTFWFMEELFVAPLHYGFVIFGWLALAVMGTLTQTFYSFAQGGLGQSLC
EAVDEGLIAK
>MCA0295 pmoC3, methane monooxygenase, C subunit
MATTTAGGIAAIDRPLLDKKWLVFAIGIYTVFYLWVRWYEGVYGWSAGLD
SFAPEFETYWMNFLYTEIVLEIVTASILWGYLWKTRDRNLAALTPREELR
RNFTHLVWLVAYAWAIYWGASYFTEQDGTWHQTIVRDTDFTPSHIIEFYL
SYPIYIITGFAAFIYAKTRLPFFAKGISLPYLVLVVGPFMILPNVGLNEW
GHTFWFMEELFVAPLHYGFVIFGWLALAVMGTLTQTFYSFSHLFERDLCP
DIR
>MCA1445.1 pqqA, coenzyme PQQ biosynthesis protein A
MRWEKPSYNDMRFGFEVTMYIYNR
>MCA1448 pqqD, coenzyme PQQ synthesis protein D
MSLQPDTLLELSPLLRMQWEEAQQRYVILYPEGMIELNETAAAILELCDG
QHNLTSIVDKLERKYDASGIEPDVREMLESALNNGWIREIIAY
>MCA1989 soxZ, sulfur compound-chelating protein SoxZ
MKIRARTADGITTVRLLITHPMETGRRRDETTGMVVPAHYIEELTLEHDG
KPVVRCRLTTAVSKDPYFSFQFRGGRPGERIRVNWTDNLGNTESQEGVIE
>MCA1107 ybgT, cyd operon protein YbgT
MWYFAWILGVGFACAFGVINAMWLESVCDIDDPAAGGAEN