TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Organism: Mycobacterium tuberculosis CDC1551, CDC1551
Gene type: CDS

Number of genes found: 1296

Free access
Sort by:

 



# Mycobacterium tuberculosis CDC1551, CDC1551

>MT0585 hypothetical protein
MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAW
LAKISCERLSRGVDGDAYKSATFLQRNLPRGTTQGQAFQFLGAAIDHYCP
EHVGVLQRAGTR
>MT2708 hypothetical protein
MTDSEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQMVGVGLARLDPADE
PVAQIGDELAIARALSDLANQLFALTSSDIEASTHQPVTGLHH
>MT3247 PPE family protein
MSSASRAGEGAGLMNYSVLPPEINSLRMFTGAGSAPMLAASVAWDGLAAE
LAVAASSFGSVTSGLAGQSWQGAAAAAMAAAAAPYAGWLAAAAARAAGAS
AQAKAVASAFEAARAATVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAE
AMYEQMWAADVAAMVGYHGGASAAAAQLSSWSIGLQQALPAAPSALAAAI
GLGNIGVGNLGGGNTGDYNLGSGNSGNANVGSGNSGNANVGSGNDGATNL
GSGNIGNTNLGSGNVGNVNLGSGNRGFGNLGNGNFGSGNLGSGNTGSTNF
GGGNLGSFNLGSGNIGSSNIGFGNNGDNNLGLGNNGNNNIGFGLTGDNLV
GIGALNSGIGNLGFGNSGNNNIGFFNSGNNNVGFFNSGNNNFGFGNAGDI
NTGFGNAGDTNTGFGNAGFFNMGIGNAGNEDMGVGNGGSFNVGVGNAGNQ
SVGFGNAGTLNVGFANAGSINTGFANSGSINTGGFDSGDRNTGFGSSVDQ
SVSSSGFGNTGMNSSGFFNTGNVSAGYGNNGDVQSGINNTNSGGFNVGFY
NSGAGTVGIANSGLQTTGIANSGTLNTGVANTGDHSSGGFNQGSDQSGFF
GQP
>MT2593.1 hypothetical protein
MPLLLTHRKCWGDDVNSAIIKIAKWAQSQQWTVEDDASGYTRFYNPQGVY
IARFPATPSNEYRRMRDLLGALKKAGLTWPPPSKKERRAQHRKEGAQ
>MT0404 hypothetical protein
MAVGRCAIPRFDQAASGSAINGGQVHLSDGSTSPARQLPAPWPGDAGAAA
EGRAGVCCRGNRLPHVSDVGVSHRFDHRPAGVGAGGCRAGAAGAGLAVDD
PGQLAAAIDRIVAVADPDAVRQVRERARDREVSIWNSADGMGEVYAQLYA
TDAQALDARLNALVATVCAGDPRSTDQRRADALGALAAGADRLACRCDNP
DCAAEGRPVSAVVIHVVAEQASVKGHGQAPAALLGGDGLIPAELVAELAK
TAGLQPIPVPAGTEPGYRPSVKLAAFVRARDLTCRAPGCDRPATQCDLDH
TIAFADGGATHAANLKCLCRLHHLLATFCGWRAQQLPDGTVIWTLPGNQT
YVTTPGSALLFPALCTPTGDPPAPEPARADRRGQRTAMMPRRASTRTQNR
AHCIAAERHRNHQARRIAQAAVIATETHGPPPDPDDDPPPF
>MT2488.1 hypothetical protein
MVSDPRAARRLRSPHAMDGDRPRGSALADRVAEYTLDRGEPSPDGRRSPV
LVVRGCWRGLSHLDPSPKRLVHSLGLIHSRAARRRRARRFWRSPQLRSAH
ANRTARRATAKAARCRPGYRLARRICTLRPGATRSNGRRSRPRRATRRSE
LAAAALASRHFPWARLGGQDTRRSGPCRRRRIGGDRRPRGAGDGIHLDPR
PD
>MT3573.11 hypothetical protein
MRDTAFRTLDSCVRDGLMSSRAAETAETLCRTGPPQSTSWAQRWLAATGS
RDYLGAFVKRVSNPVAGHTVWTDREAAAWREAAAVAAEQRAMGLVDTQGG
FLIPAALDPAILLSGDGSTNPIRQVARVVQTTSEIWRGVTSEAPKLVGTP
KPRRCPTIRQRWPSRRCRTTVEAAGFRSPSSWRVTRRASLARSARFSRTA
LSNCRPRRSSTAPATASPPGSSAR
>MT3612.1 PE_PGRS family protein
MPRWPRRPVGSSPMSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATT
QVLAAGADEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAY
ASAEAAAAEQIVLGVINAPTQALLGRPLIGDGANATTPGGAGGAGGLLFG
NGGAGAAGAPGQAGGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWLFG
VGGAGGVSGAGGGTGGAGGPGGLIWGGGGAGGVGGAGGGTGGAGGRAELL
FGAGGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAPGGAGGAGGQGGAGG
AGSDGGALGGTGGTGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGGTGG
AGGDGVLGGVGGTGGKGGVGGVAGLGGAGGAAGQLFSAGGAAGAVGVGGT
GGQGGAGGAGAAGADAPASTGLTGGTGFAGGAGGVGGQGGNAIAGGINGS
GGAGGTGGQGGAGGMGGSGADNASGIGADGGAGGTGGNAGAGGAGGAAGT
GGTGGVVGAAGKAGIGGTGGQGGAGGAGSAGTDATATGATGGTGFSGGAG
GAGGAGGNTGVGGTNGSGGQGGTGGAGGAGGAGGVGADNPTGIGGTGGTG
GKGGAGGAGGAGADATATGATGGTGFAGGAGGAGGQGGSSGAGGTNGSGG
AGGTGGQGGAGGAGGAGADNPTGIGGAGGTGGTGGAAGAGGAGGAIGTGG
TGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGAGFAGGAGG
EGGAGGNSGVGGTNGSGGAGGAGGKGGTGGAGGSGADNPTGAGFAGGAGG
TGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRGGDGGDGASGLGL
GLSGFDGGQGGQGGAGGSAGAGGINGAGGNGGDGGDGATGAAGLGDNGGV
GGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAGGAGGAGDNNFN
GGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGVG
GSGGTGGDGGDAGAGGGGGFGGAAGKAGGGGNGGVGGDGGEGASGLGLGL
SGFDGGQGGQGGAGGNAGAGGINGAGGAGGTGGAGGDGAPATLIGGPDGG
DGGQGGIGGDGGNAGFGAGVPGDGGDGGNAGFGAGVPGDGGIGGTGGAGG
AGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAG
GAGGNGGDGDGFIGGSGGTGGTGGDAGVGGLANTGGTAGNAGIGGAGGRG
GDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAG
GDGAPATLIGGPDGGDGGQGGIGGAGGNAGFGAGVPGDGGIGGTGGAGGA
GGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGG
AGGNGGDGDGFIGGSGGTGGTGGDAGVGGLANTGGTAGNAGIGGAGGRGG
DGGAGDSGALSQDGNGFAGGQGGQGGAGGNAGAGGINGAGGTGGTGGAGG
DGQNGTTGVASEGGAGGQGGDGGDGGEGGGAGFGSGVAGAAGAGGNGGKG
GDGGTGGTGGTNFAGGQGGAGGRGGAGGNGANGVGDNAAGGDGGNGGAGG
LGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAG
GNGGSATGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGLGGSGAGGGT
DGDDGNGGSPGTDGS
>MT1409 hypothetical protein
MAETTEPPSDAGTSQADAMALAAEAEAAEAEALAAAARARARAARLKREA
LAMAPAEDENVPEEYADWEDAEDS
>MT1233 PE family protein
MRVLGPFEDGVHVSFVMAYPEMLAAAADTLQSIGATTVASNAAAAAPTTG
VVPPAADEVSALTAAHFAAHAAMYQSVSARAAAIHDQFVATLASSASSYA
ATEVANAAAAS
>MT0193 conserved hypothetical protein
MTNDKMLARIAALLRQAEGTDNPHEADAFMSTAQRLATAASIDLAVARSH
AGNRSPAQAPTQRTITIGAAGTRGLRTYVQLFVLIAAANDVRCDVASNST
FVYAYGFAEDIDTSHALYASLVVQMVRASDAYLASGAHRPTPTITARLNF
QLAFGARVGQRLADAREQTRQEATKDRDRPPGTAIALRDKDIELHEYYRR
SSKARGAWRASRATAGYSSAARRAGDRAGRQARLGNNPELPGARAALGR
>MT1095 conserved hypothetical protein
MVMPLVTPTTAVPSPGPTRLRVADLLRATDQAADDVLGGRCDHLLPDGGV
PQTQRWYTRIHGDEELDIWLISWVPGQPTELHDHGGSLGALTVLSGSLNE
YRWDGRRLRRRRLDAGDQAGFPLGWVHDVVWAPRPIGGPDAAGMAVAPTL
SVHAYSPPLTAMSYYEITERNTLRRQRTELTDQPEGSG
>MT0099.1 hypothetical protein
MRGLRVVSKVTSAFFALFTPARRARKAARVNLDQVAQCRRTDEGPTLCQH
CQPGSARALPTAAWSRQSQRVPATHCRPCCAPGAAASALTCALCAEAWSV
VEVRPAPRAATFSLQSRAARESR
>MT0691 DNA-binding protein, CopG family
MAAALFLPNTRAYRRYNRSVWAVRGSTRPQWQPPPKFQHAKCMSMRLAHR
LQILLDDECHRRITAVARERGVPVATVVREAIDRGLVSPAGRRKSAGRRL
LDAADMSVPEPRELKQELEALRARRG
>MT0982 hypothetical protein
MNRVSASADDRAAGARPARDLVRVAFGPGVVALGIIAAVTLLQLLIANSD
MTGAWGAIASMWLGVHLVPISIGGRALGVMPLLPVLLMVWATARSTARAT
SPQSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAF
TSVLVVHSVGAATGVWSRVGRRALAATALPDWLHDSMRAAAAGVLALLGL
SGVVTAGSLVVHWATMQELYGITDSIFGQFSLTVLSVLYAPNVIVGTSAI
AVGSSAHIGFATFSSFAVLGGDIPALPILAAAPTPPLGPAWVALLIVGAS
SGVAVGQQCARRALPFVAAMAKLLVAAVAGALVMAVLGYGGGGRLGNFGD
VGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPPVELDADES
SPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEP
PPRAD
>MT0160 PE family protein
MAPFGFTPKARHNRGVALRSTYRLDGWVMGPVDKEGWGLSYVFAQPSVLA
AAATDLAGIGSAINQATAAVAAPTTGLAAAAADEVSTALATLFGAYGQQF
QAISAQVAAFHNEFTQRLAAATNAFVNAEATNTSALVQEATAGLFKPTSP
PVLPPMFNQNTAIIMGGTGSPIPTPSYVNAITTLFIDPVVSNPVVKALVT
PEELYPITGVKSLPFQTSVQLGLQILDGAIWEQINAGNHVTVFGYSQSAV
IASLEMQHLISLGPNAPSPSQLNFILIGNEMNPNGGILARIPGLNVTTLG
LPFYGATPDNPYPTTTYTLEYDGFADFPRYPLNVLSDINAVFGILTVHTT
YADLTPAQIASATQLPTQGTTSNTYYIIETEHLPLLAPLRAIPVIGPPLA
ALVEPNLEVIVNLGYGDPRFGYSTSPANVPTPFGLFPDVPASVVADALVA
GTQQGVNDFMVELPAALNTLPQTPMPAFPPYVPTLLPPPPPPQPATLINI
ADTFASVVSTGYSILLPTADLGLAFVTILPAYDLTLFVNQLAAGNLRAAI
ELPLAATIGLAALGGMIEFIAVVVTLADITQQLQSFSI
>MT1279 hypothetical protein
MTCVCPGIMMTCMRTTLTLDDDVVRLVEDAVHRERRPMKQVINDALRRAL
APPVKRQEQYRLEPHESAVRSGLDLAGFNKLADELEDEALLDATRRAR
>MT2367.1 hypothetical protein
MMRLLEILSCPKSRSMRRHCLRSGRCRPTTARWSPFGHHLEVFTAVSWTF
WAAMLSDAMVSTAPGANSITAVLLLRTDNHQGPSGAFSYWCSMAGHRAAS
RVSCAHAD
>MT1116 hypothetical protein
MAPVHREIRDAVSACRVAIFTLATALGVPGQNTTNHIAM
>MT0968.1 hypothetical protein
MLRRTARSSKDRVSAAKADGILKLLAGQEFEELAMKPDDRMGPECRFMAS
RRFGTMAMVALRPKLRCAPDAGARDARKWSR
>MT1172.1 hypothetical protein
MPAGGPGFTGPGVGPARGGVISRHPALPVLTQVYEPAFLVALLTTIRTAS
NLARRRYGPWNGPAMAVPLARI
>MT3630 hypothetical protein
MAFREVSMNKIREVLRVWLGVAGLPAPGCRTIAAHCGMDRKTVRRYVEAR
AGTRSAPRRRCQRYR
>MT3718.1 hypothetical protein
MADVNSFFHAKQIVSFEFVKTACSLVVDYIVCAHLPSCNCAATMRQPPCR
RRDPKAVSPTSMPALRLRSV
>MT2626 hypothetical protein
MGFQRGDPLFEVLGQHERILPRFNATQPCPGRTPALGARRISRAERGSAL
LGSYLDKSGRAEVTIEGIRILDAFLSHHREARGIDERVLSLVVAYKPFPC
LLFQVGCYVLDANDGAQADCSGGNRRAVTAAPVEQRPGLAQDMVGGHHHG
RFAGPQPLRGAVPSVAGVAKCSPEGRVDEDHSCFP
>MT2042.1 hypothetical protein
MLDDPRSAGSVVAPYDDGELLRLAELRASSGLKLPDCCVPDVAIHHQASL
ATFDDTLAAAARTRSVPASTNGAANPIRPASLDIMSLIGL
>MT3313 hypothetical protein
MDYLSTVRTAVGCAGRTTLITPGGPADRRRTEQQYQHAKRDDSHRDPDRR
LMRTHNQIPRHRQQQQLGEHRQPATPKLANRQPASGGEHCSADQYEPTAV
PRPLTMCWSAPASPRTSNAAATTRAAPPCTATISPAPRTAAGARTGTASA
>MT2475 lipoprotein, putative
MTNRWRWVVPLFAVFLAAGCTTTTTGKAGLAPNAVPRPLMGSLIQRVPLD
GAALSTLLNQPFQALPPFPPVFGGSDSLGDSDVSARPADCVGVGYLTQRN
VYRSVEVKSVARVSWRHDGSSVKVDDVDEGVVALPSAAAADDLFARFSAQ
WKECDGTTLTVPASAFGQRSITDVRVADSVVAATVSLRRGTHSILASVPQ
ARAVGVRGNCVVEVAVTFFGITHPSDQGSADISTSAVDIAHAMMDRISEL
S
>MT2262 conserved hypothetical protein
MKLLGHRKSHGHQRADASPDAGSKDGCRPDSGRTSGSDTSRGSQTTGPKG
RPTPKRNQSRRHTKKGPVAPAPMTAAQARARRKSLAGPKLSREERRAEKA
ANRARMTERRERMMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPS
ALTLLFVMFAVPQVQFYLSPAMLILLALMTIDAIILGRKVGRLVDTKFPS
NTESRWRLGLYAAGRASQIRRLRAPRPQVERGGDVG
>MT2969 conserved hypothetical protein
MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSVEMVPRN
TDGEVYFELRLADAWVWDMYRPARFVKQVRVVTFKDVNIEEVEKPELRLP
E
>MT3118 hypothetical protein
MAHSIVRTLLASGAATALIAIPTACSFSIGTSHSHSVSKAEVARQITAKM
TDAAGNKPESVTCPSDLPAEVGAELNCEMKIKDRTFNVNVTVTSVDGSDV
KFDMVETVDKNQVANIISDKLFQRVGARPDSVTCPDNLKGVEGAKLRCRL
TDGSKTYGISVIVTSVDAGDVNFDFKVDDHPE
>MT2774 conserved hypothetical protein
MVAQITEGTAFDKHGRPFRRRNPRPAIVVVAFLVVVTCVMWTLALTRPPD
VREAAVCNPPPQPAGSAPTNLGEQVSRTDMTDVAPAKLSDTKVHVLNASG
RGGQAADIAGALQDLGFAQPTAANDPIYAGTRLDCQGQIRFGTAGQATAA
ALWLVAPCTELYHDSRADDSVDLALGTDFTTLAHNDDIDAVLANLRPGAT
EPSDPALLAKIHANSC
>MT1312 hypothetical protein
MGEVAVRRKVRRLTLAVSALVALFPAVAGCSDSGDNKPGATIPSTPANAE
GRHGPFFPQCGGVSDQTVTELTRVTGLVNTAKNSVGCQWLAGGGILGPHF
SFSWYRGSPIGRERKTEELSRASVEDINIDGHSGFIAIGNEPSLGDSLCE
VGIQFSDDFIEWSVSFSQKPFPLPCDIAKELTRQSIANSK
>MT2174 hypothetical protein
MLTPVTSMTRFTLMSAPERVTGLSGQRYGEVLLVTPGEAGPQATVYNSFP
LNDCPAELWSALDPQALATEHKAATALLNGPRYWLMNAIEKAPQGPPVTK
TFGGIEMLQQATVLLSSMNPAPYTVSQVSRNTVFVFNAGEEVYELQDPKG
QRWVMQTWSQVVDPNLSRADLPKLGERLNLPAGWSYHTRVLTSELRVDTT
NREARVLQDDLTNSYSLVTA
>MT0085.1 hypothetical protein
MSVAVSVAAQKLRLALDMYEVGEQMQRMRLGRERPNADVVEIEAAIDAWR
MTRPGAEEGDSAGPTSTRFT
>MT0303 hypothetical protein
MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPSAQNGDG
GQADSGAAVQLSLAPVGGQPFSLDASLDTVGVVDGDLLVLQPVPAGPAAP
GIVEDIADAAMIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRV
ATGVLAGLLAVAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALAL
AVPGKFGPAQVLLGAAGVAAWSLIALMIPSAERERVVAFFTAAAVVGASV
ALAAGAQLLWQLPLLSIGCGLIVAALLVTIQAAQLSALWARFPLPVIPAP
GDPTPSAPPLRLLEDLPRRVRVSDAHQSGFIAAAVLLSVLGSVAIAVRPE
ALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAGVLLVFYTA
TGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAG
LDVSLIPVMAYLVGLFAWVLNR
>MT0299 PPE family protein
MAAPIWMASPPEVHSALLSNGPGPGSLVAAATAWSQLSAEYASTAAELSG
LLGAVPGWAWQGPSAEWYVAAHLPYVAWLTQASADAAGAAAQHEAAAAAY
TTALAAMPTLAELAANHVIHTVLVATNFFGINTIPITLNEADYVRMWLQA
AAVMGLYQAASGAALASAPRTVPAPTVMNPGGGAASTVGAVNPWQWLLAL
LQQLWNAYTGFYGWMLQLIWQFLQDPIGNSIKIIIAFLTNPIQALITYGP
LLFALGYQIFFNLVGWPTWGMILSSPFLLPAGLGLGLAAIAFLPIVLAPA
VIPPASTPLAAAAVAAGSVWPAVSMAVTGAGTAGAATPAAGAAPSAGAAP
APAAPATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVPAAAAAAAT
RGQSRARRRRRSELRDYGDEFLDMDSDSGFGPSTGDHGAQASERGAGTLG
FAGTATKERRVRAVGLTALAGDEFGNGPRMPMVPGTWEQGSNEPEAPDGS
GRGGGDGLPHDSK
>MT2404 PE family protein
MSHVTAAPNVLAASAGELAAIGSTMRAANAAAAAPTAGVLAAGGDDVSAG
IAALFGARAQAYQAISAQAALFHDRFVQILQEGAAAYAMAEAANALPLQK
AQGVVSELAQDRTGGTGTGQSRGAGGFGGVGQAGGKGWDGGPIGNGQVGE
QHGAGQLGSTDGNPGVAGAAHGSGVSASHGSGATGAAGVADPGGSGAGVG
SAAGNGTGAGSADAVGGAGTGRDIVGSVRGDGGVGMASGDGGLSTGAAGA
SAEGGLMPGFGGAPWVGGHWGLGGEGHSGAIGGVGEQVAPAVATAPAVSP
ATTSAVAAESGSTPATKAQAMHATTNPGNAAHQGNPADPGNSARRADGGR
DEQLLLLPLTSLRGLRHTLKKLSGLRARNGLLTASGDNASGSGRPWDRDQ
LLRALGLRPPGHE
>MT1843 conserved hypothetical protein
MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSNDWLNEH
PGMAVMREQGIVVNDAVNEQVAARMKVLAAPDLEVVALLSRGKLLYGVID
DENQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVTVSDSASI
AALVMDGLESIHHADPAAINAVNVPMEEMLEATKSWQESGFNVFSGGDLR
RMGISAATVAALGQALSDPAAEVAVYARQYRDDAKGPSASVLSLKDGSGG
RIALYQQARTAGSGEAWLAICPATPQLVQVGVKTVLDTLPYGEWKTHSRV
>MT0772.1 PE_PGRS family protein
MSFVLAMPEVLGSAATDLAALGSVLGAADAAAAATTTGIVAAAQDEVSAA
IAALFSAHGRAYQVASAQAAAVHAQFVEALSAGAGAYASAEAAGAAVLAN
PAQSVQQDLLAAVNAQSVALTGRPLIGNGANGAPGTGANGAPGGWLLGNG
GAGGSAAAGSGLPGGAGGAAGLFGTGGAGGAGGSSTVGDGGAGGAGGSGG
WLLGTGGVGGVGGLGAGAGGAGGVGGAGGLLGAGGHGGAGGLGAVTGGVG
GAGGAGGLLAGLMAGPGGAGGTGGRGFLNDGGVGGAGGNAGLLFGAGGTG
GSGGAGLGGDGGAGGAGGNAGVLFGNAGSGGTGGFGDTDGGAGGAGGDAG
WLGSGGVGGAGGFGETGDGGVGGAGGKAGLLIGNGGAGGAGGQGAVTGGT
GGAGGDGVLIGNGGNAGIGGTGPTAGDTGAGGISGLLLGADGFNAPASAS
PLHTLKQQALAAINAPTQTLTGRPLIGNGTPGAVGSGATGAPGGWLLGDG
GAGGSGAAGSGAPGGAGGAAGLWGTGGAGGAGGSSAGGGGAGGAGGAGGW
LLGDGGAGGIGGASTVLGGTGGGGGVGGLWGAGGAGGAGGTGLVGGDGGA
GGAGGTGGLLAGLIGAGGGHGGTGGLSTNGDGGVGGAGGNAGMLAGPGGA
GGAGGDGENLDTGGDGGAGGSAGLLFGSGGAGGAGGFGFLGGDGGAGGNA
GLLLSSGGAGGFGGFGTAGGVGGAGGNAGWLGFGGAGGVGGSAGLIGTGG
NGGNGGTGANAGSPGTGGAGGLLLGQNGLNGLP
>MT0906 conserved hypothetical protein
MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSVQPEQAQ
LDIEVTNVLSPKEIQARIRAGASVEQVAAASGSDIARIRRFAHPVLLERS
RAAELATAAHPVLADGPAVLTMQETVAAALVARGXNPDSLTWDAWRNEDS
RWTVQLAWKAGRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLR
PLAPVAHLDFDEPEPAQPTLTVPSAQPVSNRRGKPAIPAWEDVLLGVRSG
GRR
>MT0785 steroid isomerase, putative
MTQTTQSPALIASQSSWRCVQAHDREGWLALMADDVVIEDPIGKSVTNPD
GSGIKGKEAVGAFFDTHIAANRLTVTCEETFPSSSPDEIAHILVLHSEFD
GGFTSEVRGVFTYRVNKAGLITNMRGYWNLDMMTFGNQE
>MT3106 PPE family protein
MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSV
VVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAGY
VCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQA
ATVMSAYEAVVGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPFG
ELAKFLEMAAQAFTEVGELIMKSAEAWAVGFVELITGLVNFEPWLVLTGM
IDMFFATVGFALGVFVLVPLLEFAVVLELAILSIGWIISNIFGAIPVLAG
PLLGALAAAVVPGVAGVTGLAGLAAVPAVGAAAGAPAALVGSVAPVSGGV
VSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLT
VLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV
>MT1797 hypothetical protein
MYRYQVRVQQRRSEMNRWVATRSRRHTYQWITDHKSPRDHYRHISELRTS
IATSSPGRCDMSPIPRIVSVSLAWAAAIGLMVPIGLAPPAMAAPCSGDAA
NAPPPPSAIVTDPGATALGPVRPGHGPIPTGRKPRGANDRAPLPKLGPLI
SALLNPGARNAAPLQQQALVPRANPGPNPAPNPPATGPQPPNATQLTPNP
APAPDPAPAAAPDPGATLAGATTSLAEWVTGPDSPNKTLERFGISGTDLG
IPWDNGDPANRQVLMIFGDTFGYCAVDGHQWRYNTLFRSQDRDLGNGVHV
TSGDASNRYSGSPVRQPGFSKQLINSIKWARDETGIIPTAGIAVGKTQYV
NFMSIRNWGRDGEWTTNYSGIAVSKDNGQTWGVFPGTIRASGPDSGGKAR
FVPGNENFQMGAYLKSNDGYLYSFGTPPGRGGSAYLARVPQRFVPDLTKY
QYWNGDSNSWVPNKPDAATPVIPGPVGEMSVQYNTYLKQYLALYTNGMND
VVARTAPAPQGPWSAEQMLVSSWQMPGGIYAPMMHPWSTGKDVYFNLSLW
SAYNVMLMHTVLP
>MT1975 hypothetical protein
MRNRAPLESEPNHPRHSRPLGVEAGTLGAVMDPADVINPTSTRDAALARV
LAYRQRVRARPLLIRATLAVVGGGLFVVSLPMIVLLPELGIPALLVAFRL
LAVEAQWAVRAYAWTDWRFTQLREWFHRQVLVTRAAILVGLFLAAVALVW
LLVYEF
>MT3882 conserved hypothetical protein
MPPESRPGPDSPPTDELACAEAALQVLQQVLHTIGRQDKAKQTPCPGYDV
KKLTEHLLNSIMVLGGMVGAEFSLRADIDSVERLVSGAARSALDAWHRHG
LEGDVSLGPGSMSAKVAVSVFSVEFLVHAWDYAVAVGSELKAADSLAEYV
LELARKLIKPEERSVAGFNEPVDVPEDGGALERLIAFTGRNPAR
>MT0335 hypothetical protein
MGRHELARDRRKSSAVLAAVLAPAAVFFATGGDVSTLAARADANPVLGDD
APCCVQIVPVAPLAFSSQISGGEIGTGLAASQFASASRWRIVSRYLPVGV
APEQGLQVKTVLTARSISAAFPEIREIGGVRPDALRWHPNGLALDVMVPN
PGTAEGIALGNEIVAFVLKNATRFGMQDVIWRGAYYTPNGARTTGAGHYD
HIHITTVGGGYPTGEELYIR
>MT3304 hypothetical protein
MPMEGATVEVKIGITDSPRELVFSSAQTPSEVEELVSNALRDDSGLLTLT
DERGRRFLIHTARIAYVEIGVADARRVGFGVGVDAAAGSAGKVATSG
>MT2790 conserved hypothetical protein
MTRDLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHVGKPFLT
YTQQTRAVADGKPLHSETGYLRVCRPGCVELVLAHPSGITEIEVGTYSVT
GDVIELELSTRADGSIGLAPTAKEVTALDRSYRIDGDELSYSLQMRAVGQ
PLQDHLAAVLHRQR
>MT0025 hypothetical protein
MATAGLADRASSGRLWANLCDRQVHHDPGHRGDLESFVAGQQSRIVRELP
GIDLAAGNPTQHVVGCRELFALSVLAAGKLDAGPYAVLHPMLDRVLSTPA
LL
>MT0434 hypothetical protein
MRLHDASAAAPESRMHIARHGEAVNRRQMFIGITGLLLAVIGLMALWFPV
YLDQYDAYGIKVTCGSGWRSNLTQALYADGNDNTQALVTRCDTALLVRRA
WAIPSVALGWLLVTGFLVMWVHNDQHQGQSYPGYRA
>MT3270.1 hypothetical protein
MRQRWAALALVDALFNQLTPSSRAATYPRPCDASTPPAPRITQRTGQIVG
FQNLHDLLGRLHVTPFRELNNRRGAVGRGCPGAAVPKPDMANVKRSDHGS
SWAGATRHPAAWAPVCRRRSQTATTRPGSPTANALAR
>MT0270 hypothetical protein
MTLQTLSSGRATTTLLGLNVAARDATTPPPAVSRR
>MT3867 conserved hypothetical protein
MTSNPSSSADQPLSGTTVPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLI
LILLLIFIAQNTASAQFAFFGWRWSLPLGVAILLAAVGGGLITVFAGTAR
ILQLRRAAKKTHAAALR
>MT2782 conserved hypothetical protein
MWDSRVMKHGLRLGFNGQFDDFDDFDDKGRPVLITAAAPSYEVEHRTRVR
KYLTLMAFRVPALILAAIAYGAWHNGLISLLIVAASVPLPWMAVLIANDR
PPRRADEPRRFDVARRRIPLFPTAERPALEPRRQPAERSAPRGFADHG
>MT1147 hypothetical protein
MSTERFHLCKHGAAIRMCSRMADEPRLEAGAHPFEEGRDKAPELRATQMD
HVRFTEGRRERNRDRLERSQQFRQPGR
>MT1126 acyl-(acyl-carrier-protein) desaturase, putative
MAQKPVADALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGENFAFLG
GRDWDPSQSTLPRTITDACEILLILKDNLAGHHRELVEHFILEDWWGRWL
GRWTAEEHLHAIALREYLVVTREVDPVANEDVRVQHVMKGYRAEKYTQVE
TLVYMAFYERCGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLV
THCLDYTRDETIAAIAARAADLDVLGADIEAYRDKLQNVADAGIFGKPQL
RQLISDRITAWGLAGEPSLKQFVTG
>MT0492 hypothetical protein
MSGPADPVTTCLLCCTSSTESLLASAADRLDQPAVDLTQMTVLQV
>MT2109 hypothetical protein
MSAGMTRRARPQPCGWCGRDVTDVGMGRRRRYCRQSCRQRAYEQRAMLTR
GEVRALPADAVVLSADDAADLSDRVYQVRCAAEDVVTALDEGAAATELRD
LCDELIRAARAADGWRRAGA
>MT1745 PPE family protein
MDFGALPPEVNSGRMYCGPGSAPMVAAASAWNGLAAELSVAAVGYERVIT
TLQTEEWLGPASTLMVEAVAPYVAWMRATAIQAEQAASQARAAAAAYETA
FAAIVPPPLIAANRARLTSLVTHNVFGQNTASIAATEAQYAEMWAQDAMA
MYGYAGSSATATKVTPFAPPPNTTSPSAAATQLSAVAKAAGTSAGAAQSA
IAELIAHLPNTLLGLTSPLSSALTAAATPGWLEWFINWYLPISQLFYNTV
GLPYFAIGIGNSLITSWRALGWIGPEAAEAAAAAPAAVGAAVGGTGPVSA
GLGNAATIGKLSLPPNWAGASPSLAPTVGSASAPLVSDIVEQPEAGAAGN
LLGGMPLAGSGTGMGGAGPRYGFRVTVMSRPPFAG
>MT3921 hypothetical protein
MRGYDAVHCASAEQLDDDEVVAAAADQRLLTAWLELGMATYDTNQRATPR
>MT2165 hypothetical protein
MSTARGRRVGLPRMTFTTVVLNPPLATLEVAVLDADRLRRAFRRIAGAAL
GKRLRELDRKDAKGHKGVPRAPKTPSPTANRRISDGPARRLTRCCRQS
>MT3463 conserved hypothetical protein
MTVRAVFRRTVGAQWPILLVGSIFAVGFVLAGANFWRRGALLIGIGVGVA
AVLRLVLSEERAGLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG
>MT1760 hypothetical protein
MLDTAYRDHLERFVRKPPEPPALPAFSAINPPPKEDQPTQ
>MT3331 hypothetical protein
MVTRLSASDASFYQLENTATPMYVGLLLILRRPRAGLSYEALLETVEQRL
PQIPRYRQKVQEVKLGLARPVWIDDRDFDITYHVRRSALPSPGSDEQLHE
LIARLAARPLDKSRPLWEMYLVEGLEKNRIALYTKSHQALINGVTALAIG
HVIADRTRRPPAFPEDIWVPERDPGTTRLLLRAVGDWLVRPGAQLQAVGS
AVAGLVTNSGQLVETGRKVLDIARTVARGTAPSSPLNATVSRNRRFTVAR
ASLDDYRTVRARYDCDSTTWC
>MT2601 conserved hypothetical protein
MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDYAAGVIP
ASQIRAAGAVGAIRYVSDRRPGGAWMLGKPIQLSEARDLSGNGLKIVSCY
QYGKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYE
QYKNQIVPYLRSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGS
PKGYTHPAAHLHQVEIDKRKVGGVGVDVNQILKPQFGQWA
>MT1484 hypothetical protein
MQMSASNAFVEGFADFWKAPSPDRLTDHLHPDVVLVRPLSPPRHGLGAAQ
REFTRILGLLPDLHGEVDRWSQAGDVVFIEFRLIARLGSEVVEWPVVDRF
LLRGDKAVERVSYFDSLPLLIKVVKHPSAWRGWLTTMRSRA
>MT1555.1 hypothetical protein
MQSGQNILAKVCNLIEQSRLSSTRCLQFRITNTSRPRQLRWSEFKRFCDI
FNMVLGKARMGRDPGRPVRDERRIVSCEIIASDHIGLAAARLLAKRYRGR
SVSGFVLMIKSASVHEIDSWSSPSVAMSIGVALCSYPHYAAARTSPPNRD
WGEDTTRSRPVTGLLAG
>MT1866 PE_PGRS family protein
MTRRSVAAPRRGHRVLTRCQMSFVVTIPEALAAVATDLAGIGSTIGTANA
AAAVPTTTVLAAAADEVSAAMAALFSGHAQAYQALSAQAALFHEQFVRAL
TAGAGSYAAAEAASAAPLEGVLDVINAPALALLGRPLIGNGANGAPGTGA
NGGDGGILIGNGGAGGSGAAGMPGGNGGAAGLFGNGGAGGAGGNVASGTA
GFGGAGGAGGNGGLLFGAGGAGGVGGLAADAGDGGAGGDGGLFFGVGGAG
GAGGTGTNVTGGAGGAGGNGGLLFGAGGVGGVGGDGVAFLGTAPGGPGGA
GGAGGLFGVGGAGGAGGIGLVGNGGAGGSGGSALLWGDGGAGGAGGVGST
TGGAGGAGGNAGLLVGAGGAGGAGALGGGATGVGGAGGNGGTAGLLFGAG
GAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTGAKGGDGGAGG
GAILVGNGGNGGNAGSGTPNGSAGTGGAGGLLGKNGMNGLP
>MT0364 hypothetical protein
MVMILQHPCALRHGVDLHPRLLVAPVRPDSLRSNWARAPFGTMPLPKLID
GQDHSADFINLELIDSPTLPTCERIAVLSQSGVNLVMQRWVYHSTRLAVP
THTYSDSTVGPFDEADLIEEWVTDRVDDGADPQAAEHECASWLDERISGR
TRRALLSDRQHASSIRREARSHRKSVKLAD
>MT0853 hypothetical protein
MGLAPVMLEMAEQSGLSRAIEEQMDPPFDPGGIRSDQPGRQSRSSPRWPT
GAGRHR
>MT1689 PE_PGRS family protein
MSFLLVEPDLVTAAAANLAGIRSALSEAAAAASTPTTALASAGADEVSAA
VSRLFGAYGQQFQALNARAATFHAEFVSLLNGGAAAYTGAEAASVSSMQA
LLDAVNAPTQTLLGRPLIGNGADGVAGTGSNAGGNGGPGGILYGNGGNGG
AGGNGGAAGLIGNGGAGGAGGAGGAGGAGGAGGTGGLLYGNGGAGGNGGS
AAAAGGAGGNALLFGNGGNGGSGASGGAAGHAGTIFGNGGNAGAGSGLAG
ADGGLFGNGGDGGSSTSKAGGAGGNALFGNGGDGGSSTVAAGGAGGNTLV
GNGGAGGAGGTSGLTGSGVAGGAGGSVGLWGSGGAGGDGGAATSLLGVGM
NAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVGGAGGAGGNAS
LFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGLLYGNGGNGGAG
GIGGAAINILANAGAGGAGGAAGSSFIGNGGNGGAGGAGGAAALFSSGVG
GAGGSGGTALLLGSGGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGL
IWGNGGAGGNGGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGT
GGDHSGGNGIGGKGGASGNGGNAGQVLGDGGTGGTGGAGGAGSGTKAGGT
GSDGGHGGNATLIGNGGDGGAGGAGGAGSPAGAPGNGGTGGTGGVLFGQS
GSSGPPGAAALAFPSLSSSVPILGPYEDLIANTVANLASIGNTWLADPAP
FLQQYLANQFGYGQLTLTALTDATRDFAIGLAGIPPSLQSALQALAAGDV
SGAVTDVLGAVVKVFVSGVDASDLSNILLLGPVGDLFPILSIPGAMSQNF
TNVVMTVTDTTIAFSIDTTNLTGVMTFGLPLAMTLNAVGSPITTAIAFAE
STTAFVSAVQAGNLQAAAAALVGAPANVANGFLNGEARLPLALPTSATGG
IPVTVEVPVGGILAPLQPFQATAVIPVIGPVTVTLEGTPAGGIVPALVNY
APTQLAQAIAP
>MT1260 hypothetical protein
MASAQFTSTQFAELLNGRRARRFGYRMVANRSRILGFYRLAMVFRHRAAL
HSRYRWLARWRKEVVMADPGSVGHVFRRAFSWLPAQFTSQSDAPVGAPRQ
FRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQSRARA
ALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFG
DRFADGDGGNRGRQSRVRR
>MT0600 hypothetical protein
MAADPQCTRCKQTIEPGWLYITAHRRGQAGIVDDGAVLIHVPGECPHPGE
HVPRS
>MT1813 hypothetical protein
MGPGICLLGSGSALGWIIDHATTDETHHHTLFTDPMPPTAKNAACPLGPH
>MT1166 hypothetical protein
MVRAYPESHPHDDRGGPDEDIRRRAILPPAGPCRPMSPE
>MT2113 hypothetical protein
MTGIWLTPLIRPRRWRFCGYPRGLKGQRLDIYDVDDLAINKVRVGAALAE
GPGHVGRRIGRQWHVPQETAHGDRGQPGSSERPDRRHQQQQPHRRGDQTR
NKHQDRGNGDQRAVTQCASWFRQAGSQPQEPSAELPATERRQQTEPEDER
RQQHQQSPAKPDRRRQRENHRELDDGVAEQQPRHHVTPTSAG
>MT1821.1 hypothetical protein
MPLSGVVQLVERGLCGAPPKSDEDTPGGVENAARVRAARDCSGPSIRRHG
LATSC
>MT3988 conserved hypothetical protein
MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTA
AQAAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF
>MT0264 conserved hypothetical protein
MTALKARFGDNPATRRIVIDADRILTDIELLDTDVSELDLERAAVPQPSE
KIAIPDTEYDREFWRDVDDEGVGGHRY
>MT1848 hypothetical protein
MSVKSKNGRLAARVLVALAALFAMIALTGSACLAEGPPLGRNPQGAPAPV
GGTVIVAPMHSGV
>MT2589.1 hypothetical protein
MNEGSATATNLDRQEWRTEASLDVGRPENAPRNETLVRTKILRSSGSTRE
HVVRPVGMGAGKVLQPGDGRRRVQTWLLSAG
>MT1352 conserved hypothetical protein
MSAPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAVGGHGWR
HGVIRYRGGEAAFYRLSSLRLWPDRRLSRRGVEIISRRAPRGDEFDIMTD
EIVVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRARRRSM
>MT0030.1 hypothetical protein
MHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSA
QVPTTDDLNVAIEKLLAHQLRRRNR
>MT2166 PE family protein
MSFVNVDPFGMLAAAATLESLGSHMAVSNAAVASVTTKVPPPAADYVSKK
LSLFFSSHGQQYQVQAARGTAFHRKLVRTLANGALAYEEVEIANNEGF
>MT1170 hypothetical protein
MGPDQPGRQGGFGRRGRPHLAVRVTVNASLSVQASKRIACGVDDGVVVDE
GTPHPARDGFPDEIAGPRAFAEKQEPVWRARCIVSAPWMGLAGVPSASTV
ALPTFVGLIPMPGRSVALRWAQRASFPGEPMWRM
>MT0543 hypothetical protein
MREAAQALGFEVLDQRDLVRNLRTHYSRVFEELEARRLELEGKSSQEYLD
KMRVGLKNWVEAADNGHSRVGHPTFPRTRLTPICQLPTAAIDSTAGRRRY
R
>MT3276 hypothetical protein
MAVTLDRAVEASEIVDALKPFGVTQVDVAAVIQVSDRAVRGWRTGDIRPE
RYDRLAQLRDLVLLLSDSLTPRGVGQWLHAKNRLLDGQRPVDLLAKDRYE
DVRSAAESFIDGAYV
>MT0087 conserved hypothetical protein
MVELDRDEAMRLLASVDHGRVVFTRAALPAIRPVNHLVVDGRVIGRTRLT
AKVSVAVRSSADAGVVVAYEADDLDPRRRTGWSVVVTGLATEVSDPEQVA
RYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSRTP
>MT0116.1 hypothetical protein
MAYQRHVYRLAHTSPGEVAGEGSPDPFGLRAMNADAVSDLA
>MT3753 conserved hypothetical protein
MTHDWLLVETLGDEPAVVARGRELKKLVPITTFLRRSPYLAAVRTAIAET
LQTGQSLTSITPKHDRVIRTEPVIMTDGRMHGVQVWSGPTDAEPPDRPIP
GPLKWDLTRGVATDTPESLTNSGKNPEVEITYGRAFAEDLPARELNPNET
QVLAMAVKAKPGKTLCSIWDLTDWQGTPIRIGFVARSALEPGPNGRDHLV
ARAMNWRAETKAPAVPVDDLAQRILIGLAQAGVHRALVDLKTWTLLKWLD
QPCSFYDWRRSAADGPRLHPDDQHVIDAMTRDLANGSASHVLRLPGHDVD
WVPVHVTVNRIELEPDTFAGLVALRLPTDEELADAGLPKATDVTT
>MT4004 conserved hypothetical protein
MLTTTVDGLWVLQAVTGVEQTCPELGLRPLLPRLDTAERALRHPVAAELM
AVGALDQAGNADPMVREWLTVLLRRDLGLLVTIGVPGGEPTRAAICRFAT
WWVVLERHGNLVRLYPAGTASDEAGAGELVVGQVERLCGVAEAAPLRPVT
VDADELLHAVRDAGTLRSYLLSQRLDVDQLQMVTMAADPTRSAHATLVAL
QAGVGPEKSARILVGDSTVAIVDTAAGRICVESVTSGQRRYQVLSPGSRS
DIGGAVQRLIRRLPAGDEWYSYRRVV
>MT1266 hypothetical protein
MYSGAMKSISVGELRQNPAPMIADLERGEPYALTRHNHRIGTIIPAVSSA
TLIPRKA
>MT2015.2 hypothetical protein
MMRLSAAKAAEHIAPRYRILARYIIANICQFALLFTDQLSKQSRIGYGHQ
RSLPRTLTNVTDRRPVRGASVRHLTQRPSIESSWCAAPW
>MT3540 hypothetical protein
MADASVVARLRSWALAVWHFVSNAPLTYAWLVVLVITTIIQNNLTGSQLH
FVLLHRSTNIAELGRDPLEVLFSSLLWIDGRNLEPYLLLFTLFLAPAEHW
LGHLRWLTVGLTAHIGATYLSEGLLYLAIQHRDASERMVHARDIGVSYFL
VGVMAVLTYHIAKPWRWGYLGVLLVIFGFPLIAMDKAELDFTAVGHFASI
LIGLLFYPMARERDGRLWNPARIKSLLHRRGTRGRRA
>MT2593 hypothetical protein
MTADWVVTFTFDADPSMETMDAWETQLEGFDALVSRVPGHGIDVTVYAPG
DWSVFDALAKMAGEVMPVVQAKSPIAVQIISEPEHRLRAEAFTTPELMSA
AEIADELGVSRQRVHQLRSTAGFPAPLADLRGGAVWDAAAVRRFAETWER
KPGRPHTGTAKFAYSWAVGPAVGRSGKAPNVRWRVENPDKIRFVLRNIGD
DIAEDVEIDLSRIDAITRNVPKKTVIRPGEGLNMVLIAAWGHPLPNQLYV
RWAGQDEWAAVPLHPAH
>MT0305 hypothetical protein
MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAVVIGLFG
FWRGLYFTTIARRGLAILRRRRRIAEPATCTRTTVLVWVGPPASDTNVLP
LTLIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQAR
SARIPLQETAQVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRG
MRHTDSDYVAAYRVSADAELPDTLPAIRSRPAQETWIALEIAYAAGSSTR
YTVAAACALRTDWRPGGTAPVAGLLPQHGNHVPALTALDPRSTRRLDGHT
DAPADLLTRLHWPTPTAGAHRAPLTNAVSRT
>MT2315 hypothetical protein
MRYRDLETVAAPTINVLRVWPEIVGAIVLLVIAAMGIGHGLRPSPEPVPA
PQKQLGCVRFALIFGLTAINPATFVYFTAVAVTLARALRATTAIAVVVGV
ALASLLWQLLLVSAGAFLRSRATARVRRMTVLAGNAVIAAFGAVLVVHAF
A
>MT3709 conserved hypothetical protein
MTVLSRGARVRRGGRRRGWVLLTALLVLAIGASSALVFTDRVELLKLAVL
LALWAAVAGAFVSVLYRRQSDVDQARVRDLKLVYDLQLDREISARREYEL
TLESQLRRELASELRAPAADEVAALRAELAALRTSLEILFDADLEHRPAL
GTVEKEARAARALDGESPPADWVSSDRVMAVRGGDGASRTDEASIIDVPE
VGVPPVSGGPRHYEAPPPPQPEPLFEPRHRPPPLPPQQERPVWQPVTSHG
QWLPAETPGSQWASVEPETTPAAPPPGRRRRARHASPADQAYNPPAYVEL
AAQYGESGRRSRHSAEHRDHDIGGSGAGTGERPPSPPMAPPPPAEPTRRH
RTADTPPDDSGGLHARDPLTGGQSVADLMARLQVESTGGGRRRRRGE
>MT3149.1 hypothetical protein
MGPTRDSRQPAERATKRRYRFARGLMAGLLGTMALTSGGGVAREDPLEPD
PLAPIIDDSR
>MT2865 hypothetical protein
MMRWPTAWLLALVCVMATGCGPSGHGTRAGEEGPLSPEKVAELENPLRAK
PPLEDAKDQYRAAVTQLANAITALVPGLTWRTDMDTWTGCGGEYEWTRAK
AAYFMIVFSGPIPDDKWLQAVQIVKDGVEQFGATGFGVMKNKPADHDVYF
AGHGGVEFKCSTQKAAVLTAQSDCRISRTDTPKPSPTP
>MT2455 hypothetical protein
MFVIRLADGEEVHGECDELTINPATGVLTVCRVDGFEETTTHYSPSAWRS
VTHRKRGVGVRPSLVSTAQ
>MT3808 hypothetical protein
MRIAAAVVSIGLAVIAGFAVPVADAHPSEPGVVSYAVLGKGSVGNIVGAP
MGWEAVFTRPFQAFWVELPACNNWVDIGLPEVYDDPDLASFNGATTQTSA
TDQTHLVKQAVGVFASNDAADRAFHRVVDRTVGCSGQTTAIHLDDGTTQV
WSFAGGPSTGTDEAWTKQEAGTDRRCFVQTRLRENVLLQAKVCQSGNAGP
AVNVLAGAMQNTLG
>MT2554.2 hypothetical protein
MLAPLAGTLALMGIEFLSCPWTRTAPRSESAWTLATPPRVWRCTPIARVT
RPLTQFTGVPFAVLAAALTALHLTAWRTAFGFSERWENGQPSGSWRLRNA
TPPTRS
>MT1856.1 PPE family protein
MLAAAAAWDALAAELYSAAASYGSTIEGLTVAPWMGPSSITMAAAVAPYV
AWISVTAGQAEQAGAQAKIAAGVYETAFAATVPPPVIEANRALLMSLVAT
NIFGQNTPAIAATEAHYAEMWAQDAAAMYGYAGSSATASQLAPFSEPPQT
TNPSATAAQSAVVAQAAGAAASSDITAQLSQLISLLPSTLQSLATTATAT
SASAGWDTVLQSITTILANLTGPYSIIGLGAIPGGWWLTFGQILGLAQNA
PGVAALLGPKAAAGALSPLAPLRGGYIGDITPLGGGATGGIARAIYVGSL
SVPQGWAEAAPVMRAVASVLPGTGAAPALAAEAPGALFGEMALSSLAGRA
LAGTAVRSGAGAARVAGGSVTEDVASTTTIIVIPAD
>MT4025 hypothetical protein
MEYCIAGDDGSAGIWNRPFDVDLDGDGRLDAIGLDLDGDGLRDDALADFD
GDDVADHAVFDVDNDGTPESYFIDDGSGTWAVAVDRGGQLRWYGLDGVEH
TGGPLVDFDGFGGLDDRLLDTDGDGLADRVLCAGEQRVTGYVDTDGDGRW
DVRLTDTDGDGTADGASSL
>MT2297 hypothetical protein
MLRCRRGAGYGSVVVVGERPGFQSDSAARQTAPPVRPMTSDQLPATKADL
YAAVDAMRADMRELLEQISTLIREATQK
>MT1025.1 hypothetical protein
MVETASTALGAILAGRNAVVPPRCPRRGAGLVVPVSAVATAIWREA
>MT3058.2 hypothetical protein
MPIASAALAPGAAGRHHRAKCRSQARRLRRARRVGTIGLSADRKRGASAG
RGGSAPSG
>MT1069 PE family protein
MSFLKTVPEELTAAAAQLGTIGAAMAAQNAAAAAPTTAIAPAALDEVSAL
QAALFTAYGTFYQQVSAEAQAMHDMFVNTLGISAGTYGVTESLNSSAAAS
PLSGITGEASAIIQATTGLFPPELSGGIGNILNIGAGNWASATSTLIGLA
GGGLLPAEEAAEAASALGGEAALGELGALGAAEAALGEAGIAAGLGSASA
IGMLSVPPAWAGQATLVSTTSTLPGAGWTAAAPQAAAGTFIPGMPGVASA
ARNSAGFGAPRYGVKPIVMPKPATV
>MT3958 hypothetical protein
MVFDKPTVSCLSVSHFQRLFRVAQHNPMPVEIRRDYTHTQHLDHRDSGRR
RLTSSFAPPAPAATTQRHGSS
>MT3033 hypothetical protein
MSQFKTLKYRNDSPERFDSIEHARTWPGHMGAAMANAYSANPNPFGVSPQ
PPKPATAAWINPPTPDPK
>MT3532.1 hypothetical protein
MSFGFPTFSQNRFTEQYSGLCPIAPGRGAGLQPCRRDCPVARWLVADHPV
FGSDCRCRMMVGVNRVRIGRHELTGA
>MT2122 DNA-binding protein, CopG family
MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAERE
ASHAETTTQAVRDEDREWEGTVGDGLG
>MT3817 conserved hypothetical protein
MTWNSPTSSPWAERSPAAACSPDSPTSDFRRDQRSAGRYSHLACKMLISR
MSVRSASMSVMGDVFIGSEAITAGRLTRHELQRWYQPMFRGVYVSRRSVP
TLWDRTVGAWLATRRHGVIAGNAASALHGAQWVDVDVAIELISPTTRPQH
GLVIRRETLCDDEITRVVGLPVTTLARTAYDLGRHLSRGEAVARLDALMR
ATPFSRDDVLLLAKRHAGARGVRRLRDVLPLVDGGAASPKETWLRLLLID
AGLPVPTTQIPVVHRWRNVGVLDMGWEKYMVAAEYDGDQHRSDRGRYVKD
QRRLRKLAELGWIVIRVIAEDNPDDVVNRVRAALLARGWRP
>MT2627 DNA-binding protein, CopG family
MLVAYICHVKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQP
GPDPVDAFVGSFVGEADLSASVDDVVYGKHE
>MT0210 conserved hypothetical protein
MRNAWRLVVFDVLAPLATIAALAAIGVLLGWPLWWVSTCSVLVLLVVEGV
AINFWLLRRDSVTVGTDDDAPGLRLAVVFLCAAAISAAVVTGYLRWTTPD
RDFNRDSREVVHLATGMAETVASFSPSAPAAAVDRAAAMMVPEHAGGFKE
QYAKSSADLARRGVTAQAATLAAGVEAIGPSAASVAVILRVSQSIPGQPT
SQAARALRVTLTKRGSGWLVLDVTPINAR
>MT0472.1 hypothetical protein
MLQQLQRQMESERIVEFDQLGRGDVAQRRIQPAGPAPSNRRPGHRSLPAT
TPGMGTFSVLLVTGTTGTTPRSRRIAAALALSLLTITAGRRIFAALPRAG
SRSTCQISPRSIYAVRCKPPTATAGPLSWHASNAATSSVDKLTLGFMPQS
YPCSNR
>MT1625 conserved hypothetical protein
MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFCAQCGRR
MVVQVRPDGWWARCSRHGQVDSADLATQR
>MT0065 hypothetical protein
MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAGRLLADS
AVTPTTEVAYNPVKELRRHKVVAPDSRYPASMASDHVPFYIAARSPMLYV
VCKGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQV
DTLGTFVDFDLLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCC
YNTETMTRVRTLLDPVGGVRKYVIKPGMYY
>MT2510 hypothetical protein
MRDEDDDEDPYAAHSSIGRWWLFGKRDDRGLGTLVCCVIGAS
>MT0640 hypothetical protein
MDDELRGLLARYARGELSADDARRAILRYPKWRVAEIDGELETVALDDGT
PMLIAESSASDGREYSGLELVRDIAPLVGGLSFDPDEPWGSAFRPGALPE
LQNWARTVELEDAVAKPGPGQRDLLYEGPWWVAVSPGTGRPAVHRADGLD
VITIMTAPDAAATFRRTERHRGLDVVRLGPALWGDLAKRSDFDGVRLNPL
RPLAQLWPPHVPAMLVAGCDPRPNAEPLPARTVAEIHLWLDQHGARQEKR
ELSNRATPVGEVTVARAWWNYDRREIAFTRVAPASDTEGLGSVPSRILCA
GKLRQSIQSKLAGLPRLTWRADAWHRQRAALAVGWALELEKLVCGERVPF
AALRTPEGAHLWHLEPQAFTARAIRKLRDRAASFR
>MT0159 hypothetical protein
MPATIGLPDGLVPAATGIIENRLFMIEGRQRLEQRQAGVIT
>MT2417 hypothetical protein
MSLGIGHSPTVKNYSNPAFLTKLDMTDLTGRLRRLSH
>MT1502 conserved hypothetical protein
MTSIACMWLSCPGMKLARPDVFHPRVVLAGWPQQPAGDGDDAGLVAALRH
RGLHAGWLSWDDPEIVHADLVILRATRDYPARLDEFLAWTTRVANLLNSR
PVVAWNVERRYLRDLMDRGVPTVPGEVYVPGEPVRLPRKGQVFVGPTIGT
GTRRCSARFAAEFVAQLHAAGQAVLVQPGGSGDETVLVFLGGEPSHAFTK
QADTWRQTEPDFEIWDVGAAAVAGAAAQVGVDPXELLYARAHITGGSRDP
RLLELQLVDPSLGWQWLDPDIRNLAQRDFALCVQSALERLGLGPFSHRRP
>MT0608 hypothetical protein
MAAVIVEPIFAKSVAAVVSISGVAITNDISLPRSGQPGVARRGKELCGYP
GDLPCGANLRDPNSNDPLLTLSSKGQNPKLYANELSTVHPRFPSTRANGQ
TSTVLAPRRYCRSSSRSRPGARTPLPSRDPAGEDKEWSAMWMSGHTPSST
SSWSCRRAV
>MT2123 hypothetical protein
MVVSVDELLTGIDDELVVVVPVSSSRSRTPLRPPVAPSEGVAADSVAVCR
GVRAVARARLVERLGALKPATMRAIENALTLILGLPTGPERGEAATHSPV
RWTGGRDP
>MT2830 hypothetical protein
MSLNIKSQRTVALVRELAARTGTNQTAAVEDAVARRLSELDREDRARAEA
RRAAAEQTLRDLDKLLSDDDKRLIRRHEVDLYDDSGLPR
>MT0394 hypothetical protein
MSVSNLRWSGDYVLIDVDASPTDPHAPHAKPEDIRFGLYGALAHPMESAA
LGSCGDAMAHVRDVVSPLSAPAGRLTGTVCLGPLKERSAVRGVYTYSPRD
RIPGTAAAYPAAFPVGMLPTNQNDAGLVVKTTSVSAWRADGMQLGKPQLG
DPVAFTGNGYMLLGLEVDAVPDRYRDDSAARGGPMMLLAAPTLPGRGLSP
ACATYGSSVLILPDALLDAVHISASLCTQGEINEALLYATVATAGTHAAL
WTSR
>MT0661 hypothetical protein
MSLWYKRMVDSMGWVLSSWHEVTGVDSGTWLAWAAWAALGLGVVALVVTK
RQIQRNRRLAAEQTRPYVAMFMEPHVADWHVIELVVRNFGRTAAYDVRFS
FPNPPTVAQYENAANGYADVVELRLPQELPMLAPGQEWRMVWDSALDRAE
IGRGIESRFPGTVTYYDRPEQPRRWRFWRRGRRPLETKVVLDWDALPPVA
RIELMTTHDLAKREKQKLELLRSLLTYFHYASKETRPDVFRSEIDRINRA
AAETQDRWRARQVEVPTEVSQRSEGQGPQPTRIPAG
>MT2738 hypothetical protein
MKHKTDIDEWLDTIEPNPADAHDASHLRRIIAAKEAVQTAESELRAAVNA
ARAAGDTWAAIGVALGITRQAAFQRFGPHSTASP
>MT2334.1 hypothetical protein
MRSVIRVPARTDNVYRPIGPAYWVDLVSTKRGIQLLPLWRRVHSHILRIS
LDRSCG
>MT0132 PE_PGRS family protein
MFAGGGAGGLGRCVMSFVSVAPEIVVAAATDLAGIGSAISAANAAAAAPT
TAVLAAGADEVSAAIAALFSGHAQAYQALSAQAAAFHQQFVQTLAGGAGA
YAAAEAQVEQQLLAAINAPTQALLGRPLIGNGADGAPGTGQAGGAGGILY
GNGGNGGSGAAGQAGGAGGPAGLIGHGGSGGAGGSGAAGGAGGHGGWLWG
NGGVGGSGGAGVGAGVAGGHGGAGGAAGLWGAGGGGGNGGNGADANIVSG
GDGGLGGAGGGGGWLYGDGGAGGHGGQGAGGGAGGAGGDGGQGGAGRGLW
GTGGAGGHGGQGGGTGGPPLPGQAGMGAAGGAGGLIGNGGAGGDGGVGAS
GGVAGVXGAGGNAMLIGHGGAGGAGGDSSFANGAAGGAGGAGGHLFGNGG
SGGHGGAVTAGNTGIGGAGGVGGDARLIGHGGAGGAGGDRAGALVGRDGG
PGGNGGAGGQLYGNGGDGGPGGQGGQAFGANNIGGTGGAGGNGGPAILSG
NGGNGGAGGAGGAGGAGGGAGGVGGAGGAPGTGGTLQAAVSGLVTALFGA
PGQPGDTGQPG
>MT2241 conserved hypothetical protein
MNSIQIADETYVAADAARVSAAVADRCSWRRWWPDLRLQVTEDRADKGIR
WTVTGALTGTMEIWLEPSMDGVLLHYFLHAEPTGVAAWQLARMNLARMTH
HRRVAGKKMAFEVKTVLERSRPIGVSPVT
>MT1844 hypothetical protein
MTAVADAPQADIEGVASPQAVVVGVMAGEGVQIGVLLDANAPVSVMTDPL
LKVVNSRLRELGEAPLEATGRGRWALCLVDGAPLRATQSLTEQDVYDGDR
LWIRFIADTERRSQVIEHISTAVASDLSKRFARIDPIVAVQVGASMVATG
VVLATGVLGWWRWHHNTWLTTIYTAVIGVLVLAVAMLLLMRAKTDADRRV
ADIMLMSAIMPVTVAAAAAPPGPVGSPQAVLGFGVLTVAAALALRFTGRR
LGIYTTIVIIGALTMLAALARMVAATSAVTLLSSLLLICVVAYHAAPALS
RRLAGIRLPVFPSATSRWVFEARPDLPTTVVVSGGSAPVLEGPSSVRDVL
LQAERARSFLSGLLTGLGVMVVVCMTSLCDPHTGQRWLPLILAGFTSGFL
LLRGRSYVDRWQSITLAGTAVIIAAAVCVRYALELSSPLAVSIVAAILVL
LPAAGMAAAAHVPHTIYSPLFRKFVEWIEYLCLMPIFPLALWLMNVYAAI
RYR
>MT1769 conserved hypothetical protein
MAMSVNGLPGAHNAGLQPIDSKGCHTRRTRHTKVLFVSKGVLANGRGRWL
AIAASLVVSAAILYAQGAEHTCCRETPAAIPTGPDSAPANAPRIASPTEA
DLLAASAPVAAQQFQFALPAGVASEEGLQVKTIWVARAVSVLFPQITNIF
GYRQDPLKWHPNGLAIDVMIPNHHSDEGIQLGNQVAGLALANAKRWGVLH
VIWRQGYYPGIGAPSWTADYGSETLNHYDHVHIATDGGGYPTGRETYYVG
SMSPTPPE
>MT1837 PE family protein
MSFVTTQPEALAAAAGSLQGIGSALNAQNAAAATPTTGVVPAAADEVSAL
TAAQFAAHAQIYQAVSAQAAAIHEMFVNTLQMSSGSYAATEAANAAAAG
>MT2548 hypothetical protein
MPWPSAAASGVVGWRTTATASQRYHRPMSDTPFAEPYPEQRPPWGVPPPG
WDGSSRPAPSTTPRSPGRWSLVAALALAVVSLGVGIVGWFHRQPHDKPSP
APSAPTFTSQQISDAKENVCAAHRIVRQAAVLNTNQANPVPGDPTGDLAV
AANARLALYSGGDYLLRRLTAEPATPAELRDAVRSLANALQELAVNYLAG
APDSVVTPLRLALERDTRAVDPLCV
>MT2425 PPE family protein
MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVT
SGLAGGSGQAWQGAAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAVAGA
FEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAA
DVAAMVGYHGGASAAAAALAPWQQAVPGLSGLLGGAANAPAAAAQGAAQG
LAELTLNLGVGNIGSLNLGSGNIGGTNVGSGNVGGTNLGSGNYGSLNWGS
GNTGTGNAGSGNTGDYNPGSGNFGSGNFGSGNIGSLNVGSGNFGTLNLAN
GNNGDVNFGGGNTGDFNFGGGNNGTLNFGFGNTGSGNFGFGNTGNNNIGI
GLTGDGQIGIGGLNSGTGNIGFGNSGNNNIGFFNSGDGNIGFFNSGDGNT
GFGNAGNINTGFWNAGNLNTGFGSAGNGNVGIFDGGNSNSGSFNVGFQNT
GFGNSGAGNTGFFNAGDSNTGFANAGNVNTGFFNGGDINTGGFNGGNVNT
GFGSALTQAGANSGFGNLGTGNSGWGNSDPSGTGNSGFFNTGNGNSGFSN
AGPAMLPGFNSGFANIGSFNAGIANSGNNLAGISNSGDDSSGAVNSGSQN
SGAFNAGVGLSGFFR
>MT3572 conserved hypothetical protein
MGSGSRERIVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLECLVRRL
PAVGHALINQLDAQASEEELGGTLCCALANRLRITKPDAALRIADAADLG
PRRALTGEPLAPQLTATATAQRQGLIGEAHVKVIRALFRPPARRGGCVHP
PGRRSRPGRQSRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPE
QPALRRYVTAKWLPDPPSAGHL
>MT1321 hypothetical protein
MPNHRVALGQGSEPGDGVPPARRLMGSPVIRMVPLGPILMRENWVTGC
>MT0595 conserved hypothetical protein
MTSWGIRMGSPSGSRGDFWPPALVAYCPPAVAPGACTIEVPKEGTLMKAK
VGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLETDHVATVIPGP
DAVVVTAEEQNAADERAQHRFGAVQSAILHARGT
>MT3987 PPE family protein
MITMLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVELTA
RLNSLGEAWTGGGSDKALAAATPMVVWLQTASTQAKTRAMQATAQAAAYT
QAMATTPSLPEIAANHITQAVLTATNFFGINTIPIALTEMDYFIRMWNQA
ALAMEVYQAETAVNTLFEKLEPMASILDPGASQSTTNPIFGMPSPGSSTP
VGQLPPAATQTLGQLGEMSGPMQQLTQPLQQVTSLFSQXGGTGGGNPADE
EAAQMGLLGTSPLSNHPLAGGSGPSAGAGLLRAESLPGAGGSLTRTPLMS
QLIEKPVAPSVMPAAAAGSSATGGAAPVGAGAMGQGAQSGGSTRPGLVAP
APLAQEREEDDEDDWDEEDDW
>MT2944 hypothetical protein
MCWLSDPAASSAPQVARQDEQHDCGKRDQEPVLDVADVVAPQHRQQEAED
AQADDVPGAVIDPAPPPRCRWNLGGVDLMLRAGEVLHLGTCHGDSSGWTD
VELPLPEHSGTCSVALGRVGGLRGRLRRRH
>MT3279 hypothetical protein
MITVLDMNGFKDARPDRLPLSASVWDIAQRYNKGGPTVTEALYEALKELE
AQVIALQRSEGKGLLSRLS
>MT1055 hypothetical protein
MVSEALRWEIGRDRVYRLGRGRYGPGYIPRSTEYRIHQRVLALRASANVS
LRGGQSVHPLPAETPVADVI
>MT0392 hypothetical protein
MILPALSTAMSCSMTRATRMSSTLSRARWTASAAACSQDVGLVPMMSITL
YTLMGVKRIASSIRESWRVDSSSPPSPRPLSVVSEPGDAVIFGSAGCAGL
AHCAAFSAVAGVAGVAVYGPACSSIATFATGAGGPAATAVAAGPAAADQP
RGATVTVRIPGLPAARHHHTLSRAGSRFNGQQAPKAPAHDAQ
>MT0662.1 hypothetical protein
MGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTL
LGEADTAEHGHDDEYDEFSDPNAWVPRRSRDTG
>MT1249 hypothetical protein
MAVQCRVWLEIQWRGMLGADQARAGGPARIWREHSMAAMKPRTGDGPLEA
TKEGRGIVMRVPLEGGGRLVVELTPDEAAALGDELKGVTS
>MT2878 hypothetical protein
MRGCAGGLGPGSVAAVTRPGWSGGVVPASVAVHRVRGDACVVAGERVAAP
RRHGGGDRVGAGGEGHQPGRVPPDRHGCGSPGGDGAGLLRRFAERVEAVR
SVFTVWLCAVDADPVMPDAGGGGFVDAVVAIGALAAAIGRRFSLPTVSLA
ETAVAVSGGRLLAPGWPGEWVQHESTLP
>MT1775 hypothetical protein
MICPKCWAIERVCTPGLTYRPKTGLPPSGRSSSCYCGAPRQLVDRQEEQR
AAMVIAAAVVSGQRSQHGRHDVDRAVDGANSLCRLAETDQRDLRWVDHPE
DRVNPLVTDVGHGDREVR
>MT0740 hypothetical protein
MAGSDPPTGGPASQAGSDAGASPEHKHMSRRKHLVLDVCIILGVLIAYVF
SLLGYDWLAHTPGPLPQPDVGTTDDTVVLIRFEELHTVANRLDVKVLVLP
DDSMIDHRLQVLTTDTSVRLYPENELGDLQYPVGKLPAQVATTIEAHGNP
GAWPFDTYTTDTVQADVLVGAGDNRQYVPARVEVTGSLEGWDISAVRVGE
SSQTSDRPDNVIITLKRAKGPLVFDLGICLVLITLPTLALFVAIQMITGR
RKFQPPFGTWYAAMLFAVVPLRTILPGSPPAGAWIDRAVVIWVLIALAAA
MVVYIVAWYRESD
>MT0513 hypothetical protein
MGESTTQPAGGAAVDDETRSAALPRWRGAAGRLEVWYATLSDPLTRTGLW
VHCETVAPTTGGPYAHGWVTWFPPDAPPGTERFGPQPAQPAAGPAWFDIA
GVRMAPAELTGRTRSLAWELSWKDTAAPLWTFPRVAWERELLPGAQVVIA
PTAVFAGSLAVGETTHRVDSWRGSVAHIYGHGNAKRWGWIHADLGDGDVL
EVVTAVSHKPGLRRLAPLAFVRFRIDGKDWPASPLPSLRMRTTLGVRHWQ
LEGRIGGREALIRVDQPPERCVSLGYTDPDGAKAVCTNTEQADIHIELGG
RHWSVLGTGHAEVGLRGTAAPAIKEGTPA
>MT0856 hypothetical protein
MSISRSKRQSPVRWRQRRRRSGFPLGQRRSPSMTDRGRQSAEAGLSTMRP
KSSTRQRTCGARWPVWRSVVCCSTAAKSAVIVCCAAIATTACSFQATSTQ
PSTAPPTSRVDSLIVSIEDVRRIANYEELAAHFQTDLREPPEADTNVPGP
CRVVGSSDRTFGTDWSEFRSAGYHGVTDDLRPGGPVMVETVSQAIALYPD
PSTARGVFHRLESSLAECAGLHDPYFDFILDRPDASTVRIGAAGWSHVYR
LKSSVFISVGVLGIEPAEPIANVILQTISDRIQ
>MT2093 hypothetical protein
MLDRYGTDVLAAGGRRRPRSVEHPVELGMVVEDAETGYVGAVVRVEYGRI
DLEDRYGKTRGFPLGPGYLLDGLPVILTAPRCAAAAGPRRTASGSVAVPG
ARARVARASRIYVEGRHDAELIAAVWGADLRIEGVVVEHLGGVDDLVEIV
AKFRPGPRRRLGVLVDHLVAGSKEARIAEVVRRGPGGSDTLVVGHPYVDI
WQAVKPQRVGLAAWPRVPRHIEWKHGVCDALGWPHADQADIAAAWRRIRS
QVRDWTDLEPALIGRVEELIDFVTQPAGDE
>MT0105 PPE family protein
MAIPPEVHSGLLSAGCGPGSLLVAAQQWQELSDQYALACAELGQLLGEVQ
ASSWQGTAATQYVAAHGPYLAWLEQTAINSAVTAAQHVAAAAAYCSALAA
MPTPAELAANHAIHGVLIATNFFGINTVPIALNEADYVRMWLQAADTMAA
YQAVADAATVAVPSTQPAPPIRAPGGDAADTRLDVLSSIGQLIRDILDFI
ANPYKYFLEFFEQFGFSPAVTVVLALVALQLYDFLWYPYYASYGLLLLPF
FTPTLSALTALSALIHLLNLPPAGLLPIAAALGPGDQWGANLAVAVTPAT
AAVPGGSPPTSNPAPAAPSSNSVGSASAAPGISYAVPGLAPPGVSSGPKA
GTKSPDTAADTLATAGAARPGLARAHRRKRSESGVGIRGYRDEFLDATAT
VDAATDVPAPANAAGSQGAGTLGFAGTAPTTSGAAAGMVQLSSHSTSTTV
PLLPTTWTTDAEQ
>MT3839 hypothetical protein
MDLMMPNDSMFLFIESREHPMHVGGLSLFEPPQGAGPEFVREFTERLVAN
DEFQPMFRKHPATIGGGIARVAWAYDDDIDIDYHVRRSALPSPGRVRDLL
ELTSRLHTSLLDRHRPLWELHVVEGLNDGRFAMYTKMHHALIDGVSAMKL
AQRTLSADPDDAEVRAIWNLPPRPRTRPPSDGSSLLDALFKMAGSVVGLA
PSTLKLARAALLEQQLTLPFAAPHSMFNVKVGGARRCAAQSWSLDRIKSV
KQAAGVTVNDAVLAMCAGALRYYLIERNALPDRPLIAMVPVSLRSKEDAD
AGGNLVGSVLCNLATHVDDPAQRIQTISASMDGNKKVLSELPQLQVLALS
ALNMAPLTLAGVPGFLSAVPPPFNIVISNVPGPVDPLYYGTARLDGSYPL
SNIPDGQALNITLVNNAGNLDFGLVGCRRSVPHLQRLLAHLESSLKDLEQ
AVGI
>MT2081 conserved hypothetical protein
MQYGPTWRIGSSIMTLIQTVTTDDLVIQVADRRLSRPDGSVFDDDYTKLV
CWNTSFTVGFTGLARIDPAQKKSTSEWLAETLCDYASFEDGVDALRYWAS
GQIGQLPTGKGWEDKRLGIIIAGFDRRRIPLVAEISNFDPEAPIPANQNE
FKCYRIRRAPGHSASFRITGAALTEKMYANILLRRVPRILKQQDGITRAA
RLMVALQRRISEDNPGVGRHAMAVAIPRERTMPAVLSNLDAPSLNTMNSN
FCYFDDAGFNYKQLGPHMAGGGWAWADFVAEADPSNPDMQKVGGRVLKCP
QPPPQAESTGC
>MT2885 hypothetical protein
MLRDGSVRAVDQESRMILFSPIGTADPITALGDGPMLHIVRHYRPIVVVL
FLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLFVPVFR
NHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPA
RALSKPGDRESPDAYDLELMWDANDDNQPGAPNRCFEATSAALGALLERA
NLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFK
DTAFTYDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVAKH
LPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSPNAEWYLYTKDWLALL
RQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLK
ILARETGADLTLYDRLNDEIIRQIDMAPLG
>MT0408 hypothetical protein
MGVIARVVGVAACGLSLAVLAAAPTAGAEPTGALPPMTSSGSGPVIGDGD
AALRQRISQQLFSFGDPTVQEVDGSDAAQFITAAAAVADRDVASVFLPLQ
RVLGCQQNTAGSGAGFGARAYRRTDGQWGGAMLVVAKSTVSDVDALKACV
KSGWRKATAGTPTSMCNNGWTYPPFADTRRGEEGYFVLLAGTASDFCSAP
NANYRTTASSWPG
>MT3756.3 hypothetical protein
MEYAIGTIAAAAFGAILYTVVTGDSIVSALNRIIGRALSTKV
>MT2695 conserved hypothetical protein
MSAGPAIEVAVAFVWLGMVVAISFLEAPLKFRAAGVTLQIGLGIGRLVFR
ALNTVEVGFALVILAIVVVGSTPARIAAAFSVALAALAVQLIAVRPRLTR
RSNQVLAGLQAPRSRGHHIYVGLEIVKVVALLVAGILLLNG
>MT2440 PE family protein
MWRVRDYWDERLPTAAAAGRAPAESSTFSVQEVSMSLVSVAPELVVTAVP
DVARIGSSIGAPDTAAAARPTTSVLAAGADEVSADVVALFGWVAR
>MT2959 PPE family protein
MDFGVLPPEINSGRMYAGPGSGPMMAAAAAWDSLAAELGLAAGGYRLAIS
ELTGAYWAGPAAASMVAAVTPYVAWLSATAGQAEQAGMQARAAAAAYELA
FAMTVPPPVVVANRALLVALVATNFFGQNTPAIAATEAQYAEMWAQDAAA
MYAYAGSAAIATELTPFTAAPVTTSPAALAGQAAATVSSTVPPLATTAAV
PQLLQQLSSTSLIPWYSALQQWLAENLLGLTPDNRMTIVRLLGISYFDEG
LLQFEASLAQQAIPGTPGGAGDSGSSVLDSWGPTIFAGPRASPSVAGGGA
VGGVQTPQPYWYWALDRESIGGSVSAALGKGSSAGSLSVPPDWAARARWA
NPAAWRLPGDDVTALRGTAENALLRGFPMASAGQSTGGGFVHKYGFRLAV
MQRPPFAG
>MT2331 hypothetical protein
MRLPGRHVLYALSAVTMLAACSSNGARGGIASTNMNPTNPPATAETATVS
PTPAPQSARTETWINLQVGDCLADLPPADLSRITVTIVDCATAHSAEVYL
RAPVAVDAAVVSMANRDCAAGFAPYTGQSVDTSPYSVAYLIDSHQDRTGA
DPTPSTVICLLQPANGQLLTGSARR
>MT1232 hypothetical protein
MSQSAICTYRRRRRVNLPNQRPLDVQNAPAAQFTMWFDATTTTSWVTMPG
RWRISQIAVCRKLFNTVDSSSRPTP
>MT1468 conserved hypothetical protein
MVLGDQFVVDPHATRSVGMESVIADPGLPAIASNDLRTRRRQLGVVGEPV
QAHPQVHGEHQLQRRTIVAPRLVVTRSGSGLARTQQQRENPQQHWENELL
GWRSPVRREAIPPRRCLGRSLHTLTITWSRDRRSWPRRRAGDYGHTMKRL
SSVDAAFWSAETAGWHMHVGALAICDPSDAPEYSFQRLRELIIERLPEIP
QLRWRVTGAPLGLDRPWFVEDEELDIDFHIRRIGVPAPGGRRELEELVGR
LMSYKLDRSRPLWELWVIEGVEGGRIATLTKMHHAIVDGVSGAGLGEILL
DITPEPRPPQQETVGFVGFQIPGLERRAIGALINVGIMTPFRIVRLLEQT
VRQQIAALGVAGKPARYFEAPKTRFNAPVSPHRRVTGTRVELARAKAVKD
AFGVKLNDVVLALVAGAARQYLQKRDELPAKPLIAQIPVSTRSEETKADV
GNQVSSMTASLATHIEDPAKRLAAIHESTLSAKEMAKAPSAHQIMGLTET
TPPGLLQLAARAYTASGLSHNLAPINLVVSNVPGPPFPLYMAGARLDSLV
PLGPPVMDVALNITCFSYQDYLDFGLVTTPEVANDIDEMADAIEPALAEL
ERAAE
>MT2874 conserved hypothetical protein
MVSTTGMGRSTARRMLTGPGLPEPAEQVDGRRLRARGFSDDARALLEHVW
ALMGMPCGKYLVVMLELWLPLVAAAGDLDKPFATEAAVAELKAMSAATVD
RYLKPARERMRIKGISTTKPSPLLRNSITIHTCSDEAPKVPGVIEADTVA
HCGPSLIGEFARTLTMTDLVTGWTENASIRNNAAKWILEGIKECQQRFPF
PMTVFDSDCGGEFINHDVAGWLQARDIAQTRSRPYQKNDQAHVESKNNHV
VRKHAFYWRYDTGEELELLNRLWPLVSLRCNFFTPTKKPVGYTSTVNGRR
KRIYDKPATPWQRLQASGVLDAQQLSTVAARIEGFNPADLTRQINAIQMQ
LLDLAKTKTEALATARHIDLQSLQPSINRLAKAK
>MT2345.1 hypothetical protein
MSRRRPLIEPATVQVLAIAFTDSFSVSLHWPQREQGCRTAILAPMRRWCD
GDVDGRKLLPPARRTGTQQRRIRPAAPRVYTTGDILRDRKGIAPWQEQRE
PGWAPFGWLHEPSGARCPKADGQSV
>MT3105 PE family protein
MEGIVMSLLDAHIPQLIASHTAFAAKAGLMRHTIGQAEQQAMSAQAFHQG
ESAAAFQGAHARFVAAAAKVNTLLDIAQANLGEAAGTYVAADAAAASSYT
GF
>MT1280.1 PE_PGRS family protein
MHRQATGHASCRPVGQSRRERGPAMEYLIAAQDVLVAAAADLEGIGSALA
AANRAAEAPTTGLLAAGADEVSAAIASLFSGNAQAYQALSAQAAAFHQQF
VRALSSAAGSYAAAEAANASPMQAVLDVVNGPTQLLLGRPLIGDGANGGP
GQNGGDGGLLYGNGGNGGSSSTPGQPGGRGGAAGLIGNGGAGGAGGPGAN
GGAGGNGGWLYGNGGLGGNGGAATQIGGNGGNGGHGGNAGLWGNGGAGGA
GAAGAAGANGQNPVSHQVTHATDGADGTTGPDGNGTDAGSGSNAVNPGVG
GGAGGIGGDGTNLGQTDVSGGAGGDGGDGANFASGGAGGNGGAAQSGFGD
AVGGNGGAGGNGGAGGGGGLGGAGGSANVANAGNSIGGNGGAGGNGGIGA
PGGAGGAGGNANQDNPPGGNSTGGNGGAGGDGGVGASADVGGAGGFGGSG
GRGGLLLGTGGAGGDGGVGGDGGIGAQGGSGGNGGNGGIGADGMANQDGD
GGDGGNGGDGGAGGAGGVGGNGGTGGAGGLFGQSGSPGSGAAGGLGGAGG
NGGAGGGGGTGFNPGAPGDPGTQGATGANGQHGLNG
>MT3954.1 hypothetical protein
MVTGRDRRVDSDQRPPAPSWILGSHGLALLCMEVLPFIVNHVVLMKSSRI
PGPTQPTVVRELVAVGEPSYTALPAGLPHHPRPQRGGSSRAAPVLVDDSM
PHPGTRRCCALPQPGHQFPKPGQFRRQSWSRRSLSRAQTLSRKRRALQPL
PRPPYLTTTPSKRHYKRYKPPKYAHSGPRLEDQPVQCLLTLFAFDRKQVP
>MT3854 PE family protein
MGPEPAQSRERHMQSMSFDPAVADIGSQVVNNAFQGLQAGAVAWVSLSSL
LPAGAEEVSAWAVTAFTTAATGLLALNQAAQEELRKAGEVFTAIARMYSD
ADVRAAACLLEAIPRPGQTLARE
>MT3855 hypothetical protein
MANPTSAATAASGSSNSMWCASSPVGGQTSTSTRRSGSSVSAWVNTTSRN
WADRSGSTANRYSTPPETLSLLSTTEAASARKAPTITLPNLPLGWVSARS
AESSCVVGMLLVHGTTFLQSPGGRRSRPARQKPIVAGMSLTSTGSDIAPV
PPVTTSTQRPLTGRRSWISKNANPGNSATAASGASKFRRIGSSSVGGLIS
TVMSRSELSGSACVNTTNTNRASRSGPTANLDSTPPCTLSLLSTAAAASA
KNAPVSITPLNLPRRRWTTCPGVARGGSALGDKLFSATAYLVFYFVSADY
ILLLLSLPTTEFLA
>MT3322 hypothetical protein
MLAMIALIKVIRSGGATVTAQCRLPAPQYPVPAQGRHIDHGPQPLALTER
GDAADHVAGGLFGGSGFSHGRFGHP
>MT0724.1 hypothetical protein
MFSDFRRVGRPQRSLPNVHFGQALSRGPNSVAPHRTAVVRSTMMALTSEV
GRGPGFAVWRPAKKWLF
>MT3216 hypothetical protein
MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSSLAQRLR
PCTRFGQRLRLRPFDLGAPKWVDDPDFDLGRHVWRIALPRPGNEDQLFEL
IADLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLA
GLSDESMSDSFASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIV
RAAKGASEIAAGVLSPAASSLNGPISDLRRYSAAKVPLADVEQVCRKFDV
TINDVALAAITESYRNVFIQRGERPRFDSLRTLVPVSTRSNSALSKTDNR
VSLMLPNLPVDQENPLQRLRIVHSRLTRAKAGGQRQFGNTLMAIANRLPF
PMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDLYPVSPIAM
QLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRR
KVTRRRGALSLVV
>MT3767.2 hypothetical protein
MLPLGLPPSWPSGAPTRSAAPIKLTGANERRHIAHVAHASRSPLPATVTV
STTRHNASRTLQGRPTCDV
>MT1414.1 hypothetical protein
MIWGTCCERGGMLVTMVSLLVNQGVGRQSPRPATMDGAGFEAPCMPYD
>MT3535 hypothetical protein
MAALSARRGPKPGKAGANAADAEIAWLRAELDTAREVIRVQGELSALLER
L
>MT0989 hypothetical protein
MRVPSQWMISSRVTVAWNIVGYLVYAALAFVGGFAVWFSLFFAMATDGCH
DSACDASYHVFPAMVTMWIGVGAVLLLTLVVMVRNSSRGNVVIGWPFVGL
LALGLVYVAADAVLH
>MT0069 hypothetical protein
MIMELSVSVIAGLVIALLAAITPAAGERPESRRQALANAAEAGEHPATSP
LRR
>MT0609 hypothetical protein
MVLPVAPYILLRHQLVGVAANPDRIYGEGMTDQSYAVDIAHPPAALLRLV
NPILRSLLHTPLAGPLRTQLMVVSFTGRKTGRHFSIPLSAHVIDNDLYAL
TEAGWKHNFSDGAAAQVVYDGKTTAMRGELIRDRAVVSELFLRAAQAYGV
KRGQRMLGLSFRDRRIPTLEEFAEAVDRLKLVAIRLTPADNS
>MT0545 hypothetical protein
MVGMQLPQWLARFNRYVTNPIQRLWAGWLPAFAILEHVGRRSGKPYRTPL
NVFSADVDGRAGVAILLTYGPNRDWLKNITAAGGGRMRRYGKTFGVANPR
RLTKAEAAPYVSSRWRPVFARLPFDEAVLLTKAD
>MT3097 secreted antigen, putative
MVSQSMYSYPAMTANVGDMAGYTGTTQSLGADIASERTAPSRACQGDLGM
SHQDWQAQWNQAMEALARAYRRCRRALRQIGVLERPVGDSSDCGTIRVGS
FRGRWLDPRHAGPATAADAGD
>MT3387 hypothetical protein
MGQIPPQPVRRVLPLMVVPGNGQKWRNRTETEEAMGDTYRDPVDHLRTTR
PLAGESLIDVVHWPGYLLIVAGVVGGVGALAAFGTGHHAEGMTFGVVAIV
VTVVGLAWLAFEHRRIRKIADRWYTEHPEVRRQRLAG
>MT1645 conserved hypothetical protein
MAANAGSVRPNRRARPMIGIAQLLLVVAAGALWMAARLPWVVIGSFDELG
PPKEVTLTGASWSTALLPLALLMLAAAVAALAVRGWPLRALAVLLAAASF
AVGYLGISLWVVPDVAARGADLAHVPVVTLVGSARHYWGAVAAVLAAVCA
LLAAVFLMSSAAIRGSAGEDMARYAAPRARRSIARRQHSNAAGRAAPQDD
GPDMGPRMSERMIWEALDEGRDPTDREQESDTEGR
>MT2159 PE family-related protein
MSFVIASPEALLAAATDLAAIRSTIRAANAAAAVPTTGALAPAADEVSAG
IAALFGAQAQSYQAVSAQAAAFHDRFVQLLNAGGGSYASAEIANAQQNLL
NAVNAPTQTLLGRPLVGDGADGASGPVGQPGGDGGILWGNGGNGGDSTSP
GVAGGAGGSAGLIGNGGRGGNGAPGGAGGNGGLGGLLLGNGGAGGVGGTG
DNGVGDLGAGGGGGDGGLGGRAGLIGHGGAGGNGGDGGHGGSGKAGGSGG
SGGFGQFGGAGGLLYGNGGAAGSGGNGGDAGTGVSSDGFAGLGGSGGRGG
DAGLIGVGGGGGNGGDPGLGARLFQVGSRGGDGGVGGWLYGDGGGGGDGG
NGGLPFIGSTNAGNGGSARLIGNGGAGGSGGSGAPGSVSSGGVGGAGNPG
GSGGNGGVWYGNGGAGGAAGQGGPGMNTTSPGGPGGVGGHGGTAILFGDG
GAGGAGAAGGPGTPDGAAGPGGSGGTGGLLFGVPGPSGPDG
>MT3952 hypothetical protein
MDNSARQCVLIDSLAGTRAKWIVALSIHICPGCT
>MT1626 conserved hypothetical protein
MLGLSATGVLVGGLWAWIAPPIHAVVAITRAGERVHEYLGSESQNFFIAP
FMLLGLLSVLAVVASALMWQWREHRGPQMVAGLSIGLTTAAAIAAGVGAL
VVRLRYGALDFDTVPLSRGDHALTYVTQAPPVFFARRPLQIALTLMWPAG
IASLVYALLAAGTARDDLGGYPAVDPSSNARTEALETPQAPVS
>MT0566 hypothetical protein
MRIGRREGLAVAIGFVLVGAAFVLPRLNLGIKPRSDIGLERFATRAGAAP
IFGYWDAHVGWGTAPAVLTAVAVVAWGPVVAHRLPWRVLTLSTWATAAAW
AFSLAMIDGWQRGFAGRLTTRDEYLWQVPGIADIPATLRTFTSRILDFQP
NSWVTHVSGHPPGALLTFVWLDRIGLRGGGWAGLVCLLVGSSAAAAVLIA
VRVLASEQMARRTAPFVAVAPTAIWIAVSADGYFAGVAAWGIALLAVAVH
GATRFPALVAAGAGLLLGWGVFLQLRARADRAAGDGGVGRRRLAARPAGT
GAGRAGGAGGRGELRGCRILLVRRLYPCPATLLAGDRQRSAVRLLVLGKL
GVRGLRYRVRQRRRSQPGIRPGRDQSSIRLPSAAAGGAGRHRLGRPEHAE
QSRDRTNLAALHHLADRGARTAAAPLAPTLAGGQRRRGPTAEQHHLHQLV
SKCAGAASESSRAGSLREPTPRSAAIAYQRDVVHDVEVAQTLH
>MT2444 hypothetical protein
MIFKGVREGKPYPEHGLSYRDWSQIPPQQIRLDELVTTTTVLALDRLLSE
DSTFYGDLFPHAVKWRGTTYLEDGLHRAVRAALRNRTVLHARVFDMDASP
GGRRS
>MT0063 hypothetical protein
MPVVTAVGRRRGFAMPWVSTARSGAVMLANYSAGVCGRVSSPGLNVRKMC
LKANTPGAVTWLDTPKRFLSTQTASRCMAVNSSDVVTGRIDPQVLHTPLN
TDVDGYAHAMHSSINSGPLEYLPATFSVFPALGDVGDLGGGVGAATYALD
RLSNMRSGACVGGGESPWRSLMT
>MT2375 hypothetical protein
MPRRLGAAAEVTRVGVLDQAEDVGVDHRRRYLIHERAA
>MT1774 conserved hypothetical protein
MITFRLRLPCRTILRVFSRNSLVRGTDRLEAVVMLLAVTVSLLTIPFAAA
AGTAVHDSRSHVYAHQAQTRHPATATVIDHEGVIDSNTTATSAPPRTKIT
VPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQLVDEPAPPARAIA
DAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQHDIDSLFCTQR
>MT1970 hypothetical protein
MSFRESRMSGRKFSFEVTKTSSAPAATLFRLVTDGGNWATWAKPIVAQSS
WARRGDPAPGGIGAIRKLGMWPVFVQEETVEYEQDRRHVYKLVGARTPVQ
DYFGEVVLTPNASGGTDLRWSGSFTEKVRGTGPVMRAALGGAVRFFAGQL
VKAAEREAVRR
>MT3132 hypothetical protein
MGGPFDADAEAHFDEVAEAFAKLTNVDRDVGVDLEKELCMTVEADDRSDA
LVTRRLLPRVPRCIPLAARLAPGTIGCPSFWNPIATGGASRQAL
>MT2800 conserved hypothetical protein
MLSAIGIVPSAPVLVPELAGAAAAELADLGAAVIAAASLLPKSWIAVGTG
RADDVVRPTDVGTFAGFGADVRVGLAPQDGDGVAVPVELPLCALLTAWVR
GQARPEARAQVHVYASDHGSDAAVARGRQLRADIDREPDPIGVLVVADGL
NTLTPRAPGGYDPDGAGMQRALDDALASGDLAVLTRLPAQVLGRVAFQVL
AGLAEPGPRSAKEFYRGAPHGVGYFAGVWQP
>MT3875 hypothetical protein
MGSTPPRTPQEVFAHHGQALAAGDLDEIVADYADDSFVITPAGIARGKEG
IRQLFVKLLDDIPNALWDLKTQIFEGDILFLEWTANSAVSRVDDGVDTFV
FRDGTIWAHTVRYTPHPKT
>MT3873 hypothetical protein
MRSAFDSGRLTFGIVYTYARPNWWANANTVRSMIDAAGGLHPRVALMLDV
ESGGNPPGDGSSWINRLYWNLADYAGSPVRIIGYANAYDFFNMWRVRPAG
LRVIGAGYGSNPNLPGQVAHQYTDGSGYSPNLPQGAPPFGRCDMNSANGL
TPQQFAAACGVTTTGGPLMALTDEEQTELLTKAREIWDQLRGPNGAGWPQ
LGQNEQGQDLTPVDAIAVIKNDVAAMLAE
>MT2401.2 hypothetical protein
MAVRDMGIHGTEPLRRPVRFNLHMSGAAAVVRYVEHRWRSGAGRLVFSCD
ASAPPRQPPRAKQVLMPQC
>MT3595 hypothetical protein
MNIRCGLAAGAVICSAVALGIALHSGDPARALGPPPDGSYSFNQAGVSGV
TWTITALCDQPSGTRNMNDYSDPIVWAFNCALNVVSTTPQQITRTDRLQN
FSGRARMSSMLWTFQVNQADGVACPDGSTAPSSETYAFSDETLTGTHTTV
HGAVCGLQPKLSKQPFSLQLIGPPPSPVQRYPLYCNNIAMCY
>MT1312.1 hypothetical protein
MRHRQRTDPPIDCEFEMRRVLVGAAALITALLVLTGCTKSISGTAVKAGG
AGVPRNNNSQERYPNLLKECEVLTTDILAKTVGADPLDIQSTFVGAICRW
QAANPAGLIDITRFWFEQGSLSNERKVAEGLKYQVETRAIQGVDSIVMRT
GDPNGACGVASDAAGVVGWWVNPQAPGIDACGQAIKLMELTLATNA
>MT0555 hypothetical protein
MASRYPKNSERGAPGSRHCAQVGWAFSTVFALIVAIRLGRDWRLLTLAAP
GVGCGKVCDV
>MT0993 hypothetical protein
MRRPVSSMRVNRPQCARVPYSAESLVRVEASWYGRTLRAIPEVLSQVGYQ
QADHGESLLTSHHCCLGAAEGARPGWVGSSAGALSGLLDSWAEASTAHAA
RIGDHSYGMHLAAVGFAEMEEHNAAALAAVYPTGGGSARCDGVDVS
>MT2143 hypothetical protein
MTVAMFANAGLSPFVAIWTARAASLYTSHNFWCAAAVSAAVYVGSAVVPA
AVAGPLFVGRVSATIKAAAPSTTAAIATLATAANGQLRERGGAGGWVGVH
CPVVGGGGVGHPRKAIAAAVSVHSTCMPAAFGGHLGLGDRSRSVSLSGTP
>MT2844 hypothetical protein
MFGVGTAVEVGWRDPCGLAVGELRCAPAVSDQPVVGCAGCPLVDMVDFAP
VTGCVAVGSTMGAVPALLRVRFPWPPFEPDVRLSPYLALHGICRWGGSDS
CDRTTVQVFHLHSINKRLTAHAGFGAAAVVGLEDGPV
>MT3596 hypothetical protein
MRRLISVAYALMVATIVGLSAAGGWFYWDRVQTGGEASARALLPKLAMQE
IPQVFGYDYQTVERSLTAVYPLLTPDYRQEFQKSANAQIIPEAKKREVVV
QANVVGVGVMDAKRDCASVMVYLNRTVTDKTRQPLYDGSRLRVDFQRIDG
KWLIAYITPI
>MT3449 PE_PGRS family protein
MVMSLMVAPELVAAAAADLTGIGQAISAANAAAAGPTTQVLAAAGDEVSA
AIAALFGTHAQEYQALSARVATFHEQFVRSLTAAGSAYATAEAANASPLQ
ALEQQVLGAINAPTQLWLGRPLIGDGVHGAPGTGQPGGAGGLLWGNGGNG
GSGAAGQVGGPGGAAGLFGNGGSGGSGGAGAAGGVGGSGGWLNGNGGAGG
AGGTGANGGAGGNAWLFGAGGSGGAGTNGGVGGSGGFVYGNGGAGGIGGI
GGIGGNGGDAGLFGNGGAGGAGAAGLPGAAGLNGGDGSDGGNGGTGGNGG
RGGLLVGNGGAGGAGGVGGDGGKGGAGDPSFAVNNGAGGNGGHGGNPGVG
GAGGAGGLLAGAHGAAGATPTSGGNGGDGGIGATANSPLQAGGAGGNGGH
GGLVGNGGTGGAGGAGHAGSTGATGTALQPTGGNGTNGGAGGHGGNGGNG
GAQHGDGGVGGKGGAGGSGGAGGNGFDAATLGSPGADGGMGGNGGKGGDG
GGGGAGGTAVAGTAGKAGDGGAGGDGGKAGDGGAGAAGDVTLAVNQGAGG
DGGNGGEVGVGGKGGAGGVSANPALNGSAGANGTAPTSGGNGGNGGAGAT
PTVAGENGGAGGNGGHGGSVGNGGAGGAGGNGVAGTGLALNGGNGGNGGI
GGNGGSAAGTGGDGGKGGNGGAGANGQDFSASANGANGGQGGNGGNGGIG
GKGGDAFATFAKAGNGGAGGNGGNVGVAGQGGAGGKGAIPAMKGATGADG
TAPTSGGDGGNGGNGASPTVAGGNGGDGGKGGSGGNVGNGGNGGAGGNGA
AGQAGTPGPTSGDSGTSGTDGGAGGTGGAGGAGGTLAGHGGNGGKGGNGG
QGGIGGAGERGADGAGPNANGANGENGGSGGNGGDGGAGGNGGAGRKAQA
AGYTDGATGTGGDGGNGGDGGKAGDGGAGENGLNSGAMLPGGGTVGNPGT
GGNGGNGGNAGVGGTGGKAGTGSLTGLDGTDGITPNGGNGGAGMINGGLG
GFGGAGGGGAVDVAATTGGAGGNGGAGGFASTGLGGPGGAGGPGGAGDFA
SGVGGVGGAGGDGGAGGVGGFGGQGGIGGEGRTGGNGGSGGDGGGGISLG
GNGGLGGNGGVSETGFGGAGGNGGYGGPGGPEGNGGLGGNGGAGGNGGVS
TTGGDGGAGGKGGNGGDGGNVGLGGDAGSGGAGGNGGIGTDAGGAGGAGG
AGGNGGSSKSTTTGNAGSGGAGGNGGTGLNGAGGAGGAGGNAGVAGVSFG
NAVGGDGGNGGNGGHGGDGTTGGAGGKGGNGSSGAASGSGVVNVTAGHGG
NGGNGGNGGNGSAGAGGQGGAGGSAGHGGNGGGATGGDGGNGGNGGNSGN
STGVAGLAGGAAGAGGNGGGTSSAAGHGGSGGNGGSGGSGGSGTTGGAGA
AGGNGGAGAGGGSLSTGQSGGPRRQRWCRWQRRRWLGRQRRRRWCRWQRR
CRRQRWRWRCRQRRLRRQWRQGRRRCRPWLHRRRGRQGRRWRQRRFQQRQ
RSRWQRR
>MT1936 hypothetical protein
MPRTWIATAGNLYRPLSVNCWPTMDYQCRRHRPESCLQPKAITARRSGVR
SARRWLRPCAADERQWAWLVDRADAHVKEPSIMQPDAYPVRVRGDLDPAL
SRWQWLVKWFLAIPHYIVLFFLHVAAVVVTVIAFFAILFTGRYPRTLFDF
NVGVMRWRWRVAFYALSALGTDRYPPFSLQTKAEYPADLEVDYPERLSRG
LVLIKWWLLAIPHYLILAVFLSSGWRVFLIDPHDRVGIMWPSLLVILLLV
AVVALLFTGRYPIGLYNLVIGVNRWALRVRAYTTLMRDEYPPLRLDMGPR
EQVSQPATAASDYSAGGAESP
>MT0253 hypothetical protein
MIRTQVQLPDELYRDAKRVAHEHEMTLAEVVRRGLEHMVRIYPRRDAASD
TWQPPTPRRLGPFRASEETWRELANEA
>MT2341 hypothetical protein
MTWMSRGAPMWLPYEATPPYERWPARDVDPKGPAACSIGGLRIVLASLRL
VAVAGSLSVMPIFEAPQSTLAWGHFSSMGRRH
>MT3191 hypothetical protein
MAPPTTTSTRAATIAHPAPASHHMTPNAASTGDSAKNTITGCCLITARAL
VARTRSISLPGMPFRMPADYHNASSDEPTNRHPWPAPARCCRHEWRTMRR
TNACDRRRFGLSLTIHEDACRIISVVPVVLEVRRAEPAHPATPYPEPLAR
CSRSPGLNESSHMSGRIPP
>MT0553 hypothetical protein
MLYLLLVLILATLIYLGWRAARAQMNRPKTRVIGPDDDPEFLRRLGHGDN
NRS
>MT3581 PE family protein
MSFTAQPEMLAAAAGELRSLGATLKASNAAAAVPTTGVVPPAADEVSLLL
ATQFRTHAATYQTASAKAAVIHEQFVTTLATSASSYADTEAANAVVTG
>MT3878 hypothetical protein
MGSTPWCPNPCQCTLRTPVEVLELAVALRPENPDRTAGAIQRILRAQLAG
DRIALRGRGS
>MT0406 hypothetical protein
MPEHELGPVRALGWLREDRKPLLNAKLLVLGHLALNVYDPDNGYGEEVLD
FEPRTVWWGSANWTVRAGSHLEVGFACDDPTLVEEATAFVADVIXXSEXI
DTTCAGPEXNLVQVEFDDAAMAEAMEEMAEPDDDGEDW
>MT1025 hypothetical protein
MNYRCVIALGALTRPRWPTMGPRGDRNRREQVIMPSIPQSLLWISLVVLW
LFVLVPMLISKRDAVRRTSDVALATRVLNGGAGARLLKRGGPAAGHRWGY
LPPEGQGDDPDWKPEEDWRDDPVEDGFADVEHDIDEDQEADDARRRGAVV
MKVAAPQTAGADEPDYLDVDVVEEDSEALPVGAGAAVGESADEADAEAAD
GVAGHADPEADPVEYEYEYEYVEDTCGLELEEDDQEAPPTVASGTSRRRR
FDTKTAAAVSARKYTFRKRALIVMAVILVGSAAAAFELTPVAWWICGSAT
GVTVLYLAYLRRQTRIEEKVRRRRMQRIARARLGVENTRDREYDVVPSRL
RRPGAVVLEIDDEDPIFTHLESAAPIRNYGWPRDLPRAVGQ
>MT0438 hypothetical protein
MMAEKNTRRATSQREAVAKIREAETIVMNLPICGQVKIPRPEHLAYYGGL
AALAALELIDWPVALVIATGHILANNHHNRVLEELGEAMEEA
>MT1533 conserved hypothetical protein
MWCPSVSLSIWANAWLAGKAAPDDVLDALSLWAPTQSVAAYDAVAAGHTG
LPWPDVHDAGTVSLLQTLRAAVGRRRLRGTINVVLPVPGDVRGLAAGTQF
EHDALAAGEAVIVANPEDPGSAVGLVPEFSYGDVDEAAQSEPLTPELCAL
SWMVYSLPGAPVLEHYELGDAEYALRSAVRSAAEALSTIGLGSSDVANPR
GLVEQLLESSRQHRVPDHAPSRALRVLENAAHVDAIIAVSAGLSRLPIGT
QSLSDAQRATDALRPLTAVVRSARMSAVTAILHSAWPD
>MT0639 hypothetical protein
MRRRPLFLTAAVSSVAISLRRAAGTAFEIEVDMPTSLAGNGVDLGAAIEV
AGTFALETQRRVPDDGPFDELVAPASHALLVGHDASPSLRLARLRPALDA
QEIHGAAASFT
>MT0124 hypothetical protein
MTTMIMTFVVPQRVTRATKGRARSLLRVSRRLTDTFRAPLAWTPQERADR
YVARMPIAVIAD
>MT2803.1 hypothetical protein
MADHGFAAHRLEEVGHAHEVVASSPTLWLACL
>MT3671.2 hypothetical protein
MSHVRYVAWLGQLSSRMANMQTLTSRSSIVADQFGEGFLVVQVDTGLQAS
GAVGRQ
>MT2875 hypothetical protein
MEKSVSNVLDAISTEHRPVIEQELENRNPALFDELRRTEKPTNEQSDAVI
DVLSDALMKTFGPDWVPNDYGLKIERAIDAYLETWPIYR
>MT1123 PE_PGRS family protein
MSFVIAAPEALVAVASDLAGIGSALAEANAAALAPTTALLAAGADEVSAA
IAALFGAHGQAYQTVSAQASAFHAQFVQALTGGGGAYAAAEAANVSAAQS
TDQRLLDLINGPTQALLGRPLIGDGANGGPGQDGGPGGLLYGNGGNGGTS
TTAGVAGGNGGAAGLIGNGGAGGGGGAGAAGGNGGAGGWLYGNGGAGGAG
GTSVIPGVAGGNGGAGGSAGLWGTGGAGGDGGNGRSGPVNVAGSAGGNGG
AGGAAGLFGDAGAGGNGGKGGAGGAAFSINFTAGDGGAGGAGGSGGHALL
WGAGGAGGNGGSGGTGGAGGSTAGAGGNGGAGGGGGTGGLLFGNGGAGGQ
GATAGAGGAGANGVSSTNGGGTGGNGGIGGTGGSGGAGGNAGLLGVGGAG
GHGASGGAGDRGGAGGTGFISSDGGAGGDGGDGGNGGAGGTGGLLFGAGG
NGGPGGSGGAADIGGNGGAGNGGGTDGNGGNGGSGGGAGSGGDGGGAGGN
GAWLFGNGGAGGGGGKGGNGAGGGLGGGSFGLPGLNGSGGDGGDGGNGAP
GGVLYGNGGAGGQGSSGGIGGPGATGGAGGKGGDGGDAQLIGDGGNGGNG
GAGGTGGTPGPGGPGGSGGLGGLLFGQTGTAGVSP
>MT1684 PE family protein
MSFLTVAPDMVTAAAGNLESVGSALNEAAAAAAPATVGLAAPAADRVSAV
VAAMLGAYARDFQGISAQIAGFHNQFVGALRGGAAAYASAEAANVQQTVV
NAVNAPAQALLGHPLIGPETVGSSAAAVSFGFGPLLLAGSDPLLAVPFSY
PASLPTPFGPVTMTLNGSFDPLTQQVVFDSGSLTAPAPFVYGLGAVGPAL
TTMTALQNSGTAFSGAVQSGNLLGAAGALLQAPGNAVTGFLFGQTAISQS
IPGPSNLGYESVGISVPVGGLLAPLQPVTVTLTPTSGMPTAIQLSGTQFG
GLLPALLNGF
>MT2812 PE_PGRS family protein
MSFVIAAPEFLTAAAMDLASIGSTVSAASAAASAPTVAILAAGADEVSIA
VAALFGMHGQAYQALSVQASAFHQQFVQALTAGAYSYASAEAAAVTPLQQ
LVDVINAPFRSALGRPLIGNGANGKPGTGQDGGAGGLLYGSGGNGGSGLA
GSGQKGGNGGAAGLFGNGGAGGAGASNQAGNGGAGGNGGAGGLIWGTAGT
GGNGGFTTFLDAAGGAGGAGGAGGLFGAGGAGGVGGAALGGGAQAAGGNG
GAGGVGGLFGAGGAGGAGGTVFGSGGAGGAGGVATVAGHGGHGGNAGLLY
GTGGAGGAGGFGGFGGDGGDGGIGGLVGSGGAGGSGGTGTLSGGRGGAGG
NAGTFYGSGGAGGAGGESDNGDGGNGGVGGKAGLVGEGGNGGDGGATIAG
KGGSGGNGGNAWLTGQGGNGGNAAFGKAGTGSVGVGGAGGLLEGQNGENG
LLPS
>MT2392 hypothetical protein
MRRQRSAVPILALLALLALLALIVGLGASGCAWKPPTTRPSPPNTCKDSD
GPTADTVRQAIAAVPIVVPGSKWVEITRGHTRNCRLHWVQIIPTIASQST
PQQLLFFDRNIPLGSPTRNPKPYITVLPAGDDTVTVQYQWQIGSDQECCP
TGIGTVRFHIGSDGKLEALGSIPHQ
>MT2726 hypothetical protein
MARSVNDYRRELERWVRQQQRVQDQFRMGVQRAIEREIRRRYEAWRRQAE
QARKAAEDELRKRGRERRPLDKLPPGPIPGTGGQPLQPFKPSR
>MT2343 hypothetical protein
MKLLSPLDQMFARMEAPRTPMHIGAFAVFDLPKGAPRRFIRDLYEAISQL
AFLPFPFDSVIAGGASMAYWRQVQPDPSYHVRLSALPYPGTGRDLGALVE
RLHSTPLDMAKPLWELHLIEGLTGRQFAMYFKAHHCAVDGLGGVNLIKSW
LTTDPEAPPGSGKPEPFGDDYDLASVLAAATTKRAVEGVSAVSELAGRLS
SMVLGANSSVRAALTTPRTPFNTRVNRHRRLAVQVLKLPRLKAVAHATDC
TVNDVILASVGGACRRYLQELGDLPTNTLTASVPVGFERDADTVNAASGF
VAPLGTSIEDPVARLTTISASTTRGKAELLAMSPNALQHYSVFGLLPIAV
GQKTGALGVIPPLFNFTVSNVVLSKDPLYLSGAKLDVIVPMSFLCDGYGL
NVTLVGYTDKVVLGFLGCRDTLPHLQRLAQYTGAAFEELETAALP
>MT1828 hypothetical protein
MGKDRGGIGTREFDRSGGDMRVSLFLSDAAQADAQSGKVHALGLGWRQCQ
TPTPPFALVLFLDIDWDETNKQHQLKCQLLTADGDPVVVPGPHGPQRILF
EAAAEAGRAPGAIHGTSVRMPLTLNIPAGIPLEPGIYEWRVEVEGYERAT
AVEAFIVAGGGHPPASCG
>MT2246 hypothetical protein
MSQAERNDPMARSGPDTSGGTAPGPGAPVVIDCDDCAARGSGCRDCVVSV
LIGVPESLSHDERAALEVLADVGLAPRLRLVPVRRQRGSGVA
>MT0376 conserved hypothetical protein
MTSMGDLLGPEPILLPGDSDAEAELLANESPSIVAAAHPSASVAWAVLAE
GALADDKTVTAYAYARTGYHRGLDQLRRHGWKGFGPVPYSHQPNRGFLRC
VAALARAAAAIGETDEYGRCLDLLDDCDPAARPALGL
>MT3978 conserved hypothetical protein
MASGSGLCKTTSNFIWGQLLLLGEGIPDPGDIFNTGSSLFKQISDKMGLA
IPGTNWIGQAAEAYLNQNIAQQLRAQVMGDLDKLTGNMISNQAKYVSDTR
DVLRAMKKMIDGVYKVCKGLEKIPLLGHLWSWELAIPMSGIAMAVVGGAL
LYLTIMTLMNATNLRGILGRLIEMLTTLPKFPGLPGLPSLPDIIDGLWPP
KLPDIPIPGLPDIPGLPDFKWPPTPGSPLFPDLPSFPGFPGFPEFPAIPG
FPALPGLPSIPNLFPGLPGLGDLLPGVGDLGKLPTWTELAALPDFLGGFA
GLPSLGFGNLLSFASLPTVGQVTATMGQLQQLVAAGGGPSQLASMGSQQA
QLISSQAQQGGQQHATLVSDKKEDEEGVAEAERAPIDAGTAASQRGQEGT
VL
>MT3413 hypothetical protein
MLARSLSYRHRMYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDY
YWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGPILEGPVLDD
PGAAPPPPAAGGGA
>MT3977 hypothetical protein
MAGERKVCPPSRLVPANKGSTQMSKAGSTVGPAPLVACSGGTSDVIEPRR
GVAIIGHSCRVGTQIDDSRISQTHLRAVSDDGRWRIVGNIPRGMFVGGRR
GSSVTVSDKTLIRFGDPPGGKALTFEVVRPSDSAAQHGRVQPSADLSDDP
AHNAAPVAPDPGVVRAGAAAAARRRELDISQRSLAADGIINAGALIAFEK
GRSWPRERTRAKLEEVLQWPAGTIARIRRGEPTEPATNPDASPGLRPADG
PASLIAQAVTAAVDGCSLAIAALPATEDPEFTERAAPILADLRQLEAIAV
QATRISRITPELIKALGAVRRHHDELMRLGATAPGATLAQRLYAARRRAN
LSTLETAQAAGVAEEMIVGAEAEEELPAEATEAIEALIRQIN
>MT1160 conserved hypothetical protein
MCSTREEITEAFASLATALSRVLGLTFDALTTPERLALLEHCETARRQLP
SVEHTLINQIGEQSTEEELGGKLGLTLADRLRITRSEAKRRVAEAADLGQ
RRALTGEPLPPLLTATAKAQRHGLIGDGHVEVIRAFVHRLPSWVDLKTLE
KAERDLAKQATQYRPDQLAKLAARIMDCLNPDGDYTDEDRARRRGLTLGK
QDVDGMSRLSGYVTPELRATIEAVWAKLAAPGMCNPEQKAPCVNGAPSKE
QARRDTRSCPQRNHDALNAGLRSLLTSGNLGQHNGLPASIIVTTTLKDLE
AAAGAGLTGGGTILPISDVIRLARHANHYLAIFDRGKALALYHTKRLASP
AQRIMLYAKDSGCSAPGCDVPGYYCEVHHVTPYAQCRNTDVNDLTLGCGG
HHPLAERGWTTRKNAHGDTE
>MT2934 hypothetical protein
MPPHWEHRNPEVLFRSFRSRPADRSQSWLPSDGRAPVGELLSAVDVRAVI
DAQDHYRCVLVVNPVQQAVRSATRAERAGQLAPKGLAHPQGLARQIAERE
FDHCREDSRWQLVEVSTRGCGEPHGVRHRSVGASRDAEFGADLVFAVGAA
GGNVGVGFSDRLPDSGLRQPVQRLLQRFPLVGADQNGCGCTVLGDGDLVL
CRRDRVDELIELALDRRNRQYPHTAILGRYTGPAQP
>MT3207 hypothetical protein
MHHILAMLVRFVPAPVRGAFGIVGVGRLPTSAEADGERGRVVDIWR
>MT0488 hypothetical protein
MPDAGAGSRLRSWAYALRTTNPPADGPTDTVTRWLVVTRAAVLPMTLVSG
LVAGLLAIGEPGLDWRWLVLWWESHAPHIANNLMNDLYDTDVGTDSATYA
RARYAQHPAATGANRAAYTTPRRTTSCGSPERALEPTTPRWARAVGRSCW
RRSPTGCCAPRC
>MT1993 hypothetical protein
MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNR
IRLSTTVDAALLTSARDMRAGITDAALIDEALAALLARHRSAEVDASYAA
YDKHPVDEPDEWGDLASWRRAAGDS
>MT3984 hypothetical protein
MTRAGDDAKRSDEEERRQRPAPATMQSAAMRRSGAHDC
>MT2477 conserved hypothetical protein
MTTAVPSPPAEIAAGRPVTSTSCPTAARARRLVYAPDLDGRADPGEIVWT
WVAYEQDPTRGKDRPVLVVGRDRSVLLGLLVSSQERHAADRDWVGIGSGA
WDYEGRESWVRLDRVLDVPEESIRREGAILEREVFDVVAARLRADYAWR
>MT1854.1 hypothetical protein
MASASGQTRTNAMGLLLTDDRTRRPMTASVVATSRERHSHKAAKQRACEI
TDFEPEGRFRVRKRRRGRIGTKRSSISDTDYRRDSFRSHLLTAGAHGDAD
AQHKGMTAQQTTELGTPLVRALAPHGVSGRSSRKPLGLNP
>MT3573.10 hypothetical protein
MVPGAPTGGDDATRRRATRCGTPRFAHWILVCETA
>MT1399 hypothetical protein
MLIAGYLTDWRIMTTAQLRPIAPQKLHFSENLSVWVSDAQCRLVVSQPAL
DPTLWNTYLQGALRAYSKHGVECTLDLDAISDGSDTQLFFAAIDIGGDVV
GGARVIGPLRSADDSHAVVEWAGNPGLSAVRKMINDRAPFGVVEVKSGWV
NSDAQRSDAIAAALARALPLSMSLLGVQFVMGTAAAHALDRWRSSGGVIA
ARIPAAAYPDERYRTKMIWWDRRTLANHAEPKQLSRMLVESRKLLRDVEA
LSATTAATAGAEQ
>MT0298 PE family protein
MTLRVVPEGLAAASAAVEALTARLAAAHASAAPVITAVVPPAADPVSLQT
AAGFSAQGVEHAVVTAEGVEELGRAGVGVGESGASYLAGDAAAAATYGVV
GG
>MT0740.1 hypothetical protein
MTPDQLSDDSGGDLDDRGRYRQSGIRVDHDRHGLICPGGADSH
>MT3095 vitamin-B12 independent methionine synthase family protein
MSVFATATGIGSWPGTAAREAAQVVVGELAGALAYLTELPARGVGADMLG
RAGGLLVDVAIDTVPRGYRIAARPGAVTRRAASLLDEDMDALEEAWETAG
LRGCGRAVKVQAPGPVTLVAGLELANGHRAITDPGAVRDLAASLAEGVAA
HRAALARRLDTPVVVQFDEPSLPAALGGRLTGVTALSPVAPLDETVAEAL
LDTCIAAVDADVALHSCSPDLPWDLLQRSRISAVSVDASTLQAADLDAVA
AFVESGRTVVLGLVPVTAPERAPSMEEVAAAAVAVTDRLGVPRSALRDRL
GVSPACGLANATGQWARTAVGLARDVAEAFARDPEAI
>MT2950 hypothetical protein
MVGAGDDAERSDEEERRRWLAPATMQSAAMRRSGADD
>MT0269 PPE family protein
MTAPIWMASPPEVHSALLSSGPGPGPLLVSAEGWHSLSIAYAETADELAA
LLAAVQAGTWDGPTAAVYVAAHTPYLAWLVQASANSAAMATRQETAATAY
GTALAAMPTLAELGANHALHGVLMATNFFGINTIPIALNESDYARMWIQA
ATTMASYQAVSTAAVAAAPQTTPAPQIVKANAPTAASDEPNQVQEWLQWL
QKIGYTDFYNNVIQPFINWLTNLPFLQAMFSGFDPWLPSLGNPLTFLSPA
NIAFALGYPMDIGSYVAFLSQTFAFIGADLAAAFASGNPATIAFTLMFTT
VEAIGTIITDTIALVKTLLEQTLALLPAALPLLAAPLAPLTLAPASAAGG
FAGLSGLAGLVGIPPSAPPVIPPVAAIAPSIPTPTPTPAPAPAPTAVTAP
TPPLGPPPPPVTAPPPVTGAGIQSFGYLVGDLNSAAQARKAVGTGVRKKT
PEPDSAEAPASAAAPEEQVQPQRRRRPKIKQLGRGYEYLDLDPETGHDPT
GSPQGAGTLGFAGTTHKASPGQVAGLITLPNDAFGGSPRTPMMPGTWDTD
SATRVE
>MT2634 hypothetical protein
MTGGATGALPRTMKEGWIVYARSTTIQAQSECIDTGIAHVRDVVMPALQG
MDGCIGVSLLVDRQSGRCIATSAWETAEAMHASREQVTPIRDRCAEMFGG
TPAVEEWEIAAMHRDHRSAEGACVRATWVKVPADQVDQGIEYYKSSVLPQ
IEGLDGFCSASLLVDRTSGRAVSSATFDSFDAMERNRDQSNALKATSLRE
AGGEELDECEFELALAHLRVPELV
>MT0910.1 hypothetical protein
MLGQSGARTGSRILTPGAMENHRGAVIQSPDTWPPPSGVASKPNLNRRCV
Q
>MT3248 PPE family protein
MAEGGVLLTGTRIFTKSPLFVAPFSYSLFYEYRDVHRCLSHWPGRPVGEG
AGLMNYSVLPPEINSLRMFTGAGSAPMLAASVAWDGLAAELAVAASSFGS
VTSGLAGQSWQGAAAAAMAAAAAPYAGWLAAAAARAAGASAQAKAVASAF
EAARAATVHPMLVAANRNAFVQLVLSNLFGQNAPAIAAAEAMYEQMWAAD
VAAMVGYHGGASAAAAALPSWQQALRGLPGLGQVASAISGGAASMFAAPA
AATAAVTPPALNTGLGNIGSWNLGGGNVGLLNLGSGNFGSLNLGGGNTGN
ANLGGGNWGFANLGSGNIGNTNFGNGNQGNLNFGSGNLLGNGNFGFGNAF
GDGNLGSGNVGSTNLGSGNFGSFNVGSGNMGMSNIGFGNLGNNNLGFGNN
GNNNIGFGLTGDNLVGIGALNSGIGNMGFGNSGNNNIGFFNSGNGNVGFF
NSGDGNTGFGNAGDVNTGFWNGGPFNTGFGNGGNTNFGFGNAGFQNMGHG
NAGGVNVGSGNAGLANTGDFNSGGVVSGIGGNTGSFNSGNLNTGFGNAGD
LNTGLFNSGDVNTGIGSTVDQPGSVSGFGNTGTSVSGFNNSGNLTSGFGN
MNSNVFDSTSGFQNIGDANVGFFNSGNSNEGFFNTGMFNNGIYNSGVAST
GIANSGNASSGVANSGDNSSGAFNQGDNQAGFFGQP
>MT3341 hypothetical protein
MRRRLLMSPRVPRLRWDDPFRALDMLASLWSSTGMSLVSAGAAQAVAAPY
RTLFTTLQQLLIGKEVTVRIGDHDVVLTVTELDSALEPQGLAVGQLGEVR
VAARGISWDQHHLHSAVAVLRNVHIRPGVPPLVIAAPVELSSALPTEIFD
DVLRQATPQLRGELSESGAARLRWARRPDWGGLEVDVDVAGTTSQTTLWL
RPRTVITGQRRWTLPARTPAYRVPLPELPHGLRITDVSLAADCLQLSALL
PEWRTELPLRYLESVITQLSQGALSFVWPPLRSGAD
>MT2941 hypothetical protein
MTVTPRPAQADPRSMPAEVASRELRNNTAGLLRRVQAGEDITITANGKPV
ALLTAGSPHGADG
>MT0296 conserved hypothetical protein
MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTGWRFVMR
RIAAGIALHDTRMLVDPLRTQSRAVLMGVLIVITGLIGSFVFSLIRPNGQ
AGSNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTEL
DQFPRGNLIGIPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVI
AGPLEDTGARAAALGPGQAVLVDSGAGTWLLWDGKRSPIDLADHAVTSGL
GLGADVPAPRIIASGLFNAIPEAPPLTAPIIPDAGNPASFGVPAPIGAVV
SSYALKDSGKTISDTVQYYAVLPDGLQQISPVLAAILRNNNSYGLQQPPR
LGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSKPVGAATSS
LTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPD
APGAGSLFWVSDTGVRYGIDNEPQGVAGGGKAVEALGLNPPPVPIPWSVL
SLFVPGPTLSRADALLAHDTLVPDSRPARPVSAEGGYR
>MT3690 conserved hypothetical protein
MHRRIWDAPENGVPGLPRRSGRRRGPHRYRRRHLTTTVHSRAAGTMPRQG
RAGGIRVNRCNIRLRLAGMTTWVASIALLAAALSGCGAGQISQTANQKPA
VNGNRLTINNVLLRDIRIQAVQTSDFIQPGKAVDLVLVAVNQSPDVSDRL
VGITSDIGSVTVAGDARLPASGMLFVGTPDGQIVAPGPLPSNQAAKATVN
LTKPIANGLTYNFTFKFEKAGQGSVMVPISAGLATPHE
>MT0933 hypothetical protein
MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKLHDAASN
VVGMSDQQS
>MT2115 hypothetical protein
MPMTRIRQAGELTEAMLAKPMSARRTRQATKRAPRAFFAS
>MT2593.2 hypothetical protein
MKPSRSAIMRPAGASSREAVIIGAPTHWLHRSNIRMSPAKTFTMSRLAIT
LDTSQVLGRRCEQRNHQDREVGAIATMDG
>MT0598 hypothetical protein
MKRAGPAAIGPSIRVRTAEVSHHRSAAEFASADASPGFAE
>MT0855 PE_PGRS family protein
MSFVIAAPDLVAMATEDLAGIGASLTAANAAAAVPTSGLLAAAGDEVSAA
IAALFSSHGQQYQAMSAQAAAFHARFVQALAGAMGAYAAAEAANASPLQT
LEQGLLGAINAPAAALSGRPFIGNGTNGAPGTGEAGGPGGWLLGNGGNGG
SGAPGQTGGAGGAAGLLGHGGTGGAGGTGASGGKGGTGGWLWGSGGAGGA
GGSGGGSGGAGGNALMFGIGGNGGAGGAASGVGNGGVGGAGGAGGALVAI
GGAGGAGGAATTGTGGAGGAGSNALGLFLGLGGSGGQGGDSAMGSGGAGG
AGGSGGAASPFGIDIGIGGAGGHGGAGTNGGAGGAGGAGGSSGTVFALDL
SWGGAGGNGGAATTGTGGAGGTGGFAVAPDFIGFGAAYGGAGGLGGAATG
AGGTGGTGGVGAGGFAALGVGVGGAGGAGGAATETGGIGGAGGLGVGLLG
GAGGAGGPGGAASAGSGGHGGTGGDALGLIGAGIGGVGGVGGAATDTGGN
GGAGGSGTGLLGGVGGAGGHGGGASVGTGGSGGAGGDGFGFVGAGGNGGN
AGTGVGVNGANGGNGGSATGALAAVGGAGAAGGDATSGTGGFGGAGGSAR
GLIFALGGAGAAGGDASTGVGGPGGPGGTGTASSPFGIAIAIGGAGAQGG
AGTSGATGGAGGDGVFEGIAVLGLGFGGAAGAGGAATGDGATGGAGGFGG
AGAGIANFLGFSVLHGGAGGAGGTATGTGGNGGAGGGGGLSSPVILGIGI
GGAGGDGGGALGVLGGMGGDGGEAVAVGIAVGGAGGAGGAAPTGNGGAGG
GGGDALGLVGVGGNGGNAGTGFGANTGGNGGDTTIVVNGMLAPSTLGYGG
NGGNGVNGGAGGTGGKAGVFGAPGQNGLP
>MT3951 hypothetical protein
MIQVCSQCGTGWNVRERQRVWCPRCRGMLLAPLADMPAEARWRTPARPQV
PTASDTRRTPPRLPPGFRWIAVRPGAAPPPRHGPRLRGPTPRYAGIPRWG
LTDHVDQAPVPASAKAGPSPAAVRTTLLVSLLVFSIAVVVFVVRYVLLVI
NRNTLLNSVVASASVWLGVLVSLAAIAAAGTTIVLLVRWLVARRAAAFMH
QGLPERRSARELWAGCLLPMVNLLWAPLYVIELALVEDRYTRLRRPIVVW
WIVWIVSNAISMFAFATSWVTDAQGIANNTTMMVLAYLCAAAAVAAAARV
FEGFEQKPVERPAHRWVVVNTDGRSAPASSVAVELDGQEPAA
>MT1627 hypothetical protein
MRSAHTGANSDLAHNLVTPDLNQFDDLPLESKR
>MT0350 hypothetical protein
MPSPEAIAHFDERFECHAPRTTRVSAAFIDRICSATRAENRAAAAQLVAL
GELFAYRWSRCGGREEWVMDTMAAVAAEVAAALRISQGLAASRLRYARAM
RERLPKTAEVFSAGDIGYLMFATIVYRTDLIVDPDVLAAVDAQLAANVAR
WPSMTKARLAGQVDKIVARADADAVRRRKEYQAQRQFWVGESQDGVCQIG
GSLLAVDAHALDARLSALAGTVCEHDPRSREQRRADALGALAGGADRLGC
GCGRADCAAGKRPAAPPVVIHLIAEAATINGTGSAPASQMNADGLITAEL
VAELAKTATLVPLVHPGDAPPEPGYAPSKALADFVRCRDLTCRWPGCDEP
ATNCDLDHTIPYAAGGPTHASNLKCYCRTHHLVKTFWGWRDQQLPDGTLI
LTSPSGHTYVSTPGSALLFPSLCHFSGGIPAPEADPPYDHCDQRTAMMPK
RRRTRAQDRAYRIATERRQNHAARQRAQVLTQTAAATDTHGPPPDPNDDP
PPF
>MT2702 hypothetical protein
MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYLGQQPDK
LPIARPTIALAAQAFRDEIVLLGLKARRPVSNHRVFERISQEVAAGLEFY
GNRGWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEP
GSQRWLSYTANNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWK
LHDELGLNIVMPVLPMHGPRGQGLPKGAVFPGEDVLDDVHGTAQAVWDIR
RLLSWIRSQEEESLIGLNGLSLGGYIASLVASLEEGLACAILGVPVADLI
ELLGRHCGLRHKDPRRHTVKMAEPIGRMISPLSLTPLVPMPGRFIYAGIA
DRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFVQAALEQSG
LLDAPRTQRDRSA
>MT2909 conserved hypothetical protein
MLRAAPVINRLTNRPISRRGVLAGGAALAALGVVSACGESAPKAPAVEEL
RSPLDQARHDGALAAAAATAIGIPPQVAAALTVVATQRTSHARALATEIA
RAAGKLVSATSETSSSSPSPTDPAAPPPAVSDVIDSLRTSAGEASRLVAT
TSGYRAGLLASIAASCTASYTVALVPSGPSI
>MT3762 hypothetical protein
MIQKTKSGRRPEELESPPTNTVLGYQACTPGVAWVASAILRRF
>MT1650 hypothetical protein
MTGNFVGASFRLRQRPPDRRSIESSTGDQAVPSWVVIVDAVEAATNGGVS
S
>MT1884 conserved hypothetical protein
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSE
GHYSAVGGYSASGSEDYPDIPPRPDWEPTGAEPIAAAPPPLFRFGHRGPG
DWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTA
AARCVGGKDTVAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAG
SDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGSQAISDSRSLV
ISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMP
SSGNGDAAYLAGEAVAAASAPAGAPATAGIGAVRTLMGARPKLADDSLTA
AMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAA
VADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKP
PSSPVTSFPALPSTLSVGDDSMRATLADTMVTASAGVAATIMLDQSMPND
EGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPV
NGQPRPAALTAALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLV
ITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGADPDRATWEAVA
QLSGGSYQNLETSASPDLATAVNIFLS
>MT1330.1 hypothetical protein
MTLVEGVADGVRVLDGARALGGVIPSACSCARNASSVELALAVPAKPSAA
RPKVAAVKVVPTMEAAKRRVNIDDLLVLMSSEVALVCVPSESAPGLDRFS
INPWQYRYRFDGVPTVQGERSAMAVPERVRRMRQHGTCTRRTGSPATRHA
GRPGSYPDEFESETTKPTMRPPSRVAPITLL
>MT1143.1 hypothetical protein
MRTTVTVDDALLAKAAELTGVKEKSTLLREGLQTLVRVESARRLAALGGT
DPQATAAPRRRTSPR
>MT2138.2 hypothetical protein
MARFLAHRDQRLPVSVQVSHLLCVGRADRVVKSLGDRKCTEHPRQQGSPR
QLISTTVGVLTGVVRLVCPRMQKSNIDKHRYSLRTWCKATVVDLTIKSPC
GRLEGITGDHGHAPHRAPPRWRHWIQCFDDRGERSGAAVLMMRTTGP
>MT3743.1 hypothetical protein
MFVQATELQKVKRRFRNVRATRRNTELEGTRSTAATRADQNDYARGKITA
AELGERVRRRYNIQ
>MT0034 hypothetical protein
MVSGSDSRSEPSQLSDRDLVESVLRDLSEAADKWEALVTQAETVTYSVDL
GDVRAVANSDGRLLELTLHPGVMTGYAHGELADRVNLAITALRDEVEAEN
RARYGGRLQ
>MT3330 hypothetical protein
MVLTVIAGALGNWLMSRGEAVAPTATVRAMAPLSVYADDQLDSTGPGQAI
SQVTPFLVDLPVGEGNAVVRLSQIAHATESNPTAASLVDARTIVTLSGLA
PATLHAMGVRVATSFSARLFNLLITNAPGTQSQMYIAGTKLLETYSVPPL
LHNQALAISVTSYNGMLYFGINADRDAMSDVDLLPGLLSQALDELLEASR
>MT0441 hypothetical protein
MAGLHFESAEQKSRQRCECRVRVNECRTVAKPFVNECSLGQGVIMSVVGG
TVRTVGRTVSGAATATTAAAGAVGGAAVSGIVGGVTGAAKGIQKGLSSGS
KSTAAAALAIGAIGVAGLVDWPILLAVGGGALLLRKLNRTPEVAAPPVKA
KLAPVPDKPAAAKEAPAKASKTTARKTSGRRAGTAELRSTN
>MT0725 hypothetical protein
MAQRCPNAVRRVDPFHVVAWATEALEAERRRA
>MT1164 hypothetical protein
MRARAAGVTGHLLVLWAMGFLQPRLPDIDLAEWSQGSRSQKIRPMAQHWA
EVGFGTPVLLHLFYVAKILLYVLVGWLIVLTTKGIDGFTDAAAWYAEPIV
FEKVVLYTMLFEVIGLGCGFGPLNNRFFPPMGSILYWMRFGTIRLPPWPD
RVPWTRGTKRKPVDVALYALLVMMLLSALFTDGAGPIPELGTTVGLLPAW
QIVLILLLLGVLGLRDKVIFLAARGEVYATLTVTFLFGRLNGIDMIVAAK
LVFLVIWIGAATSKLNRHFPFVISTMMSNNPLFRPRFIKRMFFKKFPGDL
RPGLLSRIVAHVSTVIEMCVPVVLFVAHGGWPTVVAATIMVCFHLGILTA
IPMGVPLEWNVFMIFGVLSLFVGHACLGLADVKNPVPLAILIAVVAGIVI
AGNVFPRKISFLAAMRYYAGNWDTTLWCIKPSAEDKINRGIVAIASMPAA
QLERFYGKDRAQIPMYLGYAFRAMNSHGRALFTLAHRAMAGHDEDDYVIT
DGERVCSTAVGWNFGDGHLHNEQLIAAMQQRCGFQPGEVRVVLLDAQPIH
RQTQEYRLVDAATGEFERGYVRVADMVNRQPWDDDVPVHVLPG
>MT0405 hypothetical protein
MDWMPLGDYETFRHWSGKPRAWGPQESGWRAWFGGKIVDGLCEVLDEHLA
VRRRGVPAAIGCVPWLSSEAVAETLLALSAFCVVIDKGTSFPSRLRNPDK
GFPNVALLRLRDMAPSEHGSRCSSARGRLCLSMS
>MT3258 hypothetical protein
MGMRVHAHRCAASALRRDREARRCGVAVLSRRLDEASSWRGALMPQMLGP
LDEYPLHQLPQPIAWPGSSDRNFYDRSYFNAHDRTGNIFLITGIGYYPNL
GVKDAFVLIRRADIQTAVHLSDAIDSDRLHQHVNGYRVEVVEPLRKLRIV
LDETEGVAADLTWEGLFDVVQEQPHVLRSGNRVTLDAQRFAQLGTWSGRI
VVDGERIAVDPATWLGSRDRSWGIRPVGEPEPAGRPADPPFEGMWWLYVP
LAFDDFAVVLIIQEEPDGFRSLNDCTRIWRDGHVEQLGWPRVRIHYRSGT
RIPTGATIEASTPDGAPVHFDVESKLAVPTHVGGGYGGDSDWSHGMWKGE
KFVERRTYDMTDPTIIARAGFGVIDHVGRALCRDGDGNPVQGWGLFEHGA
LGRHDPSGFADWSTLAP
>MT2960 hypothetical protein
MDWLVRRPGPLSAISDLRLAIQDQFVTASRTTAYSGGARNHAVIPAG
>MT1157.1 hypothetical protein
MRTLLRRRSQLAGPTVTVTVLAAVSTGLLGLLGGDVDTLGAEVPMAKPGV
PRSYNHFGNVVVGLYPRLEPDERVRRIATDLANARRRFEHPAMLSADRAF
AAVPAALLRWGVSQFDAEVRPVRVAGNTVVSSVYRGAADLSFGDAPVVLT
AGYPALSPAMGLTHGVHGIGDTVAISVHAAESAVSDIDAYMRLLDAALQ
>MT2080 conserved hypothetical protein
MVQRYPFRMVQRTPAMTSVAQLEHYLEEHLTKELAWLLRAATEWHAQHCM
NLGIDGYSMQVYALDSTVLHARTLFEFFTQNTSVGQNANYYNCTVYKVPL
IGSILYQFHWRRPIHSHMMHAQDRRPVTQLPTYDDHAQTKPLNEMPVDFA
KEIVRLWRVFVKDLNNHTNLQFRPIGATAQTALASEINAAKRVRTNDVTQ
RQIAVGKETSRLEPNFSIPQIEWPA
>MT1935 hypothetical protein
MRCENLDTVLGLSITPTTLGWVLAEGHGADGAILDRNELELHSGRNAQAI
HTAEQLAAEVLLAHEVAAAGDHRLRVIGVTWNAEASAQAALLVESLTGAG
FDNVVPVRRLRAIETLAQAIAPVIGYEQIAVCVLEHESATVVMVDTHDGK
TQIAVKHVCRGLSGLTSWLTGMFGRDAWRPAGVVVVGSDSEVSEFSWQLE
RVLPVPVFAQTMAQVTVARGAALAAAQSTEFTDAQLVADSVSQPTVAPRR
SRHYAGAAAALAAAAVTFVASLSLAVGIQLAPHNDTGTAKHGAHKPTPRI
AKAVAPAVPPPPTVTPPVPARAPRPAAQHEPPARVTSGEALTEPNPPEEQ
PNASAPQQDRNDSQPITRVLEHIPGAYGDSAPPAE
>MT3161 hypothetical protein
MVLDGVVSDTRRSRTIAARQQTIWDVLADFGSLSSWVEGVDHSCVLNHGP
DGGALGSTRRVQVGRNTLVERVIEFDPPTTLAYRIEGLPARLRKVTNRWT
LRPADPVGAVTVVTLTSTIEIGGNPLARLAELVVGRAMAKRSNTMLAGLA
QRLEDKHG
>MT1762 hypothetical protein
MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEPALDDVL
DRLAALPRRDLGASAAELVDEARSE
>MT2370.1 hypothetical protein
MTAVPATANIGILVSTSRRSSVVRCHRANGYCAVNGKDFAVGPGTAKGIG
T
>MT2808 conserved hypothetical protein
MLAGVRLTEFHERVALHFGAAYGSSVLLDHVLTGFDGRSAAQAIEDGVEP
RDVWRALCADFDVPHDRW
>MT0879 conserved hypothetical protein
MSSTRLPEVVMEALADVGVLASWSPLHKQVEVIDYYPDGRPHHVRATVKI
LGLVDKEVLEYHWGPDWVCWDADQTFQQHGQHIEYTVKPEGVDRARVRFD
ITVEPAGPIPGFIVKRASEHVLDAAAKGLQKLIAGAGDQGNAKS
>MT3078 hypothetical protein
MQRMEGTRMAGAKHAGRIVAITTAAAVILAACSSGSKGGAGSGHAGKARS
AVTTTDADWKPVADALGRSGKLGDNNTAYRINLPRNDLHITSYGVDIKPG
LSLGGYAAFARYDNNETLLMGDLVITEEELPKVTDALQAHGIAQTALHKH
LLQQDPPVWWTHIHGMGDAARLAQGLKAALDATTIGPPTPPPARQPPVDI
DVAGVDQALGRKGTQDGGLLKYSIPRKDTIIEDGHVLPAVSLNLTTVINF
QPVGRGRAAINGDFILIAPEVQEVIRAMRAGNITIVELHNHGLTEEPRLF
YMHYWAVDDAVTLARALRPAMDATNLQSS
>MT1065 hypothetical protein
MPHPTTLMKLTTRCGSAAIDGLNEALLAKAAEAKLLGTNRIRADTTVARA
NVSYPTDLGLLAKAMRRIAATGKRIQAAGGAVRTRVGDRSRAAGRRAHAV
AAKLRSRAELGRDEARAAVLRFTGELAELAQAAAQEAQQLLDNAKQAVLR
AKAKAAALAARGERDAVAGRRCGGLVRAVNDLTELLNATRQIVAQTRQRV
AGITSDGASRRVSLHDGDARPDHQGSAR
>MT1618 conserved hypothetical protein
MVTMTSWPSRLFAFTDNVCPPDACPLVPFGVNYYIYPVMWGGIGAAIATA
VIGPFVSMLKGWYMSFWPIISIAVITVTSIAGYAIAGFSERYWH
>MT2083 hypothetical protein
MLGDELAAQRGVLPSKEGAHETREASLCEGHLNLELAG
>MT2420 conserved hypothetical protein
MAMTINYQFGDVDAHGAMIRAQAAALEAEHQAIVRDVLAAGDFWGGAGSV
ACQEFITALGRNFAVIYQQANAHGQKIQAAGSNMAQTDSAVGSSWA
>MT3559 serine esterase, cutinase family
MRFIGVIPRPQPHSGRWRAGAARRLTSLVAAAFAAATLLLTPALAPPASA
GCPDAEVVFARGTGEPPGLGRVGQAFVSSLRQQTNKSIGTYGVNYPANGD
FLAAADGANDASDHIQQMASACRATRLVLGGYSQGAAVIDIVTAAPLPGL
GFTQPLPPAADDHIAAIALFGNPSGRAGGLMSALTPQFGSKTINLCNNGD
PICSDGNRWRAHLGYVPGMTNQAARFVASRI
>MT3273 hypothetical protein
MVTGHLPSKLHPKVLQRKVFAVRAGPSAQLAFVVSCMATAAPRW
>MT1924.1 hypothetical protein
MLMYVCLCVGVTNQTVCDAVARGASTSKEVAAVCGAGGDCGRCRRTLRAI
IAAARLNPTQLDPAGRTSDAVC
>MT3068 hypothetical protein
MCVTWAEMPKIAALIRHIEDLHARHGRSYILRAGISSLFRYIEGVHGERP
WGTVLDAGTGVKSLQWIQTLPTERWTAVTAARSLADKTRAALGSAMRPQD
RLLVGNWVDDSLLAGETFDTILVDYLVGAIEGFAPYWQDRVFERLRPHLA
DHGRLYLVGLEPYVQFEPETESGKIIWEIGRVRDACLLLAGERPYREFPL
DWMLGRLGLAGFRILEARRFPIRYRARYVNGQLNMCLARIERFSSNGLGM
AMRAYVEELRARALQLNERQDGLWHGNDYVIAVEPM
>MT3174.2 hypothetical protein
MRLSSIAFGAEYWLAPSGFTRAPQNSVDRFLFNVPGAVDALCVTQSRRRE
EGLHGEQLGPSADPGDGPAHHAIFGEHTLVSMEGTAPTGATPARPSAERR
>MT1556.1 hypothetical protein
MRGRSRLVRWPERPNTDTYPGGPAHQRLRIDQRLVADRDMVQDYES
>MT1941 conserved hypothetical protein
MIRELVTTAAITGAAIGGAPVAGADPQRYDGDVPGMNYDASLGAPCSSWE
RFIFGRGPSGQAEACHFPPPNQFPPAETGYWVISYPLYGVQQVGAPCPKP
QAAAQSPDGLPMLCLGARGWQPGWFTGAGFFPPEP
>MT3967 hypothetical protein
MPDPQDRPDSEPSDASTPPAKKLPAKKAAKKAPARKTPAKKAPAKKTPAK
GAKSAPPKPAEAPVSLQQRIETNGQLAAAAKDAAAQAKSTVEGANDALAR
NASVPAPSHSPVPLIVAVTLSLLALLLIRQLRRR
>MT2600.1 hypothetical protein
MSWVTPGSTAGAYVLTKRPACRWLRWSSYNCADQGSFVIKSLCWEFALRR
ARADAPTQRPRHWRRWPGVGNAVQRANECLL
>MT3205 hypothetical protein
MYSGCWINNQNGETRVGEDSLEDLEQRRARLYDQLAATGDFRRGSISENY
RRCGKPNCVCAQEGHPGHGPRYLWTRTVAGRGTKGRQLSVEEVDKVRAEL
ANYHRFAQVSEQIVAVNEAICEARPPNPAATAPPAGTTGHKKGGSATRSR
RSSPPR
>MT2072 hypothetical protein
MSADTGIFSGMFSEPYPTDGEVMTELGDKFLAALVGTIRDTRFDIADMRN
WRPGWFPTMHSRCLSNLIHDRIWAHLVTLIASNPGTSIKDKGATREIVVG
AHLRLRIKRHHAGDEISTYPTRTAIEFWQQGSQLAFPGLEEVRIAVGYRW
DPDTREIGAPLLSLRDGKDHVIWVVELDEPAAGVKITWTPIEPTLPSIDF
GDLGEDSGASGER
>MT3573.14 hypothetical protein
MGPMGYKPESERHSTKTDTAIGAALGISAGTYRRLKRIDNATHSDDKEIR
RFAEKQMAPLVAGSPSWNARKPRSANARVVASVHRSPMPALVPWNQSRLS
ATLTRR
>MT2595 PE family protein
MWTAGKRLVVSRLIVAPDWLASAAAEVQSIGSALSAANAAAAAPTTLLVA
AAEDEVSAAAAALFANYGREYQTLSVRFASLDQQFAQALNSAAASYQTAE
ATGASLVQTATQGVLGVINAPTEFMFGRSLIGDGADGTAASPIGEPGGIL
YGDGGNGYSQTTPGAVGGAGGSAGFIGNGGAGGAGGPGAGGGTGGLGGWL
WGNNGAAGTGDPVNVAVPLRVENNFPLVNLLVNRGPTVPILLDTGSSSLV
IPFWKIGWQNLGLPTGFDVVHYGNGVSIVYADVPTTVDFGGGAATTPTSV
HVGILPYPRNLDSLVLIASGGAFGPNGNGILGIGPNVGSYAVSGPGNVVT
TDLPGQLNEGTLIDIPGGYMQFGPNTGTPITSVTGAPITVLNVQIGGYDP
NGGYWSLPSIFDSGGNHGTLPAVILGTGQTTGYAPPGTVISISIHDNQTL
LYQYTTTASNSPVVTADPRLNTGLTPFLLGPVYISNNPSGVGTVVFNYPP
P
>MT3857 excisionase, putative
MTSLLEVLGAPEVSVCGNAGQPMTLPEPVRDALYNVVLALSQGKGISLVP
RHLKLTTQEAADLLNISRPTLVRLLEDGRIPFEKPGRHRRVSLDALLEYQ
QETRSNRRAALGELSRDALGELQAALAEKK
>MT2601.2 hypothetical protein
MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRR
RIVDHLAHAGTHVDADVLLSEQAWR
>MT0834 conserved hypothetical protein
MSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFDDLPVPA
DTANLREGADLNNALLALLPLVGVWRGEGEGRGPDGDYRFGQQIVVSHDG
GDYLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLA
HSAGYVELFYGRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDL
AYVEERVDADGGLVPHLSARLSRFVG
>MT2848 hypothetical protein
MPDPDGPSVTVTVEIDANPDLVYGLITDLPTLASLAEEVVAMQLRKGDDV
RKGAVFVGRNENGGRRWTTTCTVTDADPGRVFAFDVRSGIIPISRWQYGI
VATEHGCRVTESTWDRRPSWFRAVARMATGVKDRASVNTEHIRRTLQRLK
DRAEAG
>MT3342 hypothetical protein
MERLMRLTILLFLGAVLAGCASVPSTSAPQAIGTVERPVPSNLPKPSPGM
DPDVLLREFLKATADPANRHLAARQFLTESASNAWDDAGSALLIDHVVFV
ETRSAEKVSVTMRADILGSLSDVGVFETAEGQLPDPGPIELVKTSDGWRI
DRLPNGVFLDWQQFQETYKRNTLYFADPTGKTVVPDPRYVAVSDRDQLAT
ELVSKLLAGPRPEMARTVRNLLAPPLRLRGPVTRADGGKSGIGRGYGGAR
VDMEKLSTTDPHSRQLLAAQIIWTLARADIRGPYVINADGAPLEDRFAEG
WTTSDVAATDPGVADGAAAGLHALVNGSLVAMDAQRVTPVPGAFGRMPEQ
TAAAVSRSGRQVASVVTLGRGAPDEAASLWVGDLGGEAVQSADGHSLSRP
SWSLDDAVWVVVDTNVVLRAIQDPASGQPARIPVDSTAVASRFPGAINDL
QLSRDGTRAAMVIGGQVILAGVEQTQAGQFALTYPRRLGFGLGSSVVSLS
WRTGDDIVVTRTDAAHPVSYVNLDGVNSDAPSRGLQTPLTAIAANPSTVY
VAGPQGVLMYSASVESRPGWADVPGLMVPGAAPVLPG
>MT3456.1 transposase, degenerate
MSDATTVLFGLPGARVERVERRSDGTRVVDVITDEPTAAACPSCGGGLDI
SEGIRGYLTERSTLWRRPHHGALEQNSLAMPRRLLQAGAVHRGHHPGTCP
RPQHAAAASADGQGDRGCGPLGGPRSPRLTPCRGRRHIGRLLPTPRRVLT
EPLPTPVLGVDQTRRGKPRWERCAKTGRWVRVDPWDTGFVDLAGDQGFMG
QHEGRGGAAVLAWLQARTPQFRESIQYGGHRPRRCLRLGDPHARAAAQRQ
ARRRPLPCDHAGQRRADRGAPPGDLGVPRPARPQDRPAVGQPTSLADRPG
TLVGQKLRQNAESDQRRRPPRADSLGLDRQRGAAHPAVDRAHRRGPPPGA
PSPTPLPAWRIDSQIPELLTLATTID
>MT3593 hypothetical protein
MSTKSDHGEIGDVEPLADSTASQARRVVAAYANDADECRIFLSMLGIGPA
KLES
>MT0203 hypothetical protein
MSSLGQTATTQALPDNSDGIQLTKFAADDILPLEYAPPIGPELVSQDQLP
AAWAYKRFRDLDDKESYRRKLLQELTDALAAQGSEAAEIATAALRDLIDQ
MAEQGAVVLADIVESDDFLELVKRYDELMAREGSRSFIHRFLDLRRSPGM
LTDPAVNGALVHPLMIALISYAVGGPIRMIDARGKDAEPLSVLAQDNMLH
IDNTPFNDEYKILITWRRGTAQGPAGQNFTFLPGTHKLARTCFVNEDGVP
WSSENASIFTTPDSIRKVFDAQRQLGGQDHPTVIEVTDSERPLSGVFAAG
SLVHHRFRTASGSARSCIILVFHRVADNPGRMVSDVEDSSDVSLSELLTR
GVPDESYQQRFIATLCAAADEIAELLLKWKKTPQRPVSLPLQTKQIDGAR
FEEWISAATEAPEVREIRNRELTIPYGEVLSAEEFFDLIWRLMRFDKHGP
LDLILYHDNREEPRKWARNLIREMSADRLYERLLGWLADIQQPRPADCLR
PLQIHALISEVLKTLPLDEDQDPPADWHFDLLGMSHAEAARSVKHLLEDV
AEALLRCEDMAAYLSTSLFAFWAVDAAYSLDGRRNLVVKDCARRLLRHYT
MLSLTCFQ
>MT2668.1 PE_PGRS family protein
MLAAGADEVSQAIARLFSDYATHYQSLNAQAAAFHHSFVQTLNAAGGAYS
SAEAANASAQALEQNLLAVINAPAQALFGRPLIGNGANGTAASPNGGDGG
ILYGNGGNGFSQTTAGVAGGAGGSAGLIGNGGNGGAGGAGAAGGAGGAGG
WLLGNGGAGGPGGPTDVPAGTGGAGGAGGDAPLIGWGGNGGPGGFAAFGN
GGAGGNGGASGSLFGVGGAGGVGGSSEDVGGTGGAGGAGRGLFLGLGGDG
GAGGTSNNNGGDGGAGGTAGGRLFSLGGDGGNGGAGTAIGSNAGDGGAGG
DSSALIGYAQGGSGGLGGFGESTGGDGGLGGAGAVLIGTGVGGFGGLGGG
SNGTGGAGGAGGTGATLIGLGAGGGGGIGGFAVNVGNGVGGLGGQGGQGA
ALIGLGAGGAGGAGGATVVGLGGNGGDGGDGGGLFSIGVGGDGGNAGNGA
MPANGGNGGNAGVIANGSFAPSFVGFGGNGGNGVNGGTGGSGGILFGANG
ANGPS
>MT0839 conserved hypothetical protein
MPMRKVLVGVTGAAIVVAVLIVGAVGADFGASIYAEYRLSTTVRKAANLR
SDPFVAILRFPFIPQAMREHYAELEIKAFAVEHAGSGTATLEATMHSIDL
SYASWLIRPDAKLPVGELESRIIIDSMHLGRYLGISDLMVAAPRQESNDA
TGGTTESGISGSRGLVFSGTPISANFAHRVSVLVDLSVASDDRATLVITP
TAVVTGPDTADQPVPDDKRDAVLHAFASKLPNQKLPFGVVPNTVGARGSD
VIIEGITRGVTISLDEFKQS
>MT3961 hypothetical protein
MGTGSGGPIGVSPFHSRGALKGFVISGRWPDSTKEWAQLLMVAVRVASLP
GLLSTTTVFGAREELPDEPEPGTVGLVLAEGTVFGESAIQPGYFADHQPP
ALLMLHPPSETTPSLPECTGAASGCVLLPGLPYLGLEHRAAWVEAEADGT
ITSMVSRVGVDPISHPDTAILAMLLAA
>MT0772.5 PE_PGRS family protein
MSWVMVSPELVVAAAADLAGIGSAISSANAAAAVNTTGLLTAGADEVSTA
IAALFGAQGQAYQAASAQAAAFYAQFVQALSAGGGAYAAAEAAAVSPLLA
PINAQFVAATGRPLIGNGANGAPGTGANGGPGGWLIGNGGAGGSGAPGAG
AGGNGGAGGLFGSGGAGGASTDVAGGAGGAGGAGGNASMLFGAAGVGGVG
GFSNGGATGGAGGAGGAGGLFGAGGEGGSGGSGNLTGGAGGAGGNAGTLA
TGDGGAGGTGGASRSGGFGGAGGAGGDAGMFFGSGGSGGAGGISRSVGDG
AAGGAGGAPGLIGNGGNGGNGGASTGGGDGGPGGAGGIGVLIGNGGNGGX
GGTGATLGKAGIGGTGGVLLGLDGFTPPASTSPLHTLQQDVINMVNDPFQ
TLTGRPLIGNGANGTPGTGADGGAGGWLFGNGGNGGQGTIGGVNGGAGGA
GGAGGILFGTGGTGGSGGPGATGLGGIGGAGGAALLFGSGGAGGSGGAGA
VGGNGGAGGNAGALLGAAGAGGAGGAGAVGGNGGAGGNGGLFANGGAGGP
GGFGSPAGAGGIGGAGGNGGLFGAGGAGGNGGLFGAGGTGGAGSHSTAAG
VSGGAGGAGGDAGLLSLGASGGAGGSGGSSLTAAGVVGGIGGAGGLLFGS
GGAGGSGGFSNSGNGGAGGAGGDAGLLVGSGGAGGAGASATGAATGGDGG
AGGKSGAFGLGGDGGAGGATGLSGAFHIGGKGGVGGSAVLIGNGGNGGNG
GNSGNAGKSGGAPGPSGAGGAGGLLLGENGLNGLM
>MT2725 hypothetical protein
MHVCHTIADVVDRAKAERSENTLRKDFTPSELLAAGRRIAELERPKAKQR
QREGGDHGRQARYSGLGSMEPKPESERDAHKADTAISEALGISRGHYQRL
KRIDNATRSEAGYRDGLNGWSG
>MT3554 hypothetical protein
MPTSDPGLRRVTVHAGAQAVDLTLPAAVPVATLIPSIVDILGDRGASPAT
AARYQLSALGAPALPNATTLAQCGIRDGAVLVLHKSSAQPPTPRCDDVAE
AVAAALDTTARPQCQRTTRLSGALAASCITAGGGLMLVRNALGTNVTRYS
DATAGVVAAAGLAALLFAVIACRTYRDPIAGLTLSVIATIFGAVAGLLAV
PGVPGVHSVLVAAMAAAATSVLAMRITGCGGITLTAVACCAVVVAAATLV
GAITAAPVPAIGSLATLASFGLLEVSARMAVLLAGLSPRLPPALNPDDAD
ALPTTDRLTTRANRADAWLTSLLAAFAASATIGAIGTAVATHGIHRSSMG
GIALAAVTGALLLLRARSADTRRSLVFAICGITTVATAFTVAADRALEHG
PWIAALTAMLAAVAMFLGFVAPALSLSPVTYRTIELLECLALIAMVPLTA
WLCGAYSAVRHLDLTWT
>MT1838 PPE family protein
MLLPRPPTRPRPARGVTAMDFGALPPEVNSVRMYAGPGSAPMVAAASAWN
GLAAELSSAATGYETVITQLSSEGWLGPASAAMAEAVAPYVAWMSAAAAQ
AEQAATQARAAAAAFEAAFAATVPPPLIAANRASLMQLISTNVFGQNTSA
IAAAEAQYGEMWAQDSAAMYAYAGSSASASAVTPFSTPPQIANPTAQGTQ
AAAVATAAGTAQSTLTEMITGLPNALQSLTSPLLQSSNGPLSWLWQILFG
TPNFPTSISALLTDLQPYASFFYNTEGLPYFSIGMGNNFIQSAKTLGLIG
SAAPAAVAAAGDAAKGLPGLGGMLGGGPVAAGLGNAASVGKLSVPPVWSG
PLPGSVTPGAAPLPVSTVSAAPEAAPGSLLGGLPLAGAGGAGAGPRYGFR
PTVMARPPFAG
>MT0992 hypothetical protein
MLQRELTRLQNGWLSRDGVWHTDTDKLADLRALRDTLAAHPGTSLILLDT
ASDPRKVLAAVGVGDVDNAERVGVTMGGLNTRVSSSVGDMVKEAGIQRAK
AAELRERAGWPNYDAVASIAWLGYDAPDGLKDVMHDWSARDAAGPLNRFD
KGLAATTNVSDQHITAFGHSYGSLVTSLALQQGAPVSDVVLYGSPGTELT
HASQLGVEPGHAFYMIGVNDHVANTIPEFGAFGSAPQDVPGMTQLSVNTG
LAPGPLLGDGQLHERA
>MT0415 conserved hypothetical protein
MFGVAKRFWIPMVIVIVVAVAAVTVSRLHSVFGSHQHAPDTGNLDPIIAF
YPKHVLYEVFGPPGTVASINYLDADAQPHEVVNAAVPWSFTIVTTLTAVV
ANVVARGDGASLGCRITVNEVNRPGMSGDSSSWKGWGHVRWFIEEVPAGA
A
>MT0768.1 hypothetical protein
MDAMCSRLPPVFGMGQLPAFQSSCPRYPFVDVGPAGPWRARWRVGS
>MT3756.2 hypothetical protein
MEAALAIATLVLVLVLCLAGVTAVSMQVRCIDAAREAARLAARGDVRSAT
DVARSIAPRAALVQVHRDGEFVVATVTAHSNLLPTLDIAARAISVAEPG
>MT0632 hypothetical protein
MIRRRGARMAALLAAAALALTACAGSDDKGEPDDGGDRGASLATTSDADW
KPVADILGRTGKLNDGSVYKIGFARSDLSVQTKGVTVAPALSLGSWVAFA
RTPDGQTMLMGDLVVTEDELASVTDAVQAGGLQQTALHKHLLEQSPPIWW
THIAGHGDAADLARAVRSALDATDTPPPASATSGQTSLDLDTAAIDEALG
RSGTIAGGVYKFFIARRDPVTMSGMLIPPSMGLATALNFQPTGNGRAAIN
GDFVMTAAEVQDVVQALRGGGIDIVAIHNHGFDEQPRLFYMHFWAENDAV
ALARTLRAAVDATAAR
>MT1035 hypothetical protein
MVLRSRKSTLGVVVCLALVLGGPLNGCSSSASHRGPLNAMGSPAIPSTAQ
EIPNPLRGQYEDLMEPLFPQGNPAQQRYPPWPASYDASLRVSWRQLQPTD
PRTLPPDAPDDRKYDFSVIDNALTRLADRGMRLTLRVYAYSSCCKASYPD
GTNIAIPDWERAIASTNTSYPGPATDPSTGVVQVVPNFNDSTYLNDFAQL
LAALGRRYDGDERLSVFEFSGYGDFSENHVAYLRDTLGAPGPGPDESVAT
LGYYSQFRDQNITTASIKQLIAANVSAFPHTQLVTSPANPEIVRELFADE
VTNKLAAPVGVRSDCLGVDAPLPAWAESSTSHYVQTKDPVVAALRQRLAT
APVITEWCELPTGSSPRAYYEKGLRDVIRYHVSMTSSVNFPDQTATSPMD
PALYLVWAQANAAAGYRYSVEAQPGSQALAGKVATISVTWTNYGAAAATE
KWVPGYRLVDSTGQVVRTLPAAVDLKTLVSDQRGDRSSDQPTPASVAETV
RVDLSGLPAGHYTLRAAIDWQQHKPNGSHVVNYPPMLLSRDGRDDSGFYP
VATLDIPRDAQTAVNAS
>MT3573.1 hypothetical protein
MAMSRHHNIVIVCDHGRKGDGRIEHERCDLVAPIIWVDETQGWLPQAPAV
ATLLDDDNQPRAVIGLPPNESRLRPEMRRDGWVRLHWEFACLRYGAAGVR
TCEQRPVRVRNGDLQTLCENVPRLLTGLAGNPDTHRVLRCSRTRWSSPCG
CGARSAKATRRTNYAPPQRVVAARLRRSRLRLRGFGVPKESHVSTIYHHR
GRVAALSRSRASDDPEFIAAKTDLVAANIADYLIRTLAAAPPLTDEQRTR
LAELLRPVRRSGGAR
>MT0854.1 PE_PGRS family protein
MIGNGGAGGSGAPGAIGGAGGPAGLIGVGGAGGAGGDSAVAGVIGGAGGA
GGAALLFGAGGAGGAGGSGGSGAAGGAGGAGGAGGLFASGGSGGFGGFAS
TGTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGGTGGAGGLFASGG
AGGAGGSGGTGGAGGTGGAGGLFGAGGAGGLGGQGNHTGGHGGAGGSAGL
LALGDGGAGGAGGAATTGTGGAGGAGGKAGLLFGSGGAGGSGGAAGTFGD
TGNSGGAGGAGGKAGLLFGSGGAGGSGGAGGFANGSTGGAGGAGGGAGLI
GNGGNGGSGGTSVATGGAGNGGAGGAGGGAGLIGNGGNGGSGGMGDAPGG
TGVGGIGGLLLGLDGANAPASTNPLHTAQQQALAAVNAPIQAVTGRPLIG
NGANGAPGSGAPGGHGGWLFGGGGTGGSGVSGGAGGDGGAGGILFGAGGA
GGAGGAVTGTGATGGSGGAGGGALLFGAGGAGGAGGSSGIGGFAAGGAGG
PGGAGGLFNGGGAAGAGGSGVSGGAGGEGGAGGAGGLFAGGGAGGAGGSG
NNVGGAGGAGGVGGLFGAGGAGGSGGGGSVAGDGGAGGNAGLLAPGLAGG
AGGGGGQGFDTGGAGGPGGDAGLLVGSGGVGGAGGFGLTTGGPGAAGGDA
GLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGGNGGG
DGGPGGAAFGLGNGGNGGNGGTGTSAGSPGAGGAGGSLIGAEGLPGLLP
>MT1167 hypothetical protein
MAAYQKFGQEHAAAIRGGAVLHPTATATTVRVTGARGGDVVTGDGPYEAA
DLDEQGPFPMETVYLWEDGPNGTTRMTL
>MT3760 conserved hypothetical protein
MPEQEGELVRELAEAAESARDDGICGAVVAVIGGRGGAGASLFAVALAQA
AADALLVDLDPWAGGIDLLVGGETAPGLRWPDLALQGGRLNWSAVRAALP
RPRGISVLSGTRRGYELDAGPVDAVIDAGRRGGVTVVCDLPRRLTDATQA
ALDAADLVVLVSPCDVRACAAAATMAPVLTAINPNLGLVVRGPSPGGLRA
AEVADVAGVPLLASMRAQPRLAEQLEHGGLRLRRRSVLASAARRVLGVLP
RAGSGRHGRAA
>MT3626 hypothetical protein
MLLGGHVVGNFSRFQVLLYLGIPELVGALGVDLDLEALHHLWFYVGYVDI
KLFIPAAQLIHGAVLFNQQRVVDAGLVLPDLDILQEALADALGEHPRQLV
GHLFAHTLGLFDDDAPLQDERVLGHRVVAVDQDRLGLVVAVAVVQPVDHE
RRPEVRRLRIQMRLTVRRPQIVDIGPAHVVQILRGDVALEDVLEVRRQTE
VDVEEVRHIGDVVDDVAAVGPFDEDAVPPPVGPLVAGRLGNLGDPDRGVG
WIALMVVPDKQQPAAHIGRPRPSARHSGCAPGIRHQFAAAVATPAPVVER
ASDLVAFDGALGQVAAHVPAVAVEDFQVPVGISEHNQLGAERLYPVWLPF
QIVLRDAQAMPATRIPGRQGAGIDLPNTDPTRVGTHLSPPDRYR
>MT1790 hypothetical protein
MWVANAVLPASGKLDSITAEPVGRALRGRRA
>MT1999 hypothetical protein
MRQASGLAREGAGTIGAAQRRVIYAVQDAHNAGFNVEEDLSVTDTRTSRT
FAEQAARQAQAQALAGDIRQRATQLIGVEHEVAAKIATATAPLNTVGFHE
PPIAPSLPTPVPHNEKPQIHAVDRSWKQDPPSPMPGDPKDMTAVQARAAW
DAVNADIARYNARCGRTFVLPNEQAAYDACIADKGSLFERQAAIRARLGE
LGVPVEGEPPPAPDPAGPQPNEGLPPPGVSPPAESNLTVGPPSRPIQQAR
GGESLWDENGGEWRYFPGDNYRYPHWDYNPHDSPTARWQNIPIGDLPTHK
>MT3732 hypothetical protein
MAVGAAAVTEVGDTASPVGSSGASGGAIASGSVARVGTATAVTALCGYAV
IYLAARNLAPNGFSVFGVFWGAFGLVTGAANGLLQETTREVRSLGYLDVS
ADGRRTHPLRVSGMVGLGSLVVIAGSSPLWSGRVFAEARWLSVALLSIGL
AGFCLHATLLGMLAGTNRWTQYGALMVADAVIRVVVAAATFVIGWQLVGF
IWATVAGSVAWLIMLMTSPPTRAAARLMTPGATATFLRGAAHSIIAAGAS
AILVMGFPVLLKLTSNELGAQGGVVILAVTLTRAPLLVPLTAMQGNLIAH
FVDERTERIRALIAPAALIGGVGAVGMLAAGVVGPWIMRVAFGSEYQSSS
ALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYSLGWVGATVGSGLLLLLP
LSLETRTVVALLCGPLVGIGVHLVALARTDE
>MT1978 hypothetical protein
MRFGSLALVAYDSAIKHSWPRPSSVRRLRM
>MT2045 hypothetical protein
MGIQFRLGPADHKPVEDFLSRDHAGTTAITLDTNATRHQHDAAAAAVDAG
LDVYWEPAAERLAAHPASGSTSSLCETGSPTTRMP
>MT3744 hypothetical protein
MTGRKAGGAPYTVTIPEFGAAALREQRALVIPFDPVFPARRGTR
>MT2993 hypothetical protein
MRHGFFLPGGLCVEIDGSAPARESIALTSISMRQHPPYFRGRGWHRGQ
>MT2308 conserved hypothetical protein
MTRQQLDVQVKNGGLVRVWYGVYAAQEPDLLGRLAALDVFMGGHAVACLG
TAAALYGFDTENTVAIHMLDPGVRMRPTVGLMVHQRVGARLQRVSGRLAT
APAWTAVEVARQLRRPRALATLDAALRSMRCARSEIENAVAEQRGRRGIV
AARELLPFADGRAESAMESEARLVMIDHGLPLPELQYPIHGHGGEMWRVD
FAWPDMRLAAEYESIEWHAGPAEMLRDKTRWAKLQELGWTIVPIVVDDVR
REPGRLAARIARHLDRARMAG
>MT2491 hypothetical protein
MSSRRGRRPALLVFADSLAYYGPTGGLPADDPRIWPNIVASQLDWDLELI
GRIGWTCRDVWWAATQDPRAWAALPRAGAVIFATGGMDSLPSVLPTALRE
LIRYVRPSWLRRWVRDGYAWVQPRLSPVARAALPPHLTAEYLEKTRGAID
FNRPGIPIIASLPSVHIAETYGKAHHGRAGTVAAITEWAQHHDIPLVDLK
AAVAEQILSGYGNRDGIHWNFEAHQAVAELMLKALAEAGVPNEKSRG
>MT0168 PE family protein
MSYVIAAPEMLATAAADVDGIGSAIRAASASAAGPTTGLLAAAADEVSSA
AAALFSEYARECQEVLKQAAAFHGEFTRALAAAGAAYAQAEASNTAAMSG
TAGSSGALGSVGMLSGNPLTALMMGGTGEPILSDRVLAIIDSAYIRPIFG
PNNPVAQYTPEQWWPFIGNLSLDQSIAQGVTLLNNGINAELQNGHDVVVF
GYSQSAAVATNEIRALMALPPGQAPDPSRLAFTLIGNINNPNGGVLERYV
GLYLPFLDMSFNGATPPDSPYQTYMYTGQYDGYAHNPQYPLNILSDLNAF
MGIRWVHNAYPFTAAEVANAVPLPTSPGYTGNTHYYMFLTQDLPLLQPIR
AIPFVGTPIAELIQPDLWVLVDLGYGYGYADVPTPASLFAPINPIAVASA
LATGTVQGPQAALVSIGLLPQSALPNTYPYLPSANPGLMFNFGQSSVTEL
SVLSGALGSVARLIPPIA
>MT2813 hypothetical protein
MNVGVALAGVLVDELGVKIVHAQHVPAPYLVQRMREIHERDENRQRHAQV
DVQRRRDQPERGQHQHRRNRDADHHPDGRTLAGQIVAHPVSHRVRQPRPV
AIADVLPRVGPRADCVVAHSLQGSPRRRERRRGQTAHQRLGRRSGNAIAC
PLYLENAAGPEPDTKRAEGRRFGAFGGGDLRWMADRVPRQGSGRRGLGSR
SGAGVPQGADARGWRHTADGVPRVGQPAIRRGVPGFWCWLDHVLTGFGGR
NAICAIEDGVEPRVAWWALCTDFDVPRSMGRRTPGG
>MT3808.1 hypothetical protein
MTETPQPAAPPPSAATTSPPPSPQQEKPPRLYRAAAWVVIVAGIVFTVAV
IFFSGALVLGQGKCPYHRYYHHGMFRPVGPVAPGPGMGWVFGFPGGPPPP
GMGPGFPGGPGGPAVGPTGPGPTTAPARP
>MT2731 hypothetical protein
MSGHALAARTLLAAADELVGGPPVEASAAALAGDAAGAWRTAAVELARAL
VRAVAESHGVAAVLFAATAAAAAAVDRGDPP
>MT3084 hypothetical protein
MASRRVLPYGCCSTGQIRWPERCHRYGLGTRPELASVRCGSVAKVTAGNR
SSVPSLSVARAQDPRWNRSPSMSSQRTARLPTSVRTVTPRSSASVRSRMI
DADNATGMTNSGALTGQVSTGTSRPRVRNPTAKCAIGDTLITTGDVVATP
ASSHRLVGVEFASVTPPRKFAWDFAVPSRSRIKTDTADLTAELCEGYRRT
LWTRPESPGCS
>MT1209 PE family protein
MSFVFAAPEALAAAAADMAGIGSTLNAANVVAAVPTTGVLAAAADEVSTQ
VAALLSAHAQGYQQLSRQMMTAFHDQFVQALRASADAYATAEASAAQTMV
NAVNAPARALLGHPLISADASTGGGSNALSRVQSMFLGTGGSSALGGSAA
ANAAASGALQLQPTGGASGLSAVGALLPRAGAAAAAALPALAAESIGNAI
KNLYNAVEPWVQYGFNLTAWAVGWLPYIGILAPQINFFYYLGEPIVQAVL
FNAIDFVDGTVTFSQALTNIETATAASINQFINTEINWIRGFLPPLPPIS
PPGFPSLP
>MT1931.1 hypothetical protein
MCLDQVMEGSATVHMAAPPDKIWTLIADVRNTGRFSPETFEAEWLDGATG
PALGARFRGHVRRNGIGPVYWTVCEVTACEPGREFGFAVLLGDRPVNNWH
YRLTPTADGTEVTESFRLPPSVLTTVYYRVFGGWLRQRRNIRDMTKTLQR
IKDLVEAG
>MT2736.1 hypothetical protein
MSGGWLAEHLGLSTNRLRHELADRLDAHYGPPAQNRELARPSLRIINEGT
DG
>MT1490 hypothetical protein
MSSAMQRWQIVNAGENLFRPLLMRVTSEGVGKGISPRSCNHCPDNQIQHG
LVIDRILGLSDSSITVLTRAQVEAMVAALPRSY
>MT1086 hypothetical protein
MRPCGSARGQRPASCGMRECLAMVGQVFTAGDIDYRMFQTIVCRSDLTVD
GEVLAPWPPSSLSGRLAGRR
>MT3488 hypothetical protein
MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDD
YQQAALRQSIRILKMLFEHGIETVISPIFSDDLLDRGDRYIVQALEGMAL
LANDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSS
NTEHRLCFGVFGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDK
ADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTLRRILYDHIYL
RHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG
>MT0645.1 hypothetical protein
MQVLRQVDGSAHRLILTSLHRDARADAHRYSNGTDHAGRAADEPAETAHE
PCWVAARGLASQASRAMSATYRPSSFI
>MT0650 hypothetical protein
MGRRSPPQLRPIADATRTGSAPMTVSRSSSAPSLARRARRCTGSDDAAMS
FCVYCGAELADPTRCGACGAYKIGSTWHRTTTPTVGAATTATGWRPDPTG
RHEGRYFVAGQPTDLVREGDAEAVDPLGQQQLDQSGAVGVSPSAVSGWVR
SGHRRLWWALAGVVAFLGLVGAGVVGTLFLNRDRESIDDKYLAALRRSGL
TGEFNSDANAIARGKQVCRQLQDGGEQQGMPVDQVAVQYYCPQFSDGFHI
LETITVTGSFTLKDESPNVYAPAITVSGSGCSGSAGYADIDRGTQVTVKN
GQGDILATAFLQAGQGGRFLCTFPFSFEITEGEDRYVVSVSRRGEMSYSF
ADLKANGLSLVLG
>MT0355 hypothetical protein
MANSLLDFVISLVRDPEAAARYAANPERSIAEAHLTDVTRADVNSLIPVV
SDSLSMSEPIGAAGGAHAGDRGNVWASGAATAALDAFAPHADAGVVQQHG
AVGSVLNQPTPPGPGVTPTDPRPFRAGPHETSALLTSAEIPDTTSEDGGL
PTDHPAVWNHPVVDPHTVEPDHHGYDIHG
>MT2473 conserved hypothetical protein
MNGFLSWWDGVELWLSGLPFALQALAVMPVVLALAYFTAALLDALLGRVI
QLIRRARRPDQAPR
>MT2673.1 hypothetical protein
MDVSGALLGLVLNAPAPRPLATHRLAHTDGSALQLGVLGASHVVTVEGRF
CEEVSCVARSRGGDLPESTHAPGYHLQSHTETHDEAAFRRLARHLRERCT
RATGWLGGVFPGDDAALTALAAEPDGTGWRWRTWHLYPSASGGTVVHTTS
RWRP
>MT1548 hypothetical protein
MIAHRVPSHSFTRSYRPAAHQNSPVPSGEPSTAGHFEHLPRGSFGRILSV
LNAAADHHPRELLVVGIATFDQKRPAVGVDEHDPGGAATPAVVINYESRS
SAGGTIGHSTTSQVACCLYQQPKRPALRPTKAAATTAATTWIERVQNRRG
RHSALV
>MT0940 PPE family protein
MLAMDFGLLPPEVNSSRMYSGPGPESMLAAAAAWDGVAAELTSAAVSYGS
VVSTLIVEPWMGPAAAAMAAAATPYVGWLAATAALAKETATQARAAAEAF
GTAFAMTVPPSLVAANRSRLMSLVAANILGQNSAAIAATQAEYAEMWAQD
AAVMYSYEGASAAASALPPFTPPVQGTGPAGPAAAAAATQAAGAGAVADA
QATLAQLPPGILSDILSALAANADPLTSGLLGIASTLNPQVGSAQPIVIP
TPIGELDVIALYIASIATGSIALAITNTARPWHIGLYGNAGGLGPTQGHP
LSSATDEPEPHWGPFGGAAPVSAGVGHAALVGALSVPHSWTTAAPEIQLA
VQATPTFSSSAGADPTALNGMPAGLLSGMALASLAARGTTGGGGTRSGTS
TDGQEDGRKPPVVVIREQPPPGNPPR
>MT2195 conserved hypothetical protein
MRNMKSTSHESESGKLLSISSCRPREMVLQRYSLGMTVTADRHLADKREE
FAVEDISTGIFASGYGQVGDGRSFSFHIEHRSLVVEIYRPRVAGPVPQAE
DVVAMAVRGLVDIDLTDERSLAAAVRDSVASAAPVSR
>MT3573.6 hypothetical protein
MTPINRPLTNDERQLMHELAVQVVCSQTGCSPDAAVEALESFAKDGTLIL
RGDTENAYLEAGGNVLVHADRDWLAFHASYPGNDPLRDARPIEQDDDQGA
GSPS
>MT2110 conserved hypothetical protein
MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPFADDAEI
PGTWLCRNGMEGTLIEGDLPEPKKVKPPRTHWDMLLERRSIEELEELLKE
RLELIRSRRRG
>MT1957.1 hypothetical protein
MGLFDPHMRLRAHGRRATDAVSGNDAVNRGRRQRFHRVTALSASTPPRPH
VSEDVRKPLATSVSSNHRCHCIAVTKSLSGNTFRALASG
>MT2467.1 PE_PGRS family protein
MLARRCSGRHGLAILTFWRCRMSFLIASPEALAATATYLTGIGSAISAAN
AVAAAPTTEILAAGTDEVSTAISALFGAHAQAYQALSAHVAAFHDQFVHT
LTAGAGSYMAAEAAAASPLQALQLELLNAINAPTLALLGRPLIGDGTDAA
PGSGGAGGAGGILIGNGGTGGASDLAGTGRGGVGGAGGAGGLFGIGGAGG
GCGSAVAIGGDGGAGGAGGVFSGGGAGGAGDAIGGSGGAGGTGGLLGGGG
GAGGAGGAGGNGGGASNSASIGGDGGSGGAGGMLYGAGGVGGNGGAAVAI
GGDGGAGGRAGAIGNGGDGGNGGTSNTPGGSGGDGGNGGNAGLIGNGGNG
GNAEIVISGGSVAGTGGNGGLLLGFNGTNGLP
>MT0036 hypothetical protein
MVDLKAHPPPKVAVGTESRGAKSNRLSAAADRHCDNAPTRSPNTAPFDVI
AGPPAPAPQCTKRNIARALGRIAPLRVGRAAVNYQ
>MT2200 hypothetical protein
MTRRLRVHNGVEDDLFEAFSYYADAAPDQIDRLYNLFVDAVTKRIPQAPN
AFAPLFKHYRHIYLRPFRYYVAYRTTDEAIDILAVRHGMENPNAVEAEIS
GRTFE
>MT2458 conserved hypothetical protein
MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLSTISSKAD
DIDWDAIAQCESGGNWAANTGNGLYGGLQISQATWDSNGGVGSPAAASPQ
QQIEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFLAAETGGCSG
SRDD
>MT2365.1 hypothetical protein
MGHRACGGQKAAFPTRMNSGVEKMYKNSIAIAIGTLTMAVEFSMVSANAE
PAPPPGQDPHMPNSAMGYCPGGGFGGITGWGYCDGIRYPDGSYWHQVRVP
APFVGTTLTLSCVIDDGSPVPPLAAPGSCGGGA
>MT2518.1 hypothetical protein
MLAGKSVTGARSKGAVIGGSSRSTSDAIRHDVSRSHQRSNPDDVTTRAPE
CGSVEIDATGALSCAGVDLAEAPLV
>MT3627 hypothetical protein
MPDDQPAVPDVDRLARSMLLLHGDHHDHNDSPEQHRTCGSWSKSRDFADD
PQRAAAVREASRAERDRYLTSGLQPVDCRFCHVTVTVKRLGPGHTAVQWN
TEASRRCAYFTELRARGGDSARTRSCPRLTDSIEHAVAEGYLEHHDPNR
>MT3909 hypothetical protein
MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPSAVPPGV
LPPGPTPAHPHKPRPAFQDASCPDVQMISVPGTWESSPQQNPLNPVQFPK
ALLLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGT
RAMVAAMTDMNNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLV
LGVTLIADGRRQQGVGNQVPPSPRGEGAEITLHEVPVLSGLGLTMTGPRP
GGFGALDGRTNEICAQGDLICAAPAQAFSPANLPTTLNTLAGGAGQPVHA
MYATPEFWNSDGEPATEWTLNWAHQLIENAPHPKHR
>MT0905 hypothetical protein
MNDQRDQAVPWATGLAVAGFVAAVIAVAVVVLSLGLIRVHPLLAVGLNIV
AVSGLAPTLWGWRRTPVLRWFVLGAAVGVAGAWLALLALTLGDG
>MT3723 PPE family protein
MLDFAQLPPEVNSALMYAGPGSGPMLAAAAAWEALAAELQTTASTYDALI
TGLADGPWQGSSAASMVAAATPQVAWLRSTAGQAEQAGSQAVAAASAYEA
AFFATVPPPEIAANRALLMALLATNFLGQNTAAIAATEAQYAEMWAQDAA
AMYGYAGASAAATQLSPFNPAAQTINPAGLASQAASVGQAVSGAANAQAL
TDIPKALFGLSGIFTNEPPWLTDLGKALGLTGHTWSSDGSGLIVGGVLGD
FVQGVTGSAELDASVAMDTFGKWVSPARLMVTQFKDYFGLAHDLPKWASE
GAKAAGEAAKALPAAVPAIPSAGLSGVAGAVGQAASVGGLKVPAVWTATT
PAASPAVLAASNGLGAAAAAEGSTHAFGGMPLMGSGAGRAFNNFAAPRYG
FKPTVIAQPPAGG
>MT3302 conserved hypothetical protein, truncation
MTSPWPAGSTSRVPVLRDEWREPLRALRDPLAATDRRVRARRDRKRQWRK
QTWLGRFVSTYGWRAYALPVLMVLTTVVVYQTVTGTSTPRPAAAQTVRDS
PAIGVVGTAILDAPPRGLAVFDANLPAGTLPDGGPFTEAGDKTWRVVPGT
TPQVGQGTVKVFRYTVEIENGLDPTMYGGDNAFAQMVDQTLTNPKGWTHN
PQFAFVRIDSGKPDFRISLVSPTTVRGGCGYEFRLETSCYNPSFGGMDRQ
SRVFINEARWVRGAVPFEGDVGSYRQYVINHEVGHAIGYLRHEPCDQQGG
LAPVMMQQTFSTSNDDAAKFDPDFVKADGKTCRFNPWPYPIP
>MT1071 hypothetical protein
MCSTATGTIPGNVGTTSRRSSRRLLGSQHLPTKRSYPCSALSTYLVFDAL
IAHGDRHDRNWAVHVPPLETKYVEALCPSFDHAASLGFTLTDQTPAQHLH
DG
>MT1499 PE_PGRS family protein
MMSLVIVTPETVAAAASDVARIGSSIGVANSAAAGSTTSVLAAGADEVSA
AIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLA
TLEHNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAG
GSGAPGQVGGAGGAAGLFGTGGAGGAGGAGAAGGAGGSGGWLLGNGGVGG
AGGQSLLGGATGGAGGNAGLFGVGGTGGPGGPGGPGGVGGTGGAGGLGGT
LYGAGGHGGAGGPGPIGGVGGHGGVGGAAGLLGVGGHGGAGGHGAEGVAG
AAGEDLSPHGTSGGVGGDAGDGGAGGRGGWLAGAGGAGGDGGIGGAGGAG
GGGHSLVIATGQAGGAGGSGGAGGVGGVGGAGGLISLLGGQGAGGAGGTG
GAGGVGGDRGAGANGNQAFNAGAGGHGGNGGNPGTGGAGGTGGAGSITGA
QGAIGATPTSGGNGGAGGNGANATTAGTNGANGGPGGHGGLVGNGGAGGN
GANGAAGTNASDSGAVGGKGNSGGNGGQGGAGGDGGTLAGNGGAGGTGGR
GADGGLGGSGAEGANATTAGERGQDGGKGGNGGVGGTGGNAVAPGANGGH
GGNGGNPGFSGAGGLGGLSGDGVTRAAQGATPDFADTGGKGGNGGNGANA
VAPGGTGASGGAGGNAGAGGKGGENIIGDGGGGNGGAGGKGGAGTLLGLT
VFGDNGGAGVLGDSTDPDGSGGAGGAGGAGGAGGDPTI
>MT0770 hypothetical protein
MFFGFTSQHWDMETLLKTSEAAQILGVSRQHVVNMCDRGEMVCVHVGSHR
RVPSSEVERVTSRRLTREEERSLWLHRALLSPLLTEPDTVVSAARENLRR
WSGMHRRDGMAGWYFTKWQRVLNDGLDAVMHVLTSPSEDAREMRQNSPFA
GILPEATRVAVLRSFKDHWDREHERAMTE
>MT3481 hypothetical protein
MAQLTALDAGFLKSRDPERHPGLAIGAVAVVNGAAPSYDQLKTVLTERIK
SIPRCTQVLATEWIDYPGFDLTQHVRRVALPRPGDEAELFRAIALALERP
LDPDRPLWECWIIEGLNGNRWAILIKIHHCMAGAMSAAHLLARLCDDADG
SAFANNVDIKQIPPYGDARSWAETLWRMSVSIAGAVCTAAARAVSWPAVT
SPAGPVTTRRRYQAVRVPRDAVDAVCHKFGVTANDVALAAITEGFRTVLL
HRGQQPRADSLRTLEKTDGSSAMLPYLPVEYDDPVRRLRTVHNRSQQSGR
RQPDSLSDYTPLMLCAKMIHALARLPQQGIVTLATSAPRPRHQLRLMGQK
MDQVLPIPPTALQLSTGIAVLSYGDELVFGITADYDAASEMQQLVNGIEL
GMARLVALSDDSVLLFTKDRRKASSRALPSAARRGRPSVPTARARH
>MT0937 hypothetical protein
MGYELHRAGKSAAKLRVAACRSAASAKRSSGGQRKAPPNNALARGIGVNR
PRPSNLEETVRFYRDLVGMLDQTFAESYGSNGAIFGLPSSSLTLEIVETD
HHEQLCRYCSDKRGQQAALTGLQEVATQPVEQRPL
>MT2501 hypothetical protein
MAGHRRAVRDRRAGGDTDSADRGRRRDHAKPAGTRPIRRPCPARRIGLVF
SSFGGREKSYQRLAGIIGKLIRGDRQVRLIA
>MT2254 hypothetical protein
MSGPNPPGREPDEPESEPVSDTGDERASGNHLPPVAGGGDKLPSDQTGET
DAYSRAYSAPESEHVTGGPYVPADLRLYDYDDYEESSDLDDELAAPRWPW
VVGVAAIIAAVALVVSVSLLVTRPHTSKLATGDTTSSAPPVQDEITTTKP
APPPPPPAPPPTTEIPTATETQTVTVTPPPPPPPATTTAPPPATTTTAAA
PPPTTTTPTGPRQVTYSVTGTKAPGDIISVTYVDAAGRRRTQHNVYIPWS
MTVTPISQSDVGSVEASSLFRVSKLNCSITTSDGTVLSSNSNDGPQTSC
>MT1083 hypothetical protein
MDCCEERGVARHKGLSQVGTPGCPRWSQAVSCRCSAYREAAVTAVQMPLT
PGYGETPLPHDELAALLPEVVEVLDKPITRADVYDLEQGLQDQVFDLLMP
TAVEGSLSLDELLSDHFVRDLHARMFGPV
>MT3721 conserved hypothetical protein
MTINYQFGDVDAHGAMIRALAGLLEAEHQAIISDVLTASDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>MT3754 hypothetical protein
MLTLDIATRVWSTSRGTRPPSRPGALRRLSERAIVTTLLMPAPLDVDDEI
GGVVRRALPIPYVGGSRIQPSSVQSA
>MT3176 hypothetical protein
MPIPFADGMLSRLGRRGAALDLIEEFEDESGEPPASLSPADLLAAEPALL
LQKMENRLVRHHLANPDVLSGEQLRKLRYILNFARLADFEPGAAGPGGSR
GRGDISVGGQVAPWRSRVVDALYAPLREEPDPVTALEGAKDVLATLVDDQ
DDQRRVLIERHGSDFSATELDAEVGYKKLVTVLGGGGGAGFVYIGGMQRL
LAAGQVPDYMIGSSFGSIIGSLVARELPVPIDEYAEWAKTVSYRAILGPE
RRRSRHGLAGMFTLRFDQFAHTLLSRADGERMRMSDLAIPFDVVVAGVRR
QPYAALPSRFRHRERSTLTLRSLPFLPIGIGPWVAARMWQVAAFIDLRVV
KPIVISADGATRDVNVVDAASFSSAIPGVLHHETSDPRMLPILDELCADQ
DVAAMVDGGAASNVPVELAWERVRDGRLGTRNACYLAFDCFHPHWDPRHL
WLVPITQAVQLQMVRNLPYADHLVRFEPTLSPVNLAPSAAAIDRACRWGR
DSVEPAIAVTSALLEPTWWEGDRPPAAEPKERTKSAASSMSAVMAAIQAP
TGRFRRWRSRHLT
>MT2300 hypothetical protein
MAGEIGGQRTTPVGGGLPLACCLDGRPPIVPHRRRRRIAALRSVLRMRDT
PRPARSRCDQVTSHAVLIGWRAVPRRHGGELPRRGALALGCIALLLMGIV
GCTTVTDGTAMPDTNVAPAYRSSVSASVSASAATSSIRESQRQQSLTTKA
IRTSCDALAATSKDAIDKVNAYVAAFNQGRNTGPTEGPAIDALNNSASTV
SGSLSAALSAQLGDALNAYVDAARAVANAIGAHASTAEFNRRVDRLNDTK
TKALTMCVAAF
>MT3756 PE_PGRS family protein
MHGQTYQALSARAAAFHERFVQALATGGGAYAAAEAASVSPLQSALDLLN
APTQALLGRPLVGNGANGAPGTGANGGDGGILFGSGGAGGSGAAGMAGGN
GGAAGLFGNGGAGGAGGSATAGAAGAGGNGGAGGLLFGTAGAGGNGGLSL
GLGVAGGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGGAGGAGEDGTTPGG
NGGAGGVAGLFGDGGNGGNAGVGTPAGNVGAGGTGGLLLGQDGMTGLT
>MT1957 conserved hypothetical protein
MMRLKPAPSPAAAFAVAGLILAGWAGSVGLAGADPEPAPTPKTAIDSDGT
YAVGIDIAPGTYSSAGPVGDGTCYWKRMGNPDGALIDNALSKKPQVVTIE
PTDKAFKTHGCQPWQNTGSEGAAPAGVPGPEAGAQLQNQLGILNGLLGPT
GGRVPQP
>MT1609 conserved hypothetical protein
MPLSGEYAPSPLDWSREQADTYMKSGGTEGTQLQGKPVILLTTVGAKTGK
LRKTPLMRVEHDGQYAIVASLGGAPKNPVWYHNVVKNPRVELQDGTVTGD
YDAREVFGDEKAIWWQRAVAVWPDYASYQTKTDRQIPVFVLTPVRAGG
>MT3278 hypothetical protein
MEYVQLFSKGRLNDLAGSLAGFLGKARQATAQRLQSWDADDLLNTPVDDV
VEQLVELGSVECPDLRVDDAFMLPATEVDQQYRDWGEQRTRRVTRLVLVV
PFEGHKDIFNLRPDQFTTMPPQVLRLQGHEIHLAIDNPSNDAAAINAAFH
KQIANIEKYLGWSRRQIDLHNQGLRNELPGMVARRREQLLATRNLQAEIG
FPVRRRKDADTYAAPISRKSVRPRPHRPAGARAAFKPEPAMQDEDYQSAL
RVLRNQRNALERTPSVAAKLDGEEIRDMLLVGLNAQFEGDAGGELFNGAG
KTDILIRVDDRNIFIGECKVWSGPRTMDDVLKQLFGYLVWRDTKAAILLF
IRNKDVTAVIDNAIAKIKEHPNHKRCPAHRAGADQYEFTMHADGDPEREI
HLTLIPFALRPTAEVPTTTIP
>MT2445 conserved hypothetical protein
MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGAPLPLDP
ASAPDVPTAAQLTSLLNSLADPNVSFANKGSLVEGGIGGTEARIADHKLK
KAAEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTRNVTFVNQG
GWMLSRASAMELLQAAGN
>MT3939 hypothetical protein
MVSLLVHAALGVVVIGWIVSSNPKVFTRPAGGSWFSLPECVYYVVGIASI
ALGWYFNIRFVQQYAHGAANPLWGPGSWAEYVRLMFTNPAASSAGQDYTI
ANVILLPLFSTTDGYRRGLRRPWLYFVSSLFTSFAFAFAFYFATIERQHR
HERSRATVGA
>MT3953 hypothetical protein
MATHSYGPGSTTPPNSGSPEWNHAYGIAALRAALIALALLAILAVIALV
>MT0467 conserved hypothetical protein
MLMRTWIPLVILVVVIVGGFTVHRIRGFFGSENRPSYSDTNLENSKPFNP
KHLTYEIFGPPGTVADISYFDVNSEPQRVDGAVLPWSLHITTNDAAVMGN
IVAQGNSDSIGCRITVDGKVRAERVSNEVNAYTYCLVKSA
>MT1707.1 hypothetical protein
MEPAVATTLIGISAWWANGSVKQYAGDLTDRVATMTVCRRTPAPRVHYRQ
>MT0497 hypothetical protein
MTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAGHIQEPV
SPPTQPEQQPQTEHLAASHAHTRRSGRQAAHQAWDPTGLLAAQEEEPAAV
KTKRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACV
VKDQATASFGVAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQN
VRLKNTPNSRGTIGALDATITWSSEGIKESVQNAIPILGAFVTSSVVTHP
ADGTVELKGLLNNITAKPIVAGKGLELQIINFNTLGFSLPKETVQSTLNE
FTSSLTKNYPLGIHADSVQVTSTGVVSRFSTRDAAIPTGIQNPCFSHI
>MT1066 conserved hypothetical protein
MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>MT3821 conserved hypothetical protein
MGQVSAASTILINAEPTATLDALADYETVRPKILSPHYSEYQVLEGGKGR
GTVAKWRLQATQSRVRDVQVNVDVAGHTVIEKDMNSSMVTNWTVAPAGPG
SSVTVKTTWTGAGGVKGFFEKTFAPLGLKKIQAEVLSNLKTELEGDA
>MT4011 conserved hypothetical protein
MPLSLSNRDQNSGHLFYNRRLRAATTRFSVRMKHDDRKQTAALALSMVLV
AIAAGWMMLLNVLKPTGIVGDSAIIGDRDSGALYARIDGRLYPALNLTSA
RLATGTAGQPTWVKPAEIAKYPTGPLVGIPGAPAAMPVNRGAVSAWAVCD
TAGRPRSADKPVVTSIAGPITGGGRATHLRDDAGLLVTFDGSTYVIWGGK
RSQIDPTNRAVTLSLGLDPGVTSPIQISRALFDGLPATEPLRVPAVPEAG
TPSTWVPGARVGSVLQAQTAGGGSQFYVLLPDGVQKISSFVADLLRSANS
YGAAAPRVVTPDVLVHTPQVTSLPVEYYPAGRLNFVDTAADPTTCVSWEK
ASTDPQARVAVYNGRGLPVPPSMDSRIVRLVRDDRAPASVVATQVLVLPG
AANFVTSTSGVITAESRESLFWVSGNGVRFGIANDEATLRALGLDPGAAV
QAPWPLLRTFAAGPALSRDAALLARDTVPTLGQVAIVTTTAKAGA
>MT3502 hypothetical protein
MTLTDSKDGCLLTSNIFSNRRLVMTAAFASDQRLENGAEQLESLRRQMAL
LSEKVSGGPSRSGDLVPAGPVSLPPGTVGVLSGARSLLLSMVASVTAAGG
NAAIVGQPDIGLLAAVEMGADLSRLAVIPDPGTDPVEVAAVLIDGMDLVV
LGLGGRRVTRARARAVVARARQKGCTLLVTDGDWQGVSTRLAARVCGYEI
TPALRGVPTPGLGRISGVRLQINGRGR
>MT3282 hypothetical protein
MLRNGSSTVVRRAPTANRDAAGGGIKMRRMPLYNGPTTPKPTGTRYGPLI
HVTLNG
>MT2138.1 hypothetical protein
MRANGVFGLLAAAACGVPIPVIDNRAEEMTGRHATTATSFSITDQSCAS
>MT0030 hypothetical protein
MTDRIHVQPAHLRQAAAHHQQTADYLRTVPSSHDAIRESLDSLGPIFSEL
RDTGRELLELRKQCYQQQADNHADIAQNLRTSAAMWEQHERAASRSLGNI
IDGSR
>MT2370.2 hypothetical protein
MREITGVPVSTLHGWAAKRERGIDAPGPHYVRLGGRDRRWTRRDMYDWLE
SARV
>MT3573.12 hypothetical protein
MAADVYALQSALPPRFQASAAFAANLSTINTLRQAETSNGALKFPSLHDS
PPMLAGKSVLEVSHMDTVDSAVTATNHPLVLGDWKQFLIGDRVGSMVELV
PHLFGPNRRPTGQRGFFAWFRVGSDVLVRNAFRVLKVETTA
>MT1942 hypothetical protein
MIMCEGRPTESPIPRWLRFVLTSDRAGSAWYIGAGFFFAPVLAVLSPWPT
ITAVLWWIIGLAGLWLGLLGIAMAVGLARVLRSGAEIPEAYWRTLVDYRS
ANE
>MT0603 hypothetical protein
MGPKVLKTAPKGHIAGDKPCLPGGEVPVGGGHQV
>MT0831 hypothetical protein
MGRGRAKAKQTKVARELKYSSPQTDFQRLQRELSGTGTDRLDGDGPSDDD
SWNDEDDWRR
>MT1151 hypothetical protein
MGRSVRHGDDLFGRNVAMTARVAGQAVGGQILVGEPVHDAVSDCADIRFG
SYRLFSLDAAPGPDLD
>MT0827 hypothetical protein
MSCKSYRLDRTILYGPRASAVNLRRPAICEILSVT
>MT1810 hypothetical protein
MSDFDTERVSRAVAAALVGPGGVALVVKVFAGLPGVIHTPARRGFFRSNP
ERIQIGDWRYEVAHDGRLLAAHMVNGIVIAEDALIAEAVGPHLARALGQI
VSRYGATVIPNINAAIEVLGTGTDYRF
>MT2653 hypothetical protein
MRYRRRPALHAMTVARHPGKPNCVSRTAISSRKLSLASGFALWRRSLV
>MT1585.1 hypothetical protein
MSPGGFRLTGHVVWCRSSDERGADFRQLPASLYSCDALYRADNRTCRGRE
EVRDA
>MT2317 conserved hypothetical protein
MEPKEQQMRASNQFADVTSGVVYIHASPAAVCPHVEWALSSTLQAKANLV
WTPQPALPPQLRAVTNWVGPVGTGARLANALRSWSVLRFEVTEDPSPGVD
GQRFSHTPQLGLWSGAMSANGDIMVGEMRLRAMMAQGADTLAAELDSVLG
TAWDQALEVYRDGGDAGEVTWLSRGVG
>MT0044 conserved hypothetical protein
MFLAGVLCMCAAAASALFGSWSLCHTPTADPTALALRAMAPTQLAAAVML
AAGGVVAVAAPGHTALMVVIVCIAGAVGTLAAGSWQSAQYALRRETASPT
ANCVGSCAVCTQACH
>MT3378 hypothetical protein
MRMAVQNDLLASGQDILRIAHARDLNLLGTPAPRELTQMHHVAR
>MT3847 hypothetical protein
MLRVALGLTKASPVYLPGVKGRILLAGQLEHRSSWK
>MT2801.1 hypothetical protein
MMMNWRQTNITTKRCAQTRASSSASEFCGIFAAPGLMRNCHHGGSAPSAV
GGSAVQLTVAYGPQRFHGRCASNSSVRPLTTGGSWTPTSISSTDGGKAQG
HDTHDRQISRRTVCQAASILASILLETVAGPGEGIGPTTSVPLRAADARH
TREGLQGR
>MT2712 PE_PGRS family protein
MRVVRSRRSQMSFVIAVPEALTMAASDLANIGSTINAANAAAALPTTGVV
AAAADEVSAALAALFGSYAQSYHAFGAQLSAFHAQFVQSLTNGARSYVVA
EATSAAPLQDLLGVVNAPAQALLGRPLIGNGANGADGTGAPGGPGGLLLG
NGGNGGSGAPGQPGGAGGDAGLIGNGGTGGKGGDGLVGSGAAGGVGGRGG
WLLGNGGTGGAGGAAGATLVGGTGGVGGATGLIGSGGFGGAGGAAAGVGT
TGGVGGSGGVGGVFGNGGFGGAGGLGAAGGVGGTASYFGTGGGGGVGGDG
APGGDGGAGPLLIGNGGVGGLGGAGAAGGNGGAGGMLLGDGGAGGQGGPA
VAGVLGGMPGAGGNGGNANWFGSGGAGGQGGTGLAGTNGVNPGSIANPNT
GANGTDNSGNGNQTGGNGGPGPAGGVGEAGGVGGQGGLGESLDGNDGTGG
KGGAGGTAGTDGGAGGAGGAGGIGETDGSAGGVATGGEGGDGATGGVDGG
VGGAGGKGGQGHNTGVGDAFGGDGGIGGDGNGALGAAGGNGGTGGAGGNG
GRGGMLIGNGGAGGAGGTGGTGGGGAAGFAGGVGGAGGEGLTDGAGTAEG
GTGGLGGLGGVGGTGGMGGSGGVGGNGGAAGSLIGLGGGGGAGGVGGTGG
IGGIGGAGGNGGAGGAGTTTGGGATIGGGGGTGGVGGAGGTGGTGGAGGT
TGGSGGAGGLIGWAGAAGGTGAGGTGGQGGLGGQGGNGGNGGTGATGGQG
GDFALGGNGGAGGAGGSPGGSSGIQGNMGPPGTQGADG
>MT3876 hypothetical protein
MTPSGTTRNHGVSRIHRPQITVRCRAQKIAQPFGESLVEGAEQKVLDLGL
SLFQRPHPRIGVVGLGRELADFGRHAVQSSGVFPDGRRDFSLVGARRVDG
REDGSIVGLVRFQSGDPCSKLLQRRHGLILSGRSAGGSSAKCGCNLPTSC
AARNAPPGGIRPRQTTVPRSDR
>MT2742 hypothetical protein
MPAQPMPAEASDPSPSRRRVCNHAKVPHREPRRHRCAAGLIVLATLLVAA
AGVAAANDVPRAWAGDAPIGHIGDTLRVDTGTYVADVTVSSVVPVDPPPG
FGYTRSGVPVKSFPDSSVTRADVTVRAVRVPNSFILATNFSFTGVTPFAD
AYKPRPCDASDWLDAALGNAPQGSIVRGGVYWDAYRDPVSVVVLLDEKTG
QHLAQWNL
>MT0118 PE_PGRS family protein
MRVGQPNPRNWRNVPRKPRYIWLLATRQLARKRQRSLMSLLITSPATVAA
AATHLAGIGSALSTANAAAAAPTTALSVAGADEVSVLIAALFEAYAQEYQ
ALSAQALAFHDQFVQALNMGAVCYAAAETANATPLQALQTVQQNVLTVVN
APTQALLGRPIIGNGANGLPNTGQDGGPGGLLFGNGGNGGSGGVDQAGGN
GGAAGLIGNGGSGGVGGPGIAGSAGGAGGAGGLLFGNGGPGGAGGIGTTG
DGGPGGAGGNAIGLFGSGGTGGMGGVGGMGGVGNGGNAGNGGTAGLFGHG
GAGGAGGIGSADGGLGGGGGNGRFMGNGGVGGAGGYGASGDGGNAGNGGL
GGVFGDGGAGGTGGLGDVNGGLAGIGGNAGFVGNGGAGGNGQLGSGAVSS
AGGMGGNGGLVFGNGGPGGLGGPGTSAGNGGMGGNAVGLFGQGGAGGAGG
SGFGAGIPGGRGGDGGSGGLIGDGGTGGGAGAGDAAASAGGNGGNARLIG
NGGDGGPGMFGGPGGAGGSGGTIFGFAGTPGPS
>MT3297 hypothetical protein
MATMAAVVGGGPQDEIPEADAVEQGRAVDFDDEAGLDTAYLSGGAGDRDA
SEADVVDQAFVVPVADDEEIDR
>MT1578.1 hypothetical protein
MIDSSVNIKVAAHNYNTPVQRTIAEAPDASRSDVAFPCLVVNI
>MT2867 hypothetical protein
MDARKQRVFQISPEQWMHSAAQVTTQGEGLTVGHLSSDYRMQAAQFGWQG
ASAMALNAKMDDWLDASRALLTRIGDHAFGLQEAAIQHAAAEAERAQALA
QVGVSADVVAGPRGV
>MT2541 hypothetical protein
MCPTHCRAPARAPADSEFGDKNPGVFRRRIHYRRLVELRRLGAGPLELKK
VRVGSEGNRGIPELAGFLAARNLREHRPEERHTIDMDRRRTDVLAHRVHP
GVVAFAQGVVVSVAGSGLGQLRGQTGVRQRLGDDLVEFVVALVVNPVAHR
GVQRVQDFRPMGLLGGDRHPYRSPCPRHAFAVLLGQVVTVFVEYCQAHDV
EVHLDVADLCHLEDPARRDPAPRAQRIEPEIGDRLLGGLLEHGAVLSLES
AAVSTPPSTTTAAPHLFPPTRLSWTPWPFQTSRGTKPSNAPP
>MT1713 hypothetical protein
MPPLHSSIQMFVYSIDCLIMSFRHAAAVSPPGRRTATRCRLRPHPRPRVR
DRALRRALGVGGMSSAVSTGAFLTTVCLAHLVLGALMGVLVHEFGADMLS
LWPVGPALCH
>MT3077 hypothetical protein
MARIADELTFSLERVLQPLQKVVECCSKIPDFVSSQWQWQTGMWVGGADA
RGADAHAFHGSKSRGGQQIRAQGVDEDVWPCPDRCDRASSWFHNHPIDHG
>MT0996 hypothetical protein
MTSAGLRCFSGQRHTKGASAMVWHGFLAKAVPTVVTGAVGVAAYEALRKM
VVKAPLRAATVSVAAWGIRLAREAERKAGESAEQARLMFADVLAEASERA
GEEVPPLAVAGSDDGHDH
>MT1717 hypothetical protein
MIDATPVGLALAHAEMPGLRDELGLAPRVILPS
>MT0762.1 hypothetical protein
MRAGRRVSAQCRAEMGAYASARARLGAQPATFHDQFVRALTRARARMRPP
RPAGLLLGNGGARRYPAAGS
>MT0130 hypothetical protein
MASSSSLPLLRHCVSDDQVTVVGFDGDDLGKTARRIAALVVQRAIFLNDR
NTAVAHSGDDAVLGHAVLPGVPRDPDPLHASSMYSILGMCQSVNGRPFDA
IALVSVRLCHVQTDPTDSCGGRDRPGQLPCAPLDYHRHH
>MT1830 conserved hypothetical protein
MQNHDYVTYEEFGRRFFEVAVTPDRVAAAFADIAGSEFAMEPISQGPGGI
AKVSANVKIREPRVTRKLGDLITFVIHIPLSIDLLLDLRLDKQRFMVAGD
IALRATARAAEPLLLIVDVAKPRPSDITVNVSSKSIRGEVLRILAGVDGE
IRRFIAQYVSAEIDSPKSQAAQVINVAEQLDSTWSGP
>MT2590 hypothetical protein
MLYSFDTSAILNGRRDLFRPAVFRSLWGRVEDAISAGQIRSVDEVQRELA
RRDDDAKRWADGQTGLFCPLDEQIQQAARHILRLHPNMVRQGGRRSAADP
FVIALAMVNNATVVTQETASGNIEKPRIPDVCDALGVPWLTLMGYIEAQG
WTF
>MT1343 conserved hypothetical protein
MTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLTVGMFLG
LGLLLGLLNALLVRRSAESITAKEHPLKRSMALNSASRLAIITILGLIIA
YIFRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQT
GGSEGRSASDD
>MT2158 conserved hypothetical protein
MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSSNVFLRN
GARLYLDVGSHPEYATAECDSLVQLVTHDRAGEWVLEDLLVDAEQRLADE
GIGGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQL
ICGAGKVLQTPKAATYCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADA
EKYRRLHVIVGDSNMSETTTMLKVGTAALVLEMIESGVAFRDFSLDNPIR
AIREVSHDVTGRRPVRLAGGRQASALDIQREYYTRAVEHLQTREPNAQIE
QVVDLWGRQLDAVESQDFAKVDTEIDWVIKRKLFQRYQDRYDMELSHPKI
AQLDLAYHDIKRGRGIFDLLQRKGLAARVTTDEEIAEAVDQPPQTTRARL
RGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRAVDERVKRLIA
SM
>MT2737 hypothetical protein
MEVRASARKHGINDDAMLHAYRNALRYVELEYHGEVQLLVIGPDQTGRLL
ELVIPADEPPRIIHANVLRPKFYDYLR
>MT2334 hypothetical protein
MNRHSTAASDRGLQAERTTLAWTRTAFALLVNGVLLTLKDTQGADGPAGL
IPAGLAGAAASCCYVIALQRQRALSHRPLPARITPRGQVHILATAVLVLM
VVTAFAQLL
>MT1426 hypothetical protein
MNSGTLAGSLIFAAVLVMLIAVLARLMMRGWRRRSERQAELLGDLPDVPE
HVSSATVTTRGLYVGATLSPAWNERVTVGDLGYRSKAVLTRYPSGIMVER
ARAQPIWIPTESIAAIRMERGVAGKVVAGIGILAIRWRLPSGTEIDVGFR
ADNRDEYQEWLEEPV
>MT3319 hypothetical protein
MSSPVSSRRLANLVKESLQGSVLGGVVSDAVLPAVSDDVKPGAGEDAYRV
PVVVAAGSGAVVQVGGLEVGSAAVAGEVADTVAELFVCRPTEPDVGDFVG
LAGGAGDAGQAGQQFGLGVGVRGESFGARRRLALSTVGASGATAGLRKTH
DGHHGCQARGALTQRRLYIGNPSEITDTRMVHQ
>MT2235 conserved hypothetical protein
MVALYGACICSQGGRSFLEVFHWLQHDIVDRGRLPLLCCLVAFVLTFLVT
RSFVRFIHRRAADGRPARWWQPRNVHIGSVHIHHVAFGVVLVMISGLTLV
TLSVDGREPEFTIAASIFGVGAALVLDEYALILHLSDVYWEEDGRTSVDA
VFAAVAVAGLLIMGLHPLIFFLPVRQGANWVVLQTTLIAGLVLTLPLAVV
VLLKGKVWTGLLGMFVVVLLVVGAVRLSRPHAPWARWRYTRHPEKMRRAL
QRERTWRRPVVRIKLWLQYVIAGTPRMPDERAVDAQLDQDVRPAPPPERT
APILISGSVWSD
>MT0291.3 hypothetical protein
MFGNGGDGGAGGFGAGTGGNGGVGGNAVLIGNGGNGGNGGKAGGTPGAGG
TSGLIIGENGLNGL
>MT1853 PE_PGRS family protein
MDSPCDDNGQEVWTSQMIVAPAFVDAAAKDLATIGSAISRANAEALVPIT
ALLPAGADDVSAAIAALFATHGQAYQELSAHAVAFHEQFVQLMSAGAAQY
ASAEAANSSPLQIVGQTALDAINSPVQTLTGRPLIGNGANGVAGTGQNGG
DGGWLYGNGGNGGSGGTGQNGGNGGSAGLWGSGGNGGQGGAGANGAAGQP
GKAGGSGGNGGAGGWIYGHGGHGGAGGNGGNATAPGGASAGFDGGAGGNG
GSGGRGGLLFGNGGNGSVGGMGGQGTNDTAGDSAGSGGLGGNGGNGAQGG
WLIGNGGQGGDSGAGGGTDSTQTGVMNGASGGSAGIAGNGGDAGLVGNGG
AGGNGGNGAAGSALGTTIFGGSGGVGGSGGDGGNGGWLFGSGASGGNGGQ
GGDAGTNGFAGFGGSAGGGGWVGAVNFGPISVQGFGLFGHGGDGGNGGDV
GAGSLSIQFGASGGDGGQGGVLYGNGGNGGNAGSGGGTGFEXSAGXGGAA
ILIGNXGAGGNGATGGTGVGNIIQEAGCDSSDGGAGGSGGLLFGSGGAGS
IGGAGSVGRSGNDGSNGSDGGQGGASGLGIGNRGLGGSGGTGGAGGTGGS
AGTGGAGGDGGNAALLIGTGGDGGDGVPPAPGGQGGKGGLIGLPGQNGQP
>MT2002 hypothetical protein
MEQIVIRNLPEGTKAALRVRAARHHHSVEAEARAILTAGLLGEEVPMPVL
LAADSGHDIDFEPERLGLIARTPQL
>MT0717 hypothetical protein
MLGWTVKPGRVADGWQAPGVHLMARCSGPQPASERRADMDGGDIDAAVAR
VRAAGALAEPSRQPDDMSAECADDQGARCHLGQL
>MT2068 hypothetical protein
MSCARRSDIPADRDVSLGRPDYRDDPSPAHQPGSWSRLFTGATFPGASKA
VADLREAAVSETHDTKDVLAALAARKSPVRPF
>MT2872 hypothetical protein
MHDHQVLAARHAHQGPHVLQQRPGFVAEAPRPKATPVDLLGRARQPRAGQ
HLPRRRAAHPRGGHHRIQNLAVAPPHHRRQQQRGHSRRSIGSTSPSDDSA
SYSQRPRDVADPPVEASTLEGQEAVVTVELGGAVVDGVDDQGAGAVVPGT
GHGSDEGIEEKIATETGALLLPVERQASEDEHWDRVGSGWPRPGRDGTRI
RSMLPMASA
>MT2619 lipoprotein, putative
MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMDQSPDTSRRLTDEQK
IQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFTEDPAGRK
ADREGLSCKELTGDIARRPIADAVIFGTAFSAEDFKVVTNIVREEAAKYG
ATTESSLFNESAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDL
PAGQLPPEPPIWPTTSTPH
>MT0347 hypothetical protein
MPENRCDVVLDHTERRGAGPGDARIFAAGDLLFDDHDSVVHVHAAKFCGC
GERAGPPAQRIVALYSPVRVTDVDVVSRQRHEGLNVGGIEGVVPGQHGSD
LRCGHGLKVTVCGGYSGVRSSHFCHTPASPNSSMPVSATESRRTIASSSS
VAPPRSATAPFPCRCDQVTPRSSRRTVHSPASRSPGSVACRCKVSSSSSG
NGVPPLPAWTAMRSRNSLIPSAATFGSSVNSPPTVIATSARCTATSCSRR
RTHQPAGRGPRKVHTGVSTPVCSTASTSSRAPPYSQEIASSGSGGGLPPS
TLRGSRKWSSRSRTICAAQRSPRPTWRNS
>MT1765 hypothetical protein
MVGNEENELQDLRNLRRPCFSRAEAPIGVYNGEQAIIVYDLRPVPHWPKY
WIQALAKHFQRQLKPSPKIDISLLDDRIRFSVFVSTDVSAKDLCKLDDAV
YNAVRNAGRAIENEQAALDHKLAEVRKRRMDTWDESYFR
>MT3536 hypothetical protein
MKYKLAILDEYDRADRTERGAILRRENLYSSLLTE
>MT3573.16 hypothetical protein
MCGVVACGFDCWGSIGFVVVVLVVEESFGVVEGVGAGLAVDC
>MT1479 hypothetical protein
MRKYLRANRGELTLMAIVNRFNIKVIAGAGLFAAAIALSPDAAADPLMTG
GYACIQGMAGDAPVAAGDPVAAGGPAAAGACSAALTDMAGVPFVAPGPVP
AAAPVPIGAPVPIPGAPVPIPGAPVPIPGAPVPIPGGPVPIPGAPVPVPA
VPAPVIPVGTPLIALGPVLAGAPGDGVVSAPIIGMSGVKDALTDPAPAGG
PVPGQPVLPGPSASAPAGAR
>MT1132 hypothetical protein
MGCILLAGLVGMCSFQLGGTKRGPIPSYDAAQALRADAKTLGFPIRLPQL
PGGWTPNSGGRGGIENGRADPATGQRRNAATSIVGFISPTGRYLSLTQSN
ADEDKLVGSIHPSMYPTGTVDVGGTRWVVYEGSDENGAVEPVWTTRLTGP
GGATQLAITGAGSIDQFRTLASATQSQPPLPAR
>MT2622 hypothetical protein
MMVFCVDTSAWHHAARPEVARRWLAALSADQIGICDHVRLEILYSANSAT
DYDALADELDGLARIPVGAETFTRACQVQRELAHVAGLHHRSVKIADLVI
AAAAELSGTIVWHYDENYDRVAAITGQPTEWIVPRGTL
>MT3929 conserved hypothetical protein
MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYTMAGGVA
MVTLVVLGATPLAGHFSVAEVQIGTGLIALLIAFALTTNVIGKHVRRATH
ARVGDDGGRVLRESVPPSGAHKLAVRARCFLQGDSLYVAGVSGLGAALPS
ANYMGAMAAILASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTR
AFMAALQSWLRSRSRRDAALLVAAGGCLMLTLGLSNL
>MT0556 PE_PGRS family protein
MSNLLVTPELVAAAAADLAGIGSAIGAANAAAGAPTMALLAAGADEVSAA
VAAVFSSYAQQYQALSAAAAAFHDQFVRALAAGAGAYAGAEAANVEQQLL
NAINAPTLALLGRPLIGNGADGAAGTGQAGGAGGLLYGNGGNGGSGAAGQ
AGGAGGAAGLIGHGGTGGVGGTGAAGGAGGTGGWLFGNGGAGGTGGAVTG
VSTTGGPGGHGGDAGLYGFGGAGGAGGFGQSGAAGGAGGAGGWLYGDGGD
GGAGGNGGNESGTGVSGVGGVGGAGGAGGLLFGNGGDGGVGGDGGDGSST
QDSGGDGGAGGAGGAGGWLLGNGGAGGAGGAASIKVATGGLGGDGGDAGL
FGFGGDGGWGGRGVDARFGAAGGAAGAGGAGGWLYGDGGAGGVGGVGGAV
FSLSSGDVGAGGAGGGGGWLFGNGGDGGAGGGGGGRFGSGSGAGGDGAVG
GAGGAGAWFGNGGAGGVGGGGGRGTTAIGGDGGAGGAGGAGGWLYGDGGA
GGAGGGGGRGGTGNDGGDGGDGGRGGDAQLLGNGGDGGAGGAGGPAGLAL
PPGPARPAGAAVPAVRCSAAPARPARTADPWLAPIFARSTLRHSHHLGGI
AQTGAVADQQGQIAGLGRAGRQ
>MT3315 WhiB-related protein
MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTTECLSWA
LNTGQDSGVWGGMSEDERRALKRRNARTKARTGV
>MT3141 transcriptional regulator, TetR family
MVTVTPSGATLAASSAIWRLNDAARRLPTKAKILMSDLVSDTATFLSACW
STSLHEPTGPWSDLFRSRYSAKVSGAERLGDLPVFARQEPVPERGDAARN
RALLLEAARRLIARSGADAITMDDVAAAAGVGKGTLFRRFGSRAGLMMVL
LDEDERASQQAFLFGPPPLGPDAPPLDRLIAFGRERMRFVHAHHQLLSEA
NRDPQTRHSAALSVLRTHLRVLLASAPTTGDLDAQTDALLALLDVDYVEH
QLNAGGHTLQTLGDAWESLARKLCGR
>MT0726.1 hypothetical protein
MRRILRRCLPVVGPPAPRSGVSPAHSLVAINFKAEIVALQGWDRTRPCGR
WSYARPDASLAQGNLNSMPQITASAKSPLARVQRRAAGCYRHLTRH
>MT3701 PE_PGRS family protein
MSFVIAVPEFLSAAATDLANLGSTISAANAAASIPTTGVLAAGADDVSAA
IAALFGAHAQAYQTISAQAATFHAQFVQTLSAGAGAYANAEAANVQQSLL
NAINAPTQALLGRPLIGDGADGTAPGQNGGAGGLLYGNGGNGAAGVNAGI
AGGSGGAAGLIGNGGSGGAGGAGAAGGSGGQGGLLYGNGGAGGNGGAATI
PGGNGGAGGAGGNAWLFGNGGAGGLGAAGAAGAAGVNPLTVPAGQGSMGN
NGEPGGPGQPGTEFGQTGGTGGTGGTGLSVGGTGGTGGTGGTGGAGGSGG
RGGLLVGDGGAGGIGGTGGEGGIGARGGTGGQGGMGGAGQPGVGGDAGDG
GNGGIGGDGGAGGDGGAGGAGGLFGVSGSSGLGGAAGSGGNGGGGGEPGV
AGSPGVGPAGRGGDGNLGQFGPEGAPGQPGQPGQPG
>MT2364.1 hypothetical protein
MPDRCVLNEYDPSLQIAVAPGGAPEPGGTCRGAGMH
>MT1965 hypothetical protein
MVLSRTSTGRVILVPTQLRFDRWFLPLAVPLGLGPKNSELWVGAGSLHVK
MGWAFAADIPLTSITKAEATNARVYAAGVHFGFGRWLVNGSRKGLVALTI
DPPEQAKMWKKSMTVRELWVSVTDPDALVTACTAK
>MT1307 hypothetical protein
MLVETAAPPAFGKDSPMTTMITLRRRFAVAVAGVATAAATTVTLAPAPAN
AADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTSFTAC
GAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN
>MT0573.1 hypothetical protein
MIEYAVTLHRKPPGRQLTTTCDQGPGHHTVRLTPSGHDYCGRHG
>MT3290.2 hypothetical protein
MIDNNAFGVGAAAAVPTAGTPHSKNRVAAATDESVERGST
>MT1777 hypothetical protein
MRDTTFGPVVTRLCGWTYALSVVLLWVTCTAYAVLIAVSTTRIVIFRKEF
ADDLADPRRGFGMFTFVAASDVLGTRLVGQ
>MT1809 hypothetical protein
MPPASGHPRLSRRCHPLRRPPQWHGHRHVMPRGCAGARFACNACLNFLAG
LGISEPISPGWAAMERLSGLDAFFLYMETPSQPLNVCCVLELDTSTMPGG
YTYGRFHAALEKYVKAAPEFRMKLADTELNLDHPVWVDDDNFQIRHHLRR
VAMPAPGGRRELAEICGYIAGLPLDRDRPLWEMWVIEGGARSDTVAVMLK
VHHAVVDGVAGANLLSHLCSLQPDAPAPQPVRGTGGGNVLQIAASGLVGF
ASRPVRLATVVPATVLTLVRTLLRAREGRTMAAPFSAPPTPFNGPLGRLR
NIAYTQLDMRDVKRVKDRFGVTINDVVVALCAGALRRFLLEHGVLPEAPL
VATVPVSVHDKSDRPGRNQATWMFCRVPSQISDPAQRIRTIAAGNTVAKD
HAAAIGPTLLHDWIQFGGSTMFGAAMRILPHISITHSPAYNLILSNVPGP
QAQLYFLGCRMDSMFPLGPLLGNAGLNITVMSLNGELGVGIVSCPDLLPD
LWGVADGFPEALKELLECSDDQPEGSNHQDS
>MT2092 hypothetical protein
MSAQPRDTVAIAGSAHRHIIRPIQLRCLDAVPAAPATGDP
>MT1534 hypothetical protein
MSGLTSPKTYAVLAALQAGDAVACAIPLPPIARLLDDLDVPVSVRPVLPV
VKAASAVGLLSVTRFPALARLTTAMLTLYFILAVGAHVRVRDRVVNAIPA
ASFLTLFALMTAKGPERT
>MT0054 hypothetical protein
MAKWLGAPLARGVSTATRAKDSDRQDACRILDDALRDGELSMEEHRERVS
AATKAVTLGDLQRLVADLQVESAPAQMPALKSRAKRTELGLLAAAFVASV
LLGVGIGWGVYGNTRSPLDFTSDPGAKPDGIAPVVLTPPRQLHSLGGLTG
LLEQTRKRFGDTMGYRLVIYPEYASLDRVDPADDRRVLAYTYRGGWGDAT
SSAKSIADVSVVDLSKFDAKTAVGIMRGAPETLGLKQSDVKSMYLIVDPA
KDPTTPAALSLSLYVSSDYGGGYLVFAGDGTIKHVSYPS
>MT1114 conserved hypothetical protein
MNQILLSVIAEGGPGNTGPDFGKASPVGLLVIVLLVIATLFLVRSMNQQL
KKVPKSFDRDHPELDQAADEGTDRDGPARPPGPPHESG
>MT3098 PPE family protein
MPGRFMAAAVVPGVAGLAGVAGLAALPAVGAAAGAPAALVGSVAPVSGGV
VSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLT
VLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV
>MT0517 conserved hypothetical protein
MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDAITVAEL
TGEIPIIRDDHHHAGPDAHASQSPAANGRVQVGEAAPQSPAEPVAEQVAE
EPTRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPP
SGAEHMSPDPVEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAA
AAGTDVEGDGAAEARVARRALDVVPTLWRGALVVLQSILAVAFGAGLFIA
FDQLWRWNSIVALVLSVMVILGLVVSVRAVRKTEDIASTLIAVAVGALIT
LGPLALLQSG
>MT0396 hypothetical protein
MVPLWFTLSALCFVGAVVLLYVDIDRRRGRSRRRKSWARSHGFDYEREST
EILKRWTRGVMSTVGDVAAHNVVLGQIRGEAVYIFDLEEVATVIALHRKV
GTNVVVDLRLKGLKEPRESDIWLLGAIGPRMVYSTNLDAARRACDRRMVT
FAHTAPDCAEIMWNEQNWTLVSMPIASTRAQWDEGLRTVRQFNDLLRVLP
PLPQEMPQQTGVGPRGAAPGRPVAPGGPAELPPRRAQPDPATTVLPDPAR
RAPEPIRRDEGRSEGVRRPPPAGRNGQQATNYQH
>MT2638 hypothetical protein
MAQLLESVIDAAKGMKLAKLEGDAAFFWAPGGNTSVLVCDRPPQMRQRFR
TRREQIKKDHPCDCKSCEQRDNLSIKFVAHEGEVAEQKVKRNVELAGVDV
ILVHRMLKNEVPVSEYLFMTDVVAQCLDESVRKLATPLTHDFEGIGETST
HYIDLATSDMPPAVPDHSFFGLLWADVKFEWHALPYLLGFKKACAGFRSL
GRGATEEPAEMG
>MT2547.2 hypothetical protein
MVRQPTTPLAAALGQGMSSSLATLLEVGATVTRCEVRPGQFAGKGTGLV
>MT0943 hypothetical protein
MMCYRPAGSPLPGPEPATSGKRAPLDESPRHEKLDGGAGIVAHDVMLQGA
GQPMAFGVPLTGSISAAGDHATVASIAQVARRATA
>MT0773.1 hypothetical protein
MVRKHAFHWRYDSTEELELLNQLWQLVSLRLNFFTPTKKALGFRP
>MT2723 hypothetical protein
MTTTPRQPLFCAHADTNGDPGRCACGQQLADVGPATPPPPWCEPGTEPIW
EQLTERYGGVTICQWTRYFPAGDPVAADVWIAADDRVVDGRVLRTQPAIH
YTEPPVLGIGPAAARRLAAELLNAADTLDDGRRQLDDLGEHRR
>MT1460 hypothetical protein
MTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSSGVVFQT
ADQVAMGALGLVLAGAVLLFARPRLRVGSAGLSVRNLLGDRIVGWSEVIG
VSFPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAAMDTVRSLLARYRPD
LCAR
>MT0313 hypothetical protein
MVVAVEQPNGTLLPELVQWLHVAALGAPLGNAGVAALREAASVVTALLC
>MT1216 hypothetical protein
MRSRQPLGVAAPRIPASALTPPVPGQFSPVRLARGAVAAVSVVGASTATA
VASANLGMLAGAGTAGAIVAAGVGLVATAAAAESRRLDHAPNALEQLAAV
VADALYAAGGAQRGSAALRLASDPEGWIRCQLDGVPTEQSLRFTAALDEL
LAPLAEPRYLIGRKILTPPARPVARRLFAVRAVVGLSLPGTVAWHAVPRW
FARNKDRRQHLAQAWRKHIGPPRQLPADSPQGQAILDLFRGDNPLSVTTQ
LRTTWR
>MT2652 hypothetical protein
MPAGVGNASGSVLDMTSVRTVPSAVALVTFAGAALSGVIPAIARADPVGH
QVTYTVTTTSDLMANIRYMSADPPSMAAFNADSSKYMITLHTPIAGGQPL
VYTATLANPSQWAIVTASGGLRVNPEFHCEIVVDGQVVVSQDGGSGVQCS
TRPW
>MT1718 hypothetical protein
MTMARVRRGTELLLSPQSPPATGGLIVLTGLRLLAGLIWLYNVVWKVPPD
FGERGRRDLYHFTHLAVEHPVFTPFSWVIEHAVLPYFTAFGWGVLFAESA
LAVLLLTGTAVRLAALIGIGQSVAIGLSVAESPGEWPWAYAMLLGIHVVL
LFTCSTRYAAVDAVRAAATGSAARTAAQRLLAGWGIVLGLIGLVAVWRGL
GDDRPAYVGIRALEFSLGEYNLRGALALIAIALAMLAAAKRGWRTVALVA
AVVAVAAAAAIYLQVGRTAVWLGGTNTTAAVFVCAAVVSLATEFRIGRVE
GA
>MT1529 conserved hypothetical protein
MTFVGVTQHTSTMTDPFLGSEALAAGVLTPYELRSRYVALHKDVYVPQGV
ELTAQLRAKALWLRSRRRGVLAGYSASAFHGAKWIDADLPAAIIDTNRRR
APGLQVWEERIEPDEICVIEGMRVTTPERTALDLTSRFPLDPAVAAVDAL
IQATDLKVADVEPLIERYRGRRGMKAARAALDLVDGGAQSPKETWLRLLL
IRAGFPRPQTQIAVRNEWGWAEAHLDMGWQDIKVAAEYDGDHHLTSRYHY
RKDILRHEKVQHRYGWIVVRVVAEDHPADIIRRVGEARAFRA
>MT3215 conserved hypothetical protein
MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVKVRAQVL
TTDAGVREAERAQLLPWTATLKRHCVRVIPWEITGRHFRFGPEPDRSQTF
ACEASSHNQR
>MT1628 conserved hypothetical protein
MPDGHEGSLMVEPGNLAGATGAEWIGRPPHEELQRKVRPLLPSDDPFYFP
PAGYQHAVPGTVLRSRDVELAFMGLIPQPVTATQLLYRTTNMYGNPEATV
TTVIVPAELAPGQTCPLLSYQCAIDAMSSRCFPSYALRRRAKALGSLTQM
ELLMISAALAEGWAVSVPDHEGPKGLWGSPYEPGYRVLDGIRAALNSERV
GLSPATPIGLWGYSGGGLASAWAAEACGEYAPDLDIVGAVLGSPVGDLGH
TFRRLNGTLLAGLPALVVAALQHSYPGLARVIKEHANDEGRQLLEQLTEM
TTVDAVIRMAGRDMGDFLDEPLEDILSTPEVSHVFGDTKLGSAVPTPPVL
IVQAVHDYLIDVSDIDALADSYTAGGANVTYHRDLFSEHVSLHPLSAPMT
LRWLTDRFAGKPLTDHRVRTTWPTIFNPMTYAGMARLAVIAAKVITGRKL
SRRPL
>MT0846 fatty acid desaturase
MSAKLTDLQLLHELEPVVEKYLNRHLSMHKPWNPHDYIPWSDGKNYYALG
GQDWDPDQSKLSDVAQVAMVQNLVTEDNLPSYHREIAMNMGMDGAWGQWV
NRWTAEENRHGIXLRDYLVVTRSVDPVELEKLRLEVVNRGFSPGQNHQGH
YFAESLTDSVLYVSFQELATRISHRNTGKACNDPVADQLMAKISADENLH
MIFYRDVSEAAFDLVPNQAMKSLHLILSHFQMPGFQVPEFRRKAVVIAVG
GVYDPRIHLDEVVMPVLKKWRIFEREDFTGEGAKLRDELALVIKDLELAC
DKFEVSKQRQLDREARTGKKVSAHELHKTAGKLAMSRR
>MT1455 lipoprotein, 27 kDa
MQGMRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPLVEE
ATAQTKALKSAHMVLTVNGKIPGLSLKTLSGDLTTNPTAATGNVKLTLGG
SDIDADFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLA
NFADAKAEGRDTINGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQ
ETGDHQLAQAQLDRGSGNSVQMTLSKWGEKVQVTKPPVS
>MT2234.1 conserved hypothetical protein
MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAGSWVRTH
VLPKLPPPASQLWRSRQQIRLDLEEFLRIDGTDSIELWAWVGAYDHVALC
QLWGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQL
RRFRLITSTDDAGRGAAR
>MT3966 hypothetical protein
MTAIGMSHPPRVHRRVGGQRTALTAGIGLLLAALVLTTIANPPAAFAHTA
QLSTATPAPAVAATDANDVPTWPFVVGTVAAVAVAALWAVRRGR
>MT3449.1 hypothetical protein
MAFAREPPRAMGSLGLPPAVKARASRLYAPLVLEEWSCR
>MT2196 lipoprotein, putative
MLTGNKPAVQRRFIGLLMLSVLVAGCSSNPLANFAPGYPPTIEPAQPAVS
PPTSQDPAGAVRPLSGHPRAALFDNGTRQLVALRPGADSAAPASIMVFDD
VHVAPRVIFLPGPAAALTSDDHGTAFLAARGGYFVADLSSGHTARVNVAD
AAHTDFTAIARRSDGKLVLGSADGAVYTLAKNPAVDPASGAATVASRTKI
FARVDALVTQGNTTVVLDRGQTSVTTIGADGHAQQALRAGQGATTMAADP
LGRVLIADTRGGQLLVYGVDPLILRQAYPVRQAPYGLAGSRELAWVSQTA
SNTVIGYDLTTGIPVEKVRYPTVQQPNSLAFDETSDTLYVVSGSGAGVQV
IEHAAGTR
>MT3959 hypothetical protein
MWRRSMGWVGKKKSTAGQLAGTANELTKEVLERAVHRESPVIRPDVVVGI
PAVDRRPKQ
>MT0411 conserved hypothetical protein
MRPRRALAGLAADVVAVLVFCAVGRRSHAEGLSVTGLAATAWPFLTGTGI
GWVLARGWRRPTALAPTGVIVWLCTIVVGMVLRKVSSAGVAASFVVVASA
VTAVLLLGWRAAVALMAPHRADG
>MT2191 conserved hypothetical protein
MLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERPLWDFPD
GTLAGRELSAYLVSTQLGWNLVPHTIIRDGPAGIGMLQLWVQQPGDAVDS
DPLPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIRLRRMAV
FDVLINNADRKGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPI
DDQILQAVAGLADALGGPLAEALAGRIAAAEIGALRRRAQSLLDQPVMPG
PNGHRPIPWPAF
>MT2505 PPE family protein
MRPPRPTTSRPLVNESLQSSRADGTTNKQMHFEAYPPEVNSANIYAGPGP
DSMLAAARAWRSLDVEMTAVQRSFNRTLLSLMDAWAGPVVMQLMEAAKPF
VRWLTDLCVQLSEVERQIHEIVRAYEWAHHDMVPLAQIYNNRAERQILID
NNALGQFTAQIADLDQEYDDFWDEDGEVMRDYRLRVSDALSKLTPWKAPP
PIAHSTVLVAPVSPSTASSRTDT
>MT3503 hypothetical protein
MRGCLVQSRKTTSVLAAALLFCGLLGPGTAPPATGGGPACRPAELFATDN
TTDGFELPAVATIALTGTVVTGSTLVDGVFWSNERQQIGYERSREFHLCV
VDAPTLHNAAEALHRQFNQEAVLTFDYLPQNAPEADAILITVPDIGIARF
RDAFASDLAAHHRLRGGSVTTADHTLILVAGNGDLDVARRLVEEAGGDWN
ATTIAHGRREFVN
>MT3186.1 hypothetical protein
MRGPRWQKVRSSSVKLSNQKRHWPGYLFGRIRTSTLVLIAAFLAVWWIYE
TYRPQAPGPGDSPPTQVVPPGFVPDPDYTWVPRTRVQPPTVKATPTTTSS
TPPVSPPETTTDSAVPPPFELPPPFGPGTTTPTPPAPLPQPGPGPTAGTY
PKSEPPTR
>MT3222 conserved hypothetical protein
MGWEFGVLLILIAVLAVFLAPRLIPRGPRGDLASGTLLVTGVSPRPDAGG
QQYVTIAGIITGPTVNEYAVYQRMAVDVDQWPTVGQILPVVYSPKNPDNW
TFTPNGPPVG
>MT2370.3 hypothetical protein
MATSSDDITINRHPPLNCAVNRHDESRRSPLRRGLLANGLRERQAGALFE
RYESQFDSFGYIEKVRYRGSGYRVEDVYARADSGPSAGAELPVGP
>MT2015.1 hypothetical protein
MLCCLGGRQAHHRSSRWLESRWRGTTIRHQQRARRAVIGCRILTASIRAR
RPEPSRPARLPPWSSKQLAANEPVVLPAPTRALSAGFQAQRRLMSLRTAN
SSTASWGARSAICLLLICTLRRVIRINCNVIAYTGAMMYFVSPILRIEQG
>MT2867.1 hypothetical protein
MYTPGKGPPRAGGVVFTRVRLIGGLGALTAAVVVVGTVGWQGIPPAPTGG
DAVQLRSTAAPMSTTMKSPIVATTDPSPFDPCRDIPFDVIQRLGLAYTPP
EAEEGLRCHFDAGNYQMAVEPIIWRTYAQTLPPDAIETTIAGHRAAQYWV
RKPTYHNSFWYSSCMVTFKTSYGVIQQSLFYSTVYSEPDVDCPSTNLQRA
NDLVPYYRF
>MT2190 DNA-binding protein, CopG family
MMHNSGMRTTVSLADDVAAAVQRLRKERSIGLSEAVNELIRAGLTKRQVA
NRFQQQTYDMGEGIDYSNIGDAIETLDGPASG
>MT2527.1 hypothetical protein
MAFRDILVLFSMKTLLTLAMAAASSTALTTVGVSGARLITYCVGVEDI
>MT0636 hypothetical protein
MPRKTVGAWQTADTMGIFQALPDVWGGWRTECWEDRFEEQLIRCNGALRL
PELDLAAGMDSAREWLRDRIFQRFSDSPAGQILKLSELLADVGPGLVVSD
DAVTNGGARPNNEEWARFVAACDLVRGLTPNRPDFGDSGTITLVEGY
>MT2983 hypothetical protein
MRGHPNPLQRRYLVSDPSVLDRAARIGARVYRHPRLPRRSDPPIAAGIQV
AHVRGIGWIAPCRIGNARKVFQVDQRGDQRGAVLEHQGDGVVGEAGAVLD
AVDAGVDQAGQRVLAENVRGDPGALSVCRVDGGFEHVIGPQRGKIADLTV
DPVTDQLDPAVTAAGLLGYRCRQLGFVFELDREAGDVTLGSGQVPSGADD
AGQVFVVVKAAGVGRRAAVPQQQRADVTFGLGLSDRLVEFDVAVFPKPDM
AVRVDQPGQDPAAVKDGVGSCHRFGANAAVDDPQLDRRLVGQAQTSHVQA
HGAATGPDGVDPLHPAPPHSRSPLLLAWELQLGQVEVGQAGRQLVETFGH
L
>MT0932 hypothetical protein
MVVHIVGLSIETTAPTDSAITPIMVREINIGEIPLGLRLGSDTTLLDAAL
AGG
>MT0973 hypothetical protein
MPRSWWPTAPRIWAYSGNGTPSAELSGNRRQALQSKYQAAQRRFNFPDGE
THTWAYWRAPLQAMLPDLQRVLGDGQRPRRGPRPRERRPASPSL
>MT3979 conserved hypothetical protein
MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSK
FNDTLQEFETTRSSTGTGLQGVTSGLANNLLAAAGAYLKADDGLAGVIDK
IFG
>MT2405 hypothetical protein
MSDESRGGFGYVDVWARTALCADTKLVPSWPRLVSACHSPDDAGTPPTST
LQCEAQ
>MT3476 PE_PGRS family protein
MRSMNLPQTPRPLAHIALASGCCDLAGCVLAAWGWVVMSFVVAVPEALAA
AASDVANIGSALSAANAAAAAGTTGLLAAGADEVSAALASLFSGHAVSYQ
QVAAQATALHDQFVQALTGAGGSYALTEAANVQQNLLNAINAPTQALLGR
PLIGDGAVGTASSPDGQDGGLLFGNGGAGYNSAATPGMAGGNGGNAGLIG
NGGTGGSGGAGAAGGAGGSGGWLYGNGGNGGIGGNAIVAGGAGGNGGAGG
AAGLWGSGGSGGQGGNGLTGNDGVNPAPVTNPALNGAAGDSNIEPQTSVL
IGTQGGDGTPGGAGVNGGNGGAGGDANGNPANTSIANAGAGGNGAAGGDG
GANGGAGGAGGQAASAGSSVGGDGGNGGAGGTGTNGHAGGAGGAGGAGGR
GGWLVGNGGNGGNGGNGAAGGNGAIGGTGGAGGVPANQGGNSALGTQPVG
GDGGDGGNGGTGGTGGRGGDGGSGGAGGASGWLMGNGGNGGNGGTGGSGG
VGGNGGIGGDGAGGGNATSTSSIPFDAHGGNGGAGGDAGHGGTGGDGGDG
GHAGTGGRGGLLAGQHANSGNGGGGGTGGAGGTHGTPGSGNAGGTGTGNA
DSTNGGPGSDGLGGDAFNGSRGTDGNPG
>MT3096 hypothetical protein
MVGLTRPLLLCGATLLIAACTRVVGGTASATFGGDRQGMLDVATILLDQS
RMQAITGSGDDLTIIPTMDTTYPVDVDDFAQPIPRECRFIYAETAVFGSE
IEAFHKTTFQDRPDGSLISEAAAAYRDAGTARRAFDTLAVTVHDCAASPA
GWLFVSRWTAGGNSLHIRAGDCGRDYRVLSAALLEVTFCGFPESVSDIVM
TNIAANVPG
>MT2924 hypothetical protein
MRDIRVHVGVQKGHQVDLVEQHYIPRAERRRVHQRLVLALGDRGDHRLGA
YRRRTPSGAPARARLTIGASRWHSAQKPCWLFSSDTWVPRPLAASRSVVM
SPSITPIANWSACSSSAAPSTVVFPDPGELIRSSARIPLFSNAARLTFAR
TSFSAKIFSNTSTARLATVSIPAAPP
>MT1401 hypothetical protein
MTDFYALSLTIFASQDSRIYCDGSGQDDFGGVLRVATAVGEVPTGL
>MT3174.1 hypothetical protein
MQEEIDERRRMNGQLKHTQRQRITGRCAGGPDPAR
>MT2620 lipoprotein, putative
MIAPQPIPRTLPRWQRIVALTMIGISTALIGGCTMGQNPDKSPHLTGEQK
IQLIDSMRHKGSYEAARERLTATAQIIADRVSAAIPGQTWKFNDDSYGQD
FYRNGSLCKELSADIARRPMAKPVDFGSTFSAEDFKIAANIVREEAAKYG
VTTESSLFNESAKRDYDVQGNGYEFNLGQIKFATLNITGDCFLLQKVLDL
PAGQLPPEPPIWPTTSTPTP
>MT0644 hypothetical protein
MGHVRPGFSPRLGSHRTLRPRWPPYAAASRGLTSGTSRWGWPRLGFGVVT
APTRWTLADGRELLFFSLPGPRTSGTAAERVARHAQAQTFAGDIRQRAIQ
LVVSEQEVASKITAATAGIATTTFPETPSIDDTIIGNDNRDTGVRLVDVK
QDGGTSPPPPFAPWDTPDGTPPPGTGLSPTLQQMILGGDPANLTGQGLAD
NVQRFVQSLPANDPNTAWLRGQVADLQAHVADIEYARTHCSTNDWIDRTA
QFASGAIVFSIGVLTAETGAGVVAAAAGGVGAATAGVSLLQCLVGSK
>MT1367.1 hypothetical protein
MFAKSEAALTSVSGAAITNDMLSSDQLCVEYPARVSTG
>MT2010 hypothetical protein
MFLPTNAQYQLLVVGVGPWDTPSPSGRISWGSAWPHQARRAQTCQRVRRH
WMIDTTEAAYRLTYQPDGTSITVRENLVDILARELLGPIRGPQEVLPFSP
RSQYLVGHLAPVKLTGAALIDDNAVQARANAEALAEGGGVPAYAADETTP
TPTTTPKTAHPSRA
>MT0356 hypothetical protein
MKMTSLIDYILSLFRSEDAARSFVAAPGRAMTSAGLIDIAPHQISSVAAN
VVPGLNLGAGDPMSGLRQAVAARHGFAQDVANVGFAGDAGAGVASVITTD
VGAGLASGLGAGFLGQGGLALAASSGGFGGQVGLAAQVGLGFTAVIEAEV
GAQVGAGLGIGTGLGAQAGMGFGGGVGLGLGGQAGGVIGGSAAGAIGAGV
GGRVGGNGQIGVAGQGAVGAGVGAGVGGQAGIASQIGVSAGGGLGGVGNV
SGLTGVSSNACXASNASGQAGLIASEGAALNGAAMPHLSGPLAGVGVGGQ
AGAAGGAGLGFGAVGHPTPQPAALGAAGVVAKTEAAAGVVGGVGGATAAG
VGGAHGDILGHEGAALGSVDTVNAGVTPVEHGLVLPSGPLIHGGTGGYGG
MNPPVTDAPAPQVPARAQPMTTAAEHTPAVTQPQHTPVEPPVHDKPPSHS
VFDVGHEPPVTHTPPAPIELPSYGLFGLPGF
>MT0400 PPE family protein
MDFGALPPEINSARIYSGPGSRPLMQAAAAWQRLANELTATAASYSSVIS
GLTGDDWLGPSALSMAAAAVPYVAWMRATAASAEQAAAQAVAAANAYESA
YAATVPPTVIAANRRTMLSLVKTNVFGQNTPAIATSEAQYGEMWAQDIVA
MEGYAGASAAASQLPPFTPPPATTSGAGSLSDAAATAAQAVVPAAAATDV
SLLPTLQSFLPPPFDAIPNPIEDLDVLVAAAVAVAAGSLGVSAAQLGEIY
RHDVVDEAQKAPHCPAESDQTPAGAAGDGDLPEVGGRVTSPPQPPVAALT
GYSANIGGLSVPHSWNLPPAVRQVAAMFPGATPMYMTGSSDGSYAGLAAA
GLAGTGLAGLAARGGSAPTPAAAAPAGAGGAGPAATRPAAQQTPAVPAAA
AGSAIPGLPPGLPPGVVANLAATLAAIPGATIIVVPPSPNANQ
>MT1908 fibronectin attachment protein
MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPAPPVPTT
AASPPSTAAAPPAPATPVAPPPPAAANTPNAQPGDPNAAPPPADPNAPPP
PVIAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHLDYGSALLSKTTGDP
PFPGQPPPVANDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPY
PGTRINQETVSLDANGVSGSASYYEVKFSDPSKPNGQIWTGVIGSPAANA
PDAGPPQRWFVVWLGTANNPVDKGAAKALAESIRPLVAPPPAPAPAPAEP
APAPAPAGEVAPTPTTPTPQRTLPA
>MT0320 hypothetical protein
MAVIVRKWFGLGRLPADLRCQVEAEGLIYLAEYVAVTRRFTGVIPGLRAS
HSIASYVGALAFTEQRVLGTLSMVPKLAGRVVDARWDGPQAGAATAEISP
TGLQLDLDVADVDPKFSGQLALHFKATIGEDVLSRLPRRSLAFDVPAEYV
NLAVGVTYSP
>MT0220 hypothetical protein
MDELVAAIAPGLAGLGLPVINRREVVLVTGPWLAGVSGVRAALAERLPQR
RFVETAELGPGDAPVAVVFVVSAATALTESDCVLLDTAAEHTDAVVAVVS
KIDVHRGWRDVLTSNRDRLAARASRYARVPWVGAAAAPELGEPYLDDLVA
AIQKQLADPAVARRNMLRAWESRLLMVARRFDGDAQSAGRRARVDALRQQ
RRTVLRQGRQSKSEHTIALRAQIQHARVKLSYFARNRCSLLRVELQEHVA
GLSRKDIARFAAYTRGRVQEVVAEVGEGAVAHLADVAQLLGVPVQPPVLE
NLPAVLPTVVAPPLTSRRLEIRLTTLLGAGFGLGIALTLSRLVAGLTPGL
AASGMVAGVAIGLAVTAWVVNARALLHDRVVVDRWTGEVTASLRSVVEQL
VATRVVAVETLLSTAISERDDAENARVADQVSIIDGELLEHAVAAARAAA
LRDREMPAVRAALEAVRAELGEPGAPTTGLF
>MT1579 hypothetical protein
MTQLPQPTWRWWQQRETEQVQSSHIDGEIVGALIPDLAVLHSEDASRAAV
GREKHRCSLDPLGGGFRSRRASMPAGALLLSAVIAIQLDRMNARVFGDGW
IGAQACMWVNKFHEESTVTALSPSSPIAQGSIARHPETMQSAYVRIAEGG
SRDVAPAAQLQRRRP
>MT1998 hypothetical protein
MDRYNDQASGRALIEIRLCNERATPMPIPIGLWMFQTKLHVNAGGADVFL
PVCDVLEQDLAERDEEVRQLNLQYRNRLEYAIGRTCSAAWSVNGSRRPSA
VWTTWLPVAETPHTRARSVENALLSMDSRGGVT
>MT3156 hypothetical protein
MVLVEVPPDNPEVVAGWVSALADKWMFASRVG
>MT0359 lipoprotein, putative
MRLSLIARGMAALLAATALVAGCNTTIDGRPVASPGSGPTEPTFPTPRPT
TAPPGTTAPTLPTTPVSPTAPAGAIPLPPDSNGYVFIETKSGMTRCQINR
DSVGCEAPFTNSPLRDGEHANGIHITAGGSVQWVLGNLGAIPTVSIDYRT
YEAQGWTIDATTDGTRFTNNRTGHGMFVSIEKVDTF
>MT0198 conserved hypothetical protein
MSTVHSSIDQHPDLLALRASFDRAAESTIAHFTFGLALLAGLYVAASPWI
VGFSATRGLPTCDLIVGIAVAYLAYGFASALDRTHGMTWTLPVLGVWVIF
SPWVLPGVAVTAGMMWSHIIAGAVVAVLGFYFGMRTRAAANQG
>MT0012 hypothetical protein
MRLPLPVTPVAAKGERTWREGVRLNGPNGVSVYRHVPWRVHKVYSSDEPT
>MT3755 hypothetical protein
MTSDTSDHRAMPIEGPPGHRRTICWSAAYRYPGSMCVGFPKVLAAFRPHA
GDLVFLGDRRRLEDVRNTMALRWRPGAHQAR
>MT0324 hypothetical protein
MSQSRYAGLSRSELAVLLPELLLIGQLIDRSGMAWCIQAFGRQEMLQIAI
EEWAGASPIYTKRMQKALNFEGDDVPTIFKGLQLDIGAPPQFMDFRFTLH
DRWHGEFHLDHCGALLDVEPMGDDYVVGMCHTIEDPTFDATAIATNPRAQ
VRPIHRPPRKPADRHPHCAWTVIIDESYPEAEGIPALDAVRETKAATWEL
DNVDASDDGLVDYSGPLVSDLDFGAFSHSALVRMADEVCLQMHLLNLSFA
IAVRKRAKADAQLAISVNTRQLIGVAGLGAERIHRAMALPGGIEGALGVL
ELHPLLNPAGYVLAETSPDRLVVHNSPAHADGAWISLCTPASVQPLQAIA
TAVDPHLKVRISGTDTDWTAELIEADAPASELPEVLVAKVSRGSVFQFEP
RRSLPLTVK
>MT1479.1 hypothetical protein
MTIAISVNSPLFARRYFRNQFGSAEPHSRIEFLFDHRLNCQHPMGNMSPA
APGRFQMVTSQRCHKPAQCPR
>MT0554 conserved hypothetical protein
MSEAPNDKTTRGVVDILVYATARLLLVVAVSAAIFGVARLIGLTEFPVVV
ATLFGLIIAMPLGIWVFSPLRRRATAALAVAGERRRAERERLRARLRGES
LPEEQ
>MT1842 conserved hypothetical protein
MTINYQFGDVDAHGAMIRAQAASLEAEHQAIVRDVLAAGDFWGGAGSVAC
QEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>MT2838 PPE family protein
MVPRGRSFPGCRGWPRRRAAARASVRHATAPNPSSCPSRRYRGRPRRGCP
HLRSSEFRSIEINRGIGKMDFGALPPEINSTRMYAGAGAAPLMAAGATWN
GLAVELSTTASSVESVIMQLTTEQWLGPASMSMVVAAQPYLAWLTYTAES
AAHAAAQAMASAAAFEAAFAMTVPPAEVAANRALLAALVATNVLGQNTPA
IMATEAHYGEMWAQDALAMYGYAASSAAAGRLNPLITPSQTANMAGLAGQ
AAAVSHAAAASTVQQVGLGSLISNLPNAVMGFASPLTSAADAAGLGGIIQ
DIEELLGITFVQNAINGAVNTTAWFVMATIPNAVFLGHAFAALNPATVTA
AADAVPAAAAAAGLAHTVTPVGVGGASLTASLGEASSVGGLSVPAGWSTA
APAMTSGTTALEGSGWAVPEEAGPVAAMPGMAGISGAAKXXGAYAGPRYG
FKPIVMPKQVVV
>MT0169 PE family protein
MSHLVTAPDMLATAAAHVDEIASTLRAANAAAAGPTCNLLAAAGDEVSAA
TAALFSAYGREYQAVVKQAAAFHSEFTRTLEAAGNAYAHAEAANAARVSH
ALDTINAPIRTLLGRAPLSPNGSSGAGGLPAIAQLAAESPITALIMGGTN
NPLPDPEYVTDINKAFIQTLFPGAVSQGLFTPEQFWPVTPDLGNLTFNQS
VTEGVALLNTAVNNQLALDNKVVAFGYSQSATIINNYINSLMAMGSPNPD
DISFVMIGSGNNPVGGLLARFPGFYIPFLDVPFNGATPANSPYPTHIYTA
QYDGIAHAPQFPLRILSDINAFMGYFYVHNTYPELMATQVDNAVPLPTSP
GYTGNTQYYMFLTQDLPLLQPIRDIPYAGPPIADLFQPQLRVLVDLGYAD
YGPGGNYADIPTPAGLFSIPNPFAVTYYLIKGSLQAPYGAIVEIGVEAGL
IGPEWFPDSYPWVPSINPGLNFYFGQPQVTLLSLMSGGLGNILHLIPPPV
FT
>MT0066.1 hypothetical protein
MESAESIQRLTEFEMKLKFARLSTAILGCAAALVFPASVASADPPDPHQP
DMTKGYCPGGRWGFGDLAVCDGEKYPDGSFWHQWMQTWFTGPQFYFDCVS
GGEPLPGPPPPGGCGGAIPSEQPNAP
>MT0117 hypothetical protein
MPVETLHSGDPITDVNGGGQRYIVLESKTVGDSCVVLELESRVNHQLQVI
EKSFPAGYHVGRAHHRIL
>MT3380 conserved hypothetical protein
MSRVSGTNEVSDGNETNNPAEVSDGNETNNPAEVSDGNETNNPAPVSRVS
GTNEVSDGNETNNPAPVTEKPLHPHEPHIEILRGQPTDQELAALIAVLGS
ISGSTPPAQPEPTRWGLPVDQLRYPVFSWQRITLQEMTHMRR
>MT0029 hypothetical protein
MAFDAAMSTHEDLLATIRYVRDRTGDPNAWQTGLTPTEVTAVVTSTTRSE
QLDAILRKIRQRHSNLYYPAPPDREQGDAARAIADAEAALAHQNSATAQL
DLQVVSAILNAHLKTVEGGESLHELQQEIEAAVRIRSDLDTPAGARDFQR
FLIGKLKDIREVVATASLDAASKSALMAAWTSLYDASKGDRGDADDRGPA
SVGSGGAPARGAGQQPELPTRAEPDCLLDSLLLEDPGLLADDLQVPGGTS
AAIPSASSTPSLPNLGGATMPGGGATPALVPGVSAPGGLPLSGLLRGVGD
EPELTDFDERGQEVRDPADYEHANEPDERRADDREGADEDAGLGKSESPP
QAPTTVTLPNGETVTAASPQLAAAIKAAASGTPIADAFQQQGIAIPLPGT
AVANPVDSARISAGDVGVFTATPLPLALAKLFWTARFNTSQPCEGQTF
>MT1478 hypothetical protein
MRASPAERVDVDGAYAGAGPHTQSVLEEDQRQRAPAGAEAEGPGRTG
>MT2611 hypothetical protein
MPMTNWMLRGLAFAAAMVVLRLFQGALINAWQMLSGLISLVLLLLFAIGG
VVWGVMDGRADAKASPDPDRRQDLAMTWLLAGLVAGALSGAVAWLISLFY
KAIYTGGPINELTTFAAFTALIVFLVGIVGVAVGRWLVDRQLAKAPVRHH
GLAAEHERAADTDVFSAVRADDSPTGEMQVAQPEAQTAAVATVEREAPTE
VIRTTESDTPTEVIRTDTEADQTKPGDEPKKD
>MT0870 hypothetical protein
MGHVESGHVVWMRSAIVAVALGVTVAAVAAACWLPQLHRHVAHPNHPLTT
SVGSEFVINTDHGHLVDNSMPPCPERLATAVLPRSATPVLLPDVVAAAPG
MTAALTDPVAPAARGPPAAQGSVRTGQDLLTRFCLARR
>MT0035 hypothetical protein
MLARHFGAGRKAHSRAVATLKADIQAWHPAGIQTPKPRCESDVFARIGHT
SHPSTRKSRVGPGASEAPLA
>MT2160 hypothetical protein
MMAGALFEPSFAAAHPAGLLRRPVTRTVVLSVAATSIAHMFEISLPDPTE
LCRSDDGALVAAIEDCARVEAAASARRLSAIAELTGRRTGADQRADWACD
FWDCAAAEVAAALTISHGKASGQMHLSLALNRLPQVAALFLAGHLGARLF
SIIAWRTYLVRDPHALSLLDAALAEHAGAWGPLSAPKLEKAIDSWIDRYD
PGALRRSRISARTRDLCIGDPDEDAGTAALWGRLYATDAAMLDRRLTEMA
HGVCEDDPRTLAQRRADALGALAAGADHLACGCGKPDCPSGAGNDERAAG
VVIHVVADASALDAQPDPHLSGDEPPSRPLTPETTLFEALTPDPEPDPPA
THAPAELITTGGGVVPAPLLAELIRGGATISQVRHPGDLAAEPHYRPSAK
LAEFVRMRDLTCRFPGCDVPAEFCDIDHSAPWPLGPTHPSNLKCACRKHH
LLKTFWTGWRDVQLPDGTVIWTAPNGHTYTTHPGSRIFFPTWHTTTAELP
QTSTAAVNVDARGLMMPRRRRTRAAELAHRINAERALNDAYMAERNKPPS
F
>MT0638.1 hypothetical protein
MIDVSLARRCEAHGYDYFRSDDPVAAAGFVVSAVWSCGRGPGNATGSGRL
PKPLRHS
>MT0885 conserved hypothetical protein
MTEHTPDIPLGSWLAALPDERLTQLLELRPDLAQPPPGSIAALAARAQAR
QSVKAATDELDFLRLAVFDALLVLQADTAPVPIVRLLAVIGDRAAQADVL
GALADLKQRALAWGETAVRVATDAGTALPWHPGQVTLEGSSRSGDQLADL
IAGLDPAQRDVLDKLLQGSPVGRTRDAAPGAPSDRPVPRLLAMGLLRRID
AETVILPRHVGQVLRGEQPGPMELTAPDPVVSTTTPDDADAAAAGAVIDL
LREVDVLLENLGATPVAELRSGGLGVREFKRLAKATGIDEPRLGLILEIA
AAAGLIASGMPDPEPPHSDGPFWAPTVAADRFATMSPAERWHLLASAWLD
LPGRPALIGTRGPDAKPYGALSDSLFSTAAPLDRRLLLGMLAELPAGAGV
DASRASATLIWRRPRWARRLQPAPIADLLTEGHALGLVGRGAISTPARAL
LDEALEPATAPAAAVGVMARALPKPIDHFLVQADLTVVVPGPLQRELADD
LTTVATVESAGTAMVYRVSEQSIRHALDVGKSRDSLQEFFANRSKTPVPQ
GLTYLIDDVARRHGQLRIGMAASFVRCEDPTLLAQVVAAPEADGLALRAL
APTVAVSPAPISEVLVTLRGAGFAPAAEDSTGAVVDVRTRGARVPTPQRR
RPYRPPPRPNSEALKAVVAVLREVTAAPFANVRVDPAVTMSLLQRAAKDQ
ATLVISYLDAAGVATQRVVAPITLRGGQLVAFDSSSGRLRDFAIHRITLV
VSAHDR
>MT1129 hypothetical protein
MTVPPAGPYGNYPYGPNTYGQDPYWGGQPQGGSYPPAYPPQQYPPGWPAG
PYPPGPPPPGPGSKTPWLILAGLAVLGVILLVVILVIGLRGDNKSTTATS
PATSAPTSQPFSQQTATGCTPNVSGGVQPIGDSISAGKLSFPTSAAPGWS
AFSDDQNPNLIDAVGVGHEVAGADQWMMQAEVAITNFVTTMDVAAQASKL
MQCVADGPGYAGSSPTLGPTKTSSITVDGVRAARVDADITIADSSRNVKG
DSVTIIAVDTKPVTVFLGATPIGDATSRATVERVIEALKVNKS
>MT0611 lipoprotein, MK35
MKHFTAAVATVALSLALAGCSFNIKTDSAPTTSPTTTSPTTSTTTTSATT
SAQAAGPNYTIADYIRDNHIQETPVHHGDPGSPTIDLPVPDDWRLLPESS
RAPYGGIVYTQPADPNDPPTIVAILSKLTGDIDPAKVLQFAPGELKNLPG
FQGSGDGSAATLGGFSAWQLGGSYSKNGKLRTVAQKTVVIPSQGAVFVLQ
LNADALDDETMTLMDAANVIDEQTTITP
>MT2164 conserved hypothetical protein
MRTTVTLDDDVEQLVRRRMAERQVSFKKALNDAIRDGASGRPAPSHFSTR
TADLGVPAVNLDRALQLAADLEDEELVRRQRRGS
>MT0836 conserved hypothetical protein
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFT
AEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT
>MT1395 conserved hypothetical protein
MARTLALRASAGLVAGMAMAAITLAPGARAETGEQFPGDGVFLVGTDIAP
GTYRTEGPSNPLILVFGRVSELSTCSWSTHSAPEVSNENIVDTNTSMGPM
SVVIPPTVAAFQTHNCKLWMRIS
>MT0987 hypothetical protein
MKTLYLRNVPDDVVERLERLAELAKTSVSAVAVRELTEASRRADNPALLG
DLPDIGIDTTELIGGIDAERAGR
>MT3899.1 hypothetical protein
MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAVVSLLAI
ARVEWPAFPSSNQLHALTTVGQVGCLAGLVGIGWLWRHGRFRRLARLGGL
VLVSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTY
IGLPPFYPPGWFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWW
RMIRFEYALLVTVATAAVMLAYSSPEPYAAMITVLLPPMLVLTWSGLGAR
DRQGWAAVVGAGVFLGFAATWYTLLVAYGAFTVVLMALLLAGSRLQSGIK
AAVDPLCRLAVVGAIAAAIGSTTWLPYLLRAARDPVSDTGSAQHYLPADG
AALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVLAVYLWSLL
SMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMA
AAIGLAGAIAFSQDIPDVLRPDLTIAYTDTDGYGQRGDRRPPGSEKYYPA
IDAAIRRVTGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDK
RATQIDSWSGLSTADEFIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDV
YPNQPNVRRYTVDYGPPSSPTRVSSSRTLARSCWPSASRRRARDGYRSRP
TPYRRPATIYLRARRGSKLPDRPVRRCGGGSARRCAGHRHPTAAXQPDHR
AIELAPKRHVRQCRGTADWLRGHRLEHHRPLPGRRRTGRIAEHRQDGVVV
NGAQAGA
>MT2739 hypothetical protein
MIVVRTAEAAEQALTEGQLVCPRRGCGDTLRRWRYGRRRHVRSLGSQVID
VRPQRVRCRRCESTHVLLPAALQPRLGRGGGGQLRPGVWCTGR
>MT1569 hypothetical protein
MRCGCLACDGVLCANGPGRPRRPALTCTAVATRTLHSLATNAELVESADL
TVTEDICSRIVSLPVHDHMAIADVARVVAPFGEGLARGG
>MT0852 hypothetical protein
MGAQTGARSRAHRRFPPDARSGDLSKATWLCLLSMLPETNQDEVQPNAPV
ALVTVEIRHPTTDSLTESANRELKHLLINDLPIERQAQDVSWGMTAPGGA
PTPVADRFVRYVNRDNTTAASLKNQAIVVETTAYRSFEAFTDVVMRVVDA
RAQVSSIVGLERIGLRFVLEIRVPAGVDGRITWSNWIDEQLLGPQRFTPG
GLVLTEWQGAAVYRELQPGKSLIVRYGPGMGQALDPNYHLRRITPAQTGP
FFLLDIDSFWTPSGGSIPEYNRDALVSTFQDLYGPAQVVFQEMITSRLKD
ELLRQ
>MT1746.1 hypothetical protein
MWTLKARKEHTGISGKPTARTDRHGSTRSGDSELQASARRFSRLPDRCGA
QGVT
>MT0536 hypothetical protein
MDHFRCCLIGPWSVSAAAGSIESMFDQVRGRMPSPEAIAHFDERFECHAP
RTTRVSAAFIDRICSATRAENRAAAAQLVALGELFAYRWSRCGGREEWVM
DTMAAVAAEVAAALRISQGLAASRLRYARAMRERLPKTAEVFSAGDIGYL
MFATIVYRTDLIVDPDVLAAVDAQLAANVARWPSMTKARLAGQVDKIVAR
ADADAVRRRKEYQAQRQFWVGESQDGVCQIGGSLLAVDAHALDARLSALA
GTVCEHDPRSREQRRADALGALAGGADRLGCGCGRADCAAGKRPAAPPVV
IHLIAEAATINGTGSAPASQMNADGLITAELVAELAKTATLVPLVHPGDA
PPEPGYAPSKALADFVRCRDLTCRWPGCDEPATNCDLDHTIPYAAGGPTH
ASNLKCYCRTHHLVKTFWGWRDQQLPDGTLILTSPSGHTYVSTPGSALLF
PSLCHFSGGIPAPEADPPYDHCDQRTAMMPKRRRTRAQDRAYRIATERRQ
NHAARQRAQVLTQTAAATDTHGPPPDHNDDPPPF
>MT1849.1 hypothetical protein
MITITSLLAWAAREIYSPTLALLWPAASVSALCDQLAARQRRRLPIWRHV
FTLDGSTRASSSPRTMPSGWPPDDRAGSCLDHWRSCIGGRAGRPTRERNI
PQHNALKFTQNTIRGQWPRCHWQSFGPR
>MT3724 PE family protein
MSIMHAEPEMLAATAGELQSINAVARAGNAAVAGPTTGVVPAAADLVSLL
TASQFAAHAQLYQAISAEAMAVQEQLATTLGISAGSYAATEAANAATIA
>MT0129 hypothetical protein
MGEFDPKLRFAQSPVARLATSTPDGTPHLVPVVFALGARRPAEATGADVI
YTAVDAKRKTTQRLRRLANLEHNPRASVLVDSYADDWTQLWWVRADGVAA
IHRDGEVMRAAYRLLRAKYTQYQSVPLNGPVIAIAVQRWASRHA
>MT2873 hypothetical protein
MKTNPRYGPAFYSVMTVLFLALFVLNVCTHGSTLGLISTGGLAVLMGYIG
YRGWSGKRHINRQ
>MT3042 hypothetical protein
MSATAGKPTFSHNATERYPMFTARIRALAGMSLLASAIGLAAFGAATGTA
NAAPTHQPEWGTYTCYDYATQTFYECFDPS
>MT2480 hypothetical protein
MPPRCCISQPPAAAFSDRVGFDDRRWPAAPWRAGASGLTGLLGKTVWRPR
VSTDPALVRAIYFLPYRSPWCRGWLVIAGVLAPRVHLDSV
>MT1067 conserved hypothetical protein
MTTSRQGDSSMASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNI
SGAGWSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQ
ASQQILSS
>MT0369 PPE family protein
MRVSVCVIYIPFKGCVKHVSVTIPITTEHLGPYEIDASTINPDQPIDTAF
TQTLDFAGSGTVGAFPFGFGWQQSPGFFNSTTTPSSGFFNSGSGGASGFL
NDAAAAVSGLGNVFTETSGFFNAGGVGNSGFQNFGNLLSGWANLGNTVSG
FYNTSMLDLATQALISGFGNHGARLSGILNNGSGP
>MT3920 PE_PGRS family protein
MSFVVTVPEAVAAAAGDLAAIGSTLREATAAAAGPTTGLAAAAADDVSIA
VSQLFGRYGQEFQTVSNQLAAFHTEFVRTLNRGAAAYLNTESANGGQLFG
QIEAGQRAVSAAAAAAPGGAYGQLVANTATNLESLYGAWSANPFPFLRQI
IANQQVYWQQIAAALANAVQNFPALVANLPAAIDAAVQQFLAFNAAYYIQ
QIISSQIGFAQLFATTVGQGVTSVIAGWPNLAAELQLAFQQLLVGDYNAA
VANLGKAMTNLLVTGFDTSDVTIGTMGTTISVTAKPKLLGPLGDLFTIMT
IPAQEAQYFTNLMPPSILRDMSQNFTNVLTTLSNPNIQAVASFDIATTAG
TLSTFFGVPLVLTYATLGAPFASLNAIATSAETIEQALLAGNYLGAVGAL
IDAPAHALDGFLNSATVLDTPILVPTGLPSPLPPTVGITLHLPFDGILVP
PHPVTATISFPGAPVPIPGFPTTVTVFGTPFMGMAPLLINYIPQQLALAI
KPAA
>MT3750 conserved hypothetical protein
MSQLSFFAAESVPPAVADLSGVLAGPGQIVLVGCGARLSVVVAESWRASA
LAEMIQEAGLVPEVARTDENTPLVRTAVDPLLCGIAAEWTRGAVKTVPPR
WLPGPRELRAWTLAAGSPEADRYLLGLDPHAPDTHSPLASALMRVGIAPT
LIGTRGTRPALRISGRRRLSRLVENVGEPPDGAEAWVQWPRT
>MT1236 conserved hypothetical protein
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIVRDVLAAGDFWGGAGSAAC
QGFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>MT3254 hypothetical protein
MKRLIALGIFLIVGIELLALILHDRRLVLAGSGLALALVLLNVRRMLGNR
DELTAAPDSDDLGEGLRRWLSNTETTIRWSESTRADWDRHLRPMLARRFE
IATGHRQAKDPVAFAATGRMLFGDELWEWVNPNNVTHTGDRQPGPGRAAL
EEILQKLEQV
>MT2244 hypothetical protein
MRDGPAAPAQVVAPADGFVALRVADDRTVRLLSLGGAATDRLLSRIAAGI
DAAVDEVVAFWGTDWSHDIFVVAAGSDEQFHAAAGGGLASQWADIAAITV
VDRVDPARRTVVGQRIVFAPGAAHMSPAALRIVLGHELFHYAARADTALD
APRWLAEGVADFVARPKTPPPADAVSVALSLPSDTDLDTPGPQRSLAYDR
AWWFARFVAAAYGTAKLRELYLATCGVGHFDLATAAHDVLGIDAAGLLAR
WQRWLMG
>MT0107 hypothetical protein
MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATE
DSVLAYGNFTIGESAYIRSTGHFNAVELILCFNQLAYSAFAPAVLNEEIR
VLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIER
TWRYLKVPCVIEFWDENGGAASGEIELAALNIP
>MT3080.1 hypothetical protein
MCHGAGPPRCVTGVAAFIGASSGAVSVAFVTTTTTRRDEAMRVLVTKPDG
TQVEVHLDQGFRFLGTETVDND
>MT1433 mIHF
MRDGGIVALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLTQVLK
DAESDEVLGKMKVSALLEALPKVGKVKAQEIMTELEIAPTRRLRGLGDRQ
RKALLEKFGSA
>MT2330.1 hypothetical protein
MRRMPTQSAQFWAFRQQVGLTSAMVAGPVSAGRKRLPITAPLVGAITSLA
LGRFACPNSG
>MT0768 PE_PGRS family protein
MIAAPEAIAAAATDLASIGSTIGAANAAAAANTTAVLAAGADQVSVAIAA
AFGAHGQAYQALSAQAATFHIQFVQALTAGAGSYAAAEAASAASITSPLL
DAINAPFLAALGRPLIGNGADGAPGTGAAGGAGGLLFGNGGAGGSGAPGG
AGGLLFGNGGAGGPGASGGALG
>MT0886 hypothetical protein
MCSVIADQRRPDQPCGVGGCKTCQNGFVADIAEGKARKTRYVDHGWPTTD
PDDHAVSELVTDRTGALSPFGELTFPVPSDDLPYIHPVTVINR
>MT3542.1 hypothetical protein
MVGRAVPSPNRRYRRVWPPRTKGQHLSNPYAQHQLKLIRHTGALILWQQR
TYVVSGTREQCEAAYKSAQTYNLLVGWWSLVSLLAMNWIALISNFNAIRR
VRAAADGASVPHGPHAIAHPAVPRGPIPAGWYPDPSGAGLRYWDGATWTH
WTHPPRHR
>MT2650 hypothetical protein
MYPCERVGLSFTETAPYLFRNTVDLAITPEQLFEVLADPQAWPRWATVIT
KVTWTSPEPFGAGTTRIVEMRGGIVGDEEFISWEPFTRMAFRFNECSTRA
VGAFAEDYRVQAIPGGCRLTWTMAQKLAGPARPALFVFRPLLNLALRRFL
RNLRRYTDARFAAAQQS
>MT3848 hypothetical protein
MSPIDALFLSAESREHPLHVGALQLFEPPAGAGRGFVRETYQAMLQCREI
APLFRKRPTSLHGALINLGWSTDADVDLGYHARRSALPAPGRVRELLELT
SRLHSNLLDRHRPLWETHVIEGLRDGRFAIYSKMHHALVDGVSGLTLMRQ
PMTTDPIEGKLRTAWSPATQHTAIKRRRGRLQQLGGMLGSVAGLAPSTLR
LARSALIEQQLTLPFGAPHTMLNVAVGGARRCAAQSWPLDRVKAVKDAAG
VSLNDVVLAMCAGALREYLDDNDALPDTPLVAMVPVSLRTDRDSVGGNMV
GAVLCNLATHLDDPADRLNAIHASMRGNKNVLSQLPRAQALAVSLLLLSP
AALNTLPGLAKATPPPFNVCISNVPGAREPLYFNGARMVGNYPMSLVLDG
QALNITLTSTADSLDFGVVGCRRSVPHVQRVLSHLETSLKELERAVGL
>MT2182.1 PPE family protein
MTFPMWFAVPPEVPSAWLSTGMGPGPLLAAARAWHALAAQYTEIATELAS
VLAAVQASSWQGPSADRFVVAHQPFRYWLTHAATVATAAAAAHETAAAGY
TSALGGMPTLAELAVDFTSRPDMPTTSALLSSATSRMVAIGCLMPMLTTS
>MT3058 hypothetical protein
MDLRTPRPTCGKLSHCRRERTAPVYESKEVAAQVTGESDGPPRAVLIAAA
ALAAAVIGVILVVAANRQPPERPVVIPAVPAPQATGPGCKALLAALPQRL
GEYRRAPVAEPTTAGATAWRTGPNSTPVILRCGLDRPAEFVVGSAIQVVD
RVQWFQVAAQNPDEPGRSTWYTVDRPVYVALTLPSGSGPTAIQELSDVID
HTIPAVPIDPAPAR
>MT3290.1 conserved hypothetical protein
MKQDVSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCVSC
PIRRQCLAAALQRAEPWGVWGGEIFDQGSIVSHKRPRGRPRKDAVA
>MT4013 hypothetical protein
MMQQAVSGITGALGGAVGGVMGPLTQLPQQAMQAGQGAMQPLMSALQQTY
GAEGLDVADGARLVDSIEGEPGLGGEPGAGDVGAGGGGGGTTPTGYLGPP
PVPTSSPPTTPAGAPAKSVTPDPVSGTPRASGPAGMTGMPMVPPGALGAG
AEGANKDKPVEKRVTAPAVPNGQPVKGRLTVPPSVPVKSADDKPVVTKST
RRILVVPNDDKVKE
>MT2134 conserved hypothetical protein
MAMVNTTTRLSDDALAFLSERHLAMLTTLRADNSPHVVAVGFTFDPKTHI
ARVITTGGSQKAVNADRSGLAVLSQVDGARWLSLEGRAAVNSDIDAVRDA
ELRYAQRYRTPRPNPRRVVIEVQIERVLGSADLLDRA
>MT0458 PPE family protein
MTSPHFAWLPPEINSALMFAGPGSGPLIAAATAWGELAEELLASIASLGS
VTSELTSGAWLGPSAAAMMAVATQYLAWLSTAAAQAEQAAAQAMAIATAF
EAALAATVQPAVVAANRGLMQLLAATNWFGQNAPALMDVEAAYEQMWALD
VAAMAGYHFDASAAVAQLAPWQQVLRNLGIDIGKNGQINLGFGNTGSGNI
GNNNIGNNNIGSGNTGTGNIGSGNTGSGNLGLGNLGDGNIGFGNTGSGNI
GFGITGDHQMGFGGFNSGSGNIGFGNSGTGNVGLFNSGSGNIGIGNSGSL
NSGIGTSGTINAGLGSAGSLNTSFWNAGMQNAALGSAAGSEAALVSSAGY
ATGGMSTAALSSGILASALGSTGGLQHGLANVLNSGLTNTPVAAPASAPV
GGLDSGNPNPGSGSAAAGSGANPGLRSPGTSYPSFVNSGSNDSGLRNTAV
REPSTPGSGIPKSNFYPSPDRESAYASPRIGQPVGSE
>MT3877 hypothetical protein
MLSGIQQNTLMDNDPLAHGYYVADLLVALAVVVLMLRARRTRPELARMLL
LGTLIGLVWELPVFGLSAWTNTPIIEWATPLPLPTVVFLLAHSVWDGPLL
TMGWLLARALTGEPAGALGLTVQVLWGQLTALAVELSAILAGTWSYVDDL
WFNPVMFWFRGHPVTAAMQLTWLLAPLCFAALVRRLALTAR
>MT0620 hypothetical protein
MLHSSFGHLEGIQQPLIDELAELDHVLGKLPDAYRIIGRAGGIYGDFFNF
YLCDISLKVNGLQPGGPVRTVKLFGQPTGRCTPQ
>MT1026 hypothetical protein
MTRPWPTVVQGGAGQRRRRDVEPDRKTPVRWMSGQRLSEITWPTTDIEHS
VGAAEVQRHRGAVPLGSGGDAAGKVEGGRTPQPFVQP
>MT0772.2 hypothetical protein
MGAAAPGISAKIPGARALCQRPVAGRRDDGYPGGAGFAARPVPGTAARLS
ATPPRPVSRPAAGACETCGSAACGAGSRYR
>MT2146 hypothetical protein
MPVSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPR
PLVIVHGPLFQAVKAARAQVYGRLIQLRHARCEVLDERWQLRPTGQRDVR
ALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQC
NSVCRAAAEAIHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVV
AAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRRLVQLWQARRA
VTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRR
RNGPRRPARRGTSRGRRCAPRVAIGWHTPIGDPLAVEGVEEIGASLPGRE
STPSDDGGSLHPSGRPRRVHRRRWCGLGLC
>MT3557 serine esterase, cutinase family
MGAGALITAVVLLIALGAVWTLVAFADGCPDAEVTFARGTGEPPGIGRVG
QAFVDSLRQQTGMEIGVYPVNYAASRLQLHGGDGANDAISHIKSMASSCP
NTKLVLGGYSQGATVIDIVAGVPLGSISFGSPLPAAYADNVAAVAVFGNP
SNRAGGSLSSLSPLFGSKAIDLCNPTDPICHVGPGNEFSGHIDDYIPTYT
TQAASFVVQRLRAGSVPHLPGSVPQLPGSVLQMPGTAAPAPESLHGR
>MT2028 conserved hypothetical protein
MRWIVDGMNVIGSRPDGWWRDRHRAMVMLVERLEGWAITKARGDDVTVVF
ERPPSTAIPSSVVEVAHAPKAAANSADDEIVRLVRSGAQPQEIRVVTSDK
ALTDRVRDLGAAVYPAERFRDLIDPRGSNAARRTQ
>MT3183 conserved hypothetical protein
MTTPGRPLTTLDKSDVLAGLFAVWHSLDALLDGLLETDWQATSPLPGWDV
KAVVSHIIGTESFLLGIAAPEPDTDVSALAHVRNPIGVMNECWVRHLGTE
SGVGLLERFRAVTSQRRKVLASLSDDEWNAPTTTPSGPDSYGRFMRIRIF
DCWMHEQDIRAAVQRPSSDDELGGPASPLVLDEIAATMGFVVGKLAKAPD
GSRVLLELTGPLSRSIRVSVDGRARVVDDFGGPAPTATIRLDGLQFTRLA
GGRPMSPARSQDVELGGDKELAGHILERLNFVI
>MT2768 conserved hypothetical protein
MAPGNQQVYGGRWHRWNTPAVARNFHNVGDVSGEAMGAQGYLRRLTRRLT
EDLEQRDVEELSDEVLNAGAQRAIDCQRGQEVTVVGTLRSVETNGKGCSG
GVRAELFDGSDTVTLVWLGQRRIPGIDTGRTLRVRGRLGKLENGTKAIYN
PHYEIQR
>MT2228 conserved hypothetical protein
MRDAARPTGGPIGGHGSTRLTKRSRVTLNTIALELVPPNLEGGKERAIED
ARKVVQYSAASGLDGRIRHVMMPGMIAEDDDRPIPMQPKLDVLDFWSIIK
PELAGVHGLCTQVTAFMDEPSLHRRLVDLSDAGMEGIVFVGVPRTMQDGE
GSGVAPTDALSLYRQLVANRGVIVIPTRDGEQGRLNFKCSRGATYGMTQL
LYSDAIVGFLREFARTTEHRPEILLSFGFVPKVETRIGLINWLIQDPGNA
AVADEQAFVQKLAGSEPARRRRLMVDLYKRVLDGVADLGFPLSIHLEATY
GVSAAAFETFAEMLAYWSPAEPGKPD
>MT0903 hypothetical protein
MLGTTAFGHGLSSTRASIRSAALRRMSRSLRFSVASRSASHSCRARRAAL
TNSAPDSDTETSTCRPSMGCGARSTKPMSANEAITLVIDGGRTRSRIANA
PGVIAPSLASVVNAESWDSETGDDGFRNRSWRESRMTANDKSLASRASLS
STAEVTTEVQTRASETANDDFGSTHCRPTTHQQVVVPPAGAEATHPGPFA
DASSPRYVDSG
>MT3041.1 hypothetical protein
MTARVAVNFLVPMMVTASTGRFGAVAQRELRAENRLARISPPVHARRRLA
SPGQRELSGWALSRSEPTWGRGRPAKNAPARPAHERVRKLKGHPLCTVEE
PRRPGLRQVRSKERSVDRTGGSAQRGPELGQFSHLAHPISLSKNVHASPT
GPVRMVQRAFRRRKRRPPPSPR
>MT0981 antigen 34 kDa
MTYSXGNPGYPQAQPAGSYGGVTPSFAHADEGASKLPMYLNIAVAVLGLA
AYFASFGPMFTLSTELGGGDGAVSGDTGLPVGVALLAALLAGVALVPKAK
SHVTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVL
ALLVETGAITAPAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAA
GLQSPGPQQSPQPPGYGSQYGGYSSSPSQSGSGYTAQPPAQPPAQSGSQQ
SHQGPSTPPTGFPSFSPPPPVSAGTGSQAGSAPVNYSNPSGGEQSSSPGG
APV
>MT2842 conserved hypothetical protein
MTRRTLYVQLIIAFMCVAMVAYLVMLGRVAVAMIGSGRAAAAGLGLALLI
LPVIGLWAMIATLRAGFAYQRLARLIAEDGLDIDASALPRRASGRIQRDA
ADALFAAVRTELEDDADDWRRWYRLARAYDYAGDRRRAREAMKTALQLEG
RARPGAR
>MT1623 conserved hypothetical protein
MLANSREELVEVFDALDAELDRLDEVSFEVLTTPERLRSLERLECLVRRL
PAVGHTLINQLDTQASEEELGGTLCCALANRLRITKPDAALRIADAADLG
PRRALTGEPLAPQLTATATAQRQGLIGEAHIKVIRALFRPPARRGGCVHP
PGRRSRPGRQSRSISSRRAGPLRPAGHGLATPRRRPHRHRTRPQTRHHPE
QPAIRRHVTAKWLPDPPSAGHL
>MT1070.1 hypothetical protein
MPDSWPVVCVDDWSVAGLETQGQHPHDWLKHSSQKRTWLFKPARPERDRL
LGEDVAEKLASELARLRDVSTTRGEAHPSCKC
>MT0009 hypothetical protein
MLVAYIDESGNTGDPANGGSMTFALGCVLVDADNRPTAFDGLLSF
>MT3388 hypothetical protein
MHEVGGPSRGDRLGRDDSEVHSAIRFAVVAAVVGVGFLIMGALLVSTCSG
VDTAACGPPQRILLALGGPLILCAAGLWAFLRTYRVWRAEGTWWGWHGAG
WFLLTLMVLTLCIGVPPIAGPVMAP
>MT2358 serine esterase, cutinase family
MGAAAAMLAAVLLLTPITVPAGYPGAVAPATAACPDAEVVFARGRFEPPG
IGTVGNAFVSALRSKVNKNVGVYAVKYPADNQIDVGANDMSAHIQSMANS
CPNTRLVPGGYSLGAAVTDVVLAVPTQMWGFTNPLPPGSDEHIAAVALFG
NGSQWVGPITNFSPAYNDRTIELCHGDDPVCHPADPNTWEANWPQHLAGA
YVSSGMVNQAADFVAGKLQ
>MT0706.1 hypothetical protein
MSVNDGVDQMGAEPDIMEFVEQMGGYFESRSLTRLAGRLLGWLLVCDPER
QSSEELATALAASSGGISTNARMLIQFGFIERLAVAGDRRTYFRLRPNAF
AAGERERIRAMAELQDLADVGLRALGDAPPQRSRRLREMRDLLAYMENVV
SDALGRYSQRTGEDD
>MT3629 hypothetical protein
MMLDRLRQGGYWLVRGKINLIDRAFTSCRIESFADLGAVWGVEGAYTFRA
LDKYPVKEAVLVDGRITPTVAARANSYPQLRVIEGNFGDQEIADKVGNVD
ALFLFDVLLHQVSPDWDTILDMYAKNVRCLLIYNQQWIGSTTTVRLLDLG
EKHYFRNVPHSKLNKAYRDLFQKLDKKHPDHDKPWRDIPDIWQWGITDAD
LESKASELGFKLLYKEDCRGFGWLPNIQNRAFLFARQ
>MT0764 hypothetical protein
MVLTRRAREVALTQHIGVSAETDRAVVPKLRQAYDSLVCGRRRLGAIGAE
IENAVAHQRALGLDTPAGARNFSRFLATKAHDITRVLAATAAESQAGAAR
LRSLASSYQAVGFGPKPQEPPPDPVPFPPYQPKVWAACRARGQDPDKVVR
TFHHAPMSARFRSLPAGDSVLYCGNDKYGLLHIQAKHGRQWHDIADARWP
SAGNWRYLADYAIGATLAYPERVEYNQDNDTFAVYRRMSLPDGRYVFTTR
VIISARDGKIITAFPQTT
>MT3716 conserved hypothetical protein
MDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTGPDLDNA
HGQAETDTEQEIALFTVTNPPRTVSVSTLMDGRIDHVELSARVAWMSESQ
LASEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRVALLRKTVGETW
GLPSPEEAAAAEAEVFATRYSDDCPAPDDESDPW
>MT3024 hypothetical protein
MGLTESSAAASALAAAPFAKHTVGLASLSLMSTVFDVDLLIVPERIRQRR
AIRSIRSRRVAPAERCQTPQVTVQRPQPTDPAGTPCPHQHESRQARCGEA
EPSNR
>MT0014 hypothetical protein
MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIGLIWLMV
FQLAAIGSQAPTALNWMAQLGPWNYAIAFAFMITGLLLTMRWH
>MT3741 hypothetical protein
MAGLFTPPASGAATLQRAARDAAPDARWLLAVSDRNGIVSTSATTCNYPP
AAKDSAQDGFRHALAAAIAADIDEALRHGYGDLLELAYPLMSWPRRGVFG
GPTPAPRGLATRQCPPRTVHVDRVRPNGAERALRARFRPILRPQFTLGDG
ANGLPLAACTKTGAYVPHLPYSPIAVDPQPSAGQQGPS
>MT2837 hypothetical protein
MACGPSTGPAGYVPHSDPPGVAGAATDGGAATDAARTVNQAAGAMASQPP
PPGSLAAKWLVTRVPVAGLAEKCQNPPQLGHVQAGYRSESFGIMRL
>MT0161 PE family protein
MHWPRRPAMRCRPPSRNRSAHTARNTRPCSLKSRRFTVRFHQTLAAAANS
YADAEAAIASTRQNQLAVPAAAPTPAAAAMIPPFPANLTTLFFGPTGIPL
PPPSMLTPPIRCRSVRRALQAVFTPEELYPLTGVRSLVLNTSVEEGLTIL
HDAIMVELATTGNAVTVFGWSQSAIIASLEMQRFTAMGGAAPSASDLNFV
LVGNEMNPNGGMLARFPDLTLPTLDLTFYGATPSDTIYPTAIYTLEYDGF
ADFSRYPLNFISDLNAVAGITFVHTKYLDLTPAQVEGATKLPTSPGYTGV
TDYYIIRTENRPLLQPLRAVPVIGDPLADLIQPNLKVIVNLGYGDPNYGY
STSYADVRTPFGLWPNVPPQVIADALAAGTQEGILDFTADLQALSAQPLT
LPQIQLPQPADLVAAVAAAPTPAEVVNTLARIISTNYAVLLPTVDIALAL
VTTLPLYTTQLFVRQLAAGNLINAIGYPLAATVGLGTIDSGRRGIAHPPR
GGLGHRSKHRGPRHLTDSRRHRRPPTTVYRPRQ
>MT3483 hypothetical protein
MRRGAAAGDTLRPSTRRTAMWFALVNPEMLAAAATDLGGIRSGISAAYAR
PLR
>MT3520 hypothetical protein
MVAHPSTLRKHAVITPEIEFSLRVGLTTRWRSLVAV
>MT3651 conserved hypothetical protein
MPKSPPRFLNSPLSDFFIKWMSRINTWMYRRNDGEGLGGTFQKIPVALLT
TTGRKTGQPRVNPLYFLRDGGRVIVAASKGGAEKNPMWYLNLKANPKVQV
QIKKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCE
P
>MT3543 conserved hypothetical protein
MPRIRKLVAALHRRGPHRVLRGDLAFAGLPGVVYTPEAGLHLPGVAFGHD
WLTGTSRYSGLLEHLASWGIVAAAPDSERGLAPSVLNLAFDLGVALDIVA
GVRLGPGKISVHPAKLGLVGHGFGGSAAVFAAAGLTGTHVKSVAAIFPTV
TNPAAEQPAATLDVPGLILTAPGDPKTLTSNALGLSRAWDKATLRIVSKA
RAGGLVEGRRLTKVLGLPGPHRRTQRSVRALLTGYLLYTLGGDKTYRRFA
DPDLQLPKTDPIDPEAPPITPGEKIVTLLK
>MT2171 conserved hypothetical protein
MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEIDDVLEEN
AEDFVRAYVQKGGQ
>MT2973 lipoprotein, putative
MRARPLTLLTALAAVTLVVVAGCEARVEAEAYSAADRISSRPQARPQPQP
VELLLRAITPPRAPAASPNVGFGELPTRVRQATDEAAAMGATLSVAVLDR
ATGQLVSNGNTQIIATASVAKLFIADDLLLAEAEGKVTLSPEDHHALDVM
LQSSDDGAAERFWSQDGGNAVVTQVARRYGLRSTAPPSDGRWWNTISSAP
DLIRYYDMLLDGSGGLPLDRAAVIIADLAQSTPTGIDGYPQRFGIPDGLY
AEPVAVKQGWMCCIGSSWMHLSTGVIGPERRYIMVIESLQPADDATARAT
ITQAVRTMFPNGRI
>MT3454 hypothetical protein
MLMKSSRIPGPTQPTVVRELVAVGEPSYTALPAGLPHHPRPQRGGSSRAA
PVLVDDSMPHPGTRRCCALPQPGHQFPKPGQFRRQSWSRRSLSRAQTLSR
KRRALQPLPRPPYLTTTPSKRHYKRYKPIWALMRRKCRHVCTRSGYCRLV
CRSARAHHVVVRR
>MT0194 conserved hypothetical protein
MIGADVPRDSQRARVYAAEAFVRTLFDRVTAHGSPTVEFFGTQLTLPPEG
RFGSVASVQRYVDDVLALPAVGQNWPTVSPVRVRARRAATAAHYENHGGT
GTIAVPDRHTAGWAMRELVVLHEVAHHLCQVPPPHGPEFVATVCTLTELV
MGPEVGHVFRVVYAQEGVR
>MT0890 conserved hypothetical protein
MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEWDQVARC
ESGGNWSINTGNGYLGGLQFTQSTWAAHGGGEFAPSAQLASREQQIAVGE
RVLATQGRGAWPVCGRGLSNATPREVLPASAAMDAPLDAAAVNGEPAPLA
PPPADPAPPVELAANDLPAPLGEPLPAAPADPAPPADLAPPAPADVAPPV
ELAVNDLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPAPADLAPPAPA
DLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPADLA
PPAPAELAPPAPADLAPPAAVNEQTAPGDQPATAPGGPVGLATDLELPEP
DPQPADAPPPGDVTEAPAETPQVSNIAYTKKLWQAIRAQDVCGNDALDSL
AQPYVIG
>MT1367 PE_PGRS family protein
MSAFHAQFVQTFTAGAGAYASAEAAAAAPLEGLLNIVNTPTQLLLGRPLI
GNGANGAPGTGQAVGADGLLYGNGGAGGSGAPGQAGGPGGAAGLFGNGGA
GGAGGDGPGNGAAGGAGGAGGLLFGSGGAGGPGGVGNTGTGGLGGDGGAA
GLFGAGGIGGAGGPGFNGGAGGAGGRSGLFEVLAAGGAGGTGGLSVNGGT
GGTGGTGGGGGLFSNGGAGGAGGFGVSGSAGGNGGTGGDGGIFTGNGGTG
GTGGTGTGNQLVGGEGGADGAGGNAGILFGAGGIGGTGGTGLGAPDPGGT
GGKGGVGGIGGAGALFGPGGAGGTGGFGASSADQMAGGIGGSGGSGGAAK
LIGDGGAGGTGGDSVRGAAGSGGTGGTGGLIGDGGAGGAGGTGIEFGSVG
GAGGAGGNAAGLSGAGGAGGAGGFGETAGDGGAGGNAGLFNGDGGAGGAG
GLGIAGDGGNGGKGGKAGMVGNGGDGGAGGASVVANGGVGGSGGNATLIG
NGGNGGNGGVGSAPGKGGAGGTAGLLGLNGSPGLS
>MT1520 hypothetical protein
MRKSKKTRDQLLRELRNAYEGGASIRNLAATTGRSYGSIHSMLRESGTTM
RGRGGPNRRSRPR
>MT1305.1 hypothetical protein
MGHQPRLHGRHHLNRARMSITTAGSHSLQEHRVGCPPIACLDEHQQDSRG
TFTIIFTGPAFHPHGNSRCRCQGPPTRYIARRAEILGSYRIGETWYHNDR
>MT2729 hypothetical protein
MGRRGPAPAPAQLKLLGGRSPGRDSGGRRVTPPAAFERVAPECPDWLPPG
AKDMWGRVVPELAALNLLKESDLGVLTSFCVAWDQLMQAVTAYREQGFIA
TNARSRRVTVHPAVAAARAATRDVLVLARELGCTPSAEANLAAVLAAAGD
PDDDEFNPFAPDR
>MT1798 hypothetical protein
MPNIDDPINLRPLSPGQVNKVWLWQSLPGPWIGSARNTVYLTGFEFLEP
>MT2220 PE_PGRS family protein
MSFVIAAPEVMAAAATDLANIGSSISAASAAAAGPTMGILAAGADEVSVA
ISALFGSHAQGYQTLSAQLAAYHNQFVRALNAGAGSYASAEAANVQQTLL
NAINAPTQTLLGRPLIGNGADGGPGQNGGPGGLLYGNGGNGGAGDTANPN
GGNGGSAGLIGNGGAGGAGAATGAGGAGGNGGWLYGNGGPGGAAGLGTAG
GVSPAGGAGGAAGLWGHGGAGGAGGSASGAPGAGGAGGDGGRGGLLYGDG
GAGGAGGNGSNGVTGVHGGNGGAGGAAGLIGNGGAGGDGGNGGLSNTGAS
GGAGGAGGAALIGNGGDGGHGGNGGHGNSGGAGGAGGAGGAGGAGGHVGL
IGNGGNGGAGGNGGNDNSSTLADAGSGGAGAAGGNGGLFYGNGGVGGRGG
NGGFSSAGTSGGDGGIGGAGGIGGLIGSGGGGGDGGNGGQAPTPGNAGDG
GAGGNARLIGDGGRGGNGGEGGDGPPGVKGDGGNGGNGGNAVVIGNGGNG
GAGGFGIPVGSGGAGGSRGVLFGTPGANGADG
>MT2839 PE family protein
MSFLTTQPEELAAAAGKLETIGSAMVAQNAAAAAPTTTGVIPAAADEISV
RQAPLFTAYGTLYQQVSAEAAAVYDLFVKTLGVSAGTYAATEAANSSAAA
SPLSGIASILGSTPGKVPSWISDIANIFNIGAGNWASAASDLLGLASGGL
LPAAEEAALEEGLEGAGLSELGAAEAAVGEAPIAAGLGAAPLAAGLSRAS
SIGALSVPPSWAGQANLVSSTSTLQGAGWTTAAPHGAAGTVIPGMPGLAS
ATRSSAGFGAPRYGAKPIVMPKPAV
>MT2362 hypothetical protein
MTQTLRLTALDEMFITDDIDIVPSVQIEARVSGRFDLDRLAAALRAAVAK
HALARARLGRASLTARTLYWEVPDRADHLAVEITDEPVGEVRSRFYARAP
ELHRSPVFAVAVVRETVGDRLLLNFHHAAFDGMGGLRLLLSLARAYAGEP
DEVGGPPIEEARNLKGVAGSRDLFDVLIRARGLAKPAIDRKRTTRVAPDG
GSPDGPRFVFAPLTIESDEMATAVARRPEGATVNDLAMAALALTILQWNR
THDVPAADSVSVNMPVNFRPTAWSTEVISNFASYLAIVLRVDEVTDLEKA
TAIVAGITGPLKQSGAAGWVVDLLEGGKVLPAMLKRQLQLLLPLVEDRFV
ESVCLSNLGRVDVPAFGGEAGDTTEVWFSPTAAMSVMPIGVGLVGFGGTL
RAMFRGDGRTIGGEALGRFAALYRDTLLT
>MT1622.1 conserved hypothetical protein
MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGLLAGLRA
LIASGKLGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMT
SHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTK
PGCDAPAYHSQAHHVTAWTSTGRTDITDLTLACDPDNRLAEKGWTTHKNT
HGHTEWLPPPHLDHGQPRTNTFHHHEKLLRHNDEDNHDDP
>MT1854 hypothetical protein
MHRKRSAIPGESLSMRVVSTLLSIPLMIGLAVPAHAGPSGDDAVFLASLE
RAGITYSHPDQAIASGKAVCALVESGESGLQVVNELRTRNPGFSMDGCCK
FAAISAHVYCPHQITKTSVSAK
>MT3572.1 conserved hypothetical protein, interruption
MLAKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGLLAGLHA
LIASGKLGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMT
SHAHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTK
PGCDAPAYHSQAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHKNT
HGHTEWLPPPHLDHGQPWTCEIHYTCACCCLPPNLRRPLRRTARRGPPTR
GLPKAVRAAKMGARRVPRQRRQRINRQAPPRLRADVGRHHRRQDRRRGGL
GPGPAPSPSHRAGSLHVISRREAAGPGHRRRRR
>MT2514 hypothetical protein
MAAEQSHDDPLVHLGYPLGRNAQLAAMLIEIIWGTPHLGAIW
>MT0880 conserved hypothetical protein
MIANLVAVAIRASREVVIEAPPEVIVEALADMDAVPSWSSVHKRVEVVDT
YSDGRPHHVKVTIKVAGIVDTELLEYHWGPDWVVWDAAKTAQQHGQHGEY
NLRREDNDKTRVRFTLTVEPSAPLPAFWVNIARKKILHAATEGLRKQVVG
RRRFTSG
>MT0328.1 hypothetical protein
MPDTVRLVQWPVITTVLDMPLTLTMLSLQAIVRFTLTPETLSWRRPAEEW
WPVAQSGLEQPGRQTGWARERRSCLGKAAVR
>MT1363.1 hypothetical protein
MARRRKPLHRQRPEPPSWALRRVEAGPDGHEYEVRPVAAARAVKTYRCPG
CDHEIRSGTAHVVVWPTDLPQAGVDDRRHWHTPCWANRATRGPTRKWT
>MT2589 hypothetical protein
MDDIAAFKLDSLPDITFTVTRAISSGGENPAGFLNFAARREQPEILGGGG
RPGPVGPEAVDTPRIRGGKVPFVFRTLPGYTFYASQIEPRVGDPEGPTLL
AGFGNIPETSQRSPGWIRITCKGPDDDEELEFFGFSGPES
>MT1475 hypothetical protein
MGFLKPDLPDVDHDTWLTQPRRTRLQVVTRDWVEHGFGTLYAVYLLYLTK
IAVYVAAGAAIISLNPGLGGLSRIGDWWTQPIVYQKVIVFTLLFEVLGFG
CGSGPLTGRFWPPIGGFLYWLRPNTIRLPAWPDKVPFTQGDTRTVVDVAL
YAIVLIGGVWALLSPGSPGPGGTPVTAAGDVGLINPVLVVPTIVALGVLG
LRDKTIFLAARGEHYWLKLFVFFFPFTDQIAAFKIIMLCLWWGAATSKLN
HHFPYVVAVMTSNNALLRSRVFNPIKHLLYRDHANDLRPSWLPKLMAHGG
GTTAEFLVPGILVLVADGHPWRWFLIGFMVLFHLNILSNLPMGVPLEWNV
FFIFSLCYLFGHYGAITATDLRSPLLLAIVIAVVAVVIMGNLLPEKISFL
PAMRYYAGNWATSIWCFRGDAEATMETSVVKSSALVVNQLAKLYDGATAE
IMTDKVAAFRAMHTHGRALNGLLPRALDDEAHYRIREGEIVAGPLVGWNF
GEGHLHNEQLVAAVQRRCNFADGDLRVIILEGQPIHVQKQWYRIVDAKTG
LFEAGYVTVEDMLSRQPWPEPGDEFPVHVTTQRGTPSKP
>MT2596 conserved hypothetical protein
MVDRDPNTIKQEIDQTRDQLAATIDSLAERANPRRLADDAKTRVIAFLRK
PIVTVSLVGIGSVVVVVVIHKIRNR
>MT3210 hypothetical protein
MQCTNASGRRRFGLSLTIHEDACRIISVVLVVLEVRRAEPSVRHDVSIHS
VTIV
>MT0908 conserved hypothetical protein
MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDIDWESPE
FAVTDNDPRWILPATDPLGRHPWYQAQSRERQIEIGMWRQANVAKVGLHF
ESILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRVGADV
PGLPRRLRWVSPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSL
HPIMERVMSIHVAEEARHISFAHEYLRKRLPRLTRMQRFWIALYFPLTMR
SLCNAIVVPPKAFWEEFDIPREVKKELFFGSPESRKWLCDMFADARMLAH
DTGLMNPIARLVWRLCKIDGKPSRYRSEPQRQHLAAAPAA
>MT2419 PPE family protein
MILDFSWLPPEINSARIYAGAGSGPLFMAAAAWEGLAADLRASASSFDAV
IAGLAAGPWSGPASVAMAGAAAPYVGWLSAAAGQAELSAGQATAAATAFE
AALAATVHPAAVTANRVLLGALVATNILGQNTPAIAATEFDYVEMWAQDV
GAMVGYHAGAAAVAETLTPFSVPPLDLAGLASQAGAQLTGMATSVSAALS
PIAEGAVEGVPAVVAAAQSVAAGLPVDAALQVGQAAAYPASMLIGPMMQL
AQMGTTANTAGLAGAEAAGLAAADVPTFAGDIASGTGLGGAGGLGAGMSA
ELGKARLVGAMSVPPTWEGSVPARMASSAMAGLGAMPAEVPAAGGPMGMM
PMPMGMGGAGAGMPAGMMGRGGANPHVVQARPSVVPRVGIG
>MT0204.1 hypothetical protein
MGGELTGRRIPTVVVDLPNQRLLATNMCPRLASAIPTQSVTIETVPADN
>MT2359 conserved hypothetical protein
MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVNGHETTV
YPGSDAVVVTATEHAEAEKRAAARAGHAAT
>MT2978 hypothetical protein
MCAVLDRSMLSVAEISDRLEIQQLLVDYSSAIDQRRFDDLDRVFTPDAYI
DYRALGGIDGRYPKIKQWLSQVLGNFPVYAHMLGNFSVRVDGDTASSRVI
CFNPMVFAGDRQQVLFCGLWYDDDFVRTPDGWRIIRRVETKCFQKMM
>MT4026.1 hypothetical protein
MAVEVGRDGRWRNRQPSLPHPSHWPVSGRGVVVPISARQTAGDNIKNPWV
RSRLVARRWANRAPPTQRLL
>MT0645 hypothetical protein
MPGGEQVMDVLAAGIAAGALTLAAWGAWRPHYRAASYLVAGAVELALIGL
LVVTGQTLMAISVAFLVALGGPLVVVNHRRAERSRG
>MT0187 hypothetical protein
MEDQQSASGDLTQKSVANGESTDTASAATEGHRGEIDAAGEPDERGAAVA
DSQADEDDSAATAARGGKTRARRSRGRRLAITVGVAAALFVGSAAFAGAT
VEPYLSERAVVATKLMVARTAANAITTLWTYTPENMDTLADRAANYLSGD
FAAQYRRFVDQIAAANKQAKITNDTEVTGAAVESLSGRDAVAIVYTNTTT
TSPVTKNIPALKYLSYRLFMKRYDARWLVTRMTTITSLDLTPQV
>MT3135 hypothetical protein
MSAAAVDIAAAINNTRHIRYPGYWVFAPARVTEAPNSPLNRKTDPL
>MT1920 hypothetical protein
MNAAMNLKREFVHRVQRFVVNPIGRQLPMTMLETIGRKTGQPRRTAVGGR
VVDNQFWMVSEHGEHSDYVYNIKANPAVRVRIGGRWRSGTAYLLPDDDPR
QRLRGLPRLNSAGVRAMGTDLLTIRVDLD
>MT1309 hypothetical protein
MTMLSPLSPRIIAAFTTAVGAAAIGLAVATAGTAGANTKDEAFIAQMESI
GVTFSSPQVATQQAQLVCKKLASGETGTEIAEEVLSQTNLTTKQAAYFVV
DATKAYCPQYASQLT
>MT3221 PPE family protein
MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLS
GLAALHWRGPAAESMAVTAAPYIGWLYTTAEKTQQTAIQARAAALAFEQA
YAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAA
MYGYATASAAAALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIETV
TQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFVAGTIGAESNL
GLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGAGGAASVGNTVLA
SVGRANSIGQLSVPPSWAAPSTRPVSALSPAGLTTLPGTDVAEHGMPGVP
GVPVAAGRASGVLPRYGVRLTVMAHPPAAG
>MT0270.1 hypothetical protein
MTRVSWLPDRCLPRLPACGRGLRGSLPGDSGGTAPDSHRLPASSSPDGKN
IGMQSVDLHVERHLPSRGRSHRTVATVTCVTALGDIRSAQLSATGAWPAV
LFPSWSWLCGIGGGVDLQKPSCRA
>MT1094 hypothetical protein
MRPSRYAPLLCAMVLALAWLSAVAGCSRGGSSKAGRSSSVAGTLPAGVVG
VSPAGVTTRVDAPAESTEEEYYQACHAARLWMDAQPGSGESLIEPYLAVV
QASPSGVAGSWHIRWAALTPARQAAVIVAARAAANAECG
>MT1075.1 hypothetical protein
MRHFLRLTFAGRFEGSRDDRPLLGYDTPTGLTCPYTTPLDVTADIIEARR
HVPAKHQVSPSAPDSLKVQARVGWNRRQLSAVGGRGQQLFANAPGHIPST
SHRRGTGDINRKIDESLAGAARPQANANYGATSDPPLTHQPKPGSPTQVG
PRSPSPPGLRGLVKQLPEVHQSSLHLDTVASLPSSRPSPHHTPLALRSRS
GHFSPDEIRNRRSRKRSQSHMPPRTPPRGRCLRAPESARLGRRSAAHRHS
IARNARAIPFVV
>MT4035.1 hypothetical protein
MAEIKACTGTLCNYVTVTTRRFRRLSQPAAGSGNQRRRGVRQADSRLDGA
MDRTIVTGRTRCGASRLEWHREYWPDSDRLSYSRRVAGAHWPAEV
>MT3223 hypothetical protein
MSRYYAERELSARTHAKNRTGYTLGERMVHQ
>MT1181 hypothetical protein
MTTSQYAAVAAAHSVDPDRWQAEFSAVLDRIAPRFARHQPLRHAGELMAG
MVSGLDRKNCWTIAEHRGDTTPMGCSICWHGPAGTPTMSVTICVTIAIDR
WRRTRPPVYR
>MT2854 hypothetical protein
MRAWLAAATTALFVVATGCSSATNVAELKVGDCVKLAGTPDRPQATKAEC
GSPASNFKVVAVVQEDHAECPADVDSTYSMRNAFNGSTNTICLDIDWVIG
GCMSVDPTHNTDPFRVDCDDASVPHRQRATQILKDLDSPVSVDQCASGVG
YVYTQRRFAVCVEDVTGGPRS
>MT2828 hypothetical protein
MHRGYALVVCSPGVTRTMIDIDDDLLARAAKELGTTTKKDTVHAALRAAL
RASAARSLMNRMAENATGTQDEALVNAMWRDGHPENTA
>MT0312 DNA-binding protein, CopG family
MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYT
AKTVPALDIDAYAQRVYQANRAAGS
>MT1832 conserved hypothetical protein
MAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWRVRMEIEPGR
RQTLAVVASVSAALVICLGALLWSFISPSGQLNESPIIADRDSGALYVRV
GDRLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFS
PKSPPASSWLVCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDA
VVLRYGGDAWVIREGRRSRIEPTNRAVLLPLGLTPEQVSQARPMSRALFD
ALPVGPELLVPEVPNAGGPATFPGAPGPIGTVIVTPQISGPQQYSLVLGD
GVQTLPPLVAQILQNAGSAGNTKPLTVEPSTLAKMPVVNRLDLSAYPDNP
LEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEMNKVVSLVK
ADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVED
SKEARDALGLTLTPSLAPWVALRLLPQGPTLSRADALVEHDTLPMDMTPA
ELVVPK
>MT1235 conserved hypothetical protein
MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>MT3251 hypothetical protein
MTSFAHPGTRGLSTVFGLMMVGSAAVGSHGLAVVVGLAAVIAVGVAAVFR
LAATLAVVLSVVMIVVSGPTHVLAALSGFCAAVYLVCRYGAGVVAGSWPT
TVAAVGFTFAGLAATSFPLQVPWLPLAAPLAVLATYVLATRPFSR
>MT4022 hypothetical protein
MAPLAVDPAALDSAGGAVVAAGAGLGAVISSLTAALAGCAGMAGDDPAGA
VFGRSYDGSAAALVQAMSVARNGLCNLGDGVRMSAHNYSLAEAMSDVAGR
AAPLPAPPPSGCVGVGAPPSAVGGGGGAPKGWGWVAPYIGMIWPNGDSTK
LRAAAVAWRSAGTQFALTEIQSTAGPMGVIRAQQLPEAGLIESAFADAYA
STTAVVGQCHQLAAQLDAYAARIDAVHAAVLDLLARICDPLTGIKEVWEF
LTDQDEDEIQRIAHDIAVVVDQFSGEVDALAAEITAVVSHAEAVITAMAD
HAGKQWDRFLHSNPVGVVIDGTGQQLKGFGEEAFGMAKDSWDLGPLRASI
DPFGWYRSWEEMLTGMAPLAGLGGENAPGVVESWKQFGKSLIHWDEWTTN
PNEALGKTVFDAATLALPGGPLSKLGSKGRDILAGVRGLKERLEPTTPHL
EPPATPPRPGPQPPRIEPPESGHPAPAPAAKPAPVPANGPLPHSPTESKP
PPVDRPAEPVAPSSASAGQPRVSAATTPGTHVPHGLPQPGEHVPAQAPPA
TTLLGGPPVESAPATAHQPQWATTPAAPAAAPHSTPGGVHSTESGPHGRS
LSAHGSEPTHDGASHGSGHGSGSEPPGLHAPHREQQLAMHSNEPAGEGWH
RLSDEAVDPQYGEPLSRHWDFTDNPADRSRINPVVAQLMEDPNAPFGRDP
QGQPYTQERYQERFNSVGPWGQQYSNFPPNNGAVPGTRIAYTNLEKFLSD
YGPQLDRIGGDQGKYLAIMEHGRPASWEQRALHVTSLRDPYHAYTIDWLP
EGWFIEVSEVAPGCGQPGGSIQVRIFDHQNEMRKVEELIRRGVLRQ
>MT1541 hypothetical protein
MPFLVALSGIISGVHDHSMTVRLDQQTRQRLQDIVKGGYRSANAAIVDAI
NKRWEALHDEQLDAAYAAAIHDNPAYPYESEAERSAARARRNARQQRSAQ
>MT0576 hypothetical protein
MRRLNKAFGGFFRPPQTAKPAVKVGYPEHRRHICTASAASSAPASPGRRP
G
>MT3200 conserved hypothetical protein
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFT
AEVVASATGDFRFFAAPGSWTLRALSAAGNGDAVVQPSGAGIHEVDVKIT
>MT0213 hypothetical protein
MKTGTATTRRRLLAVLIALALPGAAVALLAEPSATGASDPCAASEVARTV
GSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPKVASD
LHALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR
>MT3573.4 hypothetical protein
MRYLPGRSRPPPGGLYRQTSKHRRCRMTAVAITPASGGRHSVRFAYDSAI
VSLIKSTIPAYARSWSAHTRCWFIDADWTPLLAAELRYHGHTVTGPADPA
QQQCTDWAKALFRAVGPQRTPAVYRALSKVLHPDAPTGCPILQQQLNAAR
TALTNPA
>MT3912 hypothetical protein
MVGRVGRPVFPYEPMVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLR
TVRNLLAGNGPVFNQGERVEANTSTAWTYLLYVGGWVGGPMRLEYVALAL
AMVLSLLGMVLLMLGTGRLYAPSLRGRRAIMLPAGALVYIAVPPARDFAT
SGLESGLVLAYLGLLWWMMVCWSQPLRARPDSQMFLGALAFVAGCSVLVR
PEFALIGGLALIMMLIAARTWRRRVLIVLAGGFLPVAYQIFRMGYYGLLV
PSTALAKDAAGDKWSQGMIYVSNFNRPYALWVPLVLSVPLGLLLMTARRR
PSFLRPVLAPDYGRVARAVQSPPAVVAFIVGSGVLQALYWIRQGGDFMHG
RVLLAPLFCLLAPVGVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWS
LWAANSPGMGDDATRVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMA
AVLTALNNTPEGALLLPSGNYNQWDLVPMIRPSSGTAPGGKPAPKPQHAV
FFTNMGMLGMNVGLDVRVIDQIGLVNPLAAHTERLKHARIGHDKNLFPDW
VIADGPWVKWYPGIPGYIDQQWVTQAEAALQCPATRAVLNSVRAPITLHR
FLSNVLHSYEFTRYRIDRVPRYELVRCGLDVPDGPGPPPRE
>MT0765 hypothetical protein
MTGPPRSYTGRRDLIAEKLEPYFQISAMLPKNTRPTSETAEEFWDNSLWC
SWGDRETGYTRTVTVSICQVADGEREAEGFGT
>MT0708 hypothetical protein
MKWNTVAASLAAGVITIAVALAAPPPAAHAKNGDTHVTGQGIERTLDCNE
STLLVNGTQNIVTALGTCWAVTVMGSSNTVVADTIINDITVYGWDETVFF
RNGDPFIWDRGRELGMVNRLQRVG
>MT1997 hypothetical protein
MIRGSAVSGLLMPSVNGGTAGSVACVQCLFLPKVAVDLINLSGIQCFARI
EHVAHAQAHPFVVLVGKPAQHGARIGAVAGAILTGDVIVSHDGELYRAVT
ALRQNGPRPHASRRLHAPALCSARSRRGHLRPSCWLPPPRFAGRQSLVAR
>MT2423.1 hypothetical protein
MSLYRATPDVPGSQRQLPDRRFCSRVGRGCAGLEAGVDLQKPSCRAKVGT
ATIADGEGQCCGRDERAVRTAGAALGSIESSGAPPWFGGAPVGRPGVQVL
LSASSAQFSTRHQANSQVNGLY
>MT1025.3 hypothetical protein
MPEPFRIAREKVRVLRPAGRITIHTSYAGQCPPMRYAPNAAAGNIGLTMF
DRNAFVNLSRSAGLVDVEQ
>MT2627.1 hypothetical protein
MDRRRRGGVAACLLVTGVSCRSPRAGLERERSCMGTMANSAADYDARKGA
DTRFAPVQKSALSSLWPVA
>MT2564 PE_PGRS family protein
MGRCEMSYVIATPEMMATAAFDLARIGSQVSAASAVAAMPTTEVVAAGAD
EVSAGIAALFSAHAQEYQALSAQAAAFHDQFVHTLTAAARWYTATEIANA
AAMRVVLGAVNAPTQTLLGRPLIGDGAHGTAPGQPGGAGGLLFGNGGNGA
AGAVGQVGGAGGAAGLFGIGGAGGAGGAGAPGGTGGTGGWLAGGGGVGGM
GGAGGGAGGAGGNAGLFGNGGAGGAGGAGGGAGGAGGNAGWFGHGGAGGV
GGVGAAGANGATPGQDGAAGVAGSDDGAGGDGLAGSDGGDGGAGGVGGNG
GRGGWLLGNGGAGGVGGVGGAGGAGAAGGAGGAGATGINGPAGISAAGGD
GGAGGNGGAGGNGGVGGAGGAGGSAGLLGYVGRAGDGGAGGGGGLGGAPG
DGGAGGNGGSWLAAGDGGAGGHGGDPGLGGAGGAGGASGGAGARAGANGL
AAGNDGPVSGGNGGKGGNGAHAPVAGGHGGNGGAGGNGGLVGDGGAGGHG
GDGAAGAGYADMTAIFLGSSGTPGEDGGNGGAGGAGGAGGAHAGDGGAGG
AGGNGGAGGAGGNGAHGFNAVLVSDGGNGGDGGAGGRGGDGGAGGAGGDA
PAGRAGSQGVGGDGGAGGAGGAPGNGGSGGRGDMAFKDGDGGAGGDGGDP
GAGGKGGAGGAGATEGVTGATGATVHSGGNGGKGGNGADATVAGANGGKG
GAGGNGGLVGDGGAGGDGGSGAAGANGANVGEDGADGTLSGQPGEGSEAN
GGQGGVGGGGAGGAGGDGGAGSSALGSGGNGGRGDAGQAGGAGGAGGAGG
AGGSVSGDGGPGGKGGAGGAGGAGASGGGGGKGASGADSAEAVGGAGGKG
GDGGVGGVGGDGGPGGDGGAGGAAPAGQVGSHGVGGVGGDGGLGGAGGNG
GDGGHGSDGGDGGDGGDPGAGGLGGLGGDSGNGTRAASGVDASDHGPGSG
GNGGNGGNGAQASVAGGAGGNGGDGGNAGRVGDGGAGGNGGDGAAGANGA
NSGAPGSDALALGQPGGNGGQGDAGQAGGAGGAGGAGGAGGSVSGDGGAG
GNGGAGGNGGVGASGGAGARGANGIDSIGGTGGAGGGGGDGGAGGVGGHG
GDGGVGGAAPSGTVGSHGTGGVGGDGGLGGAGGVGGAGGNGGIGITVGGA
GGAGGNGGDPGAGGRGGLGGDSGNGTSAANGVDASKHGPLTGGDGGVGGN
GAKAAAAGGDGGQGGDGGNAGLFGDGGAGGDGADGTAAEALGGDGGAGGA
GGKGGDAGDIGDGGDGGKGGDGAHGALGGLTVAGGNGGAGGAGGAGGAGG
AFLGDGGNGGAGGQGGAGRGGSPGGGGGVGGHGGAGGDAGMNGGGGTGGQ
GGNGAAGGAGWSPDSDLKGFDGFDGGSGGAGGDGGAGGAGGTQTGDGGDG
GAGGLGGAGGVGGNGVDGFDINETTGRDGGDGGDGGYGGWGGAGGNGGAG
GSAPAGEVGNRGVGGDGGDGGSGGDAGNGGLGGDGFTYLADFDGEPGGDG
GDGGDGGWGRPGGQGGFGSTSGAHGKAGFGAPGGDGGDGGNGGHGGDGNG
SFADAGDGGPGGNGGNGGLGGAGRDGGAPGGDGGDGGTGGSGGFGAPPPR
SIGGGDGGDGGRGGDGGRGAGGLTSGGVGSSGESGGSGNGRGDPGSGGSG
GEGGEGGPSISVNVT
>MT3809 hypothetical protein
MRHMSETSETPTPPPHQTPKVFKAAAWVAIAAGTVFIVAVIFFTGYILGK
HAGHGGFHHRQHHQHPAMMLRPGSPHGGPAAVRPGPGPGGPGQVPSSVSP
PATPAP
>MT1597 conserved hypothetical protein
MASVELSADVPISPQDTWDHVSELSELGEWLVIHEGWRSELPDQLGEGVQ
IVGVARAMGMRNRVTWRVTKWDPPHEVAMTGSGKGGTKYGVTLTVRPTKG
GSALGLRLELGGRALFGPLGSAAARAVKGDVEKSLKQFAELYG
>MT3495 PE_PGRS family protein
MQPRLFQVLVISGRRFGSTGALRRNTRRDAVIGDTLRPIEGRTAMSFVIA
NPEMLAAAATDLAGIRSAISAATAAAAAPTIQVAAAGADEVSLAISALFG
QHAQAYQALSAQATIFHDQFVQALTSGGNLYAAAESHTVEQMVLNAINAP
TQTLFGRPLIGDGANGTAENPDGQNGGLLFGNGGNGFTQTTAGVAGGNGG
SAGLIGNGGAGGGGGAGAAGGLGGNGGWLYGNGGAGGIGGAGTGTGGHGG
AGGAGGRAWLWGTGGAGGAGGDGGWLFGDGGAGGTGGNGGSGFNSLTSSV
GGAGGAGGHAGLFGAGGTGGTGGIGGQNTETGPAASNGGAGGAGGGGGYL
VGDGGAGGTGGAGGKNSSGGATLTGGTGGTXGAGGAAGWLYGSGGAGGAG
GAGGLNNAGGATGGTGGTGGAGGSGAWLYGNGGAAGAGGNGGNNTSAGTG
GVGASGGTGGNAGLIGAGGHGGAGGAGGNQTGGVGNGGAGGNGGAGGAGG
QLYGNGGDGGNGGAGGANIAGGNGSDGGAAGHGGAGGSARLIGAGGRGGD
GGAGGNTAGRRADAIAGTGGDGGNGGNGGLLSGNAGAGGHGGAGGSSTAT
TTTGTPPTGATGGNGGNGGAGGTAGFTGSGGIGGNGGAGGTGGNAGVALS
VGSTGGLGGNGGSGGLGGGGGSLFGNGGAGGVGATGGNGGSGIGPASVGG
NGGKGGVGAAGGLAGQIGNGGSGGSGGAGGNGGTGDTAGNGGNGGAGAVG
GNAQLIGNGGNGGGGGNGGTGADGT
>MT2418 hypothetical protein
MLAFWLRGIATSVALAVDVLFGQADFTLSSVHSAELASANSTSGHLQIAM
VVLALLIAGLTAGGAFRMASGLGHA
>MT1107 hypothetical protein
MLRDPHAFIVSGACRPGTRAPSASRYPVASRMYSPI
>MT3014 hypothetical protein
MTVAGDGDGLVLGLSAMCDVASFMGAVAQRELRAENRRRISPSVHARHRF
NFTRST
>MT3615.3 PE_PGRS family protein
MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGADEVSAR
IAALFGGFGLEYQAISAQVAAYHQRFVQALSTGAGAYASAEAAAAEQIVL
GVINAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQA
GGPGGPAGLWGNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGG
TGGAGGPGGLIWGGGGAGGVGGAGGGTGGAGGRAELLFGAGGAGGAGTDG
GPGATGGTGGHGGVGGDGGWLAPGGAGGAGGQGGAGGAGSDGGALGGTGG
TGGTGGAGGAGGRGALLLGAGGQGGLGGAGGQGGMGGAGGAGADNPTGIG
GTGGDGGTGGSAGEGGAGGAAGQLFSASGAAGNAGVGGAGGQGGDGGAGG
AGADADQPGATGGTGFAGGAGGAGGAGGSSGAGGTNGSGGAGGTNGSGGA
GGTGGQGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGT
GGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAG
GAGGSSGAGGTNGSGGAGGTGGQGGGAGGSSGAGGTNGSGGAGGTGGQGG
AGGAGGAGADNPTGIGGTGGDGGTGGAAGAGGAGGAAGTGGTGGMIGTTG
NAGVGGAGGQGGDGGAGGAGADADQPGATGGTGFAGGAGGAGGAGGAGGN
SGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGTGGDGGTGGAAGA
GGAGGAAGTGGTGGMIGTTGNAGVGGAGGQGGDGGAGGAGADADQPGATG
GTGFAGGAGGAGGAGGAGGSSGAGGTNGSGGAGGTGGQGGAGGAGISFSN
GSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGGSG
GANFNGGTGGTGGTGGTGGKGGLNTDGLSSATSGTGGTGGTGGKGGTGGA
GDDSAGGTGGTGGAGGNAGAGGLANTGGTAGNAGIGGDGGQGGNGGQGDS
GSGLGGQPGFAGGPGGKGGAGGNAGTGGTNGSGAGGAGGQGGAGGAGISF
SNGSNGGTGGTGGVGGTGGDGGNAGTGAGDPGKGGTGGTGGTGGSGGAGG
SGGANFNGGTGGNGGSGGTGGTGGPGGSGGAPTGSGTGGKGGAGGDGGDG
ADGGAATGVGDGGDGGNGGNGGNGGTGVGSPGGLGGAGGTGGLGGAGAGG
GADGDDGDDGQPGNNGS
>MT3615.4 hypothetical protein
MMGAAATGSGHPARKVDMGRLSCLSGAEPTSMWMEVRHGCSRASRNSCGG
FAEAGPHLIDGHRAALPGVAIG
>MT1135 hypothetical protein
MSTQIAVRLPDEIVAFIDDEVRGQHARSRAAVVLRALERERRRRLAERDA
EILATNTSATGDLDTLAGHCARTALDID
>MT1958 hypothetical protein
MIGPARRSTTTRRSTPRADRLAGCWCLPGAICQTPRAWWSQARRDGDDET
GMRRKGAEMCWMCDHPEATAEEYLDEVYGIMLMHGWAVQHVECERRPFAY
TVGLTRRGLPELVVTGLSPRRGQRLLNIAARRALVGDLLTPGMQTTLPAG
PLVETVQVTHPDAHLYCAIAIFGDKVTALQLVWADRRGRWPWAADFDEGR
GTQPVLGMRATRRSA
>MT2238 conserved hypothetical protein
MADVSGAHTDVRPELRKLAQAILDGIDPAVRVAAAMASGGGPGTGKCQQV
WCPLCALAALVTGEQHPLLTVIADHSLALLEVIRAIVDDIDRSAKPPPEG
PPGGGQTGASGGENTNGEGSMKSHYQAIPVTIEE
>MT4002 hypothetical protein
MVELFDADLKRKGFDGVALPAGSYELHKINGVRLDINKSLDELGVQDGNT
LVLVPRVAGESFEPQYESLSTGLAAMGKWLGRDGGDRMFAPVTSLTAAHT
AMAIIAMAVGVVLALTLRTRTITDSPVPAAMAGGIGVLLVIGALVVWWGW
RERRDLFSGFGWLAVVLLAVAAACAPPGALGAAHALIGLVVVVLGAITIG
VATRKRWQTAVVTAVVTVCGILAAVAAVRMFRPVSMQVLAICVLVGLLVL
IRMTPTVALWVARVRPPHFGSITGRDLFARRAGMPVDTVAPVSEADADDE
DNELTDITARGTAIAASARLVNAVQVGMCVGVSLVLPAAVWGVLTPRQPW
AWLALLVAGLTVGLFITQGRGFAAKYQAVALVCGASAAVCAGVLKYALDT
PKGVQTGLLWPAIFVAAFAALGLAVALVVPATRFRPIIRLTVEWLEVLAM
IALLPAAAALGGLFAWLRH
>MT0450 hypothetical protein
MSSRFRTRMRLPAVTGPYMVLGAVNVVQAQWVRRTA
>MT3757 hypothetical protein
MVPMAYSTVDMPDSSTASAVIRATSTRNMVITNIFLSHTRPNCKTSPASP
TTSGTMPTHRNAGKKHSPSGPAISTPARSAAAAAACAASCRTWTASSTMP
SASAAPEAAERRANRSTASVCASWVPGGKSGGLDQAMLGSAPNASRSAAR
RNTRATAAARPQRWRPRPSTPPSPQPDTRPARQGWLRRLAGPRHPSALAA
APGVADARGAASRLAPVPPTGRHRATTPWPATPPPHSQHWPIGDPVRPQQ
PGAGQCQPDHQQPSAHASRQQKAQRPGADQLTKQHPEQRQDCQYGRTGPG
TGHPR
>MT2919 PE_PGRS family protein
MTSASWASNPVQRGAANPRQLPRSPQCASLAAIESALSAQSGSSWEVTML
YVVASPDLMTAAATNLAEIGSAISTANGAAALPTVEVVAAAADEVSTQIA
ALFGAHARSYQTLSTQAAAFHSRFVQALTTAAASYASVEAANASPLQVAL
DVINAPAQTLLGRPLIGNGADGSTPGQAGGPGGLLYGNGGNGAAGGPNQA
GGAGGNAGLIGNGGAGGAGGVGAVGGKGGTGGLLFGNGGAGGQGGLGLAG
INGGSGGQGGHGGNAILFGQGGAGGPGGTGAMGVAGTNPTPIGTAAPGSD
GVNQIGNGGNTDLTGGAGGDGNAGSTTVNGGNGGTGGAARNSSGGTGNSF
GGAGGAGGDGANGGDGGAGGEALTEGGATAVSGAGGKGGNAXASGGAGGN
XGKGGFAQATTSVTGGNGGNGGNGHDSNAPGGAGGSGGVGGDGGRGGLLA
GNGGTGGAGGNGGTGGAGAPGGAGGAGGKADIANSLGDNATVTGGNGGTG
GDGGSALGTGGAGGAGGLGGHGGAGGLLIGNGGAGGAGGLGGAGGAGGAG
GEGGAGGAGGEAIPGGASTNSAGGDGGAGGTGGNGGDGGAGGAPGLGGAG
GAGGWLIGQSGSTGGGGAGGAGGAGGAGGAGGSGGAGGHGDTTSGKNGSS
GTAGFDGNPGQPG
>MT0416 hypothetical protein
MGVAIPYPVTILATRPPSCLGIEFPVGPLMFPLLRFN
>MT0246 hypothetical protein
MGWFSAPEYWLGRLALERGTAIIYLIAFVAAAQQFRPLIGEHGMLPVPRY
LAGQSFWRTPSIFHFRYSDRVFAGVCWLGAVLSAAVVAGAASFVPLWATM
LIWLTLWVLYLSIVNVGQAWYSFGWESLLLETGFLMIFLGNERTAPPILT
LLLARWLLFRVEFGAGLIKMRGDSCWRSLTCLYYHHETQPMPGPLSWFFH
HLPKPLHRIEVAGNHFAQLVVPFGLFTPQPAASIAAAIIVVTQLWLVASG
NFSWLNWLTILLACSAIDTSSAAALLPMPAQPALSAPPQWFAGLVVVFTA
AVLLLSYWPARNLLSSHQRMNMSFNPFHLVNTYGAFGSICRTRREVVIEG
TDESPITEQTVWKAYEFKGKPGDPRRLPRQWAPYHLRLDWLMWFAAISPG
YALPWMTPFLNRLLRNDPATLKLLRHNPFPQSPPRYVRAQLYQYRFTTVA
ELRRDRAWWHRTLIGRYVPPMSLRKVASPPAD
>MT2102 hypothetical protein
MAPPNRDELLAAVERSPQAAAAHDRAGWVGLFTGDARVEDPVGSQPQVGH
EAIGRFYDTFIGPRDITFHRDLDIVSGTVVLRDLELEVAMDSAVTVFIPA
FLRYDLRPVTGEWQIAALRAYWELPAMMLQFLRTGSGATRPALQLSRALL
GNQGLGGTAGFLTGFRRAGRRHKKLVETFLNAASRADKSAAYHALSRTAT
MTLGEDELLDIVELFEQLRGASWTKVTGAGSTVAVSLASDHRRGIMFADV
PWRGNRINRIRYFPA
>MT1122.1 hypothetical protein
MHRPCRQGAQIVSLTRPPSTRFPMDRMHTIPLAAYALCMTQSMRLSKWL
>MT3289 hypothetical protein
MQEGGPQETMSARSTQHDAADALFRAIIETLDKHRNERTLTEDVLDTLAR
AYASISTNVPEQGRLG
>MT1850 PPE family protein
MDFGLLPPEINSGRMYTGPGPGPMLAAATAWDGLAVELHATAAGYASELS
ALTGAWSGPSSTSMASAAAPYVAWMSATAVHAELAGAQARLAIAAYEAAF
AATVPPPVIAANRAQLMVLIATNIFGQNTPAIMMTEAQYMEMWAQDAAAM
YGYAGSSATASRMTAFTEPPQTTNHGQLGAQSSAVAQTAATAAGGNLQSA
FPQLLSAVPRALQGLALPTASQSASATPQWVTDLGNLSTFLGGAVTGPYT
FPGVLPPSGVPYLLGIQSVLVTQNGQGVSALLGKIGGKPITGALAPLAEF
ALHTPILGSEGLGGGSVSAGIGRAGLVGKLSVPQGWTVAAPEIPSPAAAL
QATRLAAAPIAATDGAGALLGGMALSGLAGRAAAGSTGHPIGSAAAPAVG
AAAAAVEDLATEANIFVIPAMDD
>MT4008 PE family protein
MVWSVQPEAVLASAAAESAISAETEAAAAGAAPALLSTTPMGGDPDSAMF
SAALNACGASYLGVVAEHASQRGLFAG
>MT1291 hypothetical protein
MPGVWSPPCPTTPRVGVVAALVAATLTGCGSGDSTVAKTPEATPSLSTAH
PAPPSSEPSPPSATAAPPSNHSAAPVDPCAVNLASPTIAKVVSELPRDPR
SEQPWNPEPLAGNYNECAQLSAVVIKANTNAGNPTTRAVMFHLGKYIPQG
VPDTYGFTGIDTSQCTGDTVALTYASGIGLNNVVKFRWNGGGVELIGNTT
GG
>MT1784 hypothetical protein
MSALLDGVLDAHGGLQRWRAAETVHGRVRTGGLLLRTRVPGNRFADYRIT
VHVQQARTVLDPFPRDGYRGVFESGQVRIESHDGAVISSRAHPRAAFFGR
SGLRRNIRWDPLDSVYFAGYAMWNYLTTPYLLTREGVAVEEGAPWQQEGE
TWRRLIVSFPPDIDTHSPRQTFYVDASGLLRRHDYVPEVVGHWARAAHYC
ADPVDVDGFVFPTCRWVHPIGPGNRSLPFPTLVSILLTDIRVETD
>MT0082 hypothetical protein
MPAVTTPSNHWGDERRKLSHQPPVRGQILGRRQARRLSQHFARVGVEAPP
KRLQEMLLGAPAADEEWTDVKFALIVTQLNHEKRVAKFHRLQRRATHSLI
CLGLVLVALNFLICLAYIFFSLTQHAAAL
>MT3992 hypothetical protein
MAEPLAVDPTGLSAAAAKLAGLVFPQXPAPIAVSGTDSVVAAINETMPSI
ESLVSDGLPGVKAALTRTASNMNAAADVYAKTDQSLGTSLSQYAFGSSGE
GLAGVASVGGQPSQATQLLSTPVSQVTTQLGETAAELAPRVVAKVPQLVQ
LAPHAVQMSQNASPIAQTISQTAQQAAQSAQGGSGPMPAQLASAEKPATE
QAEPVHEVTNDDQGDQGDVQPAEVVAAARDEGAGASPGQQPGGGVPAQAM
DTGAGARPAASPLAAPVDPSTPAPSTTTTL
>MT1488 hypothetical protein
MAHVVLTRAATRADTSRAHLRRVVRSIAAFRAQQVDVRR
>MT1768 conserved hypothetical protein
MDLYSNLVEAEQRLVALVSSIEADSYSSPTPCDRWDVRALLSHALASIDA
FAAAVDGAPGPDMAQVFSGADIVGDDPLGATQRITRRSQAAWSTVRDLNA
ELSTFIGVMPAGQALAIITFSTVVHGWDLAVATGQAGELPEHLAEAAQQV
AAELVPVLRPRGLFAHDVDLAGEATPTQRLVALTGRKPR
>MT1732 hypothetical protein
MTIDPDQIRAEIDALLASLPDPADAENGPSLAELEGIARRLSEAHEVLLA
ALESAEKG
>MT1190 hypothetical protein
MGESKSPQESSSEGETKRKFREALDRKMAQSSSGSDHKDGGGKQSRAHGP
VASRREFRRKSG
>MT2152 conserved hypothetical protein
MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYT
PQYQQPADPAYPQQYPQPTPGYAQPEQFGAQPTQLGVPGQYGQYQQPGQY
GQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLF
IGAVLILGFWAPGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKC
NNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEVGRPQ
>MT0772 hypothetical protein
MGPPHRSRPPLPSPGPTCQVLPTTAVIHTVTAEALGRIGIDAPRIPGSLD
VAAHAAIGLLPLVAGCDRRHRRPVRGARAGRAAQVSLCMTAIRVEPVSSN
AVCTGPAAQVGDQSRSPQRDYAHQALQPDVPRRRARRHRPRRCSAKTGSS
SSTMRCTCHQNQCLWSSGVSWALAR
>MT3982 conserved hypothetical protein
MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYSRSIALGIVV
AVLILAGAALLAYFKPQGKLGGTSLFTDRATNQLYVLLSGQLHPVYNLTS
ARLVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLC
DTVARADSTSPVVQTAVIAMPLEIDASIDPLQSHEAVLVSYQGETWIVTT
KGRHAIDLTDRALTSSMGIPVTARPTPISEGMFNALPDMGPWQLPPIPAA
GAPNSLGLPDDLVIGSVFQIHTDKGPQYYVVLPDGIAQVNATTAAALRAT
QAHGLVAPPAMVPSLVVRIAERVYPSPLPDEPLKIVSRPQDPALCWSWQR
SAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGGKFVALQSP
DPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPV
LSKDAALLEHDTLPADPSPRKVPAGASGAP
>MT0582 hypothetical protein
MISPKPLLHILIHGRSDELPDTRGRIVLRWLRIAVLIVTGLVTLQSVLLV
AGAWRNDIAIQRNMGVAQAEVLSAGPRRSTIEFVTPDRITYRPQLGVLYP
SELSTGMRIYVEYNKRDPNLVRVQHRNAGLAIIPAGSIAVVAWLIAAAAL
VVLAVLDKRLERRENSASATG
>MT0910.4 hypothetical protein
MNVIAIPAVGEYLTLLWFSSGEGIKVSGTVVCLDALHDLGVVRPGILG
>MT2080.1 hypothetical protein
MRNVVPEVGNRDSAVPVVLAGPVSSVIDEIVREGAQQMLAGAQRRCAASK
LVRQRHETAILLGWFNAILLGWFNERQR
>MT3448 PE_PGRS family protein
MLGGKGGDGGNGDHGGPATNPGSGSRGGAGGSGGNGGAGGNATGSGGKGG
AGGNGGDGSFGATSGPASIGVTGAPGGNGGKGGAGGSNPNGSGGDGGKGG
NGGAGGNGGSIGANSGIVGGSGGAGGAGGAGGNGSLSSGEGGKGGDGGHG
GDGVGGNSSVTQGGSGGGGGAGGAGGSGFFGGKGGFGGDGGQGGPNGGGT
VGTVAGGGGNGGVGGRGGDGVFAGAGGQGGLGGQGGNGGGSTGGNGGLGG
AGGGGGNAPDGGFGGNGGKGGQGGIGGGTQSATGLGGDGGDGGDGGNGGN
SGAKAGGAGGKGQAGQPNSGTEPGFGGDGGLGGAGATP
>MT1075 hypothetical protein
MTKPYSSPPTNLRSLRDRLTQVAERQGVVFGRLQRHVAMIVVAQFAATLT
DDTGAPLLLVKGGSSLELRRGIPDSRTSKDFDTVARRDIELIHEQLADAG
ETGWEGFTAIFTAPEEIDVPGMPVKPRRFTAKLSYRGRAFATVPIEVSSV
EAGNADQFDTLTSDALGLVGVPAAVAVPCMTIPWQIAQKLHAVTAVLEEP
KVNDRAHDLVDLQLLEGLLLDADLMPTRSACIAIFEARAQHPWPPRVATL
PHWPLIYAGALEGLDHLELARTVDAAAQAVQRFVARIDRATKR
>MT1192 hypothetical protein
MKTHLTCPCGEAITGKDEDELVELTQAHLASVHPGLEYDRDAILFMAY
>MT1887 DNA-binding protein, CopG family
MSKRLQVLLDPDEWEELREIARRHRTTVSEWVRRTLREAREREPRGDLDM
KLRSVRAAARHEFPTADVEQMLEEIERGRGAEREGSR
>MT3722 conserved hypothetical protein
MTSRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMTQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>MT1418.1 hypothetical protein
MVTSVADENVASRIASWGTGPAPDPRLDYAHAHLKGRRGRSPARPNAPIG
ARSFAVGRKICRVERFTLLEHGFVGHALHRVPCAGLVALVMSACSLAVCR
EVGNYAQRRVGRFAFFEQTFVRHALTPRCSRTDSKASYTQLNRICKFPPH
WV
>MT3582 PPE family protein
MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAASAFQSVV
WGLTTGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYET
AYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAA
AMFGYAAATATATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQL
MNNVPQALQQLAQPTKSIWPFDQLSELWKAISPHLSPLSNIVSMLNNHVS
MTNSGVSMASTLHSMLKGFAPAAAQAVETAAZNGVQAMSSLGSQLGSSLG
SSGLGAGVAANLGRAASVGSLSVPQAWAAANQAVTPAARALPLTSLTSAA
QTAPGHMLGGLPLGHSVNAGSGINNALRVPARAYAIPRTPAAG
>MT4014 hypothetical protein
MTGDQNPAPGPAPGVPIKVTPEILLQVLTTPPASGPAPFPAVPVDLPAPA
DIANGALFAAGNSGVPGDVESSGLEDLDRRAHAADAVQKFSANEADAAQQ
FQGVGAQAEA
>MT3767.3 hypothetical protein
MMCSLQDTGFLGYDCILVSDCTVTTLPSSTSAMPAVSSRG
>MT4031 hypothetical protein
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQA
QQILRALNRVRRDVAAMGADPAWGPAARPAVVDSISAALRSARPNSSPGA
AHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHIT
VSKPAPVIPLSRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPAS
TPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSAADTGLLASTV
VPRA
>MT0291.2 hypothetical protein
MTKTPQVLSLEMGCFLLPIFAGDVPRVGMTERSCVLGGVRCRL
>MT0010 hypothetical protein
MSEQVETRLTPRERLTRGLAYSAVGPVDVTRGLLELGVGLGLQSARSTAA
GLRRRYREGRLAREVAAAQETLAQELTAAQDVVANLPQALQDARTQRRSK
HHLWIFAGIAAAILAGGAVAFSIVRRSSRPEPSPRPPSVEVQPRP
>MT0196 hypothetical protein
MRVIRMTNYEAGTLLTCSHEGCGCRVRIEVPCHCAGAGDAYRCTCGDELA
PVK
>MT2371 hypothetical protein
MPERTVDLAPVLSFLSAHERRRGRTLAPSYALVGATSTTASSCRARFIRR
>MT3172 hypothetical protein
MMRRLNGVDALMLYLDGGSAYNHTLKISVLDPSTDPDGWSWPKARQMFEE
RAHLLPVFRLRYLPTPLGLHHPIWVEDPEFDLDAHVRRVVCPAPGGMAEF
CALVEQIYAHPLDRDRPLWQTWVVEGLDGGRVALVTLLHHAYSDGVGVLD
MLAAFYNDTPDEAPVVAPPWEPPPLPSTRQRLGWALRDLPSRLGKIAPTV
RAVRDRVRIEREFAKDGDRRVPPTFDRSAPPGPFQRGLSRSRRFSCESFP
LAEVREVSKTLGVTINDVFLACVAGAVRRYLERCGSPPTDAMVATMPLAV
TPAAERAHPGNYSSVDYVWLRADIADPLERLHATHLAAEATKQHFAQTKD
ADVGAVVELLPERLISGLARANARTKGRFDTFKNVVVSNVPGPREPRYLG
RWRVDQWFSTGQISHGATLNMTVWSYCDQFNLCVMADAVAVRNTWELLGG
FRASHEELLAAARAQATPKEMAT
>MT2096 hypothetical protein
MAVIPTPLYRPALHPGAAMIAADDDTEKSMMDMARAERAELAAFLTTLTL
QQWETPSLCAGWSVKEVVAHMISYEDLGVFGLLKRFAKGRIVRANEVGVD
EFAGLSPQELADYVGRHLQPRGLTAGFGGMIALVDGMIHHQDIRRPLGQP
RTIPAQRLDRVLRLMPKNPRLRARPRIKGLRLRATDLDWTIGTGPEVTGP
GEALLMAMAGRPAAVSDLSGPGKPTLAGRLG
>MT2467 hypothetical protein
MPGLVPAMPLDALRPARQPTSGLGECATMRRPEAGNEKVAVIWESLDVVP
PESL
>MT1140 hypothetical protein
MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVVMRFQQG
LAELVIKGDNTLETLFPPKDEKPEWATFDEDLPDALEGTSIPLLGLSDAS
EAKNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPT
VPTPAVAAELDYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQT
LLANRITRATAK
>MT3139.1 hypothetical protein
MSVAGDILGQTGMAAAEVICDTPPRCSSQAYDQGISKMLFRTSCISSPCH
PLGVVFSSTGRSQVHQVSPDPRGAMNFPASASPGRRTWSRYDGNLEPCRA
AQHIEDALLESLPDPVRFPISAKERYAHDRDRLH
>MT2424 hypothetical protein
MCPTLDAHQFEPTQVLRCLDAELARSSADPHPTTGI
>MT1442 transcriptional regulator, CopG family
MIWCMKRTNIYLDEEQTASLDKLAAQEGVSRAELIRLLLNRALTTAGDDL
ASDLQAINDSFGTLRHLDPPVRRSGGREQHLAQVWRATS
>MT1857 PPE family protein
MGHRPAKHHHYLGEPHWAVQHHRAGRYTWRLVADVRPDPRPSPKRPRCGR
PTGPESRRRRVVAIGAATGRVYRRYHASRWWGHRGHRPCDLRRVALGPAG
LGRGRTGDEGGRIGIAGHRRRPRPGRRGTRCLVRRDGPVESGRTRAGRNR
GALWCRSCSRRRRFRHRRRRQHDHHHRHTRGLTGLSRWHLNWVLAPTGEE
RRTVSSPLWPVAGGQPVSGRLRKGVAMDFGLQPPEITSGEMYLGPGAGPM
LAAAVAWDGLAAELQSMAASYASIVEGMASESWLGPSSAGMAAAAAPYVT
WMSGTSAQAKAAADQARAAVVAYETAFAAVVPPPQIAANRSQLISLVATN
IFGQNTAAIAATEAEYGEMWAQDTMAMFGYASSSATASRLTPFTAPPQTT
NPSGLAGQAAATGQATALASGTNAVTTALSSAAAQFPFDIIPTLLQGLAT
LSTQYTQLMGQLINAIFGPTGATTYQNLFVTAANVTKFSTWANDAMSAPN
LGMTEFKVFWQPPPAPEIPKSSLGAGLGLRSGLSAGLAHAASAGLGQANL
VGDLSVPPSWASATPAVRLVANTLPATSLAAAPATQIPANLLGQMALGSM
TGGALGAAAPAIYTGSGARARANGGTPSAEPVKLEAVIAQLQKQPDAVRH
WNVDKADLDGLLDRLSKQPGIHAVHVSNGDKPKVALPDTQLGSH
>MT2625 hypothetical protein
MLPENLEQRVTALESQVRELADRVRASEQDAAAARVLAGAADRDVTEFVG
EFRDFRRATIGSFNALREDFTALREEMTERFSHVEERFSRVDDGFTEMRG
KLDGAAAGQQRIVELIEQLIADQG
>MT0311 PE_PGRS family protein
MMAVGSLIESRRRRRRVVSWRPGVTSRREVPVSFVIAQPEMIAAAAGELA
SIRSAINAANAAAAAQTTGVMSAAADEVSTAVAALFSSHAQAYQAASAQA
AAFHAQVVRTLTVDAGAYASAEAANAGPNMLAAVNAPAQALLGRPLIGNG
ANGAPGTGQAGGDGGLLFGNGGNGGSGAPGQAGGAGGATGFFGNGGNGGD
GGAGANGGAGGTAGWFFGFGGNGGAGGIGVAGINGGLGGAGGDGGNAGFF
GNGGNGGMGGAGAAGVNAVNPGLATPVTPAANGGNGLNLVGVPGTAGGGA
DGANGSAIGQAGGAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGN
GGNGGSVEHTGATGSSASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVNT
AGAGGKGGNGGTGGAGGPGGHGGSVLSGPVGDSGNGGAGGDGGAGVSATD
IAGTGGRGGNGGHGGLWIGNGGDGGVGGVGGVGGAGAAGAIGGHGGDGGS
VNTPIGGSEAGDGGKGGLGGDGGGRGIFGQFGAGGAGGAGGVGGAGGAGG
TGGGGGNGGAIFNAGTPGAAGTGGDGGVGGTGAAGGKGGAGGSGGVNGAT
GADGAKGLDGATGGKGNNGNPG
>MT1054.1 hypothetical protein
MAAGPWTARCRVSTYRSSIPCSCLVMAEHSHAGLTSAKVVATRRAEECPP
AQIRVWQM
>MT3036 hypothetical protein
MRIAVLHGVIQDDAIVVVDDLGLVPELDRPRPSAVGRNATAVVSLPVVAL
SPRAGQAGYLWQSITRGLRVTPICCYHPPCGGGVQKMLSRKLGRVCPAPS
PKDAARGAHNVGANAV
>MT0186 hypothetical protein
MSPRRKFEPGEGALLAPQSIEPSRRWGLPLALTASAVVMAAAISACALMR
ISHESHQRAAHKDIVMLSDVRSFMTMFTSPDPFHANEYAERVLSHATGDF
AKQYHERANDILIRISGVEPTTGTVLDAGVQRWNEDGSANVLVVTQITSK
SADGKRVVSNANRWLVTAKQEGNEWKISSLLPVI
>MT3746 hypothetical protein
MTVPDYTAALDEYSRPIRAFRPLKSNRPGDIPT
>MT3412 hypothetical protein
MNGDSASTIDIDKAVTRTPVRRIVRSALDRLWLRQSQPRRYRFLEDSCMA
RALNRL
>MT0271 hypothetical protein
MARSQEPSRGLLDPVAKMLRLPFGTPDFIEKIVTGSVNQVGRRTLYVLIT
TWDAAGGGPFAASAIATTGLAKTAEIVQSMFIGPVFNPLLKMLGADKIAI
RASLCAAQLVGLGIMRYGVRSEPLHSMSVEMLVDAIGPTMQRYLVGDIGR
G
>MT3756.1 hypothetical protein
MVAVLLCVTGAGAYLGSAVVARHRAQAAADLASLAAAARLPSGLAAACAR
ATLVARAMRSSTRSAGWWTSTVVVTVEVAVAFAGVATATARAGPAKVPTT
PG
>MT1247 hypothetical protein
MALVLVYLVVLVLVAIVLFAAASLLFGRGEQLPPLPRATTATTLPAFGVT
RADVDAVKFTQVLRGYKTSEVDWVLERLGRELEALRSQLGAIHASSEDAE
AESDASNPSRGETVVHYRSDPA
>MT1430 PE family protein
MTLRVVPESLAGASAAIEAVTARLAAAHAAAAPFIAAVIPPGSDSVSVCN
AVEFSVHGSQHVAMAAQGVEELGRSGVGVAESGASYAARDALAAASYLSG
GL
>MT2910 hypothetical protein
MTSSEPAHGATPKRSPSEGSADNAALCDALAVEHATIYGYGIVSALSPPG
VNFLVADALKQHRHRRDDVIVMLSARGVTAPIAAAGYQLPMQVSSAADAA
RLAVRMENDGATAWRAVVEHAETADDRVFASTALTESAVMATRWNRVLGA
WPITAAFPGGDE
>MT3522 hypothetical protein
MREFGNPLGDRPPLDELARTDLLLDALAEREEVDFADPRDDALAALLGQW
RDDLRWPPASALVSQDEAVAALRAGVAQRRRARRSLAAVGSVAAALLVLS
GFGAVVADARPGDLLYGLHAMMFNRSRVSDDQIVLSAKANLAKVEQMIAQ
GQWAEAQDELAEVSSTVQAVTDGSRRQDLINEVNLLNTKVETRDPNATLR
PGSPSNPAAPGSVGNSWTPLAPVVEPPTPPTPASAAEPSMSAGVSESPMP
NSTSTVAASPSTPSSKPEPGSIDPSLEPADEATNPAGQPAPETPVSPTH
>MT2011 hypothetical protein
MIPASMGLRFQVPPDLVSFTITASWITYETVESGR
>MT2138 hypothetical protein
MGSNELQVVLGQLEVAASQSQGLGAQFAASATPPESGQPFQATTVAVSGI
NAAICCAAAEFATRTQATATGVAAAAAAYAHQEATAASEMAAVTQVTVV
>MT0177 hypothetical protein
MLVGAVRLPVEHRGGDLRGQERHNHRQRVQRDDHHQPGHDPGGGQKRDRL
DAHHFQGIDLLADPHRAQLGRSTRTDRGRQGDTRDNWAGDAHVDQCGEEP
GERLDTDVAQRREALDCDQRAAGQRDKANDGDRAADDGHRAGTHADLGDQ
PQRLLAVVAQRVRDLPDAGQSEPGELAGLVQSTGRRTSILTKVGDRPREA
GAQHSSGRHVSAPSRIAHRWSSRSR
>MT3180 hypothetical protein
MHRRTALKLPLLLAAGTVLGQAPRAAAEEPGRWSADRAHRWYQAHGWLVG
ANYITSNAINQLEMFQPGTYDPRRIDNELGLARFHGFNTVRVFLHDLLWA
QDAPGFQTRLAQFVAIAARYHIKPLFVLFDSCWDPLPRPGRQRAPRAGVH
NSGWVQSPGAERLDDRRYASTLYNYVTGVLGQFRNDDRVLGWDLWNEPDN
PARVYRKVERKDKLERVAELLPQVFRWARTVDPVQPLTSGVWQGNWGDPG
RRSTISAIQLDNADVITFHSYAAPAEFEGRIAELAPLQRPILCTEYLARS
QGSTVEGILPIAKRHNVGAFNWGLVAGKTQTYLPWDSWDHPYRAPPKVWF
HDLLHPNGRPYRDGEVQTIRKLNGMPSQD
>MT3996 hypothetical protein
MTQSQTVTVDQQEILNRANEVEAPMADPPTDVPITPCELTAAKNAAQQLV
LSADNMREYLAAGAKERQRLATSLRNAAKAYGEVDEEAATALDNDGEGTV
QAESAGAVGGDSSAELTDTPRVATAGEPNFMDLKEAARKLETGDQGASLA
HFADGWNTFNLTLQGDVKRFRGFDNWEGDAATACEASLDQQRQWILHMAK
LSAAMAKQAQYVAQLHVWARREHPTYEDIVGLERLYAENPSARDQILPVY
AEYQQRSEKVLTEYNNKAALEPVNPPKPPPAIKIDPPPPPQEQGLIPGFL
MPPSDGSGVTPGTGMPAAPMVPPTGSPGGGLPADTAAQLTSAGREAAALS
GDVAVKAASLGGGGGGGVPSAPLGSAIGGAESVRPAGAGDIAGLGQGRAG
GGAALGGGGMGMPMGAAHQGQGGAKSKGSQQEDEALYTEDRAWTEAVIGN
RRRQDSKESK
>MT3103 hypothetical protein
MGLMGPHPNAVALLVDPVADVVGEELGVWLAASLPAVVRGQLRRRGCEES
AASSSTPG
>MT3573.5 hypothetical protein
MAETPDHAELRRRIADMAFNADVGMATCKRCGDAVPYIILPNLQTGEPVM
GVADNKWKRANCPVDVGKPCPFLIAEGVADSTDDTIEVDQ
>MT0781 hypothetical protein
MSLAAHSLSTTRCERTDRDDERRAGRRPGSGMRSAMLLSGLTTMVSAITA
SCSVAMAGSSASIPGVSRANVSATSASCSSGPSLAVILSISPPSASSRSA
SPSSESTRSISPSSASSWSASPPGCSARCMAPPSAGRRSSKPPGWVSRST
TPPGRSIRSIGPLGAMARPSGISSSNSLASRLARAIVSSIPSMLAIEVDP
LAPPASPNACLAMSTAAPAMFKPVSAAASPARAGQVAIPTRVWPRFILRV
YSRRAVD
>MT0769 hypothetical protein
MTRQQLAHLLRRACAVVGDVDVLVLGSQSILGSFDENELPPQATASQEAD
IAFVNDPARDKADHVDVAIGEMSDFHRSNGVYAEGVHIDTAILPNGWRDR
LVSWTVESSRPAKPRFLEPHDLAVAKLAAGREKDKAFVAALIRSGLLDVG
VIQARVLLLPEETDPRIGQRIAAWLNYYGAGNHSS
>MT0645.2 hypothetical protein
MRTTIDLPQDLHKQALAIARDTHRTLSETVADLMRRGLAANRPTALSSDP
RTGLPLVSVGTVVTSEDVRSLEDEQ
>MT0494 hypothetical protein
MHKRVHRCHQDGDQQYQHDGTHEVAHAFRLCGLHPTRRLRVAPWATRSRW
>MT3422 DNA-binding protein, CopG family
MRTTLSIDDDVLLAVKERARREKRTAGEILSDLARQALTNQNPQPAASQE
DAFHGFEPLPHRGGAVSNALIDRLRDEEAV
>MT1211 hypothetical protein
MRLSLTALSAGVGAVAMSLTVGAGVASADPVDAVINTTCNYGQVVAALNA
TDPGAAAQFNASPVAQSYLRNFLAAPPPQRAAMAAQLQAVPGAAQYIGLV
ESVAGSCNNY
>MT1759 hypothetical protein
MTEALCDKLVGAWDLVSYVERAAALALGYLAYGGR
>MT1497.2 hypothetical protein
MVDRVTVAAEPQGRFTRLRRHIGTARQPEPRSGQFSHQPDENRYLGLISV
TP
>MT2734 gene 36 protein, putative
MCAFPSPSLGWTVSHETERPGMADAPPLSRRYITISEAAEYLAVTDRTVR
QMIADGRLRGYRSGTRLVRLRRDEVDGAMHPFGGAA
>MT3645 hypothetical protein
MTVVGAVLPELKLYGDPTFIVSTALATRDFQDVHHDRDKAVAQGSKDIFV
NILTDTGLVQRYVTDWAGPSALIKSIGLRLGVPWYAYDTVTFSGEVTAVN
DGLITVKVVGRNTLGDHVTATVELSMRDS
>MT3954 hypothetical protein
MAIRVDLDGRKDASGRPIQDSPGRLVDAGIHGAVTVEAAELALKEFCLGA
CVAGIRAIPGLSDRAAQVNQ
>MT0568 hypothetical protein
MLGGHRDQCRVAATQHSLQGRQQQRGSRTGRDVQRTQDGYACSSRSRTSS
SGHPWAANRARTRSTSSGDGNSSVKWVITTPMSIWSKSPSRISSLATALT
SSWLSRRQSRASTAKESVGGMPSGYPARNHATIEVRNRFTLLYLPVGAVA
KPYVAAICDSTWVWRPGRPSRCDAR
>MT2586.1 hypothetical protein
MVSVVQLVEHQVVILGVAGSSPVTHPNRAAGCLWPWALCCPRRGLAPATF
VSMIWSRGEFDHSPRLHLAARSNTRRSNCHRTACSPSPGPWQVSVCPVYL
PAAQLSHLRRGLFGLTRCFVPWLMCRTFETPTS
>MT3800 DNA-binding protein, CopG family
MRTTIDLDDDILRALKRRQREERKTLGQLASELLAQALAAEPPPNVDIRW
STADLRPRVDLDDKDAVWAILDRG
>MT3710 conserved hypothetical protein
MGPTRKRDLTAAVVGAAAVGYLLVAVLYRWFPPITVWTGLSLLAVAVAEA
LWARYVRVKISDGEIGDGPGWRHPLVVARSLMVAKASAWVGALVTGWWIG
VLAYFLPRRSWLRAAAEDTTGTVVAAGSALALVVAALWLQHCCKSPQDPT
EHADGAES
>MT0300 PE family protein
MSLLDAHIPQLVASQSAFAAKAGLMRHTIGQAEQAAMSAQAFHQGESSAA
FQAAHARFVAAAAKVNTLLDVAQANLGEAAGTYVAADAAAASTYTGF
>MT1044 lipoprotein, putative
MAGRRCPQDSVRPLAVAVAVATLAMSAVACGPKSPDFQSILSTSPTTSAV
STTTEVPVPLWKYLESVGVTGEPVAPSSLTDLTVSIPTPPGWAPMKNPNI
TPNTEMIAKGESYPTAMLMVFKLHRDFDIAEALKHGTADARLSTNFTELD
SSTADFNGFPSSMIQGSYDLHGRRLHTWNRIVFPTGAPPAKQRYLVQLTI
TSLANEAVKHASDIEAIIAGFVVAAK
>MT1808 hypothetical protein
MVTRCVLHIGSLGTSATVTAVTRVPVGLVGHRRQRFLRALICRRWAGIAA
TAVSGLVGVSRQIGIDEFVIELPAKTT
>MT0294 hypothetical protein
MHKLTVEATATNRLGPAAKLLGTAGKIMTDTPFRW
>MT2218 hypothetical protein
MKFVNHIEPVAPRRAGGAVAEVYAEARREFGRLPEPLAMLSPDEGLLTAG
WATLRETLLVGQVPRGRKEAVAAAVAASLRCPWCVDAHTTMLYAAGQTDT
AAAILAGTAPAAGDPNAPYVAWAAGTGTPAGPPAPFGPDVAAEYLGTAVQ
FHFIARLVLVLLDETFLPGGPRAQQLMRRAGGLVFARKVRAEHRPGRSTR
RLEPRTLPDDLAWATPSEPIATAFAALSHHLDTAPHLPPPTRQVVRRVVG
SWHGEPMPMSSRWTNEHTAELPADLHAPTRLALLTGLAPHQVTDDDVAAA
RSLLDTDAALVGALAWAAFTAARRIGTWIGAAAEGQVSRQNPTG
>MT0814 hypothetical protein
MSRRAIHSGRAAPRRSGNSHLVLRNRVPSSKDSPRRRPHHEFMTESIGEP
LSTNLIERYLRARGRRYFRGHHDAEFFFVANAHLRLHVHLEISPAYRDVF
TIRVSPAYFFPATDHTRLAEIVNAWNLQNHEVTAIVHGSSDPHRIGVAAE
RSLIRDRIRFDDFATFVDNAVSAATELFGQLTAAGLPPTATPPLLRDAG
>MT1142 hypothetical protein
MSAQRARSAVQASHRSIHPHIPGVPWWAAILIAVTATAIGYAIDAGSGHK
ALTLVFTGCYIAGCVGAVLAVRQSDLFTALVQPPLILFCAVPGAYWLFHG
GTIGKFKDLLINCGYSLIERFPLMLGTAAGVLLIGLVRWYLGTALFDSIA
RKLSSLMTGDSDDDGGRRSAQRPARTRSRHARPPSEDNREPIAERRSRRR
PRPQNDPHPRGNAHERPAPRSSRFDSYRSYQPSEPSGPAEPVNRYERRGA
RYQPYARYEPTYEPQRRRARPSEPTNPTHHPISQVRYRGSATRDARRDNY
REEQRFDRRDRSRAPRRPPAESWEYDV
>MT1560.1 hypothetical protein
MPQVDKSRRLVSEHSVLGSDPSRRRCNLIWLKLVRTEGRAHRRVATVTCV
TAPGDTRCPGSQRQLSDLWFCSQVGRGCAGLEVAWGSRRMDSPPRFRAKR
PQAQWSPTWPRWSLASHFPGRPRQPGGRCRTGVPYPHLL
>MT2361.1 hypothetical protein
MRRSLGPPGLGIVARSAPSPTIRCRRLDPLLPTRSGLLLNRFYGRGESPR
VLPPRRSWR
>MT0766 hypothetical protein
MMRLECPAGLDLRTPNPEAYEITGQRPGEFVFVLGYLGHVRAIVGNCYIE
IMPMGTRVELSKLADVALDIGRSVGCSAYENDFTLPDIPTQWRNQPLGWY
TQGLAPYLPGLSDPKDAAEG
>MT2369 hypothetical protein
MSNGFVPRIVAVLTLTLVLGYCGLRFGEAAALRRKNVGTGS
>MT2527 hypothetical protein
MGRAVSVRHGSGALDLPGAAASRRLRVGQPIQPSPAPLARGSVDSIVEIS
CCPSAGPRGPYDNDLDSSSPANRDISSITSRSRRGGTIVVAGQKCGFGSA
VSLRPRRYREPNHANIVTPDTDLSPSWPWSGI
>MT0868 hypothetical protein
MAGTGGTIDPARLSTCAGVTSLPPAHRLTGNNGHSPFQW
>MT3585 hypothetical protein
MRGLLPVAGHWVSVLTGLVPLALVIALSPLSVIPAVLVVHSPQPRPSSLA
FLGGWLLGLAVVTAVFVAASGALGGLSTTSPAWASWLRVVLGSALIVFGV
LRWLTRHRHTEMPGWMRAFASFTPARAGLVGAVLVVVRPEVLIICAAAGL
AIGSGGHGAAGSWIYTAFFAMLAASTVAIPILAYVAAGDRLDDSLERLKD
WMEKNHAGMVAAILVVIGLLLLYNGVHAM
>MT2703 hypothetical protein
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSA
TIYQVTDRSHDGRTARVPGDEITSTVSGWLSELGTQSPLADELARAVRIG
DWPAAYAIGEHLSVEIAVAV
>MT2041 chitinase-related protein
MAGLNIYVRRWRTALHATVSALIVAILGLAITPVASAATARATLSVTSTW
QTGFIARFTITNSSTAPLTDWKLEFDLPAGESVLHTWNSTVARSGTHYVL
SPANWNRIIAPGGSATGGLRGGLTGSYSPPSSCLLNGQYPCT
>MT1838.1 hypothetical protein
MVRCGWQLVGAEISVGYLRWDFFPKPGSAPDFLTVPRLNGTAPSASSGGV
DRGIHAGGDGAARLVVCAEAQGTWRLWLPGCSWPPNVPQRRGQRRSSTAI
>MT3117 hypothetical protein
MKPQDQGLHFPYRYDLRLAPMWLPFRWPGSQGVTVTEDGRFVARYGPFRV
EAPLSSVRDAHITGPYRWWTAVGPRLSMVDDGLTFGTNAAAGVCIHFEPR
IHRVIGLRDHSALTVTVADPEGLVAALSS
>MT3780 conserved hypothetical protein
MTQPTAWEYATVPLLTHATKQILDQWGADGWELVAVLPGPTGEQHVAYLK
RPK
>MT0260 hypothetical protein
MHAHLRSRGRRWSVAKTSHRVSSADGMSKRILRLIIAQSGFYSAALQLGN
VSIVLPFVVAELDAELWIAALIFPAFTAGGAIGNVVAPPAVAAVPRRHRL
FIIVSCLAVLAGVNALCATIGKGSVAGILLVVNVTLIGVVSAISFVAFAD
LVAAMPSGTARARILLTEVGVGAALTAVVAATLSFVPDQHPLSRNIHLLW
TAAVAMAISAAICRALPHRIVPRVHAAPGLHKLVYVGWTAIRTNGWYRRY
LLVQVLFGSVVLGSSFHSIRVAAVPGDQPDEVVAVVLFVCVGLLGGIALW
NRVRERFGLVGLFVGSALVSIAAAVLSIAFDLAGAWPNVVAIGLVIALVS
IANQSVFTAGQLWIARDAEPGLRTSLISFGQLVINAGLVGMGLALGLIAQ
DHDAVWPVMIVLLLNLTAAYSATRFAPAKSVDVRGLPQVSRTSRPKTGG
>MT2361 hypothetical protein
MSHDIATEEADDGALDRCVLCDLTGKRVDVKEATCTGRPATTFEQAFAVE
RDAGFDDFLHGPVGPRSTP
>MT0457 hypothetical protein
MGAKKVDLKRLAAALPDYPFAYLITVDDGHRVHTVAVEPVLRELPDGPDG
PRAVVDVGLIGGRTRQNLAHRSEVTLLWPPSDPSGYSLIVDGRAQASDAG
PDDDTARCGVVPIRALLHRDAAPDSPTAAKGCLHDCVVFSVP
>MT3377.1 hypothetical protein
MAIPELADHTFRGDQPAGDVPAWCADPQRDRHTASTDQQRGVPGPDLRAD
FSHRDVDYRVRVTRSARVLQHSALREVHALLYHEVFDTLGSDESPS
>MT2466 hypothetical protein
MTMTASVAKVTAARPEPSAAWAEARRRVRQRREDMLRHPAFLSKQLPAEP
ADDDGVAAVYDIAIARRRRPA
>MT0901 PPE family protein
MMNFMVLPPEVNSARIYAGAGPAPMLAAAVAWDGLAAELGMAAASFSLLI
SGLTAGPGSAWQGPAAAAMAAAAAPYLSWLNAATARAEGAAAGAKAAAAV
YEAARAATAHPALVAANRNQLLSLVLSNLFGQNLPAIAATEASYEQLWAQ
DVAAMVGYHGGASTVASQLTPWQQLLSVLPPVVTAAPAGAVGVPAALAIP
ALGVENIGVGNFLGIGNIGNNNVGSGNTGDYNFGIGNIGNANLGNGNIGN
ANLGSGNAGFFNFGNGNDGNTNFGSGNAGFLNIGSGNEGSGNLGFGNAGD
DNTGWGNSGDTNTGGFNSGDLNTGIGSPVTQGVANSGFGNTGTGHSGFFN
SGNSGSGFQNLGNGSSGFGNASDTSSGFQNAGTALTRASSTWADSPRAWP
IRAPSRLQVWRTRATTARECSIRVIISRVSSTGAPPQKKVGNSG
>MT0285 hypothetical protein
MGRKPKVALIAAHYQIDFSEHYLAEYMAIRGIGFLGWNTRFRGFESSFLL
DHALVDIGVGVRWLREVQGVETVVLLGNSGGGSLMAAYQSQAVDPNVTPL
DGMRPAAGVTELPAADAYVAAAAHPGRPDVLTAWMDAAVIDENDPVATDP
ELDLFDERNGPPYSPEFISRYRSAQVKRNHTITDWAESELKRVRAAGFSD
RPFSVMRTWADPRMVDPSIEPTKRRPNQCYAGTPVKANRSAHGIAAACTL
RGWLGMWSLRVAQTRAAPHLARITCPALVLNAEADTGIFPSDAQQIYDGL
ASSDKTQVSIDTDHYFTTPGARSEQADTIAKWIAKRWR
>MT1840 PE family protein
MGEEDSMSFVTTQPEALAAAAANLQGIGTTMNAQNAAAAAPTTGVVPAAA
DEVSALTAAQFAAHAQMYQTVSAQAAAIHEMFVNTLVASSGSYAATEAAN
AAAAG
>MT3745 hypothetical protein
MISLPLCPALLGRGFQPYRAFHRGDDGLADRVRDNPFEVLSADRPPVGAT
LASD
>MT2694 hypothetical protein
MLCSTQAMFGSRPRVSIGQGFQHGQHDHRILDGVQRMPCRRNRDVVAGPA
VPRVLTGGKAHMALQHLQRRLARAVMLGQVVAGKQCQHRLPKLVGVTTVD
GVGSPSAVCLLRLGQLFGGQAGQRNGFHRRVLSAVQ
>MT0103 conserved hypothetical protein
MDWLHPDGDLTDTERARKRGITLSNQQYDGMSRLSGYLTPQARATFEAVL
AKLAAPGATNPDDHTPVIDTTPDAAAIDRDTRSQAQRNHDGLLAGLRALI
ASGKLGQHNGLPVSIVVTTTLTDLQTGAGKGFTGGGTLLPMADVIRMTSH
AHHYSPASGRYPQAIFDHGTPLALYHTKRLASPAQRIMLFANDRGCTKPG
CDAPAYHSQAHHVTAWTSTGRTDITELTLACGPDNRLAEKGWTTHKNTHG
HTEWLPPPHLDHGQPRTNTFHHPERFLHNQDDDDKPD
>MT3586 hypothetical protein
MEHDVATSPPAGWYTDPDGSAGQRYWDGDRWTRHRRPNPSAPRSPLALRV
DGLRSRWLGMPAGLRLTVPVAAVLTMVGVAVYAWIRPLPDDWSQLPKRLS
CQLRPGPTPPATITVASVDVSHPRGAVLRLVVRFAEPLPPSPSGSFASGF
AGYLLTYTIANNGKEFAELGPQQDTDELAIRKPGESRGTEPNMRPDRNTN
ARRTAPDTVEINLETKRLGLDQAPVDPQLTFAAQFRTPSTVTVDFGSQFC
QGERLAGQRR
>MT0910.2 hypothetical protein
MSAPPAQAPVCGALAARPTAPGNASCTRPAKRDCRYGSRCETCLPFALAK
DCRQASSRLQAATEPDETTTTSVISMRSPGSLQYQPAT
>MT0032 hypothetical protein
MGDVGDQDRRAVRVNADRPVQPWLLTSHERQEALQSGFVILGGGNRPIRT
GLVSGLNYRDGRHESCRVQIRAAGAAHTRNFHGNAPRYLPKLAARSITLS
SSARRVESAIPARQMTISSSLAEPTIRCRSAFRCSASVAVSAVLDESSAG
QSPPITRVVQSSEELNVGIVRDSRVALRSRWRALHRPKIATG
>MT1861 hypothetical protein
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGL
PIPPIIHYGAIAYAPSGASGKAWHQRTPARAEQVALEKCGDKTCKVVSRF
TRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN
>MT3362 hypothetical protein
MAPRLEILQEGRQDLGAVIDAAVGGALSVMLGNIPLVVPNANQL
>MT1506 conserved hypothetical protein
MAARHHTLSWSIASLHGDEQAVGAPLTTTELTALARTRLFGATGTVLMAI
GALGAGARPVVQDPTFGVRLLNLPSRIQTVSLTMTTTGAVMMALAWLMLG
RFTLGRRRMSRGELDRTLLLWMLPLLIAPPMYSKDVYSYLAQSEIGRDGL
DPYRVGPASGLGLGHVFTLSVPSLWRETPAPYGPLFLWIGRGISSLTGEN
IVAAVLCHRLVVLIGVTLIVWATPRLAQRCGVAEVSALWLGAANPLLIMH
LVAGIHNEALMLGLMLTGIEFALRGLDMANTPRPSPETWRLGPATIRASR
RPELGASPRAGASRAVKPRPEWGPLAMLLAGSILITLSSQVKLPSLLAMG
FVTTVLAYRWGGNLRALLLAAAVMASLTLAIMAILGWASGLGFGWINTLG
TANVVRSWMSPPTLLALGTGHVGILLGLGDHTTAVLSLTRAIGVLIITVM
VCWLLLAVLRGRLHPIGGLGVALAVTVLLFPVVQPWYLLWAIIPLAAWAT
RPGFRVAAILATLIVGIFGPTANGDRFALFQIVDATAASAIIVILLIALT
YTRLPWRPLAAEQVVTAAESASKTPATRRPTAAPDAYADST
>MT1303 hypothetical protein
MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGR
PGGWQQAQQPDASGAAEWFAGRLPEDWFDGDPTVIVDREEITVIGKLPGL
ESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGE
RILFTHIAVPVMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHT
EEWLAKLRTAMSAVDDLRAQGPDLPA
>MT2889 CRISPR-associated protein, TM1810 family
MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSA
NPTLPRQLKEKVQYLRVRFVYQSGREDAVKTFVRNAKLLEALEGIGDSRD
GLLRFCRYMEALAAYKKYLDPKDK
>MT0066.2 hypothetical protein
MSKAGARAESRAIAAQSDASYTIATAAVPSRVSASSWYVDAGTASTEQDF
RLFSGMSAHYDSLPRLIRKR
>MT2422 PPE family protein
MWIVLPSARGLVCVWRVVGGAVEAAGWFGVAMILDFSWLPPEINSARIYA
GAGSGPLFMAAAAWEGLAADLRASASSFDAVIAGLAAGPWSGPASVAMAG
AAAPYVGWLSAAAGQAELSAGQATAAATAFEAALAATVHPAAVTANRVLL
GALVATNILGQNTPAIAATEFDYVEMWAQDVGAMVGYHAGAAAVAETLTP
FSVPPLDLAGLASQAGAQLTGMATSVSAALSPIAEGAVEGVPAVVAAAQS
VAAGLPVDAALQVGQAAAYPASMLIGPMMQLAQMGTTANTAGLAGAEAAG
LAAADVPTFAGDIASGTGLGGAGGLGAGMSAELGKARLVGAMSVPPTWEG
SVPARMASSAMAGLGAMPAEVPAAGGPMGMMPMPMGMPAGMMGRGGANPH
VVQARPSVVPRVGIG
>MT2554.1 hypothetical protein
MTVILRTMATYGPRQPAIEGATTMKTRNPRTLLTWLLGAIVTGLYVVFAT
GCQLQAPAPPTPEIGWSGPQAPLPAPDAAPTHLGV
>MT1839 PPE family protein
MSFATPQPEKGFGMDFGALPPEINSGRMYCGPGSGPMLAAAAAWDGVAVE
LGLAATGYASVIAELTGAPWVGAASLSMVAAATPYVAWLSQAAARAEQAG
MQAAAAAAAYEAAFVMTVPPPVITANRVLVMTLIATNFFGQNSAAIAVAE
AQYAEMWAQDAVAMYGYAAASASASRLIPFAAPPKTTNSAGVVAQAVASV
SWPNPNDWWLVRLLGSITPTERTTIVRLLGQSYLATGMARFLTSIAQQLT
FGPGGTTAGSGGAWYPTPQFAGLGAGPAVSASLARAEPVGRLSVPPSWAV
AAPAFAEKPEAGTPMSVIGEASSCGQGGLLRGIPLARAGRRTGAFAHRYG
FRHSVITRSPSAG
>MT1742 hypothetical protein
MSSTVGTEPKALPSSTTLANRFEVQPEAAGLRLMRKTGALRSLNCWITLG
RHRGRSTDTSGNAWPHRIDSV
>MT1586 hypothetical protein
MTAALHNDVVTVASAPKLRVVRDVPPAPASKKVARRLDAQPFGTGGDPLV
DGAARLLSIPLRHLYAALWRVGLLEVQA
>MT3284 hypothetical protein
MADGVWGEAGFEGTTTRIREPTSTREQTQKSPISGEIGDFCVCSPRAPRL
TTRRR
>MT3995 conserved hypothetical protein
MDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAETVEVTINGH
QWLTGLRIEDGLLKKLGAEAVAQRVNEALHNAQAAASAYNDAAGEQLTAA
LSAMSRAMNEGMA
>MT2692 hypothetical protein
MSIRPTTSPALADQLKDPAYSAYVLLRTLFTVAPILFGLDKFFNLLTHPQ
HWNMYLAGWINDLVPGTADQCMYLVGAIEIVAGVLVAVAPRIGAWVVAAW
LAGIILNLVTGPGFYDIALRDFGLLVGAIALARLAQGVHSGGIGRP
>MT3930 conserved hypothetical protein
MKCPGVSDCVATVRHDNVFAIAAGLRWSAAVPPLHKGDAVTKLLVGAIAG
GMLACAAILGDGIASADTALIVPGTAPSPYGPLRSLYHFNPAMQPQIGAN
YYNPTATRHVVSYPGSFWPVTGLNSPTVGNSVSAGTNNLDAAIRSTDGPI
FVAGLSQGTLVLDREQARLANDPTAPPPGQLTFIKAGDPNNLLWRAFRPG
THVPIIDYTVPAPAESQYDTINIVGQYDIFSDPPNRPGNLLADLNAIAAG
GYYGHSATAFSDPARVAPRDITTTTNSLGATTTTYFIRTDQLPLVRALVD
MAGLPPQAAGTVDAALRPIIDRAYQPGPAPAVNPRDLVQGIRGIPAIAPA
IAIPIGSTTGASAATSTAAATAAATNALRGANVGPGANKALSMVRGLLPK
GKKH
>MT1567 conserved hypothetical protein
MWTMVLLLGLGMAIDPARLGLAVVMLSRRRPMLNLFAFWVGGMVAGVGIA
LAVLVFMRDVALAAIQGVVSAANEFREAVGILAGGRLHIVIGVIMLLLAA
RMVARARAQVGVPVGPVGVADGGMSALALAQRPPGLVARLEVRTQQMLQG
DVVWPAFVVGVASSAPPFESVVALTVIMASGAEIGTQFGAFVVFTLLVLA
VIEIPLVAYLAIPQQTQQVMLRFQDWVRSNRRQISLTILIGVGFLFLYQG
VTSL
>MT1028 hypothetical protein
MRPPLAPQFAADLLVKTVSTLRSSGAALGRLTTMRKAVLAVGSVCWLVGC
SSGASSTTASTGDIAKVAEVKSGFGPEYTVTDVTPRAIDPGFFSARKLPD
GLSFDPANCAQVAAGPQLPTGLQGNMAAVSAEGNGNRFVVIAVETSQPLP
APSPGKDCSKVTFSGTQLRGGIEVVDVPHIDGTQTLGVHRVLQAVVGGSA
RTGELYDYSARFGDYQVIVIANPLVIPGRPVARVDTQRARDLLVQAVAAV
RG
>MT2722 hypothetical protein
MPFTTRDVQASLMRIPAPASNTTGLTETSRLTVDKVTTSPAPA
>MT1431 PPE family protein
MTEPWIAFPPEVHSAMLNYGAGVGPMLISATQNGELSAQYAEAASEVEEL
LGVVASEGWQGQAAEAFVAAYMPFLAWLIQASADCVEMAAQQHVVIEAYT
AAVELMPTQVELAANQIKLAVLVATNFFGINTIPIAINEAEYVEMWVRAA
TTMATYSTVSRSALSAMPHTSPPPLILKSDELLPDTGEDSDEDGHNHGGH
SHGGHARMIDNFFAEILRGVSAGRIVWDPVNGTLNGLDYDDYVYPGHAIW
WLARGLEFFQDGEQFGELLFTNPTGAFQFLLYVVVVDLPTHIAQIATWLG
QYPQLLSAALTGVIAHLGAITGLAGLSGLSAIPSAAIPAVVPELTPVAAA
PPMLAVAGVGPAVAAPGMLPASAPAPAAAAGATAAGPTPPATGFGGFPPY
LVGGGGPGIGFGSGQSAHAKAAASDSAAAESAAQASARAQARAARRGRSA
AKARGHRDEFVTMDMGFDAAAPAPEHQPGARASDCGAGPIGFAGTVRKEA
VVKAAGLTTLAGDDFGGGPTMPMMPGTWTHDQGVFDEHR
>MT3371 hypothetical protein
METTTEHRDESTLDSPVSVAREAEWQRNVRWARWLAWVSLAVLLTEGAVG
LWQGIAVGSVALTGWALGGGSEGLASAMVLWRFTGDRTWSATAEHRAQRG
VAVSFWLTAPYLVAESIRHLAGEHRAETSVIGIGLTAIALLLMPVLGWAN
HRVGERLGSGATAGEGTQNYLCAAQAAAVLLGLAITAVWSNGWWIDPAIG
LAIAGIAVWQGIRTWRGHGCGC
>MT2710 hypothetical protein
MLPLGANITLAELPEPELRQLFPHEELQIPVSCGGLGAGAGGRMDMRAVG
LVPVRRASRYCTGFFVLIHSYLTLMGRGARLRLSR
>MT1863 conserved hypothetical protein
MVRLVPRAFAATVALLAAGFSPATASADPVLVFPGMEIRQDNHVCTLGYV
DPALKIAFTAGHCRGGGAVTSRDYKVIGHLRAFRDNTPSGSTVATHELIA
DYEAIVLADDVTASNILPSGRALESRPGVVLHPGQAVCHFGVSTGETCGT
VESVNNGWFTMSHGVLSEKGDSGGPVYLAPDGGPAQIVGIFNSVWGGFPA
AVSWRSTSEQVHADLGVTPLA
>MT1634 hypothetical protein
MSAKDHPNNAPGVPMVFPLWLERLQVKYINRALKPIARYLPGTATIEHRG
RKSGKPYQTIVTAYRKDGVLAIALAHGKTDWVKNVLAAGEADVHFARGVV
HVINPRIVPAGSDGQGLPRMARLQLRRIGVFVGDIA
>MT1404 hypothetical protein
MMALMNEIDLYEHKTPLPISPTSEGTPMTETHFRQTLYECAVKLRELAYT
LPQGVGEHALLRMSEQMIVTAGQVAPRV
>MT3943 hypothetical protein
MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRALLLTAL
GGLLIAGLVTAIPAVGRAPERLAGYIASNPVPSTGAKINASFNRVASGDC
LMWPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARI
QQISEEQCEAAVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGL
QSPGPNNQQLAFKGKVADIDQSKVWPAGTCLGIDATTNQPIDVPVDCAAP
HAMEVSGTVNLAERFPDALPSEPEQDGFIKDACTRMTDAYLAPLKLRTTT
LTLIYPTLTLPSWSAGSRVVACSIGATLGNGGWATLVNSAKGALLINGQP
PVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQHLPAQQPVV
TPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG
>MT0610 hypothetical protein
MQHRIGCGLRNDGEAPTADRPQRREVPTVQGEDRARAVSVGQHDVGRIRD
PDPLILVFRDDAGGLLDFGGTAVGELPGPAGEFTQDGELGIDTYPGGDEI
VEFRYHIRGYDERVCGAIDALRHCGVVWLGGVEIGKQRTRVDDHRSPKPA
SSSSTRRAMGSEPAYRPPRGGGLAPPTAARMDSRITCASETPRCRAARFT
AALSSSGR
>MT0806 conserved hypothetical protein
MTASSSHSVEKPCTSDQVASSAPAGFARALGSRSAATPRASSAAAAVSAS
TSSESEPSSSRAHFTPPLRGHLIAIVSPGLLAPGLLVTITGALEAPGRRG
PVAARHPHGSGGMLAGADAGHVRSQCLRIPNGRSHRLCWRSAIEVVSLAA
AVTVAVGSS
>MT0934 conserved hypothetical protein
MAKLSGSIDVPLPPEEAWMHASDLTRYREWLTIHKVWRSKLPEVLEKGTV
VESYVEVKGMPNRIKWTIVRYKPPEGMTLNGDGVGGVKVKLIAKVAPKEH
GSVVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG
>MT3634 hypothetical protein
MYSDPLREAIAEAEQLVAAAPHIETEADLLEGLQYLAGCIAGCMHLAFDY
ERDHPFLQSGTGPFTKMGLDNPDTLYFGTRLQANRDYVVSGRRGTTTDLS
FQLLGGEYTDYNVPASQAAFDDRELDIAADGSFEWRLRPSAPGQLVIREV
YGDWSQQRGTLAIARLDTVGTAPPPLTRELMEKRYATAGSQLVNRVKTWL
QFPQWFYLNIPVNTMVAPRLTPGGLATQYSSAGHFELRPGQALVITVPVS
DAPYLGFQLGSMWYISLDYINHQTSLNASQAQADPDGKVRIVVAEQNPGV
TNWVETLGHRRGFLQFRWQRVSRELTEADGPTVELVDFDAIPAALPHYQH
NKISEDDWRARIALRQRQIATRMLG
>MT0290 hypothetical protein
MIEDALRRELAAARTGGARPTVPVFDAGTGPRPGIDLTSNTVLSEVLDEG
LELNSRK
>MT3620 hypothetical protein
MARIHRYAARASGLRARLGGARLRLGDHPYAKELASLGLPKRALLSQSAA
NVEMTSATVTRSEPQESEAISPI
>MT2049 hypothetical protein
MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLRG
TRRAEAAAESARLTVADVVAEARGRIGEEAPLPAGARVDE
>MT3532.2 hypothetical protein
MPNPVTMLYGRKADLVILPHVLAEERPHPYSTPGRKRGAQIALTTGIDAL
ASFAPQIVNPRHGLSRVVQCLGGCENKRHAYFRSISKTPHIRARGVPSVC
AVRTVGVDGAKRPPKPIPVQ
>MT2222 conserved hypothetical protein
MRAKREAPKSRSSDRRRRADSPAAATRRTTTNSAPSRRIRSRAGKTSAPG
RQARVSRPGPQTSPMLSPFDRPAPAKNTSQAKARAKARKAKAPKLVRPTP
MERLAARLTSIDLRPRTLANKVPFVVLVIGSLGVGLGLTLWLSTDAAERS
YQLSNARERTRMLQQHKEALERDVREAASAPALAEAARRQGMIPTRDTAH
LVQDPDGNWVVVGTPKPADGVPPPPLNTKLPEDPPPPPKPAAVPLEVPVR
VTPGPDDPAPPARSGPEVLVRTPDGTATLGGATHLPTQAGPQLPGPVPIP
GAPGPMPAPPLGAVPSPAPAENPVPLQVGAAPPAGLPGPAPVAATPGLSG
GSQPMVAPPAPVPANGEQFGPVTAPVPTAPGAPR
>MT0910.3 hypothetical protein
MRLAGNVGNIPIPIDCTLRGLTQYSRTNNAEVVQSVETHHRPANFDALT
>MT4023 hypothetical protein
MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQ
RHWAAGEAMMRQALAQLTAAGQSAHANYTGAMATNLGMWS
>MT0685 hypothetical protein
MAAATTTGTHRGLELRAAQRAVGSCEPQRAEFCRSARNADEFDQMSRMFG
DVYPDVPVPKSVWRWIDSAQHRLARAGAVGALSVVDLLICDTAAARGLVV
LHDDADYELAERHLPDIRVRRVVSADD
>MT1415 lipoprotein, putative
MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPTTASSPS
PGSPSPEAQQILQDSSKATKGLHSVHVVVTVNNLSTLPFESVDADVTNQP
QGNGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIY
DPGIILDKDRGLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPI
VPQLGKGGGRLPITLWIVDTNASTPAPAANLVRMVIDKDQGNVDITLSNW
GAPVTIPNPAG
>MT2802.1 hypothetical protein
MRAKSSQGARALVVAILVFVLLGSFILPHTGSVRGWDVLFSSHGAGRAAV
ALPSRVFAWLALVFGVGFSMLALLTRRWALAWVALAGSAMASGTGLLAVW
SRQTVAAGHPGPGIGLIVAWITAIVLTFHWAQVVWSRTIVQLAAEERRRR
VVAQQQCKTLLDHVQTDSEAGTTPDRGTDR
>MT1995 conserved hypothetical protein
MSEIFCITDHSEPMTARFLSVVLRRIRGMRSDTREEISAALDAYHASLSR
VLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGG
TLRTALANRLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQR
EGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAELATSRRPDHLHGL
ATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPELRATI
EAVLAKLAAPGACNPDDQTPVVDDTPDADAVRRDTRSQAQRHHDGLLAGL
RGLLASGELGQHRGLPVTVVVSTTLKELEAATGKGVTGGGSRVPMSDLIR
MASNAHHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAP
AYHSEVHHVTPWTTTHRTDINDLTLACGPDNRLVEKGWKTRKNAKGDTEW
LPPAHLDHGQPRINRYHHPEKILCEPDDDEPH
>MT1709 hypothetical protein
MNPRVGMAWRAGLWRGSANSGVMPTVGPADHAAGLDRRATPDQLPIWRIG
IISGLVGMLCCVGPTILALVGIISAATAFAWANDLYDNYAWWFRVSGLAV
LAILVWWALRHRNRCSVNAIRRLRWRLMAVLAIAVGTYGVLSAVTTWFGT
FV
>MT1194 hypothetical protein
MPTIWTFVRAAAVLVGSSAALLTGGIAHADPAPAPAPAPNIPQQLISSAA
NAPQILQNLATALGATPPLSAPKVAEPAPAAPGITATFPGLTPAHRRQRG
PRANSVHSRSERPDPGDNPGGTGAPRHRPGGSSDHSRSERPDPGDNRTGP
GGGRGAGLRSRRAVGEGRPTAAALPSAPSAATAFAASRPTGARIGGHSGS
AHRADAAGSRRARVTARAAVVARRAALTHL
>MT2332 hypothetical protein
MTTPPDKARRRFLRDAYKNAERVARTALLTIDQDQLEQLLDYVDERLGEQ
PCDHTARHAQRWAQSHRIEWETLAEGLQEFGGYCDCEIVMNVEPEAIFG
>MT1851 PPE family protein
MDFGVLPPEINSGRMYAGPGSGPMLAAAAAWDGLATELQSTAADYGSVIS
VLTGVWSGQSSGTMAAAAAPYVAWMSATAALAREAAAQASAAAAAYEAAF
AATVPPPVVAANRAELAVLAATNIFGQNTGAIAAAEARYAEMWAQDAAAM
YGYAGSSSVATQVTPFAAPPPTTNAAGLATQGVAVAQAVGASAGNARSLV
SEVLEFLATAGTNYNKTVASLMNAVTGVPYASSVYNSMLGLGFAESKMVL
PANDTVISTIFGMVQFQKFFNPVTPFNPDLIPKSALGAGLGLRSAISSGL
GSTAPAISAGASQAGSVGGMSVPPSWAAATPAIRTVAAVFSSTGLQAVPA
AAISEGSLLSQMALASVAGGALGGAAARATGGFLGGGRVTAVKKSLKDSD
LPDKLRRVVAHMMEKPESVQHWHTDEDGLDDLLAELKKKPGIHAVHMAGG
NKAEIAPTISESG
>MT0515 conserved hypothetical protein
MWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPGQEVELD
FAREWVEFYDPDNPEHLIAADLTWLLSRWACVFGTPACQGTVAGRPNDGC
CSHGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQP
QHRTRKHKGACIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLP
IRRSQEWVTRPDGTEILKTTLTEYDRRGWGSGGADLHWYCTGDPAAHVGT
KQVWQSLADELTELLGEKAYGELAAMCKRRSQLGLIAVHPATRAAQ
>MT2779 hypothetical protein
MNATLTSPELTRADRCDRCGAAARVRAKLPSGAELLFCQHHANEHEAKLT
EMSAVLEVSGSE
>MT1855 PE family protein
MAFVLVCPDALAIAAGQLRHVGSVIAARNAVAAPATAELAPAAADEVSAL
TATQFNFHAAMYQAVGAQAIAMNEAFVAMLGASADSYAATEAANIIAVS
>MT1888 PE_PGRS family protein
MSFVVAAPEVVVAAASDLAGIGSAIGAANAAAAVPTMGVLAAGADEVSAA
VADLFGAHAQAYQALSAQAALFHEQFVHAMTAGAGAYAGAEAADAAALDV
LNGPFQALFGRPLIGDGANGAPGQPGGPGGLLYGNGGNGGNGGIGQPGGA
GGDAGLIGNGGNGGIGGPGATGLAGGAGGVGGLLFGDGGNGGAGGLGTGP
VGATGGIGGPGGAAVGLFGHGGAGGAGGLGKAGFAGGAGGTGGTGGLLYG
NGGNGGNVPSGAADGGAGGDARLIGNGGDGGSVGAAPTGIGNGGNGGNGG
WLYGDGGSGGSTLQGFSDGGTGGNAGMFGDGGNGGFSFFDGNGGDGGTGG
TLIGNGGDGGNSVQTDGFLRGHGGDGGNAVGLIGNGGAGGAGSAGTGVFA
PGGGSGGNGGNGALLVGNGGAGGSGGPTQIPSVAVPVTGAGGTGGNGGTA
GLIGNGGNGGAAGVSGDGTPGTGGNGGYAQLIGDGGDGGPGDSGGPGGSG
GTGGTLAGQNGSPGG
>MT2561 PE_PGRS family protein
MSLVIATPQLLATAALDLASIGSQVSAANAAAAMPTTEVVAAAADEVSAA
IAGLFGAHARQYQALSVQVAAFHEQFVQALTAAAGRYASTEAAVERSLLG
AVNAPTEALLGRPLIGNGADGTAPGQPGAAGGLLFGNGGNGAAGGFGQTG
GSGGAAGLIGNGGNGGAGGTGAAGGAGGNGGWLWGNGGNGGVGGTSVAAG
IGGAGGNGGNAGLFGHGGAGGTGGAGLAGANGVNPTPGPAASTGDSPADV
SGIGDQTGGDGGTGGHGTAGTPTGGTGGDGATATAGSGKATGGAGGDGGT
AAAGGGGGNGGDGGVAQGDIASAFGGDGGNGSDGVAAGSGGGSGGAGGGA
FVHIATATSTGGSGGFGGNGAASAASGADGGAGGAGGNGGAGGLLFGDGG
NGGAGGAGGIGGDGATGGPGGSGGNAGIARFDSPDPEAEPDVVGGKGGDG
GKGGSGLGVGGAGGTGGAGGNGGAGGLLFGNGGNGGNAGAGGDGGAGVAG
GVGGNGGGGGTATFHEDPVAGVWAVGGVGGDGGSGGSSLGVGGVGGAGGV
GGKGGASGMLIGNGGNGGSGGVGGAGGVGGAGGDGGNGGSGGNASTFGDE
NSIGGAGGTGGNGGNGANGGNGGAGGIAGGAGGSGGFLSGAAGVSGADGI
GGAGGAGGAGGAGGSGGEAGAGGLTNGPGSPGVSGTEGMAGAPG
>MT3121 immunogenic protein MPB64/MPT64 precursor
MRYLIATAVLVAVVLVGWPAAGAPPSCAGLGGTVQAGQICHVHASGPKYM
LDMTFPVDYPDQQALTDYITQNRDGFVNVAQGSPLRDQPYQMDATSEQHS
SGQPPQATRSVVLKFFQDLGGAHPSTWYKAFNYNLATSQPITFDTLFVPG
TTPLDSIYPIVQRELARQTGFGAAILPSTGLDPAHYQNFAITDDSLIFYF
AQGELLPSFVGACQAQVPRSAIPPLAI
>MT2140 hypothetical protein
MQLRHINIRALIAEAGGDPWAIEHSLHAGRPAQIAELAEAFHAAGRCTAE
ANAAFEEARRRFEASWNRENGEHPINDSAEVQRVTAALGVQSLQLPKIGV
DLENIAADLAEAQRAAAGRIATLESQLQRIDDQLDQALELEHDPRLAAAE
RSELDALITCLEQDAIDDTASALGQLQSIRAGYSDHLQQSLAMLRADGYD
GAGLQGLDAPQSPVKPEEPIQIPPPGTGAPEVHRWWTSLTSEERQRLIAE
HPEQIGNLNGVPVSARSDANIAVMTRDLNRVRDIATRYRTSVDDVLGDPA
KYGLSAGDITRYRNADETKKGLDHNARNDPRNPSPVYLFAYDPMAFGGKG
RAAIAIGNPDTAKHTAVIVPGTSSSVKGGWLHDNHDDALNLFNQAKAADP
NNPTAVIAWMGYDAPNDFTDPRIATPMLARIGGAALAEDVNGLWVTHLGV
GQNVTVLGHSYGSTTVADAFALGGMHANDAVLLGCPGTDLAHSAASFHLD
GGRVYVGAASTDPISMLGQLDSLSQYVNRGNLAGQLQGLAVGLGTDPAGD
GFGSVRFRAEVPNSDGINPHDHSYYYHRGSEALRSMADIASGHGDALASD
GMLAQPRHQPGVEIDIPGLGSVEIDIPGTPASIDPEWSRPPGSITDDHVF
DAPLHR
>MT0974 hypothetical protein
MGGSFDRAARARRQLDNLVNVVAAGSTHRLMVPSRSMHRLIKVEFQGGGP
HAWYLSDGILARDDYNGRDIHLPVFG
>MT2601.1 hypothetical protein
MPQGTTKTTTVTLVSVVTDASHWQNTCMRPYRHRCGLGQAASPCDHYYGV
IAYAPNGAMGKIVAPPHSRPGGYRRIRTLRRLSCKVLSNFTNYHGGVRRS
RPLAEPGRATS
>MT0407.1 hypothetical protein
MHALRLVGLAILTAIAPIAVLIGSSPAHADTDIGQPCSPEGAKLWGNPGP
IYCERTADGQLQWVSIPAWALCVAFCDRPGGP
>MT1148 hypothetical protein
MGALGTVRGLQDSNTAFVGALHSGNLLGATGAVLQAPGNAVNGFLFGQTS
ISQSIDVSPEYGYELVAVSDPVGGTAGSARAGHGYVHADLR
>MT2588 hypothetical protein
MLAAFRSHDAVLREFEKLGRYHQSTGHGCLCGKRNCATLSIIDSNQIYGH
IDRMNRRDELG
>MT2027 hypothetical protein
MIRAPAIEDASDALLELPVSHAVNLRCPAQIWQPRNSAVR
>MT3449.2 hypothetical protein
MDRLLALPHRGQPRQDRSAMAPPEQAKGTRF
>MT0859 hypothetical protein
MIGISYFVMTQLEAAVERRSRRSSRCGDRRPLDGANHHGLLGSSFLAPAR
PDLQAQRQALAQ
>MT3573.13 hypothetical protein
MGPMGPKPESERRSTKTDTAIGAALGISAGTYRRLKRIDNATRSELAAWA
ARHP
>MT0328.2 hypothetical protein
MIVVWEHLCMNPEDDPEARIRELERPLADVARASELGGSQSGGYTYPPGP
PPPPYSYGGPFGGPSPRSSSGNRAWWILAAVVVVGVLVLVGGIAAFSAQR
LSQGNFVVLSPTPSVSRAVPTPTAQPATTLPPAGAS
>MT0292 PPE family protein
MGRRRAFAGVRLGLALRRAVVWNSGNENHVHQSSVVFTAMTLWMASPPEV
HSALLSSGPGPGSVLSAAGVWSSLSAEYAAVADELIGLLGAVQTGAWQGP
SAAAYVAAHAPYLAWLMRASETSAEAAARHETVAAAYTTAVAAMPTLVEL
AANHTLHGVLVATNFFGINTIPIALNEADYARMWTQAASTMATYQAVAEA
AVASAPQTTPAPPILAAEAADDDHDHDHDHGGEPTPLDYLVAEILRIISG
GRLIWDPAEGTMNGIPFEDYTDAAQPIWWVVRAIEFSKDFETFVQELFVN
PVEAFQFYFELLLFDYPTHIVQIVEALSQSPQLLAVALGSVISNLGAVTG
FAGLSGLAGMQPAAIPALAPVAAAPPTLPAVAMAPTMAAPGAAVASAAAP
ASAPAASTVASATPAPPPAPGAAGFGYPYAIAPPGIGFGSGMSASASAQR
KAPQPDSAAAAAAAAAVRDQARARRRRRVTRRGYGDEFMDMNIDVDPDWG
PPPGEDPVTSTVASDRGAGHLGFAGTARREAVADAAGMTTLAGDDFGDGP
TTPMVPGSWDPDRDAPGSAEPGDRG
>MT2401.1 hypothetical protein
MCVSRGGMLERAKFPRRLFCRNEYILFSKVSSLHATRCDSAIDAKPGLRR
LIRAHLDTIHRVTPCLNRRSGLPAARGIKYIEMAQADERPVVGG
>MT1792 conserved hypothetical protein
MLRAVNEIRQHDGTLKLGKGVGMFTIVGVIVALIGAFVQSRRHRHRPAAD
IHMLWWMVLIVGVVSIIGAGYHVFDGERTAELIGYTRGDGGFQWENAMGD
LAIGVVGLMAYRFRGHFWLATIVVLTIQYVGDAAGHIYYWVVENNTNPYN
IGVPLWTDILLPIVMWALYAWSWHSNGDAVPKGQP
>MT2160.1 hypothetical protein
MPAAPSTREKDCMLVLHGFWSNSGGMRLWAEDSDLLVKSPSQALRSGAAT
PVRGAR
>MT0477 hypothetical protein
MVCWLRSRWRPVADNDYRSAPGTEPFVPDFDTGAHSQRFLSLAGQQDRAG
KSWPGSTPKPQEDPVGVAPSASVEVLGSEPAATLAHSVTVPGRYTYLKWW
KFVLVVLGVWIGAGEVGLSLFYWWYHTLDKTAAVFVVLVYVVACTVGGLI
LALVPGRPLITALSLGVMSGPFASVAAAAPLYGYYYCERMSHCLVGVIPY
>MT0302 hypothetical protein
MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRGAFVDRQ
RDELTRMGLLSPQGVINPAVADWIKVVCFPDRWLDLRYVGPASADGACEL
LRGIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGV
GLAHRPPARFDEFSLPTRVGARADERLRSGVPLGEVVDYLGIPASARPVV
ESVFSGPRSYVEIVAGCNRDGRHTTTEVGLSIVDTSAGRVLVSPSRAFDG
EWVSTFSPGTPFAIAVAIQTLTACLPDGQWFPGQRVSRDFSTQSS
>MT2325 hypothetical protein
MWAPVIWIATLPQRPVGQQMVPMLATNGTSGAVRQPGSSNPSG
>MT1057.1 hypothetical protein
MAMTTVDNIVGLVIAVALMAFLFAALLFPEKF
>MT0856.1 hypothetical protein
MKAATSWMQQQVLATATVVRWRLAWLSKPETWRRRKRRQSQKIRRSVQKQ
RRSTVWDNQRKTFGVVISPARLTAWIAASALILLTGTELFGTALSWVKLS
PIWHVFSFAVDTRPIATLLDYASHPIDPRSYSSVAVTAVGAEATLLAFFF
ATVGVVASTALNTLRSSPPLLLSSPRLTTGCDR
>MT0998 hypothetical protein
MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNHGLHFAMAVAMA
VMAWPWGARVPTTGPAVFFLLAAVWFGATAVVAVRGTATRGLYGYHGLMM
LATAWMYAAMNPRLLPVRSCTEYATEPDGSMPAMDMTAMNMPPNSGSPIW
FSAVNWIGTVGFAVAAVFWACRFVMERRQEATQSRLPGSIGQAMMAAGMA
MLFFAMLFPV
>MT2236 conserved hypothetical protein
MSAWRAPEVGSRLGRRVLWCLLWLLAGVALGYVAWRLFGHTPYRIDIDIY
QMGARAWLDGRPLYGGGVLFHTPIGLNLPFTYPPLAAVLFSPFAWLQMPA
ASVAITVLTLVLLIASTAIVLTGLDAWPTSRLVPAPARLRRLWLAVLIVA
PATIWLEPISSNFAFGQINVVLMTLVIVDCFPRRTPWPRGLMLGLGIALK
LTPAVFLLYFLLRRDGRAALTALASFAVATLLGFVLAWRDSWEYWTHTLH
HTDRIGAAALNTDQNIAGALARLTIGDDERFALWVAGSLLVLAATIWAMR
RVLRAGEPTLAVICVALFGLVVSPVSWSHHWVWMLPAVLVIGLLGWRRRN
VALAMLSLAGVVLMRWTPIDLLPQHRETTAVWWRQLAGMSYVWWALAVIV
VAGLTVTARMTPQRSLTRGLTPAPTAS
>MT1462 hypothetical protein
MGELRLVGGVLRVLVVVGAVFDVAVLNAGAASADGPVQLKSRLGDVCLDA
PSGSWFSPLVINPCNGTDFQRWNLTDDRQVESVAFPGECVNIGNALWARL
QPCVNWISQHWTVQPDGLVKSDLDACLTVLGGPDPGTWVSTRWCDPNAPD
QQWDSVP
>MT3955 IS1608', transposase
MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPTLAGLRT
LTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGA
IVGKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVID
ANRSWRRLMSLAR
>MT0655 hypothetical protein
MATESDPNDLHTGRTVEGGEVVVLGDDRQRSRCRDGRDPQVVDPHPATGL
GKMDPQSGPHSGGIVVDGQRFHIGNGFQGRQALSPDVGRRRGQHADAQFG
EGDDGGGDPVGDQRLVELPAAFGRDEHRRVEHSGGRRRAHRIGPRSSVVS
PARTARSLRSPGSA
>MT3211 hypothetical protein
MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAVDELSAL
SFDALTTPERDRARRDRDHHPWSRSRSQLSPRMAHGAVHQCQWPKAVWAV
IDNP
>MT1083.1 hypothetical protein
MVRYRTGPGGGDDVNSTSVLHRSRSPSRYATRSTPSRTAGCTPMIGPVGN
WVLLFMQTLCESIRSPMEMGAPQGFSLIWCTRRFRIPPSCSMTGSSINCA
YVELLRGYDRDRDIAALAAFIGVRPIET
>MT0992.1 hypothetical protein
MLLDHHAGACEAVARAAEKAAEEVAAIKMRLQVIRDAAREHHLTIAYATG
TALPPPDLSSYSPADQQAILNTAIRRASNVCWPTPRPPMRIWPRRFDAPP
GTCRASRSMPNSAMRHPQCRRCRRRTATLRRSSGGGIR
>MT2734.1 hypothetical protein
MMMADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQW
GRHIEWKLECRACRKYAPISEMTAAAILDGFGAKLHELRTSTIPDADDPS
IAEARHVIPFSALCLRLSQLGG
>MT0921.1 hypothetical protein
MGKGRKPTDSETLAHIRDLVAEEKALRAQLRHGGISESEEQQQLRRIEIE
LDQCWDLLRQRRALRQTGGDPREAVVRPADQVEGYTG
>MT3267 conserved hypothetical protein
MPCSFSSREAISHDHQSHAQTSWIRRSSPPFRGIGRAHPWIYHRTGGRVG
CKLRLGAGFRKPVPTLLLEHRSRKSGKNFVAPLLYITDRNNVIVVASALG
QAENPQWYRNLPPNPDTHIQIGSDRRPVRAVVASSDERARLWPRPVDAYA
DFDSCQSWTERGIPVIILRPR
>MT3770 hypothetical protein
MARGKCGARLFDRRNGLPRSPDAPDFQETQEGQHGGDHDHYGEHEQESHP
APDPGIEQFGDEKEEKEGGVEPDHQRGDEENTAGQSLLDVPGDLGAGQLD
LGSDQRRHLRCRVFDQVTDRRLSRSGVRVGQRNRGQRAGHTVLAIDLAHR
RSSRFDCSARPRQRRYPTLGRHPARIVAELGSIARLLLGPSSGPHRRRAV
RLASRFMLPCGAVPYEASQHQGRRDRVSHVLTCAAV
>MT2481 PE family protein
MADATPRWQYVQRDRLIADLRRNRGDRRHAAGATPTGPRFPLLFGGESLT
PWTAPSRGCSRWCSRPDFTETQAVISEGNYSPCKAFPWRHTDSRLVLIAR
PDILCSRGPEAMRAKAADLDLAAAAKTVGVQPAADQVAAAIAAILLSHAQ
IYQDISTQMAAFHDQLVENRTADSTSYASAEANAQQSLLNAMDAPSWQQR
RETVGEVGLPADPAGSGTATAAVAAATTARAGSRSAAQATVAPIGGLKLR
RESALSQPGDLHHHVEVGDALPRVDPFQRGNVGVVAAYTHTDVLLGDLIV
IGGVVVPPSTGPGLNPGMAAPVYRLSHHGITLRV
>MT0291.4 hypothetical protein
MPPLLPPAPPVPPAPNSPPCPPAPPSAPSVPAPPVPPAPKSNPFPPVPPF
APNPPAPPAPPLANSPPVPPAPAVPPAPIKFWERAAWPPVPAAPRNKPAS
PPRPPAPPVSRPNPPLPPVPPEPISKAAPPVPPVPPWPVVPMPPDPPVPP
IPDRIPPAPPDPPSPPSAPVAPWPPLPPLPNNHPPAPPSAPVPGVPLAPL
PISGRPVRAWVGSLIALRICCCRVCSGVLAGALNPSRPSSCPPKPPAPAV
PAGAPVPPLPPLPPLPISTPLPPAPPLPPLPALPTSPGAPPAPPVPPAPA
KDPPPAPPAPPAPLSRPAFPPAPPAPPASKPSPPTPPAPPEPNNVPPSPP
IPPAPPPPSGLDPPLPPAPPAAPRLSMPASPPAPPFPPTLIMLVPPLPPV
PPAPNSPPEPPSPPAPPPKMPNPPGPPVPPAPNSPPFPPDPPAPPVPASV
APPAPPTPPSANSPPFPPAPPAPPVAPKAAANPPGPPTPAAPNSMPAAPP
APPAPPVPVLALPPAPPAPPLPMSPPAPPLPPAPPLTPAAPDPPAPPLPI
NQPPSPPLAPVPGAPLVPLPISGRPVFARKNSLIGASSGDTAAASAAA
>MT3986 hypothetical protein
MCTGYKYREGRSRQMEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVT
GLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQDVARTY
SQIDDGAAGVFA
>MT2411 conserved hypothetical protein
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIVRDVLAAGDFWGGAGSVAC
QEFITQLGRNFQVIYEQANAHGQKVQAAGNNMAQTDSAVGSSWA
>MT3810 conserved hypothetical protein
MLRIGPTAGTGTPTGDYGIGATDLCEFVEFPSQLLQVCGDSFAGQGVGFG
GWYAPVALHVDTESIDDPAGVRYTGVTGVGTPLLADPTPPGDSQLPAGVV
QINRRNYLMVTTTKDLQPQNSRLVRAEAARGGWQTVSGSRRNAAYQDGRQ
TQISGYYDPVPTPDSPTGWVYIVADSFTRGEPAVLYRATPESFTDRSRWQ
GWAGGPDGGWNKPPTPLWPDQLGEMSIRQIDGQTVLSYFNASTGNMEVRV
AHHPTSLGAAPVTTVVRHDEWPEPAESLPPPYDNRLAQPYGGYISPGSTI
DELRIFVSQWDTRARQNGPYRVIQFAVNPFKPWSDP
>MT0694 hypothetical protein
MHGGLIDSTASIWSYCPRPDSQPWQAATRAFSPEATKGAMRALWVSQRED
LTADAKRVNLLGSMRRMWPKEVEIAS
>MT3856 hypothetical protein
MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDL
IAAMNNHYKDRHVLAAAVRERAEVIVTTNLKHFPDDALKPYQIKALHPDD
FLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKAR
RLFPSGSPFGLGVLLPFDQ
>MT0719.1 hypothetical protein
MRHHIRPSISALDAILCPDRRIAVETCWRKAIQMDYETDTDTELVTETLV
EEVSIDGMCGVY
>MT0219 hypothetical protein
MRGQGHQIFVDELARFATSSADQRVVAIAQRAAEPLRVAVRGRPGVGCRT
VARALQGAGSSSGMTVTPQARAADSDVDLVVYVTVEVVKPEDREAIAATR
RPVVAVLNKADLAGPLSGAGPIVMAQARCAQFSTLLGVPMESMIGLLAVA
ALDDLDDTLRAVLRALAAHPDGFDALDRAVAGFLAAALPVPTEVRLRLLD
TLDLFGIALGMAAFRPGRPSRTPAQLRTLLRRVSGVDAVIDKVTAAGSEV
RYRRLLDAVAELEALAAQAKEIGGPIGEFLRDDDTVLARMAAAVDVALAV
GLDVGPLDDPAAHLPRAVRWHRYSLDNGDMHRTCGADIARGSLRLWSLAG
GMPLHRYRKSS
>MT0229 hypothetical protein
MFDIATRFKNSYGSGPLHLLAMVSGFALLGYIVATARPSALWNQATWWQS
IAVWFVAAVVAHDLLLYPLYALADRILARLVGRRDVSAPRRRPELPVRNY
IRIPALAAGLTLLVFLPGIIRQGAPTYLDATGQTQEPFLGRWLLLTAVAF
GISAAAYAIRLVVAHVRRRRAGCSRVDAIDEE
>MT3597 hypothetical protein
MAADTGVAGGQQSTTRRARRKASRPAGPAEGESSRPAQGAATVRAAARTE
SKPAKAAKPALRPVKPPPRRPAHRVLVGWLSLAAGLLAIAALAWGVTALV
MQNRDADARQARNQRFVDAATQTVVNMFSYTPDTIDESVNRFVNGTSGPL
RGMLNANNNVDNLKGLFRATNATSEAVVNGAALEGIDEISDNASVLVSVR
VTVADIDGVNKPSMPYRLRVIVHEDENGRMTGYDLKYPDGGN
>MT2015 hypothetical protein
MSHPVTGAFEVKNERGEAEAMLLAGFVIPVRKKGVRADGIRRPTR
>MT3693 hypothetical protein
MTWPYGVIVLDLEPRGPLPTEIYWRRRGLALGIAVVVVGIAVAIVIAFVD
SSAGAKPVSADKPASAQSHPGSPAPQAPQPAGQTEGNAAAAPPQGQNPET
PTPTAAVQPPPVLKEGDDCPDSTLAVKGLTNAPQYYVGDQPKFTMVVTNI
GLVSCKRDVGAAVLAAYVYSLDNKRLWSNLDCAPSNETLVKTFSPGEQVT
TAVTWTGMGSAPRCPLPRPAIGPGTYNLVVQLGNLRSLPVPFILNQPPPP
PGPVPAPGPAQAPPPESPAQGG
>MT2616 hypothetical protein
MKTDRLADIAAIQEALRVPDSHLIYVARPDDPADMIPAVTAVGDPFTADH
VSVTVPGVSGTTRQTIATMTQEARGLREEARVIAHSVGESENVATIAWVG
YQPPPVLASWNTVDDDLAQAGAPKLEAFLRDLQAGSHNPGHTTALFGHSY
GSLLSGIALKDGASSLVDNAVLYGSPGFDATSPAKLGMNDHNFFVMTTPD
DPIRYPARLAPLHGWGSDGADTIGTVGRQGTPARVGIRPQRDHRRIPGPL
PLHPSADRRGIHSAG
>MT0854 PE_PGRS family protein
MASRSCRCGRRCRVSYVSVLPATLATAATEVARIGSALSLASAVAAAQTS
AVQAAAADEVSAAIAALFSAHGRDFQALSARAAAFHHEFVQALAAGAGSY
AVAEIAAASPLQSLIDVFNAPIQAATGRPLIGNGANGQPGTGAPGGPAGG
>MT2785 hypothetical protein
MTKYRGQFELNRPATLIAALPAILGFVPEKSLVLVSLAAGELGSVMRADL
CDELADRVGHLAELVAAANPAAAIAVIVDANGAQCPRCNEEYRQLCAALA
AALSQRDIVLWAAHVVDRVAAGGRWHCVDGCGCSGVIDDPSASPLAMAAV
LDGRQLYPRRSDLQAVIAVDDPVRSAELAVALGHQAADREIAHRADSVGC
SRQDVENALAAAARVADGQSLSDTELARLGCALGDARVRDMLYALAVGEN
AGAAESLWALLARVLPEPWRVEALVLLAFSAYARGDGPLAGVSLQAALCC
EPGHRMAGMLDTALQSGLRPEHIRDIAVTGYQRAEQLGIRLPPRRAFGQR
AG
>MT3858 hypothetical protein
MTATASRSASKRSAYVSRVTFADLCPRIRCSAKTLTPADTAKLAAVCRKS
CGVICWTFARFTAPQIRPRVGFGRGRRAPVSSSDQLVRSAGLDVTVLGGV
>MT4000 hypothetical protein
MTVESDNVIDVVELAPLLRHPLDLELDSISVVTFGSRTGTVGDYPRVYDA
EIGTPPYAGRRETWLIMRLPVIGNTQALRWRTSVGAAAISVAQRVASSLR
CQGLRAKLATATDLAELDRRLGSDAVAGSAQRWKAIRGEAGWMTTYAYPA
EAISSRVLSQAWTLRADEVIQNVTVYPDATCTATITVRTPTPAPTPPSVI
LRRLNGEQAAAAAANMCGPRPHLRGQRRCPLPAQLVTEIGPSGVLIGKLS
NGDRLMIPVTDAGELSRVFVAADDTIAKRIVIRVVGAGERVCVHTRDQER
WASVRMPQLSIVGTPRPAPRTTVGVVEYVRRRKNGDDGKSEGSGVDVAIS
PTPRPASVITIARPGTSLSESDRHGFEVTIEQIDRATVKVGAAGQNWLVE
MEMFRAENRYVSLEPVTMSIGR
>MT2558 hypothetical protein
MAVPTRLPNTRFRRSWPRHRCCGWIMARIGFHHRVIATSVVLALWCVAVA
GLKQEV
>MT1791 hypothetical protein
MPGGVCSGRPWGRPWWHPGLVGLLIRLAELLVVMLPLIGVLYVGIKALSS
FTRRLGEASGDLASDSPAMPRPTTVENDAARWRAITRAVEAHERTDARWL
EYELDAAKLLDFPVMTDMRDPLTTAFHKAKLQADFHKPLRAEDLLDDPDA
AGHYLDAVRDYVTAFDTAEAEAMRRRRTGFSREEQQRLARAQSLLRVASD
AGATAQERERAYRLARTELDGLIVLPDRTRAGIERGIAGELDD
>MT3288 hypothetical protein
MSARSVAPSQVMRRAASALYSLNPAMPVLLRPDGAVQVGWDPRRAVLVRP
PRGLTATGLAALLRSMRSPIPITELQRQAAERGLVDGDAMANLVAQLVGA
GVATPLANPGNLDSRRRAASIRVHGRGPLSDLLVQALRCSGARIRHSSQP
HAAVTPAGVDLVVLSDYLVADPHMVRDLHTERVPHLPVRVRDGTGMVGPL
VVPGVTSCLGCADLHRSDRDAAWPAIAAQLRDTVGVADRATLLATAALAL
SQVNRVIAAVRGQEATPEPPSALNTTLEFDLNAGSIVARQWTRHPRCFC
>MT3885 hypothetical protein
MRRRTRALSTSQRGRCRPAILERMFEISLSDPVELRDADDAALLAAIEDC
ARAEVAAGARRLSAIAELTSRRTGNDQRADWACDGWDCAAAEVAAALTVS
HRKASGQMHLSLTLNRLPQVAALFLAGQLSARLVSIIAWRTYLVRDPEAL
SLLDAALAKHATAWGPLSAPKLEKAIDSWIDRYDPAALRRTRISARSRDL
CIGDPDEDAGTAALWGRLFATDAAMLDKRLTQLAHGVCDDDPRTIAQRRA
DALGALAAGADRLTCGCGNSDCPSSAGNHRQATGVVIHVVADAAALGAAP
DPRLSGPEPALAPEAPATPAVKPPAALISGGGVVPAPLLAELIRGGAALS
RVRHPGDLRSEPHYRPSAKLAEFVRIRDMTCRFPGCDQPTEFCDIDHTLP
YPLGPTHPSNLKCLCRKHHLLKTFWTGWRDVQLPDGTIIWTAPNGHTYTT
HPDSRIFLPSWHTTTAALPPAPSPPAIGPTHTLLMPRRRRTRAAELAHRI
KRERAHVTQRNKPPPSGGDTAVAEGFEPPDGVSRLSLSRRVH
>MT3358 WhiB-related protein
MSYEHLRGVMGGTPHTTTGSATASATAVLRPHLSLVPEAPAPFEEPLPPE
ATDQWQDRALCAQTDPEAFFPEKGGSTREAKKICMGCEVRHECLEYALAH
DERFGIWGGLSERERRRLKRGII
>MT3328 hypothetical protein
MTQVYIPATLAMLQRLVADGALWPVNGTAFAVTPTLRESYAEGDDEELAE
VALREAALASLRLLAADIGATADALPPRRAVLAAEVDDATYRPDLDDAVV
RLAGPITIDQVVAAYVDNAGAEPAVMAAIAVIDAADLGDEDAELVVGDAQ
DHDLAWYANQELPFLLDLL
>MT0431 hypothetical protein
MRAARYRSDCPGCGSLRIQCSLIHADLLAAADYNVVGLAAAVLSVWAYLA
>MT4016 hypothetical protein
MVTGQPAAAGAHSLSEGAMTAMQSGSVPPPQATPPITTPPVVSAPTMAAG
IEATHGPVDTPANTSGAPPASTGTTGPVAPTVVTAGPVAVPAAPVVGGSA
VPAGPLPAYGSDLRPPVVAAPAVPSVPTAPVSGAPVAPSASSAPSAGGAL
VSPVERAASKAVAGQAGASSSTMAGASALSATAGATAGAVSARAAEQQRL
QRIVDAVARQEPRISWAAGLRDDGTTTLWSTDLAGGWIPPHVRLPANVTL
LEPTARRRDADVIDLLGAVVAVAAHESNTYVAEPGPDAPALTGDRSARSA
IPKVDEFGPTLVEAVRRRDSLPRIAQAIALPAVRKTGVLENEAELLHGCI
TAVKESVLKAYPSHELTAVGDWMLLAAIEALIDEQDYLANYHLAWYAVTT
RRGGSRGFAA
>MT2807 hypothetical protein
MSSPATSGGHCAPTSTFPTIGGEVAHSSKRAIAALASGGYPVEQAAAATP
KGQIRAGEHAGSGQASGVEQIVGDPDQPRTQVGPHRRQGVRRADERGRDV
ATRGVSHLNRTGVRLLW
>MT1272 hypothetical protein
MTSPFQPRQVPGSTPAAAGAGRRGVPALPTPPKGWPVGSYPTYAEAQRAV
DYLSEQQFPVQQVTIVGVDLMQVERVTGRLTWPKVLGGGVLSGAWLGLFI
GLVLGFFSPNPWSALVTGLVAGVFFGLITSAVPYAMARGTRDFSSTMQLV
AGRYDVLCDPQNAEKARDLLARLAI
>MT1120 PE family protein
MTTASATASSTGVDGGIAATYAVASQWDGGYVANYTITQFGRDFDDRLAV
AIHFA
>MT1418 sulfotransferase, putative
MTDRVVYRSLMADNLRWDALQLRDGDIIISAPSKSGLTWTQRLVSLLVFD
GPDLPGPLSTVSPWLDQTIRPIEEVVATLDAQQHRRFIKTHTPLDGLVLD
DRVSYICVGRDPRDAAVSMLYQSANMNEDRMRILHEAVVPFHERIAPPFA
ELGHARSPTEEFRDWMEGPNPPPPGIGFTHLKGIGTLANILHQLGTVWVR
RHLPNVALFHYADYQADLAGELLRPARVLGIAATRDRARDLAQYATLDAM
RSRASEIAPNTTDGIWHSDERFFRRGGSGDWQQFFTEAEHLRYYHRINQL
APPDLLAWAHEGRRGYDPAN
>MT0599 hypothetical protein
MGEHAIKRHMRQRKPTKHPLAQKRGARILVLTDDPRRSVLIVPGCHLDSM
RREKNAYYFQDGNALVGMVVSGGTVEYDADDRTYVVQLTDGRHTTESSFE
HSSPSRSPQSDDL
>MT3182 hypothetical protein
MAHLQMRRPSQTKVSCHGVFHLGHLVTVQIVDRIAVHIDDADRPDLIHQE
PCLCSHDFQLWSENRRLGACRRRHHGYYTPRHRAGADDHGVAPPALLVAS
LRIAEVDPVDRSPNHHASGSVETSSSRSRSASVRACLIHTSRSSSCSARR
MTSLLRSPLRIAALMICSSFSVGRKPMVAVMSTTIADVAQSYSNCSTHSG
TPTPAFAASFLLDAINAPRVIAGRFASESVRFPAAAPHGSVPSRLPV
>MT1394 hypothetical protein
MTPRSLPRYGNSSRRKSFPMHRPSNVATATRKKSSIGWVLLACSVAGCKG
IDTTEFILGRAGAFELAVRAAQHRHRYLTMVNVGRAPPRRCRTVCMAATD
TPRNIRLNG
>MT0535 hypothetical protein
MIARYRAGAELFLACAALAGSAASWSRTRSTVAVAPVIDGQPVTLSVVYH
PQPLVLTLLLATIAGVLSVVGTARLRRARAGLNAHPDGLNQRPPGGWCH
>MT0479 conserved hypothetical protein
MTRRASTDTPQIIMGAIGGVVTGYILWLAAISVGDGLTTVSQWSRVVLLL
SVLVAVCGAAGGLRLRSRGKLAWSAFAFSLPIPPVVLTVAVLADIYL
>MT1183 conserved hypothetical protein
MHAAFGGGSRYGAAVFAVSETFCLTDHSEPMTARFLSVVLRRIRGMRSDT
REEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLEVERRRQGAAEH
ALINQLAGQACEEELGGTLRTALANRLHITPGEASRRIAEAEDLGERRAL
TGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIREAAEA
QLAELATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMGKQEFD
GMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQTPLVADTPDADAVRR
DTRSQAQRNHDAFLAALRGLLASGELGQHKGLPVTIVVSTTLKELEAATG
KGVTGGGSRVPMSDLIRMASHANHYLALFDGAKPLALYHTKRLASPAQRI
MLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACGPDNRL
VEKGWKTRKNAHGDTEWLPPPHLDHGQPRINRYHHPAKILCEQDDDEPH
>MT1040.1 hypothetical protein
MPVVAVRALWNIFSVNLLKATSSVPVLSPRSGFSWSAYRMPHRDRRAHTV
GERLGLGSVMSPRPNTADYLKQLLSSCQTLPRFEGDVTAWSQPLPYSPR
>MT3305 hypothetical protein
MVALGAVATAVIINSGDSTSTKAIVGAPAPRTVISTSPRPTAPTSTSPHP
SPSTLRPQLPPETVTTVAPPGTGPTTVPTRTPTAAPPQTAVPPPAPLNPR
TVVYRVTGTKQLFDLVNVVYTDARGFPVTDFNVSLPWTKMVVLNPGVQTE
SVVATSLYSRLNCSIVNTGAQTVVASTNNAIIATCTR
>MT4018 hypothetical protein
MQAANRRSADTICGVTAPAPLPIPRTRSWPAIVVATIAAVVAVAALIVAL
TNARPAATPATTSVPTYTAAQTAAAQRQLCDTYKLVAHAVPVDTNGSDKA
LARITLTNAAAILDNAAADPALDAKHRDAARASDRLPHNDRNGEWWHSS
>MT0719.2 conserved hypothetical protein
MPAPAQARRADSSEFDPDRGWRLHPQVAVRPEPFGALLYHFGTRKLSFLK
NRTILAVVQTLADYPDIRSACRGAGVDDCDQDPYLHALSVLAGSNMLVPR
QTT
>MT4007 PPE family protein
MPDPGWAARTPEANDLLLKAGTGVGTHLANQTAWTTLGASHHASGVASAI
NTAATAASWLGVGSAASALNVTMLNATLHGLAGWVDVKPAVVSTAIAAFE
TANAAMRPAPECMVNRDEWGVDNAINPSVLWTLTPRIVSLDVEYFGVMWP
NNAAVGATYGGVLAALAESLAIPPPVATMGASPAAPAQAAAAVGQAAAEA
AAGCGMRSAYQGVQAGSTGAGQSTSAGENFGNQLSTFMQPMQAVMQAAPQ
ALQAPSGLMQAPMSAMQPLQSMVGMFANPGALGMGGAAPGASAASAAGGI
SAAATEVGAGGGGAALGGGGMPATSFTRPVSAFESGTSGRPVGLRPSGAL
GADVVRAPTTTVGGTPIGGMPVGHAAGGHRGSHGKSEQAATVRVVDDRR
>MT2032 immunogenic protein MPB64/MPT64
MSLVRHRRQQRDALCLSSTQISRQSNLPPAAGGAANYSRRNFDVRIKIFM
LVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQMSDPAYNINISLP
SYYPDQKSLENYIAQTRDKFLSAATSSTPREAPYELNITSATYQSAIPPR
GTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQADTDPLPV
VFPIVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPGELL
PEAAGPTQVLVPRSAIDSMLA
>MT1077 hypothetical protein
MLLSTLASYVSAAGAHARIVVTVEGRDLEFDVSTFALVGPQQLPEVEPSQ
>MT0642 hypothetical protein
MLGPIRQPRLTVRPGRLPGMIAGVAAKRMNREQFFRAASGLDEDRLRKAL
WNLYWRGTANMRERIEAELASAGRARPARKIKPPADPDIVGWEVDEFVSL
ARSGAYLGGDRRVSPRERSRWRFTFKRLAAEAQDALRAEDAEPAASALEQ
LIDLAREADGYDYFRSDDPVAAAGFVVSDVAAAGHPHFREFAAEIGAAIP
P
>MT3521 conserved hypothetical protein
MRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEADLADLA
VYEALLAHKGIRGLVVCCDECQQDHYHDWDMLRSNLLQLLIDGTVRPHEP
AYDPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR
>MT0689 hypothetical protein
MTMLSFRADDHDVDLADAWARRLHIGRSELLRDALRRHLAALAADQDVQA
YTERPLTDDENALAEIADWGPAEDWADWADAAR
>MT0900 conserved hypothetical protein
MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGV
SYEDGNAATHRFIAHLPGYQGWQWAVVVASYSGADHATISEVVLVPGPTA
LLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAA
EIGLGRRWVMSAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPL
AGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGGSTPIYEPYDD
GVLDIIEKPAES
>MT2568 hypothetical protein
MRTTLDLDDDVIAAARELASSQRRSLGSVISELARRGLMPGRVEADDGLP
VIRVPAGTPPITPEMVRRALDED
>MT1112 hypothetical protein
MHPAGVNPGMTHTPIPRPDARYGRPRLSRRARRRVAIALGVLVAAAGIVI
AVIGYQRISTSAVTGSLVGYRLVDDETASVTISVTRSDPSRPVACIVRVR
ATNGSETGRRELLVPPSEATTVQVTTTVKSSQPPVMADVYGCGTEVPSYL
RLP
>MT2840 PPE family protein
MAASDGAPATTALDQKRNASVDFGALPPEVNSARMYGGAGAADLLAAAAA
WNGIAVEVSTAASSVGSVITRLSTEHWMGPASLSMAAAVQPYLVWLTCTA
ESSALAAAQAMASAAAFETAFALTVPPAEVVANRALLAELTATNILGQNV
SAIAATEARYGEMWAQDASAMYGYAAASAVAARLNPLTRPSHITNPAGLA
HQAAAVGQAGASAFARQVGLSHLISDVADAVLSFASPVMSAADTGLEAVR
QFLNLDVPLFVESAFHGLGGVADFATAAIGNMTLLADAMGTVGGAAPGGG
AAAAVAHAVAPAGVGGTALTADLGNASVVGRLSVPASWSTAAPATAAGAA
LDGTGWAVPEEDGPIAVMPPAPGMVVAANSVGADSGPRYGVKPIVMPKHG
LF
>MT3556 conserved hypothetical protein
MGRNQRATVGEPVPSPATTWLHVSGYRFLLRRIECALLFGDVCAATGALR
ARTTSLALGCVLAIVAAMGCAFVALLRPQSALGQAPIVMGRESGALYVRV
DDVWHPVLNLASARLIAATNANPQPVSESELGHTKRGPLLGIPGAPQLLD
QPLAGAESAWAICDSDNGGSTTVVVGPAEDSSAQVLTAEQMILVATESGS
PTYLLYGGRRAVVDLADPAVVWALRLQGRVPHVVAQSLLNAVPEAPRITA
PRIRGGGRASVGLPGFLVGGVVRITRASGDEYYVVLEDGVQRIGQVAADL
LRFGDSQGSVNVPTVAPDVIRVAPIVNTLPVSAFPDRPPTPVDGSPGRAV
TTLCVTWTPAQPGAARVAFLAGSGPPVPLGGVPVTLAQADGRGPALDAVY
LPPGRSAYVAARSLSGGGTGTRYLVTDTGVRFAIHDDDVAHDLGLPTAAI
PAPWPVLATLPSGPELSRANASVARDTVAPGP
>MT0247 conserved hypothetical protein
MAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFLARATNL
WNSDLPFGQAQNQAYGYLFPHGTFFVIGHLLGVPGWVTQRLWWAVLLTVG
FWGLLRVAEALGVGGPSSRVVGAVAFALSPRVLTTLGSISSETLPMMLAP
WVLLPTILALRGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIW
WACHRPNRLWWRYTAWWLLAMALATLWWVMALTQLHGVSPPFLDFIESSG
VTTQWSSLVEVLRGTDSWTPFVAPNATAGAPLVTGSAAILGTCLVAAAGL
AGLTSPAMPARGRLVTMLLVGVVLLAVGHRGGLASPVAHPVQAFLDAAGT
PLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAFAHPERDKR
VAVAVVALTALMVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAAT
PTPGRVLVVPGAPFATQVWGTSHDEPLQVLGDGPWGVRDSIPLTPPQTIR
ALDSVQRLFAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLH
RSIAGSPGLAKLAEFGAPVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPAN
PGAPYFAATDQLARVDGGPEVLLRLDERRRLQGQPPLGPVLMTADARAAG
LPVPQVAVTDTPVARETDYGRVDHHSSAIRAPGDARHTYNRVPDYPVPGA
EPVVGGWTGGRITVSSSSADATAMPDVAPASAPAAAVDGDPATAWVSNAL
QAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTLRF
DEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGF
AHPVQLRHTVLVPGPPPGSAIAGWDLGSELLGRPGCAPGPDGVRCAASMA
LAPEEPANLSRTLTVPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGD
SDLVDILGSAYAAADGDPATAWTAPQRVVQHKTPPTLTLTLPRPTVVTGL
RLAASRSMLPAHPTVVAINLGDGPQVRQLQVGELTTLWLHPRVTDTVSVS
LLDWDDVIDRNALGFDQLKPPGLAEVVVLGAGGAPIAPADAARNRARALT
VDCDHGPVVAVAGRFVHTSIRTTVGALLDGEPVAALPCEREPIALPAGQQ
ELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESA
TSRVLVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLT
FAPNSLYRASLAIGLALLPLLALLAFWRTGRRQLADRPTPPWRPGAWAAA
GVLAAGAVIASIAGVMVMGTALGVRYALRRRERLRDRVTVGLAAGGLILA
GAALSRHPWRSVDGYAGNWASVQLLALISVSVVAASVVATSESRGQDRMQ
>MT3231 PPE family protein
MSFVVLPPEINSLRMFIGAGTAPMLAAAAAWDGLAEELGTAAQSFASVTA
GLAGQAWQGPAALAMAAAAAPYAGWLTAAAAQSAGAAGQARAVASIFEAA
QAATVLPAAVAANRDAFVQLVMTNLFGQNAPLIAAAEGVYEEMWAADVAA
MSGYYSGASAIAAQVVPWASLLQRFPGLGAGATGATGGESVGTGATGGES
VGTGGGESVGTGGATASGGGVGYVGGGVASAGLAAGDPAHGSVGQGNFGG
GDVGAGDVVASSATSAHAGVVSPGFIGAPLALAALGQMARGGTNSAPGTA
TESARAPEPAASAPPEAVVEVPELEVPAMGVLPTVDPKVAAKAAPLSTTR
VGQSAGSGIPESTLRTAQGQQASETSAAEETAPSLRPEAAAGQLRPRVRK
DPKIQMRGG
>MT0263 hypothetical protein
MSAPTANRPAIGVFTPTRAQIPERTLRTDLWWLPPLLTNLGLLAFICYAT
TRAFWGSQYWVEKYHYLTPFYSPCVSASCQPGASHLGVWFGHFPGWIPLG
AMVLPFLLGFRLTCYYYRKAYYRSVWQSPTSCAVPEPRAHYTGETRLPLI
VQNTHRYFFYIAVVVSLINTYDAIAAFHSPSGFGFGLGNVILTINVVLLW
AYTISCHSCRHATGGRLKHFSKHPVRYWIWTQVSKLNTRHMQFAWITLGT
LALTDFYIMLVASGSITDLRFIG
>MT3102 hypothetical protein
MIPLAGDPVSSHRTVEFGVLGTYLVSGGSL
>MT3632 hypothetical protein
MTRRPDRKDVATVDELHASATKLVGLDDFGTDDDNYREALGVLLDAYQGE
AGLTVLGSKMNRFFLRGALVARLLSQSAWKQYPEHVDVAIKRPIFVTGLV
RTGTTALHRLLGADPAHQGLHMWLAEYPQPRPPRETWESNPLYRQLDAQF
TQHHAENPGYTGLHFMAAYELEECWQLLRQSLHSVSYEALAHVPSYADWL
SRQDWTPSYCRHRRNLQLIGLNDAEKRWVLKNPSHLFALDALMATYPDAL
VVQTHRPVETIMASMCSLAQHTTEGWSTKFVGAQIGADAMDTWSRGLERF
NAARAKYDSAQFYDVDYHDLIADPLGTVADIYRHFGLTLSDEARQAMTTV
HAESQSGARAPKHSYSLADYGLTVEMVKERFAGL
>MT2635 hypothetical protein
MPGSAGWRKVFGGTGGATGALPRHGRGSIVYARSTTIEAQPLSVDIGIAH
VRDVVMPALQEIDGCVGVSLLVDRQSGRCIATSAWETLEAMRASVERVAP
IRDRAALMFAGSARVEEWDIALLHRDHPSHEGACVRATWLKVVPDQLGRS
LEFYRTSVLPELESLDGFCSASLMVDHPACRRAVSCSTFDSMDAMARNRD
RASELRSRRVRELGAEVLDVAEFELAIAHLRVPELV
>MT2582 conserved hypothetical protein
MDMNDPRRPQRFGPPLSGYGPTGPQVPPNPPTADPAYADQSPYASTYGGY
VSPPWSPGGPPPRPPQWPPGPHEASPTQQLPQYWQYDQPPPGGFPPDGLT
PPPPQGPRTPRWLWFAAGSAVLLVVALVIALVIANGSVKKQTAIEPLPPM
PGPSPTRPTTTTPTPPSPSAAPAPTTTTGTPSETVAGAMQTVVYDVTGEG
RAISITYMDSGNVIQTEFNVALPWRKEVSLSKSSLHPASVTIVNIGHNVT
CSVTVAGVQVRQRTGAGLTICDAPS
>MT0779 PPE family protein
MMVGFAWLPPETNSLRMYLGAGSRPLLAAAGAWDGLAEELHAAASSFGSV
TSELAGGAWQGPASAAMANAAGPYASWLTAAGAQAELAARQARAAAGAFE
EALAGVVHPAVVQANRVRTWLLAVSNVFGQNAPAIAAMESTYEQMWAQDV
AVMAGYHAASSAAAAQLASWQPALPNINLGVGNIGNLNVGNGNTGDYNLG
NGNLGNANFGGGNGSAFHGQISSFNVGSGNIGNFNLGSGNGNVGIGPSSF
NVGSGNIGNANVGGGNSGDNNFGFGNFGNANIGIGNAGPNMSSPAVPTPG
NGNVGIGNGGNGNFGGGNTGNANIGLGNVGDGNVGFGNSGSYNFGFGNTG
NNNIGIGLTGSNQIGFGGLNSGSGNIGFGNSGTGNIGFFNSGSGNFGVGN
SGVTNTGVANSGNINTGFGNSGFINTGFGNALSVNTGFGNSGQANTGIGN
AGDFNTGNFNGGIINTGSFNSGAFNSGSFNGGDANSGFLNSGLTNTGFAN
SGNINTGGFNAGNLNTGFGNTTDGLGENSGFGNAGSGNSGFNNSGRGNSG
AQNVGNLQISGFANSGQSVTGYNNSVSVTSGFGNKGTGLFSGFMSGFGNT
GFLQSGFGNLEANPDNNSATSGFGNSGKQDSGGFNSIDFVSGFFHR
>MT2374 hypothetical protein
MMKEIELHLVDAAAPSGEIAIKDLAALATALQELTTRISRDPINTPGPGR
TKQFMEELSQLASAPGPDIDGGIDLTDDEFQAFLQAARS
>MT3037 hypothetical protein
MGECRRTVGGAELRALGAGSCCEAADLDEVVGEDAVPGPDPGSFGAVDAG
AVPTVVSFEGADAAFAAGAPLDCSSERWPLFFGAPGLGGSAFTRYRDRTY
TEVVQVIFDAFLAVAAVGGDGAGRASGACGDPFDRGR
>MT0898 conserved hypothetical protein
MKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHGHLTRVG
PYLYCNVVDLDDCQTPQAQGELPVSERYPVQLSVPEVISRAPWRLLQVYQ
DPANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRD
VPHAEWSVRLIF
>MT2617 hypothetical protein
MVRSLLWAFAHRQIGPVEQDGRQVVAGHPLQRVLAVLRVPAGGGLVITQR
SGSRVGLQMGEAVAVRGSGDDLVGV
>MT2165.1 hypothetical protein
MRPPVAGGEIIPISPTRRCEMHTMSSAEYRGL
>MT2299 conserved hypothetical protein
MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWGWDEDTD
DDIRAAIEEACGGELLDEDTDEVIDVVLLWWRDGDGDLVDTLMDAIGPLA
EDGVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQ
PKSRAGKR
>MT3870 lipoprotein
MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASPGAASGP
KVVIDGKDQNVTGSVVCTTAAGNVNIAIGGAATGIAAVLTDGNPPEVKSV
GLGNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNK
SFEIEVTSS
>MT0362 hypothetical protein
MPGARELTLRVERGALFRRRWAASAASSARAAIRRDPRRCALGTRPRWVS
FLVIVLVIMNVVTAHPKYPNDPLALVLIELRHPRTEPPVPSAISILKEEL
ARWTPILEQEEVRQVNLETGEHTAHSQKKLVARDRRTAITFRPDAMTLEV
TDYPGWEEFRSIVHAMVTARQDVAPVDGCIRIGLRYINEIRASLAEPSGW
AYWVAESLLGPGTQLADLKLTTTAQRHVIQCEGPEPGDSLTLRYAGARGA
VIQSTPFLQRLKEPPAEGDFFLIDIDSAWSDPCKGIPALDAHLVDEVAER
LHTPIGPLFESLITSELRTKVLQQPGQE
>MT1432 hypothetical protein
MDFGFPDLLNRSTTADDWWRQVASQLAVPGITRGWATYRACRAMVASTQP
CQFSPTSPRCNPTRIIVAASAPNAGAQHHRTLP
>MT3879 hypothetical protein
MRAERARAIGLFRYQLIREAADAAHSTKERGKMVRELASREHTDPFGRKV
RISRHTIDRWIRN
>MT3560 hypothetical protein
MTAADWHTAYHGLMLAPDPARQARRPSTQPNAHLSAECAAMNRNHFTGRE
VCGHANRTLGLTSTAIALRRVDAGRHNKQ
>MT1241 hypothetical protein
MLLAYVLITKGEFGAAASMLEPAAATLERTGYSWGPLSLMLLATAIAQQG
HIAESAKTLQRAEARHGTKSALFAPELGLARAWTRAAAQDMTGAIAAARE
AARTAERAGQAAVALCAWHNAVRLGDIRAVDPVTRLAAEIDCTVGNILVK
HARGLADGDAAELTAVAEELAGIGMAAAAADATKAAARLGPQQR
>MT0327 hypothetical protein
MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFE
DLSRRSRPAPETAGEAGDGVWAIYTVDADGGARVEQVYATELDALRANKD
NTDPKRKVRFLPYGIAVSVLDDPVDEAQ
>MT3752 PE family protein
MSFVIAAPEALDSAATDLVVLGSTLGAATAAAAAQTTGIVAAAHDEVSAA
IAALFSAHGQAYQAASAQAAAFHTRFIRARSRHPQQETTCRRVR
>MT3628 hypothetical protein
MRESIEAHQSIVFAALAVRTGLNTKQVGASRNSFKHYVATAP
>MT0697 hypothetical protein
MFDSAAAITNPGHAWASAMERSGLLECVAGLDEQPFGEFTADKLNPDRGS
SRRVPRRQADGGIATHVERGGGQRQSGGQAGVVPQRMHGFPALAMQDRLI
HHGEQTQNRIAQAFRVRFCVCSPT
>MT1916 hypothetical protein
MRDHCFWNVTRAARRLSQGRGAVGGDGNLDAGDVLAHRCLHRCGIGDHGQ
AARPVGRAHRQERLLGLSVPVDWCGPRFDGADAFPLDQPISGGLPIDRGL
HVLPAEFRRRLGAVDPETVFGQVTAVGGDQTSAPGIGRQQFSHGGDGVRD
VIAPRPAGEGQAPGDGQRAGRIVGVQAEFDCGDLDSGGKARIQVDVFDVI
EPQPSQFQRPGTGDPNGRRPMQSVAVGDGRCVVGVGASLWINPAFGRYSQ
TCRTFDRRQQQRGTLVNHVVGVHKLGVGPADHPVLWAGLPDLVSRNRVAD
PRVRVVSSYGTEPRPQLADPLSVVLDRLAVGDAQRLLEYRVHVGRPVQVD
PQLSRTGHRNVVAGHVRQRNGFVLHSPLQFAALGAQARSGAPGFGASQQH
HAGPPALDVQACPVDQGLRHVAAHPAVTGGEGPRLDSFAEELPGIPVMRR
QHVHHTDRIHRLEHRRIRGFPGRGGHQVDGFDGLLLGVDVASVVDLSIAD
QHWRPGVYRHGLNLARRRCGSWNGPRRSRAIGPSAATPNRSSRPGSGTRR
GTAGLRVEKL
>MT2225 conserved hypothetical protein
MPLSDHEQRMLDQIESALYAEDPKFASSVRGGGFRAPTARRRLQGAALFI
IGLGMLVSGVAFKETMIGSFPILSVFGFVVMFGGVVYAITGPRLSGRMDR
GGSAAGASRQRRTKGAGGSFTSRMEDRFRRRFDE
>MT4020 hypothetical protein
MTIGVDLSTDLQDWIRLSGMNMIQGSETNDGRTILWNKGGEVRYFIDRLA
GWYVITSSDRMSREGYEFAAASMSVIEKYLYGYFGGSVRSERELPAIRAP
FQPEELMPEYSIGTMTFAGRQRDTLIDSSGTVVAITAADRLVELSHYLDV
SVNVIKDSFLDSEGKPLFTLWKDYKG
>MT1330 hypothetical protein
MAKATGLSAKGAKTFAVDAASAYCPQYVTSS
>MT2549 hypothetical protein
MVERGLWLPDPAHRADLATFVDHALRLDDAAVIRIRARSTGLLSAWVATG
FDVLASRVVAGKVRPDDLSVAARSLAHGLATTDASGYVDPGYSMDSAWRG
GLPPESGFTYLDDVPARVMLDLAHRGARLAKEHGSSAGPPVSLLDQEVIQ
VSSADVVVGLPMRCVFALTAMGFLPQSAETISADELIRVRISPAWLRLDA
RFGSVYRHRGHAALVLR
>MT1252 PE family protein
MLASAATDLAGIGSALSAANAAAAAPTTAMLAACADEVSAVVASLFARHA
QAYQALSLQATAFHQQFVQALTGAGGAYAAAEAVNAAVAQSVQQDVLNVI
NAPTQALFDR
>MT2142 hypothetical protein
MGDAGQCVAAAYRFDQGGSEFLDTWCDAGSPVAGRHVPHGLVITDIDVGT
VKAPDAFMAERVGW
>MT2139 hypothetical protein
MFVDVGLLHSGANESHYAGEHAHGGADQLSRGPLLSGMFGTFPVAQTFHD
AVGAAHAQQMRNLHAHRQALITVGEKARHAATGFTDMDDGNAAELKAVVC
SCAT
>MT3541 hypothetical protein
MAWRWHPLTEGSRGYNFRAGTHKWAGSELGRILRVVVGLVLVIAAYVTVI
ALYHSTGLGRPHEVAHGRPTADGTTVTLHVEQLQTIKGVLVANLAVSPGT
ELLDSQTQGLKDDLTVTVTSVVTPTKRTWSSGSLPGVFPVPLTISGDPAN
WPFDHYRSGPITVQLYRGAAHAPERVSVTFVDRLPGWNVDISGVGDANVP
APYRVGLHRSPSSVAFGTVIVGVLIALAGVGLFVVVQTARGRRQFQPPMT
TWYAAMLFAVIPLRNALPDAPPIGFWIDVTVVLWVVVALVTSMVLYILCW
WWHLKPDVDETM
>MT3013 hypothetical protein
MGSVETPIIQRPRTLPRATARRPLTRPTTGNPYQVWCLRWVIVVGQPSVA
WGLEQAVNAALAWSEATLAAP
>MT0236 conserved hypothetical protein
MRWFRPGYALVLVLLLAAPLLRPGYLLLRDAVSTPRSYVSANALGLTSAP
RATPQDFAVALASHLVDGGVVVKALLLLGLWLAGWGAARLVATALPAAGA
AGQFVAITLAIWNPYVAERLLQGHWSLLVGYGCLPWVATAMLTMRTTVGA
GWFGLFGLAFWVALAGLTPSGLLLAATVAVVCVAMPGAGRPRWQCGVAAL
GSALVGALPWLTASALGSSLTSHTAANQLGVTAFAPRAEPGLGTLGSLAS
LGGIWNGEAVPSSRTTLFAVASAVVLLAMVAIGLPTVARRPVAVPLLTLA
AVSVMVPAVLATGPGLHALRVVVDAAPGLGVLRDGQKWVALAVPGYTLSG
AGTVLTLRRWLRPATAAVVCCLALVLTLPDLAWGVWGKVAPVHYPSGWAA
VAAAINADPRTVAVLPAGTMRRFSWSGSAPVLDPLPRWVRADVLTTGDLV
ISGVTVPGEDAHARAVQELLLTGPHPSTLAAAGVGWLVVESDSAGDMGAA
ARTLGRLAAAHRDDELALYRVGGQTSGASSARLKATMLAHWAWLSMLLVG
GAGAAGYWVRRHLHHCEDTPASRAQD
>MT2167 PPE family protein
MPNFWALPPEINSTRIYLGPGSGPILAAAQGWNALASELEKTKVGLQSAL
DTLLESYRGQSSQALIQQTLPYVQWLTTTAEHAHKTAIQLTAAANAYEQA
RAAMVPPAMVRANRVQTTVLKAINWFGQFSTRIADKEADYEQMWFQDALV
MENYWEAVQEAIQSTSHFEDPPEMADDYDEAWMLNTVFDYHNENAKEEVI
HLVPDVNKERGPIELVTKVDKEGTIRLVYDGEPTFSYKEHPKF
>MT1555 conserved hypothetical protein
MKKVAIVQSNYIPWRGYFDLIAFVDEFIIYDDMQYTKRDWRNRNRIKTSQ
GLQWITVPVQVKGRFHQKIRETLIDGTDWAKAHWRALEFNYSAAAHFAEI
ADWLAPIYLEEQHTNLSLLNRRLLNAICSYLGISTRLANSWDYELADGKT
ERLANLCQQAAATEYVSGPSARSYVDERVFDELSIRVTWFDYDGYRDYKQ
LWGGFEPAVSILDLLFNVGAEAPDYLRYCRQ
>MT3544 hypothetical protein
MADRLNVAERLAEGRPAAEHTQSYVRACHLVGYQHPDLTAYPAQIHDWYG
SEDGLDLHALDADCAQLRAAASVLMEALRMERSQVAVLAAAWTGSGADAA
VHFVQRHCETGNSVVTEVRAAAQRCESLRDNLWQLVDSKVATAIAIDERA
LAQRPAWLAAAEALTTEGADRPTAVEVVRQQIQPYVDDDVRNDWLTTMRS
TTAGVAASYDAVTDQLASAPRAHFEIPDDLGPGRQPSPASVPAQPSATAA
ITPAAALPPPDPVPAVTSRPVTPSDFGSAPGDGSATPAGVGSAGGFGDAG
GTGGLGGFAGLAGLANRIVDAVDSLLGSVAEQLGDPLAADNPPGAVDPFA
EDAADNADDGDDAHPEEADEAAEPKEATEPDEADEVDDADESVPAERAQD
VAEEATLPPVAEPPPPAAPPVAEPPPPVAAPAPPGAPEPANGPSPEALSE
GATPCEIAADELPQAGP
>MT2459 hypothetical protein
MAIFGRGHGASEPGGTGEPAETPGRGRLTRSVIGWVGAVAVVVSLAGSGW
CGWVLFEKHQTDVAAGQALQAARSYVVKLATMDCERIDHNMRDILEGSTG
EFKDKYGKSSAHLRQLLADNRVATHGTVVAASVKSATTNKVVVLMFIDQS
VSNRNSPTPQIDRSRIKVIMDKVNGRWLASKVELL
>MT2673 hypothetical protein
MGNLLVVIAVALFIAAIVVLVVAIRRPKTPATPGGRRDPLAFDAMPQFGP
RQLGPGAIVSHGGIDYVVRGSVTFREGPFVWWEHLLEGGDTPTWLSVQED
DGRLELAMWVKRTDLGLQPGGQHVIDGVTFQETERGHAGYTTEGTTGLPA
GGEMDYVDCASAGQGADESMLLSFERWAPDMGWEIATGKSVLAGELTVYP
APPVSA
>MT3957 hypothetical protein
MGLSKTIPRCRFSLPLVGRQGRFSSLRCGDSTVTPCSPTTMGGKEPGMET
IASSDRCVCNRIPCRQWCSACRFRVLHGVSVGAVVWVGPGVDVVVGVDQF
FRAAGGDRVAVGVAQLVGAAGGDRHPGAFGQLAGYGDGGLGVAMPLGGHQ
PVVERRQVAGRCAGQQADVGPTPQHRCRGR
>MT0894 PE_PGRS family protein
MSYVLATPEMVAAAANNLAQIGSTLSAANAAALAPTTGVLAAGADEVSAA
VASLFSGHAQAYQTLGTQAAAFHERFIQALSTAAGAYGSAEAANASPLQQ
ALNVINAPTQTLLGRPLIGNGTNGAPGTGQAGGPGGLLYGNGGNGGSGGV
GQAGGAGGSAGLIGIGGTGGAGGAGAVGGVGGNGGWLYGNGGAGGLGGTG
VAGVNGGMGAAGGAGGNAYLFGSGGAGGQGGMGAAGADGVNPTPTGTADA
GSTGTDQTLGGNAIGGNGGPGDAGDAMTSGGAGGSGGNAVSTVNGDAVGG
EGGKGGEGAYGGAGGAGGSAASIGNAAIGGNGGAGGNAQAPGGVGGAGGE
GGDAQVGTNSPSNAEAGNGGSGGNGFDSFASGGTGGAGGTGGAGGRGGLL
IGDGGAGGAGGVGGTGGSGAPGGGGGAGGDGGAANTDSAGSSRKAFGGDG
GVGGDGASALGTGGEGGIGGQGGNGGAGGLLIGNGGAGGVGGTAGAGGTG
GSGGAGGAGGAGGGGTNSGPGAAFGGNGNTGGNGGNGGAPGALGGKGGSG
GLIGRAGSDGGVGAGGAGGAGGAGGTGGEGGTGGDGKTTDGNPGMGGSPG
SAGQPGQPG
>MT1327 hypothetical protein
MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFR
TALRDSLDIYGVMATPGVDLEKTRFPVGVRSDLLPDKRPDIADVLYGIHR
WLHGHADESSVEFEVSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFA
PENKGEVIPPDYQLSWYDHVFFISVWWGWQDHFREIVNVDRASLVALDFG
DLWNGWTPVG
>MT1205 PPE family protein
MDFTIFPPEFNSLNIQGSARPFLVAANAWKNLSNELSYAASRFESEINGL
ITSWRGPSSTIMAAAVAPFRAWIVTTASLAELVADHISVVAGAYEAAHAA
HVPLPVIETNRLTRLALATTNIFGIHTPAIFALDALYAQYWSQDGEAMNL
YATMAAAAARLTPFSPPAPIANPGALARLYELIGSVSETVGSFAAPATKN
LPSKLWTLLTKGTYPLTAARISSIPVEYVLAFVEGSNMGQMMGNLAMRSL
TPTLKGPLELLPNAVRPAVSATLGNADTIGGLSVPPSWVADKSITPLAKA
VPTSAPGGPSGTSWAQLGLASLAGGAVGAVAARTRSGVILRSPAAG
>MT0291 PE_PGRS family protein
MSTAIAALFGAHGQAYQALSAQAQAFHAQFVQALTSGGGAYAAAEAAAVS
PLLDPINEFFLANTGRPLIGNGANGAPGTGANGGDGGWLIGNGGAGGSGA
AGVNGGAGGNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGG
AGGAGGGVVALTGGAGGAGGAGGNAGLLFGAAGVGGAGGFTNGSALGGAG
GAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADS
SFGGVGGAGGAGGLFGAGGEGGSGGHSLVAGGDGGAGGNAGMLALGAAGG
AGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQGG
PGGNAGTVFGSGGAGGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGGAS
AVTGGNGGIGGTGVLIGNGGNGGSGGIGAGKAGVGGVSGLLLGLDGFNAP
ASTSPLHTLQQNVLNVVNEPFQTLTGRPLIGNGANGTPGTGADGGAGGWL
FGNGANGTPGTGAAGGAGGWLFGNGGNGGHGATNTAATATGGAGGAGGIL
FGTGGNGGTGGIATGAGGIGGAGGAGGVSLLIGSGGTGGNGGNSIGVAGI
GGAGGRGGDAGLLFGAAGTGGHGAAGGVPAGVGGAGGNGGLFANGGAGGA
GGFNAAGGNGGNGGLFGTGGTGGAGTNFGAGGNGGNGGLFGAGGTGGAAG
SGGSGITTGGGGHGGNAGLLSLGASGGAGGSGGASSLAGGAGGTGGNGAL
LFGFGGAGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSGA
NALGAGTGGTGGDGGHAGVFGNGGDGGCRRVWRRYRRQRWCRRQRRADRQ
RRQRRQRRQSRGHARCRRHRRAAARRERTQRLAIAGRPATTRGVEGISCS
PQMMP
>MT0209 conserved hypothetical protein
MFSTYGIASTLLGVLSVAAVVLGAMIWSAHRDDSGERTYLTRVMLTAAEW
TAVLINMNADNIDASLQRLHDGTVGQLNTDFDAVVQPYRQVVEKLRTHSS
GRIEAVAIDTVHRELDTQSGAARPVVTTKLPPFATRTDSVLLVATSVSEN
AGAKPQTVHWNLRLDVSDVDGKLMISRLESIR
>MT2618 lipoprotein, putative
MIAPQPISRTLPRWQRIVALTMIGISTALIGGCTMDHNPDTSRRLTGEQK
IQLIDSMRNKGSYEAARERLTATARIIADRVSAAIPGQTWKFDDDPNIQQ
SDRNGALCDKLTADIARRPIANSVMFGATFSAEDFKIAANIVREEAAKYG
ATTESSLFNESAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDL
PAGQLPPEPPIWPTTSTPH
>MT0505 conserved hypothetical protein
MTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGERKLKINT
ILSVGEHSVRVEAFVCRKPDENREDVYRFLLRRNRRLYGVAYTLDNVGDI
YLVGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWR
LSRGESLQNLQAFAHLRPTTMQSAQRDEKELGG
>MT0835 hypothetical protein
MVLFFEIMLVLATVVISWFALYTLYRLVTDES
>MT1786 hypothetical protein
MDHVRHSGPWPTTAGYRKPHMVINRSIASIDSIAVAGSAATTGAVAVAGS
VATAGSVAVAGSVATAGSVAIAGAAATAGSVGIIGSLLTVLCVAVRQCVA
CLACITCTRCVACIGCVRCTDCVGCLWCVNCSGLRNVVGARNLRVGNLGR
VSN
>MT2037 serine esterase, cutinase family
MTPRSLVRIVGVVVATTLALVSAPAGGRAAHADPCSDIAVVFARGTHQAS
GLGDVGEAFVDSLTSQVGGRSIGVYAVNYPASDDYRASASNGSDDASAHI
QRTVASCPNTRIVLGGYSQGATVIDLSTSAMPPAVADHVAAVALFGEPSS
GFSSMLWGGGSLPTIGPLYSSKTINLCAPDDPICTGGGNIMAHVSYVQSG
MTSQAATFAANRLDHAG
>MT2137 hypothetical protein
MLATLSQIRAWSTEHLIDAAGYWTETADRWEDVFLQMRNQAHAIAWNGAG
GDGLRQRTRADFSTVSGIADQLRRAATIARNGAGTIDAAQRRVMYAVEDA
QDAGFNVGEDLSVTDTKTTQPAAVQAARLAQAQALAGDIRLRVGQLVAAE
NEVSGQLAATTGDVGNVRFAGAPVVAHSAVQLVDFFKQDGPTPPPPGAPH
PSGGADGPYSDPITSMMLPPAGTEAPVSDATKRWVDNMVNELAARPPDDP
IAVEARRLAFQALHRPCNSAEWTAAVAGFAGSSAGVVGTALAIPAGPADW
ALLGAALLGVGGSGAAVVNCATK
>MT3612 PE_PGRS family protein
MSFVLVSPETVAAVATDLKRIGASLAHENASAAASTTAVVSAAADEVSTA
VAALFSQHAQGYQAAAAQVAAFHSRFVQALTAGAGAYAFAEAANASPLQS
AMGAVSASAQTLLSRPLIGNGANATTPGGNGGDGGWLFGSGGNGAPGAAG
QSGGNGGSAGLWGNGGAGGAGGSGGAAGGNGGNGGWLFGAGGTGGIGGTG
APGAMGGTGGNGGNGALLIGGGGLGGAGGMGGTGGGTGGTGGNGGNGALL
IGAGGVGGAGGIGGQGTGAGGAAGAGGTGGNGGAGGLFMNGGDGGAGGQG
GDGAAGDAAASAGGTGGKGGQGGDGGTGGAGGAGPVLFGHGGAGGMGGQG
GTGGMGGAGGDGTTVIAAGTGGEGGTGGAAGAGGAAGARGALTSGGLAGG
VGAGGTGGTGGTGGNGADAAAVVGFGANGDPGFAGGKGGNGGIGGAAVTG
GVAGDGGTGGKGGTGGAGGAGNDAGSTGNPGGKGGDGGIGGAGGAGGAAG
TGNGGHAGNTGDGGDGGTGGNGGNGTGGVNGADNTLNPDTPGGAGEPGGA
GGAGGAAGGPGGTGGTGGNGGNGGNGGNGGNGGNGGNGGNAGNNSTNAPV
GGEGGAGGDGGAGGAGGAANGGTAGSQGTGGVGGDGGAGGNGGGGKAGTG
NSGNFGVDGEAGFSGGAGGNGGVGGAAGANGGTGGSGGNGGDGGAGGIGG
AGGNGIPGTGTEPAGGTGAKGGDGGDGGAGGAGGNAGGAGGNGGAGGQGG
NAGQGGAGGAGGNAVIPGDGVGKAPHGDAGGSGGDGGKGGQGGSGGTGGS
GAPIGGGAGGTGGSGGHAGKGGAGGIGAQGTTITVPGNGGNAGDGGNAGA
GGNGGSGDFGGNTTSGASGSGGNGGNAGTAGSGGAGGTGGTGLSGGNGGN
GGNGGNGGDGGNGAHGTVGAQFVPATSLPTPNGGAGGNGGTGSNGGAPGP
AGAPGPTTGGNAGSQGIGGDGGNGGDGGKGGDGADAVNVVFMPTEPQAAT
GTAGSAGDPTGGNGGPGTPGSPMVAPPPPTPITQVQQGGDGGAGGTGSTN
ANDGTATGGKGGEGGVGSILGGPGGNGGTGGNASATGTNGVANAGNGGKG
GDGGQFGAGGNGGAGGSVTDGSAGSTAGNGGNGGNATNGTIAGQPAGGNG
SAGGKGGDGGNIAAGATGTAGNGGNGGNGNDGAVNAGTGGSGGNGGNAGG
GGANGGDGGAGGAGGAGGRGGKGIDGGFGGDGGNGGSNNGTGAGGNGGNG
GTGGVGSVGAAGGDGGNGGTGGFAGFGGTAGNGGSGGTGGAGGDGGTGGD
GGNGGTGVIAGGGGTGGNGGASGAGGAGGTGGFAGNGNAGGNGGTGGASE
DGDNGNAGSGATGGTGGNGGTGGDGGAAGLGGVA
>MT0425 conserved hypothetical protein
MTVELAHPSTEPLGSRSPAEPAHPRRWFISTTPGRIMTIGIVLAALGVAS
AFATSTTIEHRQQVLTAVLDHTEPLSFAAGRLYTTLSVADAAAATAFIAQ
AEPGGVRLRYEQAITDASVAVTRASSGLTDESLVQLLGRINAELAVYTGL
VEIARANNRAGNPVGSSYLSEASGLMQSTILPDAQRLYQATSARVDRETT
ASTQIPAPVILVVATTVVFGAFAHRWLARRTRRRINPGLVVGALGILVMV
VWVGTALTISTTASRSAKDTAAESLKTITNLAITAQQARADETLSLIRRG
DEEVRKQAFYQRIDAMQRQLNDYMARRHAVDKPDLQGADQLLVRWRQAND
RINSYISVGNYRAATQVALGKGEDDATPAFDKLDEALTKAMGQSRTQLRH
DILNAHRGLAGAQVGGVVLSLGAAIAVALGLWPRLKEYR
>MT2413 hypothetical protein
MLLPLGPPLPPDAVVAKRAESGMLGGLSVPLSWGVAVPPDDYDHWAPAPE
DGADVDVQAAEGADAEAAAMDEWDEWQAWNEWVAENAEPRFEVPRSSSSV
IPHSPAAG
>MT3583 hypothetical protein
MLWQSGLRDPTPPGGPHGIEGLSLAFEKPSPVTALTQELRFATTMTGGVS
LAIWMAGVTREINLLAQASQWRRLGGTFPTNSQLTNESAASLRLYAQLID
LLDMVVDVDILSGTSAGGINAALLASSRVTGSDLGGIRDLWLDLGALTEL
LRDPRDKKTPSLLYGDERIFAALAKRLPKLATGPFPPTTFPEAARTPSTT
LYITTTLLAGETSRFTDSFGTLVQDVDRRGLFTFTETDLARPDTAPALAL
AARSSASFPLAFEPSFLPFTKGTAKKGEVPARPAMAPFTSLTRPHWVSDG
GLLDNRPIGVLFKRIFDRPARRPVRRVLLFVVPSSGPAPDPMHEPPPDNV
DEPLGLIDGLLKGLAAVTTQSIAADLRAIRAHQDCMEARTDAKLRLAELA
ATLRNGTRLLTPSLLTDYRTREATKQAQTLTSALLRRLSTCPPESGPATE
SLPKSWSAELTVGGDADKVCRQQITATILLSWSQPTAQPLPQSPAELARF
GQPAYDLAKGCALTVIRAAFQLARSDADIAALAEVTEAIHRAWRPTASSD
LSVLVRTMCSRPAIRQGSLENAADQLAADYLQQSTVPGDAWERLGAALVN
AYPTLTQLAASASADSGAPTDSLLARDHVAAGQLETYLSYLGTYPGRADD
SRDAPTMAWKLFDLATTQRAMLPADAEIEQGLELVQVSADTRSLLAPDWQ
TAQQKLTGMRLHHFGAFYKRSWRANDWMWGRLDGAGWLVHVLLDPRRVRW
IVGERADTNGPQSGAQWFLGKLKELGAPDFPSPGYPLPAVGGGPAQHLTE
DMLLDELGFLDDPAKPLPASIPWTALWLSQAWQQRVLEEELDGLANTVLD
PQPGKLPDWSPTSSRTWATKVLAAHPGDAKYALLNENPIAGETFASDKGS
PLMAHTVAKAAATAAGAAGSVRQLPSVLKPPLITLRTLTLSGYRVVSLTK
GIARSTIIAGALLLVLGVAAAIQSVTVFGVTGLIAAGTGGLLVVLGTWQV
SGRLLFALLSFSVVGAVLALATPVVREWLFGTQQQPGWVGTHAYWLGAQW
WHPLVVVGLIALVAIMIAAATPGRR
>MT2988.1 hypothetical protein
MAAPSPQQLVRMVLDQVVSRKAGAVPTLDWR
>MT0033 hypothetical protein
MRLLAPARRAGRAPELVGITTCCKTYTPGDSLRRAVDSTAPTSSVQPRAL
PAIAGLSVELGIATQRHDGLPKIVHAMATAAGNGAAAEEVDLLRVHVDTA
LHHVLAQYPRVDPALLLNCMLLAATERSVTGDPIAANYHFAWFRELDSRR
>MT2316 hypothetical protein
MRLRPAEDSDDFLAWSSTDTTIDDAVHVTGPYDYLLHIRVCDTADLDRLL
RRLKTSAEAAQTQTRIALRSRR
>MT3275.1 hypothetical protein
MEARCCRSTKSLTDFGLGLLLIAVLGTDLGTEQAETNPYNRDAAEQAGRR
NRG
>MT2876 hypothetical protein
MTYAARDDTTLPKLLAQMRWVVLVDKRQLAVLLLENEGPVASATDTLDTR
GDSDYENQPVDAVERLCRRLADQAVRQWGFMQGLKQKLGPGVDVRMKLVE
WNR
>MT2520.1 hypothetical protein
MPLISVEPEPPSCELPALSPARGTRSCTTVLGGSSGNGPSNAASLSSPEA
DLRRRRRRRRRLPASSEPFSSSPLSPASVSSESRSSPSSVSAASALVNCW
ARGSDCWSTGSPSDPPCSPRPRPRPRRPRRRRRFAGRSSCPSSSASESSA
T
>MT2079 hypothetical protein
MRPQHSPAGKAFVVKKITHEQSRRNITRRGRRVAARHARAGRWAAQPRPM
LGSGAVRYEVGANIDATGFGGIAAVHRLVTRLGLVTRLGLVERVDAHSRF
SSSNLPKSSRRISGRVSLSGMSNSAAKVVASTSSSPWGQPLSVGLRRRWR
S
>MT3769 hypothetical protein
MQTAHRRFAAAFAAVLLAVVCLPANTAAADDKLPLGGGAGIVVNGDTMCT
LTTIGHDKNGDLIGFTSAHCGGPGAQIAAEGAENAGPVGIMVAGNDGLDY
AVIKFDPAKVTPVAVFNGFAINGIGPDPSFGQIACKQGRTTGNSCGVTWG
PGESPGTLVMQVCGGPGDSGAPVTVDNLLVGMIHGAFSDNLPSCITKYIP
LHTPAVVMSINADLADINAKNRPGAGFVPVPA
>MT2095 hypothetical protein
MTRPRTDAIHHHVVVNAPIERAFAVFTTRFGDFKPREHNLLAIPITETVF
ECHAGGHIYDRGVDGSVCKWARVLVYEPPSRVLFTWDIGPTWRPETDLAK
TSEVEVRFTAQSAETTRVDLEHRHLDRHGPGWESVADGVDSEAGWPLYLR
RYTDLLCIQVQP
>MT2314 hypothetical protein
MSGHRKKAMLALAAASLAATLAPNAVAAAEPSWNGQYLVTLSANAKTGTS
MAANRPEYPHKANYTFSSRCASDVCIATVVDAPPPKNEFIPRPIEYTWNG
TQWVREISWQWDCLLPDGTIEYAPAKSITAYTPGQYGILTGVFHTDIASG
TCKGNVDMPVSAKPIVG
>MT1265 hypothetical protein
MSRQWHWLAATLLLITTAACSRPGTEEPDCPTKITLPPGATPTTTLDPRC
IVRATTTGTADGDAASRWTGTVRIAGFYASICNAVWDGNVSLAGKDELTG
KATLILVETSCPGKVVAGELVLKGNVGSDSLAITWAHPELPQRAFDLGAG
QGTIRRSGDRAEGTFNSDMGGGTEFFLTWSLTMRN
>MT1084 hypothetical protein
MGDRVLTVRSSPTAVTGKGIVESTTKTKRDRHVPVPEPVWRRLHAELPTD
PNALVFPGRKGGFLPLGEYRWAFDNAGDQVGIEGWYRTVWGTPRPRWRSA
QALTSRSCNGSLDTQQRR
>MT4035 conserved hypothetical protein
MSARITALRLEAFEQLPKHARRCVFWEVDPAILGKDDHLADPEFEKEAWL
SMVMLEWGSCGQVATAVPDERSHAEPPCLGYVLYAPPSAVPRAQRFPTAP
VSADAVLLTSMGIERGQADDDLPHSLIARVIEELVRRGVRALEAFGRTPA
ATDLQNPGAVTPDVRPVLEALGDCCVEHCIIDANFLMDVGFVVVAPHPYF
PRLRLELDKGLGWKAEVEAALERLLENARLQEPIAAGSTAGNTS
>MT2767 conserved hypothetical protein
MNANRTSAQRLLAQAGGVSGLVYSSLPVVTFVVASSAAGLLPAIGFALSM
AGLILLWRLLRRESARPVVAGFCGVAVCALIAYLVGQSKGYFLLGIWMSL
LWAVVFTLSILIRRPIVGYLWSWLSGRDRAWRDVSRAVFAFDVATLGWTL
VFAARFIVQRHLYDADKTGWLGVARIGMGWPLTALAALATYAAIKAAQRA
ILASHDAAAVGGAAEFDADAGRE
>MT1342 hypothetical protein
MIPVRRLRYARRATDQSIGWFPLHQPGIEVPQ
>MT1122 cellulase-related protein
MGTNLPTEVGQILSAPTSIDYNYPTTGVWDASYDICLDSTPKTTGVNQQE
IMIWFNHQGSIQPVGSPVGNTTIEGKNFVVWDGSNGMNNAMAYVATEPIE
VWSFDVMSFVDHTATMEPITDSWYLTSIRAGLEPWSDGVGLGVDSFSAKV
N
>MT1096.2 hypothetical protein
MNSLSGIARAIRLRFRGEIWVGGDAPCFDDQFGDLKCQMC
>MT3737 conserved hypothetical protein
MPAPRMPRVALVAVLLITVQLVVRVVLAFGGYFYWDDLILVGRAGTGGLL
SPSYLFDDHDGHVMPGAFLVAGAIIRVAPLVWTGPAISLVVLQLLESLAL
LRALYVISSWRPVLLIPLTFALFTPLAVPGFAWWAAALNSLPMLAALAWV
CADAILLVRTGNHRYAVTGVLVYLGGLLFFEKAAVIPFVSFAVAALQCHV
RGDRSALATVWRAGVRLWTPSLALTVGWVALYLAVVDQRRWSSDLSMTWD
LLCRSVTHGIVPALAGGPWDWARWAPASPWATPPAVVMVLGWLVLIAVLA
LSLVRKRRIGPVWLTAAGYAVACQVPIFLMRSSPFTALELAQTLRYFPDL
VVVLALLAAVALQAPNRAGTRWLDASPARAVATVASAVLFLTSSLYSTAT
FLASWRDNPTEGYLKNAQASLAAAASGAPLLDQEVDPLVLQRVAWPENLA
SHMFALLRVRPEFATTTTQLRMFTSTGRLVDAKVTWVRTIIAGPVPQCGY
FVQPDRPERLILDGPLLPGDWTVELNYLANSDGSMALALSDGPERKVPVH
PGLNRVYARLPGAGDAITVRANTTALSLCIGAAPVGFLAPA
>MT0525 hypothetical protein
MSCVPLVVSPCDVIERTLLSPLDRCNRRHRCIGIGFVVGRFSQHGDDTGD
HRLSFGVGNFASDLGEHRAVGFDDGVKIGVEVQLVVGQDRPVEAELLVAM
KDPGDVDRDIELGEDLQLHAPAGDRQEGQRGYQRGVTGRCGIRLAVVGRV
VVFDRDRELADLLAPHQKVVRRPIMLADQCLGFFGNCHAAAALRANSC
>MT0726 hypothetical protein
MALAKLAQSADRIAARQRMQRSLAQLVATCELPAPTGTAPIERRISESFA
ASRSTRRSPIAQSMTRPSPPRPGSPTLPVDGEDPFAEAVLTASVSCRLYP
TDTGGTPADRCTPATGGEFPTSALIASRGAAYPATQPNGPSIRTILRALA
IHRRPSANSKLGAVTTEFDTSAPRILDQLHASDVIRLDASARCILSPQHR
IRIVFRSPMVPRFSG
>MT2721 hypothetical protein
MPRSVLRRYVRSRLGSDFPRVRRLGTSGSHLYWLETCRVAALACVERLVK
KLGPVRPYGFLLICPVRSLDKAAARDRVRRYRERLRQRGLRPIQIWVPDV
NAPEFVGEAHRPSALVAAREYEDDDQAFVDAVSVDWDDAT
>MT2895 hypothetical protein
MISGVTFLATLRRVGRSSAKRVLSLAVAPHRRQPVQGT
>MT0763 hypothetical protein
MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQ
VGRWAASPIEPPARPDGLVAAHQAAAAVAHEIFAAPGGMSATFKLPLGEV
PGQVFIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQF
RGPGKPFADEKPCPRERPPADQLAAFLGRTVR
>MT1025.2 hypothetical protein
MSTKYYLQKVPVEAVQPGFSLAIPHDGDYRLFQVDCTQMCQRSGQPVMIR
LMSESVDGGQPWVLEYEAGTAVIRLLGVCQAAS
>MT3997 conserved hypothetical protein
MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLGVIVATV
TFYGRRITGWVAAVYAWLRRRRRPPDSSSEPVVGATVKPGDHVAVRWQGE
FLVAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIV
SAGYRVGNTAAPDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQR
RDEGVAGLARYLVASATRIADRLASHGVDAVCGRSFDDYDHATDIGFVRE
KWSMIKGRDAYTAAYAAPGGPDVWWSARADHTITRVRVAPGMAPQSTVLL
TTADKPKTPRGFARLFGGQRPALQGQHLVANRHCQLPIGSAGVLVGETVN
RCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQFEEFARLI
GAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRVSP
PEESRYQMALPK
>MT3050 hypothetical protein
MNRRTLLWLSAIAALALVVAYQTLGSSAGRHADEFAARAGVPTVQPGADV
LAGIAVLPKRIHRYDYRRSAFGHPWDDRNDAPGGHNGCDTRDDILDRDLV
DKTYVSIKRCPNAVATGTLRDPYTNTTVAFQRGASVGQSVQIDHIVPLSY
AWDMGAYRWPNSERMRFANDPANLLAVQGQANQDKGDSPPAQWMPPNKAF
ACQYAMQFIAVLRGYSLPVDQPSSDVLRQAAATCPTG
>MT2173 hypothetical protein
MSLSVRRPPAARAAAIVEAESWFLKRGLPSVLTMRGRCRRLWPRSAPMLA
AWAVVEGCLMAVFFVTDGGEVFISATPTTAQWVILALLAVALPLASLVGW
LVSQISSGRGQAAVATMAVAFAAASDVIESGPIQLLRTAVVVGLVLLQTG
CGVGSVLGWAVRMTLEHLATVGTLAVRALPIVLLTALVFFNTYVWLMAAN
INGERLTLAMVFLLAIAGAFVVSKTVERVRPLLRSTTVMPQGSQSLAGTP
FATMGDPSPGFPLTRAERLNVVFLLAASQLVEILVVASVGAAIYLVLGMI
ILTPPLLREWTHYDSMTTTVLGMTFPAPDSLIRMCLFLGALTFMYISARA
VDDAEYRAMFLDPLIDDLHTALLARNRYRNNVVTAPCAGVDAGHVDD
>MT3106.1 PE family protein
MSRQASRQVSIIRSAGDGNRSCGCVTPKEGVWVVTLRVVPEGLAAASAAV
EALTARLAAAHAGAAPAITAVVAPAADPVSLQSAVGFSALGSEHAAIAGE
GVEELGRSGVAVGESGIGYAAGDAVAAATYLVSGGSL
>MT0534 conserved hypothetical protein
MTPTGDTKPKLLFYEPGASWYWVLTGPLAAVSVLLLEISSGAGVGLITPA
IFLVMVSAFVALQVKAARIHTSVELTHDALRQGTETIRLAEIVKIYPEAD
GRETSGEEPAKWQSARTLGELVGVPRGRVGIGLKLTGGRTAQAWARRHQQ
LRAALTPLVQERLGPVDSDVADVNGDDAGPAR
>MT3277 hypothetical protein
MKLADAIATAPRRTLKGTYWHQGPTRHPVTSCADPARGPGRYHRTGEPGV
WYASNKEQGAWAELFRHFVDDGVDPFEVRRRVGRVAVTLQVLDLTDERTR
SHLGVDETDLLSDDYTTTQAIAAARDANFDAVLAPAAALPGCQTLAVFVH
ALPNIEPERSEVRQPPPRLANLLPLIRPHEHMPDSVRRLLATLTRAGAEA
IRRRRR
>MT1846 hypothetical protein
MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVAWWVGVG
VAAVVTLLSVVSYHGITVISGLATWVRDWSADPGTTLGAGCTPAIDHQRR
FGRDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADG
LRQFDIHLDGIDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTV
ADRRRTWLVLRMNPQRNVAAVACRDSLASTLVAATERLVQDLDGQSCAAR
PVTADELTEVDSAVLADLEPTWSRPGWRHLKHFNGYATSFWVTPSDITSE
TLDELCLPDSPEVGTTVVTVRLTTRVGSPALSAWVRYHSDTRLPKEVAAG
LNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPVGQELEHAT
SSFVGQ
>MT2557 conserved hypothetical protein
MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTPDWDRFR
TRFENASRRVLRLRQKVVVPTLPTAAPRWVVDPDFNLDFHVRRVRVSGPA
TLREVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDG
VGGVEMFAQIYDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVG
GVLDALSGAVSMAGRAVLEPVSTVSGILGYARSGIRVLNRAAEPSPLLRR
RSLTTRTEAIDIRLADLHKAAKAGGGSINDAYLAGLCGALRRYHEALGVP
ISTLPMAVPVNLRAEGDAAGGNQFTGVNLAAPVGTIDPVARMKKIRAQMT
QRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASNVPVYPGDT
YLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFA
QCLQAGFDEILALAGGPAPRVLPASFDTQGAGSVPRSVSGS
>MT2006 hypothetical protein
MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYD
LEFEPAVDADPATISAFVVRISCHLRIQNQAADNDVKEGDTKDETQDVAT
ADFEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTG
RLALPPLTLEILSRPMPVSPGAQWPATRGTP
>MT1096.1 PE_PGRS family protein
MSFVLVSPSQLMAAAADVAGIGSAISAANAAALAPTSVLAAAGADEVSAA
VAALFSAHAGQYQQLGARAALFHEQFVQALTGAASAYASAEATNVEQQVL
GLINAPTQALWGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFSQTTAGL
TGGTGGSAGLIGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGAGGAGPA
GAIGAPGVAGGAGGAGGTAGLFGNGGVGGVGGDGGQGGNGAGAGASGTKG
GDAGAGGAGGAGGWIHGHGGAGGDGGAGGAGGQASPGAPGPPSQPGGAGG
AGGAGGRGGDGGSAGWLSGNGGDAGNGGGGGTAGGAGNGGQFGGDGGTGG
TGGTAGAGGNGGRGAVLFGHGGNAGHGGAGGNGAAAGAGGEHVVATAGKG
GTGGVGGDGGGGGAGGGGGLLYGNGGAGGAGNSGGDGGTGLNAALGGNGG
GGGVGGNAGAGGTGGSAGWLSGNGGAGGSGGSAGAGGAGGKGGDTPNGLA
INPGIGGNGGDTGNAGNGGNGGSAARLFGGGGAGGAGGTGSTAGSGGSGG
TNPPTGLQAAGGNGGSGHAGGHGGNGGGAGLLGGGGTGGNGGGGGQGGLG
AAAGGVDGNGGNGGNGGKGGDAQLVGDGGNGGNGGKGGAGLIAGLDGAGG
AGGTRGLIFGNAGTPGQ
>MT0828 conserved hypothetical protein
MSARDRVDPAKTRQVVLALADWLRDETLPAPDTDVLAAAVRLTARTLAAL
APGASVEVRIPPFAAVQCISGPRHTRGTPPNVVQTDPRTWLLVATGLSGV
AQARGSGALQLSGSRAGEIEAWLPLVDLG
>MT3220.2 hypothetical protein
MPHWSTSISRPAPERSPLGAARDHGGSHCRLPHAHTEVRIGLGAQSTLGA
CGVTRQPDRSGIRGAIVVETASASNTAGTPP
>MT0237 hypothetical protein
MLRFAACGAIGLGAALLIAALLLSTYTTSRIAEIPLDIDATLISDGTGTA
LDSASLATEHIVVNQDVPLVSQQQVTVESPANADVVTLQVGSSLRRTDKQ
KDSGLLLAIVDTVTLNRKTAMAVSDDTHTGGAVQKPRGLNDENPPTAIPL
RHDGLSYRFPFHTEKKTYPYFDPIAQKAFDANYEGEEDVNGLTTYRFTQN
VGYTPEGKLVAPLKYPSLYAGDEDGKVTTSAAMWGLPGDPNEQITMTRYY
AAQRTFWVDPVSGTIVKETERANHYFARDPLKPEVTFADYQVTSTEETVE
SQVNAARDERDRLALWSRVLPITFTAAGLVALVGGGLFASFSLRTEGALM
AASGDRDDHDYRRGGFEEPVPGAEAETEKLPTQRPDFPREPSGSDPPRLG
SAQPPPPPDAGHPDPGPPERR
>MT0677 hypothetical protein
MVGPGSGRLRGSVGLFAGGEGERDMSGRSRLPGSSSRRDAARIVAERVVA
TVAGVAVAVDEVDAAEARLRDGPRAAALPASGTSEGRQLRRWLTQLIVTE
RVVAAEAAARGLTAAGAPAEADLLPDATARLEIGSVAAAVLADPLARALF
AAVTARVAVTDDAVADYHARNPLRFAAPCPGQHGWRAPAAAAPPLDQVRR
AITEHLLGAARRRAFRVWLDARRNALVVLAPGYEHPGDPRQPDNTRRH
>MT2259 hypothetical protein
MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADD
AALPPAAYPGVPPQVSYPKRRHKRLLIGIVVALALVSAMTAAIIYGVRTN
GANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSD
QALAKLSSDAFRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPP
RGQVQGIAQLLFQRGQVLVCSYVLRTAGSY
>MT1923 hypothetical protein
MEKVIAVLMRPEPDDDWCARQRAQVADALLGLGVAGLSINVRDSTVRDSL
MTLTTLYPPVAAVVSLWTQQCYGEQVAAALRLLAQECDELGAYLVTESVP
LTFPSLVESGSRTPGLANIALLRRPDGLDQATWLTRWQRDHTQVAIEAQA
TFGYTQNWVVRALTPEAPGIAGIVEELFPVAATTDLKAFFGAADDNDLRN
RISRMVASTSAFGANQNIDTVPSSRYVFRTPFKD
>MT3573.15 hypothetical protein
MTTTPARFNHLVTVTDLETGDRAVCDRDQVAETIRAWFPDAPLEVREALV
RLQAALNRHEHTGELEAFLRISVEHADAAGGDECGPAILAGRSGPEQAAI
NRQLGLAGDDEPDGDDTPPWSRMIGLGGGSPAEDER
>MT1033 hypothetical protein
MVCRVRLSSGHRRVSISCRVREGFVMRLAIVGTAAAAAIGGTLAVAPLTL
STPERVAGGTCSAGQQCDLLAAVLMPDTATPSGPAAAEHAVPAPFEPVAD
TIAPGLVPRPGVPAAAAVPRVGPPAVPGLPNIPGAAGPALPPPPALPNLA
APSVPGVGIPGIGIPGIGIPGIGIPGIGIPGVPDPITGVNTAAAVVNGVL
GVGGTAAGVVTASAVAVTYLVLAVNALESSGILPTARGTASTVASLLLPG
AQSAAAALPAVGLPALPGVTPASLLAMAAAAGLPGVGFPSLPGVSPTDLM
AMAAAAGLPTSLPGLAGMSPAELTALVAGGLPMLAAAGLPAGLAGVDPAT
LAAALPALAAGGLPPGLPALPGVDPAALAAALPALAAGLPALPAGLPPLP
AVPALPAPPPLPGPPPLPALPSRLCTPGFGPIGVCIP
>MT0041 hypothetical protein
MADPGPFVADLRAESDDLDALVAHLPADRWADPTPAPGWTIAHQIGHLLW
TDRVALTAVTDEAGFAELMTAAAANPAGFVDDAATELAAVSPAELLTDWR
VTRGRLHEELLAVPDGRKLAWFGPPMSAASMATARLMETWAHGLDVADAL
GVIRPATQRLRSIAHLGVRTRDYAFIVNNLTPPAEPFLVELRGPSGDTWS
WGPSDAAQRVTGSAEDFCFLVTQRRALSTLDVNAVGEDAQRWLTIAQAFA
GPPGRGR
>MT0471 hypothetical protein
MGACSETDYEPCRLPASRRPKSATVTTMSRLSSILRAGAAFLVLGIAAAT
FPQSAAADSTEDFPIPRRMIATTCDAEQYLAAVRDTSPVYYQRYMIDFNN
HANLQQATINKAHWFFSLSPAERRDYSEHFYNGDPLTFAWVNHMKIFFNN
KGVVAKGTEVCNGYPAGDMSVWNWA
>MT3268 hypothetical protein
MSSAAVTRPPGQRKRTFNVAWPKVFRPPVIAATVTTPVGVAGSSDLIRPL
YSMSAPSVA
>MT2231 conserved hypothetical protein
MQNGSLTRMPGRAPGSTLARVGSIPAGDDVLDPDEPTYDLPRVAELLGVP
VSKVAQQLREGHLVAVRRAGGVVIPQVFFTNSGQVVKSLPGLLTILHDGG
YRDTEIMRWLFTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQ
AMAY
>MT2203 hypothetical protein
MLIIALVLALIGLLALVFAVVTSNQLVAWVCIGASVLGVALLIVDALRER
QQGGADEADGAGETGVAEEADVDYPEEAPEESQAVDAGVIGSEEPSEEAS
EATEESAVSADRSDDSAK
>MT1121 hypothetical protein
MDSITVGYQAAQTGGYSPPTNLLINGQAVTIDQTPITSSPTTPPPTTPPE
IPTGGTVIST
>MT1150 hypothetical protein
MQSGPHLVGRVGTSFPLIARHQGATRDDAGDTGQPDPLPHVAHPDRLYPP
MVHGVDPSTLALDRALNETRTGDLWLFRGRSRPDRAIQTLTNAPVNHVGM
TVAIDDLPPLIWHAELGDKLLDVWTGTNHRGVQLNDARQVVQQWAGRYRQ
RCWLRQLTPHANRDQEDKLLRVIARMNGTPFPTTARLTGRWLRGRLPTLN
DWLRGIPVLDRKVREQTQRRKQQQRTMGLATAYCAETVAITYEEMGLLVT
DKDAHWFDPGKFWSGDSLPLAPGYRLGHEIAVDVGG
>MT1474 PE family protein
MSFVFAVPEMVAATASDLASLGAALSEATAAAAIPTTQVLAAAADEVSAA
IAELFGAHGQEFQALSAQASAFHDRFVRALSAAAGWYVDAEAANAALVDT
AATGASELGSGGRTALILGSTGTPRPPFDYMQQVYDRYIAPHYLGYAFSG
LYTPAQFQPWTGIPSLTYDQSVAEGAGYLHTAIMQQVAAGNDVVVLGFSQ
GASVATLEMRHLASLPAGVAPSPDQLSFVLLGNPNNPNGGILARFPGLYL
QSLGLTFNGATPDTDYATTIYTTQYDGFADFPKYPLNILADVNALLGIYY
SHSLYYGLTPEQVASGIVLPVSSPDTNTTYILLPNEDLPLLQPLRGIVPE
PLLDLIEPDLRAIIELGYDRTGYADVPTPAALFPVHIDPIAVPPQIGAAI
GGPLTALDGLLDTVINDQLNPVVTSGIYQAGAELSVAAAGYGAPAGVTNA
IFIGQQVLPILVEGPGALVTADTHYLVDAIQDLAAGDLSGFNQNLQLIPA
TNIALLVFAAGIPAVAAVAILTGQDFPV
>MT1408 hypothetical protein
MQTGKTPKTHDDYDDYEAADQEAARSASWRRRLRVRLPRLSTIAMAAAVV
IICGFTGLSGYIVWQHHEATERQQRAAAFAAGAKQGVINMTSLDFNKAKE
DVARVIDSSTGEFRDDFQQRAADFTKVVEQSKVVTEGTVNATAVESMNEH
SAVVLVAATSRVTNSAGAKDEPRAWRLKVTVTEEGGQYKMSKVEFVP
>MT0184 hypothetical protein
MKAADSAESDAGELGEDACPEQALVERRPSRLRRGWLVGIAATLLALAGG
LGAAGYFALRSHQESQSIAREDLAAIEAAKDCVAATQAPDAGAMSASMQK
IIECGTGDFGAQASLYTSMLVEAYQAASVHVQVTDMRAAVERNNNDGSVD
VLVALRVKVSNTDSDAHEVGYRLRVRMALDEGRYKIAKLDQVTK
>MT3436 hypothetical protein
MRYKPRGAGSPRQSDRVAGATVNVDRVLTVRAWAL
>MT2176 conserved hypothetical protein
MRRNIRVTLGAATIVAALGLSGCSHPEFKRSSPPAPSLPPVTSSPLEAAP
ITPLPAPEALIDVLSRLADPAVPGTNKVQLIEGATPENAAALDRFTTALR
DGSYLPMTFAANDIAWSDNKPSDVMATVVVTTAHPDNREFTFPMEFVSFK
GGWQLSRQTAEMLLAMGNSPDSTPSATSPAPAPSPTPPG
>MT1074 hypothetical protein
MCAKPYLIDTIAHMAIWDRLVEVAAEQHGYVTTRDARDIGVDPVQLRLLA
GRGRLERVGRGVYRVPVLPRGEHDDLAAAVSWTLGRGVISHESALALHAL
ADVNPSRIHLTVPRNNHPRAAGGELYRVHRRDLQAAHVTSVDGIPVTTVA
RTIKDCVKTGTDPYQLRAAIERAEAEGTLRRGSAAELRAALDETTAGLRA
RPKRASA
>MT3919 hypothetical protein
MTTNDTLFLSTGLRRPGDQPGKWSLMAQGVYAPAIPRRKSQNVKSAHT
>MT2367 hypothetical protein
MLEVDKVTHVVDENLLRLGVALSPSEKTRPGLAARPSTTCYRKASSTPTG
SPSSGVGWVVISNDRHLRTRPVEAELAVAHKLKVVHLHGRVGGLVRVGTA
DAAGCAVAGH
>MT1943 conserved hypothetical protein
MSFNPKDAVDAVRDIAANAVEKASDIVENAGHIIRGDIAGGASGIVKDSI
DIATHAVDRTKEVFTGKTDDEG
>MT3783 WhiB-related protein
MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDELFVRGA
AQRKAAVICRHCPVMQECAADALDNKVEFGVWGGMTERQRRALLKQHPEV
VSWSDYLEKRKRRTGTAG
>MT1097 PE_PGRS family protein
MREMREMSYMIAVPDMLSSAAGDLASIGSSINASTRAAAAATTRLLPAAA
DEVSAHIAALFSGHGEGYQAIARQMAAFHDQFTLALTSSAGAYASAEATN
VEQQVLGLINAPTQALLGRPLIGNGADGTAANPNGGAGGLLYGNGGNGFS
QTTAGLTGGTGGSAGLIGNGGNGGAGGAGANGGAGGNGGWLYGSGGNGGA
GGAGPAGAIGAPGVAGGAGGAGGSAGLFGNGGAGGAGGAGGQGGAGIGGA
DGTKGGDAGAGGAGGAGGWIHGHGGVGGDGGTGGQGGDGVQGEPGDTGAA
GGAGGAGGRGGDGGSAGWLSGNGGDAGTGGGGGNAGNGGNGGSAGWLSGN
GGTGGGGGTAGAGGQGGNGNSGIDPGNGGQGADTGNAGNGGHGGSAAKLF
GDGGAGGAGGMGSTGGTGGGGGFGGGTGGNGGNGHAGGAGGSGGTAGLLG
SGGSGGTGGDGGNGGLGAGSGAKGNGGNGGDGGKGGDAQLIGNGGNGGNG
GKGGTGLMPGINGTGGAGGSRGQISGNPGTPGQ
>MT3717 conserved hypothetical protein
MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQ
FNDTLNVYLTAHNALGSSLHTAGVDLAKSLRIAAKIYSEADEAWRKAIDG
LFT
>MT2145 hypothetical protein
MTSIESHPEQYWAAAGRPGPVPLALGPVHPGGPTLIDLLMALFGLSTNAD
LGGANADIEGDDTDRRAHAADAARKFSANEANAAEQMQGVGAQGMAQMAS
GIGGALSGALGGVMGPLTQLPQQAMQAGQGAMQPLMSAMQQAQGADGLAA
VDGARLLDSIGGEPGLGSGAGGGDVGGGGAGGTTPTGYLGPPPVPTSSPP
TTPAGAPTKSATMPPPGGASPASAHMGAAGMPMVPPGAMGARGEGSGQEK
PVEKRVTAPAVPNGQPVKGRLTVPPSAPTTKPTDGKPVVRRRILLPEHKD
FGRIAPDEKTDAGE
>MT1193 hypothetical protein
MRRLTNTEHRENTTVASTWSVCKGLAAVVITFGGPFALCPNAAADPATPQ
PNPTQQLPGLPALAQLSPIIQQAAMNPAQATQLLMAAASAFAGNPAVPTE
SKNVASSVNQFVAEPTNPDSAALGVPAPHGVALPEAIPVPHVPPLGAEPG
VQAHLPTGIDPSHAAGPAPAVAPTVTPPVAAPPASAPAPAPDAAQPVAVP
GPPPAPRAPRAAAPAPASAAPAPAAAPAPASGFGADAPPTQDFMYPSIGP
NCVADGSNSIATALSVAGPAKIPLPGPGPGQTAYVFTAVGTPGPADVQRL
PLNVTWVNLTTGKSGSATLRPRSDINPDGPTTLTVIADTGSGSIMSTIFG
QVTTKDRQCQFMPTIGSTVVP
>MT1919 hypothetical protein
MPPRIAGMRLLVIKPEPLARRLLKLAGTTYAAEAGIRIRDKPMPLFQLLV
LCMLASKPIGAATAARAARELFCSGLRTPKAVLSAERQTMISAFGRAHYV
RYDESSATRLTAIAHRVRDEYSGDLRELAQRTRPDVSAAKRMLKTFNGIG
DTGADIFLREVQDVWIWVRPYFDDRATAAAKQLGLPTDPKKLASVAPSSN
ALLAAALVRVA
>MT2255 conserved hypothetical transmembrane protein
MHIEARLFEFVAAFFVVTAVLYGVLTSMFATGGVEWAGTTALALTGGMAL
IVATFFRFVARRLDSRPEDYEGAEISDGAGELGFFSPHSWWPIMVALSGS
VAAVGIALWLPWLIAAGVAFILASAAGLVFEYYVGPEKH
>MT1805 serine esterase, cutinase family
MPGRFREDFIDALRSKIGEKSMGVYGVDYPATTDFPTAMAGIYDAGTHVE
QTAANCPQSKLVLGGFSQGAAVMGFVTAAAIPDGAPLDAPRPMPPEVADH
VAAVTLFGMPSVAFMHSIGAPPIVIGPLYAEKTIQLCAPGDPVCSSGGNW
AAHNGYADDGMVEQAAVFAAGRLG
>MT3994 hypothetical protein
MTTRLIAQAVHHARGFNGLLRIPARGGSSGRAGVKCRRPTGR
>MT0941 PE family protein
MSFVTIQPVVLAAATGDLPTIGTAVSARNTAVCAPTTGVLPPAANDVSVL
TAARFTAHTKHYRVVSKPAALVHGMFVALPAATADAYATTEAVNVVATG
>MT3653.1 hypothetical protein
MTATMPGSGVVEVIGIALLRVRYLSKHLLGTLAQ
>MT1003.1 conserved hypothetical protein
MRIGNCSGFYGDRLSAMREMLTGGELDYLTGDYLAELTMLILGRDRMKNP
DRGYAKTFLAQLEDCLGLAHDRGVRIVTNAGGLNPAGLANAVRALAARLG
IPAQVAHVEGDDLQPRAAELGLGTPLTANAYLGAWGIVDCFERGADVVVT
GRVTDASVVVGAAAAHFGWGRTDYHRLAGAVVAGHVIECGVQATGGNYAF
FTEIGDLTHAGFPLAEIAADGSSVITKHHGTGGLVSVDTITAQLLYEITG
ARYANPDVTARMDSVELSPDGPDRVRISGVIGEPPPPTYKVSLNSIGGFR
NAMTFVLTGLDIDAKADLVRRQLEAALTVKPAELQWTLARTDHPDADTEE
TASALLTCVARDPDPANVGRQFSSAAVELALASYPGFTATAPPGDGQVYG
VFTPGYVDAGKVAHIAVHADGTRTEIPCATETLELAPAHPPALPDPLPAG
PTRRVPLGLIAGARSGDKGGSANVGVWVRTDEQWRWLAHTLTVELLKELL
PETAGLVVTRHVLPNLRALNFVIEAILGQGVAYQARFDPQAKGLGEWLRS
RHVEIPETLL
>MT3889 conserved hypothetical protein
MVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDLVEQPAKVMRI
GTMIKQLLEEVRAAPLDEASRNRLRDIHATSIRELEDGLAPELREELDRL
TLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLQQ
MRQGALPPGVGKSGQHGHGTGQYL
>MT2141 hypothetical protein
MFSMPHSTADRRLRLTRQALLAAAVVPLLAGCALVMHKPHSAGSSNHWDD
SAHPLTDDQAMAQVVEPAKQIVAAAXLQAVRAGFSFTSCNDQGDPPYQGT
VRMAFLLQGDHDAYFQHVRAAMLSHGWIDGPPPGQYFHGITLHKNGVTAN
MSLALDHSYGEMILDGECRNTTDHHHDDETTNITNQLVQP
>MT3876.1 hypothetical protein
MNRTMMGCADIGWRYGLEVADPVRARGRHPSDPVEDRALITWR
>MT3962 hypothetical protein
MQHKEIRRSLFGRRIAKYGWPNLSDILVHTKYDR
>MT0717.1 hypothetical protein
MAAWRDTSWLTTISRRHVPMPVTELPRTTELPTGIFAGSGCPTLVAARPS
QAWSPRARRSPQPAAPRDKQVNPSTEPSSMLPTTEARHSRS
>MT2406 lipoprotein, putative
MPVGGRQHVFEKLASILGLVAAPLMLLGLSACGRSAGKTSEPTCPTEPID
AADSSTTPDPSCVVRATEINGNGSRIQTWTGSYDAAATQSGGVCGGTCNF
HATVRFTVDEGQISGSVDQVYQAAMVAIATRPTSPSLAP
>MT2733 hypothetical protein
MTAVGGSPPTRRCPATEDRAPATVATPSSTDPTASRAVSWWSVHEYVAPT
LAAAVEWPMAGTPAWCDLDDTDPVKWAAICDAARHWALRVETCQAASAEA
SRDVSAAADWPAVSREIQRRRDAYIRRVVV
>MT1329 hypothetical protein
MASFRPLTPSDRCNTWPEVLALHGLSEGVSGSGGSGGRWGAGEVLEGARI
GVIADGVSCFPTKADCRRIRGVPVFDGYTRMVARLMGSLAVLRSVSIPKG
YRDFGFGSLRAVAPKNCPDVSG
>MT3715 hypothetical protein
MVAVLTYARQLGFCRSTPPTIPHSRNQLVNKTAGQAAVAESWADRVSPGA
VTHATGAMCPTLGAHQFEPNQVRCTACLTRTLSCRIFRRRRELPVVGLAS
GDPLHPALG
>MT3580.2 hypothetical protein
MLGSIWPDQSSPTLGVRARRAIRSLDYQSILSSRADARAK
>MT3801 hypothetical protein
MRTISPFLRCRHETCCISNVGEEVTRTTYSREHQREYRRKVRLCLDVFET
MLAQTRFEADRPLTGMEIECNLVDADYQPAMSNRYVLDAIADPAYQTELG
AYNIEFNVPPRPLPGRTCLELEDEVRASLNDAETKASCSGAHIVMIGILP
TLMPEHLTDGWMSASARYAALNESIFKARGEDIPINIAGPEPLSCHAGSI
APESACTSVQLHLQLAPADFPANWNAAQVLAGPQLALGANSPYFFGHQLW
SETRIELFTQSTDARPEELKSRGVRPRVWFGERWITSVLDLFQENIRYFP
TLLPEVSDEDPLAELSAGRIPHLSELRLHNGTVYRWNRPVYDVVDGRPHL
RLENRVLPAGPTVVDMLANHAFYYGALRGLSEADPPLWTQMNFAAAQANF
LAAARYGMDAQLDWPGLGEVTTRELVLGTLLPMAHEGLRRWGVDAEVRDR
FLGVIGGRAQTGRNGARWQVATVAALQDGGLTRPAALAEMLRRYCEHMHS
NEPVHTWDT
>MT0994 hypothetical protein
MGYPRSRTWQHLQFRQHLLKQILDVVGSWTMSNSAQRDARNSRDESARAS
DTDRIQIAQLLAYAAEQGRLQLTDYEDRLARAYAATTYQELDRLRADLPG
AAIGPRRGGECNPAPSTLLLALLGGFERRGRWNVPKKLTTFTLWGSGVLD
LRYADFTSTEVDIRAYSIMGAQTILLPPEVNVEIHGHRVMGGFDRKVVGE
GTRGAPTVRIRGFSLWGDVGIKRKPRKPRK
>MT2544 hypothetical protein
MSAMEIHLFFVGIPLLLVVVLSVLIWSRKGPHPATYKLSEPWTHPPILWA
ATDEVVGSAHGGHGHDASEFTVGGGASGTW
>MT2713 hypothetical protein
MVAADHRALGSNKSYPASQTAEAIWPPARTLRYDRQSPWLATGFDRRMSQ
TVTGVGVQNCAVSKRRCSAVDHSSRTPYRR
>MT0911 hypothetical protein
MLRTVHAVNLLGAQGESMDYAKRIGQVGALAVVLGVGAAVTTHAIGSAAP
TDPSSSSTDSPVDACSPLGGSASSLAAIPGASVPQVGVRQVDPGSIPDDL
LNALIDFLAAVRNGLVPIIENRTPVANPQQVSVPEGGTVGPVRFDACDPD
GNRMTFAVRERGAPGGPQHGIVTVDQRTASFIYTADPGFVGTDTFSVNVS
DDTSLHVHGLAGYLGPFHGHDDVATVTVFVGNTPTDTISGDFSMLTYNIA
GLPFPLSSAILPRFFYTKEIGKRLNAYYVANVQEDFAYHQFLIKKSKMPS
QTPPEPPTLLWPIGVPFSDGLNTLSEFKVQRLDRQTWYECTSDNCLTLKG
FTYSQMRLPGGDTVDVYNLHTNTGGGPTTNANLAQVANYIQQNSAGRAVI
VTGDFNARYSDDQSALLQFAQVNGLTDAWVQVEHGPTTPPFAPTCMVGNE
CELLDKIFYRSGQGVTLQAVSYGNEAPKFFNSKGEPLSDHSPAVVGFHYV
ADNVAVR
>MT3212 conserved hypothetical protein
MDIQVLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTVPA
TDHSGRQAIISCGAVLDHLRIAMTAAHWQANITRFPQPNQPDQLATVEFS
PIDHVTAGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAML
DVVSDDQRTRLVVASQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTL
ASDAERLRVDLGRDFPVRSYQNRRAELADDRSKVLVLSTPSDTRADALRC
GEVLSTILLECTMAGMATCTLTHLIESSDSRDIVRGLTRQRGEPQALIRV
GIAPPLAAVPAPTPRRPLDSVLQIRQTPEKGRNASDRNARETGWFSPP
>MT2147 IS1556 protein
MSDMCDVVSFVGAAERVLRARFRPSPESGPPVHARRCGWSLGISAETLRR
WAGQAEVDSGVVAGVSASRSGSVKTSELEQTIEILKVATSFFARKCDPRH
R
>MT0131 DNA-binding protein, CopG family
MTKKPRNPADYVIGDDVEVSDVDLKQEEVYVDGERLTDERVEQMASESLR
LAREREANLIPGGKSLSGGSAHSPAVQVVVSKATHAKLKELARSRKMSVS
KLLRPVLDEFVQRETGRILPRR
>MT0085 hypothetical protein
MNAVESTLRRVAKDLTGLRQRWALVGGFAVSARSEPRFTRDVDIVVAVAN
DDAAESLVRQLLTQQYHLLASVEQDAARRLAAVRLGATADTAANVVVDLL
FASCGIEPEIAEAAEEIEILPDLVAPVATTAHLIAMKLLARDDDRRPQDR
SDLRALVDAASPQDIQDARKAIELITLRGFHRDRDLAAEWTRLAAKW
>MT1189 hypothetical protein
MLATIKHDGRPQLSNVQYHFDPRKLLIQVSIAEPRAKTRNLRRDPRASIL
VDADDGWSYAVAEGTAQLTPPAAAPDDDTVEALIALYRNIAGEHSDWDDY
RQAMVTDRRVLLTLPISHVYGLPPGMR
>MT3718.2 hypothetical protein
MVDLLLLSANSPEFELDLAGNRGAVTVSCSHASCTGSAIRTSREARLVRP
ADQ
>MT3356 hypothetical protein
MRVSGASAALVHDSLSVVNVPRRCCRPGCPHYAVATLTFVYSDSTAVIGP
LATAREPHSWDLCVGHAGRITAPRGWELVRHAGPLPSHPDEDDLVALADA
VREGGPSAGRRHHPGGNGAPLHGFDDFPAAATGAPTGGGVLAPPEPGAGR
RRGHLRVLPDPAD
>MT3058.1 hypothetical protein
MPIASAALAPGAAGRHHRAKCRSQARRXRRARRVGTIGLSADRKRGASAG
RGGSAPSG
>MT0374 conserved hypothetical protein
MYTAENAPGVAVLLSGDADVPGPLTGLPTHQDNLDTVIGRYSRLIVVGAD
ADLGAVLTRLLRTDRLDVEVGYVPRRRSPATRAYRLPAGRRAARRARCGV
ARRVPLIRDETGSVIVGRAQWLPAEEQALIHGEAVVDDTVLFDGDVAGVC
IEPTLTLPGLRAAVDGAGKWRRWIGGRAAQLGTTGAAVLRDGVAAPRPVR
RSTFYRNVEGWLLVR
>MT0445 hypothetical protein
MSRRRIHTRSMSRRAAIGSRVQASRPTRGQDYRPTVPGVVQYSPKKRLRR
PSHSYTLRIRRKGPGESDMDSAMARAIRSGDDAEVADGLTRREHDILAFE
RQWWKFAGVKEEAIKELFSMSATRYYQVLNALVDRPEALAADPMLVKRLR
RLRASRQKARAARRLGFEVT
>MT1822 hypothetical protein
MLPHTTAKLALRRRMRKKLTPLPPSRRLGIDEAPAWRASLDRQMH
>MT1497.1 PE_PGRS family protein
MMSLVIVAPETVAAAALDVARIGSSIGAANAAAAGSTTSVLAAGADEVSA
AIATLFGSHAREYQAISTQVAAFHDRFAQTLSAAVGSYVSAEATNAAPLA
TLEHNVLNALNAPTQALLGRPLIGDGAAGAPGTGQAGGAGGILWGNGGAG
GSGAPGQVGGAGGAAGLFGTGGAGGAGGAGAXGGAGGSGGWLLGNGGVGG
AGGQSLLGGATGGAGGNAGLFGVGGTXGPGGPGGPGGVGGTGGAGGLGGT
LYGAGGHGGAGGPGPIGGVGGHGGVGGAAGLLGVGGHGGAGGHGAEGVAG
AAGEDLSPHGTSGGVGGDAGDGGTGGRGGWLAGAGGAGGAGGVGGTGGAG
GAGFSRALIVAGDNGGDGGNGGMGGAGGAGGPGGAGGLISLLGGQGAGGA
GGTGGAGGVGGDGGAGGPGNQAFNAGAGGAGGHGGDPGAGGAGGTGGAGS
TIGAHGAAGASPTSGGNGGAGGNGAHFSSGGKAGGNGGAGGAGGLVGNGG
AGGAGGNGAPGAPPSGGDPNGGGGGAGGAGGKGGDGGAQAGDGGAGGAGG
KGGNGGNGATGATGLNGLGAGADGTDGGKGGNGGAGGGGGAGGQGGKALA
ATHQDGSMGAGGAGGNGGAGGMGGDGGNGAKGTFDNGGDGVGGNGGNGGS
RGIGGAGGIGGAGSTAGADGARGATPTSGGNGGTGGNGANATVAGGAGGA
SGKGGNGGLVGNGGAGGKGGDGMAGVAGSSPTTAGESGTSGQNGGAGGAG
GAGGRGGDFGGDGGTGGAGGNGADGGAGGNGANGANATTPGAKGGDGGHG
GPGAQGGNGGQGGPGGLAGNLFGQNGIQGVGGSGGKGGAGGLAGDGGNGA
NGNFAFGDGNGGHGGNGGNPGAGGQGGSGGAGSTPGAKGAHGFTPTSGGD
GGDGGNGGNSQVVGGNGGDGGNGGNGGSAGTGGNGGRGGDGAFGGMSANA
TNPGENGPNGNPGGNGGAGGAGGAGLNGGNGGAGGNGGLGGFGGNGAAGA
NGVAVGAPGQPGGAGGHGGAGGNGGAGGNGGQGVVSDGAGGAGGAGGDGG
APGDGANGGNGQGAGAFAGGGGGRGGDGGNAGNAGAGGPGGTGSTAGKAG
PAGSIXHDGGNGGHGGHGAASGGNGGPGGHGGNGGNGGTGANGGNGGIGG
TGGAGSTGAKGVLGTNEGDGGDGGRGGNGGRGGNGGQGLTGAGGNGGTGG
TPGNGGNGGNGASGDLVTSPGDGGGGGRGGDAGRGGDAGLGGSSGPGGTP
GDWGTGGTGGTGGTGGQGANGGLTGGRGGTGGNGGNGNTGGTGGAGGTGG
TGHNGSQPGMGGNGGAGGFGGNGFAGVGGRGGMGGSGGTGGTGDAGPFGT
GTGGTGGHGGQGGGGGFSILLGLGGLGGLGSPGSIATGTAGGAGGGGGFG
GLGGGEFV
>MT0495 hypothetical protein
MCSGWRCPPVRPACIWSTCGPSFSRFRASRANGMKALVAVSAVAVVALLG
VSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYPSQV
GRTASRRLGMAAADAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGK
PSWNLEPWRPVVDDSEMLASGCNPGSPEESF
>MT2434 conserved hypothetical protein
MLARAAMARAEAGAGAAVRDVDGRTYAAAPVALSALELTGLQAAVAAAVS
SGATGLQAAVLVAGSVDDPGIAAVRELAPTAAIIVTDRAGNPL
>MT3718 conserved hypothetical protein
MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALEELAAAF
PGDGWLGSAADKYAGKNRNHVNFFQELADLDRQLISLIHDQANAVQTTRD
ILEGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALA
YLVVKTLINATQLLKLLAKLAELVAAAIADIISDVADIIKGILGEVWEFI
TNALNGLKELWDKLTGWVTGLFSRGWSNLESFFAGVPGLTGATSGLSQVT
GLFGAAGLSASSGLAHADSLASSASLPALAGIGGGSGFGGLPSLAQVHAA
STRQALRPRADGPVGAAAEQVGGQSQLVSAQGSQGMGGPVGMGGMHPSSG
ASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV
>MT2001 hypothetical protein
MSQPASVLPQHELRSPGPAVVGSTCDELPPCSSLLNFPRMKAGELRVNIQ
QVAATASQWSGRSTELSVLAPPPLGQPFQPTTAAVGGAHAAVGLAVAAFT
ARTHATASAVEAAAAEYANNEAAAAAEMAAVPQTRLV
>MT2024 hypothetical protein
MSVAVDSDAEDDAVSEIAEAAGVSPAPAKPSMSAPRRMLLFGLVVVVALA
VLLCCWGFRVQRARHAQDQRGHFLQAARQCALNLTTIDWRNAEADVRRIL
DGATGEFYNDFAQRSQPFVEVLRHAKASTVGTITEAGLQTQTADTAQALV
AVSVQTSNAGEADPVPRAWRMRITVQRVGDRVKVSDVGFVP
>MT1407 hypothetical protein
MTDDVRDVNTETTDATEVAEIDSAAGEAGDSATEAFDTDSATESTAQKGQ
RHRDLWRMQVTLKPVPVILILLMLISGGATGWLYLEQYRPDQQTDSGAAR
AAVAAASDGTIALLSYSPDTLDQDFATARSHLAGDFLSYYDQFTQQIVAP
AAKQKSLKTTAKVVRAAVSELHPDSAVVLVFVDQSTTSKDSPNPSMAASS
VMVTLAKVDGNWLITKFTPV
>MT2354 hypothetical protein
MAMEMAMMGLLGTVVGASAMGIGGIAKSIAEAYVPGVAAAKDRRQQMNVD
LQARRYEAVRVWRSGLCSASNAYRQWEAGSRDTHAPNVVGDEWFEGLRPH
LPTTGEAAKFRTAYEVRCDNPTLMVLSLEIGRIEKEWMVEASGRTPKHRG
>MT0469 PPE family protein
MTSALIWMASPPEVHSALLSSGPGPGPVLAAATGWSSLGREYAAVAEELG
ALLAAVQAGVWQGPSAESFAAACLPYLSWLTQASADCAAAAARLEAVTAA
YAAALVAMPTLAELAANHATHGAMVATNFFGINTIPIAVNEADYVRMWLQ
AATTMATYQAVADSAVRSIPDSVPPPRILKSNAQSQHSSSNNSGGADPVD
DFIAEILKIITGGRVIWDPEAGTVNGLPYDAYTNPGTLMWWIARSLELLQ
DFQEFAKLLFTNPVKAFQFLVDLILFDWPTHMLQLATWLAENPQLLVAAL
TPAISGLGAVSGLAGLTGLVPQPPVVPAPAPDAVVPTVLPLAGTATPTTA
PASAPAAGAAPGPPAGTATATSASVPTSAGGFPPYLVGSGPGIDFDAGTP
AGSRRAQPAADNVTAVAAAQVSARHQARRCRRAAAKERGNADEFVDMDSG
PAIPPSGERDAWASNSGVGGLGFAGTASNETVAAPAGLTTLADDEFQCGP
RMPMLPGAWDLGTWDRGD
>MT0270.2 hypothetical protein
MEAALTCRNRVVALSWAQRPSPTAELGVGESLRSAGAARFGFDHVVVDQL
TRRTCNHGGSARRRVPATSTRSAQPVHTRHGV
>MT1650.1 hypothetical protein
MCPALSDSCLTGGFVPELVAAVRDWRRALTCRNRVVALSWAQRPSPTAKA
SVADVTKERYGQIFMWRSGIGGSAPASRARPGCEKIWNLPGLLAYWA
>MT2024.1 hypothetical protein
MSWSRVIAYGLLPGLALALTCGAGLLKWQDGAVRDAAVARAESVRAATDG
TTALLSYRPDTVQHDLESARSRLTGTFLDAYTQLTHDVVIPGAQQKQISA
VATVAAAASVSTSADRAVVLLFVNQTITVGKDAPTTAASSVRVTLDNING
RWLISQFEPI
>MT2732 conserved hypothetical protein
MADIPYGRDYPDPIWCDEDGQPMPPVGAELLDDIRAFLRRFVVYPSDHEL
IAHTLWIAHCWFMEAWDSTPRIAFLSPEPGSGKSRALEVTEPLVPRPVHA
INCTPAYLFRRVADPVGRPTVLYDECDTLFGPKAKEHEEIRGVINAGHRK
GAVAGRCVIRGKIVETEELPAYCAVALAGLDDLPDTIMSRSIVVRMRRRA
PTEPVEPWRPRVNGPEAEKLHDRLANWAAAINPLESGWPAMPDGVTDRRA
DVWESLVAVADTAGGHWPKTARATAETDATANRGAKPSIGVLLLRDIRRV
FSDRDRMRTSDILTGLNRMEEGPWGSIRRGDPLDARGLATRLGRYGIGPK
FQHSGGEPPYKGYSRTQFEDAWSRYLSADDETPEERDLSVSAVSAVSPPV
GDPGDATGATDATDLPEAGDLPYEPPAPNGHPNGDAPLCSGPGCPNKLLS
TEAKAAGKCRPCRGRAAASARDGAR
>MT2565 hypothetical protein
MPAGSMLAGMREIDPGADVAPLDCSKVSKDDVGNPVAAGSVALLLADRVG
STHLGGRRGKSEQGLSRR
>MT1514.1 PE_PGRS family protein
MRRCQMSFVVANTEFVSGAAGNLARLGSMISAANSAAAAQTTAVAAAGAD
EVSAAVAALFGAHGQTYQVLSAQAAAFHSQFVQALSGGAQAYAAAEATNF
GPLQPLFDVINAPTLALLNRPLIGNGADGTAANPNGQAGGLLIGNGGNGF
SPAAGPGGNGGAAGLLGHGGNGGVGALGANGGAGGTGGWLFGNGGAGGNS
GGGGGAGGIGGSAVLFGAGGAGGISPNGMGAGGSGGNGGLFFGNGGAGAS
SFLGGGGAGGRAFLFGDGGAGGAALSAGSAGRGGDAGFFYGNGGAGGSGA
GGASSAHGGAGGQAGLFGNGGEGGDGGALGGNGGNGGNAQLIGNGGDGGD
GGGAGAPGLGGRGGLLLGLPGANGT
>MT0641 hypothetical protein
MPDRPQHPTASRQSSMVSWNHGAAGWLHCVQCGSATNPTACLDWLPPIHA
RSGPMYAEHDVVVLTRDVPDKSLIAGDVGAVVGRYAAGGYEVDFTAANGC
TVAVVTLAGDDIRPRRRREIPHVREVA
>MT1182 hypothetical protein
MRTRRRRTTLTPNNEDVAGRITESLKQMRTDPSAQPAPEQPIAAATSFYP
STNMTPWSRVSTDLISSLLVGAVEAQHAADGRRVLAVLGLVERAARGRTS
>MT3261 hypothetical protein
MSVALLREMFDRMVVAKNAELIEHYYDPDFLMYSDGLSQSFAKFRDSHRK
LYATAISYAVEYDEHAWVEAQTRLPGGCGSPRRDLARSRPASRWYSLPPT
ATAEFTGSGRRRGRVGATWPPSTITETTTDRLAMRNQLRAGAATLLFCDP
MLQRFPATRK
>MT4028 conserved hypothetical protein
MPMTQHRVSTTVPGRGRDRTATRLGRFGARHLSDRRRGRPRDGAHGTVGG
TARARGEPSPTPFVQVRIDQVTPDVVTTSSEPHVTVSGTVTNTGDRPVRD
VMVRLEHAAAVTSSTALRTSLDGGTDQYQPAADFLTVAPELDRGQEAGFT
LSAPLRSLTRPSLAVNQPGIYPVLVNVNGTPDYGAPARLDNARFLLPVVG
VPPDQATDFGSAVAPETTAPVWITMLWPLADRPRLAPGAPGGTVPVRLVD
DDLANSLANGGRLDILLSAAEFATNREVDPDGAVGRALCLAIDPDLLITV
NAMTGGYVVSDSPDGAAQLPGTPTHPGTGQAAASSWLDRLRTLVHRTCVT
PLPFAQADLDALQRVNDPRLSAIATISPADIVDRILDVSSTRGATVLPDG
PLTGRAINLLSTHGNTVAVAAADFSPEEQQGSSQIGSALLPATAPRRLSP
RVVAAPFDPAVGAALAAAGTNPTVPTYLDPSLFVRIAHESITARRQDALG
AMLWRSLEPNAAPRTQILVPPASWSLASDDAQVILTALATAIRSGLAVPR
PLPAVIADAAARTEPPEPPGAYSAARGRFNDDITTQIGGQVARLWKLTSA
LTIDDRTGLTGVQYTAPLREDMLRALSQSLPPDTRNGLAQQRLAVVGKTI
DDLFGAVTIVNPGGSYTLATEHSPLPLALHNGLAVPIRVRLQVDAPPGMT
VADVGQIELPPGYLPLRVPIEVNFTQRVAVDVSLRTPDGVALGEPVRLSV
HSNAYGKVLFAITLSAAAVLVTLAGRRLWHRFRGQPDRADLDRPDLPTGK
HAPQRRAVASRDDEKHRV
>MT0031 hypothetical protein
MADSALQQQLDEVRALLTRARELFGPNPIEPPTDIAPDPDSTKTWLI
>MT3145.1 hypothetical protein
MGAGCALALSSAAPVDWRNPPKRAYLGRSTIGIPSIGIGMAL
>MT2672 hypothetical protein
MLVAGFATWHEGHEAAVRALNRGVHLIAHAAVETYSVLTRLPPPHRIAPV
AVHAYLADITSSNYLALDACSYRGLTDHLAEHDVTGGATYDALVGFTAKA
AGAKLLTRDLRAVETYERLRVEVELVT
>MT1571 hypothetical protein
MPSSGTALAHPDQSLEDIRTGELTDLKDGPQGYLMALSVVESAATGPIPH
APSIEARRAIYQDLGM
>MT2218.1 hypothetical protein
MGRIPGTRRAGGCFFAAAAADVDSQPGPVRDRIAATGRAGIAAITADVET
AQRRGEIRADIEVRQLAFELHAYAMEANWALLLLDDDGAGERARTAIDAA
LARVGTTQEGVES
>MT3696 PE_PGRS family protein
MSFVIVAPEALMSVASEVAGIGSALNAANAAAAAPTTGVLAAAADEVSAA
MAALFGAHAQEYQRLSAQAAGFHAQFVQALNAGVNSYASAEAANASPLQA
VEQQVLGLINGPAQTLLGRPLIGNGADGAPGTGQPGGPGGLLWGNGGNGG
SGVAGVGGPGGSGGAAGLFGHGGNGGAGGSNAAGAGGVGGAGGAGWLVGN
GGAGGFGGVGTTVSGNGGAGGAAGAFGNGGVGGAGGAAVIGGLPGNGGAG
GNAGLIGAGGDGGVGGVGAPGTNGMNPPPNQTSQAANGSPGANNGAGSGG
AGLPGNPGAVPGRAGGAGGLGGSGSDTSEGPVTGGNGGNGGDGGPGAPGG
NGAPGGIGVNTGTGWAYGGNGGNGGDGGAGARGGDGGNGGNGLALNGGNG
IGGNGGAGGRGGTGAAGGNGGIGGGATGTLTFFGSGGDGGPGGAGANTAG
TGGVGGVGGAGGQGGLLFGDGGNGGAGGAGGIGGTGASGGAGGKGGSGLV
GGDGGNGGAGGAGGNGGKGGAGGAGGGAGMFSQPGVHGAGGTGGQGGAGG
AGGAGGAAGAGTVVAGNPGDPGGFGAAGADGLPG
>MT1070 hypothetical protein
MVHWIGSKRYIRRQASTARLAEAADNPRSRVLAFASAGNLRKSRF
>MT2866 hypothetical protein
MPLTVADIDRWNAQAVREVFHAASARAEVTFEASRQLAALSIFANSGGKT
AEAAAHHNAGIRRDLDAHGNEALAVARAADRAADGIVKVQSELAALRHAA
AAAELTIDALINRVVPIPGLRSTEAQWARTLAKQTELQAELDAIMAEANA
VDEELASAVNMADGDAPIPADSGPPVGPEGLTPTQLASDANEERLREERA
RLQAHLERLQAEYDQLSVRAARDYHNGILDGDAVGRLAALTDELSAARGR
LGELDAVDEALSRAPETYLTQLQIPEDPNQQVLAAVAVGNPDTAANVSVT
VPGVGSTTRGALPGMVTEARDLRSEVIRQLNAAGKPASVATIAWMGYHPP
PNPLDTGSAGDLWQTMTDGQAHAGAADLSRYLQQVRANNPSGHLTVLGHS
YGSLTASLALQDLDAQSAHPVNDVVFYGSPGLELYSPAQLGLDHGHAYVM
QAPHDLITNLVAPLAPLHGWGLDPYLTPGFTELSSQAGFDPGGIWRDGVY
AHGDYPRSFLDAAGQPQLRISGYNLAAIAAGLPDNTVGPPLLPPILGGGM
PAAPGPALRGGR
>MT2802 hypothetical protein
MTADEPRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVGAHPPSD
PHRFGRIDDDGTVWLVSASGERIVGSWQAGDPEAAFAHFGRRFDDLSTEI
MLMDERLASGTGDARKIKAHAIALAETLPTACVLGDVDALADRLTSIRDR
AEVIAAADRSRREEHRAAQTARKEALAAEAEELAANATQWKVAGDRLRAI
LDEWKTISGVDRKVDDALWKRYSTARDTFNRRRGSHFAELDRERSGVRQS
KERLCERAEELSESTDWTATSAEFRKLLADWKAAGRASKDVDDALWRRFK
AAQDSFFTARNAATAEKEAELRANADAKEALLAEAERLDTTNHEAARAAL
RSIAEKWDAIGKVSRERAAELERRLRAVEKKVREAGEADWSDPQARARAE
QFRARAEQFEHQAEKAAAAGRTKEADEAKANAEQWRQWAEAAADALTRRP
>MT2230 conserved hypothetical protein
MTTPSHAPAVDLATAKDAVVQHLSRLFEFTTGPQGGPARLGFAGAVLITA
GGLGAGSVRQHDPLLESIHMSWLRFGHGLVLSSILLWTGVGVMLLAWLGL
GRRVLAGEATEFTMRATTVIWLAPLLLSVPVFSRDTYSYLAQGALLRDGL
DPYAVGPVGNPNALLDDVSPIWTITTAPYGPAFILVAKFVTVIVGNNVVA
GTMLLRLCMLPGLALLVWATPRLASHLGTHGPTALWICVLNPLVLIHLMG
GVHNEMLMVGLMTAGIALTVQGRNVAGIILITVAIAVKATAGIALPFLVW
VWLRHLRERRGHRPVQAFLAAAAISLLIFVAVFAVLSAVAGVGLGWLTAL
AGSVKIINWLTVPTGAANVIHALGRGLFTVDFYTLLRITRLIGIVIIAVS
LPLLWWRFRRDDRAALTGVAWSMLIVVLFVPAALPWYYSWPLAVAAPLAQ
ARRAIAAIAGLSTWVMVIFKPDGSHGMYSWLHFWIATACALTAWYVLYRS
PDRRGVQAATPVVNTP
>MT4019 hypothetical protein
MPLAPDHTNPDTPRRMRYVTGDHAVQVFQLTSTVIDLTTKRKHTTVVYAA
TSMSGTPPLHR
>MT1977 hypothetical protein
MKLTTMIKTAVAVVAMAAIATFAAPVALAAYPITGKLGSELTMTDTVGQV
VLGWKVSDLKSSTAVIPGYPVAGQVWEATATVNAIRGSVTPAVSQFNART
ADGINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMNNG
MEDLLIWEP
>MT3076 hypothetical protein
MRTEGISLGTHGCVSQGLGDVCFLKLRKFGENLGGCHLVRDHRDHSRHWD
AQPRMHGMPPITSGSMVTRVTRMSIRLAGDSTLGRFSTSRLGLSSAKSKP
EGDFGTACGAVSGGDAGVVALAEGVDDGQSKPGAAGGARGVGGFRESRAD
CGEQFGVASWTPQGEFEFGGQEAKGVRSSWPASLTN
>MT0506 hypothetical protein
MEVKTPAGDGLVALTPFRTQKFAITICAFKSLACM
>MT1771.1 hypothetical protein
MAADLNSFVGGSASSTNAESMALAFRGRVHMSVNIAGLT
>MT3788 hypothetical protein
MPACRRACHAPHFDGVEMVSGSCQCPDGILEELFAGGCRRPVWVTLIGNL
LSLGVVMVYTGSDAGDHASAPQPSGSGSVPASVNVPGLVVAAVWAVGLVA
GLVALTIGHLAVAAAALVVAVMAPWCRVAYIAHGQHRVCGETLRGTPAGE
TASFPTGWRGLRFSTR
>MT1839.1 hypothetical protein
MASTPAVRIDSAVCGVYRPARSIAICSISGGRADRIESYPADGDRVITLW
RNPYR
>MT0158 hypothetical protein
MLTLPDDRAPTGLPDPGIEALAHTKIASTISTVVADGYAVVLSTADIANS
LLANAIGYPIAASVALVTPAAGANSSCWPADPSQHHRIAESRACA
>MT0377 conserved hypothetical protein
MSNAPEPDRSAGESGSEPAGERSADPGEERTESYPLVPHDAETETVVITT
SDNDAAVTQPEAQRERRFTAPGFDAKETQVIVTAHEAATEVFQTNQAPTT
PPRMPTGMPPKTAVPQSIPPRTEATSVRQRTWGWALAVVVIVLALAAIAI
LGTVLLTRGKHSKMSQEDQVRQAIQSLDIAIQTGDLTALRSLTCGSTRDG
YVDYDERDWAETYRRVSAAKQYPVIASIDQVVVNGAHAEANVTTFMAFDP
QVRSTRSLDLQFRDDQWKICQSSSN
>MT2526 conserved hypothetical protein
MPVGWLWRARTAKGTTLKNARTTLIAAAIAGTLVTTSPAGIANADDAGLD
PNAAAGPDAVGFDPNLPPAPDAAPVDTPPAPEDAGFDPNLPPPLAPDFLS
PPAEEAPPVPVAYSVNWDAIAQCESGGNWSINTGNGYYGGLQFTAGTWRA
NGGSGSAANASREEQIRVAENVLRSQGIRAWPVCGRRG
>MT0325 hypothetical protein
MTPPHRPHTNEVVTEPYTAHPPLWLTLATDPY
>MT3826 conserved hypothetical protein
MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGLRIATGA
LVGLAALPVVFTLLRTRKPELGTPQLALSMRIWSIMAHVLAGALIVGTAI
SEVWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKP
KKPKQRRLRRKKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIES
PGGEPESATREAPAAETATAEEPRGGLRNRRPTGKTSHRRRRTRSGVQVA
KVDE
>MT1945 hypothetical protein
MANSPGSGRVSARTPAKLTRRSKVTFTRAWPASNNDGCPTEQKYWAMVCT
AVGYHRDDPAAELLLRNEGLAAAVQTGHLRLPATETSRQGPCGRQSVRQP
RPGDHSVPPRDRSPNRDRAADLLPDQPSIGPGAMDLDPLGPRR
>MT0291.1 hypothetical protein
MAVDTSSAPAAISAVVLAAAAAFAALMLDPRLAKSVAAAAITSGAAITND
I
>MT3846 hypothetical protein
MVGEMTEGVRGVTQGVVPDGLAAAALSPAVDSVSGGSL
>MT0250 hypothetical protein
MVVGDHGLGVRSISHADHQQLGQLVTGGQQRRDPRRLGAAPGHGNGYRSC
GRRAGGRRRSGGCAAMAAGHHQRCCRSEYRQCAWKSHPDMLSRWRIRSGL
GGQAHASAKLPLASQAAPTAVEEPPMNRIVAPAAASVVVGLLLGAAAIFG
VTLMVQQDKKPPLPGGDPSSSVLNRVEYGNRS
>MT0146 hypothetical protein
MSASEFSRAELAAAFEKFEKTVARAAATRDWDCWVQHYTPDVEYIEHAAG
IMRGRQRVRAWIQETMTTFPGSHMVAFPSLWSVIDESTGRIICELDNPML
DPGDGSVISATNISIITYAGNGQWCRQEDIYNPLRFLRAAMKWCRKAQEL
GTLDEDAARWMRRHGGP
>MT3671.1 hypothetical protein
MSGADPPTRRAFGQMARAATGWVSVSGQFAVAADTCRCEGTLFAVDPETH
VANHNRCDIVGRLRDERPNTLRSVRRGDEVRMATWHWI
>MT3631 hypothetical protein
MIGAVADAVRPARPDGHGAAWEQLLGVANCSLPPGNQVVPSPWQATARAI
HLVIKQFRAAGELSR
>MT3587 hypothetical protein
MSDEIDPDWPAPAYQPSDDVDTTPPAPGGSWPTAWLVALVVLACVAAAVV
AYAGMHRVRPGANQAAPATTSAPARPTSPASQVGPCGPDEATAVRAALAQ
LAPDSKTGRPWNSTPEDSNYDPCADLSAVLVTVQDATNSSPDQALMFHRG
TFVGTATPRAYPFTNLIGPASTNDIVVLSYRTRQSCDGCQDGILTIVGFA
WRGDHVQILDSLPELFDAPP
>MT2407 hypothetical protein
MDATAPLVGGTALIGYVAVLGLGYVLGAKAGRRRYEQIASTYRALTGSPV
ARSMIEGGRRKIANRISPDAGFVTLAEIDNQTAVVQRGVERQPKTAR
>MT2365 hypothetical protein
MAFVDLRYPWCRGDGWISPPVVAVALGWAMRRKPFSRFNEYVGSASNTCW
FARALELRTLLIR
>MT3972 hypothetical protein
MNCALGFDTKPILLASYVTHGARRATANQFERPAKGAGVLMALLILGEMA
GFAVVVTGVVFGQLV
>MT1461 hypothetical protein
MACLGRPGCRGWAGASLVLVVVLALAACTESVAGRAMRATDRSSGLPTSA
KPARARDLLLQDGDRAPFGQVTQSRVGDSYFTSAVPPECSAALLFKGSPL
RPDGSSDHAEAAYNVTGPLPYAESVDVYTNVLNVHDVVWNGFRDVSHCRG
DAVGVSRAGRSTPMRLRYFATLSDGVLVWTMSNPRWTCDYGLAVVPHAVL
VLSACGFKPGFPMAEWASKRRAQLDSQV
>MT0527 conserved hypothetical protein
MISVSGAVKRMWLLLAIVVVAVVGGLGIYRLHSIFGVHEQPTVMVKPDFD
VPLFNPKRVTYEVFGPAKTAKIAYLDPDARVHRLDSVSLPWSVTVETTLP
AVSVNLMAQSNADVISCRIIVNGAVKDERSETSPRALTSCQVSSG
>MT1825 conserved hypothetical protein
MASDLYLGYRNDDADTPFGKFFKPEMAPLPQHVVVALQHGPQAGMALLAF
DDAASIVDEGYQQTENGYGILGDGSMQVSVRTDMPGVTPAMWAWWFGWHG
SDTRRYKLWHPRAHLSARWKDGDQDSGAGRRGAQRYVGRWSMISEYIGST
KLGAAIQFVEPAAMGLPDDSDDTVSICARLGSADAPVDAGWFVHQVRSTP
GGSEMRSRFWMGGPHIAVRKAPEVASKAVRPIASKLIGVSESTARNLLVY
CAQEMNHLAGFLADLWESFGDE
>MT1686 hypothetical protein
MIYRVACLLARIRFTVGYVAALASVSTTILMHGPQVHAQVIRHASTNLHN
LAHGHLGTLWNSAFVIDEGPLYFWLPCLACLLAVAELQLRSLRLTVAFVV
GHIGATLLVAAVLAGAIEIGWLPWSISRVSDVGMSYGALAALGALTAAIP
GRWRPAWIGWWVSLGLATATIGGGFTDAGHTVALLLGMLVTACFTRPARW
TLGRCALLAVASGFCLVLLAHSWWSLVSGSALGLLGALGAAGFARWTRAR
ATSLPPGALAIPQPALSR
>MT3573.9 hypothetical protein
MTEFDDIKNLSLPETRDAAKQLLDSVAGDLTGEAAQRFQALTRHAEELRA
EQRRRGREAEEAAPLPGR
>MT2747 conserved hypothetical transmembrane protein
MYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPAAVLSVL
HRSIVLTTNGNITDDFKPVYRAVLNFRRGWDIYNEHFDYVDPHYLYPPGG
TLLMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPAL
ILAMFATETVTNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIG
LTLVLKPLLGPLLLLPLLNRQWRALVAAVVVPVVVNVAALPLVSDPMSFF
TRTLPYILGTRDYFNSSILGNGVYFGLPTWLILFLRILFTAITFGALWLL
YRYYRTGDPLFWFTTSSGVLLLWSWLVMSLAQGYYSMMLFPFLMTVVLPN
SVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGWSLLLIVTF
TVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR
>MT3927 hypothetical protein
MQFYDDGVVQLDRAALTLRRYHFPSGTAKVIPLDQIRGYQAESLGFLMAR
FNIWGRPDLRRWLPLDVYRPLKSTLVTLDVPGMRPKPACTPTRPKEFIAL
LDELLALHRT
>MT0666 conserved hypothetical protein
MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEAAADLGY
DALVAPLTFVTILAKYVQLDFFRHVDVGMETMQIVQVDQRFVFHKPVLAG
DKLWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGS
ARLKWDKESGQVIRTA
>MT0628 hypothetical protein
MKPPLAVDTSVAIPLLVRTHTAHAAVVAWWAHREAALCGHALAETYSVLT
RLPRDLRLAPMDAARLLTERFAAPLLLSSRTTEHLPRVLAQFEITGGAVY
DALVALAAAEHRAELATRDARAKDTYEKIGVHVVVAA
>MT3643 PPE family protein
MADFLTLSPEVNSARMYAGGGPGSLSAAAAAWDELAAELWLAAASFESVC
SGLADRWWQGPSSRMMAAQAARHTGWLAAAATQAEGAASQAQTMALAYEA
AFAATVHPALVAANRALVAWLAGSNVFGQNTPAIAAAEAIYEQMWAQDVV
AMLNYHAVASAVGARLRPWQQLLHELPRRLGGEHSDSTNTELANPSSTTT
RITVPGASPVHAATLLPFIGRLLAARYAELNTAIGTNWFPGTTPEVVSYP
ATIGVLSGSLGAVDANQSIAIGQQMLHNEILAATASGQPVTVAGLSMGSM
VIDRELAYLAIDPNAPPSSALTFVELAGPERGLAQTYLPVGTTIPIAGYT
VGNAPESQYNTSVVYSQYDIWADPPDRPWNLLAGANALMGAAYFHDLTAY
AAPQQGIEIAAVTSSLGGTTTTYMIPSPTLPLLLPLKQIGVPDWIVGGLN
NVLKPLVDAGYSQYAPTAGPYFSHGNLVW
>MT3455 IS1608', transposase
MTAENPGRSRRTLVGIDAAITACHHIAIRDDVGARSIRFSVEPTLAGLRT
LTDKLSGYDDIDATVEPTSMTWLPLTIAVENAGDTMHMAGARHCARLRGA
IVGKSKSDVIDAEVLTRASEVFDLTPLTLPTPAQLALRRSVIRRAGAVID
ANRSWRRLMSLAR
>MT1356 hypothetical protein
MTTKLATNTLLGPEATLGLVPGVVPPPWWWGVVFENWIVVASINGYAAG
>MT1413 hypothetical protein
MRPSRQGEVGEVAGYVVEYNRRTHVRRITEFATPQEAMEHRLKLEAERTD
SNIEIVALVSKSLGTLKQTHSRYFTGEELNVGNGAR
>MT3131.1 hypothetical protein
MRVRERFRHPTSLKVSTSLAPVWCGAIQTIHEAFTTIASPWTHEPVTFCS
VV
>MT0607 PE_PGRS family protein
MSFVIATPEMLTTAATDLAKIGSTITAANTAAAAVAKVLPASADEVSVAV
AALFGTHAQEYQTVSAQVATFHDRFVQTLSAAASSYVAAEAVNVEQSLLA
AVNAPTQALFGRPLIGNGADGSPGTGQAGGPGGILYGNGGNGGSGAPGQR
GGAGGAAGLIGNGGNGGAGGVGTTGGAGGHGGAGGWLYGNGGAGGFGGAG
AVGGNGGAGGTAGLFGVGGAGGAGGNGIAGVTGTSASTPGGSGTAGGAGG
IGGNGGAGGAGGVLMGNGGNGGAGGEGGPGGAGGAGASGAHATNLGADGQ
AGGNGGNGGAGGTGGVGGPGGGHGLLGLGGSHGAGGAGGSGGDGGAPGDG
GNGATGTWGHNLRAGGTGGNGGNPGAGGAGGAGGASVGGSAHGANGAPGT
TSTSGGNGGDGGKGADAISSGQTGANGGRGGDGGQVGNGGAGGAGGRGGA
GGLGFGSEAPGRPGGAGGTGGAGGNGGTQAGDGGTGGAGGAGGDGGSGGA
GSIGFNASAPGAAGSPGGNGGNGGPGGAGGEGGAGGLALAASGQNGSQGA
GGDGGAGGNGGTPGNGGHGAAGALGVNGGVGGAGGHGGDPGVGGAGGQGG
SGSTPGANGAPGNTPTSGGNGGNGGRGADATGFGQTGASGGRGGDGGLVG
NGGAGGAGGNGSKGLPGLGRLGNPGLDGGTGGNGGAGGSGGAWAGNGGTG
GAGGTGGVGGTGGSGSDGVNGSSAGADGHPGGTGGVGGTGGKGGDGGDGG
AAPNGVAGSQGPGGAGGDGGTGGVGGNGGRGIDGADGATAGARGQDGGAG
GAGGKGGRGGTGGPGGAGPAGTTGSQGAGGNGGSGGTGGDPGDGGNGANG
SVFTNNGIGGNGGNGGNAGPSGAGGSGGAGSTFGATGSSSSIHVNGGNGG
NGGNGDHALSGNGAAGGNGGNGGNGSLRGSGGAGGHGGNGGNASRGMGGD
GGTGGAGGNAGQIGNGGAGGNGGDGGTGSDGNPGAITGSGGRGGDGGVGG
QGGSVAGDGADGGRGGAGGTGGTGLRGTTGATGATGTFDAGADGHGGNGG
TGGVGGTGGAGGGGGNGGAGGKALSPTGNNGSQGAGGDGGAGGAGGTGGT
GGDGGRGAHGTLFSSLAGTGGTGGNGGTGGTGGTGGAGGAGGTGSTLGAT
GATGAAGRAGNGGVGGSGGLGSAFGPGGTGGMGGAGGTSTVSAGGDGGRG
GFGGDGLDASSGGNGGDGGHGGDGFRTAGAGGRGGDGGKGADPGGLFPIP
GAGGKGGTGGTGGTAHLGPLAIIGQSGQPGQFGSPGADGRGGAGGAGGGG
GAGGSF
>MT2637.1 hypothetical protein
MAAAGSVDCIDHASDQISELSNYVHIGRNLGSIPILEVIYVNLWESHCPN
AFQAPDANSGERVTNPAGQDGQPASAGADYASWGPGGQPTGGAGRTASTT
WPPDSFR
>MT1234 PPE family protein
MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAASAFQSVV
WGLTVGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYET
AYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAA
AMFGYAAATATATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQL
MNNVPQALQQLAQPTQGTTPSSKLGGLWKTVSPHRSPISNMVSMANNHMS
MTNSGVSMTNTLSSMLKGFAPAAAAQAVQTAAQNGVRAMSSLGSSLGSSG
LGGGVAANLGRAASVGSLSVPQAWAAANQAVTPAARALPLTSLTSAAERG
PGQMLGGLPVGQMGARAGGGLSGVLRVPPRPYVMPHSPAAG
>MT3947 hypothetical protein
MPPLTSLAPTTAERIRSACARAGGALLVVEREDPVPVPIHHLLYDGSFAV
AVPVDRGEVSGSQALLELTDYAPLPVREPVRSLVWIRGCLHQIPPAELVE
TLDLIATDNPNPALLQVETPRPGPADAAETRYTMQRLEIESVVVTDATGA
EPVTVADLLAARPDPFCEIESTLLWHLATAHDDVVARLVSRLPAPLRRGQ
IRPLGLDRYGVRFRIEARDGDRDIRLPFHKPVDDMTGLSQAIRVLMGCPF
RNGLRARR
>MT1082 hypothetical protein
MSVPRRTPNSHAPAGPVPDFLRHRYETSTHRVICLRSVPKTSGIGAVEAL
TPSTAGRAGLAQRALPRRAGRPLGPHRPRHLPTRRRVDRRLGSDRGRHAP
PRRYDLPGLRTHAPRPDRRDPRRAGHRHPPLVEDTGQHRRDCVAPLRPGH
ILDRTRRDHDPGIGSDNRNLTHPSARSPIHSGSATKSVTNWRVTRCESGC
AEAANPPG
>MT1206 PE family protein
MEGDARVSFVTTRPDSIGETAANLHEIGVTMSAHDDGVTPLITNVESPAH
DLVSIVTSMLFSMHGELYKAIARQAHVIHESFVQTLQTSKTSYWLTELAN
RAGTST
>MT3491.1 hypothetical protein
MGGECYAMVVRFPREHNAKLKQLHAETEPKGRPAGKAATGEL
>MT0895 hypothetical protein
MLCHRRLLADESRRLPTTTYLSEYYVAPTADGPLPKSRSPVLAANDGCDA
EAGSGPPFFHVKK
>MT3220.1 hypothetical protein
MRQLVPRPVYNWAGRLYGSRLTADRRLIDAVPAASRCALTARLCPNILWV
KRYAADGRPLVRQLSATVSHFWDFAVKSWAMVVDLDEPPVR
>MT1924 conserved hypothetical protein
MTTLNEAAALAAAERGLAVVSTVRADGTVQASLVNVGLLPHPVSGEPSLG
FTTYGKVKLGNLRARPQLAVTFRNGWQWATVEGRAQLVGPDDPRPWLVDG
ERLRLLLREVFTAAGGTHDDWDEYDRVMAQEQRAVVLITPTRIYSNG
>MT2554 hypothetical protein
MTDHGQWRFEHLLIPFLVALAQLRWHRVIPRTAHAYAECYSPRPHRHLPG
>MT4041.2 hypothetical protein
MSGNITIRPLTAFQLDWPAGQSEVVFAAGRRGPWTNPGRSRIADFRATVR
GYLRAFAWSNLAHPATASGHPAR
>MT2135 hypothetical protein
MPRARWLQSAALMGALAVVLITAAPVAADAYQVPAPPSPTASCDVISPVA
IPCVALGKFADAVAAECRRVGVPDARCVLPLAHRVTQAARDAYLQSWVHR
TARFQDALQDPVPLRETQWLGTHNSFNSLSDSFTVSHADSNQQLSLAQQL
DIDVRALELDLHYLPRLEGHGAPGVTVCHGLGPKNANLGCTVEPLLATVL
PQIANWLNAPGHTEEVILLYLEDQLKNASAYESVVATLDQVLRRADGTSL
IYRPNPARRATNGCVPLPLDVSREEIRASGARAVLVGSCAPGWSAAVFDW
SGVELESGSNSGYRPYPACDATYGRGVYAWRLVRYYEDSTLATALANPTR
PPANPQALTPPKVPAMTDCGVNLFGFDQLLPEDGRIQASLWSWAPDEPRA
GAGACALQGADGRWVAASCGDPHPAACRDAAGRWTVTPAPVVFAGAALAC
TAIGADFTLPRTGNQNARLHAVAGPAGGAWVHYLLPP
>MT1098 hypothetical protein
MQSVRQIFPQDIFIGRTMGVVVERYGSTDLSSRGMFVPFHDVDCVQ
>MT2772 conserved hypothetical protein
MSGTRLAPHSVRYRERLWVPWWWWPLAFALAALIAFEVNLGVAALPDWVP
FATLFTVAAGTLLWLGRVEIRVTAGSADGAGVKLWAGPAHLPVAVIARSA
EIPATAKSAALGRQLDPAAYVLHRAWVGPMVLVVLDDPNDPTPYWLVSCR
HPERVLSALRS
>MT3525 WhiB-related protein
MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQREQRAK
EMCRRCPVIEACRSHALEVGEPYGVWGGLSESERDLLLKGTMGRTRGIRR
TA
>MT0772.6 hypothetical protein
MCIMRTTVSISDEILAAAKRRARERGQSLGAVIEDALRREFAAAHVGGAR
PTVPVFDGGTGPRRGIDLTSNRALSEVLDEGLELNSRK
>MT1836 PPE family protein
MLYSAPFRCRARVSLATFGGNAHQGSSGSPTRSCVEREDWLDFGALPPEI
NSGRMYCGPGSGPMLAAAAAWDGVAVELGLAATGYASVIAELTGAPWVGA
ASLSMVAAATPYVAWLSQAAARAEQAGMQAAAAAAAYEAAFVMTVPPPVI
TANRVLVMTLIATNFFGQNSAAIAVAEAQYAEMWAQDAVAMYGYAAASAS
ASRLIPFAAPPKTTNSAGVVAQVAAVAAMPGLLQRLSSAASVSWSNPNDW
WLVRLLGSITPTERTTIVRLLGQSYFATGMAQFFASIAQQLTFGPGGTTA
GSGGAWYPTPQFAGLGASRAVSVSLARANKIGALSVPPSWVKTTALTESP
VAHAVSANPTVGSSHGPHGLLRGLPLGSRITRRSGAFAHRYGFRHSVVAR
PPSAG
>MT3793 hypothetical protein
MGAGPVIPTRLATVRRRRPWRGVLLTLAAVAVVASIGTYLTAPRPGGAMA
PASTSSTGGHALATLLGNHGVEVVVADSIADVEAAARPDSLLLVAQTQYL
VDNALLDRLAKAPGDLLLVAPTSRTRTALTPQLRIAAASPFNSQPNCTLR
EANRAGSVQWGPSDTYQATGDLVLTSCYGGALVRFRAEGRTITVVGSSNF
MTNGGLLPAGNAALAMNLAGNRPRLVWYAPDHIEGEMSSPSSLSDLIPEN
VHWTIWQLWLVVLLVALWKGRRIGPLVAEELPVVIRASETVEGRGRLYRS
RRARDRAADALRTATLQRLRPRLGVGAGAPAPAVVTTIAQRSKADPPFVA
YHLFGPAPATDNDLLQLARALDDIERQVTHS
>MT1288 hypothetical protein
MLAVALGHGRHGMHQGNPYRHQCPRRIIPLRAVNRAATETHARIVEPALA
>MT2000 hypothetical protein
MLPTLSHIHAWDTEHLIEAAYYWTKVADQWEDVFLEMRNRSHFIAWEGAG
GDGCDSEPALTYR
>MT1068 PPE family protein
MDFGALPPEINSARMYAGAGAGPMMAAGAAWNGLAAELGTTAASYESVIT
RLTTESWMGPASMAMVAAAQPYLAWLTYTAEAAAHAGSQAMASAAAYEAA
YAMTVPPEVVAANRALLAALVATNVLGINTPAIMATEALYAEMWAQDALA
MYGYAAASGAAGMLQPLSPPSQTTNPGGLAAQSAAVGSAAATAAVNQVSV
ADLISSLPNAVSGLASPVTSVLDSTGLSGIIADIDALLATPFVANIINSA
VNTAAWYVNAAIPTAIFLANALNSGAPVAIAEGAIEAAEGAASAAAAGLA
DSVTPAGLGASLGEATLVGRLSVPAAWSTAAPATTAGATALEGSGWTVAA
EEAGPVTGMMPGMASAAKGTGAYAGPRYGFKPTVMPKQVVV
>MT1746 PPE family protein
MTLDVPVNQGHVPPGSVACCLVGVTAVADGIAGHSLSNFGALPPEINSGR
MYSGPGSGPLMAAAAAWDGLAAELSSAATGYGAAISELTNMRWWSGPASD
SMVAAVLPFVGWLSTTATLAEQAAMQARAAAAAFEAAFAMTVPPPAIAAN
RTLLMTLVDTNWFGQNTPAIATTESQYAEMWAQDAAAMYGYASAAAPATV
LTPFAPPPQTTNATGLVGHATAVAALRGQHSWAAAIPWSDIQKYWMMFLG
ALATAEGFIYDSGGLTLNALQFVGGMLWSTALAEAGAAEAAAGAGGAAGW
SAWSQLGAGPVAASATLAAKIGPMSVPPGWSAPPATPQAQTVARSIPGIR
SAAEAAETSVLLRGAPTPGRSRAAHMGRRYGRRLTVMADRPNVG
>MT1812 hypothetical protein
MDVPGLGQIALNSAPIFGGAMLAIAAGQFKGPDFRALIRQDMDLLDRLPA
DATKRRANLQRTIDARIDDLIDAADKSHALRKAAMSYRGNWRDIVLLVCV
LLFTIIWWNVNHGRANWLPTFVLLILLAAVTAVYALRGALRAATSLMRGR
RGADR
>MT2370 hypothetical protein
MTATSVVRLGTKAAAEYLGGLPESTLRYWRYWAPVRAAIALADTRFAISP
TWMRG
>MT1652 conserved hypothetical protein
MEASGRQRRYAAAGSVVLLAGALGYIGLVDPHNSNSLYPPCLFKLLTGWN
CPACGGLRMIHDLLHGELAASINDNVFLLVGVPVLASWVLLRRRHGDLAL
PIPVMIAVAVAVIAWTVLRNLPGFPLVPTISG
>MT1416 desaturase-related protein
MAWPVRYSSAGGGRLRRNPLRIRQGRRPSMGSARQPRRLATVPARERPGG
IPVTNDLPDVRERDGGPRPAPPAGGPRLSDVWVYNGRAYDLSEWISKHPG
GAFFIGRTKNRDITAIVKSYHRDPAIVERILQRRYALGRDATPRDIHPKH
NAPAFLFKDDFNSWRDTPKYRFDDPNDLLHRVKARLAEPALAARIKRMDT
LFNAIVAVLAVGYFAVQGVRLVEPSWMPLWAFVIAMVLLRSSLAGFGHYA
LHRAQRGLNRVFNNAFDLNYVALSLVTADGHTLLHHPYTQSEVDIKKNVF
TMMMRLPWLYRVPVHTIHKFGHMLSGMAIRIVDVFRITRKVGVEESYGSW
RAALPHFLGSAGVRLLLVSELVVFAIAGDFWPWALQFVATLWVSTFLVVA
SHEFEDDTQGGAVNGEDWGIDQLEHANDLTVIGNRYVDCFLSAGLSSHRV
HHVLPFQRSGFANIVTEDVLREEAAKFGVEWLPAKGFITDRLPRLCRKYL
LTPSRQAKERHWGFVREHCSPAALKASASYVVAGFVGIGSV
>MT2472 hypothetical protein
MCRQRDALAMQLASESVKESIGDLPVRDFGQRSRSGGKAIAEHCRTHELH
IRPRTGGESATTVQVGRSAANERADIAPRKTRCCVHVAKPNRIRLADQLA
RSSMGEKPGHDHQRNQRDQNQRDVRPRHPGYLGA
>MT2438 hypothetical protein
MGTVGMLPRTGRHSWQAREWLPFVGIERADTDRPHRG
>MT2621 hypothetical protein
MSTTIVAGVIQGHLPVILPTRRRARDLGHTTALFRAQTLQCIYLSIEYLY
VCSMSRRTTIDIDDILLARAQAALGTTGLKDRVDAALRAAVR
>MT2495 hypothetical protein
MPASVSTVLVDTSVAVAPVVADHDHHEDTFQALRGRTLGLAGHAAFERRT
LATVAKLLAHTFPATRFLGAGAAMSLLPELAPAEIAGGAV
>MT3308 hypothetical protein
MVKPERRTKTDIAAAATIAVVVAVAASLIWWTSDARATISRPAAVAVPTP
APAREVPTSLKQLWTAASPATRVPVVVGGTVATGDGRQVDGRDPATGESL
WSYARDTDLCGVTWVYHYAVAVYRYDRGCGQVSTIDGSTGRRGAARSGYA
DPRVRLFSDGTTVLSAGDTRLELWRSDMVRMLAYGEIDARVKPSNRGLQS
GCTLESAAASSAAVSVLEACTNQADLRLVLLRPGKEDDEPIQRIVPEPGV
RPGSGARVLVVSQNNTAVYLPARSGAQPRVDVIDETGATVSSTLLAKPPS
TSAVASRTGNLVTWWTGDALLVFDAGNLTQRYTIAAGETTAPVGPGVMMA
GQLLVPVTGGIGVYDPVSGANNRYIPVTRPPSTSAVIPAVSGSRVIEQRG
DTLVALG
>MT2192 conserved hypothetical protein
MARAIHVFRTPDRFVAGTVGQPGNRTFYLQAVHDSRVVSVVLEKQQVAVL
AERIGALLFEVNRRFGTPVPPEPTEIDDLSPLIMPVDAEFRVGTMGLGWD
SEAQSVVVELLAVTDAEFDASVVLDDTEEGPDAVRVFLTPESARQFATRS
YRVISAGRPPCPLCDEPLDPEGHICARTNGYRRDVLLGSGDDPAG
>MT3993 hypothetical protein
MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTDVLDTCR
QQKGHVFEGGLWSGGAANAANGALGANINQLMTLQDYLATVITWHRHIAG
LIEQAKSDIGNNVDGAQREIDILENDPSLDADERHTAINSLVTATHGANV
SLVAETAERVLESKNWKPPKNALEDLLQQKSPPPPDVPTLVVPSPGTPGT
PGTPITPGTPITPIPGAPVTPITPTPGTPVTPVTPGKPVTPVTPVKPGTP
GEPTPITPVTPPVAPATPATPATPVTPAPAPHPQPAPAPAPSPGPQPVTP
ATPGPSGPATPGTPGGEPAPHVKPAALAEQPGVPGQHAGGGTQSGPAHAD
ESAASVTPAAASGVPGARAAAAAPSGTAVGAGARSSVGTAAASGAGSHAA
TGRAPVATSDKAAAPSTRAASARTAPPARPPSTDHIDKPDRSESADDGTP
VSMIPVSAARAARDAATAAASARQRGRGDALRLARRIAAALNASDNNAGD
YGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAIPVDEI
ARCATYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIP
ESGKMTGRSRLEVVDPSAAAQLADTTDQRLLDLLPPAPVDVNPPGDERHM
LWFELMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRV
AVADWLYWQYVTGLLDRALAAAS
>MT3173 conserved hypothetical protein
MTRINPIDLSFLLLERANRPNHMAAYTIFEKPKGQKSSFGPRLFDAYRHS
QAAKPFNHKLKWLGTDVAAWETVEPDMGYHIRHLALPAPGSMQQFHETVS
FLNTGLLDRGHPMWECYIIDGIERGRIAILLKVHHALIDGEGGLRAMRNF
LSDSPDDTTLAGPWMSAQGADRPRRTPATVSRRAQLQGQLQGMIKGLTKL
PSGLFGVSADAADLGAQALSLKARKASLPFTARRTLFNNTAKSAARAYGN
VELPLADVKALAKATGTSVNDVVMTVIDDALHHYLAEHQASTDRPLVAFM
PMSLREKSGEGGGNRVSAELVPMGAPKASPVERLKEINAATTRAKDKGRG
MQTTSRQAYALLLLGSLTVADALPLLGKLPSANVVISNMKGPTEQLYLAG
APLVAFSGLPIVPPGAGLNVTFASINTALCIAIGAAPEAVHEPSRLAELM
QRAFTELQTEAGTTSPTTSKSRTP
>MT3677 hypothetical protein
MTRLIPGCTLVGLMLTLLPAPTSAAGSNTATTLFPVDEVTQLETHTFLDC
HPNGSCDFVAGANLRTPDGPTGFPPGLWARQTTEIRSTNRLAYLDAHATS
QFERVMKAGGSDVITTVYFGEGPPDKYQTTGVIDSTNWSTGQPMTDVNVI
VCTHMQVVYPGVNLTSPSTCAQANFS
>MT2522 hypothetical protein
MTDRSREPADPWKGFSAVMAATLILEAIVVLLAIPVVDAVGGGLRPASLG
YLVGLAVLLILLTGLQRRPWAIWVNLGAQPVLVAGFAVYPGVGFIGVLFA
ALWVLIAYLRAEVRRRRDYRVSQ
>MT3734 conserved hypothetical protein
MNWIQVLLIASIIGLLFYLLRSRRSARSRAWVKVGYVLFVLAGIYAVLRP
DDTTVVANWFGVRRGTDLMLYALVMAFSFTTLSTYMRFKDLELRYARIAR
ALALEGAQAPEQCR
>MT3551 hypothetical protein
MSPHRAVIEAGPGAIRRLCCGADVVADTAVSAAALAAIDDQVALLDERPV
AVDSLWFDALRSVAVDHRDGPVVVHPSWWSAARVEVVTAAARTLTRDVVV
HPRSWLLRQASSGVSAATVVVEIAERLVLVAGAEVAAVARRTDAESVAGQ
VGSVIARMTRGITAVVLIDVPSTVAGAAALAAAIAGAVRGTGSSVVEIDG
VRLARLARAALPPSDEPADPAARPATRSRVPTLARVAAACVALALLAPAA
VVRHGATTLQRPPTTLLVEGRVALTIPADWSTQRVVSGPGSARVQVTSPA
DPEVALHVTQSPVPGETLPGTAQRLKRAIDASPAGVFVDFNPSDIRAGRP
AVTYREVRAGHQVRWTILLDGAVRISVGCQSGPGHEDLLREVCAQAVRSV
HAVG
>MT3814.1 hypothetical protein
MRRVARVGFRAIASSALVAGCGGSPALDSGRSQARRLSPGAAGRPRWIQG
DRKLGACRRVRRVARVRVLRADTMRSDQVMSLGRVILVL
>MT2326 hypothetical protein
MGRLLPTAELISQHGSKPVGIVATDDIEALA
>MT3972.1 hypothetical protein
MRTNPVPRSRMGRIAWSWFRHPVKSHEFPAKNERPITTEELLRFAV
>MT1285 hypothetical protein
MTVPGYRDALVCQLNFGDVLREAIADFAQWDNSHNRDCS
>MT1729 hypothetical protein
MTAHTHDGTRTWRTGRQATTLLALLAGVFGGAASCAAPIQADMMGNAFLT
ALTNAGIAYDQPATTVALGRSVCPMVVAPGGTFESITSRMAEINGMSRDM
ASTFTIVAIGTYCPAVIAPLMPNRLQA
>MT3524 conserved hypothetical protein
MNETPHAPVVEQVLVAAAFGNQPGSWPLPTAITPHHLWLRAVAAGGQGRY
AHAYGDLSVLRRLVPAGPLASLAHSTQGSLLRQLGWHTLARGWDGRALAL
AGADREAGADALIGLAADALGVGRFAAAGALLDRADPLVVSPLVADRLAV
RRRWVAAELAMATGDGATAVRHAEEAVELTQAMAVASARHRVKSDVVLAA
ALCSAGAVARARAVGEEALDATARFGLLPLRWALACLLIDIGTVTFSAQQ
LRELTKIRNICAGQVRRAGGCWRTA
>MT2075 hypothetical protein
MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQS
VSDHKWIAMTAECGWIGFHKDANIRRNAVERRTVLDTGARLFCVPRADIL
AEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL
>MT1178 hypothetical protein
MAAFRLAVMWIDDSSADVIKADFEALYHGDMLVEGDTSEQLEDLHPLPTA
S
>MT2781 conserved hypothetical protein
MSGMQTQTIERTDADERVDDGTGSDTPKYFHYVKKDKIAESAVMGSHVVA
LCGEVFPVTRAPKPGSPVCPDCKRIYDTLKKG
>MT3101 PPE family protein
MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSV
VVAAVGAGVWQGPSAELFVAAYVPYVAWLVQASADSAAAAGEHEAAAAGY
VCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQA
ATVMSAYEAVVGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPWH
EIVQFLEETFAAYDQYLSALLSELPAVAWVWFQLFVDILGFNIIGFIITL
ASNAQLLTEFAINASYVAVGLLYAIAGVIDIVVEWVIGNLFGVVPLLGGP
LLGALAEPPR
>MT2547.1 hypothetical protein
MRIAVRLPGEVITFVDSEVSQIRIPSRRAAVVLRASNASDAAILTATEPN
HHLDALAGQAAKLAPTSIDAAHPARPARRDPCLYPRTGQALPRTG
>MT2396 hypothetical protein
MAICDQTLGCASDGVSRFGRFEPNGRRLRNRPQNWGKRFTAGVSQLGEPI
PLAHGL
>MT2227 conserved hypothetical protein
MRASYAPPSSQGSRVARTRRRGMLAIAMLLMLVPLATGCLRVRASITISP
DDLVSGEIIAAAKPKNSKDTGPALDGDVPFSQKVAVSNYDSDGYVGSQAV
FSDLTFAELPQLANMNSDAAGVNLSLRRNGNIVILEGRADLTSVSDPDAD
VELTVAFPAAVTSTNGDRIEPEVVQWKLKPGVVSTMSAQARYTDPNTRSF
TGAGIWLGIAAFAAAGVVAVLAWIDRDRSPRLTASGDPPTS
>MT3209 PPE family protein
MVLGFSWLPPEINSARMFAGAGSGPLFAAASAWEGLAADLWASASSFESV
LAALTTGPWTGPASMSMAAAASPYVGWLSTVASQAQLAAIQARAAATAFE
AALAATVHPTAVTANRVSLASLIAANVLGQNTPAIAATEFDYLEMWAQDV
AAMVGYHAGAKSVAATLAPFSLPPVSLAGLAAQVGTQVAGMATTASAAVT
PVVEGAMASVPTVMSGMQSLVSQLPLQHASMLFLPVRILTSPITTLASMA
RESATRLGPPAGGLAAANTPNPSGAAIPAFKPLGGRELGAGMSAGLGQAQ
LVGSMSVPPTWQGSIPISMASSAMSGLGVPPNPVALTQAAGAAGGGMPMM
LMPMSISGAGAGMPGGLMDRDGAGWHVTQARLTVIPRTGVG
>MT2683 PPE family protein
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTT
GLTGDAWHGPASLAMTRAASPYVGWLNTAAGQAAQAAGQARLAASAFEAT
LAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAA
MFGYHSAASAVATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGN
DNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGIGITGNGQIGF
GKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHP
GYTATFLETPSQFFPFTGLNSLTYDVSVAQGVTNLHTAIMAQLAAGNEVV
VFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAI
AGILFLHSGLIALPPDLASGVVQPVSSPDVLTTYILLPSQDLPLLVPLRA
IPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAAD
LQQGAVQGVNDALSGLGLPPPWQPALPRLF
>MT1856 PPE family protein
MTAALDFATLPPEINSARMYSGAGSAPMLAAASAWHGLSAELRASALSYS
SVLSTLTGEEWHGPASASMTAAAAPYVAWMSVTAVRAEQAGAQAEAAAAA
YEAAFAATVPPPVIEANRAQLMALIATNVLGQNAPAIAATEAQYAEMWSQ
DAMAMYGYAGASAAATQLTPFTEPVQTTNASGLAAQSAAIAHATGASAGA
QQTTLSQLIAAIPSVLQGLSSSTAATSASGPSGLLGILGSGSSWLDKLWA
LLDPNSNFWNTIASSGLFLPSNTIAPFLGLLGGVAAADAAGDVLGEATSG
GLGGALVAPLGSAGGLGGTVAAGLGNAATVGTLSVPPSWTAAAPLASPLG
SALGGTPMVAPPPAVAAGMPGMPFGTMGGQGFGRAVPQYGFRPNFVARPP
AAG
>MT0188 hypothetical protein
MWIRAERVAVLTPTASLRRLTACYAALAVCAALACTTGQPAARAADGREM
LAQAIATTRGSYLVYNFGGGHPMPLLNAGGHWYEMNNGGHLMIIKNASQR
LSPHLLVDTHTGDQARCEHNPGARTGEGLWQASEIYPPLKAWQRMGRPTI
AVNANFFDVRGQKGGSWRSTGCSSPLGAYVDNTRGQGRANQAVTGTVAYA
GKQGLSGGNELWSSLTTMILPVGGAPYVLRPKSRQDYDLATPVIEDLLNK
NARFVAVAGIGLLSPGNTGQLHDGGPSAARTALAYAKQKDEMYIFQGGNY
TPDNIQDLFRGLGSDTAILLDGGGSSAIVLRRDTGGMWAGAGSPKGSCDT
RQVLCDSHERALPSWLAFN
>MT0470 hypothetical protein
MGPSAAVRRIDSGDQLRNSHIHHPRNSTTYRVPLGLRPCPTSLLVAQPPP
ATRRDTMPLRGRGKHRGPHGLDTGARRMSQAPGNEIPHTLAALDWGGITC
QSGAGCTNRASYVVHLHAVDECNDPDLDPFGNTVEILCIACLWHVEAKVL
LQVGRLTRSPGAFCLTCGAPVREPTDIMRDVAAL
>MT2606 hypothetical protein
MRTTLQIDDDVLEDARSIARSEGKSVGAVISELARRSLRPVGIVEVDGFP
VFDVPPDAPTVTSEDVVRALEDDV
>MT3427.1 hypothetical protein
MNVNPGKRANEKEACGASVHAKRVAMVFAGAIAAASPR
>MT1909 hypothetical protein
MWRDQTCVAPHGVFVGAFLLFSGAFRWSSTVSAFRTVH
>MT0991 hypothetical protein
MSGYNLAAVLAGLPDDLIKPPVLPPPTMPSGPGPFGLPIPNPNYHP
>MT2283 hypothetical protein
MHGIGLPSPGGRAELSEICGHIASLPATAVGDGFVEGVDGIGDRPRAMAK
VHHAGVDGVPAPT
>MT3166 hypothetical protein
MRINVGHRRSLSSAGRVSPPVSSPVTPHYRQAAASRLDTHRTQKLRSQTN
GGKDRHQLTYEQFARMLTLMGPSDLWTVERAARHWGVSASRARAILSSRH
IHRVSGYPAQAIKAVTLRQGARTDLKTANHLVPAAQAFTMAETGAAIGET
EDERARLRIFFEFLRGADETGTSALDLIVDEPALIGEHRFDALLAAAAEY
ISARWGRPGPLWSVSIERFLDTAWWVSDLPSARAFAAVWTPAPFRRRGIY
LDRHDLTSDGVCVMPEPVFNRTELQRAFTALAAKLERRGVVGQVHVVGGA
AMLLAYNSRVTTRDIDALFSTDGPMLEAIREVADEMGWPRTWLNNQASGY
VSRTPGEGAPVFDHPFLHVVATPAQHLLAMKVVAARGVRDGEDIRLLLDR
LRITSAAGVWEIVARYFPAETITDRSRLLVEDLLNQ
>MT2365.2 hypothetical protein
MWRHLWLMQPQRRYPRGSGTTRTARRDAGVAPLYGVSRVTVLASTTATTA
PPVKSFPDLL
>MT2007 hypothetical protein
MQRWTVSPAARVEILGRYWWRIRRRATEGAKAKSKGKARRGSQFKVLEHG
>MT0569 hypothetical protein
MSAWFNYTATLKILIFSLLAGALLPGLFAVGVRLQAAGDGADATARRRPL
LVAVSWAIFALVLAVVIIGVLYIARDFIAHHTGWAFLGATPK
>MT0902 conserved hypothetical protein
MSVENSQIREPPPLPPVLLEVWPVIAVGALAWLVAAVAAFVVPGLASWRP
VTVAGLATGLLGTTIFVWQLAAARRGARGAQAGLETYLDPK
>MT1007 hypothetical protein
MSAGETRADENSCVKQHKSLLTLTGQRTEPAPCRRPVRRPLNSSASTRPL
PPWSPIPVPQPHHHRGDNRELTVTPGPNPGPELGRAVFNRYPPRHCRRDC
RRAHLRAEAHTKRGRPTMAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVA
GHAHKVPRRLLKAARLGLIDFDKR
>MT2507 hypothetical protein
MGLRDADERWDTVGQAIGLFLRGHTLRTAAPTALIVGTVLCAVNQGATLA
EGAATIGTWVRMVINYLVPFLVASVGYLGARRGVRRASGRSDPSAQ
>MT3131 conserved hypothetical protein
MTKTFSHPHFFRSVLRWLQVGYPEGVPGPDRVALLSLLRSTPLTEEQIGE
VVRHFTENGSPAVADRVIDRDEIAEFISEVTHHDAGPENIQRVAGILAAA
GWPLAGVDVGESESGSDRAPASQG
>MT0563 conserved hypothetical protein
MDVALGVAVTDRVARLALVDSAAPGTVIDQFVLDVAEHPVEVLTETVVGT
DRSLAGENHRLVATRLCWPDQAKADELQHALQDSGVHDVAVISEAQAATA
LVGAAHAGSAVLLVGDETATLSVVGDPDAPPTMVAVAPVAGADATSTVDT
LMARLGDQALAPGDVFLVGRSAEHTTVLADQLRAASTMRVQTPDDPTFAL
ARGAAMAAGAATMAHPALVADATTSLPPAEAGQSGSEGEQLAYSQASDYE
LLPVDEYEEHDEYGAAADRSAPLSRRSLLIGNAVVAFAVIGFASLAVAVA
VTIRPTAASKPVEGHQNAQPGKFMPLLPTQQQAPVPPPPPDDPTAGFQGG
TIPAVQNVVPRPGTSPGVGGTPASPAPEAPAVPGVVPAPVPIPVPIIIPP
FPGWQPGMPTIPTAPPTTPVTTSATTPPTTPPTTPVTTPPTTPPTTPVTT
PPTTPPTTPVTTPPTTVAPTTVAPTTVAPTTVAPTTVAPATATPTTVAPQ
PTQQPTQQPTQQMPTQQQTVAPQTVAPAPQPPSGGRNGSGGGDLFGGF
>MT0055 conserved hypothetical protein
MDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRVICPICR
KEQLTLVSWVFGEHLGAVSGSARTAEELILLATRFSEFAVHVVEVCRTCS
WNHLVKSYVLGAARPARPPRGSGGTRTARNGARTASE
>MT1078 hypothetical protein
MQASDRTWQSNFIRRWYFTETVEYRPLVKYDASMSWDERTVSALEGAFRS
EVRARRVNGPHRDVIVSLDGAEFLVRWLTTGWPRQVAEALHATSRPDILA
APTMSPGARKAAHDAGVGWVDESGAADIHYRNTSTGTTLVIETKGAPPAP
LDARIGWRRATLAVCEALLANIAGPTVASVVEATGLSMGSSAQALKFLEK
NGHLASATARGPKSARLIVDRDALLDAYAEAADKLRSPISISTGVLWRDP
TAGVVKAGQLWDAAGIEWAATSALSASLLAPMQTEIAPMEIYVPGRSWSD
LRRAAMAAGLQEIAGGRLILRFFPTPACARLTEQNLQGFRSMLWPRVYAD
LRTAGVRGEDAAEHLREAMTK
>MT2998.1 hypothetical protein
MIELSYAPDVAGRRSNWPKGSGVNTWTAIRWTFAEDSPYVGTGLERMASD
THGGGGGRPVTPPPPGMHHLGCSRGVLLISSQRDAGHKTCDPAAGGTLTS
VLT
>MT0936 conserved hypothetical protein
MLSSTLLVVGAMAGRGLSVKLSSIAALVVSLLIVALTVWYYKLNVNPPVS
AEYGLYFGAAGGVCAVGCSLWAAVSAASPGRRRHREVVR
>MT1622 conserved hypothetical protein
MVHSIELVFDSDTEAAIRRIWAGLAAAGIPSQAPASRPHVSLAVAERIAP
EVDEPLGAVARRLPLDCVIGAPVLFGRANVVFTRLVVPTSELLALHAEVH
RLCGPHLAPAPMANSLPGQWTAHVTLARRVGGHQLGRALRIAGRPSRIDG
RFAGLRRWDGNTRAEYLLG
>MT3450 conserved hypothetical protein
MTVRAVLRRTVGAQWPILAGVNFWRRGALLIGIGVGVAAVLRLVLSEERA
GLLVVRSKGIDFVTTVTVAAAMVYIASTIDPLGTG
>MT1467 hypothetical protein
MITQNHRAYYCLKYLVRVGYCYPAVTTPGKPPSVLLYAPSACDESLPSPR
VATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPA
IPPASSQLVCVAPK
>MT1081 hypothetical protein
MRADVTAEHLTQVVRDIAVIDIDDGVAFNLDTSSVQEIRERADYPGLRVR
VAMSVGPWQGIAAWDVSTGEPIAPWPTRVTIDRILGEPITLLGYAPETII
AEKGVTILERGITSTRWRDYVDIVQLDRRGIDDDELLRSARAVAQYRGAT
LEPVAPHLAGYGAVAQAKWATEHGRCQHCWRHWKPAHVGRRNMDLLDAKQ
VSEMIGVPVGTLRHWRHSDIGPASFTLGRRVVYRRDEVSRWISKRESATR
R
>MT0923 hypothetical protein
MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSAAETGAQ
>MT2690 PE_PGRS family protein
MGRPHLNIVAVRRISMSFVNVAPQLVSTAAADAARIGSAINTANTAAAAT
TQVLAAAQDEVSTAIAALFGSHGQHYQAISAQVAAYQQRFVLALSQAGST
YAVAEAASATPLQNVLDAINAPVQSLTGRPLIGDGANGIDGTGQAGGNGG
WLWGNGGNGGSGAPGQAGGAGGAAGLIGNGGAGGAGGQGLPFEAGANGGA
GGAGGWLFGNGGAGGNGGIGGAGTNLAIGGHGGNGGNAGLIGAGGTGGAG
GTGGGEPSAGASGGNGGNGGNGGLLIGNSGDGGAAGNGAGISQNGPASGF
GGNGGHAGTTGLIGNGGNGGAGGAGGDVSADFGGVGFGGQGGNGGAGGLL
YGNGGAGGNGGAAGSPGSVTAFGGNGGSGGSGGNGGNALIGNAGAGGSAG
AGGNGASAGTAGGSGGDGGKGGNGGSVGLIGNGGNGGNGGAGSLFNGAPG
FGGPGGSGGASLLGPPGLAGTNGADG
>MT0540 hypothetical protein
MSPMLLAFLTAAAPLAGLGLLQLQARLERWDYERHAED
>MT2736 hypothetical protein
MIAGVDQALAATGQASQRAAGASGGVTVGVGVGTEQRNLSVVAPSQFTFS
SRSPDFVDETAGQSWCAILGLNQFH
>MT0407 hypothetical protein
MKYNNIRTPCLYPIPVFWAGAYRLEGRRVVVGIGWWAVSLGGGCGGLGDH
VGNPAGLVVAVAFGGDVVGAVLRSGAGPARHHRGALPAAVGAGRVGSRWV
TGGGAQRREQQRAARGGDIGLVTRQRPNHGAVGQLLGAPAPEGGQQMMAP
ITRRTF
>MT3888 hypothetical protein
MGLWFGTLIALILLIAPGAMVARIAQLRWPVAIAVGPALTYGVVALAIIP
YGALGIPWNGWTALAALAVTCAVATGLQLLLARFRDLDAEALAVSRWPAV
TVAAGVLLGALLIGWAAYRGIPHWQSIPSTWDAVWHANTVRFILDTGQAS
STHMGELRNVETHAPLYYPSVFHGLVAVFCQLTGAAPTTGYTLSSLAASV
WLFPVSAAVLTWRAVRSHPGALWSASCASAEWRAAGAAGTAAALSASFTA
VPYVEFDTAAMPNLAAYGIAVPTMVLITSTLRHRDRIPVAVLALVGVFSL
HITGGIVVALLVSAWWLFEALRHPVRSRLADLLTLAGVAAMAGLVMLPQF
LSVRQQEDIIAGHAFPTYLSKKRGLFDAVFQHSRHLNDFPVQYALIVLAA
IGGLILLVKKIWWPLAVWLLLIVMNVDAGTPLGGPIGGVAGALGEFFYHD
PRRIAAATTLLLMLMAGVALFATVMLLVAAAKRLTDRFRPQPVSVWASAT
ATLLIGATLVSAWHYFPRHRFLFGDKYDSVMIDQKDLDAMAYLASLPGAR
DTLIGNANTDGTAWMYAVAGLHPLWTHYDYPLQQGPGYHRFIFWXYGRNG
ESDPRVLEAIQVLRIRYILTSTPTVRGFAVPDGLVSLETSRSWAKIYDNG
EARIYEWRGTAAATHS
>MT2794.1 hypothetical protein
MTSLPVHQASQPMPCLARQPVDLPPWAGPRCGPYCPRARITLLQRTTIAK
SNRKYYENGYPADVKLMPGHAAVVSNRAAARAGFALPCRKRQPD
>MT3229 hypothetical protein
MVLVSIGRLGCNPARDFGGCVRDIWAQQGQSEYMTEQEMTEQWLEGCAVQ
RIMFRDGLVLNFDDYNELVISVPLQLTLPAIETSPAEVVAIDPNDPADHE
RPLFDFAGATCTAFVWYDTGDLHLEFSDGHQIDVHPDDRVTAWELYGKYH
GYAACLAPGKLRVVRHDVADANGDQ
>MT0383 hypothetical protein
MSSSWRSRRGRQHDRGAVWPVLDDVARVVQGDRGDALRGEHVAVVGSRQA
LLDSCGDLGVEHHALLQRQVRHRQMPRERSFHAAPRRRVLVRHRPTDAGP
VVELLLGRTALTSTLGRRAVHQIRGDARDAVDGFGHDISPRV
>MT2276 conserved hypothetical protein
MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDKRLLPYM
IGAFLLIVGASVGVGVWAGGFTMFTMIPLGVLLGALVAFVIFGRRAQRTV
YRKAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVG
EGSAARVKPLLAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTR
LPANITVKQMDTVESRLAALGSRAGAGVMPKGPLPTTAKMRSVQRTVRRK
>MT4006 hypothetical protein
MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTG
VVASHMTATEITNELNKVLTGGTRLAEGLVQAAALMEGHEADSQTAFQAL
FGASHGS
>MT1757 hypothetical protein
MPTRSPISAWLRPPGCACSTRSSSAARAITWMPPLRVSCSSADTSELLSG
RAGQLQRRLGFFDRGDRQVQGFGETYRARYQRQIARRQLALAQVQGVLQA
DSGVAAQRQAHGGQFHVGLTDRDDLPDRPGGQPAHHREQIIGGGGHSAPN
PEHDAELQGFGQQVFRPQSQARLQMPGVIDFQFRFDVQFPHACRQPTNRL
RRVAEFARAEAHRARIQRGHPRAQLDQLLALLEGHRQADPGGQLDQDRTP
LADEVHGPARDVRIRCGPFVFVAQVDVGHRGTGMVGLLDGCRDFLGARRQ
RRVVGLGGDGAGGCDGDDDAHSHRSSASLRSASSPVQLITPPRQPGARRS
PAAAPDDPCGCVSIAVRLWRGARCRASPTRFRRRRR
>MT3206 hypothetical protein
MRWDPRCRPGRSGVGDPHCDDPAGLLAAGAAAGRRHRAPGPAHRLRARAL
RVVRRLPRQEPRYRAGPGPVAPRLLPLPHLRAWDGAPWIWNLATAILPEA
TPIVDLYHARQHVHDLAGQLAPALGEHHSDWLTARLVDLDSGDIETLVQQ
PIGQHTGHT
>MT3403 hypothetical protein
MRRSRAVQHSSATMHAAQRREEEPSSQAQLDDDPHRSAARWVPPEPPPSK
LSPVPLYAAYGSNMHPEQMLERAPHSPMAGTGWLPGWRLTFGGEDIGWEG
ALATVVEDPDSKVFVVLYDMTPADEKNLDRWEGSEFGIHQKIRCRVERIS
SDTTTDPVLAWLYVLDAWEGGLPSARYLGVMADAAEIAGAPSDYVHDLRT
RPARNIGPGTIA
>MT2460 hypothetical protein
MTAAKNPRPDLRIALVARRHIDLKRVCSCGCRP
>MT2810 hypothetical protein
MAGLLRCLRHVQGPLARPGGAGGAIAEGNAVGRPSGSSPTRCRTSGSATA
SRLPGLIATLRGGPAKMRSVGETANASPRPLYDPHAQS
>MT3573.2 hypothetical protein
MTAGAGGSPPTRRCPATEDRAPATVATPSSADPTASRAVSWWSVHEHVAP
VLDAAGSWPMAGTPAWRQLDDADPRKWAAICDAARHWALRVETCQEAMAQ
ASRDVSAAADWPGIAREIVRRRGVYIPRAGVA
>MT2412 conserved hypothetical protein
MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAE
ATSLDTMAQMNQAFRNIVNMLHGVRDGLVRDANNYEQQEQASQQILSS
>MT3510.1 hypothetical protein
MRRTFLICPTRTDPDRLREAAISLALWRRDE
>MT3354 hypothetical protein
MNVARAIDLEDTEGLIAADRGALLRAASMAGAQVRAIAAAADEGELDLLR
GSDRPRSVIWVTGRGTAETAGTILASTLGAGAAEPIVLASAAPPWVGPLD
VLIVAGDDPGDPALVGAAAIGVRRGARVVVVAPYEGPLRDSTAGRVAVLE
PRLRVPDEFGLSRYLAAGLAALQTVDPKLRIDLASLADELDAEALRNSAG
REVFTNPAKALAARVSGCQLALAGDNAATLALARHGSSVMLRIANQVVAA
TRLSDAVVALRAGTPPDALFHDEEIDGPAPQRLRVLALALAGERTVVAAR
VAGLDDAYLVAAEDVPELLDAPVGSGGAVLAVRLEMAAVYLRLVRG
>MT1083.2 hypothetical protein
MATRHQRAGIDDRWHKRVKGPDGNRRTVRSAVCGRVSRWRVRWVDGGGEE
HSKSFQRKPDAQVPDPCRTVDARQGHGPVRNAHLGARLLRLTAVYGSVRL
LPCGASMWGIAC
>MT2168 hypothetical protein
MITWITMWVIAVDSFGTEHIATTRGTNLVHKSAPAAMRLRRS
>MT1287 hypothetical protein
MSARRIRSWKRFDNRSANAAEPDPQLAGTGGRPKVSTRALAQVIERSSRI
QGPAAQAYVARLRRAHPGASPAKIVAKLEKRFLSVVTASGAAVGAAATLP
GIGTLAAWFAAAGEVVVFLEATALFVLALASVHAIPLDHRERRRALVLAV
LVGDNTTAVADLLGPGRTSGGWVSETMASLPLPAISSLNSRMLKYVVKRF
ALKRGALMFGKLVPMGIGAIIGAIGNRLVGKKLVRNARSAFGTPPARWPV
TLHVLPTVRDAS
>MT0614 hypothetical protein
MDRHECQQQRPPPVPPARGQPGTRHGEHRREHRDPSRVIEKLGQQCGEPV
GKVKVPSGCRDAATADRQRENGHKSGGRIRAQQLPLPGNDQANQDHERQR
QNRQAVPQVHQIGLRRGQHPDDLRDGFLQRHPLRAGDQRTRDHRHEVDRR
QHRPDDVVGAPGQWLQQVTGNADVASVNSHVLTIFRIRARGVWCRFAAPR
TT
>MT3462 hypothetical protein
MNLRRHQTLTLRLLAASAGILSAAAFAAPAQANPVDDAFIAALNNAGVNY
GDPVDAKALGQSVCPILAEPGGSFNTAVASVVARAQGMSQDMAQTFTSIA
ISMYCPSVMADVASGNLPALPDMPGLPGS
>MT1406 PPE family protein
MVDFGALPPEINSARMYAGPGSASLVAAAKMWDSVASDLFSAASAFQSVV
WGLTTGSWIGSSAGLMVAAASPYVAWMSVTAGQAELTAAQVRVAAAAYET
AYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAA
AMFGYAAATATATEALLPFEDAPLITNPGGLLEQAVAVEEAIDTAAANQL
MNNVPQALQQLAQPTKSIWPFDQLSELWKAISPHLSPLSNIVSMLNNHVS
MTNSGVSMASTLHSMLKGFAPAAAQAVETAAQNGVQAMSSLGSQLGSSLG
SSGLGAGVAANLGRAASVGSLSVPQAWAAANQAVTPAARALPLTSLTSAA
QTAPGHMLGGLPLGQLTNSGGGFGGVSNALRMPPRAYVMPRVPAAG
>MT2787 conserved hypothetical protein
MARDQGADEAREYEPGQPGMYELEFPAPQLSSSDGRGPVLVHALEGFSDA
GHAIRLAAAHLKAALDTELVASFAIDELLDYRSRRPLMTFKTDHFTHSDD
PELSLYALRDSIGTPFLLLAGLEPDLKWERFITAVRLLAERLGVRQTIGL
GTVPMAVPHTRPITMTAHSNNRELISDFQPSISEIQVPGSASNLLEYRMA
QHGHEVVGFTVHVPHYLTQTDYPAAAQALLEQVAKTGSLQLPLAVLAEAA
AEVQAKIDEQVQASAEVAQVVAALERQYDAFIDAQENRSLLTRDEDLPSG
DELGAEFERFLAQQAEKKSDDDPT
>MT2709 hypothetical protein
MNAYDVLKRHHTVLKGLGRKVGEAPVNSEERHVLFDEMLIELDIHFRIED
DLYYPALSAAGKPITGTHAEHRQVVDQLATLLRTPQRAPGYEEEWNVFRT
VLEAHADVEERDMIPAPTPVHITDAELEELGDKMAARIEQLRGSPLYTLR
TKGKADLLKAI
>MT2958.1 hypothetical protein
MWTSAGPADRHSWRPKPGPTSPPSADRRRPGTRGARPPARRSRWAPTPAD
RPGPGRLPARSWCVAHPGGPVKPPAVPRAARRIRPSRCRPRKPADRGARP
GRHRGAPDSNWAMRRRTAGLPPAAAAAASPAGSRRHGPAPHQAATSKGSV
RCGSEPSVPLQCSAARAIWPIAAARFVDNVVAVDSAAGKITSWVGVNYSA
QLASAG
>MT2172 conserved hypothetical protein
MQRIIGTEVEYGISSPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDY
EVESPLRDARGFDLSRSAGPPPVVDADEVGAANMILTNGARLYVDHAHPE
YSAPECTDPLDAVIWDKAGERVMEAAARHVASVPGAAKLQLYKNNVDGKG
ASYGSHENYLMSRQTPFSAIITGLTPFLVSRQVVTGSGRVGIGPSGDEPG
FQLSQRSDYIEVEVGLETTLKRGIINTRDEPHADADRYRRLHVIIGDANL
AETSTYLKLGTTALVLDLIEEGPAHAIDLTDLALARPVHAVHAISRDPSL
RATVALADGRELTGLALQRIYLDRVAKLVDSRDPDPRAADIVETWAHVLD
QLERDPMDCAELLDWPAKLRLLDGFRQRENLSWSAPRLHLVDLQYSDVRL
DKGLYNRLVARGSMKRLVTEHQVLSAVENPPTDTRAYFRGECLRRFGADI
AAASWDSVIFDLGGDSLVRIPTLEPLRGSKAHVGALLDSVDSAVELVEQL
TAEPR
>MT3280 hypothetical protein
MDHPTEREWASIAEHTRASNFTGDLLRMPPYPLILTLRTLVGSAEVVTAS
HTLFLSAATEY
>MT1858 hypothetical protein
MTRKVCRALAADLPQDAMQLQRTMGQCRPMRMLVALLLSAATMIGLAAPG
KADPTGDDAAFLAALDQAGITYADPGHAITAAKAMCGLCANGVTGLQLVA
DLRDYNPGLTMDSAAKFAAIASGAYCPEHLEHHPS
>MT3767.1 hypothetical protein
MTVSYHATQSAGEPVGRCFGNAHTVVRDPVLATHRDCASQGKTSPDRALL
ACGILEEDRWCRRCGEEGSPRDTVTRRLTHWCGLHPGVSVDHSCLSRCCR
LDCRRRGPAVPQRDPPRR
>MT3437 hypothetical protein
MVLIGAAILHDGPAAADPNQDDRFLALLEKKEIPAVANVPRVIDAAHKVC
RKLDGGMPVNDIVDGLRNDAYNIDPVMRLYPVRLTTTMTRFISAAVEIYC
PNHHSKMAFAMANFEPGSNEPTHRVAASTRSAVNSGSDLRASVSDMTIMS
PGWREPTGAMLASVLGAVRAGDPLIPNPPPIPVPPPAAQTLIPPPPIVAP
PPPRPAPPQQPPPPPPEVEPPAGVPQSGGAAGSGGAGSGGGGGGDGPVEP
SPARPMPPGFIRLAP
>MT2930 conserved hypothetical protein
MTETGGDMVALRVSDADRNGTMRRLHNAVALGLINIDEFEQRSSRVSFAC
TRSELDGLVGDLPRPGAIVTSAADRVELRGWAGSLKRHGEWIVPTRLALV
RRLGSIELDLVKARFAGPVVVIELDMMFGSLEVRLPNGASASIDDVEVYV
GSASDRRKDAPAEGTPHVVLTGRMVCGSVVIKGPRRALLRRHRG
>MT3580.1 hypothetical protein
MVSSMRFALSRRLTVTIVTDVSTRRAGLERGRF
>MT3410 conserved hypothetical protein
MGRLAAVAILCRMVADLVPIRLSLSAGDRYTLWAPRWRDAGDEWEAFLGK
DDDLYGFESVSDLVAFVRTDTENDLVDHPAWQDLTGAHAHNLNPAEDNQF
DLVVVEELLAEKPTAESVAALAASLAIVSAIGSVCELAAVSKFFNGNPIL
GTVSGGLEHFTGKAGNKRWNSIAEVIGRSWDDVLAAIDEIISTPEVDAEL
SEKVAEELAEEPEGAEEVAAEVEATQDTQEAAESDDEEADAPGDSVVLGG
DRDFWLQVGIDPIQIMTGTATFYLRCYLDDRPIFLGRNGRISVFGSERAL
ARYLADEHDHDLSDLSTYDDIRTAATDGSLAVAVTDDNVYVLSGLVDDFA
DGPDAVDREQLDLAVELLRDIGDYSEDSAVDKALETTRPLGQLVAYVLDP
HSVGKPTAPYAAAVREWEKLERFVESRLRRE
>MT1596.1 hypothetical protein
MPNGVLGLGNPSRLAALYGLQLAHESQCCQMHNLPSAARQVTVACREEVG
ITTILAGRDECGVCDKTAGLDGAAP
>MT2335 conserved hypothetical protein
MSYVAAEPGVLISPTDDLQSPRSAPAAHDENADGITGGTRDDSAPNSRFQ
LGRRIPEATAQEGFLVRPFTQQCQIIHTEGDHAVIGVSPGNSYFSRQRLR
DLGLWGLTNFDRVDFVYTDVHVAESYEALGDSAIEARRKAVKNIRGVRAK
ITTTVNELDPAGARLCVRPMSEFQSNEAYRELHADLLTRLKDDEDLRAVC
QDLVRRFLSTKVGPRQGATATQEQVCMDYICAEAPLFLDTPAILGVPSSL
NCYHQSLPLAEMLYARGSGLRASRNQGHAIVTPDGSPAE
>MT3255 hypothetical protein
MRMPGTKPGSDKPTGRVVVVIVLLMLAGAALRGHLPADDGAPLAAAGGSR
AALMFIVAALAATLALIALAIITRLRHPLPVAPSAGELSAMLGGAAGRPN
WRVLLLGLGTILAWLLIAILLARLFVPDDVGPAAPIPDSTATPDASSTTP
SRPQPPQDNNDDVLGILFASTIGLFLMVVAGSLITSRRQRKSAPARISGD
RIESPAPSARSESLARAAEIGLAEMADLRREPREAIIACYVAMERELSHV
PGVAPQDFDTPTEVLARAVEHRALHGASAAALVSLFAEARFSPHVMNEEH
REVAMRLLRLVLDELSTRTAI
>MT1119 PE family protein
MGKGLRMSYMIATPAALTAAATDIDGIGSAVSVANAAAVAATTGVLAAGG
DEVLAAIARLFNANAEEYHALSAQVAAFQTLFVRTLTGGCGVFRRRRGRQ
CVTAAEHRAAGAGRRQRRRRSGDGQWRLRQQRHFGCGGQPEFRQHSEHRR
>MT3578 hypothetical protein
MRPVDEQWIEILRIQALCARYCLTIDTQDGEGWAGCFTEDGAFEFDGWVI
RGRPALREYADAHARVVRGRHLTTDLLYEVDGDVATGRSASVVTLATAAG
YKILGSGEYQDRLIKQDGQWRIAYRRLRNDRLVSDPSVAVNVADADVAAV
VGHLLAAARRLGTQMSDT
>MT2804 hypothetical protein
MAREWSYWTRNKLEILAGYLPAFNRASQTSRERIYLDLMAGQPENIDRDM
GEKFDGSSLIAMKADPPFTRLRFCELNPLASELDVALRTRFPGDGRYRVV
AGDSNVTIDETLAELGPWRWAPTFAFIDQQAAEVHWETINKVAAFRQNPR
NLKTELWMLMSPTMIARGVKGTNAELFIEQVTRMYGDADWKRIQAARWRH
HLTAPAYRAEMVNLMRVKLEYELGYKYSHRIPMQMHNKVTIFDMVFATDH
WAGDAIMCHLYNRAAQKEPEMMRQAKSAKQQKESEDRGEMGLFSVGELAV
QDSNAGQILWAPSPTWDPRARGWWSEDPGF
>MT2668 hypothetical protein
MDAAAVAFAALSDVPIFATFCAAVASISGAAVTNDISHPPNPSCDTCHAP
>MT0446 AT103, tuberculin related peptide
MLVTVGSMNERVPDSSGLPLRAMVMVLLFLGVVFLLLVWQALGSSPNSED
DSSAISTMTTTTAAPTSTSVKPAAPRAEVRVYNISGTEGAAARTADRLKA
AGFTVTDVGNLSLPDVAATTVYYTEVEGERATADAVGRTLGAAVELRLPE
LSDQPPGVIVVVTG
>MT1930 MI22, lipoprotein antigen, 22 kDa
MSRATMLRPEWRWLVCNRLVTVTGVAMVVAAGLSACGQAQTVPRKAARLT
IDGVTHTTRPATCSQEHSYRTIDVRNHDSTVQAVVLLSGDRVIPQWVKIR
NVDGFNGSFWHGGVGNARADRARNTYTVAGSAYGISSKKPNTVVSTDFNI
LAEC
>MT0323 baiE, baiE protein
MCCNGVVTPGDPADIAAIKQLKYRYLRALDTKHWDDFTDTLAEDVTGDYG
SSVGTELHFTNRADLVDYLRQALGPGVITEHRVTHPEITVTGDTATGIWY
LQDRVIVAEFNFMLIGAAFYHDQYRRTTDGWRISATGYDRTYEATMSLAG
LNFNIRPGRALAD
>MT0826 cpsY, cpsY protein
MPKISSRDGGRPAQRTVNPIIVTRRGKIARLESGLTPQEAQIEDLVFLRK
VLNRADIPYLLIRNHKNRPVLAINIELRAGLERALAAACATEPMYAKTID
EPGLSPVLVATDGLSQLVDPRVVRLYRRRIAPGGFRYGPAFGVELQFWVY
EETVIRCPVENSLSRKVLPRNEITPTNVKLYGYKWPTLDGMFAPHASDVV
FDIDMVFSWVDGSDPEFRARRMAQMSQYVVGEGDDAEARIRQIDELKYAL
RSVNMFAPWIRRIFIATDSTPPPWLAEHPKITIVRAEDHFSDRSALPTYN
SHAVESQLHHIPGLSEHFLYSNDDMFFGRPLKASMFFSPGGVTRFIEAKT
RIGLGANNPARSGFENAARVNRQLLFDRFGQVITRHLEHTAVPLRKSVLI
EMEREFPEEFARTAASPFRSDTDISVTNSFYHYYALMTGRAVPQEKAKVL
YVDTTSYAGLRLLPKLRKHRGYDFFCLNDGSFPEVPAAQRAERVVSFLER
YFPIPAPWEKIAADVSRRDFAVPRTSAPSEGA
>MT3901 embA, arabinosyl transferase
MPHDGNERSHRIARLAAVVSGIAGLLLCGIVPLLPVNQTTATIFWPQGST
ADGNITQITAPLVSGAPRALDISIPCSAIATLPANGGLVLSTLPAGGVDT
GKAGLFVRANQDTVVVAFRDSVAAVAARSTIAAGGCSALHIWADTGGAGA
DFMGIPGGAGTLPPEKKPQVGGIFTDLKVGAQPGLSARVDIDTRFITTPG
ALKKAVMLLGVLAVLVAMVGLAALDRLSRGRTLRDWLTRYRPRVRVGFAS
RLADAAVIATLLLWHVIGATSSDDGYLLTVARVAPKAGYVANYYRYFGTT
EAPFDWYTSVLAQLAAVSTAGVWMRLPATLAGIACWLIVSRFVLRRLGPG
PGGLASNRVAVFTAGAVFLSAWLPFNNGLRPEPLIALGVLVTWVLVERSI
ALGRLAPAAVAIIVATLTATLAPQGLIALAPLLTGARAIAQRIRRRRATD
GLLAPLAVLAAALSLITVVVFRDQTLATVAESARIKYKVGPTIAWYQDFL
RYYFLTVESNVEGSMSRRFAVLVLLFCLFGVLFVLLRRGRVAGLASGPAW
RLIGTTAVGLLLLTFTPTKWAVQFGAFAGLAGVLGAVTAFTFARIGLHSR
RNLTLYVTALLFVLAWATSGINGWFYVGNYGVPWYDIQPVIASHPVTSMF
LTLSILTGLLAAWYHFRMDYAGHTEVKDNRRNRILASTPLLVVAVIMVAG
EVGSMAKAAVFRYPLYTTAKANLTALSTGLSSCAMADDVLAEPDPNAGML
QPVPGQAFGPDGPLGGISPVGFKPEGVGEDLKSDPVVSKPGLVNSDASPN
KPNAAITDSAGTAGGKGPVGINGSHAALPFGLDPARTPVMGSYGENNLAA
TATSAWYQLPPRSPDRPLVVVSAAGAIWSYKEDGDFIYGQSLKLQWGVTG
PDGRIQPLGQVFPIDIGPQPAWRNLRFPLAWAPPEADVARIVAYDPNLSP
EQWFAFTPPRVPVLESLQRLIGSATPVLMDIATAANFPCQRPFSEHLGIA
ELPQYRILPDHKQTAASSNLWQSSSTGGPFLFTQALLRTSTIATYLRGDW
YRDWGSVEQYHRLVPADQAPDAVVEEGVITVPGWGRPGPIRALP
>MT3902 embB, arabinosyl transferase
MTQCASRRKSTPNRAILGAFASARGTRWVATIAGLIGFVLSVATPLLPVV
QTTAMLDWPQRGQLGSVTAPLISLTPVDFTATVPCDVVRAMPPAGGVVLG
TAPKQGKDANLQALFVVVSAQRVDVTDRNVVILSVPREQVTSPQCQRIEV
TSTHAGTFANFVGLKDPSGAPLRSGFPDPNLRPQIVGVFTDLTGPAPPGL
AVSATIDTRFSTRPTTLKLLAIIGAIVATVVALIALWRLDQLDGRGSIAQ
LLLRPFRPASSPGGMRRLIPASWRTFTLTDAVVIFGFLLWHVIGANSSDD
GYILGMARVADHAGYMSNYFRWFGSPEDPFGWYYNLLALMTHVSDASLWM
RLPDLAAGLVCWLLLSREVLPRLGPAVEASKPAYWAAAMVLLTAWMPFNN
GLRPEGIIALGSLVTYVLIERSMRYSRLTPAALAVVTAAFTLGVQPTGLI
AVAALVAGGRPMLRILVRRHRLVGTLPLVSPMLAAGTVILTVVFADQTLS
TVLEATRVRAKIGPSQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCL
FTAVFIMLRRKRIPSVARGPAWRLMGVIFGTMFFLMFTPTKWVHHFGLFA
AVGAAMAALTTVLVSPSVLRWSRNRMAFLAALFFLLALCWATTNGWWYVS
SYGVPFNSAMPKIDGITVSTIFFALFAIAAGYAAWLHFAPRGAGEGRLIR
ALTTAPVPIVAGFMAAVFVASMVAGIVRQYPTYSNGWSNVRAFVGGCGLA
DDVLVEPDTNAGFMKPLDGDSGSWGPLGPLGGVNPVGFTPNGVPEHTVAE
AIVMKPNQPGTDYDWDAPTKLTSPGINGSTVPLPYGLDPARVPLAGTYTT
GAQQQSTLVSAWYLLPKPDDGHPLVVVTAAGKIAGNSVLHGYTPGQTVVL
EYAMPGPGALVPAGRMVPDDLYGEQPKAWRNLRFARAKMPADAVAVRVVA
EDLSLTPEDWIAVTPPRVPDLRSLQEYVGSTQPVLLDWAVGLAFPCQQPM
LHANGIAEIPKFRITPDYSAKKLDTDTWEDGTNGGLLGITDLLLRAHVMA
TYLSRDWARDWGSLRKFDTLVDAPPAQLELGTATRSGLWSPGKIRIGP
>MT3917 erp, exported repetitive protein
MPNRRRRKLSTAMSAVAALAVASPCAYFLVYESTETTERPEHHEFKQAAV
LTDLPGELMSALSQGLSQFGINIPPVPSLTGSGDASTGLTGPGLTSPGLT
SPGLTSPGLTDPALTSPGLTPTLPGSLAAPGTTLAPTPGVGANPALTNPA
LTSPTGATPGLTSPTGLDPALGGANEIPITTPVGLDPGADGTYPILGDPT
LGTIPSSPATTSTGGGGLVNDVMQVANELGASQAIDLLKGVLMPSIMQAV
QNGGAAAPAASPPVPPIPAAAAVPPTDPITVPVA
>MT3704 lsr2, lsr2 protein
MAKKVTVTLVDDFDGSGAADETVEFGLDGVTYEIDLSTKNATKLRGDLKQ
WVAAGRRVGGRRRGRSGSGRGRGAIDREQSAAIREWARRNGHNVSTRGRI
PADVIDAYHAAT
>MT0046 mtc28, proline rich 28 kDa antigen
MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAPVSAPAT
VPPVQNLTALPGGSSNRFSPAPAPAPIASPIPVGAPGSTAVPPLPPPVTP
AISGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAF
VVIADRLGNSVYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTN
ASMANFDGFPSSIIEGTYRENDMTLNTSRRHVIATSGADKYLVSLSVTTA
LSQAVTDGPATDAIVNGFQVVAHAAPAQAPAPAPGSAPVGLPGQAPGYPP
AGTLTPVPPR