TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Mycoplasma mycoides subsp. mycoides SC str. PG1, PG1
Gene type: CDS

Number of genes found: 159

Free access
Sort by:

 



# Mycoplasma mycoides subsp. mycoides SC str. PG1, PG1

>MSC_0572 Hypothetical protein
MDVGTNKVLDIYFKSQEESLNSYQKIFENVFQKYGYPMRIITDNRTNFKK
RWNKQP
>MSC_0576 Hypothetical protein
MKLNDKLKNFFNNIKSYFTTKEKIIVKNKPKPIETKTENNNNNLDNNSQF
YHDISNNKEYIDKRATLDSQNEFILKVISNKAELLEQLVDIKNTFKHCED
CLNIYKKNLDDMKLKILRLKKHIDNNYGFLGDEKEYQNYVFIDDIKTYSQ
TDESAGLKLVHKLEDHFNKYSNYDIDYFVPCDNHKNLIDKYKIVSIKIKD
LDKIISN
>MSC_0370 conserved hypothetical protein
MAIIDQLFDWQKELIIKNKHKNQLGLFLDMGSGKTILSLGFCEINQVDAI
IVIASRSKTLETIEQNASFTNYLQNDLNFKVYLKTDLKNADFKNNQKEAI
IINYHSFLLNINKENFFNYLAKFISVHKNQKIAIIIDESQNIKNSRALIS
ISIKNFVDLVKKLAFKTYLYLLSGTPQTTGAIDMFNQLRFLGLKMTRYQF
EDQFCIKTKFNNAYRRVYKINGYKNIKQLEQLINQFALTSKNIPPIYLPK
QQFIGIKLPISSEFELFTNKKITEYDLLTFCNKNNIKSNYKPNELNLIKN
PYYKNLSYPDDKWIAGDDATFWLRTRQIPIGFQGNSKDYYWFDYSRLNKL
EQLLTDKIDNYIIFYNYDPEFDEIYKICKKLDYKIDIWNGNKKNITNYLE
FLNLDQNNKNSSKKRVIIANYFSASTGNNWQEYDKTILFSLATYGYHIQA
LKRNHRIGTKNTITYYLFLQDNYLDKSMWGSIQKRQIYNTEIFIKTTKEF
NKLKSQYKNLTFNF
>MSC_0963 Conserved hypothetical protein
MPKINFKKYVWHKYAISASLIIAATTAILGSAYKYSNTSVEKLGRKNFSD
EESFKNAFIDPNALPVTQFSEIHNNKIKIDYDPLKDVVKVNGKELSVDKY
LDQYYKKYHALPYLNIRYGSFNFYNQYIEAVSPQEFFKFTKWFMKNVSWG
PEIITLKSFSIVKGVEINGNSITLGAHSNKNKEETTIKFYPDAFFGTLPI
YSSLSGRGNAHESLTYKLNQQVLTKKDLESFLTNISNYNSLANLSQQTID
KSYFRGITNVSFLKGQKVFAYKQKDWYNNFSNSFSELGKKRLEDKSDYLF
VIPASNLNEAKNKLEIKLSEYKKNDKYNLFKNIDSKNVVLEEKTIESAEV
KQDQAKLDGNNNVIDKKLLITFNDNTKYFIHNAFGDVLTEQINNNKKTFV
ALKNYVQFSKAVEEFKSKLKNIETQIKELKKSKIDKNVQFEELEKEVKSI
SDKSNKLSDLKNGLEQSKILKEVSKKLFDLKTKILQTLNQSTKNRQFKEQ
NYYNVDFAYTLPELISIESLVENRNKLKLNWRDFYNIDEFISDRENINKG
ITDFFVYSKHASIFNKSSEKISYQQLLDENDLIVAKTKIELVDILTKRKL
LKPNADQNEINKLVFKADITNVKKENKNLLITIKEVLKDKTADKQKEFGS
KVVNLSISSDTNHGVIRAINDFFSVLGYKKMVSPSVLKEESDLRNPITGE
IKKTFDVYTDAYEGLIDELLEKVPYAAQWLEGPHIKKVIDKNGVMQYKLE
NGKYLGFTKDDRIGLWAILKMSDKNFKGISTDFLKFVGAHEYGHHITLNG
AHDLGNKGSDPIFISALTPGATPNINNYYSREAVDLFLKARTHIELETKR
LLDQFGAIKDYGEYATFNFAKKDKNGSISFDTKSNKKGLEDDSDIWGVNI
EDPNIRTAFSNKKRRFLQDFAGLLEAVRERQKDNGLTSEQDKKWLSSFDL
WVANAIDFYSGTLNPTVNSSNKENEVVKYMYKDEKGELKFRPASLKMLNG
ILKDGKGNPVQFQINTNENNETVEPIVVKGEKDSEGNYTKITEVLIYNRD
GSPIINVPLGVDLKNEGANAAQTIEFANEKIKTVEKTIKSLVVDKYTING
WNKTDSRLSLDSKIDLNYPALKNIFGAGLPSEANEMFKKIYGNYVSNRDV
DKGSYKDTNKDENKIKYYNLDGTENKISDKYKDQQSNLIYANPKNSKTVN
GNGQISFASLLATMFTAGYSQDNPTNGGSAQVLWIDKNKMYMPNVKLQDA
YTDLYFLSKLPKEYGKIFEAKKILKWMSAYTPTFIASKFNKDGIWNMINK
SEQVVSLDSSKDNTILDVVKLKLNYGDASKTLDYNVLNGLFGSFNNFRNN
TLEFSDTQKWLDFVNVDLTKAKYSASDETVNWDKDYVKGKIDINKFKNNF
KTYVLDKLDSFTSMNENQKEAFRKFYEKANKDKTDQIWANEIMRRFSSSY
FAMYNSALSIDEIKNNKELGWIFDSVHGYGDFKKTEFKIKDPDKNKWEIT
TEDMLNAYDKFAKELKVEIKQLNLVDALVFDNKIQLYTDQTKIHIANKKF
DLLSIFSSLATAQFHTSPSQDVLEYFHNKTERKFNELFSDYTYSFAEVIN
RDNLQITYSPSNHNFGNMPSFLSNISEATTGLEYVVDGTTTAKWKKRAIK
INDRDGRNGIVNTILDYEKLVDNEAKNKAEALNLKYHNSKLSQKSNLSDD
SNYNNSYFGEFQSINNGWFKDRWYRDFLDFKLYDDYGKPIEDKTIRIKDL
ENKVVTSRVNAYWQYYIQSQGVGKRNISGIWRDATKDAVAMFGYLSSDVA
NKANYLAFKNQETGEVKTVKINKQFSSNMFYYKTQNIENEAKYEAAKTDE
ERNAIRHTLAHEKYDYKDTNGHHKGTGFVSWVSDYAIMSKYQNALLIPGQ
KYSVYFSSDKKGTKDIIKTDLGDFQSIAENGKTFSQAPIRMEKSKNPVKV
DANGFKHYENTLYVYDQFNGVK
>MSC_0410 DNA-binding protein HU (HB)
MTKKELIEEIIINENISKVDAEKVVNRIFQTISKHLIEGKEVSVAGFGKF
VISERSSREGVNPSTGEKIIIPASRSARFKPAKQLKESLM
>MSC_0316 Conserved hypothetical transmembrane protein
MKKSIKKAKPWLISLIVMFAFISGFLFFIRSYVDSSKQNYLNKIQSYIDA
SSYLTKSKILKSVEDLNEDYVNKKLSEQYLEQEFGKDFIWKPDKIDANLK
NKVSDIYRTYFGKSIDVIEKDLKLQYRDNNKNLVDITDLKNGEFIPKNID
KITALTRSSEDFLNGFSPSLASLGLSFFQSQALENEDDLNKIKNNNILKS
VVDFINNNQSLIKLLSKIFSTKNVDKSFYKDLTIKQVFNKNINLITSSFT
KKTYSDNNNDYFADDISDLVISDVKNIVLQEWNKDKKNIEILDKFKTIFN
KVKDYFLKFDYLKILDNLFRYVQSELYFSMYYVINEQYSNPSQLLNQKIE
DKRFDPLINNKLDFSLLINGFSKVLEDKKYTTRLLDFLFKRTDKTKIYFD
HKTIPQNIGTSSLILDVINLIQKTILKTNQLIKEAIEGVENYIKENMYEL
REKIRNTILSVLKNKVNSFSHSVHFGYIHTNEDYNKLTVQIYTNLWLTKF
YYVDANIYIFGKNGLVTKIFDLFKQIGNSISTTSSDAFNFLKSVLYRDLD
SNIGFKDNFEEISKLINIGHNLFSDQKMLKIDLSLGSLKQIASIKDLYGI
LNLPNELDTLKNLLNTFGLSSFISKVEKGVEPLQKILDLLKDFGFISDTK
KFIQQFDTYIKKIIKYIPDKYQSVNMQLNYDLISNLYPNTKTNTNFISDF
TTRLAEFLFPKNNKNEDDELLLPIIRLIRNDKDKKITSVEQIKKDLTNYS
EKILSKDSLFKNIIDIRDLKIQLPDNLLKHLKIENIKDLTVLELLQITSK
YVNQFLKNNPDKSLTFDLSSIGFILKALSASVNITYIDNKKITNKNLFKA
LYDDLDSFNHKNDENNKGRKEESFYNWSSIVLNLGDGIKRTIDLKQIKND
FSYSPLHLLLGIDLNKVQYIKNTIGYALATLIGGINITDPNYELANQNRK
SVITIFAILNFVLQNQESILKNLEYQKAGIYYKKDSWTTKLINSNDKEIK
YYLIRNLTSESEINKRVGNCFEVTLTNDNNSSYWKISNIVALDYKN
>MSC_0186 DNA methylase
MKKLQKKGKIMIKLNQVYNIDCLDGLKQLKDNSIDLVYLDPPFFTQKAHF
LVDKTNKKYFFNDIWKDLKEYQEFLKIRLIEIKRVLKSTGSVFVHCDKTA
NHIIRVLLDEIFGSINFRSEIIWVYKRWSNSKKGLLDSHQNIYHYSKTND
FKFNVIYTDYSLTTNIDQILQLRVKDKNNKIVYKKDKNNNVVFSDLKKGV
PLSDVWNIPFLNPKAKERTSYPTQKPIELLERIISLVTNENDVVLDPFVG
SGTSVVASKLLNRNFIGFDINIDAIDITNQRLKNPIKSESNLLKNGIDKY
DTKTKQQKQILSRYDCDIVWRNKGLDAILKQKINNKTVGIKIQKDSETLK
QSQKLLFNCMKSKNLESEIRQTIRDNNNSFNRISSKLTDTKSIPEHIWNT
TINVNLKIDDSNKNSVLSSTSINNVSIGVYDIDEITTRKNIWIDIDELKK
LVHNSNSEQQKEKLFHYLKKNNPTITDESVFNRDSYNYDFSNTELKEVVT
STESGPITLSGKQTIHHYYFSGMNYKQSWYQTKLKINKNNSNKNFVDWNN
KDTIELTFYVFIPVPTKVGFENRDYLRPFKIGDLDGAFIIIDKDGSERRV
IRKTLLVKDTEDDWNEEYEYKKWKKQGWKVEEIINWNHILKRHKLPYTIK
RFPDTELNDYPF
>MSC_0810 Putative variable surface protein
MKKLLTILTTSSAIFTLPAITLLITRSNTQFEFKTYKNKFNSREHKIDKN
GRVTEIGYTVLPNGVIKIKRFDYKVKIIAAKLPEEITSLNNAFLLNPHNI
KWEVDWDTKNITDMSYAFYNTIWINSEKISKWNTSKVTNMEGMFGLTKSF
DQDISNWDVSNVKNFKNMFDRAKKFNNKNKPLNWNSKLKSAKNMQGMFKS
TDLFNQDISDWDLSNVTNISQMFSESKSFNKNISKWDVSNVKDMSKLFEN
AYAFNNGEKPLDWGHKLKSIKNMSSMFNGASKFTHNLSSWLMNDIVKNDN
FGLNKEKQPKWKVEEKKPVNDSLTQPQPNSSSDNSLPRENSESSSISNTE
AESTLPKVDKTKKQSEAKNKIPVEKGELPKDENQTTKTSNAIKDKENSSI
KSDSLYKIPSKPNTIISKPSSANAGIIAGVVLGSFTILGTTAGLSYYYRK
NLKNLYLKSADKIKPSLLKSKDNIKDFYVKSIDKTKNLYFKSKNKIKDKI
AKIRSKK
>MSC_0455 putative ATPase, AAA family
MNQPLSFLLRPKTTKDIIGQTEILKPKGLINKMILNNYCTSLIFYGPSGV
GKTSFAISLANDLKIDYEIFNASYDKKEKLTNIIQTALKQKRFILIIDEI
HRLNKDKQDILLEYMEKGNIFVFSTTTENPFFVINPALRSRSLLIELKPV
NKEELISYAKKVINQYQLKINISDEALDFLAQISVGDIRTFLNYLELIDR
LYSNEFIDIKEIKTIITVSKNPTAQNSDDFHDLKSALQKSIRGSDVDAAL
YYFSRLIETGDYESLMRRMIIIGYEDIGLANPAISSRVVQACSAFRQIGF
PEGIIPLGLAIVEMSLSLKSNSAYLATNSALDFVRNGNIYSVPKHLKDNH
YKSAIKLGIGIDYKYPHDYENDWVEQQYLPDEIKNKKFYNHKNNQYESKV
YELYNRMKNKK
>MSC_0658 Conserved hypothetical protein
MNSNLELFKTKDKEINNIAKDRIIMTLDKKIKESLIFENTNPQYSIIKRE
SAKKRLKMSFKTIFENALEFILNIKPCLMLSPLTVSYLFKDIDYKFDTVI
FDEASQIKPETAISSLFRAKQVIIVGDKEQMPPTNFFNTLENDEVVQKTN
IEEDISSGYESLLSLAEGNLDSIRLKWHYRSKFEDLIYISNKFIYNDLIT
FPNSKLPKDYEGLRFIYSNPNQQTDEYHTILKALQTLKQIILTYQNKYSF
GIVVFNTEIEAKANEYLDKFLEQNPDLLPFFNEDVKEPFFIKNIETVQGD
ERDFIIFIINGKINKNNRVGVVFGEINKQNGYKRLNVAISRAKRGMIVVS
NFKHNDVDWFKSEQDGIKMLEKFIKNAELGVDNLKHLDLDSNFKSAFEQE
VYEQLVLKGWKVKKQVGSSEFKIDLVVVDPRNQNKFLLAIECDGSAFNLA
KSTRDRDRLYQQVLESRGWAFHRIWSTDWFKNPSLQIDLIEEKLNKLLNI
NKENKIENKIISNQTNTDKFIKKTNQPTEIFQTYPSINKLIKQVGYHSLA
NQTGFFIKKVSDLENATSYILNQIGPLLLTSVYKIVRDITKQQRVSDTVK
LAVSRMISSIGTLDKDQFLIPYGWEFKFRQSTNNDNKRSISEIHDKEIKH
FILTIFKTNNMAISTTDFTKELTLKTYNKSISQNSIDKIQHVLDQMVKDK
IIVEEIKNVYRLI
>MSC_0219 DNA METHYLASE
MIGVIMHIISGKYKKMKLQTLDSSITRPTLTRIKEDMFNIISNYFIFENK
TSLDLFGGSGSLSIEGLSRGIKFAIINDLNKDANKIISFNLKKIPTSDYV
LYQKDYLDLLNLLKVQHQKVDLVYLDPPFKQIEYYYVVFAFLINNNLLND
WAIIISETNQKLDLTKIKDLSLLKFKDYNKKYLYIFRLEK
>MSC_0197 Conserved hypothetical protein
MNWIRIFWKKGEPNQLGAGKWTSFVKFQFENNKLKILTKIRLVSESYNQK
EVIFVAGKTLADKIEIQPNFKYWDNLMQEDKTYHLNKFTFNLETSDIIND
EKSYKVGTGEKTKSNKELVLDKFKEELNSFKSKNYSSLTETFFNRAFPNF
DEMLDDRYLESHLKFEKNKVILPINSSAKWPFTERPFNIPTQFELPVEIN
FNPTSFISEINNKFNQLSSDINIMADYSTDITDRTVKNEVINKDEITGQT
YGSNLYTNYTLVHKIIEDRIQKLFVEPIKNNFSEEDLNNLFEHTITFDTI
KNKINVNFISNSKYKKDILDIENNKVKTIYINCNVSGSNEYNKKINLGNG
LTLESGKYFDTKTNKFIKDIPEIKETQTNSKPELNTKLKTYIYHSDVTLE
WKSDDPSDVLVVNGEIKELSDNNTFRETFNIPLINKWNDETQKEWKIHIT
SKNPERKQKIDYQFNIKIEPIVKENLEIKNLGWKPDEKADKNTKEYNQWL
ITQEFLFDKDGNRQPNPKFVPNLNPKTGFISNFIFYKTQYNWSNPKLRI
>MSC_0282 conserved hypothetical transmembrane protein
MKKALKVFSIYSLDPVYKLKQINKKIDNKKLSFKKILKIVSSLGLVLFST
TFLVVLNFDKLNFSNNKNLEQQLESYKTELNNKISSLKISYNDQIDQIKE
FKKNHNLLKTISLKELEDKVSNLDDQNKKALKQIEHNDKLLKANQDKLNY
LTNLKNQYSKQITTLEQNKNENLKQINQINLQTKSLETKIINLNQQIKQT
NELLINKTNQNNQLKTQITTSSNQIIDNNALIRNTNQLISDIQAEKNKLL
EQLDQLVNKLNEAKNSTKQKSQ
>MSC_0255 conserved hypothetical protein
MTQSIIALDIGSKTIGLAYSSGVIASSLDTIRFEEYNFDQGLRQLESYLK
KYNPTIIVVGYPKNMNNTIGERAEMVDYVIEMFLDMYKNFNKDQIIRIDE
RRTTKIAKNILIQANLTREKQKKYKDSLAAQLILELYLESRKL
>MSC_0947 Deoxyribonuclease, TatD related
MIDDNKIFDNHIHFNDESKYKDVNIKQLIEESNEHVTGWLCSSSDLISSK
KAVEFSKEFENIFASIAIHPNDVQNFDNSVFDELEKLITNKKVVCIGETG
LDYFYSKEHIKKQKEFFKKHISLAIKYNKVLQMHIREQKDQFLAYDDVIE
IIKDLNQITKVVHCFSANAIYAQKFLDLDCFINIGGAVTFKNAKDLQEAV
KIIPLEKMLLETDAPYLAPHPYRGQVNHPKYIYLTALKIAELKNVDVKEV
IRITTLNSKKIFNLN
>MSC_0033 Predicted permease
MKNLYLMLKQGVKWILKFKLQLVVIVVLTFIASSILTISFTTNKRLSSAY
DQVVNNSKSPKFDSTYQITVGSKAKPEKGDPLFIPIFDFVNKQYTGFKNE
GYDNFNLTFNDIYGEKNLLTITTSSQEFKDAWAKKKDIFIYKDNQNDIKQ
LAKEQQDFDFAINDAFFNTMAELLSKNDPAIRNTVIGRYTLSNPNWYKHF
YNKEKNIKSNWAEFIKDKSRIETLKNTNPDDLKTYFYSYYAFESLSQYFF
KTIQTFLQNKDSELAQQSNINKDNSHKYFYEFLFGKYFENNKASYKEEYI
ANNNNLYTLTFDSTVSSSEFEKMNFLISSENKEQNSQDQKFFNELVKKGF
KGILRPLQITYQNFGDQVDIKNVVQYTETQELRGFVSNSNIYSQNVKELP
EIFKNNSFVDILAMNADPFANIGEKSVNFYTSKTNDLETIVASDFPITAA
FLTHHKLTALANGYDLYIRPETIFNDPITKKTFRIVDITNKDFTNYIILD
GHAPSSASEITISKQFAKANKIQIGDRLTLGNAKGLIVTGYAVDTYSFFP
TSDPNVPLPKSDSGGLIYADFATINQILGDGNSATANDQTSTFNFFLIKK
NNSLNIKNVYFDHFSVANKIRDNILAKQKGTEIQTFYQEREFSNSWYSLN
WTLYQKISFWYSLATFLMASLITLVSALAVFVGVIKSIQANSKQIGILKA
NGASSATISWSYVSYAVILVFIAIPLGWMAGTMLQVPFVAIFKDYFSFKT
NVLIYDWLAPLISIIIFGVLIGVFSFLVALFHIKKPVLDIIKSSKKWSKP
KITDWLHKHIFKKPRFATLLMLKLTESGKKPFSLLLVLVFIGTLFVSAGV
AIPSVTKYAKDNYFKKVNYDNQYEIYNSLSNSPLGKDVFNFWNGHEQIDN
TYKEIKDPSGTINYYENPNSYTLSNQNSSVLPQLIYKINTNKDNDSNNAE
ILTPYKSIIKEYLKTGVSNLYKNLLDWASYQISISNGKSISIGTIEQLYA
YILNDADLNERFKNDIDKVKETNNVTQPLTQFVGELLKTIFKDKVQTTGE
WKEKILNLILGYSPSFIKSYLTSKSRRAQFSFGWQKQTIIPQKDQLATIF
KPKSNNVETNYSILGLDKNQQTYKLSDKQKNQLFLSNNQVQKLYQIINNP
YDKNQNNDIYLNNIKVYDHKTNTLTIPTIVNKNLNYKLNKFGDNIISNLS
ANNIQLSYKTRNNDFNVLPKQAWIYDDSDYLKTEYVNKHTKWEDQPIQII
NNKNNSSSYGYEVVENNNEKYYYLNPYNLDVNKFTQRQVIDIWSNNSNSS
LVAKQHENIVDESPLFGDFVINNNGQITKSFIRPYYQLRNLLLFVPITDQ
VSWEDFVLYASGWSKSTEHGLDIKRVISDLDKTDDHTRNYKYPAIKKLNA
SLVPQSVKNGWQSVIKDLKSDTAYLAIRPYDFSIQQEKWANNHYEYFILD
NNTKKILGVNPPSADKSIPNILLNSVPHFYRRAVGKRKSVPAILKLQDKN
VSYVNKDLKIKLQKVDDIDIYGKAYALVDSDLANMLYGFDISRSTNYDYR
PFDTSKIIKKGELFNTYKTTNWLKVNNKDPWKQAFISQKDTFSYSPHYYY
NTIFSNSSEPLIITSSVSLISEQRLGIGILDLMNLSDYKAGIVDVDFTFE
TKQLLNQIVKTAIYIAIIIITAIMLCASLLIMLITDIYISQYKSFMIMLR
SMGYTNTQVMFYTLGIATIFSLLISFITTIIVFSSTSIIDKVFSANGFSI
PINVYWVSVVFCILLILVSFFTSLWVSTKRVRNAEPSTMLSEVDE
>MSC_0631 Conserved hypothetical protein
MFNSWTEEQVKNYVNTQLHQLELNFKDNTREIEKQLRETPKNREGIIKTL
NEQKTKIEIKYREDLDKFNKLDKKGLIEWQQKEIEGYNKKKGQQTLRSSE
SGTMWIMDYIDETNPTKFYFGTNSHVAKAIKDNLVSFSLTRLNSDISVGQ
TFNLNGFDKNFTKFVFEKNENGTKLKDAVSAIFHATDFIKDESNPLKMLE
DKQKEKYKDAGIFADFAVIEVDFSKLLDNSKYKYSIWSESNEVSHKYDKD
QNKLISLITNDYAKSNKQIQFESESLLDDKKYKTFDRKLDFDPKKQNEVD
EYTKLDSLYIVGYPIAVEDYYLDKYEDYKQLQNREYDYSLWVNSESKYYK
NLVKKEGTPPSFKEYETDKGNFFSYQIGYRSFIDKPGLTDAFLAVHRIGK
KLYTLDNNGKPKKYFNYGLEILPRFYAPSGGASGSSVRTKDNKLLAVFHA
ANNSAKTGLAAVFRSRGKTIITYLVIIN
>MSC_0575 Hypothetical prolipoprotein
MKKLLSLLACSFVITTSASFAISCKTIYKQFKEFENLINQSENKTMILYL
GARGNKSAKSFEQGLEELTKTNSLDQAIKNINETSTNDATSFIYKFKSNL
SWNSTNNHTKVLNDVEVKKNKNSKTKKERWIIDQKASPSSKQIFKNMTND
VVIKNFKYDSDDKIWTKGLTSKILNEYLVKNWAKAFYGETSSSFNKNDNT
VTDKVEKLQDKVKNLKGPLFLILRDKMFYGIVSGFETFSKQDQKNATKTI
DNYPNGSDIRKNVYDQWIGYLKQAIEMYDVVKLLQDTDPMITPKTEWKYQ
GTNKVEDKKDDKKSDKDQKEKPKEEKPAPAPQPAPAPTPAK
>MSC_0003 Primase-related protein
MNKIKQIIIVEGKTDTDKLKSIYGNDLKTIQTKGLSLNKKTLEMIEEFNN
KMGVIIFTDPDGAGKKIRQTIIDYLDNKVLNAFIKKDDINKTSKKIGIAE
ASDDAIKKALDNLIIYDKNNVSLSWTDYINNDFYLKSNRIVICKYFNFDN
NISSKTLFKWLNWMNVSIDDIKKIIGE
>MSC_0279 Conserved hypothetical transmembrane protein
MCLGKANMWFELMLIITKLSETKAINIVFLTIFLLAFFCSLFTIFKLYVY
RNTLKKLHFTFLNIEKTLKHPLANRLVRMQFIVTNSNNQNLSKALEIWKI
KYNQIYNVELDILIKQTKEHFDLNSYSKKILFRVLSIKNFYRTRKLYKTS
KVIYQKINLMYSETQQVTNIEFLLRDYRIILQKHINDLFDIVFKEQENNQ
LNIDKKIINNYQESIFKKMIVCEYYIKIGNFKEAFSKLNLLSNNVIEYIK
FLDDHYKITKFLEFNGILDSKLQEIKNKVQLAIDQKNNQLIKYKINLLEQ
QFIDQKQAVEKLLFQGKNNQAFLIIETLIKNIQNLDVILKYDQQILSLVK
TNVKNIRTILLSFNTELLKTEELINFNNNLNNDISDIKIQFDQLKTSFNN
ITTEFDKEYQKISSNFIRFNSLIVDYLNYIRNVLIDIKKHYTQLIDIKTL
LKNKSLVLRDLETKYDNIKILLFLSQAIIKKYEKVIDWSVYKELINNKFL
IINFIYKNLELEANTFTNDYDALLVLNNQLDNQIEQVEQLHLNIEQVVVI
YKIAQQIIIYIAKNLAYIPNNNAFEEILTKFKEKNHKKVINLAIHLIRKN
QL
>MSC_0949 Conserved hypothetical protein
MKVSEVLNSSNIKQLFFVFFNLNIEKNSQFLIKITYKNSQFSTLELNDKI
YTENNILMSEYTSINMSFKERIHYVLIENDLNIDREVFLHNIFKRIQQLR
LHVEQNQFEKALYLGLFAFRGSADLSLNFYSVDLLNYNNFYSHWEDIISL
LISSPAIKQLNLNFRELQPDYINNNKKRNTQIRINLKWFYDNLVDDISKI
NVYKSKILKDNKSIISNLQTVKTFKNSFIQRLNYYKNKILNFKDINEQPT
KEQIQKMRKELNLAIEENSTTARNSQVVIFAKVMLPDFCFACKDIYELNS
RTFIHRQTINLI
>MSC_0418 Conserved putative prolipoprotein
MISTTGFLVVACNNTNNNPVKSEVIINNKQAVIDLWNKEFKNKLDSAKNT
SLIVEMPKEKLPKKISDSIDLNGSLYENTVEKLTKDENNKLNQKVKILVN
KDVIGLEVGSVKYGQQSIKYKTKDGKIVEKNKDEWEKMVWKDPSKKRQKQ
EISDIDEVIQMGYYDDEGDNYLKPFNKNIHYVQAFEMPTNISKISALLPR
EVNSLSKVFNQITSTNVGGIEKWDVSRVCRMSFLFDGAKNFNYDISNWDV
KILRDANSMFADNAKFNQNLGKWKPYNLAFARKMFARAKVFNQDLSSWQT
SHVYNMEQMFDGAEKFNSNLDKWDTSHVQTMKSMFSGAKMFNGNISSWNT
SKVEDMTSMFRKTGEFNQDISKWKTNSVKKMNYMFSEAKMFNQDISKWDV
SNVDEMNFMFNKASSFDQNLSEWKVKEKVKHSSFATGSKIESQSSKLPKF
KDSSSSTPTPTSTPASTPASAKSKMK
>MSC_0236 Hypothetical protein
MKLKNWTFYKAKQFVKLNESNEILKDIAVLVLRPDINKEKTLLAIGLDKK
VVNSLIIDLQNKVFEENELFEIFKENIGFVLTEEISEIDAKGLNLSNPIH
PDNIKSIIKIYNLFLNVEPIEFDTKDYQDLETIQNQEDVFTNVDFENIPL
PALLQTLNVGMENYKQRVEEIFELDGKESINKKLELVNIQSNLIAFFDQA
LRKMDEIITKLSEQNAELIKKLESQEK
>MSC_0207 Hypothetical protein
MIYLSNQKKGLIKMNLVNIVGQIEGDATVAYTSKDGAKKFYKFIVKVPKP
YKSKEVDSFDYINIKTWSNAVDDEFLLHDQAVVGIEGRIESFTSNNDLTN
IRNEIFANRILYLN
>MSC_0776 Conserved hypothetical prolipoprotein
MINKKYKKSVLVVLCSLPILSTSIVIGCKTTQNQQGIYKIVDFEKENQIN
ILSEINQFFEKHDFNEQLVQFVNKDSHNYITLDSLMKNNYAAKYVKFDKD
KFKQIIKKEFNLSDAYLNKLEIEVDYTNIDRDYSNNFDIVFPIRIKRQLE
NHKKASYQPGLFTEQIIKFRLKNVKSSPSEAFFAEELKDVFNKLKELKYD
NFTARLKTNISNELKKQIDQWNINELDSTQLSNIFEINISEFDQLKTNNP
NFVFKSTIFGVDFSDKNLALNEGYLKVRFAVKEGFDSKDKTKQINLINKE
INELIVKKENLEKTNNSDSNKTEIDKLIQIIKQKSAQLTKIKQKALPAEA
GITKLIKFKFDWNDQFWKNIKLNEVIKIDTIKYGISNTDFLSLTKDNLIV
KILNKDVRNVDIKKIEKTNDFRNAKLVLDVLLKDNKKLELNKKIGVGKYS
LLYENDFIKNNIQAPYFTTERLTQENLQSVNKDFFRQFDSELFSGGYASS
RGFYAPKITTPIFMHIGEDYIANDFQAVLMPYDGEIIAAYELSTNVPFAG
VGTVVVVKIKVSDLDWTPKEKEIYLNDNKDHIYMSFLHLDASRTLNNQKL
GWSAEKVVLNNNRTIQVVKSLTPEKPQKVAKNTIIGYLGNNASNGGWMSH
AHVNLYTNRPSYLSENYFSTKSNQGLSEDRIKQYHQNINGKETWRQFGNI
GLHQSPQRPPYTINEVDQITGVEKLDENKKKIVVKNEQALFLPNLSMSLF
EKRLGYANPNLVYRLRDNKTVSFSVKEVNKLT
>MSC_0402 Conserved hypothetical protein
MLHVKKHQMIKHQIKIIVVIKIITLDNKQKNELIQKFNNEVRNVYNGTVR
PFIISRTAFLSKDSDDKNFFSRESLYKLDNEKSLKRNSDGYYSFYENLVQ
ERKNEFEKTLRTVLDVNSAVNKFKEKILSLGVDKYRTILGGFNNNKWIKG
IKFRLTDSKIKFFEEKDSFDSTIQVGLDFQYQYKDSADQIKIETISDDFV
INISSEEAVIKLVNDIKKSWVDLLLLDSNNLLKVDFERLKKFLGENQVLT
AQDLLTTATNNYKSVISKYNESLAEDVKEEISRHFIKNSENKVVKNLLLS
FKDQNQKTDVEKNNTELNIGSLERSYYNAKGKKTGTLEKGSYDLLHLYFG
EVKNGQEINLLNTKTGDEKLAKDLINPWVEKMANYKTKFKQTLANIASGF
VDEDKLDEFNNKLEKDNVLKNLFKSSTSIESFDLKGLQLKLENGYVHDLG
NISFSYFVELDKKDKELNFENLENLTSEGYKKSAVFDAYYKGIEVMLDQF
HKFYGIQKAYPDYESSEPSYPLKRLLFNMTGKPSSLKDSENSDFNIWDEW
EKYLSQTENNRVDDWTSFYSLDFNADVKKVKEEYLISNMLGIKNFKVFQD
QEQQHRYEQEIYYKNESYQRDYEQSKLHVTFPKGIVTEGRRNTHLEFGLL
TDLLNFKLKVAEGFTYYTLGAVTIIGKLDDQNGNQDSTKPDGEKDKTKPD
GSTEPKQPENQK
>MSC_0813 putative variable surface protein
MKKLLTILTTSSAIFTLPAITLLITRSNTQFEFKTYKNKFNSREHKIDKN
GRVTEIGYTVLPNGVIKIKRFDYKVKIIAAKLPEEITSLNNAFLLNPHNI
KWEVDWDTKNITDMSYAFYNTIWINSEKISKWNTSKVTNMEGMFGLTKSF
DQDISNWDVSNVKNFKNMFDRAKKFNNKNKPLNWNSKLKSAKNMQGMFKS
TDLFNQDISDWDLSNVTNISQMFSESKSFNKNISKWDVSNVKDMSKLFEN
AYAFNNGEKPLDWGHKLKSIKNMSSMFNGASKFTHNLSSWLMNDIVKNDN
FGLNKEKQPKWKVEEKKPVNDSLTQPQPNSSSDNSLPRENSESSSISNTE
AESTLPKVDKTKKQSEAKNKIPVEKGELPKDENQTTKTSNAIKDKENSSI
KSDSLYKIPSKPNTIISKPSSANAGIIAGVVLGSFTILGTTAGLSYYYRK
NLKNLYLKSADKIKPSLLKSKDNIKDFYVKSIDKTKNLYFKSKNKIKDKI
AKIRSKK
>MSC_0945 conserved hypothetical protein
MILMYFFYSEDIFLLNNQIKKTIKELQQKDQYDVLSFSLIEDDFNTIYDN
VTNLNFFSSKSIIVISDAYFVTEIKTNFNKNYSLNKLEIMLKNFNPNNII
IFVLNSNKFSKKLKIAKYIESSFNVKYLSLWDEKQTIKYIIDYLKSKNKI
IDINLASQIYNLLPNDLQIITNEINKLANLKSELNIDIIKTNLNKYHNED
IFKLVDAFINNNIDKFIKLYHDYILLNDDIIGLISLIDTNLSFYRDVVIL
KKQFKSEEQISTILKSHIYRVKLAVNNSYDINTLNDKIKIVYKIYKGIIN
WNINKKTLVEYMLIKNMKG
>MSC_0656 Conserved hypothetical protein
MDQDKKEILIKNINNWKHKLLDLGMKNKSLNFNIKTTKTISSKIQIIYPN
LLDFLKSLDTSSKEIYEISNYKMLGLYKTDHLKNQVLYSHNLEEINKNNS
FYKTKIYTQFGYFESKDILDLTLKKIYKISKTWKDEYSIDVLYLAFGFLK
WYKTNDSNEIRYAPLLLLPIELKKNLNTWSIHIKKAENFIQNEALVKKIK
NDFDIDAELDLNKDDLINIYKSYSQKILNQVIDQRWEIIDDIYLANFDFS
RINIYKDIEANINSIIESDFFKKIVDQTNNLDNDISTINEQNIDTKINIL
EQYKILDADSSQEIAIQNAILGKSFVLQGPPGTGKSQTITNIITELISRN
KKILFVAEKNAALQVVYNNLKKIGLEKYAIPIHDSKINKSEILNELINSA
NNSQLFLLDDQKLESFISNYQKIKEIFSNYKDVLLKKRSIEFDNVYGYIN
KYYLYKNYLDLNFEINNINKITNQNFEQYLQLINRFYESYKLIGFNYKNN
LWYGFNKTNLDFLTKTQIFENLTKFNTEIENIKNYINKANVLANTNTINL
EFILKITYLKEITDLYKHIKPLYKNQILEIKTLNEQLQKVNQLVEFYYIK
NLLIQTLNNNWTNLDFINDNQIKQTIDYIKKTINKPLKVLSLK
>MSC_0205 hypothetical protein
MDLQNDLKNEEILEQIKKARVKFEQQIYATSPNKVEKQFVQQLSNLDNES
FFKILEIPLTSSFERIEQAYKVLIFKKMTLNEQQIDFSNNQEIKVYDLPF
NSTHNEVIELPLKTNDFKEIEVETNKVKPEYESVEQEINNDFVGEVETND
SQQNEQENPIDSNNQFKLPFKKTKSKRQKIIKKYERINKYEFN
>MSC_0639 hypothetical protein
MLDEKIAFASKELPEIFKQMSSLQNQLNELLKIEKDLSIRLSKDDTFKEL
DKIITQLNEKHYEKGEYETKLSQINDSEKIIFKLKQKISLLSNDLYSKEF
KNKLDERLNTFNKYFQKLSDTLYNERYFLTYSNRKWKK
>MSC_0522 Hypothetical surface located membrane protein
MKYKGKTRGEDTYSKENEFDASGWGDTNFVVYELGYYDDGKGQIQAIKLP
KGVVLVPKDLPKEITSLKELFKDAESFNDQNVKNWDTSNVEIMESMFEGA
KKFNQDISKWNTSNVVNMSQMFKDAKSFNQNINTKYSRDKNKYNMSWDVS
KVKNMQSMFEGAEKFNSDLYNWDTRKVSAMRDMFNGATSFSKDISNFDVK
GVVDFTDMFKNATSFNKNLSNWALYDNEPTDLDSGATSWISYYKPRSKRD
TSNIIKKSDEGYLNTTNSLYDEEITKLIDKHKIEVEKQKAEAEVRKQKEI
PNKIKDIWNKDFKGKLNSAQKFKDIFKEFQEKIKEDIDLKSVSIKLANSS
LENKRFRFDSTNTTIDTRIKVEPQSLDILLDNQTIKLEPGSVKRAWKAVQ
FKGKTNGERASSKEDEWDISYWDDKDFEVYELGYYDDGEQIQAIKLPKDV
KIIPDHLPKEITSTKELFASSNQFNDENIKKWDMSNIEDTSGMFLGAAKF
NQDLSDWNTSNVKKHE
>MSC_0005 Hypothetical purine NTPase
MIRDFNNQEVTLDDLEQKNIETNKNKPKVQFLMRFSLVFSNIFTHIFLLV
LIVITSLFFGLRYTYYNYKVDLISNAHKIKPSIPKLKEVYKEALEVVEEI
KRETDKNSNDSLINKIDEIKAIVKEVTKFANEFNDRSKKVEPKVKEVIED
GKQVTTNLNKITKEIEGLWKIGDSLTNRVRRSLNVFSALDSLANTANNDF
RSVSESVTKITELAKKLSVEGEKITTNVEIIKKEVDYFSKKSEIPLKNIE
KLKEIYRQKLPIFERNNKKLQEIWDKLMGIFNQFTVEKTESNYYNHLIYI
LLFLIIDSLALLVITYMSMISKTMKKILLFYIFGILSFNPFVWVSVVISF
LSRPIKNRKRKFS
>MSC_0446 Hypothetical protein
MLLFLVKKTTINQISDNNNNSSTNKQDKNKQDHSNNEKMGENTKNDSDKI
NTEKTLDNDRMNNQSDQPREESTPRNNDSKENVWSRGIKKRILESLNSTN
LDYLKTLSNSLIQEKEKTLISNNIDKKTLEYKTKLTKFSSELKFDEIKKE
LISSLEESIKKNKNNQHQHKLLLHQFKDRQLEKQHISEITKLIIDIYRSN
LLNELYKELDEKIQKENREFEEIFKRKNKNEIKNKLFDLVDKIVDLQEAL
KNMSV
>MSC_0923 Conserved hypothetical transmembrane protein
MNKNKKTLINISIAFAILPFLVLPSLFFINKKTNNLNLVRPPGFDVYDNL
NNDKEAKTVKLLQSLVDNVFKDSKLDQQKFIKSQKEKNDQLISKAKELNL
EYLKNSNEDNLKKLKNFYSENWLFVFQNLSKFEMKFIDFWKLKANNGSEL
HSKQFLDDIKKREKPKDNYYFSNNNIDLIKIGKENEDLPDLTVYYLRKDR
FIIRTLLTNNDKKMTIDKFILFNDSIISRINIDTISDAIHLGVFHNQKDA
FTSTFEKHIIKEYGYPWDGILLWKGK
>MSC_0190 Conserved hypothetical protein
MNNILSFIKTKKEHLLNKKSDDIKNQQDFKDKHNFLDKKVKELQAELELK
ETRIIDLTSQNQILDSTIIKLDDRILFLENDIDQLKKDNQNLEQQLYSNK
ILLNNSKVHINQLVKDNLNLVEIINNSKQIIDEAYQLSLKSTNLSKQNSE
FLTNNLSNSKDFSENKQENIERI
>MSC_0951 dam, Adenine-specific DNA-methyltransferase
MKWKEPSPTIDTRFDTPSNGTNSHPVLNRTITPREAARIQSFDDNFCFLG
NKTEICKQIGNAVPPLLAKSIGLSIIEQIKKINEIYINENIKIYNADSYK
IVEQFINNSTKVNHIITDPPYNISQSNNFHTLRSANRQGLNFGKWDYDFD
LISWIKPYSKLLDKNGSMIIFCSYKYISFIIEELESNMLEIKDVIKWVKT
NPMPRNVNRRYVQDTEYAIWAVKKNQSECLINHKIRFIYVRFFRLQL
>MSC_0216 dcm, Cytosine-specific DNA-methyltransferase Sau96I
MSNSYKSIELFAGAGGLALGLEQAGFEHVGLVEFDKQAVETLKFNSPNWN
IVFEDVQKVSQRDLKKEFNLKERELDLLSGGAPCQSFSYAGKRLGLDDIR
GTMFYHYATFLNKLKPKMFLFENVKGLLTHNKGQTFQTICDIFSQQGYEI
TYKVLNALDYMVAQKRERLIVIGIRNDLTNLIKFEFPKKHQKKLVLKDIL
KNVPKSECAKYSKEKQEIFKLVPPGGCWKDIDQNIAKKYMKSCWNMEGGR
TGILRRLSLDEPGLTVLTTPQMKQTERCHPLEIRPFSIRENARIQSFPDD
WVFKGTIASQYKQIGNAVPCNLAKEIGKSIIKSLQGIDVNE
>MSC_0950 dcm, Cytosine-specific DNA-methyltransferase
MLAVDFNKSALETFKHNMPWSDIICGDITNESIRQEIIKRATKLKVNMII
GGPPCQGFSNKGKKLGLNDKRNFLFKEYLEIVGKLQPEIFIIENVKTMLT
TANGYFLVKFKTQQNN
>MSC_0469 deaD, ATP-dependent RNA helicase
MKFTDFGFKKYINDTLDQIEFIAPTSIQQKVIPLLKKHKNVIALAHTGTG
KTHSFLLPILNNLKLEENNNYAQAVIISPTRELSLQIYQNTKLFFKNNPL
INCNLFIGGEDISKNIEQLEKKQPHIVIGTPTRLKELYDLNKLRLTTTSY
FIIDECDMIFDLGFIEDVDYLISKINQDVTIGIFSATISQQLSVFCKKYI
KNAHFIDDSQNKISTSNVKHILIDTKNKELEQSLIQIINSINPFLCIIFV
NQKDEISKIVEILHKNNIKQVAELHGNLQPRLRLSMLKKIQNNEFKYLVA
TDVASRGVDVKGVSHIISINLPNDLTYYIHRSGRTGRNNSTGYSYIIYNL
KNKIQIEELIKKGIEFETKKLIDNQLVDIKTNYKKVKVFKELDAESKQVI
NKYKNKKVKPNYKKKRKQELDKIKQKIRRKHIKENIEKIKKAKYQKRRAE
LFD
>MSC_0549 dinP, DNA polymerase IV
MFNKTIIHIDMDAFFASCMQLKHPELKNKPIVISNSFDKSIISTASYEAR
KYNIKAAMPLFKAKKLYPQIISVKPDMIFINNISYQIWDFIKNNYTNKIE
VASIDEAYLDVSDLVKNTSVLILAKNIQKDIYDQFNLTCSIGIGFNRFSA
KMSTSLDKPNGITLTTTSNFKDNIWPISINKMYGFGQSAAKLLNNTKIKT
IKDLALLSDVEVYQLLNKKGLALKNEALGLGSDYINYLSNDYKSISKETT
LNTPIYQYDEIETIILNLSKFISYKLHKNQLLCKTIEIKIRYKIDEKLFD
KQKHLTSRHKQITLKNYTNDFEKIYNSALDCFYSLYDQNKGILLIGVGVS
KLIHKNQNWVQLDIENQTKVDKNQIDNALKIENMIFDINKKFKKPVIFKA
KD
>MSC_0001 dnaA, Chromosomal replication initiator protein DnaA
MLIYLCYKDFFHVFHIFNKCLTIIFLETNMNVNDILKELKLNLMANKNID
ESVYNDYIKTINIHKKGFSDYIVVVKSQFGLLAIKQFRQTIKNEIKNILK
EPVNISFTYEQEYKKQLEKDELIKKDHSDIITKKVKKINENTFENFVIGA
SNEQAFIAVQTVSKNPGISYNPLFIYGESGMGKTHLLKAAKNYIESNFSD
LKVSYMSGDEFARKAVDILQKTHKEIEQFKNKICQNDVLIIDDVQFLSYK
EKTNEIFFTIFNNFIENDKQLFFSSDKSPELLNGFDNRLITRFNMGLSIA
IQKLDNKTATAIIKKEIKNQNIKSEVTSEAINFISNYYSDDVRKIKGSVS
RLNFWSQQNPEEKIITIEIVSDLFRDIPTSKLGILNVKKIKEVVSEKYGI
SVNAIDGKARSKSIVTARHIAMFLTKEILNHTLAQIGEEFGGRDHTTVIN
AERKIETMLKKDKQLKKTVDILKNKILTK
>MSC_0681 dnaB, Chromosome replication initiation/membrane attachment protein
MLSKNFSYSVSLNFELDQEQYKSLTCLYQPLISAQAISLYLTLIQEVRIS
NILKEEALESKRLLNITNFSYKELIKTLDLLNAFKLIKVYVKKSDYSLIK
FEILAPLKSDEFFNHTYLNSLLLNKLESNDYEITKFMLVNETKINTNQYK
QIVVDLTDIYDQSLIADINIFDTEVNSFNTYLKKLNQLINVDYVLNSLKD
KDIDLDFIQQSTLKSLYDLLTIKKLSEDQIIYLITNSYDFINKNIDLNIF
KKLLINLITKKETNFDNKELLDLINQTTWSEYSKKKYDIDLSSYTTVFEN
IKQNYCLSNGIINCLIDFSYKKNNGQIIVKYIAKIAKTLFDKNINTTLKV
MQFLKNIQSKSINYNYETMFDSNDFNLQTEAIFEFSEEELKCLV
>MSC_0956 dnaC, replicative DNA helicase DnaC
MKQELTVAELLYAERFVLGVAMSFSNALADIVSVLKVDDFSIPANKYIYQ
AIIDLNNKNKSISPISVINRLEAINKLEQVGGDVVVYEIAAENYTDQGLE
EYIDIIHKAGVIRKLDIVIKELEIKRNNSNTDVDELLKVAQTKLLDIDLS
IKRFEIEPIGEVANRVVEKIKELEMKAEIISGVPTGYNYLDLVTSGWQES
DFIILAARPSVGKTAFSLNLAFNAAMQKYPVAFFSLEMPAEQLTQRLFTR
LTSVDSTNLRTGKGLSKQNWEKIQIAKEKLEEIPIYIDASPGISTQEIRS
KLYKMKRDHNIKLCVIDYLQLIVGSQNKDRQNEVSEISRQLKQIARETSI
PIICLSQLSRRAETREDKRPMLSDLRDSGAIEQDADIVTFLYRDDYYKKD
LTDLDKEKTELILAKHRNGATGTVLLRFIKDFGVFRDW
>MSC_0684 dnaE, DNA polymerase III alpha chain
MNYISLLTIKNQYDFLESLITIDQYIEFIKKNKLSYAFYSETHTMYGVAE
FFKKATDNNIKPIIGLTIEFEDSTKLIIYAKNKKGYQILNFVSSFLNDGF
NHYDYEIKEYILELVNNNVVVIGLISDLDFKTHLIDKLNDDFYDVKELNL
YFNQISYLDINDQKTYNILNAIKTNKTIDQIQNTNNYFYPDNDYLIKNYS
LENIKKVINEINFKVDFNLFDSNKKHLVKYKNINNLSSFEYLRQVCLLSL
KKYQQKIKPNLDLKLYISRLNYELEVIKQMGFSDYFLIVSDYVNFAKKND
ILVGPGRGSAAGSLISYLLRITDIDPLEYDLLFERFLNPDRSNLPDIDLD
FQDNRREEVLEYLFEKYGKYHVGMITTYQTIGYKMAWRDVCRVFNIDLLI
VNKISKVLDQYTNSDFLEFIKENKLLNDYFQNNVFKEIFITMHKIVGLPR
QTSTHAAGIVLTDCDLRELVPIKIGFNGINQTQFDMNYLDDLGLIKMDIL
GLRNLTTIQEIKHLIYLNQNLKISLNKIDLNDKKAFELLKNKQTSGIFQL
ESKGMTDLISKMQVDSIELISIASALYRPGPQEMISIYLENKKTNKFKII
DQSVFEILKPTYGIIIYQEQVMQMLNKVVNFSYAKADIIRRAMSKKNNKV
MQSMKLEFINSAVKNNFSYNKANLIWNWIEKFSNYGFNKSHSISYSYISY
WLAYFKAHFTTEFYTSLLDQNIGNEIKTQQYIKELYDYKIKVNKPSVINA
NFNYQIINKQIYMPLTCIKSIGYEVVKKINLAKSENENMYLDIHNFILAM
IKQKISVNVLQTLIKAGALDIFNYNKKTMIENLDLLISQANAYKQVNNIL
DDEKINLIIYDEYEDEILASFEKELYGFFIEQNPILKLKTSNFDLNLIDI
SKLEYNKVQVILGYILKIKEIKDKNNNKMAFVTIFDNTSELELTIFSSDY
KDIYQDLVINKAYVFKVLKTKTNNKTSIKFVSLIKAI
>MSC_0465 dnaG, DNA primase
MVYISNEKISEIISKANIVDIISSYLHLIKKGRNHLAVCPFHNDSNPSLT
ISPEKRIYTCFSCKATGNVINFVKDFEHVDFVTALKIVCDKTNISLDELK
NYNQPIKDLESETIFKLNSEANNIFKTTLFSNLGIDALEYLKSRNISIEQ
IKKFEIGFASDKTNLVQKLLDKNYNSLDIEKANLGIITNSYTKDYFTNRI
IFPIKDENDQVIGFSGRSFTKDNDPKYLNTKENKVFKKSHLAYNIASTLK
ISKSLKKIIVLEGFMDVISLSKIDINNTIALMGTSLSNYHLNLFKRHNLD
VLLFLDGDDPGIQANIKISHQLLKEKINVLVIDNQTNNDPDELVNNNVEY
LKQIINQPIHPVNYLIDKLWNKVDNNDPNQIENFIKKVLNFIFDLNNEIL
VEATINKLVELTKISEQTIKNNLIELKKQLKLNNSFNKNSTTQVFKTNTQ
TNKVNKQQFKTKPNDFIKKEYINAEKRIFISLLISDQFLDKIAANVEKMI
HPDIKHATINLINLYNKKIYQGNDINKAFDLLKEYNLTGFDKKQEEIINN
SLLTSIKIRESEIDDAFSKLDSYHNDTEISNLKKLLAESKNKTERFQIWN
GIDTLKNKKKKR
>MSC_0680 dnaI, PRIMOSOMAL PROTEIN
MKLSDYKNNVKIKKLIEDSQNSNDIITDKVLLENQNILDEFLLNYKECNL
DTKCEQVVKNYQVDLVFKDHQFYLKNVLCVHGKQTEKLFIIKKNYWFCDF
DLNLFHLTIDEYFNTQLNNSLFTLLDQNEKNIRKTILKTIIKQIQKGYKK
GFYLYGNSGVGKTYIFKVLANTLASKNKTVIFSTLRSLIDKLKESFNSSE
INSLTLIKKIKTVDFLFLDDIGGENLSLWARDDFLFEVLNYRMENQKPTF
FTSNFSIDLLEKNLQFTKQYNNFLTTQDVFKLEKIKIDRLISRIKTLAKE
INLIGKNKRQTN
>MSC_0002 dnaN, DNA POLYMERASE III, BETA CHAIN
MNFSINRIVLLDNLSKAAKVIDYKNVNPSLAGIYLNVLSDQVNIIATSGI
LSFKSILNNQNSDLEVKQEGKVLLKPKYVLEMLRRLDDEFVTFSMVEDNE
LIIKTDNSDFSIGVLNSEDYPLIGFREKGIEFNLNPKEVKKTIYQVSVSM
NENNKKLILTGLNLKLTNNQAIFSTTDSFRISQKILEIQSNNNEDIDITI
PFKTALELPKVLDNAENLKIIIVEGYITFIIDNVIFQSNLIDGKFPNVQI
AFPTKFQTIITVKQKSILKVLSRFDLVADDGLPAIVNIKVNEDKIEFKSF
ISEVGKYEEDFDDFVIEGNKSLSISFNTRFLIDAIKTLNEDRIELKLINS
TKPIVINNVYDEYLKQVILPTFLSN
>MSC_0048 dnaX, DNA polymerase III gamma-tau subunits
MEGKMNTNKESLYRTYRPKDFNSVAGHNNIKEILEKQIKDNRINHALLFS
GQRGTGKTSVARIFAKTINCLNLTNSTACEQCNNCKLANQNQLIDIIEID
AASNNGVDEIREIKNSVSTLPLNSKYKVYIIDEVHMLTKQAFNALLKTLE
EPPVYAIFILATTEFNKIPQTILSRCQIFNFTKIDKNSLKNRLQYIANQE
NYQIEKEVLDEIFYLSEGSLRDAINILEQLMLATDDLITIKSLKSIFLIA
TKQEQLQVIHQSLNNNTSFIISYFQKANDQGMNWDVFALGLIEILKEIIE
YKLTNNTEFLNILEKNEVEQFNSINVNNLFILADNLAEAYFKTKAANISF
NYLLLSLLKTINSNNNNLQAVSKTINTKQIEQNQEILKPNDITPKILDKP
IIDEPVIQQPIIDDLLLTKDLDDQTLIKNTIENDKPLDSTNLDDQINEFD
FYNQKEQAIDEICKTLSELKIKFNIHISQAIDSKVKMLFNEDLISILIET
KNYKNQIHNIEQLLEDLFLQNDDQLVNAQIASELFMLLDSKIISLTNDVI
VLKTQTKAQANLINDSMLDNHVLQQIYNWFKKPYLIFAIDKMKWDEIKTI
FIDLKNKNKLSEYSEINLKQLKEKYLTINDEIDQDLINKAKDLFNDDFMI
GD
>MSC_0682 fpg, formamidopyrimidine-DNA glycosylase
MPELPEVVTVTNTIKPKIINRTILNSQIFTNKIISSTNVDQFINLTKNQK
IYDVYNLAKYIVIELKEHVIISHLRMTGKWVIENSDQYAYKKSWLRAELL
LDNNLVFRFYDMRGFGTLNLYNKQTFLKDSHLDKLGPIPLNNQTSADYLF
NKLQKSNKAIKTVLLDQHVISGLGNIYVNEVLFLSKINPLVSANLITKDQ
TKEIIKNCETVLSQAILLKGTTISDFESLPGITGGYQTKLLVHMNNKNCK
ICDTKISKIKVNGRGTYYCSKCQN
>MSC_0007 gyrA, DNA Gyrase Subunit A
MNNENNNNDSLNENQDHYHGKISPIDISTEVRKDFLEYAMSVIVSRALPD
LKDGLKPVHRRIIYAMNDLGITSDKPHKKSARIVGEVIGKYHPHGDSAVY
ETMVRMAQEFSYRYPLIDGHGNFGSIDGDGAAAMRYTEARLAKISNYLIK
DIDMDTVPFIDNYDASEHEPAYLTGYLPNLLVNGTMGIAVGMATSIPPHN
LKEVVSAINAYIDNNDITIDEILNDHILGPDFPTGALMTNGSKMREGYKT
GRGSVIIRAKIDFEENKKHDRFVVTEIPYQTNKAKIIEKIAELVKDKTIE
GIFDIRDESNYEGIRIIIELKKDANPDVVLSKLYKYTALQSSFSINLLTL
NNNLPVLLDLKTIIKNYVEFQVSVIIKRSIFEKNKLTKRYHILEALHIAL
DNVDDVINIIKNSKTSEEAKVELTNKYNFDEEQNKAILDMRLQRLVGLER
DKITLEMTNIKERLTYLDVLINTKEEQDNVLKNQLNEIADKFGDNRRTEL
IDEELINIEDEELIPDLKWMILLSQEGYIRRINPDEFRIQKRGGRGVSVN
AEPSDPIDIATMGKAKDWVLFFTNSGKVYRTKLYNIRSYSRTARGLPIVN
FLNDLTSEDKITAILPLRNNKEKFNYLTFVTQKGMIKRTKISEFENINRN
GKKAINLRENDQLVSVFATTGQDTVFIANESGKVIRIKESVVNPQSRVGG
GVRALKLEDDDVVVGAISSFKLTHITTVSNKGLFKKTPIDDYRISGRNGK
GIKVMNLNQRTGKFKAIIGARETDLIMIISSDGNLIKTKVSNIPSLSRNA
SGVKAIRLTDNQEINAITLEYRKHGLENEDFEED
>MSC_0006 gyrB, DNA Gyrase Subunit B
MSQEYSAESIKVLKGLEAVRTRPGMYIGSTSKTGLHHLVWEILDNSIDEA
MAGYADLINVTITKENEIIVQDNGRGIPVGINSDTKKSALSLVFTQLHAG
GKFDSESYKISGGLHGVGASVVNALSLYVEVEVYRNNIHYHQLFSEGGTK
ESELQQLGHTDLRGTKVKFKPDPEIFKETVVFDYEVIKNKVKQLAFLNKG
LKITLTDERIEKTVEYLFLNGILDYIKEKNETKNKINPNIFYVDSKYEDI
EVEMALQYNSDYQENIITFVNNINTHEGGTHEDGLKQVLIRDINRYADTV
IKNNKTPSKFSWDDIKEGMMCILSVRHTDPQYEGQTKTKLSNPDAKEAVN
IIIGNAFEEFLLKSPEDAKAILDKNVNAQKARIAAQKAREETRRKSALDS
FSLPGKLADCETKDSSIAELYLVEGDSAGGSAKTGRNRKFQAILPLRGKV
LNVERVTEARAFSNNEIKSIITAIGTGIKEELDLSKLRYKKIVIMTDADV
DGAHIRTLLLTFFYRYMKPLVANGHIYIAQPPLYKIEAGKKIAYAYTDSQ
LDELKNNEFNNLKYTIQRYKGLGEMDPLQLWETTMDPQQRTMLQISLEDA
TLANEVFSDLMGEDPELRKIYIQDNAKFVENIDF
>MSC_0045 holB, DNA polymerase III delta subunit
MKKEQVISRLKKLIDNNSLFSNIILNCKDEQTSWDVIYQIIYYAFNKNVK
DLDFNKLKDQIQNNTHVDILTIGNNINITNQEILDLINKMSLSATATQNI
KFFIIKNAQNLKLSAANSLLKFLEEPPINTYGILLTNNYSEIINTIWSRC
QLINIDNQTQLDNKLNRFEELLISKNKDEILLFNKEMKTMNKNELVKLID
DAYNRTIIYQFANLISCTLEILDDLKFLPLTNIAIDNYLIRIVEQI
>MSC_0764 ligA, DNA ligase
MSKDKALLRINQLKEQLNLWSKQYYVDDNPSVDDTEYDLALKELISLETL
YPELITSDSPSQKVGGMVSEKFLKITHKTPMLSLGNVFSFDEFLDFNTQI
SKISNTLDNQYVAELKIDGLSISLVYENGSLVSAATRGNGVVGEDVTINA
RTIKSIPLKISKKERVEVRGEIYLSKAEFEKINQKRLLNNEDLFINPRNA
AAGTLRQLDSKIVASRNLDAFLYYYISDDSNNLTQYQSILKLNELGFKTN
KETMLCKNLDEIKAYIDKYTNLKNDLDYQIDGIVFKINDKNLQNSLGFTS
KIPKWAIAYKFPAEIKQTKLLDIFATVGRTGKITYNAKLEPVFLMGAKIS
AATLNNAEYIKTKDLRINSIVKIKKAGDVIPEVIEAIKDEDFYKLEKFKP
ALYCPNCHSLLEKNENEVDQFCINSSCSMKILRSLQHFSSREAMNIVSLG
DRSLEILFNLKIIQNISDIYKLEEYKDQILAIDNFGLKSYLNLIDSINMS
KNNSLEKVLFGLGIRHIGSKTAKILARKYQNIDNLMSASYDELIQINSIG
ESLALSIIDWFKIEDNLKLIDELKSFNINFNYLGAKINSDSIIANKSFVI
TGTLTRPREEFKTLIENNAGKVIGSISKQTDYLLAGNNVGSKLEKAKKLG
VKIIDEQQFFDLLKSEKG
>MSC_0617 lpp, Prolipoprotein
MKKLIAILSSVMMISTASLPVIACHKKEYKFETNNSLTNTKTAVSLFAKD
FILADQLQLNFQEIRNLNENKNLELLTKQNNLSLDKDELSLDSLKSTNQF
INKYFDQNSYKNVLDKNIKLDSNKSLNNFVLDEIFKLIGLGTSDIDKLSA
DILKVLELTTNLNPMFLLSDFDSVNSTLKSFFKKVKPYLKDGLEKLSNPG
AKFEDEVKAFQEKIDVNNKFKDLKVEDLDNAFYVSLSNAIGLSTVGSSYT
PIELKTGEASKSLKQASEALTKALNGPAKSKNGQEWNIVAYILQSLQFLQ
IKLSLFEEARDYTPNSYTNLFSASKKNEEFIKSIYNSKTIKEITKDKQSS
INIKYIFSFIKKTVDELKDETKKDGFELQKLLGILFLTSNKVEYSEDSSK
DDSKEYYDNSKAHPSLTILADLAKQALNEKLKPLLALISKETTDDQIQKV
VDTLIQQLYKWISFTLNSLLTGENNLNKCLTTLFAKGLPVIIESIQKNTK
LIPPEVSAQLGFLPFLLIPLINKILAIAFPILYSDSEKPAANSFKDLYSG
QVFLVNKVNEMFKVFRKTLIDLLTQVKADTKSIPYATIRGFLTTLENGYD
KIFKNVKEFNLKGLLTTPLNKINETWWKDQPISKSLQEQSVTDILDSLLN
DLDVKGNEKIDQVKDSNINLTSLTEVAKLVDNYTYKVKDINTSGKIHLLE
ILKNNPEKTLEILGWTSDKNNPIGKDSLIYALLTKVFNVNLDKKDDKSQN
AINQISKIISTVNKSIEIKTNWNSVEISFEFKDQKKNKFDQLLSETLIAK
VKNKKDNSQSTYSFSYSRDKQDKFKFTKITKN
>MSC_0419 lpp, prolipoprotein
MKKILIFLNSLLVISLTSLVISCYQLDTQTLEMLSTKIKDELNEVDLKIT
NLQKENNSLESQIQELKNIEKISYKFEPFTRLYDKANGYRYKASKINLPI
KNQYFKTKIQLNKFKNNQQVKVLLNDIQIKENQILNQEKINYQLKQNIED
TKEDLSKFWGYGIDPVYNKNQLVKIGYFLTRDFEIQIEKIKPETNQVPDH
LPKEITSLKLAFQQNINSKIKGIENWKTDNIKNMSEMFEKAKNFNQDISN
WNTSNVIKFNSIFNEAEQFNQNLSSWKTNNAINMDSMFVDAKNFNNNNQK
LVWNTKNVTNMRSMFLGASMFNQDISKWDVSNVTDMSNMFFRATSFNQNI
SNWNVLNVNNMSKMFFNASSFDQDLSKWKVNKNVVFDDFAFRSKIDNDNK
KLPNFNN
>MSC_0540 lpp, Prolipoprotein
MKKLLTILSSFGLIATTGASVVACKNDQSISLQPKKSENESLGSATKEEK
KEEKTDNNQPSSLKSTEDQNTSLTSTPDNKELGSTGSIQNKEEEVTKIKG
QLEKLKESEQKAKVLLKQIEEGNNKAKEAAEQEKIRNELEKLNAQKPKIE
EALKQIEETKKQLEAKLQSLQTNTTESSN
>MSC_0625 lpp, Prolipoprotein
MKRFNKLLMYASSSTLLLPITLLVACTPSKVVSKPIDDNEFSKLINSIKT
EDDLLKYADIKFKDQRGSEISKGNILPSQLKKENISIIFKGKYIGQISTE
VLNVNVINQDSSLGNKVNIFVQFTNKKTGTKIPTSFIISGLNENGNFDFS
GTRIVNDLDYFGGLSGFNDYSNKTQEQRFDYDNSRYITGLKNHLSSGTGT
VDLKKLRGLDTKEEQIRTFDKLAKEVKFDSYYNAALKGFTLPVYDNSGQF
IGLSVNDEQEIGKIASHVDSLGRTEKAKTNGLARTIPNDTYRTAAIQTYQ
VNFTIYKDYAKEIEEAEDNIQLFNKWDEKQIQSYISAQLNQLRLNFEDEV
SQIEKQLSQPLDGRTTIVENLNKRKSEITTEYEKKLKEISSLNRDNLVEW
QKKEIEEYKKKREEKIFRTSESGTMWIMDYIDINNPTKFYFGTNSHVAKG
IRDDMISFSLTRLNSNIKVGQTFGLNSADKNFTKFTFEPAKKEKKLNEAV
TAIFHATDFIKKESSPLSLLKDDQKTKYSEAGLFADFAVVEIDFEKLLDT
NNFSRTIWSDSSDISSKYQDDKNELIKKITNDYAKSDKKISFASESILDD
KNYKKFDRKLDFNPENSDELKAYRDLDSLYIVGYPTSNEDYYLDQYEDEK
QLKNKKYDFSLWINNEYKYYKKLVNGEGTSSSFKDYELEKGNFFSYQIGY
RSFIDKPGLTDAFLAANKVGKKLYSLDEQNKGKAKKYFNYGLEILPRFYA
PAGGASGSSVRTKDNKLLAVYHAANGWAKTGLAAVFRSNGYDYKNLFGPY
KLGQYDLIYGGGKDQEKGKSYREALLKKYNNDIKSALLPNGFSDDKVPQQ
FKFDNGKQK
>MSC_0775 lpp, Prolipoprotein
MFKSKKLIIPLLTTLAVVPSLVVVSCKNPLFNQSLSEKIYLNYNLQTEKD
KQEFENYNQINMLSEINQYFTKHDHNKDLVKFTTDGASGDTVEFNNIMKN
NYASKYIKFDQDKFKEIIKKEFNLSDSFLKRLEFEVDYNNISRDYGNNFD
VIFPIRVKLPLVSHNNFKYQQGLFIEQTFKFRIKNVKASGSEKIDVSKIK
DIYNELVKLKDKNNFTASVKTVTEETKKLVDEWGIHELNSTQLSSIFDIK
TEEFDNLIKDKKEVEHKVTITDVDLSDPSLAINEGLLKLRLGVKIKGKET
ETGVNVWIKFNFDQKDTFWKELKISESIKVNTVKFSETNTDFTKLMNDNL
IIKSKSKFIKNIKLSSIDKTTDYRNSGVLLEVLTNESKDNVIKLHKKPGV
GKYTDLYSADFTKNNIHAPNFATEKLTQENLKSINKDFFRQFDSELFSGG
YARSRGFYSEKVKSPKFMHIGEDYIANDFQAVLMPYDGEIIAAYELSTNV
PFAGVGTVLVAKVPITSLPWSPKQKEIELNDNKTHIYISFLHLDAQRTLN
NDKLGWVAETAKLKKDKTVKVVKSVTPSTPKKVSKGTVIGYLGDHSSNGG
WMSHAHINLYTNRPNYLSENYFSSKTIRAQLDDKRAKGYKSSVSNNDFSA
IGNIGVERKIDTKIYQVDPKTGIEDKQKAISDEIPLYFNGLSMLGFEKTK
GYANPNLMYKLRDERTVSFSVKEVNKL
>MSC_1021 lppQ, Prolipoprotein Q
MKNKHISLLAKLEVLFSVTSLPLVVVSCKTSNFNNNKPNNNQQKKEQVSK
IDISSFKDKIEPKNEWQKQDVLQALLKIKGLDKLTQNDFNFNIKKANLLR
NGQLIIKSKDDSKIIKGELSLEIKKLNRVKKVETKYNDTRTEVLVIGYDE
NGKISGFAQTVKKVPEKLPEEIISLERAFLKNNSDKIENLDKWDTSNIVS
MSSMFQQARNFNQVLSNWNTENVTDMNYMFDGATKFNSDLSSWKTANVKT
MRSMFSDTKQFNQDISSWNVSNVKNMKNMFYRAEKFNKSLSDWNLKNIQE
LDHMFFGASEFNSDIFKLKNNLVTDMRYMFFQAKKFNKSLDWDVSKVVNM
DSMFNGAHDFNQNITNWNVSNVKTMRSMFSDTKQFNQDIKNWRVDNVTDM
DRMFQNALSFNKDISSWNVKNVKSYDAFGWHIKKEFKPLFEKNNK
>MSC_1046 lppQ, Prolipoprotein Q
MKNKHISLLAKLEVLFSVTSLPLVVVSCKTSNFNNNKPNNNQQKKEQVSK
IDISSFKDKIEPKNEWQKQDVLQALLKIKGLDKLTQNDFNFNIKKANLLR
NGQLIIKSKDDSKIIKGELSLEIKKLNRVKKVETKYNDTRTEVLVIGYDE
NGKISGFAQTVKKVPEKLPEEIISLERAFLKNNSDKIENLDKWDTSNIVS
MSSMFQQARNFNQVLSNWNTENVTDMNYMFDGATKFNSDLSSWKTANVKT
MRSMFSDTKQFNQDISSWNVSNVKNMKNMFYRAEKFNKSLSDWNLKNIQE
LDHMFFGASEFNSDIFKLKNNLVTDMRYMFFQAKKFNKSLDWDVSKVVNM
DSMFNGAHDFNQNITNWNVSNVKTMRSMFSDTKQFNQDIKNWRVDNVTDM
DRMFQNALSFNKDISSWNVKNVKSYDAFGWHIKKEFKPLFEKNNK
>MSC_0200 mod, adenine-specific DNA-methyltransferase
MKNIKPVISYYGLKYRMLKNIFSVLNLNSSDIFLDLFAGSGIVGVNAKHL
FNCQTIINDYDNVLPLNLTYALKNILSFEGNLKNYTKKRLDYFIKRLDNG
WIDKLNQYNKILQTIDITLFDYKQILKNIINNKNKVTKLYADPPYFNKTG
MYKNSFTIQDHIDLYNLLSELKIQTKIVISYNDEPFIKELYKDWNIIEIN
KTNVCGINKNSSKVKELLITNI
>MSC_0105 nfo, endodeoxyribonuclease IV
MDKILLGCHVSMNKQNNYLVGSVNEAISYKANTFMIFTGPPQSTLRTNTN
HLYINQMHELMNSYKIDAKDLVVHAPYIINIANSVDQNKWKFTVDFLIQE
IKRCEEIKIPTLVLHPGSYTTGNYKDSLNQIIKALNIVSNYQVNVKIALE
TMSGKGTEVCSRLEDFKYILDNVKNKDKVGVCLDTCHLHDAGYDLSKWTE
FKEQMKQNFSLDKVLCIHLNDSKNMISSHKDRHANIGYGYVGFDTLVNVV
FDKDFSNISKILETPYIDKTPPYKIEIEDLLNKTFTNRL
>MSC_0511 parC, TOPOISOMERASE IV SUBUNIT A
MSDKSEIILYPLEELLGNRFSRYAKYIIQERALPDVRDGLKPVQRRILYA
MNQLNLTFDKPYKKSARVVGEVIGKYHPHGDSSIYDAMVRMSQWWKVNIP
LVDMQGNNGSIDGDSAAAMRYTEARLTKISNLLLEDLEKNTVIFSPNFDD
SETEPTVLPSYFPNILVNGATGIAAGYATNMPPHNLSEIIDATINIIKNP
NITIDQILKIVKGPDFPTGAIIQNKQGIREAFLTGKGKVIISSKWHQEKN
NIVIDEIPYEVVKQDLVKKIGDVIDNNSNLGIKEIRDETDRKGLRIVIEL
NEKANLETVRKFLFKSTWLSVSYNYNNIIIVDKQPKQLGLIDIIKAYISH
YKEVFIKRTEFNLNKANLRLEIVNGLIKALSILDEVIKVIRKSENRLDAI
NNLVITFKFTTNQATAIVDMRLYRLTSTDVNKLLLEKTELIDKIKKYQEI
LNNSLVLDNEIISRLEEAKKQFGIKRKSQVEDLVEDLDVDQKEVIIEKEI
NLWISKDGYIKVIDNNILNKNELSSFGKKPNDMWISQGVCSNLDHLILIS
DQANYYSIPLYKISTSKWKEQGVHINSVATTQPNETIINALVIKEFINST
QHLLLVSKNGLIKRTQISDLETKIFNKSFKIMKISDDDSLVYADLISSKT
SYCCIITKNGYAVRYNIEDIPVQSTISKGVKAANLKDDYIISALSLQNNK
DVLIFTNKNNYKRLDQNLIPIYIRPKKGIRILVEKKKNKEQIIFGFAIND
QMSISILDTNDQITDINVSDLKHTNLEQNSISTNIDEISYISIKQLIRSI
AFDQPALANNFNDQDTNDQIETKPKFNKLVSERIVVSKDTKNKAISNANQ
VHGSDLSSYLDDISSLLSKVSQNKKDKKTKQLDFEDYFSDDDQNQDDN
>MSC_0510 parE, DNA TOPOISOMERASE IV SUBUNIT B
MAENGKYDESAIQVLEGLDAVRKRPGMYIGSTDNRGLHHLVWEIVDNAID
EALAGYCTQIDVILEKDNSITISDNGRGIPTGMHKTGKSTPEVIFSILHA
GGKFDSTAYKSSGGLHGVGSSVTNALSKRFKATIYRDKKIHEIEFKNGGK
LEKPLTFIANTYKTGTTINFLPDDSIFSNTKFNFSLISERLKESALLNSG
LKITLSDLISNRYVEYQFQYGLVEFIKELVDDKKVITDIITINNESKNII
AEIALQYTEDDNEIILGFANNVKTSDGGTHLVGFKSGLIRAINDYAKEQK
ILKDKAKLDSNDLREGLVAIVTVKIPENLIEYEGQTKSKLGTSDAKTVVE
QIVYEFMSYWLIENKVLANKIIENAFNAQKARIAAKQARQAIKSVKGKKN
INKLMLGKLTPAQGKKREINELYLVEGDSAGGSAKTGRNRKFQAILPLRG
KVINSEKAKLVDLLKNEEIQSIINAIGAGVGKDFDISDINYGKIIIMTDA
DTDGAHIQTLLLTFFYRHMKDLIVHKKVYIALPPLYKITFNDKSFIYLWD
EEELNEFNKTNTKKYEIQRYKGLGEMNADQLWQTTMDPKNRKIIQVTITD
GLLAERMFKTLMGDDVEKRKLWIQENVKFTLEDDQIKIIEMEK
>MSC_0769 pcrA, ATP-dependent DNA helicase
MSVDHLLDLLNSQQLAAVLNTDKPVRIIAGAGSGKTRVITTKIAYLIEKK
DIDPTRILAVTFTNKAAKEMKERVLQITKNQKKSPFISTFHAWCSKVLRI
DGKHVGLKDKFLIIDSDDQKRIIKNALKESNIELSENDKKTFDKKILYKI
KEWKEELVDPDEAILNANSTYDRNSAIIYKLYQETLLKNNSIDFDDLQIY
VYLLFKNHQEILNKWKNSYDYVLVDEFQDTNDIQFNLIKFLTINTNHLTA
VGDPDQTIYSWRGTKLDIILNFNKTYSNAISIVLNQNYRSTKQILDISNS
FIKNNKFREHKEIYTNNKTGKKVVLKECNSKTSEASYVSFKIKELIKQGY
HYKDIFILYRMNAWSQEFEKELINRKIPFQLIGGIKFRERKVIKDAMAFL
KMISIKDDLSSQRVLSLIPKIGNITIEKIINTANLNHISIFDLITNKDKT
LLQSITKNLDELIEVFKTAHQLYLDNTNIEEILKYLLIQSGYGNKLKVKK
EQDDLENINALYDQLKRFDEDFDPKYYSEENKLIAFLQEEALTSDIDEAQ
QIDKVSLLTVHAAKGLENKVVFITGLNQGIFPTRLSENNQKELEEERRAL
YVALTRAKEELFLTYVKGDYSHIIQSELKPSKFIHELDKDLYEFESQFLN
SQIYDKNQHKTPSFYVSPKQHNLYNVGDHVEHKLFGKGIITKVINDQLQI
SFTNSSYGVMIIAANNSALTKI
>MSC_0307 pkn, serine/threonine protein kinase
MPKTKKDLRINKEELLNQVVNNRYKLIKYLNSGASAVVFKALDLDASVLE
KKDVFIAVKIILKAKNKNIETIKKRLFLETNIFAKLSFSKNIVKMKDVFS
WQNYYVIVMELIEGVDLNKKFNAYNNVLSNKEFIYYFLEITKGLKEIHDN
NIIHKDVKPANILITNDSKVRISDFGISIIKSIILDDHHNHISPGTPRYT
APEQFINFESRKDALYFESDIYSTGVIMYEFLTGSMLYLNYGSNHTSSKE
KELTNFQQHILKDITRPREINPNISQALENIIMKCLAKDYKNRYHTFDQI
IKDLEQAKQQPDVNIDFPNMWWEDENYLNIKNNNTLKYKYFFKNTNFKYF
LFWISIVISLFIIFLIVLILK
>MSC_0683 polA, DNA POLYMERASE I
MKTKILVVDGNSLIFRAFYATAYSPNTSLLKTKSGVLTNAVYSFINMLLS
VIHQRGPYDHILIAFDKSKKTFRHDLLSDYKANRIKTPNELVEQFSIVRE
FLTKANIQWFEQENIEADDIVGSICKYAEKQFDDLQAEILSSDKDMYQLI
TNKVICLNPVQGVNELEEVDTNKLFEKWQILPNQVPDYKAIVGDSSDNLK
GVNGIGQKGAIKLIQQYESLENIYNSLEQLKGAIKTKLEQDKKMAFLCKD
LATIKTDVVLENFSFNKLDFNVDNIYEFLNKYEMYSLKKRFTNILNLDFN
PYQNKKQNLDVKIINSWSKDYEDSINYLYVESLEEDYHKDKIIGIGISNN
KGNFYLDFKNKAQQLSFFEDTTLSSTDSLFEEFLNNSNLKKYTYDIKKTT
YLLKNHKYNVLASNFDFDFMVACYSLNANVISDLSNQIKLVDNMIELETI
DEIFGKGVKKNPDIDLDIKSKYISKKAYLLKKYSDQLIEQLKQTNTYDLY
LKIDHPLIEVLYDIEVQGILIDKEQLKLQTQQILKKINHIEGQMKILVAE
EIDNNFNFSSPKQIQELLFDKLKLPNLEKGTTSKEVLEKLITFHPIINLL
LEHRKYTKLYITYLKGFEKFIFDDNKVHTIFNHTLTNTGRLSSSYPNIQN
ISIRDNEQKEVRKIFITNNNKTFLSYDYSQIELRVLAQMSKESNLIKAFN
QDADIHLQAAKLIFNLSDDQITSEQRRIAKVFNFGILYGLTDFGLANDLN
MNVNQAKQMIKDYYSAFPSLLDFKEKQVEIATNQGYITTLSNRRRYIDEL
NSTNHNIRQFGKRIAVNTPIQGTASDILKVAMISIYKKLKEQNLDARIVC
QIHDEIILEVNDNQLEQTKKIVVSELENALEKLFLDLNIKEQVVVKLKVG
ESVGKTWFDLK
>MSC_0091 polA, DNA POLYMERASE I, 5'-3' exonuclease
MITNETKPILLIDGYHLLHKGYYGTLKRTIVSKNKDGIVINAIYSFVANI
LKFVQSDRYHSVIVAFDFDENCWRKELYSEYKAKRKPTPIDLIPQLQIAR
DFLTSANISWYEKYNYEGDDVIGSICRIANKLGYDVCILTNDKDIYQLVN
NKTSIITNISKKEKIKIIKSEQVYEHFLCQPNQVADIKAILGDQSDNIKG
VKYIKRKQAESLINKYENVENILDHINELNEPLKTIISENKQLIIDNKKI
TKILTNVKLGRINFKPTKITYYRLIRFLKEQEMYAFIRPIRKYLERTNKK
TVNK
>MSC_0355 polC, DNA polymerase III alpha chain
MQTKILGIFKKIGIELDQTDYIYFKDAILVETPRISQIKNKGYLHVEIKD
FLPIDVLKKIEDKLKNNQYFNFKLIIDVKNQQFNKDLLIQYLEFIKMHKS
LFNNRASWKLLDIYNFELINNQLVFLVNSQTIKNEISLELDYCLAKLNQF
GFKDLSYLINVNEISLDTLDTKQQINKTYDKPEYEQQIIKPIEKKPSVNN
SYKNKRPSLDKPSYNSLLDVEDDAQNIVIQGMVINKEFKLSKTGRKIFYI
DITDFQSSIRCMYFAKSDALCEFDDLTEDELKSKEIEQIKENKIQINDWI
SVKGKTSLSVYDQEQIFYIDDFKKIKKQVGLRIDDAKIKRVELHAHTKMS
VMDGVSDPIDYLELISSWNHKAIAFTDHTNVQAFPDIYKALNSVNKKRSD
QDKIKAIYGLEMNMLNNDLWYVKNPKNQKLKDARMVFFDLETTGLSPELD
EIIEFGAIEYNLKTGERKKIDILIKPKTKLKAFTQKLTNITEKMLEDKPS
IEQAFKQINEIIKDAILVAHNANFDFTFLSYWSEKLGYSKLENTIIDTLT
ISRIIYPDLKSHRLGSLAKRVNISYDPSIAHRGDYDADILADIYERMLDE
TRKKIKIITDSDWNKIDPINYADNLNYYKNKGFHTNILVKNQAGLKELYK
LVTKSHTTNFYSSPKIFKDDLIQIKQNNNLLFGASCVNSEIFELARTSTL
ENLKQAISFYDYIEVQPISVYKNLLQNDSLDLDQLKEIITNIINIAKQEN
KLVVASSDCHYTNPELKQIREVYINAKGLGGIRHPLFDFNNRVKDYPDQH
LRTTKEMLKEFEWLNDDDLINEIVITNSNKIADMIDSNVIPIKDGLFTPK
IANVNEKLKDKCYQTAKQMYGEMLPEIVEKRLEKELGSITKHGFAIVYWI
SHLLVKQSLDDGYLVGSRGSVGSSFVATMAQITEVNPLKAHYRCLNCKYS
DFNTDPAYKCGYDLPEKNCPNCNQKLIGDGHDIPFETFLGFNGDKVPDID
LNFSGEYQNQAHNFTKKMFGENNVFRAGTISTVAEKTAFGYVKTYFEETK
RDASLPRKTEINRLAKLAQGVKRTTGQHPGGIIILPNEYEIEDFTPVNYP
ADDLNSTWKTTHFDFHSIHDNLLKMDILGHDDPTALKMLRDLTNIDPITI
PTDDKNVYSLFSSLQALNLTSDKINDEITGAIGIPEFGTGFVRNMLKETQ
PKTFADLVQISGLSHGTDVWLGNARDLIKDKKADISTVIGCRDDIMVYLI
NMGLESSLAFMIMESVRKGKGLKKEWIDVMKQYNVPDWYIDSCLKIKYMF
PKAHATAYVLMAYRIAWYKIYYPTEYYATYLTTRADVFDLKTVLGGYDAV
LLKLKSQQQRVKNGEKLSKKEEDLEVVYEVLLEMFARGIKFSNIDFEKSE
ATKFKVDILQDNSKIIIPPFNVIDSLGEAVALSIINARNTKPITSVNDLK
NRTQTTQTQIKIFEEFNILDSLSVDEQLAFDF
>MSC_0421 recA, recombination protein recA
MSTEIQKIEDNNLKESQMWNSKELKEAIKEIEKMFGKGSIMVLGQSDNLN
VETFSSGSLLLDNALGIGGYPKGRIIEIYGPESSGKTTLSLHAICEIQKL
GGIAAFIDAEHSLDPKYCHNLGIDTNKLLVSQPDNGEQALDILEMLINSN
SIDLIIVDSVAALVPKTELEGEMSDQSIGLQARMMSKALRKLNGLIAKSN
TTVIFINQLREKIGVIFGNPETTTGGKALKFFSSIRLEVRKSENILSNSE
IIGNKIKIKIVKNKTAIPFKTATISLLYNKGIDKIGELVDLLVSYEIVEK
SGVWYSYQNEKIGQGKTSVIQWLNADEKRTNELIQQVKKIIEQKE
>MSC_0649 recD, EXODEOXYRIBONUCLEASE V ALPHA CHAIN
MVDNNLFSQFISAIGQAKKIVFVGDINQLPSVEIGNVFEDIIKSKKITTV
ELKSTHRQANNSKIVELAYMIKDNNFDLKKLYENQTTENQSKTDLQTIFV
DNQEQCLKEIINNYNLDNKTGFHEPYKIQIISPFKDDILGVKDINKFIQN
NRFKQEKLDENTELCIQINNSEYYQKDKVMYLRNENDLSNGDIGVIQKIH
KDSNELKINFNDKIFSTINNASLKNLTLCYACTVHKTQGSEFEKVILVLD
PKRSSSFIDKKLIYTAITRARKQLVIIANKNSFINGINKNPETRDTTLVK
HIKLMYKKRKKYHYSYIF
>MSC_0512 recD, Exodeoxyribonuclease V alpha subunit
MEHIMEEQIKIRGYLSKFLYKSDSWALAIFTSENNSNKTIKIKGEISDLK
PKILYELIGKQTSHIKYGTSFEVSSFTLASINTEQQIINFLKSDVFPGIG
NLTASKIAKLYSKNFIQEILNNKEQFFQIKDISKDKLELIYNKIKQINEQ
NFLRIEFINHNLNLKILDKLKKFTDDENEIKQILTNNWFEFSFKHELGLM
QDIDKIFLHFNNNDINNLTRISYWHLYACNEILFNLGDSYTYKNYLFKKV
SDLIKINDLDLLNNGLDYAIKNELLVLKNEKIYTYESFNDELVISEVLVN
NHLKTNDIDDNLLTSYINQIQLNTQIKDFKYDKDQVLALKNFIKHKISII
TGGPGTGKTTIIKAIVELFKKVYHSSNYAICTPTGRAAAKIRENFTESHA
TTIHKLLGYEKDNKFSVNKYHPLDYDLLILDESSMIDARLFSQFLSSINN
AKKIVLIGDVDQLASVGYGNVFYDIINSKIICTTNLINIHRQSFNNGIID
LAYMIKNDNLDLNKLQNLDNVEFFFNKDKQACLEYLKTVYSKTINQTKSI
QVISPVYADILGIENLNNFIQYNFNQNILDTNNVYDRLRFRYAVNDKVMY
LKNDSELNLSNGDIGYICQINKNNNKFNSAIINFNNNLFEFTSEQFDEIN
LSYCCSVHKTQGSEYDQVILVLQDTLFSSFLSKKLLYTAITRAKKQLIVI
GDFDLFIKASKKNARIRKTTLVEHILNKLNN
>MSC_0463 recO, DNA repair protein recO
MEKTLKGIVLNSFDFQDYDKIITIYSNLYGKISLVCLGVNKIKSKNKYGI
NYLSYSNFEIFKSKNKFNLSKLKRSELINSFNHISTDFNLYLYANIITSL
VLSLDEQIKNYNLYKTLKLSISIINNKSDFGLKVCVLFMFYFLKIIGNQI
DLSKCGFCNSKINPIIAISFTNYCSSCKFCYFDDCILIDNQLSNFINSIF
KNDFITNLSQEISTNNLNILTRFILNYYQDIVGTYTTSMYLLSTLIRFN
>MSC_0047 recR, DNA REPAIR PROTEIN RecR
MIKMLETTFDEIIESIRTNQGLTKKTSERLLVDLLINKDKLNQFIDQLNK
TKQLISTCKICGYLSENDKCLVCSLENRNQNIICIVSTILDAKNIENANK
YKGVYHILNGEINLNKTLHLIS
>MSC_0430 rnhB, ribonuclease H II
MISRKSFDDQIKIVHNITYLSGSDEAGRGCLAGPLVVASVILKPDYFNPL
IKDSKKLNPKTRQVLYDEIIKNCLDYQIIIISSKQVDELNPLQASLLGFK
TTINNLKITPDLALIDGNQNIELTNIKTLQIIKGDDKSFSIACSSILAKV
TRDKILDEYDQIYPNYDFKSHKGYCTKKHLLAIQKYGILDIHRKSYKPIK
KISKETS
>MSC_0542 ruvA, holliday junction DNA helicase RuvA
MNDYINGLLHKIDDKYLYIELNNSGYRFLYLKTDLKDLKLNQNNQVYVAI
NVIDNVFKYYGFKNQLIRDLFELLININTIGEKTAFLILENYNYNELIDI
FKNGRTDKILQLKGIGNYTARLIINSVQKELFNNKISDKKNKVITSLEKL
GYKTKDIYKIIINIDEDMNIEDLTKYVLEQLSYLHN
>MSC_0543 ruvB, Holliday junction DNA helicase RuvB
MFRPEKWEEYIGQTNILDNLKVFIKSAKKQNKVVDHIFIYGPSGYGKTSL
AYLLAKQLKTNIRILNGPNLQKASDIISILTSIKEKDVLFIDEIHAVNKE
VLEIIYPVLEENKLNIIIGKDYNSKVVNIDLPKFTIICATTEINKIPLPL
LNRFPINFTMQEYNLEDMSKIIELYCNKFQIKLDQDIYLYLASFCRKTPR
IAINLIKRIKDHIICDNPKIIDVNYIKKVLDKLEIYELGLTITEINYLKM
ITKNKRIGLDNIAHLLNTTPLIIANNIEPFLIRESLIIRTIKEREITYKG
LKYIQNF
>MSC_0026 ssb, Single-strand binding protein
MNRVNLVGRITRDLELRVAKNGSKFVFFTVAVSEFSTKEEKTNYIPCSAF
DRTAENMVKYLSKGSLISVEGRITTRNNQTPDGKFETIVNVLAERVNFLE
PAKNRNNMTTEQNDNFTPNQPTQQNSDSSFDDLVVSDDDELSILWE
>MSC_0699 tnp IS1138-like, transposase (IS1138-like)
MNKLGLFCHVRTTCKKKREPKNTNVYYPNIANRDYNGQLNDIYASDVSYI
PAPIDVDGRHVYLSILIHHRTKKIVSYNLSIHNDTNLIMNHFRKTKFPKN
FIIHTDHGATYSSIEYLEYIKQQGGIVSMSRIGNSLDNREAEYFFSILKS
EMFINFEKKVKEITFSQLKKSIKNFIEWYNSERILRKFKWKTPQELLNFY
FINYKLLFNKFSST
>MSC_0551 tnp IS1634aa, Transposase IS1634AA
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0538 tnp IS1634ab, Transposase IS1634AB
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KDAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0245 tnp IS1634ac, Transposase IS1634AC
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLKLHHLHSKFKIRTDWK
KQNHWA
>MSC_0520 tnp IS1634ad, Transposase IS1634AD
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLPWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQEIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0363 tnp IS1634ae, Transposase IS1634AE
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGVATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDIDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KDAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0249 tnp IS1634ag, Transposase IS1634AG
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSCNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0859 tnp IS1634al, Transposase IS1634AL
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRGILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0785 tnp IS1634am, Transposase IS1634AM
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNVKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0098 tnp IS1634ap, Transposase IS1634AP
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YKMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0672 tnp IS1634as, Transposase IS1634AS
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0797 tnp IS1634au, TRANSPOSASE IS1634AU
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKKSFYRSLDYIAKNKDKILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0981 tnp IS1634av, Transposase IS1634AV
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KDAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0332 tnp IS1634aw, Transposase IS1634AW
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0862 tnp IS1634ax, Transposase IS1634AX
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKKSFYRSLDYIAKNKDKILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0802 tnp IS1634ay, Transposase IS1634AY
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFLGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KDAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0876 tnp IS1634az, Transposase IS1634AZ
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMCVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_1003 tnp IS1634bg, Transposase IS1634BG
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDGPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNTKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNKDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQENWQTYQILLEL
LTKEKVT
>MSC_0605 tnp IS1634bh, Transposase IS1634BH
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKKSFYRSLDYIAKNKDKILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0060 tnp IS1634bk, Transposase IS1634BK
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KDAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0685 tnp IS1634bl, Transposase IS1634BL
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0697 tnp IS1634bm, Transposase IS1634BM
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKS
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSGTVPNVFINLKPYCWLHLFMFHFISVFKLHHLHSKFKIRTDW
KKQNHWA
>MSC_0976 tnp IS1634bn, Transposase IS1634BN
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFIIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0097 tnp IS1634bo, Transposase IS1634BO
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YKMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0172 tnp IS1634bp, Transposase IS1634BP
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKLATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0969 tnp IS1634bq, Transposase IS1634BQ
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKVIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0342 tnp IS1634br, Transposase IS1634BR
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGVFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0038 tnp IS1634bt, Transposase IS1634BT
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISSSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0074 tnp IS1634bv, Transposase IS1634BV
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVDSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_1011 tnp IS1634bx, Transposase IS1634BX
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNVKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0934 tnp IS1634by, Transposase IS1634BY
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRGILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0168 tnp IS1634bz, Transposase IS1634BZ
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYKVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIVSSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHNVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEAFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0602 tnp IS1634ca, Transposase IS1634CA
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKKSFYRSLDYIAKNKDKILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIVSSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0989 tnp IS1634cb, Transposase IS1634CB
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNVKICANTNRKIDVLWFDT
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFSGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0921 tnp IS1634cd, Transposase IS1634CD
MPRPDNKGFVYRLGYGYLHELKQYHDDPLAIIKAIIANFPLSWTKEQART
KLDEIFKEKKETKKEVLERFKGYEVVEKLFDYFNIFNDCSPTKSTTLKDV
VLQLIYQRIKNPISVFNTYKTAKKEKIDTHSKNSFYRSLDYIAKNKDEIL
RNLNAKICANTNRKIDVLWFDATTTYFETFSREGYKKPGYSKDGKFKEDQ
IVIGMATDENGIPLHYKIFPGNVADPNTLIPFMLEIADIYEVNSVTIIAD
EGMSVNRNIRFLESKNWKYIISYRMKAGSKQFKEYILDEKDYINDGGLIY
KTRDIVSSYNKKRINGHFRRQIISFSQKRATKDKNDRDILIQNFTKKMNK
DNLVSCDDLAGSKKYRFFKPINKGAFYELDIEKIQEDQKYDGYYVYETNR
TDLSVKEVINLYSKQWQIESNFKTLKGKLSLRPMYLSTWNHIVGYICLCF
ISLVFLNYIIYILNSKLGLTGKSKITEHKVINVIKEVKEIEVFVNKQKIE
TIQVYNDELQESWQTYQILLELLTKEKVT
>MSC_0871 tnp IS1634ce, Transposase IS1634CE
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKKSFYRSLDYIAKNKDKILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0814 tnp IS1634chbz, Transposase IS1634CHBZ
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
NKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_0822 tnp IS1634ci, TRANSPOSASE IS1634CI
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDGPLAIIKVIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNAKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_1036 tnp IS1634cm, Transposase IS1634CM
MKKSRNDVKGQWRTSIARVKKGEYLSIGVPRPDNKGFVYRLGYGYLHELK
QYHDDPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNVKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADSNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYVLDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNNRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSLRPMYLSTWNHIVGYICLCFISLVFLNYIIYILNSKLGLTGK
SKITEHKVINVIKEVKEIEVFVNKQKIETIQVYNDELQESWQTYQILLEL
LTKEKVT
>MSC_1031 tnp ISMmy1a, Transposase ISMmy1A
MAIPKDILKISRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKDS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRV
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLA
TEFINFLSSIMLCRIKNHLLNSSVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0230 tnp ISMmy1b, Transposase ISMmy1B
MVIPKDILKIPRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKGS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRA
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLA
TEFINFLSLIMLCRIKNHLLNSGVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0808 tnp ISMmy1c, TRANSPOSASE ISMmy1C
MVIPKDILKIPRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKGS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKEYKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRA
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNETNVQGNYRLLA
TEFINFLSSIMLCRIKNHLLNSGVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0781 tnp ISMmy1d, Transposase ISMmy1D
MFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIGKSSSKMEDF
MNKRLEEFSNHSIVIDGMLKNNASETNIFSEMSRKSRTKGSQNLNLIYAY
DINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDKGFDDKKCKN
LMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRAKKIIINNKY
YFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGLIIFECNYDL
DLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLATEFINFLSS
IMLCRIKNHLLNSGVLDNRTISETFRYLSKIIKKRKSRKREEWDDVETLK
YIKEMKSILKI
>MSC_0123 tnp ISMmy1e, transposase ISMmy1E
MAIPKDILKIPRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKGS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRA
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLA
TKFINFLSSIMLCRIKNHLLNSSVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0571 tnp ISMmy1f, Transposase ISMmy1F
MAIPKDILKIPRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKGS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRA
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLA
TEFINFLSSIMLCRIKNHLLNSGVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0796 tnp ISMmy1g, TRANSPOSASE ISMmy1G
MAIPKDILKIPRLSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIIPVEKGVIGKIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFREL
LNFYDFEDARKLYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSS
NTISSFLEKIGKSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNASETNIFS
EMSRKSRTKGSQNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTY
EIKNGFLILDKGFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNF
TFTYDDDTIRAKKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEI
KLLERENLFGLIIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNE
VNVQGNYRLLATEFINFLSSIMLCRIKNHLLNSGVLDNRTISETFRYLSK
IIKKRKSRKREEWDDVETLKYIKEMKSILKI
>MSC_1056 tnp ISMmy1i, Transposase ISMmy1I
MAIPKDILKISRPSSTRVKTTSKEGIYNVIQRTSIRKNGKIIPVEKGVIG
KIINGVFQSIEKQTYEVDIKSYGLFALNEKLNNHIFRELLNFYDFEDARK
LYVIASLRTMFSDIKNEHLKHEYDTNFISEIYPKCALSSNTISSFLEKIG
KSSSKMEDFMNKRLEEFSNHSIVIDGMLKNNTSETNIFSEMSRKSRTKDS
QNLNLIYAYDINAQEPVASSVYPGNMLDYTAFRDFLRTYEIKNGFLILDK
GFDDKECKNLMREKNIKYLIPIKINHTFKKFNLKSGFNFTFTYDDDTIRV
KKIIINNKYYFCYKSTLTEMVEKKNFISRAHKKGAYDEIKLLERENLFGL
IIFECNYDLDLKDIYVAYKKRWEIELLFKQFKNVLEQNEANVQGNYRLLA
TEFINFLSSIMLCRIKNHLLNSSVLDNRTISETFRYLSKIIKKRKSRKRE
EWDDVETLKYIKEMKSILKI
>MSC_0070 tnp is1634ba, Transposase IS1634BA
MKKSRNDVKRQWRTSIARVKKGEYLSIGVPRSDNKGFVYRLGYGYLHELK
QYHDGPLAIIKAIIANFPLSWTKEQARTKLDEIFKEKKETKKEVLERFKG
YEVVEKLFDYFNIFNDCSPTKSTTLKDVVLQLIYQRIKNPISVFNTYKTA
KKEKIDTHSKNSFYRSLDYIAKNKDEILRNLNTKICANTNRKIDVLWFDA
TTTYFETFSREGYKKPGYSKDGKFKEDQIVIGMATDENGIPLHYKIFPGN
VADPNTLIPFMLEIADIYEVNSVTIIADKGMSVNRNIRFLESKNWKYIIS
YRMKAGSKQFKEYILDEKDYINDGGLIYKTRDIASSYNKKRINGHFRRQI
ISFSQKRATKDKNDRDILIQNFTKKMNKDNLVSCDDLAGSKKYRFFKPIN
KGAFYELDIEKIQEDQKYDGYYVYETNRTDLSVKEVINLYSKQWQIESNF
KTLKGKLSGTVPNVFINLKPYCWLHLFMFHFISVFKLHHLHSKFKIRTDW
KKQNHWA
>MSC_0662 tnpA IS1296ab_b, Transposase IS1296AB_B (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDKVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0898 tnpA IS1296ac_r, Transposase IS1296AC_R (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0905 tnpA IS1296ds, Transposase IS1296DS (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPTKIKKSQVKINNEQQIKELKE
QVEYYKLETEFWKKFHTLLTKEKSTRKKQK
>MSC_0141 tnpA IS1296eh, Transposase IS1296EH (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEVFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0238 tnpA IS1296fj, Transposase IS1296FJ (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEKQIKELKE
QVEYYKLEAEF
>MSC_0851 tnpA IS1296gz, Transposase IS1296GZ (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0056 tnpA IS1296ie, Transposase IS1296IE (ORFA)
MGFLYVQIRFRKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFE
AFGIEGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTI
AGWVKKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIK
ELKEQVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0630 tnpA IS1296ll, Transposase IS1296LL (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVKSLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0138 tnpA IS1296mp, Transposase IS1296MP (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFRI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDKVAKKFNIIFSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLKAEFWKKFHTLLTKEKSTRKKQK
>MSC_0232 tnpA IS1296px, Transposase IS1296PX (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVKSLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0674 tnpA IS1296qt, Transposase IS1296QT (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFGI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0016 tnpA IS1296sq, Transposase IS1296SQ (ORFA)
MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFRI
EGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWV
KKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQVKINNEQQIKELKE
QVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0173 tnpA IS1296uk, Transposase IS1296UK (ORFA; extended into IS1634BP
MRSKIYISMFSCFMLKLGNVGKKLKIVKEAKKLNIKKSTYLANKYDISVD
TIESLVNRFEAFGIEGLINKEKKPYYSAKLKLKIVLYKLETNHSYDEVTK
KFNIIYSSTIAGWVKKYREYGFLGLNNNIGRPKKIMKNPNKKPAKIKKSQ
VKINNEQQIKELKEQVEYYKLEAEFWKKFHTLLTKEKSTRKKQK
>MSC_0663 tnpB IS1296ab_b, Transposase IS1296AB_B (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKYEKFCIKELETHIL
>MSC_0899 tnpB IS1296ac_r, Transposase IS1296AC_R (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0904 tnpB IS1296ds, Transposase IS1296DS (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDPGFNYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0239 tnpB IS1296fj, Transposase IS1296FJ (ORFB)
MLKTHKKVKISILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0844 tnpB IS1296hv, Transposase IS1296HV (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVAEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDPGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0055 tnpB IS1296ie, transposase IS1296IE (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFDLLKQEIYYGEKYNSVEELTKR
IHKCIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0211 tnpB IS1296ji, Transposase IS1296JI (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNDIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0629 tnpB IS1296ll, Transposase IS1296LL (ORFB; extended)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGFLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIENFSPCLW
>MSC_0137 tnpB IS1296mp, Transposase IS1296MP (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0246 tnpB IS1296od, Transposase IS1296OD (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKIALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0015 tnpB IS1296sq, transposase IS1296SQ (ORFB)
MLKTHKKVKIPILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGHRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKNVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0174 tnpB IS1296uk, Transposase IS1296UK (ORFB)
MLKTHKKVKISILLKIAKLPKSSFYEWKHKLENTIDKDKELKEMIVDIFS
KSFETYGYRRLKMALKSKGYIVNHKKILRLTKELGVQCIKFRTKNGRYSS
YKGTVGKIADNVLKRNFHSLQANKLWCTDVTEFKVNGQKLYLSPIIDLYN
DEIISYSIQTNPNLNLTNSMLDKALKKVKNTNGLLIHSDQGFHYQHISWA
KKLEENNITQSMSRKGNCLDNAIIENFFGLLKQEIYYGEKYNSVEELTKR
IHKYIYWYNNIRIKEKLKGLSPVQFRKQSCYNIEKF
>MSC_0931 topA, DNA topoisomerase I
MKVLVLLESPSKIEKIKHYLEEGFPENQFVVLASGGHINSIADKGAWGLG
IDLETMQPNFVIESSRKKIISQIKKEGKTADLIILASDPDREGEAIAYHL
ANLFKDHTNIKRITFNEITSEAITNAFNNLRDVNMNLVNAQISRQILDKI
IGYLVSKSLQKSTGLMSAGRVQTPALNILTTRDMLIKNFKEVLYKKIFVI
ESKRAINLNLNKDKNNVLVNTEKTYYIDEKQAKIIVDELGEVYRCTDYKS
TAYETRSFKPYSTAGLLQDGFTKLKLSTSQITLAAQKLYELGYITYIRTD
SVKYSSQFINEVKDYISKNYSSDLFKDPIVGKKDQNSQEAHESIRPTNIW
LTPEKASLEIEDNLLKRVYNLIWWNSIKSLMKGPSGFNHRWTFNNNGYEF
KQSWQEVKDLGYQAIKHSSSDENIELTNDGEEIVQSKNDKPEYQFNDDFE
INISKKFIKIEDAKTNPPKMFNQASLIKELKNLGIGRPSTYNPILTKLKD
REYVEYPKSKPIVVTNKGYSANEYLYDHYLDFFNLNYTAEMEEKLDEITK
GDFDYVNWLKDIYTALNFKVKKEIGEAKTEAICPRCGANLVYIKSRFNRG
RGCSNFTITKCGYREYEQPDGTWKEYVKEEKTTENNTKKENKE
>MSC_0496 ung, uracil-DNA glycosylase
MTNLHPSWNKLFTDLNLSDQINNLINKAYSSNDIVFPKQKDVLNLFKLSD
LNNIKVVIIGQDPYHDFNQANGIAFSSNALKTPPSLKNIFNELYSDLNID
HFNNNDLTNWVKQGVFLINTCWTVIAHKPNSHNNLGWQKITKKILEQIIL
HNQNVIFCLWGNFAIKMYDSLLVKSNFVIKSAHPSPLSYKGFKNTKPFSQ
INNLLINLNLDPINWSL
>MSC_0943 uvrA, excinuclease ABC subunit A
MSTDKIIIKGAREHNLKNIDLELPKNKLIIFTGLSGSGKSSLAFSAIYQE
GRRRYIESLSAYARQFLGGNEKPDVDSIEGLSPAISIDQKTTSHNPRSTV
GTVTEIYDYLRLLYARIGQPYCINNHGQIKAVSIKEIVENIKQSTSDGEQ
IHILSPVIRDKKGTHIDILEKLRNDGFIRVIVDDQLRMLDDQINLEKNQR
HNIDIVVDRIIYHNNDEINSRIFTAVEMGLKYSNNLIKIAFPNSNKQEKL
FSTSFSCKVCDFVVPELEPRLFSFNAPLGACELCNGLGVSLEPDINLILP
DLKLSINQGGVVYYKNFMHTKNIEWQKFRILCDYYYIDLNTPLKDLTQKQ
RDIILWGSDREIDIKIVTENNNKYEKYDFIEGNAALIKRRYFESKSEEAR
KWYAKFMSSKICKQCKGSRLNDIALSVKINEKSIFDYTNMSISEQLDFLL
NIDLTATQAMIAKLVLDEIISRTNFLNEVGLGYLNLSRTATTLSGGESQR
IRLAKQIGSQLTGILYVLDEPSIGLHQKDNDKLIKTLKHLRDLGNTLIVV
EHDEDTMKSSDWIVDIGPRAGEYGGEITFSGTYQDILKSDTITGRYLSRK
EGIAVPKTRRGGNGKKIEIIGASENNLKNINVTIPLNKFITITGVSGSGK
STLLEDIVYKGIHNNLSKEYLPIGKVKEIKGIENINKAIYISQEPIGKTP
RSNPATYTSVFDDIRDLFTNLPEAKIRGYKKGRFSFNVHGGRCEHCQGDG
VITISMQFMPSVEVVCEICDGKRYNDETLTVKYKNKSIADVLNMSVSEAY
VFFENIPQIKQKLETILEVGLGYIKLGQNATTLSGGESQRIKLSTYLLKK
QTGNTMFLLDEPTTGLHVDDVKRLIGVLNKLVDLGNTVLCIEHNLDFIKV
SDHIIDLGPDGGEYGGQVIVTGTPEQVIHHQTSYTAKYLKDYIIND
>MSC_0944 uvrB, Excinuclease ABC subunit B
MLWEKVMFIANNKYKLVTKYKPSGDQNQAIEKLNKAIIENKKHQVLLGAT
GTGKTFTIANIIAKHNKQALVIAHNKTLAMQLYYELKEMFPENRVEYFVS
NFDFFQPEAYIPSKDLYIDKDSRQNMELDMMRLSACNALLTRNDTIVVAS
VAALFALQNPLEYSSAFIELKVGQKIKRNELLTWLVRSGYTRNDIENQLG
SFSAKGDVVKIVPGWVNNIMFRISLFDDEIESIHTLNTITNSILDNITTV
TIHPAQSYITPQDKLKTICNNIRNELVQRLAELQSENKLLEAQRLEQRTK
YDLESLEEFGFCSGIENYSSHLDFRSKGQRPYVLLDYFNNDFITIVDESH
ITLPQIRGMYNTDRSRKLTLVEYGFRLPSALDNRPLNFDEFNSLIKQVIY
TSATPGDYELDLVNHQVVQQIIRPTGLLDPQIEIRKTTNQIDDIINEIHL
RKLQNERVFITTLTIRMSEDLTAFLQEKNIKVAYLHSELKTLERSEILND
LRKGVYDVVVGVNLLREGLDLPEVSLVCILDADKQGFLRNYRSLIQTIGR
VARNVNGKAIMYADTVSQAMDEAIKETNRRRKIQEEFNKKHNIVPKTISK
AISESILSEQTKKTLAKAKKIKDKKQKLQTIQQTIDNLRQEMLQAAKELD
FERAAILRDTIIELENEKNTN
>MSC_0297 uvrC, excinuclease ABC subunit C
MSLKQQVDLLPNKPGCYLFFNKDNDVIYVGKAKNLKKRVSTYFNKAYNIK
TTRLVREITDLKYFIVDNEKESLLLEKNLIKKYHPKYNVLLNDDKTYPYI
IITNQKDPMYKYVRKYDKKALKNYGPLPIGSNARSILLTLQRLFPLRMCQ
GNLNKPCLYYHLNQCSGACFKQVDPSYYEYQIKQVDKFFKGEINQVKQTL
VKQMQKASDNLQFEQAQRIKDQITSLDFITAKQNVDIVTNKNIDVVNYEI
NQDKICFVILFYRLGQLTYKDEYIQNYEGQNLSELFNSYLQQIYQKNIYP
DVLLIPNEIELLDLDQNLLEFSSYSLNKQDDVFIKLAKQNAIDSLNKSVI
SHNVNSGDEIEILEQLKQISNASKYLKRIEVFDISNIYSQFITGACIVYI
NAKPIRNEFRKYNIDPSYTSDFSRMKFMLEKRFLKQIKEKEQLPDLVIVD
GGIIQIHAAKEVLNKLNLKIDVIGLSKDDHHKTRYLIDIFEQTIDIKNFK
KLYNFLTSLQIRVDEYAKSGFRKKYHNQLNDQILLIKGVGKKTNLKLYKH
FKTIDNIKNASFDELNKVINNKKITNLIISNLNK
>MSC_0101 xseA, exodeoxyribonuclease VII large subunit
MEKILTVQELNEALKTLIENKQEFKDIYVQGELSNLTFNKSGHIYFSIKE
QDAAINCMMWKTNAYKIQSLNLEDGMQIICYGRLTYYIPTGRVSFEVRDI
QIHGIGDLQKIFEQRYKELEQKGWFDPNLKKSIPEFVKNVGIITADSGAA
IYDLIRTVHRRLPLINIYLFPAQVQGDKAEIDITNKIKQANNFKIDLDVL
IVGRGGGSYEDLWAFNELEVLQAIKNSHIPIISAVGHEPDWVLSDYVADI
RAATPTAAGELVSKSIIEIKNQLKHYYQNYKTLILNKLDFFNEKINNYKK
DQTKYIKDNFSFKYLQLKQLSIDNTKWTKNKIDSVIYKLEDYKHSINNSI
IHIINSQNKALKNYLIADEQKILNYLKKQISEFNYTISSFKGHINQILKY
EELSFDTLENKLNSLDPLKPLQNGYSIVTNLNHQKIRSYKQVKLNEDLKV
ILTDSKLTVTIKEVKTNEQ
>MSC_0100 xseB, Exodeoxyribonuclease VII small subunit
MNNKNKSYDELISEIKEDTKKLSSNEISVEQAMEIFEQNIEKIKLAKEKL
TQYKGQIYKVMQDDELEEFKD