Gene list
Applied filters:
COG category: Function unknown
Organism: Moorella thermoacetica ATCC 39073, ATCC 39073
Gene type: CDS
Number of genes found: 203
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Moorella thermoacetica ATCC 39073, ATCC 39073 >Moth_0215 Polysaccharide pyruvyl transferase MARVVISGYYGFQNAGDEAVLYSIVKALRSLEPDIEITVLSRRPEQTAAC LKVRAVDRWHPVRVAGAIRRADLVISGGGSLFQDVTGPKSLLYYLGIVLL ARLLRKPVIVYAQGLGPLKRHWSRWLTGRVLNRVQLISLRDSESRRLLEE LGVTRPPVYVTADPVLGLEPENMDLRPGQDKWEQLELSGPVIGISVRSWP GYEECWPSLARVADELVAGGWQVLFLPFHFPADVDACRQVARLMHSPAVV LRENLDLPALMGLMGRLQFLIGMRLHALILASLMGVPFLALPYDPKVTAL ARMMEQPVAGFLASVSYTGLEAAVKQALAEREENARRVQAAVAELRPLAL DTARLVIEYLRKGARG >Moth_2061 Protein of unknown function DUF820 MGVSLAELAVGRERYTYEDYCRLPEGSPYQLIGGELVMTPSPTPYHQMVS MKLELKMAGFVLDKGLGIVLHAPLDVYLDDTETYQPDIIFISNEKLPIID EKRINGAPDLVAEILSPSTGYYDLRSKYKVYEKKGVREYWIVDPQHKSVQ VFCRQEGKFVLDQEAEQQGTVKSRVIVGFEVQVESIF >Moth_1866 Protein of unknown function DUF169 MADLDKSLMVKRLTEVLHLDTGIVGIKLYRDRKDLPRRPYNWKVNICQLV SAARYQGKASSGTPDLMVCAIGAACTGLIKTPERFTSGKAALGRYVADVE AGRRFMANTYKLGDNGKVYDAIYIAPLESYKTEDPDAVVIYANPAQMMRL IHCCLHETGEPVKADTVAEAALCSSIGFAVKEQKPIIGFPCGGDRTFGGT QKDELVFVTPYNMLGTLVENMESLLMSTGQLYPVAPFNNFTPVMVQSYTM QPEDLEEK >Moth_1377 Protein of unknown function DUF711 MPSLFSFTPEEILETIRMIQVENLDIRTITMGISLRDCATASLEETCRRV YDKITRLASDLVAVGEEVAATYGIPIVNKRIAVTPIAQVGEPSGEDDLTP LARALDRAAEAVGVNFIGGFSALVHKGFTHGDRALFNSLPEALATTERVC ASVNVATTKAGMNMDAITWLGHLIKETARLTASRGGLGCAKLVTFCNAPE DNPFMAGAFHGPGEPECVINVGISGPGAVLAAIRQYPEADLGQLATIIKN TAFKVTRMGELVGREVSKRLGVPFGIVDLSLAPTNAQGDSVAEILEAIGL ERCGAHGTTAALALLNDAVKKGGAMASSYVGGLSGAFIPVSEDNGMIRAV ESGALSLEKLEAMTAVCSVGLDMFAVPGDTPAEVIAAIIADEAAIGMINN KTTAVRVIPAPGKKPGETVEFGGLLGRAPVMEVNTYSPAALVRRGGRIPA PLQALGN >Moth_2247 YbbR-like MLEHFRQNWGYRLMAVILAIILWMYVTGEQNPTGETVVRVPLETENLSSG LVVADRPAEVQVRVEGRKAAVANLLPRDVHAYADLRDAKVGDNVLPVRVD VPEGINVIHVNPAQVTIRVEKIEDIQLPVQVSLLGSPASGYRALEPVLKP SQVIISGPAAALKEIGRVYVEAKIDQASGNFLAQLPVKIADREGRPMQTW LTVNPDTVETFIPVVQDMPSKMLPVRPRLTGEPAKGYAIQRVILQPEVVE AFAPYSQLAALDYLNTAPINIAGAKKNVTVETNLEIPSGVQLSSFPRVRV VVEIGPAVAGAAGSGP >Moth_2133 MOSC MGRIVAVCTSANKGERKKNIGRGMLIANYGLKGDAHAGPWHRQVSLLAME SIAKMQAKGLKVGPGDFAENLTTEGIDLVSLPVGTRLKIGPSVLAEVTQI GKQCHSRCAIYQQAGDCVMPREGIFVAILTGGPVQVGDHIEVVAS >Moth_1739 conserved hypothetical protein MVAPIEPLYNKHYRCLFCDREFTNKKLRLSRIRQVKRDSDLCAYFEGENP YFYEVAVCPHCGYAFTTGFGTVKKERREVITREYINKITHKDYTGPRTLG DALKVYKLALLCGDLNQERKSVLAGICLHIAWFYRYSQEEEAEKKYLRNA YDLYQEAYQKENGTGREGNPNLILYLIGELEGRLGNYAEACRWLGRLLNV RNLEPYLHDLLLERWEVYRERLKETAPAGAP >Moth_0057 Protein of unknown function DUF1021 MNGKEVLATIREDLEARVGQKVKLRANRGRKKILERTGVLEKTYPNIFVI RLEEQKSPERRISFSYTDVLTNTVELMVEGDYGDKKLGAKP >Moth_1233 transporter MITAKIARGAVIAALYAVVTIILKPISFGYLQVRVAEALTLLPILYPEAV PGLFIGCLISNIYGGLGPIDIFLGSLTTLAAAWLTYVWRRSWIAYLPPIL LNGVIVGAYLSYLLHVHILLAMGSVATGEAIAVLALGIPLLKQIKKINVG Q >Moth_2497 Protein of unknown function DUF77 MAILEVVIAPLGTGSTSISPYVADVHKVLKETSGIKYQLTPMGTIIEGEP DVLFPLLQRLHEVPFARGARRSMTIIRIDDRRDKELTMEGKLKSVEEKLA LASS >Moth_1576 HEPN MNSREREQEALKWQDRARRDLRVAKMLFYDKEPEFDLACYLSQQCAEKSL KALLIRLGIRFAYKHDLDYLVGLLPPEDQEKFYNTRLEWLSGWVTEGRYP GDAAGATREDAQRAIEIAEAIYNNVSKALRSE >Moth_1854 Glycoside hydrolase, family 57 MVTRSGDNLGYLSLVLHAHLPFVHNREPYVSLEEKWLFEALTESYLPLIL SWEELAGEGLDFHLTLSLSPPLISMLMEPALGERYGRYLDNLRELAGREI ERTRGDPTFAPLAEFYHRRLTLVDRAFKETYRGNLLAPIKRLREQGRLEL ITTAATHGYLPLMLTDEARRAQVRAALDLFGMTMGFVPDGLWLPECGYTP GIEKILRSEGIKYFIVASHGMLNATPVVKSAVYAPVRVGGVAVFGRDWET SHQVWSRTEGYPGDPVYREFYRDIGYDLDFNYLAPYLVGGIRGDTGFKYY RITGKTGVKEPYDYRAARERAREHARDFIANREKQLAYWAGRTQDKPVVV APYDAELFGHWWFEGPDWLADVLRLAGESRVSLTSLSAYLEQYPPRQEVT MGPSSWGEGGYNHVWLNQANDWLYLHLHRAERAMIKLAAANPRPGSLQER ALNQAARELLLAQSSDWSFILTTGTTVDYARRRLREHLGAFFKLCQDYER DRLDEDFLARLEAADNIFPGLDFRLYRPAGRGVACRPEVHNKTRPGILML SWEFPPRHVGGLGIHVRDWARPWPARGWMSTS >Moth_1217 Protein of unknown function DUF205 MLSWTVILTGIFMAYAVGSLAGGHFLSKILYNADVRQVGSGNAGTMNVLR NLGIAAGIMTFIWDTAKGFLVVTLGLKGGGAELGVLMALAAVAGHNWPLY WRFQGGKGLATSLGVALAVYPAAVPPGAALMGLLTFLTRNTDLATLLTFS ALPIYFWWREGPGCYLAFGLGLAAIMLLRHGPLVISLFYNLKERR >Moth_2522 Protein of unknown function DUF37 MIDNWVILAIRFYQRYISPLLGRHCRFYPTCSQYALEAITKYGLLRGGLL ATRRLLHCHPWDAGGYDPVP >Moth_0719 DedA MRGGTGVSSLLAPLFEFITSAIASFGYPGIAVAMALESACIPLPSEVILP FGGYLVSTGSLGFWGTVLAGTIGGTIGSIVAYFVGLRGGRPFLRKYGHYV FFSEKEFAVAEKWFNRYGEATVFFTRLMPVIRTFISLPAGIAAMPFGRFV VYTFLGSLPWSILLVFVGRQLGANWEALSPIFHRFDLIIVAGLILLVFFY WRRHRSRR >Moth_2306 Protein of unknown function DUF162 MKSQELITSFTSQAEAMGARVIQAARPGEIGAKLVEVLRPLGSKIALVDS PLVKTAGVEAALAGAGFNVEKDGPEFARQADTGIVEFEYGIAETGTLAMD ATDLKTRLAAMLPLTCVALLAAERVRANLTEVIDAYLERGPWPGYFTLVT GPSRTADIERSLTVGVHGPERLFIILVGENGGASRGR >Moth_0483 Protein of unknown function UPF0150 MASHCFLVVIEQDEDGKYIASVPSLPGCHTQADNLAELEERVQEAIKLYL NENSDFIPAPDKFIGIHQVEVQA >Moth_0597 Protein of unknown function DUF502 MRRLRRFFLTGIIVTMPAAATIYALWLVFSFLDQLAGQAVGLFLGRRVPG LGLALTLAVVLIAGFLATNFIGRFFLNLWDEVMYRIPLVNSIYRTVKQLV EAIWRDDKKAFQHVVMVEYPRRGIYSLGFLTGPAPAEASMRAASDLVNVF VPTTPNPTSGFLLLVPREEVIPLEMPVEDGLKLIISAGVVGPAGRTASGG AGWGRLLQGLTANGTINNPGWRRSRNTAE >Moth_0555 conserved hypothetical protein MRLRIKYSKTGQMAFLGHLEMLRLWQRAMRRAGLPVAMSQGFNPHPRLAF GPALALGLESLAEYLDVELAADREPDRVQVELQAQLPPGLEILLVRTIPD QAPALTAVIDVAAYRVTWLKEVDPGLLQQRVESLLARQEVLVRRTGRDGR PRVKDIRPGILKLVLDPSPDLVMLLQCSQAGSVRPEEVLKALALETPARM VRTGLFARRGDDLLAPEDISG >Moth_1682 hypothetical protein MIDANIVYFSGVAVMQIFSLLAIFFALLVAIFAVQNAGPVEINFLAWQFS NISLVLVILGSAAFGALVVFLLGAVRQVRQAREIRELKSQHKRLQETIAR LELVAAGKGAGQQERKQEA >Moth_0077 conserved hypothetical protein MFYTSKKLMGMPVVSLADGHQLGRIKRLLIDHSKMAIAAFTVDRKGWFKE QPVVPYSHVKSVGSHAVTVDEASAVVKLSSLPELEALAKHPLPLLGARVI TEEGTVLGTVEDFRFDPQDGKIHYLDIKSGLLHGARSLETDQIITCGRDA LIARAGAEEALQKSGGLLSVKLQDACKSAGKTFDSAGTITRKVGETVNRY WQRLPFSQKKNDGPPPGENS >Moth_0526 Protein of unknown function DUF190 MTGTRAKRLTVYLGEGMKWQGMALYHALVLELKAAGLAGVTVVRGIEGYG KRKQLYASRLLELSADLPVLVEAVDSPEKINAVLPRVREMVQQGLITLAD VEIISSAGTASNKENSPGPGRRA >Moth_2295 Extradiol ring-cleavage dioxygenase, class III enzyme, subunit B MGRLLDVGFMPHPPIMVPEVGRGEVARIKATVAAARELAARVAAHQPEVI IIISPHGPVFRDAVGIWATPELAGDLAAFRAGEVRFKYSLDLDLSRAIAA KAREEGIAVAWLDARASSSYGLTPELDHGMMVPLYFLRQAGLEVPLVAMG MAFMEREKLYAFGAALARAVKDSPRRALLVASGDMSHRLLPGAPAGYDPR GKVFDARIRNLLAALDVEGILAIPEDLAEGAGECGLRSFIMGLGALDGYR VKGEVLSYEGPFGVGYLVAHLEPGEEAPERSLLARETAAAREESLPVRLA RQSLEHYLRTGKVLPVPAPLPPELAGRAGVFVSLKKNGQLRGCIGTISPT RENLAGEIIYNALAAGLEDPRFPPVTVDELPELQYSVDVLSEPEPATVAD LDPKVYGVIVSCGHRRGLLLPDLEGVDTVAEQVAIARQKGGIGPDEPYRL ERFKVTRYH >Moth_0733 Protein of unknown function UPF0029 MKKEGARRGLAAKSGTSIENGSSYLTVAREAIAELKIERSLFIGHACEVD SDAAARDFIARIQAEHRQATHNCFAYRLGIGKKEITYYSDAGEPGGTAGR PILGAITGLGLTNVAVVVTRYFGGKKLGVRGLIEAYGQAARRVLEEAGSI RRVVTRELELTCSYAELDRLLYQVRSRGGKVIETEYGSEVRLKVAVPLPA WEEMKESHRPSP >Moth_0437 pyridine nucleotide-disulphide oxidoreductase family protein MAEDKMAIICFSGDLDKALATFNLATGAAASGMEVTIFFTFWGINLLKKP RGRRGASSLLGRMFDWMMPSGPDRLPLSKFNMAGLGPFFMKKQMRSKKVQ AVSEFLALAREMGVKFVACIMSMQVMEIPEEDLIDGVEFGGVAAFLQEAA NAKISLFI >Moth_0169 Protein of unknown function DUF433 MNPLERITIDPSVCHGKACIKGTRIPVSVILDNLAEGISQEEILKSYPSL SLEDIKAAIAYGAMLAKERHIAL >Moth_1214 GatB/Yqey MAENMKEKLTRDMKEALKAHDKIRLQTIRMVLASIKNVEIDKQHPLSEEE IAGVIQKEIKMRQDSLEQFSRGGREDLVEQTRTELKVLEDYLPRQLTEAE LKEIIQQTIQETGATSKREMGKVMAALMPKIRGRADGRKANELVKEILAS >Moth_1704 Protein of unknown function DUF28 MSGHSKWANIKNRKAKVDEKRGRLFTKIGREIIIAARMGGGDPEGNMRLK AAIAKAKAANMPNENIQRAIMRGTGELEGAAYEEMTYEGYGPGGVAMLLN IATDNRNRTASEIRYIFSRGGGNLGESGCVAWMFNPKGVITVEVPAGDKR EEVILQAIEAGAEDVDDEDDEVLEIKTAPGDLEAVREALEASGVTITHAE VEMVPQTTVTIDDPETAGKVMRLIERLEDHDDVQAVYTNADIPAAIMDQL DI >Moth_1489 Ku MRPLWKGAISFGLVNVPVKLYPATESNDLKFNYLHTRCKTPIQYRKYCPY CQVEVPPEEIARGYEYEKGKYVILREEDLEAIPAEKTRSINIMDFVDLEE IDPIYFSRSYYLAPADMGQKPYLLLKKAMEETGKVAVARVTIRSRESLAT VRVYGPALVMSTMFYPREVRPVTGMPELDFQVNLRENEVKMAVTLIKSMA TSFQPEKYTDTYRQALLQVIEAKIAGEEVEVPARPEAGKVVDLMEALKAS IELARQEKEKVAADVEDRKPRRRRKTS >Moth_1003 flagellar biosynthesis MKERERIRKAVALRYNSEEDKAPRVVASGRGAIADKIITAAREAGVPIHR DTHLAEMLAGLDVGSEIPVELYQMVAEILVFIYSLDQSRGKSGK >Moth_0118 (2R)-phospho-3-sulfolactate synthase, ComA MQYKDQSAWRDVLEFPIGGRQQKPRRTGKTMVIDKGLGLTEFKDLLEVAA PYIDFIKLGFGTSVFYPADILREKIRLARSHAVDIFPGGTFFEVAVLQGR LSLYLQTARELGYTFIEISDGTIDLSRTLRYAALRQARAAGFGVITEVGK KDPRDALSDTHILSQIAMDLEAGADYVIVEGRESGQGVVIYDSRGVVKED TLAYLIEGIGDLDRIIWEAPQKQQQQVLIINLGANVNLGNVQPGDVLALE ALRVGLRGDTLRTTLVREGVS >Moth_0626 Protein of unknown function DUF34 MAAKCGEIIAIMEALAPPELAAGWDNVGLMLGSPEAEVRRVLVCLDVTPS VAAEAAARAVNLIISHHPLFFRPVKNLRFDEPVGELVRRLLQDNIMVYSA HTNMDSADLGVSYHLASRLELEDIRVLVPTHREKYYKLVTFVPEDHEKVV REALTRAGAGWIGNYSDCTFRVAGTGTFMPLAGTRPYTGEEGKLAEVKEY RLETIIPTGRLPEVLRALLKAHPYEEVAYDVYPLANEGPAQGIGRTGVLP QAVTLEEFALRVKESLGAGRVNLVGDRERKVKRVAVCGGAGSDVMAAARD AGAEVLVTGDLKYHEARTAQAMGLAVVDAGHFATERLIVPALVTYLQEQL QEREVMVLASQQEQEPWYAL >Moth_0877 pyruvate formate-lyase activating enzyme MLNRPLLAIDIGGGTQDILLYRPDQPLENCVQLILPSPTVICGRQVEAAT AARQDVFLRGHLMGGGALVGALRRHLAAGCRVYATPEAARTVYDDLERVR QLGIVITDQPPEDAVTIKTGDVDLATLAGSLAPYGVKLPAEVAIAVLDHG EAPQGMSDRVFRFQHWQRFVAGGGRLEDLLYREPPSYLTRMKAVREQAPG AWLMDTGAAALWGALEDARVAARSEEGLVIINCGNQHTIGVLLKGQRVLG LFEHHTSCLSGHKLAAFIEKLRAGKLTNEEIFNDGGHGCYIDPSYQPGDG FRFVAVTGPQRRLTIHQDYYWAAPYGDMMLSGCFGLIAAVAGVKINP >Moth_1476 Propeptide, PepSY amd peptidase M4 MNKKLLASLTTGVMLLGAATGIGVFANQPAKAAAASIPAIAQSAATVNPA PANGQQVQASQQQPAYNASIKVANSQNDNGTEVNETTEKQNEAAESQALQ ARAKITADAAKSAALKAVPGTVKKVALDNENGNLVYSVEIQAASGSIDVK VDAGNGQVLAQDQDSGQDNEKGAKGIDNDNIQVEQ >Moth_1358 conserved hypothetical protein MPEHPIEALMKTAMESIKDMVDVNTVVGDPIETQDGQVIVPVSRVTAGFA AGGSEYESSTADGGGGGKESLPFGGGSGAGVSVQPVGFLVVGKEQVRLLP VDGNAVVDRLIDVAPQVMEKIQELLGKGKSAKGNGKVNGSSTPTTVRTKM VLRPQDAEE >Moth_0992 conserved hypothetical protein MARGTKKIVFHKIPVRTHVVTDKDDAISLAKKYSAGIAAPGDVICLAESV VAITQRRAILPEEVRPGRLARFLCRFPGKDGSLATPPAMQLALDEVGPVR LLAGCAAAAFGRLIRKRGLFYIVAGRQLALIDDIAGTMYPYERHIVMGPK NPGKLVREIKKATGAEAVIADVNDKKCVDILGITDRRYIQAVTEALRDNP FGNEDEQTPIVILKRQ >Moth_1420 conserved hypothetical protein MVMAGVSYQPGTILRETIHSIRNILGDSLDDLTVERVVIGVFYTGVKLSN GQGGLCFTPIKAIPGAVCCPSSARAMPASGELRGRKATAFLEGMFADQAL RRALGIAVLNALSATCWQVRPPMNYTLKTGANALDQVIIPGEGQVVVVGA LAPFLKVLKRQDCRFTILELDPATLKKDELPFYRPPEDAPEVIPWADLLI ITGTTLINDTLEGLLSVVKPGAQVVVVGPTASMLPDAFFRRGVNLLGGTL VTKPDELLDVLAEAGSGYHFYGRAAEMMVLRLSDHGGTI >Moth_2223 transcriptional regulator, PadR-like family MNYTDREYWNGIIKMCLSKFFILRVLYTQPMHGYEIARTVAQVTRGCCTP TEGTIYPVLREFEEGGYVTSSLEIAGGRERKVYTLTPKGQEAFRVAVEAW KEVTGYILEAVKLEDYASTRRRCIMAGAKGILSNFSGLREGNCYSVRNEE IPIGQCRTAASSCSCGSSLPGSFDKEEGGEIGISPDTQASQRSLDIEFLY LDLDSCTRCGGTARNLEEALNEVAGVLQATGIQVNLHKIHVQSEEQALAL GFVSSPTIRINGRDIQLDVRESLCESCGEICGEDVDCRVWVYQGKEYTEA PKGMIIEAILKHVYGGGNESPAERETLQELPDNLKRFFAARRKKEEGGAG RKPAPEPQDSCCGISPLSKCCK >Moth_0170 hypothetical protein MPCELRGIKMLRFKIDENLPIEIADLLREAGYEAETVWSEQIQGFSDIEL LGICRNEKRVLVTLDMDFSDIRRYRPEDYQGIILLRIASQGKQSVINLFK KVIPHLRYQNLIGYLWIVQKDRIRIRGPER >Moth_1931 conserved hypothetical protein MTELLNNQDYRKEALKEIIRELHRGKSVEEVKARFNELIKDVAPAEISLM EQALINEGLPVEEVQRLCDVHAAVFKESLERAPQPETIPGHPVHTFKEEN RALEDLMIREIQPLLAELRRANPDVEKDLAIKLAEKLNLLQDVNKHYSRK ENLLFPYLEKYQIVGPPKVMWGVDDEIRDLLKEARDLAVNYVPDKKEELI TRTEAALAKIKEMIFKEERILFPMALETLTEDEWYRIMLDSASIGYCLIE PREDWRPAQVKLDQKETVASEETRGYIKFATGILTPREISLIFDHLPVDI TFVDKDNVVKYFSNTRERIFTRSRAVIGRRVENCHPPASVQVVEKLIADF KSGRKDREAFWLHLGDKYVFIQYFAVRDEKGDFAGTLEVTMDLKPLQAIS GEKRIMD >Moth_2349 hypothetical protein MEPKLRKILNRYLGEIEARLEGISCAARKEFITEIRSHLVEKWESSGEKT EESLLQVINDFGDPREIAEDYLTKVGGEARVRRTYPSTWLVLTLTALIWP VGIILAWVSPAWKFRDKIIATLIPIVIFLLLLTSSLPAVREVHKLTTQVQ LQPIETEVQR >Moth_1425 Protein of unknown function DUF364 MWEVYDELIAAVPPDLEVEDCIVGLNWILVRSRATGLVMTPLEGHPRIKL AGEIKGMPVRDLAEYIKSWNNFEAALGLAAINSVLNTPEQVESLCGRPLS SQPQMNAFTCFQEQVRGKKVTVIGHFPDLDPLARNCKLTILERRPREGDL PDPACEYILPEQDYVFITATTLINKTFPRLMELSRRARVILVGPSTPMTP VLFHYGIDTLAGTVVVEPKLIRRVVQEGGCLEIFKRGGRMVQVSRDEGLA VALSRQSREQAVQVMMGRRFSAIGAGQENFA >Moth_0167 Protein of unknown function UPF0150 MGRGAFTTFIQAAMKEAVIEQLEDGTFFAEIPPYPGVWADGKDEKECLAT LQEVLEEWLLFKLRDGDNDIPVLGGEDLNHKWQVKL >Moth_1898 Protein of unknown function DUF488 MFKLKRIYEEAAADDGLRVLVDRLWPRGMSKEKAEVDLWLKDIAPSPGLR QWFAHDPARWEEFRRRYEGELYLKGDLLAILRQKAKEETVTLLYAARDEN YNHVVVLKKFLENANKKPGKPGSR >Moth_2483 Ribonuclease III MDFVANLSPLALAYVGDAVYELLVRAHLVGRGPAKPEQLHREALKYVRAT AQARVVPALEEYLTPAEKDILRRGRNARPGHLPRTAAPAEYHSSTALESL LGYLYLQGDWARVEELAHLIFKLAEQD >Moth_2145 conserved hypothetical protein MFWVCRYRLFIVPLLVFFLVSGFGLAWGADETSTMQSPEVEWEKTLGKGI GYSVQQTSDGGYIIVGSTQFRGAGDVYLIKTDANGNKLWEKTFGGSGSDE GYSVQQTTDGGYIIAGSTHSYGGGDDDVYLIKTDANGNKLWEKVFKGEEL IEVKGGIARIKTLKGELIKEIDITKDWEKYWPETLGAELINEDTTNNYWL WKKTLGGKGRSVQQTADGGYIIAGYTNTYNVYLIKTDTNGDTLWERIFGS NYTEVYSVQQTTDGGYIIAGYIDPGSVGKGNVYLIKTDAKGNMVWEKTFG GSNWDKGYSVRQTTDGGYIIAGFTRSYGVGNDDVYLIKTDANGNKLWEKN LGGNYWEGGYSVQQTTDGGYIVAGVGDYSQIKTDGDGNLLWKKTLRGEGR SVQQTTDGGYIIAGYTFSRSTDSDVYLIKLKPETPPANQPPVVSLKDMQG HWAADAVDRLVETGVVSGYPDGTFRPDLEVTRAEIAAILVRALKLTPTNN QELKFKDDATIPTWAKDAVSIAVKEGLVKGYLQPDGTMTFEADRPVTRAE MAVLVARVLRKKLGEVTPMELKFTDAVMIPAWAKSDVGVAVAEGIVVGYP DNTFRAENHVTRAEAAVMILRLLRVLGRI >Moth_2411 conserved hypothetical protein MKKDVLVKVRGTQTNDLGEQDSIELITEGRFFIRDQHYYILYNETCLSGM EGTTTSLKVEPRRVTLNRMGTAEQKTTFETGILNYSFYVTPYGTMRISVL PSKVEVDLTERGGSINLEYELQVGQEKISNNQLEITIQHLENPV >Moth_1536 conserved hypothetical protein MATVHLVQVETANQPPGQAPRRDGDPLAGIEELLPPNIKAAVESLPAGIR DNLEEIRLRRERPLQVRWSGGEGWVAASGGLAAGPDGAYKVTAADLGRTI EALTRSSLYALEEELRSGYITISGGHRVGLVGEAVVLQGEIRTLKNFAGL NLRLARDIPGCARSLIPYLLEGGRPLHTLILSPPRCGKTTLLRDLIRLLS TGVPELKFSGVNVGVVDERSEIAGCWLGVPQLEVGPRTDVLDRCPKAAGM LMLLRSMGPEVIATDEIGRPEELAALQDVLHAGVTMLASVHAGSLEELQH RPGWGPLLKQGFWQRLVLLGRTLGPGTIEGVFSGDHRTLKRGPWRGEARP >Moth_1493 Protein of unknown function DUF1458 MHVKVVELVGESPNNWKDAVQKAVSEASRDISNISGVEVYNLTANVKDGK LSEFKANVKIAYADHSADL >Moth_0697 AIR synthase related protein-like MIGKVDDAFFRQAILPHTGAGDPEVVVGPRMGVDAAVLKIGEEYLAVAED PIFPGPTTSPDDFGWITVHIGASDVAVMGIKPRFMTYSLLLPPGTPEDYI AGLVRSISTYARELGITIVGGHTGFYGAVTIPTIGGITVWGRGREVVTPA GARVGDAVIITKGAAIEAAALVACELGEKLLAAGISPDLVARAKKRLREM SVVAEAGIAVEVGGVHAMHDATEGGLARGLWEVAEASGVGLRIERARVPV PADIRAVCDYIGLNPYEVISEGTLVLTCAPEKADAMLAAFKEAGIEAAVI GRVVPAGAGRAWLEDDGREEQLLPPAVDRFWEVFFNALALKNDTRTPAEV ALCRELGQAVRELEEANVAALIPEIGANLAYCLPEAKELRDIAAIPGRLL RFKGRVATLGEPEMGCSHHMGGTILVVREFFPQARCVINLRNNARVRQAC ADLGYKVVSMPVPPDYRQTDDDFYTDLRRTMAACRELPDVIEIPDRINLE RLILVLGRNPGEIVSKVTSLATRVAELE >Moth_1878 conserved hypothetical protein MSTRNPTMTEGNQGLGLIQGVGLTVILTLVARQLAMLPILKIMGSMVLAI LLGVAWRSLMDIPATAEVGINFASKKILRYGIILMGLRLDIPKIIAAGPQ VILLDILAILVSMVVIIFLGQRMGLNKKLAALIAAGTGICGAAAIAAIAP IVRSRDDETAVAVAIVALLGTLFTILYTLLYPVLNLTSFQYGLLSGSSLH ELAHVIAAAQAGGSASADIAILVKLGRVAFLVPVALVLGLIFARQNETGA GWHWRQLQVPWFILGFLVFSGINTMAILSTPLIAFLIQVGVFLLTVAMAG LGLNVSLEMIKKVGSRGLVTGLLGSVVLSLTIFLVIASLIN >Moth_2093 hypothetical protein MEEKDLIRSLNWFYSLEIEQVDLYKSQARAATDIYLRQVLTRVAAMEQEH VINLEAEISRRGATPTRLGAIIAPLLGVATGTILNWTNTRTLLWANITLE EKAMADYKRLILKVAEKTLFNLLWSHLIDEDLHAAWFSNKLKELDRLALH >Moth_1084 Stage V sporulation protein S MEVLKVAARSNPNSVAGALAGVLREKGGVEIQAVGAAALNQAIKAVAIAR GFVSPNGLDIVCIPAFADIQIEGQERTAMKLIVEPRKTQ >Moth_0749 Protein of unknown function DUF180 MQVMTSRFGTLEINPSDLLHFPQGIPAFEHLKEFFFYPIPENPAFTWLQA AADPEVAFLLVDPFLFFPGYAVDLPARLQEELAIKDPADALVYAVVTIPD GDIRRATANLVGPIIINPTVRLGMQLILEGTKYTTRHQLFKESFSDDESI PSSGGNEG >Moth_1742 conserved hypothetical protein MSYRVGIPRGLSYYYLFPFMQGFLTALGVEVVVSPPTDAVTLAAMNACPT DEPCVSVKLYFAHAKRLVESGVDYLFIPVLSSVEKDNYCCPKLIGAAAMI RNGLGLAPEQVLAPEWNEREKPGAWRENLLEIAARLGAGPERAAAAIRAG LEKQAACERMARQGLTLPLVYHRLCGTEKPRRQQFDPEARYDEGETIAVL GHPYLLYDGAGHQIVERLAEYARVITPEMVPPEDYRPEVATIFEGTRMWA YEARILGAGFSLLRKGQVTKMVLVEAFECGPASVIESYLEAEAERFGVPF LLLTVDEHTGEAGLVTRLEAFVDTSGSRPVNPAGKKQAFFFPASRGGELK VGAPGMGLLDIALEAVLQECRVEMVPTPPVTKRTVELGKELAPEFICYPL VTTLGQIREVLEQGANTIVMVGGKGRCRLGWYAQVQELLLKRLGRDFHLV IIDSPLPWRERWPAFRQALKEITNNAPLWRIIQGIYLGYHKMAALDQGEA MVRRKRAYESQRGAADKAWRRFVGRVKTAAGVRSVKRVFNEFREELAAIP EVPASPLRVKIIGEIYTVLEGFVNQEIEQFLASREDLRVEVVREITATQW FNLNVLHRRYEVERHRAIVTAAAPYLDVSVGGHGQESVGEAVLAAREGYD GVIQLLPFTCMPEIVAQNILVPLSEKLDLPFLSLIINEQNGTAGWETRLE AFLEVLAERREKLAAPGGEKHGVLFGY >Moth_0268 Protein of unknown function DUF72 MTVFNPFLASASANKPSTSNFRPPTAAIYIGTSGYSYRDWQGYFYPRGLA AKDMLPYYAREFNFTEINSSYYALPRPQNLEQMARKVPEGFIFAVKAYRT LTHDRGESVTDDSKQFRQALQPLVDQGRLGAVLLQFPYSFHNNQANREYL ARLRELLPDLPLVVEFRHAGWVHDAVRDFLARNELAYTCVDEPDLPGLPG PVVYCTAPVAYVRFHGRNAAKWWRHEEAYERYDYLYTEAELKEWLPGIAY LAGQARQVFVAFNNHYHSQAVTNARMLKELLAAGQL >Moth_0834 Protein of unknown function UPF0040 MFMGEYHHTIDDKGRLIIPARFREELGVKFVITKGLDNCLFVYPMQGWAE MEQKLRSLPFTRADARAFVRFFFSGATECELDRQGRILLPGNLREYARLD KEVVVVGVSTRVEIWSRSRWEEYCRETSDQYEALAEKMVDFDI >Moth_2416 Protein of unknown function DUF163 MLHLTIVAVGRVREKYLVAGIEEYLKRLRPYARVRILEAPEEKVPDKPSP AGVEQILASEGRGIGRLIPPSSFTVALDREGVMLSSEELAGRLADLALAG KNEVALIIGGTLGLATFILQQADLRLSFSRFTFPHQLMRLILLEQLYRAF KIQRGETYHR >Moth_1842 hypothetical protein MNCSQCRELISPYLDGVLSETIQRALENHLNSCPACREELEAMGQTIEII RAWSEEELDLPPGFEERLRSRLEECRQPWYRRLSRNWLSLAAAAATIMVV AITARADYLHLGSSRQIAVPHEKQVQELAMTRGDQQVTPLKALPPVTSTD APQQSAPKVKVKAATTSVRSAVRNLESSHPDPEQQQRKIVPGGTFNLNSR GRAERAAPEQQTGGQSGKGQPDQDKDKGKEKGPGQSRTVLEAGKKEVTPR AGEGVAGGTSTIAGDGPGTVKTPAGDGKEVPPLPPAGGKATLQDLTPGVG RQNSAASPDSDLQNRTLTQPPPAPVAPATIPKPPSP >Moth_0856 Protein of unknown function DUF152 MATFTGEEREGIFFLRVTFLESRAPVKAVYTSRRGGVSNAPYDGLNLGLH VGDNPRAVLANRNLLAGVLDLPLGSWVIGEQVHGNEVARVGREQAGSGVQ ELTTALKGIDALVTNEPDVTLVAFFADCVPIYIVDPVNRALGLAHAGWRG TVLQVGARMVARMATEFGSRPGDLLAAIGPAVGPCCYQVDARVVDKVQEH LPFAGELLAADGPGHWRLDLPRANYLSLVAAGLQPDRIAVAGICTHCQAE TFFSHRASGGITGRQAAMLALGEQV >Moth_2269 Protein of unknown function UPF0261 MTAKDIAIIATVDTKEAEARFLQEFITSHGWQAPVLDVSTHRPHNFQATY SREEICRRAGVEYKDLGTLRRDAMMATMGRGAARVLMELYDRGELAGVLG IGGNQGTAIAAMAMRSLPVGLPKLIVSTVASGNVRPYVEYKDITMMFSVA DLLGGPNTVSRTILSNAAGAVIGMAAWGQPLKAGERPVIAITALGNTDPA VAAARGRLVELGYEVIAFHASGTCGSAMEELIEAGLINGVLDLTPHELIG EVHGADIYTPLRPRLEAAGRRGIPQVVSLGGLDYFCFGPADTIPQRFQGR KTHYHNPYNTNVRATGGELAQVGEVMAAKLNAARGPVVVMVPLKGWSENG RAGGPLYDQEADAALVASLEANLNPGIKLMKLNAHINDPIFAASAVAVLH QLMEVSRPVDGTFPREAVEKGTLPPKNPKWRRSLTPESAIVKQAPR >Moth_1797 Protein of unknown function UPF0150 MDKFLVVIEKADGNYSAYSPDLPGCVATGKTPQETRENMAEAIKMHLQGL KEDGRPITVPTAKADYIGVNLQAL >Moth_1977 conserved hypothetical protein MVLRGLFAIIEAYLIYWPLVYFIARKYFKFTPEWAVPLASGISICGVSAA IATGGAIKARPMIPVIVSSLVVIFAVVELIILPFAAQAWLYREPMVAGAW MGLAVKTDGAAAASGAMVDALIRSKALSALGVKWQEGWMLMATTTVKVFI DIFIAIWAFILAIIWSWKIDRREGEQVNAGEIWLRLPKFVLGYAGLFVLV ILLGVALKGTPGIKLLNAGIGQAGIKPIYLIQIGGLRMA >Moth_0891 Protein of unknown function DUF370 MDIKLINIGFGNIVSARRIVAIVSPESAPIKRIIQEARDRGMLIDATYGR RTRAVIITDSDHIILSAVQPETVAHRLSSREPLASAEEPVD >Moth_0206 conserved hypothetical protein MLELNRPAEVVAMPVYEYRCAKCGVFEKEQRITAPPLTECPTCGGPVHRI ISRNIGVIYKAGGFYTTENRSQEYKNKAKEESKTSEVSKAS >Moth_0179 Protein of unknown function DUF86 MEKLSECLTKLEPLKTKSFDDFEQDPYLRDIVERNLEVAAQCCIDIANRV ISIEDLEKPEDYYSAFITLGQAGILPLKFARSFAGIAGFRNILVHEYIQI DWHEVYRNLHRLDDFYHFADCIKAWMKK >Moth_0210 Protein of unknown function DUF917 MGSRIILDNEVVEAAVLGGAVLGGGGGGSMEMGRQAARLAVELGSPELIT LDSLPEDAVLLTVSAVGAPAARTVYVKPVHYIRTVELFQKYTGQEIRGLI TNECGGLAAVNGWLQAAALGIPVVDAPCNGRAHPTGVMGSMGLHRLSGYV SRQVAVGGNPQTNSYVEVFASGSLETAAALVRQASVQAGGMVAVARNPVT AGYARENAAPGAIGRCIAVGRTIIENRSRGPLPVIEGVAGVLQGEIAFTG RVAAVDLETTGGFDVGRVVVRDGDRLAELTFWNEYMTLEIGSVRKGTFPD LLATMDLTTGLPLSSAEIKAGQEIAILHVHRDRLILGRGMKAPELFQVVE KATGKEVIKYIFS >Moth_0456 DedA MTALHAYLALFGLLAIEGTGLPGVPFEPAFLAAGYLIERGEMSFWGAVLV GTAGNLLGNLIGYWLGARPGRGLIERLLHQGWGEGGMTTARYWLARYGAA VIILARWFGPIRTPTILAAGVVGMETGIYALYSTLGAFTWTLAWQYASWK GTHYLLGWWQVYRRYATWWMDALLILAGLAITTISVYYCWRRWRRDKEPA >Moth_1572 hypothetical protein MWWPEVGEKNPFVPVHIQVSELDILAKLGHEIVYLTSRPVLAIDLTRAWL AAHGFPRGSIVFLPRGHKKFFALYYSIDLVFEDDPAEVLQLQKAVGRVLV PAWPYNLNTKGRGIKAFTSWREIVGYIDCFRQLRMVL >Moth_1298 Protein of unknown function DUF1614 MLWPLLFLFLFIPMLMASLFLNLAIFSFARLGLSPGGAMLLLSASVIGGL INIPVSRRRLYIEEPRFGSFPFFFYYPPQVSYQLLCINVGGAVIPVLFSL HLLATRAPLLPALTATLIVTVVAKLLARIVPGVGISIPTFIPPVVAALAA IIVSPHNAAPVAYIAGAIGTLLGADILNLGAIRRLQSQVVSIGGAGVFDG IFLVALAAALLS >Moth_2249 Protein of unknown function DUF881 MRRIYLSLLIISIFSGLLVAWQWRSHLATAAQTNQDPGLIDIIHALEKED ASLENSIADLRQQIDALQKKHSQGAGRLTETQKEIDSLRLTAGLVAVTGP GITVTLDDNSAGAEAAQKSSPATYKPDDYIIHDKNVLYMVNELKAAGAEA IAVNGQRIVANSDIRCVGTVIMVNSTRLAPPYIIQAIGNPDKLEAAALRS EEFVFLKSRDFPVKVGKNDSLTLPAYNGGFPLDHVRPLPTGGQQ >Moth_1355 Nucleoside recognition MTEAVLAVSRWAIPLVIFLIPAYGYLRGVAVYEAFVAGAEDGFKVAIKII PFLVGMLVAISIFRASGAMDLFARALNPVLHLVGIPGEVLPLAVMRPLSG GGALGVAAELIGNYGPDSFIGRLASVMQGTTDTTFYVLTVYFGSVGVRRY RYALALGLIADISSLIAAVFICHLMFG >Moth_2386 conserved hypothetical protein MAAKKRNYWDYARYTNMAFSFGITLTAGVLLGFYGGSWLDRRLGTSPWLM LAGVLLGIGTGFHSIFSELRALEKDLKNRETDAQDKGKPH >Moth_1009 Protein of unknown function DUF441 MESTLIILAVLVVAVLGRANTVALAASLLLVLKLLQVDQYIFPFIEKGGT FWGLVLLIAAILVPLARGTVTLRDLGHVFLSWVGLSAFILSLITTYMSGQ GLQYLTVQGHSEVMPALILGAVIAAAFLGGVPVGPFITSGVLALLVKLIA KL >Moth_0414 sulfonate/nitrate transport system ATP-binding protein MVVGLLEMLEDARGREDIFKLAGSLSMELDDIGPVIEAARVLGFIETTNG DITLTRLGSKLLNADINERKDIIAARLQELPAFKEVLQLIKSGRGRQVRR EQVVRRFARRMSDEDAEVLFKTVVDWGRFAEIIGYDTKGEVLYLDEGA >Moth_0587 conserved hypothetical protein MAHHFFLPIVVAPGETVLLEGENAHHAIRVLRLRQGESITLADSNGQGYR AEIVAITEGRVAAAIKDPLDSPEPRVRVTLYQGWPKGDKMDLIIEKCTEL GVDRIVVLATERSIPRPDQQTCARRRERWQQKAHAAARQSRRHRIPVVDG PLGLAEALVKLRPDTLLLVPWEEERTRDLKSILATVPADRELALLIGPEG GLSRAEVDLACRFGGLPLTLGPRILRTETAGLACLAAIMYAMGELG >Moth_1915 Glycoside hydrolase, family 57 MPRGYVALVLHAHLPYVRDTEDDFSLAEKWYHEAVTETYIPLINICQRLN RDRVPYRITISLSPPLVTMMADPLVQEHYRRYLERLRELAAREVWRTRND PRFHLVARMYQDLFENTARTYQTYGGNLINAFRELQDSGKVELITCAATH GYLPLIGLQREVVRAQVEVAVNNHRRLFGRPPAGLWLPECAYNPGDDAIL RDYGLKYFFVDAHGLLYATPRPRYSIFAPVYTPAGVAAFGRDLESSEQVW SAQEGYPGDFDYREFYRDIGYDLDFEYIKPYIHPSGLRLDTGLKYYRITG KSGYKEPYVPEWASFKAHTHAGNFLFNREQQINYLATYMDRPPLIICPYD AELFGHWWFEGPQWLESLFRQVAGLAPQPFSFITPSEYLERFPVNQPATP CMSSWGNNGYNEVWLEDSNHWIYRHLHHAAAEMIRLANQHPTAGGILLRA LNQAARELLVAQSSDWAFIMKTGTMVEYAVSRTKKHLLNFWELTRGINKN DLDPAKVQALEEANNIFPDINYRIFASR >Moth_0051 conserved hypothetical protein MSYTWKKVTLEVDGQRIATRTFAPTVGEFLSQQHITLGAEDAVTPALDAP VTRDIVITIKRAVPVKINADGREKEILTPPDTVANVLNKAGVTLNPADRV IPDLNATIAAGDTIKVIRVTVKTETVSKEINYRVERRPEPQLEKGITRLL QEGVKGLQEETYRVILEDGQEVKRELVSTKTLKEPVPEIVAVGAMDTASR GGQSFRFERVFWATATAYTHSGAPTATGAYPRVGTIAVDPAVVPLGTRLY VEGYGYGIAQDIGSAIKGDRIDVFLDTEADTRRWGVRRVKVYVLR >Moth_0446 Radical SAM MPVELRRQLAEVYHLPLKELLPAAWRLRLKNFPPVLGLATPGSKHYDSGD HRNHRRLFVTISITGQGCRLQCEHCRGELLKSMYPATTPAALLELGRQLK DGGCRGVLVSGGADLGGRVPLLPYLEALAGLKGLGLKVIVHTGLADPATA RGLKNAGVDQVLLDIIGDRETARRVYHLEMDPADYGAALENLLSAGLKVV PHIVAGLNFGRLAGELEALYQVLSRDCRDLVLVVLTPLPGTPMAGITPPP PAAVGRLLATARLAGPRLNILLGCARPAGRHRLWTELYALRAGVNGMAYP HEATVARARRLGLKPFFSDLCCSLL >Moth_0453 LmbE-like protein MGVSAVIPAYNEETTVGRIIDTLKQVAAVTEIIVVSDGSEDDTAAVARHH GARVLELAVNSGKGAAMTAGAREAREDILLFLDADLEGLLPDHVQALIEP LLAGRAEMSVGIFSRGRSMTDLAQVVAPHLSGQRAIRKDLFLAIGADRSR FEVEVQLTSEARARNWRVEKVPLVNMTHIMKEEKRGLYRGVVARMGMYKD IAGFFWRLTRKKLKARPVAVLLLLLSLGVTFNYDTQRVASAEAGRMPDLN LPAAGQRLLVVSPHPDDETLGAGGLIAKARARGDTVKVVFMTNGDGFRRG VETTRGILPTSAGDFLTYGERRQQEAITALGNLGVGPADIIFMGYPDGGL AAIWSNYWQEDKPYRSACTRKEAVPYRLAFKPGEPYAAPALLADLEEILR EYRPTDIYVTDTNDSHPDHWATGAFTLAAVGELKGEDPTFNPRIYTFVIH TGMWQMLPVFDRDHKPLLPPGYFLARGTPWYKLPLAPAILELKKQAIAAY RTQEMVMPTFLANFERPNEVFSRLPDQEVITTATGMSVDGWVKEWPRDAV IALDPAGDLVTKKVERGGDLKAAYLLQSGRTTYLRLDTWGRVGFPVNYTL SIYLLPASPGAGSQRFTWSWAPGEKQVRWLTRPAGYDPNAIRVASGGDSL EMALPDLIPPGEHYLMFTAVTSIGRLPLDRIPWRLVKIKGSDL >Moth_2248 Protein of unknown function DUF147 MTGLAALWRYLLSLNFSDLLLVALDISVVAFVIYKFMMLIKGTRAVQLIK GLVVLVVASVIAERLHLTTINWLLSQLRLVIVVALPVVFQPELRRALEQL GRGKFFARPLTTLGAEDMEKLINELVRAMQVLAKNRTGALVVVERETGLN DYIETGIRVDGVVSAELLINIFVPLTPFHDGAAIIRGDRVVAAGCFLPLS ESPYLSKQLGTRHRAALGISEISDAVVLIVSEETGVISVAEGGKLTRFLD EKNLRELLQNLMLPQDNHTTFLWPWRS >Moth_2507 transporter MALLMELAVAGALMGTMIGAGFASGQELVQFFLTLGTGAPAAVVLMTGLL MTSSFLVRHLALKWRTASYRDLMIMLLGNWYRPADVAITAFLFGGLAIML AGAGAVARQYFGWPPLAGILACSGLALLGSLGRGRGVLILNSLLVPVMLG IIVLIVGLNWAPTVGPLTTVPAGPLVGSNWVLNACLYVTYNMVGLMVLLA SLPNSRRGTAGAALGGLLLGILVFVLVQALGRLPKEILVTELPLLSLVKS RHPELQGAYALSLWLAMVTTAASNLYGLAERLGSSSRLPLPAAVPVLVLA VPMASFGFANLVGLIYPFFGYLGLLLLILVMGRRLLLFLRF >Moth_2060 conserved hypothetical protein MKIRRSRKIALVSHCILNTNAKIEGTASYPAALQQVVGLLLAHHYGIIQL PCPELRAMGLRRWGQVVEQYANPFFTEAVRSMLVPTLREIQEYVRNGYRV DLLIGIDKSPSCGVNLTCSGDWGGEIGIKKLTESMVVKGEGVMIRVLRQE MEKLGIPLTMVGVDEDNLDLSIEAIKQAMLRAGS >Moth_0567 Iojap-related protein MDAGRVAQLAAQAVLEKKGIDPVILDLQGITLIADYFVIASGTSTVQVQA IAGRVEEILDAAGVELLHREGLEAARWVLLDYGAVVVHIFLEEERRFYDL ERLWGDARRVAIESP >Moth_1564 Protein of unknown function DUF103 MSIDLCRWRLEKAERTFKEGEQLLDVGFYNGAINRFYYAAFHAVRALLAL KKLDSAKHSGVISLFNREYVKTGVISKEASKTLSTIFAMRSEADYDDFKS FSLQEAADARKAVRSLIDEVSAYLAGIS >Moth_0131 membrane protein-like MAEGRVTALAEGALMAALTVVLVLTGYFIPPLQLLTNIIWTVPIVVLIVR QNLRLGVMATFIAGVVIALFTGPLNATLLFVQFAALGLVYGYLFKIKAGA GRTIVIGALVALLSLLLTLALTFKLTGLPVGGLIQEFEGTVNYAMEFYRR AGILDHLAGQGLTPDQIQASLQGMINLLKLLLPGILMTASLLAAFANYLV AEKVLQRLGLKAAGLPPFRYWQLPWYAVWGVIAGLALWQLGDYYHLALAS RVGVNILYVYLPLLAGNGLAAVTFIFYHLRLAPFFKAVLVLVALMYFPVA LVSLVTLGLFEPFFGFRRHLRPPADKGE >Moth_0395 Protein of unknown function UPF0016 MKAFLLSLGLIFIAELGDKTQLVALTLATRFNARVVLAGIFTATLLVHVI SVALGEFVGVLIPTAWTHFLAGLAFIGFGLWTLRGDSLDDERDNAHRIAS PFLLVVVTFFLAEFGDKTMLSTVTLATTYSIIPVWLGSTLGMVLSDGLAI WIGQAMGSRLPERVIRLGAAFIFFVFGLFSTFQGGLNLSPPVWGLGVAIL AILVFIFFRKPVQKGN >Moth_0045 hypothetical protein MSTNLPEDTGGMLTVALAGLEGQLNEVLKETRRLRARVAALEEENQRLRA MILAAGEGESGRQQLVRLYNEGYHVCPPHFARVRGNEGCLFCQSFLEKKG LLPDG >Moth_0859 Protein of unknown function DUF552 MAFWQGLINWLGYGHEGEEPVMEKPVLEAETTPVTPAKGKLVGLPTARNA MRLVIARPQSFEQAAGLAENLKNYRPLIVNVEGIPVEEARRIIDFLSGAA YALGGRVRKVTSGIFLFTTSNVDLSGDLEDQIPGGLNWLEAAGRGR >Moth_0050 conserved hypothetical protein MFLLRQPSAGRGRKLVLLACLVAGLQSLFLVGWTSKKVDIYADGQARQVT VNQWLVGDFQSRSQENLRIGDWILPLPGGWFWPGLKLFMARGTPVAAEVA GQNIWPREPASVASELLNREGITLGPGDRVETNLGSEDPHQYIRVIRVED SIEVQQQPVDPPVVRRPDRYLPPGQEKVVQEGQPGVRYYKYQVRKENGVE VERRLVDTWVEIQPSPKIVAYSSRAYPEVTARAGDTLMVIATAYTHTGNR TATGIWPYRGVVAVDPRVIPLGTRLYVEGYGYAVAQDTGGLIKGKRIDLF MDSAGEAMRWGRRQVTVRILGD >Moth_1242 Branched-chain amino acid transport MDRQILILILGTALVTYLPRMLPLVVLSRARLPEVFLRWLGFIPAAVLAA LLAPGLLLPQGKLALTGNPYILAAIPAGLVAVKTRSMALTIVSGMAAMVL FQYWL >Moth_1263 Protein of unknown function DUF302 MEQPDFSYTVTTARDFEAAVKAVEEATAAQGMKVQHVHDVQATLRSKGYD SDPLKIIEICNARYAHEVLAKDILISLMMPCKINVYVRDGKTYISALRPT MLAQFFPHARLEEVAREVDTKIRTIVEAAR >Moth_1083 conserved hypothetical protein MRILMIGDVVGRPGRKAVREVLPALLQEHRPDLVIANGENAAGGNGITPD TAGELFASGIDILTMGNHVWDKREALTLLEEEERIIRPANYPPGTPGRGY NLFEVKEGLKVGVINLSGRVFLPPLECPFRLGKQLAEELRAETRVIIVDF HAEATSEKVALGWHLDGLVSAVVGTHTHIQTADARVLPGGTAYITDVGMT GPRDSVLGVKTEIIVHKFLTQLPARFEIAGGVIQLEGVILDIDPSTGRAA GIQRVQHYCNP >Moth_1060 hypothetical protein MRAGLEQGLITAIKGDREEKPVDRLEHIFELQERFDRDLARRRQLPDYSP AEWIQKEVLAIVAELGELLDEVNFKWWKNPHPLDHEAIKGELVDILHFLV SMCLKAGITAEDFYQAYLAKNQVNFRRQQGLTDQKEYAAGWADKEE >Moth_2394 Protein of unknown function DUF204 MEPLGLILVAVALGTDAFSLATGLALGGFRGRQAWLFAGTVGLFHIFMPL AGLYLGLLLGRLLGKVAAIIGALVLATMGTLMLWEAYNNRRQGGSMVGQV LRVIPGRGGVLGGVMAILFMAGSVSLDALSVGFGLGAISVNVPLTVLTMG FIAATMTALGLLAGRRLGSFFGNRAELAGGLILVAIGLKMLVGV >Moth_2326 hypothetical protein MKFRYDPDADALYIRFNESSICETEEISQGVLLDIDEEGKLVGLEILNAS KKLGKPPLTVEVELSKAATI >Moth_0259 conserved hypothetical protein MDGLKWLYPGLKIKRWLLLAVLGLLLLVSGLTVILGITLLASAEKGVTWF ILHTLGGLGSPLLAGLLAMALGAVFIGVAVRNLARSVIQVLLPGHTANPW QVFYRRQYLARGPHLVAIGGGTGLAVLLRGLKNYTRNLTAIVTVADDGGS SGRLRQELSIPPPGDIRNCLVALADTESLMEDLFSYRFRQGEGLAGHSLG NLLLAAMTDMAGDFDRAIQELARVLAVGGRVIPSTTTHVVMGAELADGST VLGESNIPLAGKPIKRVFLKPADCRPPAAALEAIARADAVIIGPGSLYTS VLPNLLVPGIVEALRDTPAPVFYVCNIMTQPGETDGYTVADHLRALIDHC GQGIIDTVIAHSGPISRAARRRYGEKGARPVLINSPAIARMGVELRRGWL VDETHVVRHHPERLASLVMEEVYRHQARGRRRFFYLVRERFRTLAR >Moth_0160 UvrB/UvrC protein MLCERCQERPASVHVTRIINGEKTELYLCQECARELQPQLNFSIPQFLAG LLDYDPELEVKAPPAVERCPECGLTYEQFHETGRLGCPECYHHLAPRLDP LIRRIQGSSQHRGKVPRRAGGNLRLRREIENLRARLQQLVQQEEFEKAAQ VRDRIRDLEGRLEKGESSQ >Moth_0855 PRC-barrel MIRVAELRQREVINVIDGRRLGTIKDIDLDLEEGRVRALIVPGQGSKFLF FFGREEDLVIPWENVVKIGVDVILVESYSSTAPVHREKA >Moth_2213 Putative Fe-S cluster MFLEMIDVTKVETCLADAEKIRLQAVLSQDIEELLPYLNTVIKNAVYNHY TKNLTFLKEFRLITLYPRKLTMAKAVNMTDALQVLDWLKDLINDTHRRQK EIQPSFAKKDRPTALKIYKWLPGTNCRRCGQLTCLAFAARLLSGENTLAD CPPLAEEENSERFNALQGMLG >Moth_0249 Protein of unknown function UPF0150 MEIKRLTAAITQEGKWYVARCLEVEVTSQGHSIEEAISNLKEALELYFED EEHPRINASPIIAPVEVNMAI >Moth_1046 Protein of unknown function DUF150 MSKLTALIQELVEPLLTPMGYELVDLQYGREGGRYILRLFIDRPEGIGLD DCEQVSRVVSALLDEKDPIPHSYYLEVSSPGLERPLKKEADFNRFAGRKV KLRTFAPINGQRHFQGRLLGYQDGQVRMHLEEGWDLAIPLEQVATARLVF DLADDEED >Moth_2327 hypothetical protein MKLRFTAHAEKQLIERKIVKKLVNETVCNPEQVIPQGQDVLIYQKIYKEV GKEYLLRVAIKLSGDTYVVLTAYKTSKIKKYGGDK >Moth_0938 Nucleoside recognition MRQPVFIISRGITPFLTAVAVVILALAIVLFPQPVFQAALRGLRAWWEIV VPALLPFFIISQLFMGLGIVHFLGVLLEPVMRPLFNVPGSGAFVMAMGYT SGAPISAILTSQLRQQQLVTRVEGERLICFTNNASPLFMLGAVAVGMLHN PALGPALAGAHYGANLFLGVLFRFYGRRAPASPPGNHPLLSLPRRAWRAM IQAQQRDGRSLGQLLGDAVSHSFQTLITIGGFITLFSVIIQVAGMLGILD LLARLLLYAGHPLGLTPATAGALASGIFEMTMGTKFASEAPVPLGEQLTA ISIIMGWAGLSVLGQVAAMTSKTDLRLGPFILARLLHGFLAAFMVQLFRG PARPVLGWLTGSHFLSPPVSWLSLGVHYTGFTLTLAALLLFLTVLGLFAR LTLYRRF >Moth_1111 Protein of unknown function DUF151 MLIPVKVKQIVLDQTLNPVVLLGEPEGNQVLPIWVGPFEAQAIALAMQGI LTPRPLTHDLLRSLCENLGVEVNKVLVQDIRDGTYYAELYLRQGDREVVV DARPSDAIALALRTNAPLYITEKVAAYTLNVEDLVSEDQAEELQQMLTDI KPEVDKKHLH >Moth_1745 Pseudouridylate synthase MKLKVIPEDFVVRELARLPIREKGPYRLYLFEKKGWNTIDLLIRLAKAHR LPYRLFAYGGLKDRHAHTFQYVTVKHPADLTTEAENFSLQSIGYMDRPMG PDLLEGNEFAITIRALGAAEVCRISRRVDEVRGFGYPNYYDNQRFGSMDR QMGLMAERLLKKHYNGSLQIYLTGIYPEEKKEARERKLFFREHWGDWSTC LARAKTTMESRIFSLLVEKPKAYIEALQMIPREELSLLFSAYQSFLFNEL LRRILQEFGLDLTAVPGTAGPYLFYRRLERKELGYLRALSLPLAASRMEF PDAMSERLFAAILEERGIKRSSFNLRKVRQAFFKSTPREAIVFPGNFRIQ PAEPDDLYPGRQKIRLFFKLPRGSYGTMLIKRLTMP >Moth_0102 Protein of unknown function DUF606 MWLALLIALVSGIAMAFQGSLNSALAKITGLLQATLVVHLTATLAVGVLL FFPLSDGHLGRIWQCPWYLWLGGLIGVVITYGVVASIPRVGVALATTAII VGQVTTALIIDHLGLFGLDKIPFTWWKAAGLILLATGARLMLN >Moth_0347 hypothetical protein MRIGVDLCNTIANINAMLVMKFTRLSLTRYPDPEIPEGFFHTTEGLELLS KAQPFPCAAGTLRFLASAGHEIIYLTSRPVLAAGLTREWLAVNSFPRGTL MFLPRGFKALFARYYGIEWFFEDDPLEALRLNDVVSRVFVKIWPYNLGVQ GPGIVRFVNWREVLFLVSGRKRAGAGVVHERHDYGARIGARG >Moth_1062 conserved hypothetical protein MVHMRFTELMGKEIINLYDGSRLGNFADADLVLDADEGRVAAIILPPRGG WRSLFGSRQELLIPWEAVRKVGNELVVVDMDPTYSRRQKD >Moth_1151 conserved hypothetical protein MSLHIGFPTALIYYSHFPFWQAYFNRLGVEVVTSPTTTKAILDDGAREAV ADACIPIKLYHGHVLALKDKVDALFIPRMVRLNRRTTFCPKFLGLPDMVR ATLDKLPPIIDLQVDAGRNLWGLWPTCRGVAELLGFSRRLAWTAYLEGSH HQQAFEGLLLKGYLPLEAMAKLRGEPIEPAPLRPGSLNLAVLGYPYQVYD GYISLNIIAKLRKMGVNILTLEMVPQARLYRLSRRLPRRLFWHFSSLVVG ATYNYLQQGDLDGIIHVTAFGCGPDAMVDKIMELDIRNYSQGKMPFMSIC IDEQTGDAGVSTRLEAFVDMLLQRRAAR >Moth_0120 Protein of unknown function DUF441 MSAATVILILLMLLGILGRSNVIAAAAAFLLLLQFTSLQRFYPILERRAL EAGLIFLVVSVLVPFASGRVAPRDMLQSFVSLPGLIAIASGIIATHMNCQ GLELLQRFPQMMIGMVIGSIIGVAFFGGIPVGPLMAGGIAALLVHLMAWL R >Moth_0952 Stage V sporulation protein S MEVLKVSAQSNPKAVAGALTAVFRQHGKAEVQAVGAGAVNQAVKAIAIAR GFIAPNGIDLVMIPAFAEINIDGEERTAIKFIVEPR >Moth_1321 Protein of unknown function DUF205 MWLLALVVAYLIGSIPTAYVVGRYLYGFDIRRRGSGNVGATNTLRTMGTI PGLVVLGVDALKGVLAVLLGQALGGPVLVILAALMAIVGHNWSIFLEFQG GRGVATTAGALLAMAPLALFWAFLIWLAVVIFSRYISLGSIVAAAVAPFL VIYFHRPWPYVLFTFVAAALVIYRHRPNIKRLLAGTEHKLGERS >Moth_0821 Protein of unknown function DUF964 MQVYDRAHELARELSRSSEYNDFRLAKAKLESNATNVDMLRDFRRRQLAL EMAVLSGKEPDPADKKALEESYRIISLNPTITAYLEAEQRLARLLADIQK ILIDALPEWGKDIIDEVDKK >Moth_0965 Putative helix-turn-helix protein, YlxM/p13-like MLDDLARVARLYDFYGPLLTPKQRHWLELHYHHDLSLGEIAGEEGISRQA VYDGLQRAVKALEEYESRLGYLRRDMALREQLAAAIRHLENYRRGGGEGE LVETSRILQRLLELPEGSMGKK >Moth_2038 Trp repressor MVSDKLRDPQVDDLFRAILALEDIDECYRFFEDLCTTAEIKAMAQRLAVA RLLRRGVTYTAIAEATGASTATISRVKRFLNYGADGYKLILERLEGNGK >Moth_0443 Biotin/lipoate A/B protein ligase MRQWRLLDTGSRTAAENMALDEVLLTARSQGQAPDTLRFLQFNPPCVLVG FHQVVEQEVRLEYCRREGIEINRRITGGGALFWDTNQLGWEVITTLDYPG VARRLEGLYAQLCGAVVRALKRLGVPAAYRPRNDIEVGGRKISGTGGTEL GGAFLYQGTLLIDFDVETMLRALRIPTEKLKAKEIASLKERVTCLKWELG RVPLLETIKQVIAEEFCREFAMELIPAGLTPAEEALLAHQLPYFQSEEWI NAVQGPEGRTELRSSRRTRGGFLRSSLVLGPGNSRIESLYLTGDFFAHPR RSIYDLEARLKGLPADPVLISRQVEEFFRESGARLPGIKAAEVAAAINDA LVKKDYPRQGIPAAAVNDVFTVVKPLEEITAAPVVLLPYCAKLPTCRFRG RQGCSECGRCDIGTAYALARQYGLEPLTIQNYEMLARVLRRLQREGAPGF LGSCCEAFLAKHRRDLERIGLPGILLDIDSSTCYELGQERAAHAGRFENQ TTLKLDLLELLMARVAPGKARRQVAVAAHA >Moth_0690 conserved hypothetical protein MRVGVGFSSANDPSAAGQVASEQAVRQSGSPVITLVLTTDNYDQERVLSA VKRVIGNSRLVGACVPGVIVNARLYKRGVGICTVSGEGVEAVTHLQRNIS QHSYRKGEKAGEALLEKGGETPGTVLLFPDGFAANISGLLRGLYNVMGPA FEYIGGGSGDNLRFYRTYQFTEEGISSDAVAAAVIRGINFQMCLSHGWRP VGEPLMVTKAKGRKVYEIDGLPALERYSALVGAYDKNDFSCYSMKYPLGL PCAGGEFIIRDPLKAEEDGGILFVTEIPENTIATLMEGDTASLLAAAEEV SKKALNTPAAPKTFMVFDCVSRYLLMGEDFSREMEAIAKNIKAEIPVIGM LSFGEISSISGTPLFYNKTIVAAAGW >Moth_1327 Protein of unknown function DUF1614 MTGYTIGVLLLVLVFGLVYFGLAHRVLDRLYLNDRTALALIVAMIVGSFI NIPLTSGRVVTSINVGGALIPLGLAIYILYRAGSVREVGRALLGAVITAV VLFGITYITRGREAWNVYALNLLDPLYYYPLVAGFVAYLVGRSRRAAFVG AILGVLFLDIIDLIFYLRLGLRGTVAFGGAGIIDATFMAAVVAVLLAELI GEVRERMQGGPELAGHSRSLLAGLKGPSLKRTPADIDINRDGLRNEGGEG NG >Moth_2072 conserved hypothetical protein MQPKSGNRYDITIKDLFADETQELINYFGHLEARVTGDLKIEFPQVETRV SDLVMKAESQQGPLAIHLEFQSRNDDEMPYRMLRYALEIHKTYHLPVYQI VIYFGQWQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRL LSLLPVVDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGL VFDKKAIDLVFREVEQMLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGQ QESLLNVTIRLLSKKFRKIPREYMARIKKQDVYVLQQIIDNIFDINDLKE LEDYLQ >Moth_2360 GtrA-like protein MFNNKPGAAQFLRFCAVGVGNTVVDLTTFFGLTLVGMPYLLAQVLSYSAG VANSFFFNRKWTFRVTHKASALEVIKFITVNGLSLLLSSGLLFILHDVTH LQLWLSKFSATGGGIVVNFLGSRLWVFTES >Moth_1914 conserved hypothetical protein MDILSDEFKQSFSIIPPESGVYGMVTVVSVLILALLAGVFLWYWRRKGNS PVAKPLPAEAPPARDLPAKYGKDEIVLMVKDPYWLYAYWELSDAKKEELR RQFGPAAWENSRPLLRVYDLTDHPYDFLRAPFFEIAITDLADNWYIHTGK PCSTFCVDLGRLIPNYGFITLVRSNIVTTPADKPSSVIDPLWPPIEACWT AVERYEKGTGPSSLQLVKTQK >Moth_1023 conserved hypothetical protein MPEFKAMELVAELMAIAARTAPKAGGKDFIELKILQGDSLEQLAIAMTRY GQEKGKKNFDRDGENVRRSDAVLLVGLKKAAKAGLDCGACGAARCADLEG PHEGPEFAGPICAWRLIDLGIALGSAAKTAGILNVDNRVMYRIGVVARKT GLMDAEVIAGIPISATGKNIYFDR >Moth_0762 Protein of unknown function DUF115 MSPSTNLVYRKNARVLQRYSPELFRDLEATALPLDRQLAPAQNGEPTLIA ITGGKEIALHSRYDPRREAVTWARGVDENADMVVVLGMGLGYHLEALKDL YPHKAVLVLEPELAAVKLAFAARDMTHLLKSGQFYLLAVADPEDAAAQLS NILAENAGKRIALHTLPAYEQLYAGYWQRVCQGVTDRLRQRRVNWATTEK FMMQWLCNFRDNFLPYIKAPGVIHLFDAFSGKPALIVAAGPSLEKNIHLL PSLKGRVLIMAAGSAIRILEKNGIKPDLLVSFDPGDANYQHFAGFDGRGV PLVYAPVIFPRIVQEYQGPTFSCELNVSPFIEWFDEKLGEKKGVLISGPS VANVCLDLAVKMGANPIILIGQDLAFTNNKTHADGARHQQRIDPSQGNYI WVEDIYGDRVPTTTAFYSMLVWYEQYLGNLKGKRLVIDATEGGARIRSTE IMSLQEVRDKYLRETFSPGEIIAAKHDVYAVPDGEQLRRLEEAFSELSSR REDLRACFEEGIEVARQLLEKCHKKTVKLTNYERARRKFMGLDRRITGNI LYRLFLEQGLAARIDAINRILGERVNDEQELPARGEKLASLYLSFFTEVE RYAEFTTEILKEIEEKIRRESASTSCSKA >Moth_0209 conserved hypothetical protein MEKHPRAFEPAVLILNIILSVLGSIIGLQILTTLGVTPNTAIIGVLVALA LSRIPGGWMAKYRSIHRQNLVQSTISGATFGAANSLLLPIGIPYLFGRPD LVVPMLIGATMGMFIDWAMLYWFFDSRIFPGQAAWPPGVAAAEAIYAGDE GGKRAWLLVWGTIIGIIGSYFKVSMSAFGVAFIGNVWALTMFGLGLLLRG YSVKLFGFDIDKLYIPHGMMIGAGLVAGIQILLILLKGRKETTASGDAPA AANYTRSEKQVAKGLARGFGLYIVAALVLAMLGGLYTSMPAWQLLFWAVF AAVSCILAEFIVGLSAMHAGWFPAFATALIFLVIGMALGFPAPALALLVG FVASGGPAFADAGYDFKAGWILRGEGRDRGFELDGRWQQFLAGASGLVVA WAMVTLTHGIYFRQGLFPPVDKVYAATIKAGVDAAIIKNLVLWAIPGALI QALGGSEKQLGIMLATGLLILNPLAGYAVLAGILIRTLVLKFKGREAETP MTILAAGFIAGDALYGFFN >Moth_1412 Dinitrogenase iron-molybdenum cofactor biosynthesis MKVAITARGNDPKAEADPRFGRCQYFVIADVEKGTFEAIANANQNAGGGA GVAAAQALVNAGVEVVLTGNVGPNALRVLQEAGIKVYSTAAPTVQAALQQ WQAGGVSPLSQATVGSHFGMGRGGNRW >Moth_0511 Protein of unknown function DUF444 MPVEYNLSREDWSLHRKGYLDQQRHQEKVREAIKKNLPHIIAEESIIMGR GKKVVRVPIRSLEEYHFRFNYNQGQHAGQGSGGTRKGTVIGREVIEGAGG GAGAGDEPGMDYYEAEVTLEEVQEMLFRDLELPNLREKKKPVMASPAYEF RDVRRKGLMGNLDKKRTLLENLKRNAMKGKLAIGGITPEDLRFKTWEEKI RYETSAVVLAMMDTSGSMGTYEKYIARTFFFWMVRFLRSRYQQVELVFIA HHTQAREVTEEEFFAKGESGGTRCSSAYRLALEIIDRRYPPADYNIYPFH FTDGDNLPSDNEACLEAVQELLPRVNLLGYGEIVNPYYRTSTLMNVLKRI KDDRLVTVAVKDKSEVYQALRQFFAGSKGGEAGGTRI >Moth_1372 conserved hypothetical protein MVVKVKQWRLGVPRALFYYYYAPWWEAFLQALGAQVVVSPPTTREIMDLG ISLAVAEACLPVKVYYGHAAWLAPRVEALFVPRLVSVEQKSFICPKLMGL PDMLRAAIKDCPPVIDVTVDMSRRPEEGLKAAIRDTARAIGSRGREVYRA GEIARARYRDWLAGQQEQGREFSPGKKEGITPGGQPPLAVGLVGHNYLLH DRYLGMDIAGKVTRLGGRVILPENFAPELGEAACRRLPKRLYWTLGRKIM GAALHLMEQEEVAGLIHLTAFGCGPDSLVGDLAERYAHRHGKPFLLLTLD EHTGEAGVETRLEAFMDMLARRRPA >Moth_0890 conserved hypothetical protein MLNSMTGYGRGEASGAGKTVSVEIRAVNQRFLDVVVRLPRAYGALEEKIR QELKKSLNRGRVEVVLTIKEDNAEKRPVNVDTGLAMAYYNALKELAQKLS ISADITAAGLLSLPEVITVAEPEWDEATLWPVVARALAAALAGLLEMRRA EGQRLQADLEARAAFVRRQVEAIRERAPEVPREYAARLRERVDELTGGMA LDPGRLEMEVALMAERADITEEVVRLTSHLEQMQAAMAGAEPAGRRLDFI LQEMWREINTIGSKAGDLTISHLVVAVKGELEKMREQVQNIE >Moth_0713 conserved hypothetical protein MMWGYYNWGMGVWMLVWWAILIGIIVLAVYGLVSLFNRRDGQSPMRPDPL GIIKERYARGEITAEEYHRMRDELKE >Moth_0186 Protein of unknown function DUF86 MALKFDALKAGKLLAAFRQAQKRLQDLARLPKEEFLTDPDKIGSAKYHFI VAIEAAIDLSNHIIARNNLRIPEDYADTFRILGEAGIFPPEFVTTLIKMT RFRNRLVHIYWDVDSDTLYNILVNGLKDLDAYLKAMGKFFTGNKEEPYC >Moth_1674 Protein of unknown function DUF156 MSRRPGEESSYAPQREDLLLRLRKIEGQVKGIHRMIEEDKYCVDILIQIA AVRAALKKVGSMIFEAHVRGCVRTAIVNQADKEIISELIDVLNRFIS >Moth_0044 Signal peptidase II MIVETARGLELGEVVIAPRQVEETEVVQPLRPVLRPATQADLEQVAVNRQ KEKDAFAICQQKIQEHGLPMKLIDVEFTFDVSKIIFYFTAEGRVDFRELV KDLAAIFRTRIELRQIGVRDEAKMLGGLGCCGRPICCATFMGDFDPVSIR MAKEQNLSLNPTKISGLCGRLMCCLRYENDAYEDARHRLPRCGAMVRTPA GCGRVTGINILKERVTVEIPEQGSMEFPLEAIEGECHEH >Moth_1876 conserved hypothetical protein MAKQYSTWQIAATYIGTVVGAGFASGQEVLQFFGYFGLRGILGLILATAL FIFFGYTVLRLGFQLKAESHLEVMHRAGGAFIGRAVDAVTTFFLFGALAV MAAGSGAIFRQEFHLPVLLGSSLLIAITLVTVLAGIEKVIDSISLVAPVL IASVLGISLATVAKNLPALVANLSWEETYRAAVSSWPLAALLYASYNLVL SIAVLGPLGALARQERLLPGAFLGGLGLGLGAIAITLALITTAPAVTALE VPMLYIAGSFSPVLRIFYSAVLLAEIYTTAVSSLYGFAARLAGPGGNNFR RLAIGASAVALAAGQAGFSRLVATLFPLVGYAGFLLLGGLAYYVLKEILA LRPAFPGRLVPAPARRPILGAVLERRGKAGEKERP >Moth_1428 Protein of unknown function DUF364 MSLIEAIIASLSGDEVVKEVRIGPFWTGVWSRYCGLASTTFTHEHENRFP VGEAGFLTGKSARELCRYATSTSLLEASIGLAAINSLLEVDMEQCQDINA GELLIERGAGKRVAVVGHFPFVPGLRRVARELWVLERRPQSGDLPADEAA NVIPGADVVAITGTALINGSMESLLKLCRKDSLVMVLGPTTPLTPLWFDY GVDLVSGTRVVEPEVVLKFVSEGVVFKQLHGRGARLLTMAKKGLK >Moth_0004 RNA-binding S4 MDQFLKWNGVAATGGQAKELITSGLVRVNGQVERRRSHELVPGDEVEVKG ACFKLTTAPAD >Moth_2262 conserved hypothetical protein MEGDEVMPTYDFKCNECGHLFSQFAAIKERDKVRCPECGGKVSQRFTGFL YTRKGKPGAAYSSSSCSGGSCSTCSGCH >Moth_0956 conserved hypothetical protein MGKSKLISFILSFFPGLGHFYLGLMNRGIAFMATFFGWMALVILASVITG FNGFIALLILLPLIWLYSLFDAVQLCGRLQSGETVSDSSPLAELSESLVN GYKSRLWALLFSIIPGAGHMYLGWQQRGLGFMSTFFLAMFLMEWLRLSLF FFMLPVIWLYSFFDVMQLVSNDISVPASEGSFSTWLLERQRWVGLVLIGL GILIIFDRMVVPYLSYELINFIKTGFIAAIFIGGGLRLALGSKIDLPATE EKTGVSSDNELMEGERCNHRENP >Moth_0126 Protein of unknown function DUF951 MSFYVLQRGNRRFSEENCSMDFHIGDIVQTKKTHPCGSDRWEILRVGMDF RLRCLGCGRLILIPRVKAEKSIKKVLPKPS >Moth_2085 Protein of unknown function DUF1648 MENKYRVSKEMITRDWPALMVLVAMLVAGILVYPHLPDLVPSHWNFRGEV DNYFNRFWGAFALPLMTGGIYLLLLFVPYLDPKRENYPRFDRAYQVVRLG MVFFMGGIYATTLVVALGGPANLVGRVVPLAIGLLFILIGNYLPQARLNY FFGIRTPWTLANEEVWRRTHRFSGYTFILAGFMFIVAAFLPPPANFILGM AGPAIAVVSTTVYSYLAFRRVSR >Moth_1001 conserved hypothetical protein MGNCKLDLIILKEELAVCRLQQDAPVPGWALKGDFVSITRTPDELSLVCP AAGVPSGVKCEKGWRCLMVEGPLAFSLTGILAALVVPLANQGISIFAIST FDTDYLLVKEKDLERAIQVLSREGHRIRP >Moth_0145 conserved hypothetical protein MRIRAHHLLCALQFRGYGYSEAFVRRVSRVIALWRHRPGLILTITRSHDA WCRACPNRDTSNCRQAAARDARVLACLGLAPGARLEVAAAQDLVHRKIDS RAATYICTGCRWLDGGYCRW >Moth_0053 Protein of unknown function DUF458 MEPVYFTSPTRGRLTEDEMFADMMQFVDANPEAQYKIIIGSDSQARVRTC FVTAVIVHQVGRGARYYYRKKYQRKITSLRQKIFYETALSLETASFIAQR LAANGHADLNVEIHLDIGPNGETRDLIREIVGMVVGSGFAAKIKPYSCGA SKVADKYTKSG >Moth_2201 hypothetical protein MIACQLSLYPLGTPAYTPVIKEAMAVLEQCGVEIEVNAMGTIIRGEEEAV WRAARQLFQVAAGRGEAVLVMTVSNRCGCKVANKKAPARGS >Moth_1073 transcriptional regulator, XRE family MGQVGEILRSTRQEKGISLREAEEATKIRLKYLEALENGTYDEIPGRVYA LGFLRNYARFLGLDPAELTALFKEEYPPKEESYQVEEPPGITTPRLTTRG WGRWLLILGVILVLWGVNRLYNYYRPSPEQSPAPPPVTEPAPATPAPVTP VQPASPPSQVQGVEVKIRATGNCWVGAVVDGKADFSGTLKPGDEKVFQGK DKVSVTLGSAGAVEVTLNGQVQPPLGKAGSVVTFEADKGANQLRIIKKQ >Moth_1926 conserved hypothetical protein MRNREVPKYKQLALKIPYGQGYLVGELPEGIKVKEVVPSEIAGVPDPSAD VRRALEQPIGNHGVEELRGVRQVVIVVSDLTRPVPNDVILPVLLEKLNAI GIKDEQVTILVGTGLHRPAPPEEFPLIVGPEVAARVKVISHDAYAPGILQ QIGVSSRGTPIWINRHYLEADGRILIGMIDPHQFVGYTAGSKSLVIGCGG EATIRANHAHLVEPEATLGRIEGNPAREDIDEIGGLVGINLIINVILNSH KGIVRTVAGHYLAAHRAGVAVARQVSEVPVPALADVAIVSPGGFPKDINL YQAQKGLAHGTRLVKEGGVIILCAECREGAGEKGFIDTMQAGNTPEEVIK VFKQGEFQMGRHKAFLWCRSLVRARVILVSDGVDDNLARIMMVRKAPDLQ AAINMALKELPDAEVVTVMPKANSTIPVF >Moth_1586 conserved hypothetical protein MRICITFNNNITVPAELNDSETARKIQDALPLAGRVNTWGHEIYFSIPVK AGLEKGATEVMEIGDIAYWPPGHALCLFFGPTPASVDDKPRAASPVNKIG RFEADSHILKQVPDGARVEIKEA >Moth_1361 chromosome segregation and condensation protein ScpA MLGNVRLDVFQGPLDLLLTLIERQEIDVQAIPVAEVTAQYLEYLQEVEEL DLEWASEFLVLGAELLALKARLLLQRPAAGEQEEEGEDPARALADRLQAY RCYKEAAQHLAELAATGALIFTRPRDQEAVERALAGINPLAGVTPLDLAR AMARVLARKEQVQEPEEPRLIPRLAFTLAGQVRHILRSLYRAKTLSLDRL LSTRPTRMEVALTFLALLELARRGRVALAQEVNFGNIEVRLLPHPGAR >Moth_2512 Protein of unknown function DUF111 MKIAYFDCFSGISGDMCLGALIACGLSQDELTSGLKGLGLEGWELRVREV KQHSIAATDVAVQVTGSQPHRHLADILGLINNSSLPAPVKEKSAAVFKNL ARAEGQVHGIDASQVHFHEVGAVDAIIDIVGSILGLHLLGIEKVISSPLP AGSGWVDCRHGKLPVPAPATLYLLQGYPVYGTEDKAELVTPTGAALITTL ADSFGPFPAMNLTRVGFGAGKTELPHPNLLRLALGEINSGQLEGEESSLV IETTIDDMNPEFFPALLEETMAAGAVDAFFTPVQMKKGRPGILFTALCPE NKLAAVAAAIFTHSSTLGLRFRRDQRLVCQRRMAEVVTPYGTVPVKLGLY RDPTGQVITNIAPEYESCRQIAKSAGAPLKEVYAAALAAARALKAF >Moth_0028 conserved hypothetical protein MGMGNMNKMMKQMQKMQAQVARLQEELGERTVEASAGGGVVKVTANGRQE LVNIKIDPAAVDPEDVEMLQDLILAAVNEALHQSQEMVTREMAKITGNIR LPGF >Moth_1096 Cobalamin (vitamin B12) biosynthesis CbiX protein MATGIILLGHGSRIPEANEHLKVLADQVREILGGVRVEPCYMMRTHPNLA EGIATLVKEGRRKIVVVPMFFSNGLHVQRDIPEQLAAARERYPDVEFIYG ANLGADRRIAEVIVERIQEVAPGGFSV >Moth_2250 Protein of unknown function DUF881 MMLKFKNWPLSLAVVFLVLGLLLSLQFRTQRLLASSLEAQKTTDLITMWK NLSTKRNQLQGEIAQLQQQLFTLETNSSQSSETETSMEKELARLQMNTGL AAVKGPGITVTITGDAPLLYYDLVDLVNELWASGAEAIAVNDHRISAYTT ISDQQDGPRNYITIDGQRLLYPIVIKAIGDPQTLDKGLTFTGGLIDNLNN LYKIYPIIKKEQDLQLPATSLPSWHYAKPAPPSAPAGDQNGK >Moth_2391 conserved hypothetical protein MAEWQDVTATVQAAAEELLNVAGLQPGQILVVGCSTSEITGRSIGTASSL EIGQAVVAGLLAATNRAQVYLAAQCCEHLNRALVIEAGAASLYNLPVVTV VPAPKAGGSLATAAYASLHRPVVVASLLAQAHAGLDIGSTLIGMHLRPVA VPVRLAIKTIGAAPVTAARTRPPLIGGQRAVYK >Moth_2217 hypothetical protein MPVNSSRKVYVIPCSGIGKMYGLLGREAVLKTVKELRPDKAATMCLALLV YGDNEARKEIAGARCITVDGCPKLCAAKNVEHAGGRVVEMIRAVDAFRNH RGVDAGTAAHLTAAGWQIADELAADLAGKVDRWYDAGEEQ >Moth_0140 NADP oxidoreductase, coenzyme F420-dependent MMRVGIIGAGAVGTGMGLLLSRRGYTIAGVSSRTMASAERAAARLNCPAF ADPETVARRSEIVFITTTDRAIGPVATAIAGRGGFCPGQTVIHMSGSLTS AVLDPARQAGALALALHPLQSCADADMAVANLPGSVFSLEGDREALPLGE RLVNDLEGEYFIISPEAKPLYHAAACVASNYLVSIVDLSYRLMQAAGMAP DMVARALAPLIEGTWGNIKEKGVPRALTGPITRGDVATIASHLQAMAARA PELEEIYRAVGRYTIGVAGRKGSLNARRAALLGQLLANSRGKSPVRVPGR SPNKKRS >Moth_1881 Protein of unknown function DUF503 MTIMVIGVGTATLRLAGARSLKDKRRVLKSILARLHNRFNVAAAEVDYQD SHQKAEVGIACVSTSGSHASQVLAAVMGFLEAEDAIELLAYHTELL >Moth_0220 Protein of unknown function DUF421 MKFVEVFLQTLLAFFAILIYTRILGKQQIGQLTFFEYINGITFGSIAAVL ATDTAPNQTWMHFLGLTLFAFFTWLAGYAVLVSRPARKLISGEPTVVVHN GKILEENMKKMRYNFDELAMQLRQKNVFDIADVEYAIMEPDGDLSVLLKS QKRPLTPSDLKLSTKYEGVPTELIEDGEILFQNLRQNHLDEKWLIQQLQA QGIQDISQVDYAVLRSNGTLYVNTKEDDIINPVDITDAPESPVKTEKEEQ DRP >Moth_2074 hypothetical protein MQPKSGNRYDITIKDLFADETQELINYFGHFEARVTGDLKIEFPQVETRV SDLVMKAESQQGPLAIHLEFQSRNDDEMPYRMLRYALEIHKTYHLPVYQI VIYFGQWQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRL LSLLPVVDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGL VFDKKAIDLVFREVEQMLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGI EKGQQESLLDVTIRLLRKKFRKIPREYLARIKEQDVYVLQQIIDSIFDIN DLKELEDYLQ >Moth_1166 Protein of unknown function DUF820 MSLTLAELAAGRQRYTYEDYCRLPEGSPYQLIGGELVMTPSPTPYHQMVS MKLELQMAGFVLEKGLGIILYAPVDVYLDEEETYQPDIIFIASSRLDIIE EKRIKGAPDLVVEILSPGTGYYDLRSKYKVYEKSGVREYWIVDPQQKSVQ VFCLRDGKFVLDQEAEQQGTVRSRVIAGLEVQVESIF >Moth_1749 Protein of unknown function DUF710 MPQAEGQVAAQDRTTVNINGQDYVVKGEAPEYIQMLAAYVDKKMRQVNQK FPHYSPVKVAVLAALNIADELYKVQQDYDTLVKLIQEEKQG >Moth_1855 hypothetical protein MLDLAHGYHQNALICLPQDLHTLYVYWDFTPARIRILVDFFHHVRPEMEL TLRLCRQDCPLPEQQLTLESLEPGWRYFSNLDDRAAYHLELGAQSPEGEF VLFSRTPVFQIQPGRTVEPAGARQLPPGLNPDWTIPPEGQQSGSNFSWS >Moth_1639 Protein of unknown function DUF1292 MGYDNPILEKRGEVMADQENTIILTDDEGHEHEFIVVDVLNLEDDEYAIL LPAENADGNDEAVVLKIGLDEDGNEILYEIDDEEEWQRVARAWEDAVAEE DGEE >Moth_0829 hypothetical protein MDNTKIITVKPMGVKVMPTYDFRCQECGERFTVKMSWKDKDKATCPACGS KKLQQLFTGITILGGNSGGGGCAAPAGSSFS >Moth_0729 conserved hypothetical protein MSCRLSLPYGTHKLTFHIPEEKIKAVLSPVAMKEIPSTEVEIQRALENPI GCQSLGTMVSPTSRVLLLCDDNTRPTPANIIVPAILRELESGGVRKENIK ILMALGTHRPMTLEELQQKLGAEVLAQVEVINHDFRNPMALHDFGLTANG TPVKVNRVVLEADVVIGIGSIVPHHIPGYSGGAKIVQPGICGEDTTAATH LLSVRTRRNMLGIVENKVREEMEAIADRAGVKYIFNTVLDPQGRVVKAFF GDIRQAFRAGVEISRQVYGIPAPGRTPIVLASSHPCDIEFWQAHKTLYPC DMLVEEGGTIIIVTPCPEGVAVTHPEMLEFAGQKPEDIDAQIEEGRIKDK VAGALALAWAKVRQHAEVCLVSDGINAEVAAKLGFKHADTIEEALEMTWA RLGKEARVIVLTHGADTLPLLPEDNLE >Moth_0848 Protein of unknown function DUF1290 MWIPLIGLIVGVVAGLLLPVKIPVVYSKYMSVAVLAALDSVFGGLRASME DNFDNAIFLTGFFSNTLLAAFLAYIGDQLGVELYLAAVLVFGVRLFQNLA IIRRHLLKR >Moth_1371 conserved hypothetical protein MKVTFPHMGHLWLVLKAALTGIGLEVVVPPPCTRRTLELGVRHAPESACL PLKVNLGNYLEAKELGADTIVMAGGVGPCRIGYYSQVQREILRDLGCAYE MVVFEPPDVHFNEVWDKIKYLNRRPWQDGVHGVVMAWLKACAVDALEQEV QRLRPREAQTGLADAVFRRALQELDAAGSRKEVNRVVKEIKGELAALPLK PDVRVIRVGIVGEIYTVLEPLVNLDIEKRLGALGVEVVRGLFLSRWINDN LFKGLLPLPGHHPEKTAPPFLNHFVGGHGWESVGDTVTFARRGCDGVIQL APLTCMPEIVAHSVMPAVQQATGIPVMTIYLDEQTGEAGLQTRLEAFVDM LRRQKGVRAG >Moth_0861 Protein of unknown function YGGT MQTLAVLVRVAFEVLNWLIIARILISWFPHDPNHPIMRFIYEITEPVLAP FRRIMPRTTMPIDFSPIIAVLVLQLVEHLLINFIMRLG >Moth_0763 Protein of unknown function DUF820 MAAIEIARRRFTVDEYYQMARAGILGEDDRVELIEGEIIEMVPIGTQHAA CVRRLLHIFSTKIGDNALVDTQNPLRLGQNSEPQPDLMLLKPRDDYYATF HPRPEDVLLLVEVADTSLAYDREVKVSLYAKGRVNEVWLVNLQTQQVTAY RQPSPSGYREVKEYGHGDHISPLAFPGLNIPVQYILPGD >Moth_2168 conserved hypothetical protein MYLVTAAEMGQLDRLASSEYMIPSIVLMENAGLRVVESIERHFQGQVANR RILIFCGKGNNGGDGLVVARHLLNRGAEVKVFLLARPEDIRGDARTNLEI YQKMGGKLLLLLGESHLQRADIALLYADLVVDAIFGTGFKGAAMGLPAAV INMINKAHRETVAVDLPSGLEADTGRCFGPCIQATWTVTFALPKLGLVVE PGASLTGRLEVADIGIPQKLVATQHFNRRLLTAAWCRSQLPRREASGHKG LYGRVLAVGGSPGLTGAITLAATAALKAGAGLVTAAVPRGVQGILAMKTT EIMTMSLPETPAGALSRDALDPLLERLAEVDVLAIGPGLSRDPATVDLVK ELLPRVQVPAVVDADALNALATDTRVLTGDHGPLVLTPHPGEMARLLGTT AAKIQEDRLEIAAKYAREWQAVLLLKGARTVIAWPDGQVYINPTGNPGMA TAGSGDVLTGIIAGLAGQGLKPGVAAALGAYLHGAAGDEAARQRGQRAMM AGDLLDFLPYVLRNLEEEVETIVAAGLGRD >Moth_0260 Protein of unknown function DUF199 MLMPLSFSLQTKEELARVKARHPCCRQAELVAFLRLGNLDGGQPGEETVL FTTPYPALARKVYSLAREFLACPVKVRNSRRQGKGRPVFRVVARARLKEI QDWLAGRAGVPEYPCCQAAYMRGAFLVTGSVNKPSGTHHLELIFPDAAMA GQMQGLMQQQELEPRLSRRQRGYVLYLKDSEQIIRALSLMGAYSAVLAYE NVLIFKDMRNRVNRLVNCETANLTKTVETGLRQAENIRYLIATVGWDYLP PALREIAAVRLQHPEASLKELGEMLHPPVGKSGVNHRLRRLELIARQVRG QGREGYAPDDDLSRPRA >Moth_0136 Protein of unknown function DUF763 MRTGTASLPLHGGHCPPWLFERMQRLGPAILEVIVQEYGPQEVLRRLSDP HWFQAFGCVLGFDWHSSGLTTTLCGALKEGLRGREKDLGLVIAGGKGRTS RQTPHEIETAVDRLALTSLEPEDLVYASRMAAKVDNTALQDGYQLYHHVF IFTFDGQWAVVQQGMNETSRLARRYHWLGEGMQDFACEPHAAVCCDARET ALNMVARESEASRQVVTELVRQQPAKVVAEFSRILEKDLPNLALPWRHDV PRAGYLNKALLKVYDVQPRDFAGVLGIEGVGPKTIRALAMVAEVAYGAPA SFRDPVRYSFSHGGKDGHPYPVDRQVYDRTINVLEQALAAAKIGRTDKIQ ALKRLSRLANGS >Moth_1523 Protein of unknown function DUF322 MEVGRVDQILEQEAKTDLGTIKIANEVVAIIAGLAATEIEGVAGMSGGIA GGITELLGRKNLAKGVKVEVGEKEAAVDLYIVVNFGVRIPDVAIKVQENV KKAIEGMTGLQVVEVNVHVQGVVFPQETRDEETSRVR >Moth_1393 conserved hypothetical protein MKEEIKQLILYRMERAKEAIAEAELLFSEGHIRTSVNRLYYACFYAVSAI LLAKGYSSAKHSGIRSLFHQKIVKAGLVNPSAGTLYNRLFDARQKADYAD LVKFEADIVAPWFDEVKSLVHQIETLVVKEIRSPG >Moth_1705 Allergen V5/Tpx-1 related MKHKWLTAALKILLAAVVAAAPLSAARAATPASNSTRGIYSYNYSWYTAP YWSWHYRWHVSPNAGSGQVTRQSPTPAPAPAPQPASKPAPAPVTSPGNPA VPAPQPAPQPAPAGNYQLSAYEQQVVNLVNAERAKVGLKPLAADPQLARV ARLKAEDMRDKNYFSHESPTYGSFANMLKQFGISYRIAGENIAAGYPTPE AVVAAWMNSPGHRSNILNANFTAIGVGYASGGSYGHYWVQEFIGQ >Moth_1414 Protein of unknown function DUF1063 MNLLDQQTLINLLHQEADVAIGCTEPVMVALAAAKTRDMLGTLPRLVDIS VSSAVWKNARRVGLPGTGEKGLAMAAAMGLLAPVEAGQRLLAALTPVQVE QAKILVREGVVKVGVVAAKEGLYARAVARSNQHEAIVELNGSHKNFSALW LDGRMAGGAGENLNLKLEALLAQDYQSLLKQVLSLSPEELYFLYQGAEDI LTFAREIHQGGRNPLSAMASFFRRTESGGESLEVLIRNLTGIAVAERMAG ATYPVLTCAGSGNQGILAAVSLLLAGQELRAGPESVTRALAIAHFTNMYL KAYTGKLSPLCGAVTGGAGVAAAICWLLEGSCQQIINAMQIVLGNLCCVI CDGAKESCALKISTAAVEAVRAGYMACQGINLEAGTGIVGKKLEDTMELV RKVYQGGLGEIDYYLGKVDYLLSTN >Moth_0912 serine/threonine protein kinase MIGKVLEGRYEIVSELGGGGMARVYRGQDRLLNRNVTIKILREQYASDKE FLARFQREAQAVASLSHPNVVSIYDVGQEDDLHYLIMEYVEGRSLKDLIS ERAPLPPLEAIDISLQICDALEHAHENGVIHRDIKPHNILITRNGRVKVT DFGIAQAVSEVTMSQSGTMIGSVHYLAPEQARGGVIGATADIYSLGIVLY EMLTGDLPFHGETPVAVALKHLQENPRPVRELNPNVPPALERVVMRTLEK DPARRYPSAAALRSDLLAVRNALADATFATQVLPAIETPDPPSTLPKPRR RPRVWAWVLMALLFLGLAAAGLWAGFRYYLAVGETLVPSVVGLPEGQALE QLAAAGLRGQVIARQYDASVPAGQVMAQDPGPNQRVRRGRVVALTVSQGA RLVRVPSVIGETERNARLILENANLKVAADTLKVYHPSIPAGSVVDQNPP ANTQQPEGTEVRLIISKGPEPQFTTAPSVVGLSLAEAQQKLLEAKLKQGT LTYQRSDNQFPGYIIAQDPREGSNVLQGSAINLVVSQGPGPVQKQVGVTI DPAPDDKDHEVRIVVTDAKGTNEVLKKKQKMGQQIQATINYFGKGKLQVF RDGNVIYEQDLQ >Moth_1643 Protein of unknown function DUF965 MWPPRCVRKPQGGKKMPGDMQETMMFKVEKEEKVRVRDVLTEVQAALTEK GYDPINQLIGYLLSGDPAYITSHKGARNLIRRVERDEILAELLKSYLA >Moth_1503 required for dissolution of the septal cell wall; RBL05740 MERALTRHLKDNLGLYLLVGFFFLAGIITGTIAVNFLEPQQVSQLGAYLD KVLSQFKGEGPGFNQAAYQALLGALRETGLIWFLGLTVIGIPLIIGLIFL KGLILGFTVGFLVQQKALQGMAFSFLALLPPNIIQIPALFIAAILGISFS IGLMRGRGQAEAAILPRFLTYSFLMLLVTLVLVGGGLVEGYLSPFFARIV LAYF >Moth_0235 Protein of unknown function DUF161 MHRKAWADYLGITAGTLVTALGLVLFLVPNRIAAGGVSGLATVLHYVFGW PVGLTMLVLNIPLFLAGLKVLGLEFGLKTLYGTIILSVFTDTLALWLHAP TSNTLLASLYGGLLSGVGMGIVFRSGGSTGGTDLAALLFRHYLHISAGMG LLMVDALVISLADLVFNVELALYALVALFLTSRAIDAIQEGGGYARAALI ISDKAEEIARRVLVELDRGVTGLAGRGYYTRQEREVLLVVVQRAEVSRLK DLVASIDPEAFVIVSNVHEVLGEGFGYFNRL >Moth_2316 Protein of unknown function DUF86 MSLDEFLHNHQIYSTVERDLELAITCIMDIGNHIISAMDLPEPETYADIP ISQELARKLSGAVRFRNILAHEYMDIDRRLVYKHLQTGLGDLVEFIYGIG KFLGI >Moth_1760 conserved hypothetical protein MYIYGRWIVIPLIGAFIGWVTNVLAIRLLFRPHRPFQFLFWTFQGLIPKR RAEIAANVARVVDKDLLPLGEVLEHLRTPALEEKITELVVEVARRRLTER LPSFIPAGLKEAASKTIEETLRREIPPALEELEGELTSSISSFSLGDLVA EKINKLDLHQVENLIVEVAGRELRYIELLGGVLGFFIGLVQAFVAGR >Moth_2119 Protein of unknown function DUF606 MQLNITRLKVKVMLVFFFLALGIGAIWALQPVINAGLARSTGPLMASTIS FLIGSLLLIISVVITQWLTKDPLDFAALSRVNPVYYMGGAIGAAVVLGMT TIIPKLGAGGVLSAAITGQLIMAAIIDHFGLLGAPRIALNSLRLIGIILL LIGVNLVIYK >Moth_1895 Alkylhydroperoxidase AhpD core MPLPPFIEALAERDPEFYRAVKAVAETAMAPGALDAKTKTLITLALDAAH GASEGVAVLAKQARELGASEEEIREALRLAYFVAGNGVLAAGGAAYR >Moth_2514 conserved hypothetical protein MTTYWQQKGKSNTEATVKLALKRARELGLKDVVIPSVSGYTANLCLGIND LNIVCVTHQAGFKKPGEIEMDTAIRQRLEDGGIKVLTTTHLLAGLDRALR FSFQGIYPAEIIANTLRLFGQGTKVAVEVACMALDAGLIPAGVDIVSLGG SSEGVDTALVLRPAHSQDFFATKIKEIICKPREF >Moth_1494 Protein of unknown function UPF0047 MLFTFDLQTQAKEAMIDITHLAAKTVKEAGIKEGFCLVYVPHTTAGVTIN ENADPDVVTDILAALARIVSAGGYRHGEGNSPAHIKASLMGSNQTVVIHE GRLVLGTWQGIYFCEFDGPRRRKVHVKVWEG >Moth_0299 Protein of unknown function UPF0150 MLPQKDLETLSKLPPERLRMVLNFARANLINQKITRRYNVKLDWNEPTDE DGVAGYTVTVPSLPPVVTEGDTREEALENAREAIACYLEYLIITGQPVPE SDTEGENMVEVII >Moth_0899 conserved hypothetical protein MRIKKRLFIGLLALSLMFITAVLAGSWYLLINHSSLFNRVLLALGFFTLA VLFLLIALGIISLVLMLWQGRSRPLFQHLGLMAVNILFPVALALGKRLGV EAATIKASFIEMNNQLVRLQRLQVAPREILILAPHCLQWSGCPHKITIDV NNCRRCGRCPIDALHALAARYGVRLAVATGGTLARHFVKQYRPRAVVAIA CERDLTSGIQDTQPLPVLGVLNLRPHGPCLNTQVNLNQVEQAVQFFLTGR TVPQVQACSEGWVEVTHGS >Moth_2019 Cupin region MQDKEQAGSRPLDARAAILSELVDYQEGSIVSRTIIDKKAGTVTLFAFAA GQGLSEHTAPYDALVHVLDGEVEITIAGKPLHVKTGEAVIMPANQPHALR ALTNFKMILTMIRS >Moth_0606 Protein of unknown function DUF299 MGNRAGAHFCLASGLRLPSKGGVTIGVIYVISDSLGETAEYVARAAASQF DGGGLDIRRVPYVTDLDHLEEVVNEAAQEQGIIAFTLVLPDLKKKLLELA AARGLEAVDLLGPLMDAITRVTGGRPRLEPGLIRRTDEDYFRKMEAIDFA VKYDNGKDIRGLSHADLVLIGVSRTSKTPVCMYLAHKRLRAANLALVPEV PLPGELLNLPPEKIIGLTIDAGLLYQIRQERLKTLGLPGPAGYATRERIE EELAYARRVMDQLGCPVIDVTNKAVEETAGKILQIYYRRERNGK >Moth_1692 metal dependent phosphohydrolase MSLVTLEEVKKDPEVEALITRGNEHLGAMGFTEHSHRHLNLVASISRNVL ERLGYDKRTAELAAVAGYLHDIGNVVSRQDHGQSGALLAYNILRRLGMPA DEAATIMGAIGNHEEEYGQAVNPVGAALILADKSDVHRSRVRNNDISTFD IHDRVNYAVQHSFLRVDAGKRAITLELTIDLAISTPMEYFEIFLTRMMMC RRAARFLHCHFGLVVNDARLL >Moth_1978 conserved hypothetical protein MPERPLTDYRCQAEISTREQRVLEEVLKALRQVRYGSITIIIQDGRIVQI DYTEKVRLGKE >Moth_0726 Type III effector Hrp-dependent outers MEQISIIADDLTGANDTGVQFCQHGFRTMVIIDAANVERVGQDKDVWAIN TDTRHLAAPEAYQRVYEITLKLKKAAISRVYKKIDSTLRGHPGAELEAVM DAWQADLALVVPAYPANRRLVVDGHLLISEGMETAAASVSLTPGDARAAL CHIPTVLQGEMGRRVGQINLATVRQGVKELVAALEAARTNSQVLVLDAAD EEDLRNIARAISRFQRDVIVAGAAGMAAHLPLAWNLKPVPNNPLNKKGAI LLVAGSRNPVTAAQVQRLAEVSACQAVKVETEAILTGEPAVEIERVLQEV TTQDAGAGLIIIAVDSLFQTIDRDRVSNSGSKAIALALGTITSRLLNMRR ISALVVTGGDTAVHVCRALEARGINLAADLLPGIPLGYLEGGRGDGLPIV TKAGGFGSPDSLIKVNEFLQQRMKSEMELV >Moth_1752 RepA / Rep+ protein KID MPEESLREIAATNEPPHTLASPEFFFLLQRIDRLDEKLTREIRDGDQKSE ELITAVEQKLTQRIDAVEQKLTQRIDVVEQKLTRHIEEVEQKLTQRIDAV EQKLTEHIEEVEQKLTQRIDAVEKTLTQGIDAVDQKLTRRMDALEEKLGL RIDKQDDKLNSLKFWAIGAVITISVGFIGTIATLLYK >Moth_1940 hypothetical protein MPQIIRTDSFLEQFQELSKEAQKHVLKTILFLAQNPSHPSLKVHRIKGTP FWEAYASISIRVIFERNGDTLVLLACGYHDILKKY >Moth_0616 Protein of unknown function DUF523 MGKILVSACLAGVRCKYSGGHNLVPVIAELVRQGKAVPVCPESLGGLTIP RPPAEIKGGDGYDVLAGRARVMDKEGRDVTAAFIQGARAALARAREVGPE IIVLKERSPSCGSKLIYDGNFSGATRPGPGVTTALLREYGYKVISEEEFK GEGNPSP >Moth_1411 Dinitrogenase iron-molybdenum cofactor biosynthesis MVKIAVATEGQMVAEHFGHCSQYSLFDIEGGKVIGREVITNPGHQPGFLP GFLADLGVNCVIVGGIGARAVELFGVRGIEVITGARGPVEEAVSAYLGGT LKSTGSTCSHDHEGHEGCEH >Moth_1649 conserved hypothetical protein MPKGRELVGLPVISQDRGEELGRIQDLFYDETSGSLRACLLADGGWLRQP RVVDFTALQARGPGAFTVSGAGAVSHEPPPGTRRWQELKGLRLLNRDGRE LGIVEDLVVELPSGQVKALEISTGLVNDLLEGRKEITLEGQVNWGTDTVI IG >Moth_0191 conserved hypothetical protein MAKPELSSPAVKASRWTARRLATLAMLIALSTVGANLKIPSITGTPAFDS FPGFLGALILGPADGALIAALGHLLTAFTAGFPLTPPLHLVIAAGMAAVV ALFAIFYRFSPWLGIAAGIALNGLLLPALFIPLPGFGKAFFLAMVVPLLI ASALNIVLAATAFTSLRRVFPASYAAGRGKGEGK >Moth_1150 conserved hypothetical protein MKKVSFAHMGYSYLGFKQLVEDMGFEAIVPANPSPATLDLGVQYAPEFAC IPFKTVLGTYLEVLNRGAEMIITSGGVGPCRAGLYGLLHEKILRNLGYNF ELFIFDPPLTGLGPFFWKLRRVLKEARLSWLAFIDVVRRAWAKLKLLDEL EQMATVTRPYEIKRGATTRAFNQCLEIIDRARSSKEIAAAREECRQLLQS VPRDEERRPLRIGIVGEIYVLLEPFMNLDIEKTLGEMGVITKRSIYLTNY TTTDVLAHGTEDIRQIAHPYLNQFVGGHGQSSVGETILYARNGFDGVIQL APFTCIPEIVAKSILPRVSRDFNIPVLSLTIDEQTGRAGVETRLEAFVDL LRQRREQMEARSNAALLPGY >Moth_2318 conserved hypothetical protein MIPEKYLLLEARIRKEVANLERLERELARYNLFPRIQADSLGGFSLTDEA SLRIIGSILHDYYTAIEKIFRIIARDIDCSVPTGEQQHKELLDQMTLEVP GLRPALLDNETARKLDELRAFRHVFRNIYGFSLDADKIRQLLEGLPELAS DCKKDLHLFTLRMRRILGLNSSSEV >Moth_1139 Protein of unknown function UPF0182 MKLNRLWFCLLIIIPGFLVAAYLGSHFLTDWYWFAEVGYRQVFLTRLLSE VGIRLGTIAFFFLFFYLNLLFTRKSLHLSPPEGRENWTLKEYLIDRFITS RRLGILYLLLSLAGALIFSPLAAGKWLVVQEYLRATPFGLADPLFGRDVS FYIFKLPLYHFLYKLLITAVVGAVLVTGFFYFIFNPRELLGLRRGHFSRP LVHFSTLVALLFLIQAWGFRLQALDLVRSSRGVAFGASYTDIHALLPGYN ILGWVAVACGLIIVLNAFRRNLKLVSAGILSFMAAYFLLVIAVPLAVQKF QVEPNEFAREEPYLRYNINFTRRAYGLDRITIQEFPALDNLTPASLREEG ATLDNIRLWDYRPLEQTYSQLQEIRSYYSFKDIDVDRYTLDGRERQVMLA ARELDQNKLPDRARTWINEKMRYTHGYGLAMNPANTVTAGGQPEFIAGDL PFHSSAGLQVNEPRIYYGELTGDYVITGGTAAEFDYPVTGEDNFVETRYQ GRGGVPINTPWRRLVFAFRFHDYRLLMSNGLTPQSKILYYRNIQERVRKI MPYLRYDADPYLVVAGGRLYWFLDAYTITNMYPYSEPNSGGFNYIRNSVK VVIDAYNGSVDYYLVDPGDPLAQTLARIFPGLFKPREDMPAGLQQHLRYP PDLLSIQAQMLTNYHMENTMLFYNKEDAWSIAEEMVGDKRQAMDPYYTLM RLPGETQAEYILMLPFTPARKVNMIAWLAARNDGPHYGQLLLYQFPKNRS IYGPMQVEARIDQEPRISQQLTLWDQHGSQVIRGNLLVIPIKGSLLYVEP IFLQAQESKLPELRQVVVAYEEKIAMADTLAGALQVIFGTQTPAPAASPQ PPSQAATGSPGNLSELIKEANRLYSEAQDRLKQGDWAGYGENLKKLEQVL QEMGQKVAE >Moth_1522 Protein of unknown function DUF322 MSVLDRALLALFALITAILAGIFLAVIAGWSVPLDLFELSLLNNDYRLLA GVVALIFFLLAMRFLLGSLRLAQAGEGRAVIKAADLGLVSISLPALEHLV TRAARQVKGVREIRPRLNYGPEGLAIRLNITVNPDRNLPEMAAELQEKVR EYVIATAGLEVPEVQVRINGIFQEGQRRVE >Moth_0512 SpoVR MEQEFKTIAQAIETIHDQARKFGLDFFPVYFELCPADVLYAFGAYGMPTR FAHWTFGKHFYKMKLQYDFNLSRIYELVINSNPCYAFLLEGNDLIQNKLV IAHVFAHSDFFKNNIYFTATSRQMVETMAVHAAKIREYEFKYGHREVEIF LDAVLAIQEHIEPPGPFGYKEEENEENEDTRPHRRETPYDDLWILDGRPK EPPPERNRKIPPRPTKDMVGFIMANSPELEDWQREVMAMIREEMQYFWPQ METKIINEGWAAYWHARIIRELDLTPAETVDFARLHASVLQPGYRQINPY LVGSKIFEDIEKRWENPSQEERERYGRTGGEGRSKIFEVRSCENDISFLR NYLTRELVEELDLYLYQKVGSEWVVVEKDWEKVRDGLVSRLINCGYPYIV VEDADYQRRGELYLKHRYEGLELDVSYLEKTLPHVYLLWGRPVHLETIID GKTTVFSYDGKKNCRR