TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Moorella thermoacetica ATCC 39073, ATCC 39073
Gene type: CDS

Number of genes found: 203

Free access
Sort by:

 



# Moorella thermoacetica ATCC 39073, ATCC 39073

>Moth_0215 Polysaccharide pyruvyl transferase
MARVVISGYYGFQNAGDEAVLYSIVKALRSLEPDIEITVLSRRPEQTAAC
LKVRAVDRWHPVRVAGAIRRADLVISGGGSLFQDVTGPKSLLYYLGIVLL
ARLLRKPVIVYAQGLGPLKRHWSRWLTGRVLNRVQLISLRDSESRRLLEE
LGVTRPPVYVTADPVLGLEPENMDLRPGQDKWEQLELSGPVIGISVRSWP
GYEECWPSLARVADELVAGGWQVLFLPFHFPADVDACRQVARLMHSPAVV
LRENLDLPALMGLMGRLQFLIGMRLHALILASLMGVPFLALPYDPKVTAL
ARMMEQPVAGFLASVSYTGLEAAVKQALAEREENARRVQAAVAELRPLAL
DTARLVIEYLRKGARG
>Moth_2061 Protein of unknown function DUF820
MGVSLAELAVGRERYTYEDYCRLPEGSPYQLIGGELVMTPSPTPYHQMVS
MKLELKMAGFVLDKGLGIVLHAPLDVYLDDTETYQPDIIFISNEKLPIID
EKRINGAPDLVAEILSPSTGYYDLRSKYKVYEKKGVREYWIVDPQHKSVQ
VFCRQEGKFVLDQEAEQQGTVKSRVIVGFEVQVESIF
>Moth_1866 Protein of unknown function DUF169
MADLDKSLMVKRLTEVLHLDTGIVGIKLYRDRKDLPRRPYNWKVNICQLV
SAARYQGKASSGTPDLMVCAIGAACTGLIKTPERFTSGKAALGRYVADVE
AGRRFMANTYKLGDNGKVYDAIYIAPLESYKTEDPDAVVIYANPAQMMRL
IHCCLHETGEPVKADTVAEAALCSSIGFAVKEQKPIIGFPCGGDRTFGGT
QKDELVFVTPYNMLGTLVENMESLLMSTGQLYPVAPFNNFTPVMVQSYTM
QPEDLEEK
>Moth_1377 Protein of unknown function DUF711
MPSLFSFTPEEILETIRMIQVENLDIRTITMGISLRDCATASLEETCRRV
YDKITRLASDLVAVGEEVAATYGIPIVNKRIAVTPIAQVGEPSGEDDLTP
LARALDRAAEAVGVNFIGGFSALVHKGFTHGDRALFNSLPEALATTERVC
ASVNVATTKAGMNMDAITWLGHLIKETARLTASRGGLGCAKLVTFCNAPE
DNPFMAGAFHGPGEPECVINVGISGPGAVLAAIRQYPEADLGQLATIIKN
TAFKVTRMGELVGREVSKRLGVPFGIVDLSLAPTNAQGDSVAEILEAIGL
ERCGAHGTTAALALLNDAVKKGGAMASSYVGGLSGAFIPVSEDNGMIRAV
ESGALSLEKLEAMTAVCSVGLDMFAVPGDTPAEVIAAIIADEAAIGMINN
KTTAVRVIPAPGKKPGETVEFGGLLGRAPVMEVNTYSPAALVRRGGRIPA
PLQALGN
>Moth_2247 YbbR-like
MLEHFRQNWGYRLMAVILAIILWMYVTGEQNPTGETVVRVPLETENLSSG
LVVADRPAEVQVRVEGRKAAVANLLPRDVHAYADLRDAKVGDNVLPVRVD
VPEGINVIHVNPAQVTIRVEKIEDIQLPVQVSLLGSPASGYRALEPVLKP
SQVIISGPAAALKEIGRVYVEAKIDQASGNFLAQLPVKIADREGRPMQTW
LTVNPDTVETFIPVVQDMPSKMLPVRPRLTGEPAKGYAIQRVILQPEVVE
AFAPYSQLAALDYLNTAPINIAGAKKNVTVETNLEIPSGVQLSSFPRVRV
VVEIGPAVAGAAGSGP
>Moth_2133 MOSC
MGRIVAVCTSANKGERKKNIGRGMLIANYGLKGDAHAGPWHRQVSLLAME
SIAKMQAKGLKVGPGDFAENLTTEGIDLVSLPVGTRLKIGPSVLAEVTQI
GKQCHSRCAIYQQAGDCVMPREGIFVAILTGGPVQVGDHIEVVAS
>Moth_1739 conserved hypothetical protein
MVAPIEPLYNKHYRCLFCDREFTNKKLRLSRIRQVKRDSDLCAYFEGENP
YFYEVAVCPHCGYAFTTGFGTVKKERREVITREYINKITHKDYTGPRTLG
DALKVYKLALLCGDLNQERKSVLAGICLHIAWFYRYSQEEEAEKKYLRNA
YDLYQEAYQKENGTGREGNPNLILYLIGELEGRLGNYAEACRWLGRLLNV
RNLEPYLHDLLLERWEVYRERLKETAPAGAP
>Moth_0057 Protein of unknown function DUF1021
MNGKEVLATIREDLEARVGQKVKLRANRGRKKILERTGVLEKTYPNIFVI
RLEEQKSPERRISFSYTDVLTNTVELMVEGDYGDKKLGAKP
>Moth_1233 transporter
MITAKIARGAVIAALYAVVTIILKPISFGYLQVRVAEALTLLPILYPEAV
PGLFIGCLISNIYGGLGPIDIFLGSLTTLAAAWLTYVWRRSWIAYLPPIL
LNGVIVGAYLSYLLHVHILLAMGSVATGEAIAVLALGIPLLKQIKKINVG
Q
>Moth_2497 Protein of unknown function DUF77
MAILEVVIAPLGTGSTSISPYVADVHKVLKETSGIKYQLTPMGTIIEGEP
DVLFPLLQRLHEVPFARGARRSMTIIRIDDRRDKELTMEGKLKSVEEKLA
LASS
>Moth_1576 HEPN
MNSREREQEALKWQDRARRDLRVAKMLFYDKEPEFDLACYLSQQCAEKSL
KALLIRLGIRFAYKHDLDYLVGLLPPEDQEKFYNTRLEWLSGWVTEGRYP
GDAAGATREDAQRAIEIAEAIYNNVSKALRSE
>Moth_1854 Glycoside hydrolase, family 57
MVTRSGDNLGYLSLVLHAHLPFVHNREPYVSLEEKWLFEALTESYLPLIL
SWEELAGEGLDFHLTLSLSPPLISMLMEPALGERYGRYLDNLRELAGREI
ERTRGDPTFAPLAEFYHRRLTLVDRAFKETYRGNLLAPIKRLREQGRLEL
ITTAATHGYLPLMLTDEARRAQVRAALDLFGMTMGFVPDGLWLPECGYTP
GIEKILRSEGIKYFIVASHGMLNATPVVKSAVYAPVRVGGVAVFGRDWET
SHQVWSRTEGYPGDPVYREFYRDIGYDLDFNYLAPYLVGGIRGDTGFKYY
RITGKTGVKEPYDYRAARERAREHARDFIANREKQLAYWAGRTQDKPVVV
APYDAELFGHWWFEGPDWLADVLRLAGESRVSLTSLSAYLEQYPPRQEVT
MGPSSWGEGGYNHVWLNQANDWLYLHLHRAERAMIKLAAANPRPGSLQER
ALNQAARELLLAQSSDWSFILTTGTTVDYARRRLREHLGAFFKLCQDYER
DRLDEDFLARLEAADNIFPGLDFRLYRPAGRGVACRPEVHNKTRPGILML
SWEFPPRHVGGLGIHVRDWARPWPARGWMSTS
>Moth_1217 Protein of unknown function DUF205
MLSWTVILTGIFMAYAVGSLAGGHFLSKILYNADVRQVGSGNAGTMNVLR
NLGIAAGIMTFIWDTAKGFLVVTLGLKGGGAELGVLMALAAVAGHNWPLY
WRFQGGKGLATSLGVALAVYPAAVPPGAALMGLLTFLTRNTDLATLLTFS
ALPIYFWWREGPGCYLAFGLGLAAIMLLRHGPLVISLFYNLKERR
>Moth_2522 Protein of unknown function DUF37
MIDNWVILAIRFYQRYISPLLGRHCRFYPTCSQYALEAITKYGLLRGGLL
ATRRLLHCHPWDAGGYDPVP
>Moth_0719 DedA
MRGGTGVSSLLAPLFEFITSAIASFGYPGIAVAMALESACIPLPSEVILP
FGGYLVSTGSLGFWGTVLAGTIGGTIGSIVAYFVGLRGGRPFLRKYGHYV
FFSEKEFAVAEKWFNRYGEATVFFTRLMPVIRTFISLPAGIAAMPFGRFV
VYTFLGSLPWSILLVFVGRQLGANWEALSPIFHRFDLIIVAGLILLVFFY
WRRHRSRR
>Moth_2306 Protein of unknown function DUF162
MKSQELITSFTSQAEAMGARVIQAARPGEIGAKLVEVLRPLGSKIALVDS
PLVKTAGVEAALAGAGFNVEKDGPEFARQADTGIVEFEYGIAETGTLAMD
ATDLKTRLAAMLPLTCVALLAAERVRANLTEVIDAYLERGPWPGYFTLVT
GPSRTADIERSLTVGVHGPERLFIILVGENGGASRGR
>Moth_0483 Protein of unknown function UPF0150
MASHCFLVVIEQDEDGKYIASVPSLPGCHTQADNLAELEERVQEAIKLYL
NENSDFIPAPDKFIGIHQVEVQA
>Moth_0597 Protein of unknown function DUF502
MRRLRRFFLTGIIVTMPAAATIYALWLVFSFLDQLAGQAVGLFLGRRVPG
LGLALTLAVVLIAGFLATNFIGRFFLNLWDEVMYRIPLVNSIYRTVKQLV
EAIWRDDKKAFQHVVMVEYPRRGIYSLGFLTGPAPAEASMRAASDLVNVF
VPTTPNPTSGFLLLVPREEVIPLEMPVEDGLKLIISAGVVGPAGRTASGG
AGWGRLLQGLTANGTINNPGWRRSRNTAE
>Moth_0555 conserved hypothetical protein
MRLRIKYSKTGQMAFLGHLEMLRLWQRAMRRAGLPVAMSQGFNPHPRLAF
GPALALGLESLAEYLDVELAADREPDRVQVELQAQLPPGLEILLVRTIPD
QAPALTAVIDVAAYRVTWLKEVDPGLLQQRVESLLARQEVLVRRTGRDGR
PRVKDIRPGILKLVLDPSPDLVMLLQCSQAGSVRPEEVLKALALETPARM
VRTGLFARRGDDLLAPEDISG
>Moth_1682 hypothetical protein
MIDANIVYFSGVAVMQIFSLLAIFFALLVAIFAVQNAGPVEINFLAWQFS
NISLVLVILGSAAFGALVVFLLGAVRQVRQAREIRELKSQHKRLQETIAR
LELVAAGKGAGQQERKQEA
>Moth_0077 conserved hypothetical protein
MFYTSKKLMGMPVVSLADGHQLGRIKRLLIDHSKMAIAAFTVDRKGWFKE
QPVVPYSHVKSVGSHAVTVDEASAVVKLSSLPELEALAKHPLPLLGARVI
TEEGTVLGTVEDFRFDPQDGKIHYLDIKSGLLHGARSLETDQIITCGRDA
LIARAGAEEALQKSGGLLSVKLQDACKSAGKTFDSAGTITRKVGETVNRY
WQRLPFSQKKNDGPPPGENS
>Moth_0526 Protein of unknown function DUF190
MTGTRAKRLTVYLGEGMKWQGMALYHALVLELKAAGLAGVTVVRGIEGYG
KRKQLYASRLLELSADLPVLVEAVDSPEKINAVLPRVREMVQQGLITLAD
VEIISSAGTASNKENSPGPGRRA
>Moth_2295 Extradiol ring-cleavage dioxygenase, class III enzyme, subunit B
MGRLLDVGFMPHPPIMVPEVGRGEVARIKATVAAARELAARVAAHQPEVI
IIISPHGPVFRDAVGIWATPELAGDLAAFRAGEVRFKYSLDLDLSRAIAA
KAREEGIAVAWLDARASSSYGLTPELDHGMMVPLYFLRQAGLEVPLVAMG
MAFMEREKLYAFGAALARAVKDSPRRALLVASGDMSHRLLPGAPAGYDPR
GKVFDARIRNLLAALDVEGILAIPEDLAEGAGECGLRSFIMGLGALDGYR
VKGEVLSYEGPFGVGYLVAHLEPGEEAPERSLLARETAAAREESLPVRLA
RQSLEHYLRTGKVLPVPAPLPPELAGRAGVFVSLKKNGQLRGCIGTISPT
RENLAGEIIYNALAAGLEDPRFPPVTVDELPELQYSVDVLSEPEPATVAD
LDPKVYGVIVSCGHRRGLLLPDLEGVDTVAEQVAIARQKGGIGPDEPYRL
ERFKVTRYH
>Moth_0733 Protein of unknown function UPF0029
MKKEGARRGLAAKSGTSIENGSSYLTVAREAIAELKIERSLFIGHACEVD
SDAAARDFIARIQAEHRQATHNCFAYRLGIGKKEITYYSDAGEPGGTAGR
PILGAITGLGLTNVAVVVTRYFGGKKLGVRGLIEAYGQAARRVLEEAGSI
RRVVTRELELTCSYAELDRLLYQVRSRGGKVIETEYGSEVRLKVAVPLPA
WEEMKESHRPSP
>Moth_0437 pyridine nucleotide-disulphide oxidoreductase family protein
MAEDKMAIICFSGDLDKALATFNLATGAAASGMEVTIFFTFWGINLLKKP
RGRRGASSLLGRMFDWMMPSGPDRLPLSKFNMAGLGPFFMKKQMRSKKVQ
AVSEFLALAREMGVKFVACIMSMQVMEIPEEDLIDGVEFGGVAAFLQEAA
NAKISLFI
>Moth_0169 Protein of unknown function DUF433
MNPLERITIDPSVCHGKACIKGTRIPVSVILDNLAEGISQEEILKSYPSL
SLEDIKAAIAYGAMLAKERHIAL
>Moth_1214 GatB/Yqey
MAENMKEKLTRDMKEALKAHDKIRLQTIRMVLASIKNVEIDKQHPLSEEE
IAGVIQKEIKMRQDSLEQFSRGGREDLVEQTRTELKVLEDYLPRQLTEAE
LKEIIQQTIQETGATSKREMGKVMAALMPKIRGRADGRKANELVKEILAS
>Moth_1704 Protein of unknown function DUF28
MSGHSKWANIKNRKAKVDEKRGRLFTKIGREIIIAARMGGGDPEGNMRLK
AAIAKAKAANMPNENIQRAIMRGTGELEGAAYEEMTYEGYGPGGVAMLLN
IATDNRNRTASEIRYIFSRGGGNLGESGCVAWMFNPKGVITVEVPAGDKR
EEVILQAIEAGAEDVDDEDDEVLEIKTAPGDLEAVREALEASGVTITHAE
VEMVPQTTVTIDDPETAGKVMRLIERLEDHDDVQAVYTNADIPAAIMDQL
DI
>Moth_1489 Ku
MRPLWKGAISFGLVNVPVKLYPATESNDLKFNYLHTRCKTPIQYRKYCPY
CQVEVPPEEIARGYEYEKGKYVILREEDLEAIPAEKTRSINIMDFVDLEE
IDPIYFSRSYYLAPADMGQKPYLLLKKAMEETGKVAVARVTIRSRESLAT
VRVYGPALVMSTMFYPREVRPVTGMPELDFQVNLRENEVKMAVTLIKSMA
TSFQPEKYTDTYRQALLQVIEAKIAGEEVEVPARPEAGKVVDLMEALKAS
IELARQEKEKVAADVEDRKPRRRRKTS
>Moth_1003 flagellar biosynthesis
MKERERIRKAVALRYNSEEDKAPRVVASGRGAIADKIITAAREAGVPIHR
DTHLAEMLAGLDVGSEIPVELYQMVAEILVFIYSLDQSRGKSGK
>Moth_0118 (2R)-phospho-3-sulfolactate synthase, ComA
MQYKDQSAWRDVLEFPIGGRQQKPRRTGKTMVIDKGLGLTEFKDLLEVAA
PYIDFIKLGFGTSVFYPADILREKIRLARSHAVDIFPGGTFFEVAVLQGR
LSLYLQTARELGYTFIEISDGTIDLSRTLRYAALRQARAAGFGVITEVGK
KDPRDALSDTHILSQIAMDLEAGADYVIVEGRESGQGVVIYDSRGVVKED
TLAYLIEGIGDLDRIIWEAPQKQQQQVLIINLGANVNLGNVQPGDVLALE
ALRVGLRGDTLRTTLVREGVS
>Moth_0626 Protein of unknown function DUF34
MAAKCGEIIAIMEALAPPELAAGWDNVGLMLGSPEAEVRRVLVCLDVTPS
VAAEAAARAVNLIISHHPLFFRPVKNLRFDEPVGELVRRLLQDNIMVYSA
HTNMDSADLGVSYHLASRLELEDIRVLVPTHREKYYKLVTFVPEDHEKVV
REALTRAGAGWIGNYSDCTFRVAGTGTFMPLAGTRPYTGEEGKLAEVKEY
RLETIIPTGRLPEVLRALLKAHPYEEVAYDVYPLANEGPAQGIGRTGVLP
QAVTLEEFALRVKESLGAGRVNLVGDRERKVKRVAVCGGAGSDVMAAARD
AGAEVLVTGDLKYHEARTAQAMGLAVVDAGHFATERLIVPALVTYLQEQL
QEREVMVLASQQEQEPWYAL
>Moth_0877 pyruvate formate-lyase activating enzyme
MLNRPLLAIDIGGGTQDILLYRPDQPLENCVQLILPSPTVICGRQVEAAT
AARQDVFLRGHLMGGGALVGALRRHLAAGCRVYATPEAARTVYDDLERVR
QLGIVITDQPPEDAVTIKTGDVDLATLAGSLAPYGVKLPAEVAIAVLDHG
EAPQGMSDRVFRFQHWQRFVAGGGRLEDLLYREPPSYLTRMKAVREQAPG
AWLMDTGAAALWGALEDARVAARSEEGLVIINCGNQHTIGVLLKGQRVLG
LFEHHTSCLSGHKLAAFIEKLRAGKLTNEEIFNDGGHGCYIDPSYQPGDG
FRFVAVTGPQRRLTIHQDYYWAAPYGDMMLSGCFGLIAAVAGVKINP
>Moth_1476 Propeptide, PepSY amd peptidase M4
MNKKLLASLTTGVMLLGAATGIGVFANQPAKAAAASIPAIAQSAATVNPA
PANGQQVQASQQQPAYNASIKVANSQNDNGTEVNETTEKQNEAAESQALQ
ARAKITADAAKSAALKAVPGTVKKVALDNENGNLVYSVEIQAASGSIDVK
VDAGNGQVLAQDQDSGQDNEKGAKGIDNDNIQVEQ
>Moth_1358 conserved hypothetical protein
MPEHPIEALMKTAMESIKDMVDVNTVVGDPIETQDGQVIVPVSRVTAGFA
AGGSEYESSTADGGGGGKESLPFGGGSGAGVSVQPVGFLVVGKEQVRLLP
VDGNAVVDRLIDVAPQVMEKIQELLGKGKSAKGNGKVNGSSTPTTVRTKM
VLRPQDAEE
>Moth_0992 conserved hypothetical protein
MARGTKKIVFHKIPVRTHVVTDKDDAISLAKKYSAGIAAPGDVICLAESV
VAITQRRAILPEEVRPGRLARFLCRFPGKDGSLATPPAMQLALDEVGPVR
LLAGCAAAAFGRLIRKRGLFYIVAGRQLALIDDIAGTMYPYERHIVMGPK
NPGKLVREIKKATGAEAVIADVNDKKCVDILGITDRRYIQAVTEALRDNP
FGNEDEQTPIVILKRQ
>Moth_1420 conserved hypothetical protein
MVMAGVSYQPGTILRETIHSIRNILGDSLDDLTVERVVIGVFYTGVKLSN
GQGGLCFTPIKAIPGAVCCPSSARAMPASGELRGRKATAFLEGMFADQAL
RRALGIAVLNALSATCWQVRPPMNYTLKTGANALDQVIIPGEGQVVVVGA
LAPFLKVLKRQDCRFTILELDPATLKKDELPFYRPPEDAPEVIPWADLLI
ITGTTLINDTLEGLLSVVKPGAQVVVVGPTASMLPDAFFRRGVNLLGGTL
VTKPDELLDVLAEAGSGYHFYGRAAEMMVLRLSDHGGTI
>Moth_2223 transcriptional regulator, PadR-like family
MNYTDREYWNGIIKMCLSKFFILRVLYTQPMHGYEIARTVAQVTRGCCTP
TEGTIYPVLREFEEGGYVTSSLEIAGGRERKVYTLTPKGQEAFRVAVEAW
KEVTGYILEAVKLEDYASTRRRCIMAGAKGILSNFSGLREGNCYSVRNEE
IPIGQCRTAASSCSCGSSLPGSFDKEEGGEIGISPDTQASQRSLDIEFLY
LDLDSCTRCGGTARNLEEALNEVAGVLQATGIQVNLHKIHVQSEEQALAL
GFVSSPTIRINGRDIQLDVRESLCESCGEICGEDVDCRVWVYQGKEYTEA
PKGMIIEAILKHVYGGGNESPAERETLQELPDNLKRFFAARRKKEEGGAG
RKPAPEPQDSCCGISPLSKCCK
>Moth_0170 hypothetical protein
MPCELRGIKMLRFKIDENLPIEIADLLREAGYEAETVWSEQIQGFSDIEL
LGICRNEKRVLVTLDMDFSDIRRYRPEDYQGIILLRIASQGKQSVINLFK
KVIPHLRYQNLIGYLWIVQKDRIRIRGPER
>Moth_1931 conserved hypothetical protein
MTELLNNQDYRKEALKEIIRELHRGKSVEEVKARFNELIKDVAPAEISLM
EQALINEGLPVEEVQRLCDVHAAVFKESLERAPQPETIPGHPVHTFKEEN
RALEDLMIREIQPLLAELRRANPDVEKDLAIKLAEKLNLLQDVNKHYSRK
ENLLFPYLEKYQIVGPPKVMWGVDDEIRDLLKEARDLAVNYVPDKKEELI
TRTEAALAKIKEMIFKEERILFPMALETLTEDEWYRIMLDSASIGYCLIE
PREDWRPAQVKLDQKETVASEETRGYIKFATGILTPREISLIFDHLPVDI
TFVDKDNVVKYFSNTRERIFTRSRAVIGRRVENCHPPASVQVVEKLIADF
KSGRKDREAFWLHLGDKYVFIQYFAVRDEKGDFAGTLEVTMDLKPLQAIS
GEKRIMD
>Moth_2349 hypothetical protein
MEPKLRKILNRYLGEIEARLEGISCAARKEFITEIRSHLVEKWESSGEKT
EESLLQVINDFGDPREIAEDYLTKVGGEARVRRTYPSTWLVLTLTALIWP
VGIILAWVSPAWKFRDKIIATLIPIVIFLLLLTSSLPAVREVHKLTTQVQ
LQPIETEVQR
>Moth_1425 Protein of unknown function DUF364
MWEVYDELIAAVPPDLEVEDCIVGLNWILVRSRATGLVMTPLEGHPRIKL
AGEIKGMPVRDLAEYIKSWNNFEAALGLAAINSVLNTPEQVESLCGRPLS
SQPQMNAFTCFQEQVRGKKVTVIGHFPDLDPLARNCKLTILERRPREGDL
PDPACEYILPEQDYVFITATTLINKTFPRLMELSRRARVILVGPSTPMTP
VLFHYGIDTLAGTVVVEPKLIRRVVQEGGCLEIFKRGGRMVQVSRDEGLA
VALSRQSREQAVQVMMGRRFSAIGAGQENFA
>Moth_0167 Protein of unknown function UPF0150
MGRGAFTTFIQAAMKEAVIEQLEDGTFFAEIPPYPGVWADGKDEKECLAT
LQEVLEEWLLFKLRDGDNDIPVLGGEDLNHKWQVKL
>Moth_1898 Protein of unknown function DUF488
MFKLKRIYEEAAADDGLRVLVDRLWPRGMSKEKAEVDLWLKDIAPSPGLR
QWFAHDPARWEEFRRRYEGELYLKGDLLAILRQKAKEETVTLLYAARDEN
YNHVVVLKKFLENANKKPGKPGSR
>Moth_2483 Ribonuclease III
MDFVANLSPLALAYVGDAVYELLVRAHLVGRGPAKPEQLHREALKYVRAT
AQARVVPALEEYLTPAEKDILRRGRNARPGHLPRTAAPAEYHSSTALESL
LGYLYLQGDWARVEELAHLIFKLAEQD
>Moth_2145 conserved hypothetical protein
MFWVCRYRLFIVPLLVFFLVSGFGLAWGADETSTMQSPEVEWEKTLGKGI
GYSVQQTSDGGYIIVGSTQFRGAGDVYLIKTDANGNKLWEKTFGGSGSDE
GYSVQQTTDGGYIIAGSTHSYGGGDDDVYLIKTDANGNKLWEKVFKGEEL
IEVKGGIARIKTLKGELIKEIDITKDWEKYWPETLGAELINEDTTNNYWL
WKKTLGGKGRSVQQTADGGYIIAGYTNTYNVYLIKTDTNGDTLWERIFGS
NYTEVYSVQQTTDGGYIIAGYIDPGSVGKGNVYLIKTDAKGNMVWEKTFG
GSNWDKGYSVRQTTDGGYIIAGFTRSYGVGNDDVYLIKTDANGNKLWEKN
LGGNYWEGGYSVQQTTDGGYIVAGVGDYSQIKTDGDGNLLWKKTLRGEGR
SVQQTTDGGYIIAGYTFSRSTDSDVYLIKLKPETPPANQPPVVSLKDMQG
HWAADAVDRLVETGVVSGYPDGTFRPDLEVTRAEIAAILVRALKLTPTNN
QELKFKDDATIPTWAKDAVSIAVKEGLVKGYLQPDGTMTFEADRPVTRAE
MAVLVARVLRKKLGEVTPMELKFTDAVMIPAWAKSDVGVAVAEGIVVGYP
DNTFRAENHVTRAEAAVMILRLLRVLGRI
>Moth_2411 conserved hypothetical protein
MKKDVLVKVRGTQTNDLGEQDSIELITEGRFFIRDQHYYILYNETCLSGM
EGTTTSLKVEPRRVTLNRMGTAEQKTTFETGILNYSFYVTPYGTMRISVL
PSKVEVDLTERGGSINLEYELQVGQEKISNNQLEITIQHLENPV
>Moth_1536 conserved hypothetical protein
MATVHLVQVETANQPPGQAPRRDGDPLAGIEELLPPNIKAAVESLPAGIR
DNLEEIRLRRERPLQVRWSGGEGWVAASGGLAAGPDGAYKVTAADLGRTI
EALTRSSLYALEEELRSGYITISGGHRVGLVGEAVVLQGEIRTLKNFAGL
NLRLARDIPGCARSLIPYLLEGGRPLHTLILSPPRCGKTTLLRDLIRLLS
TGVPELKFSGVNVGVVDERSEIAGCWLGVPQLEVGPRTDVLDRCPKAAGM
LMLLRSMGPEVIATDEIGRPEELAALQDVLHAGVTMLASVHAGSLEELQH
RPGWGPLLKQGFWQRLVLLGRTLGPGTIEGVFSGDHRTLKRGPWRGEARP
>Moth_1493 Protein of unknown function DUF1458
MHVKVVELVGESPNNWKDAVQKAVSEASRDISNISGVEVYNLTANVKDGK
LSEFKANVKIAYADHSADL
>Moth_0697 AIR synthase related protein-like
MIGKVDDAFFRQAILPHTGAGDPEVVVGPRMGVDAAVLKIGEEYLAVAED
PIFPGPTTSPDDFGWITVHIGASDVAVMGIKPRFMTYSLLLPPGTPEDYI
AGLVRSISTYARELGITIVGGHTGFYGAVTIPTIGGITVWGRGREVVTPA
GARVGDAVIITKGAAIEAAALVACELGEKLLAAGISPDLVARAKKRLREM
SVVAEAGIAVEVGGVHAMHDATEGGLARGLWEVAEASGVGLRIERARVPV
PADIRAVCDYIGLNPYEVISEGTLVLTCAPEKADAMLAAFKEAGIEAAVI
GRVVPAGAGRAWLEDDGREEQLLPPAVDRFWEVFFNALALKNDTRTPAEV
ALCRELGQAVRELEEANVAALIPEIGANLAYCLPEAKELRDIAAIPGRLL
RFKGRVATLGEPEMGCSHHMGGTILVVREFFPQARCVINLRNNARVRQAC
ADLGYKVVSMPVPPDYRQTDDDFYTDLRRTMAACRELPDVIEIPDRINLE
RLILVLGRNPGEIVSKVTSLATRVAELE
>Moth_1878 conserved hypothetical protein
MSTRNPTMTEGNQGLGLIQGVGLTVILTLVARQLAMLPILKIMGSMVLAI
LLGVAWRSLMDIPATAEVGINFASKKILRYGIILMGLRLDIPKIIAAGPQ
VILLDILAILVSMVVIIFLGQRMGLNKKLAALIAAGTGICGAAAIAAIAP
IVRSRDDETAVAVAIVALLGTLFTILYTLLYPVLNLTSFQYGLLSGSSLH
ELAHVIAAAQAGGSASADIAILVKLGRVAFLVPVALVLGLIFARQNETGA
GWHWRQLQVPWFILGFLVFSGINTMAILSTPLIAFLIQVGVFLLTVAMAG
LGLNVSLEMIKKVGSRGLVTGLLGSVVLSLTIFLVIASLIN
>Moth_2093 hypothetical protein
MEEKDLIRSLNWFYSLEIEQVDLYKSQARAATDIYLRQVLTRVAAMEQEH
VINLEAEISRRGATPTRLGAIIAPLLGVATGTILNWTNTRTLLWANITLE
EKAMADYKRLILKVAEKTLFNLLWSHLIDEDLHAAWFSNKLKELDRLALH
>Moth_1084 Stage V sporulation protein S
MEVLKVAARSNPNSVAGALAGVLREKGGVEIQAVGAAALNQAIKAVAIAR
GFVSPNGLDIVCIPAFADIQIEGQERTAMKLIVEPRKTQ
>Moth_0749 Protein of unknown function DUF180
MQVMTSRFGTLEINPSDLLHFPQGIPAFEHLKEFFFYPIPENPAFTWLQA
AADPEVAFLLVDPFLFFPGYAVDLPARLQEELAIKDPADALVYAVVTIPD
GDIRRATANLVGPIIINPTVRLGMQLILEGTKYTTRHQLFKESFSDDESI
PSSGGNEG
>Moth_1742 conserved hypothetical protein
MSYRVGIPRGLSYYYLFPFMQGFLTALGVEVVVSPPTDAVTLAAMNACPT
DEPCVSVKLYFAHAKRLVESGVDYLFIPVLSSVEKDNYCCPKLIGAAAMI
RNGLGLAPEQVLAPEWNEREKPGAWRENLLEIAARLGAGPERAAAAIRAG
LEKQAACERMARQGLTLPLVYHRLCGTEKPRRQQFDPEARYDEGETIAVL
GHPYLLYDGAGHQIVERLAEYARVITPEMVPPEDYRPEVATIFEGTRMWA
YEARILGAGFSLLRKGQVTKMVLVEAFECGPASVIESYLEAEAERFGVPF
LLLTVDEHTGEAGLVTRLEAFVDTSGSRPVNPAGKKQAFFFPASRGGELK
VGAPGMGLLDIALEAVLQECRVEMVPTPPVTKRTVELGKELAPEFICYPL
VTTLGQIREVLEQGANTIVMVGGKGRCRLGWYAQVQELLLKRLGRDFHLV
IIDSPLPWRERWPAFRQALKEITNNAPLWRIIQGIYLGYHKMAALDQGEA
MVRRKRAYESQRGAADKAWRRFVGRVKTAAGVRSVKRVFNEFREELAAIP
EVPASPLRVKIIGEIYTVLEGFVNQEIEQFLASREDLRVEVVREITATQW
FNLNVLHRRYEVERHRAIVTAAAPYLDVSVGGHGQESVGEAVLAAREGYD
GVIQLLPFTCMPEIVAQNILVPLSEKLDLPFLSLIINEQNGTAGWETRLE
AFLEVLAERREKLAAPGGEKHGVLFGY
>Moth_0268 Protein of unknown function DUF72
MTVFNPFLASASANKPSTSNFRPPTAAIYIGTSGYSYRDWQGYFYPRGLA
AKDMLPYYAREFNFTEINSSYYALPRPQNLEQMARKVPEGFIFAVKAYRT
LTHDRGESVTDDSKQFRQALQPLVDQGRLGAVLLQFPYSFHNNQANREYL
ARLRELLPDLPLVVEFRHAGWVHDAVRDFLARNELAYTCVDEPDLPGLPG
PVVYCTAPVAYVRFHGRNAAKWWRHEEAYERYDYLYTEAELKEWLPGIAY
LAGQARQVFVAFNNHYHSQAVTNARMLKELLAAGQL
>Moth_0834 Protein of unknown function UPF0040
MFMGEYHHTIDDKGRLIIPARFREELGVKFVITKGLDNCLFVYPMQGWAE
MEQKLRSLPFTRADARAFVRFFFSGATECELDRQGRILLPGNLREYARLD
KEVVVVGVSTRVEIWSRSRWEEYCRETSDQYEALAEKMVDFDI
>Moth_2416 Protein of unknown function DUF163
MLHLTIVAVGRVREKYLVAGIEEYLKRLRPYARVRILEAPEEKVPDKPSP
AGVEQILASEGRGIGRLIPPSSFTVALDREGVMLSSEELAGRLADLALAG
KNEVALIIGGTLGLATFILQQADLRLSFSRFTFPHQLMRLILLEQLYRAF
KIQRGETYHR
>Moth_1842 hypothetical protein
MNCSQCRELISPYLDGVLSETIQRALENHLNSCPACREELEAMGQTIEII
RAWSEEELDLPPGFEERLRSRLEECRQPWYRRLSRNWLSLAAAAATIMVV
AITARADYLHLGSSRQIAVPHEKQVQELAMTRGDQQVTPLKALPPVTSTD
APQQSAPKVKVKAATTSVRSAVRNLESSHPDPEQQQRKIVPGGTFNLNSR
GRAERAAPEQQTGGQSGKGQPDQDKDKGKEKGPGQSRTVLEAGKKEVTPR
AGEGVAGGTSTIAGDGPGTVKTPAGDGKEVPPLPPAGGKATLQDLTPGVG
RQNSAASPDSDLQNRTLTQPPPAPVAPATIPKPPSP
>Moth_0856 Protein of unknown function DUF152
MATFTGEEREGIFFLRVTFLESRAPVKAVYTSRRGGVSNAPYDGLNLGLH
VGDNPRAVLANRNLLAGVLDLPLGSWVIGEQVHGNEVARVGREQAGSGVQ
ELTTALKGIDALVTNEPDVTLVAFFADCVPIYIVDPVNRALGLAHAGWRG
TVLQVGARMVARMATEFGSRPGDLLAAIGPAVGPCCYQVDARVVDKVQEH
LPFAGELLAADGPGHWRLDLPRANYLSLVAAGLQPDRIAVAGICTHCQAE
TFFSHRASGGITGRQAAMLALGEQV
>Moth_2269 Protein of unknown function UPF0261
MTAKDIAIIATVDTKEAEARFLQEFITSHGWQAPVLDVSTHRPHNFQATY
SREEICRRAGVEYKDLGTLRRDAMMATMGRGAARVLMELYDRGELAGVLG
IGGNQGTAIAAMAMRSLPVGLPKLIVSTVASGNVRPYVEYKDITMMFSVA
DLLGGPNTVSRTILSNAAGAVIGMAAWGQPLKAGERPVIAITALGNTDPA
VAAARGRLVELGYEVIAFHASGTCGSAMEELIEAGLINGVLDLTPHELIG
EVHGADIYTPLRPRLEAAGRRGIPQVVSLGGLDYFCFGPADTIPQRFQGR
KTHYHNPYNTNVRATGGELAQVGEVMAAKLNAARGPVVVMVPLKGWSENG
RAGGPLYDQEADAALVASLEANLNPGIKLMKLNAHINDPIFAASAVAVLH
QLMEVSRPVDGTFPREAVEKGTLPPKNPKWRRSLTPESAIVKQAPR
>Moth_1797 Protein of unknown function UPF0150
MDKFLVVIEKADGNYSAYSPDLPGCVATGKTPQETRENMAEAIKMHLQGL
KEDGRPITVPTAKADYIGVNLQAL
>Moth_1977 conserved hypothetical protein
MVLRGLFAIIEAYLIYWPLVYFIARKYFKFTPEWAVPLASGISICGVSAA
IATGGAIKARPMIPVIVSSLVVIFAVVELIILPFAAQAWLYREPMVAGAW
MGLAVKTDGAAAASGAMVDALIRSKALSALGVKWQEGWMLMATTTVKVFI
DIFIAIWAFILAIIWSWKIDRREGEQVNAGEIWLRLPKFVLGYAGLFVLV
ILLGVALKGTPGIKLLNAGIGQAGIKPIYLIQIGGLRMA
>Moth_0891 Protein of unknown function DUF370
MDIKLINIGFGNIVSARRIVAIVSPESAPIKRIIQEARDRGMLIDATYGR
RTRAVIITDSDHIILSAVQPETVAHRLSSREPLASAEEPVD
>Moth_0206 conserved hypothetical protein
MLELNRPAEVVAMPVYEYRCAKCGVFEKEQRITAPPLTECPTCGGPVHRI
ISRNIGVIYKAGGFYTTENRSQEYKNKAKEESKTSEVSKAS
>Moth_0179 Protein of unknown function DUF86
MEKLSECLTKLEPLKTKSFDDFEQDPYLRDIVERNLEVAAQCCIDIANRV
ISIEDLEKPEDYYSAFITLGQAGILPLKFARSFAGIAGFRNILVHEYIQI
DWHEVYRNLHRLDDFYHFADCIKAWMKK
>Moth_0210 Protein of unknown function DUF917
MGSRIILDNEVVEAAVLGGAVLGGGGGGSMEMGRQAARLAVELGSPELIT
LDSLPEDAVLLTVSAVGAPAARTVYVKPVHYIRTVELFQKYTGQEIRGLI
TNECGGLAAVNGWLQAAALGIPVVDAPCNGRAHPTGVMGSMGLHRLSGYV
SRQVAVGGNPQTNSYVEVFASGSLETAAALVRQASVQAGGMVAVARNPVT
AGYARENAAPGAIGRCIAVGRTIIENRSRGPLPVIEGVAGVLQGEIAFTG
RVAAVDLETTGGFDVGRVVVRDGDRLAELTFWNEYMTLEIGSVRKGTFPD
LLATMDLTTGLPLSSAEIKAGQEIAILHVHRDRLILGRGMKAPELFQVVE
KATGKEVIKYIFS
>Moth_0456 DedA
MTALHAYLALFGLLAIEGTGLPGVPFEPAFLAAGYLIERGEMSFWGAVLV
GTAGNLLGNLIGYWLGARPGRGLIERLLHQGWGEGGMTTARYWLARYGAA
VIILARWFGPIRTPTILAAGVVGMETGIYALYSTLGAFTWTLAWQYASWK
GTHYLLGWWQVYRRYATWWMDALLILAGLAITTISVYYCWRRWRRDKEPA
>Moth_1572 hypothetical protein
MWWPEVGEKNPFVPVHIQVSELDILAKLGHEIVYLTSRPVLAIDLTRAWL
AAHGFPRGSIVFLPRGHKKFFALYYSIDLVFEDDPAEVLQLQKAVGRVLV
PAWPYNLNTKGRGIKAFTSWREIVGYIDCFRQLRMVL
>Moth_1298 Protein of unknown function DUF1614
MLWPLLFLFLFIPMLMASLFLNLAIFSFARLGLSPGGAMLLLSASVIGGL
INIPVSRRRLYIEEPRFGSFPFFFYYPPQVSYQLLCINVGGAVIPVLFSL
HLLATRAPLLPALTATLIVTVVAKLLARIVPGVGISIPTFIPPVVAALAA
IIVSPHNAAPVAYIAGAIGTLLGADILNLGAIRRLQSQVVSIGGAGVFDG
IFLVALAAALLS
>Moth_2249 Protein of unknown function DUF881
MRRIYLSLLIISIFSGLLVAWQWRSHLATAAQTNQDPGLIDIIHALEKED
ASLENSIADLRQQIDALQKKHSQGAGRLTETQKEIDSLRLTAGLVAVTGP
GITVTLDDNSAGAEAAQKSSPATYKPDDYIIHDKNVLYMVNELKAAGAEA
IAVNGQRIVANSDIRCVGTVIMVNSTRLAPPYIIQAIGNPDKLEAAALRS
EEFVFLKSRDFPVKVGKNDSLTLPAYNGGFPLDHVRPLPTGGQQ
>Moth_1355 Nucleoside recognition
MTEAVLAVSRWAIPLVIFLIPAYGYLRGVAVYEAFVAGAEDGFKVAIKII
PFLVGMLVAISIFRASGAMDLFARALNPVLHLVGIPGEVLPLAVMRPLSG
GGALGVAAELIGNYGPDSFIGRLASVMQGTTDTTFYVLTVYFGSVGVRRY
RYALALGLIADISSLIAAVFICHLMFG
>Moth_2386 conserved hypothetical protein
MAAKKRNYWDYARYTNMAFSFGITLTAGVLLGFYGGSWLDRRLGTSPWLM
LAGVLLGIGTGFHSIFSELRALEKDLKNRETDAQDKGKPH
>Moth_1009 Protein of unknown function DUF441
MESTLIILAVLVVAVLGRANTVALAASLLLVLKLLQVDQYIFPFIEKGGT
FWGLVLLIAAILVPLARGTVTLRDLGHVFLSWVGLSAFILSLITTYMSGQ
GLQYLTVQGHSEVMPALILGAVIAAAFLGGVPVGPFITSGVLALLVKLIA
KL
>Moth_0414 sulfonate/nitrate transport system ATP-binding protein
MVVGLLEMLEDARGREDIFKLAGSLSMELDDIGPVIEAARVLGFIETTNG
DITLTRLGSKLLNADINERKDIIAARLQELPAFKEVLQLIKSGRGRQVRR
EQVVRRFARRMSDEDAEVLFKTVVDWGRFAEIIGYDTKGEVLYLDEGA
>Moth_0587 conserved hypothetical protein
MAHHFFLPIVVAPGETVLLEGENAHHAIRVLRLRQGESITLADSNGQGYR
AEIVAITEGRVAAAIKDPLDSPEPRVRVTLYQGWPKGDKMDLIIEKCTEL
GVDRIVVLATERSIPRPDQQTCARRRERWQQKAHAAARQSRRHRIPVVDG
PLGLAEALVKLRPDTLLLVPWEEERTRDLKSILATVPADRELALLIGPEG
GLSRAEVDLACRFGGLPLTLGPRILRTETAGLACLAAIMYAMGELG
>Moth_1915 Glycoside hydrolase, family 57
MPRGYVALVLHAHLPYVRDTEDDFSLAEKWYHEAVTETYIPLINICQRLN
RDRVPYRITISLSPPLVTMMADPLVQEHYRRYLERLRELAAREVWRTRND
PRFHLVARMYQDLFENTARTYQTYGGNLINAFRELQDSGKVELITCAATH
GYLPLIGLQREVVRAQVEVAVNNHRRLFGRPPAGLWLPECAYNPGDDAIL
RDYGLKYFFVDAHGLLYATPRPRYSIFAPVYTPAGVAAFGRDLESSEQVW
SAQEGYPGDFDYREFYRDIGYDLDFEYIKPYIHPSGLRLDTGLKYYRITG
KSGYKEPYVPEWASFKAHTHAGNFLFNREQQINYLATYMDRPPLIICPYD
AELFGHWWFEGPQWLESLFRQVAGLAPQPFSFITPSEYLERFPVNQPATP
CMSSWGNNGYNEVWLEDSNHWIYRHLHHAAAEMIRLANQHPTAGGILLRA
LNQAARELLVAQSSDWAFIMKTGTMVEYAVSRTKKHLLNFWELTRGINKN
DLDPAKVQALEEANNIFPDINYRIFASR
>Moth_0051 conserved hypothetical protein
MSYTWKKVTLEVDGQRIATRTFAPTVGEFLSQQHITLGAEDAVTPALDAP
VTRDIVITIKRAVPVKINADGREKEILTPPDTVANVLNKAGVTLNPADRV
IPDLNATIAAGDTIKVIRVTVKTETVSKEINYRVERRPEPQLEKGITRLL
QEGVKGLQEETYRVILEDGQEVKRELVSTKTLKEPVPEIVAVGAMDTASR
GGQSFRFERVFWATATAYTHSGAPTATGAYPRVGTIAVDPAVVPLGTRLY
VEGYGYGIAQDIGSAIKGDRIDVFLDTEADTRRWGVRRVKVYVLR
>Moth_0446 Radical SAM
MPVELRRQLAEVYHLPLKELLPAAWRLRLKNFPPVLGLATPGSKHYDSGD
HRNHRRLFVTISITGQGCRLQCEHCRGELLKSMYPATTPAALLELGRQLK
DGGCRGVLVSGGADLGGRVPLLPYLEALAGLKGLGLKVIVHTGLADPATA
RGLKNAGVDQVLLDIIGDRETARRVYHLEMDPADYGAALENLLSAGLKVV
PHIVAGLNFGRLAGELEALYQVLSRDCRDLVLVVLTPLPGTPMAGITPPP
PAAVGRLLATARLAGPRLNILLGCARPAGRHRLWTELYALRAGVNGMAYP
HEATVARARRLGLKPFFSDLCCSLL
>Moth_0453 LmbE-like protein
MGVSAVIPAYNEETTVGRIIDTLKQVAAVTEIIVVSDGSEDDTAAVARHH
GARVLELAVNSGKGAAMTAGAREAREDILLFLDADLEGLLPDHVQALIEP
LLAGRAEMSVGIFSRGRSMTDLAQVVAPHLSGQRAIRKDLFLAIGADRSR
FEVEVQLTSEARARNWRVEKVPLVNMTHIMKEEKRGLYRGVVARMGMYKD
IAGFFWRLTRKKLKARPVAVLLLLLSLGVTFNYDTQRVASAEAGRMPDLN
LPAAGQRLLVVSPHPDDETLGAGGLIAKARARGDTVKVVFMTNGDGFRRG
VETTRGILPTSAGDFLTYGERRQQEAITALGNLGVGPADIIFMGYPDGGL
AAIWSNYWQEDKPYRSACTRKEAVPYRLAFKPGEPYAAPALLADLEEILR
EYRPTDIYVTDTNDSHPDHWATGAFTLAAVGELKGEDPTFNPRIYTFVIH
TGMWQMLPVFDRDHKPLLPPGYFLARGTPWYKLPLAPAILELKKQAIAAY
RTQEMVMPTFLANFERPNEVFSRLPDQEVITTATGMSVDGWVKEWPRDAV
IALDPAGDLVTKKVERGGDLKAAYLLQSGRTTYLRLDTWGRVGFPVNYTL
SIYLLPASPGAGSQRFTWSWAPGEKQVRWLTRPAGYDPNAIRVASGGDSL
EMALPDLIPPGEHYLMFTAVTSIGRLPLDRIPWRLVKIKGSDL
>Moth_2248 Protein of unknown function DUF147
MTGLAALWRYLLSLNFSDLLLVALDISVVAFVIYKFMMLIKGTRAVQLIK
GLVVLVVASVIAERLHLTTINWLLSQLRLVIVVALPVVFQPELRRALEQL
GRGKFFARPLTTLGAEDMEKLINELVRAMQVLAKNRTGALVVVERETGLN
DYIETGIRVDGVVSAELLINIFVPLTPFHDGAAIIRGDRVVAAGCFLPLS
ESPYLSKQLGTRHRAALGISEISDAVVLIVSEETGVISVAEGGKLTRFLD
EKNLRELLQNLMLPQDNHTTFLWPWRS
>Moth_2507 transporter
MALLMELAVAGALMGTMIGAGFASGQELVQFFLTLGTGAPAAVVLMTGLL
MTSSFLVRHLALKWRTASYRDLMIMLLGNWYRPADVAITAFLFGGLAIML
AGAGAVARQYFGWPPLAGILACSGLALLGSLGRGRGVLILNSLLVPVMLG
IIVLIVGLNWAPTVGPLTTVPAGPLVGSNWVLNACLYVTYNMVGLMVLLA
SLPNSRRGTAGAALGGLLLGILVFVLVQALGRLPKEILVTELPLLSLVKS
RHPELQGAYALSLWLAMVTTAASNLYGLAERLGSSSRLPLPAAVPVLVLA
VPMASFGFANLVGLIYPFFGYLGLLLLILVMGRRLLLFLRF
>Moth_2060 conserved hypothetical protein
MKIRRSRKIALVSHCILNTNAKIEGTASYPAALQQVVGLLLAHHYGIIQL
PCPELRAMGLRRWGQVVEQYANPFFTEAVRSMLVPTLREIQEYVRNGYRV
DLLIGIDKSPSCGVNLTCSGDWGGEIGIKKLTESMVVKGEGVMIRVLRQE
MEKLGIPLTMVGVDEDNLDLSIEAIKQAMLRAGS
>Moth_0567 Iojap-related protein
MDAGRVAQLAAQAVLEKKGIDPVILDLQGITLIADYFVIASGTSTVQVQA
IAGRVEEILDAAGVELLHREGLEAARWVLLDYGAVVVHIFLEEERRFYDL
ERLWGDARRVAIESP
>Moth_1564 Protein of unknown function DUF103
MSIDLCRWRLEKAERTFKEGEQLLDVGFYNGAINRFYYAAFHAVRALLAL
KKLDSAKHSGVISLFNREYVKTGVISKEASKTLSTIFAMRSEADYDDFKS
FSLQEAADARKAVRSLIDEVSAYLAGIS
>Moth_0131 membrane protein-like
MAEGRVTALAEGALMAALTVVLVLTGYFIPPLQLLTNIIWTVPIVVLIVR
QNLRLGVMATFIAGVVIALFTGPLNATLLFVQFAALGLVYGYLFKIKAGA
GRTIVIGALVALLSLLLTLALTFKLTGLPVGGLIQEFEGTVNYAMEFYRR
AGILDHLAGQGLTPDQIQASLQGMINLLKLLLPGILMTASLLAAFANYLV
AEKVLQRLGLKAAGLPPFRYWQLPWYAVWGVIAGLALWQLGDYYHLALAS
RVGVNILYVYLPLLAGNGLAAVTFIFYHLRLAPFFKAVLVLVALMYFPVA
LVSLVTLGLFEPFFGFRRHLRPPADKGE
>Moth_0395 Protein of unknown function UPF0016
MKAFLLSLGLIFIAELGDKTQLVALTLATRFNARVVLAGIFTATLLVHVI
SVALGEFVGVLIPTAWTHFLAGLAFIGFGLWTLRGDSLDDERDNAHRIAS
PFLLVVVTFFLAEFGDKTMLSTVTLATTYSIIPVWLGSTLGMVLSDGLAI
WIGQAMGSRLPERVIRLGAAFIFFVFGLFSTFQGGLNLSPPVWGLGVAIL
AILVFIFFRKPVQKGN
>Moth_0045 hypothetical protein
MSTNLPEDTGGMLTVALAGLEGQLNEVLKETRRLRARVAALEEENQRLRA
MILAAGEGESGRQQLVRLYNEGYHVCPPHFARVRGNEGCLFCQSFLEKKG
LLPDG
>Moth_0859 Protein of unknown function DUF552
MAFWQGLINWLGYGHEGEEPVMEKPVLEAETTPVTPAKGKLVGLPTARNA
MRLVIARPQSFEQAAGLAENLKNYRPLIVNVEGIPVEEARRIIDFLSGAA
YALGGRVRKVTSGIFLFTTSNVDLSGDLEDQIPGGLNWLEAAGRGR
>Moth_0050 conserved hypothetical protein
MFLLRQPSAGRGRKLVLLACLVAGLQSLFLVGWTSKKVDIYADGQARQVT
VNQWLVGDFQSRSQENLRIGDWILPLPGGWFWPGLKLFMARGTPVAAEVA
GQNIWPREPASVASELLNREGITLGPGDRVETNLGSEDPHQYIRVIRVED
SIEVQQQPVDPPVVRRPDRYLPPGQEKVVQEGQPGVRYYKYQVRKENGVE
VERRLVDTWVEIQPSPKIVAYSSRAYPEVTARAGDTLMVIATAYTHTGNR
TATGIWPYRGVVAVDPRVIPLGTRLYVEGYGYAVAQDTGGLIKGKRIDLF
MDSAGEAMRWGRRQVTVRILGD
>Moth_1242 Branched-chain amino acid transport
MDRQILILILGTALVTYLPRMLPLVVLSRARLPEVFLRWLGFIPAAVLAA
LLAPGLLLPQGKLALTGNPYILAAIPAGLVAVKTRSMALTIVSGMAAMVL
FQYWL
>Moth_1263 Protein of unknown function DUF302
MEQPDFSYTVTTARDFEAAVKAVEEATAAQGMKVQHVHDVQATLRSKGYD
SDPLKIIEICNARYAHEVLAKDILISLMMPCKINVYVRDGKTYISALRPT
MLAQFFPHARLEEVAREVDTKIRTIVEAAR
>Moth_1083 conserved hypothetical protein
MRILMIGDVVGRPGRKAVREVLPALLQEHRPDLVIANGENAAGGNGITPD
TAGELFASGIDILTMGNHVWDKREALTLLEEEERIIRPANYPPGTPGRGY
NLFEVKEGLKVGVINLSGRVFLPPLECPFRLGKQLAEELRAETRVIIVDF
HAEATSEKVALGWHLDGLVSAVVGTHTHIQTADARVLPGGTAYITDVGMT
GPRDSVLGVKTEIIVHKFLTQLPARFEIAGGVIQLEGVILDIDPSTGRAA
GIQRVQHYCNP
>Moth_1060 hypothetical protein
MRAGLEQGLITAIKGDREEKPVDRLEHIFELQERFDRDLARRRQLPDYSP
AEWIQKEVLAIVAELGELLDEVNFKWWKNPHPLDHEAIKGELVDILHFLV
SMCLKAGITAEDFYQAYLAKNQVNFRRQQGLTDQKEYAAGWADKEE
>Moth_2394 Protein of unknown function DUF204
MEPLGLILVAVALGTDAFSLATGLALGGFRGRQAWLFAGTVGLFHIFMPL
AGLYLGLLLGRLLGKVAAIIGALVLATMGTLMLWEAYNNRRQGGSMVGQV
LRVIPGRGGVLGGVMAILFMAGSVSLDALSVGFGLGAISVNVPLTVLTMG
FIAATMTALGLLAGRRLGSFFGNRAELAGGLILVAIGLKMLVGV
>Moth_2326 hypothetical protein
MKFRYDPDADALYIRFNESSICETEEISQGVLLDIDEEGKLVGLEILNAS
KKLGKPPLTVEVELSKAATI
>Moth_0259 conserved hypothetical protein
MDGLKWLYPGLKIKRWLLLAVLGLLLLVSGLTVILGITLLASAEKGVTWF
ILHTLGGLGSPLLAGLLAMALGAVFIGVAVRNLARSVIQVLLPGHTANPW
QVFYRRQYLARGPHLVAIGGGTGLAVLLRGLKNYTRNLTAIVTVADDGGS
SGRLRQELSIPPPGDIRNCLVALADTESLMEDLFSYRFRQGEGLAGHSLG
NLLLAAMTDMAGDFDRAIQELARVLAVGGRVIPSTTTHVVMGAELADGST
VLGESNIPLAGKPIKRVFLKPADCRPPAAALEAIARADAVIIGPGSLYTS
VLPNLLVPGIVEALRDTPAPVFYVCNIMTQPGETDGYTVADHLRALIDHC
GQGIIDTVIAHSGPISRAARRRYGEKGARPVLINSPAIARMGVELRRGWL
VDETHVVRHHPERLASLVMEEVYRHQARGRRRFFYLVRERFRTLAR
>Moth_0160 UvrB/UvrC protein
MLCERCQERPASVHVTRIINGEKTELYLCQECARELQPQLNFSIPQFLAG
LLDYDPELEVKAPPAVERCPECGLTYEQFHETGRLGCPECYHHLAPRLDP
LIRRIQGSSQHRGKVPRRAGGNLRLRREIENLRARLQQLVQQEEFEKAAQ
VRDRIRDLEGRLEKGESSQ
>Moth_0855 PRC-barrel
MIRVAELRQREVINVIDGRRLGTIKDIDLDLEEGRVRALIVPGQGSKFLF
FFGREEDLVIPWENVVKIGVDVILVESYSSTAPVHREKA
>Moth_2213 Putative Fe-S cluster
MFLEMIDVTKVETCLADAEKIRLQAVLSQDIEELLPYLNTVIKNAVYNHY
TKNLTFLKEFRLITLYPRKLTMAKAVNMTDALQVLDWLKDLINDTHRRQK
EIQPSFAKKDRPTALKIYKWLPGTNCRRCGQLTCLAFAARLLSGENTLAD
CPPLAEEENSERFNALQGMLG
>Moth_0249 Protein of unknown function UPF0150
MEIKRLTAAITQEGKWYVARCLEVEVTSQGHSIEEAISNLKEALELYFED
EEHPRINASPIIAPVEVNMAI
>Moth_1046 Protein of unknown function DUF150
MSKLTALIQELVEPLLTPMGYELVDLQYGREGGRYILRLFIDRPEGIGLD
DCEQVSRVVSALLDEKDPIPHSYYLEVSSPGLERPLKKEADFNRFAGRKV
KLRTFAPINGQRHFQGRLLGYQDGQVRMHLEEGWDLAIPLEQVATARLVF
DLADDEED
>Moth_2327 hypothetical protein
MKLRFTAHAEKQLIERKIVKKLVNETVCNPEQVIPQGQDVLIYQKIYKEV
GKEYLLRVAIKLSGDTYVVLTAYKTSKIKKYGGDK
>Moth_0938 Nucleoside recognition
MRQPVFIISRGITPFLTAVAVVILALAIVLFPQPVFQAALRGLRAWWEIV
VPALLPFFIISQLFMGLGIVHFLGVLLEPVMRPLFNVPGSGAFVMAMGYT
SGAPISAILTSQLRQQQLVTRVEGERLICFTNNASPLFMLGAVAVGMLHN
PALGPALAGAHYGANLFLGVLFRFYGRRAPASPPGNHPLLSLPRRAWRAM
IQAQQRDGRSLGQLLGDAVSHSFQTLITIGGFITLFSVIIQVAGMLGILD
LLARLLLYAGHPLGLTPATAGALASGIFEMTMGTKFASEAPVPLGEQLTA
ISIIMGWAGLSVLGQVAAMTSKTDLRLGPFILARLLHGFLAAFMVQLFRG
PARPVLGWLTGSHFLSPPVSWLSLGVHYTGFTLTLAALLLFLTVLGLFAR
LTLYRRF
>Moth_1111 Protein of unknown function DUF151
MLIPVKVKQIVLDQTLNPVVLLGEPEGNQVLPIWVGPFEAQAIALAMQGI
LTPRPLTHDLLRSLCENLGVEVNKVLVQDIRDGTYYAELYLRQGDREVVV
DARPSDAIALALRTNAPLYITEKVAAYTLNVEDLVSEDQAEELQQMLTDI
KPEVDKKHLH
>Moth_1745 Pseudouridylate synthase
MKLKVIPEDFVVRELARLPIREKGPYRLYLFEKKGWNTIDLLIRLAKAHR
LPYRLFAYGGLKDRHAHTFQYVTVKHPADLTTEAENFSLQSIGYMDRPMG
PDLLEGNEFAITIRALGAAEVCRISRRVDEVRGFGYPNYYDNQRFGSMDR
QMGLMAERLLKKHYNGSLQIYLTGIYPEEKKEARERKLFFREHWGDWSTC
LARAKTTMESRIFSLLVEKPKAYIEALQMIPREELSLLFSAYQSFLFNEL
LRRILQEFGLDLTAVPGTAGPYLFYRRLERKELGYLRALSLPLAASRMEF
PDAMSERLFAAILEERGIKRSSFNLRKVRQAFFKSTPREAIVFPGNFRIQ
PAEPDDLYPGRQKIRLFFKLPRGSYGTMLIKRLTMP
>Moth_0102 Protein of unknown function DUF606
MWLALLIALVSGIAMAFQGSLNSALAKITGLLQATLVVHLTATLAVGVLL
FFPLSDGHLGRIWQCPWYLWLGGLIGVVITYGVVASIPRVGVALATTAII
VGQVTTALIIDHLGLFGLDKIPFTWWKAAGLILLATGARLMLN
>Moth_0347 hypothetical protein
MRIGVDLCNTIANINAMLVMKFTRLSLTRYPDPEIPEGFFHTTEGLELLS
KAQPFPCAAGTLRFLASAGHEIIYLTSRPVLAAGLTREWLAVNSFPRGTL
MFLPRGFKALFARYYGIEWFFEDDPLEALRLNDVVSRVFVKIWPYNLGVQ
GPGIVRFVNWREVLFLVSGRKRAGAGVVHERHDYGARIGARG
>Moth_1062 conserved hypothetical protein
MVHMRFTELMGKEIINLYDGSRLGNFADADLVLDADEGRVAAIILPPRGG
WRSLFGSRQELLIPWEAVRKVGNELVVVDMDPTYSRRQKD
>Moth_1151 conserved hypothetical protein
MSLHIGFPTALIYYSHFPFWQAYFNRLGVEVVTSPTTTKAILDDGAREAV
ADACIPIKLYHGHVLALKDKVDALFIPRMVRLNRRTTFCPKFLGLPDMVR
ATLDKLPPIIDLQVDAGRNLWGLWPTCRGVAELLGFSRRLAWTAYLEGSH
HQQAFEGLLLKGYLPLEAMAKLRGEPIEPAPLRPGSLNLAVLGYPYQVYD
GYISLNIIAKLRKMGVNILTLEMVPQARLYRLSRRLPRRLFWHFSSLVVG
ATYNYLQQGDLDGIIHVTAFGCGPDAMVDKIMELDIRNYSQGKMPFMSIC
IDEQTGDAGVSTRLEAFVDMLLQRRAAR
>Moth_0120 Protein of unknown function DUF441
MSAATVILILLMLLGILGRSNVIAAAAAFLLLLQFTSLQRFYPILERRAL
EAGLIFLVVSVLVPFASGRVAPRDMLQSFVSLPGLIAIASGIIATHMNCQ
GLELLQRFPQMMIGMVIGSIIGVAFFGGIPVGPLMAGGIAALLVHLMAWL
R
>Moth_0952 Stage V sporulation protein S
MEVLKVSAQSNPKAVAGALTAVFRQHGKAEVQAVGAGAVNQAVKAIAIAR
GFIAPNGIDLVMIPAFAEINIDGEERTAIKFIVEPR
>Moth_1321 Protein of unknown function DUF205
MWLLALVVAYLIGSIPTAYVVGRYLYGFDIRRRGSGNVGATNTLRTMGTI
PGLVVLGVDALKGVLAVLLGQALGGPVLVILAALMAIVGHNWSIFLEFQG
GRGVATTAGALLAMAPLALFWAFLIWLAVVIFSRYISLGSIVAAAVAPFL
VIYFHRPWPYVLFTFVAAALVIYRHRPNIKRLLAGTEHKLGERS
>Moth_0821 Protein of unknown function DUF964
MQVYDRAHELARELSRSSEYNDFRLAKAKLESNATNVDMLRDFRRRQLAL
EMAVLSGKEPDPADKKALEESYRIISLNPTITAYLEAEQRLARLLADIQK
ILIDALPEWGKDIIDEVDKK
>Moth_0965 Putative helix-turn-helix protein, YlxM/p13-like
MLDDLARVARLYDFYGPLLTPKQRHWLELHYHHDLSLGEIAGEEGISRQA
VYDGLQRAVKALEEYESRLGYLRRDMALREQLAAAIRHLENYRRGGGEGE
LVETSRILQRLLELPEGSMGKK
>Moth_2038 Trp repressor
MVSDKLRDPQVDDLFRAILALEDIDECYRFFEDLCTTAEIKAMAQRLAVA
RLLRRGVTYTAIAEATGASTATISRVKRFLNYGADGYKLILERLEGNGK
>Moth_0443 Biotin/lipoate A/B protein ligase
MRQWRLLDTGSRTAAENMALDEVLLTARSQGQAPDTLRFLQFNPPCVLVG
FHQVVEQEVRLEYCRREGIEINRRITGGGALFWDTNQLGWEVITTLDYPG
VARRLEGLYAQLCGAVVRALKRLGVPAAYRPRNDIEVGGRKISGTGGTEL
GGAFLYQGTLLIDFDVETMLRALRIPTEKLKAKEIASLKERVTCLKWELG
RVPLLETIKQVIAEEFCREFAMELIPAGLTPAEEALLAHQLPYFQSEEWI
NAVQGPEGRTELRSSRRTRGGFLRSSLVLGPGNSRIESLYLTGDFFAHPR
RSIYDLEARLKGLPADPVLISRQVEEFFRESGARLPGIKAAEVAAAINDA
LVKKDYPRQGIPAAAVNDVFTVVKPLEEITAAPVVLLPYCAKLPTCRFRG
RQGCSECGRCDIGTAYALARQYGLEPLTIQNYEMLARVLRRLQREGAPGF
LGSCCEAFLAKHRRDLERIGLPGILLDIDSSTCYELGQERAAHAGRFENQ
TTLKLDLLELLMARVAPGKARRQVAVAAHA
>Moth_0690 conserved hypothetical protein
MRVGVGFSSANDPSAAGQVASEQAVRQSGSPVITLVLTTDNYDQERVLSA
VKRVIGNSRLVGACVPGVIVNARLYKRGVGICTVSGEGVEAVTHLQRNIS
QHSYRKGEKAGEALLEKGGETPGTVLLFPDGFAANISGLLRGLYNVMGPA
FEYIGGGSGDNLRFYRTYQFTEEGISSDAVAAAVIRGINFQMCLSHGWRP
VGEPLMVTKAKGRKVYEIDGLPALERYSALVGAYDKNDFSCYSMKYPLGL
PCAGGEFIIRDPLKAEEDGGILFVTEIPENTIATLMEGDTASLLAAAEEV
SKKALNTPAAPKTFMVFDCVSRYLLMGEDFSREMEAIAKNIKAEIPVIGM
LSFGEISSISGTPLFYNKTIVAAAGW
>Moth_1327 Protein of unknown function DUF1614
MTGYTIGVLLLVLVFGLVYFGLAHRVLDRLYLNDRTALALIVAMIVGSFI
NIPLTSGRVVTSINVGGALIPLGLAIYILYRAGSVREVGRALLGAVITAV
VLFGITYITRGREAWNVYALNLLDPLYYYPLVAGFVAYLVGRSRRAAFVG
AILGVLFLDIIDLIFYLRLGLRGTVAFGGAGIIDATFMAAVVAVLLAELI
GEVRERMQGGPELAGHSRSLLAGLKGPSLKRTPADIDINRDGLRNEGGEG
NG
>Moth_2072 conserved hypothetical protein
MQPKSGNRYDITIKDLFADETQELINYFGHLEARVTGDLKIEFPQVETRV
SDLVMKAESQQGPLAIHLEFQSRNDDEMPYRMLRYALEIHKTYHLPVYQI
VIYFGQWQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRL
LSLLPVVDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGL
VFDKKAIDLVFREVEQMLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGQ
QESLLNVTIRLLSKKFRKIPREYMARIKKQDVYVLQQIIDNIFDINDLKE
LEDYLQ
>Moth_2360 GtrA-like protein
MFNNKPGAAQFLRFCAVGVGNTVVDLTTFFGLTLVGMPYLLAQVLSYSAG
VANSFFFNRKWTFRVTHKASALEVIKFITVNGLSLLLSSGLLFILHDVTH
LQLWLSKFSATGGGIVVNFLGSRLWVFTES
>Moth_1914 conserved hypothetical protein
MDILSDEFKQSFSIIPPESGVYGMVTVVSVLILALLAGVFLWYWRRKGNS
PVAKPLPAEAPPARDLPAKYGKDEIVLMVKDPYWLYAYWELSDAKKEELR
RQFGPAAWENSRPLLRVYDLTDHPYDFLRAPFFEIAITDLADNWYIHTGK
PCSTFCVDLGRLIPNYGFITLVRSNIVTTPADKPSSVIDPLWPPIEACWT
AVERYEKGTGPSSLQLVKTQK
>Moth_1023 conserved hypothetical protein
MPEFKAMELVAELMAIAARTAPKAGGKDFIELKILQGDSLEQLAIAMTRY
GQEKGKKNFDRDGENVRRSDAVLLVGLKKAAKAGLDCGACGAARCADLEG
PHEGPEFAGPICAWRLIDLGIALGSAAKTAGILNVDNRVMYRIGVVARKT
GLMDAEVIAGIPISATGKNIYFDR
>Moth_0762 Protein of unknown function DUF115
MSPSTNLVYRKNARVLQRYSPELFRDLEATALPLDRQLAPAQNGEPTLIA
ITGGKEIALHSRYDPRREAVTWARGVDENADMVVVLGMGLGYHLEALKDL
YPHKAVLVLEPELAAVKLAFAARDMTHLLKSGQFYLLAVADPEDAAAQLS
NILAENAGKRIALHTLPAYEQLYAGYWQRVCQGVTDRLRQRRVNWATTEK
FMMQWLCNFRDNFLPYIKAPGVIHLFDAFSGKPALIVAAGPSLEKNIHLL
PSLKGRVLIMAAGSAIRILEKNGIKPDLLVSFDPGDANYQHFAGFDGRGV
PLVYAPVIFPRIVQEYQGPTFSCELNVSPFIEWFDEKLGEKKGVLISGPS
VANVCLDLAVKMGANPIILIGQDLAFTNNKTHADGARHQQRIDPSQGNYI
WVEDIYGDRVPTTTAFYSMLVWYEQYLGNLKGKRLVIDATEGGARIRSTE
IMSLQEVRDKYLRETFSPGEIIAAKHDVYAVPDGEQLRRLEEAFSELSSR
REDLRACFEEGIEVARQLLEKCHKKTVKLTNYERARRKFMGLDRRITGNI
LYRLFLEQGLAARIDAINRILGERVNDEQELPARGEKLASLYLSFFTEVE
RYAEFTTEILKEIEEKIRRESASTSCSKA
>Moth_0209 conserved hypothetical protein
MEKHPRAFEPAVLILNIILSVLGSIIGLQILTTLGVTPNTAIIGVLVALA
LSRIPGGWMAKYRSIHRQNLVQSTISGATFGAANSLLLPIGIPYLFGRPD
LVVPMLIGATMGMFIDWAMLYWFFDSRIFPGQAAWPPGVAAAEAIYAGDE
GGKRAWLLVWGTIIGIIGSYFKVSMSAFGVAFIGNVWALTMFGLGLLLRG
YSVKLFGFDIDKLYIPHGMMIGAGLVAGIQILLILLKGRKETTASGDAPA
AANYTRSEKQVAKGLARGFGLYIVAALVLAMLGGLYTSMPAWQLLFWAVF
AAVSCILAEFIVGLSAMHAGWFPAFATALIFLVIGMALGFPAPALALLVG
FVASGGPAFADAGYDFKAGWILRGEGRDRGFELDGRWQQFLAGASGLVVA
WAMVTLTHGIYFRQGLFPPVDKVYAATIKAGVDAAIIKNLVLWAIPGALI
QALGGSEKQLGIMLATGLLILNPLAGYAVLAGILIRTLVLKFKGREAETP
MTILAAGFIAGDALYGFFN
>Moth_1412 Dinitrogenase iron-molybdenum cofactor biosynthesis
MKVAITARGNDPKAEADPRFGRCQYFVIADVEKGTFEAIANANQNAGGGA
GVAAAQALVNAGVEVVLTGNVGPNALRVLQEAGIKVYSTAAPTVQAALQQ
WQAGGVSPLSQATVGSHFGMGRGGNRW
>Moth_0511 Protein of unknown function DUF444
MPVEYNLSREDWSLHRKGYLDQQRHQEKVREAIKKNLPHIIAEESIIMGR
GKKVVRVPIRSLEEYHFRFNYNQGQHAGQGSGGTRKGTVIGREVIEGAGG
GAGAGDEPGMDYYEAEVTLEEVQEMLFRDLELPNLREKKKPVMASPAYEF
RDVRRKGLMGNLDKKRTLLENLKRNAMKGKLAIGGITPEDLRFKTWEEKI
RYETSAVVLAMMDTSGSMGTYEKYIARTFFFWMVRFLRSRYQQVELVFIA
HHTQAREVTEEEFFAKGESGGTRCSSAYRLALEIIDRRYPPADYNIYPFH
FTDGDNLPSDNEACLEAVQELLPRVNLLGYGEIVNPYYRTSTLMNVLKRI
KDDRLVTVAVKDKSEVYQALRQFFAGSKGGEAGGTRI
>Moth_1372 conserved hypothetical protein
MVVKVKQWRLGVPRALFYYYYAPWWEAFLQALGAQVVVSPPTTREIMDLG
ISLAVAEACLPVKVYYGHAAWLAPRVEALFVPRLVSVEQKSFICPKLMGL
PDMLRAAIKDCPPVIDVTVDMSRRPEEGLKAAIRDTARAIGSRGREVYRA
GEIARARYRDWLAGQQEQGREFSPGKKEGITPGGQPPLAVGLVGHNYLLH
DRYLGMDIAGKVTRLGGRVILPENFAPELGEAACRRLPKRLYWTLGRKIM
GAALHLMEQEEVAGLIHLTAFGCGPDSLVGDLAERYAHRHGKPFLLLTLD
EHTGEAGVETRLEAFMDMLARRRPA
>Moth_0890 conserved hypothetical protein
MLNSMTGYGRGEASGAGKTVSVEIRAVNQRFLDVVVRLPRAYGALEEKIR
QELKKSLNRGRVEVVLTIKEDNAEKRPVNVDTGLAMAYYNALKELAQKLS
ISADITAAGLLSLPEVITVAEPEWDEATLWPVVARALAAALAGLLEMRRA
EGQRLQADLEARAAFVRRQVEAIRERAPEVPREYAARLRERVDELTGGMA
LDPGRLEMEVALMAERADITEEVVRLTSHLEQMQAAMAGAEPAGRRLDFI
LQEMWREINTIGSKAGDLTISHLVVAVKGELEKMREQVQNIE
>Moth_0713 conserved hypothetical protein
MMWGYYNWGMGVWMLVWWAILIGIIVLAVYGLVSLFNRRDGQSPMRPDPL
GIIKERYARGEITAEEYHRMRDELKE
>Moth_0186 Protein of unknown function DUF86
MALKFDALKAGKLLAAFRQAQKRLQDLARLPKEEFLTDPDKIGSAKYHFI
VAIEAAIDLSNHIIARNNLRIPEDYADTFRILGEAGIFPPEFVTTLIKMT
RFRNRLVHIYWDVDSDTLYNILVNGLKDLDAYLKAMGKFFTGNKEEPYC
>Moth_1674 Protein of unknown function DUF156
MSRRPGEESSYAPQREDLLLRLRKIEGQVKGIHRMIEEDKYCVDILIQIA
AVRAALKKVGSMIFEAHVRGCVRTAIVNQADKEIISELIDVLNRFIS
>Moth_0044 Signal peptidase II
MIVETARGLELGEVVIAPRQVEETEVVQPLRPVLRPATQADLEQVAVNRQ
KEKDAFAICQQKIQEHGLPMKLIDVEFTFDVSKIIFYFTAEGRVDFRELV
KDLAAIFRTRIELRQIGVRDEAKMLGGLGCCGRPICCATFMGDFDPVSIR
MAKEQNLSLNPTKISGLCGRLMCCLRYENDAYEDARHRLPRCGAMVRTPA
GCGRVTGINILKERVTVEIPEQGSMEFPLEAIEGECHEH
>Moth_1876 conserved hypothetical protein
MAKQYSTWQIAATYIGTVVGAGFASGQEVLQFFGYFGLRGILGLILATAL
FIFFGYTVLRLGFQLKAESHLEVMHRAGGAFIGRAVDAVTTFFLFGALAV
MAAGSGAIFRQEFHLPVLLGSSLLIAITLVTVLAGIEKVIDSISLVAPVL
IASVLGISLATVAKNLPALVANLSWEETYRAAVSSWPLAALLYASYNLVL
SIAVLGPLGALARQERLLPGAFLGGLGLGLGAIAITLALITTAPAVTALE
VPMLYIAGSFSPVLRIFYSAVLLAEIYTTAVSSLYGFAARLAGPGGNNFR
RLAIGASAVALAAGQAGFSRLVATLFPLVGYAGFLLLGGLAYYVLKEILA
LRPAFPGRLVPAPARRPILGAVLERRGKAGEKERP
>Moth_1428 Protein of unknown function DUF364
MSLIEAIIASLSGDEVVKEVRIGPFWTGVWSRYCGLASTTFTHEHENRFP
VGEAGFLTGKSARELCRYATSTSLLEASIGLAAINSLLEVDMEQCQDINA
GELLIERGAGKRVAVVGHFPFVPGLRRVARELWVLERRPQSGDLPADEAA
NVIPGADVVAITGTALINGSMESLLKLCRKDSLVMVLGPTTPLTPLWFDY
GVDLVSGTRVVEPEVVLKFVSEGVVFKQLHGRGARLLTMAKKGLK
>Moth_0004 RNA-binding S4
MDQFLKWNGVAATGGQAKELITSGLVRVNGQVERRRSHELVPGDEVEVKG
ACFKLTTAPAD
>Moth_2262 conserved hypothetical protein
MEGDEVMPTYDFKCNECGHLFSQFAAIKERDKVRCPECGGKVSQRFTGFL
YTRKGKPGAAYSSSSCSGGSCSTCSGCH
>Moth_0956 conserved hypothetical protein
MGKSKLISFILSFFPGLGHFYLGLMNRGIAFMATFFGWMALVILASVITG
FNGFIALLILLPLIWLYSLFDAVQLCGRLQSGETVSDSSPLAELSESLVN
GYKSRLWALLFSIIPGAGHMYLGWQQRGLGFMSTFFLAMFLMEWLRLSLF
FFMLPVIWLYSFFDVMQLVSNDISVPASEGSFSTWLLERQRWVGLVLIGL
GILIIFDRMVVPYLSYELINFIKTGFIAAIFIGGGLRLALGSKIDLPATE
EKTGVSSDNELMEGERCNHRENP
>Moth_0126 Protein of unknown function DUF951
MSFYVLQRGNRRFSEENCSMDFHIGDIVQTKKTHPCGSDRWEILRVGMDF
RLRCLGCGRLILIPRVKAEKSIKKVLPKPS
>Moth_2085 Protein of unknown function DUF1648
MENKYRVSKEMITRDWPALMVLVAMLVAGILVYPHLPDLVPSHWNFRGEV
DNYFNRFWGAFALPLMTGGIYLLLLFVPYLDPKRENYPRFDRAYQVVRLG
MVFFMGGIYATTLVVALGGPANLVGRVVPLAIGLLFILIGNYLPQARLNY
FFGIRTPWTLANEEVWRRTHRFSGYTFILAGFMFIVAAFLPPPANFILGM
AGPAIAVVSTTVYSYLAFRRVSR
>Moth_1001 conserved hypothetical protein
MGNCKLDLIILKEELAVCRLQQDAPVPGWALKGDFVSITRTPDELSLVCP
AAGVPSGVKCEKGWRCLMVEGPLAFSLTGILAALVVPLANQGISIFAIST
FDTDYLLVKEKDLERAIQVLSREGHRIRP
>Moth_0145 conserved hypothetical protein
MRIRAHHLLCALQFRGYGYSEAFVRRVSRVIALWRHRPGLILTITRSHDA
WCRACPNRDTSNCRQAAARDARVLACLGLAPGARLEVAAAQDLVHRKIDS
RAATYICTGCRWLDGGYCRW
>Moth_0053 Protein of unknown function DUF458
MEPVYFTSPTRGRLTEDEMFADMMQFVDANPEAQYKIIIGSDSQARVRTC
FVTAVIVHQVGRGARYYYRKKYQRKITSLRQKIFYETALSLETASFIAQR
LAANGHADLNVEIHLDIGPNGETRDLIREIVGMVVGSGFAAKIKPYSCGA
SKVADKYTKSG
>Moth_2201 hypothetical protein
MIACQLSLYPLGTPAYTPVIKEAMAVLEQCGVEIEVNAMGTIIRGEEEAV
WRAARQLFQVAAGRGEAVLVMTVSNRCGCKVANKKAPARGS
>Moth_1073 transcriptional regulator, XRE family
MGQVGEILRSTRQEKGISLREAEEATKIRLKYLEALENGTYDEIPGRVYA
LGFLRNYARFLGLDPAELTALFKEEYPPKEESYQVEEPPGITTPRLTTRG
WGRWLLILGVILVLWGVNRLYNYYRPSPEQSPAPPPVTEPAPATPAPVTP
VQPASPPSQVQGVEVKIRATGNCWVGAVVDGKADFSGTLKPGDEKVFQGK
DKVSVTLGSAGAVEVTLNGQVQPPLGKAGSVVTFEADKGANQLRIIKKQ
>Moth_1926 conserved hypothetical protein
MRNREVPKYKQLALKIPYGQGYLVGELPEGIKVKEVVPSEIAGVPDPSAD
VRRALEQPIGNHGVEELRGVRQVVIVVSDLTRPVPNDVILPVLLEKLNAI
GIKDEQVTILVGTGLHRPAPPEEFPLIVGPEVAARVKVISHDAYAPGILQ
QIGVSSRGTPIWINRHYLEADGRILIGMIDPHQFVGYTAGSKSLVIGCGG
EATIRANHAHLVEPEATLGRIEGNPAREDIDEIGGLVGINLIINVILNSH
KGIVRTVAGHYLAAHRAGVAVARQVSEVPVPALADVAIVSPGGFPKDINL
YQAQKGLAHGTRLVKEGGVIILCAECREGAGEKGFIDTMQAGNTPEEVIK
VFKQGEFQMGRHKAFLWCRSLVRARVILVSDGVDDNLARIMMVRKAPDLQ
AAINMALKELPDAEVVTVMPKANSTIPVF
>Moth_1586 conserved hypothetical protein
MRICITFNNNITVPAELNDSETARKIQDALPLAGRVNTWGHEIYFSIPVK
AGLEKGATEVMEIGDIAYWPPGHALCLFFGPTPASVDDKPRAASPVNKIG
RFEADSHILKQVPDGARVEIKEA
>Moth_1361 chromosome segregation and condensation protein ScpA
MLGNVRLDVFQGPLDLLLTLIERQEIDVQAIPVAEVTAQYLEYLQEVEEL
DLEWASEFLVLGAELLALKARLLLQRPAAGEQEEEGEDPARALADRLQAY
RCYKEAAQHLAELAATGALIFTRPRDQEAVERALAGINPLAGVTPLDLAR
AMARVLARKEQVQEPEEPRLIPRLAFTLAGQVRHILRSLYRAKTLSLDRL
LSTRPTRMEVALTFLALLELARRGRVALAQEVNFGNIEVRLLPHPGAR
>Moth_2512 Protein of unknown function DUF111
MKIAYFDCFSGISGDMCLGALIACGLSQDELTSGLKGLGLEGWELRVREV
KQHSIAATDVAVQVTGSQPHRHLADILGLINNSSLPAPVKEKSAAVFKNL
ARAEGQVHGIDASQVHFHEVGAVDAIIDIVGSILGLHLLGIEKVISSPLP
AGSGWVDCRHGKLPVPAPATLYLLQGYPVYGTEDKAELVTPTGAALITTL
ADSFGPFPAMNLTRVGFGAGKTELPHPNLLRLALGEINSGQLEGEESSLV
IETTIDDMNPEFFPALLEETMAAGAVDAFFTPVQMKKGRPGILFTALCPE
NKLAAVAAAIFTHSSTLGLRFRRDQRLVCQRRMAEVVTPYGTVPVKLGLY
RDPTGQVITNIAPEYESCRQIAKSAGAPLKEVYAAALAAARALKAF
>Moth_0028 conserved hypothetical protein
MGMGNMNKMMKQMQKMQAQVARLQEELGERTVEASAGGGVVKVTANGRQE
LVNIKIDPAAVDPEDVEMLQDLILAAVNEALHQSQEMVTREMAKITGNIR
LPGF
>Moth_1096 Cobalamin (vitamin B12) biosynthesis CbiX protein
MATGIILLGHGSRIPEANEHLKVLADQVREILGGVRVEPCYMMRTHPNLA
EGIATLVKEGRRKIVVVPMFFSNGLHVQRDIPEQLAAARERYPDVEFIYG
ANLGADRRIAEVIVERIQEVAPGGFSV
>Moth_2250 Protein of unknown function DUF881
MMLKFKNWPLSLAVVFLVLGLLLSLQFRTQRLLASSLEAQKTTDLITMWK
NLSTKRNQLQGEIAQLQQQLFTLETNSSQSSETETSMEKELARLQMNTGL
AAVKGPGITVTITGDAPLLYYDLVDLVNELWASGAEAIAVNDHRISAYTT
ISDQQDGPRNYITIDGQRLLYPIVIKAIGDPQTLDKGLTFTGGLIDNLNN
LYKIYPIIKKEQDLQLPATSLPSWHYAKPAPPSAPAGDQNGK
>Moth_2391 conserved hypothetical protein
MAEWQDVTATVQAAAEELLNVAGLQPGQILVVGCSTSEITGRSIGTASSL
EIGQAVVAGLLAATNRAQVYLAAQCCEHLNRALVIEAGAASLYNLPVVTV
VPAPKAGGSLATAAYASLHRPVVVASLLAQAHAGLDIGSTLIGMHLRPVA
VPVRLAIKTIGAAPVTAARTRPPLIGGQRAVYK
>Moth_2217 hypothetical protein
MPVNSSRKVYVIPCSGIGKMYGLLGREAVLKTVKELRPDKAATMCLALLV
YGDNEARKEIAGARCITVDGCPKLCAAKNVEHAGGRVVEMIRAVDAFRNH
RGVDAGTAAHLTAAGWQIADELAADLAGKVDRWYDAGEEQ
>Moth_0140 NADP oxidoreductase, coenzyme F420-dependent
MMRVGIIGAGAVGTGMGLLLSRRGYTIAGVSSRTMASAERAAARLNCPAF
ADPETVARRSEIVFITTTDRAIGPVATAIAGRGGFCPGQTVIHMSGSLTS
AVLDPARQAGALALALHPLQSCADADMAVANLPGSVFSLEGDREALPLGE
RLVNDLEGEYFIISPEAKPLYHAAACVASNYLVSIVDLSYRLMQAAGMAP
DMVARALAPLIEGTWGNIKEKGVPRALTGPITRGDVATIASHLQAMAARA
PELEEIYRAVGRYTIGVAGRKGSLNARRAALLGQLLANSRGKSPVRVPGR
SPNKKRS
>Moth_1881 Protein of unknown function DUF503
MTIMVIGVGTATLRLAGARSLKDKRRVLKSILARLHNRFNVAAAEVDYQD
SHQKAEVGIACVSTSGSHASQVLAAVMGFLEAEDAIELLAYHTELL
>Moth_0220 Protein of unknown function DUF421
MKFVEVFLQTLLAFFAILIYTRILGKQQIGQLTFFEYINGITFGSIAAVL
ATDTAPNQTWMHFLGLTLFAFFTWLAGYAVLVSRPARKLISGEPTVVVHN
GKILEENMKKMRYNFDELAMQLRQKNVFDIADVEYAIMEPDGDLSVLLKS
QKRPLTPSDLKLSTKYEGVPTELIEDGEILFQNLRQNHLDEKWLIQQLQA
QGIQDISQVDYAVLRSNGTLYVNTKEDDIINPVDITDAPESPVKTEKEEQ
DRP
>Moth_2074 hypothetical protein
MQPKSGNRYDITIKDLFADETQELINYFGHFEARVTGDLKIEFPQVETRV
SDLVMKAESQQGPLAIHLEFQSRNDDEMPYRMLRYALEIHKTYHLPVYQI
VIYFGQWQMNMTSQLEYRLGDQNLLDYRYHLIDVGNITYEELKNSPHQRL
LSLLPVVDREKRQKGGKEFLRRCAEDIINSDLDLETKKTVLLRAEIFAGL
VFDKKAIDLVFREVEQMLSIEESAGYQRIFEKGMEKGIEKGMEKGMEKGI
EKGQQESLLDVTIRLLRKKFRKIPREYLARIKEQDVYVLQQIIDSIFDIN
DLKELEDYLQ
>Moth_1166 Protein of unknown function DUF820
MSLTLAELAAGRQRYTYEDYCRLPEGSPYQLIGGELVMTPSPTPYHQMVS
MKLELQMAGFVLEKGLGIILYAPVDVYLDEEETYQPDIIFIASSRLDIIE
EKRIKGAPDLVVEILSPGTGYYDLRSKYKVYEKSGVREYWIVDPQQKSVQ
VFCLRDGKFVLDQEAEQQGTVRSRVIAGLEVQVESIF
>Moth_1749 Protein of unknown function DUF710
MPQAEGQVAAQDRTTVNINGQDYVVKGEAPEYIQMLAAYVDKKMRQVNQK
FPHYSPVKVAVLAALNIADELYKVQQDYDTLVKLIQEEKQG
>Moth_1855 hypothetical protein
MLDLAHGYHQNALICLPQDLHTLYVYWDFTPARIRILVDFFHHVRPEMEL
TLRLCRQDCPLPEQQLTLESLEPGWRYFSNLDDRAAYHLELGAQSPEGEF
VLFSRTPVFQIQPGRTVEPAGARQLPPGLNPDWTIPPEGQQSGSNFSWS
>Moth_1639 Protein of unknown function DUF1292
MGYDNPILEKRGEVMADQENTIILTDDEGHEHEFIVVDVLNLEDDEYAIL
LPAENADGNDEAVVLKIGLDEDGNEILYEIDDEEEWQRVARAWEDAVAEE
DGEE
>Moth_0829 hypothetical protein
MDNTKIITVKPMGVKVMPTYDFRCQECGERFTVKMSWKDKDKATCPACGS
KKLQQLFTGITILGGNSGGGGCAAPAGSSFS
>Moth_0729 conserved hypothetical protein
MSCRLSLPYGTHKLTFHIPEEKIKAVLSPVAMKEIPSTEVEIQRALENPI
GCQSLGTMVSPTSRVLLLCDDNTRPTPANIIVPAILRELESGGVRKENIK
ILMALGTHRPMTLEELQQKLGAEVLAQVEVINHDFRNPMALHDFGLTANG
TPVKVNRVVLEADVVIGIGSIVPHHIPGYSGGAKIVQPGICGEDTTAATH
LLSVRTRRNMLGIVENKVREEMEAIADRAGVKYIFNTVLDPQGRVVKAFF
GDIRQAFRAGVEISRQVYGIPAPGRTPIVLASSHPCDIEFWQAHKTLYPC
DMLVEEGGTIIIVTPCPEGVAVTHPEMLEFAGQKPEDIDAQIEEGRIKDK
VAGALALAWAKVRQHAEVCLVSDGINAEVAAKLGFKHADTIEEALEMTWA
RLGKEARVIVLTHGADTLPLLPEDNLE
>Moth_0848 Protein of unknown function DUF1290
MWIPLIGLIVGVVAGLLLPVKIPVVYSKYMSVAVLAALDSVFGGLRASME
DNFDNAIFLTGFFSNTLLAAFLAYIGDQLGVELYLAAVLVFGVRLFQNLA
IIRRHLLKR
>Moth_1371 conserved hypothetical protein
MKVTFPHMGHLWLVLKAALTGIGLEVVVPPPCTRRTLELGVRHAPESACL
PLKVNLGNYLEAKELGADTIVMAGGVGPCRIGYYSQVQREILRDLGCAYE
MVVFEPPDVHFNEVWDKIKYLNRRPWQDGVHGVVMAWLKACAVDALEQEV
QRLRPREAQTGLADAVFRRALQELDAAGSRKEVNRVVKEIKGELAALPLK
PDVRVIRVGIVGEIYTVLEPLVNLDIEKRLGALGVEVVRGLFLSRWINDN
LFKGLLPLPGHHPEKTAPPFLNHFVGGHGWESVGDTVTFARRGCDGVIQL
APLTCMPEIVAHSVMPAVQQATGIPVMTIYLDEQTGEAGLQTRLEAFVDM
LRRQKGVRAG
>Moth_0861 Protein of unknown function YGGT
MQTLAVLVRVAFEVLNWLIIARILISWFPHDPNHPIMRFIYEITEPVLAP
FRRIMPRTTMPIDFSPIIAVLVLQLVEHLLINFIMRLG
>Moth_0763 Protein of unknown function DUF820
MAAIEIARRRFTVDEYYQMARAGILGEDDRVELIEGEIIEMVPIGTQHAA
CVRRLLHIFSTKIGDNALVDTQNPLRLGQNSEPQPDLMLLKPRDDYYATF
HPRPEDVLLLVEVADTSLAYDREVKVSLYAKGRVNEVWLVNLQTQQVTAY
RQPSPSGYREVKEYGHGDHISPLAFPGLNIPVQYILPGD
>Moth_2168 conserved hypothetical protein
MYLVTAAEMGQLDRLASSEYMIPSIVLMENAGLRVVESIERHFQGQVANR
RILIFCGKGNNGGDGLVVARHLLNRGAEVKVFLLARPEDIRGDARTNLEI
YQKMGGKLLLLLGESHLQRADIALLYADLVVDAIFGTGFKGAAMGLPAAV
INMINKAHRETVAVDLPSGLEADTGRCFGPCIQATWTVTFALPKLGLVVE
PGASLTGRLEVADIGIPQKLVATQHFNRRLLTAAWCRSQLPRREASGHKG
LYGRVLAVGGSPGLTGAITLAATAALKAGAGLVTAAVPRGVQGILAMKTT
EIMTMSLPETPAGALSRDALDPLLERLAEVDVLAIGPGLSRDPATVDLVK
ELLPRVQVPAVVDADALNALATDTRVLTGDHGPLVLTPHPGEMARLLGTT
AAKIQEDRLEIAAKYAREWQAVLLLKGARTVIAWPDGQVYINPTGNPGMA
TAGSGDVLTGIIAGLAGQGLKPGVAAALGAYLHGAAGDEAARQRGQRAMM
AGDLLDFLPYVLRNLEEEVETIVAAGLGRD
>Moth_0260 Protein of unknown function DUF199
MLMPLSFSLQTKEELARVKARHPCCRQAELVAFLRLGNLDGGQPGEETVL
FTTPYPALARKVYSLAREFLACPVKVRNSRRQGKGRPVFRVVARARLKEI
QDWLAGRAGVPEYPCCQAAYMRGAFLVTGSVNKPSGTHHLELIFPDAAMA
GQMQGLMQQQELEPRLSRRQRGYVLYLKDSEQIIRALSLMGAYSAVLAYE
NVLIFKDMRNRVNRLVNCETANLTKTVETGLRQAENIRYLIATVGWDYLP
PALREIAAVRLQHPEASLKELGEMLHPPVGKSGVNHRLRRLELIARQVRG
QGREGYAPDDDLSRPRA
>Moth_0136 Protein of unknown function DUF763
MRTGTASLPLHGGHCPPWLFERMQRLGPAILEVIVQEYGPQEVLRRLSDP
HWFQAFGCVLGFDWHSSGLTTTLCGALKEGLRGREKDLGLVIAGGKGRTS
RQTPHEIETAVDRLALTSLEPEDLVYASRMAAKVDNTALQDGYQLYHHVF
IFTFDGQWAVVQQGMNETSRLARRYHWLGEGMQDFACEPHAAVCCDARET
ALNMVARESEASRQVVTELVRQQPAKVVAEFSRILEKDLPNLALPWRHDV
PRAGYLNKALLKVYDVQPRDFAGVLGIEGVGPKTIRALAMVAEVAYGAPA
SFRDPVRYSFSHGGKDGHPYPVDRQVYDRTINVLEQALAAAKIGRTDKIQ
ALKRLSRLANGS
>Moth_1523 Protein of unknown function DUF322
MEVGRVDQILEQEAKTDLGTIKIANEVVAIIAGLAATEIEGVAGMSGGIA
GGITELLGRKNLAKGVKVEVGEKEAAVDLYIVVNFGVRIPDVAIKVQENV
KKAIEGMTGLQVVEVNVHVQGVVFPQETRDEETSRVR
>Moth_1393 conserved hypothetical protein
MKEEIKQLILYRMERAKEAIAEAELLFSEGHIRTSVNRLYYACFYAVSAI
LLAKGYSSAKHSGIRSLFHQKIVKAGLVNPSAGTLYNRLFDARQKADYAD
LVKFEADIVAPWFDEVKSLVHQIETLVVKEIRSPG
>Moth_1705 Allergen V5/Tpx-1 related
MKHKWLTAALKILLAAVVAAAPLSAARAATPASNSTRGIYSYNYSWYTAP
YWSWHYRWHVSPNAGSGQVTRQSPTPAPAPAPQPASKPAPAPVTSPGNPA
VPAPQPAPQPAPAGNYQLSAYEQQVVNLVNAERAKVGLKPLAADPQLARV
ARLKAEDMRDKNYFSHESPTYGSFANMLKQFGISYRIAGENIAAGYPTPE
AVVAAWMNSPGHRSNILNANFTAIGVGYASGGSYGHYWVQEFIGQ
>Moth_1414 Protein of unknown function DUF1063
MNLLDQQTLINLLHQEADVAIGCTEPVMVALAAAKTRDMLGTLPRLVDIS
VSSAVWKNARRVGLPGTGEKGLAMAAAMGLLAPVEAGQRLLAALTPVQVE
QAKILVREGVVKVGVVAAKEGLYARAVARSNQHEAIVELNGSHKNFSALW
LDGRMAGGAGENLNLKLEALLAQDYQSLLKQVLSLSPEELYFLYQGAEDI
LTFAREIHQGGRNPLSAMASFFRRTESGGESLEVLIRNLTGIAVAERMAG
ATYPVLTCAGSGNQGILAAVSLLLAGQELRAGPESVTRALAIAHFTNMYL
KAYTGKLSPLCGAVTGGAGVAAAICWLLEGSCQQIINAMQIVLGNLCCVI
CDGAKESCALKISTAAVEAVRAGYMACQGINLEAGTGIVGKKLEDTMELV
RKVYQGGLGEIDYYLGKVDYLLSTN
>Moth_0912 serine/threonine protein kinase
MIGKVLEGRYEIVSELGGGGMARVYRGQDRLLNRNVTIKILREQYASDKE
FLARFQREAQAVASLSHPNVVSIYDVGQEDDLHYLIMEYVEGRSLKDLIS
ERAPLPPLEAIDISLQICDALEHAHENGVIHRDIKPHNILITRNGRVKVT
DFGIAQAVSEVTMSQSGTMIGSVHYLAPEQARGGVIGATADIYSLGIVLY
EMLTGDLPFHGETPVAVALKHLQENPRPVRELNPNVPPALERVVMRTLEK
DPARRYPSAAALRSDLLAVRNALADATFATQVLPAIETPDPPSTLPKPRR
RPRVWAWVLMALLFLGLAAAGLWAGFRYYLAVGETLVPSVVGLPEGQALE
QLAAAGLRGQVIARQYDASVPAGQVMAQDPGPNQRVRRGRVVALTVSQGA
RLVRVPSVIGETERNARLILENANLKVAADTLKVYHPSIPAGSVVDQNPP
ANTQQPEGTEVRLIISKGPEPQFTTAPSVVGLSLAEAQQKLLEAKLKQGT
LTYQRSDNQFPGYIIAQDPREGSNVLQGSAINLVVSQGPGPVQKQVGVTI
DPAPDDKDHEVRIVVTDAKGTNEVLKKKQKMGQQIQATINYFGKGKLQVF
RDGNVIYEQDLQ
>Moth_1643 Protein of unknown function DUF965
MWPPRCVRKPQGGKKMPGDMQETMMFKVEKEEKVRVRDVLTEVQAALTEK
GYDPINQLIGYLLSGDPAYITSHKGARNLIRRVERDEILAELLKSYLA
>Moth_1503 required for dissolution of the septal cell wall; RBL05740
MERALTRHLKDNLGLYLLVGFFFLAGIITGTIAVNFLEPQQVSQLGAYLD
KVLSQFKGEGPGFNQAAYQALLGALRETGLIWFLGLTVIGIPLIIGLIFL
KGLILGFTVGFLVQQKALQGMAFSFLALLPPNIIQIPALFIAAILGISFS
IGLMRGRGQAEAAILPRFLTYSFLMLLVTLVLVGGGLVEGYLSPFFARIV
LAYF
>Moth_0235 Protein of unknown function DUF161
MHRKAWADYLGITAGTLVTALGLVLFLVPNRIAAGGVSGLATVLHYVFGW
PVGLTMLVLNIPLFLAGLKVLGLEFGLKTLYGTIILSVFTDTLALWLHAP
TSNTLLASLYGGLLSGVGMGIVFRSGGSTGGTDLAALLFRHYLHISAGMG
LLMVDALVISLADLVFNVELALYALVALFLTSRAIDAIQEGGGYARAALI
ISDKAEEIARRVLVELDRGVTGLAGRGYYTRQEREVLLVVVQRAEVSRLK
DLVASIDPEAFVIVSNVHEVLGEGFGYFNRL
>Moth_2316 Protein of unknown function DUF86
MSLDEFLHNHQIYSTVERDLELAITCIMDIGNHIISAMDLPEPETYADIP
ISQELARKLSGAVRFRNILAHEYMDIDRRLVYKHLQTGLGDLVEFIYGIG
KFLGI
>Moth_1760 conserved hypothetical protein
MYIYGRWIVIPLIGAFIGWVTNVLAIRLLFRPHRPFQFLFWTFQGLIPKR
RAEIAANVARVVDKDLLPLGEVLEHLRTPALEEKITELVVEVARRRLTER
LPSFIPAGLKEAASKTIEETLRREIPPALEELEGELTSSISSFSLGDLVA
EKINKLDLHQVENLIVEVAGRELRYIELLGGVLGFFIGLVQAFVAGR
>Moth_2119 Protein of unknown function DUF606
MQLNITRLKVKVMLVFFFLALGIGAIWALQPVINAGLARSTGPLMASTIS
FLIGSLLLIISVVITQWLTKDPLDFAALSRVNPVYYMGGAIGAAVVLGMT
TIIPKLGAGGVLSAAITGQLIMAAIIDHFGLLGAPRIALNSLRLIGIILL
LIGVNLVIYK
>Moth_1895 Alkylhydroperoxidase AhpD core
MPLPPFIEALAERDPEFYRAVKAVAETAMAPGALDAKTKTLITLALDAAH
GASEGVAVLAKQARELGASEEEIREALRLAYFVAGNGVLAAGGAAYR
>Moth_2514 conserved hypothetical protein
MTTYWQQKGKSNTEATVKLALKRARELGLKDVVIPSVSGYTANLCLGIND
LNIVCVTHQAGFKKPGEIEMDTAIRQRLEDGGIKVLTTTHLLAGLDRALR
FSFQGIYPAEIIANTLRLFGQGTKVAVEVACMALDAGLIPAGVDIVSLGG
SSEGVDTALVLRPAHSQDFFATKIKEIICKPREF
>Moth_1494 Protein of unknown function UPF0047
MLFTFDLQTQAKEAMIDITHLAAKTVKEAGIKEGFCLVYVPHTTAGVTIN
ENADPDVVTDILAALARIVSAGGYRHGEGNSPAHIKASLMGSNQTVVIHE
GRLVLGTWQGIYFCEFDGPRRRKVHVKVWEG
>Moth_0299 Protein of unknown function UPF0150
MLPQKDLETLSKLPPERLRMVLNFARANLINQKITRRYNVKLDWNEPTDE
DGVAGYTVTVPSLPPVVTEGDTREEALENAREAIACYLEYLIITGQPVPE
SDTEGENMVEVII
>Moth_0899 conserved hypothetical protein
MRIKKRLFIGLLALSLMFITAVLAGSWYLLINHSSLFNRVLLALGFFTLA
VLFLLIALGIISLVLMLWQGRSRPLFQHLGLMAVNILFPVALALGKRLGV
EAATIKASFIEMNNQLVRLQRLQVAPREILILAPHCLQWSGCPHKITIDV
NNCRRCGRCPIDALHALAARYGVRLAVATGGTLARHFVKQYRPRAVVAIA
CERDLTSGIQDTQPLPVLGVLNLRPHGPCLNTQVNLNQVEQAVQFFLTGR
TVPQVQACSEGWVEVTHGS
>Moth_2019 Cupin region
MQDKEQAGSRPLDARAAILSELVDYQEGSIVSRTIIDKKAGTVTLFAFAA
GQGLSEHTAPYDALVHVLDGEVEITIAGKPLHVKTGEAVIMPANQPHALR
ALTNFKMILTMIRS
>Moth_0606 Protein of unknown function DUF299
MGNRAGAHFCLASGLRLPSKGGVTIGVIYVISDSLGETAEYVARAAASQF
DGGGLDIRRVPYVTDLDHLEEVVNEAAQEQGIIAFTLVLPDLKKKLLELA
AARGLEAVDLLGPLMDAITRVTGGRPRLEPGLIRRTDEDYFRKMEAIDFA
VKYDNGKDIRGLSHADLVLIGVSRTSKTPVCMYLAHKRLRAANLALVPEV
PLPGELLNLPPEKIIGLTIDAGLLYQIRQERLKTLGLPGPAGYATRERIE
EELAYARRVMDQLGCPVIDVTNKAVEETAGKILQIYYRRERNGK
>Moth_1692 metal dependent phosphohydrolase
MSLVTLEEVKKDPEVEALITRGNEHLGAMGFTEHSHRHLNLVASISRNVL
ERLGYDKRTAELAAVAGYLHDIGNVVSRQDHGQSGALLAYNILRRLGMPA
DEAATIMGAIGNHEEEYGQAVNPVGAALILADKSDVHRSRVRNNDISTFD
IHDRVNYAVQHSFLRVDAGKRAITLELTIDLAISTPMEYFEIFLTRMMMC
RRAARFLHCHFGLVVNDARLL
>Moth_1978 conserved hypothetical protein
MPERPLTDYRCQAEISTREQRVLEEVLKALRQVRYGSITIIIQDGRIVQI
DYTEKVRLGKE
>Moth_0726 Type III effector Hrp-dependent outers
MEQISIIADDLTGANDTGVQFCQHGFRTMVIIDAANVERVGQDKDVWAIN
TDTRHLAAPEAYQRVYEITLKLKKAAISRVYKKIDSTLRGHPGAELEAVM
DAWQADLALVVPAYPANRRLVVDGHLLISEGMETAAASVSLTPGDARAAL
CHIPTVLQGEMGRRVGQINLATVRQGVKELVAALEAARTNSQVLVLDAAD
EEDLRNIARAISRFQRDVIVAGAAGMAAHLPLAWNLKPVPNNPLNKKGAI
LLVAGSRNPVTAAQVQRLAEVSACQAVKVETEAILTGEPAVEIERVLQEV
TTQDAGAGLIIIAVDSLFQTIDRDRVSNSGSKAIALALGTITSRLLNMRR
ISALVVTGGDTAVHVCRALEARGINLAADLLPGIPLGYLEGGRGDGLPIV
TKAGGFGSPDSLIKVNEFLQQRMKSEMELV
>Moth_1752 RepA / Rep+ protein KID
MPEESLREIAATNEPPHTLASPEFFFLLQRIDRLDEKLTREIRDGDQKSE
ELITAVEQKLTQRIDAVEQKLTQRIDVVEQKLTRHIEEVEQKLTQRIDAV
EQKLTEHIEEVEQKLTQRIDAVEKTLTQGIDAVDQKLTRRMDALEEKLGL
RIDKQDDKLNSLKFWAIGAVITISVGFIGTIATLLYK
>Moth_1940 hypothetical protein
MPQIIRTDSFLEQFQELSKEAQKHVLKTILFLAQNPSHPSLKVHRIKGTP
FWEAYASISIRVIFERNGDTLVLLACGYHDILKKY
>Moth_0616 Protein of unknown function DUF523
MGKILVSACLAGVRCKYSGGHNLVPVIAELVRQGKAVPVCPESLGGLTIP
RPPAEIKGGDGYDVLAGRARVMDKEGRDVTAAFIQGARAALARAREVGPE
IIVLKERSPSCGSKLIYDGNFSGATRPGPGVTTALLREYGYKVISEEEFK
GEGNPSP
>Moth_1411 Dinitrogenase iron-molybdenum cofactor biosynthesis
MVKIAVATEGQMVAEHFGHCSQYSLFDIEGGKVIGREVITNPGHQPGFLP
GFLADLGVNCVIVGGIGARAVELFGVRGIEVITGARGPVEEAVSAYLGGT
LKSTGSTCSHDHEGHEGCEH
>Moth_1649 conserved hypothetical protein
MPKGRELVGLPVISQDRGEELGRIQDLFYDETSGSLRACLLADGGWLRQP
RVVDFTALQARGPGAFTVSGAGAVSHEPPPGTRRWQELKGLRLLNRDGRE
LGIVEDLVVELPSGQVKALEISTGLVNDLLEGRKEITLEGQVNWGTDTVI
IG
>Moth_0191 conserved hypothetical protein
MAKPELSSPAVKASRWTARRLATLAMLIALSTVGANLKIPSITGTPAFDS
FPGFLGALILGPADGALIAALGHLLTAFTAGFPLTPPLHLVIAAGMAAVV
ALFAIFYRFSPWLGIAAGIALNGLLLPALFIPLPGFGKAFFLAMVVPLLI
ASALNIVLAATAFTSLRRVFPASYAAGRGKGEGK
>Moth_1150 conserved hypothetical protein
MKKVSFAHMGYSYLGFKQLVEDMGFEAIVPANPSPATLDLGVQYAPEFAC
IPFKTVLGTYLEVLNRGAEMIITSGGVGPCRAGLYGLLHEKILRNLGYNF
ELFIFDPPLTGLGPFFWKLRRVLKEARLSWLAFIDVVRRAWAKLKLLDEL
EQMATVTRPYEIKRGATTRAFNQCLEIIDRARSSKEIAAAREECRQLLQS
VPRDEERRPLRIGIVGEIYVLLEPFMNLDIEKTLGEMGVITKRSIYLTNY
TTTDVLAHGTEDIRQIAHPYLNQFVGGHGQSSVGETILYARNGFDGVIQL
APFTCIPEIVAKSILPRVSRDFNIPVLSLTIDEQTGRAGVETRLEAFVDL
LRQRREQMEARSNAALLPGY
>Moth_2318 conserved hypothetical protein
MIPEKYLLLEARIRKEVANLERLERELARYNLFPRIQADSLGGFSLTDEA
SLRIIGSILHDYYTAIEKIFRIIARDIDCSVPTGEQQHKELLDQMTLEVP
GLRPALLDNETARKLDELRAFRHVFRNIYGFSLDADKIRQLLEGLPELAS
DCKKDLHLFTLRMRRILGLNSSSEV
>Moth_1139 Protein of unknown function UPF0182
MKLNRLWFCLLIIIPGFLVAAYLGSHFLTDWYWFAEVGYRQVFLTRLLSE
VGIRLGTIAFFFLFFYLNLLFTRKSLHLSPPEGRENWTLKEYLIDRFITS
RRLGILYLLLSLAGALIFSPLAAGKWLVVQEYLRATPFGLADPLFGRDVS
FYIFKLPLYHFLYKLLITAVVGAVLVTGFFYFIFNPRELLGLRRGHFSRP
LVHFSTLVALLFLIQAWGFRLQALDLVRSSRGVAFGASYTDIHALLPGYN
ILGWVAVACGLIIVLNAFRRNLKLVSAGILSFMAAYFLLVIAVPLAVQKF
QVEPNEFAREEPYLRYNINFTRRAYGLDRITIQEFPALDNLTPASLREEG
ATLDNIRLWDYRPLEQTYSQLQEIRSYYSFKDIDVDRYTLDGRERQVMLA
ARELDQNKLPDRARTWINEKMRYTHGYGLAMNPANTVTAGGQPEFIAGDL
PFHSSAGLQVNEPRIYYGELTGDYVITGGTAAEFDYPVTGEDNFVETRYQ
GRGGVPINTPWRRLVFAFRFHDYRLLMSNGLTPQSKILYYRNIQERVRKI
MPYLRYDADPYLVVAGGRLYWFLDAYTITNMYPYSEPNSGGFNYIRNSVK
VVIDAYNGSVDYYLVDPGDPLAQTLARIFPGLFKPREDMPAGLQQHLRYP
PDLLSIQAQMLTNYHMENTMLFYNKEDAWSIAEEMVGDKRQAMDPYYTLM
RLPGETQAEYILMLPFTPARKVNMIAWLAARNDGPHYGQLLLYQFPKNRS
IYGPMQVEARIDQEPRISQQLTLWDQHGSQVIRGNLLVIPIKGSLLYVEP
IFLQAQESKLPELRQVVVAYEEKIAMADTLAGALQVIFGTQTPAPAASPQ
PPSQAATGSPGNLSELIKEANRLYSEAQDRLKQGDWAGYGENLKKLEQVL
QEMGQKVAE
>Moth_1522 Protein of unknown function DUF322
MSVLDRALLALFALITAILAGIFLAVIAGWSVPLDLFELSLLNNDYRLLA
GVVALIFFLLAMRFLLGSLRLAQAGEGRAVIKAADLGLVSISLPALEHLV
TRAARQVKGVREIRPRLNYGPEGLAIRLNITVNPDRNLPEMAAELQEKVR
EYVIATAGLEVPEVQVRINGIFQEGQRRVE
>Moth_0512 SpoVR
MEQEFKTIAQAIETIHDQARKFGLDFFPVYFELCPADVLYAFGAYGMPTR
FAHWTFGKHFYKMKLQYDFNLSRIYELVINSNPCYAFLLEGNDLIQNKLV
IAHVFAHSDFFKNNIYFTATSRQMVETMAVHAAKIREYEFKYGHREVEIF
LDAVLAIQEHIEPPGPFGYKEEENEENEDTRPHRRETPYDDLWILDGRPK
EPPPERNRKIPPRPTKDMVGFIMANSPELEDWQREVMAMIREEMQYFWPQ
METKIINEGWAAYWHARIIRELDLTPAETVDFARLHASVLQPGYRQINPY
LVGSKIFEDIEKRWENPSQEERERYGRTGGEGRSKIFEVRSCENDISFLR
NYLTRELVEELDLYLYQKVGSEWVVVEKDWEKVRDGLVSRLINCGYPYIV
VEDADYQRRGELYLKHRYEGLELDVSYLEKTLPHVYLLWGRPVHLETIID
GKTTVFSYDGKKNCRR