TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Nitrosococcus oceani ATCC 19707, ATCC 19707
Gene type: CDS

Number of genes found: 236

Free access
Sort by:

 



# Nitrosococcus oceani ATCC 19707, ATCC 19707

>Noc_2028 conserved hypothetical protein
MNSVLSSQKRGEPAFHDYRPAQARFREDVLSGLTKSQRRIPPKYFYDEKG
SRLFDAICQLPEYYPTRTEIQLLKHYGAEMAEYIGEGSVLVEFGSGSSLK
IRVLLDALEPAAYLPIDISREHLFHAAKRLAGDYPGVAIHAVCADYSQPL
GLPGGFAEKPKAGFFPGSSIGNFEPEEARQFLTRIAALLGPGSGLLIGVD
LKKEAALLNAAYNDNQGITAAFNLNLLQRINRELEGNFDLTGFEHHALYN
SAEGRVEMHLVSLKDQAVTVNNQTFQFQAGETLHTENSYKYSIEEFQQLA
SSAGFQPQQVWTDREDLFSVHYLSL
>Noc_0147 TPR repeat protein
MMVSWCRGRWLLAGGVLWAQLSVASGAEEVEVIDPGALEQRLQQIERIIE
GQGLSELLSQMAELEANVRRLQGESEELRYEMDNLKRQQRELYVDLDRRL
QKLSTSGESLSPLLPSEATEEVAEEVLVDSGEQTYQAALELLKEGRYEEA
IAAFDQFPQQYPDSRYRPNAQYWLGEARYMLGDFNAAADTFQALVEQYPE
SAKVPDAMLKQGLAYYELAQWEQAKAQFQAVMTRYPASTASRLAEERFEK
MKREGHP
>Noc_0484 Protein of unknown function DUF1328
MFGWAVTFLIVALVAALLGFTGIAGIATEIAWILFVVGIILFVVFLVLGR
RGRPPL
>Noc_3011 Protein of unknown function DUF494
MKENVLDVLMYLFQNYMDSEVESQPTKESLEVELTEAGFHHAEIGKAFDW
LEGLALLQQSEPRRFPVTNSSIRIFTLHEKEKLDTECRGFLLFLEQVGVL
DSMTRELIIDRVMALEAEDFDLEHLKWVILMVLSHQPGQEAAYAWMEDLV
FDEVSDLLH
>Noc_2389 hypothetical protein
MRNANLAGSSLKSCDLRGAMLSGANLVSANLAYAVGNDKEIKSLMLEGVS
IVYTAERMWLDDENLPIDAWCKSLAAILNLSLN
>Noc_0099 Glycoside hydrolase, family 57
MASGYLSLILHAHLPYVRHPEQEEALEERWLFEAMTECYLPLLTTFERLT
NEGIPFYLTLSLSPTLLSMLQDPLLLQRYGLHMEKLISLAEKEIRYTRGN
TDLNRLARLYRRWFLQTLSDFEERYQRQLVPAFARLQQEGVLEIITCAAT
HGFLPLLQPEPTAVYAQLQVAADYYRQCFGIAPKGIWLPECAYYPGLEKV
LKAVGFRYFFIETEALLHASTRPRYDHFAPVACPNGVAAFGREPALSRQV
WSAEEGYPGDGDYREFYRDVGFERELSYLQPYLPDGRIRVDTGMKYYRVT
DKTEYKAPYQPAKAQARVACHAGHFYHHCLQQITGANRMDRPPLLVAPYD
AELFGHWWFEGPQWLEQLLRRIGTGEGAIQTITPSQYLTQHPVLQQATPN
LSSWGDRGYYDFWLNEKTDWIYPLLHRAARRMKELTIAYGHESKGTLAGR
ALGQAARSLLLAQASDWPFILQNGTTVEYATRQLQDHLSRFHYLEMVLER
KSFDERRLQALEALDNIFPELDYRVYKHPYRER
>Noc_0162 hypothetical protein
MSTILYEANPSMLRMNPFITALSILLIPVGIGIIILLWMYIKTKMDKLTI
KTDEIVWMHGLLNKSYTEINMSSVRTVKVNQSVLQRMLNAGDVAIYTAGD
NPEVVIRGLPEPDKVRDYIKGQNEERG
>Noc_3042 conserved hypothetical protein
MLFQFNNKVSWPKLRFTVAFLLLIIILASLGLWQLQRASEKHTIETVLKE
RSISDSLRVGEERLQLPDSEYRRGIAQGWFDNEHIIFLDNQIHKGRAGYH
VLIPLRFNDGDGSGILVNFGWLPMELDRQELPRIESPDLRVSAHGVLRQP
YQAPFFLGGEESRESSGWPKLVQYVSIEQLQSQLGYFLQPLILQLAPEES
YGFVRQWPEPPTSVQQHIAYAVQWFALALIGLIISVILYRRNF
>Noc_2961 membrane protein-like
MDHLIGESGELGWVSLKALLLYLTAVIGFRLGERRTLIEMSPFDFVAAVA
VGAIVGRVPNASTTSYLAGAVTLVTLLLAHRIITQLRYFPSIADLVDPPP
RVLIAHGQVLKGELRQCGLTLNDLHGLLRQRGVEDLREVQFVICEQRGQI
SIIRQKDRTDTDSSLIRDIVARISSPP
>Noc_2537 conserved hypothetical protein
MLYPVYIHPGDNKHAHGVTFPDFPGCFSAADSWEELPTKIQEAVELYFEG
ETMEIPPPTPLEELAQRPEFEDGAWMMVDINCSRIRPKATRINISLPEAL
VRRIDEYAKAHQMSRSGFLAKVAEEAMRKKHN
>Noc_0821 Pyrrolo-quinoline quinone
MTGCELLQEYLPEPDTSEPPMALVEFVPEVGVEVLWSAKVGKGTRDRYLR
LPLSIIGDKVVAADYGGQVSAFDAQTGEKAWEIKLDLPITGGPGGSDDLV
VLGTEEAEVIALDAADGSPVWRTSVSSEILSAPSVADGVVVIRSGDDQVY
GLDARDGSRLWAYQHNVPILTLRGVAAPIVVDRKAIIGLAGGKLVALSLE
DGQLLWERAIVVPRGRTELDRLADIDSKPAVYGGYLYTVTYNGRIAALWL
ADGDILWTREMSSYAGIGTDGESLYVTDMEGSIWALESRTGASLWRQSKL
LRRQPTAPTSYQNYVVVGDSAGYVHWMAKEDGHFVARIEVDKKGIVNAPL
VLDDILYVNSQGGILKAIKIAG
>Noc_0867 Cobalamin adenosyltransferase
MGYRLSKIYTRTGDQGTTGLGPKQRVPKDHPRIEAIGTIDELNALLGLLL
AHSIPHSLHQSLTPIQHELFDLGGELSIPPTLIIQASQVQALERLLDNLN
EELPPLKEFILPGGTIPAGICHLARTTCRRAERRLVSLAKEESVNAESLK
YLNRLSDLLFVMARELTRTAGAKEILWEKHR
>Noc_3061 Pentapeptide repeat protein
MREAKAMTEPQIKSDLLYLLLRNGEIEEFNCRVASGESCDLTYCDFLKAD
LRGLKAEGLDFSGSYFRQADLRGIDFSASRLEGASIHGAHIAGVYFPLEL
KPEEISLSLCYGTRMRYLKVD
>Noc_0379 conserved hypothetical protein
MAGQIDTLIEHLQQPGIYHHAVENLTMIETHISWVVLTGPYAYKIKKPLD
LGFLDFSTLDKRRHYCDEELRINRRLAPEIYLEVVPITGSTAQPHLGGTG
IPIEYAVKMIQFPQQTRLDYCLQRGDLSPKQVEDLAGKVAAFHQNVAIAP
QDSPYGAPKTIGQPALENFQQMEIFLKEAEDQKKLARLRHWTEKKWQQLQ
AEFTGRKKAGFVRECHGDLHLGNIALREGKFIIFDGIEFNENLRWIDVMS
ELAFLIMDLENRGRPDLAHRCLNSYLEHSGDYPGLAVLAYYQVYRALVRA
KVTAIRLGQTPPETQAIKERCRDYLNLALHYIQPARSFLLITHGLSGSGK
TTLSQPLIERFGTIRLRSDIERKRRHGLKPRERLNKGIGIGMYSAESSHK
TYQHLQQLAQTILKAGYPLIVDAAFLKQQQRQIFQDLANKLNIPFAILDF
HCDPQQLQQRIRERQHKNQDASDADLAVLEHQQATQEPLTKAEQAITLAI
DTSQTQAMETVIQQLQVLTGEKAPRVK
>Noc_1336 conserved hypothetical protein
MTEITSKTFTKERAPEIERVFGDLGKHWGWLLALGMLFIVLGTIALGMSV
TLTVVTVLFFGVLLSIGGIFQIVEAFKCKGWRSVLLHILIALLYITAGVM
LITEPIAGSLALTALLGGAFVATGVLRIVMGFHLKGTGVRWGWVVFAGVV
SLVLGGMILFQWPVSALWVIGLLVAIEMLFHGWSYVMIALAAKSIR
>Noc_0984 conserved hypothetical protein
MEYETSRRIFYPPIKYCFIILFLLSLLTGCASTPPASTANLCVIFEKQPE
WYDYAKESEKKWGTPTPILMAFIHRESSFRGDARPPRRWFLGFIPLPRSS
SAYGYGQIQDPAWEDYLGANGGWFKSRADMEDVLDFIGWYNHISARRLGI
SKRNPEHLYLAYHEGHRGYQRGAWRRKPHLRRVAKRVARQARMYGSQLKS
CEDRFKCRKFYQFWPFCR
>Noc_1607 Sulfate transporter/antisigma-factor antagonist STAS
MAENKILHAVHDGVHVLRLLGDMRYPLSPSLDHFLQRLFSEVTPQGFVVD
LTDTRSIDSTNLGLLVRIAKLMQRRGQPRVTIISNREEINEVLTCLGFDK
VFNIVAHGGGEPLAGKALALEEANDVAMLDTLLKAHQTLMNLNEQNKDLF
RDVVAMLEKEQADKSSAPNA
>Noc_2578 Protein of unknown function UPF0149
MADIYSKLLEALRNLDSYADLPEIHGSLCGFLCARKNQTAKTWAEEIAPQ
ASNNDKKKLEALFEFTEQQINSPDLDIRLLLPDDEQSLNQRTEALASWSR
GLLYGLGIGGLRKETVLDGDTQEFLKDAAKIAKMVHFPSNDSQEDEAAYN
EIVEYLRMGLLLLYESLHPADSGPSL
>Noc_1372 conserved hypothetical protein
MESITSLALDRWLHFLAGITWIGLLYYFNFVQAPAAAEAAKDQGGPGPAA
IGKYITPRALLWFRWSALVTWLSGAYYLMAAPHYSLHGAFTLGIGANNIA
LTTIGMGAWLGTIMLFNVWVLIWPNQKKVLGLVVATEEEKAKAKRVAFLA
SRSNTLLSVPMLLFMGSASHGLPF
>Noc_1870 Peptidoglycan-binding LysM
MALAKAVITIEHTGEEIPVMFNPEEYTLNKDNNFASQTIPGLRSPLLQFV
NGNLRTLEMELLFDTYLPHTTEPKPPSDVREETQKIVKLLDIDSNLHAPP
VLTISWASLQFRCVLARVSQRFIMFLPDGRPVRARLTVTFNEFIDVEREA
KEVNLQTADFTKVHVVVRGETLSGIAGQFYEDPQLWRPIAIANGIADPRA
IVVGQSLQIPSLPFIDPDTREVVV
>Noc_2379 Invasion gene expression up-regulator, SirB
MMTVYLTLKYIHLTSVVTTLVLFVGRGLWMLSSSHYLQRRWVRLVPPIVD
TVLLASAIGLTLVLHQYPFVNDWLTAKILALIGYIVFGSIALKYGRTRSV
RIGAGCIAMILFTYMVWVARSKTPFPWLL
>Noc_0961 Protein of unknown function UPF0182
MSRWKRRFLILGLSIVVLAVFLLIAGFEGIPFLVDLWWFSAQDYGFYFWQ
RALYQLVVFIQVSIVFFLIFFVNFWVASHYLGSERPADIPPGAPRKKFKN
LFFRFQTGSLWIYTPLSLVLSIIIAWPLFQQWESFLLYLVAPDMGIQDPA
YGKNISYYLFSFPIYVLILQRLLISFLLLLASLTLFYWIENRLLSQHGKR
LPTGAKWHLSILVLMVFFIEIWDFFLQRNGLVYSEAHQPLFSGPGFVEMR
VILPFIWLSMFFLLGIAFFLLLFIHKRKGLKTLAVFSLLFILSLGARHFH
FLHWATQEYIVKPNELSIQKPFIENSIKATLDAYKLSNVEVRKFERKKAL
KNIFNAKVQDLLHNVPVWDKELVGQVYRQLQQLRTYYGFPAENVDRYMIN
GKKQQVFLGARELNYDKLPSGAQKWVNEHLLYTHGYGLAMSSAGQGQGEA
SMDWFIRGIPPESDEGFNIKQPGIYYGRGSYRYAIAPNENREFDYPKGNA
NVMTDYQGKGGVSLSSLFEKYIFSVYFQDKDIFFTTQTYDKSKILFRRNI
IERIETLTPYLLLDAAPYLVATTEGLYWIQDAYTASTWYPNADVLGLMRF
EFPIPPKGSEKINYIRNSVKIVVDAYHGTVNYYIFDSSDPIIQAYSRIYP
GLFKPKEQMPEDIRAHIRYPKDLFEIQMGIYAKYHQTDLEIFYQQEDMWT
FAQKYNRESETRLRPYYLTLDLIEENRLDFLLFLPMIVKGQDNMRAMLVA
GSDDPYYGKLIAYSFPKGELVFGPAQINAVINEDPKVSAQFTLWNQDDSK
VVLGDMIIMPVDQVILYIQPVFLTIKDDVRIPKLERIIISDGRSVVMEPT
LMEAYAKLKERIETEGAEGESH
>Noc_2392 conserved hypothetical protein
MNNGTVIAIMFSFTFAGGAMAAGDSQQALAIQSAGQIYSVPQAAYQPDRK
STYKVVFDLTQAAVKPNQLSPALERVARTVNLYVNAGVPLQHLKFVAVAH
GAATSIALDEAHYREQYGTANPNLPVIEELRKAGVDIAVCGQAVAEHHYA
YEWIDSRVTLALSALTTIMDLQQQGYALTPL
>Noc_1035 Protein of unknown function DUF190
MRTKDVMMVRIYLTEAEGHLDTLLKRLHDWSQVQGVTVFHGIAGFGPSGP
MHPISPARISGDLPVVVEFFDEPAKAEVILETLSHIIKPGHVVCWPARII
VEDK
>Noc_2733 conserved hypothetical protein
MMPRLIPAGALALWILVPPSTAWSQEEIHTTHTDQKKVAITIYNNDLALI
KDQRQVALKKGHNTLAFMDVSTGIQSETALLRNLTHPQGFSIAEQNYEFD
LLTAEALLQKHVGQQVGVISRHPTTGEETIEQAILLAANPGVILRFKNRI
ETGISPDRLVYQQIPQDLRERPSLILQLQTEVADKQALELSYLTSGLNWK
ADYVAQLNQTEDRFDLNGWVTLTNQSGISYQNAHLQLVAGSVHQVTPQVH
YAKRPAQEARTFADVPPMEEEGLLDYHLYTLDHTTTLKNNQTKQVALLSA
HNVIAQKQYELRSHTPRLYYGHSQSFPVLKPPVMVYLHFDNTQQAGLKLP
LPAGIIRVYKQDSRGNTQFVGEDRINHVSKNASAKLRLGQAFDITAKKQR
TEYRKLDSDSFEATFKIELRNGKKESVLVKVAEFIPGDWYIITESHPHQR
ETGNTALWQLSIPAEGQVTLTVSFQTRT
>Noc_2766 hypothetical protein
MARPGVTYREVAEVATTLTHQGQPPTVDGVRAVLGTGSKSTIGPLLKQWK
AQQAERVSEAETGLPAALVAVVKGLYESLQQQAQSQIDTVKEETQAQVEA
ARQAQAAAEAHQQALEKEVNRQQAAYQRLLEEKETLNIALEETQRAQLTL
QAEAEGLRERLEDRTQVAQDLKQQLTQAQTNLEHYREAARQQREQERADF
ERQNRRLEQILKENRAERHLLQASCTQLEKDQAQLQAERDQLRAENASLH
EQGGQLQQKLEQTQRQWAEGEGRYQVLAQTHQEVITKLDAQEKAFLEAEK
QKARVQEKVAALSAALQNAQDKIEGLQNENRVLAQEKAAIEGQFKQRQQT
>Noc_0857 hypothetical protein
MSRLFFLLVLSVFPLLAGASEKIGVLVVAQDRGAVGNQELAAATADLDPD
YPVRLLLIGADHQGIENGYAAYIKKARSFFHEHGVNKIIAIPLFISEDDY
LLGLFHGKIEAAMAPANFTWTPALGESYLLREIVLDRIDSARKTSKVEQL
VLWLSGASDTTSAANLQLLGERLLNDIRPHISSSKTAVAVTYSKTAAGPG
AEAAETEAAAIVNRFAKIGQTLIIPLAIGPKFTPRMSLETSLARRYAHSG
AHISESIMPHPAVRTWLQRMINTHVPVTEKTLGIIIMPHGSTMPYNDGIV
AAMPSIVKRYPTAFAFGMASPFTMGQAVNELEAAGVRHGAFLRLYALPRH
LHEASDYILGLRPMPPAHSHGGVPERVRTPIGFVTTGGYQTDPLISEILK
DRILEVSQNPLQESVLLLSHGSRSDETNAADLALITEHNIAAIKATLPSP
FRDIRAMSLREDWPDKRAKAVREIRAFIEQANKGEGRAIVISNRLYGSGP
YATYLDGLEYVMNGQGLIPHPNFTRWVEKSLRKSIATLKAGAQGQATEVA
AGHPRLSH
>Noc_1906 Smr protein/MutS2-like
MSRKKKPVSTHDRELFREAVKDVLPLNQDKIMPFQRYLAPIPQQRKRDEV
RVLQDMMSDTFEAVELETGEELLYLRPGVQKRLLRQLRQGKFSIGAELDL
HGMNVPMARQALAGFLKECEKNGIRCIRIIHGKGRGSYHKEPILKGKVNT
WLQQKDEILAFCSARSTDGGTGAIYVLLKRRR
>Noc_2133 putative transmembrane phospholipase protein
MLYLVSKILIGSKHQIPTNNSNRSIRQRKLAFLGIFIILLLGLGLAWCWT
PLSEWLEPEQLAVWAQGLAASPLGGVGVMMGFVLGSLIVFPLSAMIVATA
LIFGPVTGFIYALMGALSAALATYAVGYAVGKNIVRQWVGWRIHWLSEKL
SQQGILAVIFFRVVPLAPFTIINFVAGASHIKIRDYFIGSLLGMAPGIFI
MVFFVEGLIAALHKPGILSFVFVILLGFLLLAITTFGRYWLRREKDERKE
KT
>Noc_0037 Protein of unknown function DUF185
MRRVQKDFFPHPEPIALAHSQKLENVIQTTIEQAGGQIPFARFMELALYT
PGLGYYMAGLHKLGTFGDFITAPELSPLFARCISRQCQQIFELLGTGDIL
EFGAGSGRLAADLLSELNLSGNLPERYFILELSADLRHRQQETLYQRVPL
LASRVSWLDRLPDRIDGFILANEVCDAMPTHCFQLENGYDWERYVGYEKG
KFVWKKGPLSHPLLKDRIAKIRLLLKHVNSYESEINLAMEGWTTEIAHRL
RKGMLLIIDYGFPRHEYYHPERMMGTLMCHYRHQAHPNPLIMAGLQDITT
HVDFTALAEAGHSSGLRVAGYCTQADFLLACGLDKLAATEIAAGEKQALE
TSQQIKRLVLPSEMGELFKALALTREINQPLLGFNLRDRRARL
>Noc_3001 Protein of unknown function YGGT
MPNDYLTIPLIFLLNTLFSLYILAVMLRLLLQWVRANFRNPVAQFLITIT
QPLLRPLRRFLPPMGNIDTASMFLLLILTMVKLTIISSLALSVPPLPVLL
LASIGDLVSLTFDVFKIAIFIQVILSWIAPTTYNPVTILLYDLTEPLLRP
ARNLVPPIGGLDLSPLVVLIALQVASMLIEPWFPRII
>Noc_0579 Protein of unknown function DUF152
MENTVHPLTIIPHWPAPANVRAYTTTRSGGVSLPPYHSFNLAEHVGDTPE
AVKKNRFHLYQFLSLPCEPSWLKQVHGARIVSANYGPGQQGDASIAYGPG
PVCAILTADCLPLLLCDQKGTRVAAVHAGWRGLAADIIGATINALDIPGE
HLLAWLGPAIGPQAFEVGPEVKQTFLDQNDDHALAFNANRPGHWLADIYQ
LARLSLTKRGVRSIYGGQYCTVADPDRFYSYRREGITGRMATLIWLAAA
>Noc_2411 conserved hypothetical protein, DoxX-like
MSTQMSTTTANSGPTFFNRLAPLSYPLVRVTTGLLLMPHGAQKLFGWFGG
HGLTATGQYFASNLGMEPGFLFALLAGLIEFFGGLALVLGLLTRPAALII
AIFMAVIIPVHLPNGFFWSNGGFEYPLMWGLLALAIFFRGGERYSLDRRL
GLRI
>Noc_1100 Addiction module toxin, Txe/YoeB
MRSLVFEGNTWTTYENLRERDKKLHKALRSVLKEMLRGDPTTGMGKPEQL
KHSLSGLWSRRLSRKDRIVYKFDDKYIYIFAIGGHYDQF
>Noc_0462 Protein of unknown function DUF202
MPEKSKQQLATDRTELAFHRNLLAEQRTFSAWMRTGIAAIALGFADIKLL
AEAEPKWAVYAAGVILIVIGMAIHILSFWGYYVTFRALKEEGLPGLPIWS
VVLITLSLFIAGLLILILLLAGLIDSP
>Noc_2243 conserved hypothetical protein
MNYNTSSALLHLNKSTVKDAARGQWRTILSPLGVPLPNHGRHAPCPTCGG
KDRFRFTDKHGNGEYYCNHCRGGDGLQLLQNYHGWRFQEAIQNVAELLNL
EEETCQPFEFGKFDADKIAPSGNPQTPAPRHQEPAKPKRPPSNILQQWQM
TRPASPDHTYLKIKRVQLCPGIRLSRNYRRLVVPMRCTENQLWGIQNISP
NGKKTATKGSDKKGHFFPIKGQDDRALFIVEGLATGLSVHEATGFPVVVA
FDANNLLPVAENIRAACPDLPLVIAADNDAWTDGNPGITAAMEAAEAVDG
AIVWPEFEPGAETVAGRPTDFNDLQRLCGAEAVRDRIRELLPRGFVPGWI
PIRGERVTEAASLAGGAA
>Noc_2853 conserved hypothetical protein
MRVTPSINKVLKTTEGNLQRLLTRAKTLQRLTFEVRQCLPQALSSHCLVA
NIRDNRLIMHTDTPAQANLLRYYMPNIIKHLQQRQEFRNLRRVVVKVRPP
TPLLPSSQKQRPALSHENAVLLRNIARGMKNSHLKAAFLRLSLRSKE
>Noc_2743 conserved hypothetical protein
MKKHIEYPFEFRPLADEEGGGWLVSYPDLPGCMSDGETPEEAMANGKDAL
AAWLKAAEESGREIPNPSDLPSGKFIARVPRSLHARLTARAKQEGVSMNA
LVSSFLAESLGRREGTKR
>Noc_1108 Protein of unknown function UPF0150
MTNEFTAIFECDGDWYIAYCPEIPGANAQGRTKDEARESLAEAISLILAD
RREDMLRGLPDDRIGSRRRYARASLSRKSAGNSP
>Noc_0976 Protein of unknown function DUF1499
MEKDEMQSPPNNEFPPCPTNPNCVCSDSAIKDEKHYIEPYHLKVSPPEGW
GVLKDIISALPRTTIISASSHYLHAIAKSRIFRFVDDLEFQLRPRQKMIA
LRSAARLGYYDFGVNRNRIEKIRNQLRLQGVIR
>Noc_0168 Protein of unknown function UPF0150
MTMHKYEVIIYWSQEDKTFIAGVPELPGCMVHGPTQMAALESVNQAIELW
LDTAREFNDPIPEPKGRRLLYA
>Noc_2939 conserved hypothetical protein
MIEFEWGIAKAQSNVKKHGILFEEAKSVFYDEYARQFFDEDHSESEERFI
LLGLSDHSRVLDQSLINLYLRDCMAAGREPDLSWK
>Noc_2576 Protein of unknown function DUF710
MNDEAQPVVLHILGKEYRVVCPSGEEEALLTAARYLNKKMEEIKAGGKII
GVERVAVMSALNIAHELLREQSKKEGERELNKRIQTLRHKVEAVLEECRQ
VDP
>Noc_2656 Conserved hypothetical protein 2099
MAFIRLLPRYFYLSLGILACCIVLIFGALWTLWTVVEDEPATISRWLSTA
LGREVQVSEIALSWRGGEPRLHLTGVRIIDPQTERQLLSFREIYLDAAPS
RTSAEGEWMLRHLTIIGGELTVERKPAGCIRVYGLQDLESPCPHPPRLPA
WVLNLTQLTLVDMAIRWLDPVLERKPLELQVRELQLRNEEDFHWVEGKAT
LAEKFGSEIAVKAKFTGGGIRDPRSQGVLYLKGEGLRLASWQASDFLAGL
HFIHGRGDLELGLRWGKAALHQVLAKFDWENLKITGNTQTAEAPGSISFQ
RLAGTAYYHPLAQGWSLKVPRLVVIRDERAWPATALQVKMEREPVSKLQG
RLTFLQLQDVVPLLQFSNQLDPQLRLFLAKTKPVGELRNIKAVLPLGGEE
AMSWKFSATAAQFGTRPWRHWPGYQGLSFELQLMNRGGQVRINSRNAQIL
SGYLEKPVAVDALGGRLSWVRKGKDWQVRVDDIKLQNQDLALRGGLRLEA
SSKVDTPHLDLSLMLTKIDLAALPQYLPSPAMKPGLVRWLKQAFPQGGAA
EGSLVFQGPWSHFPSSRDGSQLTVELDIRGAKLNYARGWPVLERVAAHLH
SDGQYMTIHANEGQMFSARVAEATVQITDLGAAIVPLTITVRAQGGAEDA
WRFIQGSPLRKKYGEFLEGIEVTGDTDLKLNLAMLLGNKKKRPKVEGELI
FTNAYLRQQNNPFLELAAIEGSLHFTPAGLKSENLVARLLERPIKVELLS
LPRQRGGTEVQLSLEGHLEARQLAEKWFPSLSTWVHGAADFIADVTFKKA
RPDEDKVTIVRVNSGLKGVKVALPPPLAKPAEQRRKLLWEFRGTDRDKKQ
VTLVYGDWLQGVFEMTGRGTSSRLLRGEVRVGNNSDPPVLSRPGVWLAGA
LPKLSIGAWWETLKRAGQGDSSQLLRLNHIALQVDQMELFGGYFEDVMIE
ANRMPSRWQAQVTGPSLTGWIRIPGEGSEEPLAVDLEHLHLHSIASDSSL
SSVNPQTWPAFDLICRQCRYKGYDLGVVKVYASPHADGFRLEELEIVSPT
LQLEANGDWTIRGETQWSHVDIKAYSPSLSQLLTEAGHQANVLGGETEME
LIVGWPGPPTLFSLERLEGSMRLTMDKGRLLDVEPGAGRLFGLLSITTLP
RRLALDFSDIFGKGFAFDRVKGAFSIREGNAYSKDLVIEGPTARVQASGR
IGLATQDYDQIIAVTPQVSSSLPLAGAMAGGPVGLGVGTAIMLADELAGK
IFNTGMEQLLTYYYAISGPWKKPVVTRAKNFFSSKERE
>Noc_1592 hypothetical protein
MKHTEYPYQEKSYEDKSPAEIEAEIEQTRADMNNTLHALERRFSPGQLMD
QTLSYFRGAGEETNEFATNLGRNIKDNPIPVALLGIGLGWLMMGGSERSD
HRYSRIYRHSTDRPARRPIVTPSGEPATATATSQSAGSYRDQARQTAHEA
RERAGGMAHAAKEKISETAQRTREQTKHAADEAREKMGDVTEAARYQAQR
AKRGFTYLLQEHPLVLGSIGIALGAALGAGLPPTRKEDEWMGRERDELLA
RAEATGREQLHKVEQVAETAQAAAREEAKRQNLTPEAGKEELEKVKHKAE
RVAEAARGAAKKEVKRQDLGSSPHSQS
>Noc_2955 Protein of unknown function UPF0060
MPELKTVGLFLITALAEIAGCYLAYLWLREDKTIWLLVPCALSLVAFVWL
LSLHPTAAGRVYAAYGGVYIVMAILWLWVVNGIRPTTWDLVGSAIALLGM
AIIMFAPRTT
>Noc_1589 conserved hypothetical protein
MFFDDWSGIGRVLMATLVAYGALIFLLRLSGKRTLAKMTAFDFVVTIAIG
SVLANIALSKSTPFMEGMTALVVLIGAQYLISSMCIKSKKIEHLVKPQPA
LLLYQGQFLQETMRSERVTEADILLALRQRGHSSVKDVEAVILEPDGSFN
AVINETLPASESALRNVHNYRPQDHEVTSESFSQY
>Noc_1683 Conserved hypothetical protein 698
MSGVNRRWYSGMATTEDWWAVWLGLIMFFAGLASIWGWNLVGWMTKTGTW
VWGDFAWSKALKVSSYQGWHPLLSLLVTYLVFTALTCLGAAAMKLDLKRF
FLGWTFLFMLTWVVWIIGHEAHFKASVNQFDQYGLSWGLSLGGGFSYMLA
LAVGLIIGNFFKGFAEFLKEAAKPEWFIKTAIVYLGIKIGLMSIEAAGFT
FELAITGIAATFVAYLLVWPIVYALSRRVFRLSREAAAVLSSGISICGVS
AAIATAGAIRARPVVPVMVSMLIVIFAMIELVVLPGFYTAVAPNQPIVNG
AAMGMTVKTDGADAAAGAILDELMRANAEVNLGVVWQEGWILTSSIITKI
WIDMFIGVWAFLLALVWVYKVERQPGQSKVGVMEIWHRFPKFVLGYLVAW
FVYMAIATLGPDLNEAATSGAEAVEGPMRKMMFMLTFVSIGVITDFSKLR
GMGKLALLYAIALFAIIAPLAYGVAWIFHHGMMPPTA
>Noc_0463 Protein of unknown function DUF1362
MSTPIGKTCWAIAEGYIPPSSTGPAPQMTSHETACILNATDQEAQIAITV
YFSDREPIGPYQFTIPARRTRHVRFNELNDPEPIPKDTDYASIIQSNVPI
VVQHTRLDSRQAELALLSTMAYASE
>Noc_2915 conserved hypothetical protein
MAQSIQVKRVTVTKEAKKVIDQLREKHGDLMFHQSGGCCDGSSPMCYEDG
DFKVGGSDIKLGEVYGCPFYMARDQFEYWKHTQLTLDVKSGRGSSFSIEI
PMGVRFIIQSRMFTEEELQALEKQDPLSA
>Noc_2136 conserved hypothetical protein
MYQKVFSLVKIGLEKQVPALGLGVFRIFLGLVILQEIVFLYYFRHLIFDT
IPFIDVASPSIYFFLILWGINTLFLTTGYHTRLAAIVNYFFWVIFTAFTP
MWRDFDGGFDQFMIGSSFLLIFLPTERAFSLDNLRVRLKFLKSELHHDPV
STVSVLSYYLPLAISLGLIYFDSAVHKLFAEHWRNGLGAWLPLTMPYYIS
AIDMTWFLNQEFLQKFIGYLIIVFEFIFIFTFYLRSFRVPLMITGISLHS
GIILSLNIYPFGFGMLVYYFLMVPFSWWQGLKKTLQFKSPQLVVFYDQQC
PLCNRTRIIIEHFDIFKAINFEGLQKRAKKYPELNNISEEQLLKDIYALD
QKGHLYVGIDAYLQILLKMKYPALAGIFIRIPGVYHFGKKIYRRIADQRA
RLTCDESCFVSSENSLQEAYSFKRSYEYYAGTKKQRSNRITKFLVLIMLL
QLNSTIHYGIFYRLNEDGAESEIGQILSPISNAVLFLSHAFLGITPHALY
MHDHFHGYNHILALTYKNSQGQEQWLPFVNEEGRLVAPNWGRVQSMWANV
AVTPHIEQRRLYKFIKKMTAFWGKKIDLDLQDTEFIIKMKRIDVPVHWER
NLRNKNINRPWVNIGRVIWHKGLARIEIQDINLESL
>Noc_1543 conserved hypothetical protein
MRLSLGKRKAQPRQEKSSPHLTKAAKGIINRTNTFRQEKGRQKLTVSPKL
KEASRYFAEFMARTDQYGHNADGNQPAERASRYGYRYCIVSENIAFQFDT
AGFTTEELIQGFFQGWKSSPGHRKNMLDPGVTEVGVAVARSKQSGYYYAV
QMFGRPKSLRIEFQVVNNTATAIQYELGSQLFPLPPRAIRTHQLCRSENL
RFHWPDQQENTFVQPNNGNRYTIRREGQKFRVRKE
>Noc_0304 conserved hypothetical protein
MVVSKTAAPSAASSIPANEARRYEVTRADLPLSCPMPSMALWNSHPRVYL
PIEETGWERCPYCGAVYVLKEE
>Noc_2112 Protein of unknown function DUF455
MDIMPLVWGKATLGEAALHCLKVCDPEEKTKVTCDVAQLWKAGCLQIGFP
PKSCLPDIPGRPVWPSLVAPRELPRRKLTTVLGRAALIHAIAHIEFNAIN
LAWDAVCRFHDLPGEFYDDWVQVALEEAYHFCLLQDHLHSMNHEYGDFPA
HDGLWEMAQKTAHDPLVRMALVPRVLEARGLDVTPGMIERLQQAGDLRAV
LILEIILRDEVGHVGIGSHWFRYLCESRELNAETTFRDLVNGYFKGKTRG
PLHREARLRAGFSEAELRMLDKELNDSRLC
>Noc_0121 conserved hypothetical protein
MNQQTFRANIDVRSIRPQERHPLIFRTFYELEPGEALLLTNDHDPKPLYY
QFQQEQGENFNWQYLEEGPEIWQVQISKLG
>Noc_1708 hypothetical protein
MKPGLTRSLNGKGGGWLPKIALIRKILASRLGVLALLSLALMACATLSNY
ERPRVYLTDFQFREASMLAQNFLLRLRIDNPNDAALSINGMEVSLNLNGH
PLAQGLSNQSFSVPRFGSAEVEMQVTTTFINLVQQMLSLQGQRVLNYEIA
GQLHLARALGFGRRVFPFQEKGVLDLKTMGNRDISFNYPPGRR
>Noc_1007 Protein of unknown function DUF475
MRHFKYSFLVTVAGLVAAFLWGGPAGLFIAAILGVLEISLSFDNAVVNAS
VLKDMDPKWQARFLTWGILIAVFGMRLVFPVAIVAIVADIGILEVTQMAL
NDPDAYSAHLLASHVDISAFGGMFLLMVFLSFLLDETKELHWWGLVEEKL
AGIGKLESIEIVIALGVLWALQSFLPPEEKLDAMLAGISGVMLFVIVGSA
SGLFEVEETGEAVTHAAKRSGVMGFLYLEVLDASFSFDGVIGAFAISKDV
IIIMLGLAIGAMFVRSITVYLVRKGTLSEYVFLEHGAHYAIGALALIMLA
STKVHIPEVVTGLIGAAFIGLSLLSSIRYRNKH
>Noc_1251 conserved hypothetical protein
MSAHHAEQAESPLDNREKLTTAVDKLELVLPCAAPLYDFMHLNTMQGYHH
ISFAEAMAAHFELTGIRGYLPEEDFRKHYARGRIDDADLNESLANNTHGR
NREIVLKVSGRPINKQDIWRISLIQDINPLSPSRFRWQIKEYDALERFQN
GVPKSARDTLLNATQETDQNRRTESQAIRDLWEACLRVFQLENPNLHSEE
LGELVDLEEFSRSQAEGKQSKFRTSQPTLIPQEEMLAEARKDLHHLVDKV
GEELSLRGLLQALTGEDLLDQVRPILIRFCASHLDEGFTAWSLPERGQGL
YAAWRKCPFAELGLDLARLPDWQSFHAELPEHSVDAVIACLERLKIPESR
WEGYLKRIAVELPGWSGLINWRHHRPKYKPNRKAPTSLMDYLAIRLFLDV
IHIEQVTQNTWGIAGNLEELKTYFENYLWEFSGRYALFSNTLPEYLAIRA
QELIALPRTAQKDRENWRTVSNMIHNWKHNPSAERTKRQTVHSHVWRLFC
LAQHLGLPGNEVGKLSSSEAEQLLAILDELTTSERGYIWLCAYEYHYRED
YFAALTQNHGRGRWANRNERPEAQLIFCFDDREEGIRRHLEEVNPNLETL
GAPGFFGVPIQWRGLDYPDTTPHCPVVVTPVNELHEEPRPEAKKRYGLHK
RLYNFKQFLLRVLHNKTRRDLLTSKVLIDVLFPGMLAVLAGKVFFPFQQA
SLKRKATAALVPPVPTQLKLTVPDDGTEATPDNLRVGFTDAEQAERLAAF
LRAIGFTSGFAPLVVLSGHGSMSENNPQLASYDCGASGGRHGGPNARAFA
AMANRPEIRARLAEQGIHIPDDTWFIGTEHDTCSESFPWFDLDKVPANFA
PALKKLKAEVDQALLLSAHERCRRMASAPRKPSLQQARRHVAERGTDFSQ
ARPELGHATVASALIGRRSVTRGIFLDRRCFVLSYDPTIDDAEGTILEGV
LKNAGPVGVGINLDYYFSAANNQGFGSGSKVAQNVTGLFGVMQGIDDDLR
TWCSYQMVDVHEPMRVLTVVEATTETLTAIYKRQPSSQEPAGGSWLLPPL
RQLIDGGWLLLAAIHPKTGKISVFDPKQGFIPWKSYREPSPLPVVERSMD
WYDGYSDPRPPALVEPKQTETHHAA
>Noc_1650 HesB/YadR/YfhF
MAITVTASALKQIKKVLSQQENVEGLRIGVKKSGCSGYAYVLDFAKQVKS
EDTIFDHDGVKILVDKKSLNFIDGTELDYRREGLNESFKFQNPNVAGTCG
CGESFSI
>Noc_1454 Protein of unknown function DUF883, ElaB
MADNSPKKEVDELKDELSQLRKDMGSVVSAVKNLGQSTARTTKTKAEQEL
DEMLEKLNRAYLSAREGGERAVTSTYSEIERHPLASLGVAFVAGLIAGKL
FSQK
>Noc_0497 membrane protein-like
MLFWHKGDSAILEINDKRYQGCRRNQEREARIPNSFWPVGFRATGNEPGW
IMEIIKKGEIHLLMDYGKTRIPLPEMNPTTSGESTIYETKTGTHRLRILI
TPGTCIDSMSGEQFQAQVILKLADKTYRGCGHFLE
>Noc_1385 Protein of unknown function DUF885
MRKNFSPFLGSAVLLFLTLLLLFPGIPETRAAENNTHWDAFVHNFVEKYF
AANPDFAVRAGRHEFDGKLPDWSPEALAKEVARLRSERQRALAFEVASLT
ASQRFERDYLVAWIDKDLFWLETAEWPYRNPAFYTQELDPNVYLSRPYAP
LEERMRAYIAYAEAIPAAAKQIRHNLRTPLPRTYVDIGEKVFGGLAAYYE
RDAPAIFSTVENERLQRKFRAANRHAIRAMKELQQWLQTQRTNATSDFAL
GAPLFRALLREAEGVKISLERLEQIGRQDLKRNLVALQKACGNYAPSKTV
SECIEKARAVKPEKGPVEEARRQLQKLKEFVIAKDLVTIPSAEQAQVAAS
PPYMQWNFAYIDIPGPFDKGLPAIYYVAPPDPAWSKAEREDYLADKADLL
FVSVHEVWPGHFLQFLHSNRVASPLGKLFVGYGFAEGWAHYVEEMMWKAG
LGHGDPEIHIGQLLNALLRNVRYLSAIGLHTQRMTLEESERMFQEFAHQD
VGTARQQAARGTFDPAYITYTLGKLMIKKLREEWTATRGEREGWRVFHDK
FLSYGGPPIPLIRKEMLGENAGPAL
>Noc_2413 Extradiol ring-cleavage dioxygenase, class III enzyme, subunit B
MSENRLPSLFIPHGAGPCFFMDWNPPGTWERMEAWLRGLTGQVGATPKAL
LVVSAHWEAAVFTVNAQAQPGLLYDYNGFPAHTYRLTWPAPGAPALAAEV
GALLAAEGFTVAEDHERGLDHGVFIPMKLAFPEAEVPVVQLSLRSGLDPT
EHLKAGQALAPLRRQGVLIVGSGMSYHNMAQLRHGGPAIDPASQHFDAWL
AETVALPPGQREPRLTEWSRAPGARDSHPREEHLLPLHVVAGAADGGPGL
KVFEDQVLGSVQSAFLFGDSSPPP
>Noc_2224 Protein of unknown function DUF344
MPRSINTDTFTFTGKKAVKLNEHETKIKNIYQNKADYRKLIKASQKKIND
LQRMMYAHDRYSMLLVFQAMDAAGKDGTIRAIMSGVNAHGITVHAFKEPS
AEELDHDFLWRTTVRLPQRGRIGIFNRSYYEEVLVVKVHPEIVQSMQRLP
ANRTDNLEELWRQRYTSIRDFEKHLWHNGTRVLKFFLHLGRDEQRQRFLD
RIDEPDKNWKFSEGDVKERKFWDDYQQAYQDAINATATKNAPWFVVPADD
KKNMRLIVAQIILEQLKSLDMEYPEVTPERRDELQQFRKKLLQA
>Noc_1899 conserved hypothetical membrane protein
MKLATIISIVSVLTILPTMVLAGDKPPKDSKPLSEIVKSLEEQGFKQISE
IEFDDDKWEVDVYKDNQKLELEVDPSSGKILSNKVDD
>Noc_0451 Glutathione-dependent formaldehyde-activating, GFA
MTISGRCLCGTVSYECSSDPVLQFNCHCHDCQKSTGAAYAPVMFFKREDL
HINGPLSYFESFGGSGKKIRRGFCPKCGAQVIGDAEIAKPLISIRAGTLD
DTSLYNPRADIYCSQAANWDCMSKDLPKYPAMIESQNV
>Noc_0123 Conserved hypothetical protein 46
MRVPRVYFPLSLSIGSSVSLDERALQYVIRVLRLRLGAQLRLFDGRGAEY
QAVLETIEKRAVKVRILERIEHHVESPLHIILGQGISRGERMDYALQKAV
ELGVSRIIPLLTERSAVNLSAERAEKRLRHWQGVIISACEQCGRNYIPPV
DTPRPLADFLRDDHRGLAVLLDPRSRRPLKALPLPLDNRLIVLIGPEGGL
NKGEAKQAQQADFIGVCLGPRILRTETATVAALTALQLLWGDL
>Noc_0058 hypothetical protein
MLGENILCVGRRWMWPCRQPCMDFNRLVAAIPQAHDELAKQACQAVYGEK
VLSELTARLSDIPPIGILLCTDKNHALAEYARAGMDNSLFVSKYLLELPK
KEEMQAFIKRQVKEIGDVG
>Noc_1842 conserved hypothetical protein, DedA-like
MNSLFQKIHHISRKKPLITFSVGIILIFIGLYFFWPSLHDSINRFFNILF
SGDREKLHDFVEQFGFWGPIIIILSMVAQMFLVVIPSVVIFVVSILAYGS
FWGGTVALIAVLVASTVGYFVGRWLGPITVDKLIGLKTRHKIEGYVERYG
FWTVIVIRFSPFLSNDAISFVGGLLRMNYWRFMAATFIGILPLIVLLAYL
GETNERLRTGMFWVSIISLIVFIAYIIYDHRRGDDSSTRVKQRNSVQKR
>Noc_2963 conserved hypothetical protein
MNDKIAPASKETKLHTKLSAARTRLIVERPFLGALVLHLPLREASPEWCG
STATDARAIYYNPAYVEWLSFEQLQFILAHEALHCALCHFARRGRRHLGR
WNAACDYAVNQLLVREGLEPPPGVLLKQDYRALSAEEIYPLLPVGAKLQT
VDQHVYDHEDSPSNAPADLNPASQDWSIHGPERHNPPVYDGRLLPSSQVS
ASGSPPPLTEMEREQLGRLWQQRTVSLAQQALQNGKLSGPLRRLIGNLGR
PQLPWRQLLAQYLSAAAQNDYSFARPSRREGPAILPRLASQQIDLAIVLD
ISGSIHDEQLQSFLTEVSALKGQLCARVTLHACDADLCELGPWIYESWEG
LTLPENLPGGGDTDFRPPFAWQAREGLYPDVLLYFTDARGPFPQAEPPYP
VIWLVKGKAQVPWGRRIQLN
>Noc_2090 Protein of unknown function DUF488
MSKIALKRAYEQADKDDGCRILVDRLWPRGIKKEDAAIDEWLKEIGPSNE
LRKWFGHDSEKWPEFRKRYFQELEKNPEAVEPLTEIAKNQRQVTLIFSAK
DSEHNNAVVLKEYLEEKLSS
>Noc_0791 conserved hypothetical protein
MARLFTLLYDKTMGWSRHRHATRWLVLVSFTESSFFVVPPDVMLAPMALA
SPNRAWYYAMLTTVASVFGGLLGYGIGALAFSLIEPLLQQWDYWDSYLKA
RLWFEEWGGWAVLLAGFSPIPYKVFTIAAGVAAMPLALFIMASLAGRGAR
FFLVAALMRWGGARMEAGLRKYVDLMGWIAVLVAFAGYLLLSR
>Noc_2790 Protein of unknown function DUF1239
MVALMLIATLTAWELLHEESAYIPRSDLNKRTTDYFMEKFTSTLMDQQGL
PLYRLAGTHMAHYLDNDTIEITAPDAVFYQQATARWKVVAERGLTNSQGD
EIDLLGEVIIRQLGADSKTSNMKILTQNVRVKPRIKYAETQQPVTLLNSF
GKTHSIGARVYLKDGRIELLSQVRGNYDLAPEP
>Noc_0085 Protein of unknown function DUF497
MGYPHSMNESHFEWDEAKNTANQRKHGVSFYEAQYAFLDPKRVIAEDLSH
SQNEKRYYCFGTNREGTGIITVRFTYRSGRIRIIGAGYWRKGKKVYEQAN
SV
>Noc_0322 PepSY-associated TM helix
MANLLKVTTSRFRWHSLWRKVHLYLALTVGFFFVLLGLTGSVNVFHWELE
ELSLPALETRESPKAGLPLNAVMANLHQAHPQRQGRWLLFMPGYEREYVW
AIYPHPEETRDVFFAPLRVQLDPGSGKLVAEHFWGETLGTLIYSLHASLL
TGIIWDRDMGLIGFQTVTFLGLFLLISAAIGLYLWWPRTGTFLKAMRFQR
QGRVTRTHFELHRLVGFYGSVILLVLAFTGFSFGYYDYLKPLVAAFSPVE
AKHFKDPEGLKSTPVPGTQPITIEQAVAIANQVFPNAELRWLATPEGPEG
VYAIEKRQPGEANQRRPRSKVWVEQYSGEVLAVEDPNKFTAGETFFNLMW
PLHNGQAFGFPGRLLWCLVGFVPLTLYITGLTLWLRKRRVRRLARHKGMA
ATVGGLWL
>Noc_1358 Uncharacterized conserved protein UCP006173
MTEPEFWKHKPLQAMSSEEWEALCDGCGKCCLHKLEDEETGEVFYTSVAC
RLLDLHYCRCTDYQNRVRKVPDCLSLREELTEALKWLPSTCAYRLINEGK
ELFPWHPLVSGDPETVHQAGISVRGVAIPESQAGDLEDHLVDWSVKD
>Noc_1188 probable signal peptide protein
MLFRTPPLRLTLLVPGLAQALEARAIEGEGARLPFLECIIGQADVEALTT
PLYETLLFALFGISQSGTMDVPLAPLMYSWDKGGGSPEPGWWLRADPVCL
HPDRDRLVLFGPSHLQLSRTESQSLAKRVAPLFTEYGWQFQALEPDRWYL
RLPQPEQVTFTALTALEGKYIEPGLPSGPNSSRWRTLLNEIQMLLHDCPI
NLEREKQGLPLANSVWFWGAGEAPSYPIPPLWRQIGWDHNPLLQALAAYC
EIPGRPVPEGATVWLAQNSTIPGDYLVGLDSLLHTPDSFPCAGALQALEE
NWFSILYAALRNRQLASLTFYPMNGYRYHLTWQRSWRLWRRPRSLIKSVG
G
>Noc_0538 conserved hypothetical protein
MTLLDPLFTTSFTFALLLGLLSGLHCVAMCGAIIGTLTLSLPREIRSDKK
RLLTFVFAYNLGRVASYITMGFIMGLLVGLLGSLSHPLVFGVSGHDILQA
VSSLIMLGTGLYLANWFPRFAVVERIGVPLWRRLEPLGRRLIPVRTRTQA
FVFGTIWGWLPCGLVYNAVAVAATTGSGTQSALTMLAFGMGTLPTVMSTG
IFTAWMAHLASLNRLRLVAGLVIIAMALVNLLAVGFYRG
>Noc_0684 Protein of unknown function DUF891
MFDRFCRDECEQLGPAIPLVTTAVHRMEQGNFSNVKGIGAGVNEYRIGFG
PGYRIYFGKDGDRLVILLAGGTKKRQDADVAAAKGHWRDYKRRKRKEVT
>Noc_1819 PepSY-associated TM helix
MYRLLQKTHLWIGLMAGALFCISGLSGSVLVFDDELDAYFNSELWQVEPQ
PGPIRLNEATTKIRSAFPSSVLLYARLPREPNHSIEYWVKDEALQRVYID
PWRLDILGVRGEHAGLLGFLHDLHVHLLAGAQGLLANGILGLILLLMVVT
GLWLAWPGWRRLLKTLRLPRTEARVARWFALHRSVGLISLLLLFSSALTG
AAMVFHEQANAALIALFGGPSLPEPPPIKASPLQSVSKSPAELLNIAESA
VGDERATWLQFPAQPSVPFIVRLRHPDNPHPNGTSYVALDATTGEILMAH
DETRSGPGQQIADLKYPLHIGTALGLPGRLVILVAGMMPTLLFVTGVYTW
WRKRQKRRARLGKSPSPSNA
>Noc_0339 conserved hypothetical protein
MAILPLNLYTAAQVRELDRCTIEEFGISGAMLMERAGKASLEQLRKHWPQ
AQRLVIICGVGNNGGDGYVLARLARKAEMYVSIYQLGDNSKLSTDAQAAR
QMLLDSGMEILPFQPQVLRAADVVIDAIFGTGLSRGVTGQWADAIEAINT
CGQPVFAMDIPSGLHADTGNILGIAIKAQVTATFGGLKQGMFTHLGPDYC
GEIAFDSLSIPPEAYHGVTPSARRITLEDHITKLPSRAKAGHKGDYGHVV
IIGGERGMPGAARMAGEAAYRVGAGLVSIATREKHASLLNLARPELMCYG
VESAEELKPLLNRATTLVIGPGLGQDLWGQTMLAEALNHSHPLVVDADAL
NLLASQPRQHNRWIITPHPGEASRLLNINIEEIQADRFAAVQALQQRYGG
VAVLKGNGSLVCSTNHPLGLCTAGNPGMASGGMGDVLSGTIAGLLAQGLT
LNNAAHLGVTIHAMAGDRAAREGGERGLLASDLMEHLRQLANLQQIGP
>Noc_1353 Propeptide, PepSY amd peptidase M4
MVRLSVISKTTKRPWTVLLVILTLALGMSGTLKARDLGHEEALRLRQTGT
ILPFEQILARALKAYPGAQLLESELEREDGLYVYELEILTPAGVVHEIEI
NANNGQFIEDEIED
>Noc_0956 Conserved hypothetical protein 255
MIYSMTAFARQEAQSDVGTFTWELRSVNHRYLDISVRLPEELRFIESQLR
TQVGCRLKRGKVDCTLRYLPPSEQAPKFSLNEQVTRQLVQLCEEIEALSH
NPAPLNSLEVLRWPGVLQTPPVDGEQLKIGALSALEEALNQMLETRAAEG
RRLAAFITQRCEEIEAIIVRVRAHLPQAMLHFRERLLARLEVVQADLEQG
RIEQELVLFAQKSDITEELDRLQSHMAEVREVFQRKEPIGRRLDFLMQEL
NREANTLAAKSADVEISQDAVELKVLIEQVREQIQNIE
>Noc_0875 hypothetical protein
MKNQKESIRKMVSYLNNDEKDGGYWLPNIQRPFVWSEDQIERLFDSIMRE
YPISTLLVWRTKAEIRHRKFVDNYKQGLRLTDFYVPENN
>Noc_2712 hypothetical protein
MVEREKALLYYFAYGSNMAVERLRARVPSARRIVTARLMGYALRFHKKGM
DGSGKCDVVSTGNPADCVHGIVFEIATAEKPALDRVEGPRYIQQNVQLAL
ADGNWVDAFLYVVADKRRFTDAAMKPFCWYKYHVLYGARSNGLPGDYIAR
IEAELEQPDGDDQRRAHEMRIYAKGAACGP
>Noc_2791 OstA-like protein
MIWRLSLKLTAVGFIISSNLSIIETALALSSDSEQPIHIEANRGELDDRE
HVAVYSGNVHLTQGTLRIDSDTLTIYYTPDKKLEKAVAEGQPAWYRQRPD
NSNEDIRAKALRMEYHADTATIHLLQKAQVWQGTNEFTGDRIVYDTERDI
VRGEGSETGVGRIHVTIHPADDSPSTGESTPENTTSTTSPPENKKNDRQE
LTDGRTTTWLKLRTGPGTDYPKVALLPPRTQVAILGRQKKWLHIATLVKG
ESVEGWSHMDFIRLSPKREDNRIVP
>Noc_2508 Protein of unknown function DUF444
MTTVFRAYSQSDALRSDRSAKDRLRHRQKVRKAIRDNVADIVAEESIIGQ
SRDRIIKVPIRGIREYRFVYGQNTPGVGTGQGDSEPGQTVGQVPQGDGGP
GHAGDRPGMDYYETEITLEELIEIMLEDLELPDMERKRFREVLSERTSKR
KGFRRVGVRVHMDKRRTAKSRIRRRLASDKDAEDNETKHRFPFHRDDMRY
HRLREDMRPQSNAVVFCIMDTSGSMDTLKKYLARSFFFLLYQFVRSRYVN
VDVVFIAHHTKAREVTEEEFFHKGEAGGTFISSGYSKALEIIQNRYHPSL
WNIYAFHCSDGDNFDSDNAATLKAAEVLCQVCNLFGYGEIKPRPSGFYEG
TMLDLFRSVRMDNFQSVLIQRKEDIWPSFRQLLSRESESSKYDE
>Noc_2660 Iojap-related protein
MSTVELKEFVVSALEALKGHDIQVLDVRKLTTVADYMVIASGTSNRQVKA
LANEVIERCEAAGYRTLGVEGESYGEWVLVDMGDVVAHIMLPQVRAFYNL
EKLWNIPSAQGVKEQ
>Noc_2659 Protein of unknown function DUF163
MNIYVIAVGQRLPRWIGEGYQEYAQRLPRQCSLSLVEIAPARRGKSGHPT
QWRQDECRRLLAAVPPNSKVIACDERGQSWSTEEVARRLQIWMNEGQDVV
LLIGGPDGLAPLCLEQATGLWSLSSLTLPHGLVRVILAEQIYRAFSLLNR
HPYHRAS
>Noc_0831 hypothetical protein
MTPVRIRSFFYSTAAIFLLSSFYSLAIAAQDREKPSLASLLEETVTLPSQ
EATIMVHRTQFPVGFKTPEHTHKGPGPRYVIKGKVKITEGGETHTYQAGQ
AFWESGLPMTLENIGSGEAEVVGFELIPIE
>Noc_2719 Ribonuclease BN
MRDLSRRVQFILAQPLCFLSRVISGFRANQGLLLAGAIAYYALLSVVPMF
ALILVGLSQIIDPEQLLEATHDYLALLAPGQAEILTQQIAEFLENWKLIG
VVGFLVLLFFSSMAFTVLENAMSVIFFHRVAIHRRHFLVSAILPFCYLFL
LGLGLFVVTLVSSALHTLDDRTVPLLGNIFGSYYAIGIYIVGLLGEVLLL
TSLYLVMPVGRIVFRHAFIGGMVATLLWELTRQALVWYFSTLSFVNIIYG
SFAAVVMILLSLEAAAIIVLFGAQVIAEYERLDTQGERNSSFQT
>Noc_2056 conserved hypothetical protein, DedA
MEHLQWLVPYLEHYGYGILFIGNLLEGVLIPMPGQLLLIGASLMAARGDM
QIYLVLFAAWSGAIAGNLLGYGLGYYVGRQGILRYGERLKIVNPTRLSRI
EHYFDHYGSGLVVIAPFFELLRQLNGFLAGTMGMPIWRFILCLILGVTLW
VGLWGIGAYILGEHVQEAFFLIKKAEPYVIGLGVSILLATMLYLLRRRRC
RQG
>Noc_2546 Protein of unknown function DUF820
MGLGADSRIKFRYEDYKSLPESEIRRYELLDGELVMVPSPSEYHQRLSRN
LGFLLWEYVQERDLGQIYSAPLDVVLGQGSGREIVQPDIFFIAKARASMI
AETEIRGAPDLIVEILSPATARRDRTYKSTLYARYGVKEYWLVDPESRVI
ELLTLGPRGFERVACYGEREVLRSPLLPGLRLALTEVF
>Noc_0317 Protein of unknown function DUF484
MRKKAGEKETRKTPLEEVQEREQSIAKYLQTHPDFFARHPDLLSELTIPH
PSGAAISLVERQLALLRGQNRELKRQLRDLIENAMVNDDLSKKVHKLALG
VLTAATPQAAIEVLFSSLQTGFDVDVIALRLFFDGKSLPPPAPDHLNVMR
VSRNAPELEVFASVLKSPQPICGRLTVEQRDYLFGEEAERAVSCALIPLG
EEQRRGMLAMGSQEPNRFRADLGTIFLDYLGVIVERALHRHWPY
>Noc_0404 hypothetical protein
MQLPDFIKDVVLLLLGGVGTAVWFFWRRRTEQAPVFENIHKAEKLLSLRK
ELDNTNYTFEDLKSLEDALMERADVAKVLSISYEEEAKQVREIEMSTAMT
QAEMNIVAGNAYQSAERKLDAILEQIKEFLSPEESALLDEANQAWRTYQK
SHADFSASQYEGGSIQPLIYASTMETVTVARIVELEAELKFMKDTRVPNA
EQDAL
>Noc_1343 Protein of unknown function UPF0016
MIRKNRAISSAWFFLLGSINKWRITTLEYKVLLTVFVAVFIAELGDKTQL
ATMLFAADKEVSKLAVFIGASLALIVASGMGVLAGGIISQYISEKHLHYI
AGVGFIGIGIWMLLKA
>Noc_2138 Protein of unknown function DUF77
MRATAELQMIPIGSEISVRPEISRVIEILQEHDFILETHASGTNIEGELE
DILSVIRKIHETLHQEGAVRLISYLKVETRTDKIPTIAGKRL
>Noc_1060 Pentapeptide repeat
MSIPPNEETLHLHKNELRDILTAHQQWLQSGGKEGQQANLDGIILRGADL
KGVNLQRANLTLACLEQVNLENANLQECTLILASLKGANLINAKLRGANL
DSSKLQAADLRGADLSAANLEWTDLSHANLYKTNLRGAKLGNANLKGTKG
LADKSLG
>Noc_1447 conserved hypothetical protein
MANSNISQNNGVKSTASIAGHPIHPMLIPFPIAFLVGALLTDIAYLATAD
FFWARASLWLIGGGFITGALAAVFGLTDFLTIPKIRAHKAAWIHFLGNAT
VLLLALANWLMRLTEMTTPILPWGLSLSALTAAILLVTGWYGGELSYRHK
IGVTGHTIK
>Noc_1110 HicB-related protein
MNVLEVDGFKAKIEFDPDLDLFRGEILGLNGSADFYGKSPASLRKEFKNS
LKVFLEVCEEKGIEPTKEFSGKFNLRIPPRLHSEISAKAAASNKSINQWV
VEVLEESVNE
>Noc_0083 Protein of unknown function DUF497
MDEFEFDEAKSQANLDKHGINFVAAQGLWKDPCLLEIRAKSEGEPRFLLI
GKIGEKHWSAVVTYREACIRLISVRRSRKKEIELYES
>Noc_0098 conserved hypothetical protein
MKKTRLPLQRKAQLHPSKEEKTPLSQQELLRISQETRDCFLPPASPDKTQ
LGLMVVNPYRVYAYWSILNQDLIRAHKKLGIKKSAQWALRFYDFTAGKLA
ISSSFEVAIPRKAHHWYIDLWADNKHYMAELGLADSEDQFVALARSNMIE
TPRAGSADPTAPVIFTGHDLQPFHPAKTFPLEFSVSPEERAVRSQELTAS
YFSFLAESSQIQPKENMLSKPKSLSQWSASLPKRRY
>Noc_1689 conserved hypothetical protein
MSYVIIEKATIYHERRSGERRVGSFPQFNYWGHQGRRGYIRRREDLANSY
LDKYPASLWWLIVAVLILCFMDAIFTLSLLQHGAKEINPLMAELINFSIP
LFAGVKMAVTGVGLVGLIIHHNFIVFRIFRVQQFIYGFLLLYSGLVVYES
FLLLPEFLLY
>Noc_0874 hypothetical protein
MCLKIIKAKMLILDGQQRLQSLFIGLKGSYEKKELYFDILSGESAAPEDV
RYRFRFLSNYKAVFPWICFKEIVFYNGMPNRLADEVIAKADRELSDNDIA
RTTENVWKAIQMFVQHEALSYQELDSVDNPDAYTENDIVEIFIRANSGGT
KLGKSDLLFSLLTSSWEDSDERMEELLDELNRPGYHFTRDFVLKTCLTLL
GKGASYDIAKFRDGRTREAIIERWDEISAAISDVKDFLQGKTFIRTDRAL
PSYLGLIPIIYFRFKFPGKWGNISNLDSYILQTLLTGSFGGRPDTLIDKC
TRRIDEISDFDVTELFGVIRADGRSLEVTRDTILNTRYSSKDIHLLFNLW
YRDFNYQPAYENNLPQVDHIFPQSLLKTVKDENPETGRRNILRYKTGFRD
QIANCMLLRTQENGAGGKSDALPMQWFRDKDDEYLDLHLIPKDKTLWELD
RFEDFIEARKRLILEKFQYLLQ
>Noc_2242 Protein of unknown function UPF0150
MKYLIEVFWSDEDSGYIALVPDLPGCSAWGATPEEATREIQDAMTAWLEA
CQQSGESIPKPAAKARYVA
>Noc_0568 Protein of unknown function DUF891
MPSGSMVYAISKPELVLVRVERLAAGNSGDVKPVGEGVSELRIDYGPGYR
VYYKKQGKKVIILLAGGDKRSQASDIKTALRLARNL
>Noc_3013 Peptidoglycan-binding LysM
MNLKKLISIMFLYAMTSSVWGEPVTLNPDAPQRYEVVRDDTLWDIAGRFL
RDPWRWPDIWQVNQQINNPHLIYPGDIIFLSYRENGDPVLELQRGLPSYK
MSPAVRTSKLEKAIPTIPIDAIQQFLLHPRVVSKETLDNAPYVVAGTEER
LILGAGDEIYVRGLSKSAIIRYGIYRGGEVYRDPDHPSKILGYEALFIAD
ANIQRFGDPAILSIQNSTREILVGDRLLPTSDLEFEQHFLPHASPTPVEG
KIIAVIDGVSQIGQYQAVALNLGVEDGMEKGTVLAVYQAGRVVRDPISRL
PEDKVRLPDERAGVIMVFRAFNHISYGLMMGAERAIHVYDVVRNP
>Noc_2426 ATP-dependent Clp protease adaptor protein ClpS
MGKINLDEDNEGGFAVDTAKPKLKRPPLYKVVMLNDDYTPMEFVVKVLQA
FFAMDREKATRVMWQVHTEGKGICGVFTYEIAETKASQVNEYSMNHQHPL
KCVLERA
>Noc_2298 Protein of unknown function DUF62
MIGLFTDFGILGPYLGQVRIVLQQQAPNIPVVNLMADAPRFSPHASAHLL
AALGQHLPPGTVVFAVVDPGVGSSDREPVIVCADDRWYVGPGNGLFDVVM
GRSLTAALWRITWRPPSLSATFHGRDLFAPVVAKLARVENPRGEEFLGQR
WDEAVVTKDPGDLEEIIYIDHYGNAMTGICGTKFRRGHVQLPNGDLISAA
QTFSDVDMGDAFWYVNSLGLIEIAVNQGCAAELFKLDVGVPIKFI
>Noc_2039 Protein of unknown function DUF198
MNPIVPAVERPIEDVQAKADERQIAINKVGIKDIRHPVRVSDRNGGEQHT
VANFNMYVDLPHHFKGTHMSRFVEILNQHEHEITVRSFREMLREMNHRLQ
AESGHIEMVFPYFVTKQAPVSKVQSLMDYQVTFIGEIKGDKPQITVKVVV
PVTSLCPCSKQISDYGAHNQRSHVTVAVRIEGFIWLEDIIDLVEEEASCE
IYGLLKRPDEKYVTERAYDNPKFVEDMVRDVAARLNQDDRVSAYRVESEN
FESIHNHSAYAMIEREK
>Noc_2122 Protein of unknown function DUF150
MWGDRRISELVEPVVAALGYELVGVERLSSVGKGALLRIYIDTPSGITID
DCERASHQISALLDVEELMASAYTLEISSPGLNRPLFTEEHFKRFTGVEA
SITLSKPLNGRREFKGLLRGIRGDRVVILVAGEEFELPLEGIKKARLVPE
C
>Noc_1566 Protein of unknown function DUF820
MVKMSWAEICQDPTLRDLPYKIQTDKWGNIVMSPATNEHGIYQAKIVALL
SKLMNEGTIISECSVQTSEGIKVADVAWASEKFMQNNRGKSPFDEAPEIC
VEILSPSNTKMEMEEKKELYFARGAKEFWMCDKKGSISFYKNTGPLEHSN
IIEGFPGTLSV
>Noc_1868 conserved hypothetical protein
MPITLYESGEERKTNKEKTSVVTGTVINNCDLIKQGKILVRIPSLDQELW
ARLTAPGASSGAGLFYVPRPDDEVLVVLSGDEPVDAYIIGGLWNTQDSPP
VSNPLEASTKRVIKTGLAGGVGHEVEFDDGPGQSISITTSTQQKIIIDPF
KIELRNTAGTLKITLDNKTQTISIAAAASLELAAVGNIKLKAANIEIGDI
VKTAKTSINGKMVSIN
>Noc_0939 Peptidoglycan-binding LysM
MADFKRGDSNYVRSAYRRPGSERPEYGRSKNRGQIWSVIIVLLLIVVLAG
GAVWMYLNGKGEEEAVASKESEVASPQMAESNTTELEPSQQGGTEEEFAS
TAPSEEDEFASSTPSEEEDEFASILEELGVGSEPNEEEASTPSTVEPQAE
GSAVEGNTVIGTAPEEPQFFDEEEDQFSTEEFTIPEQEQPEEAVGEQAEQ
ATGQFRGTERQVEETQKRGEANNKMKEAEEEDFAAVTPSFEEAPTRSQAE
GSFPQTVTVQSGDSLSVIADRVYGDAGKWRLIYEANQDQLENPDQLLVGM
KLTVPDPND
>Noc_1867 hypothetical protein
MPTAARIGDMTSHGTPLTPLVPGVMGSLNVFIGGQPAWRAITDVHVCPVS
NGPQPHVGGTVLKGSTSVFINAFPAARQGDEIVEGGGGLTKSPWDFPLFR
LEDKRDE
>Noc_1921 Protein of unknown function DUF1232
METSLGTQEYSDNTFWEKVNIYAKSAGIDIIETALKLYYALQDDDTPKWA
KTVIYGALIYFISPADALPDLLPGGYIDDWGTLLSAAAAISIHMKAEHSE
KANTKIKQWFRK
>Noc_1445 Protein of unknown function UPF0057
MDLVRIIFAILLPPLGVFLQVGLGGQFWLNILLTLLGYIPGIIHAVWIIA
RR
>Noc_1202 Protein of unknown function DUF86
MTQSWQPYAKHILDAIAKVRRIEARGDLTQDEVLYDAVLRNLQTLSEATQ
LLPEEKKASCPEIPWREISGFRNILVHNYLGEIDPLTVKTVITQHLPPLE
ACVRTLLINAGETEL
>Noc_1370 LmbE-like protein
MEIGMGGTAAKKAASGSHITSVILTDGRRSPNPFGWPEETLVEIRKQEAT
RAARVLGIKEVIFFALPDLKNTSHYHSAKERLSELIKQLRPEEVYSLHDH
WDRHPTHRLAGQLTRKCIEETPLFIDALWAYEVWGLFSRWDRLEYIDDQI
GKKMQAIGEHKSQLASIPYGEGILGLNRWRAVFADPQQTTPQGVFAEVFF
TMKL
>Noc_1506 Protein of unknown function UPF0150
MAAKGLHGNANCSKASELVWNKRTRLSRKLQEEALNRAHYEMIEDDEPYY
GEIKELRGIWATGKTLEECRRNLKDAIEGWLLLSIRRGLPVPKLGDYEIK
EGEDVMA
>Noc_0698 conserved hypothetical protein
MVISGRKVDGAALSTTQGLPQVRVFNKVFIEDNVFTSHGSVAPIFFSGEE
NIEKQKQVEKLKGDLDQSEKEDREKDTEKHRSEKALDDFKKERAKSIKDL
LSSSGGNNPYNNYDKRSYQVKYDELLKLSATEQKAKILSESDFAVQKQKK
EVGPAG
>Noc_1293 Protein of unknown function UPF0150
MNRKLTAIIEREDDGYVALCPEVDVASQGDTIGEARNNLKEALELFFEVA
SPEEVTSRLHEEVYVTYVDIAVG
>Noc_1809 conserved hypothetical protein
MNDTEQQFLKSLDNKLWKAADKLRANLDAANYKHVVLGLIFLKYVSDAFE
ERQEQLLALFKDESNDIYYLSPEDYDGDADYQQALRDELEILDYYREANV
FWVPKAARWNTVKEKAVLPVGTVLWQDDAGNDVKLRSVSWLMDNALEAIE
KSNAKLRGILNRISQYQLENEKLLGLINTFSDTSFTKPVYGGEKLHLHSK
DILGHVYEYFLGQFALAEGKQGGQYYTPKSIVTLIVEMLEPYSGRVYDPA
MGSGGFFVSSDKFIEEHAKEQHYDPAEQKKHISVYGQESNPTTWKLAAMN
MAIRGIDFNFGKKNADTFLDDQHPDLRADFVMANPPFNMKDWWSESLADD
ARWQYGTPPKGNANFAWMQHMIHHLAPTGSMALLLANGSMSAHTNNEGKI
RQRLIEEDLVECMVALPGQLFTNTQIPACIWFLTKDKAGGKHPSPSSRGE
PAKHSFPSSRGESAKHPSPSSGRESAKYPSPSGRGAGGEGKEGKESKRDR
RREFLFIDARNLGYMRDRVLRDFTLDDIAKIADTFHAWQRIPPLPLGESR
GKGQTPPKLLRFARELRKNQTDGENLLWQLLRNRQMANAKFRRQQPIEDY
IADFYCHEHRLVVELDGSQHLTPEGRQRDARRTQRLQEIGIQVLRFNNRQ
VLTETEGVLESIYNTLTLSLTQTLSQRERASTPRVTLPRGDDRGSEAIPA
GENQHILPAGEGQQNTPLPLTGESQQNTPLPLTGESQRNTPLPLAGEGQQ
NTPLPLGEGPGVRAKSAKRATTYQDIPGFCKSVSLDDIKKHDFVLTPGRY
VGAPEQEDDGEPFAEKMMRLTEQLREQFAESDRLEAEIKRNLGRLGYEL
>Noc_1482 conserved hypothetical protein
MTDISVAIALHVLGVVWWIGGLALVTTVVLPQLRSDPVNALERFHTIERR
FAPQVRIAVLIVGASGGWLLYRLQLYHVLNEPAFWWLPAMMALWTLFFLM
LFILGPTGVLRHIMSGPLNADLTRRLARMHLLHTVLLIVALVIIAGAVAG
NHG
>Noc_1357 conserved hypothetical protein
MQNRVITPFFLLLLFIVFFSVWVRPLTAATVVSGQAHGKANELEAQQALE
MDASTKAVNLRQLLDLEAIIPKLARHQVIFVGEQHPRFDHHLNQLAIIRG
LHGIHPKLVIGVEFFQQPFQQYLEQFVADQLTVEEFLKKTEYYDRWRYDF
RLYAPILEFARKNSIPILALNVPTELIQKVGREGLEGLSEKERAQLPSEI
DRSSVAYRERLQEVFENHPQHFGKFETFYEAQLVWDEAMAESASRYLKDH
SDSHMIVLAGNGHLAYGVGIPERLNRRLDTTASVAIVLNDWEGLVEPDIA
DYLLLSEKKELPKAGFLGVMLKQSSGKLEVNAFSEISAAKTAGIEEKDEL
LSLNGRLVSDMSDVKEVMWDKKPGEEVLVKVRRGAFMGKDEELEFEIKLK
>Noc_1098 Protein of unknown function DUF497
MDMTFELNGTTFVWDSKKAHSNFAKHEVSFEAAATAFFDPFFRLEDASDN
DEVRDALIGFDGYHRLLYVVNIQIEESGIRIISARKATREERQRYDQ
>Noc_2344 Protein of unknown function DUF1243
MYLPSILLAPIEASVNASLRLDPDTLAQVAAISGQCIAVELRGLDLQIFI
EPTAEGILLATSSDSPPVATLSGTPLNLLRMAITPSDSSPLLTGEVQIHG
DIELGRKLRTLLQGFDLDLEELLSHYTGDLLAHQIGNRMRGFKAWCQRAA
GTLGQDLAEYLREESQLLPNHAKVTTFLDEIDRLRADSSRLEARVQRLQR
LL
>Noc_0450 Protein of unknown function DUF891
MIEVVKSETFNCWLKKLRDRRAAVRISARIDRLAFGNPGNVQPIGEGLSE
MRIDYGPGYRVYYMQQGKVLVLILCGGDKRTQQEDIAKAKRIAEEWKG
>Noc_1131 conserved hypothetical protein
MLHIMRRIQDLKSFTLHAKDGELGRLREVFFDDSSWRVRYLVVETGHWLV
GRQVLITPEALGSIHEEKWTLQIELTREQIETSPLMDTKQPLLSRQQEEE
YHRHYNWLPYWRSGLLGYLGRSYENKDKGEETEDPHLRSSDEVSGYQVEA
RDGEVGYVEDFVIDEKDWVIRYLEISTHRGWFGKKILLAPTWIEQVDWED
RKVRVDLLREVIRSAPEYEEGQLIGRDYEINLYSHYGRNQYWE
>Noc_1196 Protein of unknown function UPF0125
MASVKMMTVEVAYARPDKQVILKASVPENTTLEAAIQASGILEQFPEIDL
DKNKVGIFGKLSKRDAPLCFGNRIEIYRPLIADPKQARRERATKARQAK
>Noc_1299 SEC-C domain protein
MEQLTNCICGSGIPYTECCGLYHSGKKNAPTAEILMRTRFTAFAMENEAY
ILETWEPAKRPLRVNFPKKGTQWKRLEIVEKKKGGSQDTKGIVEFKAYYL
LEGEEYAVNEISRFRKGQGRWYYLDGAVKSIAKVGQQTNRGKNAPCPCGS
GKKYKRCCGKSK
>Noc_2059 conserved hypothetical protein
MTGLRLGGAPVHPMLVHFPIVAWTTALLADGVFLITGQPSAWIVAYWALA
AGALTGLAAMVPGWVDFLLLDRTHTALPAVQRHMLYMSTAWGIFLLDLLV
RTRVPPETMPVWLAGLSLAGFVLLAIGSHRGARLVYYHGVNVYRDPV
>Noc_1105 Protein of unknown function DUF497
MRFEWDLSKAENNEKKHGVSFSEATTVFGDPLELTISDPDHSEDEYRFLL
SIGRSSVGRVLIVSYTEREENSIRIISARRATKPEQKQYESQH
>Noc_1073 Ribonuclease BN
MPTSKQIVDRRGRKARHPGEIPTKGWRDIAYRIKDSLDDDNISIVAAGVA
FYALLAIFPALVAMVSIYGIIADPADVQRQFDALSGILPTEAQVLLSEQL
RRITSQASTALSVGVGVGVILALWSATRGTKAFIIALNIVYGEKEKRGFL
KLNAIALMLTLGAIVLAILALGMIVVLPILLSYLDLPEIFQVLASLLPWL
LLAFTFILGLAVLYRYGPSRSEARWRWVSWGAVAATVLWIVGSALFSFYV
ANFAQYNKTYGSVGALIILLMWFFVTAYIILLAAEFNAEMEHQTKVDTTR
GKPQPMGERGAYVADTVGKPYGDETGAP
>Noc_0410 Protein of unknown function DUF497
MYTFEFDERKSNGNRRKHGIDFVEAQALWSDPYLMEIPALTSDEERFLVI
GKIEGKHWSAVITPRNGNIRIISVRRSRVEEVAIYEG
>Noc_0762 Protein of unknown function DUF820
MQVHKRHAFTAWEYHQMAAVGILREDDRVELWDGEILEMSPIGSRHAATV
DRLTAWLSRELGGRAIVRVQSPIGLSRYSEPQPDLVLLKPRPDFYASAHP
GPGDVWLLLEVAESSLEYDRDFKLPRYAEAGIVESWIVDLPGQRLWVYQQ
PVGRNYQSMREYEPSDSLSPQAFPSASLALREILECG
>Noc_2677 Protein of unknown function DUF343
MDKKLLEILACPVCKSSLIYKKADQELICKACRLAYPIRDDIPVMLEEQA
RQFDPEEEI
>Noc_2974 conserved hypothetical protein, DoxX
MSMNWGMAADGLGKLILRLTLGILVLFHGINKITYGISGIEGMLQGIGLP
ASIAYGVYIGEILGPILLLLGWYARLGAGLIAINMLFALFLAHRPELLNL
TPQGGWALELQGMFLFTALALILTGPGRFSLNNR
>Noc_2507 Stage V sporulation-like protein, SpoVR
MASHWTIEDLKYWDDKIREKAEEFGLSCFPQEFEICDHTQMLGYMAYSGM
PAHYPHWSYGKAYEKLQTLYEHGMSGLPYEMVINSNPALAYLVQENSLCL
QILTIAHVYGHNDFFKNNFTFRDTQPELTLSNFKLRADRVRGYIEDPSIG
LHKTERVLDAAHALSLQCSRNQAIRKLSASEQKEQVVAAAHPTHDPYQRL
HKQPEYVEPDLNRFPLFPEEDILLFMRDNNLYLADWERDLLTIVHEQARY
FIPQIETKIMNEGWASYWHHKLMNSIDLPQDLYLEFLVHHNQVVRPHPGD
INPYYLGFKLWHDIFRHCQVSAEEEHVGKPRKGGREGIFQVREVDRDASF
LRRFLTEELMREMDMFEYQPKGEALVISKVSDEEHWREIKTTLLKNVGMG
SIPVIRIEDADYGRNRQLYLKHDHDGRDLQAEYAEKTLSHLYQLWGRDVW
LETRMNDRKVCMGYGEDGFVQKTLGRFR
>Noc_1861 Phage tail protein
MFGWALRVSYESRSQCSFSVLNRDNEWPDFQWEGLELLQDGTLRLYSIPL
LKEKLPEEIEAIAPSRAPAGITVDLDGTVYFSDPAAHRLLKIDGCNSELK
TVPCIGSKNGKPTQLNGPRGLLIPPHRRSLLVVDSGNHRIQIFDIASLQL
VAIWGQQDPFSLPQPSDAPGYFNTPWTLAADTKGNVYVVDYGNQRVQKFN
FLGEVIPDFWETLQAANLQQPSDIAAGAIGEELYFYIVAQDAKGAWKIFV
VDNNGHPVLDTSGQSIAFGEEYLEQPMGIAVDKDTIYVGDNNRRRVLTFK
KKSDTFEFAGEALGYEGPVAALALDGKEGLLIHSGIALAPLRLTLDSGYR
NKGMLWSRVIKSAESKVQWHRLHTIVDSLESGAHIQFFVHTSDQEDDPPL
VDPSSPNPFSDAKWRALPLNVSDFFIGDTPACCLWIGAVFSGDGSASPII
SQMRVEFDQETYLKHLPAIYINSVHSREFLVRFLPLFESFFNEVEGTIAH
LPALFDPNAIPKEMLSWLAGWLAMELDEDWDGAMQRQVIVEAFENYAWQG
TAEGLRRSLRLFAGVHAIIEEPNLNSAWWVLPIREEMEDKISDPSYLSWG
NEENSILGFTTRLASAEPQGAVVGTTTILDQSHLITSKEFGAPLFEDVAY
QFSVLLYRGELRCADTLLRVRAVIEREKPAHTSYQVCIIEPRMRVGYQAR
VGVDTVIAGPPSVSRLGENSAGGVTLGGEPAGRIGERNQVGLATRVG
>Noc_2437 Protein of unknown function DUF37
MKNILLSLIIFYRYALSPFMGNHCRYYPSCSVYTQEAIQRYGGFRGGWLG
LRRLLRCHPFCPGGIDQVPEIAKWRSKS
>Noc_2153 Pentapeptide repeat protein
MKVKHDKSKAEFFPGLLKQDEGGYLWDDRRSGYDRRKFKRRRPPFEQRGK
KDRRSPELPEIIRYRIARTQLRTQLEACQTSHSFKYRFIGAIILVIVVFL
GVIVFLSPVRVVKNCDMLPRPGVNWSNCLLSGKDLAAVNLSGANLHNSSF
AGADLRRISLASATLAYANMASANLAYADLSNAVLRSANLQSANLSHAKL
DYADLSYANLLGANLEGASLRGVKLDHAVWPDQRECAPGSVGFCR
>Noc_0452 Protein of unknown function UPF0150
MKYAVVIEQGESSYGAYVPDLPGCVAAGETREEAIKLIQEAVEFHTQGLK
EDGEDIPASTSSIEIIEVAA
>Noc_1157 conserved hypothetical protein
MLYPVYIHAGDENHYHGVTFPDFPGCFSGAGTWEDLPVNIQEAVEVHFEG
KDMEIPEPTPLEKLIANPEYQEGAWMMADIDLGRIRSKFVRINISLPDYL
VRRIDEYTKGQHLSRSGFIAKAVELLMQDSKR
>Noc_0490 Peptidoglycan-binding LysM
MKVYLSLISTLTDLKNTIGISMAHSRPNIHGHYRRSTPKSFVYGPAKSHG
GILSIVIVLLLIVVLATGAVWVYLGRQDEEVKVSNKDNDALVPQVSATKS
TEPGSTELAADDQKYKSVPKEKKTTPEDEKDEFAAILEELNREEEKAEFE
KSRESSPQLAREDKMPAQAPPEQADTPNKEDTFEQTSKEPYERGKVPTAE
ELGNTDSSRSKSTKLPPIQSEQPKPSGKGQEGEGINSKLITVQQGDSLST
IAARVYDDANKWRLIYKANKDKIKNPNQLSVGTKLTIPASK
>Noc_2721 Conserved hypothetical protein 155
MDFSAQATLAVCHHCDWVMTLPRLRAGETACCPRCEHKLPGQQHTSIQSQ
LAWASAALIMLAAAIAFPFVSFEVQGIKHTIIVADTALALFDYDFPFLGL
VVLTTTILLPTAYLLVLLYLHGVLASGRRPMGAQTLARLLTSIKPWVMSD
VFVVGVLVSMIKVLSLASLQLGPAFPAFCAYAVLLLKSISSFDPGTLWTA
ISGPVDPPVDLAPGSPAATQGAAGCTRCNAIVNTASQTRCPRCGYHPIAP
NPRRLQATWALLIAAGILYIPAMAYPIMITTELGRTSPQTVVGGARLLLE
TGSWPIALIIFTASIVVPIGKVLALGWLCLQAQAGTGRSAYDRLRLYRLV
EAIGRWSFLDVFVVALLTALIQAGELMRVQPSPGVVIFAIVVILTMLAAM
AFDPRLIWRVHEK
>Noc_1457 40-residue YVTN beta-propeller repeat
MKKIDGFMVALLAGGAGLMLSATDSVLAAESSGSGQDNQGLQAKLYVTLE
EPDALGIVDPKSKKGLGTVAVGGKPHDVICAPDGATAYVTNPETHNLSVV
DTATDKVKKTVEFGQGTTPWHVEISPDGSQVFAALQDQSAVAIIATADNH
LATKVSVTSGPWGVAAPKNGPWAVAAPRNDVVYVTLNGSITKGTANAARS
EDIAVFDPTAAVPTVKYVTLAADTANGPHGIVSAPDGSAVYVAAEASHEV
WKIQVDGNQAKRVVEIPDPNPAGTPLNPGFPTDLGISPDGNTLIAVNHDL
DSITVINLKTHKIIETVSTGEGSAPWGALISPDGQTAYISTNGADSLAFF
SMKELTDGTEGARKHTIADLPTSDGLAWCNLAQ
>Noc_0868 Hemimethylated DNA-binding region
MEQAKAKFTIGQIVRHKLFHYRGVVVDADPSFQGSPEWYEHMACSQPPKD
RPWYHVLVNDADYETYVAERNLDLDGSGQPINHPAVEMFFDELHEGVYRC
QRHIN
>Noc_2011 CBS domain containing protein
MPLYWYRRPRLVVLSSKSSVLEAARAMENNSIGAIVVQDHGRIVGIVTDR
DLAVRALGHKLDPENTAITEVMTPSPLMLTLADSREEAIALMQQGNVRRI
PLSENNRVVGMVTLDDLLLDEAAPLEELAAIVQAQIGEGGPTESPRSPAR
RRSLARAEATLSRLINAVQTETGLEDRQQARIAVETILALLVRRVTPGEA
KDLIAQLPSLLQSPLRALQPGPDKSITRETAITELSQQLGLDTARAEEVI
GTIAKAISPGQVEDLRRQLPEDLRSLLVLPDPIDTA
>Noc_0837 Protein of unknown function DUF427
MKAVWKGATLAESEKTAVVENNHYFPPDSINKEYFRGSDKHTICPWKGKA
SYYHIIVGDQLNADAAWYYPAPKKAAAHIKNYLAFWHGVEVRE
>Noc_1887 hypothetical protein
MISNSPLQFGPCSMPVSNTRFNRRAQRMPQNVAPPHSHRLGCFCIPPKLF
GVTQPCVSDLMRGKIELFSLASLANMATVAGLHVELHGHEPETAP
>Noc_0432 hypothetical protein
MKDNTITPDPTYKTYLLSNEWHSVIIGFFIGIFMITIYFWMASGFINLLI
HLYHSYPDNWTHEAENMIKDTVVILASFELIRVFQSYLLIGRVKVTFILD
VALVVLIGELISLWYVENDANEVLLNIFVIASLIAFRIITTKFSPD
>Noc_0182 Protein of unknown function DUF598
MKFSLDERTDVYTVSAYGSGYVEFRIPVSSEKEPERLQGEGLKENRGRQK
ICRNVVVSPGRLQEWSPASFSELEKAHFQAFLEMEPEVVVVGTGEQSHFL
SPRLIEPLLRHQIGVEFMDTAAACRTYNILVGEGRRVVAALFIIRQP
>Noc_1595 conserved hypothetical protein
MVPPFGSIPVIIPSLILILFPMFFKPFRIIFFATGIILCFYPVIAGSGEG
NTIVIKVTGIEPVEGQVQIALYNAPERWLEESFAHATIEVKGREAEWRVE
GIPDGAYAVATFHDRNGNGKADRNWLGIPKEAYGFSNNVKAVFKPPQWNR
VKFIVARPVTAISIELGYWN
>Noc_0587 conserved hypothetical protein
MNTMPQRAIRIVIFAKAPLPGLAKTRLIPALGAEGSAHLAVKLLEHAVKQ
AVLADTGPVELCVSPTRLHSIWKELTLPVSLEWSEQGEGDLGERLARATR
RITNNGQAIILIGTDCPSLNAKKLREAAQALERHDACMVPVSDGGYALLG
LNRHLPEVFSDMPWSSTAVARLTRQRMLAECWTFKQLAPLHDIDEPQDLQ
YLPENWAPRFTAIS
>Noc_2945 conserved hypothetical protein
MHVISKQPFSEAVKKYPNDRQSIMDTYNVLRRGSFSSPSELRIVFPSLDN
FKHKKRWWAIDIGGNNLRLIAAIQFVHQHVYVKHIVTHAEYDRLNSRYLK
GEL
>Noc_2594 Conserved hypothetical protein 103
MKGGLGNLMKQAQQLQSNMEKAQEELANMEVTGQAGGGMVSIVMTGRYDC
RRISIHSELWQEDKEMVEDLVAAAINDAVRQVETQSKEKMSSMTSGMLPP
GFKLPL
>Noc_0468 conserved hypothetical protein
MNKLTYLWEALRESLWFIPMCIVIGAVLLALGLIELEAAVEREQLAEHWP
TLFGVGADGSRALLSAIASSMITVAGVTFSITVVALSLASSQYTSRILRN
FMRNRSNQAVLGVFVGVFAYCLVVLRTIRGGDQGVFVPGLAVLGALLLAF
VAISFLIFFIHHIATSIQATSIIKSAATETLEAIDRLFPTEIGEATAEHV
GNAPGARVDLAAQAWITIPAQQSGYIQGIDADALLHVACAQDIIVRMEKE
IGEFVIEASPLVSVTGKSPGDETIWELNAAYTIDWRRAVEQDATYGIQQI
VDVALKALSPGINDTTTAIICVDFLGMILARLIARHIETPYHSDNGQLRL
ITRGPTFSNLLSQAFDQIRRNAEGNVAVLIRLLQSLETLTKLTVNVQRHQ
ALRQQAALIIETAERTVPMAYDRMPIQAIRDRIFPLPADESR
>Noc_1152 Protein of unknown function UPF0047
MKSYRKELWFNVPNRRGFVNMTPQVEECLGESGVREGLVLVNAMHITASV
FINDDEPGLHHDYEEWLEKLAPHEPIRQYRHNDTGEDNADGHMKRQIMGR
EVVVAVTEGRLDFGPWEQIFYGEFDGRRRKRVLVKIIGE
>Noc_1373 conserved hypothetical protein
MYLNRRMLCFAFFISALCSLSTVAFAIEEKLAIIKDNVNRRGLTLQVYTP
PLAPENRLEAKLFQDVQIFGNVASLAAGNVSGAAGDEFIFLTHGRFGSNG
LYLYTVKPEDGALFFRLLADDRSLERNVQFATLCDCDSDPEKELAVIKRL
QNGSHQLIIYDLPTRRRGNALAIAGAANIGNNIIGLSAGDMAGDSKSELV
IAQKNGNGTVRVEIYSPPSSLSDSLGPPLLSYSDLGRDIIPDGLAVGDFD
NDSEDEIALVRSLKNGTYSLDILKAPTAFEDEAPVFIASDVNIGQNVAKI
TAFKIKAAGPSFNQPPQAVINANPQKGPLPLTVTLDGSNSQDADGFIQHY
RWEFDNGQIITGPNLEYTYDTPGAHVVTLTVIDNENSKDSTQITIEVTEA
ANNNDTTDSQNLLPAEQELIKLINQERQKHNLASLKIHSALVAAAKGHSK
DMAQNNFISHRGSNGSSPFTRMADAGYRFRTAGENVAAGYSSPQAVLTGW
MNSPGHRRNILNASYCELGVGYAYQGRSTYRHYWTLTLGCR
>Noc_1737 Protein of unknown function DUF159
MCGRYTLYTSPAKIAAHFHLHQVQGLIPRFNIAPSQTVPVVRGESSYREL
TLLRWGLIPHWSKEEKSPYNLINARAETVATKPAFRGAFRQRRCLIPADG
FYEWKAEADGKQPYYIRHHDGEVFAFAGLWEHWEGETGQYIDSCTIIVTA
ANKLIQPIHDRMPVILEPVDYETWLNPNNNQATSVLTALLKSYPPEKMKA
YPVSKKVNRPTNDDSACITPLP
>Noc_1185 conserved hypothetical protein
MKQVALAIVIFYVLPLEVQAIGAVDLYEAQVPVSNQTPEEQARAVKEAFQ
KVLLKVMGNRRTLARAPLALLLEKSSSLVQKFRYNASDEENGAATFWVRF
DPLGVEQLLRQKALPVWGQVRPILLLWVAIEEGRHRYLVDADGNLPAAEI
LEEQAGVRGMPVILPLWDLEDRSQLSFSDIWGNFPEPILAASKRYPASVQ
LVGRLSRQSEDDWQARWTLYGVDKARDWRVNGEFEQVLRAGIDKSVDTIA
AQIVPATGNNSLSSVQVRVTGVTSFMDYARLFSYLSSLSQVITMEPVQLS
RAEAKFRLELRGKAEGLATSIRFGRVLVRATEGMGTNIEPNHMELNYRLL
P
>Noc_3048 conserved hypothetical protein
MVCKRYGPSEDTLRFILRPNSSLSWRGMKAVFFAMTLVLVIIAGGFSLLG
LWLIFPFAGLELLILGIAFYLSARRAQHCEIITIGQEEIEIFRGRETTGE
TWKFHRYWARVRIELPPYAWHMSRLIIGSHGHEVEIGVFLSEEERLRLAK
ELQAVCGA
>Noc_0301 conserved hypothetical protein
MPIYEYRCEACGHKMELLQKIAEEPLLQCPSCGKEQLRKLVSAVGFRLKG
GGWYETDFKSGNKRNLAASNDASKESSNNKEGPGKAKDKQEKTPGANSKP
TGDAAATG
>Noc_2186 conserved hypothetical protein
MKPEPYIRVHFLEQRLILRRERKVLFEAPVSLAKKGLGEQMGSECTPRGW
HQIRAKIGTGCPVNTVFRGRRPTGEIYTPELGQAYPDRDWILTRILWLSG
LEKGKNRFGRVDTFRRYIYIHGAPDAAIMGLPGSHGCIRIHNGPLLELFK
QVKPGMRVFIS
>Noc_0119 hypothetical protein
MDIAKKFLPLPWRAQFLAFAALTVFALPLHAESGEAPSLTPAQMQEFHKL
QQKMRTVGQQLDEIRQETLKTTPKLQEQQEEYQSLLFKTMKEQGSDPDPA
LARMREIEGQVQNEDLPEDERKQFIMEYQQKDAQLQQASRDAMQDEKVRK
MAESLSQDTVAAMREQDPKTEELLREMEQLREEMQGIVAKIKPKPQAGSG
SSSE
>Noc_0564 hypothetical protein
MGHFLFEEKTQKFNVIDGQQRLTTIVIFLSVLFAQLKSIRDLSEEEEICY
EDMVKRRSAIRFWTVDYDNQLFIDYVIDQSKSEHNGLETESSRRIVSAFD
FFKNHLSDKPEEYLTKMLLIVAGAACTTHPVQNESEAIQMFIFQNSRGKR
PSNLEIVKAQFMHHAHLHGGAKDEVDSLIGEIKGRFEKIYKSISSIEYRI
NEDDVLLYTLRVHFNSLWETNSLDKINKMLADGNPLDFIKSFSRSLSTSF
EHLSQFFGKHERENFAIHSLISLGGIAVALPFVIKAYRYGLSLDQIGKLC
WSLESLVLRHRLIGTRANIISRINDVFEKFTESNKDISPVIDHIEWMKTT
DDWWWAYWNNDRLKESIQGGINHSTAKYLLWKYEVHLEQQGKAGYSPIRF
DNIISPELEHIAPTTEPESKPHGYDDYDDEFRNQYLNTLGNYLLLSKSHN
CAVGNILLSQKLATYTHNEQQREVRSLVPKNGIWSKEIIQQRKNKIIGVI
MIIC
>Noc_1922 conserved hypothetical protein
MVNGDNKTIKVYYNSACPICNAGIKHQKSRVPGHQIDWNDVHTQSGVHNE
VSPNLELVRKKLHVMDRKGNIKVGIEAFEVIWRNSPNEHWKAKIVSFPVI
KQISIFFYNVFAEALYRWNRWKRHW
>Noc_0482 Protein of unknown function DUF1328
MFGWAVTFLIIALIAALFGFTGLAGVATHIAWILFVVGLILFVVFLLLGR
RGRPPL
>Noc_2020 hypothetical protein
MYIWFGILLVVSLPLSSTISIQNCLSGGETSEVRHEYVASQLAVMSGASR
ARNFTSPNIHTRLCECYRLPRKILTGVRNEWPIGTWEVDVHEDSETAALI
ILPGSGITSGCPPIRELPVTITPSIR
>Noc_0717 conserved hypothetical protein
MVKSKIVRLIITNFYTSIIASAILFSTPALAVTFALPSLGETVVGRNLVV
PAKASETLLDIARRYDVGYSEIKAANPDVDLWLPKEGSLVVVPTRYVLPQ
APRKGVVINLAEMRLYYFPESPTAQPSTVVTHPIGIGREGWSTPLGRTSV
ISKKKNPTWVPPESIRAEHAADGDPLPKIVPPGPDNPLGKFAMRLGMPGY
LIHGTNRPWGVGMRVSHGCIRLYPEDILSLFNQVKVGTPVNIVYQPFKAG
LKDGILYLEAHAPLPELENSEQGGLTSMVAAIVAVTEKPVPEINWELAKR
TTADRTGIATQVSGVGDGNSLSRADRDAGNEW
>Noc_1039 chromosome segregation and condensation protein ScpA
MSESSGQVPSSEASALAVVRGQPLIELPTDLYIPPDALEVFLEAFEGPLD
LLLYLIRRQNLDVLDIPIAVITHQYMEYIEILGELRLELAAEYLVMAAWL
AEIKSRMLLPRPQESAEEEGDPRMELVRRLQEYERYKKAAEDLDSLPRVG
RDIFETGVHVPPGQVTKPQPTLELTDLLFALKGVLARADLFSHHHVAQET
LSVRERMGEVLSRVEADSFFEFTALLHPEEGRLGVVVTFLAILELIKESL
LEVVQAESYGPLHVRAVAA
>Noc_0096 conserved hypothetical protein
MKVIYFEDTDTLYIKVRGSDIAESKDLDENTIFDMEANGNVRTITFEHAS
QRTNVSRLIVEGIAA
>Noc_1691 Protein of unknown function UPF0047
MVVQERLEVTTSGRGTVEITDQLQRIAAGSDIKTGLCHIFLHHTSASLML
CENADPAVRYDLEAYFSRLVPDGDPLFTHQQEGADDMAAHVRTVLTHSEL
NLPVTQGRCALGTWQGVYLWEHRFSGYRRQVTVTVYGE
>Noc_1715 Protein of unknown function DUF482
MEFKVTESIETVASGQWNALEGTDNPFLRYEFLSALERYGCVGAHVGWLP
RYLLAEDRPGSLIGAVPLYLKYNSFGEFVFDWSWADAWEQAGQHYYPKLV
VAVPFSPVTGPRLLLKPGADEAMVDELIQMTIRLAKKGGISSLHWLFPHQ
KDRKRLANCGFLLRQGYQFHWHNRGYRDFNDFLETLTSKRRKEVRRERRQ
AQTRGTEIKVIHGSEVTDLEWRAMYRFYQATFHAKGNYPALTLPFFKSLG
RTMGQAVVLALAVRTGIPIAGALYLRSRDTLYGRYWGDSEYLAGMHFELC
YYRGIEYCIEHHLQYFEPGAQGEHKINRGFLPVLTWSAHWLGDSGFYSAI
ADFLAQESRMVRSAIMELEAGGPYKRP
>Noc_0691 Protein of unknown function UPF0061
MTRTPQSRLKVWRPMPSSTAMKHQDIGWHFDNTYAQLPGHFYTKLHPVPV
HEPRLVIVNNALAEELGLNFKASSEDELAQLFSGNQLPEGAEPLAQAYAG
HQFGHFTYLGDGRAHLIGEHLTPDGKRVDIQFKGSGQTPYARRGDGRAAL
GPMLREYIISEAMHALGIPTTRSLAIATTGESVYRETVLQGAILTRVASS
HLRVGTFEYLAAQEDKAGLKQLTDYAIQRHYPEIIDSDTPYLELLKAVMA
CQIKLITEWLRVGFIHGVMNTDNMAVSCQTIDYGPCAFMDNYDPNTVFSS
IDHMGRYAYANQPRIAQWNLARFAEAILPLLHENIEKAAAMAEEAIQSFK
ALFQQEWLAMMRRKLGLFGEEKEDMEFITGLLQWMQRSHADYTNTFRDLM
DEHFPEEPHYQDQEFKHWYDRWQQRLEHNTKPFPSSLCLMGATNPVVIPR
NHRVEAALNAVEQNADFSKLHELLDVLSEPYKDKEKYTEFKNPPAPKERV
SQTFCGT
>Noc_1651 Conserved hypothetical protein 374
MRRYWILALLAMGLGLAIPFSYGGLAVFERLGQVPLWLPLLTLAMIVLGW
NFNAAKLRILVSAVDTKLSHRSALGTVMAWEFAFSATPAGSGGIVSYIYL
LNRYGVKTAHAAAIFTMEMGIDLLFFITASIIVVIKLATNVAYDIHLGFV
LVVVLLTGGMGVMWILARQYQKLLRVFGYLLKLFKVSTPRRKRAARWMLR
LRHGLMLLLGLPRRHLYAAYLFCVAHWLLRFSVLYVLLLGLGENVPWPYL
FITQVLILTLGHFTFLPGGSGGVELGFGVMLGPFLDSATLATALVLWRFA
TFYWYLIAGAPVFVAVAGPVIFQTLAQATTKKSL
>Noc_2723 Protein of unknown function DUF330
MKRSPIILVAVAFILSACATTPHQPWLYSLTTPPAGTTPPSEARVARKAL
IVNQIYAAEFLQHSGIVYKTAPNRLSVARQHRWAAPLTVQLHQYIYNRLT
TQLPSIAVYPNTSSAPLKAWQLTVEFNSFHGRFDGKAVVAGSWRLRNGNG
DVVAQSSFKQDTPLAEDGYEALVAALSKGLARVTDSIASTIHELTLEQAD
ETAQGVNIDAP
>Noc_2009 Protein of unknown function UPF0027
MKHDGTIRTNGKAIIETDNLPIKLWLEEDQMEEGALEQARNLANLPFAFK
HIAIMPDTHQGYGMPIGAILATKGAIIPNAVGVDIGCGMCSLRTNLEHIE
TPKLKEIMGIIRKTVPVGFEHHKTRQDEAWMPERKGELPIVEQEYESALY
QIGTLGGGNHFIEIQKGSDGYIWIMIHSGSRNIGFTVANHYEGVAKKMNQ
DAGEDVSQELAYIPETSEYFKLYWNEMNYCLEFALANRKLMMERARSAFT
EILPEVEFADFINKPHNFAAEEKHFGEWVIVHRKGATRARKGEWGMIPGS
QGTRSFLVKGKGEAQSFESCAHGAGRIMSRTKARKTLDLKEEVKALKDRG
ILHAIRHRKDLDEAPGSYKDIDEVMANQVDLVDVQIELQPLAVIKG
>Noc_2534 conserved hypothetical protein
MNFPANSDLWTLFGSSFLAATLLPGGSEVVFALLASQDVYSPSVLIGVAT
LGNTLGGMVSLGMGWLVARRYPLRTLARPSHQRAYRWLRRYGAVGLLGAW
MPVIGDPLCFVAGWLRLNVFLGLFFMAVGKGARYAALWNVVG
>Noc_1908 conserved hypothetical protein, RDD
MKATNHHLPSTPGLFRRLGAIFYDSLLLIATWFLFTIFALPLTAGEAIPA
GNILYRLYLLLIALAFFGGFWTHGGQTLGMRAWRIKVQQPNGQLITWRQV
LLRFGSAFLSWLPLGAGFWWVFIDKEGQAWHDRLSITTLVLVPKKTKVNK
SGAA
>Noc_0020 Protein of unknown function DUF556
MNRWLASARNLEEASYLLAEGPDIIDFKEPKRGVLGALPPETVRQAVALI
GGRCQTSATIGDFFVESSQISQGVLEIAATGVDYVKIALFSNTQQAADCL
ISLQPLAARGVSMIGVIFADKQPDFSWIHLIKQAGFKGIMLDTAVKDGRG
LLSHLSLPELDNFIKVARNANLVSGLAGSLSIQDIPKLLSLKANYLGFRS
ALCTAGNRCYRLDPKAVLSIKRAIRENPRIVEN
>Noc_2079 Protein of unknown function DUF423
MAALKILLGRTTIIVAKFLISTGAFIAALGIAFGAFGAHGLRAKLEPRML
EVWQTGVEYHMYHALGLILIGMISHWLGSSALIKWSGGLMLAGILLFSGS
LYVLALTGAGWLGAIAPLGGSSFIIAWILLAVALLRTS
>Noc_0137 Protein of unknown function DUF28
MAGHSKWANIQYRKGAQDAKRGKLFTRLIREITVAARLEGGDPATSPRLR
TAIDKAFAANMPKDNIERAIKRGTGDLEGVAYEEVRYEGYGPYGVAVMVD
CMTDNRNRTVAEIRHVFSKWGGNLGTDGSVAYLFSKKGIISYSKESNEDA
IMEVAIEGGAEDVVTNEDSSIDVFTEPEEFSTVKKALDEAGLKAEQAEIT
QHASTSVSLALEEARKMLDFLDALEELDDVQQVYSNADFSEEILAQLG
>Noc_0486 conserved hypothetical protein
MNTPITTPGSDQPRVIGGEDNTSGPGPYLMTANSLENNEVFNLEGEELGK
IKDIMIDVPKGRVAYAVLSFGGILGVGDKLFAVPWSALTLDADEKCFVLR
VNKEVLENDPGFDKDHWPSMADQRWASDVHSRYGARPYWVPKDYWPTKRV
RQGRQTRM
>Noc_1714 ErfK/YbiS/YcfS/YnhG
MPNLLLKLLVILALVLTNPFAGATSSTELSYELIIIKSKRLLIVKKGGQI
IKKYHAALGQGGGGGKRIEGDKHTPEGEYEIVGFWPSHKFHYFIHLDYPS
RGDALAGYKEGLISWDELVHIYRAQQEKNGIPPQQTRLGGFIGIHGIGEE
TRKKLMIHRNFNWTRGCVALTNREIDELRQFIQLGTPVIILNEFNDKSLL
AGMKVESGVSLTN
>Noc_2469 Protein of unknown function DUF339
MSKHSLHRLRWRCRRGLLELDLLLGNFLEISYESIPDEEKRAFETLLYHS
DPDLWRYCLGEEPHPDPVTANVIAKIRRAALS
>Noc_2199 hypothetical protein
MSSIEKLDVSLAGLGTVLSSRRVAVPPYQRLYAWTDEQTSALFKDLNDAI
RNKESEYFLGTVVIAKEPTGGQWVIDGQQRLTTTSILLASIRDYFLKNGD
SERADIIQQDHLSKKDLETLEDTPYIKLNDTDHDFYLNRILARPGKDRDA
LSPASSLTESRCPAKGRPRNRS
>Noc_0359 Protein of unknown function DUF533
MSFGNIVGQLLQGMSSQSHSRLERTASPEGLGGLVDIAQKFLSNKQAGGM
TGGQVGGLGALAGALLGGGGSAAKGALGGSAMAILGTLAVSALQDKQNTP
TSSGNLTSGAQPLPQEQIEALAAPQTEQLMVRAMITAAKADGQLDQKEMD
NIFGKISADGVTDEERQLVMDELSRPMDLQALVSAVPNKAVAAEVYAASL
FAIDIDTEAERAYLQQLAQALGLDRATVSRLHELTGSPIA
>Noc_0371 Protein of unknown function DUF490
MKLLRSIGAFILLLLVLLVGGLAYLLLTEVGTRQLLAQVAQVIPGELETQ
QVEGTLGEALTLTGLRYRTPDFTLEVGYFHFAWRPAALLGATFWVEQLHL
GEVSWRQKRPGESTASQEPIVLPEIQIPLKAKAEDVRLQNISLTPLGSSP
VVINTIVFKGNFDGQALQVGELGVSAPQGEVQVSGEMAFQEAYPMAFALA
WGAPVPELGKVTGIGRVKGDLRQLTLHQTVQAPFHLQFRSRLFELLEQPR
WVAALEVPGVELQHLSAQWPAVRFGLDLKGAGSLEQFKVRASYQIQEAQT
GEVRGSLSVEQLAMGHWLLDRLTLRQVEGPARLALRGEVMMAEDQPRMSL
AGQWQDMAWPLRGTAQVSSNRGQLTLEGTPAAYRLQLNSALAGQDIPTSE
WHLMGTGDTTQFELEKLRGQLLDGVLSGSGNFRWTPALAWDVRVDGEALN
LSKEWPEWPGVLSFSSDTNGVLEEDAQDITLDLHALSGSLRGYPVAAQGR
VQRQDNTWRIADLKLRSGDSRLSLGGTVNERLALKWHLSSPDLSQLLPEA
QGDLLVKGHAKGPLTGPELTFRLQGKALAYQDYQVESVMANVDVDLQGKQ
SSQVRIDASDFTLADQTLRSVAIEGGGTPLHHKLSLAVKAPERSLDLGFQ
GSWKEEVWQGEITKTELTDSLMGHWEAVSATSLTLSRSNIDLAPWCWRQQ
SAQLCLGGSWQEESFWRGSFKLEDFPLAMLGPLLPEKTALEGVIGGEVQA
QGEAHQLVQARMQLAASGVQLTQVTPEGQSLRFPYQDMQARLNLEDRGGK
AGFELLSADPGTAPVRASLRLPSAPLDLTALGQLPLDGQISMAFRDLAFL
ETLIPELEAVQGQLRADLTLGGQVAAPQLLGEVVLQEGSAQVVPLGLKLI
KIRLRAEASEQDRIVFTGGVHSGEGELAVNGQVRLEPEAGWPAKVTVTGE
RFEAMGTSDIRVLISPQLQITKAEEAIRVEGEVVIPEATLVIKDIESRGG
VPVSQDVVIISQEKETEKKAVPIYARVRIILGDDISVRAFGFKGGITGSL
LVTETPGKATRGSGELQIVKGEYKAYGQQLNIRQGQVVFAGPIDDPRLSV
EAVREVDNGNIVVGARIRGAASEPVLTLFSEPSMDESNILAYLILGRPLA
GASGGDGELLTKAATSLGLSGGTLLAKRLGKIFGLEDVGIESADNGNGNG
DTQSEMLMLGKQLSPSLYIGYGIGLFERFSSFRMRYILSKNWSVQAETGL
ETGADLFYSLER
>Noc_2254 Protein of unknown function DUF520
MPTFDVVSEVDKHELQNAIDQVNREIGTRFDFRGTEAHIEKSEEELILVA
ESEFQLQQMRTILDTKLAKRGVDVDCLEAKEPEIIGKRARQSIQVRQGID
KDTARKIIKIIKESKLKVQAAIQGEQVRISGKKRDDLQQVIALLREADLD
LPLQYINFRD
>Noc_0308 Protein of unknown function DUF329
MNKDKKRHVNCPTCGQEALWSEENPWRPFCSERCRLIDLSDWATESHRIP
GEEEHPIPSSEE
>Noc_0106 Protein of unknown function DUF924
MLLNSMHHQQIIDFWFKEIKPESWWKKIPHFDQLIKERFKAHHRAAVQGE
LYEWRREPFGRLAEIIILDQFSRNIYRNHPLSFAYDTAALILSQEAIDKN
VGKMLNSEHKIFLYMPFMHSESLKIHQIGIKLFAEPGLESQYDCEIKHQA
IITRFGRYPHRNQILERPSTPAEIAFLKKADSSF
>Noc_2513 membrane protein
MALLEQLGLALALGLLVGLERGWQERHAEEGARIAGLRTFGLISLLGGLW
ALLSEQFGAVLLGFAFLTFVLLVLVSHFDSVKRSGARGLTTVVATLVTFT
LGALTVAGYEVVASAAAVVVTVLLGLKPVLHAWLARLRQEELYAVYKLLL
ISVVLLPVLPNQGFGPWQILNPYEIWWMVVLIAGVSFIGYFAVRIAGTGR
GILLTALSAGLVSSTALTLHFSRIARANPRGQRLLSAGIAFAAATMFPRL
LLVVGIIHPSMALYLLFPAGLMGIIVYGGGYWLWRGAKRQSTPHLVLDNP
CDLGMAVKFGGLLALVILAAHALKEWFGEIGLYVAAGISGISDVDAISLT
LARMAQEDSQVLAVAAIGIFLAAMVNTLVKGGLAWGIGGSRFGLPLLGIF
LLSIVVGGGSLLMWSGNFPSLELV
>Noc_1707 Protein of unknown function DUF1458
MSDHVYKKIELTGSSAKSMEDAVQQAVSKASKTLNHLRWFEVVETRGEID
DNGRIKHWQVSLKVGFRIDG
>Noc_1323 hypothetical protein
MTAIRKHRRNPNGGEPERSVGGAGADKRVPLKKLALTILLLGLVVLAWLG
TLSQYGEDYIDRVFTQALTTFAVARALNGVISVAQGTKLAVEPAGVGANF
ALGEVLDPINDLIERFSWVMLASTTALGLQKLLLEMAGWWGIRIFLLGSA
LLFMACLWWPARIYWQFSPWVKQLLLMALLLNFAVPTVAFLSGQVFEDFI
REKQQVSMRALERESQEFKRIEESLPSEEDRIKEQTWQEKLSQLMGGMRK
SLDFEAEIQRLKVKSEEIAEHVIALIVVFVLQTIILPLLLIWLLLRAIRI
IQ
>Noc_0856 Pseudouridylate synthase
MEADEAGQQVLAYGGDPPLATALLRCRPEDFQVVEELPFALSGEGEHVWL
LLCKRNTNTVWLARQLARIAGVRLVDVGYAGLKDRHGLTTQWFSVNLSGK
KEPAWATALESATVQVLKVIRHSRKLQRGALKGNRFLLTLRHFQGDREVV
CDRLTQIKVAGTPNYFGPQRFGRGGQNLDQVHRWFSGGKPPRGRYLRGML
LSAARAFLFNRVLSERVQAANWWQPLPGEALILDGSHGFFVAETIDEALQ
ARVRRFDCHPSGPLWGRGESPAKRMSRALEEEVLADYALWREGLEQAGLK
QERRSLRLMVADLEWSFPPAMDSLQLHFRLPAGAYATTVLREVVRTQEAV
GQPFLLDE
>Noc_1044 HesB/YadR/YfhF
MNVTNAMPNPLIFTDIAAGKVKELIEEEGNDKLMLRVFITGGGCSGFQYG
FTFDETSHEGDTRVKNGGVTLLIDPTSYQYLVGAEIDYTEGLEGAQFVIR
NPNAETTCGCGSSFSP
>Noc_2342 Protein of unknown function DUF971
MKTCHAISSREFSTLTMTIQHPHPQPTEIKLRRNSRVLEISFEDSAHFAL
PCEFLRVYSPSAEVRGHGPGQGILLIGKETVSIKEIEPVGHYAIKIRFDD
GHDTGLYSWEYLYELGEKQTQLWQKYLDRLQQVGHSRKEPEIKD
>Noc_1801 conserved hypothetical protein
MYMMSSRENFWAGERLSNKDQIKDVVLDDPDPAVDSPLIEEAIFFDMMKD
KRLLLLIHGYNNEADDVCRAYSIIQDNVHRYLKGNYDDIVGYTWPGGSEP
LNYYKARRRAGAVAPRVARWLAELTKVTQAIDVMDHSMGTRVILSALRMD
TTARLHNLYSMASAVEDESIQYSREYFSSTKACDSVYVFHSRHDKVLRFA
FKGAEWDRALGYSGPEDPALIMTHSPNVKVINCRQVIDSHGGYKYSESVY
RFIARELKSSEAPQYSILE
>Noc_1857 conserved hypothetical protein
MRLRIDQQSDALYLDLNDKEIDSSEEVSDGIILDYDKDGNLVGIEVLDAS
KKAGDLKTLHQLSLDVPHIAV
>Noc_2029 Protein of unknown function DUF323
MIPSQDAVELKEIAPDTREEGFTFMRRYRQVRQLSETLCQPLVDEDYVIQ
TMPDVSPPKWHLAHSSWFFENFILIPKFKGYQPFHPAYSYLFNSYYETVG
QFWPRPQRGLLSRPTVAEVYAYRHHVDKNMVRLAENLEAEKWPSVASLIE
LGLNHEQQHQELLLTDLKHIFATNPLRPAYQEGVVPQFKGARKNGSLEWY
DYKGGLHALGYSGEGFAYDNESPNHLVYLRDFRLASRLVTNREYLAFMAA
GGYREPRYWLSEGWHTVRQEGWQAPLYWEQQGEGWWQMTLHGMQPVQKEA
PVCHLSYYEADAYARWAGYRLPTEAEWEIVARTLPCRGNFLESGALQPLP
APPAAPTPVQMFGDVWEWTGSPYAPYPGYQPSEGAIGEYNGKFMCNQMVL
RGGSCISSSEHLRASYRNFFPPHARWQFTGLRLADDV
>Noc_1577 Protein of unknown function DUF1130
MLPFYTIGHSARTLDAFLDLLRAAQVTVVADIRSVPKSRTNPQYNKDILP
NTLTAFQIGYEHIAELGGLRGKAKSVEPTVNSFWENRSFQNYADYALSVS
FRAGLDRLIALGHARRCVMMCSEAVWWRCHRRIVADHLIARGESVIHLIG
RDRAEPAKLTAGACVHKNGRVTYPAGSIEG
>Noc_0532 conserved hypothetical protein
MSETVSNILIEAINDEYKARATYRHVIRKFGEIRPFINIVEAESRHIEAL
LPLFHKYGVAVPGDNWASRIEAPLSVLEACQIGVEAEIENANMYDRLLES
TADYPDVRRVLVQLQRASIDNHLPAFRRCVERGGSPGQGWQHGHRHDA
>Noc_2252 UDP-2,3-diacylglucosamine hydrolase
MATFFISDLHLGTGKTEIQLRAIEFLSQEAPYGDALYILGDLFDYWIGDD
APTAEGLAIITALRRLADVGVTLHFLSGNRDFLVGQVFSQASGCQILSDP
AIIDLYGVPTLLMHGDTLCTDDVAYQRARARLRRPAILRTYLALPKSWRR
AVAQRLRRQSQAHVQHQPLTIMDVNQTAVETALRIHGVKQLIHGHTHRPA
VHHFTVDGHPRQRIVLGDWDRGKSALSCTPEGFHFSDSRILEPRFGNLQG
>Noc_1113 Protein of unknown function DUF454
MWLYRIIGFFFIGIAIIGAILPLLPTTPFLLLAAGCFAKSSPKFHQMLLE
NAIFGPIIKNWHEHKTISLRTKIIAISSLLFFGGYSVIFAIENTTLKVVG
ILAITIGLFSVIRIKTHPIHPPSVE
>Noc_2004 Protein of unknown function DUF526
MIESKLLDELARKLAEAVPPGLQDFQRDVEKNFRAVLSSTFAKLDLVTRE
EFDLQQAVLARTRMKLEHLATQVASLEEQIGLSKTPQESQKAPEEKLIE
>Noc_0441 putative plasmid protein
MEHTKLFKSNRSQAVRLPKAVAFPADVTDVDIVAIGRARLITPAGDAWDV
WFDGPAVSPDFMDDRDQPDEQEREAF
>Noc_1832 LemA
MGMTGIIALGVIVAMILYVIVIYNRLVAFKNRYKNAFAQIEVQLKRRYDL
IPNLVETAKGYLQHERKTLEAVIAARNQAVTDLRQAASDPGSGDAINRLS
HAEQTLTGALGKLNIVVEDYPDLKANQNMMQLSEELTSTENRVAFARQSF
NDAVMHYNTYKQSFPQNMLAGMFGHGPDSALLQFEDSQAIQAVPSISF
>Noc_0394 Protein of unknown function DUF523
MDPLSARGEIPPSLSSPEIPRLRLAISSCLLGERVRFDGGHKRDTYITEA
LNAYFDFVPVCPEVAIGMGTPREPIRIVKTSQGLRARGVRHSELDVTDPL
RQFGEVMAEELGDIDGYILKKDSPSCGMERVRVHGTGGAPSRTGTGLYAQ
VFMARRPWLPVEEEGRLNDPVLRENFFERVFVHYRWRQIGAAGITPAALV
EFHSRHKFMVMAHSQAAYRRLGRLVAKAGSESLETLAPTYEAELMSALRR
RATRKGHTNVLMHLLGFLRAQLDKADRAELLESMESYRLGLVPLIVPMTL
LKHHFRRHPNPYVNQQYYLNPHPPELMLRNLI
>Noc_0230 conserved hypothetical protein
MRSNKTYLSLLFLLSLSLLFLTYSASAQWQQLHPMPTHRSEMAAAYLDGK
IYVPGGLGGQHQFEVYDVTTDSWEQLAPLPAPRHHLMATAHQGKIYVFGG
GDQDWSPTVTAWVYDPPSNQWQTLTPLPEPRYAGDAVSMGDFIYVVGGKG
PSGRLLRYDPQQDSWDFLKGMHQRREHIRSVVFEDRIVVLGGRYQGAGEL
GSVEIYDPATDTWREGPSLNTARGGHGAAVYQGKIMVFGGEIIMTGRTTL
ASSEILEKLSGKWQPGPPLPMALHGMPAISTGSHLYILGGSEQAAASINR
GRVYRLLETPGPPPALTLGLFLTTLPPEPISSFQLAPAP
>Noc_0228 Conserved hypothetical protein 701
MLWIKAFHIIFVVTWFAGLFYLPRLFVYHAQCQDKPGRERFKVMERKLYR
GIMHPSAILAVGLGVWLIYLTPAWMGAGWLHVKLSLVLLLIAYHLYCGRL
LIAFREERNRHSHVYYRWFNEFPVLILIGAVILVVVKPF
>Noc_0573 conserved hypothetical protein
MKVSYFEDTDTLYIEFRDNDIAESKDLDENTILDVDANGNVCAITFEHAS
QRTDVSRLIVEGIAA
>Noc_1351 Propeptide, PepSY amd peptidase M4
MNARKLMLIMSILGLAGTGIAHADDIGPDQAIKLMQEGKIQSFEKLNEAA
LAKHPGATVEETELEKERGGYVYEVELRDTQGVEWKVELDAVSGKILRDD
KDD
>Noc_0043 conserved hypothetical protein
MVADGSPLKARLQEEVKIAMRAKDKARLGILRMVMAALKQFEVDTRKPLD
DAQVIALLDKMLKQRRESLAQYEAAGRTDLAEKERFEEEVIQSYMPTPLS
EVEVEQLIEEAISQCQATTARDMGKVMALLKPNLQGRADMAVASAKVKAR
LEAISS
>Noc_0820 conserved hypothetical protein
MALDIYASDQEKSEAIRQWWRENGRAVLVGLIIGLLALLGLRSWTNYEQS
RTSEASSLYQQILAAKDQDANAEIYNSAEHLLQEYSDTPYGLFSVLILAK
EDQARGDLETATERLKGALEYARHPSLQKVIYLRLARLLLAMGSPQEVLT
MLAEVKPGSFSSAYAELRGDAYVALGQPSEAQLAYQEALLGLGPSEQYRQ
ILQMKLDNLAQP
>Noc_2393 Adenylate cyclase
MPIESEIKLRLPPGQFSKFSEHPWVKEHGKQGPVRKHLQSIYFDTPDLRL
LDQGVGLRVRRMDHRWIQTLKGDNSGEAGLQRRQEWESEVENNAPDLEWL
VAEAKVGVFKESDLAARLIPVFETDFYRMIWILQVRRGSAIELALDEGHI
HAHGRTESLSELELELKQGDEKDLYSAALQLAEIFSLAVEHASKAERGYR
LYTQQFSPLSAPGFVSAGTVSDVQSLSYWLARLQYGERLLLERRDPQGYS
HLRAAAEELYGGVMTAEQKMGISLQEDLHWLVRGLQGNKDFTEVCHLVAA
PRYSQLLLRLGRYLAELKEKQ
>Noc_3000 Conserved hypothetical protein 251
MAVRWYCWQQEALIIQIRLQPRAKGDEVIGPHGDRLKIRITAPPVEGKAN
THLLRFLAKTFQVSRNQVYLLSGATSRDKRVRIEKPTKLLPGITPPIRKT
VR
>Noc_0274 CsbD-like protein
MPINWDQIEGNWKQFKGHARQKWGKLTDDEIDEAAGNKQILAGKIQERYG
IEREEAEKQVEEFRNSLK
>Noc_2386 conserved hypothetical protein, YCII-related
MLYAIIGQDIDHSLERRRQVRSEHLARIRKLQENGHLVLAGPFPALDTQD
PGEAGFTGSLIVAEFSSLEAAQQWAEADPYVEAGVYAQVTVKPFKQVLP
>Noc_0536 Pheromone shutdown protein
MNSTEQQQPLSIIKFADRQITLLGTAHVSRASAEHVKALLATGDYDAVAV
ELCPSRYQALINPNALSRMDLLEVLRKGKAAMVTASLALGAYQQRIAEQF
GIEPGAEMRAAVDSAHSAKLPVLLIDREVGTTLKRIYRNVPWWQRFNLIG
GLFASLISRDTVSEEEVERLKEGDVLETTFSQFAMEAENLYLPLIDERDR
YMAARLREEINQNEYRHLLGVIGAGHLRGVTRYLEQDDATPAKKTITELD
QIPPPSRWPKIIPWLIVVLVLVGFIFGFYRSPELGWQLIIDWILINGGLA
ALGALIAGAHPLTIVGAFAAAPLTSLNPTIGAGMVTAAIETFLRRPTVGD
FSRLRQETVHIKGWWRNRVTRILLVFLLSTLGSAVGTYVAGFKIIDRLAG
A
>Noc_2228 conserved hypothetical protein
MKKDPLNSQQHSPQQPLDWVALLGITIASLSALAAVSSGFGHRLGAWHFT
TGFLILRWASFGALAAIVLSLWGAFRTRPQRKRRGFWHALLGLALSLALV
SIPLYWLLMARSVPPIHDITTDTDNPPAFSALLPVRAEAPNPSYYGGPSI
AVQQQKAYPDIKPLRLPLNPKTTLDEALTIARSLGWKVIAVESRETAKGG
IRLEATDTTLWFGFSDDVVVRITPFDDGSRVDIRSVSRVGRSDVGTNARR
IRAFLAALKERAA
>Noc_2671 conserved hypothetical protein
MPRRIIKRYIPQPHQLQEHKHLRHLGEWFFASDLWHLSRRSTAGAVGVGL
FIAFLPLPGQMLIAAVAAAWARVNLPVAILMVWVTNPLTMGPMFFFAYKV
GTWLLGSPVYGPEFEFTWQQWLQTRLAATWEPFLLGCLVVGVVVGLTGGL
LTLVLWRLEVSRRWRNRKRRKVISQMEK
>Noc_0395 conserved hypothetical protein
MRNRLITIGTLIVVLSGCMANQPKVSGTPTKKIQSQNQIEPAQPEIETKR
TYNDDGSVKNIQYSKNGKEIEKFTFSYYPSGAIKIKLRTVNDILNGQSEA
YFEDGTVSENANYVNGLHYGKFENFHENGKLRKSGTYKNDKLNGTVKTYN
KLGNILSRVIYTDGVEGISIYYGYKDNNLEWEKKYKNKTLVQSKKYYLNG
NVELKAGIKNGEINGFLEEYYNNGNIKKKTTYKDGKKTGLELGYFKNGEL
SWETNLIEGRIDGIYKEYYENGALKLIQNYTKDEPNGLKKRYFDNGKPEY
EAVIKNGKVNGVYKEYFESGGIKFTKNYKNGEMVGEGKGYYQNGNLNWLV
TYKNGKLFTDKSYTKTGKISADLTYKDGKKTGIERDYFEKGNLSWETWFK
NGKRTKAKQYYQGGNLAKEIDFKDGKAIKGFQYTYEGKKTEMTNAHFHNL
GYEY
>Noc_2870 Protein of unknown function UPF0040
MFRGITTLNLDAKGRLSIPAKYRKSLGICCDGKVIITVDLLEPCLQLYPL
PEWEIVERKLVALPSHNRQARYIKRRLIGHAEECELDGHGRILLPLELRS
RTELGKNISLVGQGNKFELWDSMVWERQMAKEEASAKEELTRELALLAL
>Noc_1947 Conserved hypothetical protein 374
MASTELKSLQGWRYRVFLLSVAFSAAGYLGVSLWGGWREVLQTIGAVGFW
GTVIVLGLSLVNYGLRFVRWQHYLQLLGHPLSLEPSLRIYLAGFALTTTP
GKAGEMLRSVFLKHHGVPYSKSLAAFFAERLCDLIAVLILIAAGAWRHER
AQPIILGLGIALVMGLILLHIPRWLRGIETWVERFSRPRLRSFLISIIEM
VLHFRRCFAFSTMVYAISLGFFAWGAEGLGLYYVANWMGGEISLIEAVFI
YAFSMLVGALSFLPGGLGSAEVTMIGLLLLNGMGEAQAVACTLFIRLATL
WFAVLLGLLALPRRD
>Noc_0040 Protein of unknown function DUF205
MLTILVLVIGGYFVGSLSSAIIVARLAGLGDPRTQGSGNPGATNILRLGS
KRLAGIVLLGDALKGFLPVWLAQEVGANAWAVAGVGLAAFWGHLYPVFFG
FRGGKGVATGLGVLLGFSWSLALAVLGIWLSIFWCWRISSLSALCAAILA
PFLAWWLVPDSAIRFAVVLLAIFLLWRHQDNIRRLLSGQEDRASKQN
>Noc_1481 hypothetical protein
MKQRYPLPVFYTVALTLSLIVPMLFAPRAQGEILAMLNYEAKPEQRIQKE
GLAIIDVDPNSPNFGKMLMDIPLPPGLVAHHLYYNQDHSKIYITALEKSI
LHVLDMTQFPYRMKMVEIPQCKVLEDMAFSKDKQTWYLTCMGSSNVIVGN
ASTDKPIKSIKTPPSDSAFIRYPHGIALHDDLDRLLVTSTVRHSDLGDPG
ETITAIEASSGKVLSTHKVSMKPSPSGAAPVEVHFLPGAEPPLAYITNMY
EGNLWTAVWDSDKKVFDFQQVADFAPHGRGVPLALEFNRKGDRFFVTTAQ
PGHLNIFDISDPQAPELLKAIPTAPGAHHIVLSPDERYVFVQNSFLNLPE
MSDGSITVVDLAKGEAIAQIDTLKNQGFNPNCIVLLPEWASGGHSH
>Noc_0692 conserved hypothetical protein
MSKHITRMKLGEGLEQSQTDWKRLDAMKDEDIDCSDIPELDARFFENAKV
VMPPGKKQLTLRIDADVLDWMKAQGKGYQSRINAVLRAYYEAHRDEGR
>Noc_0422 Protein of unknown function DUF980
MLIISLARTLKAYWRRRIINRHPIPLAHWEAAVACLPLLQGLSLAELSHL
RDLSTLFLHHKSIYGAQGFKVNEKMRLVVAIQASLLILCLDWDDYRHWRT
VLLYPGAFVAKREERDESGIVHAVQHPLIGESWDRGPVILSWDDVAHTMA
PSESPSNVVIHEFAHLLDMGNGAANGMPPLHRTMNRRTWTSVFFQAYKNL
NERWAAGEPIPLDSYALESPAEFFAVASEVFFVSPQKLQTALPQIYHQLQ
LYYRQDPASRTLLF
>Noc_1380 Endonuclease/exonuclease/phosphatase
MKMILIILALVLIIATLTPLIRRHEWWIRVFDFPRLQILIMGLLVFAVFF
FLWDKQNIFEGITLVVLAAALLFQGYKILPHTFFAAKQVRGTSADTPGNT
ISLLIANVLMGNRNVQRFLAIVREADADVVLMLEPDQWWEEQVRELEVKY
AYTVKKPLDNRYGMLLYSKFELVAPEIKFLLKEGIPSIHTEVKLPSGQLV
RMHCLHPEPPSPTEADTSVNRDAELLIVGREVKNSEQPVIVAGDLNDVAW
SYTTTLFQKTSGLLDPRIGRGMFNSYNAKHPLMRWPLDHLFHSEDFLLVR
IARMPAFGSDHFPIYIKLSLNSNAENLQESPEEPDAEDKEFTADTLQKVE
QKEEI
>Noc_2030 conserved hypothetical protein
MARKRVYKIKFISQGKLYELFAREVGQSTMYGFVELGEIIFDEKSAVLVD
PSEEQLKAEFAGVKRTYIPLHTILRIDEVDKEGINKITEATNADTNVAFF
PSSLHTPGKDKK
>Noc_0473 Peptidoglycan-binding LysM
MIFSSAALLADEESQQQRADNQTTHQSQESDQSLRITGSYPNIYKATGII
GKEVKNKQNKKLGKISDLVIDKSGQVRYAVLSHGETLGVGGKKLAISWDL
IQVSPEEESYTLVMDATPEELANAPSFNKDNWPANAQVTDTSSLKEQQQS
STAGSQSTDQQSTNERSQFQQQDNGSSTPKTVTVQEGDTLADIAHHAYGD
ANKWRLIYNANKDKIKDPRDLLVGTKLTIPSSNE
>Noc_2137 conserved hypothetical protein
MSVTSSIAHKSKEGARRLAWLYRPRTWKEKGLLWTIGLALATYMVIVVIL
GILWSFEPEVFDGRTHTQEVAAVFAKDIADEMDLPQSKRKELPAGFIATS
TAIHVARTLLEKPGGYLSNDVFPPGVYLDNIPNWEFGVLVQLRDFVRNLR
NDFSRAQTQSLEDKDLQIADPQFNFNSESWILPTTESQYRKGNKALLSYL
KRLSDDKKNDGHFFVRSDNLRSYLEVVEKRLGGLTQRLIAAVGEVQFNVN
LAGERQGRSAKPEPREVRVKTSWWEIDDVFYEARGSAWALVHFLHALRIE
FEHVLQDKNAEVSLAQVIRSLENSQKTLWSPMILNGDGFGTLANHSLVMA
SYLAAANAALIDLRNLLKQG
>Noc_2284 Protein of unknown function DUF190
MKQRSEAELLRVFIGESDKYHGRPLYEVVVEEARRYGLAGATVLRGTLGF
GANSRIHTAKILRLSEDLPMVVEIVDQPERIAAFLPELDALIEEGLVTLE
RVQVIAYRHSHTE
>Noc_0087 Protein of unknown function DUF541
MTESNKSNAIVLGVCLIIALSSLGYLLGNAAIRFKEYERTVTVKGLSERE
YSADIVIWPIQFSTASNDLEEMYRSIDASTEKIKSFLESAGVNADEISYS
LPAITDKSAQQYGNQAKSEFRYTASQTVTVYSKNINVVRDVMGNMSDLGK
RGIVFTGGDYQSQTEYIFTRLNQVKPEMIEEATRKAREVAEKFAADSKST
LGKIKSASQGQFSISARDKNNPHIKKVRVVSTVAYYLSD