Gene list
Applied filters:
COG category: Function unknown
Gene type: CDS
Genomic element: chromosome
Number of genes found: 173
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Nitrosomonas europaea ATCC 19718, ATCC 19718 >NE1283 conserved hypothetical protein MFKTDMWDTFLQAIALMLILEGIFPFSFPGAWRETFLKLMQLEDSQIRFV GLSTMLVGLVILFLVN >NE0264 conserved hypothetical protein MSTTPIDAELDLMLKRELAVPVNLVWRGLTEPELLKKWFVPKPWSISDCR VDLRPGGEFYTVMQDPEGNKFPNSGCFLEVTDEKRLIWTSALVKNYRPAV PATTSDKECAHIVMTAVIELQPTSSGTRYTACAMHNTPGQRKLHEEMGFH EGWGTTITQLEELLKQEKAY >NE0713 conserved hypothetical protein MKLVFSEQAWEDYLYWQKTDRKTVQRIDTLVKEITRTPHEGTGKPEPLKH ALSGYWSRRINNEHRIVYKIADDSLFIAQLRYHY >NE0227 DUF186 MNLRQKITEDMKSAMRAGDVKRRDALRLLQAALKQKEVDERIELDDAAVV AVIEKMLKQRRDSIAQYEAAQRQDLADIEKFEAGVLQAYMPEALSDAELD AMISEVIASVGENGSPKIGEVMALLKPKLAGRADMAKVSLLVKVKITG >NE1345 conserved hypothetical protein MRAIRFVPDAWEAYLYWQDQDKKTLRRLNSLITAASRDPFVGIGKPEPLR GELSGYWSRRIDETNRLVYRVTDVELVIIACRFHYE >NE1928 Smr domain MGSDDKSSSEEVAGDEDAALFREAMRDVRPLTRTKKIIRAHSKHQALPQP GASDLPDIQDVLADDGWQWDMAESEEWSFARPGLQRYTLRKLRRGNWPVQ DELDLHGLNRDEARRILVIFLNQGVLRGLRCVRVIHGRGLSSRNRKPVLK ILTGNWLMQHGDVLAFCQALPEQGGSGAVLVLLRNADK >NE0619 hypothetical protein MIWYLEKWRRRWILGRKPIRESVWQSAVDQLPFLHGLTQDEFRQLREWTS IFLHDKKINGVQGLVVTEAMRVMIAVQACLLILKLDPEYYDDWVEVIVYP GKFILDHAYTDETGIVHTRRMVASGESWQAGPVILSWEDVVHVHHESGYN VVIHEFAHKLDMLNGSANGYPPLHSDMDPRIWAVVFSRAYEIFCRQVERG EVTIMDPYAAEDPAEFFAVSSEVFFTRPYRIKQYFPEMYQQLARYYRQDP AARQECETDS >NE0717 Cold-shock DNA-binding domain MRYQGRITTWKDDKGFGFVTPNGGGEQIFVHINSFSSRQRRPEGNELVTY ELTVDSKGRSQAKAVAFVGEQPTPPKAPSRSSLPPLFAVCFLIFVVGVVV AGRLPSPVLAFYTIASIVAFFAYAFDKSAALRNQWRIQESTLHLFALLGG WPGALAAQRLLRHKSAKASFQTTFWVTVILNCGALGWLLSPSGTRTLNSL LGTA >NE1164 probable transmembrane protein MQQDHNKISSARIPAIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWVELV DLIGSVFGNQSALQHTLEMVTVKGAESSGKLTETTIMLVVLGLIDVVMIS NLLIMVIIGGYETFVSRMNLEGHPDQPEWLSHVNASVLKVKLATAIIGIS SIHLLKTFINATAYDEKTLIAQTVIHLAFLLSALAIAYCDRIISRTIHHA DEHE >NE0064 putative signal peptide protein MRKCNLQAAHGYSGLFNMAWPNRAIFSSFLSLFLIFFSSIVLAERADRDK PIHLEADHATVEDYKRKGEFRTSIFTGNVVLTQGTLVLRADKVIMKEDAA GYRYATAYGDLVSYRQKRDGVNEYVEAWSKRAEYDDKTDKIELFGSARLK RGADEVEGDYISYDIASDFFQVSGRQQSENDKNSDHRVRAVIQPKAKQSD TETGK >NE1104 conserved hypothetical protein MRDATINLRALPEQRDLIDQAASLLGKNRSDFMLEAACDRAQAVVLDQVF FSLDTDKFKQFTTILDAPPGPNPGLERLMAVKAPWNTDVV >NE1328 possible transmembrane protein MPHSLLRHCTVCLGWAGRGFLLVAILACLLLLALRYWILPDIGKYRSDIA AAMTRVAGQPVRIDSIRANWDGLRPHLSMQGVSVSDRQDEPVLVFPEIEG TVSWRSLLRGELNFHEIIIDRPALVTRRDVKGVLHIAGVTLGENQQESGF LDWLLRQRRLIIRQAVIYWQDELRRRPVHYFESVNLHLQNKRGGRRHQFG LQAHSSDPLFSRVDLRGDVTGDSVQTLSAWQGRLFIQLQDVDLGSWQEWM TLPADLVLKKGKGSMRAWADIRAGILARWVSDISLRSTAVHVARQLPVLE IDRLHGRWGWHEIEDAKGVNRQWFARGMSIELNGLPLTGPIDASWRVLNP KDGGLPIHSLQAGGLRMDVLTRLAASLPAEEALHESLSRLSARGMVKHVN LEWRGDWTKKPPFRINAGFNDLAVQSFDDYPAFSGISGIVDATEAGGSLF LSSKNVEISKSQSPGEKFRFDSLIGRVDWKTAADHAVTRIKFSDIAFESD AGSGTLQGNCAFDREMPARIDLTGGLSHADMHLLGEYATWMADETWKDKL DKATLSGKLSDAKFHLRGVLNGRSADKKGAFSIHAETAISNAGIKISDDW PEVSDMAGRVSIQDGALDLSLSSASVAGIRLQKFMLQSDDVYADRPEIRI KGMAEGESGEMATLLRRVDVGPHVGELLGQAEFSGKGRLQAEVELSVGQE KFSVTRMQGRYQFVDNRIDLDRYIPDFHQVNGSLIFTESGVVLEGIRAQV MGGAVEFSSASLPEGGVRITARGRADFDHFRPDANLVKRVDLSQLWTQFI RGGSAWQAAVEVESGKVGIVVESSLEGVELMFPAPFSKTAAEVIPVRLEK VFTLPRDDHVHFHYGDILTAEFQRIREKAHYYHPVRGIISFGGHGTLPQD RVTRVEGSVSRLEWDQLRELFKRHAGMDASLDHTARGLDNILTRSVQFDL HIGQFEFLSSYFNDTHFSIDRQGESWRVDVSSQEVDGRIDWQAASPQKVV ARLSRLKIPEDAPENILTPYKHDPPGDWPAVDIEADELFVKGGLLGQMKL SAVQRQDGWLVENLDIRHPNSRLQANGLWENHKPPYRMYSHIRLQSSNIG KLFKRHGYPGRIARGKGVLEGNLDWAGKPFSVDFATLYGSLQLDAQHGQF TELKPGIGRLLGVFDLKSLPRRLTLNFYDVFGKGFGFDELNGHISIKSGV ASVDDLYIAGSAAELILKGEWDLVNETQTLNLKVFPSFGLATPIAGIAAM IAKRALQDPFDRVLLNEYAITGSWSDPVVVRLDEERDKVE >NE2444 putative periplasmic protein MLDTAVIKLRSTDQLLLNFNKMNIRSVVFFVCVLLCAIANAQTQADLNDD ACGAYQEADKKLNAIYQQLLEQHKDDANFTTRLRKAQRAWLAFWDAEMEA IYPADNKREEYGSIYPMCSCLEQAALVNHRIEQLSGWLTAEEGDVCRGSR >NE2170 possible (AF025396) ORF15x3 [Listonella anguillarum] MNADSGILSIRKTFRQLFSYALIGILTNVLGYAFYLLLTYLWDAPKITMT ALYFVGASIGFFANRRYTFRHDGHIGVTGVRYLLAQVAGYLLNLVLLLLF VDWLGFPHQIVQAIAIIVVAIFLFVMSRFFVFAQSAAESEGVPS >NE1173 conserved hypothetical protein MSRIFLALGAVNAFLCVMLGALGSHGLKSILAPDILTTFQIGVQYHFYHA IGLILVGLAMDRLPQARALKFSGILMMTGIVLFSGTLYVVSLTGWRGLGM VAPLGGTSYMAGWLLFAWATWKNKSA >NE1509 Domain of unknown function DUF34 MHQNMLENYLNDLLGIQQFRDYCPNGLQVEGRTSIQTLVSGVTASQALIQ AAVGLRADAIIVHHGYFWRGEDACLRGMKRHRIATLIKHDINLYAYHLPL DAHTELGNNTQLGKKLGIVEAGRFGEQNIAAYGHFPESVSLDELSNLLNS VLGRKPLIIGDPLKPVRRVAWCTGAAQSYFEEAIRLGVDIFITGEISEQN VHTARESGTAFISAGHHATERYGIQALGEHLSQKFSIAHHFIEVENPV >NE0018 PDZ domain (also known as DHR or GLGF) MNVSDVSAQTGSGKKAVSASEQTTRNACVPAASWVIPGKGETTLSSVMAS VKDKSVVLLGETHINPEHHRWQLQMLATLYAARPDMVIGFEMFPRRVQKI LDQWVAGELTESEFLSRSEWQSVWSTDAGLYLPLFHFARMNRIPMRALNI DIHLRRAVTAKGFDGVPEKDREGVTRPAPPAPEYLEFLLPIYMQHGRAPQ KTAEKNQKPNHYDPDFLRFVVSQQLWDRAMAQEIHAVLSSYDKHKKPLIV GIMGSGHILKGFGVPHQLKNMGVKKIVSLLPWDTNRPCKLLTDRYADAVF GLAPFTPESGSPLQQRLGIGFEFSKKTTGAHVLQVEQQSIAESAGLQAGD VILEMAGSTLKESNDVIDAVKRQAPGTWLPLKVMREGMITEIIAKFPPLA K >NE0792 conserved hypothetical protein MFFVRVFIIILALSSSGWVSAKDQVPYATQINDLMRYLRVTPNIATSGAL TKDGIQELVKHSFQTVIDLRSESEGTPSEKKAVEAVGITYINIPVTGEGV NESQLTAFKQALEQAAPPVLIHCATGNRAGAMWTAYRLSEGIAPEIAFKE GRAAGMNAGMEEKIRKIWCDGNKDSCQ >NE1500 hypothetical protein MDDFVLARVLHVLGVVLWIGGVAMVTSTLLPVIARMPPVFDRMDIFHRIE KRFARQARFTTLLVGLTGFYMIHVLDAWHRFTEIRFWWMHAMVLVWGIFT LFLFVLEPRVMHKKVSENAQQDPEATLARMQRMHWLLLSLSLITAAGAVA GSHGWSFF >NE1198 conserved hypothetical protein MISENLIKTLIALAIFVLTMAAIGYLFEEELEAGTNWIVDQIGFLGLCLI LLVTDTLITPFPPDVLLLVIAKSSLSEHWFMYVSILGVVSCVAGMLGWGI GRWLGHFGFIKRILGELEENQREFIHRYGFWAIAIGSITPFPFSVTCWAA GMMALRGTTVLAAVLVFRIPRFFLYYWLIIAASRWF >NE2432 conserved hypothetical protein MSLQSWAWIHKWSSIVCTLFMLLLCLTGLPLIFHEEIEHLSGVVEAPPMP KGTPDASMDRIAQVVLARRPGEVIRFMFWDQEEHPDLTYVSMASRVDAPP EESNSVVIDSRTAKVLDEPKTNEGFMYVMLRLHVDMFADLPGTLFLGFMG LLMVVAIVSGVVLYYPFMQRLKFGVLRMQRSARIKWLDIHNLLGIVTVVW LTVVGLTGSINTLDRIILSLWQMDQMAEMTAPFKDLPPVTQPAHLEASLR AARAAAPDMEVRLVAFPGTMFSSPHHYTFFLRGNTPITSRLLKPALVDAS NGELTDSRDMPWYVKTLFLSQPLHFGDYGGLPLKILWAVLDLIAIVVLAS GLYLWLRKPKTTAASVSKADDAVLPLDDDHTAPNNAMVYINENKDR >NE1231 conserved hypothetical protein MSFANIIGQFLQQGISGQSRSRLDQALGSLELDGAGGKLEQFLGNLLNDN DNGSGRDSASTSGGLLNLIRGFFGSKQTGKLTGSQLTGIGALAGALLSGG VKASKGALGGSAMAILGSLALNALKGHLAAGNTSASAADIDRYAMEAIDD PDTQRLIVRAMIAAAKADGIIDEQENARILGKVGEDGVTEAERQLVDEEL RRPADMAALVAEVPNQVVAAEVYSASLLAINVDTEKEENYLRELAKLLGL DAAAVARLHQLTGAPEIS >NE0621 DUF176 MYVTIVYASVKTDKTEAFKEATRMNHEQSIREPGNMRFDILQSADDPTRF VLYEAYKTRKDAAAHKETAHYLTWRDTVADWMAEPRKGVIYGGLYPTGDD >NE0457 conserved hypothetical protein MLRTLCKTLLPLILATASVKGFAMHIEEKTTQLQDQQQVSITIYNENLAL VRDLRHVPLEKGINKLAWCDVSAQIRPETALLRTPEKTSSIRMVEQNFDF DLLTPEKLLEKYLGRSVNVIYVNPATGAETVEAAIVLSISGGVVLKFKDR IETGTPGRIAFPDVPGSLHDHPTLSLVLDGATPGKHELELSYLTSGLSWQ ADYVARLDANDGRLDLSGLVTLANHSGIAYPDAHLQLVSGEVNQVTPEPP QARKMMAMVADAAEYQAVREESLSEYHLYSLPVPTTLAENQSKQITLMSV TDIPVSQEFLLRGTNAYYFSRYSNLDDKLKPVVLIQFKNEGEGLGVPLPR GTIRVYQNDSRDNLQFLGEDHIDHTAKNEEVRVKLGKATDITAMRTQTDF QQLDTPSRRFTETATDITAMRTQTDFQQLDTPSRRFTETAHQIEIHNARQ EAVTIRVQEPIPGDWMMISESQPHTKSSANLVEWLVKVPTNQKTILSYRV RIKH >NE1867 hypothetical protein MTNHTYVIRQQDRNAAISFIDNQLNQNHAWLNETESQRAIAGQEYLQART DPASFNAWCQKWLNESQWAEIKQAICIAKDRQETRLRYAAEPHKTISVTH RAWKILSEIALQEQLTLSEVIVNRLSGNDATACPPAKSRYNLTS >NE2282 conserved hypothetical protein MGESGTKSRFPIRVQLEANKQYYWCRCGLSQSQPFCDGAHRGTGMNPVVF VVRETSPVWLCVCKETKNIPFCDCFRTD >NE0537 conserved hypothetical protein MVPTRSFWVWLHRWAGLIMAAFLIIVGATGSLLAFYPELERLINPHIYPR QVLEKKLDMATLAELAEQRVPAGRVNGVLMEANQEATLISMDARPDNADP PNKLGFDQMIVDPYTGEELARRQFGEISEGMINFMPFIYKLHYALALGKF GVWVLGICALIWTIDCFVAFYLTLPQRRRSVTAPASGNHGNWWRRWQPAW KIRWHSGSYKLNFDLHRASGLWLWPVLLIFAWSSVYMNLWDTVYTWTTRA VMEYKAPWTEFSKREIPLAEPGVDWRQAQRIGEQLMSEQANKHGFAVEQV IALRLDRGNGTYQYIVRSSKEIQDRRGRTSVFFDADTGELKLALLASGQY SGNTVTNWLFALHMANVFGLPYRIFVCVLGLVIVMLSVTGIIIWIRKRAA RLSPKNHQYDSRYLRPSVHRDS >NE2409 conserved hypothetical protein MINNQIKHVWQRFAQSFRLMVGVPEYRVYVEHMRSAHPEQAIMSQEEFFR ERLEARYGSKGRLNRCC >NE2471 conserved hypothetical protein MRILTFLAIALFAFGLTTLDVEARRMGGGGSVGKQRQSINLNRQQQAAPQ APQSGKAASPASQAAPAAGGSKWLGPLAGLAAGGLLASLLMGGAFDGINM MDILVLVGIAAAIFFIIRMMRGSAGGRQTQRPMQYSGAGAGGMGGVPFPN RDTSAPAGGASSGYDRQSTSTDAPDIPADFDVESFLKQAKRSFIALQLAH DAGDLEEIRAYTTPDLFAEISKQVAERGNMAQQTDVIFVDAALLDVTNQG NHAIASVRFTGELRDTPNAQPEPFDEIWHVEKDLAASDSGWLLAGIQQAD DLKH >NE0599 conserved hypothetical protein MLYSSNAILLTVICYELPLSERIRMLLRLEDLFDKIDFFSARDTSFEHHA VLVALFEILDVTSRSDLKSDLLQELDKQRTMLEGLRSNPEVSEKALDHIL QDIRAAFRGLLDIPGRIGGHLRDDEWLMSVKQRMSIPGAACEFDLPAYHY WLNLAPEIRREDLKDWITPFTPIRSGINIVLNMLRNSGKNCCYTAVQGLF QQAGSEHQAHLLRLHISSEFPCVPEISANKYALNIRFVPWRSDHKTEVYE EDIPFELTFCSL >NE0432 putative transmembrane protein MEGSVPGFWRRMLALAYESLLLLGVWFIAAFLFHLLFRDPTAEYFRPLFQ FYLLIVGGIYFIWFWTHSGQTLAMQTWKLRLVSANNGKVTTQQAMVRYLM AVIGISFLGFGLLWALFDRNRQFLHDRVAGTRIIRLG >NE1145 conserved hypothetical protein MAGLNPQTPVPTELKLHRKSGVLSVTFNDGKAFNFPCEFLRVYSPSAEVR GHEPGQEILQTGKKNVTITHIEPIGRYAVRLDFSDGHNTGLYSWDLLYDY GLNQETMWQDYLQRLQAASGSRETG >NE1868 conserved hypothetical protein MITQSSLLTYPLAGCLNRLANKSSQVKQRLQTHANETVCFRIDSLASLSV TITQDGHFTTAAKDTQAAVILDIAAELIPRIASGEMAAFDKIIICGDPEL ADTLLYMGKIFQAGIEENLSSVMGDVLARRATLTGQELVRWHLGSIRNLS RALGEFLSEEQPVTASRNRFHQLASEVESLQRRILRLEKRITALVPSFSS IAGNLPRTGR >NE1403 putative uroporphyrin-III C-methyltransferase MDIQIKRAYEPADPADGCRILVDRLWPRGLTKQQVACDLWLKEIAPTADL RKWFNHDPAKWAEFQRRYRDELSVNPCVKELLDRAAKGRLTLLYGARDAE CNQAVVLRDYLLERN >NE0388 Domain of unknown function DUF37 MKQLIIDLIKLYRYSIGLLIPPSCRFYPTCSNYMHEALVKHGLIKGLWLG MKRILRCHPWNQGGYDPVP >NE0428 conserved hypothetical protein MVPASRKESITIEVAFALPQRQFLRRLQVPMGTSMYQAIKLSGVENFYPG SDSTALKTGIHGKITDPETILHQNDRIEIYRPLVIDPKEKRRLKSDRLTK GQK >NE1012 SURF1 family MLMFKQQFKPALWSTAVTILAIALFLKLGFWQLSRAEEKEASFALLERYA QQPPVTVPEPLIELDDYLYRRVEVHGYFEAEHTIFLDNKTHQGVVGYHVL TPLRQVNSTTYVLVNRGWVSGGNDRSLLPDIYTPDGLVYLTGVVVSPSIR TLSLSDKQFTGKVWQSFSLDSYQNLTELTFQPFLLLQQNETQGDGLIRQW EKPDSGSSKNIGYAFQWFSLAVMTLIIYIVLNVKRKSIA >NE0092 conserved hypothetical protein MLCALKHLFQGEPQQQAVSGLISTSYTDNDGETRISLKLEPRYENAGEVI TKLVELDADPQVVRVGIQSSSIASFGSLENLVNAYGTLYRYLKDNYDSPA KLKKYWGYLANDVVFIQISTDVSSALKIFETINERGVGLNPMDLLKNLLF TQVGQAQFTQLKDEWKKITRPLEKGKEKPLFDHFVMQLENFLFYYIFTKT PTRDMERSFSQWANELRAIADAAECRCHNQWQGIDS >NE2472 conserved hypothetical protein MLASMTGYAAVSREIPQGSLALELRSVNNRYLDLQFRLPDELRALESEMR DFLGSGLTRGKVDCRLTFSSYSNSYRQQRLNRDLLQDLQIMSDEIKSIFP AAGDFSVAEILRWPGVLDSDHISIDDLRTPCMALLQTALQELVSARKREG EKLHNLLLERVQRMRQLVLDLLPGLPAILASFQERLLNSLKAAGLDEKDE RIRQEFTLFANRVDVDEELSRLQGHLNEFEHILLAGGVVGKRLDFLTQEL NREANTLASKSVAREVTRIAVEMKVLIEQLREQIQNIE >NE1309 hypothetical protein MMPKSHPSKNNRSTSAHSSNNRNENNVSTRVSAEIQSATFLGPIPHPAIL EGYEKIVPGAAERILIMAESSMKHQQQYDNALLEASKNRQHEVRFLVF >NE0497 putative NtrP protein MFRNRSNTMLSERYVRLFRNGKNQAVRIPREFELNAQEVIMRREGNRLII EPVPPKGLLVVLAELAPLEENFSDIDTRLAPLDDIDL >NE2505 conserved hypothetical protein MNSGEYKYIWQASDWPNWRFDLAALAEPMAEVSRAQGLLMGRLADVGMAL RDQASLAALTEDVVKTSEIEGEQLNVESVRSSIARRLGVDIGALAPVDRH VEGVVEMVLDATANCQALVSRERLFGWHAALFPTGYSGLSKINVGGWRDD ATGPMQVVSGPIGRQRVHFEAPPADRLETETSRFLDWLNGTLNEPPLLKA GLGHLWFVTLHPFDDGNGRIARAIGDLLLARADGSPQRFYSLSAQIQRER KAYYDILERTQKRSMDVTEWLAWFLDTLHRAVDQAQHTLDAVLTKARFWQ RWATTPLNERQVKLLNKLLDGFEGKLTSSKWAAIAKCSPDTALRDINDLL TRGVLRKSDAGGRSTSYELNDLPE >NE1419 conserved hypothetical protein MLYLIYGEDVPDSLAQRVASRPAHLARIRELQEQGRLLLAGPCPAIDSID PGPAGFTGSLIVAEFASLDAAREWADADPYLLSGVYAKVTVKPFRKVLPE >NE0723 hypothetical protein MKFSTILTFLSGTTFVYPAAFAQSIIASGNILPGIPAPPLALWQPANLRV GVNAAGTLSITDGGMVAGPGQALLGSAVGSSGTVMVSGNDSLFSTVLQMH VGSSGTGTLKIEDRGTANIGTFLYIGRFIGSDGLVTVSGAGSRLTNGNMM QVGSEGTGALIIEDGATVNSTNVTRIGWSSTGIGTAIVQGSGSSWTTNNS MSVGFGGSGRLLIVDGGAVSNVEGFVGRETGSTGEVTVSGAGSSWSNSAA LEIGSFGMGELMVEDGGALSNTDGRIGREAGAIGTVTIKDAGSTWSNTGT LYIGDLGKGTLTVADGGKANIVASFVIGRQAGAEGLVTLSGAGSSLINTS STQVGGAGKGTLIVENGGVGQSNNLSVGVSSGSTGSVAVRGADSRWIAGS ILTIGASGHGTLTIEDGGSVTSTLTTIGSNTSGLGEATVSGADSTWTNSG ALIVGALGNGTLTVSDGGMVSNATAGIGVGTGRQGVALVSGAGARWINSG DLTVGTNGSATLTVADGGHVSVGGGNGIVHVAEAATANGVVNIGAAAGSA PVGAGTLGAAEVRFGDGTGRLNFNHTDSAYLFSPVITGNGALNHYSGTTI LTGDNTYSGSTMIAGGVLQLGNGGTSGSVTSDIQIDSTGTLRIDRSNDWT YAGILSGTGVFDQLGTGTTMLTGNSAAFNGTTTVTNGRLIVGMGGAGTLG GMVNVLDGATLGGSGTVGSAGADVTILGGGVHAPGNSVGVQTIAGNYVNY GTLRIDGTPAGTDMLIVQGGVDITGATLDLQLSPPVASGWNIINGPFTII DKQSAGAVAGSFGAVNNNLLFLDPYVNYAGGDGNDITLDFVRNDVAFASV ALTPNQIATGRGIGTLPYGHPIWNTIALMSDEVAVRRSFDFLSGEIHATA SSVLLEESRFPRRAVNDRLRSAFDTNTDTAFWAHGYGAWAGWQSDGNAAS LKRDTGGLLLGLDGQLGNWRTGVMTGRSWTGVTVADRASTAEAGTWYAGL YGGTQWGDLGLRLGLLHGEHDIDTRRTVAVPGLSGTLRSTHGASTTQAFG ELAHTLHFGIVRYEPFANLAHIHTRSNRFTETGGSMALAGRRSTMSATVM TLGQRIEAAHVFRGTGIRTVGMIGWQHVWGDVIPRSTHRFSMGDPFTIAG TPLARNNLLVEGGFELSLGRRAAIGASYTGRFAHNGHDHAATAVLRIGF >NE0478 conserved hypothetical protein MYTGTVFENNRTQAVRLPVDVRFADDVKKVWVRKLGKERILTPVDHTWDS FFLAEQGVSEDFLSERASQEQQEREVF >NE0220 TPR repeat MFLRAFLPLLLLSCNVAQAALFGDSETREQLEALRTKVLEMEARMQRTEE VLMGQALIELHTQAENLKEEMGKLRGQIEVLEDENRSLRKQQKDFYLDLD NRLRQLEPGSAGTAASDSRISSPSSEQLAADTKDIKSPASGKTAAVLQLP DTAQRNRYDAAYASIKSGDYSGAVTGFESFLAQYPQSALAPSAAYWVGNA YYALRDFDKAITAQQRLIEIYPGSPKVADGLLNMASSQAEMGQKAAARKT LEKLIASYPGTEAATKAKQRLGTLK >NE0358 Domain of unknown function DUF143:Iojap-related protein MKSPEKLLETAIMALEDLKASNIHVMDVSKLTSLCTTMIVASADSTRQTR ALASHVQEKVKATGSMVYGIEGEQTGEWLLVDLGDIIVHIMQPAIRSYYD LEGLWSEQAWRSPQENSAVYG >NE2056 conserved hypothetical protein MKYWLYTSLFLALIFSFRSYANSDIWILIDTLEQRLSVMRGDKAQLAFNN IAIGRYGASSSRMKGDNQTPLGSFQISWIKQHHRYYRFFGVDFPNQEAAD LALAEKRISRQAWLSITKAIESSRLPPQDTPLGGYIGIHGIGRGDRTVHA RFNWTNGCVALTNAQIDELSSWIKIGTKVVIR >NE0009 MgtC family MNPARHTMDDLFLLEQERGYLAQFATSLAIGLLIGLERERSPAAKAGLRT FALVAMFGTLTAMLSHKAQTPWLLISGLLLVGIMVIASYRDKRDLPEDPG TTTVTAVLICYGLGAMVWYEESTLAVMLAIITTILLYFKTELQGITQNLT RKDLISILQFAVLSFIILPILPDRNYGPFDAFNPHQIWLMIVLISGVSLV GYIALRFIGQRYGAVLLGVLGGLVSSTATTLVFTRRNGDRPDITNLAVVV ILLANLVVLVRLALITEVISPVIFPYLLPVLGGGLFLGLITSLFWWREFS QQQIIPMPDTKNPAELTTAMGFGLLYAAVLFLSGWLSDIAGSSGLYAVAI ISGLTNVDAITLSSLRLHGLGKLEIVEVVTAITLGVIANLIFKLGLIFFT GNNVLARRCFLGTVAIIIGLAGALSTASYLSYF >NE2401 Generic methyltransferase MSSKPELFQRTPWLNGEDIEAKRTELRAYFHTTLDKYEQLFETLRGDEAY YDRPISLRHPLIFYLGHTATFFVNKLVLAGVLAERINPRLESIFAVGVDE MSWDDLNTAHYDWPTVEEVMAYRRAMRDKVDALISSLPLTLPISWESPWW AIVMGVEHERIHLETSSVLIRQHKLKYVQSHPAWQPCRNSGNAPENEMIL VPAGKVVLGKDKADPVYGWDNEYGHHEAELSAFQASRYLVSNQEFLGFVE ARGYETNDYWEEEGLAWRQFSGAACPTFWIREGDQWRLRLMTEEVPMPWN WPVEVNYHEAKAFCNWKQKTTGQPVRLLTEDEWYRLYDVAGLTEVPHTEP AKGNIHLDHYASSCPVDEFQQGEFFDIVGNVWQWTETPTYPFTGFEVHPL YDDFTTPTFDDRHNLLKGGSWVACGNESIWVSRYAFRRHFFQHAGFRYVI SDMPATHHSSHYETDKLLSEYAEFHYGDTYFGVPNFSVALAELAIAAMSG RPARRALDLGCASGRATFELARHFDHVTGVDFSARFINQGVALAEHGILR YTLVDEGELVSYRERTLAGLNLESVRHKVEFFQGDACNLKPILTGYDLIL AVNLIDRLYEPAKFLTMVHERLNPGGMLLIASPYTWLEEHTKREHWIGGF KRDGENFTTLDGLKEILGKHFRLIGAPCEVPFVIRETRHKFQHTLSEVTI WERVV >NE1943 conserved hypothetical protein MNRKRKWQIPPPSEPVTDEYLTSVTIGERKPLNDRIQLAEYDLRWPSMFS VAADKIRSALSEKALLVEHVGSTSVPGLAAKLIIDMLLGVTDSADEKSYV LPLEQQGFVLQAREPGWYEHRFFRLKSGDMEWHLHVFSAECEEIDRMLAF RNWLRVHDDDRQRYENVKRTLAARTWKHMQNYADAKSDIVREILGRAQDD HVVGITSEAVSAAFRFR >NE1375 Helix-turn-helix motif MLRDRISRELVSIETKKQGFIAFTPTYQADLGFNPEQSAIYTLRAELMSN LRKTIRERKWTQEEAAKVLNIGQSRVSDLMRGKWEKFSLDMLITLAIRVG KRIGITVV >NE0665 conserved hypothetical protein MVSPRLVTKIEAEEPTLRLAVLIDADNAQAAVIEGLLAEIARFGEATVKR IYGDFTAPASASWKKVLQKYAIKPVQQFAYTTGKNATDSTLIIDAMDLLY TRKFDGFCLITSDSDFTGLAMRLREEGLTVLGFGEKKTPEAFRNACHKFV FTEVLRPDTATESAAQRTKKTENDQKSSSPQPAAQAPETKQAFPRKFVLA ALEQSSDDAGWANLGNFGNYLNKLQPDFDSRLYGYKKLSDLVKARTDLFV TEERQVPGSTQKALYLRAK >NE1969 LysM motif MRTLTIISVILFTSLFSGLSFSAGEIGLRDDIPDRYQVVPGDTLWGIASR FLKEPWRWPEIWQMNRGQIRNPHRIYPGDVIIIENTRYGKRLRMASEKGV VRLSPRIRAEESAMRAIPVIPAAKIEPFLNQPLVIEKGKLDRAPVILGAS DDRVILSTGDKVYVGDLPADQGVIWQVFRNGKALTDPDYDNRILGYEAVY LGTVEITDFAVISTAKISRSVQEILKGDRLLPLSSARIDDYLPHAPDFPV TARIISVYGGVNEIGENMIVTLNQGSHDRIEPGHVLAVYRKNDIRSHEGK PVSLPDERIGLALVFRVFDTVSYALIMQSTQAIKVMDAVKTP >NE2360 conserved hypothetical protein MKLHLSDSSGLNVFSGYGEGYVAVNQVRYTDNMIVLPNRIIEHWQASSIS QLGMEHFDALLAMQPEIILLGTGTSLQFPDASLMRMILSRDIGFEVMDTQ ATCRTYNILSSEGRRVAAAILVRSTDG >NE1728 DUF173 MDNPTLIDPVAKVQGVPLLEVPQDLYIPPEALQIFLETFEGPLDLLLYLI RKHNLDILDIPMAELTRQYIAYVETMRADQFELAAEYLLMTALLIDIKSR MLLPRPVTPREDDADPRAELVRRLLEYERIKQAAIHIHSLPVAGRDFMPA CIWVDFVTEKQLPGINPQDLYDTWLALLERLRLNRNHTIRYETISVRVCM SEILRNLQSRGNVPFTELFSDITNVHKLVASFLALLELAREALVDIVQPD RFGMIYVHAIHTDQAD >NE2439 conserved hypothetical protein MKHTCSTSDLGYLKFNKRREMMGWNTVERNWKELKGKLKETWGDMTDDEL DVIAGKREQLVGKIQTKYEIAREEAERQVNAFAHDCDAAKEPLKNVGEAV SSRQKSVKKRSLYT >NE1702 hypothetical protein MKNNNNRLIYTGAFLAATLTFGQVVQADSLVLSRGNQTISRDTVVDSVVV GHTLDDSIAGSGHLIVNNGAQLVNNGTGHYLGMIDGVNFNAGNGYLGLNT DSMGVATITGASSLWQNEKDLYVGRYGTGSLNIEKGGKVTNAEGVIGDRA GSIGTVTVTDAGSLWQASRGIIVGVVGQGTLDIRNGGRVSNEATVIGISL DGHNGNVTVTGVGSLLESTVSTQIEYGSLNIENGGKVINGHDGFIGANVS SIGTATVTGTGSRWDNLNKLYVGGWKDGSVLGNGSLIINDGGAVSATNGV TIWSTGSLSGNGGTIEGDVINYGLISPGNSPGVLTITGDLTLESDSVLLM EIFGPTAYDQLIIGGNFVAGGILELDFGGYMPEFDVPYDLFQVAGGMSGD FSEIKFLNPAAGFDAGLLSLSFASGENGGMFQLIMANNGDPGNNTVPEPA SILLIGLGGLVMLMLRRRSPRISLA >NE2308 conserved hypothetical protein MSKKVTAIPPDYIEWLSDIKSRVTAARQRTVLAANAELIQLYWQIGRDIL QRQQSANWGDKVLDRLASDLRAAFPEMKGFSSRNLKYMRYFAEHCPRQAF GQQPAAQLPWFHVVTLLTKLADTGAREWYAQRAIAEGWSRTSLELSIRNR LHERQGQAVTNFGVRLPAPHSELAHEALKDPYLFDFLGLGDKAHEREIEN ALVRHITQFLLELGNGFAFVGRQFRLEVSDKEFFIDLLFYHTRLKCYVVV ELKATEFKPEHAGQLNFYLTAVDRQVKAPDDNPTIGLLLCKTQDRLVAEY ALHGIDKPIGVAEYELVRALPERLVTSLPTVEELENELLIAQETKT >NE1984 putative transmembrane protein MIRADIIKLYKTVHTWTGITCGFALFIAFYAGALTMFQEPIARWASPPAV GVAAVPLDDAPRLLELVAAAHPEAREEGIKLHLRGYENEPARVTWEEDES HEAGQPHGHVHWWATLKPDGSLLAKQEEPSELAEFINTIHMRIGIPEPWG SYFMGVVSLLYGVALIAGVIVLLPSLVKDFLALRVGRNLKRMWLDAHNVV GITALPFHIAMAITATALSLSHELWSLQEAVIFGGKQAILDERDNEPFRA PKPIGEAGAMMAPSQLLQRLKAQVPDFEPKVMIFKNIGDKAASVRVAGAE RGYVGDTHLGGVMLSAVTGELLKDSTRPSRQDPDRRASETFYGLHSGQYE GATITWAYFFLGFSGAWLFYSGNLLWIETRRKKACKGGELPEQRRDTYWL GAGTVGVCLGCVAGLSLTIVAAKWLYGRVDDLNHWHEYIYYAVFFGSVVW AFAWGAARSAVHLLWLCAAATMAIPLTTLLAVLFPALGMWAHGSAATIGV DVVAFIGALCFAWMAVATTRRIRNGPTDSIWSIRKADGDTVPHKPAATPA E >NE1822 possible ORF H1620 MIPQRNLSLISNTQVFAGGRRIPEAVIERDYVLAWFLTGLAGHPLRDVLA FKGGTALRRCWFVDYRFSEDLDFTLIRPITLDEILAGLNDIFAVIENACG LRIAFDREDRHGHQNSHTFYLRYQGPLPAANDVKVDITIDEVLCFPLVDR PIHRAYDGFDDLPEGPTVKVYALEEIVIEKLLALSDRARNEPRDLYDLWY LLNAADLRIAELRTELDAKLAHRQRTAAGIEQAIAAKEERLRRLWTTRLA HQMSALPPFDDVFRNVLRILRAAGLPRAND >NE0233 Toprim domain MATLLSKKQPTAHANYNNNLTAFHQEIINAGLTPPAGIIDDGKLHRFSSN GKRGDDSGWYVFHPDDPVAGAFGCWRLGLTQSWCSKSPDTMTPAERELHR KRVEAMRLEREADTARRHKEAAAKAQKLWANAEPADSEHPYLINKNVQSH GLRCVNDSLIVPLYSGSEIASLQFIMPDGEKRFLTGGKIKGSYSPIGTVQ VGQRIFICEGWATGATIHQSTGCAVFCSMNAGNLLDVGRYIRREFPHKEL VIAGDDDRLTKDNPGRTAALLAASTLDCGVVFPPFPDDAPMELSDFNDLQ NWRDMQ >NE0282 conserved hypothetical protein MNLAISFYILLAVTVALLVIGVMIYNSLVDIKHTVSQAWSNIDVLLKQRH DELPRLVETCRHYMKFEQETLSRVMEARTSIAHARETGNLPALGAAESIL RAGLGQLYAVAEAYPELRASEKFQHLQTRITSLENGIADRREYYNEAVRN NNIRIEQFPEILIARWFGFSARSLLKFTTINKHDHDLGKLFD >NE1356 conserved hypothetical protein MVYSLGYKTMYIVKRLDEFDKWLDGLKDRPTRIRLIRRLDKARQGLLGDV KSVGEGVFEMREFFGSGWRMYYIQQGGTIILMLGGGDKSTQSKDIQKAIQ LANDLGENSYE >NE0492 putative transposase MNHDHGYKLLFSHAEMVADLLRGFVKEGWVNELDFSTLEKINGSYISDDL RERQDDIIWRLRRKQGKQDEWLYVYLLLEFQSTVDWFMAVRIMTYIGLLY QDLIRSESIKTGEQLPPVLPMVLYNGDHRWQAPVNMGELILPAPGGLDRY RPQLNYLLLDEGRYHEHELVALRNLTAALFRLENSRTPEDVQQVLQALIT WLHTPEQDSLRRAFTVWLKRVFLPGRMPKTSFDEIHDLQEVYSMLSERVK DWTKDWKQQGIEEGKQIGIQEGKRIGIQEGKQIGIREGRQEGRQEGRQEG RLEGEVEFFLRLLERKFGPADEITQTRIKSADSQTLLRWGERILVAQTIE EVFEE >NE0686 hypothetical protein MKTLSTLFLLLAIVAQLTACNTIQGFGKDVQRGGEAIEKTAK >NE0878 conserved hypothetical protein MQTGNLFTGFSTPRKGETFETLLSRRNLVIERIVSSAELSPQKYQQVQDE WIALLKGTAILEVDGKTIELNTGDYLFLPANTPHTVKWTSHGALWLAVHV Y >NE0357 DUF163 MKFHILAVGNKMPDWVRKGYTEYCQRMPKEAELLLVEIKPEKRVGSKNTR QLLQAESERIRTVLPPGCHIVVLDETGKQATTMKLAEMMDRWMGSGQDVA FIIGGADGLHQDIKQMAHEKLALSAMTLPHGLARVLLAEQLYRAFSINRN HPYHRA >NE2104 conserved hypothetical protein MIAFEFDEAKSQANLLKHGISFVDAQALWNDPRLLEIPAKTEDEPRYLMI GLINGKHWSAVITYRGTNIRLISVRCSRTEEVTLYES >NE2572 conserved hypothetical protein MSKLIITDNLEMLTEQGATRRRLLQAGLGACALLAMPAANAAYSRVYEKR VSLLNLHTGERVRTAYWERGKYIPEALRMIEKVLRDHRSGDIHRIDPRLL DLMQHLHHKTGNSKEFQVVSGYRSPATNAALSVQSHGVAKNSLHMQGKAI DIRLPGVPLHVLRRAAMSMHAGGVGYYPKSNFIHIDTGNVRYW >NE0697 possible transmembrane protein MIKNVSEEEILLRKRARRRFIGAVTFVILSVVFLPMILDDAPQQEQQQID IQIPSEELTAETYPWMTPENAPAIDEAEIASDPDKPLPFSDSEYSGKIES GRSTSGIPVPAKKPPFVKSTPASVPAPVVQEKVPASNNVKVQEGAFVIQL GAFSDVSKAKQQQQNLVANGIRAYTETIKVGNNEMTRVRIGPFATRDAAE AEHERLKKTGLSGVVTTK >NE2194 conserved hypothetical protein MTVSDHPQTVSQPDPESESRPSKTRLKQEMHALQALGERLVELEPARIAE LDLPEKLAEALLEARKITSHGARRRHLQFIGKLMRAVDPLPVQEKLDAWQ HTGMRHTAWLHQLERWRDRLISDETAVTEFVQTYPHTDVRQLRTLLRNIE KEKLAGKPPHNFRALFQLLRQIIPEIPG >NE2251 Type I antifreeze protein MPIYEYACHACGLEKEHLQKMSDAPIANCPACGSSDYVKKVSAAGFQLKG TGWYVTDFRNKNTRSDSKPKEESGKEAADTDKAAATTTTDSTTATATTAS TSTTAPTVSSVD >NE2493 Uncharacterized protein family UPF0016 MESFLVSTGVVALAEIGDKTQLLAFLLAARFKKPLPIMLGILVATLINHG LAGFLGAWITATVSPDILRWILGLSFIGMAIWTMIPDEIEQEETLIAGKF GIFGATLITFFLAETGDKTQIATITMAAHYGTPFMVVMGTTLGMLIADIP AVFAGEKLATRIPMKLVHSIAAAVFALLGVATLLGAGSKLGF >NE1268 conserved hypothetical protein MLLTAVLLPATEGGFTALNPETGTTSQGETVEEALANLREATELYVEEFP LTIASRPLVTTFELPAHA >NE2418 conserved hypothetical protein MNQVTTYLWETHLPGGSHWSGVVRRGTTLRLTDVSGGANAAVLFYNLEEK LERYNMADTLKTQHTFCLTKGHACHSDMGRIFCCITEDTAGWHDTVCGLS DAELIQQKYGTGRYQELRNDMYRNGLDGMLVELGKWGLGRRDVVSNINFF SKVTANSVGELQFHVSHSKAGDHVDLAFAMDTLVVLSAAPHPLDPAATYR PGAVNLAVFPSTPAVTGACAHIAENARGLENTRRLYAYGGVK >NE1546 putative membrane protein MGQTESIKTIIHFKETKMEKISQFVARLFLGQIFLLAGISKISSYAGTQG YMDAMGVPGTLLPLVIALEIGGGLAIIAGWQTRLTSIALAVFTLAAAAIF HNNLADQTQMIMFMKNIAIAGGFMLLAVHGAGGYSLDSRRARP >NE2500 conserved hypothetical protein MLTHPVQTRSRLAHAMFALLNPIPFGFFVAALIFDAIYACNANVFWVKSA AWLNVIGLIFAIIPRLINLVHVWMPARRSSRVEKLDFWLNLVAIITAIVN AFVHTRDAYGVMPEGVWLSAVTVALIAIGLIVSALQQATTRGGRHE >NE0910 hypothetical protein MKKIGSATVFAAVMLLFSFNNAFADSEGLPADRVITAIQTAVAANPGLIH EVEVDQEHGKLIVEIKIIDAKGQKTKVKIDPEKNEVIR >NE2358 conserved hypothetical protein MLQRSVFFLSDRTGITAETLGHSLLTQFDGIEWKKHYASFLDSAAKAQAV IEQINTIAEQEGQPALVFSTLLDPVMLASIRRADCVLFDFFETCLGTLEA VLQQPPARIPGRSHVLRQDASYFRRIAAIQYALNSDDGANAKILADADVI VVGVSRTGKTPVCVYLALQYGVLAANYPFTPEDMGAIQLPPLLQPLRKKL FGLTLNTSRLQSIREERYPGSHYASFAECQRELQWQNELYRQFDIPSINT TDVSIEEISASIVNRAHLERRQHGT >NE2367 conserved hypothetical protein MKLHLSDSSGLNVFSGYGEGYVAVNQVRYTDNMIVLPNRIIEHWQASSIS QLGMEHFDALLAMQPEIILLGTGTSLQFPDASLMRMILSRDIGFEVMDTQ ATCRTYNILSSEGRRVAAAILVRSTDG >NE2225 Protein of unknown function DUF86 MADDVLINKAATIERCVARAHEEYAANPENFATDYTRQDAAILNIQRACE AALDMGQHLIRREKLGIPQSTRDVFSLLARGGWINIELADSLKRMVSFQN IAVHDYIALQLPITVRIIENHLDEFLQYSQTLLLHDAALGGNQRG >NE1428 Hypothetical hesB/yadR/yfhF family MGTTIHEETSAAQPPLNFTDGAASKVKELIEEEDNQALKLRVFVSGGGCS GFQYGFTFDEIVNEDDFVMEKQGVKLLVDSMSFQYLVGAEIDYQESAQGA QFVIKNPSAASTCGCGSSFSV >NE0759 DUF150 MSLFELLEPTIAGMGYELVDIEQSAPGRLLRVFIDKKEGSITLDDCVAIS NHLSQLLAVENIDYNRLEVSSPGLDRPLKKKADFVRYMGESARIRLRIAL QGQRNFVGTLVEVNDDVLTLNADGKLLQIELRNLEKARLIPKL >NE0167 conserved hypothetical protein MANDGYFEPTQELSDETRDMHRAIISLREELEAVDLYNQRVNACKDKELK AILAHNRDEEKEHAAMLLEWIRRCDPAFDKELKDYLFTNKPIAHE >NE0864 conserved hypothetical protein MNLHLFVPDIFWPDGSQTDIYQHLKLPALETILAKSNRFEVGGEVLESWL CKIFNVNKQQDWPIASILLQREKNRIEVGESYWLRADPVHLRVENNHILL GDNQILNISLKEATSFADSINEFFSDEGVTLLPLHSDRWYVKCDETPELQ TFLLSQVVGKNINDLLSRGKEGAIWNSRINEIQMFLHEHPLNRNREIQGE LPVNSIWIWGGGVRPSEVRTSYTGIWGNHVLVHALAEMGKVMCHDLPENA DEVLNHSGNSGEQLIVLDNLQKYACYRDAYNWRNELMKMERDWFDPLFQA LKKRQIQQLKLTIINESSTKDFVLTPGSLWKFWAAVRPLGTYS >NE1756 conserved hypothetical protein MLWIKSLHIISMVTWFAGLFYLPRLFVYHAMCTDQAGIDRFKVMERKLYY GIMTPGGLLTIIFGTWLWLGYGFSGGWLHTKLALVVLLVAYHLYCGKLLI DFRNDRNRHGHVYYRWFNELPVLALFAIIILVVVKPY >NE1703 DUF190 MNGYQITFFTQQDRHHAGKPLADWLMHLAAEMGLRGATLIPGSEGMGHDK RFHSVHFFELSDQLLEVVMVVSEEEADQLFARLRAEGVRLFYVKVAAEFG IMD >NE1882 conserved hypothetical protein MFTSLRLTHFKAWQDTGMITLKPVTVLLGTNSSGKSSLIQSLLLLKQTVQ SPDRSIHLNLGGDEISDLFDFGHFDEVIKHGTSSPREFSITFTFRTAGQS RINSGEFSCGYRQTATGVTAIQELVLSAAEHRFRAIRREKGSYAIFTDDE TRPRAKGEQLAPERSVALPAAAIVALGSAGALAEDISLAIRRELENICYL GPLRRKPERDYVWNKTTPGQIESDGHRAMDILLSSVLIKNDKQNEILDEV SFWLNRMKVAERLEIRQVGRSARYEIVVHQGGTITNLRDVGMGVALVLPV LVAGYFAPAGSTVILEEPEVHLHPLAQAVLAEFFVALSRSRRIQFIVETH SEHLFRRMQTLIARKNTSVEQIALLFVESAAGNAALRRLEVDEFGRVSNW PQYFFGDALGETREQARLMHTRHQEQHRK >NE2365 conserved hypothetical protein MLQRSVFFLSDRTGITAETLGHSLLTQFDGIEWKKHYASFLDSAAKAQAV IEQINTIAEQEGQPALVFSTLLDPVMLASIRRADCVLFDFFETCLGTLEA VLQQPPARIPGRSHVLRQDASYFRRIAAIQYALNSDDGANAKILADADVI VVGVSRTGKTPVCVYLALQYGVLAANYPFTPEDMGAIQLPPLLQPLRKKL FGLTLNTSRLQSIREERYPGSHYASFAECQRELQWQNELYRQFDIPSINT TDVSIEEISASIVNRAHLERRQHGT >NE0020 possible sugar kinase MNTRPIYTTAEIRKIESLVLSVPHSPPLMEKAGLAAAKVAHTRLLTDDKQ RILILAGPGNNGGDALVAARHLREWGRQVTLVLTGEAERLPQDARQALEQ WQSAGGTVIPELPDGGQWDAAIDGLFGIGLNETRLLAESYRQLIRQINQL NLPVLALDIPSGLLSDSGRVPDVAVKAAITTTFIALKPGLLTHDGCDYCG EIVVCDLELDVAALIPPQNWLLDRAGIVQRLPSPRRANSHKGTYGRLGIL GGATGMIGAALLTGRAALKLGAGRVYLGLLAQDDVPVVDPVQPELMLRSP SDFFNPDFLEGLVIGPGFGSEIAACICLERALQTCLPLVLDADALNLIAQ HTELSSALQARKAPAILTPHPAEAARLLNTSVTEIQRNRLEAARNLARKF NCAVVLKGAGSICAFPNGHCHFNTSGNPGLSSAGTGDVLSGFLGALLVQG LLPENALLLAVYLHGAAADVLLKQQNGPLGMVASEIIPAARNLLNCWIEE ENPGW >NE1410 hypothetical protein MSRWQQALLIMSIMVAAMPDTSARRHDAASGHDRAGKPAGGISEQRAIAI AQQHFSGRVLAISQTDRVYRIKILSDQGTVHTILIDALNGAVVSAR >NE1889 conserved hypothetical protein MRYWLMKSEPSEVSIDDLAARPGQTVPWDGVRNYQARNFMRNQMQPGDLV FFYHSSCPEPGIAGVVEVSRLAYPDETQFDSASKYFDPKSTRENPRWFNV EVRFLRKTRLLSLRELRSYPELAGMRILQKGNRLSITPVDPSEWKFIEAK LQS >NE0395 DUF167 MSWYSFGNDRSLLILKLYVQPGARQTEAVGICGEELKIKLAALPVDGKAN RALTEFLAKRFNVPRKNITLKRGEQSRHKVVEVCQSSNGPEVLFSEMRAE >NE0004 conserved hypothetical protein MSIEPDGTVNRPEENIHLLQLHDSPVMRWLYLSIGMTALFMGILGIFLPI LPTTPFILLAAGCFARSSERFHSYLLNHRIAGPIIREWCEYRSVARHVKR WAYLVMALSFGSSILIVSSWWLKGMLALLAMILFTFIWRLPVRDQSR >NE0131 hypothetical protein MFIFNLNKKESKMNEDRIKGQWKQLAGKIKERYGIAHDEARKQVKKFHDS L >NE2372 conserved hypothetical protein MNHQARMRWRCRRGMLELDIVLQRFIDNHYEQLDEHQLELFEMLLSLSDH DLWNIIIGNTKEPNNQFQPVLKLLQEN >NE0339 conserved hypothetical protein MDLSMTTPEEITQYLQDHPEFFEEHPDLLESLRFPHPYEGRVISINERQV AMLREKNKLLQNRLQELIDVGENNDAISEKMHRLTVALLGFGSLPELLHE LQYHLCEDFSIPHVVLRLWQIDEFGTEADLPSPEFDPISNNVRILAQGML RPYCGPEVDDEIRQWFAQDAEYLKSFAVIPLKKQSNFGLLVMASPEAERF YPDMGTLYLERLGDMVSSSIMRLIQQTPGAANRESAS >NE1732 DUF174 MAADYRNPESIIEKGRARLNPPPLYKVILINDDFTPMDFVVKVLRHFFLM NEEMATKVMLKIHIEGAGICGIYPSDIAVTKVQQVNDFSRQNQHPLMCVM EKE >NE1499 putative membrane protein MMIDYLILKHLHVTCVAISYTLFVLRGIWMLNASSRLRQRWVKIAPHIND TVLLLSAVALAVLTHRNPLVETWLAAKIIGLLLYIMLGLVAFRLGKIRRA KVMAWILAQIVFAYIVSVALTKNPLLF >NE2305 conserved hypothetical protein MNPPFYTIGHSTRTLEEFIGLLHAAEVEQVVDVRTVPRSRTNPQYNLETF PDSLAAFQISYEHIPQLGGLRARSKTVSSNVNGFWENQSFHNFADYALSD TFHEGLAKLIALGRKRRCAIMCSEAVWWRCHRRIISDYLLMHGETVFHLM GHDKIEIARLTESACPQPSGAVTYPSRNSLVN >NE1534 conserved hypothetical protein MIIATRNRLIHGYLGIDNDTIWSIIQDEIPKLLRQLTAMLESIR >NE0210 Domain of unknown function DUF28 MAGHSKWANIKHKKAAQDAKRGKIFTRLIKEITVAARLGGGDPNSNPRLR LAMDKAFGHNMPKDNVERAIKRGCGELEGVNYEEIRYEGYGISGAAVMVD CMTDNRTRTVAAVRHAFTKHGGNLGTDGSVAYLFKHCGQLLFAPGVGEAQ LLEAALEAGAEDVISNDDGSLEVITGPDTFVSVRDTLEKAGFKAELAEVT WKPENEVLLQGDDAVKMQKLLDALEDIDDVQDVYTSAVLDT >NE0316 conserved hypothetical protein MQYHEPVWIPAHLTTGEPDSFAAFTLEERLPAILDNLANHADARTDQALQ QLAVEIREGEITPLPPVILGLFDQVIKPYTGRRWTDMPFLTVELYFYARI LLAFGHTATTLVDPFHPIKNTVSLQAIESLTTMTGYCDSDCDITGLLRWS MTGNTADLSQQVVTDAGQISLLVDESYVAGGLFDSGLNRIDFVFDNAGMD VLTDLLLILRISRHCSRIVAHVRPWPMFVSDVTMTDMKYLIRKLVTSSIP AAKKLGEDITQLLHQNRLIFRSSSALGLPVCFCEEEALTRETFENTELVI FKGDLNYRYFVGDRRWPHTMEKRYFFERFSQPAICLRTLKSEVLVGLPAD IATRTLHLQPDWLTSGRYGIIQVFAH >NE1588 conserved hypothetical protein MRVIALSTLKVFWENDSSRADAIQPTLAWHRHALKADWSLPAEVKADFKN ASILKDGRAVFNIAGNKYRLVVWINYPYKIVYIRFIGTHAQYDRIDAQTI >NE1166 DUF185 MALPLPDPSEQAYSDTLKTMLHERIAHSGGWISFADYMETVLYTPETGYY SGGAAKFGTAGDFVTAPEISPLFGQALARQIAPILSAVNQGSILEFGAGS GKLAVDLLCALEELNNLPQHYYILDLSADLQQRQRAMIEQHIPHLASRVS WLSALPEQFEGLILANEVLDAMPVHLVAWQNGNIAERGVIWKDQGPVWQD QPLAAGELLDVARQLPPADQFSYPLYISEISLTNRHFICSLAMLLQRGAI LLVDYGFGQNEYYHPQRHQGTLMCHYRHHAHDDPFFLPGLQDITSHVDFS TIARTALDSGLQLAGYTTQAHFLINCGITDLLARTPADQPGSYLPLVSQV QRLVSPAEMGELFKVMVLSRDIDIDAASCGFTRGDLRRLL >NE0192 conserved hypothetical protein MLDKNVLNEIGSKVNEILASSPARDVEKNMRAMLTGAFARLDLVTREEFD VQQEVIKRTRIKLAELEEKVRKLEQQLQQPAEVVSSNETGDACCETQP >NE1276 conserved hypothetical protein MNLPFQNDALGKLTLRLTVGVLILLHGVHKIFNPGSLDYISTLLANVNLP QILAYGVYLGEVIAPLMIILGIFSRIGGLLVFGNTVFAIGLAHRSELFAF TDHGGYALELQAFFLLTGLAVFFLGSGRFAIKPD >NE1163 DUF198 MNKQIDLPIADVQGSLDTRHIAIDRVGIKAIRHPVVVADKGGGSQHTVAQ FNMYVNLPHNFKGTHMSRFVEILNSHEREISVESFEEILRSMVSRLESDS GHIEMAFPYFINKSAPVSGVKSLLDYEVTFIGEIKHGNQYSFTMKVIVPV TSLCPCSKKISDYGAHNQRSHVTISVRTNSFIWIEDIIRIAEEQASCELY GLLKRPDEKYVTERAYNNPKFVEDIVRDVAEVLNHDDRIDAYIVESENFE SIHNHSAYALIERDKRIR >NE1487 conserved hypothetical protein MTEESLIEYPCDFPIKIMGKSQQGFTQSVLSIVKTYAPDFDDTTLEVRSS RNGAYLSLTCTIQATSRTQLDSLYQALHDHPMVTMLL >NE2248 conserved hypothetical protein MPSFDIVSEVDKQEIRNAVDQLNKEVSTRFDFKGSDARAEQTDYELYLYA DDEFKLGQVMDILMTKFTKREIDVRCLEKGQTEKISGNKVKQKVTVKTGV ESDLAKKIIKLVKDSKLKVQASIQGEVVRVTGAKRDILQEAIQLVKGSIT ELPLQFRNFRD >NE0147 conserved hypothetical protein MKDNELDERDGLARDQKSDLIRDAEPVETSDQAVSLQAEPVGTMPYSPAQ AGDPAEAATTGGTAASGFGQILRDARIRRGMNVGEVAHRLRLSEQQVEAI EAQDFSRLPAAVFLRGYIRNYANLLQLDDVPLLMEAVPQARPVDTVFASK RNAQRFKAIEPVYRSGRSSRGGWLYIAVILAALAAYGIYRDEVPEQLASF SAGDTDQVMSSISADGNDQVAIDLALPLSSSSSGLPLVTPAPAGASVPSV ATTLPELPAPSVPAVEVPKASDDGKKSLHFSFSRDSWVKIKDSGGRVILE KTHSRGTEQTIEGKPPLYLVIGNAAGVSLTYNGRKVDLAPYTRGNDDVAR FSLE >NE1074 conserved hypothetical protein MSESRLLDYLDHIQQAATDACSFVEGLAKEDFLENKQTQQAVIMSLIIIG EAVTKVMDGYAEFAQAHAQVPWRNMRGMRNRIAHGYFDINLDVVWDTVQA ALPELLKQLPAVRQDADNEARDKPC >NE2091 conserved hypothetical protein MAEYFFDLNKRDQREALEYGRAETGRPIHLLEKDIWVVWALRALFVSPLS ADLTFKGGTSLSKVYKLIDRFSEDIDLTCDIRKLIPDLVGKGDELPASRS QAGKWTQTVRHRLPDWIMQNVQPVVQAALVREQLNARLELGGSDNDKLFL HYPALAQGTGYVAPVVTLEFGGRATGEPHQVFPITCDIAAHLADVSFPIA SPLVMSVARTFWEKATAAHVFCAQGRIRSERYARHWHDLAAIARSPHFAA VIADRTVAIAVARHKSYFFIEKDADGQAIDYIAATTGHLKIVPEGEAKAA LARDYAAMLADEVMVGNALSFDALLKACADLEARVNRAAL >NE0929 putative signal peptide protein MDNHTENENPTPQPPAEPPRRLRKWLAGIATVVVLAVTSGFYWLIFTSAG LHWLLMIASQATGGALTFSGVNGSLHHTVHAGMIAYQSDELTARVEDMTF RWQPRQLLTGKLHIETLMIRSVEVHSAASEQEEEPVTLPENLSLPVDAVI EKLGIRSLKRYTLGNDQPDIVMTDLALRLDSDGKRHHLQTLALGLEWGKI AGNLEIGIHPPFDLASQLVFYNWANVANTSSEPAGYAAVRLGGNLQQIQA ALTIADRKLAGKGNFTIHPFDALPLLEANLVVSGLDLNVFSSDLPKADLS FSSQLAQKQAEQLTGHITISNAMARPLDQDGIPLKNAHMLLDLTHDKIQL SDIALRLSEREEVPGSLTGEANWQISTKTGQADIHVRRLNPTDLQSSLRP AKLSGNLHFDGNQESQQGSIKLRDEALRLNMDMALVRTASAITIEKLDLV RGDSSMSGHGTLQLDESQPFTFEGLLRQFDVSAFADVPPSNLNATFNLAG QLALQPAANIDFMFEPSRFANQLVSGQGSLVLQQPAHIRSDTHLRLGDNQ LEIKGALGNPGDRLAVVLSAPKLAQIGFGLQGDIHSRVELGGAIDHPDIT FEIDSNHVGFREEHRLAHLKASGNLRGTALQLDLQTGEYQQGSQTHLQKL SLGLSGTQAQHQLSLTSQIDQATEVQFLARGSGDTSRKQWEGVIEKLSLT GSVPLDLVTQPSIKISPETVALGHTRIMAAAGEIDIQDARWTPKQWSTQG NFTGIALKSGDLSSEHTEPLKLRGNWQLAADRQLAGHLRIQREKGDFILP TETPFALGLQTLLLDLQAENNGLNGQLTIRGKHVGETTARVSVPLQSTGS TWEIRKNAPLKGDLQLNLPDLAWIGPALNNNLRSEGRVTAQASLAGTLDQ PELQGRMTGDELTIALLDQGLQLKEGRLAVDFNQDRLRLETLNFTAPLEK PSKDRLLKNIKLSRKSGQLNAHGSLDIHNQQSHLTVELDHLPIAQQADRW IVVSGNSVIGFKEQALDITGKIMTDVGFIKQPAAGRPELADDVVISGKTE AEPESPAMQVNLDAILDLGERFFLRASGLEGRLAGQLHLLSKPDQLLSAI GTISTRDTRFEAYGQRLQVRRGIVNFDGPLDNPGLNILAVRTSQESDFDT GSSDVIDQPSQDSSVNALAVRGGMRVEAGVEITGTVRHPKIKLVSQPEVP DSEKLSWIVLGRPADKSGLDNALLLNAAGSIFGGTDESVLENITQGMGID DFSIRQQAGGSLTNQVGTIGKRLSSRAYLSYERGLTSASAGIAKLTYSLF PRVSVVTRAGDDSSVDLFYNFQFD >NE0581 possible predicted diverged CheY-domain MTVLIVGGDYIASLKQRITAHGYSRIEHWNGRKKGFNKRALPGRTKLVVI IYDYVSHNLANSVKDQASRIGIPMIFCRHAMHEIDTIFDEKKAEESCCNF V >NE0779 Conserved hypothetical protein 46 MTARFFHTPPIRDAEWITLNSDTSHHAAQVLRLKPGDAVTLFDGTGGEFS GRLEQISKSGCQVRIECHLPVERESPLMIELAQAVCANEKMDWIIQKAIE QGATRIQPLITRRTLIRLTGERADKREQHWQKIIIAACEQCGRNQIPLLL PLMPLSHWLEQKLAEKYKNDNPAGHDIMLSPAAHQRLVELSPPRTGECLT LLTGPEGGFTGEETDAAHLAGFIPVRLGNRILRTESAALAAIAAAQTLWG DY >NE0632 conserved hypothetical protein MADRLLTRILDDEDSDDPILSVVNIIDVFLVIIAGLLIAILENPLNPFAA QDMVVIRNPDTPQMEMIVRKGEEMKHYKSTGEIGQGEGVRAGVAYRLKDG SMIYIPEETDKATGTSSK >NE2107 conserved hypothetical protein MFVWRISRKEFALDRTGYGASIKGQRWNSASIPAIYAGLSLGIAAMEKLV HTGSNLPLNLVVVRMTLPDDNSLYKIPPPDALPDGWSALPGSPTAATYGD SFLLNGKYLGLIVPSAVIPEARNIVINPNHPMMKEVTIEIIRNFTFDSRL RS >NE1371 phage-related protein MPSDFKPMLAVGPGAYEIRIHIMGEWRVIYVAKMQDTIYVLHTFQKKTQK TSKHDRYRQIIKEITNGKNN >NE0151 conserved hypothetical protein MSAYNFEEQEKIEGLKSWWAANGTTMLFAIAVFAATVAGNRLWNHYKAEQ AQQAADLYAVLQQQVEKGSELVKITDAAHLLTEGFPGSGYASRAALIAAR AAGQAGNDQVARDMLQWALDHAEEPEIKDMARLRLASVLVDESKYDQALK LLDAQHAASFTGLYADLRGDALAAAGKTDEARAAYQKALDSLNAQGAYRN VVQMKLDVLREQRQ >NE0829 conserved hypothetical protein MRKLFKKYLPSHESIRQNRFVNFFGTLLYHHNLWHLHRRSVAGGVAAGLF AGLIPGSNPVQFFFATLFSVIFKVNLPIAAFVTLYSNPFTIVPLYLAAYT LGGWVTGGTRNSGSLPPLELGLLDKNLSEWIPVLTDYLVTFGKPLITGLF LLASLLSITGYFTVRILWRYYVVHAWHKRAKRHHPDK >NE0738 conserved hypothetical protein MEYTRLATLVFSVNGCFLMMSRVSQALAPVLFLPHGGGPLPVLGDKEHEK MVSFLREIATELGEPPAILIISAHWEEEQATITSNSQPGIIYDYYGFPAA AYEIQYAAPGHPGLANEIYTLLTANGIPARLDEQRGFDHGMFVPLKLMFP QARIPCVQLSLLNNLNPRMHIALGKAITALRSRNILIVGSGMSFHNLKAF FSSTVDGRGENEAFDNWLIETCTHPAIAPEMREQRLIEWEKAPFARFCHP REEHLLPLHVCYGVACVDTPTARTVFNGEIMGRKVTSFLWQ >NE1352 conserved hypothetical protein MWELRYTHQAQKDAKKLASSGLKDKAEELLAVVRNNPYQTPPPYEKLVGD LAGACSRRINIQHRLVYQVLERERIVKVLRMWTHYV >NE1496 conserved hypothetical protein MTEIERKFLVATFPDGELHAVPLRQGYLTTPTDSIELRLRQQGTEYFMTL KSEGGLSRQEYEIQIDVTQFEMLWPATEGRRVEKTRYSGKLPDGQLFELD VFAGHLSPLMLVEVEFLSEDAAQAFIPPPWFGEEVTEDKRYKNKALALSI P >NE2110 Bacterial regulatory proteins, AsnC family MQEPRILITQELLKLIAEIDEFKGKWEVLKNLSPERLRQLRKVATIESIG SSTRIEGAKLTDMQVETLLSNLSSMSFKTRDEQEVAGYAEAMDIVFQAYE DMTITENHIRQLHQTLLRHSNKDERHRGEYKKIDNHVVAIDEHGKEIGVV FETATPFDTPRKMEELVRWVNKAITENSFHPLLIVAVFVVVFLAIHPFQD GNGRLSRILTTLMLLRAGYSYVPYASLESVVEDNKDLYYKALRRTQTTLK TDSPDWEPWLGFFLRCLKKQKANLAAKIEKEKAADDTILPALAVQILELL KKHERLSIAEMVEHTGANRNTLKVRLRELVSTGRIQRHGKARATWYVLNR K >NE1372 Helix-turn-helix motif MEKITDSSGNIFTDLGFNPEQSAIYTLRAELMSNLRKTIRERKWTQEEAA KVLNIGQSRVSDLMRGKWEKFSLDMLITLAIRVGKRIGITVV >NE0849 similar to nodulin 21 MTHDNTHYSHRTGWLRAAVLGANDGIVSTASLIIGVASAHAAADDILLAG VAGVVAGAMSMAAGEYVSVSSQSDTEKADVALEQYHLDRDIDFELQELTD IYMKRGLQPELAAQVARELMAHDALDAHLRDELGLHERVNAKPVQAAFTS AGMFILGASMPLAATIAAPATTHIIPVVAISSLLSLTALGTFAAYLGKAN MLTGAARVAFWGALAMAFTALTGTLFGIIA >NE1885 DUF202 MSDLKDPRVLFAAERTLLAWNRTSLSLIAFGFVVERAGKLVRAISPGIIG PDQLAAMFWLGLAFIVLGAITSVYSARQYAVILKTLTPDEFPPGYAAKWG LLVNLAVAILGLILALALWWWRLH >NE0394 YGGT family MPNQIMIFLLDTLLSLFSLALLLRFYVQWSRVPYYHPFTRFLVAVTDFIV RPAGRVIPSWRGLDLSTFVLAWLAQFIILVGVNLLGGFGAGSSMFAFALL ALVKLASMTLNILLISIIVQAVLSWINPHTPLAPVLESFTGPVLGPIRRY IPPIANFDLSPIFAFILLQVLMMVVENLQRQIIQMF >NE1890 conserved hypothetical protein MNKDTVLNVTVMGREFRIHCPGEEREELLLAVSCLNRKMQEIKSAGKIAG TEQIAIAAAISMTHELLSIRSQRGFDMNEFKRRIELLECRVSDALTDQGI YKNS >NE1032 Domain of unknown function DUF74 MLLTTTPVIEGKRITHYYGIVAGEAVLGANVLKDLFAGIRDFVGGRSGTY EKELQHAREIALEELQENAHRLGANAVIGIDIDYEVLGKENGMLMVSVSG TAVFVE >NE1451 Hypothetical hesB/yadR/yfhF family MAITLTERAARQIRQQLERRGKGVALRLGVKKSGCSGFAYSFDYADEVQE DDQLFESHDAQVVVQRDQLSFIDGSEIDFIQEGLNSSFKFRNPNIDNTCG CGESFSLKT >NE2151 conserved hypothetical protein MRHPIHPMLVHFPVATLFLATLGDIASLFMDEQVSRVAGVLLVIGTITTL LAMVAGLMELGKIDQQSPAMKVANQHMMLMMASWSFYAVSLFLRLDGTRL GQPGMVAVAMSVAGLIVLCIGGWLGGKLVYEYGVGTRSSQP >NE0446 conserved hypothetical protein MTIPAKNTVCLWYDDCAEDAARFYADTFPDSFVGAVHRAPGDFPSGKQGN VLTVEFTVMGIPCIGLNGGPVFTHNEAFSFQVATTDQAETDRYWNAIIGN GGQESECGWCKDKWGLSWQITPIVLINAITHPDPAAAKRAFDAMMQMRKI DIATIEAALRG >NE0021 conserved hypothetical protein MKHEANKRAGRIDFAGWAKRGMLIGMAAIVTSYSLPLLAEDIGSVSTRFK FLGANDKIVVEAFDDPEVAGATCYLSRAKTGGISGTVGVAEDKSDASIAC RQTGPIVLSEKIKNGKSDGDEVFKKSTSLLFKTLQVVRFYDAKRNVLIYL TYSDRVIEGSPQNSISVIPVMPWH >NE2102 conserved hypothetical protein MLPEIWKHDSPYFRPEGDTRGEAVLPIGMPMRQQYDFSSSRKNPYAAKLK KPVTIRLDEESISYFKSMSEETGIPYQSLINLYLKECAASGKRLNLSWK >NE1659 conserved hypothetical protein MRHKDKQKLGISLLLLFTTLPAVVHVYAGSWVRPPDDIDIFGQMQTVTAS REETLLDVARHYGIGQDEMVLANPNTNRWLPEDGAEVVLPLRFIIPQAER IGLVINLPEMRLYYFPKPAKGQKPEIITHPVSIGRMDWNTPLGRTTIVRK QKDPTWTPPQSLKAEAIAEGKPPLSDVVPPGPDNPLGRYALYLGLPGYLI HSTNKPFGVGMRVTHGCMRLYPEDIEELFNLVPTGTPVQIVNQPVKLGWQ ENLLFIELHPPLEEDDTTPYDYEQKVHSAIAEFLEKTAKDPDGRMTRNTR ISPEALESAIRARNGIPTLISENLEN >NE0550 conserved hypothetical protein MSSESAHQHAAGVDEERGTLPRRSWRQLWLSIHLYLGLFIGALLVILGLT GSIAVFWAEIDEWLNPELLTVTVPEQKNLAPGAPAYQSLDEIIRVARQAA APDSRITTVYGARNSEAVYAVYASQPSSAWQRIFVDPYRAQVTGVRSYGA NEWIPNYFMDVIFQLHFSLLLGMNGQTLMAVCALLLLVSLITGLIVWWPT SGQWRKALTIKRGAGPVRFNFDLHKTLSLYLFPVLGAVLLSGVFMNLNEP FVWVTQLFSPATRQPQHTLTSIPITGIPSIGAEHAWAIATEHYPDGKFGG MFMPGNAEGVYIVTQKHVPKLSAFWSERQIAIDQYSGEILDVRAPDARRS AGETFLEWQWPLHSGQAFGWPGRILIFLCGLACPVIYATGVIRWLQKRRV KVRSPRRPIR >NE2076 LysM motif MISLNFSVVAKLLLQALQGLIKGLFFPIQWKHIFILNNQIFFSSIRYYGI CLFFQYDGLSIRSSYQVCDKTIYWLLPVYALFLLIGSDGVRAEDWLYTIR SGDNLWNVAERHLVSMKYVPRLQQLNRIHDPHHIPPGKVIRIPVEWATRR PGDAEIVDCYGTATLRRAASAEILPVTEKLRVAIGDEISTGPDSQVTLEF RDRSRLRVESESKIRLKQAEVLGQDGVVMTEVELESGRTVNIAPHGSEPA TRFRIRTPAAVSSVRGTRFRVGADKQDGTTRSEVLEGLLEVSAQGRNVQV REGYGTVTRPDQSPVSPVELLPGPDFSATPDLYERVPLIISLKPLAGARS YRVQIALDPEFRQISTDLVAMNLPLRGRELADGDYWLRVRGQDVSGLEGK DGVKKIRVHARPEPPFIMAPQSGARVGDNRPVFEWATRPDIKRYLIEVDR TADFRESRKYESDSPEGYFMLSEPLTPGEYWWRIAAESEIIGVGPYSDSV SFKVPVPGPELDSVEIEREEIRFAWPAGETGEQFQFQMARDSGFDNIISD VLVSEPRVVIPNSGGGRYYLRIKPIRADGEEGVFGPVQSFEIPYRFPFFL PFLGFM >NE1354 Helix-turn-helix motif MKMRSQLLIVLQEHLRNSGLTQFKAAELLGVTQPRVSDLMRGKIDLFSLE SLIDMITSIGLKVEINIKDAA >NE0914 conserved hypothetical protein MRKLLNFLTNFLAPASLLVLVSMLGGCGYNTLQSTNEQVQSSWSEVLNQY QRRADLIPNLVNTVKGFAAQEKEVLLGVTEARSRAGSIQVSPELIDDPEA FARFQNAQGELTGALSRLLAIAENYPQLKSDANFRDLQAQLEGTENRIAV ARNRYIKAVQEYNVTVRSFPGNLTAMMFGFKVKPNFTVENEKALSTPPAV DFGAPATAQ >NE0916 conserved hypothetical protein MDPVRILRHLITGQSAIKRAFPPATLTVIEQAIAHSETLHGGEIVFAVEA SLDLPLLLQNQIIRERAIDVFSLLRVWDTEHNNGVLIYLLMADHDVEIVA DRGIHTKVDQTIWETACDTMKTTFRHGQFEQGVLAGIDLSTRVLQQFFPA STGKRKSELPDRPVVL >NE0265 hypothetical protein MTKREISGGMVHELPEDLKKALIAHPEALETWEDITPLARNEWICWVESA KKIATRNKRINWGCESLSEGKRRPCCWPGCPHR >NE0224 DUF205 MITVVLIFSAYLLGSISFAVVASWLFKLPDPRSYGSRNPGATNVLRTGKK AAAAVTLLGDAGKGWVAVAAAKYGGEVWELGDEVIAGAALAVFLGHLFPI FLAFKGGKGVATSAGILLGLNPWLGVLTISTWMVVALVSRISSLSALLSA LLAPLYAYFLLEKGILIMAVSIISVLLILKHRLNIANLMAGKEARIGKSS >NE0434 DUF149:Conserved hypothetical protein 103 MKANLGNLMKQAQQMQENMKAMQEKLAAIEVEGQAGAGMVKVTMTCRYDV KRVNIDSSLIGDDKEMLEDLVAAAVNDAVRRVETVTQEKMASVAGGLGLP AGMKFPF >NE0251 conserved hypothetical protein MATPADKLAESLAVLKTLRDQGRKALRSEDMGRTHRERLMRNGFIKEVMK GWYIPSRPDEPAGESTSWYASFWVFCASYLESRFGDEWCVSPEQSIHLHT GNWSVPKQLLIRSPKGGNKPTGLLHGTSILDVRLELPPASDTEIKEGMRI YNLPAALVGCSQTQFSAHPTEMRTALTMMQDASELLGRLLAGGHSKIAGW LAGACRSIGRKQIADDILGAMRAAGYTVNENNPFKDQAQIIFSPRETSPY VNRMRMNWASMREDVLHSFPAAPGLPADSAKYLKQVEAVYVNDAYNSLSI EGYKVSAALIEKVRSGNWNPDSNKDDQDHRNALAARGYWQAFQKVRESIG KVLSQENAGTVAESDHAQWYRELFGPSVTAGILKAADLAGYRNSPVYIRR SMHTPPGKEAVRELMPVLFELLQQENEAAVRVVLGHFMFVYIHPYIDGNG RIGRFLMNLMSASGGYSWIVIPLEQRNDYMAALESASVEGDIKPFSTLLA NLVSTG >NE2417 conserved hypothetical protein MSAQNLVESTLNPDHAVINEICDAGEPWVKEIKKGQIFRIVDLEGNQAVD TLFYNAHDAMERYSATDTVRRQHRLYLTTGSKLYSNFGNVMLVITADTCG RHDTVGGACAAESNTTRYALDKYPMHSCRDSFLYALAHDPVCERLGMSKR DVPANINFFMNVPVTEAGKLEFADGVSAPGKYVEMRAEMDVVVLISNCPQ LNNPCNAYNPTPVRLLVWN >NE2119 conserved hypothetical protein MTRTTGIYSISTNLGESVRAFVPHSLPPSDPDLSPKMFTDLNQQAELALA RLAGVSGLAPSVDWLLYSAIRKEALLTSQIEGTQATLTDLFDEEAGFKVS NTDDVEEVTNYLRAFRWTQEQLRDPKGLPISVRLLCEAHRRLLDGARGAG KQPGELRRSQNWIGGTRPGNAVFVPPPPGHVPALLADMERFIHSTATDLP PMVKVALIHAQFETIHPFLDGNGRIGRLLIAALFEHWELLTEPLMYLSGY LKQHQAEYYRRLSAIRTDGDWESWVTFFLEGVATATGDAEKNIIEVASLI ATDRKRMLQSTKAGPASYRLFEMLPMMPRFTIERVRRQLDTSFPTATAAV RVLEDLGIVTEMTGQKKNRSYSYQAYVELLSR >NE0982 Domain of unknown function UPF0040 MFRGSTQLSLDSKGRLAIPAKYRDELFASCGGNIVVTADPSRCLLIYPQP VWEPIEKKLNSFPSLSPQIRSLQRLIIGNASDVEMDSSGRILISAPLRQF AGLQKEVVLAGQGEKFELWDMAKWDLEIDTATTYKDGDIPPELEGFSL >NE0523 conserved hypothetical protein MRSQMKISTILATHDKTDLVILLLRITTGGVFMAHGAQKLFSWFGVNGLE ATGQWMNSIGPNPGYLMALLAGSGEFFDGLALLSPESESGGISQWDS >NE0065 conserved hypothetical protein MTRSLFLKPSVWLTVLLLLTLWLDKNLQRPDSQQDSGTQQEIDYIIENLD GIQINHELKVNRFFSADKLTHYPVGDITQLEHIGLVSIEPDKPLLRVTSG RAELAGGDNDIFLTRNVAIIRGEDKDKDKVTMLTDFLHLIPDTDIAKTDQ PVTVTRMNSVINAIGLFMNNQTGEILLQSRVTAHDDRTPRTAR >NE1461 DUF192 MLRSCLAVLLMVSSLSNTAEAEDQSLPVVKLSILDHVITVELADTTAART TGLMYRTYLPEDSGMLFVFPVAGIHCMWMKDTVIPLSVAFLDETGKILNL AGMIPETLTPHCSASAARYALEMDAAWFEARKIKAGDRVMQLPDTGR >NE0152 PQQ enzyme repeat MIGSISLSAPDFTFHNGYCRALRVCILALLILLGGCANLSDITGAHFTDL FSGDEDEVEIDEAELAELQTLAPIKLLWQVKLSESKTAVFLPVYDNGALY AADEDGRLIKLDPATGREIWRVDTKSQLSGGVGTGGGMILLGTYKGEVLA FDEAGNALWQSQVPSEILSPPQTDNGIVVVRTGDSRLYGLNAADGKQIWS YQGVTPPLTVRSFVGVSITRGAIFAGFPGGKLIALDLFTGNVGWEATVSQ PHGVTELERMTDISSLPIVDENQVCAVAYRGRAACFEISNGNQIWARDAS SSAGMVMDNSHVYISEEHGTVAAYDKSSGAAVWKRGKLGSRKLSALMVAR GTRLIVGDDQGYVTLISRQDGSLLSRAPTDGSAISSRAEYLPDGFVVQTH KGGLFAFSLQ >NE1692 conserved hypothetical protein MREIGLTIGLLSISNVFMTFAWYAHLKDLSTKPWLVAALISWGIAFFEYM LQVPANRIGFNVLSLGQLKILQEVITLSIFVPFSLFYMKEPLKLDYLWAG LCILGAVYFIFRGNMADAGS >NE0266 conserved hypothetical protein MQKIATCLWFDHGEAGKAAEFYAATFPDSRVERVNTAPGDFPGGQEGNEL TVEFTVLGQSFIGLNGGPNFTPDQAVSFMVLTNDQEETDRYWNAIIENGG SENNCGWCQDRWGFFWQITPKRLMELTTGSDRDKAKRAFEAMMTMNKINI AALEAAVRE >NE1084 conserved hypothetical protein MKSNADSLTDTATESDPPVRGFRQRMTWLHTWGGLWAGWVLFAIFLTGTL GVFDDAITRWMKPERPLVAEVAPGSAEQRAQAVRLAQTYLQQAMPRGEFW SIGLPGESDPAIRLFWRENEDAKFQQTRLDPVTGTELDKAVDRETEGGHH FVHMHFEFHAGEAGIWMVGFFAMIMLVALVSGVITHKRIFKDFFTFRPKK GQRSWLDAHNVASVLTLPFQFMIVYTGLAIFYSLYMPAGIFAHYPNKDTY FSQLLSRPAPREETHIDAQVASLDKLLLTAETELGRRASFVSVNHPGDSS ASVTVFGLFDEEENEKYLLPPGSGNVIFDGITGETLDIQMSGDHRGGEAQ AVQRVMGTLHFARFGGDTIKWLYFISGLAGAIMMATGSILFMVKRRQKAL NEFGSHTRRVYRLIETLNVAVIAGLCIACIAYLWSNRLIPVGIEDRSHWE ITTFFTVWLMALLHASIRPVASAWVEQLSLAALLCLALPLLNWLTTGQQV LTYGLQGDWERVGVELTVIGLGLLLATMAQKARSMAPVLPPRSAVVATQQ KTITASVPYRNSILMRVLAATLGGYAVASGLAILLPMVLPIARAEAVLAS TLLSFAAYTGVIIWVFSARAPKRAWQGVFFLAIGCALTILFNATFGGM >NE1654 DUF152 MIDWIVPDWPAPANVDAIFTTRNIGAAENRGIYAGLNLASHVDDDPLIVQ QNRNQLRQYLPDHPRWLTQVHGSQPVWVDSSNETLELEADAAMSRRPGVV CAVLVADCLPVLLCDMAGSVVGVAHAGWRGLAGGVIENTVRELRRFSSSD RIIAWLGPAISSRHFEVGDEVREVFTQYDHRAACAFLPGKEAGKWYANLF DLARQRLSHAGVNQVYGGDLCTFSNPEQFYSYRRDGKTGRMAGLLWMTQP AGMQ >NE0267 conserved hypothetical protein MTYVDGFVLPVPEGKIDAYRQMAESAGKIWMEHGALQYKECVLEDAKPEM PEDAPETCKITPFGKLAGTKDGETVIFAFIVYKSREHRDEVNKKVMADPR MQEACDENNMPFDPSRMAYGGFKALVDL >NE0755 conserved hypothetical protein MAERRSLSILEDEDGDDPILSVVNIIDVFLVIIAVLLIAVMENPLNPFTL QDAVIIKDPGKPSMEMIIKQGEELKHYKSTGQIGEGEGTKAGTAFRLKDG SMIYVPEQEQ >NE1881 DUF208 MEREKLELPENGKKLLLHSCCAPCSGEVMEAIVASGIDFSIFFYNPNIHP RKEYDLRKDENIRFAEKHGIPFIDADYDMDDWFKRAKGMEMEPERGIRCT MCFDMRFERTALYAYENGFDVISSSLGISRWKNMDQINDSGVRAAAQYPG ITYWTYNWRKKGGSARMLEISKREKFYMQEYCGCAYSLRDTNKWRVEHGR EKIRIGEKYYGDVTE >NE0603 possible transmembrane protein MIHKLPKWVEIGGFLLSFNAGYVNAIGLLGFEHQAVSHLTGISTFLSLEL ANHNMQAVVHLLLVMAGFIIGAAYSGFIIGNVALKLGRNYSLALITESFL LGISMLLLNYGSPVGHYFASAACGLQNAMTSTYSGAVVRTTHVSGLFTDL GVALGLRVRGQPADTRRIVLYLILIIGFISGGVAGAVCFGQYRFSAILVP CIVTTLIGAGYWLFTHQTYWRLK >NE0898 conserved hypothetical protein MRWDIFCHVVDNYGDIGICWRLARQLVTEFDISVRMLVDDLGAMQRICPA IDPRLAVQNIRGVEILHWVEPFADLVPADVVIEAFGCELPPRYIAAMAAA SPRFSEPEGKTDRIWINLEYLSAEQWVEGCHGLASPHPSLPLIKYFFFPG FTAATGGLLREAELFTLRDASRTDPAGLWRELGITNPAADEATVSLFCYD SAPISDLLEAWAGSISPVRCLLPEGTASASAASWAGISRLAAGDSIQRGN LTLHVIPFMSQENYDHLLWACDCNFVRGEDSFVRAQWAVRPIVWQIYPQQ ENVHLIKLEAFLDLYCQELIEPAADAVRAFHRSWNNNEQPDWNCFWKYRD VLQQHAMAWAERLAQIPNLASSLVNFCRNR >NE1894 conserved hypothetical protein MKLPMNTSSTPAYPQEIEISPDDLPLYCPNPLMDARSWHPRVFLEIEATG SAMCPYCSTQYILKGTPNPDHHHS >NE1758 dedA, DedA family MFLVDFIIHIDSHLQELVSEYGVWVNGILFLIVFCETGLVVFPFLPGDSL LFAAGSLASLQGSQLDPHFLFVGLTLAGILGDSVNYWVGKKFGITIFTSG KFRFLKQEHLDKTHAFYLKYGGKTIIIARFIPIIRTFAPFVAGIGTMPYR KFIAYNVIGAVLWVGIFVYAGFYFGQLPLIQKNFKLVILAIIILSITPPL IEYLRHRFGKNRPGVS >NE2125 fiu, iron-regulated outer membrane protein MLLTIDDVLTQEELAIARSMLARSAWVSGLVTAGTQAAQVKNNQQVQEND PQIVNLRRLVLGALNRNALFFTATLPEKIVPPFFNRYSGETNHYGFHVDN AMRLLPDGSGYVRTDVSATLFLSDPQEYDGGELVINDTFGQHGVKLQAGS MVIYPSSSIHQVTPVTRGERLACFMFIQSMVRNPDQRRLLYEMDMALLQL RQNIGETPAVVSLTGTYHNLLRQWADS >NE2164 lpxK, Tetraacyldisaccharide-1-P 4'-kinase MNWYELYWQRITPLHLFLWPVSQLLILFQSVRRFLYRRAILTSIHLPVPI IIIDSITTDSPVKTSLIIQIANILKAAGLRPGIISRGYPDNHRPPTRVTI SSHPHLTGEKSLLLTYHLRETCPVWIGYDRIETAKALLNAHKECNVLICD DGLQDLRLQRDFEAVIVDTSVINSGNGLIMPAGPLRDSFARLKHTDAVIL AGHQRRIPDITDEIRTIHTRPQKEHFFNLSWPELTADAAGLAGKRIHAIV CDPDTQNFLDNLEFLKLTVTPRVFPENHHFIATDFQSDEAEIILIPEEDA VKCLSLHDDRIWVLQQEYRVDPGLREIILKKLREKFMDPKLLDILVCPLC KGSLIYKKDRLELICKADRLAYPIRDGIPVMLEDEARKLPDEEEIK >NE0971 phnB, conserved hypothetical protein MTTPYVQPYLFFGGRCEEALEFYRTTLGAQVDMLMRHKESPESPPPGMLA PGFENKIMHASFHIGATTLMASDGCGEDSHFDGFSLSLTVPTEAEADRTF AALAEGGQVRMPLTKTFWSSRFGMLTDRFGISWMITVQE >NE1967 smg, conserved hypothetical protein MFDILVYLFENYFDAGNCPDSATLTRKLTMAGFDDEEITLALDWLSDLSR HDEEGYLAGLAESDSMRHFTEEEMEIIDTEGRGFIFFLEQAGVINPLQRE LLIDRVIRMDGDTASIEKIKLVVLFDLWIQNQLTDRSIVEGLFVVSDSHQ RH >NE2573 ycbB, putative periplasmic protein MTPLHQKSRKAFLSFDQGVSEFRLNYLAIAQYLVFFCFALTVLSAAAENR PVAVSADAIREQLAKNAWEGVNEAELIKLRHFYAQRDYQPVWALKEDSGV LLDTALTFIGRADEEGLASSDYHIETLRRWRAESPDQVSLPLELGTTRSL LALVHDLSNGRLTATLADPDWYVPQRRLDPVNFLQQSIASVDSAEQLEQV LASLPPNMPQYHTLKRLLVRLRILVAAGTVWTRIPDDIPSIHPRTRHAAI PLIRQRIREAYSVFEKPEYDIASDDSELYDDQLETAIKAFQYQYGLNTDG VVGKNTRRAMNMTAVEHIQQLRITLERLRWLPREFSNRYILVNIAGFNLA AIRNNVRVLNMRIVVGRDYRSTPSFNSRISHLVLNPYWNVPASIASKDLL PKQKHNPDYFASEGIRVFSDYHYELELDPDAIDWHAFSRSFPYVLRQDPG KRNALGTIKFMFPNPFSIYLHDTPSKSLFQRDIRTFSSGCIRLEKPMQLA EFVLGPSFEKANILEKIDSGKTQTVHLPEPIPVYLLYLTAWNDGQGEVHF SADVYGRDKRALAYARWLQPEPHLSQSF >NE1335 ykvP, possible spore protein [UI:20467420] MIGRAIRKLGSLVYKYPEIPDAVERPGPFGRLKIAMVTDYFTADCLSAEC RVKALTPGNFRMVIGEWKPDLIFVESAFHGSRGSWRYELAKQPKWLRLSK PTAIYQLVEFARSRGIPTIFWNKDDDVFFDAFIDVAKAFDYVFTTDNECI ESYRQQLPAHVPVNPLIMPYQPAFHNFTGFEFTRNEACFTGSYYQRILNE RKLFLDMVFDACERTDLSLNIFDRNHDRLSRHFEFRFPENSRLHLHGRVP HRETAKIYKSHAISLNVNSITRSETMYSRRLLEILACGGIVVTNPSQAVD RYFRDYCHVVSSSDEAQELFSRLRYGPSPDDMARAEAGAAYVRQNHTWVH RLEEICTVVKI >NE2078 ylqH, possible similar to flagellar biosynthetic protein MKRPEPTGQENQGEPTDSPAPPEHTPIPGAVALAYHSGMTAPQVVAKGRG LIAEEIIRRAQEAGVYVHESAELVALLMQVDLDDHIPPELYIAVAELLAW LYRLENDLAMPADSPDPSQ >NE1910 yqaA, putative inner membrane protein MICGHLRTTLRFTYCYQKIDTIPMNENASLLALFTSSFLAATLLPGGSEA VLAGVLVAYPDLFWPALNIATLGNTLGGMSSYVIGRLLPDEQALSEKIGR HVHGLEWIRRHGAPVLFLSWLPLIGDILCVAAGWLRIHWASAALFIAAGK FARYWVIALAIS