TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Organism: Nitrosomonas europaea ATCC 19718, ATCC 19718
Gene type: CDS

Number of genes found: 268

Free access
Sort by:

 



# Nitrosomonas europaea ATCC 19718, ATCC 19718

>NE2227 CBS domain
MFFEWVVDPAAWAGLATLVILEIVLGIDNLVFIAILTDKLPPHQRGKARI
VGLSLALIMRLILLASISWVITLTTPLLTLFDVELSWRNLILLFGGIFLL
FKGTMELHSRLEGQDGQKEGKGVHAVFWQVIVQIIVLDAVFSLDSMITAV
GMVEHLSVMMIAVIIAIIVMMVSSGPLMIFVSRHPTVVILCLGFLMMIGF
SLVIEGFNFHVPKGYLYAAISFSILIETFNQIARHNKEKLITTGDLRDRT
AEAVLRLLGGRHGEVGLGETSEVIAQQVAENDLFAREEKEMIEGVLTLAG
RPAMSIMTPRTDIDWLDLGDTSEMIRAKIIDSGHSRFLLAHGNVDEFVGA
AFAKDLLRDMLEEGKINLEKSLRHPIVVLERVPVIKLMEQLRNQTLQLAV
IVDEYGSVEGIVTPADILEAIAGEFLDAGEEKVVAEQQADGTWLMDGWIS
IRKASNLLEHDLVDEAERYSTLGGYLLWQFGYIPAAGEQITVDGLIFEIV
SVNKHNIGKVRVHRTQPENE
>NE2550 putative ABC-2 type transport system permease protein
MMMLTIAGKELKLLFASSLVWIFLAGMQLVLAWVFLGRLNTFLEIQPQLA
QLANPPGVTEVIISPVFSVASIVLLAVTPVLSMRLFAEERRNHTLAMLIS
APVSTSAIVLGKFMALMIFFCLIPLLIVTLAISLLTGGTLDFGLLGSNVI
GLILLAGCFAALGLYISSLTSHPTVAALGCLGVLLCLWVMDIVAIESESA
AHHFSLFRHFESFNIGLIDSFSLVFFLLFTITFLVLTIRHLEGERLNG
>NE1349 conserved hypothetical protein
MIGLDTNVLLRYLAQDDAIQSPKSTLLIESLSVEEPGFVPLIVIVESVWV
LSSAYGSTREEITEVLHNLLRTRELRIEQAETVAAALHLYQRGKADFADC
LIERTAMRAGCKAVMTFDKTAVKSCGMQLID
>NE1067 putative plasmid stability-like protein
MLRTWLEEHVLSSFADRILPVDTVVARRSAALHVPNPRPYRDSLIAATAL
VYGMTLITRNVADFEPMGVTLLNPWTPCWPRISGLVP
>NE2333 Uncharacterized protein family UPF0054
MPTRNMPLTKNPVESPDHEQELVLTVQYVADKTDIPNRRLFRKWVKAALS
KPAEVVIRIVDRQEGEILNRDFRGKSSATNVLTFVYDDDVPLLGDIVLCA
PVICNEAQQQGKDLTAHYAHLTIHGILHLQGYDHIRDEDAVVMESLETEI
ITRLGYPDPYVIQH
>NE1683 hypothetical protein
MRAPTSHIQGMFGVTLDDLCGRWGFPYPNYIKIDVDGIEIPILKAATSVL
KHPNLQSVIVELGTDAEQQAASDIMQQAGLKLKTKTTRNWGETCCLFERN
PAA
>NE0320 GCN5-related N-acetyltransferase
MMSSEIRVRVCSSGDEEALALIGQATFLETFAGVLAGSNILAHCARAHSV
ECYRSWLADSRYKLWLAEISPGNAPIGYMVVSPAELPLADISLRDLELKR
IYVFSRFQGNGLGRRLLQEAIAEARIRKAERLLLGVYVRNDAAIGFYTRM
GFCKLGYRKFNMGGQKCDDYVMGLTL
>NE2114 putative plasmid stabilization protein ParE
MKHYLLSPEAKTDITNIRQYTTQQWGKTQADKYILRLRERMRWLADNPML
GRARDEIKEGYRSFSEGDHVIFYRMAGSAIEVIGIPHQNMDIEQNLSSGN
LLLPDIADYEPEDG
>NE2342 Metallo-beta-lactamase superfamily
MTETCQLMEQLPAIIDYPNGISVIDARYHRPGRAAIHLITEGDKAALVDT
GTRFSVPGTMAALAHKQITPEQIDYIFLTHIHLDHAGGASEFMKWLPNAR
LVVHPRGASHMANPIKLIAGVMAVYGESEFRRIYGEIHPIAAERIIEAPD
NTLVELNGRPLRLLDTPGHARHHYCIHDARSKSIFTGDTFGVSYREFDMD
GLEFVFPTTSPVQFDPEAAHASIDRLMALHPEQAFLTHYGRIRNLSYHAS
QMHALIDAFVSIVQQAAEIQDRQPIITKALQDLLLERLAAHRCTLTRDAA
LSLLQTDIKLNAQGLEIWLEQTIRQSTQAAGSS
>NE0552 conserved hypothetical protein
MMKLFWTPEALQDRDAIYDYIEVDNPRAALALDELFSEKAQRLPDHPALG
HPGRVAGTRELIAHQNYIIIYDVTGELVRVLRVLHAARQWPPSEND
>NE1582 putative plasmid stability protein
MATLTIRNVDDVTKRLLRIRAAQHGVSMEEEVRRILRQELSRAGSSQFPF
GQHLLSRFAESTSKEFALPARQVPRTPPSWDEPI
>NE2508 hypothetical protein
MTKLPVLALLCSSLLLPLAAHAGDGCDGLPNWQQLKQALANARKDANGGF
NLDMCGTVVATDGTVCAVAHTGAKVGDQWLGSRVISAQKANTANAFSLPG
LALSTANLYSATQPGGSLFGLQFSNPVDTQVAYAGSAADFGTAKDPLVGA
RIGGVNVFGGGLALYDAKGTRVGALGVSGDSSCADHNIAWKTRHALALDY
VPAGVSPAKDDNIVYLDKAEQANGFKHPMCGGTEDKVTLPPVRKR
>NE1849 conserved hypothetical protein
MQVIIISGLSGSGKSIALKVLEDSGYYCVDNLPASLLVVLINHLQTQQHA
YVAVAIDMRSGENITVLPWQLKMIDKSIQIKFIFLEARTETLMQRFSETR
RRHPLSDKNITLEEAIRREREALATLTGLGHHIDTSSLRPNVLRAFIKDF
IADSRSPSQLTLLFQSFGYKHGIPLDADLVFDIRCLPNPFYDPQLKELTG
HDPEVIRFMESQPDASKMLRDISSFLGTWLPAYIRDNRAYLTVAIGCTGG
QHRSVYFAEKLALHFHDSAHVLVRHRGLAEYKPHYARR
>NE0908 probable hydrolase oxidoreductase protein
MAGTMTVKLETRNQHRCFDGVQSYYQHHSEIIGLPMRFSVYEPPQVQEAG
GQRLPVLFFLAGLTCTEETFMIKAGAQRYAAELGLILVSMDTSPRNTGIP
GEADDWEFGTGAGFYLDATESPWSRYFHMESYVTRELYDIILDRFPVDPE
RVGIFGHSMGGHGALTLALRHPGHYRSVSAFAPIAAPTQCLWGQKAFSRY
LGESPANWRKHDATALIESGCRLPIPLIDQGLNDPFLKDQLHPGYFEAAC
QQAGQSVILRRHAGYDHSYFFISTFIEDHLRHHYTWLTSNGQ
>NE1515 Ankyrin-repeat
MNRSLWTQWSIASVIVLFCAFCLYSPAHAGVDEDLVRAVEDNKTHRVRDL
LTKGASPDARDLQSETALMLAARNKNPEMGGLLLEAGANPDLRNKYGETA
TMLACYYGQLDLVKRLYAKGAKIDHDGWNPLIYAASKGYKEIVEFLLNYG
VRIDAATDNGTTALMMAVRGNHYDTVELLLKHGANALIRNEADGTALGWA
RKQGHTSIVQLLTRNGTAD
>NE1682 possible transmembrane protein
MIYWGMGMLVIAWLADGGISRLSDLIKEPLALGVLLFCGVWVLGLLWSDS
AVIFQGKWRKYFILLTFIPLFSLLSRERLPWVTGALLCSYLGMVILGSYQ
WAMQEMQGISLLGMSYLHYSAALGIGVILAVFLGWETLSREKRWLSVLSW
LIAILLLFLQFNQSARGILLATLMALLLMIVLRYRAEWRMLTGGLTAIMA
MVILFATSSDIFHDRLQQAGTDLRSFQQGDYQTSVGYRLAMWDVGLHGIA
EHPLLGHGSGMAKKYFDDSIITYKQGIYRNLPEFQETAHFHNELIEIGMH
LGLLGILAFVFLLWCWFQIFRQNRMALLGSAIVCFICISGLTDTFLLYSG
TPPFLLTVTAIAVCWRKYREDPDRIGQKYTSGERNGNQNPQMFLMSA
>NE1575 hypothetical protein
MKKQVIKGMLAASAAALMLGVSAAHANVIDELVGSAKLGNSGDGTELAFI
RSITGDNTLTLDFKINDNDSSFNVMSNGLDSWFIDVAPDTPGYFMLKLGV
PGNSTLHSHYVFKNIGELDKLVWSNDQVNYLTGGNCGLNGSPNSCNIGRL
SHYVGTQGIGGEDPEVPGEIPEPASMLLFGAGLLGLGLSRRRKLV
>NE1075 DNA polymerase beta-like domain
MRPSVVLDMKRSAVREAVGRFRTTNPRVFGSVLHGTDHDGSDLDLLVDAL
PGATLFDLGGLQVELESLLGLHVDLLTPGDLPPKFRAKVLAEARPI
>NE1018 Uncharacterized protein family UPF0005
MQFNPKYATPVSNTVNSGVRNKVLKNTYLMLSLTMIPTIVGSVIGTGTNF
SFLAQSPIVGSLVMLAVMIGLMFAVSATRNSMWGIILLFLFTFVAGWWLG
PLLQYALHFKNGSQLIGLAAAGTGIIFFTLAGIATTTRKDFSFLGNFLLA
GIILVILASLVNLFLAIPAISLAISAVAVLVFSGFILFDVNRIVNGGETN
YVMATLGIYLSLYNLFISLIQLLLAFLGEKD
>NE0672 putative
MSEQIRLENLTIFMQRRAVKHVYLRVHPPDGRVTLVAPTGMRLEVVRAFV
TAKLDWIHKQQAKLQAQVREVPRQFIGGESHTVWGRHYVLQVVEKSAGPC
VLMDQQTITLQVRPGSDLFKRAAVMHAWHKSLLHAVVPDLIGKWQDRLGV
SVSAYFLQQMKTRWGSCNTRRRHIRLNTDLVRKPQDLLEYVVVHELVHLI
EPSHNKRFVGLMDQHYPAWREATAELNQIPLSAIRPRG
>NE2295 prolyl aminopeptidase
MPLHNHNLCFLYNKPVQHTMTHTGLFPPVEPHDHGMLPLDDTHTMYWEQS
GNQNGIPVLFLHGGPGAGATPAHRRFFDPARYRIVIFDQRGAGRSLPLGE
TRDNTTPLLIEDIETLRQHLGIERWLIFGGSWGSTLALAYGEAHPDRCLG
FILRGIFLCRPGEINWFLYGLRNFFPEVWREFVARLSPIEQCDILSSYYR
LLMDPDPAVHMPAAKAWGRYEGSCSTLLPNPDTVDYFTSDTVALGLAKIE
AHYFRNNIFLPENSLLENVHKIHHLPGVIVQGRYDAVCPIVSAHDLHLAW
PQADYIVVNDAGHSAWEPGILIELVKATEKLKLIL
>NE1469 conserved hypothetical protein
MSLQAKLSDALTLEKPWSARESWRVFGIVAEFVEGTERLECIQPAISIFG
SARTPPDHPHYKLTEAIARQLSDAGFSVISGGGPGIMEAANKGAFYGKSP
SVGLNIQLPHEQHRNVYQDISQTFRHFFARKYMFIKFATAYVVMPGGFGT
LDELMEALTLVQTGKTRKMPIILVCSDFWTGVIDWFRQVLVQHDFISSED
MDLIQIVDEPSQVVDAIFRYYETSGFEPSAAERNIQLNL
>NE2034 conserved hypothetical protein
MSLAEFRAQYLVRFWSPIPALLALGVASAYYFAITGTFWAVTGEFTRWGG
HIAALLGFSPQQWSYFQLIGLNGSPLERIDGVMIIGMFAGALCAALWAGN
VQLRWPTSRRRLAQGLIGGIIAGFGARLAMGCNLAAFFTGIPMFSLHAWA
FMLTTVIGAWIGVKLCLLPFLRTPLRLDTAPSSLFADTASLARRARLQNR
LGLLIAVLVLGFAAWRFETSLVLGLAVLFGVFFGAVIERGQICFTSAARD
LWTTGRTRIAYGILLGMVVACLGTFGAIALGATPKIFWMGPNAALGGLLF
GIGIVLAGGCETGWMYRAMEGQVHFWIVGIGNVIGGTLVAIFWDELGGTL
ALPYPKINLLEYLGAGTGLLLSLAGLMLAMLLVYLNARRFAVREGLAR
>NE1570 Type III antifreeze protein:CBS domain:NeuB family
MRIEKKIKDHLVFSGDSILDALKRINDNQSRIVFVVQDNGVLIGAVSDGD
VRRWMTQATEFNLNLPVDHVMNRNFIARPVTESQHQIADYFDHKRDIIPL
IDEQGRFVALARKSATGLQIGDFLIADQNPAFIIAEVGNNHNGDIGLAKE
LVNLAVEAGADCVKFQMRDLSSLYSNQGRNAEAGYDLGSQYTLDLLNKFQ
LNHDELCQVFDYCRQQDILPLCTPWDLVSAHVLDEYGLEAFKVASADFTN
YEMLETLAKTGKPLLCSTGMSSEAEIKGSVDLLRRLGAPFALLHCNSTYP
APFKDVNLNYLPHLKQLGGTVVGYSGHERGFSVPLAAVALGARIVEKHFT
VDRSMEGNDHKVSLLPEEFAEMVRQIRNIEEALGQGGERSLTQGEMINRE
NLAKSLVINCDLSQGQLIRRSMITVKSPGQGLQPNRIDELAGKVAQRDFK
AGDFFFETDITPKSVKKQHYVFSRPYGIPARYHDYRALIEGMKIDFVEFH
LSYHDLDVKLSDYFSDPLSIGYAVHSPELFAGDHILDLASHDADYQAHSI
AELKRTVAVAAELRQYFPATPKPVLVLNAGGWTPQNFLPVEARTKLYDKV
AKALDEIDLSTVQLAIQTMPPFPWHFGGQSHHNLFVDPDEIAAFCDKTGH
RICLDISHSMMACNYYQWDFNTFLQKVLPYTIHLHIVDAKGVDGEGVQIG
HGDVDFTLLRDQLNQFARGVQFIPEIWQGHKNKGEGFWSALAFLEKTSL
>NE0961 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE2256 GCN5-related N-acetyltransferase
MSSYEMFADKNIVAGELANLMASAGWGTEGDYDATAIEKSLSAYPMIAYC
RDSDGLLVGYISAFTDGAFSTFVGELVVRPTYQQRGIGSALLAMIVEKCR
GVPVYATPFQGTEKFFLDRGFRVPERPMSVVSMRNVT
>NE1514 Glycine cleavage T-protein (aminomethyl transferase)
MNSDWFTFLTHRNAHIEQNRVLHFGQPAAELAQAASGPVLIDLSHFGLIR
FSGEDAQNFLQGQLSCDVRSVDSTQASHGGYCTPKGRLLGSFLLWQDSDN
SYLMQLPAERVETITRRLKMFVLRAKVSIQDNTDDLIRIGIAGKNALLSL
QNMLPDTTISPAPLAVTSIPDGQIICHSENRFEIMTTSIQAPSLWEQLNK
QAHCAGAAIWDWLEIREGIPAIFNATQEQFIPQMINLDIIGGVSFKKGCY
PGQEIVARTEYLGKVKRRMYLAHLDADSCQNIAAGDSLFGTDTGDQACGM
IANAAPVPAGGVDVLAVIQTSSMEAGSIHWKTPNGPQLTILPLPYAIT
>NE0862 conserved hypothetical protein
MTSHCIQCGSSRIHKSRFRPGERTPANLLLSPYRCRDCKARFWGRNNDAC
LAAAAGVGGIFLLGTFIWVGFSLNDPMERSLSSQPQTALTGLSWLDSQSS
PHVSSTNLTLAEAIERGEKIDLQSIKSEDTSSLPDPSDNRFYTINLFLEK
ARKGNADAQYQLGILYLAGKGTLQDFSEASKWFILAAEQNHPLAQYELGL
LYQVGQGVEMDNEKSYMWFNLAAAAGIEQAIAARDKAMRSLSRTQLSSAQ
KAAREWLDSRNKLGK
>NE1151 possible sec-independent protein translocase protein TatC
MIPRPIPEPSPDLYTEEEVSKLVHEFYAKARKDAALGPIFEEHVIDWDAH
FVQMTNFWSAQLRGTSRFRGAPMPKHIALPELNETLFKRWLQLFRQTTLE
LGNPLLKQHADTVAEFIAGRLWMGYQMSHFPHREPADLNTSEV
>NE1622 Protein of unknown function DUF132
MPGLSVVLDTSVLVLGLAYPASIPGHIINAWRQSALNVVLSHYILDEMIR
VLPRLSRIQMTPAEIRDLANSFMFLADVVEPQGSQDSNLRDSGDQPVLLT
LITAQTDYLVTGDKDLLALARDYPIVTPAEFWSRHGE
>NE1600 TPR repeat
MKVILTIQLALLLSMAGCTQLPRTGDAIAHGSAASAGAAPATELTADSLF
DFLMGETALQRNMPDVAVESFIRLARETRNPRIAEHATDIALRTRHFGEA
KEAIDLWVALEPDSMHARQAAVALFVANGQLDNVRPHVEQLLKLEPETVD
KAFMQINKLLSHHSDREAVLKLVQQLAASYPDLPEAHFAVSQAAWSANEF
KLAAKAMNQALELRPEWEMAAVHQGQILQKIDKDKALSFYDQYLDRFPRA
NDIRIAYIRMLMEEREFDRGREQFQKLEQVNPSNPDIALAIGLLSAELDD
LGSAEKYFKRALQLGFEDTNTIHFNLGRIHEIAQHNAEAMDAYLRVTGGE
RYIAARVRYAFLLAKRDGIAAARRYLKTVQVENEQQRTQLLISEAQLLRD
SGEFRGAYDLLDAYLRKYPDQVELLYDRALMADKIGKLDVLEQDLRKLIE
LRPDNAHAYNALGYSLAERGLQLPEALALIQKAIELSPDDPYIMDSLGWV
YYRMGDLKKGVNYLKLAFDTRSDPEIAAHYGELLWMNGAKEDAEKIWQSA
LEEHPENELLLDTVKRFMK
>NE1983 conserved hypothetical protein
MKTIAVLLTALLLATGCGNKESTGETASDEPEIVMMDESTSVGELPSILK
EGLSNKEQDELKKQIMPILDDGLTPEQRFLNLRKDAEAGNAEAQNSLGSM
YFSGEAISRDAQGKVKDKDPETAAGWFFRAAEQGHAGAQFNLGLLYFSGE
GVTRDTAKAVELFTKSAEQGNIDAQNNLGVIYLMGEGVKQNTDKAIEWFE
KAAEQGNEEAIKNLEAVRASQQDSKEAADKQK
>NE2043 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE2214 AAA ATPase superfamily
MNDPESIFQRLDRLLARIEDTLPVRPQHADPAEYIALRWRKHGDTGYLQA
INHPHTITLNELLNIEEQKQTLDRNTLQFVSGLPANNVLLTGARGTGKSS
LIKALLNRYADRGLRMIEVDKLGLTDLPDIIEFIGQRPERFILYCDDLSF
EADEPGYKALKVVLDGSISTASDNVLIYATSNRRHLIPEFMHENLATRHV
DGEIHPGEATEEKISLSERFGLWLSFYPFDQEQYLEIVRHWLSQHGISRL
SGPARQEALRWALARGSRNGRVARQFARDWAGQQKLAKTEPVRVDKE
>NE2431 PIN (PilT N terminus) domain
MNVVDSSAWLSYFAGDANAPVFTGPVEQISQLLVPSITITEVFKNVLHQR
GEEAALVVVAHMEQGRVVPLDSELAMDAAKFGVLYKLPLADSIIFATAHK
YGATLWTQDNDFEGLLNVKYVPKSGI
>NE0283 possible S-adenosylmethionine-dependent methyltransferase
MSSHPPIRSYVLRQGYFSNAQRHAYESLLPRYGIPLTEEPVDLDSIFGRT
APGILEIGSGMGETTAEIARQHPEKDFIAIEVHAPGIGSLLGQIEKHRLT
NLRIIPHDAKLVLQQMFTSESLDGIHIFFPDPWPKARHHKRRLIQPDFVS
LLCDRLKPGGYLHIATDWEDYATHILHVLRSEERFVNTAVDYAARPAYRP
LTKFEQRGMKLGHTIRDIIFTRTA
>NE0199 hypothetical protein
MFPIKSRPLKIIIFWQLAFAILAAVLCGLLSGVNAAISGFLGAIISVIAG
AVYAILVSRHSGYSASGTLRTALRAETVKIFIIIMSLWAVFAIFEGLQPV
MFIGSFVVAVLISSMAVFVPEKLNK
>NE2246 hypothetical protein
MGTSFRQYFKIVPALTEELKREAYRVRHSVYCEDLQFESSRSDGFEIDEY
DAHSLHLLIRSINNDTFIGCTRIIRPSSNSNDRRLPFEKTCAQTLDRSII
DPSRLPADKIGEVSRLAVIAAFRRRKGEKNHPINISEEDFSTGPMMRFPY
IPLSLYIGTIELARIHDIRVLFMLTEERLASHFSRLGAQLEPIGAPVEHR
GLRFPSMVEINSIISNMRPIFRPLYQAIAEDIKAELEKKNH
>NE2458 putative GTP-binding protein
MTHPLFRHAEFYTTVNRLQDLPQTAGVEVAFAGRSNAGKSSAINTLVGRE
RFAFVSKTPGRTQHINFFQLGEERFMVDLPGYGYAQVPLAIRQHWGHLLS
SYLQTRQSLYGMILIMDIRHPLTKLDLQMLDWFRQTKKPVHVLLTKADKL
SKSRALVALNEVRQFLTVNYPHCTVQTFSSLKVAGVEEASQLLQNWFDTG
HASVQQENGEISEQKKTPAKGD
>NE1535 DNA polymerase beta-like domain
MRLKYCVRDAMMSRPDAEENPRGGVGSAGFRRKHMNRDRVLAVLRHSKPM
LASRYGVRRLALFGSTARDDSDVDVLMVFDGVASAARYFGVQFHLEDALG
CSVDLVSEKALRPELRPFIEKEAVYV
>NE0477 ATPase components of ABC transporters with duplicated ATPase domains
MIRISELTLQRGPLRLLENADLTLHPGHKVGLIGANGAGKSSLFALLRGE
LHPDAGDCVLPMNWRIAHMRQEIDAPDCNAIDYVLDGDGYLRDIQQQLVK
AEQRQDGVELGRLYSELDNADAYTSDARARKLLAGLGFLEEQMGNQVNSF
SGGWRMRLNLAQALMCPSDLLLLDEPTNHLDLDAILWLESWLQSYPGTLL
LISHDRDFLDAVVGHIVHIDQQKLTLYCGGYTAFERARAARIMQQQQAHE
KQRAQRAHMEDFIRRFKAKASKARQAQSRVKALERMEELAPAHFDSPFDF
IFRVADKVSTPLLNLNEAELGYNVGQPILSNVKLQLVPGARIALLGPNGA
GKSTLIKSIVDDLPLLAGQLTRSENLAIGYFAQHQLDSLDPKASPLLHFQ
RIAQDEREQVLRNFLGGFNFRGKRCDEPVLNFSGGEKARLALALIAWGKP
NLLLLDEPTNHLDLEMRQALSMALQDFSGALLLVSHDRHLIKSTMDELYL
VADGRVREFAGDLEDYSKWLNDYRLRQRPDNDATSPDKVDRRAQRQATAA
LRKQLAPLRKHTDTLEKQLDNIQKELQALETILADNNLYEQPQKDQLKQY
LSQQTNLLQKQVKLEETWLNNLEELELLQSELGEDV
>NE1590 hypothetical protein
MTIQWNHYGSISIGKLRNAMPTTLTLKNIPDDVYERLKVAAEMHRRSLNS
EIIVCLETVLMPTRISPGERLERARQLRAGLNSEKFQACDIDVMKRQGRP
>NE0506 TPR repeat
MLAACGILSEKTVDHSKWSASKFYVEAKNELNEGNYAAAVKLFEALEARY
PYGRYAQQAQLEIAYAYYKDQEHASAIAAAERFIQLYPHHQNIDYAYYIK
GLASFNDDQGLMGYITHKIIKQDMSERDAKASRESFESLKQLVTRYPDSK
YTPDALQRMAYLVNALARGEIHVAQYYMKRKAYVAAIKRAQFILEEYPQT
PATEDALYIMAVAYGELGMTDLREDVEKVIRKNFPESIYLTDSGAVKGKS
WWEIW
>NE2159 TPR repeat
MSMKILKASVFVIIVFFLMPLPVVAAKKGIYCGELKGSHYGPFDYMDRFN
HSEQLKIVEDFHFTSDVEDLIRGSTSSTPAKDLNYTLHAWPNHHRALVSL
FKYSIKEKSTRIKGLKYPVECYFDRAIRMNMKDVQVRSIYSAFLSHRGRN
KEALEQLEVAANLEPDNATILYNLGLLYFKQKNYEKASHYAEKAYALDYP
LPGLRNKLIQAGKWRGSASGRSGK
>NE1263 conserved hypothetical protein
MTEPLLIAKSGNTELAILPSMANRHGLIAGATGTGKTITLQSLAEQFSRA
GVPVFMADVKGDLSGMCQPGGGNARVEARVAELGLKGFGYAACPVVFWDV
FGQKGHPVRATVSDMGPLLLARMLNLNDTQSGVLTAVFHIADDNGWLLLD
LKDLRAMLQHAAENASSYSVEYGNISTASVGAIQRRLLQLEHEGGDQLFG
EPALNFDDLMQIAEDGRGIINILAAETLYNSPRAYATLLLWLLSELFENL
PEVGDVEKPRLVFFFDEAHLLFNDAPAALLSRIEQVVRLIRSKGVGVYFV
SQNPLDIPDAVLGQLGNRVQHALRAFTPRDQKAVRAAAETFRPNPQIDVE
AAITEMGVGEALISLLDEKGRPHPVERGLIVPPGSRVGPATEAERQQVIR
GSLLYGHYEQLVDRESAYEILKARAATSPEQAPAEADRGFNWGELLGGST
GPRGGRREGIVETITRSTARTIGSQLGRQVIRGVLGSIFGGGRRR
>NE0823 TPR repeat
MKTRAFNSLVVGLLSAVVSVSCSTIQSDENAAGHHEKHEVSTVEHRSREL
SALADQPLSGDQLASRLQNLGTHSFPVSTQHEWAQLFINQGLSLAYGFNH
AEAGRAFQEAAQLDPGLAMAYWGQALVLGPNVNALMDPADEPRALELVKQ
AESLMVSASPREQALIRALKKRYSGADEDRKANDKAYADAMREVYRSFPD
DADIAVLYVESMMDLRPWDYWMRDGHPHEGTDEIVAVTEDVLRRHPVHPG
ALHMYIHLMEPTNTPERAEHAADTLMTLMPGAGHMVHMSSHIFQRVGRYA
DSVKSNQLAIAADEHYMGQSHAPGLYPMAYYPHNIHFLWFAATASGQRAL
ALESAQKAASKVDDALLREMPFTAIFRVVPYWALARFGQWQAILAEPAPP
AFNAFLKGSWHYVRGLAYVATKQSQRAERELQQLRRIVKNRAALDNPLLS
RNTAYDILRIGPEVLAGEIAAARGRYESAVAHLERAVRYEDALVYTEPAE
WHYPPRLALGAILLKAGRPDEAETVYWEDLKRNRDNGWALFGLQQALIAQ
KKEAEAKVIEARFKKAWEHADITLTASRFGR
>NE0932 putative isomerase
MPYVNIRLAGTLTREQKQQIATEITDTLERIAHKPKSYTYIAFDELPHES
WAIGGKLLGDDK
>NE1373 conserved hypothetical protein
MKDHSAFLDTNILLYLLSEDETKSVRAENTIAAGGFISVQVLNEFASVAR
RKLNMSFAEIQEFLSHIRMICSVVPVTVEVHDQGLRIAEHYGFSIYDALI
IAAALSADCTILYSEDMQNSQIIDDRLLIQNPFA
>NE0145 Conserved hypothetical protein 48
MGEKPYRARQLLRWVHQSGKTDFMEMSDLAKGFRHKLMECAVVQLPEIVS
DHTAGDGTRKWLLSTGAGNAVEMVFIPEPSRGTLCVSSQVGCALACSFCS
TGRQGFNRNLSVAEIIGQLWWANRLLEAGSHDPFPLDTTRVQTDKPETRR
PVTNVVMMGMGEPLANFENLVTALDLMLSDDAYGLSRRRVTVSTSGLVPA
LDRLRERCPVALAVSLHAPNDALRDQLVPINKKYPIRDLLAACERYLPAA
PRDFITFEYVMLKGVNDSVALARELVQLVRNVPCKLNLIPFNAFSGSGYE
RSGAEAIGNFRDVLMQAGIVTTVRKTRGDDIAAACGQLAGQVRDKTRRTS
GCGTGQPAVAR
>NE2380 STAS domain
MHRCGESSVAGTDARIRLEGNRLFVGGPVTYDNVVEVIRTGDAAIKADDM
LIDLAGVTWVDSSAVSMLLEWMRTAQTYDRRIEFINLPSNLADLIELYDV
GSLIPTDKPAESV
>NE2294 Patatin
MASQGSQPSRLGLVLTGGGARAAYQIGVLRAIAEMLPPHARSPFPVVCGT
SAGAINAAGLAMAATHFSTGIKRLEAVWGNLHTGQIYRSDLIGVLHNALR
YFGSIVSSRMAGKPVSLLDNTPLRQLLACHLPFRGIRRSIHAGALHALGI
TAWGYASGQSVTFYQASPSVIPWKRAQRIGVPSRIGIQHLVASASIPFIF
PAIRVNREYFGDGSMRQLAPLSPALHLGADRLLVIGVNEKKETDCERTKV
TGYPPLAQIAGHVMNSIFVDSLDVDLERLQRINQTLSLIPGMQAGNGPAL
RMVDCMVITPSERIDELTQAYANSLPRPIRYFFRAAGAMGPKGSAMLSYV
LFEAPFCQALIDLGYRDTLRRQDELFAFISERRE
>NE0191 Domain of unknown function DUF227
MDRLQLLEYWLKALYPDQPCTLSPASADASFRRYFRASLPGKTLIVMDAP
PQQEDCRPFLHAASIFSRASVHVPAIVAQDLNQGFLLLSDLGTTTYLQAL
TAAPENANRLYQDAIDALIKIQCASQKNIFPEYDRILLSRELELFPDWYM
TRHLHAPPDDDQKNTLKTVFNLILANNLAQPQVFVHRDYHSRNLMVSTPN
PGIIDFQDAVLGPITYDLVSLFKDAYIQWEEAQILDWMIRYWEKARHAGL
PVATDFSIFYRDFEWMGVQRHLKVLGIFARLCYRDNKPTYLQDMPAIMQY
LRQTCERYSELHPLSRLLDRLEDRQAETGYTF
>NE2095 conserved hypothetical protein
MPIFNFRDEEALGKVASVDTTNVIVDVENVAHLKRLQVNHLAVLQSSRPG
QHLIGLITQVTRKRGIENISDDGIVDQNSELNLCRIALIGTMLDRDGSKE
NVFRRTIESVPEIDANCFSLEGENLTGFMRTLSSVSAEGNALTLGKYTLD
ENAVAYLNGNKFFQRHAFIGGSTGSGKSWTTAKIIEQMSGLSTANAIVFD
LHGEYSPLTGPGIQHFKVAGPADVEAKRTISDDVLYLPYWLLSYEALVSM
FVDRSDQNAPNQAMIIAREINQAKRKYLEDNGQQALLKHFTVDSPVPFDL
DFLMERLNSINVEMVPGAKAGTEKQGDFFGKLARMISRLENKISDRRLGF
MFNGGGDILDFAWLEKFANAALGSTGENGKAGIKIINFSEVPSDVLPLIV
SLVARVTFSVQQWTPSELRHPIALLCDEAHLYMPQRNMADSADDISLDIF
ERIAKEGRKYGVSLVVISQRPSEVNKTLLSQCSNFVSMRLTNAEDQGVIK
RLLPDSLGGFSDILPTLDTGEALVVGDASLLPSRIRIDEPQNKPNSGTVN
FWDEWQKPVKDNRLLIAVDNWRKQNIQ
>NE1361 putative similar to abortive phage resistance protein
MSFRDPVTFSMIASRERQHGDRLPKLGKYNIRILPIAAIYGGNASGKTNF
FKALNFAKMLVVKGTQPGSPLPVEGFRLDNTSIDKPSRFAFELLIDETVY
EFSFSVNRKTIVDEKLVVVTSTSERELYVRSGGQIKFNEALKKDQFLQFA
FKGTRDNQLFLTNSVSQKVDNFQPVYDWFKDSLDLVAPDSRFELFEQFLD
DGHPLYATMNEMLPQLDTGIAHLGGEEIPFENIPLPEPMKMLLQEDVKEG
MTIRLMSDKNERFVFTRKNGELVAKKLVTYHPKADGTEAKFEIRQESDGS
QRVIDLLPAFLELAALGSKKVFVIDEVDRSLHTLLTHRLLEAYLASCSAN
TRSQLLLTTHDVMLMDQQLLRRDEMWVAERKPAGVSTLISFSEFKDVRYD
KDIRKSYLQGRLGGTPRILLSSNFAEGDEARATEEVQ
>NE0078 putative ATP-binding protein
MSVGADRKKTGTSLTGKVVAAYGRHFEVEVAGGTIYSCVVRGKKKGVVCG
DEVEILPATGDQGIIETTLPRTSLFYRSEIFREKLIAANATQLVFVLAVV
PSCNLELLDRCLVAAESQGIRPLILLNKIDLIGQDEQRQAVAHHLMFYRE
LGYPVLEISAKISVQPLIPLLSGQTSLLAGQSGVGKSTLLNALVPRAQQA
TAEISDALDSGRHTTTHVRLFHFDADSSIIDSPGFQEFGLQQLDEASLAR
GFIEFRPFLGQCKFRDCRHIAEPGCKLLLAAQEGMLNSRRIACYHKLVKG
LKKSHPWMETNKRV
>NE2546 Haloacid dehalogenase/epoxide hydrolase family
MIEAVLFDFDGTLADTAPDLGRALNRQRTARNQPPLPIELIRTEASAGAR
GLLSLGFDLKPGDPEYQAMREEFLSFYTEQLCQDTCLFPGITELLEQLDS
RAIPWGIVTNKPAKFTGPLMHLLGLHHRAACIISGDDTPYSKPHPEPLLT
ACRQINMAPDHCIYLGDDIRDVQASLAAGVKPIVALYGYLGNCAPPETWG
AAELIDHPRDLLHHI
>NE0230 hypothetical protein
MGLRLRRNSAIESLTIRVKAVISESVKGFGFFAIGGHESVAWLARPLPQI
LSAVWRPVGLQLHIFNPPKIYPAVTTITTSGSVAVITKSALGSVTGCSSL
SLT
>NE2470 NUDIX hydrolase:Conserved hypothetical protein 52
MDFDVLEKTVCFQGFFRLERYRLRHRKFNGEWGRPITRELFERGHAAAVL
PYDPQTDEVLLIEQFRAGAISAPGGPWLLEIVAGVIEANETPEQVVARES
MEEANCQIGSLIPLYDYLVSPGGTTERIVLFCGRVDMQTIEAGAVYGNHG
EDEDIKVHVMPLNEAIRLLSTGRINSASAIIALQWLALNRDSVRRRWLPE
>NE1726 putative integral membrane transmembrane protein
MDNIIQGIAIYALPVIFAITLHEAAHGYVAKYFGDLTAEMAGRITLNPLR
HIDPVGTILLPLMMFVTSKLLMGSGLLFGWAKPVPVNFGRLRQPKKDMLW
VAAAGPGANLLMAFFWAAIIKLGMNMPDSIYLKPMVLMGIAGIEINVVLM
VLNLLPLPPLDGGRIAVSLLPSRLAWKFAQIEPYGFIILLVLFISGVLSV
VLWPLIIFTKQMIVTLFGLYI
>NE0240 hypothetical protein
MTGAVRRNVRKRLNKNQKQLRQNWEGYHLLEFAKTIRPVKQDTKRRKELD
DHLVSNGIGGFETLQKLPADNKTAFTTEKSDGVFESLAEFQDIPTFVCFL
NFLEFCRNSIPRRPLDVFDGNDTLSIEIDGKRKAVTLDYAIDDFKRHAKT
VEERLKLESSQSSTGQADPIQPRLTIPQARENNPNYFANAVLDVIGRDEQ
KSRLKAFLECDKNVAWFQLAGVAGQGKSRLAFDLIKVAEELGFRAGFLTE
NDIKFFKDQWKDWQPDKPYLLIFDYVIGREEQIKPIFQTLISNQDGYCHN
IRILLVERQRWDQGNVIETQDQINKDAPRQLIAISNKAPWFLKLCEEGDS
EGERLAPFRFDNGVEELKELGQDNLASIVKQLLSGKTLTLSDDVLEETLE
RIDNTGRPLYAYLLAQQLSESEEGFRSWTKIDLLNGQLKRDKRRWEQAFN
DKAPTWGDSHEAMKLAVLATIVRQINFEDEIIKSNFGHIDSSLGKEALAI
TNSYLVNNDNRPHKIHALEPDLLGEWFVLYCFYQGLNFEELLNIAWEYSP
NDTAMFLIRVMQDFIDLTKTYNDWNLTEKLLAHKPPHENHYLVLARVAVF
ISYELGRRNLTIPHNIIIALEHAANLSNVIAMDYLGFFYQQGLGVVRNPE
KAIYWFQMAVNKKSDTAMVNLGICYQKGEGVKQNLNAAFKLFQRAVKLDN
STAMFYLGLCYQRSEGVKEDLNEAFALYQQAADKGNSTATAYLGLCYQYE
VGVKQDLDKAISQYQRAVDEGNSLAMVFLGRCYQYGEGVNQNINKAIALY
QKATDKGDSTAMTCLALCYQDGKGVDQDWNKAINLYQQAVKKNDCTAMYY
LGACYENGYGVKQNRSSAIELYRMAANQGNSNAMVNLGFYYRNGIGVKQN
RKEAVKLFQRAAKVGDYRAMCNLGVCYENGEGVDQDWNKAISLYQQATKA
GEIRAISNIQNILLRNFLGEGNYKSRKTNGCTRNKLSHLVEKLAFDPPIL
GGDWKPLLTEEIKACIDKVIVPFEILDVVVEI
>NE2454 MFS family transporter
MSRVEIRASASLAGVYALRMFGLFIILPVFTFFAKELPGGDNYTLIGIAL
GAYGLTQAILQIPFGWLSDRIGRKPVIYLGLVLFALGSFVAAGATDIYWV
ILGRIIQGAGAISAAVMALAADLTREEHRTKAMAMIGMSIGMVFALSLVI
APLLDQWVGVPGIFVITGCLAILAIGVVHKVVPDPVVSRFHSDTEVTAAR
FSSVIRDVQLLRLNYGIFALHAVLMALWLSIPLSLRETGLAAADHWKVYF
PVMVGSILLIVPAIIYAEKKAQLKRVFIAAIALLLLAQILLAIFNASFWG
LVFALLLFFAAFNLLEASLPSLISKIAPVGAKGTAIGIYSSTQFLGAFVG
AGLGGYLFQHFGFYALAAMCSGLLILWLVLAVTMQAPAAVRSRMYQVRKM
DSDEANGLSRELAALPGVYEALVLANEQVAYLKVDLKGFDEEKVVQLLEG
NI
>NE2451 putative death on curing protein
MIEPIWIDEQVALAIHERLISLHSGASGVRDKELLKSALARPLNLLAYDQ
QADVIHLAAAYTAGILQNHPFVDRNKRTGFVVGVLFLELNGYRFTAAEED
SAQAVIALAAGSLDEARFKLFLADNSIPV
>NE1090 conserved plasmid protein
MAEADLDNIIDYIAQDNPTRTEEFGQELRDKILPLTQNPKMGRTGRPGSS
AFVRELVAHRNYIVFYRVLDEACTVEILRVKHAAQQSS
>NE2428 hypothetical protein
MSLHLALLMRGGDPHCILDFGAGNGELCKLIALQFCYEPTPSLMAEAKEN
LADLPQISFCSDLEKISDGSVELIFCLEVFEHLPEKETKDALGQFDRLLT
DNGNAVIGVPVEIGIPALYKGIFRMSRRFNTFDASIKNVLLAALSFPPKD
RPVSEITPGFAFHHEHMGFDYRKLQALLHAQFGLQQVTTSPFSIFGPWLN
PEVNFLIQKANPAVNADAAR
>NE1513 hypothetical protein
MNELSIQLQPSSRLAVLLSLAHCTAAGVFWPLALPVVVKLIITLLLAGSL
YYYLRRYAWLTSPRSIVALHLTGRNSCRMKTHADEYIDTVVDTSTFVASY
MTVLYLQKERTRRYYTVVILPDSIDANSFRRLRVWLRWKWQDSSSDGRKR
G
>NE0804 DAG-kinase catalytic domain (presumed)
MDLIPARIGLIYNPLGGWFRKHTARMQSLLATLPEIRQIQATDQIEFERA
VTVVVEAKIGWLIVVGGDGTLQGVMSCLFECLPPDRWPEITIVPAGTTNM
TALDLGMNGQAEQILSRIRQHLQRPGDMKQVRRPVLRIEQTGMRNVYGMM
LGLGLIARGVKFSRSQIKQLGMTGNIFTVVIVLRSLIGMFLGRPQAEWAP
VRVAQIDETGVLSEKVYLFALISALEKLLLGIRPYWGQEPAPLHATFIGQ
HSRRFWRAIWPLIAGRGHHLQKEDGYTSYNTASIELWLDDEYIVDGELYY
ASSRNGPLKITADGPILFRIL
>NE2495 ABC transporter
MKLLRLKITDSAGFRSLPCGFEHHFRTEWSLQEELAQPEGFGPFVCAGPN
GSGKSNLLEALAAIFFQLEVQRVRRNFLPDIFQYDPDDNPEGIQEHEGHP
NAYELEYLIKLPKEHRSSGSPEFAHVVVIKERDKSPWLRWENNEAFPVEG
FAFSTLTDEERDLLLPQYVLGYSSGENEILSLPFFKMRFIQFDEYSNALA
RQLPYPGRPETRLAYLDSSFSQAILLCNLLFQDAATLQPFREDVGIEALQ
EFRIVLRRSVPVTHQQVAAFTSGEYVLPTETQDGRFTDTNVVYLDPETGD
YRLNLLQGLEANERTERTAIVEKLKRCATLHFHDEATDTLVLDYRVNEAT
KQAFRANFDDPAGPALALFQALQVLLTLNLYSVSDTIKADLYRSTSHYVS
ETVPTLASDQRIMRFKNFYFTKQGVKKPMLLKDLSDGEYQMLHSLGLCLL
FRKTNSLFLLDEPETHFNPHWRASFITRLRQCLPDVEGVGQEMLVTTHTP
FLISDSKPDKVLVFAKDKTSGEVSISKPNYNTLGASINKITMNTFGKRET
IGGYAQVLLDDLRKRFEEGHEDRETLITEINQQLGDSVEKLLLIKTILDG
NQPADEEAQD
>NE0893 conserved hypothetical protein
MSITLITAAPGAGKTIFAVWNIIKPAVEADRVVYTAGIPELKLPAISLSY
SQVKRWADRELVEVENPSGIPIPDDEKPSRLQNITEGALIVIDEVQYLWP
ASGSREPGEDIKYLTKHRHHGLEFVLITQAPQLIHKNVLAVVDKHIHMLS
DWHGRKRYEWPEYCATVRATSSKLKAVSQRYELPKEAYGLYRSASMHITQ
KRRKPLMAYVVPIAFFALIYTGFTFKDRFLDSKSSESPKVVKDEQKTDDP
HLSQQKLTSAPVTTVTTVARPVTLALVSDQIDWSKVGACVATQAKCICYG
KSAERLVVPPETCRKAVSSGWPGQETKV
>NE2346 possible hydrolase
MMERITSRDGTPIACWRQGSGPPLLLVHGTTGDHSTWGSVLAGLQQHHTV
WTLDRRGRGHSGDAANYSLEHESEDIAAVIDAIGSSVNLLDHSFGGLCAL
EASLLTANINKVVLYEPAISLAGSDWSATFEARMQALLEKNAREETLLLF
FRDLLNTPNPELVALQAGSNWAIRLAAAHTILRELQGIDRYQFTPQRFQT
LKSPVLLLVGGNSHPRRFMTAERLQQGLPDCRVGIIAGQQHSAMRTAPDL
FVHHVLEFLQSAD
>NE0067 conserved hypothetical protein
MSISRAPADFDRSISPPENRGEHETGRTLHIATYNIHKGLSFFNQRLILH
ELRDQLHGLDVDVVFLQEVVGEHALHATRFRDWPRNTQYEFLADSMWPDF
AYGKNAVYGHGHHGNAILSRFSIVNWENEDISAHRFESRGLLHCELAIPG
WKDTLHCICVHLGLFRRGRSQQLEAIEKRIRQLVSPDAPLVVAGDFNDWR
GAANPLLASRLNLVEVFQHTHGKAARSFPSVLPLLRLDRIYIRGFQVKNA
QILHNRPWSRISDHAVLSANIMRT
>NE2566 Domain of unknown function DUF81
MEIQWLAILPGVFTGLVLGLTGSGGAIIAVPLLVFSLHTTIAEAAPVALL
SIAVSAAVATCNAFMQGIVRYRAAALIASTGMLVAPAGIWIARQLPDLLL
TVVFSAVLAFAAGYMYRQGRRSAQPAPPEEAVYPPCQLSLESGRLIWTIP
CAKGLLFSGVATGFLSGLLGVGGGFITIPALRKVSNLPMQSLTATALAIT
TLISITGVVSATSMGFMNWPLALPFTVGTVIGTLTGRRYAHRFDEAKLQY
GFAILAWCISLGMIVKVVYSIDFSALS
>NE1064 PIN (PilT N terminus) domain
MIVLDTNVLSEILRPVPDTQVLVWLAAQPRSVLFTTTVTRAELFYGVRLL
PDGQRQTALLDAIQSIFDQDLAGHVLNFDSTAADTYAKIAASRKAVGKPI
SQFDAMIAAMAKSKGASLATRNLKDFVDCGIDLVNPWSTSYLK
>NE0821 conserved hypothetical protein
MLVVFEMRSLFAQLTLTDVHQDIARNIVSLRQSQNLFDDLTDDPAGWLLA
QKVEAEIKPPPYRSYTPIIDRPFEDAEWFNAIIWPFKYWQSSRFSDGTHG
IWYGSESVETTVYESAYHWYRGLLSDAGYEHEAVVAERKVYSVACSAALL
DFRKITEEYTDLLHPSDYTLTQSVGARIHREGHPGLLIQSVRRSSGENMA
IFNPGVLSNPRHNCQLTYWLEGNQIKVEKHPGTVWITMDIATFG
>NE0129 putative lipoprotein
MMISRAIAIVVIASASMVMVSSCSVARHQESVGEYVDGSIITTEVKAKLA
NDPGTSAANINVKTIEGGEVQLSGFTKSQAEKNRAGELARTVKGVTRVHN
NLVVKP
>NE1526 hypothetical protein
MKLFLNSFFFILLTAIISCTPLSTSNPKASVINSVVFVSDSGQVVRAIYR
DDDTVTLTFPNNRIEMLNLAVSASGARYVAGMNEWWEHQGEATYSVNDER
VFTGRLQRQPAN
>NE1525 putative plasmid stabilization protein ParE
MAEYRLSPAAQRDLDGIFNYTFQQWGAAQAVRYIDILEAACTELVETSSQ
GQDCSYIRPGYRRRHVERHITTE
>NE2518 Patatin
MMESKNEPGASALLRVLTLDGGGAKGFYTLGVLKEIEAMVGCPLHQKFDL
VFGTSTGAIIASLIALGHSVDSILELYRKHVPTVMSQKTAPARSQALKKL
ASEVFGDATFSDVKTGIGIVTAKWLTERPMIFKGSVAQAHGQVGTFVPGF
GVSIADAVKASCSAYPFFERTVVRTSMGEDIELIDGGYCANNPTLYAIAD
AVQALRSDRKDIRLVSVGVGIYPDPKPSLLMWLAKKYLVSVQLLQKTLEI
NTQSMDQLRQILFPDLLTIRINDSYVTPEMATDLLEHDLKKLGILFQRGR
ESFASREKQLREYLI
>NE1265 GCN5-related N-acetyltransferase
MLIRSAKPEDAEFIGSIRVAAWQAAYRGFMPDTYLASLDPGANLDELRAA
LRAENPPFTLRIAETEGQPIAFSILGKPRYNADQSIVELWALNVHPTHWR
KGAGQQLVRQVLLDAKEQKFVSVELWCIQGNLAAQRLYEICGFVPNSQVR
TTSSLTGYPLHELAYTYAL
>NE0587 conserved hypothetical protein
MHTDMKSVGTVSMQEAPLLEIAVVGHTNTGKTSLIRTLLRSTSFGRVDDA
AGTTRHVERATIFAGSEAVLNLHDTPGIEDVYALQDKLHLIATRNKRSTQ
SELLEKFVAATPLNDPLEQEAKVIRQVLRSDVLLYVIDVREPVLEKYLIE
IEILGKAMKPMIPIFNFTAAHRAELDLWRKKLAAFNIYASLELDTVAFAF
EAEKRLYQKIQSLLEVHYTRLQRLIDHRARVWNQLCMSAARRIAGLIITT
ACYREHTGDERSSAGDTSSAAIRLQDFIRQAEQHCLVDLLKMFNFTDKDI
ELQKIPVQNGYWQLDLFAPGVLKAYGLDIGSAALKGAAAGAGIDLMTGGL
SLGVASMLGALAGTGWSTFRRYGKEIQAKIRGTRWLCVDDSTLQLLYLRQ
RQLLDKLMNRGHAACHTDQVSQQPERGELPDGWQQIIDMLRQNPAWGRSP
GLHTDDTRQYSSIEKRLIDALLKNPAL
>NE2285 conserved hypothetical protein
MTTQPSKQLETFENPVQTRDYRIHMEIPEFTCLCPKTGQPDFARLTLDYI
PDKKCIELKSLKLYIWSYRDEGAFHEAVTNRILDDLVAAMKPRFIRLTSK
FYVRGGIFTNVVAEHRKKGWQPQPPVLLEVFEQQFNTHG
>NE0222 ExsB protein
MKKAVVLLSGGMDSATTLAIARQSGFACYALSIDYGQRHVAELAAAARIG
QSLQVSDHQFLKLDLAVLASSVLTDISATVPLHGTSTGIPVTYVPARNTI
MLALALAWAEVLGSHDIFIGVTAVDYSGYPDCRRDYIDAFEKMANLATKA
GREGMVLTVHAPLIDLPKREIIQCGMELGIDYGLTVSCYQADEAGYACGQ
CDACHIRRAGFEAADIPDPTCYRNKQIS
>NE1049 Ankyrin-repeat
MKLTENMSFPVRLKQSCLAGVTALFFLLQIPFAHADADKDADFLKAALTG
DTSGVENMLNEGIQTDLQSPEGFTALSVAAQTGHKEVVKLLLNRKATVDL
ANVQGGTPLLLASKNGHQEIVDLLLAKGANPNLQDKNGLAPLMLAAAKGN
TGIVRSLLEHQAQPDLQNNAKATALHMAATNDYADIIDMLLAKGASVDLQ
DANGASALILASLSGHLSIVRKLLAHGAQPDLKATNDFTALILSAQNGQN
PVIEVLLEKGVHIDFQNKDGMTALMSAVLNENIDTVKLLLEKGADTKLKN
TSGKTALDIAKLPAIIELLKAAKS
>NE0657 Uncharacterised P-loop hydrolase UPF0079
MHSSHVVKLDSEAATLALGEQLATLFHPGLTVFLYGDLGAGKTTLARGIL
KGLGHHGKVRSPTYNLVEIYKLSRLYLYHFDFYRFNDSLEWEEAGFREYF
NQDSICLVEWPEKAGEFLHAADLEIRISYSGTRRIAEFSAATEAGEQCLS
HWQKRVSD
>NE0063 ATPase component ABC-type (unclassified) transport system
MSELRAENLKKSYQSRTVVTDVSFSVRSGEVVGLLGPNGAGKTTCFYMVV
GLVPLNGGEIFLDEHNLSRLPIHQRARLGLSYLPQEASVFRRLSVEENVL
AVLELQQLQKDEIQRYLDELLHDLHISHLRESSGMSLSGGERRRVEIARA
LASRPRFILLDEPFAGVDPIAVMDIQRVISFLKSRGIGVLITDHNVRETL
RICDRAYIISGGTVLANGAPAEIITDERVREVYLGENFRL
>NE0456 Esterase/lipase/thioesterase family active site
MKNSAAIAWKFLSVPAQPGKMKSGRGWRQLAYTDWGNPKNEHVVVCAHGL
TRNCRDFDFLAAALEQDFRVICVDMAGRGRSDWLKEAEDYNSAATYVSDM
EHVLEHVYRQNDSDSFRIYWVGVSMGGLIGMLLAARQRPAVSYRFRTLVM
SDIGPHVSSGILSLFATTIGKDPRFRSLSELESHMRATALPYSPLTDTQW
HHLALYSAREYEDGTIGYRYDPAISSGFRPDRIKDIDLWAYWNRLDLPVL
VLRGEKSGVLTPETAGEMQLHRSNVQITELAGIGHAPMLMDADQINLVRD
YLLKIRNRTE
>NE0029 Short-chain dehydrogenase/reductase (SDR) superfamily
MLADRVILVTGAGQGIGRAAALAYAEQGATVILHGRKTEKLEQVYDEIEA
LGRASAIILPFDYEQATEAGITELVEAIASQLGRLDGILHNAAWTYGPMP
LAFHTSAHWQTIIQVNLLIPAMLTRACFPLLNASPDASVIMTGDTHGQTP
AAYWGAFAVAKAGVEVLVKIQAEEWEIYPNLRINTLIPGAVDTPQRTKTH
PGSNNRILPKPTDLMETYLFLMGPNSAGITGKTFDCQKEQSA
>NE1109 conserved hypothetical protein
MIKTFATKETAALFANEKIRRLPPEILRVARRKMAQLHRVSSIEELRIPP
GNRLEKLSGNRNEQWSIRINDQWRICFRFEAGDVFDVEITDYH
>NE2144 DUF209
MKKILGIYDAPPLHWVGDGFPVRSLFSYSNHGKLLSPFLLLDYAGPVDFA
PAERPRGVGQHPHRGFETVTIVYHGEVAHRDSTGQGGVIGPGDVQWMTAG
AGILHEEFHSESFTRSGGQLEMVQLWVNLPAKDKMIAPHYQAILSADIPV
VALPDDAGSIRVIAGCYQDHTGPARTYTPMNVWDVRLKRGKVTELPLPEG
WNTALAVLHGKISVNGSPLVQAAQLVSLDRAGDTVSLDVREDATVLLLSG
EPIDEPVVGYGPFVMNSQTEIDQAIADFNSGHFGQLSR
>NE1724 PHP domain N-terminal region:PHP domain C-terminal region
MMRSHVITHFSMPNIDLHSHSTISDGMLSPSRLLAHAAVRGVNVLALTDH
DDIAGLSEASRSAQQENITLIRGVEISVTWHGRTLHILGLGINPEHPPLT
EGLKKIRDGRMDRARAIAAQLDKFGIHGSFEGASAQAGISRLIGRTHFAR
FLVSQGYAKNVKSVFKKYLVKGKPGHVSHVWVSLDEAIGWIRGSGGQAVI
AHPARYKLSNDLLEQLLCEFRELGGAGIEVVSSSHTPEQTRQFAALATRM
NLYASCGSDYHGPGESYFDLGRLPALPPECTPIWNEWEIPDYAETGATTL
NEQSEQTGLQSPGKSV
>NE0499 Glycosyl transferase, family 2
MITVSIVSHGQSTLVEQLLADLVRLDMSMVTEVLVTLNIPEDISSKPGDY
PYPVRILRNTAARGFGANHNAAFRQAEGEWFCVMNPDIRLINNPFPILIE
EGAYDSAGVIAPMVVTPSGMIEDSVRCFPVLTSLAAKLFGHGDGRYLFAA
GDEAFAADWVAGMFMLFRTEDFRAVGGFDEGFFLYYEDVDICARLWKSGR
SVLACPKASVIHDARRSSRRNLRYMKWHALSLIRYFWKHWGRLPQTPEQ
>NE0553 PIN (PilT N terminus) domain
MSYLIDTNIIAEVRKGKRCDPHVAKWWGQTSDDDLFLSVLVTGEIRKGIE
LARLRAPTKAARLEQWLDALIAGFSGRILPVDQAVADQWGCLNSPNPRPT
VDSLLAATAQVYRLTLVTRNVADMPKIGISILDPFTFE
>NE2162 Esterase/lipase/thioesterase family active site
MENPPAEPFFLDASPGKRFCLYHSPAGDTPLNQVFIYLHPFAEEMNKSRR
MAALQAKAFAAMGFGVLQIDLYGCGDSAGDFGEATWEIWKNDVEFAYQWL
IQQGFTSVHFWGLRLGALLALDYAGKAETGSAKFILWQPVINGKSFLTQF
LRLRLVNKLLSDDSDKAQNVHLREELRAGKSLEIAGYTLSPAMAAAIDEL
KLGQLVVGNSEIYWFEITPEAGRGLPPAGAAVVEAWNQSGVYPEVTLIPG
LPFWATQEISECPALLAATAKLFAGIQP
>NE1651 DUF175
MPKSTRFLKLTSFVLIVILVGSTLFSVWFYRLATTPLNLPAVPSEFSIEP
GSGLHRIAGQLAEAGILSNEWSFILLAHITGYNASLKAGDYQLTEKLSPL
DLLKYLTRGKVRQYAITFLEGWTFSQFRKALDEHPALRHDSDKLNDSELL
RAIGAKESHAEGLFFPDTYFFTRNSSDLTILKRAYQAMQQHLETVWLARQ
EFLPLKDQYDGLILASIIEKETGADNERTQIAGVFINRLRHNMKLQTDPT
VIYGMGNKFDGNLRKIDLQTDHEYNTYTRFGLPPTPIAMPGLASIRAAFN
PAITDELYFVARGDGTSHFSSTLEEHNRAVLKYQKSSIKHSVH
>NE0586 conserved hypothetical protein
MALKNHDFNDLVRLEQLRHIETGQPQALSYAGIASMEAYPASGFSYAAFL
ERLLDRAHHLVHDNHLDEVLQQPEKLFIRASRIILLLAAVLGGLAAVNAA
SESSTLNIYWLLVVLLGFNFLSMLLWCAGILLSVQGLSSGIAAQLACWLP
FQLKKRESDSTGTFAARAWWETCLSGRVGRWRISMLTHQFWLVYLLAGMG
VLILLMLAKQYDFVWGTTLLPENSLPELTRLLGVPMQHIGLAVPDGQQIA
ASRIGAGVQDSVIRSAWAEFLVGALIVYGLLPRLILMLLAFFMLKLSEYR
YKLDLYLPYYVTLRQSLIAKEFVTSVIDRDPGMAKESLEPITRAKHSRRF
PENALVIGVELDSHAIWPEGLVCQENVADQKTFARVSEMLKKSKGALVIG
VAAHRLPDRGVQRMVRELATLVSGQIWLILLQSSPAIPVAESRRQAWYRL
AQTCAIPAEHILS
>NE0139 Generic methyl-transferase
MFDSVLQQSMQQWLETSLGQYVLEQEQRYFDRVVTDIFGYNATQIGFSGF
DFLRNNRMPFKFAFGVRDGASVYAHPHFLPIKSSSIDLVLLPHTLEFNSN
PHQILREAHRVLIPEGKVIISGFNPFSLWGMRQRMAKSKTDFPWCGRFIA
LPRMQDWLELHNFEIVAGQFGCYVPPCTREKWLSRLRFMEAAGDRWWPIA
GGVYFLQAVKHECGLHIITPRWEDSPAKRGTVVVPQIERGHRMNNLEVST
VPWREKAGGGLIENRKYGVQYSVNNRDV
>NE1856 hypothetical protein
MSTTNILKHLANLEKHLAEEHPDNPVLTKAVHSFRKLDGVAQALGLLELN
ESYATYVTWWPMIAVLGTFSSGKSTFINSYLNMTLQRTGNQAVDDKFTVI
CYSRHDEIKTLPGVALDADPRFPFYQISHSIDEIAEAGPQRIDAYIQLKT
CPSEQIRGKILIDSPGFDADSQRTSTLRLTQHIIDLSDLVLVFFDARHPE
PGAMKDTLEYLVAATINRPDANKFLYILNQIDVTAKEDNPEEVISAWQRS
LAQAGLLAGRFYRIYDKDAATPIENPQIRERFEQKREEDLADISTRMQQI
EIERSYRIAAMLEQTANTIENQVIGKLETALDQWRSQVLTLDGILVGLLA
ALTGIALTVTDSWHLLKECIDGISAGGILPIITIMLLLGVIGYIHFAVRK
SVARSILKQLPQEFGHDHNAAKQFSRAFSKSTAGYRPMFLKKPAGWNGTN
RRILAEVKEDANDYIQMLNDQFTDPSGHLEQEQNQPATEQA
>NE0801 conserved hypothetical protein
MKIIERYITRELLFPFIVVTVILIGLFVSFSVARFLTSAVTETLGTAAMF
KLVGLKTIIALEVLVPIAFYVAVIYGLSRMNRDQEINVLRTAGYGDNRII
RTVFVIALPIAILSGILSVYARPWAYAESYIMDAQAEAELNTNRFQPGRF
YGSEKSGRVIFIRGKDDLEKRMEGVFHYIRTTEDREIIISREGYQQPMTA
EQWPHIELREGQIYRLSFDTVKDSAIRFEKMIYFNENDQAQNYRRKAAST
RSLWDSEEPREIAELQWRLSRPVATVLLALIAVSFTRTAPRKDKTDRTFL
VAALVFAVYYNLSGLAKTWVEQGVVAAMPGVWWVYGLLIIIVMLRLPELR
SLLPAGKQ
>NE2492 General substrate transporters
MTTENNQRTKIPAGIWVLGCVSMLMDISSEMIHSLLPLFMVGTLGASAFV
VGLIEGLAESTALIVKVFSGVLSDWFGKRKGLAVFGYALGALTKPLFAIA
PGIGVVLTARLLDRIGKGVRGAPRDALVADIAPPEIRGAAFGLRQSLDTV
GAFLGPLLAVGLMLLWADDFRAVFWVAVVPGLLAVALLLFGVHEPDRHVG
EKRINPIRPENLKRLSSAYWWVVGVGAVFTLARFSEAFLVLRAQQSGIAV
ALVPLVMVVMNVVYSASAYPFGKLSDRMNHKLLLALGLVVLIAADLILAL
DDHWITVLAGVALWGAHMGMTQGLLATMVADAAPADLRGTAFGFFNLVSG
IVMLIASAVAGLLWDQLGASFTFYTGAVFSGVALLGLLKRF
>NE1878 conserved hypothetical protein
MNIQIILSLIFSMILMACTTSSPQKDLSRICDSSGCSDRSGNYVSNHSSA
ASSDEEARIRVLEDVARQDPRAAYDLGLRYFRGDGVRQDSYQALQWMREA
AERGDLNAQKALGRLYLTGLEEMGADYREAEKWLRIAASRGDKESEQLLV
EAAAFASEERRSGEIYHRWVNRWQPVFYQRWYYGYPYLGTWRGNYWYY
>NE0184 NUDIX hydrolase
MTWKPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQAC
SREVLEETGHSFLPEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKL
DTGIVRAAWFSIDEIRAKQAMHRTPLVMQCIEDYHAGKRYPLDILQYYD
>NE0138 Metallo-beta-lactamase superfamily
MINVFPVRAFRDNYIWIVHNQQFALIIDPGDATPVLTWLRQQKLQPIAIL
CTHHHHDHTGGISLLVQKFEIPVYGPASEKIPGMTHPLAEGDTLVFPELS
LELSILDVPGHTAGHIACHGQNRLFCGDTLFACGCGRIFEGNAQQMFDSL
QKLTDLPDETQVYCAHEYTLDNIRFARAIDPDNPELIELESNVEEKREQN
MPTLPSSLAAEKATNPFLRCNQPAIIQSASRYAGRQLTDPVSVFAAIRDW
KNNFRGNTDLPM
>NE0329 MoaA / nifB / pqqE family
MSVPFLQQYRVGSYILKQKLAGNKRYPLVLMLEPLFQCNLACAGCGKIDY
PEETLRRRLSVDECLHAVDECGAPVVSIAGGEPLIHKEMPQIVQGIIQRK
KFVYLCTNALLLDTRMDDYQPSPYLTFSIHLDGNRERHDASVCREGVYDK
VIPVIEQALQRGFRVTVNCTLFQSETAEEVAEFFDTATKLGVEGINVSPG
FSYEHAPRQDVFLQRSVSKRLFRSIFEIGKKRKLPWKFNHSSLYLDFLAG
NQSYKCTPWGNPTRNLFGWQRPCYLLVDEGYATSFRELMEETDWDRYGVG
NNPKCANCMAHCGFEPTAINDTFAHPLKALRVSMRGPRVEGPMALDPLQT
SSESQHNTDKRKPFPIPVTVEHKTVAPSSDHSSGPDN
>NE2570 probable transmembrane protein
MIDWQSFTPASAFTGGMIIGLATALLLLITGRIAGISGIIGGLVELRRGD
FAWRAAFVSGLLLAPWLWQWLGELPPVHIETSHTVLALAGLAVGIGTRYG
SGCTSGHGVCGLSRLSPRSMVATVLFMIMGMMVVYVVRHSLS
>NE1110 Helix-turn-helix motif
MTIHIEELENMDFSDVAEGGKLHPIHPGEILREEFLMPLKITPHALSLAL
QIPATRINDIVRERRAITTDTALRLARYFGNTAEFWMGLQIDYDMTITRD
SLRGALNRIQRFEPTHIS
>NE0820 Zinc-containing alcohol dehydrogenase superfamily
MTIKAYGARAGDLPLEPMNITRRTPSAHDVQIDITHCGVCHSDLHQVRSE
WAGTLYPCVPGHEIVGRITAVGAQVSGYKPGDLVGVGCIVDSCQHCADCN
DGLENYCDHMVLTYNGPTSDAPGHTLGGYSQQIVVHERYVLRIRHSEAQL
AAVAPLLCAGITTWSPLRHWKVGPGQKVGVVGIGGLGHMGIKLAHAMGAY
VVAFTTSESKREAARALGAHETVVSRNPDEMARHVGSLDFILNTVAASHD
LDAFFALLKRDGTMALVGAPATPHPSPNVFNLIMKRRSLAGSLIGGISET
QEMLDFCAKHNIVADVEMIRIDEINEAYERMLKGDVKYRFVIDCASLTA
>NE0658 Short-chain dehydrogenase/reductase (SDR) superfamily
MNTVLITGANRGIGLEFARQYAADGWQVVACCRQPQQAEALNRLADQYKD
RFSIHRLDVRELAEIDQLSHKLQDLSIDILINNAGVYPHAQNGEFGRISY
DDWMEAFRVNTFAPLKMVEALIEQIACSQLKIVATITSKMGSIADNQRGG
SYIYRSSKAAVNTVVKSLAIDLQPRGIIAVLLHPGWVQTDMGGRGALIST
KQSVTGMKSILDRVTHSDTGKFIAYDGQHIPW
>NE2257 SAM (and some other nucleotide) binding motif
MRLILSQCLANLSVRFLMNRNSYNKIAHLWNVARNGFFGREREYLDAILS
VAPIGSTILDLGCGTGRPMAEYIVSRGRCVLGVDQSEEMLRLARQKLPHE
QWVLSSIESYEPVEGYHGALLWDSLFHIRRTEHELIVSKVVRGLPSGGRL
MLTVGGSAHPEFTDFMYGEEFYYDSNTPQETETFLQRLSCRMVIGEYMNL
PDGGRDKGRYAIVAEKI
>NE2005 norQ protein
MTPIPFYVPVGNECELFETAWQRRLPLLLKGPTGCGKTRFVTHMAARLQR
PLFTVSCHDDLTAADLTGRFLIKGGETVWVDGPLTRAIREGGICYLDEIV
EARKDVTVVLHPLTDDRRMLPLERTNEILHAPDTFMLVVSYNPGYQNILK
SLKPSTRQRFVALSFNFPPPEIELEIIASESGLARDRCTALVNLATRLRL
LKDVDLEESVSTRLLVYCATLMAAGLDPYQAAQAALVEPLSDETEVQQGL
LELIHATFG
>NE0481 Helix-turn-helix motif
MTRPVNRMRAVHPGEVLREDFLIPAGISVNALAIALSVPATRIHEIVKER
RAVTADTAERLAHYFGGDAASWLALQASYDLKTLPTRDEIERRVQRREEH
V
>NE1564 Domain of unknown function DUF71
MLRRRTFLSWSSGKDSAWALHVLRQDPHVDVIGLFCTVNKVFDRVVMHGV
RVALLQQQAESAGLPLHIIEIPYPCSNDEYASAMSAFVDSARKENIECFA
FGDLLLEDVRQYREDRLNGTGITPIFPLWGIPTKTLSREMVAGGLKAVIT
CIDPKRIPESFAGREYNESFLDDIPGSVDPCGEYGEFHTFSFDGPMFQNP
IDVVLGETVHRDGFVFTDLLSLTSSTEPTH
>NE0966 Uncharacterized pyridoxal-5'-phosphate dependent enzyme family UPF0001
MTTIASRLQNVKNRIIEAAKKAGRDPESVQLLAASKTNTPDKLREAWEAG
QTVFGENYLQEGLVKIRALSDLPIEWHFIGPIQSNKTKLIAENFSWVHGI
DREKIATRLSAARPESLPPLQVCVQVNVSGEITKSGVDPEKAAELAAFVS
EQPRLQLRGIMAVPELTAVTALQREQFQMMREVYEQLQQQGFNLDTLSMG
MSEDLENAIAEGATMVRIGTAIFGPRRYAIPEELGSRQ
>NE1681 possible methyltransferase
MNRIVLMIVDPIIEHYMHTLARRSDHPVLDEMEAFALEKSFPIVGRLVGI
SLEIYAKMIGARRVFEFGSGYGYSAYWFGRAVGPGGQVVCTDSNPLNREQ
AEQYLAAAGLWERVRFCTGYAQDIFGQTDGNFDICYNDADKGGYPDIWLM
ARERIRSGGLYIADNVLWHGWVAVEDSADAKPDWTKAIREHNRLILTDPE
FDAFINPTRDGVIVARRKMA
>NE0100 General substrate transporters
MVSPFRQHRRILVASLVGTTIEFYDFYIYATAAALVFGPLFFPAESPSAQ
LMLSFLSFSLAFIARPFGAVLFGHFGDRIGRKSTLVASLLLMGISTLLIA
FLPTYSTAGWIAPLLLCILRFGQGLGLGGEWGGAALLAVENAPPGWRGRF
GMVPQLGAPIGFLAANGLFLLIGLQLSDADFAAWGWRIPFLASSILIVLG
LWVRLKINETPEFTQALAQNPPVSIPFWELIRKHALITFAGTFTVVACFA
IFYLSTAFALAHGTTTLGYDREQFLITQLAAIAFLAAGIIIAGIRADKAS
ANQVLSWGCAATIGLGLTFGPALGAGSLWLVWGMLSLALFIMGFVYGPLG
VWLPSLFPPRIRYTGVSVAFNAGGILGGALAPIIAQALTDAGGTSLVGLY
LTMAGIFSLAGLKLVGKLMPDKTE
>NE1503 Rieske iron-sulfur protein 2Fe-2S subunit
MSSFVQTPDTAVAQPQLSVDWYLDPRIFELEKQLLFDQGPGYVGHELMVP
EMGDYYVPATQNNARILVRNENGIELLSNICRHRQATILEGRGSSRNIVC
PIHRWTYAMDGKLLGAPHFSKNPCLNLGKTILQRWNGMIFAGKRDINRDL
AGMGSRNELDFSGYVLDRVQIDHYQCNWKTFIEVYLEDYHVDPFHPGLGH
FVNTRQLEWEFGDWYSVQTVGVNPDFSHAGSEVYQKWHAQVLQQNNQQIP
RHGAIWLLYYPNVMLEWYPNTLVVSTLLPTGIEQCMNIVEFYYPEEIALF
EREFVEREQAAYRETAREDDEICRRITAGRRALYEQGVNETGPYQQPMEA
GMEHFHRFLRREIESHLY
>NE1577 conserved hypothetical protein
MKVSISNSAFNDLETMISYYTAEGVPDVGFKFAQEIIEHIQILADHPDMG
RIVPEFQLPHIREIIFAPFRVVYLREKGAIKVIRVWRSERPLVLPTET
>NE1165 Short-chain dehydrogenase/reductase (SDR) superfamily
MKDKVILVTGGARRVGAAICRWLHRKGARLVVHYRDSSADAQRLKQELEQ
GHPDSVALLQADLLDTGGIPALVDQAARQFGRLDALVNNASSFFPTPVGD
CTEQAWHDLVGSNLKAPLFLSQAVAPYLKKNRGCIVNIIDIHTEQPLKRY
VIYNAAKGGLAALTRSLAMELAPEVRVNGISPGPILWPETGEWQDETARR
HIIDRTLLKRMGEPDDIARTVSFLIEDAPFITGQIIAVDGGRSINL
>NE1103 GCN5-related N-acetyltransferase
MSLQLNPPELLVATHLLDDFECGVNSLDEWLKRRALANQHSGASRTFVVA
DHDSRVYGYYAMAAGAVSHQAATSGVRRNMPDPVPVMVLARLAVDQRAQG
IKLGAALLQDAVNRVVNVSHNVGVRALLVHALDDRAKQFYAHYGFKESPQ
HPMTLMLRLNTTKA
>NE2231 conserved hypothetical protein
MQSNDNPSLTQQLTTLLSHHPVIKLAILFGSRADPARTKHFGSDIDLAIM
TGEPISSHFKMELMQAISTELDCPVDIVVVNDAPEPILGEILKGQRLLGD
NNTYAQLLARHLLNTADFLPLRQRILKERRERWIQSY
>NE1406 putative AttH
MRYLWILLGWLAVQNMLFSAPPVLAPVVPGKALEFPQDFGAHNDFRIEWW
YVTGWLETPTGKPLGFQITFFRTATEIDRDNPSHFAPDQLIIAHVALSDP
AIGKLQHDQKIARAGFDLAYARTGNTDVKLDDWIFVRETDGRYRTRIEAE
DFTLTFILTPSQPLMLQGENGFSRKGPGAPQASYYYSEPHLQVSGIINRQ
GEDIPVTGTAWLDREWSSEYLDPNAAGWDWISANLDDGSALMAFQIRGKD
DSKIWAYAALRDASGHTRLFTPDQVSFHPIRTWRSARTQAVYPVATRVLT
GETEWQITPLMDDQELDSRASAGAVYWEGAVTFTRDGQPAGRGYMELTGY
VRPLSM
>NE0091 Universal stress protein (Usp):ABC1 family
MKNPLEGSVIRRVMVGTDRSETADQTVQWAAGLADRYDAELFIVQVIVPK
YPSATEFGESEQTSAVAANNDLAHFARQIAGERGHALVVINADPALAIVH
AAEQEAIDVLVVGNLGMAGRKEFLLGNIPNRISHNAHCTVIIVNAAHSAD
ERAPHSVRASLSNDEIPSFKPRLVARATHIAAVMAKHGLTELFSQSDPDI
SIRRQQARRLRGALEELGPIFSKLGQVLSTRPDLLPIEYIEELVLLQSRV
PPMTESEVVRVSEQELGVPWEDVFKSFDPNPLAAGTIAQVHRATLETGDR
VVVKVQRPTARADIEQDLALFEMFAEKVGKRPALNQVINMEDVFKHLSTS
LHRELDFRQEANNIERMRTVLADYDRLAVPSIHWDLSTSRLLVMEEIQGI
PIKQAPAGPERIQAARQLLESYYKQIIVDGFFHADPHPGNLMWWKDCIYF
LDFGMVGAVGADLREHLLLMLMALWQEDAGFLTDVTLMMTNAVNSNDFDV
AQFQSEIGEVMAKYRAASLAEMQIGPLLQEMSTVSLRHGVPLPASLTLAT
KALAQVQLATAELDPTLDPYDVAGKYLMRLMVKRIGAALNPKTFVYQSQK
LKVRTLRVIEALENLVGVRPGGPKLVVNFKANSLENIVRHTGRRLALGLT
AAASILTSGLTTMSTAVAEWVPVTFGAVAGLLTLGLVIDLLRGR
>NE1304 Bacterial regulatory protein, LacI family:Helix-turn-helix motif
MARMHNPPYPGETLREDVLPALGLTVTQAAKELGINRVTLSRVLNGKAGI
SVDLALRLEAWLDGPSAESWLKGQLAYDLWQAEQRGCAKVVVRHINREQI
>NE1004 conserved hypothetical protein
MKLHIITCSTRPGRIGPYVAKWFSEVAVQHGKFEVVPVDLAEFNLPVYDE
PEHPIRQQYQHVHTKNWAASVSAADAYVFVTPEYNFGPPPSLLNALNYVY
KEWNYKPAGIVSYGGISGGLRSAQILKQTLTTLKIMPMMEAVAIQNVSTL
ISEHKQFMPSEHHTSSAVTMLNELYKWAQALKTLR
>NE1416 Insulinase family (Peptidase family M16)
MKISFHFSYFATLLCITLAWPLPATANSHEYLLDNGLKLVVKEDHRSPVV
IQQVWYKAGSMDEVNGTTGVAHALEHMMFKGTDSVLAGEFSRKIAAIGGK
ENAFTSRDYTAYYQQLHQRHLPMAMELESDRMHNLQLTEEAFAKEIQVVM
EERRLRTDDQAHSLLYEKMMATAFQTHPYRRPVIGWMNDLENMQVNDARD
WYQRWYAPNNAVLVVVGDVDPENVFVLAKKYYGRFSAARVPALSERKPQI
EPPQTGIKRLVVKASAQLPYLIMGYKVPVLKDPKNEWEPYALTILAEVLD
GNASARLNKTLVRETRVAISADASYNAIERGPGTFFIDGAPSEDKTVDDL
EQSIRTEIGKIIQSGVTQEELARVKAQVVANHIYQLDSTFAQAMQIGRLE
SVGLSHRDADIILEGLQAVTAEQIRKVAEKYLIDDSLTIAVLDPQPLPET
THPRNSNIELKH
>NE2303 Uncharacterized NAD(FAD)-dependent dehydrogenases
MTSYLSEEAGPDLTQGISLSDFGNQPLLRGHVGDEPVILARIGDEITAVG
ATCTHYGAPLTEGLVVGETVRCPWHHACFSLRSGEALGAPAFDPLPCWQV
ERDGDRIMVRDKITPKPRSIPVAAANQPANVVIIGGGAAGFACAEMLRRR
GYQGQLTMLSEDSDAPCDRPNLSKDYLAGNAPEEWIPLKSDDFYVRNRID
LQLHTTVTKINTTGHTVTTADGRIFPFDRLLLATGAEPVRLPIPGANQSH
VFTLRTLADSRAIIERAKHAKAAVILGSGFIGLEAAAALRARELDVHVVS
LDKHPLEKILGSEPGDFIRSLHEQHGVQFHMGTSLAHIEPHKVVLSNGKE
LTADLVIIGVGVRPCVSLAEAAGITVDNGILVNEYLETSVPGIFAAGDVA
RWRDEASGKTQRIEHWVLAERHGQIAAENMLGANTAFQDVPFFWSAHYDI
SIRYVGYAGPWDTLEIEGDMAAYDCLISYKTGGKTVAVAAIGRDKQALEY
RALIAQQQH
>NE0214 4-hydroxybenzoyl-CoA thioesterase family active site
MEVVFVHPVRIYYQDTDAGGVVYHASYLNFLERARYEWLRELGFTVDTMI
RSHKMIFLIRSLGIEYFKPAVLDDLLDITVQVVDIGRSRITLQQQILREQ
GTLASATVHAVCVGAETLKPISIPAPLRQKIEKQSS
>NE1128 hypothetical protein
MNDIEAAFHQYFEIIDADTPELLKTVFDLRYRILCVHNVIPGFDTNNYPN
ELESDQYDSHSIHFLLRHRPTNTFIGTTRLILPNPLDPMDKFPTELNTHF
YPGFVLDSSSRKHTTEVSRFAILSDFFKRKGERNMLSQSTEIGCKAQERR
RFPHPMLGLVVGLIQLCARNNIYHLISAMEPALNKLLGFYSLQMNPIGPP
ADYHGLRTPYYLYLPDLLDRMYQDHRSLWELITDHGRIWPMNLACIHQKT
LKTAYTDNVYISE
>NE1545 DUF209
MNTSMNTVRIIPAMAVPEGAGVIVHRTIGTPVLRNYDPFLLLDHIGSDNP
DDYIAGFPPHPHRGFITFTYMLDGHMQHQDSMGNTGDLGPGSAQWMKAAS
GVIHSEMPKQENGLLRGFQLWINLPAINKMDHPEYQEFPAAAFPVVETAD
YRLKVLIGRFGDTVAPIRDDLTQVTYFDVQLQPGRHFQHRLPAQNTSFIY
LFEGNGQFNGQDIGLHSLIAAGTDGGTFDFVAGKKGARFIVVSGKPLHES
VVQHGPFVMNTREQIDQALKDFQSSQFVRDRAWVKRNQ
>NE2113 hypothetical protein
MIVVDSNVLAYFYLPGEYTATAEALFEHDPDWVAPVLWWSEFRNILAGYL
KRGNLTFLQAYNLQCEAEDLLASAEYEVNSPSILELVRDSECSAHDCEFV
ALAMKLGAKLVTMDGKLLRAFPGIAFALSMS
>NE0905 conserved hypothetical protein
MPSTLSEELVGKYLAAQYQVWIDTSVVTLQIGCQSAPLAALLQATGNRSA
VYVTACNPASEVATSQENQSAMARLYERLACYSNHIYRGTGIDPSGEWPA
EESLLALGIDLSIAKKIGDEFGQNAIVWIDSAAIPHLVLLC
>NE1690 Haloacid dehalogenase/epoxide hydrolase family
MNIATRPADYWQAIIFDFDGVIVESGDIKAQAFAELYRHHGETIAQAAVT
YHRANGGMSRYLKFHYFQQNLLNYPPLTKEEEQELDRRFSELVMNAVISC
QPVAGAEALLHRMVDQIPLFVASGTPESELRIIVEQRDLSRYFTEVRGSP
RLKETLVADILSAYPLVPERVLMIGDALVDYESAHQNGIAFLGRVRPGDD
NPFPESVEIVPDLCPVAI
>NE2408 DNA internalization-related competence protein ComEC/Rec2
MLIGIYSLAFVFGALWLQQQSVLPEFYWAIGLIPVALGVLVLLRFQTRFS
ILAGRGLLLAVMLGAGFFWAALWAQVRLADDLPSAWEGQDIAVIGVVTEM
PQLTRQSMRFRFEVERVLTPDAAVPAHVQLSWYRDGRRDEGNLPRITAGE
RWQLTVRLKRPHGSANPHVPDYEARLFERNVRATGYVRAGESNKRLEVQE
IHPVYLFERKRDEIRSQFQHYLAGYPYAGVLIALTVGDQQAIPSEQWETF
TRTGTSHLMAISGSHITLLAGMVFLLAYRAWRYAGLALWLPARKFALVCG
LIVAIGYALLAGFAVPVRRALFMMAIIVVAFWRNQRVRTLPVLGWVLLLV
VVLDPWSVIAPGFWLSFAAVALICLVISGRIGRPGVVAGWMRIQWAITLG
LFPLLLILFQQISLISPIANAVAIPVISFIIVPLALLATIPGLEFLLLIA
HPVLQITMGVLQWLGELPLATWQQQAPPLWAVIAATLGVVWLLLPGGPGL
GVSAGFPARWLGILAGVPLFLISPEKPAEGELWLTVLDVGQGLSVVVRTR
NHTLLYDTGPGYGENDSGKYVVLPFLRGEGVQALDMMIVSHVDSDHSGGA
LSVLKRIKTDVLLTSIEDNHPIRQAVPDNRHCLAGDAWWWDGVYFEILHP
VKPDTLIRKRKTNESSCVLKVTTSHGSVLLPGDIGRVTEEDLLQRYAREL
ASSVLIVPHHGSRSSSSEVFVRQVDPDHAIFTVGYRSRFGHPHAEIVERY
LEHGSRLYRSDHDGAVLLRFVSGNITADTWRRLNRRFWHDEWPSADRED
>NE1643 conserved hypothetical protein
MFDQLVIDALDFVRSGKSLQGNVPLLNLERLRDYLTNSAGELAYLVTGLL
DERDRPLLKMSVNGIIDLSCQRCLEKIEYTLDVKTALLLARNEDELSRYD
EDMFVDAIYASNELDILALIEDEVILSLPVSPRHEDTAGCHPSTGTGIHE
AAVKEHPFTVLASLKQSH
>NE2301 conserved hypothetical protein
MSISSSPILNRRLNIAQLFSHKHCVLCQAPNHQDICNACLQDLPGLPPVH
CPSCLLPMTSPEICGTCLRNPPAWSHIRAALRYTFPADALVQALKYRSDL
PLAPILAGLLLGRFRDDPLPDYLIPVPLHPARLRERGFNQALEISRHLCR
QTGVELLSAACTRIRSTPSQTELPWKNRPQNVRNAFTCNRNFSGKRVAIV
DDVMTSGATLNELAKVIRRHGATDVRAWVIARAFPGAPAAKRATDLPGND
ETKRPIKP
>NE0299 conserved hypothetical protein
MDPVTHTLSGALLMRAVTSSHTQYAQRLPLRERIVAGSVAAAFPDSDVVL
RLIDTLTYLNWHQGPTHSLVMLPFWAFLLAHLFSRFTGGHYPWRSFFVPA
CLGIMIHILGDLVTAYGLMLLAPFSTWRFSLPLVFVIDPWFTAIILAGLV
LSAIFPTKRVYAVASLIGIVAYVSFLWMLHEEAMQAGKVHAAEKMLDQQT
VSVLPQPLSPFNQMVIIRDDTELHVARINLRRSTLLKCTDTSNLLCNMTA
AYRPLAMANWRSYRLHDSRSPEYAALSHEAWQQPVLAPFRQFAQFPVLDG
IDHPAQNICVWFIDLRFQFPGLPPSFRYGVCREEENSLWYLQRQHGAFWI
D
>NE2398 CBS domain
MKTVKHLLQEKGHTVVAIGPDDSVFNAMQKMAADNIGALLVMKDEKLVGI
LTERDFSRKSYLLDKPVKDTQVKEIMTRQVAYVDLNNTNEDCMALITEMR
VRHLPVLDDGKVIGLLSIGDLVKDAISQHQFVINQLERYIYDTREI
>NE2259 ThiJ/PfpI family
MSKKILVVLTSVEKYPEMDRATGLWLGEAVHFVRKVEAAGYEVDYVSPQG
GYTPIDPHSLAMAEPIDWEWYQKKEFMNRLGKTMKASEVNPDDYIAIYYA
GGHGVIWDFPDNEELQSISRKIYENGGIVSSVCHGAAGLLNIKLSNGSLL
VKGKELTGFSNEEEKLAELDKFVPFLTETELLARGAIYKKADEPWVSFAV
EDNRLITGQNPASGGAVADLLIKALKNELRHLLR
>NE1027 Esterase/lipase/thioesterase family active site
MHNRVIRWILLQLVTALLIISAVTIILGEIVTGSAPTAVETLLPDFPVET
VQIPVNDEYAVHGWLAHGMSGHGAVLLVHSMRSNRLEMLGRARFLNNQGY
HVLMIDLQAHGETPGDRITFGARESADVAAAVGYLRSTFPHDRIAAIGAT
LGAAAIVLANPPLKLDAMILESLHPTFAEAVANRLKLHLGNTGEYLQFLL
LPYFSFLLDLPVNQLNPVDRIGNIAIPVLFIAGTLDRHTTQSEVKRLYDA
ALPPKELWIVEGAGHYNMHTFAGKSYEMQIADFLSTYLQRQ
>NE0381 Integral membrane protein, DUF6
MIEYRIFRFPMYPLLARFFSSHQQALGVLFALLSAIGFSAKAIFIKLAYV
EPVDAVTLLALRMVFSVPFFVFAMLHGRAQATPMARHDWLAVLLLGLVGY
YLASFLDFLGLQYISAGLERMILFLYPTMVVLISALVFRAAIGRRVWFAL
LLSYVGIGLVFVHDFHITSDGLFMGSSLVFASALAYAIYLIGAGHTIARI
GSMRFTAYAMTVACLACIAQFLLTHPLDDLQQSTRVYGLSIGMALFSTVM
PAFALAAAMRRIGSMQTSMIGALGPVATIYLAYVFLAEQLSLTQLAGSGL
VLIGVMMISMRKME
>NE1420 probable rubredoxin reductase
MNQPSVVVVGSGLAGYTVVRELRKLDAAVPITLLSADHGSFYSKPMLSNA
LATGKTPDSILSAGTMQMSGQLDITVRPYTSVNAIDVAAGSVSFEEGGQL
TYDRLVLALGADPIRLPIPGEGVDEILSVNNLDDYRKFREALESKRHIAI
LGAGLIGCEFANDLAAKGYQVSVFDLSPQPLGRLLPPEAGRFFRDKLTAA
GVNFLLGTTVERVSKENGYYQLFYEGGKVVQADMILSAVGLRPRTRLAAV
AGIQVNRGIAVNRYLQTSIQNIYALGDCAEVEGKVLPFILPIAHAGRALA
ATIAGNPTLLHYPAMPVMVKTPACPTVVSPPDPAVQGEWEVVAIENGMKA
LYHDEAGNLHGFALLGTATVERNTLASRLPPVLA
>NE1266 putative death on curing protein
MTTPVWINEQDVLAIHERLIFLHGGASGIRDRNLLKSALARPLNFSVYDQ
QSDIFLLAATYTSGILQNHPFVDGNKRTGFVIGVLFLELNGYKFIANEED
SAQAIISLAEGSLDELGFRLFIEHNSIAT
>NE1365 Helix-turn-helix motif
MTMNANIEVRLKSPAHPGGFIKHEIIEPLALSVSNAAEVLGVSRAALSAL
LNERAHLPPEMALRIEKAFGVSMDTLMRMQNSYDIAQTRKRAEEIKVAPF
SGKPIESNSVV
>NE0491 conserved hypothetical protein
MSFDANIATQYYILMTIKTFRCADPETLFKLGRVARFVNIERPALRKLKQ
LDLARCIEDIRVPPANRPEILKGDRAGQHSIRINDQWRVCFRWTGTDAED
VEIVDYH
>NE1399 GCN5-related N-acetyltransferase
MATTGIPAFQAEIREMHPDDLEQVIRIEHEIFLFPWSIVNFSDSIKAGYH
CRVLVQPNSDLVMGYGILMTGPGEAHVLTLGVGAAWQSQGLGRKMLRYLI
ELSRKHQAEFVLLDVRESNTGAINLYQRLGFQQIAVRKGYYPAMCGREDA
LVMKLEL
>NE0482 conserved hypothetical protein
MGCMIQSFRCKSTQAMFEGECPQRFSAIQAVAERKLAQLEAAQTLDFLRS
PPGNRLEKLAGDREGQWSIRINAQWRICFTWSDLDPADVEIVDYH
>NE1207 Bacterial transferase hexapeptide repeat
MIRKNPRGDQPIVHESAFVDPTAILCGRIVVHENVFIGPYAVIRADEVDE
TGHMEPITVGAHSNIQDGVVIHSKSGAAVTIGERTSIAHRAIVHGPCTVG
PDVFIGFNSVLFNCTIGEGCVVRHNAVVDGCDLPPGFYVPSTQRIGPRTD
LSTIPKVSPKASEFSEDVARTNNTLVQGYKHIQNEL
>NE2169 Generic methyltransferase
MKVCLQCENFFSSADWACPSCGYQPERLNGIEAHASEFAHGGGGFKPEYF
SELSRLEAGNFWFRARNELILWALRTYKPNAGAFLEVGCGTGFVLSGIAR
ACPEIALNGSEIFLAGLFHAAKRVPSTHFMQMDARRVPFVEEFDAIGAFD
VLEHIEEDETVLAQLHNAIKPSGVLLLTVPQHPRLWSASDDYACHVRRYT
RVEIEQKVLTAGFELLRSSSFVTSLLPVMMLSRVLQKRKTKDFDPAGELK
INAALNKVFYGLMMLELAGIRLGMNYPVGGSRLVVAKKQSA
>NE2092 Glycosyl transferases group 1:TPR repeat
MRDHPADRLQSITVTAFYQEPNVPTHSLSDYTSWIERTWQGQVSFTELVT
YAETLNSHPALCAALYRTWLQRNTGVFNSVAWFNLGVILFAENNLIDSIE
AFQKALALSPAFPQARINLGLALERQGNAEAAIEQWQAVVENAITPEADQ
NTGPNQADQIKNLTMALNNIGRLQETRRQYQAATQALEKSLQLDPDQPDA
IHHLIFQRQKQCQWPVYAPVGKVTEAVLHEHTSALAMLNISDAPEAQLTA
ALNYSRRKIPADLPRLSPANGYRHDKIRVAYCSSDFCTHPVAMLTVELFE
HHDKNRFETYAFCWSPDDGSTLRQRILSAVDHYIPVHGKSDDEVAQLIRQ
HEIDILIDLQGQTSGAKTRMLAMRPAPMQITYLGLPATTGLPGIDYVIAD
RYLIPEEYARFYSEKPLYMPDVYQVSDRKREHSPAPTRKDCGLPARKFVF
CSFNNNHKYTLEVFTTWMNILRRVPNSVLWLLADNPWARENLQKQAKAQG
IDPKRLVFAERTMPADYLARYLVADLFLDTFPFNAGTTANDALWMGLPVL
TMSGRSFASRMAGALLTAADLPELITHDLQTYEDKAVALAADAKARKTMR
QKLALAKESGPLFDSLRFTRNLEQQYIALVSELQNPSQHINISAQPEPTK
LGEPAQPNPIIATVQEAEALQARGDTQGAIQLYRQWLEHAHSGDEWIAQF
NLGVLLRDGGDITGAQQAFQAVLKQKPDFVQARAALGKLPAPVTTESQKH
GAIIGPLQTNSPISTKSKEQITQIFSETPKIKLLVEGWRGINHSFALVNQ
YHLYEWMRSSQLHIYHRDMPLLFSHWETNKNRGNTGLPGTYSQRLAQVQH
WSGQTYDACFRIYSPVTLAPDDKHPVSTFLVTELGLDETQIAHFRPNLKA
YFNMGGNIVTPSHWSKERIIEAGIPAEHIHLISHGVASNIFHPMHSDERA
IHRQRLGFDREAVIFLNVAAPIWNKGLDLLIQAFVQCFHQNPHTRLLIKD
QQAVYGISTKDTVLREITLLGESKNESLLNAIRVIPDLNLLQLRELYCIA
DYYVSPYRAEGFNLPVIEALVCGTPVIVTEGGATQDFCSEKNALFIEAVP
YRNVKINDRHVNAYQAPILDSLITHLNVCAQDKPFSESQRLQNAASIAKN
FTWAKAADTISRLFTQSNDCNTSTSQITGALHEEPQYCN
>NE1589 hypothetical protein
MIVVDSNVLAYFYLPGEYTAAAEALFEHDPDWVAPVLWRSEFRNILAGYL
RRGSLTFLQAYNLQCEAEDLLAGAEYEVNSFSILELVRDSECSAHDCEFV
ALAIKLGAKLVTMDGKLLRMFPDIAFALSASQRSS
>NE1179 putative transmembrane protein
MRRIYFLVPEIVTTRKIVDDLLLAKIEERHIHVIAKRGTPLEDLPEANLL
QKTDFVPAVQQGIALGGATGLLAGLVAVALPPASAVIAGGILLATTLAGA
GVGSWVSGMIGMTIGNRRIKEFEEEIEAGKLLVLADVPVNRVDEIEDRVK
QHLPQIEVMRTEPKVPAFP
>NE1897 Peptidase family M48
MMELDLPATIKRDMKFRYLLIIVPLLFPAHIFGQELPDLGDVSQASVTPH
QERQIGIQIMREIRADSSYLDDPEIADYLTRLGNRLIAASGQTNPDNPFE
FFAINDSSINAFALPGAFVGFHSGLITAAQNESELAGVMAHEIAHVTQKH
LARMISGTSYLGLLGSIAALAIAILASRSNPNAGQAVLATAQASAIQSQL
NFTRKHEKEADRVGFNMLIKAGFDPHGMSSFFERMQHASRYYENGMPSYL
MTHPVTHERIADIQNRTQELGYRQVPDSLEFHLVRAKIRASRGNPASVIN
EFKARLQDKRYINEIAEQYGLIQALLRARQFKQADEELNTLYRTIQSDSS
AQSLKNHRLGKSIQVDGDYLQSAAMIETLAARVKFASGQTDDAFRLYQSA
LQSFPHYRALVYGYADALLQHKDAQAALDFINGQSQFIHDDIRLYRLTAQ
CHAALGNALLQHQAEAEALIREGNLRAAIEQLQIALRHKHDNFYQLSSVE
ARLRQIKEFVAAEKEKK
>NE2059 putative similar to copper export proteins
MKRESFVELFIEKIVTKVKISSFVMGLISLVLLSSSNVVFAHAALTKAEP
ARRAVLTASPKQVRLWFNEEIEADYASLSLHDANGKALTEKKPLVHPDDA
KSIYLELPELIGGQYTVKFRVLSVDGHVVDSEYKFTVKNK
>NE0182 conserved hypothetical protein
MCHTCDGKDFSPELAWTGVPENTKSLVLIVDDPDAPDPQAPKMTWVHWIL
YNIPPATRKLPERVTVAELPSGTLEGVNDWERTGYGGPCPPIGTHRYFHK
LYALDTLLPDLNQPTKAILEKAMQGHIIARTELIGLYHRSDNV
>NE1298 TPR repeat
MIVCYQRRNDKFMKNAQCTGWIPATLKLLYRALPGLCFFMTTGAVSIAFA
EEVCNAPVARMVSLQGVVEYHRPGDSGWHMAASNSTFCAGDRVRVRANSR
AALRLSNESMLRLNQRTAITISGPDVEQNTLLDLMNGVMHIITRTPKPFK
IRTPVVNASVDGTEFLVDAGGEDDSSPSVTIAVYEGRVKAGSDQDNLILA
NQEAAVFQENQPARKTVMVHPLDAVQWALHYPMLIDLYSRSGHENQQSPA
VHHVIEQYRQGKLAEALAELDHLLAEELTVDALILRSELLLTTGRVKEAL
SDLQRTEQLEPGNSDALALHAMIFVVQNRKQEALALAGQAVRNNPASSAA
KLALSYAQQANFQIETALASAEEAVRLDSQNALAWARLAELQMSAGKSDH
ALQSAERAVSLDPDLSRTQTVLGFAHLLQIDTHRAQVAFARATVLDQADP
MPRLGIGIARIRENKLEAGRIDIETAASLDPANSLIRSYLGKAYFEEKRY
PLAGTQFDLAKARDPNDPTPWLYDAIQKQTQNRPVEALRDVQKSIELNDN
RAVYRSQLLLDQDQAARGSSLARIYDNLGFEKRALMETAKSLSFDPASHS
AHRFLSDAYANVPRHEIARVSELLQAQLLQPVNVNPVQPRMAVADLNLIT
GTGPSAPGFNEFAPLMERNKAQLVASGVVGNHGALGDEVVVSGVYDRASV
SVGQFHYQTDGFRPNNDQTHNIYNAFVQYAVTPDLNVQAELRRREKKHGD
LLMDFDPKKFSEVARLNLEEDTARIGAMYRISPRQNFLFSTIYTHQDADV
IEDLGSDFPFYGDQRSHGYQVEAQHILRKDRVNVITGGGIYRTNLTNDFR
KNTEPLICMMMGCEKSKADKEQNVAYLYSNLNILKNVMATLGFSYQAYSN
DAGGINRKVSEFDPKIGLQVDFHKNVRLRMAWFEALKRDLIGQATIEPTQ
IAGFNQFYDEMTGTKSRHKAVGLDIHFANAVYGGVEVSERDLDVPVIPEL
GVSDRDYYWDKQKEQLLRGYVYGTFRPNWVVAIEPEYEKFDRKERYADLP
TNIHTLRAPISVSYFDQNGFWAKLTGTYVMQDVKWARFDEDTWLGWIEKK
DSNFFLLDMVAGYRLPKRKGLLSFEVRNLLNKHFYYRNQYLYLSEPALPR
YIPERTLFARITLNF
>NE2347 General substrate transporters
MSVDYRQLPRSALTDVLLQAVLVFAMTLPMLVLYTTSTLGPLLSRDLGFE
PVAIGYLIMSSFGLAAILSLRAGAIVDYIGVRTALIVLFCAVAAAFALIA
ITQAFFSLIMATAICGIAQALANPATNLLIAHQIRPEQRAWIVGLKQSGV
QLAALFAGLVLPAIAFQYGWRIAFGVIVPVAVLFSLAAWLVTPAQHVRKN
RQLIFTRPNMLLSWLMAIQFSVGLALSAFVTFLPTFATLQEMPLAQAGTL
IAVFGITGMLSRIILTPLGNKFSDESHLLFALIAIAAGAIMLTMQAGPGS
HGYLWAGAIGMGMSAVATNAIAMSMLIRDSAFGRVTETSGYVSFAFFSGF
ALGPPLYGQLFSQTGSVSLAWSLLTGVLCLACIMTLRLTAVRKRRSHVSV
>NE2196 Integral membrane protein, DUF6
MTNTRNENLGYVYGLIAVTAFALTLPAMRAALSALDPVFVALGRGAGAAV
LAAVFLWFTRQRLPTREEAKGLIIVAAGAVIGFPLLAAWAMLYVDASHGG
VVLGILPLATAVAAALFSNERPSMKFWLFALIGAGLVVGYSLSRAGGTLH
PADLALFGSIVCAGVSYAEGARLSKSLGGPQVISWALVFSFPILIIPAIH
YAPVSLNLPLESWLGFIYLTVISQYLGFFPWYHGLALGGIAKVGQTQLLQ
PFLTIIASVLLLGEHADLMTWLVATLVVAVVAVGKHAQVKHNDPESATIA
DTSPHS
>NE1286 GTP-binding protein HflX
MSHDVATGTVADNTAILVDIDFGEGDKESLEELRELARSDRLSVVAVVEG
TRKQPDPATFIGKGKAEEISQILAQTHAAMVIFNHELSPVQQRNLSMVLA
CRIIDRTSLILDIFAQRAKSHEGKLQVELAQLEYLSTRLVRGWTHLERQK
GGIGLRGPGETQLETDRRLLAKRVKLLKEKLTKLKRQREVRRRARKRAEI
LSVSIVGYTNAGKSTLFNRLVRTDTYAADKLFATLDTTTRRLTLPGRGTI
VISDTVGFIRELPHTLVAAFRATLEETIQADLLLHVVDASSSNRDAQISE
VNKLLREIGADTIPQILILNKIDLLEQYPSGNYMRDEYGRIKSIHLSART
GAGFSYLYDALAEVFDQNLKRLEQHSATPESMNDNVRATFINNEKD
>NE1008 conserved hypothetical protein
MQNTFIKLILGIQILFLLTGCVPMVLTGVGVGAGTGALMVEDRRSSGMYI
EDERIELKTSRRIGERLGDKVHVNVTSFNRNVLLTGETPDESTRKEVEKL
AMSVENVLNVSNEIIVAPKSSLASRSNDTLITSKVKARFINNRVFQVNHI
KVVTENGVVYLLGLVKRNEGEKATHIASTTESVTKVVKVFEYLD
>NE1875 Esterase/lipase/thioesterase family active site
MPNEQKRFVTGPAGRLETVVTLPEGAPRGLAIVAHPHPLYQGSMDNKIVY
ILSRAFIEQQYITVKFNFRGVGASEGSYAEGKGEIEDVMAVTQAMREQYD
TGPEPLPLTLAGFSFGGAVQAHVAQQLKPSRLVLVAPSVERLQAPPVVDH
ARHILVIQGDQDTIVPLQSILNWAAPQTLPVTVIPGAEHFFHGKLHVLKN
VILQSCSISQAATSLYP
>NE1568 possible Oxidoreductase
MKISALKAVVVGLGSIGVRHLNNLHALGIRELGAVRTRNLPPPAQIIPKD
VQLFQSLDQALKQNFDLVVVANPTSLHLKTLIEALKAGCHVYVEKPVAHE
KRHLSELMRCVDPHGPRVLVGCQLRMHPGLRKIEEWIQQGRLGKIYSVQV
DLGEYLPDWHPWEDYRQSYAARADQGGGVILTLIHELDYLHWLFGKPRSV
FAIGGHRTSLEVTAEDTALISFETEQGICVQLRMDYWRKPPVRHMNIVAE
KAIVDWDYPARLTTLQQNGHLLEEVILAPSWDRNELFLSMMKEFIEGIPG
GSIPRVTLQEGIDVLNTALAAKQSLQTGRQVRL
>NE0479 PIN (PilT N terminus) domain
MLKYMLDTNIAIYVIKRRPIEVLVTFNRYADMMCVSAVTEAELLHGAEKS
RQREHNLRQVADFLSRLEVLSYTSKAAGHYGDIRADLERKEKPIGVNDLH
IAAHARSEGFILVSNNLREFERVDGLRLENWIT
>NE2154 possible (AF124349) unknown [Zymomonas mobilis]
MHPSRYNRTGTSLFLIFSLLAATSASAEETGLINTLTGYNINEITPAPSI
KLKFRGWVEAGFTGNPGDPHNRSNFPVAFNDGANQFNLHQVYAYIEKEID
PGRNSWDIGMRADLLYGTDAKFVKTSSFDSTILGDNPKHQLVFPQLYVNL
YAPIGNGVSMSIGHFYTIIGYESPMSPNNFFFSHAYTMRYAEPFTHMGIM
LSYPVNDNLTIKSGVVTRWDAFSRHSPDYLGGLNYITDDRKTMLSASLIT
GDVKTGPLNHDHNRTMYSIELERSITDKLHYVVQHDFGIEAGTPNSSSAT
WYGINQYLLYDISNQLGAGLRFEWFHDQNGTRVMGDGNDEDFIGVTAGLN
YKPIAGITLRSEVRYDLAVHHDIFRDGTDNDQILLSGSAILHF
>NE0260 putative antirestriction protein
MCTEIRIYVADLAAYNNGKLHGVWINATDDLEAIREQVNQMLTDSPEDFA
EEYAIHDHEGFGGFILSEYAGLETAHEVACFITEYPDFGSELLDHLSGDL
EEARTAAEENYCGCYQSLADFAEELTEDTTQIPVNLVYYIDYERMARDME
LNGDVFTLETGWEEVHIFWNH
>NE1364 Appr-1-p processing enzyme family
MIEYTSGDILRCEADALVNTVNCVGVMGRGIALQFKNMYPANFKAYEAAC
KREEVQPGRMFVFETKQLTPPRLIINFPTKRHWRGKSRIEDIEAGLVDLV
NVIRDKNIRSIAIPPLGAGLGGLDWKEVRPRIEHALGELEGVQVIVYEPN
GAPASDKMAHVREVPKMTSGRAALVELMQRYLSGLLDPFVTLLEVHKLMY
FMQEAGEPLRLDYIKHHYGPYAKNLRHVLNAIEGHLIAGYADGGDAPDKP
LSLVPGAVAEAKSFLDQHEISRARFERVTRLVEGFESPYGLELLATVHWV
IHREGATQSDSVKRQIYQWNDRKRQFTQRQLVIAEERLRSQGWLSPETTF
TY
>NE1581 PIN (PilT N terminus) domain
MILLDTNVLSEFMRLQPATQVVVWLDRQAPNEIWTNAVSRAEIELGLALM
PESKRQKSLSQAARTMFDEDFAGRCLPFDEIAASYYGRIVSTRTRMGRPI
SVEDAQIAAIALAYRMFLSTRNTVDFEDIAGLNVINPWETEA
>NE0089 Domain of unknown function DUF20
MNDKYSPDSRLFWYLTVIGVVSALIYLLSPILTPFLLAAVIAYICNPLVT
WLEARKIPRTLSTIFVMLMTMGIFIAMALILFPLFEKEVSRLVERIPSFL
DLVKSQFIPWLEDNFNVELQIDIASLKQMLTEHWKSAGGVAAQMLPSLKS
GGLILLTFLMNLVLVPVVLFYLLRDWNNLIRQVGELIPPVWQKQIFTLAR
ETDDVLAEFMRGETAVITIMSIYYVTGLWLVKLEFALPIGLISGILVFVP
YLGTITGLALATFAAITQFQEWSGVIAVWVVVGSGQLLESMLITPRLVGE
RIGLHPVAVIFALLAFGQLFGFIGILLALPVSAVLLVLLRHLHTQYMETM
RE
>NE0490 Helix-turn-helix motif
MNNKLTPVSPGEMLAEEFLIPLGMSNYRLAKEIGVSAQRIGEIVTGKRAI
TVDTDLRLCRFFGLSDGWWLRLQVDYDIEMARGALEETLAKIRPWANTQE
HGTPA
>NE0970 Insulinase family (Peptidase family M16)
MRFLQFLIMFWVGLYAQWALAFLPIQHWQTANGAQVYFVENHDLPILDLS
IEFPAGSSTDTAETSGRAGLVQRLMSMGAGDLSEDRIAETLADVGARLGG
TFDLDRAGLSLRTLSHQQERVRALDVLAQIVQRPEFLEKILERERARIIA
ALKEADTKPEVIADRTLMKLLYGKHPYGLRESGEPDALAALRRQDLVDFY
RAHYTAGNAIIAMIGDIKRDEAARIAEMLTRNLPTGKTYKTLPPVEKPVP
IIQKIAHPATQSHIQIAYPGLSRKDPDYFPLLVGNYILGGGGFVSRLMNE
IRETRGLAYSVYSTFAPYQEKGPFEIGLQTKKEQAEQALQLTQKTLRDFV
EQGPTEEELQAARQNIVGGFPLRIDSNQKILGYLGVIGFYDLPLTYLEDY
VKAVEKVTVAQIRDAFKRRIDPAGMVTVVVGAAD
>NE0702 conserved hypothetical protein
MAIRQYEVLFTRGAEQDLELIYDYIVESDCKANADSVLDRLLEVVENLAT
FPSRGTWPKELVAVGIREYRQAIFKPYRVIYRVIEQKVYIYLIADGRRDM
QSLLMHRLLGK
>NE0044 hypothetical protein
MLYSTETCQHSSSESLLKKTLVVLLGSILATYLLGTTLPWPITLLHAVWL
QGLIAALASYYLLHMPVWWAAIHLLFFPALLSATLVLNLPAYWYLAGFIT
LLVFGRIHRTRVPLFLSSGEAVDALARLLPQDRQFKLIDLGSGCGGLVCK
LARMLPHGSYHGIETAVLPCWISKLRALLSRQDCQFKWESIWQHDLSGYD
VVYAYLSPVPMPRLWEKARREMRPGSLFISNTFTVPGIKPDRCIRLDDFS
STVLYIWRIA
>NE2571 Metallo-beta-lactamase superfamily
MQPNIQAFFDPVTWTVSYVVFDKPGGHCAIIDPVLDYDPKSGRTKHHSAD
VLIKFVHSKELTVDWILETHAHADHLSSAHYLQQELGGKVAIGSRISGVQ
QVFKKLFNMGPDFQPDGSQFDHLFDDGDTFEVGELKGRAIFVPGHTPADM
AYQFGDAIFIGDTMFMPDVGTARADFPGGDARQLYRSIRKLLDQYPPETR
LFMCHDYPPGDRPIQWESTVAEQRAHNIHVHDGINEDGFVAMRTARDATL
EMPVLILPSVQVNVRAGQMPPAEDNGRVYLKIPINVL
>NE1123 Penicillin amidase
MLRKSLFFLSALVMVILLTAWSLLKGSLPVYEGEQSVPGLTDIVTVERDA
LGTVTLNGGNRLDLARGLGFIHAQERFLEMDLMRRKAAGELAELFGTVAL
PADRKARVHRMRARAQTMLKILPQDQLRLLEAYRDGVNTGLDSLRIRQFG
YLLTQTMPRAWQSEDSLLIVLAMYMTLQGNNFDRELGLSMMHASLPESAY
RFLTASGGEWDAPLDGSYFEWPPYPSATDFDLRSTSKQALADNGFQESPS
VGSNGFAVGGPLTSGSALVANDMHLTLRVPGLWFRTRLIYPDARHANQKI
DVIGVSLPGTPAIVAGSNRHIAWGFTNSYGDFADWVRVNPDPENPARYIS
QGEWKSVKIWRETLHVRGAPDETLEIRETEWGPILAQDFDGTPLALVWTA
HQPGAVNFDIVELEQADNLEKAAAIAWNMGIPAQNFIAGSKNGDIAWTIA
GRIPQRTGNYDPGLPADWSKQSTGWNGWLAPADYPLVINPPDMRLWNGNS
RMIDGALLSKLGDGGYELGARSRQIRDELYAHDHFSPSDLLAIQLDDRAL
LLARWKQLLDEILQKTPSTVWRNEMQQVLLDWNGHASVQSVAYRIVRSFR
LEVMKQVLSGLTAKVKSDYPEFEIPRLSQAEHAVWKLIEQRPLHLLPADD
SDWDSMLAACARRIAEQMQAQPGGIVARNWGEENTADIRHPFSRALPSWI
AAWLNMPADQLPGDHHMPRVQAPDFGASLRFVVAPGEEEQGYFEMPGGQS
GHPLSPYYGSGHSDWVAGRRVSFMPGAAQQILYLHPAEFRADH
>NE1684 hypothetical protein
MGNLFSIGFLKNLVVAGYVVKGGTKTVRVRGYPIRLTVDHWRVLRRIRTY
SIKEPDTLDWLDNIEPGSCYFDIGANIGQYSLYPAIKLGHDIRIFAFEPQ
SNNYYALNKNIYLNDLKDLITAYCVAIGGTNGFDKLYVPKFIPGGTVRNS
DRNP
>NE2177 hypothetical protein
MDEPEIQNNKYRLIQILTISCLVIFLLFIWQGNKGFNLWDEGYLWYGVQR
VLLGEIPIRDFMSYDPGRYYWVAALLSVAGDNGIMSVRIAVAVFQCLGLF
VGLLLIAQSTKSRDKADILFWIISAAILVLWMFPRHKLFDISISIFLIGI
LTYLVSNPIPKRYLIAGICVGLIAVFGRNHGVYGAVGSLGVIAWLNIRNR
SDTGFLKGFVLWSVGVTIGFLPIIFMALLIPGFAVAFWESVRFLFEQKAT
NLPLPVPWPWTINFAASSIGDAARGVLIGIFFIGTLIFGGLSVIWVVYRG
LKEKPLPPVLVASAFLALPYAHYAFSRADVGHLAQGIFPLLIGILAIASS
ASSKTKWVLAAGLFMTSFWVMHVFHPGWQCLASKQCVNVDISGKYLQVDP
NTASDIALLHQLTDQFAPDGRSFIATPFWPGAYALLERRSPMWEIYALFP
RTGAFEKKEIERIKASDPGFAFIFDLPLDGRDELRFKNTHPLIYQFILNN
FELVPNSHNPAYQIYKTRNAGQ
>NE0850 Phospholipase/Carboxylesterase
MPDNSFQLSAIDITTGSNPEYTILWMHGLGADGNDFVPVVQALDLPEIPI
RFLFPHAPQQPVTINSGYIMRAWYDIQHTDFVEQEDETGIRRSQHAIVEL
IEREDRRGIPPDHLILAGFSQGAAMALHTGLRHPDRLAGIIALSGYLPLA
HKIEREAHITNRITPIFMAHGNDDPIVPIELAHASLQQLREYYYPVTWHE
YPMEHTVCDQELVDISRWLKTILK
>NE1459 putative transmembrane protein
MPSILTYLSASLLYGIAGWYFWRAMRTDTSAAGAVPNVRLQQYVMLLPLL
VHGLVLYHAVFMDNVLSFGVGNAISAIVWLTAVIYWVSGFFSSLRGLQNL
IAPLSAIAAVAVLIPLLLPSIHPLAHAGMTAFKAHLLAAMLAYSLFTIAA
LHAVLMTLLERRLHHSEVSPIFSQLPPLLVMEKMLFRLVWVGFILLSLTL
LSGIVFSEELFGQSVPFTHKSLFGFISWGIFAALLAGRHLYGWRGRTAIR
WMLAGFVALVLSYIGSKFVLEIILNR
>NE1444 TPR repeat
MSGSGKTITQMKSTAILAPVIAVILILSACGKPMDAQALVAEAKQYQQQG
NDKAAIIQLKNALQQSPNDPEIRYLLGTLYNREGDIQSAEKELNKALDLG
MDPVKVLPGLSRAWLGMGKFQQVLDETGKLSDKGNFAELLALRGNASLAL
GKFEEAKVLFEQALQDKPGFSDALTGLARYSLARNDIESAMNFSEEAVKL
NPENSDAVLFRGDLLRAQNKIDEALADYDKAIKLNPESEAAYINRATISI
STKKFEAAQADLDAVRKIAPGSLLAAYTQALLDFSQGKHAVALETLQRIL
SSAPGHLPSVLLAGATQFALGSFPQAGQYVEQYLKAIPNNLYAIKLMASI
QLKNNQVKQAITTLTPALKSVQQDPQLFALAGEAYMRSKDFTKASEYFEK
AGELAPDNASLYTALAMSKMGQGDSKSAIADLEQAAQLDDQSGRAGVMLV
LTHLQLREFDKALKAVESQLAEQPDNPLLHNLKGGIYLGKKDLAKARSSF
NQALSIQSDYFPAISNLARIDMQENHPEAAQQRFEDVLKRDKKNVQAMNA
LAGIALARGNKEEATGWLEKASRENPDELQPALQLGAHYLAVNDPGKSLA
LAKKLQGIHPDNLSIVELLARSYLATGDKDAALENFQKLAARLPDSAPAQ
LQLAQIYSSMQNNKAAAGSLKKALTIKPDLWEAKLMQAQLAVAADRVEDA
FNISHDLQKQHEKLPVGFELEGDLQMRQKNAAAAATAYEKALSRQKNSQL
LIKLHTALSQSGKEKQADQRLNQWLKENPTDAVTRTYLAGVYLASKKYDP
AIKEYQTILKQHPDHAATLNNLAWVYQQKKDPVALEYAEKAYKQAPDSPA
ILDTLGWILIEKGDAERGTSLLQKAVTAVPEAAEIRYHYAVGLFKSGNKA
EARQELEKLLGGDKPFPQRDDAKKLLESL
>NE1956 putative lipoprotein
MTKFSAVFLAVLLTGCTLTPKTLAPVSIYDLGPATSVTVTDSSRLSQAII
QVMDVTAPVWLDTQSIHYRLAYHDPARIYAYAGSRWAAPPAKLLTERFRQ
YFASHAIDSQKDDKNKESHVPAHYLLKIELGEFTQIFHAQNDSRIIIRLR
ASLYEPNTRLPVAQRSFTGERPAQTADAAGAVAAFILVSDNLLDELVQWL
FSIHS
>NE2567 conserved hypothetical protein
MSVASETNWKKLDEKIAISGQISVDDVAAIAAAGYKSIICNRPDGEGGEH
QPGSTELEEAAKAAGLQFAYLPVEIGQVSDEKCSAFHQLMATLPGPVLAF
CNSGNRARALYSRDVGTTTTPAETISAACDWEHEAAAVTEAESEAAGAAA
VSAASSRDEAAGKSIPVTPACNWDNAFDIVVVGGGSAGLGVTASLLRRRS
SLRIAIIEPNDKHYYQPAWTLVGGGAYAVDQTVRNTADVIPHGAEWIKAE
VSGFSPNDNLVHLADGRTIGYQQLIVCPGIRLAWEKIEGIQETLGKNGVT
SNYLFDLAPYTWSLTQQLKGGKALFTQPPMPIKCAGAPQKAMYLSCDYWQ
RQGVLDKIEVEFDSAGAVLFGVADFVPPLMEYVRKYHANLVFNSNLVKVN
GPEKIATFEIKNEAGEVTRVDKPFDMLHVTPPQAAPDFLRDSPLADASGY
CEVNPKTLQHTRFANIFSLGDACSSPNTKTAAAVRKQIVIVAENLLAAKD
GREFHAVYDGYGACPLTVENGKVVLAEFGFGGKLLPTFPLNPAVARKLYW
WFKVKLFPWLYWEGMLKGREWLTRSTETK
>NE1215 Short-chain dehydrogenase/reductase (SDR) superfamily
MKKSILITGCSSGIGYYTAHGLHARGYRVFATARRQESVEMLLAEGLESF
RLDLNDSDSIRWAVEETLRRSGGELYALFNNGGYGQPGAVEDLSREALRA
QFETNLFGWVELTNLILPAMRRQGYGRIIQNSSVLGFTAMPFRGAYNASK
YAIEGWSDTLRLELRGSRIFVSLIEPGPIITQFRANAMKAFERYIDVERS
VHREKYLAIHNRLNKPGPAVPFTLPPEAVLKKVIYALEADTPKARYYVTF
PTHLFGFLKRILPVSVLDKILAKAGNDHQ
>NE1888 Domain of unknown function DUF81
MEAWPVYLLTGSAVGFFAGLLGIGGGLLMVPILASVFMSLGFPADHILHI
ALGTTTAIITLTAISSLRAHHAHGAVNWWIVRYITPGIIAGALAGSTLAG
QLSSRILGIIFVLFIYFAATQMWLNLKPGTGHVLPGKAGMFAAGSVIGAL
SSLVAIGGGLLTVPFLTACQIRLHHAIGTAAAVGFPVALASAAGYAINGL
LLTQPLPDYALGYIYLPALITVGLASTVTAPLGARAAHVLPAALLRKIFA
GLLYLLGTKLLLDLWN
>NE2352 Mov34 family
MLTIHTKLISAMITQSLKDHPIETCGIIAGLAGSNLPLRLIPMRNVAQSE
NFFMFDPQQQLQVWKEMSARHEEPVVIYHSHTGSEAYPSRSDVELAAEPQ
AHYVIIPTCSPHKEEIRSFRIVDQMVIEERVQIVRQYQPELEFQMMVA
>NE1572 Uroporphyrin-III C/tetrapyrrole (Corrin/Porphyrin) methyltransferase
MPPDATLHGTLYLIPTPLGEGDLARILPAEVRQQVSLLERFIVEHPKTAR
HFLKQINPLRAIQTLKLEVLDEHTPAGEVEALLAPLLAGEDVGLLSEAGC
PAIADPGGALVRMAHQKKIRVVPFVGPSSILLALMASGLNGQRFHFHGYL
PVASDIRNKEIARLEQTSITADETQIFIETPYRNQKLLEALVQQCHTETD
LCVACNLTQADEYVSTKSIGEWRAGNWPDLQKKPTVFLLHGQKQSRKF
>NE0153 GTP-binding protein (HSR1-related):AAA ATPase superfamily
MKPTLVLVGRPNVGKSTLFNRLTRSRDAIVADIPGLTRDRHYGHGRLGLK
PYLVVDTGGFEPVVKSGILHAMAKQTLQAVDEADIVLFIVDGRQGLAAQD
KIIAEQLRKTGQKIILVVNKTEGMPYSSVTAEFHELGLGTPCAVSALHGD
HLGELIDFALEGYPYEEETAAEPGQEKCPVIAIAGRPNVGKSTLINTLLG
EERVIAFDQPGTTRDSIYVDFEYGQRSYTLIDTAGLRRSGKVWETVEKFS
VVKTLQSIEAANVVILVLDAHHEISDQDAHIAGFILETGRSLVVAINKWD
GLDDYQREIIKREFERKLGFLSFANLHYISALYGNGVKGLMPSVDAAYAA
ARAHIPTPKLTRAMLAAVAKQQPPRGGMSRPKLRYAHQGGENPPLIIVHG
SMLEHVPQTYRRYLENTFREVFKLKGTPLRVEFRTGHNPYAGKKTPLTEE
EARRAHSRRRRNRKKYG
>NE2161 Esterase/lipase/thioesterase family active site
MNVQEQAVRFCCHHDWLYGVLHLPQQPVTRGVLIVVGGPQYRVGSHRQFV
LLARYLAERGIAVMRFDFRGMGDSDGEIRTFEHVGEDLRSAADFFFSECP
FLEDIVIWGLCDAASAALFHAHQDSRVSGLVLLNPWVRTEQGIAKAYLKH
YYLERLFDPEFWKKLLGGKFNPLASIRSLYEFGRNSLRGGKSPAVSEKSA
GSACDLTVPLPERMLDGLKRFQGKILIITSGNDLTAREFLDLVDSSADWQ
ATLRTKQTELCHMESANHTFSTREWRDQVTELTANRVLSW
>NE2093 conserved hypothetical protein
MKNPNTVISSDYGPIIINLNDNAIGRQISQYGYWATDDINIINTLVNVQL
DKFGQIMFYDVGANIGTHSLAIAKTHPDTVAIRAFEAQRQVFNMLCGTMA
INGLSNVHCHHNAISEKIGDFIDIPIPDYNSANNFGSLELIPPKNSDNQG
IIHSGKMESVKTLSIDSFNEKVDFIKMDIEGMEDKALLGAINTIEHHRPI
LFLEILKTDVNFVMTFLRERGYLGFQKSFDLIAIPIEYQLQVNGTNRVF
>NE2338 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE1591 conserved hypothetical protein
MTCEVRLRPEAEQDLADAAAWYEEQRQGLGHKFLDEVTTTLSNIAETPLA
YPNVHRGTRRAVIRRFPFGIYFQVKKATIIVVAVMHGSRNPHQWKSRT
>NE2569 probable transmembrane protein
MLIPFTALLSGLVFGLGLILSGMTDPAKVLSFLDVAGLWDPSLMFVMLGA
ISIGFFAFRAAKRRGRTLLSTPVHLPGTRTVDLRLILGSLLFGMGWGLVG
ICPGPGLVLAASGHTGGIVFMVAMLLGMFIFDRLEKHRQSDSNVNYRQPG
NIERK
>NE1273 HI0933-like protein
MTHSFDVIIIGAGAAGLMCAIEAGKRERKVWLIDHSVKIAEKIRISGGGR
CNFTNLHTQPQCYLSQNPHFCKSALRRYTPQDFIALVRKHGITFHEKKLG
QLFCDDSAKQIIAMLLAEAREAGVKLENPVQVSTIETIAGGYRLNTSQGE
RTCTSLVIATGGLSIPKIGATPFGYQVAKQFGLNIIPPRPALVPLVFDGA
LLACCQALSGLSVEADVRFGKQVFAEGLLFTHRGLSGPSILQISSYWEEG
EPIVINLAPGTDVLAFLKEQKQSQPKLHINNALAEILPRRLAQSICDEHN
GSGPLASLSDTRLAALASAVNAWHVKPVGTEGYRTAEVTRGGVDTRELSS
QTMEANKQPRLFFIGEVVDVTGHLGGFNFQWAWSSGYVAGQYV
>NE1316 NUDIX hydrolase
MIDRNGYRANVGIILLNSQNQVFWGKRARQDSWQFPQGGIKSGETPTEAM
YRELAEETGLQPVHVEILGRTREWLRYDVPACWTRRDWRKNYRGQKQIWF
LLRLLGRDSDVSLETCAHPEFDAWRWNQYWVELESVVEFKRQVYRQALTE
LSRLLDHEAGLGNDRAYREPLEPVEKNRKKSSDTRQS
>NE0261 putative plasmid stabilization element ParE
MGSFILRQKAMDDLLSIGRYTRKEWGKTQQIRYLTQLDRAFHELADKPGL
GRACDDIREGYFKYGVGKHVIFYRHTGKDQIEIIRILHGRMDIEQHL
>NE2464 Integral membrane protein, DUF6
MTKQKKLLPVASLLLGAAIWGVAWYPYRLLEQAGMRGELSTTLAYSIALL
IGLILFRRQLRISEILNPAAGILFWISLSAGWTGIAYVLGIIHGEVMRVL
LLFYLAPLWTILFSRILLQERLSRQGYAIILLSLAGALLLLWQPGSKLPL
LASYGDWMGLSGGLAFALTNVLIRKDQQHGIQLKLLAVLSGTALTGFAAT
LLMESISDITHLHTHAWLILAGIGGLVFFLCILLQYGMTHIPANQAIVIM
LFELVVAAIAAHFLTNEYLTGRDWAGGLMIASASLFSARVNRD
>NE0123 conserved hypothetical protein
MSLLQSSCRVAFAALLHDLGKFHERTGQPVNGDLAALTTLYRYSHAAHTG
GMWDVVEKYAPDLLRGDVAPFSGRTSGADITDSMANAAAAHHKPGTLLQW
IIATADRAASGFERTKFDEYNADAEGETPQHKNRFQARMISLFEQIKINA
QAPVGQFKHAYPLRALGPEAIFPDKRAVIEPNENKTAQAEYAALWEQFLQ
ALESIPKSHRSNWPLWLDHFDSAWLTFTQAIPSATNRGVVPDVSLYDHSK
AVAALAAALWRWHEETGNTGADALAKLSDSERPDWDEQKLLLIQGDFFGI
QNFIFAQGAQTQKHAHKLLRGRSFQVALLAEMAALKLLETLQLPSTSQII
NAAGKFLIVAPNTPSAREAVETCRRSFDQWCLQHTYGEISVGIASTSASC
NDLRSDRFRTLTQKLFGALDVAKHRRFDLCGNTAAVRDVSFADGPCDYHG
RYPADCAAEGDKSASCALSRDQIVIGEALTKHARLLVLNTADSFKKPLDL
DYFGYRLIFVNEADASGHYGKLAEQGELVRAWDFDLPDKNGTCFHGYARR
FVNSYVPIWDENEKKDDPAYKRLSQEDLGDTSPGKLKTLHSIAAGSNNEI
ALVTLKGDIDNLGALFQSGLAEPTFARWASLSRQVNAFFALWLPWYCAHG
ENRRFRNTYTVFAGGDDFFLIGPWESTLELAGAMRKAFARYVVRDDITFS
LGAVMTQPKIPARQLAVAAESGLNAAKQHYGKNAVSLWGVTVGWAEWRTL
MKERRDALERLISEAGGLSTGFIYNLLLLSDQAERDDPKRDDRRPEDALW
RSRLAYRCARLPKNQQMVGKALARECGEALKQYRGAYRLPVSVLLYRQRQ
>NE1065 putative plasmid stability protein
MATMTIRNIDEQLKARLRVRAAMHGRSMEDEVQDILRTALSAEPVQTVSL
VEVIRSRIEPLGGIELNLPEREAIRDPLEPGA
>NE0797 conserved hypothetical protein
MENKEKNTSELLQLIVVQNTNAVISVDEIKNSLHERGFAVLLAIATLPIC
LPVPAPPGYTTVFAIPLFIFSIQMICGMKAPWIPEWLTKKTIKRGTLDKL
ITKAAPWLRKIESHMHPRLTYISVHAWERIIGLFSFIFSISIALPVPLIN
FLPGLGILIMSLGLLSKDGLTIIAGMIVGTTGVGIALIVVALLWMGVPIP
FVQSTD
>NE1327 Carbon-nitrogen hydrolase
MSTDVGIDQAARSKAGDNTRVRVAAVQMASGPSVAANLEEAFRLIEEAAA
KQAKLVVLPEYFCIMGMKDTDKLAVRENPGEGEIQNFLSETAKRFGIWLA
GGSVPLISPVSDKVYNSCLVYDEHGQQVARYDKIHLFGLSLGNENFAEER
TIDAGNRVVALDSPFGRMGLSICYDLRFPELYRMMGKVDVILAPAAFTAI
TGKAHWETLIRARAIENQAYLIAPAQGGFHVNGRETNGDSMIVDPWGVII
DRLPRGPGVVVAEIDRAYQSSVRASLPALEHRCLLAC
>NE0604 possible ubiE; ubiquinone/menaquinone biosynthesis methyltransferase
MAGYVGDVAIAYDRDLGHVLFEQYASDIARRTAGKPVRDVLEVASGTGIV
TRQLRNVLPGDAQLTAIDISDSMMEVARTKFLPHEQVTFQVANAVALPFD
DRAFDTVVCQFGVMFFDKDKAFQETHRVLRQGGRYLFSVWDSRDYNPYAS
LTFEVMKQFFPSDPPRFLESTVSSFEIDPVKERLIRAGFEQISISVQRRI
YDIPDIRAFARGLIFSPIINEIRERGEVDPDDIVEALVKIFIGEYGSNPT
RFPMQAILFETEKP
>NE0524 CBS domain
MTKVRDLMTPMPKTIGFDISVEKALVMMKECACHHLPVLDGGKLVGVLSD
RDLSMAWHGSGNTKDEHLVRDLMTDTPVVIDPSAEINMAIRIMLDNKINS
LIVRAEENQPWGILTSTDLLRYVMNKA
>NE0474 conserved hypothetical protein
MMTLFQRPFVAQLAHRLDGMQPLIQVLTGPRQVGKTTGVRQLMAQCSYPQ
HYANADDVLVSDRSWLLEQWQQALLLGEGALLVVDEIQKVVNWPETIKAL
WDAQPGRLRVLLLGSSALQIQSGVTESLAGRFELLRVHHWTFAELHAAFG
YDLPRYLAFGGYPGAVVLEYDPDRWYAYMKDAIVEAVIGKDILQSRKVAN
PALFRQAFEILCAYPAQEISYTKLLGQLQDKGNTDLVKYYIELYGGAFLL
HALQKYSPKTWLARSSSPKMLPACPALYSMVAGVDVMRSTEQRGRAFELV
VGAELMQLPGQVFYWRERNDEVDFVYQYRERLYAIEVKSGRKKSARGLDA
FCAQEPKALRVIVTPENFAQFSAEPRDFLQQVAI
>NE1291 GTP1/OBG family
MKFIDEVKIQISAGDGGNGVASFRREKFIPRGGPDGGDGGHGGSIYALAD
HNLNTLIDYRFTPVFRAKRGENGRGSDCYGKGAEDIVLRMPVGTIITNDL
TGELVADLEHDQQKVLLAKGGRGGLGNLHFKSSTNRAPRQFTHGEAGEQF
ELRLELRVLADVGLLGLPNAGKSTLIRAVSAARPKVADYPFTTLYPNLGV
VRVDAGHSFVMADIPGLIEGAAEGAGLGHRFLKHLGRTRLLLHVIDVAPF
DENVDIVHSARALVDELRKFDETLYRKPRWLVFNKVDMLPEDEQQAVCTR
LLQAMNWQERWFAISALTGRGCQALIYAIMGHLQQLQSDSEET
>NE1353 conserved hypothetical protein
MSFHVRFTLEAKADIERLYRFLAEHDFDVAERTLETIDSAWSLLEQFPFS
CRKIDDANPFLREFIISFGNSGYVVLFEIEDSNTVTVLAVRHQLEDDYY
>NE0467 Domain of unknown function DUF81
MIDWSFTLAGALTGFVIGLTGVGGGALMTPILLLVFGVQPVTAVATDLWF
AAITKIAGARVHHTNGNVDWQVVKRLWSGSLPMALLVVLLVSMGTHITKV
DWLTKGIGIVVLITAIGLLTAPRMVALARKKRTGQPERFETMQSILTVIA
GAILGVCVALTSVGAGVLGSVMLLYLYPLRMTPHRLIATDIVHAIPLAVV
AGLGYLFAGKVDWWMLISLLLGSVPTVVAGSMLASKITGRWIQIALACVL
AAAGLKVLI
>NE1542 conserved hypothetical protein
MKFFQRTVLEPAYQGAQLHPGKLSGIYQNEILCNSLFHHYKQRTLKLGRE
LESRLKHLYHLVRMALPGTVSANTTRRSPANSIHTEVQIDSRTDTHLPQV
SDDEDLRWIRTGLPHQYHAYLAIDRLKSSHHLAGQSVLCIGGRAALYPNY
HQLIEAAGGHFMVFRGGAQDNSECLLALLARVDSIICPVDCINHEDFFTV
RRYCQRTGKNCVMLERSDLVTFGKAVETLARGDCHNSETDFLNRSAA
>NE2112 PIN (PilT N terminus) domain
MYLLDTNVVSELRKPRPHGAVLAWINSVDDASLHLATVTLGEIQAGIELT
REQDPAKAAEIESWLDLVSDSYNVLVMDGPAFRCWAKLTHKKSNTLIEDA
MIASIAKIHGLTVVTRNVSDFSSFGVRIFNPFEFNANA
>NE0789 conserved hypothetical protein
MKLPLTSFAVFLITQTVWAGEVFALDMIGVAGGANFNIQGKSFAERRFST
VYRQQFDFSCGSAALASLLTFHYDDSVDEQSVFVDMFQHGDQEKIRREGF
SMLDMKRYLERRGYGSDGFKINLDQLYSTGSPAITIINHNGYMHFVIIKG
VDEDRVLVGDPAQGVKSMDRTEFERMWGNRIVFLIHNHVDPEVSYRKIHQ
EWSGRLAPLGEAVDRTSLGVFNVLRPGPWDF
>NE2226 SLT domain
MRISFKWQWAVLSFLVLHTAVAHSGRDIQAVRYAGVKGGGDILVAQARQY
EHGEGVLQDREKAVELYCQAARQGSAEGQYALGWMYANGRGVERNDGIAA
RLFEMAAARKHADAQKLLRFMPLPDKRKVQLPNCLSRNVYVRTNITPNKY
YVDQSISALVEKLAPQYEIDPGLVLAVIAVESGFNTQAVSPKNAQGLMQL
IPATAERFQVRDVFDPEENIRGGMAYLRWLLAFFKGDVALVAAAYNAGEG
AVEKYRGIPPYPETVKYVDKIMSRYNKTSHPYQPGVVNRTSFIFASAASG
Q
>NE0915 conserved hypothetical protein
MIRCINLLSLVLLLALANGALAEIAIPLLKSHVTDLTETLSSMEISRLEQ
QLTDFEAKKGSQIALLIIPTTQPETIEQYSIQVAEVWKLGRKGIDDGVLL
LVAKNDRTLRIETGYGLEGVLPDALARRIIDEIIVPKFRQGHFFGGLQAG
VEQIISIIEGETLPESEPAGGASLAVENIIPFLFIALVLGRTLQSMFGRM
AGATITGSIAGALTWLISSSIAVALLIAIAIFVISLFEQTGRIIHRGGPG
YRNWPGGGFSGGGFRGGGGGFGGGGASGRW
>NE0802 putative membrane protein
MKICHRYIAWQVLTGMVIATAILLPLFSFFDLLDQLDDVGKGTYTTWDAF
LYTVMLMPRRFIQIAPFIALLGTVGALGALAVNLELVAMRVAGLSPLMIG
LAPVGIGGLLIASTIALEYFVAPQFQQQATILRAVALEQGAELGKGLGIW
TRNERNILRIGEMLHKGRATDIEVIHFDGEGSMTAHVHAWYADIFDESLW
KLHDVTIRTFSPDRITSRKAHILQWHSFLGPDDIATLTKSPESLTPIELV
KHAEFLRATGQKADAYVMALWRKVGGAIMTIAMILVAIPFIFGSVREGLG
GKLILAALMGISIYLFDQIIANIGLVFQLNPIVVSVVPGMVLIAVAAHWL
LRRTF
>NE1806 ATPase components of ABC transporters with duplicated ATPase domains
MAQYVLIMNRVGKIVPPKRVILKDISLSFFPGAKIGLLGLNGSGKSTLLK
IMAGVDKDFEGECTPMPDLKIGYLPQEPQLDPKLTVRETVQEGLGDVFNA
QQQLEAVYAAYAEPDADFEVLAAEQSRLEAILTTQNGDNLSQQMEIAADA
LRLPEWDAHIEHLSGGEKRRVALCRLLLSKPDMLLLDEPTNHLDAESVEW
LEQFLARFPGTVVAVTHDRYFLDNAAEWILELDRGHGIPWKGNYSSWLEQ
KETRLKQEESAESARQKSLKQELEWVRQNPKGRQAKSKARLARFEELSSQ
EYQKRNETQEIFIPVADRLGNEVIEFINVSKGFGDRLLIDNLNFRIPPGA
IVGIIGPNGAGKSTLFRMITGKEQPDTGEIKIGETVKIAHVDQSRDALSD
SQTVFQAISGGNDMLIVGKYEVPARAYLGRFNFKGPDQQKITGTLSGGER
GRLHLAKTLIAGGNVLLLDEPSNDLDVETLRALENALLEFAGCVLVISHD
RWFLDRIATHILAFEGNSQVTFFTGNYQEYEADKRQRLGEEAAKPKRIRY
KPITR
>NE1492 conserved hypothetical protein
MKIQFNRFLTRSLLTAALTGAASMAAAATMEVYKSPTCGCCAKWVDHMRD
NGFTVNIHDIGNDEARAEAGILPELGSCHTALVNGYAIEGHVPADDIKQL
LKERPRAVGLSAPGMPHGSPGMETGRVDSYNVLLIRKPGDKRSATEIYNR
YGPGKSGAAEKTSENSTTDSVLRLK
>NE0525 CBS domain
MSTKLELPIQEFTTPYPVTAREDSSIEELLDLIKNLKVRHIPIMSDGKVT
GIVSERDLKIISALSTREKFLVRAADLMTPDPIIFRGSTSIEDVILKMSE
KKIGSVLVSDEQGNLQGIFTVTDALDILVEILRGKK
>NE0176 conserved hypothetical protein
MSVEIFNGTMQKFGAPDTVICQFQYWSVLLRPAQLTLGALVLVAHEPVQS
FSALSSTSFAELQIVTGKIDTALKKAFQYDKLNYLMLMMVDPDVHFHVIP
RYAQAREFAGKTFLDAGWPGVPDFSRINETDKEMNQQIIEHLISCWECS
>NE1339 H-NS histone family
MKNLTYIEIQEEIKKLQKQANEIRAKEIADIIADIKVKIQLYGITEKDLG
FGEKQKKTIFPPLYKKGNRTWSGRGRQPGWIKEHLEAGGNLEELLM
>NE0600 probable transmembrane protein
MGAQFFSSLADNALLVVAIALLIDLHAPAYLTPMLKFVFVLFYVLLAPLV
GAFADSMAKGRVMFISNSIKIVGCILLFFAAHQFSALGAYAVVGLGAAAY
SPAKYGILTELLPPEKLVIANGWMEGLTVASIVLGTVIGGLLITPSVAAV
LLSLDLPLIKTAVDASIIIIMLFYGIAAVTNLFIPDTGIDHRILKKNPIF
LFHDFVHCVKLLWFDKLGQISLAVTTLFWGAGATLQFIVLKWAEAALGYA
LNQAALLQGVVAVGIALGAVLAAKLVSLRRSLDVIPLGIIMGIVVILMIT
ARDLWISVVLLISIGGLAGFFVVPMNALLQHRGHILMGAGHSIAVQNFNE
NLGILTMLSLYALLIWFDVHIYTVIILFGLFVSITMIVVRKWHLNNQSKQ
DSLHLIGMQKRQF
>NE0834 DEAD/DEAH box helicase:HD domain
MRQNIGKLVSRKQPRRSEKVNAENNTTYLAHVRQLPNGRWIEHFLEEHLL
AVAVLAAEFASVFNSQDWARLSGLWHDIGKFREKFQKYIKSVSGYDAEAH
IEGAPGRVDHSTAGAIHAIEELGPPGRIIAYLIAGHHAGLPDWNGEPASL
FQRIEDGKQKGYRQEALQNAPTTGLFNQPCPTSSPPQDGSFALWIRMLFS
CLVDADFLDTEAFMDERRKDLRAGYPALNELLSAFDQYMNDKTANATDSP
VNRIRTEVLRQCCEKATLPPGLFSLTVPTGGGKTLSSTAFALNHAMHHGK
QRVIYVIPYTSILEQTAEIFRKIFGDENVIEHHSNLDPDKEDSRSRLATE
NWDAPIIVTTNVQFFESLFAARTSRCRKLHNIVNSVVVLDEAQLLPPEFL
APILHVMQDLSQNYKVSFVLSTATQPAFSPRPKFSGLRGVQELMDDPDGL
YADLKRVEAELPRDFNAPRTWESIAEELQQYDSVLCIVNSRTDCRALHAL
MPRDTIHLSALMCGQHRSEVIADIKQRLKDGIPTRVISTQLVEAGVDIDF
PVVYRALAGLDAVAQAAGRCNREGMLPGMGKVVVFVPPKPAVPGLLRKAQ
QSGQEIMRLTEGDPLTRERFEAYFRHYYASLNSLDEENIIGLLDMHNRVE
ARRAEFSFRTAADKFQLIKGSSQKTENKAR
>NE1305 proteic killer suppression protein
MIRHFKHKGLQLFFETGDKSGIRPDHASRLARQLRQLNDAVNPREMNIPG
WKLHPLSGDLSGYWSVMVNGNWRMIFVFDGEDVILVDYRDDH
>NE0682 SCO1/SenC
MRTFLATLFVAISGTLILWISTDGGRAFTAEEARRLEIRENPRSVPDWQL
QNQDAETFTFQNWHGHLIVVDFIYTSCPSVCLILSGNLKNLQKDFSDEGK
SDKLRFLSITFDPEKDTPQRLKEHLSHFSADFKNWVAARPTSPSQKEAIL
DFFKVIVIPDEYGGYTHSAGYHIINPDGKLVAIFGMEQMDELRAYLNQAL
EGKDNASEN
>NE1377 Protein of unknown function DUF132
MSKAVSRVVLDTNLVLSALVFQSSRLTPLRNLWQTGRIHPLISRETAAEL
IRALTYPKFKLTASEQEELLADYLPYCLTAIIPNPPPVTPPCRDQADIPF
LQLALAGKADVLVTGDKDLLVLAGMFDCRILAADVFLTEFAGC
>NE1225 conserved hypothetical protein
MATIVHDLSKYDGSAGFLVDTNIWIDCMDTDSRWHDWSVDQLQICSEQAP
LHINLMIYTELLIPGPDIDALDTMLDIYDTLRSPLPWSCAGLAAKAYLNY
RRRGGTRLVPLPDFYIGTHAAVANLSVLSRDVKPYHNYFRRLRCVGPDET
AEQHTDG
>NE2534 Glycoside hydrolase family 24
MSQIPQAAIALAKRFEGFHKVPKSDPLRRARPYICLAGYWTIGYGRLCKP
DHPPIDEEEGEAYLYQDLRKALAATLRYCPVLATEPESRLAAIVDFTFNL
GAGRLQTSTMRRRINQRDWLSAGQELRRWVHGGGKVLPGLVARREAEVLL
LVPG
>NE1289 Esterase/lipase/thioesterase family active site
MTDPFFLSDSGLLESYRAPKWLPGGNAQTIFPYFINLSPIISYRRERWEM
DDGDFIDIDWLDGESDKPLVIMLHGLEGSSQSHYALSLMNLLQMLRWRGA
VVHFRGCSGYSNRLPRAYHAGDSMEIDRMLRHIAHRNDSHEWNTPCYVVG
VSLGGNALLKWLGEQGAQAARQIAGVVAVSVPLDLAAAGKVLDSGFNRVY
THHFLTTLKRKALEKNRQFPGLLNARAVAACRSLYEFDNLVTAPLHGFRD
TDDYWRQSSSKPWLGSVQVPTLLINARNDPFLPESVLPQKSEVSSFVSLE
FPQQGGHVGFIQGTFPGKLDWLPQRIIEFFSSLCGLDAIPG
>NE1502 Integral membrane protein, DUF6
MFACMGVLVKLAAAFFSNTELVFYRSLVGVITTFLVMRAYGMPLVTEHWK
SHCWRGLSGLGGVLLFFYCILQLPLATAISLNNTWPLFLAFLAMILLKEE
FSWLLAGALVVGFIGVIFLLRPTLAEGQWYLALIGIGSGLFAGIAHFHVR
QLSELGESDWLTVFYFTLVCTVATGLWLTFTAFSAVSLQSLTLALGIGVT
ATLAQLAISRAHRGSNILIVGVLSYSSVLFAGLMDLFFWDARLPVSAWVG
MGLIILGGLLSIRGIPVRNSSIVTLDD
>NE0509 SCO1,SCOD1, SCO1/SenC
MIDEKTDNMKLRISRSYCFQVITTLFLIFVQADIRAATITLSRPVSLQAE
TITHLKQADTTNTNQWKLVVFGFTHCKDVCPMSLANLSMLVKAAVSEQIE
LNGVFVTVDPDRDTEEILSGYTKGFGPGITYLRFEGEELEHFKNAFQVEV
VFYTKNAGNQTHYQVDHSTTAFLIDPTGKIRVIFDALKDAVDVARIFKDN
KGLFKS
>NE0777 TGL2, Esterase/lipase/thioesterase family active site
MSDPNNGAIIFVHGLLGFSSFSIFGKKVHYFRNLRSSLRNSTRQVLFPEL
PATGYIEDRARVLANFLAHISADRIDLIAHSMGGLDCRYLIHHLDPMHRV
RSLTTVATPHHGSPLAKWTIEGSDMCFRLMHSISTPAVNDLTPESCARFN
IEISNRKDVRYCSYASMRCPTDMSFILRSWGNKIAANSGDNDGMVPVASA
QWGEFRDVLQADHFELTGWSFAWPDARKARPFNHLQFYLNLVRELTENHS
>NE1869 aarF, ABC1 family
MRFFRLLKIILIAFRFGLDEILFTQVRLRILKVFSALLPFRSRLQLPRAV
RLRLALETLGPIFIKFGQMLSTRRDLLAQDFAEELALLQDRVPPFPSEQA
VQILETVYGRPVHEVFLEFDIKPVASASVAQVHYAVLHDGTRAAVKILRP
TIAPVIAHDVALMETGAWLLESIWPDGKRLKLREVVAEFARHLGDELDLI
REAANCSQLRRNFLDSPLLLVPEVYWDYCHTEVMVMERVVGTPISHVASL
RTQGIDIPQLARMGVEIFFTQVFRDGYFHADMHPGNIFVGSDGRYIAVDF
GIMGSLSDQDKNYLAQNFLAFFRRDYRRVAQTHIEAGWAPRNTRVDDFES
AIRAVCEPIFDRPLKEIYFGRVLLRLFQASRQFNVEIQPQLVLLQKTLLN
IEGLGRDLDPDLDLWKTAKPFLENWMAEQVGLRGLVTHLQKEATNWAVIL
PQFPRLLHYNLSQERAQNLEDRLAQLVAQEKRQSRLLMLLALLLAGLLLA
QIYL
>NE0734 abcZ, abcZ; ABC transporter ATP-binding protein
MPLLTLDNACLAFGHHALLDHAALQLDPGERIGLIGRNGAGKSSLLRVLA
GEIKLDDGQLWVAPGMNVAYIPQEPTLDESASVFAEVARGLGTLAQTLLD
YHEVSHALGEEGADTKALLDRMQHLQGVLEAQNGWSLHHKVETVINRLEL
PEDAIAGTLSGGARKRVALARALVVSPNVLLLDEPTNHLDFSSIEWLEET
LQNFPGSVIFITHDRRFLDNVATRIIELDRGELKSFAGNFSAYQQKKAEL
MEVESVHNRKFDKVLDQEEVWIRKGIQARRTRNEGRVRRLEALRLERAAR
RERIGNVNFRVDAGQHSGQLVAELEHVTKSFGDKTIIQDFSCRIMRGDRI
GLLGPNGAGKSTLLKLILGELQPDSGMVRLGTRLSVAYFDQLREQLNEDM
TLVDSISQGSEFIEIDGKRRHVISYLEDFLFPPQRARSPVKSLSGGERNR
LLLARLFTRPANVLILDEPTNDLDIETLELLETLLQDYTGTLFLVSHDRA
FIDNVVTQAIVFEGNGHLREYAGGYQEWLQSRSATKAIRKENSDSPVPVA
GLGQIRKDKSSLPAGLSYQETNELAALPGKIDVLEQEQIVVTRKLSDPAL
YKNNHNEAMELQARAAALEKELSLYYTRWEALEHKQTMAESARKKN
>NE2191 ampG, putative transport transmembrane protein
MKPGACVGWLHALRIYTHPRVLGMLLLGFSAGLPMLLILGTLSFWLREAG
IDRATIGHLSWVGLAYGFKWAWAPLVDRMPLPLLTRWLGRRRAWLLLSQL
AISMALIGMARTDPAEDLVRMTVCAIIVAFASATQDIALDAYRIEAVALR
LQGAMAATYQAGYRLAMILASAGVLWIAAALDASPGEYSAASWQVAYTIM
ASCMLIGMMTTLIIREPEVPVAQLTVNSGSRGATASARLLAWLDSAVIAP
FRDFIMRYGYHALLILALIAIYRISDVVMGIMSNPFYVDMGYTKDEVATI
SKVYGVIMTILGAAMGGVLVARIGVIRTLFLGAVLSAATNLLFVWLAGRG
HDVGGLVFTISADNLAAGIASSAFVAYLSGLTHAAYSATQYALFSSIMLL
LPKFIAGFSGEFVDAYGYATFFTGTALLGVPVLVLVWKVGRIDFAGSVRN
NSQGE
>NE2297 bioC, SAM (and some other nucleotide) binding motif
MHYDHVLDKRMLRRSFEQAAAGYDQSAVLQREICDRMLSRLEYIKYVPAR
ILDAGSGTGYGTRKLIERYPAAEIMPMDIALTMHRCARMAISEQIPGWQR
WLPFRRHWPRDYICADIEQLPLGEASIGMIWSNLAIQWCNDLRQTFAEAY
RVLENGGLLMFSTFGPDTLKELRQAFKSADSFSHVNRFTDMHDIGDMLVN
CGFSLPVMDMEYITLTYEDVRGVMQDLKAIGARNVTQGRRRGLTGKAAWQ
QVIERYEALRQDGRLPATYEVVYGHAWKPESRQVQLKPETRRKLGLEP
>NE2298 bioH, possible BioH, catalyzes some early step in biotin biosynthesis
MIWPSVDMASIHIETTGNGPDLVMLHGWAMHSGVWDGVVESLSQRFRLHQ
VDLPGHGASRDCALDSLDQMTEVIADRLPGRYSVCGWSLGGQVAIRLALQ
APERVQQLVLVASTPCFVRRADWPWGMEDSTLTLFMENLARDYTQTLNRF
LTLQVSGSEDQARVLAWLRKSILRGQPPTPATLQAGLKILQTSDLRAELN
QVSQPVLLIHGRNDVITPAGAADWMQQHLPRARLVLFPHCGHAPFLSFPE
QFVSCFDAL
>NE1919 cbbQ, nitric oxide reductase NorQ protein
MSDVIEQYFVKNEPYYRPVADEVKLYEAAYSVRMPMMLKGPTGCGKTRFV
EYMAWKLGKPLITVACNEDMTASDLVGRFLLDAQGTRWQDGPLTTAARYG
AICYLDEVVEARQDTTVVIHPLTDNRRVLPLEKKGELVGAHPDFQLVISY
NPGYQSLMKDLKQSTKQRFGALDFNYPQHDIETEIVSHETGIDSAIADKL
VSIAERARNLKGHGLDEGISTRMLIYAGNLIAKGVDVHAACRMALVRPIT
DDPDMRDALDAAVTTFF
>NE0947 cbbY, hydrolase family
MALSAVLFDVDGTLADTERDGHRIAFNQAFNEFQLDWEWDVDLYGVLLQI
TGGKERIRFYIENYAPSLLSKNNLDEWIAQIHKTKTNYFLNLLKEGKIPL
RPGIKRLLDELRKNNIKIAIATTTTYENVSTLLQCTLGDSALEWFDVIGA
GDIVSKKKPAPDIYEWVLNQLNLPAEACIAIEDSENGLKSATAAGIKTII
TISEYTREQNFSYAALVLEDLESTDHTHQIAAQSFDKPLSVQTLSDLLN
>NE2149 cbbZ, pgp, phosphoglycolate phosphatase
MTQLNTQPSDSGLFPLPLKAIIIDLDGTLLDTAQDLALAANEMLRELRMA
ELPSSTIQTFIGKGVPKLVKRTLTNSPDGEPDPELFEQALPIYERCYAEN
LHVHTRPYPGVVEGLEQLKQSGFRLVCITNKTEIFTLPLLHKTGLLDYFE
LVLSGDSLPKRKPDPLPLLHACKHFDILPKAALLIGDSSNDAIAARAAGC
HIFCVPYGYNEGHDVRELDCDAVVDTIVDATRLITYQHDT
>NE0698 cvpA, Colicin V production protein
MTVFDYIVIGIISFSALLSITRGLVHEIVSLLAWIIAFFAASRYSINVAP
LLAGMVENESIRMLVAFSATFFIVLLITMLASKLLSALVRGVGLGLIDRM
LGALFGMIRGLVIVLFLITAAGFTPLPQQPFWKQAVLSEPLEVMTADIIP
WLPQDFRNLIGFDRNS
>NE2480 dppF, ATPase component ABC-type dipeptide/oligopeptide/nickel transport system
MTALLEVTDLRVLLHTGRQPVRAVDGLSLAIHPGETFALLGESGSGKSIT
ALSIMRLLPDAGEIVHGSVRLNGDELLTLPESAMRKVRGNRIGMIFQEPM
LSLNPVMTTGAQIGEVLLQHSGLRGAALQIRILELMRQVGIPDPARRMAE
YPFQFSGGMKQRVMIAMALAGKPELLIADEPTTALDVTIQAQVLDLMRGL
QQQENMAVLLITHDLGVVAEMAHRVAVMYAGQIIETADRERFFQSPAHPY
SHKLFAALPTRKKRDQGLIVIPGNVPALSKVFTGCRFADRCDRAWEKCHQ
IIPPWVETAPQHHVRCHLYSDDSTERSPQSRLQSLSRTARSALDDLPLSS
HTSDSTQSKPLLRVDDLKIHFPVHKGLFKRVAGHVKAVDGVSLQIDGGRT
LALVGESGCGKTTVGKGIMQLIPVTSGSVRLQDKELRDLDRKQLLQKRSA
FQIIFQDPYSSLNPRMRIVEIIEEGIRALGRNSDKIAASEKNQHDVDTLL
MQVGLPAEAKWRYPHEFSGGQRQRIAIARALAVDPQLLICDEPTSALDVS
VQAQILNLLKTLQQEHKLAYLLITHNIAVVDYLADEVAVMYLGRIVESGR
TEEVLDNPKHPYTQALLSAVPTYEPGSQREIIRLQGEPPSPANVPPGCHF
HPRCPHVMPICREVYPAVSRFSASHTTYCHLYHSVSQEPQDLQ
>NE2323 era, Type 2 KH domain
MNAPGYKTGYISIVGRPNVGKSTLLNHLIKQKISITSRKAQTTRHRIHGI
LTDAQSQFIFVDTPGFQTRHRSQLNQVMNRVVLQSMQDVDVVVFVVEAGR
FGREDEQVLEQLPRNLPVVLVINKIDLLPDKLQLLPFMQKMADVFEFSAI
VPVSALQNRQLSALIEAIRQHLPGNPFLFAEDEITDRSERFLAAELLREK
VFRQIGEEVPYSVSVVIEQFTVEGNLRRIHACILVERENQKAIIIGKQGK
KLKDMATQARKDMEMLFGSKVYLEVWVKVKSGWADDITALKSLGYE
>NE2022 exoT, Polysaccharide biosynthesis protein
MTRSTPRHTTVRQALILSSRDLGSRVARGAGFTLLGIVLRTTLTIGSMAI
LARLLTPADFGYLAMATVVTEFAGLLGSFGFANILIQRRVITRLQLDTMF
WATLALGCTIAAVIFALSFLTHWLFGDEATGPLLRVMCLTFIFGSLSTVH
QAILSRLMRFGTEFIIQIGTIGLRSAAAIILAYLGFGVWSLVYGSLAGSI
IGTLLMVSAIRYRPRLRFHRQYLLSTWKTSSSYMGNTVLYYLNMNSDLLL
IGRQFGASALGYYQNARSLTDEVRGRIAMPLQRVLFPAFSSLQADQVRLQ
HSVLRSGRLLAAIICPIGIGLSAVATEIVPVLYGEQWLPMIPILSLLGIS
AALRGSTAIGSSLFNSQNRVPLAFRYNIIYTALLLCSILFAMPYGLNVVA
LAIAANSLFSVFVFRVALGLIGLGTSHLLHILARPFIAALLMWAAIAFLR
NLPILTALHPGMHLGALIACGAISYASVLHLLSRQYLQDFTELAARFTKR
R
>NE1648 fabG1, Short-chain dehydrogenase/reductase (SDR) superfamily
MLLEHKIALVTGASRGIGKAIALELGKHGATVIGTATSETGAGHISQYLS
EARISGMGLIMDVSNIEHIKSGIETIQQSLGDVAILINNAGITRDNLLAR
MKDDEWDNVIQTDLKSVFCLSRAVLRTMMKARSGRIINISSVVGATGNPG
QTNYAAAKAGMIGFSKSLAKEIGSRNITVNCVAPGFIDTDMTRSLSPDQQ
QSLIQHIPLGRFGRPEDVAAAVVFLASPAADYITGATLHVNGGMYME
>NE0095 guaB, guaB; inosine-5'-monophosphate dehydrogenase oxidoreductase protein
MRLIQKALTFDDILLVPAYSEVLPKDVDLATQLTRTLRIKIPIVSAAMDT
VTEARLAIAIAQEGGIGIIHKNMPIKAQAAQVAQVKRFESGVVTDPIIVS
PDMTVRKVLELIRQHNISGLPVVKSKKVVGIVTNRDLRFETNLDQPVKNI
MTPKKHLVTVREGVSKEDALALLHKHRLEKALIVSENFELRGMITVKDIT
RTTEHPYASKDNQERLYVGAAIGVGEGSDERAAALVEAGADVIVVDTAHG
HSQGVLDRVRWVKKKFPEIQVIAGNVATATAAKALVDHGADAVKVGIGPG
SICTTRVVAGVGVPQISAIDNVATALLGTGVPLIADGGIRYSGDIAKALA
AGASSVMLGGLLAGTEESPGEIELLKGRSYKSYRGMGSLSAMQQGSSDRY
FQEAERHEADKLVPEGVEGRVPYKGYLANVIHQLTGGVRSSMGYLGCRTI
SDMHTKAEFIEITSSGIRESHVHDVQITKEAPNYHVE
>NE1333 hetN, Short-chain dehydrogenase/reductase (SDR) superfamily
MSRLPDKSGRSILITGATGAIGAALAEIYAQHGVTLHLQGRNAVKLAEVA
ERCRLKGAHVLMQCLDLRDSVALQDGLKVLEPLDLVIVNAGMNTHVGSAG
ESELLDEVEALLDVNLKAAMVIVHAVLPSMRMRGSGQIAFVSSLAAYFGL
PVTPAYCASKAGLKAYGEALRGWLAREGIKINVIMPGYVKSPMCDDMPGS
KPFLWPPDRAAKVIKRGLERDQARISFPFPLNWGAWWLAVLPASVSILIV
RLLGYGG
>NE1287 hfq, putative host factor-i protein
MGVKGQLLQDPFLNILRKERIPVSIYLVNGIKLQGQIDSFDQYVVLLKNS
VTQMVYKHAISTIVPAKAISIPIPADTQTEQDEP
>NE1310 hipA, possible HipA protein
MARELEVWLFAERIGTLALIEDRLNFRYSPDWLSRPDAATLSSSLHLQAE
SFDDHHTRPFFGGLLPEGQLRRLIAQQFQVSSQNDFALLDHIGGECAGAV
TLLEPGQSLSSPGQGDDVQWLSEEEIVAILDELPHRPMLAGKDGVRLSLA
GTQDKLPVVSDGARIGLPRNGSPSSHILKPAIRTLMDTVTNEGFCLALAE
AMQLKPAKSQVHSVLGRQFLLIERYDRVVDAQGQRQRLHQEDFCQALGVV
PEMKYQNEGGPDLVQCFDLVRRITRPSAPQILRLFDYVIFNALIGNHDAH
AKNFSLLYAGKSAILAPFYDVLSTAIYPTLTPKMAMKIGSKYKFSEVQTR
HWDQFSEAVGLGKAQARKRILALAKSMPPTARELQSSREHGFAGHAVVEQ
IVILIEQRCALTVRRLSAPAADMEDETVL
>NE0640 hitA, HIT (Histidine triad) family
MEDCLFCKIVRGEIPATKIHEDEDTLVFLDIHPAAPVHLLVVPKQHIGSL
SEVDASHQQLLGKMLWLAPRLAASQGCTDGFRTIINTGRVGGQEVFHLHL
HVIGGKDRLPAMVHHD
>NE0101 hprA, D-isomer specific 2-hydroxyacid dehydrogenase
MLTVFLDFGSVTRGDIDRTVLEQVVSPWVYHDNTSREQVAERIREAEIVV
SNKTLLDRSALDAANKLKLICVAATGYNNVDLIAAAERNIPVCNVRNYAT
GSVAQHVFMFMLNFACRFVEYQQLIKRGGWQASSYFCPLDFGITELAGKT
LGIVGYGELGNAVANIAKAFGMKLLIAEHKSASTIRPGRTAFDEVIRQTD
FITLHCPLSEDTRHLISNRELNLMKPSAYLINTARSGLIDETDLLKSLYS
KHIAGAAIDVLKEEPPVSGNPLLDYPHPNLIITPHSAWASVESRQRMLNL
LADNIRNFLHNKPFNQIKDALA
>NE1567 kduD, Short-chain dehydrogenase/reductase (SDR) superfamily
MNQPYKGTPFDLSGRAVLISGATGLLGTEFALAAASAGADLILGDLDGNR
LELLKNEIIASHPDVHVLIQVLDVTRADSCQSIAQLCEDRFGRIDGVVHS
AAIDPKFEQGSDTSRFSKFTEFPLALWQTSLDVNLTGAFQLAQATCRIME
KSGRGSVVFLGSNYGLVGPDQRIYKKAGQEAQTYKPAVYSVCKAGLLGLT
KFLAAYYMNTSIRTNLLTPSGVWNKHDSEFTGHYSSRTILGRMSEKEEYR
GAILFLLSDASSYMTGANLVIDGGWTAL
>NE0899 ldhA, D-isomer specific 2-hydroxyacid dehydrogenase
MKITFFSTQPYDRESFLKHHIDTQFELVFLEEKLTEHTVSLASGSQAICV
FVNDNLNEAVIHQLSQLNVQLIALRCAGFNNVDIKAAHACNIRVVRVPAY
SPHAVAEHTLAMIMTLNRKTHKAYNRVREQNFSLNGLLGFDLHKKTVGVI
GTGHIGEVFCRIMHGLGCNILACDPVKKLEIEKMGIPYVPMNELFSRCDI
LSLHCPLNEETRYLIDSSVIAQMKTGVMLINTGRGGLIDTKAVIAGLKSG
KIGYLGIDVYEQEADLFFQNLSEQIILDDTIARLMTFPNVLITAHQGFFT
QEALDQIALTTFANIKRFVAGEIPANEVKI
>NE0840 merC, putative mercury transport protein
MGLVTRIADKTGALGSVVSAMGCAACFPALASLGAAIGLGFLSQYEGLFI
SRLLPLFAAVAFIANALGWLSHRQWHRSVLGMIGPAIVFAATVWLLGNWW
TANLMYVGLALMVGVSVWDFVSPANRRCGPDGCELPAKRG
>NE0625 metG1, Methionyl-tRNA synthetase
MTIRNILVTSALPYANGSIHLGHLVEYIQTDIWVRFQKMQGHTVYYVCAD
DTHGTPVMLRAEKEGISPEALIARVHAEHLRDFTGFHIAFDQYYSTHSDE
TRYYAEDIYRKLKEAGLIAVRAIEQLYDPIKNLFLPDRFVKGECPKCGAA
EQYGDSCEACGAAYTPTELKNPYSAVSGATPVRKTSEHFFFKLSDSRCAD
FLRRWTHEGNHLQAEAANKMAEWLGEAGENKLSDWDISRDAPYFGFEIPG
ETGKYFYVWLDAPIGYMGSFKKLCARKGIDFDAYWKKDSTTELYHFIGKD
ILYFHALFWPAMLENAGYRTPTQIFAHGFLTVNGEKMSKSRGTFITAESY
LEQGLNPEWLRYYYAAKLNGSMEDIDLNLDDFVARVNSDLVGKYINIASR
CAGFISKRFGGKLVSGEDYRLLQQMVDEHFAGWQPGVIEAAYEARDFSAA
VRHIMRRADEVNELIHELAPWEIARDETRERELHRACSLGIQMFYLLSCY
LKPILPRTAAQIEDFLNLGELSWQKQQAGQPLSDTLLPPGHVINPYQHLM
TRIDPKQITALITANQQTMQQTMNTETESHSPQRHGQAQQHPVAPIAETI
SIEDFVKIDLRIARIVDAQHVPGADKLLQLTLDIGSEQRTVFAGIKSAYD
PEQLKGRLTVMVANLAPRKMKFGLSEGMVLAASGENGGGPFLLAPDSGAQ
PGMRVK
>NE2187 metW, SAM (and some other nucleotide) binding motif
MLDLGCGDGTLLHYLRDKLDIHGYGVEIDAHNILACMKNGINVIQNDLEA
GLSEFEGESFDYVILSQTLQAMKNTEYIIREMLRVGKEGIVSFPNFGYWK
NRIQVAGGHMPVSPTLPYQWYDTPNVHLCTLHDFEQLCQQHRVNILERRV
MNNDKKVTFLPNLFGILAFYRFSHAA
>NE2405 mviN, Virulence factor MVIN-like
MNLLKALATVSSMTLVSRILGFVRDLIIARIFGAGVATDAFFVAFRIPNL
LRRLFAEGAFSQAFVPVLAEYKNNRTEEQTRELIDHVATLLGSALFIVTL
VGILAAPLIIYISAPGFAGVPDKFELTIALLRITFPYIFFISLVALAGGI
LNTYSHFSVPALTPVLLNLSFIGCALWLAPLMDPPVLALAWAVFIGGMLQ
LAFQIPFLLRLKRMPRLRFGFRDSGAWRVLKLMGPAVFGVSIGQISLLIN
TIFASLLITGSVSWLYYADRLMEFPAGMLGVALGTVILPSLSRHYTQNST
EEFSRLLDWGLRLTFLLTLPAAVALALLATPLITTLFYYGAFTVEDVWMT
REALIAYSVGLLGLILVKVLAPGFYARQNIKTPVKVAILTLAATQLMNLA
FIIPLKHAGLALAISLGACLNAGVLYSKLRSQGIYQPLPGWGIFIFKILV
ALIVMGAGLWLATGNSAEWFVLTATERAIKLGLVVILGGIGYFACLWMLG
FRLRDFARQ
>NE1896 nadE, Carbon-nitrogen hydrolase:NAD+ synthase
MKIALAQINCTPGDLRGNQLKILHACRQAREAGADLVITPEMSLCGYLAE
DWLLRREFVQACHQALTELTAQVYDVTLIVGHPHNMNGNLFNAVSAVRDG
RLLATHCKQHLFSDRLQDERRYFSAGNSLCTFECSGILFGLMTGSDYRHA
AHLQSLHAAGAQVLLAVDASPYSIDSQIDRYQILREGITQTGLPAVYINP
VGGQDELVFDGASFAMDHSGKLVCQLPAFQEALALIAIHGNQSIFGECST
LPDQAGSIYTALRLGLHDFITKNRLPGVLIGLSGGVDSALVLAIAVDALG
AERVRTVMMPSPYTADISIQDAQTMADNLGVRHAGIPITGLFDQFQQALQ
AELQACSDSGTSATVENLQARIRGTLLMALANQSGMLVLPTSNKSETAVG
YSTLYGDMAGGFSILKDVSKTLVYRLCHYRNQISPIIPQRILQRPPSAEL
RPGQIDQDSLPPYDVLDAIIEAYVENDLSAAEIIAMNYPEETVRRVLRMI
HSSEYKRRQAAPGIRITRRDFGRSWRFPLTSGFPD
>NE2253 ntpA, NUDIX hydrolase
MQRYKLPVSVLVVIYTADLQVLLLERADHPGYWQSVTGSQDPGETLLQTA
VREVREETGLNTDDYVLSDWQIQNRYEIFEEWNWRYPPGTTHNTEHVFGL
ELPKTIPAVVSSREHLGYVWLPWREAAEKVFSSSNACAIRMLASKRKSEN
SR
>NE0498 ntrR2, PIN (PilT N terminus) domain
MISPRYLLDTNILSDLVRYPQGVIARRIEEVGEAAVCTSIIVAAELRFGA
ARRNSLRLTRQVEAILAAIEVLPLDTPVDRAYAQLRWVLEQSGQVIGPND
MLITAQAMASQCVLITANLDKFSRVGELQVENWLVR
>NE0132 osmY, putative osmotically inducible protein Y
MKIKTLLSIITAMALLILGTEVATAQSTGVPTIDSGSPESNQPLNDTMIT
TKVKAELAIAEGIKSGDISVETVNGVVILTGTQPNEMLIKKAEEVAKSVK
DVKQVDISGLTVNTTTAD
>NE0953 pheT, pheT; phenylalanyl-tRNA synthetase beta chain protein
MKFSGNWLRKLVDLQYSDEELAHKLTMAGLEVESVAPVAPFFDKVVVAQV
VSVQKHANADRLNVCKVDVGTQSDGFLQIVCGAQNVKEGMKTVCALVGAR
LPELDIRQGKIRGVESLGMLCSAKELGLASDADGLLELPGDTPVGVDFRK
YYSLDDCIFTLKLTPNRADCLGMFGIAREVAAITASELDLPEIVFVNPVI
EDVLQIRVEEPESCPLYCGRVIKGVATDTAIPLWMSQRLERAGLRMINPV
VDIINYAMLETGQPMHAFDLNQINQEICVRFAGENEHLLLLNGERLTLQK
DMLVIADSSKPLALAGIMGGSESGVTNTTIDVFLESAFFSPVVISGKSFK
LGFTSDSAHRFERGVDFSMTRSVLERATALILEICGGKAGPVTEIGNDLP
RREAVRVRQKRIAKILGVDFSVELISEYFQRLQFSYTIAGEMFYVVPPAA
RFDLVIEEDFIEEIARVHGYDLIPARLPKAPVHMLAEPETGVSPVKRLRQ
ILTAKDYQEVINYTFVDTQWEADFSGNDIPIRLKNPIASHMDVMRSSLFG
GLIDNLQFNLNRKQSRVRIFELGSCFSKEGDEEKEVENLAALCSGSAYPE
QWGVPDRDIDFYDVKMDIESLFWPRSVYFELALHPALHPGKSARILIDKK
PVGWIGELHPRWQSKYHLSQSAILFELRTEALAAELLPAMYPLSKFPPVR
RDIAVVVESDVSVASLLETMHLEKDRSISEISLFDLYSGEKLVQGKKSLA
FRILLQDSEKTLTDQEIDRAVSQLVGILERKFGATLRS
>NE2195 pmbA, Putative modulator of DNA gyrase
MDNSNHHPEQPQSFSYSVETLQQIADDVLTLAHKGGADACEMNVSEGSGQ
NVTVRQGEVETIEYTRDKGLSITVHIGHKRGNASSSDFSPQAIRETVSAA
LSIARYTADDIYAGLADQDLLATSFPDLDLYHPWSLPVEEAIELARQCEA
AALATDKRITNSEGASVSAGASHFVYANSLGFCAGYPLSRHSISCAVIAG
EQNNMQRDYWYSVARAAGDLEAIEEIGKKAGMRSLARLGASKIATCEVPV
LFESTIASSLIGYFVQAISGGSLYRKSSFLLDSIGRQVFPATIQISELPH
LQKGLASCAFDDEGVATHPRKVVENGVVQGYFLGSYTARKLGMRSTGNAG
GNHNLIVENNVSLSFDALLKKMNKGLLVTELLGHGVNLVTGDYSQGAAGF
WVENGEITHPVEEITIAGNLKNMLSGIVAVGNDVIVRGSRQCGSLLIERM
TIAGH
>NE0298 pta, Phosphate acetyl/butaryl transferase:Phosphate acetyltransferase
MHTFFVTSTGFGVGLTSTSLGLVRALEYGGLKAGFYKPVAQQHPGNSKLE
YSTELISRTLGLAPPAPLPLATVEHLLGEGQIDDLMEDIVRRFKQASEGY
DVMVVEGMVPTRHVSYASRVNTRLASSLDADIILVSSAEDDALQAITDRI
EIQAQFFGGAQNPRLLGVILNKIRTDHSDDLFEQLKNHSTLFHQSSFQIL
GCIPWEDSLNAPRMADVVTQLQAQIVNAGDSEKRRVQDIVLFASAAPNSV
TLLRPGVLVVTPGDRDDIVMAASLAVLNGVPLAGLLLCSDFPPDPRVLEL
CKGALTKGLPVCTVTTNSYDTAANLHRMNREIPLDDHERAERITNFVANH
IRQELLVKRCGEPQEQRLSPPAFRYRLVKRAQEADCRIVLPEGYEPRTIQ
AATICQERGIARCVLLAKPDAVKAVASARGITLPEGLEMIDPEKVRRNYV
AAMVELRKHKGLNEPMALAQLEDNVVLGTMMLATGEVDGLVSGAINTTAN
TIRPALQLIKTAPGFKLVSSVFFMLLPEQVVVYGDCAVNPNPTAEELADI
ALQSAASAQALGIEPRVAMLSYSTGDSGSGQEVEKVREATRLARLARPDL
LIDGPLQYDAAAIASVGRQKAPGSPVAGRATVFIFPDLNTGNTTYKAVQR
SANVVSVGPMLQGLRKPVNDLSRGASVEDIVYTIALTAVQAASQR
>NE1931 recX, RecX regulatory protein
MSLYARALECLARREYSRHELEKKLSCHEQLPDELKSVLDRLEQQKLLSD
ERAVEQILHARSRKYGSKRIRYELQMKGIADHLIEAALGEFKQTEFSSAH
ALWCKKFGVAPSTPEERGKQIRYLAGKGFSSEVISKVLSDAREAEN
>NE2307 rhuM, putative cytoplasmic protein
MSKKKQDVSIVRSSAAEYLTFIAAMGDQSQSVEMRYEDENIWLTQKMMAS
LYDVTVPAINQHLKRIFDDGELLPEAVLKDYLITAADGKQYRTKHYNLQA
IISVGFKINNVRAVQFRKWAGQIVKDYTIQGWTMDVERLKKGHLFTDEYF
DRQLEYIREIRLSERKFYQKVTDLYATAFDYDKDALTTREFFALVQNKLH
WAVHRHTAAELIVSRADAGRTNMGLTHWAAAPQGKIIKSDVSIAKNYLNV
QEMEYLERIVSIYLDFAELQAMRKIPMSMQDWARRLDGFLEFNGNEILMG
PGKVSQEQAKLHAESEFEQYRIVQDRLFQSDFDRLLLQLESKNKEEGQ
>NE1213 sps, Glycosyl transferases group 1
MMTDQKLYILMMSVHGLVRGHDMELGRDADTGGQITYVVELARALGRNSH
IAQIDLLTRQIEDPNISPDYAAEIEELGPNARIVRLPCGPRKYLRKELLW
PHLDQMVDRCLHYLRQQGRLPDLIHTHYADAGYVGQHLSNLLGIPQIHTG
HSLGRPKRARLLASGRKEQAIERQFNLSRRIAAEEEVLVHASLIITSTSQ
EIEDQYGMYKNTDPRRCQVIPPGTDTSRFSPPGRKPLDPAIQAGIDRFLN
TPEKPVILTICRPDTRKNLHGLIQAYGSDPSLQDMANLVIIAGSREDIRA
MEESQRKIMNDVLLDIDRYDLWGKIAIPKHFMVEDVPEVYRLAVRRRGIF
VNSALTEPFGLTLIEAAASGLPIIAPEDGGPRDIITNCRNGLLVNTLNPS
DIASALKDALSDRKRWRNWSRNGIASVRRHYTWDAHVSKYLREADKLLYR
ERKRLRRQLAATLHAGRSPMPLARKVIISDIDNTLLGDEQGLAEFLQWLR
MHAGNISFGIATGRTVESAVRILKKWRVPMPDILITSVGSEINYWPSLRP
DKGWSNHIRHRWRREALAEALKEIPGLALQAPENQREFKLSYLVTPERMP
PLKQLYQHLHKQNLHAKLIYSHEAFLDVLPVRASKGLAVRYLAYKWGLPL
QSFLIAGDSGNDEEMLVGDTLGVVVGNHSPELESLRDREQIYFAKNTYAL
GILEGMKHYHFDQ
>NE0813 sspB, putative stringent starvation protein B
MSDVSSIKPYLIRAVHQWCTDNMNCPHVSVLESGCSGIPAELFKDGEIIL
NISYQATSDLLIDNETIQFVARFNGVSRKVEIMIGAVIAIFARESGQGLT
FTPEISKTAVADKQEGDVDHAVSQDSQVLSIEGKRGKPSLKIIK
>NE0950 surE, Survival protein SurE
MRILLSNDDGYFAPGIANLAKVLLEIADVTVVAPERDRSGASNSLTLDRP
LSLHKSHNGFYYVNGTPTDCVHLAVTGMLDELPDMVISGINDGANMGDDT
VYSGTVAAATEGFLLGLPSIAVSLVSMSRGNFPTAARIVVDLVKRFTENR
FHIPILLNVNVPDVPYDELQGVEVTRLGRRHKAESVIKYQTPRGETVYWV
GAAGAAQDAGEGTDFFALQNNRVSITPLQIDLTRYDQIGYVKNWLTL
>NE0386 thdF, GTP-binding protein (HSR1-related):tRNA modification GTPase TrmE
MTSNDTIAAIATPPGRGGIGIVRISGTNLESLARGILGKLPDPRHAGLFS
FLDQNSQIIDQGIALYFPSPHSYTGEEVLELQGHGGPAVMNLLLDRCLQL
GARLAEPGEFTLRAFLNDKLDLAQAEGVADLIAASTANAARCAVRSLHGE
FSSTIHQLVSALIDLRVLVEATLDFPEEEIDFLQSAHAAEQLATIRAKLE
QVLVASRQGNLLQEGIKVVLAGQPNVGKSSLLNRLAGDEVAIVTDIPGTT
RDTVRQSIEIEGIPLHLIDTAGLRETSDIVEQHGIARTYAAIEQADLVLL
LVDSRHGVTEEDRSVLTRLPERLPVLTVHNKIDLSAQPPRLEENTSGPTI
YLSAINGEGIELLRAALLKTAGWQANIAGEGAYMARQRHLQALIQAKELL
ERAAAWLHRADQLEILAEELRLAQQALSSITGEFTSDDLLGEIFSSFCIG
K
>NE1471 thrB, putative homoserine kinase protein
MSVFTPVTKEQLAVWLKNYSLGSLIDLQGISSGIENTNYLVTTTQDKFIL
TLFEKLTSTELPFYLNLMAHLSEQSIPCPRPVESQNHRLLGQLNGKPACI
VTFLPGRSMVQVAEKQCAQVGEMLARMHLAGRNYSGWNQNPRGLNWWQTT
AETVMPFLSSSEQNLLDEELQFQAAQMTANLPQSVIHADLFRDNVLFTSD
GIGGVIDFYFACNDTLLYDLAITANDWCTLTDGIMDKTRMHALVTAYHAV
RPLTADEHSAWPAMLRAGALRFWLSRLYDYYLPRPGELTHKKDPGHFKRI
LEHHLSNPGVLPSFQA
>NE1326 tldD, Putative modulator of DNA gyrase
MMDSFTIADQYLLAPYELNTGRLQDVFGHILTHQIDYADIYFQYSRSEGW
VLEEGIVKSGSFNIDQGVGVRAISGEKTAFAYSDDISSQALISAARATRA
IAAQGGGTHASITGLSGNDARQQALYYSSLDPIALCKDADKIGTLERLEG
FARTLDKRVIQVMASLAGEYEVVMVARSDGLLAADVRPLVRVSLQVIVEE
NGRREQGVAGGGGRFDYAYFTDAILQDYARKAVHQALTNLASQPAPAGSM
TVVLGSGWPGILLHEAIGHGLEADFNRKGSSAFSGRIGERVAAPGVTVVD
DGTIRDRRGSLNIDDEGNPTQCTTLIEDGILKGYLQDNLNARLMNQRVTG
NGRRESFAHIPMPRMTNTCMLNGNKEPEEIIASVKQGLYAANFGGGQVDI
TSGKFVFSAAEAYMIENGKITYPVKGATLIGNGPDVLTRVSMIGNDLALD
PGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTNG
>NE0287 vapC, PIN (PilT N terminus) domain
MLILDSNTISYYFRGDPQVVLRLQAQRPQDVAVPAIVEYELRYGLLRLPP
EMAAPRLAALTTLLLPMQKLPFDSECADHAARIRTTLEAAENPIGPHDTL
IAATALRHGATLITRNVREFSRVPGLQWINWHEG
>NE0732 ybbB, Rhodanese/cdc25 fold
MRNPDIGIDDLTALFIADTPLIDVRAPVEFTQGSLPGAVNLPILNDEERA
LVGTTYKQQGSEAAIKLGYEMVSGSVKQNRLQQWLDFIHQHPRAILYCFR
GGKRSQITQQWLRDTGIDSPLITGGYKRARQFLISTIDRFSEHRKLLVIT
GPTGSGKTRLIHDISNSHPVLDIEALARHRGSAFGGMSVPQPSQIDFENH
LAVNLLKLEQNNLSEPVIVEDESRHTGKVYLPDSFFHHLRNSEIIWVDEP
LATRVDNIFEDYILTTPIGQAQRIRQAIPPLASTVETREILRQQARQLFD
KYAGALQAISKKLGGDRFQEVSEDLENARSDFENKNEIQSNKIWIEKLVR
YYYDPLYLGSLQRRRVNPCFKGSGQAVMDYLQARK
>NE2561 ygaD, Competence-damaged protein
MSTYPTVPDDETLLELARRAGKLLEQNGLKLVSAESCTGGWIGQIITAIP
GSSAWYDRGFITYSNSSKQQMLHVQPSTLTQSGAVSEQTAREMALGALTL
SQAQVAVSVTGIAGPAGGSAEKPVGTVCFAWMLESASATSANSKICRFSG
NREAIRRQSVAIALQGMLELLENTTPLNLA
>NE0721 yraL, possible methyltransferases
MPAAAGVSKGTLYVVGTPIGNLRDITLRALEILSAVDCIAAEHIQHAQKL
LAGHALHTTSTRIMPLHQHNEGSAVEKIIELLGSGKSVALISDAGTPAIS
DPGALLVQQVLARSLPVVPIPGANAALCALSASGLIAPHFFFYGFLPAKS
GERQRKLAGLKTLYACILVFYEAPHRVLECVADMVAVLGTTREITFAREL
TKLFETIHTCALGEALDWLQADENRLRGEFVLLLAPAEEPGQEDISPQAV
HALAILQRELPLKQAVQLAAEITGEGRKKLYARALLERKTET