TitleGenColors Logo

Gene list

Applied filters:

Organism: Nitrosomonas europaea ATCC 19718, ATCC 19718
Gene type: CDS

Number of genes found: 2461

Free access
Sort by:

 



# Nitrosomonas europaea ATCC 19718, ATCC 19718

>NE0481 Helix-turn-helix motif
MTRPVNRMRAVHPGEVLREDFLIPAGISVNALAIALSVPATRIHEIVKER
RAVTADTAERLAHYFGGDAASWLALQASYDLKTLPTRDEIERRVQRREEH
V
>NE0293 hypothetical protein
MPTIHEVATLTSKGQITLPKSIRQALGADTGSKLAFELRGGEVIVTRADT
GHEDPAIAAFLALLARDIEAGRNIRGLPEDLIRTMLEHTGHNVVLGDDFD
EDVEI
>NE2126 glycolate oxidase, (S)-2-hydroxy-acid oxidase, peroxisomal
MGELLPKLATIPEGIQSAADYLPLARERLLPAVWHYLEEGSGNQVTLHTN
NCVFDSIRLIPRPMADVRDGHTRITLFGQTLAHPVILAPLAYQRLYHPHG
ESASAMAANAQGGQLCVSSLASQTLEEIITAAGQPLWFQLYWQEDRPRTL
KLLRRAVTAGYQAIVFTVDAPIKQATIQLPASISAVNLDTPAPFPALLPH
QSQVFNGWMAQAPRWEDLAWLRAQTSLPLLVKGILHPEDARKVINLGYDG
LVVSNHGGRVLDGAPASLACLPEIVSTVSGRGKVLFDSGIRNGRDIYKAL
ALGADAVLIGRPYIWGLATVGALGVAHVIRLLRDELEMTMALTGTASIRE
ITREKIISDRD
>NE1297 hypothetical protein
MGTHDREGLLTRYRISFLVNIILLVIILAYLFKLSQTGIPVKVVSYEKQD
LTVKNLETLTRNELDSERTEGVLVIKKKDGEQADLVFYGKGLAGMTAGAG
SSIKELMLQLGSGSTDAYAGDDNCRSVLVFRGQVYCIPW
>NE1985 hypothetical protein
MANEICLDWWGKASAGAILGLGLALSLVGLYAYLGPGGIDAPGGRYSLMR
YLEVFVWVAVFGFCFLFRSGRAAWAWLGAANLIAFTTLFACRFYFFV
>NE1008 conserved hypothetical protein
MQNTFIKLILGIQILFLLTGCVPMVLTGVGVGAGTGALMVEDRRSSGMYI
EDERIELKTSRRIGERLGDKVHVNVTSFNRNVLLTGETPDESTRKEVEKL
AMSVENVLNVSNEIIVAPKSSLASRSNDTLITSKVKARFINNRVFQVNHI
KVVTENGVVYLLGLVKRNEGEKATHIASTTESVTKVVKVFEYLD
>NE1404 ABC transporter, ATP-binding protein
MSLTPVLELSNVSKTYSSRQVLSGLSYSFAAGEYVAIMGDSGVGKSTLLN
LIAGLDSPDSGEIRIDGKILPVEDEAATNLRRRHLGFIFQAFHILPHLTL
LQNVSLPLLLNRLPLDRAPVMLTAVGLQGREQDFPHQLSGGELQRVAIAR
ALVHRPRLILADEPTGNLDQDTAHEVLQLIRTEIKANQASAIIVTHSRLA
AETADKILMLSKHGLTPVPLENQT
>NE1285 Band 7 protein
MGLNDPQWGKRRGNSGPPDLEDIMRNFNQKISEIFGKKGGGNDDEDSGGG
SPNLPSGRGFVAIVALLALAWIGSGFYIVDEGQRGVVLRFGKHVETTMPG
LRWHIPSPVEAVESVNIGQVRTVEIGYRNNVRSKVLKESLILTDDENIVD
IQFAVQYILNSPENFLFNNRDPESTVLQVAETAIRQVIGTSKMDFVLYEG
REEVTAKTTELMQEILDRYQIGISINRVTMQNAQPPEQVQAAFDDAVKAG
QDRERQRNEGQAYANDVIPRARGGAARLLEEAQGYKQRVVAAAEGDASRF
TQVQTEYAKAPEVTRERMYFDTIQQVLSSTSKILIDQEKGGSNLLYLPLD
KLIQADSSATRSSVAAARSQNESQEFSSDVGSRSRESFRSREREMR
>NE2354 hypothetical protein
MSGYMARIFCKKAMLLLLLTGTGIAGHIPVSDARQVAEAIGADTFTGYQE
IRDRRKSRPAAVLSSTEAGQDNSELKKPPYRRTRSTDAAAAFIPPQLLAP
GRCGIVDRIEFVMQPYYGDPGRVDTFIPGMAAIPYGQHGEKLFYADVAVA
GSSGYFPPHKNIVEPVYFKIGLLADDGHYHVMTQRTPPQFQTGETVRLGH
SGFLEKADCVMPEPDQHRPGR
>NE2169 Generic methyltransferase
MKVCLQCENFFSSADWACPSCGYQPERLNGIEAHASEFAHGGGGFKPEYF
SELSRLEAGNFWFRARNELILWALRTYKPNAGAFLEVGCGTGFVLSGIAR
ACPEIALNGSEIFLAGLFHAAKRVPSTHFMQMDARRVPFVEEFDAIGAFD
VLEHIEEDETVLAQLHNAIKPSGVLLLTVPQHPRLWSASDDYACHVRRYT
RVEIEQKVLTAGFELLRSSSFVTSLLPVMMLSRVLQKRKTKDFDPAGELK
INAALNKVFYGLMMLELAGIRLGMNYPVGGSRLVVAKKQSA
>NE0912 hypothetical protein
MKSFLKNKQCGQGIRKAAVLTAAIVSLSAASVSWAASITIDNRTFVNGSQ
SGNISVAGGVLAGQFNFNVTSPATLGNVTWNSTLQAFCIDINNSIKNPAT
YNIVAATDVGGLTNLQLSQVGWLFDNHGSSLGSSSQFDAAFQLTLWEIFF
ETGPTFNLNNGSFEAESGFSGARATANAWLSGVPTSDDYTSEKWEFYTLN
PTNPQNNQRLITWREKDPSQPPQEISEPGTLLLLSIGLGVAFFSIRRRNG
MQFASLA
>NE0816 TonB-dependent receptor protein
MPVQAQNTVAINIPSQPLSDALLQLGQQTSLQILYTPDLVRELNTSGLQG
NYTPENALRQLLKGTGIEYERQGNTVTLKRISAVTLPPLTVSTASLKQGT
AEEGYRVSNVSGVGLWDERSLRDTPYSMSVVTSDLIENINAQNMGQIFKM
NPLTQEPNIQNHNGVPIVTIRGFQSTNPIYDGIPMADSNGSAVSVIEIDR
VEILSGTSGFLYGGGRVGGAVNYVTKNSTNIPLRRLTLGNYGGTQFYGHA
DIGGKFDERGRFGYRVNLLYQGGDTAIGEPVNQKVANIVLDGHLTDTLYA
DVKYTYNDYRKEATQPTFNNVRERTGIDTSKRYAPKWASNDIESHKAYTS
LRWDPSKYFPLRNAFFYQNTRAKTNEITLNEQADRTFIPRISSGNPWQKY
ENYGGYIYLDTRFSTFSVDHKLTVGYSSTVLNQVFLDGSPSWSGTNLLLS
EIKNISRPDWEEPPPRSASIWYDRYNNVLIGDDITFNDQWSVMVGLNYAT
TANKATGIYGDRDYEKSALTPTVSLLYKPIEAVTTYVSYMEALEKGTIVG
RTYTNYGEVLPPLVSKQYEAGLKYDINPNLLLSTALFRIEKANQFSNLAT
PVPTYVRDGLQVHNGVELVLTGKITDNLTIMGGGTLMDLSVEKSNNPALE
GNKPANAASRMGKIYAEYALPWIPGLSLTGGIYYTGERYANAANTDKIPS
YTLYDIGARYSTRVLNKALTVRLNVINLTGKNYWQNATYLGVPRTVAFSV
STMF
>NE1227 conserved hypothetical protein
MKKQSCSHTTEDVPFPQANRRKFLQLAALGSSIGLFSVVSGISEARAEKK
AGTLLLSCMDYRLMDEIERYMLRRGLYHNYDHIVLAGASLGAVTDKYPAW
SRTFWDHLDLAVNLHEIHTVMVMDHRDCGAYKILLGEDYSKDTDRETAEH
TIKLTQLKDMINKKYPKIEVETLLMGLNGEVEAIPTAVAKQG
>NE1370 Glycosyl transferase, family 2
MNQPTNLLKVPDTVENFQSGGLKEADESLSFQDQGELEYRRSVDQKLVSV
VAENRRLHALVDAQRQLHHLETHNALNVRLGNILIQAVDSPSTLLSVPGK
LLAIWRQSVRKKPPAVLGGKGFGKVISTYQEEGFAAVEKLLTRPSISPVM
QANAWTALSRHLMQNKRDHAAEAARRAYALDPKPYRLKWLAFRLHEAGDV
IEAAAMLELLPADTPFSSSETRQAGRLQKEAKLAREPGRELEKLKQYPAE
KVIRVLRQCSWFDAAWYFTQYPEVRESGADPVDHYFYHGAAEGKNPGPFF
DTSRYLEQHPEILETDLNPLLHYVEIGFYEGLKADPCNRWLRLFDKQITD
LGKTGNTAGHEKYTVVSAVYNVAAYLDDFFYSMTCQTLDFRSHIYLIMVD
DGSTDHSADIIRRWQASYPENILYLHKENGGQASARNLGLQQAHTEWVTF
IDPDDFVAPDYFQKVDDFLAQQAAKKTAEKLALLSCNFIFYRENQDSFSD
THPLKYRFAKGDRIFAADDLEKHLQLTVNSAFFRLSLLREQSRLFDERIK
PNFEDGHFVGCYLQPLKSKQVAFLVAPRYYYRNFTEANSMLDISWQKEER
FDEVMIYSYQALFDAYIARGYNVPKYVQRTVLYDLVWYLRKIVNRSETVA
FLAPSQKGRFLAHLDKLFSVITTDTIVEFELAGCWFFHKVGILSAFKQLK
PPFQIVYVESFDWARQQVLLRYFHGEQPAEEFIVNQKIVAPAFEKITAHD
FLERTFVQDRRVWLPLSQKGKLEVRLDNVAARISMGGKQVGKTIDVQAIR
EHFSALQPVREGKYAGAWLFMDRDTQADDNAEHLYRYVCRAMPQLPVYFL
LREESHDWPRLQAEGFQLIAFGSAEHRLAMQECTRMVSSQAAPYVVDPFI
DGGIKRRQFIFLQHGVTKDDLSRWLNSRRIDLLVTASPIEQVSFAEDGNR
YKFTSKEVILTGFPRHDALLQGTGQKEKLILIMPTWRKYLLGELIDKSSE
RIMREGFMDTLYARSWGGLLSEPALMALAQQRGYRIVFFPHANIQSYLDQ
FNLPAGIEVLSHMDGSIQTLFQRAALLITDYSSVAFEMAYLKRPVVYWQF
DEEEFFSGAHSYQKGYFDYRRDGFGPVCTEKQEVLTAMADLLARDCQPSD
EYAKRMHNTFAWRDGQCCERVLQAILALDHVVEESENVGIFQRIRTGLRR
LITRRLPDAAGDTVSSQ
>NE1057 DUF214
MASYEFLIGLRYTRAKRRNHFISFISLISLLGMTLGMTALITVMSVMNGF
HKEVRARILGVASHAQISSYQGSLYDWEQIREAAIQNPQIAAAAPYIDAQ
GMLSHDNMVRGVMVRGILPADENRVADFAGKMVEGSLDRLAPGEFGIIIG
QELARNLNAFPGNKIVLISPQGQITPAGILPRLKQFTVTGIFDVGHFEYD
SGLALIHLADAQKLYRMAPNQVSGIRLKLHDMFKAPQVIQDLARVLPADL
YFTDWTQQHANYFRAIQIEKRMLSLILALIIAVAAFNIVSTLVMAVTDKQ
SDIAILRTLGASSGSIMKIFIIQGALIGILGTLLGLLGGVLLAYNVGDVI
AFIEHLLSTQFLSQEVYYISKIPSDPQLADIVTVAVVSLILTLLATLYPS
YRASKVNPAEALRYE
>NE1822 possible ORF H1620
MIPQRNLSLISNTQVFAGGRRIPEAVIERDYVLAWFLTGLAGHPLRDVLA
FKGGTALRRCWFVDYRFSEDLDFTLIRPITLDEILAGLNDIFAVIENACG
LRIAFDREDRHGHQNSHTFYLRYQGPLPAANDVKVDITIDEVLCFPLVDR
PIHRAYDGFDDLPEGPTVKVYALEEIVIEKLLALSDRARNEPRDLYDLWY
LLNAADLRIAELRTELDAKLAHRQRTAAGIEQAIAAKEERLRRLWTTRLA
HQMSALPPFDDVFRNVLRILRAAGLPRAND
>NE1289 Esterase/lipase/thioesterase family active site
MTDPFFLSDSGLLESYRAPKWLPGGNAQTIFPYFINLSPIISYRRERWEM
DDGDFIDIDWLDGESDKPLVIMLHGLEGSSQSHYALSLMNLLQMLRWRGA
VVHFRGCSGYSNRLPRAYHAGDSMEIDRMLRHIAHRNDSHEWNTPCYVVG
VSLGGNALLKWLGEQGAQAARQIAGVVAVSVPLDLAAAGKVLDSGFNRVY
THHFLTTLKRKALEKNRQFPGLLNARAVAACRSLYEFDNLVTAPLHGFRD
TDDYWRQSSSKPWLGSVQVPTLLINARNDPFLPESVLPQKSEVSSFVSLE
FPQQGGHVGFIQGTFPGKLDWLPQRIIEFFSSLCGLDAIPG
>NE1860 conserved hypothetical protein
MVNPQFASNFYFDQRLNTEAVKLLPGEYFVTTRNMVLSTVLGSCVSACIY
DGQSGVGGMNHFMLPRGEDDQGLQVECARFGGYAMDILIRELLRNGAHKE
NLVSKVFGGGNVQRAMVSSMIGSQNTRFVKKYLAERGIPILAADLEGNHP
RKIHFFPKNGRVLQKKLYVSRNDTIVEREQNYSRLLLSVAQSAIQPGTRE
N
>NE2570 probable transmembrane protein
MIDWQSFTPASAFTGGMIIGLATALLLLITGRIAGISGIIGGLVELRRGD
FAWRAAFVSGLLLAPWLWQWLGELPPVHIETSHTVLALAGLAVGIGTRYG
SGCTSGHGVCGLSRLSPRSMVATVLFMIMGMMVVYVVRHSLS
>NE0779 Conserved hypothetical protein 46
MTARFFHTPPIRDAEWITLNSDTSHHAAQVLRLKPGDAVTLFDGTGGEFS
GRLEQISKSGCQVRIECHLPVERESPLMIELAQAVCANEKMDWIIQKAIE
QGATRIQPLITRRTLIRLTGERADKREQHWQKIIIAACEQCGRNQIPLLL
PLMPLSHWLEQKLAEKYKNDNPAGHDIMLSPAAHQRLVELSPPRTGECLT
LLTGPEGGFTGEETDAAHLAGFIPVRLGNRILRTESAALAAIAAAQTLWG
DY
>NE0547 Sigma factor, ECF subfamily
MPAGLSLRQFVAFHPYGKHSIIMLFLAHYALLSMTTSCTNDELFLQLRND
LLNFLYRKVGRDEAADMVQEVYLRWRKQDVSSIENPRAFLFTVALNLIRD
SSRQQIRVDKHTTEIQNFFEIGNAADDPARRYDGQRQLASLLHALNQLPE
PVGHAFLLFRYDGMTHAQIAKQLGVSSKTVERYIQRAGEHCLAILAEYR
>NE0270 hypothetical protein
MPEQITRKASPFCLRLTPEERTLLEREAAGLPLGEYIRQQVFDENRVKRR
SRNKQPVKDHRLLSQLLGELGRSRLANNLNQLARAANCGLLNLTPEVKTS
LLNACADIRHIRETLMKSLGLNR
>NE1406 putative AttH
MRYLWILLGWLAVQNMLFSAPPVLAPVVPGKALEFPQDFGAHNDFRIEWW
YVTGWLETPTGKPLGFQITFFRTATEIDRDNPSHFAPDQLIIAHVALSDP
AIGKLQHDQKIARAGFDLAYARTGNTDVKLDDWIFVRETDGRYRTRIEAE
DFTLTFILTPSQPLMLQGENGFSRKGPGAPQASYYYSEPHLQVSGIINRQ
GEDIPVTGTAWLDREWSSEYLDPNAAGWDWISANLDDGSALMAFQIRGKD
DSKIWAYAALRDASGHTRLFTPDQVSFHPIRTWRSARTQAVYPVATRVLT
GETEWQITPLMDDQELDSRASAGAVYWEGAVTFTRDGQPAGRGYMELTGY
VRPLSM
>NE0775 tRNA/rRNA methyltransferase (SpoU)
MRHITSREHPLFKKLFKLQHSARQRHDEGMTLLDGIHLLQSCLASGKTPV
LLIVSESGSQHPEISQLLIHIDKDPQKTDCLMLSDTLFNQISPVKTPVGI
LALIDIPQHTLPADTYQNSFSILLEGIQDPGNLGSMLRSAAAAGVEGVFL
SSDCADAWSPKTLRAGMGAHFSLKIHENVDLIQVARQFAGNVVATTLHDA
SNLFHTDLRGAVVFVFGNEGGGISEELLETVHQRVMIPMPGCTESLNVAT
AAAICLFEKVRQDTCAGDSHENSMGSR
>NE1746 Fimbrial protein pilin
MSRTRTYTGNSGFTLIEVMVVVAIVGILAAIAYPSYQEHVRRANRAEARG
ILLEMAQLLERNYTEANNYENFVLPVSQSPRTGTARYTVQFSADTLTRNT
YTLEAVPEGSMAGDACGTLTLTQTGVRGSDGVVAECWQR
>NE0264 conserved hypothetical protein
MSTTPIDAELDLMLKRELAVPVNLVWRGLTEPELLKKWFVPKPWSISDCR
VDLRPGGEFYTVMQDPEGNKFPNSGCFLEVTDEKRLIWTSALVKNYRPAV
PATTSDKECAHIVMTAVIELQPTSSGTRYTACAMHNTPGQRKLHEEMGFH
EGWGTTITQLEELLKQEKAY
>NE1934 Phosphofructokinase
MAIKNAFYAQSGGVTAVINASAAGLLETARAHSDKIGKVYAGRNGIIGAL
TEDLIDTSAESTEAIRALRHTPSGAFGSCRYKLKSLEDNRREYERLIEVF
KAHDIGYFFYNGGGDSADTCLKVSQLAGTLGYPIQAIHVPKTVDNDLPIT
DCCPGFGSVAKYIAVSMLEASFDVASMAKTSTKVFILEVMGRHAGWIAAA
GGLASNEDHEVPIIILFPEVTFDQEKFLAKTDHYVKEYGYCTVVVSEGVK
GTDGKFLSDQGLRDAFGHAQLGGVAPVIANIVKNGLGLKYHWGVADYLQR
AARHIASGTDVEQAYAMGKAAVEYAIAGHNSVMPTIERISSSPYAWKTGM
VELSKVANVEKMMPADFISEDGFGITGKCREYLSPLIEGEDYPPYKNGLP
DYVRLKNVAVTKKLAEFKI
>NE1081 probable ATP-binding/permease fusion ABC transporter
MVSTDLPPLIELRGIRKRYGGGDKPEVEVLHGIDLDIRAGEFIAIVGSSG
SGKSTLMHLLGCLDRPSSGSYRFAGEDVSTFGSDELAWLRRKAFGFVFQG
YHLIPTESARENVEIPAIYAGLPPGERMQRAADLLGRLGLSDKLNNRPNQ
LSGGQQQRVSIARALMNGGHIILADEPTGALDSRSGAEVMELLRELAGAG
HTVILITHDRDVAAQAQRVVEIRDGRIVADSVTDRQPSEQPLLHHAGLSS
LEMTQAHEDTGTPFWQGLHETIRAAWRVMWIHRVRTSLTLLGIVIGVASV
IVMLAIGEGTKQRVIDQMGSMGTTIMYMSSDVPSTGGPVGVITEEDLDEV
ARLPEISRVMPVIGDPILVRHQNVDKQIYVFSSPYIMPMVHHWRVAQGRF
FTETEDRELAPVVVLGHKIYRSFFPHLSNPVGQYLLIGTSPFEVIGVMAE
RGAESGSQNYDDMVFIPYRAGRARVYQAQEQPDYIVMEAASMDQVQEAEE
AIRALLLERHGREDFRIGNAAARLKTQLETRDTMTRMLGLVAAVSLLVGG
IGVMNVMLMTVRERTREIGIRMATGAREYDILSQFLIEAMLVTITGGTVG
VILGLTVGALLVFWEVPVVFSFGVMIGAFACAVITGLIFGYMPARTAARL
DPVVALSSE
>NE0776 hypothetical protein
MSGEKSGRNFIFYFYPSLFIAANALFIGMVLFLFYASMQPPHLLEDTVNT
SPYRQIASWFDPIFFQYNLVLLVFAVGIIPLITLCYTSSMREEKKRRLQR
ELPPAIYSANSNYIQNYLSKISSIRSYLGSMMSLMFVVMFGCMIILLLKP
APLALPDLAYANGVDYSKGANFLMLGTYMKSYMVGNRDYINVLVYTLTAF
QFGFLGGYVYFIGLMVRSYFTLDMTPNIFINSSVRMITGSLLAMVLSYFL
IDPDKFSEPDAILIRSLPVWSFFIGHFPDRGLVFLENIATKALGLVRIHE
FASPLSDLPGINYNQEILLKREGYDNIENLANANALDIALRTGFSYQQLV
QWISQARLHGHLRNDYHAFVNCTGIVSLDDFVHFYRTVKLQNSAADPIEL
IIASLKNEKHDLDDKIRILRYLADPRDLATDIPHSMRDENSAADQETISN
KS
>NE2091 conserved hypothetical protein
MAEYFFDLNKRDQREALEYGRAETGRPIHLLEKDIWVVWALRALFVSPLS
ADLTFKGGTSLSKVYKLIDRFSEDIDLTCDIRKLIPDLVGKGDELPASRS
QAGKWTQTVRHRLPDWIMQNVQPVVQAALVREQLNARLELGGSDNDKLFL
HYPALAQGTGYVAPVVTLEFGGRATGEPHQVFPITCDIAAHLADVSFPIA
SPLVMSVARTFWEKATAAHVFCAQGRIRSERYARHWHDLAAIARSPHFAA
VIADRTVAIAVARHKSYFFIEKDADGQAIDYIAATTGHLKIVPEGEAKAA
LARDYAAMLADEVMVGNALSFDALLKACADLEARVNRAAL
>NE2468 Protein of unknown function DUF107
MQDYIQRGLSRATERRAQVIILQIDTPGGLDVSMRKIIQEIIASPVPVVS
FVAPGGARAASAGTYILYASHIAAMAPATNLGAATPVKMNTLASKPEVPD
SQPKGEQPDKSDSQDAMTVKVINDAVAYIRGLAQMRGRNADWAEQAVREG
VSLTAAEALDKNVIDLVADDVSDLLARIDGRTVDLSGEKVKLSTRSLALE
RIEQDWRTRLLAVVADPNIAYILMLIGFYGLVLEFTTPGTIVAGVVGAIS
LLLALFAFQVLPINYAGLALILLGITFMLAEALLPGVGALGIGGIAAFVI
GSIMLIDTEVPDYGISIPLIATFTLMSAGFFMLVFGMVLKSRKKPVVSGS
EELVHSTGEVLESFDQEGWVRVHGERWRARTNTPLSPGQRIKVTAMRGLL
LEVEPCQPDHTMNATEN
>NE1156 Bacterial regulatory proteins, MerR family
MDYDILHGIVIEESEALSLSELCQICNVEVEWIMALVNEGIFEPAGTRPE
DWFFSGVALRRVLVVRHLQRDLDVNLSGAALVLELLEERNALLAKINLY
>NE1680 conserved hypothetical protein
MQIHVYDTYVKAKDGHVMHFDVFTDVRDDKKAIEFAKQWLSSIGEEGATV
TSEECRFCHSQKAPDEVIEAIKQNGYFIYKMEGCN
>NE0117 possible A. fulgidus predicted coding region AF1859
MSTHHMAEFSSPTLSLGRYRLDWQVTRSIRLPDYAGSMLRGAFGHALRSI
GCITREKDCTTCPLRRDCPYTILFEPVPPEHHPLQDFSRIPVPYIIEPPE
WGTRVLHPGDTLSFHFTLIGCALQELPIAILAWRRALARGIGPDDGTAEL
TSIVLEQPDSSIPVYTPENGQIEPHETRLACPPPPAATIRLHITTPLRLQ
NNGVPLKAQTVTERALLMALVRRFALISEFHGEAAWQPDFRHLGELTSSV
TGKRQLSWRDWQRYSSRQKQKMALGGLTGRWDLHGELAAFWPALWFGQWL
HAGKNASFGLGRYRIIAA
>NE0939 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1529 possible signal peptide protein
MMRRLLFIWLLTIAGVASATEGVRPFVLGSMTQIQEEHAGKPFVLFLWSL
TCTYCPTELKMLGEFKQQHPDLNLVLIAADTPDDEPEIVSHLADYGLNKV
ERWVFAEDMPERLRFEIDRRWYGEVPRTYFYDQSHQREAKTGLVGQEYVK
SWLARVGMDASSASK
>NE1693 putative transmembrane protein
MNGYETLIATLALTMGSSWASGINLYAALLILGLGGATGNIALPNELAVL
ENPFVIGAAAVMYLIQFFADKIPGVDSIWDAAHTFVRIPAGAMLAAGAVG
DVSPALEIAAGILGGGTAATSHATKTGTRLMINTSPEPVTNWTASISEDL
MVIAGLWTALNHPILFIILFIGFIGLAIWLLPKLWTLIRGLLMKMARFLR
ITSPPVSTGDSVGQEEK
>NE1503 Rieske iron-sulfur protein 2Fe-2S subunit
MSSFVQTPDTAVAQPQLSVDWYLDPRIFELEKQLLFDQGPGYVGHELMVP
EMGDYYVPATQNNARILVRNENGIELLSNICRHRQATILEGRGSSRNIVC
PIHRWTYAMDGKLLGAPHFSKNPCLNLGKTILQRWNGMIFAGKRDINRDL
AGMGSRNELDFSGYVLDRVQIDHYQCNWKTFIEVYLEDYHVDPFHPGLGH
FVNTRQLEWEFGDWYSVQTVGVNPDFSHAGSEVYQKWHAQVLQQNNQQIP
RHGAIWLLYYPNVMLEWYPNTLVVSTLLPTGIEQCMNIVEFYYPEEIALF
EREFVEREQAAYRETAREDDEICRRITAGRRALYEQGVNETGPYQQPMEA
GMEHFHRFLRREIESHLY
>NE1143 SecD/SecF/SecDF export membrane proteins
MNRYPLWKYIVIAVSILLGLLYTLPNFFGESPAVQISPLRAGIQTDMALL
QRAEQVLKQHGLPYNGTILESTGIKIRFSDTDTQIKARDILDAEVGSDYI
VALNLLPNSPQWMRHIGALPMYLGLDLRGGVHFMLQVDMAGALTKALDRY
NADLRSTLREQRIPSAGIDRDRSRIIIKFRDEQAREKAISELGKTFEDLA
FRSEDVNGERRLVASIKPEALIRMQNTAVQRNITTLRNRVNELGVAEPII
QQAGADRVIVQLPGVQDTAKAKDILGRTATLEVRMVDEDGDIDAALRGSV
PSGTELLSERGGGHLLVKKQVLLTGDRITDAQPGFDQDRQAAVHVTLDNN
GSRIFKQLTRNNVGKRMAILLVEKNVTEVVTAPVIREEIGGGRVQISGQM
NSIEARDVALLLRAGALAAPMEIIEERTVGPSLGAENIERGFNSTLYGFL
IVSVFIMIYYSAIGVISVVALATNLLFLIGLLSILQATLTLPGMAGMALT
VGMAIDANVLINERIRDELRHGATPQAAIHAGYERAWDAIIDTNLTTLLA
GLGLMMFGSGPIKGFAVVLCLGILTTLFSAVTVSRAMVNLFYGNRKLTHV
PIGAIKILKKSSSKR
>NE2567 conserved hypothetical protein
MSVASETNWKKLDEKIAISGQISVDDVAAIAAAGYKSIICNRPDGEGGEH
QPGSTELEEAAKAAGLQFAYLPVEIGQVSDEKCSAFHQLMATLPGPVLAF
CNSGNRARALYSRDVGTTTTPAETISAACDWEHEAAAVTEAESEAAGAAA
VSAASSRDEAAGKSIPVTPACNWDNAFDIVVVGGGSAGLGVTASLLRRRS
SLRIAIIEPNDKHYYQPAWTLVGGGAYAVDQTVRNTADVIPHGAEWIKAE
VSGFSPNDNLVHLADGRTIGYQQLIVCPGIRLAWEKIEGIQETLGKNGVT
SNYLFDLAPYTWSLTQQLKGGKALFTQPPMPIKCAGAPQKAMYLSCDYWQ
RQGVLDKIEVEFDSAGAVLFGVADFVPPLMEYVRKYHANLVFNSNLVKVN
GPEKIATFEIKNEAGEVTRVDKPFDMLHVTPPQAAPDFLRDSPLADASGY
CEVNPKTLQHTRFANIFSLGDACSSPNTKTAAAVRKQIVIVAENLLAAKD
GREFHAVYDGYGACPLTVENGKVVLAEFGFGGKLLPTFPLNPAVARKLYW
WFKVKLFPWLYWEGMLKGREWLTRSTETK
>NE1736 putative transport system permease
MQSHSTDRPDRQITLISRFVRLILQSNLLWFCLAWLLIILLWELGSYMEW
LNPQILPPPSETIPYAFSGNISIGFGQQRTGLFEATAITLARVGIGMLAG
LLVSSCLAVMVIELPLLRRLVLPIVQSLAPIAPVAWIPFTIAVIGIGGQA
AVFIVFMAVLGTMSLSLIAALDGIPAEYLKIARNLGTSRMRMWWHVRLPA
ITPGAMTAIRMSFFGAWMAVLAGEMAGINSGLGYMIIMAQQMYNMKLVMI
GILAIGFIGFAIDRLLLLANARLLQWVQ
>NE0588 Phage integrase:Chain length determinant protein
MKQESPRLENTTPVEEKMTRSLLDHLTRLKPTRLVIFTSVFLSCLLLSLG
FVFLQPATYQSYATLLTVAKTAIDQASREADIQHVAIQKQVLLGSELLTE
TADRLRKGDDAEAGINLTHAAIRDMLDVRPVDDTNLVEMVAEGNEPGLLP
RLINTWIDVYLDARSAEIARSKGSTVEMMQEELVALTEKIQQKRLELEQF
RQHNEIVSTERQDNEALARLKGLTDSLNKASEEEVKAKARLDTIRKSIER
GQIVVPNEDTRTLSALERRAQELREQLEELDRQYTREYMALSPDLKVIPD
KLRALEDEIQHIRRKGQTVVVSDAEQEYAAARQATRSIQEQLDQHRKKAA
EFTARFTEHDALKSDLEELEQLYRDTQARLAQIEAQHTGKYPYVDVIERA
FLPHHPIGPDYLWNAMIATLASLLLGLTAIWILEFLMHKEQEKLAIHLSG
IHLHNQDNLPHSAFDVLPSTSVNLPRQPAQALEHTSMNGLTPQQIATLFQ
TANIREKQLLLLLLSGLTPDEITHLQQDDIDLEHNSLTVRSVPARTIPLH
PVLAALYGKYGHCLTDPAGNRLNQEELEALLICLQIDADLSADNQISAEI
LRQTYILYLVRQGIRLGDLESVTGYISPTELSGYGTYAPAGTRYPVESVN
LFYPIDYKQDV
>NE2311 possible helicase (Snf2/Rad54 family)
MVQQLLTPHQSQYIAWQLTRRAAKDSVESLASTLVDSQVDLNPHQVDAAL
FACRNPLSRGVILADEVGLGKTIEAGLVISQHWAERRRKMLIIVPANLRK
QWHQELQDKFNLQGLVLEAKNYNAMRKEGVTQPFLHAGGPIICSYQFAKA
KADDLRRIHWDLVVMDEAHRLRNVYKNGNVIARTIRDALEHVDAKVLLTA
TPLQNTLLELYGLVSMIDERVFGDLDSFRTQFSGVRTEQSNRALRERLTP
LCKRTLRRQVQQYVPYTARIAIVEEFTPSQEEQQLSALVADYLRRPNLKA
LPEGQRQLISLVLWKLLASSSHAIAGALETMANRLQGQLDELPDVPDLTE
SLDDDYEGLDETADEWNGATANDADASANERAAIADEAAELRRFKELATS
IRQNAKGQALLTALDKAFAELERLGASKKAIIFTESKRTQNYLLSLLAET
PYGIVLFNGTNTDARAQAIYKDWLQRHEGSDRITGSKTADTRAALVEHFK
ERGTIMIATEAGAEGINLQFCSLVINYDLPWNPQRIEQRIGRCHRYGQKH
DVVVVNFVDRSNEADARVYQLLSQKFKLFEGVFGASDEVLGAIGSGVDFE
RRIAAIYQNCREPEEIRSRFEDLQRELSSEIDEAMLRTRQLLLENFDEEV
QEKLRIHSQDSQAVLNKYERLLMDLSRTELRDHARFDTAEEVNGFVLHSL
PDGLGLATGSREQAVMAGRYELPRRSGDAHLYRMGHPLAEWAIERAKARD
LQAPARLAFDYAAYGKRLVSLEKWRGQCGWLSVTLLSVETLNDQEQHLVV
SACTQAGEALPEDDPEKLLRLPAQVEGDAHLQVCAELVANVESRKSVLLR
GINQRNLGYFEQEVQKLDTWADDLKLGLEQEIKAIDGEIKEVRRTAAASP
TLEEKLAHQKRQRELETRRSKLRRDLFARQDEVEEQRNKLIGELEEQLKQ
QVAERMLFTVEWELT
>NE0599 conserved hypothetical protein
MLYSSNAILLTVICYELPLSERIRMLLRLEDLFDKIDFFSARDTSFEHHA
VLVALFEILDVTSRSDLKSDLLQELDKQRTMLEGLRSNPEVSEKALDHIL
QDIRAAFRGLLDIPGRIGGHLRDDEWLMSVKQRMSIPGAACEFDLPAYHY
WLNLAPEIRREDLKDWITPFTPIRSGINIVLNMLRNSGKNCCYTAVQGLF
QQAGSEHQAHLLRLHISSEFPCVPEISANKYALNIRFVPWRSDHKTEVYE
EDIPFELTFCSL
>NE0720 ABC transporter, fused permease and ATPase domains
MSNHSSAPLIRLLRYGHAFRGRIWLATLFSILNKLFDLAPPVLIGMAVDI
VVNQQSSVFARLGFPEVSTQLWILATITLIVWGFESIWEYLLKLCWRNLA
QSIQHTLRLDAYDHVQKLELAYFEARSTGGMMAILNDDINQLERFLDVGA
NDLIQLLTTVITVGGMFIYLTPGVAWMALLPMPFVVAGSYLFQKRIAPRY
MAVREQVSILNSYLSNNLGGIATIKSYTTEPYELRQIERESQIYQEKNQH
AIALSAAFVPIIRMVIALGFSAIMIAAGLMAAGGELNVGAYSILVFMTQR
LLWPLTRLGETLDLYQRAMASTHRVLDLLQAQPTITDGPQTLPAAQVSGE
IIFDDISFNYERRASTLNHLSLTVHAGQTIALVGATGSGKSTIVKLLLRF
YEPQQGRILLDGHDIRDLRLHDLRKSIGLVSQDVYLFHGTVYENIAYGSS
GAHMEDVIAAARTAEAHDFIMELPDDYQTIVGERGQKLSGGQRQRISIAR
AVLKNPPVLVLDEATSAVDNETEAAIQRSLERITVGRTTIVIAHRLSTIR
NAHCIYVLDKGEVVETGTHEALLARNGLYTNLWQVQTGEKMRAHG
>NE2451 putative death on curing protein
MIEPIWIDEQVALAIHERLISLHSGASGVRDKELLKSALARPLNLLAYDQ
QADVIHLAAAYTAGILQNHPFVDRNKRTGFVVGVLFLELNGYRFTAAEED
SAQAVIALAAGSLDEARFKLFLADNSIPV
>NE1128 hypothetical protein
MNDIEAAFHQYFEIIDADTPELLKTVFDLRYRILCVHNVIPGFDTNNYPN
ELESDQYDSHSIHFLLRHRPTNTFIGTTRLILPNPLDPMDKFPTELNTHF
YPGFVLDSSSRKHTTEVSRFAILSDFFKRKGERNMLSQSTEIGCKAQERR
RFPHPMLGLVVGLIQLCARNNIYHLISAMEPALNKLLGFYSLQMNPIGPP
ADYHGLRTPYYLYLPDLLDRMYQDHRSLWELITDHGRIWPMNLACIHQKT
LKTAYTDNVYISE
>NE1754 Purine and other phosphorylases family 2
MLAIIGGRSMNKLAGLEVTHRQVMRTPYGEPSGALIFGTIGTREIVFLSR
HGHGLTIPPHAVNYRANIWVLSTLKIKTIIAVASVGGIRKDMGPGKIVVP
DQIIDYTHSREATFHGRSNGTIIHTDFTQPYCSQTRTSLLQAARDAGESV
IDGGVYAATQGPRFETAAEINRLERDGADLVGMTGMPEAILARELGISYA
TIAAVANHAAGRGESIQAIPLQAAHLALENAMGSVRNILEHLVRNYDD
>NE1018 Uncharacterized protein family UPF0005
MQFNPKYATPVSNTVNSGVRNKVLKNTYLMLSLTMIPTIVGSVIGTGTNF
SFLAQSPIVGSLVMLAVMIGLMFAVSATRNSMWGIILLFLFTFVAGWWLG
PLLQYALHFKNGSQLIGLAAAGTGIIFFTLAGIATTTRKDFSFLGNFLLA
GIILVILASLVNLFLAIPAISLAISAVAVLVFSGFILFDVNRIVNGGETN
YVMATLGIYLSLYNLFISLIQLLLAFLGEKD
>NE2030 ADP-glucose pyrophosphorylase
MKVQPAVQTNDNPRFVSTLTRNTLALILAGGRGTRLKNLTDWRAKPAVPF
GGKFRIIDFTLSNCVNSGVRRIGVVTQYKAQSLIRHIQRGWSFLDGRFQE
FIELLPAQQRTEEGTWYQGTADAVFQNLDILRTHNPGYVLILGGDHIYKM
DYGRILAEHVERQADLTIACLEVPVEDASAFGVMAVDDSWRTTSFAEKPE
HPAPIPGKPGHALISMGIYVFNAKFLYEQLIQDHDMDQSSHDFGKDVIPR
LVASNARVYAHRFQNSCVNMASGVPYWRDVGTVDAYWKANIDLTTITPDL
NLYDEDWPIWTHQEQLPPAKFVFDDDDRRGQALDSMVSGGCIISGATVRR
SLLFSNVQIRGYSTIEDSVILPNVSIDRHAYLKRVVVEKECQIPEGLKVG
FNPDEDRKHFYVTDDGITLITPEMLGQGIHYIR
>NE0977 putative nitrate transport system permease protein
MKTQHASARLSHRFSAIFWDLVAVLLVLGMVVFLAQTSRDLMQPLAGPGE
TVISLEPSGLPEYVLRTSLRMLAAMVLSLIFTFTYATWAAKSRRAGQVLI
PLLDILQSVPILGFISVTVVFFMSFAPGRMLGAELVAIFAIFTSQAWNMA
FSFYQSLRTVPTELVEASRSFRLSPWMRFWKLEVPFAMPALVWNMMMSMS
AGWFFVVASEAISVGNTVVTLPGIGSYIALAIDQRDLGAVGRAIAVMLIV
ILIYDQLLFRPLVAWVDRFRFEQEADRNPPRSWVLMALRRSRWVAMAVVP
LVRMWRWTYRAGNDGASRELARTARNTMGSGVISWLDWLWYGAIAVICAV
SMWKITVFVLQGVTPDEVGTTVFLGCVTMVRVFVLIAVASVIWVPVGVWI
GLHPAAARIIQPVTQFMAAFPANLLFPLAVSGIVVLRLDPDIWLSPLMIL
GTQWYILFNVIAGASTIPGELRDIGINLGVRGWLWWRRIALPAVFPFYMT
GAVTASGGSWNASIVAELANWGDTTLKANGLGAYIAEATAAGDFHRIVLG
IAVMSAFVVLINRLFWRPLYILAERKFRLG
>NE2308 conserved hypothetical protein
MSKKVTAIPPDYIEWLSDIKSRVTAARQRTVLAANAELIQLYWQIGRDIL
QRQQSANWGDKVLDRLASDLRAAFPEMKGFSSRNLKYMRYFAEHCPRQAF
GQQPAAQLPWFHVVTLLTKLADTGAREWYAQRAIAEGWSRTSLELSIRNR
LHERQGQAVTNFGVRLPAPHSELAHEALKDPYLFDFLGLGDKAHEREIEN
ALVRHITQFLLELGNGFAFVGRQFRLEVSDKEFFIDLLFYHTRLKCYVVV
ELKATEFKPEHAGQLNFYLTAVDRQVKAPDDNPTIGLLLCKTQDRLVAEY
ALHGIDKPIGVAEYELVRALPERLVTSLPTVEELENELLIAQETKT
>NE2215 NUDIX hydrolase
MAGPSLVEVAAAVLIRPDDSFLLACRPDGKPYAGYWEFPGGKIETGESPL
QALARELDEELGITVRQATPWLTRTFSYPHATVRLRFYRVTDWHGELTPR
ERQQFAWQTAENITVSPLLPANTPILRSLALPSIYAVTRTVETDIEASLL
SIGQTLGNGVKLLQIRQKAMSHDQLEHFSRTALQLARSYQATVLINENIP
LAQTLQADGVHLTAAQLRSLHSRPAVNWCGASCHNEQELQRAERLGVDFV
TLSPVQPTLSHPGAAALGWKEFAALIRDYPLPVYALGGLLPADLETAREY
GAHGIAMMRSI
>NE0354 possible TonB protein
MTKIRHKPALTGLSLVSFLVVVATHAAVLYGLWHHRFSTTTPDTITLYAQ
FIAPPEKQEEAAKAPKVELKAEHAPPQVKPTPPVKKVQPQPKHQLVAKTP
AVTPQEYVAPPPVEEHEPEPEPEKKTVSESVIAAKPAQMPTGPVTLSSEL
SVSCPDLASPAYPALSRRLGEEGKLVLQVELDETGRIGKAKIVQSSGYSR
LDNAALSAVKTWRCRPATRNGHPVPAVALQPFNFVIEGS
>NE0488 hypothetical protein
MIRSRCDRWPSTKPGQLRRIALSATLPRKERRGIERITMQRQSEQEQLFS
IFIVFTRPLGRGDPENLNKVAYLNIAVTDC
>NE2431 PIN (PilT N terminus) domain
MNVVDSSAWLSYFAGDANAPVFTGPVEQISQLLVPSITITEVFKNVLHQR
GEEAALVVVAHMEQGRVVPLDSELAMDAAKFGVLYKLPLADSIIFATAHK
YGATLWTQDNDFEGLLNVKYVPKSGI
>NE2159 TPR repeat
MSMKILKASVFVIIVFFLMPLPVVAAKKGIYCGELKGSHYGPFDYMDRFN
HSEQLKIVEDFHFTSDVEDLIRGSTSSTPAKDLNYTLHAWPNHHRALVSL
FKYSIKEKSTRIKGLKYPVECYFDRAIRMNMKDVQVRSIYSAFLSHRGRN
KEALEQLEVAANLEPDNATILYNLGLLYFKQKNYEKASHYAEKAYALDYP
LPGLRNKLIQAGKWRGSASGRSGK
>NE0650 NAD binding site:D-amino acid oxidase
MASDFIVIGAGIIGLSTALRLLEEGASVTLLDRREAGCEASWAGGGILSP
LYPWNYSDSVTRLAAYSMSRYPEWTAALNTATGIDPEYQICGLVILPPCD
PEAAVNWCSAHAVRLTYLSSSHHGGLSWAHINDEQGNAQALFLPEVAQVR
NPRLLQALYKRIRQLGGKIIEHCEVQELEISGQKVHAIRTPSEKHSADQY
ILSAGAWSRKILGEHALNLAIKPIRGQMLLYRLPGNPLCSIVLQRDLYLI
PRRDGHLLVGSTIEDTGFDKQITLDAKNRLSSWAEEILPQLKNTPLLKHW
SGLRPATPDNIPIIGPHPFLENLYINSGHFRYGVTMAPGSAEILVNEILK
RTQPFDVTPYQRGWHPSE
>NE0166 conserved hypothetical protein
MKKLYTANHLLEAHIVRDLLENAYIPTRLFNEYAQGGMGEISFTHTYPEV
WVMRDLDFERGRKIIAAYEQAPQVTDIVFCLQCGEENPGNFQLCWQCGSG
LEVAREKS
>NE1207 Bacterial transferase hexapeptide repeat
MIRKNPRGDQPIVHESAFVDPTAILCGRIVVHENVFIGPYAVIRADEVDE
TGHMEPITVGAHSNIQDGVVIHSKSGAAVTIGERTSIAHRAIVHGPCTVG
PDVFIGFNSVLFNCTIGEGCVVRHNAVVDGCDLPPGFYVPSTQRIGPRTD
LSTIPKVSPKASEFSEDVARTNNTLVQGYKHIQNEL
>NE0495 hypothetical protein
MSLFSPEQLKKASHIKRLSPVLFINTYLAHCKISQRANRANPAEVQTASP
VELDCLNFHGLRESQRDMAIQSLYRLSLWIAALRTPKTQQIQPPQSLRTL
QG
>NE0147 conserved hypothetical protein
MKDNELDERDGLARDQKSDLIRDAEPVETSDQAVSLQAEPVGTMPYSPAQ
AGDPAEAATTGGTAASGFGQILRDARIRRGMNVGEVAHRLRLSEQQVEAI
EAQDFSRLPAAVFLRGYIRNYANLLQLDDVPLLMEAVPQARPVDTVFASK
RNAQRFKAIEPVYRSGRSSRGGWLYIAVILAALAAYGIYRDEVPEQLASF
SAGDTDQVMSSISADGNDQVAIDLALPLSSSSSGLPLVTPAPAGASVPSV
ATTLPELPAPSVPAVEVPKASDDGKKSLHFSFSRDSWVKIKDSGGRVILE
KTHSRGTEQTIEGKPPLYLVIGNAAGVSLTYNGRKVDLAPYTRGNDDVAR
FSLE
>NE1555 hypothetical protein
MDGTKAPPVSSTLEGIMNTAPELIIARKAIAKIAIRFPSLTMIEEPTVPV
ELSIRLPVQPGLNYEVWLALQNNDELHFSVGNFWLEWFPCTESSRVKEYI
SAVTGFLSSQYRVLEHYRGKHCVKAELQAPSGGDWKTVGTWSNLLSFLPL
RSSLREVSNTQPIIPPDLPQQAAPDR
>NE1652 possible dolichol monophosphate mannose synthase
MSFSLIIPTLNEAENIDPLLSGIFSLDTLGSQFEVIIVDDGSTDGTPDKV
RQWQNTHNVRLIERRAQPDLTASILDGTAVARYEVIAVMDADLSHPPDKL
AALIQPILDGTHDITIGSRYIAGGNTVGWPFYRLWLSRAGGWLARIICDV
NDATSGFFAFRRELIGNITKNARGYKILLELLMTGQGRLRVKEIPISFRD
RLHGNSKLSLTHHLVYLLQLITLAGGTVSFHTASRFAIVGFLGVFVDVLI
FQWMTGHGASLASAHISSFIAAASFNYTINSKWSFRLHHAGHLHWQQFIR
FLTVGVFALLLRGAILAWLVYTWGISAEWAIFPAILTTAMVNYLGSAFYV
FPRTEPPVSTDTRWRVASVGIVCFVILLRLAYMDVAQLIPDEAYYWQYAQ
HIDLSFFDHPPMVAWLIWLGTTLAGHNEFGVRIGAFLCGLIAMAYVFALA
RNLYDKPTAMRSVLLFAIVPFGFATGLAMTPDAPLIAAWAATLYYMERAL
LANQSQAWLGVGIAFGLGILSKYTLGLLGIAALIYAVLDPVARRWLLRPH
PYLAILIALILFSPVIIWNMENDWASFLFQTGRVGNTQHTFATHLLILQI
ATVLTPVGLIAAVFALRSISLSYCQAPRKYLFIGIFTGVPLLVFLAISTF
DPLRFHWTSPLWLALIPTMAWMIGQNDVPMGLSARLRAAWQPMIVLSLLT
YAFILHYVTLGIPGLPYQFLSEHYFWKEATAEVEKIADEIRQTTGEMPIV
IGMSKWSTASALAFYKDTPALDIRSRNLFGQHAAMYSFWNRVEPPSDQPI
LLIGMKQQALESSNATGNFENGLINLGKIHSRSIYRHDGVLLRTLYYRVA
AGYRGNFNPQPAPADARTRQYSPEIVRFNGYDQRTTLSAISY
>NE0278 Coproporphyrinogen III oxidase and related FeS oxidoreductases
MPVLSVPDSKSPHLAELPPLGLYIHLPWCVRKCPYCDFNSHAINSGGDLP
ESEYVTALKRDLELALPLIWGRPVTSVFFGGGTPSLFSATAIDAILSHVR
MLLPLAPGVEITLEANPGTLEAQKFTDFHSAGINRLSIGIQSFNSRHLQA
LGRIHDGAEALRAAELALQHFSNVNLDLMYALPGQTRDEALQDIETACNR
GVAHISAYHLTIEPNTPFYRYPPQLPDEDTSADMQVMIEHILAERGYRHY
ETSAFAQPGKPCLHNMNYWQYGDYLGIGAGAHSKISFRDRIIRQMRHKHP
RQYLEQAMAGNACLENLLATEQEVGPQDRIFEFMMNALRLTEGFDPGLFH
TRTGLTIASIQKSLAEAEQRGLIEWQHDCIRPTPEGRRFLNDLLEIFLS
>NE0129 putative lipoprotein
MMISRAIAIVVIASASMVMVSSCSVARHQESVGEYVDGSIITTEVKAKLA
NDPGTSAANINVKTIEGGEVQLSGFTKSQAEKNRAGELARTVKGVTRVHN
NLVVKP
>NE1426 Rubredoxin:Rubredoxin-type Fe(Cys)4 protein
MTTEDNQVYNRYMCMICGFIYDEAEGWPQDGIPAGTRWEDVPMNWTCPEC
GARKEDFEMVKV
>NE0880 probable ATP-dependent DNA helicase-related protein
MSDLNTVFSADGLLARNIPDYRPRTQQLEMAQAIAQAIESQEVLVTEAGT
GTGKTYAYLVPALLSGGKVILSTGTKTLQDQLFQRDIPTVRAALKIPVTI
ALLKGRANYICHYHLERTLNSDHIHFASRTEVKYLNLIERYAGTSSHGDK
SGLDKVPEQAAIWQHVTSTRENCLGSDCPHYRQCFVMEARKRALSADIVV
VNHHLFFADVMLRDEGLSELLPACNTVIFDEAHQLPEVASLFFGESVSTG
QIQVLVRDTDTEALLEAKDFAPLFDATAAVGKAVLDLHLTITEKHTRMSS
ASAARYPGFSEARQVLQEKLVLLAGLLETQAVRSQGLQNCWLRAQTLLNR
IRQWHEQSESREFICWVETYSQSLQFNTTPLSVAETFSKQLDASARAWIF
TSATLSVKKDFSHYNRMMGLFEAKTANWDSPFDFPNQALLYVPSQLPDPN
TPHYTESIVQAVLPVIKASQGRAFILCTSLRNMQQIHELLQVAFQREQLE
FPLLLQGQEARSALLNQFRQLGNAVLVGSQSFWEGIDVKGNALSLVIIDR
LPFASPDDPVLSARIEKFTREGRNAFMEYQLPHAIISLKQGAGRLIRDEK
DRGVLMICDPRLVSKPYGKQIWQSLPPMKRTRDPDEVLRFLENVDQ
>NE1162 hypothetical protein
MDLAQKSDAEILAVATPIMDNLMDASTAIDYERHTRDFTERARSVLSEES
LQSICEHYQSTKGFFAKREFVAAFRRPDSVAIVWRQQFTKQPGEFVAELI
LVQQGGKYLVDHVMVF
>NE1346 DUF172
MYLFYTCTIYCANEVAMKVVTYSHARNALKSILDDVIQDADVIVISRRDA
EGDAVVMSLDSYNSIMETLHLTSNPANAAALAKAIAQDKAGQAQDHPLLS
AD
>NE2472 conserved hypothetical protein
MLASMTGYAAVSREIPQGSLALELRSVNNRYLDLQFRLPDELRALESEMR
DFLGSGLTRGKVDCRLTFSSYSNSYRQQRLNRDLLQDLQIMSDEIKSIFP
AAGDFSVAEILRWPGVLDSDHISIDDLRTPCMALLQTALQELVSARKREG
EKLHNLLLERVQRMRQLVLDLLPGLPAILASFQERLLNSLKAAGLDEKDE
RIRQEFTLFANRVDVDEELSRLQGHLNEFEHILLAGGVVGKRLDFLTQEL
NREANTLASKSVAREVTRIAVEMKVLIEQLREQIQNIE
>NE0288 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNR
>NE0413 Ribosomal protein L5:Mitochondrial ribosomal protein L5
MARLLEYYRDTVTKELTQKFNYKTVMEVPKITKITLNMGVGEASTDKKII
ENAVSDLEKISAQKPVVTKARKSIATFKVRQGYPVGCKVTLRGVKMYEFL
DRLVNVAIPRIRDFRGIQVKGFDGRGNFNMGIKEQIIFPEIDYDKIDKVR
GLNITVTTTAKTDQEARALLSAFKFPFKN
>NE2449 hypothetical protein
MLHYTRQSGKLMFKVILLSFIVLIVIPFDSFAQTRCRPDSLQNTTCRDSN
GNTMRSRTDSLGNTTYRDSDGNTIHSRTDSLGNTTYRDNRGNTVRCHKDA
LGNTICR
>NE0717 Cold-shock DNA-binding domain
MRYQGRITTWKDDKGFGFVTPNGGGEQIFVHINSFSSRQRRPEGNELVTY
ELTVDSKGRSQAKAVAFVGEQPTPPKAPSRSSLPPLFAVCFLIFVVGVVV
AGRLPSPVLAFYTIASIVAFFAYAFDKSAALRNQWRIQESTLHLFALLGG
WPGALAAQRLLRHKSAKASFQTTFWVTVILNCGALGWLLSPSGTRTLNSL
LGTA
>NE0604 possible ubiE; ubiquinone/menaquinone biosynthesis methyltransferase
MAGYVGDVAIAYDRDLGHVLFEQYASDIARRTAGKPVRDVLEVASGTGIV
TRQLRNVLPGDAQLTAIDISDSMMEVARTKFLPHEQVTFQVANAVALPFD
DRAFDTVVCQFGVMFFDKDKAFQETHRVLRQGGRYLFSVWDSRDYNPYAS
LTFEVMKQFFPSDPPRFLESTVSSFEIDPVKERLIRAGFEQISISVQRRI
YDIPDIRAFARGLIFSPIINEIRERGEVDPDDIVEALVKIFIGEYGSNPT
RFPMQAILFETEKP
>NE0744 conserved hypothetical protein
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE0794 Glycosyl transferases group 1
MRILHILDHSIPLHSGYTFRTLAILKEQRDLGWETFHVTSAKHTGSVALE
EEVDGWHFYRTPEHSRFASRLPILNQIAVINGLARRVADIVRIIKPDLLH
AHSPALNAIPALIAARKASIPVIYEVRAFWEDAAVDHGTSSEWGVRYRLT
RGLENFALRRVDAITTICEGLRNDMIGRGIAPEKITTIPNAVDIEKFAAS
RPANESLKEQFGLKSKRIIGFIGSFYAYEGLDILLRALPIMLLQHPDLRV
LLVGGGPQDGQLRQLASQLGIGDKVIFTGRVPHDKVQDYYDLIDVLVYPR
LSMRLTDLVTPLKPLEAMAQGRVLAASDVGGHQELIRDNHNGILFKSGDP
NSLAQKVGELINVPGRWEDLRRAGREFVETERNWRKSVAKYQFIYSNLLN
K
>NE1851 YER057c/YjgF/UK114 family
MSKFIIQTNDAPQAIGTYSQAVRVTGGETVYLSGQIGLDPVSMEMVAGVD
AQIEQVIANLKAVIAASGGSLGDVVKLNVYLTDLGNFSRVNEIMGKHFSQ
PYPARAAIGVAALPRDALVEMDAVVVLDK
>NE2205 Short-chain dehydrogenase/reductase (SDR) superfamily
MGFLAGRQILITGLLSNRSIAYGIAKAMQREGAKLAFTYQSDDLRERISK
LAAEFNSDLLFRCDVSQDEEISRCFSELADHWDGLDGIVHSIAFAPRTAL
AGDYLESVDRQAFHIAHDISSYSFAALAKAGLPLMQGRPAALLTLSYLGA
VRVMPSYNVMGLAKASLEANVRFMASSLGKQGIRVNAISAGPIKTLAAAG
IGNFGKLLGYSEKVSPLKRNVTTEEVGNVAAFLCSDLASGITGEITYVDA
GFNTVAFNLDAEGQ
>NE2320 Glycoside hydrolase family 3, N terminal
MSLGPLMLDIAGTELTETDRVRLSHPLVGGVILFARNYASPAQLAGLTAE
IHALRYPSLLIAVDQEGGRVQRFRDGFARLPPMRVLGEICDRNPDRAHHL
ASQAGYVLAAELKACGVDLSFTPVLDLDYGQSCVIGDRAFHREPQVVADL
ACALMNGLQSAGMAAVGKHFPGHGAIRADTHVETAIDTRSYTDIEKEDLI
PFRRMIDAGLSGIMAAHVIYPAIDQHSAGFSSRWLQRILRHDLGFEGCIF
SDDLGMQAARNYGSITRRAEQALQAGCDMVLVCNDADAADELLGSLHWEF
SAASLARVECMRGQHMIHSMAQLHEMERFLQAADEISKIDPAGISVSL
>NE1298 TPR repeat
MIVCYQRRNDKFMKNAQCTGWIPATLKLLYRALPGLCFFMTTGAVSIAFA
EEVCNAPVARMVSLQGVVEYHRPGDSGWHMAASNSTFCAGDRVRVRANSR
AALRLSNESMLRLNQRTAITISGPDVEQNTLLDLMNGVMHIITRTPKPFK
IRTPVVNASVDGTEFLVDAGGEDDSSPSVTIAVYEGRVKAGSDQDNLILA
NQEAAVFQENQPARKTVMVHPLDAVQWALHYPMLIDLYSRSGHENQQSPA
VHHVIEQYRQGKLAEALAELDHLLAEELTVDALILRSELLLTTGRVKEAL
SDLQRTEQLEPGNSDALALHAMIFVVQNRKQEALALAGQAVRNNPASSAA
KLALSYAQQANFQIETALASAEEAVRLDSQNALAWARLAELQMSAGKSDH
ALQSAERAVSLDPDLSRTQTVLGFAHLLQIDTHRAQVAFARATVLDQADP
MPRLGIGIARIRENKLEAGRIDIETAASLDPANSLIRSYLGKAYFEEKRY
PLAGTQFDLAKARDPNDPTPWLYDAIQKQTQNRPVEALRDVQKSIELNDN
RAVYRSQLLLDQDQAARGSSLARIYDNLGFEKRALMETAKSLSFDPASHS
AHRFLSDAYANVPRHEIARVSELLQAQLLQPVNVNPVQPRMAVADLNLIT
GTGPSAPGFNEFAPLMERNKAQLVASGVVGNHGALGDEVVVSGVYDRASV
SVGQFHYQTDGFRPNNDQTHNIYNAFVQYAVTPDLNVQAELRRREKKHGD
LLMDFDPKKFSEVARLNLEEDTARIGAMYRISPRQNFLFSTIYTHQDADV
IEDLGSDFPFYGDQRSHGYQVEAQHILRKDRVNVITGGGIYRTNLTNDFR
KNTEPLICMMMGCEKSKADKEQNVAYLYSNLNILKNVMATLGFSYQAYSN
DAGGINRKVSEFDPKIGLQVDFHKNVRLRMAWFEALKRDLIGQATIEPTQ
IAGFNQFYDEMTGTKSRHKAVGLDIHFANAVYGGVEVSERDLDVPVIPEL
GVSDRDYYWDKQKEQLLRGYVYGTFRPNWVVAIEPEYEKFDRKERYADLP
TNIHTLRAPISVSYFDQNGFWAKLTGTYVMQDVKWARFDEDTWLGWIEKK
DSNFFLLDMVAGYRLPKRKGLLSFEVRNLLNKHFYYRNQYLYLSEPALPR
YIPERTLFARITLNF
>NE1651 DUF175
MPKSTRFLKLTSFVLIVILVGSTLFSVWFYRLATTPLNLPAVPSEFSIEP
GSGLHRIAGQLAEAGILSNEWSFILLAHITGYNASLKAGDYQLTEKLSPL
DLLKYLTRGKVRQYAITFLEGWTFSQFRKALDEHPALRHDSDKLNDSELL
RAIGAKESHAEGLFFPDTYFFTRNSSDLTILKRAYQAMQQHLETVWLARQ
EFLPLKDQYDGLILASIIEKETGADNERTQIAGVFINRLRHNMKLQTDPT
VIYGMGNKFDGNLRKIDLQTDHEYNTYTRFGLPPTPIAMPGLASIRAAFN
PAITDELYFVARGDGTSHFSSTLEEHNRAVLKYQKSSIKHSVH
>NE1072 TonB-dependent receptor protein
MLSGICFIQPAQAQNNAAGPVETNTRMFDIPAGPLDSALDRFARTAGVNL
TYDPALISNKVTRGLNGRHDISAGLMLLLAGNSIEVIRQPGGGYTLHKVP
ATGNLTTDGKAVLPTLTVKADSLKQGAAEEGYLVSNVSGIGLWDERSLRD
TPYSMSIVPSDLIENINAQNMGQIFKMNPLTQEPPIQNFTGIPMVVMRGF
QSTNPVYDGIPLALGNGSAVSVIEIDRVEILGGTSGFLYGGGRVGGAINY
VTKSPTSVPLRRLTFGNYGGTQFYGHADISGKFDDRGRFGYRANLLYQGG
DTAIGEPVKQKVANIVLDGHLTDTLYADVKYTYNDYSKEATQPTFNSVIE
RTGVDTSKRYSPKWASNDLERHKAYTSLRWDPSKYFTLRSAFFYQHTNMR
TNQIYLNEQANRTFIPRMLNAPWEKHENYGGYIYLDTRFSTFAVDHKLTL
GYSSTYYEYFLRGNGFVLRSGTSLSLNEVKNLSEPDWGESFGKGTVSSPW
IDQYNNVLIGDDITFNDQWSAMIGFNYATTMNKASGFYGDGIKYEKSALT
PSVSVLYKPIKDVTTYVSYMEALERGTRVGNTYTNFGEVFPPLVSKQYEV
GLKYDVNPNLLLSTALFRIEKANQFSNQAMPIPTYVQDGLQVHNGVELIL
TGKVTDNLTIMGGGTLMDISVEKANDPALEGKKPVNAASRMGKIYAEYAL
PWITGLSLTGGIYYTSERYGNAANTDKIPSYTLYDIGARYATRVLDKSLI
VRLNVINLTGKNYWQDANYLGVPRTVAFSVSTMF
>NE1856 hypothetical protein
MSTTNILKHLANLEKHLAEEHPDNPVLTKAVHSFRKLDGVAQALGLLELN
ESYATYVTWWPMIAVLGTFSSGKSTFINSYLNMTLQRTGNQAVDDKFTVI
CYSRHDEIKTLPGVALDADPRFPFYQISHSIDEIAEAGPQRIDAYIQLKT
CPSEQIRGKILIDSPGFDADSQRTSTLRLTQHIIDLSDLVLVFFDARHPE
PGAMKDTLEYLVAATINRPDANKFLYILNQIDVTAKEDNPEEVISAWQRS
LAQAGLLAGRFYRIYDKDAATPIENPQIRERFEQKREEDLADISTRMQQI
EIERSYRIAAMLEQTANTIENQVIGKLETALDQWRSQVLTLDGILVGLLA
ALTGIALTVTDSWHLLKECIDGISAGGILPIITIMLLLGVIGYIHFAVRK
SVARSILKQLPQEFGHDHNAAKQFSRAFSKSTAGYRPMFLKKPAGWNGTN
RRILAEVKEDANDYIQMLNDQFTDPSGHLEQEQNQPATEQA
>NE0162 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1279 hypothetical protein
MKSGLQCESPGRYFSYLSYICLVQLLSLHFESMKGESFMRNETQTTHSHS
KHEQHCQHVYETGQLRRAKMHVARLTGNFDSRKILSPSLLQLLETSIILE
TTDSKVLASRLKRKPAAIRADLQKICHLLAEEPRL
>NE1112 RND multidrug efflux transporter
MARFFIDRPVFAWVISLLIILAGLLSIRGLPVAQYPDIAPPVINVSATYP
GASARIVEESVTAIIEREMNGAPRLMYTSATSSPGSANINLTFRQGTDPD
LAAVEVQNRLKMVEARLPEVVRRNGIFIEKSADNIQLVVSITSNDNRLTE
IELGELASANILQALRRVEGVGKVQSFSPEYAMRIWPDPAKMTSLDLTAS
DLASAIRSHNARVVIGYIGDKAVPEHAPISANVAADKALATPEDFKNITL
RTKADGGAILLRDVARVEYGGSDYSFFSRVNGKNAAGMAVKMAPGSNAVA
TVERIRNTMDELSRYFPPGVSYQIPYDTSAFVEVSIQKVVNTLVEAVVLV
FLVMLLFMQSIRVTLIPTLVVPIALLGTFAVMYGLGFSINVLTMFGMVLA
IGIVVDDAIVVVENAERIIVEEGLSPYEATVKAMQQISGAIIGITVVLTS
VFLPMAFFSGAVGNIYRQFSLSLATSIAFSAFLALSLTPALCATLLKPVD
KEHHENKRGFFGWFNRTFISLTNRYQHRVGQILKRPVRWLVVYGIIIAAV
ALLFARLPTAFLPEEDQGDFMIMVMLPQGATMHETMNTLGEVSQYLQENE
PVKYVYEVGGFSFYGTGASSGMIFATLQDWEERKHADQHVRSIVERVNMH
FYGRPNLTVFALNTPPLPELGSSSGFDLRLQDRGNVGYKTLAAARDRLLA
EGGQHPVLTNLMFAGLAEAPVINLDINRRKAQALGVTMDEINATLAVMFG
SDYIGDFMHGNQVRRVIVQADGKNRIELDDIRKLRVRNAAGKLVPLSSFV
SLEWGMGLPQLTRYNGFPSFTINGNAAAGKSSGEAMREIEHIVAKLPNGI
GFDWSGQSLEERLSGDQAPFLFGLSVLIVFLALAALYESWLIPQAVILVV
PLGVLGAVLGVTLRDMPNDIYFKVGLIAMIGLSAKNAILIVEVAKELYDS
GMSLIDATLEAARLRLRPIIMTSLAFGAGVLPLALATGAASGAQTAIGTS
VFGGNVMATALAIFLVPLFFAVIGRYIKRSRHS
>NE0179 hypothetical protein
MQNDNHSTAQPEKDVARLTEEVREAIAHGGDIENAIRNLTLKAMHSNGLD
IESLKQIATAVMKGVQEGAQQKMTHAAEQSHAAQSQITQAVVGLDTAFAQ
LAGASKLALEEAASKAKQFSDSELTKAQADLKDLESVFLDTLKHTATAAQ
GLIAETLRDMLSHAQHNGTAVGMQLKDTLAVFAHQMASTGRAQFEAGVKL
TQTTADLLYKISTGVLSGITSQTNRDDK
>NE1504 hypothetical protein
MKTDNKQTTIQLLITSLFALTLTACDSQQEATSNKKPVAQAQGPATGILA
DSAVEGVSYSASSGASGVTDVTGLYKFNHGDSIEFHIGKLNLGKIPGTGL
TTPIELAAGDRNKLLNLLVLFQSLDADNNLANGISIPKTAADALDASLDL
KADPGTFPSSPALATAREAAGIAGSIKTADEANAHFLSQAVNLLGGHLWV
NQDDTSLNFFRFSTDGSGEYLHGIATPDDSCDANRACGSKLVFTAGVEYG
TAKATEYDERGFKLVSTPEVDTDLQSGLSHPRPNWRVYTNGNELIISDIV
IVQREREQASLFGELFHISKPIELSSDDEVAETTVQEIRYHKMDNSQSIV
GAWTMDKDSIKSPVFLFFPDNRYMLVDPVGSATQSTPAACAKPGVELATY
AFDAASGTLKLSSFTYNTAGCAGLSEYSGKPITFKIDTGAQNATLSGERL
APITLQRLSN
>NE1699 conserved hypothetical protein
MLKFNLAVLISMLLISFFPQAYAVGWEKIEIPDNVSTILDDDGRYKDIKT
GCAFSHLPDEAGLPNKPFHFYYRKGTKAKTLIYFNGGGACWNGATCLTSL
TVPVTQTTRPAYNPSIENENNPEELGGILDFTRADNPLKDWNMVFIPSCT
GDAHLGSKNEVYVDPSGIINHGDAVLVQHRGFDNFMAVREWLKHRADRPG
TEQVLVAGSSAGAYGALMNFPRLHSIYPDKTKISLLSDAGTGVFTSNFLN
TVFEPDGPWGTEHTLATWIPGINRIGSYNALNFFTSLATGIERHFVNSKF
AYVTTAWDDVQMLFLNIMRKTGQGVNDPNQWFNLTPVTAVEWSLRMLTTL
HANALINRNSKYYISAGTYHIGLVDAFAPGVFYTEKSAGGIYLKDWVNRL
VTDDRNYPLINLMCSGTCGAPFPP
>NE1201 Universal stress protein (Usp)
MIRRTLIPLDGSELAERAIPHLLRFVEPDQTELLLMAALSSTSSSLFEDT
ARLLMSGQTALSKGHEAGKRVDIVAQQLRQTGFNVANRLLSGAPAASILH
LAEEAYVDLIAMSTHGRTGLGQVLLGSVADEVVRNARPPVFLVPAAVVVK
PDSLPRTILLPLDGTPLAETAIPVASQFAQNTGATIYLIRVVEPGDSGDE
LQDAQILAGSDYAGEQPIIRQAISYLERIRLRLQLAGVSSCSRIAAGDPA
DVIARIVHAEDADLIVMSTHGRSGMERMVHGSVVSRIIGNTICPLLLMRG
RVPVEAYERTGNEVSAVSFC
>NE1617 Sigma factor, ECF subfamily
MHSRDHQEDVAPAANPAMQQVHFLYSDHHNWLYGWLRFRLGDAADLAHDT
FVRLIARPRCFDSSQEARAYLRTIANGLCIDLWRHREIERAWLETLATQP
EAYAPSAEEQTAVLQALHEIDGMLRHLPLKAARAFVLAMACGMTHQEVAR
ELGVSSRMVGHYITRAMLHCMQLEARNLGQAGAIAP
>NE0888 hypothetical protein
METSPVFIDYVTIRQEYFGGGLPVLNDGKVLKVDADGEIEYSTDVRCIIE
GSYDSRVQVRCDGNTVEFTGNISRYGRRDNLFGYDWPTTIARINDLLSLL
GLPPFTSGKLYKFADTGWSWSGARVSRIDLTCNYSTGSKESMHAVLCHMA
GQHVGRQKGSLSPDSGTVEYGRGSKYVYGKLYAKYQELEKHRSKKSGSHV
SDDVIDYCKTEGILREEFTLKSRFLLQNNLAYLGAITQDNLNQIYADRTQ
LQRLEDMKYENFNDLPKHLRSTYASWKLGLPLDISRATRYRHRTELLAYG
VDISIPNNVHHLPSRVRVVELKPLTAPDWYIQNYG
>NE1165 Short-chain dehydrogenase/reductase (SDR) superfamily
MKDKVILVTGGARRVGAAICRWLHRKGARLVVHYRDSSADAQRLKQELEQ
GHPDSVALLQADLLDTGGIPALVDQAARQFGRLDALVNNASSFFPTPVGD
CTEQAWHDLVGSNLKAPLFLSQAVAPYLKKNRGCIVNIIDIHTEQPLKRY
VIYNAAKGGLAALTRSLAMELAPEVRVNGISPGPILWPETGEWQDETARR
HIIDRTLLKRMGEPDDIARTVSFLIEDAPFITGQIIAVDGGRSINL
>NE1242 hypothetical protein
MTHHTEVFEGGTIDIEDDTSLTINGKEISYVHDAVKNKWSSRYLPYTQYD
SLLDLARAIIRDTVEFSGVKE
>NE2237 FAD linked oxidase, N-terminal
MEYQSLPLPAALIAELQAQLGKDKVVLEQDALEVTTRTCIPYRELPGAIV
YPESVEEVQAIVRLAAKFNVPVWPVSTGKNWGYGEKTACYPGGITVVLER
MNRIHEVDEALGYAVIEPGVTYKQLNEFLKANYPDLWSDSAGTTQFASVL
GNALDKGRGLTPYADHFGSLCGMDVVLPDGRLLETGGGPQGNNHAKYVYK
WGVGPYLDGLFTQSNFGIVVKAGIWLMPKPECFDWAAFEYTASDEKFGDF
IDELRQLVSQGVLRSRPHLANDFAMMCIISQYPYELLDGKKYLSEAALES
WRRQHGVARWTFGCGLYGSQAEVRFQKQAIKRVLGRYGRVQFLGAAVEDN
WYGRLLRQVAPVVNRLMGKSDAFMEALIPGINLFRGIPTDYFVRQVYFKS
HPQKPADDVDPARDQCGLLWIGPIVPFTAKHILHALTLVKEVYTRHEFDF
FVELIIESPRTIILLVGVFYERNNANEIARAQAWYKDVRELFLQNGYPPY
RTTTMSMSGSLDQNPIARDFLKMIKGVIDPQDLIAPGRYGVSGLSGADSS
LVALKQADNDPY
>NE0047 Cytidine and deoxycytidylate deaminase zinc-binding region
MNDALHIGLPPFLVQANNEPRVLAAPEARMGYVLELVRANIAADGGPFAA
AVFERDSGLLIAAGTNRVVPGRCSAAHAEILALSLAQAKLDTHDLSADGL
PACELVTSAEPCVMCFGAVIWSGVRSLVCAARSDDVEAIGFDEGPRPENW
MGGLEARGITVTTGLLRDAACALLREYNACNGVIYNARCGVHK
>NE0709 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1266 putative death on curing protein
MTTPVWINEQDVLAIHERLIFLHGGASGIRDRNLLKSALARPLNFSVYDQ
QSDIFLLAATYTSGILQNHPFVDGNKRTGFVIGVLFLELNGYKFIANEED
SAQAIISLAEGSLDELGFRLFIEHNSIAT
>NE1205 TonB-dependent receptor protein
MKQYSGTQPGSRLFFIPWRTACSGSMFMLVFFMASQICYAQSATPGATLL
KPIVITAPREEDSFSSRTKLESLPGGTSYIAAEEMPGAANLTASKAFFGT
PGLIVQDFFGGNDQPRIQIRGSGLQQNPVERGILVLQNGLPINRADGSYI
VGFINPRQAEAIEVYRGYMANRLSANVLGGALNFISPSGITSPNTRMMIS
GGSFGQLNLSGQAGATSGDFDGLIQADFNRRDGYRDYNDSRRFSVSGNAG
WHLSEDVTTRLFASYTDLGFDVAGPLSKDQLKSSPRSVSAGPVVIPAGAI
YPGPNVIRDQPERTASQFLVGSRTTLQFDRHILDLALGFTHTDDTFRFPV
SSGIRETRGSDLTGVIRYTIESTDKNALPIFESAAYYTAGWADRKNYLNR
SGTKGEHFGSNKLDATTLSLHANLNIPIHRNLTLTPAISYSRATRDNKDT
YDSPTRPTLAYRPNDPTQLLPGGAVPYDSTRYKRSFDSWNPSLAMTFRPA
AGQTLFTAISQSFEPPTHDDLLATVNGTPNSSAGRPNPANSALPAAVFVT
PDLKAQTAMTLEGGWRGSHGIFTWDAVAYYAWIDNELLSLRDESGVSLGS
VNADKTTHLGIELGLKAQLTEHLLARIAYTYQDFRFDNDPVRNNNQLAGA
PPHFIQTQLQARIIGNLVMQAAVKWVPARIPVDNMNTLYVDPYAILDLRA
EYRFNRNFTVFGETTNLFNKTYASSSLVVDQARSDQAVFLPGDGRAFFAG
IKATF
>NE0807 conserved hypothetical protein
MPILNLNNFLAALGNTPDYGRLFARAEHLAQVQKHLLNAVPFQLRNQCTV
GQYTADGLLVIYAGNGTTATRLRHLAPSIRQKMNNAGVKVENIRFSIQPQ
LHSLESGNVREITRSLSQTAIEHLDKLSDSLPAGSPLQSSLAALLANSRR
K
>NE0670 possible membrane fusion protein MtrC
MSMQSRSSPMTMVLPVIFFLAACSGSQPEAMPDKPAKVEPIGHETNLLKL
TLSTQAIQRLGIETAPASDSKNTNNIMLHGEVIIPPAGGGVPVTSASDLA
TLASNQARADGDVMRTRAELDIALKNAARAEALVREEAGSVRLRDEALAT
VAVARANLRVAQTQRAQYGPPITAMNRTSQVWIRVPVPAGDVSRIIRSAP
AQIAALGDGSTRRSARPVDGPPSANAAAATVDLYYALGNAGTSFRIGQRI
AVELPAQGQTSGLTIPSSSILRDIYGGEWVYVSTGKRSFERRRVEIASVQ
NGQALLARGLKPGMEVVTAGAAELFGTEFGIK
>NE2157 dihydrolipoamide dehydrogenase
MAISENKTMTQFIKVTLPDIGDFQEVPVVEILVSPGDEVEQETPLLVLET
DKASMEVPAPQAGIVREIHVKAGDRISQGSLIVTLETRETDTQVASPTPP
RDTMAVPDSQPTITSAAEATAGKVALASVADRGDIHAEVVVLGAGPGGYT
AAFRAADLGKQVVLIERYPALGGVCLNVGCIPSKALLHAAKTLTEAKEAS
LYGIRFGQPEIDVGKLRSWKESVVGKLTKGLSMLARQRKVTVIHGTGKFV
NPHLIEVETSDGIKTISFDHCVIAAGSSAARIPGLPADERIIDSTGALAL
AEIPERMLILGGGIIGLEMATVYHALGTRISIVERMAQLIPGADTDLIKP
LYKKLKTECEAIYLNTSVSRVEADKEGLQVFFEGEQAPEPQRYDRVLVAV
GRRPNGKLIDAGAAGINVDERGFIPVDKQMRTSVPHIFAIGDIAGDPMLA
HKASHEGKIAAEVIAGHKVTFDARTIPSVAYTDPEVAWMGLTETEAEKQG
IAYEKAVFPWAASGRAITMTRDEGMTKLLFDKVSKRILGAGMVGPHAGEL
IAETVLALEMGADMQDIGLTIHPHPTLSETILFAAELAEGTITDLYVPKK
>NE1951 Fumarate lyase:Adenylosuccinate lyase
MEISTLLALSPLDGRYQQKVAALRPFFSEYALIRNRIHVEIEWFKALGNT
PALTGASSLSTGAVVQLDQLVENFSVTDAEAVKSIETRTNHDVKAVEYWL
KEKLTGDQETAAISEFVHFACTSEDINNLSHGLMLKHSRDQVMLPALDEV
IKRLAVLARDLADLPMLARTHGQAATPTTMGKEMANFVYRLRGARERLAA
VVITGKINGAVGNYNAHMIAFPDFDWESFARSFVEKLGLSFNPYTTQIEP
HDTMAELFDAHVRINTILLDLNRDIWGYVSLGYFRQKTRADEVGSSTMPH
KVNPIDFENSEGNLGIANALLKHLSEKLPVSRWQRDLTDSTVLRNMGVAL
GHTLLAYDSCLKGLGKLQADPARLSEDLQNAWEVLAEPIQTVMRRYGVAN
PYEQLKALTRGKTGIDRTTLQQFIEQLDIPAAERDRLLQLTPWGYIGLAA
HLARDIGD
>NE0916 conserved hypothetical protein
MDPVRILRHLITGQSAIKRAFPPATLTVIEQAIAHSETLHGGEIVFAVEA
SLDLPLLLQNQIIRERAIDVFSLLRVWDTEHNNGVLIYLLMADHDVEIVA
DRGIHTKVDQTIWETACDTMKTTFRHGQFEQGVLAGIDLSTRVLQQFFPA
STGKRKSELPDRPVVL
>NE0722 AraC type helix-turn-helix
MIGRREPYIRPNCLFGFDDLVHSSGGDLHTLADQAGLPREAFSNPDILIS
WPRQGIFCELAAQELGRPNFGLDHAFSLPNTYPTAGAAIFLSQITHTFEE
WLDACERYARYYTNAWVPLLIRDGGPFAYVRLLKDPLAKSPRQQTEGLVA
TFYRMARTVIDATDEKAHLIRFRHARPCDTQPHEALFDCPVEFEAAHNEL
VFPHEYLNRPTNGRLAGLEPLLDLYLRYRIAKMPLYDQSVAATVAAGIPI
VLNTSLCNLEYFAVSLGLSPKKLQRLLQQENTSFSAILDKVRESLACEML
ETTDVPISNICGLLGYTALPAFNLAFKRWIGVAPSIYREFNRRKEKDCDS
>NE1041 possible sigma-70 factor, ECF subfamily
MMLLPTSCRKHTRLLALQQAGQHITESRALLYRIARNLVIDRYRRGQVQG
WTTDNHSLENFPAQRACEPDALVASSQVLQAVLTTIDALPLRCREAFILH
RFDGLSHSEVAEHMGISRKAVEQHIQHAMQVIRRCRMQMEGDTDTSGSCL
ATQKKRV
>NE0703 conserved hypothetical protein
MRYSTQIRPISYLKANAAEVLAYLTENREPLIITQNGEAKAVIQDIASFE
ETQETLALLKILALGNAEIEAGEVQPVHEVIAGLRTRQTIK
>NE0223 Dihydroneopterin aldolase
MDIIFLKEFRIKTLIGIYPWERKIPQTIELNIEIALPSRQAGMSDRIEDA
LDYSQVVHRINAILDQQHFSLLEALTENIAQMILIEFDSPWVKISAAKLD
VIPGVKQLGICIERSQS
>NE2101 hypothetical protein
MSSSPSHPFPSLQSRIVASFVSTSSTIIVARLSTLRPLRDLTMVGWSMST
RMKAKLVCDALQMAVWQRQP
>NE0187 hypothetical protein
MLHFYPTKVDLISNWNNNNRNNYMKVSLNLITTSMLIGLPMLAQAEVFTI
PQSSLIPSTTLWTDDLGISIGNTLVMTGGGSAANVGDPTGRNDDGFSGPI
SFGLDFSGLTLFGTTYSQFYANNNGNISFGNGISAFTPMGLQGATQPIIS
AFFADVDTRNAASGVMSFQTHTTAAGSEVIITWPSVGYYSQQATPLNTFQ
LVVREDDYLIPDGEGQIGFFWTTMGWEVGSASGGGSGGLCSGIGGIGTGC
VPAAVGFGDGLNNGYVLEGSTLNGIAGVLQNHRLWVNLSDGGVPVIDPGA
VPEPGTLALLSIGLAGLGIKRRYKAKIA
>NE0428 conserved hypothetical protein
MVPASRKESITIEVAFALPQRQFLRRLQVPMGTSMYQAIKLSGVENFYPG
SDSTALKTGIHGKITDPETILHQNDRIEIYRPLVIDPKEKRRLKSDRLTK
GQK
>NE1832 hypothetical protein
MKTHTLPLLLSCTLFLIGACTSMPPEIRNFSAVDIPYQPVSQNAETYKDA
PVRWGGTVIEVENETDFSLMQVLFHPLDRSGYPETRKPGEGRFAVEINEF
LDPAIYTKGVEVTVIGTVKGNIERTIGNKTIHIPLITAKTIHLWPQAYRE
DNLYRYGPYPGYYGYGYPFFYNGYYRPYRFWW
>NE0441 aminopeptidase A/I
MDFAIRSDNPEKYRGDCIVVGVFESRKLTEAARVLDEAGKGHLGRIVDQG
DMDGRANTTLLLHGISGIDSKRVLLIGLGKEEEFGEKVFLDVVRTTFKAL
QPTGAKDVGLYLTELTVKGRDVAWNVLQTVILAEESAYRFDRLKSKPEGR
QPSLAKVDIGITDTSTAAAVETALQQGLAIAHGMKVTKDLGNLAPNICTP
SYLAGQAEEMARTFNLKFSVLEEKDMEELGMGALLAVARGSHQPAKLIVL
EYHGGKDSEKPVALVGKGVTFDAGGISLKPAAEMDEMKYDMGGAASVFGT
LTAVAELKLPINVIGVIPTTENLPGGNATKPGDVVTSLSGQTIEILNTDA
EGRLILCDALAYTERYDPEVVVDIATLTGACVVALGHVVSGVMGNDEPLV
QELLQAGEQTYDRAWHLPLFDEYQEQLKSNFADTANIGSRWGGAITAACF
LSRFTKKFRWAHLDIAGTAWKSGKEKGATGRPVPLLTQFLISRANKH
>NE0158 putative hemolysin-type secretion transmembrane protein
MNLQDDAFSKDESRFTASSILWLIVASMAIFLIWSYFAVLDEVAIGEGKV
TPASKGQIIQSLEGGILSELVVHEGDVVENGQKLATLDPALSRSSVEEAA
QKIIALQARAARLRAEIENKSDVTFPPELADEKAVIERERQLFRTDVQAF
HENVSQLSRQLRLAQQEIDIAMPLLKTGATNEVEILRLKQKAAELSTKLA
ATKGQYSVALKSDYATTMADLGPLLKVREGRADQLRRTIITSPTRGIVKD
VRVNTIGGVIGPGGELMEIVPLDDQLLVEARLSPRDIAFIHPDQAATVKI
TAFEPSIYGTLAATVQNISPDTIEDQVDKRLYYYRVYLLTKHAYLETKDG
KRHAIMPGMVATAEIRTGQKTVLDYLIKPLNRAAEALRER
>NE0878 conserved hypothetical protein
MQTGNLFTGFSTPRKGETFETLLSRRNLVIERIVSSAELSPQKYQQVQDE
WIALLKGTAILEVDGKTIELNTGDYLFLPANTPHTVKWTSHGALWLAVHV
Y
>NE0253 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1219 UMUC family (DNA-repair)
MHGMNTKNRRIAHLDMDAFYASVELLRYPELRGLPVVIGGRSVHQPVIQP
DGKRSYVRLRDYTGRGVVTTSTYEARAYGVFSAMGIMRAAQLAPDAILLP
ADFDTYRHYSRLFKDAIARITPHIEDRGIDEIYIDLSEHPDETASLASSI
KQAVRDATGLSCSIGIAPNKLLAKISSDLEKPDGLTILTHTDIPNRIWPL
SVRKINGIGPKAEEKLVRLGIQKIGELAKAELSLLQAHFGRSNAIWLHDS
AHGRDSRPVVISSESKSISREATFERDLHVQEDREILSDIFTELCTRVAE
DLQRKGYVGRTIGIKLRYENFQTITRDLTVRNPTADASTIRKAARDCLRR
VPFEQKLRLLGVRISGLSKISALLKENYYFQEELF
>NE2171 Formyl transferase N-terminus
MQRIVPFVDHEIGYRLLQKLVTHSDASRFEIPVVVTTRENGKAWWPGVRE
ICMKANIPLLVYEAPFSVDRLIRQADWFLLLSWKHIIPIDLISLPKQGVL
NLHYSLLPSYRGVYPVNWAIINGERRTGFTYHFVNEEIDDGEIFMQVEVP
VHLSDTARTLQSRLDDIVCHHFDELIERLLTYDLGSRLRDTQKKKGARGD
YYSREKFEKTCLIDLNKSYQGVEFFNLLRGLAFFEDSKNAYVIDDESGQK
IYISLNLREEK
>NE2043 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE1541 hypothetical protein
MNSVDVKYHADPGLNNELVSPFAQFPIKLISSENLFGTGKEVLIQHRGEQ
YHLRLTRNDKLILTK
>NE1560 conserved hypothetical protein
MHCRSFPPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILG
FDATSAYPLRVSSLKDHGVALWDVLQSCTRESSLDADIVAHTIVPNDFGR
FFTACPDIRRVCFNGAKAAALYARHVKPFLQDAPTVEYVQLPSTSPANAA
IPRADKLRAWSVIKHNA
>NE2199 Flavin-containing monooxygenase (FMO)
MRIAIIGSGCSGLTAIKNLLDAGLKEIICFEKSDQIGGNWVYTAAPSHSS
VSEATHIISSKALSQFSDFPMPDDYPDYPSHQQILAYFQAYTRHFHLDHY
IRFNTAVLRAEKIEKERWCLHLDDGTQAEFDYLLVANGHHSVPRHPDWKE
CFTGKYLHAHEYKTNQGLEGKRILVVGAGNSGCDCAVEASRVAARVDISL
RSPQYIIPKFIMGRPTDTFAATFHWLPQSVQDGLQRISLRLQIGRYHDYA
LPEPDFSPTRAHPTINSALFDKIRHGKVHPRPGIQKVSGQTVYFADNATA
QYDVLIAATGYKISFPFFDRDFLDWEEAAHIPLFLRIFHPDHPSLFFVGL
IQPQGCIWPLAEIQARLIGQLLTNKIQLPLNWRKLALSEGRNRAQQFIAR
PRHSIEVHYYPYLKQLQRMIQGQIK
>NE0240 hypothetical protein
MTGAVRRNVRKRLNKNQKQLRQNWEGYHLLEFAKTIRPVKQDTKRRKELD
DHLVSNGIGGFETLQKLPADNKTAFTTEKSDGVFESLAEFQDIPTFVCFL
NFLEFCRNSIPRRPLDVFDGNDTLSIEIDGKRKAVTLDYAIDDFKRHAKT
VEERLKLESSQSSTGQADPIQPRLTIPQARENNPNYFANAVLDVIGRDEQ
KSRLKAFLECDKNVAWFQLAGVAGQGKSRLAFDLIKVAEELGFRAGFLTE
NDIKFFKDQWKDWQPDKPYLLIFDYVIGREEQIKPIFQTLISNQDGYCHN
IRILLVERQRWDQGNVIETQDQINKDAPRQLIAISNKAPWFLKLCEEGDS
EGERLAPFRFDNGVEELKELGQDNLASIVKQLLSGKTLTLSDDVLEETLE
RIDNTGRPLYAYLLAQQLSESEEGFRSWTKIDLLNGQLKRDKRRWEQAFN
DKAPTWGDSHEAMKLAVLATIVRQINFEDEIIKSNFGHIDSSLGKEALAI
TNSYLVNNDNRPHKIHALEPDLLGEWFVLYCFYQGLNFEELLNIAWEYSP
NDTAMFLIRVMQDFIDLTKTYNDWNLTEKLLAHKPPHENHYLVLARVAVF
ISYELGRRNLTIPHNIIIALEHAANLSNVIAMDYLGFFYQQGLGVVRNPE
KAIYWFQMAVNKKSDTAMVNLGICYQKGEGVKQNLNAAFKLFQRAVKLDN
STAMFYLGLCYQRSEGVKEDLNEAFALYQQAADKGNSTATAYLGLCYQYE
VGVKQDLDKAISQYQRAVDEGNSLAMVFLGRCYQYGEGVNQNINKAIALY
QKATDKGDSTAMTCLALCYQDGKGVDQDWNKAINLYQQAVKKNDCTAMYY
LGACYENGYGVKQNRSSAIELYRMAANQGNSNAMVNLGFYYRNGIGVKQN
RKEAVKLFQRAAKVGDYRAMCNLGVCYENGEGVDQDWNKAISLYQQATKA
GEIRAISNIQNILLRNFLGEGNYKSRKTNGCTRNKLSHLVEKLAFDPPIL
GGDWKPLLTEEIKACIDKVIVPFEILDVVVEI
>NE1656 Phosphopantetheine attachment site
MNCSGIDLMSEVTQEQITALLCQRLANFTEIETEVSPETNLITHLAIDSV
KLLNLVMEIEDQFDISVPLNTLVDVLTVQDLANLIYKIKSLSQ
>NE0585 conserved hypothetical protein
MKPVAILRFFPIEGPGYFATFLNHHHIPWQLICLDEGTPPPKNMNKYSGL
VLMGGPMSVHDDLPWIGSVLSLIRQATSTDIPVLGHCLGGQLLSTALGGV
VTNNPIKELGWGRVDVTSSSIAHDWFGTDLTEFDAFHWHGETFSIPANAV
RLLSSPYCPNQAFALGIHLGLQCHIEMTAEMVKIWCEANAAELVQSRESP
AVSSSEEIQANLDDRIAALQSIADRLYKKWIKALKN
>NE2416 Biotin / Lipoyl attachment:DUF183:DUF213
MEITMSEKTLDTNKIAAPTIDVLVPGIQTTIQDYPGRRGYWNIGVPPSGP
MDNLAFRLANRLAGNEEGQAAIEITFAGPTLRINCDTVIAVTGAPIEVNL
NGKPLAQWKSHEVAAGALLAFGDVLGQGSRVYLAIQGGIQVPDYLGSKAT
FMLGQFGGHEGRILQAGDVLNVVAGKHDGYTPRELPQVSIPHYTHHWEIA
VLYGPHGAPDFFTEEGVANFFAADWTVHHNSDRTGVRLIGPKPKWARTDG
GEAGLHPSNVHDVAYSIGAVDFTGDLPVILGPDGPSLGGFVCLVTLVHAE
LWKMGQLRPGDRITFRRISAGEAATLEAVQDALISDLRLPEAASVSTSAK
VAATTESPILHTIPESSGQVQVVYRQGGDRNLLVEYGPMMLDFNLRFRVH
VLMEWLQQAVESGDLPGIIDLTPGIRSLHIRFDAKLLPRKRLLDTLITAE
KQLPAIENVEVPTRIVHLPLSWNDSSVQLAMKKYMQSVRKDAPWNPDNIE
FIRRINGLDSIDEVRRIVFDASYMVMGLGDVYLGAPLGTPVDPRHRLVTT
KYNPARTWTPENAVGIGGAYMCVYGMESPGGYQLVGRTVQMWNTHVQTKD
FREGKPWLLRFFDQVRFYPVTEAELAEMRKDFPSGKFTLRIEEDRFSLKQ
YNDFLKENAASIDTFRTKQKAAYIAERERWKAEGQENYTTSEILEDGSAP
ADITISAGAHAIHTHVTGIVWKLLVSEGQRVNAGDDLAVLESMKMEFTVN
APISGVIQRVLSKAGSKVTARQVLFVIEGG
>NE0633 hypothetical protein
MLRLIPNKGLKKTINTLTVAEKARINHQDIEELLLRYQRSCPEDAARNIT
EVTNPNQEKHEGLRVFVIQTWMALMVANSYGLETYQTLKTFMDRQGYKTQ
PDSTYMATEQKD
>NE0792 conserved hypothetical protein
MFFVRVFIIILALSSSGWVSAKDQVPYATQINDLMRYLRVTPNIATSGAL
TKDGIQELVKHSFQTVIDLRSESEGTPSEKKAVEAVGITYINIPVTGEGV
NESQLTAFKQALEQAAPPVLIHCATGNRAGAMWTAYRLSEGIAPEIAFKE
GRAAGMNAGMEEKIRKIWCDGNKDSCQ
>NE0343 Sensory transduction histidine kinases
MKSGFSHPLGLRQRLLLWILLPLIAVFIGSILFDYRLAQETADTAYDHSL
NNAALDIAAYIQTVGLSPKLRLTPEAEAMLRHDAVDKIFFAVRDSNKQLL
IGDAQLPAYPGAIHKRPVFIDDVYQGEKIRATTQLIMVDANEITITVAET
MREREHASNRILAATIIPNLIIIAATLLVIYFGVRRGLAPLVHVENQIAS
RSPRDLRALDIGHAPREIRPMLTRLNELFESLRTAAAAQQRFLADAAHQL
RTPLAGLQTQIELVAAEGKYVRNDERLARIEEAAERIAHLVTQLLIYAQT
EPAASANQTFHPVALHELAESAASTFIDRALAKDIDLGFEIQPAVAEGIA
WMLREALGNLIDNALRYCPAGSTITVRSGVHSSRPCLAVEDNGPGIPPEE
HKKVFERFYRIPGSISGGCGLGLPIVQEIAHLHAATITFSRPANSGLTVT
LTFPENHRENT
>NE1797 conserved hypothetical protein
MNITVQSQRTAVGSRMINGRSVTALITLLVIAVILGIYHTTVESMVAIWN
RSDTYAHGYLILPFSVYMIWKKRAMLSTIQYRPDYMPLSVLVGLGAGWLL
ASAASVVVVEQYALVAMIPVIVWALLGLRAFSAILFPLAYLLFAVPFGEI
FIPPLIDFTADFTVGALQATGIPVYREGSFFSIPSGNWSVVEACSGVRYL
IASVTLGTLYAYLTYHSISRRLIFIAFSIVVPIVANGVRAYLIVMTGHLS
DMQLAVGVDHLVYGWIFFGLVMLLLFWIGSFWREDHLDETVSVENSSSAS
LESSSVPIKSTLGMAGLVVAVAMIWPVYLNYLNNKSDVRPISEISVADLS
GKWSTASASLTDWIPGYIGFPRQFIGHFYRENKHVGLYITYYRNQDQNNK
LVSSSNVLVSDRDSRWRNMDGSKRNISLGDAPFTVHQNQLHASKERLLIW
RWFWLIGYETADPYVAKMIQALNRVWGNGDDGAEIIIAAGYEHDPEEAAA
VLREFMADMKPVIAGKLQAVHSAQTD
>NE1952 putative glutathione S-transferase protein
MKLIGSLTSPYVRKVRIVLGEKSIDHEFTNDPPFSPDTRVAQVNPLGKVP
ALIMDDGYILFDSRVIVEYLDDLNGPESGRLIPTAGPQRLRVKRWEALAD
GIIDACVAIYLERKRPESQQSQEWIERQQKKIDQGLQAVAAELGDKPWCE
GESMTLADIALGCAFGYLDARFPAVKWRDTYPNLVRLADKLAERQSFKTT
IAPN
>NE0569 Cytosol aminopeptidase
MLATLIQNDNTFDESTRTSHLLIIFPKTDKIPGKYQLPAEQVLNDLLQRQ
GIPVNRLAETPLSASLQHGGLYAWVMVDPGRSLFEQQTSLRKAMQLLLNE
HPAEIHLAVYGDDAQRRHLAELAVYVAWVNGAILPVRKQRKPDDKPLERI
CLHGIVDSTGFAMQCAQAEGNFLARGLTILPPNELTPRTYREQIRKLAAS
ENWQYEEYELSRLRELGAGAFSAVAQGSLTDDAAIVHLRRTAGHGETGKT
IALVGKGICFDTGGHNLKPARYMYGMHEDMNGSAVALGVLLAATRADLPI
NIDCWLAIAQNHISPAAFKQNDVITALNGMTIEIVHTDAEGRLVLADTLT
LAAREKPDLIIDYATLTGSLITALGSRYSGVFTNRDYLAQNAVAVGRSSG
ERVCVLPVDSDYEADLDSKIADIKQCTIESEADHILAACFLRHFVNDIPW
LHMDLSACRHQDGLGAVGTEVTGFGVTWSMHFLQEILKILQKRMSKSRTD
>NE2503 TonB-dependent receptor protein
MSIRGFGARSSFGVRGVRLYVDGIPGTMPDGQGQTSNIDIASADRVEVLR
GPFSALYGNSSGGVVQVFTEEGSRPPQLVADISAGSYGTRRYGLRASGAT
ESALGLIDYNVSASHFTTGGYRDHSAARKNLGNVKLGVQLDDASRLTIVA
NGVDLTAQDPLGLTRSQFESNPRIAAIQAKTFNTRKTIQQTQGGLVYERD
IDANHSLRAMLYYGQRNTVQYQAIPVAAQQPPGSAGGMIDLARSYGGADL
RWTSRLSVADRPLTLIAGLSYDSMVEQRKGYENYTGSLNAPTLGERGNLR
RNETNRVYNLDPYLQMSLQASPQWAVDAGLRYSTVSFDSSDHYIVSGNPD
DSGNLRYRKALPVAALRYTPAKNTMLYASYGRGFETPTLNELSYRPDAQP
GMNFGLDPALSDNFELGAKKEVAGGLLTVAIFRSDTQDEIVPATSLGGRT
SYRNAGRTRRDGVELGWNARFAGDARVQLSYSWLDARYRESVSDAIRAGH
RIPGTARQAAYATVAWEPREGWQAGIEGRYLGSVAINDANSDAAPAYFVA
ALSAGYVWRAGSWQGRVYARADNLFDRRYAGSIIVNEGNGRYFEPAPGRN
WSAGIGATYTF
>NE2107 conserved hypothetical protein
MFVWRISRKEFALDRTGYGASIKGQRWNSASIPAIYAGLSLGIAAMEKLV
HTGSNLPLNLVVVRMTLPDDNSLYKIPPPDALPDGWSALPGSPTAATYGD
SFLLNGKYLGLIVPSAVIPEARNIVINPNHPMMKEVTIEIIRNFTFDSRL
RS
>NE0057 5-formyltetrahydrofolate cyclo-ligase
MNNAALRHWKQQQRARLRELRQQVPVTQRMLWSDAITAVLTQGFPFLENQ
RIGFYWPHQGEYDPIPAMTFLRTRGATLALPEVISKDEPLRFIEWWPEAP
MKKDIYGIPFPDNTRGITVDSIIIPLLGFDEQGYRLGYGSGYFDRTLAAM
SPRPLAIGVAFEILRLPTIHPQQHDIPMDYIVTEQSILHRTDTGLVPLTA
PIVQSETA
>NE2124 TonB-dependent receptor protein
MSQQQKTLLPLGAALMAGGLSISAFAAEGSESSEAAVLPTIKVQTSAEEQ
DGYRATATRVGKKLQDPHDIPQAVTTITNQFMEDYQASSLRDALRHVAGL
SFNAAEGGRSGDNMMLRGFYTFGDMYLDGIRDTAQYNREIFYMEQVDVLR
GAGAMMFGRGQAGGVINQVSKAPHIGDRGKITGSVGMYGYYEMTGDFNKQ
ITDTIAIRFNGMKRTEDSWRKNPAADDRASLDRDGFAISAGIGLNTANEF
ILSHIRTTTNDKPDFGVSFDATTRRPQERFPAKYYWGTNSNFDKSTVNIT
TASFTHRFSPDSEWRTQLRYGDYERSYWARTPSLTVAPNIGGTICTPNCN
GGPTRAMDYETLAIQTDYNKKFQLFGMNHEFLAGAEYLNEDGFRHSLQNF
GGTTAANPPLYRPYEINSAGNPVDFKSRTYAVYMQDTIEFIPKWKLTLGG
RRDEMKAIYSNTLSPRLRYGEWSTRGALSYHFSDQSHFYVAYSDSFSPTA
DLYQLNVAPLPPERSQVKEVGAKWLLFEGDLALRAALYRADKQWERNTDL
ESTASVLTRKRRTDGIELEMAGRITPNWEVFSGFTVMDSEVRKVAVNVNP
NTGALTSGNPAYEGERSRNTPPYTFNLWTTYKLPYGWRVGGGVEVKGKRY
AYNPSGAGPIPTLPGETEFHPNTAPSYMRWDAMVAYEQPKWAVRLNVQNV
FNKLYYDAVYDNGGFVIPGQRRRAIMTAEYRF
>NE1571 hypothetical protein
MAATATSASTLIIPVENQVRELDAKLLLACVAAERGFPVIIGSRAFVHFE
IASLPRGIYLAKSMRSLSNSMFRIIRMLGHEIVAWEEEALVHPPADTYYT
LRLSPTTIRNVSHIFAWGQENVDLLQHYPQFPENLPVHLTGNPRGDILRP
EMRAYFAAEVERLRNLYGDFILINTNFTDVNPFIPNIGLFIPAKDGDKKS
RRGQAGIGMSEEFAEGLWHHKKAILEDFRQLIPALEQVFPDVTIVVRPHP
SENFQVYHDIAARCQRVKVTNEGNVIPWLLASKTMVHNGCTTGLEAYALG
VPAISYLATFNEYYDYDFQGLPTRLSYQSFNFSELQDTLSRILNGDLGAP
GGEERKTLIDYYLAAQNDRLACERIVDVLEESGYSQSQPPARATPVYLAG
WALANLKATLTQLNMRRPGPNRLSYHDHRFPEIPVEQIEQKIARFGNLLN
RFDSIKVKQHSRHLFRINSSL
>NE0681 putative transmembrane protein
MRLKIKPEWLLLAAFIILTLPPMQRFLEQSMVTHMLVQLPALTVIGWIFG
RTLPESWSNKIAPWNRWGITGMALAIVIMTYWMLPRALDAAVSEWYVELG
KFITVPAMGIALGVSWPQLNPIAQGVLKLEFWATFMRLGWLYLDLPDRLC
ANYLLSDQRVLGQLLLLIGSAWAIAWTLRVMFGTKIINT
>NE2102 conserved hypothetical protein
MLPEIWKHDSPYFRPEGDTRGEAVLPIGMPMRQQYDFSSSRKNPYAAKLK
KPVTIRLDEESISYFKSMSEETGIPYQSLINLYLKECAASGKRLNLSWK
>NE2515 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE2318 hypothetical protein
MKLLFFCLLLVSMSVPAVAGNEKQIFELEAAIMQQQQEQQILFQRFQMLQ
ELRRHEITQIEQALPTGSDVIINGEAPKYEDVARQRKERAERVHRYTDEL
DELYMRYQETENERRALIEQLNGLKPGQDVSAEPKK
>NE1367 possible ISA0963-4, putative transposase
MLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQLYLALNDIEHSK
TKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEELQHDLDDWMAYY
NSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1303 hypothetical protein
MWQTICRGAIAGGVLLLTVQPVSAAEPDPAQVEKTVQAYIGKANQEAAQN
RESVEESQVVTADLNGDGRAEIILWSTRYGGTYSFNDVTIFTDSGRGYQV
AAGTEDVLGMVESIEVKNGLIHIHALWPGPNDPRCCPTVKKTAVYQWQGK
ALADVTSRVSGKK
>NE2368 Aspartate aminotransferase
MQPIKKSSKLDNVCYDIRGPVLDRARQMEDEGQRVIKLNIGNPASFGFEA
PDEILQDVIRNLSAASGYCDSKGLFAARKAIMHYTQEKNIANVQMDDIYI
GNGVSELIMLATQALLENGDEILVPSPDYPLWTAAISLAGGVARHYTCDE
QSSWLPDPENIKAQVSSRTRAIVIINPNNPTGALYPDDLLREIIEIARRN
NLIIFADEIYDKILYDSASHTSVASLADDVLFVTFNGLSKNYRAAGFRSG
WVVVSGNKTHARDYITGLNMLASMRLCANVPAQFGIQTALGGYQSIHDLT
MPTGRLKRQRDLAWKLLTDIPGVTCFKPQSALYLFPRLDPDYYPVKDDEQ
FALDLLLEEKVLVVQGTGFNWSQPDHFRIVFLPNSDDLTEAIGRIAKFLD
RYRTRHGS
>NE0319 hypothetical protein
MRAWIEYRGGYVMNGRFCIWRSPEFRYKLMSVLFPGMILSSSCFGESEIG
QAPLGNRQILQSQFQEFRISSSLQYFPHSYFPVQPILIYPGVQWAYPCFP
FVSCMELQQYRRYKRREKRQQPKPVFGQGASLMDESMEDWRAGLRPAVEP
FRTDEHQIVPALRGHSLIRPEYREAGSILPRFSNGTE
>NE1911 Glutaredoxin-related protein
MDVVDSIEKQVSSHPVVLYMKGTPQQPQCGFSANAIRILNACGVEDFFAV
NVLADPEIRQGIKDYSSWPTIPQLYVNGEFIGGSDIMSEMYQNGELQKLF
EK
>NE0706 hypothetical protein
MAKPMFNRQTTILLVLNTCVISAFVLIHWIVLDHPDDLSTKSRNVPSETT
PLSIPMETLPNALEQTLLFSSSRTRTVISSPESTVPLDTAPPRLVGIVEE
EGHKRFALLEDETATSRKLVAQGDTFETWMVVSVTSDAIHLRSRSDINDG
VHPSSDIELRLRPSVPPSQNFNP
>NE0153 GTP-binding protein (HSR1-related):AAA ATPase superfamily
MKPTLVLVGRPNVGKSTLFNRLTRSRDAIVADIPGLTRDRHYGHGRLGLK
PYLVVDTGGFEPVVKSGILHAMAKQTLQAVDEADIVLFIVDGRQGLAAQD
KIIAEQLRKTGQKIILVVNKTEGMPYSSVTAEFHELGLGTPCAVSALHGD
HLGELIDFALEGYPYEEETAAEPGQEKCPVIAIAGRPNVGKSTLINTLLG
EERVIAFDQPGTTRDSIYVDFEYGQRSYTLIDTAGLRRSGKVWETVEKFS
VVKTLQSIEAANVVILVLDAHHEISDQDAHIAGFILETGRSLVVAINKWD
GLDDYQREIIKREFERKLGFLSFANLHYISALYGNGVKGLMPSVDAAYAA
ARAHIPTPKLTRAMLAAVAKQQPPRGGMSRPKLRYAHQGGENPPLIIVHG
SMLEHVPQTYRRYLENTFREVFKLKGTPLRVEFRTGHNPYAGKKTPLTEE
EARRAHSRRRRNRKKYG
>NE1116 Type I antifreeze protein:HlyD family secretion protein
MNKRVKPVLTYLLAALVIITIIVAWRVLRPAGLPAGIVAGNGRIEATEID
IATRAPGRIIEILAREGDFVQADRVLARMDTQVLRAQQIEAEAQVRRAEH
AHQTAQSIVIQRQSEQTAARAVVAQRQAELDAARKRLQRTQMLAGEGASA
QQELDDDQARVLSAQAAVSAAQAQVAAANASIEAAKSQVIESQSVIVAAQ
ATVARLQADIDDSELRAPRDGRVQYRVAEPGEVLGVGGRVLSMVDLSDVY
MTFFLPAEAAGKVALGSDIHIVLDAAPGYVIPAKASFVASVAQFTPKTVE
TESERLKLMFRVRARIDPELLKQHLEQVKTGLPGMAYVKLDPAAVWPAEL
EVRLPQ
>NE0437 Cytidine and deoxycytidylate deaminase zinc-binding region
MKIQGVRVPVQTEDEYFMRQALDLARVAGAAGEVPVGAVMVRESRIVGCG
HNCPVTTVDPTAHAEIRALRDAASRVGNYRLPGCTLYVTLEPCVMCIGAM
FHARITRLVYAANDPKTGVCGSLLDLPADTRLNHHLMVSQGVLADEAGTL
LKQFFIAKRGIKERNTS
>NE1155 DnaJ N-terminal domain:DnaJ C terminal domain
MEYKDYYQTIGVPRDATQDDIKRAYRKLARKYHPDVSKEPEAEARFKEIG
EAYEVLKDPEKRTAYDRLGTRWKSGQEFQPPPGWDQGFEFHGGGFTEAAS
QFSSFFESLFGRNQQSHRYRDREFNLHGEDSHARISIALEDSFQGGIRSL
TLQHTESGQDGRPLIKERTLKVRIPKGIQQGQHIRLAGQGSPGTGGGKAG
DLYLEIVFKPHPLFRVEGKDIFLDLPVAPWEAALGATVAIPTPAGTVDLK
IPAGSRSGQKLRLKERGIPGTPSGDLYVVLQIVLPAADSEQARAFYREME
QKLAFNPRAGMSIR
>NE1300 hypothetical protein
MNKVIVAAFVSAFVLGSTATFASGNLESSLAPISAKDMLDYLACKDKKPT
DVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY
>NE1798 Glycosyl transferases group 1
MRGLLYLTHRIPYPPNKGDKIRSFHLLQHLSKRYRVYLGTFIDDEEDWQY
EDEVRKYCQDVCIVRLNPLSARIRSLFALFGNDPLTLAYYRDARLMQWIK
ALFSNESIQDIVVFSSAMAQYVEHLSGCRRIIDFVDVDSDKWRQYALSKQ
WPFNRIYHRESRYLLNYERKVAEEFSHSTFVSEKEAELFRQLAPDAASKT
TFFNNGVDTDFFSPQRSYLNPYPAAKKVLVFTGAMDYWANIDAVSWFANS
VFPAIRTKQPDVDFYIVGAKPARQVQMLADLPGIYVTGAVEDIRPYLIHA
AVAVAPLRIARGIQNKVLEAMAMQKPVVVSVQAMEGIHAVSEKELIVAGD
AGDFAEQILLLLDSDEQVTMGRLARERVLQDYTWPANLSRIDALLAAP
>NE1900 DUF214
MNLLTIIGEATRALQTNRLRTALTMLGMIIGVAAVVLMLSIGQGAQTSIN
SAIASMGSHLLIVMPGATSSGGLRWGSGSVKTLTVQDAHAIAELPMVDAT
APVISGTAQLNYGANNWSTVLTGVTPDYFSVNNWEMADGEIFTEGDLRSG
TRVAVLGQITANSLFGNEDPVGKIVRIANRPFTVVGVLVAKGQSLSGRDQ
DDNVFIPLTTAQRQIIGNQFPGSINHMTVRAKTADSMEAAEAEITRLLRQ
RHRISAHMENDFTVRNLTALASVASNTARVMAWMLGAVASVSLLVGGIGI
MNIMLVSVTERTREIGIRMAIGANQRMILTQFLLESLMICILGGITGIAL
GIGGAWLASRIAELEIVITPGMIALAFSFSSIIGIFFGLYPARKAAALKP
VEALRHE
>NE0715 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE1079 Sigma factor, ECF subfamily
MTIPVVKLVTWQQAHDLYNNHHGWLQKWLYRRLGNTFDAADIAQDVFMRV
LTRRQPSQIHEPRAYLSSIARGLVVDHWRRRELEQVWLEILAVLPEPETP
APESRLIFLETLVEIDRVLDSLKPAVRTAFLLAQIDGLTCSQIAKHLDVS
VATVERYIAKALRQCYALHFEA
>NE2534 Glycoside hydrolase family 24
MSQIPQAAIALAKRFEGFHKVPKSDPLRRARPYICLAGYWTIGYGRLCKP
DHPPIDEEEGEAYLYQDLRKALAATLRYCPVLATEPESRLAAIVDFTFNL
GAGRLQTSTMRRRINQRDWLSAGQELRRWVHGGGKVLPGLVARREAEVLL
LVPG
>NE2524 possible type I restriction-modification system methylation subunit
MKRISQQELESYLWGAAVLLRGLIDAGDYKQFIFPLLFYKRVSDVWDEEY
QAALANSGGDLSYAQFAENHRFQIPAGAHWNDVRQTPKNVGAAIQKAMRA
IESANPDLLDGIFGDAPWTNRERLPDETLKNLIEHFSTQTLSVANVPEDE
LGNAYEYLIKKFADDSGHTAAEFYTNRTVVHLMTQLLAPQAGESIYDPTC
GTGGMLISALDEVKRAGGEYRTLKLYGQERNLITSSIARMNLFLHGVEDF
EIIRGDTLAEPKHIEGDRLRQFDVILANPPYSIKQWNREAWSSDKWGRNS
LGTPPQGRADYAFQQHILTSLTAKGRCAVLWPHGVLFRNEEQSMRAKMVE
QDWVEAVIGLGPNLFYNSPMESCIVICNRKKAAARKGKVIFIDAVNEVAR
ERAQSFLKPEHQQRILTAYKTFADVPGFAKVATLAEIGANAGNLSIPLYV
KRIAAAIATDSNDDAVSLRSAWDQWQADGRAFWQQMDALVETLDGLVAQE
AADA
>NE1223 putative transposase
MMDHDHSYKALFSHAEMVADLLRGFVREEWVNELDFSTLEKVSGSYISDD
LREREDDIIWRIRWGKDWLYVYLLLEFQSTVDWFMAVRIMTYVGLLYQDL
IRSESIHKGEQLPPVLPVVLYNGDNRWQAPVDISELIIPIPGGLERYRPQ
LHYLLLDEGSYHDHELATLRNLTAALFRLENSRTPEDVQQVLQALIAWLQ
SPQQSGLRRTFTVWLKRVFLPGRMPKVRFDEIQDLQEVHSMLAERVKEWT
KDWKQQGIEEGLQKGLQQGLQQGRQEGRQEGREEGLQQGEAEFLLRLLER
RFGPINETIRTRIRAADSQTLLTWGEQILTAQTVEEVFEA
>NE0829 conserved hypothetical protein
MRKLFKKYLPSHESIRQNRFVNFFGTLLYHHNLWHLHRRSVAGGVAAGLF
AGLIPGSNPVQFFFATLFSVIFKVNLPIAAFVTLYSNPFTIVPLYLAAYT
LGGWVTGGTRNSGSLPPLELGLLDKNLSEWIPVLTDYLVTFGKPLITGLF
LLASLLSITGYFTVRILWRYYVVHAWHKRAKRHHPDK
>NE1937 conserved hypothetical protein
MSVKNIGNYVIVLALIAIVTSCNFGSSDWRTASRARTGLAPDPAETPEAV
IQVYAARAYSWRGIFGVHTWFAVKPSHAESFTVYEVAGWYARWGGSVVAI
HEQAPDKRWFGNAPMLLAEKRGEGVDELIKRIDKTVQTYPYSKEYTIWPG
PNSNTFTAWLSRAVPEIGLDLPPTAIGKDYLGNSMTATAPSGSGWQLSVL
GLFGIIVSDVEGFEINILGLTFGIKPDPLAIKLPLIGRIDLSV
>NE2382 mce related protein
MQRTMMDFWVGLFVMAGIGALLVLGLKVGNLTDFQPEGSYVLVGNFENIG
GLKVRAPVKSAGVVVGRVTDIQFSVQTYDAVVTMKVDTRYQFPKDTFASI
LTSGLLGEQYIGLLPGGDEEMLKDGEKIMKTNSAIVLEEMIGRFLFDKAS
EKKNSDPEI
>NE1196 putative ABC transporter, permease protein, cysTW family
MILLRSGLKSFLIYLWSGWGAIASLMLFLAVWEWAGLYYGELILPNPRIT
FQTLAGLIAEGSAWQELTITARRALIGFGAALAVGSTLGLVAGLSMTALM
MSRPIVTVLIGTPPIAWLVLALLWFGNGDGTPIFTVFVACMPIVFVGAML
GARTLDGQLGDVATAYRLPPLMRFTDIVLPHVVSYLFPAWITALGISWKV
VVMAELLASSDGVGAALAVSRSHLDTAATLAWITAVVGLLLAVEYLLLEP
IKREIERWRVAV
>NE2383 Domain of unknown function DUF140
MIGMISLAKRIRTIIESIGHRVADSIWRLGCATRFVLLVLLKSLPSFLRF
HLITREIYFAGVLSLIIILVSGLFVGMVLGLQGYETLQKFGSESAVGTLV
ALSLVRELGPVIAALLFASRAGSAMTAEIGLMKATEQLAAMGMMAVDPIA
RIVAPRFWGGVFSMPLLAAMFSVMGVFGGYLVTVVFIGVDEGSFWSQMQN
SVDFRYDIVNGVIKSCFFGVVVTAIAVFEGFDAPPTAEGVSGATTRTVVT
SSLAILGLDFVLTAFMFRGVS
>NE0665 conserved hypothetical protein
MVSPRLVTKIEAEEPTLRLAVLIDADNAQAAVIEGLLAEIARFGEATVKR
IYGDFTAPASASWKKVLQKYAIKPVQQFAYTTGKNATDSTLIIDAMDLLY
TRKFDGFCLITSDSDFTGLAMRLREEGLTVLGFGEKKTPEAFRNACHKFV
FTEVLRPDTATESAAQRTKKTENDQKSSSPQPAAQAPETKQAFPRKFVLA
ALEQSSDDAGWANLGNFGNYLNKLQPDFDSRLYGYKKLSDLVKARTDLFV
TEERQVPGSTQKALYLRAK
>NE0074 putative alpha-glucan phosphorylase, putative
MTQRTAFTLTVSPKIPARLSRLEELSNNLWYSWDRATRALFSRLDQQLWE
AVGHSPKAFLRRVDESLLLQAAEDLTFLSHYNSVLSAYDSYHSEPSRYQA
INGIEARDQVAYFCFEFGFHESLPVYSGGLGILAGDHCKAASDLCLPFVA
VGLLYRQGYFFQTIDGQGNQQPVFCDSDFEDLPVKPALHDDGSEVRISVR
FPGRVVTVKVWRAQIGHVGLYLLDTDLPDNSEYDRLITHQLYGGNKTMRI
EQEIILGVGGVAALRAMGIKPTIWHSNEGHAAFMMLTRALELVRQGLDFA
SATEAVAVNTIFTTHTAVPAGHDHFPEDMMHRYFEDFYHELKISREEFMA
LGHTPGNHDFNMTALAIRMSRMHNGVSRIHRDVSANICHDMWPQIEPEEN
PIDYVTNGVHVPTFLAQEWTDLFDRRLGSQWRSKQHAPEFWSRIHDIPDH
IFWSVRQSLKSQMFYGIRARLTEQNLRNHGSEAHLERLLRFVDPINPNIL
TLGFARRFATYKRATLLFQDLDWLRKLLLDAARPILLIFAGKAHPADLPG
QALIRHIHQVADMPEFRGRILLVEGYDLRLARRLVSGVDVWLNNPIYPLE
ASGTSGMKAGINGTLNLSILDGWWGEGYDGKNGWGIKPVSENMEDSLRDA
EESRAFYEILQDQVIPLYYDHSKFGYSADWVRMAKNSMVSLLPRYGTGRM
VGEYVQKFYAPASRQGTGYSQDNFAIARSIAAWKARINAAWDGVSVRRLD
TPGKQLEFGSTMHFKLAVKLNGLQPEDVVVELLLSRQYNKTRLSQFKHFR
LECTGPLGSHEHQFELNHVPELCGRQQYYLRIYPYHPLLTHPLEMGKMVW
L
>NE0583 hypothetical protein
MKHRTWIWLYLITLPSLTNMVHAKETLPDNQNGQTTAVDSGSAASKEVII
QTAANQQTESTKPTFIAGNAPSSFYQRARSYSTHPESDPPRYVRTLSKTG
IDAFKNLYWLDVGLDYRVRYEHRHNDIRRSRITTDDPVLLRTRAYLGIKE
ILDPLRFVVEFEDARRYNGKFPKDNRDWNEFELIQTYGELYFKDALGRDD
LGNYRPVRIRGGRMAWETLDRRLLGSNQWRNTTNNFEGFRVTLGQESNDW
EFDAWGVQPVIRLIDKFDRRDKGQWFYGAIGHFRQWSKIITIQPYYMGLI
QDDDGGTRVKREIHSPAIRAYGVVPNTEVDFDLGAIYQFGRDGGQKKSAH
AYLLEFGYTFQQAAWKPRVSAFYGYVSGDRDPNDRTNNRFERFFGFARPW
SADDYIIMENIQAPKIKVEFQPHPDLQIDGGYNGFWLASKTDRFNNLLNG
SGNNRDRSGNSGSFIGHSADIRARYKLTPHISTTLGYSHWFNGGFIKNQQ
LAELGETTAGTDFFYVEVAISAFK
>NE2436 hypothetical protein
MSEVITVGDKPILLKAIELVLAEPAAIRKEALQLKDKYVTRYGSDRSEDE
INAYAADKIISNYSYYTAFVGGTTALTGVIPGLGTVLAAFGGATANTALS
MKYQIEMTMAIATIYGRDITIEEEKRLCLMIAGLGAISEMTRVGGKEPGK
KASVKMMQQYLQDASLQTLRELFKKVGITFTKKAAEKAIPFGVGVIIGFS
ANKGLTWYVGTKARDFFSVTDSIV
>NE1263 conserved hypothetical protein
MTEPLLIAKSGNTELAILPSMANRHGLIAGATGTGKTITLQSLAEQFSRA
GVPVFMADVKGDLSGMCQPGGGNARVEARVAELGLKGFGYAACPVVFWDV
FGQKGHPVRATVSDMGPLLLARMLNLNDTQSGVLTAVFHIADDNGWLLLD
LKDLRAMLQHAAENASSYSVEYGNISTASVGAIQRRLLQLEHEGGDQLFG
EPALNFDDLMQIAEDGRGIINILAAETLYNSPRAYATLLLWLLSELFENL
PEVGDVEKPRLVFFFDEAHLLFNDAPAALLSRIEQVVRLIRSKGVGVYFV
SQNPLDIPDAVLGQLGNRVQHALRAFTPRDQKAVRAAAETFRPNPQIDVE
AAITEMGVGEALISLLDEKGRPHPVERGLIVPPGSRVGPATEAERQQVIR
GSLLYGHYEQLVDRESAYEILKARAATSPEQAPAEADRGFNWGELLGGST
GPRGGRREGIVETITRSTARTIGSQLGRQVIRGVLGSIFGGGRRR
>NE1442 hypothetical protein
MKPDRLWFHRGDFLDCVYRLFTRFWLFFLILVLCFPAIAATPRFDFTLEN
IRHPVFSIRSAGIKLIGAPSPTLEINLGEVAIGKQTWHGLRLRCNPVHID
RESMNCNTGTLQIGERFMTMIFRLSLQHKQFVLEIRPASNKSKEKWRLEV
NWQASKWQGVLQVVNGEGKFLADLLPQGDDRIQVHQAILNGNIRLSGNNA
SVSALSARLGISKLSFSDASGLHAGEGIDLQLDADAQQKRNDWQWRGKIT
WPEGEIFWQPFYFSGEGHQLTARGTVKDERINITQGEFNLAGTGKADFSA
VAGIADQSLQQAWLSARDLELSALFGSIIRPLAVDTALAETEAAGQMNID
WRYQGNDNQELIVGLQDVSLTDAHGRFAVERLNAHIPWNSNEKRDGSIRF
SNAQMVGIPLGETYIPIGTDGMRFSIPRAEIPVLDGKMLIENFVASMQAS
GWQWQFDGLLAPISMEKLTESLHIQPMFGTLSGTIPRMSYANSIMTMDGE
LVFGIFDGVAVARNLALSGPLSLTPHLTMDMAMYHIDLDLLTRAYSFGNM
QGRVDVEIDDLELINWEPVKFDAKLASSAGDYKRRISQAAIKNLIALGGG
LAVTAIQKSFLGLFEQFGYAEIGWSCKLRGSVCNMGGIGPATHDGGYMLI
KGSGIPAITITGYNRKVDWPELLERLRHAIESGNPIIH
>NE2067 Penicillin binding protein transpeptidase domain
MNNTVELRDHTREIRLFRIRLAICAVIVIILFGLLFSRFFYLQVSQGEHY
TTLAEANRITIQPLTPNRGLIFDRNGEILAQNYTAYTLEIVPDQVKNLES
AIDELATVVEITPEDRRRFNKLKRESTRFKSLPIRSRLTDIEVARFAENQ
YRFPGMDIKAHPLRHYPRGELVSHVIGYIGRINDKDLEQFEADKRLKNYR
GSTHTGRIGIEKSYENALHGQTGFEQVETDAVGRPVRTLSRVAPVAGNDL
TLSLDIGLQQAAVQAFGDYRGALVALDPSNGEILAFVSKPGFDPNLFVDG
IDHANWRLLNESIDRPLNNRALRGVYPPGSTFKPFMALAGLELGKRSIRY
AFNDPGFFSLPGSSHRFRDWKPGGHGWVDLRKSLVISCDTYYYMLAHDLG
ITNIHNFIGQFGFGREIGIDIPGEVRGLLPSPEWKKKRFKQNWLPGDTIS
VGIGQGFNLSTPFQLTFAMMLLANDGRAYKPHFVKQQVDHNKHLVEDNFV
EEMYRLNLKQEYLDYVKQALTDVTRPGGTASRAGANAAYTFAGKTGTSQV
INIKQGERYNASRINERHRDHALFTAFAPAEDPKIVLTVLVENGGSGGST
AAPIARKVLDYFFLGKMPEPAEKAG
>NE1028 Universal stress protein (Usp)
MSVYHHILLAVDFSSEDSQVVQKVRNLASQIGARLSLIHVLDNIPMPDTP
YGTAIPLDTETTYDAMLDVEKQKLSQIGNTLGIDPAHRWLVWGEPREEII
RIAEQENVDLIVVGSHGRHGLALLLGSTANSVLHYAKCDVLAVRLRDD
>NE1781 Peptidase family M23/M37
MFADLITSSKSTQHCLSRKANRLLRATFLLLSVNVFLPSVLYAIPQANRD
SQKQLQELRTRIDALQKDLVDKKSSKAKVADALRESERSINDLHAKLAKL
AQQQKNAQEKLNQLQNSSALLQDDINNRQKQLGKLIYHQHLAKYGNYLPL
LLKQQNPDKATRNFYYYSYIAQARSKNIDNLRNQLTALDALAHESHIQNE
SLKRIHNEQIHHKRQLELEKNRKSEILATLSKEVTRQQKKIDELKQDEQR
LSNLIEKLNRQLARKKSAPAKADGKSVLRNDKLPDSTEHRGSFAALKGKL
RLPVKGELANRFGSPRESGSIKWQGLFIRSAGGNEVKAIAGGEVIFADRL
RGFGNLMILDHGNHYMSLYGNNAAIHKRVGSKVKSGDTIATVGNSGGNAE
TGLYFELRYQGKPFDPLSWVKLE
>NE2090 conserved hypothetical protein
MYYFLHREISMNTLPETILQQARSLQEGGILSPREFLHLGSRSAVDQAFS
RLAKAGRLLRVARGTYAIPVSSRFGSRAPAPEKVIRALAEQSGEIVVPHG
ASAANVLGLTQQVPIREVYLTSGRTRKLKLGRSEVLIKHAPRWMLALGTR
PAGAAVRALAWIGPTHAGKSLASLRRILPLPEWQALISARATLPGWMAQA
IGEEAARG
>NE0512 conserved hypothetical protein
MNKKLIGIVLALAFPLAVTANPGEHKHDHEQYRAKKIERLDKELSLTDDQ
KSRIDTLFKQNGEKFKAIREETQSQLKTILTPEQYSKLQELKQRRQEQWH
KKHQEKLQERKQEQSGTTQ
>NE0231 hypothetical protein
MTMIKKIETELLAAKATLSEISGRFKEFSDTQARLSADGDLLGLARLNKE
HTGLEDSLLAADDTVRALESRLSVLRQAEYRPQFDKAHKTHLGAVQAETK
AAEKLLAAIDAVFSAATDMQNLSDEVAATYHAARDLHNRAGLDHELRWPA
PDGQIPIKISDRMNSLRDELVRTIRLYEDRLPQSQSLEGLRIIEQQQEEL
VRNSGRGFR
>NE0286 conserved hypothetical protein
MTLPDSKKDQTARHVAILQESYRHYTGRYLFDPKLGASEAIVWLEQAPFA
LVSHGTQPDPIFNYGNQTALQLFGMTWDEFTRMPSRLSAEPVDRAERTRL
LDHVSRDGYIDDYTGVRIAADGRRFLIRHATVWNLLDESGRFYGQAAMIP
EWEVLQPVFTAD
>NE0797 conserved hypothetical protein
MENKEKNTSELLQLIVVQNTNAVISVDEIKNSLHERGFAVLLAIATLPIC
LPVPAPPGYTTVFAIPLFIFSIQMICGMKAPWIPEWLTKKTIKRGTLDKL
ITKAAPWLRKIESHMHPRLTYISVHAWERIIGLFSFIFSISIALPVPLIN
FLPGLGILIMSLGLLSKDGLTIIAGMIVGTTGVGIALIVVALLWMGVPIP
FVQSTD
>NE0018 PDZ domain (also known as DHR or GLGF)
MNVSDVSAQTGSGKKAVSASEQTTRNACVPAASWVIPGKGETTLSSVMAS
VKDKSVVLLGETHINPEHHRWQLQMLATLYAARPDMVIGFEMFPRRVQKI
LDQWVAGELTESEFLSRSEWQSVWSTDAGLYLPLFHFARMNRIPMRALNI
DIHLRRAVTAKGFDGVPEKDREGVTRPAPPAPEYLEFLLPIYMQHGRAPQ
KTAEKNQKPNHYDPDFLRFVVSQQLWDRAMAQEIHAVLSSYDKHKKPLIV
GIMGSGHILKGFGVPHQLKNMGVKKIVSLLPWDTNRPCKLLTDRYADAVF
GLAPFTPESGSPLQQRLGIGFEFSKKTTGAHVLQVEQQSIAESAGLQAGD
VILEMAGSTLKESNDVIDAVKRQAPGTWLPLKVMREGMITEIIAKFPPLA
K
>NE0592 possible uroporphyrin-III C-methyltransferase
MNKHTQTSTSGSARRLALFILIILLLAVAVMLWNWAGKQGYIAAMEYSLT
KYLADADVFSNRSRKMLDELEAKKAEVEQQLNLLEENLQADPASSPAVVE
LPADEGNGTRSDAWILGTVERLVVSTDRQLRLTGDVRSALVSLKHAYELL
QSSDVLGASKLAVILAGEIERLETQPVVDVSEISRNIDKLAAQIETLPLA
MEAHLINVDLSEKYDDMPEPEWWQRYLREIGEDLSRLVKIEKTDQGIPLL
SPSQAHLLKENIELQLILARLALLTRDEESFGSAVKSASDWIQRYFDIEA
QSVRDVLKELERLAGIDIGSRLPDTGKLLQAVRYN
>NE2024 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE0083 Proline-rich region
MRHFSRLIVLSAAVSLVACVNIPLGPSVMVLPGVGKNFDQFRGDDYLCRQ
FANQQTNYETPKNSAVSSGMESAALGAALGAAAGAALGGGRGAAVGAGMG
LLGGGLAGSGTAQSSGSISQERYDIAYIQCMYANGHRVPVSAGLIEGSGG
NMGQGVTSNPSSSGRYIPPPPPGHPPPPPPY
>NE0712 DUF172
MAECNVQINVQLENLMDAITYSTARAKLADTMNRVCDNHEPIIITRNGEQ
SVVMMSLDDFKALEETSYLLRSPKNAKRLLESIAALESGRGETRSLAE
>NE2229 possible long-chain N-acyl amino acid synthase
MQIANHLQSGTAVKNIQPAHIPFSPYQPDSTSSDDTDPRFPRSQKYQISR
HPGSNSSHIAETDCLLQRNGYSIHLVNSLKQRIKASTLIKRMYASRGYQT
ESASVFSTSSNQYTFEARQSQQLIGTLTLTIDTGKGLLADTLYQPELDQF
RRQGRRLCEVSKLAFNPETSSKEIFASLFHMAYIFAHRIHGVDDSFIEIN
PRHATFYKRMLGFRQVGELRTCPRVNAPAVLLYLDLEYMKEQITTQAGQF
DQKTKSIYPHFLSQNREKEITQRIQIEHTHFVPPSSRKSTFNHHQDYFQP
A
>NE1136 conserved hypothetical protein
MKANIAWQEGVSFLGQVGSGHSVLMDGAPEAGGKNLGPRPMEMILLGLGG
CTAFDVVHILRKGRQVITGCQVEMDAQRATEDPKVFTHIHLHFIITGKNL
DPRHIERAISLSAEKYCSASMMLKATVDITHDYEIIVG
>NE1711 membrane-associated Zn-dependent proteases 1
MTVLATIFAFVIALGLLITFHELGHYLAARWCGVKVLRFSLGFGQPLFKK
RLGNDQTEWVVAAIPLGGYVKMLDEHEGQVPAGERHRAFNHQPVSRRFAI
VVAGPVANFLLAILLYWLLFILGVSGVKPILGEIEPATPAAVAGFRSGDT
ITGIGDQAITTWQEARLLLLDNAVDKNADVRITVTGESGISRQLRFDLSS
LGAEDIEKDFLGKLGLSAYQPIIAPVIDQVMAGGAAEHAGLETGDRIVAI
NGKGITTWEEVVTVIRSSPGRILLIEAIRDGQELDLSLQPEAVSEGSTEI
GKAGITPKIEHALLEGLLVKTSYPPAMALAKAVTKTWEMSYFTLRMLGKM
VTGDVSLKNISGPITIANYAGQSAQMGLAAYLGFLALISISLGVLNLLPI
PVLDGGHLMYYLIEMVRGAPLPERIMYIGHQIGVVLLVTLMIFAIHNDLL
RLVSE
>NE2342 Metallo-beta-lactamase superfamily
MTETCQLMEQLPAIIDYPNGISVIDARYHRPGRAAIHLITEGDKAALVDT
GTRFSVPGTMAALAHKQITPEQIDYIFLTHIHLDHAGGASEFMKWLPNAR
LVVHPRGASHMANPIKLIAGVMAVYGESEFRRIYGEIHPIAAERIIEAPD
NTLVELNGRPLRLLDTPGHARHHYCIHDARSKSIFTGDTFGVSYREFDMD
GLEFVFPTTSPVQFDPEAAHASIDRLMALHPEQAFLTHYGRIRNLSYHAS
QMHALIDAFVSIVQQAAEIQDRQPIITKALQDLLLERLAAHRCTLTRDAA
LSLLQTDIKLNAQGLEIWLEQTIRQSTQAAGSS
>NE1530 putative signal peptide protein
MSVKHFITAVSLAMVSTVIPLTVTAEQHAHDAKSAKPGHDMNKMWAEMRT
RAVGMAVSVAADEKGKLWLVRMQDGHIRVSHSEDGGKHFSEGVTVNPQPE
AILAENQNRPKIAVRNGVIAVTWVQALPKVFAGNIRFARSVDGGRTFSEP
VTVNDDQGEISHGFSALTLGDNGRVTLTWFDGRERDAADKGGQKYVGTTV
YYATSEDGGASFSANRKLADHACECCRIGMTLDSDGVPVVFWRHVFEGSM
RDFALARLDSQPKVLRASEDGWEINACPHHGGDIAVDEAGSRHLAWFTGN
PQNPGLFYRRADGENMTAPHAFGDLDFQPGYPAVFAYGKKVYLVWREFDG
NNYQLMASVSADRGDTWSAARAVATTGGAADLPVFVVGAQKPLVVWNSAR
DGVRFFNAEGDL
>NE2417 conserved hypothetical protein
MSAQNLVESTLNPDHAVINEICDAGEPWVKEIKKGQIFRIVDLEGNQAVD
TLFYNAHDAMERYSATDTVRRQHRLYLTTGSKLYSNFGNVMLVITADTCG
RHDTVGGACAAESNTTRYALDKYPMHSCRDSFLYALAHDPVCERLGMSKR
DVPANINFFMNVPVTEAGKLEFADGVSAPGKYVEMRAEMDVVVLISNCPQ
LNNPCNAYNPTPVRLLVWN
>NE0949 pimt: protein-L-isoaspartate O-methyltransferase
MSVRHSGIGMTSLRTRVRMVERLREQGIKDEVVLAAMGFIPRHLFVEAAL
ASRAYEDVALPINYGQTISSPWIVGRMTELLRNGGSNSLRKILEIGTGCG
YQTAVLAKIASEVYSIERIGPLLTRTRIRLRELGILNIHLKHADGMRGLP
EAGLFDGIMMTAVIPEIPETLLEQLAMGGRMVFPKGNRKQYLCVIDHTTE
GFVETILDEVMFVPILPGTINR
>NE0369 hypothetical protein
MNLKKWTDEELVSTRDQIEAWCAKYAQSVWDGRKGYLTGLLGVFGISTGV
VFLMFDGIEVVSFVPILLGVIVCFTWWKTKQQHKKNNGFLEEIKEEIARR
AKKMEKIEKNKPQSNHAVLP
>NE0755 conserved hypothetical protein
MAERRSLSILEDEDGDDPILSVVNIIDVFLVIIAVLLIAVMENPLNPFTL
QDAVIIKDPGKPSMEMIIKQGEELKHYKSTGQIGEGEGTKAGTAFRLKDG
SMIYVPEQEQ
>NE2492 General substrate transporters
MTTENNQRTKIPAGIWVLGCVSMLMDISSEMIHSLLPLFMVGTLGASAFV
VGLIEGLAESTALIVKVFSGVLSDWFGKRKGLAVFGYALGALTKPLFAIA
PGIGVVLTARLLDRIGKGVRGAPRDALVADIAPPEIRGAAFGLRQSLDTV
GAFLGPLLAVGLMLLWADDFRAVFWVAVVPGLLAVALLLFGVHEPDRHVG
EKRINPIRPENLKRLSSAYWWVVGVGAVFTLARFSEAFLVLRAQQSGIAV
ALVPLVMVVMNVVYSASAYPFGKLSDRMNHKLLLALGLVVLIAADLILAL
DDHWITVLAGVALWGAHMGMTQGLLATMVADAAPADLRGTAFGFFNLVSG
IVMLIASAVAGLLWDQLGASFTFYTGAVFSGVALLGLLKRF
>NE1012 SURF1 family
MLMFKQQFKPALWSTAVTILAIALFLKLGFWQLSRAEEKEASFALLERYA
QQPPVTVPEPLIELDDYLYRRVEVHGYFEAEHTIFLDNKTHQGVVGYHVL
TPLRQVNSTTYVLVNRGWVSGGNDRSLLPDIYTPDGLVYLTGVVVSPSIR
TLSLSDKQFTGKVWQSFSLDSYQNLTELTFQPFLLLQQNETQGDGLIRQW
EKPDSGSSKNIGYAFQWFSLAVMTLIIYIVLNVKRKSIA
>NE0781 myo-inositol-1(or 4)-monophosphatase
MAGRLNVLERVIKAVREVARDEIMSRYLQVERIYKADGSFYTEADLAAQV
SLLGRLHEIAPVAMIGEEMSESEQLAQWEAGQDGLWSIDPIDGTSNFLNG
LPYFAVSVALMRGGRSVLGVVYNPATDEMFYAEQGKGAFLNGKALPFSRR
KIPLNKAIANVDFKRLDRRLAVEIVSRPPYSSQRNYGTCTLEWCYTAAGY
FDLYLHGGQKPWDYAAGCLILEEAGGHRCTLSEDDYWSGNPWRRSVIGAL
DEDLFVLWRDWIRSRVSDIDNLR
>NE1478 Domain of unknown function DUF15
MDSALFKTIALIGKHKNPDIVIPLLSLAEYLTDRGISVVLDSLTAAHISN
SRYPILTLEEIGKQADLAIVLGGDGTMLNIARALVPFSVPLIGINQGRLG
FLTDLTADTMHETLNDMLAGQFVVENRMLLTVEVTRNGESVFKELAFNDV
VLHRGISSGMIELEVHINGEYVYSLRSDGLIIATPTGSTAYALSSGGPIL
HPGLNLMTLVPICPHTLSNRPIVIGADATIEIKVHFTTEIKIYTDSHSWF
DLSEHDRVFIQRCPETIKLLHPVHHSYYRMLREKLGWSGILQKNSR
>NE2114 putative plasmid stabilization protein ParE
MKHYLLSPEAKTDITNIRQYTTQQWGKTQADKYILRLRERMRWLADNPML
GRARDEIKEGYRSFSEGDHVIFYRMAGSAIEVIGIPHQNMDIEQNLSSGN
LLLPDIADYEPEDG
>NE0736 Cytochrome c, class IC:Cytochrome c, class I
MKSYLVAAVSGLIFFLSGQVSAADIEAGKKKAEEICSSCHGLDGNSPVPM
FPKIGGQPRTYIEKVLKDYKSGVRKDPVMAGMAAALSAEDIDNLAFYYSS
QSGLQVKR
>NE0224 DUF205
MITVVLIFSAYLLGSISFAVVASWLFKLPDPRSYGSRNPGATNVLRTGKK
AAAAVTLLGDAGKGWVAVAAAKYGGEVWELGDEVIAGAALAVFLGHLFPI
FLAFKGGKGVATSAGILLGLNPWLGVLTISTWMVVALVSRISSLSALLSA
LLAPLYAYFLLEKGILIMAVSIISVLLILKHRLNIANLMAGKEARIGKSS
>NE1667 conserved hypothetical protein
MPDMTSASSGTVLAFDFGKRRIGVAIGEHELRMAHPLTTIDQSMTRPRFE
KIAELIEAWQPVLLVVGLSVHADGTEHEITRLCRRFARRLEGRFRIPVAL
ADERYTTVIARSVLEEVGVTGKKQRPMLDQIAAQHILQTYFDLSHAAS
>NE1265 GCN5-related N-acetyltransferase
MLIRSAKPEDAEFIGSIRVAAWQAAYRGFMPDTYLASLDPGANLDELRAA
LRAENPPFTLRIAETEGQPIAFSILGKPRYNADQSIVELWALNVHPTHWR
KGAGQQLVRQVLLDAKEQKFVSVELWCIQGNLAAQRLYEICGFVPNSQVR
TTSSLTGYPLHELAYTYAL
>NE1319 Thioredoxin
MKRWFTIFLLVLIGYLPVFEAGAYGFRFEDSSGKTHTLSGYKGKWVLVNF
WATWCPPCLREIPDLSELSRERNDIVIIGIAMEYDDARDVMKFVEKMTIP
YPIVLGDRTAAAQLGDELSMLPSTYLFDPDGKPAARKIGLITRAEIEAFI
NTH
>NE1133 putative protease
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARPGKTSNGLGLLDS
NSMPSSIPGVN
>NE1585 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0526 Uncharacterized protein family UPF0044
MLELDMAQRRQLAASAHHLNPTVTIGKDGLTENVIGELDRSLLKHELIKV
KVLDGDRDQRASLLEQICLSLNAAPVEQIGRVLIIYRINPDKKIARAMTP
DTLKPGSKRKLKPRSAQSAKKTKVRK
>NE1334 Glycosyl transferase, family 2
MYCREDLKESFPMLLGLLSFFVQTVFFVVVAYFALYVLLELRILFISRRV
ERCKLTELTEAVQPSLRVGDDYKPSVSVLLPVHNESFVVERLIDAACRLR
YPADLLEILVLDDSSDDTSRLARARVEQYAARGVNIRHVCRNDRQGYKAG
NLAHGIHQASGEFFAIFDADFVPPPDFLLKTIPYFRDPQLGFLQTGIGYE
NKNKSFLTRFQAMEMGHQQYVTVGLSEEGDMASLSGSSCVWRKSCVEVLG
GWNTSMVTEDVDLGYRAQFGEWKYAYLRDVVSMSLLPESVSAFRVQRERW
GRGLIHSGFKHVRQMLHQRMPLMKRLHAISMMFSSVLLASIYVLVLLSLP
LNYLVDFDNATMQWVLLAFFGLVAVWGLDNAFGARKGARLDEKPGALRTL
WGIYLYIAMFLPMAWYYFAGGIRALLGVHGEFNRTPKGMDERRARMPRIN
MILLAGELFTFVYSMLAILAAVQKGNYFLIPLNFTVCVGFGMVLFWSWRG
IRGNEQAT
>NE2441 hypothetical protein
MKKILSILSSTFILSLFMLSSVNAQVEDVHLQEAIRQTEAVVLAVDVKTM
TQLVQEAERYAVEVKSTHPENEHLQEGLKHLNDVIKESQAGEPAAARKAA
IVALSKFNQIERK
>NE0620 Iron-containing alcohol dehydrogenase
MMPFSIARLPRIEFGAGVLKKLPDIIESYGNHILLVTGAHSFMNTPYWRL
LIDQFEHRAITWRHAMVAEEPSPALIDDFVIDYANEHFAAVIGIGGGSVL
DTAKAVAGLLQVQHSVMDYLEGVGPELSYQGPAVPFIAVPTTAGTGSEAT
KNAVLSVQGENGFKKSFRHDKLVAEYAIVDPDLLASCPPGVIAANGMDAL
TQLLESYVSLKANALTDALAISGLQAVRDALLPFYHQQGELAQHREKMAY
AALLSGITLAQTGLGSVHGLASPLGAFYPIPHGVVCGTLVAAATRINLHS
MHMREPDNPALAKYRHVAEILCKQHFNDPEIAFDALIDLLTQWTEELTLP
RLSHYGLQPAGLDKVIAHCRGSSMKTNPIVLTDDEIRQILLERL
>NE1098 putative transmembrane sensor
MNTRLKDTSGHSPIDPRILDEAAAWLMQLHASTVTDMERNAWEQWRLRSP
QHRQAWERAEQLMNKMGGLPPALSMAALTHPGGNNRRAAIKKLALLMTAL
PASWAAWRVTPWEAWMAEYRTMVSEQREIELADGSQVRLNTATSIDVRFN
AVQRLIVLHTGEILIQTAADNAVTPRPFQVLTAEGRLYALGTRFTVRQGE
ERSHVAVFDGAVAIHPKSGDSGVAQVLHAGNKTVFTAEAIEVSRPADDTD
IAWTQGMLLADKMHLTDFTDELSRYRQGIVRCDPVLADLRISGAFPIRDT
DRTLAMLQATYPVRIVYRTPYWVTLVPR
>NE1340 hypothetical protein
MVREEELSVIISDIKAKIEWYDITFAEDLYRVADRNAPKLKKLPVQVKHI
ELQAIANRVFCGRMLFHYSSLRHRILQGKN
>NE1683 hypothetical protein
MRAPTSHIQGMFGVTLDDLCGRWGFPYPNYIKIDVDGIEIPILKAATSVL
KHPNLQSVIVELGTDAEQQAASDIMQQAGLKLKTKTTRNWGETCCLFERN
PAA
>NE0080 Diguanylate cyclase/phosphodiesterase domain 2 (EAL)
MPYGKLRLLIAGHSKADVEQLSADFGRNGVQLIYQHVNSLSDIQSALNTS
SWDAIITEHSMAGFDTMQALDLMKACHQTIPFILYTAVTDEQAILPLLHN
GANGCVTKGHSMQLMLAISRELEYLDLKRRKRQADSHIYRMTYYDELTGL
PKHNLFREKAVSLLPDNAINGDMIAAVYFIDVNRLSRINSKYGYSISDNL
IQQLASRLSIYSDKTGILARIEGSNFVFLKTGLTGADQAQILANQLLKLA
TAPFIINNLEFYITLNIGISLYPRDGRDIETLLANAENALFSTKRSWPNT
YKFYTAEMGEALSQKTRIEQSLQHTLNDREFILHYQPIIDLKTDSIVGAE
ALVRWQHPELGLLSPDKFVSLATESGSIIKIGKRVLHEACRQAKCWQDSN
HGPAFVTVNISAIELDQLQLIKHVAEALQITGLDPARLELEISESVLMQD
IDGSIRILNKLKEMGIRVVMDNFGTGYSSLNYLRRLPVDTIKIGQPLVQD
IASQSDTSVIITAIVTLARNLGMQIRAGNVQSRTQLDFLQKANCHHVQGF
LFSPPVSAENLLPLIEQCKTGTFA
>NE0112 hypothetical protein
MLGTANLPSRNKYKAKATILHMNRNLYLVAYDICNPRRLRQVCRYLTGYK
VSGQKSVFEIWVTPTELHTIRTELDKLMDTQADRLHILSLDPRMKPRCYG
NASTFTVQHFCIV
>NE1095 putative transmembrane sensor
MPEKQPIPDDIAEQAADWIVRLTAGDATEREQAQSGFEVWKKQHPLHAEA
AASLQQVIGRVNEVRATTHGNPEPAHAALKAALAEGNRSRRRFRKTVATL
VVACALLLPAWQLLQIYPPAYLMADLRTTTGQWQTHDLPDGTRITLNSIS
AVNLHFDIERRILELIKGEILVDVAQDTDRPFLIETAEGSIQALGTRFVV
DRHEQTTTLTMLESKVSVQTAEQRAEHSHESTLVQAGERIHITAQGVSAI
ETIDTRTISDAWQYRHLVVQNRPLTEVLDELNRHRRGHILYDRAQLQGID
MAVVLPLDDTDRALQLLVDSLPMIRIRTLTPYLVLVDLAPASE
>NE2341 AMP-dependent synthetase and ligase
MATIESILHENRIFPPASEFVRNANLSGREAYETLRQEAEHDYTGFWAKL
AQQYIAWHKPFTRVLNDANPPFYKWFDDGELNISWNCLDRHLATQADKTA
IIFESDAGEVNHCSYRELHRQVCHFANGLKSLGIRQGDRVVIYMPMRIEA
VVAMQACARIGAIHSVVFGGFSAKSVYERIIDAGASAVITADEQIRGGRY
HPLKATVDEALAMGDTATVHSVIVFRHTGTGITWQPERDHWWHDLIAGQP
DECEPAWINAEHPLFTLYTSGSTGKPKGVQHSSAGYLLGAVTSMQWVFDY
HADDVFWCTADVGWVTGHSYVAYGPLAIGATQVIFEGTPTHPHAGRFWEI
IQKHRITTFYTAPTAIRSLIKLGSDLPAKYDLSSLRLLGSVGEPINPEAW
MWYYTVVGQSRCPVVDTWWQTETGCHMIAPAPGAISTKPGSCTLPLPGID
AAVVDETGHPVEQGKGGFLVIKRPFPSMLRTLWNDPERFRKTYFPTDIAG
GRYYLAGDSAHRDQDGYFWIMGRVDDVLNVSGHRLGTMEIESALVAHPLV
AEAAVVGKPHEIKGEVVVAFVTLREKLPDDQRAAEIAATLREWVASEIGA
IARPEEIRFGENLPKTRSGKIMRRLLRALARGETITQDVSTLENPVILEQ
LSQTV
>NE0725 hypothetical protein
MTTPNDADNETYSQSLENDMSRLLPDQSVLGEPEPWESWETSLCLWSIGI
GIAALVILGILVDWFLLPGQK
>NE0238 hypothetical protein
MDNYTGQSARLLPLVFYQDCSLVDIQLYNPSNNNKLIFSAIFNDEYALLL
NGKSDTLYSLNRHVLSLDDSKSAIDYLRFFCSYVQSEYGPFQIITHLDEI
PFKDEGMDQNIRDTIKASIHNPEYLEGSFERDGWQAFKACVLYGGALFSS
VMRVFSNGRVSLEEDRAIADNLLLLQRQYHGIFRTPL
>NE1373 conserved hypothetical protein
MKDHSAFLDTNILLYLLSEDETKSVRAENTIAAGGFISVQVLNEFASVAR
RKLNMSFAEIQEFLSHIRMICSVVPVTVEVHDQGLRIAEHYGFSIYDALI
IAAALSADCTILYSEDMQNSQIIDDRLLIQNPFA
>NE1250 two-component hybrid sensor and regulator
MNTATRINISSVMGIREAIEQIFTLIHRHFDSYATDTDDTRQIEECRYYM
HQLNGMLEMLELNGVAFVGHNVEALLTALQERRTEPDTEVLTVTRQAIRS
IYRYLDALIDGETDNPVRLFPVYRKIMQIQGLEEISESDLFFPDVRESLP
LQPVEPDLDDSSRQEAAKQARAQFQSGLLSWLKNAEDRHSLQKMVEAVRS
VEKLPAPIEQRKFWWISSGFLDSLLHQVQAADKSARRLCGKIEQEIRHLN
KESHTVADRLTRELLYRVARSQPVSERLQEIGRVYDWQIPLSTIDNREHL
DESILDEEATASDLQNMRVLLEDINDRWRRFNAEDQENLSSLLALVERFR
TQAARVNCPPLEKLAGVIHGALSYLRIRPQSMNEGIALDIASSLLLTENA
IDNFHRLSPEFPAQVEVLATRIRAVTTGKGNEDDLPDLPSPDQAGHLAQE
KKLLKQVAQEVLTNLAQVEDILDRFFFEPEIRNDLPVLPELFNQISGVLL
MLGLDRAGTLLDSCRDLVEKLTDPNRPLDQAEQILLADALSGLGFYIEAL
RNAQSDSEEILTATINQFNQKLAGQSLSTDIPAISAEPALTAPEMPSEAA
TDQLYDPELLAIFLDESTDVLASIAAHLEACYGDHLDLAALTAIRRDFHT
LKGSGRMVKLEALSEVAWRIEQLMNRWLSEQKPATGELLDLLAESHRQFC
NWCASLKKNGTAEIRAQHLLDRIREMMYGTEEQTAMDATIPVLPLSAVPE
QETSLEWTVEPVVSPSAAEAESVTETMVQIGKIEVPAELFSIFIKEASTH
VDTLKQALGLSEGNTILSISHESMLAAHTLTSTARALELDFIAQTGQVLE
RWFNRLLMTSGMPGPQILPHMTAAVNLLDDMVTAIRNHQYPSIETLQAGE
ETTNELARLLEEVSAQMPETEPAESECIKPESAELQASEILAEIAPVLLQ
ESAVTLPVSPVQSTETEAESTGVDRELLEVFLEEAQELQSEIGTNLRAWR
RQPEQVAARKAVLRALHTLKGSARIAGALQVSDQAHQMESDIESTYDQAV
PAALLDRLESRFDTILDDIEQLRSSLQPVPQPVSPIEETGIRPDLPVFPV
IAEDDQPVRKTVLRVDAELIDRLINESGESSLIRSRIEAQLYDLEQSLQD
LGESVDRMRGQLREIELLAETRIPSGALPSVTDEADFDPLELDHFTRFQE
LTRLMAESMDDVVTIHKSLREMRRAADMAVDQQAHLNRRLQQDLVHIRTV
PFRHYSERLYRVVRQAAKDTGKQASLTIQGEDIELDRSVIDKIGSPLEHL
LRNALVHGIETPQERLQAGKVETGQITLDLHQEGSEIVMVLRDDGIGLDA
GRIRDKAHQLGLDDPENAQNDEQWYPLIFTHGFSTLDNVTDMAGRGVGLD
IVRNELGEIGGNITVTSEKNRGVTFTLRLPVTLAVTQALMVRAADSTYAI
PAANVVHVLEMNSETLGMAYREHHLTWNDDRFPLIYLPHLLGVPHTFPEI
RRHNRIVLLQAGHERLAVHADALVGQCEVVVKNVGPQLSRAPGIEGATIT
GDGSVVLIIDPIRLLQREQVQELLSAGPVAPVNASDIHPSAIAPVVMIVD
DSLTVRKVTSRLLERQGYEILIAKDGVGALQLLRETTPAIMLVDIEMPHM
DGFELIRAVRDNPQLHTLPIIIISSRTADKHRKVAEELGVNEFMGKPYQE
EELLYHIERLIKG
>NE0758 TonB-dependent receptor protein
MKKTACIFLVSNLTLPLSPNALYAAHEAGISKKKDEPVVFNPMVISATLS
DKSIEEIPGTLHLFDKEKIRLRNVRRAGDILQETPGVYIWGASQGSQAPS
ANRGNRISIRGVSGVSRTLVLLDNQKMNDPAFGTFNWATLFPEDIERIEV
IPGAASALYGGNAFAGVIRVVSKKPTEKEISLTGGYQFNSPESGYGSAVY
RDVFDNGLAISIGYRYETSSGYRDSWSFSDPATNQAGPQGTRVTGAIPTH
TTNGTPTFQIGDRGNVPWNYHVGQIKAYYDFSPTTRISGGLRYLFSDVRS
ENPTSYLRDPGGNTVISGTVNVGGTILGRITPSLFLNAETHEDDLRSFLD
FEHEFANHSKFSLSFNHLSRGYWSTLVGQNSTFDGGPGTSAHRPQQLING
QGMYSFHLGDRHFLTTGFQVEHGTLKRREYTLSNWQDIDSRTALNLTGDS
ESLTTSFFVQDEIFLTDRLTAYFGTRLDWWTAEGKSRNLQTGVVKPTEKQ
SVVEISPKLALVYQAPWQGGVLRASAGRAFRPPTLQDMFADSRRGVVINY
ANASLKPETALSWEAGIEQHLAATGTRLKATFFENRLFDLITSKNLTFTT
TLREATDVNAGTGITRGIELSVDQVLTGWLKLHANYTWTPTARTLKNSAS
PDSEGKRMPNSPKHILTAGLETNWHDWSGSLTGRHVSKSFNTAENIDVFH
NVYGGNQAFWLLDTKLIYTWNKMAELSFAANNLLNERYFQSLISTGRNYV
AQVNLRF
>NE2226 SLT domain
MRISFKWQWAVLSFLVLHTAVAHSGRDIQAVRYAGVKGGGDILVAQARQY
EHGEGVLQDREKAVELYCQAARQGSAEGQYALGWMYANGRGVERNDGIAA
RLFEMAAARKHADAQKLLRFMPLPDKRKVQLPNCLSRNVYVRTNITPNKY
YVDQSISALVEKLAPQYEIDPGLVLAVIAVESGFNTQAVSPKNAQGLMQL
IPATAERFQVRDVFDPEENIRGGMAYLRWLLAFFKGDVALVAAAYNAGEG
AVEKYRGIPPYPETVKYVDKIMSRYNKTSHPYQPGVVNRTSFIFASAASG
Q
>NE0749 Transposase IS911 HTH and LZ region
MNKQNKQNKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTL
LEWVKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFA
QAELDRVLKK
>NE1601 putative transmembrane protein
MVTMSCRQERGYIYIWMLFAVMLAGVMLAAAGLIWQTEVKREKELELLFA
GDQFRRAIESYYNDSQVSGRAGEAGASRYPASLEQLLKDERSLVVKRHLR
RVYPDPMTNSYNWGLVRQQDGGITGVYSLSTGVPIKRANFPADYIAFEKA
GNYQGWKFVHAASTAGGQEKQQAEGRGNIQGDTSMPGLPGTGFNPLPQNQ
PAPNLSPPTGNDAF
>NE2483 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNS
>NE0159 ABC transporter, fused permease and ATPase domains
MNNTTRKQIMAPATYWAESLLLAARQLGLQTSPECVRGAASWVGCDDPNA
AVLDIAAQAGLSAKFVEISLTRLSPIMLPALLELDDGQIGLLIDIDAGRA
TILLTLEGKVLERLLPIDNHPRLLLLVHERTAPRKDRVNDYLVTTTDHWL
RDIFISHWKTLFELCAGSLFSNLLAICTALFAMQVWDRVIPARSINTLWV
LASGVAMALLLELMLRMMRVTIADHFGKNADIKLSSMFFARVLDIRNDAR
PRSPGSLIAQLRDLEQLRELLTSSALGVLIDLPFVFAFLLIIWVLGGWLV
LVPLAAIPLLVLPGVLAQYPLAKLSKEGMAEAALRNAILMESIYRIEDIK
TLQAEPRFRQLWDQTNFINADISLKQRYLSGLLMNLSQTVQQLAYAGVLI
EGVYAILDTDLSFGAVMACSILTSRTIAPLSQIPAVLSRLQNARVGKDGL
DKLLELPIDHPVDKNTYHKPVLLGRYEFENVSYSFDPQEKPVLTIPCLSI
RPGERIAILGRVGAGKSTLLRVLAGLAIPQQGRVLFNDTPISMIDVADVR
RDIGTVLQEASLFYGSLRENLLLANPLATDEELLAAMRLSCADQLLLKQP
HGLDLLLRENGFGLSGGQKQSLILARTFLRTPNVLLLDEPTASLDENTER
NIIENMRQWLGGRTLIVATHRYPLLSLVDRIIVIDSGRIVRDGPREEILA
AIRSLAANSQNVAHSKTHIQVHSQPQRQPQTTGTSP
>NE2443 conserved hypothetical protein
MKKLIFLAVAIFAGYSYLGNPYTSIPDRLQHPTFSESQSGTDATIADAFS
NRKSNLQISGEGVVTKLLPDDNDGSRHQKFIISLRSGQTLLIAHNIDIAP
RIGSLRKGDSIQFSGQYEWNEKGGIVHWTHQDPNGSHVAGWLKHNGQIYQ
>NE1210 putative ABC transporter permease protein
MKSASGQEKSIVVRRFDPARFVALVRKESLQAMRDPSTLLIALVLPLILL
FLFAYAVSLDVRHVRIGVVLESPGATARTLADAFSGTHYLDVTFAYDRRE
VAGKLIAGELRGYVVIPQNFERHFADTGSPLVQIITDGSQPNTANFVANY
AQGVVQTWMASLTETMPMPAVQLEPRYWFNPELASRRALVPGAIAIIMTI
IGTLLTALVVAREWERGTMEAILSTPASVLEILVGKLLPYFVLGMLSTLI
AAALAVFLFNVPLRGSLFALLVLSAVFMMPALGQGLLISSLARNQFLASQ
LALFSGYLPAFMLSGFLFEIDAMPALIRAITWLIPARYFVSSLKTIFLAG
DVWSVFLPDLLGMATIGLIFFLIAQRYTRKSLE
>NE1441 hypothetical protein
MTMTKSYYTPILLVAAGFVLTGCVTINIYFPAAAAEKVADKIIEEVWQTD
GNSGKNDRSGNKPGNDVSDKTDSETGKTKP
>NE0802 putative membrane protein
MKICHRYIAWQVLTGMVIATAILLPLFSFFDLLDQLDDVGKGTYTTWDAF
LYTVMLMPRRFIQIAPFIALLGTVGALGALAVNLELVAMRVAGLSPLMIG
LAPVGIGGLLIASTIALEYFVAPQFQQQATILRAVALEQGAELGKGLGIW
TRNERNILRIGEMLHKGRATDIEVIHFDGEGSMTAHVHAWYADIFDESLW
KLHDVTIRTFSPDRITSRKAHILQWHSFLGPDDIATLTKSPESLTPIELV
KHAEFLRATGQKADAYVMALWRKVGGAIMTIAMILVAIPFIFGSVREGLG
GKLILAALMGISIYLFDQIIANIGLVFQLNPIVVSVVPGMVLIAVAAHWL
LRRTF
>NE2146 hypothetical protein
MYDDFRKIIIRRIVYRGQTGPEGDPLKTPPANKWLFYLTLIPAVVVVVVL
GAFFFSIVLALFVAVAGVIGARFWWLRRKFRKSMSAAAEQKNSMIEDAEI
IEIRENDKSDRGHH
>NE0075 Exonuclease
MAQDSNNLIWIDMEMTGLNPGTDRIIEVALVVTDAQLNTLAEAPVLVVHQ
PDDILNGMDKWNQSTHGKSGLIDKVKTSILSEAEVESRMLAFLELHVPAG
TSPMCGNSICQDRRFLARSMPKLESYFHYRNLDVSTLKELAKRWKPEITQ
GFNKQGKHEALADIYDSIEELKYYRQHLFNI
>NE2367 conserved hypothetical protein
MKLHLSDSSGLNVFSGYGEGYVAVNQVRYTDNMIVLPNRIIEHWQASSIS
QLGMEHFDALLAMQPEIILLGTGTSLQFPDASLMRMILSRDIGFEVMDTQ
ATCRTYNILSSEGRRVAAAILVRSTDG
>NE0914 conserved hypothetical protein
MRKLLNFLTNFLAPASLLVLVSMLGGCGYNTLQSTNEQVQSSWSEVLNQY
QRRADLIPNLVNTVKGFAAQEKEVLLGVTEARSRAGSIQVSPELIDDPEA
FARFQNAQGELTGALSRLLAIAENYPQLKSDANFRDLQAQLEGTENRIAV
ARNRYIKAVQEYNVTVRSFPGNLTAMMFGFKVKPNFTVENEKALSTPPAV
DFGAPATAQ
>NE2541 Site-specific recombinase
MSEVLKRRMRCAVYTRKSTDEGLDQEYNSIDAQRDAGHAYIASQRAEGWI
PVADDYDDPAFSGGNMERPALQRMMADIEAGKIDVVVIYKIDRLTRSLAD
FSKMVEVFERYAVSFVSVTQQFNTTTSMGRLMLNILLSFAQFEREVTGER
IRDKISASKRKGMWMGGVPPLGYDVENRRLVPNEREAKLIRHIFQRFVEL
GSSTALVKELKLDGVTSKAWTTQDGKTRDGRLIDKGHIYKLLSNRTYLGE
LRHKDQWYQAEHPPIINRELWDSVHAILETNGRVRGNTTRAKVPYLLKGI
VFGNDGRALSPWHTTKKNGRRYRYYVPQRDAKEHAGASGLPRLPAAELES
AVLDQLRAILRAPNLLGEMLPQAIKLDPTLDEAKITVAMTRLDAIWDQLF
PAEQTRIVKLLVEKVIVSPNDLEVRLRANGIERLVLELRPEPVEQQEVAR
A
>NE2088 possible flagellar hook-length control protein
MPNLPALPDMGLPVVTSDLTATLLPGVASPIVPGQPIEQAFSSILISMMK
SDQAIEDDTENPAPVSIDALIAGMIAAPANPPQLVPSQPAPDALDTPNAP
DAPAEELLSMLATSMMPNTGQPVTDEPVTFPAGGLASATPPNLPSDSGRH
AADPLQTTSQAHAAPLPAELPAGNISTSTPFPAPVNSFNQAIDAANFADS
GKSLPPSMTSVPVTPVAAHTSSTPLTDIAADTTANDAPAIAAEFGQPDWP
EEFGRKITWLATQRMQAAELKLHPAHLGPIEISLQLSDDQRLTAQFISHH
PAVREAIEANLPRLREIMAENGITLADTSVSADTPGQQAENRQGSPFRRP
AAGDHSTYDSTSPQTPALQATRRSSLIDTFA
>NE0975 hypothetical protein
MLQTLRKAGGSLVMTVPKSFIEQNGLSEGSQVELHLHGKKMIVEAPARPR
YKLADLMAEMPKGLPRVEGWDEMSPVGLEDS
>NE2115 conserved hypothetical protein
MKAITAKDAKNKFGEMLDTAQREPLTIEKHGRAVAVIMSVQEYQQMKLER
LRAKLAAGEEQLDRGEGVEGETFFAELLNEK
>NE1236 hypothetical protein
MKTCRNIFYFTGISLVLLLAGCSDDLSLKIAAECIKGNHIQSGSVLDEAQ
CVNRSAESFPGADEDYFKDMDYGISQTPDKVVAALEPFVPGVSQNPDEAV
KSIVRGRNNWIVWTGGNDKFWDYMSVKSFGSFDLLKVLSNHPRLVKNPGY
SRDNRWHWFGLVNEPCFIKNTDAQGKLVGREDRFGLYLDVRDPTCPADPF
ENEEKYPGVKIGARGKTVPVGSYYGYATGIVGLRLFPNPDFNEAAKKRWD
AEKYYSDPNYYNNPKLVKPYRVGMACSFCHVGPNPTNPPEDPENPQWANL
NSNPGAQYFWFDRVFAYEADKTSFAYQHLHTNRPGALDTSLISTDYINNP
RTMNAVYSLPARMLNTLRWGEEKLNGGERDNKQFNHFDEIPADSPLRAFF
KDPDTVLTPRVLKDGADSVGALGALNRVFINIGLFSEEWLQHINPLVGGK
PFTPFPIKIAEQNSSYWRATEQQTLDGALFFLVATPPDHLKDAPGGKHYL
TDDEETLEHGKKVFAEHCAACHSSKLPDEANKFFPDKGCVGPDYLKCWNQ
YWHWTNSPEFKEKMTKLVMEEDFLKENFLSTELRIPVTLLETNICASIAT
NAIQGDTWDNFSSTSYKNLPSVGKALIHHPVTRKPADHVMPDGGRGYIRP
PSLSSIWSTAPFLLNNTVGKFYPSGSVEDRMKSFEISIEQLLWPEKRYCD
QKDLYYYGQDKEGKGSYAYDSEAAGSCEGKTYLTRSGKEVPGIIDRTTER
SELKVPKGYLPWYVRILPIGDGLELGPFPEGIPVNLVSNTNMEMNWGQRL
SLSWDLARAIGWDIFDLWKAQEHPESMTDEELRKILSGIVDPLLGVNKCP
DFVVNRGHYFGTDYLPAEEGRTALDDNDKRALIEFLKTM
>NE1397 CheW-like domain
MEPIAAKPGFGGFRIGGMELALPMQVLREVVPCGGLARLPCASPCVIGGI
DLRGIMVPVVDLRIVFDMPTPRVPFQNVMIMAHENRLLGLLTEAVTGVFT
GESEEINPISFAAADQPVFHGSIRRSDNGTLVNLLSPSVLATLRDVPLAD
HREMKKKAATRTDAASGQDKVMPVLLLRCNRIAFAIDAMEVYTTLSDPEI
RNSPLAMGSCRGVIEHAGSRIPVLDLLEICGLGKIEPDTPLQAFLIQLPK
GMAAFLVSKVIDVVRIRAGEIIDVPTYALPQPELFSGAIPKSSLPASRSQ
QEAILASQFLVLRGAGLKSNHEIMALADVSSPQQPFIHQDHSQTSVAGKK
NTSRKRDMVTYLLRSETATPFEQIREILPFNHSIPVFEQTGPLLGMIVIR
GRSIPVVCLHRLMTGQFTPASSSAYILVVEIDNEVMGFAIPALKSIETAE
WEPELPQFGGSADDDLKRAIHSNKLAQIGNGNAIRMLPMLDLEKIGRAFR
ARQLSAA
>NE1020 Heavy-metal-associated domain
MQAFNTLFSFLQGIFMPTTIVHIKGMFCGGCVSSIKTVLGKIPGVNSVEV
SLNDGQAVIQHDESLKEEVVNQAIEGAGFEVVKQ
>NE1696 conserved hypothetical protein
MTMKMNTAQSGNGSSRYLPGETRAYAHRWFFTAACIYAALIIPLTLLARH
GFGPAVLSVPTGHAFEMLFGLAPALIAGYLLGPMPARRLVWFLCFWLLAR
IAGLIAPFAWPALVFNALFVILFARQLLPRLWAAKKWRNRSLVPLLGLIC
AITIAVILAGRLDEYLLHRYLIDESVQLFALLMLFIGGRMLAPAAAGEFY
RQGMELGARVQPNIEASLIITIAAAYLLAPVAVPFSGMLLILCGVLAGIR
LFRWQLWHCLARPDLICLGLGYGWLALGLILLGWAKLDGGSHFSTAVHAI
TVGALGTLATNVIVRVTLLHVKQYPSRIPQIVIMTGFMSIAAVTRMAADF
SNYRETLLIIAASGWSIAFLFALHVLLTSRTAPQVKPTQNPSTNS
>NE2346 possible hydrolase
MMERITSRDGTPIACWRQGSGPPLLLVHGTTGDHSTWGSVLAGLQQHHTV
WTLDRRGRGHSGDAANYSLEHESEDIAAVIDAIGSSVNLLDHSFGGLCAL
EASLLTANINKVVLYEPAISLAGSDWSATFEARMQALLEKNAREETLLLF
FRDLLNTPNPELVALQAGSNWAIRLAAAHTILRELQGIDRYQFTPQRFQT
LKSPVLLLVGGNSHPRRFMTAERLQQGLPDCRVGIIAGQQHSAMRTAPDL
FVHHVLEFLQSAD
>NE0902 Periplasmic component of the Tol biopolymer transport system
MTNFTQSINFPGSLPLTRNLLAAACFFAMTQASAAGFISDVKQISDQEVR
SGEGYYRADGKWLVYQAEVAGENPFYQIYLKNIASGEVQQVSPGIGKTTC
SWVHPIQEKVLFSSTHEDPEAKAKMTTEIEQRKSGQRRSYAWDYDEFYDI
YETDFAGGNIRNLTHTKGYDAEASWSPDGRLIAFASNRRAYSEPLSEAET
VLFSNDPSSMIDIYIMDADGSNVKRLTHTNSYDGGPFFSPDGKRIVWRRF
SENGQEAEIFTMNTDGSDQKQITRLNKMSWAPFYHPSNQYIIFATNVHGH
RNFELYIVDIDGKKEPVRVTDEEGFDGLPVFTPDGQYITWTSDRTPDHKG
RLFHGKWDHKKALESLGLE
>NE0951 Bacterial regulatory proteins, MerR family
MENQAQINVLPPIPVKRYFTIGEVSELCGVKPHVLRYWEQEFTQLRSIRR
HGNRRYYQHQEVILIRRIRELLYDYGFTINGARNHLVLGLQNQVKNNVVG
GQSCQSGSDFSLAGLKREIEQVISFLQVSE
>NE0268 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1540 TonB-dependent receptor protein
MFSRFKCRSAGWKVYLATTVTFSLFIHPAHAETASADTEPAGKLITATNL
QEVTVTATRTSRKVSEIPESVSVVDTEQIRTRQAADIGDVLRYLPNIDLG
GGPRNLGISPTIRGMADDRILFLLDGARQDFNRGHNARIFTDPSLLKRVD
VMRGPASAIWGSGALGGVISFTTLDATDLLRPGERSGIKVRGGFQSMNEQ
YIPGTSIYGLVGEEFDYLLDFSYRNAADDIRHGDGSKLQNSQFQSYAGLA
KFNWTPGDHRFTFSTQTFDQTGKVPANSQTASTPDTLVDRDTEQRNYTLR
YRYENPDNTWLNPEVLVYHNTTHSLEKRLLDRRRDETDFSTTGINARNSS
RLDLSSFATQLITYGVDYFHNEAKGKRNGAIRPEFPRGNADVVGVFLQDE
ITLWDRLSLIPGVRWDYFRSEAEGMATSANKNDRVNFKIGGLFKVTEWFS
LIGNYSEAFRAPTLGELFTTGTHFTCGPGCANLFVPNPNLKPETAFNKEV
GARIQKSDLLYEDDRLAIRGAYFHNKVKDFVDLMVDFVFFPVPGNPGRGG
VTTSDNVRDAVLEGFEIEANYAAKYGYTGFSYSQTRGYNRTDGGVLSNVQ
PDRWIVMAGLNWPTYDLSLGWRSSIIEAQNRVPTGGTPTPGYTLHDLTLT
WLPQNGSLKGVRLDFGIDNLTDKDYRRHLNVLRDPGRNYKLAVSYQF
>NE1883 hypothetical protein
MKPRYVVDTNVLITASVADPVAPKDIDATPQDPALRFRVWQWLVEFESSP
ARLVLDSAGKIKEEYDRKLGFNDYGIQVVIHKWSMAAVDNVDVQYDTDGH
AVLPPPLDSVVHDLADRKMVAAALEAQKCHGESAIAFAGDTDWHDWEQTL
IQAGLSIEPIIEGWSRAKHAEKAHRKQNHD
>NE0621 DUF176
MYVTIVYASVKTDKTEAFKEATRMNHEQSIREPGNMRFDILQSADDPTRF
VLYEAYKTRKDAAAHKETAHYLTWRDTVADWMAEPRKGVIYGGLYPTGDD
>NE2123 Ferredoxin-dependent glutamate synthase
MPDSLIDLIIYSVGTILGLIVVGAIVTFFIRDVTQKEHAILRNYPIIGRL
RYFFEIQGKYFRQYFFSGDRDEMPFNRATRKWIYKNAKNRGGIIGFGSTY
DLREPGAFIFVNAAFPILEEDQLPTPALSIGEGYCDQPFLARSIINISAM
SYGAISTPAVRALSRGAAKAGCWLNTGEGGLAPCHLEGECDRIMQIGTAK
YGIRDIQGNFSATRAKEIAQSVKAFEIKLSQGAKPGKGGVLPASKVNPEI
AAIRGIPPYQDSISPNRHHDIGNIDELLDQVAFIRELTGRPVGVKTAIGG
WHFINELCDTILRRGLEYAPDFLTIDGGEGGSGAAPQTLLDHAGLPITEA
LPRVVDALIESGLKQRIRVVAAGKLVTSAQGAWALCVGADFINTARGFMF
ALGCIQAMRCHLNTCPTGITTHNPRLQRGLVVEEKYLRVANYALNVNHEI
NMIAHSCGLKHARELRREHIRIVEKAGISVALNILHPYPETKSTVRN
>NE0917 probable RNA polymerase sigma factor transcription regulator
MQDEHAALDIVQDSMMKLVEKYPHKSLTELPLLFQRILQNTIRDFYRRQK
SRSLWVTLFSSFMPDSNEKETGYSDIPESLPVAQDFGYKNNPGTAAEQSE
LIELISKALETLPPRQREAFLLRYWEEMDVAETAKIMDCSEGSVKTHCSR
AAHTLAALLESKGVKR
>NE0445 Uroporphyrinogen decarboxylase (URO-D)
MTKLENDTFIRALLRQPVDYTPVWMMRQAGRYLPEYNQTRARAGSFLSLC
KNPDFATEVTLQPLARFSLDAAILFSDILTIPDAMGLGLYFADGEGPRFE
RPLREEREIRSLIVPDPDTHLRYVTDAVRQIRTALNNRVPLIGFSGSPFT
LACYMVEGAGSSEFRQIKTMLYARPDLLHHILGVTAQAVTAYLNAQIEAG
AQAVMIFDSWGGALSHAAYQEFSLRYMNQILDGLKREHNGDRIPNILFTK
GGGLWLESIMASGCDAIGLDWTIDIGEARRRTQDKVALQGNLDPAVLFSS
PEVIAAEAGKILASYGHGHGHVFNLGHGISQFTPPENALALIEAVHAQSV
RYHTD
>NE2013 hypothetical protein
MCRCRLHWCSHTLESIEEHGCTAHVKGRGQQAKEKRRHPGAHARRWIIEV
SHGWFNCLRKLLMRYEKLARSFLGLNHLAAAIIAFRKVPLAVNIIYE
>NE1583 conserved hypothetical protein
MPNKILTEIAASISELKANPMKVVASGKGMPIAVLNHNEPAFYCVPAAAY
EAMMELLDDIELLKIVKERMDEPSVKVSLDDL
>NE1243 hypothetical protein
MKRWVVSACGLSGKDAMKQAAVILTGILVLLAGKSTTAISREPFHLENMT
CSITSASMNGCTGVPTLHSGILTLSKEGTFKLKARYEGCFMVENVTKSGD
FAASFFEKAMSLELLTTRTTGMKNALSDFPGTLGFAHLNYDGLDGYFIDI
SAVIRARGREMENVIVTANLQCTVIDDAARTAVKADRLSFIQKGVYK
>NE1763 putative membrane-bound dehydrogenase oxidoreductase protein
MKFLSFIFLVSLCAVTSQINAQAPELPLDKIKLPPGFSISVWANVPDART
MTLGDKGTVFVGSKSAGNVYAITENAGKRQIRIIASKLKMPNGVAFHDGA
LYVSSVNRILRFDNIENTLDQPLPPRIITADYPKETYHGWRFIAFGPDGW
LYVPVGSSCNVCETDLEHYALISRIRPDGSNYEVFAKGVRNTEGFDWHPV
TKALWFTDISREWMGDNLPPDELNHASNQGLDFGFPYCHGKEVSDPKLGA
KRDCNKFIPPAVELDPHVAPMGMRFYTGTMFPAEYHNGIFIAEHGSWNRS
TKIGYRLEWVQLEVNEVARKEIFAEGWLQENEAWGRPVDVLVMPDGALLV
SDDMAGVVYRISYDQP
>NE0666 Sensory transduction histidine kinases
MKLHIPRTTFGLAVMVSVVLTALAVLSGLVGRYFAHEELEQQLDHRITTE
SSLLIEEYDRGGLAALAAMVDSHHDRHENSGLGYLLVDHQGRRLAGSLQA
EAPSKAGWREHLYVGNKTTGDRRAHQALTVILPAGERLVVAADRMPVEEI
DAVIGWITAGMTSVMLMMGIGSAWILGLITRQRIDNISKVANAIAAGDLS
RRIDRNNRAGEFDRLAETLNRMLDRNAELLTNLKQISTDIAHDLLTPLGR
LQQQLELALTDCRNTQDYRHTIERAIATAVEIQQLFSSLLKISEIESLTL
RKTFTPVSLTEVAKRVTDAFRPDIEENGRKLIVRLDHDLHILGERHLLSQ
LLVNLIENAMRHTPVGTTISVILSRAADHTILTVEDDGPGIPVADRARVI
ERFVRLEASRSTKGHGLGLSLVKAIATAHEAELIMEDNAPGLCIRLSFTA
LPIRDRKEATKV
>NE0510 hypothetical protein
MILRSWARGVTLATICMFSLSSGVILAKNNRPTPLPTYQYPDVYNNYLQP
VATAIDYHSGMVYLFNYENDKLIIVDPTSLSGWPGNVPLQHTLVFPEGDK
IFITSDNTEEHAAYIIILKVNDINWDAGTVSLAVETVVAADNPGTPTEFP
FVEPVNNVQAIPNWLVGRGTTQIHGPTILPYSDFVYLTELTSDRIRVINR
KTNEFVSGVDPIAIPGYTEQTHGINFNRSGTIGLGTGYFFDNSVIDVYKP
NRETGELQTIGQIRLGDEKRHAAFTHFVYWLDERYAVTASMQFDKTSLTP
TTTKKIIPPSVWLLDTLEGTATKILDHTNHANGKGIFRSPSDIAVVNGKL
YIAEEDSLDYTFANDGYISVFDLTDRYKPRFLKRLKPGRELPTGYAVAHT
ISPTPDNRYLIVASWVSGYVLKIDTETDTVVKIWGPNDGLIKPHGIYTAG
GLR
>NE2398 CBS domain
MKTVKHLLQEKGHTVVAIGPDDSVFNAMQKMAADNIGALLVMKDEKLVGI
LTERDFSRKSYLLDKPVKDTQVKEIMTRQVAYVDLNNTNEDCMALITEMR
VRHLPVLDDGKVIGLLSIGDLVKDAISQHQFVINQLERYIYDTREI
>NE2357 NLP/P60
MSLKHIVILLLGTGMVVGCSSVPKDSNQEKVSRHWHKPAFVTPDKTTENN
PYSYFVRDRLYSQYEEWRGVRYRLGGSDHSGIDCSAFVKIIYEAEFGLSL
PRTALSQANLGSEINQNKLMPGDLVFFKTGRYSQHVGIYLDSRQFLHVST
RKGVTISQLDNTYWKSRYWKAVRI
>NE2409 conserved hypothetical protein
MINNQIKHVWQRFAQSFRLMVGVPEYRVYVEHMRSAHPEQAIMSQEEFFR
ERLEARYGSKGRLNRCC
>NE2269 Glycosyl transferases group 1
MHIAIYLPSLRGGGAERAMATLANGFADRGLKVDLVLVRAEGPYLSEVSP
GVRIVDLQSNRVLTSLPGLVRYLRRERPQAMLSALNHANVIAVVARMLAG
VPARLVVSERNNVSLSGSSSKNLRSRVVLHMMRWAYRKTDGVTAVSGGVA
DDLANAINLPRDRISVIFNPVVTPELIEKSRMPLEHPWLGEGKPPVILGV
GRLTPQKDFSTLIHAFAQVRTVRDCRLVILGEGELRAELEQLVASLGVQD
SVQLPGFADNPFAWMSRVRLFVLSSRWEGLPNVLIQAMACGTAVVSTDCP
SGPDEILEGGKWGRLVPVGDEEALAEAMVTLLGMPENQLSDVRQRAGGFR
PELAVDAYLKILRSMR
>NE1516 Uncharacterized protein family UPF0006
MFVDSHCHLDFPDLASSLDELLVNMQISQVTHALCVGVNLENFPRVLALA
ESHSNLFASVGVHPDYEDTAEPAVEQLLKLADHAKVVALGETGLDYFRLK
GDLEWQRERFRRHIRAARRCGKPLIIHTRAAAEDTLRIMEEEGAASVGGV
MHCFTESWEIARRALDLNFYISFSGIVTFKNAAIIKEVAKKVPADRMLIE
TDSPYLAPVPHRGETNQPAFVRHVAEEIARLRETTLAEIAAVTTNNFFNL
FKVV
>NE1481 conserved hypothetical protein
MLHNRQSISAPKLVVRPHVPWYRRLLMSFVGLLLIALLAYGMYVIGQSTA
QPAGNITVTADPVLEQILESNSCLEKYDTALCSQLAELVRQLQIGNATRA
DLVKQVKSLDEENERLREDLTLFQQMISGNEESSNVELIIHRFSLEAGQL
PGEYLYTLLLAQGGQRLKEFSGKLEFVVGLLQNGEEKFISLVDENASKEF
PINFRFYHRLEKSFQIPADTVVKSLQVFIYENGSSKAVLTKTIQLPLKES
EHVRKKT
>NE1969 LysM motif
MRTLTIISVILFTSLFSGLSFSAGEIGLRDDIPDRYQVVPGDTLWGIASR
FLKEPWRWPEIWQMNRGQIRNPHRIYPGDVIIIENTRYGKRLRMASEKGV
VRLSPRIRAEESAMRAIPVIPAAKIEPFLNQPLVIEKGKLDRAPVILGAS
DDRVILSTGDKVYVGDLPADQGVIWQVFRNGKALTDPDYDNRILGYEAVY
LGTVEITDFAVISTAKISRSVQEILKGDRLLPLSSARIDDYLPHAPDFPV
TARIISVYGGVNEIGENMIVTLNQGSHDRIEPGHVLAVYRKNDIRSHEGK
PVSLPDERIGLALVFRVFDTVSYALIMQSTQAIKVMDAVKTP
>NE1545 DUF209
MNTSMNTVRIIPAMAVPEGAGVIVHRTIGTPVLRNYDPFLLLDHIGSDNP
DDYIAGFPPHPHRGFITFTYMLDGHMQHQDSMGNTGDLGPGSAQWMKAAS
GVIHSEMPKQENGLLRGFQLWINLPAINKMDHPEYQEFPAAAFPVVETAD
YRLKVLIGRFGDTVAPIRDDLTQVTYFDVQLQPGRHFQHRLPAQNTSFIY
LFEGNGQFNGQDIGLHSLIAAGTDGGTFDFVAGKKGARFIVVSGKPLHES
VVQHGPFVMNTREQIDQALKDFQSSQFVRDRAWVKRNQ
>NE0874 Predicted TIM-barrel enzymes, possibly dehydrogenases, nifR3 family
MQIGSYTLKNKLIVAPMAGVTDRPFRQLCKRMGAAMAVSEMVSSNSLLYG
SEKTRRRACHDGEVKPVSVQIAGADPVMMAQAARHNADQGAEIIDINMGC
PAKKICNVMAGSALLKDENLVSRILDAVVQAVDIPVTLKIRTGWDTQHKN
AITVARIAESAGIQALAIHGRTRACAYRGQAEYDTIAAVKTSIRIPIIAN
GDITTPEKAWAVLEYTGADAVMIGRAAQGKPWIFRETDHYLTTGSFLPPP
EVAEIQRVLIDHLYELYHFYGEYSGVRIARKHISWYTRGLAGSAAFRHAM
NRLATVEEQLAATQQFFAELADQGRTLTYIDEEVIEA
>NE1294 putative glutamate--cysteine ligase
MPVPHLTHALNAPTLDLEKRILAAKPTIEHWFRKQWQHHTVPFYCSVDLR
NSGFKLAPVDTNLFPGGFNNLNPTFIPLCIQAMMIAVEKICPNAHNVLLI
PENHTRNVFYLQNIAMLQSIMQQAGMNIRIGSLLPDLDQPLPVLLPNGDS
LLLEPVQREGNRLFTRDFDPCAILLNNDLSGGIPEILQNLEQIIIPPLHA
GWATRRKSWHFSAYDNVAQDFADLIGIDPWLINPYFISCGKVDFHEKEGL
DCVAAGVNEILHLIREKYAQYGISEKPFVICKADSGTYGMGIMSIKDGAE
LYKLNRKQRNKMATIKEGLVVKNVLVQEGVHTFESIHQAVAEPVIYMIDR
HVVGGFYRVHTGRGIDENLNAPGMHFVPLAFDTTCMQTDPTARPDAAPNR
FYTYGVIARLALLASAIELERYLQTPLPVLAEV
>NE2011 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE1868 conserved hypothetical protein
MITQSSLLTYPLAGCLNRLANKSSQVKQRLQTHANETVCFRIDSLASLSV
TITQDGHFTTAAKDTQAAVILDIAAELIPRIASGEMAAFDKIIICGDPEL
ADTLLYMGKIFQAGIEENLSSVMGDVLARRATLTGQELVRWHLGSIRNLS
RALGEFLSEEQPVTASRNRFHQLASEVESLQRRILRLEKRITALVPSFSS
IAGNLPRTGR
>NE2139 putative transmembrane sensor
MPASGPPDEQLLKQALKWFFLLQSEHCTDKDRRKFNRWFCKNEAHQTAYA
HAERLWSETDRLKETPDIPGLREARQRRPKHQPAGILGWALFFLISSALM
SIGWLEYSAETVTYSTRLGEQRLISLADGSRINMNTATRLHVRISYLQRK
ITLDTGEAVFDVAHETWRPFVVHTDKLQIRDIGTRFNIHKQQDDISITVL
EGAVEIDGVRLDEGYQQHYSSRSDHAILRPVDTEQIAAWQHGRLIFRQTP
LAAVTAELERYHPVHFIFTDPAIAHETLSGTFGTRDLPLFLSSLEKALPV
RVKKLPDGQTLLIDRINKKM
>NE1725 SUA5/yciO/yrdC family:Sua5/YciO/YrdC/YwlC protein family
MAQFFSIHPQTPHTRLIRQAVDIVRKGGVIAYPTDSCYALGCQIGNKEAM
DRIRTIRKVDQNHNFTLVCRDLAEIATYAKVDNAQYRLLKAATPGSYTFI
LAATREVPRRLQHPKRHTIGLRIPDHPVVQALLAELDEPLLSSTLILPGD
ELPLNDAGEIRQRLEHQIELVMDAGSCGTDMTTVIDLTETTPKVARSGKG
DLSPFGIING
>NE0058 Flavoprotein
MVSSLGNMSEQKTITVALTGASGMPYGIRLLEILLKQGHRVYLLYSQAAQ
IVAQQEMALTLSPRPKETEAFLNGYFNVEPGLLKVFGREEWFAPVASGSN
PADAMVICPCTMGTLSAVAAGLGQKLIERAADVMLKEQRKLIIVARETPF
SAIHLENMLKLAHSGAVILPANPGFYHLPESIQDIVDFIVARILDQLGVT
HTLLPRWGCDA
>NE2330 hypothetical protein
MVTRGEIVRNKVSELMDGELDGTHAAKIINAVKTDNDLFSDWKIYHAIGD
SLRQSAVNIDISEQVRNQLADEPLLLSPYPHKTHQNRKQKLLGLSVAASV
AALSIGWLISQSMEQHETTLKEIYVAEKNNGKTAPVGGPRSLMTFQPVSA
YSSPSMPVNTHYNNDPLIYRDLTYERSVRYPATGISSPAEVVGEQSAASA
E
>NE1455 GDSL lipolytic enzyme
MKKLLFVILCLFSIFSLSSAWSGTGTSIIVLGDSLSAGYGLPPGTGWVNL
LERRLQNQSPDYRVINVSISGEITLGGRNRIEQALNDHVPDIVIVALGAN
DGLQGRSITSIYENLEAIILACKRYNATPLLVGMQLPPNYGISYTQKFRD
IYPRLARDQQLQLAPFLMEGFGDQPESFQADGIHPTIQAQEKMLDNVWPA
LTAVLKSVRAPVASEESADKTSTEAILP
>NE0901 PDZ domain (also known as DHR or GLGF)
MTKSARLFSLCCLIAVSLFHSINVSANDQTFHHRMEVILSPDNSELQVKD
QIQVPKNNLNQTAATRLEFSLHAGLSIMANKDVDIVAQTVNEADVHAGPV
PLKRYMVTLPAGQNTFTLTFSGRIHHAVQDPSLEYARSFSYSPGLISAEG
VFLANSTAWYPQFDEGMVSFNLDVRLPADWDVVSQGSLIQEHITETTRRV
TWEEKKPQDDIYLVAGRYQRYQQSTGPVAAYVYLRSPDKALAQKYLDATG
QYIAMYSKLIGPYPYPKFALVENFWETGYGMPSFTLLGPKVVRFPFILHT
SYPHEILHNYWGNGVFVDYSKGNWSEGLTAYLADHLVSEQGGKGEEYRRD
VLQKYTDFVNQEKDFPIARFTSRHSPSSEAVGYGKTMMMFHMLRQELEDK
DFISVLRSFYKQYQFKQATFEDLQTTFNTLTGRDFSPFFQQWIYQSGAPD
ISLESAQAEPWNRGYRLKAIIRQTQPGTPYQLTVPIAVHLEGETAAWQTQ
IKIDQPDNIIELDLPARPVRIDIDPQFDVFRRLDSREIPAALSQGFGAEK
PLLILPANTGNAQLSQAYQLLASTWQKTQSGSLKIIRDDQLEALPADRTV
WILGWQNKFSNQIIDTLSEHGVARKEGELQLGGQIYQQDKHAIVLTARQS
ANPASTLLWVASDQPAAITELARKLPHYRKYSYLVFEDSGKSEELNNINK
GQWPVTRSPLTQLVKQKDDASFTSSHTGTLAPRHALAELPPLFSENRMLE
DITLLASEVFKGRELGSPELDKAAEYIAQQFQQAGLQPGGESGSYFQIWQ
QDVGAPKGKITLRNVIGILPGTNPQLAEQSLVIGAHYDHLGLGWPDVRAA
NRGKIHYGADDNASGVAVMLELARQIATRWQPQRTIIFAAFTGEESGLLG
STHYLGNPSGYPTEKIITMLNLDTVGRLGNNPVTVFGTGTASELVHVFRG
AGFVTGIPVNTVANDFGSGDQTAFIKAGIPAVQFFGSAHEDYHAPGDTAD
KIDAAGLIKVAAILKEGAEYLANRLEPLTVTLDAGNRSSDMPQATAQTRD
KRKIIIGTVPDFSWQGEGVRVDDILPNSPAQQAQLQQGDILIRLAGQSIT
DLKSYADILRVLKAGEKIELQFRRGKDIKSIEIVPVER
>NE0544 putative ORF1 [Plasmid pTOM9]
METICMTEMKMSSPVEHGVLTCLCMKALFIAVIFGVTSDVTAHGVTEGDK
GYILESTGILPIPFIYLGAKHMMTGHDHLLFLLGVIFFLYRLKDICIYVT
LFAAGHVITLLSGVLFEVAVSPYLIDAIIGFSIVYKSLDNMGAFQRWFGF
QPDTKIATFIFGLFHGFGLATKILEYELPADGLLPNLIFFNIGVEIGQIL
ALAAILIAIAYWRKSGKFMNHAYNTNTFMMMLGFLLMGYQITGHFILFST
I
>NE0119 hypothetical protein
MKFLDTQELVVSTLSPVHIGCGEDYEPTEYVVDTSGVLHRFNAGILPDLS
DAGISSDILTILSNDEAHTEQLRAVHKVLSKYRDKIIPLASVHVSMCTGV
HAHYKSTQDKKNDFNRNGVERTSYQPFNQLPYLPGSSIKGAIRTAILNEH
IAGNNPCSTVLMRQIQDFNTMIEEYDPGNGKLLLRLKLQHTKWDYDRARK
NIEKAIADVSSALGTDLLGGKFETDPLRALKVSDAAPLDIEIEREIRFCL
NRSRSGRRSQAQVKNLYTRLEYILEHQPAAFSLSLTLQNLHEIAGRRNHR
NELISPSADKLLLWTGIVKACNSYYLNRLDDDLAMLGKLYPTSEWRKQTQ
SILDAGLRDQIKTGNCLLLRIGKHGGANSNTVSGRQIKIMLNEDKREANG
KEEKIRLYTFDDESRTIWYCGDDLDKPSDLLPHGWIVLSNPDQIWHADLP
GFERRCARQQAIAESARRQAEAAAAEQAKAAAQAAREAALAAMTENQRRI
EAFVSMCARRAEQLRGGKENPNAAIHTAARELVKAALEGADWTIDEKCAV
ADAIEEWLPKLVKVELKDERKKLKLSALRT
>NE2522 type I restriction-modification system methylation subunit
MSDQHITLSQLESHLWESANILRGPVDAADFKTYIFPLLFFKRICDVWDE
EYQEIVDETGDEQLAWFPESHRFQIPEDCHWNDVRTKASNVGTALQRAMR
EIEKANPDTLYGVFGDAQWSNKDRLSDALLKDLIEHFSKLPFGNKNVSSD
LLGDAYEYLIKKFADATNKKAGEFYTPRSVVRLMIDMLDPKEAETIYDPA
CGTGGMLLAAVQHVKEQHGDVKRLWGKLYGQEKNLTTSSIARMNLFLHGI
EDFQVVRGDTLRNPAFFEVDRLATFDCVIANPPFSLEKWGEDLWLNDPFG
RNFAGLPPSSSGDFAWVQHMVKSMADVIGRMAVVLPQGALFRKGVEGSIR
QKLLEMDLVEAVIGLAPNLFYGTGLAACIMVCAKRKPAKHKNKVLIADAS
RLFRRGRAQNHLEPEHATEILSWYRGFADVQDAVPWSVSTRSRPKTGR
>NE1610 Bacterial type II secretion system protein
MRYTVRALLAGKGAVLLELDAASEAEVRNRISAQGGVILSIRRNFSGFLP
RSGSRFPLSDFSQELLALLTAGLSLVEGIETLVEKEEDSVTRELLQQILA
RLREGVTFSRALEFYPQVFPSLYVATIRASEQTGDLGEALSRYLVYQSQM
DLVRKKLVGASIYPVLLLVLGGLVAIFLLVFVVPKFSAIYEDMGTDLPWM
SMLLIEWGHLVQKNGWMLAGMAALFLTAAFYTFSRPQIRAHLLRAVWRIP
TIGERMRVYQLARFYKTLGMLLRGGIPVTQALEMVSDLLQPHFRPRVKTA
AALIREGKAISSAMQSAELTTAVGSRMLRVGERTGMMGDMMERIGNFHDE
EIGRWVEWMTKLIEPLLMAVIGTVIGGIIILMYLPIFELAGSIN
>NE1050 hypothetical protein
MNYSKPQKVEFATFPESKRNLLRFPGSGEIPGNDKIKASPTLAPLSKHSA
NRFHSGVILFSARKMKNMATFYAPAIILKLKMYT
>NE2017 conserved hypothetical protein
MLIKTAFQLFSPASDRGKLSILIFHRALSAPDPLFSEESDAVRFNLIMSW
VVQQFDVLPLDEAIRLLQASKLPARAMAVTFNDDYADNYTEALLILQRHS
LPATFFIATNFLDGERMWNDTIIEAVRGTKVDFLDADIPGVEAAPMRNDA
EKHSFLGRLIQAIKRQLYSRRRSNALLKSAKRICQRSHAQHKTAARSAPD
RHADRRTHAQSPDSGQNRRGRGRA
>NE2093 conserved hypothetical protein
MKNPNTVISSDYGPIIINLNDNAIGRQISQYGYWATDDINIINTLVNVQL
DKFGQIMFYDVGANIGTHSLAIAKTHPDTVAIRAFEAQRQVFNMLCGTMA
INGLSNVHCHHNAISEKIGDFIDIPIPDYNSANNFGSLELIPPKNSDNQG
IIHSGKMESVKTLSIDSFNEKVDFIKMDIEGMEDKALLGAINTIEHHRPI
LFLEILKTDVNFVMTFLRERGYLGFQKSFDLIAIPIEYQLQVNGTNRVF
>NE0131 hypothetical protein
MFIFNLNKKESKMNEDRIKGQWKQLAGKIKERYGIAHDEARKQVKKFHDS
L
>NE0516 hypothetical protein
MKLAKLLLTLVLASFACLTVANPMWEPDLPRAKSAFEKETRVPLRAGHYI
EPTLFKGFYAVRTGQAGGPSAYFREDMGWLGNIKSPGWVIQSPAEDNPAY
KHTWLRQQLAQIPLEQLVLVKRSTPPVAVIWSAPDCPYCRKLEKTLEQEN
ASVYVVPVGVAENGFRQAARVYCADDPSQAWRKVMQGSEINGRAKTSCQY
PRDMLNDIGFFLGMGRMATPIVVFADGSTVIGWNDQNAWMLLRKKISEQI
FFND
>NE2194 conserved hypothetical protein
MTVSDHPQTVSQPDPESESRPSKTRLKQEMHALQALGERLVELEPARIAE
LDLPEKLAEALLEARKITSHGARRRHLQFIGKLMRAVDPLPVQEKLDAWQ
HTGMRHTAWLHQLERWRDRLISDETAVTEFVQTYPHTDVRQLRTLLRNIE
KEKLAGKPPHNFRALFQLLRQIIPEIPG
>NE1735 putative sulfonate binding protein precursor
MIKNIIGRLTKRNTTAPAVARTDNTLFIPTRRRFLCTCGCGMFYVTAAGA
GLLAAATRAEAAAGGIAIRVGHLPAGCVSHLLLAKVRGMFGKAGLNVVLT
QFNSPADSMQALVSNNLELIHSPWTMTAAAYTAGTTDLRIIGGSGQAGIE
LVARNGSVKSVEEFIDAAGKGLRVGTLRLDTLELVGYGTMSQHGKSYDDY
RMTFFPSMVGMGEAIANKALDVCTLAQPYAEHVVAEQGAVYLTDSNSVWG
PEAADCVVSTKLHTIDRQRDVLKTYLKVLQESARAFSEDYEAVLNDLQPI
YGVPRPILAVALKRQFPNPVISQAGIGGLRQGAKYLAELGYLKPDVIDTV
LDLKYQPA
>NE2493 Uncharacterized protein family UPF0016
MESFLVSTGVVALAEIGDKTQLLAFLLAARFKKPLPIMLGILVATLINHG
LAGFLGAWITATVSPDILRWILGLSFIGMAIWTMIPDEIEQEETLIAGKF
GIFGATLITFFLAETGDKTQIATITMAAHYGTPFMVVMGTTLGMLIADIP
AVFAGEKLATRIPMKLVHSIAAAVFALLGVATLLGAGSKLGF
>NE1899 ATPase component ABC-type (unclassified) transport system
MIHPGGSSEPLIEVEKLCKIYRLGNHAGTSLNLQVLFDVSFTIRRGEFVA
IMGHSGSGKSTLMNILGCLDTPTSGHYRLAGHDVSTLSGNQLASVRNRQI
GFVFQGFNLLRRMTALDNVAIPLLYAGYSRAESRRRAMSLLQQTGLEKFV
MHQPGQLSGGQQQRVAISRALINQPQLILADEPTGNLDTQTSHEIMQLFD
RLNREEGMTIVIVTHEEDIAAWTRRLIRLKDGRIIEDRPIAKPVVPLSKS
>NE0440 Domain of unknown function, DUF9
MKENLLEQLRFCTNLPSLPSIVLKIIDLAGNADTSLTQINHYISLDPALA
AKIIKTANSPLYKSRYPVSNISQATGILGTYGVTTIALSFSLADSLVKQS
GKKLGMAGSNIFWRRSITSALASRALGERLGMSRMLDDLFLAGLLQDIGI
LAFHALLPEDYPSVFSLADNHNALLAAERETFGVGHDELGAALLEYWKFP
GYVPAACRYSHTAPGKPNAGINECVATSGYVTDYFLSQDRKEKIGEVTRI
AEAYLGLDEHALIEVLEAMQNELQHVEDLFEITILNPAHLSSIVTEAREL
LIVHATIKVRELGDKIRHDGLTGAHNRAFFDETFQNEFQLSIQQGSPLSL
AMIDIDNYKTFNDTYGHVAGDGVLVTIARAITEQIRQTDILCRYGGDEFA
LILPGTTLSAARHTLTRIMESIAAIAYKPNEVNTIRITTSIGLAVNMDKE
KSFSTHREMLEAADRALYAAKHAGRNRIAEWNPSLTSLLKS
>NE0707 hypothetical protein
MARRALALAVGVLLLTSVWSIVVRLLYVLPTTAIERLDDTRFELQRLQTL
AAENSNLTTDDFTRIEQSISTLVFPSSNDNAAFVDAVNMLIHDSDVQLLE
LRTADPFNDGNLTRFALDVRINAPEEKLVHLLKSLERHRPLLIIDRAVVL
ATAAASDGTSPPLSVELRIWAFAAEY
>NE0438 putative malate oxidoreductase (malic enzyme)
MICSICLNGCQPVLRLKSRPDEVWREAGNRKSNIDINSLAGEYKMKPLRG
KALLNDPVQNKSTAFTREERERYGLQGLLPYSVTDIGKQQQRVLANLRSK
NSNIEKYGYLNDLLERNQRLFYRTLIDYIGEIMPLVYTPTVGEACMKFTQ
IFRKPQGFYITPEDRGKILPLFENWPENDVRIIVVTDGERILGLGDLGAN
GMGIPIGKMALYVACAGIRPEYCMPVMLDVGTSNQILREDPLYLGYPRPR
LTGQDYLSLVEEFVAAVQNKFPQALIQFEDFSSQNAFRLLDRYAGSVRCF
NDDIQGTAAVTLAGIYASCRITHKPFVDLKIMLLGTGSAATGVAELLLSA
FMMAGLSEAEARSRISFVDRQGLVVTVREEIKPRVRSFASEHAAMDFVGA
ITAIRPDVLIGATGTAGAFDEIAIRQMARCQEHPVIIALSNPTSHTECTA
DQAYRWSDGRAVFASGSPFAPVTLDGRTRYPAQGNNVYIYPGIGLGMVAS
HARLVVEHMFLATAEELANCVLPEEIERGSIYPEIARVRDVSQKIAVKVC
QVARQEGLAGVDLPDDLETYVYTLMYEPGY
>NE0679 NAD dependent epimerase/dehydratase family
MNVLVTGGAGYIGSHTCVELLTAGYEVVIFDNFCNSHPEALRRIEQITSK
KITIVTGDIRNQVAIEKALKDYGCEAVIHFAGLKAVGESVEKPLEYYDNN
VIGTHRLLAAMQNCGVYTLVFSSSATVYGEPQRLPLTEDHPLSATNPYGR
SKLIIEEMLRDVYRADPRFRIAILRYFNPVGAHDSGLIGEDPQGIPNNLM
PFVAQVAVGRREYLNVWGSDYPTHDGTGVRDYIHVVDLALGHLGALDYLT
VPQCMAINLGTGIGYSVLDVIKAFEEASSRQIDYRLASRRSGDVAACYAN
PALAEKLLRWKAQRDLAVMCRDHWRWQKNNPAGYV
>NE1374 conserved hypothetical protein
MQVSKWGNSLAVRLPASVVEILDLKEGDSIEIHVAGARDIEIMKTPEARE
ILERLRKYRGRLPKDFKFDRLEAHERS
>NE2508 hypothetical protein
MTKLPVLALLCSSLLLPLAAHAGDGCDGLPNWQQLKQALANARKDANGGF
NLDMCGTVVATDGTVCAVAHTGAKVGDQWLGSRVISAQKANTANAFSLPG
LALSTANLYSATQPGGSLFGLQFSNPVDTQVAYAGSAADFGTAKDPLVGA
RIGGVNVFGGGLALYDAKGTRVGALGVSGDSSCADHNIAWKTRHALALDY
VPAGVSPAKDDNIVYLDKAEQANGFKHPMCGGTEDKVTLPPVRKR
>NE2167 DegT/DnrJ/EryC1/StrS family
MKNTNHRIPFNWPHMTGKELYYIAEAHFNGSLAGDGPFTKRCHAWLEERT
GCNKGLLTHSCTAALEMAALLLDIQPGDEVIMPSYTFVSTANAFVLRGGV
PVFVDIREDTLNLDERLIESAITPRTRAIAPVHYAGVACEMDTIMAIAQK
HELKVVEDAAQGVMSTYKGRALGSIGDLGAYSFHETKNVISGEGGALLVN
TPDLSLRAEIIREKGTDRSQFFRGEVDKYTWQEVGSSFLPGELIAAFLWA
QLEEADRITNERLASWQRYHELLGPLEAKEILRRPIVPEECQHNAHMYYV
LLASKIDRQEVLDEFKRNDIWSVFHYVPLHSSPAGLRYGRIHGNLKITTQ
QSERLVRLPLWVGLSVEQQDRVVEVLCKAVS
>NE0120 hypothetical protein
MTPLRAILRLRSPLGTPLAGDTLFGQLCHAVREMLGEEKLEALLDGYTAG
SPWLVVSDGFPSGYLPRPTVAAALQANSEEDPKKRKEAKGKRWIPHSQIA
QPLRQLLSSAVSDEEVYGKQSRPIQAAAFHNTLNRLTGTTGTGEFAPYTQ
SQIFYQRDQRMDLWCVLDEDRLPRETLHQLLEYIGSVGYGRDASIGLGKF
AVEQIEEAALFKQTHPNANAYWTLAPCSPQGQGFKTSRSYWQVLTRFGRH
GGTLALGANPFKQPLLLAATGAIFAPTNNMAQIHFIGSGLAKVSLMQTAA
VHQGYAPVLGICMEAI
>NE0919 ABC transporter, fused permease and ATPase domains
MQSNISDLPHTVPPDFPSHWQTWIGPILNEHEQIHAWLETDLDGQLNFSS
GLVVITTHRIFSRMANDENWQVWPCRQDMKLTYQDHAGVCSIELSDTHAR
LACWHCTLGRHQAVDQLITHFSRLITAHDAATELTGSNVADNESASESSP
ASDQEATATPPSTWTLFRLWRFARPYKWQLLIGFILMLGSTAAALVPPYL
TMPLMDNVLIPYQNGHPIDVDLVTFYLSGLLAAAILAWILGWLRTYTLAR
VSERIGADLRTTTYEHLLSLSQAYFGGKRTGDLMARIGAETDRINLFLSL
NLLDFATDVLMITLTVIILVSINPWLALITLLPLPIIAWLIHLVRDRLRV
GFEKTNRVWAEVNNVLADTLPGIRVVKAFAQEEREIERFRTVNQRNVEIN
DRVNKVWALFTPTVTLLTECGLLVVWIFGIWQVSQDHITVGVLTAFLAYI
GRFYLRLDSMSRIVSATQKAAAGTKRIFDILDHVSSVPEPVKPVHLPVMK
GHIELRNVSFRYGTRRVIQNLDFTIEPGEMIGLVGHSGSGKSTLINLICR
FYDVTDGSILIDGVDIRALPIAEYRKHIGLVLQEPFLFFGTIAENIAYGK
PGASRKEIIAAARAAHAHEFILRLPYGYDSLVGERGQALSGGERQRISIA
RALLIDPRILILDEATSSVDTQTEQEIQGALDNLVSGRTTIAIAHRLSTL
QEASRLVVLDHGRIVETGNHEELMARQGHYYQLYQSQSRSSDDENRPDTS
EKIMDIRGNRV
>NE2061 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR
VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID
PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM
VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG
RLRSWFKSQDD
>NE1158 NUDIX hydrolase
MKFCSQCGGEVILRIPEGDTLPRYICPKCHTIHYQNPKVIVGCIPEWENK
VLLCKRAIAPYRGKWTLPAGFMENNETLVQGAARETLEEANARVEIRELY
AVYSLPHISQVYMLFRAKLLDLDFFPGIESLEVRLFGEQEIPWNDIAFRV
IHDPLKRYMEERHHGQPAFHLGIINKPQAGSNSNE
>NE2175 hypothetical protein
MERGMTMSTIERLYKLSSTLPPAALAELLDFAEFLHQKNMLPQPDEPFRL
IDMAGGLEHSACFAGEPLAVQEALRREWD
>NE1518 conserved hypothetical protein
MSQDDHSRDSAGLTYVYPVVSRRAGGVSIGINLNPNNACNWRCIYCQVPD
LKRGTAPVIDLVRLEQELRDFLHELVYGNFMQEKVPPETRVIKDIALSGN
GEPTSAREFEQIIEIIGRVKADFPLPEELKLVLITNGSLINRMYVQKGLH
LMAGLNGEVWFKLDSVTQTGRLRINNTRASLQRMRDNLQTAASLCPTWLQ
TCVFKLKGVHPDTHETDAYIGFIRTLLQEGTQIRGVLLYTLARPALQPEA
PDLSKADPDWIKTFADKIRALGIPVRTAF
>NE0879 Esterase/lipase/thioesterase family active site:Lipase (Class 3)
MKTAFSRLFLVMLTAILLSACTSNEIYRSNFSNCIVTAQESCESHAIQLH
DKGTEREYLLGFVEIDDQGQLRNRVQMQALLNELYTLASKESLLINVFVH
GWHHNAKQGDANVESFKLNLAELSKVESHLHQDRTPRKVVGIYVGWRGES
IDIPWINNVTFWDRKNTAHEVGYLGMAELLLRLEEIRNIKNTQEPPVKSR
LVILGHSFGGAAVYSAAAQILADRFINSAGDKNYVDNAEGFGDLVVLLNP
AFEALKYAPLYDLAQARCSYFQDQPPRLVILTSEADFATKYAFPAGRVFS
TFFETHSTIKRNDCNRPLSYSEGAADRQAVGHFEFLQSHELHPASKSMAA
VYHQAKAIWKNQQPGEAIQFGSTELRNLEHTVIHNPYLNVKVDKRLIENH
NDVFRPEIMEFIRMLIVLSTEE
>NE1447 Aminotransferase class-V
MKMVDALLTPSVRQTIASGMERLRADFPILDIKVGDKPLVYLDNAASSQM
PQAVIDRLVHYQKTQHANIHRAVHHLSDLATQEYEAARRKLQHFIGAREE
REVIFTSGTTDSINLVMHGYGRKFIQNGDEIILTALEHHSNIVPWQMLAE
EKGARIRVVPINDAGELLVDEYEKLFNSRTRFVGLSHVSNALGSINPIKR
MIATAHQHGVPVLIDGAQAAPHMKIDVQDLDCDFYAFSAHKMCGPTGVGI
LYGKAHLLESMQPFKGGGDMIASVSFEKTTYNDIPYKFEAGTPPIAAVIG
FGAAIDYLEQIGLDAIAAYEHELLVYASEQIRMIPGVRIVGDCPDKVAVI
SFVVDGIHPHDVGTLLNQDGIAVRTGHHCAQPIMQRFNVAATSRASFSFY
NTKEEIEVLVAGIRSVQKVFAQ
>NE0636 TonB-dependent receptor protein
MSENHASSVLTLWLGASLSALSWAEEISYLEKPVVVTASRTNEAAEDTLA
SVTVITRTDIERQQARSMQDLLHGIAGINIANTGGPGKLTTLLLRGTESD
HVLVLVDGIRMGSATSGFSEIQNYPVELIDRIEIVRGPRSSLYGPEAVGG
VIQIFTRKGGKKGLKPTLSFGGGSYGTINGSATLSGSSERAWFNLGISGS
DTNGFNACKGSFSAGCFTIEPDKDGYRNISGSARAGYRFDNGLDVEASFL
HTDGHYEYDGSFQNRTQLAQQVFGGTARYSLFEPWRITLTAGRSNEDSDN
FKEKLFTTRYHARRDTISWQNDITITPNQLLTIGTDYLYDRINSSEKFTT
TSRYNWGVFAQHQISLAAHKLQLSVRHDDNQQFGSQVTGGASWGYSLTER
VRLTAGFGSAFKAPTFNELYFPGFGNPDLKPEDARTVELGAAGRFDWGNW
SLNLYETWIDDLISYNAATFTPDNIDKARIQGLEAILNTEIKGWLIQTNL
TFLDPQNRSKGANKGNILPRRSKQAFRIDASRQLGDFVVGAMFLAETNRY
DDLANNQKLDDYVKVDLRAEYLINPQWRIQGRIENLFDKDYETAAFFNQP
GRNFFVTLRYQP
>NE1904 Protein of unknown function DUF68
MNRLFPYPLLIILLIITWLMLAGFSVGQFLLGTLTAIIATWGLAALHPAK
PRLRRWDLLPRLFAIVFYDIVRSNIAVAGIILHGGRKVGKSGFMTIPLDL
RDPMGLAILAIIITGTPGTAWIDYNSARNILILHVFDLVDETAWLNLIKN
RYEYLLQEIFE
>NE1273 HI0933-like protein
MTHSFDVIIIGAGAAGLMCAIEAGKRERKVWLIDHSVKIAEKIRISGGGR
CNFTNLHTQPQCYLSQNPHFCKSALRRYTPQDFIALVRKHGITFHEKKLG
QLFCDDSAKQIIAMLLAEAREAGVKLENPVQVSTIETIAGGYRLNTSQGE
RTCTSLVIATGGLSIPKIGATPFGYQVAKQFGLNIIPPRPALVPLVFDGA
LLACCQALSGLSVEADVRFGKQVFAEGLLFTHRGLSGPSILQISSYWEEG
EPIVINLAPGTDVLAFLKEQKQSQPKLHINNALAEILPRRLAQSICDEHN
GSGPLASLSDTRLAALASAVNAWHVKPVGTEGYRTAEVTRGGVDTRELSS
QTMEANKQPRLFFIGEVVDVTGHLGGFNFQWAWSSGYVAGQYV
>NE1147 Arginase family
MSDDDQIRIDGAIARKTPYGMLREPTFSGMLSFMRCNFRKDLTGAELVIS
GIPLDLAVSYRSGARFGPLGIRQASAEVASLKPYPWGFDPFEHLAVIDSG
DCFLDVHNPLTIHGAIVDHARHILASGARMLTFGGDHSISYPLLQAHVEK
TGAPLSLIHFDAHCDTWPDDSLASLNHGSMFYKAVREGLIDVKRSVQIGI
RTWNDDFMGINILDADRIHRQGTDAVVAEIERIVGTQPAYLTFDIDCLDP
AYAPGTGTPVCGGLSTAQALAIIRGLTAINLIGMDIVEVSPPYDHSQITA
LAAAHIACDLICLLAKRKADGLL
>NE0384 Restriction modification system, type I
MAGKWPYKRVEEIALKVAMGPFGSSIKVETFTDTGIPIISGQHLRDAELT
DSEFNFITEEHADSLKNANVQRGDVIFTHAGNIGQVAFIPNHSKYQRYVI
SQRQFYLRCDTSIILPEFVVYYFKSPEGQHKLLANANQVGVPSIARPSSY
LKTIEVPVPSIEEQQVVVRNIKALDVKIRANRRINQTLEAMAQAVFKSWF
VDFDPVKARIAAIEQGQDPLRAAMRAISGKTDLELDQMPREHHDQLAATA
ALFPDTMQESELGAIPKGWQVKRVGDLIELAYGKALKSTDRQEGAVPVYG
SGGITGCHNEALVPHGAIIVGRKGTVGSLYWEDDPFYPIDTTFYVKPKAV
PMTYCFYAMQTLGLNKMNTDAAVPGLNRENVYRLELVKPSTPVLNAFDGL
VAQIRKTMQANETTGQSLAELRDTLLPKLLSGELSVDNTSAEAATA
>NE1575 hypothetical protein
MKKQVIKGMLAASAAALMLGVSAAHANVIDELVGSAKLGNSGDGTELAFI
RSITGDNTLTLDFKINDNDSSFNVMSNGLDSWFIDVAPDTPGYFMLKLGV
PGNSTLHSHYVFKNIGELDKLVWSNDQVNYLTGGNCGLNGSPNSCNIGRL
SHYVGTQGIGGEDPEVPGEIPEPASMLLFGAGLLGLGLSRRRKLV
>NE0740 hypothetical protein
MDETRRDFIKGMFASGTFLALGVPGIARAASVGPLFDSTRNCRLLLGNTT
GAESFAKGVQSACFNHGSRHHGALPVFRFESELSTGFLHLVDLLMQSRNT
RWIAVMDHADAAIFTELIRNSAAHLLASGSHTFAAGDHAALPLRHVWAAA
SPAYSTGGLLASMLAREQYSFSIVERFLTQSAGESIENAALSLPEFLPYH
RADQPVTRLYCAGVPLPEAGRLLGWETSKNQESLFSRTIASTASRNETAG
STTVEYPQSGDWVEATGYAVAAAALGMKINRESCSERAFVYRSGQGHPDH
KGLSGVNFASFVIDV
>NE0920 conserved hypothetical protein
MNLSDCKLSRNAFGKLIVTTKDQIHEGVVPVRSFPITAINEGIALVDGHG
HEVTWIDSLAELSETERILIEQELASREFMPEIKCIDRVSSFATPSTWQV
QTDRGETCFILKGEEDIRRLSLATLLITDSYGIHFLIRDRSMLDRHSNKL
LDRFL
>NE0340 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE0491 conserved hypothetical protein
MSFDANIATQYYILMTIKTFRCADPETLFKLGRVARFVNIERPALRKLKQ
LDLARCIEDIRVPPANRPEILKGDRAGQHSIRINDQWRVCFRWTGTDAED
VEIVDYH
>NE0610 Glycine cleavage system P-protein
MLIFEHSRKNRRNYSQAPATRPAKNNIPDHLKRKSTPLLPEVSEMDTVRH
YTRLSQKNFSIDTEFYPLGSCTMKYNPRACNSLAMLPQFLSRHPLAPEDT
GQGFLACMYELQEILKDITGMAAVSLTSMAGAQGELIGITMIRAYHEAHG
DTGRTEIIIPDAAHGTNPATAVMCGYKVIEIPTNRDGDVDMDALKAAVGP
KTAGLMLTNPSTLGVFEKKVAEMSRIVHAAGGLLYYDGANLNAVLGKVKP
GDMGFDVIHMNLHKTFSTPHGGGGPGAAPVGVAERLLPYLPVPIVAHEQG
VYRWLTEEDRPQTIGRLSAHMGNAGVLLRAYIYVRLLGAEGMHRISEYAT
LNANYLMAELRKLGFEIAYPNRRASHEFIVTMKEIKDRTGVTAMNLAKRL
LDKGFHAPTTYFPLLVPECLLIEPAETESKETLDRFVAAMKEILDEIATQ
PDMVKAAPHDMPLRKIDDVKAARELDLVWDPAG
>NE0801 conserved hypothetical protein
MKIIERYITRELLFPFIVVTVILIGLFVSFSVARFLTSAVTETLGTAAMF
KLVGLKTIIALEVLVPIAFYVAVIYGLSRMNRDQEINVLRTAGYGDNRII
RTVFVIALPIAILSGILSVYARPWAYAESYIMDAQAEAELNTNRFQPGRF
YGSEKSGRVIFIRGKDDLEKRMEGVFHYIRTTEDREIIISREGYQQPMTA
EQWPHIELREGQIYRLSFDTVKDSAIRFEKMIYFNENDQAQNYRRKAAST
RSLWDSEEPREIAELQWRLSRPVATVLLALIAVSFTRTAPRKDKTDRTFL
VAALVFAVYYNLSGLAKTWVEQGVVAAMPGVWWVYGLLIIIVMLRLPELR
SLLPAGKQ
>NE1867 hypothetical protein
MTNHTYVIRQQDRNAAISFIDNQLNQNHAWLNETESQRAIAGQEYLQART
DPASFNAWCQKWLNESQWAEIKQAICIAKDRQETRLRYAAEPHKTISVTH
RAWKILSEIALQEQLTLSEVIVNRLSGNDATACPPAKSRYNLTS
>NE0780 conserved hypothetical protein
MASLETPVCNFGWKAVDFDLPGVDGKRYNLASVMGENGLLVMFICNHCPY
VKAIRDRIIRDTREFAEHGIGAIAIMSNDPADYPEDSFENMAKVAREYAY
PFPYVQDETQEIAKVYGAVCTPDFFGFNNKLELQYRGRMDASRKEAAPAD
ARRELFEAMVQVARTGRGPEEQIPSIGCSIKWRED
>NE0322 conserved hypothetical protein
MVRSVKCIRLGCEAEGLDFPPYPGELGKRIFDNVSKEAWSQWIKHQTMLV
NEMRLNLADIKARKYLASQMEAYFFGEGADQPAGYIPPDK
>NE0556 TonB-dependent receptor protein
MQKNLWVIGLSIIMFSGFAQAQSNKVLSIPAQNLPDALHSLSSQTGIQIL
FTAEQLKGFQSPAINGSMNTEQALIRLLQGTSYTWLASGQNTYVIKSAAE
VATVMPEVRVTGAMNPDTPGNPSYTRTNASTATKSNLPIMKTPMSVQVIP
RAVIEDQQAVQVNDAVRNVSGVFPGFTFGGMSQAFMIRGFDTGFASFRDG
FRFPLALRFSLANIERVEVLKGAASNLYGRIEPGGMVNLITKRPQAERYY
SLNQQFGSYGQFQTLADATGALNETGTLLYRLNFEYLNQDSFRDFGFNKR
IFVAPSVTWRITPDTQLDVDFTYNNEDTREDHGIVAIGNRPAKLPRSRFL
GEPADKTTQNMYNTAVTLTHAFNSDWQARARFNYLRRDTVDPQTTGFDLN
EATGDMQRSFYGGTATGDVFMSTMDITGRFYTAGVEHNVLAGWEYYGQFG
SVKSISVDASPINIFRPIYSNVNLASQPYNFFIDQKNEWNGVYLQDQITL
FDKLHIMAGGRYDWASNDVGTAFGIDKFLSDAKAAGREINNNRFSPRAGI
VYQPWEWLSFYGNYVQSLGAANSAFDANGNILKPQIGEQFEGGFKTSFFD
GRLNSNVAFYYLTKQNLAFRVPGQPYSIPIGQARSQGVEVDISGQIIQGL
NLILTYAYTDAEVLEGANKGNRLWNVPKHAGSLWARYDVQYEPLRGLSIG
AGIYAQDKRPGDPANTYFLPAQARVDAMVRYRPPAMHSRLSLQLNIYNLA
DSTLYGGTLGDRFSVNVDVPRTFVGSIRYEM
>NE1748 hypothetical protein
MDANFSADFSSKEDQFFKPQRAAAPAARHVSLTVKIRFLHNDPGKSLQTM
PELAALAPRTKHHELRCSPLAAIENRVRRHFCTWVSRLLSPTLLVRHDLK
SLAMTSQLTKTSKLLLQRMHATGFVTQQSTIIASCAAAQRSGTFITPSTG
LLHTTGIRNDGYVIARQRGAVLVTGLIFLVILMLLGTTAMQGTLLEERMA
GNLRDEMLAFQAAEAALRSGERFLEQVTLPEFNGKNGLYHHACSDEAVDE
DIDESFLCTTTPDPVAEMKWNAGDSREIDVVAMDGVASQPRYFIEQLPSV
PLMGDGGSAQQSGTSLNANMFRIVARGTGGTETAAILLQSTYRR
>NE0533 putative sigma-70 factor, ECF subfamily
MCADCFRFYFSKAISMQESSAADTGTAVIDALYREHHRWLLMWLRKKLGC
PQNAADLSHDTFVRLLSLQEMPVLREPRAYLLVTANRLMINLNRKHKVEA
ETLHSIAALLSERTEQDVAHVAAVRQLLEATVLMLIDGLDERSRQAFLLA
RVEDMTYAEIAETMQVSISRVKQYLVKALAYCHVHLHQLKEDVCLK
>NE1552 Transposase IS4 family
MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA
EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA
KNTSSFLAVVQIRCIALWADIL
>NE1022 conserved hypothetical protein
MADSKSISPTDITDCFFHNLRAQVRPGDRLTVALSGGVDSVVLLHLLTTF
SESMQLEVSAVHVEHGISTYSGEWSAFCQSLCDSLAIPLSIHRLKIRRRP
QESLEAIAREARYQIFKHIQADYVMLAQHQDDQVETLILQLLRGAGVKGL
SAMPTVRLLEPGKTIRLFRPLLNIPRSEILNYARLHGLSWVTDESNLDTS
YDRNFLRHQILPLLEQRSPAYRKTLFRSTQHLGEAAHLLDELAEIDAENT
LVTNRLSLQGLRKLEPARARNLLRYLLAQRLIRLPNSTKLEEILRQLNNI
QPDNHFRFIVDTLEIRCHRGLIEFLPADSLPEPIAPVVWQGEQHLVIESL
QGVLKFTRQNNMGIDPARLSGQIVTIRSRSGGERFQPDCKRPRRSLKKIL
QEAALAPWVRNTLPLLFCEDQLVWVAGIGIDCNFQISEGSTGLVVAWHPS
QINQVSTH
>NE2198 Integral membrane protein, DUF7
MAWMMLIFAGLLEVGWAFTMKLSNGFTNPVYSIITIAGVIASFVLLSLSM
KILPLGTAYTIWTGIGAIGAFMVGIFVLGESVTPLRLLAAALIISGLVLM
KLSS
>NE2501 L-sorbosone dehydrogenase
MNKTYIAQACAVALMMVLAGCEKATYEPAQQSGANPPLPKAQKFFIPPIQ
TPDRTGWAEGEKPTVADGLAIEKIASGLMHPRQLLTLPNGDVLVVEANSP
GTVPITSPKQMIASIVKDRSGKGGKGGNRITLLRKAAGQSGKWEKYVFIE
NLHSPFGVQLIGDTLYVANTGNIMKYRYTPGETRMSDPGVEFTDLPSTVN
HHWTKALLASPDGKKLYVGVGSNSNVAENGLDIEYRRATVLEVDVQSAAS
RIYASGIRNPTGLGWEPTTGKLWAIVNERDEIGADLVPDYLTSVQEGGFY
GWPYSYYGQYVDQRVQPQRPDLVARAIRPDYALGSHVAPLGLWFYQGRNL
PEKYHGGAFIGQHGSWNRIPLSGYDVVFVRFKDGKPEGVPEPVVTGFTSA
DQKTLKGAPVGLAEDAEGALLIADDVGNTIWRVTARGTDAPAPR
>NE1631 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDHLISVNRP
>NE1526 hypothetical protein
MKLFLNSFFFILLTAIISCTPLSTSNPKASVINSVVFVSDSGQVVRAIYR
DDDTVTLTFPNNRIEMLNLAVSASGARYVAGMNEWWEHQGEATYSVNDER
VFTGRLQRQPAN
>NE0394 YGGT family
MPNQIMIFLLDTLLSLFSLALLLRFYVQWSRVPYYHPFTRFLVAVTDFIV
RPAGRVIPSWRGLDLSTFVLAWLAQFIILVGVNLLGGFGAGSSMFAFALL
ALVKLASMTLNILLISIIVQAVLSWINPHTPLAPVLESFTGPVLGPIRRY
IPPIANFDLSPIFAFILLQVLMMVVENLQRQIIQMF
>NE2119 conserved hypothetical protein
MTRTTGIYSISTNLGESVRAFVPHSLPPSDPDLSPKMFTDLNQQAELALA
RLAGVSGLAPSVDWLLYSAIRKEALLTSQIEGTQATLTDLFDEEAGFKVS
NTDDVEEVTNYLRAFRWTQEQLRDPKGLPISVRLLCEAHRRLLDGARGAG
KQPGELRRSQNWIGGTRPGNAVFVPPPPGHVPALLADMERFIHSTATDLP
PMVKVALIHAQFETIHPFLDGNGRIGRLLIAALFEHWELLTEPLMYLSGY
LKQHQAEYYRRLSAIRTDGDWESWVTFFLEGVATATGDAEKNIIEVASLI
ATDRKRMLQSTKAGPASYRLFEMLPMMPRFTIERVRRQLDTSFPTATAAV
RVLEDLGIVTEMTGQKKNRSYSYQAYVELLSR
>NE0981 HhH-GPD
MALVFWEQAVNDLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQ
ISVKAAASVWQKVTTLIPEITPEALIATEIDLLRTCGLSARKVDYLRDLS
RHFLEGTLVTVNWHDLDDETLIRKLVEVKGIGRWTAEMFLIFHLHRPDVL
PLDDIGLQRAVSLHYNASQPVAKQAIRTIAESWQPWRSVATWYLWRSLDP
IPVIY
>NE1075 DNA polymerase beta-like domain
MRPSVVLDMKRSAVREAVGRFRTTNPRVFGSVLHGTDHDGSDLDLLVDAL
PGATLFDLGGLQVELESLLGLHVDLLTPGDLPPKFRAKVLAEARPI
>NE2079 conserved hypothetical protein
MAILRKNPELAAPSSVSRDTNESDYIVHSLTEVCFFLNGIMQEKSLISLH
LARNSHSAILSSILAVDLQKKLLVLDYGINETLNQMALKSGMLRCVTSHN
RIRIEFDCNNLQHIRFEGRNAFSADLPTSLKRLQRRNFYRIATPVANPAT
CVIPPLRQHEEVFLTLNLLDISRGGMALIDRPDTDAPLEAGMTLEHCRIE
LPEFDTIETTVRIVSISMAILSNGNTCSRIGCEFVNLSEKSGTLIQRYII
WLEQQARKFDTKSHF
>NE1538 conserved hypothetical protein
MHAPAILRQSGIDALLAGDGSPEIMVTDSPSPPVKRSKIWELSRTYHCPV
IGLCLPASDLERFAKRFHFKASINNAHDLHVEAVCRVTSRNVVSETIHKY
LDRLYESHIKQFDAAKTDSEVLALWETCFTNERIAGPLWAALTHKRISSE
GREQIHDRVHMHMHEAVAELSATQHRLNQAEQMLKELAGQLEQIREKHVQ
TESRLRQQLQQAAAEIKQLQQSRRESETTRQRLETLESGQAILRMGQRLM
QLTVENEQLQIKAAQADDLKQSLKNACHNLTELYHERDTLRVERDTLENL
LLTLNSGSTTDTETADTTETKMAAYECILCVGGRSTLLPQYRTLARQLGL
KLSHHDGGQEDALSRLPEMIYGAAAVICPTDYVSHAAYYQVKRLCRLGHK
PCLLFKGTGVSSFAAALTKITSGQASINSSPVDTVETTT
>NE1247 DUF160
MNHAIKIMAPGHEALTQNTRFQVYTERQLDKIRPLERFTKEQRFEMHVVA
NVLPFRVNQYVIDELIDWNNIPADPVFQLTFPQRNMLEPEDFERMAEALR
RDAQRSEIQAIAVDIRSKLNPHPAGQQEMNVPVFHGEPLPGTQHKYRETV
LFFPSQGQVCHSYCTFCFRWAQFIGDKELRFASNEAGNLHKYLAGHKDVT
DLLMTGGDPMVMKTHHLKAYLEAMLRPALEHVQNIRIGTKSLTFWPQRYV
TDEDAHELLALLERLVKAGKHVALMAHFNHWREMDTPIVREAVRRIRAAG
VVIRAQAPIVRNINDDPAVWAKMWRTQVGMGIIPYYMFVERDTGAKRYFE
VPLERTYQIYREAIQQVSGLARTVRGPSMSAGPGKVEIQGIVELNGEKIF
ILRFIQGRNPDWVQRIFFAKYDPEATWFDDLKPAFGEEKFFFTDEYNAMQ
GV
>NE1067 putative plasmid stability-like protein
MLRTWLEEHVLSSFADRILPVDTVVARRSAALHVPNPRPYRDSLIAATAL
VYGMTLITRNVADFEPMGVTLLNPWTPCWPRISGLVP
>NE1563 Protein of unknown function DUF79
MTYKLKFLPSAKKEWDKLDSSIKTQFKNKLKKCLENPHIQPNKLRGFDNA
YKIKLRSAGYRLVYEINNQEVVVFVIAVGKRENNKIYDKAINRTKT
>NE1185 putative phosphatase protein
MKLIILDQNGVINYGSETYIRAPGEWKSIPGSLQAIARLTHAGYRVVTAT
NQSGIGRGLLDMTTFNSINDRMLRAVQQEGGRIDSLFFCPHTQHDKCSCR
KPGIGMFKEIRQRYGIELSHVLAVGDSLRDLQAAAKVGAIPVLVLTGKGQ
VTLAEGNLPAGTQIYPDLSAVADSLEVEAV
>NE2477 hypothetical protein
MNNIQTNTRDQDSLEYILQLELDNIRLFLELLEHERNLLAAGNLDDLALL
VADKDRLIDQFARLDMRRNRFLNAAGLPEGTQGMNAWVSGSDEESTVARD
WEELLGLADLAKQLNQTNAVITSSWLQYTRRTLNALHSAAGRPPLYNTKG
QTT
>NE0070 7,8-Dihydro-6-hydroxymethylpterin-pyrophosphokin ase
MILPLHSESVSQVFIALGSNLENPLSQVRRGMMLLAGLEHSRLVKRSSLY
RTAPVGHIDQPDFINAVVQMETNLTPHHLLDALLEIERACGRVRTFPNAP
RILDLDILLFENLQHNDTKLVLPHPRMHERAFVLQPLLEIVADCVIPGRG
SAVACLAACAAQPLECIASS
>NE0455 Prokaryotic dksA/traR C4-type zinc finger
MGKQLESRTIPPYEPKAGEEYMSPEQLAHFRKILEYQRAELSMEIDRAVH
VMQEESTIFADPADRATQESDMALELRSRDRERKLIKKIDEIFTKIDSGE
YGYCEKCGIEIGLKRLEARPTASLCIDCKTLDEIRERQMAK
>NE0471 hypothetical protein
MNEYFFPKLTAVEALAPYRLRTTWSTGEVLEVDVGDILRKIPDLAPILDP
EAFARVHIAEWEGSVEWFDTEFGRDNVYAWAKEQAGEVSHEMFGDWMHRN
NLSLTTAAEALGISRRMVSYYRTAHKIIPRTIWLACLGWEATRPETKTLP
RTLPAAYAKGVSASLS
>NE0145 Conserved hypothetical protein 48
MGEKPYRARQLLRWVHQSGKTDFMEMSDLAKGFRHKLMECAVVQLPEIVS
DHTAGDGTRKWLLSTGAGNAVEMVFIPEPSRGTLCVSSQVGCALACSFCS
TGRQGFNRNLSVAEIIGQLWWANRLLEAGSHDPFPLDTTRVQTDKPETRR
PVTNVVMMGMGEPLANFENLVTALDLMLSDDAYGLSRRRVTVSTSGLVPA
LDRLRERCPVALAVSLHAPNDALRDQLVPINKKYPIRDLLAACERYLPAA
PRDFITFEYVMLKGVNDSVALARELVQLVRNVPCKLNLIPFNAFSGSGYE
RSGAEAIGNFRDVLMQAGIVTTVRKTRGDDIAAACGQLAGQVRDKTRRTS
GCGTGQPAVAR
>NE1598 hypothetical protein
MNTINANDLKTRGIAAIEAQLEEQPEAIIAVRGKDRYVVMQLEHYYYLRE
CELTAALAETRADLAAGRCEQESPEAHLARLDTLK
>NE2218 hypothetical protein
MNRYVIGKRAVLSVTAGMLSISTLLSAPQVFADSTSTQALEAQIKLLEEQ
LQSIRGELDRVKTDTVRNEQKIRENEQSVASVGAQKSAGESGKHMVFFRG
GFAHSNHTRNGVSIQSDVAPVGAQDQAGKDAWYIGAGFDFNLTNDVWGFL
PGTSVFAELMFEYKQFSNSVSGNALANNPTQLAGGALNPRKVTVSQFTLT
AAPKIKFLEGNRFRPWIIPAGLALHVISPPSESITVLQPGVMFGVGADYN
IWKAIYVGADARYQLAGGKIDGVNVNGFTVGGYLGIGF
>NE2290 Bacterial type II secretion system protein E:GAF domain
MSISLASEAGSGIIPGNGIPSDSAGRLQSVIDRIQAAGNVDEIIFETSRD
ICEIFNADRLTVYTLGDDKTSLTSRVKTGLDSWQNLKLPINEHSIAGFVG
LHKQAVNFRDVYDRCELREQSEYLRFLQEVDKRTGYRTKQMLVVPILNAG
DNDLLGVVQLINNKADEPFSQRVVDELSEICQALAVAIRQRQMQQLLGRT
RYDQLIVNALISASEFELATRQARETGRDLEDVLIDEFGISVTQVGGALA
VFFGVRYERYRPDRIKPMDLLKNLKREFVESNSWLPIDDGTEGLMVLTLD
PERIRALRIVNQIFPKRKIIYAVCTRREFQAMLDLFYGGNETENLGGSID
EMLSNLGEGDGGEETRETDTDEASAAADNELVKLVNRIIMDAYKTGASDI
HVEPLPGKGKTSIRFRKDGSLMPYIEIPASYRNPLVTRIKIMCDLDISEK
RKPQDGKIKFRKFAPLDIELRVATIPSAGGVEDVVMRILAAGEPIPLEKM
GFTSFNLERLKSIISKPYGLFFVCGPTGSGKTTTLHSILKYLNRPETKIW
TAEDPVEITQKGLRQVQINPKAGLTFATIMKSFLRADPDIIMVGEMRDKE
TTSIGIEASLTGHLVFATLHTNSAPESIIRLLDMGMDPFNFADALLGILA
QRLAKRLCKCKKPHVASREEIRALLVEYCEELKNIGSFKADPDAACREIE
SRWVQQYGDENGQITLYEPVGCEHCTQTGYAGRVGLHELLTATDALKKNI
QEHARVAEMLVTALHDGMRTLKQDGIEKVLQGITDIHQVRAVCIK
>NE0115 possible M. jannaschii predicted coding region MJ1674
MTTLISFLGKGIADKTTGYRTATYRFDDDSKHTTPYFGLALAGYLRPERL
ILVGTAGSMWDVFFEQQDASDDDVLALIDAVRESRVDADMLSAQEKRLTK
RLGLPVICRLIPYARDAAEQTEVLLTLAKLVHRSEEVFLDVTHGFRHLPM
LALVAARYLAHVKDVKVRGLYYGALEMTSTNGETPVLQLDGMLQMLDWVE
SLATYNKDGDYGVFASLLQQDGLPEGKAKQLTRAAYFERSSNPVKARETL
GSVFSAIKTHNGPMGVLFRDALTERINWFKEPDRAAWELALADAYLERRD
YVRAVIYLYESFVTRAVLEHKLNPNDFSERDEAWKDARQDNKQVRKLEYL
RNALAHGIKSDDKEIIRMVNDENCLDDQLKKFRRSLFN
>NE1113 HlyD family secretion protein
MLRYPNRFYILMVLFSIYGLSGCNDRQSDTAANEVPPQVDVIITRVVSSP
VSIELPGRLEAFRQAEVRARVAGIVTERLYKEGQDVRKGTPLFLINPELL
KVARDEASGTLAKAEANYHDVADKLKRYKDLVSDHSISERDYQSAVAGEK
QAKAELLSARARLEKARLDLGYATVVSPIDGRARRALVTEGALVGQDAPT
PLTTIEQIDPIYVNFSQPATEVMAMQRAIKSGKIRGIEQKDIQVHLIFSD
GSEYGHTGKLFFSDLAVDTSTDTVAMRALFDNPEYELLPGAYVQVRLEQA
IINHAVPVPRDALVRAAGVSSVMIVDTEDTVQRVEVEANTLEGNHWVVTS
GLQGGEKVIISNPAMMISGARVRPVVTDDQTDHSQQ
>NE1975 NtrX; nitrogen assimilation regulatory protein
MANNTILVVDDELGIRELLSEILGDEGYNVALAENAEQARNYRSQTRPDL
VLLDIWMPDTDGITLLKDWANNGLLTMPVVMMSGHGTIDTAVEATRVGAV
GYLEKPIPLQKLLNTVGQVMRGGKQHKFSPTLSLSNLGHSGLITELRKKL
EQVENLKIPVLLTGEPGVGVEHCARFLHRPNTAWIVPESCSFLAESPLTQ
LEQAKGGLLFLGEVGDLSKLEQKGLLLLLGKLEQYGVRLVCATIRPLAEL
TAQGTYHPRLFEILSNLCITIPPLRAHREDVPEMASQLLSGWIESGEIPL
MHFSTAALNALRNYDWPGNLAQLTSIVHSLALTCTGDEISVDAVQQALTL
SSREAVSVVSDIPLNISLREARDIFEKNYFEKLIEQEGGNMTRVAERAGL
ERTHLYRKIKTLGIKPRSQNY
>NE0392 conserved hypothetical protein
MEQITRWLMIAGAALLVIGVVLHFAPWLFNWFGKLPGDIRIETRHSKIFI
PITSMLIVSIVLSVIINLFKK
>NE1946 Bacterial regulatory proteins, TetR family
MLTTMAEKISVEEKILQTARRLFCQVGIHATGIARIIEESGVSRRSLYTH
YGSKENLLKAVFEAEANIWFRWFDCDLPQMECSPTERILALFDLMRDWFD
SKNFYGCVFINAVAEHEKSNGWIQEVANSHREKITAKLQAMVAASGAQNP
EMVTEKLNLLIDGTIVTAMVTANSEIAHIGKLAAGDILRNAQ
>NE2034 conserved hypothetical protein
MSLAEFRAQYLVRFWSPIPALLALGVASAYYFAITGTFWAVTGEFTRWGG
HIAALLGFSPQQWSYFQLIGLNGSPLERIDGVMIIGMFAGALCAALWAGN
VQLRWPTSRRRLAQGLIGGIIAGFGARLAMGCNLAAFFTGIPMFSLHAWA
FMLTTVIGAWIGVKLCLLPFLRTPLRLDTAPSSLFADTASLARRARLQNR
LGLLIAVLVLGFAAWRFETSLVLGLAVLFGVFFGAVIERGQICFTSAARD
LWTTGRTRIAYGILLGMVVACLGTFGAIALGATPKIFWMGPNAALGGLLF
GIGIVLAGGCETGWMYRAMEGQVHFWIVGIGNVIGGTLVAIFWDELGGTL
ALPYPKINLLEYLGAGTGLLLSLAGLMLAMLLVYLNARRFAVREGLAR
>NE0496 hypothetical protein
MVFPAKSRRFIAPGLRYRQHATNLIGADGGFIAKFSRFGLMLSGRYRQWN
ARNIEALSGLRRN
>NE2334 PhoH-like protein
MTLDAATKSKPVELMFSPTDNERLANLCGALDENLKQVEYAFNVMISRRG
GHFKLYGSVEHTHLAIQALKKFYDDSYHHLSVEQIQLGLIEIRNNPSPPL
GDSDRAIAGSNATSLQLVTRRGNIQGRTARQTDYLFQIKKHDITFGIGPA
GTGKTYLAVACAVDALERELVSRVILVRPAVEAGERLGFLPGDMVQKVDP
YLRPLYDALYDLMGFEKVSKHFERGIIEIAPLAFMRGRTLNQSFIILDEA
QNTTAEQMKMFLTRIGFGSKAVITGDITQIDLPKHQKSGLVEAERVLKNV
NGIAFTRFRTEDVVRHPLVQRIVDAYEKYAPDKESG
>NE0972 hypothetical protein
MLSIKPAAEDLAARQPVWEALSDMFLDTDTSLSRQWRADQLARSPYSIDQ
LEFILINEVYPICKYNLLSVAGEWAGFDPEWLKEKILRHLGSRFRFLHTL
NLGCFTVHASVEWHATRHAILAARSIGTKNTT
>NE1594 conserved hypothetical protein
MNINAVTAELFQATAPVQPAMPAARTGEREMAPAAQPPAETSDIKPEQVK
EAVNQIQQFTQALTQNLKFSIDEDTGKTVVKIVDAQTQEVIRQMPSEEAI
KIASALGKIQGLLFNGQA
>NE1994 TPR repeat
MRRVPWLLYWRGTSWILFNPQEARSSLEQAYAGFQAEKDKAGLFLVCAAM
IDAYLYAEDNIKPIVAWGERLQKLLSLYGDFPSIEVEVRVYGSLLGLIFS
APHHPLLQILEARFESTLQSSVEPALRIAAACAIVFLPLWRGDARKARRI
MDETIPLFKGISISPLLRILWCNIEGGYAWAIAASAHIAEQKFHEALQIA
QESNISVLNAMLWTHGAYSALSAGNLVTAESCIEKLKLNIDAQRKHDLTE
FRYLRAGIEFLREDFSKALDDASAVLKSHEEMERPFLREINRLGLAQILI
EIGDVEPGRSHLKQTVEYAKIMRNPMLEYQCLLIEAHSWIKQGNADKALV
PLREGLRIGRENGFLLINYWLRPKVMAHLFSLALQFGIESVFVRNLIRHW
GMKAESQALEVWPWSVRIYTLGRFEVFLEDMPLHFSGKAQHKPLELLKCL
CAYGSLAVNQDRITDALWPDSGGNAAEQALRTTLHRLRKLLRHEKAIRLE
DKHLSLDPGYVWVDCMAFDRAIHHSGMADRNSLQQALSRYRGHFLEGETA
SWALTFRERLRAHYIKMSERLGGMLEQDGGWPEAIDCYRRAVEIEPLAEN
FYQHLMRCHAQLGQRAEVLSEYQRCRHQLLCRLGISPSQETQLLYQKLIN
T
>NE0570 possible lipase
MNDEVKQRLQSTETWIRVLFMLLFMFIQGSVKFLIVLLALFQLGSTVLTG
QANTRLLKLGRQLAMYDYQISLFLTFNSEQRPFPFSSWPSDTDNRTSDNA
DNRTPNENPEKTSWFQ
>NE0345 Acriflavin resistance protein:Heavy metal efflux pump CzcA
MLSRLISFSIKQRLFILVMVGALLIAGIRAFIELPIEAFPDVQDVQVQVI
TQVVGQAPEEVERSVTIPIEREMSGIPQMTQLRSVSITGLSVITMIFADG
TEDRLARAQVLEKLQTVDLPDGVMPTLGPLTTAVGEIYRYVVDAPPNLPL
YEVRAIQDWIIRPELRRVPGVADVVGFGGSVKEIQVDVDPNALRKFGLTL
DQVSQALRENNANTGGGIIRRGGEGLVLRAIGLFHTVEDVAATVIMSHEG
KAITVGDVATIEISGHTPLSGIVSLAQQGDNGKILSQDSIVEGIVLMIKG
SDPSKIIQTLKTRVDELNSQAKLPEGVHIRPIYERTQLVDHTVATVGENL
LLGALLVVAVLIVFLRDWRTSLIVACIIPLTLLFAFIMMNARGVSANLIS
LGAVDFGIIIDSAVVLVEALMVRLMLRQEPGSGQDTAKWRINALRQTAIN
LGKPILFSKAIIVLAFVPIFTFQRVEGKIFSPVAFTLSFALIGAIILTLT
LVPALLSYLLQRSSISELHLTWMERLKTGYRKLLDWTDTRPRIMAGITIA
SLAGALTLLPVIGTEFLPKLDEGNIWLTIALPPSIHLEQSKEVERAIRAK
LITYPEVKTVVAQVGRPDDGTDPKGANNLEILADLNPRSSWRFSSKEKLV
ADMSHSLKAIPGLPTNFSQVIQDNVEESLSGVKGEIAIKVFGPDLEILED
KAEQIVTLLNHIRGAVDVAAIKVGGQTEVTVTPDRQKLARYGISIAEINT
LFQTAFGGSAITRFYEGERRFNVNLRLAPDYRHTITDIANLQIAIPGSTG
AFLTLGELARIDVRQGASRIAREAGGRAVSVKANLMGRDQGSFVSEAQKK
VTEHVHLPPGYRITWGGQFENQQRAMKRLMIIVPASLLGIFVLLFWAFKS
LRSAFIILLIAPLTLIGGLTGLALSGLHLSISAAVGFIAVSGIAVQNGVI
MVEEIVGLLHKGRSFAEAVREGAVLRLRPILMTALMAGLGLLPAALSHGI
GSETQRPFAVVIVGGIISATFFTLMLLPLMFNRLARPKHTQP
>NE1592 hypothetical protein
MKVLQSDKAIMMNRKLTELPIDERIQLVEDLWDSIASDQKMLRLTTEQKA
ELDRRLNAYEVDKNPGRSALEAIAEIRRNL
>NE0713 conserved hypothetical protein
MKLVFSEQAWEDYLYWQKTDRKTVQRIDTLVKEITRTPHEGTGKPEPLKH
ALSGYWSRRINNEHRIVYKIADDSLFIAQLRYHY
>NE1512 Aromatic-ring hydroxylase (flavoprotein monooxygenase)
MKFDIVVIGGGLAGASLLAALKGSGLRLALIESRPPAPLPDDDSWDARVY
AISPGSVEFLQSSGIWQRMNAARITPVHEMRVHGDDNAARIDFSAYESGV
PELACIIENRQLQHAVWEELAGAENVQIFCPAQCDSLAWQDSHVELTLAD
GTVLQTALLIGADGINSRVREQAGIGVDRHSYYQTGVVANFETERVHHHI
AYQWFRRDGILALLPLPGKRVSMVWSANTALADELLNLSAEALCDRVARA
AEYELGSMRLVTAPLGFPLNFVHARSLIKPRLALIGDAAHGIHPLAGQGV
NLGLRDVRELARLQRQFGGIGDCGEFSLLRCYERNRKEDILAMGWVTDGL
QKLFGSEDAAIMRIRNVGLGITNRLPLLKNRLMRHALS
>NE2183 conserved hypothetical protein
MIGILIISHEQLGTSLIDCIIHILGERPPLLINHIVSSTEGPDSGSVRLQ
VTLEQLDQGSGVLILTDIFGATPANIARKQIRSGRIECLAGLNLPMLLRA
VQYRHQPLPQLIDKILAGGRESIFQISPENSNAN
>NE2445 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1936 hypothetical protein
MRAASCSMFSQILKLILRTGCDGLSRRLGQNLTVVRKAVKLCLVLDHNGY
LSAPASLSTGKVVEVKVDEIMVGCVKKPTKFGGLF
>NE1378 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1508 htra-like serine protease signal peptide protein
MQRLWLIFTQTVTVLLAVFFVVSTLRPELLPWTPRGKLATIREATRANVE
QALSAGGFHTAAEVAMPSVVNIFTSKEVRAPSHPFMDDPFFQRFFGDRFG
PRTERSLGSGVIVSPEGYILTNHHVVEAASEIQVALMDGRNAEARIIGSD
PESDLAVLKIDLGELPSITFGESEKARVGDIVLAIGNPFGVGQTMTMGII
GALGRSQVGINTFENFIQTDAAINPGNSGGALTDTSGNLIGINTAIYSRS
GGSLGIGFAIPVDAAKQIMQQIIETGGVVRGWLGVSMQDLTPELAESFGL
KKAGGALIAGVLKNGPADDAGIKPGDVLVAVNGKPIFNSSEMLNMVASLA
PGKSATLTILRHGGQQDIQVRIGKRPS
>NE2241 hypothetical protein
MIKILLLLTSVLVAMPVAAVDVAPRISDREIIESLAELKAGQKALEEKMD
LRFNAMQEQIDQRFTAIDQRFTAMQEQMDQRFTAVDQRFTAVDQHFTAMQ
KQIDQRFIAVDQRFEAIDRRLDFIQQLMLVTIAGIFGLIGFIIWDRYSTL
RPMDMRLQRLEEDLERDLELQSPEGSKLTRLIHALRELAKEDKKVEAILR
SFSLL
>NE2428 hypothetical protein
MSLHLALLMRGGDPHCILDFGAGNGELCKLIALQFCYEPTPSLMAEAKEN
LADLPQISFCSDLEKISDGSVELIFCLEVFEHLPEKETKDALGQFDRLLT
DNGNAVIGVPVEIGIPALYKGIFRMSRRFNTFDASIKNVLLAALSFPPKD
RPVSEITPGFAFHHEHMGFDYRKLQALLHAQFGLQQVTTSPFSIFGPWLN
PEVNFLIQKANPAVNADAAR
>NE1701 possible msrA, pms; peptide methionine sulfoxide reductase
MKIIIAALTGLIMVVTVFAMEKTVTPNSVNVKQTIDSKTDYIVLGMGCFW
GAEKRMGELSGVIDVESGYAGGDHADAGYQDILNFEHALRAGKTAGRNHA
EVVKVTFDPAQVALEHVLARFWESHNPTQGDRQGNDIGSNYRSAVYYHDE
NQKTLALTTRKIYQQALATAGYGQITTEILPLKNYITAEEYHQNYLQKNP
DGYCGLGGTGVKYPAPDQHIIRGDTSSVPASPSPQSLDGKNLNFDQQLVI
FESENCEFCAQFDRDILAHWQADIPMVATKNTNPPADWTLDKPLFAAPTI
VLFREGKEVVRYTGYTGEKENFWQWLGFQMLTPEQQKIAFQQGTERPFTA
SNLDEKRPGRFVDPITGATLFFSKTKFNSGTGWPSFFEPVEGSVTYHEDH
SGYMQRIEVRSASSGIHLGHVFDDGPPPSYKRYCINGNVLKFIPD
>NE2444 putative periplasmic protein
MLDTAVIKLRSTDQLLLNFNKMNIRSVVFFVCVLLCAIANAQTQADLNDD
ACGAYQEADKKLNAIYQQLLEQHKDDANFTTRLRKAQRAWLAFWDAEMEA
IYPADNKREEYGSIYPMCSCLEQAALVNHRIEQLSGWLTAEEGDVCRGSR
>NE0600 probable transmembrane protein
MGAQFFSSLADNALLVVAIALLIDLHAPAYLTPMLKFVFVLFYVLLAPLV
GAFADSMAKGRVMFISNSIKIVGCILLFFAAHQFSALGAYAVVGLGAAAY
SPAKYGILTELLPPEKLVIANGWMEGLTVASIVLGTVIGGLLITPSVAAV
LLSLDLPLIKTAVDASIIIIMLFYGIAAVTNLFIPDTGIDHRILKKNPIF
LFHDFVHCVKLLWFDKLGQISLAVTTLFWGAGATLQFIVLKWAEAALGYA
LNQAALLQGVVAVGIALGAVLAAKLVSLRRSLDVIPLGIIMGIVVILMIT
ARDLWISVVLLISIGGLAGFFVVPMNALLQHRGHILMGAGHSIAVQNFNE
NLGILTMLSLYALLIWFDVHIYTVIILFGLFVSITMIVVRKWHLNNQSKQ
DSLHLIGMQKRQF
>NE1103 GCN5-related N-acetyltransferase
MSLQLNPPELLVATHLLDDFECGVNSLDEWLKRRALANQHSGASRTFVVA
DHDSRVYGYYAMAAGAVSHQAATSGVRRNMPDPVPVMVLARLAVDQRAQG
IKLGAALLQDAVNRVVNVSHNVGVRALLVHALDDRAKQFYAHYGFKESPQ
HPMTLMLRLNTTKA
>NE2109 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2358 conserved hypothetical protein
MLQRSVFFLSDRTGITAETLGHSLLTQFDGIEWKKHYASFLDSAAKAQAV
IEQINTIAEQEGQPALVFSTLLDPVMLASIRRADCVLFDFFETCLGTLEA
VLQQPPARIPGRSHVLRQDASYFRRIAAIQYALNSDDGANAKILADADVI
VVGVSRTGKTPVCVYLALQYGVLAANYPFTPEDMGAIQLPPLLQPLRKKL
FGLTLNTSRLQSIREERYPGSHYASFAECQRELQWQNELYRQFDIPSINT
TDVSIEEISASIVNRAHLERRQHGT
>NE1235 hypothetical protein
MNNTVRAWLGKSREELDEIYRHATPGNIPAGDTRGTAILAGSFFSKTVAA
FARLFAWQGKVFDLFCPGGQAGVLVNKITPFGLTFIVAKVYRDKSWLDGQ
DTIVIDYSKTSFVAKVIRDEIREVEPGVYLGKVWWGKTRVLDFALTQSDT
Q
>NE1003 Aldose 1-epimerase
MNIEQLNKNYKIGEQVVFAEGEGGLPFIHVRNDKASALISVYAGQVLAYR
PQSIQDDLLFLSKRAYYQAGKAIKGGVPICWPWFGPDPEGRGRPAHGFMR
NCMWDIVEVSATPEGNTRIVLGSTDTEETQSIWSRSFVLRLEITIGDTLN
LELVTRNSGSQACTVTQAFHTYFSVGDIRQTHVSGLENTRYIDKVDDNLE
KNQTGSVTIDAEVDRIYQSVGSNLVIHDTARKRRIHITSKGNRTAVVWNP
WAKISAEMADLEDDDYLRFICVETTNAATDRVHIQPGSEFRLIANYRIEQ
D
>NE2251 Type I antifreeze protein
MPIYEYACHACGLEKEHLQKMSDAPIANCPACGSSDYVKKVSAAGFQLKG
TGWYVTDFRNKNTRSDSKPKEESGKEAADTDKAAATTTTDSTTATATTAS
TSTTAPTVSSVD
>NE0627 TonB-dependent receptor protein
MKVWRGCDKNTSLDWLIHRIYPARISAWKWFSCLMFWLPALPLFAQQLSP
EQPAQFDTISVTATREARTTKEVPQSISVVDEKRIKDVRMFNIKDAIQGQ
VPGLRIESNNNAYDAKISIRGAGLKAQFGVREINLLRDGVPILDPDSFGR
LDFVDPDDVERIEVTRGPGDLYSAGTAGGTIHIISRSAFDDQHNVIRGGV
GNWGTHNLHTRFGKVFNEHALAFTFSRRHTDNDWRRHNRFSSTNAGLKHG
WQLGESSLLETEITYSDVKLNLPGSLSREQYAQYRNTGKALETQDPWKNS
ARDSQILFMNSRLEHRAGDWLFKPRIYYNQWKQFHPVTGQINVTDGWERN
FGFDLEGIYTHSLFGIPGSMVIGGTWRRNWNDGALQYAYADVLTVPGSGR
IISTLSDRKGQINSRSKSTNDLWGVYFLESLSPLDRLTVDLQMRFDQVSF
DIERNEFQRYDFASGRYTPGRGLIQVREKYDLFAPKAAVTYRATDLINVY
ASVAHANQVPADSEVQNAVEFGRTLKASGHLNYEAGFKGRGRNWSFDLTG
YHTDVSNEIISFVQNLQTLYTNAGKTSKNGFEFFGNYAFTSGFEVGASYT
YTDLKFNRLTEPLTIIDPATNQRTTINANRSGNQVPRFPEHMYFVYTTYR
HLSGWHGRIETRGQSDYFTDNANTERHGGYHFLTNLTVGYDRKHWGLTFN
VQNLFDKRYAVDVNKDASGSRVSFTPGMPRTFMGYVSYKF
>NE1215 Short-chain dehydrogenase/reductase (SDR) superfamily
MKKSILITGCSSGIGYYTAHGLHARGYRVFATARRQESVEMLLAEGLESF
RLDLNDSDSIRWAVEETLRRSGGELYALFNNGGYGQPGAVEDLSREALRA
QFETNLFGWVELTNLILPAMRRQGYGRIIQNSSVLGFTAMPFRGAYNASK
YAIEGWSDTLRLELRGSRIFVSLIEPGPIITQFRANAMKAFERYIDVERS
VHREKYLAIHNRLNKPGPAVPFTLPPEAVLKKVIYALEADTPKARYYVTF
PTHLFGFLKRILPVSVLDKILAKAGNDHQ
>NE1500 hypothetical protein
MDDFVLARVLHVLGVVLWIGGVAMVTSTLLPVIARMPPVFDRMDIFHRIE
KRFARQARFTTLLVGLTGFYMIHVLDAWHRFTEIRFWWMHAMVLVWGIFT
LFLFVLEPRVMHKKVSENAQQDPEATLARMQRMHWLLLSLSLITAAGAVA
GSHGWSFF
>NE2007 hypothetical protein
MDFTIKAIALLTIGHRLHQFVMYQPCCKIAHTQLTLERQGRQTDLGLTNQ
INYQEPDGQRQFGALENCSGKSMRSDADRPCIEKPCVNQIL
>NE0579 Domain of unknown function 2
MNRTTAKPFTVMDEFELIKAATDYSLEQDENGRISGNFYHCKLTSAFQVI
LDIHENNIIGHAAYIRSESNGEVSLWPWQVFALASKDEQLIDLDRLCRAI
HALNYFKKNFDTNTGQLFLSVHPRLLSSIQNEHGRTFKDFLDLTGISTSR
IVIEIPPALNHDWKLLQKLIINYRSYGYQIALNFSSTNGHWLLGTDDLHP
DILIVQAHELLHYQLNDYPEDNSTRDAGFRLHVRKIETQEQLTAAKQAGA
HYLQGNFLGKTV
>NE0076 Peptidase family M48
MIFQELNFSDSFMHTFTLVFILALILTTLAQWWLAARHIRHITAHRNQVP
DAFVSQIDLAAHQKAADYTCAKARLSFPGILLHAGLLLVLTLGGGLEWLS
GFWHTWFSDSLWHGMVLIFSVVALLSIVEIPFSYYRTFVIEQQYGFNKMT
RAMFFADLVRKYVLGTLLGAPLLLSVLWLMEKAGDSWWLYTWLIWIGFNL
FLLAVYPNWIAPLFNKFSPLENDSLKTRIENLLQKCGFESSGLFVMDGSR
RSSHGNAYFTGFGKTKRIVFFDTLLNRLEAEEIEAVLAHELGHFKRHHVI
KRIVLSFAVSLLFLWVLGYLMQQPWFYQGLGVQVTAVPSTAMALLLFFLV
MPVFTFLLHPLSSIYSRKHEFEADEYAAEQASAADMIRALVKLYQDNAAT
LTPDPLHSAFYDSHPPAAIRVAHLKKLITTGAEV
>NE0704 hypothetical protein
MMAVFYSAEELTISERRLRADLAHESAVEMVIYDVLVKGNRSIWIGDGIV
TRNVQISEQMFSVSVQQATGLIDAMTSDSRILSRLLAWLAIPRNSREPDF
LSARSTGTLRPATYTDLQAMLGLSHSAFACLYPHITFYSGRVEPDWRYAS
NDLVELVGLRSRSAGTHSVLNDDTSSHNVTGATLRVNVLPGNTSDEAAGL
SVEVTITGQIDPSHLIRSWKRITRMDNSKQCRNLNTQ
>NE2514 conserved hypothetical protein
MSNEQLADLTQPQRDRLAFIELRLRFIGEIRRQDLVSRFGIQAAAATRDI
GIYKDLAPGNLDYDTKNKVYGYADGFTPVFDFPAERVLAWLSQGFGDGEP
SALKSWITCEIPSRLTKPGLDTLACVTRAIHQERPLKVTYHSLSSGETER
EIVPFALIDNGLRWHVRAFDRRSKDFRDFVITRMQRSVMLKDSHVLPHEK
SDQDIQWTRIVELELVPHPDQPRPEIAEMDYGMTGGMVKMKLRAATAGYI
LRKWSVDCSADHSLRGPEYRLWLKDHLALYGVKNAVLAPGYAPLDAARAV
ADCD
>NE1360 putative similar to abortive phage resistance protein
MADRRRKLKRPMGERRYKKLFFIAAEGVKTEPIYFGIFTDETSIVHVSYL
KGKHDSSPPQVLKRMTDHLKNKELKSYDEAWLVVDKDQWTDEQLTQLYQW
SLQQENYGFALSNPKFEYWLLLHFEDGVGIKSSHDCTDRLKRWIQDYDKG
INMRKISQEQINDAISRAKKRDHPPCKDWPRTLGQTTIYRLIENILKSSK
GFVK
>NE0281 conserved hypothetical protein
METAATNLVRSLTPQEYGFTLIILILAALTGFYCFIRAWKRWHLIKDTPT
ARLRSAHQGRIELEGKGRSLPDQPVFAPLSNHECLWYHSRIERKETILEQ
KRTRTEWKILYRNTSNHPFLLDDGTGICQVDPEEAEIISNEKLVWYGNTE
WPVRTGILDNGSAIIGLASRYRYTEQLILPGQRLYITGHLQTRSPATERS
VRDIARDLLSDWKQDRRQLLERFDTNRDGEIDLAEWEIARETALSQAQTV
HRQLLHETEIHHVSTLKDGRYPFIISVRPQAELIRKYRRNALIALTGCFS
VAGCIIWLLHVHG
>NE2551 conserved hypothetical protein
MARFNKKLRLRSRWQSGIFTALFLILVGTLGYLAQEFRIQWDISQNNRNS
LSQASIDLLQTLDDPLHVTVYATQQDLQLGDIRGIISNFISVYQRIKPDL
TLNFIDPVEQPQQAQSAGVRLNGEIVISYKQRKERLTSINEQAFSNALVR
LARTEKKQLLILSGHGERKMDGITQRDLGNFGKKLQETGFESTLFSLTDP
PEIPDTGVLVIASPQIELLEGEVKKILDYLARGGNLLWLVDIGSLNGLLP
LAEKLGIVFTPGVIVDPQANRLRVPTTFALATRYGAHAVTENFDYITVFP
FARQLILEEGTDWHRTILAEAAPEGWVETGSTDSSEGEITFDEENDVSGP
ISVAVALSRVVEDREQRIIVAGSGHFLANGYLGNGGNLDLGMNMINWLAG
DEAFIPIQPRATLDSQLALSELQLTLIVTGFLIILPLVFLIGGISINWYR
KRR
>NE1524 conserved hypothetical protein
MYKLRFFADFCLELTASLTFFRGSLASISCFTADYRYGDMTMATVRKTIS
LTNQQDAWITAQVEAGRFTNDSELIRDLIRREQERMAEIDNIRAALIDGE
QSGEPQPFDFDQFKRHKLAQHKPG
>NE2350 CAIB/BAIF family
MIKSLKDCTVSGPFRTPPSPFAAVGFSLNYQSGRLGLKVVEADDYADQGT
FSFVFRAPQARLVNCVITGWSDCDQPGAITESMMQAACGLMSVHGRSTGA
IQPFGLQYVSTVTTALALQGGMAVALGNLRGLAVSDSHISMAAAALLSTS
QYIADATSTSSSENLSSADADHSPAPPFISADGIVFELETLNPDPWLRFW
TQLDISRTLAGKGWTAFLLRYAKAIAPIPGELTRAVSKLGYADISRLCAD
AGMAICRVRTLDERMDDEHFKSNWLKGPWVFDCAVPQPELPLTSASGLLP
LSGLTVIESCRRIQGPLAGHLLALLGARVIRIEPPGGDPLRGMPPVSEGC
SVRFDALNRLKVIREIDIKSPCGQEEITEMARHADVFLHNWAPGKALELD
LDYADLSRVNPALIYAHASGWFTDGGEISLRSPSLPGTDFMVQAYSGVAQ
KISRTCGTQGGTLFTALDVLGGVIAAQGITAALLNRQLNCAGAKVTSSLL
GAAILLCSDDFQNRHDFFDCVPPAQSVLNRVFETGKGKIAIECLDDDTLL
RLMQTLDLSAGDKSELWQRLGRSFLGKTAAEWLVVLQQAAVPAAIVIEDL
NDLHGDPRLKPYLEIGSYTRVTSPWRFQ
>NE0239 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1561 hypothetical protein
MRLLIVFLLLLPLPSLALPQCGSEAILQAQKLLSFHVDGDDRAHVDPKAI
ALPSIRNPANRKQKFLVLEVDGTVYKSKYRMRLIYYPLGSECVLMGQEIL
ELASL
>NE1106 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE0702 conserved hypothetical protein
MAIRQYEVLFTRGAEQDLELIYDYIVESDCKANADSVLDRLLEVVENLAT
FPSRGTWPKELVAVGIREYRQAIFKPYRVIYRVIEQKVYIYLIADGRRDM
QSLLMHRLLGK
>NE2295 prolyl aminopeptidase
MPLHNHNLCFLYNKPVQHTMTHTGLFPPVEPHDHGMLPLDDTHTMYWEQS
GNQNGIPVLFLHGGPGAGATPAHRRFFDPARYRIVIFDQRGAGRSLPLGE
TRDNTTPLLIEDIETLRQHLGIERWLIFGGSWGSTLALAYGEAHPDRCLG
FILRGIFLCRPGEINWFLYGLRNFFPEVWREFVARLSPIEQCDILSSYYR
LLMDPDPAVHMPAAKAWGRYEGSCSTLLPNPDTVDYFTSDTVALGLAKIE
AHYFRNNIFLPENSLLENVHKIHHLPGVIVQGRYDAVCPIVSAHDLHLAW
PQADYIVVNDAGHSAWEPGILIELVKATEKLKLIL
>NE2015 possible capK protein, putative
MARHALMRTELLPTFGMPDKKVDGFIARIRERHPKMIFGYPSAMSHIVPC
AEQRGVRFDDLGIKVVFCTSERLYNHQRETIVRLFGCPVANGYGGRDAGF
IAHECPNGNMHVITEDIVVEIIGEAGRVLPHRQSGKIVVTHLATRDYPFI
RYRTGDIASLSEETCSCGRGLPLLTDIQGRNTDFVVAAEGTVLHGLALIY
VVHDLNEVRTFKIVRKNREHTRILLAPEAGGDMTGIDTIIIDGFRCRLGA
KVTVKFVDAVLAEKSGKFCYVASHVVLH
>NE0127 putative transmembrane sensor
MHLDNEADHHSENKAEAPGIQPAVTRNVADISPQIAQCAVEWLVELQAAD
HDEATHEAFQRWLAAHPDHKLAWQHIETVNTRFNGLTSPLGSAVAKAVLT
PRRSLKRRQIIKTLVVILFAGSTGWWADEKIPWRAWTADERTHIGQRRNI
TLADGSRVVLNTDTAINIRFTANERRLQLVRGEILVTTNQDSASIVRPFI
VEIAQSELRPLGTRFTIHQQTTSNRLSVFEGAVEIRLRQDINYRHIVQAG
EQVDFSNQGIHEIHPADDTSTAWSNGMIIASSMRLADFLAELDRHRPGKL
NCDPAVADLRVSGTYPLTDPEDILDALQTTLPVKIQYFTRYWVTVRPAS
>NE2089 ice nucleation protein homolog
MEKIIMSIISSLTPTQISALTTTQIKDLTTGDMKALSDKQINAMTSTQIA
AIETQDLVILSGKQIGAFNPNQFTYGLTTTQIQAINKAQATGLTAAQLKD
MNSGDLSSLSADALGALSAKQIQGLNSTNIESLTTAQAAVLSYQQIAAIS
IDAVKGFETEDLAQISTAAIKGLTAAQLGALNSEQFSSLDSSQVQALSAK
QIQALGTDVIKNLTTSDMKEFSATQVAALSSAQLKELSSGQLSALSTDAL
GALSAKQIQGLDATNLSTDTISALTAKQIAALTTTQLSGMNSSQIGAIST
SGIASLTAAQIKGLSSANVEALTSDQAKILSAKQLAGLGTDAVKGFETAD
LAKIDAAAIKGLTAAQLGALGSAQFNSLDSSQVQALSAKQIQALGTDVIK
NLTTSDMKEFSATQVAALSSAQLETLTSDKLDALSTDALGALSAKQIQGL
DAANLSSDTISALSAKQVAALTTAQLNGMDSNQIGAISTSGIASLTAAQI
KGLSSANVEALTSDQAKILSAKQLAGLGTDAVKGFETGDLAQISTAAIKG
LTAAQLGALGSAQFNSLDSSQVQALSAKQIQSLGTDVIKNLTTSDMKEFS
ATQVATLTTAQLNAMDSDQIGAISTSGIASLTAAQIKGLSSASVEALTSD
QAAALSAKQLAGLSTDAVKGFETGDLAQISTAAIKGLTIGQIKELSTDQV
TALTSDQVQALSAKQIQAFTTDQIQHLNFGS
>NE1224 hypothetical protein
MQVTDKGQVTIPKRLRDAAGFLPGSQVTFSLEGGKIIISKTGMGTDDRRK
SLRAAAAKVRKSLDEPFKQMNSDDIMAFLRPDGDDCA
>NE0905 conserved hypothetical protein
MPSTLSEELVGKYLAAQYQVWIDTSVVTLQIGCQSAPLAALLQATGNRSA
VYVTACNPASEVATSQENQSAMARLYERLACYSNHIYRGTGIDPSGEWPA
EESLLALGIDLSIAKKIGDEFGQNAIVWIDSAAIPHLVLLC
>NE0329 MoaA / nifB / pqqE family
MSVPFLQQYRVGSYILKQKLAGNKRYPLVLMLEPLFQCNLACAGCGKIDY
PEETLRRRLSVDECLHAVDECGAPVVSIAGGEPLIHKEMPQIVQGIIQRK
KFVYLCTNALLLDTRMDDYQPSPYLTFSIHLDGNRERHDASVCREGVYDK
VIPVIEQALQRGFRVTVNCTLFQSETAEEVAEFFDTATKLGVEGINVSPG
FSYEHAPRQDVFLQRSVSKRLFRSIFEIGKKRKLPWKFNHSSLYLDFLAG
NQSYKCTPWGNPTRNLFGWQRPCYLLVDEGYATSFRELMEETDWDRYGVG
NNPKCANCMAHCGFEPTAINDTFAHPLKALRVSMRGPRVEGPMALDPLQT
SSESQHNTDKRKPFPIPVTVEHKTVAPSSDHSSGPDN
>NE2274 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE1698 conserved hypothetical protein
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTFPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE1496 conserved hypothetical protein
MTEIERKFLVATFPDGELHAVPLRQGYLTTPTDSIELRLRQQGTEYFMTL
KSEGGLSRQEYEIQIDVTQFEMLWPATEGRRVEKTRYSGKLPDGQLFELD
VFAGHLSPLMLVEVEFLSEDAAQAFIPPPWFGEEVTEDKRYKNKALALSI
P
>NE1902 Protein of unknown function DUF67
MELVLSLGIGILVGSGVWLLLRPRTYQVIIGLGLVSYAVNLFIFSMGRLV
TDKPPVTQAGAQIDPTTFADPIPQALVLTAIVIGFATTALFLVVLLASRG
LTGTDHVDGKEPDR
>NE2144 DUF209
MKKILGIYDAPPLHWVGDGFPVRSLFSYSNHGKLLSPFLLLDYAGPVDFA
PAERPRGVGQHPHRGFETVTIVYHGEVAHRDSTGQGGVIGPGDVQWMTAG
AGILHEEFHSESFTRSGGQLEMVQLWVNLPAKDKMIAPHYQAILSADIPV
VALPDDAGSIRVIAGCYQDHTGPARTYTPMNVWDVRLKRGKVTELPLPEG
WNTALAVLHGKISVNGSPLVQAAQLVSLDRAGDTVSLDVREDATVLLLSG
EPIDEPVVGYGPFVMNSQTEIDQAIADFNSGHFGQLSR
>NE1260 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2498 hypothetical protein
MPKMDVKGIAVTVYSENAMDFISLTDMLRAKDGDFFISDWLRNRNTVEFL
GIWEQVHNPNFNYGEFATIRSQAGLNSYKISVKEWVARTHAIGLVAKAGR
YGGTYAHKDIAFEFGMWISAEFKIYLIKEFQRLKEAEQQQLGWDIRRNLT
KINYRIHTDAIQTNLIPPALTQSQISLIYASEADLLNMALFGKTAKQWRE
ENPNNKGNIRDEANVSQLVCLANLETLNAHFIHQGLPQVERLKILNQTAI
HQMKLLLADRSLKQLDGN
>NE0611 PfkB family of carbohydrate kinases
MHTLICGSIAYDTIMVFEDRFKRHILPDKIHVLNVAFLVPEMRREFGGCA
ANIAYNLKMLEGKPLIMATVGDDFQPYTYRLEKLGLAQTHIRRIQDTFTA
QAFITTDLDDNQITAFHPGAMNFSHQNSVKDAIDVSLGIIAPDGRDGMLT
HAREFHEAGIPFIFDPGQGMPMFSGNELTDFIDMADYIAVNDYEAQLLQS
VTGYKLSELATRVKALIETKGAEGSVIHAQGKQFQIPAVRPQKVVDPTGC
GDAYRAGLLYGIAQDMDWQTTGQLASLMGTLKIAHRGGQNHQYQRDEIGQ
RYFEAFGCRIL
>NE2304 Isochorismatase hydrolase family
MSKPFKYSRLSKDDAALLLVDHQAGLISLVQDFGPSEFKNNVLAVGACGK
YFKLPTILTTSFEEGPNGPLVPELKEMFPNAPYIARPGNINAWDNEDFVN
AVKNTGRKQLIIAGVVTEVCVAFPALSALEQGYEVFVITDASGTFNEVTR
HTAWLRMQAAGVQLINWFAMACELHRDWRNDIEGLGELFSNHIPNYRNLM
TSYFTITGKK
>NE1684 hypothetical protein
MGNLFSIGFLKNLVVAGYVVKGGTKTVRVRGYPIRLTVDHWRVLRRIRTY
SIKEPDTLDWLDNIEPGSCYFDIGANIGQYSLYPAIKLGHDIRIFAFEPQ
SNNYYALNKNIYLNDLKDLITAYCVAIGGTNGFDKLYVPKFIPGGTVRNS
DRNP
>NE2259 ThiJ/PfpI family
MSKKILVVLTSVEKYPEMDRATGLWLGEAVHFVRKVEAAGYEVDYVSPQG
GYTPIDPHSLAMAEPIDWEWYQKKEFMNRLGKTMKASEVNPDDYIAIYYA
GGHGVIWDFPDNEELQSISRKIYENGGIVSSVCHGAAGLLNIKLSNGSLL
VKGKELTGFSNEEEKLAELDKFVPFLTETELLARGAIYKKADEPWVSFAV
EDNRLITGQNPASGGAVADLLIKALKNELRHLLR
>NE1046 Succinate dehydrogenase, cytochrome b subunit
MEARYPAKRRPKYLNLLKIRQPLPAIVSILHRISGVLLFLPGIPLFLYGL
QMLLQSPETFASLQSNLAHPLCKLFLLLATWFFIHHLLAGIRHLMLDLHY
GMQLEQARLSSKLVLVFGAILTALTGIWLW
>NE1539 hypothetical protein
MLFWIKTIQTAAWFYLLMFALLAGSAHAAELKPADQTGFLIVAADRGFVG
NEEIRDAFASFSANHPAALVFVTDERTRQTLQSGLASLHQQNIGRIVVLP
LFISAAEPRYQLIRTLVTEENQTIPVTFTKPYGESYFAVEALATRLRGMQ
HTAQQHLLVVGYGAQNDTHRRAMYDDWMRIVKQASQGVSFRSINSLILLE
AQKDEEPESYGNKTKQQLATALSSLGTATKNNKNQVIAFALGPKYDSMMS
LESRLERLLPENAALNHFEIEPQHLAMWMEREASRNLPLAEEDTGVILFA
HGSDFHWNENLRVAVEPLMKRYKIEFAFSMADPYTIERALHKLEQRGAKA
AIVVSAFASRSSYRNEIGYLAGLDIENQDDHIHDNNSGHGSHGGHGGHAK
SSTPVPRILTSLPVIWTGGYEDNPLFASALFDRVLALSKDPARETVILTA
HGTQDDRKNDEWLEKLNSIASQMHDQGGQKFKAFKVATWREDWPDKRAPW
VKKVRAMVTEASKQGDRVIVIPTRTTSVGPEKRFLAGLEFELGEGFAPHP
LFTQWVDEQIRQGINLHKEALGR
>NE0564 Multicopper oxidase type 1
MRSNTFIQFAATLVAFVAACLFPAWSMAATHNVVLSAETLPNGQPAYKLL
SHTSSNGTVPGYATEATIPGPTLFIKTGDKINVQLTNNTQSPVSFTAPGL
STGNSALAAPGQTKSYRLNARKAGTFAYHDEKAPMLGLFGAIVVDESNGH
VQSYVDGDGTIVSTKRSQLSKEFVLFMVGSTFWGAEISRNGNQQPLWANP
DLGAYQDDLLRFHVLAVGPGHTFHLHAHRWIEPGSSQKTPHIIDTHILDD
LNNSHVFTIKAGTGVGPGVWQLHCHLISHMQSGMNSRIHVVARDSGQSAD
SIVGASPSGAIFLNDSDQPGLVTFIVSDEPAGWFRSARGDALSPVTHTKS
LEIIPPGSSVHFVMSDTAGVHTITSLLWPSEAGHNDHGGDHFMIPFDETK
AYRGGAILKLNVPGLYVFTCKIHPFMFAAVIVDDPATDGLDLGNTIDLIS
GVKNLPTSSDLATRLLHTFFVTTVPKNWQDFTSPNPWKIAYPDVDVVVDI
GKVSLPAVLNARYGNDISLSPLPRPAVPGIGEVWVDTQFEKTAGKTKPGT
ITVVDTSNWTLKRKIALPQINMNNPHNMWPNRDHSIVYTTQWFDNKLTAI
KQKTGKLASNVQVGDSPSHVMTLPNTDDITVAINGENDIVIIPAGTTKVN
YALLTQSHGQAPGNPHGHWISADGKRIITPNINTHDAGFYDVEDGKIVAR
TATGEGQPGAHPIAIGMLPDSSKVYAANLLHHSVSVMDGDGNFIKNINLI
EHYDPISGAINGPIGILPIQSPVSPDGKVMVTASYGGQIVVIDTRTDSIV
KSLPCDPGCHGANFGAKKGGGYYAYITNKFSNRLIVVDPDPNGDGDLSDA
EIAGAVTLVADSRVPKDDKISSLAGFGGQGIVALPNVYNGWVQNLNSQWS
AGLTDQQRNPVQ
>NE2038 Myeloperoxidase, thyroid peroxidase, cyclooxygenase catalytic domain
MTWHGSNKSGGYNPPKSISYDQGKFGRMFPSLPPFAQDTRQIRDALKELG
RKGGIMDAKEDTDIAVNPNLARDLIIDPALSLINPNNPNLVAGMTFLGQF
LDHDITFDPVSNLERQSDPESIRNFRRPLFELDSMYGSGPSASPYLYDQS
ADGEGIKFYVEEISGAAAVSAGGFVRYDLPRNSQGTALLGDPRNDENLMV
SQLHLAMLRFHNAVVDYVKAQSSLTDPDEVFTEAQRLVRWHYQWIIIHEY
LVRTVGKPLVDNILINGRKFYKWHNQPFIPIEFSAAAYRFGHSQVRPSYR
SNFGPIPSDINSQIFRLIFNDNLADEPDPDDLRGGKRAPSRFIDWQTFFD
FGDGKVRPSKKIDTKLSTTLFDLPAVRGDIQSLAQLNLLRGLTFSLPSGQ
SVAKAMNLPILNTTDLADLVDFKLHQRTPLWFYILREAEVKENGERLGPV
GGRIIAEVFLGLLQGDSMSYLRQDPRWIPTLPSTVEGTFRMADLLRFAGV
VAPL
>NE0272 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR
>NE0827 ATPase component ABC-type (unclassified) transport system
MIQLSGITRIFHMGDQAIHALDHIDLTIESGEYVSIMGPSGSGKSTLLNV
IGLLDRPDSGHYLLDGRNVTDLSETEQAQVRREKIGFVFQSFHLIPRLTA
AENIELPLILTGMPPADRKIRIMETLQAFNLSDRAHHRPAELSGGQRQRV
AIARATILRPTALLADEPTGNLDHRIGSEVAALLEALNQTGTTLIVVTHD
RELGSRARRRIAMRDGQIDVDER
>NE2239 hypothetical protein
MKQTNLKKQLVAVAIGGVFALGVTAQATAAGIFQYDLDGQGGSGETVIAD
AIQGVANESLSLLADGKTLDGQGWVKFNTFLLSTVDQDYKYSEVLLYATF
KITTELVDGTIGASGSEYKVTSFTFDLYKDLGNDNTFTVADASSSTHASV
TAVGVDDYIASGELIVGSANIQAASGAAINVETTFNLQPGGGEYFFDPDP
FYNILKAGFNSTGGNWAFTNNMLAVGSATGVIDFNSTPTEVPEPATLALL
GIGLLGFGARRALVASKNA
>NE0934 Integrase, catalytic core
MNRTLKEATVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYI
IKCWQNEPERFIINPYHHKVGLNSYSVFRCLDTVRLSAAHVGFFTEMREI
SC
>NE0976 putative nitrate transport system ATP-binding protein
MLDIRGVRKSFRKPDGSELLVLDGIDFTLGEGEIIGLLGRSGSGKSTLLR
SIAGLTPPSAGSVIYHGNEVTAPPPGIAMVFQGFALFPWLTVLENVQLGL
EALGLSTGELHRRAIEAIDLIGLDGFESAYPRELSGGMRQRVGFARALVV
HPNILLMDEPFSALDVLTAETLRTDFLDLWSEGQLPIKGVILVTHNIEEA
VLMCDRILIFGSNPGRILNEIKVELPHPRARLDPAFRDLVERIYVEMTTR
SASVPAGRRTEYFPGTGIGTILPHVSSNSLSGLLEAVAAAPYNGEADLPV
IASSLHMEIDELFPVAETLQMLRFAEVAGGDIRLTTEGRQFADFDTDRRK
HLFARHLLTYVALAAHIRRVLDERPVHQAPWSRFADELEDYMPPAAAVQT
LRTIIALGRYAEIFSYDDEKQVFSLDNP
>NE0259 hypothetical protein
MVKPTDAELRTSGGLTSVFLNCDTCLSDEDFNRLRRMEFTQNEHAILYGQ
LGGSIAGMIELGPVASRTQSRQDEERKTEERRTAQFVQLVEQMRASIEQM
EADVKRLVASFEKRDGDAWREKLALNILEADEIPQQEADESITAYRKRLE
QHLINEMLNPDGTIKDKYKNDPKYGDYAEWAQTQFHLNSAKAAVAELDNS
DTSPQRKEHILDEMKQRGYIEEMVFTDRISGNLDAQKSVRDIRDSQHDEA
LSQVRPPEATLKFLS
>NE0523 conserved hypothetical protein
MRSQMKISTILATHDKTDLVILLLRITTGGVFMAHGAQKLFSWFGVNGLE
ATGQWMNSIGPNPGYLMALLAGSGEFFDGLALLSPESESGGISQWDS
>NE1974 Sensory transduction histidine kinases
MLYLLASAGANTEFFERHYRWLIVSITIFLLLLIGVVGFLLGRLRRRLKT
GEFGSKLALRLLMVFSLMAILPGVLIYSISVQFLEKSIESWFDVKVDRAL
EGGLTLGQTVLDNLLEELQKKAQVAALDLAEPASFPLTVLNQLLIQSQIQ
EATLFNQEGKVIAFTSLDDAVLFPEIPNTEAMRHIRMQKNYSAVETLADN
TLYLRVLVPVNVLSAEEDIRVLQVLQRVPPSIARNAEIVQAGFSDYQELM
LSRQGLTRLYSATLTLALLLALFSALACAFLISEKLSAPLGLLAEGTRAV
AQGDFSRRHQVHSTDEFGILTESFNLMTEQLAAARTIAQQHQQEVENARA
YLENILANLSSGVIVFDKALRIRAVNQSAEQILQIPLISFEGLTMEECAE
QESGLRLLAAEIRGGFDSEEAGEWQRQVLHFTADKEQVLLLRGSRLPQAS
GGGGVVVFDDITSLLQVQRTVAWGEVARRLAHEIKNPLTPIQLSAERLQH
KLVSKLDEPDAKILKRSTETIVSQVEALKRMVNEFREYARVPELELCQVD
VNRLVREVLALYQMSDNTENESPQPPITLELAGEISPVRGDPARLRQVIH
NLLQNAQDALTGMEDARITVQTRSVSNGIELSVIDNGKGFPEQVRAHAFE
PYVTTKPRGTGLGLPIVKKIVDEHSGTIKIQNIQPHGAQISITLPALPCS
TPQPSTKTA
>NE1200 conserved hypothetical protein
MTWILTKYFITAAVVVIVSEFAKRSDKLGALVAALPMVTILTLIWLHVEN
QPETKIANHAWYTFWYVVPTLPMFLIFPFLLQHFGFWLALTLSAFITIAC
FGVFALLVRRYGINLM
>NE1454 ABC transporter, ATPase subunit
MNNQPIIQASGLTREVDTGGTRLTILQDINLEICAGESIAVVGASGSGKS
TLLGLLAGLDVPTRGTVFLDGEDIFKLDEDERAGLRGRLLGFMFQSFQLL
PSLTALENVMLPLELSGAGNAREVAHSWLERVGLEKRIRHYPRQLSGGEQ
QRVAIARAFVTRPKMLFADEPTGNLDAATGAQIIDLMFAINREQGTTLIL
VTHDENLSARCSRVIKLVAGRAVD
>NE0551 putative yacA [Plasmid ColIb-P9]
MSESTFTFRVDEDLKTEFSAAAKDCDRSGAQLLRDYMREFVKTRREVAEH
DAWFRKQVQIGLDSANTGNLVPGDEVEAEFAARRAATRRRLKASE
>NE0241 hypothetical protein
MGKKKNKKTEVQQPDPMRKNWIMENMDSGVIYLLESWLKAKSQETGKEIS
DIFANAVEFNIVLKDWGKEKLEETNTEYQNQQRKLRKTYIEYYDREMK
>NE1507 conserved hypothetical protein
MDIWFAVTRLPVTVVRWIDVNLLALGREMRFSYLPPLMVYLAAGISSLTG
IVGTFYVKERLGLSAEFLAALGFWMALPWALKMPLGHLVDLMWRWKSLLV
YLGAGLIATSLLIMIGLLGHDEAMREFAPLAQWYVLASLLAPLGYVLQDV
VADAMTVEAVPRVEPDGRLIGLEERKQMHITMQTLGRVAIIGGGVLVAVA
NVVLMKDVASFSDADKTAAYLTVYQLALVIPFVSVCGIWLAGWLKWRELK
RHAARGLDRNEIDILLGVHNDAPPANWWILGGGLAFTVVSLGVGLVQVSY
AEEIIFCLSMAIVLFLIRQLTAELENDARNVLVGTALLVFVFRAMPAPGA
GSTWWMIDELGFDQQFMATLSLIGSVLTLAGMFIFRRFMAERSMAYVIGT
LTIAGTVLSLPVISMYFGLHEWTSAHTGGVVDARFIAVIDTALESPLGQI
SMIPMLAWIANSAPEKLKATFFAVMASFTNLALSASQLATKYINQVFTVT
REVKDTKTGEILVSADYSELGTLLITVSVLGFVLPMLIVIMIRYTRLRSA
>NE1835 Cation transport protein
MGHVLSHFLAVANILGRMVMMFGLVLLIPCGVAYWTLDGSLSVFLDALSV
TLGCGATIWILTYRFKRELQIRDGFLLVVLVWLSLPLFGMLPLVWYLPEL
GIAKAYFEAASGLTSTGATILTRLDELPYAINLWRCLMAWLGGMGLIVLA
VAILPMLGVGGRQLLSAEIPGPIKESRLTPRIAETAKRLWLIYVTLTLTC
MIAYRLAGMTPFDAIAHALSTLGLGGFSTHDSSYGYWNSPLIEAIAILFM
LIAGINFSSHFVAWRAKSFSPYRADPEAKLYILITLASCIGVAGFLWIKG
IYTDPLVALRYAAFNVVSTATTAGYSNTDYALWPIFAPLWMLFLSGFCTS
SGSTGGGIKMIRARILFQQFFREMIIIMHPRAVSLIKIGRSIITNEIIFA
ILGFFFVYLISIVLMTLALTLSGLDEITAFSAAVACLNNLGAGLGAVGPA
MTYASLNDFQIWLCSFGMFLGRLEFYTLMIVFTPAFWRR
>NE2065 putative rare lipoprotein A
MKINFDFCFAITHKNTLFVLLAVMVVLIVSACSNTPRLGQSSTHSHTRAG
GGYYLDDGPEQNPPSNLDSVPDAVPRIEPLKTTTMRPYVALGKTYTPMTA
LKPYKARGHASWYGKRYHGKRTASGEIYDMYSMTAAHPTLPIPSYVRVTN
LQNGNSAVVRINDRGPFLENRLIDLSYVAAYKLGVLANGSARVEVESIIP
GRFGSADPPAMTRKQTSGEPKVASSGDNGKFYLQLAAFGSAHNAQSYLMQ
LKSEFPSITQQTHINQANGLHKIFVGPFHDLEAANRTANIITTAGSPEPI
LVRHTY
>NE0580 Domain of unknown function 2
MIRTIERIITSTGFSLQHSDNGEITGLFYHCKLSSAYQPVFEAARNNIIG
QEARIRADLNGNGEIQLSPWHIFALAPDDEQLIKLDRLCRTIHALNFLDN
ITNKQMKLFVSVQPRLLESVKDDHGSAFEQILDIIGIRTSRIVIEIPVEA
NRDWRLLKRVIMNYRAHGYLISVNYSGTNNDRISELGKLYPNVIRIDARD
LLRRSVLDPLIVTAHNQGADVLVGNIETSYQLTDAIHAGADLLQGNFLAR
YSRKADNMLTPFSWQIPDERNGYRFDNESLQENQPYRSI
>NE0451 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1444 TPR repeat
MSGSGKTITQMKSTAILAPVIAVILILSACGKPMDAQALVAEAKQYQQQG
NDKAAIIQLKNALQQSPNDPEIRYLLGTLYNREGDIQSAEKELNKALDLG
MDPVKVLPGLSRAWLGMGKFQQVLDETGKLSDKGNFAELLALRGNASLAL
GKFEEAKVLFEQALQDKPGFSDALTGLARYSLARNDIESAMNFSEEAVKL
NPENSDAVLFRGDLLRAQNKIDEALADYDKAIKLNPESEAAYINRATISI
STKKFEAAQADLDAVRKIAPGSLLAAYTQALLDFSQGKHAVALETLQRIL
SSAPGHLPSVLLAGATQFALGSFPQAGQYVEQYLKAIPNNLYAIKLMASI
QLKNNQVKQAITTLTPALKSVQQDPQLFALAGEAYMRSKDFTKASEYFEK
AGELAPDNASLYTALAMSKMGQGDSKSAIADLEQAAQLDDQSGRAGVMLV
LTHLQLREFDKALKAVESQLAEQPDNPLLHNLKGGIYLGKKDLAKARSSF
NQALSIQSDYFPAISNLARIDMQENHPEAAQQRFEDVLKRDKKNVQAMNA
LAGIALARGNKEEATGWLEKASRENPDELQPALQLGAHYLAVNDPGKSLA
LAKKLQGIHPDNLSIVELLARSYLATGDKDAALENFQKLAARLPDSAPAQ
LQLAQIYSSMQNNKAAAGSLKKALTIKPDLWEAKLMQAQLAVAADRVEDA
FNISHDLQKQHEKLPVGFELEGDLQMRQKNAAAAATAYEKALSRQKNSQL
LIKLHTALSQSGKEKQADQRLNQWLKENPTDAVTRTYLAGVYLASKKYDP
AIKEYQTILKQHPDHAATLNNLAWVYQQKKDPVALEYAEKAYKQAPDSPA
ILDTLGWILIEKGDAERGTSLLQKAVTAVPEAAEIRYHYAVGLFKSGNKA
EARQELEKLLGGDKPFPQRDDAKKLLESL
>NE2536 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNI
>NE1179 putative transmembrane protein
MRRIYFLVPEIVTTRKIVDDLLLAKIEERHIHVIAKRGTPLEDLPEANLL
QKTDFVPAVQQGIALGGATGLLAGLVAVALPPASAVIAGGILLATTLAGA
GVGSWVSGMIGMTIGNRRIKEFEEEIEAGKLLVLADVPVNRVDEIEDRVK
QHLPQIEVMRTEPKVPAFP
>NE0139 Generic methyl-transferase
MFDSVLQQSMQQWLETSLGQYVLEQEQRYFDRVVTDIFGYNATQIGFSGF
DFLRNNRMPFKFAFGVRDGASVYAHPHFLPIKSSSIDLVLLPHTLEFNSN
PHQILREAHRVLIPEGKVIISGFNPFSLWGMRQRMAKSKTDFPWCGRFIA
LPRMQDWLELHNFEIVAGQFGCYVPPCTREKWLSRLRFMEAAGDRWWPIA
GGVYFLQAVKHECGLHIITPRWEDSPAKRGTVVVPQIERGHRMNNLEVST
VPWREKAGGGLIENRKYGVQYSVNNRDV
>NE1307 conserved hypothetical protein
MNSHRGQMMSKDATALLHVTCVFRHNVACNASGGLHMGTTHVNARVKKHR
DTLRMAGLRPVQIWVPDTRRPDFAEECRRQCLLIAQADKADTSMQQFMDE
ALADSDGWTE
>NE1486 Aminotransferases class-IV
MIYLNGKFLPMEQATVPVLDRGFIFGDGVYEVIPVYSRKPFRLGEHLSRL
QHSLDGIRLQNPHTEEQWAGLIERIIELNEGDDQYLYLHITRGVAKRDHA
FPREVTPTVFIMSNPLPAPPAKLLVSGVSAITARDNRWGRCDIKAISLLP
NILLRQLAVDAQAMETILLRDGLLTEGAASNIFIVKDDLLLTPPKDHRIL
PGITYDVVLELAETHGVPHATREISELELRTAREIMLTSSTKEILPITQL
DGQPIGNGTPGPVFQQLDRLYQAYKLEVMRGHAPRQ
>NE0261 putative plasmid stabilization element ParE
MGSFILRQKAMDDLLSIGRYTRKEWGKTQQIRYLTQLDRAFHELADKPGL
GRACDDIREGYFKYGVGKHVIFYRHTGKDQIEIIRILHGRMDIEQHL
>NE2096 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1348 Integrase, catalytic core
MGRIYQQTFVDTYSKWAAAKLYTNKTPITSADMLNDRVLPFFAEQSMGII
RILTDRGTEYCGKPENHDYQLYLALNDIEHSKTKANHPQTNGICERFHKT
ILQEFYQVTFRRKIYQSIEELQHDLDDWMAYYNSVRTHQGKMCCGRTPMQ
TLIDAKEIWDDKITELNN
>NE0814 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1180 hypothetical protein
MINTITILTILCLSHNQPKNFVMILFFNNLATRRCVPADFLSGIHDHSYQ
IKHSFTLNIGITGQVGAILQQQCLDAIRIAN
>NE0851 Transglutaminase-like superfamily
MKRRSFFKLVGSSAAAAALFPAVARASLPPNQKWRSYQLTYQVTLPASGK
VAKLWLPLPDASDTVYQFTQGNNWSGKATASAFYMLPGTQSQVFFALWQK
GEERTVRVSSIVKTADRTVDLHSYRAPAKISIPQNIQRYLQSTKHIPLDD
VVKKMARSVTSPANAHTPLQQARAIYDWVINNVAYDINGRGQGSGDLRSL
LSRKENLSGKCGDIHALFVGLARASGIPARIQYGIRIGDSALNKNLGKSG
DISTSQHCQAEFYLSGLGWVPVNPSDVARVMAAESLSHNHDSISQLREKL
FGSWEMNWVAFNERENVDLGTTKVSAAISKLPFFGYPYAEIDGRLQDSLE
PESFSYKITSAVLVGTGAKL
>NE2523 hypothetical protein
MVSLDEIKAEDWTLNISRYVLPPLQEDIPPLPDAIAAFKDALTRCREAEE
RLAQVMTEGGWLDTVRPEPVEGREQA
>NE1222 Transposase IS4 family
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQARYLGKDIRCFDRRPGQSIYYDRQHDRAR
SSAGRLRKRGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDC
TQALDLISGFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRT
YDREIYKCRNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE2282 conserved hypothetical protein
MGESGTKSRFPIRVQLEANKQYYWCRCGLSQSQPFCDGAHRGTGMNPVVF
VVRETSPVWLCVCKETKNIPFCDCFRTD
>NE0330 Glycosyl transferase, family 2
MAVLVMYLSLAGMLCWWAILLAPWRAWSTREQLEPSSSPEAKQSFDDVTV
LIPARNEAQFIDQTLNRLQAQGEGLRIIVIDDQSEDHTASLAGRRGVEVI
SGTTPPKNWSGKLWALNQGLQQVRTRYTLLLDADISLSSGILGALLNKAR
QENLSLVSLMAELPVQRFSERLLVPAFIFFFKLLYPFKLSNESKSRMAAA
AGGCILVETGVLRSIDAFTSIHHALIDDCALAAQVKQAGFKTWIGLSHAV
QSHRGYDSLAAIWNTVARTAFTQLHYSSLLLLLCTLIMSSMFWCAPLSAI
IYTDTIFVVSSILAWLAMLIVYTPVLRFYRSSPFWVIVLPLIGTLYLMMT
WTSAFRYWRGVRAHWKNRRYETSEHN
>NE0970 Insulinase family (Peptidase family M16)
MRFLQFLIMFWVGLYAQWALAFLPIQHWQTANGAQVYFVENHDLPILDLS
IEFPAGSSTDTAETSGRAGLVQRLMSMGAGDLSEDRIAETLADVGARLGG
TFDLDRAGLSLRTLSHQQERVRALDVLAQIVQRPEFLEKILERERARIIA
ALKEADTKPEVIADRTLMKLLYGKHPYGLRESGEPDALAALRRQDLVDFY
RAHYTAGNAIIAMIGDIKRDEAARIAEMLTRNLPTGKTYKTLPPVEKPVP
IIQKIAHPATQSHIQIAYPGLSRKDPDYFPLLVGNYILGGGGFVSRLMNE
IRETRGLAYSVYSTFAPYQEKGPFEIGLQTKKEQAEQALQLTQKTLRDFV
EQGPTEEELQAARQNIVGGFPLRIDSNQKILGYLGVIGFYDLPLTYLEDY
VKAVEKVTVAQIRDAFKRRIDPAGMVTVVVGAAD
>NE1809 probable beta subunit of citrate lyase
MSHTLYETKTPRVQRCELAVPGSRPEMFEKALKSGVDFIFLDLEDAVAPD
DKIQARKNIIQAINDLDWKSHGVTLSVRINGLDTQYMVRDVVDLVEQAGH
KIDTLLIPKVGVYADVYMVEAMLSQLEMQQGLKNRIGVEALIETALGMAN
VEDIARRGTAGRLEALHFGVADYAASNRARTTNIGGLNPDYPGDQWHAAI
SRMTVACRAFGLRPIDGPFGDIQDPEGYKQAARRAAALGCEGKWAIHPTQ
IALANEVFTPPTAEVDKAKRILTALKEAAAQGKGAASLDGRLIDAASERM
ANNIVKMAEAIAAKSK
>NE0761 putative translation initiation factor protein
MLMTQMTVEQFAHDLGMLPGLLLEQLQAAGVDKRSAADFVTEQDKTRLLD
YLRKSHGSSGPRTRITLARKQTTEIKQSDSTGRPRTIEVKVKKTRVLTRQ
NEPEVIEKSVSEEPAPLKMVETPEPTIVKSVVDAEQMALRAEEARKRSEL
IARQAAELKEKQEKRRQQAAAQANVKKEPAPAEQESGPATAVTPGSVTEI
SSKLPETGAAATPATSTAPATTSTTAATKGHAPQKPVVKPEEKGEKKKKP
TKQDAWKDEPVKRREPKARGDLSGGQEWRMRKDKHGKYKSDELQSQHAFS
VPTEPVIHEVLIPETISVGALAQKMAVKAAEVIKVLMKMGSMVTINQMLD
QETAMVVVEEMGHIAKIAASDNPESFLEEVDVSSDEARMEPRAPVVTVMG
HVDHGKTSLLDYIRRTRVAGGEAGGITQHIGAYHVETSRGVITFLDTPGH
EAFTAMRARGAKITDIVILVVAADDGVMPQTIEAIHHAKAANIPIVVAVN
KMDKPEANFDRIKQELVNHGVVPEDWGGDAMFIGVSAKTGLGIDELLEAV
LLQAEVLELKAVREAPAKGVVIESRLDKGRGPVATVLVQSGTLRRGDAVL
TGAVFGKIRAMLNERGKSISEASTSIPVEIQGLSEVAVAGEVFIALDDER
KAREIALFRQGKFRDVRLDKLQVAKMEDVFGQHEDVSTLNLIIKADVQGS
CEALVYALKKLETDEVKINVVHSGVGAIIESDINLALASKAVVIGFNCRA
DLGARKLITSTGVDVRYYNIIYEAVDEVKKALSGMMMPDRKEKILGMVDI
REIYRISKVGVVAGCYVLEGLIKRDALVRLLRDGLVIHSGSLDSLKRFKE
DVREVKSGFECGLSLKNFNDIQQGDQIEVYEIVETARVL
>NE1653 ZIP Zinc transporter
MPVLAWIIFASLASGLLSIGLAALFALNVHSNKWISVLISYAIGALLGAA
FLNALPEALELTESPKQLTLILLLGILIFFILEKLLLWRHCHLSECEAHE
PAIQIKSSDVHDHGRSGMMIVLGDTFHNFVDGILIATAFMADIQLGIVTA
IAITAHEIPQEAGDFIILLNSGFTRARALWLNLLSGAATLFGGLLGYFML
YQLDHMIAPILAIAAASMIYVAMSDLIPSLHKRPEIRATIQQVTLIILGI
STIWLIGLFFGHNH
>NE0524 CBS domain
MTKVRDLMTPMPKTIGFDISVEKALVMMKECACHHLPVLDGGKLVGVLSD
RDLSMAWHGSGNTKDEHLVRDLMTDTPVVIDPSAEINMAIRIMLDNKINS
LIVRAEENQPWGILTSTDLLRYVMNKA
>NE0503 hypothetical protein
MKRTYCTIMLTGALSFGLSTACTASGIHKLVDERGRVIFTNDPAKNTRQI
QSSKSVSVVPSRRNGTSTEPITVAITGSNYPRVSKLQQDQRDSKRRQILS
QELANETRLLEDALKTIDLTQQKTDNYLPGRPYFTSDHFDILQLRNQAAA
HERNIEALKMELNNL
>NE1953 rRNA_methyl_1: RNA methyltransferase, TrmHfamily, group 1
MNPASPLDNVRIVLSHTSHPGNIGATARAMKTMGLSRLYLVNPKSFPDSE
ADARASNARDILESACVCTSLEEAMGDTFLAAALTARSRDLSHPTRNARQ
AAGELIRYAHHHPVALLFGRESAGLTSAEISHCQLTVQIPANSAYSSLNL
AAAVQIMAYELRMVLEEGVVADEPDPQLPEPASLQEIEGLHHHLEQVMVQ
SGFLDPTQPKRLMRRMRRLFARTRLEKEEVNILRGLLTAVNPHKPT
>NE1686 Cytidylyltransferase (CMP-NeuAc synthetase)
MKTIAIIPARMGSSRFPGKPLAQLLGYTMLEHVYQRVAMSKSLSATYIAT
CDETIRQAATAFGAPVIMTSDSHERASDRIAEAVAHTDADLIVMVQGDEP
MVHPAMIDTAVAPFHTDPELECVNLTRRIDDETDFRNPNTIKVVMDQQGN
ALYMTRQPIPTLAPGGFGATAVYKQVCIIPFTRTCLIEYSQLPPTPLEQL
ESIDMLRLMEHGHHVRMVETSHDTQAVDTEADLIRVAGLMATDPLLAEYR
KY
>NE1756 conserved hypothetical protein
MLWIKSLHIISMVTWFAGLFYLPRLFVYHAMCTDQAGIDRFKVMERKLYY
GIMTPGGLLTIIFGTWLWLGYGFSGGWLHTKLALVVLLVAYHLYCGKLLI
DFRNDRNRHGHVYYRWFNELPVLALFAIIILVVVKPY
>NE2381 putative signal peptide protein
MKIRIGNVLLLALLAVVLPVSAQNSSPDALIRNTVDEVVAVLKQDDGIRA
GNQDKVLSLIREKILPHFNFTRMTQLAMGKNWSLAGPEQKKSLVKEFRTL
LVRTYSNSLTTYRDEVIKVSPTNVGSQDTKATVKTLVIQGSGKQPVPIDY
AMEKSDSTWKVYDVTVAGVSLVTNYRGTFNSQVRDGGVDGLITALAEKNA
SMRGK
>NE1690 Haloacid dehalogenase/epoxide hydrolase family
MNIATRPADYWQAIIFDFDGVIVESGDIKAQAFAELYRHHGETIAQAAVT
YHRANGGMSRYLKFHYFQQNLLNYPPLTKEEEQELDRRFSELVMNAVISC
QPVAGAEALLHRMVDQIPLFVASGTPESELRIIVEQRDLSRYFTEVRGSP
RLKETLVADILSAYPLVPERVLMIGDALVDYESAHQNGIAFLGRVRPGDD
NPFPESVEIVPDLCPVAI
>NE0227 DUF186
MNLRQKITEDMKSAMRAGDVKRRDALRLLQAALKQKEVDERIELDDAAVV
AVIEKMLKQRRDSIAQYEAAQRQDLADIEKFEAGVLQAYMPEALSDAELD
AMISEVIASVGENGSPKIGEVMALLKPKLAGRADMAKVSLLVKVKITG
>NE0639 mttA/Hcf106 family
MGSFSIWHWLVVLAIVVLVFGTKKLRNLGSDLGGAVRGFKEGMKGAEEES
TPPPPAQQVTGHSIKSEIEEKDQTKV
>NE1256 Bacterial type II secretion system protein
MTRFTVRAITPEGGSRIFTREADDERQLTVQLIGENLTPIRIIEQDGSLN
DLLSRPVTLNRRIPIRDVALFCEQIGTMVGSGLNIEQALQVVSRQKGNRP
SARMARHLLPRIQAGGPLSDALDIEVGMPRYLPSMIRAAESGGRLAEGLE
TAGRYLQRQASTRAEIANALTYPVIVLVTVMIALLIVLTVVIPGFEPIFA
GEEHRLPGMTRFVLWLSDLAVNHAITSLLWMGAILAIGFLLYRRSVRFGD
WVAASIMRLPPVRIIDQLNVSRVLNVLGMLFQSGVEASEAVLLAAQAAGS
RRLRVALEGASRKLREGGTIGTTLGAIYVIPDETCALIEVGEHTGELGKT
TLRAAQLLERDTSEQLERMLALVNPLAIAFLGIVVGLVVGGVMLGILSIN
QLALRS
>NE1313 Response regulator receiver domain:Helix-turn-helix Fis-type
MDRPSLLIVDDDETFCEVLARAMVKRGFDTTCMHDIKTALEQAEVLMPEY
AIIDLKLGNESGLLLVEKIRELDPGTRIIVLTGYASIATAVEAIKLGATH
YLAKPVDADDIMAAFERTSGDTETPISPHPLSVERLEWEYIHRILMENDN
NVSVTAKVLNMHRRTLQRKLSKKPAQL
>NE2545 hypothetical protein
MGQKMKNAPVYFTIAQVRHNPVLRLGSYAPDIQDRMRKAGYPDFKKGIAM
AFTLAPQLGDAPQTQPPVVEQVERLMFFSTDSTRGFIVEQNALSFHTTEY
ETFEALADEFMRGLAIVHECVTLAHSERIGLRYLDAVVPPGGETGLAEYL
APGVLGLSSRLPEDVTVSHSFSETHIQTAKCAVLARTIIQSGPLGFPMDL
QPIGVKVADRFREINGVHAIVDTDASIEGRHPFNLELIKSQLQVLRDGVG
IAFDATVTPIAVSAWNS
>NE2193 hypothetical protein
MALPVTSSANQPTGTNNVAATKCKGCFCPGNPCQLCRLPPHTDDPIPENE
PETCRLIREAVPPASFQPGENEYFANLDKATIQCIRSGDVIPNTRRVPGY
PGRVYCKPGLPALGAH
>NE2486 Sigma factor, ECF subfamily
MPFLLSGHLSDQSCKAGTLPYKIRIVLILFLCPHMCLTNDEGSAVPATDL
AAHQIHILYTDHHGWLYNWLRHKLGNACDAADIAQDTFMRILTRQQPVHL
NEPRAYLSTIAHGLVIDHWRRRELERAWLETLASLPEPEAPAPETRLIFL
EALIEIDRLLDSLKPRVRTTFLLAQLDGLTCPQIAERLGVSLSTVERYIT
KALRSCYMLRFEP
>NE1237 Glucose-methanol-choline (GMC) oxidoreductase
MQPEYPPYFSVYRSRRSTRFLLQEISMIEPASPENNQIDSDSDEKFDYVV
IGSGAGGGPVACKLALAGFRVLLLEAGGDDEPCDYRVPAFHARASENEAL
RWDYFVHHYGDKVRQEYDCKFSKEKNGVLYPRSGTLGGCTAHNAMVLMYP
HNSDWDYIAKLMNDPSWHSRNMHKYFRRLERCEYVPRWKIWSRHGFDGWL
PTNPANRWMLLKDKVLTKLTIAAVKESRRMMHPLKRFFLGILKFIIRFDP
NDWRLIRRIPEGLCKIPMTIQHGKRAGTREYILRVREQCPDKLIVRTHSL
VQRILLDEHNRAYGVVYRVGAHQYRADPRHEESVVSEAKTVCAKREIIIA
AGAFNTPQLLMLSGIGPREELEKHEIEVKVELPGVGKNLQDRYEIGVVTR
LKENIPLIKGMKLRPPEPGEEPDPQYALWLQEKGPYTTNGATIALIKRSS
AKHPDDDPDLLIFGLLGNFSGYYPGYARDITRDNSRYPNALCNKNGDHSD
TCKENIFTWAILKAHASNTGYVTLRSSDPRDVPDINFNYFPENEDDSSSK
KDLDSVVEGVETVRNILEDCGDLVDEEILPGANVSSRDEIRQFIKDQAWG
HHASCTCKMGPRSDEMAVVDSRFRVYGTTGLRIVDASIFPRIPGFFIVSA
IYMISEKAGEVIIEDVRSKLGNHS
>NE0155 Integrase, catalytic core
MCGVFREGVAVRYARIEQLRQHHAVAAMCRILDVSESGYHAWRQRPPSAR
QQENLRLETEVKAAHQRTRETYGPRRLRSDLADHGIQTSLYRIKRIRRKL
GLRCKQKRKFKATTDSRHALPLAPNLLDRQFTVAAPDRAWVSDITYVATD
EGWLYLAGIKDLFNGELVGYAMSERMTTSLVSQALFRAVAAKRPARGLIH
HSDRGSQYCAHAYRKQLQQFGMQASMSRKGNCWDNAPMESFWGSLKNELV
HHRRFTTRTQARQEITEYIEIFYNRIRKQARLGYLSPAQFTQKYHAKQIA
A
>NE0482 conserved hypothetical protein
MGCMIQSFRCKSTQAMFEGECPQRFSAIQAVAERKLAQLEAAQTLDFLRS
PPGNRLEKLAGDREGQWSIRINAQWRICFTWSDLDPADVEIVDYH
>NE1377 Protein of unknown function DUF132
MSKAVSRVVLDTNLVLSALVFQSSRLTPLRNLWQTGRIHPLISRETAAEL
IRALTYPKFKLTASEQEELLADYLPYCLTAIIPNPPPVTPPCRDQADIPF
LQLALAGKADVLVTGDKDLLVLAGMFDCRILAADVFLTEFAGC
>NE1607 possible transmembrane protein
MTARIMRSLKLNFPYRRQQIPLVDYLLLFLGMVLLLAVMYTLKQTMSKIT
YWEAREARIVQQQKHTRQPRTPMARINKATQQELKQADDILRQLNLPWEA
LFDALELAASEQIALLSLQPSVTGQTIRITGEARDLAALVEYVQALELEP
VLKNAHLASYKARQDHLRRPIVFSIIATWHESL
>NE0613 bis(5'-nucleosyl)-tetraphosphatase
MATYAIGDLQGCHRHFLELLDLIGFNATRDRLWLVGDIVNRGPDSLSLLR
TLIELGDAVTMVLGNHDLHLLAVAAGSIRQQHGDTLQPVLEASDSSRLLD
WLRHQQLFHHEDEYVLVHAGLLPNWSIEQAQILAQEVETIIRGDRFQTFS
RSMYGNVPDHWHDRLQGEDRWRVIINAMTRMRVCSPEGRMNFSCKGELSS
VPDGLLPWFEIPWRASKDTTIVFGHWSALGLHLTPNLIALDTGCVWQGCL
TSVRLEDRKVFQVPCVRH
>NE0553 PIN (PilT N terminus) domain
MSYLIDTNIIAEVRKGKRCDPHVAKWWGQTSDDDLFLSVLVTGEIRKGIE
LARLRAPTKAARLEQWLDALIAGFSGRILPVDQAVADQWGCLNSPNPRPT
VDSLLAATAQVYRLTLVTRNVADMPKIGISILDPFTFE
>NE0191 Domain of unknown function DUF227
MDRLQLLEYWLKALYPDQPCTLSPASADASFRRYFRASLPGKTLIVMDAP
PQQEDCRPFLHAASIFSRASVHVPAIVAQDLNQGFLLLSDLGTTTYLQAL
TAAPENANRLYQDAIDALIKIQCASQKNIFPEYDRILLSRELELFPDWYM
TRHLHAPPDDDQKNTLKTVFNLILANNLAQPQVFVHRDYHSRNLMVSTPN
PGIIDFQDAVLGPITYDLVSLFKDAYIQWEEAQILDWMIRYWEKARHAGL
PVATDFSIFYRDFEWMGVQRHLKVLGIFARLCYRDNKPTYLQDMPAIMQY
LRQTCERYSELHPLSRLLDRLEDRQAETGYTF
>NE0513 Amino acid transporter
MIFWRVKSLDAILATAEKKSLHRSLGVWQLTLLGVGAIIGTGIFVLTAEA
AQKAGPGMMIAFVIAGFVCAVAALCYSELSSMVPVSGSAYTYSYAVLGEL
IAWMVGWALILEYSVAAGAVAVGWSGYFVGLLNNASLGFEIPYVLANGPF
AGGIINLPAVVICLLVVGLLIIGTRESAMFNAILVAVKIIALTAFIALAL
PATDMNHFDPFLPLGTSGVVAAAASIFFAYVGFDAVSTAAEETKDPQRNL
PIALISSLLLCTLFYLLVSAGVIGAVGAQPLVDAAGEGLAPGSREMAAQC
KSLAAAGQQPVVCSHEALAHVLREIGWEKIGNLLGLAAFLALPSVILMML
FGQTRIFFVMSRDGLLPEVLSRIHPRFKTPYIVTLVTGVGVMAAAAFLPV
GKLANISNSGTLFAFLVVALTVMILRVRDKNRPRPFRTPMIWLVGPLAIG
GCLILFLFLPMDAKLVFPVWTGIGLIFYFLYGYRRSHVLLGIDTPSGGEN
LIEPIRSLADGSDETEVKVK
>NE1107 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE2510 NADH:flavin oxidoreductase/NADH oxidase
MLFNPLQVGSLTLPNRILLAPLTRARADAGHMPNALMAEYYSQRATGGLL
ISECTMVAPGTSAFVNEPGIYNDAQIAAWRQVTDAVHAKGGRIFMQIWHA
GRAAYPGAADGAPIVSSSATAIEGEIHTPQGKVPHAVPRPLTVDEIPGIV
AAFAQGARNAIAAGFDGVEVHGANGYLIDQFLRDTPNQRTDAYGGSLENR
ARLLFEVLTAVTQAIGSERVGLRLSPLNSFNSMKDSDPLALIGFLADRLN
AFKLAYLHVMRADFFGVQKADVMPVAREKYKGVLVGNMGYSADEAEAAIA
EGRLDAVAFGTAFLANPDLPARIRAKAPLNAPDSNTFYAGGAKGYTDYPT
LQLA
>NE2532 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1891 hypothetical protein
MDDELKSLEDKIGLLIRLYQETREENAKLYRLLENAEAVNRQLAERMQTA
AERLKVLLNNLPEG
>NE0190 ADP-glucose pyrophosphorylase
MILAAGKGKRMQPLTDTCPKPLLQVGGKMLIEYHLEKLAQAGFTNVVINH
AYLGSMIEAALQDGKRYGLHIHYSPETLVLETAGGIANALPLLTTPDKNQ
PFAAINADIFCDMDFSILQPVLQSMQSHPTHTVAHLILVDNPPHHPEGDF
FLHDDTGKLVESARPDCQKLTFSGIGVYHPILFENVPSGRAEKLAPLLRQ
AIAAGKATGSYFPGLWLDIGTPERLHQLDTTLNRSDRHT
>NE1556 hypothetical protein
MDFDTFRPILSGLVGGLVVYLLTYSGRKPAATEGGRRLLIYGLGIRIFTA
ILIPSSLFIAYAAAHAHPDQAILAVCIAAAFFSYQVFFVSLAYDNDNIYY
RSPIGGNHVIPWPDVVEVGYSWLMQSYYLRTKQVRRIWCSNMLRGYNELE
EFIPKKADKLFHPELKSYSEAHIN
>NE0214 4-hydroxybenzoyl-CoA thioesterase family active site
MEVVFVHPVRIYYQDTDAGGVVYHASYLNFLERARYEWLRELGFTVDTMI
RSHKMIFLIRSLGIEYFKPAVLDDLLDITVQVVDIGRSRITLQQQILREQ
GTLASATVHAVCVGAETLKPISIPAPLRQKIEKQSS
>NE2037 putative cation-efflux system signal peptide protein
MLNRAGKLLSMALVLLLSACGKQEETVKVASPPSVSVITAEQISMEVREE
TIGSLEGMMDPTVAAEVSGRVLKVLVRQGQHVKKGQEIALLNPVDYQLQR
REALSEVARLEALLENQQRSVERNARLVERNFISQIVLDDVKTQEIALRR
QLEGAKSRLASIEHNQSKTRLLSPVDGTVESRLVAEGDYVKVGDPLFQII
SNQRLRAHLPFPESIASSIRPGLEVRLNTPTSSVEIVSRIRELKPAIGAG
NRAIDVLADVEDQPGWHTGASVSGVVVLGVRENVVVVPEQSVVLRPAGEV
VYAISQDQAQAHQRVVRTGLRQDGMVEIIDGIQAGEQIVLDGAAFLTDGA
RISVQKEMSVQADS
>NE1198 conserved hypothetical protein
MISENLIKTLIALAIFVLTMAAIGYLFEEELEAGTNWIVDQIGFLGLCLI
LLVTDTLITPFPPDVLLLVIAKSSLSEHWFMYVSILGVVSCVAGMLGWGI
GRWLGHFGFIKRILGELEENQREFIHRYGFWAIAIGSITPFPFSVTCWAA
GMMALRGTTVLAAVLVFRIPRFFLYYWLIIAASRWF
>NE1448 Uncharacterized protein family UPF0051
MSTAIRDDYLEKLIESWKKEASPVHSVSYLDQLREDALDRIESLRLPTIR
DEEWRFTDISSLRSMPFPRASAAPLPGLDKQAGTFLDETACRLVFVDGQY
MSGLSSLADDGSITVCSLSELIGTRASIAERYFGQLADFQENVFVALNTA
LMHDGVCIVIPAGVSVKVPVHILYVTSRKDVAVYPRCLLIAEPGANVTVV
EEYVTLYEGASLTNAVTEMYIRDSACVNHIRVQRESSQAFHIANGSVLVA
QSAHYDSVSLALGARISRFDQKIILAGGNAECEVDGLALIAGRQLADTHT
FIDHANPHCRSRQLHKCIVDESAHGVFSGKIMVRPHAQQTDARQLNRNLL
LSDKARMDTKPQLEIFADDVKCAHGATVGQLDNESLFYLRSRGLTEVAAR
NLLTYAFGGEVINRVIVPSLKQRLEEYILARTRIG
>NE1305 proteic killer suppression protein
MIRHFKHKGLQLFFETGDKSGIRPDHASRLARQLRQLNDAVNPREMNIPG
WKLHPLSGDLSGYWSVMVNGNWRMIFVFDGEDVILVDYRDDH
>NE2060 possible (AF047705) unknown [Nitrosococcus oceani]
MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE
STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN
LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF
AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNP
>NE2401 Generic methyltransferase
MSSKPELFQRTPWLNGEDIEAKRTELRAYFHTTLDKYEQLFETLRGDEAY
YDRPISLRHPLIFYLGHTATFFVNKLVLAGVLAERINPRLESIFAVGVDE
MSWDDLNTAHYDWPTVEEVMAYRRAMRDKVDALISSLPLTLPISWESPWW
AIVMGVEHERIHLETSSVLIRQHKLKYVQSHPAWQPCRNSGNAPENEMIL
VPAGKVVLGKDKADPVYGWDNEYGHHEAELSAFQASRYLVSNQEFLGFVE
ARGYETNDYWEEEGLAWRQFSGAACPTFWIREGDQWRLRLMTEEVPMPWN
WPVEVNYHEAKAFCNWKQKTTGQPVRLLTEDEWYRLYDVAGLTEVPHTEP
AKGNIHLDHYASSCPVDEFQQGEFFDIVGNVWQWTETPTYPFTGFEVHPL
YDDFTTPTFDDRHNLLKGGSWVACGNESIWVSRYAFRRHFFQHAGFRYVI
SDMPATHHSSHYETDKLLSEYAEFHYGDTYFGVPNFSVALAELAIAAMSG
RPARRALDLGCASGRATFELARHFDHVTGVDFSARFINQGVALAEHGILR
YTLVDEGELVSYRERTLAGLNLESVRHKVEFFQGDACNLKPILTGYDLIL
AVNLIDRLYEPAKFLTMVHERLNPGGMLLIASPYTWLEEHTKREHWIGGF
KRDGENFTTLDGLKEILGKHFRLIGAPCEVPFVIRETRHKFQHTLSEVTI
WERVV
>NE1419 conserved hypothetical protein
MLYLIYGEDVPDSLAQRVASRPAHLARIRELQEQGRLLLAGPCPAIDSID
PGPAGFTGSLIVAEFASLDAAREWADADPYLLSGVYAKVTVKPFRKVLPE
>NE1070 putative transmembrane sensor
MTVLHSPEPDIDPAILLEAADWLVLLQSGEATERDRMALAHWRERNPAHE
AAWRRAENMLSTFDQVPPKLGHDTFSHLGNPGRRQMLRLGLLALAMPAAW
ITWRHSPWQEWVADLRTATGEQKLIDLNDGTHLVLNTTSAANVAFTTAER
RLILLAGEILVTTGRDPSPVSRPFIVQTAHGSLRALGTRFTVRHFDDRTQ
VAVLEGAVEVHPAVSIQTTVLRAGEQMTFTGLDMQPVRPVEASTTLWEHG
MLLARDMHLGELIAELNRYHRGVLRCDPSVASLAISGAFPVTDIPASLAL
LETTLPIRISSTTSWWITVQAR
>NE1658 AMP-dependent synthetase and ligase
MSSTIAKLGVPPVQKSNSLSSRIDVVVPTPTVNSSPQKRADFTTLVESLE
YAAQGETGYNFYDSRGNLKSVLAYKDLCANSKIIARRLSGLGLARNSRVA
LIADTTPEFIELFFACRFAGLVPFAMPVPVNLGSRAIYVQQLRGMLESSK
ASVAVANLDFVEFLNEAVGSLVSSELKWSGTPEQLDLLPEADVVFSPNTP
DETAYLQFTSGSTRLPRGVVITERALMTNLRGIVCNGLDVRLGDRCASWL
PFYHDMGLVGLVLAPLAAQLSVDYLATRDFAVRPLQWLKLISRNRCTIAF
SQPFGLKLCTLRARESDLADLDLSCWRAAGVGAEMIRMDTLKSFAAKFAA
AGFDERAFLPCYGLAESTLAVSFPDISRGARSLRVDARTLIEKKIAVRVQ
AEGRKYNEFVNCGRPLPGHEVRIVDDTDHEVPHMTVGSILVRGGSVMNGY
FNNAEETARAIRPDGWLDTGDIGFLFEDDLYITGRRTDVIIVNGRNIRAQ
DVEELAEQQPEVRAREASAFGISNEDGSTSVVLVVECRLASAVDRQSLIN
RLQQMVYMAFGVSCLVELVPPHTLPRTSSGKLSRFAARQGFLRRANLEQP
TAATTIHTAG
>NE1818 hypothetical protein
MAHRIRSSGKSATETGEVSQDVWFDNWEKGLDLWELARHYLRTDTTISLL
WFDSDDLPEVEVSRFGARIQDDGGLAELTGELPWPGRSRRR
>NE1550 hypothetical protein
MLALFKGDYATNEPDYRTSSVVYITLGFSMFRPPRCYRKNFIVPLYRLKK
ATPPVIWVVMFLLCCFMLPFGIAIKIMHPGFFATLRAGLEIAAENQLYGA
VFVTTFLVLGSFLMFIVNGAFVSLRVAAGYEVRRNGTILKWLINKVRRAY
KILNKPND
>NE0891 hypothetical protein
MDTVRKLIVCAILGVFYVPVVSAYEFHDPYPNVKVIDGEVHIQRPGGGTS
LAYPNAGISKDIPVNTSKGQFLVPVEKTFPVEPSKVGKAATRFLKTLPAI
GTAIALYDTVCDLTDICRNSQSGEIEYAPDMPAGYPVTTETGYWRHPFYV
SLTHVTADLLCKSFDYRAAVHFAPNNLTFLRVEVSGGTSYCIYSDKTNNP
PTETAPPNYSIVKVNSGNCFTGYTKVGNECVHNNAPVPVTETHWTDAETK
LNAQPQQTAEALYNSDAPVPVLASTQSAPVIQQIAQTSTQTKDAQGNITG
TQVATTSVKVEDTSTTNNVTYNVTEVTTITTYNENNEITNTQTSTSDNSP
PKTGTDETTVSFDDVPPAQLEEEQPEFNLQTPESWGEGTCPPDEILTVQG
VTFPVSWQPACDTAVQLRPIFVLFASVAAMFIVAGISRAGT
>NE0245 hypothetical protein
MALPQRQIVRAENVKIGISWQCALCDLDIYARPLPGAEVIYFGRMVTTHG
RYWKDYRNSPQPTNGYETISFDVPLDLRPVVIAINFYEGEAPQGVSGEIR
IAVDENTYAAPFHISATRGNRGQGVAKIIETGKASGNHSVIVDPLHIIRA
R
>NE0818 putative sigma-70 factor, ECF subfamily
MPRHRISGINWLAHYNELIRFWTRKTGSRGDVEDATHDVITKLLESNVST
ISSPRSYLHSSIRNRLTDSYRQNKILDLVPIHDLPEDSHPLQPNDPVTHT
CTMQLLTALKEALQELPLKCRQVFIWHRLEGYSQTEIAEKLDISVNMVEK
YMIRATRHLRDRLQDHIPD
>NE1549 AMP-dependent synthetase and ligase
MNPTTHTLPTHNLDMISVETARTLDGLFRERVRRSPHATAYRDYDVSEEI
WRDYTWDDMANAVKRWQMALSQENLSPGDRVAVMARNCPQWVMFEQAALG
LGLVVVPLYTEDRAENAAWCLDNAGTALLLLENMTRWEAMNEAMTGLQQL
KRIVILDASPEDVRQTGDKRAIALQDWLPETPGDIPSVQNDPHVLATIIY
TSGTTGRPKGVMLSHNNILSNAYSSAQVVTVRPDDVLLSFLPLSHTFERT
AGYYVPMLCGATVAYARSIRQLQDDLLIIRPTILISVPRIYERIYAGIQA
KLAEGPAISRLLFKLAVDIGYSRFERQQKRVGWRISHLLWPLLDKLVARK
VMEKLGGRLWQVMSGGAALSPEISRVFIGLGLPILQGYGLTETSPVVCAN
RLDDNLPSSVGRPAPGVEVRLGEQNALLIRGPNVMLGYWNNPEATHAILS
ADGWLNSGDTASIDAQSRVTITGRLKDIIVTSTGEKIPPADMEAAILRDP
IFEQVMIVGEGRSYLSALTVLSKRGWEVVATHCNAKNDPHALACDEYAEE
IILEKIARQISGFPGYAKVHRALLISEPWTVENEMLTPTLKLKRKKIYEH
YQGEIDQLYASR
>NE1506 Universal stress protein (Usp)
MYQKILVPVDGSITSNSALQEAVKLAKRLDACLELVYVYEDAIYLVDENY
FNYEELQKTIHSSSEKILAEAAAVVSASGIPVETRLVQSNNERIASLLVK
EAERWQAELIVIGTHGHSGFSRLLLGSVAEGVVRMASIPVLLIRGARD
>NE1709 putative transmembrane protein
MRYPLDQYWQGMFFRFVKAFVVTMMFVLPVHSSAGEIKIGVVNTEKVLRE
SMPAIEAQKKIEREFQARDARIRELSAQITALQQELEKNTGTVDEEERRL
KERELAGLSRQYQRAQQQMREDLSLRQNEEYGLILERINQVIRELAEKQS
YDLILQLQDSVYRSARIDITDQVIKVLNARESAARKP
>NE0320 GCN5-related N-acetyltransferase
MMSSEIRVRVCSSGDEEALALIGQATFLETFAGVLAGSNILAHCARAHSV
ECYRSWLADSRYKLWLAEISPGNAPIGYMVVSPAELPLADISLRDLELKR
IYVFSRFQGNGLGRRLLQEAIAEARIRKAERLLLGVYVRNDAAIGFYTRM
GFCKLGYRKFNMGGQKCDDYVMGLTL
>NE1901 NADH-ubiquinone oxidoreductase subunit 5 (chain L)
MNSEDLLYFLIALPFAGSALAAILPANARNTEAWLAGSVMIAAFIIAAGL
YPFISEGEALRNEIEWLPGSGLNFTLRMDGFAWMFSMLITGIGFLVVLYA
RYYMSPRDPVPRFFSFLLAFAGAMQGMVIAGNLILLAIFWELTSIFSFLL
IGYWHHNAAAREGARMSLIITGTGGLCLLAGLILVGHIVGSYDLDSVLAA
GDRIREHALYVPALVLILLGALTKSAQFPFHFWLPNAMTAPTPVSAYLHS
ATMVKAGVFLLTRLWPVLSGTDEWFFILGLAGMSTLLLSAYFAIFQQDLK
GLLAYSTISHLGLITTLLSLGSPLAAVAAIFHMMNHATFKASLFMAAGII
DHEAGTRDMRKLSGLFSYMPITATLAMVASAAMAGVPLLNGFLSKEMFFA
ETLGQHSGSVLDGMLPYVATIAGAFAVTYSVRFIHTVFFGEKARDLPREP
HEPPHWMRLPIEFLVLACLIVGIIPGMTVGPYLHTAVTSVLGEHTPQYSL
AIWHGINTPLIMSVTALIGGVLLYLALKNYLARCEDGPPGLSHLKGQHIF
DRMLVTLSWRWARWLEESFGTHRLQPQLFLVMLTALIAGSWPFFGAGIDF
SSPILTGIDPVFAILWGIGVACALGAAYQAKYHRLAALILMGGAGLVTCI
TFVWFSAPDLAVTQLLVEIVTTVLILLGLRWLPKRVAQIDNASTPVAWTR
RFRDLMLAVGCGGGMALIAYKAMTNAPLSDSIARFFLERAHSEGGGNNVV
NVILVDFRGFDTFGEITVLGIVALTVYALLRRFRPAPDSIDIPEQQRRQN
AFDDNHPDRSIGETVHDYLMVPSIVMQWMFPFIILLAVYFFLRGHDLPGG
GFAAGITLAIGILLQYLASGTRWVEDRLYIRPVRWIGSGLLLAVLTGVGS
WLFGFPFLTSHFDHLEIPLIGELPVATALFFDLGVLLLVVGATALMLIAI
AHQSLRSYRTRSADTEKTGEED
>NE2016 conserved hypothetical protein
MRDLLIVSIVAVMAVMALRRPWVGVMLWTWLSIMNPHRYSWGFAYSAPLA
AVAAGVALLGLAFTKERQSPFQGAPVAWFALLTIWITLSWLFGYDPAGDY
VQWSKVMKIYFMTFVALMLLSNKYHLMAFAWVTVGSLALLGAKGGFFTVL
HGGNYRVWGPPGSFIADNNHFALALIMTVPLLHFLQLQLQKPWQRHLMTV
TMLSCVASALGSHSRGALLAISAMGAVLWWRSNRKGLMSFGVLVVLLLLL
PMMPEEWWARMGTIQTYEEDASAMGRINGWLVAIEVAKHYLFGGGMSYQH
EFFFSMWGVYNTNVIAAHSIYFQILGNHGFIGLALYLGLWASTLWQAGWL
RRHARSIPEAKWTVDLGSMVQVSLIGFAVGGAFLSMPYFDLPYNLMVLVV
LARRWVETRGWERDPAIPFMEYAGLRVGKRERADVSSARPRRTGY
>NE0081 possible transmembrane protein
MSGLFSALSRASANLLNPRMLWLWSWPMLVSAIFWWLIGMFFWTPLSGWV
LTVIPADTLQNWLESSRLQVIADSVESIINVIIFVTLAITTSLVITALVT
MPALVNFVAKRYYPDLARMQGGTITGSLRNVISAITIFFILWIITIPLWF
TGIGLLAPLLAAAYLNQRLFFYDALSEHANSSELDKLSSIDRSMRWSLGF
LTGLLQFIPFLNFFAPTLTALAFTHFELGRLAKLRHTAAA
>NE1799 conserved hypothetical protein
MACPEATFFHRAGWKQVIEQAFGHKTWFLYIEEGGCIQGVLPLAEIDSLL
FGHSLSALPFCVYGGIAAISDDARIRLDQAAQMLATYLKVDYLEYRHLHP
FHHDWPTKDLYVTFRKTMDSDVEQNMLAIPRKQRAMVRKGMKYELQSYLD
QDTDHFFRAYSASVHRLGTPVFSRKYFRLLKSVFAEDCELLIITKDENVV
SAVMSFYFRDEVLPYYGGGTDAARDLAGNDFMYWELMRRACERGYRVFDF
GRSKRGTGSYSFKKNWGFEPQPLFYEYQLHQAKTIPEHNPLNPKYQLFIK
AWQKMPLALANMLGPHIVKNLG
>NE2098 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron
MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA
EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR
VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV
DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS
EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA
ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS
LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL
DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA
RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG
>NE0396 hypothetical protein
MSDKNIQDRAAGAIMGAFIGDALGLGPHWYYDLSELRRDYDDWITGYTDP
KPDRYHAGLKAGQLSQSGFILKLMVRSLVGCGGYDTEDFCHRMDEELFPL
LDGTPVNGPGGYTSQSIRETWRRRVQQMLPWGQTGGHADTTEAVERTLAI
AVRYAYQPEQLAAAVSANTTLTQTDETIVSMTVAYGAVLGLLIQGHALDT
GLSGKLMDLVKSGALPFHAVTGDDLQPPRPGDPDPPRAGRFASPDALLTP
SYMAAAAVDPEIRIEPAWKVSIVYGMPCAIYHQLPAAYYLAARFRDDFKS
AVLHAVNGGGQNQARAMLTGALAGAQTGLSGIPQYLLDGLDNSAVLGKLA
VELAASVNPI
>NE2100 hypothetical protein
MLYTYLQRSMHRLRCILNVGRHTEPSEATMDENSQANKHKHLDFIQAAIN
RMAGNLFLLKGWSITLIAALFALAAKDSNKLYIVIAYFPLFIFWALDGYF
LSQERKFRALYDHVRTLDESQIDFSMDTRPFSSDIRNTWAGSASSKTLVV
YYAGLAVVMLILMYVVR
>NE0929 putative signal peptide protein
MDNHTENENPTPQPPAEPPRRLRKWLAGIATVVVLAVTSGFYWLIFTSAG
LHWLLMIASQATGGALTFSGVNGSLHHTVHAGMIAYQSDELTARVEDMTF
RWQPRQLLTGKLHIETLMIRSVEVHSAASEQEEEPVTLPENLSLPVDAVI
EKLGIRSLKRYTLGNDQPDIVMTDLALRLDSDGKRHHLQTLALGLEWGKI
AGNLEIGIHPPFDLASQLVFYNWANVANTSSEPAGYAAVRLGGNLQQIQA
ALTIADRKLAGKGNFTIHPFDALPLLEANLVVSGLDLNVFSSDLPKADLS
FSSQLAQKQAEQLTGHITISNAMARPLDQDGIPLKNAHMLLDLTHDKIQL
SDIALRLSEREEVPGSLTGEANWQISTKTGQADIHVRRLNPTDLQSSLRP
AKLSGNLHFDGNQESQQGSIKLRDEALRLNMDMALVRTASAITIEKLDLV
RGDSSMSGHGTLQLDESQPFTFEGLLRQFDVSAFADVPPSNLNATFNLAG
QLALQPAANIDFMFEPSRFANQLVSGQGSLVLQQPAHIRSDTHLRLGDNQ
LEIKGALGNPGDRLAVVLSAPKLAQIGFGLQGDIHSRVELGGAIDHPDIT
FEIDSNHVGFREEHRLAHLKASGNLRGTALQLDLQTGEYQQGSQTHLQKL
SLGLSGTQAQHQLSLTSQIDQATEVQFLARGSGDTSRKQWEGVIEKLSLT
GSVPLDLVTQPSIKISPETVALGHTRIMAAAGEIDIQDARWTPKQWSTQG
NFTGIALKSGDLSSEHTEPLKLRGNWQLAADRQLAGHLRIQREKGDFILP
TETPFALGLQTLLLDLQAENNGLNGQLTIRGKHVGETTARVSVPLQSTGS
TWEIRKNAPLKGDLQLNLPDLAWIGPALNNNLRSEGRVTAQASLAGTLDQ
PELQGRMTGDELTIALLDQGLQLKEGRLAVDFNQDRLRLETLNFTAPLEK
PSKDRLLKNIKLSRKSGQLNAHGSLDIHNQQSHLTVELDHLPIAQQADRW
IVVSGNSVIGFKEQALDITGKIMTDVGFIKQPAAGRPELADDVVISGKTE
AEPESPAMQVNLDAILDLGERFFLRASGLEGRLAGQLHLLSKPDQLLSAI
GTISTRDTRFEAYGQRLQVRRGIVNFDGPLDNPGLNILAVRTSQESDFDT
GSSDVIDQPSQDSSVNALAVRGGMRVEAGVEITGTVRHPKIKLVSQPEVP
DSEKLSWIVLGRPADKSGLDNALLLNAAGSIFGGTDESVLENITQGMGID
DFSIRQQAGGSLTNQVGTIGKRLSSRAYLSYERGLTSASAGIAKLTYSLF
PRVSVVTRAGDDSSVDLFYNFQFD
>NE0566 Domain of unknown function 2:Response regulator receiver domain
MKADIFTTRILILDDDPFILKLLMRMLANMGFSSVICCENGVDALKLLDD
VATRPDLILLDLNMPGMDGIEFVRHLVERKFSGSLILVSGEDERMLRTTE
KLVQAHRIPILGYLQKPVQPESLRMLLMKLDVPSSMRSGERNTRKIYDAD
TIRTAIDSGQLINYYQPKVRVDTGVIAGVEALVRWHHPVDGMVYPDQFIG
VAEENGLIGRLTHTVLSRAFIQAKVWQQSGYELKVAVNVSMDNLTSLDFQ
DAVVRLAAEAEILPGKIVLEITESQLMGDVRVSLEILTRLRLKRFHLSID
DFGTGHSSLTQLRDIPFDELKIDQGFVHRASGDETLRAIYDASLALAKQL
NMEAVAEGVEDREDWELLRQTGCDLAQGNFISRPMPAEALGEWMEKWHVR
IEAESLVSPKAGKAL
>NE2232 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1826 Phosphoribosyl transferase
MSYDSLMVFTGTANPKLAHDVVKYLNINLGRANVGRFSDGEVMVEILENV
RGNDVFVLQSTCTPTNDSLMEILVIVDALKRASASRVTAAIPYFGYARQD
RRTRSARVPITAKVVANMLTSVGVDRVLTMDLHSDQIQGFFDIPVDNVYG
MPILLGDIWKHDYQNLIVVSPDVGGVVRARHLAKRLECDLAIIDKRRPKP
NESKVMNIIGDVRGRTCVIIDDMVDTANTLCEAASALKREGAASVIAYST
HAVLSGKAVERVQTSDLDKLVVTDTIPLREDASKCNRIHQLSVASLLGES
MLRISNESSLSSLFIE
>NE0558 putative transcriptional regulator, putative
MLTSPHVFLLTDSYQMYSDDDRNHSGKEALDWFVRMQSDSVTEAERAAFI
AWRDAKPENAQAYADTELLSHKLAALKGHSAYHIPRLTPDQKPSLMTMQS
FNRIALFLLIIFSSAGLWWQTGIVSDEYVTSAGEQRHIPLEDGSKLELNT
ASHADVRYTRLERIVTLYSGEAYFDIADENYRRFVVLAGGLQVRDIGTRF
IVRHDNEQVTVTVLEGEVELTRPALSDRSGELPATLTLLAGQQSHYRNEV
ITSAIPIDSVAAAAWREGKLIFKATPLREVVQELARYHPLRFVIHDPALA
DETVTGSFHIGELSSFLRALEQVLQIQIDTRNPDIASIQRRKN
>NE1887 conserved hypothetical protein
MTDNLLKYGPLGLPEYSERRLLHTELNADVYELVNIPLQLSHLVLLSDRQ
WVNRERELIVQICEHFGIRMLNGAFDQLSVELGGFQLRWERHTEYSTYTF
YSEGPFEVPFAQPAIAHVPPEWLEKLPGEVLVATHIALEDRRRPSRSMSE
LSSLFSSNTVIGSKVSAGSASVWSDNQIHPDGFTRFLIHDDNLRSRQVGR
LVQRLLEIETYRMLAILPMTMTREIIPQLERYGDQLTELISTNIAPNSIE
DEQLLLVKLTALATEIERISAQSSHRFSASQTYHTIMQQRITELREERIE
GLQMLYEFRKQRVTSAMSTFDLVWSKLETLSLRVERATSMLRTRVDISME
SQIRDLLRSMDTRAYLQLRLQETVEGLSVVVLSYYLLGITGYGLKAAKAA
GLNIDIELMTGIAIPVIVTIVFFAIRRFRRIVSKSAFGENKGGE
>NE0161 Hemolysin-type calcium-binding region:RTX N-terminal domain
MASSTVLSQHINVPVEVNAEYAIAPDNPANFLLAVRKEDISNYSRVGNDL
AIQFKDGHALHIQGFFSLGKTINNLILGEDNSRFLVNFHNAMSDTGDGIV
ETSIIYEPIQDGTATIALLGLLGATGIGIRFAAAGGGNGDHPMMPPPAPT
LTITGNGQPVPDGQSTNDTLPTLSGTAKPGSAVTVYDNGQAIGTTPVDSN
GNWSFTPIQPLSDGNHSLYAVATDNTGTSEPSSSLTIEIDSIAPATPSID
SVMDDIGIITGAIMAGGSTDDTTPTLSGATEAGATVSIFDGDRLLGTTTA
DTSGNWIFTPPSLLSEGVHSLTVIATDATGNSSDPSASFTLTIDTTAPTF
VSAATSIDGSSLVLTYDEPLDTTHPPATDDFVVNVDGTPVTVSNVTVNGS
TVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTN
TVPVTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTHPPATDDFVVNVDG
TPVTVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAG
NDAASLLNSPVTNTVPVTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTH
PPATDDFVVNVDGTPVTVSNVTVNGSTVTLELATPVNNGETVTLSYTDPT
AGDDLNAIQDLAGNDAASLLNSPVTNTVPVTPDTTAPTFVSAATSADGSS
LVLTYDEPLDTTHPPATDDFVVNVDGTPVTVSNVTVNGSTVTLELATPVN
NGETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTNTVPVTPDTTAP
TFVSAATSADGSSLVLTYDEPLDTTHPPATDDFVVNVDGTPVTVSNVTVN
GSTVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPV
TNTVPDTIAPTLNITATDLVLAAGEAITVTFQFSEAVIDFVETDVVVTGG
SLSAFTQVDDDTWTATFTQTGSDAPSISVPNGTFTDLAGNPGTGDTLDGT
NGFMADLTPPALLSAATSANGSSLVLTYDEPLDTTHPPATDDFVVNVDGT
PVTVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGN
DAASLLNSPVTNTVPVTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTHP
PATDDFVVNVDGTPITVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTA
GDDLNAIQDLAGNDAASLLNSPVTNTVPDTIAPTLNITATDLVLAAGEAI
TVTFQFSEAVIDFVETDVVVTGGSLSAFTQVDDDTWTATFTQTGSDAPSI
SVPNGTFTDLAGNPGTGDTLDGTNGFMADLTPPALLSAATSADGSSLVLT
YDEPLDTTHPPAIADFVVNVDGTPVTVSNVTVNGSTVTLELATPVNNGET
VTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTNTVPVTPDTTAPTFVS
AATSADGSSLVLTYDEPLDTTHPPATDDFVVNVDGTPITVSNVTVNGSTV
TLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTNTV
PVTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTHPPATDDFVVNVDGTP
ITVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGND
AASLLNSPVTNTVPVTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTHPP
ATDDFVVNVDGTPVTVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAG
DDLNAIQDLAGNDAASLLNSPVTNTVPVTPDTTAPTFVSAATSADGSSLV
LTYDEPLDTTHPPATDDFVVNVDGTPVTVSNVTVNGSTVTLELATPVNNG
ETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTNTVPDTIAPTLNIT
ATDLVLAAGEAITVTFQFSEAVIDFVETDVVVTGGSLSAFTQVDDDTWTA
TFTQTGSDAPSISVPNGTFTDLAGNPGTGDTLDGTNGFMADLTPPALLSA
ATSANGSSLVLTYDEPLDTTHPPATDDFVVNVDGTPVTVSNVTVNGSTVT
LELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGNDAASLLNSPVTNTVP
VTPDTTAPTFVSAATSADGSSLVLTYDEPLDTTHPPATDDFVVNVDGTPI
TVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAGDDLNAIQDLAGNDA
ASLLNSPVTNTVPDTIAPTLNITATDLVLAAGEAITVTFQFSEAVIDFVE
TDVVVTGGSLSAFTQVDDDTWTATFTQTGSDAPSISVPNGTFTDLAGNPG
TGDTLDGTNGFMADLTPPALLSAATSADGSSLVLTYDEPLDTTHPPAIAD
FVVNVDGTPVTVSNVTVNGSTVTLELATPVNNGETVTLSYTDPTAGDDLN
AIQDLAGNDAASLLNSPVTNTVIAPLTAINNTEVAMVNINTTSAAVNEGS
ATTLLGMGVGLAGLDLQAELLGTEKVSFDIPSGHVEDLVFTFGSLLNLSL
LGDYRIVVQKWNASTGKWTTVDSLSDGATILTIGLLNDGSYGVQQLLGEG
QYRAFVSYGGVGLSVLNTLSVTGTEYAHTYVADTTSSSGNVITNDQNVTA
TTIISHINGEPLVAGDNTITGDWGSLVINRETGAYTYTPNTANSSSIGQM
DTFTYTLFDTANNATSTATLAINISSNDVAANDAGAAGIVFDNPPPEDRF
NDSATADTIPLNPIPNTYNSASFTINGNEAVSGTVKLSTLVAVITNTTLV
IQEETAPGVWTAVSNGTFDFSSLASIGTFGTINLATLDLDAGTYRIHLKS
LTGILTSINITTDVNVTHTDQYVVTGTTNVHGNVLANDVTNDAQPTFQVL
DATNTYVDVTDGMTVTGDHGTLTLYADGHYIYTPDMATDYFTSPLIDSFE
YQLTSGTVIDQARLDITNGSYGVERGSTANDTLIGNAGNDILIGGSGNDM
LTGGTGGDVFLWENLNPIDATGGNGTDIITDFTIGMIGTDANADIIDISR
LLVSFDPGTSSVENFLSLTSSGGNTVINIDRDGSNLAYGPTPLVTLNGVT
TDLATLLANNQIIV
>NE0485 conserved hypothetical protein
MKRVNQNLYQLLRRIYWRLPLPEETKELLVGFARRFLRGMKKALAEPVTT
ASSASVSREKILQEYANQILAIPRKSGNEYVEISSSSYQRKEGDAKILAY
YLPQFHPTKENDMWWGKGVTEWNNVSRAMPQYVGHYQPRLPGELGYYDLR
ILDNMRRQVELAQMYGIYGFCFYYYWFDGKRLLDKPLDMFLEAKTIDFPF
SLCWANESWTRRFDGSCGEILVKQSETVESYIAFINSVVPYMRDSRYIRM
NGKPIFTIYRPSFIPECASTITAWREHCVKAGIGDIYIIGIKEHTWDVNL
IELGFDAQSEFHPGTLFKHCVDISSQINYMQDFGGIVLDYRDIVEHKKYF
LYNHPKLHRAAMPMWDNSARRDNKGMIFEGASPDLYERWLTDILLEAKNR
EDLEDHYIFINAWNEWGEGAYLEPDKKYGYAYLNATRQAIEGVRS
>NE0557 possible sigma-70 factor, ECF subfamily
MMAVLHRSSDIIAATICRILIPQFYSDNTVPINGNYSCIFLLSYMFPLDS
PLLADLYKKYRSRLISRIARLTGCRETAADLAQDAYMRLLDQTGVACPSN
PAAYLFRIGRNLAIDHQRRPDFRQHDETSFEEELPSSFSGLDQQAVYRQE
CEQLWTAIMELPRPVSRALVLSVIEGVSQAAIAEELGVTERTVRRYISKG
MAHCQRRLRNPLPPARYDR
>NE1399 GCN5-related N-acetyltransferase
MATTGIPAFQAEIREMHPDDLEQVIRIEHEIFLFPWSIVNFSDSIKAGYH
CRVLVQPNSDLVMGYGILMTGPGEAHVLTLGVGAAWQSQGLGRKMLRYLI
ELSRKHQAEFVLLDVRESNTGAINLYQRLGFQQIAVRKGYYPAMCGREDA
LVMKLEL
>NE0966 Uncharacterized pyridoxal-5'-phosphate dependent enzyme family UPF0001
MTTIASRLQNVKNRIIEAAKKAGRDPESVQLLAASKTNTPDKLREAWEAG
QTVFGENYLQEGLVKIRALSDLPIEWHFIGPIQSNKTKLIAENFSWVHGI
DREKIATRLSAARPESLPPLQVCVQVNVSGEITKSGVDPEKAAELAAFVS
EQPRLQLRGIMAVPELTAVTALQREQFQMMREVYEQLQQQGFNLDTLSMG
MSEDLENAIAEGATMVRIGTAIFGPRRYAIPEELGSRQ
>NE2018 Glycosyl transferase, family 2
MRIRFSRCASPRGKSSTGVIVPVCSAIYSIRRFWRWNKMKISVVIPTFNR
CHLLAEAINSALDQASSELEVEAVIVDDESDDGTAAWLETAFAGEHRVRV
LVNGRGKRPAGARNTGILAARHPKVGVIFSRARYEQDETEVPYMDPNFEC
KLCYASIATEDKDAVVFDRSYFGHLLKYSCYFNLSAVVMTADAARQLMSE
SLHIAEDFKFWVRLSHEQIFACLKVPQIRYRSHSANIPFKADKDPAGNTP
SMLHTYRLMLGYPGLARSNREQLRKHMAKEYFDWAYRCSRKGRLAEALRL
RLALFDLLWKNVVACVKLPWIGLRYRADRVGSTEQ
>NE1882 conserved hypothetical protein
MFTSLRLTHFKAWQDTGMITLKPVTVLLGTNSSGKSSLIQSLLLLKQTVQ
SPDRSIHLNLGGDEISDLFDFGHFDEVIKHGTSSPREFSITFTFRTAGQS
RINSGEFSCGYRQTATGVTAIQELVLSAAEHRFRAIRREKGSYAIFTDDE
TRPRAKGEQLAPERSVALPAAAIVALGSAGALAEDISLAIRRELENICYL
GPLRRKPERDYVWNKTTPGQIESDGHRAMDILLSSVLIKNDKQNEILDEV
SFWLNRMKVAERLEIRQVGRSARYEIVVHQGGTITNLRDVGMGVALVLPV
LVAGYFAPAGSTVILEEPEVHLHPLAQAVLAEFFVALSRSRRIQFIVETH
SEHLFRRMQTLIARKNTSVEQIALLFVESAAGNAALRRLEVDEFGRVSNW
PQYFFGDALGETREQARLMHTRHQEQHRK
>NE1480 conserved hypothetical protein
MFGRKHKIVSDINTLIGEGTVITGDIHFSGGLRVDGRICGSIVSANGKQH
SLLVLSEQGMVEGKIKVPYIVINGTVRGPIHCDESLELQPAAKISGDIFY
KTIKIHQGAVIEGKMVCTETSQPELITYVASAEESKGQQESTE
>NE1977 putative transmembrane protein
MKILAILLLLSILYSLGSALYFMIKDKGDSTRMVKSLTIRVTLSLVLFML
MMLWVYIEYIHGN
>NE0099 hypothetical protein
MCARIQQLPVFFIFILYGSCATTKRNIYVLAINKLMEQQIRKYESFNDLP
ADCKKLFDSGEKDSFDLSRDWFLLLETTVIRQTKEICIFTLEIEGVTQGI
WPTLLQKKGKLSLRQISSFTCFYSSLYQPLISSSLTVDKLADCLRWILSD
TRTDVLRFDIMDPSQSSFNLHEQALKKIGFKTDRFFCWGNWYLPVNNQPF
SVYLQNLSSRVRNTLERRKKKFLAGGHGKLEILTTHDKLPIAIQAWEKIY
NASWKIPEPYPEFMPSLISLCAAKGWLRLGIAYYDEEPIAAQLWIVNQGR
AAIYKLAYDEKFAHLSPGTILTAHLMQHAIDVDKVHEVDYLTGDDAYKKD
WMSHRRERLGLVAYNLRSFWGLIGISKHIAGKIRKKILKSLK
>NE2516 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2447 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE2257 SAM (and some other nucleotide) binding motif
MRLILSQCLANLSVRFLMNRNSYNKIAHLWNVARNGFFGREREYLDAILS
VAPIGSTILDLGCGTGRPMAEYIVSRGRCVLGVDQSEEMLRLARQKLPHE
QWVLSSIESYEPVEGYHGALLWDSLFHIRRTEHELIVSKVVRGLPSGGRL
MLTVGGSAHPEFTDFMYGEEFYYDSNTPQETETFLQRLSCRMVIGEYMNL
PDGGRDKGRYAIVAEKI
>NE1710 Bacterial surface antigen (D15)
MKLRFLILFFSLYSLGCMANDSLVVRDIRVEGIQRTEAGTVFSYLPVKVG
DVLDSKKASAAIKALYATGFFSDVKLKSEGGLLIVQVQERPAIAQISING
AKEFDKDKLKEGLKQAGLSESRIFSRSLLEKAEQELKRQYISRGKYAVKI
TTTTTPLERNRIGINFDIKEGKTARIKQINIVGNHVFPEDDLVDLFSLKT
PGFMTWFTKDDQYSKQKLSADLETLRSYYLDRGYLEFNIESTQVSITPDM
KDIYITVNVTEGPQYTVSDIKLAGELLVPEEELRKLIKLEPGGIFVREKL
TESIKLISDRLGNDGYAFANVNASPELDKETRKTAFTFFIDPGRRVYVRR
INISGNERTRDEVIRREFRQMEGGWHSTEQINRSRQRVDRLDFFTGVNIE
TPPVADVPDQVDINVNVVEKPTGAIMFGAGYSDREGIILNGSIAQNNILG
TGNFLSLQVNTGSVNKVISASFNNPYYTINGVSLGLEAFKRDINTRSLSS
VGMFNTDTTGANIRFGIPVAENDIVSLGLGYEHTKIDLRDDSPQRFKDFV
DQFGKTSNNLPITLSWARDRRNSAIWTTSGMTQRLFGEFGLPFGDLNYYK
VSYEQRWFFPVTKMFTLMLNGEVGVGDGYSDKPLPFFKNFFAGGFNSVRG
YNINTLGPRDSDDRVLGGSKRIVGNIEVLFPVPFMKEDKSVRLSAFADGG
TIVNSFSGLGFDDFRYSAGLAATWISPMGPLKFSVAQPLNNQSGDKLQRF
QFQFGQTF
>NE2225 Protein of unknown function DUF86
MADDVLINKAATIERCVARAHEEYAANPENFATDYTRQDAAILNIQRACE
AALDMGQHLIRREKLGIPQSTRDVFSLLARGGWINIELADSLKRMVSFQN
IAVHDYIALQLPITVRIIENHLDEFLQYSQTLLLHDAALGGNQRG
>NE0935 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPGPMVKSNA
>NE1855 conserved hypothetical protein
MKQEFLHIASSLWATVAAQISRHTGQTFAPESLTPIGGGCINQTFCIRDH
ERQYFVKLNKAGNLAMFESEAAGLGEILDSASLRVPQPLCCGSHHDDAWL
VLEFIDLQNRGNAAALGIGLANMHRHTAETFGWIRDNTIGSTPQRNATAS
DWISFWRQHRLGYQLNLARKNGHTGSLQSLGERLLSEFQHFFTDTLPLPS
LLHGDLWGGNYAFDQDGQPVIFDPAVYYGDREADLAMTELFGGFPPDFQA
AYRDTWPVETGYTTRKQLYNLYHILNHLNLFGPQYLSQAEITMKKLLAEL
Y
>NE1434 putative coenzyme PQQ synthesis protein c, putative
MATNTFKQQVDSIIQSRHLLQHPFYIAWTEGKLTREQLRHYAEQYFYNVL
AEPTYLSAVHFNTPHFHNVENSGDISIRQEVLKNLIDEEHGEKNHPALWK
AFAFALGADDASLTQADALPETENLVATFRDICINEPFYAGLAALHAFES
QVPDIAAVKIDGLAKFYGMKDPDSYEFFSVHQTADIFHSQAEWAIIEKFA
DTPEKQAEVLAATRRACDALWKFLDGIHENYCANLICEEKTAATLH
>NE1475 conserved hypothetical protein
MSISKIKLLFSCLLVCILAACATSPTGRTQVAFMPDAEVNSMGLQAFDTL
KRENKISHDTASNRFVSCVASAITREVGGEWEVVVFEDESLNAFALPGGK
IGVHTGLLNLVDNQDQLAVVIGHEVGHVMARHSNERLSQQAGTNLGISLI
QAIAAPQSALGQTAVGLLGIGAQYGVIMPFSRLHESEADTIGLDLMAKAG
FNPAESIRLWQKMAQASQGAEPVEFLSTHPSHTSRIQNLQADLPRAQRLQ
QQANAAGKNPRCTK
>NE1679 conserved hypothetical protein
MRRIFAKVLMLSAVSFMTSNLVQAADPEVIGEFDDWIAYVYTEDSSKVCY
MVGKPKKEEGNYTKRGAVYALVTHRPAEKSKNVFSFVAGYPYKQSSEVTV
SIGNQRFKLFTQNETAWAPDSAIDNKLVAAIRGGSQMVVSGTSSQGTATT
DTFGLKGSTAAYTAISKECGIK
>NE1692 conserved hypothetical protein
MREIGLTIGLLSISNVFMTFAWYAHLKDLSTKPWLVAALISWGIAFFEYM
LQVPANRIGFNVLSLGQLKILQEVITLSIFVPFSLFYMKEPLKLDYLWAG
LCILGAVYFIFRGNMADAGS
>NE0024 Cytochrome c, class I
MIKSGQIAAIMMALVNGERKMNKMIGLLVVSAVIGLAGCAKTDTYTPAAD
ASGEDIFYANCTKCHKPETDGKVMILSSKMTTKEAIIEKVNKGGMTMSSF
PNITGEPAQRLAEFILANSKTK
>NE1204 TPR repeat
MRSVRKRTVLVSTLLALVLTQVARADDKGSHPAVIGDVHFKVECNATAQA
KFNVAVAYYHSFQWQRVIATADDVLKVDPTCGMAHWVKALAMLDNPFAWP
VTLSEKAIAEGPVLLDAARKAGLKTQRERDYVDALAIFFKDLNTTNYRER
AESFEKAMAQLAQQNADDSEATVLYALILSRNFDPTDKTYRNQLHAAELL
EPIFAREPNHPGVAHYLIHSYDYPPLAKRGIDAARKYAKIAPDTPHSLHM
PSHIFTLTGFWQESIDTNRRAAEMADDSITHDGHHASDYMVYAHLQLGQD
LAARKIMEQEQVRHGIDMIGVAYPYAAIPARIALERRAWREAADLPLYAR
DTYPWKKYPQAEAVNAFARGVGSAMSGEPAKANFEAKRLIKLRDAATAMK
LNYWADQIDIQAEVVRGLADFAEGKRDEGIAILHRAAEREDASAKNVVTP
GPVVPAREMLATILERDRKPADALAEFEKVLEQHPNRYRTIAGAAQNAKQ
AGNEQKADHYAELLLKLAEHADSPRPEIAEAKSMLGM
>NE2338 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE0351 putative tRNA/rRNA methyltransferase
MTHSQYIFGFHAITSRLRQHPDSIKEIYLDTNRHDQRARDLMSLAETTST
RLILCEQDRLCKMTGTTRHQGVAAHVTKLRRYATLEDLLEGLTEPALLLV
LDGVKDPHNLGACLRVADAFGVHAVVVPKDRAVGLSATVHKVASGAVDTV
PFFAVTNLARTLRELKEMGLWIVGTAADAPDTLDSVTLTRPLAWVLGAED
GGMRRLTREACDLLVSIPMSGSIESLNVSVSAGICLFETFRQHSQRTGSG
LNVPAPKNAV
>NE0862 conserved hypothetical protein
MTSHCIQCGSSRIHKSRFRPGERTPANLLLSPYRCRDCKARFWGRNNDAC
LAAAAGVGGIFLLGTFIWVGFSLNDPMERSLSSQPQTALTGLSWLDSQSS
PHVSSTNLTLAEAIERGEKIDLQSIKSEDTSSLPDPSDNRFYTINLFLEK
ARKGNADAQYQLGILYLAGKGTLQDFSEASKWFILAAEQNHPLAQYELGL
LYQVGQGVEMDNEKSYMWFNLAAAAGIEQAIAARDKAMRSLSRTQLSSAQ
KAAREWLDSRNKLGK
>NE0334 D-isomer specific 2-hydroxyacid dehydrogenase
MPDDAQRITRILTLNSIAQVGLKRFPPHLFQIGSDITGPDVILVRSHNLH
DMEIPESVIAIGRAGAGTNNIPVNQMSARGIPVFNTPGANANAVRELVLA
GMLMASRNLIPALRFVETLEGDDQSFNLQVEAGKKQFSGLELPGRTLGVI
GLGKIGRQVADIAIKLGMKVLGYDPKITIDSAWSLPAEVQKANQIEDLIR
RSQFISLHVPLNDSTRHLINDSLISCMQKNTILLNFSRDAIVDEDAVLTG
IKSGVIRYYVCDFPGRKLQQQQAVVTLPHLGASTREAEENCAMMIADQIM
DYVTNGNISYTVNFPDVVMERGTPYRVAVANANVPNLLGQISTCMADVGL
NIHNMVNKSRGEMAYTLVDTDKAVPQETIDAIVRITGVLMVRYLPISESS
ERA
>NE0623 dUTPase:dCTP Deaminase
MSIKSDKWIRRMAAEYNMIEPFEPNQIKQRNGESIVSYGTSSYGYDIRCS
DEFKLFTNLNSTIVDPKRFDSNSFVDVKGDICIIPPNSFALARTVEYFRI
PRNVLTICLGKSTYARCGIIVNVTPFEPEWEGYVTLEFSNTTPLPAKIYA
NEGVAQVIFFESDEVCETSYKDRNGKYQFQQGVTLPKI
>NE1548 Acyl-CoA dehydrogenase
MLEFLVVLILLIVAGVALFAVPFLRINIVSSLVLRIFRKLLPQISQTEQE
ALDAGTVWWEGALFSGKPDWNQLLAYPKPELTPEEKAFLNGPVEQLCAML
DEWHITHERHDLPPHVWQFIRENRFFSLIIPKEYGGLGFSALAHSEVVMK
ISSRSNTAAVTVMVPNSLGPAELLLHYGTEEQKDHYLPRLANGSEIPCFA
LTGPDAGSDAGSIPDTGIVCRGVFDGQPDVLGIRINWEKRYITLGPVATL
LGLAFRLYDPDGLLGNEKEIGITLALIPTSTLGIQIGRRHYPLNGVFQNG
PNSGKDVFIPMDWVIGGQDKVGQGWRMLMNSLSVGRSISLPATSVGAAKL
SAWSSGAYGRVRVQFKLPIGYFEGVEEALARIAGNTYMMDAARIMTAGAV
DLGEKPSVISAIVKYHLTERSRQIINDAMDIHGGKGICLGPSNYLARTYQ
QTPVGITVEGANILTRNMIIFGQGAIRAHPFTLREIDAAWDQDKKRGLRA
FDEALTGHIVFTFRNAFHSLIYGFSTRLVKDIPQNVTHETVNYYRQLTRF
SAAFALLTDIAMLKLGGALKRRERLSARLGDILSMMYLCSATLKRFEDDG
RPAADLPLLHWSVQDALYRTQQSFHEFLLNFPVSRITRGLLRFIIFPRGM
RCLPPADSLSHEVARLILAPGEVRDRLTAGIYLPEQDNEPLALLKKALAC
TIDCEPIEARLRQAVKDKAITPHEHEQINQALAQGIITSKEAASLEQMKT
LRRRVIMVDDFPADLRQTGNSHGETPL
>NE2505 conserved hypothetical protein
MNSGEYKYIWQASDWPNWRFDLAALAEPMAEVSRAQGLLMGRLADVGMAL
RDQASLAALTEDVVKTSEIEGEQLNVESVRSSIARRLGVDIGALAPVDRH
VEGVVEMVLDATANCQALVSRERLFGWHAALFPTGYSGLSKINVGGWRDD
ATGPMQVVSGPIGRQRVHFEAPPADRLETETSRFLDWLNGTLNEPPLLKA
GLGHLWFVTLHPFDDGNGRIARAIGDLLLARADGSPQRFYSLSAQIQRER
KAYYDILERTQKRSMDVTEWLAWFLDTLHRAVDQAQHTLDAVLTKARFWQ
RWATTPLNERQVKLLNKLLDGFEGKLTSSKWAAIAKCSPDTALRDINDLL
TRGVLRKSDAGGRSTSYELNDLPE
>NE1586 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE2235 AMP-dependent synthetase and ligase
MIGKAGIGTILDLLRVRSAATPAATAFQVLVQNKSWQPTNWEQFVQAAGR
VGARLTDFGVRKGDHVGIMAATSLDWEYAQMGALFAGAVVAGIDPEYPAD
QLNHVIENLGLSVLFVQNRSVFAKIPLELRQQISVFIFFEGESQQKNEFS
MINLLVEQETDTSICLPDVLPQDEAVIVFSSGTTGMPKAIIFTHEQVIMA
VEAIMRVFNDLSSQTILLCWLPLSNLFQRIVNFCAIKIGASSYILSDPRD
LMRYVKQVNPDVLIGVPKVLERVHSGVVDHIEGRVWPLRMLARWAIRIGY
KRAAARSDRGDSFGLMNKLLWKQAENMFLRRLRMVFGSRVRYLVSGSAAM
PVWLLRWFDAIGLPVYEVYGISENVIPVAVNSSGSRKLGAVGKPLLPNEI
KLTMDGEILVRGPGVFKGYQDRKEESNLRFSPDGYWHTGDLGELDTAGFL
SIVGRKSDVFKTSAGKWIAPVRIEERLGRIAYVEQSVVFQYESGKIVAVI
VVDLEKCAQKTSAVRYSEVSLHMENMAKNKLHQILETDINAELEDLPLYQ
RPVYVAVTDKFFTVSGGELTVNMKVRRSIVAQRFSAYFENTVCKSDQGRA
ENGHVVMSGPTIFFI
>NE1356 conserved hypothetical protein
MVYSLGYKTMYIVKRLDEFDKWLDGLKDRPTRIRLIRRLDKARQGLLGDV
KSVGEGVFEMREFFGSGWRMYYIQQGGTIILMLGGGDKSTQSKDIQKAIQ
LANDLGENSYE
>NE0466 Glycosyltransferase family 35
MQIPRLLMSYLRDLPADLLPLVEQIADLAMDLRWTWSHGGDAMWKIMDPQ
LWEQSENPFVVLQNLSHERLQTLSQDQEFRQHLDRLVEARNRYCSCGGWF
GEVHDNAGLRRIAYFSMEYGLGKALPLYAGGLGVLAGDYLKAASDLAVPV
IAVGLLYQEGYFRQMLDTSGWQQEIYPYNDSSSLPLHPLRAHNGTWLSIP
IRLPGRTVQLHVWEARVGRVSLYLLDSNNPLNSALDRGITGKLYGGGEEI
RLVQEIVLGIGGWQLIEALALEIDVCHLNEGHAAFVTLERARCFAQKRQM
DFHEALWATRPGNLFTTHTPVAAGFDTFPRALMIKYGLPYARNLGMPLDE
LMAFGRSNGQDDDEPFNMAYLAARTCGMMNGVSRLHGQVSRSIFQKLYPQ
WPECEVPVTHVTNGVHVPSWDSPWADEIWTDSCGKGRWLGSEETLATKIH
KLDDAALWNLCGQERADLVRHARQRMRILQGQRGADTEAISQVDRILDPN
VLTLGFARRFTAYKRPNLLLHDRERLARLLTRTDRPVQLVVAGKAHPADN
EGKRFIQEWAQFSSRPDVRNHVVFLEDYDIALAQELVQGVDIWINTPRRP
WEASGTSGMKVLANGGLNLSELDGWWAEAYTPDAGWALGDGREHHEPGWD
AMEAEQLYRLLEEEIIPMFYDRDASGIPAAWVARMRTSMAHLAPQFSSNR
MVREYVEQLYLPAAAAFQQRTAPGSSTARELRQWDEQLKRSWHEIHWGNL
IVSQDEQGWLFEVPVYLGGIPPESVQVQLYADPIEGKVPVREAMQRGNEM
PGAVNGYLYTARLNTLRAHTDFTPRCIAHHPDACIPIENHLIHWWSGIVS
LKNL
>NE0619 hypothetical protein
MIWYLEKWRRRWILGRKPIRESVWQSAVDQLPFLHGLTQDEFRQLREWTS
IFLHDKKINGVQGLVVTEAMRVMIAVQACLLILKLDPEYYDDWVEVIVYP
GKFILDHAYTDETGIVHTRRMVASGESWQAGPVILSWEDVVHVHHESGYN
VVIHEFAHKLDMLNGSANGYPPLHSDMDPRIWAVVFSRAYEIFCRQVERG
EVTIMDPYAAEDPAEFFAVSSEVFFTRPYRIKQYFPEMYQQLARYYRQDP
AARQECETDS
>NE0109 hypothetical protein
MDILSLVAGFLAGALTIYLASYFMKPSENKTQKTVDAGTINLFDQLWQTH
ERLLNEMKQDVENPDFKFHREFYVLKKGWGWERWGFHRRGPCIAYFLEDH
SDLLPLLDSLTSYGLISQTGETGKNTARFQLSEKLVELLRGKNTNKS
>NE2419 Amino acid permease
MPVDEKKGPFSAINNRSAGEAEAGGEIIHETGQLRRSMSVWNCFTLGFSV
VSPVVGLYAIFGVQNLVTGGGWIAALILSLFMQLMVATVYAELSSQFPIA
GGAYKWSRHLIGPRTGQYAGAIYVSSTIAMLTTTAYTGGMWLSAVIVGGD
IQGASAVLWGVVFLLICTTVNIVNQKVYKVINLLGVSAEVLGSICVALVL
FFFFRIYPVSELFQHLGTGLAPTSLTAFLAALAIAGWAFIGFDSCSTIAE
EAHNPERMIPRAIFLSLLAVGSVVLFNSSALTLSISHDTLVEASATSDPI
SPIITSTIGAWAAKPFLVIVMTAFLSCGSSVVNYVSRIVFSMAREGNLPA
FFSQVTRKTQLPRNAIGGTVLMAGLGLLFGLNDGAVATIIAFGTGGLYAM
FSMTTGVGLYARLMGRWNPALGAFRLGRWGLVINTVAFVWALFELINIGW
PREYVAAPGAPWWQLWAVPLVLGTILALTTLHVQMGHKRRMKRALSGTVT
DDLQK
>NE1841 hypothetical protein
MFPAHPTIKEVDLKKIFTIVLGFVMTAAMADGVDQRQILPMNEMQRNHLL
SEMRMLLTGTGAILEALAQDDRAAAARHARSLGTDMPHKMEGHMDNILPE
QFMQLGMAMHQAFDRIAQDAESGKDTKHTLQQLSETLGRCTACHATYQIS
TGRQLAGQGSQKNHGEHGAHSHTY
>NE1157 hypothetical protein
MTVNSKPAIIMNQTQLKIRIHLDQSADHPKIEQNLPELPEESVFPPLPPV
YEYNWPRIIGAGLVLLFVLITSIWIAADWLSDDEKLETSSTEISLSAVSP
ASSETPSTEPVPLLPSVGFSSDQPSENNPAIGNVPDGDIGSDQAVQAPDP
QPQRAAPSASAPPVKPGIKPDITIPQAKTKNVSQGSNHSSGLIKAQLTSN
IRQRQPVDNINQISLGSKSSRPIFLFLHLNKFRGEKILINWYYRDQSIAR
IVLPVGNNDWRTYSSKVLNRNRLGPWHVTASDQAGNLLAEFKFRVTR
>NE0541 Sigma factor, ECF subfamily
MTITYSGDELFLQLRGDLLNYLCRKLSKDEAADILQEIYLRWRRQDISSI
ENPRAFLFTVALNLIRDQARRQAHVQTYANTMQHLFQDKPADYDLAQHCD
ERRQIEYLQQALDQLPENVRHAFLLFRLDGLTFAEIALQMGISSKSVARY
IQHASEHCLTALAQYR
>NE1274 Cytochrome c, class I
MNKCSHTTLPLILGVFLLIAFTNPAYAQVNVVEALIEFKNLDGLVKSLKP
EELNTLARQKTLKVFEVHEGRERQYKVYPAQSVFDQMFGPSWRIADEIIF
TCQDGYRPSIPVAKFLTYDAYLAVASADSKPFTLSNALQNHEVVALGPAY
LIWDNLKSPALLDEGASDMPYQIARIELASFAARFPGMFPPAGASAETKR
GFLHFRKYCMACHTINGQGGGKAPELNYPVSVIEYFKPGYLKQWIDNPAS
LRYNTTMPALTVAIPEREKVIEEIIAYLTAMRDVKRMRPAGNSKQ
>NE0739 Cytochrome c, class I
MKKLKLKAAVSGFLFTAILMNSAWSAQEEKPGTGFAWKDGAEIYAKICAY
CHEANVGPQIRNRELPAIYIRAIVRNGSRAMPAFRSSEIDDESLAKVADF
ISQKASQY
>NE2379 ATPase component ABC-type multidrug transport system
MTPAIEVRQLSKRYGKLVALDNIDLVVQPGEFFALLGPNGAGKTTLISIL
AGLTRASQGSARVMGYDVLSGYREARKRLGVVPQELVYDPFFTVREMLVI
QSGYFGIRCNDAWIDEILQHLDLADKAHTNVRALSGGMKRRLLIAQALVH
KPPVIILDEPTAGVDVDLRQSLWQFIRQLNLEGHTVVLTTHYLEEAEALC
NRVAVLKQGRIAALDSMHNLVRSTRDYTLELRLSTDTLPSGLGTWPGGHD
GCLHNLKIKNYTEIEKILSVLRQSGIEVLDMRLLQPDLETVFMKIMETDA
AGIS
>NE0093 recQ; ATP-dependent DNA helicase
MRHQFPEVGRLISELRDAPCEREDCQYCQTTHDPRHELKRYFGFPDFRYE
QPGESLQHDIVLAGMRGQHVLAILATGGGKSLCYQLPALNRFHRNGSLTV
IVSPLQSLMKDQVDGLLERNVQCAAALNGLLTMPERAEVLEKIQMGDVGI
LLVSPEQFRNKAFRRAIRQRQVGAWIFDEAHCLSKWGSDFRPDYLYVSRF
IREYTGDQPLAQIGCFTATAKPDVLADIQSHFRESLGIEFKVFPGGHERT
NLHFDVLPCTKAEKWSRTDRLLHEHLDSQEGGAVVFVSSRKSAEELSDFL
IGQGWPCKHFHAGLEPNTKTDIQDDFKAGQLRIIVATNAFGMGVDKADIR
LVIHADIPGSLENYLQEAGRAGRDQGDARCVLLYDPQDIETQFGISERSK
LSIRDIQQILRKLRNESNRRKGGKLVITAGEILLDDDVDTSFSADERDAE
TKVVTAVAWLERGDYLKREENHTQIFPARLDMSEKEAEKRLLKAKLPQRR
LEEFRAILRFLYGADADERVNTDQLMQLTSLESEEVASALKQLEEMGLLV
NDSQITLYVRHGVTGASSQRLQSSLELERALLQRLPELASDAGQGEWQDL
NLPALAAELKADTRQGDLLPLQVLRLLRSLADDHDANSQQRSSFELRQLN
RDYLKLRIKGGHSWRQIERFGEKRRALAGVLMEFLIGKLPPGSRNKDLLV
ETSFGELVKALESDLELPHLIAPDQRRKAVEHVLLYLHRQDILTLNHGMT
VMRRAMTIEVKKEDKRKTFLKEDYLRLDEHYREKRIQVHVIHEYAEVALK
EMAEALRLVLDYFTDSKQAFIKRHFAGREDVLKLATSEASWKSIIESLST
TQKLIVADDDDINRLVLAGPGSGKTRVIVHRIAYLLRVRRVPATSIVALT
FNRHAANEIRKRLLALVGADAYGVSVLTYHSMAMRLTGTRFERGDTVDER
ALKRVLSDAVELLEGKRNVEGEDNLREQLLRGYRYILVDEYQDIDDLQYR
LVSALAGRHAEEDGRLCIMAVGDDDQNIYAWRDTNNRYIERFREDYEASI
SFLVDNYRSSLRIIEAANQLIGQNSARLKEANPIRIDRARQELPAGGLWE
EQDKQRKGRVLRLLIDPSDRERGNLQAQAAMLELERLLVLEQGSWNGCAV
LARTHRYLWPIQAWCEQHDIPYFLAADKETALPITRQRSFVAAIDSLREI
ESALCAADAWLRLSGSNQLVEAEWKSFFQTAFEQMRGELGDCQLGSGALI
DWLYDYARELRQQPKEGLYLGTVHSAKGLEFRHVVLLDGGWSTQVDTLAD
ERRLYYVGMTRAEQTLALCEFADGNPFSRSLMKGVQQHAFQGQPLPELEL
RFQQLTLKEIDISFAGRQLPHARIHKAIEALREGDPLTLKEEAGRYQLLD
RQGNVVGRTAKSFQPQIGFAHCEVAVVIVRFAEDSEEQYRDLNKCERWEV
VVPRGRG
>NE1342 possible glycosyltransferase
MTRSVQGFKSPKLPSILFFNVNGSGMGHLNRCLAYAQQLKGRARPVFFSL
ASAIEMIEEMGFEADYFVSHFWSANASFHWNSELTFRFGLMLERVQPKVI
VFDGTWPFQGFLAACKAHGASALVWSNRGLLKEDVMHAPVDENLFDLIIQ
PGELGAVKSELPLEGGGKRVTVPPVCLLESEALLDKKQARKALNLPEEGR
FALFSLGPGNLKDVSGIGHGLIRLFTAAGFQVIWARAPISVRDVELPSGV
KPVSVYPLARYLRAFDVFIGAAGYNSCCELVQSGIPALLVPNDKLADDQI
RRAQMVAELIPAVVSPCETDTQRSEAVADLLKMLADTPAEKPEIQMNGAA
LAAEEILALLPARKR
>NE1296 Cyclic nucleotide-binding domain:cAMP-dependent protein kinase
MVSMSTYANDFYNQSREVTGTQLTVIREFRDFSASDLEMVAQLLQMQHFL
AGEEIVRYLDHSDNVFFVLEGGVRIHYFGYSGDEVILCDLSQGEFFGELT
AIDGQPRSATVVARSDSLLAVMSNQAFIRLVHESPVFCMAIMQRLAGQVR
RLTERVFDFSTLAVRNRIHAELLRLARQHMDSPNTAIIKPAPTHAELACF
VSTHREAVTRELSELARNRLIQRTGHELRVLDVERLRKMVDLARGPVGD
>NE1452 Uncharacterized protein family UPF0074
MRLTTKGRFAVTALLDIAIRHGNGPVALADISERQRISLSYLEQLFAKLR
QRKLVDSVRGPGGGYCLAKGLDEVSVADIILAVDEPIDSTQCGGKENCLG
DSKCMTHDLWAKLNEIIFDYLGHVTLKRLVEEQSQRELQRELQREQAVVK
LYDMREAAF
>NE2001 conserved hypothetical protein
MKLKSFLNYFSSREKGLAVSILLPTHRTFPDNKQDAIMLKNLVTEARNRL
QSWPDTQEAETIMEAIDKKISTHDHNYNLDGLGIFANREGVTLINFPFTV
KEQMIVDEAFAIRDLIREINGAVHYRVLVISRTDARLIEGFNSHLVHEFD
ARTELRTGSFPMENPFLFAAKGLDRAQIPNEAANLKEFFNRVDKSLQEIQ
NKPEQERLPVIIAGDARNAAFFREVCDQPADIIGEVTSIPDLRIPAEKII
VEVQGLVSDHRRQKAETALQYIAQARNNHQLLTDSSMIFRAIDAGNAARL
FVRQGYIQPGIIDFDQKIVTLQEDSATVDAAGGTVTDDVVGTLIELVIRQ
RGEVHFLSTDQLGKEAPLSLQTRY
>NE1151 possible sec-independent protein translocase protein TatC
MIPRPIPEPSPDLYTEEEVSKLVHEFYAKARKDAALGPIFEEHVIDWDAH
FVQMTNFWSAQLRGTSRFRGAPMPKHIALPELNETLFKRWLQLFRQTTLE
LGNPLLKQHADTVAEFIAGRLWMGYQMSHFPHREPADLNTSEV
>NE0233 Toprim domain
MATLLSKKQPTAHANYNNNLTAFHQEIINAGLTPPAGIIDDGKLHRFSSN
GKRGDDSGWYVFHPDDPVAGAFGCWRLGLTQSWCSKSPDTMTPAERELHR
KRVEAMRLEREADTARRHKEAAAKAQKLWANAEPADSEHPYLINKNVQSH
GLRCVNDSLIVPLYSGSEIASLQFIMPDGEKRFLTGGKIKGSYSPIGTVQ
VGQRIFICEGWATGATIHQSTGCAVFCSMNAGNLLDVGRYIRREFPHKEL
VIAGDDDRLTKDNPGRTAALLAASTLDCGVVFPPFPDDAPMELSDFNDLQ
NWRDMQ
>NE0705 General (type II) secretion pathway (GSP) D protein
MNRFQTFFIGTAMLAGLNACSTTRPPEPAPISTWKIMPAVPMQPSDAASS
TSTTPEHPTGISTEYFEAGWRGPSSSKVGKFAFADTGERFELNFNEADIR
GVIDVVLGEMLKLDYVVSPAVQGRITLRTSRPVSKASLLSALEAALGTVS
AVILKDEQTISVLPWSEAPQRVRPAQRFASAMPPTPGYAVEIIPLRYINA
NDMQAILDSFVPKGIVLQADTRHGHLVVAGSSQDRGAILHTIENFDVDWL
NGMTFALYKLSQSRLEAIVTELREIFQGEIDLFTTRIHLVPLERMQAVLG
VAQNKSDLELLREWIERLDVSKLGAQRIFVYNVQNGNAKDLVAPLRQVLT
GEMQATTASGTTVSPTSTPMSTPQMPAAAPSLSAEPSISGSQTNVGRLVA
IEENNSLMFYGTEDEFRVIQEALKQIDVLPRQVMIEAFLAEVTLNNNLRY
GVQWFFDSGENTVTLSATDVGSVASQFPGFSYIYAGKANARIVLNALQSK
TEVKILSAPKISVLNNQKASLQVGDQVPIVTQTSQGTVVSGAPVINTIQM
RDTGVILEVIPRVNDNGNVILDVTQEVSEVAQTTSSGIDSPTIQRRKIHS
VIATRDGFTVALGGLIRENGSAGNSGVPLIKDVPVIGNLFKSNTSDIRRT
ELVVLLVPHVMRNQMETQSVADALVDGLEAASRLAERARLPAPSRAQ
>NE1732 DUF174
MAADYRNPESIIEKGRARLNPPPLYKVILINDDFTPMDFVVKVLRHFFLM
NEEMATKVMLKIHIEGAGICGIYPSDIAVTKVQQVNDFSRQNQHPLMCVM
EKE
>NE0788 conserved hypothetical protein
MEIFMRTSIRKGLVALAIWVPVGMSYGASGSELLEGKDIYLPHAERATLE
ELDNATGREGVDITTLNRMNVRAFLADNSATNNVSGFNSIDNGSFVGASG
MFSVIQNSGNNVLIQDSTIVNVTILP
>NE1416 Insulinase family (Peptidase family M16)
MKISFHFSYFATLLCITLAWPLPATANSHEYLLDNGLKLVVKEDHRSPVV
IQQVWYKAGSMDEVNGTTGVAHALEHMMFKGTDSVLAGEFSRKIAAIGGK
ENAFTSRDYTAYYQQLHQRHLPMAMELESDRMHNLQLTEEAFAKEIQVVM
EERRLRTDDQAHSLLYEKMMATAFQTHPYRRPVIGWMNDLENMQVNDARD
WYQRWYAPNNAVLVVVGDVDPENVFVLAKKYYGRFSAARVPALSERKPQI
EPPQTGIKRLVVKASAQLPYLIMGYKVPVLKDPKNEWEPYALTILAEVLD
GNASARLNKTLVRETRVAISADASYNAIERGPGTFFIDGAPSEDKTVDDL
EQSIRTEIGKIIQSGVTQEELARVKAQVVANHIYQLDSTFAQAMQIGRLE
SVGLSHRDADIILEGLQAVTAEQIRKVAEKYLIDDSLTIAVLDPQPLPET
THPRNSNIELKH
>NE0318 conserved hypothetical protein
MNPNIRVWFSADELAKLSPHLVTRIPVSPRQCRERAKREKWLSREVAGKG
GPRGKKTEFLPPENILSEIHQFLSRNPDFFAEAGSSPENATQPGMYQARQ
PMSRNGKYGQSIVGNHTDLENSEVVYVAHTELCDTPGEVTEMIPDDQMIL
SIAVNPADWRRYAGLNPEHIKIIDVQGDGMKPTLQHGDQVLVDTACNRFV
DDAIYAIRQGDMLRIKRIRLRLDGSIEIRSDNSHNFGIETYSQKEAAAFN
VFGKILPFKFGKFDL
>NE2497 hsdM; type I restriction modification enzyme methylase subunit
MLQNNPELKSKIDQLWNKFWSGGISNPLTAIEQITYLLFMKRLDELDQKR
QADARDGWSDPYQSKFEGTWIPPEERNWPVAEQRPIDKRTLRWGEFKRMQ
AEEMLQHVQGKVFPFLKDLNGAESNFTHHMKNAVFIIPKPALLVEAVKTI
DEIFEVMEKDSRENGQSFQDIQGDVYEMLLAEIATAGKNGQFRTPRHIIK
LMAELVQPQLGHKIADPACGTGGFLLGAYQYIVTQLAINAGTQTLTPDED
GFTRTSVAAAFDEKRQAILASSLWGYDIDQTMVRLGLMNLMMHGIEEPHI
DYKDTLSKSYTEEAEYDIVLANPPFTGSIDKGDINENLQLSTTKTELLFV
ENIYRLLKKGGTACVIVPQGVLFGSGKAFKDLRQTLVEHCDLKAVITLPS
GVFKPYAGVSTAILLFTKVWGMKDKVAKPATEHVWFYEMAADGYSLDDKR
TKQEGYGDLQDIIAKYHARDATTDTNRTAKCFMVPRADIESENYDLSLSR
YKEDVFEEVQYDAPGVILDRLIQAEVGDVDEAELAKVQSGIVRELLELKG
MVE
>NE1669 conserved hypothetical protein
MLNIDKLIIGFDSALRTLLAPANTLRPVPGKDLPENELSEIEKRESAALM
RINHVGEVCAQALYQGQALTARNDRVRQALDQAAREETEHLAWTERRIAE
LGGRKSFLNPLWYGGSFTLGLVAGVLGDKWNLGFLAETERQVEAHLADHL
QRLPHQDVRSRAIVTQMKVDEACHATMAVSHGGGQLPAPVKVAMKFSSRI
MTRTAYWV
>NE2248 conserved hypothetical protein
MPSFDIVSEVDKQEIRNAVDQLNKEVSTRFDFKGSDARAEQTDYELYLYA
DDEFKLGQVMDILMTKFTKREIDVRCLEKGQTEKISGNKVKQKVTVKTGV
ESDLAKKIIKLVKDSKLKVQASIQGEVVRVTGAKRDILQEAIQLVKGSIT
ELPLQFRNFRD
>NE1990 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE0266 conserved hypothetical protein
MQKIATCLWFDHGEAGKAAEFYAATFPDSRVERVNTAPGDFPGGQEGNEL
TVEFTVLGQSFIGLNGGPNFTPDQAVSFMVLTNDQEETDRYWNAIIENGG
SENNCGWCQDRWGFFWQITPKRLMELTTGSDRDKAKRAFEAMMTMNKINI
AALEAAVRE
>NE1174 Uncharacterized protein family UPF0020
MTERFFAPCPRGLETVLAAELERLDATSIQASPGGVGFHGNWQTCYRANL
ESRIASRILWQIAKDQYRSEADIYDLTHSLPWQDWFEPRLSIKVNLAAIK
CPLRSLDFVTLRIKDAVCDKFRAIHGKRPDIDTVAPDMRIHGFLNAQEFT
LYLDTSGEALFKRGLRQTQGEAPLRENLAAGILALTGWQPGTPLLDPMCG
SGTLLLEAAQIACRIAPGSGRQFAFEQLKLFDARSWKKLKQTATERQHER
TFQSIYGSDLYGSALAHTRNNLAAAGLAECVTLKQANVLEISAPAETGIL
VSNPPYGVRIGDHQMLAEFYPRLGDVLKQRFSGWRAFLLTADPLLAKSIR
LTPSRRTPLFNGALECRLLEYRLVAGSMRREKQPSSESSTNQPIT
>NE0128 probable sigma-70 factor, ECF subfamily
MKYLLEPALSVNPIHSDSITELYQDHHGWLHGWLRKKLGCSHQAADLAHD
TFMRLLTREEPILIQEPRAFLTTVAQRVLANHWRRKQIEQAYFEMLVQHP
EASVLSAEECAILLETLLEIDHLLNRLPLPVKRAFLYAQLDGMSHAEIAA
ELNISITTVKRYLIKAGAQCYFALVID
>NE2138 Sigma factor, ECF subfamily
MSSSDSPVFTTFYKQYQSQLISRITRLVGCRETAADLAQEAYIRLLGHRD
LSNILSLSAYLFRIGHNLAMDHQRDPANKIEYLLLDETLPCPLLQPHEIV
SLRQQCRLLLDTIASMPQGCRDVFLLRKIDGLSYSEISIRLNISEKTVQR
RLVQAMLRCHRSMNQIVIR
>NE0035 putative ATP-dependent protease LA, putative
MPITRLEPCALGLTIDPDSLEFPDTSALIDQPLDWIGQERARQATYFGLE
MEHPDYNLFVLGETGSGRSSLLRQAMAEVAARKPAPRDLCYMYNFDVPER
PLALHLPAGKGSWLRQQLAQSVSHLLQEIPERLSGDDFKVETSHIEKRFK
LEETRYFEELNTFAESLCFALRRESGRLVFTLLDESGKPLTEEEILALPK
DRRAALDQDEQVLRSEIASYLQKIRVLEQSRDKELAALCWGWIEPLLGQV
LNVVLEGLDQSFSDHARLKNHLDNIKHDILENLDLFQLTDLSEEKNPEDL
KQLFERYRINVVVDNRDLLSAPVVIDENPSFKSLIGSIGYQSVEGVLVTD
FTRIRAGSLLRAHGGFLMLHLDDLLVDAFLWEKFCRLLRNHLLQIEEPWA
VNVSAPVVPIEPEAVKVDVRIVLIGSREQYYAIQEENPELARRFRVKVDF
AAAFTASIQAYRALSVFIAHICQQSCLPHFSREAVASVLRTCHRAAEDQK
RLSANFSHTEMLVVESARQCKTRGNLVVEARDVTTARRARILRHNYPDQC
AQEAIADGDVVITVHDERIGQINGLSLIEMGDHSFGIPSRITAHTFAGED
GLLNIEREVGMSGPIHDKGVFILQSFLSALFHHNAPLAFNASIVFEQEYY
GVEGDSASCAELYALLSALSGLPFRQGIAVTGAINQYGEMLPIGGINEKI
EGFFRVCATAGLDGTQGVLIPDRNCRHLMLDDEVVDAVSKGVFHIYAISH
MREGLELLSGLPAGISAGTDLQEIINYPQDTVLGYAQKTLRVYRMACQQP
SQRTKTLHKRSGEHIDK
>NE1749 conserved hypothetical protein
MQINHKSSQNQQGMTLVEVMVAMTIGLVLLGGVVTVLSSSQNTYRVNEAL
ARMQENARYAFQLLSRDIRMAGYRGCIGDGVAITNVLNGNTDFLWRFDQP
LEGFEAAGASSWAPALPSEVTSPLTGGNMGRDIVVLRGVESNYAKVISHA
SESADLVVEAGSDIAAGNTVLVSNCQGAAIFKATSVSDVSGQKNIAHADN
PGGSAGSNASTVLGRSFTGGEVMRISTRSYYIRENSAGIPSLYWKRGSQP
AEELVEGIENMQIQYGVDTDGDKTIDIYHKADEVTNWENVVSVRIDLLVQ
SVEDGITSQSQTYTFNGATVTPTDRRLRQTYSTVIALRNRAP
>NE0434 DUF149:Conserved hypothetical protein 103
MKANLGNLMKQAQQMQENMKAMQEKLAAIEVEGQAGAGMVKVTMTCRYDV
KRVNIDSSLIGDDKEMLEDLVAAAVNDAVRRVETVTQEKMASVAGGLGLP
AGMKFPF
>NE1461 DUF192
MLRSCLAVLLMVSSLSNTAEAEDQSLPVVKLSILDHVITVELADTTAART
TGLMYRTYLPEDSGMLFVFPVAGIHCMWMKDTVIPLSVAFLDETGKILNL
AGMIPETLTPHCSASAARYALEMDAAWFEARKIKAGDRVMQLPDTGR
>NE2512 hypothetical protein
MTKLALFVRLEAKPGQEAALADFLASALPLANAESGTTAWFALKFGPSTF
GVFDAFADEAGRQAHLNGQIAAALMANAATLLSSPPNIEKVELLAAKLPA
>NE0938 hypothetical protein
MINRSRLRRLIVISLVSYTVVLSLTVSLHGYLVNEYIEELIWESMLESEM
AYIKRKIAQDPEYDWSGLDRFHWYDEHRDSSIPPQFQALPAGMHDEVRID
GSEFAILIEDGPEGRKILALDITDLENRELMIAFAIVASTVLLITVLTLL
SFYSVDRLLRPLTRMADEISNLSPDGEGPKIPIGDKDAYV
>NE1719 putative anaerobic transcriptional regulatory protein
MHNCAIPGHPTAPQQRSSCTTCSLRELCLPVGLTDEEIEYVGNLTNHKLR
VRRGEYLLHAGADFRSLYAVKNGSFKTYTLRVDGRQQVTGFYMTGELLGL
DAISPEKHTCDTVALVDSEVCEVPFIKLEEIGRSIPSLMSQFHKIMSREI
VQDRGVMMLLGSMRAEERLAAFLLNLSTRLARRGYSSTDFILRATREDIG
SYLGLKLETVSRSFSKLQEEGLITVNNKHVTLHDLGKLNRMIGNSMD
>NE0084 Thioredoxin
MASKAIFYHAGCPVCVSAEQAVANAIDPSKYTVEIVHLGTDKARIAEAEK
AGVKSVPALVIDGAAFHINFGAGIDDLK
>NE1353 conserved hypothetical protein
MSFHVRFTLEAKADIERLYRFLAEHDFDVAERTLETIDSAWSLLEQFPFS
CRKIDDANPFLREFIISFGNSGYVVLFEIEDSNTVTVLAVRHQLEDDYY
>NE1234 conserved hypothetical protein
MTPQSAFMIAATVRVGQLQDLRTLLASMNTIPGHADPDNDLIPFGKLDRL
HFARFVIIEAKTLQEIKEFGVKPRPWRPMLAFLGDIDGDMHTFLAELVER
AESGLTKIFSHCDDFSTGNQNLLEWMKMRNVSPGANYVNWVGRTVRQIHE
EAALHHSLSDCLQKIVAEVGRENIHTLRQKLLSHVEMEKYKGRLVLSPPE
PTPSEWRTRNLLHKIGVPLVLLLFSPLLLVIAPFFALWLRKRERSDPELF
IRPAYSHIEALSEQEDWDVSNQYSVFGDVKPGLFRLLTFKFILLLTDWFA
RHVYNHGFLARIKTIHFARWVFMDNNHRVFFASNYDGSHESYMDDFINKV
GWGLNLTFSNAVGYPTTRWMIKEGAQREHAFKYTQRRHQIPTEVWYKAYP
GLTAVDLVRNSRIRQGVEIRQSDDAEIREWLSLI
>NE0742 Zinc-containing alcohol dehydrogenase superfamily
MRRIRAAVVYQKGGPFQIEDLRLEAPRTDEVLVRIVATGMCHTDMVARDQ
LYDVPLPVVLGHEGSGIVEQIGDGVRKVAVGNHVVLTYMWCGHCKPCLRG
DLTYCEHFYALNFGGVREDGSTSTCLANDTRRPIHDHFFGQSSFGTFALV
HERNVIKVPETAPLELLGPLGCGIQTGAGAVINTLKVSPGSSFVAFGGGA
VGLSAVMAAHAAGATTIISVDVVPSRLALAMELGATHTINSRETDPVEAV
HKIINGGADFTLESSGRPAVLRQAIDALAIRGTCGIVGAPALGTEASFDV
NGVMTTGKRIIGIVEGDSKPDLFIPALVELYQQGKFPFDRLVKFYSLDQI
NQAAEDSEKGITIKPIIRLQS
>NE0890 hypothetical protein
MKNFKQRLTNAAYASPLVLFAASARADLPEDVTTAITAAKADIAAAGALV
ITIVVGIKVWKWITRVF
>NE0452 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1604 General (type II) secretion pathway (GSP) D protein
MNTTKRILILLLLALITGCAMHTTRNPSFLEGKRLIAEGRLEEGLGKLEQ
AAREEPDDREVGATLARERDTIIGQILFEADNARLAGNLDQAEQNYQRVL
NIRASSERAREGLDAVDLERRHIVLVNQAKEALAHGDVDGAEKIVRGVLA
DNPMQSEARQLIKALSEQIARAEMSELSLITEFSKPITMEFRDTSLKTMF
ELISRTAGINFVFDRNVQQEATASIFVRDNSIEDVLKLMLLTNQLAYKVL
NSNTLLIYPDTPAKQKDYQELVVRSFHVANTDVKQMVAMIRGLVKARDIY
VNEKLNLFVIRDTLEMIRLVERLVAINDFPDPEVMLDVVVLEVKRDSTLD
LGPNMPTSVTFSAVPGIGTPAQAAADSVVKMSMIKNTGFEGLRSFTISNQ
ARIDFGRTLTNADVLANPRIRVKSREKAKIHIGNKEPVFTVTNTANVGSA
ASATYIDVGLKFEVEPVVRTNNEVTLKALLEVSSNLGEKRSGSGENAATA
IVIGTRTAETVLELRDGETQVLAGLIQDDLSRVRSGIAGLTDIPVLGDIF
SKQVRKHNKTEIILLITPHIVRNVIQPNHFESEFYSGTASAAGKLPVTIR
KTAPRSLAMESSNSASDGGYMSLGRARSAGLFGAAPDLFEEAVTQAREAS
PSLTLRVPESVAGGREFTATVRLVTQDPMLSGELNLTYDPDTLELVDGGE
KSGMRSLKLGRDQSTGMTAVLRFKVISTNATSTEIALQNLQVRDETGQPV
EVNLPPAASIRIQ
>NE1578 conserved hypothetical protein
MEAVMTIKFSEDVIPLADLKVNPGRVVSRVKETRRPVLLTSRGRGVAVVQ
DLDEYEKSQEELAFVKAVAQGLMDIKEGNTMSLSEAKKRLGIE
>NE2002 hypothetical protein
MKKTAFILSVILILLVAVMALIALFSEIEGDNVRHGIMGAGISTLPLVAY
CISVVRVCRGWYLVAATLNGLFFALTVVSIVIILMDDPSTMKNLLAVLLV
LLVPLTLNIFALIHIRRTDSRLMPHPDIPGAAGGKNLEGLGGWLILVGSN
VVLSPFVIAARTYKSYAEMFASGVWDVLTSPDSMAYHALWAPLAIGEIIL
NSALILAWIYIAFLFFSKRRAFPFWFIAIHIATVCLIVIDAIVVHHILPD
APIFDANTLRELSRPIGAILIWAPYMLMSKRVKSTFLH
>NE2162 Esterase/lipase/thioesterase family active site
MENPPAEPFFLDASPGKRFCLYHSPAGDTPLNQVFIYLHPFAEEMNKSRR
MAALQAKAFAAMGFGVLQIDLYGCGDSAGDFGEATWEIWKNDVEFAYQWL
IQQGFTSVHFWGLRLGALLALDYAGKAETGSAKFILWQPVINGKSFLTQF
LRLRLVNKLLSDDSDKAQNVHLREELRAGKSLEIAGYTLSPAMAAAIDEL
KLGQLVVGNSEIYWFEITPEAGRGLPPAGAAVVEAWNQSGVYPEVTLIPG
LPFWATQEISECPALLAATAKLFAGIQP
>NE0192 conserved hypothetical protein
MLDKNVLNEIGSKVNEILASSPARDVEKNMRAMLTGAFARLDLVTREEFD
VQQEVIKRTRIKLAELEEKVRKLEQQLQQPAEVVSSNETGDACCETQP
>NE1629 hypothetical protein
MKNTQPYSWWLLPCLLTTLPGCLSPITLHHAVSAYDDAITSTISRQLLTN
IARARHHQPIHFTGVSNVAATFDFRFSAGATPALGGLAGTTLMPLFGGSV
AENPTISIVPIEGEEFTRRLLTPFQQNKFMLLLRQRFDIDLLLRLMAQEV
RIQESTSQTTYRNTPSDMTGYETFRKVVLHLSAIQDRDQLYAEPLNLEYD
WTLPAAAVSAEGFHTLAKEFVVHHDRQNDLFILHQKKQGPILITNYDPGI
LSEKERAQLSKEAEGWEPNDVAFDIRPGGMGGEWPMKGIFRLRSFHAIIS
ALGRSLSDEPEYHVEKDLRTLPVSRDENPVATMALLVTDTPSPNTDLSIR
SHGRHYAVDTQGQQARWNRDAFQMLYLLFQMTVTDLPRTGAPGITIAK
>NE0896 Helix-turn-helix motif
MLRIFMESLPSVFGDCCLKVGSRDSRILTTSGALRLLIFDHRVNRVFIVS
ASSFTRGMAMRKRHYKRKPYKPVNYDTIRETRLLAGLTRQQVADMLRVSL
RTVQYWERGLVHMPYAAFRLLRILTGYELPGKEWKNWMLLRNKLISPEGK
TFTPGDLYPLNWTIDKARMWDKEYRRRVGSGETALVPFALPGNAWRGWMC
HDGRLLSPDGMAFTPVHLNFLASMVDRICGEISSVPVKVIQGDTP
>NE1071 probable sigma-70 factor, ECF subfamily
MCMTDDEEGAVPATDLAAHRIHVLYTDHHGWLYGWLRCKLGNACDAADLA
HDTFLRVMASQRMPVHLGEEPRALLTHIAKGLVIDHWRRQDVERAYLEAL
AHLPETEIPSPETRLLILEALCRIETMLRGLPVRTRDIFLLAQLDGLTYR
QIADQTGFSLITIKRHMRTAFVACLMLD
>NE2566 Domain of unknown function DUF81
MEIQWLAILPGVFTGLVLGLTGSGGAIIAVPLLVFSLHTTIAEAAPVALL
SIAVSAAVATCNAFMQGIVRYRAAALIASTGMLVAPAGIWIARQLPDLLL
TVVFSAVLAFAAGYMYRQGRRSAQPAPPEEAVYPPCQLSLESGRLIWTIP
CAKGLLFSGVATGFLSGLLGVGGGFITIPALRKVSNLPMQSLTATALAIT
TLISITGVVSATSMGFMNWPLALPFTVGTVIGTLTGRRYAHRFDEAKLQY
GFAILAWCISLGMIVKVVYSIDFSALS
>NE2533 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE2526 Restriction modification system, type I
MIERDLKPGWRRVKFGDVVRQCKEKADPETSGLERYIAGDHMDTDDLRLR
RWGEIGSGYLGPAFHMRFKPGQVLYGSRRTYLRKVAVADFEGICANTTFV
LEPHNPNELLPEFLPFLMQTEAFNDFSVKNSKGSVNPYINFSDLAKFEFV
LPPIDEQQSAIALLSAATDQCHAVEAAHRAAGRMLQSFKDSLLLRKTSSL
ANSFLLGDLLLRSPESGCSAPPKDADTGYFVLGLAALSRDGYVSGDFKPV
EPTSKMVAAKLSMGDMLISRSNTVDRVGFVGIFSDNRDDVSFPDTMMRLQ
PNPALVHPHFLEALLQTTSAREFLMRIAAGTSASMKKINRANLLQMRLNV
PDLDVQEMALDELQQFKNAIATQKARWDAARQLTRLIAMRTIGGAA
>NE0246 TRAG protein
MQPRNQITNPFVLLVLSGLCGWGAWETRDWLHSDLLAVPVFLALGAVLTL
IKGVLYSLLYLIRLFFRKRAMKPTDRSGSAAWASSRDIEKMGLYNRRGGF
LAGVHENRPVFVDIESSGLVLSPAGGGKTVTFVIPALCHNPISMFVPDLK
GTLACMTAELRRKRYKHEILCVNPGGLYQDILGAGARYNPLQILIDSWNN
PDRHAQLLSDAQSIAKQLCQDPVQQGENQYFRNGSRKFLVFAFVYQVTNS
NHATLADALCLLSDKEALLKALQDAQYSAHLNGDLARSAKDLLAKFDSGD
QKQIESFREGAIQPLEIFSPSGKLAECTSTCDFRFADLKKKKMTVYLLAD
PTRMAVYAPWLGLLSWCALTELIRFQNGKSVCFLCDEITNIKINDLPSLL
TLAREFKIKLWLIVQELEQWAHVYGRESLETLLSQTEVKIIMGVRSYKTC
QLISDMLGESTIKALNHNLGRSLFDAPTRSIQDAPRRLMTADEVRRTSDT
ILFMGNHRPIHLSKIGYHEVKPWVKWVGVNPLFNKPFKGKTRLWL
>NE1088 TonB-dependent receptor protein
MLFAPLVHSIRLLSGRVRYAVLSLLLLIVAATTFPAQAAENTLDTPDAAA
RKHYDIPAGPLGRTLSHFATIAGIALSFDPALTEGLTSPALTGNFSVSEG
FNRLLANSDLELIRNADSSYTLRRAPIKAAQAGVTALPVVTVSTSTENAD
DTGYVAKRSSSGTKTDTPLIEVPQSISVVTRRELDMRSVQDMSEALRYTP
GVVVDQWGFEGRGYEYLLLRGFQGLTTSLFRDGLSMAAQGLYFGSFITDT
YGLERVDVLRGPTSVTFGRGDAGGIVNRITKRPSADPIREIELQYGNFDR
KRIAADLGFANNDRSLMFRLVTLGLDSDTQVRYPNTGGDRAGLRRFYVAP
SVTWHPTADTTVTLMGEVLNNRAEASPFFLTTPDGRHTDVVQTDSKFVKY
ATNQSSFSYQIEHNFNETFTARQNFRYMQQDGNFNDMFRAGFDADIPTLM
QRSAMTTKERLTQTVLDTHLQAGLDTGPLNHTALLGVDWNRTNASLRFFS
GEAPGIDVFNPIYGISVPVPDSLDIDAKQEIDQLGIYLQDQIRYGENWIL
TLSGRYDRVDTKDADFLSTTNTRNRDNAYTGRAGLTYLFSSGVAPYVSYS
QSFLPLPGLDSSNRPFEPTRGTQYEVGVKYQPAGGRSLYTVAFFDLTKTN
VPTPDPDDPFFRNRQTGEIGSRGVEVEARAELLRGLNFIGSFTYHDVTVT
KSTDIDKDKMPIQVPHLMTSAWLDYSLGNLGVNWLRGFGIGGGVRYVGRI
FNDLENTSTTPSYTLFDAMLRYDNGPWMFTINASNLFNNKYLSYNDPGWR
AYPGLERTVLGTLKFRF
>NE1757 hypothetical protein
MHKPSVHDFSQPDTGNQAVPVYLVAGYLFFVIYGSLVPFSWNGLAFDTAW
ENFQHIPLLELGVASRADLVANLLLYIPLGLLTCGLFTGRSRDPLVLATG
IVLSLLFSTVIALAVEFTQQFFSPRTVSLNDLYAEFAGALSGALLWPLIG
MRLQKITLDILHGGKNARHAALLTYTLVYLILSFFPYDLLLSYDEWQQKL
TSGQMGWLFAPSCGNSCTWKLIPEALLVAPLAIFLFQPLQRTSLILAATT
GIMLGVLIEGLQLTIVSGISQGASIISRMAGMMLGTALIQISPRVNWYWL
HPYIRPLLLLGLIPYLGVLVLINYRLSGNWLGLTEGIERLKEIHFLPFYY
HYFTTELVALMSLLLQAGAYLPVGIGFWLWHRTKLSFNSQHYCISHPGVT
AGILSCVIEAGKLFSPDTHPDPTNVLIAAVSASLACYLLNLLFPVKSEVS
PNNGKLNIPQTTDSDHKVISGSNGKAADQINDPEKITSQHNRQSPPPTFS
ATLSRTETSAFTGASEQQPPPSRRPGWPAIAGTIALLLAGIAAITSPLGA
LWVSFPLIIYSALLWWRPDLWLVWVLAFLPLLDLTPWSGRLYWTEYDTLL
LATAGIGYLRLCFHPYTQPILRRPAALALALFVISTAISLGIALFPLSPL
DQNAFTSYYSSYNGLRSVKGILFALLFIPLLTREWNAPETAARRLALGMS
LGFAAEILYVLWERITFPGLFNFETDYRITGSFHGMHTGGAYIEGFLVLA
APFTVLWAWQQRNTLITLLTIGLYCLGAYCVMVTFSRGGQIAFGLVTVLL
ATGFSGFVLRRRTRIFSSISITILITVLAGTIAWPILSGPYSQSRFATVE
KDVVTRADHWQDSIDIALVRGSPVFGMGSGSFPYAFFWYSSVPARPSTYT
FATEDKNTYLQLGSGETLYFEQPVAVEPGQRYMLTMNLRSQAPNAAITAP
VCEKALLYSFRCFWLSLRLDKVPPGKWGYYEAAISTSEFNTENSRVRRPA
KLSLYNGQPDTTVDVDNISLKDAEGNDLVLNGDFSDGMSHWFFSTDSHLP
WHIKNLFLHIFFEQGWFGLACFIVLTGYLLTRGLIRTWHNDVLHLVLCIS
LGAFLAIGIVDSLIDEPRLAFLFYLLMITGSISDTHPLPLLSGNRTRNTA
>NE0946 SUA5/yciO/yrdC family:Sua5/YciO/YrdC/YwlC protein family
MLQTDNLIPDFINKQITQAVMSLRSGNVVAFPTETVYGLGADISQPQAIQ
KIFEIKGRPSSHPLIVHFANLSELEFWAENIPATAWILAQHFWPGPLTLI
LTKSNHIPLSVTGGQETVGLRIPRHPIALKLLEQLGSCKALAAPSANRFG
FVSPTIADHVKNAFNDKIEVILDGGPCEVGLESTIVSCIDKDVTILRPGG
ISVLSIENLLQQKVSVKSQNKIRTPGSLSSHYATATPLEICQSSESLFSR
SFDLLQQGIRTAVIIRSIHLIPMLNSDDKFHYILMPNDPVAFGKDLYATL
RKLDDANFERILVEAPPDDLEWLAVADRLSRASYSFTSK
>NE1579 PemK-like protein
MAKILRGEIRWANLNPTVGREQSGERPILVLSQDIFNERSGTVIAMALTS
QEQRAGFPLTYEILKSSLPKRSWVKISQIRTLSTERIGKKIGAIAPEELA
QIVEGLNEIIGS
>NE1338 hypothetical protein
MSIFEAAQIVWQELVTRIARSEWNLLKHHDLLSNKLPVLFRNRAFLFIRS
PLVAARSIPFHITKYFWCSSTKYGVSVRP
>NE1233 possible peroxidase, putative
MKESGNFEFDDVQGLLRFGYGKLADTCFMLLKIADASAARQWLSTAPVSS
AIAIHPPPDTALQIAFSVQGLRALGIDESIIDGFSDEFVHGMTRNENCSR
RLGDIGHNAPKYWKWGEGADVPHILLLLYARKGGLDAWKATVEGEYFSQA
FQLLQYLPTSDMGRTDPFGFEDGISQPAIDWADRHDTDSHANDRYTNLLA
AGEMVLGYPNEYGQYTARPLIDLQKDRFAVNLPDAKDDPARKDFGRNGTY
LVLRQLEQNVPGFWQFLDKVSDSIPEKRERLAASMVGRERNGTPLIAEHI
PGILPKDHGNHFTYDLDPKGNHCPVSAHVRRANPRTGDLPTAVTGTGAIA
LINRLVKILGFGQKPEEDLIASSRFHRLLRRGRSYGPVLAPGDAVKPDAP
AAERGLQFICLVANIGRQFEFVQNAWIVNGKFGGLQEEPDPLLGNREPMI
NGNSTDHFNRPTSSGSMQKTCHLPQFTTVLGGGYFFMPGLRALKYISALP
ANGNGGSS
>NE2284 Chromosome segregation ATPases
MRLTEIKLAGFKTFVDPAVVPVPGNLVGIVGPNGCGKSNVIDAVRWVLGE
SRASALRGESLQDVIFNGSATRKPIGRASVELVFDNSSGKAAGQWKSYSE
IAIRRVIQRDGESSYYINNIHVRRRDITDMFLGTGVSGRGYAIIEQGMIS
RIIEARPQELRTFLEEAAGITRYHEKRHETGLRLADARGNLQRVDDILEE
LDKQQQHLEAQAECAVRYQDLHRQLTAAQHTLWTLHKQQAAESRHQAQVE
VERLVQEMETIRSGMQETGEKLDELRLYHRVVNDRQHQIQGALYAANGEI
ARIEQNIRHIRANREQLDRQLAEAELQLQNHEQQLNEVNENLTSWQDKLE
QAKNCHLSCKEEHIVEAGKLPQMESAAQADQVRLIGLREKLALARQNEKL
LQNQQAYAEKTLHQLVVRRERLLEEQSSQPEIDPVQLDELQMESAELAAM
LEQKQHSLSTLETQAYTVQQERDAILQIIQSLERDMARANARCDVLQRLQ
DQIEDNQELNAWMARLQLNLLPRLWQQISIESGWETALEAVLRERIQAVT
VDRLEQMLEWEAQRPSAKWAVCELVPDRPVSNDTDQRTWKPLSTLLSCRS
PAVQAVLGNWLYGVFVTDSLTAALADRSLLTSGEMLVTAEGHSITSCGVS
YFAPDSAVHGILQRQREIAQIREECEQIGQSVLSQQQTLAAVEQDDQQIA
ADIVQLRKVIEEIRTQQHDRQIQIVRLTQQIERVAQQQAQLEIELADLAV
QIEEESSQKQQAETELVVCEAERVELEAQVNQAESACQSSGRALASQRSR
VQRLSDRLHETAFGEQDCQNRVIDCERRIGMITRNSVILTENMQRLQHAR
ADLDESTLTADTVHWQIQRDQHEQALMAVRHELENMDDTLREMEQARMYA
EQRLQECGEAVGQARLREQEAEMTEARFADKLAESGETMLEMVPQFVNES
PAKLQTRINRLTGEVTALGPVNLAALQELEALKSRRIHLEEQSHDLREAI
TTLEQAIRQIDRETRERLQETFDQVNQNLAGLFASIFGGGVAELVLSGED
ILDAGVQLNAHPPGKRNSSIHLLSGGEKALTALALVFSLFRLNPAPFCLL
DEVDAPLDDSNSVRFCELVKRMSGETQFLFISHNKITMQMAQQLIGVTMR
EQGVSRVVTVDIGKIMAAGDLPASV
>NE1126 Orn/DAP/Arg decarboxylases family 2
MTHPKHAPMTQFPIRNHCLVIGDLPLTRLAQRIGTTPFYAYDRKLISARI
QFLRQHLPAGIHLHYAMKANPMPAVVQHLAGLVDGMDVASVGEMKVALDA
GITPENISFAGPGKSRQELICAIAAGIVINVESEYELETIAQLAEETGFQ
PKVAVRVNPDFELKSSGMKMGGGSKQFGIDAECVPAVLQRIAQLRFDFEG
LHIFSGSQNLRVEALREAHEKTLQLGLQLASTSPVPVRAINIGGGFGVPY
FPGESPLDLNAVGTNLQRLLAEIPSQFSAIRFVIELGRYIVAEAGIYVCR
VLEKKISRGQTFLVTNGGLHHHLAASGNFGQVIRKNYPVAIGNKMKIDET
EIVSVVGPLCTPLDLLADRMELSRAEPGDLVVVFQSGAYGLTASPTAFLS
HPASLEVLI
>NE1031 putative iron transport system ATP-binding protein
MSTLLELDRISHAYGAQVIVNELSFELEKGEIGCLLGPSGCGKTTVLRCI
AGFEPIVSGEIRLNEISVSRAGFVLPPEQRRVGMVFQEYALFPHLTVTAN
VGFGLHRSTKAERAHRVAELLQITGLMEVANRYPHELSGGQQQRVALVRA
LAPYPDLILLDEPFSNLDVSLREYLGQEIRELLKKLNITAILVTHDQAEA
FAVADKIGVMHAGKMMQWGSAHELYHHPANRFIADFIGQGTLIPGKVTHS
DKVETELGTLAGNIYHPGHSSREIAGSQVDVLIRPDDVIHDNDSSLRAVV
VHKAFRGAQFLYTLRLASGQSVLSLISSHHDHAIGEKIGIRPQTDHIVVF
DRVNSQ
>NE0561 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE1659 conserved hypothetical protein
MRHKDKQKLGISLLLLFTTLPAVVHVYAGSWVRPPDDIDIFGQMQTVTAS
REETLLDVARHYGIGQDEMVLANPNTNRWLPEDGAEVVLPLRFIIPQAER
IGLVINLPEMRLYYFPKPAKGQKPEIITHPVSIGRMDWNTPLGRTTIVRK
QKDPTWTPPQSLKAEAIAEGKPPLSDVVPPGPDNPLGRYALYLGLPGYLI
HSTNKPFGVGMRVTHGCMRLYPEDIEELFNLVPTGTPVQIVNQPVKLGWQ
ENLLFIELHPPLEEDDTTPYDYEQKVHSAIAEFLEKTAKDPDGRMTRNTR
ISPEALESAIRARNGIPTLISENLEN
>NE0844 Protein of unknown function DUF48
MTQARLCLSVLYFPFSETTPKDLDQVRGIEGDAAKTYFSALPYLVRKDIR
EFFTMDGRTRRPPRDRFNAMLSFIYSLVMNDCRSALESVGLDPQIGFLHA
VRPGRAALALDLMEEFRSFMADRLALTLINRGQITDQDLLVREGGAVHLE
DKARKTVVVAYQERKQEEITHPLLETKVPIGLLPQLQARFMARVIRGEMD
GYLPFLVR
>NE1509 Domain of unknown function DUF34
MHQNMLENYLNDLLGIQQFRDYCPNGLQVEGRTSIQTLVSGVTASQALIQ
AAVGLRADAIIVHHGYFWRGEDACLRGMKRHRIATLIKHDINLYAYHLPL
DAHTELGNNTQLGKKLGIVEAGRFGEQNIAAYGHFPESVSLDELSNLLNS
VLGRKPLIIGDPLKPVRRVAWCTGAAQSYFEEAIRLGVDIFITGEISEQN
VHTARESGTAFISAGHHATERYGIQALGEHLSQKFSIAHHFIEVENPV
>NE1283 conserved hypothetical protein
MFKTDMWDTFLQAIALMLILEGIFPFSFPGAWRETFLKLMQLEDSQIRFV
GLSTMLVGLVILFLVN
>NE1337 hypothetical protein
MIGRNRYAMLTVDTEALPKRAVQDHIKRLMWGEHDGGCAGIREMCAIGNE
CGVKQVFFVDMCGAYACLDQTLDVVRWLDQDGQDVQLHAHPEYLPEQFWK
EHGFKYRPRFLNQYGIEKATFTIKYFGKLISDLTGKPLRAFRAGSFRWNA
DTLRALQEAGVSLSFNNSMNARLKGQCTYSEPTNHSYLWSNGVIEVPVTE
RKFFPLFGKEWWGRLQFPVGDWLGSPPWRVLRPYTVGADPSFLVVLLHSW
SLLYWDKDGYAVYRDDKRIEDYRKLVRRLARDYDIITTADFLDLYACGKI
KTTHTADLSLAEFKAVKK
>NE2058 Cytochrome c, class IC:Cytochrome c, class I
MEIVAILARWSQLVANLVLLGSCLYLAIAGTRRELFDNAWVIRLEKLFPV
LTAIVLVGLIGVLATTTGKATGIESNTWKLDAWIEIVRHTNMGHMWALRA
ISALFLMAVSVFIYVKRERARWHYVMGACIASLPLVASAMVSHAGADENF
MVYVPIYAIHILMAGAWFGALPAFLLIVFDRHNCDAKGIQLVLNVDSLKR
FSVIALPIMVAIVLTGLILTDRMVEEDYHALVASTYGWALVTKIALLIAI
LFIAYQARSKWLPSFERVCDSIDRSSLDKEEKINQSFFSKWVHKTTQVEN
YGCAEEGAPDSGVARLRKWVRVEFVLALLLVLFATILSNSVPAKHAMVEN
WPYPFRFTIDGSIGAGAWNDPTVQFFLGLGVVLLVAAFGLFWLGKKKDWN
FRKRALVTSVLVVSGIAAILPQFAVVAYPETYRNTPVPFDAISISNGSVH
FSEFCTSCHGPQGKGNGILAKTLSMPPADLLTEPHTARHTAGDFFHWLGY
GIENTGMPGFINSLSEEDRWDTVNYIHAMSRGYQARLLGPAVIPDKPSMG
PPVFQFTAYDGTTGTLKDFRQTANVLLVFFSWPNSLDRLKELSESYEKLR
SLKTVVLAVPLDGYSEAAVAGFAADTPLEVVRDGWFEVKNSYILYRRTLK
YPDILGSGNIPDHMEFLVDRFGYLRARWIPSLDQSGWSDLNLLTSQLLQL
NQEKEILPPPRDHVH
>NE1557 putative transposase
MTYPISFRRKVLSVREKEGLTIAQTAARFCVGIASVTRWIKNPVPKESRN
KPATKIDMAALAHDVREFPDAYQAERARRLGVSEKGIGHALRRMHISYKK
NTAAPQSGRRQTAHLPGDD
>NE0724 possible A. fulgidus predicted coding region AF1619
MNAMNDKWYSGMARTEDWWAVWLGLIMFMASVSSLWGWELTGWMAKPDTW
VWEKFSIEGVLKSSGKPEWHPALSLLVTYLVFTALTCLAAWSMKFDLKQF
FIGWTILFIMTWVIWIIGNEAHFKASVYEMDKYGLSWGLSLGSGFSYLLA
LLVGLVIGNFFKNTARFLNEAAKPEWFIKTAIVFLGIKIGVMSIEAAGFI
TELVMTGVAATFVAYMLFWPIVYALGRRVFHLSRDAAAVLSSGISICGIS
AAIATAGAIRARPALPAFVSILVVIFAMFELIILPGFYTAIAPEQPIVNG
SALGLTVKTDGADAAAGAMLDELMRANAEANLGIVWKEGWILMASLTTKI
WIDMFIGVWAFVLALVWVYKVERKPGQSKIGLMEIWHRFPKFVLGYLLVW
FSYIMLASSGSEAAETLHKAAAAVEGPMRNMMFMLTFISIGIITDFSKLK
GMGKLALLYAIALFGIIAPIAYGVAWIFHRGMMPPVL
>NE0371 Glycerophosphoryl diester phosphodiesterase
MVHGVPCFVAHRGYPALYPENSLVGIRAAVQAGARYVEIDIQLSRQLTPY
LCHDDNLKRLTGRNAYLTRLKDDEIDTLTVSCPDVTCSSGTEPAPIPRLA
EFCRYLAQHPQVTTFVEIKSESIARFGLSKTMDAILPVIDVVHTQCVLIS
FDWELIALIKSHQMYKTGWIIEHWTDEQSAMAARLQPDYLFSSIRCLPEK
LDQLWPGPWQWVIYTVDDYATALGYAEAGITLIETDTISTLLAP
>NE0974 PemK-like protein
MTYLPNRGDIVHLDFDPSSGREIKGPHFGLILSGKLFNQRGLAMICPISQ
GAAAAARTYGTVVTLMGAGTDTQGAVHCHQLKSLDWQVRNVRFKESVPQH
ILDEVLARVEAILFE
>NE0317 possible response regulator
MVVQIHSFGPLAVEVAGESVIQPGRNQKKILELLATIIALGGRNVNGNLL
KEILWPDAEGDLADLSLGTSLHRLRKLIGKEAVLLNTGMVSLNDGCCWLD
LWIFETISCKLENVLKCSDQQPLATELVDQLMTLYRGTFLKNYDSGWILL
KQEQLLDRLIRLLNLSADCYEQHGENERTSQLLSKILELRPLSEANYRRL
MQHYIKLGWTDQALHIYRQCQRILSGGFNIPLSSEIHRLAKQLQTGT
>NE0819 conserved hypothetical protein
MPTVDQSFPSFFNDAPTVTLQDPLARFLGAAHDGIMEYQYVDAVRLAGHS
CPTVAGTWLMTVHGLRALYGDALPVRGEIEVYMADARDAGTTGVMATVAQ
LVTGAAPETGFQGIGGRFGRNDLLHFDQPMQGSIGLRRKDTGAAVQVELD
ASVVPWPDEMRVLLPKAVSGQASTAELQRFGELWQERVCKMLVDHADDVN
LVRVSNWAVD
>NE2380 STAS domain
MHRCGESSVAGTDARIRLEGNRLFVGGPVTYDNVVEVIRTGDAAIKADDM
LIDLAGVTWVDSSAVSMLLEWMRTAQTYDRRIEFINLPSNLADLIELYDV
GSLIPTDKPAESV
>NE1519 conserved hypothetical protein
MKLFKSIPNPKEIRQKLGLNQHEFWSKVGVTQSGGSRYESGREMPKAVRE
LLRLVHIEHIDLTKIKRDDLIVAAMLKAQYPDLYKNLKKSAKQLK
>NE0410 Ribosomal protein S17
MSSDNQSKTLIGQVVSDKRDKTVTVRIDRKVKHPLYGKIVTRSSKYHAHD
ELNQYKLGDIVAISETRPKSKTKAWQVVRVIKVN
>NE0016 hypothetical protein
MTYTFKLLTGLTLSVMLTGCMHTPVQPVDEPSEQAEIRIIQESGSDLSEL
MHYYDSLQNKSRVELWEEYKYANSHYRESTDMQQRLKLLILLLLPNTSFQ
SNRVALNLVEDLPEQAETTPDTTAFKNLLVLLLKRQRAANLQIQNLSEKL
RSAETEVKTLKNKINAIKIIEKDLMRNNTP
>NE0348 DUF206
MRVLPYIHSTSVVDATVSPRGKHLVTRAMPGLHLIGDLFGCRNGADLMTD
TAALEAFCKQAVADAGLTAVGSLFHSFGPGEGVTGTIMLAESHLALHTWP
EDNYVTLDVYVCSYSCDNSAKAECLFKTLMQAFQPADPHLHRVVRA
>NE0978 TonB-dependent receptor protein
MNPHHHRSTLFPVCKSCVLITRLVFFILGTALILSAIPAQAQSEAGRQTY
AIPAGTLKSVLVSFGQANGVMISYTESEIAGKRSSGIHGSFTVLEALNVL
LTGTGLEASKRVEGGFRLRPQTPVITGQTLPLMTVSASADTLKQGTVEEG
YRVSDISSIGPWGKRSLRDTPYSMSVVPSDLIENVNAQNMGQIFKMNPLT
QEPAIQNITGLPLVRIRGFETINPVFDGIPLAFRTGSAVSVVEIDRVEIL
SGTSGFLYGGGRVGGAVNYVTKNSTDTPLRRLTFGNYGGTQFYGHADIGG
KFDDQGRFGYRANLLYQGGDTAIGEPVNQKVANIVLDGHLTDTLYADIKY
TYNDYRKEATQPIFNNIVERIFIDTSKRYSPKWASNDLESHKAYTSLRWD
PSKYLTIRSAFSYQYANLRADQISLNEQVDRTFIPRTVNYPREKHENYGG
YIYLDTRFSTFSVDHKLTLGYSSTFYDYFRTVAGEESQTGTTSFSLSALK
HIPEPSWRSSGDTSMISVYVARYDNVLIGDNIIFNDQWSAMVGFNYATAI
NKSSYPGNNVIKYDKSKLTPTVSLLYKPIEEVTTYASYMEALEAGLIVGN
TYANRGTVLPSLVSKQYEVGLKYDINPNLLLSAALFRIEKANQYSDNATP
IPTYVQDGLQVHNGVELVLTGKVTDNLTIIGGGTLMDINVEKSNNPALEG
KKPVNAASRMGKIYAEYALPWIPGFSLTGGIYYTGEQYGNTTNTDKIPSY
TLYDVGARYATRVLNKSLVVRLNVINLTGKNYWQNANYLGVPRTVAFSVS
MMF
>NE2544 Helix-turn-helix motif
MYAAYQTNGPARPIARTAAFTKEDDLLVDMARQNNLRGRILTYCVAGTFG
LLSPNYLHASAATANWDINQIEISGLSESSVGAISESARDIAHIRAVTKI
SVSELSRVFGVSRQAVHEWIKGGTLSQRNAQRLSELARATDVFIEAEIEA
SPQVFRRKIAGGQSILDAVRENGNAVELAKTLAATLVRESQQRQRLAARL
AGRSATPGTSTEFGSPHLDEDA
>NE1470 conserved hypothetical protein
MRSTTLLFFCSNLVWISPLTWAQTSQQPDNLIPLPEIPESPSAGEENGLP
PELGLDPSLEPEITIHEGKDKTMIEEYRVNGELYVIKITPRIGKPYYLLN
RRSAVGMPHRGDMESGVSVPMWQIYRF
>NE0631 conserved hypothetical protein
MNALESFLYEISRFFLTPVLIILCIMFVLSLFALGMLLFDLILRSLHFSV
RQSLQHYRERHPQANTEAIELHLLKLLEPLRLTSRVAPMLGLVATMIPMG
PALVAVAAGNTQEIAENLVVAFSAVIVALVTASISYVVLSMRRRWLLTEL
NILLNGNQRSAFIADSNNAHVITPQEAIHG
>NE2222 3Fe-4S ferredoxin:4Fe-4S ferredoxin, iron-sulfur binding domain
MLDKNSLTEQIDAVLPQTQCGQCGFDGCKPYATAIVEGHADIDQCPPGGD
AGVAQIAAILNIAPKPLNTTYGHPKPPAIAVIDESQCIGCTFCLRACPVD
AIIGAAKHIHTVLTELCTGCERCIAPCPMDCISMVSVPAPATPEIRQQIA
DGARERYHLRLLRLARQQQRTRKPEQNKPKEATIPTAQQATDTKQAAIQA
AMERARAALARSNIPNKD
>NE1603 putative general secretion pathway gspG related transmembrane protein
MLQTASCRGFTLIELMIVVAVMGILASAAMPLGEMVVKREKERELRIALR
QIRTAIDAYKQAADEGLVEKKADETGYPHRLEDLDTGMDDVKDPDSKKIF
FMRRLPRDPMFPDTEVPAAETWGKRSYASPPDSPEEGDDIYDVYSLSEKA
GLNGIPYNQW
>NE1606 possible transmembrane protein
MARIALENWLVRARWQVTRLGTVGRAGAGLLVLTLVFFIAAVMPQKERLK
ELKSKVQVMQQAQPDSAGQTKLNNNQALQVFYDFLPRSDSSPYWISELDR
IAKDSGVELNSSDCRLKVEKESKLVRYEIQLPLRGTYPQIRAFIASALQA
VPALALADIIIRRETIQAGRVDARLNMHLYLNDY
>NE0007 hypothetical protein
MKTSITQVVFLILFCVLPQTTMAQRNMPQSYPVAASEKLVNGIANAVTGV
IELPKTVILTSRRDGPAYGLTVGLVTGIMHTIGRTVFGVLDAATFFIPTQ
PTVRPPYIWQDFDKETTYG
>NE0774 pyridine nucleotide-disulfide oxidoreductase, class I
MHKFDVVIIGAGSAGLSALREVKKHTDNFILINEGPWGTTCARVGCMPSK
LLIEAANAFYRRVSFNEFGITGADQLGVDHKRVLQRVRRLRDDFVTSTLG
ATRELGERAISGRAHILSPHQVMVNGEKFHTRKIIIATGSRPVVPEPWQS
LGKRLFTTDTLFEQEALPERIAVIGMGPVGLEMAQALSRLGVRVTGFGSG
RMIGGITDPLINQAAVTLLSKEFSLHLGTRADITVNVDSDVITVNSGDIR
IEVDSVLAALGRRPNIDDINLEALGVPLDERGLPPVNPDTLQIADLPVFL
AGDVNQRSPVLHEAADDGHIAGLNATREKLVCFKRRVPLTIVFSDPNIAV
VGKSFRSLEHEDIRAGEVDFSRQGRARSAQSNRGVLRVYAAANSGRLLGA
EMCAPAGEHFAHLLALAIDQSLSVWDLLRLPFYHPVLEEGLRTALRNLAS
KLPACSESDLAVCGPFNAEALD
>NE1375 Helix-turn-helix motif
MLRDRISRELVSIETKKQGFIAFTPTYQADLGFNPEQSAIYTLRAELMSN
LRKTIRERKWTQEEAAKVLNIGQSRVSDLMRGKWEKFSLDMLITLAIRVG
KRIGITVV
>NE0085 SAM (and some other nucleotide) binding motif:NOL1/NOP2/sun f...
MRLTPERLDLLIAAVRKILPLQAPADVLLRQFFQDHHKLGQNDRGTIAEA
VFGILRRRFFLEKLAGKATARELVLAYLTRFQGMNLRELAPLIRETEIKW
IQQIKAVKLEEQPLSIRAEFPEWLVEKLQKYHSDEEILRLGQTLQQPAPL
DIRVNSLLAKREEVLAALEQQEIEAQPTPFSPVGIRITGKPAINRNALFL
SGKIEVQDEGSQLLGFLLAPKRGEMVVDFCAGAGGKTLLLGALMHSRGRL
YAFDVSEKRLNNLKPRLKRSGLSNIHPQRIDSERDAKLKRLAGKIDRVMV
DAPCSGLGTLRRNPDLKWRQSPESIDELQKKQMAILTAAANLLKPGGRLV
YATCSLLPEENQQIVEHFLANHPQFSILHCDELLAQQKIHLPDTGKFLQL
SPLYHNTDSFFAAVLERSV
>NE1800 Polysaccharide deacetylase
MNLMIRNALTIDVEDYFQVSAFARYIPRSSWDSLPCRVERNIDRILVLLD
EHKIKATFFTLGWIAERYPSMVKRIVENGHELASHGYAHHRVTELSRGQF
YDDIARSKFLLEDIGGQSVWGYRAPSFSINKDNLWALDYLEEAGYRYSSS
IYPVEHDHYGMPDAPRFAFNPTGSRKILELPVTTVRLLDRNFPAGGGGYF
RLWPYAVSRWFLQRVNSVDRQPAIFYFHPWEIDPDQPRQVGISFKTRFRH
YLNLGRMEKRLDALIRHFDWGRMDQIFLRQSV
>NE1490 Uncharacterized protein family UPF0021
MVTETLSRKADYNANKLRKRLRRLVGTAIADFNMIEKGDRVMVCLSGGKD
SYALLDILRNLQAHAPLDFELIAVNLDQKQPGFPEHVLPGYLSEINMPFR
IVEQDTYSVVKRVIAEGKTTCSLCSRLRRGVLYRVATELGATKIALGHHR
DDILETFFLNMFYGGKLKAMPPKLVSDDGCHVVIRPLAYCKEKDLAAYAW
HAQFPIIPCNLCGSQPNLQRQVIKEMMQQWDKKYPGRLETMFTALQNIQL
SHLADTSRYDFVGLKPHGIAIEEGDKAFDEEPLSVIPVDMDHDDDSTFEP
ENEHDGGAVQGGVI
>NE1469 conserved hypothetical protein
MSLQAKLSDALTLEKPWSARESWRVFGIVAEFVEGTERLECIQPAISIFG
SARTPPDHPHYKLTEAIARQLSDAGFSVISGGGPGIMEAANKGAFYGKSP
SVGLNIQLPHEQHRNVYQDISQTFRHFFARKYMFIKFATAYVVMPGGFGT
LDELMEALTLVQTGKTRKMPIILVCSDFWTGVIDWFRQVLVQHDFISSED
MDLIQIVDEPSQVVDAIFRYYETSGFEPSAAERNIQLNL
>NE2434 putative transmembrane sensor
MSDSNTSSLPRAAQLEPDAAPAVAVLQQALQWQVIFWSGEATDRDRMRWQ
SWLAADPAHARAWAQVQRTDEQLQTLASPAAEVLRSADARRCKARRTVLG
MAGLLAGAGIMGCGLRQTAQWQMMMADDHTGYGERRNIELADGTQVTLGS
ATAIDVCYSAQARQLLLRTGEIFIVTAPDSADRPFLVQTTRGSVQALGTR
FNVRESGEQIQVAVQDGAVAIRPAGMNPGDVVRLDAGQQTCFDHTKVEAA
EALQISATAWTRGLLVAERMRLEDFLVELGRYRSGVLRCDPAVRDLIVSG
VYSLDDTDQSLRLLAQALPVQVQTVTRWWTIVGPRPMK
>NE1817 hypothetical protein
MDEADVRQRARAFVARVDVSNIREDLSPYVTEANAKVKKEELGEGESGYT
ITKPNGKHVITVNSLETEERQRFTICHEVAHIVLGLESSHEEVPSWSYAK
RHPNEIACDTFAAELLMPYQQWLSAVPKEEPSLDLIQHMADLFGTSFPAA
ASRFASLSDMPCAFVTMERGAVRYAARSTALRQAGAWIPVGNSSRLRGSP
NPLFREERY
>NE1845 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0267 conserved hypothetical protein
MTYVDGFVLPVPEGKIDAYRQMAESAGKIWMEHGALQYKECVLEDAKPEM
PEDAPETCKITPFGKLAGTKDGETVIFAFIVYKSREHRDEVNKKVMADPR
MQEACDENNMPFDPSRMAYGGFKALVDL
>NE0858 conserved hypothetical protein
MIDSRSVFTTSSTVAVTALALEFGISFPELYERAGLIKLDQIFLNFLKEG
DSTLHDHLIQARQNPELLVRKDESALLIEVAPWLEDFIATLFGIKAEVSA
LTARHHELAPLYFCKRQFVQRRAKTKVSDEVLQTLNGSILEQKLVEEFGQ
PFSELVFASHVARWMEDEVNHAAQLELALHYAAWALRTSEGQAHTRQGIL
FKSPRKLDFQHLLSLTSDDRNGYTVHSLDHVRYREGFSLTDQGTDLTGAL
DEANYCIWCHEQGKDSCSKGFPQKSKEVPEVKAFKKNDLGVLLAGCPLEE
RISEFHKLKTQGVAIGSLAMIVLDNPMCAGTGHRICNDCMKSCIYQKQDP
VNIPQAETRTLKDVLELPWGFEIYSLLTRWNPLDLLRPYPKAATGKKVLV
VGMGPAGYTLSHHLMNDGHTVVGVDGLKIEPLSASLSGINIKGERSAFTP
IFDVEQLSEDLDERLPAGFGGVAEYGITVRWNKNFLKLIRLLLERRSEFA
LFGGTRFGGTLTVEDAWNLGFDHIALAAGAGRPTILEIPNGLARGVRAAS
DFLMALQLTGAAQSDSIANMQLRLPVVVIGGGLTAIDAATEALAYYPVQV
EKFLRRYEILAAVQGEAAIRANWDSEEREIAEEFMTHARAIRNEKAQAAR
EGREPRVAALLQSWGGAMIAYRKRLIDSPSYTLNHEEVEKALEEGIYFAE
GLTPLRIDIDQWEHVQAIRFAVQAINEEGSWYKVNEVTQPARTILVAAGT
QPNTVLAREDAHNFKLNGRYFASCDEQGQPLDAIRGNPKPETPAVLLSHS
DDGRFISFFGDLHPSYSGNVVKAMASAKQGYPTVSRVLAQAGPVSVQSAS
QFFTEIGGQLRATVHKVERLTSNIIEVVVHAPLAARHFHPGQFYRFQNYA
SLAGISEQTRLAMEGIALTGASVDVSNGLVSLIALEMGGSANLCALLQPG
EPVVLMGPTGTPTEIESHETVVLVGGGLGNAVLFSIGAAARAAGCKVLYF
AGYKKLADRYKVAEIEAAADTIVWCSDEAPGFQPSRLGDLSYVGNIVQAM
VAYASGELGSAPISMQAADRIIAIGSDRMMAAVAAARHQQLAPYLKVEHF
AVGSINSPMQCMMKEICAQCLQPQKDPVTGDISYVFSCFNQDQPLDLVDF
SGLASRLRQNSVQEKLTSRWIGHCLQQGSSHVS
>NE1689 possible epimerase
MSETIQPGQTAVVLGAGGFLGSHVADALSNAGYKVRLFDRNPSPFRRSDQ
EMIIGDLMDISQVSNAVQGAAAVYNFAAIADIDEAHDNPLGTASINVLGN
MHALEASRLAGVRRFIFASSVYVYSETGSFYRASKQAAERFTETYHERYG
LEYTILRYGSLYGRRSDRRNGIYRMLHEAIQQRAITYRGSGESIREYIHV
EDAARMSVQILAPEFANRHLILTGQEKLRIRDVMTMISEMLPWNVDLHYD
QAKPGHHYQITPYAFQPRVGRKLVLNEHVDLGQGLLDCIQEIYQQISHGN
EEGGTTPSSTDNQA
>NE0092 conserved hypothetical protein
MLCALKHLFQGEPQQQAVSGLISTSYTDNDGETRISLKLEPRYENAGEVI
TKLVELDADPQVVRVGIQSSSIASFGSLENLVNAYGTLYRYLKDNYDSPA
KLKKYWGYLANDVVFIQISTDVSSALKIFETINERGVGLNPMDLLKNLLF
TQVGQAQFTQLKDEWKKITRPLEKGKEKPLFDHFVMQLENFLFYYIFTKT
PTRDMERSFSQWANELRAIADAAECRCHNQWQGIDS
>NE1178 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1100 hypothetical protein
MPSTPGIWKLPFNYFYIVPDLADASQRCFYGLNRVNCQPACKNNRYKNDN
CSYSIRSSYENTASQITPSLRDLDDNIYESCKRAANLLTA
>NE2465 Alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen
MSLINTTVKPFKATAYHNGDFVEITDKTLKGKWSVIVFYPADFTFVCPTE
LEDLADNYDEFQKIGVEVYGVSTDTHFTHKAWHDTSSVVRKVRYPLVGDP
TRELSRNFDVLIEDAGLALRGSFIINPEGEIKAFEVQDNNIGRNAAELLR
KVKAAQYVSMHEGEVCPAKWTEGAATLKPSIDLVGKI
>NE1398 putative DNA polymerase-related protein, bacteriophage-type
MDPNRLKELGLLPVWRVRPGAIAGQSPDSGKNMSDQIETAAESRPFEESE
DRSTSIAHADWGRLRQMVSGCTACPLSQTRKQTVFGVGDEQADWLFIGEG
PGAREDELGEPFVGQAGKLLDNMLQAVSLRRGQDVYIANIVKCRPPGNRN
PQDAEAEQCRPYLLRQIALIQPRLIVALGKVAAQNLLATDASIASLRGRL
HEFSGIPLIVTYHPAYLLRSLGDKAKAWEDLCFARDTMRNLQAAHSS
>NE0029 Short-chain dehydrogenase/reductase (SDR) superfamily
MLADRVILVTGAGQGIGRAAALAYAEQGATVILHGRKTEKLEQVYDEIEA
LGRASAIILPFDYEQATEAGITELVEAIASQLGRLDGILHNAAWTYGPMP
LAFHTSAHWQTIIQVNLLIPAMLTRACFPLLNASPDASVIMTGDTHGQTP
AAYWGAFAVAKAGVEVLVKIQAEEWEIYPNLRINTLIPGAVDTPQRTKTH
PGSNNRILPKPTDLMETYLFLMGPNSAGITGKTFDCQKEQSA
>NE0806 hypothetical protein
MKYSIRFFAVIFAIFLTACSTMYYSGLEKIGIPKRDVLVYRVEKARDTQE
ETREQFKSALEQFSAATNFKGGDLEGIYKKLNGEYEASVNKAKEVRSRIE
DIENVSAALFREWEQEITQYSNPALKRSSQDRLTETRSYYKQLIAAMKNA
ESRIQPVLTVFNDQVMYLKHNLNARAIASLKGELKTLQSNVSTLVAAMEK
SINEANTFISNMEKN
>NE1090 conserved plasmid protein
MAEADLDNIIDYIAQDNPTRTEEFGQELRDKILPLTQNPKMGRTGRPGSS
AFVRELVAHRNYIVFYRVLDEACTVEILRVKHAAQQSS
>NE1182 Helix-turn-helix protein, CopG family
MRNTMTHRVTITLDAETFAFLNDVASSNRSAYVNQLLKQDRKNFLQAALR
KANQEEAEDTNYQEKLQAWESTLSDGLAND
>NE2426 hypothetical protein
MTKPLDRLIGMTLLSAEIASGSAELRFSRCDFSAYSTYSSFPDFGSLVGQ
TVQSIVGSMDRLVIRFAFGEFFISLHPDDYRGPEAFCARFADGPWVVE
>NE1657 conserved hypothetical protein
MFRTVALTGATGFIGRVLIAKLAASGWKVRALARRVPFRQDLPLVEWICG
DMNCDSVLCDLVSGTEAVIHCAGAIRGKSWDDFSRTNVTGTRNILRAASG
APSCSRFLFISSLAAREPHLSWYARSKFEAEQLIPGFSGLASTVFRPAAV
YGLGDKAMQPFFQAMRYGILLVPGDPGNRFGLIHVDDMVAAIHNWLEAGQ
PVKGTYEIDDGTSGGYDYTSIAALAEQVLSRPVHCLQIPLGGIRMLARFN
LWLAHLLNYAPILTPGKVMEFQHPDWTCDISSLKRELAGWSPVTKLETVL
PLLVRM
>NE1492 conserved hypothetical protein
MKIQFNRFLTRSLLTAALTGAASMAAAATMEVYKSPTCGCCAKWVDHMRD
NGFTVNIHDIGNDEARAEAGILPELGSCHTALVNGYAIEGHVPADDIKQL
LKERPRAVGLSAPGMPHGSPGMETGRVDSYNVLLIRKPGDKRSATEIYNR
YGPGKSGAAEKTSENSTTDSVLRLK
>NE1025 Bacterial outer membrane protein
MYKTRVITTLVAGFIISGCADMSGTQKGAGTGALIGAGTGAAIGALAGGG
KGAAIGAGAGAALGAGAGYLWSKKMEEQKIQMENATAGTGVVVSQTPDNQ
LMLNIPSDISFDTGSAQIKPDFRPILDSFATSLLNNPGTRVTIVGHTDST
GSDAVNNPLSVNRAASTREYLASRGVPFQRIQIDGRGSYQPVASNNTAAG
RAKNRRVEIYVSETRDETVQ
>NE0044 hypothetical protein
MLYSTETCQHSSSESLLKKTLVVLLGSILATYLLGTTLPWPITLLHAVWL
QGLIAALASYYLLHMPVWWAAIHLLFFPALLSATLVLNLPAYWYLAGFIT
LLVFGRIHRTRVPLFLSSGEAVDALARLLPQDRQFKLIDLGSGCGGLVCK
LARMLPHGSYHGIETAVLPCWISKLRALLSRQDCQFKWESIWQHDLSGYD
VVYAYLSPVPMPRLWEKARREMRPGSLFISNTFTVPGIKPDRCIRLDDFS
STVLYIWRIA
>NE1515 Ankyrin-repeat
MNRSLWTQWSIASVIVLFCAFCLYSPAHAGVDEDLVRAVEDNKTHRVRDL
LTKGASPDARDLQSETALMLAARNKNPEMGGLLLEAGANPDLRNKYGETA
TMLACYYGQLDLVKRLYAKGAKIDHDGWNPLIYAASKGYKEIVEFLLNYG
VRIDAATDNGTTALMMAVRGNHYDTVELLLKHGANALIRNEADGTALGWA
RKQGHTSIVQLLTRNGTAD
>NE1311 Helix-turn-helix motif
MMSSIRTTEQLGDALRAARKQLGLTQPQLALAAGVGVRFIVDLEAGKPTL
RLENVLRVIDALGGALQITGLSSSDSENHPEGNQHGA
>NE0535 TonB-dependent receptor protein
MDNRSTFFTDQTQGVFMIIKIISLYHGFLFGLVLAAGLFISANVAAQSAV
TQQTIAIDIAAQPLEQAITQLATQAGLLIGVDVSLVAGKQAPALNGRFTP
LQAIGQLLKGSGLTVVENAPGRYTLEAAPAGHTNSNAEAVTLPEMKITSV
IDPDAPDNPSYQKTKAFSATKTDTSIMETPFSVQVVPRAVMDDQKSTKVK
DALENVSGVRPQPSLGGGGGFIIRGFRTGNIYRNGLLSSEGFFGDFDASN
LDSIEVLKGPAQLYGRTEPGGLISVITKRGLEIPYYSVEQQFGSYDYFRT
QWDAGGPVTEDKKLLYRFSGAYQSNNSFRDFISGDRKVFNPSLTWRPTDA
TDVTIDIEGTEKFASADFGIPVIGNRPAPIPISRNLGDPNTPTGDQSSVK
VGSEINHRFSENWAIHHRFLASLTDGSTTFVNPAPAFNAVAALNPATGLL
QRNIFTQNSEQEHYATNLDVTGKFDLGFSRHQVLVGFDFYRTYNKYGTKG
QWIVPDPNLAINIYNPYPSYGISQSTFDTAFLTSSAPGTNFSVIYNNWYG
VYFQDQITLWDKLHIIGGGRYDWAETGRGRSDNFDTADSLVKPNIRKDQG
FSPKAGILYEPWKELSIYGSWTTSFGANNAPAADGRTFDPQTAEQFEAGV
KTQLFDRRLTGTLAYYHLTKDNILVNDLSTADPFDKIANKQRSQGIELDI
SGYITDSFSVIASYAFTDTRILKDYSGATAGNRLANVPKHAGSVWLRYDV
KKFAPLNGLSFGLGVFAAGKREGDVQNTFQLPGYARLDAFAAYRMKLGPT
RLTAQINARNILDKRYYESTDPDSNVAPRLGVYPGAPLTILGSLRLEY
>NE2252 aspartyl-tRNA synthetase
MRTDYCGAINTRHLDKTITLCGWVHRRRDHGGVIFIDLRDREGIVQIVCD
PDNVAAFQIAEKIRSEFVLAITGTVRHRPEGTVNHGILSGEIEVLVNAIE
ILNPSLTPPFQMDDDNLSEAIRLEYRYLDLRRPVMQRNIRLRHQVTMAVR
IFLDQHGFIDVETPMLTKSTPEGARDYLVPSRVNAGHFFALPQSPQLFKQ
LLMVSGFDRYYQITRCFRDEDLRADRQPEFTQIDIETSFLPENEIMGMME
DMIRRLFASVLDISLPDPFPRLSYADAMFLYGSDKPDLRVPLVLTELTDL
MQDVPFQVFRDAAQKAGGRVAALRVPGGGELSRKEIDEYTQFVGIYGAKG
LAYIKINDLTKGIEGLQSPILKFLPESVVQSILERTQAQNGDLVFFGADK
AKVVNDALGALRVKIGHERNLATDSWQPLWVVDFPMFEWDEEEKRWQALH
HPFTSPSQGHEDFLTSDPGKALSRAYDMVLNGMEIGGGSIRIHRQDIQSK
VFQALNISDDEAKLKFGFLLDALQYGAPPHGGIAFGLDRIVAMMTGADSI
RDVIAFPKTQRAQCLLTQAPGAVEEKQLRELHIRLRRTENTNN
>NE0130 conserved hypothetical protein
MITTHIQNFSVLFEALSLAMNQTEFSIKTRSKQMLKWAIIFAIISFISGV
FGFRSTSAGTASIAKFLFFLFALITLVLLVLGLLGIGVVA
>NE1582 putative plasmid stability protein
MATLTIRNVDDVTKRLLRIRAAQHGVSMEEEVRRILRQELSRAGSSQFPF
GQHLLSRFAESTSKEFALPARQVPRTPPSWDEPI
>NE0614 Dihydrodipicolinate reductase
MTPLNIAVAGSTGRMGRAIMETIAEADDLRLSAALEQPGNPYLSQDAGSL
TGTPPGVAISSDYVSALAGSDILVDFTRPAGTLSHLATCRKLGVRMVIGT
TGFSPEEKDIIRNAAQDIAIVLAPNMSVGVNLLFRLLEVAAKALPEGYDV
EIIEAHHRHKVDAPSGTALRMGEVIAQAQSRDLEKVAIYGREGNTGERRA
DTIGFSTIRGGDIVGDHTALFAGIGERLEITHKASSRKTFAAGALHAARF
LMTRKSGLFDMQDVLGLR
>NE1366 conserved hypothetical protein
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVDEFTSKRSSPLIPSGHQRSC
IQQKHHLHQLICSMTGYCRSLLNRVWALFAF
>NE1362 hypothetical protein
MPGTQQGRTGYPQLRFVGLLENGTHVLFGVALGGYQDAEVRLAHQTIAHL
KPGMLCLADRGLSGYPLWAAASRTRRAVALAHPQESPTAYA
>NE0097 Haloacid dehalogenase/epoxide hydrolase family
MKLALFDLDNTLLAGDSDFQWAQFLIEQQVLDREVYEARNIEFYEQYKAG
TLDIHEFLDFQLKPLSRHPREQLNTWRSDFIERKIAPLIAPGARELIARH
QAEKDLCIIITATNSFVTAPIARMLGIDHLIATEPEQKNGEFTGRVTGIP
SFQAGKITRLEQWLDAHNLTWLSFLQSWFYSDSLNDLPLLKRVTHPVAVD
PDATLHEHAKKSGWPIISLRQ
>NE2305 conserved hypothetical protein
MNPPFYTIGHSTRTLEEFIGLLHAAEVEQVVDVRTVPRSRTNPQYNLETF
PDSLAAFQISYEHIPQLGGLRARSKTVSSNVNGFWENQSFHNFADYALSD
TFHEGLAKLIALGRKRRCAIMCSEAVWWRCHRRIISDYLLMHGETVFHLM
GHDKIEIARLTESACPQPSGAVTYPSRNSLVN
>NE0061 Sigma 54 modulation protein / S30EA ribosomal protein
MNLNLTGNHVEITPAMRDYVLSKIERITRHFDNIIDINVILSVDKLKQKA
EATVNLRGKEIFVEADGLDMYASIDNLVDKLDRQILKYKEKNIDRRDNQG
GLKDQEFEQSE
>NE1597 conserved hypothetical protein
MKPDTSSGYSPEDNKNGMQIITTYEAILAITDQMLQAAKNSDWDKLVALE
QDCKRLTTWLMEQHTYEQLSEEQKKKKISLIHGILERDAEIRAITEPWMA
QLQNKLTSYGHKRKLGQTYQTDS
>NE0383 conserved hypothetical protein
MIEFTALDDLSLLRESVSLECKLAQGRDGKGALPDDFWPTYSAMANTDGG
IVILGVRERKGQFEVAGIENPDKLRTELFNHLNNRQKVSVNLLRDEDVRA
WPVEGKTLLVVEIPRARRQQRPVYLRGNPLGGHTYRRLHEGDRPLPDEDV
KRMLAEQVEDSRDERILPGYGLDDLHLESLRAYRQLFANRDPSHPWNALA
DQAFLRQIGGWRQHRETGEAGLTLAGLLMFGQWSSITEAVPLCFLDYQER
PADYATTVQWLDRIVPDGTWSGNVFDFYRRVVGKLTADIKVPFVLKGDIR
QDDTPTHKALREALVNTLVHADYTGRISVLVVKEPAGFVLRNPGGLRVPA
QQALQGGMSDCRNRTMQQMFLMIGLGERAGSGMSRIVHGWRDLGHELHLR
ESYEPHEHSVLEMFWAKGRGATLASESSEESSEESSEETGKKILHLLREE
PSLTASALARRLQLTPRAIEKQLAKLKAQKRLRRIGPNKGGHWEVGE
>NE1345 conserved hypothetical protein
MRAIRFVPDAWEAYLYWQDQDKKTLRRLNSLITAASRDPFVGIGKPEPLR
GELSGYWSRRIDETNRLVYRVTDVELVIIACRFHYE
>NE0562 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE0234 hypothetical protein
MQIFDDYSNLTGDVTKDAADRALRHFIYRTQRGGMVIPAELQAFILNGIE
RQLAEGTGGWFVPARGRPTISNDAGWRFVAMIAWHEYYFIAKGHSEIRRK
NVSDFLIKQFGHTYCDFDLSDSGARRMIEDVNNRGFSGIDSGSPSGINNR
DTDLLNAKLFCRTELNMRGLAELSHAVAMVRRLNKNRGHK
>NE2035 Uncharacterized protein family UPF0033
MNHNHQPDLSLDLRGEHCPYNAIATLEALADMTAGQVLEVITDCAQSVNG
IPEDARAKGYDCLAVEQHGPLFRFLIRVPG
>NE2413 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYL
>NE1316 NUDIX hydrolase
MIDRNGYRANVGIILLNSQNQVFWGKRARQDSWQFPQGGIKSGETPTEAM
YRELAEETGLQPVHVEILGRTREWLRYDVPACWTRRDWRKNYRGQKQIWF
LLRLLGRDSDVSLETCAHPEFDAWRWNQYWVELESVVEFKRQVYRQALTE
LSRLLDHEAGLGNDRAYREPLEPVEKNRKKSSDTRQS
>NE0257 Site-specific recombinase
MESQHNHVMKKSCRIIGYARVSTEDQHLDLQIDALKLAGCSSIFEDHGLS
ATAKRRPGFEQALASLQAGDIFVVWKMDRAFRSLKNALDILEEFENRAIE
FRCLTEDIDTTTPMGKCMYQIRHAFSELERNLIRERTKAGMEAARQRGAH
LGRPKKLSRGQIIRMQNLLQRQPDMTPVQIADQFGVSSRTIYRALSKYST
IKEELAIHAG
>NE1058 hypothetical protein
MAAKVNPGTTLTRKCSIRVYLRSLYFYGSSDMKKTTTRIVYAGMFLTAAL
VLGQKMFMENRVVDDGRVHTRAGSAGSVITGGHGRNMSSVEDSVKIAGIH
GGIGDFSVDGDSILFTSEDSVYEYEDQSGEGGYGNATHSAGDSEKITGSH
GRNGNNPIHTGSGSGTVNRPAGSTPGNTGRSYADGNGNGARGNSIGGLGG
GGGGGGNGARGKDDNTDKDPANNAPGMNTGANPDREETRADEDKSDPDRG
GKPDGDDPDNDKAAGPAGNPPDKGIPPEYFDAPPGKELNDPFQPPIPAEE
VGGRDYEEAGEGAGASGDPGARAIPEPVSSLLIGLGSVMLLWSRRRGSRS
>NE1907 hypothetical protein
MNKKRMVTMFVLVGACAGLSACATTGEVEALKSRVDALESNVSATKSDAA
AARAAANDAVNIANQAMDKANEANARSIDTETKIDRMFKRAMHK
>NE1821 conserved hypothetical protein
MRLDLSEKYHTKTLGPRSAQLITELHELRKSTFTVADVRTITGLSPTAAR
TLVHKARQRGLVTRLKPGLYNLVPFELGRATEHVDSPYLIARELAGEASY
FLSHGTAFELHRMVTQPNFTVYVSCTRRVRPQTVGGYDFRFVHVTPEQVF
GVTKHWIDKERFVAISDIERTILDGLRHPAFVGGITEVAKGLWMKRAELD
LPRLVEYVRRLDVGAVVRRLGYLLEHYGLGDAATLESLRGMLTATYQRFD
PLLPAEGPFVSRWRLRLNVTPEELDAVRLG
>NE2440 hypothetical protein
MKAHISTVKLSILVPAALLSGFFLTGSTAAIADSSSDSNRANSEKYEKKE
DIRKYDQRNDLDESRRGIEDTVIPETDSNSTNQQDHNLRNPSEQSPAEIM
PGRN
>NE2347 General substrate transporters
MSVDYRQLPRSALTDVLLQAVLVFAMTLPMLVLYTTSTLGPLLSRDLGFE
PVAIGYLIMSSFGLAAILSLRAGAIVDYIGVRTALIVLFCAVAAAFALIA
ITQAFFSLIMATAICGIAQALANPATNLLIAHQIRPEQRAWIVGLKQSGV
QLAALFAGLVLPAIAFQYGWRIAFGVIVPVAVLFSLAAWLVTPAQHVRKN
RQLIFTRPNMLLSWLMAIQFSVGLALSAFVTFLPTFATLQEMPLAQAGTL
IAVFGITGMLSRIILTPLGNKFSDESHLLFALIAIAAGAIMLTMQAGPGS
HGYLWAGAIGMGMSAVATNAIAMSMLIRDSAFGRVTETSGYVSFAFFSGF
ALGPPLYGQLFSQTGSVSLAWSLLTGVLCLACIMTLRLTAVRKRRSHVSV
>NE2372 conserved hypothetical protein
MNHQARMRWRCRRGMLELDIVLQRFIDNHYEQLDEHQLELFEMLLSLSDH
DLWNIIIGNTKEPNNQFQPVLKLLQEN
>NE2528 AAA ATPase superfamily
MTTQTIDSAPHATWFVGASYGGTDDQMPRFLAEGIWENGYDDKLLEVVRS
MRPGERIAIKSSYTRKHGLPFDSRGRAVSVMGIKAVGTITENLNDGKRVK
VDWAKVEPVREWYFYTHRATIWRVLPGEWMNDALIAFAFDGKPQDVDRFR
NEPFWRERYGTTSPEKQRFEWTDFYEAVAEKLLAHADDRTPLIEGIHEIA
SRVPGLTYLQDKFPDGTSGPLRDICPFTTMGTFNRSMTDANRKTLAGELA
KLLGVTVPVPPSFEGIPVLNNQRSWFFAYADKRGAGDIDALWKVFVAASK
MVDGDQLDTRDAFIRAYDEATQVWGVAWNLSTGLYWAHPWEFLTLDSQSR
HYINKRLGLNVAISGQQGPCDGRAYLKLLDDLRSRFGEDGYPVHSFPDLS
LASWMYKDPVDEPVPAGDIGTNAGAEQETEGEVREAFQVAAPIVPYSVED
ILKDGCFLERAEIDRLLDRLRTKKNLILQGPPGTGKTWLAKRLAFALMGQ
KDDSKVRAVQFHPNLSYEDFVRGWRPTGEGKLSLADGVFMEAIKAASKDP
SSKFVVVIEEINRGNPAQIFGELLTLLEAGKRTPNEALELCYPDADGKRR
PVHIPENLYVVGTMNIADRSLALVDLALRRRFAFVGLEPRLGQVWRDWVV
KECAVDPGLVADIERRIAELNDQIAADARLGKQFRIGHSYVTPAHRLEAG
DTKKWFLQVVETEIGPLLDEYWFDAPDEAQKAIARLTQGW
>NE0501 Bacterial sugar transferase
MSGKTRESPLFVNREVMLRLLDIVLVIPGILVTLPLMVVLYVIGLFDTGS
PLFRQIRVGRRQQPFTLVKFRTMRPGTASVATHLADASAVTPFGGFLRQT
KLDELPQLWNVLKGEMSLVGPRPCLPNQHELIRERQLRGVFDVRPGITGL
AQIQGIDMSTPVLLAEIDAQMIKTLSLTSCFRYILLTALGRGTGDRVRKK
SG
>NE0064 putative signal peptide protein
MRKCNLQAAHGYSGLFNMAWPNRAIFSSFLSLFLIFFSSIVLAERADRDK
PIHLEADHATVEDYKRKGEFRTSIFTGNVVLTQGTLVLRADKVIMKEDAA
GYRYATAYGDLVSYRQKRDGVNEYVEAWSKRAEYDDKTDKIELFGSARLK
RGADEVEGDYISYDIASDFFQVSGRQQSENDKNSDHRVRAVIQPKAKQSD
TETGK
>NE0175 ADP-glucose pyrophosphorylase
MKITERGDDFPARAIILSAGQGRRLLPLTENTPKCLLPVAGKPVIAWQID
ALLANGIEEIVIVAGFQIGKVEALIAERYHDRSNIRVLFNPFYEVADNLA
SCWIARNEMHTGFLLLNGDTLLGDDLLPGLLHTQTAPINLCIDFKAEFDD
DDMKVQLGPQNQVRQVSKKLSAGEMDAESIGLIRFSKEGARLFREAVEQA
LREPEKLGSWYLAIISALARQGVVNGYSVAGSRWCEIDFIQDLQKAEKYF
ACKETDAADQPRSDQLLRVPG
>NE0998 hypothetical protein
MTLKRTIKEFATYLGDRESILDRDYPRVAGQIELLWGYVEFYRYLEKLLI
TEKGRDRSGFPFEAVLELDKLKEIHERLYP
>NE1380 hypothetical protein
MKIINIRENFSRYPAGRYRADGPYNGEKFREELLVPALSEAIDKGEKVKV
ELDGVRGYNSSFLEEAFGGLVRSGKFASTRDLSERFEFVSTDKSLIEEIR
GYMEEATPAVAQ
>NE0300 Bacterial regulatory proteins, TetR family
MTINNDPMRDAIVDTAVELAAHTSWEAVRLYDIAARLAVSLDEIRLYFRE
KDELIDAWFDRADSRMLKEAESAGFLDLVASERIHHLIMIWLDALAVQRK
VTRQMIMSKLEFGHIHIQIPAVMRVSRTVQWVREAAQRDATFMRRALEES
TLTTIYLMTFFFWMRDESENSRHTRQFLKRHLTMAAWLGQKVYGKDPEKS
TVPHKQQIPLQFD
>NE1670 glycolate oxidase iron-sulfur subunit
MKNTDRNDSSVEKLLTEANRCVACGLCLPHCPTYHLTHSEADSPRGRIAL
MSGVASGRIPLNERFIQHMDRCLTCRACEQVCPNNVAYGQLIDGARVMIH
EYSSVQRESQHKKGRLRDFLERELIAKPGRFDRLRPVLRLLAGSGLLSLL
LRLETLKRSDLIQSLLFLATRNLPGQLWRRSYPARGVLRGEVGLFLGCVA
RLIDTETLLSSIFVLNRMGYTVRVPDTQTCCGALHQHSGRIDAARSLAQQ
NLHAFEGLNIQAIINTASGCGVQLTEYSSRLDTGFSVPVMDISQFLDEQE
WDEVGFAPLSRRVAVHEPCTLRNTLRTSKHMYPLLRRIPGIEVVELPGNE
QCCGAAGTYFLDQPEFAKSLLDAKLQAFESTGADCLVTGNIGCALHIANG
LAKKGTNIEVLHPVTLLARQMRIK
>NE0821 conserved hypothetical protein
MLVVFEMRSLFAQLTLTDVHQDIARNIVSLRQSQNLFDDLTDDPAGWLLA
QKVEAEIKPPPYRSYTPIIDRPFEDAEWFNAIIWPFKYWQSSRFSDGTHG
IWYGSESVETTVYESAYHWYRGLLSDAGYEHEAVVAERKVYSVACSAALL
DFRKITEEYTDLLHPSDYTLTQSVGARIHREGHPGLLIQSVRRSSGENMA
IFNPGVLSNPRHNCQLTYWLEGNQIKVEKHPGTVWITMDIATFG
>NE0726 hypothetical protein
MLSLCRIIISLLLVGLSGQAAADISNSPNPYDAGYGFDTPDEAGWGGWMR
GGASTLYAEWDTISDASYGGSGDRTAAPDIGTHNVADAYLSWNPGVFVTS
TGNLITPSVVQEFFIRISPVSLFSGPLVVALQVEMWGDEPAAPLLNGLAA
SSWTRTFTGTSVTDHDLNQYLGLWYFANTVNHFEFDLTNQPFISLAQVAV
DIAQVSEPYMLAIMLTGLILIGSMTRYRSRPI
>NE2240 DUF214
MWKIISFNCYLIWRNMLRQKRRSTISIGSITLGMVALILASGFIEWIYQD
MRESTIHSQLGHLQIIKPGYFEAGKADPYRFLFSDDLEQDLLENPTLQNS
NHLIKTIVPRLSFSGLISHGDATLSFIGEGVDPQEQVYFGNALKISTGSN
LSADHPDHLIMGEGLARNLGVTVGDQVVLLVNTATGGINAVEMTIDGLFS
TVTKSYDDNALRLPISTAQQLLRTQGAHSWVVLLNDTHQTDAALAALRNT
LSQDRFEIVPWYQLADFYNKTTVLFTKQVQAIKLIIALIILLGISNTMTM
NVVERTGEIGTAMALGVKKFDILRQFLCEGALIGGIGGALGILIGWLLAA
IISSIGIPMPPPPGMARGYTGEILLTSNMVLEALALAILTTLIASVYPAW
KASRMQIVDALRHNR
>NE1352 conserved hypothetical protein
MWELRYTHQAQKDAKKLASSGLKDKAEELLAVVRNNPYQTPPPYEKLVGD
LAGACSRRINIQHRLVYQVLERERIVKVLRMWTHYV
>NE1909 Diguanylate cyclase/phosphodiesterase domain 2 (EAL)
MKLIIARYAIRVARFTARGGNFIPEGGKIKGRGLNSIPIRFTLMALVFTA
LCTFLQYHYVLPLIESGTSDSIVVVAILLSISLPAAITYFAANKLTNTIR
ELNKSTDAIAAGDFDRPVDVDCACEVGSLADSFRAMVNRLNSNIVRINTL
AYTDAVTGLPNRAVVSHVLDLAARMRGSADCKGALMFIDLDGFKRINDTL
GHEGGDELLRQVSRRVIEKGFNLTRGQIENCTTAFGELRQTCPEKLVFAR
FAGDEFVAIMPGEFHRRVLEKYAADILKAVNEPFLVSDNELRIGASIGIA
RVTSDSDDPRQLLINADIAMYSAKESGKNQYRFFDESLKNIAVERSQIEA
DLRKAIEEDMLDMHYQPKLNAQTLQVTGVEALVRWKHPQKGAIPPARFVG
IAEQCGLMPALGTNILRMVARQARAWQEVGMPMPIAVNVSAVQFERSGIA
NEILAILEQHAVDPLLIEIELTESMIMSDFATAKSRLEQLREAGAQISID
DFGTGYSNLYQLSHLPFNVLKIDKSLVDDIGKNSKSEAIVTAIVQMVHSL
GHRVVAEGVETHEQYAFLRKARCDQVQGYLFGRPMPAHELIEWVSDHASD
NGSMNNTLIEKVSRVA
>NE2166 UvrD/REP helicase
MNTVLQQEIPDQRERQQALDPWHSFIVQAPAGSGKTGLLTQRFLVLLATV
EEPEEIVAITFTRKAASEMKHRILQALRDTAGDINSDAESETALLNDAYQ
RQLRELANRVLAHDQARGWQLLQNPSRLRIQTIDSLCAWLVDRMPVCSRQ
GALSSVAEDADRLYLEAARLTVEALEEEGEWTAAIEHLIGHLDNRLDRLQ
QLIADMLARRDLWLRGVVDAANSDDMRDRLESVLSGRIAEAIERLADAVP
AGCQSEIIELMQFAAVNLSEAGSADSNTVRWPGNALEDRLVWESMADFLL
TQTGDWRKQVTKANGFPAPSSVRDADVKEYLNGMKQRMSELLVALQSEET
FRQQLQLLRQLPPERYTDEEWETLQALFSLLKVAAGYLLLVFRQHGQVDF
TEIAMAAVRALGEPEMPTDLALALDYRIHHLLVDEFQDTSSSQAELLQRL
TAGWQTGDGRTLFLVGDPMQSIYRFRQAEVGLFLDIRDSGYFGQIQMRFL
RLSVNFRSQSGIVEWVNRYFPRILPDTDSVSTGAVSYASSVAFHAASSGE
AVRIYPYLQKDDRAEAEQVGAIVAQARAAQPDGRIAVLVRNRSHLASIVV
HLRRKGLRFQAVEIEQLAQRVVIRDLMALTRALVHPADRIAWLALLRAPF
CGLSLQDLHTVANTLPQHVLIDSLRACAGSGVLSEEGGQRVNRVLPILER
ALMLYDRMSLRRCVEGIWVSLGGPASVQNETDLADAEVYFQLLENFDVTG
YRPDIQELDERLVRLFALPDVAADDSLQLMTLHKAKGLEFDTVILPGLGK
SPRRDQEKLLNWLEFHDQSQHPGLLCAPISAAGSDKNPISAYILSEEKKR
TALEEARLLYVAVTRAKHNLHLLGHLRIDPDMQENDALKPPEDTLLARLW
PAVAADFLARSREAAIGDLPASNVHTGLQLVGMVRLVSGWQPPPLPKAVA
VAMHANEAGTTEEPVDFDWAGEPARLVGVVVHCLLHRIGLIGVENVDHQD
LEALKLAGRSLLIQSGITPRHLEKAVQQVARALRTMCVEDETGRWILSNR
HQEARCEWALSVPTAIAAGHSISVSIIDRTFVDAAGVRWIIDYKTGSHTG
GSLEEFLDREQLRYRPQLDRYAQVLQRMEDRPMHLALYFPLLGKWRKWIP
SRESA
>NE1574 Serine proteases, subtilase family
MANAKPVCSLFHSTFHDGIASVFHRSFSFPLLLILLIAFSGSGYANSSKE
TRAAARSESAGKTVSSRWAKGRLLVIPRAGLTAMEFDKATKPYGVKSRRR
LNGLNAHVYELPDGVDEVKVLDKLKKDRRFKAVELDRLVEPAQVVTDPAF
GNSWALPKIQAPAAWDIATGDGITIAILDTGVDGTHPDLAANMLPGWNAY
DNNTDTSDIYGHGTKVAGTAAAVANNGAGSSGLAWNARILPVRISMPDGR
AYLSDMAKGIRWAADNGARIANISYGGAESLTVQSAANYMRSKGGVVVMS
AGNSGGLNNFPASNAIIVASATDSKDARASWSSYGPYVDVAAPGVSIYTT
IRGGGYGYVSGTSFSSPIVAAAAALLFSINPDLAPTDIDQMLTATAQDLG
NAGYDQYYGHGRINAASAVNAAHARISVDRTAPLISIASPTGGTVSGSVP
VDVNYSDNKGVVRVDLYVNGRKTIEDTQPPFAFAWDTTTLANGSYTLVAY
AFDAAGNQGTSSTVNVTVKNTVADTTPPTISITSPAGGAVVLGSVPVNVN
SSDNIGVVRTELHVNGKKVMEGTGSSFVWDTKSLADGNYTLVAYAFDAAG
NQGASSPVAVSVKNTGTTTNTEPLPQINSFNLRDGQNVSRNQNIRVTADR
NTRKINLKINGGIIAAANGNSLSYRWNTSAIPKNSNITVTAEAFNAKGDA
TSRTVTVRN
>NE1410 hypothetical protein
MSRWQQALLIMSIMVAAMPDTSARRHDAASGHDRAGKPAGGISEQRAIAI
AQQHFSGRVLAISQTDRVYRIKILSDQGTVHTILIDALNGAVVSAR
>NE2104 conserved hypothetical protein
MIAFEFDEAKSQANLLKHGISFVDAQALWNDPRLLEIPAKTEDEPRYLMI
GLINGKHWSAVITYRGTNIRLISVRCSRTEEVTLYES
>NE1248 hypothetical protein
MARLIEDDHSNLIGLAGVLPRSWLICSASASQSTLLLRGRWMLAENRISD
GSQVLLSAPGYNGFQLRHYACMAARIRITRRTTSGTLRMVMDGIVFILPQ
YMQMQRYISMKSIAAILIYEQQLLTRAHEQLQQDLYRTWQIR
>NE0457 conserved hypothetical protein
MLRTLCKTLLPLILATASVKGFAMHIEEKTTQLQDQQQVSITIYNENLAL
VRDLRHVPLEKGINKLAWCDVSAQIRPETALLRTPEKTSSIRMVEQNFDF
DLLTPEKLLEKYLGRSVNVIYVNPATGAETVEAAIVLSISGGVVLKFKDR
IETGTPGRIAFPDVPGSLHDHPTLSLVLDGATPGKHELELSYLTSGLSWQ
ADYVARLDANDGRLDLSGLVTLANHSGIAYPDAHLQLVSGEVNQVTPEPP
QARKMMAMVADAAEYQAVREESLSEYHLYSLPVPTTLAENQSKQITLMSV
TDIPVSQEFLLRGTNAYYFSRYSNLDDKLKPVVLIQFKNEGEGLGVPLPR
GTIRVYQNDSRDNLQFLGEDHIDHTAKNEEVRVKLGKATDITAMRTQTDF
QQLDTPSRRFTETATDITAMRTQTDFQQLDTPSRRFTETAHQIEIHNARQ
EAVTIRVQEPIPGDWMMISESQPHTKSSANLVEWLVKVPTNQKTILSYRV
RIKH
>NE0237 hypothetical protein
MITNFRFIHPPYKQLYAYVERIVMPLDIEKYRKYLAPLNLGKDHEEEIIR
HIYMIMDEFISAAFNKHPVQQALQAKNRKTLQGQSDVIDSKDRSIQSLYQ
NVASRPDE
>NE1042 hypothetical protein
MVIKWILLITLVFLIFWFFKQFRQIQRKPPDTTRKVIEDMVRCAYCDVHL
PKSESIVEHGRYFCCTKHRQLYSQSQPDDK
>NE1804 Chain length determinant protein
MSELIAQLLVYIKGIWKYRWASVATAWTVAIAGWFFVYQMPDNYQASARI
YVDTQSILRPLMAGMTVSPNPDQQITIMSRTLISRPNVERIIRMVDLDIK
ITDDSAREKLVTTLMKDIKLSTTGSDNIFMISYDNKDPRLAKDIVQSLLT
IFVEGGLEGKKQDSASALRFIDQQIAAYEEKLVAAEAALTAFKQKNIGFL
PGQGGDYYSQLVSAAEELEKARLTLLEAEQARDAVKKQITGDEPVLLFEV
GEVSPQSIVNPEIDSRIQALNTNLDNLMLNFTDQHPDVVATKRLIAQLEE
RKVEEAKLAGTVNNRGRNYDPMMQQLNIALAESEAQVASMKARVAEYEAR
FERLKSMSNTVPAVEVEMQQLNRDYNVNKANYEKLLERRASAEISGELTS
TSGLMSFRIVDPPTVPEAPSGPDRKKFFTLVLIGALAGGIGFAFIISQIR
PTFHSQSTLRELTGLPILGTIPMVWTEQQKLKDKRKIYAFGISLLMLMAC
YVVLMIYVKPHGMTAA
>NE2172 SAM (and some other nucleotide) binding motif
MQNNKTKLLDEVATYYAEKLAEHGDTPRGVDWNGEESQTIRFEQLCKVID
PKKMPGFSLNDLGCGYGALLDYLRDKYAACIYLGVDVSHEMIKAARQRHT
AANQARFITSTEPDQVADYGVASGIFNVRLERTDAEWFDYLLGTLDVLNR
TSSLGFAFNCLTSYSDEDKKRDYLYYADPCRLFDLCKRRYSRQVALLHDY
GLYEFTILIRKA
>NE1122 conserved hypothetical protein
MNDLESLISQVRRCTLCAEHLPLGPRPVFQLHETARILIASQAPGRRVHE
TGLPFNDPSGDRLREWLNMTRTIFYDPRRIAILPMGLCFPGTGKSGDLPP
RPECAPAWRSALLSHLKNIRLTLLVGQYAQAYYFTRQGRKPVATLTENVR
SWQKFWPDIVPLPHPSPRNNLWLRRNPWFEEEIIPALQERVAMILNQTTD
S
>NE1654 DUF152
MIDWIVPDWPAPANVDAIFTTRNIGAAENRGIYAGLNLASHVDDDPLIVQ
QNRNQLRQYLPDHPRWLTQVHGSQPVWVDSSNETLELEADAAMSRRPGVV
CAVLVADCLPVLLCDMAGSVVGVAHAGWRGLAGGVIENTVRELRRFSSSD
RIIAWLGPAISSRHFEVGDEVREVFTQYDHRAACAFLPGKEAGKWYANLF
DLARQRLSHAGVNQVYGGDLCTFSNPEQFYSYRRDGKTGRMAGLLWMTQP
AGMQ
>NE1965 hypothetical protein
MEDTAPITTLLYSILALPIAFMILYWTKVRKDRRRNETGEIEYKSLGHAL
TFFIIEGLALVGSLSILITAVSGIVRYLIYVYA
>NE1144 secF; protein-export membrane protein
MDFFRFERDIPFTRWGKFSMTFSLIISILAIYSLTTKGLNLAVDFTGGTV
MEISYQQPANLDKTRKILAGIGMSDAIVQNFGTSRDVMIRLPVKLEHAGG
NLSETVMTALKVDDPSVEMKRVEFVGPQVGDELLENGLLAMLFVSMGIVA
YLAVRFEWKFGVACIVANLHDVLTILGCFSFFQWEFDLTVLAAVLAILGY
SVNESVVISDRIRENFRKLRKATVTRIIDNAITETMSRTIITHGSTQMVV
ISMFLFGGEALHNFALALTIGILFGIYSSIMIASPILLLLGAKREDLVKP
ERKPQEEALP
>NE0828 HlyD family secretion protein
MPKSRRLLRIAGIALVIAATGSTTWYLTRSAPQEVELVTVSRGTVEATVV
NTRAGTIKACRRAKLSPAAGGQVIHLQIREGDRVKEGQILLELWNADLQA
QYDLSRQQLATAESRQRETCILAGNAQRESVRTQQLVEKGFVSSQRADEA
DATAKAQQAACTAAAAEVKRAHAQIAVSQANLERTRLIAPFSGVVAQITG
ELGEYVTPSPPGIPTPPAVDLIDDSCLYVSAPMDEVDAPKIRIGQTARIT
LDALPDQVFDGKIRRIAPYVTEIEKQARTVDVEAEFVNPTQAVLLVGYSA
DIEAIIDQRENVLRIPTQVIRQDNKVWVLGQDDTLEERTLKTGLANWAYS
EVLEGLSEGDQVLLSSDNGKITAGTRVIPKPPQP
>NE2309 Adenine specific DNA methylase Mod
MASNQKLELTWIGKEKRAKLEPRILLEDPEKSYHAKQRVSESDVFDNRLI
FGDNLLALKALEQEFAGEVKCVFIDPPYNTGSAFTHYDDGLEHSIWLGLM
RDRLEIIKRLLSNDGSLWITIDDNECHYLKVLCDEIFGRANYKTTITWQR
KYSVSNNFQGIASICDFVLVYSKSEAFKNNLLPRSEESAARYNNPDNDPR
GPWKAVDYLNQATPEKRPNLCYDIVNPNTGVVIKNTKKAWKYDPTTHQRH
VDEKRIWWGRDGGNSVPALKLFLSEVRDGMTPHNWWSHEEVGHTDESKKE
MIGLYGPRDVFDTPKPERLLKRILEIATNPGDLVLDSFAGSGTTGAVAHK
MGRRWIMVELGEHCHTHIIPRLKKVIDGEDPGGITNAVDWQGGGGFRYYS
LAPSLIVEDRWGNPVINPEYNATQLSEALCKLEGFTYAPSETRWWQQGHS
SERDFLYVTTQNLSASQLQALSDEVGTEQSLLVCCSAFHGISAAAAAARW
PNLTLKKIPKMVLARCEWGHDDYSLNVANLPLAKPEPETPASQPAPKKKG
KKTLPMPDLFGDVEDGA
>NE0542 possible transmembrane sensor
MHLSNSPNSHDPDAQARFWFARQQGQTLSEAEQQQLAEWLAANPLHQDAW
RRVEEDWLAIDRYRHALNDELKKARRNRPGQHKAAFQLKRRAAAAALLVA
VCIPVLYQGMTGTTTYWNTLKGEQQQITLAGGSTLNLDTATRITVTQNWF
THHVQLHEGEIYLETASSDWHTLRVSAEHYEIRDIGTRFSVRYIPDQFMV
EVAEGTVEVKKQGQLVLLQAGEVLSIITGSNEWRLAALPAGDIAHWRKGL
LVFNAHRLHDVLHEIARYHTVQFDLADSAIGEIQVSGRFKLDELDTNLQI
IADTLKINIEHPAPGRYRLGALRQSTR
>NE0123 conserved hypothetical protein
MSLLQSSCRVAFAALLHDLGKFHERTGQPVNGDLAALTTLYRYSHAAHTG
GMWDVVEKYAPDLLRGDVAPFSGRTSGADITDSMANAAAAHHKPGTLLQW
IIATADRAASGFERTKFDEYNADAEGETPQHKNRFQARMISLFEQIKINA
QAPVGQFKHAYPLRALGPEAIFPDKRAVIEPNENKTAQAEYAALWEQFLQ
ALESIPKSHRSNWPLWLDHFDSAWLTFTQAIPSATNRGVVPDVSLYDHSK
AVAALAAALWRWHEETGNTGADALAKLSDSERPDWDEQKLLLIQGDFFGI
QNFIFAQGAQTQKHAHKLLRGRSFQVALLAEMAALKLLETLQLPSTSQII
NAAGKFLIVAPNTPSAREAVETCRRSFDQWCLQHTYGEISVGIASTSASC
NDLRSDRFRTLTQKLFGALDVAKHRRFDLCGNTAAVRDVSFADGPCDYHG
RYPADCAAEGDKSASCALSRDQIVIGEALTKHARLLVLNTADSFKKPLDL
DYFGYRLIFVNEADASGHYGKLAEQGELVRAWDFDLPDKNGTCFHGYARR
FVNSYVPIWDENEKKDDPAYKRLSQEDLGDTSPGKLKTLHSIAAGSNNEI
ALVTLKGDIDNLGALFQSGLAEPTFARWASLSRQVNAFFALWLPWYCAHG
ENRRFRNTYTVFAGGDDFFLIGPWESTLELAGAMRKAFARYVVRDDITFS
LGAVMTQPKIPARQLAVAAESGLNAAKQHYGKNAVSLWGVTVGWAEWRTL
MKERRDALERLISEAGGLSTGFIYNLLLLSDQAERDDPKRDDRRPEDALW
RSRLAYRCARLPKNQQMVGKALARECGEALKQYRGAYRLPVSVLLYRQRQ
>NE1722 Ferric uptake regulator family
MDIAEQLIQNAGKRSTPVRSAVLGVLLNAEDVLSHSEVLEHLQQLGAFDR
VTVYRALDWLVTQGLVHKVAGAGRAWRFQVTRSESMHRHAHFQCHHCHKV
FCLPEVQPVLPKELPSNFSIDSIELNIKGICADCGRAMMSQ
>NE1061 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNS
>NE2457 Delta-aminolevulinic acid dehydratase
MTFFSQFPQKRMRRNRRDDFSRRLVRENHLRPDDLIYPVFVLDGSNREEK
VASMPGVVRQSIDLLLLQAEKCLRLGIPALAIFPVIDASLKNLTADEAYN
PDGLVPRTVKELKKRFPELGVITDVALDPYTSHGQDGLIDANGYVLNDET
VSVLERQALSHAQAGADVVAPSDMMDGRVAAIRAVLDRDQFIHTRILAYS
AKYASSFYGPFRDAVGSSATLGAGNKYTYQMDPANSDEAIWEAGLDIQEG
ADMIMVKPGMPYLDIVRRVKDELGVPTFVYQVSGEYAMLKAAAMNGWLDE
KACVTEALLACKRAGADAILTYFAVEVAGWLQERA
>NE0067 conserved hypothetical protein
MSISRAPADFDRSISPPENRGEHETGRTLHIATYNIHKGLSFFNQRLILH
ELRDQLHGLDVDVVFLQEVVGEHALHATRFRDWPRNTQYEFLADSMWPDF
AYGKNAVYGHGHHGNAILSRFSIVNWENEDISAHRFESRGLLHCELAIPG
WKDTLHCICVHLGLFRRGRSQQLEAIEKRIRQLVSPDAPLVVAGDFNDWR
GAANPLLASRLNLVEVFQHTHGKAARSFPSVLPLLRLDRIYIRGFQVKNA
QILHNRPWSRISDHAVLSANIMRT
>NE0822 conserved hypothetical protein
MTAITRREESTSRDRGALAKMIMTLLDHWQLSTEDQAALLGIAASNRTAL
ARYRKGEAIGTSRDQYERVGHLLGIHKNLRLLFPQNRDLVYRWMTTRNKA
FDNLTPVEVIKEWGFAGLLMVRGYLDRARGI
>NE0618 mono valent cation-transporting P-type ATPase
MTDHRVTDHTETLQQTAWHALTLPEVRQILHTDSAGLKTDEVNDRFARFG
PNSLIPPKRRGPLLRLLLQFHNVLLYIMIAAAAITAVLGHWVDTGVLLAA
VIINVIIGFIQEGKAETALDSIRAMLSPHATVIRDGTRYEIDAAGLVPGD
LVLLASGDRVPADIRLISVKELQVEEAALTGESLPVRKRIETALPDTLPG
DRYGMAFSGTLVVYGQASGIVVATGSATELGKINRLLEDTQHLSTPLLRQ
IDHFGFKLAVFILAASAATFLLGTLWRNHAPAEMFMMAVALIASAIPEGL
PAIMTVTLALGVQRMARRNAIIRRLPAVEALGSVTVICSDKTGTLTRNEM
TVQRIVCADHTIDVSGVGYVPTGEYSIDGHTIDPAHYPALTLAIRSGVLC
NDALLREKDGLWSVEGDPTEGALLILGAKNGFSNHHANTAWPRRDVIPFE
SQHSFMATYHHDNENEPWIFVKGAPERILDMCTTQLQQNGKQPIDIDYWQ
RMVSATAAKGLRLLALACKRSAPQEDSLKINDMKTGFTLLALVGIIDPPR
EEAVQAVAECHRAGIRVKMITGDHAETARAVGAQLAIGAGRPVLTGMEIA
AMDDDTLRDIVMDVDIFARTSPEHKLRLVKALQAGGQVVAMTGDGVNDAP
ALKRADVGVAMGMKGTEAAKEASDMVLADDNFATIAYAVREGRVVYDNLK
KFILFMLPTNGGEALVIIAAILFEFTLPLTPAQVLWINMVTVSTLGLALA
FEPAERNIMNHRPRSPKEALLSGFFVWRVLMVSVLMMIGGLGLFLWEQHI
GVSTETARTMAVNAIVMSEMFYLINSRHTFASVLHLEGLTGNRYVLLAIV
ACLLLQITYTHLPTMQSIFNSTDLTMLQWCKSIAAGLMVFCVIEIEKSVI
RHTRLASVVTSA
>NE1076 putative CcdB-like protein
MARFDVYVNPGSHAATTPYLLDVQSDLLDVLDSCMVIPLRSLEHFPKVKL
PGRLTPVVTIKGQDFLLETPKMGAIPRRLLTMPVLSLRDMQPEITSALDF
LFHGY
>NE2031 Glycosyl hydrolase family 57
MTSRSSTLDLVLLWHMHQPDYRDYKTGEYMLPWVYLHAIKDYTDMAHHLE
KHPNIRLVVNFVPVLLDQLEDYADQFTHGTIRDPLLRLLAVPDLDRISPA
ERRLILNSCFLNNHDTMLRPYPSYQYLRDLYDHLLKMGDKELMYISGQYL
ADLLVWYHLAWTGESVRRENECVVNLMSQGKLFSHGDRMLLFHLIGKLIQ
NIIPRYRKLAESGQIELSTTPHYHPLAPLLIDFTSARDSLPEAMLPTSPV
YPGGRKRVAFHLQSAIDSHRRRFGSDPAGVWPAEGAVSEPLLDILIEHGR
QWCASGEGVLANSLRKSHPDQPLPARNEYLYRPYRFETGNSSILCFYRDD
QLSDMIGFEYSKWFGHDAAENFMQRLKQIRNEIHNNPAPVVSVILDGENA
WESYPYNGYYFLNDLYEMLEKSPDIRTTTFREYSAGLSESKVKVTRLPAL
TAGSWVYGSFSTWIGDHDKNLAWELLCAAKHSYDLVMQGPRLTDEEKTAA
GKQLASCESSDWFWWFGDYNSPHSVESFDFAFRRNLANLYRLLKIPAPVT
ILEPISRGGTSTGESGAMRHAAS
>NE1714 Undecaprenyl pyrophosphate synthetase family
MPLTPSSTRGIPETGAIPKHIAIIMDGNGRWARKRFLPRIAGHTQGVEAV
RGTIKACIERGVSHLTIFAFSSENWRRPAEEVKLLMQLFLAALEREVTGL
HENGVRFRVIGDISKFDPKIVDFVQQGEALTAGNSRLNFTVAANYGGRWD
IMQAVRKMITENPDSAVTFDEPDIARHLALADAPEPDLFIRTGGECRISN
FLLWQLAYTELYFTDTLWPDFDASALDEAIASYRKRERRFGRTSEQIAGQ
QENKNTVSNEDRV
>NE0220 TPR repeat
MFLRAFLPLLLLSCNVAQAALFGDSETREQLEALRTKVLEMEARMQRTEE
VLMGQALIELHTQAENLKEEMGKLRGQIEVLEDENRSLRKQQKDFYLDLD
NRLRQLEPGSAGTAASDSRISSPSSEQLAADTKDIKSPASGKTAAVLQLP
DTAQRNRYDAAYASIKSGDYSGAVTGFESFLAQYPQSALAPSAAYWVGNA
YYALRDFDKAITAQQRLIEIYPGSPKVADGLLNMASSQAEMGQKAAARKT
LEKLIASYPGTEAATKAKQRLGTLK
>NE0632 conserved hypothetical protein
MADRLLTRILDDEDSDDPILSVVNIIDVFLVIIAGLLIAILENPLNPFAA
QDMVVIRNPDTPQMEMIVRKGEEMKHYKSTGEIGQGEGVRAGVAYRLKDG
SMIYIPEETDKATGTSSK
>NE0893 conserved hypothetical protein
MSITLITAAPGAGKTIFAVWNIIKPAVEADRVVYTAGIPELKLPAISLSY
SQVKRWADRELVEVENPSGIPIPDDEKPSRLQNITEGALIVIDEVQYLWP
ASGSREPGEDIKYLTKHRHHGLEFVLITQAPQLIHKNVLAVVDKHIHMLS
DWHGRKRYEWPEYCATVRATSSKLKAVSQRYELPKEAYGLYRSASMHITQ
KRRKPLMAYVVPIAFFALIYTGFTFKDRFLDSKSSESPKVVKDEQKTDDP
HLSQQKLTSAPVTTVTTVARPVTLALVSDQIDWSKVGACVATQAKCICYG
KSAERLVVPPETCRKAVSSGWPGQETKV
>NE0850 Phospholipase/Carboxylesterase
MPDNSFQLSAIDITTGSNPEYTILWMHGLGADGNDFVPVVQALDLPEIPI
RFLFPHAPQQPVTINSGYIMRAWYDIQHTDFVEQEDETGIRRSQHAIVEL
IEREDRRGIPPDHLILAGFSQGAAMALHTGLRHPDRLAGIIALSGYLPLA
HKIEREAHITNRITPIFMAHGNDDPIVPIELAHASLQQLREYYYPVTWHE
YPMEHTVCDQELVDISRWLKTILK
>NE1792 hypothetical protein
MKVLLLCESGPLGLKVLYCLKGMNAFVQTAGPRGARILKYSRYTDGHSEV
QFWQDGLPSVKTQQLLRERVQGENFDIILPSDMGSAAFLAATKKAYPDLP
CFPCSDQVILDTLHNKWSFAQTCAKYRLPIPPTILLTSSDQLDTSTLDPV
GFPLIVKPLEAESSHGVVRLDNLQALRTYIEQDSHYAQLPLIVQSYIPGR
DIDISILAANDRILCNTVQSWLEDGVLEFTQHPEMHAIATRLVQAFHYEG
LAHFDMRIDARDGKLYVIECNPRAWYTISASMWQGLNFIEMGIHYTKTGT
LPKISEKSGEGRYCLAGSLWKQLLLPHKGWKNLSVGSVRGFVQAITDPVP
HLYSKLT
>NE0289 Helix-turn-helix protein, CopG family
MSQITLYLDDEIQALIEQRAKASGLSKSRWVAEFITKYATQEWPQDCLEL
AGRFADFPLREEANPLPV
>NE2021 Glycosyl transferases group 1
MEQSATPACSICFPDSICRISPSSPLASPNGDEYHMKIALLCSGLGNVQR
GHEIFVRDLFDMLKGSVDITLFKGGGEPAPNELVIDNVSRNAPSLQNIHL
AVSPRWKVAMQEQERIRIETETFAHAALRPLLEGGYDIIHCLEQEVCNRL
YGFRHLFAHTPKFLFSNGGAIPAAKLSNCDFVQEHTEYNLQHSAREKAFC
IPHGVDIQRFNPSISSDFRARYGIAEDAFVVISVGIICYWHKRMDYLIKE
VAQIPEAHLLIVGQECADTPAIKALGEKLMPGRITFTKMPHDELPQAYAA
ADVFALGSLFETFGIVYIEALAMGLPVFCTEHPNQRSIVREGVFLDIKQA
GALRDALLKRDPAQMRYLRERGPQIARETYALEKLKSAYIEHYQRIAASE
VSLPRYTFKTKLQDNVRSLYRRLSRG
>NE0380 Iron-sulfur-dependent L-serine dehydratase single chain form
MFISVLDLFKIGIGPSSSHTMGPMTAANVFRDHVDQLVSQTPDDLNYRVY
CVLKGSLAFTGRGHATDRAVALGLHGHSPVSLAGEDVNALTAKLWATDTL
QLTSGREIGFNPEDDIVFDKSDPLPQHPNGMIFYLLDEEGKKILSETFFS
IGGGFICTLAEINQLNAPLKMESAGLYPYPFDSAIGMLEMSQQSGLSIAE
MKRANELTRMSAQELEQGLDSIWKAMCHCVEQGLTAEGRLPGGLSIRRRA
KDLYEQLQMHPEKATLNDWLCAYAMAVNEENAAGHMVVTAPTNGAAGVIP
AVIYYLLKHEGGTEQQVRDLLLTAAAIGGLIKHLSSISGAEVGCQGEVGS
AAAMAAAGLCAVRGGTTEQVENAAEIALEHHLGMTCDPVGGLVQVPCIER
NGFGAIKAHTAAALACRGTGEHFIPLDNCINAMKQTGLEMSHKYKETSLG
GLAVSVSITEC
>NE0795 Glycosyl transferases group 1
MKLAPEETENKDKKDTNNPGIVVFSSLFPHSGQPNAGLFIRERMFRVGKH
LPLVVVAPVPWFPFQKILRRWRPHFRPDAPYMELQENIEVLHPRFFSIPG
FFKFLDGFFMALGSLPVLWRLKRRFDFKIIDAHFAYPDGYAASLLGRWFR
VPVTITLRGTEVRLSHRFLYRDLMRSAMIRAAKIFSVSDSLKQVAVAIGI
QESKILVVGNGVDLDKFCCIDRSEARKSLEIPEDIPILISVGGLCERKGF
HRVIACLPDLISTYPGIQFLIIGGASAEGDWTERLQQQVSESGFEKNVRF
LGVMPPDQLKIPLSAADLFVLATRNEGWANVFLEAMACGLPVVTTDVGGN
AEVVCRSELGKLVPFNDQQALCQAIANALEVNWDREKIIMYAHANSWDQR
VAVLVEEFVKITCQSELAEQYR
>NE1412 4-diphosphocytidyl-2C-methyl-D-erythritolsynthas e
MAEFIALIPAAGSGSRMGEDIPKQYRPLASKPMIYHALRTLCSAERIGMV
CVVLAPDDTEWARHDWSEFAGKIRVFGCGGATRAESVTNGLKALRAENHV
QDQDWILVHDAARPGLSRALVERLLDQLAQDEIGGLLAVPLADTLKRADD
TGRVMCTEPRERLWQAQTPQMFRTKLLLEALEKAPTGITDDASAVEALGF
SPKLVTGDAYNFKVTYPQDLKLAELILQERTATQN
>NE0435 possible pseudouridylate synthase family 1
MSADRLKNQKPRRIRKASPASKQSGAAPAAIETSRLQKLLARNGLGSRRE
IEDMIAAGRVSINGVTAKPGDRAGPGDMIRIDGKIIRFRLAALLPRVIIY
HKPEGEIVSVRDPQGRPSVFDKLPHIRSSKWIAIGRLDFNTSGLLIFTTD
GTLANHLMHPRYQMEREYAVRIVGELTPEQITRLTTGIELEDGLAKFDQL
LDEGGTGTNHWYRIMLKEGRNREVRRMFEALGLTVSRLIRVRFGPVNLPP
RLKRGMWIELMDIEVEQLLKLVSQPGAGIPDKPPSMQAGRRS
>NE0048 Class II Aldolase and Adducin N-terminal domain
MSGETFDLDKQQLMEREFAQMESKLPGAVYSIRQKVALTCRILFDNGHDS
GLAGQITARGETSGTFLTQRLGFGFDEISAGNLLLVDEDLQVLEGDGMPN
PANRFHSWVYRARSDVRCIIHTHALHTAALSMLEEPLVISHMDNCVLYDD
VAFLPNWPGVPVGNEEGELIAGALGNKRALLLAHHGLLVACSSIEEACLI
ALAFERTARMQLLASAAGRIQPIEPVLAREAHDWILRMSRSASAFAYYAR
RTLKQCSDCLLE
>NE1145 conserved hypothetical protein
MAGLNPQTPVPTELKLHRKSGVLSVTFNDGKAFNFPCEFLRVYSPSAEVR
GHEPGQEILQTGKKNVTITHIEPIGRYAVRLDFSDGHNTGLYSWDLLYDY
GLNQETMWQDYLQRLQAASGSRETG
>NE1430 Peptidase family M23/M37
MIYYQSPRKRRSILAQKLAVLAQKKVRWITILSSLPLFGMVTAFGIVPDA
PPHEIPGKKIVRPLQLPEKFASSDPEMTFWHQESIRRGDTIVAILARLEI
SQEDKNNFLDATRGSKAMSQLKPGEVVHAETRPDGELLTLRYFYGNGEQF
QVEKVDGAFELSEQSGKSDTHIQIGSGVISTSLFAAVDKADLPSTIASQM
IDIFSSSIDFHRDIQKGDYFTVVYESRQDEGGKIQVGRVLAVEFFNKGKS
RRAVYFQPSDGKGEYYTPEGESLRRPFLMAPLKFSRISSGFSNARYHPIL
KRWRAHRGVDYAAPRGVPVMATADGIVEYKGWQNGYGNLVVLKHNAQYSS
AYGHLSGFDKRLQKGKRVRQGDVIGFVGSTGMATGPHLHYELRVNGVQRD
PSRIVMPAAQPISKKHYSTFRRQTSELLSQLDLLRNSSFAALN
>NE0314 hypothetical protein
MGIPSDFSNTGLTGFNVSSTKLVKIEQQYKGVKGKKFISPGGGKFVRIDD
PGAVDSNGLRTADASNTSISLETGNTWLPRLHADGTLTCGGKCAWLECTN
VTIPKNKRLWFRYAFLRFSSLPADSFSVLLCFPNDDTSVPPLPPYWICSV
KELQENRGNINQTDWTECFVEIDKNADFHGTLRWVVATGHNLADQNSIPD
NTRFTRPGCLLIDAIDIR
>NE1502 Integral membrane protein, DUF6
MFACMGVLVKLAAAFFSNTELVFYRSLVGVITTFLVMRAYGMPLVTEHWK
SHCWRGLSGLGGVLLFFYCILQLPLATAISLNNTWPLFLAFLAMILLKEE
FSWLLAGALVVGFIGVIFLLRPTLAEGQWYLALIGIGSGLFAGIAHFHVR
QLSELGESDWLTVFYFTLVCTVATGLWLTFTAFSAVSLQSLTLALGIGVT
ATLAQLAISRAHRGSNILIVGVLSYSSVLFAGLMDLFFWDARLPVSAWVG
MGLIILGGLLSIRGIPVRNSSIVTLDD
>NE2250 putative mannose-1-phosphate guanylyltransferase
MPDDPAENCHFPSDQTNTYMKHTLIPVILSGGSGTRLWPLSRQQYPKQLL
PLVNRYSLLQETVRRLDGLEDIAVQSSIIVSNEEYRFVIAEQLRVLGKIG
KIILEPYGRNTAPALTIAALAAMQDSDDPVLLVMPSDHVITDISVFQAII
HKGMTLAESGMLVTFGITPNAPETGYGYIQYGTALHEPGTFRIARFVEKP
DLATAQNYLEEGSYLWNSGMFMMRASIWLTAIDTCRRDILEACRAAWQSG
SIDGDFYRIDKTAFAACPSDSIDYAVMETLSARPDMLPPGAIIPLSAGWS
DIGAWDALWRTLQKDESGNVVRGDVLLHACHDTLAFSEHRLVACIGVENV
VIVETSDAVLVAHQDKTQEVKHIVDILKQRNRPEIRSHRKIYRPWGWYDS
VDRGERFQVKRIVVNPGAALSLQMHHHRAEHWIVVRGTAQVTRADKTFLV
SENESTFIPLGTPHRLENPGRVPLEMIEVQSGSYLGEDDIVRFEDKYGRK
EN
>NE2005 norQ protein
MTPIPFYVPVGNECELFETAWQRRLPLLLKGPTGCGKTRFVTHMAARLQR
PLFTVSCHDDLTAADLTGRFLIKGGETVWVDGPLTRAIREGGICYLDEIV
EARKDVTVVLHPLTDDRRMLPLERTNEILHAPDTFMLVVSYNPGYQNILK
SLKPSTRQRFVALSFNFPPPEIELEIIASESGLARDRCTALVNLATRLRL
LKDVDLEESVSTRLLVYCATLMAAGLDPYQAAQAALVEPLSDETEVQQGL
LELIHATFG
>NE1059 putative urea transporter
MVIIDWSLSVDRTIPKPLRIILRGVGQVFFCCNAVTGLIFLMALYIGGVT
AGLAATAGVFSSTIAAHLLGFPEKDIDAGLYGFNGTLVGPCLFLFLENSP
QLWLYVVLASILSTIVLAALMRILQPCDIPASTAPFILTCWMFMAAVYAF
DSFSRGPLLPTAGIPVEAANISMIPVEIGYMAAAKGISEVMFADSVVVGI
LFLAGIAIHSLRGAAMALAGAIVGIVIPVLLGMDKSLIEMGLYAFNPVLA
MMAVGWAFLKPSGRSMGLAFLAGIFTVICQAGLTGFLTPLGLPVLTFPFV
LVMWVFLLAASRSKYWEYLNQGK
>NE0294 Cytochrome c, class I
MNRSKPFFLSNLLVQWLAMLIFMVSLHGTAAAAPADDERAQTIVHMLDYV
GVDYPEFVQDGKVLNAEEYEEQREFATQAITLLEQLPKVPEQPALQQQAH
ALLARIEAKAPGSEVSALASQLRAGVIQAWKLSVAPRQAPDLMLGAKLFA
QHCATCHGAEGHGDGPLAKGMEPAPSNFHDEARMRQRSLYGFYNTITLGV
GGTPMRAFTEFSEADRWALAFFAGSLRADPEAVARGEVAWREGQGKAAFD
SLKSLVDQAPGDQAEAGTVLDAVRTYLTQQPQALQAVAPAPLAFSRAKLD
EVVQAYTRGDREGARRLAIAAYLEGFELVESALDNVDALLRAEVEREMMA
LRAAIGDGQSAEAVSAQVVKVKALLNRADDVLSGSSLSPMTAFVSSLLIL
LREGLEAILVLSAIIAFVIKTGRRDALPYIHAGWIGAVVLGVITWAVASY
VINISGANRELTEGITALLAAAMLLYVGWWLHSRANAQAWSRFIREQVNV
ALGKRTLWAMAGISFLAVYRELFEVILFYEALWVQAGVEGHTSVLWGIMA
ATALLVLIGGAILRYSVRLPIGPFFAVMSVLLAFMAVVFVGNGVAALQEA
GMLDATAVRFISLPLLGIHPTEQGLTLQALTLLLIAGGFWFNRRKTA
>NE2467 Band 7 protein
MYTDTVSVITLILTFSIFFLASSLKVLKEYERGVVFMLGRFWRVKGPGLV
IVIPAVQTMVRVDLRIIVMDVPAQDVISRDNVSVKVNAVLYFRVVDPQKA
IIQVEDYNMATSQLAQTTLRSVLGQHELDEMLASRDKLNSDIQLILDEQT
EAWGIKVSNVELKHVDLNETMVRAIARQAEAERERRAKVIHAEGELQASH
HLLEASQVLANQPQALQLRYLQTLTEIAGEKSSTIVFPLPIELLTILQKM
TEEQSDNPTSK
>NE0299 conserved hypothetical protein
MDPVTHTLSGALLMRAVTSSHTQYAQRLPLRERIVAGSVAAAFPDSDVVL
RLIDTLTYLNWHQGPTHSLVMLPFWAFLLAHLFSRFTGGHYPWRSFFVPA
CLGIMIHILGDLVTAYGLMLLAPFSTWRFSLPLVFVIDPWFTAIILAGLV
LSAIFPTKRVYAVASLIGIVAYVSFLWMLHEEAMQAGKVHAAEKMLDQQT
VSVLPQPLSPFNQMVIIRDDTELHVARINLRRSTLLKCTDTSNLLCNMTA
AYRPLAMANWRSYRLHDSRSPEYAALSHEAWQQPVLAPFRQFAQFPVLDG
IDHPAQNICVWFIDLRFQFPGLPPSFRYGVCREEENSLWYLQRQHGAFWI
D
>NE0731 TonB-dependent receptor protein
MAKVSSRSWGIFPIRPVVSALTLAFGGMAGANASEPVPLPEIKVKSGKTS
ERSNEYKVDKSASQKFTAPLIDTPKSITVITDEVIKDSGSLTFQDALRTT
PGITFGSGEGGIASGDRPFIRGFDVFSSIYVDGLRDLGTQTREIFAVEQM
EVLKGPSGSFDGRGSAGGSINIVTKQARAGNFFKGSAGLGTEKFKRGTID
GNYTIGENVAVRLVGMAHKADTPGRDGVDVERWGLMPSITLGLNTATSAT
FSWYHFETDDKSDWGIPFIQNTANGGIPEGKPVGSRSAWYGVKGRDFQDT
SADIGTFKISHAFSDNLVVRNTTRYSITTNEFFVGRPNISSADFAAGLVN
RDATRNRGTRTETVANLTDVSFIFDTGFIKHSLNAGFEVSWEDYRNRTYA
GGAIINPADRLTPLGNPDSSVAFDPVTRNAYPSAEIEAHNKSAYIFDSME
LTEKILFNAGIRFDNYQVDLQNRNASTGANTTSFKQNKSFFNYQVGAVYK
MQPNANIYAAFATSSSPVGLSMGDFGYAGGDLNANTESLKPERTETYEVG
TKWHVLHDLALTAAVFHTIKTNARVNVGPSVENAGRAVVNGFELGFAGNV
TDKWNVFGGYTFLDAEQTRVGDSTDQNAVGSAGSKGKQLHGTPKHAASFW
STYKVLPRVTLGGGLFYTGKVYADPSNNGYLPSYVRFDLMAKYNISQNLD
VQLNVQNLTDKRYFNTTYFRHYAIPAPGRVAFVNLNLKF
>NE2014 hypothetical protein
MSDLYTRFVANVLFPLKKHDTVQVHGKIEVLQWWLSECILRLQAQCLRTL
LEHAGTHVPHYRDLFARISFDPAKVDSVADIRASSDALRADNAVGLAHFN
TEGSSGELSIFFIGTKRVSHDVAAKWHATH
>NE0034 Aminotransferase class-V
MTTESFRPEKKITTFYPPQRTLMGPGPSDTHPRVLSAMARPTLGHLDPVF
TEMMEELKGLLRYVFQTTNLMTFPVSGPGSVGMEMCFVNMIVPGDKVVVC
RNGVFGGRMIENVERCGGIPLVVEDKWGDPVDPQKVEDMLKKNPDAKIVA
FVHAETSTGVQSDARTIAQIARKHDCLTIMDTVTSLGGTPVYMDAWDIDA
IYSGSQKCLSCPPGLSPVSFSERVVDLVRNRTEKVHSWFMDISLLLGYWG
ASRTYHHTAPTNSLYGLHESLVILYEEGLERSWARHRRNHEALKAGLKTL
GIEYVVAEPYRLPQLNSVYVPAGVDEKEVRRRLLDSYNLEIGAGLGDFAG
KIWRFGLMGNSSKLENVVFCLDALEHVLIDMGVKVNRGTASSAAHQYYAT
NPVLA
>NE1789 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE0484 conserved hypothetical protein
MNSFLIKTGYILDVVTGIWSRQDYTGIAYSDGDEVEQRIASIINEASDLG
VLSSELRQHCTDWPSLYHLSGTRANILRPFENDLHGDILEIGAGCGAITR
YLGECGGNVLALEGSPRRAAIARARTRDLQNVTVVSDRFDLFSADQKFDA
ITLIGVLEYANLFTPGESPALNMLERVRALLKPEGKLFIAIENQLGLKYF
AGAPEDHLGQPMYGIEGRYRNDQPQTFGRIILAEMLKEAGFSACDFLAPF
PDYKLPVSVITERGFKTEKFDAAAFAWQSVHRDPQLPPILAFSPELVWPT
LAQNGLALDLANSFLIVASSNKESNLESSTLAWHFTTERKKEFCKTTQFV
QTEADGIEVRYKSLAPALPRPVIGKSLKFDIPDRASYVYGIPLSQEFIRI
VTRDGWRIEELCILLRKYLDIVANVACTKWQIEELSEPTIKLPGSTFDCI
PQNIITSTDGGAYIIDKEWEATEPVQIGFLCFRSLITLFHNVTKFGHSAD
EFGKTYINLIQAVMNGLGWLPDKGMIISYVNLEAVIQTEVSGRVVTPKET
MDWLDGPISKLNNLNQAVTEKDGQIASLSQAVIDRDKWLEQYNQQIKALK
GSHSWKITSPLRIIGKTLRKQRTAQRASMEPGAVPTHKDYPLWRKRISYL
TTRYNNGVQRHGLLSSLPLAMRAFHRFGSVWMEKQLRRRTYERRLRELRT
IIIEHTDFIDLFHVPMGWSTPLFQRFQHMSLQAAQLGGLALYGGHLQVDK
DLFVFNRVEGNVVVFDALDDRVVQCVFDALLEVRQPKMLRLQSIDLATKV
SDLEHFIRDGIIIVYEYIDEISEEITGAIPAFVIERHKWLLRNEAVFVVT
TSDRLFTEVTRYRTSQCILSTNGVDLPHWRKTAPHPPADMEPVLASERIV
VGYHGALAKWIDYDLLKRIASDGKYELVLIGYAHDDSLEQSGILDIPNVH
FLGSKSYFILNEYAVFYDIGILPFKRNELTESVSPVKLFEYMAASKPIVT
TYLPECAKYKSCLVSGTHDDFLSNLAAAAVARDDPTYMKALADDAEKNSW
AEKAALVYQLVGIRTAKS
>NE0387 60Kd inner membrane protein
MDNKKIVLLIIFSTSLLFLWDAWIKEQEKFNNPPAITQADSSAGSTQSRN
DDSLPVPGSELTSSQASPDTNGIPASGGNGDSVTPRLLPSGEQIRVVTDK
VIAEIDTIGGDLRRLELLQQPSSEDKDTPYALLYSEAARTYIAQSGLVGE
GLPNHKTTFRAESDIRNYELSSGEDKIVIRLLAPEAQGVQVIKTYTFHRD
SYVIDVGFEVENKGDATVRPFAYFQMLRDGNPPPAKTMMIPTFLGAAVYT
EEGKYQKIPFSDLDKNKADYPANANNGWIAMLEHYFLTAWLPQQQTPREF
FAKRQSDNLYSAGVIVPAGVIAPGEIAATTMPFYAGPEEQDNLEGLAPGL
DLTVDYGWLTVIAKPLFRLLSFYHSWTDNWGVAIILLTMTVKLLFFPLSA
AGYRSMAKLRLVTPKLKRIQDQYKGDRQRMHQAMMEFYKEEKINPMGGCF
PILVQIPVFIALYWTILAAVELRYAPLALWIDDLSSPDPFYMLPLLMGIS
MFVQTKLNPTPTDPLQAKIMQIMPVAFSAIFFFFPAGLVLYSLVNNILSI
AQQWKITKMYGTAPSKDTPEPPVSKQVNSSENPETTANSPADSPKQPQTP
ANNPRKMYKRTRKK
>NE1011 putative transmembrane protein
MSNEKVSRSRLKLILMMLVILSPIVISSFLHRSNFRPDHTVNYGELLEVR
PLQGEATNLTDNTIFRIRQLKGTWNLLIIDSGKCEEYCQEKLYTLRQVRL
AQHVDKDKVQRVWLINDDIRPDQETIDKFKGTRLVLANGKDLLKEFPAEN
KREDHIYVVDPMGNLMMRYPRNADPRKMVGDLKRLLKLSHLEH
>NE0549 TonB-dependent receptor protein
MLRSGLFLAVVMLASPIVQAQISATQQQPALPLAAALQQLAVRHNLSIVF
DAEIVKDKQASPLSENLSIREALDTLLNGSGLQAKEIAPGRYSIVKATTG
AMQQTLPEMTVTGASDPDSPYSTQYKVPDTTTATRTKTPIMETPMSIQVV
PKSVMNDQQAITLEQSLSNVSGVFPGLGFNGVETFNLRGFNTWDYYRNGV
RFQSALSQTGNREVANLERIEVLKGPASILFGRIEPGGMINLVPKTPQAT
PYYSLQQQFGSYNLFRTTLDATSSLNQDNSLLYRLNFAYDNEGSFREFVN
SHHFFVAPVLQWRISDRTQITAEMEYKTGKYAHDYGFPAIGNRPANLPVN
RQLGESFNSAKFDEITAGFHWSHAFNDNWEIKHRFYLQHTDEDDNVALPL
ELRADNRTLDRFFAGFRNNKIETYTTNVDLTGHIDTWGIKHTLLMGGDYY
NFRNHGLMIDNFNFPSIDIFNPAHSGVAVRDPADDIPYDTKEDWFGLYFQ
DQIKLPYNVHVLAGFRYDNAEIKDNISGRESAQDRISPRVGVLWQPIPAL
SFYGNYVENFGAPNLYGSGLDGQPLPAETAQQWEAGVKTEFFDGRLSATL
AWFQLTKQNIATLHPDPQLALQRISVLTGEARNEGIELDITGELLPGWQV
IANYAYINSEVTKTNDNTLGNRFPNVPEHAGNIWTTYAFQNETLRGLKVG
GGVTLRGKREGNLENDYQMPGYAVFNLMTSYGMKVGKTRVTAQLNVNNLF
NEEYFPSSGGFGRTRIAVGTPRVFLGSLRVEY
>NE0730 Ferric uptake regulator family
MGSRAGHPCSLKLQAAGLRSTSTRLSILKVLEADSQRWIEGEAIFRELIA
RGTTISLATVYRALKDLDHRGVLLREWRVGASGGKAVYRLSSHDLQDRKG
AIVCRQCGASVPMDDSVLLHEHLRQLASRHGFVLTEQPVTIHMTCGRCAG
ASDKKALHEKNTSPTLQHSMPLLPARRRKNKASDVTWPIECE
>NE2145 hypothetical protein
MRKFAAFILFSVVGTSGWCGDTIRPGLWEVTTRSDLLGLIAHVPSEQMQQ
ITSLARQYGLKVPRIQEGAAISKVCITPEMAEQDIPSHFYENQSGCSVVN
ASRSGNRYQVELVCDNPRFKGNGHAEGIFSTPERFTGKTEFNSTVQGTPL
YVYAETSGRWIGAQCEPMR
>NE0107 ATPase component Uncharacterized ABC-type transport system
MAIIDPVIEITGLTTRFGSHIIHENIDLTVYKGEILALVGGSGSGKTTLL
RQMLGLVTPAQGSVRILGARRYDCGREEQRKLRTRSGVLFQQGALFSALT
VFENVALPLREMRTLCEGTIRELVMLKLGMVGVEAQHARKMPAELSGGMV
KRVALARALALDPELLFLDEPTAGLDPALSEGFVDLVRSLREELALTVIM
VTHDLDTLVAISDRVAVLADRRIVALGPIPEIITIDHPFITAFFGGMRGR
NALRALQQH
>NE1752 hypothetical protein
MNIRSTGLSILIGALFAIPATTATAVSPVTDTSIRQGTASYLILADAHQH
QGHQGGSAGQGHSGGSGHAGHGQGGQGEGKGRHGGGGGHSGHGGHGKGDM
EHHGHPPSYAHSVAMQAEALGLSDEQLGKIVRFHLKEDKQAHERIKQKMM
ESMKAFRKAVGEPATDDETLRKLGQAHIDSFNEMVKYHIDERKAVRSILT
PEQIGKLKAVKSDHDH
>NE0090 Guanylate cyclase:TPR repeat:SAM domain (Sterile alpha motif)
MMNDRVSAWLDSLGLTVYHESFEHNAITWDVLPELNEDDLEALGVLLGHR
KILLRAIAQLSQNTASPGPGSILAGVIPEEQSFPLERNQAERRQLTVMFC
DLVDSTALSCRLDPEDLQDVIRQFLVACSQAIGRLNGYVAKYMGDGILAY
FGYPHAHENDAERAVHAGLAILDTVKVFNLNNPHPQLSIAARIGIATGQV
VVGELMGQDTATERSVFGETPNLAARLQALAKPGQLIIDLATKRLVGNEF
EFSDLGASSLKGFDTPVHAWQVLSIKPSASRFESYRSGQLTKFVGREQEI
SLLLGRWHEAVSGEGQVVLLSGEAGIGKSRIACSLRDHLADERHQVIQFQ
CSPYHINTALYPVINFLRQAAGFANQDSAETQLNKLDAMAAKSGIDDPKT
VSLLADLLLIRGDHRYPPLNVSSEKRKDMTLEALVQHLQKLAGHCPLLFI
VEDAHWLDPTTLELITRIIGHIRQMRMLLLITSRPGFKPVWAEYSYVTSL
TLSRLPRRHSAELITTMAGGKVLPPEVQQAILAKADGIPLYIETLTENVL
GSGLLTEGNDSFTLQGPVKKLPIPDSLQALLMERVDRLGSAREIVQAGAA
IGREFTYELLQATVEVPDSELKDALDLFVASGLVLQEGEIPLATYYFKHA
LVQEAAYSMLPRKLRRALHARIAKALESRFSERVSMEPELLAYHYEQAGL
AGPAVEYYHRAARCDVERSASIEALNHFNRALELLKELPQGPERDTLELE
LLIARGVPLLSVKGYASDEMEHNYRRAKDLLQEHSSFVHQFRAIRGLWVF
HLVRGQLANARGLAEDLLALARREQSPELLIEAHHALGATCFYLGQFDEA
RTHLFAAKSLDDSNQHDSQVFFYGQDSGITARILLARTLWILGEVDQAET
LALETMGMARELEHPFTRVFTLIFLAWIYSGAHNAEKTLEITNEAIAIST
QYSFELGLAWATASQGWALADTGQEEGIAKLIDGLSATRATGARIHDSCT
LALLAEVYLRRNRIDEGLAAIEEAQKLAVTGGALFWQAELFRLKGELLQE
QSGESVQEAEECLCEALKIAQDQHATMLELRAATSLAKLWRKLNKIDNAK
YILNDVYFRFSECTDNLDLIEAKTVLEQLGV
>NE2081 sigma-54 dependent response regulator
MRNLPVLIVEDDPDLLEALCATLKLDGYTTLAASSGEQAIDLLRTHPAGL
VISDVQMRPMDGYTLLQAIKRSHPWIPVVLMTAYGEVDKAVAAMRSGASD
YLLKPFDPHSLLAHVRHYLLTVSDEDESEIVAEDPRTLTLLSLAKKVAAT
SATVMLTGESGSGKEVLARFIHRHSPRASRPFVAINCAAIPENLLESTLF
GYEKGSFTGASQAQAGKFERANGGTLLLDEISEMPLGLQAKLLRILQERE
VERIGGSKPVKLDIRLIATSNRDMPMYVQEGRFREDLYYRLNVFPLEILP
LRERPQDIVPLAQRILDRSAQPRCVLAQSAIRSLERYTWPGNVRELENVI
QRAMILATGSIEAEHLNLPGSSARDTFRGEAQREGDSPLRDIKSLERDHI
LQTLAAVNGSRKLAVQKLGISERTLRYKLQQYRLARC
>NE1029 Bacterial extracellular solute-binding protein, family 1
MSRSFIFSGHVGRWCSSFLLIVFLFAVPISSQAEEIVVYSSRIEQLIKPL
FDAFTKETGITVKFTTDKEGALLTRLLAEGKRTPADVLITTDAGNLWEAA
RNGLLKPVTSPVLQANIPENLQDPQGQWFGLTVRARTIVYNTRKVKPEEL
STYADLADPKWKHRLCLRTSKKVYNQSLVAMMIAELGETETERIVKGWVD
NLATDPVADDTLALEFVAAGRCDVTLVNTYYYGRLMRENPTLPLAIFWPN
QATTGVHINVSGAGVIQYSKHGAAAVRLLEFLSSEQAQKLFVELNMEYPA
NPKVEYGELLNSWGSFKGDPRSIVQAGELQADAVRLMDRAGYR
>NE0189 Helix-turn-helix motif
MPMTEKELKERDAKRDLGAELLESVVQMKSGHAGRIHKVEISPVTAARLK
SGLSQADFARLLGVSIRTLQDWEQGRRQPSGAARTLITIAERRPDVLKEI
TL
>NE1369 Glycosyl transferase, family 2
MKSLSEHVDFSESKTGNSSSESETGQTLIFLLADLAQRQQIDNTLQSFLN
ESNAEEILLVANIRCADDEVWLRKYCARFPSVNAILNRRAKSATVLEAQG
RWLARYRSQVVLSVDCDESGVKILRKVLPFFTPPVAMEMQKSTLSCQPGE
QGLTLIIPAHDCDLYLTDCLASALVIESDKTLVVVDDGSLDDTHGVIAGF
ASQYPEIIDHISICKASGLPAVPRNIGLASVQTDTFGFIDADDWISPGEY
TSALYEMQQMSADIATAAGFTRHYPDRTESLLQKCFTIPENEPANVLEGS
FFSNIWNRIYARSILLRSGAYFPRTHFSEDFCFSVWSHFYAEKTIQVSVS
FYHHRYGRQGSTTDNRAGSGAFSHITEFHRELEMYLADPFMRKALATILR
KRMGSFQYTLKLLPPELIQPFKTALQEMLKPLNRYFISDKEPNKRDIATF
AALGISDLLYQRPRSKFLGMRNWGRKWLGMYQQLTKRESHENS
>NE0474 conserved hypothetical protein
MMTLFQRPFVAQLAHRLDGMQPLIQVLTGPRQVGKTTGVRQLMAQCSYPQ
HYANADDVLVSDRSWLLEQWQQALLLGEGALLVVDEIQKVVNWPETIKAL
WDAQPGRLRVLLLGSSALQIQSGVTESLAGRFELLRVHHWTFAELHAAFG
YDLPRYLAFGGYPGAVVLEYDPDRWYAYMKDAIVEAVIGKDILQSRKVAN
PALFRQAFEILCAYPAQEISYTKLLGQLQDKGNTDLVKYYIELYGGAFLL
HALQKYSPKTWLARSSSPKMLPACPALYSMVAGVDVMRSTEQRGRAFELV
VGAELMQLPGQVFYWRERNDEVDFVYQYRERLYAIEVKSGRKKSARGLDA
FCAQEPKALRVIVTPENFAQFSAEPRDFLQQVAI
>NE1532 TonB-dependent receptor protein
MKGTKRFEQSRQHLLKAGDAMIRKPLAVAMVGVLAGMTSVWAQAESKKEP
VQLETIEVVASGQGSLTVPTLVEAQHEMRKIPGGANIIDSESYANGRGST
LQDALGYSPGVFVQSRFGAEEARLSIRGSGIQRFFHLRGIKILQDGSRVN
LADGGADFQVIEPLAARYIEVYRGANALQYGSTTLGGAINFVTPTGYDAE
RFRARGEAGSFGYNRLMLSSGGVHGSLDYFASVSRYSQDGFRDWSEQENW
RMFSNAGYRITPDLETRFYLTYTKTNSQLPGELTKAELKDNPKQANFFNS
NIARYKRDFDLIRVANRTVMKLGEAQQLEFSAFYAHKKVWHPIYVVMEQP
SHDYGLGLRYINEMPIAGWRNRFVAGFEPSWGKVMDNQFVNINGEAGPRI
HKYDQDVSSYTVYGENQFYLLPELALVVGAQYTNMTLKQKDLHTGDPRQK
KNYERFTPKAGLIYDLNPEIQFFTNVSTSFEPPSFSELASWQATPVLASA
QRALTFEVGSRGRFGIAEWDVAFYRSHVRNELLARVDPVSRFPIGMVNAD
KTVHQGVELGLDMELAKMLYLRQMYTFSDFKFDGDPLFGNNQLAGIPRHF
YKAELLYRHPAGYYAGPNVEWSPEKYYVDHANTLSVDTYALLGFKLGKRN
KTGFSWFVEGRNLTNQKYAATVSVTDIAGVGAFGGANSALFFPGDGRSFF
AGLEYRM
>NE0134 transposase
MKASDFREWVGKITQLSRGQKEQTKHKLGGMVPRIEVAKWLESSFEPICP
VCQSNHFYRWGYQAGLQRFYCRMCKHTFTAISGTPLARLRHKDQWLNYSA
ALIEGLTVRASARQCRIDKNTSFRWRHRFLTLPAAAKANHLEGIVEADET
FFPVSCKGQRQLDRPPRKRGKQIHMRGTGKDQVPVLIVRDRSGATADFML
DAIDRKAIEPPLRTVLEKDVIFCSDGAAVYRSVARSLGITHRPVNLAAGV
RVIAGVYHIQNVNAYHSRLKQWMKRFHGVATRYLENYLGWFRWLDQQENL
SSPIVPLQAALGRENQFQLLTNT
>NE2133 Putative peptidoglycan binding domain 1
MKVINLDRFAKCVLMLSLVAFVAGCGDEPEETTAQNQIGVTEEAVQSDEI
VTGSVPVESGETESGEVMEEESLESILAKADEIIQRTGGALNGEVEPEPE
ETTASGAEETGSVAAAPATDSAAGSADDGQEVVKATPDLIRKVQQALTDA
GLNPGVADGKLGPRSRGALIDFQKQHGLAEGKITKETLRELGIDF
>NE2097 Integrase, catalytic core
MPTGQNNRTTVVRKNATCPRDLVNRMFHANRPNQLWVSDFTYVSTWQGWL
YVAFVIDVFARRIVGWRVSSTMSTDFVLDALEQALYDRRPADTLIHHSDR
GSQYVSIRYTERLAQAGIEPSVGSRGDSYDNALAETINGLYKAELIHRRA
PWKTRAAVELATLEWVAWYNHQRLLGSIGYIPPAQAEENYRQTQDNKTLM
DILL
>NE1132 transposase
MPDLCQGLFGQLFADKGYLAQWLTEALDQQNLQLITPLRKNMRPVPRTRF
EKVILRRRSLIETVFDELKNLCQIEHTRHRSLFNFIVNLMAGIVAYCLSD
NKPTLNLTRVNSLAKA
>NE2270 hypothetical protein
MSPKAAIHNRSGSFSDRWIAYCQQRNIPYKTVDCHSSDIIEELADCSFLM
WNWSQTYAEDMLIARGLIKAVESMGIRVFPNSETCWHFDDKVAQKYLMEA
LGVPMVKSYVFLDRARALEWARKTEYPKVFKLRGGAGSSNVFLVNDINKA
ERLINKAFSTGFPPVSRWNALSERWWQFKRDKTFISFFNISRGIFRALIK
NPTLERLPVQRNYAYFQDFVPDNDSDIRVIAIGKRAFGIKRMVREGDFRA
SGSGKILYDPELIPEACIRMTFDLAKKTRSQSLVLDFIFSDGSPLVVEMS
YAFASHGYLSCPGYWRNDMTWVEGSFHPEDFMVDDLIVSSVERS
>NE0574 major membrane protein I
MTDIHQAQTALGDVAARTLANATKTIPMMGTITPRWLTHLLQWVPVEAGI
YRVNKVKDPDEVEVDCSNKDERELPATYVDYEEWGREYVLSAVNTVLDVH
TRVADLYSSPHDQTREQLRLIIETIKERQEKELVNNKEYGLLNNVARNMK
IKARTGAPTPDDLDDLISLVWKEPAFFLAHPKAIAAFGRECTRRGVPPPT
VSMFGSQFLTWRGLPLIPCDKIGITGGKTSILLLRTGESRQGVVGLYQPG
LQGEQGMGLSLRFMGINHKAIASYLVSLYCSLAVLTEDALGVLENVEVGK
YHEYK
>NE2553 conserved hypothetical protein
MAMEKPVLWYIADPMCSWCWGFAPVIENIRQEYSAFLTVKIMPGGLRPGT
NTPLLPEKRAQILHHWHSVHITTGQPFTFENALPEGFIYDTEPACRGVVS
VSLIEPEKVFPFFAAIQRAFYVGQEDVAQLAILKKLAVDLGIPESRFTPV
FQSDEAKQRTLAGFQRVAQWGISGFPALVVESGTDRYLITTGYRPIEALR
QLLDTWLQQHGV
>NE0961 hypothetical protein
MTALTTDRRAQTRWSGRLLPSLGILLVTGGLVLLTWYTWLVLTPDTAPYR
YQQVTTGNASEYPELELDTWPDLTISQYDIHVEGTEQPVAQAWFGQRANQ
PQVLLNWKNQTREPLLALDQKASELSALAAAIDKHASRDALLLGWWDTSR
QLALLTGRDVLFHTPLHEPLIIPPEWQPHEQAIRAYENQQAGTPADPQEQ
ELFMRFAQSLVNPPANGLDDLRQLAGTRDTYLIVHVSDLYKLGLMYPDKF
GIAYKHYRMTGNLHGMISHLKTEMRTRGYYTYTLQSLSDELIRAFFLIDE
ASYDTLLAKLLPFTSQPSPVERTSPRLIYQQGGYWVYHLTAKAPAHNTLQ
SGKDSNETTDSTVSVDQVQ
>NE0228 CHC2 zinc finger
MIEQSFIQELLDRIDIVDVVARHLQLKKAGANFTACCPFHNEKTPSFTVN
SSKQFYHCFGCGRHGNAISFLMEHSGASFVEAVESLATHAGMQIPDQVSI
YPKIPDPGRVPSDKIKIDKEVEATSPLAGLYERMEQAAKFYRGQLKQSDQ
AIAYLKERGISGRTALCFGIGYAPPGWQNLSGIFTDYPADDSSHPLVQAG
LVVAHDGKKNYDRFRHRIMFPILDRKKKIVGFGGRALDGGEPKYLNSPET
SLFVKGRELYNLASASPAIRKSARVIVVEGYMDVVMLVQSGVENVVATLG
TATTAMHIQNLLRHTDEVVFCFDGDAAGTKAAWRALETSLPQLKDGKDIK
FLFLPDKEDPDSYIRKYGRVAFEGLLEKAQPLSVFFCNELSGRVNLGTSE
GRARLVQRAGPLLAQINAPVFGFMLTKRISELTGVGQNQLAAFLKTGKKN
RSSTLRPEASRPLSVTPYRRLIQILLHAPDYANKLDTNLLAVNDEQNEEK
VLLVALVDFLKTSACSMEEELNSVTILLHFDQTPHRVLLEKIVRDAHVKD
ENWNIDAEFTGGMERLREMQRRSRMAELHSRPLVSLTPEEKNELRQLMLS
>NE1811 ATP-citrate lyase/succinyl-CoA ligases:DUF184
MAILINEQTRIIVQGFTGRIGTFHAQEMIDYGSNVVGGVTPGKGGQKHLG
LPVFNTVREAVEQAGAEASIVFVPPAFAADSIMEAADAGIKYCVSITDGI
PTQDMMTVKNFLRLFPEEDRMMLTGPNCSGTISPGRAMLGIMPGHIYSRG
VVGVVGRSGTLGYEAADQMRRLNIGISTSVGIGGDPIIGSSHRNVLQKLE
EDPETKVTLMIGEIGGPMEVEAGLFAKENMSKPLVAYIAGLTAPPGRRMG
HAGAIISSAGESAAEKVERLKELGVTICPTPSLMGETVAKVLAGL
>NE2111 hypothetical protein
MHVWPVQDAKARFSEFLDACITEGPQIVSRRGAEEAVLVPIGEWRRLQAA
ARPSLKQLLLSDSARTEMLVPERGKARRRQVEPLR
>NE0098 conserved hypothetical protein
MRQQLLDITEIGPPSLDNFISSGNEEVLYTLRNLVAGNQQDRFYYLWGKT
GSGKSHLLQAVADAFSEQQCNSRYIDCNQDEPNFNPGTDCIVIDNVERLD
DAAQIRLFNLYNHLRDNKHGIFLASGTKPPAQLDLRQDLTTRLGWGLVYQ
VHELTDEKKIEVMQDYAIRCGFELPLEICHYLLKYEQRNLSSLIRLVHAL
DQLSLTRQRPITLPLLRELL
>NE0372 hypothetical protein
MYKRDTRFSMVFPALLAVVLFVPFVMNFLAHAWYGEKHFPPLTMMKLPAD
SSVISLPQEIKQYKPFEITLQLNTRELARRINDIVKKSHPGTELQGIRSE
VFPEMRARIAGDAFSIDPPEPQVQFFSGQGEMSRWSWIITPEKTGRHHLL
IELHLQTAETTREHPQVADLAEIQLFVRENPEAWMRTHGIWYALFTLLAA
GWWWKKRLIKKRKAAKEQ
>NE1957 Orn/DAP/Arg decarboxylases family 2
MSSFSSFEYRDSQLFVESVPLAEIAAKFGTPCYVYSSAAIRTTYQIFDHA
FGQRDHLICYAVKANSNLAILNLLARLGSGFDIVSGGELQRVLKAGGDPQ
KTVFSGVGKTPEEIRAALAANILCFNVESEMELMVLNEIAGQMGKIAPVS
LRVNPDVDANTHPYISTGMKENKFGIPAAEAGRIYSLAHQLPHIQVSGLD
CHIGSQLTEIAPFIEAADKMLDLLARLQTMGVTIKHLDLGGGLGIRYDRE
SPPSIEAYIKALCTIAENHPQQLLIEPGRSMVGNAGLLLTRVHYLKHTSH
RNFAIVDAAMNDMLRPALYQAYHSILPVTKRTGETKTYQIVGPVCETGDF
LGHDRNLALTKNDLLAVMSAGAYGMSMSSNYNTRPRAAEVMVDGSTVHLI
RARETVEDLYALEKLPI
>NE0376 ABC transporter, fused permease and ATPase domains
MLRYFERLINPFPSGKLTEPPATLYRFCRHYMKGAEIYFILLAIATFCVA
VGEAMLFGVLGTIIDWLAEKDPQAFLQEERLTLLGLSLFILIAMPVIVFF
HSVILYQSLMGNFPMVVRWLSHCYLLDQSYAFFQNEFSGRIATKVMQTAL
AVRETIIKTLDVLLFVTVYLATALLLIANADLRLCIPLIIWLAVYLCILR
YFIPKLRHISRIQADSRSNMTGRIVDSYANILTLKLFSHTRRESDYARDG
MQEFLDTVHPQMRLVTQLNTCVWINNMLSVFAIGTLGILLWLDTSISTGA
IAIALSLAIRLTGMSHWIMLEVGELFENIGIVQDGMNMLAKPYAVTDTPD
AIPLFVTKGQIDYNQVEFSYHNDIKNEDHRIFKALDLHISSGEKVGIVGP
SGAGKSTLINLLLRFYDIRQGSIQIDQQDIRTVTQESLRANIAIVTQDTS
LLHRSIRENILFGRPDATKEEVILAAQRAHAHEFIQELVDSYGRRGYDAM
VGERGVNLSGGQRQRIAIARVLLKNAPILVLDEATSALDSDIEASIQQSL
HELMQNKTVIAIAHRLSTIAAMDRLIVLDNGHIVEQGRHAELLSLNGLYA
RLWNHQSGGFLGHLDHSSGSHDETC
>NE1935 Inorganic H+ pyrophosphatase
MANGLTIAICSAILALIFSGLWIRRIYAQSAGDSRMQEIAAAVQEGASAY
LKRQYLTIGMVGTVLFVIIGLALSWNTAIGFALGAILSGLAGFMGMNVSV
QSNVRTAEAARSGLNEALAIAFRGGAVTGMLVVGLGLLGVAGYTALLVSG
ADETSSISDLIHPLIGFAFGGSLISIFARLGGGIFTKGADVGADLVGKVE
AGIPEDDPRNPAVIADNVGDNVGDCAGMAADLFETYAVTIIATMLLGALL
FKTGTGDAAVYPLALGAASIVASIIGCYFVKMREGGKIMNALYRGLAVAG
GIAFFAYLPITVWFMGGATLTLDGTEVGGGELIMRLFASTTIGLVLTGLM
VVITEYYTSTEYPPVQHIANASTTGHATNIIAGLGVGMRATAAPVLAVCA
SIIVAYSLAGLYGIAIAATAMLSMTGIIVALDAYGPITDNAGGIAEMAGM
PESVRAVTDPLDAVGNTTKAVTKGYAIGSAGLAALVLFADYTHGLEHANK
LMTFDLSNHLVIIGLFIGGMVPFLFGAMSMEAVGRAAGSVVLEVRRQFKE
IPGIMDGSRKPDYSRAVDMLTKAAIREMIVPSLLPVLIPVLVGVFLGPQA
LGGVLMGSIVTGLFIAISMTAGGGAWDNAKKYIEDGNYGGKGSDAHKAAV
TGDTVGDPYKDTAGPAINPLIKIINIVALLIIPLL
>NE1032 Domain of unknown function DUF74
MLLTTTPVIEGKRITHYYGIVAGEAVLGANVLKDLFAGIRDFVGGRSGTY
EKELQHAREIALEELQENAHRLGANAVIGIDIDYEVLGKENGMLMVSVSG
TAVFVE
>NE0622 conserved hypothetical protein
MDIYQLSDLMGRRKESGESYLEFLRVPSLSMGVYTLPAGAEDLQEPHTED
EVYYVVHGKARMRIGSDYREISAGSIIFVAANVEHCFCDITEDLTVLVFF
APAEYTHQGGE
>NE0926 Cytochrome c, class I
MKPYLFAAIGASFFATSLAVHAADVPAVLQSKCASCHALTKPESNSLDRL
WERKGPDLHYAGNKFNKEWLVEWLQDPAKIRPAGEFYRKHVKPGTKEDVV
DESSLAPHPKLARKDAEAAANALMTLTDPDGLIEKGAFKDQKVAMSMGAM
YFNKLRGCAACHSAVPGKGGLSGPEMATAGKRLQPDFIYSYIKDPQKIDK
GVWMPKLDISDADLQKLTSYIVQLSAKEDKK
>NE1277 hypothetical protein
MNTAHKWRFFRSGGFDQVRLETGADVESLGTLDPKLWAALSCPTSNLEFD
DKTLEFIDTDHDGHIRVPEIIAAVEWVSSVLKNPGDLTSGSEALQLSAID
DSTPEGAALLASARQILLNIGKKDEEVITVEDTADLNRIFASTRFNGDGI
IPATAASDAETRAVIEDMMKCVGSVQDRSGLPGVSAELIEQFFTEAKAYS
EWWQDAERDATSILPLGENTEAAKAAFDAVKAKIDDYFTRCKLAEFDQKA
GDPLNPALSDYEALTNTDLSSTTEQLATLPLAKIEAKKPLSLNEGINPAW
VGAIDALKRQVVQPLLADKEQLSADEWQALCDRFTAHQAWLDTKRGATVE
SLGINRIRSILAGRYQDEISALLQKDQSLASASDAIDSVEKLIRYQRNLF
QLLNNFVSFRDFYTAQNKAVFQAGSLYLDGRNCDFCLRVDDIEKHSSMAG
LSGIYLAYCECQRRGGSEKMNIAAAFTNGDADNLMVGRNGIFYDRRGQDW
DATIVKIVEHPISVRQAFWYPYKRIGKMIGEQIEKVASAREKSVQDQAAA
GIADISQKAEAGKPPAAAPFDVGKFAGIFAAIGLAIGAIGTAIASVVTGF
LGLAWWQMPLTVLGLILVISGPSVLIAFLKLRKRNLAPLLDGNGWAINTR
AIINIPFGISLTQMATLPAGAQRSLTDPYAEKKRPWKFYLFVLLLLGSIA
YLFHSGYLNQGTVDTLKKHFLSDKAEIGTEEASTAQEIPSASPAEEIADG
QKTEDDKPPQPAAKEGVENGSVASPEPVSTVVPAPKPVSVPASH
>NE2179 hypothetical protein
MNRIGNGLLVAMAITWSGMVFSAVQNLPQPVFSGQDGEQAAADNAASPAP
AEQAAPASTEAETAAKAEEQALPSGIGWKLVRSLEMGDSGKFVHMVLIEK
GRQADKTIYSSAIHRLCAKEKEFCRIRFWVQSYLIPEKVSLTLEQQKTQQ
ADHLFNRAAGIHRTLWACTVDSTSESCIQ
>NE1043 putative transmembrane protein
MKIVAGLLVLLIGLAQYPLWFGKGSWLAILEMHEQIAALQEANQRLQNRN
TVLEAEVNNLKKGFDAIEELARSELGMIRKNELFFRVVEYENK
>NE1212 PfkB family of carbohydrate kinases
MSIDSYSTLTKRNVILFGEALADIFPDGPVMGGAPLNVSCHLQAFGLHPM
LITRTGTDELHRKLIRLMRNAGMTISGIQSDPHYPTGQVLIKPGENWYGH
QFEILENQAYDFIDPVTAATAASSIEPDLVYFGTLAQRSRLSGSALETIF
RYTPQTPRLLDINLRKPWYTRDTVRYSLTRADHVKVNEDELIELPGLLNL
HTRDPRDTASRLIEEFQLHTLLVTCGAEGAWLLDQSGNEAETPGITDITV
IDTVGAGDGFSAVFILGMLLGWPETLTLERANRFAAALCGIRGAIPESSE
FYSPFLKDWGMIN
>NE0999 phosphate transport system permease protein
MPIPNNTDHSPGSLRRSYKRYLKERVIELLLFLAAFLSVFITFAIIYLLI
SESLVFFEHVSVWDFLTDTQWTPLFDDAHYGILPLVSGTAISSLVALVIA
LPFGTIIAIYLSEFAPFTVREIAKPFLELLGGIPTVVYGYFALLFVTPLL
QTILPDLPGFSLLSAGLVMGIMIIPYVSSMTEDAMRSVPMHLREGSYAIG
ATRFQTAIKVVMPASLSGIAAAYILGISRAVGETMVVAIAAGMQPNLVWN
PMEPAATITTYIVQVSLGDLPHGSVGYQTIFAAGLTLLLLTLVFNILGHL
LRKRYREVY
>NE2502 putative outer membrane receptor for iron transport
MRFAKPFCVTSRMRALALLLLFPSIAAAQEMPTTTLAPVVVQATRLGASI
ADTPASVDVIDGRQLRARQPGINLSEGLVSVPGL
>NE2135 conserved hypothetical protein
MTQLPQPLTPDELQKIDAYWRAANYLSVGQIYLLDNPLLQTPLTLQHIKP
RLLGHWGTTPGLNFIYAHLNRIIRRDDLNMIYIAGPGHGGPALVANTYLE
GTYSEHYPDISQDIQGMKHLFRQFSFPGGIGSHATPEIPGSIHEGGELGY
ALSHAFGAVFDNPDLIAACVVGDGEAETGPLATAWHSNKFLNPVHDGAVL
PILHLNGYKIANPTILARISREELDQLFQGYGYQPYIVEGEDPLTMHQLM
AGTLDQVLAEIRSIQTRARLDGVTQYPRWPMIILRTPKGWTGPATVDGLK
TEGSWRSHQVPLSELAKKPEHIQQLEAWLRSYRPEELFDTQGRLVEPLQT
LAPLGNRRMGANPHANGGSLMKQLRMPDFRKYAVEIVQPGQIEAESTRIM
GSFLRDIMCLNLKTCNFRVFGPDETASNRLGSLYDVTPKTWLAETLPEDE
HLAPDGRVMEILSEHTCQGWLEGYLLTGRHGLFSCYEAFIHIVDSMFNQH
AKWLKVSKEIPWRRPIASLNYLLTSHVWRQDHNGFSHQDPGFIDHVINKK
ADTIRIYLPPDANCLLYITDKCLRSRNFVNVIVAGKQPQLQWLDMDAAIK
HCTAGIGIWGWASNDQAGEPDVVIACAGDVPTIEVLAAVSILREHLPDLR
IRVINVVDLMTLQHDREHPQGLSDREFDTLFTTDKPIIFAYHGYPWLIHR
LTYRRTNHANLHVRGYKEEGTTTTPFDMTVLNDMDRFHLVDDVIDRVPHL
GYKAAYLRQIMRDKLVEHREYINRHGEDMPEIRDWKWTAP
>NE0937 possible Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding
MNTHSLVPNYGISARQCYEPIRVLIVDDHIDLISNVFAYLENKNFILDAA
RDGESALELSSNGQYDVLILDWMLPRLNGLQVLQRLRAADVDTPTLILTA
KSDIPDKLSGFAAGADDYLTKPFLIAELEARILALHTRRIGRAKILRVGS
LIYHLTSQQVMRDGSIIHLHSGSRKLLQVLMRESPNVVTKDRLESLLWGE
DRPDRDLLRTHIYELRKSIDGDYPLKFVQTVPKVGYRIVDPELGP
>NE1728 DUF173
MDNPTLIDPVAKVQGVPLLEVPQDLYIPPEALQIFLETFEGPLDLLLYLI
RKHNLDILDIPMAELTRQYIAYVETMRADQFELAAEYLLMTALLIDIKSR
MLLPRPVTPREDDADPRAELVRRLLEYERIKQAAIHIHSLPVAGRDFMPA
CIWVDFVTEKQLPGINPQDLYDTWLALLERLRLNRNHTIRYETISVRVCM
SEILRNLQSRGNVPFTELFSDITNVHKLVASFLALLELAREALVDIVQPD
RFGMIYVHAIHTDQAD
>NE1347 conserved hypothetical protein
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKVWDEFTSKPSSTLIPSGQQRSC
IPTKHQLHQLICSMTGYCRSLLNRVWALFAF
>NE1344 hypothetical protein
MLLLLTFIESPIAGIPEIRKHNCLVLPHIFVCLVITKSFGRVVTSGAAVT
KRAAGYFGYLAHCAADETTWYACEQFVIGHARKAVTHFIYLA
>NE2103 Helix-turn-helix protein, CopG family
MKAKDFEQQFDEGVDITASLDLSKAKRVLQEQKRVNVDFPTWMIESLDRE
AEKLGVTRQSIIKVWLAERLEKAALTHPSSGTR
>NE0773 Lactate/malate dehydrogenase
MSSPIRIAVTGAAGQISYSLLFRIAAGDMLGSSQPVILQLLDIPESGKVL
DGVLMELQDCAFPLLTDIIVTHDPMIAFDQADIAILVGARPRGKGMERKD
LLQTNGEIFREQGRALNQVVKRDAKILVVGNPANTNTLITMKNAPDLSPE
NFSGMLRLDHNRALSQVAMKLNQPVSHIRKMIVWGNHSSTQFPDLSHAEI
DHQKVIDLIKDQTWVENSFIPTVQNRGAVVIEARGLSSAASAANAIIDHM
RDWIFGTRDDDWITMGILSDGSYKIPKGVIYGFPVVCKNGGRKIVQGLEI
SPFSRTRLDIAYDELTQELDSIKHLLL
>NE0714 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWPNRT
>NE1682 possible transmembrane protein
MIYWGMGMLVIAWLADGGISRLSDLIKEPLALGVLLFCGVWVLGLLWSDS
AVIFQGKWRKYFILLTFIPLFSLLSRERLPWVTGALLCSYLGMVILGSYQ
WAMQEMQGISLLGMSYLHYSAALGIGVILAVFLGWETLSREKRWLSVLSW
LIAILLLFLQFNQSARGILLATLMALLLMIVLRYRAEWRMLTGGLTAIMA
MVILFATSSDIFHDRLQQAGTDLRSFQQGDYQTSVGYRLAMWDVGLHGIA
EHPLLGHGSGMAKKYFDDSIITYKQGIYRNLPEFQETAHFHNELIEIGMH
LGLLGILAFVFLLWCWFQIFRQNRMALLGSAIVCFICISGLTDTFLLYSG
TPPFLLTVTAIAVCWRKYREDPDRIGQKYTSGERNGNQNPQMFLMSA
>NE1672 RimM
MVVLGRIAGPHGIRGQIKVIPFTEYVDGLMEYPVWCLSRDEKNWQIVRPA
TFFIHDNLLIVTLTGYSDRTSASELKGLLVAVPRSQLPPLSKDGEDGYYW
TDLIGISVVNMQGEPLGTVAGLFETGANDVLRVRLSGSSKDELIPFVDQV
IRQVDLESRQITVDWELGY
>NE1302 putative transposase
MKTPWCYTCPMEHDHGYKLLFSHAEMVADLLRGFVREEWVYELDFSTLEK
INGSYISDDLRERQDDIIWRLRRGKGETGEWLYVYLLLEFQSTVDWFMAV
RIMTYVGLLYQDLIRSESIRTGERLPPVLPVVLYNGDTRWQAPVNMEGLI
FPAPGGLDRYRPQLNYLLLDEGSYSDHELATLRNLTAALFRLENSRTPQD
VEQVLQALIAWLQSPEQSGLRRSFTVWLKRVFLPGRMPGTSFSEIHDLQE
VQSMLSERVKEWTKDWRQQGIEEGKQIGIEEGKQIGIQEGRLEGRQEGRQ
EGRLEGRQEGRLEGESEFLLYLLEQRFGPVSDAVRARIGSADTQTLLVWG
KRILTAQTIEAVFGD
>NE0373 Outer membrane efflux protein
MRKIFLWSFLSGFALFTGSNYPVLAAGGGYTVTDIQRMGLQANGLLQAVR
SQVEIARAEVTSASAFPNPEVTFMAGPDSKRLPEIDTGPASMQRQVTVSQ
SLENPFVRRARIGAAEAGVEASRANLEQARADLAAQLRVHIYEFLLRRGL
AEMESDIHDLMGEIQRRIKLSVESGETARFELIRADTEVMTAASRKEAAL
LNTERARIALLQLTAGALPPEFNVTASLGDPVELPSLEALRVELSTVNPE
ILRLEAEQNRAHLRIDQERASVLPSVNILYSNYQDKQFTSNTAGLSVRIP
LFYQRRGEVDAAVSDAARVRETLDYRRYEINRLLESAWQAMEIARRRVEM
FEGGIVKEAEAALRVAEAAFRLGERGFIEVLDTQRVLRNARSELLQAQFE
LQSAAAEIDRLRAHYPKE
>NE1863 probable chemotaxis transducer
MRVNMPVTNVEREMRDGEFLVSKTTAKGVITYINEPFIRMSGFTEQELVG
QAHNIIRHPDMPPEAFADFWNTLKRGRPWSGMVKNRSKDGSCYWVYANVT
PIREHGKVTGHMSVRSKPTRDQIQAAETLYRQMREGTAKVQIVAGEIVAN
SWLGSLREKFRHLSLKTRITGFGAVPLLVMAGSAWLMMQGQLNLAFAGMG
LSVCLAGGMGVLLYRCIRQPVDTMIHHLDKMAQGDYTSQILLERNDEIGR
LTEALKSMQIRAGIDFTETKRMNIENLRVRNALDIATSAVMIADNTGTII
FMNRMVQEILVNEANEVDVCESLQSRLRDLAGQQQIDNVRIGERTYSLLL
MPVTSESGERAGAVVEWRDRTQELTVEKEVMEIVQAGVAGDFSKRLVLAG
KEGFFRQLAEGINRLMESTSQGLDEMATMLEALAQGNLTMHITHDYQGML
GKLKDDSNSTVDQLASIVRQIKDAAGLITTASKEIADGNTDLSQRTEQQA
ANLEKTAASMEELTSTVKQNADNARQANQLAASASDVAVKGGEVVSQVVQ
TMSAINDSSKKIVDIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVV
ATEVRTLAQRSAAAAREIKGLISNSVHKVEDGTQLVDQAGRTMEEVVLSV
KRVTDIMGEISAASHEQSLGIEQVNQAISQMDEITQQNAALVEEAAAASE
SMREQAEQLSRAVAAFKLEHGMAAAHTDNPPSVERRSPNRATNVERLPQT
RNRKKAATRSSVASVSAGTEGGWEEF
>NE1613 ABC transporter, fused permease and ATPase domains
MRTLITYIMVFPKRSAFVLVALLLAGIAEALSLTALLPLLSIAVGESIDS
SMGKLVVDTLHQVGLSPTLEIILLVIVGGMFLKGAILLLTNQQVGHTVAH
VATALRLDLIKALLASRWQYYLRQPAGALANSIATEAYRAAMGFEHSVNV
LALAIEALVYGIVALFISWEATLASLLIGTILFVVLNRLVRAAKRAGNKQ
THLLRHLLTYLSDVLGSVKSLKAMARDNVADAILHEQTRMLEKATRKEVT
NRAALLALQEPILAALTASGLYLALVVWKLSLPEVMVMVFLLTRILGLLN
KTQRRYQQLVAQESAYWALRNAAEEARMEAERSTGTLVPTLEQGLNLRHI
VFGYERKTIFSDLNLEIPVHSFTALIGSSGAGKSTLLDLLCGLAEPKSGE
ILIDGVSLHHIDLRRWRHMIGYVSQDTVLLHDTILNNILVGEPAISVEDA
ERALRQAGAWDFVSSFPDGIHTVVGERGGMLSGGQRQRIAIARALAHQPK
LLLLDEPTSALDPESERIICATLQNLAKEFTIIAVSHQPAVINAASQVFI
VSNGKAELLSDAAAHLPDITGAKEGN
>NE1726 putative integral membrane transmembrane protein
MDNIIQGIAIYALPVIFAITLHEAAHGYVAKYFGDLTAEMAGRITLNPLR
HIDPVGTILLPLMMFVTSKLLMGSGLLFGWAKPVPVNFGRLRQPKKDMLW
VAAAGPGANLLMAFFWAAIIKLGMNMPDSIYLKPMVLMGIAGIEINVVLM
VLNLLPLPPLDGGRIAVSLLPSRLAWKFAQIEPYGFIILLVLFISGVLSV
VLWPLIIFTKQMIVTLFGLYI
>NE1864 HAMP domain:Bacterial chemotaxis sensory transducer
MLQNMTIKSRLIFIVGLLSLALISVGSVGLYGLYQSKTKLETIYQDRVVV
LGHFSQILDSLLQVRMYALLVTNLRDTGVARQRAAQVVDEDAKINELWRK
FTQTYMTPEEQSLAGRFTEQWKVYQAARNQTLNHAINGNFDAAVEVTLQD
AGPKYEAVHGSLMQLIGLQESVTAQEYEQLQAMTSIIFATTITLTVLGIL
LAGGVGFVLIRGVSRSINQAMEVADGIANGRLDQRIEVTGRDEISRLLQS
MTKMVEVLNGFMSAQQKMSQQHDAGAISYRIPASQFPGSYGEIANSINEL
VKAHIAVKMRVVEVVARYAQGDLSVDMDRLPGEKAKVTETMDKVKASLQA
INGEIKYLVEGATAGDFSRRGKADKYQHDFHAMVSNLNQLMQTCETSLED
VIRIMDALAEGDLTQKITREYQGMFATMKDDANITVGQLASIVRQIKDAT
GLISTASKEIADGNSDLSQRTEEQAANLEETAASMEELTSTVKQNADNAR
QANQLAASASDVAVKGGEVVSQVVQTMSAINDSSKKIVDIISVIDGIAFQ
TNILALNAAVEAARAGEQGRGFAVVATEVRTLAQRSAAAAREIKELISNS
VHKVEDGTQLVDQAGRTMEEVVLSVKRVTDIMGEISAASQEQSLGIEQVN
QAISQMDEITQQNAALVEEAAAASESMREQADQLSRAVAAFKLEHGMAAA
HTDSQPFVERRGPNRATNVERLPQQVKGHRKTASPGASSSIPTGTDGDWE
EF
>NE1494 conserved hypothetical protein
MNSKSANYYMSVEQLSNLAGSDIQGLVPILDWDENEYSFAILDCFEQSLR
KSQRLLFVIDEQVELLTTEGIILSQHARPDTRFVTGFQEGPVKEALTGLS
PLRALLTVASGTMRNGKLSLIDNEGKTHTRAGLRELMPISGKAVVLITPQ
GLRGYDQALDDLTAHIRTSGGTPLTAGNLYRMIDHACVDYVAKPTIEIHQ
EETAFDTATNLIDSYLSIVRMNEYGIIHDLDTEFLHDYRIALRKIRSILS
LFKGVYDIDQAHQLKIRFSALMTPTGPLRDLDVYLLEQESFYSLVPHTMH
SGLDAMFSLMRSRRTDEQARLAIHLKIGRYKKEIKALTKLFSKKRKLKPG
PNSSRTSYDFACELIWKRYRKICRLAATINVATPDQEIHQLRIECKKLRY
PMEFFSAVFPQEPFDSLLKSLKRLQDSLGLFNDCVVQQINLQAFVDDLND
EQHQLEITQSIGSLITILYQRQSAEREKITNAFVRFSSDRMQRSFRTLFQ
SRKEF
>NE1202 hypothetical protein
MITIWQNLSINIQRLFIALCILTGIVLIGMQFHVNSQGSMSDTYPKGFRG
GTCTIESDTLLVGYSAYFIPVDYEIPDDSMSALSVVPVLCDKVPGPGLLS
ITVDLLYPASIREQPVAVSLARKNGERIMEPLLSIPARNYQSGIISQEVR
IDESGEYVLQLSGTDEYQSEFHLDIPVTIGTKWYEPFVPYWPMLVLGVVA
AFFYNLRRIVN
>NE0339 conserved hypothetical protein
MDLSMTTPEEITQYLQDHPEFFEEHPDLLESLRFPHPYEGRVISINERQV
AMLREKNKLLQNRLQELIDVGENNDAISEKMHRLTVALLGFGSLPELLHE
LQYHLCEDFSIPHVVLRLWQIDEFGTEADLPSPEFDPISNNVRILAQGML
RPYCGPEVDDEIRQWFAQDAEYLKSFAVIPLKKQSNFGLLVMASPEAERF
YPDMGTLYLERLGDMVSSSIMRLIQQTPGAANRESAS
>NE1104 conserved hypothetical protein
MRDATINLRALPEQRDLIDQAASLLGKNRSDFMLEAACDRAQAVVLDQVF
FSLDTDKFKQFTTILDAPPGPNPGLERLMAVKAPWNTDVV
>NE0176 conserved hypothetical protein
MSVEIFNGTMQKFGAPDTVICQFQYWSVLLRPAQLTLGALVLVAHEPVQS
FSALSSTSFAELQIVTGKIDTALKKAFQYDKLNYLMLMMVDPDVHFHVIP
RYAQAREFAGKTFLDAGWPGVPDFSRINETDKEMNQQIIEHLISCWECS
>NE0525 CBS domain
MSTKLELPIQEFTTPYPVTAREDSSIEELLDLIKNLKVRHIPIMSDGKVT
GIVSERDLKIISALSTREKFLVRAADLMTPDPIIFRGSTSIEDVILKMSE
KKIGSVLVSDEQGNLQGIFTVTDALDILVEILRGKK
>NE2540 putative bacteriophage related protein
MPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRN
QRRIESLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQ
GRIYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSPKKRGGR
>NE1894 conserved hypothetical protein
MKLPMNTSSTPAYPQEIEISPDDLPLYCPNPLMDARSWHPRVFLEIEATG
SAMCPYCSTQYILKGTPNPDHHHS
>NE1956 putative lipoprotein
MTKFSAVFLAVLLTGCTLTPKTLAPVSIYDLGPATSVTVTDSSRLSQAII
QVMDVTAPVWLDTQSIHYRLAYHDPARIYAYAGSRWAAPPAKLLTERFRQ
YFASHAIDSQKDDKNKESHVPAHYLLKIELGEFTQIFHAQNDSRIIIRLR
ASLYEPNTRLPVAQRSFTGERPAQTADAAGAVAAFILVSDNLLDELVQWL
FSIHS
>NE1203 possible immunogenic 75 kDa protein PG4
MKDSTVMKYCSALPVLVLASNLSYAMPGMHGAESSGVTEPPAVSKDMGAP
INSPGSEIEITYSADGKTAIFVSTREGSVESSGTPYNFDIWMARNVDGVW
QEPVHLGADIDPTVGPNINTSAWELEPSFSDDGNVIYFTRYEPGDLLSGD
LYVVQKVNGVWQSAKNWNDVPELPSINTLTGEEHCPIIVSDSLIYFNYSQ
PGVTQESDIWKVEKKDGVWQKPVSLGAKINSPQRDHMHWTGVSKDGKSMV
ITSTRIDPDSRGGHDMWISHQDARGEWQKPINLGDVINTPGEEMCWTFTP
DGKKFTGSWGPQNTFDTDIRWISKEDIPLLKTFDPIGPPPNLLVNSNKG
>NE1089 TonB-dependent receptor protein
MYKFGGVLVILVSSITTIYAQSPSNEDNDTASGKTFLPGITVSTSADASA
EGLSPPYAGGQVARGGRAGILGTNDNMDTPFSITSYTNELIQDRQARSVG
DVLQNDPSVRVARGFGNFQESYFIRGFILGSDDVAFNGLFSLLPRQYIAT
ELFERVEVMRGASAFLTGANPGGGGIGGAINLLPKRAPNEPLTRLTTHVG
SGWHHNVGVDVARRFGPDQKIGIRFNGAYRGGGTAVDKEGVQTGLAAVGL
DWRGDKLRLSADIGWQDNRLKRTRTNVTLGSGITGSPRLPDPTTNFAQPW
SYSNESDIFGTLRGEYDINAYLTAWAAYGLRRTDEKNSLANFTLADRDTG
DGYATRFDNTRKDNVDTGEIGIRGKFSTGPIDHRVVIAGSYFENERKNAY
AWDFFNQLNTNIFSPVSWEKPDFSSSAFRGNNLSSPGLTGRTRLLSFAIG
DTVSILEDRILLTAGFRHQNLRTESYAYDTEAKSPAYDRNRISPSVSAVY
KINKQFSIYGNYIEGLTQGDTAPGTAINSGQVLAPYVSKQKEIGAKYDGG
HIGATIALFTTTKPRSIIDADNVFTSSGRDRHRGVELGIYGEVIRGLRLL
GGATWLDAKQRSTGDVTTDGQRVIGVPSFQASMGADWDIPGLQGVTFDSR
VVHTGSSYANDVNTVKVAGWTRLDLGLRYLTEFRGHLVTLRGRVDNVTNR
KHWASVGGYPGQGYLVAGMPRTFVVSASIDY
>NE2507 AraC type helix-turn-helix:ThiJ/PfpI family
MKLHILVCDGVFDLGLAALTDTVGLANAMSGSLPQAPAHIELTLVGVRRR
IRTAQGLTVPVVTARGVPEPDVVLVPAFGDKMPDTLSARLTRPDVPDAVA
MLQQWSTAGAHLGAACSGSFLLAESGLLDGHRATTSWWLGPMFRQRYPNV
TLDESRMIVNSTRFTTAGAALAHVDLALRIIRGRSPALATLVARYLLVET
RSSQAEFVIPDHLAHADPMVERFECWARRRLAKGFSLAEAASAAGTSERT
LARRLQSVLGKTPLSYFQDLRVEHAVHLLRTGNASVDQVAAQVGYSDGVT
LRALLRRKLGRSVRELRRGG
>NE0655 hypothetical protein
MTLVTRCPVCHAVFRLTGIQLHSCNGDVRCGQCRQVFNGFVALIVVPETC
IQPAARSAESAPDYLESGNVAVVPVAESSFPADHFGVQLSTRKTSRWWLI
PNALLLLLLLGQFVHAYRTEIFIAFPAFQPALDSYCDLMQCEIDLPRHLH
LLSLESSDLRVSSPAEPDVVALSAIIRNHAPFPQALPALLLTLTDSDEKP
LASRIFTAEDYLDSVTDQSVLGGDSEIQVQCFLNTSSLDAVGYKLELIYP
>NE0068 Phospholipase D/Transphosphatidylase
MSWPDFVEGNQITLLHSGTEYFPALESAIDSARQEIHLETYIFQYDATGA
RIAAALKRAAQRGVAVHLLIDGFGSHGLSRIVIQEMLAAGVQVLIYRQEF
FSFRFKRYRLRRMHRKLAVMDASVAFVGGINIIDDCSEPDASFPRFDYAV
QVAGPLLAEIHAAARHLWMLAAWVYFKKRWTNCSPVMASRLPAGHQRASL
VIRDNLRNRHSIERCYLRSIAAARREIILANAYFLPGKHFRQALARAAQR
GVSVILLLQGKSEYRLQHYAMHALYGHLLDAGISIYEYRHGYLHAKVAVI
DSAWSTVGSSNIDPFSLLLAREANVVISDQVFAVELRSSLQQALEESYPV
SLHSWKSRSRIKRSLNWMCYYFVRILQGLLGYRRE
>NE0723 hypothetical protein
MKFSTILTFLSGTTFVYPAAFAQSIIASGNILPGIPAPPLALWQPANLRV
GVNAAGTLSITDGGMVAGPGQALLGSAVGSSGTVMVSGNDSLFSTVLQMH
VGSSGTGTLKIEDRGTANIGTFLYIGRFIGSDGLVTVSGAGSRLTNGNMM
QVGSEGTGALIIEDGATVNSTNVTRIGWSSTGIGTAIVQGSGSSWTTNNS
MSVGFGGSGRLLIVDGGAVSNVEGFVGRETGSTGEVTVSGAGSSWSNSAA
LEIGSFGMGELMVEDGGALSNTDGRIGREAGAIGTVTIKDAGSTWSNTGT
LYIGDLGKGTLTVADGGKANIVASFVIGRQAGAEGLVTLSGAGSSLINTS
STQVGGAGKGTLIVENGGVGQSNNLSVGVSSGSTGSVAVRGADSRWIAGS
ILTIGASGHGTLTIEDGGSVTSTLTTIGSNTSGLGEATVSGADSTWTNSG
ALIVGALGNGTLTVSDGGMVSNATAGIGVGTGRQGVALVSGAGARWINSG
DLTVGTNGSATLTVADGGHVSVGGGNGIVHVAEAATANGVVNIGAAAGSA
PVGAGTLGAAEVRFGDGTGRLNFNHTDSAYLFSPVITGNGALNHYSGTTI
LTGDNTYSGSTMIAGGVLQLGNGGTSGSVTSDIQIDSTGTLRIDRSNDWT
YAGILSGTGVFDQLGTGTTMLTGNSAAFNGTTTVTNGRLIVGMGGAGTLG
GMVNVLDGATLGGSGTVGSAGADVTILGGGVHAPGNSVGVQTIAGNYVNY
GTLRIDGTPAGTDMLIVQGGVDITGATLDLQLSPPVASGWNIINGPFTII
DKQSAGAVAGSFGAVNNNLLFLDPYVNYAGGDGNDITLDFVRNDVAFASV
ALTPNQIATGRGIGTLPYGHPIWNTIALMSDEVAVRRSFDFLSGEIHATA
SSVLLEESRFPRRAVNDRLRSAFDTNTDTAFWAHGYGAWAGWQSDGNAAS
LKRDTGGLLLGLDGQLGNWRTGVMTGRSWTGVTVADRASTAEAGTWYAGL
YGGTQWGDLGLRLGLLHGEHDIDTRRTVAVPGLSGTLRSTHGASTTQAFG
ELAHTLHFGIVRYEPFANLAHIHTRSNRFTETGGSMALAGRRSTMSATVM
TLGQRIEAAHVFRGTGIRTVGMIGWQHVWGDVIPRSTHRFSMGDPFTIAG
TPLARNNLLVEGGFELSLGRRAAIGASYTGRFAHNGHDHAATAVLRIGF
>NE0942 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR
VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID
PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM
VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG
RLRSWFKSQDD
>NE0710 hypothetical protein
MCISNSDWISLGSAIATLLGVGVALYASWQQMKKMNNQLVIQQFSDYTKR
YQEIILHFPENINEQTFDFSKDTDKNKTMRYMRAYFDLSFEEWHLNQRKL
IDAKTWTVWEGGIKTALSKTAFINAWLEIKKDTGYGQEFEQFINASLPTN
QNLTNHSSGTPNGAP
>NE0799 conserved hypothetical protein
MKISPSRQVKGEIEIIPVSGFRGIGKFIDVPWRLYADDPLWVPPLRLERR
LHLSRFNPYFRHAQWQGWIACRDNQPVGRISAQIDELYQQRYGTDTGHFG
MLESIQDEAVFSRLIQVAESWLVERGVRQISGPFNFSINQECGLLVQGFD
TPPVFMMPYSPEWYTSLLEQNGYQPCKDLLAYWLVTDFDPPPAMQAIDRK
YRHQIRIRPLQRNRFNEEIETMRDIFNDAWSDNWGFVPFTQEEFAELGSS
LRWLVPDEFIQIAEIDGRPVAFMAVLPNLNEVLPALNGKLLPLGWLHLIN
KLKSASITTGRVPLMGVRKQFHHTLVGIALAFKVIDAPRKMVKSRGIGHV
ELSWILEDNQSMRAILEKIGGREYKRYRIYDKTLA
>NE2301 conserved hypothetical protein
MSISSSPILNRRLNIAQLFSHKHCVLCQAPNHQDICNACLQDLPGLPPVH
CPSCLLPMTSPEICGTCLRNPPAWSHIRAALRYTFPADALVQALKYRSDL
PLAPILAGLLLGRFRDDPLPDYLIPVPLHPARLRERGFNQALEISRHLCR
QTGVELLSAACTRIRSTPSQTELPWKNRPQNVRNAFTCNRNFSGKRVAIV
DDVMTSGATLNELAKVIRRHGATDVRAWVIARAFPGAPAAKRATDLPGND
ETKRPIKP
>NE1564 Domain of unknown function DUF71
MLRRRTFLSWSSGKDSAWALHVLRQDPHVDVIGLFCTVNKVFDRVVMHGV
RVALLQQQAESAGLPLHIIEIPYPCSNDEYASAMSAFVDSARKENIECFA
FGDLLLEDVRQYREDRLNGTGITPIFPLWGIPTKTLSREMVAGGLKAVIT
CIDPKRIPESFAGREYNESFLDDIPGSVDPCGEYGEFHTFSFDGPMFQNP
IDVVLGETVHRDGFVFTDLLSLTSSTEPTH
>NE0571 conserved hypothetical protein
MFKRVGIFVALVTLSLIFNQPATAGRTVEDFQVWGNITALGNFGFVNPGN
PDLKKFRWWMEGQGRFGNDSSQFTQAIIRPGLGYAITDKIIIWAGYAWIP
SDEPLVPKSGLPFDEHRIWQQVTWADEFSFGKLSLRSRFEQRFFDHNAPV
SGSDDVAYRFRQLVKLAIPVAMIDPNLTFIIQNELFIGLNTVSNPGFISR
GFDQNRAFVGLGYKVHQNATVELGYMNQFIDRRHNPRPDQMMHNFAVNLF
LNF
>NE0791 conserved hypothetical protein
MRKNISENNGKRPGSVKSRFLVSAFAVFLVAYSQTGRSSEEVLEQYKSLF
AQQQKEFEKQRQIIIEQGKEIEKLKSRLDSLITTQPTDRSPASNVAGKDG
QRPPQVSSPSTPKTVVAGPVGKKNDQVQTRTVPGNLPAGPVGQAPPKQDE
KPRPPEMPRLSDAVGGVLTRKGKIVVEPALEYAFTDSNRIFLDAFTFLPA
IAIGLIDVRQVDQHSLMASIGARYGVTDRLEVEARVPYRARFDEQRSRPV
SIGAGIDETFNASGNGLGDIEFAARYQLNSGAGGWPILVGNMRATVPTGK
GPFDIKYAQAQGVPGAVFPTEVPTGSGFFSFEPSVTALYATDPAVFFANL
AYNYNMGTTEKALDGSGDKFKVDPGYAVGMTFGMGFGINERSSFNIGYGH
RHIFNTKINNRTLKGSQLDIGQLLLGYAFKYSQQTTLNFSIAIGTTDDAQ
DVRLSFRVPMTF
>NE1553 possible transposase
MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG
APWRDLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS
HCKVHPHASGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL
IARRLAA
>NE1721 hypothetical protein
MKKLEALIGLITFTGLSFCVCAQEVTNSPARESEVTDRTSIKTSLKPVVV
TADPLSGDQADIAQPVSVLQRERMQTRDLRNIGEAVSQELGVSSSDFGPA
VGRPVIRGLSGARVRVLEDGIGTMDVSTISADHAVAAEVLFADQVEIFRG
SSALLYGSGASGGVVNIVNHRIPERLPDEAGGDLYTHYNSVADDFTGAFR
LNAGAGRFAWHLDGMKRHTGDYSIPGYAELKPGSDAKKGVLENSDVRTTN
FSSGLSYVGERGFLGVAVSRFVNNYGVPGHDHDHDHDHDHDHDHDHDHDH
DHDHDSHNHEQGKGARIRQKQTRFDIKGAVNHPLPGVQKIKTRWGYNNHV
HKESEGDEVGTLLMNREWDGRVEMLHQPLAGWKGVLGFQYQNRDLATLGE
EAFVPSSRMNSLGVFLLERRDVGRWHFEMSGRFEHQRTKRKDDGFETSHD
AFSISGGVIRKFGDGYSVGGDISRTERAPALEELFSNGAHLATGTFERGD
TSLTREKSSNFDLFLRKKGGSLDWTLNLFAKLVDDFIFQQELDLTGDGLP
DRVNMEGRPDEDGYLLLKYRQNNANFLGMEFETIAHLLSDRRGKLDLRLW
TDYVRGRLSSGGYLPRMTPFRFGGALDYAHGPWRGRIDVMRVHKQADIAL
LETDTAGYTMLNVQLDYRFDWGKMNYNLFIRGANLLNEEARRHTSFLKDR
APLPGRSGLVGVRVNF
>NE2464 Integral membrane protein, DUF6
MTKQKKLLPVASLLLGAAIWGVAWYPYRLLEQAGMRGELSTTLAYSIALL
IGLILFRRQLRISEILNPAAGILFWISLSAGWTGIAYVLGIIHGEVMRVL
LLFYLAPLWTILFSRILLQERLSRQGYAIILLSLAGALLLLWQPGSKLPL
LASYGDWMGLSGGLAFALTNVLIRKDQQHGIQLKLLAVLSGTALTGFAAT
LLMESISDITHLHTHAWLILAGIGGLVFFLCILLQYGMTHIPANQAIVIM
LFELVVAAIAAHFLTNEYLTGRDWAGGLMIASASLFSARVNRD
>NE0020 possible sugar kinase
MNTRPIYTTAEIRKIESLVLSVPHSPPLMEKAGLAAAKVAHTRLLTDDKQ
RILILAGPGNNGGDALVAARHLREWGRQVTLVLTGEAERLPQDARQALEQ
WQSAGGTVIPELPDGGQWDAAIDGLFGIGLNETRLLAESYRQLIRQINQL
NLPVLALDIPSGLLSDSGRVPDVAVKAAITTTFIALKPGLLTHDGCDYCG
EIVVCDLELDVAALIPPQNWLLDRAGIVQRLPSPRRANSHKGTYGRLGIL
GGATGMIGAALLTGRAALKLGAGRVYLGLLAQDDVPVVDPVQPELMLRSP
SDFFNPDFLEGLVIGPGFGSEIAACICLERALQTCLPLVLDADALNLIAQ
HTELSSALQARKAPAILTPHPAEAARLLNTSVTEIQRNRLEAARNLARKF
NCAVVLKGAGSICAFPNGHCHFNTSGNPGLSSAGTGDVLSGFLGALLVQG
LLPENALLLAVYLHGAAADVLLKQQNGPLGMVASEIIPAARNLLNCWIEE
ENPGW
>NE0411 Ribosomal protein L14b/L23e family
MIQMQSLLKVADNTGARTVMCIKVLGGSKRRFAGIGDVIKVAVKDAAPRG
RIKRGEVYNAVIVRTAKGIRRADGSLVKFDTNAAVILNNKLEPIGTRIFG
PVTRELRTAKFMKIVSLAPEVI
>NE0414 Ribosomal protein S14
MAKKSIVNRNLKRLKTVNKYAARRAEIVSILRDAGSDIEAKASARDLLQK
LPRNASPVRLRQRCALTGRPRGVFSKFGLGRIKLREIAMRGEIPGLIKAS
W
>NE2572 conserved hypothetical protein
MSKLIITDNLEMLTEQGATRRRLLQAGLGACALLAMPAANAAYSRVYEKR
VSLLNLHTGERVRTAYWERGKYIPEALRMIEKVLRDHRSGDIHRIDPRLL
DLMQHLHHKTGNSKEFQVVSGYRSPATNAALSVQSHGVAKNSLHMQGKAI
DIRLPGVPLHVLRRAAMSMHAGGVGYYPKSNFIHIDTGNVRYW
>NE0096 hypothetical protein
MKRTSISLLAPAILCLSLSFNAQADTYPEKAGEKLATGVANVITGVAEIP
KNMMITSHKKGTIYGVTAGFFVGLVHTVGRTLSGAVDIVTFVVPTTPIIK
PTYIWDDFDRETTYTTWRMR
>NE0358 Domain of unknown function DUF143:Iojap-related protein
MKSPEKLLETAIMALEDLKASNIHVMDVSKLTSLCTTMIVASADSTRQTR
ALASHVQEKVKATGSMVYGIEGEQTGEWLLVDLGDIIVHIMQPAIRSYYD
LEGLWSEQAWRSPQENSAVYG
>NE2439 conserved hypothetical protein
MKHTCSTSDLGYLKFNKRREMMGWNTVERNWKELKGKLKETWGDMTDDEL
DVIAGKREQLVGKIQTKYEIAREEAERQVNAFAHDCDAAKEPLKNVGEAV
SSRQKSVKKRSLYT
>NE1199 hypothetical protein
MVWVIGLIVLILLIVSAWFRKVAVSVIIVTGVVGSLIYVLNEREEERALS
RISLAELDFENVALKPSYSGYKLSGRIKNNSQEFTLKQVNLLIIMQDCTG
TPDSQDCVTIGESHENMDLNIPPGQARDFEKSLYFPGGNLKLLGKLEWNY
SVSGIKGE
>NE1487 conserved hypothetical protein
MTEESLIEYPCDFPIKIMGKSQQGFTQSVLSIVKTYAPDFDDTTLEVRSS
RNGAYLSLTCTIQATSRTQLDSLYQALHDHPMVTMLL
>NE0894 hypothetical protein
MDDLIYGISVEVLQEITGEELRVIKQWKKGTRKPSESAIRLINLFVHGKA
CALLGNSWNGFYFRNGKLFVPEWRNGFSAGEIRTLFFRCQLVVYLESEIR
LLKAELERRNLDIEELEIKADFYRRQISTESKFGMMLERCFGVM
>NE0375 Acriflavin resistance protein:Heavy metal efflux pump CzcA
MMITSLVRAMLAQRLVIVVLTLILMGFGLHAAQRLSVDAFPDVTNIQVQI
ATEAVGRSPEEIERLVTVPLEIAMTGLPGLEEMRSLNKSGLSIITLVFTD
ETDVYFARQLVMERLLEAASKMPPGIIPRLGPVSTGLGEVYQYTIDHPND
GERALTVEELTERRIVQDWIVRPMLRSIPGVAEINSQGGYVKQYHVLTDP
NKLRHYDLTLNQVEKAVAENNANASGGILPTGSEQYLVRGVGLIRTLQDI
GNIVLSEERGVPVFVRDIAEVKLGTEVRAGAVIKGGYTESASGIVLMVRG
GNAKEIVERVKVKVAEINEQKLLPGGLQIVPYYDRTDLVDAALWTVSKVL
MEGIFLVIVVLFIFLGDVRSSLIVVATLVVTPLLTFMIMNHQGISANLMS
LGGLAIAIGLMVDSTVVVVENVFHRLGHSGNTQESRARTIIEAVGEVGTP
VIFGICIIILVFLPLMTLQGMEGKLFSPLAFTIAIALAISLIVSLTLSPV
LCAYILKGGADHDTTIIAKLKAAYHSLLNTALKRGKLTVTTALGLLLLSF
VLFPFLGKSFIPTLKEGALTPQINRVASISLSEAIEMEMEAMKAVSKVPG
VKSVVSKLGRGESPADPAGQNESDPIVILDPESGRTQDEVDEDIRKLLEV
LPGVNIVLSQPIAERVDEMVTGVRSEIAVKIFGDDLNKLRELSEQIARIL
RSIPGAQDIRIERLSGQQSLTIDVDRNAIARNGINVADVHELIETAVGGR
EVSTLYEGERRFSIVARFPQHFRDSIESIRNLLLRAPDGVTVPLYTVANI
NIVDGPAQVSRENAKRRVVVGSNVEGRDLGGFVAEVQERIAKEVKLPDGY
FFEWGGQFENMERAMATLSIIVPITIFAIFFLLFMLFNSLRLAALIILVL
PFASIGGVFGLFVTGEYLSVPASVGFIALWGIAVLNGVVLISYIRKLRDE
GVSVKDAVIKGCEQRFRPVLMTATVAMLGLVPFLFATGPGSEVQRPLAIV
VIGGLITSTLLTLVVLPVLYRRFDQPPVTQQQIL
>NE1546 putative membrane protein
MGQTESIKTIIHFKETKMEKISQFVARLFLGQIFLLAGISKISSYAGTQG
YMDAMGVPGTLLPLVIALEIGGGLAIIAGWQTRLTSIALAVFTLAAAAIF
HNNLADQTQMIMFMKNIAIAGGFMLLAVHGAGGYSLDSRRARP
>NE0759 DUF150
MSLFELLEPTIAGMGYELVDIEQSAPGRLLRVFIDKKEGSITLDDCVAIS
NHLSQLLAVENIDYNRLEVSSPGLDRPLKKKADFVRYMGESARIRLRIAL
QGQRNFVGTLVEVNDDVLTLNADGKLLQIELRNLEKARLIPKL
>NE1806 ATPase components of ABC transporters with duplicated ATPase domains
MAQYVLIMNRVGKIVPPKRVILKDISLSFFPGAKIGLLGLNGSGKSTLLK
IMAGVDKDFEGECTPMPDLKIGYLPQEPQLDPKLTVRETVQEGLGDVFNA
QQQLEAVYAAYAEPDADFEVLAAEQSRLEAILTTQNGDNLSQQMEIAADA
LRLPEWDAHIEHLSGGEKRRVALCRLLLSKPDMLLLDEPTNHLDAESVEW
LEQFLARFPGTVVAVTHDRYFLDNAAEWILELDRGHGIPWKGNYSSWLEQ
KETRLKQEESAESARQKSLKQELEWVRQNPKGRQAKSKARLARFEELSSQ
EYQKRNETQEIFIPVADRLGNEVIEFINVSKGFGDRLLIDNLNFRIPPGA
IVGIIGPNGAGKSTLFRMITGKEQPDTGEIKIGETVKIAHVDQSRDALSD
SQTVFQAISGGNDMLIVGKYEVPARAYLGRFNFKGPDQQKITGTLSGGER
GRLHLAKTLIAGGNVLLLDEPSNDLDVETLRALENALLEFAGCVLVISHD
RWFLDRIATHILAFEGNSQVTFFTGNYQEYEADKRQRLGEEAAKPKRIRY
KPITR
>NE2141 Sodium:dicarboxylate symporter family
MLAPMAFSRREQSLSQSIGALIAQHLWAKILLGMVLGTLTGLVLSPGGLA
LLENDTVLMAAEWIALPGVIFLEMLKMVVIPLIICSIALGILTSGSPHQL
KRMGAFIVPYFIITSFVAVGIGLAITLIIQPGHTVGTSIDTLAAAMTADG
GIKTFDDLTVPQRIVNLIPSNFVEAAMQTDMLKVVIGAIFLGVAALTIPR
EIAKPFEDLCLFGQVAAMKIISWAMALAPYAVFGLMVDAMVRLGFDILTA
IGWYMGCVLLGLVIILGLYLFGAAVFGHRPPFEFLRAIRDVQLLAFSTSS
SAAVMPLSIRAAEENLGVSEDISRFVIPVGTTINMDGTGLYQAVAAVFLC
QIFKVDLSFTEMILLLCTTVGASIGTPATPGVGIAVLATILSGIGVPPAG
IGIIFGVDRILDMCRTTINVTGDLTAVVIMDKWMKKNFA
>NE2238 Transposase IS200 like
MTFTTKEYQSLSHTRWDCKYHVVFIPKRRKKRIFGMLRWHLGELFHELAS
HKESKIVEGHLMDDHVHMCISIPPKYAVSNVVGYLKGKSAIQIARKFGGR
QKNFTGEHFWARGYFVSTVGLDDNIVRTYIRNQEDEDERYDQMKLEI
>NE2421 SLT domain:LysM motif
MKTKSNTSLVLIAAVVLASLSVTVQAGRTLEKNISITVLDTRQYQVPPHK
QQDDLWMRVRAGFSLANIQSQEVRQHESNFSKNQRFIDHIVGRSQRYLFY
IIEEVERRRMPAEIALLPIIESEFDPDAYSHRHAAGLWQFVPSTAKAFGL
EQNWWHDERRDIIAATQAALDYLQMLYKMFGNWKLALAAYNWGQGSLKKV
IDEGHGGNKSVNYQEIGLPAETRNYISKLIAIRNIIANPRRYGIKLKPIS
NRPYFEQITITQQIDVQLAARLAGISESEFNALNPAYHRPLIKAEDSPRT
LLLPVTKVKTFVDNLENYDKSLVSWRIYQIEQTETLRDLSSRYRIPVARL
AEINGISERATLKKGQILLVPRDGSSAREMTYLAHYQLKQPARPATRRSQ
SSEERIVYTVKKGDTLHAIAKRYGIDIKTIRLWNKGSDQLSIGQKLTLKL
TSPAFSSPYSS
>NE0183 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MNKVLRNSGIAGLCLALSIFLLSAPASAQLMLAHEGHHDAGGCKIEGGDF
PVTVSVYEVPEGNIPPMHSYCNHLPDAGKINMTVELSDSQTREVPIAVRV
LMEGHENSDHGAHEVLYMPAEKYSSGIIVVATNLEHLGQYTVQLETEDSA
GQVKTAVKIPLHVGGGGGHDHGSNFGMLEMILLAVVGGVGTFIFMRSKKA
ANA
>NE1823 GTP1/OBG family:Conserved hypothetical protein 92
MSLKCGIVGLPNVGKSTLFNALTKAGIAAENYPFCTIEPNVGIVEVPDSR
LTELSRIVNPQRVQPAIVEFVDIAGLVAGASKGEGLGNQFLANIRETDAI
VNVVRCFDDPNVVHVAGKVDPLADIEVILTELALADLATVDKAIARDGKK
AKSGDKEAIKLVTVLEKLIPHLNQGQPARTFPLSNEEKEIIRPLCLLTIK
PAMYVGNVLEDGFENNPYLDRLREYAAKENAPVVALCAKIEAELADLDEA
DKKEFLADLGLEEPGLNRLIRAGYDLLGLQTYFTAGVKEVRAWTIHKGNT
APQAAGVIHTDFEKGFIRAQTVSYADFIACGGEAGAKEKGKMRVEGKDYI
VQDGDVMHFLFNV
>NE0889 conserved hypothetical protein
MAIEKVKVTGITFFDGTLDDGKHIDSGKVFIEHLLDFRKGTAKGSSTTAY
PLASSKEAKALMNHDFPLVCEVEFLTLSSNKGPKTVINALRPVPAAASPA
R
>NE1788 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE0846 conserved hypothetical protein
MIIRNNSSFYRIDPLKSPIKANPLRNPGESLNHRLQNIFLDGIVPYFIAA
LCFVLIAAWEWIRWYTQTSPNPVLFTVMAIPCIATLLWKIYKGRKEVKRI
KLGLAGELAVGQFLERLRAQGSHIFHDIPGKGFNLDHVVIHSSGIYVIET
KTLSKPDRGESKLVYDGNHILKNSTALDRNPITQVRANSRWLRE
>NE1085 putative transmembrane sensor
MPDTEEIARDSVVPPDVRRQAVNWLVELQSDIVTDETRDQWQKWRMSHPD
HEYAWQQVEVFSRKLRGLPSPLAHAALTPPHSSSRRRAIKALTVLIFLGG
GTWVAGEGRLWCWWPEDYCTGIGEHRAVTLSDGTGIELNSGSAITVNFDD
TRRLIRLMNGEIMVTTAPDPRLANGIGAARPLLIETAQGILRAIGTRFTV
RQFDVSQQGKSSVAVFEGAVEIQPITGRLRQLEVGQQAAFTRDGVIAVGM
ACEADTAWRQGMIVARDMPLIDFLAELGRHRSGWLSCDPAVAQLKVTGTY
PLADIGKILAILQKTLPIDVQLFTRYWIRVGARQEDT
>NE1403 putative uroporphyrin-III C-methyltransferase
MDIQIKRAYEPADPADGCRILVDRLWPRGLTKQQVACDLWLKEIAPTADL
RKWFNHDPAKWAEFQRRYRDELSVNPCVKELLDRAAKGRLTLLYGARDAE
CNQAVVLRDYLLERN
>NE1542 conserved hypothetical protein
MKFFQRTVLEPAYQGAQLHPGKLSGIYQNEILCNSLFHHYKQRTLKLGRE
LESRLKHLYHLVRMALPGTVSANTTRRSPANSIHTEVQIDSRTDTHLPQV
SDDEDLRWIRTGLPHQYHAYLAIDRLKSSHHLAGQSVLCIGGRAALYPNY
HQLIEAAGGHFMVFRGGAQDNSECLLALLARVDSIICPVDCINHEDFFTV
RRYCQRTGKNCVMLERSDLVTFGKAVETLARGDCHNSETDFLNRSAA
>NE0121 conserved hypothetical protein
MQLDTIHKITGTLILKSGLHIGAGDSEMRIGGTDSPVVKDPLTDQPYIPG
SSLKGKIRSLLEWRHGLVVATGGAPYSFKHLAQDENNSAGRDVIKLFGGA
PDKAEDQLVKNIGPTRLAFWDCPLNGDWKKEAADSRHLLTTEVKSENSIN
RIAGTAEHPRFIERVIAGARFDFTLTLKVLEGDDLLNTVLLGLRLLELDS
LGGSGSRGYGKIKFAELKLDGTDLMEQFHAITPFNQTA
>NE0404 Ribosomal protein L2
MMALRKTKPTSPGRRAVIKSVNSFIYKGKPFAALTEKKKKNAGRNNSGRI
TVRHIGGGHKQHYRIVDFCRNKDDIPAKVERIEYDPNRSAYIALLCYADG
ERRYIIAAKDIEVGSYLVSGSSSPIKMGNAMPIRNIPVGSVIHCIELRPG
KGAQLARSAGSSAQLMAKEGDYSQIRLRSGEIRKIHISCRATIGEVSNSE
HNLQSIGKAGAIRWRGVRPTVRGVAMNPIDHPHGGGEGKTAAGRHPVSPW
GTPSKGSRTRKNKRTSNMIVRSRYSKKG
>NE0517 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNT
>NE1993 hypothetical protein
MRQQADNVYLIWYQPGTGRPVQQSTPARTIGYIPSAQAEENYHLTQTGRT
PMTVTSTKHPHNSGGQFRTVAIWEKARIYYNPPLPLVIEDRGHNLA
>NE1559 hypothetical protein
MFKPVVQEEVTGCGIASVANILGKTYSEMKTIANAMGIHASDQSLWSDTQ
YVRRMLSGAGVETSEDEVPFESWDALPDLALLSIKHHQEEGKAFWHWVVF
KRMDGQSFVLDSASYLPSNIRQDFDAMQPKWFIEVKNA
>NE1497 conserved hypothetical protein
MDYFYESEHTKFMRELFAKRPELIEKQKEARAIWWDKDVDREALKCFEEA
EVPQRSYVYFSWPDQEQETEK
>NE2266 Glycosyl transferases group 1
MDKKVIIALNAAWNLVNFRANLIRGLAAAGYEVVAIAPPDEYAPRLAELG
CRYVPLSMDNKGTHPGRDLLLFWRFLQLLRKEKPAVYLGYTVKPNIYGSL
AAHLLGVPVINNIAGLGAVFIQEGWLTRLVQRLYRVALSRSAKVFFQNNV
DRALFVSKNLVSDVVTDRLPGSGIDLNRFVPVPLPDKTPLRFLLIARMLW
DKGVGEFVEAARILKKQGVNAEFCLLGFLDVQNPTAISRQQMDEWIAEGV
IRYLGVSDNVAEEIALADCVVLPSYREGIPRTLLEAAAMARPIVAADAVG
CRDVVDDSINGYLCRPRDATDLADKISRIVALSCIERTAMGLHGREKMER
EFDERIVIDKYLRAIEEILNRPEPSPSGQS
>NE1881 DUF208
MEREKLELPENGKKLLLHSCCAPCSGEVMEAIVASGIDFSIFFYNPNIHP
RKEYDLRKDENIRFAEKHGIPFIDADYDMDDWFKRAKGMEMEPERGIRCT
MCFDMRFERTALYAYENGFDVISSSLGISRWKNMDQINDSGVRAAAQYPG
ITYWTYNWRKKGGSARMLEISKREKFYMQEYCGCAYSLRDTNKWRVEHGR
EKIRIGEKYYGDVTE
>NE1405 DUF214
MNPVSLTRWLLLGEWCSHRLQSVVAVLAIMLGVALGFTIHLINTAAVSEF
SAATRSLSGTSDLTVRAVQSTFDESLYPTLMQHEGVAQASPVLEITAGVP
AKAKSMHNPVFKILGLDVFRATAVTPDLLGVPAEDRQMDILASDAVFLSP
AAMEWLSVKQGDSIEVSAGAKRITLRVAGGLTRAHAGQRIAVMDIGAAQW
HFDQLGQLSRIELKLRQGVDRDTFRATLSKELGASYLLTESEDDDERAHN
MSRAYRVNLNILALVALFTGAFLIFSSQVLSTIRRRSQLALLRVLGMTRR
QILRQLLLESGMLGVLGSLLGLALGYLLAATALYFFGADLGAGFFPGLEP
RLHPDPLAALFYFLLGTGVTLSGSLLPAWEAAHAHPAPALKSGSEYTVLA
ALRAPRLAAACLLTGVIFAGLPPVDDLPVFGYLAIVLLLIGSMAALPYLC
ELFFSTLSRALNPNRSSVLYTLAITRLSNASGLAAIALGGVLVSFSLMVA
MAIMVASFRISVDDWLARVLPADLYLHPTIHQDMRGFIPDEQQIIASQPG
IERIDFIRSIPLTFDPARSNITLLARPIDRNDPGLTLPLTEDRLPPGDIP
PDAIPIWVSEAMVDLYGFKTGKRVTLPFADPEQTFIVAGIWRDYGRIFGA
IQMQLSDYRRLTGDDHASDAALWVKEGAAIDTVISALQALPFGDTLEIME
TGRVRAVALKQFDRSFVITYLLELVAIGVGLMGVAASFSAQTLTRIREFG
MLRHIGLSRRQIHRMLTLEGGLLAGFGIITGFLAGTAISLILIFVVNPQS
FHWTMQLHMPWTWLLLMALAVLGAAALTAFAASRHAASGNVIRAVREDW
>NE0197 conserved hypothetical protein
MIPNPDPESSINRNQVIISGTITDLASPRYTPAGLMIAEFKLSHCSNQQE
AGIQRRIEFEFEAIAIAETAEKIIRIGSGSNVEITGFIAKKNRLSNQLVL
HVRDTRII
>NE0719 Uncharacterised protein family UPF0102
MSSAGNKGSDAEQCAAAFLQQQKLTLLEKNYRCRFGEIDLIMREDDTVVF
VEVRMRSSDRFGGAAASITAAKQSRLIRTARHYLAGHEGDFPCRFDAVLI
SGNRENEIEWIRNAFDES
>NE1895 Phosphoglucomutase and phosphomannomutase family
MSNIAPEIFKAYDIRGIVETALTPATVELIGHAIGSEARERQLTTIAIGR
DGRLSGPELAQALTNGIRKSGIDVIDVGMVPTPVLYYAAHELCGYSGVMV
TGSHNPPEYNGLKIVLGGETLAAEAIQSLRLRIEQANFQHGQGSYRQHDI
VQSYLQRIIADVKLARPVKIVVDCGNGVTGVLAPELYRRLGCEVIELFCE
VDGTFPNHHPDPSVPENLQDVIRTLATTDAEIGFAFDGDGDRLGVVTKDG
SIIYPDRQLMLFAADVLSRNPGGQIIYDVKCTRTLAPWIIRHGGKPVMWK
TGHSFIKAKLKETGALLAGEMSGHIFFKERWYGFDDGLYAGTRLLELLSK
QVDPATVLHALPDTINTPELQIKLKEGENHALIAQLQRDADFPEADQVIT
IDGLRVEYKDGFGLIRASNTTPVAVLRFEADNRAALERIQQDFRRVILQA
RPDAALPF
>NE0258 hypothetical protein
MFIVFSGSAAFAQEALLSAKEDYITCWKQPCVDVAGSEWSEKNPNGVGIS
VRMGTQSGVTDDQIKTVLTRDFKKFGMTNIKFFFEQNDAPAAGIAFHVRG
GTEGLFFIDNVREQVAGIARRAANTNPVFQ
>NE0796 possible capK protein
MAGIYTHLISGFLFPLHEHLKRHASVNVRKSMEQTQWFSPEKIVQLQTEK
LRQLLIHINTHVPYYRTLFAELGFQPEEAGSLADLQYLPFLDKSIIRTHL
EELKSDQARGLVRFNTGGSSGEPLVFFIGKERVSHDIAAKWRATRWWGVD
IGDPEIVVWGSPIELGAQDTLRSWRDHLFRSRLLPAFEMSDQKLDDFLDT
IRRFHPKMLFGYPSALSHIAKYADKSGIKMSNLGIQVAFVTSERLYDDQR
RQISDTFGCPVANGYGGRDAGFIAHECPAGGMHITAEDIIVEIVDSNGHP
LPYGESGEIVVTHLASRDFPFIRYRTGDIGILDDRTCDCGRGLPLLREIQ
GRSTDFVVAQNGTVMHGLALIYILRDMPEVKVFKIIQESLDFTRVQVVAE
TGLHPSQISRITESFQTRLGRDVKIKVEQVSEIPAEKSGKFRYVVSKVAG
V
>NE0900 hypothetical protein
MISKPGRIVGLGIILSGLSACATYQPVLYPNSYYQSVGKVAAERDIRECR
QLAESAGAREGSGSTGNTARRTAIGAGAGAASGAVGGAIAGAAGRGAMVG
AASGATWGLLSGLLGSGSASQPAPAYMNFVNRCLREKGYEVTGWQ
>NE1407 Uncharacterized protein family UPF0033
MNPNNEFFDRELDVRGLICPLPILRTKKSLSEMTRGQVLKIMATDPGAVI
DFQVFADQTGHELLSSSETTGEFLFYLKKR
>NE0667 possible Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding domain
MMSTMRVLVADDHEETLNFIRRGLTQAGHIVSTVDNGREALFRATEESYD
AIVLDRMLPGLDGLSILKMLRAGGIRTPVLLLTAMTRIADRVEGLENGAD
DYLVKPFAFSELHARLNVIIRRPAVLEPETSLRVLDIELDLGKRIARRGG
RRIDLQPRELLMLEVLLRNPHRVMTKTMLLERVWDFDFDPQTNIVETHIS
RLRAKLNADFDQDAIITVRGAGYMIRSE
>NE2446 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE1080 Outer membrane efflux protein
MNRVVLFLLVWLPAACTSLTAPESNMPSEAINVPAQWRATSDLAAPDVTR
DWWRSFGSTELELLIVQAQKQSLNVVAAVARVHQAEAAARIAGAPLLPEL
SANADFSRRGQYGDNAFHDNTFSGGLAASYEIDFWGKNRARRDAARAILR
ATEFDHDTVSLTVTAGTAQLWLQTTALRQRIDIAERNLDNASRLLTLVEA
RRRAGAATLLDVARQRGLVAAQQRFLAALHQQASDTQTALAVLLGLPAST
FTVKSSRLDALQVPSISAGLPSDLLVRRPDIASAEAQLAAANASIIAARA
AMFPSLQLTASMGVVTPLDGPLYGVAAGLMAPIFNAGRLAAGRDLAVAQH
EELLAGYRSIIISAFGNVEVALNAVTGYDLQATAQTQELIQARRAFDLAE
SRYKAGAETLITLLDVQRTLYAALDNAMQFKLLTLQARVSLYRALGGGWK
TVADARPNVL
>NE0255 hypothetical protein
MTYDTNLPSEDGIPDEGNAKKVARGTLQVIGGAVPLVGGLLSALAGAWSE
REQAKVNRFFEQWVRMLEDEIREKEATVLEVMARLDLHDEKIAARVESKD
FQSLVKKTFRDWAGVESEEKRVLIRNILSNAAASTLSSDDVVRMFIDWIG
QYSELHFQVIGAIYNSGGITRGAIWKKIGKGRVREDSADADLYKLLFRDL
STGGVIRQHRKTDYYGNFVAKSTQKKSPARSGGTKTLTSAFDEEDQYVLT
ELGQQFVHYAMTDLPLRIAYKL
>NE1440 hypothetical protein
MYKQLVRFMMLVSLGCLLYAGTATAVDFEANTPAVTRLKQNMRQRHGQLV
PFYDSGAVGLTRDGLIVLRDANAVPLSGRQSVNSLVAAENQDRNALYREI
ALANNHPEWEGNIRSTFAQRWIASARSGWWYQQENGSWQRK
>NE2106 hypothetical protein
MNIERKQAKQKEISSGHSFLEWGMTILKMPPAKRIKMVRDGVKVTHLVGA
GRYYGISQAKLSKLLGISDATITRKIRSSGKLGPLESERLARIALIEAEA
EKVFDSSDLAKRWMLESNLALGESPLSLLDTDMGADEVRKVLASIAYGGI
A
>NE2565 S4 domain
MTEKSAQSADEKFRIDKWLFAARFYKTRSQAADAVERGQVQVNGMRAKPS
RILNTGDLLAIRIGPYHYLVKVLALSNQRRSATEARLLYQETEESRQARE
AVAENLKSQPTPPHYAGKGRPTKRARRDLERFMSEK
>NE1570 Type III antifreeze protein:CBS domain:NeuB family
MRIEKKIKDHLVFSGDSILDALKRINDNQSRIVFVVQDNGVLIGAVSDGD
VRRWMTQATEFNLNLPVDHVMNRNFIARPVTESQHQIADYFDHKRDIIPL
IDEQGRFVALARKSATGLQIGDFLIADQNPAFIIAEVGNNHNGDIGLAKE
LVNLAVEAGADCVKFQMRDLSSLYSNQGRNAEAGYDLGSQYTLDLLNKFQ
LNHDELCQVFDYCRQQDILPLCTPWDLVSAHVLDEYGLEAFKVASADFTN
YEMLETLAKTGKPLLCSTGMSSEAEIKGSVDLLRRLGAPFALLHCNSTYP
APFKDVNLNYLPHLKQLGGTVVGYSGHERGFSVPLAAVALGARIVEKHFT
VDRSMEGNDHKVSLLPEEFAEMVRQIRNIEEALGQGGERSLTQGEMINRE
NLAKSLVINCDLSQGQLIRRSMITVKSPGQGLQPNRIDELAGKVAQRDFK
AGDFFFETDITPKSVKKQHYVFSRPYGIPARYHDYRALIEGMKIDFVEFH
LSYHDLDVKLSDYFSDPLSIGYAVHSPELFAGDHILDLASHDADYQAHSI
AELKRTVAVAAELRQYFPATPKPVLVLNAGGWTPQNFLPVEARTKLYDKV
AKALDEIDLSTVQLAIQTMPPFPWHFGGQSHHNLFVDPDEIAAFCDKTGH
RICLDISHSMMACNYYQWDFNTFLQKVLPYTIHLHIVDAKGVDGEGVQIG
HGDVDFTLLRDQLNQFARGVQFIPEIWQGHKNKGEGFWSALAFLEKTSL
>NE1457 Ribonucleases G and E
MKRMLFNATQPEELRVAIVDGQKLIDIDIETLGKEQRKSNIYKAAVTRLE
PSLEAAFVDYGAGRHGFLPYKEIAPTCFDRNTDWSGHRVADLLREGQELI
VQVDKDERGNKGAALTTYISLAGRYLVLMPNNPQGGGISRRVSAEERAEL
REVINNLQVPEGMSIIARTAGIGRNEEELQWDLNYLVQLWRAIEEAASNE
KGVFLIYQESSLVIRAIRDYFHAEIGEILIDNPEIYEQAYQFMANVMPAN
VDRLKLYQDDIPLFSRFQIEHQIETAYSRQVTLPSGGSIVIDHTEALVSI
DVNSARAIRGSDIETTALNTNLEAADEIARQLRLRDLGGLVVVDFIDMEV
TRNQREVESSLLKALRLDRARVQIGKISRFGLLELSRQRLRPSLGEGNYI
PCPRCHGTGHIRGTDSSALHILRIIQEEAMKEQTSMLQVQLPVDVATFLL
NEKRSEIYRLEARLKVGIILIPNIYMETPNYTITRLRHEDIKSTELPPSY
EMVEKTSEEVTLPSISQETRATRPKAAVTGIKPEQPAPVPEEKPREQLSF
LDKFFSLFRPAGSNRTETDQAAEESQTKPSRGERSRNRRDRGRSKRTKTS
TQVTANGSLQEAGHSSLLDHPVVVEHPVTDRATDNDRQHRKNGRQNPPQK
LEVASDQLDVSTTPADTAATADSTSERKTEKRGRSRRRRGSLHRDREDQI
NLNTATGQDEVIPIGENETVEPVVSSDDDSKDTENILLLPVEPVGDTSST
PTNDIIENQVTASAEIIETETIHTRGEPDSEYQVTVEPHPHQTIELPDLA
QSGLIMIETAPDKISETEEVPPAPSRRPRSRPRRSVEPAADNAEPLVQVE
TRD
>NE1533 Aminopeptidase I zinc metalloprotease (M18)
MTAQRYVRDLLNFIDQSPSPWHAVATIEATLREFRFIRLEETDKWALQAG
GRYYVVRDDSSIVLFILGSKAPAESGFRIVGAHTDSPGLRIRPNGVSASD
GLARLKVEIYGGPILATFTDRDLSLAGRISYTDDGGQVAHKRICFDQPLL
RLPNLAIHMNRGVNEDGLKLHKQNELPLLFAQLTDDQLPQPYFLALLEQK
AGIPATQILSWDLAVYDTQKGTLWGANQEFYTNSQIDNLASCHAGLQAVL
DDTILDHAESTLVCAFFDHEEIGSESHIGAGGSFLSDVLQRINLAVSRDP
EDGARAFARSFLISADMAHAYHPNFPSSYDADHKVFVNRGPVIKFNANRR
YSSESISAAHFMRWCEAAGVPYQRYSHRSDLPCGSTIGPIASAKLGIRSI
DIGCPMWAMHSIRESAGVQDHEYIIKALKQFFSS
>NE0443 hypothetical protein
MTDDQPDLSAEPEEDELLLGKVDSMLHRHRREQSSLMQSSEQTVAKLDDP
GEMLPAEVGPVEVPQNDESVSSEVDGIPVLTEQVTLVLDEWPSQSEISEL
LYFAFDAALRETSIHLDPAERLTLIQALGKRLPKNL
>NE0865 PAS domain:Domain of unknown function 2
MIMYTTPRINVFKPFLSATVAMVVIGLLIALDNSLIFIESHIVAGLWVTL
LMSILFMITQIAHAEKTFSKQFAQLAAQKERLSSEIKHRIWAEKTSSENK
AKLQIVDENFPVMLAYFNTERQCRYHNRAFRQWFGLRPEQIENRLFNEFL
DKSLFSDISSAVERVLSGETIQNQHVLKLANQSICFITSHLTPHFDSAGK
VIGFYILYTPRLMKKGEELLQNVKDRSTAEHKQAQESKTTRDSNENTAKQ
PTRVSIDSSRRIIQAIEQNEFHLYCQKIIPAGSNQESSSYYEVLIRMAEE
ESNLIPPGAFLPFVEKYNLMPRLDSWVVEQVIHYLADQRADPRISFCINL
ARSTLIDPVFYDRVWKLLNAAGIEPQRICFEIETADVAANLLSAVQFVKK
VQQSGCLVSLCSFNHDRGSFDLLKHIQADFLKIDGSLICNILRDAEDLKK
VENIGKFARALKIRTIGELVETRNILEKLAKIGIDYAQGFIVGRPLPIEK
IEYRADSMDDL
>NE0142 putative aminotransferase protein
MKIQDFIQNRSKENFALHDKYLNSQMVKVLKTIDFDRHYVRAEGAYLFDE
QDNRYLDLLSGFGVFALGRNHAEIVEALKSTLEADLPNLVQLDVSLLAGL
LAERLVKLTPDGLKRVFFANSGTETVEAAIKFSRYATGKSKIIFCHGCYH
GLSLGSLSATGDDHYKKGFGPMVPDFIEIPFNDLEALEQVLAQGDVAAFI
TEPIQGHGVWIPDDNYLPEAARLTRKYGALFITDEIQTGLGRTGKWWAVE
HWSVEPDILCMSKTLSGGFIPVGALVCKEWIFDRVFNRMDRAVVHGSTFG
KNNLAMAAGLATLEVLESGNLIEHSARTGAAIMDALTPLTEKYECFKEVR
GKGMMIALEFTEPKSPMLKLSWKMLESANKGLFSQLITVPLFRQHRVLSQ
VAGHGMNIVKFIPPLILTESDVQWIVTAVSDVVANAHRGPSAVWDFGKTL
AFSALRAKLHA
>NE0478 conserved hypothetical protein
MYTGTVFENNRTQAVRLPVDVRFADDVKKVWVRKLGKERILTPVDHTWDS
FFLAEQGVSEDFLSERASQEQQEREVF
>NE0454 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIQTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE1257 Bacterial type II secretion system protein E
MSMRANQALSNLPEYLSSTTSALIEILQQRGLITDEGIERVQAAALAHGI
AFSTAATRLGVVSERDMARTFADILGVAVLTRAEYPQPPHIPNDLNLAFL
RKHRVCPVMVSETYVRLAMANPQDSVAITGIRFALGKDVEPVAALESDID
EYLQAGIADISAGTDNVTPGASSTGDDIDKLSDHDSEAPAIRMAHQLLTQ
AMDAGASDVHLEPMTDSLVVRYRIDGVLQEVDVLSNRWSRSVVSRLKLMA
RLDIAERRLPQDGRIRFTTRGVSLDLRVATFPTLHGESMVLRLLGQHSVS
PELESLGLDPSGLQALRAALDRPHGIVLITGPTGSGKTTTLYAALNAIRS
PERKIVTVEDPVEYTLPGVSQLQVKPEIGLTYGAALRSVLRNDPDVIMIG
EIRDQETADIAIRSALTGHLVLATLHTNTAAGAITRLLDLGVEDYLLAST
LVFAGAQRLVRRICPECCTTREPTGDERHALASVSGHKAETCQLPVAMGC
PACRGRGYTGRLPIFEALPIGLEEQAAIRQPHAEEVLLGITRRKNTPTLW
SHGLDRVIQGETTLEEVLSVVEEVGQ
>NE0136 hypothetical protein
MKQSIFFLTALILFHASSNAGTLVNGTWSPMGCGERPIAPVVSAASVEDY
NRSAEAINEWQKRAQQYNSCLVDEANADNALIARTANDQQAKFREEIDRI
NAETDKARAELDSKR
>NE1064 PIN (PilT N terminus) domain
MIVLDTNVLSEILRPVPDTQVLVWLAAQPRSVLFTTTVTRAELFYGVRLL
PDGQRQTALLDAIQSIFDQDLAGHVLNFDSTAADTYAKIAASRKAVGKPI
SQFDAMIAAMAKSKGASLATRNLKDFVDCGIDLVNPWSTSYLK
>NE2294 Patatin
MASQGSQPSRLGLVLTGGGARAAYQIGVLRAIAEMLPPHARSPFPVVCGT
SAGAINAAGLAMAATHFSTGIKRLEAVWGNLHTGQIYRSDLIGVLHNALR
YFGSIVSSRMAGKPVSLLDNTPLRQLLACHLPFRGIRRSIHAGALHALGI
TAWGYASGQSVTFYQASPSVIPWKRAQRIGVPSRIGIQHLVASASIPFIF
PAIRVNREYFGDGSMRQLAPLSPALHLGADRLLVIGVNEKKETDCERTKV
TGYPPLAQIAGHVMNSIFVDSLDVDLERLQRINQTLSLIPGMQAGNGPAL
RMVDCMVITPSERIDELTQAYANSLPRPIRYFFRAAGAMGPKGSAMLSYV
LFEAPFCQALIDLGYRDTLRRQDELFAFISERRE
>NE2196 Integral membrane protein, DUF6
MTNTRNENLGYVYGLIAVTAFALTLPAMRAALSALDPVFVALGRGAGAAV
LAAVFLWFTRQRLPTREEAKGLIIVAAGAVIGFPLLAAWAMLYVDASHGG
VVLGILPLATAVAAALFSNERPSMKFWLFALIGAGLVVGYSLSRAGGTLH
PADLALFGSIVCAGVSYAEGARLSKSLGGPQVISWALVFSFPILIIPAIH
YAPVSLNLPLESWLGFIYLTVISQYLGFFPWYHGLALGGIAKVGQTQLLQ
PFLTIIASVLLLGEHADLMTWLVATLVVAVVAVGKHAQVKHNDPESATIA
DTSPHS
>NE1927 Sulfate transporter
MMKLGNLQNHMILRDLYASIVVFFVALPLCLGIAMASGAPLFSGIIAGIV
GGIIVGMASGSQLGVSGPAAGLAVIVLNAIATLGSWEAFLLSVMLAGVLQ
IGMGYLRLGAIAYYFPVSVIKGMLSGIGLVIILKQIPHALGYDRDFEGDL
AFFSPDGESTLGYLVDTLYDVSPAAILVTVISMSILLLWEMVLYKKYRIF
QILQGPLVVVAIGILLSYLFDEYIESLNFETDDLVNLPVALSIQDFAGQF
TFPDFSHLANPEIYFIALVIAVIASLETLLCVEATDKLDPLKRVTSTNRE
LVAQGLGNTVSGLIGGLPVTQVIVRSSANITFGGRTKLSAIFHGVLLLIC
VIVIPGVLNMIPLASLACILFVVGYNLAKPSLFKAMYELGWTQFAPFIAT
VIGILLTDLLKGIGIGMVVAIFFILRSNLKTDYELFHEKNDGVKKIRLVL
AEDVSFLNKGSIGKALKEFEPDSWVIIDGSESKYIDHDIIEIIRDFTVNA
HIRDINVEVIGIPTLETKSRYQRIMETRYGRPDSPAVKS
>NE1514 Glycine cleavage T-protein (aminomethyl transferase)
MNSDWFTFLTHRNAHIEQNRVLHFGQPAAELAQAASGPVLIDLSHFGLIR
FSGEDAQNFLQGQLSCDVRSVDSTQASHGGYCTPKGRLLGSFLLWQDSDN
SYLMQLPAERVETITRRLKMFVLRAKVSIQDNTDDLIRIGIAGKNALLSL
QNMLPDTTISPAPLAVTSIPDGQIICHSENRFEIMTTSIQAPSLWEQLNK
QAHCAGAAIWDWLEIREGIPAIFNATQEQFIPQMINLDIIGGVSFKKGCY
PGQEIVARTEYLGKVKRRMYLAHLDADSCQNIAAGDSLFGTDTGDQACGM
IANAAPVPAGGVDVLAVIQTSSMEAGSIHWKTPNGPQLTILPLPYAIT
>NE0948 LysM motif:Peptidase family M23/M37
MFMRLFSFFLLTLSLGACVSKPDPAPVVERLPGSYSGISGDNVKDYERKE
VYVVQSGDTLYGIALKNGLDVNQLAEMNDIADPRELRVGQKLYLRTLAGQ
GQIPDQEDLSSQPTLFSISQPGDVGVAGYQPGTLSDGSNYETTESFKTEP
KGVLLPYSDTARDQLNNQSGAAQVDSEPYHQVSSIKEPSKNKSDQSANVR
TNNSSINWGWPTSGQIISQFSEKSKGVGIGGQLKQAVLASASGTVVYSGS
GLRGYGNLIIIKHNDSYLSAYGHNSKIFVHEGDSVSKGQKIAEMGNTDGG
VVKLHFEIREKGKPVDPLGYLPTR
>NE1424 Thiamine monophosphate synthase (TMP)
MVPAAEPMKQGGVSGLYAITPDMADTGRLCDAVRQALAGGVSWLQYRNKT
ADSRLRLVQAAEIHLLCRQFQVPLIVNDHLDLAMEIDAEGLHVGGDDISP
AVARCHLGQDKIIGVSCYNQLDRAIEAEEAGADYVAFGAFYPSMTKTGTY
QAPIELLTAAKKKLGIPVVAIGGIDLDNAGMLIASGCDAVAVSQALFGAQ
DIQSAARYFSELFD
>NE1576 hypothetical protein
MLHARSVRKDIQPVIARHIGDPEWITGQIEVKESVRINACYTRYRHQRFP
NLQLNFSTDHKTLLPENPEPSRFNKRAIPWFTSRTT
>NE0143 Chain A, Red Copper Protein Nitrosocyanin
MKTTKAMLAGFAVGSLLLAGAAQAEHNFNVVINAYDTTIPELNVEGVTVK
NIRAFNVLNEPETLVVKKGDAVKVVVENKSPISEGFSIDAFGVQEVIKAG
ETKTISFTADKAGAFTIWCQLHPKNIHLPGTLNVVE
>NE0831 DAHP synthetase class I
MKSGSLYCQCGKEVQKEIVIKESSVEHKTDDRRIIRGDELITPVKLLHDH
PLTEQATETVVTARQECHDILRGEDDRLLVVCGPCSIHDVDAAVEYASRL
SHLRQELKDQLHIIMRVYFEKPRTTVGWKGLINDPDLDNSFNIDKGLRLA
RDLLLKLGNINMPAATEFLDLISPQYVADLISWAAIGARTTESQGHRELA
SGVSCPVGFKNGTYGNLNIAIDAIGAASHPHNFLSVTKEGRVATFTTRGN
EDCHIILRGGRSPNYDAESVASAVEALRKAKLPPYLMIDCSHANSHKDYR
RQPEVAADVAQQIAEGNCSISGIMLESHLVAGRQNVEGKRREELVYGQSI
TDGCIDFDTTQQVLFELAEAVEKRRNLQNTPILVPDHSTALS
>NE2386 Bacterial SH3 domain homologue
MNKITYFLAASMFFVFLWGVPLTARAEKSYASDQVEVLMRTGPSQQHAIV
RMLKSGVALEVLERDQNKGYSRVRTTGGTEGWVLSRYLMAEPAARELLEK
LSSDFSGSDSRPDSIRAQADLVRHELGMQKKQAETLEKDNKRLEAELSKI
RQLSANAVQLNGENQEMRQQVADLKMKLGELEQENHALSGQIEREWFYAG
ALVLFAGLFLGLVIPRIQWRKRSRYNDF
>NE0798 hypothetical protein
MDKFIEQVSLYIQEAPVWPFTLLGFILVVGVAVDIINRRRRTAAVEYFDL
AFQEELTGLYPAATRWPDDLAAYMQPRLPILRDAFEVLRNFIPQNQLREY
NAAWNRFYQFSRTGGNERPVSLEDAAQELAVNQPDLQQQQAFQQMISDLL
AFATQFKK
>NE1049 Ankyrin-repeat
MKLTENMSFPVRLKQSCLAGVTALFFLLQIPFAHADADKDADFLKAALTG
DTSGVENMLNEGIQTDLQSPEGFTALSVAAQTGHKEVVKLLLNRKATVDL
ANVQGGTPLLLASKNGHQEIVDLLLAKGANPNLQDKNGLAPLMLAAAKGN
TGIVRSLLEHQAQPDLQNNAKATALHMAATNDYADIIDMLLAKGASVDLQ
DANGASALILASLSGHLSIVRKLLAHGAQPDLKATNDFTALILSAQNGQN
PVIEVLLEKGVHIDFQNKDGMTALMSAVLNENIDTVKLLLEKGADTKLKN
TSGKTALDIAKLPAIIELLKAAKS
>NE2277 NAD dependent epimerase/dehydratase family
MKVLITGSAGFIGSTLALRLLERGDTVIGIDNHNDYYDPKLKEDRLARFA
DHPDYTHLRLDLADREGIKTCFETYKPQRVVNLAAQAGVRYSIENPLAYI
DSNIVGFAHILEGCRHNDVEHLVYASSSSVYGANTMMPFSVHHNIDHPLS
LYAASKKSNELMAHTYSHLYNLPTTGLRFFTVYGPWGRPDMALFKFTRAM
LAGEKIPVFNYGKHRRDFTYVDDIVEGVIRVLDQPARSNPAWSGANPDAG
TSLAPWRVYNIGNNSPVELMDYIAALEKALGKKAEMEMLPLQPGDVPDTY
ADVSDLVEQFDYKPATPVEQGIANFVTWYRNYFNL
>NE2328 putative thioredoxin
MNNQVEPRKLVVYGREGCHLCEEMIASLRVLQKKSWFELEVINIDGNEHL
TRLYNDRVPVLFAVNEDKELCHYFLDSDVIGAYLS
>NE0152 PQQ enzyme repeat
MIGSISLSAPDFTFHNGYCRALRVCILALLILLGGCANLSDITGAHFTDL
FSGDEDEVEIDEAELAELQTLAPIKLLWQVKLSESKTAVFLPVYDNGALY
AADEDGRLIKLDPATGREIWRVDTKSQLSGGVGTGGGMILLGTYKGEVLA
FDEAGNALWQSQVPSEILSPPQTDNGIVVVRTGDSRLYGLNAADGKQIWS
YQGVTPPLTVRSFVGVSITRGAIFAGFPGGKLIALDLFTGNVGWEATVSQ
PHGVTELERMTDISSLPIVDENQVCAVAYRGRAACFEISNGNQIWARDAS
SSAGMVMDNSHVYISEEHGTVAAYDKSSGAAVWKRGKLGSRKLSALMVAR
GTRLIVGDDQGYVTLISRQDGSLLSRAPTDGSAISSRAEYLPDGFVVQTH
KGGLFAFSLQ
>NE2209 hypothetical protein
MKFQPLRGESITTGLLFSLFVLLIFSCRAVAQESTRNEFLSIAKSAVVLY
DAPSLNAGKLYVAGVNLPLEVVVKVVGWVKVRDYHGYLAWVEDKNLGPKR
FVIVKIPVGSVYQSPNPTSSLIFQAQQDVILELLGVVAGGWVKVKHRDGQ
TGYIRTDQIWGV
>NE2527 Restriction enzymes type I helicase subunits and related helicases
MFNESNTVEAYVRDLLAGPIKAVPVNTAQEPQASYGPSPKGVGWRYAAPS
EVPRQNQEVLVEPWLREALIRLNPEIAAQPDRADEVLYKLRAIVLSVRSD
GLIRANEEMTAWMRGERSMPFGANNEHVPVRLIDLDDLSQNHYIVTQQYT
YRAGPTERRADLVLLVNGLPLVLIEAKTPVKKCISWVDAAVQVHDDYEKF
VPELFVCNVFSVATEGKAYRYGSIGLPVKDWGPWHLDGDGEDGQHHPLKS
LKLSAESMLRPHVVLDILGSFTLFATNKKKQRIKIICRYQQFEAANKIVE
RVLAGYPRKGLIWHFQGSGKSLLMVFAAQKLRMHAGLKNPTVLIVVDRID
LDSQITGTFTGADIPNLEKADTREKLQQLLAQDVRKIIITTIFKFGEAPN
GKSGSLNDRSNIIALVDEAHRTQEGDLGRKMREALPNAFLFGLTGTPINR
ADRNTFYAFGADEDEKGYMSRYGFEESIRDGATLKLHFEPRLIDLHIDKA
ALDAAYKDLTGGLSDLDKDNLAKTAAKMAVLVKTPERIRKVCEDIVEHFQ
TKVEPNGFKGQIVTFDRESCLLFKAELDKLLPPEATDIVMSVQAADKKEH
PEYAPYDRSRDEEERLLDRFRDPADPLKLIIVTAKLLTGFDAPILQAMYL
DKPLRDHTLLQAICRVNRTYSEQKTHGLIVDYLGIFDDVAAALEFDDQSV
KQVVSNIQELKDKLPEAMQKCLAFFAGCDRSVQGYEGLIAAQQCLPNNEV
RDNFAAEYSVLNKIWEALSPDTVLGPFEKDYKWLSQVYQSVQPSSGHGKL
IWHSLGAKTIELIHQNVHVDAVRDDLDTLVLDADLLEAVLSNPDPKKAKE
IEIKLKRRLRGHGGNPKFKKLSERLDALKDRFESGQINSVEFLKQLLEIA
KETLQAEKEVPPEEDEDRGKAALTELFNEIKTAETPIMVERVVADIDEIV
RLVRFPGWQGTQAGEREVKKALRKALFKYKLHADEELFEKAYSYIRQYY
>NE0221 Radical activating enzymes
MLRITEIFYSLQGETSRMGLPTVFVRLTGCPLRCGYCDTGYAFSGGKSMS
ISEIMGEVASFSPRYVTVTGGEPLAQADSLVLLAALCDRGYSVSLETSGA
LDIARVDARVSRIVDIKTPGSGEVEKNHWNNLAYLTSHDEIKFVLCDRAD
YDWARQKLLELKLDVICPVLFSPAYGQLVPADLAAWILQDQLPVRLQLQL
HKLLWGECVGR
>NE0925 Cytochrome c, class IC:Cytochrome c, class I
MNSRNSISFSWLTSIGLISLLAFMEPAAAASDDSFEHAERLYDTYCTQCH
GVNRDGNGVNSVFMSVQPRDHSDAKGMGDIPNSEIIDVIKKGGLAANKSV
LMPAWGGVFSDEEVNQLAAYLRHVCKCGSDQ
>NE1120 putative orf; Unknown function
MFNMALVFFLIAVLAGILGFAGIAGTLAWAAKVLFFAGLILTVVFYLLGK
RTPPV
>NE2390 Thioredoxin
MSKPNRFSFYVIVAILSLAAGFAYKSREMGITVSLSSADRKAGADKFFDA
TLSALDGTPQALSQWRGKIVIANFWATWCLPCRDEIPELIDTYTAYHQKD
LVILGIAIDDTDKVAAFSKEFNINYPVLVGEFDAFSLAEAMGNLQGALPF
TVTIDRSGMIVDTHLGRIKKKQIEEIIKPMLQ
>NE1695 Bacterial transferase hexapeptide repeat
MTDISVAGDLLLKSTHLQINEVVAELRNVRTTALLDRQGHLKPPRLPSRK
ILAGVIEGLRTALFPNRLGPSDLAVEGVDHYVGHILNSTLRALLVQIQHE
LHFNSGTEELDQETYAQAAEITQAFAKRLPAVRRLLDGDVLAAYEGDPAA
RGIDEVLVCYPGVAAITWHRLAHELYQLNMPLIARMISEIAHSQTGIEIH
PGAQIGERFFIDHGTGVVIGETSIIGRNVRLYQAVTLGAKHFPVDENGTL
VKGIARHPIVEDDVVIYAGATILGRITIGHGSTIGGNVWLTRSVPPGSLI
TQAQTHND
>NE1349 conserved hypothetical protein
MIGLDTNVLLRYLAQDDAIQSPKSTLLIESLSVEEPGFVPLIVIVESVWV
LSSAYGSTREEITEVLHNLLRTRELRIEQAETVAAALHLYQRGKADFADC
LIERTAMRAGCKAVMTFDKTAVKSCGMQLID
>NE0969 possible N6-adenine-specific methylase
MVKADRIRIIGGQWRSRLIQFADDELLRPTPDRVRETLFNWLGQDLTGKI
CLDLFAGSGALGFEAASRGAKQVTMIEQNMKAVRNLHCSIEKLGASQVKL
EHVDARMFLTANSERYDVIFVDPPFKSGLLAEVLPLLPAKLEEEGVVYVE
SSDKLLPDDTWSIWKQGRASHVHYCLLSLNPDG
>NE0789 conserved hypothetical protein
MKLPLTSFAVFLITQTVWAGEVFALDMIGVAGGANFNIQGKSFAERRFST
VYRQQFDFSCGSAALASLLTFHYDDSVDEQSVFVDMFQHGDQEKIRREGF
SMLDMKRYLERRGYGSDGFKINLDQLYSTGSPAITIINHNGYMHFVIIKG
VDEDRVLVGDPAQGVKSMDRTEFERMWGNRIVFLIHNHVDPEVSYRKIHQ
EWSGRLAPLGEAVDRTSLGVFNVLRPGPWDF
>NE0676 probable oxidoreductase
MTTIGFIGLGIMGKPMAGHLIRGGYPVYLNSRSGIPAELTVPGGLACPTP
RAVAEQADIIILMLPDTPDVEKVLFAENGVAQGLSANPPRSKIIIDMSTI
SPIRTRDFVTRIRQLGHEYADAPVSGGDIGAQNATLTIMVGATEAVFAEI
EPILALMGKTVTRIGDPGTGQVCKAANQIIAALTLEAVAEALVLASRAGA
DPNKVRQALLGGFAASRILEVHGERMIKHAFTPGFRVDLHRKDLGIALDC
ATELGVSLPGTALVQSLYNASIAQGDAQQDSSAIVRILEQLAGHTLDTST
ESAHEH
>NE0682 SCO1/SenC
MRTFLATLFVAISGTLILWISTDGGRAFTAEEARRLEIRENPRSVPDWQL
QNQDAETFTFQNWHGHLIVVDFIYTSCPSVCLILSGNLKNLQKDFSDEGK
SDKLRFLSITFDPEKDTPQRLKEHLSHFSADFKNWVAARPTSPSQKEAIL
DFFKVIVIPDEYGGYTHSAGYHIINPDGKLVAIFGMEQMDELRAYLNQAL
EGKDNASEN
>NE1183 Serine proteases, subtilase family
MATEKRSSRSTSSKSSGSKSAGTGKTRQPGVEHISPFAQFSRSAASTKSA
EFPSILNKEGSSPEVGTIVYIHGIGNKPVASILKCQWDTALFGAPMGDRT
RMAYWVDRNRYPKPKDASCADKDTLPEDAGGMGVQALQENELEPEIKLSA
AERKIMAALEERLRAGEKQPGGVNVKVLPLPESVRLWITRRITKLFLKDV
QDFFFDEHKRNVMEQSLRDRLDVGGGPFIVVAHSQGSMIAYHVLRQLKKA
DCDVRLFITIGSPLGIQEAQDVLGKIGSGKPLAVPECVDRWLNVAERLDP
VALDSDLGNDYQPNSHGVKIENHAGLMINPEWASNPHSGTGYLSLDIVRQ
AVRETAGPTFSNPVGRNILMKDLVDNIEDGHREQRHPTLIQLVSDDDDAT
SLDEVRDRLSNLIKNVLIFNNAEPEEGRIQLMRRFISADLTRSEIEELRS
HCSTLKIDRVWRNALKRTLLHQSAHTIQVRPANLGYGACGQNIAWAVLDT
GIAADHPHFKKHNNVVAQWDCTGSGSPRELRLGDSGFGTLDGHGHGTHVA
AIIAGTLTLPHDEKDKSEILLQGMAPEARLYGFKVLKDNGNGEDAFIIKA
LDTIAELNERAGKLVIHGVNLSLGGNFDPSVFGCGHTPLCQELRRLWRQG
VLVCLAAGNEGYALLDSASGVIPANMDLSIGDPANLEEAISVGSVHKTNP
HTYGISYFSSRGPTADGRMKPDLVAPGENILSARHQWPKSKLTMRNAYVE
MSGTSMATPHVSGLFAAFLSARREFIGYPDRVKTLLLQHCTDLARDPYIQ
GKGMPNLVRMIMNT
>NE1268 conserved hypothetical protein
MLLTAVLLPATEGGFTALNPETGTTSQGETVEEALANLREATELYVEEFP
LTIASRPLVTTFELPAHA
>NE0271 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDHLISVNRP
>NE0285 DUF170
MQLIINGQQQSYDGPMNVQQLVEKLSLQNKRFAIERNGEIIPRSRFPELL
LNEGDQLEIIVAVGGG
>NE1889 conserved hypothetical protein
MRYWLMKSEPSEVSIDDLAARPGQTVPWDGVRNYQARNFMRNQMQPGDLV
FFYHSSCPEPGIAGVVEVSRLAYPDETQFDSASKYFDPKSTRENPRWFNV
EVRFLRKTRLLSLRELRSYPELAGMRILQKGNRLSITPVDPSEWKFIEAK
LQS
>NE1897 Peptidase family M48
MMELDLPATIKRDMKFRYLLIIVPLLFPAHIFGQELPDLGDVSQASVTPH
QERQIGIQIMREIRADSSYLDDPEIADYLTRLGNRLIAASGQTNPDNPFE
FFAINDSSINAFALPGAFVGFHSGLITAAQNESELAGVMAHEIAHVTQKH
LARMISGTSYLGLLGSIAALAIAILASRSNPNAGQAVLATAQASAIQSQL
NFTRKHEKEADRVGFNMLIKAGFDPHGMSSFFERMQHASRYYENGMPSYL
MTHPVTHERIADIQNRTQELGYRQVPDSLEFHLVRAKIRASRGNPASVIN
EFKARLQDKRYINEIAEQYGLIQALLRARQFKQADEELNTLYRTIQSDSS
AQSLKNHRLGKSIQVDGDYLQSAAMIETLAARVKFASGQTDDAFRLYQSA
LQSFPHYRALVYGYADALLQHKDAQAALDFINGQSQFIHDDIRLYRLTAQ
CHAALGNALLQHQAEAEALIREGNLRAAIEQLQIALRHKHDNFYQLSSVE
ARLRQIKEFVAAEKEKK
>NE1093 Transposase IS4 family
MARFKPVQKGLMLLPVDFSRQIIPGSFEHALCYLVDHELDFSGLRERYRN
NTQGAPAYDPAVLLKIILLAYSRGLIGSRRIEAMCRQNILFIAVAGDNQP
HFTTLAAFIAELGDEVAKLFAQVLVVCDRQGLIGRELFAIDGVKLPSNAS
KAKSGTRADYQRQAEKMEKAAKQMLVRHREIDMTPVDERQAQREACMLER
LQKEAKQLKDWLAANPEDRKGPKGGVRQSNLTDNESAKMATGKGVIQGYT
GVAVVDEKHQIILDAQAHGTGSEQELLVPVVQAIKPQMSNQTVITADAGY
HSENNLKMLAAEGIDTYIPDNGYRKRDERYHGQEAHKTKPDPLWDKRGQP
SISKRFGPGDFQLAEDGSHCLCPAGKRLYSNGSNCTFNGYAAMKFRGAER
DCLPCTLRTQCLRTPEKTKTRQVAFFRGKRDGYETHTDRMKRKIDSDQGR
QMITRRFATVEPVFGNLRNNKRLDRFTLRGRSKVDGQWKLYCLVHNIEKL
AHYGVGQ
>NE0349 hypothetical protein
MKGNALCTGEPVMTIETRGGGSEDIAVLPLDPTGNITRLFIRERCNDSSD
PECKTGQWRLGSVDISRENTPASTRYAWRPGNNASEDDFEPLGMSLVPGN
TPGEGTLFVIDIARPQSVRIWQLDISGGEITKATLATPADTQTGARLTAA
NSLQAVRNDDSRSFHLTITRFDEYGLLPFRPTPWPALVRINNGVIQPPPA
QDFRNANGIIRPCAGCDLVIASYWERRLRFVSKENGEIGEYASAELPIRP
DNLRLDGERILIAGQRRVDLTALNLLVSPHIPSPSGVYAIDTRSLGPDTV
PTLLWEGGWKHGHSVATAVALPGNRLAIGQINTPGILIADCSP
>NE1254 putative general secretory pathway protein H precursor
MVIVLLGLLSLIGAQALPGDDRHLQTVSERIEHRLRKARLSAMRSGTAVF
IPCESLAAPSDRTTPVTSSRSTDWMVTCSGEVGRTEGITFFTDGSSSGGI
IELVSGVARIRIHIDTLTGTSRLE
>NE2310 ABC transporter
MAKREISRGSLWNQWDLHVHTPASFHWLGKKFEGDLSSDVNKALVDEMID
AMNKAEPAVFAIMDYWTFDGWFALKRRLAEQDSPKLHKKVFPGIELRLSA
PTAIRLNAHVVFSDEIPDQHLKDFLADLKIERTGRSVSKDALIALARATA
PDMLEKKGHKKDVVVTDDADALLAGYKLAEIKADSYKDAISKVPNGLAIG
FMPYDTSDGLAEVKWEEHYSYFLGLLESSPIFESRNLELRFAFLCEETEK
NKGWIGNFKHGLKGIPRLVVAGSDAHCFIGTPGDNDKRGYGDFPSGKKTW
IKADPSFRGLKQAILEPAKRSFIGQKPPKLQEIESNKVYYIDSLEVAKTG
SESDIGKWLDGCDIPLNPDLVAVIGNKGSGKSALADVIATLGNSKQSRHF
SFLRKERFLGKAGEPARHFTGTLTWCDESTEARPLNELPSEEKPELVRYI
PQGYFEELCNEHVSGKSNAFEKELRAVIFSHTSEEMRLGALDFDQLTEQQ
EQSLRNRLGEYRKVLSSINADIVRIEEQLQPIELERLNELLLLKIKQIDE
HEKLRPVEVAAPAEALDPEQKKASEELVSINQKIESSQKRSQEIALENTD
LAKRQKAITNIREQLRLFQRAFEQAQQTVSDDLKILGITWPDLAVIDIKQ
HILDEKSAEISKKQEQLKSEAEKIAEELKESVASQQSFTAKLNAPQQQFE
AYQQQLSEWNKKLAYLKGSPTEPETKVGLETRIEQIKMLPEQHTELEEKR
LRLSGEIFDTLDAQRKAREELFKPVQDLIQKNSLIREDYKLQFQATLAAS
SEAIADQLFALIKQTWGDFRGQDEAPSTIRKLFDNYDLNSKEGALAFIAA
LQEKVQEASTLSGATVGIFNLVKKGQSAISVYDLIFGLNFLEPRYSLLFQ
DTQIEQLSPGQRGALLLIFYLLVDKGKTPIILDQPEENLDNETVFRLLVP
VLSEAKKQRQIIMVTHNPNLAVVCDAEQIIYAKFDRADNSTVSYESGAIE
NSNLNGIVVTILEGTKPAFDNRSGKYH
>NE0262 conserved hypothetical protein
MPIQKNTSVTLGEHFEKFLAHQIEAGRYGSASEAIRAGLRLLEEREAKLE
ALRRALIEGEQSGPADYSLQNVLDELESAD
>NE0104 Dihydroxyacid dehydratase/phosphogluconate dehydratase
MPDNRRSQTITQGAQRTPNRAMLRAVGFGDGDFDKPIVGVANGFSTITPC
NMGLDTLAKCAEHALKTAGAMPQMFGTITVSDGISMGTEGMKYSLVSREV
IADSIETCVQAESMDGVIAIGGCDKNMPGAMIALARLNVPSIFVYGGTIK
PGHYQGRDLTIVSAFEAVGQHTAHKIDDHELLEVERHACPGAGSCGGMYT
ANTMSSAIEAMGMSLPYSSTMAAEDEEKRISAIRSAEVLVDAVKKQILPR
SLITRKSIENAVSVVMAVGGSTNAVLHLLAIAHAAEVEFSIDDFEAIRAR
VPVLCDLKPSGRYVATDLHKAGGIPQVMKMLLVHDLLHGDCLTISGQTIA
EVLKDIPEQPRADQDVIQPWDNPVYEQGHLAILKGNLSSEGAVAKISGIK
NPSITGPARVFDSEETCMAAILERKIQPGDVVVIRYEGPQGGPGMREMLS
PTSALIGEGLGDSVGLITDGRFSGGTYGMVVGHVAPEAFAGGTIALVQEG
DSITIDAHRRLLQLNVPEAELERRRAAWHPPAPRYTRGVLAKYAKLVSTA
SRGAVTD
>NE1536 TonB-dependent receptor protein
MKQFPLALIGLLCASSAWAAQDVEQQKQFSRKNLSTQADNSSDGIRVSGA
NGEHAGEQLAQAADEQVQTGKSSNSDNLNEQKEKDSGEPAGQNETVFREM
VITGEIERDSHFTSPSMRVTRAQVEQQNAQTTEEVLKYMPSLQIRQRYVG
DPNGVLGIRGADMFSTARSMVYADGLPLHNFLQASFNGAPRWSLVGPNEI
DSVDVVYGPFSAEYSGNSIGGVANIRTRMPHKQEFYFEGSLFIQPYKNYG
EDKGTFIGDRQYFSYGNRIQDKFTVFLAYNRLEAQGQPMSYFIDNTGLSN
VPGGTPVTGGIRTPDTRGTPSIIYGDTGPEKVNTHLFKGKFGYDITSDLQ
ALFTVAYEERTRNQNHPSNYLYTANGDYFWGGNAPTCNSVATCSTPGNAS
LDGTRFDVVRSGFGESRDKRETLNLGLSLRGALTPDWDVDTTFSYFDVLK
DIRATAFFNHADPGNTGAGQLQDFRKFNWWNYDLKLSTPMLLGYDKLSFL
AGYHFDQYNLSFRQYSLTSYEDLTRGALQANRNNDGRTSTHAIFAQSMFR
FMPDWDITAGVRQEWWEASKGVASNVAVPDRTETATSPKVSLGYEPGQWK
FRYSFGRAHRFPVIAELFQSLGTPTSILVANATLKPENGVHHNFMMEYGL
PKGYVRVNVFRDDIKDAIRQITSASGALTTSAFQNIGEVGTTGVELIFEQ
RNILGSRFDFMFNGTWMNSKVEDGPVVRFTESTGVPANYSLTGKQVIRLP
HWRVNVFGTYHVTDAWDVSLSGRYTSDSYNDPDNRDYKNNVFGAQSDFFF
MDFKTSYRYRFNNGLKSRFSFGISNLNNDKAWVFHPYPQRTYLLEAAFSY
>NE1591 conserved hypothetical protein
MTCEVRLRPEAEQDLADAAAWYEEQRQGLGHKFLDEVTTTLSNIAETPLA
YPNVHRGTRRAVIRRFPFGIYFQVKKATIIVVAVMHGSRNPHQWKSRT
>NE0601 Na+/H+ antiporter NhaC
MTSEKTALEELPVAQPLPAFWQAAVCFIGILLLIALGLFVFEISLHALIF
LALVWAGVHTRILGYSFIAIRSMMDEGIVRALPAIYIFMLIGMVIASFMQ
SGTIASLLYFGLDWLDPALFLAAGFVLCSLMSVATGTSWGTAGTIGVVLI
GIGDAMGIPLPLTAGMIISGATFGDKLSPISDTTNLAAMSADTNLYRHIG
SMLYTTIPTFVIVLSIFTVWGWQYGDHHLPQAHLTEIRSALAGSFRMDGM
VTLLPLLVMLGMSIKRYSAEVSMTCSTLLALLIAVVYQDRNGIDVINALW
LNTPGDTGIASIDELLGRGGLYSMAWTLLLSIMALALGGILHHAGFLQVL
LMRLIAPIRRTSTLIAATIASGLTGNLVMGEAYISIILSCQLFKKVYQEK
NLDPAVLSRSVEEGSTLTTGLIPWTTAGIFYTATLGVPTLDYAPYTLLNL
LNPLVSIGMAVMGVGLLRSGHYQKTT
>NE1989 putative transmembrane sensor
MNSTRNITEELPETILEEAAIWQARLQHGETDIELQKAFNIWLAADARHR
QAYEEMESLWGALETPVTQLMAEQSDHTFAAKTVPLHKLRQRSLGRGFQC
LALAACLVLAIVIGWQQDWVTRWQSDYMTVIGQETSFLMADNSRITLNTD
SALAVDFTAERRQVRLLKGEAWFEVTSADNRPFIVATSAGSVRVTGTRFN
IRLHQDTAIVSLDEGQVELSTPDHSNNPNGSPIVLFPGQQATLSPQGISS
PSPFDHTAITAWLRNQLVFYDTPLAEVVDTLNRHRHGRIFITNQKLKALK
VSGVFSTDEPDTALDVMINTLPIHLLRLTDYLVLIH
>NE1984 putative transmembrane protein
MIRADIIKLYKTVHTWTGITCGFALFIAFYAGALTMFQEPIARWASPPAV
GVAAVPLDDAPRLLELVAAAHPEAREEGIKLHLRGYENEPARVTWEEDES
HEAGQPHGHVHWWATLKPDGSLLAKQEEPSELAEFINTIHMRIGIPEPWG
SYFMGVVSLLYGVALIAGVIVLLPSLVKDFLALRVGRNLKRMWLDAHNVV
GITALPFHIAMAITATALSLSHELWSLQEAVIFGGKQAILDERDNEPFRA
PKPIGEAGAMMAPSQLLQRLKAQVPDFEPKVMIFKNIGDKAASVRVAGAE
RGYVGDTHLGGVMLSAVTGELLKDSTRPSRQDPDRRASETFYGLHSGQYE
GATITWAYFFLGFSGAWLFYSGNLLWIETRRKKACKGGELPEQRRDTYWL
GAGTVGVCLGCVAGLSLTIVAAKWLYGRVDDLNHWHEYIYYAVFFGSVVW
AFAWGAARSAVHLLWLCAAATMAIPLTTLLAVLFPALGMWAHGSAATIGV
DVVAFIGALCFAWMAVATTRRIRNGPTDSIWSIRKADGDTVPHKPAATPA
E
>NE0832 DUF157
MNPNQATWFKNYTLDYLEGLRNANMGVYIGIRFLEVGPDFLKASMPVDHR
TTQPFGILHGGASCVLSETLGSVSAWMTIDPEQYRAVGIEINANHIRAVT
QGNVIGVCTPLHVGRRTQVWQTDITEEETGKRIAVSRLTVAIIEQGTLST
QKEKVIVSK
>NE1630 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFTWSRQMPEPR
>NE2548 Bacterial outer membrane protein
MKKMILARKYLMGAVILPGLLASGAVMANTHEEERQGLSAVDKAEAYAVD
DRGVVARNSTGLCWRTGYWTPAKAIYECDPDLVKQPEQVVEAPPPVVPVV
TEPEKVSFSADALFDFDKAVLKPAGKQALDDFAANLEGVNYDLIIAVGYT
DRIGSEAYNKNLSIKRAEAVKSYLVSKGIGSDRIFTDGKGEANPVTGDSC
HGTKATRALIDCLAPDRRVEIEVAGTRETVN
>NE0005 possible 16S pseudouridylate synthase
MEKVRLSKLMSARGICSRREADVYIEQGSVYVDGQPVRELGTKIYPWQEI
VLDRAAQIRQNQQVTILLNKPVGYVSGQPEPGYKSAISLVNNDSHFHQDR
SSLRFVPQHLKNIAPAGRLDIDSQGLLVFTQDGRIAKQLIGEYSQIEKEY
LVRVSGNLTREGLALLNHGLQLDGQALKPARVSRLNQDQLRFILQEGKKR
QIRRMCELVGLNVTGLKRVRIGQIRLADLPEGKWRYLGKNESF
>NE0040 Rhodanese/cdc25 fold
MKDIDQILETARERAEDMGVPYKGALLPSEAHAVLQALPKTRIVDVRCRA
ELDWVGRVPKAIEVELLTYPGMQSNSGFLDQLTSQIADDAVLLFICRSGG
RSSQAAALMSQNGFTDCYNILEGFEGDKDESGHRGQQSGWKAAGLPWIQS
>NE0022 alanine dehydrogenase
MQFNKSEWHERFRASMIIGLPKETKADEYRVGLTPGSVHTLVGRGHRVLV
ETGAGAGSVISDQDYRAAGAEIVTHAGDAWAAELVVKVKEPTEPEYQYLR
KGLLLFTYLHLASNRTLTEKLLATGVSAIAYETVQTASGALPLLTPMSEV
AGRMAVQIGATYLLKTQGGRGVLMGGVPGVAPANVVILGAGVVGTNAATV
AAGMGAMVTALDINHDRLKALDDRFSGRLQTRYCDTVNVRQAVYEADLVI
GAILIPGGRSPWLVTREMLPQMRTGSVIVDVAVDQGGCVETTRPTTHSNP
IFEIDGVLHYCVANMPGAVPRTSTFALNNQTASYVATLADNGLAALQNNQ
ALMNGLNTHRGQVTHPAVAEAFGLSYASPLEVLVA
>NE1599 conserved hypothetical protein
MGYSLIFTDAYNQRAARWLRRHPDLRTQYLRTLQILQTNPYHPSLRLHVL
SGKLQGIYAISINLSYRITLEFLIEDKQIIPINIGSHDVVY
>NE1376 hypothetical protein
MGLLLYRELVHKLKTKYKKIAGKGENQKDKTLPWHLETDQLTNRHPNHKV
RIPYIDEVFDMLAKMTAKNQLTLPKSVTTAVGAGEYFEIEVRNGQIVLTP
VRIQRADAVRAKLAELDLGERDIMDAVTWARQPVKGSATE
>NE1491 putative copper-containing oxidoreductase signal peptide protein
MEEHMKKTLIVFLMLSCLGSVGAYATANHSHAAIGEPGVQENVTRTIRLE
AYEYRFSPSEINIKQGETIRFIVKNTGKKKHEMMIDTMQHLREHEKMMRQ
HDHGAHTEPNQIILEPGEEKELIWHFTQAGTIDFACPVPGHFKGMRGKII
VESK
>NE1577 conserved hypothetical protein
MKVSISNSAFNDLETMISYYTAEGVPDVGFKFAQEIIEHIQILADHPDMG
RIVPEFQLPHIREIIFAPFRVVYLREKGAIKVIRVWRSERPLVLPTET
>NE2247 hypothetical protein
MIKRFNLTICGLGLILVTSHFFPAGAQELILYVDTATKQVYTEPGKNRIK
LGTFQQVKESPVQSQSKPDTESSQPVSNTTTGLAQGEAKFQENSGQSGAE
SDIRRKSEEIAAHSNEPSAEKPKEEKKWYDRIGIRGYTQFRYSSTVSGDK
DAVSYWPDKSVGEDGSFLIRRARLVIFGDINDHLSLYIQPDFASTPSGSS
TGHFAQLRDAYADIHFDKNKEFRVRVGQSKIPYSFENLQSSQNRLALDRN
DAMNSCCRDERDIGAFFYWAPTHIRDRFKEVMAKNLKGSGDYGMFAFGIY
NGQGANRLEGNNGVHMVTRFTYPHQFSNGQILEAGIQALRGRFVPSTGPA
GGFTPVMDAPEKGFKDERVGVHAVLYPQPFGLQAEWNWGRGPQLNDGQTM
LTESSLNGGYVLATYKIDGLRWGTLFPFVKWQHFKGGQKFERNAPRNHVN
DLEFGLEWQVMKEIELTAVYHMMNRTNVASAPYERYKADVLRFQLQWNY
>NE1176 Putative peptidoglycan binding domain 1
MIPVDGNFILAVAPQFSGDKKAAQEQIIGAISAIFSATLDGYAINTRLRI
AHFMGQVTHECAGFRTTEEFASGAAYEGRQDLGNTQAGDGKRYKGRGLIQ
LTGRANYRDIGQILGLPLEDNPAMAAEPVTALKIACEYWKKRNINSAADR
DDLISATRLVNGGLNGLEDRRGYLIKAKSELARIEGLVISADEGGNTMIL
HRGSFGAGVAELQELLINKGFSLSIDSSFGAATELAVMTFQKANSLEANG
IVERNTWTRLRA
>NE1702 hypothetical protein
MKNNNNRLIYTGAFLAATLTFGQVVQADSLVLSRGNQTISRDTVVDSVVV
GHTLDDSIAGSGHLIVNNGAQLVNNGTGHYLGMIDGVNFNAGNGYLGLNT
DSMGVATITGASSLWQNEKDLYVGRYGTGSLNIEKGGKVTNAEGVIGDRA
GSIGTVTVTDAGSLWQASRGIIVGVVGQGTLDIRNGGRVSNEATVIGISL
DGHNGNVTVTGVGSLLESTVSTQIEYGSLNIENGGKVINGHDGFIGANVS
SIGTATVTGTGSRWDNLNKLYVGGWKDGSVLGNGSLIINDGGAVSATNGV
TIWSTGSLSGNGGTIEGDVINYGLISPGNSPGVLTITGDLTLESDSVLLM
EIFGPTAYDQLIIGGNFVAGGILELDFGGYMPEFDVPYDLFQVAGGMSGD
FSEIKFLNPAAGFDAGLLSLSFASGENGGMFQLIMANNGDPGNNTVPEPA
SILLIGLGGLVMLMLRRRSPRISLA
>NE0718 hypothetical protein
MTKAFDAFERAIRDAEDLLARHDAEKTVPNGHNGEVLKRAGLVMALAAWE
TYVKDRLQSEIDTWLQAVEGSPLGKFVRRRLEEDLKRFFNPNDERTRRIF
IDYFDVDITKDWVWENYDSSTSKKVLDSLVAKRGDAAHKANTALHASAEP
HKVKRDELEKGIRFLKGLVAATEKAKITK
>NE0672 putative
MSEQIRLENLTIFMQRRAVKHVYLRVHPPDGRVTLVAPTGMRLEVVRAFV
TAKLDWIHKQQAKLQAQVREVPRQFIGGESHTVWGRHYVLQVVEKSAGPC
VLMDQQTITLQVRPGSDLFKRAAVMHAWHKSLLHAVVPDLIGKWQDRLGV
SVSAYFLQQMKTRWGSCNTRRRHIRLNTDLVRKPQDLLEYVVVHELVHLI
EPSHNKRFVGLMDQHYPAWREATAELNQIPLSAIRPRG
>NE2153 conserved hypothetical protein
MKSALSLLGERQQSLLTALLHHRKGLTVEELSSLLTISRNAVNQHLTHLG
SSGFIQSALQESTGGRPGRVYTLTPNGLELFQRRYSFLAKLLLSWIDKNM
GEHELDVCLTELGKEMASGLEHRLTQHTSRSDKLQEAITIMSELGYDTEI
YRISENQAEIVANNCVFQELAQQHQEFCKLDISFLKHLLKADIEHTECIV
RNGKYCRFRITSPDESGS
>NE0086 hypothetical protein
MKLLLSIALFFWVGMVAHAENLPERIDIEYVLNGSIGQGKAHEILRVRQE
NGVQHYTIDSEASASGILKLIKRGSIHRHSEGTIIPHTGMKPFRFTDQRG
EKPAREVEFDWSEQRIIYRRKGQEMTENLPSGTLDELSLAYHFMFTAPPR
QTLVVHETDYHTLQTTRYTVTREMLDTPIGKLATIVLTKQREQNDPFKKK
IWLATDHHLLPVRIISTEKHGLEVDQIVTKINYSPLVNSAR
>NE0292 conserved hypothetical protein
MQQHGWTLLFHDNLIEQLMRLRAAVLRAQENDPEDFGSNTNVKFFRALIQ
LMQDVVPGDPVRDEYRQGNTMGPTYRHWRRAKLGRRYRLFFRYDSKAKVI
VYTWVNDEQTLRSSGSKSDPYTIFEKMLGRGNPPDDWNALIQASKPNWSQ
LE
>NE2520 ATP-dependent DNA helicase RecQ
MAYDPKRALELLRIGSGRANATFRDGQEDAIRHIVEGKGRLLVVQKTGWG
KSFVYFIATKLLREAGAGPALLISPLLALMRNQIAAAERMGVRAATINSD
NMDDWTVVEGKLAKGEIDILLISPERLANERFRTQVLAGIAAQISMLVID
EAHCISDWGHDFRPHYRLLERIVKTLPPNLRLLATTATANNRVMEDLAAV
LGPKLDVSRGDLNRTSLSLQTIRLPSQAERLAWLAEQLATLQGHGIIYTL
TVRDANQVAQWLKTQGFNVEAYTGETGDRREQLEQALLNNQVKALVATTA
LGMGYDKPDLAFVIHYQMPGSVVAYYQQVGRAGRALDSAYGVLLSGQEES
DITDWFIRSAFPTRQEVADVLGALEDEPNGLSVPELLSRVNLSKGRVDKT
IALLSLEAPAPIAKQGSKWQLTAATLSEAFWDRAERLTALRRDEHQQMQD
YVSLPFGEHMGFLIGALDGDPSVVAEPALPPLPATVDAELVKAAVEFLRR
TSLPIEPRKKWPDGGMPQYGVKGFIAPAHQAESGKALCVWGDAGWGGLVR
QGKYHDGHFSDDLVAACVKMIQEWNPQPSPTWVTCVPSLRHPELVPNFAQ
RLAAALGLPFHMVIAKTDARPEQKTMANSTQQARNIDGSLALNGQPIPPG
PVLLVDDMVDSRWTLTVSAWLLRKNGSGEVWPMALSQTGHDE
>NE1521 possible transposase
MTHSHRRHDISDRIWSLLEGHLPGREGAWGGVATDNRQFINAVFWIIRTG
APWRNLPPDYGGWSNTHRRFIRWRDKGIWEKLLEILIDDPDYEWLIMDAS
HCKVHPHANGARGGNQDMNRTKGGSTPRYIWPWMRMVCRSESLLHKVPLL
IARRLAA
>NE0941 possible (AF047705) unknown [Nitrosococcus oceani]
MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE
STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN
LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF
AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNL
>NE2108 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1943 conserved hypothetical protein
MNRKRKWQIPPPSEPVTDEYLTSVTIGERKPLNDRIQLAEYDLRWPSMFS
VAADKIRSALSEKALLVEHVGSTSVPGLAAKLIIDMLLGVTDSADEKSYV
LPLEQQGFVLQAREPGWYEHRFFRLKSGDMEWHLHVFSAECEEIDRMLAF
RNWLRVHDDDRQRYENVKRTLAARTWKHMQNYADAKSDIVREILGRAQDD
HVVGITSEAVSAAFRFR
>NE2273 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE0295 conserved hypothetical protein
MAFSFSLWQLLLPVHHVAVKPVLSLSDGTTRTESDSLTTTKNFRKKLMAI
RSIRSLLCSLFAVLSLLIPFGAISAEKTTVIPESLKILGWVEFTRLEPWG
IKTPARIDTGANTSSMSARDINVFKRNGKTWVRFTFDFKKGTAERSIEIE
RPLVRMVKIKQHDGPSQERPVISMEICLADEIREAEFDLIDRRALNYPVL
LGRKALAGYVVVDPARAYLTKAVCGHTKKKKKGSEPADPGQQHE
>NE0546 TonB-dependent receptor protein
MQHYLSIMILILLLPVAHAQTGAAGSTSTAINWHLSAQPLSTALQQLAEH
SNTSIMFDAATVRNIQAPSLRGQYTPLEALKKLLSGSGLQAEETVPGRFS
IVQAATTVQQLPEMTVTGAPDPDSPYSTQYKVPDTTTATRTKTPIMETPM
SIQVVPKSVMNDQQAITLEQSLRNVSGVFHGIGQGGVEFLNIRGFQTWDY
YRNGVRFTSAQTQTGYREVANLERIEVLKGPASILYGRIEPGGMVNLVPK
TPQATPYYSLQQQFGSYNLFRTTLDATGGLNQDSSLLYRLNFAYEDKGSF
REFVDNHHFFVAPVLQWQISDRTQITAEMEYKTGKYSVDYGFPAIGNRPA
NLPINRQLGESFNSAKFDEITAGFHWSHAFNDNWEIKHRFYLQRTDEGDD
VVIPSALRTDNRTLDRFYAGFRNAKVDTYTTNMDLTGHVETFGTQHTLLM
GGDYYNFRNRRLMISNFDFPSIDIFNPVHSGTAIRDSANDSPYDRRDDWF
GLYFQDQIKLPYNVHVLAGFRYDNAEIKNISGRKSAQDKISPRVGVLWQP
IPALSFYGNYIENFGAPNLGTSGLDGQPLPAETAQQWEAGIKTEFFDGRF
SATLAWFQLTKQNIATPHPDPQLALQRISVLTGEARNQGVEFDITGELLP
GWQLIANYTYIDSEITKTNNNMQLGNRFPNVPEHAGNIWTTYAFQNETLR
GLKIGGGVTLRGKREGNPENDFQMPGYAVFNLMTSYAMKMGKTRVTAQLN
VNNLFNEEYFPGSGGFNRARIFVGTPRVFLGSLRVEY
>NE0669 Acriflavin resistance protein
MLNWLVTNALQQRVLVLALAIIMVFVGINTTRSVPLDVFPEFAPPMVEIQ
TEAPGLSTEEVESLVTIPIEMVVNGVPGLKTLRSKSVLGLSSVVMIFAEG
TDVIRARQLVQERVAVVTPRLPTNIRPPIMLPPLSATSRAMKIGLSSDTL
DQIALSDAVRWTIKPKLMAVPGVANVAVWGQRDRQLQVLVDPARLRASGV
TLAEVQRATGDAAVSLGGGFVDTPNQRLAVRHLSPLETPEDLARTVVKLA
NGAPIRLGDVATVREGHAAPIGNAIINDKPGILLIVEKQQGGNTLDVTRG
VEKALTELKPGLTGIAVDSTIFRPATFIEKSLGNLTEAMAIGCALVTMVL
ILFTRDWRSATISLTAIPLSLLGAALMLSWWNISINTMVIAGLVIALGEI
VDDAIIDVENIGRRLRLNAALDEPRPAFNVVLAASLEVRSAVVFASLIVM
LVFLPIFFLEGVSGTFFRPLATAYVLAIGSSLLVALTVTPALCLLLMPRD
GIRHADTSFVRFLKARYRPVLPRLIERPKLAVMTIVVGLTVTGIGYLSFK
DQFLPDFRETDFLMHFVEKPGTSIEAMDRVTIRASKELRAIPGVRNFGAH
IGRAEAGDEVYGPNFTELWISLDDSADYDASVAKIQEVIDGYPGLQRDVL
TYLRERIKEVLTGAGATIVVRVYGPEIDQLRTMAEQVKTAIANVPGVTDL
KVEMQSLIPQVQIKPKPEALAAYGLTTGEVRRAAATLIQGVKVGEIYRDQ
RSLDVTVWGTEEVRGDLHALRNLMIEAPAAGATVQGAATGGAQIRLGEVT
DIRIMPAANEIKRENGSRRIDITMNISGSDLGAVATAVETRVRQMKFASG
YYPEFLGEYAALQQSQRQLLTLGLLCLAGILLLVWLEFRSLRLTGLIALS
LPFALVGGVIAVALSGGIMSLGALVGFVTVLGITARNAIMMISHFRHLEE
KEGEMFGRALILRGTEERLIPILMTVSCAALALLPLVVRGNAPGHEIEHP
MALVILGGLISSTALNLLLMPTLYLRFARGSQIPPQPEAPAARVTA
>NE1916 hypothetical protein
MEITCYRDPEIRREKRTLPATTYNLAIKLLARCETKQLFIPIRSMQYMAI
VDAEEFVFVDSQRKCWIDIAWQNFHSHEREALNQPIEYDAVFYREDQTDI
MQRLQIEFPLALSAMMAKQAPHKLAKVISFRQKPPAENPKQ
>NE0395 DUF167
MSWYSFGNDRSLLILKLYVQPGARQTEAVGICGEELKIKLAALPVDGKAN
RALTEFLAKRFNVPRKNITLKRGEQSRHKVVEVCQSSNGPEVLFSEMRAE
>NE1510 putative membrane-bound lytic murein transglycosylase A transmembrane protein
MRKINNPSANAPSSAKTHIFFPDKMFLQSSYFNRYLLKTQFTALLVSISF
GLTSCKTETSITTRPATPPAKVAAIPTDPQPAQPALDMGNKVEKKSLKAA
SWSMLPGWQHENLIPAWNTFIQSCKSLERRPSWQEVCAQATTLRPTSERA
IRNFVENHFAPYQVTNPDGTTTGLATGYYEPLLKGSRKYSSRFRYPIYSA
PDNLLTIELKDTQSGAGQTSLRGRLQGRKIIPYYTRAEIENNRNLLKGKE
LLWVEDEVELFFLQIQGSGRVVLEDGKTVKIGFADHNGYPYRSIGKILID
RGELPAWKVSMQAIKQWGRQNPAKLKPLLQQNARYIFFRELPPDLTGPVG
ALGVPLTAGRSLAIDPESVPLGAPVYLSTTWPNTAQPLNRLMIAQDTGSA
IKGGVRADFFWGHSPEAQVQAGKMKQQAQMWVLLPKQYRE
>NE0598 Uncharacterized protein family UPF0038
MALIIGLTGGIGSGKTRAADSFRELGIEIIDTDQIAHELTRSAGKAISPI
RIAFGDCFILDDGSLDRSAMRRLVFSDETARHRLESILHPLIYQETLQRL
PLIQSEYGIVVVPLLLEIDGYLKLVDRVLVIDCPEPLQISRTMLRSKLSE
QEVRDVMAVQCSRDKRLAQADDVIVNDSGEQHLQRQVEELHRKYLMLARK
HGL
>NE0254 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1738 Sensory transduction histidine kinases
MVGALAILMVGYSAAEAGGLRDLLEQGGLSAKVDCVAMADMLPGRLQQQE
WDLVLTDDRPDLSAAQVLGLLARLGLDVPVIALVSRQDEQAAAHLIAAGV
QDVLLKSDPVQLMATIRGCQRYIGHLRDLREAQLALERSEARFRAIASNL
PGLVFQFVLESDGSVFFPYVSEGSRSLLGLSPDYLQKKPNSFARLIVATD
VASYDQSLQDSCEHLSAWNWEGRIQAKGDMDVKWISLRATPRRTPRGAVL
WDGIMLNITRNKQIELEIARSRAQLEELSAYSQKAKEQERTRIAREIHDD
IGGTLTAIKCELVPCLDNSSRTPEFYRNKAEAVESLVDMVIDSTRRIALD
LRPGVLDCGIVAAVQWQAREFNQRTGVSCTVACDYEDISLDGDLAVAIFR
IFQEALTNIAKHANASEVRVKLAETGSQVYLEVTDNGCGITNPDMEKVNS
FGIRSMRERCQQLGGQFHIQGGPLAGTEVLIRIPVNGQSTVGRESRLN
>NE1226 conserved hypothetical protein
MRNQILKKATVSIMLLLCINVSVADTSTSVSCAESGSASAYSRSGNSVSY
SSVNCSSNNEQNAMTPAGIAVSKSYSPEPYTSLVLSGAFNTEIKTSSENR
VIISGDSNYVESVEVNSSDGELIIRRPGPGNDNLNVIVESISLQKLKISG
AGSTNIYGDFPDGLSVRKSGAGSINIEGQASTLKLNLSGAGNTTARDFTV
DNVEIDATGAGNIAVCAKKSVAGSLAGAVHFKVYCNPSQRSVNTRGVSRV
SYR
>NE2519 conserved hypothetical protein
MSNEQLADLTQPQRDRLAFVELRVRFIGEIRRQDLVTRFGIQSAAASRDL
ALYKELAPGNIDYDSKGKSYVLGPDFRPVFDFPPERVLSWLTQGFGDGEP
MRLKAWVASESPSRLTHPDLDVLASVTRAIHQECPLGIEYHSISSGRTER
EIVPFALIDNGLRWHVRAFDRKSQEFRDFVITRIKRPVLIRDAEVQSHER
SDQDIQWTRIVELEMVPHPDQPRPEITEMDYGMVRGSLRMKLRAATAGYI
LRQWSVDCSPDHSLRGYEFRLWLKDHLALYGVKNAVLAPGYRSPDQNKGE
Q
>NE2152 hypothetical protein
MLNISILIFAIAALGGVFLASKVLAGKLAPWPVSIVHALLGAAGLVTLIL
VIMEGPENNRLTAALALLVVAALGGFYLASLHAKSAIAPKGVVFIHAGVA
VAGFLTLLSVLL
>NE2569 probable transmembrane protein
MLIPFTALLSGLVFGLGLILSGMTDPAKVLSFLDVAGLWDPSLMFVMLGA
ISIGFFAFRAAKRRGRTLLSTPVHLPGTRTVDLRLILGSLLFGMGWGLVG
ICPGPGLVLAASGHTGGIVFMVAMLLGMFIFDRLEKHRQSDSNVNYRQPG
NIERK
>NE1014 putative transmembrane protein
MQMGKENEKVTSALQNSDMEKASMYQVAEAVLFSFIGIRKKSDLEHDAAK
IKPVQIIIGGLVGGVIFILSILSVVKLVTG
>NE0586 conserved hypothetical protein
MALKNHDFNDLVRLEQLRHIETGQPQALSYAGIASMEAYPASGFSYAAFL
ERLLDRAHHLVHDNHLDEVLQQPEKLFIRASRIILLLAAVLGGLAAVNAA
SESSTLNIYWLLVVLLGFNFLSMLLWCAGILLSVQGLSSGIAAQLACWLP
FQLKKRESDSTGTFAARAWWETCLSGRVGRWRISMLTHQFWLVYLLAGMG
VLILLMLAKQYDFVWGTTLLPENSLPELTRLLGVPMQHIGLAVPDGQQIA
ASRIGAGVQDSVIRSAWAEFLVGALIVYGLLPRLILMLLAFFMLKLSEYR
YKLDLYLPYYVTLRQSLIAKEFVTSVIDRDPGMAKESLEPITRAKHSRRF
PENALVIGVELDSHAIWPEGLVCQENVADQKTFARVSEMLKKSKGALVIG
VAAHRLPDRGVQRMVRELATLVSGQIWLILLQSSPAIPVAESRRQAWYRL
AQTCAIPAEHILS
>NE1099 Sigma factor, ECF subfamily
MSIDDPAHLNAVDTLYSDHHDWLVRWLRSRLGCAHNAADLAQDTFTRILA
SRDATTVCEPRAYLTTIAKSLMVNWYRRQALERAYLDALAHIPETEVPAP
EQHLIILETLHEIDRMLDSLAPKVRQAFLLSQIEGLKYEVIAEQLGVSLG
SVKRYMQQAFRQCLLLM
>NE1365 Helix-turn-helix motif
MTMNANIEVRLKSPAHPGGFIKHEIIEPLALSVSNAAEVLGVSRAALSAL
LNERAHLPPEMALRIEKAFGVSMDTLMRMQNSYDIAQTRKRAEEIKVAPF
SGKPIESNSVV
>NE0520 hypothetical protein
MKTKFALTLAASTLLVSAAHADPFVNGGFETGNFNGWTVSNSAYRASINN
ANLTPDWVFANDNYAMHSQIISAGTIDPNVGAAFGSTVYAGNYSARIEDT
TWGGYASAITQTVTNYTEDSINFVWKAVLLGAHGVNDAATFKLVLTDLTD
GIDLITREYNAASSGSGVDSRFSLSGGNYYTQDWQIETLNINDTLKGHDF
MLSLVAADCQPTGHWGYVYLDGFGSVAGGGGDDTNNVPEPATLAILGLGL
LGMTATRRRKNS
>NE0497 putative NtrP protein
MFRNRSNTMLSERYVRLFRNGKNQAVRIPREFELNAQEVIMRREGNRLII
EPVPPKGLLVVLAELAPLEENFSDIDTRLAPLDDIDL
>NE2112 PIN (PilT N terminus) domain
MYLLDTNVVSELRKPRPHGAVLAWINSVDDASLHLATVTLGEIQAGIELT
REQDPAKAAEIESWLDLVSDSYNVLVMDGPAFRCWAKLTHKKSNTLIEDA
MIASIAKIHGLTVVTRNVSDFSSFGVRIFNPFEFNANA
>NE0518 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE2137 putative methylamine utilization protein MauG precursor
MYKILVVFLLFYTGLSVAGDKKAMRSEEIMLGELIFKDKNLSLNRNQSCE
TCHSLSPVAKPGLPAVPGFVDPDNVTAGSPVSKGSDIDPETNLPFNGTLN
TPSVGYAAFSPKFKWDKSLNAYSGGFFWNGRARNLAEQAKQPFLNPVEMA
MPNAADVVNRLSENATYRKLFKKIYGISLPLKKQSDRMVEKVFQSMTQAV
AAFERDRSFSKFNSKFDYVLAGTTSFTPVEQLGFDLFKDPQKGNCAVCHP
VESTFDKKGNITVPPLFTNFGYENIGAPRNMKIPGSPEPDLGLGGRADIQ
KRDKSGLQIGKHKVMSLRNVAVTAPYGHNGVFASLAMIVHFYNTRDALPH
DCDNTSPRFPSQCWPLPEVGRNVSNEFGNLNLTAEERTALIAFMNTLTDD
YPAWGNDPLVPPGTLSPYPHLGIPSR
>NE1231 conserved hypothetical protein
MSFANIIGQFLQQGISGQSRSRLDQALGSLELDGAGGKLEQFLGNLLNDN
DNGSGRDSASTSGGLLNLIRGFFGSKQTGKLTGSQLTGIGALAGALLSGG
VKASKGALGGSAMAILGSLALNALKGHLAAGNTSASAADIDRYAMEAIDD
PDTQRLIVRAMIAAAKADGIIDEQENARILGKVGEDGVTEAERQLVDEEL
RRPADMAALVAEVPNQVVAAEVYSASLLAINVDTEKEENYLRELAKLLGL
DAAAVARLHQLTGAPEIS
>NE0078 putative ATP-binding protein
MSVGADRKKTGTSLTGKVVAAYGRHFEVEVAGGTIYSCVVRGKKKGVVCG
DEVEILPATGDQGIIETTLPRTSLFYRSEIFREKLIAANATQLVFVLAVV
PSCNLELLDRCLVAAESQGIRPLILLNKIDLIGQDEQRQAVAHHLMFYRE
LGYPVLEISAKISVQPLIPLLSGQTSLLAGQSGVGKSTLLNALVPRAQQA
TAEISDALDSGRHTTTHVRLFHFDADSSIIDSPGFQEFGLQQLDEASLAR
GFIEFRPFLGQCKFRDCRHIAEPGCKLLLAAQEGMLNSRRIACYHKLVKG
LKKSHPWMETNKRV
>NE2227 CBS domain
MFFEWVVDPAAWAGLATLVILEIVLGIDNLVFIAILTDKLPPHQRGKARI
VGLSLALIMRLILLASISWVITLTTPLLTLFDVELSWRNLILLFGGIFLL
FKGTMELHSRLEGQDGQKEGKGVHAVFWQVIVQIIVLDAVFSLDSMITAV
GMVEHLSVMMIAVIIAIIVMMVSSGPLMIFVSRHPTVVILCLGFLMMIGF
SLVIEGFNFHVPKGYLYAAISFSILIETFNQIARHNKEKLITTGDLRDRT
AEAVLRLLGGRHGEVGLGETSEVIAQQVAENDLFAREEKEMIEGVLTLAG
RPAMSIMTPRTDIDWLDLGDTSEMIRAKIIDSGHSRFLLAHGNVDEFVGA
AFAKDLLRDMLEEGKINLEKSLRHPIVVLERVPVIKLMEQLRNQTLQLAV
IVDEYGSVEGIVTPADILEAIAGEFLDAGEEKVVAEQQADGTWLMDGWIS
IRKASNLLEHDLVDEAERYSTLGGYLLWQFGYIPAAGEQITVDGLIFEIV
SVNKHNIGKVRVHRTQPENE
>NE1255 Bacterial general secretion pathway protein G
MLELLVVLVIMGLLTAVVTPQVMNMFSGAKSDTAALQVETITTALNYYRI
DTGSYPDMEHGLRALWEAPPGVNRWRGPYVRKQQHLVDPWGRSYRYRLPG
KHGPVDVFSLGADDKEGGSGEDTDIGNWDKQ
>NE0514 possible baeR; Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding domain
MSRILIVEDEPRLARLEADYLQHAGFQTHCLSQGLEVIPWLKENTVELIL
LDLMVPGRDGLDICRDIRSFSQVPVIIVTARIEEIDRLLGLEMGADDYIC
KPFSPREMVARVKAVLRRLQPPAPNSAQQALTLSPQSFKVQYDGREIELT
AVEFQLLYTLYQQPGRIFSRARLMDLIYQDQRIVSDRTIDSHIKKLRKKI
AELIPEKEVICSVYGAGYRYDPQSVEPEE
>NE1784 ABC transporter, fused permease and ATPase domains
MTSNRWQAVLKTAAFLKPYKLSLSFSLAALFLTAGITLSLGQGIRLMVDR
GFATQSPDLLTRYVLVFLLLVVLLAIGTFTRYYWVTWLGERVIADIRRKV
FAHLIELHPGFFEANRSLEIQSRLTADMTLLQSVVGSSISVALRNLIMLA
GGIVWLFITNVKLTVIVMLSIPLVIVPILLFGRRVRVLSRQSQDRVAAVG
AYVGEMLGQIKTVQAYNHQAMDKMHFDQYVEDAFHIARQRTLQRAWLIAV
VIVLVFGAIGMMLWVGGMDVIEGKISGGELAAFVFYSIIIGSAVGSISEV
LGELQRAAGAAERTMELLEEPSQISSPVDPRQLPAVTGNIVFQRVCFAYP
SRPEIQVINDFSLAVRPGETLALVGLSGAGKSTLFDLLLRFYDVRSGCIS
LEGVNIRDLDLSDLRRSFALVSQNPVLFHGSIEQNLRYACPDASDAEIEM
AAKAAFIHDFIVSLPQGYQTSLGDTGLGLSGGQKQRLAITRAILADAPVL
LLDEATSALDSQSEHWVQQAITRLSTGRTTIVIAHRLSTILTSDRIALIH
HGRLEALGTHVELMESSELYARLAIAQFQIPG
>NE0817 putative transmembrane sensor
MNRKHSDIPAHPRDAAAYWFARVHSGSFTQSDQEAFDSWLQADDLNRKEY
QALDKIWQVAGQISAHELRAMLAEEPEPPQQIQKRLKRWRLTISAVAATI
IIVIGFAGLFMPYGWLGEPFYTAEYVTDRGERRTEVLPDGSILEINTQTT
ALVEYFENRRNVRLAGGEIMFSVETDAARPFTVDAGSGTVLVTGTRFNVL
RDQDQVQIAVESGSVEISSGHWWNRRTEQLTDGLGAQIDPDSMTPASPAD
IDTLTAWRQGKIIFKDQPLASVVKEMNRYLPQPITITDHRLDQVRIAGVF
NTGDSAAFLQALHKSIPVQVILHADGSTELSLPR
>NE1082 conserved hypothetical protein
MAVSTSGRRKNRWLFLLLLIVLLTGTGMWWQGDADPAYQTVPVARGDIEA
SVAAIGTLQPLKSVEIGAQVSGQIMRLHVEVGDTVEKGQLLAEIDARVHQ
ATVDAGRAQLAGLRAQLADQRAQHELAKQQYARQQQMEKDDATRLEDVQV
AAAALKSAAARIDQLRAQIQQVSSTLKGNETLLGFTRIYAPIAGVVVGVD
AKEGQTLNATYQIPTVLRVADLTRMTVWTDVSEADIRQIKAGMPVYFTTL
GGDRRRWHSTVRQILPAPPQAVPAQGGGDNNKTPAAGTKAVQYTVLFDVD
NSDGELMPQMTAQVIFMIASASNVLTVPIPALSSVERKDQNDLYQARILG
QDGQVQVRELQLGIRNRLQGEVVAGLKEGEQIIVSEQTESTKPRRFRW
>NE1246 hypothetical protein
MILKSYSTVLFCCFLIAFDSNRYRMHLRFAHSCNSRFQYQLDLFNCFIAD
LTEVIFCVPICDINVDCAQMPGQKHGNKESHMAMTMQYLLTISK
>NE0648 hypothetical protein
MGALAFYTFIYFVGHFAALGLNIITNKKLLRHRWAGLAGVIIVAIMHGYK
IISTTPPSGHDDDTLYALSYFVIFPVVVISAVLFYLSEKDKKDGGSK
>NE0378 Bacterial sugar transferase
MQWDVPVSGSLISQLPGAVTFTFVILASMAALGMYQLDGRQDFEGILLRL
LPSLALGFGFITLIFYLFPDLYFGRGLLAIVMLLALLLILIIRLLFFRWS
GLDSLRPRALVLGAGVNARELLDLIDKTSILPNLKIVGFVPFDKESRHVP
AEMIIPKAGSLASLVDQYDASEVIVATQERRGGTFPIQELLECRISGIKV
TDLVGFFERECGHLRMDSLYPSWLVFGSGFNQGTLRTVVKRMFDIITSLL
LLIVTLPIMLLTALLIVIEDGWPVFYRQERVGDAGKTYMVIKFRSMFNDA
EKAGKPQWATTDDPRSTRVGRIIRKLRIDELPQIINVLKGEMSFVGPRPE
RPHFVDMLSAQVPYYNMRHSIKPGITGWAQVRYPYGASIEDAIEKLQYDL
YYVKNHSLFLDFVILIDTVGVVLLRKGSR
>NE2283 conserved hypothetical protein
MSDLQISLVIIGAVIIVGVVIFNRIQLARYHRKVQNVFRHEHDDVLLNDE
RKPVFGSGRIEPQFGDEAPSIPDAAVGSANTPVDRQIADDQPAKEPGVQR
KTADIDSMINYIADIRASSPIPHEKLIDLLQQKFDYGKPVRWFGLQEGQS
WEEISLDTLSPRNTYIQLKGCLQLADRSGPVTEINLSRFRDMAEDFSAQV
QAEVECPDISEAHARAIDLDKFCADVDVIMGINIISKDGGAFVGTKIRAL
AEASGFRLESDGVFKYRDSETNEVMFSLGNFESAPFLPANMRTLTTHGIT
FLLDVPRVANGERVFDQMAHIARLFSSTLNGILVDDNRVPLSENGVKRSR
QRLADIQSTMAARNIAAGSETALNLFD
>NE1625 Ribonuclease II domain
MVNVFYEEAGALKVGSILTDNTTSLQIESAHGKRSKIKAALVLLRFENPP
MHGFMQQAQEKAADIDVNFLWECCEENREFTGQELAGDYFGHTPDPLETA
AILILLSQNPIYFHKKGVGRFKAAPPEILQAALASREKKRLAEEKQAGYV
QQLMAFELPDTFRPLLDDLLYKPDKNTIEWKALETACDATRLSVPRLLER
CGAIPSTHDYHYNQFLYEHFPEGIDTDAYPADFHLLDPEDLPVAGVKAFS
IDDATTTEIDDAFSVTPLNFGSFRIGIHIATPALGVAPHSLLDKFAAKRL
STVYLPGRKITMLPESVIQHFTLGEAHHRPVLSLYLEVADDFTVTRTFSR
IETIEVSTNLRLDSLERQFNEITLRENQTDYPFAHELRLLWNFSCKMEKL
RGKDQDINSEKIDYSFIIDQDRVSVYERRRGAPIDKVVSELMIFANAEWG
KQLADAGYAGIYRSQGNGKVKMSLSPAPHQGLGVSQYAWSSSPMRRYVDL
INQRQLIALVSGNPPPYTREDDTLLIAMRDFEMAYATYAEFQRGMERYWC
LRWLLQENISTSGAIVIKENMIKLDRLPLIVRIPSLPELAPGTLVAVGVS
QIDLIDRSLNASFLGKSEG
>NE1565 hypothetical protein
MKALFFLRHYNDIDHITPVISKWSESGHESLVVLLGRPKFLKDYRIKFLS
TLDRVRVAPIRRLLSPLKFMQWRLQTLLLNRSVKRLFLIGKLIEKLARKY
DAQKRTAVWQSTAGRLLEHGFSDGNEGGVVVFDWITSDSPVPIEWVEIIV
TMARTMGLGAVSLPHGDSPHASQLIRHHEWVLKPDALYSAARIFDKLVVP
NELCATRFRPFLSNEAIAVLGSPRYCDEWLDKLATLSPAPRLKTNQDTRL
RIVMFLRKSEFTTFWEEVGEIIGMIATFPGVELVIKPHTRGGWRQPLTGS
ASLRQLANVRVAEDSEHSISLMNWADIIIDLATSVVFEAVKAKKPVMAAD
YLHAGRSALAHFMPETELKCRDDVYTMIDRFLTAGYDSYYVEAHRQRFIE
EMLHVGGADVLPRYVALLEEQTMRKKPDQANNTNTPE
>NE2344 possible unsaturated glucuronyl hydrolase
MTSPVLIVEDRLLSTDEIVNTLEQMFRRMEMMDVLCGQNFPLYSSGENSD
WSVSPGGSWMGGFWAACWWLRAKMTGSAGDRQKAGEISQRLFRKLTADSG
YRSLIFWYGAALGEIWLQNAPARELTHSSIAALAHSFDPRLNCIPLGMAM
GGLTTGSCAISVDNFASLIQLLCFSREKQYHRIAQCHAETLLAACRGDKG
AFHAEASFDGHEFQVKDRAGVWSRGQAWAMLGLSRAAAQWGEPYLSQARA
ACTYWRDVHKGNLPRNRPDQTEDVKDPSAVVIASLAMLSLARLLPDETSW
CKYAHQQISTVLHSPYFTVINPDSAFGSAGLFQGCCYRTRQNREEIVESV
WGNFFLVAALAVLAGLIDPYDC
>NE1408 Sensory transduction histidine kinases
MMSLNKRVLLSAALVLLVFIAGITLTLDRAFYDSARIGVKDRLFARLLML
LGDTEVDESGELEIPTNLLDAEFGHVNSDIYAFVTGPANTIVWRSTSSLN
KPIPGITLLEKGKQEFEQITLDEEPYFIYRYSVAWETSSGNYPLIFNVMT
DTALLEAQIERYREDLWGWLGLMAVFLLATQMLVLRWGLLPLRTVSTELA
AIESGRQESLKDRYPKELKLLTDSINSLITHEHKQQKRYRNGLADLAHSL
KTPLAVLQGAIHGGNDEAARLKTIQEQIDRMDNTIQYQLRRAATAGSSPG
MRLIPLRSMADRIINTITRVYRDKRPHITVKIDDSMNLRIDEGDLMELLG
NLIDNAFKWCHHSIHLSAGYQEDQVVIQVRDDGPGISPHEIARILERGVR
ADQSIPGHGIGLAIVRDIMQVYGGELLISNNPDGGLSVTLRFEKSK
>NE0930 putative lipoprotein
MLLSCCMLYLLWCGTVLAAAADEKEKKPDPVDVTITAPASLKKLLEKHMT
LPAKPFRDEIDQAAYARKIRQESIELLATEGYFTPEIEFDEKKGNGKAKA
KAYELRITPGPRTRVAAVTIEFQGALAADEPAYRARAEQLRAEWPLTPGK
VFRSARWEDAKSALLSSIANRDFATARIVESQAKVDPDTAQATLSITIDS
GPAFRYGELQITGLERYKPDVVQNSVPFRPGDPYRREQLLAFQAALQNSQ
LFNAAAVTVKPDPMQHEAVPVIVTLTEAQSKHIGAGLGYSSNNGARGEVN
YRDYNFLGRAWNLSSLLRLEQKRQTFSTRVDTLPDANHFQYSSGVRVERT
DIKNLQTFNQRVDFSRIRTTANSILQVGVNWQREQREPSGAPKTTNETLA
LDLWYRYHDIDDPVNVRRGFVSEVRLGGGSSYVLSDQDFIRSYLRHQHWL
PIGRRDTLYFRAEAGYTLASSRQGIPQEYLFRAGGIQSIRGYDFLSLGVR
EGDAIVGGRVLATGTAEYVHWLTNDWGAAVFTDVGDAADSLKQFDLAIGY
GIGARWRSPAGPFALDLARRHDTGTLRLHFSIAVAF
>NE1168 Prenyltransferase and squalene oxidase repeats
MSISPTFSGSSLQKSSLSDHSTISEPFTVVDRVNGISAVALDDAITRARS
ALLAQQREDGHWCFSLEADCTIPAEYILMMHFMDEIDTALERRIANFLRN
RQVTDGHGGWPLYYGGDFDMSCSVKVYYALKLAGDSPEAAHMVRARNAIL
ERGGAARSNVFTRLLLAMYRQIPWRGVPFVPAEIMLLPRWFPFHLSKVAY
WSRTVMVPLSILCTLKAKAANPRNIHVRELFTVDPEMEKNYFPVRTPLNH
LLLYLERLGSKLEPLIPSFIRRRALKKAEQWTIERLNGRDGLGAIFPAMV
NAYEALTLLGYDHDHPLLQQCRLALRELLVNEGEDITWCQPCVSPVWDTV
LASLALQEDERADNGPVRHALDWLVPLQALDQPGDWRNSRPDLPGGGWAF
QYANPHYPDLDDTAAAAWALCQADTEDYRTSITRAADWLAGMQSSNGGFA
AFDIDNVHYYLNEIPFADHGALLDPPSSDVTARCIGLLALNGEARHQETV
KRGLTFLFNEQEPSGAWFGRWGTNYVYGTWSVLEALKLARVDHDHQAVKR
AVQWLKSVQRADGGWGETNDSYLDSELAGQLETSTSFQTAWAVLGLMAAG
EVGSTAVRNGIDYLIRTQSAAGLWEEPWFTAPGFPKVFYLKYHGYSKYFP
LWALNRYRAMNSRSVV
>NE1729 conserved hypothetical protein
MSTQSILTKPTEATAFQPDQVVRILETVLLTTPEPLSISDLKKLFEGNID
KKTLHESLTILSEKWHDSGINLVSVAGGWRFQSKPEMQVFLDRLNPQRPP
RYSRAVMETLAIIAYRQPVTRGDIEEIRGVAVSTQIIKTLESRGWIETIG
QRDIPGKPYLYATTRHFLDDLNLQSLEQLPSLDSFNSLDLSADEAESVDT
DTTTIQESGEPASHEPTQLL
>NE1655 Aminotransferases class-I
MSLLEKLDAAAAARKAQLPEGIGAFGIPIEESYSATEARIGKRRVLMLGT
NNYLGLSFAPECREAAHQAIDQEGTGTTGSRMANGNYYGHRALEREFAEF
YKYRECIVFTTGYQANLGTISGLVGAGDIVLIDGDAHASIYDGCILSGAD
IIRFRHNDPADLEKRLRRLGDRSRNTLIIIEGIYSMLGDQAPLAEIVQIK
NTYQSTLLLDEAHSLGVLGETGQGLVEKTGMNDEIDFITGTFSKSLCGIG
GFCVSNHPQLDQLRYVSHPYIFTASPSPATIASTRAALKLLQEGRLLRER
LWQNAHRLYSSLEKSGYRLGPQPGPIVAILLDNPRQALTLWNGLMEHDIY
VNLVLPPAAPEGKSLVRCSINAAHTTEQIDQVCDVFSKLHPIVA
>NE0845 DUF196
MLIIVTYDVSTETRAGRKRLRRVAKLCESIGQRVQKSVFECRINLMQYEE
LERRLLSEIDEQEDNLRLYRLTEPAELHVKEYGNFKAIDFEGPLTI
>NE0243 hypothetical protein
MATAKTKIKDKKQPLALQLVESGTADSARKPPDTENTGADVFFRRLVHHN
DAEIAGSMATEQATHQADQQFGAELTKLRLDSEDIDVAFKVSLKFLSDLE
VKLANTRRYIKSGTLHSIGGKSVENVGWTDWRRKDQILLCVLVFCLTIAA
GLGMGNVYANLVSSGNAVFIEKPWLATMISALMPIASVSVKYVTNFMIYD
SSRRLYAKCIYAATGMAFLFWGGLFGLTYSGVASSIDWDSFGESTDYGFA
FVWSQLLVELLMASALFLAAEDIYMRYSPDVYIENLEYLELEKALKEQRT
VHEALREKRGELHGRLVELEARREAFINDKVMEFVSLRARHVATMNAHTD
H
>NE1528 putative 3-hydroxyacyl-CoA dehydrogenase oxidoreductase protein
MSNPFIVRKAAVLGAGVMGAQIAAHLVNAGIETLLFELPAESGNPDANVL
KAIEKLKKQEPAPLSVIDRANCIEPATYDQHLEKLRDCDLVIEAIAERLE
LKSELYEKIATYLHDQAVLASNTSGLSINQLAAVIPEALQPRFCGIHFFN
PPRYMYLVELIPSAQSDAAVLDALESFLVTSLGKGIIRAKDTPNFIANRI
GVFSMLATMHHARQFGLAFDLVDKLTGVLIGRPKSATFRTADLVGLDVLA
HVVQTMQNGLAEDPWHIYYIIPDWLQKLVDHGALGQKSGKGVYQKQGKDI
HVLNPATDTYEPAQNEVDDEIEALLKQKDPVARFTALHDHSHPQAQFLWA
IHRDLFHYCAIHLAAIADNARDLDLAIRWGFGWERGPFEIWQAAGWNTVA
GWIEADIAAGKAMAEIPLPAWVQTVGQSTIQGVHTQEGSYAPITESWQPR
SQLPVYRRQLFPDRLVGEEIVYGETVFETDALRMWHTGDQVAIVSFKTKK
HTIDNLVLQGMQQAITEAEHHFRALVIWQTEPPFSLGANLKKATERPKTE
APPAPPAPPSAFEKFIKQIRGATQSAILQAARSLDVADMLMAGKLAEVEV
MIAHFQQVSQHLRYSLIPTVAAVDGLALGGGCEFVMHSDRAVTTMESYIG
LVEAGVGLLPAGGGCKEFALRAARNAVDGDPFPYLKHYFQTVAMAQLAKS
AEQAKEMYYLKPADVIIMNRLELLYVAKAQALALAEAGYRPPLRPREIVV
AGATGIATIKSTLVNMLEGGFISEHDYLIGSKIAHVMCGGDIVAGSRVDE
EWLLKLERTAFIELLATEKTQDRIAYTLKTGKPLRN
>NE0397 6-phosphogluconate dehydrogenase
MEFGMVGLGRMGGNMARRLARQSRKIAVMNRSFDVAEALVKETGHIACRD
YTELVAVLEKPRIIWLMLPAGDVTETALSTLLPLLSPGDLIVDGANAHYQ
DDAPRAARCAERGIEFVDAGVSGGIWGLENGYCIMFGGSDTAAARLKPYL
EVLAPTPTTGWLHTGPVGSGHYVKMIHNGIEYGMMQAFAEGFALMKDKSG
FNLDLAEIAELWRHGSVVRSWLLDLSADFLAEDQMFDDIAPFVADSGEGR
WTALASIEQGIPAPVIMLSLMMRFTTQGRNDYAAKMLAKMRHGFGGHAIK
KGEQ
>NE0904 possible cytochrome c
MKFIVFILLAIGSGLVSAQEPVLSSVPDTLEQRVKPCTICHGDEDKAGRD
AYYPRIAGKPAGYLFNQLRNFRDGRRYYQAMAILLENLSDEYLLEIAHYF
SSLKYPYPEPAANTLSPEEIQAVETLVHSGDPERDIPACGACHGKALMGV
EPFIPSLLGLPHAYVAAQFGGWRNGGIMRGQTPDCMSEIAKKLSQEEVNA
LTVWLPAHPVTGEPAEAETLPPELAQRCQTILSGEGFTR
>NE0784 putative (AJ245540) NrfJ [Wolinella succinogenes]
MFRTVSAVILAILLMSFNLSAARAEGASGVETMPANEGVVVSSIDAAGYT
YMELANGGKKFWIAAPTTKVSNGEHIRFVESMRMHNFTSKTLNRTFSELI
FVTSTQAKVEK
>NE0921 conserved hypothetical protein
MSYERFTVLKVPFPFTDRTAAKNRPALVLSDAATFNDPIGHSVLAMITSA
ANPAWPLDCLIDDLVSAGLPAPSVVRFKLFTLDHRLIRGELGRLAVSDSI
QVTRSLYQLFGMAAVR
>NE1522 Transposase IS4 family
MDAHGMPVRILVTQGTTADCTQAGRLIEGIDADHLLADRGYDSNAIVEQA
EKQGMEAVIPPKKNRKIQRPYDKELYKLRHLVENAFLHLKRWRGIATRYA
KNTSSFLAVVQIRCIALWADIL
>NE2243 hypothetical protein
MAPASFSQITGQSVGKIGVLFLLLMVLWSANLCNSSDITPLSANRANVGQ
MVQAFTFDDLFKEDGPSSDLPQRVFRQTSSWRGFSQLEFAETIASPKHAS
KLRLRSELSNLGQLSPNVKWKLSARIDYDAIYDLSDFYSRQVRRDQRFEL
FLRENYLDFSIADFDVRVGRQHIVWGEMVGLFFADVVSAKDMREFVLPDF
DILRIPQWAVRTEYSKNDFHADLIWIPFASLDEIGRPGADFYPFKLPVAA
PVSFLKEDRSGRNVAHSNYGIRLSQLTNGWDVSAFYYHSLDATPTFHRIS
QPWEPLLFQARHGEIDQAGGTVTKDLGSAVLKGEFVYTHGRRFNVTRPTA
ADGLVRQDTIDYALGLDFTLPSDIRLNLQFFQRAYLNYDRDIFQDRLENG
GSIFLQGDLWRDFQGQILLIHSFNRNEWMLRPRLTWNFARNWKLAAGADI
FNGPPTGLFGRFDSSDRVYTELRFSF
>NE2224 hypothetical protein
MKRNALIHSLQTHISDLLAIYAFGSRIQGTARLDSDLDLAVLVAGYTDPL
ILFEVANELADVAGYAVDLLDLRAASTVMQYQIITTGKRWWTLDMQAALF
EAFILSEKTALDVARAGLLADIRQRGTVYGR
>NE1847 Guanylate cyclase
MKSFSSAPMIAVDPPSGTSSNKIESHSPGHGRPIIVTSIEHNVWQELEQE
IVCSLKRLQCPRLLDSTVLLLHKLITNIVRTMHRHAFQRAIEDDLNIDIA
SKDNVFDELLQNELITHGDRNIEQFCRKNHFQVILTFPEEKGTLIRIEYP
ENCDNSAYLDSKLAKAIGFTLLKQIDGTTAPGYHSAKVVRDIARHHPHLE
QAEHSELLLSAEESDKTQAIFCKLNYGIIVFSSAGDILSISPAILNNLKL
EVSPTSVHILTGIIPGHFYNDVLWGLALESQGGVFENYRIRVRLPHDNSL
SLLFNISGYRHHDMTIHTLWQVVSLDHKATHTLAEGSILNEARVHNITRN
YVPQLVEQKAREIVRLGGNALPNEECRLAVLFCDIAGFTSYVENNEKEES
VIHTLNLILGRITKAVRQHEGIIDKFMGDCVMALFREPHKAILAALEIHK
HSYDFNNLRMRAGKDLLLLRIGIHWGKVVIGNVGSSDRLDWTTIGDVVNT
AARLEKNCCPGAILISEALYETTAHMDHPEIRFSKPFHLKLKGKRNKQAV
RYVQTASQPLLHDIFTTHHAKD
>NE2154 possible (AF124349) unknown [Zymomonas mobilis]
MHPSRYNRTGTSLFLIFSLLAATSASAEETGLINTLTGYNINEITPAPSI
KLKFRGWVEAGFTGNPGDPHNRSNFPVAFNDGANQFNLHQVYAYIEKEID
PGRNSWDIGMRADLLYGTDAKFVKTSSFDSTILGDNPKHQLVFPQLYVNL
YAPIGNGVSMSIGHFYTIIGYESPMSPNNFFFSHAYTMRYAEPFTHMGIM
LSYPVNDNLTIKSGVVTRWDAFSRHSPDYLGGLNYITDDRKTMLSASLIT
GDVKTGPLNHDHNRTMYSIELERSITDKLHYVVQHDFGIEAGTPNSSSAT
WYGINQYLLYDISNQLGAGLRFEWFHDQNGTRVMGDGNDEDFIGVTAGLN
YKPIAGITLRSEVRYDLAVHHDIFRDGTDNDQILLSGSAILHF
>NE2384 ATPase component Uncharacterized ABC-type transport system
MADDALIEFSKVNFSYGLRPILKGVDMRMSRGQVIAIMGGSGSGKTTLLR
LIGGEVRPSAGSVTVAGQVVHELGRNELFRLRRKMGMLFQFGALFTDLSV
FDNVAFQMREHTNLPESMIRDLVLMKLHAVGLRGAYHLMPAQLSGGMARR
VALARSIALDPMLMMFDEPFTGLDPISLSVIGGLIRRLTDTLGMTSILVT
HDVQESLRIVDYIYFLADGVIAAHGTPDEVRESKVPFVHQFIHGETDGPV
PFHYQAQAYSQDLGIGVMER
>NE0543 putative (L31491) ORF2; putative [Plasmid pTOM9]
MYNANIPSTHEIPSTGRLIRSTVIALLTAIFLLVTVVMPAEYGIDPTGFG
EITGLKRMGEIKVSLTEEATADRANAAASLQIELSEAVTAVTESLPLAPK
SEMSHEKKITLAPNQGTEIKVTMTKGSKVHYVWRTNGGTAVFDQHGDSKE
LKINYHSYSKGTGQMREGVLEAAFDGDHGWFWRNRTSTPMTITLKTEGEY
TRIRHFK
>NE2216 FAD-dependent pyridine nucleotide-disulphide oxidoreductase
MAFLPLYIPKGRRLKIVIVGGGYAGIAALVTLHRYSPDAEITLIDPGIDH
LKITHLHETFRYPLSDFLVPYAFLEQRFGCRHIRSALAVDESMLCQWQRD
RYLQVDDQQVPFDYLLVASGARSFTRNETDERTAADAENIVRLENFFTVS
GADLLDRFLATQDRAGNTAPSCISVVGGGATGIQFLFELAGYLRRRRINH
GLQLVDDNERVLQRYPGGFARYVEMRMLELDIEFYPGVCFLEQRADTVLL
EDKTSGRHFELPSVLSLVFTGNRQDNLIKTNAFGQVMVDGKPLGNIFAAG
DCSVYPSLGSNTMTAQSAVRKGKLAARNILRHSGVLKVLEPYLHHDLGYV
VSLGPEDAVGWLAVENNVVTGVPALAIKEIVEAQYDLLLSGVDTYLV
>NE0071 Deoxynucleoside kinase
MNLERCRYIVVEGPIGAGKTSLARNMATRLNYSLMLEQPEANPFLEKFYG
DMSRHTLSTQLFFLIQRMQQLQSMERENVFSRSVISDFLFEKDRLFADVT
LSEPEHGLYRQIREHMPLNAPRPDLVIYLQAAPEILIRRIRQRGNALEQR
ISEDYLRRLTERYMRFFYEYDQAPVMIVNSEHIDLAHNLADLDMLLARID
QMRSAREYFNVGAS
>NE1232 conserved hypothetical protein
MNMTIKNRVLKAIHNFLILLLRIERRLEPWFRPQWDYLFREPGSRLIQFL
INRRRKNKDSDLELAEERFDPDEEESLNKIIDLMMDQMRGRFKPGGYERG
GNTKTHGIVRATITIRDDLPEHCRKGIFANPRSYPAYIRYSGPGPNVPAD
INDVGFMSMAMKIMGVPGEKLMSEEKLTQDFIATSGGATFVTPNTRENAK
LQYWSLVDMTLYYFLNPKDSHLLDFFMQSLWTATQYNPLGQRYWSCTPYL
LGEGQAMMYSFVPKTKEVERHIPGLPFGTPPFNYLRENMIKTLNEKDVEF
DLMIQVQTDPHLMPIEDSSVRWPEKLSSFIPAATIHIPRQKFDSDAQFEF
AKRLKMNPWHCLPEHRPLGNINRARFRMYYELSRFRQEMNETTHLEPTGD
EVFD
>NE1517 6-pyruvoyl tetrahydropterin synthase
MLITRKFEFDAGHRISTHLSQCRNLHGHRYVLEVTIAGGIIADAGVPEQG
MVMDFSDVKRVIREALVDRWDHAFLVYAGDTRVLEFLQSLENHKTVVLEV
QPTAENLALIAFDILRHTFRNRYAGRLELEHVRLFETPNCWADVDKAVFK
SATVDRIAES
>NE1309 hypothetical protein
MMPKSHPSKNNRSTSAHSSNNRNENNVSTRVSAEIQSATFLGPIPHPAIL
EGYEKIVPGAAERILIMAESSMKHQQQYDNALLEASKNRQHEVRFLVF
>NE0581 possible predicted diverged CheY-domain
MTVLIVGGDYIASLKQRITAHGYSRIEHWNGRKKGFNKRALPGRTKLVVI
IYDYVSHNLANSVKDQASRIGIPMIFCRHAMHEIDTIFDEKKAEESCCNF
V
>NE1700 Diguanylate cyclase/phosphodiesterase domain 2 (EAL)
MRVLYIDDNFSDADIVRRTLARTTPEIDLTTVASLAECLVCLEKSVHYDV
LLTDLDLPDGSGLEVLRYIREWRLPLAVVIMTGPGEHDRGGAAFKAGADG
YLVREGDYLERLPRTLHSALVRFRTNGTERNSPLSRVLYVGHSRSDIDLM
RHHLAQHAPHLHLTAMTSAQDALALLPVNPDQTADFDIVLVDYCLSGGMD
GLEFIRLLREERKLDIPIVLITRQGSEMIAARALHLGVEDYLSRHEGYLF
EVPATLEKARWLAELTRERINLKQTNLRLSHLLAASPMVLYNLRFTGNVL
QPVWVSENIERLIGYTPGEALAPGWWLSHLYPDDREQVLERQPILLSDGQ
LTHDYRFYHRDGRIIWILDELRLIRNADGQPHEVVGTWLDISEHKQAELI
RQAHQSALNLIVASQPLPVILSDIAKHLETINPDMLVSILLLDKQAKCLK
LGAAPSLPDDFNATVDQLAIGEGVGSCGTAAWRGEAVIVSDIDHHPYWQP
YLEFTQKADLHACWSVPFKDENESVLGTFAIYHRTVREPTSADLVLVEEF
ASFTALAVQKVYAAEALRLAATVFESIREGIVVTDLEPRIVAVNRAYTEI
TGYSAAQVVGKNPKIIRSGRHDKSFYQAMWSSLQEEGYWSGEIWNRRKNG
EIYPQWLTISAVSISSASNGRKELCNYVGVFTDISQIKQSEAQLAHLAHY
DPLTGLPNRLLVQSRLHHAVERAQRYNLRVATLYIDLDRFKNVNDSLGHP
IGDELLIKLTERLKNRLREEDALARLGGDEFLLVMEDVKDPSESAIVAQT
LIDLLATPFVLPSGHEIFINASIGISLFPDDASNATELIQHADMAMYQAK
KEGRNTYRYHTEALSIAANERLIMENRLRHALAAGEFILHYQPLIDARAG
HVIGVEALARWQPPDNTIVPPGKFIPIAEETGLIVPLGEWVLRTACEQGR
AWIDTGLPPLVMAVNLSVRQFQSENLVELVQRILEETRLPAVCLELELTE
SMFMEHAERSIETLNRLKALGIQLAIDDFGTGYSSLTYLKRFPIDKLKID
QSFVRGLAHDPNDREIAATIIAMARGLKLGVLAEGIESEQQLDFLRQQGC
DYYQGFLFHRPVPAIELEAWLREHSAPR
>NE0363 hypothetical protein
MENTAEKFEEEILDACIRHAKEVLAEQLPLVKDKKYDFAPQFRDLTIQLY
LVGVMQQFYDQYEATTTDAQEKAFHALYYMMTKDGVKSRRAKNQAAFIRQ
MSRLDDGDEALALALGYESKPGDRSLAEVFDHYVNESRVSKGLWRFYDQG
KKILLLGGLLFAMAGIWFVTIYLPESDNITILAVGLLAALFFIVPVFLVG
LLIHRYKTRKGSRTPTPPQ
>NE1164 probable transmembrane protein
MQQDHNKISSARIPAIAYFLFLSRWLQLPLYLGLVLAQCVYVYHFWVELV
DLIGSVFGNQSALQHTLEMVTVKGAESSGKLTETTIMLVVLGLIDVVMIS
NLLIMVIIGGYETFVSRMNLEGHPDQPEWLSHVNASVLKVKLATAIIGIS
SIHLLKTFINATAYDEKTLIAQTVIHLAFLLSALAIAYCDRIISRTIHHA
DEHE
>NE0804 DAG-kinase catalytic domain (presumed)
MDLIPARIGLIYNPLGGWFRKHTARMQSLLATLPEIRQIQATDQIEFERA
VTVVVEAKIGWLIVVGGDGTLQGVMSCLFECLPPDRWPEITIVPAGTTNM
TALDLGMNGQAEQILSRIRQHLQRPGDMKQVRRPVLRIEQTGMRNVYGMM
LGLGLIARGVKFSRSQIKQLGMTGNIFTVVIVLRSLIGMFLGRPQAEWAP
VRVAQIDETGVLSEKVYLFALISALEKLLLGIRPYWGQEPAPLHATFIGQ
HSRRFWRAIWPLIAGRGHHLQKEDGYTSYNTASIELWLDDEYIVDGELYY
ASSRNGPLKITADGPILFRIL
>NE0242 hypothetical protein
MKYDEKLIVFIVTIGVLNNPVYKKRLWLAARNMNISRREWPNVIADAIYN
FDGIILLSNGIRWPIPDVDKVLGDPRWFSYYFEEDEKGDPHRDVIMLERL
RLIDLFFKIKHPEIARHFSK
>NE0367 hypothetical protein
MSADNLLLDFTSPGAIPPDPAEVVRRVVDETQTTMRALESLLENERIEDM
TGWRLLAMFYLATDRLNDLAKIEKQYKSITGVSLSADLKQKYPQWFNGEA
VSHPVVFEIPKKITAAALPDSIIIQRGQCSPGGILLDFSQVQEIDNDGLK
KLAQLFSSLAQENTRPKLRQADRFITCLQNKAETGTGTRAIWDVLFAYER
FRDDREAFEEKAIKFAVLYGISPPSWE
>NE0054 ABC transporter transmembrane region:ABC transporter
MPDKQPAPPIELLTRFVTRSGSHWLTLISILIAGGLFALAITLIPLVVGQ
LFTQILPGGNRQLMQSLPLILMILLLAAIFADWVVYYVLERLLGRAILDI
RTELFKKLLALPPACSDFPAETISLYFFQSIEKLGHNVSLLGVCLSRDLL
TAAGLLGVMAWLNPEMSLLVLAMLATIFFIGQIFRANARQQDMLGQKQYE
VSRCLSKALRLNRIIHLDKGYKQEIRHTRNSFEQLQSFLQKQFRQTKLME
LLAYVLLIGVLTASLYYLLQQLASNQLTAGDAAAFFMAGAMLIFPLQRLF
SINLLLKQCSEALQVIFPLLNQDSRIVEENPYTTQFRRGKGKLRFEGVSF
RGGATECQLPHFNLEIASGQKIALINRDANINRLFADLVCGFVQPSTGRI
LLDDNDTKRINPAELCSYIAWIAADEDLLGDTVAINIAYGSACCSREIAI
TTAAHASQAMEFIRKLPQGLQTKINQPALIFSDDQRQRILIARALLKNPS
IIILDETTACFNTDNTALLQALQVLLNNRTVLILSSRPVMLNLAGQQFDP
EKPALISACHSDGCTVR
>NE0708 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE2506 conserved hypothetical protein
MARGRNSALMDSCQLVRTETKMEVAHLNQKQLAARWSISEATLERWRSAG
IGPKFLKLCGRVLYRQADIEAYEESCLATSTKTVVAQVSVS
>NE1276 conserved hypothetical protein
MNLPFQNDALGKLTLRLTVGVLILLHGVHKIFNPGSLDYISTLLANVNLP
QILAYGVYLGEVIAPLMIILGIFSRIGGLLVFGNTVFAIGLAHRSELFAF
TDHGGYALELQAFFLLTGLAVFFLGSGRFAIKPD
>NE0805 hypothetical protein
MGTADLPNILDLLIQEIAIQDARPVHPAFQVFTDALRQRFGEALDAVVLY
GSCLHTSDLTEGIADFYVLVSDYRLAYSGRLLAGLNAWLPPNVFYLEVPA
AAGVMRAKYAVISTADFERGARQWFHPYIWARFAQPARLLYARDDQTGKR
VHTAQASAVLKFISTTLPVLESGPSDLEMIWASGLMLTYAAELRAEREAR
ARHLVRIDPEIYSRLTAAAMPALIPLLSLQADGRYHIGPITPLKRLSARI
HWRLRRWQGRVLSVLRLSKATMTFRDCLDYAAWKIERHTGIKVEITPMLR
RHPILWGYKVMWQLLRRGVLR
>NE0185 Glycosyl transferase, family 2
MHISFIIPAFNEEQLIEPCLRSITDAVAANRSYGYTSEIIVVDNNSTDAT
AQLAKQAGAQVVFEPVNHIARARTAGAGAARGDWLVFMDADCLLNAGLVG
DIFELIRQGRHIGAGSTLYMPGQPWWAEMLLRIWTFLSVQLGWAAGALIV
CNAAAFREVGGFDLTLYAAEEIDLSQKLKKYGRKRKLKFAILNAHPLETS
SRKTQLYSGWEIVGQFLRLVLSPLGTLRNKKKLPMWYDGRR
>NE0431 Cytochrome oxidase assembly
MIFMQTSSYHSVTSVIFDKNTQNHTSIAIWLLICCALVFAMVVVGGVTRL
TGSGLSIVEWKPIVGTIPPISQGDWEILLEKYRQIPQYEQINKGMTLDEF
KGIFWWEYFHRLLGRLIGLVYFVPFVYFMVRKRVDRVLGTKLLGIFVLGG
LQGLMGWYMVMSGLADNVYVSQYRLTAHLGLAFIIYAAMFWVATGLLSPV
NTHSADPASVRKLGRFARILTGLIFFMVLSGGLVAGIHAGKAYNTFPLMD
GFFIPPALFVLEPWYRNFFDNITTVQFDHRMIAWLLMFTIPLFWFKARQL
SLSWSGRLACHLLLIMLCTQVALGITTLLLSVPLTFAAAHQAGAVLLFTA
ALWVCRKLS
>NE2110 Bacterial regulatory proteins, AsnC family
MQEPRILITQELLKLIAEIDEFKGKWEVLKNLSPERLRQLRKVATIESIG
SSTRIEGAKLTDMQVETLLSNLSSMSFKTRDEQEVAGYAEAMDIVFQAYE
DMTITENHIRQLHQTLLRHSNKDERHRGEYKKIDNHVVAIDEHGKEIGVV
FETATPFDTPRKMEELVRWVNKAITENSFHPLLIVAVFVVVFLAIHPFQD
GNGRLSRILTTLMLLRAGYSYVPYASLESVVEDNKDLYYKALRRTQTTLK
TDSPDWEPWLGFFLRCLKKQKANLAAKIEKEKAADDTILPALAVQILELL
KKHERLSIAEMVEHTGANRNTLKVRLRELVSTGRIQRHGKARATWYVLNR
K
>NE1308 Fimbrial protein pilin:Bacterial general secretion pathway protein H
MQHTQKGFTLIELMIVVAIIGILAAVAIPAYSDYQAKSKVTAGLAEISAG
KTAFEERVNNGDTVGTDPTVIGLKTPTSNCTIVTSGTTIECTLTNAPSQV
SGKKITLTRTDGTWSCATDADNKYAPKSCPGVTPTP
>NE2010 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE0834 DEAD/DEAH box helicase:HD domain
MRQNIGKLVSRKQPRRSEKVNAENNTTYLAHVRQLPNGRWIEHFLEEHLL
AVAVLAAEFASVFNSQDWARLSGLWHDIGKFREKFQKYIKSVSGYDAEAH
IEGAPGRVDHSTAGAIHAIEELGPPGRIIAYLIAGHHAGLPDWNGEPASL
FQRIEDGKQKGYRQEALQNAPTTGLFNQPCPTSSPPQDGSFALWIRMLFS
CLVDADFLDTEAFMDERRKDLRAGYPALNELLSAFDQYMNDKTANATDSP
VNRIRTEVLRQCCEKATLPPGLFSLTVPTGGGKTLSSTAFALNHAMHHGK
QRVIYVIPYTSILEQTAEIFRKIFGDENVIEHHSNLDPDKEDSRSRLATE
NWDAPIIVTTNVQFFESLFAARTSRCRKLHNIVNSVVVLDEAQLLPPEFL
APILHVMQDLSQNYKVSFVLSTATQPAFSPRPKFSGLRGVQELMDDPDGL
YADLKRVEAELPRDFNAPRTWESIAEELQQYDSVLCIVNSRTDCRALHAL
MPRDTIHLSALMCGQHRSEVIADIKQRLKDGIPTRVISTQLVEAGVDIDF
PVVYRALAGLDAVAQAAGRCNREGMLPGMGKVVVFVPPKPAVPGLLRKAQ
QSGQEIMRLTEGDPLTRERFEAYFRHYYASLNSLDEENIIGLLDMHNRVE
ARRAEFSFRTAADKFQLIKGSSQKTENKAR
>NE2174 hypothetical protein
MNGIDWLLDTNFILGLLKSNPETLSMISNQQIDTRRCGYSAITRMELLGF
PGLTAEEEILISGKLACLQYLPLTKEIEDMVIGLRRSHRVKLPDAIIAAS
ALTCNAQTDP
>NE2378 ABC 2 transport system integral membrane protein
MTGFYTLFHKELLRFWKVSLQTILAPVLSSLLYLLIFSHVLAARVEVYGS
IPYTVFLIPGLVIMAVLQNAFANSSSSLIQSKITGNLVFILLSPLSYLEI
FLAYVSASTLRGLMVGLGVYAVAAWFYPLPLQSLFWVVLFALLGSALLGV
MGVIAGIISDKFDQLAAFQNFIILPLTFLSGVFYSIHSLPPVWQFLSHLN
PFFYMIDGFRYGFFGVSDISPYISMIVVSCCLAAISGWALWMLKSGYKIR
S
>NE1793 hypothetical protein
MEEWSRQSINFTVRLGEMSVLAWPLNACVLKTHFTKLPIHPTVPAELPKL
FRDSVDVVVTRSHPIESSLAKLSILPQAIQYIPSSYRRYWVVLDGNFEDY
LKKFSAKSRNTLLRKIKRFAELSGGEIDWREYCKPEEMHEFYKLAMEVSQ
KTYQERLLDCGLPSDQQFQENMLALAAEDNVRGYLLFYQKKPIAYIYCPV
HDGIALYEYVGHDPEYQRWSPGTLLQYFALQRMFTASHIKIFDFTEGEGA
HKAFFATNNQYCADVYYFRRTWLNLIKVALHASSDKLSDGIVRMLDKFGV
KAAIKKLFRSKA
>NE2521 conserved hypothetical protein
MTSVLSPNTQAILLLTAPLIAGRGTASSDLLSPGEYKRLARHLREIQRQP
ADLLSPDAAEILRACQPVIDEGRLQKLLGRGFLLSQVIERWQARAIWVVS
RADAEYPRRLKARLREDAPAVLYGCGDMALLETGGLAVVGSRHVDDALID
YTMTVGRLAARAGRTLVSGGAKGIDQAAMRGALEAGGKVCGVLSDSLEKT
TMNREHRNLLLDGQLVLISPYDPSAGFNVGHAMQRNKLIYALADTSLVVS
SDLNKGGTWAGAVEQLDKLKFVPVFIRSTGESSAGLDGLRKKGALAWPNP
QDVDSFKDVFNVAMPTPTASPQVGFALFSNEEPTSVDAKPTVPVPPDTAP
APQAESEPSAPVDVVSDAQPPAPALEEQPSVTPEAIPPIDDAMESAQPES
SPAEVLFAAVRAAIQQLLSAPMKDADVAAALDVSNAQAKAWLQRLVDEGV
LEKQKKPAGYIVKQKRLFE
>NE2161 Esterase/lipase/thioesterase family active site
MNVQEQAVRFCCHHDWLYGVLHLPQQPVTRGVLIVVGGPQYRVGSHRQFV
LLARYLAERGIAVMRFDFRGMGDSDGEIRTFEHVGEDLRSAADFFFSECP
FLEDIVIWGLCDAASAALFHAHQDSRVSGLVLLNPWVRTEQGIAKAYLKH
YYLERLFDPEFWKKLLGGKFNPLASIRSLYEFGRNSLRGGKSPAVSEKSA
GSACDLTVPLPERMLDGLKRFQGKILIITSGNDLTAREFLDLVDSSADWQ
ATLRTKQTELCHMESANHTFSTREWRDQVTELTANRVLSW
>NE0555 possible transmembrane sensor
MEQDQQAGHPSEENALTEQAAAWFLRMQQSDTNDVEQKAFEAWLAENEAH
RTEYQQYVQLWQTLDHLEQKPRKKSRSTVTWIVTLAILFSSLHWLTRHEE
MITTAIGEHQQIILADGTTIDINTDSTLRLALYGFTRKVTLERGEALFRI
GDERLRSFEVHAGNGILRDIGTEFNVIKEEGNVTVAVLEGAVEVGIDHQN
DTVRLLHGGEQLTYSAYDLSEISSTDKETVTAWRKSRLIFRETPLEEVIR
QINRYHSRPVRLGDPQLNTLRVSGEFNSADRAGLVQALITLLPLRASELD
SVTLLTYEK
>NE0063 ATPase component ABC-type (unclassified) transport system
MSELRAENLKKSYQSRTVVTDVSFSVRSGEVVGLLGPNGAGKTTCFYMVV
GLVPLNGGEIFLDEHNLSRLPIHQRARLGLSYLPQEASVFRRLSVEENVL
AVLELQQLQKDEIQRYLDELLHDLHISHLRESSGMSLSGGERRRVEIARA
LASRPRFILLDEPFAGVDPIAVMDIQRVISFLKSRGIGVLITDHNVRETL
RICDRAYIISGGTVLANGAPAEIITDERVREVYLGENFRL
>NE1381 hypothetical protein
MLDSNKGTSKVIVAPKIFDINKNENRGNLTIFLGKLRKYFTHNGSHEITI
DFTQTEKFIAAGTLLFYSELAYLKQFINNETRLRYIPPKNPKAFEVLIQI
ELYKLCGIRKPKSKNANKYDDVLNWKVACGNVVNNEQCAPTIEAYEGQLA
EPLIDGIFKGLAEAMTNTVHHAYAEIREDGLNHKPSKNNWWMFSQARDGE
LTVVFCDLGIGIPRSLPKKHPSIFHKMLSLGKISDHQCIASSVELNATST
KMPGRGKGLGNIIEIASKNKAGGVIIYSNKGMYRLGPDATEPFSRDLKNS
ILGTIICWNVTLSKVGL
>NE0004 conserved hypothetical protein
MSIEPDGTVNRPEENIHLLQLHDSPVMRWLYLSIGMTALFMGILGIFLPI
LPTTPFILLAAGCFARSSERFHSYLLNHRIAGPIIREWCEYRSVARHVKR
WAYLVMALSFGSSILIVSSWWLKGMLALLAMILFTFIWRLPVRDQSR
>NE1291 GTP1/OBG family
MKFIDEVKIQISAGDGGNGVASFRREKFIPRGGPDGGDGGHGGSIYALAD
HNLNTLIDYRFTPVFRAKRGENGRGSDCYGKGAEDIVLRMPVGTIITNDL
TGELVADLEHDQQKVLLAKGGRGGLGNLHFKSSTNRAPRQFTHGEAGEQF
ELRLELRVLADVGLLGLPNAGKSTLIRAVSAARPKVADYPFTTLYPNLGV
VRVDAGHSFVMADIPGLIEGAAEGAGLGHRFLKHLGRTRLLLHVIDVAPF
DENVDIVHSARALVDELRKFDETLYRKPRWLVFNKVDMLPEDEQQAVCTR
LLQAMNWQERWFAISALTGRGCQALIYAIMGHLQQLQSDSEET
>NE0476 Protein of unknown function DUF79
MIYSISIRQSAVKSLEKIPGPDRLRIIKAIDLLKEHPGAGSILKGEFSGL
RRIRVGMYRVVYEIQDNLLTILVVRINHRRDIYR
>NE1641 hypothetical protein
MNKPSSARLIILAFLVSMLTNTFGWSFNGKVFTHELAHHHYRELFLMYPD
AHLELHHALDDSVDLDAATHLCLHAAGQFQPFYLPASLQINTADVREMTP
EIADSSFPETIPDRLYHPPRLLS
>NE1009 D-lactate dehydrogenase
MLPAEFIKSLQNLIPSDRLYFDPVDCYAYAYDNSRKFFPPECVIFPLTTD
ETAKTVRLCNQFKIPLTPRGRGTGTAGGSLAEQGGATLSLEQMTQIISID
PPNRALVTEPGVLNETIQAAAKPHGFFWPPDPSSAAFSTIGGNLATSAGG
PHAVKYGTTRDHVLGLTAVTGKGDIIKTGCYTTKGVVGYDLTRLLIGSEG
TLAVITEMTLKLTPLPAAKSGLVAHYQDADSCASAIVSIMKEPQGPSALE
FLDSGSLNLIRARNPELLPSDSQAMLMIEVDGTEHEIPETTAKLLAACQN
SGLLKAVSVQNTADLWKVRKTLSPLLRDIAPKKINEDIVVPVDKIPRLLA
GLSALCKQYQIANVNFGHAGNGNIHVNLLIDPDNPSESERAYKCLDQIFD
LVISLNGTLSGEHGIGSEKRPYIGKELNDATLTLMKQIKLTFDPNNILNP
GKLFP
>NE0167 conserved hypothetical protein
MANDGYFEPTQELSDETRDMHRAIISLREELEAVDLYNQRVNACKDKELK
AILAHNRDEEKEHAAMLLEWIRRCDPAFDKELKDYLFTNKPIAHE
>NE2028 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNTRICR
>NE1874 putative ferredoxin 2fe-2s protein
MTYYQHHVFFCINQRANGERCCNDHHAQEMRDYAKARIKELKLSGKGKIR
INNAGCLDRCSEGPVIVIYPEEVWYTYVDQEDIDEIIESHLQNGKIVERL
RI
>NE2542 Bacterial regulatory protein, LacI family
MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVR
PWDVAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVN
LTTLAPDIVAAILDDALPNHVTLFDLAVDPPALWDEQRRKVWDTSFSTSR
LMQDA
>NE2155 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0594 putative reductase oxidoreductase protein
MASYRITFRPSGRIITTESTETILEAALHHGLSLPYGCRNGSCGTCKGKI
IQGIVDYGVHSEEALTEQEKNQQLALFCCARPLSDLEIECQEIEAIKDIK
IRTMPCRVHKLEHVASDVMIIYLKLPANERLQFLPGQYIDILMKDGQRRS
FSLANAPANDEFLQLHTRNYAGGVFSEYVFSHMKEKDILRFEGPLGSFFL
HDAPKKDTPIILLAGGTGFAPVKSMLEHIFQTENTRFTHNTIRLYWGART
RDGLYLSNLAEKWAAENENFSYIPVLSEPLITDDWQGRTGLVHQAVINDM
NTLSDCQVYACGAPAMVKAAFDDFTHQRNLQSENFFSDAFVPSKPATA
>NE1024 Peptidase family U7
MADTENINKEKWEHETLRELVFTSLKEQRKARNWGIFFRLLTFSYLFVLL
FWGLGWLDTEVAGGTGKHTALVDLRGEIAPDGLNNAENINNGLKKAFEDR
NTAGVILRINSPGGSPVQAGSINDEIRRLRIRYPDIPLYAVVEDICASGG
YYVAVAADKIFVDKASVMGSIGVLMDGFGFTGTLEKLGVERRLLTAGENK
GFLDPFSPSDPAQREHAKKILAEIHQQFIQVVQDGRGDRLKDNPEVFSGM
VWTGAKSVELGLADALGNADYVAREVIQAERIVDFTVQQGIAERFARRIG
RVATDFLSETRFTWFMR
>NE0716 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE1906 conserved hypothetical protein
MNHAADLPIWATLLIALFLLIGAGLTLVGSFGLYHLRSFYDRIHAPTLGT
SWGTAGIAIASVIYFSVVESQLVLHQLLIGVFVMATTPVTLILLSRAALY
RDRTENNRDVPPLHPLRNLPQASELEQDSMATHKNAGEEQK
>NE0077 Pterin 4 alpha carbinolamine dehydratase
MTNVCDLTDRKCKPCEGGVPPLEMEEAEKLLKQLEQGWQLADNKISRTFS
FKNYYQTMAFVNAIAWVSHREDHHPDMMVGYDWCRVEYMTHAIGGLSEND
FICAAKVDMLFKS
>NE1304 Bacterial regulatory protein, LacI family:Helix-turn-helix motif
MARMHNPPYPGETLREDVLPALGLTVTQAAKELGINRVTLSRVLNGKAGI
SVDLALRLEAWLDGPSAESWLKGQLAYDLWQAEQRGCAKVVVRHINREQI
>NE1357 hypothetical protein
MIQGDSDNGGNGRCQNTFHQMLQLKTAAAHLGFGKKQGQATVSRLYLFNP
APARINDQSRIERIASAARAAPFIGIRHLLST
>NE0113 hypothetical protein
MPEPLQPHEYPHRILLCVTGLSPQIVTETLYALAVARATPFIPTEIHLLT
TTDGARLARAALLHPDGGHFHALLNDQPQIGLPRFDEDCIHIISHHQEKL
ADIRTPAENAAAADTITALVAQLTEDADAALHVSIAGGRKTMGFYLGYAF
SLFARPQDNLSHVLVSSPFEGHPDFFYPPRQPRRLVTRDGHHIDTAEAIV
TLAEIPVVRLRHGLPATLIAGRAGFSETVVTLQQSFAPPCLLIDLEQRNV
VCGTTAVAMKPQLLAWLAWWATLARQGRPETTWREADARLFLDIYRTVVG
IDAIDYEKTAELLGNGMEKEFFQTKNAKLERVLKDTLGPAAAPYLLTTTG
KRPHTRRGLTLPPERIRIVGTGSK
>NE2571 Metallo-beta-lactamase superfamily
MQPNIQAFFDPVTWTVSYVVFDKPGGHCAIIDPVLDYDPKSGRTKHHSAD
VLIKFVHSKELTVDWILETHAHADHLSSAHYLQQELGGKVAIGSRISGVQ
QVFKKLFNMGPDFQPDGSQFDHLFDDGDTFEVGELKGRAIFVPGHTPADM
AYQFGDAIFIGDTMFMPDVGTARADFPGGDARQLYRSIRKLLDQYPPETR
LFMCHDYPPGDRPIQWESTVAEQRAHNIHVHDGINEDGFVAMRTARDATL
EMPVLILPSVQVNVRAGQMPPAEDNGRVYLKIPINVL
>NE1245 Kazal-type serine protease inhibitor domain
MKQQKNQAGTQARNACVLAGKRSPTQFTSALGLPGSGCSGFNPQLYFSSD
FGAGNGAFSSGRGLSPPSITRKTRKELPRYLNALITWLIVLSVSLVVAGC
EEGNPPQPQLQPQVCGTIQGLACPAEQYCDLGIGQCKVADAQGVCKTRPT
ICTREFNPVCGCDGKTYGNACGAAAAGVSIDHEGECKTAEPQACGGIAGI
RCPDGLACVDDPGDTCDPEHGGADCAGICIAGQGQ
>NE1609 Bacterial type II secretion system protein E
MQTNVALPDKPAIDPVMLSQAREQAIARSTSLIRVLEETYQGTPDELVTV
LGEVLRMPVLTMNDLYALHPAFDRLPFALAAKRECMLLQKQDQSYLLAVS
DPFRTGLRDWVEEYVPVDALWHLVHPADLAAFFVQQEQTMRAMDNVLSSV
QENLIQSGIEELSLKTIHEGTSQVVRLVHSTLYDAHKSQASDIHLETTPG
ALSIKYRIDGVLTSIGTMQEANLAEQVISRIKVMSDLDIAERRIPQDGRF
KVSIQGREIDFRVSVMPSIFGEDAVLRILDRQALTDHVEGLTLNQLGFNA
TSIASVRRLSAEPYGMLLVTGPTGSGKTTSLYAAISEVNHGHDKIITIED
PIEYQLPGVLQIPVNEKKGLTFARGLRSILRHDPDKIMVGEIRDAETAQI
AIQAALTGHLVFTTVHANNVFDVIGRFAHMGVDPYSFVSALNGIVAQRLV
RLLCPNCMVAEQPDQQMIADSGITPEQAEQFDFRSAKGCGHCRGSGYRGR
SAIAEILLLNDEIRELIIAQAPIRRIKEAARLNGTRFLREAALDMVKTGQ
TSLQEANRVTVVA
>NE1884 possible homolog of eukaryotic DNA ligase III
MTDFFRFPNTPHLLWLGQGQPRDDKILSDAEIAALLQDEVLIEEKLDGAN
LGISLDEHGELRAQNRGQYLPQPFSGQFSRLNSWLGQHGEILKHTLTPEM
ILFGEWCAARHSLDYNKLPDWFLLFDVYDREAGKFWSVERRNQLAQKLNI
TTVPLLKRTKITCNQLVQLLDDAQSRYRSGKVEGIVIRCDSPLWCESRAK
LVNREFVQAIEDHWRSRSIEWNLVHAGSVKRS
>NE1983 conserved hypothetical protein
MKTIAVLLTALLLATGCGNKESTGETASDEPEIVMMDESTSVGELPSILK
EGLSNKEQDELKKQIMPILDDGLTPEQRFLNLRKDAEAGNAEAQNSLGSM
YFSGEAISRDAQGKVKDKDPETAAGWFFRAAEQGHAGAQFNLGLLYFSGE
GVTRDTAKAVELFTKSAEQGNIDAQNNLGVIYLMGEGVKQNTDKAIEWFE
KAAEQGNEEAIKNLEAVRASQQDSKEAADKQK
>NE0263 Bacterial regulatory proteins, ArsR family
MPYYLFHGTSHYFIIYLINLMLNNENMDRFAALAEPNRRRMIEIIAAQGE
IAASDISNQFDISPSAVSQHLKVLREAELVKMEKRAQQRIYSINPQGVDE
MWEWLSQMRKFWNERFDALDALLLTEHNTPKGNNDEHNTN
>NE0500 putative UDP-glucose 4-epimerase
MNILVTGANGFVGQTLCPALERAGLRAVRAVRISTRYEEISVGEVDGETS
WSRVFDEGIDGVVHLAAKVPLAEKEKEAADSYHRVNTLGTVRLARECAAR
GIRRFVFISTVKVLGEECDKPFQADDSAVPSDAYAISKWEAEQSLRQISA
ETGMEVVILRPPLVYGPGVGGNFLRLLQMVDRRIPLPLGAIHNRRSLIYL
GNLVDIIRLCLTHPDAASKTFMVSDGEDVSTPGLIRRIGSVLGHGSFLLP
VPAAWMRRVGDLLGKRSAIDRLTGSLSVDSMPVQKELGWLPPYGMQAGLA
LTVQWYRQHKPETKS
>NE0534 putative transmembrane sensor
MSEVTDHHTVSLSEQAAEWVVRLSDSQISTAQQADFEAWLAQDIRHREAY
EQISRLWQSVTPQRRRKHRPAGLLASIVLVLIGIYCLPLSTWLADERTAT
GEIRQVALPDGSTITLDSDSAADVVFDTHARRIILHRGRIFAAVVPDAAE
QGAHHRPFIIENRDGTAQALGTRYIVEQTGDHSRVSVVESSVNVTSRDRP
DQSITVYSGQSIRFDSEQIHGQEPIPPAAASWVQSRLVYRNAPLSQVIDD
LARYRTGFLRISGAAAQLRFTGVLPADDPDTALKILQHALPIGIDRYTRV
LIWINLQT
>NE1275 hypothetical protein
MNSTGNTSNNARLHWSHVLLIVLATIVLTVAGTYWVLTTYVFVSSFEPVI
LSKKEEKTLEQKLRTIGYDFSFSSPTAKRNDDLKGEIDEEGFLKPQAYSE
QGAKREVNFTEREINALLAKNTDLAQKLAIDFADDLVSARLLLPLDEDFP
VLGGKTLRLNAGLGMAYRNDKPVIILKGVSIMGVPVPNAWLGGLKNIDLV
SEFGMDPGFWKSFSEGVEHIQVTDGKVDIRLKE
>NE2165 TonB-dependent receptor protein
MDNRSTFFIDRTQGVFMIIKTVSLRHGFLLGSVLATGLFVSASAVAQSAV
TQQTIAIDLAAQPLEQAITQLATQAGLLIGVDASLVAGKQAPALNGRFTP
LQAIGQLLKGSGLIVIENAPGRYTLEAAPAGHTNSNTEAVTLPEMKITSV
IDPDAPDNRSYRRSSAFSATKTDTPIMETPMSIQVIPRVVMNDQKTTTIK
DALENVSSVRPQSSLGRSNAFIIRGFRNGRVYRNGLVALGLSIGEESMFD
SANLESIEVLKGPASILYGRIEPGGLINLTTKKPRGESYYSIEQRFGSYN
SYRTEWDGMGPVTKDKSLNYRFSGSYQNNRSFRDFNFNDRVLVNPALTWR
PDDATELSLEVEALHEDYQVDRGLFAIGDRPAPIPVTRSFIDPDDPVDTH
SRVNLGFNLTHAFNTDWTLRNRFLASFVDDDNTSVKPANAFTVAQFLDPS
KGNRTYLRNIFSQISSSQTYTTNFDLTGNLEFLGTRHQTLVGFDYLRSTG
TYLTRGNYLSPVSGLEIDIYNPVYGIDPSFYAQALATPFPAGENHSFSKD
EWYGVYFQDHITLWDKLHILGGGRYEWATTGRGRGDSFRAAEANLPTRKD
NGFSPRVGILYQPWSWASIYGNWTTSFGANNGVTNTGATIDPERGEQFEA
GLKAEWFDQRLTMTFAYYHLTKENILTRDFNSPDPFAVAAIGKARSQGIE
FDLSGQITDELSVIGNYAFTDARITRDYSGLQGNRLSNVPEHSGSLWLKY
DIRHYQPLNGWQFGMGIFAAGLRQGDNENTFALPGYVRLDAFVAYRMKLG
PTRLIAQVNIRNLLDKRYYESTDPFVNAPPRAGIYPGAPLTILGSLRLEY
>NE1801 ATP/GTP-binding site motif A (P-loop):AAA ATPase superfamily
MYLAYYGFAEKPFQLKPDPNFFFPSRGHKRAAAYLEYGLSQEDGFIIITG
DVGAGKTTLMRYLFQKLRQERVAAAQLVSTRLDPDDALRMVAAAFGLSYE
GLSKAALLLELERFMRKCERESTRVLLVVDEAQNLLPQTLEELRMLSNFQ
SDNRPLLQTFLLGQPEFRRTLLGSQMQQLRQRVIATYHLGPLDQAETRAY
IEHRLQKAGWNGSPVLQDEIFEVIHNFSAGIPRKINLFCDRLLIMGCLEE
LQTLGKAEANEVMLDIQKEFDLAVSPE
>NE0511 hypothetical protein
MAHAKGGKLFDSPVRLNYIIGIKSHILDSNLQLKWSVLLDIDREPAETAE
EQFDSSFFLMTEELRQLLPELMEALGGTASE
>NE1110 Helix-turn-helix motif
MTIHIEELENMDFSDVAEGGKLHPIHPGEILREEFLMPLKITPHALSLAL
QIPATRINDIVRERRAITTDTALRLARYFGNTAEFWMGLQIDYDMTITRD
SLRGALNRIQRFEPTHIS
>NE2352 Mov34 family
MLTIHTKLISAMITQSLKDHPIETCGIIAGLAGSNLPLRLIPMRNVAQSE
NFFMFDPQQQLQVWKEMSARHEEPVVIYHSHTGSEAYPSRSDVELAAEPQ
AHYVIIPTCSPHKEEIRSFRIVDQMVIEERVQIVRQYQPELEFQMMVA
>NE0377 Sensory transduction histidine kinases
MTTLSYAAAAAAFFFLSILLITSWRGRLYGLLLAAACLSSSIWAATIAGL
SYLDYAYPLLIQVLELLRNATLTIFLITLLGPFQPGKTDNTSPKIRPAVA
AIALFYLICLAQIVVFGQTTGIDPDQPGASRDFLIWVAMAIIGMILVEQY
YRNTPREQRWGTKFICLGIGGLFVYDFYLYSDALLFRKISADIWVARGWV
NTLIVPLIALSAARNPKWTVGIAVSRRILFYSSALFGAAVYLLLMAAAGY
YLRFSGGTWGTIFQLTFLFGAIILLIVVLFSGTTRSWLKVFISKHFFSYN
YDYREEWLRFTRTLSEEETELRVRVIKSLAQLVESPAGGVWFKSEQHHAY
LPIARWNIPHIDASVAIDSYFCQFMQESSWVIDLYQHDSESQKYAAMILP
DWLPTIPKARWVVPLILHGELLGFVVLTEPRSAVEFNWEVRDLLKVAGRQ
AASYLAQYEAAQALSTARQFESFNRMSTFIVHDIKNLIAQLSLLLTNAEK
HKDNPEFQQDMLDTVALSVNKMKRMLEKLSSENIREEKNALSLNELLQRI
IASKSFYEPQPVLDILDTELNVLADLSRLERVLGHLVQNAIEATPKDGYV
EVRLKQQPGWAVIEIEDSGHGMSEQFIREKLFTPFESTKTAGMGIGVFES
REYVEEIGGKLHVSSEVSKGTIFTVMLPLVSPP
>NE1086 probable sigma-70 factor, ECF subfamily
MTIQNPENSLASCSIESLYCHHHKWLVGWLRRRLSNALQAPDLAHDIFLR
ILSKDSLPVIHEPRALLTTIAQGIVANFHRHQRIEQAYLDALAQIPESLA
PSPEAQAIMLETLVEIDALLDGMPPLVRKVFLLSQIDGLRQSEIAQRCGI
SVPTVKRYVAKALEQCCFIG
>NE1493 Cytochrome c, class IC:Cytochrome c, class I
MMPARVLLLVSLFLIVACSEQRDTGQTGNTLTLDQILGRDLSPEQVTRNF
DPEQVARGQALFRKNCAACHGQNAEGTPDWRKPLENGRYPPPPLDGTAHA
WHHSTEELKNFILNGGPPGEGRMPGWAGILADQEVDDILVWIKSLWPDEV
YQGWYTRIENRPK
>NE2562 hypothetical protein
MKKQIAGLVLLGLACTATSVSAEEDRELRQKVEALEAKIANLEGRSENEE
HGDSHGFDKHKFHGGVVLKQDAFFGFQTILDAGYEVADNIDFTFYSWLWT
NPNFGKSSVVSGGNNVGGQGLWTEFGIGLNFRFLDNTLSINPNIGMLNGS
LLSSEVVGEDIRAGEGVVPNLVVNYDNDYFAANLYVAYYMATRGPRARDF
LHNWINVGVKPALFGLGKTLPINSVGIHWEHLWAAKNRIDSSLEGVVYNW
VGPYIEFGLPKNLALRFAGGFDVKSDVSNNFYQASIKLNF
>NE1898 conserved hypothetical protein
MFSSRRKIILFLIILFTSVIYGYFSLQGESGKIRYTTRAADQGDIVRTIS
ANGTLTPLELVDIGTQISGRVIELFADFNDQVKTGQILAKLDPALLNAQL
QQSEASLKSAQTALSVTINKANRSRNLIEKGFLSKEAMDEVKELLDSARA
QVIVSQAQVARDQANLDYSIIRSPISGVVVARNVNIGQTVAANFQTPTLF
QVARDLKQMQINISVAEADIGQIHTGQTMVFTVDAFQDREFSAIVKQIRL
NPTVQENVVTYNVIATVINEEGTLLPGMTANVRFIVNEKTEVLRVPNAAL
RYKPSLEELIATAETQPGKRLLYRLEGNRPVAISVTTGITDGNFTEILSN
ELNVNDPLIIEEMIGKKQDKSAASNFRFRMF
>NE0903 probable cytochrome c
MKQPSKFRSAEKKQVAVFIASMLLSLSSIAATQPSANSDAEQVLRGEYLT
RAGNCMGCHTTRGGKPYAGGRELVTPFGKFVTPNITPDNETGIGHWSEED
FWQALHYGKGRDGSYLYPVFPYTEYTKVTRQDADAIFAYLRSLTPVAQTN
PPHKITSPYDNQFLLFLWRTWYFKEGVYEPDDSKSEEWNRGNYLVQGLGH
CNACHTPRNVWGASQDDKLAGGEIMGTYWYAPSLISHREAGMGEWPVDEI
AKLLSTGVTGHAVTSGPMATIVRQSLQYLSKDDMRAVAVYLQSLGEKRGE
EAPRKIPPMPERVSRYLALGEPVYIKHCQECHGKSGEGVPGVYPPLAGNR
SVTMTSPLNTIRSVLYGGFSPATAGNPRPYGMPPFQHIIRDHEIAWVVSY
IRNAWGNFGSLVSPEDVDSSRGNEF
>NE1991 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE1759 conserved hypothetical protein
MKFSHNCTYNLFAFCTSFSYRYAPWVLLIALLFTVASAVYTARNLGMNTD
TTDMLSEHLPFRINSVKYDKAFPQDSGILLLVLSAPTPEQAYSTASRLVV
SLKKDSKNFSDVYAPGTDDFFGRNGLLYEDIPQLNHIADHLAEAQPLIAQ
ISRDPTLYSFIGLLTQAVDELNKGQYLELGPVFNGVSETMDARLAGKSRP
LSWQTLFRGEPAKTSYQEIIIVKPRPDYSELLPGKQPILAVRTAAEQIGL
TDDGPIQLRITGDVALADDELKSSLDGMEVAGVVTFIMVGAVLYSAMHTV
GMTLAVLICLGVGLILTAAFATVAIGQLNVISIAFAVLYIGLGADFAIHF
LLRFREMLENGLSANDAIYKAGGEAGVALTACTVANALGFYAFIPTSYSG
VAELGIISGTGMLISLIVTFIIGPALLRYLSGRSAVEPNGRKTLGKALEF
SLKWRKLTHISVGLLLLLAIVLWPQIRFDYNLLNMQDPKGEAVQTFRELL
ASPDHSPWYAVVLTEDRKEIQQLQNNLIKLPEVSKVVSILDLVPSEQEAK
LAVIEEMSLIMGPGLSVTRSATHEHTIPQQIQALTTLNTALDRYLVQHPD
SAVSASARGLRDSVRSLLDRLDKADAGERQKLLRSLEEDLLSTLSVALQR
LYNLIEAVPFSEQELPESIAGRWHSHTGEYRIAVYPSEDIGDNGLLRHFV
RSVQQVAPQATGMPVISLEAGDAVIDAFFQAFSLALVGVVVALLIMLRSI
KYTVLVLIPLLLSSIFTGVFTVLLDIPFNFANIIALPLLLGLGIDSSLHM
VHRSMDNRVENEILIHTSTARAIFYSALTALVDFASLMFSSHKGTASMGA
LLTVGLAFTLICTLIILPSLLRNPNRQRVVA
>NE2531 possible signal peptide
MCDRAPGPARVRRAGELLLQRRQPRVLPIHAGHETQCGGLRRGLRRTAAL
ALLPGQRLRAAHQRAAVRRAGYSARSRIPAVHRGGVVSVIPWPYRLLTLA
ALSVALVGFGWIKGASHVQAQWDAAIQQQALQAAAVRERQAQATVKVVTE
YVDRVRIVREKGETIIKEVLVYVPVQADSACTINRGFVRLHDAAAAGELP
EPARDADAAATGIALSAVAGTVAANYQTCHENAEQLTALQAWVREMKVAG
EQ
>NE2003 nitric oxide reductase, cytochrome c-containing subunit
MSERLTKSAARNIFYGGTAFFIVLFIALSVHTHLYIVNTSTDETTLTESV
TRGKHVWERNACINCHSLLGEGAYFAPELGNVWVRYGGRDHAEGARMGLK
AWMRAQPTRVPGRRQMPQFNLSEQELDDLIDFFEWTSRIDTQGWPPNTAG
>NE1030 iron transport system permease protein
MVLVPVCVVFASFLSPDDEIWQHLISTTLPSLLTNTLWLSLGVIVGTSLL
GVSLAWFTSVYRFPGYRFFSWALFLPLAIPAYVIAFVALGTFDYTGPVQT
SLRHWFNSDLLWFPDIRSGPGVAIVMTLAFYPYVYLLSRDAFLTQGKRLL
EVAQSLGFTRQQGFFRVVLPMARPWIVAGIMLALMETLADFGTVSIFNYD
TFTTAIYKTWFGMFSLSTASQLASLLILIVLVIIMTEQRFRLRMRFSESR
KSARVERIPLTGWQAMAVTGFAGTVLFFAFALPIIQLGIWAMDALTIGLD
QRYLEFAWHSILLSVLAALITCLVAAMLAYAARLHSDGYTRFAIRLTTVG
YALPGAVLAVGIFIPMAWLDNLLSDWIENISGVETGLLIQGTLTVMLVAY
MTRFLAVSYFPLESALQRVTRSIDEAAAGFGVTGWSMLRMIYFPMLKSSL
FTAATLVFVDVMKEMPITLMTRPFDWDTLAVRIFSLTSEGQWDQAALPAL
TLILTGLIPIILLIRHSET
>NE0177 Fatty acid desaturase, type 2
MADNTTQANFPLYEARALVRDLMTPDPRIYWLDFLFHIVLGWTAFAVALH
TSWLSIWQILSYCVAVFAFYRAAIFIHELAHLERGTFKLFRLVWNLTCGI
PFMIPSFTYDGVHYDHHKPGIYGTGKDGEYLPFATQHPSGLVGYVLLSLI
LPLLLVVRFLLLTPISYLIPPLRKIVWERASSLTINPAYIRQADAVRNDH
DWRLQEWAAFLFAAIVISSVMLGKLPWQVLLLWYAVAVAIFIMNSLRTLA
AHAYRHEGDHSLTLVEQYLDSVNIPGNFLTALWAPVGLRYHATHHLFMSL
PYHNLGKAHRRLVQELGGNDLIMQTRRNGLGHALKQIWQESAAAAANKNN
TQS
>NE2485 putative transmembrane sensor
MTNEVSSQEGTAQATEASHHGADRFPESVVDAAISWAVRLDYNTPTVAQQ
QAFECWLQADPLHGLAWQRVHSLKGFQSDLGELPPGLAYDTLQTAQRHRE
RSGLSRRNVIKLLSLAGFALSTGWIARENTPWQRLLADASTATGEQKTLQ
LDDGSIIVLNTDSAVSTDLAGPHRLLMLWRGEILITTGADTGIAMKRPFW
VHTPFGRIQALGTRFVVRLEADRARISVQEGAVELHPVNGGFPVIVQSGE
SRWLAEEGTLPADLQGFEADSWQKGVIAGRNIRLQDLLSELSRYRPGYIT
CDPGVADLRLSGLFHVKETDRTLQFLTQIQPISITYRTRFWVSVGLRDFH
>NE1118 putative ABC-2 type transport system permease protein
MLTLLNAFHLGIKELRSLGRDKLMLFLILFAFTGQVYLTATGLPESLHKA
PVAIIDEDRSPLSLRIIDAFYPPHFLPPLIIEQYAADPGMDTGLYTFVLD
IPPDFQRDVLAGRRPAIQLNVDATRMSQAFIGSGYIQNIVTGEVTAFSQR
YRMLPAWPAELEIRARFNPNLSSVWFGSVMEVINSITMLSIILTGAALIR
EREHGTIEHLLVMPLTPFEIMLAKVWSMGLVVAIVATASLVFVVQDLLNV
PIEGSIGLFVMGMTLHLFATTSMGIFLGMVARSMPQMGLLMIIILLPLQM
LSGGMTPRESMPEAVQYLMLAAPTTHFVSLAQAILYRGADFGIVWPEFLA
LLAIGSIFFTLALARFRRTISSMA
>NE0174 Aminotransferase class-V
MDLWHICRLVCLMTQIYFDHNATTKVDDAVLDRMLPYFREYYGNASSSHS
VGLAARRAIDRAREQVAQAVGVQPAQVIFTSGGSEANNLFIRGVSDSLKP
SVLAVSTIEHPCIMRPARGLTRKNGEKWQLHYLAVDAAGRVKVNDAVDVL
TATRSAMVSMMLANNETGVIQEVAPVAEVARAQGAWMHTDAVQAFGKIPV
NFTELGVHALTLSAHKIYGPKGAAALIIDPRLPVRPLIDGGGHENGLRSG
TENVPAIVGFGAACELAMSRMAESSVKTAMLRDRLEHGLLAMGATVFGLN
APRLPNTCYFALPDIEGDTLVVRLDKKGFSVASGAACSSATPGKSHVLEA
MNVPPIVARCAVRVSLGMDNTMAEVDSFLAAVKHITDELKNMSALFNV
>NE1587 Helix-turn-helix motif
MDIKPIRTDTDYRTALKEVEMLMTAEPNTPEGDKLDILVTLIEAYEQKHF
PLDLPDPVEAIKFEMEQKGLTVKDLEPMIGKSNRVYEVLNRKRSLTLRMI
QRLHHELGIPAESLIKRSSYTHI
>NE1253 possible general secretory pathway protein I precursor
MAVQRGFSLLETLITLAVAALFFGVLLPSIVTNLDRIKADTLRAEALLVA
QNQMEAHAVIATEMEGHFEGQDGPFSWTATIEPSEKKDRSAGTAGPFSLR
RVHVDVFFQGGQPLVSLEAYSVGAIR
>NE0467 Domain of unknown function DUF81
MIDWSFTLAGALTGFVIGLTGVGGGALMTPILLLVFGVQPVTAVATDLWF
AAITKIAGARVHHTNGNVDWQVVKRLWSGSLPMALLVVLLVSMGTHITKV
DWLTKGIGIVVLITAIGLLTAPRMVALARKKRTGQPERFETMQSILTVIA
GAILGVCVALTSVGAGVLGSVMLLYLYPLRMTPHRLIATDIVHAIPLAVV
AGLGYLFAGKVDWWMLISLLLGSVPTVVAGSMLASKITGRWIQIALACVL
AAAGLKVLI
>NE1972 sun; rRNA methylase
MIEVQLLAVQAIREVFAGANLTEVLRNIWQTGSHLTPQQRGAIQDIAYGV
LRHYGQLDAILHKLLNKPVQDKQLHYLLAVALYQLRYSKAPAHAIVDHAV
SSSRKITRNPAVSGLVNAVLRNFMRKRTSLPDQIRNDDIARYSYPQWWID
KLRQQYPQDYEAILLAGNEHPAMILRVNRLRTSVDHYQAMLDAQNIDSEW
LWDTALRLSRPVAVEKLPGFSEGLVTVQDAGAQFAAPLLDVEAGMRVLDA
CAAPGGKSTHLAELAENIELTVLDKQEDRLARLKENFSRLKIAGYQLVCG
DAMSPSHWWDGQLYDRILADVPCSASGVISRHPDIKWLRRPEDIDHFAQT
QTAILNGLWPLLQRGGKLLYVTCSVFNEENDAVADKFLTTHEDASRLPCS
HGMLGRGQLLPGSHHDGFFYALFRKN
>NE1815 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0429 conserved hypothetical protein
MAEIEKTVLVGYSASQMFRLVDTVENYPDFLPWCSGASMKLMEDNETAQA
TVHIDYHHIKHSFTTKNTRHPPELIKMELVEGPFEKLNGYWRFIPLSENA
CKIEFQLHYTFSHKLLEKLVGPVFYVIANNFVEAFVEQAEKIYGPSI
>NE1314 probable two-component sensor
MAETGNFVFRAVSDLRDTETKQTNGVKNGSMGILNILNQSPSAKNLQRLF
LLRIIAIISQIIIFWTVYSIIELELPWTAIIVTISLLAVLNFLTWIRLRY
SWPVTNPEFFTQLLIDVAGLTALLYFSGGSTNPFISLYLLPLTIAATVLP
WHYTWTMAAITISCYTFLLFKFIPLPHDHLDDHSRHMFEFNLHISGMWLT
FVLSTLLITSFVVKMNASIRDRDKELSRSREQALQNEQIIALGTLAAGAA
HELGTPLSTMAIITGELQQELPGNTEFQNNIRILRNQISLCKQTLTHLLA
DAGQARVEEGNSQPVDIFLQQVIDKWQLIRPSIRFTSQSEGVQPVPIIMN
TRLLSQSILNLLNNAADASTRQVTVISRWDEKQLHLEILDDGPGLDAEAA
EHAGEPFFTSKGPGQGFGIGLFLANTNIERFGGSVRLFNRPEGGACTHVT
LPVLHGTTSTSL
>NE1467 Fatty acid desaturase, type 2:Fatty acid desaturase, type 1
MDDIASILMNGLIDLPWWGYIVVALVFTHITIASITIFLHRYQAHRSLEL
HPIPSHFFRFWLWLTTGMVTKEWAAVHRKHHAKCETVDDPHSPQVFGIKK
VLGEGAELYKVETKNQETLDRYGHGTPDDWIERNVYTKHSVAGIASMLII
NVILFGPLGLTIWAVQMMWTPVMAAGVINGIGHYWGYRNFRSEDASRNIV
PWGILIGGEELHNNHHAYPTSARLSNKWYEFDIGWMYIRILEMMGLARVK
KIAPKLQLNKAKTECDLETLQAVISHRYEILAKYTKSLKIALKNELVHLQ
QAAIENGVDGLTLRNWVLADAKTLKEEERTKLETVLSHAQKLDKLYRMRE
ELSLIWERSAATKDELLKQLEDWCRRAEESGIELLEKFSRRLRCYAMA
>NE1209 ATPase component ABC-type multidrug transport system
MYPVAALPPSSAAEHDDTAIIIDHVDKHFGNVHALRDLSAEIHYGRLTGL
VGPDGAGKTTLMRILTGLLVPDAGHATLAGFDVVANNDAIHVVSGYMPQR
FGLYEDLTVMENMHLYAQLRGMDAGRHIDLFTKLLDFTRLAPFTGRLAGK
LSGGMKQKLGLACALMARPRVLLLDEPSVGVDPVSRQDLWRMVQALTGEG
MAVVWSTAYLDEAERCDSVLLLNEGRLAFAGPPGELTGRLKGRSFRLEAV
GAQRRAVLAEALDLDTIGDGVIQGAGVRVVLRAGAGPEKIQALAGRVGAR
LTPVPARFEDAFIDLLGGGPGGTSALAERLSPVELTSDIAVSCHNLTKRF
GDFIATDNVSFKVHKGEIFGLLGPNGAGKSTTFKMLCGLLKPTGGEAHVV
GLDLRRAPGIAKAQLGYMAQKFSLYGLLSVRQNLEFSAGVYGLDSTTQRE
RIEEMIRIFDLHDWLSAAPDSLPLGHKQRLALACALMHHPPVLFLDEPTS
GVDPITRREFWTHINGLARKGVTIMVTTHFMDEAEYCDRVAMLYRARLIA
LDTPDVLKRDAASADRPDPTMEDAFIHLVETSALTDEVKI
>NE1284 Band 7 protein
MKAVSSVFGGLLLVLAVLGSSAVYIVDEREQALLFQLGEVVGVKTSPGVY
FKIPVAQNVRFFDSRILTMDSEEPERFITSEKKNVLVDLFVKWRIVDVKQ
YYVSVRGDETLAQTRLAQTINSSMRDEFGNRTVHDVVSGERDKIMEIMRQ
KANADARKIGVEVVDVRLKRVDLPQEVSESVYRRMEAERKRVANELRSTG
AAEAEKIRADADRQHEVILAEAYSEAQKIMGDGDAQATAIYADAFQKDAK
FYEFYRSLEAYRKSFKSKEDILVLEPNSEFFKYMKTPLDRKK
>NE0118 hypothetical protein
MNSAILSQDGRVVIPKKFRNKLEIKEGDEVIWKEEDEQIILTSRRQELER
AERFFGQMMADYAGDSLADELIAERRAGAIYEHTSHG
>NE0783 hypothetical protein
MHTTDPAGWCSNRNSLIRKLLTSFIGSSSGAVKICRKAATVSFILATGSS
QLFASGSSPTATQVDWSKAPPVSPPPRAGIFVKPPMGPGYFSLLDLIDGN
EREKPQVDPLPPSALTTTPAFDFDFRYLEQPGHDKDFFDPVKRIHLGSDW
LLSFGGSFWYRYMHETDSRLNAAGINNDYHLLRTRLHADLWYQDQFRLFA
EMLDARALGLDLPALAIDKNHTDMLNLFADVKLGQFMDGPAYLRVGRQEL
LYGSQRLISTLDWANTRRTFQGVKTFWQTPAFNLDAFWVRPMVTEPNQFD
NWDKDRNFVGLWGTYKAIPGQVLDLYYLSLVDNRNVSPANITQGNVLQGD
SVLHTIGARWVGDYERILYELEGMYQFGKRSHLDISAFSIASGVGYQLPL
PMNPQFWLRYDFASGDKNHRDGRSNTFNQLFPFGHYYFGYIDQVGRQNIH
DFNAQFTLHPQPWVTFLGQYHRFYLANKRDYLYNAAGAGTIRDITGQSGS
HVGDEIDFTINFHLSRHQDVLLGYSKLFTGEFLKNTRPGVSPDLFYAQYN
FRF
>NE0331 putative periplasmic transport protein
MLVMISGLRAESLENEGAERVVITLQNTLIQAMQQGQEIGYSGRLKLLTP
VIEQTHDLAAIIRSVLGTHWAKLDSDQQQAITRVFQANSIATYADRFNQY
DGEQFKIIEQTQLPRGRMLVRSQLIRSDGSPVNFDYVMHEAGGSWRIINI
VVDGVSDLALKRAEYSAVLQKDGVTALIDMLEQKTSRIEQNQQ
>NE1816 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE0036 Outer membrane efflux protein
MMKSLRGFAFLLCLCLYFCEQLQAADLIQIYREALEEDAQYGSARAAYIA
AQERLPQGRAGLLPDLRVTGTAQNQYIDTSGAPTREIKNRGFTVALTQPI
IRFENFIIYQQSKNEVAQADAQFVIAAQDLILRVAQAYFDVLKAKVDLEV
VESQKKAIHEQLEQARRNFEVGVSTIVDTHEAEARYDLTLSQEIAARNKL
EISQRALQVLINRLPSDLQDASLDRIIADPLTLPHDGMDEWVQLAEERNF
QLKVQRIAYEISEQTIDRARAGHYPTLDLVAQYNDQSGVGGTFTGRGIDL
VSKSVGLQLTVPIFQGLSVQSRVRESLANRDKARKDLENVQRTIALQVRQ
NYLNVTNGIAQIKALKQALTSSRSQLDSTMLGQEVGVRMEIDVLNAQQQY
FSARRDLAQAYYDYLMARLRLKAEVGDLDEDDLLEINALL
>NE0550 conserved hypothetical protein
MSSESAHQHAAGVDEERGTLPRRSWRQLWLSIHLYLGLFIGALLVILGLT
GSIAVFWAEIDEWLNPELLTVTVPEQKNLAPGAPAYQSLDEIIRVARQAA
APDSRITTVYGARNSEAVYAVYASQPSSAWQRIFVDPYRAQVTGVRSYGA
NEWIPNYFMDVIFQLHFSLLLGMNGQTLMAVCALLLLVSLITGLIVWWPT
SGQWRKALTIKRGAGPVRFNFDLHKTLSLYLFPVLGAVLLSGVFMNLNEP
FVWVTQLFSPATRQPQHTLTSIPITGIPSIGAEHAWAIATEHYPDGKFGG
MFMPGNAEGVYIVTQKHVPKLSAFWSERQIAIDQYSGEILDVRAPDARRS
AGETFLEWQWPLHSGQAFGWPGRILIFLCGLACPVIYATGVIRWLQKRRV
KVRSPRRPIR
>NE0548 putative transmembrane sensor
MPESFHDDIDATARHWFIRQQGRALSAEEQRALTDWLAASPDHRTAWTRA
EEDWQSLDRFRTDLSSELNKARHYRPRHHSSRYLKWAAAALVLIALPILY
RASTGIPRHWQTATGERQHVTLADGSQLSLDTATRVTITQSWFQRLIQLQ
EGEIYLEVAPDWRPLQVTAADTRIRDIGTRFSVRDTPEKFTVQVEEGAVE
IIRRDKHLVLHAGETVTRHPATDRWQLATVQGDIANWRQGILVFDQHRLP
EVLQELARYHAVRFDLADPALGRKQISGRFRLDELDTTLRIIAETLNLNI
EHPTPDRYLLSAAQKQ
>NE1794 hypothetical protein
MLNVYLTVDTELWPYSDGWPVRALSPYKIAFDEEIAACFYGKTSEGEFGL
PYQIERFNQYGLKATYFLEPLFADRIGSNHLADIVDLIQRNDQEVQLHLH
TEWLSEIYDPTIPVHFKQYMHQFTLDEQVTLIAKGIRSLQAAGVKELHAF
RAGGYGANRDTLRAVAQNKLLFDSSYNSCYLGEDCKIDLNEQLLQPCKIE
GVWEFPISFFQDYPNHWRHVQLAACSTKEMETVLLNAWRQGWFSFVIVLH
SFELVKGRSIGKLSLPDKLNISRFNHLCKFLSDHPDKFRTTLFSELDPIT
IPEIRPQKILYSRLHHTIKRYAEQIHSRFF
>NE2376 hypothetical protein
MPVWLQMSFVLLYAYKQNKGVDCMKKIKLEWSLFSLTVAVLLGTGLEVSA
HTRFETGTVGEGVRVTNNVVIGHPCGTNPIIGTSVVFPDGADSTVLVDGQ
PQGSPLTDFVSNWGDNVRPLLSRAAFDHIQPKQGAAGNVVGFWAAGGPGM
PEGMVAYVPFRLNAVNIVPGSCAKSVTFQVSIVDICQITGPGGLQTEGVV
ELWTHNDLGTIFDSNIDNGPAPLKITRNLTANPLPESCGQGVDVTVKPSA
AQINRDMPIVYERKQIWPSKTGKKHSLKLK
>NE2509 hypothetical protein
MRSCELRTRHQRAGTLAGTAPARDSCDHPAGEPGPPWPTRRATTACAQTL
SRRRPIMRELDKELKDLRLYGMAGAWEDLVKQGGHATLESSRWFLKANCS
RPRGGWSSGCGRACTSSTAAARSA
>NE2233 Fatty acid desaturase, type 2
MNQPHTLSPALAGDKSRLSKQARNEIQALSGARPKEFLIQAGGAWGIIIA
VIALAIHIDNIWMTALAIPIIATRFNILGLLVHEQVHQLGLRGRYGDTLA
NILTAYPIGITVEDYAKVHLSHHKYYFTEDDPDFLRKAGPDWTFPMSTRH
LIKLLLSDLSGLSFYRILKGKRLENKNVYNRPHPIPKWLRPGFYIIVAVL
LTYLELWPVFLVYWLLPLMTFTPLIVRLGAVCEHVYNLPKANIIESSSLI
ILSWWEKLILPNLNFTLHAYHHFFPGVAWCNLPKIHEIFKRENLVNETAV
FYGYWDYLKYLQTTQIENVNEMKYHQS
>NE2458 putative GTP-binding protein
MTHPLFRHAEFYTTVNRLQDLPQTAGVEVAFAGRSNAGKSSAINTLVGRE
RFAFVSKTPGRTQHINFFQLGEERFMVDLPGYGYAQVPLAIRQHWGHLLS
SYLQTRQSLYGMILIMDIRHPLTKLDLQMLDWFRQTKKPVHVLLTKADKL
SKSRALVALNEVRQFLTVNYPHCTVQTFSSLKVAGVEEASQLLQNWFDTG
HASVQQENGEISEQKKTPAKGD
>NE0021 conserved hypothetical protein
MKHEANKRAGRIDFAGWAKRGMLIGMAAIVTSYSLPLLAEDIGSVSTRFK
FLGANDKIVVEAFDDPEVAGATCYLSRAKTGGISGTVGVAEDKSDASIAC
RQTGPIVLSEKIKNGKSDGDEVFKKSTSLLFKTLQVVRFYDAKRNVLIYL
TYSDRVIEGSPQNSISVIPVMPWH
>NE1363 conserved hypothetical protein
MRFHHAHGSLFPRKILICIKARINPDLVMNGIPLCPKLYHIVHVDRLSSI
LKDGFLWCDVHMAQHIPVGTTIGMNNIKQRRLQNCLNSYPDLHVGDCVPF
YFCPRSVMLYLIYRQNTELDYKGGQGPIIHLEADLNAVTTWAKTQSARWV
FTLTNAGSFYFEDRNDLTCLKEVNWTAVHALNWKEHKEGKQAEFLIEQCF
PWNLVERIGVQSEVIYNHVVNALPVNGHRPKVEIKPEWYY
>NE1992 Sigma factor, ECF subfamily
MTKSRFTSLSQAVEAYYDELRQFIHQRTGSSLMAEDVIQETWIRARTTSA
DLPDNLRAYLYRMAGNVAVDHVRRQKNWGREEGDSYEAENSNEHLDQLPG
HTPDLIDAIISQQELAILDAAIRELPDKCREVFLLYRGHGLSMREVAVHL
SISEKTVEKHVARAMLHCRQRLRDAGRNV
>NE1195 ATPase component ABC-type nitrate transport system
MTTAVHAPAHLEVRGLGHKFALQTILEAIDLDVTSGEVVALVGPSGCGKT
TLLHLCAGLMPVRTGQIISSFNNPACMFQQPRLLPWKSTLDNIALGLKAA
GTGRDERTRRARMLGLQTGLAHEDLGKFPHQLSGGMQSRVALARALAIMP
DLLLLDEPFSALDIGLKAELYALLLHHLNTCSLGVLMITHDLMEAIRLSD
TILVMAPSPGRIVHRFRIDLPAMQRDDAFVYKMTAELLQDSAVRTSFRLA
PIKSELLTRQELPC
>NE1270 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1807 putative ATP-dependent RNA helicase protein
MSFENLNLHPAIVKAVLAAGYTAPTPIQQQAIPDLIAGHDVMASAQTGTG
KTAAFMLPALHRLATPAQIRGRGPRILVLTPTRELALQVSDAASKYGKFL
PRINVVSILGGMPYPLQNKLLSQTVDVLVATPGRLIDHIERGRIDFSRLE
MLVLDEADRMLDMGFIQDVERIALSTPATRQTLLFSATLDVAIEKIATRL
LKAPKRIQVAAQHTKLDHIEQRMHYVDDLTHKNRLLDHLLRDTTIKQAIV
FTATKRDADSLADNLSSQGHKAAAMHGDMTQRERTRTLTGLRQGRLKILV
ATDVAARGIDIADITHVINFDLPKFAEDYVHRIGRTGRAGASGIAVSFAS
GKDVAHLKRIERFTGNRFEFHVIPGIEPRTKPRFGRSDDKPGRRPSSSAA
AHKTRRSWSDNPNTRTASPGHRGDKDAGFGQPFGRETRKRPFRDSKFNSA
DRFARTE
>NE2454 MFS family transporter
MSRVEIRASASLAGVYALRMFGLFIILPVFTFFAKELPGGDNYTLIGIAL
GAYGLTQAILQIPFGWLSDRIGRKPVIYLGLVLFALGSFVAAGATDIYWV
ILGRIIQGAGAISAAVMALAADLTREEHRTKAMAMIGMSIGMVFALSLVI
APLLDQWVGVPGIFVITGCLAILAIGVVHKVVPDPVVSRFHSDTEVTAAR
FSSVIRDVQLLRLNYGIFALHAVLMALWLSIPLSLRETGLAAADHWKVYF
PVMVGSILLIVPAIIYAEKKAQLKRVFIAAIALLLLAQILLAIFNASFWG
LVFALLLFFAAFNLLEASLPSLISKIAPVGAKGTAIGIYSSTQFLGAFVG
AGLGGYLFQHFGFYALAAMCSGLLILWLVLAVTMQAPAAVRSRMYQVRKM
DSDEANGLSRELAALPGVYEALVLANEQVAYLKVDLKGFDEEKVVQLLEG
NI
>NE2518 Patatin
MMESKNEPGASALLRVLTLDGGGAKGFYTLGVLKEIEAMVGCPLHQKFDL
VFGTSTGAIIASLIALGHSVDSILELYRKHVPTVMSQKTAPARSQALKKL
ASEVFGDATFSDVKTGIGIVTAKWLTERPMIFKGSVAQAHGQVGTFVPGF
GVSIADAVKASCSAYPFFERTVVRTSMGEDIELIDGGYCANNPTLYAIAD
AVQALRSDRKDIRLVSVGVGIYPDPKPSLLMWLAKKYLVSVQLLQKTLEI
NTQSMDQLRQILFPDLLTIRINDSYVTPEMATDLLEHDLKKLGILFQRGR
ESFASREKQLREYLI
>NE1192 Sigma factor, ECF subfamily
MLTVTDYYKELLNFCARTLKDRDAAADLVQEAYTRFLVAQRSGAAIADPR
ALLFRTTRNLLIDQHRRAEVRMHLSLDTLLEEEQPLAPRHLQPEEVLAFS
RYAQALYAAIESLPRRCREAFILNRFDGLSHQEVAEKMGISRNMVAQHII
RGVLVCKACDDRFHGRVVLASKPAE
>NE1862 hypothetical protein
MGRILKPDLVMTPIDENIRLGSILAVFASAIGVKQSGSGGNRDIESLFTT
LYLRLNGTCGLAKTKFGADDGRKDMQYTMTESGVVQPEVAEVREVRWGGD
SLPVIITVMTGSRFCSPDGHLP
>NE1459 putative transmembrane protein
MPSILTYLSASLLYGIAGWYFWRAMRTDTSAAGAVPNVRLQQYVMLLPLL
VHGLVLYHAVFMDNVLSFGVGNAISAIVWLTAVIYWVSGFFSSLRGLQNL
IAPLSAIAAVAVLIPLLLPSIHPLAHAGMTAFKAHLLAAMLAYSLFTIAA
LHAVLMTLLERRLHHSEVSPIFSQLPPLLVMEKMLFRLVWVGFILLSLTL
LSGIVFSEELFGQSVPFTHKSLFGFISWGIFAALLAGRHLYGWRGRTAIR
WMLAGFVALVLSYIGSKFVLEIILNR
>NE1066 hypothetical protein
MAITTLSSRELNQDIGRAKRAARNGPVIITDRGKPVHVLLSYDEYQRIIG
QQENIVDQLGLPSGIEDVEVEFPRSRELARPADFT
>NE2246 hypothetical protein
MGTSFRQYFKIVPALTEELKREAYRVRHSVYCEDLQFESSRSDGFEIDEY
DAHSLHLLIRSINNDTFIGCTRIIRPSSNSNDRRLPFEKTCAQTLDRSII
DPSRLPADKIGEVSRLAVIAAFRRRKGEKNHPINISEEDFSTGPMMRFPY
IPLSLYIGTIELARIHDIRVLFMLTEERLASHFSRLGAQLEPIGAPVEHR
GLRFPSMVEINSIISNMRPIFRPLYQAIAEDIKAELEKKNH
>NE0658 Short-chain dehydrogenase/reductase (SDR) superfamily
MNTVLITGANRGIGLEFARQYAADGWQVVACCRQPQQAEALNRLADQYKD
RFSIHRLDVRELAEIDQLSHKLQDLSIDILINNAGVYPHAQNGEFGRISY
DDWMEAFRVNTFAPLKMVEALIEQIACSQLKIVATITSKMGSIADNQRGG
SYIYRSSKAAVNTVVKSLAIDLQPRGIIAVLLHPGWVQTDMGGRGALIST
KQSVTGMKSILDRVTHSDTGKFIAYDGQHIPW
>NE1740 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE0171 conserved hypothetical protein
MEKKLSVLLFTLLVSLALAARADEILLKNGDRITGTIINKTGNTLEIKTE
YSDKISVKWSAIESFSTTQPVLLTLKNKEEMTGMTEVSPDNTLKIKKEGV
YQSEPIPLSEIAEINKKFFSGQVNAGGALFSGNTQRQSYNLNTDLVVRGR
DDRVSFGGQFNYADSKSRDSNGQKETILNARNFQLYGDYAHFFTDVWYGY
AHGLFTNDRLQDIKLRSAFGIGAGYQIFATGDLNLSFEAGPDYVNVDFYD
YPYQCEKKLTVDPAACNRIKDRSNVAGRWSTRYDQWIWNRAVQLFHTHEG
LAASDLFVRTRTGFRVPLWYGFQFVNEIQVDYYSKPAPGKEKFDTRYLFN
IGYGF
>NE1418 PpiC-type peptidyl-prolyl cis-trans isomerase
MRLSKFLCVPLTVLGLVCVAPLNAQSGGTVAKVNGVAIPQSRLDLVVKAA
TAQGQPDSAEVRNALRENLITEEILAQEAIKKGLDRSPEVVTQIDLARQG
ILIRAYQADFMRNNPVSDSELRKEYESVKAQMGDKEYKARHILVETEQEA
KDLVAALKKGSAFDKLAGERSIDTGSKSNGGELGWSSAAVYVKPFADALI
RLKKGETTSQPVQSPFGWHVIRLDDVRTAVPPSFEEVKQNMQQRVLQRKF
AAVVESLRKNAKVE
>NE0973 hypothetical protein
MKNIVALLTVFILFMTQSASAELYDPDEVQVYIVPMIDFPEPAAAQLSKI
LSDDMKIWVKSSVRLGDLEAATLPGTRQLSGDSIIEKSYPIVTKLPGSSK
NTMYVLLTTRDINSETGAFRFQFSMHHSEMRVSVVSMARMIEFIDGKPVV
NHLVLNRLYKMCKRAIGEQYFGWKRSTDINDIMYSPIMGMPDLDRIGIHH
KENDDENEVEPVDKNRISI
>NE0432 putative transmembrane protein
MEGSVPGFWRRMLALAYESLLLLGVWFIAAFLFHLLFRDPTAEYFRPLFQ
FYLLIVGGIYFIWFWTHSGQTLAMQTWKLRLVSANNGKVTTQQAMVRYLM
AVIGISFLGFGLLWALFDRNRQFLHDRVAGTRIIRLG
>NE1364 Appr-1-p processing enzyme family
MIEYTSGDILRCEADALVNTVNCVGVMGRGIALQFKNMYPANFKAYEAAC
KREEVQPGRMFVFETKQLTPPRLIINFPTKRHWRGKSRIEDIEAGLVDLV
NVIRDKNIRSIAIPPLGAGLGGLDWKEVRPRIEHALGELEGVQVIVYEPN
GAPASDKMAHVREVPKMTSGRAALVELMQRYLSGLLDPFVTLLEVHKLMY
FMQEAGEPLRLDYIKHHYGPYAKNLRHVLNAIEGHLIAGYADGGDAPDKP
LSLVPGAVAEAKSFLDQHEISRARFERVTRLVEGFESPYGLELLATVHWV
IHREGATQSDSVKRQIYQWNDRKRQFTQRQLVIAEERLRSQGWLSPETTF
TY
>NE0552 conserved hypothetical protein
MMKLFWTPEALQDRDAIYDYIEVDNPRAALALDELFSEKAQRLPDHPALG
HPGRVAGTRELIAHQNYIIIYDVTGELVRVLRVLHAARQWPPSEND
>NE1280 hypothetical protein
MKYLTIVVLSLLMIPASITAGDLSNLQQQTNQAYEQMKQAESQLSNARKD
VQVKQDNLRYHQEKAAETEKELQAAQQVLQRAEENQSAARKKWNENSEEL
YQKWHRKK
>NE1888 Domain of unknown function DUF81
MEAWPVYLLTGSAVGFFAGLLGIGGGLLMVPILASVFMSLGFPADHILHI
ALGTTTAIITLTAISSLRAHHAHGAVNWWIVRYITPGIIAGALAGSTLAG
QLSSRILGIIFVLFIYFAATQMWLNLKPGTGHVLPGKAGMFAAGSVIGAL
SSLVAIGGGLLTVPFLTACQIRLHHAIGTAAAVGFPVALASAAGYAINGL
LLTQPLPDYALGYIYLPALITVGLASTVTAPLGARAAHVLPAALLRKIFA
GLLYLLGTKLLLDLWN
>NE1119 hypothetical protein
MFVCSDFKQRSRKMKKTYLLSTVLLMFLSSSVIAEETLTEKLETKTNDVE
RATNKAINRAQEATCTDSDAECLKQKAKNRASEAYDATKDKASEIKNKVD
>NE2075 Diguanylate cyclase/phosphodiesterase domain 2 (EAL)
MAGWGIRSITGTSKLRAAMLLVVVGLLSVSGWLQRLDWLVYDEIISRQSF
QPDSDIVIVAIDEESLRTIGKWPWSRELHAGLIDRLSQIGSNVVALDLLL
SEPDVDHPEADKRLKTAIFTHGNVVLPVAPAQNASGDRFSLIQPLASLQR
RATLGHVDVELDRDGVARRTFLYAGIDIPAFPALGLALADKGSLPNRMRL
LSRMPEADKTIVNTGNSWVRSREIMVPFAGPPGTYPRISYARLLNDDDLL
DDLRGKIIIVGMTATGLQQGFLTPVSLTRHNYMSGVEWHASVVDMLRNGR
SIYPVPEMVAIAVTVLWVLAVLLVVNIVPGRFRSLVLLVSLAAGIGLVTM
LLGVFHIWLPPGVALIGTLAVYPLQSWQRINEFIQSQFITGSCLQAVFDS
VQEGVITVDAKGNILYLNQESERILGARSEQLIGKPLEQLIGVCLIPDKQ
DRDVNGSGSADTRPDSRIQHCVLTLPSGDRRTIRVTHRALYDRWRIPAGA
VVTISDISDTLEMKQQIARQANYDVLTMLPSRNFLLSRFGELAVSARQHG
AALIVCLVSVDNFSKINDAFGYHSGDALLKMIADRLLKLVCSEDLIARWS
GDEFMLVVDRPVREGQELQFARKVQETINRRFEINGQEVFISASIGIRIS
SQPDQVDEKSEIFLDEAVSAMRRVKRQGGNGFQFYASGQMNAWTREQLEF
ERDFRLAIENQDKALHVLFQPIVDVQQKRVVHNEVLVRWAHPTRGLLSPG
DFIPLAERVGLIGQLGDLVLWISCRIAADLAKVGQAVDISVNVASRQLLH
PDFLQKVAGVLERTGLPAGRLMLEITESAVISDLPRAAEALVRLKALGVA
IALDDFGTGYSSLSLLRELPLDILKIDKSFIQGIEQDHPGDEKFDFAIAR
AIIALGDNLGLGVIAEGVETEKHMEFLRKHHCYLQQGYYFSRPLSTTQLM
QFMGRPDSIRV
>NE2156 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0993 UDP-N-acetylenolpyruvoylglucosamine reductase
MQTEIQLPIMDTPDLQSRTSGIDTRLLRGELRQHEPMKRHVSWRAGGHAA
CFYQPADLEDLALFLRHWPKDEPVVMIGLGSNLLVRDGGLPGVVITLHAK
LNDLSLVEQNESGGLIYAGAGVPCAKLARFAASHNLTGAEFLAGIPGTVG
GALAMNAGCYGAETWDRVERVTTIDRSGIVRERTPEDYQVGYRHVELQRT
PSSDTPDVWFTGGWFRLRPGKMESSRQAVRTLLTARIKTQPLGFPSAGSV
FRNPPGDHAARLIEQCGLKGFRIGDAMVSTLHANFIVNCGHATASEIEAI
IDTVQNAVHQATGIRLITEVRIIGQLGGSTQ
>NE1197 conserved hypothetical protein
MRTMIQILVAVLMALTVLSVHAERRPHLVLAGPLAAVSFPLIHMVESGTL
ADLAETVEFVSWRDPDQLRVLALKNKADVLAMPTNVAANLYNRGTDLELI
NVSTWGILWLVSRRDKLETLADLKGEEIAIPFRADMPDLLFGLISEAQGL
DPRHDFKLRYVATPMDAMQLLIARRIDHALLAEPAVSMALRRTQSFPISV
IAPELHRSIDLQAEWGRVLERAPRIPQAGIVAMGTIRHDPDLIARIHDAY
RNSLAWCQANALECGRLVATYTDMLSAEAVSDAIAVSPLEAVPAAAARNE
LEFLFTTLMHRNPAVIGGRMPDDDFYGHGSP
>NE2265 hypothetical protein
MDFKSGIKHGVAITGIILGLSITPAQAGVAGSMVLDITGGCFSYGANGAT
GCGISDGSPDAAGAKEYATGSFTFSNTLSGISNPAAYAYQASVSLYAEAP
PDNPVISFYDTRSKSFATLADLQSDPLWNTAYAFVTAVLANTNGSFTATI
PTPPAPPGTTVEASWNYTLSNLTPGPGSTPAYATGEFEAWSKDDLNGLAL
ILFGPEQPLPTSPVNFSLTVALSAIPEPATIALIGLGILGMGAAQRRKTP
AALPV
>NE2163 hypothetical protein
MQHLEAVRNILGDVLNLGERKHTLTASSVLLGNIPELDSMAVVNVITALE
EYFDFSVDDDEISAQTFETLGSLALFVEHKLSH
>NE0763 tRNA pseudouridine synthase B
MLNKPSGISSNRALQISKRLLSAAKAGHTGTLDPMAQGLLPICLGEATKF
SSTLLGVDKTYIASLRLGYISNTGDAEGEIRQVVGSDVNPPDFGQVTGIL
QTFLGRSSQIPPMFSALKQHGKPLYRYAREGITVERKAREIVIHAASLDT
LSGFEMTITVRCSSGTYVRTLAEDIGKALGYGGAYLTALSRISVGHFELS
QACDLDQLESETPVNRQKLLCPIDSLLNDIPSIVLDDDEALRLRQGQKIR
KNMSRYGLPVNTQLKLYDDRNVFLGLGERIDPEVIVPRRMISLHEVVTGA
IAGSVDLQ
>NE2504 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE0199 hypothetical protein
MFPIKSRPLKIIIFWQLAFAILAAVLCGLLSGVNAAISGFLGAIISVIAG
AVYAILVSRHSGYSASGTLRTALRAETVKIFIIIMSLWAVFAIFEGLQPV
MFIGSFVVAVLISSMAVFVPEKLNK
>NE0982 Domain of unknown function UPF0040
MFRGSTQLSLDSKGRLAIPAKYRDELFASCGGNIVVTADPSRCLLIYPQP
VWEPIEKKLNSFPSLSPQIRSLQRLIIGNASDVEMDSSGRILISAPLRQF
AGLQKEVVLAGQGEKFELWDMAKWDLEIDTATTYKDGDIPPELEGFSL
>NE1501 hypothetical protein
MINIEETGNLIVVAVIGEFTLDDFRQFEDQVLYQFHFNGEANLLLDLRDM
VSYSVDVAWKEIRFMREHGSKINRVAVVTDSQWQVWSAWVSNLFTDADVV
VFSDYDEAYGWARI
>NE1605 putative prolin-rich transmembrane protein
MESDMAGTLYRKLILWGALAATVLAALLVDEGTELSVDDVVQPAVDISAD
RRTAGQTRQIRQTHETLPVDQLGKRKFSAKADDIFAVTSWEPKRTASTDF
NPQIFQPRKEEVVRRPSAPPLQFEYLGRVVSEGKIRVFLAQADQNYVAGA
GERIGTEYRIDRIREDTIELTYLPLGIRQTLTIDQGTFD
>NE0820 Zinc-containing alcohol dehydrogenase superfamily
MTIKAYGARAGDLPLEPMNITRRTPSAHDVQIDITHCGVCHSDLHQVRSE
WAGTLYPCVPGHEIVGRITAVGAQVSGYKPGDLVGVGCIVDSCQHCADCN
DGLENYCDHMVLTYNGPTSDAPGHTLGGYSQQIVVHERYVLRIRHSEAQL
AAVAPLLCAGITTWSPLRHWKVGPGQKVGVVGIGGLGHMGIKLAHAMGAY
VVAFTTSESKREAARALGAHETVVSRNPDEMARHVGSLDFILNTVAASHD
LDAFFALLKRDGTMALVGAPATPHPSPNVFNLIMKRRSLAGSLIGGISET
QEMLDFCAKHNIVADVEMIRIDEINEAYERMLKGDVKYRFVIDCASLTA
>NE0559 TonB-dependent receptor protein
MPHITGKFRLCWPGIILSFLIMITSAQASNTFFEFDLPSQPLTQSLEQFQ
KTTGINIVYAPKELANYYSPAIQGQYTPRQAIQILLAEHRLNAEFISNSM
IAVKPASAVQQLPEMTVTGTPDPDSPDNRSYNHSSAFSATKTDTPIMETP
MSIQVIPKALMEDQQAIQLKDALKNVSGVFTGNQSGEYGYDKFFIRGFPN
AGSTQIYRDGLLLPRSNNDTFNIEQIEILKGPAAVLYGRVEPGGLINTVS
KKPLDTPYYSLQQQFGSFDHYRTMADATGPITSDKSLLYRLNIAYQNTGT
FVDLVDNERILVAPSLQWKPGDRDVLNFRLEYQNDNGRYYDGIPAVGRRP
ADIPISTFLGWGGDNEYQKQQRIVTGYDWTHHFNGDWKITNRFSYTNLNY
KFINTYYGDSLADDNRTFTRANYHYPSDKTDAYATNLDLTGRFTTYGISH
KVLLGFDYYRKDTHAIGYCCNALNGFIPTVDIYNPIHADVRVPLQNELND
YRRSSQYWYGLYFQDQVTFFDKLHLLFGGRQDWATDTRASASSAASYALA
QDKVFSDSKFTPRAGILYQIQPSLSIYGNYVESFGVNNGRGFNNEPLKPQ
TATQYEAGVKKEWMDGKFITTLAYYDLTKKNITTTDPLHPQFVIPVGEAR
SRGVEFDISGRITENLSVIGSYSYTDTEITKDNRGNQGHKLPNVPKHAGS
LWGKYDVTQGLLRGLNIGSGVFLIGKREGDNANSFELPGYVRWDALVGYR
WRTSKTELSLQLNVNNILDKTYYDASPAGATNIYPGIPRTFLGSIKIAL
>NE1761 hypothetical protein
MKDSSVMKLCSTLPVLMLASHLSYAMPGMHGTEGSGATEPPAASKDMGAP
INSSGSEIEITYSADGKTAIFVSTRAGSVESPGTPYNFDIWMAHNVDGVW
QEPIHLGGDIDPAVGPNINTTAWELEPSFSDDGNVIYFTRYEPGNLLSGD
LYVVQKVDGVWQSAKNWNDVPELPSINTPTGEEHCPIIVSDSLIYFNYSQ
PGVTQESDIWKVEKKDGMWQKPVSLGAKINSPQRDHMHWTGVSKDGKSMV
ITSTRIDPDSRGGHDMWISHQDAKGEWQKPINLGDIINTPGEEMCWTFTP
DGKKFTGSWGPQNTFDTDIRWISKEDIPLLKTFEPIGPPPNLLVNSNKG
>NE0374 putative cation efflux protein
MNLQIQIIAVFLAFVTLSVPGCKSGEEQEPPEKHVKAETDDANTIFLRPD
LRERIKTGKPFYADIAEKLSVPGQIKVNELKLVQVGASFTGRIVEIYAHL
GDSVSAGTKLARISSPELTQAQLAYLRANSLAVLAQRAAMRAQQLFEEDV
ISAAELQRRESELQVSNAELSAAKDHLRLLGMDNQAVQDLTKRGQILPSV
VITAPRSGTVIARDVALGQVVQPSDQLFTLADLSVLWVVGDVPEHAAHFV
ELGQHVEVRVPALGDTSFDGTIIFVSDIVNPLTRTVTVRTEVDNPERRLR
PDMLTTVNIDEHARERLVIPEGAVVRENNEDHVFIAREDGGFTLTPVKLD
EMINQVRPVLEGLTPDQQIIVDGAFHLNNERKRKDLE
>NE1703 DUF190
MNGYQITFFTQQDRHHAGKPLADWLMHLAAEMGLRGATLIPGSEGMGHDK
RFHSVHFFELSDQLLEVVMVVSEEEADQLFARLRAEGVRLFYVKVAAEFG
IMD
>NE1833 hypothetical protein
MWKLLNLTGICTLILVVTLAIMTYLAIDSHPRIEREISITPEQIARAKDI
LDTHRYQVRPGTSATVRIQADDLDNALNYLAYHLAQGHAKVTMHDKSAQI
QLSLPIPPGMITGYLNLQATLTEGKSFPELSSVSIGKIQIPDILAGQLTE
KLLAWLQTASPDARAGLDAFRKLRFSRNEVAISYFWKGWGIDKASYSPVS
LPFFDRQALDKLSHYHHFLNEQNRKRVSHTITLSEILTQIMQETVRHSPN
GNVLEEFRAAILVTAFHVVQFPLRLVIPETADWPDPVRINVTLDGRNDLA
MHFMASAVITAYSDTTLSNAIGLYKELEDSRSGSGFSFNDLMADRSGTRF
AEKAMASQDSARRMRNIILAGIHDTDLIPHWSDLPEHMSETAFKARFGST
SSPRYHEMMDKIEQRVASLKWLRY
>NE1217 putative sigma-70 factor, ECF subfamily
MCSTDSSKDIVPTADSAMQQVYFLYSDHHDWLYSWLRHRLGNAADAADLA
HDAFLRLIIKPAPRGFANFGEARAYLRTMAQGMCINLWHRREIERAWLDT
LAAWPEALAPSPERQVMVLEALHEIGTMLLDLPPKAARAFLMAAACQMTD
KEVAEALGVSSRMVRKYGFP
>NE1299 Guanylate cyclase
MKLWRWRFALAVFIGLLGAAVYLSPQGTAIEEQFGLYWLFHLRGERTAPA
DVVVIAVDQASATHLELPLKPSLWSRALHARLIDRLVEAGARVVVFDLVF
DRPGSDARHDKQFAAALKHAGNVVLVERLDFEETRFPGMAADGSYTHILR
EYTGKILPIIADAARAHAPFPLQKSSWVSYYWTFRPGGSDMPTLPSLLLQ
LSSAEVYEHFLTLLAGIDPALIRQLPASVNEADVENLVLDLRNLFMQKPV
LAEKLLQKSAEGENLDTAQQRILRALIDLYGGSEVRYLNFYGPPRTIRTI
PYHLLLDEGGGQPETAGELDLAGKTIFVGFSAETVSGQDKVRDDYQTVFS
LPDGLTLSGVEIAATAFANLLEGESIHPAAPVNGALILLLLGTGVCLLCF
VIPARNTMVHTLGMALAGILSFCAYGSASVWLFGQWQLWIPLAIPLMVQL
PVGVIGIVTLLYFEADRERRRIEALFGGHRPEPVVRDMVRGAGPIHTEGQ
LVYGACLATDAEKYTALAEAMDPRQLAALMHAYYECLFEPVERYQGKVSD
VIGDAMLAIWAAGSADPLVRERACLACLDIISAIERFNHQGGYGRPPLPT
RLGLHFGEMVLGNIGARQHYEYRAVGDIVNTANRIQGVNKHLGTRLLVSE
EVVNGLNDLLLRPLGRFLLPGKSQPVNLFELVARQSGSNAAQRQLCAAYE
QALHVCQTGNRVDALHRFEMIHERFPEDGPTRFMLTYCREPRLFPVREAG
ELIISIQSK
>NE2074 Heat shock hsp20 (alpha crystallin) proteins family
MAITRYEPWGLLTQLQRELERARDDMATEGASAIAEWAPAVDIKEESDKF
IVHADLPGVKPEAIDVTTENGVLTIKGEKQTEARTEKEGYKRVERTHGSF
YRRFSLPDTADLGAISAVTKDGVLVVTIPKREAVQPKKVSVTAQ
>NE0531 Phosphate-binding protein
MQQSLLRFDTGFLVAILGFALLSGIGTMVQARPLVKISGSSTVFPITEAV
SEDFQAAKKGAIMVTAGIAGTGGGFKKFCRDEVDIVNASRPILSSEMEQC
RQNGVQYIEMPIAFDALTVVVNPGNDWIKSMTVAELKKIWEPAAQEKILN
WNQVNPAWPNARIKLFGAGADSGTFDYFTEAVVGKAKSSRGDFTASEDDN
VLVQGVASDLNALGFFGFAYYIENHKKLKAVAIDNGSGGILPSMLTVQNG
SYQPLSRPLFIYVNARSAGKPEVREFVIFYMQHAQQLVEEVRYFPLKKEF
YDFNLKHLQNKTLGTVFGGEAGMGISIEDLYKRERVQ
>NE0602 hypothetical protein
MEKELEDRKLAELMNSIRHYSTLRFAMLTVYFAVTGGLLVKFFDCDFSVR
YPELHGLFQIAGSMVTVAFFIFEVALDDNLRKLWGSVKKLAGEGDVLLSH
RQLWKGCLVPMATYGIFVGVLIFWLFTSRNYYPCQAAAHKAVQSETVISK
ECRK
>NE1318 SLT domain
MWRLIYSLLFLIPGAAMAGAQVYEPLSASTRTVLHRTISDQAVPMLVTLP
GPEAKDWLDSMTKRLEKRISDIEERRAFLITVHYEATRAGLDPELVLSII
QVESNFRKYAVSSAGARGYMQVMPFWVDTIGNPEHNLFHLRVNLRYGCTI
LRHYLDLEQGDYFRALGRYNGSLGKAGYPNLVFNAWHRHWTYSTASQ
>NE0447 conserved hypothetical protein
MLKQNETKTSMILNYRWLYDTVRKRFESDEAMEAFLPKALTPATLKQKGD
DRYLSAMSQRVFQAGMQHSVVNAKWPAFEEAFWGFVPETMVMLSPEQIEG
YMKNSSIIRHYTKLQTIPRNAQFILDIRQEQGCSFGEFIADWPSADIIGL
WRLLAKRGARLGGRSSAGFLRLAGKDTFLLTSDVTARLIAAGIIDHEPTG
QRDRQIIQDAFNELQQDSGRPLCQLSAMLSLSINPRF
>NE0116 hypothetical protein
MTIWLVTRHPGAIEWVARQGIQWDKHAAHLDPCEITAGDTVIGSLPINLA
AEICNRGARYFNLSLNLPAHLRGRELDAATLTACEARLEEYIVKKVNS
>NE1846 HD domain
MNRCIIDYKTIKSAQPEIISRIIEINEESPNCFREMDHLLRSDQVVASFI
LRVANSPIYNRSQPIRTLPIAISLLGINIIRSLVILAFSRSVFSKSKNAL
VCKHIWHHSLLTAIASYNICSELGKAEDCEEAFVAGLVHDIGKVLLFTHV
RKDYMDALTYALENACSSKEAEQKFMGTDHYQIGEQAVREWKLPPQFQLY
VGTDLDQLLPGQIDNEILLWLAIANSLIKGAGIGTNPCDPVIRKEKLMAY
GLNEEFSDHLLDEAFFQKLINDEIFQFCAHDRC
>NE1238 putative oxygenase
MSHFPIVIVLTGLLLLSGCDALEPEISCLSVLMQGDIQHVAKTREQRFLG
KVTGRRAHCLGGDHAVALNRNPWLDWPNFWGTGDSLSLSSSPLASSFFGP
NERGINSALYELELQRIELIKFNLFDNSGTYQAYVTGRDGRAGPVLQVWP
EMQLPPTHPRYKDVEHNQEHQVCSGELIRFRTVTGICNDIYNPLMGSTHQ
IFARNVQFDTTFPDLGLDEMARNRHGDRLGLLKPDPQVISRKLFTRTQSQ
PDKCRNDDELSGDLEKFACDYKKAPALNVLAAFWIQFMTHDWFSHVEEES
DQSAWMTVGCITQRIDNIEQPLAAKEARQLGCRPGDRIHVAPIDDDTPPA
SFMHDGHLYRTRAPKTTRNHVTAWWDASQLYGYDERSSQRVKRDPEDAAK
LALIHVRESVDRGDESGYLPTFEVDDPIDPAWSGQEAAAFPDNWSIGLSF
FHNVFAREHNAFVEEFRKQAAKTPDADSGLRNPAHPEMIIRYRDVTAGEL
FNVARLVIAAEIAKIHTLEWTTQLLYNEPLYRGMNANWHGLFHEHAAVSE
VLREIIRQLDDTEGISNSLHAAFAGGAGIFGLGNHRYEGAPLYSLVDRNR
KDIWTLTRNEDINGGVNHFGSPFSFPEEFVTVYRLHPLLPDLIEYREWHN
NPNIIRQKIPVIDTFRGKATGAMRQKGLANWALSMGRQRAGALTLQNHPR
FLQNLKIPHLQSSTRQIDIAALDLIRDRERGIPRYNEFRRQYGLKQLTSF
DDFIDPRVPGDSSVRREQEQLVRTLREVYGQHRCDASRLITNAQLNDDKS
PINDCLGHPDGSLVDNIEDVDTVVGWLAEFKRPHGFAISETQFVVFVLNA
SRRLFSDRFFTSSFRPEFYSILGVEWVMHNGPGPEIMEEGTYNGHRQPVS
PLKRVLLRTLPELADELQGVVNLFDPWARDRGEYYSTQWKPRRGAEGDEV
FTR
>NE1269 hypothetical protein
MIEAPFNKAKYAIFMFYGIDDGKQVISSYPIFGQTGTSSSYTTGTITSYG
NTAFYSGTTYKTPTRGVVGSRTSTDTVFKRYLNIDIIDIAKSGNGKVQKV
YEGKAISSGTNGQLAPVMPAIVRSVFEDFPGKSGASRTSRQPVEK
>NE0908 probable hydrolase oxidoreductase protein
MAGTMTVKLETRNQHRCFDGVQSYYQHHSEIIGLPMRFSVYEPPQVQEAG
GQRLPVLFFLAGLTCTEETFMIKAGAQRYAAELGLILVSMDTSPRNTGIP
GEADDWEFGTGAGFYLDATESPWSRYFHMESYVTRELYDIILDRFPVDPE
RVGIFGHSMGGHGALTLALRHPGHYRSVSAFAPIAAPTQCLWGQKAFSRY
LGESPANWRKHDATALIESGCRLPIPLIDQGLNDPFLKDQLHPGYFEAAC
QQAGQSVILRRHAGYDHSYFFISTFIEDHLRHHYTWLTSNGQ
>NE1691 Glucosamine/galactosamine-6-phosphate isomerase
MSSSHLAERHAFSDLSLLSEALAQSVTVTLRNAIERYGKASLVVPGGHTP
VVYLPRLAQMDLPWQQVFITLSDERWVEPTSEQSNERLVREHFLQHMQQQ
PHFIALKTGHIHPDQAITDIDARLAELPQPFSLVILGLGEDGHIASLFPG
MALNPDTTSLCQTATPPAAPSLRISLSLHALINSDRIILVITGKEKRQLI
DRLIESPDQNIPFVRLMQFKPVELFETD
>NE0932 putative isomerase
MPYVNIRLAGTLTREQKQQIATEITDTLERIAHKPKSYTYIAFDELPHES
WAIGGKLLGDDK
>NE0148 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase
MSSNNVPASRRKCVGVKVGSVMIGGGAPIVVQSMTNTDTADEVSTTQQVA
QLALAGSELVRITVNSMEAARAVAGIRARLDDMGCHVPLVGDFHFNGHKL
LTAYPECARALAKYRINPGNVGHGRKRDEQFSLLIETACKYDKPVRIGVN
WGSLDPEMLARIMDENARSPDPLGASQVMHKALITSALESAARAEELGLA
RDHIVLSCKVSGVHDSRI
>NE1724 PHP domain N-terminal region:PHP domain C-terminal region
MMRSHVITHFSMPNIDLHSHSTISDGMLSPSRLLAHAAVRGVNVLALTDH
DDIAGLSEASRSAQQENITLIRGVEISVTWHGRTLHILGLGINPEHPPLT
EGLKKIRDGRMDRARAIAAQLDKFGIHGSFEGASAQAGISRLIGRTHFAR
FLVSQGYAKNVKSVFKKYLVKGKPGHVSHVWVSLDEAIGWIRGSGGQAVI
AHPARYKLSNDLLEQLLCEFRELGGAGIEVVSSSHTPEQTRQFAALATRM
NLYASCGSDYHGPGESYFDLGRLPALPPECTPIWNEWEIPDYAETGATTL
NEQSEQTGLQSPGKSV
>NE0282 conserved hypothetical protein
MNLAISFYILLAVTVALLVIGVMIYNSLVDIKHTVSQAWSNIDVLLKQRH
DELPRLVETCRHYMKFEQETLSRVMEARTSIAHARETGNLPALGAAESIL
RAGLGQLYAVAEAYPELRASEKFQHLQTRITSLENGIADRREYYNEAVRN
NNIRIEQFPEILIARWFGFSARSLLKFTTINKHDHDLGKLFD
>NE0940 putative DNA transport competence protein, ComEA
MYYIWPKRLKLRNPEALFLNFCGDSEMKKIFLILVIFFGFNLSVLAGVDI
NTASQADLESVKGLGPVKAKAIIEYRNKYGMFKSVEELANVKGIGAGILK
QLGDQVSVQEGAVLTETKVD
>NE1208 HlyD family secretion protein
MKMGSLSRTVKRILIITVILLAAGSAVFSWVLHTEQDESLLYLYGNVDIR
EVQLAFRQPGRVAQMMFDEGDAVVAGTRMATLDAQPYQEAFATAEASVRV
AQAELAKMRRGLRPQEIIQAQEALNQAQAVARRAERDYVRLSRLLPSKAS
SQQDVDNARAARDQAIAGVKAAKAALSQAREGFRKEDIAAAEAQLAAAMA
TRDQAATALADTELFAPSDGTVITRIREPGSMVVSQSAVYNLSLDKPVYV
RAYIGETALGRIAPGTPVQVRSDSSKKIYHGQIGFISPRAEFTPKTVETT
DLRTDLVYRLRIVVSDADAALRQGMPITIEVDTRPAADLSAKDL
>NE2173 Glycosyl transferase, family 2
MKLSIVATLYQSAPYINEFHERASATARRLVGDDYEIVLVNDGSPDNSLD
LAVKLTEQDSHVVVVDLSRNFGHHKAMMTGLAHAKGERVFLIDSDLEEEP
EWLETFVQQMEQESCDVVYGIQERRKGSWFERWSGQWFYRFFKALTGLAL
PENVVTARLMTRRYVNALLRHEEREVFMAGLWYITGFDQRPYVIKKHNTS
ETTYTFRRKMSLLVNSVTSFSNTPLVSIFYIGISISLFALAYIAYLVTHW
LFLAKPLAGWTSVMASIWLLGGMIISFIGVVGIYLSKIFSETKQRPYTIV
RQIYAKQQD
>NE2188 hypothetical protein
MQSGFVGIFDSGVSGLAAYRAARALLPHQAMLNQYLAPAVSMASDVIVLG
CTHFTFLADQKTAAHNKQANQLLPEQYYLLQGAALNRSKSIGDTI
>NE0151 conserved hypothetical protein
MSAYNFEEQEKIEGLKSWWAANGTTMLFAIAVFAATVAGNRLWNHYKAEQ
AQQAADLYAVLQQQVEKGSELVKITDAAHLLTEGFPGSGYASRAALIAAR
AAGQAGNDQVARDMLQWALDHAEEPEIKDMARLRLASVLVDESKYDQALK
LLDAQHAASFTGLYADLRGDALAAAGKTDEARAAYQKALDSLNAQGAYRN
VVQMKLDVLREQRQ
>NE0752 hypothetical protein
MISTMQAYALVNSEEWVFNTDIPSALSSFTEIIPLQSKVMLELDLIKELS
SNYKIENETHRALAGVFQTNYITQPREDYLLAFRLRQFCVYKLGKASKDG
LLKFLGIVAAGYALTPITGPLAWIAPGIAGWEFIKSIVLAYERIEDPDEK
MVFETIYILERRPIIVDYRAYEKEDFLNAYGHAWPNIDDLNNELNGKLTE
KELKKALVSLKARGIISNK
>NE2302 rRNA_methyl_2: RNA methyltransferase, TrmH family, group 2
MLQIILHQPEIPPNTGNIIRLCANTGAQLHLIRPLGFDLSDRQLRRAGLD
YHEWARLTVHEDLASCLKLMDGCRIFAVTTRATHHHGKVAYTGNDVFLFG
SETRGLPAEVLENFPQDQRIRIPMLSASRSLNLSNAAAVIIYEAWRQIGF
EGGI
>NE1096 Sigma factor, ECF subfamily
MPPPDIHCKQVETLYSDHHRWLRGWLCKKLGCSEQAADLVHDTFLRILNS
GEVLTGLTKPRAYLTTTAKRLLIDRARHRRVEQSYLDELALMTATMEGAP
SPEAILMALEALKQLSDALQSLPANASEAFLQHYLLDETQPAIAARLGVS
VRMVQKYLAQALLCCHRALTD
>NE0603 possible transmembrane protein
MIHKLPKWVEIGGFLLSFNAGYVNAIGLLGFEHQAVSHLTGISTFLSLEL
ANHNMQAVVHLLLVMAGFIIGAAYSGFIIGNVALKLGRNYSLALITESFL
LGISMLLLNYGSPVGHYFASAACGLQNAMTSTYSGAVVRTTHVSGLFTDL
GVALGLRVRGQPADTRRIVLYLILIIGFISGGVAGAVCFGQYRFSAILVP
CIVTTLIGAGYWLFTHQTYWRLK
>NE1926 Prokaryotic-type carbonic anhydrase
MRTLTREMQAHLSPEQAIQLLKDGNQRFVSNLKLNRNLLQQVNETSEGQF
PFAVILSCIDSRTSAELIFDQGLGDIFSCRIAGNILNEDILGSMEFACHI
AGSKVIVILGHTKCGAVQGVCHGIKLGNLTALLDKLQPVVDAELSGKNIS
DISDPEFMENVARLNVHYVINEIPKRSQVIAEMLGNGKVALVGGMYDVDT
GIVTFYDK
>NE1668 DUF179
MKPEANSYLFQDGVIVMQSINLTDHFLIAMPGLEDSFFARTLTYICEHSE
RGALGLVVNRPTDLSVENLLLQLGMFPKGTASSNLPVLLGGPVQIDSGFV
LHEPVGSWKFTLSSNESIGLTSSADILQAVADCEGPKRMLIALGYSGWAA
GQLEQELAQNAWLTVPAESQILFELSSEERLPAAMKLLGIDFCNLSSEVG
HA
>NE1004 conserved hypothetical protein
MKLHIITCSTRPGRIGPYVAKWFSEVAVQHGKFEVVPVDLAEFNLPVYDE
PEHPIRQQYQHVHTKNWAASVSAADAYVFVTPEYNFGPPPSLLNALNYVY
KEWNYKPAGIVSYGGISGGLRSAQILKQTLTTLKIMPMMEAVAIQNVSTL
ISEHKQFMPSEHHTSSAVTMLNELYKWAQALKTLR
>NE0527 FtsJ cell division protein
MKSARTSRAWIKAHINDNFVRKANHEGYRSRAAYKLREIAEHDALFVPGM
TVVDLGAVPGSWSQVALESVGPTGKVFALDMLDMQPLPGMTFIQGDFREN
EVLAMLEAALGGKRADLVISDMSPNLTGIRVSDQAQGMYLAELALIFCRE
HLNPGKNFLVKVFQGSDFEAFRQMMQTDFSKVVIRKPKASRDRSKELYLL
GLEKII
>NE0356 Maf-like protein
MITPAGNCIYLASRSARRRDLLKQIGIRHNILLMREALSRPADVDETPLP
EESPADYVYRITHTKSEAGWLRLKQRGLPLLPVLAADTTVVLDGRILGKP
QDIGHAEEMLHALSGQEHQVYTAVGLTFQGQTRLRLSTTTVRFRDISPRE
IQAYIASGEPHDKAGAYAIQGKAAAFIINIDGSYSGVVGLPLFETSQLLE
ETGISVF
>NE1355 Cro repressor helix-turn-helix motif:Helix-turn-helix motif
MSKKTPLNEIAEFDVSDYLRDDEDIAEYLTQVLAEGDSNELLRAIGYIAK
ARGITQLAKDTGLGRESLYKAFRAGSKPQFETVFKVLHALNINLKAIPKE
VVSTRV
>NE2076 LysM motif
MISLNFSVVAKLLLQALQGLIKGLFFPIQWKHIFILNNQIFFSSIRYYGI
CLFFQYDGLSIRSSYQVCDKTIYWLLPVYALFLLIGSDGVRAEDWLYTIR
SGDNLWNVAERHLVSMKYVPRLQQLNRIHDPHHIPPGKVIRIPVEWATRR
PGDAEIVDCYGTATLRRAASAEILPVTEKLRVAIGDEISTGPDSQVTLEF
RDRSRLRVESESKIRLKQAEVLGQDGVVMTEVELESGRTVNIAPHGSEPA
TRFRIRTPAAVSSVRGTRFRVGADKQDGTTRSEVLEGLLEVSAQGRNVQV
REGYGTVTRPDQSPVSPVELLPGPDFSATPDLYERVPLIISLKPLAGARS
YRVQIALDPEFRQISTDLVAMNLPLRGRELADGDYWLRVRGQDVSGLEGK
DGVKKIRVHARPEPPFIMAPQSGARVGDNRPVFEWATRPDIKRYLIEVDR
TADFRESRKYESDSPEGYFMLSEPLTPGEYWWRIAAESEIIGVGPYSDSV
SFKVPVPGPELDSVEIEREEIRFAWPAGETGEQFQFQMARDSGFDNIISD
VLVSEPRVVIPNSGGGRYYLRIKPIRADGEEGVFGPVQSFEIPYRFPFFL
PFLGFM
>NE1142 Domain of unknown function DUF219
MLITEAFAQTANGAAQTSDPTLMSFLPMIGILVIFYFLLIRPQTKRAKEQ
KEMQNGLQKGDEIVISGGELGRVVSAGDSYITMEIATGVEIIVLKSSVQT
LLPKGTLKSVETGKSNRALKNSRQKAGESRQDEPEDNVTETSEVESSSAP
SKQDSEKN
>NE0436 adenine specific methylase, HemK family
MNPTRQPNDTPGDTPIRQAAEELRTLRDLLRFTVSYFNRSDLFLGHGLPT
AYDEAAYLILHTLHLSADQLEPFMDATLTRNERDQILRIIERRVNEKIPV
AYLAHEAWLGDFNFYVDERVLIPRSFIAELLQDHLAPWVMDPYDIGTALD
LCTGSGCLAILLAHAFEQAQIDAVDISSDALDVASINVRNYDLINRVGLI
QSDLFAELQGKRYDLIISNPPYVNAASMAMLPEEYHHEPSVALAGGSDGL
DIIRRIFKEAAHHLTEDGLLIMEIGHNQAEVERAFPQLPCTWLETSSGDQ
FVLMVRSGDLPA
>NE2285 conserved hypothetical protein
MTTQPSKQLETFENPVQTRDYRIHMEIPEFTCLCPKTGQPDFARLTLDYI
PDKKCIELKSLKLYIWSYRDEGAFHEAVTNRILDDLVAAMKPRFIRLTSK
FYVRGGIFTNVVAEHRKKGWQPQPPVLLEVFEQQFNTHG
>NE1751 putative type-4 fimbrial pilin related signal peptide protein
METGMTEAVSKGFTMIELMLTISIASILLAMAVPSYQSLMRESRLTTQAN
ELMTSLHYARSEAVKRGMRVTICKSSDGASCTNGSSWQDGWLIFSDAGTA
GMVDGGDEVLRVFPGLNGSTLGAGGNFANWVSYLPNGRSQGNTGLPNDTF
RLCNQASGRNVIVNNAGRPSVERVSPC
>NE0472 ABC 2 transport system integral membrane protein
MRQFPATPVEMFTSLWRNRELILASAKREILGRYRGSVLGLLWSFFNPIF
MLAVYTFVFSVVFKARWSVGSESKTEFALVLFAGLIVFNLFAECINRAPA
LILANPNYVKKVVFPLETLTFVSLLSALYHALISLGVWLAAYVILFGVPH
ASALYLPFVVMPFALFILGLSWILASLGVYLRDVSQFIGVITSVLMFLSP
IFYPISALPESYQHLLYLNPLTPVIEMVRDVLYWGKMPDFLLLALYWLVT
GIIAWLGFAWFQKTRKGFADVL
>NE1052 Outer membrane lipoprotein carrier protein LolA
MTRLLFVLVLSVCLLPVPVKASAVKSLKTFVNKALTFQANFSQTLLDKNF
QVIRKASGSMMFERPGKFRWTYDQPYQQLIVGDGKQVWFYDQDLAQVTVH
RLDQALGSTPAALLAGGNTIERDFNLQEIDVQGETEWLEAIPKNQENSFE
LIRLGFSKTGILREMVLRDSFDQVTWLIFSEIEQNPTLTPDLFQFTPPEG
VDVIRD
>NE0251 conserved hypothetical protein
MATPADKLAESLAVLKTLRDQGRKALRSEDMGRTHRERLMRNGFIKEVMK
GWYIPSRPDEPAGESTSWYASFWVFCASYLESRFGDEWCVSPEQSIHLHT
GNWSVPKQLLIRSPKGGNKPTGLLHGTSILDVRLELPPASDTEIKEGMRI
YNLPAALVGCSQTQFSAHPTEMRTALTMMQDASELLGRLLAGGHSKIAGW
LAGACRSIGRKQIADDILGAMRAAGYTVNENNPFKDQAQIIFSPRETSPY
VNRMRMNWASMREDVLHSFPAAPGLPADSAKYLKQVEAVYVNDAYNSLSI
EGYKVSAALIEKVRSGNWNPDSNKDDQDHRNALAARGYWQAFQKVRESIG
KVLSQENAGTVAESDHAQWYRELFGPSVTAGILKAADLAGYRNSPVYIRR
SMHTPPGKEAVRELMPVLFELLQQENEAAVRVVLGHFMFVYIHPYIDGNG
RIGRFLMNLMSASGGYSWIVIPLEQRNDYMAALESASVEGDIKPFSTLLA
NLVSTG
>NE1875 Esterase/lipase/thioesterase family active site
MPNEQKRFVTGPAGRLETVVTLPEGAPRGLAIVAHPHPLYQGSMDNKIVY
ILSRAFIEQQYITVKFNFRGVGASEGSYAEGKGEIEDVMAVTQAMREQYD
TGPEPLPLTLAGFSFGGAVQAHVAQQLKPSRLVLVAPSVERLQAPPVVDH
ARHILVIQGDQDTIVPLQSILNWAAPQTLPVTVIPGAEHFFHGKLHVLKN
VILQSCSISQAATSLYP
>NE1513 hypothetical protein
MNELSIQLQPSSRLAVLLSLAHCTAAGVFWPLALPVVVKLIITLLLAGSL
YYYLRRYAWLTSPRSIVALHLTGRNSCRMKTHADEYIDTVVDTSTFVASY
MTVLYLQKERTRRYYTVVILPDSIDANSFRRLRVWLRWKWQDSSSDGRKR
G
>NE0494 hypothetical protein
MKTSFKRPFAQYVKKATKPLRLAIEDEVEMICETPEIGELKAGDLADVRV
YKFRFNQQEYLIAYRSPTRNTPVEFMIIDFYQIGTHENFYDKLKQYLRHD
KNPREI
>NE1428 Hypothetical hesB/yadR/yfhF family
MGTTIHEETSAAQPPLNFTDGAASKVKELIEEEDNQALKLRVFVSGGGCS
GFQYGFTFDEIVNEDDFVMEKQGVKLLVDSMSFQYLVGAEIDYQESAQGA
QFVIKNPSAASTCGCGSSFSV
>NE2033 conserved hypothetical protein
MSEESSRLGQCDEGKASWKKWGPYLSERQWGTVREDYSDNGDVWNHFPHS
QSGARAYRWGEDGLAGISDDHQLLCFALTLWNGRDPVLKERLFGVTNLQG
NHGEDVKEYYFYLDATPTHSYLQYLYKYPQAAYPYEDLVTTSERRSRQEA
EYELLDTGIFDENRYFDIFVEYAKADPEDILIRISAVNRGPETADLRILP
TLWFRNTWSWAPGLAKPALYQEENQDDCRIIHTQQNESGDYRLYCADAPT
LLFCENETNTCRLFHTDNASSYTKDGINNHIVCGQKDAINPANHGTKAAA
DYALNIPPGETRVLRLRLRRAESSAPATDKIFAGFDTLIDQRKKEADDFY
ASLSGGRLNKEQQRILRQALAGMIWTKQYYEFDVERWLNEHPRHNARNAG
WAHMKCRDIISMPDKWEYPWFAVWDTAFHTLPLAMIDPAFAKQQLGLFLE
NRYQHPNGQIPAYEWNFSDVNPPVHAWAVYMVYQVCQDYHDQNDLSFLKS
AFASLERNFSWWETHREPDKNVYEGGFLGLDNIGVFDRSVELPTGGHLEQ
SDATAWMTLFSQNMLQIALELSLHDPDYEQRVLSYLNRFMATAAAMQDIS
DEHRDMWDDEDGFFYNVLRFPDGHSTRIKVRSLVGLLPLCAVTVIERTTL
DKLPLVAEHFENLVRRRQFLADHIFCPTTPGVEGRRLLAIVDEEKLRRIL
SKMLDEQEFLSPYGIRSLSRYHLEHPYQFHWNGQTFTADYQPGESTSNMF
GGNSNWRGPVWVPINILIIRALLTLYAYYGEDFQVECPTGSGKQYNLFRI
ARMIAGRLLHIFLPDEQGRRPVFGNTEKFQIDPHWRDNLLFYEYFHGENG
SGLGASHQTGWTGALAALLTIFGSLEQEELTELGMQEISAILAGNNGI
>NE2539 hypothetical protein
MSAHKWQFASRFRRHAFGWRSDTPVQRIKEAITEIKQVARKEPVLAAEGA
ITLLEKLSPALEQVDSSSGALGSAVNKAIDTLVPIIVKADVEPKLRQRWL
ERLWQALQDDEMPYIEVLGDYWGELCVTPELASHWADEFLPVVESVWSPK
ASGHGFFKGTSACLASMYAAGRHQELWALLDKAPFKWWHDRRWGVKALAA
MGKKAEAIRYAEESRGLNDPGWQIAQACEEILLSSGFLDEAYRRYAIEAN
QGTTNLATFRAIAKKYPHKQPEEILRDLVASTPGAEGKWFAAAKDAGLFD
VAIELATRSPTDPRTLTRAARDYAEKQPAFALAAGLAALRWISLGHGYEI
TGTDVLDAYSAVTQAAVNAVVPTQQVNEQIRDMIASTQPGNSLMKTILAR
HLAN
>NE0815 hypothetical protein
MSDRVIDSIDDYLSPNSQIQIIVNRYRSLFTWFKIMSLPDRLSCLSIEHC
GCLQLIAGEILIQAVSGVRSLERPFVIDIEHGRLQARGMVGVIRALFL
>NE0186 conserved hypothetical protein
MDINMAHSLISCDLHDYIEVACMYSYQVRLILKDQSTVEGKAKDILTDAE
KREFLLLETESGSQQVELISLDRLQVLTPGARFSEVVFSDTCEP
>NE1026 hypothetical protein
MSGFCHLPDKPSLIMLVLIFSGVSLGGCTTQQSRSMVPKQSAGLKATNAG
KTRNIEVIPIPEKLPCQRTCRDEIIALKQKLAEKDELIRNLNAREQDQAQ
VLQETTSEISRTKNKLHRLATQPEAASKIAEVEIAINAARQAVFDESDAV
FQLLAQRLLDAAMVAYRQKDYSGAMNHAAQSGELIDAIANPARKALESQD
ATLIFRISIPLLVTRTISLRADPTDHSRVIGSLEKDTPLTATAYSGNWLR
VQTRDDLSGWIQSQSVDVRVDSQNFQK
>NE2369 Homoserine dehydrogenase:ACT domain
MKPIYVGLLGVGTVGGGTYTVLKRNQQEITRRTGRNIVIRMIADLDQEKV
RHLTAGDDVIVTSDALEVARHPEIDIVIELMGGQTIAKQAILEAIAHGKH
VITANKALLAIHGNEIFAAAQQRNVIVAFEAAVAGGIPIIKALREGLAAN
HIEWIAGIINGTSNFILSEMRSKGLDFTTVLAEAQQLGYAEADPTYDIEG
IDAAHKITIMSALAFGIPVRFDQTYIEGITKLANDDIRYAEELGYRIKLL
GITRRTANGIELRVHPALIPVKRLIANVEGVMNAIIVKGDAVGPTLYYGA
GAGAEPTASAVIADLVDVTRLLTSGPEHRVPVLAFLPELLSDTPIVPMEA
VETAYYLRLQVLDKPGVLADITRILADNHISINAMIQKEHADKEDKVNII
MLLHKTREKNVNTAIEKIQNLPAMTDKIIRIRLEELGS
>NE0456 Esterase/lipase/thioesterase family active site
MKNSAAIAWKFLSVPAQPGKMKSGRGWRQLAYTDWGNPKNEHVVVCAHGL
TRNCRDFDFLAAALEQDFRVICVDMAGRGRSDWLKEAEDYNSAATYVSDM
EHVLEHVYRQNDSDSFRIYWVGVSMGGLIGMLLAARQRPAVSYRFRTLVM
SDIGPHVSSGILSLFATTIGKDPRFRSLSELESHMRATALPYSPLTDTQW
HHLALYSAREYEDGTIGYRYDPAISSGFRPDRIKDIDLWAYWNRLDLPVL
VLRGEKSGVLTPETAGEMQLHRSNVQITELAGIGHAPMLMDADQINLVRD
YLLKIRNRTE
>NE1955 mce related protein
MENRAHALAAGLFILLLGVATVMAVKWFSRDSVSYNHYFLVSAGGAVSGL
NPEASVRYRGVNVGKVEEIYFDRENIRNIIVRIAVNHNVVLPGDIYAQLA
SQGITGLTYIELNDDVADAETKPLQNEARIPLRSSLIKTLSDSLEEILKN
SNMAIRQISNLLNEQNQTHISSILSHLEQAVQHYDTLTGQLQKGLQTLPQ
LSSELTSTFKQTRQVMENTSQVLQKLNQQQGLVDNLTQGSLEMTRTLTSL
NETGVAITQSAHKLDQLLNLLEDHPQSLVFGKPPALPGPGEPGFSPPQPG
K
>NE0009 MgtC family
MNPARHTMDDLFLLEQERGYLAQFATSLAIGLLIGLERERSPAAKAGLRT
FALVAMFGTLTAMLSHKAQTPWLLISGLLLVGIMVIASYRDKRDLPEDPG
TTTVTAVLICYGLGAMVWYEESTLAVMLAIITTILLYFKTELQGITQNLT
RKDLISILQFAVLSFIILPILPDRNYGPFDAFNPHQIWLMIVLISGVSLV
GYIALRFIGQRYGAVLLGVLGGLVSSTATTLVFTRRNGDRPDITNLAVVV
ILLANLVVLVRLALITEVISPVIFPYLLPVLGGGLFLGLITSLFWWREFS
QQQIIPMPDTKNPAELTTAMGFGLLYAAVLFLSGWLSDIAGSSGLYAVAI
ISGLTNVDAITLSSLRLHGLGKLEIVEVVTAITLGVIANLIFKLGLIFFT
GNNVLARRCFLGTVAIIIGLAGALSTASYLSYF
>NE2244 ATPase component ABC-type (unclassified) transport system
MSSIVRIENVFKEYTLGTSRVHALKDINLQIGNGEFLAIAGPSGSGKSTL
LNLIGCIDIPSSGKIYIDDRDTSGQTPDQLSELRARTIGFIFQTFNLFSV
LSAEENIEYPLLQFKELRAAERRERIVKFLDIVGLAEFADHRPNQLSGGQ
RQRVAIARALATHPKIILADEPTANLDHKTGEGILQLMKDINRNSGTTFI
FSTHDAKVMAMAERLIRIEDGQLVGEDAVHT
>NE2469 Transposase IS4 family
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARRGKTSMGWFYGFK
LHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGLFGQLFADKGYLAQWLT
EALDQQNLQLITPLRKNMRPVPRTRFEKVILRRRSLIETVFDELKNLCQI
EHTRHRSLFNFIVNLMAGIVAYCLSDNKPTLNLTRVNSLAKA
>NE2411 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNNYPQQNGMVKWVTGH
>NE0346 possible cation transporter transmembrane protein
MNYMMKHRYQKIIIAIALILGSAIAGISGYFHSRQAAVHSETNNTNSSHN
PDHLVFPANAPQLAYVKTSRAVISPLPASEPLTARLALAENHTARIIPAV
AGRILQLHAEIGDQVSMGAPLATLDSPDFGTALADLHKAEADVLRKHLTY
KRAQELLAGDAIARRDVESARAEWQIAAAETERARLRIRNLSPNSKIQNE
QLVLRAPVAGIVVERQANPGTEVRPDQGVPLFVISNLSRLWVQVDVPEHL
IGRVHTDNPLLLSFDAFPGESFFATITRIAPQLDPNVRRVQVRAEIDNPD
GRLRPAMFARARLADSQAVNVIRLPAVSVITHGIYPTVFIETAPGQFVQR
QIRIAFQDAESVWLEPENSSVQAGEEVVTDGAMLLASELAAGN
>NE2019 Glycosyl transferase, family 2
MNSLPGSIQTSRTNQQRPLISVIVPAYNAEDFILEALRSITAQDYEPLEI
LLVDDGSTDGTADLVRREMPTVRIIRQDNTGVAEARNTGLRNACGEFICL
LDADDGWCPGKLHAQVQYLQQHPQTGAVYHAWQVWRPDAEGNFVSMPAPV
VADSTAIDPALSGWIYPQLLMDCIVHTSTIMMRREFVEQLGFFRAELING
EDYDYWLRLSRLTRIDKLVGVYSFYRGSPGSLTNSVKPVNYEYNVVSAAA
AKWGLAAPDGRALAAVDFQRRLGKLAFDFAYRHYRWGSARIARRAALQAW
RHDPRRYKALLFWLASFFKREPRSAVQG
>NE2041 hypothetical protein
MLNIFHRFPEVTGNPAHRKRDYFIAAFTFFCVAVSLVIDAHGTLLLQNVL
GVIAWIFLVALLRGENREIRMQVVIAVAFATAGEHFASIYMGGYTYRLEN
VPLYVPPGHGMVYLTAVTLARSGFFLQHARKIAAFVVISCGLWSAWGISG
LPEHGDQVGALLYVVFLIYLFKGRSPMVYLAAFFITTWLELIGTAAGTWQ
WATLEPIFELTQGNPPSGVSAWYCLVDAVAISGAPVFLNAFNRMNGLLKW
LKTNGISLKVILSRK
>NE0492 putative transposase
MNHDHGYKLLFSHAEMVADLLRGFVKEGWVNELDFSTLEKINGSYISDDL
RERQDDIIWRLRRKQGKQDEWLYVYLLLEFQSTVDWFMAVRIMTYIGLLY
QDLIRSESIKTGEQLPPVLPMVLYNGDHRWQAPVNMGELILPAPGGLDRY
RPQLNYLLLDEGRYHEHELVALRNLTAALFRLENSRTPEDVQQVLQALIT
WLHTPEQDSLRRAFTVWLKRVFLPGRMPKTSFDEIHDLQEVYSMLSERVK
DWTKDWKQQGIEEGKQIGIQEGKRIGIQEGKQIGIREGRQEGRQEGRQEG
RLEGEVEFFLRLLERKFGPADEITQTRIKSADSQTLLRWGERILVAQTIE
EVFEE
>NE1173 conserved hypothetical protein
MSRIFLALGAVNAFLCVMLGALGSHGLKSILAPDILTTFQIGVQYHFYHA
IGLILVGLAMDRLPQARALKFSGILMMTGIVLFSGTLYVVSLTGWRGLGM
VAPLGGTSYMAGWLLFAWATWKNKSA
>NE0847 conserved hypothetical protein
MKILPSLFSPAGKNSKLSIVIYHRVLPEPDLLTGEGGIAQFEKGLSYLTN
NFNILPLSEAVNKLRSGTLPGRAACITFDDGYADNAEIALPILQKHGVTA
TFFIASGFLDGGRMWNDTVIESVRRAKGDKLDLNAIGLGNHAIATLEQRR
ETLNLLINKLKYLPHEARASQVDKLSTIIAADLPDNLMMISAQVRQLHDA
GMEIGGHTLTHPILASIDDAAARTEIAEGKAKLEAIIDTPLRLFAYPNGK
PGKDYLSAHVKMIKDAGFEAAVSTAWGAARQDSDMFQLPRFTPWDQSEWR
YVLRMVRNMRQRIQTV
>NE0059 HPr(Ser) kinase
MSQVSITQLFEENQEKLNLQWGEPSAVIDRQLENHQINNSTQELIGHLNF
VHPNWIQVLNQTSVNYLDQLDDVSLKKRLNQLAKSQLACLIVADDAPIPN
AIRQFVNEQSVPLIQSATASLEIIWRLQSYLARMLAPAITRHGVLLDVLG
MGVMITGESGVGKSELALELISRGHGLVADDVVELHRIGPETLEGQCPPL
LRDFLEVRGLGMLNIRTIFGETAVRRRKNMKLIVHLEKTVGSSINAYERL
PLSNLNEIILNVGIRKVIIPVAAGRNLAVLVEAAVRNYILQLRGIDSTQE
FIRRHESEMAGNTAEHFDDSHNE
>NE1755 Formylmethionine deformylase
MIKPVLKMGDPCLLQPARRVDQFGTPELEALLQDMQDTMAALNGAGLAAP
QIGVSLQVVIFGVEHSPRYPDAESVPFTVLINPVLTPLTEQMEEDWEGCL
SIPGMRGLVPRYTRLRYQGVDAAGASIDRTVTGFHARVVQHECDHLNGIL
YPMRINDLRKFGYTDTLFPGQTIADD
>NE1843 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNNYRARAEYATATATLPQAENH
>NE1635 conserved hypothetical protein
MKVEQLSLDIFHTLITGSGLRVKAGPFNLSIQTCLPSLIDQLYRMYSHYS
LVDETEIAEFHVRIIPKHSIRRPLAQFIQFQIDGRSPFPPVPAKQALATL
EWGINLAIAMRINHLLLFHSAAVERNGHVLLLPAWPGSGKTTLCTALIHR
GYRLFSDEFGLMDPQSGEFFPLPRLMPLKNQSIGVIRDYLPEAVLGPEIP
GTLKGTVAHVRPPQESIERADETAKPRWIVFPKWSAGAALRLESLPQSEA
FLLLATNAFNYEVLGETAFDAVTSLVRNCECRKLVYSDFDSALAALDELT
DG
>NE1782 carboxy-terminal processing protease
MHNFIKKISLVFAGVIAGIMVSLNFSAIADKEIKEELQTLPIEELRTFSQ
VFGRIKSDYVEPVEDKKLITEAINGMLVGLDPHSAFLDSEEYKELQIGTQ
GEFGGLGIQVTMEDGLVKVISPIEDTPAFRAGVKTGDLIIKIDDTAVKGM
TLNDAIKRMRGKPDTPITLTVIREGEAEPLTFTLVRAIIKIDSVKSKLIE
PGYAYVRITQFQERTGENLAEAITKLYSGSKVPMKGLILDLRNDPGGLLN
SAVAVSAAFLPSNALVVYTEGRTNDAKMQLRANPEYYLRGSGRDFLANLP
QSIKTIPMVTLVNGGSASASEIVAGALQDHKRSIVMGSQTFGKGSVQTIL
PLSNNTAIKLTTARYFTPNGQSIQAKGITPDILDEETDNDSRLLREADLN
RHLSNGQEEKTDKAAASDKKGAQSSKRSKNKADKKQKKHEPMEFGSSDDL
LLAKAVNHLKSMAQKSKHKTAKSTQS
>NE1945 FAD-dependent pyridine nucleotide-disulphide oxidoreductase
MKTSDIHDVAIVGAGPAGLSAAYELIRTGFTPLVLERTPAVGDVWRNHYD
GLRLNSGRFFSALPGSKFPLSAGGWPSRDEVVSLLETFPARGGFTVQTGI
EVEKVSHDRERDIWLITSNDNRQFESRAVVIAAGANRIPIIPEWEGKNTF
TGTIIHSSQFKSAQDYAGKHVLVVGSGNSAAEIASRLAKYADSVTMSVRT
PPQILPKSIYGIPLIGIGVWTRYLPRALVDGLLNFLRRTMIGDLSVYGLP
SPTISMSKQYAINNVVPILYGPFIDDVRSGRIKIVGPVQKISGGTVEVLS
TVESALNGDQATTTLQPDIIVAGTGFRTGFPELIQVPGITDEKGRSKISG
DQEFKGAPRLYFIGQINPLSGQLREIRLEAGRIARKLDKQLRAK
>NE0522 Glutathione S-transferase C terminus
MGLLINGKWQDQWYTLQNGEFVRENAQYRHWVTPDGQPGPSGEGGFTAES
GRYHLFVSLACPWAHRTLIFRHLKDLQAHIGITIVDPRMLEHGWAFSEIS
RDNPVQDIRYLHQLYTRAKPDYTGRVTVPVLWDKQRDTIVSNESADIIRM
FNSAFDSLTGNDLDFYPEALRNEIGTINEVVYRDFNNGVYKAGFATDQLI
YEKAYDRIFNRLDAIDDSLNTRRYLAGNRLTEADWRFFTTLIRFDAVYYS
HFKLNKQRIEDYPNLSNYLRALYQISGIAGTVNFEHIKTHYYYSHSTINP
TQIIPKGPDIDYTRPHNRVQAFSSPG
>NE2140 TonB-dependent receptor protein
MKSIAGLFSLLLLLWANTANAQNDITYSFNLPAQPLSSTLNHLANITGIK
LIYADEITRNHLSPPLHGNFTLAEALNQVLNKNRLKYEMVDNSMVVISQA
AANTVLPEITITTDSSAAGNEASLTVPNTAQATANIQYTPGAVEVISDTR
FKSTPAQTIKDIVGWIPGVIAQPKSNIDNRISIRGSGLTRNYGNRGINMY
MDGIPINTADGLFDVFEIDPGAYRYVEVFKGANALRYGANSLGGAINFVT
PTGYDASRFSGRVDTGSFGYMKGQASTGGVHGPYDYFITASAQREDGYRD
HSNGHMVLGSANFGYRFSPDAETRFYLNANTWRQRLPGELTRSMALDSPR
SADPEFIRQDQQRNIDSVRLANKTTLRFGSTTVDFGLFGHHRHVDHPIYR
YLDYYVHDYGGFARATDDRLIAGFRNRFVAGANVHNGKINYKEYINPGDA
VKGALAMSTVDKSQNYSIYAENSFYFLQNVALVAGGQFLHAIRDRNDRFL
TDGDQSGRRAYSIFSPRGGLLWDIAPDVQIFANISRSAEVPTFDTNTFMT
PASSELDAQTATTYEIGTRGRKADFKWDIAFYRAHLRNELQCLRTSPFSP
CTVVNADRTVHQGIELGFGAAFLKGIATRKDLLWLDVAYTYNDFFFDNDT
LYGNNKLPGVPPHYLRAEVLYKYPNGFHAGPNVEWMPRAYYADNANSLTI
GSYALLNFRIGYDPGKGWSGYLEGRNLLDTRYISTTITSGTADATSALFN
SGYGRAIYGGIRFNW
>NE2430 hypothetical protein
MLDLIAIVVLVSGLYLWLRKPKTITASASNAEEAVLPLDDDHTAPNNTMV
YINENKDR
>NE0824 hypothetical protein
MKSSFGLSALVGLTLSLHAYAGGNPEFVKFPEKYEQIFTHYDTANRANQT
QLAKFYANEIAAESYKKGEEAAPGSIVIMEIYAPKKDAEGKIQSGEDGLF
VIDKLAAIAVMEKRNDWGSAFKADDRSGNWGFALYDPEGKAKDNDLTCAQ
CHNPLQKQDNLFSFQKLVDYVKAH
>NE2525 possible ATP-dependent DNA helicase RecG-related protein
MRSATDLLDELNAVDESARIEAKRASDMGKSVMETVIAFANEPGLDGGYL
LLGVDWAINDKGDTVYRPVGLPDPDKVQRDLASQCASMLNVALRPEMQLE
QVGGKTLLVVYVPEADVTHKPIYKKATGLPGGAYRRIGSSDQRCVDEDLW
VLRGESQPLHGPDSSILSDARADDFDPTAIAAYRRERARINPQAEELDYG
DEDMLEALGALRRVDGVLQPTLAGIVLFGKPLALRRLLPMVRIDYIRVPG
NEWVEDPENRFQSIDIRKPLMLALPLAEASIIDELPKGFRLPEGQLQSVQ
EPILPRKVIREALANAAMHRNYTLHSPTQIIRYSNRIEIRNVGYSLKEPA
QLGLPGSRLRNPAIAAVLHDLHLAEAKGTGIRAMRRLAADAGLPLPEFHS
DRQANEFKLTLFLHNLLTEEDHTWLRNYAGSTLGPDEAKVLIYVRATGAV
DNTACRDFCGSDTLAASMVLRRLRDRGLLEKRGAGSRTYYTLAGNEPRPT
LESAQSELPLDVPADTPPVGQPQPAEAVAGQQACNPQLATLPPELATLLA
TLQGRISSEALRDGIRRLCQWAPQSGDQLATLLGKDRDYLRNKHLIPMVR
DGQLRFRYPESAKHPHQAYVAPEDKRND
>NE0605 hypothetical protein
MMRVKLFLAVVLSLSVSCFAGSGGDQPESLPETGQTKNASTGDEQPTRIE
TGKKEAESQSSQETGLDKQSLMIDYCRKHTC
>NE2543 hypothetical protein
MADNDRQFSWRQGNVVTLEAAKALNLLAPECDDQHFAVVASHDCDLSASQ
DKEPCVEVVVGKRIDKLGGDSFGKTARRLHIEYQSEAGPVAIELLATSKR
PVAKHELFSTHPRQDIWLDGQGIGILQRWLASRYHRAAFPEAFEDRLRSA
NLPGKRTFLKRIEGILADGGDHIRALLFDLDEGKDVERDGPDDVYQLGIV
VLYDSLRDEPAAAQVAGKAAEALEELFEAAFHPKDSGWKNICLMYCDPIS
DSAITVAQREMLKQWRLEHMSLQEDPPQPMITP
>NE1074 conserved hypothetical protein
MSESRLLDYLDHIQQAATDACSFVEGLAKEDFLENKQTQQAVIMSLIIIG
EAVTKVMDGYAEFAQAHAQVPWRNMRGMRNRIAHGYFDINLDVVWDTVQA
ALPELLKQLPAVRQDADNEARDKPC
>NE2039 hypothetical protein
MLMWLCFPLYIVSMHYQQYVRSMSGPMTEMWGGNRVLSGIADYARMTVSG
IRYSQIYKGSYERFNREILSNGLIFFHRPMTQLENE
>NE2404 putative transmembrane protein
MRNKHLATLLIANIILAALIPGCSLLPENKKIDYKSAGKLPPLEVPPDLT
SPETNDRFAVPDVDASGSATFSTYSNARNGQLRDRSGSPVLSATAAQDGI
HIERSGTQRWLVIPREPATVWPVIKEFWQDMGFLLKMEAPDVGIMETDWA
ENRAKIPQDIIRSALSTFLDGLYSTAERDKFRTRLEQEQPGITEVYISHR
GMIEVLADRNTNRTIWQPRKSDPELEAEMLSRLMQRFGVDEERATAEVAA
SSTVEERAFLDKTRKGVLIIKEPFDRAWRRVGNELDRIGFTVEDRDRSSG
IYFVRYISPDEETRRAAEGKGLLSKLAFWSSDTDDNNKDASAEKYQIKIS
EVGLNSEVSVLNANGVAEGSETAQRILDLLREQLK
>NE2332 CBS domain
MNETSPKIGWFEHISTLLSRKPEDREQLLALLHSAYERNLLDADALAMIE
GVMQVSEMQVRDVMIPRSQMDTIDISEKPEGFIPFVMETAHSRFPVTDGS
KDQIIGVLLAKDLLRYYASREEFNIRDMLRPVIYIPESKKLNILLKDFRS
NRSHIAIVVDEYGGVAGLVTIEDVLEQIVGDIEDEFDFDEDDAYIVADTD
GHYRVRAITEISSFNETLGATFSDKEFDTIGGLVINKFGRLPKNGESITI
EGFNFTVTRADSRRLHLLKVERIEPGVVEDADSHDSV
>NE1259 hypothetical protein
MGVMKGIEFSESKLKEILDQLPIEAQSAFAASCAQRIFTCYVEYARVAKS
KKVDLDAYSEAISYVWNAVIAGNHDAIILNGLLERCMAVLPSEEDAWESG
TPYAEDAAAAIIYSLRSLASGCPQEAIWAAKRVYEAVDNFVVNTYNVNTN
ATDGEKFILDHPIVSNELSRQLRDLNEIINSKRDSESLKKTIKIIMERSK
SESNDLFSEAT
>NE0110 hypothetical protein
MPADHFLKRQAMSDFIICYDITDPRRLGRLYRYLIKRAVPLQYSVFLFRG
DDRQLERCIQDAIELIDEKQDDLRVYPLPGRGLKARIGRPTLPEGIQWSG
LPAKW
>NE2293 conserved hypothetical protein
MDVTDPQFWIAVFQIIAIDIALGADNAVVIALACRNLPENKRNQGIFWGT
AGAIGIRILLVFFALQLLALPYLKITGGLLLLWIGVKLLLPEPASESGPV
VKGGATTLPGVIRTIIIADTVMSLDNVMGIAGAARDSLALVIFGLLFSVP
IIVWGSKLVMKWIERFPVIVVIGAGLLGWIAGDLLTGDTISQEWVATQAA
YLQWLVPAACSLLVIGTGKWLTARIQKKARASMDLLGKD
>NE0122 hypothetical protein
MTPDLKTSPPPATLFSEDAEAIAKYLDESAKNGRDSGKNKSTQIRRFYDE
LVGWQERIGTDEQKFRDYAAFIHMLNAKAAYAQGRNLVTREFVQWLRDCI
KQIDGPRTLNHFRLHFEAMIGFLKAIRG
>NE0082 Proline-rich region
MKRLNWLHLSLILAGMIASGVSWSYSHGHSHGYHNRSHGHYSGKRNFSLG
VGLTSTFGSYGYYNYPGSNVGIYGSFGYGRSYPYSRRPYYRPYGYGYPAS
RFYWPAYPPTVYYPPVVVVPPDPPVYIQQQPARLVPPPPESAVTNYWYYC
ENPAGYYPEEVERCPGGWVKIPPRPAQ
>NE1225 conserved hypothetical protein
MATIVHDLSKYDGSAGFLVDTNIWIDCMDTDSRWHDWSVDQLQICSEQAP
LHINLMIYTELLIPGPDIDALDTMLDIYDTLRSPLPWSCAGLAAKAYLNY
RRRGGTRLVPLPDFYIGTHAAVANLSVLSRDVKPYHNYFRRLRCVGPDET
AEQHTDG
>NE1286 GTP-binding protein HflX
MSHDVATGTVADNTAILVDIDFGEGDKESLEELRELARSDRLSVVAVVEG
TRKQPDPATFIGKGKAEEISQILAQTHAAMVIFNHELSPVQQRNLSMVLA
CRIIDRTSLILDIFAQRAKSHEGKLQVELAQLEYLSTRLVRGWTHLERQK
GGIGLRGPGETQLETDRRLLAKRVKLLKEKLTKLKRQREVRRRARKRAEI
LSVSIVGYTNAGKSTLFNRLVRTDTYAADKLFATLDTTTRRLTLPGRGTI
VISDTVGFIRELPHTLVAAFRATLEETIQADLLLHVVDASSSNRDAQISE
VNKLLREIGADTIPQILILNKIDLLEQYPSGNYMRDEYGRIKSIHLSART
GAGFSYLYDALAEVFDQNLKRLEQHSATPESMNDNVRATFINNEKD
>NE0477 ATPase components of ABC transporters with duplicated ATPase domains
MIRISELTLQRGPLRLLENADLTLHPGHKVGLIGANGAGKSSLFALLRGE
LHPDAGDCVLPMNWRIAHMRQEIDAPDCNAIDYVLDGDGYLRDIQQQLVK
AEQRQDGVELGRLYSELDNADAYTSDARARKLLAGLGFLEEQMGNQVNSF
SGGWRMRLNLAQALMCPSDLLLLDEPTNHLDLDAILWLESWLQSYPGTLL
LISHDRDFLDAVVGHIVHIDQQKLTLYCGGYTAFERARAARIMQQQQAHE
KQRAQRAHMEDFIRRFKAKASKARQAQSRVKALERMEELAPAHFDSPFDF
IFRVADKVSTPLLNLNEAELGYNVGQPILSNVKLQLVPGARIALLGPNGA
GKSTLIKSIVDDLPLLAGQLTRSENLAIGYFAQHQLDSLDPKASPLLHFQ
RIAQDEREQVLRNFLGGFNFRGKRCDEPVLNFSGGEKARLALALIAWGKP
NLLLLDEPTNHLDLEMRQALSMALQDFSGALLLVSHDRHLIKSTMDELYL
VADGRVREFAGDLEDYSKWLNDYRLRQRPDNDATSPDKVDRRAQRQATAA
LRKQLAPLRKHTDTLEKQLDNIQKELQALETILADNNLYEQPQKDQLKQY
LSQQTNLLQKQVKLEETWLNNLEELELLQSELGEDV
>NE1534 conserved hypothetical protein
MIIATRNRLIHGYLGIDNDTIWSIIQDEIPKLLRQLTAMLESIR
>NE2170 possible (AF025396) ORF15x3 [Listonella anguillarum]
MNADSGILSIRKTFRQLFSYALIGILTNVLGYAFYLLLTYLWDAPKITMT
ALYFVGASIGFFANRRYTFRHDGHIGVTGVRYLLAQVAGYLLNLVLLLLF
VDWLGFPHQIVQAIAIIVVAIFLFVMSRFFVFAQSAAESEGVPS
>NE0100 General substrate transporters
MVSPFRQHRRILVASLVGTTIEFYDFYIYATAAALVFGPLFFPAESPSAQ
LMLSFLSFSLAFIARPFGAVLFGHFGDRIGRKSTLVASLLLMGISTLLIA
FLPTYSTAGWIAPLLLCILRFGQGLGLGGEWGGAALLAVENAPPGWRGRF
GMVPQLGAPIGFLAANGLFLLIGLQLSDADFAAWGWRIPFLASSILIVLG
LWVRLKINETPEFTQALAQNPPVSIPFWELIRKHALITFAGTFTVVACFA
IFYLSTAFALAHGTTTLGYDREQFLITQLAAIAFLAAGIIIAGIRADKAS
ANQVLSWGCAATIGLGLTFGPALGAGSLWLVWGMLSLALFIMGFVYGPLG
VWLPSLFPPRIRYTGVSVAFNAGGILGGALAPIIAQALTDAGGTSLVGLY
LTMAGIFSLAGLKLVGKLMPDKTE
>NE2360 conserved hypothetical protein
MKLHLSDSSGLNVFSGYGEGYVAVNQVRYTDNMIVLPNRIIEHWQASSIS
QLGMEHFDALLAMQPEIILLGTGTSLQFPDASLMRMILSRDIGFEVMDTQ
ATCRTYNILSSEGRRVAAAILVRSTDG
>NE0910 hypothetical protein
MKKIGSATVFAAVMLLFSFNNAFADSEGLPADRVITAIQTAVAANPGLIH
EVEVDQEHGKLIVEIKIIDAKGQKTKVKIDPEKNEVIR
>NE0184 NUDIX hydrolase
MTWKPNVTVAAVIEQDDKYLLVEEIPRGTAIKLNQPAGHLEPGESIIQAC
SREVLEETGHSFLPEVLTGIYHWTCASNGTTYLRFTFSGQVVSFDPDRKL
DTGIVRAAWFSIDEIRAKQAMHRTPLVMQCIEDYHAGKRYPLDILQYYD
>NE0735 Cytochrome c, class IC:Cytochrome c, class I
MILSFSGMASAAGNTDSVQSKIVMCQGCHGIEGYRTAYPHVYHVPKLGGQ
HSAYLVKALKDYRSGERNHPTMVGIAGTLSDEDIDALAAYYAGN
>NE1111 Outer membrane efflux protein
MKNSSFSKPATKSAVSALVLLLSGCSLIPEYSRPPAPLPASYTADDTNAM
DAAPESWQQYFTDPVLQNLINIALEQNRDLRIAALRIEEARAQYDIQRSA
KLPTIDATGSYDRSRLIFAKGETFDVDMYRVGIGISNFEIDFFGRIKSLS
EAVLEDYLATREAQQTARTSLIAEVAAAYVDERALSERQLLAEKTLAARE
ASYTRIRRRFDAGIDTAIDLKAAKMQMESARASVSALDREHTRAINTLLL
LMGDQSVTLPGGIAALDTLAFSPIPAGLPSDLLERRPDIRAAEHKLKAAN
ANIGAARAALFPRLQLTTNIGLVNEHFTKLLSNGINAWAFTPQIIFPIFT
HKRNQANLNVSRARMEIAVAEYEKAIQTAFREVADTLLAREQIEKQINAQ
SNATDADRERLKLVTRRYEKGVANYLELLDAQRSLFDSEQALVQLRQLSL
SNAINLFKALGGEWSGAGIN
>NE0848 Phosphoglycerate mutase family
MDLILWRHAEAEDGIPDATRKLTEKGLKQAQKMARWLEPKLPKDTRIIVS
PATRTQQTVSALTHHFETSEQVGTSATPHRVLNTIAWPEAEGTVLVVGHQ
PTLGKIASLLLKGDESGFSVRKGSIWWFSSKQKDERESIILRAVMTPEIL
>NE2478 hypothetical protein
MFILEGNSIVKINQIPPGRTETVAGGSQKKMDRANSAKPGAVGESNNVHI
SSLSMSIQSLDASSETMNTAKVAEIKQAISEGRFKVNPEVVADRLLETVK
ELIQNKR
>NE0886 hypothetical protein
MQILSASKPMRQGEKMNELKKYIETGTEKAGTQKELAKIIGIADANLRTA
KSGIRGLPIEVCIQLANYLKEDELKVILASEMVTAKDEKKRKILESCMQK
SKERISAMIVGGLVISMLTLSPLESAEAKVVENQDSMYIM
>NE0923 Cyanophycin synthetase
MKVLQIRALRGPNVWSKLAAIEATLVFEQNECLPDSIPGFDTRLREYFPD
IALLQPVDWQETTLAHILAFITLKLQERAGCSVSFSRVIKMAEANTWRVV
VEYSEEAVGRLALEQSLALCRAVAEAAPFDTSEAVNRLRELYEDIRLGPS
TNSIVQAAVRRKIPYRRLTDGSLVQFGWGSRQRRILASESDLTSVVAESI
VQDKDLTKMLLHTAGIPVPTGRPVISADDAWAAACEIGAPIVIKPQDGNQ
GKGVTANLTDRDQIKAAYHVAAERSRNVLVERYISGHDYRLLVVGNKLVA
AARRDPPQVVGDGIHSIAQLVKQINSNPLRSEGHANLLTRIHLDEISLAH
LALQGLNAASVPDKGKLVTLRNNANLSTGGTATDVTDEVHPDIAECAVMA
ARMTGIDICGIDVICSSLSRPLGEQGGAVIEVNAAPGLRMHLQPSYGKPR
AVGEAIIDHLFAPGENARIPVIAVTGTNGKTTTVRLIANMLENNRLRVGI
ACTDGVFVNGQCVDTGDCSGPQSARNILFHPEVDAAVLETARGGILREGL
GFDYCDVAVVTNIGRGDHLGLANINTAEELAAVKRTIVENVNPKTGVAVL
NADDPLVLGMASHCPGNVTFFSRNHRHPVILEQRVQGKRVIYMEDHHIIV
AEAGTERRISLSQIRLTKNGMISFQIDNAMASIGAGLAIELDWTTICAGL
ADFVSDAQTVPGRFNLFNYREATLIADYGHNPDAMEALVCAIDHIPAKKR
TVVISAAGDRRNEDIRLQTRILGDVFDEVVLFQDKCQRGRADGEVLGLLR
EGLENAKRVRKVSEIRGEFKAIDTALTNLEAGELCLILIDQVEQALGYIH
SRIAVA
>NE0391 hypothetical protein
MANQLTIDQRLQERRAGASFVCPYQLGIKQGRRVTRRRSGKGAAYVDKYG
WPLVICCLAIVLFSATDAFLTINILSDGGTELNYFMAVLIEESTQKFVHF
KLALTSLAAIILTIHHEVQIRGGFRCRHLLYMISTGYAGLIGYELVLLQI
IDV
>NE0420 Ribosomal protein L15
MKLNTIKPGIGSAKPKRRVGRGIGSGLGKTCGRGHKGQKSRAGGFHKVGF
EGGQMPLQRRLPKRGFTVYGKKQVREIKLSTLQLIDLSEFNPSVLYDHGL
IKNINDPVKIILGNQLIKRAIKIKDLIISRGAKEAVEQMGGLVELTVKDV
NGV
>NE0365 conserved hypothetical protein
MSRDYKSRNSAKTHKNGSLLWLGLFVGYTLGLASAIGVWLYLSQAPSPFL
NGGKVAQNQSSEKSSVRQSPSQEKKTESNTNQTGTSGSRFDFYKILPGID
EPPGEDVFDLTPLPPAATVTRKIPEKTAEKPPEKVPEPKSRYYLQAGSFR
NSSDAERVKAELALLGIIASVQTGRSEGNVPVHRVRVGPFTRMEELDRVR
ASLQENGVTSSLVTQ
>NE0089 Domain of unknown function DUF20
MNDKYSPDSRLFWYLTVIGVVSALIYLLSPILTPFLLAAVIAYICNPLVT
WLEARKIPRTLSTIFVMLMTMGIFIAMALILFPLFEKEVSRLVERIPSFL
DLVKSQFIPWLEDNFNVELQIDIASLKQMLTEHWKSAGGVAAQMLPSLKS
GGLILLTFLMNLVLVPVVLFYLLRDWNNLIRQVGELIPPVWQKQIFTLAR
ETDDVLAEFMRGETAVITIMSIYYVTGLWLVKLEFALPIGLISGILVFVP
YLGTITGLALATFAAITQFQEWSGVIAVWVVVGSGQLLESMLITPRLVGE
RIGLHPVAVIFALLAFGQLFGFIGILLALPVSAVLLVLLRHLHTQYMETM
RE
>NE1681 possible methyltransferase
MNRIVLMIVDPIIEHYMHTLARRSDHPVLDEMEAFALEKSFPIVGRLVGI
SLEIYAKMIGARRVFEFGSGYGYSAYWFGRAVGPGGQVVCTDSNPLNREQ
AEQYLAAAGLWERVRFCTGYAQDIFGQTDGNFDICYNDADKGGYPDIWLM
ARERIRSGGLYIADNVLWHGWVAVEDSADAKPDWTKAIREHNRLILTDPE
FDAFINPTRDGVIVARRKMA
>NE1796 Glycosyl transferases group 1
MSHPPLIVHVIFHLGVGGLENGLVNLINHIPADRYRHAIICLKGFSEFHK
RLNRDDVEIIALNKREGKDFSVYGKLYRVFRQLKPDIVHTRNLTAMEAQV
VAAVAGVKARVHGEHGRDIFDLDGKNWKYNLLRKAIRPFIHHFITVSKDL
ENWLIDTVKVSPVKVHQIYNGVEHLRFHPGGTVPVEIFPPDFFAGRPFVV
GSVGRMAAVKDFPTLVQAFLMLRNELSEIDRPLRLIIAGEGVARAECEAM
LRSAGVEQFAWLPGERDDIPQLMQAMDVFVLPSLGEGISNTVLEAMASGL
PVIATRVGGNTELVLEGETGRLVPSGDPVALARAISQYHQDNTAVYKHGQ
HARAIIEQQFSMRSMTNGYLAVYDRVLGYKSKTETIN
>NE1109 conserved hypothetical protein
MIKTFATKETAALFANEKIRRLPPEILRVARRKMAQLHRVSSIEELRIPP
GNRLEKLSGNRNEQWSIRINDQWRICFRFEAGDVFDVEITDYH
>NE1562 conserved hypothetical protein
MHQILASFSASISELKKNPTALLRKAEGETIAILNHNLPTAYLVPAEVYE
LLMEKLEDYELGEIVKARQAEKHLAIEVSLDDL
>NE0277 Ham1 family
MNKIVIASNNAGKLAEISRLLAPLGIEVVTQSSLGVTEADEPHMTFVENA
LAKARHASLATGLPALADDSGICVSALRGDPGVFSARYAGEPRSDERNNR
KLVEALHGQSDRRAYYYCVIVLLRHGQDPQPVIIEDTWRGEIIAEPIGQG
GFGYDPHFFLPELGKTAAELSIEEKNRISHRGKALARLVQMLSENETVPV
VPV
>NE1146 UbiE; ubiquinone/menaquinone biosynthesis methlytransferase
MTNSTHFGFTTVSETEKARKVAEVFHSVAARYNLMNDLMSMGLHRLWKRF
AIDVSGAKPGDKVLDIAGGTADLTRLFLEKTGSTGEVWLTDINNSMLSIG
RDRMLNDGKSVPVAQCDAEKLPFPDNYFDRVCVAFGLRNMTHKDAALREM
WRVLSPGGSLIVLEFSKVWKPLQPLYDTYSFKALPFMGKIVARDDTSYRY
LAESIRMHPSQEELKQLMQQAGFERVEYFNLTAGVVALHRGYKF
>NE2391 ApbE family
MMKASLSMSPRSIMIIIAAFLIMMALLGRIFFASSLTQSHVHGYTMGTKY
SVKYRHPEGGFTPDTAQKQIESVLDEINRAMSTYDPESELSRLNRATTSD
WIPVSDSLFTVLEAALEIGARSEKAFDITVGPAVNLWGFGPEFHSERIPD
KSEIAAVLPAIGQDKLVLDRETHAIRKLHSDIYIDLSAIAKGYAVDRVAE
IFDARGIEHYLVEIGGEIRARGTNAQNTPWQIGIEKPHHQYGRSAPYKIL
SLQDTGLATSGDYQNFFEIEGHRYSHLIDPTTGWPVENGASSVTVLAESC
MVADAWATALLVLGHERGLTIAEDQGIAVLFIVNQEGVIKGYTSSHFPAE
RASDFTQIFIATFLVMGLALIAMAIGVIGGRQPLAGSCGGLGRMGLGCEA
GCDQSCAKHDGKNSVH
>NE1600 TPR repeat
MKVILTIQLALLLSMAGCTQLPRTGDAIAHGSAASAGAAPATELTADSLF
DFLMGETALQRNMPDVAVESFIRLARETRNPRIAEHATDIALRTRHFGEA
KEAIDLWVALEPDSMHARQAAVALFVANGQLDNVRPHVEQLLKLEPETVD
KAFMQINKLLSHHSDREAVLKLVQQLAASYPDLPEAHFAVSQAAWSANEF
KLAAKAMNQALELRPEWEMAAVHQGQILQKIDKDKALSFYDQYLDRFPRA
NDIRIAYIRMLMEEREFDRGREQFQKLEQVNPSNPDIALAIGLLSAELDD
LGSAEKYFKRALQLGFEDTNTIHFNLGRIHEIAQHNAEAMDAYLRVTGGE
RYIAARVRYAFLLAKRDGIAAARRYLKTVQVENEQQRTQLLISEAQLLRD
SGEFRGAYDLLDAYLRKYPDQVELLYDRALMADKIGKLDVLEQDLRKLIE
LRPDNAHAYNALGYSLAERGLQLPEALALIQKAIELSPDDPYIMDSLGWV
YYRMGDLKKGVNYLKLAFDTRSDPEIAAHYGELLWMNGAKEDAEKIWQSA
LEEHPENELLLDTVKRFMK
>NE0826 DUF214
MKLRDLLHFSLRTITSYPARSFLIMLAMALGVAAVIILTALGDGARQYVV
NEFSSIGTNLIIVLPGRAETAGSFPGAVMGQTPRDLTLEDAHWVGRLPQV
RRYAPLNVGVAELSATGKLREVTLMGTTAEIFPIRHMKLAQGRFLSHSGE
NSAQIVLGAKIAQEFFPDGNALGQRVRLGDRRFLVTGIMAVQGESMGFNS
DELVIIPIQHAQTLLNTTSLFRLLIETRHHNEIENAKEAIHQALIRRHDG
EDDVTVIAHDAVLATFDRILRALTLGVAGIAVISLAVAGILVMNVMLVSV
SQRTAEIGLLKAIGTPASAIRHIFMAEAVWLSVTGAFAGFVLGQAGSWLL
RLAYPLLPAWPPLWANFAGIAVAVLAGVLAGLLPAIRAAKLDPVMALSKR
>NE2268 hypothetical protein
MKTTIDRATFNAIAVLGFVTVIISVPWEEIFRAVNGYGFVDKGVYSEYFL
YKTSVLDYKEFSGVLSYVSNEFLWHYAIGWLVNNTGILIDHVFLAISFIT
LLIFGLLLAKWQSIYALPLLVNPLVIDFAFSQYRLALAISILGAAYLLHG
RYRAIPFALVLTSLFIHSGTTLFILIYAAVWLAQWLSIRFRIGCSTVFAI
LFAVGAFISLLIGPLREALLSAIEDRRAEYHDMSSSLLYSLFWIALLIPF
YLGRRYLVTLDYARYALVIVSIVFVNVFHGGYSTRFLAAAFPGLVSAVLS
LHGVIKIMILVLFIVYAVFQWLYWLRILSG
>NE2538 conserved hypothetical protein
MTDAKKPMAEANRISSMLNTVLGADRFPVKVDELALEYSRQCFSDSPIDT
VRGEDLEGFDGLLKANKARSKWLILYNSATPSEGRKRFTIAHEFGHYILH
RHQQDLFECGGDDIETGDNNERDIEAEADLFASTLLMPLDDFRRQVDGQP
ISFDLLGHCADRYGVSLTAAALRWTEIAPKRAILVASRDDHMLWAKSNKA
ALKSGAYFATRKNTIELPHDALAHSYNAFDMCDNRTGRAQSWFAREPASM
PVTEMTRVAGQYDYTLTLLLLPEAEWQGARHDDEEPEEDTYDRFIRNGQY
PVR
>NE0114 hypothetical protein
MASLQSSKIPIQRRFPLIGNFPVSHFRHHCEAARGNSVSLYCLTGLLRCI
YLAIIVQALEDSVFTAINLLTEYLPGKLFHYARAAPTP
>NE1129 hypothetical protein
MGFFSYWLRGILFLTLLVTFSFAGNALSFDGSVEVITHPGVNSNYLSKNL
LRSIFGMRLRTWQDGLPVRVFVLPDDAPLHSTFTKQKLNIFPYQLRSAWD
RMVYSGTGQAPFVVHTEEEMRARIASTPGAIGYLSGSMINDSVQVVHIDE
>NE0595 putative oxidoreductase protein
MKNKLLIVGCGDIASRAAGLLEKHYQLFGLCRRAENSGHLRALGIRPITG
DLDQPASLNKIAGLAAHTILHLAPPPGHGERDMRTLHLLSALSRHSSKTQ
TGILPQRLIYISTSGVYGDCSGSRVSESHPTNPKNARAFRRLDAERQVRS
WGIRNRIQVSILRVPGIYAHNRLPIERLQQRTPVLLSTEDSYTNHIHADD
LARIIVAVLRSGRPGRIYHASDDSCLKMGEYFDLVADHFALSRPERITRQ
EARKVISPGLLSFMLESRRLTNDRIKRELRVRLRYPTVSDCLAEMRKIPV
KSS
>NE1527 Thiolase
MTRQVQDAYIVAATRSPVGKAPRGMFKNVRPDDLLVHVLQAVLKQCEGLD
PAAIEDVIVGCAMPEAEQGINVARVALLLAGLPVSVPGVTINRFCASGLQ
AVAMAADRIRLGEADVIIAGGTESMSMVPMMGNKVAMNPALFKPGSEQVA
IAYGMGITAEKVAEQWKVSREEQDAFALESHHRAIRAIEQGEFRDEISPY
PVQENRPDLNTHEIQNTAIVRDTDEGPRADTSAEALARLRPVFAAQGSVT
AGNSSQMSDGAGAVMVVSEAALQRFNLTPIGRFIGYTVAGVPPEIMGVGP
VKAIPKVLEQTGIQQDALDWIELNEAFAAQSLAVINDLGLDRAKVNPLGG
AIALGHPLGATGAIRVATLLSGLRRHKLKYGMVTMCIGGGMGAAGVFEAL
>NE0881 putative organic solvent tolerance transmembrane protein
MNTLKLCLILYACLVLLPVRVMSADLSPASSERQPIYIEADHIDGHYQQE
IEAIGNVRMRRGDQTLTADRVKYYQSNENVEVEGNAQLERPDDILWGSYL
QMNLNDNTGQLSEPRYLQKDGNGRGDGNLLLLEGENQYRFKKARYTTCPE
DDHDWYILADDLEIDKEKKVGTARHASVRFKDVPILYVPWMNFSFGNERK
TGFLSPIMGNTSRSGVEVSVPFYWNIAPNYDATITPRLMSRRGVMLNNEF
RYIGQTLNGRFLLDYLPNDLETDTTRYGMQLNHFHNLGAGWFGMINYNSA
SDRNYFRDLGNNILFTSQTNLLQQGFASYFRELGRNGTLTFSTLLQQFQT
LQDPRAPIISPFKILPRFTLNAAKRNVYGLDFDFSGSFTHFSHSTLPHGL
RTTFLPGVSLPLENSFGFIRPRVSLHHTRYDLNEPANPAANDKHLSRTVP
IFSFDSGIVLERDTTLARENFVQTIEPRVFYTYIPYRDQQLLPNFDSAEM
DFSYPQLFLERRFSGEDRINDANEITLAVSSRLIHSATGNERLRFSAGQR
IRFSDRRVILTSPQVTRAGSDFIAELSGGITQNIKTDTGIQLNQNNFLIE
KIRTGISYRPAPGKVINAGYRFTRDVLEQVDLSTQWPFLKKWQGFAAINY
SLKDDKLLAGLLGLEYNACCWSLRFVTSHFTTATQRTSTNIFVQLELNDL
MRIGTNPVRVLQQTIPGYMRTDL
>NE1814 hypothetical protein
MSMLCSLYRITPEQVTKLKDFPDAIGELVGFTAPPPKVSFLSKLFGKPPK
QLSSSGQQFEPVAESDIFELNQAWHILHFLFSGTNAESPWPGGFLISGGE
EIGPDQGYGPIRLFDSELSRAVAGFLDTQSFKMLDSAYVASEIEATEIYW
KVSSEHTERQRQLEELWSMVKELQTFFEHTVRAGNATLLSIY
>NE1849 conserved hypothetical protein
MQVIIISGLSGSGKSIALKVLEDSGYYCVDNLPASLLVVLINHLQTQQHA
YVAVAIDMRSGENITVLPWQLKMIDKSIQIKFIFLEARTETLMQRFSETR
RRHPLSDKNITLEEAIRREREALATLTGLGHHIDTSSLRPNVLRAFIKDF
IADSRSPSQLTLLFQSFGYKHGIPLDADLVFDIRCLPNPFYDPQLKELTG
HDPEVIRFMESQPDASKMLRDISSFLGTWLPAYIRDNRAYLTVAIGCTGG
QHRSVYFAEKLALHFHDSAHVLVRHRGLAEYKPHYARR
>NE2256 GCN5-related N-acetyltransferase
MSSYEMFADKNIVAGELANLMASAGWGTEGDYDATAIEKSLSAYPMIAYC
RDSDGLLVGYISAFTDGAFSTFVGELVVRPTYQQRGIGSALLAMIVEKCR
GVPVYATPFQGTEKFFLDRGFRVPERPMSVVSMRNVT
>NE0490 Helix-turn-helix motif
MNNKLTPVSPGEMLAEEFLIPLGMSNYRLAKEIGVSAQRIGEIVTGKRAI
TVDTDLRLCRFFGLSDGWWLRLQVDYDIEMARGALEETLAKIRPWANTQE
HGTPA
>NE2228 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0754 TonB-dependent receptor
MVVLGEVTVNSGNTGSLPTRSILTSVDVMGAERVQDKNVMNSWELIGQMP
GIQSTEFRLGAESGKPSFRAFNGEGYINGIKLLIDGVPGNINSGNMRHLD
MIFPLDIVIFRRRLHQISELDSTQLAV
>NE1306 conserved hypothetical protein
MIRGDLVTIAVPGDFGKPGFALVIQANLFSEHTSVTVLPVTSMLVAAPLL
RITVQPGAENGLQKPSQVMVDKIITVKRDKVGPVLGCIDPDTMVEIERCL
AVFLGIAK
>NE1750 putative pre-pilin leader sequence
MLNIYRRSAFIYTQAGVSMIEVLVSIIILSIGLLGMAGLQTAGLKSNHSA
SFRSTASMMAYNILDSMRANRVVAGAGGYNHSLSEEDASETETKVEAEAE
IPEDIKNWLKELALRLPEGLGSIDVDADNKVTVLIQWDDSRGAATAQQFV
MTTRL
>NE1451 Hypothetical hesB/yadR/yfhF family
MAITLTERAARQIRQQLERRGKGVALRLGVKKSGCSGFAYSFDYADEVQE
DDQLFESHDAQVVVQRDQLSFIDGSEIDFIQEGLNSSFKFRNPNIDNTCG
CGESFSLKT
>NE1535 DNA polymerase beta-like domain
MRLKYCVRDAMMSRPDAEENPRGGVGSAGFRRKHMNRDRVLAVLRHSKPM
LASRYGVRRLALFGSTARDDSDVDVLMVFDGVASAARYFGVQFHLEDALG
CSVDLVSEKALRPELRPFIEKEAVYV
>NE1409 possible phoP; Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding domain
MRILVIEDEFRLQNQIRRQLEAAGYMVDTSSSGDEGLFLATEYRPDAAII
DIGLPGKSGLEIIKALRERGSLLPILILTARSSWQDKVQGLEMGADDYLT
KPFQMEELQARVRALLRRAVGIPQTLLKCGPIAVDVTAQSVSVDGANIEL
TSFEYRLLEELVCHRGEILSKESLADALYPHNEDRDSNVLEVMIGRLRRK
LDPEGTLKPIETMRGRGYRFTLECNSKSP
>NE0252 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE1572 Uroporphyrin-III C/tetrapyrrole (Corrin/Porphyrin) methyltransferase
MPPDATLHGTLYLIPTPLGEGDLARILPAEVRQQVSLLERFIVEHPKTAR
HFLKQINPLRAIQTLKLEVLDEHTPAGEVEALLAPLLAGEDVGLLSEAGC
PAIADPGGALVRMAHQKKIRVVPFVGPSSILLALMASGLNGQRFHFHGYL
PVASDIRNKEIARLEQTSITADETQIFIETPYRNQKLLEALVQQCHTETD
LCVACNLTQADEYVSTKSIGEWRAGNWPDLQKKPTVFLLHGQKQSRKF
>NE0787 hypothetical protein
MDHDKSKRLPPSYYTLLEASQEKETTSTKILAQYLDRSPATVRTQFQRIM
EFLDVSSRYGALREAEKRGLIRRNKRATSAHHQENRSSL
>NE1351 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1828 possible lipoprotein localization protein LolB
MACRSWVLGILLVLVGCAGPEVHQGEVTTVVIQPVQEQADIANEPFVLTG
RLAVNARQHRFSGGIRWQHTGQSDEIYLFSPLGQIVAEIFRDQTGVRLVT
SEPAIYRAQNAEYLTSQVLGWELPLAGMRFWVRGTHFPDTVAEKDLDKNR
RTVAIRQDGWQVVYQNYYPAQEGMSVLPRLLEFSRPDVKMRLLVDQWQGE
TKSGDATGRKLP
>NE0160 Outer membrane efflux protein
MFTLLSLIYASTTVLAGKLPVPRDIAIPSRVISVASAAFPPPLINGGKVR
RVSLDALTLSDAIGIAIAHHPDISRAHAEVIQRNAEVDVAKAAWYPKVDY
GVKPGYGRSYNALNSGNTTALRTSVGVSQLIYDFGVTSNRIKAADAVGEQ
TGHQLADTIETVAYNTATNFIELAAAQDRIAAAHKQIEALKATHSKIIDR
VRAGLSVSSDRTMADLAIRRAEAEAEQANTRRDVAAAKLAELIGVRPENV
ACLGDTVSLVSKLGAPDGNVDQRPSIKAAEAAVQAADAQLQVARGSRYPS
IGVGASRSFTTGPASALNDTWISVVMSGSFDFGNSARHQIEAAKAAANAS
RKALENERLITRSALNSAQIEARGAAERMTSYEKIIELAQVSRNLYWQEY
ILNKRTLTDVLNPERDIYQSELEWINALADATIARIRAQVAVGGFVQQLR
DREGNRHE
>NE1350 conserved hypothetical protein
MTAATLTSKGQITIPAAVRAGLGIDVGDRVEFIEIEPGRYEVIAATQSVK
ALKGIIRKPNHPISIEQMNAAIAREAVKSVR
>NE1000 Phosphate permease component of ATP-dependent phosphate uptake system
MLPMNHPDNLNEIRTIIAYHKRWDLLALVAGVTALMIAILTFIALFGSMV
IDGMPRLTWEFFTSFPSRKPEAAGILSAWVGTTLIMLVTAAAAVPLGVAA
GVYLEEYAPKNLITEIIEINVTNLAGVPSIIYGLLALGLFVYQLGLGQSI
LSAGLTLALLILPIVIVATREAIRSIPVSIREGAYALGATKWQTVSDHVL
PYSAAGILTGVIIGLARAIGETAPIITIGALTFIAFLPPSPIQDQFPFVS
FQWLMEPFTVMPIQMFNWISRPQEAFQHNAAAAGLVLVVMTLLMNGLAIY
LRYRLRKNIKW
>NE0745 hypothetical protein
MSIERVCSLCFDNEDLSDWIVNEDGPRGCDACGKRDAPTCKLSELCAFIE
SRLSQYWGSADNQLFYVSAEGGYQGRTWDTYDLIVDEIGLSFPRAQNDRL
LREILGHLTDQAWCDYDCGALDHDEALKFSWRQFCETIKHKRRFFFLSDG
SDDRDSFTPASLLHEIAHSIEVIGLIREIPAGTKLWRARPDLNKGAKATA
TSFGPPPAEHALQSNRMNPPGIPMFYLASSQKTALLETRTMESRMGKWSV
ARSLLVLDLRRLPHVPGIFSKADRHYRLGLKFLHDFAVDIMTPVARDQRV
HVDYLPSQVVTEYFRDYDFEAGRLDGIVYNSTVHLEGWNIALFANNVDLG
LSRPTWGRAPEPWLTFIKSIRARI
>NE0810 Cytochrome b/b6
MSKQQNSMMEWIDKRFPLTITWKAHLSEYYAPKNFNFWYYFGSLALLVLV
NQLLTGIFLTMNYKPDAGMAFASVEYIMRDVDFGWLIRYMHSTGASMFFV
VVYLHMFRGMMYGSYRKPRELLWLIGMAIFFVLMSMAFTGYILPWGQMSY
WGAQVIVSMFGAIPVIGDTLSNWILGDFMLSDAALNRFFAYHVVTLPVLI
VVLVFVHIIALHETGSNNPDGIEIKANKDPHTGLPVDGIPFHPYYTVKDI
FGVVGFLIVFCGIIFFAPEMGGYFLEDNNFIPANPLHTPDHIAPVWYFTP
YYSMLRAVTVNFLGVDAKLWGVILMAASVVIFCFLPWLDRSPVKSIRYKG
PYFKFALTLFVISFFVLGWLGTKSPTPLYTLLAQIFTVIYFAFFLLMPWY
SKIDKTKPEPKNVTK
>NE1445 Nitrogen-fixing protein NifU
MPKIAEIEGTPNPNALKFVLKEPLTWGVAKSYDHAEQAVDDPLAAALFDI
DHVTNVFYVDRWITITQDGGADWQDLAREVADPIRAAPAATDQSAAVVAA
ASRTLADLSEEDQQRLERINILLDEEVRPFLQHDGGDLHVLALEGNILRI
HYQGACGTCPSSISGTLRGIEQLLRTIEPDIRVVSA
>NE2303 Uncharacterized NAD(FAD)-dependent dehydrogenases
MTSYLSEEAGPDLTQGISLSDFGNQPLLRGHVGDEPVILARIGDEITAVG
ATCTHYGAPLTEGLVVGETVRCPWHHACFSLRSGEALGAPAFDPLPCWQV
ERDGDRIMVRDKITPKPRSIPVAAANQPANVVIIGGGAAGFACAEMLRRR
GYQGQLTMLSEDSDAPCDRPNLSKDYLAGNAPEEWIPLKSDDFYVRNRID
LQLHTTVTKINTTGHTVTTADGRIFPFDRLLLATGAEPVRLPIPGANQSH
VFTLRTLADSRAIIERAKHAKAAVILGSGFIGLEAAAALRARELDVHVVS
LDKHPLEKILGSEPGDFIRSLHEQHGVQFHMGTSLAHIEPHKVVLSNGKE
LTADLVIIGVGVRPCVSLAEAAGITVDNGILVNEYLETSVPGIFAAGDVA
RWRDEASGKTQRIEHWVLAERHGQIAAENMLGANTAFQDVPFFWSAHYDI
SIRYVGYAGPWDTLEIEGDMAAYDCLISYKTGGKTVAVAAIGRDKQALEY
RALIAQQQH
>NE1261 Integrase, catalytic core
MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1853 hypothetical protein
MSDQYTQNESDQSKDKVEWTKPASLLNILGKKFAPIADLQHKQLPSWSLL
VFLGILLLVFIWKQIAVNQAESRLEKGQAQIAQQLEEKSKELVKKAREYA
DSQYKKEEERFGQVLAWAVRGELIRNNLDQIDQYLTELVKTKDTERVVLI
SDEGKLLVSTDKRLESEEASSLYPKDVLGLQTITIKSDVDNRKLLVVPVM
GLNKRLATIVISYNPPSLLN
>NE1760 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0697 possible transmembrane protein
MIKNVSEEEILLRKRARRRFIGAVTFVILSVVFLPMILDDAPQQEQQQID
IQIPSEELTAETYPWMTPENAPAIDEAEIASDPDKPLPFSDSEYSGKIES
GRSTSGIPVPAKKPPFVKSTPASVPAPVVQEKVPASNNVKVQEGAFVIQL
GAFSDVSKAKQQQQNLVANGIRAYTETIKVGNNEMTRVRIGPFATRDAAE
AEHERLKKTGLSGVVTTK
>NE2550 putative ABC-2 type transport system permease protein
MMMLTIAGKELKLLFASSLVWIFLAGMQLVLAWVFLGRLNTFLEIQPQLA
QLANPPGVTEVIISPVFSVASIVLLAVTPVLSMRLFAEERRNHTLAMLIS
APVSTSAIVLGKFMALMIFFCLIPLLIVTLAISLLTGGTLDFGLLGSNVI
GLILLAGCFAALGLYISSLTSHPTVAALGCLGVLLCLWVMDIVAIESESA
AHHFSLFRHFESFNIGLIDSFSLVFFLLFTITFLVLTIRHLEGERLNG
>NE2568 Bacterial regulatory proteins, ArsR family
MTDQYTQPDFSVMVEKKASAAKACSMLKILANEDRLLILCQLIQGKKNVG
ELEQTLGIRQPTLSQQLTVLRDEKLVSTERQGKYIYYSLASPEAVQIMNT
LFSVYCQNEHHHSPDDVTHNGRQ
>NE0381 Integral membrane protein, DUF6
MIEYRIFRFPMYPLLARFFSSHQQALGVLFALLSAIGFSAKAIFIKLAYV
EPVDAVTLLALRMVFSVPFFVFAMLHGRAQATPMARHDWLAVLLLGLVGY
YLASFLDFLGLQYISAGLERMILFLYPTMVVLISALVFRAAIGRRVWFAL
LLSYVGIGLVFVHDFHITSDGLFMGSSLVFASALAYAIYLIGAGHTIARI
GSMRFTAYAMTVACLACIAQFLLTHPLDDLQQSTRVYGLSIGMALFSTVM
PAFALAAAMRRIGSMQTSMIGALGPVATIYLAYVFLAEQLSLTQLAGSGL
VLIGVMMISMRKME
>NE0738 conserved hypothetical protein
MEYTRLATLVFSVNGCFLMMSRVSQALAPVLFLPHGGGPLPVLGDKEHEK
MVSFLREIATELGEPPAILIISAHWEEEQATITSNSQPGIIYDYYGFPAA
AYEIQYAAPGHPGLANEIYTLLTANGIPARLDEQRGFDHGMFVPLKLMFP
QARIPCVQLSLLNNLNPRMHIALGKAITALRSRNILIVGSGMSFHNLKAF
FSSTVDGRGENEAFDNWLIETCTHPAIAPEMREQRLIEWEKAPFARFCHP
REEHLLPLHVCYGVACVDTPTARTVFNGEIMGRKVTSFLWQ
>NE1055 conserved hypothetical protein
MIVRIVKRNTFIFLSVVLLSLLAVPATNIFTAPSRETIKWGEKSFLYNMD
FISRWAALLLYPVGISTDSNQVIIGRDDWLFLGDLYEETRTIDRRPPSAA
DYVSGQEIGSAIESWNRYLSSKGVKLFRIMIGPNKGTIYPENLPIWAKPS
IPNATDALLVGANTIHYVDFRSILLKSKASHSVALYYKTDTHWNALGAGI
AFQAFAQQVGKVVPEIQWPPQKIYKLNRVDSRVGGDLANFLRLTTYLPDL
EPVTYISSLAVETTQLDYDTGFILRQGGNPQVNAPNKPLLVQSCGALNQK
KVLWLRDSFGTVMSPFMAATFSEVLQLHWAEAMKPGGKFVQLVEEWEPDY
VFFTVVERASRSPWFASYPPPVLVPLGSKFKPIQTTTAVGLNHLLQGTTT
NEFQIIGNDPFFDFTVSEIIKPKEVDYLSISLSCADGSQSVPLQLFWLVD
KQPYFDEEHSARFLFRTGENLIDLHTLPKWDSAKSITRVRVDIDTQDSCV
HFKLGNPIFGVE
>NE1890 conserved hypothetical protein
MNKDTVLNVTVMGREFRIHCPGEEREELLLAVSCLNRKMQEIKSAGKIAG
TEQIAIAAAISMTHELLSIRSQRGFDMNEFKRRIELLECRVSDALTDQGI
YKNS
>NE1264 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1166 DUF185
MALPLPDPSEQAYSDTLKTMLHERIAHSGGWISFADYMETVLYTPETGYY
SGGAAKFGTAGDFVTAPEISPLFGQALARQIAPILSAVNQGSILEFGAGS
GKLAVDLLCALEELNNLPQHYYILDLSADLQQRQRAMIEQHIPHLASRVS
WLSALPEQFEGLILANEVLDAMPVHLVAWQNGNIAERGVIWKDQGPVWQD
QPLAAGELLDVARQLPPADQFSYPLYISEISLTNRHFICSLAMLLQRGAI
LLVDYGFGQNEYYHPQRHQGTLMCHYRHHAHDDPFFLPGLQDITSHVDFS
TIARTALDSGLQLAGYTTQAHFLINCGITDLLARTPADQPGSYLPLVSQV
QRLVSPAEMGELFKVMVLSRDIDIDAASCGFTRGDLRRLL
>NE2059 putative similar to copper export proteins
MKRESFVELFIEKIVTKVKISSFVMGLISLVLLSSSNVVFAHAALTKAEP
ARRAVLTASPKQVRLWFNEEIEADYASLSLHDANGKALTEKKPLVHPDDA
KSIYLELPELIGGQYTVKFRVLSVDGHVVDSEYKFTVKNK
>NE0256 conserved hypothetical protein
MEFNEYELQRLFGHEAAEDEDPQRLKDYYFKSKVYSQVVNDLPLRIIVGH
KGIGKSALFQVAIDEETENKRLTVLIKPDDIIGIGEDTDDFLKLIRDWKI
GINAIITQKALTSFGMLFEGWRGKLNQYGGTALDFLSSTLKLEGKVSLTA
SKEAILRDFLKNNKISVYIDDLDRGWQGRKHDIQRISALLNAVRDISTEN
RGIYFRVSLRSDVYYLARTSDESTDKTEGSVIWYSWTNHEILVLLVKRIE
SYFGREVDEAELLKKHQLELMRYLAPIIEEKFTGKGHWRDAPTYRVLMSL
IRKRPRDLVKLLTLAGREARTKDAERITTNHLENIFEEYSQGRLQDTINE
YRSELPEIEKLILGMRPTKIQRKASQGYVYTTDQLLKKIKAIEEQGKYRW
ANRNQVDTKELAAFLYKINFITARKQIPTGIDRKYFEENRYLSNKFIEFG
YDWEVHPAYRWALQPEEPMQIFNELELSSS
>NE2535 hypothetical protein
MNDAENLTKLLGHLPPAVFREFMADEFSLAMPDLDTKKTKKEQREQMEVA
LSALGVSERQRIEEVAERIVLLSDGAGQDVIDGFKDDIFDDAAREAFAAI
PNQYQRALWLHVNEPVIFEEALNARQADVFRQSASCLAVLDDAAAKTAFH
QTVAQQLGCSDDAVAIQIFKRLRPDTQTGEDVDLYQISIHHNRPPEIIDC
VQASELVPQEVIRAVSSHITYEPANGHLEVLSKDTDGREALARIVADSLL
QSPITGEKIPLKQYDYQSLAAPRNFDIASEPVTSVKVVELGYSAANGRSL
LVKTWTKDADDIYTAARSLINPTFDFRDHHLNYAKLSIKLKKVGKDRARA
ITVILRDDNKCNIKTKREKDQALWAGVLNFWFIGRVFDRVRQ
>NE0137 conserved hypothetical protein
MRPAARVIIAPAKMFLFSVTYLTVEIMKKSTTVEKIKAPELITSDWLNTP
QPVTLASLRGKVVLMHAFQMLCPGCVQQGIPQTQRIFDEFDPDRVAVIGL
HTVFEHHEVMGRDALEVFAYEYRLRFPIGIDKPNEQHSIPHTMMTYHMQG
TPTTILVDKTGHLRLHKFGHVSDLLLGVSIGTLLAEEVSDEELAAGSTDS
AGNESDTQPGCDADGCRV
>NE0809 Rieske iron-sulfur protein 2Fe-2S subunit
MTDSGDNGKMSGRRRFLLVATSVAGAVAGAGVATPFLRSMMPSERAKAAG
APVEVDISKLEPGMLLTAEWRGKVVWVLKRTPEMLDNLEKLNSQLADPDS
QRDQQPPYAQNHTRSIKPEILVVLGVCTHLGCSPVYRKDIAPADLGSDWL
GGFFCPCHGSKFDLAGRVYKSVPAPSNLVVPPYTYLSDNRLLVGSDSKES
A
>NE2461 hypothetical protein
MKTTLIKVIAASVTALFLSMQVYASGHTAHVDEAVKHAEEAVAHGKEGHT
DQLLEHAKESLTHAKAASEAGGNTHVGHGIKHLEDAIKHGEEGHVGVATK
HAQEAIEHLRASEHKSH
>NE0222 ExsB protein
MKKAVVLLSGGMDSATTLAIARQSGFACYALSIDYGQRHVAELAAAARIG
QSLQVSDHQFLKLDLAVLASSVLTDISATVPLHGTSTGIPVTYVPARNTI
MLALALAWAEVLGSHDIFIGVTAVDYSGYPDCRRDYIDAFEKMANLATKA
GREGMVLTVHAPLIDLPKREIIQCGMELGIDYGLTVSCYQADEAGYACGQ
CDACHIRRAGFEAADIPDPTCYRNKQIS
>NE0837 Domain of unknown function 2
MTASRPVVWTATQLAQAVVLGQLELHYQPIVDLRSDQIAGAEALLRWRHP
SLGLLPPGQFLPVAESSGLMPEIGAWVLREACQQMHEWLPVQWRPFRLAV
NVSARQVGPGFDDQVQQALAAAGLLAEYLEIELTESAAFGDPAIFPLLDA
LRAIGVRFAADDFGTGYSCLQHLKCCPITTLKIDQSFVAGIVDDTRDQTI
VRAVIQLAHGLGMEVVAEGVETPASLTQLRLADCDAVQGFLFAKPMPAAA
FAAFVKRWRGVTMNVNEPTTSCCVCCKEIPLDAAFTPEGSEYVEHFCGLD
CYERFQARAKAVAEPVTTPAAGGPTPSG
>NE2333 Uncharacterized protein family UPF0054
MPTRNMPLTKNPVESPDHEQELVLTVQYVADKTDIPNRRLFRKWVKAALS
KPAEVVIRIVDRQEGEILNRDFRGKSSATNVLTFVYDDDVPLLGDIVLCA
PVICNEAQQQGKDLTAHYAHLTIHGILHLQGYDHIRDEDAVVMESLETEI
ITRLGYPDPYVIQH
>NE1840 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1282 ATP phosphoribosyltransferase regulatory subunit
MRNWLLPEYIEDVLPRDAYRIEKIRRLIMDMLFAHGYQFVMPPLLEYVES
LLAGSGSGMNLRMFKVVDQLSGRMMGLRADMTPQAARIDAHLLNISGVTR
LCYASSVVHTVPDEITRTREPFQVGAELYGHSGIESDLEIQCLLLECLSV
SGIHSIHLDLGHIRVFRSLIRDSGIKPEFEMELYAALWAKDISSLKELVR
TGLNKRLTRSVQNALLLLPELYGDGTVLLSARQHLPDFPEIGEALDQLEH
VARILQPYVDRITFDLADLRGYHYHTGMVFAVYTPGCPAPIALGGRYDEI
GKSFGRARPATGFSLDLKQLSQLTDMNGYPSGILAPWKPEDEKLAAMVRQ
LRAEGHIVVTELPGEENQEVTGCDRKLVFRNGNWEIDPVTG
>NE0575 Bacterial transferase hexapeptide repeat
MTASEKTVLPELSYLRTTISNVVIELRNLRETSLEHRQRLNNPPKLPSRK
TLADIIDKLSAALFPNRLGQPDLTGESIDYFVGYTLDTALRDLSAEVIRE
LHFASGLEAVSHTVRERAVEITHVFAGQLPAIRHILDSDIRAAYEGDPAA
HNIDEVLVCYPGITAIIHYRLAHELHKLGVPLIARMISEIAHSLTGIEIH
PGARIGDSFFIDHGTGVVIGETAIIGRKVRLYQAVTLGAKHFPVDEQGAL
IKGNQRHPVVEDDVVIYAGATILGRITIGHGSTIGGNVWLTHDIPPGSHI
TQAQMRHEKVPG
>NE2351 DUF170
MSITVIIPTILRPLTHQQKHIDMEGSRVSEIIDHMDQQYPGIKEKLVANG
NAHRFINIYVNDNDIRFQDGLATSLRDGDVLTILPAVAGG
>NE1704 CRCB protein
MWKPILAIALGSTLGGLLRWGLGLKLNNLFPDVPPGTLVANLIAGYVVGV
AIAFFAHMPNLSPEWRLLVITGFCGGLSTFSTFSAEIVSLLQRGLYAWAM
SAIAVHVAGSLIMTLAGIATVTWFKSS
>NE1928 Smr domain
MGSDDKSSSEEVAGDEDAALFREAMRDVRPLTRTKKIIRAHSKHQALPQP
GASDLPDIQDVLADDGWQWDMAESEEWSFARPGLQRYTLRKLRRGNWPVQ
DELDLHGLNRDEARRILVIFLNQGVLRGLRCVRVIHGRGLSSRNRKPVLK
ILTGNWLMQHGDVLAFCQALPEQGGSGAVLVLLRNADK
>NE1580 Helix-turn-helix protein, CopG family
MSQTKVAITIEEEVLARVDALVRQRVFANRSRAIQEAVQEKLERMDRSRL
AEECAKLDPAFEKAMADEGLSEELVAWPKY
>NE0915 conserved hypothetical protein
MIRCINLLSLVLLLALANGALAEIAIPLLKSHVTDLTETLSSMEISRLEQ
QLTDFEAKKGSQIALLIIPTTQPETIEQYSIQVAEVWKLGRKGIDDGVLL
LVAKNDRTLRIETGYGLEGVLPDALARRIIDEIIVPKFRQGHFFGGLQAG
VEQIISIIEGETLPESEPAGGASLAVENIIPFLFIALVLGRTLQSMFGRM
AGATITGSIAGALTWLISSSIAVALLIAIAIFVISLFEQTGRIIHRGGPG
YRNWPGGGFSGGGFRGGGGGFGGGGASGRW
>NE2385 Staphylococcus nuclease (SNase) homologues
MHFTRALRIQLIPSFFFRAIYPLVVLLILLHAQSGLAETIYRSTDSHGRT
LYSDIPTPAAKPLQPATPPARSKYRVTRVIDGDTIVLENNKRVRLLGINA
PETGNRYHPGEPGGADAKKWLRGKLQGRSVYLEHDRQTHDHYKRMLAHLY
LPDGEHINLSLVEKGLAIANLIPPNLLHANTLIRAQQRAETRKLGIWSMQ
HYQPRPLIKLTEKPFGWQRYRVKAKVLKRNHRFSRLIISDNLDLSFANRD
LALFPPLETYLNRPLEVRGWVSRRKNHFSIRIQHPSALILY
>NE1372 Helix-turn-helix motif
MEKITDSSGNIFTDLGFNPEQSAIYTLRAELMSNLRKTIRERKWTQEEAA
KVLNIGQSRVSDLMRGKWEKFSLDMLITLAIRVGKRIGITVV
>NE0830 DNA mismatch repair protein MutS family, C-terminal domain
MAARCVVLLCIVENCDFRRASCDHHDFACHAAHGGLDQRFQKARSWFLLS
SIQHKVAKSAKVGNTESTHVITTRTMNDTTQDTAASVWRESFILSSGKNP
SGIRDTRPTADNYGVLDAKTFAAVEVDALFDEINQAQTLTGQSILYRSLA
RPVTDAALLQSKQEALRELESNPDLLKVLEQYIKRIAIDEASLHHLLYGE
FAGGLTTDDPRDKTGKDKLEFGGYGYRQFIDGTGFVVDLVEEAEALPMPE
SDYLRTLVQTLRDFARSRTYALMHGPIYVSQGKFMTREEKPRYLLIQRFR
PSMFKWPFISFFLAFVAGLLLFFQNTLNELVASYVGYGLLILVVPIIPII
LQAISASDRDSVIYPLQRLFRQSPELARTIEAMGMIDELLALHRHARSIP
GESVLPEIDMDGRHTLVVSGARNPLLVRTRPDYVSNDIVLDNDKHLLIVT
GPNSGGKTAYCKTVVQIQLLAQAGAYVPAVQARAVPAEHIFYQIPDPGQL
EEGMGRFAHELKQTREIFFNSTPRSLVVLDELAEGTTFEEKMTLSEYVLK
GFHQLGATTILVTHNHELCERLQQENIGNYLQVEFVSEKPSHRLIPGISR
ISHADRIASAIGFSKEDVASHLASLQE
>NE1371 phage-related protein
MPSDFKPMLAVGPGAYEIRIHIMGEWRVIYVAKMQDTIYVLHTFQKKTQK
TSKHDRYRQIIKEITNGKNN
>NE2306 conserved hypothetical protein
MNARVQNHITGRLSLRPPQAESLTRLRQALDAAPEMLGHERDVSAILATL
KAEFPTLADFEREFPSLCFALATGVGKTRLMGAFIAYLHLAHGINNFFVL
APNLTIYNKLIADFTCNTPKYVFKGIAEFAQQPPLIITGDNYDQTGAAVD
DQSMGFAHDVRINIFNISKINSEVRGGKEPRIKRMREVLGDSYFNHLANL
PDLVLLMDESHRYRAQAGMRAINELKPLFGLEVTATPFVESSKGPVPFKN
VVMDYPLARAMEDGFVKEPAVVTQRNFKASEHTPEEVEKTKLEDGVRLHE
TTKVELLTYARENGLQVVKPFMLVIARDTTHAGQLLTLLESDAFFDGRYA
GKVIQVDSSQTGAQEEEMISRLLAVESVDEPTEIVIHVNMLKEGWDVTNL
YTIVPLRAANARTLIEQSIGRGLRLPYGKRTGVAAVDRLNIVAHDKFQEI
IDEANRGDSPIRLKQVILDAPSADDKKVSVQVEPGVAARLGLTDALIVQI
DAAGTPDELVSNTAPTPVFTTEAEKQAARVVMEVIGKYEVRRDLVPTSSA
LLKPEVQSVLLAEVAELLRPMQGSLLAGIDEAAPTLDLSTIVAKTTEIVV
QQTIDIPRIAVVPKGEVTTGFHLFTLDVSQLYLQPGEHEIVGQMLRTNEQ
FTLAAEIGLKEQRPEDYIVHALVDFDDIDYFTHAGLLYDLAGQMVQHLRS
YLSEDEATSILDRDRRLIAREIHAQMMAHFWEKATDYEVQVSRGFTELKP
CNYTAVADQTPCHFRETVEDVGRIKQMLFGGFTKCLYPLQKFDSDTERRF
AVILERDADKWFKPAKGQFQLYYKLGTEQPEYVPDFVVETESIILMAETK
KRDDLKTGEVEAKAAAAVQWCRHASDFTVSVGGKPWKYLLVPHDEINESR
QLTDFLRFAVKG
>NE0388 Domain of unknown function DUF37
MKQLIIDLIKLYRYSIGLLIPPSCRFYPTCSNYMHEALVKHGLIKGLWLG
MKRILRCHPWNQGGYDPVP
>NE2377 BolA-like protein
MVTAESIEHSIKATLPCTWIRVEGDDGHHFSAVIVSESFQGKSIVGQHQL
VYQALGERMREEIHALSMKTYTPEQWEAARTSN
>NE0427 Ribosomal protein L17
MRHRNGLRKLNRTSSHRLAMFRNLTNSLLEHEIIKTTLPKAKELRRVVEP
VITLGKNPSLAGKRLAFDRLRNRDNVIKIFSELGPRYQNRNGGYIRILKC
GFRRGDNAPMAIVELLDRPEAGIISNDSAD
>NE1544 Bacterial regulatory protein, LysR family
MDRFENMNAFVRVVETGSISAAADRMDIAKSVVSRRLKELEEHLGVELFH
RTTRQMNLTDSGRAFYQQSVRILADVLEAEHATSQFHGRLKGHLKVAVPL
SFGLMHLGAAISTFLQTHPDIEFDLDFNDRQVDLLAEGFDLAIRIANLPD
SSLIARRLAPIQAVMCASPAYLERMGTPQAPEELIRHRCLVYNLVSHSDN
WNVYGTTGELIKTRIIPYLKASNGEFLRDAAVDGLGIVLLPTFIVYREIQ
RGALIPILTGYHYAQLAAYAIYPQTRHLSQRVRAFVDFLSQRFEGMPYWD
ACLNQ
>NE2267 Glycosyl transferases group 1
MVVLAPHIEWLMETVHIISGLNDGGAEAVLYRLCSSDKAAEHYVISLMDE
GKYGALLREAGVQVSCLSMPRGRVTLGGLWRLWRLLRQIRPQAVQTWMYH
ADLVGGLIARLAGVKQVFWGIHHTTLHAGQSRRSTIWVARLCARISSCLP
SAIICCAQKALEAHRELGYAAEKLRVIPNGYDLVRFRVDEDARVRLRTEW
NTGSRWLIGMVGRFDPLKDHKNLLDALAIVKYRGVDFCCVLAGRGLDQNN
AQLMAWLTELDLAGEVKLLGQRMDIPDVMNALDVHVLSSSSEAFPNVVAE
AMACGTSVVTTNVGDAALIVGETGWVVPARNPGMLADALLQAYVAMHDEA
AWQVRCTAVRQRVEDHFSLERMVENYHLVWQNKL
>NE0487 possible glycosyltransferase
MQLIYLSPVPWASFAQRPHKFVEWFHGRTGGSVFWVDPYPTRFPLLSDFQ
QFIGKAKAARDEVFIPAWLNIIKPVALPIEPLPSSGTVNIFLWRRLLQEL
SVFAASGETLLVIGKPSIFALTVMDRLNGCRSIYDVMDDFPSFYSGLSRF
AMQQREKALVRQVSVILTSSTTLKHRWSIFRNDVELVHNGLDLDILPPLR
LCSHDNERKVLGYVGTIAAWFDWEWVIVLANVRPQDIVRLIGPVFSTVPA
VLPQNIEILPPCNHQAALLAMQEFDVGLIPFKKILLTESVDPIKYYEYRA
LGLPVLSTDFGEMALRKSTDGTYLSRGVEDVGELAALALQYHASSEVICK
FREFNSWKSRFDNAGII
>NE0193 probable Mg(2+) chelatase family protein
MSLAILYSRALCGMDAPLVTVEVHLSNGLPKFTIVGLPETEVRESKDRVH
AAILNGRFKFPARRITVNLAPADLPKESGRFDLPIALGILAASGQIPADR
LNQYEWAGELSLDGRLRPIRGALAMTYSASRSGRGFVLPEQNAGEAALVQ
EAKIYPATSLLQLCAYLTGQKAWEPYSTKPNDEETAGSYPDMGEVKGQMQ
AKRALEIAAAGGHSVLMIGPPGTGKSMLARRIPGILPPMTGQEALESAAI
QSLGSGRFNLTDWKRRPFRAPHHTASAVALVGGGGIPRPGEISLAMNGVL
FLDELPEFDRRVLEVLREPLESGYITISRATQRADFPARFQLIAAMNPCP
CGYLGHYEGKCRCTPDQVARYRGKISGPLLDRIDIQIEVPALPKEDLLRP
GPGEPSAVIRSRTTVARQRQLDRQNVPNTELRVQEIEKLCHPDAEGKCML
ERAMTRLNMSARAYHRILKLARTIADLSGSEPVLGKHVAEAIQYRRMDIS
>NE1588 conserved hypothetical protein
MRVIALSTLKVFWENDSSRADAIQPTLAWHRHALKADWSLPAEVKADFKN
ASILKDGRAVFNIAGNKYRLVVWINYPYKIVYIRFIGTHAQYDRIDAQTI
>NE2094 hypothetical protein
MFDVNYEKQAQDYYSKAPIIILGSGASATHGMPGMRGLAQHLTDKTDVSG
LSDAEMEPWRSFCRTLTDGVDLESALRQVAVSEELTCRIINSTWSLINSE
DAAIFKNSLQNSSMFPLSRLLEHMFKTSLKKINIVTTNYDRLAEYACDQS
RIHHYTGFTHGFFRQLATPDELTCSRRVNIWKVHGSLDWFQSPLEDTIAI
SGAQEIPENYSPQIVTPGTQKYQKTHLEPFRSIINNADIAINEAGSYLCI
GYGFNDEHVQPKLMAKCQRQGAPVTIITYALSDSTKKLILGGKAQNYLAI
ERGATDGQSVVYSSLSSSSFTVEKNIWSLEGYLSLIM
>NE0519 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQRNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE1626 putative TonB protein
MQITAVFNSPMQPDYRIFPALLISLLLHTAIIAGVSLSPPEQKTTKVAKS
MDVILVNSKSQTRPVDAKVLAQANLDGGGNVAEDRRAKTPLPVLPGVKPV
TSLSEVTRKTRQLEKEVKKLLTAAESKAKVVQPLPHTKPGQDKPVQNQPA
ALNPDNHDNLLQRSLEIARLEAQIAKDHEAYQKRPKRKFIGARTRDYRFA
RYLEDWRLKVERIGNLNYPDVARKQKLYGSLQLTVGIRSDGSLESIGIDR
SSGVQVLDDAAIHIVRLAARNGFSPFPPDIRRDTDILHITRTWVFSRADK
LLSE
>NE2365 conserved hypothetical protein
MLQRSVFFLSDRTGITAETLGHSLLTQFDGIEWKKHYASFLDSAAKAQAV
IEQINTIAEQEGQPALVFSTLLDPVMLASIRRADCVLFDFFETCLGTLEA
VLQQPPARIPGRSHVLRQDASYFRRIAAIQYALNSDDGANAKILADADVI
VVGVSRTGKTPVCVYLALQYGVLAANYPFTPEDMGAIQLPPLLQPLRKKL
FGLTLNTSRLQSIREERYPGSHYASFAECQRELQWQNELYRQFDIPSINT
TDVSIEEISASIVNRAHLERRQHGT
>NE0473 hypothetical protein
MKVITYYQVIADSTAQTDCAFFIEFMLTVIEETLSESQIITPQATLQDIP
QAVLEIMEQYPGLAEFCQHPRSCTELQAFYHLNDREHFRKAVLTPLLDAG
WLRRTQPDKPNSPRQKYFREH
>NE0691 tRNA pseudouridine synthase
MIPPKRNKHFFVKIVLALEYDGRGYCGWQKQPGCLSVQSRLESALSGVAG
RQIQVVAAGRTDAGVHALCQVVHLETCVRRPLNAWIRGTNALLPGDISIL
EASEVSDDFHARFSATERTYLYYLLSRPARPGIHHGKIGWVHYPLDLEKM
QMAAKLLIGKHDFSAFRSSECQSRTAIRQLTRLNISQHQQLFVFEFCANA
FLHHMVRNILGGLVYIGRGKYPPEWMRILLEKRDRTLAAPTFSPDGLYLS
GVRYDARWNLPVFNVTRPLDII
>NE1189 hypothetical protein
MSKLLFKLRNVPDDEAEEVRALLSAHQIDFYETSAGNWGISLPALWVRDE
TQYSQARELLDVYQAERSAHVREEYARLKQEGKHKTVLDSFRENPFAFIA
YLFIVYALLYLPYKIITGLSSQ
>NE1558 putative transposase
MPRTHGYAPIGKRCHGKCNWHARGRINVIGALIGKCLLTVGLFKNNIDAD
TFLGWTIHDLLPKLPPASIVVMDNATFRKRQDIQNVITRGGHTLEYLPAY
SPDLNPIEHKWAQTKAVRKQQNQTVEQLFKIESFYVT
>NE0887 hypothetical protein
MLTESGITPYPALNQLSLQYSIFGFLLRNITKPGNHFAARCYLSALDDTS
SLPPGQIFNLLTVLDFYYQTERNQTERRAPARSEGRGSEGQKILYVPVTR
KVSLTGETFCSF
>NE0537 conserved hypothetical protein
MVPTRSFWVWLHRWAGLIMAAFLIIVGATGSLLAFYPELERLINPHIYPR
QVLEKKLDMATLAELAEQRVPAGRVNGVLMEANQEATLISMDARPDNADP
PNKLGFDQMIVDPYTGEELARRQFGEISEGMINFMPFIYKLHYALALGKF
GVWVLGICALIWTIDCFVAFYLTLPQRRRSVTAPASGNHGNWWRRWQPAW
KIRWHSGSYKLNFDLHRASGLWLWPVLLIFAWSSVYMNLWDTVYTWTTRA
VMEYKAPWTEFSKREIPLAEPGVDWRQAQRIGEQLMSEQANKHGFAVEQV
IALRLDRGNGTYQYIVRSSKEIQDRRGRTSVFFDADTGELKLALLASGQY
SGNTVTNWLFALHMANVFGLPYRIFVCVLGLVIVMLSVTGIIIWIRKRAA
RLSPKNHQYDSRYLRPSVHRDS
>NE0493 hypothetical protein
MMSHTLTAEELYAEIKRMPIAERIRFFSLLADSAFREDDFTHEQIFGETY
QEPFSAPEAAEYLEISLPTLRRYVQSGKLVPSCIVGRNQMFSAQTLRTFK
RNRGN
>NE1531 TonB-dependent receptor protein
MKEKKVKQAGLYESSGGEIMARKPLVIVLAGVLAAMAAPLAQAHEVDEPV
QLETIKVTTSKDDLPLTVPSLPEVQQKMRQIPGGANVIDSESYATGRAST
LQDALGYSPGVFVQPRFGAEEARLSIRGSGIQRTFHMRGIKLLQDGSRLN
LADGGGDFQAIEPLAARYIEVYRGGNALQFGSTTLGGAINFVTPTGYDAE
RFRARGEAGSFGYNRLMLSSGGVHGSLDYFASVSRYSQDGFRDWSEQENW
RMFSNVGYRVNSDLETRFYLTYTKTDSQLPGNLTKAQLRANPKQANAVSL
AGRQKRDFDLIRVANRTVLKLGNSQQLEFSAFYSHKKLWHPIFQVLDQPS
HDYGLGLRYINEMPIAGWRNRFVVGFEPSWGTVMDNRFFNNGGRAGARTA
KFDQSASTYDVYAEDQLYVLPELSLVVGAQYSYTTRKQKDLFGAGAQDRS
KNYQRFTPKAGLIYELNPHIQFFSNVSTSFEPPSFSELTAGPVVAPVFAK
AQRAITFEVGSRGHFGIAEWDVALYRAHVRNELLARVDPITSSSLGTVNA
DKTVHQGVELGLDIELVKMFYLRQMYLFNDFKFDGDASFGNNQLAGIPRH
FYKAELTYRHADGYYAGPNVEWSPQKYYVDHANTVYANSYALLGFKIGKR
SKSGFSWFAEARNLTDQKYAATTGVTDVVKYNNAGVLQDQFFLPGDGRSF
FAGLEYRM
>NE2433 TonB-dependent receptor protein
MPMRNTNLPANPCVTGTCRSQALLRQIMLLACLSMAAAFVPVRAQSTTAP
VQPQASLRYDIPAGTLDQVLNHFAADAGITLSIDGALTAGKHSSGLSGSY
SVPDGLKALLAGTGLEAVATPGSGYVIRRATAATITTSGTVLPTVTVAGH
RAAIDEAPPLYKGGQVGSGARMGVLGNTAVMDTPFSVTSYSAQVIENNQA
RSVADIAAMDPSVRMSSARSNINEDLTMRGFSVSSADFALNGLFGLTPYW
RAPLESIERVEVIKGPSAALFGMAPGSSVGGVVNLVPKRAGDTPLMRITT
GITSNSLFGGHADIGGRFGPDNVFGARLNVMHRGGNTTIDGQSTYQSLGS
LGLDFRQRSLRASLDLLWQQERINNVVRQFQLSPGLTAVPRVPDNTTAYP
GYGWTDGRNISWLFKAEYDISQAVTAYASYGMRKLNWGAIAANPILLNTA
GDYSYAGGWQRMPTRTQSLEAGMRGVFATGPVSHSTALGFTWLDQSQELG
FYTGLPPGISNLYTGQLFSTPSIEGINNLARPYLDTQLTSVVLADTMSFW
DDRLLASLGLRYQKVEGQSYNFATGMASGPRYDKSAITPVVGIVFKLRPS
LSLYASYIEGLSKGDTAPISAAITNPGEIMSPYKSKQKEIGVKFDQGNFM
TTLSLFELTRPSAGISGTTFGVFGEQRNRGVEATIAGEVVRGVRVLGGAS
YIDAVLSKSVSAMLKGNKAIGVPEWQANLGAEWDVGFLPGLTLTGRMVYT
AKTFVDATNTLKIPDWVRFDAGVRYAARIVSRPVMFRLNVENLLDKDYFG
AATAGYLFIGTPRTINFSASIDL
>NE1720 Cytochrome c, class I
MLCTYHAGQWFNNLLNFIDFIRLQFGKTANVLNLNDCLHIWLFSLEVRSL
RDVVKMKLILSIMILSGCLLFILAGCARKNDYMPPAGATGEKIFKGACVQ
CHTPVNGKVMILRSEMANKEAIIERVTNGKGFGMPAFPNLTGDSVQNLAE
YVLENSVTR
>NE2517 hypothetical protein
MPISESQLETWSHQGSITQSSTTYNTIKSVLEASTTPYASKNFKVFLQGS
YGNDTNIYAESDVDIVIRLDDCFHSDLESLSDDEKSAYKQAFNDATYTHA
DFKRDVLSVLEGQYGSAVKAGDKAIAINASGSRRKSDVIVATQFRRYFKF
RSASDSEYVEGICFFNATGERIANYPKQHSANLTAKHQASSKWLKPMVRV
LKNMRSRMVEDGLIKAGIAPSYESPRVLRRLQLLREWSHEQSNKVFP
>NE2230 hypothetical protein
MALTKEDIAEIKELIGEVITERHPEIMNNNVRYELEIRERILRVEEGLKH
QRELMQEGFNRMDKRFEQIDKRFESLIAEMNTRFAQVDKRFEQVDKRFEQ
IDKRFETMTARMDRFMIWSFATTLTVGGIVVAAIKYLP
>NE0053 D-Ala-D-Ala carboxypeptidase 3 (S13) family
MKSFLKVAWLLWGVLIFPAYAVDLPDTVRQALKKAGIPESAVGVYVREVG
ADRPLVSVNADVPMNPASVMKIVTTYAGLEMLGPSYTWRTEIYANGNLEN
GRLQGDLIIKGYGDPSLNLENFWLLLRQIRQTGLRDISGDLVLDYSYYDL
PAEDPGAFDGKRYKTYNVAPEALLVNYRTSTLHLFPEPQYGRVRVTADPD
GQLLNVQNHLRLTQKKCGAGGVRVNIRDDVPQQGHVTVALEGDYSAHCGP
TVYYLSLHESTAYIHQLFTGLWKQLGGTFSGRVRRGVVPETLRPISVYQS
PPLAEVIRGINKFSNNVAAKQLFLSLGEAQSKGNGLVSPDLARIGIRQWL
LSKSMSFPELVLENGSGLSRKERISTRHLNDLLNAAYFAPTMPEFMASLA
VVGVDGTTRKRLKKSAVARKAHIKTGTLKDVSAIAGYVLNHKRRRYAVAF
TVNHPKSGEARAAMDALLEWLYIKI
>NE2056 conserved hypothetical protein
MKYWLYTSLFLALIFSFRSYANSDIWILIDTLEQRLSVMRGDKAQLAFNN
IAIGRYGASSSRMKGDNQTPLGSFQISWIKQHHRYYRFFGVDFPNQEAAD
LALAEKRISRQAWLSITKAIESSRLPPQDTPLGGYIGIHGIGRGDRTVHA
RFNWTNGCVALTNAQIDELSSWIKIGTKVVIR
>NE2160 hypothetical protein
MVKRVLMIAYHFPPLHGSSGMQRTLRFARYLPDHGWEPIILAPSPRAYQQ
IDSGQLADIPQQVRIHRAFALDTARHLKVMGRYPRVLALPDRWVSWWLGA
VPAGWYLIKKYKPDVIWSTYPIATAHLIGLTLQRLTGIPWMADFRDPMVQ
PDYPVAQWHNLLIRTIVSLILYNRRLQKWCLLTDR
>NE1114 Bacterial regulatory proteins, TetR family
MPGKTREDSQRTRDTILDAAEQVILCKGMGRTTMGDIAHAANVSRGAVYG
HYKNKIDLSIAMCERAFASMEIPVRIKGESALQTLYREGIYLLRLYSEPG
PVQRVLHILYLKCDESEDHLALLDIRNEWEAQRMADTEELIIEAVANAEL
PQTTDIKLANLYLHSLVDGIYSTLFCTNRVPEHKWEIAEKLYQAGFAGLK
TLEN
>NE2168 hypothetical protein
MLPFSSYKHVLIYYIAVAAVLFSGFFSYVVAPHRQEIEVGHVELVNLDSS
LIENRKFSDFTNAYIPEITEHLTMARSGWLPLWSNNTELGRPLYQISGFS
SAYLPSWVITRLVDGPWRFITTLSLGFCFLAGLFVLLFTREVGLSPIAGL
IAGLGLATSPLFMYWLTFPMFPAVWCWAAGALWAVTRLAKRPDILGWGVL
AFSGYSLLMTAYPQPVVFHAYLLGGYGLWLAYHQARVSRLELAKFLTLAL
SALVVGAALAFPVYRDLFILSSESARVAPDPSFFTMVLPKFASFTELVRF
FVLSTIPEIFGNPIAPSFPFSYDGLSVTLIAIFFGVVALVTSFKETWGWW
LAILIFCLLAFVHPLYVLGVKYFGFNLSRSTPLGSITLPLTIITAFGIDA
LARRTHHRQFSSAVFAGAAVALVVIAIGVAYGVSQHISIHWEIVIGMLLV
TGLLIAQYDRYRPLFLMMALVLVLGMTSYPLMLKQDPAQIAMTSPLVEKV
RENLPAGSRYAVAAPGISVLPPNLNATLDLSSVHSYNSLSSTRYHTLIKA
LGGEVQTYGRWNGAIDPDYAGTMFWMSNISLILSSGKLAHENLEFLSEES
GIHLYRVVSRMGDSMQVTPPQLDMSSTKLVLDDPRGMVTNTPVKILDQGD
VLEFEVNSSAPSVFLLSQKFHRDWEALAETNQGWQAAQTVEVNGVFQGVL
VPQETRRVRLEFKPLARYAWIAHVFWIFLFVLIIFKFSQTFRRRVLERV
>NE0065 conserved hypothetical protein
MTRSLFLKPSVWLTVLLLLTLWLDKNLQRPDSQQDSGTQQEIDYIIENLD
GIQINHELKVNRFFSADKLTHYPVGDITQLEHIGLVSIEPDKPLLRVTSG
RAELAGGDNDIFLTRNVAIIRGEDKDKDKVTMLTDFLHLIPDTDIAKTDQ
PVTVTRMNSVINAIGLFMNNQTGEILLQSRVTAHDDRTPRTAR
>NE2099 hypothetical protein
MVARKCFYSFHYQPDNWRASTVRQIGAIEGNQPAKDNDWESIASGANQDE
KIKRWIAEQMQGRTCTIVLVGTSTANRKWINHEIVKSWNDGLGVVGIRIH
GLKDRNGNTSAMGKNPFDHITHGPSKKPLSSLVKCYNPAGATSQDRYAWI
AQHLENAVAEAIRIRKANS
>NE2500 conserved hypothetical protein
MLTHPVQTRSRLAHAMFALLNPIPFGFFVAALIFDAIYACNANVFWVKSA
AWLNVIGLIFAIIPRLINLVHVWMPARRSSRVEKLDFWLNLVAIITAIVN
AFVHTRDAYGVMPEGVWLSAVTVALIAIGLIVSALQQATTRGGRHE
>NE1917 hypothetical protein
MSEQVYSGVDQDEDGGLTPLGRIVIDAWVFGILPESEMCTGWSMSQMQNL
YEEVYAAWGPYAHLPSRLPPELQQRHSFFYSQRITVAKNNGWDTDLSDES
>NE0321 TonB-dependent receptor protein
MYKTYAIWIGVVLFVCKDVIAQPDATGQEYSGDIPKVTMKEITVSSEALA
IPNERLLLDTPTTTGSRLGLTPRETPASINIIDRATFELRGAQTTQQILE
RSPGVTVSDQPGAAGTVCMRGFCGAQITQLFNGITVQYDAVAARPIDNWI
TERVEVLGGPSSFLYGQGAVGGSVNYISRTANRDQQGHESLVLLGSWLNR
RAAYGYNGRIGDTNNWLQIHAGYKGSNGYIDKTRHNSGVFSFSLLSDLTS
RISNTVAVEYQIEAREGYWGTPILNPVTAGKYDPETRFRNYNAENSVFDQ
QVIWVRDIVDFRLSEATQVRNTFYWYDAYRKYRNVEVYRWNGDNTLINRS
ASFAVDHKQNLIGNRLELSHQQNLFGLPARWLAGTDIAFNDQTRFPSTES
GLAVDTIDPYNFTVGDYFDNPRASGPIKDRRNKLFTVAGYAENRLTLFPG
FNLVSGIRIDSIQLDSRYFSPATATEPAAFSRNWTPVTWRAGFVYDVTDS
FNFYTQYSTAASPPAGVLTTTNLNSIRDFGLSTGRQIEGGMKFDFWGKRG
TATIAGYYLQRKHLSTRDPDNPINAIPIGAQSSRGVEVNLGVRLSPQWSF
QGNMAFVDARYDDFNELVSGVSVSRKGNRPENVAKWVANTWLTWDFHPDF
QWMLATRYVGDRYANAANTVPVKSHVRLDTQLAWQAHRNARIIGRIMNLT
DTDYIEWATSAPMYLIGAPRSYEVAVKLDF
>NE0480 hypothetical protein
MQQVALISAVFAIANEVKLEAIQLVDCRGAGSPWPDSLAKTRCDALQQSF
INRRHCETAQLVKQSSKILLLLLPKLKWHPDQQIFTVFANANGMKQSDTF
VVITIAGSPRIIPGCRRLRLLAITSFAMTGKIRESCSCLQQK
>NE2012 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE2212 putative transmembrane protein
MHHMESTFLVDNIFFIIAAMLSGGMLLWPVVARNSVRDIDPKRAIRLINY
EDALVLDVRDDSEYAGGHPPNAKHIPAEKIEDRWQELEKFKDKPVVIIFT
PGLRVGRAGAVLRKNGFKQVFNLNGGIDTWRRENLPLVKK
>NE1723 conserved hypothetical protein
MNKSGTIWKYLSAAPHRSLFLMGAFQAVLTILWWVIELAGRAGLMESVSF
SSIMPIWVHGFLMVYTFFPFFILGFLFTTFPNWMNGEKIKANQYMTVCLL
MAAGVVLFYAGLIAGKFLLSAGLLLMLSGWAYAIIILFHVLFTAPAGDKR
HAWVAGMAFIAGWLGLCAYALWVLTDGREMLDFSRQAGIWFFMLPILMTV
SHRMIPFFSSRVLENYVVVRPFGLLWLMVGCTVVHGALQLAGLQEYVWWA
DMLLAGCAFYLSYIWRFFRSFRVRLLAVLHISFAWLGIAMLFFGLQGMLL
WLSDGDRYWFGLAPLHMLAVGYFASMVLGMASRVTLGHGGFPLVLDNLTW
LLFIAFQVVVLLRVIPDIFPGVAPLAPQFYLIASSVWLSCFLLWVIKYAP
IYWRARVDGKAG
>NE0260 putative antirestriction protein
MCTEIRIYVADLAAYNNGKLHGVWINATDDLEAIREQVNQMLTDSPEDFA
EEYAIHDHEGFGGFILSEYAGLETAHEVACFITEYPDFGSELLDHLSGDL
EEARTAAEENYCGCYQSLADFAEELTEDTTQIPVNLVYYIDYERMARDME
LNGDVFTLETGWEEVHIFWNH
>NE2178 conserved hypothetical protein
MENPAMTTRPFDARQTNITDLIQRLDGCATIVTGNRRLARALHQAFNQAR
SAEGHGAWPAPDILPWDAWLQQLWQEVVISSRIESAPGVLLTSHQEYFVW
QEILAEQSGDVPLQATNETVARIMEAWQTLHAWCIPCREADFGHNADTRL
FWQLASMFEAKCRKNSWLSVAVLPGILQKYVQIDSLSVPNELVLTGFDEW
TPQQSSFLRAFEQTGCSLQWLQLSGQPDRIGKLACADGRDEIRQAARWIR
QRLEENPAARIALVVPELAAQREMICQTLDEVLIPQALQPEHHDRVRSYN
LSLGKPLDRYPPVSLALDVLGLSETVIELPHVSRVLRSSFIAGGDREMNA
RALLDARLRESGEWNLTLQKLLTNAARSGQPYSCPLLAECLSNLMKQVKV
SLAPTSPGEWAQRFGQWLKAIGWPGERGLSSEEYQVIQAWQGVLREFSTL
DWVIRSVSLTEALQQLRHMVAGTIFQPESAEAPVQVLGLFETSGLQFDYL
WIMGLHDGVFPASSRPNPFLPLTLQREVDAPHSSARRELRVAAALLQRIT
TNATEVVISYPQRKGDEILDSSPLIDAFPALSEEMLAMGTQSAWRDSVYH
SRQQEVLSEDVAPTFVGTGIPGGSKLFKLQAACPFRAFAELRLVARPLGR
IQIGLNALVRGTLLHRVMEMVWAELDSLAALANLSPGELNALVAGKVNEA
IYEIAPRYPHTFGERLQALESKRLHALVLAWLEMEKQRPPFRVSGREMET
ELELNGLRINLRIDRIDTLEEGGELLIDYKTGEVKASAWFGDRPDEPQLP
LYSLAFTDDGLAGIAFARIRAGDIAFEGVASEEVSILGIKSFENLRHTRE
AASWDEVLAGWRQTIEQLVQDFMAGEARVSPKQYPQTCTYCELKPLCRIG
ESLEAVDDC
>NE0316 conserved hypothetical protein
MQYHEPVWIPAHLTTGEPDSFAAFTLEERLPAILDNLANHADARTDQALQ
QLAVEIREGEITPLPPVILGLFDQVIKPYTGRRWTDMPFLTVELYFYARI
LLAFGHTATTLVDPFHPIKNTVSLQAIESLTTMTGYCDSDCDITGLLRWS
MTGNTADLSQQVVTDAGQISLLVDESYVAGGLFDSGLNRIDFVFDNAGMD
VLTDLLLILRISRHCSRIVAHVRPWPMFVSDVTMTDMKYLIRKLVTSSIP
AAKKLGEDITQLLHQNRLIFRSSSALGLPVCFCEEEALTRETFENTELVI
FKGDLNYRYFVGDRRWPHTMEKRYFFERFSQPAICLRTLKSEVLVGLPAD
IATRTLHLQPDWLTSGRYGIIQVFAH
>NE1643 conserved hypothetical protein
MFDQLVIDALDFVRSGKSLQGNVPLLNLERLRDYLTNSAGELAYLVTGLL
DERDRPLLKMSVNGIIDLSCQRCLEKIEYTLDVKTALLLARNEDELSRYD
EDMFVDAIYASNELDILALIEDEVILSLPVSPRHEDTAGCHPSTGTGIHE
AAVKEHPFTVLASLKQSH
>NE1211 putative ABC transporter permease protein
MFYHDFRTMLIRLRAQFIKELLCILRDPRNRVVVFVPPLMQLLIFSYAAT
MEVRNLDIAVYNQDTGRAAQELVWHLEAARFIAQVHHVHSNTALREQLVQ
GKVIAAIAIPADFSRTVAVSGSGHAQVLVDGRRSNSGQIVVGYLSSIARD
IRFTADTAPIPETSVTVRHWFNPNLVYLWFMVPGLTGTLAFFSALMITAL
SIARERELGTFDQLLVSPASTLEIILSKSLPALVISSLLALLMITLAIWF
FRIPFTGSFGLLLIGLVLFILSAVGIGLVISAISMTQQQAILGGFVIGVP
TVLISGFATPVENMPLLLQWLAQAIPLTHFLIIIEGCFLKALPPRDVLAN
LWPLAVIAFTMLPIAMIFARSRLQ
>NE1446 NifU-like N terminal domain
MSLKSIYQEVILDHNRKPRNYGALRSPTHHATGHNPLCGDRIELDINMLD
GHIEEIAFQGESCAICKASSSMMTNAVKGKSHQEAEALIQEFREMLVSGE
DKSFDHLGRLKVLAGVRDLPTRVKCAILPWHTLHAAMNSTDSATTEADDH
ASKLVANNH
>NE0587 conserved hypothetical protein
MHTDMKSVGTVSMQEAPLLEIAVVGHTNTGKTSLIRTLLRSTSFGRVDDA
AGTTRHVERATIFAGSEAVLNLHDTPGIEDVYALQDKLHLIATRNKRSTQ
SELLEKFVAATPLNDPLEQEAKVIRQVLRSDVLLYVIDVREPVLEKYLIE
IEILGKAMKPMIPIFNFTAAHRAELDLWRKKLAAFNIYASLELDTVAFAF
EAEKRLYQKIQSLLEVHYTRLQRLIDHRARVWNQLCMSAARRIAGLIITT
ACYREHTGDERSSAGDTSSAAIRLQDFIRQAEQHCLVDLLKMFNFTDKDI
ELQKIPVQNGYWQLDLFAPGVLKAYGLDIGSAALKGAAAGAGIDLMTGGL
SLGVASMLGALAGTGWSTFRRYGKEIQAKIRGTRWLCVDDSTLQLLYLRQ
RQLLDKLMNRGHAACHTDQVSQQPERGELPDGWQQIIDMLRQNPAWGRSP
GLHTDDTRQYSSIEKRLIDALLKNPAL
>NE1525 putative plasmid stabilization protein ParE
MAEYRLSPAAQRDLDGIFNYTFQQWGAAQAVRYIDILEAACTELVETSSQ
GQDCSYIRPGYRRRHVERHITTE
>NE0357 DUF163
MKFHILAVGNKMPDWVRKGYTEYCQRMPKEAELLLVEIKPEKRVGSKNTR
QLLQAESERIRTVLPPGCHIVVLDETGKQATTMKLAEMMDRWMGSGQDVA
FIIGGADGLHQDIKQMAHEKLALSAMTLPHGLARVLLAEQLYRAFSINRN
HPYHRA
>NE0746 hypothetical protein
MTYAEVAKKIRNRLRRYSLISIINVGLNHLTQQHDNKEKALRAMPWLPAL
VMKLAIEDEMISMHGDLCPSAEFDACCNAIWNAKRGLDESVQVALLGVRA
LMHAQFIFQRSETFGFLRWAALISRIDASHPCRSLFERVFSMTPDDFMMA
AILLISQFKKEAPQQPIDLRDYSALPEELTKPLYQLVRLLSKDLSELRVQ
LQGELRSRLDSKTKRSARQESERHEFPWLAKYPLLKLDQTRVLAWNPTIF
FHGLEEFVHIRLSEFGQDYTDSFSQVFEDYVIELIQESGTHAITDQEFKC
LGNKGMSAVDALIPHAEGNVFIECKMSLFADAVLLSDHPPFVSEKLKRIR
KAIVQGWKVGDLLRSDKIKLSDAKSADNDYLIVVTSRQLLFGNGLHLKQM
VDEQFFDHIFPESNFMSPSKEQLSRMPPQNITILSIEEFEHLVGAVKSKK
VTYLSFVQQLSKNASNPKTAKMVADQEIRKYVDKWYIPNLLTNSRDRVVA
QLNAVFNCRDRIKSRTYK
>NE0283 possible S-adenosylmethionine-dependent methyltransferase
MSSHPPIRSYVLRQGYFSNAQRHAYESLLPRYGIPLTEEPVDLDSIFGRT
APGILEIGSGMGETTAEIARQHPEKDFIAIEVHAPGIGSLLGQIEKHRLT
NLRIIPHDAKLVLQQMFTSESLDGIHIFFPDPWPKARHHKRRLIQPDFVS
LLCDRLKPGGYLHIATDWEDYATHILHVLRSEERFVNTAVDYAARPAYRP
LTKFEQRGMKLGHTIRDIIFTRTA
>NE1101 Sigma factor, ECF subfamily
MSADEFILQQKVHALYSNHRSWLQRWLYRKLGNACDAADLVQDTFISVIT
GGYADDIREPKPFLATIAGRLVAHRWRRNQLETAYLEALAALPVELMPSP
EAHVLALEALLEIDRALDGLPTRVKEAFLLAHLEELSYAQIAERLNVSTS
SVKQYLTRANRQCLFALSV
>NE0635 hypothetical protein
MLTLPLRHQYLIGSILIILMIATREYHFASLHTLPGASWAVFFLAGVYLS
SSWSLLGFLVLAWILDFSAYFTAAGSDFCLTSAYIFLLPAYGALWVAGRW
FAARYQFSWRALASLSISLLIGAMLCELFSSGGFYFFSGQFEETTFAEFW
QRELHYFPLYLQSLLFYVGTAATIHTLFVLIHKSRHPQINATG
>NE0728 Sensory transduction histidine kinases
MATKSLRRQIMLGMLAYTLLLSLGIATYGVTVIKNVEHLIWESLLRSEFE
YFLECQRTEAGYRWDDTELLRLFGKGSDTPIPAEFDLPAGVHDGISVEKK
QFVVLASGIGPDRSVMTLDITNIERVERQLIWMILGSAAVLVCILVLVTI
FGARRLVRPLNSLARAIAELSPDQNGQRVQIEPHAPKEAQVIAERLNSYL
VQIDDFVERERKFIRMASHELRTPLAVVVSTAEVALDPLVSPHSTEAHLR
RILATAHEMQNLVTLLLALARDPARLQSMMELVDLAELVPSIVQDHRLLA
GSKELAFDIITTPPCSIHAPRQIVSATIGNLVRNAIENSEHGTIRVITTG
SGVTVQDSGHGVSESERSRIYTQLARAGSAAAGGIGLELIARACAHLGWR
LDIDSLESGGTKATLTFR
>NE1986 hypothetical protein
MLVFAALALTIAGCVYLYLASPNQKWLVQALPGRPALVAGGLLLAAGLAA
WITVLRPLAGFFVTLHVAMVCLFAFPYIAALRGKGRRN
>NE1997 glucose dehydrogenase B
MKEQSLSMKKNHNSTPVSSVCIVAAIMLLFYGVGFTQAQDFNTPPPNAPY
QQPAFEGQTRAPIIEKNVRLNVQVIADGLVHPWGMDQLPDGSWLVTERPG
RMRLISADGKVSDPIAGLPNVDARGQGGLLDVVVRDDFAQTRQIWWSYAE
PRGKGHNATAVATGILSKDGSKLTDVRVIFRQNPAWNSTAHFGSRLVFDH
DGMLFVTTGDRSLPQPRILAQDVGTHIGKVLRINPEGGPAKGNPQIKGGQ
PEIWSYGHRNLQSAALDPDGNLWTVEHGPRGGDELNQPRAGLNYGWPIIT
YGLDYNGRAIGKGLTAQDGMEQPVYYWDPVIAPSGMAFYQGELFTEWQGD
LLIGGLASQALVRLTLVDGRVTGEARYLQGQGRIRDVDIAKDGAIMILTD
AEDGTLIRVTPAR
>NE0265 hypothetical protein
MTKREISGGMVHELPEDLKKALIAHPEALETWEDITPLARNEWICWVESA
KKIATRNKRINWGCESLSEGKRRPCCWPGCPHR
>NE2113 hypothetical protein
MIVVDSNVLAYFYLPGEYTATAEALFEHDPDWVAPVLWWSEFRNILAGYL
KRGNLTFLQAYNLQCEAEDLLASAEYEVNSPSILELVRDSECSAHDCEFV
ALAMKLGAKLVTMDGKLLRAFPGIAFALSMS
>NE1780 Phosphoglycerate mutase family
MKKLVLLRHGESIWNQENRFTGWTDVDLTPKGLKEAEEAGRLLRENGFSF
DIAYTSLLKRAIRTLWIALDEMDQMWTPIELNWRLNERHYGALQGLNKAE
TAKQYGDEQVLVWRRSYDIRPPSITINDERYPGFDLRYRNMSSGDIPLAE
SLKDTVARFLPYWNQSIAPQIKAEKKVIIAAHGNSLRALIKHLDNISDQD
ILNCNIPTGIPLVYELDDDLKPLNSYYLGDAGQIGEAISAVANQGKSGA
>NE0297 conserved hypothetical protein
MFGFLEKLRQSGVVGMNQRNADFILRYNKRSLYPLVDDKLRTKYLATASG
IAVPELYGVIEIQHDVASFPDIVKDHEDFVIKPAHGSGGNGICVITGRLN
NRYRQSNGDLLTEEDIEYHVSNILSGLYSLGGQPDQALIEYRVKFDPLFE
TVSYRGVPDIRTIVFRGIPVACMVRLPTRMSDGKANLHQGAVGAGIDLMT
GRTTHAIWQNYPIEQHPDTGATISGLAIPHWYKLLTLAAQCHDLVGLGYL
GVDVVLDRELGPLVLELNARPGLSIQIANRRGMSRLLEKIAKLEEIPAAA
ADRATLGQELFLENQT
>NE2387 hypothetical protein
MMKNLFVLLQSITAIFPVSIFFTYIIMDEGDQFTYEHYLVTALSAFPFFM
VLLIKYFISGFENK
>NE0803 CDP-alcohol phosphatidyltransferase
MNMTTAFFVHIIGNSKTSLWGMDGRTRLERMLGTIKTVAVVKEVSQLTET
DPVLLLRADYLFDSRVISALMALHEPAVLVAPMDETPVAILIDGKNAAAM
CEVLNEKRPAPDDLPVRTLKDLPIQVQQNLKKKDPPYVLPIRRCNQAELE
SELFAASYKGVTDLVTKWLWPWPAFMVTRLCVRLGLKPNHVTLLSLVLAV
LAGVAFWYGAFGIGLIMAWLMTFLDTVDGKLARVTLTSSKLGDVLDHGLD
IIHPPLWYLAWGVGLMATATPVADLELLVWLMFIGYVGGRLCEGAFQFWL
ASFDIFIWRKVDSFHRLITARRNPNLILLTAGWMADRPDIGFILVVLWHL
LSTLFLAYRLFAAWQSRSQDGVLHSWMENIDPVRDRQQLAVRIFTRLPFP
QRYVNPAADNE
>NE1432 DUF193
MRCPFCGAEDTSVVDSRISEEGARIRRRRRCVECEKRFTTYETVELRFPQ
VIKQDGNRVEFNREKLYTSFARALHKRPVPTGQVDAAIERILQKLLGSGV
QEISSRTIGEWVMQELYSLDKVAYIRFASVYRSFEDVGDFQEVIREVQSS
PQNGDGPSS
>NE0895 hypothetical protein
MLVPPVRKQRRKCIMSNQKNRYGGLFCALINSSFSSWLIKAMERLGFKQF
WINFFLWLPCRLAMLEIKLEYGSLENFVNRDSD
>NE2236 conserved hypothetical protein
MSRCKRCLLPHSAPGAAVDSSGICAFCRSYVPNDKNLSDQLHQQRQLDLE
KALADCRGKGPYDVLVCLSGGKDSLYLLYRLKVEYKLNVLAFTTDVNIPD
IAWNNIHRTIQKLDIDHLVYRPSAQFYRKLFRYLLMNQEERGAVYTVSYV
YAPLFEGDALQVAMEKGIPLVLAGYSPGQPEPERMEFEFSRKLITETDWT
PPGLRQSGEFTEEELARFWNPKRFPADTRFPRYLAPFHAWDYDQEAMIRK
VHDLELVKRRAHGNPIYSNYPINWLLMYSDLKHFGYNPYAPEFSALIRAG
KASRAYWWFMTPIVNFIIRNKLLIGRNTVDQLQQLGLQEDDLKITRPRGA
YDPEID
>NE2364 NLP/P60
MSLKHIVILLLGTGMVVGCSSVPKDSNQEKVSRHWHKPAFVTPDKTTENN
PYSYFVRDRLYSQYEEWRGVRYRLGGSDHSGIDCSAFVKIIYEAEFGLSL
PRTALSQANLGSEINQNKLMPGDLVFFKTGRYSQHVGIYLDSRQFLHVST
RKGVTISQLDNTYWKSRYWKAVRI
>NE0983 Methyltransferase family
MHVPVLLEEAADALNIRADGIYVDATFGRGGHSRLILSRLGESGRLIAFD
KDPAAISAARSIRDERFQAVHGSYAQIRTALESLSVSRIDGILLDLGVSS
IQLDEASRGFSFRHDGPLDMRMDSSRGKTAAEWLATVTETELKEVIRTYG
EERYAGQIAGAIVMAQTRQPIVTTFQLAEIVAAVVRKFGHRDGRQHPATR
TFQAIRIHLNQELEELSVTLPQCVELLNANGRLVVISFHSLEDRIVKRFM
RMQAGTDTLPRKLPVRDEESRMHSRQTLQIIGKKIRPGENEVAANPRARS
AVMRVAEKLETGSKVDR
>NE1995 Integrase, catalytic core
MLQVAPSAYWRHAARQRCPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWHQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRTFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE2479 appC; oligopeptide ABC transporter
MMSFDPVILWTDALIYLLLAAGIAFGIYARRHVYVLASWRKVTHSASGMS
ALTILLCFILIGLLDTVHFRPVLEQSVRSSDGKQLYSAEVLSLLDLLAAP
LRLQTEKTYSAPLSAHLYAKETITTPDGNQIRDFARLVHGGAHLQYPEME
LVEDILWRSASGAGAGLLIWIGLVAGLGGVKARQYRISFVRMQTAIWRGA
TEIPWRAILMTLAIILAACGAIVALAMHYHVLGTDKVGQDVLYLSLKSIR
TGLIIGTMTTLIMLPFALLLGIAAGYFRGWIDDVIQYIYTTLSSIPSVLL
IAAAVLMMQVYIETHADMLDTAAARADLRLLFLCIILGITSWTGLCRLLR
GETLKLREMEYIQAAHAFGVSHWRILSRHILPNVVHIVLITTVMDFSSLV
LAEAVLSYVGVGVDPSTISFGTMINASRLEMAREPMVWWTLFAAFTFMFA
LVLSANLFSDAVQNAFDPRNRTLAGDPNRLLNNQTEEQAMAETTGTLPSD
KNTSRT
>NE1216 Haloacid dehalogenase/epoxide hydrolase family:E1-E2 ATPases
MRHEHHQPNSHDPATAQPLADASGAIYTCPMHPEIRQDHPGTCPKCGMAL
EPLLPDLDDDDNPELRDFSRRFWWTLPLTLVVLILAMFGEHLQLMDMTQQ
SWVELILSLPVVLWAGWPFFLRGWQSVYNHSPNMWTLISLGTGAAFIYSV
VATVAPQVFPASFMSMGRVGVYFEAAVVIISLTLLGQVLELKARSQTSAA
IKSLLGLAPKTARRIRVDGSEEDVPLSHVHVGDLLRIRPGEKVPVDGVVF
EGSSSVDESMLTGEPLPVSKRQGDKVIGATLNTSGSLVMRSERIGSDTVL
SQIVQMVAQAQRSRAPMQRMADVVAGYFVIAVVVIAITTFFVWGMFGPQP
SWVYGLINAVAVLIIACPCALGLATPMSIMVATGRGATQGVLFRDAAAIE
NLRKVDTLVIDKTGTLTEGRPVFDRVVPADGFTAEEVLRLAASLDQGSEH
PLAEAIVQAARAQGLVLVKPQTFESGSGIGVRGNVEHQQLALGNTVLMEQ
TGVSVTSLMSQAESLRGEGASVMYLAVDGQLAGLLAVSDPVKASTPEALA
ALKEAGLNVIMATGDGQTTARAVGTKLGIDEVYGEVKPADKLSLIECLQK
EGRIVAMAGDGINDAPALAKADVGIAMGTGTDVAMNSAQVTLIKGDLRGI
ATARTLSVSTVRNMKQNLAFAFLYNVMGIPVAAGILYPFTGWLLSPMIAA
LAMSLSSASVVGNALRLRNSSL
>NE2142 hypothetical protein
MSDRVIRMNQQSETDAQFDPDLPTANAVMASLCCVAAQYASRPSTELAKL
ALDLAYKLTAPQYAESELITEVAQQLVRQWKQVLYQQVQARAAGMIIPGN
RFIN
>NE1194 conserved hypothetical protein
MLKFIHSGPFAALFECGFRPFFLLTAATAVIAVAAWVGFLTLGMPLPLTP
NGPVVWHAHEMLTGFAMASVAGFVLTAIPEFTSTRMIERHHVFILLILWI
TGRVAFAIPIVPAGTIVAMIADLGLLGMLAAIIAPRLWRDANRRHLSFLW
ILLAVIVATGGFYFDTLTGTYPMRWLLVITGLMMALIVIAMSRISMRIVN
TALSQVGETGTAYLARPPRRNLAIFCILLYTLAEFVALQHPVSGWIALAA
AAALFNLTNDWHIGRALLQRWAFMLYLVYWFMALGYAVIGAAILMETAWV
SAGRHLLLVGALGLAVFAVMNIAGRIHAGIEPDKRLWVPIAATLIAGAAV
LRAMMNFDQIEASLAIVAASACWITAFSLHLIFHWRILIRPRTDGQPGCE
GIASDA
>NE1228 conserved hypothetical protein
MTPTLIELMGAGLFALAILHTFSTGFFKRLAYIQPAHAGLWHLLGEVEVV
FGFWALVLVVALFTIEGEAAAIHYVDSRNFTEPMFVFAIMVIAGTRPILQ
TAMAAVHLATRLIPLPGSMGFCFTILTLIPLLGSFITEPAAMTLAALILA
EHIFAKGISAHMKYAILAVLFVNISIGGTLTHFAAPPVLMVASKWNWDTA
FMLTTFGWKAAIAVVANALCVTLLFREELRLLPVTENNRHEAVPPVLAGV
HLAFLAGIVMFSHHPAMFMGLFLFFLGITHAYQQYQDRLILREGLLVAFF
LAGLVVLGGQQQWWLQPLLMSMTSDQMFLGATLLTAVTDNAALTYLGSLV
EGLSDDLKYALVSGAVTGGGLTLIANAPNPAGIAILRNHFDDEAVHPFRL
LLTAAIPTFIAALAFRLL
>NE0182 conserved hypothetical protein
MCHTCDGKDFSPELAWTGVPENTKSLVLIVDDPDAPDPQAPKMTWVHWIL
YNIPPATRKLPERVTVAELPSGTLEGVNDWERTGYGGPCPPIGTHRYFHK
LYALDTLLPDLNQPTKAILEKAMQGHIIARTELIGLYHRSDNV
>NE1566 hypothetical protein
MASDKANACLICTGRQPIHPITRLYRYAGFIEWDTGFSQMRKTPPPEVHF
GSPGQTPGHLRNTLAERIAAVPAGGTIDWVTYYFRDLCLAKALVQAHKRG
VRITLTLEGRPRIPYANDAVIALLSGPDGLGDALRIVTLPGVPSPATKSW
KPQLHEKLYCFSHPEPVAFIGSFNPSGNGPEDDPEIIREIGDQDRGYNAL
IGLKDPVLVEKLAEHARQLHQSPPGLFYRFTTDTNNAIHGTDTDIYFWPR
TDTHPIVQFLRKTGSSARVRIAASHIRMESAVDIMISLARRGVDLEILAE
STLRRVTPRVEQRLANAGIRFRRIRPSGSLPMHLKFVLIEEDNRAWSIFG
SFNWTKPSFWLNHEIAAISSNPVIFESLARCWDMLKNESN
>NE1791 hypothetical protein
MEDLLNPSWNNEERNGMISKECFLHLQKKVNPASHWYLTEDKIAFHHHCI
KNNLSVPELVAVFDPNGQSYWENGQGIETKNNLLDGLARYPFDIIMKPVY
GYHGKGVSALDFVDGVHRFTTDLSLSLRDVFKKILAENPDRYILQKRLYS
HQAIAEFTGNTVLQSLRLITCLDENGQPKLIIRKIKFPKQGNLIDNFSWG
ISNGRLCLIDEYGKIESFIKYDHIKKYLVRYDYIEDISGKKTEFTIPFWN
QCVVLVLNAQKAFAPLRTIGWDVAVTNEGPFLIEGNVFWDPLTPQEGSMQ
AICQLLMALNAPLVN
>NE1241 Tyrosinase
MAIRKDANTLTAAERAEFVAAIRVLKAEGIYDRFVLRHANANMSAIHRCS
AFLPWHRRFIYDLELELQRVSGNPNLGIPYWNWPSGSANASMWNDDLLGG
NGDAGGVVRTGPFRSGQWTVINSSGLPAGPLMRAFGQNGLPTLPTQAAIN
QVMAVTPYDTSPWNMNSNPSFRNQLEGWIGPNLHNRGHVWVGGSMLPMTS
PNDPVFFMHHCMVDKIWHEWQLRFPNQGYLPASGGPFGQNLTDPMGSTPS
GQVGSRPIDVLDSAALGIVYDDAAPQPQPQPEIPLIVVGADPIAAAIGVP
GETDVFRFEVPAFGAHTMYTLGSSDTFMTLFGPNDPNFEVASDDDAGEGF
NAQINRNLSAGTYFLRVRLYSPNSTGNYAVGVRAVSATPGPGPGPVPIPE
LIVNGVGIDASISAANESDVYRFNVTTGDFYTIQTNGTTDTFMSLHGPNS
QIPEIASNDDSGISFNALIRRQLSPGEYFVRVRHYSPSGTGAYSVRVTQG
>NE0313 hypothetical protein
MDREDFFLKIAELHLKRVEILQTVEWRITFSLWTFVAGVAMVSLANADKM
KQAAVAMGGILGPVVILSLMGGIYVWLWYLYLYKFCKKNYNSLVTERNRY
QRMQNEAIKLVLKGKSADFLIEAGAEDKRVPESDFSEPSFKQLSGDDFRK
SGVWEFKRGITAALMFFSWLLVLMIVVPSASKHLNDSVVATGKPAGVEVE
SSGRNLRREADQHF
>NE1258 ParB-like nuclease domain
MWTYTYDLDNQLISANKTGSSNSLAYDGAGRLRQTTLAGTITGLTYDGVD
LVAEYNSGGTLLRRYVHGPGVDEPLVWYEGTTTTNKTWLYQDQLGSVIGT
ANSAGTSTAIHSYGPYGEPNIATGIRFRYTGQQFLGSLNLYYYKARFYSP
ALGRFLQTDPIGIADDLNLYAYVGNNPINFNDPSGLSAAAASMLLGKLGS
WGRENAGTLTEIGIGFTPAGFYTDLYSAVAGRTPITGDSLSGWERVAILI
PGVSEIRNAGRIGSSIQGVAESTIKQVNPTNLIPTQTRSEMSGSQIKRLE
KDMRTNGFDQNKPIDAVRRTDGRLEIQDGHHRVEAAKRAGIDQIPVQIWE
K
>NE2077 hypothetical protein
MISNPVNSAVIVSATPTDAISQTRPVSAVTPVPDATQSDTPAFILGQKYR
AQIGERLTNGHSLVNVAGRWLQMRMPASANPGNILELTLIEQSPRLKFLL
HSGTQGGNNPTTLSPAGRLIAQLLSQPAPPAMKTANEAAPLLPIPPATGR
ERIQLPAQLQQALSASGLFYEAHLVQWLSGNRSLQQLRQEPQGKLPAPAT
VSTTITDSATASPVASQAVSLIQQQLHTLETGTIQWRGEIWPGQTMEWDI
TEYPDDQGKEQADNEKTGKSGRWQTRIHLQLPNLGKITATIMIEPQGMRI
RLDADSDEITRQLRKEQITLASAMQTTGLTIRAMDIQQHEAT
>NE0479 PIN (PilT N terminus) domain
MLKYMLDTNIAIYVIKRRPIEVLVTFNRYADMMCVSAVTEAELLHGAEKS
RQREHNLRQVADFLSRLEVLSYTSKAAGHYGDIRADLERKEKPIGVNDLH
IAAHARSEGFILVSNNLREFERVDGLRLENWIT
>NE0052 hypothetical protein
MKKEQEMLRVNVVMITVALLAGCTMAKRPLPGPAQPAPDPVVQQRPSGPL
PPSTRPAYNLAGYPKAAQEGYVDGCETAKQSAYGFKDKKRYAADTQYQMG
WNDGFSICRGKHQQN
>NE1944 conserved hypothetical protein
MAAPDIKTIVNGINIKQSFAIQNQIRDGGDGELAKPRYQATVVWDSGYHT
TSNVTDGQIIIGDEPVHYGGEGMGITPQDLLLTAVGHCLAASYIGGLSAA
HINVESLHVHVSGRVNFRAAFAVEPGNPGFEDISVIVEIQTDAPQEQVTA
LLEKLKHMAPIPDTIMRPVPVNIEIRHQLK
>NE1584 hypothetical protein
MTYKLEFKKSALKEWEKLGHTIKEQFKKKLKERLENPHVHSAALPGAKNI
YKIKLRQPGYRLVYSVEDQTITVTVIAIGKRDRNEIYDIALSRLHDKS
>NE2214 AAA ATPase superfamily
MNDPESIFQRLDRLLARIEDTLPVRPQHADPAEYIALRWRKHGDTGYLQA
INHPHTITLNELLNIEEQKQTLDRNTLQFVSGLPANNVLLTGARGTGKSS
LIKALLNRYADRGLRMIEVDKLGLTDLPDIIEFIGQRPERFILYCDDLSF
EADEPGYKALKVVLDGSISTASDNVLIYATSNRRHLIPEFMHENLATRHV
DGEIHPGEATEEKISLSERFGLWLSFYPFDQEQYLEIVRHWLSQHGISRL
SGPARQEALRWALARGSRNGRVARQFARDWAGQQKLAKTEPVRVDKE
>NE2343 hypothetical protein
MRIQNFTDTLSNRMRHMCCFCVAVITGLCFSAQVYAQETGKTGPSSLLLH
AARVFDGNEMHSGLSVLVKDGRIARVGPRHSFRTGDASADIDLGDATLLP
GFIELHAHLAFQKVPADIVLRHGVTTLRDVGGPVHPPYGGEGSLRVLTSG
RIITAPHGYPIVTMGARDLAIPVASEAEAREAVRHLVGEGAVIIKIALEP
GSEAGAPWSSHHDHAHHDAAHTASAVTHHHAHAASHDQSAWPMLPEPIVK
AVVDEAHRHQRKVSAHVAEPSGVQVALNAGVDEWAHMPCNPIPEPLLKQA
RAQNVTIIGTLDTLSKCSGIAHNTMVWTELGGELLYGTEIAHPDIPWGID
AQELMYLMHLGKMKLLDALGAATAKAGRYLGIPMLGTIQPGAPADLIAIR
GNPEQQLKRLEYPDLVISGGHIVVNHFSSLP
>NE1589 hypothetical protein
MIVVDSNVLAYFYLPGEYTAAAEALFEHDPDWVAPVLWRSEFRNILAGYL
RRGSLTFLQAYNLQCEAEDLLAGAEYEVNSFSILELVRDSECSAHDCEFV
ALAIKLGAKLVTMDGKLLRMFPDIAFALSASQRSS
>NE1590 hypothetical protein
MTIQWNHYGSISIGKLRNAMPTTLTLKNIPDDVYERLKVAAEMHRRSLNS
EIIVCLETVLMPTRISPGERLERARQLRAGLNSEKFQACDIDVMKRQGRP
>NE2291 cAMP phosphodiesterases class-II:Metallo-beta-lactamase superfamily
MKLKVLGCGGGIGSNSHTTAMLLDEDILIDAGTGVGTLALDDLLKIDHVF
VTHSHLDHIAFIPFLVDTTGSLRDRPLIVHALPATLECLQKHIFNWHVWP
DFSRIPDAVRPYMRYQPFAVGETIVLGSRRITPLPACHAVPAVGFQLDSG
QASLVFTGDTTANDGLWEAINRITNLRYLIIESAFSDVKHDIARRSGHFY
PALLADELNKLQHDAEIYITHLRQGEAELTMQEILARISRFDLRRLMSNQ
IFEF
>NE0741 FAD linked oxidase, N-terminal:FAD linked oxidase, C-terminal
MASKYIALPKGVSEKDFEAAVGHFRGILGDDAVLTSAEQLIPYTKTMMPV
PDAEHTPSIALLATTVEQIQKIVAVCNQYKIPVWTISTGKNLGYGSAAPA
ERGQIVLDLKRMNRILEVDKDLAFALVEPGVTYQQLYDYIKEHKLGLWLS
MPAPSAIAGPVGNTLDRGVGYTPYGEHFLFACGMEVVLANGEVLRTGMGS
MPNSNTWQVFKWGYGPYLDGIFTQSNYGIVTKFGFWLMPEPPAFKPFVIQ
YEHEEDIVEIVETIRPLRISGVIPNAVVIAHALYEAPVKARRGDYVSGPG
SIPDEAVYRIMKDHKLGIWNVYAALYGTQEQIDVNWKIVTQAFGQSGKAK
ILTEQEAANDPAFAYRTKLMRGEMTLTEFSLYNWRGGGGSMWFAPVSQAR
GSETLKQMALTKQILAKYGLDYSGEFIVGMRDMHHIVDVLYDKTDAEQTK
AAYQCFDELLTEFSARGYGIYRVNTAFMDKVAETYGPVQRNIHRTLKKAL
DPNGILAPGKSGIR
>NE1301 conserved hypothetical protein
MDHNLHRPRTLILPPAPYAAALFAGWWIDRNLQSLSWDWGTVTQSLGWLG
VITGLALLAWTAITLWRHRTTVNPYKAASSLCTTGPFRYSRNPIYLGDWL
IFIGISMLLSTWWPLFFAPLIWAMLRYGVIRHEEAHLEARFGESYRDYQT
RVRRWL
>NE1252 hypothetical protein
MHGTCKAEKGFTLLELLIAMIIGSGLVYLTSEVFSLIQRGVEQSRQAADQ
VMTEQRAIAVVREALSNLIPPVTSDKRYRIVATESVFEFAAVPVDGRSQW
GSMNTRIVIEPSDSGNFMIVFEQWKSEAGSSFTIPKSRLILFKDLKSASL
QYLYHTSRGFHTQPEEPEQHPELVTLRWTKADYVGQTGAESLSVRTRIDA
GANCQLDLQSLSCR
>NE0244 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNIYRAQQDHFQLVGTDRLEIMRGHGIQRHASKQRWH
ISDKTTQLAAQRFHVKRPETLHEIGMPVTLHDTVTAVTDMSNDIFEQPCL
TGCAERRFALGSEQMPIGRKAATRHRKGRLLRIVVEW
>NE1154 Domain of unknown function, DUF9:HAMP domain
MKPVLYSIKSKIVIFAVIATLLPTAGLGLLSFKQNEALIIDSVTRELRVL
ADNVNRQLNLWMDENTLTIRALSTSNPVIEGLTILKKQTDNTHENTAKQT
QSVMSGYLTAVRDKLDDVLELTVFDSTRKIVASSASTPEVLESPERWTHA
SLTRGSIAITPHWNDRYDTAAFSVIYPVLSYDSQLIGAIAATLDLGIFRN
KLMETRKFSSGEITLLDQNSRVLLSSASGIDHLAVPNPHYLESMQILEES
ITHDGLSYPKAIGLLYASENVPVTILVEQDQSAIQASWMKLRNRFLEFVA
ILIVIVTLVALYMGHSIVTPLKQLISAVRGIVEGNLDIYLPVKRKDEVGQ
LTTIFNQMTDALRNKHTEIMAVNQAMQQQNQLLQKLSITDGLTSLYNRAK
LNTILIEQLARFKRNDRTFCLLMIDVDHFKTINDKLGHITGDKILITVAS
ALLKSIRTIDYAARYGGDEFMAILTETNSSAAIKTAERIRSEVSAACSAL
EEHPIQITLSIGITQSHHDDTTPGDLIARADVALYEAKKRGRNRVYCVDV
AHTDP
>NE1181 PemK-like protein
MTDFKQRDIYWIDLEPTKGAETRKLRPCVIIQSDLVNVQSRTVIVAPLLL
QHKPWPFAVNLEPTEKNGLDKDRHINLKQLRAVDISRIGKKQGRLENRYK
DPIKAALMIIFDL
>NE1905 PH adaptation potassium efflux system protein F
MSATILFWAVTTAQIALGIAMVLALARMISGPRAQDRVLGLDTLYNNSML
LMLVFGIRTGNSLYFEISLIIGALGFVATVALAKFLMRGEVIE
>NE1354 Helix-turn-helix motif
MKMRSQLLIVLQEHLRNSGLTQFKAAELLGVTQPRVSDLMRGKIDLFSLE
SLIDMITSIGLKVEINIKDAA
>NE0475 Helix-turn-helix protein, CopG family
MAQITARLPDDLVSSLDAAAARLRRSRAEVVRQAVEYYLEDFEDISQAID
ILRDPADPILDWEEVKRDLLHLD
>NE0499 Glycosyl transferase, family 2
MITVSIVSHGQSTLVEQLLADLVRLDMSMVTEVLVTLNIPEDISSKPGDY
PYPVRILRNTAARGFGANHNAAFRQAEGEWFCVMNPDIRLINNPFPILIE
EGAYDSAGVIAPMVVTPSGMIEDSVRCFPVLTSLAAKLFGHGDGRYLFAA
GDEAFAADWVAGMFMLFRTEDFRAVGGFDEGFFLYYEDVDICARLWKSGR
SVLACPKASVIHDARRSSRRNLRYMKWHALSLIRYFWKHWGRLPQTPEQ
>NE1996 Transposase IS911 HTH and LZ region
MSNQTKFSPEVRERSVRLVQEHRGEYPSLWAAVESIAPKIGCVPSTLLEW
VKRSEINNGAREGLTSSERDRLKALERENRELRRANEILKTASAFFAQAE
LDRVLKK
>NE0898 conserved hypothetical protein
MRWDIFCHVVDNYGDIGICWRLARQLVTEFDISVRMLVDDLGAMQRICPA
IDPRLAVQNIRGVEILHWVEPFADLVPADVVIEAFGCELPPRYIAAMAAA
SPRFSEPEGKTDRIWINLEYLSAEQWVEGCHGLASPHPSLPLIKYFFFPG
FTAATGGLLREAELFTLRDASRTDPAGLWRELGITNPAADEATVSLFCYD
SAPISDLLEAWAGSISPVRCLLPEGTASASAASWAGISRLAAGDSIQRGN
LTLHVIPFMSQENYDHLLWACDCNFVRGEDSFVRAQWAVRPIVWQIYPQQ
ENVHLIKLEAFLDLYCQELIEPAADAVRAFHRSWNNNEQPDWNCFWKYRD
VLQQHAMAWAERLAQIPNLASSLVNFCRNR
>NE0188 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1622 Protein of unknown function DUF132
MPGLSVVLDTSVLVLGLAYPASIPGHIINAWRQSALNVVLSHYILDEMIR
VLPRLSRIQMTPAEIRDLANSFMFLADVVEPQGSQDSNLRDSGDQPVLLT
LITAQTDYLVTGDKDLLALARDYPIVTPAEFWSRHGE
>NE0729 possible Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding domain
MNPSFQARILILEDNASLVANLFAYLEPRGYLLDAAQDGLAGLTLAQQGN
YHALIIDWGLPTKEGVDVIRALRARDCHIPILMLTARDDLDDKISGFRAG
ADDYLIKPFAFAELEVRIEALLARSRGRRRKLTVGDLSFDLDTQHIWRGS
RQIQLSASERRLLEVLMRASPSVVSRPELEETLWGDEPPEGNVLRSHMYE
LRRAVDGEQEEKLLHTLRCVGYRLAVQEQN
>NE0931 conserved hypothetical protein
MTVSSPFQPDCRDCPRLAQHLDQVKTDYPDYHARPVAPFGDSSAKLLIVG
LAPGLHGANRTGRPFTGDYAGILLYRTLHKFGFASHDESVSADDPLHLTD
CRITNAVKCLPPANKPQPAEIRQCNAFLAVELDNFARNGGQALLALGTIA
HQAVLMALGCRNADFPFSHGAIHRVTEELKLYDSYHCSRYNTQTRRLTET
MFEQIFDRICQDMAATQ
>NE1634 hypothetical protein
MAKVDTRPDHHRLLLETIVQPASLTGYDNRQWELLLRLARRAQLSGYLAA
KLEKDGLLDSIPARASNLLRSSLIQARKQQQSVSWELNRVMWALDGQEIP
VIVLKGMAYLLQDLPNAPGRMFADLDLLVSKENLGQIESSLLKKGWQHHA
LTDYDERYYREWSHEIPALVHPERSVEVDVHHTLSSPLGKLKIDPLPFRE
AAVKVKDAGIYVLSPEDMVLHCAVNLFQNNELADDLRHLLDFHEMMLFFS
SQQPFFQQKLIERANQLGLGNPLFYSLYFSRLLLRSAIPDDLEKQLDRQP
GWLARRVMHHCVPLALLPQHPDHPSRMAGYARMWLYWRSHWLRMPLYRLV
PHLAYKFYLSVFPARAETSK
>NE1167 Purine and other phosphorylases, family 1
MSRTGIVIAMRSEAACITSQRNLPFDQAVPIDDHLVIRMCGMGPVAARRA
AIDLYDQENVAGLISFGIAGALDDALQPGDLVSPESVQTKQSYSTDSAWR
TRIERSLPAHLNIIRRPVAASDELVSTADEKYALAARSGACAVDMESGAI
AAVASEKGVPFVVIRVISDPVQFSPPAALMDVLHPDGRVKPIALLAYLLN
GSLKLGELLRFGSDAQIAFKTLKQVVQATRHELGRQVSGTCQGISD
>NE2563 General diffusion Gram-negative porins
MNKRLMALAVAGALAAPLAASAEGTHVTIFGRLQAEYATVDMDGFNAQTA
VADDALQSRWGLQIAEDLGAGLRAIGRIEYSLNPGGGEDKHLAREQWVGL
GGDQWGELKFGRVQSVLKDFAGGHTIDPFAYTSLHANGSGGLMASSDNGY
GSGDHGFVNSAVRFDSPVVEGFSVAGLLMPGDADKNDPTRDPSYAGGENG
EWDFQVAAKYSGQFDAVGVDVFGAYSRDNVSKLQKSLGASKDEQIWRGGA
IFSVGDFRLRGQYERVYDANPFGGAASCTHASAFGDYNPLSPTQSADGGR
GQCNSAMNPNGNGSLWIAGADYTIGNTTLIAQGGMAVAKKTGDILTLDPY
YAKRNVENITVGVIHNLSRRSSLFGGYQRVWVKDRNIATDTDRNVYSVGV
RHDF
>NE2427 pyrimidine dimer DNA glycosylase
MSHNSRPVRHHNMRLWSLHPKYLDPQGLVALWRESLLAKAVLRGETRGYT
NHPQLERFKAHPQPHFAINFYLAAIHAEATERGYTFDSSKIGPVCSVQLI
LVNSGQLSHEWNHLQHKLATRSPIVHARWSDLASPICHPLFHPQPGPVAS
WERV
>NE1795 Glutamine amidotransferase class-II:Asparagine synthase
MCGITGVFDTRSTSEIDRNLLHRMNETLTHRGPDEGEVYVESGLGLGHRR
LSIMDVSSGQQPLFNEDGSVVVVFNGEIYNFQKLVGELTALGHRFRTHCD
TEVIVHAWEEWGERCVERFSGMFAFGVWDRNRQTLFMARDRLGIKPFYYT
LLDNGLFLFASELKALLVHPDFDKTFDHRAIEDYFAYGYIPEPKTIFKNA
FKLNPGHLLSLQRGQQTVQSREYWDIPFTPHGSLSEEEAAEELILRLRNA
VDSHLMSEVPLGAFLSGGVDSSAVVAMMGGLMKEPVNTCSIAFSDPAFDE
SDYARLVAERYQTRHFTEQVQQDDFDLIDRLAALYDEPFADSSAIPTYRV
CELARKRVTVALSGDGGDENFAGYRRYRWHMIEERLRSKLPLGLRKPLFG
LLGSVYPKADWAPKFLRAKTTFESLARDSVEGYFHSVSILDNKLRMQLFT
RDFYRDLQGYRAVEVLRDYADKSPTKDALSLIQYLDMKTYLVGDILTKVD
RASMAHSLEVRVPLLDHELVEWVSGLPASMKLRQQEGKYILKRSLEPYLP
NEVLYRNKMGFSVPLASWFRGPLRERVRTALLGNTLAGTGIFDMGFIKKM
LDQHQSGRRDYSAPIWTLLMFEAFLRNILSMGNMSYTGQDKKVA
>NE0790 hypothetical protein
MVKNIRSCLVLMVLSCFTVQTYAEDVTVRLSVQNIHHPEWERATDEELAV
LRGGFVLPNGVHIDMSLEKFIHLNDVLVHSSSLQLPGAGVVLQAGMQNMV
SDSITVPELSTFVQNTLDSQHIEALTTINIEVSNLKGIAANGGGQQVFTE
FLAPALLR
>NE2258 conserved hypothetical protein
MRENSVINLHASDLRTFIPSRDFVLSRDFYSALGCELEWSDDNLALFNLA
GSRFYLQRYYVKEWAENSMLHISVQDAANCFTDITGLIESGRFPSVRVAP
PKRESYGALVTYVWDPSGVLLHLTQWDEG
>NE0560 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE0565 conserved hypothetical protein
MELEFGCGFPTHDSHPLHHLKRSCKTLSTGIAWSMATLALLLLFSGTTAA
AGNLHNIVIYAERLPGNLYGYRMAGHIVRQPDGTQIDITNRYVTSTATIP
GPTIILDEGDIADIELLHQFDPHTPFQEHVSLHVHGVHYDRESDGTLKYI
NLYKDESAVPHLSYVYRWNAAPGTAGTWPYHDHNMENHNGAEDKGLFGAL
IVRSAAEARQAKTSSNRAHVSPNVAKDYVFYLGDDAFWAMEIDNATGKQT
ALGTNPPLTAQRNTNVRFHLIALGTNFHQFELPKYQWIDPGTRNTINRKV
LGPLEKHVFTVKATHSSRYQDTAFASRLLGMRGDFIITR
>NE1649 Acyl carrier protein (ACP):Phosphopantetheine attachment site
MEITEIEQRIKKIVAEQLGVNESDVKNESSFVSDLGADSLDTVELVMALE
EEFECEIPDEQAEKINTVQEAIDFVAANAAQ
>NE1420 probable rubredoxin reductase
MNQPSVVVVGSGLAGYTVVRELRKLDAAVPITLLSADHGSFYSKPMLSNA
LATGKTPDSILSAGTMQMSGQLDITVRPYTSVNAIDVAAGSVSFEEGGQL
TYDRLVLALGADPIRLPIPGEGVDEILSVNNLDDYRKFREALESKRHIAI
LGAGLIGCEFANDLAAKGYQVSVFDLSPQPLGRLLPPEAGRFFRDKLTAA
GVNFLLGTTVERVSKENGYYQLFYEGGKVVQADMILSAVGLRPRTRLAAV
AGIQVNRGIAVNRYLQTSIQNIYALGDCAEVEGKVLPFILPIAHAGRALA
ATIAGNPTLLHYPAMPVMVKTPACPTVVSPPDPAVQGEWEVVAIENGMKA
LYHDEAGNLHGFALLGTATVERNTLASRLPPVLA
>NE2092 Glycosyl transferases group 1:TPR repeat
MRDHPADRLQSITVTAFYQEPNVPTHSLSDYTSWIERTWQGQVSFTELVT
YAETLNSHPALCAALYRTWLQRNTGVFNSVAWFNLGVILFAENNLIDSIE
AFQKALALSPAFPQARINLGLALERQGNAEAAIEQWQAVVENAITPEADQ
NTGPNQADQIKNLTMALNNIGRLQETRRQYQAATQALEKSLQLDPDQPDA
IHHLIFQRQKQCQWPVYAPVGKVTEAVLHEHTSALAMLNISDAPEAQLTA
ALNYSRRKIPADLPRLSPANGYRHDKIRVAYCSSDFCTHPVAMLTVELFE
HHDKNRFETYAFCWSPDDGSTLRQRILSAVDHYIPVHGKSDDEVAQLIRQ
HEIDILIDLQGQTSGAKTRMLAMRPAPMQITYLGLPATTGLPGIDYVIAD
RYLIPEEYARFYSEKPLYMPDVYQVSDRKREHSPAPTRKDCGLPARKFVF
CSFNNNHKYTLEVFTTWMNILRRVPNSVLWLLADNPWARENLQKQAKAQG
IDPKRLVFAERTMPADYLARYLVADLFLDTFPFNAGTTANDALWMGLPVL
TMSGRSFASRMAGALLTAADLPELITHDLQTYEDKAVALAADAKARKTMR
QKLALAKESGPLFDSLRFTRNLEQQYIALVSELQNPSQHINISAQPEPTK
LGEPAQPNPIIATVQEAEALQARGDTQGAIQLYRQWLEHAHSGDEWIAQF
NLGVLLRDGGDITGAQQAFQAVLKQKPDFVQARAALGKLPAPVTTESQKH
GAIIGPLQTNSPISTKSKEQITQIFSETPKIKLLVEGWRGINHSFALVNQ
YHLYEWMRSSQLHIYHRDMPLLFSHWETNKNRGNTGLPGTYSQRLAQVQH
WSGQTYDACFRIYSPVTLAPDDKHPVSTFLVTELGLDETQIAHFRPNLKA
YFNMGGNIVTPSHWSKERIIEAGIPAEHIHLISHGVASNIFHPMHSDERA
IHRQRLGFDREAVIFLNVAAPIWNKGLDLLIQAFVQCFHQNPHTRLLIKD
QQAVYGISTKDTVLREITLLGESKNESLLNAIRVIPDLNLLQLRELYCIA
DYYVSPYRAEGFNLPVIEALVCGTPVIVTEGGATQDFCSEKNALFIEAVP
YRNVKINDRHVNAYQAPILDSLITHLNVCAQDKPFSESQRLQNAASIAKN
FTWAKAADTISRLFTQSNDCNTSTSQITGALHEEPQYCN
>NE1523 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1642 Maf-like protein
MKNRTNYPPLILGSSSVYRCELLQRLQIPFETASPAVDEFALPGEAPGTT
ALRLAKEKAHAVAKLFPDALIITADQVAALGEIQLGKPLSHENAVQQLRL
MRGREVIFHSALCLFNSRTTRLQARVIPCSVKYRELSDSQIEHYLAKEQP
YHCAGSAKAEGLGIALIERITGEDPNALIGLPLIALVEMLMEEEVELF
>NE0106 possible ABC transport permease
MAMPSDLAAGNNSRYRLVTTAAGDTLLQLTGSFTLTSLDRSFPTVTKELA
KLASQSDASSLHWDLTDVSQLDYAGAVMLWRIWGEQRPAHLLLRPEQERM
FLRLEKSVSLPEQPRRILFLPISMLGKQLLRFLDHLTGMITLSGQIVLDL
FFLMTHPGRIPAREISANLYRTGAQALGITAVVGFLIGIVLSYLTSEQLH
MFGADIYIVNILGMSIIRELGPMLAAILVAGRSGSAMTAQLGVMRVTEEL
DALTVMGIPHSMRLVLPKIIGLGIALPLIVLWTSAIALLGGLVAAELQIG
LSIHYALTALPDTVPIANLWLGLGKGMVCGMTIALIACHFGLRIKPNTES
LGEGTTNSVVTSITAVILIDAIFAVAFSNIGIRIAS
>NE0737 conserved hypothetical protein
MTAELSCHLLVSPFTKSAVMFNPSRDQARRLFFDTWQKYHRKEPLSGMET
IALEVILQHPEYHSMLQDVERYLDKDFPPELGETNPFLHMSMHVAIREQL
AIDQPAGILQRFEQLKTRLQGDEHEAMHHVMECLAEMLWHSQRNQTAPDA
GIYLECMDKRIGNK
>NE0686 hypothetical protein
MKTLSTLFLLLAIVAQLTACNTIQGFGKDVQRGGEAIEKTAK
>NE1431 t-RNA synthetase, class Ib
MSESISHQLQLIRRGCQELLIEEEFAQKLAQGRPLRVKAGFDPTAPDLHL
GHTVLLNKLRQLQDLGHHILFLIGDFTGMIGDPSGKSATRPPLTREQITQ
NADTYASQVFKILKPEQTEVVFNSSWMDKFSAADVIRLAATYTVARMLER
DDFSKRYHENRPIAIHEFLYPLVQGYDSVALKADLELGGTDQKFNLLVGR
ELQKHYGQPPQCILTMPLLEGLDGIQKMSKSLNNYVGINESPAEIFGKLM
SVSDTLMWRYIELLSFESLETVRQWQNEVESGCNPREIKMRFAREIVARF
HSQTDAVRAAEEFEARFSKGVIPDDIPEKKLYIQDAGLALPQLLKLAGLT
ASTSEALRMIEQGGVKLNGDKVSDKTRIIPSNVTVIAQIGKRKFAKVTLV
TEQSGKQAN
>NE0341 Transposase IS4 family
MFSLSPGQAGDAPEGRKLLKTLENKGFSDTHVIMDKAYEGDETRQLVLDL
GMIPVVPPKANRVSPWEYDVEMYKKRNEVERLFRRLKRFRRIFSRFDKLD
SVFRFFIHFALIVDYLISVNRP
>NE2325 possible transmembrane protein
MNPIYSSMNRRQQGVSLPGLLTWSVIIILVAILGMRLVPVYIEFAAIKRA
LVAIASDSELHNAGVHEIRQAFNKRAAVDAIKSVNGNDIVIRKQDGQLVL
DINYTVTKPLFANLSLLIDFDAASDR
>NE2121 conserved hypothetical protein
MTKKLPIGIQTFREIREESYYYVDKTSFALKLAMEGKYYFLSRPRRFGKS
LFLDTLAELFAGNEALFHGLYCHDRWDWSVRYPIIRLSFAEGWLESRAQL
DKRICWLLEQNQQRLGVTCKQESDIPGSFAELLQNAEAKYGQRCVVLVDE
YDKPILDNITEPEIARAMREGLRNLYSVIKGQDAHIRFAFLTGVSKFSKV
SIFSGLNNLNDITIDADYSAICGYTDEDVDTVFAPELPGLDRQQIQDWYN
GYNWTGQPVYNPFDLLLLFDKREFRAYWFETGTPTFLVDWLMQRGYFTPS
LSRQYSSLELLSAFDVDHIEPEALLFQTGYTTLQGVEEYLPGQRIYWLGY
PNKEVQISLNNALLPALGIEGQKVLTHRIRLLELLRANNFAGLQQLFTSF
FASIPHDWYRNNPIAQYEGYYASVFYSHFAALGLDIVVEDTTHHGRIDMA
VTFNANVYLFEFKVVELVPEGHALEQLKTKGYAEKYKIRNEPIYLIGAEF
SKDSRSVVAFDVELFA
>NE0296 conserved hypothetical protein
MRKLHLYLLAFILCLAGLGLAYYKAAVIGLPLTASEEAQVWNIEARISFR
AKPDSAIKVTLPLPFNPAGYSILDEDFVSADYGLAIEQDNTGRVARWAKR
RAQGKQLLFYRAVLFENEEVAPKSDAAPVYPKPPEYPENLAPVIQGVLDK
ARQQSADTISFTQRVIEQLNAAASIPELKLLVKHVSRTRNLAKTLSWVLA
GARIPSRVIRGLRLQDDKDYADLEHFLQVYDGEQWRTLNIKDGQEGLPSN
FIIWQIDDDRSFSIEGASESGIQYSVARTLQETVNIAGFRSAEKGSHLMD
FSLYSLPVHTQNVYKVLLMVPLGALVVVFMRNIIGIRTFGTFMPILIALA
FRETELFWGLILFSLIVGLGLLLRAYVEQLKLLLVPRLAAVLTMVVLLMA
GVSVIMHKLGFEMGLSVALFPMVIMTMTIERMSLTWEEAGPAEAFKQVSG
SLLVAVIGYLAMNISEFQYMFFVFPELLLVLLAVILLLGRYSGYRLMELW
RFRAFARNKP
>NE1925 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1206 Glutathione peroxidase
MNIYDCGIKTMDGQDKLLGDYKGKVLLIVNTASKCGFTPQYQGLEDLYRR
YKDRGFVVLAFPCNQFGHQEPGSESEIQQFCTTRYDVSFPVFAKIEVNGA
NTHPLYRYLKNEKSGVLGTKAIKWNFTKFLVDRSGHVVRRYAPADKPESL
TGDIEQLL
>NE1163 DUF198
MNKQIDLPIADVQGSLDTRHIAIDRVGIKAIRHPVVVADKGGGSQHTVAQ
FNMYVNLPHNFKGTHMSRFVEILNSHEREISVESFEEILRSMVSRLESDS
GHIEMAFPYFINKSAPVSGVKSLLDYEVTFIGEIKHGNQYSFTMKVIVPV
TSLCPCSKKISDYGAHNQRSHVTISVRTNSFIWIEDIIRIAEEQASCELY
GLLKRPDEKYVTERAYNNPKFVEDIVRDVAEVLNHDDRIDAYIVESENFE
SIHNHSAYALIERDKRIR
>NE1637 hypothetical protein
MINVTEEKIQGQTAAAPQPARRRLLKSTVAIPVIMTLHSGAALARTSNLA
GPVEVGDAVKFNTGEREQLVCVNSGDGPLDGGPTYDLGKTPTADFAPLYD
LDGNPMDLEGQAAACREGGAGILISAAAYTSLQGNGFLGNLPPL
>NE2177 hypothetical protein
MDEPEIQNNKYRLIQILTISCLVIFLLFIWQGNKGFNLWDEGYLWYGVQR
VLLGEIPIRDFMSYDPGRYYWVAALLSVAGDNGIMSVRIAVAVFQCLGLF
VGLLLIAQSTKSRDKADILFWIISAAILVLWMFPRHKLFDISISIFLIGI
LTYLVSNPIPKRYLIAGICVGLIAVFGRNHGVYGAVGSLGVIAWLNIRNR
SDTGFLKGFVLWSVGVTIGFLPIIFMALLIPGFAVAFWESVRFLFEQKAT
NLPLPVPWPWTINFAASSIGDAARGVLIGIFFIGTLIFGGLSVIWVVYRG
LKEKPLPPVLVASAFLALPYAHYAFSRADVGHLAQGIFPLLIGILAIASS
ASSKTKWVLAAGLFMTSFWVMHVFHPGWQCLASKQCVNVDISGKYLQVDP
NTASDIALLHQLTDQFAPDGRSFIATPFWPGAYALLERRSPMWEIYALFP
RTGAFEKKEIERIKASDPGFAFIFDLPLDGRDELRFKNTHPLIYQFILNN
FELVPNSHNPAYQIYKTRNAGQ
>NE2190 Maturase; integron/retron-type RNA-directed DNA polymerase (Reverse transcriptase); part of type II intron
MHRALNQDDDHNQDGQDLLEAVLARDNLARAWRRVKSNRGAPGIDGVTTA
EWPEHARAHWPATREQIEAGRYRPQPVRRVDIPKPDGGQRQLGIPTVTDR
VIQQAIAQVLIPIFDPGFSASSFGFRPGRNAHQAIRQVQAHVKAGYRWAV
DLDLARFFDNVNHDLLMSLLSRSIADKRLLALIGRYLRAGVLVGEHPQPS
EVGTPQGGPLSPLLANVLLHQFDLELERRGHRFARYADDVIILVKSRRAA
ERVMQSLTYFLQSTLKLTVNLAKSQVAPMSECSFLGFTLVGKKIRWTEKS
LANFKHRVRQLTGRSWGVSMEYRLEKLGQYLRGWFGYYGISQYYRPIPEL
DEWIRRRVRMCYWKQWRWARTKIRHLLDLGIPLKAAIQHGVSSLSYWRMA
RTPVTQQAMSNDWLRAQGLLSIKDLWCKAQSYGPDKG
>NE1940 Integrase, catalytic core
MGQILHGSARTTAAVRRAIQHSEESLNVLARRYAINPKTVAKWKKRNFTH
DARMGPKEPRSTVLTPEQEAACVAFRKHTLLPLDDCLYALQSSIPTLTRS
SLHRLFQRHNISRLPEVEGEKPARKKFAQYPIGYFHIDIAEVRTEEGKLY
LFVAIDRTSKFAYAELLPKYGKMEAAQFLRNLIAAVPYKIHTILTDNGIQ
FTHRKTDRHAFLHIFDRVCLENGTEHRLTQPNHPWTNGQVERMNRTLKEA
TVKRYHYENHQQLREHLYSFLNAYNFARRLKTLRGLTPYEYIIKCWQNEP
ERFIINPYHHKVGLNN
>NE2549 ATPase component ABC-type multidrug transport system
MFIDVTSNSPHPVIEANGLSRHFGDRVAVHDVNLKLHRGDILGFLGPNGA
GKSTTMRMLTGNLAPSSGSVHICGTNLLDDPLEAKHHIGYLPEIPPLYKE
LTVDEYLRFAARLHGLAKTSLQIALDEVKQQCELNDVGKRLIGVLSKGYQ
QRVAIAQAIIHRPQLIILDEPTVGLDPNQIQKVRALICELGKTHAIILSS
HILSEVESVCNRVQIMHQSKLVLDDSLDNLKRQNMNLENIFIQLTAGNTP
TGESVQ
>NE1790 hypothetical protein
MKNFIRKLIRALTAIRLETDRYKHSTLRIIINWLTAFLKDLFSAQEIRLY
GLADPEDGANRIRQYVSKEMADRFYRKYNPGAAVPNIEDKFIFTTLCLQH
HLPIPATYGIFKQGIVKTFDERIFQESSRFRDFIRELEPGEYLLKPNNGM
LGLGLSILEIDDQGSLKFQGKSITADDLYRELCSIEVTLPASSKGDSVDM
DFEGLLFQQRIANHPEITKLTGFKMLQTIRICTHVTDNNQVEILFAFMKL
AGQEGLADAFNLGKTGNMLAKIDPATGKFCNVYAMDQQQGYLVETTHHAV
TQANLLNFTVPHWQACLTLAMKLSTTFLPLRAVGWDIAITDNRPVVLEGN
DNWVPVVPFDINIDKLKQYKLKS
>NE1912 hemK_fam: modification methylase
MRGNDLLVPARESPDASVTIGELLQRAANVVDRVDARWILQSVLNTDAAF
LIAHAEQLLSTDQVAHFRQMLARRIAGEPVAYLTGERGFYDLVFEVTPDV
LIPRPETELLVEMALSKIPSDRKCNVLDLGTGSGAIAITLARHRASTCVT
AVDFSPGAMAVARRNARMHAVKNVVFIEADWFSSFTSEKFDVIVANPPYV
AAGDPHLEEGDLRFEPLTALVAQDNGLDCIRTIIAQAPGYLEPSGWLMLE
HGYDQADVCRELLAKAGFTHLFTRPDIAGTDRVTGGQYE
>NE2220 conserved hypothetical protein
MTMDFQITRRYPCSYLPGELARSRIVVTDDTLSDSNTYAQLIQQGYRRSG
SLIYRPDCDYCHACIPVRIPVELFDPNRTQRRIWKRHQHLQIAWHPLHFD
PVHFDLYQRYQKSRHTNGSMDQDDENGRKQYRDFLLKSTVDSFLVTFHEN
EQLRMVSIIDRLPDGLSPVYTFFEPDVPGSSYGTFNILWQTAQCRADRLP
YLYLGYWIANSRKMHYKANFQPLQYLIDNQWQYRTTNR
>NE0274 hypothetical protein
MRKFSVFLLLANIFVVFYLHGRPDDNLPAQIALIHSEKIELLPAKVACLK
WENLIGPVVQHVRVEISKWESGQDHITEISRGEVTVHWVHIPPLRNARET
AKQIEQLKKSGISYLHIQENADSPWHNAISLAILPDDSDVAALVEELKGK
GVERIMDSEQVLEQFEFDIRNPTEQITESVRQLAQQFPETKLEVTECSRL
>NE1885 DUF202
MSDLKDPRVLFAAERTLLAWNRTSLSLIAFGFVVERAGKLVRAISPGIIG
PDQLAAMFWLGLAFIVLGAITSVYSARQYAVILKTLTPDEFPPGYAAKWG
LLVNLAVAILGLILALALWWWRLH
>NE1065 putative plasmid stability protein
MATMTIRNIDEQLKARLRVRAAMHGRSMEDEVQDILRTALSAEPVQTVSL
VEVIRSRIEPLGGIELNLPEREAIRDPLEPGA
>NE2105 hypothetical protein
MVNGGITKPYAHKGRDDYQNEEEILLLVQFPLLFFWREAYSPERNFCSSM
KTVVDPHFSAHTVLIASADKNPGIISIRT
>NE1839 Protein of unknown function DUF117
MKYKDLRDFLVQLEQRGDLKRVDIEVDPHLEMTEICDRLLKQAGPAVLFE
RPAGHTIPVLGNLFGTPERVALGMGQTSVSALREVGKLLAYLKEPDPPKG
LRDAWEKLPVLKQVLNMAPKQLASAPCQEIIWEGADVDLGKLPIQTCWPG
DVAPLITWGLTVTRGPHKSRQNLGIYRQQVIAPNKVIMRWLAHRGGALDY
RDFCQIYPGQPYPVAVALGADPATILGAVTPVPDSLSEYQFAGLLRGAKT
EVVKCLTHDLQVPASAEIVLEGYIHPDEMAVEGPYGDHTGYYNEQETFPV
FTIERITMRRNPIYHSTYTGKPPDEPAILGVALNEVFVPLLQKQFTEITD
FYLPPEGCSYRLAVVSMKKQYPGHAKRVMFGIWSFLRQFMYTKFIIVTDD
DIDIRDWKEVVWAMTTRVDPVRDTLIVENTPIDYLDFASPVSGLGSKMGL
DATNKWPGETTREWGCPIEMDAAVKTRIDHLWQQLPF
>NE0168 multidrug resistance protein MexB
MNSRFFIERPIFASVLSIIIVIVGLVSLKNLPIAQFPEITPPVVQIDADY
PGASADVVAEAVARPIEVQLPGIDNLLYYESTSTNDGHMTMRLTFEIGTD
IDIAQVQTQNRQKLAEPQLPDEVIRQGITVKKTSPDLLLVVALSSTDPSH
DTIFLSNYALLRVLDNIKRLPGVGDASIFGGQNYSMRLILDPVRMAQLNF
TPSDIVAAVREQNRDFPAGRIGREPMVNETELTIPVMTKGRMSEVKEFEE
MIIRAYPDGSMVRLKDVSRVELGAQSYDLQARWNGRPNTFLLTYLSPGAN
ALETAQRIRNEMDKLAKDFPAGASYDVPYDTTTFIEVSIQEVVKTLAEAL
LLVVLVVFIFLQSWRATLIPAIAAPISLIGTFMGMEALGFSINTLTLFGM
VLSIGIVVDDAIVVVENVERHIAEGLTPKNAAIKAMQELFGALIAIILVL
ASVFLPVAFLGGITGELYKQFAVTIALSVMISGFVALTLSPALCALVLKP
GHDSPNKLWKTFNRTFDWAQTRYVSMAGTIIKRSALSILIFLMVIFFAIG
LFKTVPGGFLPEEDQGYFITVVQLPDGASISRTQKVLEKIESYFLSIPSV
HSTDAMAGMNFVFSSRGPNHATMFVPLHHWDTRKNADEQVQNLIANAFGE
FAKIPEALVLAFNAPSIRGLGATGGFSIQLQDPSGGDFSEFAETAQAFVN
KAMEHPAIGIAGTNFRVSAPRMYAHVDRERAKALGVPISDVFDTMQAYFG
NLYVNDFIKFGRVYRVQTEASPEYRSSPDNISNIYVRAQSGTTSTMIPLD
SVVTTEFNSGPDPVTHFNGFNSALVLGSAAPGYSSGQALDALQEVANEVL
APKGYTIDWSGITYQEKMAASGQKIWILSFALLMVFLVLAALYESWSVPF
AVILAVPFGILGALLAIWARGLTDDIYFQVGLITLIGLSAKNAILIVEFA
SDLYAKGASLTEAALEAARVRFRPIIMTSMAFIMGVFPLAISSGAGAASR
IAIGTTVLGGMLAATFLAIFFVPLFFVLIGKITRRDKHPTPEPVQAETGD
QAVTTNEKEDTDGK
>NE0049 Sensory transduction histidine kinases
MILTSLTQRFAIWFTLVALLPIALVGYGLLHIFEGEIRKSAIQQVSSIAD
KKVEQIDSYLRERILDATFIQTSDTTRQAIDEFSQIFSQFGVDSETYRAL
DARHQAHYERYLEGTGYYDLFLISPDGWIVFSVSHESDFATSLLTGPYQD
SGLGQVFRQTKETHQSSLSSFEYYPPSHGAIAAFVAVPIMIEGNFHGVLA
LQIYSEHIFDVLTNNVGLGESGETVVTYMENKHLARVMAPLREDPDAALK
RTISLDEPQFSLAIRNSLSGEHGGGVVTDFHGNPVIAAWRYLPRVDWGMV
VQIDTEEAFASVMQMRVMGLVVLGLTFLMAILGALLFNRRVVTPLKNLNE
CAQAINAGNLQHHIPDEGWDEIGQLASAFNMMTERLDISYRHLEARVAER
TARLEQLNAELAIKEEETRSVVEHMVDCVVTTDERGVMLSVNPVMEKLFG
YTVEEAIGQNVAVLVPKPDRDMHEYYMQRYCHTGYGQQYVGRPQQPGGHH
VGLGREVEGVHKDGTLLPVYLAVSEYFVGDKRHFTGIMRDMREHVKIRQD
LEQARRDAELANQAKSAFLAAMSHEIRTPMNGVIGMIDVLSQTSLSSYQK
EMTDLIRESAFVLLEIIEDILDFSKIEAGRLEIEQRPISLEKVVERACGT
LTHMAASKEVELILFVDPALPKNVAGDALRLRQILINLINNAIKFSSGMS
RPGCVIVKAVPADRANGDVVTVLFQVIDNGIGMDDATRERLFTPFSQGDT
STTRRFGGTGLGLTICRRLVELMKGDITVESQPGKGSTFKARIPFRMIQE
GAVQQDEPDEVDLSGLSCIVMGSEDGLSPQLATYLKYAGAAVERISGLSA
TLERIGQLPAGEWLLVADASDGEFPVDELRAAFGDRPDASAGDQLRFVIV
RRGRRHRGRMEEEGVVTLDGNVLYRRVFLHAVAVAAGRAKLEEDQTAAVE
VVSKVVPLSREEALQQGRLILVAEDNEINQKVIHQQLTLLGFAADIVING
REALQSWKNGEYALILTDLHMPEMDGYQLTQSIRSEKTGHYQIPIIALTA
NALKGEAEKCRALGMDDYLSKPVQLEQLKSVLEKWLPRSKQTEQQEERIL
PPSAATGKVVVDINVLKSLVGSDQAVINEFLQDFLHDTRKIAEQLHSAIT
GGQADKAKFAAHKLKSSARSVGALALGEWCAQMEMAAKEGDVEKLVELLP
GFEQELAQVESYIKKVLAQPDVEAQKDEI
>NE2511 AraC type helix-turn-helix:ThiJ/PfpI family
MTAISIGNNDNFDIIVAMKLHILVCDGVFDLGLAALTDTVGLANAMSGSL
PQAPAHIELTLVGVRRRIRTAQGLTVPVVTARGVPEPDVVLVPAFGDKMP
DTLSARLTRPDVPDAVAMLQQWSTAGAHLGAACSGSFLLAESGLLDGHRA
TTSWWLGPMFRQRYPNVTLDESRMIVNSTRFTTAGAALAHVDLALRIIRG
RSPALAALVARYLLVEARSSQAEFVIPDHLAHADPMVERFECWARRQLAK
GFSLAEAASAAGTSERTLARRLQSVLGKTPLSYFQDLRVEHAVHLLRTGN
ASVDQVAAQVGYSDGVTLRALLRRKLGRGVRELRRGG
>NE1083 hypothetical protein
MSVPLAALAAFTLAYAGMTGLSLAMPRHYEQVAGQRVLPSGRRHFFRILG
WLLLILAVVPCIQAWGTAVGVVVWFGFLTAGGLLIILMLPYLPRLAALAA
AGTTIAGVLILLVT
>NE2466 putative lipoprotein
MYQKFNKPVNAALHLILVLSITACASQNKFFDTLDYNRDWEAIQSNLPAY
PQPENLLEFDSGPATSLRYFIDAKSISVDEKRVIRYSIVIQSQQGANNVS
YEGLRCETRERKRYATGNNDIRSWVRANTSEWQPLEAVAQLRAQRELAKY
YFCPRGLVVGSPAEAVRALKAGVHPMVIR
>NE0554 Sigma factor, ECF subfamily
MRHYVAFQTVFLSRIIPLHRAYSRTSPLSTMNPSVILANLYLRHFSELRA
FLFRHTGCREIAAELTQETFIRIMGYQTDTSIQNARALLYRIAGNLAIDH
HRTCVKFPESVSIDDLPQHELPATELTDPARLVAARQLLEKLCIAIDALP
PQCHRAFVLHKFDGYSHAEIAEKLGITRNAVEKLLIRALIRLRQVLV
>NE0111 Protein of unknown function DUF48
MTSLFVDRRGVELGLESGAIVFRENGERIGTVPIAPLTRVFLRGDVNLPA
ALLGKLGERGVGVVILSGRTSRPSLLLARPHNDAARRVAQVRLSLDEPAS
LIIARELIERKLTRQIEWFTELRENDIQARYELSRALRGLEEHRARLGNI
NNAASLRGIEGSAAARYFTGLQAVIPGSLHFHGRNRRPPRDPFNALLSLT
YTLLHSEITIALHGAGFDPYIGFYHRLDFGRESLASDLLEPLRPLADRFA
FALVHRRVLDKDHFTTTESGCLLGKAGRVRYYAAYEEHSEILRKGINQEI
EQLAEQVGSALTPESGNTPDHDSGDWE
>NE2057 hypothetical protein
MKKYLTGVVLFGLLTLFTGSTWAHSDEQLDGVAAPHGGQLRMAGPYHLEL
VAKDGELRLYVTDHMDHEVLTKGGSGKANVFDKDGKKVSVTLIPVFANFM
KGTGEFTITPETVVSVFVVLDGAETQAARFTPLKKASAKAEDEEEHHHGD
ADQGEHHHHGDVEHEQQPTDQSDAAESEENEEHHH
>NE1810 ATP-citrate lyase/succinyl-CoA ligases:ATP-grasp domain
MDIHEYQAKEILAEYGIKLAEGGLAHTVEEAVQRSREIDGNVWVVKAQIH
SGARGKAGGVKVCRTHEEIEVAAESLLGKKLVTHQTGPAGKLCSRLYIEA
GTEIAREVYLAFMIDRSHERIVMVGSAQGGMDIETLAATNPDAIKKIHIE
PAVGLQDFQARTMAFALGLEDVLLNHAVKTIRGCYRAMRDLDANILEINP
LVVTRNNELIALDAKMSFDENALFRRHRISELRDNSQIDSREIAAAEAGL
SYVGLDGDIGCMINGAGLAMATMDMIKLAGGEPANFLDVGGGASAERTEK
AFRLVLADNNVKAMLVNIFAGINRCDWIAEGVVQAVRNIGMTVPLVVRLS
GTNVEEGRRIIADSGLPIITAETLADAAEKVVHARNQAAV
>NE1611 Bacterial general secretion pathway protein G
MLICTLPGSFPEKSKFKDHLNMKYASNIRPCMVYERGFTLLELLVVMVII
GLLAAYVGPKYFSQVGKSEVKMAQAQIDSLEKALHQYRLDVGTYPTTEQG
LNALLTAPANEPRWQGPYLSKRLPADPWGRPYQYKYPGAQNDFDLFSFGK
DGQPGGSGEDADITNW
>NE2408 DNA internalization-related competence protein ComEC/Rec2
MLIGIYSLAFVFGALWLQQQSVLPEFYWAIGLIPVALGVLVLLRFQTRFS
ILAGRGLLLAVMLGAGFFWAALWAQVRLADDLPSAWEGQDIAVIGVVTEM
PQLTRQSMRFRFEVERVLTPDAAVPAHVQLSWYRDGRRDEGNLPRITAGE
RWQLTVRLKRPHGSANPHVPDYEARLFERNVRATGYVRAGESNKRLEVQE
IHPVYLFERKRDEIRSQFQHYLAGYPYAGVLIALTVGDQQAIPSEQWETF
TRTGTSHLMAISGSHITLLAGMVFLLAYRAWRYAGLALWLPARKFALVCG
LIVAIGYALLAGFAVPVRRALFMMAIIVVAFWRNQRVRTLPVLGWVLLLV
VVLDPWSVIAPGFWLSFAAVALICLVISGRIGRPGVVAGWMRIQWAITLG
LFPLLLILFQQISLISPIANAVAIPVISFIIVPLALLATIPGLEFLLLIA
HPVLQITMGVLQWLGELPLATWQQQAPPLWAVIAATLGVVWLLLPGGPGL
GVSAGFPARWLGILAGVPLFLISPEKPAEGELWLTVLDVGQGLSVVVRTR
NHTLLYDTGPGYGENDSGKYVVLPFLRGEGVQALDMMIVSHVDSDHSGGA
LSVLKRIKTDVLLTSIEDNHPIRQAVPDNRHCLAGDAWWWDGVYFEILHP
VKPDTLIRKRKTNESSCVLKVTTSHGSVLLPGDIGRVTEEDLLQRYAREL
ASSVLIVPHHGSRSSSSEVFVRQVDPDHAIFTVGYRSRFGHPHAEIVERY
LEHGSRLYRSDHDGAVLLRFVSGNITADTWRRLNRRFWHDEWPSADRED
>NE0825 DUF214
MSFLDALRFALRALTAHRMRTFLSASGIAIGIAAVILLTSIGSGIQHFVL
SEFTQFGTNILNITPGKIRTRGASTGSIGSVRLLTIEDSLALKSSRHAIY
TNATVTGNAEVRGGGRSRRVTVYGQGPDFDRAFNMHTAIGQFLPADDPRN
PRAFAVLGAKVHKELFGDANPLGSVLQVGGSRFRIIGVMASKGQVLGFDL
DDTVYIPTARALEIFNREGVMEVQVVYPPTTPLDEVMEDIKRIMVERHGR
EDFTITPQQQMLSTLSTVLDVLTFAVAVLGGISLLVGAVGMITLMHITVN
ERMAEIGLLNALGATPMRIRILFLLESTALSTLGGMIGLMTGSGIAGLLS
VLFSDLPVNIPWRYVLAALILSGVIGLGAGVVPAMRAARLNPVDALRAE
>NE0756 conserved hypothetical protein
MNALESLFYEISRLFLTPVLIVLGLMFIYALFTAGTLLMDTLLRTVGSRQ
RQPLVLFMHAHPAATQEMLELQLLRLIEPLRITSRVAPMLGLVATMIPMG
PALVAVSAGNAQGMAESLIVAFSAVIIALLTASLSYFVLTIRRRWLLEEL
EHILGSHEVPTHLDGVVHG
>NE1568 possible Oxidoreductase
MKISALKAVVVGLGSIGVRHLNNLHALGIRELGAVRTRNLPPPAQIIPKD
VQLFQSLDQALKQNFDLVVVANPTSLHLKTLIEALKAGCHVYVEKPVAHE
KRHLSELMRCVDPHGPRVLVGCQLRMHPGLRKIEEWIQQGRLGKIYSVQV
DLGEYLPDWHPWEDYRQSYAARADQGGGVILTLIHELDYLHWLFGKPRSV
FAIGGHRTSLEVTAEDTALISFETEQGICVQLRMDYWRKPPVRHMNIVAE
KAIVDWDYPARLTTLQQNGHLLEEVILAPSWDRNELFLSMMKEFIEGIPG
GSIPRVTLQEGIDVLNTALAAKQSLQTGRQVRL
>NE1130 hypothetical protein
MSNSRFSTIAFFLGSSALSLIFLARSVLAEELPKLQNLDLFKLNGFLSQG
YIKTNKNNFFGHSNDSGSLDFREIGLTASLRPSPKLQLSGQLLHRRAGEG
SNGGIHIDFGFLDYNFANTPAAEIGIRLGRMKNPFGFYNDTRDVPFTRPS
ILLPQSIYFDRTRNLALASDGIQVYGESRADWGNITVQFGAAFPQVDGHD
TEISVLRSLQHRGDLKTKLSYIGRILYEQADGKLRLAISGAQANVGYSPG
YNDTLLNGSFRFSPLILSAQYNAERWSITSEYALRHLAWKDFGNHAAFQQ
SFTGESFYLQGVYRFYPNWEAVLRYDLLFTNRKDRSGKKFSASTGYLYPA
HNYYAKDITVGLRWNITPAIMLSTEYHRINGTAWLSPLDNPDMSHTGKNW
NLFAVQASYRF
>NE2245 conserved hypothetical protein
MNLYSLLLSLITGCFLLAQGGTSFSSPVETEKSAVTAQSILEKADEIRFP
QDSFQVNVAIRTAAPDHAEDLYRYQVLSKGNENSIVMITEPASERGQAIL
MKGRDLWVFMPSVSQPIRLSLSQRLTGQVANGDIARANFTGDYHPQLLRN
ESIDDEDYYVLELTGIDRSVTYQKVLLWVNQSNFRPYKAEFYSVSGRLLK
TSRYENFDNILGEMRPTRIIMEDALKSGEVSVLDYSDMKLRDLPDKIFTK
DYLKRLE
>NE1537 BNR repeat
MKKSTYSFPTLAGFFLTLASVVAIAAGPGTGDPTQQPSVMQASKSPKTAL
AVGVTLDQEGQLWLAKVIDQRLLVSRSEDDGKHFSESVTVTPVPENIGND
GENRPKIQVARDGTVLVTWTELLAEKYAGNIKFSRSTDSGRTFSEPIVLN
DDGRVTSHRFDSLAIDGKGRVVVAWLDARDRDAAREKGEEFKGVSLYSSQ
SFDNGAHFEPNRQIHQHTCECCRTALTWTSEGPVVLLRNIFGTNTRDFAV
ASLDKVEEGVRRVTRDEWQIDACPHNGGSLATDGRGQLHLTWFTSGTAAQ
GQFYKRISGNQESEPMALGDMDAQPNHAAVVAHGETVILTWREFDGNVYS
AKMMFSNDGGETWSEPWRLMLSAGANDYPVPLISDSKALVVWNTENEGLR
VLSVERVINRSDG
>NE1602 Bacterial general secretion pathway protein G
MVKLADQTVIQSVSNRLKRPGGHWRERGFTLIELLVVMVVIAILLTIVTP
RYFNSVDRAKETVLKQNLSILRDAIDKYFADTGKYPTDLHQLVEDRYIRA
IPVDPITEENVWVEIPPPDPEMEGVFDVHTTSDRQALDGTFYEKW
>NE0536 BNR repeat
MTIQNTLTKIFPVGLLVWFCWSLPVSAETGQPGFPIQEITVPASEESKQH
HLAKTDDGRLILSWVESDGQNSTVRFAIREGQGWSPVRTVTSVDGKLGDP
PVVFGLDDGSLAAAWMPYAKGGKSKYAADIFLARSQDGGLTWSKAFKPYG
ESARIYDAQMSLTALPNARIALVWTDMRETGDPGKNDRYQLMATVLAKGQ
QSAGTELRLDDDICSCCRASTTAEGESLLTVYRDHSQGEIRDTGAVRWDT
DGKVQALSAPGDGWRIEGCPSNGPAVDMSASSVALAWFSAADDKGRFKVA
FSTDGGKGFTKPFEVDDDARGYVNVALLSTNVALVSWRKRAGPEDELRIA
KVTTDGISRQTAIYQGDFPGWPSKYPGMIVLDHQAFVAWTDPIKKKVRLV
AVTLD
>NE2414 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVCSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE2399 DnaJ N-terminal domain
MSRSPRQESLSDPVNSSELTITDVLTHAKPQTKQQATFQRLIRQIDEQRA
QVAEWHTYSERYGQRVGSELMPLFAQLREKRIAMLHLLDTQFHQRNVIRG
KQQRAKLCDIILNLAHELLLEQRDEEIVALYDKYSDPSYDDQAGIDKTIS
QDVIEELFGVRLDDDNADVSMEEMLAKAMRQRQEEEAKNKAGSGSRRRKS
AKQAAAEEKRAAAKKEISQSVREIYRKLASSLHPDRASTDLTADEKTTLM
QRVNQAYDRSDLLELLNIQLEIEQIDTNYLTNLPDERIEHYIQVLREQLA
ELKAELASLLMPYWKLVPYTQKIRPTHVDKAMDAEITRLEMDLRQADIDL
IAFQDAKQLATFIKHYQVNDGFDEFDDGLQMLGDLLSSFSAPPPRRRKR
>NE1077 conserved hypothetical protein
MPTIQSVRRTQSGRPGKRAINLSLSADVLDAARQLDINISQVCDTYLREV
VRHEQERRWREEHADFITAYNATIEAENLPLDEWRSF
>NE1078 putative transmembrane sensor
MTHPSVDQENLPQTASLPENIIEAAITWSIRLDYNDPTPETRQSFDQWLH
ADPLHRLAWQRVDSLNNLRNDCRSLPSRLAFDTLQEVDSRRQSRKLSRRN
AMKLLSLAGITIATGWTVREHTPWQRLLADASTVTGEQRVLHLTDGTVVV
LNTDSAVSFDLTEAQRLIVLQRGEMHITTGPDSEVSARIGRKRPFRVRTP
FGELQALGTRFVVRLDEQHARVSVQEGAVELHPSAGGSSAIVQAGESRRL
TGSSILPEADRYFEEDGWVSGVIAGKNIRLADLVAELSRYRPGYIVCDER
VADLPISGIFHVKDTDKALQFLAQTQPVSVMYRTRFWVIVGPKSLYR
>NE1837 putative chorismate--pyruvate lyase
MKVNPPLAWHPAPVSAPVDLHWWLTHRGSLTRLIQDRCEHFRVEPIFQAL
ATACIDELEVMNLRRQTRALVREVYLRCNETPVVFAHSIVRKEHLRGAWR
GLSRLGNQSLGTMLFTNPLIQRTPLAFKKLKPHHPLFERACKRLQNRPAD
LWARRSLFILQHQPILVTEVFLPAIRRL
>NE1328 possible transmembrane protein
MPHSLLRHCTVCLGWAGRGFLLVAILACLLLLALRYWILPDIGKYRSDIA
AAMTRVAGQPVRIDSIRANWDGLRPHLSMQGVSVSDRQDEPVLVFPEIEG
TVSWRSLLRGELNFHEIIIDRPALVTRRDVKGVLHIAGVTLGENQQESGF
LDWLLRQRRLIIRQAVIYWQDELRRRPVHYFESVNLHLQNKRGGRRHQFG
LQAHSSDPLFSRVDLRGDVTGDSVQTLSAWQGRLFIQLQDVDLGSWQEWM
TLPADLVLKKGKGSMRAWADIRAGILARWVSDISLRSTAVHVARQLPVLE
IDRLHGRWGWHEIEDAKGVNRQWFARGMSIELNGLPLTGPIDASWRVLNP
KDGGLPIHSLQAGGLRMDVLTRLAASLPAEEALHESLSRLSARGMVKHVN
LEWRGDWTKKPPFRINAGFNDLAVQSFDDYPAFSGISGIVDATEAGGSLF
LSSKNVEISKSQSPGEKFRFDSLIGRVDWKTAADHAVTRIKFSDIAFESD
AGSGTLQGNCAFDREMPARIDLTGGLSHADMHLLGEYATWMADETWKDKL
DKATLSGKLSDAKFHLRGVLNGRSADKKGAFSIHAETAISNAGIKISDDW
PEVSDMAGRVSIQDGALDLSLSSASVAGIRLQKFMLQSDDVYADRPEIRI
KGMAEGESGEMATLLRRVDVGPHVGELLGQAEFSGKGRLQAEVELSVGQE
KFSVTRMQGRYQFVDNRIDLDRYIPDFHQVNGSLIFTESGVVLEGIRAQV
MGGAVEFSSASLPEGGVRITARGRADFDHFRPDANLVKRVDLSQLWTQFI
RGGSAWQAAVEVESGKVGIVVESSLEGVELMFPAPFSKTAAEVIPVRLEK
VFTLPRDDHVHFHYGDILTAEFQRIREKAHYYHPVRGIISFGGHGTLPQD
RVTRVEGSVSRLEWDQLRELFKRHAGMDASLDHTARGLDNILTRSVQFDL
HIGQFEFLSSYFNDTHFSIDRQGESWRVDVSSQEVDGRIDWQAASPQKVV
ARLSRLKIPEDAPENILTPYKHDPPGDWPAVDIEADELFVKGGLLGQMKL
SAVQRQDGWLVENLDIRHPNSRLQANGLWENHKPPYRMYSHIRLQSSNIG
KLFKRHGYPGRIARGKGVLEGNLDWAGKPFSVDFATLYGSLQLDAQHGQF
TELKPGIGRLLGVFDLKSLPRRLTLNFYDVFGKGFGFDELNGHISIKSGV
ASVDDLYIAGSAAELILKGEWDLVNETQTLNLKVFPSFGLATPIAGIAAM
IAKRALQDPFDRVLLNEYAITGSWSDPVVVRLDEERDKVE
>NE1191 putative transmembrane sensor
MNQPVPNRPAARPDTPPASYPQEEADARDFAEFIAGQDPLDAAAAGWVVR
RQDGLTPEEEAELQEWLAADPAHARALAQVEDVWSRMDELPDEGVVALKA
GLPSHKAGTALPPSVAIPTPVPVAPEPQHHPRPSSSIQPASPGRRSWLIG
LGRFVPQITAAAVAFSVVSVGWYGWSTWQHQPTFQQSFATARGEQKKVNL
PDGSTLWLDTATRAEVALYRQRREVRLTDGQALFAVQSNPDQPFDVLTGG
TRITVVGTRFSVRRTRSGLGAEGSVSVVVEEGRVRVASRSAAHKMDSSSS
VELSAGQSIIADAAGALGPVNRDDAPAATWREGRVSFSGTPLMQALAEFE
RYGDTGLVIRDPAVAALKVNGSFDLRQIGAFARALPRVLPVQLSTRDGQT
EIIAAPGG
>NE0323 Polyphosphate kinase
MAPVAKPEMPAAKQFLNRELSQIEFNRRVLAQAENEETPLLERLRFICIV
SSNLDEFFEVRVAGLKEQIKLDAPGSGADGMAPGQAFERVSEQVRELVSR
QYQLLNDNILPALVEQGICFLRRDSWNDGQREWIRDFFFRELMPMLTPIG
LDPFHPFPRILNKSLNFAVALEGKDAFGRNSGVAIVQAPRILPRVIRLPK
EISGSPYSFVFLSSIIHAHVGELFTGMRVVGCYQFRVTRNSDLFVDEEEI
KNLRIALQGELSQRHLGNAVRLEVADSCSQQMVEFLLGQFSLGNEDLFQV
NGPVNLVRLIQVFDLIDRPDLKFPRFIPGLPFFSQKKKAHNIFELISKGD
ILLHHPYQSFQPVIEFIQQAYADPDVIAVKQTVYRTGSDSTLMEALVAAA
RLGKEVTVVVELLARFDEEANINWAKRLEEVGAHVIYGVFGHKTHCKMAM
IVRREAGQLKRYVHLGTGNYHPGTARVYTDFGLLTCHEEICTDVSEVFLQ
LTGLGKASKLKHLWQSPFSLHGQILNSIQQEIKHANAGEPAAIIAKMNAL
LEPSVIQALYAASGAGVKIDLIVRGACALRPGVPGLSENIRVRSIVGRFL
EHTRIFYFLNAKQHNVYLASADWMYRNFFRRVEICFPVLDPRLKKRVILE
GLKPYLRDNVQAWEMNGDGHYKRKSARKNNRFSAQEVLLEKLAAVKQDAV
E
>NE2432 conserved hypothetical protein
MSLQSWAWIHKWSSIVCTLFMLLLCLTGLPLIFHEEIEHLSGVVEAPPMP
KGTPDASMDRIAQVVLARRPGEVIRFMFWDQEEHPDLTYVSMASRVDAPP
EESNSVVIDSRTAKVLDEPKTNEGFMYVMLRLHVDMFADLPGTLFLGFMG
LLMVVAIVSGVVLYYPFMQRLKFGVLRMQRSARIKWLDIHNLLGIVTVVW
LTVVGLTGSINTLDRIILSLWQMDQMAEMTAPFKDLPPVTQPAHLEASLR
AARAAAPDMEVRLVAFPGTMFSSPHHYTFFLRGNTPITSRLLKPALVDAS
NGELTDSRDMPWYVKTLFLSQPLHFGDYGGLPLKILWAVLDLIAIVVLAS
GLYLWLRKPKTTAASVSKADDAVLPLDDDHTAPNNAMVYINENKDR
>NE0979 putative transmembrane sensor
MPQEPESGADLPSDRTAVFPTPEGPPLPKQVTLQAAKWLVMLQAPNTTQA
TRAKWQQWRDVHPDHERAWQHIEASATQMRAMHSPLAHGTLIRDDTEKRH
ARRRAIKLLGVIAFTGTSTWLLRDNPVWREWHADHRTATSERRTIVLADG
TRIMLNTASIIDVHFDNAIRLVTLIRGEILVTTAPDSASGTGQPFIVETD
QGRLRALGTRFAVRQRENDSHVSLFEGMVEIRPADASDQARVLQAGEQIS
FTRNAIETPQTIDTHAASWVTGMLVVQEMPLTDFIDELARYRPGYLSCDP
ELSHLKVSGIYPIEDTERVLDMLLHVHPIEIYSLTRYWVAIRARGDR
>NE2095 conserved hypothetical protein
MPIFNFRDEEALGKVASVDTTNVIVDVENVAHLKRLQVNHLAVLQSSRPG
QHLIGLITQVTRKRGIENISDDGIVDQNSELNLCRIALIGTMLDRDGSKE
NVFRRTIESVPEIDANCFSLEGENLTGFMRTLSSVSAEGNALTLGKYTLD
ENAVAYLNGNKFFQRHAFIGGSTGSGKSWTTAKIIEQMSGLSTANAIVFD
LHGEYSPLTGPGIQHFKVAGPADVEAKRTISDDVLYLPYWLLSYEALVSM
FVDRSDQNAPNQAMIIAREINQAKRKYLEDNGQQALLKHFTVDSPVPFDL
DFLMERLNSINVEMVPGAKAGTEKQGDFFGKLARMISRLENKISDRRLGF
MFNGGGDILDFAWLEKFANAALGSTGENGKAGIKIINFSEVPSDVLPLIV
SLVARVTFSVQQWTPSELRHPIALLCDEAHLYMPQRNMADSADDISLDIF
ERIAKEGRKYGVSLVVISQRPSEVNKTLLSQCSNFVSMRLTNAEDQGVIK
RLLPDSLGGFSDILPTLDTGEALVVGDASLLPSRIRIDEPQNKPNSGTVN
FWDEWQKPVKDNRLLIAVDNWRKQNIQ
>NE1551 hypothetical protein
MRTKYILELFDTSEQHKPIARFESSTPFTAASVGERFDDIGWERLDGAGK
IASPLSPKRYTVHSAKHLVIVEAGALVIKYCLNLEPFSGPSSPVWGDE
>NE0230 hypothetical protein
MGLRLRRNSAIESLTIRVKAVISESVKGFGFFAIGGHESVAWLARPLPQI
LSAVWRPVGLQLHIFNPPKIYPAVTTITTSGSVAVITKSALGSVTGCSSL
SLT
>NE1327 Carbon-nitrogen hydrolase
MSTDVGIDQAARSKAGDNTRVRVAAVQMASGPSVAANLEEAFRLIEEAAA
KQAKLVVLPEYFCIMGMKDTDKLAVRENPGEGEIQNFLSETAKRFGIWLA
GGSVPLISPVSDKVYNSCLVYDEHGQQVARYDKIHLFGLSLGNENFAEER
TIDAGNRVVALDSPFGRMGLSICYDLRFPELYRMMGKVDVILAPAAFTAI
TGKAHWETLIRARAIENQAYLIAPAQGGFHVNGRETNGDSMIVDPWGVII
DRLPRGPGVVVAEIDRAYQSSVRASLPALEHRCLLAC
>NE0864 conserved hypothetical protein
MNLHLFVPDIFWPDGSQTDIYQHLKLPALETILAKSNRFEVGGEVLESWL
CKIFNVNKQQDWPIASILLQREKNRIEVGESYWLRADPVHLRVENNHILL
GDNQILNISLKEATSFADSINEFFSDEGVTLLPLHSDRWYVKCDETPELQ
TFLLSQVVGKNINDLLSRGKEGAIWNSRINEIQMFLHEHPLNRNREIQGE
LPVNSIWIWGGGVRPSEVRTSYTGIWGNHVLVHALAEMGKVMCHDLPENA
DEVLNHSGNSGEQLIVLDNLQKYACYRDAYNWRNELMKMERDWFDPLFQA
LKKRQIQQLKLTIINESSTKDFVLTPGSLWKFWAAVRPLGTYS
>NE1675 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0446 conserved hypothetical protein
MTIPAKNTVCLWYDDCAEDAARFYADTFPDSFVGAVHRAPGDFPSGKQGN
VLTVEFTVMGIPCIGLNGGPVFTHNEAFSFQVATTDQAETDRYWNAIIGN
GGQESECGWCKDKWGLSWQITPIVLINAITHPDPAAAKRAFDAMMQMRKI
DIATIEAALRG
>NE1450 Uncharacterized protein family UPF0051
MSAVLQHLVNQPYKHGFVTDIEADIAPKGLNEDIIRLISSKKNEPEWLLA
FRLEAYRKWLDMVEPAWQNVKYPKIDFQDISYYSAPKPKKKLASMDEVDP
ELLRTFEKLGVPMHERAALAGVAVDVIFDSVSVATTYKEKLAEVGIIFCS
ISEAVQKYPELIRQYLGTVVPPGDNYYAALNSAVFTDGSFCFIPKGVKCP
MDLSTYFRINTEESGQFERTLIIAEEGASVSYLEGCTAPRFDTNQLHAAV
VELVALDDADIKYSTVQNWYAGDENGVGGIYNFVTKRGLAKGRNSRISWT
QVETGSAITWKYPSCILRGENSVGEFYSVAVTNHHQQADTGTKMIHIGAN
TRSTIVSKGISAGQSNSSYRGLVRIAPTARNARNYSQCDSMLIGSQCGAH
TFPYIQVQNDSAQVEHEASTSKIGEDQLFYFAQRGIGAEEAVSMIINGFC
KEVFQQLPLEFAVEATKLLSFKLEGSVG
>NE2023 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPKFIWSRQMPEPR
>NE1999 conserved hypothetical protein
MNPGKPMPLVVHGWTIFAHPLFLAQIEVLIQQVEAHKQKDPVGFVKKNAS
KRLAAITKLAFDIILQDPARPEYRQGGTLGDDYKHWFRAKFFQQYRLFFR
YHTLSKVIVFAWVNDEDTRRAYESSDDAYRMFRKMLENGLPPDDWNQLLA
EARAEGQRLQQFAARWW
>NE2412 transposase
MKRYELNREQWCRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEVV
>NE1858 conserved hypothetical protein
MTIQPENFRKLTEWTQFLSEVEIPVLRKTANDLAALREQEPNPSARNVAR
IIRHDPLMTVKLLRHLQQHKHKRQQQEIVQVEQALIMLGVENALDQVVAE
PLVQHLLQQRPAALVNLLQRIHRAHIASNYALEWAIRLYDTHFEEIRIAA
LLHDITEMLLRCFVPDKMLEIDALQRQDSTLRSATVQKTILGFTVYDLQA
TLIEAWALPELLIMLMDKKHIAHPRVRNVVLAVNLARHSAHGWDDAALPD
DYAEIGELLRITPNDARTLVHARY
>NE2176 conserved hypothetical protein
MKKVAIVQSNYIPWKGYFDMIAAVDEFILYDDMQYTRRDWRNRNQIKTPQ
GVQWLTVPVLVKGKYHQKIRETEIDGTDWAAAHWKALVQNYRRSPHFTEI
AAWLEPLYLAETFTHISQLNRRFIEAICNYLGIKTVIKNSWDYTLLDGKT
ERLADLCVQAGGTEYISGPAAKDYVDEQVFKENGIKLTWFDYIGYPEYQQ
LWGEFTHGVTILDLLLNCGKNAKRYMKYVE
>NE1084 conserved hypothetical protein
MKSNADSLTDTATESDPPVRGFRQRMTWLHTWGGLWAGWVLFAIFLTGTL
GVFDDAITRWMKPERPLVAEVAPGSAEQRAQAVRLAQTYLQQAMPRGEFW
SIGLPGESDPAIRLFWRENEDAKFQQTRLDPVTGTELDKAVDRETEGGHH
FVHMHFEFHAGEAGIWMVGFFAMIMLVALVSGVITHKRIFKDFFTFRPKK
GQRSWLDAHNVASVLTLPFQFMIVYTGLAIFYSLYMPAGIFAHYPNKDTY
FSQLLSRPAPREETHIDAQVASLDKLLLTAETELGRRASFVSVNHPGDSS
ASVTVFGLFDEEENEKYLLPPGSGNVIFDGITGETLDIQMSGDHRGGEAQ
AVQRVMGTLHFARFGGDTIKWLYFISGLAGAIMMATGSILFMVKRRQKAL
NEFGSHTRRVYRLIETLNVAVIAGLCIACIAYLWSNRLIPVGIEDRSHWE
ITTFFTVWLMALLHASIRPVASAWVEQLSLAALLCLALPLLNWLTTGQQV
LTYGLQGDWERVGVELTVIGLGLLLATMAQKARSMAPVLPPRSAVVATQQ
KTITASVPYRNSILMRVLAATLGGYAVASGLAILLPMVLPIARAEAVLAS
TLLSFAAYTGVIIWVFSARAPKRAWQGVFFLAIGCALTILFNATFGGM
>NE0573 Aminotransferase class-V
MNTNEFNQWIDKVSGEASAYVPDIESLTQMGNALFNTPPLSRSTEPPALT
VPGAEVSHSRPATGSSYPSTIPLPCEAELKALFTPNQYRAPAHISLSDSD
HSGHPPWESSFYFLDEGIAPERHDRAAQIPAAHPPFDIHAIRRDFPILQE
HVNGHSLVWLDNAATTQKPQAVIDRLTYFYTHENSNIHRAAHELAARATD
AYEESRNKVRQFLNAASTDEIIFLRGTTEAINLVAQSWGRQHISAGDEII
ITWLEHHANIVPWQMLCNETGARLRVVPVDEDGQVLLEEYQKLLNSHTRL
VAFSQVSNALGTITPARQMVEMAHRVGARVLVDGAQSVSHMRVDVQQLDC
DWFVFSGHKVFGPTGIGVLYGKTELLNDTQPWQGGGNMIQDVTFEKTAYH
GAPARFEAGTGNIADAVGLGAAIDYVERIGLENISRYEHELMTYATGCLK
SIAGLRLIGTAPDKAGVLSFTLKDFSTEEVGTALNREGIAVRAGHHCAQP
ILRRFGVESTVRPSLAFYNTYTDIDRLAAAIDRIRGKKNAF
>NE0165 hypothetical protein
MPLSYAGCLCGIAAMVRLLKYIWAAPCSLFGLGCGLFLLLIGGSVRQVSG
ILEFSIGYGNPIPFFPFWAITFGHVVLGLNESALEYSRAHELEHVRQYEV
WGVLFFLAYPVSSLWQLFRGRNPYWYNYFEIQARQRSGQKRLGL
>NE2546 Haloacid dehalogenase/epoxide hydrolase family
MIEAVLFDFDGTLADTAPDLGRALNRQRTARNQPPLPIELIRTEASAGAR
GLLSLGFDLKPGDPEYQAMREEFLSFYTEQLCQDTCLFPGITELLEQLDS
RAIPWGIVTNKPAKFTGPLMHLLGLHHRAACIISGDDTPYSKPHPEPLLT
ACRQINMAPDHCIYLGDDIRDVQASLAAGVKPIVALYGYLGNCAPPETWG
AAELIDHPRDLLHHI
>NE0506 TPR repeat
MLAACGILSEKTVDHSKWSASKFYVEAKNELNEGNYAAAVKLFEALEARY
PYGRYAQQAQLEIAYAYYKDQEHASAIAAAERFIQLYPHHQNIDYAYYIK
GLASFNDDQGLMGYITHKIIKQDMSERDAKASRESFESLKQLVTRYPDSK
YTPDALQRMAYLVNALARGEIHVAQYYMKRKAYVAAIKRAQFILEEYPQT
PATEDALYIMAVAYGELGMTDLREDVEKVIRKNFPESIYLTDSGAVKGKS
WWEIW
>NE1230 conserved hypothetical protein
MCTFQPGSLSNTWLEKKSVIFYPVLTESRGYPNIPLRLPVIRIKNSSCQG
DIVRFLSIIIAAIFCTFPVSAATPASDDSYIAGYAAAILKFQFGIDLPSL
TVRNGNITLPADKLPAEDRTRITQLFSEIPGVTRVEIVEYTAQQPSLASP
EPDEAIVDKGALATRSTMLATGPLPEGHLFKPLLADPRWNHFSAAYRNYV
GRNVDGNHNGSVSFGETIPFYRANIGQSIVQWEVGLQAGIFSDFNLGASS
TDLINTDFIGGIYTSVRAGNASAFARIYHQSSHLGDEFLLRKLTDIERIN
LSYEAADLRLSYEFPYGIRVYGGGGGIFRKEPSAIKPWSAQYGIEFRSPW
QMEFALVRPVFAVDIKNYEQNNWNADISARAGIQFDNFQAFNRKLQFLLE
YFNGYSPTGQFFREKVEYLGIGAHYHY
>NE0918 hypothetical protein
MNEEEETGRIIARLLDRSLNDVTPGTLYRLQAARRAALEHYQPAEKVLHA
GVGISAQSGYHWLSAHAGRLLLTASLLLFLAIHSYWQMNNRVDDTILTPV
ILTNDPPIGSQEIEDTANGYEAADEDIVEETDSREDTDHGNYGGEADTSE
TESSTNGTADSDDVTRSFDSTEIQETENTAEAPYTTNYDQDSVTEEDTGV
ISDHLQNSEDIIDSYDTENTQDSTATIDE
>NE1402 YgbB family:2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
MKKIRIGQGFDIHPLVVGRDLIIGGVTIPYEKGLLGHSDADVLLHALCDA
LLGAAALGDIGRHFSDTDARYQGIDSRKLVREVHRLLTEAGYRVVNIDAT
IIAQVPKMAPHIPGMVTNIAQDLTLPAGDINIKAKTAEKLGVVGRGEGIV
AEVVCLIADGDEV
>NE1879 conserved hypothetical protein
MKKNSSSIRSLVAISICIPFFTGCANVTTGTGGTAVTGAAAGGASAGANA
DLERCDKTLGTLAVDDGREANWYHSFGSATSITTIEPLIRLAVQQSNCFV
LTSIGNARTDSRLSRITDMQRNSGEYRAGSKQQKGQRVAADYYMEPQIII
DNDSTGKVGAAIGGAFNPLVGALAGALETKVSVVTLSLFDIRSSVQISIS
EGSSTANNYGAALGGFGGGAGGALGGFSRTPAGKATVAAFFDAYNGMVRA
LKHYEAQKVEGGLGRGGALKVN
>NE0342 possible transposase
MEISASQFKLIENLLPIQRGNVTLSNLEVLNAILYVAEHGCKWRGLPVKF
GNWHSIYTRANRWARNGVLDRVFLALQQNKLIQLEVDHMSLDSTIVKVHP
DGTGALKKTVFKLSVNHEGAGLPNSSGRGRCQNRGSVFVIPRPGR
>NE1581 PIN (PilT N terminus) domain
MILLDTNVLSEFMRLQPATQVVVWLDRQAPNEIWTNAVSRAEIELGLALM
PESKRQKSLSQAARTMFDEDFAGRCLPFDEIAASYYGRIVSTRTRMGRPI
SVEDAQIAAIALAYRMFLSTRNTVDFEDIAGLNVINPWETEA
>NE1472 possible transmembrane protein
MEARYVRGRQGLQWILSGFYFFRMAPLNWILLCFTYLLIGITLGLIPLLG
SFIGILTVPVFVAGIMVGCRKLDLSGKLELEYLFYGFKKYTVPLITIGGV
YLIGDILITGIFMLLGGDAVVDMWLHGKRFSENELPGVMDDLLFASLLCL
LLAIPLMMSIWFAPMLVVFENMPPLIAIRKSFFACLKNLFAFQIYMAILF
VLGMLAAMLYGLGFIIWFPVAFASVYVSYKDIFHYEQDEDTQPKSDEPST
EENKEDSSQTNEH
>NE1878 conserved hypothetical protein
MNIQIILSLIFSMILMACTTSSPQKDLSRICDSSGCSDRSGNYVSNHSSA
ASSDEEARIRVLEDVARQDPRAAYDLGLRYFRGDGVRQDSYQALQWMREA
AERGDLNAQKALGRLYLTGLEEMGADYREAEKWLRIAASRGDKESEQLLV
EAAAFASEERRSGEIYHRWVNRWQPVFYQRWYYGYPYLGTWRGNYWYY
>NE0892 putative phage gene
MNILIPLGAFLEAAAGPLAVRVLTSLGIGVISYAGLTVSVGAALTYMQTQ
YFGLPSSVANLANLAGLGQCLGIVTAAITFRVAFQVQRKTLGVLNK
>NE2490 putative MinD-related protein
MISHKKDQAAGLRELTALEPNAGVRVFAVVGGRTGVGKTSSVINLAVALA
KTGKRVLILDENPRHKDVNANLGLSARYDLLHVINQDKTLEQVMTQGPED
VLVLTAMRGIHSLAKLSSADQERLIQCFSELSQTVDVVLIDTAIGRTSRV
VPLSLASQQVLIVLSASGKAITDAYALIKLISQEYARRDFLVLVNKVESE
SMGRDIFENIAGVAQKHLAVKLEWMGCILFDEKLQRSTRLCQPVVEIFPT
SASAAGYRQLAEKLMRCAGPWTDNDGVENFMRRLIRTSHLDIADFTV
>NE1121 conserved hypothetical protein
MSNREAYIKKAEAQLKEWGAQIDLLKAKGENLAADTEIEFKKKLDEAEEK
RAELSSYLDQLADKTDSIWDDIKDEAEEKWNTVSKTFSDFVDKFK
>NE0609 Glycine cleavage system P-protein
MPFIPHTEEDVAEMLTSIGARSIDELFDEIPAELKTGKLTQVPPGLSEME
ISRLMYERAQQDGFYLSFIGAGAYEHHIPAAVWQITTRGEFYSSYTPYQA
EASQGTLQLLYEYQTMMASLAGMDVSNASLYDGASALAEAALMAVRQHRT
SRRILVPQTVHPVYRSVMRTIVRNQAIEVVEVPYDPATGQVAIDKLDQFA
QEEFAALIIPQPNFFGVLEQVDALTDWAHDKQSLAVAVVNPTSLAMLKPP
GEWGRRGADIAVGEGQPLGIPLSSGGPYFGFMTCKQELVRQMPGRIIGRT
TDLEGKEGFALTLQAREQHIRRSKATSNICTNQGLMVTAATIYMSLLGPE
GLYRVAAHSHANTVALVEQLEKLPGVKKAFHSPFFHEAALQLSVPADKVL
NRLKAQGVLGGVLLENHYPDLKNTLLVCATETKTAEDLDKYTEVLRQALA
ASA
>NE1091 conserved hypothetical protein
MRSMTLEQLRTASETGGVSSVTLKGQGGAFLVQINTRSGVAAILTKARNS
EPRRFGNPAAALNVLREVGITIGQFDASEWNPDEREPVARSSDNRAKALH
KAHEAAAYNEWLAAEIQEAIDDPRPGIPHDEVMARMDARIVRHKAAGAKR
A
>NE1027 Esterase/lipase/thioesterase family active site
MHNRVIRWILLQLVTALLIISAVTIILGEIVTGSAPTAVETLLPDFPVET
VQIPVNDEYAVHGWLAHGMSGHGAVLLVHSMRSNRLEMLGRARFLNNQGY
HVLMIDLQAHGETPGDRITFGARESADVAAAVGYLRSTFPHDRIAAIGAT
LGAAAIVLANPPLKLDAMILESLHPTFAEAVANRLKLHLGNTGEYLQFLL
LPYFSFLLDLPVNQLNPVDRIGNIAIPVLFIAGTLDRHTTQSEVKRLYDA
ALPPKELWIVEGAGHYNMHTFAGKSYEMQIADFLSTYLQRQ
>NE1499 putative membrane protein
MMIDYLILKHLHVTCVAISYTLFVLRGIWMLNASSRLRQRWVKIAPHIND
TVLLLSAVALAVLTHRNPLVETWLAAKIIGLLLYIMLGLVAFRLGKIRRA
KVMAWILAQIVFAYIVSVALTKNPLLF
>NE0662 Transposase IS4 family
MCGTPARCVAPLAALFEMLKGQCDGISIADATAIAVCDNRRIARHRVSAD
SARRGKTSMGWFYGFKLHAIINSRGELIRLRLTAGNVDDRKPMPDLCQGL
FGQLFADKGYLAQWLTETLDRQNLQLITPFKKNMKPAPRTGFEKAILRRR
SLIETVFDELKNLCQIKHTRHRSFFNFVVNLMAGIVAYCLSDNKPTLNLT
RVNTLVKA
>NE0638 mttA/Hcf106 family
MFDISFAELVVVGIVALIVIGPERLPAVARTAGYFLGRARRYVDQVKHDL
HEEMELDSLRKLRDSMHETVNSFENSVRSEINKIQETTETQSAVIPDKQP
PFEKAAENIEPVNTSETKTSSAPAEPRQPNS
>NE1694 Glucokinase
MDQYLLSGDIGGTKTLLRSAVVRGEEVEFHHEHLYDSHQYDDFDAILADF
LERSGCQPVAACLAVAGPIVEQQVHLTNLPWMISAAGIAEKFSIPAVKII
NDFEGTAASIEILPQDDLITLQAGKPSSSAMRVVLGAGTGMGVAWLAWRG
QYYEPLATEAGHIDFAPTSAIQIELLRYLMVRYHRVSIERLLSGQGLTHI
FNFLQTRATEGTHLKSIELNVDDGATVTRLAFEHHYPIALQALDLFVEIY
GTYAGNLALAGLCRGGVYIAGGIAPRIIRILQQSGFIEAFCNKGRYSALV
RDIPVYVVMNPKAGLLGAGLLAQRMLQHQTVSSRQ
>NE1135 hypothetical protein
MIEKYRIKKPPERAVRIFESYLSQFLLSFSALSTLSGMESRSIDMLGGYW
AGFGSPQAVKKVVTASAAANTSESFIFSSPIN
>NE2420 Biotin / Lipoyl attachment:Carbamoyl-phosphate synthase:DUF183
MFTKVLIANRGAIACRVIRTLHRMGIGSVAVFTKADALSRHVLDADDSIC
IGDGLASESYLRADKILEVAHRTGAQAIHPGYGFLSENASFAEQCAAEGI
CFLGPTPEQIRTFGLKHTARALAQQLDAPMLPGTGLLEHLETACEEAHRI
GYPVMLKSTAGGGGIGMRLCRNRQELIDAYESVKNLAQSNFSNAGLFLEK
YIDQARHIEVQIFGDGRGKVIALEERDCSMQRRNQKVIEETPAPGLSPAT
RNALQETAIRLGESVQYRSAGTVEYVYDPSVDRFYFLEVNTRLQVEHGVT
EMVTGIDLVEWMVLLGADALPPLESFTIQLQGASIQARVYAENPAKNFQP
ASGLLSAVEFPETARIETWVDAGCEVSAFYDPMLAKIIVHASNRDQAIDR
LIAALDKTTLYGIETNLDYLRQVLSGTVFRSGQVSTQFLSGFHYQPHTLD
VLNAGVQTTVQDYPGRTGYWNVGVPPSGPMDSLAFRLANHLVGNPDDSAG
LEITLAGPTLRFNCNSLIAVCGAPIDVWLDGAPLEQWRSHFIKAGSLLRF
GKIRQFGSRSCLAIKGGIQVPDYLGSRSTFTLGQFGGHAGRALCTGDVLH
ILPAEDTEPSILPSENIPHYTSRWEIGVLYGPHGAPDFFTGADIDEFFTT
DWKVHHNSSRTGIRLIGPKPQWARKDGGEAGLHPSNIHDNPYAVGAIDFT
GDMPIILGPDGPSLGGFVCPAVIVQAELWKTGQLKPGDSVRFHPLTAEQA
LKLEKQQDKMIQQLDSFTRVPPDIPASLVEPSDPVLHRIPANNGQVQVVY
RQSGDKYLLIEYGEAVLDIALHFRAHALMAWLQHCVEKGDLPGILDMTPG
IRSLQIHFDSRLLDRDHLLNKLITAEKDLPASDDMEMPSRIVHLPLSWDD
AATKLAIEKYMQSVRPDAPWCPSNIEFIRRINGLDSIDEVKRIIFDASYL
VMGLGDVYLGAPVATPIDPRHRLVTTKYNPARTWTPENAVGIGGAYLCIY
GMEGPGGYQFVGRTVQMWNRYRQTRDFTAGKPWLLRFFDQIRFYPVSEQE
LLELRHDFITGRFTLHTEDTVFSLRNYHAFLHQHADSIQAFKTRQQKAFE
AERERWKIGNVQEENSISDEGEGVLDKPGMQNAPDLPDGAYAISSDVTGT
VWKLLIETRQLVNAGDPVIIIESMKMEITLSAPVSGIVKQISCRQGDYVF
SGQMLLIVQEE
>NE2345 hypothetical protein
MLLNRVIFQKQSDRLLFNASLSNTVLRILFFYCFPAVCLFVSSIAATRAG
QPPEQINQYLRINGEFLGSYENWNYFRPASAVNNSYDLWVVRSRLGLMFS
SDYADGFVQGQYSGLYGLPDDAVLPSGGALGLGAGYFLANRTTGASNVFL
KQGYLNFKFNKLGLPGAAVKIGRFELADGMEYRSGVEKFDALKKKRIAER
LVGGFNAIYVGRAFDGFSVVYDGPGFNATVSGVHPVQGNLTVQGQKQISD
ISILYAALTSKKDAVLPGIEGRLYYLNYDDQRVSQVTDNRPLSARPQLSN
EKLNIHTIGAHLLSLQPLGSGSFDALLMGAYQFGSWTNLSHRAWAFDAEV
GYQWHKLPFKPWVRAVYYRSSGDGNAHDGRHQTFFSAVPSGRLYAKFPFY
NQMNIQDIFFEFIAFPTGKTQINVNLHQLSLANVNDLLYTGLGASLKSGA
FGYSGSTTHGHREIGQLIDVTLTHGFNKYLTSQLYFAHAFGGSAMKSIYP
SKSDASIFLVNFNLVF
>NE2450 putative Mrr restriction system protein
MSIPKHDQIRVPALLLLAERGQLKLSDFEQPLAKYFGLSDGEVQEEYESG
KGKIFYDRISWALSYMNMAELLNKPKRGLYQISELGREKLKTPNIINQFI
AEQIAKKQENKPSKIGKITVSEENLTPQEELYESAKKIRQACYQEIIDTI
LSKKPHAFESLVVLLLQKMGYGGEIKQSGLVTPYSNDGGIDGIIKEDVLG
FGRIHIQAKRYALENSVQRQEIQSFVGALAVAQSNKGVFITTSSFSRGAI
QYAVGLNGSTTLILIDGQKLAEYIYEYGVGLQVEHTVEIKKLDSEFWDSM
ENEAGVPDL
>NE1973 possible proline rich signal peptide protein
MLLSPATLAAEKGHIEIESFHVRKSGESFQIDVEANIDLSRTMKQALKKG
VDLYFVTRLLIMKPRWYWLDEEVARSKERIELSYQALTRQYRLTQHGQPR
NFPTLKAALQALGHQPDMLIRENQPLLPDTTYTAILQIWLDISRLSKPFQ
LEWLDTEDWSLSSQRKIWQIKFPPASDAGNESGLH
>NE2494 hypothetical protein
MLIPYTYVPHQMEKMQAFIDFIFHEIWCKAPASGPFGLHLFNANAELREV
MEAFYYSDAQGADFFYGHVERIYGLFSALTFVQISQFQQWYLGNNDLEKV
CANAPAAQIVRYADIATTHQDLADQLASFFKGLYSQSLLGLATLRAKIGD
IDDHYQAFVAANKMGKCPFCGIGDIKGEHHSKREAYDHYLPKALYPFNSI
NFRNLAPACHECNSTYKLSKDPAHNAVGRRKAFNPYAAADHAIQIQVALP
HADIDALTPADITMHFGPVELAEELETWKDVYGIEERYKAKFCAENDGKY
WLTQVLDEWKEDGRDPADFMTTLVRQVQKNPYAECNFLRKPFLDACQQVG
IFK
>NE1979 Peptidase family U32
MLKAPELLLPAGSLEKMRAAYAFGADAVYAGQPRYSLRARNNEFTLPQLC
TGITEAHQLGKKFFVASNLMPHNSKVKTYLRDMAPVIELQPDALIMADPG
LIMMVRETWPDVPIHLSVQANTVNYAAVKFWQSLGLTRVILSRELSLEEI
SEIRTLCPDMELEVFVHGALCIAYSGRCLLSGYFNHRDPNQGTCTNSCRW
DYKVKAAEENASGDIELPQKIDFDFNEALQSTESISFSNTAIKRHPAADQ
VYLIEESQRPGQLMPVMEDEHGTYIMNSKDLRAVEHVAHLTEIGIDSLKV
EGRTKSAYYVARTAQVYRQAVDDAVHQRPFDPALLGQLEGLANRGYTDGF
YQRHHSQETQNYLRGHSESSRSQYVGDIEHVDATTGMAEILVKNRFQTGD
RLEIVHPSGNRVIQLEQMQDLEGNAVSVAPGSGHRVRIPLSGDIEKAFVA
RFL
>NE0980 putative sigma-70 factor, ECF subfamily
MSTTEEASENEITILYTDHHGWLKRWLYRKLGNSFDAADLAHDIFMRLLA
REQSTVIHQPRALLTTIAQGIVSNFYRRRKIETAYLETLAHLPEAQAPDP
EIRVILLETLIEIDRRLDMLAPLVRRAFLLSQVDGMKQTDIAAELNLSLA
TVQRYIVKAVHQCYFGE
>NE2418 conserved hypothetical protein
MNQVTTYLWETHLPGGSHWSGVVRRGTTLRLTDVSGGANAAVLFYNLEEK
LERYNMADTLKTQHTFCLTKGHACHSDMGRIFCCITEDTAGWHDTVCGLS
DAELIQQKYGTGRYQELRNDMYRNGLDGMLVELGKWGLGRRDVVSNINFF
SKVTANSVGELQFHVSHSKAGDHVDLAFAMDTLVVLSAAPHPLDPAATYR
PGAVNLAVFPSTPAVTGACAHIAENARGLENTRRLYAYGGVK
>NE1102 putative transmembrane sensor
MSTPGSTPPPSVADQPHPLPLANGHPISGATADAAAQWLTLLMSDEMTDS
DYQCWQQWRAAHPDHERAWQHIEAVTARFRELPSTAAYKSLSPLTNPVSA
GKPGSPGRRKAMGTLLWLGTAGVGSMLVSRTQIWQQTVADYRTGTGEQRA
IYLADGTHIMLNTYSAIDVAFDAQHRTIRLITGDILITTHPVHTPISDPR
PFIVETAEGRIRALGTRFTVSQRKGRTHVAVLEHAVEVTPAAAPDRQYIL
PAGQQLSFTRHTLNNATTLDKQTTGWTQGQIIADNIRLGDFITDLGRYRT
GLLRCDPAVAELRLSGVFPLDDTDRILETLPSVLPVRVRLRTRYWVIVEA
AS
>NE0800 conserved hypothetical protein
MSNNDQFAALVLAADRGSQDSVARYAGTVCKAYAPVCGKPMVIRVLNALN
DCHKIKSALLCGPPESLLSACEELEQRIDTRQVDWMENLDSPARSAAHGL
SRIDSQQPVLLTTADHALLTSEIVEYFLQVSSNLECDAVVGVIRYETLQA
RYGDTRRTVIRLQDGNFCGCNLFAFNNVRGRTLVDFWQQAEALRKRPWKL
ISRVLGWQAVWSYLLGRLTLDQALQKISGKTGVRVCPVILPFPDAGIDVD
KVEDLQLVESILIRSDSTRPLM
>NE0657 Uncharacterised P-loop hydrolase UPF0079
MHSSHVVKLDSEAATLALGEQLATLFHPGLTVFLYGDLGAGKTTLARGIL
KGLGHHGKVRSPTYNLVEIYKLSRLYLYHFDFYRFNDSLEWEEAGFREYF
NQDSICLVEWPEKAGEFLHAADLEIRISYSGTRRIAEFSAATEAGEQCLS
HWQKRVSD
>NE1387 Polysaccharide export protein
MNTENHFIRIGLLLAGIMLISALSGCATQPDWVSSSGASREQILKKSEPG
QIGGIGLVGVDDTLARKLFEAKKLGQFADIFPSSSTNNYIIGPGDVIDVT
IWEAPPAMLFGAIVLDPSAGPTTTRGVTFPEQMVTSDSTIAIPFAGRVSV
RERTAQEIEREIIVRLYGKANQPQVLVRVIKNNTSNVTIVGEVNNSTLMA
LTPKGERLLDALAISGGVKQSINRMAVQLSRANVTATMALDSVIRDPKQN
ILLKPGDVITALYQPQSFSVLGATGKNEEVLFEAQGISLAQALARSGGLA
DNRADARGVFIFRFEDAKLVDAGHSLTAGADGTVPVVYQVDLRDPASFFV
TQNFPVQNRDVIYVANSPEAEFNKFMKLLISVAMPGLTINRMFLLN
>NE1336 Glycosyl transferases group 1
MKRPLQTWLTESWLAFSRLLTFHILERRLFEHSGRIPYEPQPGNLLYVAA
SVLPYHTSGYTTRTHEVVCALNKAGARVHVLTRPGYPWDRKDRLCDPVSE
ETTVGDMCYRHAQAPANNRPVLQYALQAARVVAESAVRHRVAVIHAASNH
VNALPALLAARQLGIPFQYEMRGLWELTRISRQPGYEDSQAYKQGLQLEG
LVARHADRLFVISEQLGKYVQTNWDIDAGRMALLPNCVDPERFLIADPQQ
IESNTIGYAGSLMGYEGLDTLIDAVDILVRSGSSVRVVIIGDGEARPQLE
TRVQRLGLSERIYFSGKMPPDAAREKLARCALVCIPRKPFKVCEIVPPIK
LVEALAMGKPVIVPDLPVFRDELGDDPAGWFFRSGDAADLAHVIEAALGN
EEKLREMSDQARDYVLAKRCWHRFVDKMAKPYDS
>NE0232 hypothetical protein
MTNALIPDPVSAGEVMVTDEGHVEKFLNGIFTCEVPAVAVPDYLRKNTVI
GRFAHDVAAQAEMPFGTVFVTILGAASVPAACSFTTRFESGYELPAGLFT
VCEQPPASGKTRVLNYGLHAYQAAIRDLNKQIHEHNKADKQNQKPYFIDL
ITDGTAAAIDSKLAESKSGRLPLASSEQGLFRSLFPAEGGFHSNNDLLLT
GWDGGWVSGARSTRNAFTGRVSTQVVMFAQNGSIRRVLQASNGSGLTERF
LFVAERSLLGRRKFEPHTVDASQYDKAATRCVERMAAEKPPIIIEPCKDG
RAYIRQQRIAHEAELGKLERAGEAVMVGWLGKFENHVLKIASVIHSFEFM
QNEEIDFSYPVQIPLATVEAACELVMSLHEHMRAVIDAAGESGLQTATDS
VLSILRENKAPMAVSAVTAKARRRKPFADMGRDDYKASKAFIDTLIVKGI
LLKSHNKLSVAE
>NE1636 hypothetical protein
MIWVNNLISGFDGIYWYPNSSLLRIEEWDGEFTVFQPESGKTHFLNEMGL
RILTVLDRSPATLEAICQELSAYFSLQLDAQFPGQIIRTLQRYEALGLIT
RVKENE
>NE0757 CobN/magnesium chelatase
MYQSDGGNRFTFFVHDNRYMKRIFLLITLILLAPAAFAHKVALLSTQFVL
EHKFRLMQAASEGSGIDLHWVQADREGEEGVRKALAEARLVIIDAPRAED
QAQIEHIAGKLLRETAVPVVTFQTFGGQISPIRLAPEIAERLYGYYVAGQ
KVNRERLFHYLRIWMQGGDLSEVPPPADLPDGGIYHPGHADSIFPTLPEY
LNWWEKQMADPALERPVIAMEIPSTYIADGQTRMLDEVVSALEQAGALPL
VFYRSTRISRGSTVQVSGSNESNRRPGATGNRGFGERGSAGAGADNSREN
PVDIVQPRGSADFPNPAERRNITIDEPLITYQGKIIPQVILVNTFLGSAP
ENRKAWHQALGIPAINVLSYRGGTRTDYLKDSAGIGSYFVPFTLTTAEYI
GLQDPVVLTTNEGGEFVPLPEQLDLLVGKAINLARLSLRANADKRVALLF
WNHPPGEKNQGASNLNVPRSLEYLVEHLRSEGYAFEAVTEQQMIDTVGQM
LRPAYRAGGIPELMRTPHWDFLPLARYRAWYATLPESVRNDIEARWGNPE
ESVWLAHKDSVNGFVIPRMQLGNLVVMPQPSRGEMAGMEEEKKLFHDTRQ
PPGHAYLASYLWIREQFSADAIVHFGTHGTQEWTPGKERGLWAYDYPNIL
VGNLPVIYPYIVDNIGEAIHVKRRGRGVIISHQTPAFSPAGLSDDFVRLN
DLIREYQSLDQGLVRDNNRKLIIEQAVRMNIHQDMQWKVADLERNFDNFL
RDIEDYLEDLGTAQQPLGLHTLGRDAEQAHQISNVMQMLGQPLYDLLGIE
DARTLLREDYRKIQQSEPYRFVETYVFSDQPLNDINDPGLRAMAERGRTF
RMNLHAAPEIDSIVKGLSSRWIDPSYGGDPIRNPDALPTGRNMYGFDPGR
VPTRAAYEAGGQAVEQLIASYKLTHDKFPEKLAFSMWSTETMRHLGMLEA
QIFYAMGVRPVWDEGGRVTGMEVIPLSELGRPRIDTVISLTGLYRDQFPN
VMERFNQAITLLAEQDEPDSDNPIRANTRRIRAQLKKLGVTPETAREFAL
TRIFGNESGDYSTRLTQATLASDKWDEKDGKLENLYLSRMSWAYGPNPAH
WSQKLTDGKGNEINAYAEHLRGTSAAVFSRSSNLRGLLDTDHPFEYLGGI
SMAVRHLDGTSPQLYISNLRDPARARLESAEKFMARELRAVYQHPNWIRE
MQKEGYSGTLQMLDTINNFWGWQVMDRNITRDDQWEEFHQSYIADRYNLD
MRDWFEKSNPAALAQIAERMLEAIRKDYWEASEQTRRELVEVYTDIANRH
DIQTDNATFKAYVAELAAGYGLNTPDTPVESQANPQPEPQQTADIVRGQQ
MVETPVQPMIEEILWNYAWLLIGIIAAGAVYQSWRTYRERQAFSGLKLS
>NE0823 TPR repeat
MKTRAFNSLVVGLLSAVVSVSCSTIQSDENAAGHHEKHEVSTVEHRSREL
SALADQPLSGDQLASRLQNLGTHSFPVSTQHEWAQLFINQGLSLAYGFNH
AEAGRAFQEAAQLDPGLAMAYWGQALVLGPNVNALMDPADEPRALELVKQ
AESLMVSASPREQALIRALKKRYSGADEDRKANDKAYADAMREVYRSFPD
DADIAVLYVESMMDLRPWDYWMRDGHPHEGTDEIVAVTEDVLRRHPVHPG
ALHMYIHLMEPTNTPERAEHAADTLMTLMPGAGHMVHMSSHIFQRVGRYA
DSVKSNQLAIAADEHYMGQSHAPGLYPMAYYPHNIHFLWFAATASGQRAL
ALESAQKAASKVDDALLREMPFTAIFRVVPYWALARFGQWQAILAEPAPP
AFNAFLKGSWHYVRGLAYVATKQSQRAERELQQLRRIVKNRAALDNPLLS
RNTAYDILRIGPEVLAGEIAAARGRYESAVAHLERAVRYEDALVYTEPAE
WHYPPRLALGAILLKAGRPDEAETVYWEDLKRNRDNGWALFGLQQALIAQ
KKEAEAKVIEARFKKAWEHADITLTASRFGR
>NE1053 Uncharacterized ATPase related to the helicase subunit of the Holliday junction resolvase
MTDSPHTIRNPAAPLAERLRPRTLDDVVGQSHLLGPGKPLRLAFESGKPH
SMILWGPPGSGKTTLARLMAHAFDAEFIAISAVLSGVKDIREAIERAQIT
LQRTGRATLLFVDEVHRFNKAQQDAFLPHVEQGLITFIGATTENPSFEVN
GALLSRAQVYALKALTDQELHQLFERARSIAMLDLEFENTAIELLIGFAD
GDARRLLNLLEQVQNAAETEEIIKIDADYLSRVLARNVRRFDKGGDAFYD
QISALHKSIRGSSPDAALYWLCRMLDGGADPRYIGRRLVRTATEDIGLAD
PRALTLALNACEVFERLGSPEGELALAQATLYLACAPKSNAAYVAYKQAR
AFIKEDISRPVPIHLRNAPTRLMREMGHGAAYRYAHDESESYAAGENYFP
DNILAVQFYRPTTHGLEAKIGEKLAYLRSLDEKTGKKRN
>NE0849 similar to nodulin 21
MTHDNTHYSHRTGWLRAAVLGANDGIVSTASLIIGVASAHAAADDILLAG
VAGVVAGAMSMAAGEYVSVSSQSDTEKADVALEQYHLDRDIDFELQELTD
IYMKRGLQPELAAQVARELMAHDALDAHLRDELGLHERVNAKPVQAAFTS
AGMFILGASMPLAATIAAPATTHIIPVVAISSLLSLTALGTFAAYLGKAN
MLTGAARVAFWGALAMAFTALTGTLFGIIA
>NE0157 probable transmembrane multidrug-efflux system lipoprotein transmembrane
MDFSKFFIDRPIFAIVLSVIIFAAGLIAIPLLPAGEYPEVVPPSVVVRAT
YPGANPKEIADSIAVPLEEAINSVEGIMYMKSVAGSDGSLQVTVTFMPGI
DPDTAAVRVQNRVAQALSRLPNDVRQYGVTTQKQSPTPLMYVNLSSPDGR
YDSVYLRNYMTLNIKDQLSRLKGVGNVGLYGEGDYAMRLWLNPNELAARG
LTASDVVDAVREQNVQVSAGQLGAEPSPQGADFLVPINVRGRLRSEQEFG
DIVLKSADDGQLVRLSDVGRVELGSSDYTLHAMIDNRGSTSAGIFLTPSA
NSLEVARTVYAKFEELAANFPPGIEYRVVWDPTVFVRDSIRAVGQTLIEA
VFLVVLVVILFLQTWRASIIPLIAVPVSIIGTFAWLYLLGYSINTLTLFG
LVLAVGIVVDDAIVVVENVERYIEKGLSPLAAAHQAMKEVSGPIIAIALV
LCAVFVPMAFLSGVTGEFYKQFAVTIAISTIISAINSLTLSPALAAKLLK
GQSAPKDKLTRVLERSLGWLFHPFNRLFLRSSDKYQGTVARTLNRRGIVF
AVYAGLLLATVVLFHIVPTGFIPNQDKLYLFAGAKLPEGASLARTDAVTR
QLNDLASSIEGVEMTESYVGMNALQSVNTPNLAASYVILKPFDQRQRSAE
SITAELNAKLTDIKDGQAYALLPPSIQGLGNGSGYSLYLVDRGGLGYGVL
QNVMTHFQNVIAQTPGMTFPVSSYQANVPQLEVKVDRVKAKAQGVNLTDL
FNTLQVYLGSFYINDFNLYGRIYRVMAQADAAFRQKAEDINNLYTRNSRG
EMVPLGSMVTIQHTFGPDPVIRYNGYPAADLIGDADPRVLSSGQVIAKLS
QIAAEVLPRGITLEWTDLSYQQVTQGNAAAIVFPLSIMLTFLVLAALYES
WTLPLAVILIVPVCMCAALFGIWFTGGDNNIFVQVGLVVLMGLACKNAIL
IVEFARELEIQGKDIVEAALEACHLRLRPIVMTSIAFIAGSVPLVFSYGA
GSEVRYATGITVFAGMLGVTLFGLFLTPVFYVALRKLAHFNHLKSQEG
>NE2495 ABC transporter
MKLLRLKITDSAGFRSLPCGFEHHFRTEWSLQEELAQPEGFGPFVCAGPN
GSGKSNLLEALAAIFFQLEVQRVRRNFLPDIFQYDPDDNPEGIQEHEGHP
NAYELEYLIKLPKEHRSSGSPEFAHVVVIKERDKSPWLRWENNEAFPVEG
FAFSTLTDEERDLLLPQYVLGYSSGENEILSLPFFKMRFIQFDEYSNALA
RQLPYPGRPETRLAYLDSSFSQAILLCNLLFQDAATLQPFREDVGIEALQ
EFRIVLRRSVPVTHQQVAAFTSGEYVLPTETQDGRFTDTNVVYLDPETGD
YRLNLLQGLEANERTERTAIVEKLKRCATLHFHDEATDTLVLDYRVNEAT
KQAFRANFDDPAGPALALFQALQVLLTLNLYSVSDTIKADLYRSTSHYVS
ETVPTLASDQRIMRFKNFYFTKQGVKKPMLLKDLSDGEYQMLHSLGLCLL
FRKTNSLFLLDEPETHFNPHWRASFITRLRQCLPDVEGVGQEMLVTTHTP
FLISDSKPDKVLVFAKDKTSGEVSISKPNYNTLGASINKITMNTFGKRET
IGGYAQVLLDDLRKRFEEGHEDRETLITEINQQLGDSVEKLLLIKTILDG
NQPADEEAQD
>NE1271 Integrase, catalytic core
MLQVAPSAYWRHAARQRYPQLRSARARRDELLMADIRRVWQANMQVYGAR
KIWYQLQREGVTVARCTVERLMRQLGLQGARRGKIIRTTVVRQNATCPRD
LVNRMFHANRPNQLWVSDFTYVSTWQGWLYVAFVIDVFARRIVGWRVSST
MSTDFVLDALEQALYDRRPADTLIHHSDRGSQYVSIRYTERLAQAGIEPS
VGSRGDSYDNALAETINGLYKAELIHRRAPWKTRAAVELATLEWVAWYNH
QRLLGSIGYIPPAQAEENYRQTQDNKTLMDILL
>NE1547 hypothetical protein
MAFTTSSMLQSVCKTSTLKGCGKIMRSLCRTPRHRSYATTFVVLLAIITI
LPAMRFMEDLIRDSFVVTLPENERLSSFALFTS
>NE2442 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE0091 Universal stress protein (Usp):ABC1 family
MKNPLEGSVIRRVMVGTDRSETADQTVQWAAGLADRYDAELFIVQVIVPK
YPSATEFGESEQTSAVAANNDLAHFARQIAGERGHALVVINADPALAIVH
AAEQEAIDVLVVGNLGMAGRKEFLLGNIPNRISHNAHCTVIIVNAAHSAD
ERAPHSVRASLSNDEIPSFKPRLVARATHIAAVMAKHGLTELFSQSDPDI
SIRRQQARRLRGALEELGPIFSKLGQVLSTRPDLLPIEYIEELVLLQSRV
PPMTESEVVRVSEQELGVPWEDVFKSFDPNPLAAGTIAQVHRATLETGDR
VVVKVQRPTARADIEQDLALFEMFAEKVGKRPALNQVINMEDVFKHLSTS
LHRELDFRQEANNIERMRTVLADYDRLAVPSIHWDLSTSRLLVMEEIQGI
PIKQAPAGPERIQAARQLLESYYKQIIVDGFFHADPHPGNLMWWKDCIYF
LDFGMVGAVGADLREHLLLMLMALWQEDAGFLTDVTLMMTNAVNSNDFDV
AQFQSEIGEVMAKYRAASLAEMQIGPLLQEMSTVSLRHGVPLPASLTLAT
KALAQVQLATAELDPTLDPYDVAGKYLMRLMVKRIGAALNPKTFVYQSQK
LKVRTLRVIEALENLVGVRPGGPKLVVNFKANSLENIVRHTGRRLALGLT
AAASILTSGLTTMSTAVAEWVPVTFGAVAGLLTLGLVIDLLRGR
>NE1685 hypothetical protein
MNTAVKCRVGTLCTFIMMHFCRYHISPANAWHDNTAMKALFFLRHYNDID
HITPVITAWVEAGHTCDVVLIGAGKFRHDFRVSYLASLEGVRVAHIRELL
GGWLYGTWRLQMLLLTGNLRRSFIGPLVSRLAEIHSAERRKPLWQKVTRI
LLARSFKDGEQGVLAFDWIERNSVIATEWVETIVSMARDRGVGTVSLPHG
DSPHANQLIRRGEWRLGPDMTFSTARIFDRVVVPNELCAQRFRPFLPENA
IAVLGSPRFSKKWLDRLTELMPPSPLTPPEDRLRIVIFLRKANFTTFWEE
VSEVIHLIAAFRDVEIIIIKPHTRSGWKQSLTRSKSLKKLANVKVADDSV
HSTYLMNWADICIDLATSVVFEAIRCGKPVLAADYLHAGRSAAAVYMPET
ELRCRDDVYQKISDVLAKGRDGFYVEAHRQRFISEMLDVGGQDVLSRYVM
LLEAIGQGQAESLLRKN
>NE0210 Domain of unknown function DUF28
MAGHSKWANIKHKKAAQDAKRGKIFTRLIKEITVAARLGGGDPNSNPRLR
LAMDKAFGHNMPKDNVERAIKRGCGELEGVNYEEIRYEGYGISGAAVMVD
CMTDNRTRTVAAVRHAFTKHGGNLGTDGSVAYLFKHCGQLLFAPGVGEAQ
LLEAALEAGAEDVISNDDGSLEVITGPDTFVSVRDTLEKAGFKAELAEVT
WKPENEVLLQGDDAVKMQKLLDALEDIDDVQDVYTSAVLDT
>NE1244 hypothetical protein
MVSVEGTTLGEVAIMKQSNVIKLFSIVLGLWLLFNPIGTQAETAKYETIK
TEYQYAAKVACSLLLPHQDGTLAKGIYRTIINIHNPASKKITVAAKVALS
TQMGSEPGPFNVTPFKGITLQPDGAVGVNCFDIAGYFCPINGVCVDFAFL
EGFLVVKSPVPLDVVGVYTARPVEGEVQSIDVETVQSKRIHDIVKLGTTE
LPGRGEGKRVDYPPKGSAAYDGQKPKQMCGGIAGFPCPEGMKCVDDPSDD
CDPAKGGADCAGICVK
>NE0138 Metallo-beta-lactamase superfamily
MINVFPVRAFRDNYIWIVHNQQFALIIDPGDATPVLTWLRQQKLQPIAIL
CTHHHHDHTGGISLLVQKFEIPVYGPASEKIPGMTHPLAEGDTLVFPELS
LELSILDVPGHTAGHIACHGQNRLFCGDTLFACGCGRIFEGNAQQMFDSL
QKLTDLPDETQVYCAHEYTLDNIRFARAIDPDNPELIELESNVEEKREQN
MPTLPSSLAAEKATNPFLRCNQPAIIQSASRYAGRQLTDPVSVFAAIRDW
KNNFRGNTDLPM
>NE1361 putative similar to abortive phage resistance protein
MSFRDPVTFSMIASRERQHGDRLPKLGKYNIRILPIAAIYGGNASGKTNF
FKALNFAKMLVVKGTQPGSPLPVEGFRLDNTSIDKPSRFAFELLIDETVY
EFSFSVNRKTIVDEKLVVVTSTSERELYVRSGGQIKFNEALKKDQFLQFA
FKGTRDNQLFLTNSVSQKVDNFQPVYDWFKDSLDLVAPDSRFELFEQFLD
DGHPLYATMNEMLPQLDTGIAHLGGEEIPFENIPLPEPMKMLLQEDVKEG
MTIRLMSDKNERFVFTRKNGELVAKKLVTYHPKADGTEAKFEIRQESDGS
QRVIDLLPAFLELAALGSKKVFVIDEVDRSLHTLLTHRLLEAYLASCSAN
TRSQLLLTTHDVMLMDQQLLRRDEMWVAERKPAGVSTLISFSEFKDVRYD
KDIRKSYLQGRLGGTPRILLSSNFAEGDEARATEEVQ
>NE0025 Metallo-beta-lactamase superfamily
MQITFLGGAGEVTGSCYLVETAEVRFLVDCGMFQGGRDADHKNYTAINFN
PETIDFVLLTHAHVDHSGLLPRLAVLGFRGPVYMTRATADLLAIILKDSA
HIQEKEAEWYTKTALRNKRPSGHKNTPLYTVTQAEAFLRQIRGIDYDETC
QPHPSVRCRYRDAGHIIGSAIIEVWLKSGSQRKKIVFSGDLGQPGHPIVR
DPTLIEEADYLLVESTYGNRRHRSMQDTRDELVEIICRTFLQKHGNIIVP
AFTVGRTQDLLFLLADLQRQGCLNAMDVYVDSPMALAATEITLNHSELLD
KETIDTIQWHRQHNENLRIHFVQEVEDSIRLNHTRSGAIIISASGMCDAG
RIKYHLKYNLPRPECSILFTGFQAAGTLGRRLVDGNKSVRIFGEQIPVRA
SIHTVGGLSAHADQQDLLDWLKGFSRPPLKTFIVHGEPGNAEVFAEVVRK
QLKWPVSIPQRMTTERLD
>NE2470 NUDIX hydrolase:Conserved hypothetical protein 52
MDFDVLEKTVCFQGFFRLERYRLRHRKFNGEWGRPITRELFERGHAAAVL
PYDPQTDEVLLIEQFRAGAISAPGGPWLLEIVAGVIEANETPEQVVARES
MEEANCQIGSLIPLYDYLVSPGGTTERIVLFCGRVDMQTIEAGAVYGNHG
EDEDIKVHVMPLNEAIRLLSTGRINSASAIIALQWLALNRDSVRRRWLPE
>NE0690 possible transmembrane protein
MSVWVVQAAGLGKLTVSSYLGQPLKAEIALESVTDKEINTLSARLASPKI
FQKAGINLAPYHATLSVAVEKRTDGQPYVRVVSSQAISEPFLKLLIELHS
SSGRMLREYNVLLDPADTRKPATAPPVIQSAGNQAEDSAATAAGETQPFV
KAERQPVSQEKSAVQTGNTYGPVMQGDTLSRIARQVSPDSVSLNHMLVAL
YRANRDAFLEKNMNLLKVGVILRIPDEEEISSITVKEAAREVAVQKESWN
SYRQRIADLAGDAPGKNELKQSQSGKIITVSEDTSVAGSGKPEEVLILSK
GELLDDSHSSVKSAESKTAQNYLHMMEEDAIAKERALREANERVAILEQN
VAKLQRLLELKEEASGNKGQAELDQAQHVSDLQPVPVITDQTGTVAETSE
SAAQTPEKSALPVTIPVQPVNPMSSAKSLQPADPVSEEPALMDTLVESAM
ENSEWVVGGLATLLTIALGASVVRRRRAQTEDLNAEDDLYDFNEKDNGSG
TSVMAEAMPSGLENLESSESTIGSAQQAAGPIMMENSSSTDDHTDVDPGL
FFGSRQDERITEDSFLDDSASQIQEEEDATFNELDELGYEIKIDTEEPDQ
TEYKQEAIVETGENDKDDIGEQWIFTATDTGLERKPDSVQQQDTVEEFVS
LHDEHQIDFDLDLPVDTEAQESESRVKNELSDLGSNLANIDLDLGDKSDT
KFQSGEYSEAEEMQWQEVATKLDLAKAYLEMDDRDGAREILEEVLREGDD
EQQSTARSMLDQIS
>NE1802 hypothetical protein
MKSFGCGNDNFSRLRMCLRVGCIVGLSSWGFQVSAAEWNIQPRLTVSETY
TDNVGLGGGGFGGFGGAGRGGEFITQINPGVSITGEGRRFKSNLSYTLNN
LIYAKNERFRIRNQLNTDATAEIIKHHFFVDGRATISQQNAFLFGPQAPD
NAVLTGNRRNIYMWNISPYVRQRFSNLASGEVRYVHGEVSSNANSFSNSS
SDAAIFSLNSGSAFRTLGWGVNYSHTQIDRKYARSNLGRLQTIELERTTG
TLRYIVTSQFSLIGTAGYERNSFISIRGRTSSPLWTVGFSWTPTKRTKID
ASGGKRFFGNTYAASVDHRTRSTVWNLSYVEDITTFGQQSLAGGSILSAS
MLGQLFSGIQGGDALLNQGLPLSFSDPNNFLTNRLFLLRRLQASLTLNGK
KNSLVFRGFSYSRKSFSSDEEDADLIGIENAALTRDTTQTGGNLLWNHRL
SPRTNANINLGYIRTSYDVTSQEDDNIIVTAGLNKRFTSNISGSIMYYHL
HRESNRNNGSYDANAITATLNMNF
>NE2555 Riboflavin synthase alpha chain family lumazine binding domain
MFTGIIEAVGNIRHVHPEKDAVRIYIDPGKLDLADVAIGDSIAVNGACLT
VTAFEEGCFTVDVSGETLSCAHGLDIPGNPVNLEKALRFSDRLDGHLVSG
HVEGIGEVVEFTQTGDRCLLAIELPTQLLKYVIPKGSITVDGVSLTVNQI
EAGRVEINLIPHTLANTTFGQFNPDRKVNLETDMIAKYVARLLDQAQGRQ
P
>NE0913 BolA-like protein
MTTMKNRIENSLQSLQPQFLEVINESYRHAVPEGSESHFKVTIVSDAFDG
KTLVARHRLVNTTLTQEIIRSIHALALHTLTPAEWQARNETARESPPCLG
GSIAEKRAGAGKK
>NE0668 Outer membrane efflux protein
MIRISLFSLVALTSLVACGTTSEVRTPSSLPAAQMEFIEARTPGVVTTAL
PAQWWHLFDDTELDAHVERALTANADLHIAIANLETARAMVRQADAIRLP
ATVIESGAGPDQADKQPSTSSIPKTSYELGATVAYEIDLFGRLSSGVRAA
RADAAASAALVDAARIAVVGDTVAAYIDYCGAVQTRTLTKSLVRAHEHSF
QLIDQQFQEGEVSPLEVAQAQTVLQRARANLAPLEADRRRALFRLATLEG
FPPSATDGWHLSCHSIPKIEVPLPVGDGTALLARRPDVREAEQRVIAATA
RIDIAHADLYPKISLGSSLGLIAGRFDAILTPLITWPFPNRSAVRARIAA
AKGQEAAALATWDAVILRALREVETALADYRAEQVRRTDLMSALIESEKA
VRRAQARFRLGADSYLLVLDAERTRTDIATLTATSDLRIAQIQIALFRAL
GGGWKQPSGQVK
>NE2200 transposase
MKRYELNREQWRRIEPFIPGKIGDRGRHGADNLLFINGVLWVLRSGAHWH
DLPERYGKWKTAHKRFTRWAQAGIWEKIFDVLTEDPDNQYIMIDSTIVRA
HQQAACGKGGRGVRLWGVPEAV
>NE0545 hypothetical protein
MIFRKNQCNHQTTFYQIRKVDDLIGCVYSFAAMGKRYMTDNFILLILFVS
LLTNAVAWSFHQEIFKHELDHFHVSHRFDHHHSYHDHAADTGFHQHHDET
LDNDPDFTDHLILHAAGQFQPFYFILLPIIPSLPGKENIPGFFPAGIPES
TLDLPFRPPRNTASLEIRY
>NE1123 Penicillin amidase
MLRKSLFFLSALVMVILLTAWSLLKGSLPVYEGEQSVPGLTDIVTVERDA
LGTVTLNGGNRLDLARGLGFIHAQERFLEMDLMRRKAAGELAELFGTVAL
PADRKARVHRMRARAQTMLKILPQDQLRLLEAYRDGVNTGLDSLRIRQFG
YLLTQTMPRAWQSEDSLLIVLAMYMTLQGNNFDRELGLSMMHASLPESAY
RFLTASGGEWDAPLDGSYFEWPPYPSATDFDLRSTSKQALADNGFQESPS
VGSNGFAVGGPLTSGSALVANDMHLTLRVPGLWFRTRLIYPDARHANQKI
DVIGVSLPGTPAIVAGSNRHIAWGFTNSYGDFADWVRVNPDPENPARYIS
QGEWKSVKIWRETLHVRGAPDETLEIRETEWGPILAQDFDGTPLALVWTA
HQPGAVNFDIVELEQADNLEKAAAIAWNMGIPAQNFIAGSKNGDIAWTIA
GRIPQRTGNYDPGLPADWSKQSTGWNGWLAPADYPLVINPPDMRLWNGNS
RMIDGALLSKLGDGGYELGARSRQIRDELYAHDHFSPSDLLAIQLDDRAL
LLARWKQLLDEILQKTPSTVWRNEMQQVLLDWNGHASVQSVAYRIVRSFR
LEVMKQVLSGLTAKVKSDYPEFEIPRLSQAEHAVWKLIEQRPLHLLPADD
SDWDSMLAACARRIAEQMQAQPGGIVARNWGEENTADIRHPFSRALPSWI
AAWLNMPADQLPGDHHMPRVQAPDFGASLRFVVAPGEEEQGYFEMPGGQS
GHPLSPYYGSGHSDWVAGRRVSFMPGAAQQILYLHPAEFRADH
>NE0462 possible mopB flagellar biosynthesis
MQPIRIRAGAALWLWLPQQLLAEPAGNQGFTPPPPIISTESMLQLSAGLL
MVLAFIALIAWLFKKIGFHPANRTGLLKVISSASVGQRERIVLAEIGDTW
LVLGVAPGQVSLLHQMDKNTLPAGETAEPPHASRFTEKLQAHLEQGHER
>NE1341 hypothetical protein
MMDVVEATRILRREFAVWGNRLDRFFLARYSLLVDKPPKDFGSRLQHRLR
QFLVFIHLVPPRVVRRAWLPTLKHSSSAPDARALLIWALGMERDSLRNAC
LGFQRFLASRQDLAPVLVTDVADFAWFSRLGWMVEYLPELEGKGLPYQER
KRDYLAWRYRDAVVVPAAAGLLDEENWNRLLQME
>NE0508 hypothetical protein
MNRALIAVVLLMVMGSVSGMRINPDPRVVVSFMDKSIAPELDILRVMADI
SPDNHHLVFQVKTRGERIQGNDHDYLLLHITHGKTYVLLLPINKEKENQM
LVYERLPQPDDDDLLILGKFKGNSHLTNFNITSIFRGGEFSVPLDWIDFN
TNFSFDAYTVQARIKGDTLKISKVYDWARKGKTHNNEKPLSAITLLNKIC
APKSNNQRL
>NE1608 hypothetical protein
MSLSWRNRIQIFLAPDRVDLTGIARGIRPVQQFRQSGVCVQENDSRQQWK
APLRLLEQMIGQMDDRFRRGSELHITLSNHFVRYGVIAPQPSLANPDELM
AYAGFQMREIYGERIDDWELSLSTWDPYGGALCAAIARDLQSELIMFARQ
YDTRFACIEPYLAAALDHWSKRLVEKQVWFVLVETGRFCLVVLSEGAWRC
ARNQRVVENLQEELLAALEQESIILSPDRSVERVYVFAPELTGQLPVHDL
RWQFVRLPDEKHPAPSYFPGVTGMDDSQNHA
>NE2335 Fe-S oxidoreductases family 1
MGSKLYIRTFGCQMNEYDSAKMADILLSEKGMELAETPEEADLILFNTCS
VREKAQEKVFHDLGRVRHLKNSKPDLLIGVGGCVASQEGPEIVKRAPFVD
LVFGPQTLHRLPDLIDARRRTGRPQVDISFPEIEKFDRLPPARTEGSTAF
VSIMEGCSKYCSFCVVPYTRGEEVSRPLDDVLTEVAGLAIQGVKEVTLLG
QNVNAYLGKMINGEIADFATLLDYIHEIPGIERIRYTTSHPREFTARLIE
AYQRLPKLVGHVHLPVQSGSDRILAAMKRGYTTVEYKSIVRKLRLVRPDI
SISSDFIIGFPGETEDDFEATMKLIDDVHFDESFSFIYSPRPGTPAADLP
DNTSHQIKLTRLYRLQEKIQLNAQAISQGMVDTVQRILVEGPSRKDPGEF
CGRTDNNRVVNFAGHAGLTGSFIDIRITAVSSHTLRGEISDMQ
>NE0711 Uncharacterised protein family UPF0102
MSSAGNKGSDAEQCATIFLQQQKLTLLERNYRCRFGEIDLIMREGDTVIF
VEVRMRSSDRFGGAAASITAAKQLKLTRAARHYLAGCEGDFPYRFDAILI
SGERENEIEWIRNAFDES
>NE2201 Transposase IS4 family
MENCSQTLYAVGTGRYLGKDIRCFDRRPGQSIYYDRQHDRARSSAGRLRK
RGARREALGRSRGGLSTKIHMCVDASGRPLRFILTGGQCNDCTQALDLIS
GFRPSHVLADKGYDSDNILDAIASMKAVPVIPPRSNRKIRRTYDREIYKC
RNIIERTFNKLKHWRRLSTRYDRKAIYFSAFIHLAAATLWL
>NE1429 conserved hypothetical protein
MYYVGIMSGTSLDGIDAVLVDFSGPSFSLLHTCYIPYDQSLRAALLGLNQ
AGENELHRAAILSNQLSGWYAQAVGRLLEKSGIDPGEIIAVGCHGQTIRH
CPQPENGYSIQLVNGALLAELTGMTVVTDFRSRDIAAGGQGAPLVPAFHH
EMFAHRDIHRLIINIGGITNITSLPVSGGVNGFDCGPGNMLMDAWCLKHT
GMTYDHNGSWAESGRVINPLLENLLNFPYFSLPPPKSTGREMFSLDWLQP
CLRGDEATQDVQSTLLQLTVRTITDSVETYYPAVRELYLCGGGAHNGTLV
TRLQQQLPGRRINLTDALGIEADWVEACAFAWLARQSIERAPGNLPAVTG
ATGSRTLGAIYPA
>NE2231 conserved hypothetical protein
MQSNDNPSLTQQLTTLLSHHPVIKLAILFGSRADPARTKHFGSDIDLAIM
TGEPISSHFKMELMQAISTELDCPVDIVVVNDAPEPILGEILKGQRLLGD
NNTYAQLLARHLLNTADFLPLRQRILKERRERWIQSY
>NE1880 Integrase, catalytic core
MSSVHHKIIKHKIGLLNLAAELGNVSRACKVMGFSRDTFYRYQAAVETGG
VEALIDANRRKPNIKNRVEEATEAAILAFALEQPAFGQVRVSNELRKRGI
FVSPSGVRSVWLRQNLESFKKRLSALEKHIAETGAVLTEAQVQALEKKQE
DDVAQGEIETAHPGYLGSQDTFYVGTLKGVGRIYQQTFVDTYSKWAAAKL
YTNKTPITSADMLNDRVLPFFAEQSMGIIRILTDRGTEYCGKPENHDYQL
YLALNDIEHSKTKANHPQTNGICERFHKTILQEFYQVTFRRKIYQSIEEL
QHDLDDWMAYYNSVRTHQGKMCCGRTPMQTLIDAKEIWDDKITELNN
>NE1453 DUF214
MNMIRLSLRMLLRDWRAGELHILTLALIIAVSSVATVSFFSDRVQRALVL
ESNRLLGGDLLVISDTPLPPGYAEYARQLGLKTTHLTRFPSMISFGDNSL
LTSVKAVAPGYPLRGELKLADPPAEGVAAQHIRVAQGIPERETVWVDEKV
MVALEVTLGDKIEVGAAELTVMSMVESEPDHSVGFVSMNPRLLMNEEDLA
ATRLIQPGSRISYQLLVAGDEGRVAGFRRWISERLSPGERVEGIQDARPE
IRSALDRAGKFLSLAALASVVVAAAAVALAIRRFIQRHLDGCAVMRCLGA
TESDLLRLYSYYFVMLGSAASLLGCVLAVLTQEFLSYWLSELLGIALPLP
SVLPFIQGILTGMVLLLGFALPPLLNLRQVPALRVIRRDLELANLHSLAG
YGFGFAILSVLFIWKAESFKLGIMIMFGFLLAVILFGLFGLLLIRLFLLT
RPQGSGPWSYGLANIRRRTIPSLVQATALGLGLMALLTLTLVRNDLLEDW
HARLPENAPNRFLVNVQSDQQKKLMGFFDEHAIPRPEIFPMVRGRLMKIN
DRPVNLNDYPDPRAKQLVNREFNLSWTDTLQSDNEIVAGRWWREEIISSE
HELSMEVEFAETLGLKMGDRLTFYTGGGDFSATITSLRKVDWDTFHVNFF
AVVRPGLLDNYPVSYITSFYLPPDKTPILQELIRMFPNFLVIDVASVIER
VQNMIEQVTHAIEFVFLFTLLAGFAVMYAAIASTQDERIYEAAIFRTLGA
RKQQLARAWAVEFAVLGTLAGFFAAAGASALGYLIGKYVLHITYSPSLWI
GAVGVLVSVIGVTVIGLIGTRTTLSQPPLQVLRK
>NE1379 hypothetical protein
MLPELLAMAQLLVSWLLIIGGWLFVHRATLSRERRKEKREEINNTIQEIR
AIENIAIDFHNSKIFDEKAASSLTLRINRLNRKLQAPPTFNELKIPTQLM
IEFRKTITLEHFDKSNFPSMVQRIMKGNPYPATSIEILIRDINSATDDLV
DCIEAEKNNKL
>NE2242 DUF214
MFKLALRNILRQKTHTLMTLAAVIAGVTGIILAGGWVQDIYVQLGEALIH
SQSGHLQIYRQGYYEAGSRSPEKYLIDTPDTIRQQIAGETEVAQVMARLN
FSGLLNNGKSDLPIIGEGVEPSHEVKLGSSVQITAGRQLSDEDTFGILLG
KGVAQALQLKPGDPVTLLVNTLDGALNSLDFQVTGIFQSFSNDFDARAVR
IPLAAAQELLGTQGVNALVISLKHTEETDQIAAQLKQSLGPSDLEVKTWV
ELNDFYEKTVEMYKGQFGVLQIIILIMVLLSVANSVNMSIFERVGEFGTM
MALGNDSHQVFRLIISENLLLGLIGGGLGIGIGVLLATAISAIGIPMPPP
PNADLGYIARIQIAPSVLLLALGIGFTATVLAAILPARHVSRIPVVDALR
QNI
>NE2530 hypothetical protein
MDRIGIDTWANISAEGWSSLLLGNGASIAIHKEFAYPTLHGIADAKGLLA
TTAPIFAKLGTTDFEHVLLACWYAEHVNGALGTPSAAISAAYEEVRTALI
EAVHSVHPVHADVAADLQRVGAFASAFPTVVSLNYDITLYWAMLLFNAAN
GSWFKDAFHDGEFQTDWEYLRRPYGHAAGATLVFYPHGSLAVARDYLGDE
TKLSVGAGAAGDLLGTITRRWASGHYVPVFVSEGTSHQKVAAIRRSHYLT
NVYEEVLPALGESLVVYGWSFDERDQHVLAAIAANPPKRMAVSVFTGQPD
GDQQAFCLQVLKAVGRSLPGTEVTFFDSRSPGCWNNP
>NE1339 H-NS histone family
MKNLTYIEIQEEIKKLQKQANEIRAKEIADIIADIKVKIQLYGITEKDLG
FGEKQKKTIFPPLYKKGNRTWSGRGRQPGWIKEHLEAGGNLEELLM
>NE0540 Acriflavin resistance protein
MKLIIWFVKNPVAANLLMILIIVGGIVGLITADRYVYPPEPRHQLQITTI
YDGAGPGEVEQAICIPIEHAVHDLQGVRHLHAEAKQSLCRVLVEFDPAVD
ATRLQAEVQARLEAVTVFPEDADKPVIRELKTGPQAIIATVRGMADLRTL
QHYRDRLHIRLSTHPDIGQILLFPEIPYEVSIEITREDLRRYGFSFDEIA
EMIRAVSGNISAGELKNTDGRLLIRSQAQAMTAEDFAAIRLRSDRQGMQL
KLGDIAHITETVRDQSMLVRSDGLPAMEILIWPRNQLGKTVDAVNQVITA
FHSELPADVEVITWDDWSKYYDENMSMLRGNAISGFILIFVVLALVLDMR
LALWVSAGILISVFGAFWWIPILGISLNVYTITALILILSVVVDDAIIVG
EHIYLLQQRGQSGISGAIGGVRAMAPLVVPMVLTTVIAFTPGLLLPGLTG
HLLYNISAVAILALLFSMTETLLILPAHLAGQSPKDPDAATGIMPHFLRY
AMATLKEQVDNGLRWLTGCYTRALHLALQWRYIVIAGFTAALLLIAALVI
SGRITTVLDRSVDDYYLVGMLKFPAGTAFEEVDRQLFRLGHIAHEIRAEL
NQKYYCDDSAENGDSVRHVLMFSNDNVAVVNLEMAIDDRIRDQVDDIKQQ
WRERFGPLPAGTTLTLQSFWPRDLGVSTEGPPKAIEWVLTAPDTAVQNAA
GEILKHKLAAYAGVHSVTSSMQPGKPELHLELKPAAAFYGLTMRDLSEQV
RHGFFGLEVQRYYHEHDEIRVMLRFPQEDRQALEQLQNMPIRLPGGNSVP
FGTIARAEYQSGFASIQRTDRERIQLIGAEVYQDQVDVESILADMRVHVI
PQLKAQFPGLDIKPGESRQKQEEVMSDLWLYGVLALFGMYALLAIPLRSY
AQPLIIMLTVPFGFIGAVVGHLLFRIPLSLESYFALFAVSGIVINDSLVL
IAQINKNLQQKISVIRAAVAAGKSRFRAILLMNVTTLAGLLPQLSSQGYD
AEKIMPMAVTIAFGMLFTFFVTLFLVPAVCVVFNKNIQSG
>NE1124 atoC; response regulatory protein
MNANQIQKKLLVIEDDPGLQKQLRWSFDDCNVAVASDRESALAQVRRHEP
AVVTMDLGLPPDPDGATEGLATLQQILAVAPDTKVIVLTGNQNHENALKA
INMGAYDFHQKPLDANVLSLIVNRAFYLHRLQRENRELLKSSSSTAIPGL
ITNDPGMVKICQQIERIASSDVTVTLLGESGTGKEVLAKALHQLSDRNSK
RMVAVNCAAIPESLLEAELFGYEKGAFSGAVKQTPGKIELAQGGTFFLDE
IGDLPLSLQAKLLRFLQERVIERIGGRELIPVDVRIVCATHQNLKKMIEA
GTFREDLYYRLCEISIHIPALRDRIGDAVLLAHHFKNQFRVKEKRQSLNF
SQDALDAIGSYSWPGNIREIENCIKRAVIMAEGSQITAADLGLQPAADSF
APVTLREVRDKAEHEMLIKVLARVNGNVARAAELLGVTRPTLYDLMNRHG
LK
>NE2496 Restriction modification system, type I
MIPMKPTPLRQLAIVSAGQAAPKSDEFSDYGTPFVRAGSLDRLLSGEPES
GLELVSEETARRRKLKTYPRGTVLFAKSGMSATKDRVYVLQNPAHVVSHL
ATLIPKSGTHIDYLRLALKHFPPSSLIKDPAYPAIGLGDIEDFKIPTPDS
SDAQIRIAHLLGKVEGLIAQRKQHLQQLDDLLKSVFLEMFGDPVRNEKGW
DKPALTAFGKISTGNTPPRSESVNYDGDFIEWIKTDNITGDAVCVTPSTE
HLSEIGARKARTVTSGALLVACIAGSVESIGRAALTDRTVSFNQQINAIQ
PGKDVNPLYLYGLFKLSRSYIQSHATKGMKKILTKGDFEKITMIKPSFEM
QNRFAVIFEKVESIKSRYKQSLADLETLYGALSQQAFKGELDLSRVPIPA
QLVAPVSPGDQATVPEPVVQTVPAIHLPDTGNLLAALENSEARKALIAEW
LEAYRQQLGDAPFSVQDFMVAVSDRLTEWLQTVVEDEGVTDEQKNRLAEF
YPSNDVGLDVNDYEHIKKWVFEALAAGALTQGGNKVGNRIELKAVQS
>NE0149 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase
MQSQWCARQQDMIAVYGALAEQCDYALHLGLTEAGMGSKGIVASTAALAV
LLFKGIGDTIRVSLTPEPGGDRAREVIVAQEILQTMGLRAFVPLVAACPG
CGRTTSTYFQELAESIQIYVREQMVIWREQYEGVENMSLAVMGCVVNGPG
ESKHADIGISLPGSGETPVAPVFVDGQKTVTLKGDNIAGEFRMIVDEYVR
TKYRKKAANA
>NE0538 Outer membrane efflux protein
MPIRLKPAAVLGFVMFVSGCHTAPERQANRLDVSVPSVWSASANTFPLAG
QKWVESFDDPVLYSLVEQALTSNFDLKTAVARVDAAIAQARIDGAALWPQ
LSFAPGYQYTQVRSAGFGSAQYSVFEALFNLSWEADIWGRIRSFREAARQ
EVEATNADFQAARLSLAARVAQNYFALLEAKLQAQVAEQSVADRGTIASL
VQGRFERGLVRALEVRLVLTDLAQARSQLAAARNQVQLVTRQLEILLGDY
PDGSLPQIDDTVTALSGSTIRQALPVPPSQLPAGLPAELLTRRPDLIASF
ARLRAADARLESTSKLLLPRITLTAAGGTRDSALTDLIDPRSVVWNLFAG
AVQPLFTGGRIRGSIQLNEAYVEEAFNRYQDVALNAFREVEQALAAEEWL
RAQEKELREAVQQTEAGQELALYSYRQGFIQILTLLDSYRSTLNAQSAHL
AVRRQLLNNRVNLYLALGGSI
>NE2151 conserved hypothetical protein
MRHPIHPMLVHFPVATLFLATLGDIASLFMDEQVSRVAGVLLVIGTITTL
LAMVAGLMELGKIDQQSPAMKVANQHMMLMMASWSFYAVSLFLRLDGTRL
GQPGMVAVAMSVAGLIVLCIGGWLGGKLVYEYGVGTRSSQP
>NE2513 putative (AF322013) ID483 [Bradyrhizobium japonicum]
MAKPVVIGSRSFRTQSSALDHYKALLHRYQDGQRIADPADHTDLVALIER
FDPVLDAVGEPAKGAGQIAHFERRLNTGIGWSTPGFWVVRQDGTETDFSY
IDAVKGRPKGRSQDFYNACRQAVALDLVLAKKQAFAQYGDDQGRVECELT
GKMVTIDDAHLDHAWPYFSHMVSGFRAARGWSRDIPDGIVSTPADGQTTA
TFLDSAVTEAFRAFHHDQAVLRVLSREANLQTASSARRPKVARPVRLA
>NE2429 hypothetical protein
MDEIIKQSPVLWVLSAVVTGFIAGIAAYIGLLKITNQETIIKGTYEPKKN
LVGRVLKNEVLIECGKLIELAGRIDGATMPDKVEAYMTQTLIFLEGLDLP
KVQQYHQLKMSWPAYTIQLLLVNDKLSSSQKLGRA
>NE2471 conserved hypothetical protein
MRILTFLAIALFAFGLTTLDVEARRMGGGGSVGKQRQSINLNRQQQAAPQ
APQSGKAASPASQAAPAAGGSKWLGPLAGLAAGGLLASLLMGGAFDGINM
MDILVLVGIAAAIFFIIRMMRGSAGGRQTQRPMQYSGAGAGGMGGVPFPN
RDTSAPAGGASSGYDRQSTSTDAPDIPADFDVESFLKQAKRSFIALQLAH
DAGDLEEIRAYTTPDLFAEISKQVAERGNMAQQTDVIFVDAALLDVTNQG
NHAIASVRFTGELRDTPNAQPEPFDEIWHVEKDLAASDSGWLLAGIQQAD
DLKH
>NE1908 possible glutathione S-transferase family protein
MLIYDAHSPAPRCLRMFLLEKQLQLSAVTVDVMTGENRQPAYLAVNPAGQ
TPALRLDDGSTLTEAVAIAEYLEELHPLPALIGNTPEQRAQTRQWWRRVE
LNITEFIHNAYHYAEGLARFEPRIPVLPEAADGLKRVAQDRLRWLDGMFG
TGPYLCGERFTAADIWLYVWLDFGVAVNQPFDRDLPKIGPWFERIAARPS
AELSRVLLKVDGND
>NE1097 TonB-dependent receptor protein
MLFASLVHPIRFPLDPVRLLSLLLPIIVVTALPAQAAENALDTPAAARKH
YDIPAGPLGRALSHFAMVAGIALSFDPALTEGRTSPALTGNFSVDEGFNR
LLAGSGLELIRNDDGSYTLHRAPVKATQAGVTALPVVTVSTSTENADDTG
YVTQRSSSSTKTDTPLIEVPQSISVITRRELDARLVQNISQAVSYTPGVL
TEMYGPVMRDDYFNIRGFDAPQFLDGLGLWGVNYANLRIEPYGMESIEVL
KGPSSTLYGQSSPGGLVAMVSKRPTAVPVREAFITGGSFGRIQGGLDLGG
PIDSNGQFLYRLTGLVRNSDTQINHAHEDRYFIAPSISWRPTSDTTFTLL
SHFQKDDAGNTLQFLPPEGSLLENPNGKIPTSRFIGEPGFDKYNREQYTI
GYAFDHRFNESWGVQQNLRYANVSSNYPTTFYLDFLRDDDGVPLDFRTID
RLAALYQDKAGTFTMDTRVQGIFDTAMLRHNLLLGIDYRYLSGSNRRGFS
EDMLLLDIYDPVYGEPFGLPVIDYISKQQRDQIGLYAQDQIKYDRWILTL
GVRYDFSSAHTRNHDLFFDERSSTRQNDRAFTYRTGINYLFDNGIAPYAS
YAESFQPIAGTDFFGSPFKPTTGRQYEAGIKYQPVNHNALLTVAAYHLTQ
KNVVTPDLTPGHFGYNVQTGEARVRGVEVEGKARLMNDLNLIVAYSFADS
KVTESNNPDELGNRLSLTPRHQTSAWLDYTFRGNQLAGLGLAGGVRHIGS
NFGDLVNNLKAPVYTLFDAAVRYDMKHLHSVLQGASLSVNVNNLFDNKYI
ATCADMACFYGNRRTVYATLRYKW
>NE1239 ALOX5, Lipoxygenase
MNFILLKERHMMNKLPQQEENRRTVENRKNYLLRRQAQYQYAYEYANTIA
VVRKLPCREIPGPGYWLRGGINLLQLIPSLPSLLVTYMRYLLGKPMESYR
DYIFYPFSPPNPALVDNFQQDLIFGLQRVIGVNPVVLRAVTSQHPLPQKL
PESEIQRVFAKYVDETDYATAITQKRVYILDYADLEILQRNPGQIDGGRK
QYVTTPIVVLFLQADGILRPIAIQLYQDAGPDNPIYTPNDGNLWLAAKTF
AQVADGNHHILVTHATRIHYVMEAIIMASRRQLYKSHPLCVLLNPHLRHT
LNVNHQHTFLRDRKGRPGRYGELFAGDYDATTQCMANGMTSFDFRASAFP
NDIASREVDNPDLFYPYRDDGVLLWNAIQHFATEYIDVCYQSDGDVAEDC
EIQAWAHDIGARDRGRIPGFPARFASRQELAETIGHVIFLCTAFHSCIHF
NQYKYPGFVPNMPHSAYAPPPVGKGAEMDADGLLKFQPAFRAAYSQTWTY
FQTNFTVNRIGQYPLRQFDPAARDVIERFRKRLQEIEGRIDQRNSSRPVP
YDRMNPRIIPNGVTV
>NE1543 Heph,sla, Multicopper oxidase type 1
MHTRRNNNPFLYSCLTLFILIFLSLSSSLAQAGKVREYWIAAEKTDWNYA
PTGANQIDLASDLGVWGKTLTYTKYRYIGYTDGSYSHPLPQPEWMGILGP
TIRAAVGDTIKIHFLNKTDMPLSMHPHGVMYDKDNEGADGGKGGSIPPGE
RYTYTWIVDQDAGPGPGDPSSIVWLYHSHVMAEEEVNLGLIGTLIITAAG
KAYSDKNPAPRDVDQEFIALYMIFNEENDEESGLKHAINGRIFGNLSGFE
TRRGQRVRWHLVALGNEVDNHTVHWHGQTVLDHGRRTDVVEIMPASMTSV
DMVPRSPGNWLFHCHVNDHMIAGMATRWLVK
>NE2400 PatB, Aminotransferases class-I
MSVDFDREIDRTGTCSTKYDGRQATFGHPDVMPLWVADMDFAAPAAVTEA
LVARAAHPVYGYTLFPDSLYEALINWLKLRHGWKVEREWIVLCPGVVPSL
STVILALSGPDESVIVQPPVYYPFFSVIKKTGRKLLTSALQLVEGSYRMD
FADLSQQAVSARMLLFCSPHNPVGRVWSRPELERLLDIARQHRLIIVSDE
IHADLIYPGNKHHILASLAGSPESIITAVAPSKTFNIPGLNLSALIVPDP
AYRAAIRDRLEMMHVSAANPFSVVAFEAAYRAGGEWLEALMTYLADTREM
VRQYLAEYLPEIRLILSQGTYLLWLDCRRLDMSDDEIKHFFIFEAGIGMN
PGISFGKEGSGFMRMNIGAPRHRIRQALESIRSALERRRNSIS
>NE0458 Plec1, hypothetical protein
MLFKSHLRSILNISPQSENEAESNVVPLYTGKTASDDTLLTTNDTPAQEE
YDAPADDSNASRLEAENLAIREAKSRIETEARARVTAEARARVEASARLA
AEARIKAETAATEEARARARAEALAAHEAHARQELEARLRQTVEDGLKAE
KETVTALRAKVQAEAATTEKARRRLQEEALALEKSRERELAEQRAIEAAI
ARRRTHEEALKIALAASAAEAEATALARARIEEDEKNIALANAKAEAERQ
AIEEIRLRTEAEADLTSKAQQKLQGEISAREAEQSRLDAEKKAIEAAQSR
RDLDLTAKSEAEARAAAELEAATAQRTRIEAERKARAMAEQVALAEQEAA
NAALERSRADTLLLEKTRAHTLAENEACAAAEARMQAKEQETAIFNEKAQ
TDQAVTDTIKERIQAQETAIRRARARAAAEAVARKTAEDKITAETHAAEL
AEKRIALDRQVEKEANELAETEARLIENKRKQAEAVQQAKSAAAARIEME
QKLTELSTRIAQNQVIAMAKTEERLKAAETAAATVLHKIKLESSALKAIQ
ERIEQDALAVERAIAREAVEAMAVEAALARIRTDEAAIAQASRKIREEIE
TTKMIQDCFDEEIPSDTVMHDKEQGDTHSSDDGETLLAATEERMAAETSD
ALDESDDSANQPESGMLNDPVQSDVSGNSIESNDTEVLPDKSETAEIE
>NE1240 Ptgs1,COX1,Cox-1,Pghs1, putative cyclooxygenase-2
MNEFFFQIIFRLVNRFPWISRVASRITWLRRWISDTFINWQAYATNPRPR
PFSMAAPYTTWQALTDRTFTGRHLPEAEGEQNLPDLKSVVNLWRRKENRE
IPSVDTSILFSFFAQWFTDSFLRTDFFDRRKNTSNHEIDLCQIYGLREDI
THLLRLKKDGKLKYQVIDGEIFPPYLFNVEETTADNWVFADREFENLHPR
AVLEFVFDNVPEERLKRMFATGLEHGNSSIGYTLMNTIMLREHNRICDVL
KEAHPTWDDERLFQTARNIMIVLLIKVVLQDYVSHFTQFGFTLDPTPGMA
ERQRWYRTNWISLEFNLLYRWHSMVPEYYFVGDQRYTLDEFRNNTALVTH
QYGIGTMISAASQQKAGRVGLYNTPQFFFDPLPVGADNRSVMERSVEMGR
QAKLRSFNDYRQAFSMPRLRSFEELTADPALQRELKELYNDRIDDLEWQV
GIFAEDHDEGFSLGRLMVRMVGYDAFTHALTNPLVSGYVHNEKTFSSVGQ
SIIEETSLLADIVKRNVRDSDTVIASFRTSAVA
>NE0448 Rh50, Ammonium transporter family
MSKHLCFTAFSSIALFLLCFSSWASAVAPAEINEARLVAQYNYSINILAM
LLVGFGFLMVFVRRYGFSATTGTYLVVATGLPLYILLRANGIFGHALTPH
SVDAVIYAEFAVATGLIAMGAVLGRLRVFQYALLALFIVPVYLLNEWLVL
DNASGLTEGFQDSAGSIAIHAFGAYFGLGVSIALTTAAQRAQPIESDATS
DRFSMLGSMVLWLFWPSFATAIVPFEQMPQTIVNTLLALCGATLATYFLS
ALFHKGKASIVDMANAALAGGVAIGSVCNIVGPVGAFVIGLLGGAISVVG
FVFIQPMLESKAKTIDTCGVHNLHGLPGLLGGFSAILIVPGIAVAQLTGI
GITLALALIGGVIAGALIKLTGTTKQAYEDSHEFIHLAGPEDEHKAERLV
LEAKTEIQGLKNRIDAAVLSAKSEG
>NE0509 SCO1,SCOD1, SCO1/SenC
MIDEKTDNMKLRISRSYCFQVITTLFLIFVQADIRAATITLSRPVSLQAE
TITHLKQADTTNTNQWKLVVFGFTHCKDVCPMSLANLSMLVKAAVSEQIE
LNGVFVTVDPDRDTEEILSGYTKGFGPGITYLRFEGEELEHFKNAFQVEV
VFYTKNAGNQTHYQVDHSTTAFLIDPTGKIRVIFDALKDAVDVARIFKDN
KGLFKS
>NE0777 TGL2, Esterase/lipase/thioesterase family active site
MSDPNNGAIIFVHGLLGFSSFSIFGKKVHYFRNLRSSLRNSTRQVLFPEL
PATGYIEDRARVLANFLAHISADRIDLIAHSMGGLDCRYLIHHLDPMHRV
RSLTTVATPHHGSPLAKWTIEGSDMCFRLMHSISTPAVNDLTPESCARFN
IEISNRKDVRYCSYASMRCPTDMSFILRSWGNKIAANSGDNDGMVPVASA
QWGEFRDVLQADHFELTGWSFAWPDARKARPFNHLQFYLNLVRELTENHS
>NE1869 aarF, ABC1 family
MRFFRLLKIILIAFRFGLDEILFTQVRLRILKVFSALLPFRSRLQLPRAV
RLRLALETLGPIFIKFGQMLSTRRDLLAQDFAEELALLQDRVPPFPSEQA
VQILETVYGRPVHEVFLEFDIKPVASASVAQVHYAVLHDGTRAAVKILRP
TIAPVIAHDVALMETGAWLLESIWPDGKRLKLREVVAEFARHLGDELDLI
REAANCSQLRRNFLDSPLLLVPEVYWDYCHTEVMVMERVVGTPISHVASL
RTQGIDIPQLARMGVEIFFTQVFRDGYFHADMHPGNIFVGSDGRYIAVDF
GIMGSLSDQDKNYLAQNFLAFFRRDYRRVAQTHIEAGWAPRNTRVDDFES
AIRAVCEPIFDRPLKEIYFGRVLLRLFQASRQFNVEIQPQLVLLQKTLLN
IEGLGRDLDPDLDLWKTAKPFLENWMAEQVGLRGLVTHLQKEATNWAVIL
PQFPRLLHYNLSQERAQNLEDRLAQLVAQEKRQSRLLMLLALLLAGLLLA
QIYL
>NE2219 aat, Leucyl/phenylalanyl-tRNA--protein transferase
MIRTLYSDTPFPPLEQALIEPNGLLAAGGDLSPERLISAYRQGIFPWFNP
GEIILWWSPDPRMVLFPRELKISRSLHKTLKKNDYQIRTDSAFTEVMQAC
AAPREDQAGTWIHEEMIAAYTALHQMGVAHSVETWIEGELAGGLYGVAIG
RAFFGESMFSRATDASKIALVHLARQLENWGYGLIDCQMKTAHLMSMGAR
EIPRSQFSKRLNQLNALPGQNRKWYFDFTYPGRSEQ
>NE0786 aatA, Aminotransferases class-I
MKLSQRVQAIKPSPTLAVTAKAARLKAEGKNIIGLGAGEPDFDTPLHIKD
AAITAIRNGFTKYTAVGGTASLKQAIISKFKRENSLEFMPGEILVSSGGK
QSFFNLVLATIDPGDEVIIPAPYWVSYPDIVLIAEGKPVFIDTGIEEKFK
ISPDQLEKAITPRTRMFVVNSPSNPSGSVYSLEELQALGAVLRKYPDILI
ATDDMYEHILLSGDGFVNILNACPDLKARTVVLNGVSKAYAMTGWRIGYC
GGPAAIITAMENIQSQSTSNPNSIAQVAAEAALNGDQSCMVPMIEAFRER
NQFLTNALNSIAGIHCLLSEGAFYAFVDVRQAISRLNTQQILQNSSDIAF
CNYVLEKAEVAAVPGSAFGCEGYMRLSFATSMDNLQEAVKRIASLLS
>NE2452 abcT3, ABC transporter, fused permease and ATPase domains
MDESHHPAHAPALPSEPFAFLLHFFRHYYGWCVLVVVLEIGSSASSILTP
YAIGQIVGGVTDSLTVREQIFSAVAFPLGLYLLLNLGEVIFSRAGASCRI
VIAPRLRAQVTSELYAYLQHHSHRFLSNNFAGALASRISETSTSVNMTLW
TLVFDFLPIVVTLTVSIILLWYASIPLSLFTLVWALLYLSISYWLARRCR
PLAHKSAEARSITVGKVVDAVTNLASVRLFAMLGFERGYLESFLNNEIAA
SRRWLWSNEKILWFQFLLALTLKVGTLLFALWLWQQNQISVADFVMSVSL
SLLIIGEVRNISRRFIDLFEYIGNIANGVGIIVREHEIVDRPGAQPLHIT
RGCIELRGVTFGYSPDNPVFENLDLRIEGGQRVGLVGYSGSGKSTLLNLI
LRLYDPQQGSILIDGQDIRDVTQQSLHEQISLIPQDPGLFHRTLLENIRY
GRPGATREEIELAARGADAHEFIERMPNQYESLVGERGVKLSGGQRQRIS
IARVIAKNAPILIMDEATSSLDSITEQAIQNSLAALMKNKTVIVVAHRLS
TIAHLDRILVFNRGRIIEDGSHAQLLAVNGAYTQLWSRQADGFLVEETEQ
DD
>NE0734 abcZ, abcZ; ABC transporter ATP-binding protein
MPLLTLDNACLAFGHHALLDHAALQLDPGERIGLIGRNGAGKSSLLRVLA
GEIKLDDGQLWVAPGMNVAYIPQEPTLDESASVFAEVARGLGTLAQTLLD
YHEVSHALGEEGADTKALLDRMQHLQGVLEAQNGWSLHHKVETVINRLEL
PEDAIAGTLSGGARKRVALARALVVSPNVLLLDEPTNHLDFSSIEWLEET
LQNFPGSVIFITHDRRFLDNVATRIIELDRGELKSFAGNFSAYQQKKAEL
MEVESVHNRKFDKVLDQEEVWIRKGIQARRTRNEGRVRRLEALRLERAAR
RERIGNVNFRVDAGQHSGQLVAELEHVTKSFGDKTIIQDFSCRIMRGDRI
GLLGPNGAGKSTLLKLILGELQPDSGMVRLGTRLSVAYFDQLREQLNEDM
TLVDSISQGSEFIEIDGKRRHVISYLEDFLFPPQRARSPVKSLSGGERNR
LLLARLFTRPANVLILDEPTNDLDIETLELLETLLQDYTGTLFLVSHDRA
FIDNVVTQAIVFEGNGHLREYAGGYQEWLQSRSATKAIRKENSDSPVPVA
GLGQIRKDKSSLPAGLSYQETNELAALPGKIDVLEQEQIVVTRKLSDPAL
YKNNHNEAMELQARAAALEKELSLYYTRWEALEHKQTMAESARKKN
>NE0489 abiR, possible abortive infection phage resistance protein
MNINASIIDQRLAGVQDKIKERATEEFGISDGDRLRSLAFVYLCVETMLD
LDVDEAFDCLTDGGGDFGVDALHITEEMDGEFGVTLFQGKYKKNLEGNSN
FEQNSIEAMVNAIRHIFDPSADLGAINDRLRVKVEQARSLIRDGLIPRVR
AIACNNGLKWNTDGQQSIDRAVLGDQVTWEHVNHDVLIGILQSIKPVDET
LRLTGKAMVEDMNYSRVCVGRMPVAEVAALMKNHGEKLLERNIRRYLGLH
GNRVNEGIRATLSSNTPENFYFFNNGITLVCDKFTYNALQQGDFQIKVKN
LQIVNGGQSCMTILKTAEELEKNGQTLPAQASVLIRLYELSSDNDDVVLQ
ITHATNSQNPVDLKDLRANDARQQQLEQSIQNLGYSYRRKRMDATTKATD
ITTGAAAEAVLAIWRKAPHQAKFLTREHFGKLYDTIFSESLNGAQVVIAA
LLYRIAENHRRRPHGDDPLFVRYASCFIAMQMGKRLLKALDIKLNGLDHR
NFAQARQWIEEQGEAVFVVSRQDIDTALKALYGNQEISVQQLSATFRRGD
LIGKLDQVEV
>NE1021 accA, Carboxyl transferase, alpha subunit
MKIVFLDFEKGIEEFEAKIEQLRFAQDNSALDISAEIARLQTKSLGLTKS
VYAKLTPWQISQVARHPQRPYTLDYVQHLFTDFEELHGDRNFADDQAIVG
GLARFNGQTVMIIGHQKGRDTKEKIHRNFGMPKPEGYRKALRLMRLAEKF
SIPLITFIDTPGAYPGIDAEERGQSEAIGKNLYVMAGLRIPIICVIIGEG
GSGGALAIAVGDTSLMLQYSTYSVISPEGCASILWKSADKASDAAEILGI
TADRLKEMGLIDSIIPEPIGGAHRDYPVVMQSVKQTLQESLRKLQDIPLE
TLLQKRLDRLLGYGRFKINQPD
>NE0652 accB1, possible accB1; biotin carboxyl carrier protein of acetyl-CoA carboxylase (bccp)
MDLRKLKKLIELVEEYSITELEVTEGEEKVRISKSITLTQSTATIVPQYH
APAPAGPVTATVSETDAPDKPGLPEGHIVKSPMVGTFYRASAPGAKPFVE
IGQYVKSGETLCIIEAMKLLNEIEADRDGKIKTILPENGQPVEYGEPLFV
IE
>NE0653 accC1, accC1; biotin carboxylase protein
MFEKILIANRGEIALRIQRACRTMGIKTVAVHSEADAEAKYVKLADESVC
IGPAASAQSYLNIPAIISAAEVTDAEAIHPGYGFLSENADFAERVEKSGF
FFIGPRPDTIRLMGDKVSAKNAMRQAGVPCVPGSGGALPESLDEVKKIVK
EIGYPVIIKAAGGGGGRGMRVVHTEAALSNAVITTRNEAQTAFGNPVVYA
EKYLENPRHIEFQVLADEYGNVVHLGERDCSMQRRHQKIIEEAPARGISV
QLRDKIGELCVEACKRINYRGVGTFEFLYENNEFYFIEMNTRLQVEHPVT
EAITGIDLVQAQIRVAAGEKLSLQQKDIVFKGHAIECRINAEDPYKLTPS
AGRITQYHAPGGPGIRVDSHIYHNYRVPPYYDSMIAKVIAYGDDRDLAIA
RMRIALTEMIVEGIKTNIPLHLDLLTDCNFLSGPVSIHHLEGKLALYNKK
ST
>NE0695 accD, Acetyl-CoA carboxylase carboxyl transferase beta subunit
MSWFESIIPPKIKRKENGEKKAVPEGLWSKCASCEAVLYCTDLAKNLNVC
PKCGHHNRISARKRLELLLDSQDRHEIGTDIIPQDPLKFRDSKRYTDRLS
DAHEATGETDSLVAMRGNIKSVPVVAAVFEFGFMGGSMGSVVGERFVRAV
KACVTHHLPFICFSASGGARMQEGLFSLMQMAKTTAALEQLSKERLPFIS
VLTDPTMGGVSASFAFIGDVVIAEPGALIGFAGPRVIEQTVRQTLPEGFQ
RAEFLLAHGAIDLIIDRREIRDRLANLCTLLMRIPVPVLPTAVPADP
>NE0361 aceE, aceE; pyruvate dehydrogenase e1 component oxidoreductase protein
MDPMDIDPQETQEWLDALETVLMNEGTERAHFLLEKLVEKARRSGAYLPY
SANTAYINTIPPGKEERSPGDHALEHRIRSYIRWNAMAMVLRANRHTNVG
GHIASFASAATLYDVGYNHFWRAANEHQGGDLVYVQGHSAPGIYARAFLL
DELTSDQLDNFRQEVGNNGLSSYPHPWLMPDFWQFPTVSMGLGPLMAIYQ
ARFMKYLGARGLAQTEGRKVWAFMGDGEMDEPESLGAISLGAREKLDNLI
FVINCNLQRLDGPVRGNGKIIQEMESTFRGAGWNVIKVIWGSYWDPLLAK
DTKGLLQQRMMECVDGEYQTFKSRDGAYVRQHFFGKYPELLEMVANMSDD
DIWRLNRGGHDPHKVYAAYSTAMKHTGQPTVILAKTIKGYGMGEAGEAQN
ITHQQKKMGTISLKAFRNRFGLPVPDDKIDEIPYLKLEEGSPEYNYMRAR
QQAMGGFIHHRRRKAAALQIPPLSAFETLLKASGEGRESSTTMAFVRILN
VLVKDKNIGKHVVPIVADESRTFGMEGMFRQLGIWSSVGQLYTPEDADQL
MYYKEDRNGQILQEGINEAGAMASWMAAATAYSTHGVQMIPFYIFYSMFG
FQRIGDLAWAAGDMRCRGFLLGGTAGRTTLNGEGLQHEDGHSHLAASTIP
NCVSYDPAFAYELAVIMQDGLRRMYQEQEDVYYYITVMNENYPHPEMPKG
AEQGILQGMYLFRSGGKQRSKSHVQLLGSGTILREVIAAAEILEKDYKVS
ADIWSVTSFNQLRREALAVSRHNLLHPDEPARLSYVETCLKDREGPVIAS
TDYMKIVADQIREFIPGRYFVLGTDGFGRSDTREKLRQFFEVDRHYVVIA
ALKALADEGKIPSSQVTKAIRTFAIDPDKPEPVKI
>NE0360 aceF, aceF; dihydrolipoamide acetyltransferase component of pyruvate dehydrogenase complex (e2) protein
MQVAEVKKVLVPDIGDFEDIPVIEIMVKPGDSVQVEDPLIVLESDKATVE
VPSPYSGIIREIRVQMGSKVSKDSEILTMEVVSAESDNKTTSSQPQPSAG
SQPAQPTRPIETGAGQSEEEPAAKPAATTTKPATPSAPIQIPDHTIDQHN
KIIPHASPSVRRFARELGVDLSKVVGTGPKQRILKEDVQAFVKQALTGGR
NARGGTLDLLPWPHVDFAKFGPIELKSLSRIREISGANLHRNWVMIPHVT
QFDEADVTDLEALRKNHNETRQNNGTKLTILAFLIKAVTAALKKFPEFNA
SLDNSTTESQLIIKRYYHLGFAADTPNGLVVPVIRDADQKGVIGIAEELT
RLSSLAREGKLKPGDMQGASFTISSLGGIGGTGFTPIINAPEVAILGVSR
ASLKPVYQNGQFVPRLVLPLSLSYDHRVIDGASAARFTAHLASILADMRL
ALF
>NE2134 ackA2, Acetate and butyrate kinase:Acetate kinase
MIYRVKSKIAQIDNDTRQLFFRRITMNKTILVLNAGSSSLKFALYQTTHE
SISMICRGKIEIGGNASSRFQVKDSRGELLWQQSLHITHHENAIEVLLQW
LETRTDKRQPIVAGHRVVHGGMLFNRPVLIDEQVITRLLQLVPLAPLHQS
DSLAGISALRRLRPDLPQVACFDTSFHRTMPDEEQIYALPGHLTDQGIRR
YGFHGLSYEYIAHILPNHLGDRATGRIVVAHLGNGASLCALRQGKSIATT
MGFTPLEGLPMGTRCGNLDPAVLLYLMREQQMDHDTLTNLLHRQAGLLGV
SGISADMRTLLASDDPGAKRAVDLFVWQVCRQLGAMVALLEGIDGLVFTA
GIGEHASQIRTRICARLKWLSIELDESANQANAIRISTADSHTPVLVLPT
DEETIIARHTLRLIAA
>NE1002 acnA1, acnA1; aconitate hydratase protein
MAHNLFNSLSTLSLASGKTVQFYSLPALGAAGLGAISRLPVSIRIVLESI
LRNYDEKKITETHVRQLAGWQPRAERTQEIPFVVARIVLQDFTGVPLLVD
LAAMRSAARRLGKNPELIEPLVPVDLVVDHSIQVDYYGDSQALARNMEVE
FQRNQERYQFIKWGMQAFDTFGVVPPGIGIVHQVNLEYLARGVHQKDGLM
YPDTLVGTDSHTTMINSIGVVGWGVGGIEAEAGMLGQPVYFLTPDVVGVN
LTGRLREGVTATDLVLTVTEMLRRANVVGKFVEFFGEGAASLSLPDRATI
ANMAPEYGATMGFFPVDDVTLTYFRHTGRTDEEIDAFERYFKAQELFGMP
HPGEIDYSQELTLDLNTIVPSLAGPKRPQDRIALKDLKYSFTELFSKSVK
ENGYGKPAEVLDDTYSTRDSRHVESKLCIAPDTSQSELTLPPGSARNAVE
MVNNHPTQQTAVPVSLARPIKLHHGDILIAAITSCTNTSNPSVLIAAGLL
AKKAVEKGLSVRKHIKTSLAPGSRVVTEYLGAAGLLPYLEQLGFSLAGYG
CTTCIGNSGPLLEVIEETIVKDDLVCAAVLSGNRNFEARVHPSIRANFLA
SPPLVVAYAIAGTVLKDLTREPLGEGKDGKPVWLQDIWPDSAEIESMMKF
ATQAETFRRLYSDFTRDHPLWNGITAVTGRLYDWPDSTYIAEPPFFEDFS
LEIDRTRTPDAIHGARALAIFGDSVTTDHISPAGAIKESSPAGQYLLAHG
VTRMDFNSYGSRRGNHHVMMRGTFANVRIRNLMIPGSEGGVTLYRGKDDP
DGKQMSIFDAAMRYIEDGTPVIVFAGKEYGTGSSRDWAAKGTQLLGVKAV
VAESYERIHRSNLVGMGVLPLQFKEGDSVSSLGIQGNEKFELPDLGNIQP
QQEVTLVIHGQDGSRREIRLRCRIDTAIEVDYYYHGGILPYVLRKAYA
>NE2321 acpS, acpS: holo-(acyl-carrier-protein) synthase
MIYGIGTDLVDPARIASSLERYGEQFARRVLADSEWPDYLEHIKPALFLA
KRFAAKEAFSKATGTGLRAPVMFGNMAVQHDSQGKPYFEFQQELAEWIGQ
RGITRHHLSISDELTMVSAFVVLEK
>NE0169 acrA,mtcA,lir, HlyD family secretion protein
MYQTLRQSIMQTGRTITDIRTGVLLIAFLAGCSMQSSTDSQPPVPQVYVT
TVTTRTIVEEPEFIGQTESFRPVEIRSQVNGIIKKIFFTEGRNIKKGDKL
YLIDPVPFNAVYKNSRAMVTQARARLDQANKDLARVQPLLKQKAVSKKEV
DDAVVEVRSARAALEAAQNVMIKAKFDLDNTLITAPVEGRINRSQFYEGR
LVEAQTNLLTTIDQLDPMYVNVNIPESYLLRLRRELSEHKLERPESIFQL
SGVMTFSDGSIYPEEGLLDFEDIVIRPETGMLLGRFVFSNPAGKNAPGEA
HLYPGQFVKVRIKGYSRTGAILIPQRAVQQQPSGSFVYVINEGKAELRPV
QASAWQGNEWLIESGLQAGEQVVVEGIHRIHPGVQVNPVPYQQEESQTS
>NE2036 acrD4, Acriflavin resistance protein
MTLPELSIKRHVLAWMVSGLLVLLGVISYQRIGVDRWPYIEFPMISVTTT
LIGANPDIVDASITSIIETAINTIPGIEHVQSSSSPSVSVTTITFSLDKN
IDVAYNEVQANISQVLRRLPDDTDPPVVRKIETNAQPVMWLSLQGDRTLQ
QLNLYGFNVIKKKLETIDGVGEVRLGGRRDRTIRVNILPERMASYNLAAS
DVISAFRREHIQLPGGFLVGRNTERLIKLDLEFHNLKDMSNLIVAYRDGA
PIRLKEIAEIEDELADYRQVARFNGQPSLGVGIVKVSNANTVAIIEEVER
RLKEDIVPSLPPGMRIGVSTSDAIFIKELVNSLQEHLIEGTLLAALIVWL
FLRSVRSTLIIATAIPVSLLGAVAVMYFSGFTFNTMTLLALLLLIGVVVD
DAIVVLESIFRHMSRDQARYGKVDTAAASVRGSHEVVFAVIAATLSLVCI
FAPVIFMDGIIGKFFKSFAVVVTFGVLISLLVSLTLTPMLCSRYLRVDRK
HGRLYHWLDNGFNRMDRIYGSLLEFSIQHRWKVILITTLTVLSSSFFFGK
VEKELSPEADEGVFMITFRTPLGSSIDYTDSRLKLIEGVLTSYPSEVASY
LGMIGAGQDGQVNRGFVNVRLYDRNDRTMTQKMLIRELRERFDKIPGVRA
FPAPVAIVRGQRPEKLQFNLTGPNLQEVGRLAKEMQRRLGEIPGMGKVDL
DLDLDLPQLVVDLDRTRAASLGISATDVAMAINMFTGGVDVARFSDEPGD
GQRYQIRIKGKENRFNQLEDLKKVYLRSTGNELVRLDTVAKFRDNLGAAV
IGRFDLQYAAMFYANPAFSLGNATDMVFQQAAEFMPVGYTVKMTGQAEEL
AKTMTNTLFIFTLALILLYMVLASQFNSFLQPLIVMVAQPLAIIGGIFAL
WLTGNSLNIFSMIGLVLLIGLVAKNSILLIDLTNQLREQGKSVDDALREA
CPVRLRPVLMTSLTVILALLPAAMGLGAGAETNRPLSIAVIGGMISSTLL
TLVVVPAVYSLVMQAVDKLHLSRHGSAD
>NE0907 adhC1, Alcohol dehydrogenase class III and related dehydrogenases
MKTRAAVAWQAGQPLTIEEVDLAGPRTGEVLVEIKATGICHTDYYTLSGA
DPEGLFPAILGHEGAGIVVEVGADVKSLRPGDHVIPLYTPECRECKFCLS
RKTNLCQAIRATQGRGLMPDGTSRFSLDGKAIMHYMGTSTFSNYIVVPEI
ALAKIREDAPFDKACYIGCGVTTGIGAVLFTAKVEVGANVAVFGLGGIGL
NVIQGARMAGADKIIGIDLNPEREALGRQFGMTHFIDPSQVENVVDAVIQ
LTDGGADYSFECIGNTTVMRQALECTHKGWGRSIIIGVAKAGAEISTRPF
QLVTGRKWEGSAFGGARGRSDVPRIVDWYMDGKISIDPLITHILKLEEIN
EGFKLMEAGESIRSVVVF
>NE1933 adk, Adenylate kinase
MPICLTRLLATAFHRLSAAHTTAGCSPSPTILKEANIRIILLGAPGAGKG
TQASFIRECFSIPQISTGDILRNAVKAGTELGMMAKKIMESGGLVPDEII
IDLVRKRIEEPDCANGFLFDGFPRTIPQADALRTANVHIDHVIEIDVPDA
EIIKRLSGRRVHPPSGRIYHIEFNPPAVADRDDVTGEALVLREDDREETV
RKRLQVYHEQTRPLVGYYFEWANSGDPNAPGYIKIAGTGNVEAIQENIIA
ALKQTK
>NE0660 ahcY, S-adenosyl-L-homocysteine hydrolase
MSATINPVVDNSVFTDCKVADLSLADWGRKEIAIAETEMPGLMALREQYA
DKKPLAGARIAGSLHMTIQTAVLIETLVALGAEVRWASCNIFSTQDHAAA
AIAARNIPVFAYKGESLEEYWDYAHQIFEWASDGSHTANMILDDGGDATL
LLILGSKAERDPSVIANPTNEEEQVLFASIRSRLASHPGWYSRNLAAIRG
VTEETTTGVHRLYEMEKKGELPFPAINVNDSVTKSKFDNLYGCRESLVDG
IKRATDVMIAGKIAVVCGYGDVGKGCAQSLRGLGATVWITEIDPICALQA
AMEGYRVVTMDDACDKADIFVTATGNLRVITHDHMLKMKDQSIICNIGHF
DSEIDIASVQKYQWENIKPQVDHVIFPTGRRIIVLAQGRLVNLGCATGHP
SFVMSSSFTNQVLAQIELWQNGKDYQKKVYVLPKRLDEMVARLHLGKLGV
KLTELTDEQAHYLNLDKNGPYKPEMYRY
>NE1930 alaS, Alanyl-tRNA synthetase:DHHA1 domain
MKSSEIRQKFLEFFEARGHVIVPSSPLVPGNDPTLLFTNAGMVQFKDVFL
GQDKRPYVRAVSSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDY
FKRKAILFAWEFLTGSLGIPREKLWATVYAEDDEAADIWLNEVGIDPSRL
VRIATSDNFWQMGDTGPCGPCSEIFYDHGPAVAGGPPGSENAEGDRYIEI
WNLVFMQYNRDASGELHPLPKPSVDTGMGLERIAAVMQQVHSNYEIDLFR
SLIEAAARVTGSKDLSNNSLKVIADHIRACAFLITDGVIPGNEGRGYVLR
RILRRAIRHGYRLGQKQPFFYLLVDDLVDVMGPAYPELTRARKHVAGVIR
QEEERFAETLENGMEVLEAALSHGNETLSGEIVFRLYDTFGFPVDLTADI
ARERGIVIDMAGFETCMAQQRDRARASGKFTMQTGLEYTGQPTEFHGYAT
LQHEAQILALYKQGSRVDFIEADDEAVVVLDQTPFYAESGGQAGDSGELL
SGSGTFTVNDTQKIQAGVFGHNGMLSSGRMAIGDRVIARVDQIARTNTAY
NHSATHLLHAALRQVLGNHVTQKGSLVDASRLRFDFSHNSAMQENEIRQV
ENLVNAQIRKNHEVATQLMTYDDAVKQGAMALFGEKYGDTVRVVAMGDFS
TELCGGTHVSYSGDIGFFRIVAESGVAAGIRRIEALTGDAALAYTQQQEQ
QLQQVADALKAAPQEAAQKLSQILDNIRQMEKEIATLRSKLAGVQSTSLI
EQAQEIKGIRVLATTLENVNVKTLRETLDNFKNRLKSCVVVLGTIEKDKV
TLIAGVTDDLTVKLKAGDLINFVAQQVGGKGGGRADMAQAGGTLPEKLPQ
ALASVPSWVEQNL
>NE1054 algI, putative alginate O-acetylation protein
MIFSSWQFILVFLPFSFFVYFWLNYKRLVIAGKVWLVVASLFFYAYWNIK
YLPLILVSIFLNFAIGTGLAQAHEQSLHREQKIRHRINRKVVLATGITAN
LLLLGYYKYTDFLLGNVNLIFGTDLELPQITLPLAISFFTFTQIVYLVDS
YKGETAEYDLLNYSLFVTFFPHLIAGPIVHHRQIMPQFSSQWTLIRRYSN
ILKGLFIFSIGLFKKVVIADSFAIWATAGFDGGQPLDFFTAWATSLSYTF
QLYFDFSGYCDMAIGASLLFNIWLPINFNSPYKALDIQDFWRRWHITLSN
FLRDYLYIPLGGNRCGKYRTYLNIFITFVLGGLWHGATWMFVIWGAMHGC
ALVIHRFWKLLKYPLHSALAWILTFTFVNVAWVFFRAKTLDDAFRVLRGM
VDFSSVLGHTAVVIPVADLAWGGWLSDILLKFMPASFIGQFPIYLAIFAT
FMLIPQKNSVEMVSETIGMRELAFSIVLSVTAMYFMLAATSSVFLYFNF
>NE0656 amiB, Cell wall hydrolase/autolysin
MNRFSTRLLTCFSLLVATLFLPPLSYGNGAVAGTQITAARYWADSAYTRL
AMDVSKPVKYKISTLDAPKRIVMDIENVSFPGALGDLPAKLKSHDPLIKT
LKVSRFTPQTTRLVVELKTDAVPKAFALAPAEQYGHRLVLDIHAKPADKA
RKTEYTDPLMALLQKDTQPAVTPPTRPVAQKATPNLVMAAINRQPAGKRT
FVVAIDAGHGGKDPGAVGPQGTMEKNITLSIAKKLKARIDREPGMRAVLV
RDGDHFISLAGRRIKARQANADLFVSIHADAAPRREAHGASVYALSENGA
TSTTASWLAKKENEVDLLGGVKLDDKDRYLKQTLIDLSMNATIDDSVRLA
SHVLNEIGSVNHLHKRNVEQAGFAVLKSPDIPSILVETAFISNHTEEARL
NSEAYQNKLVDAISAGLKRYYGGKGWKTRVDVADTR
>NE0944 amoA1, Ammonia monooxygenase
MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA
GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC
LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA
LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE
YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV
KGKRGRIVHRNDVTAFGEEGFPEGIK
>NE2063 amoA2, Ammonia monooxygenase, subunit A
MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA
GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC
LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA
LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE
YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV
KGKRGRIVHRNDVTAFGEEGFPEGIK
>NE0943 amoB1, ammonia monooxygenase, 43 kDa subunit
MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR
TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS
PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK
DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW
MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA
LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR
ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL
NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA
EGNRHINSIAGPVIPVFTKL
>NE2062 amoB2, AMMONIA MONOOXYGENASE, subunit B
MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR
TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS
PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK
DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW
MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA
LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR
ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL
NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA
EGNRHINSIAGPVIPVFTKL
>NE0945 amoC1, ammonia monooxygenase subunit C2
MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF
AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD
NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT
PSHVVVFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI
LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS
NLTDVVWNNQSKEILNNRIVA
>NE2064 amoC2, ammonia monooxygenase subunit C
MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF
AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD
NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT
PSHVVMFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI
LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS
NLTDVVWNNQSKEILNNRIVA
>NE1411 amoC3, ammonia monooxygenase 3 subunit C
MATSILKDKTAQQVTDKPAYDKSEWFDAKYYKYGLLPILGIAVFWVWYQR
TFAYSHGMDSMEPDFDRIWMGLWRVQMVVIALAAFSIWGWLLKTRNTAEQ
LASLTPKQEIKRYFYFMMWLGVYIFAVYWGSSFFTEQDASWHQVIIRDTS
FTPSHIPLFYGSFPVYIIMGIAMIIYAKTRLPLYNKGWSFPLIMVVAGPL
MSLPNVGLNEWGHAFWFMEELFSAPLHWGFVILAWAALFQGGLAIQLITR
YSNLVDVEWNKQDRAILDDVVTTP
>NE2182 ampD, N-acetylmuramoyl-L-alanine amidase (family 2)
MSMLIDAEGRLVPARYIPSPNCDERPDRNDISLLVIHAISLPPEEFGSDS
VIELFTNRLDPQAHPYYQGLQDLKVSSHFFIRRNGEIIQFVPCTLRAWHA
GISCWQERPRCNDFSIGIELEGSDNQPFELMQYNRLIELTRAIQAAYPVS
DIVGHADIAPQRKTDPGPYFDWQSYRQALETNR
>NE2191 ampG, putative transport transmembrane protein
MKPGACVGWLHALRIYTHPRVLGMLLLGFSAGLPMLLILGTLSFWLREAG
IDRATIGHLSWVGLAYGFKWAWAPLVDRMPLPLLTRWLGRRRAWLLLSQL
AISMALIGMARTDPAEDLVRMTVCAIIVAFASATQDIALDAYRIEAVALR
LQGAMAATYQAGYRLAMILASAGVLWIAAALDASPGEYSAASWQVAYTIM
ASCMLIGMMTTLIIREPEVPVAQLTVNSGSRGATASARLLAWLDSAVIAP
FRDFIMRYGYHALLILALIAIYRISDVVMGIMSNPFYVDMGYTKDEVATI
SKVYGVIMTILGAAMGGVLVARIGVIRTLFLGAVLSAATNLLFVWLAGRG
HDVGGLVFTISADNLAAGIASSAFVAYLSGLTHAAYSATQYALFSSIMLL
LPKFIAGFSGEFVDAYGYATFFTGTALLGVPVLVLVWKVGRIDFAGSVRN
NSQGE
>NE2032 amyA, Glycosyl hydrolase family 57
MTSSVSFLFGVHAHQPIGNFPEVVIDAHERCYKPFLHTLHRYPEFRFAVH
FSGWLLDFLFEHYPQDMALLREMVTRGQVELFGAGDTEPVLAAIPERDRV
GQLRTFSTKLKKKLGQRPQGAWLTERVWESTVIPALTRCGIRYVIVDDYH
FICAGKRKTELDGFFTTEEDRHSLDLFPISEALRYRLPFSPAQEAIAYIE
SLTDQATTGRQPAAVYFDDIEKFGIWPETYQWVYERGWLEQFIQGVLASP
RIRMQHYRDYHASEKTRGILYLPTTSYIEMNEWTLPAEPAHTYADLVQQA
KAAGWYEPNKSFLRGGIWKNFFSRYPESNWMHKRMLGLSARYARLPASQR
TDAMQQCLYASQANDAYWHGLFGGLYLPHLRRAVYNNLIELEALLDQCVP
RTARYQEDIDLDGTEEIHLQNGIVQIILKLDGGASICELDTYPLKHNFAD
TLTRQTEHYYRKIRLNSGENSTHDQSGIASAHDRVSFKHKIGHADLEPDP
HAQSLFIDRLNGESLHYQLQPSTREDEIIFQTVAGAGLIHKYFRISANRL
LVTYECTNEVAGHMQTEINLAMPSCDGPGGRYVRDRIALSGFGSAIELGD
LTALTLEDDTLGGNLSLKISSAATFHACPHFTVSQSEGGFEKIMQAAKLT
LTWPVTTREQRITLQFDKKR
>NE0924 aniA, Multicopper oxidase type 1
MKYKQLLRGMLASGFLLMGIQAEAKTVQVTLHAVETDVAYDNKGSTYRAW
TFDGKVPGPVVRVTEGDTVEFTLINDKNSKNSHSMDFHAARLDVVEDFES
IKPGETKKYTFTADNPGVFFYHCGSDPMIQHIARGMYGVIIVDPKDANAL
PKADREYVLIQAEHYENPDDKTAMMQNKWSNVVFNGGVFKYDPVHDSEAT
SWLQAKPGERVRIYFVNAGPNELSSLHPIAGIWDRVYPSGNPKNVQYALQ
SYLIGAGDAATLDLISPVEGANAIVDHSMRHAHSGAIAVIMFTNDADPEA
GRGENILIR
>NE1573 apaG, conserved hypothetical protein
MENERKYSIKVEVRTIYLPDQSDPEAERYVFAYTITINNTGSVASQLVSR
HWIITSGDGVTREVRGLGVVGEQPLLKPGETFEYTSGTAISSIAGSMKGS
YQMVAEDGFHFSVEIPEFILSVPRVLH
>NE1153 appB, appB; oligopeptide ABC transporter
MLNYIIRRILYALPILIGVNLITFALFFVVNTPDDMARMHLGTKHVTEEA
IWKWKEEHGYNKPLLYNEAAHGFEKFTDTIFFEKSVSMFAFDFGRADDGR
DIAYEIQSRMWPSLAIALPVFLLGLLTYITFALVMIFFRATYVDFWGVIL
CVMLMSISSLFYIIGGQFLISKLWHLVPISGYGNGLDIGRFLLLPVIIGV
VSSAGANTRWYRTLFLEEINKEYVRTARAKGLAESVVLFRHVLKNAMIPI
LTGTVVVIPLLFLGGLITESFFGIPGLGSYTIDAIQSQDFSVVRAMVFLG
SLLYIAGLVLTDISYTLADPRVRLE
>NE0181 apt, Phosphoribosyl transferase
MSIKSRIRTIPHYPHEGIMFRDITTLLKDPAGLRSTIDGIVQRYQTEKID
KVVGIESRGFIIAAPVAYALSAGFVPVRKQGKLPAETIGCDYQLEYGYDK
VEIHVDAIDKGDRVLLIDDLIATGGTMEAAIKLVQEAGGEVIECCFVIDL
PDIGGSRRLRQQGHRLFSLCSFED
>NE0778 argA, GCN5-related N-acetyltransferase:Aspartokinase superfamily
MKSDTEFVTWFRSVTPHIHASHGKVFVIAFGGEVVENGKFVELVQDFNLL
ASLGIQLVLVHGARPQIESRLRANQQEMSYVQGMRVTDAATLQCVKEAVG
KVRVEIEALLSMGLPNTPMANAAIRVASGNFVTARPVGVLEGVDLQYTGE
VRRINITAILDQLEQGTVVLLSPLGYSPTGEIFNLTLENIAAEVAAALQA
DKLIFLVDTPGIRQQTESGAVLLPELTVRQGKTLLATMETATDQPDEDTR
LYLPWALHACEKGVKRVHLVSRHIDGALLLELFTHSGIGSMITRDPLQII
RQAEIEDIGAILQLIEPLENAGILVRRNRELLEMEIERFTVIEHDNMIIA
CAALYPFPDDKACELACLAVHPEYRKAGIGRILIDHLEDQAREQGYRRLF
ALTTRTTHWFVERGFSETTPDQLPRLKQNFYNYQRRSKVFVKLI
>NE1005 argB, Aspartokinase superfamily:Acetylglutamate kinase
MPSPSAINEKVNILAEALPYIRRFHDKTIVIKYGGNAMIEEALKQGFARD
VVLLKLVGMNPVIIHGGGPQIDHMLKRVGKEGVFIQGMRVTDAETMDIVE
MVLGGLINKEIVNLINRHGGQAVGLTGKDGMFIRAKRMLIQDKEKAGEWI
NIGQVGEIEYIDPSLIALLDTRDFIPVIAPIGVGEDGESYNINADLVAGR
LAETLKAEKLILMTNTPGVLDKNGNLLTGLTAGRVDELFADGTISGGMLP
KIKSALDAVKNGVKSCHIIDGRVQHALLLEILTDEGVGTLIKGNG
>NE1482 argC, argC; N-acetyl-gamma-glutamyl-phosphate reductase
MINAGIVGGTGYTGVELLRILVQHPKVKLKVITSRQEAGTGVDELFSSLR
GQIALKFSDPAKVDFSKCDVVFFATPNGIAMQQAKALLDSGIKVIDLAAD
FRIKDVAEWEKWYGMTHAAPELVAEAVYGLPEVNREKIRDARLIANPGCY
PTAVQLGFIPLIEAGAVDADHLIADTKSGVSGAGRKAEIHTLYAEASDNF
KSYAVPGHRHLPEIRQGLSERSNGPIGLTFVPHLTPMIRGIHATLYARLT
RDVDLQTLYENRYANEPFVDVLPAGSHPETRSVRGSNFCRIAVHRPGNGD
TAVILSVTDNLVKGAAGQAVQNMNIMYGLPETTGIRHVPLLP
>NE1439 argD, argD; acetylornithine aminotransferase
MSHVMNTYARLPVTFVKGEGVWLWDDQGNRYLDALSGIAVCGVGHCHPVL
VKALCEQVSTLIHTSNVYHIQHQERLADRLTSLSGLEKAFFCNSGAEANE
AAIKLARLYGHNQGINLPTIIVMERSFHGRTMATLTATGNRKTQAGFEPL
LTGFVRVPYDDLEAVNKVAANNREIVAILLETYQGEGGVNFPQANYLQGL
RRICDQNGWLLMLDEVQCGLGRTGKWFAFQHSEVMPDAMTLAKGLGSGVP
IGACLAGGKAAEVFKPGNHASTFGGNPLACRAALTTLDIIEQEGLMDNAV
TIGNFMWEEFGRRLQAWQDVLKIRGQGMMIGIELPVPCSELVPEALKRRV
LVNVTSEKVVRLLPALNMQKAEAEQVVTEVSALITWFLESRVK
>NE1438 argF, argF; ornithine carbamoyltransferase protein
MQIRHFLQFKDFDRQEYEYLFDRTRWIKNEFKQYRRYWPLTDRTLAMIFE
KHSTRTRLSFEAGMHQLGGSAIYLSTRDTQLGRGEPVEDAARVISRMVDI
ITLRTHEHGVIERFAENSRVPVINGLTDEYHPCQILADIFTFMEHRGSIA
GKTVAWIGDSNNVCNTWLQAAEVFDFNVHVSTPPGYEVEPERAGLYGTDH
YEQFVSPHDAVKDADLVTTDVWTSMGFEDEADTRKNDFADFRVDAEMMAC
AKEEALFMHCLPAHRGEEVDAEVIDGPQSVVWDEAENRLHTQKALMEYLL
LGRIGVR
>NE1437 argG, Argininosuccinate synthase
MDKVKKAVLAFSGGLDTSVILKWLQDTYQCEVVTFTADIGQGEEIEPARA
KAVQFGIREIFIEDLREEFVRDYVFPMFRANTIYEGEYLLGTSIARPLIA
KRQVEIAQQTGADAVSHGATGKGNDQVRFELGYYALQPDIRVIAPWREWD
LTSREKLLTYAEKQGIPIEMKQKAGSPYSMDANLLHISYEGRALEDPAAE
PEESMWRWTVSPETAPSEPEYLDLEYERGDIVALNGERLSPAAILTRLNQ
LGGKHGIGRLDLVENRYVGMKSRGCYETPGGTIMLRAHRAIESITLDREV
AHLKDDLMPRYAALIYNGYWWSPERKLLQVLIDESQVNVNGRVRVKLYKG
NVMVVGRDSRTDSLFDPDIATFEEDGGAYHQADAAGFIKLNALRMRIAKA
LRRC
>NE1854 argH, argH: argininosuccinate lyase
MEKDKKTWSGRFSEPVAQLVQRYTASIGFDYRLAEYDIQGSLAHARMLAA
TGIIQPADLAAIEQGLAQIREEISKGEFEWQLEQEDVHLNIERRLTALTG
DAGKKLHTARSRNDQVATDIRLYLRTAIDEIIDLIHTLQYVLLDLAEQQA
ATIMPGFTHLQVAQPVSFGHHLLAYHEMLQRDGQRLQDCRKRVNQLPLGA
AALAGTSYPVDRAMVAYELGFDDICHNSLDAVSDRDFAIEFCACAALIMM
HLSRLSEELILWMNPAFGFIRLADRFCTGSSIMPQKKNPDVPELVRGKTG
RINGHLVALLTLMKSQPLAYNKDNQEDKEPLFDTVDTLKDTLTIYADMLA
GLHVNPEAMRQAALRGYATATDLADYLVKKGIPFRDAHEAVAQAVRFAES
KACDLSELSLADLRQFSEVIEQDVFEVLTLEGSLQSRNHPGGTAPEQVRE
AICRARSQLPG
>NE2213 argJ, ArgJ family
MPVNIPPLLPEQLLPIPGLSLGTAEASIKRPGRKDILVITLAENTRVAGV
FTRNRFCAAPVTVARSHLTGSLPIRALVINTGNANAGTGQSGIDHAHATC
ASLARLIGCQTQQVLPFSTGVIMEPLPVEKIITHLPQALANLAPDNWFAA
AQAIMTTDIVPKGVSRQIQINGTTVTITGIAKGSGMIHPNMATMLGYIAT
DAAVTQPLLDDLVRYATDRSFNCVTVDGDTSTNDALILMATGQAGNTPIT
VSTDPAFISLQAAITEVAALLAQMIVRDGEGATKFITVQVESGKTREECT
KVAYAIAHSPLIKTACFASDPNLGRILAAIGYAGIEDLDVNLVQLYLGNI
LVAEHGGRAASYREEDGQRIMQAPEITIQVKLNRGNASTTVWTCDLSYDY
VKINADYRS
>NE0364 argS, Arginyl-tRNA synthetase
MVTTTLPDFKSHCIQLLDQAARQVLPDEVGVQIELLRPKLADHGDYSSNL
AMKLARRLRRNPLELAKALIGALPDSSCVEKADVAGGGFINFFLKKTAKQ
QFLHAVLQAGDSFGHSRLGAGKTIQIEFVSANPTGPLHVGHGRGAAFGAS
LANIMTAAGYAVTREFYVNDAGRQMDILTLSTWLRYLDLCGLSFSFPANA
YRGQYVADMASEIYQAQGDRYAHRSDATIRQLTEISTSTTIDSEDERLDR
LITAAKSILDQDYADLHNFVLTEQLADCRNDLMEFGVEFETWFSEQSLFD
SGMVARAVQLLDDKKLLYRQDGALWFRSTDFGDEKDRVVQRENGLYTYFA
SDIAYHLSKYERGFDYLLNIWGADHHGYIPRVKGAIEALSLDPGRLEIAL
VQFAVLYRDGKKVSMSTRSGEFVTLRQLRQEVGNDAARFFYVLRKSDQHL
DFDLDLAKSQSNDNPVYYVQYAHARICSVLGQWGGAEDILARAETELLTD
PAELVLLQKMIDFTDTIEAAAKERAPHLIAFFLRELAGEFHSYYNSTRFL
VEDESLKITRLALISAVRQILSKGLTLLGVTAPREM
>NE1964 aroA, EPSP synthase (3-phosphoshikimate 1-carboxyvinyltransferase)
MQWLDLPHVQRAQGNVRLPGSKSISNRILLLSALAEGTTMVSNLLESDDT
GRMLDALRLLGVAIVRTDDGKYRVAGCKGKFPVREAELFLGNAGTAFRPL
TAVLALMQGHYRLSGVPRMHERPIGDLVDALRQIGAVITCLEHEGFPPLE
IHPAVIRPGNISIKGNISSQFLSGLLMALPLTGEPVTIVVSGTLISQPYV
ALTIAQMARFGVQVKQESWQRFMLPENQTYRSPGKIAVEGDASSASYFLA
AGAIAGGPVRIEGAGSDSCQGDIRFVEALEAMGARISMGSDWIESGAPDG
GALKAIDFDCNHIPDAAMTLATMALFARGTTTLRNIASWRVKETDRIAAM
SAELRKLGARVEAGDDFLRITPPDGPLTADAVIDTYDDHRMAMCFSLVSL
SVPVRINDPGCVAKTFPDYFEKFAAITHTPF
>NE1981 aroB, 3-dehydroquinate synthase
MNAIESIEVALDTLPENRSYSIHIGQGLLSRMDLLLPHLPGKKAAIVTNT
TIAPLYLEKLRSALAEHHVETFAITLPDGERYKHWETLNLIFDALLEHRC
ERRTPLIALGGGVIGDLTGFAAATYLRGVPFIQIPTTLLAQVDSSVGGKT
GINHPLGKNMIGAFYQPQLVLTDSATLTTLPDRELRAGIAEIIKYGLIYD
ADFFDWLEQHMNSLLARDPAAVNYAIRRSCEIKAEIVSLDERESGLRALL
NLGHTFGHAIENAMGYGAWLHGEAVAAGTLMAADLSRRLQRITSQEVDRI
RYLFENTGLPVKGPRISPERYLESMQLDKKVKEGAIRFILLDSIGKASPG
DTVPTPLLLETLSACVADA
>NE1877 aroC, Chorismate synthase
MSGNSIGKLFSVTSFGESHGPAIGCIVDGCPPGLTLSVEDIQQELDRRKP
GTSRHVTQRREADRVEILSGVFENTTTGTPIALLIRNEDQRSKDYSKIMD
VFRPGHADYTYWQKYGIRDYRGGGRSSARETAVRVAAGAIARKWLQQRYG
IVIRGYMAQLGSIAIPFKSWDVVNQNPFFVADNDYVQKLETFMDSLRKSG
NSAGARINVVAEGVPVGWGEPVYDRLDADIAYAMMSINAVKGVEIGAGFN
SITQKGTEHSDEITPEGFLSNNAGGILGGISSGQPVVVSVAIKPTSSIRL
ARRSIDKAGNPVLVETHGRHDPCVGIRATPIVEAMLAIVLMDHALRHRAQ
NADVVCSTPRIPGSTTNQIHPVEMQASAPRAEDPEPDESS
>NE1627 aroE1, Shikimate / quinate 5-dehydrogenase
MDTYAVIGNPVAHSKSPFIHARFAQQTGRIIHYTALLAPLDRFEQTVLDF
RKTGGKGMNITVPFKFEAFTLASRLTDRASAARAVNTFRFEETGEILGDN
TDGVGLIRDIEVNLNFPLAGKRILLMGAGGAASGVILPLLQQQPDLLAIA
NRTPDKAVSLQRQFASYSNITTGHYHDFAGQHFDLIINATSASLHNELPP
VPADLFRNAFAYDMLYSSRLTPFLELARVQGAGYLADGAGMLVEQAAESF
LLWHGIRPETQTVIRQLRDNLRHPTS
>NE1980 aroK, Shikimate kinase
MHFRYNYRMQRSKTPNTKNSDTGSIPGNIILIGMMGSGKTTVGKLLANLV
GKTFIDIDHEIQRRTGVGIPVIFEIEGEAGFRKRESEVLRDIVRQQNIVL
ATGGGAILHPDNRALLRQHGTVVYLCAPVTELRRRTYLDKNRPLLQTGNV
HAKLIELFTQRDPLYRETAHIIMDSGRQSARAFVQKLIQKLRQSNQEFTA
AGSPPCVKPSE
>NE0651 aroQ1, Dehydroquinase class II
MVDMAANILVIHGPNLNLLGRREPAVYGQTTLEDINRNLTVKAQAAPVAL
SIFQSNAEHELIDRVQGAMSDGTDFIIINPAALTHTSIALRDALAATSLP
FVEIHLSNVYARESFRRTSYFSDIAVGVISGLGAAGYELALQFALTR
>NE0689 asd, Semialdehyde dehydrogenase
MKQVGFVGWRGMVGSVLMQRMREENDFSLIESVFFSTSQVGEKGPDIGRE
AGLLKDAYDIAALREMDIIISCQGGDYTGDVFKKLRESGWQGYWIDAAST
LRMSDDAIIILDPVNRSVIDDALNKGVKNYIGGNCTVSLMLMAVGGLFDQ
DMVEWASIMTYQAASGAGAQNMRELLQQMGETHDVAKDLLEDPASNILDI
DRKVAEKLRDKEFPAQNFGVPLAGSLIPWIDREVAHGQSREEWKGQAETN
KILGHGDDPVPVDGICVRVGAMRCHSQALTIKLKKDIPLEKIEEIIAASN
KWVHVIPNEREATTRELTPVAVTGKLDIHVGRIRKLNMGSEYLSVFTVGD
QLLWGAAEPLRRMLRILVEDKAA
>NE1127 asnB,asn, Asparagine synthase
MSGICGWLGADHLAIENQQIIDRMTGSLIRFDSNPVQTAVQGNAALSVAA
KTSHSHLYEKEGLLVGIWGRIKSRNTDLQTAIDARGMAAALADQWKQKGR
QAFSELAGEFVCCIVDSHAREVALAIDPLGTHVLYYQSLGDDGILFGTSA
DALLTHPRANTEINPQGLLNYLYFHMIPGPDTIYLAQKRLLPGEYLHFHD
GHTEIGRYTQIRFDEDIKRPFPELQQEFLSLLRKSVSEAVQGQKAGTFLS
GGTDSSTLAGILTEVTGEPAKTYSIGFEATGYDEMHYARIAARHFATDHH
EYYVTPDDVVETIPLIASVFDQPFGNASALPAYYCGRMARNDGLDLLLGG
DGGDELFGGNVRYAKQYIFSLYDKIPAPLRNYLLSPFILSLPENKGPKLL
RKARSYVAQATVPMPHRMETYNLLAHYGFETVFSPELLAKIDTAQPALLI
ERTYQDVSHARSLINRMLAFDWKFTLADNDFPKVVQACHLAGMEVAFPFV
NDEILAFSLALPPEQKLKGTQLRYFFKQALRGFLPDEIITKQKQGFGLPF
GVWLQTHRPLREIAADSLSDLKSRNIIRTDFIDQLLDQHLHEHASYHGTM
VWLLMMLEYWFKQHHPIKV
>NE0204 atpA, FoF1-type ATP synthase alpha subunit
MQLNPSEISELIKSKIEGLSVTSEFRTQGTIVSLTDGIVRVHGLSDVMQG
EMLEFPGGTFGLALNLERDSVGAVILGAYEHLKEGDIVKCTSRILEVPVG
EALLGRVVNALGQPIDGKGPIAAEGTEPIEKIAPGVVWRKSVDQPVQTGL
KSIDSMVPVGRGQRELIIGDRQTGKTAVAIDAIINQKGEDVICIYVAIGQ
KASSIANVVRKLEEVGAMAYTIVVVASASESAAMQYIAPYSGCTMGEYFR
DKGQDALIVYDDLTKQAWAYRQISLLLRRPPGREAYPGDVFYLHSRLLER
AARVNADYVEKATGGKVKGKTGSLTALPIIETQAGDVTAFVPTNVISITD
GQIFLESDLFNAGIRPAINAGISVSRVGGAAQTKVIKKLGGGIRLALAQY
RELAAFAQFASDLDEATRKQLERGKMATELMKQSQYATLKVSEMALTLFA
LNKGYFDDVDIKRALAFESALKSHIRSHHGAILDKIEASKELDAETEKAL
EAAIQEFKQNGTY
>NE0200 atpB, ATP synthase A subunit
MSSEAELNPTTYIQHHLTNHTISVGDGAFWVLHTDTLFMSVLLGVVSLGL
IWMVVRKATSGVPSKTQAFVELLIEFIDDQVKTTFHGNRHAFVAPAALTI
FVWVLLLNAMDFLPIDIMAWIYEHIFGLHNWRSVPTADVNTTFALALSIW
ILTIYFAIKVKGFGGWVTELVCTPFGKNPLLWPFNLLLNVIEYISKPLSH
SLRLFGNMYAGEIIFMLLGMWAATGVTGTFFGAILGAGWAIFHILIVVLQ
AFIFMMLAVVYLSMAHESH
>NE0207 atpC, ATP synthase, Delta/Epsilon chain
MGTIFHLDIVSAEESIYSGPAEFIVAPAVMGEVGIFPQHTPMLTRIKSGV
VRVKAPLQDDEEVYVSGGMLEVQPDVVTILADTAVRGKDLDEAKALEAKR
KAEEIMKNKISEIEYARAQAELIEATAQLAAIQKLRKRGH
>NE0206 atpD, FoF1-type ATP synthase beta subunit
MSHQGKIVQCIGAVIDVEFTPGEIPKVYDALVMEGSELTLEVQQQLGDGV
VRTIALGSSDGLRRGMMVTNTQKQISVPVGTKTLGRIMDVLGRPIDEMGE
IGANSLMPIHRAAPAFDELSASTELLETGIKVIDLVCPFAKGGKIGLFGG
AGVGKTVNMMELIRNIAIEHSGYSVFAGVGERTREGNDFYHEMKDSNVLD
KVALVYGQMNEPPGNRLRVALTGLTMAEAFRDEGRDVLFFVDNIYRYTLA
GTEVSALLGRMPSAVGYQPTLAEEMGRLQERITSSKTGSITSIQAVYVPA
DDLTDPSPATTFGHLDATVVLSRDIASLGIYPAVDPLDSTSRQLDPLIIG
EDHYNTAREVQQTLQRYKELRDIIAILGMDELSPEDKLSVSRARKIQRFL
SQPFFVAEVFTGSPGKYVPLKETIKGFKGIVSGEYDDIPEQAFYMVGGIE
EVLEKAKSIQ
>NE0201 atpE, ATP synthase subunit C
MENAQFLAMIQAYTGIGIGLMIGLGAAGACIGVGVMCGRFLEGAARQPEM
IPTLQGKVFLLLGLTDASFIIAVGLAMLFAFGNPLLAVIQ
>NE0202 atpF, ATP synthase B/B' CF(0)
MNINFTLISQAIAFSLFILFTARFVWPYLLRAIEERQQKIADGLAAGERG
KKELELASQRSSEVLKEAKQRASEIVIQAEKRASDIIEEAKQNARIEGEK
IIAGAKAEIQHETFSARESLRQQVAGLAVQGASKILRREVNAKVHADLLA
SIEAELK
>NE0205 atpG, ATP synthase gamma subunit
MPSSREIRNKIKSVKNTQKITRAMEMVAASKMRKAQDRMKKARPYGEKIR
NVAAHMSNASVEYRHPFLISRDSVKRVGIIVVTSDKGLCGGLNTNVLRRA
LNEIRTWETEGNHVDACCIGNKGLGFMSRLGTQVISQVTGLGDAPNMERL
IGAVKVVLDAYTEGQLDRVYIFYNRFINTMKQMPVMEQLLPLTDDRISSE
DGEARPTRAPWDYIYEPEAKPVIDDIMVRYIEALVYQAVAENMASEQSAR
MVAMKAASDNAGNLIDELTLIYNKSRQAAITKELSEIVGGAAAV
>NE0203 atpH, ATP synthase, delta (OSCP) subunit
MAEAITIARPYAEAVFKLARESGSLFSWSETLDAVNSIVRESQIRELISN
PLISSVKLREIIFSVCGKKLNEDGKRLVSLLIDNQRLLVMPQIHELFEQL
KAQHESILEAEVVSAFPLDSGQLEKLVSILEAKFQRKVKAEVSVDSELIG
GVRIKIGDQVVDSSVHGKLEAMATALKS
>NE2574 attINeu, qacE-like protein; integron orf
MSEQIFFEHDGVRVSSARFVVKGATYPISAITSVRAVRSKTFPLLAIVLI
LIGFGILLGGEPTLLIFGLATIALGVVWIIKKKELYSVVLQTSSGESQVL
ESQDRQYIHSVVDALNNSIVQRG
>NE0039 bacA, Bacitracin resistance protein BacA
MDWLILLKALLLGIVEGLTEFLPISSTGHLILAGDLLNFNDDKAKVFTVA
IQLGAILAVCWEYRERLVNIIRNLGTRQANRFVINLFIAFLPAAILGLLF
IKTIKHYLFHPMPVAIALVTGGILILWAERREHRIEAETVDDMSWKQALQ
VGCAQCLALIPGTSRSGATIIGGLLFGLSRKAAAEFSFFLAIPVMFAATF
YDVYKHREFLYIDDLGMFATGSVAAFISALIAIRGFIRYISHHDFTLFAW
YRIGFGLIVLLTAYSGLVDWSVD
>NE0515 baeS, Sensory transduction histidine kinases
MSIRIKLFLAFLLTTFVVVMGMHFFTRWSLEKGFTEFIEKRQQERLDKII
DVLEEHYAEHRGWGELAGNKQHWISLLWHAGPHRHRPPKWIIEQAQHEPD
RQWPPAVTPGTAERKWSPFELRVILLDADRSILFGRKELISQLSLHPVQY
EHRTVGFLGLLPGNPVSQASDIYFMEQQADFFFWIALTMVVLSALIAWLM
AYHLGRPLKQIAAVVQRLATGDYKARLPVISKDEMGQLALNFNDTAAALE
QAEQSRRRWVADISHELRTPLAVLRGELEAILDGVHPLTPAAIESLSGDV
LRLNRLTEDLYQLALSDQGALSYRKVLLDPVPLLREDLATFTTDFNHKQI
SVRWINRLFKPVLVNADPDRLSQLFRNLLTNSLNHTDARGQLEITVQRLA
DKLVIELADSSPGVSDQDIAHLFDRFYRVDHSRNRHLGGAGLGLAICSNI
VRAHEGTLMASHSALGGLAIHIELPITP
>NE0772 bcp, bacterioferritin comigratory protein
MSDDLIPDFELASTGNKQFRLSDVKSRYLVIFFYPKDDTPGCTSESQQFR
DLYPEFVQIGCEIVGISRDNLKSHDKFRGKYDLPYDLLSDSDEVVCELFG
VMKMKNMYGKQVRGIERSTFVLDRDGKVYKAWRGVKVPGHAEEVLAFIKS
VTA
>NE0863 bfr, Bacterioferritin
MKGNADIIRWLNRQLQHELTAINQYFLHARMYKNWGFNKLGEHEYHESIE
EMKHADQLIERILFLEGLPNLQELGKLLIGENVQECVSCDLELERTSRET
LISAIAFCETAQDFVSREIFKEILEDTEEHIDWLETQLDMVKNIGTQNWL
QSQI
>NE0026 bioA, Aminotransferase class-III pyridoxal-phosphate
MNDKKSAMRSHRVVWHPCTQMKHHEEFPPVLITRGEGVWLYDADGQRYLD
TISSWWVNLFGHCNPRINAAIIDQLGRLEHTMLAGLTHEPVVELSERLST
ITPGSLSHCFYASDGASANEIALKMSFHYWQQSGFPEKTQFINLQNGYHG
ETLGALSVTDVPIFRQVYAPLLRPAHPVLTPDWRFAEAGETPEAHALKAA
AALESYLTQHHASTAALILEPLVQGAAGMGMYHPVYLQKAREICDTYRIH
LITDEIAVGFGRTGTLFACEQAAITPDIMCLAKGLSGGYLPLSVVLTNDT
IYRAFYSDQTSQAFLHSHSHSGNALACRAALATLDIIEQDNVIETNRHKT
KFLNQCLQSVRTHPNVRHFRNCGMIWAFEVDNPRPDFTNRFAQMGLERQL
LIRPMGNSVYIMPPYIINEEEMEFMAQAILNILNEI
>NE2300 bioB, Biotin synthase
MIMESATSLISEKQCECAHPNSDSAVQGSSLRWSIAAIESLLDLPFSDLI
FQAQTVHRQYHDANAVQLSTLISVKTGGCSEDCAYCPQAARYHTGVENQA
ILSREEVVAAATQAKESGATRFCMGAAWRGPKQRDIEYMTEVISAVKALG
METCATLGILKPGQATQLKQAGLDYYNHNLDTAPEFYGEIITTREYQDRL
DTLEEVRGANINVCCGGIVGLGESRMARAGLIAQLANLDPYPESVPINYL
VQVEGTPLYGTPELDPFEFVRTIAAARITMPKAMVRLSAGRRQMPEAIQA
LCFLAGANSIFYGDKLLTTGNPDTEKDLALFEKLGLHAL
>NE2297 bioC, SAM (and some other nucleotide) binding motif
MHYDHVLDKRMLRRSFEQAAAGYDQSAVLQREICDRMLSRLEYIKYVPAR
ILDAGSGTGYGTRKLIERYPAAEIMPMDIALTMHRCARMAISEQIPGWQR
WLPFRRHWPRDYICADIEQLPLGEASIGMIWSNLAIQWCNDLRQTFAEAY
RVLENGGLLMFSTFGPDTLKELRQAFKSADSFSHVNRFTDMHDIGDMLVN
CGFSLPVMDMEYITLTYEDVRGVMQDLKAIGARNVTQGRRRGLTGKAAWQ
QVIERYEALRQDGRLPATYEVVYGHAWKPESRQVQLKPETRRKLGLEP
>NE2296 bioD, Cobyrinic acid a,c-diamide synthase:Dethiobiotin synthase
MAPAGLFVTGTDTGIGKTVVSCALLHAFSAEGHTVIGMKPVAAGCENGQW
QDVENLKKASNVSMPQSLINPYAFVEPVAPHIAARQAGIVIDLRVIRQAC
EQLQAAAEVTIVEGIGGFRVPLSSSGTPNGRYDTSDLARVLGLPVILVVG
MRLGCLNHALLTARAVESAGLKLAGWVANTIDPEMLQPAANLQTLTEWLD
CPLLGVLPFQEQLDVQQLAGLIDLSRLE
>NE2299 bioF, Aminotransferases class-I
MLADLSEALRERQQEGLYRSRPVLEGPQSPHVTIDGRDFLAFCSNDYLGL
ANHPALIEAAAEGARCYGVGSGASHLISGHFRAHHELEEALAAFVGLPRT
LLFSTGYMANMAVVTALAGRGDAIFADRLNHASLNDAALLSRARFIRYPH
LDLDTLARQLETTKARRRLVVTDAVFSMDGDMAPVAELLTLCQRFDAWLL
LDDAHGFGVLGERGKGSLYHSQRIERDTPYLIYMATLGKAAGVSGAFVAA
QAPVVETLIQHGRTYGYTTAAPPLLAHTLLTSLQLISQESWRRERLALLI
ERLRQRLHSLPWPLLLSETPIQPLLVGESQEAVRLDLALRERGIWVPAIR
PPTVPQGMARLRISLSAVHTEADVDRLGAALRDLAQC
>NE2298 bioH, possible BioH, catalyzes some early step in biotin biosynthesis
MIWPSVDMASIHIETTGNGPDLVMLHGWAMHSGVWDGVVESLSQRFRLHQ
VDLPGHGASRDCALDSLDQMTEVIADRLPGRYSVCGWSLGGQVAIRLALQ
APERVQQLVLVASTPCFVRRADWPWGMEDSTLTLFMENLARDYTQTLNRF
LTLQVSGSEDQARVLAWLRKSILRGQPPTPATLQAGLKILQTSDLRAELN
QVSQPVLLIHGRNDVITPAGAADWMQQHLPRARLVLFPHCGHAPFLSFPE
QFVSCFDAL
>NE2460 birA, birA_ligase: biotin--acetyl-CoA-carboxylase ligase
MRQLTFAILRMMSDGNYHSGTTLGQALKVSRSSISNTLRDLESYGLTIHK
IPGRGYRWLNPVQWLDSEQIHQHLAEYADSIQIEIVEAVESTNSLLLQRA
VEQGISANRIKQVLVTELQTQGRGRRGRSWSSGLGDSLTFSVLWPSQCPV
NTLSGLSLAVGVAIVRALTLLGIRDIALKWPNDVLSGSHAKLAGVLIELH
GDMLSLGTVVIGVGLNLRLSGVTKARIDQKVTDITSITGQMPDRNQLLAV
LLKELARVLDTFERHGFEPFIEEWTRYHAYQDKMVQVSFAEGSVRNGVAI
SVAADGALLVDTSTGVVQLRSGEISLRSLAEHSSPFN
>NE2459 birA, putative BirA bifunctional protein
MNTSSLLLAVDSGNTAIKWGLHDGRQWLAHGKTLQSERRMLKQNWTALPA
PASILIANVAGLQAADDLKALLAPWHVHLEWATASARQCGVISRYSNPAQ
LGCDRWVALIAAWHRLQQACLVVDVGTAMTVDALSASGEFLGGVIVPGPD
AMRQALADRAGIFSMPLSGSFQNFPINTGNALYSGMIQALTGAVERMYDQ
LSEYTGKQMVVETILTGGGAALLAPHIHIPHQIVDDLVLEGLVIIAGAQA
EIPATR
>NE2262 bktB, Thiolase
MRNVVVLSGARTAIGDYGGALKDLAPATLAAITVREAITRASVDSDKIGH
LVAGHVIPTEAQDMYLARVAALEAGLPQSVAALTLNRVCGSGLQAIISAA
QAVMLGDTEVAIAAGAESMSRGGYLLPALRWGQRMQDSTAVDMMVGALTD
PFDRVHMGITAENIAERWSITREQQDRFALESHRRAIHAVHNGYFREQIV
PVEVSTREGVAMFDTDQHPRDNISMETLGKLRPVFREDGTVTAGNASGIN
DGAAALVLMEERAAERNHHQPMARLVAYGHAGVNPRYMGIGPVPAVKQAL
NRAGLDLAEMDVIESNEAFAVQACAVARELNFDPDKVNPNGGAVALGHPI
GASGCILAIKAIYELQRINGRYALVTLCIGGGQGIAAIFEKL
>NE0355 cafA, S1 RNA binding domain:Ribonuclease E and G
MRFSSTSPPQETRVAIIEHGITQELHIERTSSRGIVGNIYNGRVSRVLPG
MQSAFIDIGLERAAFLHVADIHESSQNDHHKPIEQLLSEGSNILVQVVKD
AIGVKGARLSTQVSLAGRLLVYLPQESYIGVSQRIEDNSERESLRQKLQN
ILPPDSTGGYIIRTMAETAEEPELQADIVYLHKLWQDIQKKSLTISSPSL
LYQDLDLSLRVLRDFVNADTYRVRVDSRETFQKLTTFAHEYMMSDLEKIS
HYIGERPLFDLYGVEQEIEKSLARRVDLKSGGYVIIDQTEALTTMDVNTG
GFVGIRNFDDTIFKTNLEAAQVIARQLRLRNLGGIIIIDFIDMDREEHRA
AVLAEFNKALQKDRTKMTVNGFTALGLVEMTRKRTRESLAQILCETCPTC
QGRGEIRTAQTICYEILRELLRESRQFDAKEFRILASQPVIDLFLDEESQ
SLAQLGDFIARPIGLQVGASYTQEQYDVILI
>NE0606 cah, Eukaryotic-type carbonic anhydrase
MKTKILASLASIIFTGYVSIAASAPPHWSYDEEPIWGAIEDAALPVPLRY
PYAECGIGQHQSPVDLADAKIVSAKPLNKLSAQYKTDTPTFFNTGHAIQV
NTSKNFAGGLKVGKELLPLIQLHFHEPSEHVFGGKKFPAELHFVHINKDG
RIAVLGVVINIGKENAVFQTILDNMPRHEGEANSPSRVRFNPAKLLPSGV
NAANLKYLTLAGSLTTPPCSEGVQWYILTKSITISADQLEQMKSFYHDNA
RSAQSLNERSILSNQ
>NE1662 carA, carA; carbamoyl-phosphate synthase (small chain) protein
MSNRPRAVLVLADGTVFHGDSIGAEGITIGEVAFNTAMTGYQEILTDPSY
CRQIVTLTYPHIGNTGVNDEDIESTTASRIIHAAGLVVRDLSLVTSSWRS
RHTLPDFLKQHGVPAIAGVDTRKLTRILRTKGAQAGCIMTGHIDETEALR
QAQAFPGLTGMDLARVVSCSQPWEFSEGAWSPETGYQAAGESRCRVAVLD
FGVKRAILRKLVEHGCHVTVYPAQTRIDEILAARPDGVLLSNGPGDPEAC
DYAIDMARALLASKIPVFGICLGHQIIALAIGARTIKMKFGHHGANHPVL
DVDSGRVMITSQNHGFAVDVDTLPANVRVTHHSLFDGSLQGFELTDHPVL
CFQGHPEASPGPQDVNYLFSRFISRIVKNKQSA
>NE1661 carB, Carbamoyl-phosphate synthase:Methylglyoxal synthase-like domain
MPKRTDLKSILIIGAGPIVIGQACEFDYSGVQACRALREEGYRVILINSN
PATIMTDPELTDATYIEPITWQAVEKIIEIERPDALLPTMGGQTALNCAL
DLVRHGVLEKYGVVLIGASREAIDKAEDREKFKQAMTRIGLQSARSAMAH
SMEEAMQAQAMIGYPAIIRPSFTLGGSGGGIAYNREEFIEICERGLEASP
TRELLIEESVIGWKEYEMEVIRDKQDNCIIVCAIENVDPMGVHTGDSITV
APAQTLTDKEYQRMRDASIAVLREIGVETGGSNVQFAINPENGNMLVIEM
NPRVSRSSALASKATGFPIAKVAAKLAIGYTLNELGNDIAGGIIPASFEP
TIDYVVTKVPRFAFEKFPQANDRLTTQMKSVGEVMAIGRTFQESLQKALR
GLESGVDGLDEKTTDIEIIKAELGHPGPERLWYIADAFRCNIPFDEIHAL
TRIDSWFLVQIEDLIRQEQALSERRLEILDRRELYHLKRCGFSDQRLAAL
LGTSQDKVRARRHEQGVRPVYKRVDTCAAEFDTHTAYMYSTYEEECESRP
TDNKKIMVLGGGPNRIGQGIEFDYCCVHAALALREDGYETIMVNCNPETV
STDYDVSDRLYFEPLTLEDVLEIAAVEKPAGVIVQYGGQTPLKLARALET
NGVPIIGTSPDMIDCAEDRERFQAMLQGLGLKQPSNFTARTAETALAAAD
KIGYPLVVRPSYVLGGRAMEIVHEARDLERYIQEAVKVSNDSPVLLDHYL
NNAIEVDVDAISDGKAVLIGGIMEHIEQAGVHSGDSACSLPPFSLSASLQ
EELRRQTEAMAYALNVNGLMNIQFAIQDDTVYVLEVNPRASRTAPFVSKA
TGLQLAKIAARCMVGRTLADQGVTKEVYPSHFSVKEAVFPFAKFSGVDTI
LGPEMKSTGEVMGVGKNFAEAFVKSQLAAGVQLPVSGNVLISVRDSDKLE
AVSIAIKLVELGFSLYATRGTARTLQQAGLPVIAVNKVAEGRPHIVDMIK
NGEIVLIINTVANKRSAVRDSYSIRRVALQARITYYTTIAGARAACAGMA
SRQELQVYRLQNLAI
>NE0324 cbbA, fba, fda, Fructose-bisphosphate aldolase, class-II
MALVSLRQLLDHAAENGYGLPAFNVNNLEQIHAIMQAADECDSPVIMQGS
AGARKYAGEAFLRHLIAAAVEAYPHIPVVMHQDHGASPAVCINAIRSGFS
SVMMDGSLEEDGKTPSSYEYNVDVTSKVVAMAHAVGVSVEGELGCLGSLE
SGQGEAEDGHGAEGQLSHDQLLTDPEQAADFVKQTGVDALAIAIGTSHGA
YKFTRKPTGDILAIQRVKEIHERIPNTHLVMHGSSSVPQEWLEIIREYGG
DMKETYGVPVEEIQQGIKYGVRKVNIDTDIRLAMTGAIRRHLAKNKSEFD
PRKFLKDATAAAKDICKARFEAFGSAGQAGKIKPISLETMANRYLRGELK
AIIK
>NE2148 cbbE, rpe, ppe, Ribulose-phosphate 3-epimerase
MYRIAPSILSANFARLGEEITQVLDSGADIIHFDVMDNHYVPNLTIGPLV
CEAIRPLTQAMIDVHLMVEPVDRIIPDFAKAGANIISFHPEASRHIDRTI
GLIKEQGCKAGLVFNPATPLSYLDHVLDKLDLILIMSVNPGFGGQTFIPE
ALNKLRLTRERITASSRDILLEIDGGVKVDNIAEIARAGADTFVAGSAIY
GAGKESDPHRYDSIIAAMRAELAKVA
>NE0521 cbbF, fbp, fructose-1,6-bisphosphatase/sedoheptulose-1,7-bi sphosphatase
MHTGTTLTQFIIEEQRHIAGASGDFTALLNDIVTAIKTISNAVNKGALIG
VMGALDTENVQGETQKKLDVITNEIMIRNNEWAGHLSGMASEEMDDVYSI
PSPYPLGKYLLVFDPLDGSSNVDLNISVGTIFSILRSPVPSRAASMEDFL
QPGVKQVCAGYALYGSSTMLVLTTGHGVNGFTLDRDIGEFVLTHPNMRIP
ADTREFAINASNQRFWEAPVQRYVAECLAGSTGPRGRDFNMRWVASMVAE
VHRILTRGGIFMYPRDTKDPSKPGRLRLMYEANPMSFIVEQAGGLSTTGY
ERILDVVPQDLHQRVPVILGSKNEVEVVLEYHNA
>NE0327 cbbG, gapA, Glyceraldehyde 3-phosphate dehydrogenase
MTIRIGINGFGRIGRMVFRCSIEEFDDIEVVAINDLLEPDYLAYMLTHDS
VHGRFRGDVSVSGNNLIVNGKQIRLTAIKDPTELKWDEVGADIVVESTGL
FLTKELAQKHIQAGAKKVVLSAPSKDDTPMYVYGVNDKAYGGEAIISNAS
CTTNCLAPIAKVLNDTWGIKRGLMTTVHAATATQKTVDGPSNKDWRGGRG
ILENIIPSSTGAAKAVGVVIPELNKKLTGMAFRVPTSDVSVVDLTVELEK
DASYEDICNAMKEASAGSMKGILGYTDQKVVSTDFRGETCTSVFDAEAGI
QLDKNFVKVVSWYDNEWGYSCKLLEMVRVIASK
>NE1743 cbbI, ppi, rpiA, Ribose 5-phosphate isomerase
MTQDEQKRAVAQAALQYVPTGEIIGIGTGSTANLFIDELAKIKHRIEGAV
ASSEVTANRLKQHGIEVLDLNSVGELPVYIDGADEITRNMHMIKGGGGAL
TREKIVAAVARKFICIADQSKLVKVLGKFPLPVEVIPMARSYVAREITLL
GGQPAWRQGFTTDNGNIILDVHNLNIMNPVELETALNQIAGVVTNGLFAR
RAANVLLMGTDQGVETITV
>NE1779 cbbJ, tpiA, tim, Triosephosphate isomerase
MGRAGFVAGNWKMHGSLVRNQELLDAVVSGTRSLQNVSCVVCVPYPYIAQ
TQSVLEGSHVLWGGQNVSQYHHGAYTGEVSADMLADLGCRYVIVGHSERR
TLFGEGNQVVAEKFRAAQERSITPILCVGETLAQRESDETEQVIAMQLDA
VIDLAGIEALGQSVIAYEPVWAIGTGKTATPQQAQDVHKFIRSRIAVHSG
GIAENIQILYGGSVKADNARELFTMPDIDGGLIGGASLVAAEFISICLAA
QN
>NE0326 cbbK, pgk, Phosphoglycerate kinase
MSVIKLVDLDLKNKRVFIRADLNVPVKDGKVTSDARITASMATINHCLKQ
GAKVMVTSHLGRPEEGVWTEENSLQPVADDIARRLGKPVRLIKDWVEGGF
EVASGELVVLENCRINKGEKKNLEETAKKYASLCDVFVMDAFGTAHRAEA
STHGIAKYAPIACAGILLTEELDALTKALHQPAHPLVAIVGGSKVSTKLT
VLESLAEKVDQLVVGGGIANTFLKAAGNNIGKSLCEDELVPVAKSLMDKM
NKRNATIPIAVDVVVGKKFAEDEAAVLKAANAVSDDDMIFDIGPESAQEL
VDIIMKAGTVVWNGPVGVFEFDQFGEGTRAIAKAIAETDAFTLAGGGDTI
AAIQKYDIYDKVSYISTAGGAFLEFLEGKKLPAVEILELRAK
>NE1921 cbbL, rbcL, Ribulose bisphosphate carboxylase, large chain
MSAKTYNAGVKEYRHTYWEPHYNVQDTDILACFKIVPQPGVDREEAAAAV
AAESSTGTWTTVWTDLLTDLDYYKGRSYRIEDVPGDDSSFYAFIAYPIDL
FEEGSVVNVLTSLTGNVFGFKAVRSLRLEDVRFPIAYVKTCGGPPNGIQV
ERDILNKYGRAYLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVN
SQPFMRWRQRFDFVMEAIHKAERETGERKGHYLNVTAPTPEEMFKRAEYA
KELKAPIIMHDYIAGGFCANTGLANWCRDNGILLHIHRAMHAVIDRNPHH
GIHFRVLAKMLRLSGGDHLHSGTVVGKLEGDREATLGWIDIMRDSFIKED
RSRGIMFDQDWGSMPGVVPVASGGIHVWHMPALVTIFGDDACLQFGGGTL
GHPWGNAAGAAANRVALEACVEARNRGVPIEKEGKAILTEAAKHSPELKI
AMETWKEIKFEFDTVDKLDVAHK
>NE1918 cbbO, von Willebrand factor type A domain
MSLNLTDYSELLESLQAESRSLLENTWSDAIRTLSPRGIDNYLKGASALQ
GLGRGRTLVDVWINEIPLVVKEVGDDVVSDLATTCLMLSSKTSGAVIEII
IATSVTAASRLGDPELFRNYLQFLNTFTAQAPRGVRPMLDNLEVLFSQLT
LGGLRRWALWGAHAHRTDYEAQLQYFSLSSKESHAVLRKERKGTLFVDIQ
RRINIYLRALWARDFYMRPTSGDFENREGYRPFIEDYIIHLPDAYDNYGD
IAASEVYRAAAAHAAAHLVETKKPISAESLNPLQMAVISVIEDARVEALS
IRRFPGLQHTWSALHIATADMNTTIGDYLNRLARALIDPDYSDPDEWVIQ
GRLLFSQVQDRLTDNQICWDIGVQLAHEIMAKKIPFNPRTDVLTALYRDD
NRYFWEFEEFDFERSIAAGYEPLKQVRKHVSVMEFINELDVETAGDDAQE
IWVLGTELFPYEDYGVSYNTLEGKEPVASPFHYAEWDYQIQLERPAWSTV
QEKRAKPGDIQIIDEIAAQYKREIHRMKFLLDAMQPQGVQRIRKLEDGDE
IDINAAISSLVDIRMGQHPDPRVMMRAVRKVRDISVMVLLDLSESTNNQV
NGQEFTVLDLTRQATVLLADAINRIGDPFAIHGFCSDGRHDVEYYRFKDF
DQPYNEVPKMRLAGMTGQLSTRMGAAIRHASHFLKLQRAAKKLLLVITDG
EPADVDVRDPQYLRFDTKKAVEEAGRSGILTYCMSLDPRADQYVSRIFGM
RNYMVVDHVERLPEKLPMLYAGLTR
>NE1474 cbbP, prk, Phosphoribulokinase
MSQKHPVIAVTGSSGAGTSTVKTSFEHIFQRERLNVAVVEGDSFHRYDRE
SMKQAVEESEKGGGRPISHFGPEANEFEKLEELFRSYGQSGSGEIRLYLH
NEQEAQPYNQKPGTFTPWKSIQPGTDLLFYEGLHGGVKSDTADVSQYVDL
LVGVVPIVNLEWIQKIYRDTAARGYSAEAVTHTILRRMHDYVHYITPQFS
RTHVNFQRVPTVDTSNPFIARDIPTLDESMVVIRFRDHRGADFPYLLNMI
SGSFMSRPNTIVVPGGKMGMAIELILTPLILDLVAKSKK
>NE1919 cbbQ, nitric oxide reductase NorQ protein
MSDVIEQYFVKNEPYYRPVADEVKLYEAAYSVRMPMMLKGPTGCGKTRFV
EYMAWKLGKPLITVACNEDMTASDLVGRFLLDAQGTRWQDGPLTTAARYG
AICYLDEVVEARQDTTVVIHPLTDNRRVLPLEKKGELVGAHPDFQLVISY
NPGYQSLMKDLKQSTKQRFGALDFNYPQHDIETEIVSHETGIDSAIADKL
VSIAERARNLKGHGLDEGISTRMLIYAGNLIAKGVDVHAACRMALVRPIT
DDPDMRDALDAAVTTFF
>NE1922 cbbR, rbcR, Bacterial regulatory protein, LysR family
MLHITLHQLKIFESVARLLNFTHAAEELHLTQPAVSIQIKRLEEMVALPL
FEQIGKRVHLTEAGKELFHYSRGITRQLADMELALDELKGLERGKLSISV
VSTANYFVPNLLAKFCQRYPGITISLHVSNRENVLKQLSDNIMDLAIMGQ
PPEGLDITSESFMENPLVIIAPPEHPLCGQKQIPVKRLEQEIFLVREPGS
GTRNAMERFFAHEKISIQKGMEADTAEAIKQAIQAGMGLGIMSLHTIRLE
LEAGRLKILDIQGLPIMRYWNVVHQKNKRLSNISSVFKQFLLNEAADLVA
LEYPHRNTT
>NE1920 cbbS, rbcS, Ribulose bisphosphate carboxylase, small chain
MSEVIDYKSRLSDPGSRKFETFSYLPSLDQDQIRKQVEYIVKKGWNPAVE
HTEPEFLMSNYWYMWKLPMFGETDVDRILAEAEACHKANPNNHVRLIGYN
NFNQSQGTAMVIYRGKTV
>NE0328 cbbT, tkl, Transketolase
MKEAFKDFSAPVFKNLTSAIRALAMDAVQKANSGHPGMPMGMAEIAEVLW
NHHMRHNPTNPDWVDRDRFVLSNGHGSMLIYALLHLTGYDLSIEDIKNFR
QLHSKTPGHPEYGYTPGVETTTGPLGQGITNAVGMALAEKILAEEFNRPY
FPIVDHYTYVFLGDGCMMEGISHEACSLAGTLKLGKLVCFYDDNGISIDG
HVEGWFTDNTPKRFESYGWHVVPDVNGHDPVAIQAAIEAAKKVQDKPTLI
CCKTVIGMGSPNKANTSHVHGAALGNDEIAAARPHIGWNYLPFEIPQEVY
DAWDARAKGQALESAWNSMFAEYGKKYPAEAAEFTRRMADELPVGWADHV
EGTISQINAREETIATRKASQNSIEALAPVLPELIGGSADLAGSNLTMWS
GSKGISTESTGNYVFYGVREFGMSAIMNGLSLHGGIIPYGGTFLMFSEYA
RNALRMAALMKIRNIFVFTHDSIGLGEDGPTHQPVEQTATLRMIPNMDVW
RPCDTVESIVSWVSAIERREGPSSLIFSRQNLPFQKRDAVTISLIRKGGY
ILSEAANRKPQAIIIATGSEIGLAMAAQKTLAESGIAVRVVSMPCTNIFD
RQDQAYRDSVLPKGIARVAVEAGVTDYWRKYVGLEGDVIGIDRFGESAPA
EKLFEYFGFTVENVVNAVKRVI
>NE0947 cbbY, hydrolase family
MALSAVLFDVDGTLADTERDGHRIAFNQAFNEFQLDWEWDVDLYGVLLQI
TGGKERIRFYIENYAPSLLSKNNLDEWIAQIHKTKTNYFLNLLKEGKIPL
RPGIKRLLDELRKNNIKIAIATTTTYENVSTLLQCTLGDSALEWFDVIGA
GDIVSKKKPAPDIYEWVLNQLNLPAEACIAIEDSENGLKSATAAGIKTII
TISEYTREQNFSYAALVLEDLESTDHTHQIAAQSFDKPLSVQTLSDLLN
>NE2149 cbbZ, pgp, phosphoglycolate phosphatase
MTQLNTQPSDSGLFPLPLKAIIIDLDGTLLDTAQDLALAANEMLRELRMA
ELPSSTIQTFIGKGVPKLVKRTLTNSPDGEPDPELFEQALPIYERCYAEN
LHVHTRPYPGVVEGLEQLKQSGFRLVCITNKTEIFTLPLLHKTGLLDYFE
LVLSGDSLPKRKPDPLPLLHACKHFDILPKAALLIGDSSNDAIAARAAGC
HIFCVPYGYNEGHDVRELDCDAVVDTIVDATRLITYQHDT
>NE0572 cbl, Bacterial regulatory protein, LysR family
MNFQQLRIISETIRQNYNLTEVANALFTSQSGVSKHIKDLEDELNIDLFI
RKGKRLTGLTAPGKELVKIVDRILLDTRNIKRLAEQFSQHDQGELTVATT
HTQAQYALPDVVRQFKEIFPKVHLVLHESSPGEIVSMLLNGQADIGIATE
ALESTSELVSFPYYTWHHAIIVPAEHPLQSIHPLTLEAVAQFPIITYHKG
FTGRSRIDQAFSRAELTPNIAMSALDADVIKTYVTLGMGIGIVASVAFTP
ERDTKLIKLDGSHLFEQNTTCISIRRNHYLRGYAYRFIELCIPSLSRDTI
CSEIQ
>NE1614 cca, Poly A polymerase family:HD domain
MKIYRVGGSVRDELLGLPVKDQDYVVVGATPEEMVRLGYRPVGKDFPVFL
HPETHEQYALARTERKIARGYKGFEVYAAPGVTLQEDLARRDLTINAMAR
DEAGDIVDPFGGIADLQAGILRHIGPAFVEDPVRVLRVARFAARFGFQIA
PETLELMKEIVHTGETEALVPERVWQEIAHGLMESHPSRMFYVLRECGAL
ARILPEVDALFGVPQPAHAHPEIDTGIHVMMVIDHAASKQYPLEVRFAGL
THDLGKGTTSPDEWPRHIGHEARSVELVMGLCERIRVPGESRDLALLVAR
FHGDVHRALELRPATIADMLQATDAYRKKARFQAFLQACASDFHGRPGFA
DKPYPQKEHLSRALQVAVDVDAGAIAMQLNQSHAGKADLPMRINRQVYAA
RVERISSLLSHL
>NE0764 ccmA, ccmA; ABC superfamily (atp_bind), heme exporter protein, cytochrome c-type biogenesis protein
MTTGPSPIEMLTARNLECIRGEHRLFSNLSFSVNPGELMFVGGPNGSGKT
SLLRLLCGLSLPDDGEIYWNGTDIRKLGVDYRDVMTYLGHLGGIKDDLTA
IENLRISCALAGCEIDEDQAADALGQIGLAGREMLPARVLSQGQRRRVAL
ARLLVTRTKLWILDEPLTALDVAAVELIKGILEHNLAKGGMVIMTTHQEL
VMSTVTVHHLVLS
>NE0765 ccmB, ABC transporter, permease domain, ccmB, heme exporter protein B
MIWWIIKRDLLLAMRRRSDVLTTLFFFVIVVSLFPLGVGPEASILRIMAS
GVVWVAALLASMLSLGRMFSSDYADGTLEQMLLSPVSLSALVMGKAISHW
LVTGVPLVLMAPVLGIQYDLPVEALIVLTLSLLLGTPVLSLIGAIGAALT
LGLRGGGVLVSILVLPLYIPVLIFGTGAVEASTAGMGFGAHFSILGAFLL
VSIVFSPWATAASLRISMG
>NE0766 ccmC, ABC transporter, permease domain, ccmC, heme exporter
MTINWFKYASPASFYFLAGRMIPYFSIAAIIFLIVGLYIAFFSAPTDFQQ
GEAYRIIFIHVPAAWMSMFLYVVMAFWAGIGLAFNTRLSSMVASAIAPTG
AMFTFLALWTGAWWGKPMWGTWWVWDARLTSELILFFLYIGFMSLQAAID
DPRRADKAGAIIALVGVVNIPIIYFSVQWWNTLHQGASVSLIQSPKMATA
MLSGMLIMALAVWMYSIVMILIRARLIILERERHTAWVSELKEGVLE
>NE0767 ccmE, CcmE/CycJ proteins
MKPRHKKMAVIALSVSALTVAVVLVLNAFQSNLVFFFSPSQVAAKEAPIG
KSFRIGGLVEEGSLKREGDGTTLNFAITDTAEVIRVVYTGILPDLFKEGK
GVVAQGKMADDGIFYADEVLAKHDENYMPPEAASALEQAAKAQKTSLAQ
>NE0768 ccmF, Cytochrome c-type biogenesis protein (CcmF)
MIPELGNFALILAMLLALVQGTLPIIGAARGIPSWIALARPVTQGHFVFI
AIAFGCLAYSFVNNDFSVLNVASNSNSDLPVHFRFAATWGSHEGSLLLWV
TMLAGWSVAVSVFSRQLPDDMVARVLGVLGLVSAGFLLFLLFTSNPFDRL
LPAAIEGNDLNPLLQDPGMVMHPPMLYMGYVGFSVAFAFAIAALLNGQLD
AAWARWSRPWTVVAWIFLTIGIMGGSWWAYYELGWGGWWFWDPVENASFM
PWLVGTALIHSLAVTEKRGSFKSWTVLLAITAFSLSLLGTFLVRSGVLTS
VHAFATDPARGIFILVYLSLVVGCSLTLFAWRASKVGLGGNFEIVSRETT
LLANNVLLLVAAGSVLLGTLYPLLIDALDLGKISVGPPYFEAVFVPLMAP
AIFLIGLGPIARWKKASLPALAVLMRWAFVVSLVAAGIMPFIMGEWKPMV
SFGLLLAFWVIASIFVSISHRLRNGGEGGLFTRLAKQPRGYYGMHCAHLG
IAVFIIGVTLVNGYETEKDVRMEVGSTVTLKEYTFQFNGTKSVTGPNYNA
SQGQVEVFREGKKVSELQPEKRTYTASGMPMTEAAVDMGLLRDLYVALGE
PLENGAWIVRVYHKPFVNWIWLGCMLMALGGALAASDRRYRLAIGKKSRN
KAASGPVAPETADIPAGSMVTNSVILSEEGGKV
>NE0769 ccmG, Periplasmic protein thiol:disulfide oxidoreductase DsbE
MMRFLLPLAAFVVLVIFLGVGLTLNPRQVPSPLVDKPAPPFRLQQLYDTT
KTLSSEENLGKVWMLNVWASWCVACRDEHPVLVELSKLGIVPIYGLNYKD
QRDTAMQWLKQFGNPYEISIVDADGKVGINYGVYGVPETYVIDKQGIIRY
KHIGPVTVESLKNKILPLIKELKS
>NE0770 ccmH, putative cytochrome C-type biogenesis protein CcmH
MMKRLIHSSLKRYLVVLILSFTSLQGYAATEAVPVAEDPELEKRLVNLAE
NLRCLVCQNESLAASRADFANDMRREIREQMRMNKSDDEIVEFLVARYGD
FILFNPPLKSTTWLLWFGPFALFLGGGAALIFYLRRRRIQINEKDLPLTE
SQRLRAESLIKEMNKDQSI
>NE1315 ccp, cytochrome c551 peroxidase
MIKRTLTVSLLSLSLGAMFASAGVMAANEPIQPIKAVTPENADMAELGKM
LFFDPRLSKSGFISCNSCHNLSMGGTDNITTSIGHKWQQGPINAPTVLNS
SMNLAQFWDGRAKDLKEQAAGPIANPKEMASTHEIAEKVVASMPQYRERF
KKVFGSDEVTIDRITTAIAQFEETLVTPGSKFDKWLEGDKNALNQDELEG
YNLFKGSGCVQCHNGPAVGGSSYQKMGVFKPYETKNPAAGRMDVTGNEAD
RNVFKVPTLRNIELTYPYFHDGGAATLEQAVETMGRIQLNREFNKDEVSK
IVAFLKTLTGDQPDFKLPILPPSNNDTPRSQPYE
>NE0236 ccrB, Site-specific recombinase
MTQQAVIYCRVSSLKQVTEGHGLASQETRCREYAKHKGYEVVEVFHDEGI
TGKLLDRPNMKAMLIYLKQHRATRPVVIIDDISRLARDIETHLHLRASIS
AAGGKLESPSIEFGDDSDSRLVEHLLASVAAHQREKNAEQVFNRMKARMM
NGYSVFNAPIGYRYDKVGKHGKLLVPDQPCASVIAEGLEGFASGRFETQI
ELMRFFEASPHYPKDRFGTVHMQRIKEILSRVLYAGYLDKPDWGIHLVKG
HHEALVSYETWKKVQARLNGQAKAPVRKDINEDFPLRGFVTCACCGSPLT
ACWTRGGGGLYAYYLCYGKTSGVKCSQNGKSIPKDKLEGEFGALLSEMKP
SKEMFLLAAEIFTDLWNIKRDTAKQEAETIRRNLLQIERKTEQFLDRIAD
TDNSILITAYEKKIRQLEEEKIALDEKIAQCGRPLQSFDETFRTAFSFLS
NPYQLWVSSRLEHKRAVMKLAFSERLRYCRNEGFRTPEKSLPFLLLEGSD
EGKHEMVGLVGLEPTTKGL
>NE1713 cdsA, possible cdsA; phosphatidate cytidylyltransferase
MLSTRLLTAFILLTAFLAALFYLPDIFWAVLLLGLTVTAAREWCRLGQFS
VGQTVLYMILTTLLGGELLFLVSVVLSKNPADSSMVWFYAASAVFWLLAV
PFQLAMNRPIRNIWVLMSIGWLVLLPACLSLYQLRAMDPLLLLGFMGVVW
VSDSVAYFIGRAYGKHKLAPRISPGKTWEGVAAALIGVLVYALIWFYSVH
KDQALVGWLIPLLLLLAVLGIVGDLYESLLKRQAGVKDSGTILPGHGGVL
DRIDALLPVLPVAALAAILFYSTES
>NE1866 cheA, probable chemotaxis sensor histidine kinase transcription regulator protein
MSTDLEQFYEIFFEEAGELLAEMETRLLRLDILSPDPEDLNAIFRVAHSI
KGGAGTFGFTDMAELTHVLESLLDRLRRNELELRAEMVDAFLHARDVLKH
QLDGHKGNGSADPAEVEAVNRKLKELERDTGSSHTPSECLSETIESASVE
ESASAVGAAVPVAVAELSVAVAQPVSVPENAATYRIEFDCTNLARPVMEK
LLADLSRQGQLKNLTEPGNETVCTLQLSTQISEEDVWESLAFLVDPTSLS
IEIAGKIADDSPLAAVVPEPVIECEQKHPPAIDQTAATVVEVQEEKEETI
TPVLSNAASPVPVSLVSSGAKQPAGATAGAPRSKSSAEQSSANSESSSIR
VSVEKVDQMINLVGELVITQAMLAQTASQIDPVIFEKLLNGLNQLERNTR
DLQEAVMSIRMMPISFVFSRFPRVVRDVASKLGKQVDLKTVGEGTELDKG
LIEKITDPLTHLIRNSLDHGIETPEKRRELGKPARGTITLRASHQGGSVI
IEVADDGGGLNREKILAKAQSRSLPVNENMTDQEVWMLIFEAGFSTADVV
TDVSGRGVGMDVVKRNIEGLGGRVEIDSITGQGTRISIRLPLTLAILDGL
SVAVGDQMFIIPLTYITESLKPAAEDIRTIQGQGRVVQVRGEYLPVIALH
EIFNLEPTVKNIHEGILVILDAEGSKAAMFVDTLVGQHQVVIKSLESNYR
RVAGVSGATIMGDGSVALILDAVGLVNVTKQRMIHAA
>NE1859 cheB, CheB methylesterase:Response regulator receiver domain
MKKISVLIIDDSALIRKLLTEIINARPDMEVIGAAPDPIVAREMIRTLNP
DVLTLDVEMPKMDGLEFLEKLMRLRPMPVVMVSTLTERGSEVTFRALELG
AVDFVAKPKMDIRNSLEAYTDEITGKIRTASTARIHRLEPVASNGAASGS
IVQMPRHHGISTEKLIIIGASTGGTEAIKDVLIHLPPDCPGILITQHMPE
GFTRSFAERLDRLCKIRVKEAEGGERVLPGHAYLAPGHSHLLLKKSGANY
VTELSQTDPVNRHRPSVDVLFQSAAHCAGKNAVGVILTGMGKDGAAGMLD
MHHAGAYNLAQDETSCVVFGMPKEAITLGAVDEIVPLQDMARRILARLVG
AGKQAVRV
>NE1861 cheR, CheR-type MCP methyl-transferase:Generic methyl-transferase
MNARASGEHNTREYAFSSHDFECVKKLIYARAGIALAENRQEMVYTRLSK
RLRATGTNNFNDYLQRLERDSEDVEWESFTNALTTNLTSFFREAHHFPML
AEHVKKRQPGHTVKLWCSAASTGEEPYSMAMTMVETFNSMKPPVRILATD
IDTEVLAAAREGVYPLDRLESMSSERIRRFFLKGKSSTSGYARVRQELRD
LVTFRPFNLLDPKWPMRGPLDAIFCRNVMIYFDKPTQYRILQKFVPLLDR
DGLLFMGHSEVFQHASDLIRLIDRTVYKPVKK
>NE1865 cheW, CheW-like domain
MLLTQAAAAGESLNPVGNLASSEFLIFRLGKEEYGIEILKVQEIRGYDAI
TRIANAPEFIKGVINLRGVIVPIVDMRIKFNLGEALYDQFTVVIILNLSG
RVIGIVVDGVSDVINLDTEHLRPAPEFGGVIDTEYIIGLGTMEERMLILV
DIEKLMSSSEMGLLEQTGK
>NE1923 cheY, Response regulator receiver domain
MTDKNLRFLVVDDFSTMRRIIRNLLKELGFNNVEEAEDGAMALKKLRDGG
FDFVVSDWNMPNMDGLTMLQNVRADDALKDIPVLMVTAEAKKENIIAAAQ
AGASGYIVKPFTAATLDEKLNKILQNMAVHA
>NE1924 cheZ1, putative chemotaxis protein CheZ
MNNSFDTDKKTIDHPGQLARVLHNCLSELGYDWRLRQASHGISNSKDGLN
YIASKTTQAAECSLLAVENARPILNNLSADAANLHKLWCQIPETTAAIIA
NQPTLNNALKQTLDFLNNVSSQAASTQACLTEIMIAQNFHDLTGQVIQNI
SRTIETVEQEMLQLLANDSANEEKEIKLDNGLLNGPVVNPQKQDDVYVNQ
AQADNL
>NE1733 clpA, ClpA, ATP dependent protease, chaperonin
MIAQELEVSLHMAFVESRQKRHEFITVEHLLLALLDNPSAAKVLVACTVD
IEDLRKSLQDHIARHTPVVEGSEDVDTQPTLGFQRVIQRAILHVQSSGKK
EVTGANVLVAIFGEKDSHAVYFLQQRGVTRLDVVNYISHKIGKAAQSSES
EKNEENADGEQQDSGSTLENYTVNLNSQAIANRIDPLIGREKEVERVIQT
LCRRRKNNPLLVGEAGVGKTAIAEGLARRITENRVPDVLANHQVYALDMG
ALLAGTKYRGDFEQRLKVVLKQLTDNPKAILFIDEIHTLIGAGAASGGTL
DASNLLKPALSSGRLKCIGATTYNEYRGVFEKDHALSRRFQKIDVSEPDI
GETVEILRGLKSRYEKHHNVKYTEVALTAAAELSARFINDRHLPDKAIDV
IDEAGAAQRILPKSRQRKIIGRQEIEQVIAGIARIPPQNVSSDDRNKLKT
LDRDLKAIVFGQDAAIDALTSAIKMARSGLGNTCKPIGSFLFSGPTGVGK
TEVARQLAYTLGIPLHRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTE
TIIKQPHAVLLFDEIEKAHPDIFNVMLQIMDYGTLTDNSGRKADFRNVII
VMTTNAGADVLTRTSIGFTEHTKSGDEMVEIKRLFTPEFRNRLDAIISFT
SLTEDIILRVVDKFLIELETQLQEKKVDVTFTDNLRKYLARHGFDPLMGA
RPMARLIQDIIRRALADELLFGHLANGGKVTVDIDEDGKARLTFEDKENI
ASPAIS
>NE2402 clpB, ClpB ATPase dependent protease, chaperonin
MRFDKFTTKFQQALADAQSMALGQDHPYIEPQHLLLALMQQDDGSITSLL
QRAGVNAQPLRQALTQSLKLLPKVEGTGGEINVSRDLANLLNLTDKEAQK
RGDQYIASEMFLLAALEDKGETGRLLKQYGATRAALEQAVDSVRGGEKVT
DAEAEGSREALKKYTLDLTERARSGKLDPVIGRDDEIRRTIQVLQRRTKN
NPVLIGEPGVGKTAIVEGLAQRIVNGEVPETLKNKRVLSLDMAALLAGAK
YRGEFEERLKAVLKELAQDEGRTIVFIDELHTMVGAGKAEGAMDAGNMLK
PALARGELHCVGATTLDEYRKYVEKDAALERRFQKVLVDEPGVEATIAIL
RGLQEKYELHHGVEITDPAIVAAAELSHRYITDRFLPDKAIDLIDEAAAR
IRMEQDSKPEVMDKLDRRLIQLKIEREAVRKEKDDASKKRLALLEEEISK
LEREYADLDEILKAEKSRAKGSQEIKEELDKLRREEEAARRKGDLQRASE
LLYGRIPQLEAQLAEQLHHAESAEEAVQPKLFRTQVGAEEIAEVVSRATG
IPVSKMMQGEREKLLFMEDKLHERVIGQDEAVRLVSDAIRRSRSGLADPN
RPYGSFLFLGPTGVGKTELCKALAGFLFDSEEHLIRVDMSEFMEKHSVAR
LIGAPPGYVGYEEGGYLTEQVRRKPYSVILLDEVEKAHPDVFNVLLQVLD
DGRMTDGQGRTVDFKNTVIVMTSNLGSQMIQQMSGDDYQVIKLAVMGEVK
TYFRPEFINRIDEVVVFHALGEAHIKSIARIQLSNLGKRLAQMEMKLVVS
EPALTKLAEVGFDPVFGARPLKRAIQAQIENPLAKELLEGHFSAGDTILV
EYSNGHMQFTAQR
>NE0031 clpP,lopP, Clp protease
MQPMFDRERNGLDTTGLGLIPMVIETSGRGERAYDIYSRLLRERIIFLVG
PVTETSANLVIAQLLFLESENSEKDIFLYINSPGGLVTAGLAVYDTIQFI
KPDVSTLCVGQAASMGAFLLTAGAKGKRYCLPNSRVMIHQPLGGFQGQAS
DIEIHAKEILALKSRLNEIMAKHTGQTVKAIERDTDRDNFLGAEAAVKYG
LVDAVLTSREVKQE
>NE0032 clpX, clpX; ATP-dependent protease (ATP-binding specificity subunit)
MSEKTNDEKLLYCSFCGKSQREVRKLIAGPSVFICDECIDLCNDIIREEI
QVDETAKLAKTSLPTPHEIRETLDQYVIGQESAKKILSVAVYNHYKRLKN
LSKVNNGDDVELSKSNILLIGPTGSGKTLLAQTLARLLDVPFVIADATTL
TEAGYVGEDVENIIQKLLQKCNHDVEKAQRGIVYIDEIDKISRKSDNPSI
TRDVSGEGVQQALLKLIEGTTALVPPQGGRKHPNQEFIQIDTTNILFICG
GAFDGIDKIIRGRSEKSGIGFGADVINQNDRKELNKILKDIEPEDLIKYG
LIPEFVGRLPVVATLSELNEAALIQILVEPKNALIKQYNKLFSMEGGVEL
EFREQALVAIARKALSRKTGARGLRSILEETLLDIMYDLPSIENVSKVVI
ESGSNHDELQPIVIYAEKPKLARSSK
>NE1963 cmk, Cytidylate kinase
MDKQNVPVITLDGPSASGKGTIARLVSQALGFHYLDSGALYRLVALAAMK
RNTDVSDEHSMVDIARHLNVSFRDSSIWLEGKDVSDEVRAEACGEYASRI
AQYSALRVELLGYQRDFRKSPGLVTDGRDMGSVIFPDATLKIYLTASEEE
RALRRHKQLMEKGINASIADLIQALRARDERDSSRSTSPLQQCEDACLLD
TTGLSIDQVVSRVLNMYAEARKA
>NE0968 coaD, Coenzyme A biosynthesis protein:Cytidylyltransferase
MDKVIYPGTFDPITRGHEDLIQRASRLFDQVVVAVAANSGKSPCFSLEER
VEMARAVLAEYANVEVTGFSGLLMEFTRQQQAHVIVRGLRAVSDFEYEFQ
LAGMNRSLYPDVETIFLTPSEQYMFISATIVREIARLGGDASKFVHPLVA
ERLYEKRKK
>NE0634 cobO, ATP:corrinoid adenosyltransferase BtuR/CobO/CobP
MPPADSTDMDADKAQRYRSRMQRQKAVVDAAIVRADKSKGLLLVLTGNGK
GKSSSAFGMVARALGHGMRVGVAQFIKGRSDTGEEAFFTQQKNIVWHVLG
EGFTWDTQDLARDRETARRGWSVVQTMLRDPSIDLLILDELTYPLKFGWL
EISEVLSDLQNRPAMQHVVITGRAAPDTLCGAADTVTEMRDIKHAFQAGI
QAQTGIDF
>NE1019 copA, copA; copper-transporting ATPase
MTCAACAARIEKNLNKLPGVHAAVNFANEKALIKFDHDSTHPEELVQSIE
KAGFQVPEQTVQLQISGMTCAACANRIETVLNEIPGVRAILNPAAEIAYI
SFNPAITSVEQLVSAVEKAGYGANQISDDNYVKEQSRNQAAYRKELRIFW
ISAALTVPFMLEMIMMLTGNHNNLLPYWLQWLLATLIQFWPGRRFYISAW
RTLRGGGANMDVLIALGTSMAYLFSTAVIILQLDQHVYFEASAMIITLVL
LGKLMETRAKRKTSAAIETLIKLQPKTARVERDGEIIEIDINSLKNEDIF
LVRSGESLPADGIVIEGSSSINEAMLTGESQPVTKQVGAKVYAATQNQHG
LLKCRVTGVGKNTQLAAIIRLVEIAQGSKAPIQRMADTVSGIFVPIVIGI
SILTLGLTWWLTGHFVIALINAVAVLVIACPCALGLATPTAIMVGTGRGA
QEGILVKNATALELAEKIQMLVVDKTGTLTEGHPSVTDIIVADEVNEHDL
LQIAASLEQGSEHPLAKAVSQYASSSKIRLLAITNFASVTGSGIKADIDN
ASYILGSPKFLAEKGAVLDQQRITALQTEGKTVVGVAIQINESVRVIGYL
AIADSLRETSIKAIQRIQNLGIDVMMLTGDNPTTAAAIAKQAGIKIFHAE
VSPQNKAAEIEKIKANGQFVGMVGDGINDAPALAAANVSFAIGAGSDIAI
EAADITLMRNDLMSVADAISLSRSTLHKIRQNLFFAFFYNILGIPLAAAG
MLSPVIAGAAMAMSSVSVITNSLLLKRWRAGV
>NE1016 coxA, Cytochrome c oxidase, subunit I
MAVIQDVHGHGHDHPTGIMRWLTTTNHKDIGTLYLCFSLVMLFVGGFLAM
GIRAELFMPGIQILEPEVFNSFTTLHGLIMVFGAVMPAWVGFANWQVPLM
IGAPDMAFARLNNWSFWLLPVAALLLITSFFVPGGAAAGGWTFYPPLSTQ
GGMGTDMMLFAVHILGMSSIMGSINIITTILNMRAPGMTLMKMPMFVWTW
LITAYLLVLVMPVLAGAVTMVLTDRNFGTSFFNAAGGGDPVMFQHIFWFF
GHPEVYIMILPAFGVVSEIIPTFSRKPLFGYSSMVYATASIALLSCFVWA
HHMFTAGMPVAGQLFFMYATMLIAVPTGVKVFNWTATMWRGSMTFEVPML
FSIGFIFLFVIGGFSGVVCAIVPVDIQVQDTYYVIAHFHYVLVSGALFAL
FAGAYYWLPKWTGHMYDESLGKWHFWLSMVFFNVAFFVQHFLGLAGMPRR
IPDYALQFADFNMISSIGAFGFGLTQLLFVYIVIKCVRGGEKAADKTWEG
ATTLEWTHLPSPAPYHSFETPPVVK
>NE0683 coxA2, Cytochrome c oxidase, subunit I
MSATEVNGKIEYSDPENRNALIAVKSHLIAGFLVFLLMMIAGFTMRAAQG
TWLEIGADLFYQIMTVHGVGVVGAAMITSTGVQWYFLRQYVRLSNGIFWT
NFFIFLTGVVLILGSIFVGKYGGAWTFLFPLPAISMGAWSNGAAALFQAG
LLIIGVGFLILYLDFGRAILARYGSLARALGLTQLFGKEPVDYKHPAAVV
ASTMVLIVNFMGIAAGAVVLVMGLINLFYPEFKPDALLMKNLIYFFGHVV
INATIYASVIAVYELLPRYTGRPWKTNKPFYAAWFAIVFMVMGVYPHHLM
MDFAMPNWALVVGQVLSFGSGIPVMVVTGYGALMIVYRSGIKWTTSAKLL
FLSMFGWWAGVIPAILDAMVTVNRAFHNTLWVPGHFHFYLLLGLLPMLIG
FAYFLAGEGDSKEQPKATDKLGFYAYLIGAFMTSMVFLASGASSIPRRWA
AHMPEWVSFAQVGTVAAALVVLGMLLFIIRALVSLPGASNPR
>NE0684 coxB, possible cytochrome-c oxidase (EC 1.9.3.1) chain II
MISTVYAITLVGIGLVIAVFYFVIANSKEADPDYPSITKKWYGVRSKWFL
FLLVLGITVSVLSTNPFPTPDQKKEYSGGDYQNVTVDSHQWYWIMTPSTV
KAGQPVEFQVSSADVNHGFGIYDEKLTLVAQTQAMPGYTNKLIHTFDKPG
KYKILCLEYCGLAHHAMISELTVE
>NE1017 coxB, coxB; cytochrome C oxidase polypeptide II precursor (cytochrome AA3 subunit 2) transmembrane protein
MSGSKLMAALSGFTVLGLYSGMAMSSKYNLPEPQTPIAQQIYDQHMMALW
ICLVIFIGVFGVMGYSIIKHRKSVGYKAANFHHSTAIEFIWTTIPILILV
AMSWPATKTVIEMKDTSEADVTIKATGYQWMWGYDYIQGEGEGISFYSQL
STPQDQIRNEATKGQDYLLEVNNRVVVPTGKKIRILMTANDVIHAWWVPA
LGVKQDAIPGYIRDAWFKVEKPGVYRGQCAELCGKEHGFMPIVVEVMEQE
KYTQWVAEMQKSASAGVVNTALVEQSSGKN
>NE1013 coxC, Cytochrome c oxidase, subunit III
MSQGTGYYVPSPSKWPIIGSTALFFMGFGAAFTMNKLPLGYGMLITGLVI
LAFLIYGWFREVSLESEAGKYKAQEDKSFRWSMGWFIFSEVMFFASFFGA
LFYMRVLSVPWLAGADQELLWSGFSADWPTAGPGFDEKFTPMGAWGIPAI
NTMLLLTSGVTITLAHWALKKNDRGALKLWLFATIALGVTFLGFQIYEYA
HAYSDLNLKMTSGAYGSTFFMLTGFHGFHVTVGTIMLIAIFFRCLSGHFK
PEHHFGFEGVAWYWHFVDVVWLGLFIFVYLV
>NE0922 cphA, putative cyanophycin synthetase
MSTPIAIHTDPSVHSTKDLKFLEIRYLNGPNMWTYHPVLEAIVDIGELEN
YPSDTIPGFYERLSSWLPSLIEHRCSYGEPGGFLRRVQEGTWPCHILEHV
TLELQNLAGMRGGFGRARETSTSGVYKVVVSAWHKEITLKALYLARELVL
AAMGFNQMQLAYDVQHAIGQLRDMIDSHWIGPSTACIVDAAAAHNIPSIR
LLDKGNLIQLGHGSRSRYIWTAETDRTTAIAENISRDKELTKSLLRACGV
PVPEGRIVTDPADAWLAAEDIGLPVVIKPGDGNHGRGVFIELSNREEIEA
AFPIALQEGSDVLVERYIPGIEHRLLIVGGQLVAAARGDSVFVTGNGVST
IAKLIEDQINSDPRRGTSENHPLNFIELDNAIRMELAHQGYTGDSILPSG
TKVRIQRNGNHAIDITDEVHPATADLAVLAARIIGLDIAGIDLVVEDISR
PLAEQGGAIVEVNAGPSLLMHIKPAIGKPRPVGEAIIDQLFPNQDQGRIP
VVGITGSKGMTTIARFVASLLTLSGRRTGLACSSGLYLDHRKIDHGNCAN
HKSARRILMNRTVEAAVFENGFDTMLEEGLPYDSCQVGIVTRIEPALHFA
HQWIDTSEQVFKVLRTQVDVVSPTGGKREGDILPEIILPTGAAILNAQDP
MLVQMADLCQGEVIYISCDPTLPVISVHRQEGTGPTRGKRAVTVRNWEIL
LIDGTAETILIRLDELQAKTGNHPVETAHLLAGVGAGWALGMSPDLIRTG
LITFLQHQEKQSIVYQTF
>NE1731 cspD2, Cold-shock DNA-binding domain
MATGTVKWFNDSKGFGFITPDDGSEDLFAHFSAINMTGFKTLKEGQKVSF
EVTQGPKGKQASNIQSA
>NE1312 cspD2, Cold-shock DNA-binding domain
MTTGTVKWFNDAKGFGFITPDDGSEDLFAHFSAINMNGFKTLREGQKVSF
DVTQGQKGKQASNIQAP
>NE2410 cstA, Carbon starvation protein CstA
MNQLSKWVGWSLFSLLGAAAFGVIALNRGESISAIWIVIAAVCVYLIAFR
FYSQFIANRVLKLDTTRMTPAYKHNDGLDHVPTNKYVLFGHHFAAIAGAG
PLVGPILAAQMGYLPSMLWLLAGVVFAGAVQDFIVLFISMRRNARSLGDL
IKSELGHVPGMIALFGAFFIMLILLAVLSLIVVKALAESPWGTFTVAATI
PIAMFMGIYSRYFRPCAIAEISLIGFTLLILSIIAGQYVQEDPAWAAVFS
ATGEQLTWVLIGYGFIASVLPVWLLLAPRDYLSTFLKIGTIVGLAIGVII
LSPELKMPALTQFIDGSGPVWSGDLFPFLFVTVACGAVSGFHSLIASGTT
PKMIQNETDARFIGYGAMLMESFVAIMALVAAAALEPGIYFAMNSPAAII
GTTVESAVQTISQWGFHITPEMITQTTQNVGEHSIISRTGGAPTLAVGMA
QILSGVIGDTAMMAFWYHFAILFEALFILTAVDAGTRAGRFMLQDLLGSF
IPAMKRTDSLTASLIATGICVSGWGYFLYQGVIDPLGGINTLWPLFGIAN
QMLAGIALILCTCVLFRMKLDRFAWITVIPAVWVCICTLTAAWQKIFHDN
PRIGFLAHAEKYQDAVIQDIVLAPARSIGQMQQVIFNDYINVTLTALFMA
VLISILMFGIRTILQARIILHPTTREAPFELLPAYASIEK
>NE1010 ctaB, UbiA prenyltransferase
MTTSSLAWQQATARVQQFYRLTKPRVVSLIVFTAVIGMFLSVPGAVPLDK
LIFGTVGISLVAGAAAALNCLVEYKFDAIMARTKGRPLPQGKVSVPETLF
FLVLIGGFGLFMLHQWVNPLTMWLTLGTFVGYAIIYTVILKPLTPQNIVI
GGASGAMPPVLGWAAVTGEISADALLLFLIIFAWTPPHFWALALYRKTDY
AKIGMPMLPVTHGDEFTRLHVLLYTIILCVVTVLPYLTQMSGLIYLGSVL
ILDAIFFYYAIRIYLHYTDQIAREAFRYSIAYLALLFTALLVDHYFYF
>NE1015 ctaG, putative cytochrome C oxidase assembly transmembrane protein
MKQDVAKTNVDILKKLLVFSVVMFGFGYALVPLYKKFCEVTGIYELERPD
TLTKHTEVDNTRSISLLLDANVRGLPWKFKSLQANIHLHPGKLTEVMYEI
SNESEEVQVGQAIPSYSPKNLERYLKKIECFCFSQQELQGKEVKQLPVRF
LIDPEIPGDIHTATISYTFFNVKN
>NE1330 cti, cis/trans isomerase
MLSACQDSKPLLFDNNPTPTPADLPAPAGHQISFAKEVQPIIETKCLSCH
SCFDAPCQLKLESAESLLRGASQEPVYSSARTTEMKPTRLGIDELTVAGW
RKRGFYSVLQSDREHAQSLLKNMITLGKQYPFPPNSKLPDSIKTGFARKN
QCVSEEEFPGYAHDHPFEGMPFGTSGLTDREYSLLAGWLNQGAAVTDEPV
TLTRDEEQTIRTWETLFNRDDKRGRLVARWLYEHLFLAYLYFPEAGDEPR
FFELLRSSTPSGEAIIPVATVSPNSDPGGPFFYRLRPISGTIVHKRRISY
PLDQMKLRRISELFFSEDWPVGDLPGYDYTERSNPFVTFAAIPARARYQF
MLDEAEYFVRTFIHGPVCRGQIATDVIRDHFWTLFQSPESDLFITADTYR
RQAIPLLGIPGQDDDLLDAGENWLRYLKRHNDYLALRQQHYVAQQPQGAS
LAHIWNGDGHNENALLTIFRHHNSASVVRGLAGAVPQTIWLMDYPLLEQT
YYQLVVNFNVFGNVAHQVLTRLYFDLIRNGSEQNFVRLLPAGQRKTILND
WYQDLGKLKFGIVYEDIDDRSPSAERFVTENPKLELASHMLERFQSINTL
SHDPLNRCEQGNCSRIDQPHWIQQADRALSGIAAQPAASLPGISLLPEVT
FVRVQHGQNERTVYTLLRDRAHSNVAFMLGEELRYQPEKDRVTVYPGITG
SYPNFMFDVPAAQVGEFVAKLSKAGKIKDFEQIVETWGIRRTHPQFWEIL
HDITAWQKQQQPLQAGIFDINRYENF
>NE2388 cutA, CutA1 divalent ion tolerance protein
MTGSSQILLVLTNFPNDTSARELAEMLVDRRLAACINILQGCTSVYRWQG
LTETASEVPVLIKTTRQRYEAVEQAIKSLHPYELPEIIAVPVDNGLSAYL
QWIAHETTETDT
>NE0698 cvpA, Colicin V production protein
MTVFDYIVIGIISFSALLSITRGLVHEIVSLLAWIIAFFAASRYSINVAP
LLAGMVENESIRMLVAFSATFFIVLLITMLASKLLSALVRGVGLGLIDRM
LGALFGMIRGLVIVLFLITAAGFTPLPQQPFWKQAVLSEPLEVMTADIIP
WLPQDFRNLIGFDRNS
>NE0273 cyc, Cytochrome c, class IC:Cytochrome c, class I
MKFIGTTVAAIALTFSGFVFAESPAAGDVEKGKEIAAGICAGCHNADGNS
AIPLYPILAGQYPGYIAKQLNDFKVVEGETVKRDNQIMAPMVATLSQDDI
ENLAAYYSQQKPQPGTASDASLVETGKKLYQGGNLENSIPACSSCHSPNG
QGIPPHYPRIDGQHPAYTLSQLQAFRQGTRKNDTNNTMQTIVSRMSEQEM
QAVSEYIATLR
>NE2337 cycA1, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE2042 cycA2, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE0960 cycA3, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE0771 cycH, TPR repeat
MTAFWVVAGIFIVVALLFVIPVLLRSKRNESQEQIERQAANITIYRDQLA
ELERDLRNDTLSREQYDSSKQELQKRMLQDVSENGESVTHLVPTSRHGVV
AGIIVTLVIPLAAIYLYLVIGDTRGLLPQSQLANATQFSQNGAGGEEGHI
DISSMVESLAARLRENPEDIEGWVMLGRSYAIMERFDDASATYAKLVQMV
PDNPQFLSDYADMLAMINNGSLLGKPAEMITRALAIDPNFPKALALAGTL
EFEQDKFDQAVAYWERLLSAIPADSRLHKSVSDSIVQAKSLAMRSKGESA
PVQLAQNSNVGTDATAGSSSAEKQEISAGVPSISGSVTLDPSLADKVSPD
DTLFVFARASQGPKMPLAILRLNARDIPVSFKLDDNMAMTPAMKLSSFPE
VVVGARISKTGQAIPASGDLEGHSDPVKIGDGEVSITIDHVVP
>NE2336 cycX1, putative tetraheme c-cytochrome
MTRLQKGSIGTLLTGALLGIVLVAVVFGGEAALSTEEFCTSCHSMTYPQA
ELKQSTHYGALGVNPGCKDCHIPQGIENFHLAVATHAIDGARELYLELVN
DYSTLEKFNERRLEMAHDARMNLKKWDSITCRTCHKKPAPPGESAQAEHK
KMETEGATCIDCHQNLVHEEVPMTDLNASIAQGKLVLKPEDDGDDEEADE
DEDEETEEADDSSDSESASSSDNSDNEDDNNDE
>NE0959 cycX3, putative tetraheme c-cytochrome
MTRLQKGSIGTLLTGALLGIVLVAVVFGGEAALSTEEFCTSCHSMTYPQA
ELKQSTHYGALGVNPGCKDCHIPQGIENFHLAVATHAIDGARELYLELVN
DYSTLEKFNERRLEMAHDARMNLKKWDSITCRTCHKKPAPPGESAQAEHK
KMETEGATCIDCHQNLVHEEVPMTDLNASIAQGKLVLKPEDDGDDEEADE
DEDEETEEADDSSDSESASSSDNSDNEDDNNDE
>NE0011 cyp, cytochrome P460 precursor
MNFRKQLTGGLSSLILSAVMSGSLLAAGVAEFNDKGELLLPKNYREWVMV
GTQVTPNELNDGKAPFTEIRTVYVDPESYAHWKKTGEFRDGTVTVKELVS
VGDRKGPGSGNGYFMGDYIGLEASVKDSQRFANEPGNWAFYIFYVPDTPL
VAAAKNLPTAECAACHKENAKTDMVFTQFYPVLRAAKATGESGVVAPK
>NE0576 cysA, cysA; sulfate transport ATP-binding ABC transporter protein
MTIEIHDLSKQFGSFTALNDINLKVNPGELLALLGPSGSGKTTLLRVIAG
LETADSGQVLFNEEDSTDKHIRDRHVGFVFQHYALFRNMTIFENVAFGLR
VRPRKQRPNAPEINHRVTELLQLVQLDWLADRYPHQLSGGQRQRIALARA
LAVEPSVLLLDEPFGALDAKVRKELRAWLRKLHDDMHITSVFVTHDQEEA
LEVADRIVVMNRGRIEQIGTPDEVYEKPANPFVYEFLGHVNLFHGRVHQG
HAWIGDLEVDAPEYSEAEDLSAIAYVRPHDIEVDRTLNGEPALAAHIVHI
LAIGPVVRLELAGKDNQSTNSIYAEISKERFRELQLARGDQVFIKPRKLD
LFPNHAQNGSIH
>NE0854 cysB, Bacterial regulatory protein, LysR family
MKLQQLKYLHEIAKQKLNLSKAANALHTSQPGISRQIQLLEQELGVEIFT
RHGKRIIGITQPGLAILQTAQRMLQDADNLKKIGNEFNDKESGTFIIATT
HTQARYVLPPVIKRFIEHYPKIRLSLRQGSPIEIATWVSSGEADLAIATE
AIELFDKLVMLPCYEWNRCIVVPPRHPLLQLKELTLESIAQYAIVTYDFA
FTGRSKINQAFESKGLKPNVVLTALDADVIKTYVEIGLGVGILAKMAFDP
VRDKKLRAIDAAHLFEPSTTRIGICRNSYIRHYVYDFIEMFAAHLDMETV
NKLLKNPENQQLSIANDK
>NE0856 cysD, Phosphoadenosine phosphosulfate reductase
MNQRSYFTHLDALESEAIHIMREVAAECTNPVLLFSGGKDSVVLLRLAEK
AFRPSRFPFPLMHIDTGHNFPEVIEFRDYRAKELGERLIVRSMEDSIQRG
RVVLRSGNQSRNPFQSITLLDAIEEFGFDACIGGARRDEEKARAKERIFS
FRDEFGQWDPKNQRPELWSLYNTRTHPGENIRVFPISNWTELDIWEYIGR
EQLEIPSIYFAHEREVIVRDNGLLPISYLVQPKAGEHVQNMMVRFRTVGD
MSCTCPVASHAKNVEEIIAETALTRITERGATRMDDQTSEASMELRKREG
YF
>NE0532 cysG, cysG_2; Uroporphyrinogen-III methylase
MRSLHHSGNIFNGSNDSLLVKFRCLYMDYLPVFLNIKQRDCLVVGGGEIA
VRKIRLLLRAHARIHVVSPAISEELSNLLLQSPVITHTAESFRPDHLQDR
ALAIAATNDHEVNRAVSAAARKAGIPVNVVDNPDLCTFIMPSILDRSPII
VAVSSGGTSPILARLLRSRLEALIPSAYGRLAEYAARFRDKVRQRFIHQE
NRRFFWERMLQGPFAEMVFAGRDQAAQDYLSEALENSTDQFPTGEVYLVG
AGPGDPDLLTFRAMRLMQQADVVIYDRLVSPAILDMVRQDATRIYVGKVR
NQHTLPQTSINELLVKLAQEGKHVLRLKGGDPFIFGRGGEEIETLSQHHI
PFQVVPGITAASGVASYAGIPLTHRDHAQSCVFVTGHLKDNTIQLDWPAL
ARPNQTIVVYMGLLGVTELCRQLIAHGLQATTPAAIVQQGTTPNQRVLTG
TLETLPDIIQQNPLKPPTLIIVGNVVKLHQKLAWFNSTSEPMGTSSGPGY
P
>NE0855 cysH, probable cysH; 5' adenylylsulfate aps reductase protein
MGEQALQQKVEQLHSLLGNIQRDYSPAVFASSFGAEDMVLTDLIARHYPE
IGIFTLDTGRLPEETYDLMQKVKERYGVSIHAYFPDASAVEHYVAQHGPN
GFYDSVALRKRCCYIRKVEPLKRALADKKAWITGMRRDQSTTREELELSA
YDSDHGLHKFNPLCDWTEKDVWNYIRQHDVPYNALHDRFYPSIGCAPCTR
AITPGEDIRSGRWWWENPESRECGLHTRKIEA
>NE1443 cysM, Pyridoxal-5'-phosphate-dependent enzymes, beta family
MHNTIENFIGHTPLVRLKRIPSETSNTILAKLEGNNPAGSVKDRPAYSMI
MRAQQRGDIRPGDTLIEATSGNTGIALAMAAAILGYRMILVMPENLSLER
RQVMSAYGAEIVLTPKEGSMELARDTAEQLCNAGKGIILDQFSNPDNPLA
HYETTAPEIWQDTAGTVTHFVSSMGTTGTIMGCARYFREQEQKVEIIGVQ
PQENAQIPGIRKWPEAYLPKIYQPELVDRILLVSQQEAEEMARRLAREEG
LFTGISSGGALAAALKVSAQVQNAVIVFIVCDRGDRYLSTGVFPS
>NE0857 cysN, GTP-binding elongation factor:Elongation factor Tu domain 2
MSVVENVGLEHAELLRFITAGSVDDGKSTLIGRLLHDSKSIFEDQLNAIA
KTSNKRGMEKVDLSLLTDGLQAEREQGITIDVAYRYFTTPKRKFIIADTP
GHEQYTRNMVTGASTANLAIILIDARKGVLTQSRRHAYLASLVGIPHLVV
AINKMDLVGYSEDVFNQIRTEFSAFLQGLGSRSVEYIPMSALVGDMVVER
GDNLGWYSGQPLLDYLETIDIKQDVNTHDFRFPVQWVSRPQTEAMHDFRG
YMGRIESGSVHVGDTVTVLPSGLTSRVKEILFYNGTLETAFAPQSVTLTI
EDHLDISRGDMLVNGGEPPRVTREFDAMLCWLSEQSFDPGRKYIVKHTTH
MIKGLIARIDYRVDINTMNHEVVDALTMNDIARIGIKVQQPLVIDAYTQN
RGTGCFILIDEVSNNTVAAGMIC
>NE0043 cysS, Cysteinyl-tRNA synthetase
MLKIYDTLTRSRREFIPLTPGEVRMYVCGMTVYDYCHLGHARVLVVFDSV
VRWLQTLGYKVIYVRNITDVDDKIIRRALENHEPFSALTARYIQAMEEDA
MALGVISPSFEPRATEYVDSMIAMIESLLNRELAYVASNGDVFYDVRRFP
GYGKLSGKSLDDLRAGERVEIDTNKRDPLDFVLWKAAKPDEPSWDSPWGK
GRPGWHIECSAMSEHYLGDQFDIHGGGQDLQFPHHENEIAQSEGVHGHSH
VNYWMHNGFVRVDNEKMSKSLGNFFTVREVLTRYQPEVVRFFIVRAHYRS
PLNYSDAHLNDARSALERLYTVLKNHAPSQDSEVATAVIDWENNVYARRF
MSAMNDDFNTPEAVAVLFDLASEANRTGDSHYASLLKALGGVLGLLQQPP
QQYLQYPAHLQDGQYSVEEIENMIQQRLQARKERNFAQADTLRQQLAEAG
IILEDSPQGTTWRRE
>NE0578 cysU, cysU; sulfate transport ABC transporter protein
MSTFKQYSVLPGFNLALGFTLLYLSLVVLIPLSAAFIHSAKLTWPEFWST
VTAPRVVASYRLTFGASFAAAVVNTFFGLLVAWVLVRYPFPGKRLVDALI
DLPFALPTSVAGITLTAIYAGNGWLGQYLEPLGIKVAFTPVGVFVALTFI
GLPFVVRTVQPVLEDIEKELEEAAAMLGATRWQTFRYVIFPAVLPALTTG
FALAFARAIGEYGSVIFIAGNIPMVSEITPLLIITKLEQYDYAGATAIAV
VMLVISFILLLIINLLQWWVRHRSIKA
>NE0577 cysW, cysW; sulfate transport ABC transporter protein
MTAVPLLQTSPAPHKVVTQESPWARWTLILLALSFLSLFLLLPLVAVFFE
ALRKGWEVYLAAITEPDALAAIRLTLIAAAIAVPLNLIFGIAAAWAIAKF
EFRGKSILTSLIDLPFSVSPVVAGLIYVLIFGLQGWIGPWLREHDLSIIF
AVPGIVLATIFVTVPFIARELIPLMQAQGSEEEEAAIILGASGWQTLWYV
TLPNIKWGLLYGTILCNARAMGEFGAVSVVSGHIRGLTNTLSLHVEILYN
EYNFVAAFAVASLLALLALITLVLKTLVEAKVAQQTSRGNHS
>NE0102 cyt_c552, Cytochrome c-552 precursor
MMKTAWLGTFAASALLVAGYAQADADLAKKNNCIACHQVETKVVGPALKD
IAAKYADKDDAATYLAGKIKGGSSGVWGQIPMPPNVNVSDADAKALADWI
LTLK
>NE1638 czcA, Acriflavin resistance protein:Heavy metal efflux pump CzcA
MFERMLAAVIRHRGLVMLAVLAMAALGLYSYQKLPIDAVPDITNVQVQIN
TAAPGYSPMEVEQRITFPVETIMAGLPGLDYTRSLSRYGLSQVTVVFREG
TDIYFARQLVSQRIQEARGQLPAGIEPEMGPISTGLGEIFMWIVESIPDA
RKPDGTAYTSTDLREIQDWIIRPQLRMVEGVTEVNTIGGFVKQYHVTPYP
EKLVAYGLTLQEIVAALERNNLNLGAGYIEKRGEQYLVRLPGQVSNLDEI
GEITLGSRQGTPLRIKDVADILIGKELRTGAATRNGEETVLGTALMLIGE
NSRTVAHAVAGKLTEIGRSLPAGVVVVPVYDRTTLIDGTIHTVSSNLIEG
AALVIVVLFLFLGNFRAAVIAALVIPLSMLFTFSGMVVNHVSANLMSLGA
LDFGIIVDGVIVIIENSIRCFAQEQARLGRALSREERLKLVFTATQQVRR
ALIYGQMIIMIVYLPIFALSGVEGKMFHPMAFTVVVALLGALILSVTFVP
AAVALFLTGEVADEESRIVVWCKRFYQPMLVVVMRNQGLTITVAAVVMGL
SVLLATRLGSEFVPSLNEGDIALHAIRIPGTSLSTAIAMQGELEKVIQSF
PEVEQVFSKIGTAEVATDPMPPSVADIFVTLRPQSEWSEQYRTKDELIVA
MEKAVLQVPGNNYEFTQPIEMRFNELISGVRADVAIKVFGDDLDKLLALG
EDIEALVATVPGAADVKVEQVTGLPVLSIRADRANISRYGLNVADVQDAV
AIAVGGKSAGLIFEGDRRFELQVRLPESLRADIEALRYLPLGLPVQPGTQ
GGLLAGQLPLAGSTAFVTLGEVADFRIMQGPNQISRENGKRRVVVTANVR
GRDLGSFVREVQTLVNEKIRIPPGYWLVWGGQFEQMISAAERLQIVIPVA
LVLIFMMLYTVFGNFRDGLLVFTGVPFALTGGVLALWLRDIPLSISAGVG
FIALSGIAVLNGLVMLTFIRELRQEGRALVDAVQIGALTRLRPVLMTALT
DAVGFIPMALATGLGAEVQRPLATVVIGGILSSTILTLLVLPVLYYVFYK
RQEEDGKSYQEKTEMLPDANIP
>NE1639 czcB, HlyD family secretion protein
MNVTGRLPIVAVILAGILLGTWLLIGGSGLPDIVERFSSAEPETESGSAT
GSRGGELFTKDDLSLELIIHEEGVFPHFRIYPYWQKQSLSPAEMKVTVAL
SRLGRPAQLFHFRPESDYLISDQEVEEPHSFELVVAAEYQGKAYRWNHSQ
VEARVEMSDTMMQSSDIELATAGPAVIRSEITLPGEIIFNEHNIVHVVPR
LPGMVVSIKRHHGQRVKKGEVLAVMESAMLADLHSQYLLARRRQTLAQTI
YDREEQLWREKITARQDYLTARQQREEASITTQLAAERLLALGVQPKSDL
SGKNLARYEIRSPISGIVIDEAIVTGEVVKEDKTAFIIADISTIWAAVRV
FPGNLHQVHVGQRAVIRANAYDLTREGEVTYVTTLLDEQTRTAVARVQLD
NQDERWRPGMFVKADLQTEAAEVPVAVSLEAIQTIGDQSVVFGRYGDYFE
MRPLKLGRSDNRMAEVIEGLFAGERYAVSNSFIIKSELGKAGAAHDH
>NE1640 czcC, Outer membrane efflux protein
MTSLSIIMMEYQVERRPGSFHQGGAIFFLAFVLTGMMSIQNYGVALANAA
APTADVQAGGISEKGDLTLRQVLQLVLQNNPELSAFSREIAAHEGTKLQA
GLFNNPEFSIEAEDINSSNSAIQKFATFRISQLIELGGKRPARVNVATLG
QELADQAHAAKRLEIIARTASAFVDVLENQAQVSVMDDTLHLVQVAMETV
VKRVEAGKAPPMEAIRSKVALSTASIELEQARRSLSAARTKLALLWGEAE
PRFDRVLGELESFVEIPEFDQLVKRLEENPVVRQSLKNVAQREAMVELEK
ARKIPDITVDAGIRRYLGTDDTTAVVGMSIPIPIFNRNQGNELEARQRLN
KAMDERMSVELQLRTEFVRNYESLLAARNEIRVLHDAVLPGAQNAFEITN
RGYQLGKFSFLEMLDAQRTFFQNRILYVRALANYQRLVNTIEQLIAAPLA
DSATDSTRLNNQGKEHNQ
>NE1485 dac, D-alanyl-D-alanine carboxypeptidase 1 (S11) family
MTRVIIKLLQILMKQFIFSLFLLATISPAWSQQPELAVAAKSFILIDFHS
SQTLASSNPHERLDPASLTKLMTAYVVFSALQQERIKIDQVVPVSNRSWR
MVGSRMFIEPNKQVTVDELIHGVIVQSGNDACVALAELIAGSEDLFVHMM
NEEAKRLGMHNTHFTNSTGLTHSDHYSTAHDLALLAAAIIRDFPERYPLY
SLREYTYNGITQQNRNRLLWTDPNVDGMKTGWTEAAGYCLVTSAKRDQRR
LISVVMGTASPNARSIESQRLLNYGFQFFDTAHPYKKDQPVANVQIWKGA
QNKLKVGFDRDIYFSLPKGKVEGLKARMEYRQPLIAPIDRGREVGMVKFI
LDGQEIATYPLVALETVDTASFLGRSWDNLKMLFN
>NE1417 dadX, Alanine racemase
MSRPIRAFINCAALRHNLAVVRRHVHQARIMAVVKADAYGHGLLRVARAL
DAVDGFAVLELEAAIQLREAGFSQLILLLEGFFSIEEIEAINHYRLSTVI
HCHEQLSMLLAHKKTGKPDIFLKINTGMNRLGFRPEEGNSVLNRLRQWHT
DISITLMTHFACADDLLEADHVDQQLGSFARLEEKREGCIPRTLANSAAI
LRYPGTHADWVRPGIILYGASPLPDKTGIELGLQPVMTLTSRIIAVQHLD
FSDRLGYGGQFVADQPMRVGVVAAGYADGYPRHAPTGTPVLVNGRRTRLI
GRVSMDMLTVDLSGINEAGAGSLVTLWGEGLPVEEVARSAQTISYELLAA
LSPRVQTVSSIP
>NE2403 dapA, Dihydrodipicolinate synthetase
MFTGSLVAIVTPMLEDGALDLDRFCALIDFHIEQKTDGIVVVGTTGESPT
VDFDEHHLLIRTAVTHAAGRIPVIAGTGANSTREAIELTVFSKNAGADAC
LSVAPYYNKPTQEGLYQHFKAIAEAVDIPMILYNVPGRTVVDISNDTALR
LAQIPGIVGIKDATGNIARGCDLLQRVPDNFAVYSGDDATALALLLLGGH
GTISVTANVAPRLMHEMCTAAFAGDLARAREINTRLFRLHIDLFVEANPI
PVKWAVARMGLINDSLRLPLTALSSQYHELIRKAMLQAGITV
>NE2463 dapC, Aminotransferases class-I
MNPSLENLQLYPFQKLTKLFEDLVPRDSLPTPIGLHIGEPRHSTPEFIRQ
ELINSLEGLAHYPTVLGTDRLRASIATWLTQRYHLSSIDPDTEVIPVNGS
REALFSFAQAVIDTDHKAIQPAVVCQNPFYQIYEGAALLAGATPYFLNQL
PENNFSADTAQLPDTVWERTQLVYICSPNNPTGKVLTLDEWKHLFDLSDR
YGFVIAADECYSEIYFDEGHPPLGALEAAEKLGRSGFPRLVVFSSLSKRS
NVPGMRSGFVAGDATILRKFLLYRTYHGSAMNPAVQAASTVAWSDEQHVA
ENRKLYYEKFTGAMHILGDTLPVSMPDAGFYLWLRTPVSGIEFSRRLYRE
GNVTVLPGSYLAREAHGMDPGEDFVRIALVAPVVECMEAMERIRDIARKF
>NE2462 dapD, Bacterial transferase hexapeptide repeat
MEQLQAVIENAFERRAEITPRNVEANLKESVAQVINMLDTGKLRVAEKIN
GEWVVHQWIKKAVLLSFRMEDNSFIKGGFSNYFDKIPSKFADYSSRDFRD
GGFRVVPPAAVRKGSFIASNVVLMPSYVNIGAYVDEGTMVDTWATVGSCA
QIGKNVHLSGGVGIGGVLEPVQASPTIIEDNCFIGARSEIVEGVVVGENS
VISMGVYIGQSTKIYNRETGEITYGRIPPGSVVVSGNLPAENGRYSLYCA
VIVKQVDAKTRSKTGINELLRGI
>NE0108 dapE, Peptidase family M20/M25/M40
MSNSTLTLAQMLIARRSLTPDDDGCQKMIMHRLAGLGFKSDSMTFGEVEN
LWTRKGSDAPLVCFAGHTDVVPTGPVTQWDSDPFTPVVRDGFLYGRGAAD
MKTSLAAFVTAIEEFIELHPDHKGSIALLITSDEEGPAVDGTVKVVEALQ
TRGEMIDYCIVGEPTCTNQLGDTIKNGRRGSLSGNLTVRGIQGHIAYPHL
ARNPIHTAAPAIAELAQTVWDNGNEYFPATTWHISNIHGGTGATNVIPGE
INLLFNFRFSTASTVDSLKARVHEILDRHGLDYELIWELSGKPYLTPRGT
LADAVSAAIREVTGIEPELSTTGGTSDGRFIADICQQVVEFGPRNATIHK
INESVEVADVERLARIYRLTLENLLL
>NE1612 dapF, Diaminopimelate epimerase
MKLKFTKMHGLGNDFIVIDAINQSVSLDPATIRRWADRHFGIGFDQLLVV
EKPGESGDFRYRIFNADGGEVEQCGNGARCFARFVRDHDLTRKNTIRVET
ACGIIMPTVEENGEVSVDMGIPRFDPARIPFITQERALTYPLNVNDREIE
ISAVSMGNPHAVQIVPDIDLAPVTSEGPAIESHPFFPEKVNAGYMQIVDR
THIRLRVFERGTGETLACGTGACAAVVSGISRGLLDSEVQVTMRGGNLRI
RWEGEDQPVWMTGPAVSVFEGTIDL
>NE0994 ddlB, D-alanine--D-alanine ligase
MNTRDVGKVAVLLGGRSAEREISLRSGQAVLAALQRSRVNAHAFDPAGQP
LENLLQQGFDRVFIALHGRYGEDGSVQGALELMELPYTGSGILASALAMD
KWRTKMIWQAAGINTPDYVMLDASSRFRDVADRLGLPLIIKPAREGSTLG
LNKVDNEQDFRSAYQAAAEYDSLVLAEQFIQGIELTAAILDDMPLPLVRI
DVAEGLYDYQAKYFSESTRYTCPSGLSAALTTRIQEQALYAHRILGCTGW
SRVDLILDENEQPFFLETNTSPGMTDHSLVPMAAKAAGISFDELVVQILE
LSCEH
>NE1758 dedA, DedA family
MFLVDFIIHIDSHLQELVSEYGVWVNGILFLIVFCETGLVVFPFLPGDSL
LFAAGSLASLQGSQLDPHFLFVGLTLAGILGDSVNYWVGKKFGITIFTSG
KFRFLKQEHLDKTHAFYLKYGGKTIIIARFIPIIRTFAPFVAGIGTMPYR
KFIAYNVIGAVLWVGIFVYAGFYFGQLPLIQKNFKLVILAIIILSITPPL
IEYLRHRFGKNRPGVS
>NE1970 def, Formylmethionine deformylase
MIEPLPRILVSELCKFVMAILNILRYPDERLHKIATEVPSITREIRTLVS
NMAETMYAAPGIGLAATQVDVHQRIIVIDVSETRDELLVLINPEIIASSG
NAETQEGCLSVPGIFDKVTRAEEVTVRATGIDGKSFEMDASGLLAVCIQH
EMDHLMGKVFVEYLSPFKQSRILSKLKKQARRQIA
>NE1463 dfp, Flavoprotein
MILVDALAGKRLLLGITGGIAAYKAAELVRLLMQEGVEVQVVMTESACRF
VGTATLQGLTGRQVFTELWDTGMLNGMAHINLSREVDALLIAPASADFIA
KIAHGLADDLLSALCLARDCPLLIAPAMNVQMWENAATRRNLATLRQDGV
TVLGPGSGYQACGEIGEGRMLEPEDLLDKVRCFFQPKYLSGKRVLVTAGP
TFEALDAVRGLTNLSSGKTGIAIAQAALEAGAQVTLVCGPVCPPFPAVDK
LTHIVSASEMFSAVKAEVEGHDIFISVAAVADYRPAECHPSKLKKTVADI
TLTLVPNPDILQYVANLPNPPFCVGFAAETEAIEQYAAEKRKRKKLPLIV
ANNAVETIGSNESSLILLDDEGTHYLPKANKNIQARLVMAHIAHLYDKRK
GYTHHNETS
>NE1170 dfrA, putative dihydroflavonol-4-reductase
MPASRGCSLVTGGGGFIGTHLVRLLLEQGERVRVLELDDVPVLDGAEVIR
GSVADEAVVHRAVKGVRRVYHLAAHTDLWAPDKRIFRQINYESTRTVLRE
AMYADVEVVVHTSTEAILTGRGDPDQSDRTLSSSSRKRKTLGPYCQAKLM
AEQAALEASRNGLSVVVVSPTLPVGPGDRHITPPTRMIVDFLNRKIPAYL
DCRLNLVDVRDVAEGHILAAQRGRSGERYLLGHENIMLSRLLLMLKEITG
VVMPEHKIPYWLALVVGILQEFVADHVTHKAPMAPLTGVRLAGRGNDFDS
DKAVRELGFRQTPIRQALIDEIIWLVETGHINLPVHSFRKHFGSTEKK
>NE0567 dfrA,tmp, Dihydrofolate reductase
MTIPSLSPPRLAILAAVSANRVIGLNNTLPWHLPADLKHFKQLTTGQIVV
MGRRTFDSIGRPLPDRTNVVLTRQRHFNQPGILTAGSIQEVLEHFCGDDR
QIFIIGGAEIYQQTLPFCQRLYLTEIQQDFAGDTFFPEYDRDNWREISRE
MHQATDSGIEYHFVVLDRKQPVNAGACSTDLS
>NE1982 dgt, HD domain:Metal dependent phosphohydrolase HD domain
MRSNLAPYAVSDTNSRGRRIHEELPAGRSQFQRDRDRVIHSTAFRRLEYK
TQVFVNHEGDLFRTRLTHSLEVAQIGRSIARNLNLNEELVEAITLSHDLG
HTPFGHAGQDALNDCMKTYGGFEHNLQSLRVVDVLEERYATFNGLNLCFE
TREGILKRVPKSKAAALDELGTRFLHGHSASLEAQLANLADEIAYNNHDV
DDGLRSGLITLAQLEEIEIFARHLHQVKQHYPDITGRRLIHETVRGMINT
LVVDLTVQSGTRIRDASPDSPDSVREKPVLIGFSDTIKRQQQELKRFLHK
NLYKHYQVMRMSNKARHTIEKLFTTFETEPALLPYEYQQKFQEYGHQAIA
DYIAGMTDRYAIREYQRLFAITEN
>NE0001 dnaA, dnaA; chromosomal replication initiator protein
MQKIETFWHFCLKHFRQELNGQQFNTWIKPLKLEVCPDEKNTLILIAPNR
FVLQWIKDNFVTRIDEMAQDHFNERISFRLELREPAESEAQTVRTSAQKN
REDKKPAAEKTQGVTSRKTNPSQLNASFTFDAFVTGKANQLARAGAIQVA
ERPGIAYNPLFIYGGVGLGKTHLMQAIGNYVLELDAGAKIRYVHAEKYVS
DVVSAYQHKSFDKFKLYYHSLDLLLVDDVQFFSGKNRTQEEFFYAFNALI
EAHKQVIITSDCYPKEISGLEERLVSRFGWGLTVAIEPPELEMRVAILLK
KALAEKIELDENTAFFIAKYIRSNVRELEGALKRVLAFSRFTGHSISLDL
AKEALKDLLAIQNRQISIENIQKTVADYYKIKVADMYSKKRVRTIVRPRQ
VAMAIAKELTQLSLPDIGEAFGGRDHTTVLHAHRKIIELRTSDPGINRDF
NALMHILRG
>NE0194 dnaB, dnaB; replicative DNA helicase protein
MNQLTSSFIQNLAEDNIYKLPPHSIEAEQSVLGGLMLDNQAWDKVADIII
ESDFYRQDHQLIYQHISRLIEQNKPADVITVAESLENAAQLQHAGGLAYI
GAIAQNTPSAANIRRYAEIVRERSIMRKLAQVSTQITDSAYNPAGRSAGD
LLDEAESRIFEIAEQSAHGKQGFVDIQPLLKQVVERIEVLYNRSNPSDIT
GIPSGFNDLDQKTSGFQPGDLIIVAGRPSMGKTAFALNIGEHVALETSKP
VAVFSMEMGGVQLAMRMLGSIGRLDQHKMRTGQLNDDDWPRLTHALGKLN
DAPIFIDESAGLNSLELRARARRLYRQHEGLGLIIIDYLQLMSATSPGSE
NRAAEISEISRSLKALAKELQVPVIALSQLNRGLEQRPNKRPIMSDLRES
GAIEQDADVILFIYRDEVYNPDTPDKGIAEIIIGKQRNGPIGKVDLTFLG
EFTRFENCARTADYY
>NE1978 dnaE1, dnaE1; DNA polymerase III (alpha chain) protein
MPIDPVFIHLRLHSEYSVVDGIVRVEEAVAKARDVGMPALALTDLSNLFG
LVKFYQCAFKAGIKPIAGCDVWVTNENDADRPFRLLLLCQSFSGYLLLSR
LLSRAYRENMCRGRAELKKSWFREEDAGTEGLIALSGGGQGEVEQLLLAD
PPAAVTAAQQWADLFPGRFYLEIQRCGRPNEETSGYALLDLASSLKLPVV
ATHPVQFMRPEDFRAHEARVCIAQGYVLGDRRRPKEFTGQQYFKTPAEMG
ELFRDVPEALANSVEIARRCSLMLELGVNRLPDFPTPAGISVEQHLRELA
QTGLEARLLQSFPQVLQRDERRPIYQMRLDFEVETIIQMGFAGYFLIVAD
FIGWAKQHDVPVGPGRGSGAGSLVAYSLGITDLDPLLYDLLFERFLNPER
VSMPDFDIDFCQDRREQVIEYVRDRYGAESVAQIATFGTMAAKAVVRDVG
RVLDLPYNFVDQLAKLVPFELGMTLRKAREIEPLLNQRAEEEEDVRNLLE
LAERLEGLTRNVGMHAGGVLIAPGKITDFCPVYCADSGDAVVSQYDKDDV
EKVGLVKFDFLGLRTLTILDRAVADIRQYRAASPGSAVAEPDVQSAEESH
FSLESISLEDAATFSLMAKGNTVGIFQFESRGMKDLLQRARPDRFEDLIA
LVALYRPGPMDLIPDFIERKHGKRVDYLDPRLQPILGPTYGIMIYQEQVM
QIAQVIGGYSLGGADLLRRAMGKKKVEEMAQQRAVFVEGAIRNEMAEADA
VTLFGLMEKFAGYGFNKSHAAAYALIAYQTAYLKTHYPAEFMAACMSSDM
DDTDKVNVFYEDCKLNGIVILPPDINESGYYFVPVDHKTIRYGLGAVKGS
GEAAISAIVQVREQGSTFTGLFDFCRRVDRRIVNRRTIEALIRAGAFDSV
ETNRAALLESVGNAMEYAEQCSLAASQVSLFDENTDLIQPPAITGVAQWP
EREKLQNEKMALGFYLSGHPYDSYARELSCFIPVRLSRIVPGREPQLIAG
VIYAIRTQMSRRGKMAIVTLDDGLARVEVVVYSDLLSTGSHFMKADQLLV
VRALVSHGNGENADRRIVAKEIYDYVTARSMHARKLRIMIDDSGLLTPAQ
LKELLAANLPENGVNNVIPSSGCAVSIDFRNQVGSCEIDLSSRWRVHLHE
GLIESLMDILGRDKVEVVY
>NE1948 dnaJ, DnaJ molecular chaperone
MSQSDYYEVLGVGRDADENELKKAYRKLAMKYHPDRNAGDTKAEERFKNI
KEAYEILSDPNKRAAYDQFGHAGLNGGMGGAGAQGFSDAFSDIFSDLFGM
RGGGRSSVHRGADLRYNLEITLEQAARGAETQIRIPRQEVCDTCHGSGAK
PGTSPKTCPTCNGHGQIRMQQGFFSIQQTCSHCQGSGKVVSDPCGDCHGA
GWVKRQKTLSVRIPAGVDEGDSIRLTGEGEAGANGGQAGDLYIVIHLASH
PVFQREGNHLHCEIPISFTVAALGGEIEVPTLDGHARIKVPAGTQTGKIF
RLRSKGITGVRNQSTGDLLCHVAVETPVDLTARQKELLEEFESISQKDGS
RHHPRAKSWMEKAREFFAE
>NE1949 dnaK, Heat shock protein hsp70, molecular chaperone
MAKIIGIDLGTTNSCVAVMEGNKPKVIENAEGARTTPSIVAYAEDNEILV
GASAKRQAVTNPENTLFAIKRLIGRRFDEEVVQKDISVTPYKIVRADNND
AWIEARGRKIAPPEVSAQVLIKMKKTAEDYLGEPVTEAVITVPAYFNDSQ
RQATKDAGRIAGLEVKRIINEPTAAALAFGLDKKEGDRKIAVYDLGGGTF
DISIIEIAEVEGEHQFEVLATNGDTFLGGEDFDSRVIEYLVDEFRKESGI
DLKKDMLALQRLKDAAEKAKIELSSSQQTEVNLPYITADASGPKHLAVKI
TRAKLESLVEELIERTAGPCRTALKDAGLSVSDINDVILVGGQTRMPKVQ
EKVKEIFGKEPRKDVNPDEAVAIGAAIQGGVLKGDVKDVLLLDVTPLSLG
IETLGGVMTKLIQKNTTIPTKAQQVFSTADDNQTAVTIHVLQGEREVASG
NKSLGQFNLTDIPSAPRGMPQIEVIFDIDANGILHVSAKDKATGKENKIK
IQASSGLSEDEIQKMVKDAEAHAEEDKKALDLVNSRNQCDAMIHSVRKSL
AEYGDKLEGDEKSRIEAALKEAEEALKSGDKQTIDAKTQALTEASHKLAE
KMYAQEQAQAGQQAGAGTASDQSQDKPVEGEVVDAEFEEVKDKK
>NE0002 dnaN, DNA polymerase III, beta chain
MKLTITDRDLLFKPLQTVSGIVERRHTLPILSNTLIEIRNGQLTLVTTDL
EIEAEATSNIPELENQGALQTTVSVRKLQDILRALPSGAAIELTRSENRL
QIVSGKSRFSLQLLPAEDFPRMIRDSEPCSATYTLAQRVLKKHLQRVAHA
MAQQDLRYYLNGMLLLIEDNKLTLVATDTHRLGITSIDLDGNFEKSETIV
PRKTVLELIRQLEDSDKPVIVEIYPKKVCFRFSDAVLVSKVISGKFLDFR
RAIPQTSVFQFDVNRLDFLHALQRTAIISSSNDLFRNVHLNITNGKLNIS
AKNKEQEEAQEEIDIVYSNETIDTSFNIVYLMEVLNNLDSEQIRCSFESM
QSAILITLPDDEQFKHVLMPMRE
>NE0141 dnaQ, probable DNA polymerase III (epsilon chain) protein
MRYVFLDTETTGLDPALGHRIVEIAAVEVCNRRLTDRHFHRYLNPGRESD
EGALRVHGLTREFLRDKPVFQDVCSEFLEFIADAEIFIHNAPFDVGFINR
ELDLIRFESMQNHCLQIIDTLVLAKELHPGKRNNLDALCERYQIDNSHRT
LHGALLDAELLAEVYLAMTRGQESLLMEMDAPASRQADNPAVGKVENLAL
IVQPATQAELELHSRLVERINAESKGNCLWNG
>NE0433 dnaX, dnaX; DNA polymerase III (subunits tau and gamma) protein
MTDSQVLARKWRPKDFSELVGQEHVVRALINSMEQNRLHHAYLFTGTRGV
GKTTVARILAKALNCEQGVTAAPCGKCAACMAIDQGNFIDLIELDAASNT
QVDAMRELLDNAQYAPVAARYKVYLIDEVHMLSRSAFNAMLKTLEEPPEH
VKFILATTDPQKIPVTVLSRCLQFNLKQIPPSLIVERLTEILSMEGIPAD
AAGLRLLAQAAKGSLRDALSLLDQAIAFGNSVVNESDTRAMLGVLDQDHI
FALLEALAEQNGAAIFAIADQLEAASVSFDQALQDLAALLHRLATAQVIP
QMLDETQPDGDRLLALTKRFSPEDIQLFYQIVLHGRTDLAHAPDEYSGFT
MTLMRMLAFMPDSRQPGRAYADTGTDHAREVKVEAPSCPREAKPVSDQSP
NEAWLALVNQLKLSGMTRMLAQYSEAKSFSESRIELYVAEMHKHLLEKSY
QDRLRSQLEIHFGKPVEVIFSQGSITGVTSAALQDRDKLARQSKAVEAIE
SDPYVQELIEQFDARLNVSSIKPID
>NE2480 dppF, ATPase component ABC-type dipeptide/oligopeptide/nickel transport system
MTALLEVTDLRVLLHTGRQPVRAVDGLSLAIHPGETFALLGESGSGKSIT
ALSIMRLLPDAGEIVHGSVRLNGDELLTLPESAMRKVRGNRIGMIFQEPM
LSLNPVMTTGAQIGEVLLQHSGLRGAALQIRILELMRQVGIPDPARRMAE
YPFQFSGGMKQRVMIAMALAGKPELLIADEPTTALDVTIQAQVLDLMRGL
QQQENMAVLLITHDLGVVAEMAHRVAVMYAGQIIETADRERFFQSPAHPY
SHKLFAALPTRKKRDQGLIVIPGNVPALSKVFTGCRFADRCDRAWEKCHQ
IIPPWVETAPQHHVRCHLYSDDSTERSPQSRLQSLSRTARSALDDLPLSS
HTSDSTQSKPLLRVDDLKIHFPVHKGLFKRVAGHVKAVDGVSLQIDGGRT
LALVGESGCGKTTVGKGIMQLIPVTSGSVRLQDKELRDLDRKQLLQKRSA
FQIIFQDPYSSLNPRMRIVEIIEEGIRALGRNSDKIAASEKNQHDVDTLL
MQVGLPAEAKWRYPHEFSGGQRQRIAIARALAVDPQLLICDEPTSALDVS
VQAQILNLLKTLQQEHKLAYLLITHNIAVVDYLADEVAVMYLGRIVESGR
TEEVLDNPKHPYTQALLSAVPTYEPGSQREIIRLQGEPPSPANVPPGCHF
HPRCPHVMPICREVYPAVSRFSASHTTYCHLYHSVSQEPQDLQ
>NE0366 dsbA, Thioredoxin:DSBA oxidoreductase
MTVRKMICFLLSFIVVNLAGVLIAHAEIVEGRDYTVLSAPQPTEGDSEHI
EVIEFFWYGCPHCSDLHPHLSRWLENKPADVAFRFVPAILRNNWVPGAKT
FYAMESLGLTQTLHDKVYHAIHREKTDLSKEATLFDWIGKQGVDRDKFIG
AYNSFTVQNQANRSAQMIRQYKLTGVPALVVDGRYLTSGKAGGTPQDTIS
VLNQLIEKVREEKKSR
>NE1175 dsbB, Disulfide bond formation protein DsbB
MRIIFLLIALICAGLVSYALYLQLADGLLPCPLCIFQRMAYWLVGITALF
AFIHHPQRLGRRIYCGLIILFSLAGAIVAGRQAWLVRFPEAFECGISPEE
AFLNELPLARWWPDMFEANGDCTDGTWQFLSLTIPDWSLLIFLAFSLIAG
LLWRSRSISSSNLK
>NE1511 dsbC, hypothetical protein
MHLFLRSLILPGLFFAGIVLADSADLKETIQAHFPESKIESVTQTPYLGL
YEVVIDGEVFYTDKKADYFFMGHVVDAKRRVSLTSERMQQIRDARRIAID
TLPLEHAMKTVKGNGKRMLIVYSDPNCPYCKRLEKELVNVTDVTIHTLLY
PILNGSMTTAEAIWCSEDRVKAWDDFMLRSVAPAGKDCQTPLQKLLESGR
ENRVTGTPTLIFADGSVVPGFIPQQEIEKRLDQAASK
>NE2389 dsbD, Thioredoxin:Cytochrome c biogenesis protein transmembrane region
MRWIKTILLLLCLVSTHTMAEEGSSGGLMSILQRLGVGSEQTEQELLPPD
EAFKLSLEVRDDRTLIARLTPAKDYYLYRDKINFESKSAGIHVDQVALPP
GKMKQDPAFGQTEVYYQPLEAIISLRRDASAPEQLSLSATYQGCNEPIGV
CYAPINKVNDILLPTVKAAIDTVAGTISGDVQAAGATGTDATAELFQTDN
APLFETESYKIERLFASGNFWLILSGFFGIGLLLAFTPCVFPMFPILSGI
IASKGQQMSKLRGFMLALAYVLGMAITYAIAGAAAGLSGTMLSAALQNAW
VLGTFALVFVALAFSMFGFYELRMPSFIQNKLVEETGHFKGGQLTGVFGM
GALSALIVSPCVAAPLAGALIYISQTRDVVLGGSALFIMALGMGMPLLLL
GVSAGALLPRSGAWMKAVQQFFGVLLLAVAIWLVSPVITEVVHMLLWAAL
LIISAIYLHALDPLPGHASGFNKFLKGIGVIALLVGVALLIGVLSGSRDI
LQPLSKLSLAAATPSGQLAEPSVNTNESLPFKRVKTTAELDELIRQSQGR
YVMIDFYADWCISCKEMERFTFTDPQVQARLKNVELVQINVTDGTPDDAA
LLKRFKLFGPPGILFFDRQGVEIPNIKIIGYLDKRDFITVLDAILL
>NE1462 dut, dUTPase
MKQVDIKLLDPRLQDVFPGYATPGSAGLDLRACIDERMEIHPGETLLIPS
GIAIHLADPGFAAMVLPRSGLGHKHGIVLGNLVGLIDSDYQGQILVSCWN
RGQAGFTLDPMERIAQLVIVPVVQAGFNVVENFQPSQRGEQGFGSTGKC
>NE1712 dxr, 1-deoxy-D-xylulose 5-phosphate reductoisomerase
MTKTRHLTILGSTGSIGESTLDVVARHPGRYQVVALTADRNVEKMFEQCI
QFHPPYAVMLDAQSAEQLEDRLHAAGLDTRVLSGIESLEKVASLPEIDTV
MAAIVGAAGIRPTLAAARTGKHILLANKETLVMAGRVFMDTLRQHHATLL
PIDSEHNAIFQSLPQHFNGDLAGSGVRRILLTASGGPFRTVDLKILETVT
PEQACAHPNWVMGRKISVDSATMMNKGLEVIEAHWLFNAVPEKIQVVIHP
QSVIHSMVEYIDGSVLAQLGNPDMRTPIAHALSYPERMESGVQSLDMFKV
ARLDFESPDFKRFPCLRLAYEALAAGGNMPAVLNAANEVAVEVFLAGRIP
FTAIPVMIEDVMKSTERRDVPDLEGVLLADLQARATAREWLACNIRQSTG
QPGKPASSLSAGQ
>NE1161 dxs, Transketolase
MYPLLDRIEIPAQLRTLKRNQLPQLADELRNFLVESVAGTGGHLSSNLGT
VELTIALHYVFDTPFDRLIWDVGHQTYAHKILTGRRTGMARLRMQGGIAG
FPRRDESEYDAFGTAHSSTSISAALGMAVAARLKGVKQHAIAVIGDGAMS
AGMAFEALNNAGVMDANLLVILNDNDMSISPPVGALNNYLAKLMSGRFYA
TARRAGEKMLGVVPPVLELAKRAEEHVKGMVTPSTLFEEFGFNYIGPIDG
HDLDILLTTLNNIKQLDGPQFLHVVTRKGKGYKQAEEDPILYHGVGKFQP
DQGIVSKPSAKLAYTQIFGDWLCDMAAKDSRLIGITPAMREGSGLVRFSK
EYPDRYFDVGIAEQHAVTFAAGAACEGLKPVVAIYSTFLQRAYDQLIHDV
AIQNLPVVFAIDRAGLVGADGPTHAGSFDLSYLRCIPNITVMTPADENEC
RQMLYTAFQLDTPAAVRYPRGSGPGVQIQQEMQTIPLGKGEIRRQGKQIA
LLAFGSMLTPCLEAGDELDATVVNMRFVKPLDQELVATLAAEHELLVTIE
ENTIMGGAGSAVMESLSSLDKNVRLLQLGLPDSFIDQGDPAHMLSDCGLD
KAGIIQSIKERFSL
>NE0897 efp, Elongation factor P (EF-P)
MKTAQELRVGNVFMLGKDPMVVLKTEFTKSGRNSSVVKMKYKNLLTESPG
EAVYKADDKFDIVVLDKKEVNYSYFASPMYVFMDAEFNQYEVEEETMSDA
LSFLEDGMPCEVVFYNDKPISVELPNTVVREIIYTEPAIKGDTTGKVLKP
AKIPTGFELAVPLFCEIGDKIEIDTRTREYRSRVK
>NE1044 eno, Enolase
MSAIVDVIAREIIDSRGNPTIEADVLLESGVLGRASVPSGASVGTREAVE
LRDGDAQRYYGKGVLKAVESVNGEISETIMGLDAMEQCFIDKTLIELDGS
ENKSRLGANAILAVSLAVAKAAAEESGLPLYRYLGGINAKWLPVPMMNLV
NGGVHANNRLDMQEFMIIPLGLPNLREAVRCGAEVFSTLRTLLNKRNLPT
TVGDEGGFAPSFARNEEALALIVQAIDEAGYQPGSEVAIGVDCASSEFFR
EGKYHLDVDGLGLTSAQFVDYLATWVEKYPIISIEDGMSEQDWEGWGLLT
ERLGKTVQLVGDDVFVTNTRILKEGISRNIANSILIKINQIGTLTETLNA
IEMAKCAGYTAIVSHRSGETEDTTIADIAVATNALQIKTGSLSRSDRLAK
YNQLLRIEEDLGEMAQYAGRSAFYQLKP
>NE2280 epsB, Chain length determinant protein
MPSDSSRNLMEPVTSQANDEIDLGELLDVLADNKKLILVITFITCLIGAA
ISFLSRPIYKADSLLQIKENSQSMVQQLDSLSNFFETKTPVQTEIELIKS
RMILGKTIRNLHLDIVAMPKFFPLIGNAVARWLQRFNGSDVLSSPPFGLS
HYAWGGEAIQVDTLEVPESWKNEEIILQAINESQFKLIYQDEVFLEGEVG
KLAVKQPEGYLQPVKIFVTYLQSRPDTMFTLTRRSKGEAIRRLKEALSIT
EIGKGTGILELAIESDDRKEAVRIVNEIANTYLQQNVEQKSAESQKTLEF
LEKQLPALKEQLETATSALNDYRIRKGSINLDLETQNILTGTVELNTQVT
LLQQKRDELRQKFTESHPNVIAIDKQISRLQAQINAFDKKIGGLPETQQI
ILRLSRDVEVNAELYTTLLNHAQTIRVAKAGTVGNVRIIDYAMLPDLPIK
PRKVLIMGVALMAGLMMGIIITFIRKSLSYGIKDPGIIEKSLGIPVYATV
QYSKYQKLIEKKLKSEPISGNHPPLLLALENKEDLAMESLRSLRTTLHFA
FLEAHNNIVMITGPSPGIGKTFVSANLAVTMADAGKKVLLIDGDLRRGNI
HKHMRLSRENGLSELISRSIDLNDAIKSIPLAAIDFIPTGKVPPNPSELL
LHERFGQLLETVSNQYDLVIIDSPPILAATDAAIIGRLASVTLMAVRAGT
HPLRELEQSVKKLVQAGVNLKGVIFNGIPETLSRHRYGQYVYQYDYRSKK
R
>NE2278 epsP, Low molecular weight phosphotyrosine protein phosphatase
MAEAVLKHTLVQAGKTDHFVSSAGLGALIDYEADPTACRLMAEKGLDIST
HRARQLTDDMIRRADLILVMETWQKAAIEARTPSAKGKVFRLGEWEKIDI
SDPYRKDSSEFIRSLILIEQGAAQWATKL
>NE2323 era, Type 2 KH domain
MNAPGYKTGYISIVGRPNVGKSTLLNHLIKQKISITSRKAQTTRHRIHGI
LTDAQSQFIFVDTPGFQTRHRSQLNQVMNRVVLQSMQDVDVVVFVVEAGR
FGREDEQVLEQLPRNLPVVLVINKIDLLPDKLQLLPFMQKMADVFEFSAI
VPVSALQNRQLSALIEAIRQHLPGNPFLFAEDEITDRSERFLAAELLREK
VFRQIGEEVPYSVSVVIEQFTVEGNLRRIHACILVERENQKAIIIGKQGK
KLKDMATQARKDMEMLFGSKVYLEVWVKVKSGWADDITALKSLGYE
>NE0370 eryB, NAD binding site:D-amino acid oxidase
MPTTRHYDIVIIGGGIQGAAVAQAAAALGYNVLVLEKTALAAGTSSRSSK
LIHGGLRYLESGQFGLVWESLKERAALLRLAPSLVRLQPFHIPIYDQTSR
DTLTIRAGLSLYALLAGLSKGAGFHSIPRRRWGELDGLSTRGLRHVFQYW
DAQTDDQALTRAIMHSASSLGVELHCPAEFIHARISGEICEIEFLENNQA
YQCTASSLVNAAGPWVSQVAERISPAPPAYPVELVQGTHLILEGSLDKGC
YYLESPQDRRAIFALPWKQHILLGTTETVFEGAPDKALPLVSEENYLLGC
FRYYFPDHGATVRERFTGLRVLPVSRNNPFGRSRETHLQCDSPESPRIVG
IYGGKLTVCRATARKVLHTLRPSLPVRIPKADISQLTLKLP
>NE0353 exbB1, MotA/TolQ/ExbB proton channel family
MMEKALGFSNFLTQIDHVGMCVLVLLLALSVASWYLIVTKSISNTIASRR
ANAFLKHFWNIDTIPQLETEIRTMTGNHAFIQLAKTALGVTTDSQKHGLE
KLAAAGGINELLTRSLRNSLDQEAARIENGLTIIASAGSAAPYIGLFGTV
WGIYHALIQIGLSGQGTLDKVAGPVGEALIMTALGLAVAIPAVLAFNAFT
RRNRLWMAQLDSFAHDLYTILTVGSKVGGENSEKAPLKVVTTTSQRSAPA
AGLANATAGIASGEGH
>NE1171 exbB2, MotA/TolQ/ExbB proton channel family
MFSIIQAAGWPIWPIILASVLAVGIIGERLWSLRRSKVSPRDLLPRVLQE
YRRSGVHTDTLTRLQEHSPLGQVLVAGLKNIDSPREAMKESIEEAGRVAA
HELERYLTTLGTIASLSPLMGLFGTLIGMIEIFGSGTPTGGNPIQLAHGI
SVALYNAAFGILVAIPSIVFYRYFRAKVDELVLEMELQALKLVEVVHGER
RI
>NE0352 exbD1, Biopolymer transport protein ExbD/TolR
MAFGDSSRYQSKTAMSEINVVPLVDVMLVLLIIFMITAPLLTHSVKVDLP
KASSSPNLTQPEHIEFAIRADGSFYWGGEPVTLDQLPSRFAAAVEQSQQT
ELHIRADRDTHYELVAKVMSIAASAGLARIGFVTDPTGEQQP
>NE0869 exbD2, Biopolymer transport protein ExbD/TolR
MNFQRGQKKEEPEINLVPMIDVLLVILIFLVITTTYSKFSELEITLPQAA
TVETDHAEDTSKVIDVSVSATGDYTINLVPIKFASIENLREALQSAAKNQ
ENPIIIISADAKATHQSVITVMEAARLAGYNQVTFTTEMTDNN
>NE2022 exoT, Polysaccharide biosynthesis protein
MTRSTPRHTTVRQALILSSRDLGSRVARGAGFTLLGIVLRTTLTIGSMAI
LARLLTPADFGYLAMATVVTEFAGLLGSFGFANILIQRRVITRLQLDTMF
WATLALGCTIAAVIFALSFLTHWLFGDEATGPLLRVMCLTFIFGSLSTVH
QAILSRLMRFGTEFIIQIGTIGLRSAAAIILAYLGFGVWSLVYGSLAGSI
IGTLLMVSAIRYRPRLRFHRQYLLSTWKTSSSYMGNTVLYYLNMNSDLLL
IGRQFGASALGYYQNARSLTDEVRGRIAMPLQRVLFPAFSSLQADQVRLQ
HSVLRSGRLLAAIICPIGIGLSAVATEIVPVLYGEQWLPMIPILSLLGIS
AALRGSTAIGSSLFNSQNRVPLAFRYNIIYTALLLCSILFAMPYGLNVVA
LAIAANSLFSVFVFRVALGLIGLGTSHLLHILARPFIAALLMWAAIAFLR
NLPILTALHPGMHLGALIACGAISYASVLHLLSRQYLQDFTELAARFTKR
R
>NE1647 fabD, fabD; malonyl CoA-acyl carrier protein transacylase
MKAAFVFPGQGSQSVGMMNSYSELPSVRGTFNEASDILQQDLWSLVSNGP
EDSLNLTTNTQPVMLAADVSIYRAWQQAGGIEPHYLAGHSLGEYAALVAA
EVLTFTDALKLVRYRAKVMQETVPEGTGGMAAIVGLDDDIVSSVCTEVIN
AMPDTSLEPANFNAPGQVVIAGHSQAVARAVVLAKSKGAKLAVILPMSIP
SHCSLMQSAAEKFALLLEEIALQPPRIPILHNADVQQHSETASIREILVR
QLYRPVRWTETIQAIAAQNVKYVVECGPGKVLSGLNRRIDKNLENIALTD
SGSLLKTVETLK
>NE1650 fabF1, Beta-ketoacyl synthase
MSKRKVAITGLGIISPVGNTVSEAWENVIAGKSGITRITRFDASDFSSKI
AGEVNGFDITEYLSAKEARRMDIFIQYGMAAAIQAIRDAGISDVSGFNAD
RIGVNIGSGIGGLPMIENTDAAYHAGGPRKISPFFIPSTIINMVAGNLSI
MFGYKGPNLAIVTACTTATHCIGSSARMIEYGDADIMVCGGTESCVTPLA
VGGFASARALSSNNDDPAAASRPWDLKRDGFVLGEGAGILVLEEMEHARK
RGAKIYAELAGFGMSADAHHMTAPCEDGEGAARCMTNALSDAQMHADELH
YINAHGTSTPLGDIAETIAVKRCFGEHAKDLAVSSTKSMTGHLLGAAGGV
EAIFSALAVYHQIAPPTINLDDPDPACDLDYVPNNAREMKIDAALSNSFG
FGGTNGTLVFRKI
>NE1648 fabG1, Short-chain dehydrogenase/reductase (SDR) superfamily
MLLEHKIALVTGASRGIGKAIALELGKHGATVIGTATSETGAGHISQYLS
EARISGMGLIMDVSNIEHIKSGIETIQQSLGDVAILINNAGITRDNLLAR
MKDDEWDNVIQTDLKSVFCLSRAVLRTMMKARSGRIINISSVVGATGNPG
QTNYAAAKAGMIGFSKSLAKEIGSRNITVNCVAPGFIDTDMTRSLSPDQQ
QSLIQHIPLGRFGRPEDVAAAVVFLASPAADYITGATLHVNGGMYME
>NE1646 fabH, fabH; 3-oxoacyl-[acyl-carrier-protein] synthase III
MYSRIIGTGSYLPEKVLTNQDLESMVDTSDEWIRTRTGIERRHIAAEGQM
ASDLALEASRNAIEAADIQAKDIDLIIVATTTPDMIFPSTACILQNKLGM
SNGPAFDVQAVCSGFIYALATADMFVSSGKARNALVVGAEVYSRIMDWND
RSTCVLFGDGAGAVILTRDQKPGILSSHLHADGSYSQVLTAPASIHSGKI
QGTPFITMEGGTVFKFAVKVLEEAALEALQANQLQPSDIDWLIPHQANIR
IITSTAKKLGIPEEKVVATVSQHGNTSAASVPLALDQAVRDGRIQPDQHI
VLEGVGGGFTWGSVLVRW
>NE1708 fabZ, Bacterial thioester dehydrase
MKQGDRQTVMDIHEILKYLPHRYPLILVDRVVSLESGKRIHAYKNVSINE
PYFSGHFPHHPVMPGVLIIEALAQAAALLTIRSENMEKDNGQVYYFVGID
AVRFKKPVIAGDQLVLKVEITRQLKGIWKYAACAEVDGQVVTEAQLMCTA
RAI
>NE1125 fadD, AMP-dependent synthetase and ligase
MSNLFHELIYQSAARYPDSTALIDQKRHLSYAALSEAVQSIASALHTLGL
GRGERSAVYLEKRLETVIALFGASAAGGAFVPVNPLLKAEQVAYILKDCN
VRILITSAERLDLLSPVLPQCHDLHTVIITGDLHKASLPGLNVVSWRQTQ
TLSDAARLPDCIDSDMAAILYTSGSTGKPKGVVLSHRNLVTGAISVSRYL
NNRPDDRILAVLPLSFDYGLNQLNTAFYTGATAILMNYLLPRDILTTVKQ
EQVTGLAAVPPLWAQLAQLDWKDAQSLRYITNSGGAMPRATLAHLRSALP
DTQIFLMYGLTEAFRSTYLPPGEVDKRPDSMGKAIPNAEVMVLREDGSHC
APGEPGELVHRGPLVSLGYWNDADKTAACFRPLTPRQSGLTIPELAVWSG
DTVRMDDEGYLYFIGRRDDMIKTSGYRVSPTEVEEVIYATEKVAEAAAFG
VPHPTLGQAIVVVAVPRTGFALDRDVLQSACKQHLPAFMQPALIELRQTS
LPRNPNGKIDRKMLAGEFQQAFQAGES
>NE2348 fadE1, Acyl-CoA dehydrogenase
MLPAFSSGHRETARVLAQKVREFVDEIIIPNEPQLSRPGAQALQLQSDLA
LEARQAGLYGLFYPLSHGGKIASLEDYLLVAEQEGRTEFSQAIFASHTAL
DAHMLSRFGNAIIRQQFLQPMANGEALPCYGMTEPGQSGSIPGLITTTAH
LSNGRWHVNGRKWFISNADRATFMTVLARTAGKETALDHALSMIIVPADT
PGFRVERQLMMMGHACGQGEVSLADVQVPEHYLIGVCGGGLELMNKRLGL
GRLLRAMNWIGLARRCMDLMGGRVHSTRGKLGLLVEKQLVRQHFFNTYQA
IAGARELVRIAARGVDAQCPDEIAINVAKMAASRALCIASDSAMQLYGAE
GLSDLTPLYGIHRIARTSRILDGNDESLISSVGRRLISHYEHHTVYPFD
>NE1958 fdfT, Squalene and phytoene synthases
MTPTMSNIATLDDVRYQEHILQGVSRTFALTIPQLPPALREVVGNAYLLC
RIVDTVEDDSALTLEQTRELAEMFIEVVSGTVEAETFARALYPLLSEHTI
PAEHDLIKHTPSVIRMTHSFNPVQRQALERCIRIMGQGMAAYQESESLSG
LENLAEMDRYCYHVAGVVGEMLTELFCDYSPEINQHKPALMQLSVSFGQG
LQMTNILKDIWEDRKRGACWLPRDIFLQEGFDLNDLQPGCPERGFHAGLG
TLIGIAKTHLQNALAYTCLIPSHEAGVRRFCLWAIGMAVLTLNKINHKRD
FITGREVKISRRSVRATVLITSLLASQDWALKKIFAISSRNLPEVQIARS
>NE0967 fdx1, 3Fe-4S ferredoxin:4Fe-4S ferredoxin, iron-sulfur binding domain
MALIITDECINCDVCEPECPNRAISQGEEIYEIDSDLCTECVGHYNTPQC
VEVCPVSCIVGDPDRVENREQLMEKYQFLTTGANRVPQG
>NE0006 fdxA, 7Fe ferredoxin:4Fe-4S ferredoxin, iron-sulfur binding domain
MTYVVTESCIKCKYTDCVDVCPVDCFREGPNFLVIDPDECIDCTLCVAEC
PVEAIYAEDDVPEDQRQFIALNAELSKIWDPIIEKKDALPDADEWASVTD
KLDKLER
>NE2435 fecI, Probable fecI Specialized sigma subunits of RNA polymerase
MLDNSIPTDSCLVSGATVSTDIETLYCNHHDWLQVWLRRRLSSAVDASDL
AHDTFVRVLVSGRMPNAEQSRAFLTQIAKGLAVDLYRRRAIEEAYLAALA
QLPENLAPSAEERVLALQVLLLLDTALGSLPSRVREVFLLSRLDGLTYSD
IAQRRGISVATVRKYMLRAAQACHCVLNSPTGTAP
>NE1218 fecR, putative FecR protein
MEQAAEWYALLRSGEATHDDHARWQSWLEHSSDHRLAWQYVETISRSFEP
IHATPDPRQTADGLWAANSRVIQRRRILTSIVALAGTGLFGWATWRHTPL
PGIVLAQMADHHTATGEQREVVLSDGTHVWLNTATALNEAYHASLRRLCL
ITGEILIDTATDPQRPFVVDTPQGRLRALGTRFTVRLESDKTFLGVYQGS
VEITSAAGTSSLIQAGQQTRFTTDASAAVEPADSARETWSRGVLVAWDMT
LGDVVKELRRYRSGYLGIAPEVAGLRVFGSFPLRDTDDTLTMLASALPIR
IQRTLPWWVSIEARTDADPHR
>NE1460 ffh, Signal recognition particle GTPase ffh protein
MFDNLTERLSDVIKTLRGEARLTESNIQDALREVRMALIEADVALPVIKI
FIEQVKQRAIGHEVLDKLSPGQALIGVVHEELVAIMGGDKAELNLNVAPP
AVILMAGLQGAGKTTSSGKLAKWLMDQKKKVLLVSCDVYRPAAIDQLALL
AEQTGADFFPVQTGRQPAEICTAALDFARKHHHDVVIVDTAGRLGIDEAM
MREISQLEQLLKPAETLFVVDAMQGQDAVNTAKAFAEVLPLTGVILTKLD
GDARGGAALSVRHITGKPIKFAGVAEKLNGLEPFYPDRMASRILGMGDVL
GLIEEAQRTSDQKEAERLMKKMKSGKSFDLNDFQQQFQQMKKMGGMSAML
EKLPHQLSQAAQNMNVDEKIINRTEGIINSMTPQERIKPEIIKASRKRRI
AAGSGVSVQEVNRLLSQFEQARKMMKMMNKGGMAKMMRAVKGMLPHMR
>NE0875 fis, probable factor-for-inversion-stimulation transcription regulator protein
MTVINENEIALCIRRAVEAYFQDLDGEKPCPIYEMVIRSVEKPLIEIAMH
YAQGNQSKAAELLGINRNTLRNRLTKHQIR
>NE2125 fiu, iron-regulated outer membrane protein
MLLTIDDVLTQEELAIARSMLARSAWVSGLVTAGTQAAQVKNNQQVQEND
PQIVNLRRLVLGALNRNALFFTATLPEKIVPPFFNRYSGETNHYGFHVDN
AMRLLPDGSGYVRTDVSATLFLSDPQEYDGGELVINDTFGQHGVKLQAGS
MVIYPSSSIHQVTPVTRGERLACFMFIQSMVRNPDQRRLLYEMDMALLQL
RQNIGETPAVVSLTGTYHNLLRQWADS
>NE0079 fkpA, FKBP-type peptidyl-prolyl cis-trans isomerase (PPIase)
MINVKKLLSLALSGWLFLSMSAQATEEQATSHHTNAADVTTLEKIDTQVG
TGEEADIGKTAKVHYTGWLYDAAAEGHKGRKFDSSYDRGSHFSFLLGAGR
VIKGWDQGVMGMKVGGKRTLIIPSSMAYGSQGAGRVIPPNSALVFDVELV
GLE
>NE2082 fleS, putative two-component sensor
MKTEVGVTPVADHEQLRQAFAVFSQASSQLSGIYRELQQQVFRLTEELAL
ANGELQRELAAKEALSQQLGQLLTALPGGVVVLDHQNSISRVNPAAIRLL
GEPLLGMTWQQIVQERLQPTGVAGEWHAGKEDLALFCPRRLYIESSVSEM
TGECILLLHDMTEAYALREQNRRNQRLAAMGEMAAGLAHQLRTPLSTALL
YAGHLSGEALDPQERKRFAVKAIERLHHLEHLIRNMLQFVKGEPVPAGRV
KLSVLLRKMQRVMAPQMQQRNLRFVVQDNSRGVSLNVDRAALSNAVANLL
DNAVQVSPVGSLVTLTCETTEDEARLIVSDEGPGIDPTLRERVFEPFFTT
RTEGTGLGLAIVHNLIQSMRGEIRIDSVPGTGTRFTISLPRTAKEFENVP
DLRSKAETAEK
>NE0301 flgA, putative flagella basal body P-ring formation protein
MKIKFLLIVLVYALTPMAATASEDSSASTILEVVNDFVRNETRHLPGKVI
IKSNRPDTRQTPRSCRQPQPFLPTGGRVWGKFSVGVRCQDEATWTLYVPV
EIEVITRVVHAAQPVSMGKQLEAQDIVSKEADLVRIPGEVATDPDQVIGK
VATTFLASGQPIRTHQLRAPHVISRGQKVRLTATGAGFAVSMEGEALAAA
AAGEVVQVRNHTGRIIRGIARAGGVVEIKQ
>NE0302 flgB, Flagella basal body rod protein
MISKLENLVGFNQRALGLRATRQELLSGNIANADTPGFKARDFDFAKAMQ
NVQSPQRVQHSSLATTAPGHIGMADFRSITSVPKQYRVPQQPSIDGNTVD
MDTERMQFMDNAMRYDAGLTFISGQFRTLLSAIKEGN
>NE0303 flgC,flaW,flaFIII, Flagella basal body rod protein
MSLFSVFNVAGSSMAAQSQRLNVVASNLANADSATSSTGEAYRSRQVVFR
TIDVTADKAKGVKVAGVVNDPSPMRVVYDPKHPMANEKGYVTLPNVNVVD
EMVNMMSASRSYQNSVDMMNVTKTLLQKALTIGQ
>NE0304 flgD,flaV,FlaFIV, putative basal-body rod modification protein FlgD
MSQVQNNSSVSELLASNSGSIKSYRKDEEDPQDRFLKLLVTQMQNQDPLS
PMDNAEVTSQMAQISTVSGIDKLNTTLEKLVADSDARRSFEAATMIGRGV
LVPGTDMQLENQAAIGGFELAESVDKLTVTVKDSAGITVREIDLGAQAAG
VGTFVWDGMANNGNAAADGQYSFALKAIRADKEVSSSALAFGSVSSAQAG
KDGALLEVGRLGYVGMDAVKKIF
>NE0305 flgE, Flagella basal body rod protein
MGFQHGLSGLKAASAKLDVTGNNIANANTVGFKQSQAQFADVYANALGGG
RNQIGIGTQLAAVAQEFNQGNVTPTNNPLDVAITGNGFFRMDNNGSISFT
RNGQFHLDKDGFLVNASNYNVTGFGADASGQIIPNIPVNLRLPTADMTPN
PTTSFMTGVNLDARADVVTDPFNVANPDTYTHSTSGEIFDSFGNSHILNL
YFQKTAASTWGVYTAVDGTVVDPSTGLSGTDSPITLSFNPSGIMTSTGSS
TINVAASVLGAGVSPLAFELDLTRSTQFGSKFGVNVMSQDGFASGRLAGY
SISDDGQIRASYSNNVTRTVGQLVLAGFSNPNGLKPLGSNQWGETAESGL
PLVGTPETGVLGSLQSSAVEESNVDLTTELVTLIMTQRVYQANAKTIETQ
DAVMQTIMNI
>NE0306 flgF, Flagella basal body rod protein
MDRLIYTAMTGALHTMTQQATVSHNLANASTTGFRAQTDAFRAVPVYGES
LPTRAFVLDSTTGADFTPGAIQQTNRPLDVAVQGSGWIAVQLENGEEAYT
RHGNLKMDANGILQTQQGFNIKGESGPITIPPDSRIAIGKDGSVSVLPSD
SRMTAVSIVGRIKLVDPPAEQLVRGEDGLFRLKEGGQAPVSEKVALIDGA
LESSNVNVISEMVKMITLARQFDMQMKMLENAQQNAQQAGEIMTLRG
>NE0307 flgG, Flagella basal body rod protein
MVRSLWISKTGLDAQQTKMDVIANNLANISTNGFKRMRPVFEDLLYQTIR
QPGAQSSERTRLPSGLQLGTGVRPVATENIFLQGNLQQTGNTRDVAIQGE
GFFQILLPDGTLAYTRDGAFQSDQDGQMVTASGFPLQPSITIPPNTLNIT
IGRDGTVSVLTPGSALPVPVGNIQLTTFINPAGLQRFGENLYLETMSSGV
PNPGIPGTNGIGLLNQGFVETSNVNVVEELVEMIQTQRAYEINSRSISTS
DQMLARLTQL
>NE0308 flgH, Flagellar L-ring protein
MGDSGFSVFSPDRAAPEFVDGQCSGVMESFMRNRGHLVLPVLIVMICVVS
GCAITPTPKVQQPMSVRPPDLPVIPPSQTSGGIYQAVFDYQGRGRYIPLF
EDRRARSIGDTIIVALNEKTNASKKTGSNAQRAGEIGLSVPSFLGLPFKS
IFQDLDLEATSSNKFDGKGESSSNNNFTGTIAVTVLDILPNGNLLVSGEK
QIGINQGHEFIRLSGVINPINIINNTVSSIQVADARIEYRGNGYLDEVQT
MGWLSRFFLSISPF
>NE0309 flgI, Flagellar P-ring protein
MTLSKWILSFGLSVCLIVSHPVSAERIKDLANIQGVRANQLIGYGLVVGL
DGTGDQTQQTPFTVQSILSMLGQLGVNLPPGTNLQLRNVASVMVTATLPA
FAKPGQQIDVTVSSMGNAKSLRGGTLLMTPLKGIDNQVYAVAQGSLVIGG
AGASSAGSSVQINHLGAGRISAGAIVERAVPTVLGQGEYINLELRDTDFT
TARRIVDTINSRFSYGTATALDGRVIQLRAPLNNNQRVTFISQIEDLDVI
PAQGIAKVIINARTGSVVMNQMVTLESSAVAHGNLSVIINTQPIVSQPGP
FAQRGETVVVPQSQIEVRSEEGNLMLLPGSASLADVVKALNAIGATPQDL
LAILQALKASGSLRAELEII
>NE0310 flgJ, Mannosyl-glycoproteinendo-beta-N-acetylglucosamidases
MINSADLSGQLAIDAQAVDQLRTRARHDPQEALKQTAKQFEALFLNMMVK
SMREATPKGGLFDSDQSRFYTQMLDQQLVQNLSEKGIGLADVLVQQLSKT
MGEEQSPGNAIDQAESLLTTITGKSGEPLVSLAPGQTKDLPSQLWSNLNR
SSVKPSFSTVSPTMPAGRVGIASSSGGIGSAESVAAHPREFVRDVLPHAR
KVARETGIPEHFMIAQAALETGWGRHQIRRADNQPSFNLFGIKASGNWRG
GVVETVTTEYVDGKPQKLREKFRTYDSYEDAFRDYAKLLQNNPRYAAVLK
SRSATAFAWGLQQAGYATDPSYAEKLLKIINSDALSI
>NE0311 flgK, Flagella basal body rod protein
MANGIMNVGVSGLRAAQVALLTTSHNISNATTPGYNRQQAIQSASTPLAN
SDGFIGRGVQITTVQRIYNQHLVSQSLQAQSQSSQLDSHYNEIKQIDNLL
ADTTSGLTPALHNFFNAVQDVASNPASISSRQAMLSNAEALTARFQQMDQ
RLSEMREGVNSQITGSVTEINSLAQQIAGLNMNIQAVESSANGQPANDLL
DQRDALVRDLSRIINVDVVRQNNGHYNVFIGNGQSLVVGSSAMTLKAVPS
SSDPQRLVVAMDQQGGEIVLPEHQLQGGSLGGLLAFRSETLDSVQNSFDQ
LAWGIADTFNTQHQVGFDLQSVAGSNFFEGTDPSDPSQAARNIRVAISDP
AKIAAADNNDGGVTNNINALKLAGLQTENTLQGGTASYQSAYARLVSQVG
NKTRELEVTSKAQESMLNQTDQALQSLSGVNLEEEAANLLRYQQMFQASS
KVIEIGNSLFDSLLRI
>NE0312 flgL, Flagellin, N-terminus
MRVSTSTLYDQGTRSMLQQQDSLFRLQQHLSTGKRIMTPSDDPIGAARAH
ELAQSLSLNTQYADNRYRAADSLQQVDSTLGSVSNLIQSVRTMAVAVGNS
AFSDSERKMMAVELRGHFDELLGLANTKDEQGNYLFSGFKGNTRPFEQTA
NGVEYKGDQEQRLIQVSSSRQLPVSETGDAIFEPGGLSLFQTIEDFIDEL
ETPGGAALGGAVSTALQDLDIALDNVLAKRAAVGSRLQEVDALQQIGEDT
AIQYQQSLSRLQDLDFAQAISDMTRQQALLEAAQKTFTRVSGLSLFNLI
>NE2488 flhA, Bacterial export FHIPEP family
MNNILSLSTLVRGGQLQSLAGPMLIIMILAMMVLPLPPFVLDVLFTFNIA
LAVIVLLVSLYSRNALEFAVFPTVLLITTLLRLSLNVASTRVVLMHGHTG
PDAAGKVIEAFGHFLVGGNYTVGLVVFIILVVINFVVITKGAGRIAEVSA
RFTLDAIPGKQMAIDADLNAGLIREDEARKRREQVAQEADFYGSMDGASK
FVRGDAIAGILIMLITIIGGLAVGVLQHDLSFGDAAKNYTLLTIGDGLVA
QIPALIISTAAGLVVSRVSTDEDLSQQFVSQLLSKPQVLWLTAGILGSLG
LVPGMPHFVFLLLAGMIGTLAYYVTHRPASPAEKSEEPEVVSSEGQDASW
EDVVQVDTLGLEVGYRLIPLVDKQRQGDLLKRINGIRKKFAQEIGFLPPV
VHIRDNLELRPNGYRITMKGSVIGQGESYNGMHLAINPGQVTMALPGNNT
EDPAFGLPAVWIETEQRELAQTSGYTVVDASTVVTTHLNHLIRMHASELL
GREELQKLLDHLGQRSPKLAEDLIPKLLPLATVQKILQHLLEEGVNIRDM
RTIIDTLTEHAGRTQDISELLALVRIALGRAIVEQCFPSDAEMQVMSLHP
ELETILLQVVQSDPVADGGIEPGLADTLTREASNAVKRLEQIGLPVVLVV
PDPIRLLLSRFLRRSLPQLRVLAHSEIPDTRKIKFISIIGGNKK
>NE2487 flhB, FlhB HrpN YscU SpaS Family
MADDNDLERTEPASPRRLEKAREEGQVARSQELTTFTLLIAASSSLWVIG
SIIIQKLSAVLESGLRMEQEVAFNPALLLPRLFQLALDGLLAIAPLLGWL
VIIALVAPMLLSGWLFSSKALFPDLKRLNPVNGLKRIFSSRGLIELVKAI
AKVMVIGGVAAGIIWSHKQDVLDLVGMPLDTSLISMSRLIGLTFMLIVGA
MLLIVVIDVPFQIWNHARQLRMSREDLRKEAKEDEGDPQVKGRIRNMQRQ
IARRRMMAEVPKADVVVTNPTHYAVALKYQDRSMRAPKVVAKGTQLIAAR
IRELADEHHIPVLEVPPLARALYHHVELDTEIPETLYTAVAHVLAYIFQL
KRYQTTGGIAPQLSSEIAVPEEMDHA
>NE2406 flhC, probable flagellar transcriptional activator transcription regulator protein
MKGKSILSEGKQIQLATELVRLGARLQVLEASTTLSRERLVKLYKEVKGA
SPPKGMLPYSEDWFTGWQPNMHSSLFINIYNYITRYTKVRDIDAIIKSYQ
LYLEHIEANRLQRILSFTRAWTLVRFVESKVLSVTSCVKCTGNFLVHSLD
IQSNHVCGLCHVPSRAGKTKRVAQEARAAQEAGAGELCVI
>NE2407 flhD, probable flagellar transcriptional activator transcription regulator protein
MGTNQILDEIREVNLSYLLLAQQMLREDRIAAMYRLGIDEDIADILVKLT
NSQLLKMAGSNMLLCRFRFDDSLIAEILTSHKQDRALTQSHAAILMAGLP
AEKIS
>NE2489 flhF, possible flhF; flagellar biosynthetic protein FlhF
MKVRKFLAATSHEVLRKVKDELGPDAVILSNKQVPGGIEIMALAGKDISS
LTSDTPPEATSPAVKTTAAPRQKPASEKKQAPPVQKQPAVADIQSANILQ
EIHAMRQLLEEQLITMGWSNFSQRDPGGMKVLRTLLSAGFSPLLSRHLLE
KLPADRDFEQSLKKTIALLTLNLRTTAGDEIIEQGGIYALIGPTGVGKTT
TTAKLAARAVIRHGADKVALLTTDSYRIGGHEQLRIYGKLLGISVRSIKD
IDDLQLMLHELRGKHMVLIDTVGMSQRDQMLAEQITMLSQCGTEVKHLLL
LNATSSGDTLDEVISAYQQHGIHGCIITKVDEAASLGIALDAVIRRKLVL
HYVTNGQKVPEDLHEANSRYLLHRIFKPSAENSPFSLQDPEFAMLMAGNY
ADQSVTRQQTRVEASHDQS
>NE2491 fliA, Sigma-70 factor family
MYTDTGIVDKDQFITEFTPLVKRIAHHMMARLPASVQVDDLIQAGMIGLL
DAINRYEGSYGRQFESYAAQRIRGSILDELREADWLPRSLRRKMRQIETA
MRALEQRLGYPPSEQEIANEMNLPLVEYQEMLQEAGGGQLIYYEDFQEDE
EDHFLDRLRGDQSNEPLDRLLEKNLRELLVAAIEKLPAKEKLVMGMYYEQ
EMNLREIGKVLGVSESRVCQLHTQAISRLRTHLRDK
>NE1595 fliD,flbC,flaV, Flagellar hook-associated protein 2
MISAPGIGSGLDVSGIISKLMEIEQQPLTQLNTKEAKQQAQLSAFGSLKS
VLSTFQDSVKALAKPALFNGYKATLADTTAATVSTSSSASAGTHDIEVQS
LAQAQKIKSEAFATTDTVIGSGTLTIEFGTYNEDGTFTANAEKTAKTITI
DPAKSTLADIRSAINEANAGVTASIVNDGSGNRLVISSKDSGLANALKIS
VNDTDGNHTDNAGLSKLAFDASTGGVSNMTETVAARNAVMVIDGIPVTKS
SNTINDALEGVTFNLLKANPGTTTTLTVEKDKSNVEAAVNAFVKAYNDLE
KTIGNLSRYDAANKQASVLTGDSTMRMIQNRMRAMLGGNQSAAGGINSLS
ELGISFQKDGTLALDNNKLSAVLNNPDKNIAAFFTTQGGDDTTSVEGFAS
RLSELIDGMTRSDGLISSRMDGINSTIKGIGKQREALEFRLESVEKRLRA
QFTALDTMIASMNQTSNYLQQQLANLPKIGE
>NE2080 fliE, Flagellar hook-basal body complex protein FliE
MNVTSGMDQILAQLKATSDLAAGGSKPSATVSTVSQADFGQLLKSAVDQV
NTVQQTASQLSREFVGGNQDVELHDVMISLQKANVSFQSMIQVRNRLVTA
YQEIMNMQV
>NE2083 fliF, fliF; flagellar M-ring transmembrane protein
MATVQEEIQEEKKTFSQLDKIKQLPTQKKLGLMVAAAAVIALIAGTWLWS
QSPDYRVLYASLSDQEGGAIIEALQKMNVPYQFSESGGSILIPASQVHEV
RLRLAGQGLPKGNLSGFEILENQKFGSSQFLEQVNYQRALEGELSRSIQS
LSAVQGARVHLAIAKPSVFTRERPQPRVSVLLNLHPGRMLSTEQVSAIVH
LISSSIPDMPVKNVTVVDQQGNLLSGQKDNQAETGLDPGQLKYVQDMEQN
FVRRIEAILTPITGEDNVHAQVTADVDFSRIERAEEIYKPNNNPAEAAAV
RSQQQSESVSISQPEGGVPGALSNRPPAPAEAPIEAKQSSEAAAKTTPTD
TRRESTTNYEVDKTISHTRQATGRINRLSAAVVINYRTKLDTEGNPANEP
LSGEEIEKITALVKQTIGFDETRGDTLTVTNSQFNLKGDSLEELPPWKDP
DTVLLAKDIGKQLLIAAIVLFFLQKIFRPFLRNLLPPPPPPPVPALAVKN
GLEHSEAEVSTVRVVTLEENLEKARLLAVTEPAIVANVVKGWVSGNER
>NE2084 fliG, Flagellar motor switch protein FliG
MSDEGIFKSAILLMSLGEEEASRVFRHLGPKEVQKLSEAMAALNDIKQET
IESVLEEFCHLANGKTSLGQGALDYLRTVLTKALGEEKAAGLMDRISQGD
DTSGIESLKWMDAASAAELIKNEHPQIIATILVHLERDQSSAILALFTER
LRNDVLLRIATLDSVQPTALRELNGVLSKLLSGTDSIKKPRMGGIRTTAE
ILNFMPTALESNIIENFNQYDEEIAQKIMDEMFVFDDLLHVDDQGIQLLL
REIQSDALITALKGAKEELRIKIFKNMSQRAAETLREDLESKGPVRVSEV
ETEQKKILKTLRQLADDGRIALGGKEGDGFI
>NE2085 fliH, Flagellar assembly protein FliH
MEAAYVIPKKELSSWQKWEFGSLDPLKSRQKTESPEKTPHTARSADQPEN
QIAAGAKTTAHEAIVLPTAEQIEQIYQQAREEGKTAGYQEGMQQAKHAAL
VEVKHLQSLTGALEQELKQIDQTMAQDLLTLAIDLARKITSHALEIKPEL
ILPVVEEALRQLPAVSQSIRLTLHPDDAARIRDHLENHPAHPKWHIYEDT
QIEPGGCRIESGGCEVDATLATRWQRTLAVLGQEQPWLV
>NE2086 fliI, Flagellar ATP synthase
MTIPDGYHDHVDTTGSVHTPLWRDFLQNCQRLIEPANSFLVSGTLTRVAG
LVMEAVGLKLAVGTSCIVFLPNGNSVTAEVVGFSGERLFLMPSGDIYGLT
PGAKVVPTEYTGAVPRIGTLQHPRRRAADRTKHISVGPELLGRVLNGSGD
PLDRYGPLHCGHSVPLYSRPFNPLERAPIEKTLDVGVRAINTLLSVGRGQ
RMGLFAGSGVGKSVLLGMMARYTSADVIVVGLIGERGREVQEFIEHILGP
EGLARSVVVAAPADTSPLMRLHGAAYATAIAEYFRDEGKHVLLIMDSLTR
VAMAQREISLAIGEPPATKGYPPSVFARLPQLVERAGNGRRGSGSITAFY
TVLTENDDPNDPIADNARAILDGHIVLSRRLAEAGHYPAIDIEASISRVM
ASLVTREQLGQAQRFKALYSRYQRSHDLISVGAYASGSDPQLDRAIELYP
VLESFLQQGMFEQESYIESMEKLSALQL
>NE2087 fliJ, Flagellar FliJ protein
MATPHSLKLLLDHARKQTDDAAINLGKLNLKQQEAEKTLQLLVEYRENYQ
SQFMESAGSGISPVEWRNFKAFICKLDTAIQSQQRLVTMTQQHTEAGSTQ
YHAHRQKLKSYDTLSQRAELHHQARLQKQEQRQLDEHTAHNFSKQQKNAD
>NE0465 fliL, putative flagellar fliL transmembrane protein
MSEATLPAGTGNKKKMILFVLIGVLSVGAGIGGTWYYMKQQQEEGSEVTV
EKKKKKKPTTFIKLESFTVNLQSDDREPHYLQVELSLKVNESDAVKIIED
KKPEVRNQILLLLSSKKPSEINTLEGKQKLSEDIIQAVRSKIDSEELEDD
ILDVLFTSFIIQ
>NE0464 fliM, Flagellar motor switch protein FliM
MAENFLTQDEVDALLNFENGSKHQQTAQKENEVSYPGGVRPYNMATQERI
VRGRMPALEIINARFANSLQIGLYNFLRRGVDISAGSIKTIKYSEFTRSL
VVPANINLVSMKPLRGTALFTIDPNLIFLIVDNMFGGDGRFHTRVEGREF
TLTEQGIIRRLLDVLFEHYEKAWQSVYPVKFEYIRAEMNPQFANIVTPTD
IVVASAFELNLGGNRGEFYTCIPYTMLEPIRDLLNNSMQEDRAEVDKNWI
KSLTRQVQGAEIEIIANLGQTQVTLHQILNMQEGDVIALDIPDTVVAIAN
NVPIMDCHYGILNGHYALKVKTVRSPNETD
>NE0463 fliN, possible flagellar motor switch FliN
MNESTVTEPQTEQDDWAAAMAEQQAADAQPTTDDDNGSPPPSSAHLAGDS
LTKPSPVFEQFSRNDVLNDTRNDIDMILDIPVQLTVELGRTRIAIKNLLQ
LAQGSVVELNGMAGEPMDVLVNGCLIAQGEVVVVNEKFGIRLTDIITPSE
RIRKLNR
>NE0461 fliP, Flagella transport protein FliP family
MNDKLPAHPKTSFRHSLPKRSINCGILLLVAGIGVAGPVLAQKTGFPAVT
STPAAGGGATYTLSLQTLLLLTSLTFLPAVILMMSSFTRIIIVLSLLRQA
MGTQSSPPNQVLLGLALFLSFFIMSPVIDKVYTEAYLPFSEDKISIVEAM
EKAGDPLKSFMLRQTREADIALFARIAENGEIESPDQVPMKILVPAYITS
ELKTAFQIGFVVFIPFLIIDMVVASTLMAMGMMMLSPMIISLPFKLMLFV
LVDGWNLLIGSLTQSFYV
>NE0460 fliQ,flaQ, Bacterial export proteins, family 3 (FliQ)
MNPEQAMTIGRQALEVTFTIAAPLLLAALVTGLVISIFQAATQINEMTLS
FIPKLLAIFITLTLAGPWILQIMLDYITRLYTSIPWIISGG
>NE0459 fliR, Bacterial export protein, family 1
MFSITTEQLNSWLSTLIWPLARILALLASTPLLGSSSIPTQTKLGLAVLL
AILIAPLLPPLPQIDPGSGVGLIVLMQQIIIGLAMGFSMRIVFTAVEMAG
EITGLQMGLGFATFFDPQQSGQVQLVGRFYGLLATLTFLAIDGHLQVIAI
LAHSFTTLPIGAEGMSTTSFTALVNWGIKIFMLGLHLSLPVLTALLITNL
ALGILTRAAPQLNIFAVGFPLTLGIGLLVMSWALPYFTPLLNQLFQESFS
VMRSLSSKASVAGIP
>NE1596 fliS, Flagellar protein FliS
MFTESPSRAISSYQRVGVESGVVSADPHKLILMLFEGARQALDCSLLYIQ
QNRIAAKGEMISKAIMIIDHGLKASLDKTAGGELANRLEQLYDYMTARLL
TANLQNDAVIIEEISCLLGELHEAWASIGNQPAGKTDVQTATRNTIEDTA
GTGAMA
>NE1971 fmt, Formyl transferase N-terminus:Methionyl-tRNA formyltransferase
MRIIFAGTPDFAARALEELQKAGLDIVLTLTQPDRPAGRGMKMQASPVKI
LAQQYDIPLLQPETLKSSDIQAQLATFKPDVMIVAAYGLLLPEAVLRIPR
HGCINIHASLLPRWRGAAPIQRALLEGDTETGISIMQMNQGLDTGAVLLK
RSLPIEPYDTTATLHDKLADLGGKCIVEALTLLDQGRLISEPQNEVDACY
AAKIRKIEAEIDWTCDAAYIDRMIRTFDPHPGAFTHLQGNTIKLWQARIV
SHVNHNSSHQAGKIITVDPDGIVVACGRDALSIDILQKAGGKKLTAAQFL
AGHPLHPGESFHKATQDNQGASET
>NE0696 folC, Cytoplasmic peptidoglycan synthetases, C-terminal
MIDIRPATLESWLSYLEKLHPKMIDMGLERVKKVRDALKLKPEFPVITIA
GTNGKGSVCAMLESILGCASYRVGCYTSPHLLRYNERIRIDRHEITDEVL
CEVFAEIESARESVQTTLTYFEFGTLAAMLIFLRSRVDVAILEVGLGGRL
DAVNVFDTDCAILTSIDLDHTDYLGSTREAIGQEKIGIFRADKPAICAEP
DIPGNLREKMRAMNVRLYCIDEAFSYTADHLQWRYQGVSGRHCSLPLPAL
KGDCQLQNASAVLAALDVLEEVFPVPMDAIRRGLVEVTLAGRFQMISARP
VIILDVAHNPAAARRLSANLETLPVKGCTYAVVGMLKDKDMAGTLRALKN
NVDCWLISGLDVARGASANEVLQALEEAGVERRNILHAFPDVQSAYVYAH
EHASDDDRICVFGSFHTVSPVMTGLGESDS
>NE0362 folD, Tetrahydrofolate dehydrogenase/cyclohydrolase
MSATIISGSLIASKFREELKQRVKILSETWMQPGLAVILAGDNPASCVYV
RNKAKTCEELGIRSEIFNFPGDISQKALLQQIQDLNVNPEIHGILVQLPL
PGHIRIDEVIAAIAIGKDVDGFHPCNVGALVTGHALFHPCTPFGVMKMLA
EYDIPLQGQHAVIIGRSNIVGKPMALMLLEKGATVTVCTSRTRDLASHTR
NADIVVMAAGKANLLTSDMIRTGATVIDVGINRLADGRLCGDVEFSGVKE
KAGYITPVPGGVGPMTIVMLMNNTIEAAERAKAVALAGGWHSSVQ
>NE0529 folP, Dihydropteroate synthase
MNRLQRILQQNRPLIMGVINVTPDSFSDGGYFDTTEKAIEQARRLIQEGA
DILDIGGESTRPGSLSVGGDEELLRVMPVIEFALSMDIPVSVDTSKPEVM
RATIDAGVDLVNDINALRAPGALDVVADSAVMICLMHMQGKPETMQHNPQ
YSDVVAEVISFLEQRVAAAVSTGIERERIIIDPGFGFGKTFEHNLMLLRG
LDRLVSTNFPVLAGLSRKSMLGTITGNAVNDRVHASVAAALLAVGQGARI
IRVHDVKATRDAFSVFAAVNRIAPGFLLHL
>NE1087 fpvA, TonB-dependent receptor protein
MIIIKHYFLLPSFQRLWNGRAIARLFLIVMLAVSYLTMISPVSAQQSGVV
RHYHIPAGSLTTVLNDFGREAGILLSFSTELTNSLQSAGLNGNYTAHDGL
SALLAGSGLEAVRTADGSYTLRGSSATVSQPSEGALTLPAMTVMTSGIVD
PTTEDTGSYTTGSTNTSTRLPLTLRETPQSVSVMTRQRMEDQGLTQLSDV
VNQTTGMVFQSGGSSQSDSATFYARGFAVDNYQIDGVPQIYNNYNRIFQT
NDMAIFDRVEIVRGANGLMNSVGTPGASINLIRKRPTDTFRAATRFEGGN
WGYRRAEGDISMPLIPSGKVRGRLVGVLNTSDSYIDREEQKRSLLYGIVE
ADLTPSTLATAGFMFQEQDQTANARGALPAFYSDGTRTTWGRSDTAAANW
AYSKRNGEMVFVTLDHRFNEDWLARISWNRTVTKYDEVLGYALGGYPDKA
TGAGVNLWAGRWKGAPTQNTLDVYTMGSFSLFGRKHDLIAGTLFSFTKDH
TPTYRLWFFDNWSNSIGNIFTWDGNTPTMPPTNDTPIGEWGQDEQVISGY
ATGRFRLLDSLSLIGGARVTHWKFKRFSENYQTGNITRINSNQDPEIIPF
AGITYDFLDNWTVYASYTSIFKPQTVRSENGAFIDPLLGNSYEAGIKAAF
FDNRLNLHGAVYRIQQDNLAVALPNNVLAPDGSTAYRSVSGARTDGFEME
LAGMLTRNWQASVGFAHNVTEDRDKVKLNTQVPRNTFKLFSTYRFPTVAE
GLTIGGGVRWQNRIYRNDQGPARVEISQGGYAVVDLMARLEITKMVALSA
NLYNLFDEKYYQTVPASYYGAPRNVRVALNIRF
>NE1190 fpvA, TonB-dependent receptor protein
MPRLTPLALAAALALPSLVFIVSPAHAQTSTAMPISLPAQSLGAALNELA
WQAGLQLMVHPDLVEDKQAPAVSGSLTPRQALDRVLAGSGLKADIQGTEV
IIRRMPAAESGVTTLVEMKVTAQGMDDGTTEGTGSYTTRSMGTVTGLVLS
PRETPQSVSVITHQRIEDQRLVSVAEALRNAPGVSYKAIDRGRGGTTVRG
FSLTNFQIDGVPTILDANMDIDNASIAIYDRIEVVRGATGLRSGAGDPGA
TINLVRKHANNKVFTGNLMLEAGSWNRYGATADLTAPLNTDGSVRARVVA
NYRDQGGFIDFEQTRTTVFYGIVDADLGNRTRLSLGFSDQRNERKGTYWG
GLPVWYADGTRTEWDRSKTTAAKWHRWDDHLQSVFASLDHYFDGGWSIQV
NAQYLRAKEVTNLLWFGGLPDRTTGLGMSAWPYYYRGKPRQYQINIQAGG
PFELFGRTHELMVGAVHMHGKSGWDSADQIGDPAPVGDFNHWDGSYPEPV
YGPTYVGRGQTETQSAAYAAARLNLSDPLKLIVGTRVTRFDRDITAMWGA
PPYEMRESAVFTPYLGVLYDLSTHLTAYAGYTSIFSPQNLRDRNGGYLDP
LEGQSYEAGLKSEFFDGNLYASASVFRIQQENFGVLDGGTLVPGTTEWAY
RAEKGVKSEGYELEITGRILPRWDVSLSWTQFSAKNREGERVAFRYPSRM
LKLFTKYDLSGALNGLSIGGNVEWQNDMPDWRANPATGLQENVGQSAFAT
VGLMARYQLNKSLSVQANIYNLFDKQYYEGSWGTFTYGEPRRILANLTYR
F
>NE0996 ftsA, Cell division protein FtsA
MSKVKEGKDLIVGLDIGTSKIVAIVAEMKPEGGFEIIGLGSHLSRGLKKG
VVVNIEATVNAIQRALEEVELMAGCRISEVYAGIAGNHIRGFNSHGMVAI
KDKEVTQADVEKVMETAKAVNIPADQQILHILNQEFIIDGQEDVREPVGM
SGIRLEVKVHIVTGAVSAAQNIAKCIHRCGLDVRDLVLQPLASAKAVLSE
DEKDLGVCLVDIGGGTTDIAVFTDGAIRHTAVIPVAGDQITNDIAMALRT
PTKDAEDIKCRYGIALRTLADIREMVEVPDVGNRGARPLSRQTLAEVIEP
RVEELYLLIQAELRRSGFEQLLSSGIVITGGSSSMLGMVELGEEIFHMPV
RLGLPVYNGSLEEVVRTPRYSTAIGLVMVGMEDRLHHHQAKLKSSSTRQI
LAKMKGWFQENF
>NE1414 ftsE, Uncharacterized ABC-type transport system ATPase component/cell division protein
MAAMISFNDVSKCYPGGFEALKGVTLSIDQGELILFAGHSGAGKSTLLKL
IAAIERPTSGSIIVGQQNIGTLRRSAIPYLRRNIGMIFQEQKILYDRNVF
ANVMLPLQVTGFDTRTSASRVRAALDKVGLLDKERADPITLSGGEKQRLC
IARAVVHRPSLLIADEPTANLDSNYARDIMAVFESFNQVGVTVLISTHDR
MLLDSTRHRIIELKQGKLVA
>NE0906 ftsH, ftsH; cell division protein
MNNQSPMDKKAQINFWYVLIAVLSILFIQNLYNQYTRIEPIPYSRFQSLL
EQDKVSEVAITDQQIFGKLKESTSEKFTEFVTTRVESDLAEMLDKHNVTY
TGVVQSTWLRDLLSWIVPMAVFVGIWLFIIRRMNKGMMGSGLMSIGKSRA
KVYVEKETKVTFANVAGVDEAKEELVEIVNFLKNPKEYSRLGGRAPKGIL
LVGPPGTGKTLLARAVAGEAGVPFFSISGSEFVEMFVGVGAARVRDLFEQ
ARQMAPAIIFIDELDSLGRARGAGGFGGHDEKEQTLNQLLAELDGFDPSS
GIVLLAATNRPEILDAALLRAGRFDRQVLVDRPDKKGRQQILGVHIGKIT
LAPDVDTEQIAALTPGFTGADLANLINEAALLATRRGGQAVSMDDFNNAI
ERIVAGLEKKNRLLNPEERRTVAYHELGHTMVALALPGSDEVHKVSIIPR
GIGALGYTIQRPTEDRFLMTRKELENKMAVLLGGRAAERLVFDEISTGAS
DDLARATDIARAMVLRYGMSEAIGNVVYDREQMAFLQPGFPMPQSRDYSE
ETANKIDQTVRSLLDLALERAIKILDKNRELLDRTAQQLLETETLNQPEL
LELKRNLRAEETNPSTQQN
>NE0985 ftsI, Penicillin binding protein transpeptidase domain
MKILFNPSRKSAAVDLPEWRSRLIQGFLLISLIILVARAVYLQALNKDFL
QQQGQSRHLRVIEQNSQRGSIKDRNGEILAISAPVKSVWVDPKRVSATSE
QIGQLADLLDMNKATVQERLSSDKRFVYLKRQLPPERANKIAELNIKGLY
LKHEFYRYYPSRELAAHILGFTDVDGRGQEGVELAWQDVLTGEDGKRRVI
KDRIGRVVEDVGQIRSPKSGQDVVLSIDSKIQYLAYRELARAVKEQHAKA
GSIVALDVKTGEVLAMANYPAFNPNQRASMNNEVIRNRVLVDTFEPGSTL
KPFAIAVALETGRVKANTLMETSGGMMRIGRAVIRDVRDKGNLTVSQVIQ
TSSNVGAAKIALLLPPKTFWEMLNRSGFGTETGIGFPGEASGQLRAYNKW
RPIEQATMSYGHGISVSLMQLVRAYTLFATDGELKPITLLKRDTPAVGQK
VISRETAQSVRKMLELAVQPEGTGSGARINGYRVAGKTGTAHKRLEKRKG
YASDRYISSFVGFAPASDPRVIMAIMIDEPSDGRYYGGTVAAPVFSRVME
GTLRILNVPFDDSLGNLVTSPVPARLEDKG
>NE1051 ftsK, FtsK/SpoIIIE family:AAA ATPase superfamily
MSFADKPMAKKVASNPLPPRTSRLLREAVSLVLSGIALYLALILISFDRT
DPGWSHSGTLRQVSNAGGSAGAWLADLMLYFFGISAWWWVVFFFATVWWG
YRRIDIASVFDPRVLMLSFTGFITLLVASSGIEALRFHTLRISLPLAPGG
LLGEILSKQLSSLLGFTGATLAMVIIFAIGFSLFSGLSWVRLSEKIGGGI
EEICFAVRDICMGWLNRRNSTLPSSEREVRIEEIGKQPFPLVPLHIEMPE
TTPPRSTRSNREKQTPQFSNSPDGIIPPLHLLDEPQNNVEMLSSDTLEFT
SRLIERKLQEFGVEVKVVAAYPGPVITRYEIEPAVGVKGNQIVNLVRDLA
RALTVASIRVVETIPGKTVMGLEIPNPNRQTVRLHEILASGVYANHPSPL
TIALGKDISGRPVVSDLAKMPHALVAGTTGSGKSVAINAIILSLVYKASP
DNVRLILIDPKMLELSVYDGIPHLLTPVVTDMRDAASALNWCVAEMERRY
KLMSALGVRNLAGYNQKVREAVKNEEPLTNPLNPVPGSPELLEEMPLIVV
VIDELADLMMIVGKKVEKLIARLAQKARAAGIHLLLATQRPSVDVITGLI
KANIPTRIAFQVSSKIDSRTILDQMGAEALLGQGDMLYLPPGSGYPQRVH
GAFVADHEVHKVVEYLKQHGEAHYIEEILQAGEEGALSDENGGESGKPAG
GESDPLYDEAVSIVIKSRRASISLVQRQLRIGYNRAARLIEEMERAGLVS
SMQSNGNREVLTPDRNE
>NE0984 ftsL, putative cell division ftsL transmembrane protein
MIKLNIFLFMVLVICGLGIVTARYESRKLFMEQEEAQQLTEQLETEWNQL
RLEQTTLAMPARVEKIARKELGMTMPPPAAGNILAIHPDHSSGATE
>NE0995 ftsQ, putative cell division transmembrane protein
MWNDHQSLNFLANILLTGVLLATIYVVGTRILALPFFSLREVRVEAMDKN
RTGNVSLVHITRDQIEQVVRNSANGNFIMIDLKTLQNAFMELPWVRSVKI
LREWPPALNILLEEHKPLAYWEETALVNTNGEIFHAIMDNVRLPVFAGPD
NSSRLITQQYRIFNKLLQPTGQTAIEIVLTPRHAWHVRLNTGTWLKLGRE
QIEQRLKRYVAVRTHIMKVWIGMEVPLMWICAMPAGLQYAYPDIALNLRA
DKHYDYEIGGIFK
>NE0990 ftsW, Cell cycle proteins
MMSPSTSAPHNQPIQPELDVLLVSTVLLLLGLGLVMVYSASIAIAEAKFG
EGSSYYFLARQASYILAGIAVGIGCFRIPLRWWQAYSHYLLGLGILLLLV
VLIPGISHEINGSRRWIPLGITSFQPSELMKLIILIFTADYVVRKAAFKD
HFFKGFLPILALLTIVSLLLLMEPDLGATVVIAAIVLSIMFMNGMSLKMF
FGLICLVPVLLALLIIIEPYRMDRINAIFDPWNDPFDKGYQLTHALIAFG
LGEWWGVGLGSSVEKLNYLPEAHTDFMFAVLAEELGFAGVVTVISLFFFL
LVRIFKVGRTAARLGDQFGSLVAQGIGVWLGLQAFINMGVNMGLLPTKGL
TLPFMSYGGSSIVINSIAIAILLRIDWENRLKRRGLNA
>NE1413 ftsX, DUF214
MSNWLSQHGYALIRALRQLAGTPMTSLLSIIVFSIVLSLPAGIFILLENL
RALSGHATDSQQMTLILDTAAGSADVELINTRLEEMTVIEGFQFISREVA
LQELQRESGMAEVMRNLEQNPLPDAFVVNLGSLSAGEIEKMQAMMQAWPK
IAHVLVDTDWARKLDAMLDVGRLVVIMLTSVFGTTLVIVMFNTIRLQILT
RRDEIELSKLIGATDSYIRRPFLYFGAIQGLAGAALAWLLLYFTIIKLNE
ALSELARLYATTFTLDPLSWQDSLILILFSGTLGWIGARWSVARHFAQID
HESAIS
>NE0997 ftsZ, Cell division protein FtsZ:Tubulin/FtsZ family
MFEITNTEPLEAVIKVIGIGGCGGNAVDHMIRNEVKGVEFICMNTDAQAL
QGNRAQTLLQLGTSVTRGLGAGANPDIGKEAALEDRDHIAEIVQGADMLF
ITAGMGGGTGTGAAPVVAQIAKEMGILTVAVVSKPFSFEGKRLKAAQAGM
EALAEHVDSLIVIPNDKLMKVLGNDISMLDAFKAANDVLYGAVAGIAEVI
NCPGLVNVDFADVKTVMSEMGMAMMGSAAASGVDRSRMAAEEAVASPLLE
EITLTGARGVLVNITASSAMKMREVQEVMDIVKKMTAEDATVIVGTVIDE
NMGDSLRVTLVATGLGNINQQSQRPMTVIHTRTGTDDRISSHQVDEPAVM
RTGRRSNAAVTAMQQAGMDPMDIPAFLRRQAD
>NE2286 fumC, Fumarate lyase
MDQYREEHDAIGTVQVPASALWGAQTQRSLNNFNISGERMPSALIHALAL
VKRAAASVNHDLGLLDENIARAIITAADEVLAGEHAGEFPLVVWQTGSGT
QTNMNMNEVLANRASEILGGTRGKGRKVHPNDHVNKGQSSNDVFPTAMHI
AAVEAIRNRLIPALEALRKTLSSKSAAFSDIVKIGRTHLQDATPLTLGQE
FSGYVSQLDHGLAHLESALPHLLELALGGTAVGTGLNTHPEFARRVAAEI
ARLSGYPFITAANKFEALAAHDALVHAHGVLKTLAAILIKIANDVRWLAS
GPRCGIGEILIPENEPGSSIMPGKVNPTQSEAVVMLACQVMGNDVAINLG
GAMGNFELNTMKPLIIHNFLQSTRLLADGAESFNTHCAAGITANTVRIKQ
HLQESLMLVTALNPHIGYDKAAEIAKKAHHEMLTLKEAAIRLGYVTAEQF
DVWVDPQKMTEV
>NE0616 fur1, Leucine-rich repeat:Ferric uptake regulator family
MKKSENLKNIDLKTMGLKTTLPRLKILNLFENSLIRHLSAEDVYKELLNG
GEDIGLATVYRVLTQFEQAGLLERHHFESGKAVFELASDNHHDHLVCLQC
GRVEEFYDPEIEKRQARIAKERGFVLQEHSLSLYADCTKENCPHRK
>NE2053 fusA1, Translation elongation and release factors (GTPases)
MAKKTPLERYRNIGIMAHIDAGKTTTSERILFYTGVSHKLGEVHDGAATM
DWMEQEQERGITITSAATTCFWRGMAGNYPEHRINVIDTPGHVDFTIEVE
RSLRVLDGACTVFCAVGGVQPQTETVWRQANKYGVPRLAFVNKMDRSGAN
FMRVREQMISRLKTNPVPIQLPIGAEDGFAGVIDLVKMKAIYWDDASQGT
KFEEREIPASMQADAAIWREKMVESAAEASEELMNKYLETGDLSIEDIKQ
GLRVRTINNEIVPMLCGTAFKNKGVQAMLDAVLDYLPSPLDIPAIKGTNE
NGVEDEREPSEDKPFSALAFKIATDPYVGQLIFFRVYSGTIKSGDTVFNP
VKGKKERIGRLLQMHANQREEIKEVGTGDIAAAVGLKEVTTGDTLCDLDH
IITLERMDFPEPVIHVAVEPKTKIDQEKMGIALNRLAQEDPSFRVRTDEE
SGQTIISGMGELHLEIIVDRMKREFGVEANVGAPQVAYREAIRKQVEIEG
KFVKQSGGRGQYGHVWLRMEPNEAGKGFEFLDEIKGGVVPREYIPAVEKG
LQDSLANGVLAGYPVVDVKVALFDGSYHDVDSNENAFKMAASIAFKDGMR
KANPVLLEPMMAVEVETPSDFMGNVVGDLSSRRGIIQGMDDIPGFKVIRA
EVPLAEMFGYSTILRSATQGRATYSMEFKHYSEAPKNVAEAIINKK
>NE2000 gabD, Aldehyde dehydrogenase family
MQTYTPYSEAMVFSMLEHAHAAHLAWRQVPLHERTALLLKLADVLRENSE
SYARLMTEETGKTIRAARAEIEKCAWACEIYADKAAAWLAEEEIAADGIK
HRVVIEPLGVILAVMPWNFPFWQVMRFLIPALLAGNGALLKHATNVTGSA
LKIQEAVNEAGFPKGLFVTLLVSHTVVEQVIAHPLCQGVSLTGSAEAGRA
VASVAGRHLKKVVLELGGSDPFIVLGDADIPAAAKAAVIGRFQNNGQSCI
AAKRLIVLKEIEEVFTSTLLAEVEKLVVGDPLDEATDIGPLVSEQAAETM
EQFVLDAVAKGAMVRTGGTRKGAYFTPTVLTGVHSAMEVMTQEVFGPVLP
VITADTVEEAITLANATRFGLGASVWSRDLEKGEQVARQLAAGATFVNSI
TKSDPRMPFGGIRESGLGRELSYWGVREFANIKTVNVYGGEEV
>NE1382 galU1, ADP-glucose pyrophosphorylase
MKIILIKAYELESELERRGKTEMLQYVRNLLPDDMKCVYIRQSEALGLGH
AVLCAQPAVGDEPFAVLLADDLLEGDPVVIRQMVDVYEYYKCSVLGVQQV
PREDTRSYGIVASQPIRDDLEAVQFIVEKPDPEEAPSTMAVVGRYVLTPR
IFHHLIRLGKGTGSEIQLTDGIAALMQEEQVLAYRYKGQRYDCGSKIGYL
EATLAMARKHPEVGAQFEQLLQSFQHQGNSEERQ
>NE1383 galU1, probable UTP--glucose-1-phosphate uridylyltransferase protein
MKIRKAVFPVAGLGTRFLPATKASPKEMLPVVDKPLIQYAVEEAWAAGIT
EMIFVTGRSKRSIEDHFDKGV
>NE2072 gatA, Amidase:Glutamyl-tRNA(Gln) amidotransferase A subunit
MLNASLRQLSLLLSEKKISSTELTSEFLSRIKALNPDLNAFITIDEEKSL
DQANVADKMIAAGRSTPLTGIPIAQKDIFCARGWLTTCGSKMLSNFVSPY
DATVVERFDQAGMVNLGKTNMDEFAMGSSNETSYYGPVKNPWDRLAVPGG
SSGGSACAVAARLAPAATGSDTGGSIRQPAALCGISGIKPTYGLVSRYGM
IAFASSLDQGGPMAKSAEDLALLLNTMVGFDERDSTSLQRAEENYTQDLE
KPVNGLRIGLPKEFFAEGMSSDVSNVIEAALAEYRKLGATFVEVSLPNSK
LAVPVYYVLAPAEASSNLSRFDGVRYGYRTAQYSSLEDLYTKTRAEGFGE
EVKRRILIGTYVLSHGYYDAYYLQAQKLRRLIAEDFRKAFEQCDLIMGPT
TPTVAFNIGEKCDDPIQMYLSDIYTSTASLAGLPGMSIPAGFGSKNRPVG
LHIIGNYFREAQMLNVAHRYQQVTNWHELTPPETSN
>NE2073 gatB, gatB: glutamyl-tRNA(Gln) amidotransferase, B subunit
MQWETVIGLEIHAQLSTRSKIFSGASVAYGAEPNSQACAVDIALPGVLPV
LNRQAIEYAIRFGLAIDATINSPSVFARKNYFYPDLPKGYQISQLEHPIV
EGGHVAIQTGDQEKTIRLTRAHLEEDAGKSTHEEFHGMTGIDLNRAGTGL
LEIVSEPDMRSSAEAVAYAKVLHSLVRWIGICDGNMQEGSFRCDANVSVR
PAGSEALGTRCEIKNLNSFRFLERAIDYEVARQIDILESGGSITQETRLY
DADRDETRTMRSKEDAHDYRYFPDPDLLPVIIPQEWIDQIREKLPELPKD
KRNRYIQDFGLQTYDANILTAARELADYFEQMVSVLPGQSKLCANWIMGE
VSARLNKEGLEIARSPISPEQLAGLLLRITDGTISGKIAKDVFDSMWQSN
GDSADTIIDAKNLRQISDDSEIEKWVDDVLAANPQQVADYRAGKEKAFNS
LVGQVMKTSKGKANPAQVNSVLKKRLTE
>NE2071 gatC, Glu-tRNAGln amidotransferase C subunit
MTLSLNDIKRVAKLARIEISETDAQQNLVRLSGIFDLIEQMRAVDTQGIK
PMSHSQDMVQRLREDIVTESDQRTLFQSVAPQIEDGYYLVPKVIE
>NE0225 gcp, Glycoprotease, (M22) metallo-protease family
MLVLGIETSCDETGVALYDTCQGLLGHTLYSQVDMHREYGGVVPELASRD
HIRRILPLIRQLFRQSDTSLESVDAIACTQGPGLAGALLTGASFSSALAF
ARNIPVLNIHHLEGHLLSPLLSDPAPDFPFVALLVSGGHTQLMRVDGIGQ
YRLLGETVDDAAGEAFDKTAKLLDLDYPGGKLLAELATQGRAEQFRLPRP
MLNSNDLNFSFSGLKTAAALLIGKHEMNSQTRADIAFAFEDAVTDVLVKK
SVTALNITGLQQLVVAGGVGANSRLRQKLLHHLSGTDITVFFPALEFCTD
NGAMIALAGALRLQQLDERLRAGGSFTVKARWNLEDL
>NE0608 gcvH1, Glycine cleavage H-protein
MSVPAELKYTESHEWVRLEADGSVTVGITQHAQELLGDMVFVQLPDVGRA
LAQREDCAVVESVKAASDIYAPLGGEVIAVNSEVETSPEKINEDCYAAWL
FKLKPANAGEVDGLLDAGGYQKLLDSEAH
>NE0607 gcvT, Glycine cleavage T-protein (aminomethyl transferase)
MLKTTPLNAAHRAMNAKMVDFGGWDMPLHYGSQLDEHHSVRRDAGMFDVS
HMLTVDIHGENVRQFLRGLVANNVDKLTLPGKALYTCMLTPTGGIIDDLI
IYFLSESWFRLVVNAGTADKDIDWITGQSRQLAPALTITPRRDLAMIAVQ
GPNARTKVWNVIPDSQAISENLKPFQSVMLGDYFIARTGYTGEDGFEITL
PAGQAADFWQKLHAAGVAPAGLGARDTLRLEAGMNLYGQDMDETVNPLES
GLAWTVDLKSERDFTGKQTLLETPVNRQLVGLVLLDKGVLRNHQKVITRQ
EGEAGEGEITSGGFSPTLNQSIALARIPAGIAAGEQVHVVVRDKQLAAKV
VKYPFVRNGQALV
>NE1616 gdhA, Glutamate/leucine/phenylalanine/valinedehydrogen ase
MKYNSIEEFKNYVSERNPGQPEFLQAVSEVIESLWPFIVDHSRYAEQGLL
DRLIEPERMIIFRVAWVDDRGEVKVNRGYRIQYNSAIGPYKGGTRFHPSV
NLSILKFLAFEQTFKNALTTLPMGGGKGGSDFDPKGKSPGEIMRFCQAYA
AELFRHVGADTDVPAGDIGVGGREVGYMAGMVKKLTNRSDCVFTGKGLTF
GGSLLRPEATGYGLVYFAEEMLNHSGCSLKGMRVSVSGSGNVAQFAIDKA
MSLGAKVVTVSDSSGTVVDEAGFTPEKLAILAEVKNRLYGRVNEFAERVE
AQFLPGEKPWHVPVDVALPCATQNELNENDAAILIRNGANCVAEGANMPC
TAGAVERFHHAKVLFAPGKASNAGGVATSGLEMSQQAMRLSWTSGEVDMR
LQEIMRAIHHSCTEYGKKPDGTVNYVDGANVAGFVKVAEAMLAQGVI
>NE2020 ggaB, possible galactosamine-containing minor teichoic acid biosynthesis
MWMRVPRSSIGAEKMSGSEPVYQEPPTAYWRAPSSYDARLREDFLASLKS
SSTTLGAVPQPMQIDVLRRLHWYFTVDGRERAPTAIVGVEAAQAFHALIG
EILQYVDPGLIGRFSDPAVSSEIRHVLYSWHGKPVCSAAILDCYDHAQQL
VKLRYFVHGEPPVEAWLVDGKAVEPAFAKYRGCRYYHRSCMQQRIVWLPV
AQGSKLQLRLNGQPHAIELDESGFFARSVSEDETFDLAGARAAFWPGRGG
RRRSRPLLKSLKAGLLALYAALPWVRARYRRAWVFLDRHENADDNAEHLY
RWVTAKQPQINAWFLLKPDSPDWARLEQEGFQLLAPNGLQRKLLVLNSEN
IISSHAEYGAGGFDPRVYAPYMRWRYTFLQHGTILNDLSHWLGPLQFDLF
STSSLVEYQSIAEDGGNYPYSKREVSFTGLPRHDCLLRKARERKPPSSKT
LLVMPTWRGGTFEEQAKDLSADERQQLFAQTDYARAWKSLLHNPALHAAL
QQHGWQLSFMPHMNTLPFLDVFELSPEIRLVSVLDGHIQEALVSADAFLT
DYTSVTFDIALLRRPSFYYQFDRTLFYGGGHNWRPGYFDYERDGFGPVAF
SENELLQQLLAFLENGGEVPALYRERMERAMPLDDELACQRCFDRISSLN
QPWQG
>NE1331 ggt2, Gamma-glutamyltranspeptidase
MWIFRIKLATLLLTVLLVACNTQPARLNYTVPVQPEGSSGYTEKPGWQTA
TFAVAVANPLATDAGYQILKAGGSALDAAVAVQMVLTLVEPQSSGIGGGA
FLMYFDGESVEAYDGRETAPAAANEDLFLKPDGKPMSFMEGAVGGRSVGV
PGVVRMLEQVHQQHGKLPWATLFEPAIRLAEQGFKVSARLNTMLAADRYL
RQDPEAAAYFYDSNGQPWPVGHVLRNPELARVLRGIAMHGSMALLEGPVA
QSIVDKVRSHVTNPGEITLADLARYQPRKREPLCHDHAVPPKVYRLCGFP
PPGSGAIAIGQIFGILQNTPGAALPLSSDGLPTADWLHFYTEAARLAFAD
RAKYVGDPDFVAPPAGAWMSLLSPGYLAGRARLIGDRPMKVTRPGQPEGM
RINLAPMPDQPEYGTSHISIVDAQGHAVALTTTIENAFGSRQMVRGFLLN
NELTDFSFTPRDAQGNFVANRVQPGKRPRSSMSPTLVFDKATGQLVMVGG
SPGGAFIIHYIAKILWGTLHWGLNMQQAIDLPNFGSANSFTLLEEKGFPA
ATVKALEAKGHNVQEMDMTSGLQAIQRIPGGYFGGTDPRREGVVMGD
>NE2476 gidA, gidA; glucose inhibited division protein A
MHFSKDFDIIVVGGGHAGTEAALAAARMGQKTLLLTQNLDTLGQMSCNPS
IGGIGKGHLVKEIDALGGAMAAATDEAGIQFRILNSSKGPAVRATRAQAD
RVLYRQAIRRRLEAQPDLLLLQSTVDDLLLNGDKITGVVTHLGMTFSARA
VVLTVGTFLGGVAHVGHQNFQAGRAGDPASIRLAHRLREMNLSVGRLKTG
TPPRIDARTIDFRILREQPGDEPVPVFSFLGNITQHPRQISCWMTRTNER
THEIIRTGLDRSPLYTGKIEGIGPRYCPSIEDKVVRFSERDAHTIFLEPE
GLETSEIYPNGISTSLPFDVQVELVRSIAGLENAHITRPGYAIEYDYFDP
RTLKKSLETKVIDGLFFAGQINGTTGYEEAAAQGLLAGLNASLKIKEQEP
WCPSRDEAYIGVLVDDLVTRGVTEPYRMFTSRAEFRLQLREDNADMRLTE
TGYRLGLVSEERWQAFAAKREAIETEKTRLRNTWISPGTLSKLQTPDQPA
DDRSYSLYDLLRRPEIGYAELVSLSARGQSVIDSQVAQQVEIDVKYEGYI
ERQRQEVVRHAQHEAMILPKDMDYRAVRGLSNEVTQKLNQHQPETIGQAA
RISGITPAAISLLLVHLKRGMARQAVKREEMHESGKTVA
>NE2475 gidB, Glucose inhibited division protein
MNLEKQLHDGLKAIPELSADTGHLCSRLLRYIELIAKWNSTHNLTSVRNP
ESMITRHMLDSLVILPHVSGPGIVDVGSGAGFPGIPVALARPEWQVTLVE
SNQKKAAFLLQAVLELGLPNISVKQGRVEKIKLENKVDTVVSRAFSSLER
FMSLSKHLSENDSDHCRFIAMKGEFPDMELMQLSSEFVVEKIVAVTVPGL
KAKRHLVVIRYQPG
>NE0675 glcD, glcD; glycolate oxidase subunit GlcD
MNTNQLIEAFRKFLPAEAILHEAEDLRPYECDALSAYRQLPMIVVLPRTE
AEIAAILQTCQAERIPVVARGSGTGLSGGALPHAEGVLLSLARLNHILEI
DPAAGTARVQPGVRNLAISEAAAYYGMYYAPDPSSQIACSIGGNVAENSG
GVHCLKYGLTVHNILRLRVLTIEGDLLEIGADALDSPGFDLLALMTGSEG
MLGIVTEITVRLLPKPETVKLVMAVFDDVRKAGEAVASVIQAGIIPAGME
MMDKITIHAVEDFVHAGYDLEGAAILLCESDGMAEEVADEIERIRTILQA
SGATRISLAQDEAERQRFWAGRKAAFPAAGRVSPDYYCMDGTIPRKHLAH
VLEQIEQLSAEYGLRCMNVFHAGDGNLHPLILYDANQPGELQKAEAFGAR
ILEICIEAGGSLTGEHGVGIEKLNQMCLQFNDAERNRFHAIKAAFDPQSL
LNPGKAIPELHRCAEFGAMHIHRGAEKFPEIPRF
>NE0674 glcE, FAD linked oxidase, N-terminal
MQTLPDHFIDTINAAIENKQPLRIRGGGSKDFYGNPQALQHSSILDTSTW
QGVVDYEPSELVITARSGTPLAELEKLLHDHSQMLAFEPPHFSTAATLGG
CVAAGLSGPRRAYTGAVRDYVLGTKLLDGQGSILSFGGRVMKNVAGYDVS
RLMTGAMGTLGVLLEISLKVLPKPAVERTLRFQASTPEALAIMRRCTAEP
LPISATCFHNDQLYVRLSGAESAVQTAHARLGGDEIKDKDDFWESIRDHT
HPFFQEGKSAEKSLWRLSIKPTTPPLSLPGKQLIEWGGALRWFVTDEDTD
AAVIRKHAADAGGHATLFHGNKTAIPVFHPLSPALLKIHRRLKQQFDPAG
ILNPQRLYTEF
>NE0673 glcf,gox, glcF; glycolate oxidase (iron-sulfur subunit) protein
MQTRLTDLIKDTPEGQEADAILRSCVHCGFCLATCPTYQLLGNELDSPRG
RIYLIKQMLEGQEVTEKTQLHLDRCLTCRACETTCPSGVQYGHLLDIGRG
MVEQQIQRTPYATFKRYALRKLLPNRTLFSTLMGVARMGRPLLPARLKKT
IFPKPVTARSPSGKTHTRSMLILAGCVQPALTPNTNHATTRVLDRLGIAV
MAAEQAGCCGALAYHLNAQEEGLNAMRRNIDAWWPFVTRSENPVEAIIVT
ASGCGVTVKDYGHLLQHDPDYAEKAAKISSLTRDISEIITAEASTLIPLL
EAEKATSKSSSPRNLAFHAPCTLQHGMKLKGKVEPVLQAAGFNLTKVADP
HLCCGSAGTYSILQPKLSRQLRDNKIAALTMENPDLVVTANIGCQMHLQG
GTSLPVRHWIELLDETLTG
>NE2264 glgA1, Glycosyl transferases group 1
MSSSPSRKNPRVLFVTSEVFPLCKTGGLGDVSAALPAALRELKADVRLLV
PGYPSVLSGLKYKRKLAEFDLLPHFPPTTLFSSRLQINESVSLPLYVIHC
PELYQRPGGIYLDDDGQDWPDNAQRFGLLSKMGALLASDASPLSWIPDII
HCNDWQSGLTPAYLHYHSGKKAASLMTLHNLAFQGCFPPDEVARLGLPPE
SFSVHGVEYYGNLSFLKAGIYYATRITTVSPTYAREIQHEPLGFGLQGLL
AERSNAITGIINGIDNTVWNPATDPHIVKKYSSRNLAAKKINKLALQREM
GLEENETIPLFAGISRLSYQKGYDILLQVAPMLADLPAQLVLLGKGDQSL
EKQLVMLAQTNPARIAVRIDYDEALSHRINASADCFLMPSRFEPCGLNQM
YSQRYGTPPIVHTTGGLIDTVTDLAPDTPAGESASGFHFHEMTADAFMNG
IGRAIDAYYNTRLWKTLQHNGMRKDFSWRSSALAYLSIYSLLMQR
>NE2029 glgB, Glycoside hydrolase family 13:Isoamylase N-terminus
MKSLSSSSPGSSEDSAQSLLLTARLYDPCSFLGMHAAPNGRLIRVFQPYM
SRVWLHTLSGYQPMRSVHQDGIFEWEGEGVMHPYLLRMENTATGVIEERH
DPYAFPMQISGHDLYLFNEGRLLQAYHMLGAHRVRNHGVTGTRFAVWAPN
AERVSVVGDFNRWDGRVYPMMVHGHSGVWELFIPDLPEGAIYKYEIRNRI
SGEILLKTDPYATTYELRPNNAALTPIEQKYDWKDDDWIARRKGWDWLHA
PLNIYELHVGSWKRHPDGRFYSYHELADHLIPYLQDMGYSHVELLPISEH
PLDESWGYQATGYFAVTSRYGSPEAFMSFVDRCHQAGIGVILDWVPAHFP
QDSFSLARFDGTALYEHEDPRLGYHHDWGTYIFNYGRNEVKSFLLSSAHY
WLSAFHIDGLRVDAVASMLYLNYSRKEGEWLPNRYGGHENLEAIEFLRAL
NTMVHGEFPGALTFAEESTSWPAVSRPAYLGGLGFSMKWNMGWMNDTLSY
MQHDPVHRRYHHNELTFNQLYAYTENFVLPLSHDEVVHGKKSMLDKMPGD
GWQKFANLRLLFTYQMTCPGKKINFMGNELGQGHEWRVGHELDWYLLERD
PHRGIQALTRDLNHLYLNTPALHELDFFAEGFSWIDCHDVEQSVISYQRH
ARDGSFVLVVLNFTPILRTGYRVGIPGSSAYQEVFNSDSIYYDGSNAGNA
GKISPTGQPWSGQPDSIIITLPPLAGVILKAADG
>NE0209 glmS, Glutamine amidotransferase class-II:SIS domain
MCGIVGAIAKNDVVPFLLEGLSRLEYRGYDSAGIVVADGMLHRLRTTGRV
SELSKLVDSGKTSGLTGIAHTRWATHGVPSERNAHPHFSGDQKKIAVVHN
GIIENHETLRQRLQQDGFEFLSDTDTEVIAHLISSYLCKTNDLLEAVCRS
LDELQGAYAIAVMEESRQDRIIVARNGAPLLLGIDDDGIYAASDASALVQ
VTQRIVYLEEGDVAELSSDGYRILNCLDSVNGYEVTRKITESSLTRDAVE
LGPYSHFMQKEIFEQPVAVANTLEMVLNAQSVSPQLFGSEAQSILAQTQG
VLILACGTSYHAGLVAKYWLETIARLPCNVEIASEYRYRDPIADPTTLVV
GISQSGETADTLAALSYAKSLGHRYSLAICNVPESALIRQTDLRFLTRAG
PEIGVASTKAFTTQLAALLLFTAVLTKIRDQLSTGDEQKMIAALRHLPVA
IQHALQCEPEIRKWAGDFSQKHHALFLGRGVHYPIALEGALKLKEISYIH
AEAYAAGELKHGPLALVDSGMPVVAIAPNDALLEKLKSNLHEVRARGGEL
YVFADADSRIEESEGVHIIRLAEHSGVLSPILHTIPLQLLAYHVALQKGT
DVDKPRNLAKSVTVE
>NE0208 glmU, glmU; UDP-N-acetylglucosamine pyrophosphorylase protein
MLQVDVVILAAGMGKRMCSSLPKVLHPLAGKPILSHVLDIARTLSPERIC
VVFGYGGELVRQVIGDHSDLIWVKQAQQLGTGHAVKQALPYLGNKGVTLV
LFGDVPLVKSDTLKALIEKAREDNLVLLTVELDNPTGYGRIVRDPVTNRI
QAIVEEKDASQSQKKIREINTGIMVLPNGRLGNWLDNLSDANTQGEYYLT
DIIAMAVDAGIPIETSSPASDWEVSGVNDKIQLSILERAHQQDTANRLME
QGVMFADPARFDVRGRLVCGNDVEIDINCIFEGNVRLGNNVKIHANCILR
NVIVSDGSVVHPFSLIEDAEVGKNCRIGPYARIRPGTQLDDAVHVGNFVE
IKNSHIASESKVNHLSYVGDTEMGRRVNIGAGAITCNYDGAFKHRTVIED
DVFIGSDTQLVAPVTVARGSTIGAGSTITRDTPEGQLTLSRTKQTSIANW
KRPRKDRN
>NE0504 glnA1, Glutamine synthetase type I, glnA
MAADVLKMIQDHEVKFVDLRFTDTNGKEHHVTVPAHTFDTDKFEDGHAFD
GSSIAGWKNIHASDMLLMPDPETAALDPFMEETTLLLTCDVVDPADGQGY
NRDPRSLAKRGETYLKSTGIGDIAYFGPELEFFIFDSVRWHTDMSGCFVK
IESEEAAWSSKKRYEKSNNGYRPAVKGGYMPVPPVDSLHDIRSAMCMVLD
ELGIPVEVHHHEVANAGQCEIGTRFSSLVKRADWNQLMKYVIHNIAVSYG
KTATFMPKPIVGDNGSGMHVHQSIWKEGKNLFAGNGYAGMSETALYYIGG
IIRHAKALNAFANPGTNSYKRLVAGFEAPVNIAYSASNRSAAVRIPYASS
PKQRRIEIRFPDPSANPYLTFTALLMAGLDGIQNKIHPGDPMDKNLYDLP
PEEASMVPNTCASLEEALASLDSDREFLTRGNVFSNDMIDAFIKLKSEEA
TLLRTTTHPVEFALYYSL
>NE1329 glnE, putative glutamate-ammonia-ligase adenylyltransferase (glutamine-synthetase adenylyltransferase) protein
MSTPIDSSRAASIVASILPYSRYLKRTLASEPGLQQELLDQLPNPFLWEE
MLDFLQHPAISLEDEADLHRLLRQLRKRVILRLAARDLAGLADLNEVMTT
MTALADTTIRFALEFLHTAMTHPGHFGKPTGEKTGTEQQLLIVAMGKLGG
GELNVSSDVDLIFIYPEDGETDGRKSITNHEFFVRLGRKLIASLSDYTVD
GYVFRVDMRLRPHGENSPLAISLPMLEDYFITQGREWERHAWIKSRVITG
SSVAEATLMELIVRPFVFRKYLDFEAYEAMRSLHAQLRKEVDRRELHDNI
KLGPGGIREIEFITQVFQLIRGGRDADLCIRPTLGVLRRLRQKQPLPGQT
IEELTEAYCFLRKLEHRLQYLDDQQTQNLPQHPDEQTLIARSMGFTGYGD
FLDHLDLHRQNVTRHFEQIFAARRKSPRHDTFARIRPEQSGDHETVEAFS
KQLQTLGYLDPGKITARVRQFYDSTFFRQLTHSSQERIFELMPTLMEVIA
RFPPVDITLERILRLLEKIGQYPAYLALLQEHPQTLPRVAKLASVSQWAS
DYLGRHPILLDELLTSSGLHILPDWPALKTELTHQLHHVNIPKVQMVEWQ
MDVLRHFQHAQVFRLLVTDLEGDLLLEKLSDHLTELADLILDNVLQLAWQ
GLKKKHRELPAFAIIGYGKLGGKELGYASDLDIVFLYRDDHPDAASIYTK
LAQNINLWLTSHTSAGILYETDLRLRPNGTSGLLVNSIEAFTQYQYEQAW
VWEHQALTRARFVVGDREAGEMFEQMRKNMLCQPRDLVKLKREILMMRSK
MLEAHPNPTPLFDIKHDRGGIIDVEFIVQYLVLGYAHRYPQLTGNIGNIA
LLKLAGELGLTSAGKATAALTAYRELRRTQHQLRLSGTPEPAGTALSRDV
SQKFARVANNHLSDARQAVFQLWEDIFGT
>NE2363 glnS, Glutamyl-tRNA synthetase:Glutaminyl-tRNA synthetase GlnS
MNTPSSPAHFIRNIIEEDNRTGKWNGRVETRFPPEPNGYLHIGHAKSICL
NFGLALEYGGVCHLRFDDTNPEKEAQEYVDAIIESVRWLGFDWKEHLYYA
SDYFDQLYEFAEYLITQGKAYVDSLSADEIRRLRGTLTEAGTNSPYRDRS
AEENLDLFRRMRAGEFPDGVHVLRARIDMASPNINLRDPVIYRIRHIHHQ
RTGDKWCIYPMYDYTHCISDALERITHSLCTLEFEDHRPLYDWVLDQLAE
KIPCHPQQIEFARLNLTYSVMSKRKLIDLVENKLVDGWNDPRMNTLAGLR
RRGYTPESIRLFAERIGISKADSWIDMTILEDCLREDLNERALRRIAVLD
PVSLIIDNFPDGHEETCYAPNHPQKPELGTRELRLTKQLYIDREDFMEIP
NKGFFRLAPGAEVRLRYAFIIKCTHVVKDDQGKILEIHCVYDPDTKSGTA
GAETRKVRGNIHWLSATYAKAVEIRLYDRLFIDSHPDTEGKDFKISLNPN
SKEVITGYVEPSLCEAQPEQRFQFERHGYFVADLADTGPGKPIFNRTVSL
RNTWKK
>NE2356 glnS, Glutamyl-tRNA synthetase:Glutaminyl-tRNA synthetase GlnS
MNTPSSPAHFIRNIIEEDNRTGKWNGRVETRFPPEPNGYLHIGHAKSICL
NFGLALEYGGVCHLRFDDTNPEKEAQEYVDAIIESVRWLGFDWKEHLYYA
SDYFDQLYEFAEYLITQGKAYVDSLSADEIRRLRGTLTEAGTNSPYRDRS
AEENLDLFRRMRAGEFPDGVHVLRARIDMASPNINLRDPVIYRIRHIHHQ
RTGDKWCIYPMYDYTHCISDALERITHSLCTLEFEDHRPLYDWVLDQLAE
KIPCHPQQIEFARLNLTYSVMSKRKLIDLVENKLVDGWNDPRMNTLAGLR
RRGYTPESIRLFAERIGISKADSWIDMTILEDCLREDLNERALRRIAVLD
PVSLIIDNFPDGHEETCYAPNHPQKPELGTRELRLTKQLYIDREDFMEIP
NKGFFRLAPGAEVRLRYAFIIKCTHVVKDDQGKILEIHCVYDPDTKSGTA
GAETRKVRGNIHWLSATYAKAVEIRLYDRLFIDSHPDTEGKDFKISLNPN
SKEVITGYVEPSLCEAQPEQRFQFERHGYFVADLADTGPGKPIFNRTVSL
RNTWKK
>NE1427 gloA, possible gloA; lactoylglutathione lyase
MRILHTMLRVGNLERSIRFYTDVLGMQILRRKDYPEGKFTLAFVGYQSET
EGTVLELTHNWETDHYDLGTGFGHIAIEVDNAYEACEKVRNLGGRVTREA
GPMKHGATVIAFIEDPDGYKIEFIQKKTA
>NE2373 gltA, Citrate synthase
MPPQNVATLSPGKGKQEIELPIVSGNEGPDVVDIRSLYAQSGMFTYDPGF
VSTASCKSAITFIDGDKGVLLYRGYPIEQLATRCDFMEVSYLLLNGELPT
PDQKAKFVDNIKNHTMLHEQLIKFLSGFRRDAHPMAVMVGVVGALSAFYH
DAMDVSDAQHREQSAFRLIAKLPTITAIAYKYNIGQPFIYPHNDLGFTEN
FLHMMFATPAEQYTPNPVIVRALDRILILHADHEQNASTSTVRLAGSSGA
NPFACISAGITCLWGPAHGGANEACLNMLEQIGDVSRINEYIARAKDKND
PFRLMGFGHRVYKNFDPRATLMRETCHEVLDELGLHNDRLFKLALELEKI
ALEDDYFITKKLYPNVDFYSGIVQRALGIPTSMFTAIFATARTVGWVAQW
NEMISDPEQKIGRPRQLYVGAPRRDVP
>NE1624 gltX, gltX; glutamate-tRNA synthetase (catalytic subunit)(sye protein)
MVKTRFAPSPTGYLHIGGARTALFSWAFARKQGGKFVLRIEDTDLERSTQ
QSVQAILDGMAWLGLDYDEGPYYQMQRLNRYQEVAEQLLTQGLAYQCYAS
REELDALREQQRLAGLKPRYDGRWRDSRQTPPAGVKPVVRLKTPQHGYVV
FDDLVKGKISVANHELDDLVLLRSDGTPTYNFGVVLDDLDMGITHVIRGD
DHVNNTPRQINILKALGAAIPQYAHVPMILGADGERLSKRHGAVSVMHYR
DQGYLPEALINYLARLGWSHGDEEIFSREQLVEWFDLAAISRSPAKFNPE
KLTWLNQHYLKMADDARLVELVMPFLLERNYPLPATENLLKIVNLLKDRA
STIEELADASAYFFRTIEPPEELRTQYFTAEIRPVLEYLVDRLAQIEWKR
ETIHHEIKQTVSSHNLKFPSLAMPLRVMVTGEAQTPAIDAVLELLGKEET
LHRLRGRMEIFPG
>NE1433 glyA, Serine hydroxymethyltransferase (SHMT)
MFSQSLTIEQVDPDLWQAIKGEVQRQEDHIELIASENYASPAVLQAQGTV
LTNKYAEGYPGKRYYGGCRYVDIVEQLAIDRLRNLFNAEYVNVQPHSGSQ
ANAAVYLSALKPGDTLLGMSLAHGGHLTHGSAVNMSGKIFNSISYGLNPE
TEEIDYAELERLAHEHKPRMIVAGASSYARVIDWKAFRQIADNVGAYLFV
DMAHYAGLIAAGYYPNPVGIADFVTSTTHKTLRGPRGGVIMAKPEHEKAL
NSAVFPQTQGGPLMHVIAAKAVAFKEASSQAFKDYQKQVIENARVMARVL
QQRGLRIVSGRTDCHMFLVDLRAKNLTGREAESALEAAHITVNKNAIPND
PQKPFVTSGIRIGTPAITTRGFKEPESEELANLVADVLDAPANTAVLDQV
ARKAQALCTKFPVYGN
>NE1187 glyQ, Glycyl-tRNA synthetase, alpha subunit
MGPVHHQAAINPSLNRRNMSIPTFQEIILTLQQYWGQQGCALLQPYDMEV
GAGTSHTATFLRALGPEPWRAAYVQPSRRPKDGRYGDNPNRLQHYYQFQV
VLKPAPHDILDLYFRSLQILGLDLQQNDVRLVEDDWENPTLGAWGLGWEV
WLNGMEVTQFTYFQQVGGINCRPITGEITYGLERLAMYLQGVENVFDLTW
TDGLTYGDVYHQNEVEQSTYNFEYSDTGFLLLSFDKLEAQANQLIDAQLA
LPAYEQVLKAAHTFNLLDARGAISVTERAAYIGRIRNLSRRVAQAYYDSR
LRLQPPFPLAPREWVAQILNQAESA
>NE1186 glyS, Glycyl-tRNA synthetase, beta subunit
MMIENLLIELLTEELPPKSLDKLGNAFAAVIADSLKSQNLTTPDTILTAF
ASPRRLAVHLTAIPAQAPDQVVALKLMPITVGLDAQGQPTPALHKKLAAL
GMENVDASALKRVQESKAEMLFLEQNVTGILLAAGLQKAMEDAIRQLPVS
KVMTYQLDDGWENVHFVRPVHGLIALHGQKIIPVSAFGLTAGNTTRGHRF
EVKQTELIIDHADRYASLLETEGAVIPGFDRRRSWIREGLEAAASAVQLR
CISDEVLLDEVTALVEYPNILMGAFPTDFLEVPQECLISTMKINQKYFPL
LDTDGKLTNQFLIVANITPADPGQIISGNERVIRSRLADAKFFFDHDRKR
TLASRLPDLDKVIYHHQLGSQGERTRYVQTLARIIGRLLGDDNLAGQADQ
AAMLAKADLLTDMVGEFPELQGIMGRYYARFEGMDETIAFAIEDHYKPRF
AGDVLPRSMAGICVALADKLETLISLFSIGQLPTGDKDPYALRRHALGVI
RILIEKNLPIGLDVLISRAADVLQDEMIGKQDSGPGHARPVTPQLVGQLQ
DFFYDRLAASLRDQGYTAQEVEAVLNLRPSLLCEIPRRLAAVRAFAALPE
AASLAAANKRVGNILKKSECDATVAIDEACLQASAEITLYRALSEIESDA
RQAFQNGDYVTALQILAALKAPVDAFFDQVMVNDENEALRRNRLALLMAL
QATMNRVADISRLAA
>NE2254 gmk, Guanylate kinase
MSCLFVISAPSGAGKTSVIRTLLQTDINLTLSISYTTRPPRRDEKNGHDY
FFVDHATFKDMQARGEFLESAEVHGNLYGTSRKWIEETMAAEQDVLLEID
CQGAQQIRTVYPQAASIFILPPSMEALKQRLEQRGQDENKVIERRLAAAR
SEISHVNRFDYVVVNHELETAARDVASIVQAERLKTIRQLVRQRSLIAEF
S
>NE0178 gpmA, Phosphoglycerate mutase family
MNEIQEPIRLVLLRHGQSIWNQDRHFTGWGDIVLSPQGEQEALRAGHLLK
QAGFTFDACFCSELQRASDTLAIVQSVMGLNHLSTYRTWRLNERHYGALE
GMRPWAAIRKFGIWSTMKSQIRFDAAPPLLMPDDPRAPVNQPRYAAVDRT
QLPLAESMQQTLERVRPLWQETILPEIRQGKRLLIVSHQNLLKTLVMQLE
GLTGAQIMRLSITTGHPLCYELDHSLVPVKRYYL
>NE2208 gpsA, NAD-dependent glycerol-3-phosphate dehydrogenase
MNIAVLGAGAWGTALAICLSARHRVTLWTRNVEHLAELAALRTNQRYLPR
QPLPDSIHLVSALSEALERAELVFVVVPVAGLRTTLQQMVALNPSLPLIL
ACKGFETGSAKLPCQVVEEVYPASITCGVLSGPSFAREVAQGLPAALTLA
SHDEIFARSVAGEIRTASLRVYSGNDVIGVEVGGALKNVIAIAAGISDGI
AFGNNARAALITRGLAEITRLGMALGGCRETFTGLTGIGDLILTCTGNLS
RNRRVGMMLAAGRQLAEILPEIGHVTEGVYTVREAYGLGQRLQIDMPVTQ
AVYSILYEQVPVEIAIQDMLDREPGAETD
>NE1660 greA, Prokaryotic transcription elongation factor GreA/GreB
MNIVLLFFHGKRNQVMNAIPITQAGAEKLRAELHEMKTVHRPAVIAAIAE
ARSHGDLSENAEYDAAKERQGFIEGRIAELESKLSNAQIINPATLNADGA
CVFGATIDLMDLGNNTTVTYQIVGDDEADIKQGKISISSPIARALIGKYA
GDIAEVQAPGGVREYEILDVKYI
>NE1848 greB, Prokaryotic transcription elongation factor GreA/GreB
MSKAFTKESDGEEELDGNPQLPQGVKNYITPGGYQRLKDEFDQLWRVERP
ELVKVVSWAASNGDRSENGDYIYGKRRLREIDRRLRFLSRRLDNAEIIDP
KQRGECDQVFFGATVTVCNQRGEEQTYSIVGIDEAEPGRGWISWISPLAK
ALLKAREGDVVPLQTPGGPQELEVVEIRYEAL
>NE0028 groEL, TCP-1 (Tailless complex polypeptide)/cpn60 chaparonin family
MAAKEVRFGDSARQAVISGVNVLADAVKVTLGPKGRNVVLERSYGAPTIT
KDGVSVAKEIELKDKFENMGAQMVKEVASKTSDTAGDGTTTATVLAQSIV
KEGMRYVAAGMNPMDLKRGIEKAVTGAVEELKKLSKPCSTSKEIAQVGSI
SANSDTEIGRIIAEAMDKVGKEGVITVEDGSGLENELDVVEGMQFDRGYL
SPYFVSSADKQIAALESPFVLLHDKKISNIRDLLPVLEQVAKAGKPLLII
AEDVDGEALATLVVNNIRGILKTCAVKAPGFGDRRKAMLEDIAILTGGTV
IAEEVGLSLEKTRLEDLGQAKRIEVGKENTTIIDGAGDVKTIEARVAQIR
KQIEEASSDYDREKLQERVAKLAGGVALIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGIVPGGGVALLRTINAVSKIKGDNHDQDSGIKIVLRAMEEP
LRQIVTNCGDEASVVVNKVKEGQGTFGYNAATGEYGDLVAMGVLDPTKVT
RSALQNAASVAGLILTTDAMVAELPKEDSPGAGAGMGGMGGMGGMDM
>NE0027 groES, Chaperonins cpn10 (10 Kd subunit)
MNIRPLHDRVIVKRLEEERKTASGIVIPDTAAEKPDQGEIIAVGKGKTGE
DGKIRALEVKVGDRVLFGKYAGQAVKIKGEEFLVMREEDIMGVIEG
>NE1950 grpE, GrpE protein, molecular chaperone
MQEPHDQEPIEKQKLPGMDDVLETEHSGTVAGNTERAGEDAAPSLEQQLK
EAEIRAAEHHDAWLRAKAETENIRKRAQTDIASAHKYAIDNFSVQLLAVM
DSLDAALATENSTLENLRDGVELTRKQLAAVFEKFNIHTIDPQGEKFDPH
QHEAMCAVESDFAPNTVIQVMQKGYMLHDRVIRPAMVTVSKAKGT
>NE2211 grxC, Glutaredoxin
MPKIVMYVSGYCPYCTMAEKLLRARGVEEIEKIRVDLQPGLRAEMMQRTG
RRTVPQIYIGPVHVGGYDDLAMLDRQGELSGLLAG
>NE1295 gshB, gshB; glutathione synthetase protein
MKLAFILDPLDSIKIGKDSSYAMMREAAVRHHQLYTLQQNDLAWKDHQVI
GFARPLTLLDPPEGDHRWYEEGAIEEIPLSGFDAVLMRKDPPFDTEYIYS
TYLLELAERQGAYVVNSPRGIRDHNEKLAITEFPRFTPPSLVTSQEQLIL
EFLAEHEDIILKPLDGMGGAGIFRIQNTDHNIGVIIETLTRYGTRTIMAQ
RFLPEIREGDKRILLIAGRPVDYALARIPKPGETRGNLAAGGTGVARPLS
ARDREIAEELGQILYARGLMLVGLDVIGNHLTEINVTSPTGMREISDQTG
TNVAGLMIDALEQNIARKNR
>NE0094 guaA, guaA; GMP synthetase
MHSAILILDFGSQYTRLIARRIRETNVYCELHPFDVSPQFIREFAPIGII
LSGGPASTFTEDAPRVPQIVFELGVPVLGICYGMQAMAAQLGGEVEDAQT
REFGYAELLTAPCKLFHGIQDRINSDGKPALDVWMSHGDRVNKLPPGFTA
IASNAATPFAAMADETRNFYGVQFHPEVTHTRQGKAILDHFVHDICSAGY
DWNMPDYVEEAIGRIRARVGNDKVVLGLSGGVDSSVAAALIHRAIGDQLV
CVFVDNGLLRLNEAKQIMETFSRNLAVNVIYIDAGRQFLEQLKGITDPEQ
KRRTIGREFVEIFQQEAEKIENAKWLAQGTIYPDVIESAGSHTKKAGVIK
SHHNVGGLPETLHLKLLEPLRELFKDEVRELGLALGLPHDLVFRHPFPGP
GLGVRILGEVKYEYTELLRQADAIFIEELRNAGWYEKTSQAFAVFLPIKS
VGVMGDNRSYEYVVALRAVQTEDFMTAHWAELPYTLLSRISNRIINEIRG
INRVVYDISGKPPATIEWE
>NE0095 guaB, guaB; inosine-5'-monophosphate dehydrogenase oxidoreductase protein
MRLIQKALTFDDILLVPAYSEVLPKDVDLATQLTRTLRIKIPIVSAAMDT
VTEARLAIAIAQEGGIGIIHKNMPIKAQAAQVAQVKRFESGVVTDPIIVS
PDMTVRKVLELIRQHNISGLPVVKSKKVVGIVTNRDLRFETNLDQPVKNI
MTPKKHLVTVREGVSKEDALALLHKHRLEKALIVSENFELRGMITVKDIT
RTTEHPYASKDNQERLYVGAAIGVGEGSDERAAALVEAGADVIVVDTAHG
HSQGVLDRVRWVKKKFPEIQVIAGNVATATAAKALVDHGADAVKVGIGPG
SICTTRVVAGVGVPQISAIDNVATALLGTGVPLIADGGIRYSGDIAKALA
AGASSVMLGGLLAGTEESPGEIELLKGRSYKSYRGMGSLSAMQQGSSDRY
FQEAERHEADKLVPEGVEGRVPYKGYLANVIHQLTGGVRSSMGYLGCRTI
SDMHTKAEFIEITSSGIRESHVHDVQITKEAPNYHVE
>NE0332 gyrA, DNA gyrase/topoisomerase IV, subunit A
MEQFAKETLPVSLEDEMRRSYLDYAMSVIVGRALPDVRDGLKPVHRRVLY
AMHELSNDWNRPYKKSARIVGDVIGKYHPHGDTAVYDTIVRMAQPFSLRY
MLVDGQGNFGSIDGDNAAAMRYTEIRMSRIAHELLADLDKNTVDFGPNYD
GSEQEPLILPAKIPNLLINGSSGIAVGMATNIPPHNLGEVIDACLLLLRD
PDVDIAELMACIPAPDFPTAGIIYGISGIKDGYQTGRGRVIMRARTHFEE
LDKGNRHSIIIDELPYQVNKANLLVRIGELVRDKRIEGISDLRDESDKSG
MRVVIELKRGEVPEVVLNNLYKETQMQDTFGINMVALVDGQPRLLNLKQM
LDHFLRHRREVVTRRTLFELRKARERGHLLEGLAVALSNVDEIIALIKAA
PTPAEAKKGLMARTWRSSLVEEMLLRAMIDAAVFRPETLAAGFGMSDQGY
RLSDAQAQAILDLRLQRLTGLEQEKIVSEYREILDKIRDLLDILANPERI
TTIIVEELTAIKGQFGDPRRSEVVIDAQNLNTEDLITPADMVVTLSHAGY
IKSQLLDDYRAQKRGGRGKQAITTREDDFIDNLFIANTHDFILCFSSLGR
VYWIKVYNVPQGSRTSRGRPVNNLVPLEQNEKINAVLPVKSFDDTRYVFM
STAGGTVKKTPLSEFSRPRTNGIIAIDLDEGDYLIGVALTEGKHDVMLFS
DAGKAMRFDENDVRPTGRNARGVRGMKLGAGQQVISLLVADNENMAVLTA
TENGYGKRTPITEYTRHNRGTQGMIAINTNVRNGKVVAAQLVESSDEIML
ITTGGVMIRTRVSEIREMGRATQGVTLINLDAGEKLAGLERIVETDED
>NE0003 gyrB, DNA gyrase, subunit B:DNA topoisomerase II gyrB
MNTNQPESAKKTDNSHRDYNSDSIKILKGLDAVRKRPGMYIGDTSDGTGL
HHMVFEVVDNAIDEALAGYCDDISVIIHADNSVSIHDNGRGIPTDIKQDD
ELKRSAAEIVMTELHAGGKFDDNSYKVSGGLHGVGVSVVNALSEWLRLTI
RRNGNVYQMEFREGVAVAPLKVTGQTEKHGTEVHFLASQSVFGDITYHYD
IFAKRLRELSFLNHGIKIRLADQRDDREEVFAFTGGIRNFVEYINRSKTV
LHPSIFYAKGLKDNITVEIAMQWNDSYAEQVLCFTNNIPQKDGGTHLTGL
RAAMTRTLNNYIEKNELAKKAKVDTTGDDMREGITCVLSVKLFEPKFSSQ
TKEKLVSSEVRPAVEEIVVQKLSDFLLENPNEAKTICNKIIEAARAREAA
RKARELTRRKGVLDSMGLPGKLADCQEKDPKLCELYLVEGDSAGGSAKQG
RDRKFQAIMPLKGKILNVEKSRFDKLISSQEIVSLITALGTGIGKDEYNP
DKLRYHRIIIMTDADVDGSHIRTLLLTFFYRQMPELIERGHIYIAQPPLY
KIKHGKQERYLKDDYELKHYILGLALVGAELHTGANNPPITGEALARIAD
EYLLAETVIERMSRLIDRTVMYALLKQPDIDLSSETSARDSAARLAILLD
DVEILAEYDENFERYRLKIIRKQHGNLRTSYLDDDFLQSGDFARIRQAAQ
ILHGLIGEGAKVKRGEQEISVREFKEALEWLLEETKKGITIQRYKGLGEM
NPEQLWETTMDPGNRRLLRAQIEDSILTDEIFTTLMGDVVEPRRAFIESN
ALRARNIDI
>NE1593 hag, hag; flagellin
MPQVINTNIASLNAQRNLNVSQNSLSTALQRLSSGLRINSAKDDAAGLAI
SERMTSQIRGMNQAARNANDGISLAQTAEGALVEIGNNLQRIRELAVQSA
NATNSEDDREALQKEVTQLIDEIQRVGEQTSFNGTKLLDGSFASQIFQVG
ANEGETIDFTDIADVTASGLSVDSVDITGTDGTAAASVITTIDDALKIVN
STRADLGAIQNRFSSAIANLQTSAENLSASRSRIQDADFAAETAALTRAQ
ILQQAGVAMLSQANALPNNVLSLLR
>NE2044 hao1, hydroxylamine oxidoreductase
MRIGEWMRGLLLCAGLMMCGVVHADISTVPDETYDALKLDRGKATPKETY
EALVKRYKDPAHGAGKGTMGDYWEPIAISIYMDPNTFYKPPVSPKEVAER
KDCVECHSDETPVWVRAWKRSTHANLDKIRNLKSDDPLYYKKGKLEEVEN
NLRSMGKLGEKETLKEVGCIDCHVDVNKKDKADHTKDIRMPTADTCGTCH
LREFAERESERDTMVWPNGQWPAGRPSHALDYTANIETTVWAAMPQREVA
EGCTMCHTNQNKCDNCHTRHEFSAAESRKPEACATCHSGVDHNNWEAYTM
SKHGKLAEMNRDKWNWEVRLKDAFSKGGQNAPTCAACHMEYEGEYTHNIT
RKTRWANYPFVPGIAENITSDWSEARLDSWVLTCTQCHSERFARSYLDLM
DKGTLEGLAKYQEANAIVHKMYEDGTLTGQKTNRPNPPEPEKPGFGIFTQ
LFWSKGNNPASLELKVLEMAENNLAKMHVGLAHVNPGGWTYTEGWGPMNR
AYVEIQDEYTKMQELSALQARVNKLEGKQTSLLDLKGTGEKISLGGLGGG
MLLAGALALIGWRKRKQTRA
>NE0962 hao2, hydroxylamine oxidoreductase
MRIGEWMRGLLLCAGLMMCGVVHADISTVPDETYDALKLDRGKATPKETY
EALVKRYKDPAHGAGKGTMGDYWEPIAISIYMDPNTFYKPPVSPKEVAER
KDCVECHSDETPVWVRAWKRSTHANLDKIRNLKSDDPLYYKKGKLEEVEN
NLRSMGKLGEKETLKEVGCIDCHVDVNKKDKADHTKDIRMPTADTCGTCH
LREFAERESERDTMVWPNGQWPAGRPSHALDYTANIETTVWAAMPQREVA
EGCTMCHTNQNKCDNCHTRHEFSAAESRKPEACATCHSGVDHNNWEAYTM
SKHGKLAEMNRDKWNWEVRLKDAFSKGGQNAPTCAACHMEYEGEYTHNIT
RKTRWANYPFVPGIAENITSDWSEARLDSWVLTCTQCHSERFARSYLDLM
DKGTLEGLAKYQEANAIVHKMYEDGTLTGQKTNRPNPPEPEKPGFGIFTQ
LFWSKGNNPASLELKVLEMAENNLAKMHVGLAHVNPGGWTYTEGWGPMNR
AYVEIQDEYTKMQELSALQARVNKLEGKQTSLLDLKGTGEKISLGGLGGG
MLLAGALALIGWRKRKQTRA
>NE2339 hao3, hydroxylamine oxidoreductase
MRIGEWMRGLLLCAGLMMCGVVHADISTVPDETYDALKLDRGKATPKETY
EALVKRYKDPAHGAGKGTMGDYWEPIAISIYMDPNTFYKPPVSPKEVAER
KDCVECHSDETPVWVRAWKRSTHANLDKIRNLKSDDPLYYKKGKLEEVEN
NLRSMGKLGEKETLKEVGCIDCHVDVNKKDKADHTKDIRMPTADTCGTCH
LREFAERESERDTMVWPNGQWPAGRPSHALDYTANIETTVWAAMPQREVA
EGCTMCHTNQNKCDNCHTRHEFSAAESRKPEACATCHSGVDHNNWEAYTM
SKHGKLAEMNRDKWNWEVRLKDAFSKGGQNAPTCAACHMEYEGEYTHNIT
RKTRWANYPFVPGIAENITSDWSEARLDSWVLTCTQCHSERFARSYLDLM
DKGTLEGLAKYQEANAIVHKMYEDGTLTGQKTNRPNPPEPEKPGFGIFTQ
LFWSKGNNPASLELKVLEMAENNLAKMHVGLAHVNPGGWTYTEGWGPMNR
AYVEIQDEYTKMQELSALQARVNKLEGKQTSLLDLKGTGEKISLGGLGGG
MLLAGALALIGWRKRKQTRA
>NE1152 hbpA, Bacterial extracellular solute-binding protein, family 5
MLFKTSRSLMKKIRLQVICAGYSLVSTGLLLLSLILSGCTDIWNNPYPAA
DSGKNILYNVFAERPKHLDPVQSYSSNEIQLTSQIYQPPLQYHFLKRPLT
LIPQTASRMPTVSYFDSNDTRLPANSADTRIAYSVYEISIRPGIFYQPHP
AFARDEKDNLRYHELTVQDIQPIHKISDFEHTGTRELVADDYIYQIKRLA
HPGLHSPIFGLMADYITGLREYANTLRDQARTRPGNSFLDLRDFPLTGVE
RVDDYTYRIRIKGKYPQFIYWLTMAFFAPIPWEADHFFSQPGMAEKNFSL
DWYPVGTGPYMLAENNPNRIMVLERNPNFQGEYYPDEGMPEDIQKGLLRD
AGKPLPFIDKIVYSRERESIPYWNKFLQGYYDASIIGSDSFDQAVQITGQ
GEATITDEMEAQGIRLGTAVAPSTYYMGFNMLDPVVGGRTEAEKAAARKL
RQAISIAVNYEEYLSIFANERGIAAQGPIPPGIEGYHEGKEGMNPIVYEW
HNGEIRRKSIKQAQALLAEAGYPDGISRKTGKPLVLYYDVTARSADDRSI
LDWMRMQFRKLNIQLVVRSTDYNRFQDKIRKGNAQIFEWGWNADYPDPEN
FLFLLYGPQRKVGNSGENAANYDNAEYNHLFEQMKDMEHGPQRVRIIDRM
ITILREDAPWLWGYHPKEFALYHAWYQNVKPNRMAYNTLKYYRIDPALRD
QKRNEWNKPVLWPIGAGTALLVIILLPAWVAYRRKQQQQSITEQAT
>NE1914 hemA, Glutamyl-tRNA reductase
MQLFAFGVNHHTAPLDIREHVAFSEESMQHALHDLVGHQLVKEAAIVSTC
NRTEIYCNTDTPDKAVGWLADFHHLRFGDLEPYLYRLLREQAVKHAFRVA
SGLDSMVLGEPQILGQMKNAVKSAEQAGTLGLLLHKMFQRTFFVAKEVRT
STEIGACSVSMAAASARLAERIFGNISEQKVLFIGAGEMIELCAAHFVAR
HPVHVTVANRTVERAEALARRFNAHPISLGELPDQLALHDIVVTSTASPL
PILGKGMLERAIKQRKHRPVFIVDLAVPRDVEAEVAELDDVFLYYVDDLA
DIVKEGLDNRQGAVTQAETIIESNVVDFMHWVATRQSVPTIRALRNQAER
YRRHELARAHKLLAKGEDPEKVLESLSSGLTNKFLHLPSSVLNHATDDER
EQLIELVNRLYQLHHS
>NE0590 hemC, Porphobilinogen deaminase
MSSPKKIVIASRESQLALWQANFIRGRLLELYPQTDITILGMTTKGDQIL
DVSLSKIGGKGLFIKELELALEDGRADIAVHSMKDVPMIVPSGFTLAAIT
EREDPRDAFVSNDFSSLEELPAGSVVGTSSLRRESQLRARFPHLQVRPLR
GNVQTRLRKLDEGEYSAIILAAAGLKRLELGYRISMLLPPELSLPAVGQG
ALGIECRDNDPDMVEWMKPLHHAATACCVEAERAMSRMLGGSCQVPLGGF
AEIFEDVLTLRGFVATPDGSRMIADKLCGKPESGEQVGQQLAQNLKAHGA
EEILAALA
>NE0591 hemD, Uroporphyrinogen III synthase HEM4
MDLSSNRLAGKSILITRPLHQAGGLATWVRELGGEPWLFPVLEISDSENK
QPLLDLIARLDEFDLAVFVSPNAVEKVIPLVQVSHSWPRHVLVATVGKGS
ARVLERYGITNVIVPEEGSDSEALLRMPQFQVMQGRHVVIFRGNDGRRLL
GDTLRERGASVEYIECYRRHKPEADPLPLLKHWRDDGIQAVIISSSEGLD
NLFDMIGETGQQLLKATPVFTAHERIERKARELGIRKIYRTLLGDEGTVQ
GLLEYFEKM
>NE1876 hemF, Coproporphyrinogen III oxidase
MNFAQVKDYLVDLQNRIVTGLEQVDGQSFRRDTWDRPEGGGGTSCVIEEG
NVLERGGVNFSHVFGKGLPASATAARPELAGRAFEAAGVSLVLHPRNPYA
PTVHMNVRFFAATKEGAEPVWWFGGGMDLTPYYGFEEDAVHFHQACKDAL
QPSGEEYYPRFKKWCDEYFYLKHRKEPRGIGGVFFDDLNQPDFATCFNLT
RSVGDHFLAAYVPILQKRRDLPYGERERDFQAYRRGRYVEFNLVWDRGTL
FGLQSGGRTESILMSLPPVVKWRYDWSPAAGSPEAKLYTDFLTGRDWLPL
G
>NE1476 hemH, Ferrochelatase
MTRMLPEPAYRHGSVGKIGVLMINLGTPDAPTAKALRAYLKQFLSEPRIV
EFPRWLWWFILNGIILNVRPAKSAKKYEQIWTSEGSPLRVHTARQTALVA
ALLEQQADSSLVVEYAMIIGNPSIAEKLQQMKVQGCDRILVLPLFPQYAA
SSTGCVLDGVFSELRKMRNIPDIRTVRHYHDDPGYIAALAQNVRDYWEKH
GQPDKLIISFHGVPRKTLEMGDPYHCECQKTGRLLAEALELADDRYQICF
QSRFGFAQWLGPYTAEILAELGKQKTGRVDVVCPGFVSDCLETLEEIALE
GKAIFTEAGGGEFHYIPSLNEHPLWIEAIGNIIQTHLTGWADRRLSEEAA
ERSRKRALALGARE
>NE1423 hemL, hemL; glutamate-1-semialdehyde 2,1-aminomutase protein
MTITNQQLFERSRQYIPGGVNSPVRAFKSVGGTPVFFQRGQGAYFWDVEG
KSYIDYVGSWGPLILGHAHPDVVRAVQIAAGHGTSFGAPTAAELEIAELL
CRLLPSLEMVRLVSSGTEAGMSAIRLARGYTGRNRIIKFEGCYHGHDDAL
LVKAGSGALTFGHPSSAGVPAETAGHTLVLNYNDVAGVEETFSKMGTEIA
AVIVEPVAGNMNLIKATSQFLETLRTLCTKHGSLLILDEVMTGFRVGLEC
AQGLYGIKPDLTILGKVIGGGLPMAAFGGRRDVMECLAPLGSVYQAGTLS
GNPVAVAAGLETLHQIQVPGFFDKLSTMTRKLTEGLTAVAAKHSVAFCAQ
AVGGMFGLYFRKSPPESFAEVMESDREAFNHFFHAMLKEGVYFAPSAFEA
GFVSAAHSNEEIDKTLAVADRIFGQGMRRTEKATL
>NE0593 hemY, putative protein porphyrin biosynthesis
MKLVLWVLALLAAAAVIVLTAYYNTGSVLFTVPPYKVELAFNTFVLILLF
AFILFYALLRALSGLSGLRSRKVEQLTRSGLKAFFETRYDRAVALAEKAA
RLADRQTIKVLNAVVAARSAHQQRNYALRDRLLAVAREQAPAGRALALIA
EAELLLDEGRHGDALAALQSLYSTGGLQSTAVLLLELKARQMAGNWDAVL
ELTKVLVNRPAVDRTLIDELRFRAHLENIRKNAKDIASLRKYWGSLSYRE
KLDGRLAVAAARALIFLGDNATAQKIIENGLDAQPYPELVTLYADCKSGV
VSWQIQRAESWLAKYPNNAGLLLTLGRLCTYGELWGKAQSYLEASLSIEP
GYPVHLALAQLFEKLGKQEAASEHYRKGLDFALKRIGTA
>NE2234 hetM, possible polyketide synthase
MNTYFITGATGVLGSAIVKELLSNPENRLVLLVRAENELVLQKRITELLS
FLGVGEKTRDRMDFIRGDVELERFGLHPGNFIKLGDQVTHIIHSAASVRM
NYSLERARLAAVTATEHVLQLARLSRKNGLLQKMEAVSTVGVGGRYYGAL
PEQWLNTPRNFHNTYEQSKAEAEIILKTEVDQGMPITVHRPSMIVGDSQT
GRILNFQIFYYLLEFITGRRTWGVMPDISDRHVDAIPVDYVARAISWSSK
NPETIGKILHLCAGLENITSLENAEKLARSKFMKMNLPVPHKRVLPVAQF
RFLVNSVTPFLPRKIRRSVSALPIFLNYTKNQIFDNTETNIILSSAGLQM
PRSDEFLGPVIDYYLSQAYLT
>NE1333 hetN, Short-chain dehydrogenase/reductase (SDR) superfamily
MSRLPDKSGRSILITGATGAIGAALAEIYAQHGVTLHLQGRNAVKLAEVA
ERCRLKGAHVLMQCLDLRDSVALQDGLKVLEPLDLVIVNAGMNTHVGSAG
ESELLDEVEALLDVNLKAAMVIVHAVLPSMRMRGSGQIAFVSSLAAYFGL
PVTPAYCASKAGLKAYGEALRGWLAREGIKINVIMPGYVKSPMCDDMPGS
KPFLWPPDRAAKVIKRGLERDQARISFPFPLNWGAWWLAVLPASVSILIV
RLLGYGG
>NE0528 hflB, hflB; ATP-dependent zinc metallopeptidase (cell division ftsh) transmembrane protein
MNNLIKNMAIWLVIALVLMTVFNQFSTRQPSQPPMEYSQFISEMHQGRIA
KVVIDGRTLRGSKTDGRQFIVHSPSDPWLVSDLLKAGVSVEAKPEEEPSM
LMSILVSWFPMLLLIAVWIFFMRQMQGGGRNGGAFSFGKSKARMLDHKNN
NVTFADVAGCEEAKEEVAELVEFLRDPSRFQKLGGRIPRGVLMCGSPGTG
KTLLARAIAGEAKVPFFSISGSDFVEMFVGVGASRVRDMFEQAKKHAPCI
IFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLVEMDGFEGNMGVIVIAAT
NRPDVLDPALLRPGRFDRQVVVPLPDIRGREQILQVHMRKVPVSPDVKAG
ILARGTPGMSGADLANLVNEAALFAARAGKRLVDMDDFERAKDKILMGAE
RRSVVMPENERRNTAYHESGHAVVAYLLPKTDPVHKVSIIPRGRALGVTM
QLPSEDRYSMDRDQILQTIAVMFGGRIAEEVFMSQMTTGASNDFERATDT
ARKMVMQWGMSETLGPMVYGENEGEVFLGRSVTTHKNLSEATMQKVDAEI
RRIIDEQYALARKLIEENKDKIEAMTQALLEWETIDSDQIKDIMEGRPPR
PPQAPTSSARQDKGSAGSAASEVKQPAGEPATTEAPAAQKVEI
>NE1287 hfq, putative host factor-i protein
MGVKGQLLQDPFLNILRKERIPVSIYLVNGIKLQGQIDSFDQYVVLLKNS
VTQMVYKHAISTIVPAKAISIPIPADTQTEQDEP
>NE1310 hipA, possible HipA protein
MARELEVWLFAERIGTLALIEDRLNFRYSPDWLSRPDAATLSSSLHLQAE
SFDDHHTRPFFGGLLPEGQLRRLIAQQFQVSSQNDFALLDHIGGECAGAV
TLLEPGQSLSSPGQGDDVQWLSEEEIVAILDELPHRPMLAGKDGVRLSLA
GTQDKLPVVSDGARIGLPRNGSPSSHILKPAIRTLMDTVTNEGFCLALAE
AMQLKPAKSQVHSVLGRQFLLIERYDRVVDAQGQRQRLHQEDFCQALGVV
PEMKYQNEGGPDLVQCFDLVRRITRPSAPQILRLFDYVIFNALIGNHDAH
AKNFSLLYAGKSAILAPFYDVLSTAIYPTLTPKMAMKIGSKYKFSEVQTR
HWDQFSEAVGLGKAQARKRILALAKSMPPTARELQSSREHGFAGHAVVEQ
IVILIEQRCALTVRRLSAPAADMEDETVL
>NE0644 hisA, hisA; phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase protein
MLIIPAIDLKDGHCVRLKQGIMENATVFSENPETVALHWLDNGARQLHLV
DLNGAFAGKPKNGEAIRAIVEAVDGRIPIQLGGGIRDLETIEYYLDNGIT
YVIIGTAAVKVPGFLHDACYAFPGQIMVGLDAKSGKVAVDGWSKVTGHDV
IDLAKKFQDYGVEAIIHTDIGRDGMLSGLNIEATVELAQALTIPVIASGG
VTNLDDIRKLCEVQSEGITGVITGRAIYQGSLDFKEAQALADQLDAATI
>NE0646 hisB, Imidazoleglycerol-phosphate dehydratase
MLMRTAQVTRNTQETQITVTINLDGQGKAELDSGVPFLDHMLDQIARHGM
FDLSVAAKGDLHVDAHHTVEDIGITLGQALNRAVADKKGLVRYGHAYVPL
DEALSRVVIDLSGRPGLQFNTTFTRAVIGNFDVDLIQEFFQGFVNHAMVT
LHIDNLTGKNAHHQAETIFKAFGRALRMAVTTDPRCDNLIPSTKGVL
>NE0647 hisC1, Aminotransferases class-I
MTSPSPDQVIRQEILALSAYHVPPAKDMVKLDAMENPYRLPPFLCEEISR
IAADTSINRYPDPHAAALKEVLSTTLSVPAGMEIMLGNGSDEIIQIIMLA
AAKPEAKLLTIEPGFAMFKMIATFANMQYIGIPLKPDFSLDIDRMLAAIE
RHQPSVIFLAYPNNPSGNLFDTSALEKIIEISPGLVVIDEAYHPFAGKSF
IGRLADYPNLLVMRTLSKLGLAGLRLGLLAGRPEWLSHLEKLRLPYNVNV
ITQLVATKIMQHYDVLQQQADAIRQTRTRLRTFLENLNGIEVFPSNANFI
LFRLDGASQIFRLLQQHGILVKNLDNSHPLLKNCLRVTVGTPEENDRFCN
TLQDLIAGN
>NE0336 hisC2, Aminotransferases class-I
MMNLSELAPDYIRAIQPYQPGKPISELVRDLGLNKDEVVKLASNENPLGT
SPLAKEAMIQALNESARYPDGSGFELKAALSERLGMPADQIVLGNGSNDV
LELATRIFLHPDSTAIYSQYAFAIYPLLAQAVGARGIAVPARNYGHDLEA
MLAAVTPETRIIFIANPNNPTGTLCDARDLLRFMERVPQDVLVILDEAYD
EYLPEANKANSIAWLKNFQNLVITRTFSKAYGLASVRVGFALAHADIANL
MNRIRQPFNVNSIGLAAAQAALKDVEFVKLAYMTNRTGMQQMTHGLDQLG
IEYIPSFGNFVCCHIDGHPANTLKIYRNLLQQGVIVRPLGNYDMLNHLRV
TIGIEEENKRFLQALEQALKELD
>NE0872 hisD, Histidinol dehydrogenase
MIKIRRLSSVDDHFQAELDQLLSFEVSVDSEIERTVTQILHQIRTHGDRA
LLELTRQFDNPDIDRIEEIELPRDEWQSALMSLDKVQREALEQAASRIRA
YHEKQLAQSWDYVELDGTRLGQKITALDRVGLYVPGGKAAYPSSVLMNAI
PARVAGVRELIMVTPTPKGEKNPLVLAAAAICEVDRVFTIGGAQAVAALA
YGTTTVPKVDKIVGPGNAYVAAAKRHVFGTVGIDMLAGPSEILVICDGKT
NPDWIAMDLFSQAEHDEQAQSILLCPDKAFLDRVADSISRLIDTLPRRDV
IRSSLENRGALIHVRDLEEACMIANRIAPEHLELSVDEPEQWVDSIRHAG
AIFLGRYTCEALGDYCAGPNHVLPTSGTARFSSPLGVYDFQKRTSLIQVS
AAGASRLGETASILAKGEGLDAHARSAESRYQ
>NE0641 hisE, Phosphoribosyl-ATP pyrophosphohydrolase
MTASTILQRLARTIEARKNADPSISYTAKLLNSSQDKVLKKIAEEAAETI
MACKDNDREQIIYETADLWFHCLIMLTRHDISPEDILRELERREGISGIE
EKLSRSQPNKTE
>NE0643 hisF, hisF Imidazoleglycerol-phosphate synthase
MGLAKRIIPCLDIKDGRVVKGVNFVSLRDAGDPVEIARSYNEQGADELVF
LDITASSENRDLILHIVEKVAAQVFIPLTVGGGVRKAEDVRRLLNAGADK
VSINTSAVLNPMLIKESADHYGSQCIVIAIDARQIPDANPESPRWEVFTH
GGRKPTGIDAIEWAQKIQALGAGEILLTSMDRDGTRSGFDLTLTRAISDS
VDLPVIASGGVGHLDHLVEGILAGHADAVLAASIFHYGEYSILQAKQYLS
SHGIEVRL
>NE0871 hisG, ATP phosphoribosyltransferase
MPDITIALSKGRIFEDTIPFLKAAGIVPSDDPDTSRKLIIGTNRPDVRLV
MVRATDVPTYVQYGAADLGVAGKDVLLEHDGIGLYQPLDLKIARCRMMVA
VRDDYDYASAVFRGARLRVATKYVKTARNHFAAKGMHVDLIKLYGSMELA
PLVDLADAIVDLVSTGSTLKANHLQAIEEIMPISARLIVNQAALKLKNTA
IQPLLETFSAAVPKNL
>NE0645 hisH, Glutamine amidotransferase class-I
MNSIAVVDYGMGNLRSVSKALEYVDSSAVVTVTSDPETIRSAARVVVPGQ
GAMPHCMQALDDQGLRESVIEAAKNKPFLGICLGLQMLFEESEEGNIRAL
GILPGRVKKLESTDTADSDIPKIKIPHMGWNQVHQTLEHPLWHGIDTDTR
FYFVHSYYVATDEPQIVAGSTEYPVPFTCAVARDNIFAIQFHPEKSHSAG
LALLSNFLKWTP
>NE0642 hisI, Phosphoribosyl-AMP cyclohydrolase
MTDKWLDTINWSADGLIPAIAQDKNNGKILMVAWMNREALKRTVESGEAV
YWSRSRKKLWHKGEESGHTQKISAIHLDCDEDILLLSVEQKGGIACHTGR
QSCFFRQLKNGEWVVTEPVIKDPSQIYTK
>NE0150 hisS, hisS; histidine-tRNA synthetase protein
MNDILPDESGLWAFFEDTIRTWLAAYGYRNLRTPIVEQTDLFVRSIGEVT
DIVEKEMYTFVDHLNGENLTLRPEGTASCVRAVIEHNLLYAGPQRLYYSG
PMFRHERPQKGRYRQFHQVGVEALGFAGPDIDAELIVMCARLWRLLGISD
VRLEISTLGSTESRSVYRARLISYLEKFCDDLDEDARRRLKTNPLRILDS
KNPAMREILAGAPRLFDDLDEDSLAHFEALQRILRNQDISFEINNRLVRG
LDYYNRTVFEWVTDKLGAQGTICAGGRYDGLIAQIGGKPAPACGFAMGIE
RILALMGENGAVGAHAIPDIYVVHQGEAAAEFSWKVAESLRDDGLKVVLH
SGGGNFKAQMKKADASGARFAAIIGEDEVVAGQISIKPLREAAEQIRVDL
AGAAALLGNI
>NE0640 hitA, HIT (Histidine triad) family
MEDCLFCKIVRGEIPATKIHEDEDTLVFLDIHPAAPVHLLVVPKQHIGSL
SEVDASHQQLLGKMLWLAPRLAASQGCTDGFRTIINTGRVGGQEVFHLHL
HVIGGKDRLPAMVHHD
>NE1137 holA, putative DNA polymerase III (delta subunit) protein
MRLDPEHLARQLDGSIAPLYVVLGDELLLVMEAVDGIRAYVRGQGYTERT
ILTADQRFDWMNLFQWGRQSSLFSERRMLDLRIPSGKPGREGGVAIETFC
RELPRDTVTVVTLPEIDKQGRASKWFKALEQAGQVIEVKPVGRDRLAHWI
KQRLDRQNQMIDQDTLQFFAGKVEGNLLAAHQEIHKLGLLYPPGRLTFEQ
VKNAILDVTRFDVLQLPETMLTADMVRYRHILEGLQGEGVAPPLILAILS
EQIRLLIKIHLLKNSSRGMTIEQAMTALRIWPARQKLMMGAIQRIRYPLL
VQALLQAAVIDRIIKGVEQGDIWEELLNLGICFAADSSFKIIGRKDLSFI
INLSLK
>NE2180 holB, putative DNA polymerase III (delta' subunit) protein
MATAEIFPWQRVIWQQARQSGSAQRHHALLLKGRRGIGKLGFALALAKSI
LCGQGDAAGVACGKCQDCYWFEQGLHPNFRLLEPEALSAQEGATDKDDEE
NRREAGSTKSGRKPSQQISIAQIRALDDFIYLSAHQARDKVVLIHPAEAM
NTAAANALLKKLEEPPPEVLFILVTHNVSLIPPTVLSRCRQTAMPGPDHE
MAKDWLIHQGITDPDFHLAMSGFSPLLALQYDERLAASHTDFIQCLCAPE
RFDPIELAEKLHKLDLSSVTGWLQKWCYDLMSCRTSGRVRYHLKQVAVIR
QQAAVIDPVAFGFLWRNLIASQQLARHPLNPRLFLEAMLLTYMDSIRPAG
SAG
>NE0442 holC, putative DNA polymerase III (chi subunit) protein
MLIACRLCAKAVQQGLKTVVYVPDERLAGQFDKLLWTFTPTGFVPHCRVD
NKLADVTPVIMNSRPVLMEAGCFGVLLNLDADVPPGFEQFPRVVEIVDEA
EDGKLQARKRYRHYQEQGHDVRHHRLDGN
>NE1687 hpaI2, putative 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase protein
MSLKTRLARHELTIGSWITLGHPSIAEIMAKAGFDWLVLDTEHSVLELSE
VQAIIQVLDGQQCPAIVRLTSNHPDQIKRVMDAGASGIMVPMIKSAEDAR
AAVNAVYYPPEGTRGVGLARAQGYGSAFQAYRQWLRDNAVVIAMIEHIQA
IEDIDAILSTPGIDAYIIGPYDLSGSMGRPGELEHPDVQAAIRQILQAGI
RHHKAGGVHVIEPDPAMLQQRIEQGFTFLGYSLDIRMLDSLCRNHLNIIK
APV
>NE0101 hprA, D-isomer specific 2-hydroxyacid dehydrogenase
MLTVFLDFGSVTRGDIDRTVLEQVVSPWVYHDNTSREQVAERIREAEIVV
SNKTLLDRSALDAANKLKLICVAATGYNNVDLIAAAERNIPVCNVRNYAT
GSVAQHVFMFMLNFACRFVEYQQLIKRGGWQASSYFCPLDFGITELAGKT
LGIVGYGELGNAVANIAKAFGMKLLIAEHKSASTIRPGRTAFDEVIRQTD
FITLHCPLSEDTRHLISNRELNLMKPSAYLINTARSGLIDETDLLKSLYS
KHIAGAAIDVLKEEPPVSGNPLLDYPHPNLIITPHSAWASVESRQRMLNL
LADNIRNFLHNKPFNQIKDALA
>NE1762 hptG, probable hptG; chaperone (heat shock protein htpg)
MQTAENIEHLNFQAEANQLLKLMIHSLYSNKEIFLRELISNASDAADKLR
FEGLSDAALYESDPDLKIRIAYDKEARTITIIDNGIGMSRQEVINNIGTI
AKSGTREFFDSLTGDQAKDANLIGQFGVGFYSAFIVADKVTLTTRRAGLT
IEHGVRWESGGEGEYTLETVEKPDRGTEIVLHLREGEDELLSSFQLRSII
RKYSDHITLPIVMKKEVWDDESKSYRLSDEDETINQASAIWARPKNEITQ
EQYDEFYKHVAHDFEPPLAHVHARVEGKQEYIQLLYIPAHAPFDLFDREH
RHGLKLYVRRVFIMDDAEKLLPGYLRFVRGIIDSSDLPLNVSREILQESK
DIDSIRAGSVKKVLGLIEDLAMSDKSEDQEKFKTFWREFGQVLKEGIAED
YSNRERIAKLLRFTSTHDEREEQTVSLDDYIARMKPEQEKIYYITADGLK
AAQSSPHLEIFRKKGIEVLLLCDRIDEWLVANLNEYTGKSLQSIAKGNLD
LGKLEDEEEKKEHEKEAGDFQELTNKMKEVLGEQVKDVRITYRLTESPAC
LVADTHDVSGNLGRLLKSAGQKVPDSKPFLEINPHHPMIQRLKYEEAKFA
DWSHILFDQALLAEGGQLEDPAGFVKRLNDLLLQNILSGK
>NE1477 hrcA, Negative regulator of class I heat shock protein
MLNEREKILLKTLVERYIHEGQPVGSRSLAKFSGLDLSPATIRNVMTDLE
EMGYVSSPHTSAGRMPTTLGYRFFVDTLLVVKSLDNEQITLLENQLHPNN
PTHLMNVTSRLLSELTRFVGVVVTPKRMGGAVFRHIEFVALTEKRILLIL
VTPEGDVQNRIILTETAYSQSDLIEAGNFLNQHYAGCTLEEIRSGLQREL
TQLRRDMTGLMNAAIEIGNDALQESSEAVVIAGEHRLFDVRDLSDNLSSL
KSLFEMFERKSKLLQLMELSRQAHGVKIFIGGESDETMLEEVSVVTAPYE
MEGKIVGTVGVIGPRRMAYERIIPIVDITAKLLSSNLS
>NE0833 hrpA, HrpA-like helicases
MTYLPEQPACITYPEDLPVVARREEIAHAIQQHQAIIICGETGSGKTTQL
PKICLELGQGAGRQGTGHLIGHTQPRRIAARTVAARIAAELNSPLGKLVG
YKVRFSDQTHPNTRIKLMTDGILLAETQQDPLLRAYQTIIIDEAHERSLN
IDFLLGYLKQLLPRRPDLKLIITSATIDAQRFASHFNDAPIIEVSGRLFP
VEIHYRPNDPIDGEDRDLPRAILSTIDEAMRMGEGDTLVFLPGEREIRET
AETVRKYAFSGPGGKAGLEILPLFARLSHTEQARIFAPGQQRRIVLATNV
AETSLTVPGIRYVIDTGLARINRYSYRNKVEQLLVEKISQASANQRAGRC
GRVMNGVCFRLYSEEDFNARPEYTDPEILRSSLAAVILRMKSLKIGDVEQ
FPFIQPPAPRMIADGYQLLSELGALDERKGLTQIGHQLARFPTDPRIARM
IMAAKQENCLSEVLIIAAALSLQDPRDRPFEHQQAADQAHQPFRDDRSDF
MGYLKLWDFYDELLKHKKSNKKLIEQCQKNFISHRRMREWREIHGQLHIL
ISEMGLRPNQVSAGYDEIHRALLSGLLGNIGFKSDEKGVYEGARAIKFSI
FPGSSLRKKQPKWVVAAELAETTKLYARCAAAIDPAWLERIAGKLCKRHY
FDPHWEKQRAQAMAFERITLYGLTIVPKRRIAYGPIDPAHAREIFIRQAL
VAGEYESTAPFLQHNQQLIDEIRELESKVRRQDILVDEQQIFEFYAARIP
AGIYSGTAFEKWRKQAEQTEPELLYLTREVLIRQAVDGTAAEQFPETLTA
AGHVLPLSYRFDPGHPLDGVTVTVPLPLLNQIMPFHFDRLVPGLIREKIG
WYVKMLPKQVRRHAIPVPQFVTRFLEWLDSCPDQAMLLAESLTAFIRSET
GIKVPLDTWDSRLLPVHLQMNVKVIDDAGMTLGMGHDLIELKAQFGQTAQ
QLFARGAGAEPDSIERDDITRWDFGELPVETRFSRAGKLLTGYPALVDQE
QSVAVRIFDTQEGAQRSMRGGVLRLLCLALKDRIKQLEKNLPVDRQAILL
MSSLIEMDRLKEDIRSAIIDLALIGDDPLPRNEDEFNSQTSRARTRLGSV
SQEIAGLIHTIAQPCQELKKRLSVLDKSAVFLKKDMEEQLHHLIYPGFLS
TTRWQYLQHLPRYLKGMILRLDKYNKNPARDQEQTEIISTLWNQYIQRLN
KHRQAGVIDPNMEIFRWQIEELRISLFSQELKTPAPVSVKRLQKLWESVR
E
>NE0385 hsdM, hsdM; site-specific DNA-methyltransferase, type I modification
MKPIEQQFFNDLEKKLWTAADKLRSNLDAAVYKHVVLGLIFLKYVSDAFE
ERQRELREQFTNPQHDYYMDPEEYGGAGTPEYEDNIAAELEVRDYYTEKN
VFWVPVEARWQTLRDCAQLPPKAALPWNKPGKDEPEEMRSVGWLIDNAME
AIERENIRLKNVLNKDFARVQLDSSKLGELIALFSDTDFAAKTYKGQPLS
LQSRDILGHVYEYFLGQFALAEGKKGGQYYTPKSIVTLIVEMLQPFKGRV
YDPAMGSGGFFVQSEEFIGQHGGKAANGKSGQISVYGQESNPTTWRLAAM
NMAIRGIDFNFGSGPADTLLNDLHPDLRADFVMANPPFNMKEWWNEKLAN
DPRWIAGTPPQGNANFAWLQHMLWHLAPTGSMALLLANGSMSSNTNSEGE
IRKRLTEDDYVECMVALPGQLFTNTQIPACIWFLTRDKQNGFALDKKKRD
RRGEFLFIDARQMGYMKDRVLRDFTVDDIQKIADTFHAWQQGDGYEDVPG
FCKSASLEEVRKHEHVLTPGRYVGTAEQEEDGEPFADKMQRLTAQLAEQF
AESARLETEIRKNLAGLGYGW
>NE0382 hsdR, Restriction enzymes type I helicase subunits and related helicases
MITEQKLEDTAVGWFAELGWPHANGPEIAPDGDQPARTDYRQVVLREHLL
AALARINPHIPAAALEQAAHELLTVGEPLLIARNRRVHRLLLSGIPVEFS
VGDEKRSDLVNLIDFANPRNNDFLLVSQFTVRATRQPRRPDLVAFVNGLP
LVVIELKNPANEQTDIWDALNQIQTYKEEIGDLFNTNVAVVVSDGFTARL
GSLTANQERMQPWRAIANEDDRPLLEFELETLVRGFFEPALFLDYVRHFV
LFEQDADQIIKKIAGYHQFHAVREAVKATVIAASNPNKGLLEVQEPRATY
GKEVQPGSRKAGVVWHTQGSGKSITMACYAGKLLQQPEMKNPTLVVVTDR
NDLDGQLFATFCAAEDLLRQTPVQAGSREELREILASREAGGIIFTTVQK
FALLDDEEQHPLLSDRSNIVVISDEAHRSQYGMKGRLDTKTGKYVFGYAK
HMRDALANATFIGFTGTPIALEDKDTRAVFGDYVSIYDIQDAVDDGATVP
IFYESRLAKLDVNQAEIDALNAQVDEVIEDEEDITAREKTKSDWAALTKL
VGAQPRLEQVAADLVQHFETRTATLEGKVMIVCMSRDICAELYNATVALR
PDWHDADPEKGAIKVVMTGSAADKPLLQPHLYSQQVKKLLEKRFKDPADP
LKLVIVRDMWLTGFDAPCCHTMYVDKPMKGHNLMQAIARVNRVFRNKPGG
LVVDYIGIANELKAALKTYTESKGKGDPTHNAAEALAVLLEKLDIVRGLM
HGFDYRGFETDAMKLLVPVANHILGLKDGKQRFLDAMLAVSKAFSLCSTL
DETAALRTEIAFFAAVKAAIVKFTTVDRKRSDADKNSALKQILDNAIVAD
GVADIFALAGLDKPNIGLLSEEFLEDVRQMKNRNLAVELLEKLLSDEIKA
RARNNVVQEKKYGDRLLETLRKYHNRAVETAQIIEELIQMAKDFQAALER
EAALGLNPDEIAFYDALANNESAVRELGDDTLKKIAVEITEKLRNSTTVD
WQVRESVRAKLRILVRRTLQKWKYPPDKQLEAVELVLQQAEVLSNSWSK
>NE2499 hsdR, DEAD/DEAH box helicase
MSETEAQTRADLIDQQLALSGWNVKDPTQVIEEFDILTVLTEGVAEPRAP
YQGHQFSDYVLLGRDGKPLAVIEAKKTLRDAALGREQAKQYCYNIQKQLG
SELPFCFYTNGHELYFWDLENAPPRKVVGFPTRDDLERFAYIRRNRKPLT
QEFINTSIAGRDYQIRAIRSVLEGIEQKKRDFLLVMATGTGKTRTCIAMV
DALMRAGHAEKVLFLVDRIALREQALATFKEHLPNEPRWPNVGEKLIAMD
RRIYVATYPTMLNIIRDDAQHLSPHFFDFIVVDESHRSIYNTYGEILDYF
KTITLGLTATPTDIIDHNTFQLFHCEDGIPTFAYTYEEAVNNVPPYLCNF
QVMKIQTKFQMEGISKRTISLEDQKKLILQGKEVEDINFEGTQLERQVIN
NGTNTLIVREFMEEAIKDANGVLPGKTIFFCATKAHARRIEEIFDKLYPQ
YHGELAKVLVSDDPRVYGKGGLLDQFTNNDMPRIAISVDMLDTGIDVREI
VNLVFAKPVYSYTKFWQMVGRGTRLLETSKPKPWCLEKEVFLILDCWDNF
EYFKLNPKGKELKPQLPLPVRLVGLRLDKIEKANDSGHADIASREIAKLR
LQIAALPKESVVIKEAAAALARLDDDNFWISLSHDRLEFLRAEIKPLFRT
VSEADFKAMRFERDLLEYSLAVLNEDKEQAETIKEGIVEQISELPLSVSF
VKQEEALIRAAQTSHYWAKADEEAFDELVAKLGPLMKFREQSTGQEQTHL
DLADELHKKEWVEFGPQHEAVSISRYREMVEALITELTEHNPVLQKIKNG
EAVNSQESNELAELLHEEHPHITEDLLRQVYKNRKARFIQFIRHILGIEV
LQSFPDEVSAAFGQFIRAHTTLSSRQMEFLNLLKNFIIEREKVEKRDLIN
APFTVIHPQGIRGVFSPAEINEILQLTEGLVAS
>NE2261 hslU, hslU; heat shock protein chaperone
MSYMTPQEIVHELDKHIIGQDTAKRAVAIALRNRWRRQQVDEPLRHEITP
KNILMIGPTGVGKTEIARRLARLANAPFIKIEATKFTEVGYVGRDVDSII
RDLVESAIKQAREREIRKNQPLAEDRAEERILDALLPPARDLGFEASPSE
ESNATRQKFRKKLREGELDDKEIEIEVAMAQTSMEIFAPPGMEELTSQIQ
GMFQNMGSGKRKMRKLRIREARKLLTEEEAARLVNDEELKLGAVQNVEQN
GIVFLDEIDKITSRSEVSGSDVSRQGVQRDLLPLVEGTTISTKYGMIRTD
HILFIASGAFHLAKPSDLIPELQGRFPIRVELESLSAEDFKQILTNTDAC
LIRQYQALLKTEGIELNFSEDAIGRLAEIAFSVNERTENIGARRLHTVME
KLLEDISFNATRYGGSTHVIDAVYVDERLGKLSQSEDLARYVL
>NE2260 hslV, Multispecific proteasome proteases
MTTIVSVRRGRQVALGGDGQVTLGAVVAKASARKVRRLYHNKVLAGFAGG
TADAFTLFERFEAKLEKHQGHLMRSAVELAKDWRTDRILRRLEAMLVVAD
HEATLIITGAGDVIEPEQGIAAIGSGGAYAQAAARALLENTDLSPKEIVT
KALTIAGDICIYTNQVHVIEQLD
>NE1498 hss, putative homospermidine synthase protein
MDRYEKFAAPAGKFILIGFGSVGQGFLPLLLRHFDLPRERILIITGDDTG
RAQAEKEQISFKVCPLTPGNYREILNSIVEHGDFLLNLSVNVSSLALIEY
CHEREILYLDACIEPWPGGYTNIHISPSERSNYGFREQALALRQKLGGGT
TAVLCHGANPGLISHWVKRALLNIAHDIEGITTVPRTQDEWGQLAQKLDI
KIMQCAERDTQIARPFKHINEFVNTWSIDGFIGEGQQPAELGWGSHEKYE
PADARHHRFGCDAAIYLQRPGMRTQVLGWTPLTGTYQGFLITHSESISIA
NYFTLKNGNDAHYRPTVYYAYHPTDATMLSINEFSGSNFRLQNNVRLLMH
DIEQGTDALGVALMGHARGIYWYGSMLSIEQARDLAPYNNATSLQVAAGI
LGGTVWAIENPASGIVEPDEMSFERVLQVATPYLGEVTGRYSDWTPLQGR
QALFEEDLDTSDPWQFKNFRIS
>NE1421 htpX, Peptidase family M48
MMKRIFLFIVTNLAILLMLSITLRLLGVDRILDAEGSELNFNALLVFSAV
LGFGGSLISLAMSKWSAKHMTGAMVIDVPSNSTEGWLVETVRRQAKAAGI
GMPEVAIYDSPDINAFATGMNRNNALVAVSTGLLQKMNRDEAEAVLAHEV
SHVANGDMVTLALIQGVVNTFVIFLSRIIGHIVDRAVFKSEEGHGPAYFV
TSLIAQMVLGILATIIVMWFSRQREFRADAGSAQISGRNKMVAALRRLQQ
EYEPSHLPDKIAAFGISGQKSQIGRLFMSHPPLEERIQALQSA
>NE2207 hupB, Bacterial histone-like DNA-binding protein
MNKSDLIDVIAQSADLTKAQAGNALDGALSAIKDALGKNDSVTLVGFGTF
KVGKRAARTGRNPRTGAEIKIKAAKVPKFTAGKALKDAVN
>NE1730 icd, icd; isocitrate dehydrogenase [NADP] oxidoreductase protein
MYQHIKVPTTGSKIRVKDDSSLVVPDNPIIPFIEGDGIGVDITPVMISVV
DGAVKKAYGDSRKISWMEIYAGEKSTRVYGQDIWLPDETLQAAKEFVVSI
KGPLTTPVSGGIRSINVALRQLLDLYVCLRPVRYFQGVPSPVKQPEKVDM
VIFRENSEDIYAGIEWEAGTEAVEKVINFLTAEMGITKIRFPRSSAIGIK
PVSKEGSERLIRKAIQYAIDHKRRSVTLVHKGNIMKFTEGAFKKWGYELA
AKEFGAELLDGGPWMILRHNGNEIIIKDVIADAFLQQILLRPDEYDVIAT
LNLNGDYISDALAAQVGGIGIAPGANMSDSIAIFEATHGTAPKYAGKDQV
NPGSIILSAEMMLRHIGWHEAANIVISSMEKTIQDRIMTYDLARLTPNAK
QATSSEFGNAMVARM
>NE0952 ihfA, Bacterial histone-like DNA-binding protein
MALTKAELTDLLFENIGLNKREAKEIVECFYEEMRAALQNGDGVKLSGFG
NFQLRTKPQRPGRNPKTGEEIPISARRVVTFHASQKLKSMVEANYRGESG
TN
>NE1961 ihfB, Bacterial histone-like DNA-binding protein
MTKSELISKLAERFPQLLAKDAELVVKIILDAMAKSLSRGERIEIRGFGS
FDLNYRPSRVGRNPKSGEKVHVPEKYVPHFKAGKKMRELIDSGPKQHKVL
DRVTG
>NE1149 ileS, t-RNA synthetase, class Ia:Isoleucyl-tRNA synthetase
MTIDYKKTLNLLDTSFPMRGDLARREPAMLKAWQERNLYRKIRAISQGRP
KFILHDGPPYANGDIHIGHAVNKILKDIIIKSKTLSGFDAPYVPGWDCHG
LPIEHQIEKKYGKHLPADQVRKLCRAFAQEQIDRQKADFMRLGVLGDWEH
PYLTMNYAIEAGIIRALGKIFRNGHLYQGQKPVNWCIDCGSALAEAEVEY
ENKRSPAIDVGFEIIGRKDVCHPLAGVMAAEIPAGTRIFAVIWTTTPWTL
PANQAVCVHPEFDYSLVSTSRGWLLLASELTNACLARYQLEGRVVATCKG
LALEGLSLQHPFADRIVKVICGRHVTLEAGTGLVHTAPAHGLDDYFIGQQ
YGLPSDSPVKGDGKFSEQISLVGGMFVWKANDVVIDTLRSSGHLLHAEEI
EHSYPHCWRHKTPIIFRATPQWFIGMQRQTAESQSFGESLRDLALRAVEL
TRFYPAWGRARLEAMIGNRPDWCISRQRNWGVPMTFFIHKEAHTLHPRTP
ELLEKVAGLVEQQGIEAWFSLDATELLGEEAQHYQKLTDTLDVWFDSGTT
HETVLKQNIQLRHPADLYLEGSDQHRGWFQSSLLTGCAIDGCAPYTALLT
HGFVVDGQGYKMSKSKGNVIAPQKIADTLGADILRLWVASTDYSGELSIS
DEILKRTVETYRRIRNTLRFLLANLADFNLAADALPPAEWVEIDRYMLAY
TAALQNDLLGFYERYEFHQAVARLHHFCSEDLGGFYLDILKDRLYTSMAN
GIPRRSAQNALYHIVHSLVRLFAPVLSFTAEEVWQELGESAEDSVFLHTW
HCFPDQSEILSDAQILIPRWQRLRELRARVLKQLEDARIQGEIGSSLAAI
VEIHAAGEDFALLDSLGDDLRFVLITSEVHLQRVDDAAGEVIRVTASPHM
KCERCWHYRQDVGSVPEHSSLCSRCVSNLAGSGEYRRFA
>NE0701 ilvA, ilvA, threonine dehydratase
MVNDYLERILTAHVYDVASETPLDFAPSLSARIHNRVLLKREDMQRVFSF
KLRGAYNKMARLSPDVLQQGVVAASAGNHAQGVALAAQKLGCSATIVMPA
TTPQIKINAVMAMGAAVLLQGDSYNDAYEYARNLAEEKEAEFVHPYDDPD
VIAGQGTIAMEILRQHPGRIHAIFVPIGGGGLISGIAAYVKRLRPDIKVI
GVEPVDSNAMYRSLQAGYRVELSQVGLFADGVAVRLVGEETFRLCRELVD
DIILVDSDAICAAIKDIFEDTRSIVEPSGALSIAGLKAYVKQKNLLHEVL
VAIASGANMNFDRLRHVSERAELGEQREAILAVTIPEKPGSFKKFCNLLG
ARNITEFNYRYADPEMAHVFVGVSVRNREESLHLIEILQENGLKTEDMTD
NEMAKLHIRHMVGGRSREVMHEILYRFEFPDRPGALMNFLNHMSDHWNIS
LFHYRNHGADFGRVLIGIQVPPEEKQEFREFLDKLGYSFWDENDNPAYQL
FLA
>NE1323 ilvC, probable ketol-acid reductoisomerase oxidoreductase protein
MKVYYDKDADLSLIKKKKVAIIGYGSQGHAHANNLNDSGVKVTIGLRKGG
ASWDKAKKAGLTVKEVGAAVKDADVIMLLLPDEQMASVYQTEVEPVLKKN
ATLAFAHGFNVHYGQIIPREDLDVIMVAPKGPGHLVRSTYTQGGGVPSLI
AVHQDKSGKARDLALSYAAANGGTRGGVIETTFKEETETDLFGEQVVLCG
GLTSLIQAGFETLVEAGYAPEMAYFECLHEVKLIVDLIYEGGIANMRYSV
SNNAEYGDVSRGPRIITDATRNEMRKILREIQTGEYAREFILENRAGAPV
LKSSRRLASEHPIETVGAKLRDMMPWISKNKLVDQAKN
>NE1893 ilvE, Aminotransferases class-IV
MSMADRDGVIWYDGEMVPWRNATTHVLTHTLHYGMGVFEGLRAYQTPGGP
AIFRLPEHTERLFYSAHVFRMKIPFDQPTLIQAQLDVVRQNRLTSGYIRP
IVFYGSEAMGLSAKNLSVHVAIAAWSWGTYLGADALENGIRVKTSSFTRH
HVNINMCRAKSVATYTNSILAHQEVAQDGYEEALLLDVDGYVAEGSGENI
FIIKKGKIYTPDLTACLEGITRASVIQLAQEAGIEVIQKRITRDEVYCAD
EAFFTGTAAEITPIRELDCRMIGNGKRGPVTEQLQTAFFDCVNGKSDKHA
DWLHYAD
>NE1324 ilvH, probable acetolactate synthase isozyme III (small subunit)
MRHIISLLMENEAGALSRVAGLFSARGYNIESLSVAPTEDPTLSRMTLVT
NGPDEIVEQITKQLNKLIEVVKLIDLSSEGYVERELMLVKVRAVGKDREE
MKRLADIFRGNIIDVTNELYTIELTGTRSKLDGFLQAVDCNLILEIARTG
VSGLSRGERVLKL
>NE1325 ilvI, Thiamine pyrophosphate dependent enzyme
MSTELTGAEITIRCLQEEGVSHIFGYPGGAVLFLYDELFKQDKVKHILVR
HEQAALHAADGYARSSNKVGVALVTSGPGVTNAVTGIATAYMDSIPMVII
SGQVPTAAIGQDAFQEVDTVGITRPCVKHNFLVKDVSELAVTIKKAFYIA
STGRPGPVLVDIPKDVTQQKAEFHYPASISLRSYSPVINGDAQQLRKAVQ
MILEAKRPMVYTGGGVILDDAAAELTELVQMLNFPCTSTLMGLGGYPATD
RQFVGMLGMHGTYEANMAMQYCDVLIAVGARFDDRVIGNPKHFYSEERKI
IHIDIDPSSISKRVKVDVPIVGSVSAVLKELIMLLRAGRETTDVHALNKW
WEQVELWRARDCLKYDRSASVIKPQLVVEKLYEITGGDAFITSDVGQHQM
WAAQFYKFDKPRRWVNSGGLGTMGFGLPSAMGVQMANPGSTVACITGEAS
IQMCIQELSTCKQYRLPIKIINLNNRYMGMVRQWQEFFHGNRYAESYMDA
LPDFVKLAESYGHVGMRIDKPEDIDGALREAFKLEEQLVFMDFITDQTEN
VFPMVPGGKGLSEMILV
>NE0422 infA, S1 RNA binding domain:Translation initiation factor IF-1
MAKEETIQMQGEVIETLPNATFRIKLENGHIVLGHISGKMRMNYIRILPG
DKVTVDLTPYDLTRARITFRTK
>NE0957 infC, Initiation factor 3
MSIAEAKEMAEEAGVDLVEIAPGAEPPVCRLMDYGKYLYQESKKKHDAKL
RQKQIQVKEIKFRPNTDEGDYGIKLRNLIHFLNDGDKVKITLRFRGREMA
HQEFGVRLLERVRGDLADHAVVEQFPKLEGRQMVMVLSPKKKEVKVGKSA
KQSETDAKSEAITQ
>NE0450 int, Phage integrase
MQWIKRFILFHGKRHPQEMGSAEIEAFLTHLAVAGKVSASTQNQALSALL
FLYKEILSIDLPWLNEIVRAKQPQRLPTVLTRTEVQAILVRMSGTYGLMA
NLLYGTGMRLMECVRLRVKDVDFERGEILIRDGKGSKDRVTMLPESLAGP
LQAHLLHRRTLFDDDSRLGKASVYLPDALERKYPNAATDWVWQYIFSSGS
FSIDPRSGTERRHHIDEKLLQRAMKKAVQASGITKLATPHTLRHSFATHL
LDSGYDIRTIQELLGHKDVHTTMIYTHVLNKGGRGVRSPLDM
>NE0235 intF, Phage integrase
MLTKVRLTPSRIAAHTCPADASQAFLWDTATPGLAVRATAGKRAFIFQGR
FAGKSIRITIGDTEVWTIEQARQRARELQGLVDQGRDPRLVKQEKIAADV
QARITDEPALPAWRDYIAARSGKWSEAHAADHLKMARDGGEPVTRGRRIG
APAYTEKGILRPLLDLPLKGITREKVAQWLDNEATRRPAQARLALSLLGT
FLSWCGNQPAYRNQVNSDACAKLKRELPKPTARTDCLQREQLASWFAAVR
SIDNPVMSAYLQSLLLTGARREELAGLGWEDVDFQWQTIHLADKVEHSGR
TIPLTPYVSQLLQSLPKINEFVFASKRAKSGRLQEPRKAHNQAIEAAGLP
PLSIHGLRRSFATLSEWVEAPSGITAQIMGHKPSAIAERHYKRRPVDLLR
VWHTKIEEWILSNANI
>NE2189 intINeu, Integron integrase; Phage integrase; Phage integrase N-terminal SAM-like domain
MGNTNTPPKLLDQVRDRIRIKHYSLRTETQYVQWIKRFILFHGKRHPQEM
GAAEVEAFLTHLAVVGKVSASTQNQALSALLFLYKEVLSIDLPWLDKVVR
AKQPQRLPVVLTRTEVQAILVRMSGTYGLMANLLYGTGMRLMECVRLRVK
DVDFERGEILIRDGKGAKDRVTILPESLVSPLQTYLLQRRVLFDDDIRLG
KASVYLPDALERKYPNAATDWIWQYIFPSGSFSIDPRSSVERRHHIDEKL
LQRAMKKAVQTSGITKLATPHTLRHSFATHLLDSGYDIRTIQELLGHKDV
HTTMIYTHVLNKGGRGVRSPLDM
>NE1827 ipk, ipk; 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase
MDIFPAPAKLNLFLHVIGRREDGYHLLQTVFRFIDHSDRLHFDITHDGVI
RHENLIPGLTETDDLCVRAAKLLRQRFGRESLGVKIHLEKNIPLGGGLGG
GSSDAATTLIALNRLWGINWKRERLMALGLELGADVPVFIYGRNAFAEGV
GEELHAVDLPSAWYVVLTPPVQISTAAVFTSKELTRNTIPIKMAAFSMGQ
GHNDLEPVAMRMQPVIAGWLGWLKQQHGTTKVAMSGSGSCMFAEFPSESA
AREVFGRLPGDMSGFVVSGLARHPLSDF
>NE1160 ispA, Polyprenyl synthetase
MHPDFQSWASEQLALVETCLEKHLPETNCAPARLHDAMRYSVLGGGKRVR
PLLSFAAGELSGADKTHATIAGAAVELIHVYSLVHDDLPCMDNDTLRRGK
PTCHVRYDEPTALLVGDSLQSLAFQLLTETNLTEDPHVQLEMVRHLAFAA
GSRGMAGGQAIDLDSVGKTLSLPELEFMHIHKTGALIRAAVILGARCGNR
LDETQFKLLDHFAKCMGLAFQVIDDILDTEATTAALGKTAGKDAENNKPT
YVSILGIKQARELAHELRQEATGVLNQFGSEALRLQQVTDFIVQREF
>NE1915 ispB, Polyprenyl synthetase
MSIESIRNVIAKDMGEVDNVIQQKLHSQVALVRQVSEYIIHSGGKRLRPA
LVILAAGLFDYKGLHHHNLAAVVEFIHTATLLHDDVVDESELRRNQATAN
ALFGNAASVLVGDFLYSRAFQMMVEVGNMRVMEVLADATNTIAEGEVLQL
LNCRDPDVEEENYLQVIRFKTAKLFEAAARLGAILGGATPAEEEALAAYG
MHLGTAFQLVDDLLDYSGNDQDTGKNLGDDLAEGKPTLPLIFAMRSGIPE
QVDTVRHAIETGGKDGFEPILKIIQESGALDYAKQCAQNEANMAISAITA
IRDSIYKQCLLELAEFAITRNY
>NE0055 ispZ, hypothetical protein
MKFLFDLFPVILFFITYKVYGIYNATAVAIIATFVQIGWVWLRHRKVDNM
LWVSLAIIVVFGGATLIFQDETFIKWKPTVLYWLFAAVLFLANQIFEKNL
IRVMMKDQIRLPEPVWPRLNASWAVFFSVMGIINLYVAYNYSTDTWVNFK
LFGFMGLMFAFVILQAILLGKYVEKSDDQEI
>NE1886 katA, Catalase
MDDAKKKLTTSAGAPVADNQNVMTAGPRGPMLLQDIWFLEKLAHFDREVI
PERRMHAKGSGAYGTFTVTHDITRYTRANIFSQIGKKTDLFVRFSTVAGE
RGAADAERDIRGFAIKFYTEEGNWDLVGNNTPVFFLRDPLKFPDLNHAVK
RDPRTNLRSARNNWDFWTLLPEALHQITIVMSDRGLPATYRHMHGFGSHT
FSFINARQERYWVKFHFRTQQGIKNLTDAEAEALVGKDRESHQRDLYESI
ENGEFPRWTLFVQIMPESDAASLPYNPFDLTKVWPHKDYPLIEVGVMELN
RNPENYFAEVEQAAFNPASVVPGISFSPDKMLQGRLFSYGDAQRYRLGVN
HYQIPVNAPRCPFHHYHRDGAMRVDGNQGSTLGYEPNSYQEWQQQPEYAE
PPLTLEGAAAHWNHRVDEDYYIQPGNLFRLMNSSQQQVLFDNTARSIGEA
PREIQIRHITNCLKADRAYGEGVARALGIALSEVD
>NE1567 kduD, Short-chain dehydrogenase/reductase (SDR) superfamily
MNQPYKGTPFDLSGRAVLISGATGLLGTEFALAAASAGADLILGDLDGNR
LELLKNEIIASHPDVHVLIQVLDVTRADSCQSIAQLCEDRFGRIDGVVHS
AAIDPKFEQGSDTSRFSKFTEFPLALWQTSLDVNLTGAFQLAQATCRIME
KSGRGSVVFLGSNYGLVGPDQRIYKKAGQEAQTYKPAVYSVCKAGLLGLT
KFLAAYYMNTSIRTNLLTPSGVWNKHDSEFTGHYSSRTILGRMSEKEEYR
GAILFLLSDASSYMTGANLVIDGGWTAL
>NE1384 kpsE, Chain length determinant protein
MTQLQQSVIESQPGKLSDYLGHTRRNRFSIRLFSGLIGFILVLAVMATGY
WLVFASDRYVSEASVIIRKTDSVTAPSFDLSMLIAGVAGVNRADQLLLRE
YLLSVDMLKKLDTELDLHAHYSDEHYDLLSRMWLKDMEWFHRHYLSRVSI
EYDDYAGVLRINAQAYDPEIARAISNLLVREGERYMNQISHELAETQVSF
LTTQVNLAQLRFQQVRQKLLAYQNKKGLLSPQATVESLNTLIAKLEEQRA
QLKTQLASLPRTLDRNHPNLVMLRQSLAAIDRQITDEKAKLANPTGKTLN
ITMEEFQRLEMEVTFAQELYKAALTALEKGRIDATRMLDKVSVLQAPTLP
EYPMEPRRIYNTLVTLLFALMLAGILKLLEGIILDHVD
>NE0884 ksgA, ksgA; dimethyladenosine transferase
MRHTPRKRFGQHFLVDTSVIAEIIHIIHPVPGDRMIEIGPGLGALTKPLL
NVLDELQVIEIDRDIVDYLSRTYPGKLVIHNIDALKFDFSELGEGLRIIG
NLPYNISTPLLFHLSRFSSLITDMYFMLQLEVVERMVALPSTPDYGRLSI
MLQNRFEMEQMLVVPAESFDPPPRVQSAIVCMRPKAEPTIPLKHERLFAE
LVSAAFSQRRKTLRNTLRHYLTADDFERLEIDSGLRAENLSLAQYAAIVR
QVYEDRQ
>NE0899 ldhA, D-isomer specific 2-hydroxyacid dehydrogenase
MKITFFSTQPYDRESFLKHHIDTQFELVFLEEKLTEHTVSLASGSQAICV
FVNDNLNEAVIHQLSQLNVQLIALRCAGFNNVDIKAAHACNIRVVRVPAY
SPHAVAEHTLAMIMTLNRKTHKAYNRVREQNFSLNGLLGFDLHKKTVGVI
GTGHIGEVFCRIMHGLGCNILACDPVKKLEIEKMGIPYVPMNELFSRCDI
LSLHCPLNEETRYLIDSSVIAQMKTGVMLINTGRGGLIDTKAVIAGLKSG
KIGYLGIDVYEQEADLFFQNLSEQIILDDTIARLMTFPNVLITAHQGFFT
QEALDQIALTTFANIKRFVAGEIPANEVKI
>NE2327 lepA, GTP-binding elongation factor:Elongation factor Tu domain 2
MIQHIRNFSIIAHIDHGKSTLADRIIQFCGGLSDREMEDQVLDSMDLERE
RGITIKAQTAALHYQAKDGKNYLLNLIDTPGHVDFSYEVSRSLSACEGAL
LVVDASQGVEAQTVANCYTAIEQGVEVIPVLNKIDLPAADPDRVIAEVED
IIGIEAKGALRISAKTGEGVDQVLEMIVAQIPPPEGDVDAPLKALIIDSW
FDSYVGVVMLVRVVDGVLRPGNKILLMSSKANYLCEEVGVFQPKAVSHKS
LSAGEVGFIISGIKDLKSAKVGDTVTLADRPAGEPLAGFKEIKPQVFAGL
YPVESNQYDALRAALEKLQLNDASLHFEPETSQALGFGFRCGFLGLLHLD
IVQERLEREYDMDLITTAPTVVYQVVLRDGKITEIENPSRLPDLSSIEEI
REPIITATILVPEEYVGTVMTLCTGKRGIQKNMQYMGRQVMLVYEMPLNE
VVMDFFDRLKSVSRGYASLDYEFKEFRAADLVKLDILINSDRVDALSLIV
HRASSQHRGRELAQKMRELIPRQMFDIAVQAAIGAHIVARENVKALRKNV
LAKCYGGDITRKRKLLEKQKAGKKRMKRVGNVEIPQAAFLAILQVDGK
>NE2326 lepB, Bacterial leader peptidase 1 (S26A) family:Signal peptidase
MNFPLVLLILLVVTGGIWLLDYFVLRHKRVSGNTEEPWWVEYPKSFFPII
LIVFSLRSFLVEPFKIPSGSMIPTLLIGDFILVNKYTYGIRLPVANLKII
DMNEPQRGEVMVFRFPEDPSIDYIKRVIGVPGDMVTYRNKQLSINDVPVQ
LEQGGDYKYIDGPAYIYTQRFKENMDGSEHDILINEDMPDIQLSAIHHFP
NRENCTFDRTGFSCKVPEGNYFTLGDNRDGSSDSRYWGFVPENHIVGKAF
LIWWNFNDLSRIGTLIK
>NE1320 leuA1, HMG-CoA Lyase-like family
MKEHLVIFDTTLRDGEQSPGASMTMEEKVRIARQLERMGVDVIEAGFPAA
SRGDFEAVRAVAEAVSNSTVCGLARAMEADIDRTGEALQVNQNVRIHTFI
ATSPIHMKNKLRMSPDQVIDQAIKAVKWARQYTDNVEFSPEDAGRSEIDF
LCRVLEAVIDAGARTLNIPDTVGYTMPDQFGGLIRTLRERIPNSDKAIFS
VHCHNDLGLAVANSLSAVMNGARQVECTINGLGERAGNAALEEIVMAVRT
RQDYFPCDTRIDTTQIVPASKLVSGITGFPVQPNKAIVGANAFAHESGIH
QDGVLKHRETYEIMRAEDVGWGANKLLLGKHSGRNAFRSRLKELGIGLES
EEKLNAIFLRFKDLADKKHEIFDEDLHALVSDEAQIPEEHYRLLSLHAVS
ETGEIPSAQVVIAVGGSEKQAVSEGSGPVDATFRAIEKILDSKVELQLFS
VNNITSGTDAQGEVTVRLQKAGRIVNGHGADTDIIAASAKAYLSACNKLH
SSLERTHPQI
>NE0688 leuB, leuB, 3-isopropylmalate dehydrogenase
MKIALLAGDGIGPEIMAQAERVLRYFQQEGLQIELEPGLLGGCAVDATGE
PFPEATRQLAIEADAILLGAVGGTQYDGLPREKRPEQGLLAIRKELNLFA
NLRPVVLYPELSNASTLKPEVVAGLDILIVRELTGDIYFGRPRGIEYRDG
QKVGFNTMIYSESEIRRIAHVAFQAAGRRNGRLCSVDKMNVLECTQLWRD
VVTETAQSYPDVTLSHMLVDNAAMQLVRNPRQFDVVVTGNMFGDILSDEA
SMLTGSIGMLPSASLDDRNKGLYEPIHGSAPDIAGKGIANPLATILSAAM
MLRYSFNQETAASRIEHAVQKTLQQSYRTEDIYEAGMKKVGTVEMGDRVL
ANLQSR
>NE0685 leuC, leuC; 3-isopropylmalate dehydratase (large subunit)
MKTLYDKLWSDHVVHAESDDPNGMVILYIDRHLVHEVTSPQAFESLKLAG
RKPWRTGSILAVADHNVPTTDRSSGISDPVSRLQVETLDQNCEEFAITEF
RMNDERQGIVHVIGPEQGATLPGMTVVCGDSHTSTHGAFACLAFGIGTSE
VEHVLATQCLVARKSKTMLVRVEGDLPPGVTAKDIALAVIGEIGTAGGTG
YAIEFAGSAIRSLSMEGRMTLCNMAIEAGARAGMVGADEVTIDYIKGRPF
APQGALWDQAVAYWRTLKSDEDAVFDRMVELKAVNIKPQVTWGTSPEMVT
TVDGYVPDPADISDPTKRHDVEHALGYMGLKPKMPIQEITLDKVFIGSCT
NSRIEDLRAAAEIVKGKRIAPNIRLAMVVPGSGLVKSMAEKEGLDKIFLS
AGFEWREPGCSMCLAMNDDRLLPGERCASTSNRNFEGRQGPGGRTHLVSP
AMAAAAAIAGHFVDVRSFIR
>NE0687 leuD, leuD; 3-isopropylmalate dehydratase small subunit
MEKFEQFKGIVAPLDRANVDTDAIIPKQFLKSIKRSGFGQNLFDEWRYLD
YGEPGKDISLRQLNPDFILNQPRYQGARILVARDNFGCGSSREHAPWALQ
DYGFAVIIAPSFADIFYNNCFKIGLLPIVSEASIVDRLIRDTLETDGYRL
EVNLDAQTVTTPSGEIHRFEVDSFRKHCLLNGLDEIGLTLQHADKIRTFE
MNRRDRQPWLFLQQNGDKLRT
>NE1139 leuS, t-RNA synthetase, class Ia:Leucyl-tRNA synthetase
MQEKYHPQEIERQARQSWQETNIFNVTEIPDRPKYYCLSMFPYPSGKLHM
GHVRNYTIGDVLSRYRRMQGYNVMQPMGWDAFGLPAENAAIQKGVPPAKW
TYDNIAYMRSQLQSLGFAIDWQRELATCDPQYYRWNQWLFLRMLEKGIAY
QKTQVVNWDPVDQTVLANEQVIDGCGWRTGAVVEKREIPGYYLAITRYAD
ELLADLEKLPGWPERVKTMQANWIGKSFGVDITFPPDTASGMPQALKVFT
TRADTLMGVTYVAVAAEHPVALHAAHNQPDLVAFIESCRQGATMEAELAV
QEKKGMATGLYVLHPLTGERLPVWVANYVLMSYGEGAVMAVPAHDERDFD
FARQHSLPIKPVIRPENGELSVPLVQAFTEYGVTFNSDRFSDLTSPEAID
AIAVELGQKALGEKRVRYRLRDWGISRQRYWGCPIPLIHCDSCGVVPVAD
DQLPVVLPEDLVPDGSGNPLAKTPSFYECTCPRCGRQARRETDTMDTFVD
SSWYFIRYACPDQSAAMTDQRANYWLPVDQYIGGIEHAILHLLYSRFWSK
VMRDLGLVSFDEPFANLLTQGMVLNEIFFRKTSSGRIQYFNPAEVDVQHD
GEGKRVGAVLQADNQPVESGGIGTMSKSKNNGIDPQEIIEQYGADTARLF
MMFASPPTQTLEWSDAGVEGAFRFLKRLWRQVYLHRQLDGEAAATTTLVP
HQEYPADLRDLRCQLHQTIVKVTDDLERRHTFNTAIAAIMELMNELSDVQ
GTHPAARQLMQEALENIVLLLSPIVPHICHVLWRELRPGTELLDQPWPQA
DDQALIQDEVEIVVQINGKLRGQIRIAREADRAAVERTALQDEHIQKSIA
YRPVKRVIVVPGKLINIVV
>NE0105 lgt, Prolipoprotein diacylglyceryl transferase
MLVHPQFDPVAISIGPLAVRWYGLMYLLGFSLFILLGRYRIRQQPNGVFT
REMLDDALFYGVLGVILGGRLGHVLFYEPGYYLQHPLEILAIWQGGMSFH
GGFLGVAIAMLCLARKYQLSWLAVTDFIAPLVPLGLGAGRIGNFINGELW
GRPTDVPWGMIFPYADNLPRHPSQLYEFALEGLVLFALIWLYSAKPRPLG
AVTGMFMIGYGAFRSFCEFFREPDDGFLGIMTLGISMGQWLSLPMIAAGI
ALLYWAYRYDGKPEKAARKLPRERKNRD
>NE1753 lig, NAD-dependent DNA ligase
MISENTIEERLQALRAAIALHDFHYYVQDAPVIPDAEYDALFRTLQQLEQ
QYPHLVTPDSPTQRVGAPPLKVFAQLTHQTPMLSLANAFSEEEVTAFDRR
IREALNIDRVDYAVEPKFDGLAISLIYANGILTKGATRGDGYTGEDITLN
LRTIPSIPLRLQVPFPTGQFEVRGEVVMLKTDFERLNEQQRKNGEKTFVN
PRNAAAGSLRQLDSRITAMRRLTFFAYGIGAYHEDQPIFSTHSEILAYLA
TQQFLVARQSSTVMGANGLLAYYREMNAVRLSLPYEIDGVVYKVNDLAQQ
EKLGYVSRAPRFAIAHKFPAQEVSTELLAIEIQVGRTGALTPVARLAPVF
VGGVTVTNATLHNEDEVQRKQIMIGDTVIVRRAGDVIPEVVAVIVERRPT
HAQAFVMPDHCPVCGSKAVRLPDEAVTRCTGGLYCPAQRKQAILHFASRR
AIDIDGLGEKLVDQLIDRELVHTPADLYRLDIDTLAGLERMAGKSARNLV
TAIEDSKKTTLPRFIYALGIRHVGEATAKALASHTGDLDRLMDMNAEQLQ
QIPDIGPIVAQSIADFFSEAHNREVIEQLLSCGLQWEKPSHIAQPSSRTN
LAVPGKTFVLTGTLPTMTRDQAKNRIEQQGGKVTGSVSSATSYVVAGSDP
GSKYARAIELGIPVLDEDQLLSLLRDTSSSE
>NE1401 ligT, putative 2'-5' RNA ligase
MSSVAPSVRLFFALWPQEAVRKQLARLARRIADQCAGRCVRPENLHLTLA
FIGEVDSSVVPILRQAGSAIHRTSFDLLLDKLHCWGKGSIIAAGVSQPCG
ELTVLVQDLRNQLSAAGVRYDTVKFVPHVTLVRNAKREPEQKHFPETIPP
VAWAADRWSLVQSRPTPYGSAYQSFADWALASQDFDEHHRA
>NE1489 lipA, Lipoate synthase
MAADTHQRGAAKTARNPVKIDLEQSGQLLRKPSWIRVRSSNSQKYLEVKR
LLRENRLHTVCEEASCPNIGECFGRGTATFMILGDLCTRRCPFCDVAHGR
PHAPDPDEPMHLANSIAVLKLNYVVITSVDRDDLRDGGAQHFADCIRAIR
TQSPQTRIEILVPDFRGRLEVALEKLSACPPDVMNHNLETVPRLYKQCRP
GADYMHSLQLLKDFKAAFPHIPTKSGLMLGLGETDEEIIEVMRDLRAHQV
DMLTVGQYLQPSKGHHPVMRYVSPEDFKTFERIATNLGFSHAACGPMVRS
SYHADQQAHEAGIE
>NE1488 lipB, lipB: lipoate-protein ligase B
MFTVTVKNMGTTEYLTAWQAMKNFTAQRTCETPDEIWLLEHPPVYTQGIA
GKPEHLLFPNQIPVIKTDRGGQITYHGPGQIILYLLLDLHRWRLNIRQLV
RKMEGAVIDLLSEYDVVAQGDENAPGVYVDGAKIASLGLKIRRGACYHGI
AFNADMDLTPFTAINPCGYRGLRVTQAKDLGIADCKEMLATKLAQSFINQ
LTDV
>NE1188 lnt, Carbon-nitrogen hydrolase:Apolipoprotein N-acyltransferase
MNGYFRLIAAFALGVATVSGFAPFYLYPIPVVTLALLALLWRRSRTPGQA
ALTGFTFGMGLFGAGVTWLYVSLHDFGHMEPALAVLALIILCAYLALFPA
LTGWITAFRHFRASWAWPGMVAALWALAEWLRGTLFTGFPWLTVGYSQAP
ASPLAGFAPVIGVYGLSLLLMLSAAWLACWLENRQSHRFWLGLGSVWLIG
FGLQQIHWTQPEGEPVTVSLLQGNIPQNMKWQPEHLAATMQIYAELVQES
PSRLIVTPEISFPLFYEQAPQDYLALLAEHARSRQGDLLIGMAERSSSDN
GYYNTMFSFGTSPEQSYRKYHLVPFGEYIPLKPVFGWIIDVLHIPLSDFS
RGGLDQQPLDLAGQQVAVNICYEDVFGEEIIMQLPQASLLVNVSNDAWFG
RSIGPRQHLQISQMRALETGRYMLRATNTGVTAIIDERGRVLEQLDMFTT
AGLHSTAQGFGGATPYVRFGNSLVFALIGLLLLAGSLAAFSGRRKTL
>NE1056 lolD, lolD; lipoprotein releasing system ATP-binding ABC transporter
MSKIIIACRDLYKSYFQGNLEVPVLHGIDLQVNEGEMVAIVGASGSGKST
LLHVLGGLDKPTRGEVTLLDRELSTISEAERGSLRNHALGFVYQFHHLLP
EFSAQENVAMPLFIRRMNKKAAMEQAAAMLQRVGLGHRLTHTPGELSGGE
RQRAAVARALVTRPACVLADEPTGNLDRHTAEAVFDLMLELNHEANAGLV
IVTHDTQLASRADRVLHLVDGMLQ
>NE0033 lon, lon; ATP-dependent protease la protein
MTSDIIDAGESSELDLPLLPLRDVVVFPHMVIPLFVGRPKSIKALEVATE
AGTNILLVAQKSAAKDDPAPQDLYRVCCVSSILQMLKLPDGTVKVLVEGN
YRAKIESFSDSETSFSGKTIQVRIEETDTPEIEALRRALLSQFDQYVKLN
KKIPSEILASLTGIDEAGRLADTIAAYLPLRLEQKQEILEIFDVQPRLER
LLGLLEAELDILQVEKRIRGRVKRQMEKSQRDYYLNEQVKAIQKELGEGE
DGADLEEIERKIKTASLPKEALAKAESELKKLRLMSPMSAEATVVRNYID
TLVALPWKQKTKISKELKQAESILDEDHYGLERVKERIVEYLAVQQRVAK
SKAPILCLVGPPGVGKTSLGRSIAHATNRKFIRMSLGGMRDEAEIRGHRR
TYIGSMPGKILQSMTKVGVKNPLFLLDEVDKMGMDFRGDPSSALLEVLDP
EQNSTFVDHYIEVEYDLSDVMFVATANTLNIPPALLDRMEVIRLSGYTED
EKLNIAKRYLLPKQMKDHGLNEDELSISESALRDIVRYYTREAGVRSMER
EVSKICRKVVKTLLVKKSKEKIIINSRNLSKYLGVRRYTYGMADEKNQVG
QVTGLAWTEVGGELLRIEAVVLPGKGKTITTGKLGEVMQESIQAALSVVR
SRSRILGISDDFYLKNDIHIHLPEGATPKDGPSAGIGICLAMVSALAGIP
VRASVAMTGEITLRGEILPIGGLKEKLLAAHRGGISTVLIPEDNMKDLSE
IPGNIKSKLNIKPVKWIDQVLEIALEYKPEMLPSDISGQVISSASEERVT
SAPSIKH
>NE1278 lonA, lonA; ATP-dependent proteinase La 1 (lon) (class III heat-shock protein)
MSAMEQSPSNLPELPADVIALVPMRNVVLFPHVIMPVAVGRTRSIAAIQH
TLQSKVPVGIVLQKNPSVDEPGLDALCQIGTIANVVRHIASEDGTHHAVC
LGVERFRIEALVEGYPFLAARIRRIPEAIPDTTQVEALTLQLRERAMEIV
SLLPSVPAELAHALQATRAPSDLADITASLLDTEVAEKQKLLETIDIEER
LHSVLQILARRIEVLRLSQEIGERTKEQMEDRERKFLLREQLRTIQKELG
EDGENEQEVVKLEEAITAAGMPTDIEQQTHKELQRLQRMPAASSEFSMLH
TYLEWMTELPWQLPEDKPIDLDAARTILEADHFGLERIKQRIIEFLAVQK
LKPQGRAPILCFAGPPGVGKTSLGQSIARALQRPFVRVSLGGVHDEAEMR
GHRRTYVGAMPGNIIQGLRKAGARNCVMMLDEIDKMTASAHGDPAAALLE
ILDPEQNSTFRDNYLGVPFDLSRVVFIATANVIDQIPPPVRDRMEIIDLP
GYTQEEKLQIALRYLVQRQSEANGLQTDQCMLTSAALQGIIANYTREAGV
RQFEREIGRIMRHAALQIAQGTQQQVQIDAQNLEDILGPEKYEHELALHT
DLPGVATGLAWTPVGGDILFIEATRINGSGRLILTGQLGDVMKESAQAAL
TLVKARAEDLHIPTSVFEGIDVHLHVPAGAIPKDGPSAGVAMFIALASLF
ANRPVRHDVAMTGEISLRGLVLPVGGIKEKILAAQRAGIRTVLLPARNRK
DLHDVPEATRTAIQFVFLETADNAVQAALGKSDNLESV
>NE2319 lpdA3, pdA3; dihydrolipoamide dehydrogenase E3 component
MDNLFDVAVIGAGPGGYVAAIRCAQLGLNTVCIDDWKNEQGKPSLGGTCL
NVGCIPSKALLESSENFERAGHKFAEHGIKVDGLSIDVPAMIARKNKIVK
AFTGGIGMLFKKNKVTALHGRGTLQKHDQARDGGDDSWEIQVSADGKEQS
VHARHVIIATGSTPRTLKVAPVDGNNVLDNAGALALQQTPGKLAIIGAGV
IGLELGSVWRRLGAEVTLLEAQADFLPAADEQVAKEAYKALTRETGLTIH
TGVEIKSTRASENGVEIDYVDRDKNAQNLKVDKLIVAVGRVPNTSGLGAE
AAGLKLDERGYISVDEFCQTSLQNVYAIGDVVRGPMLAHKASEEGVAVAE
RIASGRQGATDSSGHVDLGMMPWVIYTAPEIAWVGKTEQTLKAEGVAYKA
GQFPFMANGRARALGETTGFVKILADAESDRILGVHMVGPYVSEMIAEAV
VAMEFSASSEDLARIVHAHPSLSESLHEAALGVDKRAIHI
>NE2164 lpxK, Tetraacyldisaccharide-1-P 4'-kinase
MNWYELYWQRITPLHLFLWPVSQLLILFQSVRRFLYRRAILTSIHLPVPI
IIIDSITTDSPVKTSLIIQIANILKAAGLRPGIISRGYPDNHRPPTRVTI
SSHPHLTGEKSLLLTYHLRETCPVWIGYDRIETAKALLNAHKECNVLICD
DGLQDLRLQRDFEAVIVDTSVINSGNGLIMPAGPLRDSFARLKHTDAVIL
AGHQRRIPDITDEIRTIHTRPQKEHFFNLSWPELTADAAGLAGKRIHAIV
CDPDTQNFLDNLEFLKLTVTPRVFPENHHFIATDFQSDEAEIILIPEEDA
VKCLSLHDDRIWVLQQEYRVDPGLREIILKKLREKFMDPKLLDILVCPLC
KGSLIYKKDRLELICKADRLAYPIRDGIPVMLEDEARKLPDEEEIK
>NE1148 lspA, Signal peptidase II/lipoprotein signal peptidase family (A8)
MKLYFSLTLAVIILVMDLLTKRWVELNLTYGQQIMITEFFNLVLTYNAGA
AFSFLSDASGWQRWFLSTIAVLVSVLIVYLLYKNTANRLFCFALSLILGG
ALGNLWDRIMLGHVVDFLDFHLNGYHWPAFNLADSAIFCGAFLLILDSIR
NGRNDSVQEG
>NE2132 lysC, Aspartokinase superfamily:ACT domain
MAFLVQKYGGTSVGSTERIKQVAHRVAEFRAQGHELVVVVSAMSGETNRL
ISLAKEIQPNPDPRELDVILSTGEQVTIGLLAMALRELGIKAKSYTGPQV
SITTDSAHTKARILKIDEDPIRADLAAGYVIIVAGFQGVDETGDITTLGR
GGSDTTAVALAAALKADECQIYTDVNGIYTTDPRVVPEARKLDTITFEEM
LEMASLGSKVLQIRSVEFAGKYKVKLRVLSSFEEGEGTLITFEEDNKMEQ
PIISGIAFNRDESKITVVGVPDHPGIAYQILGPVADANIDVDMIIQNVGH
DDTTDFSFTVHRNEYARTMDVLKQQVLPHIGAREVIGGDKIAKVSVIGVG
MRSYAGIASKMFRVLAEEGINIRMIATSEIKISVVVDEKYMELAVRVLHK
AFDLDQVNNTSTKTC
>NE2355 lysS, lysS; putative lysyl-tRNA synthetase protein
MTQEEISGISQDENNLIAERRSKLTALRQTGNAFPNDYRRDNLARILHEK
YDSCSREELESSQVTVKVAGRMLFKRVMGKASFATIQDMSGRIQLYISND
HTGETAHEAFRHYDLGDILGAEGVLFKTRTDELSLRVTQLHLLTKSLRPL
PEKFHGLADQEQKYRRRYLDLITNEDTRRVFAIRSKIIQAIREFLVDRDY
LEVETPMMHSIPGGATARPFVTHHNALDMSLYLRIAPELYLKRLVVGGME
KVFEINRNFRNEGISTRHNPEFTMLEFYEAYQDHNYLMDLTESMLREVAL
KVSGTTRIVYQQREMDLAQPFARLTIAQAILKYHPEYSDAQLNDRDFLTK
ALQAKGVTVNPDSGIGGLQLALFDETTEHLLFEPVFIVDYPAEVSPLARC
NDANPEITDRFELYIAGREIANGFSELNDPEDQANRFLEQARAKEAGDLE
AMHYDADYIQALEYGLPPTAGEGIGIDRLVMLLTDSPSIRDVILFPQLRK
ED
>NE0649 lytB, LytB protein
MIRVLLANPRGFCAGVDRAIEIVERALAMYGAPIYVRHEVVHNRFVVEDL
EKKGAVFVENLEEVPEGSMLIFSAHGVSHEVRREAAARKLQIFDATCPLV
TKVHVEVAKMDKEGKEIVMIGHQGHPEVEGTMGQIAKSKGTMYLVETAED
VARLQVKNESNLAYVTQTTLSVDDAARVIEALKQRFPKIIGPKKDDICYA
TQNRQDAVKKLVKLCDLVVVVGSPNSSNSNRLCEVARNENVEAYMVDQAE
QLQESWLTNKRCIGITAGASAPEILVQQVLERLEQIAAKQSNQGVIIEEL
SGVLESVTFPLPKAEPVSFNKYI
>NE0275 map, Methionine aminopeptidase
MQAGAGVVIHIKTAPEIETMRIAGRLASEVLDYIEPYVVPGVTTGELDEL
CHRYMVDVQKTIPAPLNYAPPGYSPYPKSICTSVNHQVCHGVPGDKKLKD
GDVVNLDITVIYEGYHGDTSRMYYVGEPSIQAKRLCELTYEAMWRGIEEI
RPGKHLGDIGHAIQRLAEGAGYSVVREFCGHGIGAKFHEDPQVLHYGRAG
TGIELKPGMIFTVEPMINAGKAAIKQLPDGWTVITKDHSLSAQWEHTILV
TDESYEVLTVSSGSPARPAYRF
>NE2529 mcrC, possible mcrC protein
MTAVAEQEESASNSAEGFIGRIPVRNLWLLMLYASDLFRTRGIGKIGLED
SPDDLPDLVAEILAHAVEVRQRRRLSLGYRSRDAVINRVRGRIDVLTTER
HQLMDRGLVACWFDELTIDTPRNRFVRAALESISRIVQRKDVAHRCRALA
GGMKAMGVSGDAPARAQMSTDRFGRNDADDRFMVAAAKLALDLALPTEAS
GANVLSLPDREATWVRRLFERAVGGFYEVVLSPQGWRVLCGGTMGWQIEQ
RTAGIDKILPTMRTDVVLDHPSTGQRIVIDTKFTSIVTSGWYREETLRSG
YVYQIYAYLRSQVGCGDALADHASGLLLHPAIGQMVDETAVIQGHRIRFA
TVDLTASTSDIRLQLLRFFDPNQPVTGQ
>NE2437 mdoG, periplasmic glucans biosynthesis protein
MRKFLNNRMQMASVSSRSDCLDYHQNKWTGSIKYTKDKRMRHKPQIIKMR
WLGVAVTLLVLYASSAWAFSINDVAKQAQLLASESYETPETNLPSIFRDM
KYADYQQIQFNHDKAYWNNLKTPFKLEFYHQGMYFDTPVKINEVTATSVH
QIKYTPDYFNFGNVQHDKDTVKDLGFAGFKVLNPINSKNKNDEILSMLGA
SYFRMIGAGQVYGLSARGLAIDTALPSGEEFPRFREFWIERPKPADRHLI
IYALLDSPRATGAYRFVIMPNRDITVNVQSKIYLRDKVGKLGIAPLTSMF
LFGSNHPSPMVNFRPELHDSNGLSIHASNGEWIWRPLNNPRRLAISSFST
ENPQGFGLLQRGRQFSRFEDLDDRYDLRPSAWITPKGKWGKGSVELIEIP
TNDETNDNIVAFWTPDQLPEAGKEINFKYTLTFSKDEDKLHAPDNAYVMQ
TLRSTGDVKQSNLIRQPDGTIAFIIDFTGRKMKKLPQDTPVAAQASIDDN
GTIVESSVRYNPIIKGWRLTLRVKVKDVEKITEMRAALVNGDQILSETWS
YQLPADE
>NE2438 mdoH, Glycosyl transferase, family 2
MNKTTEYIDALPLTVAEKATLPATDIRTLHETLNPEHHDYAREDDSPLGS
VKARLEQSWPDSLVNRQLTEDDEGRTQLETMPKATRSSISPDPWRTNPIG
RFWDHLRGHNATPHHVSRLTKEEQAHEQKWCTVGTIRRYILLLLTFSQTA
LATWYMKTILPYQGWALIDPIDMIGQDIWISFMQLLPYILQSGILILFAI
LFCWISAGFWTALMGFLQLLIGKDKYSISASLAGDVPINPEHRTALIMPI
CNEDVDRVFAGLRATWESVKATNQQQHFDIYILSDSYDPDICVAEQKAWI
ELLAEVQDKGQIFYRRRGRRVKRKSGNIDDFCRRWGSQYSYMVVLDADSV
MSGECLTSLVRLMEANPNAGIIQSWPRASGMDTFYARCQQFATRVYGPLF
TAGLHFWQLGESHYWGHNAIIRVQPFIEHCTLALLPGEGTFAGSILSHDF
VEAALMRRAGWGVWIAYDLPGSYEELPPNLLDELKRDRRWCHGNLMNFRL
FLVKGLHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHAFTEPHYFL
QPHQLFPVWPQWQPELAIALLASTMVLLFLPKLLSILLIWCKGTKEYGGF
IRVTISLLLEVILSVLLAPVRMLFHTVFVVSAFLGWKVAWKSPQRDDDST
TWREAFMRHGSQLLLGLVWATGMAWLDLRFLFWLAPIVFSLILSPFVSVF
SSRASVGLRAKRWKLLLIPEEYSPPKVLVDTDNYLMMNRNRTLNDGFMHA
VFHPSFNALTTATATARHRKSKVLEIARDHHIEQALNEPPDKLNRDCRLT
LLSDPVIMSRLHYCVWAMPEKYASWVNHYQQLTLNPSALKQCEPNIEEEA
DYAEPTAQIDMALPGTP
>NE2066 mdrB, Cell cycle proteins
MITLDIKKIWYYFTRYIDNFLLAGIFLLMLTGLIVLYSATGGNLTRVISQ
LINMIVAFVVMWTVANIPLQRIMRLAFPIYVMGIILLVAVALFGEVQNGA
RRWLNLGFINIQPSELLKIAAPLMMSWYFDKAHITLRWRDYVVAVLILLL
PVLLIARQPDLGTALLILISGFYVIFLAGLSWRIIVGLAVAVAVSLPLLW
TFGMHDYQRKRVMTMLDPSQDALGAGYHTIQSSIAIGSGGISGKGWLKGT
QSQLDFLPEPSTDFIFSVFSEEFGLIGNSLLLSLYLIVIGRCLVITARAP
TRFTRLVAGSITLTFFTYVFVNMGMVSGILPVVGIPLPLISYGGTSMVTI
LLGFGILMSIHTHPKLVKT
>NE0839 merA, merA; mercuric reductase
MTTLKITGMTCDSCATHVKQALEKVPGVQSAVVSYAKGAAQLDLDPGTAP
DALTAAVAGLGYKATLADAPPTDNRTGLLDKVRDWMGAADKGSDGEHPLQ
VAVIGSGGAAMAAALKAVEQGAKVTLIERGTIGGTCVNVGCVPSKIMIRA
AHIAHLRRESPFDGGIPATAPAIDRSKLLAQQQARVDELRHAKYEGILDG
NPAITVLHGEARFKDDQSLVVRLNAGGERVVVFDRCLVATGASPAVPPIP
GLKESPYWTSTEALVSDIIPERLAVIGSSVVALELAQAFARLGSQVTILA
RSTLFFREDPAIGEAITAAFRAEGIKVLEHTQASQVAHVDGEFVLTTARG
EIHADKLLVATGRTPNTRSLALEAAGVAVNAQGAIVIDKGMRTSAPHIYA
AGDCTDQPQFVYVAAAAGTRAAINMTGGDATLDLTAMPAVVFTDPQVATV
GYSEAEAHHDGIETDSRTLTLDNVPRALANFDTRGFIKLVIEEGSGRLIG
VQAVAPEAGELIQTAALAIRNRMTAQELADQLFPYLTMVEGLKLAAQTFN
KDVKQLSCCAG
>NE0840 merC, putative mercury transport protein
MGLVTRIADKTGALGSVVSAMGCAACFPALASLGAAIGLGFLSQYEGLFI
SRLLPLFAAVAFIANALGWLSHRQWHRSVLGMIGPAIVFAATVWLLGNWW
TANLMYVGLALMVGVSVWDFVSPANRRCGPDGCELPAKRG
>NE0838 merD, Bacterial regulatory proteins, MerR family
MNAYTVSRLADDAGVSVHVVRDYIVRGLLRPVACTTGGYGLFDAAALQRL
CFVRAAFDAGIGLGALARLCRALDAADGDGAAAQLAVLHQLVERRREALA
SLEMQLAAMPIESARQTESLP
>NE0841 merP, Mercury scavenger protein:Heavy-metal-associated domain
MKKLFASLALVAVVAPVWAATQTVTLSVPGMTCAACPITVKKAISKVEGV
SKTDVSFDKREAVVTFDDTKTNVQKLTKATEGAGYPSSVKR
>NE0843 merR, Bacterial regulatory proteins, MerR family
MQTILENLTIGAFARAAGVNVETIRFYQRKGLLPEPDKPYGSIRRYGEAD
VTRVRFVKSAQRLGFSLDEIAELLRLDDGTHCEEASSLAEHKLKDVREKM
ADLARMESVLSELVSACHLRQGNVSCPLIASLQGDASPAMP
>NE0842 merT, MerT mercuric transport protein
MSEPQNGRGALFAGGLAAILASTCCLGPLVLVALGFSGAWIGNLTILEPY
RPIFIGAALVALFFAWRRIYRPAEACKPGEVCAIPHVHTTYKLIFWIVAV
LVLVALGFPYVMPFFY
>NE1436 metE, Methionine synthase, vitamin-B12 independent
MRTLTHNLGFPRIGAQRELKKALETYWKGQNDVDQLLATARAIRTGNWLL
QQKAGIDLIPVGDFSLYDHILDMTTLLGAIPHRFGNTGDKISPDLYFAMA
RGTADQPAMEMTKWFNTNYHYIVPEFDDTTQFRLASDRLFQEIEDAKALG
ITAKAVLIGPLTYLYLGKEVTPGFQRLDLLPRLLPVYREILQKIASLGVE
WVQIDEPILSLDLEQPWRDSFAEAYHTLHDGSCRLLLTTYFGTVDHHLTL
LKNLPVDGLHIDVSSAPEQLESFLTEDFSGKTLSLGCIDGRNIWRADLSQ
KLETLSQAASRFAGELWIAPSCSLLHCPVDLALETKLEPEIKNWLAFSAQ
KLEEMTTLGRGLNQGRESVEAILTASDEARRSRTESSRIHNPIVHQRVDN
LTERDSQRNHPFATRKHLQQQRFNLPLLPTTTIGSFPQTATIRQARAAFR
KNELSHLEYLSAMRAEIREMIRKQEEIGLDVLVHGEPERNDMVEYFGEQL
WGYAFTENGWVQSYGSRCVKPPILYGDVYRPEAMTVEWIKYAQSQTGKPV
KGMLTGPVTMLMWSFVRDDQPRSTTALQLALAIRDEVADLENAGIGMIQI
DEPAFREGLPLRKQDWKSYLDWAVKAFRVASSGVKDETQIHTHMCYSEFH
DILPAIAELDADVITIETSRSRMELLDAFVKFSYPNEIGPGVYDIHSPRV
PDASEMFELLKKASRYIDPTLLWVNPDCGLKTRNWPETQTALQKMVDCAK
TLRSALQA
>NE0661 metF, metF; 5,10-methylenetetrahydrofolate reductase oxidoreductase protein
MQSQKKFTPTFSFEFFPPQTPEGMEKLRATRIQLAQFNPKFFSVTFGAGG
STRERTLETVLEIQAEGYPVAPHLSCIGSTRDNIRSILEKYHSHGISRIV
ALRGDLPSGMAQAGEFRYANELVAFIRKEFGDTFWIEVAAYPEYHPQARS
ALEDFTNFRRKVEAGSNAAITQFFYNVDAYLHFVEMCEAADLNIPIVPGI
MPISKFSQLARFSDGCGAEIPRWIRRKLESFGDDIPSIQAFGLDVVTALC
ARLLEAGAPGLHFYTLNSAVLPTKIWQRLGL
>NE0625 metG1, Methionyl-tRNA synthetase
MTIRNILVTSALPYANGSIHLGHLVEYIQTDIWVRFQKMQGHTVYYVCAD
DTHGTPVMLRAEKEGISPEALIARVHAEHLRDFTGFHIAFDQYYSTHSDE
TRYYAEDIYRKLKEAGLIAVRAIEQLYDPIKNLFLPDRFVKGECPKCGAA
EQYGDSCEACGAAYTPTELKNPYSAVSGATPVRKTSEHFFFKLSDSRCAD
FLRRWTHEGNHLQAEAANKMAEWLGEAGENKLSDWDISRDAPYFGFEIPG
ETGKYFYVWLDAPIGYMGSFKKLCARKGIDFDAYWKKDSTTELYHFIGKD
ILYFHALFWPAMLENAGYRTPTQIFAHGFLTVNGEKMSKSRGTFITAESY
LEQGLNPEWLRYYYAAKLNGSMEDIDLNLDDFVARVNSDLVGKYINIASR
CAGFISKRFGGKLVSGEDYRLLQQMVDEHFAGWQPGVIEAAYEARDFSAA
VRHIMRRADEVNELIHELAPWEIARDETRERELHRACSLGIQMFYLLSCY
LKPILPRTAAQIEDFLNLGELSWQKQQAGQPLSDTLLPPGHVINPYQHLM
TRIDPKQITALITANQQTMQQTMNTETESHSPQRHGQAQQHPVAPIAETI
SIEDFVKIDLRIARIVDAQHVPGADKLLQLTLDIGSEQRTVFAGIKSAYD
PEQLKGRLTVMVANLAPRKMKFGLSEGMVLAASGENGGGPFLLAPDSGAQ
PGMRVK
>NE1623 metH, metH Methionine synthase I, cobalamin-binding domain
MTMHERADLLKRLLAERILMLDGAMGTMIQSYKLTESDYRGERFADFPHD
LKGNNDLLCLTRPEVIRSIHRAYLEAGSDIIETNTFNSNAPSMADYHMQD
LVYELNVAGARLACEEARAMETQQPDRPRFVAGVIGPTTKTASLSPDVND
PGFRAITFDDLVESYTESVRGLIDGGADILLVETIFDTLNAKAALFAIDQ
YFETHGLRLPVMISVTITDASGRNLSGQTPEAFWNSVRHARPLSVGINCA
LGAELMRPYVEELSNVAEVFTSAHPNAGLPNPLAETGYDETPEYTARLIK
DFAQSGFVNIVGGCCGTTPKHIAAIAEAVRDIPPRPLPDIPKKLRLSGLE
PLNIDEHSLFVNVGERTNVTGSKAFARLILNGGYAEGLVIARSQVENGAQ
IIDINMDEAMLDSQKAMVTFLNLLAAEPDISRLPIMLDSSKWSVIEAGLK
CVQGKAVINSISLKEGEAEFLHHARLARRYGAAVIVMAFDETGQADTLQR
KVEICTRCYHTLIEQADFPPEDIIFDPNIFAIATGIEEHSNYAVDFIEAT
HVIRQTLPYAKVSGGVSNVSFSFRGNEPIREAIHTAFLYHAVKAGMTMGI
VNAGQLGVYSDIPPDLLEHVEDVLLNRRPDATERLVEFAEHFKGQKKEQI
EDLSWRDEPVRQRLIHALVRGISTYIVEDTELVRQEIDSQGGKPIEVIEG
PLMDGMNVVGDLFGAGKMFLPQVVKSARVMKQAVAYLLPYIEAEKKISGD
SKPKGKVVIATVKGDVHDIGKNIVSVVLQCNNFEVINMGVMVPSAQILET
ARREQVDMIGLSGLITPSLEEMAHVAREMEREQFTVPLLIGGATTSRMHT
AVKIAPHYGGVTVWVPDASRAVGVCSNLMSQDLRDDYVRQVKAEQEKSRV
QHRNKKGPSKLLTFEEARANALKTDWARYTPPAPDFLGLRTLNNYPLETL
VPHIDWTPFFQAWELHGRYPAILQDELVGEAASNLFRDAQNMLRKIVEQK
WLTANAVIGLFPANTVNGDDIEIYADRSRSQVIMTWHTLRQQTAKPAGRP
NLALADFIAPRETGLDDTIGLFAVSAGFGIDERIRAFEAANDDYSAIILK
ALADRLAEAFAEHMHARVRREFWGYVKDESLDNEQLIDEQYLGIRPAPGY
PACPDHTEKGPLFALLEAEKRSGIVITESFAMVPTAAVSGFYLSYPESSY
FAVGKIGKDQVEDYARRKGWTLEEAERWLAPVLAYER
>NE0659 metK, S-adenosylmethionine synthetase
MSNYLFTSESVSEGHPDKVADQISDAILDAILQQDPHARVACETMCSTGL
IVLSGEITTDATIDYNAIPRGIVREIGYTSSEIGFDASTCAVLTAFNKQS
PDIAQGVNRSKDEEMDQGAGDQGLMFGYACDETPQLMPLPIYYAHRLVEQ
QAKLRKSGRLSWLRPDAKSQVSVRYEDGFPKNIETIVISTQHSPDVPRDE
LVEGVIEEVIKPVLPAEMLSNHIQYLINPTGRFVVGGPMGDCGLTGRKII
VDTYGGTAHHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLARKCEVQ
VAYAIGVAKPVSLMVQTFGTGKIPDGKLAELIARHFDLRPRAIIHELDLL
RPIYGKTAAYGHFGREEPSFTWEKTDMAEQLKADAGI
>NE1435 metR, Bacterial regulatory protein, LysR family
MIELRHLRTLNALHETGSISLAAQRVHLTQSALSHQIKALQEYYQLPLIQ
RSGHAVQLTEAGKRLVKLAEKILGEVQSAERDLAKIARQAAGSLRIVLEC
HTCFDWLMPIMDDFRQHWPEVELDLVSGFHSDPVALLRQDIADVVIGSEN
RPQQGIVHFPLFRFEILAVLAPDHVLGNKRVLEAVDFAQDVLITYSVPEE
RIDLIRQVLIPAGVSWQRRTTELTVAILQLVASRRGIAALPGWGIKNYLD
YEYVIAKRIGTRGLWSDLYLSVREEDASLRYLQDFLETIGQICFARLDGI
RPLSGKRKEKKREARKTIGQSSPV
>NE2187 metW, SAM (and some other nucleotide) binding motif
MLDLGCGDGTLLHYLRDKLDIHGYGVEIDAHNILACMKNGINVIQNDLEA
GLSEFEGESFDYVILSQTLQAMKNTEYIIREMLRVGKEGIVSFPNFGYWK
NRIQVAGGHMPVSPTLPYQWYDTPNVHLCTLHDFEQLCQQHRVNILERRV
MNNDKKVTFLPNLFGILAFYRFSHAA
>NE2186 metX, Alpha/beta hydrolase fold
MSTQDSDSIGIVSARRAHFDTPLSLKSGAVLDSYELVYETYGELNADRSN
AVLICHALSGNHHVAGVYADNPKNTGWWNNMIGPGKPVDTRKFFVIGINN
LGGCHGSTGPISINDKTGKRFGPDFPLVTTADWAKTYVRFADQFSIDCFA
AVIGGSLGGMSAMQLALDAPERVRHAIVVAASARLTAQNIAFNDVARQAI
LTDPDFHDGDYYSHGTHPRRGLRLARMLGHITYLSDDSMASKFGRELRNG
SLAFNYDVEFQIESYLHHQGDKFADLFDANTYLLMTKALDYFDPAQDYDG
NLSAAFARAQADFLVLSFTSDWRFSPERSRDIVKALLDNKLNVSYAEIPS
SYGHDSFLMQDDYYHQLIRAYMNNIAL
>NE1697 metY, Cys/Met metabolism pyridoxal-phosphate-dependent enzymes
MKRETLAIHGGFAGDPQTHAVAVPIYQTTSYYFDDTQHGADLFDLKVQGN
IYTRIMNPTTAVLEERVALLEGGVGALAMASGMAAITACVQTLARAGDNI
ISTSQVYGGTYNFFCHTLPNLGIEVRMVDGRNPAAFADAIDDNTRMIYCE
SIGNPAGNVVDIAALAEVAHAAGVPLVVDNTVPTPVLCRPFEHGADIVVH
ALTKYMGGHGTSIGGIIVDSGKFPWEGNSRFPQFNQPDPSYHGVVYVDAF
GPAAFIGRARVVPLRNMGAAISPFNSFLILQGIETLPLRMERHCTNALAI
ARYLQRHPKVSWVNFAGLEDNRDYALVQKYMDGGIPSSILSFGIKGGREA
CARFMDRLMLIKRLVNIGDAKTLACHPATTTHRQLNDEELAKAGVSADLV
RLCVGIEHIDDLIADVEQAFQD
>NE0700 metZ, Cys/Met metabolism pyridoxal-phosphate-dependent enzymes
MTNDLDPETLAIHTGVHRSQFNEHSESLYLTSSFVFDSAAQAAARFSGQE
PGNIYSRFTNPTVTAMQERLAVLEGAEACIATASGMSAILTCVMGLLSAG
DHIVASRSLFGSTVSLFNNILSRFGIQTTFVSATDPAEWQAAVRPNTRLF
FLETPSNPLTEISDIAALAEIAKRAGVWLAVDNCFCTPIIQQPLKLGADL
VIHSATKYLDGQGRVLGGAILGKRDLLMDSGIFSFLRTAGPSLSAFNAWI
ILKGMETLSLRVKAHSDHALEVARWLETHPRVGRVFYPGLPSHPQHELAM
RQQKTGGGIVSFEVKGGREAAWRVVDAARLMSITANLGDTKSTLTHPATT
THGRISQEAREAAGIRDGLLRIAVGLESPDDLKADLARGLQ
>NE0156 mexE, HlyD family secretion protein
MCSSAMRQSCVFIIFIVLSIFIAGCNSESGTPVESPPPDVMVASVLSRSV
RTWDEFNGRVRAIQTVELRPRVSGYINRIAYKEGSEVKPGDLLFVIDPRP
YRDALHRAQAELERARSAASLAQSKTEHVHALFAKQAISREEFNGRKNDF
NQTAAEVRAAKAAVATAKLNLNFTEVRAPIAGRVSRAYLTAGNLARADQS
MLTTLVSQDPMYVYFDCDEHSFLRYKKFALKNDHENTIYSVRIGLADEEG
FPHHGTVDFFDNQLNPATGTIRVRAVVPNPDRLLTPGLHARVQLQGSDEL
PVLLIDAKAILTDQDRKYVYVLGEGDKAFRRDITLGRQIDGLRIVQSGLD
TTDKVVVIGAQKIFHNGMIVKPQHVAMETQATTPLLVP
>NE0008 mfd, mfd: transcription-repair coupling factor
MSSKLNPLSSESLPRYTGLEGSSDACALARLANRNPAGQLLAVITASALD
AQRLLEEIPFFAPDLRVSLLPDWETLPYDIFSPHQDLISERLATFYQIAH
NACDVLIIPVTTALYRMPPREFLAAHSFFVNQGSTLDLQSFRSQMSLAGY
SHVSQVLSPGEYSIRGGLIDLFPMGSPLPYRIDLFDDEIESIRTFDVDTQ
RSIYPVKEIRLLPAREFPLDDNGRSRFRTGFREKFEGDPTRCRLYQEISK
GNIPAGIEYYLPLFFEQTATLFDYLAQHSTVCLHGEITPAIENFWQDTRS
RYQLMRNDPDRPLLPPMDLFLPEDQFYGYLKSYKRIEMHTGQQVKTDKPF
ARSLPPVRVDRRASNPIEQLTAFVHTFTQKGGRVLLLAESMGRRELMAEY
LREYGLKLKLCEDFAAFQSDTASCMLSVASLHSGFILAAENLALVTENEL
YATHVRGQRTRDARKTVSADSILRDLSEIKPGSPVVHEQHGIGRYLGLVN
MNMGEDDSGQSSEFLALEYQGGDKLYVPVTQLHLISRYSGAAPEAAPLHK
LGSGQWEKAKRKAMQQVRDTAAELLNLYAQRAARKGHIFRFNQHDYNAFA
DGFGFEETPDQATAINAVIQDMVSGKSMDRLICGDVGFGKTEVALRAAFV
AVTDGKQVAVLVPTTLLAEQHYQNFSDRFGLIADQWPVKIAELSRFRSAR
EQAEALQSLAQGTTDIIIGTHKLIQDKVKFKNLGLVIIDEEHRFGVRQKE
QLKKLRAEVDVLTLTATPIPRTLAMSLEGLRDFSVIATAPQRRLAIRTFV
HPYSEGIIREACLRELKRGGQIYFLYNEVSTIQNMYTRLTTLLPEARINI
AHGQMRESELEHVMRDFYQQRFNLLLCTTIIETGIDIPTANTIIIHRADK
FGLAQLHQLRGRVGRSHHQAYAYLLTPPEKAALTTQATRRLEAIQAMEEL
GSGFYLAMHDLEIRGAGAVLGDSQSGEMQEVGFSLYSSLLDAAIKSLKAG
HEPDMQQPLGVSTEIRLHVPALLPESYCGDIHERLILYKRMAGCSDETEL
DEIHQELIDRFGLLPDPARALLDSHRLRIEARQLGITRIDAGPDNIQLQF
VPEPPIEAIKIIQLIQSSKEYSLSGPDRLSVRLQIPDVGERVKKIKKLMT
LLKN
>NE1633 mgtE, CBS domain:Divalent cation transporter
MTEENHENLEDKLQKVTYLLQKYKLVEGFVRLQDTPEPELIESIIQKQNL
SNLRQALDKLHPADIAHILEALPLEDRLIIWGLVKAEQDGEVLLEVSDAV
RQTLIEDMDSEELVAAAEQLDADEIADLAPDLPSDVMEDVFQALPMEERE
QLRAAMSYSEDAVGALMDFDVITVRGDVRIEVVLRYLRRLGELPDHTDQL
FVVDRTEQLQGVLLLNQLLVSDPEATVMEVMARDTVKFLPDDKAEQATHA
FERYDLVTAPVISSEGKLLGRVTVNAVIDFMREKADLEARSQAGLSEEED
LFAPIWKSVRNRWAWLAINLVTAFIASRVIGLFENSIEKLVALAALMPIV
AGVGGNSGNQTITMIVRAIALGQVNQDSTLKLISKEIGVGIVNGIVWGSV
VGLFTYAIYRNMQLGLVMALAMLLNLLLAAFVGVLIPLTRKKFGRDPALG
SSVLITAVTDSGGFFIFLGLATLFLL
>NE1976 miaA, tRNA isopentenyltransferase
MNYPDIESPPAIFLMGPTASGKSAMALEIARRFPVEIIGVDSAQVYRFMD
IGSAKPDKLILSEIPHHLIDLIDPDENYSAARFREDALSVMREITARGRV
PLLVGGTMLYFKVLRQGLAALPPADDSVRRALEQQALDKGWPAMHAVLSQ
LDPVTAGRIQPNDSQRIQRALEVCYLTGKPMSEMLEQQQNADFPFRVFNI
ALLPGDRSVLHDRISQRFATMLEAGLIDEVRLIREQFHVNGDMPSMRCVG
YRQVCMYLDNEISFARMQETGVFATRQLAKRQLTWLRSMSGLQIFDCLEN
RLARQIIDLIQAQRLFS
>NE1831 minC, putative cell division inhibitor MinC
MDQSSILDFRSGTFFAPILVLYNNDLEKIEQRLQEKIASAPDFFSSSPVL
FDLSELNENDKKIDVEALISLLRRLSLFPVGIRGGDPSQEKRALELSVPV
DSGRSRNDAILPEIQRKDDTMPEVQPVIREAVRAPAMMITQPVRSGQRIY
AASDLVILAQVSAGAEIMAEGNIHVYNTLRGRALAGVQGDTAARFFCLDL
QAELVSIAGIYKTSEDLKETPKKKPVQVYLRDQALIIEELS
>NE1830 minD, ParA family ATPase
MARIIVVTSGKGGVGKTTTSAAIAMGLAKRGHKTAVIDFDVGLRNLDLIM
GCERRVVYDFVNVINGEANLNQALIKDKNCNQLYILPASQTRDKDALNLE
GVGRVLEELSKDFKYIVCDSPAGIEKGAYLAMYYADDAFVVTNPEVSSVR
DSDRMLGILASKSRRAELNMEPIKEYLLLTRYDPDRVESGEMLGLDDVQE
ILSLHLLGVIPESKSVLNASNSGIPVILDEKSDAGQAYADVVARYLGEKK
PLRFIDSKKGFLRKLFGGK
>NE1829 minE, cell division topological specificity factor MinE
MSLLDYFRSSKSKTASVAKERLQILVAHERYYRNKPSYLPQLQEELMQVI
RKYVQVDQDAISVKFEQDDNQETLELNIILPDSQNTRNTQQDAVRNSF
>NE1033 mltB2, putative membrane-bound lytic transglycosylase
MFLRPAVTNPIKHNTFGHSRNRLTLFILTSLVTHHTSCMAEQPVSGLRLE
IRSFVTDMVSRHGYDAHELEDAFIQTNFRPEILKLISTPASAISWDEYRK
RFVNMQRIKGGVDFWNKYAADLERARKIYGVPEEIIVAIIGIETSYGSST
GNYRVMDALTTLAFDFPRRADYFREELENYLLLAREQKFGLLDIKGSYAG
AIGVPQFMPGSYRRYAVDFNNDGKIDLSRSAVDAIGSIGNYLKEYGWEAG
KPVAIRARAGSQNLQEFLDTDIKPLHSVWKLRQAGITPLDPVADDTLSAL
LELNSQGKRQLWLGFKNFYVITRYNRSTFYAMSVFELARAIHTARSSYSA
DR
>NE0315 mnxG, possible multicopper oxidase
MKNFNFSTALICLAGIFISACKVTDHNSAFGPSGKRIVHADVVAIDQPIY
YNRFGSVNPYGMVYALKRDITVVKQGEEWLPGLSCPSEVRLLEGKRPRPL
VLRGNAGDVLEITFTNKLMKVQPDISRCTLIDKHSPYAHLVDAKGRIDPI
DPPEHEAELRTDIGETQHLPPADWPKTRSANLVIPGLVSLSGTDPRCNGL
QSIDPGESVTCRWKLENEGTHLFSSHGAPAGGEGDGGSLTHGLFGVVIVE
PENSKWYRSQVTASVLDKVWPRLKEDDIVRKGSLQYDASDSEGTPYLNML
KPAGRDQSGSMRYELIYGDLNAVVREDKPENPDAPAFREFTVVFHDELKT
FYANNFQELAVSRQLSGVGDGFGINYGASGMGSILLANRKGIGPAAGCAE
CFYEEFFLQSWANGDPALLEHFEDDPSNVAHSYLNDRVKFRNLHAGPKET
HVFHLHAHQWLSSTDKNSGTYLDSQTIAPHQGFSYSIYDGGLSEWAGKGE
NVHQTLGSGNRNRTVGDSIFHCHLYPHFAQGMWGLWRVHDVIEDGTRTLP
DGQPEAHFYGQPRVALSIEKTKIGGKRPGTDPVTGAAGAGTPVPAILPLP
DQGLPPLPTYAQDPRVGQSEQYAMPGYPFYIPGEPGHRAPQPPLDFARDE
NGLLMHGGLPRHVIRNGERKPVFMSDDLLNQLPADADERAAMVLRQSLAL
GDFRLELTRADIVLLDHDGESIERQAMDFHAGKAARIRLADGSYVNHDPD
KSGYPSLTPEGRRARFYVNGAPSAPGAPYADPCHDPAFKGNRQYEVSAIG
LDLVVNRAGWHDPQARINMLTKDAEQIEGQRRHDMEPFFFRASSGECIEF
KHQNRTEKELELDDFQVATPTDTIGQHIHLVKFDVTSSDGSGNGFNYEDG
TFSHGAVEERIHAASGLAVTSAGERKRLAIGKDLDGKQIDRYQTTVQRWF
ADPLLAVNSKNEGHTSHAMHGAATPDRTCPAHADKECIDRTLRTVFTHDH
FAPSSIQQHGFYSALLVEPAGSTWLKPNGEALEEGVGSKATIIEAVDWLT
HENHREYALAVADFALLYEPALYQPWRDHSTDGMARLLRETQSNVRNSAT
GALFRDLKRHAEHWWKQHGKPVDPPFKPEAISKDHHNPYLVNYKHEPIPL
RIGKMEKKLQTPEETHCGEHAQERDSKGYFKKRMSVSQQQEGNNGDMAYV
FASREHGDPCTPILEAYEGENIQIRMVQGAQEVQHTFAIEGLYWPRIIDY
STSHQFSEEVQKKQDNPSALVSAQEIGISEHFEMRMPPFQNVSNGASVND
YLYHLGTADALWNGAWGLLRIHNGVSAPDPAGCQPDENLAAAVDKWFARR
NLFGRDVEGVPVSCKKLVGNRLKPLPGHGDGQIVITNEKEIFGEDSAPDP
ETGRELPLGCPGGSRLKNFKVVATRVDLLVSSKENKMFYDKENKLFDPDA
LILLTMPPGEANIRLQDVKNIYQQKADAGVLEPLVLRANANDCIALELYN
LLLAEGETELPDTAGDALMPRIVPLNVDQPTGSAAGDVKPSTKLSLSIPL
LATRDPVYFRNLGIGMNRSAEPLNNGQKRIYTFYAGKLELESEAKTKDGP
ACAGGECKFKLNKLPYVFGTLPIKAFGDVINHGAHGLIGTLVIEPEGAVY
LDPVTEKEIPEEDQWKLGSEALIKYEDEQRKKKMFREFVLAYQDGLNWHW
PSPWSRKTEPVGDCPICDDSYDHGEKGINYRAAPFWARLRQGMDSNGRRV
FNDIGAGADLNQVQFPKNFLLSSWADIPTPVFQAKAGEEVRFRVSQPYGR
ARQRAFVSFGADYADMMPGFGSPHSALISAGKAMTATLSGSAKEGCYIYR
DGPMHMFASGVWGHFRVSPADGTTLKCEIPASYEGL
>NE2353 moeZ, Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2
MRLPPLVNPAAELSKDEISRYSRHLLIPEVGLIGQRRLKSSKVLVIGAGG
LGSPVLLYLAAAGVGTIGIIDFDVVDESNLQRQIIHGQSDIGRPKSESAC
DAINELNPYVKVNLHRERLEVVNAHAVISGYDLIVDGTDNFATRYLVNDA
CVLADKPYVWGSIFRFEGQVSVFWENAPGGKGLNYRDLYPEPPPPEMAPS
CAEGGVLGILCASIGAMMATEAIKLITGIGETMLGRLAVYDALEMSYRFI
PLHRAPNRTAIDGLIDYQRFCSVNPVSATYADTIPVISARELKAIKENGT
DVQLIDVRGIEEWDIVHIEGAKHIPKSRIMSEEVLAAMNKEDFIVLHCKM
GMRSRDVLLEMQKQGFTNVKSLDGGILAWIREVDQSLPIY
>NE0046 motA, probable chemotaxis (motility protein A) transmembrane
MLVAIGYVVVIVSVFGGFAMAGGHLGSLFQPVELIMIAGAAVGAFFVGNS
GKAIKATFKALPYIFRDANYTKSVYMELMTLLYEILGKIRKEGLMSIEAD
VDDPGKSPLFGKYPKILSDHHATEFITDYLRLMVGGNLNPLEIENLMDGE
IETHHQEGEVPVHVISKMGDALPAFGIVAAVMGVVHTMESVGIPPAELGK
LIAAALVGTFLGILLAYGFIGPLANRLEQRLHESGKLLECVKVTLLANLN
GYAPALAVEFGRKVLFSTERPSFAELEKHIKQSKAK
>NE0045 motB, Bacterial outer membrane protein
MAEDISQKPIIIKRIKKIAGQHHGGAWKIAYADFVTAMMAFFLLMWLLGS
TTQGDKEGIAEYFKTPLKVALLGGDGSGDSTSVVKGGGDDISKKIGQMKR
SDFDDLKRSIDMDALQALQDRFELEQLEKLKARIEDTINSNPALNKFKSQ
LLLDITSEGLRIQIVDEKNRPMFSMGRAELQSYTKMILHEIGKMLNDVGN
KISLSGHTDATPYPTGEKSYSNWELSADRANASRRELIAGGMSPEKMLRV
VGLSSAVMFDKADPFSPFNRRISIIVMNKKAEDAITRESLGEVDIHSKEE
VDSQIIQ
>NE2217 mpl, mpl;UDP-N-acetylmuramate:L-alanyl-gamma-D- glutamyl-meso-diamino pimelate ligase transmembrane protein
MHIHILGICGTFMGGLAVLARAAGHTVTGCDANVYPPMSTQLRDQQIELI
EGYDADQIKLKPDLFVIGNVVSRGNPLMEAILDAHLPYLSGPQWLAQHLL
HNRWVLAVAGTHGKTTTSSMLAWILEYAGLSPGFLIGGVPENFGLSARLG
IIEAEQTPSPFFVIEADEYDTAFFDKRSKFVHYQPRTVVLNNLEYDHADI
FPDLAAIERQFHHLIRIVPRNGLLVVNGQDTALQRVLNQGCWTPVEQFDV
DDGWQITMEQNESLSVRFQGVSQGILHWELLGEHNCMNALAALAAARHAG
VPAQLGIEALSRFKNVKRRMELRGVINDIHVYDDFAHHPTAIRTTLAGLR
AKIGQARIIAVLEPRSNTMKLGVWQNDLAASLDQADQIFCYAHQIGWNID
SALASLDDKTEIHHDLTTMISAIAATAHPGDHILIMSNGSFSGLHDKLLD
TLRT
>NE0988 mraY, mraY;phospho-N-acetylmuramoyl-pentapeptide-trans ferase
MLLALSQWIAEDIRAFNVFSYITLRTMLAALTALSISFLIGPAMIRSLTA
RKVGQSVRNDGPQSHLIKAGTPTMGGTLILTAVIVTTLLWADLSNRYIWV
VSLTTLGFGAIGWVDDYRKVIQRNSKGLSASSKFFWQSIIALLVAVYLAM
TADLPQHTEMIVPFFKEVAIPLGTFLFIVLTYLVIVGTSNAVNLTDGLDG
LAIMPTVMISGALAIFAYVAGHAVFAKYLGIPHIPNAGELAVFCGALTGA
GLAFLWFNTYPAEVFMGDVGALALGAALGVITVIVRQEIVLVIMGGVFVM
EALSVMIQVASYKLFGQRVFRMAPLHHHYELKGWKENQVVVRFWIITLIL
VLIGLSTLKLR
>NE2317 mrcA, mrcA; penicillin-binding 1 (peptidoglycan synthetase) transmembrane protein
MLFRWFYRFFLVALATGLLAALLVVFAALVTLPSLPSLETLTDYRPKIPL
RIYSADNLLIGEFGEERRDFIKIEEVPKSLKQAILAAEDDRFYQHYGVDY
IGVLRAIYSNFKAGGAHQGASTITMQVARNFFLTKEKTLTRKFSEALLAF
KIEHSLEKDQILELYINQIYLGQRSYGFSSAAQSYFGKQLTDINLAEAAM
LAGLPKAPSRFNPIVNLERAKTRQQYVLKRMHELGYISAEQFARAKETAI
AIHRRVKVFAMPADYVAEMVRQIVYDRYREEAYTRGIKVYTTLRKTDQEA
AYRALRKGVMEYDARHGYRGPEAFLKLPKDLKDREALQDIMQDIGNSDDI
LAAVVLSATPGAVEAFRKNGETIRITGDGLKAAHRFLSNDPKIGEKRIRP
GALIRVKKTEKGSWQIVQLPEIEAALVAMDPNNGAIRALVGGFDYHLNKF
NHATQAWRQPGSSFKPFVYSAALEKGFTPATIINDAPLYLGPEQTGGKAW
EPRNYDGKFSGPIRMRTALTHSKNLASVRILQAIGPAYAQDYATRFGFDK
KHHPAVLPMALGAGSATPMQMAIGYATFANGGYHIHPYFIERIEDEHGNI
IEQARFATAGRNAKQVIDPRNAFLMTSMMQDVVQRGTAARARALKRNDIA
GKTGTTSNAVDAWFCGYQKDLVAVTWMGFDEPRSMGNRETGGQAALPIWI
DYMATALKSIPVAKTSSPKGIIVAKINPETGMRDIFGTLNEYFFQEQLPP
EMDYSFQDPEFTDQTESPMF
>NE2070 mreB, Heat shock protein hsp70:Cell shape determining protein MreB/Mrl
MLNFLNNNFLGNYFSTDMAVDLGTANTLIYVRGHGIVLNEPSVVAIRHEG
GLGGKRMIQQVGLAAKQMLGRTPGNITAIRPMKDGVIADFTVTEQMLKQF
IRKAHPPRLFSANPRIVISVPCGSTQVERRAIREAAYGAGARKVELIEEP
MAAALGADLPVETPTGCMVVDIGGGTTEVAVISLGGVVYSNSVRVGGDRF
DEAIINYIRRNYGIMIGEVTAEIIKSEIGSAYPGSEVREIEVKGRNLAEG
IPRSFTISSNEILEALAEPLNSIVSAVKTALEQTPPELGSDIAEMGMVMT
GGGALLHDIDRLLMEETGLSVIIADDPLTCVVRGAGIALESIDHSVGIFA
RDY
>NE2069 mreC, putative rod shape-determining MreC transmembrane protein
MLRGYMHTPPQFFKVGPSPLTRMFFFVLLSVFIMIADSRSDFIAEARRVI
GVAISPLQKLAYLPFSLQEKVDQYISDFKLIEEIDHLRREYLTSRHDFFR
LQALASENEHLRALLGAARQTEIQTVLAEILYTPRDPFSRKITLGKGSSN
GVKPGQVVIDDLGVIGQVTRVFPWTSEVTLITDKNHMVPVQIDRNGLRTV
ISGAGKNSELELRFLSINTDIREGDLLATSGIGGVYPPGLPVGRVAHIEY
DRSHKFARIICVPVAGVDRHRQVLVLIGSVPTPEISDNPESYAPQKNRQT
KNHGT
>NE2068 mreD, possible rod shape-determining MreD transmembrane protein
MRPRKTGKPRITEHDYVAQEFSLPARRSLVYLNLIMAVLVNLMPFDKVIL
VLRPDFVALTLLYWNIHQPQQAGMGIAFLCGLVMDVVDTSIMGQHAIAYC
LMTFFALILHRRLRLFSAFRQIPAVLWILLLGQAMIFLTGILAGTYIPEW
YFFLGSVTGALCWPLFVFLLGSFRKQRIEPDEI
>NE0624 mrp, Domain of unknown function DUF59
MITQQQIETVLGQIIDPTTGKDYLTSKAVSDIQIKQDNVSVNIELGYPAK
SVLNTVHQQIEQAIRTVPGIGSITVNVTSNIIAHSAQRKLKLIPGVKNVI
AVASGKGGVGKSATAVNLALALAAEGATVGILDADIYGPSQPQMLGVSGR
PDSPDGKTIEPMQAHGIQMMSIGLLIDVETPMVWRGPMVTQALQQLLNDT
RWHDLDYLVIDLPPGTGDIQLTLAQKIPVTGAVIVTTPQDIALLDARKGL
KMFEKVGIPILGIVENMSLHTCSHCGHTEPIFGTGGGEKMCRDYNVELLG
ALPLDIRIREHTDAGKPSVVAEPDGQIADIYRTIARLVAAKISDMARDYS
DVFTQIIMEDD
>NE0530 mrsA, Phosphoglucomutase and phosphomannomutase family
MKKKYFGTDGIRGKVGDFPITPDFFLRLGYAVGKVLLASDRQLAADKRPT
VLIGKDTRISGYMLESALEAGFSAAGVDVLLSGPLPTPAVAYLVRALRIQ
AGAVISASHNPFDDNGIKFFSSAGSKLPDSMELQIEAELDQPMKTTPSIK
LGRVQRLRDAAGRYIEFCKSTFPNQLDLRGLRIVVDCANGADYHIAGHVM
HELGADVITTHASPDGFNINYECGATHIETLQGSILQHKADIGIAVDGDG
DRVLMVSREGVLYDGDSLAYIIAKHRQQLGELQGGVAGTLMTNLAVEQAF
ERLGIPFARANVGDRYVSELLQQNDWYLGAENSGHIICRDKHTTGDGIIS
ALQVLYALRDTGLTLADFMRDVPFFPQRLINVKVSGNFDFRSNPAVAACK
NEAEQALGNDGRILLRASGTEPLIRVMVEGKVLQQTDYWAEKIAETIRQQ
AASSMTGS
>NE1628 mtgA, Glycosyltransferase family 51
MKLAQARRPATSKPRLISTWLLRPLLLLLTAALLYQSWFLLHIVYWRSYS
PTTSAFMQDRLKIMRQQNPAASLQHQWVDYEQISSHLKRAVIAAEDARFL
QHQGFDYKAIETAWKKNLKQRKWAAGGSTISQQLAKNLFLSTEKTVWRKS
RETLITLMLEEFLTKRRILEIYLNIIEWGDGIFGIEAAARHYFGISAASL
TPAQAAWLASIIPNPRFYDTRRTLPKLLNKSRIILSRLPAAKIP
>NE2329 mucD, mucD; serine protease MucD precursor
MADIISRNNFHSGIGSRFLINTVTVADIWRKGIFAICLLVAAFMLSASLH
AKDLPDFTDLVEKHGQAVVNISTVQTQQIDANHFFPGIPNIPEDSPFYEF
FRRHMQPFSGPRKYESRSLGSGFIISKDGYILTNAHVVESANEITVRLTD
KREFGAKVIGTDRKTDIALLKIDADDLPVVTQGSPDQLKVGEWVIAIGAP
FGFENTVTAGIVSAKGRSLAQENYVPFIQTDVAINPGNSGGPLFNMKGEV
VGINSQIYSRTGGFMGLSFAIPIDVAMEITSQLKAYGKVSRGKIGVMIQE
MTDELAESFNLDKSRGALVVSVEKDGPADKAGIKIRDVILRFDGKGIDTS
SDLPRIVGNTKPDARVSVEVWRNGSVKKLTVTVGEMPGDDSSTTVQKQSK
SGDATSRLGLALRELSASQKNELGIDSGLLVEEVYDGIASSAGIRPGDVI
LGFNNQDIKSIQQFNKLLDDAKKGRNIALLIKRGDITTFITIKING
>NE0379 mucD, Serine proteases, trypsin family
MNGVLQSLRAPLFNPFYSCHMFNPAAFPVLRLFIVCLVLLYSPFCQADLV
KTVERIKPAIVGIGSFQKLHSPPVNFLGTGFVVEDGLHAITNAHVISDLS
SNTSKGSLIIMTGKGEKPELRNATIIALDKEHDLALLRFEGTALPAMKLG
DAGTIKEGKLLAFTGFPIGMVLGFYPVTHRATISSITPVILPAQNARQLD
AAKIRQLQKSAYKIFQLDGTAYPGNSGSPLYDPDTGEVYGVVNMVFIKGK
KESILSDPSGISYAIPANYVIDLLKQGGF
>NE1852 murA, EPSP synthase (3-phosphoshikimate 1-carboxyvinyltransferase)
MQKLVIHGGAKLQGEISISGAKNAALPVLCASLLTADTFTIQNLPHLRDI
TTMLALLEQIGVRILTNDPGTAELSAASITNPTASYDMVKTMRAAILVLG
PLLARTGQAYISLPGGCAIGMRPVDQHIKGLQAMGADISIEQGYIRAQAG
RLSGTRIVMDLVTVTGTENLMMAATLASGTTILENAAREPEVVDLADCLI
GMGAKIEGAGSDIIVIEGVDHLHGSSHTVMPDRIETGTFLTAVAACGGDI
TLTRTRADTLDVVLGKLIETGAAIDTGEDWIRLRMQHRPQPVSLRTAPYP
AFPTDMQAQFMALNSIADGTSVMTETIFENRFMHVQELKRLNADIQVEGN
TAIVHGIPQLDGASVMATDLRASACLIIAGLVAQGETIVDRIYHLDRGYE
RIERKLAQAGAQIKRIN
>NE0992 murC, murC; UDP-N-acetylmuramate--alanine ligase protein
MKHKIRHIHFVGIGGSGMGGIAEVLINLGFQISGSDMHSNSTTRRLQCLG
AVIHHTHAAENIQSADAVVISTAIHSDNPEVIAARERRIPVVPRAMMLAE
LLRLRRGIAIAGTHGKTTTTSLVASILAEAGQDPTFVIGGKLKTVDSHAR
LGKGEFIVVEADESDASFLYLQPVLTVVTNIDADHMSTYEHDFNRLKQTF
VEFIEHLPFYGMAVLCVDDPHVREIISMITRPVTTYGIASEDAQICATNI
RHDRCRMHFLAHIGVNGSPRTLEVTLNLPGKHNVLNALAAIAVGNELGVP
DEAIVKALATFGGVDRRFQQYGEIPLPDQGSFALIDDYGHHPAEIAATMA
AARNAFPGRRLVLAFQPHRYSRTRDLFEDFVRVLSGADVLLLTEVYPAGE
EPIIAADSKSLARAIRVQGKIEPIYIEQIDELKATIHTIAQDGDVILIMG
AGSIGKSAPDLAEPAMKLTLITG
>NE0989 murD, UDP-N-acetylmuramoylalanine-D-glutamate ligase
MNYTGKKILVLGMGKTGISMVKWLSRLGAQLSVADTRTSPPNLELISRIV
PGEAIFCGPLKEELFQGIDAIAISPGVAVAEPLVQAALQQGVPVIGDIEL
FAVALDQYAPPGTKILAITGSNGKTTVTSMVGEMVKNAGWDVEVAGNIGP
AALDALMQRMDANKWPHLWALELSSFQLETTSSLRPDAATVLNLSEDHLD
RYDSIEEYAAAKARIFSRPHNNGCVQILNRDDARVYAMADKNSKQVTFGL
SAPVSDEEFGLLPGGSDVWLAQGSTHLLKTSELAVAGLHNAANALAALAL
CRAVDLPFEPLLHALRTFRGLPHRMQKVAEFNGVTFYDDSKSTNIGSAVA
ALNGFRKNVILIAGGDGKGQDFSPLEQPVSKHVRSVVLLGRDADKVAQAI
QASNVPIHRVTTMDEAVQVSFLLAEHGDVVLLSPACASLDMFNNYIHRAE
VFTAAVRLIERKFVLTAQTCH
>NE0986 murE, murE; UDP-N-acetylmuramyl-tripeptide synthetase
MMSTSSDSRSDEGHAACLLNQLDVKVRHLVADSRKLKPGDTFLACAGERH
DARNDIPQAIARGVNAIIWEKQGFSWKPEWKIPNLGVAGLRHEAGKIASE
AYGHPSRHLWLVGITGTNGKTTCSQWYAQAMAALGKKTAVIGTLGHGFPG
ALYPSEHTTPDAVYLQQLMAEYLHQGASSLVMEASSHGLSQDRLIGSEFA
VAVLTNLTRDHLDYHDSMDAYAAAKAKLFFWEGLQYAVLNLDEVLGVELS
QQLAGKDLSIIGYGFKQPRQTAQTAGNQKILYGSNLQFTTQQIGFDVEFC
NRYASLQCNVTGRYNAYNLLAVLAALLASNIDLDDAITALRQVQPIPGRM
EKLGGGNQPVVIVDYAHTPDALNEVLTGLRETLAGTRIKKRIRKNQAKLI
CVIGCGGDRDRGKRPLIGEIASRLADEVIITSDNPRNENPADIISEVMLG
ASGKHCTSEVDRTAAIYRAIHGARKGDIVLIAGKGAETSQEIQGKKYPYD
DREVVRQVLHDLAGPELQVQG
>NE0987 murF, murF; UDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate--D-alanyl-D-alanyl ligase protein
MMSTQEAALALHADWSGDNAVFTGVSTDSRTLKPGDLFIALSGEQFDGHR
FISAAIENGAVTAMVSADTAILPTQPDFGWIKVKDTRLGLGQLAASWRRR
YTLPLVAVTGSNGKTTVKEMIAAIFRCEFGLENVLATTGNLNNDIGVPQM
LLQLDSRHVGAVIEMGMNHAGEIAWLSRLAAPTIAVITNAGTAHIEYLGT
TEAIARAKGEIFEGLEEQGVAIINADDPHARLWRQLAGNRPVVDFSMNGT
AAVSARQPAHPSGDRWLLQLPDDTIEITLQVPGRHNVYNALAAAAAATAA
GISTSSIAEGLHSFRGVPGRLQKKTGLNRSVLIDDTYNANPDSMQAALNV
LAEMPGKKILIIGDMGELGADTATFHHRIGQQAASAGVDILLALGESSRQ
AVAGFGRGAQHFADLDTLLEKAKSCLDEQHVFVLVKGSRFMQMERVIEQL
QA
>NE0991 murG, Glycosyltransferase family 28
MIMAGGTGGHVFPGLAVARSMQANGWRIVWLGTRNGMEAALVPQHGFSIE
LINFSGLRGKKLSSYLLLPWRLAQACWQSFRILRRQQPQVVLGMGGYPAL
PGGIMAVLLGKPLLIHEQNRIAGLTNKILAKIADRILLAFPGALTSPENK
TRVTGNPVRTEIARLPSPEARYAHRTGNLHILVVGGSLGAQVLNTVLPQA
LSMIPEDQRPYVTHQSGKAHLDALQQAYADHGVTGNLVAFIENMAAHYQD
CDLVICRAGALTISELAAAGVASILIPYPYAVDDHQTANARFLSDYQAAV
LWPQSELTAASLAQWLMTCSRAQLQSMATHARALAMPEAAQTVAEACQQL
SGQTNEA
>NE0449 murI, Aspartate and glutamate racemases:Glutamate racemase
MQSGFVGIFDSGVGGLAVYRAARKLLPHQAFVYVADSGFAPYGSRESAYI
IRRVTEIADALVNNGAKALVVACNTATVTTVTALRAKYSLPIIGIEPAIK
PAAAISRNGRIVVLTTRRTAESEAVAQLCVRYGTHVQIMLQPCPGLADQV
EAGNIDGEEVKEMLRQYLAPAVSLANDVVVLGCTHFTFLADQIQNLAGPQ
VTIIEPSEAVARQLAHRLSSSKMISVNQAQAAEAFYTTAASPSAVGDIMS
KLLGRRIEVLAANGLGLASIA
>NE1742 mutL, mutL; DNA mismatch repair protein
MRPIKLLPDGLISQIAAGEVIERPASVLKELLENAIDAGTTDISVNIAQG
GLKLIRVTDNGGGISGEELPLALTRHATSKIASQEDLYRITSLGFRGEGL
ASIASVSNLLLISHQPGGKHAWQIRSEGIRVMQPEPSSHAAGTTVEVRDL
FFNLPARRKFLKTEATEFAHCEEIIRRMALSHAGIAFTLRHNGNLRGHWQ
SAEAAVRIKTVLGEEFTRSAAWIDERSAGIGLQGMLALPAYSRAARDMQY
FFVNGRFVRDKLITHALREAYRDVLHLDRHAAFVLYLDIDPEQVDVNVHP
TKTEIRFREARAIHQFIYHGVSKALSLPRSGTELSQSSSQLMADDIVPPA
EKRVPAAPMLNYPRQTGLPSEMIAQPFNFYQVLSGSESDSTATQNPFRQT
GAGESNEHPALPPLGFALGQLHGVYILAQNWKGLVIVDMHAAHERIVYEQ
LKLQMDEQTLSAQRLLIPVTFHADSLDIATAEENQSLLQQLGFEVTVLTA
TTLAVRAVPAILQDADTEKLVCNVLDEIRNGDPGQLLAARRNELLATMAC
HGAVRANRPLTLIEMNELLRKMEVTERSDQCNHGRPTWFEISLAELDKMF
MRGK
>NE2552 mutM,fpg, Formamidopyrimidine-DNA glycolase
MPELPEVEITRRGIDTHLAGRVITQISIRNPVLRWPISAGLIALLPGQRI
NAIARRAKYLLFACSRGTLIMHLGMSGNLRVLPESTPPQLHDHFDLQVDN
GMMLRFRDPRRFGAILWWDGDIRQHPLLQKLGPEPLSDDFDGQFLYTKTR
GRNASIKEVLMNQHIVVGIGNIYANEALFQAGISPLAAAGSLNTMQCERL
VDAVKATLLRAIKAGGSSLRDFTDCEGSPGYFQQQYWVYGRAGQSCRQCG
ELVSKTRQGQRSTFFCARCQH
>NE1705 mutS, mutS; DNA mismatch repair protein
MNKAEQSSHTPMMQQYLRIKAQHTDKLLFYRMGDFYELFYEDAEKAAKLL
DITLTQRGSSAGEPIKMAGVPFHAADQYLARLVRLGESIAICEQTGDPAT
SKGPVERQVIRILTPGTLTDAGLLEERSNSIVLALALHRGSIGLAWLNLA
AGDMRVLETSSDNLTSELERLHPAEILLPESLDLPATLNNFAGPKRLPDW
QFDYEHAMQQLTRQFGTRDLNAFGCEDLHAAIMAAGALFEYVRLTQQTAT
DGSSGQLPGHLHTLQVERQDAYLRMDAATRRNLEITLTLRGEDAPTLSSL
LDTCSTGMGSRLLRHWLHHPLRNRITLQQRLDTVSDLIGAQPETLYAGIR
QQFKHIADIERITSRIALRTARPRDLSGLRDSLMRLPGIIELIATSAAAA
VHRFIPPMQPDPLLTQLLVRALQPVPGAVIREGGVIADGFDAELDELRGL
QGNCDEFLLQLEARERERTGIPNLKVEYNRVHGFYIEVTRAQGEKIPPDY
RRRQTLKNAERYIIPELQAFEHKTLSAREQALAREKMLYERLLEQLADFI
IPLQEIARSVAELDVLCAFAERAALSGYTKPVFTDDPVLIIEAGRHPVVE
NQVEHYIANDVQLGAITRENRQMLVITGPNMGGKSTYMRQTALTVLLAHC
GSFVPAQIARIGPIDQIFTRIGAADDLAGGRSTFMVEMTEAAGILRNATA
QSLVLVDEIGRGTSTFDGLALAFAIARHLLTQNQSYTLFATHYFELTRLA
EEFPQAVNIHVTAVEHKRRIVFLHRIEEGPASRSYGLHVAALAGVPDRVI
RNAAKILARLEQETLSRSPQQTLFETVEENAKAVPASVHPVLDYLERIHP
DELTPRGALEQLYLIKSMLNQTD
>NE0056 mutY, HhH-GPD
MTPRTAGTIHFPADAPDSFAGRLIRWQLECGRHSLPWQGTRDPYAIWVSE
VMLQQTQVSSVIPYYQRFMASFPDVASLAGVPVGDVLTLWSGLGYYSRAR
NLHRAACVIMEQYSGVFPQDAATLQRLPGIGRSTAAAIAAFAFGERGTIL
DGNVKRILARYFGISGYPGEKSVEERLWQLAESLLPAEESNHQIVVSYTQ
ALMDLGALVCARSRPRCQYCPLQADCIACQNDLTADLPVPKPRKTLPVRE
TVHLILLDQERILLKKRPASGIWGGLWCFPEMSVDQDSIDYCEKNLHVRV
TKLARLPHLQHTFTHFKLIIQPHLLQSIMHQPVCEEKCEENSYLWLTIEQ
AMQQAIPVPVRKLLSMAYPYFQYHIHE
>NE2405 mviN, Virulence factor MVIN-like
MNLLKALATVSSMTLVSRILGFVRDLIIARIFGAGVATDAFFVAFRIPNL
LRRLFAEGAFSQAFVPVLAEYKNNRTEEQTRELIDHVATLLGSALFIVTL
VGILAAPLIIYISAPGFAGVPDKFELTIALLRITFPYIFFISLVALAGGI
LNTYSHFSVPALTPVLLNLSFIGCALWLAPLMDPPVLALAWAVFIGGMLQ
LAFQIPFLLRLKRMPRLRFGFRDSGAWRVLKLMGPAVFGVSIGQISLLIN
TIFASLLITGSVSWLYYADRLMEFPAGMLGVALGTVILPSLSRHYTQNST
EEFSRLLDWGLRLTFLLTLPAAVALALLATPLITTLFYYGAFTVEDVWMT
REALIAYSVGLLGLILVKVLAPGFYARQNIKTPVKVAILTLAATQLMNLA
FIIPLKHAGLALAISLGACLNAGVLYSKLRSQGIYQPLPGWGIFIFKILV
ALIVMGAGLWLATGNSAEWFVLTATERAIKLGLVVILGGIGYFACLWMLG
FRLRDFARQ
>NE0066 nadA, Quinolinate synthetase A protein
MQANAIDFENYELLQDDVCARRIVAAKQKLGKRAIILAHHYQRADVYCHA
DLTGDSLKLSYLAARTDTEFLVFCGVHFMAEVADILSSSEQTVILPDLSA
GCSMADMANLTKVERVWRELAEVLDPDERVTPVTYINSSADLKAFCGQHD
GIVCTSSNAVKILQWAFSRREKVLFFPDQHLGRWSGYKMGLSPDEMPVWD
FDEPMGGLTPEQIEKARILLWKGHCSVHQMFQPQHILRFRNQYPDGLVIS
HPECSFEVCKASDYVGSTEYIINTIRAAKSGTRWLVGTELNLVGRIAEEF
KAEGKIIQFMSPMVCMCSTMARIDPQHLAWSLENLAEGRVVNQIRVPEPQ
ASLAKLTLEKMLEVSK
>NE1892 nadB1, nadB1; l-aspartate oxidase (quinolinate synthetase B) oxidoreductase protein
MQPLRFDALIIGSGLAGLTLALNLAETQKVALVTKRTLHDSSSAWAQGGI
AAVLSIEDSPEAHIRDTLIAGAGLCNEAMTRHIVENGPQAVRWLIARGVD
FTRDDHNETGYHLTREGGHSVRRIIHSGDATGKVVQLTLTRQAAQHPNIT
VLEHHIAIDLITSEKLSHAGENNRCYGAYVLDIRAGKVRTIAARNTILAT
GGAGKVYLYTTNPDVSTGDGIAMGWRAGCRVANMEFVQFHPTCLYHPHAK
SFLITEAVRGEGGILKLPDGERFMLRHDERAELAPRDIVARAIDFEMKKR
GLDCVFLDISHQPADFLVQHFPTIYKRCLELGIDITREPIPVVPAAHYTC
GGIMTDQCGRTDLDNLYAIGETAHTGLHGANRLASNSLLECLVTGLSAVE
DILVRSPAPDLLLPDWDESRVTDADEEVVISHNWDELRRFMWNYVGIVRT
SKRLQRAQHRIRLLSEEIGEYYSNFRVTSDLLELRNLVCTSDLIVRSAML
RHESRGLHFSRDYPEILPQAVDTILSR
>NE2122 nadC, nadC; nicotinate-nucleotide pyrophosphorylase (carboxylating) quinolinate phosphoribosyltransferase (decarboxylating)
MLQSEAISNQVRLALSEDIGTGDLTASLIPPGKCLSAQVIVREAIVICGI
PWFDESFRQLSPSVRIDWHVSEGQQTSAGQTLCVLDGDARALLTGERTAL
NFLQMLSAVATRTRKFVEAIEGTGTQIVDTRKTLPGLRLAQKYAVRCGGG
VNHRTGLYDGILIKENHIIAAGSIEAALRKAIEIAPPEVFIQIEVETQGE
LMQALSAGARMVLLDNFDLPALRDAVAFNRQFPGGPAVLEASGNVTLDTV
RAIAETGVDRISIGSLTKDIRAVDLSMRFYEAVL
>NE0359 nadD, Cytidylyltransferase
MAEITRYSLTGIYGGTFDPIHYGHLRIAEELADIVELNHLFFLPAGRPRL
RTPPFVAGEHRVAMLQEAIRGNTRFSVDDREVRRPGETYSVESLREIRQE
YEASESVALCFITGTDAFIKLPYWHRWRELFELCHLIIVNRPGSVPIRYP
SDLPDELRGVCQDRWTTMADELKNSPVGLIFTAPTTLLDISSTSIRNIIA
SGKSARYLLPESVLNYIDKYGFYAGGK
>NE1896 nadE, Carbon-nitrogen hydrolase:NAD+ synthase
MKIALAQINCTPGDLRGNQLKILHACRQAREAGADLVITPEMSLCGYLAE
DWLLRREFVQACHQALTELTAQVYDVTLIVGHPHNMNGNLFNAVSAVRDG
RLLATHCKQHLFSDRLQDERRYFSAGNSLCTFECSGILFGLMTGSDYRHA
AHLQSLHAAGAQVLLAVDASPYSIDSQIDRYQILREGITQTGLPAVYINP
VGGQDELVFDGASFAMDHSGKLVCQLPAFQEALALIAIHGNQSIFGECST
LPDQAGSIYTALRLGLHDFITKNRLPGVLIGLSGGVDSALVLAIAVDALG
AERVRTVMMPSPYTADISIQDAQTMADNLGVRHAGIPITGLFDQFQQALQ
AELQACSDSGTSATVENLQARIRGTLLMALANQSGMLVLPTSNKSETAVG
YSTLYGDMAGGFSILKDVSKTLVYRLCHYRNQISPIIPQRILQRPPSAEL
RPGQIDQDSLPPYDVLDAIIEAYVENDLSAAEIIAMNYPEETVRRVLRMI
HSSEYKRRQAAPGIRITRRDFGRSWRFPLTSGFPD
>NE0144 ndk, Nucleoside diphosphate kinase
MAVERTLSIIKPDAVAKNVIGQIYARFEAAGLKVVAARMAHLSRVEAENF
YAIHRERPFFKDLVEFMISGPVMIQVLEGENAIARNRELMGATDPRKAEK
GTIRADFAESIDANAVHGSDAPETAVVEIACFFPSLEIHSR
>NE1569 neuA1, Cytidylyltransferase (CMP-NeuAc synthetase)
MLIYSLIPARGGSKGVPHKNIRLLCGKPLITHSIEISLKSPSIQKTFVST
DSEKIAEVARNAGAEVPFLRPADLAQDDTRDLPVFLHFLGWLEQNHVPLP
DAIFQFRPTSPARRVEKIEEAVELLKKHPDADSVRGVTEPAQNPYKMWTI
DDNGFLRALLNIPGVPEPFNEPRQSLPAVYWQVGYLDLIRTRTILEKKSL
TGIRILPLKIEGRDSIDIDDEFSFQLAEFLMEKRGVTL
>NE1184 nlaB, Phospholipid and glycerol acyltransferase (from 'motifs_6.msf')
MAGLRSLIYMLLQAIITPPYALFTLMCFALPPHSRYQVTYGWTRLMLFML
RTICGLRYEVLGRENIPDQPSIILSKHQSAWETLALQQIFPPQVWVLKKE
LLRIPFFGWGLAMTSPIAIDRSAGKAALEQIVEQGRERLQQNFWIVVFPE
GTRIPPGKKGKYRIGGAWLAVHTGALVVPVAHNAGEFWGRNSFIKYPGTI
TLSIGHPIHPDGIEAGELMAQAEAWIEAETARISRPSHRPIE
>NE2004 norB, Cytochrome c oxidase, subunit I
MKYSTQKLAYPYFIAALLLFMVQVLAGVLAGSVYAMPNFLSESLPFHIIR
MIHTNALLVWLLLGYFGAAYFLIPEESERDIHSPKVAWLQLFIFVFAALA
AVVSYLAGVHEGREFLEQPLWIKILLVVAFLLFLYNVSMTVLKGRKTVVT
TILLLGMWGAALMFLFAFYNPGNLALDKMYWWYVVHIWVEGVWELIMASM
LAFLLIKMTGVDREVIEKWLYVIVGLALFSGLLGTGHHYYWIGTPGYWQW
IGSIFSVLEVLPFFAMVLWCFHMVYRSGRNHPNKAAMLWSLGCPVMAFFG
AGVWGLLHSFSQVNFYSHGTQITAAHGHMAFYGAYVMLNLAFFTYALPQL
RNAQPYNQILNMWSFWVMTAAMAFMTFTLTFAGVLQTHLQRVVGMGFMEI
QQQLDLFYWMRLLAGVAFLVGALMYLYATLGPTTKQVSSSPNMQPAAR
>NE2006 norD, von Willebrand factor type A domain
MWQWLELEEQIGRFWHRLVGQTANSYPEYPSVAVSLECVHTALCTFFHGM
GGHHGLPLVAGMPQTTPHRRNLKQRLGGDVEKLPLAMLDRERLMLPASIS
HFPETSLNRQCYFWLAAFFASAPESVSASFPDDPLQADLVFLHYADQISL
HVCERYPGLEKTYRTLCAHVLQSRPHRSLPAQEQAVEAAVLALLGQPCID
PAGANLLARIRQPEPDFNDLRADKGYHPFLPVPLWGVVQPNSSQQDAHQT
MPADTNTDHADRSARQEEDKHRRKARRGRFDQSERDDPLLLNRFEKLITW
SEMVNVNRAVEDDDEADARNAAESMEELAITSHPRSASTVLKFDLDLSPE
DTDPASLLAQLTYREWDYRRKQYHAAHCQIWCQIAGETGEQWEPDDRARQ
QFRRVRRQFEALRPDKEILPRQLDGAELDLDALVRARADQTASGKGSDRL
YLAVRQQTRNLAVMILVDVSLSTDSWINNRRILDIEKETLVALATGIAAC
RDTFSIHAFTSRKRHFVKLTTIKDFDSPFSSRTLRRIAALRPGYYTRIGA
ALRHTSQLLAQRPERHRLLLLLSDGKPNDLDHYEGRYGIEDTRQAVMEAR
RLGLKLFGITIDREAKDYFPYLFGRGGYAIINRPERLIEIVPALYRQLTG
>NE2397 nqrA, Na+-translocating NADH:ubiquinone oxidoreductase subunit Nrq1
MFIKLNKGLDLPISGEPEQCVYAATAVRHIAVAGVDYIDLKPTMKVTEGD
RVRLGQPLFEYKKLPGVVFTAPGAGRIVAINRGTRRILLSVVLQLDEQED
EETFIKYAPDELSNLADEQVKENLLISGLWTTLRTRPYSKVPDPATRPAA
IFITAIDSNPLAADPVPIITAEMESFNHGLRILSRLTDGPLWVCQSPQAK
LSLPEDLPQLQQARFAGPHPAGLPGTHIHFLQPVSAYKTVWYLNYQEVIA
IGKLFVTGRLWTERIVALGGPGVKKPRLLRTRLGASLEELLAGELVESMD
KRVISGSVWSGRKAVDELAFLGRHHLQVTVIGEKSEREFLGWLNPGGNKY
SKLNVLFSSFFRKKRKFEFTASQQGSPRAMIPIDTFEEVMPLDILPAQLL
RALLVMDTDMAQKLGCLELDEEDLALCSFICVGKHDYGVILRENLRQIEK
EG
>NE2396 nqrB, Na+-translocating NADH:ubiquinone oxidoreductase subunit Nrq2
MRRLLNNIKPHFDRGGRFEKYYALYEMVDTFLYTPDDSTRSAPHVRDAID
LKRLMSYVVIALLPCILWSWYNTGYQANLALQELGEIGQNWRNNLLLTLN
VGFDPGSVLASTIHGLLYFLPIYLTTLAVGGFWEVLFALVRKHEVNEGFL
VTSMLFALTLPPNMPLWMVALGISFGVVIGKEVFGGTGKNFLNPALVGRA
FLYFAYPADISGDLVWVAVDGYTSATPLGLGAIGGMEAVTAGGYTWWNAF
IGLMPGSLGETSTLACLLGAIFLIYTKVASWRIIFGVFFGMVATSFLFNI
LESDNPMMAMPWHWHLVLGGFAFGMVFMATDPVSAAMTNAGRWVFGLLIG
FMTVLIRVVNPAFPEGIMLAILFANVFAPLIDYVVIQLNIRRRLRRHG
>NE2395 nqrC, Na+-translocating NADH:ubiquinone oxidoreductase subunit Nrq3
MAEPNSAPVQHSARRTVGVALAVCLVCSLFVTSAAVTLRPIQIANQVKER
QRVLVELAGLTALGLSIESAYRLFDVRMVELASGNFVDDQDTERFDITKA
AKDEAQSTALTREEDIAQIQRKPKYLPVYLIHNEAGKLETLILPIYGHGL
WSTLYGFIALEGDLRTVKGLRFYQHAETPGLGGEVDNPRWLTSWEGKVIF
DEAWQPRIELIRGSVGSATPDSQHKVDGLSGATMTTRGVDNLLKFWFGEN
GFGPFLLRLREERS
>NE2394 nqrD, RnfA-Nqr electron transport subunit
MMAQSVRKILFTPVVNENPITLQVLGICSALAVTSKVSTALTMCIALTLV
VACSSMLISLIRHYIPTNIRIIIQMTIAASLVIVVDQFLRAFAYEVSREL
SVFVSLIVTNCIVMGRAEAFAMKNPPWPSFLDGLGNGLGYSAILLIVGII
RELFGSGSLLGVTILPLIRDGGWYEPNGLLLLPPSAFFLIGLIIWGVRSW
KTNQVEQPEFRIQQTHNPTEIIR
>NE2393 nqrE, RnfA-Nqr electron transport subunit
MNSLAGLFITAVFVENLALTFFLGMCTFLAISKKIEVAFGMGIAVIVVQT
LTVPINNLVYQYLLRDGALVWAGLAEIDLTFLGLVSYLGVIAAIVQILEM
FLDRFMPALHSALGIYLPLIAVNCAILGGSLFMVERDYNFTESLVYGLGS
GFGWALAIVALAGVRERLKYSDVPDGLQGLGITFISAGLMAMGFMAFSGI
RL
>NE2392 nqrF, nqrF; Na(+)-translocating NADH-ubiquinone reductase subunit F
MLEIFLGVFFFTAIIIVLVFVILGARAKLVTHGRVTINVNDERTIEALTG
GKLLGTLATAGIFVSSPCGGSGTCGQCRVKVFEGGGDILPTETSHINKHD
ARAGYRLSCQVSVKQNMKIVVPDEAFGVRKWSCKVRSNHNVATFIKELVL
ELPESEQFDFRAGGYIQIECPPYVSAFKDFDIEERFRKDWDHYDFWRYVS
KTQEPVMRAYSMANYPEERGIIMLNVRIATPPPQREDVPPGQASSYIFSL
KPDDSVTVTGPFGDFFARDTENEMIFIGGGAGMAPLRSHILDQLCRLKSR
RKISFWYGARSRNEMFYVEDFDRLSKEYDNFKWYVALSDALPEDNWQGYT
GFIHQILFDNYLKDHPAPEDCEYYLCGPPMMSKAVIDMLIGLGVEQENIL
FDDFGG
>NE2423 nrdA, Ribonucleotide reductase large subunit
MQLATDLPSSSPVQQKNFLEEMTDLPATRQNYSQYKIIRRNGAVVAFEPG
KISIALTKAFIAVRGGQSAASSSMREIVAQLTGEVVHALTRRRPEGGTFH
IEDIQDQVELALMRSGDHDIARAYVLYREERARERARQQSAQPASGQVAV
LHVVENGQKIPLDPAQLMARIESACQNLGNTVDAALIVKSMLKDLYDGVP
AEEVRKSAILSARALIEKEPAYSFVTARLLLDNIRYEVLGEEVAHEAMQS
RYAEYFPEFVRKGIEAGLLDERLGQFDLARLSEALAADRDLQFGYLGLQT
LYDRYFLHVSERRIELPQIFFMRVAMGLALNEIDREARAIEFYHLLSKFD
FMSSTPTLFNSGTRRSQLSSCYLSTVPDSLDGIYEAIKENALLSKFAGGL
GNDWTPVRAMGARIKGTNGKSQGVVPFLKVVSDTAVAVNQGGKRKGAVCA
YLETWHLDIEEFLELRKNTGDDRRRTHDMNTAVWVPDLFMKRMQEDQEWT
LFSPSDTPDLHDKFGRAFEEAYIAYEEQAAQGRLTLFKRIPARQLWRKML
TMLFETGHPWITFKDPCNIRSPQNHVGVVHSSNLCTEITLNTSEKEIAVC
NLGSVNLLAHIVDGKLDQARLKQTVTTAMRMLDNVIDINFYAVAKARNAN
FRHRPVGLGIMGFQDCLHKLGISYASPEAVEFADRSMEAVAYHTYWASTE
LAEERGTYATYRGSLWDRGILPQDTLELLREERGGHVEIDTSSTQDWDAL
RDRIRQYGMRNSNCLAIAPTATISNIIGVSASIEPTYQNLYVKSNLSGEF
TVVNQWLVNDLKKQDLWDEVMIADLKHYDGAVSKIDRVPENLRNLYATAF
EVDPRWLVEAASRRQKWIDQAQSLNLYMAGASGKKMDELYRLAWLRGLKT
TYYLRALGATTAEKSTVHTGTLNAVPAQHEAHAGNLAHSAPQQCAIDNPE
CEACQ
>NE2422 nrdB, Ribonucleotide reductase
MLTFEDEVFQPGNILAGTSAQAGMANKMIAERQVLDEEIPAEDHRRVSVE
DKRIINCKADVNQLVPFKYKWAWEKYLSACANHWMPQEINMTRDIALWKD
PNGLTEDERRIVKRNLGFFVTADSLAANNIVLGTYRHITNPECRQYLLRQ
AFEEAIHTHAYQYIVESIGLDEGEIFNAYHEIGSIRDKDEFLIPFIDTLT
DPSFRTGTLETDQQLLKSLIVFACIMEGLFFYVGFVQILALGRQNKMTGA
AEQYQYILRDESMHCNFGIDVINQIKLENPHLWTREFREEISALMRKAVE
LEYRYAEDTMPRGVLGLNAPMFKEYLRFIINRRSHQIGLDPLFPEAGNPF
PWMSEMIDLKKEKNFFETRVIEYQTGGVLSWD
>NE1737 nrtD, nrtD; nitrate/nitrite transport system ATP-binding protein
MGTVNSMNDIATHSVGARLEFSNIRKSFGNTVSVEADNLTIEPGSLTVLL
GRSGCGKTTLLNLAAGLDFPDHGSVSYDNQILQGPAPATALIFQTHNLFP
WMTAEENVAFPLRNQGMAKSDAQEHARQYLKQVGLESFSGHKPSQLSGGM
RQRIALARTIATKPRLLLLDEPFSALDMQTRRMMQRYLLSVWNDSAATIL
MITHDLHEALMLADRIALMASSPRGHIAEIMDIELTRPRNPEDPAFRAIQ
QQLDRFLEHETLLAEHDPTLESTDQQK
>NE2223 nth, HhH-GPD:Iron-sulfur cluster loop (FCL)
MNTTKRREIFTRFRAANPRPTTELEYQTPFQLLIAVILSAQATDKSVNLA
TRKLFLVADTPEKILQLGETGLSPFIQRIGLFRTKTRNILATCQLLIEQY
NGEVPRTRTELEKLPGVGRKTASVILNTAFGEPTIAVDTHIFRVANRIGI
APGKNVLEVERKLLKVVPDEFRHDAHHWLILHGRYICKARKPLCHQCLIV
DLCEFKEKNLEGTASSLDMKQLT
>NE2253 ntpA, NUDIX hydrolase
MQRYKLPVSVLVVIYTADLQVLLLERADHPGYWQSVTGSQDPGETLLQTA
VREVREETGLNTDDYVLSDWQIQNRYEIFEEWNWRYPPGTTHNTEHVFGL
ELPKTIPAVVSSREHLGYVWLPWREAAEKVFSSSNACAIRMLASKRKSEN
SR
>NE0498 ntrR2, PIN (PilT N terminus) domain
MISPRYLLDTNILSDLVRYPQGVIARRIEEVGEAAVCTSIIVAAELRFGA
ARRNSLRLTRQVEAILAAIEVLPLDTPVDRAYAQLRWVLEQSGQVIGPND
MLITAQAMASQCVLITANLDKFSRVGELQVENWLVR
>NE1777 nuoA, NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
MLNTMLGNYFPILLFILVGLAIGVLSMLAGWLLAPNKPDAEKLSPYECGF
GAFEDARMKFDVRYYLIAILFILFDLEIAFLFPWAVVLKEIGWFGFVAML
VFLGLLVVGFIYEWVKGALEWD
>NE1776 nuoB, Respiratory-chain NADH dehydrogenase 20 Kd subunit
MGIEGVLDKGFVTTSLDSLINWGRTGSMWPMTFGLACCAVEMMQTGASRY
DLDRFGIVFRPSPRQSDVMIVAGTLCNKMAPALRKVYDQMAEPRWVISMG
SCANGGGYYHYSYSVVRGCDRIVPVDIYVPGCPPTAEALLYGIIQLQNKI
KRTNTIAR
>NE1775 nuoC, Respiratory-chain NADH dehydrogenase 30 Kd subunit
MTNSRLEKLAADLQRILGDRQIDISCALGELTLLVHSRDLPDIAEVLRDH
QDLGFDTLIDLCGVDFSEYSTDTHAGYKREDRRFAVVYHLLSVKHNHRLR
VRVFAEDNEFPMVDSVMPVWPSANWFEREAFDLFGIIFNNHPDLRRILTD
YGFIGNPFRKDFPLSGHVEMRYDPDQKRVVYQPVTIEPREITPYVIREEQ
YGREEI
>NE1774 nuoD, NADH-ubiquinone oxidoreductase 49Kd chain
MAEIRNYTMNFGPQHPAAHGVLRLVMELDGEVIRRADPHIGLLHRATEKL
AENKTYVQSVPYMDRLDYVSMMVNEHAYVMAIEKLLQIEVPIRAQYIRVM
FDEITRILNHLLWLGAHALDVGAMTVFLYAFREREDLMDCYEAVSGARLH
AAYYRPGGVYRDLPDNMPQYQPSAIHDEKATRARNENRQGSLLDFIEDFT
RRFPGYIDDYEALLTDNRIWKQRLVDIGVVSPDRAKALGFTGPMLRGSGV
EWDLRKKQPYEVYDQVDFDIPVGANGDCYDRYLVRIEEMRQSNHIIKQCV
EWLRKNPGPVITDNHKVAPPSRLAMKQNMEEMIHHFKLFTEGMHVPRGEA
YAAVEHPKGEFGIYIVSDGANKPYRLKIRAPGFAHLAALDEMTKGHMIAD
LVAIIGTQDIVFGEIDR
>NE1773 nuoE, Respiratory-chain NADH dehydrogenase 24 Kd subunit
MSSSMLSTEALRKIDREVAKYPADRKQSAVMSALAIAQDEKGWLATETMD
FIADYLEMPAIAVYEVATFYNMYNLKPVGKYKLTVCTNLPCALSGGNQTA
DYLKQKLGIGFNETTTDGLFTLKEGECMGSCGDAPVLLVNNKRMCSFMTE
DQIDKLLEELNR
>NE1772 nuoF, Respiratory-chain NADH dehydrogenase 51 Kd subunit
MTQPTQVIMAGVDTADSDNWRLKNYLKRDGYAALKKILSQKISPEAVIDE
VKKSALRGRGGAGFPTGLKWSFMPKQYTGEKYLVCNSDEGEPGTFKDRDI
MRYNPHILIEGMLIAAYAMGIRTGYNYVHGEIWDVYERMEEAVEEANVAG
FLGDNILGSGFSFRLYNHHGYGAYICGEETALLESIEGKKGQPRFKPPFP
ANFGLYGKPTTINNTETFASVPWIIRNGGERYLQLGKPNNGGTKIFSVSG
HVNKPGNYEVPMGTPFAKLLELAGGMRGGRKLKGCIPGGSSMPVLPGDIM
METDMDYDSIAKAGSMLGSGAVIIMDDTTCMVRALERLSYFYYEESCGQC
TPCREGTGWLYRIINRIEHGKGRVEDLDLLDNLADNIQGRTICALGDAAA
MPVRAMLQHFRDEFVFHIEHKKCMV
>NE1771 nuoG, Ferredoxin:Prokaryotic molybdopterin oxidoreductases
MINIEIDGKQVTVAQGSTVMDAARQIGIYIPHFCYHKKLSIAANCRMCLV
EVEKAPKPLPACATPVAEGMKVSTHSQQAVTAQKGVMEFLLINHPLDCPI
CDQGGECQLQDLAVGYGSSGSRYQEPKRAVTNKNLGPLIATDMTRCIHCT
RCVRFGQEIAGIMELGMAGRGEHSEILTFVGRAVDSELSGNVIDLCPVGA
LVSKPFRYSARTWELSRRKSISPHCGLGSNLVVQVKKNRVMRVLPRENEA
VNECWLSDKDRFSYEGLNSEDRLTVPMIRKSGQWQVCGWQEALAYAAEGI
SNVVKNQGPLSIGALGSAHSTLEELYLLQKLMRGLGSHNIDHRVRQSDFR
TDAVLRGAPWLGMQIAEVSQLKAVLVVGSTLRKDHPLLAQRMRQAVKEGG
QLSLINPSGDDQLTKIAYRAIVAPSAMLDMVLQVLKAAAEIKNIPVPGHI
KSAANAVEVSEVARGIANSLINHAPAAVFLGNLAQHHPHFADLHCAALAV
SKITEAQFGLLGEAANSVGAYSAGALPWSTGTHSVNKTVSEKVGLNAGQM
LGLGSPDGEPACSAYVLLNLEPELDCYDTGRALQVMNKAEFVVSLSSYKG
NIPDYADVILPVAPFTETSGTFVNTEGRIQSFSGVVPPLGEARPAWKILR
VLGNLLQLEGFDYDSSEQVRAEIVSAEEEFVRGLNNALEAFSISENHQKN
SGIERIGEIPIYQADPIVRRASSLQKTHDAVSPVAFASPGLLDKSGIQAG
ETVKIKQQDETIQLEISADTSLPDNCVRLACAHLQTSRLGGMFDEIKLEK
L
>NE1770 nuoH, Respiratory-chain NADH dehydrogenase subunit 1
MDSLQPLFEQLFGSEWGGPLFLLIKNLLLILAIVIPLLLAVAYLTFAERK
IIAYMQVRVGPNRVTFFDIPWLRGWGQPIADAVKAIMKEIIIPTGANKFL
FLLAPVLAIGPALAAWAVVPFSPELVLADINAGLLYILAMTSLGVYGVII
AGWASNSKYAFLGAMRSAAQVVSYELAMGFALVCVLMMSSSLNLGDIVAG
QQGGSFLNWYLIPLFPMFLVYFISGVAETNRAPFDVAEGESEIVAGFHVD
YSGMAFTVFFLAEYANMILVATLASIMFLGGWLPPVDIAPFNLIPGMVWL
LLKIAFMLFFFLWFRATFPRYRYDQIMRLGWKVFIPLTLVWIVVLGMVMQ
LPEVVRQSFPLNLWFN
>NE1769 nuoI, nuoI; NADH dehydrogenase I (chain I) oxidoreductase protein
MERIKQFFKSFLLVEMLKGMKVTGRYLFAPKITVHYPEEKTPQSPRFRGL
HALRRYPNGEERCIACKLCEAVCPAMAITIESEQRDDSTRRTTRYDIDMI
KCIFCGFCEEACPVDAIVETRVLEYHGEVRGDLTYTKEMLLAVGDRYEEQ
IARDRAADAPYR
>NE1768 nuoJ, NADH-ubiquinone/plastoquinone oxidoreductase chain 6
MNLQDFVFYSLASVVVIAALGVITLRNPVHSALLLVLAFVTSSGLWLLLE
AEFLAIVLILVYVGAVMVLFLFVVMMLDINLDRMREGFWKWFPFGAILAL
IMTGEIGIVLMSKQFGPENIQIPVSKSVDYSNTKELGQLLYTDYVYAFEL
SAVILLVAMVAAIAITLRYRTDKKSSNVSKQVAASREKRLKVISIPAERK
E
>NE1767 nuoK, possible nuoK; transmembrane NADH dehydrogenase I (chain K) oxidoreductase protein
MVSLSHYLVLGALLFAIGVVGIFLNRKNVIILLMSIELMLLAVNMNFVAF
SHFLQDTAGQIFVFFILTVAAAEAAIGLAILVALFRNLRTINVDDLDELK
G
>NE1766 nuoL, probable nuoL; transmembrane NADH dehydrogenase I (chain L) oxidoreductase protein
MENLYLLVPMAPLVGAIIAGLFGRRVGLIWSHRATIILVALSLVSSIIIF
VDVLQGNTYNGSVYTWLTSGETIFEIGFLIDTLSATMMVVVTSVSLMVHI
YTIGYMHGDPGYQRFFSYISLFTFSMLMLVMSNNFLQLFFGWEAVGLVSY
LLIGFWYTRPTAIYANLKAFLVNRVGDFGFLLGIGLVLMYFGTLDYELVF
AHAEHMASRSVEIWPGQSWMLMTVICLLLFVGAMGKSAQFPLHVWLPDSM
EGPTPISALIHAATMVTAGIFMVARMSPLFEHSEAALSTVLVIGGITTLF
MALVAVVQNDIKRVVAFSTLSQLGYMTVALGASAYSAAIFHLMTHAFFKA
LLFLGAGSVIIAMHHEQDMRKMGGLRRYMPITFATMFIAALASAGVPGFS
GFFSKDAIIEAVHFSQLPGSGFAYFCVLTTVFVTALYTFRLMFMTFYGEP
RMDKHTREHLHESPWVVTVPLVVLAVPTVASGWLIGTLLFGDYFARVIEI
QEQHDVVALMKEHYTGIVGMMIHSLMTLPFWLAMAGIFTAWFLYQYKVEL
PGKIRKIAGPVYTLLDRKYYIDEFYSWLFAGGLRSLSNVLWKYGDIKVID
GLMVNGSAKAVAWFSTVVRRFQSGYIYHYAFSMIVGVFVLMSLWLFHF
>NE1765 nuoM, possible nuoM; transmembrane NADH dehydrogenase I (chain M) oxidoreductase protein
MLFGFPLLSLIIWLPIIFGLIVLVMGDRNVRAVRWFSLVGAVAGFLVALP
LYSGFDPSTSAMQFTEHLVWFERLNVFYSLGVDGISMPLIILNCFITPIV
VIAGWEVIRERVSQYMGAFLIMSGVVNGVFSSLDAVLFYSFWETSLIPMF
IIIGVWGGPNRIYAAIKFFLYTLLGSLLMLVALIYLYQTSGGSFSILDYH
HLPLPMTAQILIFIAFMLAFAVKVPMWPVHTWLPDAHVEAPTGGSVVLAA
ILLKLGGYGFLRFSLPVAPDASHELSGILIVLSLIAVVYIGIVALMQQDM
KKLIAYSSVSHMGFVTLGFFLFNAYGIEGAMVQMISHGFISGAMFLCVGV
LYDRLHSRQIADYGGVVNRMPVFAALFMLFAMANAGLPGTSGFVGEFMVI
MGSIEVNFWYGFFAALTLILGASYTLWMYKKVIFGEVTNSNVEGMQDISS
REFLILAILAVMVLGLGIYPLPLTEVMHVTVDNLLEHVARTKL
>NE1764 nuoN, possible nuoN; transmembrane NADH dehydrogenase I (chain N) oxidoreductase protein
MDFLLPDFTPAYPEIFLLLMVCVVMLADLFAGERNRYLAFYLSLLTLAGC
ALVTCGIYSTEVRYTFTGMFVGDAMSDILKLLIYVTVAAVLIYSRSYIST
RGLLKGEFFSLALFATLGMMVMVSANHLITLYLGLELLSLSLYAMVALQR
ESAIATEAAIKFFVLGALASGFLLYGMSMLYGATGTLHLPELAKVIHSGQ
ADHEIFIIGLVFVVAGIGFKLSAVPFHMWAPDIYEGAPTAVTLFIGSAPK
FAAFGFVMRLLVGGLGDLVTDWQGMLVLLAVASMAVGNIAAIAQQNIKRM
LAYSTISHMGFVLLGFIAAGENGYSSSMFYVIAYVLMTLGAFGIIMLVSR
EGFEADKISDLKGLNQRNPWLAFMMLLVMFSMAGIPPMIGFYAKLSVLQA
VLEAGYIWLVVVAVMLSLIGAFYYLRIIKFMYFDAPEQTQPIMFKPDVKV
LVSINGLAIILLGMFPQMLMGLSLSAIQHSM
>NE0760 nusA, S1 RNA binding domain:KH domain:Type 1 KH domain
MSREILLLVNALAHEKNVDKNIVFTALEMALASAAKKHTRDDIDVRVSID
RDTGGFHCFRRWLIVADDPVEHPERQITVSEAVGRNPQAVPGEYIEDLME
DVTFDRIGAQAAKQVIFQKIRDAEREQNLNDFLDRGEYLVYGVIKRMDRG
NAIIEFGKIEAVLPRDQMISHENLRMGDRVRAYLLRIDRSVRGPQLVLSR
TSPAFLAKLFELEVPEIEEGILEIKAAARDPGSRAKIAVKSNDSRVDPIG
TCVGMRGSRVQAVTSELAGERVDIIQWSDDQATFVVNALAPAVISSIVVD
EEKHCMDIVVDEENLAQAIGRGGQNVRLASELTGWQLNIMTEEESQEKTE
EEAASICELFMEKLDVDEDVGRILVEEGFSTLEEVAYIPIEEMLEIQEFD
RETIEELRNRARNALLTDAIATEEKIGHASEALLALEGMDIELVRELAEK
GISTQEELADLAADDLVELTGIDFERASKIIIKAREPWFA
>NE2558 nusB, Antitermination protein NusB
MMSTETVPPVKRVQGHKGYKNRRRLSRELALQGIYQWQVTGGNARDIGMQ
LQQVSFFSKADGPYFSDLLQGVLEHAADLQTQIQPHLDRQIAELSPVECS
ILLIGTYEMVYRPEIPCRAIINEAIELAKSYGGTDGHKYVNGVLDKLATR
LRAVELQHPPARNKDSG
>NE2051 nusG, nusG; Bacterial transcription antitermination
MSKKWYAVHTYSGFEKSVKRALKDRIAQHKLEDKFGEILIPVEEVIEIKG
GQKSISERKFFPSYILIEMEMTDETWHLVKGTPKVTGFVGGTSTQPVPIS
HKEIESIFNQVREGVEKPKPKVIFEAGEAVRIKEGPFTDFHGNVEEVNYD
KSKLQVSVSIFGRPTPVELDFHQVDKV
>NE2374 odhA, Transketolase:Dehydrogenase, E1 component
MTKQLLEDSKLFGNNAAFIEMLYERYLEDPFAVSDQWRQYFDSLQPTETI
SVRDIPHTPVVESLVRSATLHKPPLSGTPSKQAIEQPSDTETQERKQVAV
LQLINAYRFLGVRRANLDPLDLQQKQDILELDPGFHGLTDTDMDKVFNTG
SLVGPEHATLQEILHRLQQTYCGSIGAEYMYIADTKQKRWIQNRLESINA
QPGFTPEYKRHILERLTAAEGLEKYLHNRYVGQKRFSGEGNESLIPMLDK
LLQHAGISGVQEIVMGMAHRGRLNVLVNTLGKMPSELFQEFEGKHPQALT
SGDVKYHQGFSSAVMTSGGIMRLALAFNPSHLEIVNPVVEGSVRARQHRF
GDKNGDHVIPVLIHGDAAFAGQGVVMETLNLSQTRGYGTGGTIHIIINNQ
IGFTTSDPRDSRSTLYCTDVVKMIDAPVFHVNGDDPEAVVLATEIAFDFR
MQFHRDVVIDLVCFRKQGHNEQDEPMVTQPSMYRVIHQHPGTRKLYADGL
IRQGVVDNADVEHMVQSYQDAMDEGRNPNTTICYDYKSPNVANWVPFQAS
EKWNQPVTTGVPVEDLKYLSERLTTIPATFKLHPRVEKIIQDRRKMGEGS
LPLDWGMAENLAYAALLKEGYPVRISGQDCGRGTFFHRHAVLHDQNREQD
QWEDGTYIPLRHITPRQPDFVVIDSILSEEAVLGFEYGYATAQPNELVIW
EAQFGDFANGAQVVIDQFIASGEAKWGRLCGLVLLLPHGYEGQGPEHSSA
RLERYLQLCAEYNIQVCVPSTPAQIFHLLRRQIIRPIRKPLIVMSPKSML
RHKEAVSSLEELANGRFQPILPETEAFEIEKVKRLIVCSGKIYYELTAYR
REHHITNMAIIRLEQLYPFPHEDFQAEINRYDHATEILWCQEEPGNQGAW
HRIQHYLLRHMRPDQILGYALRPSSASPSVGYLAMDRFRQKELIEAAFRD
TI
>NE0885 ogt, Methylated-DNA--protein-cysteinemethyltransferase
MNYYTFLESPVDRLLLTSDGEFLTGVYMEIEIQKLLPRMTDDWRQDAAPF
AEAIAQLNAYFAGELIQFDLPMKATGTPFQEAVWQSLSTIPYGETVSYKN
IAERLHLPKAARAVGMANGQNPISIIIPCHRVIGANGKLTGYGGGIHRKQ
WLLAHEDKQTSFA
>NE0615 omlA, possible outer membrane lipoprotein OmlA
MHAFFPRLLLLLLFLPLTHCTYLPSLPYKIDIQQGNVVTDEMVAKLKPGM
TRSQVRFTLGTPLVMDIFHGDRWDYIYRTAPGGRVAEEKKLTVFFQDDRL
SHIQGDFPQPPAFSESEPAQNFFSPEQTFTPAPDTDSNMNEEPDKKGTVN
FLKENQTNFYKDNQ
>NE0617 oprC, TonB-dependent receptor protein
MKYRIKPVTAAVAVALSAFAVQQTQAQTTLGEVVVKTSKIRDAVTPEEIG
SAEIAAQHAITRDTASLLRDVPGVHLYGAGGVSSLPVIHGLADDRLRITV
DGMDFISSCANHMNPPLSYIDPTNVAKIRVYAGVTPVSVGGDSIGGTIIA
ESALPQFAAPGQRSLIKGEIGTFFRSNNNAIGGNLSATLANEFFSINYSG
AITTADNYKAGGNFKTTTATGRPGHTLPLDEVGSTAYKSLTHTVGVALRA
DNHLLEAKLGYQNVPYQFYPNQRMDMLDNEQKRVNVRYLGQYDWGTVDGR
VYYEAVDHYMNFSDDKQLVYGTALNGMPMYSRGRTLGFNLKGDIDLTSND
LLRVGMLYQHYTLEDWWPPSGTGAMSPLTFENINDGKRDRIGVFAEWERR
FGSQWTTLLGARYEHLETDANNVHGYGNMMGNQIPDTNAFNARGHKRTDD
NVDVTALARYTPGDMIDLELGLARKVRTPNLYERYAWSTWSMAAIMNNFV
GDGNGYVGNMDLKPEKAYTASLTIDLHAVDREWEVKATPYFTYVNDYVDA
IRCSSGPNCTVTNATTGNQFVILQYANQSAHLYGLDLSGRTVLAKTSLGE
FGLRGLMNYTRGENKKTGDELYNMMPLNGKVTFTHQYSGWSSAVELVGVA
RKSSVSSVRNEIHTGSYFLTHLRTSYTWKNMRLDLGVENLFDRLYALPLG
GAYTGQGMTMGINGIPYGIAVPGMGRSVYAGLNIKF
>NE0170 oprM, Outer membrane efflux protein
MSRYLKFSDQLFAMIASIFTHPAEVSDSRYRASLPTKPEHRQSLLLLLCS
SLLVTSCAMGPDYSRPQVDVAQNYRLTPTEGRSIANLPWWELLKDKKLQL
LIDQALQENKDLKQTTASVEELQARLRISNMDFIPDVNIEGNAPALGTMG
GFSRPGFPTPYNYFGQTILNWELDIWGRLRRANEAARADLLAQEENRRAV
VLTLVSSVAQSYFDLLQFDMQLDIAHHALSSWDESIAISRAQLQGGLISR
LDLDQFEAERARAAAQVAELERQIIQKENELSVLTGKNPTSVLRKQSLNE
QLIPPEIPAGLPSELLQRRPDILQAEQTLKAATARIGAAKAARFPRISLT
GFLGLSSPALSGLLDSGSEFGVGGFGLAAPLLNSQSLGFEQRAAEAKARQ
ALAKYEQTILVAFKEVEDSLAAIRTANEQYKAQQAQVEALASALQTAELR
YQGGITSYVDVLLAKRTLFDAESALTATRRLHLISIVQLYKALGGGWLAE
GPNAPLPVSTGPLHNG
>NE1115 oprN, Outer membrane efflux protein
MIDKIGWFLIAVILSGCAVGPDYRPPLPGIPDSWQAEKEEIVEQQSLSVQ
PIDQQVLKEWWKNFNDPRLDSLIDQALSDNFDLKIALSRIDQARAERSAI
RAGLFPGVNATANAQRSHNPFPGFAPGIKFNLFELGFDALWEIDLFGRLR
RQLEAASADLDAVNIQYSQALVTLTADLARSYIEYRSFQNQLRITRANLA
AQQTTQELTEKLFKAGVGTRYDAVRARAQTETTRAQIPDLEGSLVAALRQ
LEMLTGQQPGALAAALSEPDAVPMAPGRTILASPVATIRHRPDLHIAERR
LAAATALQGAAIAELFPRLSLSAFLGLRNTELDALFKSAAFSYNTAAGLL
QPLLNFGRIRAGIDLANARQQETYLTFEKTVLEALQETETALTRYLKEEI
RRQALARSVMDLREAVRLSQLRFQVGTVSFLDVLDAQRTLYSAEIDLARS
ETKVSTDLITVYKTLGGGVEIISEKQ
>NE0132 osmY, putative osmotically inducible protein Y
MKIKTLLSIITAMALLILGTEVATAQSTGVPTIDSGSPESNQPLNDTMIT
TKVKAELAIAEGIKSGDISVETVNGVVILTGTQPNEMLIKKAEEVAKSVK
DVKQVDISGLTVNTTTAD
>NE0219 pal, Bacterial outer membrane protein
MRKIFAVLVICLLSACASDNAQQTSDVEDHSYSREGEGTRGGAGSDGFGM
NPLQDPSNILSRRSVYFDFDSYTVKSEYRDLVLAHAAYLRDNTNAQVLLQ
GNTDERGSREYNLALGQRRANSVKDILLLSGARDQQVEAVSLGEEKPRAL
GSDESSWAENRRTDILYQGEY
>NE0927 pan1, Multicopper oxidase type 1
MYLIYTKRTVFMKNSISLFSSYRFTHIILMLIVLALIPLTSQAEKREFDL
SIEDTRIVLVGKRDFHTFAFNGQVPAPLIHVMEGDDVTVNVTNMTTLPHT
IHWHGMLQRGTWQSDGVPHATQHAIEPGDTFTYKFKAEPAGTMWYHCHVN
VNEHVTMRGMWGPLIVEPKNPLPIEKTVTKDYILMLSDWVSSWANKPGEG
GIPGDVFDYYTINAKSFPETQPIRVKKGDVIRLRLIGAGDHVHAIHTHGH
ISQIAFKDGFPLDKPIKGDTVLIGPGERYDVILNMDNPGLWMIHDHVDTH
TTNGDKPDGGIMTTIEYEEVGIDHPFYVWKDKKFVPDFYYEESLKKDLGM
HNSKVFKGEPIEE
>NE0072 panB, Ketopantoate hydroxymethyltransferase
MDQSAAKRMTITTLQNACEQGEKIAVLTCYDATFAAVLEEAGVDILLVGD
SLGNVVQGKSSTLPVTLDEMIYHVRCVERGTHRVFIMADMPFGTFQVSPQ
EAFGNAVRLMAAGAQMVKIEGGQHMAETVEFLSCRGIPVCAHIGLMPQFV
HQLGGYRVQGKTPNDARQLREDALLLQEAGAAMLLMELIPAVLGEEITRL
LSIPTIGIGAGAACSGQVLVLHDMLGISSGTLPRFVRNFMMDADSIQTAV
SNYVEAVKLGAFPAYEHTF
>NE0073 panC, Pantoate-beta-alanine ligase
MEIITDIAPLRARLRHEASVAFVPTMGNLHAGHLSLVRIAQKHASCSVVS
IFVNRLQFAPHEDFDRYPRTWSDDCRLLEEQGADIVFMPDEKTLYPVPQE
FQLLLPPVADTLEGACRPGFFRGVTTVVLKLFNIVQPHIAVFGEKDYQQL
QVVHRMVDQLNLPVEIIAGETVRDEDGLALSSRNNYLDATQRQEAGELAH
HLKQIRDSIASGERDFPLLEQLAAEKLSKRGWVVDYVAVRQQHTLLPVAA
SDSSLVILGAAWLNQTRLIDNFLLTLP
>NE2474 parA2, ParA family ATPase
MTRIFAIANQKGGVGKTTTSINLASSLASIDKRVLLVDLDPQGNTTMGSG
VDKRLLDRTVYQILLGEQVAAEVRLSTKPGKYDLLPANQELAGAEVEMVA
LEQRESRLKKALQVIQADYDFILIDCPPALNLLTLNGLCAAHAVIIPMQC
EYYALEGLSDLVNTIKRVRTGFNPAIRIEGLLRTMFDPRNLLAQQVSDQL
KQHFGDKVYRTIIPRNIRLAEAPGFGLPVLYHDKQSRGAQAYLELANEIL
ATAN
>NE0037 pcm, possible pcm; protein-L-isoaspartate o-methyltransferase
MVNLEQTRFNMVEQQIRTWNVLNQDILDLLYQVKREEFVPAAYRFMAFVD
MEIPLEHGAVMLTPKMEARILQELHIRKTDKILEVGTGTGYMTALLSKLG
THVFSVEIVPELHTMAHINLQTHDITNVTLELGDAARGWPGHGPYDVIVL
TASTPVLPEAFQQNLAPGGRLFAIIGEEPVMEATLITCTSPGNYSSTCLF
ETCTAPFRNALQRESFHF
>NE0069 pcnB, Poly A polymerase family
MIRKFLHRIFSFGSTESVVADPAFRIISCEQHGIPRNRISSGSLKVALAL
QQAGYSAYVVGGAVRDLLLGLKPKDYDVATNATPEEVRAVFRHSRIIGRR
FRLVHVISKGEIVEVSTFRGKMADDLVETEDVQANTDATGRLLHDNVFGS
QEEDVRRRDFTINALFYDPATEEILDYLDGFEDVMAKRLRIIGEPEQRYR
EDPVRMLRAVRLSAKLGIQIDDRTAAPIGDLAPLLLNVPPARLFDEMLKL
LFSGHALAAVIDLRARGLHHGLLPMLDVILEQPLGERFISLVLKNTDERV
RQDKPVSPGFLFAALLWHEVLAVWDTHLKASEKMIPALHRAMGDVLATQR
SRLAIPRRHDGIIYDIWSMQPRFQARSGRKPFRLLEHPHFPAAYDFMLLR
CKSGEVDQELGQWWQIFKNTDNAAREAMLLQEAPAKRRRKRRRSSKPAGE
KIESANQEIS
>NE0279 pcoA, Multicopper oxidase type 1
MNDPISRRAFLKRSGALGTFAALDWMLPIYARSDSLSVVTPTTQLSGDVI
NLTIARTPFRIDDHIATATTINGTVPGPLLHLREGQDITLNVTNHLDETS
SIHWHGILLPPEMDGVPGISFPGIEPGATFSYRFTIRQYGTYWFHSHSGG
QEQAGVYAPMIIDPIEPDPVKYDREYVIMLSDWSFSSVDSMIEKLKKQPG
YFNFQKRTLGDFIQDAMRDGWRPALDDYLMWARMRMDPTDLADVTGHAYT
YLMNGLTPAANWTGLFHPGERIRLRFIQVGVMTFQDIRIPGLGMTVVQAD
GQNVQPVEVDEFRIGPAETYDVIVEPKEERAYTIFAETLDRSGFARGTLA
PRPGMAAEVPSRRPRSLRTMADMGMQHMSGVDHGNHAPGNATMGEPEEGT
QHDTPDRHGEHAKEGIHGRHDRHAQPGFPDSRPVRHGTDDHGSGNQFIPD
FTQSRMDEPGIGLGNDGRRVLVYTDLKSLKPYPDQREPERELEIHLTGHM
ERFIWSFDGEKYSDAKESILFRHGERLRWIFVNDTMMEHTMHLHGTWMYL
ENGAGAYLPRKHTVLVKPAERVSVAITADAPGPWAFHCHLLLHMETGMFR
VVEILDAAPGTES
>NE0280 pcoB, possible copper resistance protein B precursor
MKSFPQFAAGMVVILSPGVDMPVANAETLLDKEKQILPVPHPRGKYHHPT
HSDQNYAFLRADVLEYRPGRDDSDFRWDIQGWYGGDFNRLWFKTEGERNT
AFKAEHDIDMQLLYGRFIGKYYDVQIGVRGETQTVRGKDVARAHAVIGFQ
GLAPYRYEVESALFISQQGDVSVRFQTSRDFLLTQRLILQGRFETHAAVQ
KVEKFTTGQGLNNIELGLRLRYEIRREIAPYIGISYDRSFFRTADLVREE
GGDSSQVRFVAGVQLWL
>NE0883 pdxA, 4-hydroxythreonine-4-phosphate dehydrogenase
MTLSSIARLVLTAGEPAGIGPDLCVQIAQRNLPCRLTVIADRDVLDQRAE
KLRLPLQLHEDDPDNVQPHRAGQLSVQHVPVNNPVVAGQLDVNNAAYVLT
TLRLAVEACTTHEADAMVTAPVHKGIISESGIPFSGHTEYLSELTNSHAV
MMLAGGGMRVTLATTHLPLKDVPAAITPERLEQKLRIIHRDLVTRFAIEN
PRIAVAGLNPHAGESGHLGREEIDVIIPVLEKLRNDGMNLLGPLPADTLF
NPPKLKEYDCIFVMYHDQGLPVLKHASFGHGVNISLGMPIIRTSVDHGTA
LELAGTGQADPGSMNTAIETALMLIEKTHRARNDLS
>NE2322 pdxJ, Pyridoxal phosphate biosynthetic protein PdxJ
MIALGVNIDHVATVRQARGTTYPSPVEAALVAEVAGADAITLHLREDRRH
IQENDVVILRDRLKTRMNLESAVTEEMIAFACRIKPHDICLVPERREELT
TEGGLDVIRHFQQVSVACKRLAEAGIRVSLFVDAQAGQIDAAVEAGAPVI
ELHTGRYADAATPEMQQVELETIRTMAAYAFGRGLQVNAGHGLHYENTVQ
IAAIPELSELNIGHAIVARALFVGFAEAVREMKALLQQARA
>NE0626 pepN, Aminopeptidase N, APN (CD13)
MSDISGTPITPVVAVTRLADYRVPEFLIEKTHLTFELHEGYTDVSSRLTI
TRNPQSTKADHLVLHGEQLELLSLAIDEQPLDAGGYQITPHNLTIRNVPE
QFVLHCKTRIYPEKNTALEGLYRSSGMYCTQCEAEGFRKITYYLDRPDVM
SEFETVIIAREGTFSTILSNGNCLSDTVRDGKRIVHWHDPFKKPSYLFAL
VAGNLVCLEDSFITQSGRQILLQIYTEAKDSDKCHFAMQSLQKAMHWDEQ
VYGREYDLDRFMIVAVDDFNMGAMENKGLNIFNTSCVLAHPASTTDTAFQ
RVERVIAHEYFHNWSGNRVTCRDWFQLSLKEGLTVYRDSEFSADMNSRAV
KRIEDVSLIRGVQFVEDAGPMAHPVRPDSYMEISNFYTVTIYEKGAEVVR
MLANILGPEQYRQATDLYFSHFDGQAVTCDDFVQCMEEVSGRDLCQFRLW
YSQAGTPQLHISDSYDADREQYSLTVRQHTPATPGQTEKEALHIPLKVAL
YGSKGALPLVIDGLARGTETVLDICEAHQTIVFEQVAENPVPSLLRGFSA
PVRVFYDYQLTQLRQLILLDSDGFCRWDAMQSLMRIVLSKAIDGEDNLAE
QTVLIEVLKQLLADHAAMDAALLACMLDLPGEQYLASLYDKANPPAISWA
LTQLQYRLAMQLQTELLSCYQALQDNASGLSARAMAARSLRNRALAYLLH
IDDEYYRDLAQQQFIQADNMTDQFAALRGLVHAEHAGAAAQEALQAFYRQ
WQHDALVVNMWFQVQATRPWGNVLQEVQQLLNHEAFDACNPNKLRAVIAA
FANQNFAGFHDSSGKAYQFLADQIADIDQRNPQMAARLLAPLTHWHQFID
EHATLMRKALESLSQRELSKDVYEVVSKSLN
>NE2147 pepP, metallopeptidase family M24
MTPIQTFIDRRKHLLSKIQHGVAVIATSPERYRNRDTHYPYRFDSYFYYL
TGFREPEAVLVLVATGDASTSQQILFCRDKDIEREIWDGFRYGPEAAREV
FGFDAAYSIARLDELAGELLADQPVVFHAFGHDTSWDQRVVGWISRVREQ
VRKGVSAPAEIRDIRHLLDEMRLIKDENELAVMREAAGISAEAHKRAMQA
TRPGRYEYEIEAELLYEFRRQGAEAPAYTSIVAGGANACVLHYIQNDAQL
QAGDLLLIDAACELHGYAADITRTFPVNGRFSAVQKDVYQLVLSAQSAAI
DAVRPGSNWDSPHQAALRVLVQGFIDLNLCQGSPDAVIETESYKRFYMHR
TGHWLGLDVHDAGEYKQTGQWRELVPGMTLTVEPGCYIRPAEDVPKHFWN
IGIRIEDDVAVTPAGHEVLTGAVPKSVAEIEEWMQCTTISER
>NE0811 petC, Cytochrome c1
MRMIKFLLLAILLAPFVAHSSGQEVKLDKAPIDRADKESLQRGAKGFVEY
CLTCHGANFMRFNRHHDIGMSEDDIRADLIHTGQKTGDLMEAAMRKKEAE
GWFGVVPPDLSVIARARGADWLYTYLRTFYQDTSTYSGWNNLIFDKVAMP
HVLHHLQGWQVLEPGTGNLVQTKPGTMTKEEYDRFVADLVNYMVYLGEPH
APYRRELGITVLLFLFGMLGLTYLLKKEYWRDIH
>NE2263 pgi, Phosphoglucose isomerase (PGI)
MSTLTRSPAWLALSAHQTEIANRPLRTLFDEDPQRFDKFSLQFKDLLLDY
SKQPVDATTIRLLLALAEQQKLVDWLGRMANGEKINFTESRAALHIALRA
NSPVFMDGVDVTQEVRRVLRQIEQFVQKIHNQSHRGYSGLPITDVVNIGI
GGSDLGPVMVTEALKPYHLPQLRTHFVSNLDGAHLSDTLQHINPATTLFV
IASKTFTTQETLANAHSARKWFLAGGGSKQDIAAHFVAVSTNREAVSQFG
IDPDNMFEFWDWVGGRYSLWSAIGLPIALAVGMEHFHSLLAGAQAMDDHF
LTAPFEQNLPVILGLIGIWQINFFHITSQAILPYDQSMHRFPAYLQQLEM
ESNGKRVTRNGEAVDYSTGIIIWGEPGTNGQHAFYQLLHQGTQKFTADFL
APCRCHHPLSEHHPMLLANFFAQTEALMRGKTEQEVRSELANQNLSADLV
EHLLPHRVFPGNHATTSILFKKLDPATLGMLIALYEHKVFVQSVIWEINP
FDQWGVELGKQLAGKILAELALDKPVTGHDASTNGLINFYRAQQ
>NE2560 pgpA, probable phosphatidyl-glycerophosphatase hydrolase transmembrane protein
MITSLPDNYSPQRQIPIKPDITFVHVRLAHFIAMGAGIGLIRYMPGTAGT
IAAFPFYRFLDSIIEPVSLLLIIDVFFIMGIWACAVTGRALNSPDHSGMV
WDEIVAFMLVLFFTPDHWTWQLAAFFLFRFFDIVKPAPIGYLDARLKGGL
GVMLDDLLAAFYTLLCLAGWKSMVLFESYF
>NE2127 pgsA, pgsA:CDP-diacylglycerol--glycerol-3-phosphatidyl transferase
MLMNLPNFLTWLRIMAIPLFVGIFYLPTDWMSPATQNLVATIVFAGAALT
DWLDGYLARVLNQTSAFGAFLDPVADKLMVAAALIVLVYLDRLDAPVALI
IIGREITISALREWMAQIGKSKSVAVSFLGKIKTAAQMVAIPLLLYHDRI
NDAFDSQTAGTWLIYIAAILTLWSMIYYLRAAIPHVIHHSQH
>NE2271 pgsA,ywtB, possible poly-gamma-glutamate synthesis protein [UI:99417512]
MISLIFCGDYAPCRRFEAIVLERGSAILGNAAIEIKTADFSFVNLECPLT
DHQVAINKSGPALRAGPQCASGIADFTVAGLANNHSLDYGVQGLIDTITA
CRSVGVSTVGAGINLAEAQKIHISKVKGKKLAVIAVAEHEFNQSENNGPG
SAPLDPVDNYYQIREAQAKADIVIVTIHGGNEHFHYPRPGLRKLCKHYID
LGVNAVICHHPHVPGAYEIYNGRPIVYSLGNFVFDTLSMVHEWDVGYMAK
LKFNEVDCTFEAIEIIPYRQSITVEGVELLRGDERDKAVSKIEALRNAVQ
ENEVWLNEWNSFVKQRTHNYLLRQFFPFIFPGAGRLARNIPIIKLFFNRK
NSLAKLNLIRCQSHREVLISVIQAESPRREL
>NE1903 phaD, phaD; PH adaptation potassium efflux system D transmembrane protein
MNLSPEHLVIAPVLIPFIAGVLLLFLDDRARHVKAFVSVLSTIVLLVISI
LLLCIVHTDNSANEDRIAVYLLGNWPPPFAINLVLDRLSSMMLVLTAMLA
LPSLVFSLGRWHKAGVHFHTLFLLLLMGLNGAFLTGDLFNLFVFFEVMLA
ASYGLVLHGSGLLRVKAGLHYIAINLVASLLFLIGVALIYSVTGTLNMAD
LAIRIPQINEANRTMLESGAAILSVAFLIKAGMWPLNFWLPTTYAAAPAP
SAAIFAIMSKVGIYVLLRLSLLFFGSDPGFPESLGAQILLYGGLATISFG
TVGVLASQTPGRFAGFSVLISSGTLLASIGLVQTEVTAGALFYLASSTLA
ISALFMLIELVERGQNMAASVLAVTMEAFGYDDEETENEEIIGVAIPRTI
AVLGTCFAACTILLAGLPPLSGFIAKFVILNALFSPGESGQMHFIPATLW
WFLALLIFSGLATLIAMSRSGIRTFWAPPVDTALPRVLLIEIVPVAALLC
MTLVLTIQAGPVMRFMEATANSLHAPRPYIEGVMKTQSAVLPQPMETTE
>NE0335 pheA, Prephenate dehydratase (PDT):Chorismate mutase:ACT domain
MTEQLKQLRNEIDAVDDELLRLISTRARLAQQVGRQKSGTAYRPERESQI
FSRLQQSNPGPLRDEHIVHLFTEIISVCRALEEPLTVAYLGPQGTFSEEA
VTKRFGSAVTSLPCSSIDDIFRKVESGAASYGVVPVENSTEGAVGRTMDL
LLQTPLKICGELELPVHQCLMAQQTDLTAIRKIYSHPQSFAQCQVWINEN
LQSLTAVDRIDAASNADAARQAAADSSAAAIAGKKAAEVFGLKVCAANIE
DNPNNTTRFLVIGSRDVARSGRDKTSLAMATHNRPGAVHELLAPFARYQV
SMTRLESRPSRASLWEYVFFTDIAGHQEDENVARALQALRDNTTFLKIFG
SYPAA
>NE0954 pheS, pheS; phenylalanyl-tRNA synthetase, alpha-subunit
MNHSIAMGNLEDLVNAAIKLFDNAESIVDLEQIKAQYLGKTGEITILLKG
LRELTPEERPVMGERINQAKKLLEDALIERRNLIQEKNMSARLAEESLDV
SLPGRGLGMGGVHPVTRTLIRIESLFHSIGFGVATGPEIETDFYNFTALN
IAENHPARAMHDTFYVDGGKLLRTHTSPVQIHYMQNHRPPIKIIAPGRVY
RCDSDVTHTPMFHQVEGLWIDENVSFSALKGVLVEFMRNFFEKDNLSVRF
RPSFFPFTEPSAEMDIACVMCNGKGCRVCGETGWLEVLGCGMVHPNVMNH
VGLDSEEHIGFAFGLGVERLAMLRYGVNDLRLFFENDLRFLKQFN
>NE0953 pheT, pheT; phenylalanyl-tRNA synthetase beta chain protein
MKFSGNWLRKLVDLQYSDEELAHKLTMAGLEVESVAPVAPFFDKVVVAQV
VSVQKHANADRLNVCKVDVGTQSDGFLQIVCGAQNVKEGMKTVCALVGAR
LPELDIRQGKIRGVESLGMLCSAKELGLASDADGLLELPGDTPVGVDFRK
YYSLDDCIFTLKLTPNRADCLGMFGIAREVAAITASELDLPEIVFVNPVI
EDVLQIRVEEPESCPLYCGRVIKGVATDTAIPLWMSQRLERAGLRMINPV
VDIINYAMLETGQPMHAFDLNQINQEICVRFAGENEHLLLLNGERLTLQK
DMLVIADSSKPLALAGIMGGSESGVTNTTIDVFLESAFFSPVVISGKSFK
LGFTSDSAHRFERGVDFSMTRSVLERATALILEICGGKAGPVTEIGNDLP
RREAVRVRQKRIAKILGVDFSVELISEYFQRLQFSYTIAGEMFYVVPPAA
RFDLVIEEDFIEEIARVHGYDLIPARLPKAPVHMLAEPETGVSPVKRLRQ
ILTAKDYQEVINYTFVDTQWEADFSGNDIPIRLKNPIASHMDVMRSSLFG
GLIDNLQFNLNRKQSRVRIFELGSCFSKEGDEEKEVENLAALCSGSAYPE
QWGVPDRDIDFYDVKMDIESLFWPRSVYFELALHPALHPGKSARILIDKK
PVGWIGELHPRWQSKYHLSQSAILFELRTEALAAELLPAMYPLSKFPPVR
RDIAVVVESDVSVASLLETMHLEKDRSISEISLFDLYSGEKLVQGKKSLA
FRILLQDSEKTLTDQEIDRAVSQLVGILERKFGATLRS
>NE0971 phnB, conserved hypothetical protein
MTTPYVQPYLFFGGRCEEALEFYRTTLGAQVDMLMRHKESPESPPPGMLA
PGFENKIMHASFHIGATTLMASDGCGEDSHFDGFSLSLTVPTEAEADRTF
AALAEGGQVRMPLTKTFWSSRFGMLTDRFGISWMITVQE
>NE2131 phoB, possible phoB; Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding
MSATVLVVEDESAIQELIAYNLRNAGYTAVCTNSAEQAAAMVNEVLPDLV
LLDWMLPGKSGIELARSLRRDPRTKPIPVIMLTARVDERDKVIGLETGAD
DYVTKPFSPRELIARIKAVLRRSLPEASDEILDIGGVQINPMTHRVTVTD
LNDGTHEKELLLSPTEYRLLYFLMAHAERVHTRAQLLDRVWGDHVFVEDR
TVDVHIRRLRKALDVAGKADLVQTVRGAGYRFSTRGESEL
>NE1288 phoR, Sensory transduction histidine kinases
MNSFWLSLAPVLWLTIIAIPVGILFNAAIVFGLYSIILLGIVIYHIHTLY
CLEQWLHHSDLSYATVPDRSGLWGVVFARLARFVRRHDKERLGLNTSLER
LQRATSALPEGIVILGDKDQIEWCNSAAERHLGLNLEMDAGQHITRMVRQ
VRFVAYLDMGDFSKPFLLNQSRLRDVLLSIQIVPYGDREKLLISQDVTRF
EKIEIMRRDFVANVSHELRTPLTVIGGFLETLLEENKISGEMEKNALRLM
SDQAIRMRRLVEDLLTLSRLENAGNNLTEEQVDVVKLLQGLYQEAQSLSA
GRHVISLNLASHAMLLGSEDELRSAFENLISNAVRYTPESGIISLNWVVE
NGKGLFYVQDSGIGIEPNHIPRLTERFYRVDRSRSRETGGTGLGLAIVKH
VLNRHQASMEITSQPGAGSIFRIWFPSNRVLPAETAPVISSSPEHTKQNS
AA
>NE1744 phoU, Protein of unknown function DUF65
MVNKEHFSKKFDADLEEIRTRVLQMGGLVEEQIKNAIEALTEGNEHMIEQ
VIDNDHRVNAMEVKIDELCSQIIVRRQPTAIDLRMIMTVINTITDLERIG
DEAAKIARMAKLIYAADRMYTPRFVEIRHIAAIAMDMLRKALDAFARVDP
NASAEIVRQDEQVDEEFRSIIRHLVTFMMEDPRKISTAIEILFAAKAVER
IGDHAKNMSEYVVYMVKGKDVRHVSADEIEREVGK
>NE1415 pilA, Signal recognition particle GTPase, FtsY protein
MIFTFLKQKQPTRMFSIFKSKKNPDEKNPAQVETRPVSFTDKIRQGLTRT
RQQLGKQLSGLFGGRKIDEDLYEELETALLTADTGIEATNRLLESLRTRV
RRDALDDSEQLKTVLQDVLTELLQPLERPLVTTGHTPFVIMIAGVNGVGK
TTTIGKLAHYFQSQGHSVLLAAGDTFRAAAQEQLKVWGERNQITVISQES
DPGKKSDPAAVIFDAINAAIARKIDIVLADTAGRLTTQLHLMEEIRKIKR
VIAKAIPDAPHEVLLVLDANTGQNAVSQLKAFNDALGVTGLVMTKLDGTA
RGGVIAAIAAQFTDTPPALRFIGVGEGLDDLRPFNAREFAEALFD
>NE0596 pilC, Bacterial type II secretion system protein
MATVAASEKKIKEFNFLWEGKDKAGKAVKGEMRAAGTVVVTSTLRRQGIR
VTKITKARAGGKITDKDITLFTRQLATMMKSGVPLLQAFDIVGKGHSNRA
VGKLLMDIKSDVETGNSLANAFRKHPLYFDSLFCNLVAAGEAAGILDSLL
DRLATYKEKIQAIKGKIKSALFYPISIIVVAFVITAVIMIFVIPAFKELF
QGFGADLPAPTLMVMAISDFFVDYWWAIFGCVGGGLYAFFYTWKRSTVMQ
HVMDRLALRLPIFGEVIRKATIARWSRTLSTMFAAGVPLVEALNSVAGAA
GNQVYFEATRNIQNEVSTGSSLVDAMTATNVFPSMVLQMVSIGEESGSLD
AMLSKIADFFEAEVDDAVEALSSLMEPIIMVVLGTLIGGMVVAMYLPIFK
MGQVVG
>NE0597 pilD, Prepilin cysteine protease (C20), type IV
MNFLDTLQASSLFFVSFISIIGLLLGSFLNVVIYRLPEMLKREWQQQCAE
LRGEQIKALPVYNLAIPGSGCPQCGHKISILENIPIISYLFLRGQCSGCH
TRIPFSYPLVEALTAILSGLAAWHFGFGGLLFAVLVFVWAMIVLTFIDLN
TQLLPDSITQPLLWTGLIVNLNNGFTDVHSAVVGAVAGYLTLWSVYWLFR
LLTGKEGMGYGDFKLLAAIGAWLGWQLLPLVILFSSVVGAVVGTVLILSA
GHSKNMTLPFGPYLAGGGLIALFWGKQINQAYLAMF
>NE0146 pilF, TPR repeat
MNRLIRTGVLGFLVWLAGCGHAPVQEKPTPEELKQRALQSAKIHTELAGQ
YYHRGQYRVAIEEAEIALQKKTDYAPAYNMLGLVYMDLQEDDRAEWNFER
GLGITPNDPDIRNNFGWFLCQRKPDGIEQAIGHFMAAVRDPLYETPERTY
TNAGLCVLKQNDFERAQSYFQEALVIRPGYPLARLGLVELDFSRGEVKKA
WAAINRYLQTYPPAPGSLWLAVRIARANGDVNAETNYAFQLQKRFPDSRE
ARESRAGRLNHER
>NE1836 pilG, Response regulator receiver domain
MTETANDLAGIKVMVVDDSNTIRRSAEIFLAHSGCEVILATDGFDAMAKV
IDCQPDIIFLDIVMPRLDGYQACMLIKKNPRYQSVPVIMLSSKSGLFDRA
RGRMVGSDEYLTKPFTKEALLDTVRKHTSHHRVPA
>NE1007 pilH, Response regulator receiver domain
MTVRTILVVDDSPTDRHIISGILIRSGYQVGVAESGEEGVIKAREIWPDL
VLMDVVMPGINGYQATRILARDGATRHIPVILCTTKDQQTDKIWGLRQGA
RDYITKPVVAAELLQKIAGLN
>NE1006 pilI, CheW-like domain
MTINAMPERQSSLTEQINKKAQETSATIPVLGITIGEDRWLIPMSNISEI
LPVPKITPVFLTQPWFLGVINVRGNVYGLCDLSHYLDNTSTCISAKNRVF
LTIPRLGVSYAMLAGSVLGIRNLMEFVPQSDHEDKRPVVAGIYKDRQDRL
WRMLNLPALLQLESFQRAGS
>NE1251 pilJ, Bacterial chemotaxis sensory transducer
MTGYNNANNMTVLLPVTRYLQIFRQSWFSGVLSVLFLSIALIMLRIDTQQ
ARTHTASLEIVGRIHANIQHLTIVSPEALRGNEQAFVQLRNNLDQLNHYA
TLLQYGGEYQRETLPAIAELLPADLFKTFRNTLHIKENRARQILESREAL
VSLAEILKRVDLVNHGLQKKLQEFSVELTQTGHASNQAVAVETVKILVQF
ITGSVRSLIEGGFQVSSTADQQSVESEQITGMIQTLIKGRDWLYSTVQGN
QLSSGTLSQIRVQFNALEDLLRVAQRLTPEVTGAWHAIHETFSASDSLST
LVSQIEQAITEHNNNAGMLTTVLFYLTAVLTIISGLVFIHLFSTSLRKHI
HQGEQNVETTQKAILRLLDEMEIPAGGDLTARMSVTEDITGTIADSINLT
IEALQELVRKVNHASSQVMSASHQAESISSDLLDATREQAGKIEDATVAV
LGIAESLEAVSGSAEECADVARQTLHAAESGANAVQDAMAGMNEIRVYIQ
ETSKRIKRLGESSQEIGEIVTLISDITEQTNILALNASIQATAAGDAGKG
FSVIAQEIQRLAEHSAEATRQISTLIRNIRGDAQDTIIAMERSTAGVVEG
ARRANTTGSALEEIETVSKRLAQLVIRITEATHKQTRVSNKVAKSMEDIL
SITRQTSRGTQQNADSIKQITEHVTDLKSSVASFKV
>NE2316 pilM, putative type 4 fimbrial biogenesis protein PilM
MALSSSILKKPLDIKFDIDFSLLAGTARMLGVDISASSIKVVELSRKGRL
PHSYRLERYVIEPMPVDALQDGGINQIEQVSECLRRAVKRMGARQKKIVM
ALPLTSVITKKINVPAGLREDDLEFQVETEANQYVPFALDEVNLDFQVIG
PVPGHPEEVEVLLAAARKDKVEDRVATAVSAGLKVMVMDVEQFTAQAVSA
RAIRQQLPDDGKDKVIALIDIGASVTRVNVLLNGESVYMRDLSFGGDQLT
QEIQNQFNLSPEEAEIAKRNGTLPENYRNDVLQPFCETVALEVSRALQFF
FTSTQYIQVDYIFLSGGCAAIPGLEEIISERTKVITQIINPFADMELSGR
VAPRQLKMDAPVLLTACGLAMRGFDPS
>NE2315 pilN, putative type 4 fimbrial biogenesis protein PilN
MIRINLLPHRELKRKARQQQFAVLAGLTVILGMLIIWGVHEMVLDKIDYQ
NGRNQYLKDEIAVLDRQIAEIRSIREKIQEMLARKGIVESLQGNRAKVVH
MLDEVARRVPEGVYLKGFKQTNEHLRLAGYAQSNAWVSTLMRNLDASSWM
ESPLLIEIKAATANGIRLSEFDLNVRLTELPEEGSHTGGEEMTSKITAAA
GQP
>NE2314 pilO, putative fimbrial type-4 assembly membrane transmembrane protein
MNLLEELRHVNPNEPGSWPAVIKAGALALLLATLMVAGYFLDWEEQWTTL
EQVREEENTLRSTFLAKKTSAVNLDVLRQQLAEVQQTLGTLLKQLPNKSE
LDALLVDINRAGLGRGLQFELFKPDARETVREFYAELPVTIRVTGNYHDI
GAFASDVAQLPRIVTLHNIEITPGKDHQLIFNATAKTFRYLDDEEIAAHK
REMSEKDKQKNPKGAS
>NE2313 pilP, putative type 4 fimbrial biogenesis protein PilP
MSSRFSWWIAVMMLSLVGACSQEGYEDLESFVRESGAGLKGKVDPLPEIR
PLEHFVYQAFDIPDPFSNRKNKPGKPDRNELEPDLKRPKEALESFPLENL
VMVGSLRRGNNVFGLVRAPDNSVHRVRTGNYLGQNFGLITGVSEVEIKLR
EITRDSGDEWSERASVLMLQTQEQKQ
>NE2312 pilQ, probable pilQ; fimbrial type-4 assembly signal peptide protein
MNWITGNKEYENRVIGVIHRIWQCLPACILLLAWGYVQTVTASVNTLRAV
DVSSQTDGRTVVRVTLDKPVEVPPAGVLLNDPDRLYFDLEQMDSALGKSG
RISGRGVIKNIDVVPAEGRIRLVMNLSNTAAYETDVEGNHLLISLRSNEI
SGGTPARPSVSTSSPARSETMSLRDIDFHRGTQGEGRVEVELSQPGAVIN
VHSHGSRLLVEFMKAYLPPNLERRLNVLDFATPVQSVETVARDGNVQMII
EPKGRWEHTSWQTGTSFVVEIRPMAEAEDIPIHKKLIDGGYTGEKLSLNF
QDVEIRSILQVIADFTDLNIIASDSVKGNLTLRLRDVPWDQALDIVLQTN
DLDKRRAGNVIFIAPREEMAAREKLQLEQNQQITELEVLQTEAFKLNYRT
ASSIPLKGILSQRGSVEVDDISNTLTVTDIPTRLAEVARHVANLDTQVRQ
VMIETRIVEATDTFSRNLGARFGVQNATRIGSETNGRRLGISGNLGNSSG
LAGGTVSPSGGDNLNVNLPAAALAGVAGGPAALGLSLIKINNGTLINLEL
SALESDSKGRVIASPRLVTANRVEASIEQGTEIPFQTISVTRPQIQFKKA
VLGLKVTPQITPDDNIIMKLQVNQDTRGENTPAGPAIDTKQIVTEVLVEN
GGTVVIGGIFEQVEKQDRNQVPLLGDIPVLGNLFKNTAKRDDKRELLIFV
TPRILNENLSNNIRSALN
>NE0965 pilT, pilT; twitching mobility protein transport fimbria
MNIVELLSFVVKNNASDLHLSAGMPPMIRVHGEIRRINLPALEHKDVHDM
VYDIMNDSQRKHYEEHLECDFSFAIPDLARFRVNAFNTQRGAASVLRTIP
SRVLTLEELKAPKIFAEIAQQPRGVVLVTGPTGSGKSTTLAAMVNDINEN
QYGHILTLEDPIEFVHESKKCLINQREVGRDTHSFSNALRAALREDPDII
LVGEMRDLETIRLAMTAAETGHLVFGTLHTSSAAKTIDRIIDVFPAEEKE
MVRAMLSESLRAVISQALLKTKDGKGRVAAHEIMIGTPAIRNLIREGKVA
QMYSAIQTGQGVGMQTLDQNLTDLVKRGVISAVEARTKAMNKDNFRG
>NE0964 pilU, Bacterial type II secretion system protein E
MEKEQALKFMHDLLRLMLSKKASDLFITAGYPPAMKIDGKMTPVTQQLLS
SAHTAALARAIMNDKQAVEFESSKECNFAIHPEGIGRFRVNTFVQQQRTG
IVLRTITTKIPNFDDLGLPQVLKEVVMSKRGLVIFVGGTGSGKSTSMAAL
IGHRNQNSHGHIITIEDPVEFVHEHINCVVTQREVGVDTESWEAALKNTL
RQAPDVILIGEIRDRETMEHAIAFAETGHLCMGTLHANSANQALDRIINF
FPEERRAQLLMDLSLNMRALVAQRLIPKKSGSGRAAAIEVMLNSPLISDL
IFKGNVHEIKEIMKKSSELGMQTFDMALFELYENGTISYEDALRNADSMN
ELRLQIKLHGSEARKISAETNIEHLTIT
>NE1747 pilY1, putative type 4 fimbrial biogenesis protein PilY1
MKTIRTAMGIRLLMALFILVPIRTEAALSLSDVPLFLTTAVDPNIIFTLD
DSGSMQFEIMPDDLIPSTNSNSNTGDVRFVYPVREGTYGGSTYSNYVVGF
DANNGYTAKLRSSHVNTIYYDPTVRYLPWSKADGSLMIDAKITCAPHNPW
LIPNKNDPAKDCRNLTVNNSQSAGWLQSNNTLKTESRTFYPAVYFHYKGS
GNINSASSYDQIEIKPENAPFNGGENRTDCANSSACTYDEEIQNFANWYT
YYRSRVLLARAGVGRAFAAQGGNMRIGFGAINKGSSTIDGKSTATLITGV
RQFSGTDRTNFFDQLYKHTIPQQGTPLRNAIKDVGEYLKRSDDQGPWSDT
PGQTGGTQPVCRQNYHILMTDGYWDGTDSPGIGNSDGADGSVIINHFPNA
APATYQYKPVLPYSDNRSDTLADVAMHYWKNDLRSNLANKVPTNAKDPAF
WQHMVNFTVGLGVQGSLTSLPGSPGGPTGWPNPTSSDSAKIDDLWHAAVN
SRGEFFSAQDPATFAAGLTNALTNITARVSSAAAVTTNTSRLNTGAQIYQ
AKFNSADWSGQLLAFNLESDASLGELQWEASEQLPAHADRSIFTHNGTNG
LAFTVANFSSLSTAQQAALDKNDGQGMARIGWLRGDKSGEISQGGSFRNR
NNGVLGDIVNSDLLYIKSLDFSYDSLPAATPGQASYHSYRQSNNSRTPVV
YTGANDGMLHAFNAETGVELFAYVPAAVYSGLNRLTASSYTHRYFVDGNA
YVGDAYMGATPAWKTVLLGTLGGGGKSVFALDITDPVNFSEPDVLWEFTD
ADLGYVHGQAKIARLNDGSWAAIIGNGYNGNSDRAWLFIVDLQTGALIKK
IPTNASTSNGLSTPALVDTNGDKIIDVVYAGDLRGNVWKFDLSQSDSTQW
DVAYKSGSASVPLFTARNASNQAQPITSPLEVGYHKQGGYMIFFGTGQFV
AESDKTDKKVQSLYGIWDKGSPIAETNRSVLQQQTITAETVIPPFERTVS
DHSVDWSTKRGWYIDLLQSPEGTQQGERSVIMPMLNFGRIIFTTLIPSID
PCEAGGRNWLMLLDAETGGMMPKPQFDTNGDGKLNNNDRVIAAVGSDGIR
SESVAISAGSLTHLIAATTTGAVEKVTISNGAPPPRRSWKQLQ
>NE0612 plsC, Phospholipid and glycerol acyltransferase (from 'motifs_6.msf')
MAATGVNRLTRSIRFIRLILHIASGLLQSFALPYISTARQNRMACNWAQK
FLRILKVKLCPGGVLPVCGRQGVVFVANHISWLDIMVILAVYPVHFVAKA
EIGTWPILGQLCRNAGTLFIEREKRGDTLRINQQISSILKDGRSVVIFPE
GTTGDGDVLQHFHASLLQSAVTAETLLYPVAIRYRNRDGSRNSSVAYVSV
TILQSLMQILAEPEIQVELIFREPIPGAGRNRRELARLAEKAIAQTLSLN
VVRTVPETLSDLPAERM
>NE1645 plsX, Fatty acid synthesis plsX protein
MGGDHGPHITVPSAVEYMHHDCETNIILVGKPEVISRELDALKVSPGSRL
RLHPANEVVGMDEHPAVALRNKKDSSMRVALNLIKSGEAQACISAGNTGA
LLATSRFVLKTIPGIDRPALAVILPTISGHTYVLDLGANVNCTAEHLLQF
GIMGATLVSSVENKPNPSIGLLNVGEEDIKGNDVVKRAAELFRSSGLNFY
GNIEGDDIYRGTTNVVVCDGFVGNVALKTSEGLAQMLASYLREEFRRSLF
SRLAGLVALPVINAFRRRVDHRQYNGASLLGLRGVVIKSHGSADSYAFRC
AIRRAVEEVRGGTLRNIVEHIETLRNNIEQNIAQPVSIE
>NE2195 pmbA, Putative modulator of DNA gyrase
MDNSNHHPEQPQSFSYSVETLQQIADDVLTLAHKGGADACEMNVSEGSGQ
NVTVRQGEVETIEYTRDKGLSITVHIGHKRGNASSSDFSPQAIRETVSAA
LSIARYTADDIYAGLADQDLLATSFPDLDLYHPWSLPVEEAIELARQCEA
AALATDKRITNSEGASVSAGASHFVYANSLGFCAGYPLSRHSISCAVIAG
EQNNMQRDYWYSVARAAGDLEAIEEIGKKAGMRSLARLGASKIATCEVPV
LFESTIASSLIGYFVQAISGGSLYRKSSFLLDSIGRQVFPATIQISELPH
LQKGLASCAFDDEGVATHPRKVVENGVVQGYFLGSYTARKLGMRSTGNAG
GNHNLIVENNVSLSFDALLKKMNKGLLVTELLGHGVNLVTGDYSQGAAGF
WVENGEITHPVEEITIAGNLKNMLSGIVAVGNDVIVRGSRQCGSLLIERM
TIAGH
>NE0172 pnp, pnp; polyribonucleotide nucleotidyltransferase protein
MKPIKKSITYGRHTLTIETGEIAKQAHGAVIVSMDDTVVLVTAVGDKKTK
PGQDFFPLTVDYQEKFYSAGRIPGSFFKREGRPSEKETLTSRLIDRPIRP
LFPDGFYNEVQVVAMVLSSDTEIDADIPAMIGTSAALILSGIPFDGPVGA
ARVGYINNEYILNPTTTQLKESQLNLVVAGTQKAVLMVESEADELSEDVM
LGAVTYGHDQMQQVIDMINELADEAGVTAWDWQPAEQDLSLVEKVTQLAE
ADLRNAFRLKQKSARVEAINEIWQRVFTELKVGTEEGPSEQAVREIGFAL
EARIVRNSILDGESRIDGRGTRTVRPITIRHGVLPRTHGSALFTRGETQA
LAITTLGTARDEQKIDALQGDYSERFMLHYNMPPFATGETGRVGTPKRRE
IGHGRLAKRALLAVIPPVEEFGYSMRVVSEVTESNGSSSMASVCGGCLSL
MDAGVPLKAHVAGIAMGLIKEGNRFAVLTDILGDEDHLGDMDFKVAGTEH
GITALQMDIKIQGITKEIMQVALLQAKEGRLHILEIMKQSLPIARESISV
HAPRIIKFKINPEKIRDVIGKGGAVIRALTEETGTTIDISDDGSVTIACV
SSEGGEQARKRIEDITADVEVGRIYEGTVLKLLDFGAIVSVLPGKDGLLH
ISQIANERVENVADHLKEGQTVRVKVLEADEKGRLRLSMKAASISTEITA
DNPDNTEK
>NE0859 pntAa, Alanine dehydrogenase and pyridine nucleotide transhydrogenase
MHIGIPAETRAGETRVAATPETVKKLTAKGLHTVLVQAGAGAGASIPDSA
YQEAGATIISTASQLYEQSQIVLKVRGPEADELVLMKKDAVLVGLLTPHH
TEEIESVARHGLTAFAMEKLPRISRAQSMDVLSSQANIGGYKAVILAADI
YQKFFPMLMTAAGTVKAARVLVLGAGVAGLQAIATAKRLGAVIEAFDVRP
AVKEQVESLGAKFVEVPLTDEEKAKAETAGGYATEMSEDYKRRQGELIQE
RAAAADIIITTALIPGRPAPVLITEETVKAMKPGSVIVDMAVEAGGNCPL
SELGKTVVKYGVHLVGVANLPGLVAADASALYARNLLNFLNLILDAQTGE
LNVNREDEIIDGSLVCMAGEVISKS
>NE0860 pntAb2, probable transmembrane NAD(P) transhydrogenase (alpha subunit part 2)
MIGEIDPTIINLTVFVLAVFVGYHVVWNVTPALHTPLMSVTNAISSIILV
GAMLAAGSTEADWGAWLGAAAVILASINVFGGFLVTQRMLEMFKKKDKKG
QA
>NE0861 pntB, NAD(P) transhydrogenase beta subunit
MSANFIALAYLIAAVFFILSLKGLSSPLTARRGNLLGMLGMGIAVFTTLT
ITENWPLILGCIAVGAIIGSIVAQRIQMTAMPELIAFMHSLVGLAAVFIA
IAAVNNPTSFGLSEILPTGSKLELFLGTFIGAMTWSGSVIAFLKLSGRMS
GAPIIFSGQHMVNLLLAIIMIGFGLWFFFSETTNWTAFSAMTAIAFLLGF
LIIIPIGGADMPVVISMLNSYSGWAAAGIGFSLGNPMLIIAGSLVGSSGA
ILSYIMCKAMNRPFLSVILGGFGSDGGSQSTDDGIQKSYRSGSADDAAFL
MGNADSVIIIPGYGLAVARAQHAVKELAQALHKKGINVRFAIHPVAGRMP
GHMNVLLAEAEIPYEMVQEMDEINSDFPNTDVVLVLGANDVVNPAANVPG
SPIYGMPILEAHKARTVMVVKRSMAVGYAGLDNDLFYMDKTMMVFGDAKK
VVENMVKAL
>NE1468 polA, polA; DNA polymerase I protein
MKTLLLVDGSSYLYRAFHALPDLRNRLNEPTGAIYGVLNMLRRLHKEYRP
DYSACVFDAKGKTFRDDIYPQYKAHRPPMPEDLVCQIGPLYACIRAMGWP
LLIEEGVEADDVIGTLVERAIARQAQCVIATGDKDIAQLVRPGIWLVNTM
NNESLDESGILQKFGVTPAQIIDFLALVGDSVDNIPGVEKVGPKTAVKWL
DQYGTLDDLIAHADEIKGVVGENLRKALDWLKVSRKLLTIKCDVPLAMDW
QDLVAVPPDTARLTELYEHLEFRSWLRELKQPGPEKNEKAESSVMAAIVD
DPSVPEGENDDGRDYQIILTDAQLGDWLAQCESAELVSIDTETTSLNPME
AKLVGLSFCMELGQAAYIPLAHHYPGVPSQLNREQVLQRLKPWLESDEKL
KIGQNLKYDRHVFANHGVMLNGIVHDTLLQSYVLESHLSHDLDSLASRHL
GIQTISYDEVTGKGAKRIGFEQVEIHRAGIYAAEDADIPLRLHRVLYPVI
SQDAHLEYIYQQIEIPLLEVLFRIERNGVLLDTDLLRVQSGELTQQLVAL
EQQAHSLAGHAFNLNSTKQIQEILFGQHKLPVIKKTPKGVPSTDEEVLQR
LASDYPLPKVLLDYRGLAKLKSTYIDKLPQMVNKQTGRVHTHYAQAVAVT
GRLASNDPNLQNIPVRTPEGRRIREAFIAPDGWLIMSADYSQIELRIMAH
ISGDAGLIHAFSEGQDIHRATAAEVFGVPVEQVNPEQRRYAKVINFGLIY
GMSEFGLATQLGIERTAARTFIDRYFARYPGVADYMQRTRELAKQHGYVE
TVLGRRLQLSDIRSNQRNRQMGAERAAINAPMQGTAADIIKLAMISVHRW
LAEAQLQSKLIMQVHDELVLEVLVDELPVIKENLPRLMENVLKLDVPLKV
QTGIGKNWDQAH
>NE1870 potA, potA; putative spermidine/putrescine transport system ATP-binding protein
MALLELRDVTRRFGDFTAVDCVNLSIEAGELFTLLGPSGCGKTTLLRMIA
GFDVPDSGQILLDGQDIANTPPEKRPIHTVFQSYALFPHMTVADNVAFPL
KMSGKTPAEIKKRVEKALEEVQLSRFTHRFPHELSGGQKQRVAFARGLIN
RPRLLLMDEPLGALDAKLREDMQRELISLQKEVGITFVFVTHSQDEALAL
SQRIAVMNQGQVEQIGEPSVIYSHPANRFIADFIGKINLMAARVTQVSDN
DMTLEIDQLGTTTLPLKQGIKTGDQGVMAIRPEQVSVHALARHAELPHAH
TGKVLDFLYVGDVTTYIVELDCGIRVEALLANSSPGRARFFEVGDPVIVS
WTREAAQFLMN
>NE1871 potB, potB; putative spermidine/putrescine transport system permease protein
MNRLRLTRWLVSLPPTLFLLLFFVVPSLLMIVTSFRYPGEFGGLAPLSVP
ATGLAEEGSGLYGLTLETYRFFFSDILYAEIFLKSFAVATATTLICLVMA
YPVAMLIARSAKKYRNLMVLLVVLPFASNFLIRIYAWMIILGPESTFSHL
INSVLDVLGLEPVMLLFSPFAVLVGTVYVHLPFMILPLYTNLEKHDPVLL
DAAQDLGANRWQRFWRVTWPQSLPGVFSGSALVFIPVLGMFAIPDILGGT
GDILIGNLIKDQFLGTRDWPFGSTLSIMLTLAVLSVAGLATWFARSRTAS
SN
>NE1872 potC, potC; putative spermidine/putrescine transport system permease protein
MKRNNLTLWLAAIPVYAFLYIPLIVVVVYSFNDSRLNAEWVGFTLDWYYK
LFNNGEMLLAARNSLIIAITASLLATVLGTMAGLAIHRYRLKVLPILAFT
PIAMPEILLGVSLLLFFLQVLNLTLGMVSIIIAHTTFCIGFVAIIVRARL
QGMDDSIFEAARDLGATPWQTFRLITLPLIMPAVAAGALMSFTLSIDDFV
ITFFTKGVGEPILPIQIYTMIKISVTPEVNAISTLLMLLTLALIILANRL
DAGALRGE
>NE1873 potD, Bacterial extracellular solute-binding protein, family 1
MKASIIRSYFVCLLISILAGCTSGSGSGDNPSNQQDNVLRLFNWNNYISQ
KTVERFEQMCQCRVTQDYYSDNEELLAKLAAGATGYDLLVPTGDAMDTLI
RQGALRELDKSKIPNLKNIDPGYLKTKFDPDNRYSVPYANTITLLGFNRE
KITGLGLPTDSWAIIFEPEYLKKIKGRVTVLDSQRELLAAALKYLGYSAH
DTDEAHWQQAKTLIIRAKPYWAAFNNTSYIRELAVGNLWVAHGYSNDMFQ
AAQDAKKTGRQFSIDFVIPKEGAVLSLDSMVLHRSGHRPDLAHRFINFML
EGENSAELTNLTGSGNPNLDARPHIHPEIIANPAIFPDAAQFSRLEMLED
INHTQRRLLSRIWTEIKLR
>NE0589 ppc, Phosphoenolpyruvate carboxylase
MSLNTRANIIPEKNTPNEKDYPLREDIRLLGRMLGDTIRELEGETMFNLV
ETIRQTSVRFRRDQDEAAEHELDTILNHLSHKETIAVVRAFSYFSLLSNI
AEDLHHNRRRRAHLRAGSPPQDGSVTLALQRVVKKGIDAEQLQNFFASAL
ISPVLTAHPTEVQRRSILDYQLKIQRLLKERDRTQLTPNEMRHNEEDLRS
AIQTLWQTRVLRSVRLTVQDEIENGLIYYHYTFLRQIPYIYAKLEDILER
HMDKAAPRIASFLRIGSWIGGDRDGNPFVTHQIMLHAAERHSALILDFYI
SEVERIGQTMSLSERLIKVSSDLEGLASTAPGLPASRIDEPYRRVFLGIH
ARLIATSRHLGSSIRGCCQENNAEPYADSAEFVHDLDIVIRSLRQHRSDR
LAQGALRDLRRAADVFGFHLAPLDMRQHSKIHEQVISELYEKNTRDDRNY
LEMSRSERVEWLLAELRHPRSLVTSFSDFSDVTQGELRILKMAAEIQRRF
GHAALPNYIISMATGVVHILEVALLLKEAGLLQFGDDPRSTVNIIPLFET
IDDLRGCASVMDELFSLPDYRKLLLSRDNLQEVMLGYSDSNKDGGFVTSN
WEIYKAEIELTRVFDRHGVRLRLFHGRGGTVGRGGGPSYQGILAQPPGSV
SGQIRLTEQGEVIASKYTDPEIGRRNLETLVAATIESTLLDRDAVHYHAP
HYHQIMEELSSSACAAYRDLVYKTPGFKQFFLESTPIREIAGLHIGSRPT
SRKPSDKIEDLRAIPWVFSWSLNRTMIPGWYGFGTAVENFVQQAGNEQEA
LKQLQEMYRTWPFLQTLLSNMDMVLSKSDLGIASRYAELVTDPELRQSVF
TSIRTEWELCMKWLFAITGYSELLQDNPTLARSIRIRTPYIDPLNHLQIE
LLRRYRSGDDDDTVRRAIHLTINGVATGLRNSG
>NE0042 ppiA, Cyclophilin-type peptidyl-prolyl cis-trans isomerase
MKTFPFERQTARPMKSITSTGLFFILCLMISATSAFAANPKVEIKTNLGA
IQVELYPDQSPKTVENFLNYVKDDYYTGTIFHRVIAGFMVQGGGFDQNYT
QKPTRQPVENEAANGLKNTIGTIAMARTQDPHSASSQFFINVANNHFLDY
TAPTLQGYGYTVFGKVTSGMDVVNNIASSPTGSNGPFNRDVPRKMIIIES
IKLLPAAVSPENP
>NE0041 ppiB, Cyclophilin-type peptidyl-prolyl cis-trans isomerase
MVKIHTNHGVITLELDNDKTPITVENFLRYVDSGHYENTLFHRVIDGFMI
QGGGYAPGMKEKSTLAPIQNEAALGSGNEAYTIAMARTSDVHSATAQFFI
NVANNHFLNHTNMTPQGFGYCVFGRVVEGKEVIDAIKKVKTGRHAGHQDV
PLEDVIIQKAERV
>NE2206 ppiD, PpiC-type peptidyl-prolyl cis-trans isomerase
MFDFVNKKKTVVQIVLLIAVLPFMFWGVESYRSAGDAGYAAVVDGEEISR
QEYEQAIRNQQENLRNMLGEKFDASLLDNPQMRLAVLENLIQERLLRREA
ERVGLTVLDSRLTAEIQNISFFHEDEKFSYQRYRDLLQRQGMSPAMFEAR
VAGELMRQQLLEGITGSVIVPRTVAGKVASLSATMYEINRMTISPEQYID
QAEPDEAAIQSYYDSHYQDFTLPERVKVEYVVLSLDELARQEQISDEEIR
KYYDEHQDEFGQAEERRASHILLSVPADATEEQKTSTKARAEQILEQVRQ
DPEKLPELAAELSEDPGSAKEGGDLGFFARGLMVKPFEDEVFQMQRGEIR
GPVETPFGFHIIRLTEVKGADVAGLDDVKEQIRQLLQHQKVADRFGELSE
DFSNIVYEQNDTLQPVAQMFNLSVQQSDWIDRNSREPSVITNERMLQAIF
SESAVRDHFNTESIEVAPDTFVAARVVEHRPASAQSIGLVKDRIVALLKQ
QLASEQAENEGKQKLAGLQAGEVDESLEWGESSEVSFAQKKGDNEEILHA
LFQTDVEKLPAYTGVVTSKGGYDLIRINQVKRPDIGNESSQYNLLFDQLQ
QIYGQEELSAYIAGLRQRYEVTIRPLEEND
>NE2359 ppsA, ppsA; phosphoenolpyruvate synthase
MTDHPDYILWLNQVGMNDIASVGGKNASLGEMIGNLARAGVKVPGGFATT
TRAYREFLEIDGLGDRISQKLSGLDVGDVTALADTGKSIRSWLLAAPLPD
NLLKAVAGAYQKLVSDMGEEVSFAVRSSATAEDLADASFAGQQETLLNVH
GLENLISAIREVFASLYNDRAIAYRVHHGFHHDQVFLSAGVQKMVRSDRG
ASGVMFTLDTESGFREVVFVTSTYGLGETVVQGIVNPDEFIVYKPNLEAR
LPAILSRRSGTKAIEMIYSDGQTGQATEIRPVPPDRSQRFSLSDEQIESL
ARQAVIIERHYNRPMDIEWALDGIDGQIYIVQARPETVKSRAEQSIEQFR
LDERGQVLVQGRAIGQKIGQGRARVILNAGQMSEVQPGDVLVTDMTDPDW
EPIMKRAAAIVTNRGGRTCHAAIIARELGIPAVVGCEQATAKISNGHAVT
VSCAEGDTGFVYEGLLPFQRMSSEIGTLSELPVRMMLNIGNPAQAFAFSR
LPNQGVGLARLEFIINNAIGIHPRALLEFDTLPDDLQASIRKRMAGYPDP
VSFYVEKLVEGIATLAAAFHPHPVIVRLSDFKSNEYANLIAGDRFEPHEE
NPMIGFRGASRYLADSFRACFELECRALRKVRDDMGFANVEVMVPFCRTL
EEAQQITELLAANGLKRGSNGLRLIMMCEIPSNAVLAEEFLEYFDGFSIG
SNDMTQLTLGLDRDSGLVAHLFDERNPAVKRLLKMAIDACHRQGKYIGIC
GQAPSDYPDFARWLVEQRISSLSLNQDTVVDTWLYLAGEKQ
>NE2366 ppsA, ppsA; phosphoenolpyruvate synthase
MTDHPDYILWLNQVGMNDIASVGGKNASLGEMIGNLARAGVKVPGGFATT
TRAYREFLEIDGLGDRISQKLSGLDVGDVTALADTGKSIRSWLLAAPLPD
NLLKAVAGAYQKLVSDMGEEVSFAVRSSATAEDLADASFAGQQETLLNVH
GLENLISAIREVFASLYNDRAIAYRVHHGFHHDQVFLSAGVQKMVRSDRG
ASGVMFTLDTESGFREVVFVTSTYGLGETVVQGIVNPDEFIVYKPNLEAR
LPAILSRRSGTKAIEMIYSDGQTGQATEIRPVPPDRSQRFSLSDEQIESL
ARQAVIIERHYNRPMDIEWALDGIDGQIYIVQARPETVKSRAEQSIEQFR
LDERGQVLVQGRAIGQKIGQGRARVILNAGQMSEVQPGDVLVTDMTDPDW
EPIMKRAAAIVTNRGGRTCHAAIIARELGIPAVVGCEQATAKISNGHAVT
VSCAEGDTGFVYEGLLPFQRMSSEIGTLSELPVRMMLNIGNPAQAFAFSR
LPNQGVGLARLEFIINNAIGIHPRALLEFDTLPDDLQASIRKRMAGYPDP
VSFYVEKLVEGIATLAAAFHPHPVIVRLSDFKSNEYANLIAGDRFEPHEE
NPMIGFRGASRYLADSFRACFELECRALRKVRDDMGFANVEVMVPFCRTL
EEAQQITELLAANGLKRGSNGLRLIMMCEIPSNAVLAEEFLEYFDGFSIG
SNDMTQLTLGLDRDSGLVAHLFDERNPAVKRLLKMAIDACHRQGKYIGIC
GQAPSDYPDFARWLVEQRISSLSLNQDTVVDTWLYLAGEKQ
>NE1745 ppx, Ppx/GppA phosphatase
MHEYSTLAAVDLGSNSFHLQVARIAEKQLYLLDSLKEMVQLAAGLSDDQI
LDEASQNRALACLERFGQRLRGFPHHAVRVVGTNSLRVARNAADFLRKAE
NALGFPIEIIAGHEEARLIYLGVVHSLPISDNHLLVVDIGGGSTELIIGN
RLKSNKLESLPVGCINHSLHFFPDGKITKGSLKQAELAARTEIQAIAAEF
SSSHWQKAYGSSGTARALAHILTLNGYNNGNNEEVITLAGLEKLREFLLK
TGDIKKLEITGLNPARKAIIAGGFAIMSAVFTELNIDHMAIATGALREGV
LYDMLGRFHKEDTREISVQEFMQRYRVDPVQAARIESLAVTLGKQLLADH
PDTEKVEDALKLLSWAARLHEIGISVAHTGYHKHSAYILDNADIPGFSKM
EQSQLSQLVLSHHGSLVKTRDFLSQPVNFARSIALRIATIFYRSRTHINL
PDMTLSMNGKRCTLLIPQAWLERNPLTGTLLNNETETWAKVDIDFRIENR
VDGRRTKSGAA
>NE1913 prfA, prfA: peptide chain release factor 1
MNKGIIDQLTRLSMRLGELDKLLSTEKITADLDNYRKLSRERAEIEPVTE
LYRTYQQVEQDLATAREMSSDPQFRDFAEAEIEADRRKLTNIETEILRQL
LPKDPNDERNIFLEIRAGTGGDESALFAGDLFRMYSRYAEREGWQVEVVS
QNPSEVGGYKEIIVRIIGHGAYSRLKFESGGHRVQRVPATETQGRVHTST
CTVAVLPEADEIADITLNPADLRIDTFRASGAGGQHINKTDSAVRITHLP
TGIVAECQEGRSQHKNKAQAMSVLIARILDKQVRAQQAEQAATRKSLVGS
GERSERIRTYNFPQGRITDHRINLTLYKIEQIIDGELDELCSALAAEHQA
AQLAAMTEK
>NE1229 prfB, Peptide chain release factor 2
MICGGIFDFDAKQIRLTELERQSEDPAIWNDPDRAQEMGREKKSLENIVH
TLQSVDRQLRDTHELLQIAQEENDKETLQSIADDVTELDQTLAGMEFRRM
FSDPMDPNNCFVDIQSGSGGTEAQDWASMLERMYLRYCERRGLGVELLEE
SPGDVAGIKSATLKISGEYAYGYLRTETGVHRLVRKSPFDSGNRRHTSFA
SVFVYPEVDDSIEIDINPADLRVDTFRASGAGGQHINKTDSAVRITHIPT
GIVVQCQSGRSQHRNRADAMTVLKSRLYEAELRKRNETKQAIEDSKTDIA
WGHQIRSYVLDQSRIKDLRTNIEIGNTQAVLDGDLDTFIEASLKQGV
>NE2481 prfC, prfC; peptide chain release factor RF-3
MTIDHEVKRRRTFAIISHPDAGKTTLTEKLLLFAGAIHIAGSVKARKASR
HATSDWMEIEKQRGISVASSVMQMEYRDCVINLLDTPGHQDFSEDTYRVL
TAVDAALMVIDAANGVESQTLRLLQVCRARNTPIITFVNKLDREVREPLD
LIDEIERTLGMDVIPFTWPVGSGKRFHGVYDLRHKLMRVFRAGMDRVEQE
ETAIITNLEDPAISERFGANLEQARQEIELITGAAPEFDQTAFLAGQQTP
VFFGSAINNFGVQEVLDTLVELAPPPGSRKAIQREIQPAEKKFSGVVFKI
QANMNPAHRDRIAFVRICSGEFRRGMNLKVVRSGKDVRTSTVVSFLSQRR
ELLETAYAGDIIGIPNHGTLQLADTLTEGDHLQFTGLPFFAPEIFQTVEI
ADPLRSKQLKLGLAQLGEEGAIQVFRPHIGSMLLLGAVGVLQFEVVTHRL
KHEYGVEARIAPAKYQLARWVTAETPQELQRFIDANAHRIAYDAVNAPTF
LASFSAEISVAEENWPGIRFHKMREHAGLMFQTAG
>NE1505 priA, probable priA; primosomal protein N' (replication factor Y)
MVIIRVALDVPIDRLFDYLAPDADTADIGRCVRVPFSSRQISGIIISVCE
TSSVPEGKLKYAGQIDRQTPPLPQPLLGLFEFCSRYYHHPIGQVVMNGLP
VLLRKFKHTGKEQPPSWRLTDTGKSITLADLPIRAKAKRQLISLLSEHGI
ITAEICKAMSSHSRKLLHEFKDLGWVEQFTALPEKAVFSTASSPAPTAEQ
AQAISEILDRTGTFTPWLLNGITGSGKTEVYLQVTASLLAQQKQVLILVP
EINLTPQLEAVFRKRFPGTTLVSLHSGLNNSERLQGWLQAQRGKAGIVLG
TRLAIFTPMPELSLIIVDEEQDHSFKQQDGLRYSARDLAIYRARQANIPV
ILGSATPSLESYHQARTGRYRLLQLHSRAISQAALPTIRCIDLRVIPAQE
GLSEPVLDALRHCLARKQQSLVFINRRGYSPVLLCKSCRWIATCKRCSSR
LVVHLRDRQLRCHYCGDQQPVSPACPQCGDPDVLPFGHGTQRVEAALIRH
FPEARILRVDRDSIRHKGAWQQMLDRIHRGEADILVGTQLLAKGHDFPNL
ALVCALNADASLYSTDFRAEEHLFAQLIQVAGRAGRANVPGSVLIQTEFP
QHPLYQALIRQDYAAYAQAHLKERRSAGFPPFVYLAVLRAEAPVLTDALE
FLRQAAALAAVTENYPHIQLFDPVPAHMTRLKGLERAQLLIQARSRRHLQ
TFLGDWHQRITALPVHSRIRWHLDVDPLTL
>NE1663 prlC, Peptidase family M3
MKNPLLDFSTLPRYEEIRNEHITPAMDELLRDCRAVVNRVKNATESPDWQ
DFVQPVVDANERLSRAWGQIAHLNAVMNNPELREIYNANLPRITQYYAEL
SQDPVLFEKFRQLRADPAFNDLSQARRKIVDNQLRDFHLGGAELPLEDKA
RFMQIQEELSALSSKFNDNLLDATNAFSLFIENRDELAGIPEDVLQAARE
AATKDTSITVPGWKFTLHAPSYLPVMQYADNRSLRERMYRAYVTRASELE
AVPEQTVNRDNMPLIEKMLRLRQEEARLLSYDCYAQVSLTPKMAETPQQV
LDFLNELAAKARPYAERDLAELQQFAADKLKLDRLEMWDIAYASEKLRIE
RYAFSEQEVKQYFPENKVLPGMFRLVETLYGIRISEAEPARNIQCWHPDV
KFFDIADANGNLLGQFYLDLYARPGKRGGAWMDDAITRRKIEVPEIGRSE
IQAPVAYLTCNFSAPVTIDGQLRPALFTHDEVITLFHEFGHGLHHLLTRM
DELGVSGINGVEWDAVELPSQFMENFCWEWEVLRGMTAHVETGNPLPRAL
FDKMLAAKNFQSGLQTLRQIEFALFDMHLHTDFDPQGSETVLQRLDKIRQ
RVAVIIPPAFNRFPDSFGHIFAGGYAAGYYSYKWAEVLSADAYSLFEENG
AGQVVSAKTGERFWHEILAVGGSRPALESFIAFRGREPKIDALLRHHGMA
A
>NE0654 prmA, prmA; putative ribosomal protein L11 methyltransferase
MSWLSLTFRIGSDYVDLVGDRLLERGALSVDVHDAGEGTSQEQPLFGEPG
APLDQFWQQAEVTVLLEENANIDEIIQGVAEVIGLPALPEYQLAQVMEQD
WVRLTQAQFEPIRISSRLWVVPSWHEPPDPAAISLRLDPGLAFGTGSHPT
TRLCLTWLDQFLQPGDSVLDYGCGSGILAIAALKFGADRVTGMDIDPNAI
TASLDNARNNFCDPDRLLFTTVLPPLVEDDRASAEWAPVTIVVANILANP
LIMLAPVLMKALQPGGRIVLSGILETQADEVLQVYSEWFDMHIAAKEQGW
VLLAGQKSGGGFA
>NE2158 proA, Gamma-glutamyl phosphate reductase:Aldehyde dehydrogenase family
MEKTEKIQADMQALGRAARAAARIVAKADTAVKNHALIAMARAIRCHEAS
LLAANAADVAQARNKGLEPAMIDRLTLTPKGIASMAAGLEQIAALSDPIG
AVTDLDYRPSGIQVGRMRVPLGVIAIIYEARPNVTADAAGLCLKAGNAAI
LRGGSEAIQSNQAIAACVQEGLRSAGLPEHAVQVVETTDRAAVGELITMS
EYVDMVVPRGGKGLIERIANEARVPVIKHLDGVCHVYVDLSADLEKAVRV
ADNAKTQRYGTCNTMETLLVHAGIAERFLPRICKILLEKGVELRGDEAAR
ALVAGIKPAVEEDWYAEYLAPVLSVRIVEDIDQAITHIATYGSQHTDAIV
TEDYSRARQFLREVDSSSVMINASTRFADGFEYGLGAEIGISTDKLHARG
PVGLEGLTSQKFIVLGDGHIRE
>NE1290 proB, Aspartokinase superfamily:Glutamate 5-kinase:PUA domain
MTQSILQSARRIVVKVGSSLVTNEGHGLDRCALGGWAGQISRLRQMGKEV
VLVSSGAVAEGMQRLGWKMRPAALYELQAAAAVGQMGLAQAYADCFSEYE
LQTAQILLTHDDLSSRKRYLNARSTILTLLNLGIIPVINENDTVVSDEIR
FGDNDNLAALVANLIEAEVLVILTDQEGLFTGDPRKNTDARLIPEVSVSD
PDIEVMAGGAGSSIGRGGMQSKVMAARRAARSGAHTVIASGRAKDVLVRL
AGGEAIGTILLADMPVKVARKLWLADHLQVRGSVVLDDGASRALLSGGKS
LLPIGVVEIKGEFERGEAISCLDASGREIARGLINYDARESRKIMRRPTS
QIEAVLGYVDEYELIHRDNLVIL
>NE0393 proC, Delta 1-pyrroline-5-carboxylate reductase
MKITFIGGGNMAGAMIGGLIHHGSTPSHICAVETDAMRRQQLAEACGVTV
TASIPDGIHGSEIVILAVKPQQLRAVANQAAQFLQNKLVISIAAGVRASD
LSQWLNRHEFVIRAMPNTPALIGEGITGLYALPLVDSRQKEQATTILEAI
GTVIWVNEEAQLDAVTALSGSGPAYVFYFLEAMQEAGTQLGLTADTAKQL
AMQTFLGATHLAAQSNDTLATLRMKVTSQGGTTEQALLSMEQADIRGAFI
RAMHAACDRSRQMGEIFGRT
>NE1317 proS, Prolyl-tRNA synthetase
MRVSQFFLATLKEAPAEAELISHRLMLRAGLIKRLGSGLYTWMPLGLRIL
HKIEHIIREEMNNSGALELLMPAVHPAELWQETGRWDVFGPQMLKIQDRH
KHDFCFGPTHEEVIVDIARREIKSYRQLPINFYQIQTKFRDEIRPRFGVM
RAREFIMKDAYSFHADVDSLEQTYRLMQETYSRIFTRIGLKFRAVAADTG
AIGGSGSHEFHVLADSGEDAIAFCPASDYAANIELAEAVPHGIPRKEPAG
IMAKIATPDRKSCQDVADFLGIPVEQTLKALAVTAGGKFYLLLLRGDHQL
NETKVRKIPFLGDFEFAEESRIIAEMSCPPGYLGPVGVKAEIIADRAVLE
MSDFTCGANEEGFHLSHVNFGRDLPLPDQVFDIRNIVAGDPSPDGKGVLE
ICRGIEVGHVFQLRTKYSEKMKATYLDESGQTRIMEMGCYGIGVSRIVAA
AIEQNYDERGIIFPQTIAPFQLSIIPVGYHKSQQVRTEAEKLYQACRSAG
IEVLLDDREERPGVMFADQELIGIPHRIVIGERNLREGMVEYQGRLDKTP
QMLSLSEAASIISKICGD
>NE1322 psd, psd, phosphatidylserine decarboxylase
MSTPYYPHPIIAREGWPFIAGAFAVALLVQFMAGWLWALPFWLIALFVLQ
FFRDPPRVVPALAGAVLAPADGRIVAVDKVQDPYLPREALKVSVFMNVFN
VHSNRSPVDGEIRNRWYFPGNFLNASLPKASLENERNALWIRTDGGQDVT
CVQIAGLIAKRIVCHVHPGEHLARGQRFGFIRFGSRVDVYLPLGTKVNVA
IGDKVYATQTVLAEFH
>NE1321 pssA, pssA; CDP-diacylglycerol--serine O-phosphatidyltransferase
MQESDPEQLPVTGPRWRRRGIYLLPNLFTTAALFAGFYAIVQAMNGHYEH
SAIAIFVAMVFDGLDGRVARLTHTQSEFGAEYDSLSDMVSFGVAPALIVY
EWALRDLGKLGWIAAFIYCACAALRLARFNVTVEVVDKKYFQGLPSPAAA
ALIAGMIWVVLGFQVDAADITWLAWAMTLFAGLTMVTNIPYYSGKEFNLH
KKVSFFVVLLLLLFFFVLIPSHPPLVLFGVFLTYALSGYGMKIWRLYKNR
KRNEIRKET
>NE1001 pstB, pstB; phosphate transport system ATP-binding protein
MQIFKPASGHTHSTVQKSVVNKLNFYYGGYQALKNIDMMVYEKQVTALIG
PSGCGKSTFLRCFNRMHDLYPRNHYEGEIILHPDNANILSPEVDPIEVRM
RISMVFQKPNPFPKSIFENVAYGLRIRGVKRRSILEERVENALRNAALWE
EVKDRLGDLAFNLSGGQQQRLCIARALATDPEILLFDEPTSALDPIATAS
IEELISDLRNKVTILIVTHNMQQAARVSDYTAYMYMGELIEFGATDTIFI
KPKNKQTEDYITGRFG
>NE1785 pt, Low molecular weight phosphotyrosine protein phosphatase
MKHSEAEKVGVLFVCMGNICRSPTADAVFNHHVKSARLEHLFHIDSAGTH
AYHIGEPPDRRSQQAALRRGYNMQSLRARRVVPEDFSRFQYILAMDRHNL
EELQQNCPSRYTSRLGMFLQYSNQWDIRQEIPDPYYGGSHGFERVLDLVE
NASQGLLKYILENLNT
>NE0298 pta, Phosphate acetyl/butaryl transferase:Phosphate acetyltransferase
MHTFFVTSTGFGVGLTSTSLGLVRALEYGGLKAGFYKPVAQQHPGNSKLE
YSTELISRTLGLAPPAPLPLATVEHLLGEGQIDDLMEDIVRRFKQASEGY
DVMVVEGMVPTRHVSYASRVNTRLASSLDADIILVSSAEDDALQAITDRI
EIQAQFFGGAQNPRLLGVILNKIRTDHSDDLFEQLKNHSTLFHQSSFQIL
GCIPWEDSLNAPRMADVVTQLQAQIVNAGDSEKRRVQDIVLFASAAPNSV
TLLRPGVLVVTPGDRDDIVMAASLAVLNGVPLAGLLLCSDFPPDPRVLEL
CKGALTKGLPVCTVTTNSYDTAANLHRMNREIPLDDHERAERITNFVANH
IRQELLVKRCGEPQEQRLSPPAFRYRLVKRAQEADCRIVLPEGYEPRTIQ
AATICQERGIARCVLLAKPDAVKAVASARGITLPEGLEMIDPEKVRRNYV
AAMVELRKHKGLNEPMALAQLEDNVVLGTMMLATGEVDGLVSGAINTTAN
TIRPALQLIKTAPGFKLVSSVFFMLLPEQVVVYGDCAVNPNPTAEELADI
ALQSAASAQALGIEPRVAMLSYSTGDSGSGQEVEKVREATRLARLARPDL
LIDGPLQYDAAAIASVGRQKAPGSPVAGRATVFIFPDLNTGNTTYKAVQR
SANVVSVGPMLQGLRKPVNDLSRGASVEDIVYTIALTAVQAASQR
>NE1824 pth, Peptidyl-tRNA hydrolase
MELPVRLVVGLGNPGEKYASTRHNVGFDWLDRLASSQRVAFTLETRFRGL
CARIMLADTDIWLLKPQTYMNASGMSVAAVCRYYKIVPEQMLVVHDELDL
QPGVIKLKSGGGTGGHNGLKSIVADLSSQVFWRLRIGVGHPGDRNQVVDY
VLHPPRREEAALIDEAIDHSMQVWPLIARGDFAAAMQRLHTRQEN
>NE2184 ptsH, Phosphocarrier HPr protein
MQTESLTILNKLGLHARASAKLTQLAGKFESEVWLTRNGRRVNAKSIMGV
MTLAASIGTVVELETSGPDEVQAMEALKALINDYFGEGE
>NE2185 ptsI, PEP-utilizing enzyme
MSFILYGTGVSDGIAIGHAHLASSATLNVPHYLLPKNQINAELTRLRNAF
STVRLELETLKTSAAQVSGLAEFNAFLELHQMILDDPTLSHAALETVEQA
QCNAEWAITQQMEVLVARFEEIEDAYLRERKTDVVQVVERVLKVLLGHPG
YTPPPSKHDGSSILVAHDLSPADVMQYKQHQFSAFLTDMGGPTSHTAIVA
RSLNIPSIVAMQHAQQLIYENDNLIVDGNQGIVIVNPDKYILAEYRLKQS
QLELEKRKLWRIRSVKAVTLDGTTVDLLANIELPQEVEQARECGAMGIGL
FRSEFLFLNRDDLPDEEEQFEAYSTVVKGMHGQPVVIRTFDLGADKNLRS
AYRTAPNPALGLRAIRLSLAEPGMFLVQLRAILRASVLGKTRILIPMLCS
SREVDQTLQMIEFAKQSLRDENKPFDENVRIGCMVEIPATALSLELFMRK
LDFLSIGTNDLIQYTLAIDRTDETVAHLFDPLHPAIIRLLVQIIQNAGKA
GIPVSICGEMAGDSEYTRLLLGMGLRQFSMYPAQLLTVKREILGSHLPEI
SRLMQKILKADEPEKISELLQKLNS
>NE0060 ptsN, Phosphotransferase system mannitol/fructose-specific IIA domain (Ntr-type)
MNLIAQLLPESNIIIDLDATSKKRVFEQVGLLFENTLHIARSQVFDSLFA
REKLGSTGLGQGVAIPHGRIKGLREAAAALVRMKEAIPFDAPDGLPVSIA
CILLVPEKATDQHLLILSELAQMFSNKNFREKLLHGRDAKEIHQLISDWV
PEHVPG
>NE1281 purA, Adenylosuccinate synthetase
MARNVVVIGTQWGDEGKGKIVDWLTDRATGVVRFQGGHNAGHTLVVGGEK
TVLHLIPSGILRENVTCYIGNGVVLSPGALLEEVDMLEQAGVDVSGRLRI
SETCPLILPYHIAVDGARELAKGMEKIGTTGRGIGPAYEDKVARRAIRLQ
DLFRPERLACKLKEVLDYHNFLLKNYYHAATVDYHQVLDDCQIKAERIRP
MVADVPKLLFEASQADNNLLFEGAQGALLDIDHGTYPFVTSSNCIAGAAS
VGSGVGPQMLNYVLGITKAYTTRVGSGPFPTELSDATGEHLAQRGNEFGS
TTGRPRRCGWFDAVAARRSIQINGVSGLCITKLDVLDGLETLRIGVGYKS
KQSGEMYDALPFGADDLAAAEPVYEELPGWQERTAGIRNFDQLPQAAQNY
LKRMEEVCQTPISMISTGPDRTETIVFHHPFG
>NE0866 purC, SAICAR synthetase
MATAALFETSITSLPLLHRGKVRDIYAVDENHLLIIQTDRVSAFDVILPT
PIPEKGKILTKISRFWFDKLAHIIPNHLTDITPESVVSSREQDQVSDRAF
IVRKLKPLPVEAIVRGYISGSGWKDYQRSGTICGIALPAGLREADKIPDG
AIFTPSTKAEAGSHDENISYSVCEQLLGVSLAAAVSRHSIALYTAAADYA
LTRSIIIADTKFEFGLDEANQLYLIDEALTPDSSRFWPAESYRPGKTPPS
YDKQFIRDWLEQINWNKTPPAPPIPEEVLVQTIEKYQAACRVLTQ
>NE0877 purD, Phosphoribosylglycinamide synthetase
MRLLVIGSGGREHAIAWKLSQSPLIEKIFVAPGNAGTVLEAKLENVPMTS
IPDLVTFARQEKIDLTVVGPEVPLTEGIVDTFQASGLRIFGPTRKGARLE
GSKSFAKAFMLSHGIPTAACETFDDAAAAHRYIDQHSLPVVIKADGLAAG
KGVVVAQTLSEAHAAVDAMLVDNRLGTAGAHILIEDYLEGEEASFIVLSD
GKNVLPLATSQDHKRLRDNDEGPNTGGMGAYSPAPVITPELHDRIMQKVI
MPVIQGMTQQSQPYVGFLYAGLMISPQGEVRVLEFNCRLGDPETQTIMLR
LKSDLFILLEHAVDGTLDQTNIEWDQRVALCVVMAAQGYPDNPRKNDIIH
GLDSVMAAQHHSDEFHIFHAGTALNSTDSTEILTFGGRVLGVTALGGDIR
QAQSRAYAIAADIHFDGCQMRRDIGSRGLAQGSGGKPG
>NE0868 purE, purE; phosphoribosylaminoimidazole carboxylase catalytic subunit protein
MDHPVVSIVMGSSSDWQVMRHAASVLKNFCIPHETMIISAHRTPDRMFEY
AETAHDRGIRCIIAGAGGAAHLPGMIAAKTTLPVLGVPVTSKHLQGVDSL
LSIVQMPKGVPVATFAIGEAGAANAALFAAAILAGGDISLAEKLDQFRKD
QAASVAAMTLPESCT
>NE0699 purF, Glutamine amidotransferase class-II:Phosphoribosyl transferase
MCGILGVVARSPANQLLYDGLLMLQHRGQDAAGIVTAQGNSFHMHKGLGL
VRDVFRTRDMRALPGYMGIGHVRYPTAGSSTSPAEAQPFYVNSPFGIVLA
HNGNLTNAETLNQELFLADRRHVNTHSDSEVLLNVLAHELQERAVGYQLD
LAEIFSAVAGVHERCQGAYAVIAMIAGYGLLAFRDPYGIRPLVFGSVETD
AGTEYMVASESVALDTLGFRLIRDVAPGEAIFIDMEGHFYSHQCAALPSL
NPCIFEYVYLARPDSMLDGISVYETRLNMGINLADKISTSMQHLDIDVVI
PIPESSRPSAMQLANRLGISFREGFVKNRYVGRTFIMPGQQQRRKSVRQK
LNAIEIEFRGKNILLVDDSIVRGTTSREIVQMAREAGANKVYFASAAPPV
RFPNVYGIDMPTRQELIATDRTDEEICREIGADYLVYQNLDALRQAITQI
DPQIRHFETSCFDGRYIAGNITQEYLCRIESQRNNPGLSQGERTTQLDLA
LLSTD
>NE0876 purH, probable phosphoribosylaminoimidazolecarboxamide formyltransferase and IMP cyclodydrolase transmembrane protein
MSIKQALISVSDKTGIVELAAALHQFNITILSTGGTARLLQESGIPVIEV
SDYTGFPEMLDGRVKTLHPKVHAGILARMDLPEHRQVMEQAQIPAIDLVV
VNLYPFTQTITRPDCSFEEAIENIDIGGPTMIRAAAKNFQRVAVVTDPQD
YGPLLEQIRANDGSIDHDYRFQLAQKAFSHTARYDGAISNYLTALEPLSS
QRKRFPDQLNLSFTLAQPLRYGENPHQQAAFYRDTTNVPGSLADYDQLQG
KELSYNNIADTDAAWACVRTFDKPACVIIKHANPCGVAIADTALAAYKKA
FTTDPVSAFGGIIAINRAVEADLAKAVLQQFVEVIIAPEFSPEARHLLAD
KPNIRVLTVPLDKHNNVYDLKRVDGGLLVQTPDDLDITVDQLQVVTRIKP
TREQIEDCLFAWRVAKFVKSNTIVFCANGQTLGIGAGQMSRVDSARIASI
KAQQANLDLHGSVVASDAFFPFRDGLDVLAQAGAGVVIQPGGSIRDEEVI
AAADEQGVAMVFTGVRHFRH
>NE0867 purK, phosphoribosylaminoimidazole carboxylase, ATPase subunit; ATP-grasp domain
MYIKPGSMLGLLGGGQLGRMFAMAAQQMGYRVTVLDPAAESPAGSIAERH
LQADYLNDQALDELGSTCAAVTIEFENVPAQALRYLARRCVVSPDAESVS
IAQNRIREKQFLAENGFPVGPFTVVHDGQDLPRIDVSLFPGILKSSQSGY
DGKGQMRIEHADILSEALTALGNKPCVLEKRLSLAHEISVVLARDNLDQI
TFFPVPENHHERGILDTSIIPATLSDEIVGKARAVARQVATRLNYVGILC
VEFFVLGDGQILVNEIAPRPHNSGHYSLNACVTSQFEQQVRMLCGLPHGS
TKLVQEAAVMTNILGDLWKNGEPDWNSILQHPQTKLHLYGKRAAKPGRKM
GHFTVLADTVEEALQQTARIRQDLQF
>NE0019 purL, AIR synthase related protein
MLQFCGRNALSPFRLERLLHSIQAIVPQISGITANYRYFCQVQRDLTQEE
TNRLRQLLDVDETEAQPFPDSKLLLVVPRPGTISPWSSKATDIAHCCGLN
GIERLERGISFELRCQVTLLPAQLSRIEACIHDRMTEVVLHSLAEATLLF
HHSEPGMLNEIDLTGRGIDALLQANREMGLALSSDEIDYLLDYFTRIQRD
PTDVELMMFAQANSEHCRHKIFNADWIIDGVQQPHSLFGMIRHTHQSHPQ
HTIVAYSDNAAILEGEIIERFYPGKNDQYGYAPELTHWLAKVETHNHPTA
ISPFPGAATGVGGEIRDEGATGSGAKPKAGLTGFSVSNLRIPEAIQPWET
NAYGKPGHIASALDIMLTAPIGGAAFNNEFGRPNLAGYFRTYEVEINGRM
RGYHKPIMLAGGIGQISALHTAKEPFPEGTLLIHLGGPGMAIGLGGGAAS
SMDAGTNEEALDFNSVQRGNAEIQRRAQEVIDRCWQLARSGQPNPILAIH
DVGAGGLSNALPELVHDAGRGGRFDLRAVPSDEPGMSPMQIWCNEAQERY
VLAIRPESLSLFQAICERERCPFAVVGEALAELQLIVTDERSSTHPADMP
LPVLLGKPPKMVRDVKRIHPDPALLDKTGMQLQEAARRVLRLPAVANKSF
LITIGDRSVGGQTARDQMVGPWQVPVADVAVTTMGFQTYSGEAFALGERT
PLAVINAASSARMAVGEAVTNLAAAAIAGLGTVRLSANWMAAAGHPGEDV
ALYDAVHAVAMELCPALGISIPVGKDSLSMKTAWHDDQPKEVVAPLSLII
SAFATVSDVRKTLTPQLRTDKGETKLILIDLGNGKNRLGGSALAQVYTQS
GDDTPDLESPEKLNAFFDAIQTLNGSGLLLAYHDRSDGGLFVTLCEMAFA
GHCGITLALDSLISLSAEPDDDRDGQVLSTLFNEELGAVIQIEARHHDAV
MTILADAGLGHISHTIGSPNRQDDITLMVDGGIVFQEKCVALQRIWSETS
FRMQKLRDHPECAQQEFDQILDVDDPGLHAQLTFSLTESVASAPAILASR
PAVAILREQGVNGHVEMAAAFDRAGFDAVDVHMSDILSGRVKLAEYKGLI
AGGGFSYGDVLGAGRGWAQSILFNARARDEFATFFARTDTFALGVCNGCQ
MMSHLQAIIPGTAHWPRFGRNRSEQFEARFVMVRIGTSPSLFFDGMAGSR
LPVTVAHGEGLAVFRDAEQLLAAHPYVALQFVDNRGTLTETYPLNPNGSP
AGITGMTSADGRVTILMPHPERVFRTVQHSWHPDDWPEDSPWMKMFRNAR
KWVD
>NE0088 purM, purM; phosphoribosylformylglycinamidine cyclo-ligase (airS) protein
MTASNPANPDKNPISYRSSGVDIDAGNRLVERIKPFARRTMRPEALGGIG
GFGALFEVPGKYRQPVLVSGTDGVGTKLKLAFQYARHATIGIDLVAMSVN
DILVQGAEPLFFLDYFACGKLDVEIATQVVQGIAHGCEDAGCALIGGETA
EMPGMYPEGEYDLAGFAVGVVEKDRIIDGSGIRENDIVLGIASSGPHSNG
FSLIRKILDLNTVQSDTRLGETSLIDALLTPTRIYVKSLLTLMAELPVKG
LAHITGGGLLENIPRVLPDGVMAHIDSSTWVRPSLFDWLQQHGNVAREEM
LRVFNCGIGMVVVIAPEHAQTAIEILHNAGETVWQIGEIRPRQSETLPIV
IA
>NE0087 purN, purN; phosphoribosylglycinamide formyltransferase protein
MKSVVILISGRGSNMQAILEAGLPVAAVISNNPAAEGLMFAQTRGIPTQV
IDHRTFPDRKAFDAALAETIDTYQPDLVVLAGFMRILSEAFVDHYQGRLV
NIHPSLLPAFPGLDTHTRALQEGVKIHGCTVHFVTSQLDHGPIIAQAAIP
VLTDDTPTMLATRVLAQEHRIYPQAVRWFLQGQLTLVENRVEIKTTSDSQ
EVLYSPGIEP
>NE0325 pykA1, Pyruvate kinase family
MMRRTKIVATLGPASSNAEVLGRMLEAGVDVIRINFSHGTKDEHIASVEL
VRSLARSLGRTVGVLADLQGPKIRIGKFEQGKIRLKTGDEFILDAECQLG
NQERVGLDYRELPNDVEAGATLLLDDGRIVLTVAKVRESEIFCEVLQGGI
LSNNKGINRKGGGLSAPALTAKDLLDIKTSAVIRADYLAVSFPRSGDDIR
RARALMQEAQGHSLLMAKIERSEAILALDDILEASDAIMVARGDLAVEVG
DAAVPALQKRMIRSAREANKLVITATQMMESMISNPIPTRAEVSDVANAV
LDGTDAVMLSAESAAGQYPVEAVEAMARVCLEAEKEYTPSLRARRSPDTQ
SISIEDAIARATMYAAGSLNIQAIAALTQSGVTALFMSRRSSKAPIFALS
PQEDTLGKVTLFRGVYPIEFGQGLCDPEIILGMAEDELLKRGAVRDEDLI
IMTIGEPVGKAGGTNTMKIIKVGSSKKAAQAVREVAQQPFSDPQNL
>NE1665 pyrB, pyrB; aspartate carbamoyltransferase (catalytic chain) protein
MGVHPQLNKNGELQHLLTTEGLPAVILRHILDTAESFTGVTERDVKKIPL
LRGKSVFNLFFEPSTRTRTTFEIAAKRLSADVINLNMAVSSQTKGETLLD
TVDNLSAMHADMFIVRHNQSGAAHLIARHVRPEIHVINAGDGWHAHPTQA
LLDMFTIRRYKQDFHALRVAIIGDILHSRVARSQIHALTTLGVPEIRVIA
PKTLLPAKVERLGVHVYHNMVQGLQDVDVLMMLRLQHERMESAHLPSTEE
YFKYYGLTPEKLALARSDAIVMHPGPMNRGVEIDSEVADGTQSVILPQVN
FGIAVRMAVMSILAGN
>NE0727 pyrC, Dihydroorotase homodimeric type
MNDTADKLTFTRPDDWHLHLRDGNAMRSVLPDTARRFARAIIMPNLKPPI
VTTEQAAAYRVRILSALPAELAGRFEPLMTLYLTDTTSPEEITRAQASGI
VQGIKLYPAGATTHSDAGVTDIARCEATLEKMEELDMPLLVHGEVVDPAV
DVFDREKIFIDRVLTPLLQRFPGLRVVFEHITTREAVEFVQTAPNRIAAT
ITAHHLMLNRNALFTGGLRPHHYCLPVLKREIHRQALLEAATSGHSRFFL
GTDSAPHPVRDKESACGCAGIYSAHAAIEFYAEVFEQAGRLDRLEAFTSF
HGPDFYGLPRNTDRISLIRESWQIPEQVELGEEKLIPLRAGEHARWKLA
>NE2221 pyrD, pyrD; dihydroorotate dehydrogenase oxidoreductase protein
MMYNLLRPLLFLLDPETAHTVTLDTLNLLKRTGLLPDRTIDCTPVRVMEL
DFPNPVGLAAGLDKNGTCISALASLGFGFIEVGTITPRPQVGNPRPRLFR
IPQAEAIINRMGFNNVGVDKLIENVQQSGYRGILGINIGKNADTPLQNAI
DDYLICLRKVYLHASYVTVNISSPNTQQLRQLQNETELEHLLGALKAEQT
RLSDLHGRYTPLAIKIAPDLESEQITVIASLLMKHRMDAVIATNTTTARE
GVEHLPRGNEKGGLSGAPLTERSTTVIRQLANHLQHAIPIIGAGGIMSAR
DAQLKIEAGASLVQIYSGLIYRGPDLVTKTVSLLCSRTTHAG
>NE1734 pyrE, Phosphoribosyl transferase:Orotate phosphoribosyltransferase
MSDFQWRFIDFALQYDVLRFGNFRTKAGRPSPYFFNAGLFNDGFALKQLG
QFYAQAILASGIRFDALFGPAYKGIPLVSTIAIALAEAGHNHPFSFNRKE
IKDHGEGGDIVGAPLAGRILIVDDVVSAGLSVGESITLIHAAGATPCGIM
VALDRMEKGKSECSTLQEIKNKYDIPVISLITLDDIIAYLHTRQDLVHHI
PAIETYRTFYGAKVPDTVNHTGRSTSV
>NE1959 pyrF, Orotidine 5'-phosphate decarboxylase
MNDPRIIVALDFPDQCTALNFAAGLDSTLCRVKVGKELFTLAGPQLVEKL
MKLGFDVFLDLKFHDIPNTVAAACSAASSLGVWMVNVHALGGSKMLLAAR
QALDGKRTRLIAVTLLTSLNQNDLSELGIADTPETMVQRLALLAQRCGLD
GVVCSALEAVSLREVTGEDFCLVTPGIRSFGDGNDDQARIATPAMAIRSG
ASYLVIGRPITRSPDPLGALRRFNDEVASVL
>NE1045 pyrG, Glutamine amidotransferase class-I:CTP synthase
MTKFVFVTGGVVSSLGKGIAAASLAALLETRGIRVTILKLDPYINVDPGT
MNPFQHGEVFVTDDGAETDLDLGHYERFISTKMTRQNNFTTGQIYESVIR
KERRGDYLGGTVQVIPHITDEIKLFIRNGVSDAQVAIVEIGGTVGDIESL
PFLEAIRQMSVQLPHHDTCFIHLTLLPYISSAGELKTKPTQHSVKELREI
GIQPDVLLCRSDRPLPLEERRKIALFTNVREESVISAIDVDNIYKIPALL
HEQMLDEIVCHRLDILARPANLTVWKKLVHALEHPEHEVSIALVGKYVDL
TESYKSLSEALIHAGIHTRCKINIHYIDSENIEQHGTGCLTNMDAILVPG
GFGKRGVEGKIMAISYARNHRIPYLGICLGMQLAVIEFSRNRLQLENAHS
TEFDPDTPYPVLGLITEWRDRCGRVEKRSAQTDLGGTMRLGGQECLLKPH
TLAHKIYGADKVIERHRHRYEVNAEFIPQLEQAGMHISGLSAEGDLCEMI
ELPQSEHPWFVACQFHPEFTSTPRNGHPLFKSYIQAAISFAGQSDRTKLH
SRNVQESVTTDSSN
>NE1716 pyrH, Aspartokinase superfamily
MPVVYKRILLKLSGEALMGDGHYGIDRAVVEHIVVEVAGVLQLGVEVAIV
VGGGNIFRGMKSAGDGMDRVTADYMGMLATTMNALALHDAMRRNGVVSRV
QSALRIDQVVEPYVRGKALRYLDERKVVVFAAGTGNPFFTTDTAAALRGM
EMNANIVLKATKVDGIYTSDPLKNKDAQRFQSLTFDEAISKNLQVMDATA
LTLCRDQKLPINVFSIFKTGALKRVIMGEDEGTSVFV
>NE1666 pyrR, Phosphoribosyl transferase
MQLPDAEQLLTQLIEKIRPDIAGNTAIVGIHTGGAWLARRIHQALEIALP
VGVLDISFYRDDYSKIGLHPQVRPSQLPFDAENSHIILVDDVLYTGRTVR
AAVNELFDYGRPASIDLAVLVDRGGRELPIAARYTGEVLTLPENSMLELR
QSDDGKLSLDLRSLTTG
>NE1664 pyrX, Dihydroorotase:Dihydroorotase multifunctional complex type
MHIHIRNGRLIDPGSGMDRTDDLYLAEGKIVSIGSRPDGFHDDREIEAGG
MIVCPGLVDLSARLREPGLEYMATLESELEAAVAGGVTSLACPPDTDPPL
DEPGLVEMLKHRARNLDQARVYPVGALTQGLKGARLTEMVELHDAGCVAF
SQVDAPIANLHVLMRAMQYAATFGFKVWLRPQDIHLANHGVAHEGEIATR
LGLPAIPVCAETIALASILSLMKETGASVHLCRISSAEGTALVRAAKRQG
LPLTCDVSINHVHLTDMDIGFFDANCHLMPPLRSLADRDALCVGLMDGTI
DAICSDHAPVDEDAKSLPFAQAESGATGLELLLPLTLKWAVENRLSLPEA
LARITFHPARILGIETGQLAIGAVADLCIFDPDANWVVSHTELRSQGKNT
PFLGMELYGRTRFTLIGGRIVYG
>NE1808 radA, sms: DNA repair protein RadA
MTRLKTLYVCNSCGGQTLKWQGQCPHCREWNTLTETVQEKNVLRFSPDSM
QKQVLSLSEVETEDMPRFLTGMDEFDRVLGGGLVQGGVVLLGGDPGIGKS
TLLLQALSRMSVHHRVLYVSGEESMQQVALRARRLSLDVARVDLLTEIRL
EAIQSILAEHRPRIAIIDSIQTIYSETLQSAPGSVAQVRECAARLTRFAK
TSGTCIVLVGHVTKDGALAGPRVLEHMVDTVLYFEGDTHSTFRLIRAFKN
RFGAVNELGVFAMTEKGLREVGNPSALFLSHHSVRVAGTCVMVTQEGTRP
LLVEIQALVDEAHAPSPRRLCVGLEQNRLAMLLAVLHRHAGIPCFDQDIF
INAVGGVKITEPGADLAVMLAIVSSLKNRVLPEKTVIFGEIGLAGEVRPV
QRGQERIKEAAKLGFTCAVIPKANQSRQAVKGIEIIAVERVEEAVSALF
>NE1464 radC, DNA repair protein radC family
MAISDWPEAERPREKLIEKGAAALSDAELLAIFLRTGITGVSAVELARKL
LTHFGSLTKLCAASLHEFSELPGMGPAKFAQLQAVMEMAKRALAEELKNG
DIMDSPQSVRNYLCLSLKGKPYEVFVGIFLDARHRTIVTEELFNGTLTQA
SVYPREVVKRALYHNAAAMIFAHNHPSGIAEPSTADEILTQSLKQALALV
DVKVLDHFVIGSSEVVSFAERGLI
>NE0762 rbfA, Ribosome-binding factor A
MSRDFSRTVRVADQIQRELALLIQNEIMDPRVGMVTLTGVEVTRDYAYAK
VFYTTLGGDENIQLVEEGLKHAAGFLRSQLAGKIRLRVVPQLQFVYDESV
ERGMKLSRLIDEAVGKA
>NE0507 rdgC, putative recombination associated protein rdgC
MWFRNLLIYRLAGEVITSDELEAYLAKQTLQGCLGLEPQSRGWVPPGIAE
ADLVYSYGQQMLIALGTEKKLLPASVVNQLAKVRAQEMESHQGYAPGRKQ
MKEIKEAAYRELLSRAFAIRQRSHAWIDPVGGWFIVEGASASKADALIEA
FIKSTGIGLKRIRTTMAPTSAMTAWLSGDDPPAIFSVDSDSIFRSREDKK
VSVSYIRQSPDPQEITRHVRTGKEVIRLAMTWRDKISFILDENLQLKRLT
LLDIDREPAETAEEQFDSNFFLMTEELRQLLPDLVEILGGMTAD
>NE1932 recA, RecA bacterial DNA recombination protein:AAA ATPase superfamily
MDENKNKALSAALAQIEKQYGKGSIMRLGDSDVAKDIQVVSTGSLGLDIA
LGVGGLPRGRIIEIYGPESSGKTTLTLQAIAEMQKLGGTAAFIDAEHALD
PQYAQKIGVNVQELLISQPDNGEQALEITDMLVRSGSVDVVVVDSVAALT
PRAEIEGEMGEPQMGLQARLMSQALRKLTANIKRTNTMVIFINQIRMKIG
VIFGNPETTTGGNALKFYASVRLDIRRTGSIKRGEEMVGNETRVKIVKNK
VAPPFKQADFDILYGEGISRESEIIELGVLHKLIEKAGAWYSYNGEKIGQ
GKDNVRDYLKEHKSIAHEIEQKIRAAVGLAETDSRVVPPSSGE
>NE1850 recG, RecG-like helicases
MAAHFFDSLDEALRKKLEKLGLFSDFDLVLHLPLRYEDETRLSPISQAVP
GSTVQVEGVVAEQEVLVRPRRQLVCRVDDDSGTLYLRFFNFYASQVTAWS
PGTRLRVLGEVRAGFHGVEMVHPKCRVVRGSMVLANTLTPVYPGMAGLPQ
RTLARLIMQAFERLRAKRLLQETLPATILSACQFPAFEDSLSILHCPPAG
VSITSLQQRSHPAWFRIKFDELLAQQLSMRCHYHQRRSQQAPVLQQQTGL
QQALLEVLPFGLTDAQCKVVTEISKDLAQPYPMQRLLQGDVGSGKTIVAA
LAALQSIGNGYQVAVMAPTEILAEQHFRKLSDWLTPLGVGVGWLSGSQKK
SLRNQELERTATGEAMLVIGTHALFREAVQFKCLGLVIIDEQHRFGVGQR
LALRMKGGDEEVIPHQLMMSATPIPRTLSMSYFADLDVSVIDQLPPGRSP
VVTRLIDSSRREEIVARIREACLAGRQAYWVCPLIEESEALQLKTAVETY
ETLSQTFPDLRIALIHGRLDSDEKSVIMAEFSQGEVQLLVATTVIEVGVD
VPNASLMVIEHAERMGLSQLHQLRGRIGRGSATGVCVLMYQQPLSEVARK
RLQIIFEHRDGFEIARQDLLLRGPGEFLGTRQSGVPLLRFANLEEDIDLL
EMARNAAENMLRDHPLAAQCHMQRWLGRKEDYLRA
>NE0010 recJ, recJ: single-stranded-DNA-specific exonuclease
MANITIREFPAHAYEILSAHGFPSVLARIFAARGINHPEQLETTFARMAS
FEQLKNIQRIAVLLADAIAAKKRLLVIADYDTDGATACAVALRALRQFGA
MVEYLVPNRFEYGYGLTPEIVRLAADQVPPPDILITVDNGIASVEGVEEA
NRLGMQVFITDHHLPGDRLPDAAVIVNPNQPGCSFPDKHIAGVGVIFYVM
LALRAELRERSAFTATGKEPNLASLLDLVALGTVADVVRLEGTNRILVQQ
GLQRIRNGYCCAGIHALFKAAGRDFSRVTTYELGFILAPRLNAAGRLDDM
SLGIECLLTEDESHALRLASELDELNRQRREIESGMRDEAMDKLDDVIDL
LNQSDTPADNGKQSVYSLCLYDPAWHQGVIGLIASRVKDRLHRPVIIFAQ
GNEGEIKGSGRSIPGLHLRDALDLVAKRYPGLIVKFGGHAMAAGLTVYEQ
HFEQFRTAFEQVAQSLLTPADLIQVIETDGELAETDLTLELAQYLTNQVW
GQGFPEPSFNGCFRVENQRIVGEKHLKLKLRKTGAAQVYDGILFFHTERL
PTEIDAVYRVQINEYNGSTRMQLLLEHWFESGQAHYG
>NE1479 recN, ABC transporter:DNA repair protein RecN
MLQNLSIRNFIIVDHIDLHFKSGFTVLTGETGAGKSILIDALELVLGRRA
DTSQIRYGCKRAEITAQFSVNTIPALQEWLVENALEDETGICLLRKIMES
GGRSRNFINGHPATLQQLRTVGEWLVDIHGQHAHQLLMHGHKQCELLDAW
AGESNLAREVASAYRHWQDLCQQRLAWEQHSEQNLQEHETLTWQLQELAA
LNFSLEEWENLQIEHNRLTHTASLLETAQFSLESLSENETAVLAQLSTVL
TRLNSLIDIDNTLEPLCNQLQSAQIQLQEIVYELKRYQQHLDIDPRRLQE
TETRIAAIHGTARKYRIMPEILPDLLETTRQRLESLENAASSEALMKAEK
SARNNFENLAARLSQARQHAADQLSGLVTETMQTLAMAGGRFNVALIPIP
SGNLHGMEQIEFQVSAHRDLPLRPLNKVASGGELSRISLAIQVITSKAGT
VPTLIFDEVDTGIGGRIAAIVGKLLQQLGKTRQVMSITHLPQVAARGDHH
WRVSKTSETEDEQLPASHISELDAAERTEEIARMLGGENLTAATRQHAAE
MLGYDKQNQST
>NE2564 recQ, ATP-dependent DNA helicase RecQ
MISHAQTLLREIFGYSEFRGQQAEIITHVVNGDSCLVLMPTGGGKSLCYQ
IPALLRKGTAIVISPLIALMENQVAVLCRQGVRAVYLNSALTPEAAAAVE
RRMLAGEYDLVYVAPERLLTVRFRALLQRIPIALFAIDEAHCVSQWGHDF
RPEYGKLSILPEKFPQIPRIALTASADARTRADILRCLDLHQARSFISSF
DRPNLCYRITARSNSRIQLLNFIRSQHAGEAGIVYCQSRRKVEETAAWLN
SNHIPALAYHAGMETSIRTRHQKKFLQGHGIVMVATSAFGLGIDKSDIRF
VAHLDLPKSIESYYQETGRAGRDGLPASAWMVYGPGDIIRLRSQTESGTE
RLPAPIRQAAAARLDALLVLCETTVCRRKPLLDYFGEPTGSLPCGNCDAC
LETIPVQDVTIAAQKALSCVYRTGQCFGMEYLIDILSGKRTDRVRQWGHD
CISTFGIGHELSTEGWRIVFRHLLALDYLVAGEDRAGGERIALQLTSAAR
SVLRGETRIKLRLSHHHHSAPYQQISTGLSVPSSRCQAFSCEPQTKCGG
>NE1931 recX, RecX regulatory protein
MSLYARALECLARREYSRHELEKKLSCHEQLPDELKSVLDRLEQQKLLSD
ERAVEQILHARSRKYGSKRIRYELQMKGIADHLIEAALGEFKQTEFSSAH
ALWCKKFGVAPSTPEERGKQIRYLAGKGFSSEVISKVLSDAREAEN
>NE0468 rfaH, putative transcriptional activator
MHWYLVHTKPKQEKCAFQNLQQQGYRCYLPMLPVEKLHLGNLTVVDEPLF
PRYLFIQLEQGDSAKSWVPIRSTRGVNRLVCFGSEPARIDDILIDLLRMQ
EASFQHKPERLFKPGERVRLTEGAFAGIEGIYQMAEGERRVMVLIELLSK
PVAMRVSPTSLRKTN
>NE1023 rfbD, dTDP-4-dehydrorhamnose reductase
MPDRTTPIKILLFGKNGQVGWELQRSLAPLGELIAPDKQDLRYCGDLADL
AGITHTLQTIRPDIIVNAAAYTAVDQAESEPELAFRINAEAPELLAQQAE
QIGAWLIHYSTDYVFNGNGNCPWQETDLTSPINIYGLSKLRGEEQIRKSN
CKYMILRTSWVYAARGKNFIKTILRLAREKEQLTIIDDQIGAPTGAELLA
DITAQAIPQLLEYPDKSGVYHVAASEEVSWYSYARFLLDFAREHDIPIKV
HPDAVIPVHSEAFVTAARRPLNSRLNTEKFCNTFQLCLPHWQTGVTRVLE
EIYL
>NE2281 rfe, Glycosyl transferase, family 4
MNELTSTFFSISTDISQPVRPDLSEQITLFTISTILAGLIIRLAISLAHT
YGILDRPGQHKQHKHLTPFVGGTGIFAALLIALCFLIGYYPEQSVKWLGL
GISSAIIFIMGFADDILQLHYKTRLIVQTVAVLIMNLVSGVVLTDLGSIF
PGGILALGIFAIPFTLFATIGGINALNMIDGIDGLSGSVTLMSLILLGSA
ALIADNQPNLIIIIALTGGTMGFLFFNLRYRLQPRARVFLGDNGSMLLGF
VITWLLIDLSQGSNRAMTPVTALWLFSVPLMDTISIMLRRIWQHKSPFEP
DHNHLHHILLNAGYRVSDVIFAIVSIHLLFGLIGLTGLYLGVNELTMLTG
FLLFFSGYFYLSLHPWYFITALRQFHTLWGLTPTQSHGFFLGSYTPKEAE
NLVRMLSQELRPSMDSLVHVIKNKAPSSSDEDQYAVVVNIRLLDADVRTI
KDKIENFVTSAQKRINERCGIQLRPFVEHNPRNDRRIQNQSNPSGNKRVA
DRRKPNQKLLVFEAMFDQTSISKRNYHQESVEPPINHSNS
>NE0486 rffE,wecB,nfrC, UDP-N-acetylglucosamine 2-epimerase
MKILIVVGTRPEAIKMAPVILALKKEPWANVRVLATAQHRNMLDQVNEFF
GIDPDIDLNIMRPNQALTTLTARLLPELDDVLQAEKPDAVLVQGDTTTVM
TVALACFYHHIPVGHVEAGLRTWDMQNPFPEEANRVITGKFARWHFAPTE
GSRQNLLKEGVADSKIIVTGNTVIDALLMSASKDLQLGIELDSNKRLVLV
TSHRRENFGEPFRNICRALQTLAEKNPDVQFLYPVHPNPNVKDVAHEFLA
GLHNFTLCEPLDYASFIAAMKRAYIILTDSGGIQEEAPALGKPVLVLRDE
TERPEAVEQGVVKLVGPNYDAIVQETQCLLDDEFAYRAMARGISPYGDGK
AAERIVQVLRKYFT
>NE2040 rhlE, rhlE; ATP-dependent RNA helicase RhlE
MSNDVTFAQLGLSSEILHAVNDEGYVNPTPIQAQVIPSILAGKDVMASAQ
TGTGKTAGFTLPLLYRLQAYANTSVSPARHPVRALIMAPTRELAMQIDES
VRKYGKYLALRTAVVFGGINIEPQIAALQAGVEILVATPGRLLDLVEQKA
VNFSKTEILVLDEADRMLDMGFLPDIKRVMALLSPQRQSLMFSATFSGEI
RKLADSLLKQPVRIEAAVQNTVNESISHVIHWVKPDSKFALLLHLIRQQN
LKQALIFVKTKHGASHLAQMLSRHEISAVAIHGDRNQQQRTQALAEFKHG
DVQILVATDVAARGIDIEKLSHVINYELPGNPEDYVHRIGRTGRAGSKGK
AISLVSEHEKELLANIEKLLNAKLETEQIAGFDAEQFARSLPDRKNRMSA
GNSRYGNKPMENGSEKSRSEKHRKLPSSQKYSGSRRGGTQKYSDPIFTQP
YVPQANSTQSTTPKQPEIQSLFLTYRQEKKTIPALFTALSKSKAGQEN
>NE1035 rho, rho; Transcription termination factor
MRLSDLKSIHVSELVKMAVANDIEGANRMRKQDLVFALLKNQARKSESIF
GEGTLEILQDGFGFLRSPDTSYLAGPDDIYISPSQIRRFNLHTGDSVDGE
IRPPKEGERYFALVKIDKVNNEPPENSKRKILFENLTPLFPTERLTLERD
IKSEENITGRIIDLIAPIGKGQRGLLVASPKSGKTVMLQHIAHSIAANHP
DVVLMVLLIDERPEEVTEMVRSVKGEVISSTFDESAARHVQVADMVIEKA
KRLVEHKKDVVILLDSITRLARAYNTIVPASGKVLTGGVDANALQRPKRF
FGAARNIEEGGSLTIIATALVDTGSRMDDVIYEEFKGTGNMEIHLDRRMA
EKRIYPAINVNRSGTRREELLIPSEILQKIWILRKLLYPMDEMDAMSFLL
DKIKATKNNADFFDSMRRV
>NE2307 rhuM, putative cytoplasmic protein
MSKKKQDVSIVRSSAAEYLTFIAAMGDQSQSVEMRYEDENIWLTQKMMAS
LYDVTVPAINQHLKRIFDDGELLPEAVLKDYLITAADGKQYRTKHYNLQA
IISVGFKINNVRAVQFRKWAGQIVKDYTIQGWTMDVERLKKGHLFTDEYF
DRQLEYIREIRLSERKFYQKVTDLYATAFDYDKDALTTREFFALVQNKLH
WAVHRHTAAELIVSRADAGRTNMGLTHWAAAPQGKIIKSDVSIAKNYLNV
QEMEYLERIVSIYLDFAELQAMRKIPMSMQDWARRLDGFLEFNGNEILMG
PGKVSQEQAKLHAESEFEQYRIVQDRLFQSDFDRLLLQLESKNKEEGQ
>NE2556 ribAB, ribAB; riboflavin biosynthesis bifunctional protein: GTP cyclohydrolase II and 3,4-dihydroxy-2-butanone-4-phosphate synthase
MTISSTEEILADFRNGKMVILIDEEDRENEGDLVLAADFVTPEAINFMAR
YGRGLICLTLTEERCHQLKLPMMVADNHSPLGTNFTVSIEAATGVTTGIS
AADRACTVQAAVKADARPEDLVQPGHIFPLKAQKGGVLARAGHTEAGCDL
GRLAGLTEAAVICEILKENGEMARLPDLIEFAGKHTLKIGTIADLIHHRN
LTESLVSRFAERPLHTAHGEFRLIAYHDKIAGVTHIALVKGTWNTGEEVL
VRVHEPLSVIDLLEADDHSHSWSIHDAMAVIAQAGKGVVVLLHRKNDASV
LTDRILRARPRQPDKPVPDLRQHGIGAQILKDLGVSKMRLLATPRRMPSM
TGFSLEVAGYLEPGNRIRK
>NE0793 ribD, Riboflavin biosynthesis bifunctional RibD
MFTALDYEYMSHALRLAERGLFTTSPNPRVGCVIVNGGKVVGTGWHERAG
EPHAEIHALRKAGDLAKGATVYVTLEPCSHHGRTPPCVEALIQAGVCKVV
MAMNDPNPHVNGQGKEWLQKAGIAIQAGLLADQAERLNIGFVTRMRHDRP
WVRTKIAASFDGRTALKNGKSQWITSEPARRDGHKWRARSCAILTGISSV
RKDDPQLTVRYITVSRQPMRIVVDSNLETPLQAKLLQNADTTWIFTAQTS
KEKIHRLEDTGAHIVVLPDLAGKVDLKAMMVKLAELGINELLVEAGPVLN
GALVTAGLVDEIIFYFAPSLLGNSAQAMLALPEIGDLSEKYDLQISDIRK
IGMDVRLIARFMK
>NE1150 ribF, Cytidylyltransferase:Riboflavin kinase / FAD synthetase
MRITRRSVIYRGEPVALTIGNFDGVHLGHQAMIARLKRVAGRLGIASCVM
TFEPHPRERLMPEQAPARLTDLREKLELLAGYGVSRTQICRFDHEFAGIS
AESFITRILLQEMNVRWLLIGEDFRFGARRSGDLTLLRKFFSEPDELEVM
SPVTVDELRVSSTAVRDALASGNLDLAADFLGRPYSISGRVVDGIKLGKK
LGFPTANIWLKHHHLPLSGIFVVEANWCNDSGHTRRIRGVASVGVRPTVF
EHAQPLLEVHLFDFDEQIYGQHLRVDFLQKLRDEEKFPDMETLVKQIEQD
VDRAKAYFSRTDAAMHQTICLDC
>NE2557 ribH, 6,7-dimethyl-8-ribityllumazine synthase
MSYYDNILEIDSNLDGNGLCIGIVMSRFNIDVSEGLLGACTAELTRLGVQ
EADIVLTTVPGALEIPVALQKLAESDRFDALIALGAVIRGETYHFEVVAN
ESARGLAAVGREYNIPIANGILTTDDEDQATARMAEKGGETARVAIEMAN
LVLTLDEMDQSAPT
>NE1389 rkpA, putative type I polyketide synthase WcbR
MGRSINKHATDDSRSESLNTGTQESGTDKSSSSPVAIVGFAFRFPGDVSD
ETDFWNALKQKRDLVTQIPADRWAVDELQHDKRSEHGRSITFSAGVLSHI
EEFDASFFGISPREAAVLDPQQRLLLELTWETMENAGIPPSSMAGSDCAV
YVGISGFDYGMCEVDDLAVITSHSMTGNTLSIAANRISYAFDLHGPSLAV
DTACSSSLVALHHACNCLRNGEASTALVGGVNLLLHPYPFVGFTKASMLS
AKGRCKPFDASGDGYVRSEGAAVLLLKPLEKALADGDDVHAVILSSGVNA
DGARKTGITIPSSDGQAELMRAVLSRSGLSSKEVDFIEAHGTGTVVGDPV
ETRAIGMVYGQERTRPLPIGSVKANLGHLEPASGMAGLIKTILALKHRAL
PPAIHLHTPNPHIDFQALNLELIRKYKPFSRKRKKPLVAGVNSFGFGGAN
AHVLLKEFVPQDSKADTSSTMESLPPLFLSARTEAALRATAEHYVELLKG
KSPQEFYDIAHAAAYRREQMEKRLALHTGKVDSTVDLLDRYAKGDMVSRV
FVEDELPQSGGVAFVYSGNGTQWAGMGRALLAESPRFKEILSEIDEIMLP
QAGFSLITELEAEDTNSRLDDTRVAQPLLFAIQVGVTTLLREQGIEPSAV
TGHSVGEIAAAWASGALDLQQAIHVICVRSQAQGLTRGKGRMAAVGLSVE
AIQEIITTLGVDAEVEIAGINSPGNVTLSGPLEDLQQIQSVAESRGIFFR
LLDLDYAFHSRQMDAIEEPLIRQLGELAPAYTERATFVSTVTGNEIDGRL
LDAGYWWRNVRQPVQFAAAINRIAGLGCRILIEIGPHAILQRYMGECLAA
AEVQGRVIPTLRRDDNGLQRITETVLRTQLLAERPSLQTYFSVPGRRVRL
PNYPWQRERHWIPWTNEGLHSLDRRRVHPLLGWRIPGVDTLWENTLDPVT
LPWLADHKVGGAIVYPGAAYAEMSLAAAREWLGGKHFTVEQLDIISPMVF
DGEHARTTHLILNPRDGGFQIKSRQRLSDDEWTLHASGRILEAVNRIPVA
RIDAPAVSAEHVGHEIHYQRASQLGLDYGPGFQGVSNINIAAERLEATLS
CPELQHFDDYLLHPAILDLCFQSLIDFFGEAIDAGQGIALLPVKMGKLNF
CGNGKVAFLRARLCRHGTRSALADFELLDNQNNLIASASACRFRAAHLKR
HVQQNISNWRIVPWLKPHPLDGMTTAMPPVAGLIEQVRAQFDRVKHDRHT
LFKEILPLTEALTLSFAWQAVRQIQQYRPLDWQQIFDGSHATAHARWLAN
LLITEGLLNKDDAGQWFVTIDADLPSPETVWQTLLNDFPVYLPQLMLMGR
VGQQLPALLCGETDALKLLNELKHSPVAEMRYHDDPVYLGMRLALEEIFR
NLAGSWPVSRRLRILEITAGSSELPKILTGSLPEDRFDYVLALPDEAMQA
RQQIEYQEDASIVVARYEPATLKLSADRRLPDAFDVIVLHHALHRTNSPH
TALAQIRHLLAAGGILLLAEQHSDWSTNFLEGIDPGWWRQRNKTPDLPVS
PLLSPATWQQLLKDAGFTDTETFFEPAAEGLAEGAYLLLARCPARDSAII
PEPVTASWLLVADESSATLARHLSNRLQSSGQQVTITSRIDDDLLQNTDH
IVHLPGWNDTPDQAADTVTRLMEDVKLLAARIGKTPQFWLVTRGGTLISG
CPAESESSPAQSALWGFGRVVMNEYPQLACTLIDLVCNPATVNLADRLVN
ELLVPDGSNEIVLSAAARYCLVMQEATNQQIRTTEQSEKKDERFHLDFHV
PGQLRNLVWLPDVKRQLGDDEVEVCTRATGLNFRDVMYLMGLLPDEAVEN
GFAGASLGLEFSGIISRVGARVKGLLPGDAVMGFGSSCFASHVITRADAV
APMPENWSFEAAATVPTVFLTVYYALKQLAALQPGERVLIHGAAGGVGIA
AIQLARYLGAEIYATAGSDEKRDFVKLLGADHVFDSRSLAFADDILDATA
GEGVDVVLNSLAGEAMRRSIDVLKPFGRFLELGKRDFFENTPVGLRPFKN
NISYFGIDADQLLTARPRLAVQLFHEVMALFREQALAPLPYRVFSANRIT
DAFRVMQQARHIGKIVVSLADARPDIEQPLQPVPAIRFERNSTWLVTGGL
SGFGLESARWLAERGVDHLVLAGRRGMDTPGAKEIIETFAVQGVKVIAQA
CDITHAAAVDTLIERIGKTLPPLKGVLHAAAVFDDQLISSLDQKRISNVM
DAKLLGAWHLHQATLGTPLDYFILYSSVTTAIGNPGQANYVAANAGLEGL
AAMRRHMGLPATCIAWGPVSDTGYLARNQAVRDSLEQRLGKPPLPAAEAL
TQLDVALNDETGFVIPADFDWNTLSHLLPSASGNRFAILNRNRRSTSQTT
ESIDIRTFIIGKTRAEVTEIVRGFVVEEVAQILSISPDRIESGRSLHDHG
MDSLMAVELALGLERRFGIQLPVMMLNDAPTIHTVTARIVEKLTNKGDMP
EGEQSVSQVTEFIRQHGEELMPEEIDILSEDVRHFAQQGTSLIA
>NE1388 rkpG, Aminotransferases class-I
MNKLSLTRKARTHLIDHILGRKMEVAGGAAPASAECPTRYSSIPDAFTRF
DRFPGYEKMLVPKAASERLGLVNPFFRSHDGVAGATTLIGGREFINFSNY
SYLGLAGHPAVSRAAKEAIDRYGTSASASRLVAGERPAQRKLEEALANLY
EVDDCIVFVSGHATNVSTIGCLFGPKDLVIHDSLIHNSVLQGIQLSGAAR
RSFPHNDMAALEQILAEIRAQFERVLIVTEGLYSMDGDIPDLPELIRIKQ
HHKAFLMVDEAHSLGVLGETGKGVREHFGIQGKAVDIWMGTLSKTLAGCG
GYIAGERALVEHLKYAAPGFVYSVGMAPSLAAASLEALRIMQREPERVAR
LRERGQQFLELMQSLGVNTGLAQGYAVIPAIIGSSLKAARLSNQFFDAGI
NVQPIIHPAVEEKAARLRFFLSAMHTDQHVRYTCEVMKKLTGLH
>NE1332 rkpI, possible capsular polysaccharide biosynthesis/export transmembrane
MVAEESFISVLTASLTGLVLSVIIERLMVPRPVLARPWAAWALHGGLWFL
SYSLITLVSGRPWFSVAIVSAFMLMLVLVNNAKVKSLHEPFVFQDYEYFT
DAIRHPRLYIPFLGWWKFLGAAAGFILAVSIGLWGESVPEQQFILSGQLG
GILVLSLLGILFLLAGNHESLPVSFNPRQDVVALGLLASFWRYGREERTP
LKIPSPFDFSVSERQMNDLPHLVAIQSESFFDPRPLFPGIRPDVLAEFDR
LKKEALFHGRLKVPAWGANTVRTEFAFLSGIGEDGLGVHRFNPYRAIAAG
WNIPSLASFLKSLGYRTVCIHPYPASFYRRDRVYPCLGFDEFLDIRTFDD
TMRSGPYIGDAAVADKVGAILRDAAGPLFIFVITMENHGPLHLEQVAHSD
IESLYTEPPPAGCDDLTIYLRHLRNADQMVARLRQTLAECRQPASLCWFG
DHVPIMSAVYEIFGKPGGEVEYVCWSNQGKAAFCDANLSANTLSMSWLRE
LGLISNLKSLFQ
>NE1385 rkpS, ATPase component ABC-type polysaccharide/polyol phosphate transport system
MIVIEDVYKRYKTDHGPGKWILQGVNLTIPRNVNVGLIGSNGAGKSTLLR
IIGGIDHPNKGKVERRCRLSWPMGQGGLEPTLTGRQNAKFVCRLHGHQDD
LPERLIFIQDFSGLKDAFDEPVNTYSSGMRSRLQFAMSLAFDFDIYISDE
VTAAGDAAFRDKAAAAFKGMTNRAGLIMVAHSESTLRQFCEAGILLYQGK
AQWFDKIEEAFKAYKDTVQK
>NE1386 rkpT1, ABC 2 transport system integral membrane protein
MAEARNPLLVSFAVWKAIFLREALDRLFDMRAAWFWLLVEPVLHIGFLVF
IFTVIRMRTVGNADVAVWIIVGMLAFFLFRRTAVQVTYAVDSNRPLFTYR
QVKPFDPVLARAVLEAFLMSIISIIIIGTAILLGHEAWPYDALLVLQANF
GLWLFGIGYGLVASVLMELVPEMEHVLKILMLPLYLISGVILPLAAIPQP
YLDLLLLNPIAHGLELARAGFFPYYHTVPGLSMEYLYAWGISGIFLGLIL
YRRFALQLVMR
>NE1138 rlpB, putative lipoprotein B precursor transmembrane
MTTLRTLMAVSLVLNLAACGFKLRGQVSELPFERVYITAPAGLTIGSDLE
RVISTHTRAKVVNKAEKSEAIIQIVHAIREKRILSLSESGRVREFELVYR
VAARLLDAHNAELASLQEIRLTRILPFLDAQELAKAAEEEMLYKDMQKDA
VQQILRQVSAVTSAG
>NE1456 rluC, rluC; ribosomal large subunit pseudouridine synthase C
MKSIGKSRVPGKEKIMPGIISEHTVVRQIDETAESQRIDNFLIRSLKGIP
RRHIYQLLRSGQVRVNSKRIDASYRLQLGDKVRIPPVIRSVKPVSSQGVY
RTDRFNILYQDDHLLVIDKPAGIAVHGGSGISHGVIEQLRGQHPDWKFLE
LVHRLDRETSGILLLARKRQSLLELHRQIREGTVEKHYLTLVRGKWKNSV
QNVRLPLNKYLTAAGERRVAVAAGRNDQDKAQQAHTLFTLQKAWEDFSLL
DAELKTGRTHQIRVHLAHLGFPIAGDDKYGDFGLNKQLQKDDGKGQVLRR
MFLHASAFTCTHPVTGDLLRLEAPLPDELRVFLRNLDLNIHLP
>NE0505 rluD, rluD; ribosomal large subunit pseudouridine synthase D
MMNRLINEQDGRNYSAKPDSSAAGSQENENTIELTVPDNLAGLRLDQALA
QLLPQWSRSRLQGWIEQKCVSVDSAAATCKQKVWGGESIRVIAGQTENDQ
SHQAEAIPLKILFEDDHLIIIDKPAGLVVHPGNGNWHGTLLNALLNHAPQ
LSQVPRAGIVHRLDKDTTGLLVVAKTIEAQFDLARQLQQRTVKRHYLALV
LGKLEKDGVVDAPIGRHPIHRTRMAVVQNGKPARTHYRVLEKFTASTLLH
CSLETGRTHQIRVHLLSIGHPLAGDPVYGRTSPDPSTADAVVRLPRQALH
AWQLELTHPHSGQILLWESPLPDDMAALLQAIRNLPHSSVSRPL
>NE0677 rmlA, ADP-glucose pyrophosphorylase
MTRRGLILAGGSGTRLHPATLALSKQLLPVFDKPMIYYPLSTLMLAGIRD
ILIISTPQDTPRFQQLLGDGEQWGLNLQYAVQPSPDGLAQAFLIGEDFIG
NHPSALVLGDNIFYGHDLQRLLTHAMMRTEGASVFAYHVHDPERYGVVEF
NAQGKVLSLEEKPIQPRSSYAVTGLYFYDTQVVDHAKALKPSARGELEIT
DLNRLYLEQGNLSVEIMGRGYAWLDTGTHATLLDASQFIATLENRQGLKV
ACPEEIAYRQGWIDAARLEMLAQLLAKNGYGQYLLKILRENVF
>NE0469 rmlB, NAD dependent epimerase/dehydratase family
MILVTGGAGFIGSNFVLDWLAQSEETVINLDALTYAGNRANLASLEGDSR
HIFVKGSITDFDLVARLLHEHHPRAVINFAAESHVDRSIHGPENFIHTNI
VGTFRLLECVRAFWNDLREPDQQDFRFLHVSTDEVYGTLSKEASPFTEVS
RYEPNSPYSASKAASDHLVRAWHHTYGLPVLTTNCSNNYGPYHFPEKLIP
LMIVNALAGKPLPVYGDGMQIRDWLYVKDHCGAIRRVLEVGKPGEIYNIG
GWNEKPNIEIVNTVCKLLDELRPKADGTSYASQITYVADRPGHDRRYAID
AHKIERELGWRPAETFETGIRKTVQWYLDNPEWVVRVQSGAYREWITRQY
GE
>NE0678 rmlC, dTDP-4-dehydrorhamnose 3,5-epimerase
MKVTPFAIPEVVLIEPKVFGDERGFFFESFNQARFEETVGREINFVQDNH
SRSVKNVLRGLHYQIQQPQGKLVRVVQGEVFDVAVDLRKSSPTFGRWVGQ
VLSAENKHQLWIPEGFAHGFVVLSDTAEFLYKTTDYYAPAHERCLLWNDP
VLNIQWPLGIAPVLSAKDTQGKPFSEAEVFA
>NE2324 rnc, rnc; ribonuclease III (RNAse III) protein
MTSSRPIAHNTKNKQTKTLKGRDYAVFLQKLGYTFKQPDLLREALTHRSL
GFPNNERFEFLGDSVLNCAVSTLLFKRFPSLPEGDLTRLRANFVNQQALH
RLASALGIGELILLGEGERKSGGHHRPSILANAVEAIIGAIYLESGFAVV
EQVIVALYEPLIRQLDPDTSGKDSKTLLQEYLQSRKIALPEYSVLLAQGD
PHAQVFHVECVIPGFGIRTRGEGTSRRRAEQEAARQAYELAIVRH
>NE0140 rnhA, probable ribonuclease hi protein
MQLEEGVKLVEIFTDGACKGNPGIGGWGVCLKFDGEVREFFGGEPVTTNN
RMELLAAIRALQALESLPDTGQSLRVQLHTDSQYVQKGISEWVHSWKKRG
WLTADKKPVKNEALWKELDQLSRRYQVEWFWVRGHNGHDGNERADMLANR
GVVSVLSEKAD
>NE1707 rnhB, Ribonuclease HII and HIII
MAERRIPLKHEYAQDGKVIYGVDEAGRGPLAGPVYAACVVLDPADVIEGL
ADSKQLSEKKRISLADQIKQRARAWAIASASVEEIDRLNILQASLLAMQR
AVVSLRPISNALVLVDGNHAPRLDCEVQTVIRGDSLVAEISAASILAKTA
RDIEMLRLHEAYPVYGFDRHKGYPTKAHLEAIRLHGITDIHRRSFAPCVG
QSVSGARTTSFINQKEA
>NE2130 rnk, Prokaryotic transcription elongation factor GreA/GreB
MSIKPKIMISSLDAERLEILLETLSQNAFPGRDDLEAELARAEVVDPEEI
PPTVVTMNSTVRFRVESSAEEFCLTLVYPKDVDTSGEKISILAPVGSALL
GLAQGDEIEWPKPGGGVLRVRIVEVTYQPERSGEYYR
>NE0389 rnpA, Bacterial ribonuclease P protein
MTTRQICTLPRQCKLRKADEFRAVLRNRIVFESLSLRLYVKPIDVDYARI
GLIVAKRVERKAVRRNRIKRLIREAFRRHRQMLMGLDCVMQLRHPVELLD
STRIYQEAVMLFNKAARQL
>NE0350 rnr, rnr; exoribonuclease RNAse R (vacB protein)
MSIKRKRNKKNQLRELDPFLKREQTRYGQALPSREFILEVLDKQGVPISE
SKLLKLLDITSEESEFFRRRLAAMIREGQIICNRKGDICVTEKLELVKGV
VQGHSDGFGFLIPDDGSTDLFLTPKEMHKVLHGDRVIARVTNIVDRRGRR
ECRIIRVLESVNTQLVGRLFEEHGIFFVVAEDKRINQDILIPKENIMDAH
AGQIVVARIIQQPREHAQPIGHIIEILGDYAAPGMEIEIALRKFDLPHEF
PPEVISVKFPQKVLKKDLTKREDIRHLPLVTIDSETARDFDDAVHCCQEG
KDYRLFVAIADVSHYVRDQDALDKEAFNRGNSVYFPRRVIPMLPEVLSNG
LCSLNPQVDRLCMVCEMLWTARGELVEYRFYPAVMLSHARLTYNIVAELL
DQPKGAVAKEHRQLLPHIQLLYRLFKVLHKSRVKRGAIDFETIETEMVFN
DQGKIEQIYPVKRNDAHRLIEECMLAANVCAADFLQKHEQPVLYRIHEGP
TDEKLVALRDFLKEFGLQLRGGEKPQARDYARILDRIKERADAQLLQTVM
LRSLSQAMYNPDNIGHFGLAYEMYTHFTSPIRRYPDLLVHRAIKAVLADK
QYEPGDWHEIGKHCSQTERRADEATREVQSWLKCFYMQDKIGDCFGGVVT
GVTAFGLFVTLDEIYVEGLVYISELPSDYFHYDAVKHVLRGERSGISFRL
GDRLQVKLVRVDLATRKIDFILADNSVRKKSSR
>NE0276 rph, 3' exoribonuclease family
MPRCNNRAPAQMRPVRIIRHYVRHAEGSVLIEYGETRVICTASVIEKVPP
FLKGAGQGWLTAEYGMLPRSTGERMQREAAKGKQSGRTMEIQRLIGRALR
SILDLEKLGERTIQMDCDVIQADGGTRTASITGAFVALYDAIDYLRAERM
ISQNPIRDHVAAVSVGILKGQPLLDLDYLEDSGCDTDLNVVMTGSLGLVE
VQGTAEKVVFSRQELDVMLNMAQQGLQELFDVQRKALETVA
>NE2049 rplA, Ribosomal protein L1
MGYDMAKTSKRYREIVQKIDRSQLYSLVDALALVKETAVAKFDESVDIAI
NLGIDVRKSDQVVRGSVVLPSGTGKSVRVAVFAQGDKAKEALDAGADIVG
FEDLAERVKAGEINFDLAIASPDAMRVVGQLGQILGPRGLMPNPKVGTVT
VDVINAIRNAKAGQVQFRADKAGIVHCTVGRASFDVEALRANIMALVDAL
NKSKPTTSKGVYLRKMAISSTMGVGVRVDHTAIV
>NE0401 rplC, Ribosomal protein L3
MKLGLIGKKIGMTRVFTESGNSIPVTVLDVSGNRVVQVKTEEKDGYSAVQ
LTQGYRRKNRITKALSGHFSQAGVEAGTVIKEFRVDHDIGSDIKIGSEIS
VELFEVGSKVDVCGISIGKGYAGTIKRHNFSSSRASHGNSRSHNVPGSIG
MAQDPGRVFPGKKMTGHLGNAQCTVQNIEVVRVDAGRGLLFLKGSVPGSK
GNGVFIRPGVKQPSK
>NE0402 rplD, Ribosomal protein L4/L1e
MVKIPCIYENGQIEDIEASESVFGRVYNEALVHQVVKSYLANARAGTRAQ
KGRSDVTGSTRKQWRQKGTGRARTGAATNPLWRGGGKIFPNKPTENFKQK
LNRKMYRAGMCTIFSELLRNNKLVAIDEFQIEMPKTKVCLQKLKNYQLEN
VMIITSEIDSNLYLASRNLPNLKVVEVDLIDPVSLLAYDNVVITRDTVNK
IENVLQ
>NE0416 rplF, Ribosomal protein L6
MSSVFVTTKPIEIPKSVEVQLTEFGVFIKGPLGSLFQAIDAHNAKIVVSS
DQLTIEALNATKQTKSIVGTLKALISNMIKGVTAGFEKKLLIIGVGYKAQ
AAGNILNLNLGFSHPVAYTVPEGIKVETPSTTEILIKGIDKQKVGQVAAE
VRSYRRPEPYKGKGIRYFDEKVTIKETKKK
>NE0195 rplI, Ribosomal protein L9
MQVILLEKIAKLGSLGSIVNVKPGYARNYLIPQGKARRVTEKVIAEFEAQ
RAELEKKQSEILAAASAQAARLDGLLVQISQKAGVDGKLFGSVTSANITE
ELRKQDFPVEKSMIRMPEGQIKQIGDYTVTVVLHSEVSAHITVSVLGETT
I
>NE2048 rplJ, Ribosomal protein L10
MSHNLEGKKAIVAEVSSQVAKAQAIVIAEYRGLGVDQFTRLRVKARESGI
YFRVIKNTFARRAVADTPFSGLAESMVGPLAYGIGSDPVATAKILHEFAK
DNDRFVIKAGAMAGIVMSDKDVAALAVLPSREELLSKLLGTMQAPIAKFV
RTLNEVPSKFVRGLAAVRDKK
>NE2050 rplK, Ribosomal protein L11
MAKKIVGYIKLQIPAGKANPSPPVGPALGQRQLNIMEFCKAFNAATQKME
PGLPVPVVITAYADKSFTFILKTTPASVLIKKLAGLSKGSAQPHVDKVGK
LTRSQAEEIAKIKMADLTAADMDAAVRTIAGSARSMGVEVDGV
>NE2047 rplL, Ribosomal protein L7/L12 C-terminal domain
MAIAKAEILESIANMTVLELSELIKEMEEKFGVSAAAATVAVAAAPAAAA
AVEEQTEFSVILTAAGDNKVNVIKVVRAVTGLGLKEAKDLVDGAPKTVKE
GISKEDAESLKKQLVEAGAGCEIK
>NE1484 rplM, Ribosomal protein L13
MKTFSAKPHEVRHEWFVVDATDKVLGRLAAAIAHRLRGKHKPIYTPHVDT
GDYIVVINADKLRVTGNKAEDKKYYRHSGYPGGIYETTFKKMHERFPTRP
LEKAVKGMLPKGPLGYAMIKKLKIYAGDTHPHAAQQPQPLEINA
>NE0408 rplP, Ribosomal protein L16
MLQPARTKFRKQHKGRNTGIATRGAKVSFGEFGLKAIGRGRLTSRQIEAA
RRAMTRHIKRGGRIWIRVFPDKPVSQKPAEVRMGKGKGNPEYYVAEIQPG
KMLYEMDGVDESLAREAFRLAAAKLPMQTTFVIRHLGS
>NE0417 rplR, Ribosomal protein L18P/L5E:Ribosomal protein L18
MLLNTKQMRLRRAKATRIKIGNSRQFRLSVHKSNNHIYAQILDPSSNRVI
VSASTVEAEVKKQYPSGGTIEAAKYVGHLVAKKSIESQIYEVAFDRSGFK
YHGRIKALADAARSAGMKF
>NE1674 rplS, Ribosomal protein L19
MNLIEQLEREEIERLGKTIPDFSPGDTLVVNVNVVEGDRKRVQAFEGVVI
AKRNRGLNSSFIVRKISSGEAVERTFQTYSPLIASMEVKRRGAVRRAKLY
YLRDRSGKAARIREKLPARSVQQENVAEALQP
>NE0955 rplT, Ribosomal protein L20
MPRVKRGVTAKARHKKILKLAKGYRGRRKNVYRIAKQAVMKAGQYAYRDR
RQRKRQFRTLWIARINAAARELGMTYSTFMNGIRKAGISLDRKILADLAV
FDKVAFEKITNQVKAGLAN
>NE1293 rplU, Ribosomal protein L21
MYAVIKTGGKQYRVEVGNKLKVETLPAEVGSDIQLDQVLMIADGEAISAG
APLLDQAKVSATVVSHGRHDKIRIFKMRRRKHYRKQQGHRQNYTEIQITG
ISA
>NE0406 rplV, Ribosomal protein L22p / L17e
METSAVLRSVRLSAQKGRLVADQIRGLQVERAIRLLTFSPKKGASIILKL
LESAVANAEHNEGADIDELKISQIFIGQGATLKRVSPRAKGRGNRISKPT
CNIFLTVSNK
>NE0403 rplW, Ribosomal L23 protein
MRNITISSERALEVIKAPQISEKSTFIAEKTKQIIFYVSRDANKTEIKSA
IEQIWRSQNIQVKSVQVVNVKGKKKRFGRYLGQKSDWKKAFVSLKGDREI
DFTDVKLFEDK
>NE0412 rplX, Ribosomal protein L24/bacterial NUSG:Ribosomal protein L24
MKKIRKGDSVIVIAGKDKGKQSTVIRFQSTERVIVREVNKVKSHIKPNPN
RNIAGGIVETEKPLHISNIAIFNPEKNKADRVGFRFNESGNKVRYFKSDG
TLIDS
>NE1825 rplY, Ribosomal protein L25
MQIEISANSRKLHGTGANRRLRSQGRLPGVIYGGNGDAQSIELDHKDLYY
KLKMEAFHASILSISIDGKKEQVLLRDVQMHPFKQQVLHIDFQRVRQDQK
IHVKVPLHFINADIAPGVKLSGGMISHVATEIEISCLPKDLPEFITVDLS
GMTAGSTLHLSDLILSENVEIPALLKGDNLPVATLIAKRGEAGESSEE
>NE1292 rpmA, Ribosomal protein L27
MAHKKAGGSSRNGRDSHSKRLGVKRYGGEIIRAGGIIVRQRGTQFHPGDN
VGIGRDHTLFAKVDGKIVFAVKGRMNRRTVAVIPS
>NE1465 rpmB, Ribosomal protein L28
MARVCQVTGKRPMSGHNVSHANNKTKRRFLPNLQSRRFWLESENRWIRLR
LTNAALRTVDKNGIDSVVADMRARGERV
>NE0409 rpmC, Ribosomal protein L29
MKVQELREKNLSELGKELLSLRRAQFGLRLQHRTQQLANVSQINKVRKDI
ARLKTIIREKTGQL
>NE0419 rpmD, Ribosomal protein L30
MEKRKTIKVTLVKSLIGTRHSHRLVIKGMGLRRLNHTVSLCDHPSIRGMI
NKTAYLLKVEE
>NE1036 rpmE,rpL31, Ribosomal protein L31
MKEGIHPEYHEITVTCSCGNEFKTRSVLSKPLHIEVCSSCHPFYTGKQKI
VDTAGRVEKFNQKYGRHLQKQAQT
>NE1644 rpmF, probable 50S ribosomal subunit protein L32
MAVQQNKKSPSKRGMHRSHDALTNPPLAIEPTTGEIHLRHHISPNGYYRG
KKVIKTKNDD
>NE1466 rpmG, Ribosomal protein L33
MREKIKLESSAGTGHFYTTTKNKRANPEKLEIKKFDPVARKHVTYKETKL
K
>NE0390 rpmH, Ribosomal protein L34
MKRTYQPSVISRKRTHGFRVRMKTRGGRAVIRARRAKGRAKLSV
>NE0956 rpmI, Ribosomal protein L35
MPKMKTKKSAAKRFKVRAGGSIKRSQAFKRHILTKKTTKNKRQLRGVAAV
HASDMVSVRVMLPYA
>NE0426 rpoA, Bacterial RNA polymerase, alpha chain
MSSNSFLTPRIVEVHNISPLHAKVVMEPFEHGFGYTLGNALRRVLLSSIP
GCAPTKVSISGVVHEYSTIDGLQEDVVDLILNLKGVVLKLHNNKTQSVLT
LKKSSEGIVTAGDIEATHDVEIVNPDHVIAHITSGGKIEAQITVEKGRGY
WPSANRSKEDKSSSSIGDILLDASFSPIRRVSYAVESARVEQRTDLDKLI
IDIETNGSVDPEEAIRYAAKVLVEQFSFFADLESTPLPTEQPKAPVIDPI
LLRPVDDLELTVRSANCLKVENIFYIGDLIQRTEAELLRTPNLGRKSLNE
IKEVLASRDLSLGMKLENWPPANLENYIKEPGHASS
>NE2046 rpoB, RNA polymerases beta subunit
MSYSFAEKKRIRKSFAKRASILPFPFLLATQIQSYTDFLQAEIAPGKRKN
QGLQAAFNSVFPIESHSNNARLDFISYMLGSPVFDVKECQQRGLTYAAPL
RARVRLTILDKEASKPTVKEVKEQEVYMGEIPLMTDTGSFVVNGTERVIV
SQLHRSPGVFFEHDRGKTHSSGKLLFSARIIPYRGSWLDFEFDPKDYVYF
RIDRRRKMPVTTLLKAMGYSPAQILADFFEFDHFILVDNKIFFNLIPERL
RGELAGFDIVSEDGKVFVQKDKRITAKHVRDLQQANLTKIPVPEEFLLGK
ILAADLVDKETGEIIALANSEISETLLGRIRQTQSSEISTLFVNDLNYGP
YISQTLRIDETTDQMSAQVAIYRMMRPGEPPTEEAVLALFNGLFYSPERY
DLSVVGRMKFNRRVGREELTGSTTLSNDDIIDVIKILVELRNGRGEIDDI
DHLGNRRVRSVGELAENQFRAGLARVEKAVKERLSQAESENLMPHDFINA
KPVSSAIREFFGSSQLSQFMDQTNPLSEVTHKRRISALGPGGLTRERAGF
EVRDVHPTHYGRVCPIETPEGPNIGLINSLALYARTNEYGFIETPYRMVR
NGRVTEEVVYLSAIEESQYVIAQANANFDQNGVFTDEVVSCRHKNEFTLA
SRDQIEYVDIAPAQIVSVAASLIPFLEHDDANRALMGSNMQRQAVPCLRA
EKPLVGTGIERVVAVHSGTAVRTIRGGVVDYVDASRIVIRVHDAEARAGE
VGVDIYNLTKYTRSNQNTNINQRPIVRMGDVLSRDDVIADGASTDLGELA
LGQNMLIAFMPWNGLNFEDSILISERVVSDDRFTSIHIEELAAVSRDTKL
GTEEITADIPNLSERQRARLDESGIVYIGAEVEAGDVLVGKVTPKSETQL
TPEEKLLRAIFGEKASDVKDTSLHVPAGISGTVIDVQIFTREGVDRDKRS
KQIIADELGRFKKDLADQMRIVEADAFQRAERLLTGKVAAGGPKKLAKNS
TITRDYLENVEKHHWFDIRLVDENTSLQLEQIKDSLVQKRKLFDLAFEEK
HRKLSQGDELPPGVQKMVKVYIAVKRRLQSGDKMAGRHGNKGVISKIVPI
EDMPYMADGTPVDVVLNPLGVPSRMNIGQVLEVHLGWAAKELGKRVNEML
ASQRNVSDIRDFLNKIYNNSGKQEDLTSLEDDEVLALARNLSSGVPFATP
VFDGAHESEIKQMLKLAGLPESGQTTLYDGRTGEAFDRPVTVGYMHVLKL
HHLVDDKMHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGASY
TLQEMLTVKSDDVNGRTKVYESIVKGDHKIDAGMPESFNVLVKEIRSLGL
DIDLEEH
>NE2045 rpoC, RNA polymerase, alpha subunit
MKALLDLFKQVTQKEEFDSIKIGLASPEKIRSWSYGEVKKPETINYRTFK
PERDGLFCAKIFGPVKDYECLCGKYKRLKHRGVICEKCGVEVTLSRIRRE
RMGHIELASPVAHIWFLKSLPSRLGLVLDMTLRDIERVLYFEAYVVTDPG
MTPLNRGQLLTEDDYLNKTEEFGDDFSAVMGAEGIRTLLSNMDIPLEIES
LRLEIQTTGSETKIKKAAKRLKVLEAFNKSGMKPEWMILTVLPVLPPELR
PLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLELRAPEIIIRNEKRMLQ
ESVDSLLDNGRRGKAMTGANKRPLKSLADMIKGKGGRFRQNLLGKRVDYS
GRSVIVVGPQLKLHQCGLPKKMALELFKPFIFNKLETMGVASTIKAAKRE
VENESPIVWDILEEVIREHPVMLNRAPTLHRLGIQAFEPILVEGKAIQLH
PLVCAAFNADFDGDQMAVHVPLSLEAQMECRTLMLSTNNVLSPANGDPII
VPSQDIVLGLYYMTRKKIGAQGEGMVFSDISEVVRAYENKVVELNAGIIV
RIKERKKSRSHGEEPVEAITRYETTVGRALISEILPAGLPFSIINKVLKK
KEISKLINASFRLCGLRETVIFADKLMYSGFSYATRGGISICLDDLVTPS
QKNDIIHAAEQEVHEIANQYISGLVTQGERYNKVVDIWGRAGDQVAKAMM
DQLSVEPVTDRETGQVRADKNGQVVTQESFNSIYMMADSGARGSAAQIRQ
LSGMRGLMAKPDGSIIETPITANFREGLNILQYFISTHGARKGLADTALK
TANSGYLTRRLVDVTQDLVITEDDCDTDGGVIMKALVEGGDVIESLRERI
LGRVAATDIVNPETGAVIYAAGMLLDEDAVDEIETCGIDEVKVRTPLTCE
TRYGLCAKCYGRDLGRGMLVNVGEAVGVIAAQSIGEPGTQLTMRTFHIGG
AASRTVVANQVESKSNGVIRYSHHIRYVKNAQNELIVISRSGEVYIQDEN
GRERERHKIPYGATLLVQDGEVIKAGQILASWEPHKRPIIAEYAGKVRFE
NVEEGVTVVRQIDEITGLATLVVIDPKRRNVAQSKGLRPLVKFLDENDNE
INIPGSDQPVSITFHVGSIITVRDGQQVNIGEVLARIPQETSKTRDITGG
LPRVAELFEARVPKDVGFLAEATGTVAFGKDTKGKQRLVITDLDGVAHEY
LIPKDKHVTAHDGQVVNKGEVIVDGPIDPHDILRLQGVEALARYISNEVQ
DVYRLQGVRINDKHIEVIVRQMLRRVQIMNAGDSSFIPGEQVERAEVLTE
NEKLIAENKMPATYEYVLLGITKASLSTDSFISAASFQETTRVLTEASIM
GKKDDLRGLKENVIVGRLIPAGTGLSFHNIRKKQRLSESAAYLDTDLTEN
EVTE
>NE0229 rpoD, DNA-dependent RNA polymerase sigma subunits (sigma70/32)
MAKAKVSGSAKKNIVTSTEVTETKSGKSRVETPDNGVKVTGKNAKNTFSA
ETEVVSVKKEATSASRKKSVKAAKSDDDSVLVGQNKNTLLPAEKSEDLGA
VDEVSEKTRVSKRNKSTLSEKQITDLSESAVSVSSDNDIELRRMRLKKLI
MQGKERGYLTYSEINDHLPDDMLDADQIESIISMINDMGISVHDEAPDAE
ALLMSDAATAVTDEDVVEEAEAALSSVDSEFGRTTDPVRMYMREMGSVEL
LTRESEIEIAKRIEDGLRHMIQAISACPTIIENILELAADVENGRLRADE
LVDGFLDDDTDEIMGKEMSEESLDQDFDSEDGEDEQVAAASADMLKMKAE
VLERFAVIRQTYDGMKKIINKKDGRQDKKYKQLQEKISTELMAMRFSAKM
VEKLCDTQRDIVNDIRNCERKIAELCVTRAGMPRNYFIKAFPGNETDIDW
ISSEVESGQPYSQSLEHFRPAVMEQQQKMLELQKQAGIPIKELKEINRRM
TTGEAKARRAKREMTEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKA
VDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKMNR
ISRQILQETGQEPEPAVLAEKMEMPEEKIRKILKISKEPISMETPIGDDE
DSHLGDFIEDASTMEPADAAVYAGLRTVTKDVLDSLTPREAKVLRMRFGI
EMNTDHTLEEVGRQFDVTRERIRQIEAKALRKLRHPARSDRLRSFLDSGN
S
>NE2331 rpoE1, Sigma factor, ECF subfamily
MSDREIDQQLVEQVQRGDKRAFDLLVIKYQRRLARLLSRFIRDPAEVEDI
TQETFIKAYRALPSFRGDSAFYTWLYRIGVNTAKNFLVSQGRKLPATVNG
FGTEEAENFEGADQLREVNTPESELMGKQVAQTVNQALEALPEELRTAII
LREIEGLSYEEIADIMNCPIGTVRSRIFRAREAVADKLRPLLGTDKSKRW
>NE0584 rpoH, Sigma-70 factor family
MNEIMALPVMAGGDLDSYIRATSSFPVLSREEEASLAKRMREEGDVNAAH
QLVLSHLRVVVTIARGYLGYGLPHADLIQEGNIGLMKAVRRFDPERGVRL
ISFAIHWIRAEIHEFIMRNWRLVKIATTKQQRKLFFNLRSMKKGLDTMSQ
AEVASMAEQLGVKAEEVVEMETRFSGRDISLEPVTDSDEDDFSPIAWLTD
GSDPSRELEKEQLERLTHARLKNSLDSLDERSRRIIEARWLREDKAATLH
ELADELGVSAERVRQIEAKAMKKMRAEIAI
>NE0062 rpoN1, Sigma-54 factor family
MKPTLQLKLNTGLTLTPQLQQSIRLLQLSTLELNQEITRLVQENPLLELD
EGIDQDGFETGGPDISLSPSDSSAATESNGISEIGKNTDIQSDWPDDPLY
ADQEFFFDTRRRSDEDHDQDFSRLISKPVSLREHLLSQISLGQYCERSKK
IAELLVDSLNDDGYLSQDLDELAELLPPELEISTADLESALDYIQQLDPP
GVGARNLQECLSLQLKALPGNTPLRDEALKLVNGHLESFAAKNFQLLKKV
LDCNETCLQSIYQLITQLNPKPGDDFNTTTARHVIPDVIVTRSDSGWGVR
LNSDSVPRVRINSLYAGILKHHRTEATQMLMGQLKEARWLVRNLDQRMDT
ILRVSQAIVDCQQAFLEQGETAIRPLVMREIAVTLGLHESTISRVTTQKY
MRTPHGIFELKYFFGSHIPAERGEAHSAIAIRGLIKRLIQDEDRRKPLND
SQISQMLAQQGIVIARRTVAKYREFMHIPPTNLRKTL
>NE2255 rpoZ, RNA polymerase omega subunit
MARITVEDCLEVIPNRFDMTLAATVRARQVSVGSTPMIDSERDKPIVIAL
RELAQKKYGEEILNTLR
>NE2143 rps4, Ribosomal protein S4:S4 domain
MSRFTGPRLKIMRALGVDLPGLSRKTIASRPTPPGQHGAKLVRRRKSDFG
IKLQEKQKLRFNYGLSERQLRHLMLNARKSTEPTGETLLQLLERRLDNVV
FRAGFAPTVIAARQLVSHRHVRLNGKPVNIPSIRLNVGDEITIKPESLNL
PIVLGTLQDLPLSRPEWLLWDEKDKTGKITHLPTAEDVPFPIDVQQVVEY
YANRM
>NE1962 rpsA, Ribosomal protein S1:S1 RNA binding domain
MTIATSVAGSSENFAELFNESLSHKEMRVGEVITAEIIRVDYNFVVVNAG
LKSESYIPIEEFKNDRGEIEAREGDFVSVAIEALENGFGETRLSRDKAKR
LNAWYELEEAMEKGKIVTGMVNGKVKGGLTATINGIRAFLPGSLVDIRPV
KDMTPYENKEMEFKVIKLDRKRNNVVVSRRAVLEETQGADRDELLASLQD
GAVVQGIVKNITDYGAFVDLGGIDGLLHITDLAWRRVKHPTEVINVGDEV
TAKVLKFDQEKNRVSLGLKQLTEDPWIGLSRRYPQGTRIFGKVTNMTDYG
AFVEIEQGIEGLVHVSEMDWTNKNVYPSKIVQLGDEVEVMILEIDEDRRR
ISLGMKQCRPNPWEEFSLSHEKGDKVTGQIKSITDFGLFIGLPGNIDGLV
HLSDLSWNQPGEKAISEFKKGDEVEAVVLSIDVEKERISLGIKQLEGDPF
NIYVASHDKNSIVKGTVKSVDAKGAVIALTDEVEGYLRASEFSRDKIEDI
RSRVNEGDEIEAMIINIDRKNRSISLSVKARIQDEEDKAVKSLASVSNPA
SAGTTNLGALLKAKIDSKSSEE
>NE1718 rpsB, Ribosomal protein S2
MSVTMRQMLEAGVHFGHQTRFWNPKMAPYIFGQRNKIHIVNLEHTLVMLR
EALDYARRLTANKGTILFVGTKRQARDIVMEEAIRCGAPYVNQRWLGGML
TNFKTIRQSIKRLQDMEKMVQDGTLNKLTKKEALDFQRELEKLNNSLGGI
KEMKGLPDAMFVIDVGYQKGAIVEADKLGIPVVGVVDTNHSPAGVRYVVP
GNDDSSQAIRLYARAMADAILEGRNQSVQEIIEMSKSDDEIMSGRDQSAG
A
>NE0407 rpsC, Ribosomal protein S3:Type 2 KH domain:KH domain
MGQKINPTGFRLSVLKNWSSRWYTNTKKFSDFLNEDISVRQYLQKKLAHA
SVGSIIIERPSKNAKITIHTSRPGVVIGKKGEDIEILRRNVEKLMNVPVH
INIEEIRKPEIDAQLIAASITQQLEKRIMFRRAMKRAIQNAMRLGAQGIK
IMSSGRLNGIEIARTEWYREGRVPLHTLRAEVDYGTSEARTTYGIIGVKV
WVFKGEQLGIKERQN
>NE0425 rpsD, Ribosomal protein S4:S4 domain
MARNINPKCRQCRREGEKLFLKGDKCFSDKCPIERRNYPPGQHGQKKVRL
SDYAVQLREKQKIRRIYGLLENQFRNVYKRADKQKGVTGDNLLQLLESRL
DNVAYNMGFGSSRSEARQIVRHNCVLLNGKRANIPSHLVEPGDLIEIAEH
AKSYLRIKASIEAAKRRSIPSWLEVDFDNLKGLYKSKPERSDLSSTINES
LVVELYSK
>NE0418 rpsE, Ribosomal protein S5
MTKTTKSANTKTPDEVKTDGLKEKMVSVNRVTKVVKGGRILGFAALTVVG
DGKGSVGMGKGKSREVPLAVQKAMDEARQRMVRVNLINGTLHHSVIGRHG
AARVYMQPASEGTGIIAGGPMRAIFEVMGVHNILAKCLGSTNPYNIVRAT
LDGLSKVQTPAMIAAKRGKSIDEITGA
>NE0198 rpsF, Ribosomal protein S6
MRHYEIVFIVHPDQSEQVSAMTERYSNIVTGRSGKIHRLEDWGRRQLTYP
IQKLHKAHYVLMNIECDQESLNELEHSFKFNDAILRHLVIRMNGPITTPS
PMMQEGKSRPPHSSDEDSENTAPAKAKTADSPGEDTRTTEESDPKP
>NE2054 rpsG, Ribosomal protein S7
MPRRREVPKREILPDPKYHNIELAKFVNVLMTRGKKSVAEQIIYGALNHL
EKKTGKDPVEVFTQALSNIRPVVEVKSRRVGGANYQVPVEVRSIRRSALA
MRWLRDAARKRSEKSMDLRLASELLEASENRGAAIKKREEVHRMAESNKA
FSHFRF
>NE0415 rpsH, Ribosomal protein S8
MCMTDPIADMLTRIRNAQSAEQKEIKMPSSKLKKAILKILKEEGYIENFQ
EDPNHKKPSISVILKYFNGEPVITSISRVSKPGLRSYKSKNDLPRVMNGL
GVAIVSTSKGVMTERTARMAGVGGELLCVVT
>NE1483 rpsI, Ribosomal protein S9
MAIQYNYGTGRRKSAVARVFIKPGTGVITVNHKPADEYFSRETGRMIIRQ
PLELTGNTSNLDIMVNIHGGGESGQAGAVRHGITRALIGYDETLKPILSK
AGFVTRDAREVERKKVGLRKARRRKQFSKR
>NE0400 rpsJ, Ribosomal protein S10
MQNQKIRIRLKAFDYRLIDKSAIEIVETAKRTGAVVKGPVPLPTRIERFD
VLRSPHVNKTSRDQFEIRTHLRLMDIIDPTDKTVDALMKLDLPAGVDVEI
KL
>NE0424 rpsK, Ribosomal protein S11
MVKPSLKIRKKVKKNVVEGVAHIHASFNNTIVTISDRQGNALSWATSGGV
GFKGSRKSTPFAAQVAAEHAGRAALEYGVKNLEVRVKGPGPGRDSAVRAL
NATGFKITSITDVTPIPHNGCRPPKKRRI
>NE2055 rpsL, Ribosomal protein S12
MPTISQLVRKPRKAKRTKSKVPALESCPQKRGVCTRVYTTTPKKPNSALR
KVARVRLTNGYEVSSYIGGEGHNLQEHSVVLIRGGRVKDLPGVRYHTVRG
SLDTAGVKDRKQARSKYGSKRPKSA
>NE0423 rpsM, Ribosomal protein S13
MARIAGVNLPSNKHVNIALTAIYGIGNTTARKICSDLQIPPFIKLKDLED
IKLDELRESVSKLIVEGDLRREISMNIKRLIDLGSYRGLRHRRGLPVRGQ
RTKTNARTRKGPRKAIGAK
>NE0173 rpsO, Ribosomal protein S15
MAVTTDQKSQVMRDYQRAAGDTGSPEVQVALLTTRINSLVDHFKQHVKDH
HSRRGLLRMVSRRRKLLDYLKSKNNDSYRALIERLGLRK
>NE1671 rpsP, Ribosomal protein S16
MVVVRLARGGAKKRPFYSMVVADSRNRRDGKFIERVGFYNPRATGGEESL
RIQMDRLTYWQSKGAQLSDTVSRLVKQFGRQQKAAQPQE
>NE0196 rpsR, Ribosomal protein S18
MRFSRNKDADKRMKDRMANRPLFKRKKFCRFTAEGIKHIDYKDIDLLKDF
VSENGRIIPARITGTRAYYQRQLNLAIERARFLALLPYTDQH
>NE0405 rpsS, Ribosomal protein S19
MSRSVSKGPFVDLHLVNKVLSARQNNDKKPIKTWSRRSTILPDFIGLTIS
VHNGRQHVPVFVTENMVGHKLGEFSHTRTFKGHAGDKKAAGSKR
>NE2340 rpsT, Ribosomal protein S20
MANTAQAKKRVRQTATRRERNFGLRSKLRTAIKGVRKAVAAGDKNVAEVV
FRKAVSVIDSVASKGIIHKNKASRHKSRLSGAVKAMG
>NE0226 rpsU, Ribosomal protein S21
MTTVKVKENEPFEIAMRRFKRSIEKTGLLTELRAREFYEKPTAVRKRKHA
AAVKRTYKRLRSQMLPPKLY
>NE1715 rrf, Ribosome recycling factor
MIADVKKSAEQKMQKSLDALKVDFSKVRSGRPHTGLLDHIMVDYYGTPTP
IKQLANVTLADARTIGIIPWDKKIFSAIEKAIRDSDLGLNPMTVSDMVRV
PMPPLTEERRKDLTKIVKTEAEAARVAMRNIRRDANAHLKELLKDKLIAE
DEDRRAQDEIQKLTDRYIAEVDKLLQTKEAELMAV
>NE0212 ruvA, probable Holliday junction DNA helicase subunit
MIGRIAGLLLEKHPPLVLVDVNGIGYEIDVPMSTFCRLPGIGEQVTLHTH
FWVREDAHLLFGFMTEPERVLFRQLTKISGIGARTGLAILSGLSVNDLHQ
IVVSQDSTRLTRIPGIGKKTAERLLLELRDKISPAITLPETGTAMASSTD
KDILNALSALGYNDREANWAVGQLSEGVTVSDGIMQSLRLLSKAK
>NE0213 ruvB, ruvB; holliday junction DNA helicase protein
MIESDRIITASPFSSQEEVIERALRPVQLDDYVGQEKIREQLKIFIEAAR
LRQEALDHVLLFGPPGLGKTTLAHIIAREMGVNLRQTSGPVLERAGDLAA
LLTNLETNDVLFIDEIHRLSPVVEEILYPAMEDYQLDIMIGEGAAARSVK
IDLPSFTLVGATTRAGMLTNPLRDRFGIVSRLEFYTADELGKIVTRSAGL
LNVDVTADGAREIACRSRGTPRIANRLLRRVRDFAEVRANGRIDRPVADA
ALQMLDVDATGLDVLDRKLLLAVLEKFGGGPVGVDNLAAAINEERDTIEE
VLEPYLIQQGFLQRTPRGRMATTMAYQHFDIIPSHQTTVPSLFDPD
>NE0211 ruvC, Crossover junction endodeoxyribonuclease RuvC
MTSLVYAAKGIRILGIDPGLRITGFGIVEKIGNRLVYIGSGCVVTGESGL
PDRLKTILDGLNEIILQHKPEQVAVEQVFVNINPKSTLLLGQARGAAISA
AVLHELSVYEYTALQVKQAVVGNGHARKEQVQEMVMRLLGLGERPRPDAA
DALACAICHAHGGTGLLTLSARNRSKRSKRL
>NE0671 sbcB, exodeoxyribonuclease I
MQTGNSTLYWHDYETSGATPRWDRPFQFAGLRTDEALNEIGDPLVIYCQP
ARDRLPHPEACLLTGITPQMAEARGLPEPEFIALIHAQLAQPGTCGVGYN
TLRFDDEVTRFTLYRNFYDPYAREWQSGNSRWDVIDLARMTFALRPEGIN
WPINGEGKPSFRLEDITTANGLVHDSAHDALSDVRATIALARLLRAQQPR
LYDWLFRLRDKRAAGNLLDMKTHAPVLHTSRMYSSEYGCTTLVMPLLPET
GNANSVLVYDLRHDPAEFVLLDIDALAERLFTPKEELAEGLQRLPVKAVR
LNKCPALAPQKVLNDEVANRIGLNVEQCQQHWQLLLQHPDFMQRIKQAYS
GNKVFAENDADLALYDGFASDHDRNLFPLVRDAEPGKLADLAGKFQDERY
IELLFRYRARNFPDTLSVQEEHHWQMHCRRQLGENAINGSLTLNEYHQKL
LQLRTDCPQQAQLDILNELEAWGRVLAQENDLPWPPDHSGSEEQTD
>NE1392 sbcC, ATP/GTP-binding site motif A (P-loop):ABC transporter
MQILQVRLKNLNSLVGEWEIDFTDPAFVSDGIFSITGPTGAGKTTVLDAI
CLALYGRTPRLGKVSKSENEIMSRQTGECFAEVTFAAQSGCFRCHWSQHR
AHKKPDGELQNPRHELAEADSGKILETRLTEVGRQIEKITGMDFARFTRS
MLLAQGEFAAFLQAAPDERAPILEQITGTEIYSRISIRVHETRVSARREL
DRLSAGLNGIQPLTRADEQQFHTDLAQKIQQDAGLNEQIAHERQILVWLE
NIARLESELHLIAGQQQAWLSRKEALTPDISKLDSASRALELLGEYSRLT
SIRNEQETDRNNLAICAASLAGLEQAVKQAEQSLKSLNEQCDRQRAKQRE
TIPLLRKVRELDIQIREKESPISTASKGITAQKKTLAALRNQYQQNEIQL
AGLQTTLAGLLQQLHVIQPDGAQMDFAHNQNLLNRKQAEYRQLLENRSLA
DWRQEIAVLSGQKTLATRAIEAMQSLAASKQISAELEKHTCSLLAGKTQL
AKQLGAEEEMLGALEREISLLETQALLLKKISHLEEARQQLKDDEPCPLC
GALQHPYAAGNTPRPDDNITALNQARTMLKTRIDTISTLKIRQAETNRDI
EQTACRQQEIHRQIQADETLLQQCAVSLFPGLPSAAMFPELPRLLQETDD
KLARMTRILQTAEILENEISVQRESLDKTRELEQKIGILRVQHQHQSTQI
RQHEAELQLRQEQLDQLQQELGNLRTTRLQLFADKQPDQEEQSLTTAIEA
AQKSADNARQQLETEIQQYNRLKNRAEDLVKTITTRAVQLEKLQETFAAR
LTQSGFADEAGFTAACLPEEERRRLAQRAQQLADEKTMLDTRQKDKTIAL
QAEQLKSMTDQPRDFHDQVLAQLITRQQVLQQEIGGLRQKLADNENSKQK
QQEQLQVIEAQKRECARWDLLHGLIGSADGKKYRNFAQGLTFEVMIRHAN
RQLQKLSDRYLLIRDPVRPLELNVIDNYQAGEIRSTKNLSGGESFLISLS
LALGLSRMVSRNIRVDSLFLDEGFGTLDEEALDTALETLAHLQQEGKLIG
IISHVTVLQERISTRIQVIPRSGGRSVLAGPGCRHCQ
>NE1390 sbcD, Serine/threonine specific protein phosphatase:Exonuclease SbcD
MKILHTSDWHIGKTLYGHKRYDEFEAFFSWLVETIEQEQVDVLLIAGDIF
DTSTPGNRSQQLYYRFLHRVAASACRHVVIIAGNHDSPSFLSAPRELLRA
LDVHVTGSLSGNPADEILVLHDPKGDAELIVCAVPHLRDRDIRTAEAGES
MEDKSRKLVEGIRDHYAEVINLARLQRTALSSSIPIIAMGHLFVAGGQTV
EGDGVRELYVGSLAHVPAGIFPPDIDYLALGHLHVPQRVNGSSVMRYSGS
PLPIGFGEADQEKSVCLIEFNRQISATRPAVSLINIPVFQPLERIRGNWQ
VISDRISMLSAANSCAWLEINYEGDEMITDLQERLQSAIEGSRLEILRIR
NNRIMNQILDQIDDGGTLEELSVNEVFEHCLSAAAIPVEQRTELWRTYQE
TLVSLDEEDIRAE
>NE0582 sbp1, Prokaryotic sulfate-/thiosulfate-binding protein
MKFNIWLSVGLLTFALAGASQAFADRTLLNVSYDPTRELYEEYNREFIRY
WQEKTGEKITVKQSHGGSGKQARAVIDGVPADIVTLALAHDIDAISEQSG
LIPAEWQKRLPNNSSPYLSTIVLLVRKDNPKGIRDWDDLIKPGVAVITPN
PKTSGGARWNYLAAWGFALKKYDGDEAKAQEFLTRLYKNVSILDSGARGS
TINFIQRGIGDVLISWENEAFLALKEYGPEKFEIIIPSISILAEPPVAVV
DKNVDKHGVRNIAQAYLEYLYDKKGQEIAARNFYRPSDPEIAKKYAHQFP
AINLFTINEVFGGWPQAQSIHFKDGGLFDKIYVNQ
>NE1048 sdhA, Succinate dehydrogenase/fumarate reductase, flavoprotein subunits
MKVKQRKFDVVIVGAGGAGMRAALQLSEAGFKVAVLSKVFPTRSHTVAAQ
GGIAASLGNVTEDDWRWHMYDTVKGSDYLGDQDAIEFMCRHAAEVVYELE
HFGMPFDRLENGKIYQRPFGGQSLNFGGEQASRSCAVADRTGHAMLHTLY
QRNVSANTQFFVEWLGLDLIRDHEGEVLGITALEMETGEIMILQARATLL
ATGGAGRIFEASTNAFINTGDGLGMVARAGIPLEDMEFWQFHPTGVYGAG
ILISEAVRGEGGYLVNAEGERFMERYAPHARDLASRDVVSRALITEIRQG
RGCGPKEDHLLLKLDHLSAETITSKLPGIREIALKFAHVDPLVDPIPVVP
TAHYMMGGIPANYHGQVVAPYKTSPDEVVPGLYAVGECACVSVHGANRLG
TNSLLDIVVFGRAAGNQIIKDLTRNNHHKPLPPDAAEQTLTRLDRLETQK
DGENVAATSEALRKTMQTHCGVFRFPDVLKEGVEKIGEVAERVRHTEICD
KSRVFNTARVEALELDNLIEVAIATMIAAEARHESRGAHTRDDFPERDDQ
KWLRHSLFFSKDQRLDYKPVRLKPLTVESFPPKARTY
>NE2371 sdhB, sdhB; succinate dehydrogenase (iron-sulfur subunit) oxidoreductase protein
MLFSISRYDPDKDEKPYMQDYDVELEPTDKMLLDALIRIKTIDDSLSLRR
SCREGVCGSDAMNINGRNGLACITPLAGLREPVEIRPLPGLPIIRDLIVD
MSQFFKQYHSIKPYLVNNDPPPETERLQSPEERARLDGLYECILCACCTT
SCPSFWWNPDKFVGPAGLLQAYRFLADSRDQATSERLDDLEDPYRLFRCH
SIMNCVDACPKGLNPTAAIESIKIMMLNKTV
>NE1047 sdhD, Succinate dehydrogenase, cytochrome b subunit
MVKPVHRVVTGAHYGLKDWLVQRITAVLMAAYTGLLIILITVYQPESHEE
FKALFSIQWMKIASLLFFAGLCWHAWVGVRNVLMDYVHPMPVRLTLQITC
IVALLGYLIWFLDILWG
>NE0808 secA, SecA protein:SEC-C motif
MLSNLLKSIFGSRNDRLIKQYLKIVRTINELEAAISPLSDEELRAKTSEF
KQRVANGEKLDQLLPEAFAVVREAGKRVLGMRHFDVQLIGGMVLHEGKIA
EMRTGEGKTLMATLPTYLNALSGKGVHIVTVNDYLAKRDAEWMGQIYRFL
GLTVGVVLSQMPHEEKQAAYAADITYGTNNEYGFDYLRDNMVGHSAERVQ
RVLNFAIVDEVDSILIDEARTPLIISGMAEGDTEIYKRIDTLIPGLTRQE
DEKSPGDYSVDEKTQQVLLSEEGFEHAEKLLSEAGLLSAGSSLYDPMNVS
LIHHLNAALRARALYNRDQHYVVQNGEVIIVDEFTGRLMPGRRWSEGLHQ
AVEAKENVPIQKENQTLASITFQNYFRMYEKLAGMTGTADTEAFEFQQIY
GLETVVIPTHRPMTREDRMDQVFRTPQEKYQAIIADIKDCYERKQPVLVG
TTSIENNELLAALLTKEKLPHQVLNAKQHAREADIIAQAGQPKMVTIATN
MAGRGTDIVLGGNPEQEINRIRADETLDEAAKSKKIEEIHQAWQARHDEV
IKLGGLHIIGTERHESRRIDNQLRGRAGRQGDPGSSRFYLSLEDPLLRIF
SSDRVANIMTRLKMPEGEAIEHPWVTRAIENAQRKVEARNFDIRKQLLEY
DDVANDQRKVIYQQRNELLDAEQGVSETISAIRESVVHQLIDRYIPEQSI
EEQWDIPGLEKALASEFHLQIPLQKWLEEDSELHEENLHDRIIELVDTSY
LNKVEQVGAPIMHQYERMIMLHSIDTHWREHLAALDHLRQGIHLRGYAQQ
NPKQEYKREAFELFTSMLDAIKADVTKILMTVQIRSEQQVESVAETSALR
NLEYHHDTHSELAEEQPPVAENRENKQQPFVRKNEKVGRNDPCPCGSGKK
YKQCHGKLN
>NE2210 secB, Bacterial protein export chaperone SecB
MTESQQQPVFVIEKIYVKDLSLEIPHAPRIFLEREAPEINFQLATSHNAV
DGEIHEVVVTATVTARLKEKDQVMFLVEAHQTGIFRIGNVPGDEVEPVLS
VLCPNILFPYLRETISDTVTRAGFPPVILNPVNFEAIYHQKKQQETAGEQ
PDQPADTITRH
>NE1778 secG, Preprotein translocase SecG subunit
METLIWSVHIIAAVIIIILILLQQGKGADMGAAFGSGASGSLFGASGSAN
FLSRMTALATTVFFITSLTLTYFSGTSHSGEESVMQKLDMKSGEVAVPAA
GGDKLGTSEKEGEGDASRVGEIPE
>NE0421 secY, SecY protein
MVSKSKAALTDRFYDLKKRFWFLVLALIVYRIGAHAPVPGIDPVVLKDLF
DSQEGGILGMFNMFSGGALSRFSIFALGIMPYISASIIMQLGTVAVPYLE
SLKKEGESGRKKITQYTRYGTLFLSTLQSYGISIALQSQPGLVIQPGFYF
VVTTVITLVTGTMFLMWLGEQITERGIGNGISLIIFAGIAAGLPSAIGGT
LELVNTGAMHFLTALFIFVAAIAITYFVVFIERGQRKILVNYAKRQVGKQ
VMGGQSSHLPLKLNMSGVIPPIFASSIILFPATLAGWFSDSLSLDWLKDF
SSALTPGQPIYVALYAALIIFFCFFYTALVFNPKETADNLKKSGAFIPGI
RPGDQTTRYIERIMLRLTLIGSIYVTLVCLLPEFMILKWNVPFYFGGTSL
LIIVVVTMDFMAQVQSHVMSGQYESLLKKANFKAGGSSFSR
>NE0733 selD, AIR synthase related protein:Selenide water dikinase
MPWAGTAGVFDTSTSFPIPSSSQERFLFRTIMPSEKIALTQTVQKGGCAA
KVAATELRRILQQVGFPAAHPALMVDGRYFDDAAIYRVNEETALVQTLDF
FTPIVDTPKLFGEIAAANALSDVYAMGGRPVTAMGILAFPLAALPEQVIV
DVLQGASDKIAEAGANFVGGHSIDDDTLKFGLSVTGFVNPRQVWTNAGAR
PGDHLILTKALGTGTMTAAVKRQQLREEDIIEALDSMTAINNAIDCMSPD
LQASIHAATDITGFGFSGHAMQLANASDVTLCIETGNLPRFDKTFHCLEN
GCLTKAHRTNAEYTTPHIDDSTLDALHKLLIHDPQTSGGLLLSVAPETSQ
TMLQALHTRFPSAVIVGTVHPHQDKAVQFA
>NE1688 serA, D-isomer specific 2-hydroxyacid dehydrogenase
MSKIVVSTSSFGFDHNPAIQQLRAQGFTITGNPYQRKLTEDEIITLLGND
TVALLAGVEPLTEHVLTSASALRVIARCGTGMDNVDLEAARRLNIQVSNT
PEAPAQAVAELTLGLMLDCLRQINRIDRSVRQGEWPRSQGRLLAARTVGI
VGLGHIGRRVAKLCQAFGAQVIAHDPHLQLAPDGVELVALTTLLEQADLV
TLHLPYSPAVHYLIDAEAIDRMKPGTILINAARGGLVDETALCAALNTGH
LEAAALDSFEQEPYHGPLCECKQAILTSHIGSLARETRQRMEIEAAENLK
QGLIEAGLIDE
>NE0439 serB, possible serB; phosphoserine phosphatase protein
MNLIIQGNDVQNNDLRAIAKLAEASRIDRITGEAFRLLDAQPHPDITEYC
EEASLDHAFVPSGKKLTDFGLIAMDMDSTLLAIESIDEIADMHNVKPQVS
AITQSTMRGEISFAESLTRRTALLEGLPQEALQKVYDERVRLNRGAEKML
QRMQSAGIKTMVISGGYTFFTDRVKDRLNLDYAFANTFEVQDGKLTGRVL
GNIIGASGKGEILKRIRDELGLSKEQVIAVGDGANDLKMLEESGVGIAFH
AKPILREKATFSLNHVGLDGIVNLFE
>NE0333 serC, Aminotransferase class-V:Phosphoserine aminotransferase
MRKIYNFSAGPAVLPEEVLEQAREEMLDWHGSGMSVMEMSHRGKEFMSIA
DETESALRELAGIPDHYKVLFLQGGASSQFAMVPMNLLGKKGKADYVNTG
QWSAKAISEAKNYGSVQIAASSESDGFNSVPPLAQWHISPDAAYVHYASN
ETIGGVEFQWTPDLSAVAGDNKNIPLVADMSSNFLSRPFDVSKFGLIYAG
AQKNVGPAGLVVVIVREDLLDIPPLAGTPAMFRYKTHADNASMYNTPPTY
AIYIMGLVMEWLKKQGGLTAIEQRNIAKAKLIYDLIDVSSFYHCPVNQAD
RSRMNVPFTLSDPGLDDAFLKQAQAHGLIQLKGHRSVGGMRASIYNAMPL
EGVQTLVTFMREFEKNHA
>NE0180 serS, tRNA synthetases, class-II (G, H, P and S):Seryl-tRNA synthetase
MLDIQQLRSNLQNIITRLAQRGYDFPVADFESLESQRKSVQTLTQTLQAK
RNSASKQIGIARQRGEDVSLIMAEVANMGDELKQAENQLEAVQTRLQQLL
LEIPNLPHDSVPAGKDENDNLEMRRWGTPKTFDFPVQDHASIGEHLKLID
FETAAKLSGARFSLLKGGLARLHRALAQFMLDTHTQENGYNEIYVPYLVN
ADCLRGTGQLPKFEQDLFAVRSGAQEEADNPDKTGPAGLHLIPTAEVPLT
NIVRNTIVPLENLPLQFVAHTPCFRSEAGSYGRDTRGLIRQHQFDKVELV
QITHPEKSHEALESLVGHAEKILQKLELPYRVMLLCTGDMGFSAAKTYDI
EVWLPAQQAYREISSCSNCEAFQARRMQARFRKGQEKPELLHTLNGSGLA
VGRTLVAILENYQNEDGSITIPEILRAYMGGIERIGL
>NE2537 sinR, Helix-turn-helix motif
MPSPLGDKIRALRKQKKLSLEQLAELTDSSKSYIWELENKDDPKPSAEKI
GKIATVLEVTTEFLLTESATTPDEEVLDEAFFRKYKNMSEPDKKKIRKIL
DAWEDE
>NE1615 slt, SLT domain
MSKIFIGVLLAFWSQLLVAGGDEDYLEIREAFRAGNAARVAEYAERMKYH
VLSPYAEYFRLRLSLVTAETGAVRAFLARHDGSFVADRLRADWLQILGKR
QQWATFAEEYPKLVNREESLQCYALQHRLATGDKTANGEVRSLWFTGRDM
PASCVTVFDSLVRIGAISVEDVWTRIRLAFEAGNTGVARNINKYLPRHQA
LDLQKLNAVIKDARRFLDRPVSVQSRADREIVLFALLRLLRSETNQALAR
WHKIRGQFPEADRSYFAGNLAYWAAIRQDSRAMNWFMDATTGRHVYPLSE
TKQAWKARIALREGNWKMLLDTLGQMPAAMQQEDVWRYWRARALKVSRKN
SEANTLLVSLSREHSFYGQMAREELGEMLSVPADEYQVRPEEIRLMEQNP
GIRRALALYRLDQRIEANREWIWTIQHFSDEQLLAAARVAQRHGIYDRAI
NTAIKTVARHDFSLRYLAPYREQVHTVLQQHQLDEAFVYGLIRQESRFIS
DIKSSAGATGLMQLMPATAKWVAGKLGLQDFHTSLVTDINTNLQLGAYYL
KHVLGQLDDQPLLAAAAYNAGPGRARQWRDIRPLEGAIYAETIPFNETRD
YVKVVLSNSMYYANNFGHPNKPTLKQRLGTVAPKR
>NE1706 slyD, FKBP-type peptidyl-prolyl cis-trans isomerase (PPIase)
MRIVKNTVVSLHYKMIDQEGNVMEETEEPVSYLHGGYDGIFPAVEEALHE
KETGAALSLQMEPEDAFGEYDEELMRVEPRNAFPEEVAVGMQFEGGEEDS
DDFCIYTVREITDDVVVVDGNHPYAGMTFQFECTVTEVRPATAEELSHGH
VHGPHGHEH
>NE1968 smf, SMF family
MQIDQDIESWLRLGLTEGVGGGALRRLLIAFGDPARVLAASRPALEGVVK
KPVATSIFLRKVDEERLARTIKWLEDPLNSLITLADSDYPKLLLNISDPP
PILYFKGQRQFLAQPAMAMVGSRNATPQGLANADAFAEAASNAGFCIISG
LAQGIDTAAHQGGLRGASSSMAIVGTGLDLVYPSRNHELAHKLANEGGLI
SEFPLGTPAISRNFPRRNRIISGMCHACLVVEATLYSGSLITARLALEQG
REVMAIPGSIHSPLSKGCHALIKQGAKLVENIQDILDELHYQPQPVPRFE
SVADEGGGTGVLTGEGDDTGLLMYFSYDSTDIDTLCARSGLTVETVSAML
LGLELEGRIGSLPGGKYQRIR
>NE1967 smg, conserved hypothetical protein
MFDILVYLFENYFDAGNCPDSATLTRKLTMAGFDDEEITLALDWLSDLSR
HDEEGYLAGLAESDSMRHFTEEEMEIIDTEGRGFIFFLEQAGVINPLQRE
LLIDRVIRMDGDTASIEKIKLVVLFDLWIQNQLTDRSIVEGLFVVSDSHQ
RH
>NE0430 smpB, SmpB protein
MSIVQNKKAFHDYFIEEKYEAGIVLEGWEVKAIRAGRAQLKEAYILIRKG
ELFLIGSHISPLATASTHVNPDPVKTRKLLLHASQIRELIGKVERAGYTL
IPLDMHYKSGRIKLEIGLARGKKQYDKRETEKRKEWERDKQRLLRIRRSS
D
>NE0870 sodB, Manganese and iron superoxide dismutase (SODM)
MSTNLDNFSRAAQSGDTYVLAPLPYADNALEPVISAHTLSFHYGKHHKAY
VDNLNKLVAGTPFSGQSLEQIITATAGQADKAGIFNNAAQIWNHMFYWYS
LSPKGGGEPPAALKQKIEESFGSVEAFKQEFANAAITQFGSGWAWLAQEG
SKLAIIKTSNADSPLTRGIRPLLTIDVWEHAYYLDFQNRRPDYVNTVIDK
LINWEFAAANLG
>NE1998 sohA,prlF, HtaR suppressor protein
MPTTLEIESTLTDRYQTTVPESVRRALQLGKRDKIHYTIRPGGEVVLTRA
EPPEGDDPVIGQFLTFLALDIASHPERLQAVDASLVQHLQSLVGGIQVDL
NAALSVDDA
>NE1495 soj, ParA family ATPase
MQIVACYSNKGGVGKTATAVNLAYAFATSGRRTLLCDLDPQGASGFYFRV
KPSKKLTEARFFEDVEHFTKSIRGSDYDNLDILPANMSFRDFDVFLSKMK
DARSRLKKALKAVKGDYDIVLLDCPPNISILSENIFRAADAVVTPVIPTT
LSQRTFEQLLEFFREHDLPMEKIHAFFSMIQGTKTLHGEMIVELTHNYPK
RIMAAKIPFASEIERMGVVRAPVLATAPDSPAGKAYQALFDELLERIVP
>NE0909 speC, Orn/DAP/Arg decarboxylases family 2
MKKSFAAVQQQTETSIVFDTHPEVSIDLSLVEARLKQGYQKPFLLIDSNI
IRNKARRFKTSMPRVRPHYAAKANPDPRVLKTLIEEGVGFEIASIAELDL
LMSLGVPAAEIFYSNPMKSRAYIEYAAAKGVEWYVLDSIEELRKIVSIKP
DAKLYLRIDTPNIGSDWPLAGKFGTHLVDVSEIIDEAVRLKADLAGVTFH
VGSQCRNPQNWRVGIERAQTVFASMREAGLKPRLLNLGGGYPVRHVKPIP
SIEVIAEVINEAIADLPEDIHVMAEPGRYLVSDSACFVCRVVGTATRNGK
RWMYWDAGIFGGIIEVSEGLRYEILTQRNGSLIPWSVAGPTCDSVDVLMH
DEMLPEDIQENDFIFIPNAGAYTTSYASNFNGFPLPDVVVI
>NE0347 speE,ywhF, possible speE, ywhF; spermidine synthase
MSTSERPPSPSSMAEDFAVEPLSPDFGFYLRTTELLAERHSPVQHIEIVQ
TPLFGRAMRIDGCFMTSEQDEFFYHEPMVHLPAITHGDPRQALVVGGGDG
GTAYNLLRYPNMERVVLAELDRDVIDMARTWLPKVHRGAFEDPRLELHLG
DGRAFTGNCKNQFDQIVLDLTDPFGPAISLYTRDFYRACRRALKPGGVLS
LHIQSPIYRSPIMARLLASLRDVFGVVRPYLQYVPLYGTLWAMAMASDSA
DPLSLSATEIDARLTQNGLIDLKLYSGGTHHALLNLPPFVQTLLSEPAYP
IDDGNSLDDINLDPREAGKLKLIQT
>NE2473 spoOJ, ParB-like nuclease domain:ParB-like partition protein
MVKPKGLGRGLDALLAGNPPETDSLQNLDVGLLQSGKYQPRTRMDEASLH
DLAESIKAQGIMQPILVRPLMMGGYEIIAGERRWRAAQIAGLEQIPAIVR
EVPDESALALSLIENIQREDLNPLEAALGIQRLIEEFGMTHQTAGQALGY
SRSAISNLLRLLNLAAPVQDLIMQGEIDMGHGRALLVLDAGKQLEVAHLI
IQKQLSVRETENLLKRMNEMPSVKKRSFPDRDLLRLQEDISARLGANVII
KPGKKGTGNVVIHYASLEQLDGILAKF
>NE0368 spoT, spoT; bifunctional enzyme (p)ppgpp synthetase II and guanosine-3',5'-bisdiphosphate 3'-pyro
MQNPILNDSDDSAVISPEAELLISEVSQYLKPEDLALLKSAYFFSQKAHS
GQFRKSGEPYISHPITVARILGELRLDAVTLTAALLHDVVEDTGILKQEI
SERFGSSVAELVDGVSKLDKIRFQTQADMQAENFRKMLLAMAQDVRVILI
KLADRLHNMRTLEVMSPEKQHRIAQETLEIYAPIAHRLGLENIYQELQEL
GFCFSYPTRYKVLLKATKAARGNRREVVGKILDAIKQRLQEAELDAVVTG
REKHLYSIYKKMAEKHLSFSDVLDIYGFRVIVRDVSSCYVALGALHSLYK
PIPGKFKDYIAIPKPNGYQSLHSTLLGPYGLPIEIQIRTHEMHHIAEAGV
ASHWLYKSKNTDAGIDDLHMKANQWMKGLLETLNDSSDSLEFLEHLKVDL
FPGEVYVFTPQGKILTLPRGATVVDFAYAVHTDVGNCCVAARINGEITPL
RTRLKSGDRVEIITAPAAKPNPIWLSYVATGRARSSIRYFLRTIQYNESV
KLGERLLNQALHSFGVNPDAIEPSQWEKLVKESKAKSKEALLADIALGKQ
LAAVLAKRLAVPGESVSNIQSNNSITILGTEGMAVKFARCCHPIPGDGIV
GLIKKDQGLVIHMQDCPAVIRIKDHKNMENQLDVVWGTDIDRTFPVSIFM
TVVNKSGVLARVTAEIAKADSNIDDITLENDKDYTTMRFILQTRDRQHLA
QIFRRLKHIDEIVKMGRIKNL
>NE1213 sps, Glycosyl transferases group 1
MMTDQKLYILMMSVHGLVRGHDMELGRDADTGGQITYVVELARALGRNSH
IAQIDLLTRQIEDPNISPDYAAEIEELGPNARIVRLPCGPRKYLRKELLW
PHLDQMVDRCLHYLRQQGRLPDLIHTHYADAGYVGQHLSNLLGIPQIHTG
HSLGRPKRARLLASGRKEQAIERQFNLSRRIAAEEEVLVHASLIITSTSQ
EIEDQYGMYKNTDPRRCQVIPPGTDTSRFSPPGRKPLDPAIQAGIDRFLN
TPEKPVILTICRPDTRKNLHGLIQAYGSDPSLQDMANLVIIAGSREDIRA
MEESQRKIMNDVLLDIDRYDLWGKIAIPKHFMVEDVPEVYRLAVRRRGIF
VNSALTEPFGLTLIEAAASGLPIIAPEDGGPRDIITNCRNGLLVNTLNPS
DIASALKDALSDRKRWRNWSRNGIASVRRHYTWDAHVSKYLREADKLLYR
ERKRLRRQLAATLHAGRSPMPLARKVIISDIDNTLLGDEQGLAEFLQWLR
MHAGNISFGIATGRTVESAVRILKKWRVPMPDILITSVGSEINYWPSLRP
DKGWSNHIRHRWRREALAEALKEIPGLALQAPENQREFKLSYLVTPERMP
PLKQLYQHLHKQNLHAKLIYSHEAFLDVLPVRASKGLAVRYLAYKWGLPL
QSFLIAGDSGNDEEMLVGDTLGVVVGNHSPELESLRDREQIYFAKNTYAL
GILEGMKHYHFDQ
>NE1214 ss2, Sucrose synthase:Glycosyl transferases group 1
MTTIDTLATCTQQNRDAVYTLLRRYFTANRTLLLQSDLREGLLQTEQDCG
QSDMLRAFVFRLQEGIFSSPWAYLALRPEIAKWEFMRIHQEHLIPEKLTI
SEFLKFKETVVKGEATESVLEVDFGPFNRGFPRLKESRSIGQGVIFLNRK
LSSEMFSRIEAGHTSLLHFLGVHAIEGQQLMFSNNSHDIHAVRNQLRQAL
EMLETLDGTTPWIELAPKMNQLGFAPGWGHNANRVAETMNMLMDILEAPS
PSALEEFLACIPMISRLLILSPHGYFGQDNVLGLPDTGGQVVYILDQVRA
LEKEMHDRLQLQGVQVEPKILIVTRLIPDAGDTTCNQRLEKVSGCTNTWI
LRVPFRKHNGEIIPHWISRFEIWPHLEIFAGDVEREALAELGGHPDLIIG
NYSDGNLVATLLSRRLGVTQCNIAHALEKTKYLHSDIYWQENEDKYHFSC
QYTADLLAMNSADFIVTSTYQEIAGTREAEGQYESYQAFSMPDLYRVIHG
IDLFDPKFNIVSPGANADIYFPYSDPNRRLHSLIPEIESLIFDDATNLPA
RGYLQDPDKPLIFTMARLDRIKNITGLVELYAASPRLRSLANLVIVGGKI
DPQHSSDHEEQEQIHRMHQLMDEHELDQQVRWLGMRLDKNLAGELYRYIA
DKRGIFVQPALFEAFGLTIIEAMASGLPTFATRYGGPLEIIQNNRSGFHI
DPNQGAATADLIADFFEKNLENPQEWERISQGALDRVASRYTWKLYAERM
MTLSRIYGFWKFVSGLEREETDRYLNMFYHLQFRPLANRLAHEI
>NE2453 ssb, Single-strand binding protein family
MASLNKVMLIGNLGRDPEIRYMPSGDAMANLNIATTDTWKDKGGEKQERT
EWHRVVMFGKQAEIAGEYLKKGSQIYIEGRLQTRKWTDKSNVERYTTEIV
ADRMQMLGGRSGGGSYDPPADRDHDYQSQSTPPAKSNTGFDDMEDDIPF
>NE0812 sspA, putative sspA; transcription modulator protein
MMTLYSTATCPFSHRCRIVLHEKDMDFQVIDVDPNNIPEDIAVISSYSKV
PILVERDLVLYEANIINEYIDDRFPHPQLMPAEPVMRARARLLLHRFEKE
LFCHIESLEQGDHKTADKARAEIADGLTMIAPIFEKQKYMLGDEYTMLDV
AIAPLLWRLDHYGVKLPKQAAPLLKYAERLFSRPLFIDALTPSEKLMRK
>NE0813 sspB, putative stringent starvation protein B
MSDVSSIKPYLIRAVHQWCTDNMNCPHVSVLESGCSGIPAELFKDGEIIL
NISYQATSDLLIDNETIQFVARFNGVSRKVEIMIGAVIAIFARESGQGLT
FTPEISKTAVADKQEGDVDHAVSQDSQVLSIEGKRGKPSLKIIK
>NE0338 sss, Phage integrase:Phage integrase N-terminal SAM-like domain
MNQPHDERDTPLPPLLSEYLAYLASTRSLSLLTQHSYRRDLVALVCCIAA
QHQSEHENGHEVTDASLTRLHSHDIRHFIAHLHHGGLSGRSLARMLSAWR
GFYRYLMRHHHHTENPCQDIRVPKSPRKLPHALSPDEAAQLLAFDPADAL
ATRDLAMFELFYSSGLRLAELTRLQPTDIDFSEGIVRVTGKGSKTRIVPV
GEPALRALQAWLPLRSAWLTSGETALFLSRHGQRIHPRTIAVRLHQRARL
QNLDDRVHPHALRHSFASHLLQSSGDLRAVQEMLGHSSIRSTQVYTHLDF
QHLAKIYDQAHPRAKKRPKTG
>NE2375 sucB, sucB; dihydrolipoamide succinyltransferase (component of 2-oxoglutarate dehydrogenase complex) protein
MLIEVKVPALSESVAEATLINWHKQPGEYVERGENLIDIETDKVVLELPA
PQSGILAEIIRNDGATVTSGEIIARIDTAAKETKTAAQQPAPIDSGHLEI
TESTVASMHPAQPLMPSAKKAAEENGLTMEEIAAIHGTGRGGRITRQDVL
AHVRNKNSAVTDQQSDSRTDQSAAGIPQADTSPIPVDQTEKPDRLEKRVP
MTRLRMRIAERLVQSQSTAAILTTFNEVNMQAIMDLRARYKDSFEKEHGI
KLGFTSFFVKAVVAALKKFPIINASVDGNDIIYHDYYDIGIAVASPRGLV
VPIIRDADKLTFAGIEKQIADLARRAQEGKLTLEELTGGTFSITNGGVFG
SMLSTPIINPPQSAILGIHATKQRPVVENGQIVIRPINYLALSYDHRIID
GREAVLSLVAIKEALEYPVSPLFEG
>NE0050 sucC, ATP-citrate lyase/succinyl-CoA ligases:ATP-grasp domain
MKIHEYQAKAILSRYGVSVPAGVACFSVEEAVLAAEKLGGNKWVVKAQIY
AGGRGKGGGVKLAKSIAEVRQFAEAMLFAPLVTHQTGPQGRIVHRLYIEQ
GVDIRHEYYLALVVDRTSQCVSLIASDAGGMDIEQVAAGSPEKIHKIQID
PLAGLNLRDAERIVREIDLPESCQSAAITMLDALYRAFDENDASLLEINP
LIVTPDNRLIALDAKMNFDDNALFRHPEIVALRDLDEDDPLEAEASQHGL
SYIPLDGDIACLVNGAGLAMATMDIIKLYGGNPANFLDVGGGATVEKVTE
AFKLMLRNPGLQAILVNIFGGIMRCDVIAEGVVSAAREVKLTVPLIVRLE
GTNVELGKKILADSGLTIISAGNMADAARKAVDAAARHCAEAATGGV
>NE0051 sucD, ATP-citrate lyase/succinyl-CoA ligases:DUF184
MAILIDRHTRVMTQGITGKTGQFHTRQCREYAYGRDCFVAGVNPKKAGEE
LESIPVFATVEEAKQKTGATVSVIYVPPAYASAAIDEAVEAGLDLVVCIT
EGIPVRDMLRTRARMRGKKTLLIGPNCPGIITPGELKIGIMPGAIHQQGR
IGVVSRSGTLTYEAVAQLSELGLGQSTCIGIGGDPINGLKHIDVLKLFNE
DSETDAVLMVGEIGGTDEEECARWVKEHMKKPVVGFIAGVTAPPGKRMGH
AGAIIAGGKGTAQEKIEVMEACGIRVTRNPAEMGKLLKSVI
>NE0782 sufI, putative periplasmic cell division protein (SufI)
MTNIDFNRRRFLQYAALGTLTTSIPGFASTANRHLNQTPDKVFKPDVEIA
LTAQTAEIPILPGAATRVLKYTGKLLKGPQAAIKQLPGYLGPILNLEQGQ
RVRIFFYNQLPEPCVTHWHGMHVPQIMDGHPMYAILQGEQYVYEFEVKNP
AGTNWYHSHTHEMTARQVYQGLTGLITITDEAERKLDLPSGEFDIPLVIQ
DRSFTSGNQLHYSLTMRQRMQGFLGDTILVNGQRNHVIPVKTRAYRLRIL
NGSNARIYKLGWSDGSPITAIGTDGGLLEKPETFPYIMLSPAERVELWVD
FSGHKTGSELALQTLPYQGFSMGMGGGMRGGRGGMGMQEGAVDQGSKDTL
VRFTITEQVSDSPKLPDTLVPIHRLTVKDVSNPDKTVPINIGMRRMTFNL
NGRIFEMLDYTEQERIPLNTVQKIRISNANPAMGGGMGRGMRGGQGGGMM
GMMMALPHPIHLHGQQFQILSRKPGYADNAYATVKDGFINSGWKDTVLVM
PGEEVEIIKPFMDYTGLFLYHCHNLEHEDMGMMRNFFVS
>NE1954 suhB, suhB; inositol monophosphatase (extragenic suppressor protein)
MHPMLTIAVKAARRAGSIINRASMDLERLTISRKAHSDFVSEVDKAAEDA
IIKILLDAYPDHSILAEESGSRGNTRKPEYQWIIDPLDGTTNFLHGFPKY
SVSIALLHRGILTQAVVYDPVKDELFTATRGSGAFLNDHRIRVSKRVQLG
ECLIGTGFPFRDFTHMEAYLAIFRDMIPKTAGIRRPGSAALDLAYVAAGR
YDGFWETGLAPWDIAAGCLLILEAGGMVGDFEGNGSYMQSGQIVAGNPKI
FTQLLQIIAPHLTERLIAENREPVE
>NE0882 surA, PpiC-type peptidyl-prolyl cis-trans isomerase
MIQFSSDRQLKFRKYWLIYAVFATMLAADVFAQSSYSREDIKPIDRIVAV
VNEEVITQQEINEVLQNTVQQLQRQNTQLPRMEILEKQLLERLILKRIQL
QRAKEIGLTVSDNDLDQTLRRIIQDNHLTMDEFRQVLLQEGTDMNRFREE
IRGEILMSRLKEQEVNSRVNVTENEIDNFLQNQANSPAGNEEYRIAHILV
QISEQMDEAQIEARHKRAETAYESLRQGADFVRVSAEFSDAPDAMQGGEL
GWRPLGQLGSPFTEMLVNMQPGEVTPVVRSPVGFHILKLLERRQQEQKVT
IIEQTHAQHILIKVSELVSEEDAHQLINQLMERIHNGADFMDVAKAHSED
ASASAGGDLGWVSPGDTVPEFEQAMNALLPGQVSPPVRTPFGWHLIKVIE
RRSQDVSERQQREAARHTIHARKADAIVQEWLQQLRDQAYVEYKVEDN
>NE0950 surE, Survival protein SurE
MRILLSNDDGYFAPGIANLAKVLLEIADVTVVAPERDRSGASNSLTLDRP
LSLHKSHNGFYYVNGTPTDCVHLAVTGMLDELPDMVISGINDGANMGDDT
VYSGTVAAATEGFLLGLPSIAVSLVSMSRGNFPTAARIVVDLVKRFTENR
FHIPILLNVNVPDVPYDELQGVEVTRLGRRHKAESVIKYQTPRGETVYWV
GAAGAAQDAGEGTDFFALQNNRVSITPLQIDLTRYDQIGYVKNWLTL
>NE0154 tISRso8a, Transposase IS911 HTH and LZ region
MERLPKGIYTPEFRAEAVKLVEAEGLSVDAAAKRLLVPKSSLGNWVRASR
TGSLAKVGQGQRVPTETEIELARLRKELAEVKLERDLLKKCAAYFAKESR
>NE2136 tal, Transaldolase:Transaldolase subfamily
MKATQLLHNLGQSLWLDNITRDLLTNGTLKRYVDELSVTGLTSNPTIFNQ
AIRNSSAYEAGIRSGLEQSKSSEAIFLDLALEDLTRVADLLKPVYDRTHG
VDGWVSLEVSPLLAYDTASTIVAAKEICNRAARPNLLIKIPGTREGLPAI
EEAIFAGIPINVTLLFSREQYVDAAEAFLRGIERRIDAGLNPDVMSVASL
FVSRWDVAVAGKVPETLRNRLGIAIAGRTYKAARDLFDSPRWLRAYNAGA
RPQRLLWASTGTKDPAASPVLYVMSLVAPFTVNTMPEATLKALADYDDQS
LDYMPDDGGDCEQVLEQFARAGVDIDALAIRLQQEGAKGFVTSWQELMAV
IDSKCTALKRGKTD
>NE0637 tatC, Uncharacterized protein family UPF0032
MDDDITFISHLVELRSRLIRALLVLFLGFLPCAFFARELYSFLAFPLLEK
LPQGGQMIATEVVTPFFIPMKVALMVAFLITLPHTLYQLWAFVAPGLYSH
EKRLFLPLVFISSLLFFTGMAFAYFAVLPLVFEFIVYFAPEGVAVMTDIS
EYLGFVLSMFLAFGITFEVPVFVLILIRTGLVSIEKMKSVRRFVLVGAFV
IGAIFTPPDVMSQILLAVPLYLLYELGIILAALSMKSSSRKSPLPVTQNR
HNDTGENE
>NE0344 tctD, possible Response regulators consisting of a CheY-like receiver domain and a HTH DNA-binding
MRILVVEDDSLVASGIKQGLTNAGYTVDVARNAASAERHLREENFDLAVV
DIGLPDIDGLTLVQRLRHRKMCLPVLVLTARGSMEDTIAGLDIGADDYMT
KPFRLPELIARIRALIRRAHSITSTELQHDRLVLNTGSYTATLNDQPLLL
TRREWTILETLLMASPRVVSKDKLLQNLTGWDKNITPNAIEVHVSRLRAK
IAPGGIEIRTVRGIGYRIDQSHS
>NE2197 tex, S1 RNA binding domain
MTSTAPSAVSEPTQKKIINTIAEELDVAPRQISAAVMLLDEGATVPFIAR
YRKEATGNLDDTQLRLLEERLAYLRELEARRQVILASIEEQGKLTDELRH
AIDQAATRQLLEDIYLPYRPKRRTRAQIAREAGLEPLADTLLADPTLDPE
QEAAKYIKVVPAAEGVEAINVPDTKTALEGARDILAERFAETADLLVVLR
SKLWQEGVVTSTVVAGQEKAEEEKFRDYYAYTEPVRLIPSHRALALFRGR
ALGVLRIDLDLDETARESIPHPCIAMIATHFGIENRDRKADKWLGDVCQW
AWRVKVHLHLTTELLLQVREAAETEAIRIFSRNLRELLLAAPAGPKAVLG
LDPGYRTGCKVAVVDATGKLLETVTIYPHQPRNDWQGSLATLVQLVHRHG
VELISIGNGTASRETDKLATEVVRLVAEQSPESKLTKIVVSEAGASVYSA
SALAAAEFSDLDVSLRGAVSIARRLQDPLAELVKIEPKSIGVGQYQHDVN
QRILARSLDATVEDCVNAVGVDVNTASAPLLAQVSGLNRVLAQNIVSYRD
AHGPFSNRQTLLKVPRVGEKTFEQAAGFLRINDGDNPLDRSAVHPEAYPV
VERILARLKKGIGQVMGQPGTLKGLSAEEFTDETFGLPTVRDILTELEKP
GRDPRPEFKTAVFQEGIESITDLQPGMILEGVVTNVAAFGAFIDIGVHQD
GLVHVSALANKFIKDPHQIVKPGQVVKVRVLTVDAVRQRISLTMRIDDIP
ETISQPETRKARPDDRQSRRGSRSLQTASRTGKSEPAGALALALARAKEK
R
>NE1141 tgt, tgt; tRNA-guanine transglycosylase
MKFQLHCRDHEARRGTLTLAHGTVETPAFMPVGTYGAVKGLSPDELHTLG
AGIILGNTFHLWLRPGLEVIGAHGGLHRLMNWDGPILTDSGGFQVFSLGA
LRKICEEGVRFRSPVNGDTCFLTPEESMRIQQVLNSDIVMIFDECTPYPV
DMQIAESSMQLSLRWAERSKTAHAGNPNALFGIVQGGMYESLRDHSAAGL
CAIGFDGYAIGGLSVGEPKADMQRILRHTAPQLPADKPRYLMGVGTPEDI
VHAVAQGIDLFDCVLPTRNARNGWLYTSQGILRLRNSRYRLDTSPPDEHC
DCYTCRHFTRAYLHHLQRTGEMLGARLNSLHNLHYYQRLMANIRKAIETG
QFEQFARKFSGQDFMLKCASV
>NE0386 thdF, GTP-binding protein (HSR1-related):tRNA modification GTPase TrmE
MTSNDTIAAIATPPGRGGIGIVRISGTNLESLARGILGKLPDPRHAGLFS
FLDQNSQIIDQGIALYFPSPHSYTGEEVLELQGHGGPAVMNLLLDRCLQL
GARLAEPGEFTLRAFLNDKLDLAQAEGVADLIAASTANAARCAVRSLHGE
FSSTIHQLVSALIDLRVLVEATLDFPEEEIDFLQSAHAAEQLATIRAKLE
QVLVASRQGNLLQEGIKVVLAGQPNVGKSSLLNRLAGDEVAIVTDIPGTT
RDTVRQSIEIEGIPLHLIDTAGLRETSDIVEQHGIARTYAAIEQADLVLL
LVDSRHGVTEEDRSVLTRLPERLPVLTVHNKIDLSAQPPRLEENTSGPTI
YLSAINGEGIELLRAALLKTAGWQANIAGEGAYMARQRHLQALIQAKELL
ERAAAWLHRADQLEILAEELRLAQQALSSITGEFTSDDLLGEIFSSFCIG
K
>NE0038 thiC, ThiC family
MKTPVSKLEKFTNDNARVDTAAVQPLPNSRKIYIQGSRADIQVPMREITQ
SDTATGRGAEKNPPIYVYDTSGPYTDPDTRIDIRHGLPPLREKWIDERGD
TEILTGLSSSYSLKRLQDPALTKMRFNLMRPVRHARNGANVTQMHYARRG
IITPEMEFIALRENQRRERVADLASNELLNRQHPGQSFGAIIQEWITPEF
VRDEVACGRAIIPANINHPETEPMIIGRNFLVKINANIGNSALGSSIQDE
VEKMTWAIRWGGDTVMDLSTGKNIHETREWIIRNSPVPIGTVPIYQALEK
VDGKAEELTWEIFRDTLIEQAEQGVDYFTIHAGVRLPFIPMTAKRMTGIV
SRGGSIMAKWCLAHHQESFLYTHFEDICEIMKAYDVSFSLGDGLRPGSIY
DANDEAQFAELKTLGELTEIAWKHDVQVMIEGPGHVPMHLIRENMDLQLK
YCHEAPFYTLGPLTTDIAPGYDHITSAIGAAMIGWYGTAMLCYVTPKEHL
GLPDKDDVKDGIITYKIAAHAADLAKGHPGAQVRDNALSKARFEFRWNDQ
FNLSLDPDKAREFHDETLPQEGAKLAHFCSMCGPNFCSMKITQDVRDYAA
QQGISETVALQEGMARKANEFIEKGGELYSKQ
>NE1425 thiD, putative phosphomethylpyrimidine kinase
MSLPPPIVLSFAASDPSGGAGIQADILTLAGMGCHPLTVLTAVTVQDTTG
VEDIFVMDAEWVTDQARTVLQDMPVQAFKIGMLGSVEIISAIAEIISDYP
DIPLVLDPVLASGRGDTFANEEVIAAMREMLFPQATIVTPNSMEARRIAL
DDEDDPEILDLKQSADRLLQWGCGYVLIKGAHENTPEVVNILYDANGVVR
SDAWQRLPGSFHGSGCTLSSAIAASLAHGMSIMDSVYEAQDFTWHTLKAG
FRPGMGQYIPDRFFWTESENPEEETGGGTSS
>NE1783 thiF, NAD binding site:UBA/THIF-type NAD/FAD binding fold
MDDQQLLRYSRHLLLPEIDIPGQKKLTHSSVFILGAGGLGSPAALYLAAS
GVGKLTICDHDQVDLTNLQRQILHETASIGKFKTDSARNTLQRINPEIEI
ISLPEQATARLLNREIKSVDAVIDASDNFSTRHLINQACLAHRKPLISAA
AVRFTGQITVFDLRHPDSPCYHCLFPDSGDSDDPACAIMGVFSPLTGIIG
CIQAAETIKVLLGIGETLHGRLLLLDGLTMHWRSLKLSKDPQCPTCHSAS
VQNLSDSQ
>NE0284 thiG, Proteins binding FMN and related compounds core region
MEPLVIAGKSYSSRLLLGTGKYRDFAETRAAVDASGAQIITVAIRRTNIG
QNPDEPNLLDILPPSQFTLLPNTAGCYTAEDAVRTLRLARELLDGHALVK
LEVLGDQKTLFPDVVATIEAAKILVKEGFQVMVYTSDDPIVARQLEDIGC
AAIMPLASLIGSGMGILNPWNLQIIIDKATVPVIVDAGVGTASDAAIAME
LGCDGVLMNTAVASARNPILMASAMRKAVEAGREAYLAGRMPRKIYQASP
SSPAEGMFTGTQHPAANS
>NE2559 thiL, AIR synthase related protein
MNSEFDVIDRYFTRPRRNALLGPGDDAALIACKPGMDWAISVDTLVAGRH
FFPDVDPATLGHKVLAVNLSDMAAMGATPCWATLSLTLTETLAQDDIWLK
AFSSGFFALADQYQVELIGGDTTGGPLNISVQIIGEVERGKALRRSGAQP
GDDIWVSGHLGDAALALQSLQQHVTLTAQEAAPCLAALHTPMPRVALGKA
LVNVAHSAIDISDGLLADLGHILDRSGVAATIDFDHIPRSPALKNKLQPE
NTVTNTICQLAIDCLLAGGDDYELCFTVPESKRDAVIQLAQESGILLSRI
GKIIPGKGLTVLGTDSNPLIFKNKGYDHFSAR
>NE1471 thrB, putative homoserine kinase protein
MSVFTPVTKEQLAVWLKNYSLGSLIDLQGISSGIENTNYLVTTTQDKFIL
TLFEKLTSTELPFYLNLMAHLSEQSIPCPRPVESQNHRLLGQLNGKPACI
VTFLPGRSMVQVAEKQCAQVGEMLARMHLAGRNYSGWNQNPRGLNWWQTT
AETVMPFLSSSEQNLLDEELQFQAAQMTANLPQSVIHADLFRDNVLFTSD
GIGGVIDFYFACNDTLLYDLAITANDWCTLTDGIMDKTRMHALVTAYHAV
RPLTADEHSAWPAMLRAGALRFWLSRLYDYYLPRPGELTHKKDPGHFKRI
LEHHLSNPGVLPSFQA
>NE2370 thrC, thrC; probable threonine synthase protein
MSPRTFTEILLTGLAPDGGLAMPEAYPEISSATLESWRPLDYPSLAFEIL
SRFMDDIPADDLRTLIARTYTPEIFGSKDITPVQTLEPDLHLLHLSNGPT
LAFKDIAMQFMGNLFEYVLAKNNDELNILGATSGDTGSSAEYAMRGKQGI
HVFMLSPYGKMSAFQTAQMFSLQDENIFNIAIEGVFDDCQDIVKTVSNDQ
AFKQQYRIGAVNSINWARIAAQVVYYFKAYFAVTRSSDEVVSFSVPSGNF
GNIFAGHVARMMGLPIRKLILATNENDVLDEFFRTGLYRPRSTAETRQTS
SPSMDISKASNFERFIFDLTGRDAERVKELWKSVDQGGSFDLSGTDLWAK
IRDFGITSGTSCHAERINTIREIYRKFNVLIDTHTADGLKTGMALREPGI
PLICLETALPVKFADSIREAVGFELEQPAGYDNLESLPQRFERMAADAEQ
VKQFMASRIDSSDKINKS
>NE0958 thrS, thrS; threonyl-tRNA synthetase (threonine--tRNA ligase) protein
MTVVRLPDGTDRVFDNSVTVREVAESISPGLARAALAGKLNGKLVDLSEQ
IETDSDLVLITDKDSEGLEIIRHSCAHLLAHAVKELFPGTQVTIGPVIEN
GFYYDFSYERPFTPEDLVAIEKRMQEISKRALKIERKVWDRSRAINFFKD
IGEHYKAQIIESIPDNEPVSLYSQGDFTDLCRGPHVPYTSKIKVFKLMKI
AGAYWRGDSKNEMLQRIYGTAWVSNEEQNNYLRCLEEAEKRDHRKLGKQL
DLFHTQEEAPGMVYWHPKGWVVWQQIEQYMRQTLAGNGYVEIRTPQVLDR
SLWESSGHWENFRENMFITESENRHYAIKPMNCPGHVQVFNHGLRSYRDL
PLRLAEFGSCHRNEASGALHGLMRVRSFTQDDAHIFCTEDQILGEVTKFI
DLLNQVYINFGFSETLIKLSTRPLKRVGTEDQWDKAETALATALNQKELN
WEVQPGEGAFYGPKIEFTLKDSLGRKWQCGTLQLDFSMPARLGAGYIAED
NTRKIPVMLHRAILGSMERFIGILIEHHAGALPLWLSPEQVIILNISRNQ
AEYAQLITDELKQSGIRASSDLRNEKISYKIREHSMQKIPYLMVVGDKEM
ENRTVTVRGRAGQDYGAMSVESFVVRAQEEIAKRL
>NE0568 thyB, Thymidylate synthase
MYQYLDLMRHVLQYGHKKSDRTGTGTLSVFGYQMRFDLQTGFPLVTTKKC
HVKSIIHELLWFLRGETNIDYLKRNGVSIWDEWADENGDLGPIYGHQWRS
WAASDGTVIDQISQVIQQIKETPDSRRMIVSAWNVGDLDKMKLAPCHVLF
QFYVADGRLSCQLYQRSADIFLGVPFNIASYSLLTLMIAQCCDLQPGEFV
HTFGDAHLYLNHLEQARLQLEREPRALPAMQLNSTVRNIFDFGYEDFTLH
DYDPYPPIKAPVAV
>NE0030 tig, FKBP-type peptidyl-prolyl cis-trans isomerase (PPIase)
MQTQGEASNPLERNIELSVSREKVEAEVGVRLKRLAPKVKVQGFRPGKVP
LKIVAQQYGHQVEHEVLGELLQQQFNDAVNQENYRVAGIPGFESRNSDAN
GATSYEFRATFEIYPDIELGDLSSITVNKPVLQIGDAEIQKTLDILRKQR
ATYEPTDRPAQTGDRVTIDYRGVLDGEGFPGGQADDYSVILGNGHLLEDF
ESSILGMTAGQEKTFDMTFPADYPGKDVAGKKVSFTIRLNKLEAPKLPEV
DGEFAKSLGVAEGDIDKMRSEIKANLQREISQRIRTKLKEQVMQSLLDKV
LIQVPKVLIRQEADRLAEEMQNSRAARGFRKDQSLSGDVFLEKAERRVRL
GLILSKLIDTHELSVKPEQVRSFIEEYAQGYENPEQVIKWHYASPERLKE
IEPLILEDNAVSWLLDKAKIIDQSVTFDELMGYSHATNV
>NE1326 tldD, Putative modulator of DNA gyrase
MMDSFTIADQYLLAPYELNTGRLQDVFGHILTHQIDYADIYFQYSRSEGW
VLEEGIVKSGSFNIDQGVGVRAISGEKTAFAYSDDISSQALISAARATRA
IAAQGGGTHASITGLSGNDARQQALYYSSLDPIALCKDADKIGTLERLEG
FARTLDKRVIQVMASLAGEYEVVMVARSDGLLAADVRPLVRVSLQVIVEE
NGRREQGVAGGGGRFDYAYFTDAILQDYARKAVHQALTNLASQPAPAGSM
TVVLGSGWPGILLHEAIGHGLEADFNRKGSSAFSGRIGERVAAPGVTVVD
DGTIRDRRGSLNIDDEGNPTQCTTLIEDGILKGYLQDNLNARLMNQRVTG
NGRRESFAHIPMPRMTNTCMLNGNKEPEEIIASVKQGLYAANFGGGQVDI
TSGKFVFSAAEAYMIENGKITYPVKGATLIGNGPDVLTRVSMIGNDLALD
PGVGTCGKEGQSVPVGVGQPTLRIDGLTVGGTNG
>NE2181 tmk, Thymidylate kinase
MQRGKFITVEGIDGAGKSTHLAWLERFLQDKGLEVVVTREPGGTALGEAL
RQLLLDHRQAMHPETEALLMFAARREHLDKVILPALDRGAWVVSDRFTDA
SFAYQGGGRGVAQSKLDNLEQWVQAELSPDLTVYFDVPVIVGRERLQSTR
VADRFEMESNLFFERVRQAYLQRAEQFPQRIRVVDGSRLLAEVKTAVAEI
VEDFWSDLSDTQFRG
>NE0835 tnpA, Transposase Tn3 family
MPRRSILSAAERESLLALPDTKDDLIRHYTFSDTDLAIIRQRRGPANRLG
FAVQLCYLRFPGIILGVDQPPFPPLLKLVANQLKVGIESWDDYGQREQTR
REHLVELQNAFGFQPFTMSHYRQAVHTLTERAMQTDKGIVLADALIEHLR
RQSIILPALNAIERASSEAITRANRRIYEALSEPLSNGHRHGLDDLLKRR
DNSKTTWLAWLRQSPAKPNSRHMLEHIERLKAWQALDLPPGIERLVHQNR
LLKIAREGGQMTPADLAKFEPQRRYATLVALAIEGMATVTDEIIDLHDRI
LGKLFNAAKNKHQQQFQASGKAINAKVRLYGRIGQVLIDAKQSGGDPFAA
IEAVMSWDAFAESVTEAQKLAQPDDFDFLHRIGESYATLRRYAPEFLDVL
KLRAAPAAKDVLDAIEVLRGMNTDNARKVPADAPTDFIKPRWQKLVMTDA
GIDRRYYELCALSELKNSLRSGDIWVQGSRQFKDFEDYLVPPAKFASLKQ
SSELPLAVATDCDQYLDDRLTLLEAQLATVNRMAAANDLPDAIITESGLK
IMPLDAAVPETAQALIDQTAMILPHVKITELLLEVDEWTGFTRHFAHLKS
GDLAKDKNLLLTTILADAINLGLTKMAESCPGTTYAKLAWLQAWHIRDET
YSTALAELVNAQFRHPFAEHWGDGTTSSSDGQNFRTGSKAESTGHINPKY
GSSPGRTFYTYISDQYAPFHTKVVNVGVRDSTYVLDGLLYHESDLRIEEH
YTDTAGFTDHVFALMHLLGFRFAPRIRDLGDTKLFVPKGEASYDALKPMI
SSDKLNIKAIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVA
LRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFNRLGE
IRDRSFEQQRYRASGLNLVTAAVVLWNTVYLERAAHALRGNGHAVDDALL
QYLSPLGWEHINLTGDYLWRSSAKIGEGKFRPLRPLQPA
>NE0751 tnpR, Site-specific recombinase
MPGKRIGYVRVSSFDQNPERQLEGIQVDRVFTDKASGKDIQRPQLDMLLD
FVREDDTVVVHSMDRLARNLDDLRRLVQDLTGRGIRVEFVKEGLIFTGED
SPMANLMLSVMGSFAEFERALIRERQREGITLAKQRGAYRGRKKSLNSEQ
VAELKRRVVAGEQKALIARSFGISRETLYQYLKTVD
>NE0836 tnpR, Site-specific recombinase
MQGQRIGYVRVSSFDQNPERQLEHVEVGRVFTDKASGKDTQRPELDSLLA
FVREGDTVVVHSMDRLARNLDDLRRLVQKLTKRGVRIEFVKESLTFTGED
SPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGVYRGRKKALSPEQ
VAELRQRAAAGEQKAKLAREFGVSRETLYQYLRLDQ
>NE0217 tolA, Proline-rich region
MVRLPGDNSEPGKLRAALFALLVHAAFLALLVFGLNWKNEVSEMMSVDLW
AELPRHPVEPPSSAAKVIPEPVKVKPQPQQKTQPQPQPVIKAAPPPVRKP
DIALKDKTEKPQLKEEVKKPEPVKKEVPKKEPTKKPVEKLVPKEEVKKPE
PVKKVEQKTEKKEDVRQQAEAQKQAQQREREAAAAKAERARADGEIEKYR
EMIKAKIRSRIIMPPDLPGNPAVEFTVTLLPGGDVLTVTLRKSSGYTAFD
EAVERAIYLAKPLPLPPDPGLFNAFRNLDLKVYYRE
>NE0218 tolB, probable tolB-related transport protein
MRNFLYCTGVLLLLWMSTSSQAALNIEIFGGGTNKIPIAIAPFNGERGLP
QSISAIVSADLERTGLFKLVDTLGLTSQEPEQIRYIDWQGRGASALVVGS
ISPLPDGRIDVRFRLLDVVKQTQLTGFSGAVTTEQLRAFAHRIADIVYET
LTGEPGAFSTRIAFVRKQGSQYALQVSDYDGFNARSLIEYTEPIISPAWS
PDGSRIAYTSFEKKKPVVYVQTLATRERKAVANFKGSNSAPAWSPDGSKL
AVVLTLHGGSQIYLINADGSGLQRISQSPGIDTEPSFSPDGRWLMFTSDR
GGSPQIYRMPVSGGVAERMTFEGDYNVSPHYSPDGKRFVYIHRNAGRFNV
AIQDLSTRQMQLLTDSNFDESPSFSPNNRMILYTTEIGGRGILSTVSSDG
QAKSRLSQEAGNIREAVWGPLLKQR
>NE0215 tolQ, MotA/TolQ/ExbB proton channel family
MSQVITQDLSFFHLVSGASLPVQLVMLLLLLASFVSWWFIFRKLFTLRQE
IKQTDEFENIFWRGSDLNALYQRAAGARHAVGSMERIFEAGFREFTKYQS
GVDIGPIMDSTRGAMRAVYQREMDRLESHLSFLATVGSVSPYVGLFGTVW
GIMNAFRELSNVGQATIAHVAPGIAEALIATAMGLFAAIPAVIAYNRYAS
DTAQLATRYESFIEEVSNVLQRRVAKPRDMANFS
>NE0216 tolR, Biopolymer transport protein ExbD/TolR
MIQRRSKRRLMNEINVVPYIDVMLVLLIIFMITAPLIQPSQIELPEIGKS
SAPPAEPLEVMIAANGNLTLRDRAGADKEQQVDRNQLVELIRARQAQNTD
QPVVIAADKNVRYEEVIQVMDLLQQQQIRKIGLLTKSK
>NE1966 topB, topB; DNA topoisomerase III protein
MSKKLIIAEKPSVASDIARALGGFVKQKDYFESDEFVVSSAIGHLLELIV
PEEYEVKRGKWSFDHLPVIPPRFDLAPIEKTTDRLKLLSKLIKRKDVDML
INACDAGREGELIFRYIVRHVGSKKPIKRLWLQSMTPSAIREAFANLLND
AEVQSLADAAVSRSEADWLVGINGTRVMTAFNSQEGGFHKTTVGRVQTPT
LAILVEREEAIKKFVVRDYWEVHATFQAESGVYKGKWFDEGFSKRKDESE
SRADRIWDHAKAEVIRDKCAGRTGVVTEESKPSRENCPLLYDLTSLQRDA
NSRFGFSAKVTLGLAQALYEKHKVLTYPRTDSRALPEDYPAIVKDTLQVL
KGSRYDRFASQILESDWVKPNKRIFNNAKVSDHFAIIPTALVPKKLNEAE
EKLYDLVTKRFLAIFYPAAEFLITTRITRVENEPFKTEGKVLVHAGWQTV
YGKVESAQGQEEESVLVAVTPGETVLAQEVAVVAGKTRPPARYNESTLLS
AMEGAGKLVEDEELRAAMSAKGLGTPATRAAIIEGLIHENYVERSGRELQ
PTAKAFSLVTLLRGLKIPELISPELTGDWEFKLRQIEQGQLKRDVFMEKI
AAMTRHIVEQAKNHRDKTISGDFATLQVPCPGCGGVIKETYKKFQCQQCD
FALWKILAGRQFEAAEMETLISTREIGPLSGFRSKMGRAFNAIVRLTDDY
EMKFDFGNEADQAQEKVDFSAQQPLGKCPQCGHSVYEHKLLYVCEKSVGA
GAPCSFRTGKIILNRAIEAEQVVKLLQTGRTDLLAGFVSRKGRPFSAYLV
VGPAGKIGFEFEQKKTKSKPADTVPETGKAAS
>NE1834 trkA, Potassium uptake system NAD-binding component
MKIVILGAGRVGSTVAESLASESNDITVVDLVRNRLSLLQERLDLRTVVG
SASHPDIMIQAGMEDADMILALTGSDETNLVACKLAASMFNTPTRIARIR
TVDYLDHPDIFSRENFCVDFAICPERILTKYIEKLIEFPGALQVLDFASG
KVSLVAVKAIRGGPLIGRQLKELRRHVPNLDTRVAAIFRHNHPIIPEGST
IVEEGDEVFFIASSSDIRAVMGELRRMDKSVRRVMIAGGGRIGRRLAEVL
QQSHQVRIIERDIKVCESLTQNLHNTLILHGNVTDEELLESENISEMDIF
CALTDDEENNIMAALMAKRMGARKVIALINRSMYVDLMQDSGIDIAISPA
QVTIGSLLAYVRQGDVAVVHSLRRGAAEALELVAHGDTRSSRVVGRKIEE
IKLPRSTTIGAIVRGLPPGDVYRNLSEQEEDSILAQWKQAQVIIAHHDTV
IEQNDHVILFVVNKKMVRQVEKLFQVNVGFL
>NE1673 trmB, tRNA (guanine-N1-)-methyltransferase
MPFEFDVITLLPDMFDAVTQHGVTGRAHKSNLYRLHTWNPRNYAMNHYRT
VDDSPYGGGPGMVMMAEPLDKAITDAKARQGEDGVSKTRVIYLSPQGKRL
DHKKVLQISQLDGVVLLCGRYEGIDERLIEDQVDEEISIGDYVISGGELA
AMVLIDAVVRQLPGALGDTRSAGQDSHTDHLLEYPHYTRPEVHKEKPVPR
ILLSGDHAKIERWRLQQSIGRTWLKRPDLLAEKYPEGLPDREKELLEEFK
QLRYRVVANQLMDNTKEQEQ
>NE0963 trmU, trmU; tRNA 5-methylaminomethyl-2-thiouridylate-methyltransferase protein
MNKSRVVVGMSGGVDSSVAALLLKQQGYDVTGLFMKNWEEDDTDEYCSSR
QDFLDAASVADILDIPLEVVNFSTEYRERVFNLFLKEYQAGRTPNPDVLC
NSEIKFRAFLDHALNLGADWIATGHYAQVHETDGLFQLLKGEDGNKDQSY
FLYRLNQQQLSHTIFPIGHLYKREVRKIAREHRLPNSTKKDSTGICFIGE
RPFREFLNRYLPANPGEIHTLDDQVVGEHLGVMYYTIGQRQGLGIGGTRQ
GSEQPWFVSGKDIKKNVLYVVQGHDHPALLRSSLTAADLSWISGTPPHQN
WVYAAKIRYRQTDAPCAITHFEHDSCQIGFAAPQWGITPGQSVVVYESKV
CLGGGVIIGSND
>NE0694 trpA, trpA; tryptophan synthase (alpha chain) protein
MNRIQSVFSQLKSQNRAALIPFITAGDPDATTTVALMHRLTQAGVDLIEL
GVPFSDPMADGPTIQRSSERALKHHISLKDVFSMVAEFRKTNQSTPVVLM
GYANPIEAMGYKDFVQTAGHAGVDGVLVVDYPPEECTEWVRYLKEQNIDP
IFLLSPTTPESRIRRVAELARGYVYYVSLKGVTGASHLDLHEVGDKLSQL
RSYINIPIGVGFGIRDEQTARRIAEQADAVVIGSRIVEEIEHSPAADLLA
NVGALVESLRRAIDAKSDHSSITEK
>NE0693 trpB, Tryptophan synthase, beta chain
MYDLPDKRGHFGPYGGTFVAETLISALDELCKQYEYYRDDAEFQAEFFHE
LKHYVGRPSPIYHAKRWSEHLGGAQILLKREDLNHTGAHKVNNTVGQALL
ARRMGKGRIIAETGAGQHGVASATVAARYGMECVVYMGSEDVKRQATNVY
RMKLLGATVIPVDSGSCTLKDALNEAMRDWVTNVSNTYYIIGTVAGPHPY
PMMVRDFQAIIGNEARQQMREEYGRQPDALIACVGGGSNAIGLFYPYIDE
ENVRMIGVEAAGKGIDTHEHAATLVTGRPGVLHGNRTYLIQDENGQIIET
HSISAGLDYPGVGPEHAWLKDCGRAEYVAVTDDEALAAFHALCRFEGIIP
ALESSHALAYAAKLAPTLNKDQLLLVNLSGRGDKDMATVAQQSGISL
>NE0012 trpC, Indole-3-glycerol phosphate synthase
MSDILDRILAVKKQEVAAARARKSLEGIRKQAEEMPAPRDFLQAIRGRIS
QHRAAVIAEIKRASPSKGVLRGRSEAPNPQGSGHAENSSGKNLIPQDFIP
AEIAASYARNGAACLSVLTDEQFFMGSADFLRQARAACDLPVLRKDFILD
EYQVYEARAMGADCILLIVAAFLSPVFQQDASVNQGDSALERMRILETTA
QALGMAILVEVHDADELDLALQLTTPLIGINNRNLRTFETTLDTTVQLVR
RIPSERIVVTESGIRIPADVEMMLSHHIYAFLIGETFMRAPDPGAALASL
FTTNLT
>NE0013 trpD, phosphoribosylanthranilate transferase
MNPQAILARILEQHEIPYEEMIELMRAIMSGNVSPVMTAALVTGLRIKRE
SIGEISAAAQVMRELAVRIEVPDASHLVDTCGTGGDGCNTFNISTTSAFV
AAAAGAQVAKHGGRSVSGKVGSADVLEAIGINLDQTPDQIARSITEVGIG
FMFAPNFHHAMKHAAPVRRELGVRTVFNILGPLTNPAGASNQLLGVYHAD
LTGVLAQVLLRLGSRHAMIVHGSDGLDEITLSGPTKIAELNAGEVREYSV
QPEDFGLERAALTSLQVNSTEDAQAMLLSVLDNHPGPARDIVLLNAGAAI
YVAGKADSWARGVETARDMLASGAAKQKMQALVEFSNQVSA
>NE2150 trpE, Anthranilate synthase component I and chorismate binding enzyme
MNHCITESEFNDLAAQGYNRIPLVLETFADLDTPLSIYLKLANQPYSYLL
ESILGGERFGRYSVIGLPAEIRLEACSSRVRVISGNETKEEIDTSDVLGF
IGTFLNRFKVAPHAELPRFSGGLAGYFSYDTIRYIEPKLAGHARPDTIQT
PDILLLLSEELVVMDNLSGKLYLIIYTDPTLENAYQTSKNRLRELLHKLR
EPLSIPVEKSTSSKVAVSEFPEADFIAAVEKAKHYILEGDIMQVVLSQRT
SKPYSASPLALYRALRSLNPSPYMFNYHLGNFHIVGASPEILVRLEKDTV
TVRPIAGTRPRGQDTQADLALAADLLADPKERAEHIMLMDLGRNDIGRVA
QTGSVKVTENMQIEYYSHVMHIVSNVEGKLKSGLNAIDVLRATFPAGTVS
GAPKVRAMEIIDELEISKRGIYAGAVGYLEFNGDMDLAIAIRTGLIKDGI
LHVQAGAGIVADSVPQSEWTETCNKARAVLRAAEIAENGLDSTIT
>NE0692 trpF, N-(5'phosphoribosyl)anthranilate isomerase (PRAI)
MRIRVKICGITRLEDAMAAVQHGADAIGFILWPQSERYISPEEAGRIVKC
LPPFVRAVGVYVNPDKSWVEETSATAGLDLLQFHGDESADFCSRFHLPYI
KAVRVRDGLDLLQYAQHYAGARGLLLDAYTAGIPGGTGHVFDWKLIPAEL
PLPWILSGGLHPGNITDAIGQTHLSAIDVSSGVEVAKGIKDVNKISAFMQ
RVRSCEDVRSS
>NE0014 trpG, trpG; panthranilate synthase component II (glutamine amido-transferase) protein
MLLMIDNYDSFTYNLVQYLGELGEEVMVVRNDEITLEAVQQLNPASVVIS
PGPCTPDEAGISVELINRFSGKIPILGVCLGHQSIGQAFGGRIVRAGKVM
HGKTSLVFHDGKGVFQGMPDPFVATRYHSLVIERSSIPDCLEISAWTEDG
EIMGVRHRALPVEGVQFHPESVLSEHGHLLLDNFLHGNSVQKHSSMCEA
>NE1727 trpS, t-RNA synthetase, class Ib:Tryptophanyl-tRNA synthetase
MFVDRVLSGMRPTGNLHLGHYHGVLKNWLALQQKYECLFFVADWHALTTH
YDSPEIIGQNSWDMVIDWLAAGVDPARATLFIQSKVPAHAELYLLLSMIT
PLGWLERVPTYKDQQEKLAGKDLSTYGFLGYPLLQSADILIYRANLVPVG
EDQAPHLEFTREISRRFNHIFGKEPDFENKVAQAIKKMGSRKGKLYSELR
NRYQEQGDESALIAARSLLEEQQNLSTDDHERLSGHLEGKSKIILPEPQT
LLTQASRMPGLDGQKMSKSYGNTIGLREDADSITRKVRTMPTDPARIRRS
DPGNPEKCPVWQFHQLYSSAETRNWVQQGCTSAGIGCLECKQPVIQAILD
EQAPILERARMYEKDPAQVRKIIADGCEKANWLAEETMRDVRAALGLGYL
>NE1034 trxA, Thioredoxin
MSQHIHYVTDASFESEVLQCPVPVLVDYWAEWCGPCRMIAPLLDEIASEY
GDRLKIAKLNIDENQSTPQKYGIRGIPTLMIFKNGNIEATKVGALSKSQL
TAFVDSHL
>NE1929 trxB, FAD-dependent pyridine nucleotide-disulphide oxidoreductase
MTTTRHCKLLILGSGPAGYTAAIYAARANLNPVLITGMAQGGQLTTTTDV
DNWPADVAGVQGPELMERFLKHAERFQTEVIFDHIHTANLSEKPFELIGD
QGTYTCDALIIATGASAKYLGLPSEEAFMGKGVSACATCDGFFYKNQDVA
VIGGGNTAVEEALYLSNIARKVTVVHRRDKFRSEKIMIDKLMGKVKSGKI
ELALDHVLEEVKGDDSGVTGIRIRHVRDDTARDLELQGVFIAIGHHPNTD
LFQGQLEMKNGYIITQGGNEGNATATSIPGIFAAGDVQDHIYRQAITSAG
SGCMAALDAERYLEKSA
>NE1140 tsaA, Queuosine biosynthesis protein
MLAIGIIRSMKTDDFDFHLPDELIAQFPLANRSDSRMLYVNGKHADLRDA
AFRNLPDYLKRGDVIVLNNTRVVKARLSGVKSTGGKVEVMVERILDTHRA
RVLIRASHALVIGSTLLLENKITAQVEAREQDIYTLCFMHSLPLIELLDQ
FGHTPLPPYIGRNATVSDENRYQTVFAQESGAVAAPTAGLHFDETMLQTL
RTLGVKIIYVTLHVGAGTFQPVRVQNIDQHVMHTEPYHIPTETVEAIQMC
KSAGGSVLAVGTTSLRALESCMLTNDGTLVAGAGETNLFITPGFRFRIVD
RLLTNFHLPRSTLLMLVSAFAGMETIRHAYRHAIDNHYRFFSYGDAMLID
SQP
>NE1717 tsf, Ubiquitin-associated domain:Elongation factor Ts
MAEITASMVKELRELTGLGMMECKKALVEADGDMKAAEDLLRIRSGAKAS
KAAGRIAAEGVISGFVTADGKQGALVEVNCETDFVAKNEDFINFAGNLAR
LMADKNISDTALLEDMSIAEGETVESVRKALVMKLGENISIRRGISYQTK
DHLAMYLHGSRIGVMIDYAGGDEALGKDLAMHIAASKPVCVSSDQVPADA
LERERQIFTAQAAESGKPANIIEKMVEGRIAKYLAEVTLLGQPFVKDPDQ
TIEKLLKTKSAKVSGFTLYIVGEGIEKKSDDFAAEVMAQVTQSK
>NE2456 ttuD2, putative hydroxypyruvate reductase oxidoreductase protein
MNPRELLLDSFRAAIDAADPQKIVPAHLPEPPSGNTLVIGAGKAAAAMAR
AVETHWPPGNHLEGIVVTPYRHGLATNSIMVIEASHPIPDTRSEMASRAI
LESVRMLKPDDLLLGLFSGGGSSLLSAPVPGVILEELKSITHQLLLCGAS
IQEINIVRKHLSTVLGGKLAAASRAPVRCLIISDVTDNDPSSIASGPCAP
DPSTWQDVLTLIERYAITVSPQVIEVLRENGRKSSNGGEGETPKPGDPVF
RNVKNRIIATARQSLAAAAQFFQAQGITPLILGDTVTGEAREVAKMHAVI
AREIRHYNNPYRPPVALISGGETTVTVKGAGKGGRNAEFLLSLAIALGGL
EEVYALACDTDGIDGSETNAGAVMTPDTLSRARQAGIDPTALLDDNDAYT
FFEKQGDLVVTGPTYTNVNDYRAILIV
>NE2052 tuf2, GTPases-translation elongation factors and sulfate adenylate transferase subunit 1
MAKSKFERVKPHVNVGTIGHVDHGKTTLTAAITTILTKKFGGEAKSYDQI
DSAPEERARGITINTSHVEYETDKRHYAHVDCPGHADYVKNMITGAAQMD
GAILVVSAADGPMPQTREHILLARQVGVPYIIVFMNKADMVDDAELLELV
EMEIRELLSNYDFPGDDTPIIIGSALKALEGDKSDIGEAAILKLAEALDS
YIPEPERAIDGAFIMPVEDVFSISGRGTVVTGRVERGIVKVGDEIEIVGL
KPTIKTVCTGVEMFRKLLDQGQAGDNVGILLRGTKREEVERGQVLAKPGS
ILPHTKFTAEIYVLSKEEGGRHTPFFAGYRPQFYFRTTDVTGSIELPAGV
EMVMPGDNISVTVNLIAPIAMDEGLRFAIREGGRTVGAGVVAKVIE
>NE0399 tuf2, GTPases-translation elongation factors and sulfate adenylate transferase subunit 1
MAKSKFERVKPHVNVGTIGHVDHGKTTLTAAITTILTKKFGGEAKSYDQI
DSAPEERARGITINTSHVEYETDKRHYAHVDCPGHADYVKNMITGAAQMD
GAILVVSAADGPMPQTREHILLARQVGVPYIIVFMNKADMVDDAELLELV
EMEIRELLSNYDFPGDDTPIIIGSALKALEGDKSDIGEAAILKLAEALDS
YIPEPERAIDGAFIMPVEDVFSISGRGTVVTGRVERGIVKVGDEIEIVGL
KPTIKTVCTGVEMFRKLLDQGQAGDNVGILLRGTKREEVERGQVLAKPGS
ILPHTKFTAEIYVLSKEEGGRHTPFFAGYRPQFYFRTTDVTGSIELPAGV
EMVMPGDNISVTVNLIAPIAMDEGLRFAIREGGRTVGAGVVAKVIE
>NE2276 tviB, UDP-glucose/GDP-mannose dehydrogenase family
MQFQDIRLAVVGLGYVGLPLAVEFGRKRSVVGFDINQRRIDELKNGNDFT
LETTREELAAAKYLTYTTNIDDLRDCNCYIVTVPTPIDEHKRPDLTPLIK
ASETVGKVLKKGDIVIYESTVYPGCTEEDCVPVLERVSGLKFNQDFYCGY
SPERINPGDKEHRVTTIRKVTSGSTSEVADLVDELYNEIITVGTHKAESI
KVAEAAKVIENTQRDLNIALINELALIFNKMGIDTEAVLKAAGSKWNFLP
FRPGLVGGHCIGVDPYYLTHKAQAIGYHPEIILAGRRLNDSMGTYVVAQL
VKAMTKKRIQVEGARVLVMGLAFKENCPDLRNTRIVDIVAELKDYNCVVD
VYDPWVSLQEAQHEYGITPIGTPKPGSYDAIILAVAHHQFKDMGASAIRA
LGKPDAVLYDLKYVLDPQGADLRL
>NE2554 typA, GTP-binding elongation factor:Elongation factor Tu domain 2
MTRLLRNIAIIAHVDHGKTTMVDKLLHQAGTFAAHQTVAERVMDSDDIER
ERGITIFSKNCAIDYAGVHINIVDTPGHADFGGEVERVLSMVDGVLLLVD
AVEGPMPQTRFVTRKALASGLRPIVVVNKIDRPGARPDWVVNQTFDLFDK
LNATEEQLDFPVVYASALNGYATLDANQPGSDMRPLFDMILKHVPAPVGD
PDQPLQLQISALDYSSFVGRLGIGRINRGRLKPGQEVMVLTGERAPKKAR
VNQVSGFRGLDRIQLSEAAAGDIVLISGIEELGIGTTLADIDHPEALPMP
VVDEPTLSMNFQVNTSPFAGKEGKFVTSRQLRERLEKELLTNVALRLEET
GETDSFLVSGRGELHLTILLENMRREGYELAVSRPKVVIREIDGEKCEPF
EVLTVDIDEANQGAVMEALGTRRGDLLDMVTDGRGRVRLDYRIPARGLIG
FQSEFMTLTRGTGIMSHVFDEYAPMRPDMASRKNGVLISAEMGEAVAYAL
WKLQERGRMFVSPGEPLYEGMVIGIHSRENDLVVNPIKGKQLTNIRASGH
DEAVVLTPPIQLTLESAIEFIADDELVEITPKSIRIRKRFLLEHERKRAS
RSAA
>NE0337 tyrA, Prephenate dehydrogenase
MAFPAISKLVVVGVGLIGGSFALALRRAGLVDRVVGMGRSPENMQRALEL
GIIDEQTSDFAAALSGADFVLLAIPVKQTAGVMQQMAPHLKAHTIISDVG
STKQNVVHAARANLGKRIERFIPAHPIAGTEFNGAEAAFPDLFQDKPVIL
TPLQENDQQIVDRVADLWQHCGASVSSMLPEQHDQLLAAISHLPHMLAFS
LMQHIRTLSHTLSEGDPLALLRFAGSSLNDMTRITASSPEMWRDICLENR
AALLAQIEAYQQELSGLQQMLADHDGESLEKLFAEARAIRQAWSAFRNQS
>NE1838 ubiA, UbiA prenyltransferase
MTSVNMTFADRLSAYTRLIRLDKPIGILLLLWPTLWGLWLAADGMPDPMI
LVIFILGTILMRSAGCAINDFADRRIDPHVSRTRNRPLAAGLITAREALL
IAAGLSLCAFLLILPLNLLTIQLSVPALFLAASYPFTKRFFAMPQAYLGI
AFSFGIPMAFAAQTGTVPPLAWLLVLANLFWVIAYDTEYALVDRADDLKI
GIRTSAITLGRFDVAGILLCHIIFLSTLTYAGILLQRGIWFYGALLVALG
LVIVQYGMIRKREPSRCFQAFLHNNWIGAVIFAGILLDTLFRTDQSF
>NE2547 ubiG, ubiG, 3-demethylubiquinone-9 3-methyltransferase
MDNDGINADPMELEKFSQLAHHWWDPNSEFKPLHEINPLRLNYIDEIIGG
LSEKTVIDVGCGGGILSESMAARGASVTGIDLSDKALKVAKLHLLESGNQ
VDYRKITVEALATERPRYYDVVTCMEMLEHVPDPASVIQSCARLVKSGGW
VFFSTLNRNPKSYLYAIIGAEYILRLLPRGTHEYAKFIKPSELAHMARSA
GLMESGIVGMTYNPITKVYALEADTSVNYIMAFRA
>NE0873 ubiH, Aromatic-ring hydroxylase (flavoprotein monooxygenase)
MALALTLREYEFSVLLLEARELPEKVDDPRPLALSYGSRLILQRLDIWKN
LTQPAPITTIHISNRGHFGHAVLTPDDAGVPALGYVVNYHDLYHSMGRAI
RGSNAVYQTNALVTAIRTDLSLAQVAYQQGSTTRQVRARLLVLADGGKLA
EQIDGVRYHTYDYQQTAIVANVKVESPRPGTAFERFTLNGPLALLPSGEG
YALVWTMPAAAAEKILKLDDRSFLLHLHEQFGDRMGEFTHVGPRSSFPLI
LKHASTITGHRAVLIGNAAQALHPVAGQGFNLGLRDAWELASEISRIPSG
TPGEPGSTRMLAAYSRRRQSDKRISRLFTDSLVKLFTADLPLVQTCSGMG
LAALDNLPPARRLIAHQMIFGTKGWFIG
>NE1343 udg, UDP-glucose/GDP-mannose dehydrogenase family
MKKITIAGTGYVGLSNAMLLAQHHAVTALDIDAHKVELLNNRQSPIEDAE
VQNFLAHKSLNFTATQDKQQAYAEADWVIIATPTDYDPQTNYFNTRSVES
VVRDVLAINPRAVMVIKSTVPVGFTRRLREETGSGNIFFSPEFLREGKAL
YDNLHPSRIVVGECSDRAQVFADLLREGAIKKDVPVLLTDSTEAEAIKLF
ANTYLAMRVAFFNELDTYAAVHGLDTRQIIEGVGLDPRIGGHYNNPSFGY
GGYCLPKDAKQLLANYAQVPQNLVQAIVEANRTRKDFVADDIVRRNPKVV
GIYRLVMKAGSDNFRASSVQGVMKRLKAKGIEVVVYEPVLDTPDFYKSRV
IRDLAAFKAMSDVIVANRCSAQLDDVKDKVYTRDLFGNDL
>NE2455 uvrA1, ABC transporter:Excinuclease ABC A subunit
MELIRIRGARTHNLKNIDLDLPRNQLIVITGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLQLMEKPDVDLIEGLSPAIAIEQKATSHNPRSTVGT
VTEIHDYLRLLFARVGEPHCPEHGIGLAAQSVSQMVDQVLQLPTDTRLMI
LAPVVTGRKGEQAELFDELRAQGFVRVRLDGEVYDIDALPKLQKTKKHTI
EVVVDRLKISPEVKQRLAESFETALRHAEGRALAVEMDSGKEYLFSARFS
CPVCSYALSELEPRLFSFNNPAGACPKCDGLGQITFFDPARVVAFPYLSL
AAGAIRSWDKRNQFYFQMLQAVANHYHFDLEIPFEQLSKEVQQAVLYGSG
KEKITFTYLNEQGRAHQQVHPFEGIIPGLERRYRETESQTVREELAKFIN
ARECPECGGTRLCREARHVTVNGETIFAISAWPLRQAKQFFDDMELTGHK
QSIAERIIREISSRLQFLNNVGLDYLSLDRSADTLSGGEAQRIRLASQIG
SGLTGVMYVLDEPSIGLHQRDNERLLDTLRHLRDLGNSVIVVEHDQDAIL
LADHVVDMGIGAGEHGGCVVAEGTPTAIQANSASLTGQYLSGKRSIAIPS
TRTPPNPERMLTIRGAAGNNLKQVQLNLPVGLLICVTGVSGSGKSTLIND
TLYRVVARHLYGSHTDPAAYQEIDGLGFFDKVIDINQSPIGRTPRSNPAT
YTGLFTPVRELFAGVPQARERGYSPGRFSFNVKGGRCEACQGDGVIKVEM
HFLPDIYVACDVCHGQRYNRETLEIQYKGKNIHEILQMTVENAHAFFEAV
PTIARKLQTLLDVGLGYITLGQSATTLSGGEAQRVKLSLELSKRDTGRTL
YILDEPTTGLHFQDIDLLLKVLHRLRDNGNTVVIIEHNLDVIKTADWIID
LGPEGGAGGGRIIAEGTPETVASIPGSFTGYFLQPLLSTTLTG
>NE0785 uvrB, Helicase subunit of the DNA excision repair complex
MIITFPGSPYKLNQAFQPAGDQPEAIRILVEGIESGLSFQTLLGVTGSGK
TFTIANMIARLGRPAIIMAPNKTLAAQLYAEMREFFPENAVEYFVSYYDY
YQPEAYVPSRDLFIEKDSSINEHIEQMRLSATKSLLEREDAIIVATVSCI
YGIGDPVDYHGMILHVREHEKISQRDIIQRLTGMQYQRNEFEFARGTFRV
RGDVLDVFPAENSETALRISLFDDEVESMTLFDPLTGQTRQKVSRYTVYP
SSHYVTPRSTTLRAIETIKTELTGRLNYFHENHKLVEAQRLEQRTRFDLE
MLNELGFCKGIENYSRHLSGRQPGDPPPTLIDYLPDNALMIIDESHVTVP
QIGGMYKGDRSRKENLVAYGFRLPSALDNRPLRFEEFEKLMPQTIFVSAT
PADYEIQRSGQIAEQVVRPTGLVDPVIIIRPVTTQVDDLMSEVSLRAAQN
ERVLVTTLTKRMAEDLTDYFSDHGIRVRYLHSDIDTVERVEIIRDLRLGK
FDVLVGINLLREGLDIPEVSLVGILDADKEGFLRSERSLIQTMGRAARHV
NGTVILYADKITNSMRRAIDETERRRNKQKLFNQQNNITPRGVNKRIKDL
IDGVYDSENAAEHRKVAQIQARYAAMDEAQLAKEIQRLEKSMLEAARNME
FEQAAQYRDEIKNLRSKLFIGIIDPDEIREVPQTAGKKSRRKAGR
>NE0933 uvrC, uvrC Nuclease subunit of the excinuclease complex
MPDAHFDGKAFVLTLPAQPGVYRMLNAAGDVIYVGKAIDLRKRVSSYFQK
SGLSPRIQLMVSQIAGIETTVTRSEAEALLLENNLIKSLAPRYNILFRDD
KSYPYLLLTRHIFPRLAFYRGALDDRHQYFGPFPNAGVVKSSIQLLQKVF
RLRTCENSVFDHRTRPCLLYQIKRCSGPCVGLITPEAYQQDVKSAAMFLQ
GKQDEVLKTIEQKMFTASDQQDYEQAAQLRDQMQALRKIQEKQFVDSGKA
LDADVIACAIEPDSHAVAVNLVMIRSGRHLGDKTFFPQNVYEADISTVLE
AFVTQHYLNRSVPPLIILGQKIRVTLLQKLLSDQAGHKITLTTNPIGERR
KWLDMAAENAQLALQQMLIQQASQEDRLQALQEALNLPGLARIECFDISH
TMGEATIASCVVYDRFAMRNGEYRRYNITGIVPGDDYAAMRDVLQRRYAK
LAMEEGKLPDLILIDGGKGQIRVASEVMIELGLNDIPLVGVAKGETRKPG
LEQLILPWQEEALHLPDDHPALHLIQQIRDEAHRFAIQGHRAKRAKTRKI
STLEQISGIGTKRRQSLLTRFGGLKGVKNASIEELQQTEGISRSLAEKIY
RELR
>NE1473 uvrD, UvrD/REP helicase
MTALLTDLNPEQLEAVTWSHQSALVLAGAGSGKTRVLTTRIAYLLQSGRT
RPQNILAVTFTNKAAREMVARIGAMLPVNTRAMWVGTFHGLCHRVLRAHH
EDAGLPQAFQILDMADQLAVIKRVLKERSLDEKMLPPRQLQWFINNAKEE
GLRASQVDVHGGFNQTLAECYQAYEIVSMREGTVDFAELLLRCYELLSRN
EILRDHYRSRFEHILVDEFQDTNRLQYKWIKLLAGPGSQQHAAIFAVGDD
DQSIYAFRGAHVGNMRDLEKDFSVPKIIRLEQNYRSHGNILDAANALIEH
NKGRLGKNLWTAAGKGEPVRVYHAATDMDETSFIIDEIKALHADGLALSD
IALLYRSNAQSRVLEHGLFNASVSYRVYGGMRFFDRQEVKHALAYLRLIA
LPDDDNALLRIINFPPRGIGARTLEQLQDQAAMLGTSLWQAAFKVYEGGK
AVATRNSQPGRGIAGFVSLVLSMQQDGEGLPLPEIIRRVIDQSGLAAHYQ
AEREGGERLENLKELINAATSFVHESEDDSLTAFLAHASLEGGEHQAEGY
QDAVQLMTVHAAKGLEFHSVFISGLEEGLFPHENSRNEPDGLEEERRLMY
VAMTRARQRLYLSYAESRMLHGQVRVNIPSRFIDEIPQDLLKRLRSDFSG
RSFRQGVSGTGQTVASTINSSQKGRSTMAAAVGMTSSGLNSAGFHVGQQV
SHAKFGTGIILNYEGSGTDMRIQVNFHQAGTKWLSLAYAKLEPL
>NE1422 vacJ, putative lipoprotein precursor (vacJ) transmembrane
MFTCSLCFRKLLILLALGSVLSACATVDNRHDPLESMNRAVFSFNEKLDE
VVLEPVARGYQRFIPSPVRMIVGNFFSNLDDVVVAANSLLQLKFMDAMAS
TTRIVINTTFGMLGAIDLASDISQVSDIDISKRNEDFGQTLGFYGIGSGP
YLVLPILGPSTVRDTVGIGVDSTIFHPARAIYTSFDLLTVRLPVTTVEFI
DRREQLLDLDKTLEEASLDKYEFVRDAYLQRRESLIKNDGAASEEEPVNF
DEEEIIIDD
>NE1169 vacJ, possible lipoprotein precursor (vacJ) transmembrane
MRNLLLCLVLICTSCGSIPREGHQEGVYDPYETFNRKSYRFTDAIDRRLL
EPIADVYLKYVPEAIQRPVSNFYRNLSYPSVVLNSFLQGKIHQGFSDTMR
FVINTTIGVGGLADMAGHMGFPEHNEDFGQTLAVWGMNSGSYLFIPIYGP
STSRDVADLPVSVFTDGLYYLAYVLSGPVIIPLAVLRIVDKRAGLSGPMK
VRDESALDPYLFVREGYMQQREYLIHDGNLPASHYDDLLIDDSGLIADPE
KTDSAP
>NE0444 valS, probable valyl-tRNA synthetase (valine--tRNA ligase) protein
MELAKSFDPKEIETRWYSTWETAGYFSPTDHKGADPYCIMLPPPNVTGTL
HMGHAFQHTLMDALIRYHRMLGDNTLWQPGTDHAGIATQIVVERQLDQEG
KDRRQMGREAFLERVWQWKEESGSTITRQMRRMGASCDWSRERFTMDETL
SRAVTEVFVRLYREGLIYRGKRLVNWDPVLQTAVSDLEVVSVEEQGSLWH
ILYPFEQLEGGSEHQGLVVATTRPETMLGDMAVAVHPEDERYRHLIGRHL
RLPLSERTIPIIADSYVDPAFGTGCVKITPAHDFNDYQVGLRHKLIPLNV
FTLDGKINDNAPAEFQGLDRFDARKKVIADLQAQGLLVETRPHKLMVPRG
DRTNTVIEPMLTDQWYLAMEGLAKQGLAVVASGKVRFVPENWTHVYNQWL
ENIQDWCISRQLWWGHRIPAWYDEDGNVTVAHDLEEARKLSGKEILRQDD
DVLDTWFSSALWPFSTLGWPEQTPELKTFLPGSVLVTGFDIIFFWVARMV
MMSLHFTGEVPFREVYITGLIRDAEGQKMSKSKGNVLDPLDLIDGVSLTE
LIRKRTTGLMNPKQAESIEKTTRKQFPQGIPAFGTDALRFTFASLASHGR
DIKFDLQRCEGYRNFCNKLWNATRFVLMNCEGKDTGLDETLPLNFSQADK
WIVGRLQQAEQGVVQAFDEYRFDLAAREMYEFVWDEYCDWYVELAKVQLS
SADEAHQRATRRTLVRVLEAALRLAHPIIPFITEELWQAVAPLAGKTGTS
IMHQPYPQADPARMDEHAAANVQLLKEIVNACRTLRGEMKLSPAERVPLL
IEGDPSRLADFSPYLLALAKLSEVTILPDRLPDTDAPVAITGEFRLMLKI
EVDVAAERERLNKEMQRLTLEIGKARTKLDNPDFLQRAPEKVVLQEKERL
ATFSATLEKLDQQLHKLG
>NE0287 vapC, PIN (PilT N terminus) domain
MLILDSNTISYYFRGDPQVVLRLQAQRPQDVAVPAIVEYELRYGLLRLPP
EMAAPRLAALTTLLLPMQKLPFDSECADHAARIRTTLEAAENPIGPHDTL
IAATALRHGATLITRNVREFSRVPGLQWINWHEG
>NE2249 wbpI, UDP-N-acetylglucosamine 2-epimerase
MQKKVYLIAGARPNFMKIAPIVRALQQQDRLTYKIVHTGQHYDREMNDVF
FEELGIPQPDIFMAAGGGSHSQQTAKIMVAFEEYCQTEPPDAVLVVGDVN
STLACSIVAKKLHIPVAHVEAGLRSGDMCMPEEINRLVTDSITDWFFVTE
PSGQQHLLQEGKPASAIHYVGHVMVDNLLYQVEQLARADTSNFETSQLKT
SLGNQSYGVVTLHRPSNVDSPDALERISLTLKQIAKRLPLVFPAHPRTQN
NLKKFNIDLGPNILLMGPQAYMPFLHLWKDATLVLTDSGGLQEETTALGV
PCITIRENTERPITVNEGTNILAGTRPERILAAVDDILGGRGKQGRRPHL
WDGNAARRIVEILTRELHP
>NE0502 wbpM, Polysaccharide biosynthesis protein CapD type
MLEKYLAQLASPVLALPRTAKRMVVLSVDLSLCILTVWLAYYLRLGEFIS
FSGQGYWATGAVRAAAASVVLALPIFIVTGLYRAIFRYSGWPALLAVARA
IGIYGLAYASMFTVIGLPGVPRTVGIIQPILLLLFVGASRAMARVWLGDQ
YQNILKRASQPKALIYGVGRTGRQLASVMTNSPEIQVVGFLDDDDRLHGH
VLNGLPIYNPGDLTGLVLTLNISDVLLAMPGISRRRRNEILSQIRSARVS
VRTLPSMADLIQGRVSIADLRELDIDDLLGREPVMPNHILLAKNILDKVV
LVTGAGGSIGSELCRQILAVGPARLLLIEQNEFALYLIHQELEEKLASRE
IVLVPLLASVQDEDRMCEIMSTWHPDTVYHAAAYKHVPLVEHNPAEGIKN
NALGTLCLAQAAEENGVADFVLISTDKAVRPTNIMGASKRLAEMVLQALA
DRGSATRFSIVRFGNVLGSSGSVVPRFRRQIREGGPITLTHPEITRYFMT
IPEAAQLVIQAGAMAKGGGVFVLDMGQSVKIIDLAHRMVELSGLKIRDEQ
NPEGDIEIVITGLRPGEKLYEELLIGDDLEPTSHPRIMKAREEFVPWADL
EDKLSALEVALNVNDVSVIRLMMEQLVPGYTPSQEIVDWVYLAQEAEIQV
PGLGN
>NE1805 wza, putative polysaccharide export protein, outer membrane
MNISIKAWIVSLSVIFYLLALGGCTTYPLLSTQGEQVPHDYLIGPGDTLN
IIVWRNPEISMSVPVRPDGKITTPLVEDLPASGKTSTELARNIEETLSKY
LQQPVVTVVITGFVGPFSEQIRVIGEAAQPQALPYRENMSLMDVMIAVGG
LTDFAAGNRARIIRNVDGEQQQFRVRLEDLIRDGDISANVPVYQGDVLVI
PESFF
>NE0483 wzt, probable ATPase component ABC-type polysaccharide/polyol phosphate transport system
MNQEQNDIAIRVSNLSKCYHIYDTPRDRLKQFILPRLHHLTGQTPRQYFR
EFWALRDVSFEVKKGETIGIIGRNGSGKSTLLQMICGTLNPTSGSITTYG
RIAALLELGSGFNPEFTGRENVYMNASVLGLSQEETDARFDDIAAFADIG
EFIEQPVKTYSSGMMVRLGFAVQAQVKPDILIVDEALAVGDAKFQAKCFD
RLQQLKDGGTSILLVTHSSEQIVTHCSEAILLDGGDVIEHGEPRYVVNHY
MDLLFGKERKSYSASAFSTESNSVEPTQDDNLSLNFIDDVFSSHANYNEH
EYRWGDGTATILDFYLLTDNEPYPSVITSGQSVVLGVAVRFYEDLVRPIL
GITIKTKEGVTVYGVNSETLETDEIKVLGSKGTTALIKAEFICRLAPGDY
FISLGLATRKGEEIVPHDRRYDAIYLQIAPVTKFFGLVDLGLKLSAQNTS
L
>NE1458 xerD, Phage integrase:Phage integrase N-terminal SAM-like domain
MRQSPDFFRDMLRMNITDTNIRMLDEFTDALWLEDGLSRNTLASYRADLM
QLVEWLGRQPRTNGSLSDVTQADLLAFLSDRIGQGVKASTTCRALTCIKR
FYRYLLRQGKILADPATNIDSPKISRHLPVSLTETEVEALLAAPDTRQPL
GLRDRAMLEILYAAGLRVSELVGLSISQIRQDMGVVRILGKGSKERLIPL
GEEALHWLSLYLQEARPVLLAGKHSNMSFVTTRGDAMTRQAFWYLIKRHA
RQAGIVKLLSPHTLRHAFATHLLNHGADLRVVQLLLGHSDISTTQIYTHV
ARERLKQLHARHHPRGTL
>NE1172 xseA, xseA; exodeoxyribonuclease vII large subunit protein
MTDHNLLPEPKKILWRVSELNRNARVILEQTFPLLWVSGEISNLKRYPSG
HWYFSLKDDSAQVRCVMFRHKNLYLDWIPQEGMQVEAQALVTLYEARGEF
QLTVEQLRRAGLGALFEAFERLKARLQQEGLFSPEYKQPLPRFPRQIGII
TSPNTAALRDVLTTLQLRLPSIPVVIYPAPVQGEGSAAAITTALHTAAVR
GECDVLILCRGGGSIEDLWAFNEEIVARAIAACPIPIVTGIGHETDFTIA
DFVADARAPTPTGAAQLASPDRQAILHRLQYWLHRLQQTMERHIERRMQA
TDLLAHRLIHPGERIRHQQMHLLQLRGRLQNAWNRQVEIRTWRIEETGRR
IHSAKPDIQAGIRHQQELAARLQRAMAHRLENLQFKLRQQQQHLIHLDPK
AVLARGYSIAYTARGDILHDSRQTRAGDNVRLVFASGWAKADITETGE
>NE1159 xseB, Exonuclease VII, small subunit
MRKKSSSNKEETALHPPPENFETATAELEQIVAGMETGQMSLEDALSAYK
RGVELLQYCQNILKNSQQQIKILEADMLKHFSPAEHDAS
>NE0023 xthA1, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth
MKIATWNVNSLKVRLQQVIDWLNLNQPDILCLQETKLQDEFFPMDAIAQA
GYRSIYIGQKTYNGVALLSKETGEDICTALPGFDDMQKRLIAATYGDLRV
ICAYVPNGEHVDSEKYIYKLEWLSQLNRFLQQQRACYGKVALLGDFNIAP
EDRDVYDPEAWRGQVLCSEPERQAFRGLLDTGFVDSFHLFEQPEKTYTWW
DYRMMAFRRNRGLRIDHILLSHEMADRCTIWQVDKLPRKLERPSDHAPVL
VELA
>NE2192 xthA2, Exodeoxyribonuclease III:Exodeoxyribonuclease III xth
MRIITLNVNGLRSAAGKGLFDWLPRQEADVICVQELKAQQGDINGVMRAP
DGYSGYFHCAEKKGYSGVGLYTRYSPDQIIEGTGIPEIDMEGRFLRVDFG
NLTVISIYLPSGSSGEHRQAAKFFFMEHFLPLLQSLAECGREVLLCGDWN
IAHKAIDLKNWRSNQKNSGFLPEERAWLSTVFDELKLVDVFRKINPEPDQ
YTWWSNRGQAWAKNVGWRIDYQIATPGLAAMATGVSIYKAERFSDHAPLT
IDYDFNL
>NE0732 ybbB, Rhodanese/cdc25 fold
MRNPDIGIDDLTALFIADTPLIDVRAPVEFTQGSLPGAVNLPILNDEERA
LVGTTYKQQGSEAAIKLGYEMVSGSVKQNRLQQWLDFIHQHPRAILYCFR
GGKRSQITQQWLRDTGIDSPLITGGYKRARQFLISTIDRFSEHRKLLVIT
GPTGSGKTRLIHDISNSHPVLDIEALARHRGSAFGGMSVPQPSQIDFENH
LAVNLLKLEQNNLSEPVIVEDESRHTGKVYLPDSFFHHLRNSEIIWVDEP
LATRVDNIFEDYILTTPIGQAQRIRQAIPPLASTVETREILRQQARQLFD
KYAGALQAISKKLGGDRFQEVSEDLENARSDFENKNEIQSNKIWIEKLVR
YYYDPLYLGSLQRRRVNPCFKGSGQAVMDYLQARK
>NE2573 ycbB, putative periplasmic protein
MTPLHQKSRKAFLSFDQGVSEFRLNYLAIAQYLVFFCFALTVLSAAAENR
PVAVSADAIREQLAKNAWEGVNEAELIKLRHFYAQRDYQPVWALKEDSGV
LLDTALTFIGRADEEGLASSDYHIETLRRWRAESPDQVSLPLELGTTRSL
LALVHDLSNGRLTATLADPDWYVPQRRLDPVNFLQQSIASVDSAEQLEQV
LASLPPNMPQYHTLKRLLVRLRILVAAGTVWTRIPDDIPSIHPRTRHAAI
PLIRQRIREAYSVFEKPEYDIASDDSELYDDQLETAIKAFQYQYGLNTDG
VVGKNTRRAMNMTAVEHIQQLRITLERLRWLPREFSNRYILVNIAGFNLA
AIRNNVRVLNMRIVVGRDYRSTPSFNSRISHLVLNPYWNVPASIASKDLL
PKQKHNPDYFASEGIRVFSDYHYELELDPDAIDWHAFSRSFPYVLRQDPG
KRNALGTIKFMFPNPFSIYLHDTPSKSLFQRDIRTFSSGCIRLEKPMQLA
EFVLGPSFEKANILEKIDSGKTQTVHLPEPIPVYLLYLTAWNDGQGEVHF
SADVYGRDKRALAYARWLQPEPHLSQSF
>NE2279 yccZ, Polysaccharide export protein
MIIAFTRFGLILLVFCQLAACTIIPGQHMSPFSRQSSVEMPTRENDEAIL
EKLNIQTINAELIIELEKNYKNFSLAPDNVANHYFDYRIGSSAIKGTPMK
NEPYTQYRVGPRDILNITVWDHPELTIPAGEFRSAEAAGNVVGEDGTFFY
PYVGIVQAAGRTVEDIREELTRRLSKYIEFVQLDVRVASYRSQRVYVVGE
VAQPGVQLVRDIPLTVLEAINNAGGVNSDADLRNIILTRDDKTYSINLLS
LYEGGDVTQNVLLRHGDVLNVPDSSLNKVFVLGETNHFVAGGAIGRSRSL
VMNKARMTLTEALSEAGGFDQETSDPARIFVFRGGLGKPEIYHLNAKSPD
ALLLADRFPLQPRDVIYVDRAEGIRWNQIIGQIQPTINLLNAFDGALRVQ
PFLRR
>NE1449 ycf16, Iron-regulated ABC transporter ATPase subunit SufC
MINANSKVLLEVRDLHATVGGAEILKGVNLTVRSGEIHAIMGLNGSGKST
FAKVLAGHPSYEVTQGTVLFDGQDLSVLAPEERAQAGLFLAFQYPVEIPG
VANDQFLRLAYNTVQGHRGKEELDPLEFDDFVREKMKLLKMNPDFLDRSV
NEGFSGGEKKRNEILQMALLEPRLALLDETDSGLDIDALRIVANGVNQLS
SKDNAMILVTHYQRLLDYIVPDYVHVMEAGRIIRTSGKELALALETHGYD
WIHANDQATGAQP
>NE2349 ydiD,ppsA, AMP-dependent synthetase and ligase
MNHAGIIDLVPAEVRQQWTNDGIYPDKSLFDLFCEHARKNPEKPAVVTLD
HTLTYRQLLHKVTRLANSLRHLGVVAGDVIAYQLANSAHHCAIDLAAAAL
GAIVAPFPPGRGRLDIQSLLQRCDARVIVVEPLFLQQDLCELIESLRPAL
LSLRILVVDGVARTGWHTLNDLFQSRPIATEELPDVDPDSPARFLISSGT
EADPKWIAYSHNALAGGRGRFLQHIHTRDKDFRALYLVPLGTAFGSSATF
GVLSWMGGTLIMLPRFDVVATIRAIGQLRPTHVFGVPTMFQRIAADPDLA
KIDISSLVAIVSGGAKIDETSILRCCGAFGCSFINLYGSADGVNCHTMLD
DNMTTVLHTTGRPNPEVCSIRIVDDRKNELAQGQTGEIAARGPITPMQYV
NNPELDALYRDAEGWVYTGDLGFIDEQGNLVLTGRKKEIIIRGGINISPA
QIEDIAASHPAVVSAACIPVEDEDLGHRVCLCLVMSEGAERPSLAQFARF
LLDRGLEQNKLPEYLRYLRQLPLSPAGKVDKKQLIAELENTQFKSNAAVV
SRVDFSARMH
>NE1400 yeaZ, Glycoprotease, (M22) metallo-protease family
MNIIALDTSTEYCSLALWLNGNLVSREVLAGQRHSELLLPMLQTLLTEAG
IALNQIDGIAFGAGPGSFTGLRIACGVTQGLAFAQDIPVIGISTLEALAQ
QVDAPRVLAALDARMGELYFAAYEKTATGWLCVHEPLLCRPETAPGVTGD
GWTGCGSGFDLYQPVLSEKYSSNLQRIVPGCYPRAREMAEMSAIKFASGE
GRVAEEALPVYIRNKVALKESER
>NE0539 yegM, possible putitive HlyD family secretion protein
MGGLAAWLLIQFPPENGKQEETLTPPIVRVAQVEMQSKRLHVHSQGQVVA
HTEIDLVTEVSGRIIDTSPVFVSGGYFNKGDVLVTIDPADYDLRVAQAQA
QVREARHLLMREEAEAAQAHDEWKHLGQGDPGPLSQHIPQLQEMRAKLAA
AEAALKYARQLRQRTRIRAPFDGRVRNHNTGIGQYVTQGNVLGVIYSSDY
AEVRLPVSTRDLAFIDVPDTLVADEENSKRMPRVVLAAEYQGEKRFWQGR
IVRSEGVIDRNTGMLMLVARIPDPFLRTSSSPQSGEDRLSRLTNTTALPV
GLFVEALIQGRRFDRLVILPTSVVFKDDQVAVVDQQDRLHLRTVKLLKRE
HEQAIIQAGLTAGERVLLSGLLQPVEGMQVTPELPNTNDQASGDSHP
>NE1678 yfgD, putative arsenate reductase
MGDKITIYQKPTCSKCREALSILKESGREFDSINYYDDRLTVEVLRELVR
KLGISVRDLLRADEPLARGTESVDDDELLRLMAANPDLIQRPIVVRGDSA
VLGRPPERIKKLLDN
>NE0017 yfhA, putative transcriptional regulator of two-component regulator protein
MIQDKKILLVDDDRDLLELLSIRLTAAGYETTLADSTEAAINYLDISRPH
LVISDMQMGGLDGMALFEYIHRNIPTLPVIILTAYGTIPDAVAATQRGIF
GYLTKPYDPKILLGQVERALNLAPAVETLSSTTPVSSWRKTIVTRSAVME
DLLAKVGRVAQGDASVLLSGESGVGKELLARAIHQASRRCDQPFVTINCA
AIPEQLLESELFGHAKGAFTGAVRDHKGLFQLADGGSLFLDEIGDMPLLL
QAKLLHALQERVIRPVGSTQSIPINVRIISATHRDLKSEIQAGNFREDLY
YRLFVVGLTIPSLAQRSEDIPLLANHFLRVFAEKHQKDINVFSPEAISFL
LASSWPGNIRQLMNVVEQSVVMSAVPLISGELVRDAMHKDEEQMTSFDEA
RRQFERDYLVKILKITAGNVTQAAKLAKRNRTEFYKLLQRHQLDFTLYKS
LQEKV
>NE0015 yfhK, Sensory transduction histidine kinases
MKISTDAVSPEQTRRKTGSGYKPKSFLALILIGFSIVGLPLIGALVYSAV
RIDQLSEQSRHNVYQATQITNGTSVIIGEIMAMQRSVQHALVLDDASLLE
GYFLAHTKFENITNHLSVISLYPEQRLPLEKLRLLETSLFKEILNLNEYP
QELQYLLERFAGLLTLAQEFSTNSFRMIGENVGSMSEIATQTRSLVEWEL
LILVPLVIFLALAFSVFIARPIRQIDEAIAQMGRGRLSRPIQVNGPQNLV
YIGERLDWLRQRLLKLEDQKMQFFRHISHELKTPLTAIREGADLLAEGIT
GNLNRKQQLIAGILHTSSMQLQKRIEDLLNFSALQAEIITLVKQSVKLEK
IINSAIQAQNLSILSKNLKISLNCPELSLECDKQKLDIILDNLLSNAVKF
SPVGGLIEITASYHENRAQIDMTDSGPGVDDSDGSRIFEPFYQGHNTPES
HVRGTGLGLAIAKEYAIAHGGNIELIRNTGKRGAHFRLTLPINSSESTT
>NE2561 ygaD, Competence-damaged protein
MSTYPTVPDDETLLELARRAGKLLEQNGLKLVSAESCTGGWIGQIITAIP
GSSAWYDRGFITYSNSSKQQMLHVQPSTLTQSGAVSEQTAREMALGALTL
SQAQVAVSVTGIAGPAGGSAEKPVGTVCFAWMLESASATSANSKICRFSG
NREAIRRQSVAIALQGMLELLENTTPLNLA
>NE1857 ygcA, SAM (and some other nucleotide) binding motif
MSFQGEITHLSQKGLGVVQHPENGLSYFVAGTWPGDRGEFEITDRALNNR
KYGYARLIRLIQSSRHRKTPECRFLGFSGNDCSGCPWMIADYDSQLEQKK
NRFLYAMHRVGFDLADLNIGAVQPSPDLFGYRNRFQVKTDGEKLGFVAEG
SHHIVPIEDCLILNPACRQHLQTLRKHLPSREWSPAPGDDWNFIDLDDQS
PAEPVLLNWKQPFRQGNDAQNQWMRSWLKYALEQHGHSHKIVELFCGSGN
FTEVIAQTGCPEILAYEADPQAITVLRQKNLPGVDARTADLYHPFIWKIL
KKNVQDAGILVLDPPRSGLKTLRGFFDAFAALETICYISCDPVTFARDAW
IFCKNGWKFTDIQLIDLFPHTPHIEITATFHKQWGNTKKGNKR
>NE0928 yhdE, Uncharacterized protein family UPF0074
MRLTNYSDYALRILTYLGLKREELSTITEIADCYGISRNHVVKIVHHLGQ
LGYVDTLRGKNGGIRLAHAPEKINIGEVIRHTETSMDIVECFSNQNSCII
GCSCVLRTAISEALSAFMAVLDDYTLADLIAPRRQLSRKLHVMQISDSLS
D
>NE1117 yhiH, ATPase component ABC-type multidrug transport system
MSVDISTKPVIRVQNVNLRYGDTLALDALDVEIPAHCMAGLIGPDGVGKS
SLMSLLTGARVMQQGQIEVLGGDIADPAHRRVVCPRIAYMPQGLGRNLYH
TLSVFENIDFFGQLFGHEQIERHQRIDELLESTGLAPFRDRPAGKLSGGM
KQKLGLCCALIHDPDFLVLDEPTTGVDPLSRAQFWELISRLRTQHVNMSV
LVSTAYMDEAERFDWLAAMDAGKILATGTAAELRTRTDTDSLEAAFIRLL
PAEKRRGYREIIIPPRDIRDDQEIAIEAHDLTMRFGDFVAVDRVNLHISR
GEIFGFLGSNGCGKSTTMKMLTGLLPPSEGEAWLFGQVIDAGNLATRKRV
GYMSQAFSLYSELTVRQNLELHAKLFHLPDADIPARVTEMAERFLLEAVM
DSFPDALPLGMRQRLSLAVAVIHQPDLLILDEPTSGVDPIARDMFWELII
GLARRDQVTIFISTHFMNEAERCDRISLMHAGKILASDTPAALIERSGHS
TLEAAFISYLKQASGISDKPVKMPVPSTTQAQATVAPEETHAVMTLNTRF
SLLRLFSYTLREALELRRDPIRGSIALFGTVLLMFIMGYGISMDVENLRF
AVLDRDQTTLSRNYILDLSGSRYFIEQPPITDYGELDQRMRSGELSLALE
IPPNFARDLARGDNVRIGAWIDGAMPTRAENILGYVQAMHQNWLLEQARH
HATDHLPAGLMTIETRFRYNPDVKSLPAIVPAVIPMLLMLILAMLSALSV
VREKELGSILNLYVTPVTRLEFLLGKQLPYIALGMINFFLLCILAVFVFG
VPHKGDFLLLTVAALLYVTAATGLGLLVSAFTRSQIAAIFATSLATIIPS
SRFSGMIDPVSSLEGGARLVGEIFPATYFLTISRGTFSKALGFAELGSLL
LPLALAIPVITSLAVILLKKQER
>NE1947 yjeP, Uncharacterized protein family UPF0003
MNGLIRIIVFTVLVVMSMGSSHASELDEELQAIQAKLESLNKGEETPQKK
RLREIFNDHQQVLLDHQEYLKKTANYSQQLEKYPKQLESLKKSAPPTITT
VDPEKLTKLSPEELERRYVTTQATLSDLQNQQQKLTTEISKLRQRAVTVR
EELASINTLRAELTENSPETRKTTDKKILAALENYQEAQLQALTDKTRML
ELEALVLPNAIEITTLQEKLSLQPKIQALEQETELLAEEINRQKRAETEQ
VVEKSQQLLVEKAWQYPGLEKYAQDNQSLAKKLSNYAELSTRLINQKTGS
EKRLALITQSYAALQQRLELKGEDTALGTEVRKQFKEIMVEPAIAQTEKD
LSSARLDLFRLEQEKLQMLNETGYFKQIMNGEEIPANDTSAYQQIMNAFH
ELKQSREQMIDQFTQVLHGYTKELQLHLAIQQQLIEKISQHKLLLKENLL
VTRSAEPISLDVVQDIQDAITWLTSRKTQTATGKAINAAWKNILIVYAVF
GLIFIIFRYTYWPVYRIWTRTGNSLLGKVNQDRLLYPMGMLLATLLISGC
IYLPFQLSSVILRYGTNTEINHAFSVSLHVAAIAAFIWSFILLLFHPEGL
LIGQFKCSEKLVSKMYQDIKRFAPVVTLLCIIIAFTDALDDDLVRNGFGR
LTFILLCLVLAVFAFGWMAVTRSGKALYQGESFYLMLNPRFWMTLLFMLQ
IGMIIMAAIGYYFAAIYQYLLVFQSISWIIFCALTFFISFRSLLIIQRRI
AFERAVEKREEILAQRITTGKSEADLLDDNYIDVKTISKQSATVLKISIW
ALLITGLGFIWIDVLPALSFLEKITIWRTSVETDGEVIMRPITLQTLLVA
LIVLSLAMIAAHNLPGTLELLVLRHLSLNPGTGYAITTLLKYSIVIIGIM
VTLQKLGMEWSNIQWLVAALSVGLGFGLQEVFANFVSGLILLFERPIRIG
DMITLNNTSGTVSKIHIRATTLIDSDRKEIVVPNKIFITQQLTNWTLSDQ
ITRLIIPVRIARGSDSNKVSALLLEIAKNNPMVLRDPEPAALFLEFGSNA
LNFELRVFVGQISDRVKLTHQLHLEINRRFAEEGIDLVSP
>NE1335 ykvP, possible spore protein [UI:20467420]
MIGRAIRKLGSLVYKYPEIPDAVERPGPFGRLKIAMVTDYFTADCLSAEC
RVKALTPGNFRMVIGEWKPDLIFVESAFHGSRGSWRYELAKQPKWLRLSK
PTAIYQLVEFARSRGIPTIFWNKDDDVFFDAFIDVAKAFDYVFTTDNECI
ESYRQQLPAHVPVNPLIMPYQPAFHNFTGFEFTRNEACFTGSYYQRILNE
RKLFLDMVFDACERTDLSLNIFDRNHDRLSRHFEFRFPENSRLHLHGRVP
HRETAKIYKSHAISLNVNSITRSETMYSRRLLEILACGGIVVTNPSQAVD
RYFRDYCHVVSSSDEAQELFSRLRYGPSPDDMARAEAGAAYVRQNHTWVH
RLEEICTVVKI
>NE0911 yliG, Uncharacterized protein family UPF0004
MTSASSATIQPPRIPRVGFVSLGCPKATVDSERILTCLRAEGYLISPSYA
DADLVVVNTCGFIDSAVAESLETIGEALTENGKVIVTGCLGAKEDVIRQA
HPSVLAVTGPQATEEVMQAIHRHLPKPHDPYLDLVPPQGIKLTPKHYAYL
KISEGCNHRCTFCIIPSMRGDLVSRPVGNVLQEAQNLVDAGVRELLIISQ
DTSAYGVDIKYRTGFWQGRPIRSRITELARALGELGIWIRLHYVYPYPHV
DELIPLMAEGKLLPYLDIPFQHGSKRILKLMKRPANSENVLARIRQWRDI
CPDIALRSTFIVGFPGETEQEFEELLAFLEEAQLDRVGAFAYSPVKGAAA
NALPDPVPSEIQQERLARLMQWQEEISKKRLAGKKGRILKVLVDTVDENG
VIARSYADAPEIDGVVYIEPDFSIKPGDWVDVRITRTGIHDLWAKKI
>NE2078 ylqH, possible similar to flagellar biosynthetic protein
MKRPEPTGQENQGEPTDSPAPPEHTPIPGAVALAYHSGMTAPQVVAKGRG
LIAEEIIRRAQEAGVYVHESAELVALLMQVDLDDHIPPELYIAVAELLAW
LYRLENDLAMPADSPDPSQ
>NE1910 yqaA, putative inner membrane protein
MICGHLRTTLRFTYCYQKIDTIPMNENASLLALFTSSFLAATLLPGGSEA
VLAGVLVAYPDLFWPALNIATLGNTLGGMSSYVIGRLLPDEQALSEKIGR
HVHGLEWIRRHGAPVLFLSWLPLIGDILCVAAGWLRIHWASAALFIAAGK
FARYWVIALAIS
>NE0721 yraL, possible methyltransferases
MPAAAGVSKGTLYVVGTPIGNLRDITLRALEILSAVDCIAAEHIQHAQKL
LAGHALHTTSTRIMPLHQHNEGSAVEKIIELLGSGKSVALISDAGTPAIS
DPGALLVQQVLARSLPVVPIPGANAALCALSASGLIAPHFFFYGFLPAKS
GERQRKLAGLKTLYACILVFYEAPHRVLECVADMVAVLGTTREITFAREL
TKLFETIHTCALGEALDWLQADENRLRGEFVLLLAPAEEPGQEDISPQAV
HALAILQRELPLKQAVQLAAEITGEGRKKLYARALLERKTET
>NE0680 yrhG, Formate and nitrite transporters
MAKFRPPCSFCARLDHHQFPQLTHLKNMKESEQPSIILNERVIQSSIAAE
TMSTGGSVASAPVKVSNEPTAVIDSVSPIQMGHDLVEDATKKKKFKVGQI
LIRGFLCTPFLAYATALCALLVSQGWPTAAAGLLFPAGYVMLSILGLEMA
TGSFSVMPMGLFAGRFGLGSVIRNWSWTFVANLIGGVFFAYLLWFSLTKG
GAVEPPGVLTTLAHLAEKKASYAGYGVNGWFAAIGMGILCNWLVSLAPVF
AKGSRSVPGKIMLMWLPLATFFSLGFEHAVVNMFVFPIGILSGADVTISQ
WWLWNQIPVTIGNMLGAVIFNSTLWYRTHRA
>NE1803 yveL, possible similar to capsular polysaccharide biosynthesis
MSLIERAADKLVKNDMNGFMPGSVSQHDSREGIGPDSLFSSDGVPGKKTG
HGRIIKEEVGADRINKAADTKKTVKRIEVDLENLHRKGVVTLHHTKSEIA
EQFRLIKRPLLANAFNPDSGVKNGNLIMVTSSLSGEGKSFCSLNLAMSIA
MEMDHRVLLVDADVARPSIPATLGFSPEEPGLLDMLRDSQLSISDVMMKT
NIKKLTLIPAGRRHTHATELLASQSMHSILVELAQRYHDRVVIFDSPPLL
LTSEARVLASQMGQIVLVVEAERTTQQTVKEALQQIEACDVVNLIYNKAR
AHGSTEYYGYY
>NE0852 yvgQ, Nitrite and sulfite reductase
MANTLPPTDRSCDISQPLERLSPDESLKAESDYLRGTIALGLLDRITSAV
PGNDIKLMKFHGIYEQDDREIRDERRRQKLEPAFQFMIRVRLPGGICTTE
RWLKISELACAHGNETLRMTTRQTFQFHWVLKQNIVPLIRGLHEVLLDTV
AACGDDSRGVMATVNPQFPALQAELAALAKTVSDHVIPKTRAYHEIWYGE
ERIASSEPEEPFYGQTYMPRKFKIGFVIPPNNDIDIYAQDLGYIAIIGEN
GKIAGFNVAIGGGMGRTDKAPHTYPRTASVIGFITPDRLISVTEAVMGVQ
RDYGNRADRSRARFKYTIDDKGLDWIKLAIEDRAGPLESARPYDFTSNAD
IYGWIESGDGFHHFTLFIENGRLNRDMLDKIAQIAHVHKGHFRLTPNQNL
MIANVATADKPEIEALLRETGLIAFNERSVLRLNSMACVALPTCGLAMAD
SERYLPDLITKIEGILTRYNLQNEPITLRMTGCPNGCSRPFIAEIGLTGR
APGKYNLYLGGGFHGQRLNRLYRENIGEPAILETLNEVLGRYATERLPDE
HFGDFTIRAGIIREVTEGRFSND
>NE0853 yvgR, Sulfite reductase flavoprotein subunit
MMASQLSRGTLTAEQWQLVEALGSSLTPEHAYWISGYFAGLGSALLRSSG
TASQSPDVCIPPVVTQAEPVAARSLTILHISETGNSTELAIRLAALAVEQ
GLSPTLVGIADYKVRKLKEEQDLLIITSTHGEGDPPQSGMEFFEFVEGRK
APSLSGLRYAILALGDMSYEHFCGAGKRLDERFEALGAKRLQPRVDCDVD
YEDPAAVWSTGILALLAAEQASAISSVASPSKTTGGTQQANNSVYSKRNP
FPATVIDNIVLSGRGSTRETHHIEISLADSGLTYEPGDALGIAAHNDPAM
VEALLAALNMNPDAPLTVKGQTTTLFDALGKSFEITTATLRFLDHWGELT
GTTDIFSAMDNTKRAAFLHDNHIIDIIRAYPLKGLDPQRFVDGLRPLQPR
LYSIASSLSICPDEAHITIAPVRYKLHGEPRSGVASGHLADRAIADSVLP
VYIQSNPHFRLPSNGEPIIMIGAGTGVAPYRAFMQEREANGGGRSWMFFG
ERNFRTDFLYQTEWQDWLKNGVLTKMEVAFSRDGAEKVYVQHRIYEHAHD
VYAWLEEGAYIYLCGDASHMAPDVHNILVTVIEEQASVQREEAEEYLRDL
QRDGRYQRDVY
>NE2292 yxiE,n17E, Universal stress protein (Usp)
MLKMLLPVDGSEASGRAIEEFVKRLDWYREKPEIHLLNVRMPLTGNVSMF
VSKEEIGDYYREEGLRNLQQARGYLEEKGVAYRHHIVVGEVVTMILQFAE
EIQCDQVVIGPRGLGAVKGLLLGSVASKLIQLSTVPVLLLK
>NE0103 yyaL, putative similar to unknown proteins
MPNHLAGETSPYLLQHAENPVDWYPWGEEALEIARMLDKPILLSIGYSAC
HWCHVMAHESFEDAQVATAMNEHFVNIKVDREERPDIDQIYQSAHYTLNH
RSGGWPLTMFLTPEQKPFFGGTYFPKEARYSMPGFLELLPKVAELYRTRK
TDIEKQNAVLLKLLAQSLPAPDTRASALSRQPIDRAWEQLNRLFDETDGG
FGDAPKFLHPAELQFCLRRYVTDNDTRALHVVTHTLEKMAQGGLYDQLGG
GFCRYSTDHSWQIPHFEKMLYDNALMLPLYAETWLVTGNPLFKQVVEETA
AWVIREMQSGIDGEGGYFSSLDADSEHEEGKFYVWDRQAVSAILTPEEYR
VTAAYYGLDRSPNFENHHWHLAVTESIETVAARHQISQEAVQQLIDSARR
KLLNEREQRIRPGRDEKILTSWNALMIKGMTRAGQIFEREEWISSAVRAL
DFIRSRLWQNDRLLATFKDDKAHLNAYLDDHAFLLDSLLTLLQADFRQTD
LDFAITLADVLLTRFEDKTSGGFFFTSHDHETLIHRPKTGHDGAIPAGNG
IAATTLQRLGHLLNEQRYLEAAERTLNVFSSGLSLHASSHCSLLITLEEF
LEPTKTVILHGNRPELQIWLKALLPYSLDKIVIALPLELSELPDSLKMRS
TPDGKISARVCEGRRCLPEIHSLNELLQMCKLHGRMALP
>NE0398 zwf, Glucose-6-phosphate dehydrogenase
MNDAKMSQTEKLPCLFVIFGATGDLASNKLLPALFELENAGRLADNFSIV
AFSRREWTTDDWLEHLREILKNRIDQSFPEGVLERFFARFRYQQGDLNDI
ESYRVLAAGLAPGSACSRTVFYLAIRPADFVAVIRNLKAAGLNEPRGMNR
VVIEKPFGEDLESAQVLNRLLHQHFDEEQIYRIDHFLGKETVQNLLVFRF
ANTLIEPLWNRNFIDHVQITVAESGGIEKRAGYYDRAGALRDMLQNHLMQ
LLTVVAMEPPPALEADALRDEKVKVLRSIRPIAKRAIHAHAVRAQYTHGN
IDGQVVAGYQQEDNVERDSITETFVAAKFYIDNWRWRGVPFYLRTGKRMP
KQNSMIAIRFKHPPQQLFRETPLEWIAPNWVLLSIQPEESMHMEIHVKQP
GLDMNTRVMQMNASFLKTDEQALDAYEALLLDVIEGDRSLFIRFDEVEWA
WQVIDPILKHWVVEREYIPTYPAGSWGPAEANRLFDNDDQTWRNEL