TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 322

Free access
Sort by:

 



# Nitrosomonas europaea ATCC 19718, ATCC 19718

>NE1357 hypothetical protein
MIQGDSDNGGNGRCQNTFHQMLQLKTAAAHLGFGKKQGQATVSRLYLFNP
APARINDQSRIERIASAARAAPFIGIRHLLST
>NE2100 hypothetical protein
MLYTYLQRSMHRLRCILNVGRHTEPSEATMDENSQANKHKHLDFIQAAIN
RMAGNLFLLKGWSITLIAALFALAAKDSNKLYIVIAYFPLFIFWALDGYF
LSQERKFRALYDHVRTLDESQIDFSMDTRPFSSDIRNTWAGSASSKTLVV
YYAGLAVVMLILMYVVR
>NE1362 hypothetical protein
MPGTQQGRTGYPQLRFVGLLENGTHVLFGVALGGYQDAEVRLAHQTIAHL
KPGMLCLADRGLSGYPLWAAASRTRRAVALAHPQESPTAYA
>NE1294 putative glutamate--cysteine ligase
MPVPHLTHALNAPTLDLEKRILAAKPTIEHWFRKQWQHHTVPFYCSVDLR
NSGFKLAPVDTNLFPGGFNNLNPTFIPLCIQAMMIAVEKICPNAHNVLLI
PENHTRNVFYLQNIAMLQSIMQQAGMNIRIGSLLPDLDQPLPVLLPNGDS
LLLEPVQREGNRLFTRDFDPCAILLNNDLSGGIPEILQNLEQIIIPPLHA
GWATRRKSWHFSAYDNVAQDFADLIGIDPWLINPYFISCGKVDFHEKEGL
DCVAAGVNEILHLIREKYAQYGISEKPFVICKADSGTYGMGIMSIKDGAE
LYKLNRKQRNKMATIKEGLVVKNVLVQEGVHTFESIHQAVAEPVIYMIDR
HVVGGFYRVHTGRGIDENLNAPGMHFVPLAFDTTCMQTDPTARPDAAPNR
FYTYGVIARLALLASAIELERYLQTPLPVLAEV
>NE0187 hypothetical protein
MLHFYPTKVDLISNWNNNNRNNYMKVSLNLITTSMLIGLPMLAQAEVFTI
PQSSLIPSTTLWTDDLGISIGNTLVMTGGGSAANVGDPTGRNDDGFSGPI
SFGLDFSGLTLFGTTYSQFYANNNGNISFGNGISAFTPMGLQGATQPIIS
AFFADVDTRNAASGVMSFQTHTTAAGSEVIITWPSVGYYSQQATPLNTFQ
LVVREDDYLIPDGEGQIGFFWTTMGWEVGSASGGGSGGLCSGIGGIGTGC
VPAAVGFGDGLNNGYVLEGSTLNGIAGVLQNHRLWVNLSDGGVPVIDPGA
VPEPGTLALLSIGLAGLGIKRRYKAKIA
>NE0704 hypothetical protein
MMAVFYSAEELTISERRLRADLAHESAVEMVIYDVLVKGNRSIWIGDGIV
TRNVQISEQMFSVSVQQATGLIDAMTSDSRILSRLLAWLAIPRNSREPDF
LSARSTGTLRPATYTDLQAMLGLSHSAFACLYPHITFYSGRVEPDWRYAS
NDLVELVGLRSRSAGTHSVLNDDTSSHNVTGATLRVNVLPGNTSDEAAGL
SVEVTITGQIDPSHLIRSWKRITRMDNSKQCRNLNTQ
>NE2245 conserved hypothetical protein
MNLYSLLLSLITGCFLLAQGGTSFSSPVETEKSAVTAQSILEKADEIRFP
QDSFQVNVAIRTAAPDHAEDLYRYQVLSKGNENSIVMITEPASERGQAIL
MKGRDLWVFMPSVSQPIRLSLSQRLTGQVANGDIARANFTGDYHPQLLRN
ESIDDEDYYVLELTGIDRSVTYQKVLLWVNQSNFRPYKAEFYSVSGRLLK
TSRYENFDNILGEMRPTRIIMEDALKSGEVSVLDYSDMKLRDLPDKIFTK
DYLKRLE
>NE0165 hypothetical protein
MPLSYAGCLCGIAAMVRLLKYIWAAPCSLFGLGCGLFLLLIGGSVRQVSG
ILEFSIGYGNPIPFFPFWAITFGHVVLGLNESALEYSRAHELEHVRQYEV
WGVLFFLAYPVSSLWQLFRGRNPYWYNYFEIQARQRSGQKRLGL
>NE1091 conserved hypothetical protein
MRSMTLEQLRTASETGGVSSVTLKGQGGAFLVQINTRSGVAAILTKARNS
EPRRFGNPAAALNVLREVGITIGQFDASEWNPDEREPVARSSDNRAKALH
KAHEAAAYNEWLAAEIQEAIDDPRPGIPHDEVMARMDARIVRHKAAGAKR
A
>NE0391 hypothetical protein
MANQLTIDQRLQERRAGASFVCPYQLGIKQGRRVTRRRSGKGAAYVDKYG
WPLVICCLAIVLFSATDAFLTINILSDGGTELNYFMAVLIEESTQKFVHF
KLALTSLAAIILTIHHEVQIRGGFRCRHLLYMISTGYAGLIGYELVLLQI
IDV
>NE2041 hypothetical protein
MLNIFHRFPEVTGNPAHRKRDYFIAAFTFFCVAVSLVIDAHGTLLLQNVL
GVIAWIFLVALLRGENREIRMQVVIAVAFATAGEHFASIYMGGYTYRLEN
VPLYVPPGHGMVYLTAVTLARSGFFLQHARKIAAFVVISCGLWSAWGISG
LPEHGDQVGALLYVVFLIYLFKGRSPMVYLAAFFITTWLELIGTAAGTWQ
WATLEPIFELTQGNPPSGVSAWYCLVDAVAISGAPVFLNAFNRMNGLLKW
LKTNGISLKVILSRK
>NE1232 conserved hypothetical protein
MNMTIKNRVLKAIHNFLILLLRIERRLEPWFRPQWDYLFREPGSRLIQFL
INRRRKNKDSDLELAEERFDPDEEESLNKIIDLMMDQMRGRFKPGGYERG
GNTKTHGIVRATITIRDDLPEHCRKGIFANPRSYPAYIRYSGPGPNVPAD
INDVGFMSMAMKIMGVPGEKLMSEEKLTQDFIATSGGATFVTPNTRENAK
LQYWSLVDMTLYYFLNPKDSHLLDFFMQSLWTATQYNPLGQRYWSCTPYL
LGEGQAMMYSFVPKTKEVERHIPGLPFGTPPFNYLRENMIKTLNEKDVEF
DLMIQVQTDPHLMPIEDSSVRWPEKLSSFIPAATIHIPRQKFDSDAQFEF
AKRLKMNPWHCLPEHRPLGNINRARFRMYYELSRFRQEMNETTHLEPTGD
EVFD
>NE1862 hypothetical protein
MGRILKPDLVMTPIDENIRLGSILAVFASAIGVKQSGSGGNRDIESLFTT
LYLRLNGTCGLAKTKFGADDGRKDMQYTMTESGVVQPEVAEVREVRWGGD
SLPVIITVMTGSRFCSPDGHLP
>NE1338 hypothetical protein
MSIFEAAQIVWQELVTRIARSEWNLLKHHDLLSNKLPVLFRNRAFLFIRS
PLVAARSIPFHITKYFWCSSTKYGVSVRP
>NE0314 hypothetical protein
MGIPSDFSNTGLTGFNVSSTKLVKIEQQYKGVKGKKFISPGGGKFVRIDD
PGAVDSNGLRTADASNTSISLETGNTWLPRLHADGTLTCGGKCAWLECTN
VTIPKNKRLWFRYAFLRFSSLPADSFSVLLCFPNDDTSVPPLPPYWICSV
KELQENRGNINQTDWTECFVEIDKNADFHGTLRWVVATGHNLADQNSIPD
NTRFTRPGCLLIDAIDIR
>NE0296 conserved hypothetical protein
MRKLHLYLLAFILCLAGLGLAYYKAAVIGLPLTASEEAQVWNIEARISFR
AKPDSAIKVTLPLPFNPAGYSILDEDFVSADYGLAIEQDNTGRVARWAKR
RAQGKQLLFYRAVLFENEEVAPKSDAAPVYPKPPEYPENLAPVIQGVLDK
ARQQSADTISFTQRVIEQLNAAASIPELKLLVKHVSRTRNLAKTLSWVLA
GARIPSRVIRGLRLQDDKDYADLEHFLQVYDGEQWRTLNIKDGQEGLPSN
FIIWQIDDDRSFSIEGASESGIQYSVARTLQETVNIAGFRSAEKGSHLMD
FSLYSLPVHTQNVYKVLLMVPLGALVVVFMRNIIGIRTFGTFMPILIALA
FRETELFWGLILFSLIVGLGLLLRAYVEQLKLLLVPRLAAVLTMVVLLMA
GVSVIMHKLGFEMGLSVALFPMVIMTMTIERMSLTWEEAGPAEAFKQVSG
SLLVAVIGYLAMNISEFQYMFFVFPELLLVLLAVILLLGRYSGYRLMELW
RFRAFARNKP
>NE0520 hypothetical protein
MKTKFALTLAASTLLVSAAHADPFVNGGFETGNFNGWTVSNSAYRASINN
ANLTPDWVFANDNYAMHSQIISAGTIDPNVGAAFGSTVYAGNYSARIEDT
TWGGYASAITQTVTNYTEDSINFVWKAVLLGAHGVNDAATFKLVLTDLTD
GIDLITREYNAASSGSGVDSRFSLSGGNYYTQDWQIETLNINDTLKGHDF
MLSLVAADCQPTGHWGYVYLDGFGSVAGGGGDDTNNVPEPATLAILGLGL
LGMTATRRRKNS
>NE1223 putative transposase
MMDHDHSYKALFSHAEMVADLLRGFVREEWVNELDFSTLEKVSGSYISDD
LREREDDIIWRIRWGKDWLYVYLLLEFQSTVDWFMAVRIMTYVGLLYQDL
IRSESIHKGEQLPPVLPVVLYNGDNRWQAPVDISELIIPIPGGLERYRPQ
LHYLLLDEGSYHDHELATLRNLTAALFRLENSRTPEDVQQVLQALIAWLQ
SPQQSGLRRTFTVWLKRVFLPGRMPKVRFDEIQDLQEVHSMLAERVKEWT
KDWKQQGIEEGLQKGLQQGLQQGRQEGRQEGREEGLQQGEAEFLLRLLER
RFGPINETIRTRIRAADSQTLLTWGEQILTAQTVEEVFEA
>NE0783 hypothetical protein
MHTTDPAGWCSNRNSLIRKLLTSFIGSSSGAVKICRKAATVSFILATGSS
QLFASGSSPTATQVDWSKAPPVSPPPRAGIFVKPPMGPGYFSLLDLIDGN
EREKPQVDPLPPSALTTTPAFDFDFRYLEQPGHDKDFFDPVKRIHLGSDW
LLSFGGSFWYRYMHETDSRLNAAGINNDYHLLRTRLHADLWYQDQFRLFA
EMLDARALGLDLPALAIDKNHTDMLNLFADVKLGQFMDGPAYLRVGRQEL
LYGSQRLISTLDWANTRRTFQGVKTFWQTPAFNLDAFWVRPMVTEPNQFD
NWDKDRNFVGLWGTYKAIPGQVLDLYYLSLVDNRNVSPANITQGNVLQGD
SVLHTIGARWVGDYERILYELEGMYQFGKRSHLDISAFSIASGVGYQLPL
PMNPQFWLRYDFASGDKNHRDGRSNTFNQLFPFGHYYFGYIDQVGRQNIH
DFNAQFTLHPQPWVTFLGQYHRFYLANKRDYLYNAAGAGTIRDITGQSGS
HVGDEIDFTINFHLSRHQDVLLGYSKLFTGEFLKNTRPGVSPDLFYAQYN
FRF
>NE0635 hypothetical protein
MLTLPLRHQYLIGSILIILMIATREYHFASLHTLPGASWAVFFLAGVYLS
SSWSLLGFLVLAWILDFSAYFTAAGSDFCLTSAYIFLLPAYGALWVAGRW
FAARYQFSWRALASLSISLLIGAMLCELFSSGGFYFFSGQFEETTFAEFW
QRELHYFPLYLQSLLFYVGTAATIHTLFVLIHKSRHPQINATG
>NE1300 hypothetical protein
MNKVIVAAFVSAFVLGSTATFASGNLESSLAPISAKDMLDYLACKDKKPT
DVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY
>NE1130 hypothetical protein
MSNSRFSTIAFFLGSSALSLIFLARSVLAEELPKLQNLDLFKLNGFLSQG
YIKTNKNNFFGHSNDSGSLDFREIGLTASLRPSPKLQLSGQLLHRRAGEG
SNGGIHIDFGFLDYNFANTPAAEIGIRLGRMKNPFGFYNDTRDVPFTRPS
ILLPQSIYFDRTRNLALASDGIQVYGESRADWGNITVQFGAAFPQVDGHD
TEISVLRSLQHRGDLKTKLSYIGRILYEQADGKLRLAISGAQANVGYSPG
YNDTLLNGSFRFSPLILSAQYNAERWSITSEYALRHLAWKDFGNHAAFQQ
SFTGESFYLQGVYRFYPNWEAVLRYDLLFTNRKDRSGKKFSASTGYLYPA
HNYYAKDITVGLRWNITPAIMLSTEYHRINGTAWLSPLDNPDMSHTGKNW
NLFAVQASYRF
>NE0471 hypothetical protein
MNEYFFPKLTAVEALAPYRLRTTWSTGEVLEVDVGDILRKIPDLAPILDP
EAFARVHIAEWEGSVEWFDTEFGRDNVYAWAKEQAGEVSHEMFGDWMHRN
NLSLTTAAEALGISRRMVSYYRTAHKIIPRTIWLACLGWEATRPETKTLP
RTLPAAYAKGVSASLS
>NE0136 hypothetical protein
MKQSIFFLTALILFHASSNAGTLVNGTWSPMGCGERPIAPVVSAASVEDY
NRSAEAINEWQKRAQQYNSCLVDEANADNALIARTANDQQAKFREEIDRI
NAETDKARAELDSKR
>NE2103 Helix-turn-helix protein, CopG family
MKAKDFEQQFDEGVDITASLDLSKAKRVLQEQKRVNVDFPTWMIESLDRE
AEKLGVTRQSIIKVWLAERLEKAALTHPSSGTR
>NE1302 putative transposase
MKTPWCYTCPMEHDHGYKLLFSHAEMVADLLRGFVREEWVYELDFSTLEK
INGSYISDDLRERQDDIIWRLRRGKGETGEWLYVYLLLEFQSTVDWFMAV
RIMTYVGLLYQDLIRSESIRTGERLPPVLPVVLYNGDTRWQAPVNMEGLI
FPAPGGLDRYRPQLNYLLLDEGSYSDHELATLRNLTAALFRLENSRTPQD
VEQVLQALIAWLQSPEQSGLRRSFTVWLKRVFLPGRMPGTSFSEIHDLQE
VQSMLSERVKEWTKDWRQQGIEEGKQIGIEEGKQIGIQEGRLEGRQEGRQ
EGRLEGRQEGRLEGESEFLLYLLEQRFGPVSDAVRARIGSADTQTLLVWG
KRILTAQTIEAVFGD
>NE2461 hypothetical protein
MKTTLIKVIAASVTALFLSMQVYASGHTAHVDEAVKHAEEAVAHGKEGHT
DQLLEHAKESLTHAKAASEAGGNTHVGHGIKHLEDAIKHGEEGHVGVATK
HAQEAIEHLRASEHKSH
>NE1200 conserved hypothetical protein
MTWILTKYFITAAVVVIVSEFAKRSDKLGALVAALPMVTILTLIWLHVEN
QPETKIANHAWYTFWYVVPTLPMFLIFPFLLQHFGFWLALTLSAFITIAC
FGVFALLVRRYGINLM
>NE0543 putative (L31491) ORF2; putative [Plasmid pTOM9]
MYNANIPSTHEIPSTGRLIRSTVIALLTAIFLLVTVVMPAEYGIDPTGFG
EITGLKRMGEIKVSLTEEATADRANAAASLQIELSEAVTAVTESLPLAPK
SEMSHEKKITLAPNQGTEIKVTMTKGSKVHYVWRTNGGTAVFDQHGDSKE
LKINYHSYSKGTGQMREGVLEAAFDGDHGWFWRNRTSTPMTITLKTEGEY
TRIRHFK
>NE1119 hypothetical protein
MFVCSDFKQRSRKMKKTYLLSTVLLMFLSSSVIAEETLTEKLETKTNDVE
RATNKAINRAQEATCTDSDAECLKQKAKNRASEAYDATKDKASEIKNKVD
>NE2477 hypothetical protein
MNNIQTNTRDQDSLEYILQLELDNIRLFLELLEHERNLLAAGNLDDLALL
VADKDRLIDQFARLDMRRNRFLNAAGLPEGTQGMNAWVSGSDEESTVARD
WEELLGLADLAKQLNQTNAVITSSWLQYTRRTLNALHSAAGRPPLYNTKG
QTT
>NE1230 conserved hypothetical protein
MCTFQPGSLSNTWLEKKSVIFYPVLTESRGYPNIPLRLPVIRIKNSSCQG
DIVRFLSIIIAAIFCTFPVSAATPASDDSYIAGYAAAILKFQFGIDLPSL
TVRNGNITLPADKLPAEDRTRITQLFSEIPGVTRVEIVEYTAQQPSLASP
EPDEAIVDKGALATRSTMLATGPLPEGHLFKPLLADPRWNHFSAAYRNYV
GRNVDGNHNGSVSFGETIPFYRANIGQSIVQWEVGLQAGIFSDFNLGASS
TDLINTDFIGGIYTSVRAGNASAFARIYHQSSHLGDEFLLRKLTDIERIN
LSYEAADLRLSYEFPYGIRVYGGGGGIFRKEPSAIKPWSAQYGIEFRSPW
QMEFALVRPVFAVDIKNYEQNNWNADISARAGIQFDNFQAFNRKLQFLLE
YFNGYSPTGQFFREKVEYLGIGAHYHY
>NE2354 hypothetical protein
MSGYMARIFCKKAMLLLLLTGTGIAGHIPVSDARQVAEAIGADTFTGYQE
IRDRRKSRPAAVLSSTEAGQDNSELKKPPYRRTRSTDAAAAFIPPQLLAP
GRCGIVDRIEFVMQPYYGDPGRVDTFIPGMAAIPYGQHGEKLFYADVAVA
GSSGYFPPHKNIVEPVYFKIGLLADDGHYHVMTQRTPPQFQTGETVRLGH
SGFLEKADCVMPEPDQHRPGR
>NE0536 BNR repeat
MTIQNTLTKIFPVGLLVWFCWSLPVSAETGQPGFPIQEITVPASEESKQH
HLAKTDDGRLILSWVESDGQNSTVRFAIREGQGWSPVRTVTSVDGKLGDP
PVVFGLDDGSLAAAWMPYAKGGKSKYAADIFLARSQDGGLTWSKAFKPYG
ESARIYDAQMSLTALPNARIALVWTDMRETGDPGKNDRYQLMATVLAKGQ
QSAGTELRLDDDICSCCRASTTAEGESLLTVYRDHSQGEIRDTGAVRWDT
DGKVQALSAPGDGWRIEGCPSNGPAVDMSASSVALAWFSAADDKGRFKVA
FSTDGGKGFTKPFEVDDDARGYVNVALLSTNVALVSWRKRAGPEDELRIA
KVTTDGISRQTAIYQGDFPGWPSKYPGMIVLDHQAFVAWTDPIKKKVRLV
AVTLD
>NE1242 hypothetical protein
MTHHTEVFEGGTIDIEDDTSLTINGKEISYVHDAVKNKWSSRYLPYTQYD
SLLDLARAIIRDTVEFSGVKE
>NE0822 conserved hypothetical protein
MTAITRREESTSRDRGALAKMIMTLLDHWQLSTEDQAALLGIAASNRTAL
ARYRKGEAIGTSRDQYERVGHLLGIHKNLRLLFPQNRDLVYRWMTTRNKA
FDNLTPVEVIKEWGFAGLLMVRGYLDRARGI
>NE0363 hypothetical protein
MENTAEKFEEEILDACIRHAKEVLAEQLPLVKDKKYDFAPQFRDLTIQLY
LVGVMQQFYDQYEATTTDAQEKAFHALYYMMTKDGVKSRRAKNQAAFIRQ
MSRLDDGDEALALALGYESKPGDRSLAEVFDHYVNESRVSKGLWRFYDQG
KKILLLGGLLFAMAGIWFVTIYLPESDNITILAVGLLAALFFIVPVFLVG
LLIHRYKTRKGSRTPTPPQ
>NE1011 putative transmembrane protein
MSNEKVSRSRLKLILMMLVILSPIVISSFLHRSNFRPDHTVNYGELLEVR
PLQGEATNLTDNTIFRIRQLKGTWNLLIIDSGKCEEYCQEKLYTLRQVRL
AQHVDKDKVQRVWLINDDIRPDQETIDKFKGTRLVLANGKDLLKEFPAEN
KREDHIYVVDPMGNLMMRYPRNADPRKMVGDLKRLLKLSHLEH
>NE1641 hypothetical protein
MNKPSSARLIILAFLVSMLTNTFGWSFNGKVFTHELAHHHYRELFLMYPD
AHLELHHALDDSVDLDAATHLCLHAAGQFQPFYLPASLQINTADVREMTP
EIADSSFPETIPDRLYHPPRLLS
>NE0707 hypothetical protein
MARRALALAVGVLLLTSVWSIVVRLLYVLPTTAIERLDDTRFELQRLQTL
AAENSNLTTDDFTRIEQSISTLVFPSSNDNAAFVDAVNMLIHDSDVQLLE
LRTADPFNDGNLTRFALDVRINAPEEKLVHLLKSLERHRPLLIIDRAVVL
ATAAASDGTSPPLSVELRIWAFAAEY
>NE1360 putative similar to abortive phage resistance protein
MADRRRKLKRPMGERRYKKLFFIAAEGVKTEPIYFGIFTDETSIVHVSYL
KGKHDSSPPQVLKRMTDHLKNKELKSYDEAWLVVDKDQWTDEQLTQLYQW
SLQQENYGFALSNPKFEYWLLLHFEDGVGIKSSHDCTDRLKRWIQDYDKG
INMRKISQEQINDAISRAKKRDHPPCKDWPRTLGQTTIYRLIENILKSSK
GFVK
>NE1693 putative transmembrane protein
MNGYETLIATLALTMGSSWASGINLYAALLILGLGGATGNIALPNELAVL
ENPFVIGAAAVMYLIQFFADKIPGVDSIWDAAHTFVRIPAGAMLAAGAVG
DVSPALEIAAGILGGGTAATSHATKTGTRLMINTSPEPVTNWTASISEDL
MVIAGLWTALNHPILFIILFIGFIGLAIWLLPKLWTLIRGLLMKMARFLR
ITSPPVSTGDSVGQEEK
>NE1973 possible proline rich signal peptide protein
MLLSPATLAAEKGHIEIESFHVRKSGESFQIDVEANIDLSRTMKQALKKG
VDLYFVTRLLIMKPRWYWLDEEVARSKERIELSYQALTRQYRLTQHGQPR
NFPTLKAALQALGHQPDMLIRENQPLLPDTTYTAILQIWLDISRLSKPFQ
LEWLDTEDWSLSSQRKIWQIKFPPASDAGNESGLH
>NE1598 hypothetical protein
MNTINANDLKTRGIAAIEAQLEEQPEAIIAVRGKDRYVVMQLEHYYYLRE
CELTAALAETRADLAAGRCEQESPEAHLARLDTLK
>NE1571 hypothetical protein
MAATATSASTLIIPVENQVRELDAKLLLACVAAERGFPVIIGSRAFVHFE
IASLPRGIYLAKSMRSLSNSMFRIIRMLGHEIVAWEEEALVHPPADTYYT
LRLSPTTIRNVSHIFAWGQENVDLLQHYPQFPENLPVHLTGNPRGDILRP
EMRAYFAAEVERLRNLYGDFILINTNFTDVNPFIPNIGLFIPAKDGDKKS
RRGQAGIGMSEEFAEGLWHHKKAILEDFRQLIPALEQVFPDVTIVVRPHP
SENFQVYHDIAARCQRVKVTNEGNVIPWLLASKTMVHNGCTTGLEAYALG
VPAISYLATFNEYYDYDFQGLPTRLSYQSFNFSELQDTLSRILNGDLGAP
GGEERKTLIDYYLAAQNDRLACERIVDVLEESGYSQSQPPARATPVYLAG
WALANLKATLTQLNMRRPGPNRLSYHDHRFPEIPVEQIEQKIARFGNLLN
RFDSIKVKQHSRHLFRINSSL
>NE1076 putative CcdB-like protein
MARFDVYVNPGSHAATTPYLLDVQSDLLDVLDSCMVIPLRSLEHFPKVKL
PGRLTPVVTIKGQDFLLETPKMGAIPRRLLTMPVLSLRDMQPEITSALDF
LFHGY
>NE1561 hypothetical protein
MRLLIVFLLLLPLPSLALPQCGSEAILQAQKLLSFHVDGDDRAHVDPKAI
ALPSIRNPANRKQKFLVLEVDGTVYKSKYRMRLIYYPLGSECVLMGQEIL
ELASL
>NE0888 hypothetical protein
METSPVFIDYVTIRQEYFGGGLPVLNDGKVLKVDADGEIEYSTDVRCIIE
GSYDSRVQVRCDGNTVEFTGNISRYGRRDNLFGYDWPTTIARINDLLSLL
GLPPFTSGKLYKFADTGWSWSGARVSRIDLTCNYSTGSKESMHAVLCHMA
GQHVGRQKGSLSPDSGTVEYGRGSKYVYGKLYAKYQELEKHRSKKSGSHV
SDDVIDYCKTEGILREEFTLKSRFLLQNNLAYLGAITQDNLNQIYADRTQ
LQRLEDMKYENFNDLPKHLRSTYASWKLGLPLDISRATRYRHRTELLAYG
VDISIPNNVHHLPSRVRVVELKPLTAPDWYIQNYG
>NE1791 hypothetical protein
MEDLLNPSWNNEERNGMISKECFLHLQKKVNPASHWYLTEDKIAFHHHCI
KNNLSVPELVAVFDPNGQSYWENGQGIETKNNLLDGLARYPFDIIMKPVY
GYHGKGVSALDFVDGVHRFTTDLSLSLRDVFKKILAENPDRYILQKRLYS
HQAIAEFTGNTVLQSLRLITCLDENGQPKLIIRKIKFPKQGNLIDNFSWG
ISNGRLCLIDEYGKIESFIKYDHIKKYLVRYDYIEDISGKKTEFTIPFWN
QCVVLVLNAQKAFAPLRTIGWDVAVTNEGPFLIEGNVFWDPLTPQEGSMQ
AICQLLMALNAPLVN
>NE1965 hypothetical protein
MEDTAPITTLLYSILALPIAFMILYWTKVRKDRRRNETGEIEYKSLGHAL
TFFIIEGLALVGSLSILITAVSGIVRYLIYVYA
>NE0052 hypothetical protein
MKKEQEMLRVNVVMITVALLAGCTMAKRPLPGPAQPAPDPVVQQRPSGPL
PPSTRPAYNLAGYPKAAQEGYVDGCETAKQSAYGFKDKKRYAADTQYQMG
WNDGFSICRGKHQQN
>NE2146 hypothetical protein
MYDDFRKIIIRRIVYRGQTGPEGDPLKTPPANKWLFYLTLIPAVVVVVVL
GAFFFSIVLALFVAVAGVIGARFWWLRRKFRKSMSAAAEQKNSMIEDAEI
IEIRENDKSDRGHH
>NE2535 hypothetical protein
MNDAENLTKLLGHLPPAVFREFMADEFSLAMPDLDTKKTKKEQREQMEVA
LSALGVSERQRIEEVAERIVLLSDGAGQDVIDGFKDDIFDDAAREAFAAI
PNQYQRALWLHVNEPVIFEEALNARQADVFRQSASCLAVLDDAAAKTAFH
QTVAQQLGCSDDAVAIQIFKRLRPDTQTGEDVDLYQISIHHNRPPEIIDC
VQASELVPQEVIRAVSSHITYEPANGHLEVLSKDTDGREALARIVADSLL
QSPITGEKIPLKQYDYQSLAAPRNFDIASEPVTSVKVVELGYSAANGRSL
LVKTWTKDADDIYTAARSLINPTFDFRDHHLNYAKLSIKLKKVGKDRARA
ITVILRDDNKCNIKTKREKDQALWAGVLNFWFIGRVFDRVRQ
>NE1162 hypothetical protein
MDLAQKSDAEILAVATPIMDNLMDASTAIDYERHTRDFTERARSVLSEES
LQSICEHYQSTKGFFAKREFVAAFRRPDSVAIVWRQQFTKQPGEFVAELI
LVQQGGKYLVDHVMVF
>NE1472 possible transmembrane protein
MEARYVRGRQGLQWILSGFYFFRMAPLNWILLCFTYLLIGITLGLIPLLG
SFIGILTVPVFVAGIMVGCRKLDLSGKLELEYLFYGFKKYTVPLITIGGV
YLIGDILITGIFMLLGGDAVVDMWLHGKRFSENELPGVMDDLLFASLLCL
LLAIPLMMSIWFAPMLVVFENMPPLIAIRKSFFACLKNLFAFQIYMAILF
VLGMLAAMLYGLGFIIWFPVAFASVYVSYKDIFHYEQDEDTQPKSDEPST
EENKEDSSQTNEH
>NE0367 hypothetical protein
MSADNLLLDFTSPGAIPPDPAEVVRRVVDETQTTMRALESLLENERIEDM
TGWRLLAMFYLATDRLNDLAKIEKQYKSITGVSLSADLKQKYPQWFNGEA
VSHPVVFEIPKKITAAALPDSIIIQRGQCSPGGILLDFSQVQEIDNDGLK
KLAQLFSSLAQENTRPKLRQADRFITCLQNKAETGTGTRAIWDVLFAYER
FRDDREAFEEKAIKFAVLYGISPPSWE
>NE1551 hypothetical protein
MRTKYILELFDTSEQHKPIARFESSTPFTAASVGERFDDIGWERLDGAGK
IASPLSPKRYTVHSAKHLVIVEAGALVIKYCLNLEPFSGPSSPVWGDE
>NE2512 hypothetical protein
MTKLALFVRLEAKPGQEAALADFLASALPLANAESGTTAWFALKFGPSTF
GVFDAFADEAGRQAHLNGQIAAALMANAATLLSSPPNIEKVELLAAKLPA
>NE0726 hypothetical protein
MLSLCRIIISLLLVGLSGQAAADISNSPNPYDAGYGFDTPDEAGWGGWMR
GGASTLYAEWDTISDASYGGSGDRTAAPDIGTHNVADAYLSWNPGVFVTS
TGNLITPSVVQEFFIRISPVSLFSGPLVVALQVEMWGDEPAAPLLNGLAA
SSWTRTFTGTSVTDHDLNQYLGLWYFANTVNHFEFDLTNQPFISLAQVAV
DIAQVSEPYMLAIMLTGLILIGSMTRYRSRPI
>NE2174 hypothetical protein
MNGIDWLLDTNFILGLLKSNPETLSMISNQQIDTRRCGYSAITRMELLGF
PGLTAEEEILISGKLACLQYLPLTKEIEDMVIGLRRSHRVKLPDAIIAAS
ALTCNAQTDP
>NE2545 hypothetical protein
MGQKMKNAPVYFTIAQVRHNPVLRLGSYAPDIQDRMRKAGYPDFKKGIAM
AFTLAPQLGDAPQTQPPVVEQVERLMFFSTDSTRGFIVEQNALSFHTTEY
ETFEALADEFMRGLAIVHECVTLAHSERIGLRYLDAVVPPGGETGLAEYL
APGVLGLSSRLPEDVTVSHSFSETHIQTAKCAVLARTIIQSGPLGFPMDL
QPIGVKVADRFREINGVHAIVDTDASIEGRHPFNLELIKSQLQVLRDGVG
IAFDATVTPIAVSAWNS
>NE0583 hypothetical protein
MKHRTWIWLYLITLPSLTNMVHAKETLPDNQNGQTTAVDSGSAASKEVII
QTAANQQTESTKPTFIAGNAPSSFYQRARSYSTHPESDPPRYVRTLSKTG
IDAFKNLYWLDVGLDYRVRYEHRHNDIRRSRITTDDPVLLRTRAYLGIKE
ILDPLRFVVEFEDARRYNGKFPKDNRDWNEFELIQTYGELYFKDALGRDD
LGNYRPVRIRGGRMAWETLDRRLLGSNQWRNTTNNFEGFRVTLGQESNDW
EFDAWGVQPVIRLIDKFDRRDKGQWFYGAIGHFRQWSKIITIQPYYMGLI
QDDDGGTRVKREIHSPAIRAYGVVPNTEVDFDLGAIYQFGRDGGQKKSAH
AYLLEFGYTFQQAAWKPRVSAFYGYVSGDRDPNDRTNNRFERFFGFARPW
SADDYIIMENIQAPKIKVEFQPHPDLQIDGGYNGFWLASKTDRFNNLLNG
SGNNRDRSGNSGSFIGHSADIRARYKLTPHISTTLGYSHWFNGGFIKNQQ
LAELGETTAGTDFFYVEVAISAFK
>NE1559 hypothetical protein
MFKPVVQEEVTGCGIASVANILGKTYSEMKTIANAMGIHASDQSLWSDTQ
YVRRMLSGAGVETSEDEVPFESWDALPDLALLSIKHHQEEGKAFWHWVVF
KRMDGQSFVLDSASYLPSNIRQDFDAMQPKWFIEVKNA
>NE0918 hypothetical protein
MNEEEETGRIIARLLDRSLNDVTPGTLYRLQAARRAALEHYQPAEKVLHA
GVGISAQSGYHWLSAHAGRLLLTASLLLFLAIHSYWQMNNRVDDTILTPV
ILTNDPPIGSQEIEDTANGYEAADEDIVEETDSREDTDHGNYGGEADTSE
TESSTNGTADSDDVTRSFDSTEIQETENTAEAPYTTNYDQDSVTEEDTGV
ISDHLQNSEDIIDSYDTENTQDSTATIDE
>NE0798 hypothetical protein
MDKFIEQVSLYIQEAPVWPFTLLGFILVVGVAVDIINRRRRTAAVEYFDL
AFQEELTGLYPAATRWPDDLAAYMQPRLPILRDAFEVLRNFIPQNQLREY
NAAWNRFYQFSRTGGNERPVSLEDAAQELAVNQPDLQQQQAFQQMISDLL
AFATQFKK
>NE1916 hypothetical protein
MEITCYRDPEIRREKRTLPATTYNLAIKLLARCETKQLFIPIRSMQYMAI
VDAEEFVFVDSQRKCWIDIAWQNFHSHEREALNQPIEYDAVFYREDQTDI
MQRLQIEFPLALSAMMAKQAPHKLAKVISFRQKPPAENPKQ
>NE0571 conserved hypothetical protein
MFKRVGIFVALVTLSLIFNQPATAGRTVEDFQVWGNITALGNFGFVNPGN
PDLKKFRWWMEGQGRFGNDSSQFTQAIIRPGLGYAITDKIIIWAGYAWIP
SDEPLVPKSGLPFDEHRIWQQVTWADEFSFGKLSLRSRFEQRFFDHNAPV
SGSDDVAYRFRQLVKLAIPVAMIDPNLTFIIQNELFIGLNTVSNPGFISR
GFDQNRAFVGLGYKVHQNATVELGYMNQFIDRRHNPRPDQMMHNFAVNLF
LNF
>NE2224 hypothetical protein
MKRNALIHSLQTHISDLLAIYAFGSRIQGTARLDSDLDLAVLVAGYTDPL
ILFEVANELADVAGYAVDLLDLRAASTVMQYQIITTGKRWWTLDMQAALF
EAFILSEKTALDVARAGLLADIRQRGTVYGR
>NE0369 hypothetical protein
MNLKKWTDEELVSTRDQIEAWCAKYAQSVWDGRKGYLTGLLGVFGISTGV
VFLMFDGIEVVSFVPILLGVIVCFTWWKTKQQHKKNNGFLEEIKEEIARR
AKKMEKIEKNKPQSNHAVLP
>NE2243 hypothetical protein
MAPASFSQITGQSVGKIGVLFLLLMVLWSANLCNSSDITPLSANRANVGQ
MVQAFTFDDLFKEDGPSSDLPQRVFRQTSSWRGFSQLEFAETIASPKHAS
KLRLRSELSNLGQLSPNVKWKLSARIDYDAIYDLSDFYSRQVRRDQRFEL
FLRENYLDFSIADFDVRVGRQHIVWGEMVGLFFADVVSAKDMREFVLPDF
DILRIPQWAVRTEYSKNDFHADLIWIPFASLDEIGRPGADFYPFKLPVAA
PVSFLKEDRSGRNVAHSNYGIRLSQLTNGWDVSAFYYHSLDATPTFHRIS
QPWEPLLFQARHGEIDQAGGTVTKDLGSAVLKGEFVYTHGRRFNVTRPTA
ADGLVRQDTIDYALGLDFTLPSDIRLNLQFFQRAYLNYDRDIFQDRLENG
GSIFLQGDLWRDFQGQILLIHSFNRNEWMLRPRLTWNFARNWKLAAGADI
FNGPPTGLFGRFDSSDRVYTELRFSF
>NE1344 hypothetical protein
MLLLLTFIESPIAGIPEIRKHNCLVLPHIFVCLVITKSFGRVVTSGAAVT
KRAAGYFGYLAHCAADETTWYACEQFVIGHARKAVTHFIYLA
>NE1750 putative pre-pilin leader sequence
MLNIYRRSAFIYTQAGVSMIEVLVSIIILSIGLLGMAGLQTAGLKSNHSA
SFRSTASMMAYNILDSMRANRVVAGAGGYNHSLSEEDASETETKVEAEAE
IPEDIKNWLKELALRLPEGLGSIDVDADNKVTVLIQWDDSRGAATAQQFV
MTTRL
>NE1907 hypothetical protein
MNKKRMVTMFVLVGACAGLSACATTGEVEALKSRVDALESNVSATKSDAA
AARAAANDAVNIANQAMDKANEANARSIDTETKIDRMFKRAMHK
>NE0109 hypothetical protein
MDILSLVAGFLAGALTIYLASYFMKPSENKTQKTVDAGTINLFDQLWQTH
ERLLNEMKQDVENPDFKFHREFYVLKKGWGWERWGFHRRGPCIAYFLEDH
SDLLPLLDSLTSYGLISQTGETGKNTARFQLSEKLVELLRGKNTNKS
>NE1363 conserved hypothetical protein
MRFHHAHGSLFPRKILICIKARINPDLVMNGIPLCPKLYHIVHVDRLSSI
LKDGFLWCDVHMAQHIPVGTTIGMNNIKQRRLQNCLNSYPDLHVGDCVPF
YFCPRSVMLYLIYRQNTELDYKGGQGPIIHLEADLNAVTTWAKTQSARWV
FTLTNAGSFYFEDRNDLTCLKEVNWTAVHALNWKEHKEGKQAEFLIEQCF
PWNLVERIGVQSEVIYNHVVNALPVNGHRPKVEIKPEWYY
>NE0083 Proline-rich region
MRHFSRLIVLSAAVSLVACVNIPLGPSVMVLPGVGKNFDQFRGDDYLCRQ
FANQQTNYETPKNSAVSSGMESAALGAALGAAAGAALGGGRGAAVGAGMG
LLGGGLAGSGTAQSSGSISQERYDIAYIQCMYANGHRVPVSAGLIEGSGG
NMGQGVTSNPSSSGRYIPPPPPGHPPPPPPY
>NE1042 hypothetical protein
MVIKWILLITLVFLIFWFFKQFRQIQRKPPDTTRKVIEDMVRCAYCDVHL
PKSESIVEHGRYFCCTKHRQLYSQSQPDDK
>NE2429 hypothetical protein
MDEIIKQSPVLWVLSAVVTGFIAGIAAYIGLLKITNQETIIKGTYEPKKN
LVGRVLKNEVLIECGKLIELAGRIDGATMPDKVEAYMTQTLIFLEGLDLP
KVQQYHQLKMSWPAYTIQLLLVNDKLSSSQKLGRA
>NE0740 hypothetical protein
MDETRRDFIKGMFASGTFLALGVPGIARAASVGPLFDSTRNCRLLLGNTT
GAESFAKGVQSACFNHGSRHHGALPVFRFESELSTGFLHLVDLLMQSRNT
RWIAVMDHADAAIFTELIRNSAAHLLASGSHTFAAGDHAALPLRHVWAAA
SPAYSTGGLLASMLAREQYSFSIVERFLTQSAGESIENAALSLPEFLPYH
RADQPVTRLYCAGVPLPEAGRLLGWETSKNQESLFSRTIASTASRNETAG
STTVEYPQSGDWVEATGYAVAAAALGMKINRESCSERAFVYRSGQGHPDH
KGLSGVNFASFVIDV
>NE0256 conserved hypothetical protein
MEFNEYELQRLFGHEAAEDEDPQRLKDYYFKSKVYSQVVNDLPLRIIVGH
KGIGKSALFQVAIDEETENKRLTVLIKPDDIIGIGEDTDDFLKLIRDWKI
GINAIITQKALTSFGMLFEGWRGKLNQYGGTALDFLSSTLKLEGKVSLTA
SKEAILRDFLKNNKISVYIDDLDRGWQGRKHDIQRISALLNAVRDISTEN
RGIYFRVSLRSDVYYLARTSDESTDKTEGSVIWYSWTNHEILVLLVKRIE
SYFGREVDEAELLKKHQLELMRYLAPIIEEKFTGKGHWRDAPTYRVLMSL
IRKRPRDLVKLLTLAGREARTKDAERITTNHLENIFEEYSQGRLQDTINE
YRSELPEIEKLILGMRPTKIQRKASQGYVYTTDQLLKKIKAIEEQGKYRW
ANRNQVDTKELAAFLYKINFITARKQIPTGIDRKYFEENRYLSNKFIEFG
YDWEVHPAYRWALQPEEPMQIFNELELSSS
>NE0495 hypothetical protein
MSLFSPEQLKKASHIKRLSPVLFINTYLAHCKISQRANRANPAEVQTASP
VELDCLNFHGLRESQRDMAIQSLYRLSLWIAALRTPKTQQIQPPQSLRTL
QG
>NE0508 hypothetical protein
MNRALIAVVLLMVMGSVSGMRINPDPRVVVSFMDKSIAPELDILRVMADI
SPDNHHLVFQVKTRGERIQGNDHDYLLLHITHGKTYVLLLPINKEKENQM
LVYERLPQPDDDDLLILGKFKGNSHLTNFNITSIFRGGEFSVPLDWIDFN
TNFSFDAYTVQARIKGDTLKISKVYDWARKGKTHNNEKPLSAITLLNKIC
APKSNNQRL
>NE0117 possible A. fulgidus predicted coding region AF1859
MSTHHMAEFSSPTLSLGRYRLDWQVTRSIRLPDYAGSMLRGAFGHALRSI
GCITREKDCTTCPLRRDCPYTILFEPVPPEHHPLQDFSRIPVPYIIEPPE
WGTRVLHPGDTLSFHFTLIGCALQELPIAILAWRRALARGIGPDDGTAEL
TSIVLEQPDSSIPVYTPENGQIEPHETRLACPPPPAATIRLHITTPLRLQ
NNGVPLKAQTVTERALLMALVRRFALISEFHGEAAWQPDFRHLGELTSSV
TGKRQLSWRDWQRYSSRQKQKMALGGLTGRWDLHGELAAFWPALWFGQWL
HAGKNASFGLGRYRIIAA
>NE2160 hypothetical protein
MVKRVLMIAYHFPPLHGSSGMQRTLRFARYLPDHGWEPIILAPSPRAYQQ
IDSGQLADIPQQVRIHRAFALDTARHLKVMGRYPRVLALPDRWVSWWLGA
VPAGWYLIKKYKPDVIWSTYPIATAHLIGLTLQRLTGIPWMADFRDPMVQ
PDYPVAQWHNLLIRTIVSLILYNRRLQKWCLLTDR
>NE2060 possible (AF047705) unknown [Nitrosococcus oceani]
MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE
STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN
LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF
AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNP
>NE0551 putative yacA [Plasmid ColIb-P9]
MSESTFTFRVDEDLKTEFSAAAKDCDRSGAQLLRDYMREFVKTRREVAEH
DAWFRKQVQIGLDSANTGNLVPGDEVEAEFAARRAATRRRLKASE
>NE1307 conserved hypothetical protein
MNSHRGQMMSKDATALLHVTCVFRHNVACNASGGLHMGTTHVNARVKKHR
DTLRMAGLRPVQIWVPDTRRPDFAEECRRQCLLIAQADKADTSMQQFMDE
ALADSDGWTE
>NE0289 Helix-turn-helix protein, CopG family
MSQITLYLDDEIQALIEQRAKASGLSKSRWVAEFITKYATQEWPQDCLEL
AGRFADFPLREEANPLPV
>NE0920 conserved hypothetical protein
MNLSDCKLSRNAFGKLIVTTKDQIHEGVVPVRSFPITAINEGIALVDGHG
HEVTWIDSLAELSETERILIEQELASREFMPEIKCIDRVSSFATPSTWQV
QTDRGETCFILKGEEDIRRLSLATLLITDSYGIHFLIRDRSMLDRHSNKL
LDRFL
>NE1699 conserved hypothetical protein
MLKFNLAVLISMLLISFFPQAYAVGWEKIEIPDNVSTILDDDGRYKDIKT
GCAFSHLPDEAGLPNKPFHFYYRKGTKAKTLIYFNGGGACWNGATCLTSL
TVPVTQTTRPAYNPSIENENNPEELGGILDFTRADNPLKDWNMVFIPSCT
GDAHLGSKNEVYVDPSGIINHGDAVLVQHRGFDNFMAVREWLKHRADRPG
TEQVLVAGSSAGAYGALMNFPRLHSIYPDKTKISLLSDAGTGVFTSNFLN
TVFEPDGPWGTEHTLATWIPGINRIGSYNALNFFTSLATGIERHFVNSKF
AYVTTAWDDVQMLFLNIMRKTGQGVNDPNQWFNLTPVTAVEWSLRMLTTL
HANALINRNSKYYISAGTYHIGLVDAFAPGVFYTEKSAGGIYLKDWVNRL
VTDDRNYPLINLMCSGTCGAPFPP
>NE0879 Esterase/lipase/thioesterase family active site:Lipase (Class 3)
MKTAFSRLFLVMLTAILLSACTSNEIYRSNFSNCIVTAQESCESHAIQLH
DKGTEREYLLGFVEIDDQGQLRNRVQMQALLNELYTLASKESLLINVFVH
GWHHNAKQGDANVESFKLNLAELSKVESHLHQDRTPRKVVGIYVGWRGES
IDIPWINNVTFWDRKNTAHEVGYLGMAELLLRLEEIRNIKNTQEPPVKSR
LVILGHSFGGAAVYSAAAQILADRFINSAGDKNYVDNAEGFGDLVVLLNP
AFEALKYAPLYDLAQARCSYFQDQPPRLVILTSEADFATKYAFPAGRVFS
TFFETHSTIKRNDCNRPLSYSEGAADRQAVGHFEFLQSHELHPASKSMAA
VYHQAKAIWKNQQPGEAIQFGSTELRNLEHTVIHNPYLNVKVDKRLIENH
NDVFRPEIMEFIRMLIVLSTEE
>NE0086 hypothetical protein
MKLLLSIALFFWVGMVAHAENLPERIDIEYVLNGSIGQGKAHEILRVRQE
NGVQHYTIDSEASASGILKLIKRGSIHRHSEGTIIPHTGMKPFRFTDQRG
EKPAREVEFDWSEQRIIYRRKGQEMTENLPSGTLDELSLAYHFMFTAPPR
QTLVVHETDYHTLQTTRYTVTREMLDTPIGKLATIVLTKQREQNDPFKKK
IWLATDHHLLPVRIISTEKHGLEVDQIVTKINYSPLVNSAR
>NE0710 hypothetical protein
MCISNSDWISLGSAIATLLGVGVALYASWQQMKKMNNQLVIQQFSDYTKR
YQEIILHFPENINEQTFDFSKDTDKNKTMRYMRAYFDLSFEEWHLNQRKL
IDAKTWTVWEGGIKTALSKTAFINAWLEIKKDTGYGQEFEQFINASLPTN
QNLTNHSSGTPNGAP
>NE2175 hypothetical protein
MERGMTMSTIERLYKLSSTLPPAALAELLDFAEFLHQKNMLPQPDEPFRL
IDMAGGLEHSACFAGEPLAVQEALRREWD
>NE1157 hypothetical protein
MTVNSKPAIIMNQTQLKIRIHLDQSADHPKIEQNLPELPEESVFPPLPPV
YEYNWPRIIGAGLVLLFVLITSIWIAADWLSDDEKLETSSTEISLSAVSP
ASSETPSTEPVPLLPSVGFSSDQPSENNPAIGNVPDGDIGSDQAVQAPDP
QPQRAAPSASAPPVKPGIKPDITIPQAKTKNVSQGSNHSSGLIKAQLTSN
IRQRQPVDNINQISLGSKSSRPIFLFLHLNKFRGEKILINWYYRDQSIAR
IVLPVGNNDWRTYSSKVLNRNRLGPWHVTASDQAGNLLAEFKFRVTR
>NE0706 hypothetical protein
MAKPMFNRQTTILLVLNTCVISAFVLIHWIVLDHPDDLSTKSRNVPSETT
PLSIPMETLPNALEQTLLFSSSRTRTVISSPESTVPLDTAPPRLVGIVEE
EGHKRFALLEDETATSRKLVAQGDTFETWMVVSVTSDAIHLRSRSDINDG
VHPSSDIELRLRPSVPPSQNFNP
>NE0574 major membrane protein I
MTDIHQAQTALGDVAARTLANATKTIPMMGTITPRWLTHLLQWVPVEAGI
YRVNKVKDPDEVEVDCSNKDERELPATYVDYEEWGREYVLSAVNTVLDVH
TRVADLYSSPHDQTREQLRLIIETIKERQEKELVNNKEYGLLNNVARNMK
IKARTGAPTPDDLDDLISLVWKEPAFFLAHPKAIAAFGRECTRRGVPPPT
VSMFGSQFLTWRGLPLIPCDKIGITGGKTSILLLRTGESRQGVVGLYQPG
LQGEQGMGLSLRFMGINHKAIASYLVSLYCSLAVLTEDALGVLENVEVGK
YHEYK
>NE1303 hypothetical protein
MWQTICRGAIAGGVLLLTVQPVSAAEPDPAQVEKTVQAYIGKANQEAAQN
RESVEESQVVTADLNGDGRAEIILWSTRYGGTYSFNDVTIFTDSGRGYQV
AAGTEDVLGMVESIEVKNGLIHIHALWPGPNDPRCCPTVKKTAVYQWQGK
ALADVTSRVSGKK
>NE0496 hypothetical protein
MVFPAKSRRFIAPGLRYRQHATNLIGADGGFIAKFSRFGLMLSGRYRQWN
ARNIEALSGLRRN
>NE0232 hypothetical protein
MTNALIPDPVSAGEVMVTDEGHVEKFLNGIFTCEVPAVAVPDYLRKNTVI
GRFAHDVAAQAEMPFGTVFVTILGAASVPAACSFTTRFESGYELPAGLFT
VCEQPPASGKTRVLNYGLHAYQAAIRDLNKQIHEHNKADKQNQKPYFIDL
ITDGTAAAIDSKLAESKSGRLPLASSEQGLFRSLFPAEGGFHSNNDLLLT
GWDGGWVSGARSTRNAFTGRVSTQVVMFAQNGSIRRVLQASNGSGLTERF
LFVAERSLLGRRKFEPHTVDASQYDKAATRCVERMAAEKPPIIIEPCKDG
RAYIRQQRIAHEAELGKLERAGEAVMVGWLGKFENHVLKIASVIHSFEFM
QNEEIDFSYPVQIPLATVEAACELVMSLHEHMRAVIDAAGESGLQTATDS
VLSILRENKAPMAVSAVTAKARRRKPFADMGRDDYKASKAFIDTLIVKGI
LLKSHNKLSVAE
>NE0900 hypothetical protein
MISKPGRIVGLGIILSGLSACATYQPVLYPNSYYQSVGKVAAERDIRECR
QLAESAGAREGSGSTGNTARRTAIGAGAGAASGAVGGAIAGAAGRGAMVG
AASGATWGLLSGLLGSGSASQPAPAYMNFVNRCLREKGYEVTGWQ
>NE0602 hypothetical protein
MEKELEDRKLAELMNSIRHYSTLRFAMLTVYFAVTGGLLVKFFDCDFSVR
YPELHGLFQIAGSMVTVAFFIFEVALDDNLRKLWGSVKKLAGEGDVLLSH
RQLWKGCLVPMATYGIFVGVLIFWLFTSRNYYPCQAAAHKAVQSETVISK
ECRK
>NE2247 hypothetical protein
MIKRFNLTICGLGLILVTSHFFPAGAQELILYVDTATKQVYTEPGKNRIK
LGTFQQVKESPVQSQSKPDTESSQPVSNTTTGLAQGEAKFQENSGQSGAE
SDIRRKSEEIAAHSNEPSAEKPKEEKKWYDRIGIRGYTQFRYSSTVSGDK
DAVSYWPDKSVGEDGSFLIRRARLVIFGDINDHLSLYIQPDFASTPSGSS
TGHFAQLRDAYADIHFDKNKEFRVRVGQSKIPYSFENLQSSQNRLALDRN
DAMNSCCRDERDIGAFFYWAPTHIRDRFKEVMAKNLKGSGDYGMFAFGIY
NGQGANRLEGNNGVHMVTRFTYPHQFSNGQILEAGIQALRGRFVPSTGPA
GGFTPVMDAPEKGFKDERVGVHAVLYPQPFGLQAEWNWGRGPQLNDGQTM
LTESSLNGGYVLATYKIDGLRWGTLFPFVKWQHFKGGQKFERNAPRNHVN
DLEFGLEWQVMKEIELTAVYHMMNRTNVASAPYERYKADVLRFQLQWNY
>NE1441 hypothetical protein
MTMTKSYYTPILLVAAGFVLTGCVTINIYFPAAAAEKVADKIIEEVWQTD
GNSGKNDRSGNKPGNDVSDKTDSETGKTKP
>NE1550 hypothetical protein
MLALFKGDYATNEPDYRTSSVVYITLGFSMFRPPRCYRKNFIVPLYRLKK
ATPPVIWVVMFLLCCFMLPFGIAIKIMHPGFFATLRAGLEIAAENQLYGA
VFVTTFLVLGSFLMFIVNGAFVSLRVAAGYEVRRNGTILKWLINKVRRAY
KILNKPND
>NE2142 hypothetical protein
MSDRVIRMNQQSETDAQFDPDLPTANAVMASLCCVAAQYASRPSTELAKL
ALDLAYKLTAPQYAESELITEVAQQLVRQWKQVLYQQVQARAAGMIIPGN
RFIN
>NE1601 putative transmembrane protein
MVTMSCRQERGYIYIWMLFAVMLAGVMLAAAGLIWQTEVKREKELELLFA
GDQFRRAIESYYNDSQVSGRAGEAGASRYPASLEQLLKDERSLVVKRHLR
RVYPDPMTNSYNWGLVRQQDGGITGVYSLSTGVPIKRANFPADYIAFEKA
GNYQGWKFVHAASTAGGQEKQQAEGRGNIQGDTSMPGLPGTGFNPLPQNQ
PAPNLSPPTGNDAF
>NE1883 hypothetical protein
MKPRYVVDTNVLITASVADPVAPKDIDATPQDPALRFRVWQWLVEFESSP
ARLVLDSAGKIKEEYDRKLGFNDYGIQVVIHKWSMAAVDNVDVQYDTDGH
AVLPPPLDSVVHDLADRKMVAAALEAQKCHGESAIAFAGDTDWHDWEQTL
IQAGLSIEPIIEGWSRAKHAEKAHRKQNHD
>NE0237 hypothetical protein
MITNFRFIHPPYKQLYAYVERIVMPLDIEKYRKYLAPLNLGKDHEEEIIR
HIYMIMDEFISAAFNKHPVQQALQAKNRKTLQGQSDVIDSKDRSIQSLYQ
NVASRPDE
>NE1014 putative transmembrane protein
MQMGKENEKVTSALQNSDMEKASMYQVAEAVLFSFIGIRKKSDLEHDAAK
IKPVQIIIGGLVGGVIFILSILSVVKLVTG
>NE1248 hypothetical protein
MARLIEDDHSNLIGLAGVLPRSWLICSASASQSTLLLRGRWMLAENRISD
GSQVLLSAPGYNGFQLRHYACMAARIRITRRTTSGTLRMVMDGIVFILPQ
YMQMQRYISMKSIAAILIYEQQLLTRAHEQLQQDLYRTWQIR
>NE0938 hypothetical protein
MINRSRLRRLIVISLVSYTVVLSLTVSLHGYLVNEYIEELIWESMLESEM
AYIKRKIAQDPEYDWSGLDRFHWYDEHRDSSIPPQFQALPAGMHDEVRID
GSEFAILIEDGPEGRKILALDITDLENRELMIAFAIVASTVLLITVLTLL
SFYSVDRLLRPLTRMADEISNLSPDGEGPKIPIGDKDAYV
>NE1680 conserved hypothetical protein
MQIHVYDTYVKAKDGHVMHFDVFTDVRDDKKAIEFAKQWLSSIGEEGATV
TSEECRFCHSQKAPDEVIEAIKQNGYFIYKMEGCN
>NE1380 hypothetical protein
MKIINIRENFSRYPAGRYRADGPYNGEKFREELLVPALSEAIDKGEKVKV
ELDGVRGYNSSFLEEAFGGLVRSGKFASTRDLSERFEFVSTDKSLIEEIR
GYMEEATPAVAQ
>NE2427 pyrimidine dimer DNA glycosylase
MSHNSRPVRHHNMRLWSLHPKYLDPQGLVALWRESLLAKAVLRGETRGYT
NHPQLERFKAHPQPHFAINFYLAAIHAEATERGYTFDSSKIGPVCSVQLI
LVNSGQLSHEWNHLQHKLATRSPIVHARWSDLASPICHPLFHPQPGPVAS
WERV
>NE1189 hypothetical protein
MSKLLFKLRNVPDDEAEEVRALLSAHQIDFYETSAGNWGISLPALWVRDE
TQYSQARELLDVYQAERSAHVREEYARLKQEGKHKTVLDSFRENPFAFIA
YLFIVYALLYLPYKIITGLSSQ
>NE1537 BNR repeat
MKKSTYSFPTLAGFFLTLASVVAIAAGPGTGDPTQQPSVMQASKSPKTAL
AVGVTLDQEGQLWLAKVIDQRLLVSRSEDDGKHFSESVTVTPVPENIGND
GENRPKIQVARDGTVLVTWTELLAEKYAGNIKFSRSTDSGRTFSEPIVLN
DDGRVTSHRFDSLAIDGKGRVVVAWLDARDRDAAREKGEEFKGVSLYSSQ
SFDNGAHFEPNRQIHQHTCECCRTALTWTSEGPVVLLRNIFGTNTRDFAV
ASLDKVEEGVRRVTRDEWQIDACPHNGGSLATDGRGQLHLTWFTSGTAAQ
GQFYKRISGNQESEPMALGDMDAQPNHAAVVAHGETVILTWREFDGNVYS
AKMMFSNDGGETWSEPWRLMLSAGANDYPVPLISDSKALVVWNTENEGLR
VLSVERVINRSDG
>NE2001 conserved hypothetical protein
MKLKSFLNYFSSREKGLAVSILLPTHRTFPDNKQDAIMLKNLVTEARNRL
QSWPDTQEAETIMEAIDKKISTHDHNYNLDGLGIFANREGVTLINFPFTV
KEQMIVDEAFAIRDLIREINGAVHYRVLVISRTDARLIEGFNSHLVHEFD
ARTELRTGSFPMENPFLFAAKGLDRAQIPNEAANLKEFFNRVDKSLQEIQ
NKPEQERLPVIIAGDARNAAFFREVCDQPADIIGEVTSIPDLRIPAEKII
VEVQGLVSDHRRQKAETALQYIAQARNNHQLLTDSSMIFRAIDAGNAARL
FVRQGYIQPGIIDFDQKIVTLQEDSATVDAAGGTVTDDVVGTLIELVIRQ
RGEVHFLSTDQLGKEAPLSLQTRY
>NE0718 hypothetical protein
MTKAFDAFERAIRDAEDLLARHDAEKTVPNGHNGEVLKRAGLVMALAAWE
TYVKDRLQSEIDTWLQAVEGSPLGKFVRRRLEEDLKRFFNPNDERTRRIF
IDYFDVDITKDWVWENYDSSTSKKVLDSLVAKRGDAAHKANTALHASAEP
HKVKRDELEKGIRFLKGLVAATEKAKITK
>NE2345 hypothetical protein
MLLNRVIFQKQSDRLLFNASLSNTVLRILFFYCFPAVCLFVSSIAATRAG
QPPEQINQYLRINGEFLGSYENWNYFRPASAVNNSYDLWVVRSRLGLMFS
SDYADGFVQGQYSGLYGLPDDAVLPSGGALGLGAGYFLANRTTGASNVFL
KQGYLNFKFNKLGLPGAAVKIGRFELADGMEYRSGVEKFDALKKKRIAER
LVGGFNAIYVGRAFDGFSVVYDGPGFNATVSGVHPVQGNLTVQGQKQISD
ISILYAALTSKKDAVLPGIEGRLYYLNYDDQRVSQVTDNRPLSARPQLSN
EKLNIHTIGAHLLSLQPLGSGSFDALLMGAYQFGSWTNLSHRAWAFDAEV
GYQWHKLPFKPWVRAVYYRSSGDGNAHDGRHQTFFSAVPSGRLYAKFPFY
NQMNIQDIFFEFIAFPTGKTQINVNLHQLSLANVNDLLYTGLGASLKSGA
FGYSGSTTHGHREIGQLIDVTLTHGFNKYLTSQLYFAHAFGGSAMKSIYP
SKSDASIFLVNFNLVF
>NE0480 hypothetical protein
MQQVALISAVFAIANEVKLEAIQLVDCRGAGSPWPDSLAKTRCDALQQSF
INRRHCETAQLVKQSSKILLLLLPKLKWHPDQQIFTVFANANGMKQSDTF
VVITIAGSPRIIPGCRRLRLLAITSFAMTGKIRESCSCLQQK
>NE0815 hypothetical protein
MSDRVIDSIDDYLSPNSQIQIIVNRYRSLFTWFKIMSLPDRLSCLSIEHC
GCLQLIAGEILIQAVSGVRSLERPFVIDIEHGRLQARGMVGVIRALFL
>NE0372 hypothetical protein
MYKRDTRFSMVFPALLAVVLFVPFVMNFLAHAWYGEKHFPPLTMMKLPAD
SSVISLPQEIKQYKPFEITLQLNTRELARRINDIVKKSHPGTELQGIRSE
VFPEMRARIAGDAFSIDPPEPQVQFFSGQGEMSRWSWIITPEKTGRHHLL
IELHLQTAETTREHPQVADLAEIQLFVRENPEAWMRTHGIWYALFTLLAA
GWWWKKRLIKKRKAAKEQ
>NE0737 conserved hypothetical protein
MTAELSCHLLVSPFTKSAVMFNPSRDQARRLFFDTWQKYHRKEPLSGMET
IALEVILQHPEYHSMLQDVERYLDKDFPPELGETNPFLHMSMHVAIREQL
AIDQPAGILQRFEQLKTRLQGDEHEAMHHVMECLAEMLWHSQRNQTAPDA
GIYLECMDKRIGNK
>NE1055 conserved hypothetical protein
MIVRIVKRNTFIFLSVVLLSLLAVPATNIFTAPSRETIKWGEKSFLYNMD
FISRWAALLLYPVGISTDSNQVIIGRDDWLFLGDLYEETRTIDRRPPSAA
DYVSGQEIGSAIESWNRYLSSKGVKLFRIMIGPNKGTIYPENLPIWAKPS
IPNATDALLVGANTIHYVDFRSILLKSKASHSVALYYKTDTHWNALGAGI
AFQAFAQQVGKVVPEIQWPPQKIYKLNRVDSRVGGDLANFLRLTTYLPDL
EPVTYISSLAVETTQLDYDTGFILRQGGNPQVNAPNKPLLVQSCGALNQK
KVLWLRDSFGTVMSPFMAATFSEVLQLHWAEAMKPGGKFVQLVEEWEPDY
VFFTVVERASRSPWFASYPPPVLVPLGSKFKPIQTTTAVGLNHLLQGTTT
NEFQIIGNDPFFDFTVSEIIKPKEVDYLSISLSCADGSQSVPLQLFWLVD
KQPYFDEEHSARFLFRTGENLIDLHTLPKWDSAKSITRVRVDIDTQDSCV
HFKLGNPIFGVE
>NE2530 hypothetical protein
MDRIGIDTWANISAEGWSSLLLGNGASIAIHKEFAYPTLHGIADAKGLLA
TTAPIFAKLGTTDFEHVLLACWYAEHVNGALGTPSAAISAAYEEVRTALI
EAVHSVHPVHADVAADLQRVGAFASAFPTVVSLNYDITLYWAMLLFNAAN
GSWFKDAFHDGEFQTDWEYLRRPYGHAAGATLVFYPHGSLAVARDYLGDE
TKLSVGAGAAGDLLGTITRRWASGHYVPVFVSEGTSHQKVAAIRRSHYLT
NVYEEVLPALGESLVVYGWSFDERDQHVLAAIAANPPKRMAVSVFTGQPD
GDQQAFCLQVLKAVGRSLPGTEVTFFDSRSPGCWNNP
>NE0485 conserved hypothetical protein
MKRVNQNLYQLLRRIYWRLPLPEETKELLVGFARRFLRGMKKALAEPVTT
ASSASVSREKILQEYANQILAIPRKSGNEYVEISSSSYQRKEGDAKILAY
YLPQFHPTKENDMWWGKGVTEWNNVSRAMPQYVGHYQPRLPGELGYYDLR
ILDNMRRQVELAQMYGIYGFCFYYYWFDGKRLLDKPLDMFLEAKTIDFPF
SLCWANESWTRRFDGSCGEILVKQSETVESYIAFINSVVPYMRDSRYIRM
NGKPIFTIYRPSFIPECASTITAWREHCVKAGIGDIYIIGIKEHTWDVNL
IELGFDAQSEFHPGTLFKHCVDISSQINYMQDFGGIVLDYRDIVEHKKYF
LYNHPKLHRAAMPMWDNSARRDNKGMIFEGASPDLYERWLTDILLEAKNR
EDLEDHYIFINAWNEWGEGAYLEPDKKYGYAYLNATRQAIEGVRS
>NE0099 hypothetical protein
MCARIQQLPVFFIFILYGSCATTKRNIYVLAINKLMEQQIRKYESFNDLP
ADCKKLFDSGEKDSFDLSRDWFLLLETTVIRQTKEICIFTLEIEGVTQGI
WPTLLQKKGKLSLRQISSFTCFYSSLYQPLISSSLTVDKLADCLRWILSD
TRTDVLRFDIMDPSQSSFNLHEQALKKIGFKTDRFFCWGNWYLPVNNQPF
SVYLQNLSSRVRNTLERRKKKFLAGGHGKLEILTTHDKLPIAIQAWEKIY
NASWKIPEPYPEFMPSLISLCAAKGWLRLGIAYYDEEPIAAQLWIVNQGR
AAIYKLAYDEKFAHLSPGTILTAHLMQHAIDVDKVHEVDYLTGDDAYKKD
WMSHRRERLGLVAYNLRSFWGLIGISKHIAGKIRKKILKSLK
>NE0082 Proline-rich region
MKRLNWLHLSLILAGMIASGVSWSYSHGHSHGYHNRSHGHYSGKRNFSLG
VGLTSTFGSYGYYNYPGSNVGIYGSFGYGRSYPYSRRPYYRPYGYGYPAS
RFYWPAYPPTVYYPPVVVVPPDPPVYIQQQPARLVPPPPESAVTNYWYYC
ENPAGYYPEEVERCPGGWVKIPPRPAQ
>NE2494 hypothetical protein
MLIPYTYVPHQMEKMQAFIDFIFHEIWCKAPASGPFGLHLFNANAELREV
MEAFYYSDAQGADFFYGHVERIYGLFSALTFVQISQFQQWYLGNNDLEKV
CANAPAAQIVRYADIATTHQDLADQLASFFKGLYSQSLLGLATLRAKIGD
IDDHYQAFVAANKMGKCPFCGIGDIKGEHHSKREAYDHYLPKALYPFNSI
NFRNLAPACHECNSTYKLSKDPAHNAVGRRKAFNPYAAADHAIQIQVALP
HADIDALTPADITMHFGPVELAEELETWKDVYGIEERYKAKFCAENDGKY
WLTQVLDEWKEDGRDPADFMTTLVRQVQKNPYAECNFLRKPFLDACQQVG
IFK
>NE0942 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR
VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID
PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM
VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG
RLRSWFKSQDD
>NE0274 hypothetical protein
MRKFSVFLLLANIFVVFYLHGRPDDNLPAQIALIHSEKIELLPAKVACLK
WENLIGPVVQHVRVEISKWESGQDHITEISRGEVTVHWVHIPPLRNARET
AKQIEQLKKSGISYLHIQENADSPWHNAISLAILPDDSDVAALVEELKGK
GVERIMDSEQVLEQFEFDIRNPTEQITESVRQLAQQFPETKLEVTECSRL
>NE2443 conserved hypothetical protein
MKKLIFLAVAIFAGYSYLGNPYTSIPDRLQHPTFSESQSGTDATIADAFS
NRKSNLQISGEGVVTKLLPDDNDGSRHQKFIISLRSGQTLLIAHNIDIAP
RIGSLRKGDSIQFSGQYEWNEKGGIVHWTHQDPNGSHVAGWLKHNGQIYQ
>NE1243 hypothetical protein
MKRWVVSACGLSGKDAMKQAAVILTGILVLLAGKSTTAISREPFHLENMT
CSITSASMNGCTGVPTLHSGILTLSKEGTFKLKARYEGCFMVENVTKSGD
FAASFFEKAMSLELLTTRTTGMKNALSDFPGTLGFAHLNYDGLDGYFIDI
SAVIRARGREMENVIVTANLQCTVIDDAARTAVKADRLSFIQKGVYK
>NE1259 hypothetical protein
MGVMKGIEFSESKLKEILDQLPIEAQSAFAASCAQRIFTCYVEYARVAKS
KKVDLDAYSEAISYVWNAVIAGNHDAIILNGLLERCMAVLPSEEDAWESG
TPYAEDAAAAIIYSLRSLASGCPQEAIWAAKRVYEAVDNFVVNTYNVNTN
ATDGEKFILDHPIVSNELSRQLRDLNEIINSKRDSESLKKTIKIIMERSK
SESNDLFSEAT
>NE2002 hypothetical protein
MKKTAFILSVILILLVAVMALIALFSEIEGDNVRHGIMGAGISTLPLVAY
CISVVRVCRGWYLVAATLNGLFFALTVVSIVIILMDDPSTMKNLLAVLLV
LLVPLTLNIFALIHIRRTDSRLMPHPDIPGAAGGKNLEGLGGWLILVGSN
VVLSPFVIAARTYKSYAEMFASGVWDVLTSPDSMAYHALWAPLAIGEIIL
NSALILAWIYIAFLFFSKRRAFPFWFIAIHIATVCLIVIDAIVVHHILPD
APIFDANTLRELSRPIGAILIWAPYMLMSKRVKSTFLH
>NE0745 hypothetical protein
MSIERVCSLCFDNEDLSDWIVNEDGPRGCDACGKRDAPTCKLSELCAFIE
SRLSQYWGSADNQLFYVSAEGGYQGRTWDTYDLIVDEIGLSFPRAQNDRL
LREILGHLTDQAWCDYDCGALDHDEALKFSWRQFCETIKHKRRFFFLSDG
SDDRDSFTPASLLHEIAHSIEVIGLIREIPAGTKLWRARPDLNKGAKATA
TSFGPPPAEHALQSNRMNPPGIPMFYLASSQKTALLETRTMESRMGKWSV
ARSLLVLDLRRLPHVPGIFSKADRHYRLGLKFLHDFAVDIMTPVARDQRV
HVDYLPSQVVTEYFRDYDFEAGRLDGIVYNSTVHLEGWNIALFANNVDLG
LSRPTWGRAPEPWLTFIKSIRARI
>NE0241 hypothetical protein
MGKKKNKKTEVQQPDPMRKNWIMENMDSGVIYLLESWLKAKSQETGKEIS
DIFANAVEFNIVLKDWGKEKLEETNTEYQNQQRKLRKTYIEYYDREMK
>NE1275 hypothetical protein
MNSTGNTSNNARLHWSHVLLIVLATIVLTVAGTYWVLTTYVFVSSFEPVI
LSKKEEKTLEQKLRTIGYDFSFSSPTAKRNDDLKGEIDEEGFLKPQAYSE
QGAKREVNFTEREINALLAKNTDLAQKLAIDFADDLVSARLLLPLDEDFP
VLGGKTLRLNAGLGMAYRNDKPVIILKGVSIMGVPVPNAWLGGLKNIDLV
SEFGMDPGFWKSFSEGVEHIQVTDGKVDIRLKE
>NE1100 hypothetical protein
MPSTPGIWKLPFNYFYIVPDLADASQRCFYGLNRVNCQPACKNNRYKNDN
CSYSIRSSYENTASQITPSLRDLDDNIYESCKRAANLLTA
>NE1608 hypothetical protein
MSLSWRNRIQIFLAPDRVDLTGIARGIRPVQQFRQSGVCVQENDSRQQWK
APLRLLEQMIGQMDDRFRRGSELHITLSNHFVRYGVIAPQPSLANPDELM
AYAGFQMREIYGERIDDWELSLSTWDPYGGALCAAIARDLQSELIMFARQ
YDTRFACIEPYLAAALDHWSKRLVEKQVWFVLVETGRFCLVVLSEGAWRC
ARNQRVVENLQEELLAALEQESIILSPDRSVERVYVFAPELTGQLPVHDL
RWQFVRLPDEKHPAPSYFPGVTGMDDSQNHA
>NE2121 conserved hypothetical protein
MTKKLPIGIQTFREIREESYYYVDKTSFALKLAMEGKYYFLSRPRRFGKS
LFLDTLAELFAGNEALFHGLYCHDRWDWSVRYPIIRLSFAEGWLESRAQL
DKRICWLLEQNQQRLGVTCKQESDIPGSFAELLQNAEAKYGQRCVVLVDE
YDKPILDNITEPEIARAMREGLRNLYSVIKGQDAHIRFAFLTGVSKFSKV
SIFSGLNNLNDITIDADYSAICGYTDEDVDTVFAPELPGLDRQQIQDWYN
GYNWTGQPVYNPFDLLLLFDKREFRAYWFETGTPTFLVDWLMQRGYFTPS
LSRQYSSLELLSAFDVDHIEPEALLFQTGYTTLQGVEEYLPGQRIYWLGY
PNKEVQISLNNALLPALGIEGQKVLTHRIRLLELLRANNFAGLQQLFTSF
FASIPHDWYRNNPIAQYEGYYASVFYSHFAALGLDIVVEDTTHHGRIDMA
VTFNANVYLFEFKVVELVPEGHALEQLKTKGYAEKYKIRNEPIYLIGAEF
SKDSRSVVAFDVELFA
>NE2325 possible transmembrane protein
MNPIYSSMNRRQQGVSLPGLLTWSVIIILVAILGMRLVPVYIEFAAIKRA
LVAIASDSELHNAGVHEIRQAFNKRAAVDAIKSVNGNDIVIRKQDGQLVL
DINYTVTKPLFANLSLLIDFDAASDR
>NE2090 conserved hypothetical protein
MYYFLHREISMNTLPETILQQARSLQEGGILSPREFLHLGSRSAVDQAFS
RLAKAGRLLRVARGTYAIPVSSRFGSRAPAPEKVIRALAEQSGEIVVPHG
ASAANVLGLTQQVPIREVYLTSGRTRKLKLGRSEVLIKHAPRWMLALGTR
PAGAAVRALAWIGPTHAGKSLASLRRILPLPEWQALISARATLPGWMAQA
IGEEAARG
>NE0891 hypothetical protein
MDTVRKLIVCAILGVFYVPVVSAYEFHDPYPNVKVIDGEVHIQRPGGGTS
LAYPNAGISKDIPVNTSKGQFLVPVEKTFPVEPSKVGKAATRFLKTLPAI
GTAIALYDTVCDLTDICRNSQSGEIEYAPDMPAGYPVTTETGYWRHPFYV
SLTHVTADLLCKSFDYRAAVHFAPNNLTFLRVEVSGGTSYCIYSDKTNNP
PTETAPPNYSIVKVNSGNCFTGYTKVGNECVHNNAPVPVTETHWTDAETK
LNAQPQQTAEALYNSDAPVPVLASTQSAPVIQQIAQTSTQTKDAQGNITG
TQVATTSVKVEDTSTTNNVTYNVTEVTTITTYNENNEITNTQTSTSDNSP
PKTGTDETTVSFDDVPPAQLEEEQPEFNLQTPESWGEGTCPPDEILTVQG
VTFPVSWQPACDTAVQLRPIFVLFASVAAMFIVAGISRAGT
>NE1565 hypothetical protein
MKALFFLRHYNDIDHITPVISKWSESGHESLVVLLGRPKFLKDYRIKFLS
TLDRVRVAPIRRLLSPLKFMQWRLQTLLLNRSVKRLFLIGKLIEKLARKY
DAQKRTAVWQSTAGRLLEHGFSDGNEGGVVVFDWITSDSPVPIEWVEIIV
TMARTMGLGAVSLPHGDSPHASQLIRHHEWVLKPDALYSAARIFDKLVVP
NELCATRFRPFLSNEAIAVLGSPRYCDEWLDKLATLSPAPRLKTNQDTRL
RIVMFLRKSEFTTFWEEVGEIIGMIATFPGVELVIKPHTRGGWRQPLTGS
ASLRQLANVRVAEDSEHSISLMNWADIIIDLATSVVFEAVKAKKPVMAAD
YLHAGRSALAHFMPETELKCRDDVYTMIDRFLTAGYDSYYVEAHRQRFIE
EMLHVGGADVLPRYVALLEEQTMRKKPDQANNTNTPE
>NE0784 putative (AJ245540) NrfJ [Wolinella succinogenes]
MFRTVSAVILAILLMSFNLSAARAEGASGVETMPANEGVVVSSIDAAGYT
YMELANGGKKFWIAAPTTKVSNGEHIRFVESMRMHNFTSKTLNRTFSELI
FVTSTQAKVEK
>NE2077 hypothetical protein
MISNPVNSAVIVSATPTDAISQTRPVSAVTPVPDATQSDTPAFILGQKYR
AQIGERLTNGHSLVNVAGRWLQMRMPASANPGNILELTLIEQSPRLKFLL
HSGTQGGNNPTTLSPAGRLIAQLLSQPAPPAMKTANEAAPLLPIPPATGR
ERIQLPAQLQQALSASGLFYEAHLVQWLSGNRSLQQLRQEPQGKLPAPAT
VSTTITDSATASPVASQAVSLIQQQLHTLETGTIQWRGEIWPGQTMEWDI
TEYPDDQGKEQADNEKTGKSGRWQTRIHLQLPNLGKITATIMIEPQGMRI
RLDADSDEITRQLRKEQITLASAMQTTGLTIRAMDIQQHEAT
>NE1592 hypothetical protein
MKVLQSDKAIMMNRKLTELPIDERIQLVEDLWDSIASDQKMLRLTTEQKA
ELDRRLNAYEVDKNPGRSALEAIAEIRRNL
>NE0488 hypothetical protein
MIRSRCDRWPSTKPGQLRRIALSATLPRKERRGIERITMQRQSEQEQLFS
IFIVFTRPLGRGDPENLNKVAYLNIAVTDC
>NE0494 hypothetical protein
MKTSFKRPFAQYVKKATKPLRLAIEDEVEMICETPEIGELKAGDLADVRV
YKFRFNQQEYLIAYRSPTRNTPVEFMIIDFYQIGTHENFYDKLKQYLRHD
KNPREI
>NE0166 conserved hypothetical protein
MKKLYTANHLLEAHIVRDLLENAYIPTRLFNEYAQGGMGEISFTHTYPEV
WVMRDLDFERGRKIIAAYEQAPQVTDIVFCLQCGEENPGNFQLCWQCGSG
LEVAREKS
>NE1244 hypothetical protein
MVSVEGTTLGEVAIMKQSNVIKLFSIVLGLWLLFNPIGTQAETAKYETIK
TEYQYAAKVACSLLLPHQDGTLAKGIYRTIINIHNPASKKITVAAKVALS
TQMGSEPGPFNVTPFKGITLQPDGAVGVNCFDIAGYFCPINGVCVDFAFL
EGFLVVKSPVPLDVVGVYTARPVEGEVQSIDVETVQSKRIHDIVKLGTTE
LPGRGEGKRVDYPPKGSAAYDGQKPKQMCGGIAGFPCPEGMKCVDDPSDD
CDPAKGGADCAGICVK
>NE1993 hypothetical protein
MRQQADNVYLIWYQPGTGRPVQQSTPARTIGYIPSAQAEENYHLTQTGRT
PMTVTSTKHPHNSGGQFRTVAIWEKARIYYNPPLPLVIEDRGHNLA
>NE0788 conserved hypothetical protein
MEIFMRTSIRKGLVALAIWVPVGMSYGASGSELLEGKDIYLPHAERATLE
ELDNATGREGVDITTLNRMNVRAFLADNSATNNVSGFNSIDNGSFVGASG
MFSVIQNSGNNVLIQDSTIVNVTILP
>NE1794 hypothetical protein
MLNVYLTVDTELWPYSDGWPVRALSPYKIAFDEEIAACFYGKTSEGEFGL
PYQIERFNQYGLKATYFLEPLFADRIGSNHLADIVDLIQRNDQEVQLHLH
TEWLSEIYDPTIPVHFKQYMHQFTLDEQVTLIAKGIRSLQAAGVKELHAF
RAGGYGANRDTLRAVAQNKLLFDSSYNSCYLGEDCKIDLNEQLLQPCKIE
GVWEFPISFFQDYPNHWRHVQLAACSTKEMETVLLNAWRQGWFSFVIVLH
SFELVKGRSIGKLSLPDKLNISRFNHLCKFLSDHPDKFRTTLFSELDPIT
IPEIRPQKILYSRLHHTIKRYAEQIHSRFF
>NE1679 conserved hypothetical protein
MRRIFAKVLMLSAVSFMTSNLVQAADPEVIGEFDDWIAYVYTEDSSKVCY
MVGKPKKEEGNYTKRGAVYALVTHRPAEKSKNVFSFVAGYPYKQSSEVTV
SIGNQRFKLFTQNETAWAPDSAIDNKLVAAIRGGSQMVVSGTSSQGTATT
DTFGLKGSTAAYTAISKECGIK
>NE0242 hypothetical protein
MKYDEKLIVFIVTIGVLNNPVYKKRLWLAARNMNISRREWPNVIADAIYN
FDGIILLSNGIRWPIPDVDKVLGDPRWFSYYFEEDEKGDPHRDVIMLERL
RLIDLFFKIKHPEIARHFSK
>NE0081 possible transmembrane protein
MSGLFSALSRASANLLNPRMLWLWSWPMLVSAIFWWLIGMFFWTPLSGWV
LTVIPADTLQNWLESSRLQVIADSVESIINVIIFVTLAITTSLVITALVT
MPALVNFVAKRYYPDLARMQGGTITGSLRNVISAITIFFILWIITIPLWF
TGIGLLAPLLAAAYLNQRLFFYDALSEHANSSELDKLSSIDRSMRWSLGF
LTGLLQFIPFLNFFAPTLTALAFTHFELGRLAKLRHTAAA
>NE1246 hypothetical protein
MILKSYSTVLFCCFLIAFDSNRYRMHLRFAHSCNSRFQYQLDLFNCFIAD
LTEVIFCVPICDINVDCAQMPGQKHGNKESHMAMTMQYLLTISK
>NE0890 hypothetical protein
MKNFKQRLTNAAYASPLVLFAASARADLPEDVTTAITAAKADIAAAGALV
ITIVVGIKVWKWITRVF
>NE0545 hypothetical protein
MIFRKNQCNHQTTFYQIRKVDDLIGCVYSFAAMGKRYMTDNFILLILFVS
LLTNAVAWSFHQEIFKHELDHFHVSHRFDHHHSYHDHAADTGFHQHHDET
LDNDPDFTDHLILHAAGQFQPFYFILLPIIPSLPGKENIPGFFPAGIPES
TLDLPFRPPRNTASLEIRY
>NE0243 hypothetical protein
MATAKTKIKDKKQPLALQLVESGTADSARKPPDTENTGADVFFRRLVHHN
DAEIAGSMATEQATHQADQQFGAELTKLRLDSEDIDVAFKVSLKFLSDLE
VKLANTRRYIKSGTLHSIGGKSVENVGWTDWRRKDQILLCVLVFCLTIAA
GLGMGNVYANLVSSGNAVFIEKPWLATMISALMPIASVSVKYVTNFMIYD
SSRRLYAKCIYAATGMAFLFWGGLFGLTYSGVASSIDWDSFGESTDYGFA
FVWSQLLVELLMASALFLAAEDIYMRYSPDVYIENLEYLELEKALKEQRT
VHEALREKRGELHGRLVELEARREAFINDKVMEFVSLRARHVATMNAHTD
H
>NE2033 conserved hypothetical protein
MSEESSRLGQCDEGKASWKKWGPYLSERQWGTVREDYSDNGDVWNHFPHS
QSGARAYRWGEDGLAGISDDHQLLCFALTLWNGRDPVLKERLFGVTNLQG
NHGEDVKEYYFYLDATPTHSYLQYLYKYPQAAYPYEDLVTTSERRSRQEA
EYELLDTGIFDENRYFDIFVEYAKADPEDILIRISAVNRGPETADLRILP
TLWFRNTWSWAPGLAKPALYQEENQDDCRIIHTQQNESGDYRLYCADAPT
LLFCENETNTCRLFHTDNASSYTKDGINNHIVCGQKDAINPANHGTKAAA
DYALNIPPGETRVLRLRLRRAESSAPATDKIFAGFDTLIDQRKKEADDFY
ASLSGGRLNKEQQRILRQALAGMIWTKQYYEFDVERWLNEHPRHNARNAG
WAHMKCRDIISMPDKWEYPWFAVWDTAFHTLPLAMIDPAFAKQQLGLFLE
NRYQHPNGQIPAYEWNFSDVNPPVHAWAVYMVYQVCQDYHDQNDLSFLKS
AFASLERNFSWWETHREPDKNVYEGGFLGLDNIGVFDRSVELPTGGHLEQ
SDATAWMTLFSQNMLQIALELSLHDPDYEQRVLSYLNRFMATAAAMQDIS
DEHRDMWDDEDGFFYNVLRFPDGHSTRIKVRSLVGLLPLCAVTVIERTTL
DKLPLVAEHFENLVRRRQFLADHIFCPTTPGVEGRRLLAIVDEEKLRRIL
SKMLDEQEFLSPYGIRSLSRYHLEHPYQFHWNGQTFTADYQPGESTSNMF
GGNSNWRGPVWVPINILIIRALLTLYAYYGEDFQVECPTGSGKQYNLFRI
ARMIAGRLLHIFLPDEQGRRPVFGNTEKFQIDPHWRDNLLFYEYFHGENG
SGLGASHQTGWTGALAALLTIFGSLEQEELTELGMQEISAILAGNNGI
>NE2513 putative (AF322013) ID483 [Bradyrhizobium japonicum]
MAKPVVIGSRSFRTQSSALDHYKALLHRYQDGQRIADPADHTDLVALIER
FDPVLDAVGEPAKGAGQIAHFERRLNTGIGWSTPGFWVVRQDGTETDFSY
IDAVKGRPKGRSQDFYNACRQAVALDLVLAKKQAFAQYGDDQGRVECELT
GKMVTIDDAHLDHAWPYFSHMVSGFRAARGWSRDIPDGIVSTPADGQTTA
TFLDSAVTEAFRAFHHDQAVLRVLSREANLQTASSARRPKVARPVRLA
>NE1576 hypothetical protein
MLHARSVRKDIQPVIARHIGDPEWITGQIEVKESVRINACYTRYRHQRFP
NLQLNFSTDHKTLLPENPEPSRFNKRAIPWFTSRTT
>NE0475 Helix-turn-helix protein, CopG family
MAQITARLPDDLVSSLDAAAARLRRSRAEVVRQAVEYYLEDFEDISQAID
ILRDPADPILDWEEVKRDLLHLD
>NE2007 hypothetical protein
MDFTIKAIALLTIGHRLHQFVMYQPCCKIAHTQLTLERQGRQTDLGLTNQ
INYQEPDGQRQFGALENCSGKSMRSDADRPCIEKPCVNQIL
>NE1083 hypothetical protein
MSVPLAALAAFTLAYAGMTGLSLAMPRHYEQVAGQRVLPSGRRHFFRILG
WLLLILAVVPCIQAWGTAVGVVVWFGFLTAGGLLIILMLPYLPRLAALAA
AGTTIAGVLILLVT
>NE0116 hypothetical protein
MTIWLVTRHPGAIEWVARQGIQWDKHAAHLDPCEITAGDTVIGSLPINLA
AEICNRGARYFNLSLNLPAHLRGRELDAATLTACEARLEEYIVKKVNS
>NE2094 hypothetical protein
MFDVNYEKQAQDYYSKAPIIILGSGASATHGMPGMRGLAQHLTDKTDVSG
LSDAEMEPWRSFCRTLTDGVDLESALRQVAVSEELTCRIINSTWSLINSE
DAAIFKNSLQNSSMFPLSRLLEHMFKTSLKKINIVTTNYDRLAEYACDQS
RIHHYTGFTHGFFRQLATPDELTCSRRVNIWKVHGSLDWFQSPLEDTIAI
SGAQEIPENYSPQIVTPGTQKYQKTHLEPFRSIINNADIAINEAGSYLCI
GYGFNDEHVQPKLMAKCQRQGAPVTIITYALSDSTKKLILGGKAQNYLAI
ERGATDGQSVVYSSLSSSSFTVEKNIWSLEGYLSLIM
>NE2517 hypothetical protein
MPISESQLETWSHQGSITQSSTTYNTIKSVLEASTTPYASKNFKVFLQGS
YGNDTNIYAESDVDIVIRLDDCFHSDLESLSDDEKSAYKQAFNDATYTHA
DFKRDVLSVLEGQYGSAVKAGDKAIAINASGSRRKSDVIVATQFRRYFKF
RSASDSEYVEGICFFNATGERIANYPKQHSANLTAKHQASSKWLKPMVRV
LKNMRSRMVEDGLIKAGIAPSYESPRVLRRLQLLREWSHEQSNKVFP
>NE1481 conserved hypothetical protein
MLHNRQSISAPKLVVRPHVPWYRRLLMSFVGLLLIALLAYGMYVIGQSTA
QPAGNITVTADPVLEQILESNSCLEKYDTALCSQLAELVRQLQIGNATRA
DLVKQVKSLDEENERLREDLTLFQQMISGNEESSNVELIIHRFSLEAGQL
PGEYLYTLLLAQGGQRLKEFSGKLEFVVGLLQNGEEKFISLVDENASKEF
PINFRFYHRLEKSFQIPADTVVKSLQVFIYENGSSKAVLTKTIQLPLKES
EHVRKKT
>NE0130 conserved hypothetical protein
MITTHIQNFSVLFEALSLAMNQTEFSIKTRSKQMLKWAIIFAIISFISGV
FGFRSTSAGTASIAKFLFFLFALITLVLLVLGLLGIGVVA
>NE2506 conserved hypothetical protein
MARGRNSALMDSCQLVRTETKMEVAHLNQKQLAARWSISEATLERWRSAG
IGPKFLKLCGRVLYRQADIEAYEESCLATSTKTVVAQVSVS
>NE0819 conserved hypothetical protein
MPTVDQSFPSFFNDAPTVTLQDPLARFLGAAHDGIMEYQYVDAVRLAGHS
CPTVAGTWLMTVHGLRALYGDALPVRGEIEVYMADARDAGTTGVMATVAQ
LVTGAAPETGFQGIGGRFGRNDLLHFDQPMQGSIGLRRKDTGAAVQVELD
ASVVPWPDEMRVLLPKAVSGQASTAELQRFGELWQERVCKMLVDHADDVN
LVRVSNWAVD
>NE2509 hypothetical protein
MRSCELRTRHQRAGTLAGTAPARDSCDHPAGEPGPPWPTRRATTACAQTL
SRRRPIMRELDKELKDLRLYGMAGAWEDLVKQGGHATLESSRWFLKANCS
RPRGGWSSGCGRACTSSTAAARSA
>NE2430 hypothetical protein
MLDLIAIVVLVSGLYLWLRKPKTITASASNAEEAVLPLDDDHTAPNNTMV
YINENKDR
>NE0007 hypothetical protein
MKTSITQVVFLILFCVLPQTTMAQRNMPQSYPVAASEKLVNGIANAVTGV
IELPKTVILTSRRDGPAYGLTVGLVTGIMHTIGRTVFGVLDAATFFIPTQ
PTVRPPYIWQDFDKETTYG
>NE2038 Myeloperoxidase, thyroid peroxidase, cyclooxygenase catalytic domain
MTWHGSNKSGGYNPPKSISYDQGKFGRMFPSLPPFAQDTRQIRDALKELG
RKGGIMDAKEDTDIAVNPNLARDLIIDPALSLINPNNPNLVAGMTFLGQF
LDHDITFDPVSNLERQSDPESIRNFRRPLFELDSMYGSGPSASPYLYDQS
ADGEGIKFYVEEISGAAAVSAGGFVRYDLPRNSQGTALLGDPRNDENLMV
SQLHLAMLRFHNAVVDYVKAQSSLTDPDEVFTEAQRLVRWHYQWIIIHEY
LVRTVGKPLVDNILINGRKFYKWHNQPFIPIEFSAAAYRFGHSQVRPSYR
SNFGPIPSDINSQIFRLIFNDNLADEPDPDDLRGGKRAPSRFIDWQTFFD
FGDGKVRPSKKIDTKLSTTLFDLPAVRGDIQSLAQLNLLRGLTFSLPSGQ
SVAKAMNLPILNTTDLADLVDFKLHQRTPLWFYILREAEVKENGERLGPV
GGRIIAEVFLGLLQGDSMSYLRQDPRWIPTLPSTVEGTFRMADLLRFAGV
VAPL
>NE1605 putative prolin-rich transmembrane protein
MESDMAGTLYRKLILWGALAATVLAALLVDEGTELSVDDVVQPAVDISAD
RRTAGQTRQIRQTHETLPVDQLGKRKFSAKADDIFAVTSWEPKRTASTDF
NPQIFQPRKEEVVRRPSAPPLQFEYLGRVVSEGKIRVFLAQADQNYVAGA
GERIGTEYRIDRIREDTIELTYLPLGIRQTLTIDQGTFD
>NE1241 Tyrosinase
MAIRKDANTLTAAERAEFVAAIRVLKAEGIYDRFVLRHANANMSAIHRCS
AFLPWHRRFIYDLELELQRVSGNPNLGIPYWNWPSGSANASMWNDDLLGG
NGDAGGVVRTGPFRSGQWTVINSSGLPAGPLMRAFGQNGLPTLPTQAAIN
QVMAVTPYDTSPWNMNSNPSFRNQLEGWIGPNLHNRGHVWVGGSMLPMTS
PNDPVFFMHHCMVDKIWHEWQLRFPNQGYLPASGGPFGQNLTDPMGSTPS
GQVGSRPIDVLDSAALGIVYDDAAPQPQPQPEIPLIVVGADPIAAAIGVP
GETDVFRFEVPAFGAHTMYTLGSSDTFMTLFGPNDPNFEVASDDDAGEGF
NAQINRNLSAGTYFLRVRLYSPNSTGNYAVGVRAVSATPGPGPGPVPIPE
LIVNGVGIDASISAANESDVYRFNVTTGDFYTIQTNGTTDTFMSLHGPNS
QIPEIASNDDSGISFNALIRRQLSPGEYFVRVRHYSPSGTGAYSVRVTQG
>NE0313 hypothetical protein
MDREDFFLKIAELHLKRVEILQTVEWRITFSLWTFVAGVAMVSLANADKM
KQAAVAMGGILGPVVILSLMGGIYVWLWYLYLYKFCKKNYNSLVTERNRY
QRMQNEAIKLVLKGKSADFLIEAGAEDKRVPESDFSEPSFKQLSGDDFRK
SGVWEFKRGITAALMFFSWLLVLMIVVPSASKHLNDSVVATGKPAGVEVE
SSGRNLRREADQHF
>NE0921 conserved hypothetical protein
MSYERFTVLKVPFPFTDRTAAKNRPALVLSDAATFNDPIGHSVLAMITSA
ANPAWPLDCLIDDLVSAGLPAPSVVRFKLFTLDHRLIRGELGRLAVSDSI
QVTRSLYQLFGMAAVR
>NE1936 hypothetical protein
MRAASCSMFSQILKLILRTGCDGLSRRLGQNLTVVRKAVKLCLVLDHNGY
LSAPASLSTGKVVEVKVDEIMVGCVKKPTKFGGLF
>NE1887 conserved hypothetical protein
MTDNLLKYGPLGLPEYSERRLLHTELNADVYELVNIPLQLSHLVLLSDRQ
WVNRERELIVQICEHFGIRMLNGAFDQLSVELGGFQLRWERHTEYSTYTF
YSEGPFEVPFAQPAIAHVPPEWLEKLPGEVLVATHIALEDRRRPSRSMSE
LSSLFSSNTVIGSKVSAGSASVWSDNQIHPDGFTRFLIHDDNLRSRQVGR
LVQRLLEIETYRMLAILPMTMTREIIPQLERYGDQLTELISTNIAPNSIE
DEQLLLVKLTALATEIERISAQSSHRFSASQTYHTIMQQRITELREERIE
GLQMLYEFRKQRVTSAMSTFDLVWSKLETLSLRVERATSMLRTRVDISME
SQIRDLLRSMDTRAYLQLRLQETVEGLSVVVLSYYLLGITGYGLKAAKAA
GLNIDIELMTGIAIPVIVTIVFFAIRRFRRIVSKSAFGENKGGE
>NE0799 conserved hypothetical protein
MKISPSRQVKGEIEIIPVSGFRGIGKFIDVPWRLYADDPLWVPPLRLERR
LHLSRFNPYFRHAQWQGWIACRDNQPVGRISAQIDELYQQRYGTDTGHFG
MLESIQDEAVFSRLIQVAESWLVERGVRQISGPFNFSINQECGLLVQGFD
TPPVFMMPYSPEWYTSLLEQNGYQPCKDLLAYWLVTDFDPPPAMQAIDRK
YRHQIRIRPLQRNRFNEEIETMRDIFNDAWSDNWGFVPFTQEEFAELGSS
LRWLVPDEFIQIAEIDGRPVAFMAVLPNLNEVLPALNGKLLPLGWLHLIN
KLKSASITTGRVPLMGVRKQFHHTLVGIALAFKVIDAPRKMVKSRGIGHV
ELSWILEDNQSMRAILEKIGGREYKRYRIYDKTLA
>NE0805 hypothetical protein
MGTADLPNILDLLIQEIAIQDARPVHPAFQVFTDALRQRFGEALDAVVLY
GSCLHTSDLTEGIADFYVLVSDYRLAYSGRLLAGLNAWLPPNVFYLEVPA
AAGVMRAKYAVISTADFERGARQWFHPYIWARFAQPARLLYARDDQTGKR
VHTAQASAVLKFISTTLPVLESGPSDLEMIWASGLMLTYAAELRAEREAR
ARHLVRIDPEIYSRLTAAAMPALIPLLSLQADGRYHIGPITPLKRLSARI
HWRLRRWQGRVLSVLRLSKATMTFRDCLDYAAWKIERHTGIKVEITPMLR
RHPILWGYKVMWQLLRRGVLR
>NE1297 hypothetical protein
MGTHDREGLLTRYRISFLVNIILLVIILAYLFKLSQTGIPVKVVSYEKQD
LTVKNLETLTRNELDSERTEGVLVIKKKDGEQADLVFYGKGLAGMTAGAG
SSIKELMLQLGSGSTDAYAGDDNCRSVLVFRGQVYCIPW
>NE0392 conserved hypothetical protein
MEQITRWLMIAGAALLVIGVVLHFAPWLFNWFGKLPGDIRIETRHSKIFI
PITSMLIVSIVLSVIINLFKK
>NE1636 hypothetical protein
MIWVNNLISGFDGIYWYPNSSLLRIEEWDGEFTVFQPESGKTHFLNEMGL
RILTVLDRSPATLEAICQELSAYFSLQLDAQFPGQIIRTLQRYEALGLIT
RVKENE
>NE0655 hypothetical protein
MTLVTRCPVCHAVFRLTGIQLHSCNGDVRCGQCRQVFNGFVALIVVPETC
IQPAARSAESAPDYLESGNVAVVPVAESSFPADHFGVQLSTRKTSRWWLI
PNALLLLLLLGQFVHAYRTEIFIAFPAFQPALDSYCDLMQCEIDLPRHLH
LLSLESSDLRVSSPAEPDVVALSAIIRNHAPFPQALPALLLTLTDSDEKP
LASRIFTAEDYLDSVTDQSVLGGDSEIQVQCFLNTSSLDAVGYKLELIYP
>NE0183 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MNKVLRNSGIAGLCLALSIFLLSAPASAQLMLAHEGHHDAGGCKIEGGDF
PVTVSVYEVPEGNIPPMHSYCNHLPDAGKINMTVELSDSQTREVPIAVRV
LMEGHENSDHGAHEVLYMPAEKYSSGIIVVATNLEHLGQYTVQLETEDSA
GQVKTAVKIPLHVGGGGGHDHGSNFGMLEMILLAVVGGVGTFIFMRSKKA
ANA
>NE1180 hypothetical protein
MINTITILTILCLSHNQPKNFVMILFFNNLATRRCVPADFLSGIHDHSYQ
IKHSFTLNIGITGQVGAILQQQCLDAIRIAN
>NE0114 hypothetical protein
MASLQSSKIPIQRRFPLIGNFPVSHFRHHCEAARGNSVSLYCLTGLLRCI
YLAIIVQALEDSVFTAINLLTEYLPGKLFHYARAAPTP
>NE1050 hypothetical protein
MNYSKPQKVEFATFPESKRNLLRFPGSGEIPGNDKIKASPTLAPLSKHSA
NRFHSGVILFSARKMKNMATFYAPAIILKLKMYT
>NE2265 hypothetical protein
MDFKSGIKHGVAITGIILGLSITPAQAGVAGSMVLDITGGCFSYGANGAT
GCGISDGSPDAAGAKEYATGSFTFSNTLSGISNPAAYAYQASVSLYAEAP
PDNPVISFYDTRSKSFATLADLQSDPLWNTAYAFVTAVLANTNGSFTATI
PTPPAPPGTTVEASWNYTLSNLTPGPGSTPAYATGEFEAWSKDDLNGLAL
ILFGPEQPLPTSPVNFSLTVALSAIPEPATIALIGLGILGMGAAQRRKTP
AALPV
>NE0179 hypothetical protein
MQNDNHSTAQPEKDVARLTEEVREAIAHGGDIENAIRNLTLKAMHSNGLD
IESLKQIATAVMKGVQEGAQQKMTHAAEQSHAAQSQITQAVVGLDTAFAQ
LAGASKLALEEAASKAKQFSDSELTKAQADLKDLESVFLDTLKHTATAAQ
GLIAETLRDMLSHAQHNGTAVGMQLKDTLAVFAHQMASTGRAQFEAGVKL
TQTTADLLYKISTGVLSGITSQTNRDDK
>NE0270 hypothetical protein
MPEQITRKASPFCLRLTPEERTLLEREAAGLPLGEYIRQQVFDENRVKRR
SRNKQPVKDHRLLSQLLGELGRSRLANNLNQLARAANCGLLNLTPEVKTS
LLNACADIRHIRETLMKSLGLNR
>NE1937 conserved hypothetical protein
MSVKNIGNYVIVLALIAIVTSCNFGSSDWRTASRARTGLAPDPAETPEAV
IQVYAARAYSWRGIFGVHTWFAVKPSHAESFTVYEVAGWYARWGGSVVAI
HEQAPDKRWFGNAPMLLAEKRGEGVDELIKRIDKTVQTYPYSKEYTIWPG
PNSNTFTAWLSRAVPEIGLDLPPTAIGKDYLGNSMTATAPSGSGWQLSVL
GLFGIIVSDVEGFEINILGLTFGIKPDPLAIKLPLIGRIDLSV
>NE1279 hypothetical protein
MKSGLQCESPGRYFSYLSYICLVQLLSLHFESMKGESFMRNETQTTHSHS
KHEQHCQHVYETGQLRRAKMHVARLTGNFDSRKILSPSLLQLLETSIILE
TTDSKVLASRLKRKPAAIRADLQKICHLLAEEPRL
>NE2531 possible signal peptide
MCDRAPGPARVRRAGELLLQRRQPRVLPIHAGHETQCGGLRRGLRRTAAL
ALLPGQRLRAAHQRAAVRRAGYSARSRIPAVHRGGVVSVIPWPYRLLTLA
ALSVALVGFGWIKGASHVQAQWDAAIQQQALQAAAVRERQAQATVKVVTE
YVDRVRIVREKGETIIKEVLVYVPVQADSACTINRGFVRLHDAAAAGELP
EPARDADAAATGIALSAVAGTVAANYQTCHENAEQLTALQAWVREMKVAG
EQ
>NE0776 hypothetical protein
MSGEKSGRNFIFYFYPSLFIAANALFIGMVLFLFYASMQPPHLLEDTVNT
SPYRQIASWFDPIFFQYNLVLLVFAVGIIPLITLCYTSSMREEKKRRLQR
ELPPAIYSANSNYIQNYLSKISSIRSYLGSMMSLMFVVMFGCMIILLLKP
APLALPDLAYANGVDYSKGANFLMLGTYMKSYMVGNRDYINVLVYTLTAF
QFGFLGGYVYFIGLMVRSYFTLDMTPNIFINSSVRMITGSLLAMVLSYFL
IDPDKFSEPDAILIRSLPVWSFFIGHFPDRGLVFLENIATKALGLVRIHE
FASPLSDLPGINYNQEILLKREGYDNIENLANANALDIALRTGFSYQQLV
QWISQARLHGHLRNDYHAFVNCTGIVSLDDFVHFYRTVKLQNSAADPIEL
IIASLKNEKHDLDDKIRILRYLADPRDLATDIPHSMRDENSAADQETISN
KS
>NE0570 possible lipase
MNDEVKQRLQSTETWIRVLFMLLFMFIQGSVKFLIVLLALFQLGSTVLTG
QANTRLLKLGRQLAMYDYQISLFLTFNSEQRPFPFSSWPSDTDNRTSDNA
DNRTPNENPEKTSWFQ
>NE0503 hypothetical protein
MKRTYCTIMLTGALSFGLSTACTASGIHKLVDERGRVIFTNDPAKNTRQI
QSSKSVSVVPSRRNGTSTEPITVAITGSNYPRVSKLQQDQRDSKRRQILS
QELANETRLLEDALKTIDLTQQKTDNYLPGRPYFTSDHFDILQLRNQAAA
HERNIEALKMELNNL
>NE0998 hypothetical protein
MTLKRTIKEFATYLGDRESILDRDYPRVAGQIELLWGYVEFYRYLEKLLI
TEKGRDRSGFPFEAVLELDKLKEIHERLYP
>NE1606 possible transmembrane protein
MARIALENWLVRARWQVTRLGTVGRAGAGLLVLTLVFFIAAVMPQKERLK
ELKSKVQVMQQAQPDSAGQTKLNNNQALQVFYDFLPRSDSSPYWISELDR
IAKDSGVELNSSDCRLKVEKESKLVRYEIQLPLRGTYPQIRAFIASALQA
VPALALADIIIRRETIQAGRVDARLNMHLYLNDY
>NE1379 hypothetical protein
MLPELLAMAQLLVSWLLIIGGWLFVHRATLSRERRKEKREEINNTIQEIR
AIENIAIDFHNSKIFDEKAASSLTLRINRLNRKLQAPPTFNELKIPTQLM
IEFRKTITLEHFDKSNFPSMVQRIMKGNPYPATSIEILIRDINSATDDLV
DCIEAEKNNKL
>NE2540 putative bacteriophage related protein
MPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRN
QRRIESLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQ
GRIYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSPKKRGGR
>NE0113 hypothetical protein
MPEPLQPHEYPHRILLCVTGLSPQIVTETLYALAVARATPFIPTEIHLLT
TTDGARLARAALLHPDGGHFHALLNDQPQIGLPRFDEDCIHIISHHQEKL
ADIRTPAENAAAADTITALVAQLTEDADAALHVSIAGGRKTMGFYLGYAF
SLFARPQDNLSHVLVSSPFEGHPDFFYPPRQPRRLVTRDGHHIDTAEAIV
TLAEIPVVRLRHGLPATLIAGRAGFSETVVTLQQSFAPPCLLIDLEQRNV
VCGTTAVAMKPQLLAWLAWWATLARQGRPETTWREADARLFLDIYRTVVG
IDAIDYEKTAELLGNGMEKEFFQTKNAKLERVLKDTLGPAAAPYLLTTTG
KRPHTRRGLTLPPERIRIVGTGSK
>NE0895 hypothetical protein
MLVPPVRKQRRKCIMSNQKNRYGGLFCALINSSFSSWLIKAMERLGFKQF
WINFFLWLPCRLAMLEIKLEYGSLENFVNRDSD
>NE2542 Bacterial regulatory protein, LacI family
MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVR
PWDVAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVN
LTTLAPDIVAAILDDALPNHVTLFDLAVDPPALWDEQRRKVWDTSFSTSR
LMQDA
>NE1337 hypothetical protein
MIGRNRYAMLTVDTEALPKRAVQDHIKRLMWGEHDGGCAGIREMCAIGNE
CGVKQVFFVDMCGAYACLDQTLDVVRWLDQDGQDVQLHAHPEYLPEQFWK
EHGFKYRPRFLNQYGIEKATFTIKYFGKLISDLTGKPLRAFRAGSFRWNA
DTLRALQEAGVSLSFNNSMNARLKGQCTYSEPTNHSYLWSNGVIEVPVTE
RKFFPLFGKEWWGRLQFPVGDWLGSPPWRVLRPYTVGADPSFLVVLLHSW
SLLYWDKDGYAVYRDDKRIEDYRKLVRRLARDYDIITTADFLDLYACGKI
KTTHTADLSLAEFKAVKK
>NE1629 hypothetical protein
MKNTQPYSWWLLPCLLTTLPGCLSPITLHHAVSAYDDAITSTISRQLLTN
IARARHHQPIHFTGVSNVAATFDFRFSAGATPALGGLAGTTLMPLFGGSV
AENPTISIVPIEGEEFTRRLLTPFQQNKFMLLLRQRFDIDLLLRLMAQEV
RIQESTSQTTYRNTPSDMTGYETFRKVVLHLSAIQDRDQLYAEPLNLEYD
WTLPAAAVSAEGFHTLAKEFVVHHDRQNDLFILHQKKQGPILITNYDPGI
LSEKERAQLSKEAEGWEPNDVAFDIRPGGMGGEWPMKGIFRLRSFHAIIS
ALGRSLSDEPEYHVEKDLRTLPVSRDENPVATMALLVTDTPSPNTDLSIR
SHGRHYAVDTQGQQARWNRDAFQMLYLLFQMTVTDLPRTGAPGITIAK
>NE1340 hypothetical protein
MVREEELSVIISDIKAKIEWYDITFAEDLYRVADRNAPKLKKLPVQVKHI
ELQAIANRVFCGRMLFHYSSLRHRILQGKN
>NE0648 hypothetical protein
MGALAFYTFIYFVGHFAALGLNIITNKKLLRHRWAGLAGVIIVAIMHGYK
IISTTPPSGHDDDTLYALSYFVIFPVVVISAVLFYLSEKDKKDGGSK
>NE0510 hypothetical protein
MILRSWARGVTLATICMFSLSSGVILAKNNRPTPLPTYQYPDVYNNYLQP
VATAIDYHSGMVYLFNYENDKLIIVDPTSLSGWPGNVPLQHTLVFPEGDK
IFITSDNTEEHAAYIIILKVNDINWDAGTVSLAVETVVAADNPGTPTEFP
FVEPVNNVQAIPNWLVGRGTTQIHGPTILPYSDFVYLTELTSDRIRVINR
KTNEFVSGVDPIAIPGYTEQTHGINFNRSGTIGLGTGYFFDNSVIDVYKP
NRETGELQTIGQIRLGDEKRHAAFTHFVYWLDERYAVTASMQFDKTSLTP
TTTKKIIPPSVWLLDTLEGTATKILDHTNHANGKGIFRSPSDIAVVNGKL
YIAEEDSLDYTFANDGYISVFDLTDRYKPRFLKRLKPGRELPTGYAVAHT
ISPTPDNRYLIVASWVSGYVLKIDTETDTVVKIWGPNDGLIKPHGIYTAG
GLR
>NE1841 hypothetical protein
MFPAHPTIKEVDLKKIFTIVLGFVMTAAMADGVDQRQILPMNEMQRNHLL
SEMRMLLTGTGAILEALAQDDRAAAARHARSLGTDMPHKMEGHMDNILPE
QFMQLGMAMHQAFDRIAQDAESGKDTKHTLQQLSETLGRCTACHATYQIS
TGRQLAGQGSQKNHGEHGAHSHTY
>NE0016 hypothetical protein
MTYTFKLLTGLTLSVMLTGCMHTPVQPVDEPSEQAEIRIIQESGSDLSEL
MHYYDSLQNKSRVELWEEYKYANSHYRESTDMQQRLKLLILLLLPNTSFQ
SNRVALNLVEDLPEQAETTPDTTAFKNLLVLLLKRQRAANLQIQNLSEKL
RSAETEVKTLKNKINAIKIIEKDLMRNNTP
>NE0791 conserved hypothetical protein
MRKNISENNGKRPGSVKSRFLVSAFAVFLVAYSQTGRSSEEVLEQYKSLF
AQQQKEFEKQRQIIIEQGKEIEKLKSRLDSLITTQPTDRSPASNVAGKDG
QRPPQVSSPSTPKTVVAGPVGKKNDQVQTRTVPGNLPAGPVGQAPPKQDE
KPRPPEMPRLSDAVGGVLTRKGKIVVEPALEYAFTDSNRIFLDAFTFLPA
IAIGLIDVRQVDQHSLMASIGARYGVTDRLEVEARVPYRARFDEQRSRPV
SIGAGIDETFNASGNGLGDIEFAARYQLNSGAGGWPILVGNMRATVPTGK
GPFDIKYAQAQGVPGAVFPTEVPTGSGFFSFEPSVTALYATDPAVFFANL
AYNYNMGTTEKALDGSGDKFKVDPGYAVGMTFGMGFGINERSSFNIGYGH
RHIFNTKINNRTLKGSQLDIGQLLLGYAFKYSQQTTLNFSIAIGTTDDAQ
DVRLSFRVPMTF
>NE1135 hypothetical protein
MIEKYRIKKPPERAVRIFESYLSQFLLSFSALSTLSGMESRSIDMLGGYW
AGFGSPQAVKKVVTASAAANTSESFIFSSPIN
>NE0319 hypothetical protein
MRAWIEYRGGYVMNGRFCIWRSPEFRYKLMSVLFPGMILSSSCFGESEIG
QAPLGNRQILQSQFQEFRISSSLQYFPHSYFPVQPILIYPGVQWAYPCFP
FVSCMELQQYRRYKRREKRQQPKPVFGQGASLMDESMEDWRAGLRPAVEP
FRTDEHQIVPALRGHSLIRPEYREAGSILPRFSNGTE
>NE0349 hypothetical protein
MKGNALCTGEPVMTIETRGGGSEDIAVLPLDPTGNITRLFIRERCNDSSD
PECKTGQWRLGSVDISRENTPASTRYAWRPGNNASEDDFEPLGMSLVPGN
TPGEGTLFVIDIARPQSVRIWQLDISGGEITKATLATPADTQTGARLTAA
NSLQAVRNDDSRSFHLTITRFDEYGLLPFRPTPWPALVRINNGVIQPPPA
QDFRNANGIIRPCAGCDLVIASYWERRLRFVSKENGEIGEYASAELPIRP
DNLRLDGERILIAGQRRVDLTALNLLVSPHIPSPSGVYAIDTRSLGPDTV
PTLLWEGGWKHGHSVATAVALPGNRLAIGQINTPGILIADCSP
>NE1814 hypothetical protein
MSMLCSLYRITPEQVTKLKDFPDAIGELVGFTAPPPKVSFLSKLFGKPPK
QLSSSGQQFEPVAESDIFELNQAWHILHFLFSGTNAESPWPGGFLISGGE
EIGPDQGYGPIRLFDSELSRAVAGFLDTQSFKMLDSAYVASEIEATEIYW
KVSSEHTERQRQLEELWSMVKELQTFFEHTVRAGNATLLSIY
>NE0281 conserved hypothetical protein
METAATNLVRSLTPQEYGFTLIILILAALTGFYCFIRAWKRWHLIKDTPT
ARLRSAHQGRIELEGKGRSLPDQPVFAPLSNHECLWYHSRIERKETILEQ
KRTRTEWKILYRNTSNHPFLLDDGTGICQVDPEEAEIISNEKLVWYGNTE
WPVRTGILDNGSAIIGLASRYRYTEQLILPGQRLYITGHLQTRSPATERS
VRDIARDLLSDWKQDRRQLLERFDTNRDGEIDLAEWEIARETALSQAQTV
HRQLLHETEIHHVSTLKDGRYPFIISVRPQAELIRKYRRNALIALTGCFS
VAGCIIWLLHVHG
>NE1199 hypothetical protein
MVWVIGLIVLILLIVSAWFRKVAVSVIIVTGVVGSLIYVLNEREEERALS
RISLAELDFENVALKPSYSGYKLSGRIKNNSQEFTLKQVNLLIIMQDCTG
TPDSQDCVTIGESHENMDLNIPPGQARDFEKSLYFPGGNLKLLGKLEWNY
SVSGIKGE
>NE0255 hypothetical protein
MTYDTNLPSEDGIPDEGNAKKVARGTLQVIGGAVPLVGGLLSALAGAWSE
REQAKVNRFFEQWVRMLEDEIREKEATVLEVMARLDLHDEKIAARVESKD
FQSLVKKTFRDWAGVESEEKRVLIRNILSNAAASTLSSDDVVRMFIDWIG
QYSELHFQVIGAIYNSGGITRGAIWKKIGKGRVREDSADADLYKLLFRDL
STGGVIRQHRKTDYYGNFVAKSTQKKSPARSGGTKTLTSAFDEEDQYVLT
ELGQQFVHYAMTDLPLRIAYKL
>NE2152 hypothetical protein
MLNISILIFAIAALGGVFLASKVLAGKLAPWPVSIVHALLGAAGLVTLIL
VIMEGPENNRLTAALALLVVAALGGFYLASLHAKSAIAPKGVVFIHAGVA
VAGFLTLLSVLL
>NE0887 hypothetical protein
MLTESGITPYPALNQLSLQYSIFGFLLRNITKPGNHFAARCYLSALDDTS
SLPPGQIFNLLTVLDFYYQTERNQTERRAPARSEGRGSEGQKILYVPVTR
KVSLTGETFCSF
>NE0752 hypothetical protein
MISTMQAYALVNSEEWVFNTDIPSALSSFTEIIPLQSKVMLELDLIKELS
SNYKIENETHRALAGVFQTNYITQPREDYLLAFRLRQFCVYKLGKASKDG
LLKFLGIVAAGYALTPITGPLAWIAPGIAGWEFIKSIVLAYERIEDPDEK
MVFETIYILERRPIIVDYRAYEKEDFLNAYGHAWPNIDDLNNELNGKLTE
KELKKALVSLKARGIISNK
>NE2543 hypothetical protein
MADNDRQFSWRQGNVVTLEAAKALNLLAPECDDQHFAVVASHDCDLSASQ
DKEPCVEVVVGKRIDKLGGDSFGKTARRLHIEYQSEAGPVAIELLATSKR
PVAKHELFSTHPRQDIWLDGQGIGILQRWLASRYHRAAFPEAFEDRLRSA
NLPGKRTFLKRIEGILADGGDHIRALLFDLDEGKDVERDGPDDVYQLGIV
VLYDSLRDEPAAAQVAGKAAEALEELFEAAFHPKDSGWKNICLMYCDPIS
DSAITVAQREMLKQWRLEHMSLQEDPPQPMITP
>NE2145 hypothetical protein
MRKFAAFILFSVVGTSGWCGDTIRPGLWEVTTRSDLLGLIAHVPSEQMQQ
ITSLARQYGLKVPRIQEGAAISKVCITPEMAEQDIPSHFYENQSGCSVVN
ASRSGNRYQVELVCDNPRFKGNGHAEGIFSTPERFTGKTEFNSTVQGTPL
YVYAETSGRWIGAQCEPMR
>NE1226 conserved hypothetical protein
MRNQILKKATVSIMLLLCINVSVADTSTSVSCAESGSASAYSRSGNSVSY
SSVNCSSNNEQNAMTPAGIAVSKSYSPEPYTSLVLSGAFNTEIKTSSENR
VIISGDSNYVESVEVNSSDGELIIRRPGPGNDNLNVIVESISLQKLKISG
AGSTNIYGDFPDGLSVRKSGAGSINIEGQASTLKLNLSGAGNTTARDFTV
DNVEIDATGAGNIAVCAKKSVAGSLAGAVHFKVYCNPSQRSVNTRGVSRV
SYR
>NE1917 hypothetical protein
MSEQVYSGVDQDEDGGLTPLGRIVIDAWVFGILPESEMCTGWSMSQMQNL
YEEVYAAWGPYAHLPSRLPPELQQRHSFFYSQRITVAKNNGWDTDLSDES
>NE1277 hypothetical protein
MNTAHKWRFFRSGGFDQVRLETGADVESLGTLDPKLWAALSCPTSNLEFD
DKTLEFIDTDHDGHIRVPEIIAAVEWVSSVLKNPGDLTSGSEALQLSAID
DSTPEGAALLASARQILLNIGKKDEEVITVEDTADLNRIFASTRFNGDGI
IPATAASDAETRAVIEDMMKCVGSVQDRSGLPGVSAELIEQFFTEAKAYS
EWWQDAERDATSILPLGENTEAAKAAFDAVKAKIDDYFTRCKLAEFDQKA
GDPLNPALSDYEALTNTDLSSTTEQLATLPLAKIEAKKPLSLNEGINPAW
VGAIDALKRQVVQPLLADKEQLSADEWQALCDRFTAHQAWLDTKRGATVE
SLGINRIRSILAGRYQDEISALLQKDQSLASASDAIDSVEKLIRYQRNLF
QLLNNFVSFRDFYTAQNKAVFQAGSLYLDGRNCDFCLRVDDIEKHSSMAG
LSGIYLAYCECQRRGGSEKMNIAAAFTNGDADNLMVGRNGIFYDRRGQDW
DATIVKIVEHPISVRQAFWYPYKRIGKMIGEQIEKVASAREKSVQDQAAA
GIADISQKAEAGKPPAAAPFDVGKFAGIFAAIGLAIGAIGTAIASVVTGF
LGLAWWQMPLTVLGLILVISGPSVLIAFLKLRKRNLAPLLDGNGWAINTR
AIINIPFGISLTQMATLPAGAQRSLTDPYAEKKRPWKFYLFVLLLLGSIA
YLFHSGYLNQGTVDTLKKHFLSDKAEIGTEEASTAQEIPSASPAEEIADG
QKTEDDKPPQPAAKEGVENGSVASPEPVSTVVPAPKPVSVPASH
>NE2344 possible unsaturated glucuronyl hydrolase
MTSPVLIVEDRLLSTDEIVNTLEQMFRRMEMMDVLCGQNFPLYSSGENSD
WSVSPGGSWMGGFWAACWWLRAKMTGSAGDRQKAGEISQRLFRKLTADSG
YRSLIFWYGAALGEIWLQNAPARELTHSSIAALAHSFDPRLNCIPLGMAM
GGLTTGSCAISVDNFASLIQLLCFSREKQYHRIAQCHAETLLAACRGDKG
AFHAEASFDGHEFQVKDRAGVWSRGQAWAMLGLSRAAAQWGEPYLSQARA
ACTYWRDVHKGNLPRNRPDQTEDVKDPSAVVIASLAMLSLARLLPDETSW
CKYAHQQISTVLHSPYFTVINPDSAFGSAGLFQGCCYRTRQNREEIVESV
WGNFFLVAALAVLAGLIDPYDC
>NE0633 hypothetical protein
MLRLIPNKGLKKTINTLTVAEKARINHQDIEELLLRYQRSCPEDAARNIT
EVTNPNQEKHEGLRVFVIQTWMALMVANSYGLETYQTLKTFMDRQGYKTQ
PDSTYMATEQKD
>NE2176 conserved hypothetical protein
MKKVAIVQSNYIPWKGYFDMIAAVDEFILYDDMQYTRRDWRNRNQIKTPQ
GVQWLTVPVLVKGKYHQKIRETEIDGTDWAAAHWKALVQNYRRSPHFTEI
AAWLEPLYLAETFTHISQLNRRFIEAICNYLGIKTVIKNSWDYTLLDGKT
ERLADLCVQAGGTEYISGPAAKDYVDEQVFKENGIKLTWFDYIGYPEYQQ
LWGEFTHGVTILDLLLNCGKNAKRYMKYVE
>NE0544 putative ORF1 [Plasmid pTOM9]
METICMTEMKMSSPVEHGVLTCLCMKALFIAVIFGVTSDVTAHGVTEGDK
GYILESTGILPIPFIYLGAKHMMTGHDHLLFLLGVIFFLYRLKDICIYVT
LFAAGHVITLLSGVLFEVAVSPYLIDAIIGFSIVYKSLDNMGAFQRWFGF
QPDTKIATFIFGLFHGFGLATKILEYELPADGLLPNLIFFNIGVEIGQIL
ALAAILIAIAYWRKSGKFMNHAYNTNTFMMMLGFLLMGYQITGHFILFST
I
>NE1802 hypothetical protein
MKSFGCGNDNFSRLRMCLRVGCIVGLSSWGFQVSAAEWNIQPRLTVSETY
TDNVGLGGGGFGGFGGAGRGGEFITQINPGVSITGEGRRFKSNLSYTLNN
LIYAKNERFRIRNQLNTDATAEIIKHHFFVDGRATISQQNAFLFGPQAPD
NAVLTGNRRNIYMWNISPYVRQRFSNLASGEVRYVHGEVSSNANSFSNSS
SDAAIFSLNSGSAFRTLGWGVNYSHTQIDRKYARSNLGRLQTIELERTTG
TLRYIVTSQFSLIGTAGYERNSFISIRGRTSSPLWTVGFSWTPTKRTKID
ASGGKRFFGNTYAASVDHRTRSTVWNLSYVEDITTFGQQSLAGGSILSAS
MLGQLFSGIQGGDALLNQGLPLSFSDPNNFLTNRLFLLRRLQASLTLNGK
KNSLVFRGFSYSRKSFSSDEEDADLIGIENAALTRDTTQTGGNLLWNHRL
SPRTNANINLGYIRTSYDVTSQEDDNIIVTAGLNKRFTSNISGSIMYYHL
HRESNRNNGSYDANAITATLNMNF
>NE1077 conserved hypothetical protein
MPTIQSVRRTQSGRPGKRAINLSLSADVLDAARQLDINISQVCDTYLREV
VRHEQERRWREEHADFITAYNATIEAENLPLDEWRSF
>NE1999 conserved hypothetical protein
MNPGKPMPLVVHGWTIFAHPLFLAQIEVLIQQVEAHKQKDPVGFVKKNAS
KRLAAITKLAFDIILQDPARPEYRQGGTLGDDYKHWFRAKFFQQYRLFFR
YHTLSKVIVFAWVNDEDTRRAYESSDDAYRMFRKMLENGLPPDDWNQLLA
EARAEGQRLQQFAARWW
>NE1234 conserved hypothetical protein
MTPQSAFMIAATVRVGQLQDLRTLLASMNTIPGHADPDNDLIPFGKLDRL
HFARFVIIEAKTLQEIKEFGVKPRPWRPMLAFLGDIDGDMHTFLAELVER
AESGLTKIFSHCDDFSTGNQNLLEWMKMRNVSPGANYVNWVGRTVRQIHE
EAALHHSLSDCLQKIVAEVGRENIHTLRQKLLSHVEMEKYKGRLVLSPPE
PTPSEWRTRNLLHKIGVPLVLLLFSPLLLVIAPFFALWLRKRERSDPELF
IRPAYSHIEALSEQEDWDVSNQYSVFGDVKPGLFRLLTFKFILLLTDWFA
RHVYNHGFLARIKTIHFARWVFMDNNHRVFFASNYDGSHESYMDDFINKV
GWGLNLTFSNAVGYPTTRWMIKEGAQREHAFKYTQRRHQIPTEVWYKAYP
GLTAVDLVRNSRIRQGVEIRQSDDAEIREWLSLI
>NE1497 conserved hypothetical protein
MDYFYESEHTKFMRELFAKRPELIEKQKEARAIWWDKDVDREALKCFEEA
EVPQRSYVYFSWPDQEQETEK
>NE1204 TPR repeat
MRSVRKRTVLVSTLLALVLTQVARADDKGSHPAVIGDVHFKVECNATAQA
KFNVAVAYYHSFQWQRVIATADDVLKVDPTCGMAHWVKALAMLDNPFAWP
VTLSEKAIAEGPVLLDAARKAGLKTQRERDYVDALAIFFKDLNTTNYRER
AESFEKAMAQLAQQNADDSEATVLYALILSRNFDPTDKTYRNQLHAAELL
EPIFAREPNHPGVAHYLIHSYDYPPLAKRGIDAARKYAKIAPDTPHSLHM
PSHIFTLTGFWQESIDTNRRAAEMADDSITHDGHHASDYMVYAHLQLGQD
LAARKIMEQEQVRHGIDMIGVAYPYAAIPARIALERRAWREAADLPLYAR
DTYPWKKYPQAEAVNAFARGVGSAMSGEPAKANFEAKRLIKLRDAATAMK
LNYWADQIDIQAEVVRGLADFAEGKRDEGIAILHRAAEREDASAKNVVTP
GPVVPAREMLATILERDRKPADALAEFEKVLEQHPNRYRTIAGAAQNAKQ
AGNEQKADHYAELLLKLAEHADSPRPEIAEAKSMLGM
>NE1977 putative transmembrane protein
MKILAILLLLSILYSLGSALYFMIKDKGDSTRMVKSLTIRVTLSLVLFML
MMLWVYIEYIHGN
>NE1245 Kazal-type serine protease inhibitor domain
MKQQKNQAGTQARNACVLAGKRSPTQFTSALGLPGSGCSGFNPQLYFSSD
FGAGNGAFSSGRGLSPPSITRKTRKELPRYLNALITWLIVLSVSLVVAGC
EEGNPPQPQLQPQVCGTIQGLACPAEQYCDLGIGQCKVADAQGVCKTRPT
ICTREFNPVCGCDGKTYGNACGAAAAGVSIDHEGECKTAEPQACGGIAGI
RCPDGLACVDDPGDTCDPEHGGADCAGICIAGQGQ
>NE2562 hypothetical protein
MKKQIAGLVLLGLACTATSVSAEEDRELRQKVEALEAKIANLEGRSENEE
HGDSHGFDKHKFHGGVVLKQDAFFGFQTILDAGYEVADNIDFTFYSWLWT
NPNFGKSSVVSGGNNVGGQGLWTEFGIGLNFRFLDNTLSINPNIGMLNGS
LLSSEVVGEDIRAGEGVVPNLVVNYDNDYFAANLYVAYYMATRGPRARDF
LHNWINVGVKPALFGLGKTLPINSVGIHWEHLWAAKNRIDSSLEGVVYNW
VGPYIEFGLPKNLALRFAGGFDVKSDVSNNFYQASIKLNF
>NE2539 hypothetical protein
MSAHKWQFASRFRRHAFGWRSDTPVQRIKEAITEIKQVARKEPVLAAEGA
ITLLEKLSPALEQVDSSSGALGSAVNKAIDTLVPIIVKADVEPKLRQRWL
ERLWQALQDDEMPYIEVLGDYWGELCVTPELASHWADEFLPVVESVWSPK
ASGHGFFKGTSACLASMYAAGRHQELWALLDKAPFKWWHDRRWGVKALAA
MGKKAEAIRYAEESRGLNDPGWQIAQACEEILLSSGFLDEAYRRYAIEAN
QGTTNLATFRAIAKKYPHKQPEEILRDLVASTPGAEGKWFAAAKDAGLFD
VAIELATRSPTDPRTLTRAARDYAEKQPAFALAAGLAALRWISLGHGYEI
TGTDVLDAYSAVTQAAVNAVVPTQQVNEQIRDMIASTQPGNSLMKTILAR
HLAN
>NE2241 hypothetical protein
MIKILLLLTSVLVAMPVAAVDVAPRISDREIIESLAELKAGQKALEEKMD
LRFNAMQEQIDQRFTAIDQRFTAMQEQMDQRFTAVDQRFTAVDQHFTAMQ
KQIDQRFIAVDQRFEAIDRRLDFIQQLMLVTIAGIFGLIGFIIWDRYSTL
RPMDMRLQRLEEDLERDLELQSPEGSKLTRLIHALRELAKEDKKVEAILR
SFSLL
>NE2039 hypothetical protein
MLMWLCFPLYIVSMHYQQYVRSMSGPMTEMWGGNRVLSGIADYARMTVSG
IRYSQIYKGSYERFNREILSNGLIFFHRPMTQLENE
>NE2239 hypothetical protein
MKQTNLKKQLVAVAIGGVFALGVTAQATAAGIFQYDLDGQGGSGETVIAD
AIQGVANESLSLLADGKTLDGQGWVKFNTFLLSTVDQDYKYSEVLLYATF
KITTELVDGTIGASGSEYKVTSFTFDLYKDLGNDNTFTVADASSSTHASV
TAVGVDDYIASGELIVGSANIQAASGAAINVETTFNLQPGGGEYFFDPDP
FYNILKAGFNSTGGNWAFTNNMLAVGSATGVIDFNSTPTEVPEPATLALL
GIGLLGFGARRALVASKNA
>NE2168 hypothetical protein
MLPFSSYKHVLIYYIAVAAVLFSGFFSYVVAPHRQEIEVGHVELVNLDSS
LIENRKFSDFTNAYIPEITEHLTMARSGWLPLWSNNTELGRPLYQISGFS
SAYLPSWVITRLVDGPWRFITTLSLGFCFLAGLFVLLFTREVGLSPIAGL
IAGLGLATSPLFMYWLTFPMFPAVWCWAAGALWAVTRLAKRPDILGWGVL
AFSGYSLLMTAYPQPVVFHAYLLGGYGLWLAYHQARVSRLELAKFLTLAL
SALVVGAALAFPVYRDLFILSSESARVAPDPSFFTMVLPKFASFTELVRF
FVLSTIPEIFGNPIAPSFPFSYDGLSVTLIAIFFGVVALVTSFKETWGWW
LAILIFCLLAFVHPLYVLGVKYFGFNLSRSTPLGSITLPLTIITAFGIDA
LARRTHHRQFSSAVFAGAAVALVVIAIGVAYGVSQHISIHWEIVIGMLLV
TGLLIAQYDRYRPLFLMMALVLVLGMTSYPLMLKQDPAQIAMTSPLVEKV
RENLPAGSRYAVAAPGISVLPPNLNATLDLSSVHSYNSLSSTRYHTLIKA
LGGEVQTYGRWNGAIDPDYAGTMFWMSNISLILSSGKLAHENLEFLSEES
GIHLYRVVSRMGDSMQVTPPQLDMSSTKLVLDDPRGMVTNTPVKILDQGD
VLEFEVNSSAPSVFLLSQKFHRDWEALAETNQGWQAAQTVEVNGVFQGVL
VPQETRRVRLEFKPLARYAWIAHVFWIFLFVLIIFKFSQTFRRRVLERV
>NE2209 hypothetical protein
MKFQPLRGESITTGLLFSLFVLLIFSCRAVAQESTRNEFLSIAKSAVVLY
DAPSLNAGKLYVAGVNLPLEVVVKVVGWVKVRDYHGYLAWVEDKNLGPKR
FVIVKIPVGSVYQSPNPTSSLIFQAQQDVILELLGVVAGGWVKVKHRDGQ
TGYIRTDQIWGV
>NE1269 hypothetical protein
MIEAPFNKAKYAIFMFYGIDDGKQVISSYPIFGQTGTSSSYTTGTITSYG
NTAFYSGTTYKTPTRGVVGSRTSTDTVFKRYLNIDIIDIAKSGNGKVQKV
YEGKAISSGTNGQLAPVMPAIVRSVFEDFPGKSGASRTSRQPVEK
>NE1547 hypothetical protein
MAFTTSSMLQSVCKTSTLKGCGKIMRSLCRTPRHRSYATTFVVLLAIITI
LPAMRFMEDLIRDSFVVTLPENERLSSFALFTS
>NE1986 hypothetical protein
MLVFAALALTIAGCVYLYLASPNQKWLVQALPGRPALVAGGLLLAAGLAA
WITVLRPLAGFFVTLHVAMVCLFAFPYIAALRGKGRRN
>NE1504 hypothetical protein
MKTDNKQTTIQLLITSLFALTLTACDSQQEATSNKKPVAQAQGPATGILA
DSAVEGVSYSASSGASGVTDVTGLYKFNHGDSIEFHIGKLNLGKIPGTGL
TTPIELAAGDRNKLLNLLVLFQSLDADNNLANGISIPKTAADALDASLDL
KADPGTFPSSPALATAREAAGIAGSIKTADEANAHFLSQAVNLLGGHLWV
NQDDTSLNFFRFSTDGSGEYLHGIATPDDSCDANRACGSKLVFTAGVEYG
TAKATEYDERGFKLVSTPEVDTDLQSGLSHPRPNWRVYTNGNELIISDIV
IVQREREQASLFGELFHISKPIELSSDDEVAETTVQEIRYHKMDNSQSIV
GAWTMDKDSIKSPVFLFFPDNRYMLVDPVGSATQSTPAACAKPGVELATY
AFDAASGTLKLSSFTYNTAGCAGLSEYSGKPITFKIDTGAQNATLSGERL
APITLQRLSN
>NE2105 hypothetical protein
MVNGGITKPYAHKGRDDYQNEEEILLLVQFPLLFFWREAYSPERNFCSSM
KTVVDPHFSAHTVLIASADKNPGIISIRT
>NE1833 hypothetical protein
MWKLLNLTGICTLILVVTLAIMTYLAIDSHPRIEREISITPEQIARAKDI
LDTHRYQVRPGTSATVRIQADDLDNALNYLAYHLAQGHAKVTMHDKSAQI
QLSLPIPPGMITGYLNLQATLTEGKSFPELSSVSIGKIQIPDILAGQLTE
KLLAWLQTASPDARAGLDAFRKLRFSRNEVAISYFWKGWGIDKASYSPVS
LPFFDRQALDKLSHYHHFLNEQNRKRVSHTITLSEILTQIMQETVRHSPN
GNVLEEFRAAILVTAFHVVQFPLRLVIPETADWPDPVRINVTLDGRNDLA
MHFMASAVITAYSDTTLSNAIGLYKELEDSRSGSGFSFNDLMADRSGTRF
AEKAMASQDSARRMRNIILAGIHDTDLIPHWSDLPEHMSETAFKARFGST
SSPRYHEMMDKIEQRVASLKWLRY
>NE1985 hypothetical protein
MANEICLDWWGKASAGAILGLGLALSLVGLYAYLGPGGIDAPGGRYSLMR
YLEVFVWVAVFGFCFLFRSGRAAWAWLGAANLIAFTTLFACRFYFFV
>NE0972 hypothetical protein
MLSIKPAAEDLAARQPVWEALSDMFLDTDTSLSRQWRADQLARSPYSIDQ
LEFILINEVYPICKYNLLSVAGEWAGFDPEWLKEKILRHLGSRFRFLHTL
NLGCFTVHASVEWHATRHAILAARSIGTKNTT
>NE2057 hypothetical protein
MKKYLTGVVLFGLLTLFTGSTWAHSDEQLDGVAAPHGGQLRMAGPYHLEL
VAKDGELRLYVTDHMDHEVLTKGGSGKANVFDKDGKKVSVTLIPVFANFM
KGTGEFTITPETVVSVFVVLDGAETQAARFTPLKKASAKAEDEEEHHHGD
ADQGEHHHHGDVEHEQQPTDQSDAAESEENEEHHH
>NE1818 hypothetical protein
MAHRIRSSGKSATETGEVSQDVWFDNWEKGLDLWELARHYLRTDTTISLL
WFDSDDLPEVEVSRFGARIQDDGGLAELTGELPWPGRSRRR
>NE1793 hypothetical protein
MEEWSRQSINFTVRLGEMSVLAWPLNACVLKTHFTKLPIHPTVPAELPKL
FRDSVDVVVTRSHPIESSLAKLSILPQAIQYIPSSYRRYWVVLDGNFEDY
LKKFSAKSRNTLLRKIKRFAELSGGEIDWREYCKPEEMHEFYKLAMEVSQ
KTYQERLLDCGLPSDQQFQENMLALAAEDNVRGYLLFYQKKPIAYIYCPV
HDGIALYEYVGHDPEYQRWSPGTLLQYFALQRMFTASHIKIFDFTEGEGA
HKAFFATNNQYCADVYYFRRTWLNLIKVALHASSDKLSDGIVRMLDKFGV
KAAIKKLFRSKA
>NE2436 hypothetical protein
MSEVITVGDKPILLKAIELVLAEPAAIRKEALQLKDKYVTRYGSDRSEDE
INAYAADKIISNYSYYTAFVGGTTALTGVIPGLGTVLAAFGGATANTALS
MKYQIEMTMAIATIYGRDITIEEEKRLCLMIAGLGAISEMTRVGGKEPGK
KASVKMMQQYLQDASLQTLRELFKKVGITFTKKAAEKAIPFGVGVIIGFS
ANKGLTWYVGTKARDFFSVTDSIV
>NE1607 possible transmembrane protein
MTARIMRSLKLNFPYRRQQIPLVDYLLLFLGMVLLLAVMYTLKQTMSKIT
YWEAREARIVQQQKHTRQPRTPMARINKATQQELKQADDILRQLNLPWEA
LFDALELAASEQIALLSLQPSVTGQTIRITGEARDLAALVEYVQALELEP
VLKNAHLASYKARQDHLRRPIVFSIIATWHESL
>NE0473 hypothetical protein
MKVITYYQVIADSTAQTDCAFFIEFMLTVIEETLSESQIITPQATLQDIP
QAVLEIMEQYPGLAEFCQHPRSCTELQAFYHLNDREHFRKAVLTPLLDAG
WLRRTQPDKPNSPRQKYFREH
>NE1381 hypothetical protein
MLDSNKGTSKVIVAPKIFDINKNENRGNLTIFLGKLRKYFTHNGSHEITI
DFTQTEKFIAAGTLLFYSELAYLKQFINNETRLRYIPPKNPKAFEVLIQI
ELYKLCGIRKPKSKNANKYDDVLNWKVACGNVVNNEQCAPTIEAYEGQLA
EPLIDGIFKGLAEAMTNTVHHAYAEIREDGLNHKPSKNNWWMFSQARDGE
LTVVFCDLGIGIPRSLPKKHPSIFHKMLSLGKISDHQCIASSVELNATST
KMPGRGKGLGNIIEIASKNKAGGVIIYSNKGMYRLGPDATEPFSRDLKNS
ILGTIICWNVTLSKVGL
>NE1556 hypothetical protein
MDFDTFRPILSGLVGGLVVYLLTYSGRKPAATEGGRRLLIYGLGIRIFTA
ILIPSSLFIAYAAAHAHPDQAILAVCIAAAFFSYQVFFVSLAYDNDNIYY
RSPIGGNHVIPWPDVVEVGYSWLMQSYYLRTKQVRRIWCSNMLRGYNELE
EFIPKKADKLFHPELKSYSEAHIN
>NE1597 conserved hypothetical protein
MKPDTSSGYSPEDNKNGMQIITTYEAILAITDQMLQAAKNSDWDKLVALE
QDCKRLTTWLMEQHTYEQLSEEQKKKKISLIHGILERDAEIRAITEPWMA
QLQNKLTSYGHKRKLGQTYQTDS
>NE2193 hypothetical protein
MALPVTSSANQPTGTNNVAATKCKGCFCPGNPCQLCRLPPHTDDPIPENE
PETCRLIREAVPPASFQPGENEYFANLDKATIQCIRSGDVIPNTRRVPGY
PGRVYCKPGLPALGAH
>NE1235 hypothetical protein
MNNTVRAWLGKSREELDEIYRHATPGNIPAGDTRGTAILAGSFFSKTVAA
FARLFAWQGKVFDLFCPGGQAGVLVNKITPFGLTFIVAKVYRDKSWLDGQ
DTIVIDYSKTSFVAKVIRDEIREVEPGVYLGKVWWGKTRVLDFALTQSDT
Q
>NE0746 hypothetical protein
MTYAEVAKKIRNRLRRYSLISIINVGLNHLTQQHDNKEKALRAMPWLPAL
VMKLAIEDEMISMHGDLCPSAEFDACCNAIWNAKRGLDESVQVALLGVRA
LMHAQFIFQRSETFGFLRWAALISRIDASHPCRSLFERVFSMTPDDFMMA
AILLISQFKKEAPQQPIDLRDYSALPEELTKPLYQLVRLLSKDLSELRVQ
LQGELRSRLDSKTKRSARQESERHEFPWLAKYPLLKLDQTRVLAWNPTIF
FHGLEEFVHIRLSEFGQDYTDSFSQVFEDYVIELIQESGTHAITDQEFKC
LGNKGMSAVDALIPHAEGNVFIECKMSLFADAVLLSDHPPFVSEKLKRIR
KAIVQGWKVGDLLRSDKIKLSDAKSADNDYLIVVTSRQLLFGNGLHLKQM
VDEQFFDHIFPESNFMSPSKEQLSRMPPQNITILSIEEFEHLVGAVKSKK
VTYLSFVQQLSKNASNPKTAKMVADQEIRKYVDKWYIPNLLTNSRDRVVA
QLNAVFNCRDRIKSRTYK
>NE0790 hypothetical protein
MVKNIRSCLVLMVLSCFTVQTYAEDVTVRLSVQNIHHPEWERATDEELAV
LRGGFVLPNGVHIDMSLEKFIHLNDVLVHSSSLQLPGAGVVLQAGMQNMV
SDSITVPELSTFVQNTLDSQHIEALTTINIEVSNLKGIAANGGGQQVFTE
FLAPALLR
>NE1202 hypothetical protein
MITIWQNLSINIQRLFIALCILTGIVLIGMQFHVNSQGSMSDTYPKGFRG
GTCTIESDTLLVGYSAYFIPVDYEIPDDSMSALSVVPVLCDKVPGPGLLS
ITVDLLYPASIREQPVAVSLARKNGERIMEPLLSIPARNYQSGIISQEVR
IDESGEYVLQLSGTDEYQSEFHLDIPVTIGTKWYEPFVPYWPMLVLGVVA
AFFYNLRRIVN
>NE1555 hypothetical protein
MDGTKAPPVSSTLEGIMNTAPELIIARKAIAKIAIRFPSLTMIEEPTVPV
ELSIRLPVQPGLNYEVWLALQNNDELHFSVGNFWLEWFPCTESSRVKEYI
SAVTGFLSSQYRVLEHYRGKHCVKAELQAPSGGDWKTVGTWSNLLSFLPL
RSSLREVSNTQPIIPPDLPQQAAPDR
>NE0941 possible (AF047705) unknown [Nitrosococcus oceani]
MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE
STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN
LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF
AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNL
>NE1790 hypothetical protein
MKNFIRKLIRALTAIRLETDRYKHSTLRIIINWLTAFLKDLFSAQEIRLY
GLADPEDGANRIRQYVSKEMADRFYRKYNPGAAVPNIEDKFIFTTLCLQH
HLPIPATYGIFKQGIVKTFDERIFQESSRFRDFIRELEPGEYLLKPNNGM
LGLGLSILEIDDQGSLKFQGKSITADDLYRELCSIEVTLPASSKGDSVDM
DFEGLLFQQRIANHPEITKLTGFKMLQTIRICTHVTDNNQVEILFAFMKL
AGQEGLADAFNLGKTGNMLAKIDPATGKFCNVYAMDQQQGYLVETTHHAV
TQANLLNFTVPHWQACLTLAMKLSTTFLPLRAVGWDIAITDNRPVVLEGN
DNWVPVVPFDINIDKLKQYKLKS
>NE0258 hypothetical protein
MFIVFSGSAAFAQEALLSAKEDYITCWKQPCVDVAGSEWSEKNPNGVGIS
VRMGTQSGVTDDQIKTVLTRDFKKFGMTNIKFFFEQNDAPAAGIAFHVRG
GTEGLFFIDNVREQVAGIARRAANTNPVFQ
>NE1133 putative protease
MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV
LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF
EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARPGKTSNGLGLLDS
NSMPSSIPGVN
>NE1442 hypothetical protein
MKPDRLWFHRGDFLDCVYRLFTRFWLFFLILVLCFPAIAATPRFDFTLEN
IRHPVFSIRSAGIKLIGAPSPTLEINLGEVAIGKQTWHGLRLRCNPVHID
RESMNCNTGTLQIGERFMTMIFRLSLQHKQFVLEIRPASNKSKEKWRLEV
NWQASKWQGVLQVVNGEGKFLADLLPQGDDRIQVHQAILNGNIRLSGNNA
SVSALSARLGISKLSFSDASGLHAGEGIDLQLDADAQQKRNDWQWRGKIT
WPEGEIFWQPFYFSGEGHQLTARGTVKDERINITQGEFNLAGTGKADFSA
VAGIADQSLQQAWLSARDLELSALFGSIIRPLAVDTALAETEAAGQMNID
WRYQGNDNQELIVGLQDVSLTDAHGRFAVERLNAHIPWNSNEKRDGSIRF
SNAQMVGIPLGETYIPIGTDGMRFSIPRAEIPVLDGKMLIENFVASMQAS
GWQWQFDGLLAPISMEKLTESLHIQPMFGTLSGTIPRMSYANSIMTMDGE
LVFGIFDGVAVARNLALSGPLSLTPHLTMDMAMYHIDLDLLTRAYSFGNM
QGRVDVEIDDLELINWEPVKFDAKLASSAGDYKRRISQAAIKNLIALGGG
LAVTAIQKSFLGLFEQFGYAEIGWSCKLRGSVCNMGGIGPATHDGGYMLI
KGSGIPAITITGYNRKVDWPELLERLRHAIESGNPIIH
>NE2426 hypothetical protein
MTKPLDRLIGMTLLSAEIASGSAELRFSRCDFSAYSTYSSFPDFGSLVGQ
TVQSIVGSMDRLVIRFAFGEFFISLHPDDYRGPEAFCARFADGPWVVE
>NE2179 hypothetical protein
MNRIGNGLLVAMAITWSGMVFSAVQNLPQPVFSGQDGEQAAADNAASPAP
AEQAAPASTEAETAAKAEEQALPSGIGWKLVRSLEMGDSGKFVHMVLIEK
GRQADKTIYSSAIHRLCAKEKEFCRIRFWVQSYLIPEKVSLTLEQQKTQQ
ADHLFNRAAGIHRTLWACTVDSTSESCIQ
>NE0824 hypothetical protein
MKSSFGLSALVGLTLSLHAYAGGNPEFVKFPEKYEQIFTHYDTANRANQT
QLAKFYANEIAAESYKKGEEAAPGSIVIMEIYAPKKDAEGKIQSGEDGLF
VIDKLAAIAVMEKRNDWGSAFKADDRSGNWGFALYDPEGKAKDNDLTCAQ
CHNPLQKQDNLFSFQKLVDYVKAH
>NE1341 hypothetical protein
MMDVVEATRILRREFAVWGNRLDRFFLARYSLLVDKPPKDFGSRLQHRLR
QFLVFIHLVPPRVVRRAWLPTLKHSSSAPDARALLIWALGMERDSLRNAC
LGFQRFLASRQDLAPVLVTDVADFAWFSRLGWMVEYLPELEGKGLPYQER
KRDYLAWRYRDAVVVPAAAGLLDEENWNRLLQME
>NE2229 possible long-chain N-acyl amino acid synthase
MQIANHLQSGTAVKNIQPAHIPFSPYQPDSTSSDDTDPRFPRSQKYQISR
HPGSNSSHIAETDCLLQRNGYSIHLVNSLKQRIKASTLIKRMYASRGYQT
ESASVFSTSSNQYTFEARQSQQLIGTLTLTIDTGKGLLADTLYQPELDQF
RRQGRRLCEVSKLAFNPETSSKEIFASLFHMAYIFAHRIHGVDDSFIEIN
PRHATFYKRMLGFRQVGELRTCPRVNAPAVLLYLDLEYMKEQITTQAGQF
DQKTKSIYPHFLSQNREKEITQRIQIEHTHFVPPSSRKSTFNHHQDYFQP
A
>NE1530 putative signal peptide protein
MSVKHFITAVSLAMVSTVIPLTVTAEQHAHDAKSAKPGHDMNKMWAEMRT
RAVGMAVSVAADEKGKLWLVRMQDGHIRVSHSEDGGKHFSEGVTVNPQPE
AILAENQNRPKIAVRNGVIAVTWVQALPKVFAGNIRFARSVDGGRTFSEP
VTVNDDQGEISHGFSALTLGDNGRVTLTWFDGRERDAADKGGQKYVGTTV
YYATSEDGGASFSANRKLADHACECCRIGMTLDSDGVPVVFWRHVFEGSM
RDFALARLDSQPKVLRASEDGWEINACPHHGGDIAVDEAGSRHLAWFTGN
PQNPGLFYRRADGENMTAPHAFGDLDFQPGYPAVFAYGKKVYLVWREFDG
NNYQLMASVSADRGDTWSAARAVATTGGAADLPVFVVGAQKPLVVWNSAR
DGVRFFNAEGDL
>NE0115 possible M. jannaschii predicted coding region MJ1674
MTTLISFLGKGIADKTTGYRTATYRFDDDSKHTTPYFGLALAGYLRPERL
ILVGTAGSMWDVFFEQQDASDDDVLALIDAVRESRVDADMLSAQEKRLTK
RLGLPVICRLIPYARDAAEQTEVLLTLAKLVHRSEEVFLDVTHGFRHLPM
LALVAARYLAHVKDVKVRGLYYGALEMTSTNGETPVLQLDGMLQMLDWVE
SLATYNKDGDYGVFASLLQQDGLPEGKAKQLTRAAYFERSSNPVKARETL
GSVFSAIKTHNGPMGVLFRDALTERINWFKEPDRAAWELALADAYLERRD
YVRAVIYLYESFVTRAVLEHKLNPNDFSERDEAWKDARQDNKQVRKLEYL
RNALAHGIKSDDKEIIRMVNDENCLDDQLKKFRRSLFN
>NE1752 hypothetical protein
MNIRSTGLSILIGALFAIPATTATAVSPVTDTSIRQGTASYLILADAHQH
QGHQGGSAGQGHSGGSGHAGHGQGGQGEGKGRHGGGGGHSGHGGHGKGDM
EHHGHPPSYAHSVAMQAEALGLSDEQLGKIVRFHLKEDKQAHERIKQKMM
ESMKAFRKAVGEPATDDETLRKLGQAHIDSFNEMVKYHIDERKAVRSILT
PEQIGKLKAVKSDHDH
>NE1599 conserved hypothetical protein
MGYSLIFTDAYNQRAARWLRRHPDLRTQYLRTLQILQTNPYHPSLRLHVL
SGKLQGIYAISINLSYRITLEFLIEDKQIIPINIGSHDVVY
>NE1120 putative orf; Unknown function
MFNMALVFFLIAVLAGILGFAGIAGTLAWAAKVLFFAGLILTVVFYLLGK
RTPPV
>NE1238 putative oxygenase
MSHFPIVIVLTGLLLLSGCDALEPEISCLSVLMQGDIQHVAKTREQRFLG
KVTGRRAHCLGGDHAVALNRNPWLDWPNFWGTGDSLSLSSSPLASSFFGP
NERGINSALYELELQRIELIKFNLFDNSGTYQAYVTGRDGRAGPVLQVWP
EMQLPPTHPRYKDVEHNQEHQVCSGELIRFRTVTGICNDIYNPLMGSTHQ
IFARNVQFDTTFPDLGLDEMARNRHGDRLGLLKPDPQVISRKLFTRTQSQ
PDKCRNDDELSGDLEKFACDYKKAPALNVLAAFWIQFMTHDWFSHVEEES
DQSAWMTVGCITQRIDNIEQPLAAKEARQLGCRPGDRIHVAPIDDDTPPA
SFMHDGHLYRTRAPKTTRNHVTAWWDASQLYGYDERSSQRVKRDPEDAAK
LALIHVRESVDRGDESGYLPTFEVDDPIDPAWSGQEAAAFPDNWSIGLSF
FHNVFAREHNAFVEEFRKQAAKTPDADSGLRNPAHPEMIIRYRDVTAGEL
FNVARLVIAAEIAKIHTLEWTTQLLYNEPLYRGMNANWHGLFHEHAAVSE
VLREIIRQLDDTEGISNSLHAAFAGGAGIFGLGNHRYEGAPLYSLVDRNR
KDIWTLTRNEDINGGVNHFGSPFSFPEEFVTVYRLHPLLPDLIEYREWHN
NPNIIRQKIPVIDTFRGKATGAMRQKGLANWALSMGRQRAGALTLQNHPR
FLQNLKIPHLQSSTRQIDIAALDLIRDRERGIPRYNEFRRQYGLKQLTSF
DDFIDPRVPGDSSVRREQEQLVRTLREVYGQHRCDASRLITNAQLNDDKS
PINDCLGHPDGSLVDNIEDVDTVVGWLAEFKRPHGFAISETQFVVFVLNA
SRRLFSDRFFTSSFRPEFYSILGVEWVMHNGPGPEIMEEGTYNGHRQPVS
PLKRVLLRTLPELADELQGVVNLFDPWARDRGEYYSTQWKPRRGAEGDEV
FTR
>NE0605 hypothetical protein
MMRVKLFLAVVLSLSVSCFAGSGGDQPESLPETGQTKNASTGDEQPTRIE
TGKKEAESQSSQETGLDKQSLMIDYCRKHTC
>NE0234 hypothetical protein
MQIFDDYSNLTGDVTKDAADRALRHFIYRTQRGGMVIPAELQAFILNGIE
RQLAEGTGGWFVPARGRPTISNDAGWRFVAMIAWHEYYFIAKGHSEIRRK
NVSDFLIKQFGHTYCDFDLSDSGARRMIEDVNNRGFSGIDSGSPSGINNR
DTDLLNAKLFCRTELNMRGLAELSHAVAMVRRLNKNRGHK
>NE0889 conserved hypothetical protein
MAIEKVKVTGITFFDGTLDDGKHIDSGKVFIEHLLDFRKGTAKGSSTTAY
PLASSKEAKALMNHDFPLVCEVEFLTLSSNKGPKTVINALRPVPAAASPA
R
>NE0292 conserved hypothetical protein
MQQHGWTLLFHDNLIEQLMRLRAAVLRAQENDPEDFGSNTNVKFFRALIQ
LMQDVVPGDPVRDEYRQGNTMGPTYRHWRRAKLGRRYRLFFRYDSKAKVI
VYTWVNDEQTLRSSGSKSDPYTIFEKMLGRGNPPDDWNALIQASKPNWSQ
LE
>NE2440 hypothetical protein
MKAHISTVKLSILVPAALLSGFFLTGSTAAIADSSSDSNRANSEKYEKKE
DIRKYDQRNDLDESRRGIEDTVIPETDSNSTNQQDHNLRNPSEQSPAEIM
PGRN
>NE2387 hypothetical protein
MMKNLFVLLQSITAIFPVSIFFTYIIMDEGDQFTYEHYLVTALSAFPFFM
VLLIKYFISGFENK
>NE1156 Bacterial regulatory proteins, MerR family
MDYDILHGIVIEESEALSLSELCQICNVEVEWIMALVNEGIFEPAGTRPE
DWFFSGVALRRVLVVRHLQRDLDVNLSGAALVLELLEERNALLAKINLY
>NE2061 possible (U92432) ORF4 [Nitrosospira sp. NpAV]
MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR
VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID
PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM
VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG
RLRSWFKSQDD
>NE0973 hypothetical protein
MKNIVALLTVFILFMTQSASAELYDPDEVQVYIVPMIDFPEPAAAQLSKI
LSDDMKIWVKSSVRLGDLEAATLPGTRQLSGDSIIEKSYPIVTKLPGSSK
NTMYVLLTTRDINSETGAFRFQFSMHHSEMRVSVVSMARMIEFIDGKPVV
NHLVLNRLYKMCKRAIGEQYFGWKRSTDINDIMYSPIMGMPDLDRIGIHH
KENDDENEVEPVDKNRISI
>NE0725 hypothetical protein
MTTPNDADNETYSQSLENDMSRLLPDQSVLGEPEPWESWETSLCLWSIGI
GIAALVILGILVDWFLLPGQK
>NE0238 hypothetical protein
MDNYTGQSARLLPLVFYQDCSLVDIQLYNPSNNNKLIFSAIFNDEYALLL
NGKSDTLYSLNRHVLSLDDSKSAIDYLRFFCSYVQSEYGPFQIITHLDEI
PFKDEGMDQNIRDTIKASIHNPEYLEGSFERDGWQAFKACVLYGGALFSS
VMRVFSNGRVSLEEDRAIADNLLLLQRQYHGIFRTPL
>NE1239 ALOX5, Lipoxygenase
MNFILLKERHMMNKLPQQEENRRTVENRKNYLLRRQAQYQYAYEYANTIA
VVRKLPCREIPGPGYWLRGGINLLQLIPSLPSLLVTYMRYLLGKPMESYR
DYIFYPFSPPNPALVDNFQQDLIFGLQRVIGVNPVVLRAVTSQHPLPQKL
PESEIQRVFAKYVDETDYATAITQKRVYILDYADLEILQRNPGQIDGGRK
QYVTTPIVVLFLQADGILRPIAIQLYQDAGPDNPIYTPNDGNLWLAAKTF
AQVADGNHHILVTHATRIHYVMEAIIMASRRQLYKSHPLCVLLNPHLRHT
LNVNHQHTFLRDRKGRPGRYGELFAGDYDATTQCMANGMTSFDFRASAFP
NDIASREVDNPDLFYPYRDDGVLLWNAIQHFATEYIDVCYQSDGDVAEDC
EIQAWAHDIGARDRGRIPGFPARFASRQELAETIGHVIFLCTAFHSCIHF
NQYKYPGFVPNMPHSAYAPPPVGKGAEMDADGLLKFQPAFRAAYSQTWTY
FQTNFTVNRIGQYPLRQFDPAARDVIERFRKRLQEIEGRIDQRNSSRPVP
YDRMNPRIIPNGVTV
>NE0458 Plec1, hypothetical protein
MLFKSHLRSILNISPQSENEAESNVVPLYTGKTASDDTLLTTNDTPAQEE
YDAPADDSNASRLEAENLAIREAKSRIETEARARVTAEARARVEASARLA
AEARIKAETAATEEARARARAEALAAHEAHARQELEARLRQTVEDGLKAE
KETVTALRAKVQAEAATTEKARRRLQEEALALEKSRERELAEQRAIEAAI
ARRRTHEEALKIALAASAAEAEATALARARIEEDEKNIALANAKAEAERQ
AIEEIRLRTEAEADLTSKAQQKLQGEISAREAEQSRLDAEKKAIEAAQSR
RDLDLTAKSEAEARAAAELEAATAQRTRIEAERKARAMAEQVALAEQEAA
NAALERSRADTLLLEKTRAHTLAENEACAAAEARMQAKEQETAIFNEKAQ
TDQAVTDTIKERIQAQETAIRRARARAAAEAVARKTAEDKITAETHAAEL
AEKRIALDRQVEKEANELAETEARLIENKRKQAEAVQQAKSAAAARIEME
QKLTELSTRIAQNQVIAMAKTEERLKAAETAAATVLHKIKLESSALKAIQ
ERIEQDALAVERAIAREAVEAMAVEAALARIRTDEAAIAQASRKIREEIE
TTKMIQDCFDEEIPSDTVMHDKEQGDTHSSDDGETLLAATEERMAAETSD
ALDESDDSANQPESGMLNDPVQSDVSGNSIESNDTEVLPDKSETAEIE
>NE1240 Ptgs1,COX1,Cox-1,Pghs1, putative cyclooxygenase-2
MNEFFFQIIFRLVNRFPWISRVASRITWLRRWISDTFINWQAYATNPRPR
PFSMAAPYTTWQALTDRTFTGRHLPEAEGEQNLPDLKSVVNLWRRKENRE
IPSVDTSILFSFFAQWFTDSFLRTDFFDRRKNTSNHEIDLCQIYGLREDI
THLLRLKKDGKLKYQVIDGEIFPPYLFNVEETTADNWVFADREFENLHPR
AVLEFVFDNVPEERLKRMFATGLEHGNSSIGYTLMNTIMLREHNRICDVL
KEAHPTWDDERLFQTARNIMIVLLIKVVLQDYVSHFTQFGFTLDPTPGMA
ERQRWYRTNWISLEFNLLYRWHSMVPEYYFVGDQRYTLDEFRNNTALVTH
QYGIGTMISAASQQKAGRVGLYNTPQFFFDPLPVGADNRSVMERSVEMGR
QAKLRSFNDYRQAFSMPRLRSFEELTADPALQRELKELYNDRIDDLEWQV
GIFAEDHDEGFSLGRLMVRMVGYDAFTHALTNPLVSGYVHNEKTFSSVGQ
SIIEETSLLADIVKRNVRDSDTVIASFRTSAVA
>NE0944 amoA1, Ammonia monooxygenase
MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA
GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC
LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA
LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE
YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV
KGKRGRIVHRNDVTAFGEEGFPEGIK
>NE2063 amoA2, Ammonia monooxygenase, subunit A
MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA
GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC
LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA
LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE
YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV
KGKRGRIVHRNDVTAFGEEGFPEGIK
>NE0943 amoB1, ammonia monooxygenase, 43 kDa subunit
MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR
TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS
PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK
DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW
MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA
LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR
ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL
NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA
EGNRHINSIAGPVIPVFTKL
>NE2062 amoB2, AMMONIA MONOOXYGENASE, subunit B
MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR
TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS
PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK
DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW
MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA
LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR
ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL
NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA
EGNRHINSIAGPVIPVFTKL
>NE0945 amoC1, ammonia monooxygenase subunit C2
MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF
AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD
NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT
PSHVVVFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI
LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS
NLTDVVWNNQSKEILNNRIVA
>NE2064 amoC2, ammonia monooxygenase subunit C
MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF
AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD
NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT
PSHVVMFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI
LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS
NLTDVVWNNQSKEILNNRIVA
>NE1411 amoC3, ammonia monooxygenase 3 subunit C
MATSILKDKTAQQVTDKPAYDKSEWFDAKYYKYGLLPILGIAVFWVWYQR
TFAYSHGMDSMEPDFDRIWMGLWRVQMVVIALAAFSIWGWLLKTRNTAEQ
LASLTPKQEIKRYFYFMMWLGVYIFAVYWGSSFFTEQDASWHQVIIRDTS
FTPSHIPLFYGSFPVYIIMGIAMIIYAKTRLPLYNKGWSFPLIMVVAGPL
MSLPNVGLNEWGHAFWFMEELFSAPLHWGFVILAWAALFQGGLAIQLITR
YSNLVDVEWNKQDRAILDDVVTTP
>NE2574 attINeu, qacE-like protein; integron orf
MSEQIFFEHDGVRVSSARFVVKGATYPISAITSVRAVRSKTFPLLAIVLI
LIGFGILLGGEPTLLIFGLATIALGVVWIIKKKELYSVVLQTSSGESQVL
ESQDRQYIHSVVDALNNSIVQRG
>NE2337 cycA1, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE2042 cycA2, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE0960 cycA3, Cytochrome c-554 precursor
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA
HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI
ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ
DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK
AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>NE2406 flhC, probable flagellar transcriptional activator transcription regulator protein
MKGKSILSEGKQIQLATELVRLGARLQVLEASTTLSRERLVKLYKEVKGA
SPPKGMLPYSEDWFTGWQPNMHSSLFINIYNYITRYTKVRDIDAIIKSYQ
LYLEHIEANRLQRILSFTRAWTLVRFVESKVLSVTSCVKCTGNFLVHSLD
IQSNHVCGLCHVPSRAGKTKRVAQEARAAQEAGAGELCVI
>NE2407 flhD, probable flagellar transcriptional activator transcription regulator protein
MGTNQILDEIREVNLSYLLLAQQMLREDRIAAMYRLGIDEDIADILVKLT
NSQLLKMAGSNMLLCRFRFDDSLIAEILTSHKQDRALTQSHAAILMAGLP
AEKIS
>NE2529 mcrC, possible mcrC protein
MTAVAEQEESASNSAEGFIGRIPVRNLWLLMLYASDLFRTRGIGKIGLED
SPDDLPDLVAEILAHAVEVRQRRRLSLGYRSRDAVINRVRGRIDVLTTER
HQLMDRGLVACWFDELTIDTPRNRFVRAALESISRIVQRKDVAHRCRALA
GGMKAMGVSGDAPARAQMSTDRFGRNDADDRFMVAAAKLALDLALPTEAS
GANVLSLPDREATWVRRLFERAVGGFYEVVLSPQGWRVLCGGTMGWQIEQ
RTAGIDKILPTMRTDVVLDHPSTGQRIVIDTKFTSIVTSGWYREETLRSG
YVYQIYAYLRSQVGCGDALADHASGLLLHPAIGQMVDETAVIQGHRIRFA
TVDLTASTSDIRLQLLRFFDPNQPVTGQ
>NE0842 merT, MerT mercuric transport protein
MSEPQNGRGALFAGGLAAILASTCCLGPLVLVALGFSGAWIGNLTILEPY
RPIFIGAALVALFFAWRRIYRPAEACKPGEVCAIPHVHTTYKLIFWIVAV
LVLVALGFPYVMPFFY