Gene list
Applied filters:
COG category: Unclassified
Gene type: CDS
Genomic element: chromosome
Number of genes found: 322
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Nitrosomonas europaea ATCC 19718, ATCC 19718 >NE1357 hypothetical protein MIQGDSDNGGNGRCQNTFHQMLQLKTAAAHLGFGKKQGQATVSRLYLFNP APARINDQSRIERIASAARAAPFIGIRHLLST >NE2100 hypothetical protein MLYTYLQRSMHRLRCILNVGRHTEPSEATMDENSQANKHKHLDFIQAAIN RMAGNLFLLKGWSITLIAALFALAAKDSNKLYIVIAYFPLFIFWALDGYF LSQERKFRALYDHVRTLDESQIDFSMDTRPFSSDIRNTWAGSASSKTLVV YYAGLAVVMLILMYVVR >NE1362 hypothetical protein MPGTQQGRTGYPQLRFVGLLENGTHVLFGVALGGYQDAEVRLAHQTIAHL KPGMLCLADRGLSGYPLWAAASRTRRAVALAHPQESPTAYA >NE1294 putative glutamate--cysteine ligase MPVPHLTHALNAPTLDLEKRILAAKPTIEHWFRKQWQHHTVPFYCSVDLR NSGFKLAPVDTNLFPGGFNNLNPTFIPLCIQAMMIAVEKICPNAHNVLLI PENHTRNVFYLQNIAMLQSIMQQAGMNIRIGSLLPDLDQPLPVLLPNGDS LLLEPVQREGNRLFTRDFDPCAILLNNDLSGGIPEILQNLEQIIIPPLHA GWATRRKSWHFSAYDNVAQDFADLIGIDPWLINPYFISCGKVDFHEKEGL DCVAAGVNEILHLIREKYAQYGISEKPFVICKADSGTYGMGIMSIKDGAE LYKLNRKQRNKMATIKEGLVVKNVLVQEGVHTFESIHQAVAEPVIYMIDR HVVGGFYRVHTGRGIDENLNAPGMHFVPLAFDTTCMQTDPTARPDAAPNR FYTYGVIARLALLASAIELERYLQTPLPVLAEV >NE0187 hypothetical protein MLHFYPTKVDLISNWNNNNRNNYMKVSLNLITTSMLIGLPMLAQAEVFTI PQSSLIPSTTLWTDDLGISIGNTLVMTGGGSAANVGDPTGRNDDGFSGPI SFGLDFSGLTLFGTTYSQFYANNNGNISFGNGISAFTPMGLQGATQPIIS AFFADVDTRNAASGVMSFQTHTTAAGSEVIITWPSVGYYSQQATPLNTFQ LVVREDDYLIPDGEGQIGFFWTTMGWEVGSASGGGSGGLCSGIGGIGTGC VPAAVGFGDGLNNGYVLEGSTLNGIAGVLQNHRLWVNLSDGGVPVIDPGA VPEPGTLALLSIGLAGLGIKRRYKAKIA >NE0704 hypothetical protein MMAVFYSAEELTISERRLRADLAHESAVEMVIYDVLVKGNRSIWIGDGIV TRNVQISEQMFSVSVQQATGLIDAMTSDSRILSRLLAWLAIPRNSREPDF LSARSTGTLRPATYTDLQAMLGLSHSAFACLYPHITFYSGRVEPDWRYAS NDLVELVGLRSRSAGTHSVLNDDTSSHNVTGATLRVNVLPGNTSDEAAGL SVEVTITGQIDPSHLIRSWKRITRMDNSKQCRNLNTQ >NE2245 conserved hypothetical protein MNLYSLLLSLITGCFLLAQGGTSFSSPVETEKSAVTAQSILEKADEIRFP QDSFQVNVAIRTAAPDHAEDLYRYQVLSKGNENSIVMITEPASERGQAIL MKGRDLWVFMPSVSQPIRLSLSQRLTGQVANGDIARANFTGDYHPQLLRN ESIDDEDYYVLELTGIDRSVTYQKVLLWVNQSNFRPYKAEFYSVSGRLLK TSRYENFDNILGEMRPTRIIMEDALKSGEVSVLDYSDMKLRDLPDKIFTK DYLKRLE >NE0165 hypothetical protein MPLSYAGCLCGIAAMVRLLKYIWAAPCSLFGLGCGLFLLLIGGSVRQVSG ILEFSIGYGNPIPFFPFWAITFGHVVLGLNESALEYSRAHELEHVRQYEV WGVLFFLAYPVSSLWQLFRGRNPYWYNYFEIQARQRSGQKRLGL >NE1091 conserved hypothetical protein MRSMTLEQLRTASETGGVSSVTLKGQGGAFLVQINTRSGVAAILTKARNS EPRRFGNPAAALNVLREVGITIGQFDASEWNPDEREPVARSSDNRAKALH KAHEAAAYNEWLAAEIQEAIDDPRPGIPHDEVMARMDARIVRHKAAGAKR A >NE0391 hypothetical protein MANQLTIDQRLQERRAGASFVCPYQLGIKQGRRVTRRRSGKGAAYVDKYG WPLVICCLAIVLFSATDAFLTINILSDGGTELNYFMAVLIEESTQKFVHF KLALTSLAAIILTIHHEVQIRGGFRCRHLLYMISTGYAGLIGYELVLLQI IDV >NE2041 hypothetical protein MLNIFHRFPEVTGNPAHRKRDYFIAAFTFFCVAVSLVIDAHGTLLLQNVL GVIAWIFLVALLRGENREIRMQVVIAVAFATAGEHFASIYMGGYTYRLEN VPLYVPPGHGMVYLTAVTLARSGFFLQHARKIAAFVVISCGLWSAWGISG LPEHGDQVGALLYVVFLIYLFKGRSPMVYLAAFFITTWLELIGTAAGTWQ WATLEPIFELTQGNPPSGVSAWYCLVDAVAISGAPVFLNAFNRMNGLLKW LKTNGISLKVILSRK >NE1232 conserved hypothetical protein MNMTIKNRVLKAIHNFLILLLRIERRLEPWFRPQWDYLFREPGSRLIQFL INRRRKNKDSDLELAEERFDPDEEESLNKIIDLMMDQMRGRFKPGGYERG GNTKTHGIVRATITIRDDLPEHCRKGIFANPRSYPAYIRYSGPGPNVPAD INDVGFMSMAMKIMGVPGEKLMSEEKLTQDFIATSGGATFVTPNTRENAK LQYWSLVDMTLYYFLNPKDSHLLDFFMQSLWTATQYNPLGQRYWSCTPYL LGEGQAMMYSFVPKTKEVERHIPGLPFGTPPFNYLRENMIKTLNEKDVEF DLMIQVQTDPHLMPIEDSSVRWPEKLSSFIPAATIHIPRQKFDSDAQFEF AKRLKMNPWHCLPEHRPLGNINRARFRMYYELSRFRQEMNETTHLEPTGD EVFD >NE1862 hypothetical protein MGRILKPDLVMTPIDENIRLGSILAVFASAIGVKQSGSGGNRDIESLFTT LYLRLNGTCGLAKTKFGADDGRKDMQYTMTESGVVQPEVAEVREVRWGGD SLPVIITVMTGSRFCSPDGHLP >NE1338 hypothetical protein MSIFEAAQIVWQELVTRIARSEWNLLKHHDLLSNKLPVLFRNRAFLFIRS PLVAARSIPFHITKYFWCSSTKYGVSVRP >NE0314 hypothetical protein MGIPSDFSNTGLTGFNVSSTKLVKIEQQYKGVKGKKFISPGGGKFVRIDD PGAVDSNGLRTADASNTSISLETGNTWLPRLHADGTLTCGGKCAWLECTN VTIPKNKRLWFRYAFLRFSSLPADSFSVLLCFPNDDTSVPPLPPYWICSV KELQENRGNINQTDWTECFVEIDKNADFHGTLRWVVATGHNLADQNSIPD NTRFTRPGCLLIDAIDIR >NE0296 conserved hypothetical protein MRKLHLYLLAFILCLAGLGLAYYKAAVIGLPLTASEEAQVWNIEARISFR AKPDSAIKVTLPLPFNPAGYSILDEDFVSADYGLAIEQDNTGRVARWAKR RAQGKQLLFYRAVLFENEEVAPKSDAAPVYPKPPEYPENLAPVIQGVLDK ARQQSADTISFTQRVIEQLNAAASIPELKLLVKHVSRTRNLAKTLSWVLA GARIPSRVIRGLRLQDDKDYADLEHFLQVYDGEQWRTLNIKDGQEGLPSN FIIWQIDDDRSFSIEGASESGIQYSVARTLQETVNIAGFRSAEKGSHLMD FSLYSLPVHTQNVYKVLLMVPLGALVVVFMRNIIGIRTFGTFMPILIALA FRETELFWGLILFSLIVGLGLLLRAYVEQLKLLLVPRLAAVLTMVVLLMA GVSVIMHKLGFEMGLSVALFPMVIMTMTIERMSLTWEEAGPAEAFKQVSG SLLVAVIGYLAMNISEFQYMFFVFPELLLVLLAVILLLGRYSGYRLMELW RFRAFARNKP >NE0520 hypothetical protein MKTKFALTLAASTLLVSAAHADPFVNGGFETGNFNGWTVSNSAYRASINN ANLTPDWVFANDNYAMHSQIISAGTIDPNVGAAFGSTVYAGNYSARIEDT TWGGYASAITQTVTNYTEDSINFVWKAVLLGAHGVNDAATFKLVLTDLTD GIDLITREYNAASSGSGVDSRFSLSGGNYYTQDWQIETLNINDTLKGHDF MLSLVAADCQPTGHWGYVYLDGFGSVAGGGGDDTNNVPEPATLAILGLGL LGMTATRRRKNS >NE1223 putative transposase MMDHDHSYKALFSHAEMVADLLRGFVREEWVNELDFSTLEKVSGSYISDD LREREDDIIWRIRWGKDWLYVYLLLEFQSTVDWFMAVRIMTYVGLLYQDL IRSESIHKGEQLPPVLPVVLYNGDNRWQAPVDISELIIPIPGGLERYRPQ LHYLLLDEGSYHDHELATLRNLTAALFRLENSRTPEDVQQVLQALIAWLQ SPQQSGLRRTFTVWLKRVFLPGRMPKVRFDEIQDLQEVHSMLAERVKEWT KDWKQQGIEEGLQKGLQQGLQQGRQEGRQEGREEGLQQGEAEFLLRLLER RFGPINETIRTRIRAADSQTLLTWGEQILTAQTVEEVFEA >NE0783 hypothetical protein MHTTDPAGWCSNRNSLIRKLLTSFIGSSSGAVKICRKAATVSFILATGSS QLFASGSSPTATQVDWSKAPPVSPPPRAGIFVKPPMGPGYFSLLDLIDGN EREKPQVDPLPPSALTTTPAFDFDFRYLEQPGHDKDFFDPVKRIHLGSDW LLSFGGSFWYRYMHETDSRLNAAGINNDYHLLRTRLHADLWYQDQFRLFA EMLDARALGLDLPALAIDKNHTDMLNLFADVKLGQFMDGPAYLRVGRQEL LYGSQRLISTLDWANTRRTFQGVKTFWQTPAFNLDAFWVRPMVTEPNQFD NWDKDRNFVGLWGTYKAIPGQVLDLYYLSLVDNRNVSPANITQGNVLQGD SVLHTIGARWVGDYERILYELEGMYQFGKRSHLDISAFSIASGVGYQLPL PMNPQFWLRYDFASGDKNHRDGRSNTFNQLFPFGHYYFGYIDQVGRQNIH DFNAQFTLHPQPWVTFLGQYHRFYLANKRDYLYNAAGAGTIRDITGQSGS HVGDEIDFTINFHLSRHQDVLLGYSKLFTGEFLKNTRPGVSPDLFYAQYN FRF >NE0635 hypothetical protein MLTLPLRHQYLIGSILIILMIATREYHFASLHTLPGASWAVFFLAGVYLS SSWSLLGFLVLAWILDFSAYFTAAGSDFCLTSAYIFLLPAYGALWVAGRW FAARYQFSWRALASLSISLLIGAMLCELFSSGGFYFFSGQFEETTFAEFW QRELHYFPLYLQSLLFYVGTAATIHTLFVLIHKSRHPQINATG >NE1300 hypothetical protein MNKVIVAAFVSAFVLGSTATFASGNLESSLAPISAKDMLDYLACKDKKPT DVVKSHTEVENGKIVRVKCGDIVALVQKAREQSGDAWQGGY >NE1130 hypothetical protein MSNSRFSTIAFFLGSSALSLIFLARSVLAEELPKLQNLDLFKLNGFLSQG YIKTNKNNFFGHSNDSGSLDFREIGLTASLRPSPKLQLSGQLLHRRAGEG SNGGIHIDFGFLDYNFANTPAAEIGIRLGRMKNPFGFYNDTRDVPFTRPS ILLPQSIYFDRTRNLALASDGIQVYGESRADWGNITVQFGAAFPQVDGHD TEISVLRSLQHRGDLKTKLSYIGRILYEQADGKLRLAISGAQANVGYSPG YNDTLLNGSFRFSPLILSAQYNAERWSITSEYALRHLAWKDFGNHAAFQQ SFTGESFYLQGVYRFYPNWEAVLRYDLLFTNRKDRSGKKFSASTGYLYPA HNYYAKDITVGLRWNITPAIMLSTEYHRINGTAWLSPLDNPDMSHTGKNW NLFAVQASYRF >NE0471 hypothetical protein MNEYFFPKLTAVEALAPYRLRTTWSTGEVLEVDVGDILRKIPDLAPILDP EAFARVHIAEWEGSVEWFDTEFGRDNVYAWAKEQAGEVSHEMFGDWMHRN NLSLTTAAEALGISRRMVSYYRTAHKIIPRTIWLACLGWEATRPETKTLP RTLPAAYAKGVSASLS >NE0136 hypothetical protein MKQSIFFLTALILFHASSNAGTLVNGTWSPMGCGERPIAPVVSAASVEDY NRSAEAINEWQKRAQQYNSCLVDEANADNALIARTANDQQAKFREEIDRI NAETDKARAELDSKR >NE2103 Helix-turn-helix protein, CopG family MKAKDFEQQFDEGVDITASLDLSKAKRVLQEQKRVNVDFPTWMIESLDRE AEKLGVTRQSIIKVWLAERLEKAALTHPSSGTR >NE1302 putative transposase MKTPWCYTCPMEHDHGYKLLFSHAEMVADLLRGFVREEWVYELDFSTLEK INGSYISDDLRERQDDIIWRLRRGKGETGEWLYVYLLLEFQSTVDWFMAV RIMTYVGLLYQDLIRSESIRTGERLPPVLPVVLYNGDTRWQAPVNMEGLI FPAPGGLDRYRPQLNYLLLDEGSYSDHELATLRNLTAALFRLENSRTPQD VEQVLQALIAWLQSPEQSGLRRSFTVWLKRVFLPGRMPGTSFSEIHDLQE VQSMLSERVKEWTKDWRQQGIEEGKQIGIEEGKQIGIQEGRLEGRQEGRQ EGRLEGRQEGRLEGESEFLLYLLEQRFGPVSDAVRARIGSADTQTLLVWG KRILTAQTIEAVFGD >NE2461 hypothetical protein MKTTLIKVIAASVTALFLSMQVYASGHTAHVDEAVKHAEEAVAHGKEGHT DQLLEHAKESLTHAKAASEAGGNTHVGHGIKHLEDAIKHGEEGHVGVATK HAQEAIEHLRASEHKSH >NE1200 conserved hypothetical protein MTWILTKYFITAAVVVIVSEFAKRSDKLGALVAALPMVTILTLIWLHVEN QPETKIANHAWYTFWYVVPTLPMFLIFPFLLQHFGFWLALTLSAFITIAC FGVFALLVRRYGINLM >NE0543 putative (L31491) ORF2; putative [Plasmid pTOM9] MYNANIPSTHEIPSTGRLIRSTVIALLTAIFLLVTVVMPAEYGIDPTGFG EITGLKRMGEIKVSLTEEATADRANAAASLQIELSEAVTAVTESLPLAPK SEMSHEKKITLAPNQGTEIKVTMTKGSKVHYVWRTNGGTAVFDQHGDSKE LKINYHSYSKGTGQMREGVLEAAFDGDHGWFWRNRTSTPMTITLKTEGEY TRIRHFK >NE1119 hypothetical protein MFVCSDFKQRSRKMKKTYLLSTVLLMFLSSSVIAEETLTEKLETKTNDVE RATNKAINRAQEATCTDSDAECLKQKAKNRASEAYDATKDKASEIKNKVD >NE2477 hypothetical protein MNNIQTNTRDQDSLEYILQLELDNIRLFLELLEHERNLLAAGNLDDLALL VADKDRLIDQFARLDMRRNRFLNAAGLPEGTQGMNAWVSGSDEESTVARD WEELLGLADLAKQLNQTNAVITSSWLQYTRRTLNALHSAAGRPPLYNTKG QTT >NE1230 conserved hypothetical protein MCTFQPGSLSNTWLEKKSVIFYPVLTESRGYPNIPLRLPVIRIKNSSCQG DIVRFLSIIIAAIFCTFPVSAATPASDDSYIAGYAAAILKFQFGIDLPSL TVRNGNITLPADKLPAEDRTRITQLFSEIPGVTRVEIVEYTAQQPSLASP EPDEAIVDKGALATRSTMLATGPLPEGHLFKPLLADPRWNHFSAAYRNYV GRNVDGNHNGSVSFGETIPFYRANIGQSIVQWEVGLQAGIFSDFNLGASS TDLINTDFIGGIYTSVRAGNASAFARIYHQSSHLGDEFLLRKLTDIERIN LSYEAADLRLSYEFPYGIRVYGGGGGIFRKEPSAIKPWSAQYGIEFRSPW QMEFALVRPVFAVDIKNYEQNNWNADISARAGIQFDNFQAFNRKLQFLLE YFNGYSPTGQFFREKVEYLGIGAHYHY >NE2354 hypothetical protein MSGYMARIFCKKAMLLLLLTGTGIAGHIPVSDARQVAEAIGADTFTGYQE IRDRRKSRPAAVLSSTEAGQDNSELKKPPYRRTRSTDAAAAFIPPQLLAP GRCGIVDRIEFVMQPYYGDPGRVDTFIPGMAAIPYGQHGEKLFYADVAVA GSSGYFPPHKNIVEPVYFKIGLLADDGHYHVMTQRTPPQFQTGETVRLGH SGFLEKADCVMPEPDQHRPGR >NE0536 BNR repeat MTIQNTLTKIFPVGLLVWFCWSLPVSAETGQPGFPIQEITVPASEESKQH HLAKTDDGRLILSWVESDGQNSTVRFAIREGQGWSPVRTVTSVDGKLGDP PVVFGLDDGSLAAAWMPYAKGGKSKYAADIFLARSQDGGLTWSKAFKPYG ESARIYDAQMSLTALPNARIALVWTDMRETGDPGKNDRYQLMATVLAKGQ QSAGTELRLDDDICSCCRASTTAEGESLLTVYRDHSQGEIRDTGAVRWDT DGKVQALSAPGDGWRIEGCPSNGPAVDMSASSVALAWFSAADDKGRFKVA FSTDGGKGFTKPFEVDDDARGYVNVALLSTNVALVSWRKRAGPEDELRIA KVTTDGISRQTAIYQGDFPGWPSKYPGMIVLDHQAFVAWTDPIKKKVRLV AVTLD >NE1242 hypothetical protein MTHHTEVFEGGTIDIEDDTSLTINGKEISYVHDAVKNKWSSRYLPYTQYD SLLDLARAIIRDTVEFSGVKE >NE0822 conserved hypothetical protein MTAITRREESTSRDRGALAKMIMTLLDHWQLSTEDQAALLGIAASNRTAL ARYRKGEAIGTSRDQYERVGHLLGIHKNLRLLFPQNRDLVYRWMTTRNKA FDNLTPVEVIKEWGFAGLLMVRGYLDRARGI >NE0363 hypothetical protein MENTAEKFEEEILDACIRHAKEVLAEQLPLVKDKKYDFAPQFRDLTIQLY LVGVMQQFYDQYEATTTDAQEKAFHALYYMMTKDGVKSRRAKNQAAFIRQ MSRLDDGDEALALALGYESKPGDRSLAEVFDHYVNESRVSKGLWRFYDQG KKILLLGGLLFAMAGIWFVTIYLPESDNITILAVGLLAALFFIVPVFLVG LLIHRYKTRKGSRTPTPPQ >NE1011 putative transmembrane protein MSNEKVSRSRLKLILMMLVILSPIVISSFLHRSNFRPDHTVNYGELLEVR PLQGEATNLTDNTIFRIRQLKGTWNLLIIDSGKCEEYCQEKLYTLRQVRL AQHVDKDKVQRVWLINDDIRPDQETIDKFKGTRLVLANGKDLLKEFPAEN KREDHIYVVDPMGNLMMRYPRNADPRKMVGDLKRLLKLSHLEH >NE1641 hypothetical protein MNKPSSARLIILAFLVSMLTNTFGWSFNGKVFTHELAHHHYRELFLMYPD AHLELHHALDDSVDLDAATHLCLHAAGQFQPFYLPASLQINTADVREMTP EIADSSFPETIPDRLYHPPRLLS >NE0707 hypothetical protein MARRALALAVGVLLLTSVWSIVVRLLYVLPTTAIERLDDTRFELQRLQTL AAENSNLTTDDFTRIEQSISTLVFPSSNDNAAFVDAVNMLIHDSDVQLLE LRTADPFNDGNLTRFALDVRINAPEEKLVHLLKSLERHRPLLIIDRAVVL ATAAASDGTSPPLSVELRIWAFAAEY >NE1360 putative similar to abortive phage resistance protein MADRRRKLKRPMGERRYKKLFFIAAEGVKTEPIYFGIFTDETSIVHVSYL KGKHDSSPPQVLKRMTDHLKNKELKSYDEAWLVVDKDQWTDEQLTQLYQW SLQQENYGFALSNPKFEYWLLLHFEDGVGIKSSHDCTDRLKRWIQDYDKG INMRKISQEQINDAISRAKKRDHPPCKDWPRTLGQTTIYRLIENILKSSK GFVK >NE1693 putative transmembrane protein MNGYETLIATLALTMGSSWASGINLYAALLILGLGGATGNIALPNELAVL ENPFVIGAAAVMYLIQFFADKIPGVDSIWDAAHTFVRIPAGAMLAAGAVG DVSPALEIAAGILGGGTAATSHATKTGTRLMINTSPEPVTNWTASISEDL MVIAGLWTALNHPILFIILFIGFIGLAIWLLPKLWTLIRGLLMKMARFLR ITSPPVSTGDSVGQEEK >NE1973 possible proline rich signal peptide protein MLLSPATLAAEKGHIEIESFHVRKSGESFQIDVEANIDLSRTMKQALKKG VDLYFVTRLLIMKPRWYWLDEEVARSKERIELSYQALTRQYRLTQHGQPR NFPTLKAALQALGHQPDMLIRENQPLLPDTTYTAILQIWLDISRLSKPFQ LEWLDTEDWSLSSQRKIWQIKFPPASDAGNESGLH >NE1598 hypothetical protein MNTINANDLKTRGIAAIEAQLEEQPEAIIAVRGKDRYVVMQLEHYYYLRE CELTAALAETRADLAAGRCEQESPEAHLARLDTLK >NE1571 hypothetical protein MAATATSASTLIIPVENQVRELDAKLLLACVAAERGFPVIIGSRAFVHFE IASLPRGIYLAKSMRSLSNSMFRIIRMLGHEIVAWEEEALVHPPADTYYT LRLSPTTIRNVSHIFAWGQENVDLLQHYPQFPENLPVHLTGNPRGDILRP EMRAYFAAEVERLRNLYGDFILINTNFTDVNPFIPNIGLFIPAKDGDKKS RRGQAGIGMSEEFAEGLWHHKKAILEDFRQLIPALEQVFPDVTIVVRPHP SENFQVYHDIAARCQRVKVTNEGNVIPWLLASKTMVHNGCTTGLEAYALG VPAISYLATFNEYYDYDFQGLPTRLSYQSFNFSELQDTLSRILNGDLGAP GGEERKTLIDYYLAAQNDRLACERIVDVLEESGYSQSQPPARATPVYLAG WALANLKATLTQLNMRRPGPNRLSYHDHRFPEIPVEQIEQKIARFGNLLN RFDSIKVKQHSRHLFRINSSL >NE1076 putative CcdB-like protein MARFDVYVNPGSHAATTPYLLDVQSDLLDVLDSCMVIPLRSLEHFPKVKL PGRLTPVVTIKGQDFLLETPKMGAIPRRLLTMPVLSLRDMQPEITSALDF LFHGY >NE1561 hypothetical protein MRLLIVFLLLLPLPSLALPQCGSEAILQAQKLLSFHVDGDDRAHVDPKAI ALPSIRNPANRKQKFLVLEVDGTVYKSKYRMRLIYYPLGSECVLMGQEIL ELASL >NE0888 hypothetical protein METSPVFIDYVTIRQEYFGGGLPVLNDGKVLKVDADGEIEYSTDVRCIIE GSYDSRVQVRCDGNTVEFTGNISRYGRRDNLFGYDWPTTIARINDLLSLL GLPPFTSGKLYKFADTGWSWSGARVSRIDLTCNYSTGSKESMHAVLCHMA GQHVGRQKGSLSPDSGTVEYGRGSKYVYGKLYAKYQELEKHRSKKSGSHV SDDVIDYCKTEGILREEFTLKSRFLLQNNLAYLGAITQDNLNQIYADRTQ LQRLEDMKYENFNDLPKHLRSTYASWKLGLPLDISRATRYRHRTELLAYG VDISIPNNVHHLPSRVRVVELKPLTAPDWYIQNYG >NE1791 hypothetical protein MEDLLNPSWNNEERNGMISKECFLHLQKKVNPASHWYLTEDKIAFHHHCI KNNLSVPELVAVFDPNGQSYWENGQGIETKNNLLDGLARYPFDIIMKPVY GYHGKGVSALDFVDGVHRFTTDLSLSLRDVFKKILAENPDRYILQKRLYS HQAIAEFTGNTVLQSLRLITCLDENGQPKLIIRKIKFPKQGNLIDNFSWG ISNGRLCLIDEYGKIESFIKYDHIKKYLVRYDYIEDISGKKTEFTIPFWN QCVVLVLNAQKAFAPLRTIGWDVAVTNEGPFLIEGNVFWDPLTPQEGSMQ AICQLLMALNAPLVN >NE1965 hypothetical protein MEDTAPITTLLYSILALPIAFMILYWTKVRKDRRRNETGEIEYKSLGHAL TFFIIEGLALVGSLSILITAVSGIVRYLIYVYA >NE0052 hypothetical protein MKKEQEMLRVNVVMITVALLAGCTMAKRPLPGPAQPAPDPVVQQRPSGPL PPSTRPAYNLAGYPKAAQEGYVDGCETAKQSAYGFKDKKRYAADTQYQMG WNDGFSICRGKHQQN >NE2146 hypothetical protein MYDDFRKIIIRRIVYRGQTGPEGDPLKTPPANKWLFYLTLIPAVVVVVVL GAFFFSIVLALFVAVAGVIGARFWWLRRKFRKSMSAAAEQKNSMIEDAEI IEIRENDKSDRGHH >NE2535 hypothetical protein MNDAENLTKLLGHLPPAVFREFMADEFSLAMPDLDTKKTKKEQREQMEVA LSALGVSERQRIEEVAERIVLLSDGAGQDVIDGFKDDIFDDAAREAFAAI PNQYQRALWLHVNEPVIFEEALNARQADVFRQSASCLAVLDDAAAKTAFH QTVAQQLGCSDDAVAIQIFKRLRPDTQTGEDVDLYQISIHHNRPPEIIDC VQASELVPQEVIRAVSSHITYEPANGHLEVLSKDTDGREALARIVADSLL QSPITGEKIPLKQYDYQSLAAPRNFDIASEPVTSVKVVELGYSAANGRSL LVKTWTKDADDIYTAARSLINPTFDFRDHHLNYAKLSIKLKKVGKDRARA ITVILRDDNKCNIKTKREKDQALWAGVLNFWFIGRVFDRVRQ >NE1162 hypothetical protein MDLAQKSDAEILAVATPIMDNLMDASTAIDYERHTRDFTERARSVLSEES LQSICEHYQSTKGFFAKREFVAAFRRPDSVAIVWRQQFTKQPGEFVAELI LVQQGGKYLVDHVMVF >NE1472 possible transmembrane protein MEARYVRGRQGLQWILSGFYFFRMAPLNWILLCFTYLLIGITLGLIPLLG SFIGILTVPVFVAGIMVGCRKLDLSGKLELEYLFYGFKKYTVPLITIGGV YLIGDILITGIFMLLGGDAVVDMWLHGKRFSENELPGVMDDLLFASLLCL LLAIPLMMSIWFAPMLVVFENMPPLIAIRKSFFACLKNLFAFQIYMAILF VLGMLAAMLYGLGFIIWFPVAFASVYVSYKDIFHYEQDEDTQPKSDEPST EENKEDSSQTNEH >NE0367 hypothetical protein MSADNLLLDFTSPGAIPPDPAEVVRRVVDETQTTMRALESLLENERIEDM TGWRLLAMFYLATDRLNDLAKIEKQYKSITGVSLSADLKQKYPQWFNGEA VSHPVVFEIPKKITAAALPDSIIIQRGQCSPGGILLDFSQVQEIDNDGLK KLAQLFSSLAQENTRPKLRQADRFITCLQNKAETGTGTRAIWDVLFAYER FRDDREAFEEKAIKFAVLYGISPPSWE >NE1551 hypothetical protein MRTKYILELFDTSEQHKPIARFESSTPFTAASVGERFDDIGWERLDGAGK IASPLSPKRYTVHSAKHLVIVEAGALVIKYCLNLEPFSGPSSPVWGDE >NE2512 hypothetical protein MTKLALFVRLEAKPGQEAALADFLASALPLANAESGTTAWFALKFGPSTF GVFDAFADEAGRQAHLNGQIAAALMANAATLLSSPPNIEKVELLAAKLPA >NE0726 hypothetical protein MLSLCRIIISLLLVGLSGQAAADISNSPNPYDAGYGFDTPDEAGWGGWMR GGASTLYAEWDTISDASYGGSGDRTAAPDIGTHNVADAYLSWNPGVFVTS TGNLITPSVVQEFFIRISPVSLFSGPLVVALQVEMWGDEPAAPLLNGLAA SSWTRTFTGTSVTDHDLNQYLGLWYFANTVNHFEFDLTNQPFISLAQVAV DIAQVSEPYMLAIMLTGLILIGSMTRYRSRPI >NE2174 hypothetical protein MNGIDWLLDTNFILGLLKSNPETLSMISNQQIDTRRCGYSAITRMELLGF PGLTAEEEILISGKLACLQYLPLTKEIEDMVIGLRRSHRVKLPDAIIAAS ALTCNAQTDP >NE2545 hypothetical protein MGQKMKNAPVYFTIAQVRHNPVLRLGSYAPDIQDRMRKAGYPDFKKGIAM AFTLAPQLGDAPQTQPPVVEQVERLMFFSTDSTRGFIVEQNALSFHTTEY ETFEALADEFMRGLAIVHECVTLAHSERIGLRYLDAVVPPGGETGLAEYL APGVLGLSSRLPEDVTVSHSFSETHIQTAKCAVLARTIIQSGPLGFPMDL QPIGVKVADRFREINGVHAIVDTDASIEGRHPFNLELIKSQLQVLRDGVG IAFDATVTPIAVSAWNS >NE0583 hypothetical protein MKHRTWIWLYLITLPSLTNMVHAKETLPDNQNGQTTAVDSGSAASKEVII QTAANQQTESTKPTFIAGNAPSSFYQRARSYSTHPESDPPRYVRTLSKTG IDAFKNLYWLDVGLDYRVRYEHRHNDIRRSRITTDDPVLLRTRAYLGIKE ILDPLRFVVEFEDARRYNGKFPKDNRDWNEFELIQTYGELYFKDALGRDD LGNYRPVRIRGGRMAWETLDRRLLGSNQWRNTTNNFEGFRVTLGQESNDW EFDAWGVQPVIRLIDKFDRRDKGQWFYGAIGHFRQWSKIITIQPYYMGLI QDDDGGTRVKREIHSPAIRAYGVVPNTEVDFDLGAIYQFGRDGGQKKSAH AYLLEFGYTFQQAAWKPRVSAFYGYVSGDRDPNDRTNNRFERFFGFARPW SADDYIIMENIQAPKIKVEFQPHPDLQIDGGYNGFWLASKTDRFNNLLNG SGNNRDRSGNSGSFIGHSADIRARYKLTPHISTTLGYSHWFNGGFIKNQQ LAELGETTAGTDFFYVEVAISAFK >NE1559 hypothetical protein MFKPVVQEEVTGCGIASVANILGKTYSEMKTIANAMGIHASDQSLWSDTQ YVRRMLSGAGVETSEDEVPFESWDALPDLALLSIKHHQEEGKAFWHWVVF KRMDGQSFVLDSASYLPSNIRQDFDAMQPKWFIEVKNA >NE0918 hypothetical protein MNEEEETGRIIARLLDRSLNDVTPGTLYRLQAARRAALEHYQPAEKVLHA GVGISAQSGYHWLSAHAGRLLLTASLLLFLAIHSYWQMNNRVDDTILTPV ILTNDPPIGSQEIEDTANGYEAADEDIVEETDSREDTDHGNYGGEADTSE TESSTNGTADSDDVTRSFDSTEIQETENTAEAPYTTNYDQDSVTEEDTGV ISDHLQNSEDIIDSYDTENTQDSTATIDE >NE0798 hypothetical protein MDKFIEQVSLYIQEAPVWPFTLLGFILVVGVAVDIINRRRRTAAVEYFDL AFQEELTGLYPAATRWPDDLAAYMQPRLPILRDAFEVLRNFIPQNQLREY NAAWNRFYQFSRTGGNERPVSLEDAAQELAVNQPDLQQQQAFQQMISDLL AFATQFKK >NE1916 hypothetical protein MEITCYRDPEIRREKRTLPATTYNLAIKLLARCETKQLFIPIRSMQYMAI VDAEEFVFVDSQRKCWIDIAWQNFHSHEREALNQPIEYDAVFYREDQTDI MQRLQIEFPLALSAMMAKQAPHKLAKVISFRQKPPAENPKQ >NE0571 conserved hypothetical protein MFKRVGIFVALVTLSLIFNQPATAGRTVEDFQVWGNITALGNFGFVNPGN PDLKKFRWWMEGQGRFGNDSSQFTQAIIRPGLGYAITDKIIIWAGYAWIP SDEPLVPKSGLPFDEHRIWQQVTWADEFSFGKLSLRSRFEQRFFDHNAPV SGSDDVAYRFRQLVKLAIPVAMIDPNLTFIIQNELFIGLNTVSNPGFISR GFDQNRAFVGLGYKVHQNATVELGYMNQFIDRRHNPRPDQMMHNFAVNLF LNF >NE2224 hypothetical protein MKRNALIHSLQTHISDLLAIYAFGSRIQGTARLDSDLDLAVLVAGYTDPL ILFEVANELADVAGYAVDLLDLRAASTVMQYQIITTGKRWWTLDMQAALF EAFILSEKTALDVARAGLLADIRQRGTVYGR >NE0369 hypothetical protein MNLKKWTDEELVSTRDQIEAWCAKYAQSVWDGRKGYLTGLLGVFGISTGV VFLMFDGIEVVSFVPILLGVIVCFTWWKTKQQHKKNNGFLEEIKEEIARR AKKMEKIEKNKPQSNHAVLP >NE2243 hypothetical protein MAPASFSQITGQSVGKIGVLFLLLMVLWSANLCNSSDITPLSANRANVGQ MVQAFTFDDLFKEDGPSSDLPQRVFRQTSSWRGFSQLEFAETIASPKHAS KLRLRSELSNLGQLSPNVKWKLSARIDYDAIYDLSDFYSRQVRRDQRFEL FLRENYLDFSIADFDVRVGRQHIVWGEMVGLFFADVVSAKDMREFVLPDF DILRIPQWAVRTEYSKNDFHADLIWIPFASLDEIGRPGADFYPFKLPVAA PVSFLKEDRSGRNVAHSNYGIRLSQLTNGWDVSAFYYHSLDATPTFHRIS QPWEPLLFQARHGEIDQAGGTVTKDLGSAVLKGEFVYTHGRRFNVTRPTA ADGLVRQDTIDYALGLDFTLPSDIRLNLQFFQRAYLNYDRDIFQDRLENG GSIFLQGDLWRDFQGQILLIHSFNRNEWMLRPRLTWNFARNWKLAAGADI FNGPPTGLFGRFDSSDRVYTELRFSF >NE1344 hypothetical protein MLLLLTFIESPIAGIPEIRKHNCLVLPHIFVCLVITKSFGRVVTSGAAVT KRAAGYFGYLAHCAADETTWYACEQFVIGHARKAVTHFIYLA >NE1750 putative pre-pilin leader sequence MLNIYRRSAFIYTQAGVSMIEVLVSIIILSIGLLGMAGLQTAGLKSNHSA SFRSTASMMAYNILDSMRANRVVAGAGGYNHSLSEEDASETETKVEAEAE IPEDIKNWLKELALRLPEGLGSIDVDADNKVTVLIQWDDSRGAATAQQFV MTTRL >NE1907 hypothetical protein MNKKRMVTMFVLVGACAGLSACATTGEVEALKSRVDALESNVSATKSDAA AARAAANDAVNIANQAMDKANEANARSIDTETKIDRMFKRAMHK >NE0109 hypothetical protein MDILSLVAGFLAGALTIYLASYFMKPSENKTQKTVDAGTINLFDQLWQTH ERLLNEMKQDVENPDFKFHREFYVLKKGWGWERWGFHRRGPCIAYFLEDH SDLLPLLDSLTSYGLISQTGETGKNTARFQLSEKLVELLRGKNTNKS >NE1363 conserved hypothetical protein MRFHHAHGSLFPRKILICIKARINPDLVMNGIPLCPKLYHIVHVDRLSSI LKDGFLWCDVHMAQHIPVGTTIGMNNIKQRRLQNCLNSYPDLHVGDCVPF YFCPRSVMLYLIYRQNTELDYKGGQGPIIHLEADLNAVTTWAKTQSARWV FTLTNAGSFYFEDRNDLTCLKEVNWTAVHALNWKEHKEGKQAEFLIEQCF PWNLVERIGVQSEVIYNHVVNALPVNGHRPKVEIKPEWYY >NE0083 Proline-rich region MRHFSRLIVLSAAVSLVACVNIPLGPSVMVLPGVGKNFDQFRGDDYLCRQ FANQQTNYETPKNSAVSSGMESAALGAALGAAAGAALGGGRGAAVGAGMG LLGGGLAGSGTAQSSGSISQERYDIAYIQCMYANGHRVPVSAGLIEGSGG NMGQGVTSNPSSSGRYIPPPPPGHPPPPPPY >NE1042 hypothetical protein MVIKWILLITLVFLIFWFFKQFRQIQRKPPDTTRKVIEDMVRCAYCDVHL PKSESIVEHGRYFCCTKHRQLYSQSQPDDK >NE2429 hypothetical protein MDEIIKQSPVLWVLSAVVTGFIAGIAAYIGLLKITNQETIIKGTYEPKKN LVGRVLKNEVLIECGKLIELAGRIDGATMPDKVEAYMTQTLIFLEGLDLP KVQQYHQLKMSWPAYTIQLLLVNDKLSSSQKLGRA >NE0740 hypothetical protein MDETRRDFIKGMFASGTFLALGVPGIARAASVGPLFDSTRNCRLLLGNTT GAESFAKGVQSACFNHGSRHHGALPVFRFESELSTGFLHLVDLLMQSRNT RWIAVMDHADAAIFTELIRNSAAHLLASGSHTFAAGDHAALPLRHVWAAA SPAYSTGGLLASMLAREQYSFSIVERFLTQSAGESIENAALSLPEFLPYH RADQPVTRLYCAGVPLPEAGRLLGWETSKNQESLFSRTIASTASRNETAG STTVEYPQSGDWVEATGYAVAAAALGMKINRESCSERAFVYRSGQGHPDH KGLSGVNFASFVIDV >NE0256 conserved hypothetical protein MEFNEYELQRLFGHEAAEDEDPQRLKDYYFKSKVYSQVVNDLPLRIIVGH KGIGKSALFQVAIDEETENKRLTVLIKPDDIIGIGEDTDDFLKLIRDWKI GINAIITQKALTSFGMLFEGWRGKLNQYGGTALDFLSSTLKLEGKVSLTA SKEAILRDFLKNNKISVYIDDLDRGWQGRKHDIQRISALLNAVRDISTEN RGIYFRVSLRSDVYYLARTSDESTDKTEGSVIWYSWTNHEILVLLVKRIE SYFGREVDEAELLKKHQLELMRYLAPIIEEKFTGKGHWRDAPTYRVLMSL IRKRPRDLVKLLTLAGREARTKDAERITTNHLENIFEEYSQGRLQDTINE YRSELPEIEKLILGMRPTKIQRKASQGYVYTTDQLLKKIKAIEEQGKYRW ANRNQVDTKELAAFLYKINFITARKQIPTGIDRKYFEENRYLSNKFIEFG YDWEVHPAYRWALQPEEPMQIFNELELSSS >NE0495 hypothetical protein MSLFSPEQLKKASHIKRLSPVLFINTYLAHCKISQRANRANPAEVQTASP VELDCLNFHGLRESQRDMAIQSLYRLSLWIAALRTPKTQQIQPPQSLRTL QG >NE0508 hypothetical protein MNRALIAVVLLMVMGSVSGMRINPDPRVVVSFMDKSIAPELDILRVMADI SPDNHHLVFQVKTRGERIQGNDHDYLLLHITHGKTYVLLLPINKEKENQM LVYERLPQPDDDDLLILGKFKGNSHLTNFNITSIFRGGEFSVPLDWIDFN TNFSFDAYTVQARIKGDTLKISKVYDWARKGKTHNNEKPLSAITLLNKIC APKSNNQRL >NE0117 possible A. fulgidus predicted coding region AF1859 MSTHHMAEFSSPTLSLGRYRLDWQVTRSIRLPDYAGSMLRGAFGHALRSI GCITREKDCTTCPLRRDCPYTILFEPVPPEHHPLQDFSRIPVPYIIEPPE WGTRVLHPGDTLSFHFTLIGCALQELPIAILAWRRALARGIGPDDGTAEL TSIVLEQPDSSIPVYTPENGQIEPHETRLACPPPPAATIRLHITTPLRLQ NNGVPLKAQTVTERALLMALVRRFALISEFHGEAAWQPDFRHLGELTSSV TGKRQLSWRDWQRYSSRQKQKMALGGLTGRWDLHGELAAFWPALWFGQWL HAGKNASFGLGRYRIIAA >NE2160 hypothetical protein MVKRVLMIAYHFPPLHGSSGMQRTLRFARYLPDHGWEPIILAPSPRAYQQ IDSGQLADIPQQVRIHRAFALDTARHLKVMGRYPRVLALPDRWVSWWLGA VPAGWYLIKKYKPDVIWSTYPIATAHLIGLTLQRLTGIPWMADFRDPMVQ PDYPVAQWHNLLIRTIVSLILYNRRLQKWCLLTDR >NE2060 possible (AF047705) unknown [Nitrosococcus oceani] MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNP >NE0551 putative yacA [Plasmid ColIb-P9] MSESTFTFRVDEDLKTEFSAAAKDCDRSGAQLLRDYMREFVKTRREVAEH DAWFRKQVQIGLDSANTGNLVPGDEVEAEFAARRAATRRRLKASE >NE1307 conserved hypothetical protein MNSHRGQMMSKDATALLHVTCVFRHNVACNASGGLHMGTTHVNARVKKHR DTLRMAGLRPVQIWVPDTRRPDFAEECRRQCLLIAQADKADTSMQQFMDE ALADSDGWTE >NE0289 Helix-turn-helix protein, CopG family MSQITLYLDDEIQALIEQRAKASGLSKSRWVAEFITKYATQEWPQDCLEL AGRFADFPLREEANPLPV >NE0920 conserved hypothetical protein MNLSDCKLSRNAFGKLIVTTKDQIHEGVVPVRSFPITAINEGIALVDGHG HEVTWIDSLAELSETERILIEQELASREFMPEIKCIDRVSSFATPSTWQV QTDRGETCFILKGEEDIRRLSLATLLITDSYGIHFLIRDRSMLDRHSNKL LDRFL >NE1699 conserved hypothetical protein MLKFNLAVLISMLLISFFPQAYAVGWEKIEIPDNVSTILDDDGRYKDIKT GCAFSHLPDEAGLPNKPFHFYYRKGTKAKTLIYFNGGGACWNGATCLTSL TVPVTQTTRPAYNPSIENENNPEELGGILDFTRADNPLKDWNMVFIPSCT GDAHLGSKNEVYVDPSGIINHGDAVLVQHRGFDNFMAVREWLKHRADRPG TEQVLVAGSSAGAYGALMNFPRLHSIYPDKTKISLLSDAGTGVFTSNFLN TVFEPDGPWGTEHTLATWIPGINRIGSYNALNFFTSLATGIERHFVNSKF AYVTTAWDDVQMLFLNIMRKTGQGVNDPNQWFNLTPVTAVEWSLRMLTTL HANALINRNSKYYISAGTYHIGLVDAFAPGVFYTEKSAGGIYLKDWVNRL VTDDRNYPLINLMCSGTCGAPFPP >NE0879 Esterase/lipase/thioesterase family active site:Lipase (Class 3) MKTAFSRLFLVMLTAILLSACTSNEIYRSNFSNCIVTAQESCESHAIQLH DKGTEREYLLGFVEIDDQGQLRNRVQMQALLNELYTLASKESLLINVFVH GWHHNAKQGDANVESFKLNLAELSKVESHLHQDRTPRKVVGIYVGWRGES IDIPWINNVTFWDRKNTAHEVGYLGMAELLLRLEEIRNIKNTQEPPVKSR LVILGHSFGGAAVYSAAAQILADRFINSAGDKNYVDNAEGFGDLVVLLNP AFEALKYAPLYDLAQARCSYFQDQPPRLVILTSEADFATKYAFPAGRVFS TFFETHSTIKRNDCNRPLSYSEGAADRQAVGHFEFLQSHELHPASKSMAA VYHQAKAIWKNQQPGEAIQFGSTELRNLEHTVIHNPYLNVKVDKRLIENH NDVFRPEIMEFIRMLIVLSTEE >NE0086 hypothetical protein MKLLLSIALFFWVGMVAHAENLPERIDIEYVLNGSIGQGKAHEILRVRQE NGVQHYTIDSEASASGILKLIKRGSIHRHSEGTIIPHTGMKPFRFTDQRG EKPAREVEFDWSEQRIIYRRKGQEMTENLPSGTLDELSLAYHFMFTAPPR QTLVVHETDYHTLQTTRYTVTREMLDTPIGKLATIVLTKQREQNDPFKKK IWLATDHHLLPVRIISTEKHGLEVDQIVTKINYSPLVNSAR >NE0710 hypothetical protein MCISNSDWISLGSAIATLLGVGVALYASWQQMKKMNNQLVIQQFSDYTKR YQEIILHFPENINEQTFDFSKDTDKNKTMRYMRAYFDLSFEEWHLNQRKL IDAKTWTVWEGGIKTALSKTAFINAWLEIKKDTGYGQEFEQFINASLPTN QNLTNHSSGTPNGAP >NE2175 hypothetical protein MERGMTMSTIERLYKLSSTLPPAALAELLDFAEFLHQKNMLPQPDEPFRL IDMAGGLEHSACFAGEPLAVQEALRREWD >NE1157 hypothetical protein MTVNSKPAIIMNQTQLKIRIHLDQSADHPKIEQNLPELPEESVFPPLPPV YEYNWPRIIGAGLVLLFVLITSIWIAADWLSDDEKLETSSTEISLSAVSP ASSETPSTEPVPLLPSVGFSSDQPSENNPAIGNVPDGDIGSDQAVQAPDP QPQRAAPSASAPPVKPGIKPDITIPQAKTKNVSQGSNHSSGLIKAQLTSN IRQRQPVDNINQISLGSKSSRPIFLFLHLNKFRGEKILINWYYRDQSIAR IVLPVGNNDWRTYSSKVLNRNRLGPWHVTASDQAGNLLAEFKFRVTR >NE0706 hypothetical protein MAKPMFNRQTTILLVLNTCVISAFVLIHWIVLDHPDDLSTKSRNVPSETT PLSIPMETLPNALEQTLLFSSSRTRTVISSPESTVPLDTAPPRLVGIVEE EGHKRFALLEDETATSRKLVAQGDTFETWMVVSVTSDAIHLRSRSDINDG VHPSSDIELRLRPSVPPSQNFNP >NE0574 major membrane protein I MTDIHQAQTALGDVAARTLANATKTIPMMGTITPRWLTHLLQWVPVEAGI YRVNKVKDPDEVEVDCSNKDERELPATYVDYEEWGREYVLSAVNTVLDVH TRVADLYSSPHDQTREQLRLIIETIKERQEKELVNNKEYGLLNNVARNMK IKARTGAPTPDDLDDLISLVWKEPAFFLAHPKAIAAFGRECTRRGVPPPT VSMFGSQFLTWRGLPLIPCDKIGITGGKTSILLLRTGESRQGVVGLYQPG LQGEQGMGLSLRFMGINHKAIASYLVSLYCSLAVLTEDALGVLENVEVGK YHEYK >NE1303 hypothetical protein MWQTICRGAIAGGVLLLTVQPVSAAEPDPAQVEKTVQAYIGKANQEAAQN RESVEESQVVTADLNGDGRAEIILWSTRYGGTYSFNDVTIFTDSGRGYQV AAGTEDVLGMVESIEVKNGLIHIHALWPGPNDPRCCPTVKKTAVYQWQGK ALADVTSRVSGKK >NE0496 hypothetical protein MVFPAKSRRFIAPGLRYRQHATNLIGADGGFIAKFSRFGLMLSGRYRQWN ARNIEALSGLRRN >NE0232 hypothetical protein MTNALIPDPVSAGEVMVTDEGHVEKFLNGIFTCEVPAVAVPDYLRKNTVI GRFAHDVAAQAEMPFGTVFVTILGAASVPAACSFTTRFESGYELPAGLFT VCEQPPASGKTRVLNYGLHAYQAAIRDLNKQIHEHNKADKQNQKPYFIDL ITDGTAAAIDSKLAESKSGRLPLASSEQGLFRSLFPAEGGFHSNNDLLLT GWDGGWVSGARSTRNAFTGRVSTQVVMFAQNGSIRRVLQASNGSGLTERF LFVAERSLLGRRKFEPHTVDASQYDKAATRCVERMAAEKPPIIIEPCKDG RAYIRQQRIAHEAELGKLERAGEAVMVGWLGKFENHVLKIASVIHSFEFM QNEEIDFSYPVQIPLATVEAACELVMSLHEHMRAVIDAAGESGLQTATDS VLSILRENKAPMAVSAVTAKARRRKPFADMGRDDYKASKAFIDTLIVKGI LLKSHNKLSVAE >NE0900 hypothetical protein MISKPGRIVGLGIILSGLSACATYQPVLYPNSYYQSVGKVAAERDIRECR QLAESAGAREGSGSTGNTARRTAIGAGAGAASGAVGGAIAGAAGRGAMVG AASGATWGLLSGLLGSGSASQPAPAYMNFVNRCLREKGYEVTGWQ >NE0602 hypothetical protein MEKELEDRKLAELMNSIRHYSTLRFAMLTVYFAVTGGLLVKFFDCDFSVR YPELHGLFQIAGSMVTVAFFIFEVALDDNLRKLWGSVKKLAGEGDVLLSH RQLWKGCLVPMATYGIFVGVLIFWLFTSRNYYPCQAAAHKAVQSETVISK ECRK >NE2247 hypothetical protein MIKRFNLTICGLGLILVTSHFFPAGAQELILYVDTATKQVYTEPGKNRIK LGTFQQVKESPVQSQSKPDTESSQPVSNTTTGLAQGEAKFQENSGQSGAE SDIRRKSEEIAAHSNEPSAEKPKEEKKWYDRIGIRGYTQFRYSSTVSGDK DAVSYWPDKSVGEDGSFLIRRARLVIFGDINDHLSLYIQPDFASTPSGSS TGHFAQLRDAYADIHFDKNKEFRVRVGQSKIPYSFENLQSSQNRLALDRN DAMNSCCRDERDIGAFFYWAPTHIRDRFKEVMAKNLKGSGDYGMFAFGIY NGQGANRLEGNNGVHMVTRFTYPHQFSNGQILEAGIQALRGRFVPSTGPA GGFTPVMDAPEKGFKDERVGVHAVLYPQPFGLQAEWNWGRGPQLNDGQTM LTESSLNGGYVLATYKIDGLRWGTLFPFVKWQHFKGGQKFERNAPRNHVN DLEFGLEWQVMKEIELTAVYHMMNRTNVASAPYERYKADVLRFQLQWNY >NE1441 hypothetical protein MTMTKSYYTPILLVAAGFVLTGCVTINIYFPAAAAEKVADKIIEEVWQTD GNSGKNDRSGNKPGNDVSDKTDSETGKTKP >NE1550 hypothetical protein MLALFKGDYATNEPDYRTSSVVYITLGFSMFRPPRCYRKNFIVPLYRLKK ATPPVIWVVMFLLCCFMLPFGIAIKIMHPGFFATLRAGLEIAAENQLYGA VFVTTFLVLGSFLMFIVNGAFVSLRVAAGYEVRRNGTILKWLINKVRRAY KILNKPND >NE2142 hypothetical protein MSDRVIRMNQQSETDAQFDPDLPTANAVMASLCCVAAQYASRPSTELAKL ALDLAYKLTAPQYAESELITEVAQQLVRQWKQVLYQQVQARAAGMIIPGN RFIN >NE1601 putative transmembrane protein MVTMSCRQERGYIYIWMLFAVMLAGVMLAAAGLIWQTEVKREKELELLFA GDQFRRAIESYYNDSQVSGRAGEAGASRYPASLEQLLKDERSLVVKRHLR RVYPDPMTNSYNWGLVRQQDGGITGVYSLSTGVPIKRANFPADYIAFEKA GNYQGWKFVHAASTAGGQEKQQAEGRGNIQGDTSMPGLPGTGFNPLPQNQ PAPNLSPPTGNDAF >NE1883 hypothetical protein MKPRYVVDTNVLITASVADPVAPKDIDATPQDPALRFRVWQWLVEFESSP ARLVLDSAGKIKEEYDRKLGFNDYGIQVVIHKWSMAAVDNVDVQYDTDGH AVLPPPLDSVVHDLADRKMVAAALEAQKCHGESAIAFAGDTDWHDWEQTL IQAGLSIEPIIEGWSRAKHAEKAHRKQNHD >NE0237 hypothetical protein MITNFRFIHPPYKQLYAYVERIVMPLDIEKYRKYLAPLNLGKDHEEEIIR HIYMIMDEFISAAFNKHPVQQALQAKNRKTLQGQSDVIDSKDRSIQSLYQ NVASRPDE >NE1014 putative transmembrane protein MQMGKENEKVTSALQNSDMEKASMYQVAEAVLFSFIGIRKKSDLEHDAAK IKPVQIIIGGLVGGVIFILSILSVVKLVTG >NE1248 hypothetical protein MARLIEDDHSNLIGLAGVLPRSWLICSASASQSTLLLRGRWMLAENRISD GSQVLLSAPGYNGFQLRHYACMAARIRITRRTTSGTLRMVMDGIVFILPQ YMQMQRYISMKSIAAILIYEQQLLTRAHEQLQQDLYRTWQIR >NE0938 hypothetical protein MINRSRLRRLIVISLVSYTVVLSLTVSLHGYLVNEYIEELIWESMLESEM AYIKRKIAQDPEYDWSGLDRFHWYDEHRDSSIPPQFQALPAGMHDEVRID GSEFAILIEDGPEGRKILALDITDLENRELMIAFAIVASTVLLITVLTLL SFYSVDRLLRPLTRMADEISNLSPDGEGPKIPIGDKDAYV >NE1680 conserved hypothetical protein MQIHVYDTYVKAKDGHVMHFDVFTDVRDDKKAIEFAKQWLSSIGEEGATV TSEECRFCHSQKAPDEVIEAIKQNGYFIYKMEGCN >NE1380 hypothetical protein MKIINIRENFSRYPAGRYRADGPYNGEKFREELLVPALSEAIDKGEKVKV ELDGVRGYNSSFLEEAFGGLVRSGKFASTRDLSERFEFVSTDKSLIEEIR GYMEEATPAVAQ >NE2427 pyrimidine dimer DNA glycosylase MSHNSRPVRHHNMRLWSLHPKYLDPQGLVALWRESLLAKAVLRGETRGYT NHPQLERFKAHPQPHFAINFYLAAIHAEATERGYTFDSSKIGPVCSVQLI LVNSGQLSHEWNHLQHKLATRSPIVHARWSDLASPICHPLFHPQPGPVAS WERV >NE1189 hypothetical protein MSKLLFKLRNVPDDEAEEVRALLSAHQIDFYETSAGNWGISLPALWVRDE TQYSQARELLDVYQAERSAHVREEYARLKQEGKHKTVLDSFRENPFAFIA YLFIVYALLYLPYKIITGLSSQ >NE1537 BNR repeat MKKSTYSFPTLAGFFLTLASVVAIAAGPGTGDPTQQPSVMQASKSPKTAL AVGVTLDQEGQLWLAKVIDQRLLVSRSEDDGKHFSESVTVTPVPENIGND GENRPKIQVARDGTVLVTWTELLAEKYAGNIKFSRSTDSGRTFSEPIVLN DDGRVTSHRFDSLAIDGKGRVVVAWLDARDRDAAREKGEEFKGVSLYSSQ SFDNGAHFEPNRQIHQHTCECCRTALTWTSEGPVVLLRNIFGTNTRDFAV ASLDKVEEGVRRVTRDEWQIDACPHNGGSLATDGRGQLHLTWFTSGTAAQ GQFYKRISGNQESEPMALGDMDAQPNHAAVVAHGETVILTWREFDGNVYS AKMMFSNDGGETWSEPWRLMLSAGANDYPVPLISDSKALVVWNTENEGLR VLSVERVINRSDG >NE2001 conserved hypothetical protein MKLKSFLNYFSSREKGLAVSILLPTHRTFPDNKQDAIMLKNLVTEARNRL QSWPDTQEAETIMEAIDKKISTHDHNYNLDGLGIFANREGVTLINFPFTV KEQMIVDEAFAIRDLIREINGAVHYRVLVISRTDARLIEGFNSHLVHEFD ARTELRTGSFPMENPFLFAAKGLDRAQIPNEAANLKEFFNRVDKSLQEIQ NKPEQERLPVIIAGDARNAAFFREVCDQPADIIGEVTSIPDLRIPAEKII VEVQGLVSDHRRQKAETALQYIAQARNNHQLLTDSSMIFRAIDAGNAARL FVRQGYIQPGIIDFDQKIVTLQEDSATVDAAGGTVTDDVVGTLIELVIRQ RGEVHFLSTDQLGKEAPLSLQTRY >NE0718 hypothetical protein MTKAFDAFERAIRDAEDLLARHDAEKTVPNGHNGEVLKRAGLVMALAAWE TYVKDRLQSEIDTWLQAVEGSPLGKFVRRRLEEDLKRFFNPNDERTRRIF IDYFDVDITKDWVWENYDSSTSKKVLDSLVAKRGDAAHKANTALHASAEP HKVKRDELEKGIRFLKGLVAATEKAKITK >NE2345 hypothetical protein MLLNRVIFQKQSDRLLFNASLSNTVLRILFFYCFPAVCLFVSSIAATRAG QPPEQINQYLRINGEFLGSYENWNYFRPASAVNNSYDLWVVRSRLGLMFS SDYADGFVQGQYSGLYGLPDDAVLPSGGALGLGAGYFLANRTTGASNVFL KQGYLNFKFNKLGLPGAAVKIGRFELADGMEYRSGVEKFDALKKKRIAER LVGGFNAIYVGRAFDGFSVVYDGPGFNATVSGVHPVQGNLTVQGQKQISD ISILYAALTSKKDAVLPGIEGRLYYLNYDDQRVSQVTDNRPLSARPQLSN EKLNIHTIGAHLLSLQPLGSGSFDALLMGAYQFGSWTNLSHRAWAFDAEV GYQWHKLPFKPWVRAVYYRSSGDGNAHDGRHQTFFSAVPSGRLYAKFPFY NQMNIQDIFFEFIAFPTGKTQINVNLHQLSLANVNDLLYTGLGASLKSGA FGYSGSTTHGHREIGQLIDVTLTHGFNKYLTSQLYFAHAFGGSAMKSIYP SKSDASIFLVNFNLVF >NE0480 hypothetical protein MQQVALISAVFAIANEVKLEAIQLVDCRGAGSPWPDSLAKTRCDALQQSF INRRHCETAQLVKQSSKILLLLLPKLKWHPDQQIFTVFANANGMKQSDTF VVITIAGSPRIIPGCRRLRLLAITSFAMTGKIRESCSCLQQK >NE0815 hypothetical protein MSDRVIDSIDDYLSPNSQIQIIVNRYRSLFTWFKIMSLPDRLSCLSIEHC GCLQLIAGEILIQAVSGVRSLERPFVIDIEHGRLQARGMVGVIRALFL >NE0372 hypothetical protein MYKRDTRFSMVFPALLAVVLFVPFVMNFLAHAWYGEKHFPPLTMMKLPAD SSVISLPQEIKQYKPFEITLQLNTRELARRINDIVKKSHPGTELQGIRSE VFPEMRARIAGDAFSIDPPEPQVQFFSGQGEMSRWSWIITPEKTGRHHLL IELHLQTAETTREHPQVADLAEIQLFVRENPEAWMRTHGIWYALFTLLAA GWWWKKRLIKKRKAAKEQ >NE0737 conserved hypothetical protein MTAELSCHLLVSPFTKSAVMFNPSRDQARRLFFDTWQKYHRKEPLSGMET IALEVILQHPEYHSMLQDVERYLDKDFPPELGETNPFLHMSMHVAIREQL AIDQPAGILQRFEQLKTRLQGDEHEAMHHVMECLAEMLWHSQRNQTAPDA GIYLECMDKRIGNK >NE1055 conserved hypothetical protein MIVRIVKRNTFIFLSVVLLSLLAVPATNIFTAPSRETIKWGEKSFLYNMD FISRWAALLLYPVGISTDSNQVIIGRDDWLFLGDLYEETRTIDRRPPSAA DYVSGQEIGSAIESWNRYLSSKGVKLFRIMIGPNKGTIYPENLPIWAKPS IPNATDALLVGANTIHYVDFRSILLKSKASHSVALYYKTDTHWNALGAGI AFQAFAQQVGKVVPEIQWPPQKIYKLNRVDSRVGGDLANFLRLTTYLPDL EPVTYISSLAVETTQLDYDTGFILRQGGNPQVNAPNKPLLVQSCGALNQK KVLWLRDSFGTVMSPFMAATFSEVLQLHWAEAMKPGGKFVQLVEEWEPDY VFFTVVERASRSPWFASYPPPVLVPLGSKFKPIQTTTAVGLNHLLQGTTT NEFQIIGNDPFFDFTVSEIIKPKEVDYLSISLSCADGSQSVPLQLFWLVD KQPYFDEEHSARFLFRTGENLIDLHTLPKWDSAKSITRVRVDIDTQDSCV HFKLGNPIFGVE >NE2530 hypothetical protein MDRIGIDTWANISAEGWSSLLLGNGASIAIHKEFAYPTLHGIADAKGLLA TTAPIFAKLGTTDFEHVLLACWYAEHVNGALGTPSAAISAAYEEVRTALI EAVHSVHPVHADVAADLQRVGAFASAFPTVVSLNYDITLYWAMLLFNAAN GSWFKDAFHDGEFQTDWEYLRRPYGHAAGATLVFYPHGSLAVARDYLGDE TKLSVGAGAAGDLLGTITRRWASGHYVPVFVSEGTSHQKVAAIRRSHYLT NVYEEVLPALGESLVVYGWSFDERDQHVLAAIAANPPKRMAVSVFTGQPD GDQQAFCLQVLKAVGRSLPGTEVTFFDSRSPGCWNNP >NE0485 conserved hypothetical protein MKRVNQNLYQLLRRIYWRLPLPEETKELLVGFARRFLRGMKKALAEPVTT ASSASVSREKILQEYANQILAIPRKSGNEYVEISSSSYQRKEGDAKILAY YLPQFHPTKENDMWWGKGVTEWNNVSRAMPQYVGHYQPRLPGELGYYDLR ILDNMRRQVELAQMYGIYGFCFYYYWFDGKRLLDKPLDMFLEAKTIDFPF SLCWANESWTRRFDGSCGEILVKQSETVESYIAFINSVVPYMRDSRYIRM NGKPIFTIYRPSFIPECASTITAWREHCVKAGIGDIYIIGIKEHTWDVNL IELGFDAQSEFHPGTLFKHCVDISSQINYMQDFGGIVLDYRDIVEHKKYF LYNHPKLHRAAMPMWDNSARRDNKGMIFEGASPDLYERWLTDILLEAKNR EDLEDHYIFINAWNEWGEGAYLEPDKKYGYAYLNATRQAIEGVRS >NE0099 hypothetical protein MCARIQQLPVFFIFILYGSCATTKRNIYVLAINKLMEQQIRKYESFNDLP ADCKKLFDSGEKDSFDLSRDWFLLLETTVIRQTKEICIFTLEIEGVTQGI WPTLLQKKGKLSLRQISSFTCFYSSLYQPLISSSLTVDKLADCLRWILSD TRTDVLRFDIMDPSQSSFNLHEQALKKIGFKTDRFFCWGNWYLPVNNQPF SVYLQNLSSRVRNTLERRKKKFLAGGHGKLEILTTHDKLPIAIQAWEKIY NASWKIPEPYPEFMPSLISLCAAKGWLRLGIAYYDEEPIAAQLWIVNQGR AAIYKLAYDEKFAHLSPGTILTAHLMQHAIDVDKVHEVDYLTGDDAYKKD WMSHRRERLGLVAYNLRSFWGLIGISKHIAGKIRKKILKSLK >NE0082 Proline-rich region MKRLNWLHLSLILAGMIASGVSWSYSHGHSHGYHNRSHGHYSGKRNFSLG VGLTSTFGSYGYYNYPGSNVGIYGSFGYGRSYPYSRRPYYRPYGYGYPAS RFYWPAYPPTVYYPPVVVVPPDPPVYIQQQPARLVPPPPESAVTNYWYYC ENPAGYYPEEVERCPGGWVKIPPRPAQ >NE2494 hypothetical protein MLIPYTYVPHQMEKMQAFIDFIFHEIWCKAPASGPFGLHLFNANAELREV MEAFYYSDAQGADFFYGHVERIYGLFSALTFVQISQFQQWYLGNNDLEKV CANAPAAQIVRYADIATTHQDLADQLASFFKGLYSQSLLGLATLRAKIGD IDDHYQAFVAANKMGKCPFCGIGDIKGEHHSKREAYDHYLPKALYPFNSI NFRNLAPACHECNSTYKLSKDPAHNAVGRRKAFNPYAAADHAIQIQVALP HADIDALTPADITMHFGPVELAEELETWKDVYGIEERYKAKFCAENDGKY WLTQVLDEWKEDGRDPADFMTTLVRQVQKNPYAECNFLRKPFLDACQQVG IFK >NE0942 possible (U92432) ORF4 [Nitrosospira sp. NpAV] MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG RLRSWFKSQDD >NE0274 hypothetical protein MRKFSVFLLLANIFVVFYLHGRPDDNLPAQIALIHSEKIELLPAKVACLK WENLIGPVVQHVRVEISKWESGQDHITEISRGEVTVHWVHIPPLRNARET AKQIEQLKKSGISYLHIQENADSPWHNAISLAILPDDSDVAALVEELKGK GVERIMDSEQVLEQFEFDIRNPTEQITESVRQLAQQFPETKLEVTECSRL >NE2443 conserved hypothetical protein MKKLIFLAVAIFAGYSYLGNPYTSIPDRLQHPTFSESQSGTDATIADAFS NRKSNLQISGEGVVTKLLPDDNDGSRHQKFIISLRSGQTLLIAHNIDIAP RIGSLRKGDSIQFSGQYEWNEKGGIVHWTHQDPNGSHVAGWLKHNGQIYQ >NE1243 hypothetical protein MKRWVVSACGLSGKDAMKQAAVILTGILVLLAGKSTTAISREPFHLENMT CSITSASMNGCTGVPTLHSGILTLSKEGTFKLKARYEGCFMVENVTKSGD FAASFFEKAMSLELLTTRTTGMKNALSDFPGTLGFAHLNYDGLDGYFIDI SAVIRARGREMENVIVTANLQCTVIDDAARTAVKADRLSFIQKGVYK >NE1259 hypothetical protein MGVMKGIEFSESKLKEILDQLPIEAQSAFAASCAQRIFTCYVEYARVAKS KKVDLDAYSEAISYVWNAVIAGNHDAIILNGLLERCMAVLPSEEDAWESG TPYAEDAAAAIIYSLRSLASGCPQEAIWAAKRVYEAVDNFVVNTYNVNTN ATDGEKFILDHPIVSNELSRQLRDLNEIINSKRDSESLKKTIKIIMERSK SESNDLFSEAT >NE2002 hypothetical protein MKKTAFILSVILILLVAVMALIALFSEIEGDNVRHGIMGAGISTLPLVAY CISVVRVCRGWYLVAATLNGLFFALTVVSIVIILMDDPSTMKNLLAVLLV LLVPLTLNIFALIHIRRTDSRLMPHPDIPGAAGGKNLEGLGGWLILVGSN VVLSPFVIAARTYKSYAEMFASGVWDVLTSPDSMAYHALWAPLAIGEIIL NSALILAWIYIAFLFFSKRRAFPFWFIAIHIATVCLIVIDAIVVHHILPD APIFDANTLRELSRPIGAILIWAPYMLMSKRVKSTFLH >NE0745 hypothetical protein MSIERVCSLCFDNEDLSDWIVNEDGPRGCDACGKRDAPTCKLSELCAFIE SRLSQYWGSADNQLFYVSAEGGYQGRTWDTYDLIVDEIGLSFPRAQNDRL LREILGHLTDQAWCDYDCGALDHDEALKFSWRQFCETIKHKRRFFFLSDG SDDRDSFTPASLLHEIAHSIEVIGLIREIPAGTKLWRARPDLNKGAKATA TSFGPPPAEHALQSNRMNPPGIPMFYLASSQKTALLETRTMESRMGKWSV ARSLLVLDLRRLPHVPGIFSKADRHYRLGLKFLHDFAVDIMTPVARDQRV HVDYLPSQVVTEYFRDYDFEAGRLDGIVYNSTVHLEGWNIALFANNVDLG LSRPTWGRAPEPWLTFIKSIRARI >NE0241 hypothetical protein MGKKKNKKTEVQQPDPMRKNWIMENMDSGVIYLLESWLKAKSQETGKEIS DIFANAVEFNIVLKDWGKEKLEETNTEYQNQQRKLRKTYIEYYDREMK >NE1275 hypothetical protein MNSTGNTSNNARLHWSHVLLIVLATIVLTVAGTYWVLTTYVFVSSFEPVI LSKKEEKTLEQKLRTIGYDFSFSSPTAKRNDDLKGEIDEEGFLKPQAYSE QGAKREVNFTEREINALLAKNTDLAQKLAIDFADDLVSARLLLPLDEDFP VLGGKTLRLNAGLGMAYRNDKPVIILKGVSIMGVPVPNAWLGGLKNIDLV SEFGMDPGFWKSFSEGVEHIQVTDGKVDIRLKE >NE1100 hypothetical protein MPSTPGIWKLPFNYFYIVPDLADASQRCFYGLNRVNCQPACKNNRYKNDN CSYSIRSSYENTASQITPSLRDLDDNIYESCKRAANLLTA >NE1608 hypothetical protein MSLSWRNRIQIFLAPDRVDLTGIARGIRPVQQFRQSGVCVQENDSRQQWK APLRLLEQMIGQMDDRFRRGSELHITLSNHFVRYGVIAPQPSLANPDELM AYAGFQMREIYGERIDDWELSLSTWDPYGGALCAAIARDLQSELIMFARQ YDTRFACIEPYLAAALDHWSKRLVEKQVWFVLVETGRFCLVVLSEGAWRC ARNQRVVENLQEELLAALEQESIILSPDRSVERVYVFAPELTGQLPVHDL RWQFVRLPDEKHPAPSYFPGVTGMDDSQNHA >NE2121 conserved hypothetical protein MTKKLPIGIQTFREIREESYYYVDKTSFALKLAMEGKYYFLSRPRRFGKS LFLDTLAELFAGNEALFHGLYCHDRWDWSVRYPIIRLSFAEGWLESRAQL DKRICWLLEQNQQRLGVTCKQESDIPGSFAELLQNAEAKYGQRCVVLVDE YDKPILDNITEPEIARAMREGLRNLYSVIKGQDAHIRFAFLTGVSKFSKV SIFSGLNNLNDITIDADYSAICGYTDEDVDTVFAPELPGLDRQQIQDWYN GYNWTGQPVYNPFDLLLLFDKREFRAYWFETGTPTFLVDWLMQRGYFTPS LSRQYSSLELLSAFDVDHIEPEALLFQTGYTTLQGVEEYLPGQRIYWLGY PNKEVQISLNNALLPALGIEGQKVLTHRIRLLELLRANNFAGLQQLFTSF FASIPHDWYRNNPIAQYEGYYASVFYSHFAALGLDIVVEDTTHHGRIDMA VTFNANVYLFEFKVVELVPEGHALEQLKTKGYAEKYKIRNEPIYLIGAEF SKDSRSVVAFDVELFA >NE2325 possible transmembrane protein MNPIYSSMNRRQQGVSLPGLLTWSVIIILVAILGMRLVPVYIEFAAIKRA LVAIASDSELHNAGVHEIRQAFNKRAAVDAIKSVNGNDIVIRKQDGQLVL DINYTVTKPLFANLSLLIDFDAASDR >NE2090 conserved hypothetical protein MYYFLHREISMNTLPETILQQARSLQEGGILSPREFLHLGSRSAVDQAFS RLAKAGRLLRVARGTYAIPVSSRFGSRAPAPEKVIRALAEQSGEIVVPHG ASAANVLGLTQQVPIREVYLTSGRTRKLKLGRSEVLIKHAPRWMLALGTR PAGAAVRALAWIGPTHAGKSLASLRRILPLPEWQALISARATLPGWMAQA IGEEAARG >NE0891 hypothetical protein MDTVRKLIVCAILGVFYVPVVSAYEFHDPYPNVKVIDGEVHIQRPGGGTS LAYPNAGISKDIPVNTSKGQFLVPVEKTFPVEPSKVGKAATRFLKTLPAI GTAIALYDTVCDLTDICRNSQSGEIEYAPDMPAGYPVTTETGYWRHPFYV SLTHVTADLLCKSFDYRAAVHFAPNNLTFLRVEVSGGTSYCIYSDKTNNP PTETAPPNYSIVKVNSGNCFTGYTKVGNECVHNNAPVPVTETHWTDAETK LNAQPQQTAEALYNSDAPVPVLASTQSAPVIQQIAQTSTQTKDAQGNITG TQVATTSVKVEDTSTTNNVTYNVTEVTTITTYNENNEITNTQTSTSDNSP PKTGTDETTVSFDDVPPAQLEEEQPEFNLQTPESWGEGTCPPDEILTVQG VTFPVSWQPACDTAVQLRPIFVLFASVAAMFIVAGISRAGT >NE1565 hypothetical protein MKALFFLRHYNDIDHITPVISKWSESGHESLVVLLGRPKFLKDYRIKFLS TLDRVRVAPIRRLLSPLKFMQWRLQTLLLNRSVKRLFLIGKLIEKLARKY DAQKRTAVWQSTAGRLLEHGFSDGNEGGVVVFDWITSDSPVPIEWVEIIV TMARTMGLGAVSLPHGDSPHASQLIRHHEWVLKPDALYSAARIFDKLVVP NELCATRFRPFLSNEAIAVLGSPRYCDEWLDKLATLSPAPRLKTNQDTRL RIVMFLRKSEFTTFWEEVGEIIGMIATFPGVELVIKPHTRGGWRQPLTGS ASLRQLANVRVAEDSEHSISLMNWADIIIDLATSVVFEAVKAKKPVMAAD YLHAGRSALAHFMPETELKCRDDVYTMIDRFLTAGYDSYYVEAHRQRFIE EMLHVGGADVLPRYVALLEEQTMRKKPDQANNTNTPE >NE0784 putative (AJ245540) NrfJ [Wolinella succinogenes] MFRTVSAVILAILLMSFNLSAARAEGASGVETMPANEGVVVSSIDAAGYT YMELANGGKKFWIAAPTTKVSNGEHIRFVESMRMHNFTSKTLNRTFSELI FVTSTQAKVEK >NE2077 hypothetical protein MISNPVNSAVIVSATPTDAISQTRPVSAVTPVPDATQSDTPAFILGQKYR AQIGERLTNGHSLVNVAGRWLQMRMPASANPGNILELTLIEQSPRLKFLL HSGTQGGNNPTTLSPAGRLIAQLLSQPAPPAMKTANEAAPLLPIPPATGR ERIQLPAQLQQALSASGLFYEAHLVQWLSGNRSLQQLRQEPQGKLPAPAT VSTTITDSATASPVASQAVSLIQQQLHTLETGTIQWRGEIWPGQTMEWDI TEYPDDQGKEQADNEKTGKSGRWQTRIHLQLPNLGKITATIMIEPQGMRI RLDADSDEITRQLRKEQITLASAMQTTGLTIRAMDIQQHEAT >NE1592 hypothetical protein MKVLQSDKAIMMNRKLTELPIDERIQLVEDLWDSIASDQKMLRLTTEQKA ELDRRLNAYEVDKNPGRSALEAIAEIRRNL >NE0488 hypothetical protein MIRSRCDRWPSTKPGQLRRIALSATLPRKERRGIERITMQRQSEQEQLFS IFIVFTRPLGRGDPENLNKVAYLNIAVTDC >NE0494 hypothetical protein MKTSFKRPFAQYVKKATKPLRLAIEDEVEMICETPEIGELKAGDLADVRV YKFRFNQQEYLIAYRSPTRNTPVEFMIIDFYQIGTHENFYDKLKQYLRHD KNPREI >NE0166 conserved hypothetical protein MKKLYTANHLLEAHIVRDLLENAYIPTRLFNEYAQGGMGEISFTHTYPEV WVMRDLDFERGRKIIAAYEQAPQVTDIVFCLQCGEENPGNFQLCWQCGSG LEVAREKS >NE1244 hypothetical protein MVSVEGTTLGEVAIMKQSNVIKLFSIVLGLWLLFNPIGTQAETAKYETIK TEYQYAAKVACSLLLPHQDGTLAKGIYRTIINIHNPASKKITVAAKVALS TQMGSEPGPFNVTPFKGITLQPDGAVGVNCFDIAGYFCPINGVCVDFAFL EGFLVVKSPVPLDVVGVYTARPVEGEVQSIDVETVQSKRIHDIVKLGTTE LPGRGEGKRVDYPPKGSAAYDGQKPKQMCGGIAGFPCPEGMKCVDDPSDD CDPAKGGADCAGICVK >NE1993 hypothetical protein MRQQADNVYLIWYQPGTGRPVQQSTPARTIGYIPSAQAEENYHLTQTGRT PMTVTSTKHPHNSGGQFRTVAIWEKARIYYNPPLPLVIEDRGHNLA >NE0788 conserved hypothetical protein MEIFMRTSIRKGLVALAIWVPVGMSYGASGSELLEGKDIYLPHAERATLE ELDNATGREGVDITTLNRMNVRAFLADNSATNNVSGFNSIDNGSFVGASG MFSVIQNSGNNVLIQDSTIVNVTILP >NE1794 hypothetical protein MLNVYLTVDTELWPYSDGWPVRALSPYKIAFDEEIAACFYGKTSEGEFGL PYQIERFNQYGLKATYFLEPLFADRIGSNHLADIVDLIQRNDQEVQLHLH TEWLSEIYDPTIPVHFKQYMHQFTLDEQVTLIAKGIRSLQAAGVKELHAF RAGGYGANRDTLRAVAQNKLLFDSSYNSCYLGEDCKIDLNEQLLQPCKIE GVWEFPISFFQDYPNHWRHVQLAACSTKEMETVLLNAWRQGWFSFVIVLH SFELVKGRSIGKLSLPDKLNISRFNHLCKFLSDHPDKFRTTLFSELDPIT IPEIRPQKILYSRLHHTIKRYAEQIHSRFF >NE1679 conserved hypothetical protein MRRIFAKVLMLSAVSFMTSNLVQAADPEVIGEFDDWIAYVYTEDSSKVCY MVGKPKKEEGNYTKRGAVYALVTHRPAEKSKNVFSFVAGYPYKQSSEVTV SIGNQRFKLFTQNETAWAPDSAIDNKLVAAIRGGSQMVVSGTSSQGTATT DTFGLKGSTAAYTAISKECGIK >NE0242 hypothetical protein MKYDEKLIVFIVTIGVLNNPVYKKRLWLAARNMNISRREWPNVIADAIYN FDGIILLSNGIRWPIPDVDKVLGDPRWFSYYFEEDEKGDPHRDVIMLERL RLIDLFFKIKHPEIARHFSK >NE0081 possible transmembrane protein MSGLFSALSRASANLLNPRMLWLWSWPMLVSAIFWWLIGMFFWTPLSGWV LTVIPADTLQNWLESSRLQVIADSVESIINVIIFVTLAITTSLVITALVT MPALVNFVAKRYYPDLARMQGGTITGSLRNVISAITIFFILWIITIPLWF TGIGLLAPLLAAAYLNQRLFFYDALSEHANSSELDKLSSIDRSMRWSLGF LTGLLQFIPFLNFFAPTLTALAFTHFELGRLAKLRHTAAA >NE1246 hypothetical protein MILKSYSTVLFCCFLIAFDSNRYRMHLRFAHSCNSRFQYQLDLFNCFIAD LTEVIFCVPICDINVDCAQMPGQKHGNKESHMAMTMQYLLTISK >NE0890 hypothetical protein MKNFKQRLTNAAYASPLVLFAASARADLPEDVTTAITAAKADIAAAGALV ITIVVGIKVWKWITRVF >NE0545 hypothetical protein MIFRKNQCNHQTTFYQIRKVDDLIGCVYSFAAMGKRYMTDNFILLILFVS LLTNAVAWSFHQEIFKHELDHFHVSHRFDHHHSYHDHAADTGFHQHHDET LDNDPDFTDHLILHAAGQFQPFYFILLPIIPSLPGKENIPGFFPAGIPES TLDLPFRPPRNTASLEIRY >NE0243 hypothetical protein MATAKTKIKDKKQPLALQLVESGTADSARKPPDTENTGADVFFRRLVHHN DAEIAGSMATEQATHQADQQFGAELTKLRLDSEDIDVAFKVSLKFLSDLE VKLANTRRYIKSGTLHSIGGKSVENVGWTDWRRKDQILLCVLVFCLTIAA GLGMGNVYANLVSSGNAVFIEKPWLATMISALMPIASVSVKYVTNFMIYD SSRRLYAKCIYAATGMAFLFWGGLFGLTYSGVASSIDWDSFGESTDYGFA FVWSQLLVELLMASALFLAAEDIYMRYSPDVYIENLEYLELEKALKEQRT VHEALREKRGELHGRLVELEARREAFINDKVMEFVSLRARHVATMNAHTD H >NE2033 conserved hypothetical protein MSEESSRLGQCDEGKASWKKWGPYLSERQWGTVREDYSDNGDVWNHFPHS QSGARAYRWGEDGLAGISDDHQLLCFALTLWNGRDPVLKERLFGVTNLQG NHGEDVKEYYFYLDATPTHSYLQYLYKYPQAAYPYEDLVTTSERRSRQEA EYELLDTGIFDENRYFDIFVEYAKADPEDILIRISAVNRGPETADLRILP TLWFRNTWSWAPGLAKPALYQEENQDDCRIIHTQQNESGDYRLYCADAPT LLFCENETNTCRLFHTDNASSYTKDGINNHIVCGQKDAINPANHGTKAAA DYALNIPPGETRVLRLRLRRAESSAPATDKIFAGFDTLIDQRKKEADDFY ASLSGGRLNKEQQRILRQALAGMIWTKQYYEFDVERWLNEHPRHNARNAG WAHMKCRDIISMPDKWEYPWFAVWDTAFHTLPLAMIDPAFAKQQLGLFLE NRYQHPNGQIPAYEWNFSDVNPPVHAWAVYMVYQVCQDYHDQNDLSFLKS AFASLERNFSWWETHREPDKNVYEGGFLGLDNIGVFDRSVELPTGGHLEQ SDATAWMTLFSQNMLQIALELSLHDPDYEQRVLSYLNRFMATAAAMQDIS DEHRDMWDDEDGFFYNVLRFPDGHSTRIKVRSLVGLLPLCAVTVIERTTL DKLPLVAEHFENLVRRRQFLADHIFCPTTPGVEGRRLLAIVDEEKLRRIL SKMLDEQEFLSPYGIRSLSRYHLEHPYQFHWNGQTFTADYQPGESTSNMF GGNSNWRGPVWVPINILIIRALLTLYAYYGEDFQVECPTGSGKQYNLFRI ARMIAGRLLHIFLPDEQGRRPVFGNTEKFQIDPHWRDNLLFYEYFHGENG SGLGASHQTGWTGALAALLTIFGSLEQEELTELGMQEISAILAGNNGI >NE2513 putative (AF322013) ID483 [Bradyrhizobium japonicum] MAKPVVIGSRSFRTQSSALDHYKALLHRYQDGQRIADPADHTDLVALIER FDPVLDAVGEPAKGAGQIAHFERRLNTGIGWSTPGFWVVRQDGTETDFSY IDAVKGRPKGRSQDFYNACRQAVALDLVLAKKQAFAQYGDDQGRVECELT GKMVTIDDAHLDHAWPYFSHMVSGFRAARGWSRDIPDGIVSTPADGQTTA TFLDSAVTEAFRAFHHDQAVLRVLSREANLQTASSARRPKVARPVRLA >NE1576 hypothetical protein MLHARSVRKDIQPVIARHIGDPEWITGQIEVKESVRINACYTRYRHQRFP NLQLNFSTDHKTLLPENPEPSRFNKRAIPWFTSRTT >NE0475 Helix-turn-helix protein, CopG family MAQITARLPDDLVSSLDAAAARLRRSRAEVVRQAVEYYLEDFEDISQAID ILRDPADPILDWEEVKRDLLHLD >NE2007 hypothetical protein MDFTIKAIALLTIGHRLHQFVMYQPCCKIAHTQLTLERQGRQTDLGLTNQ INYQEPDGQRQFGALENCSGKSMRSDADRPCIEKPCVNQIL >NE1083 hypothetical protein MSVPLAALAAFTLAYAGMTGLSLAMPRHYEQVAGQRVLPSGRRHFFRILG WLLLILAVVPCIQAWGTAVGVVVWFGFLTAGGLLIILMLPYLPRLAALAA AGTTIAGVLILLVT >NE0116 hypothetical protein MTIWLVTRHPGAIEWVARQGIQWDKHAAHLDPCEITAGDTVIGSLPINLA AEICNRGARYFNLSLNLPAHLRGRELDAATLTACEARLEEYIVKKVNS >NE2094 hypothetical protein MFDVNYEKQAQDYYSKAPIIILGSGASATHGMPGMRGLAQHLTDKTDVSG LSDAEMEPWRSFCRTLTDGVDLESALRQVAVSEELTCRIINSTWSLINSE DAAIFKNSLQNSSMFPLSRLLEHMFKTSLKKINIVTTNYDRLAEYACDQS RIHHYTGFTHGFFRQLATPDELTCSRRVNIWKVHGSLDWFQSPLEDTIAI SGAQEIPENYSPQIVTPGTQKYQKTHLEPFRSIINNADIAINEAGSYLCI GYGFNDEHVQPKLMAKCQRQGAPVTIITYALSDSTKKLILGGKAQNYLAI ERGATDGQSVVYSSLSSSSFTVEKNIWSLEGYLSLIM >NE2517 hypothetical protein MPISESQLETWSHQGSITQSSTTYNTIKSVLEASTTPYASKNFKVFLQGS YGNDTNIYAESDVDIVIRLDDCFHSDLESLSDDEKSAYKQAFNDATYTHA DFKRDVLSVLEGQYGSAVKAGDKAIAINASGSRRKSDVIVATQFRRYFKF RSASDSEYVEGICFFNATGERIANYPKQHSANLTAKHQASSKWLKPMVRV LKNMRSRMVEDGLIKAGIAPSYESPRVLRRLQLLREWSHEQSNKVFP >NE1481 conserved hypothetical protein MLHNRQSISAPKLVVRPHVPWYRRLLMSFVGLLLIALLAYGMYVIGQSTA QPAGNITVTADPVLEQILESNSCLEKYDTALCSQLAELVRQLQIGNATRA DLVKQVKSLDEENERLREDLTLFQQMISGNEESSNVELIIHRFSLEAGQL PGEYLYTLLLAQGGQRLKEFSGKLEFVVGLLQNGEEKFISLVDENASKEF PINFRFYHRLEKSFQIPADTVVKSLQVFIYENGSSKAVLTKTIQLPLKES EHVRKKT >NE0130 conserved hypothetical protein MITTHIQNFSVLFEALSLAMNQTEFSIKTRSKQMLKWAIIFAIISFISGV FGFRSTSAGTASIAKFLFFLFALITLVLLVLGLLGIGVVA >NE2506 conserved hypothetical protein MARGRNSALMDSCQLVRTETKMEVAHLNQKQLAARWSISEATLERWRSAG IGPKFLKLCGRVLYRQADIEAYEESCLATSTKTVVAQVSVS >NE0819 conserved hypothetical protein MPTVDQSFPSFFNDAPTVTLQDPLARFLGAAHDGIMEYQYVDAVRLAGHS CPTVAGTWLMTVHGLRALYGDALPVRGEIEVYMADARDAGTTGVMATVAQ LVTGAAPETGFQGIGGRFGRNDLLHFDQPMQGSIGLRRKDTGAAVQVELD ASVVPWPDEMRVLLPKAVSGQASTAELQRFGELWQERVCKMLVDHADDVN LVRVSNWAVD >NE2509 hypothetical protein MRSCELRTRHQRAGTLAGTAPARDSCDHPAGEPGPPWPTRRATTACAQTL SRRRPIMRELDKELKDLRLYGMAGAWEDLVKQGGHATLESSRWFLKANCS RPRGGWSSGCGRACTSSTAAARSA >NE2430 hypothetical protein MLDLIAIVVLVSGLYLWLRKPKTITASASNAEEAVLPLDDDHTAPNNTMV YINENKDR >NE0007 hypothetical protein MKTSITQVVFLILFCVLPQTTMAQRNMPQSYPVAASEKLVNGIANAVTGV IELPKTVILTSRRDGPAYGLTVGLVTGIMHTIGRTVFGVLDAATFFIPTQ PTVRPPYIWQDFDKETTYG >NE2038 Myeloperoxidase, thyroid peroxidase, cyclooxygenase catalytic domain MTWHGSNKSGGYNPPKSISYDQGKFGRMFPSLPPFAQDTRQIRDALKELG RKGGIMDAKEDTDIAVNPNLARDLIIDPALSLINPNNPNLVAGMTFLGQF LDHDITFDPVSNLERQSDPESIRNFRRPLFELDSMYGSGPSASPYLYDQS ADGEGIKFYVEEISGAAAVSAGGFVRYDLPRNSQGTALLGDPRNDENLMV SQLHLAMLRFHNAVVDYVKAQSSLTDPDEVFTEAQRLVRWHYQWIIIHEY LVRTVGKPLVDNILINGRKFYKWHNQPFIPIEFSAAAYRFGHSQVRPSYR SNFGPIPSDINSQIFRLIFNDNLADEPDPDDLRGGKRAPSRFIDWQTFFD FGDGKVRPSKKIDTKLSTTLFDLPAVRGDIQSLAQLNLLRGLTFSLPSGQ SVAKAMNLPILNTTDLADLVDFKLHQRTPLWFYILREAEVKENGERLGPV GGRIIAEVFLGLLQGDSMSYLRQDPRWIPTLPSTVEGTFRMADLLRFAGV VAPL >NE1605 putative prolin-rich transmembrane protein MESDMAGTLYRKLILWGALAATVLAALLVDEGTELSVDDVVQPAVDISAD RRTAGQTRQIRQTHETLPVDQLGKRKFSAKADDIFAVTSWEPKRTASTDF NPQIFQPRKEEVVRRPSAPPLQFEYLGRVVSEGKIRVFLAQADQNYVAGA GERIGTEYRIDRIREDTIELTYLPLGIRQTLTIDQGTFD >NE1241 Tyrosinase MAIRKDANTLTAAERAEFVAAIRVLKAEGIYDRFVLRHANANMSAIHRCS AFLPWHRRFIYDLELELQRVSGNPNLGIPYWNWPSGSANASMWNDDLLGG NGDAGGVVRTGPFRSGQWTVINSSGLPAGPLMRAFGQNGLPTLPTQAAIN QVMAVTPYDTSPWNMNSNPSFRNQLEGWIGPNLHNRGHVWVGGSMLPMTS PNDPVFFMHHCMVDKIWHEWQLRFPNQGYLPASGGPFGQNLTDPMGSTPS GQVGSRPIDVLDSAALGIVYDDAAPQPQPQPEIPLIVVGADPIAAAIGVP GETDVFRFEVPAFGAHTMYTLGSSDTFMTLFGPNDPNFEVASDDDAGEGF NAQINRNLSAGTYFLRVRLYSPNSTGNYAVGVRAVSATPGPGPGPVPIPE LIVNGVGIDASISAANESDVYRFNVTTGDFYTIQTNGTTDTFMSLHGPNS QIPEIASNDDSGISFNALIRRQLSPGEYFVRVRHYSPSGTGAYSVRVTQG >NE0313 hypothetical protein MDREDFFLKIAELHLKRVEILQTVEWRITFSLWTFVAGVAMVSLANADKM KQAAVAMGGILGPVVILSLMGGIYVWLWYLYLYKFCKKNYNSLVTERNRY QRMQNEAIKLVLKGKSADFLIEAGAEDKRVPESDFSEPSFKQLSGDDFRK SGVWEFKRGITAALMFFSWLLVLMIVVPSASKHLNDSVVATGKPAGVEVE SSGRNLRREADQHF >NE0921 conserved hypothetical protein MSYERFTVLKVPFPFTDRTAAKNRPALVLSDAATFNDPIGHSVLAMITSA ANPAWPLDCLIDDLVSAGLPAPSVVRFKLFTLDHRLIRGELGRLAVSDSI QVTRSLYQLFGMAAVR >NE1936 hypothetical protein MRAASCSMFSQILKLILRTGCDGLSRRLGQNLTVVRKAVKLCLVLDHNGY LSAPASLSTGKVVEVKVDEIMVGCVKKPTKFGGLF >NE1887 conserved hypothetical protein MTDNLLKYGPLGLPEYSERRLLHTELNADVYELVNIPLQLSHLVLLSDRQ WVNRERELIVQICEHFGIRMLNGAFDQLSVELGGFQLRWERHTEYSTYTF YSEGPFEVPFAQPAIAHVPPEWLEKLPGEVLVATHIALEDRRRPSRSMSE LSSLFSSNTVIGSKVSAGSASVWSDNQIHPDGFTRFLIHDDNLRSRQVGR LVQRLLEIETYRMLAILPMTMTREIIPQLERYGDQLTELISTNIAPNSIE DEQLLLVKLTALATEIERISAQSSHRFSASQTYHTIMQQRITELREERIE GLQMLYEFRKQRVTSAMSTFDLVWSKLETLSLRVERATSMLRTRVDISME SQIRDLLRSMDTRAYLQLRLQETVEGLSVVVLSYYLLGITGYGLKAAKAA GLNIDIELMTGIAIPVIVTIVFFAIRRFRRIVSKSAFGENKGGE >NE0799 conserved hypothetical protein MKISPSRQVKGEIEIIPVSGFRGIGKFIDVPWRLYADDPLWVPPLRLERR LHLSRFNPYFRHAQWQGWIACRDNQPVGRISAQIDELYQQRYGTDTGHFG MLESIQDEAVFSRLIQVAESWLVERGVRQISGPFNFSINQECGLLVQGFD TPPVFMMPYSPEWYTSLLEQNGYQPCKDLLAYWLVTDFDPPPAMQAIDRK YRHQIRIRPLQRNRFNEEIETMRDIFNDAWSDNWGFVPFTQEEFAELGSS LRWLVPDEFIQIAEIDGRPVAFMAVLPNLNEVLPALNGKLLPLGWLHLIN KLKSASITTGRVPLMGVRKQFHHTLVGIALAFKVIDAPRKMVKSRGIGHV ELSWILEDNQSMRAILEKIGGREYKRYRIYDKTLA >NE0805 hypothetical protein MGTADLPNILDLLIQEIAIQDARPVHPAFQVFTDALRQRFGEALDAVVLY GSCLHTSDLTEGIADFYVLVSDYRLAYSGRLLAGLNAWLPPNVFYLEVPA AAGVMRAKYAVISTADFERGARQWFHPYIWARFAQPARLLYARDDQTGKR VHTAQASAVLKFISTTLPVLESGPSDLEMIWASGLMLTYAAELRAEREAR ARHLVRIDPEIYSRLTAAAMPALIPLLSLQADGRYHIGPITPLKRLSARI HWRLRRWQGRVLSVLRLSKATMTFRDCLDYAAWKIERHTGIKVEITPMLR RHPILWGYKVMWQLLRRGVLR >NE1297 hypothetical protein MGTHDREGLLTRYRISFLVNIILLVIILAYLFKLSQTGIPVKVVSYEKQD LTVKNLETLTRNELDSERTEGVLVIKKKDGEQADLVFYGKGLAGMTAGAG SSIKELMLQLGSGSTDAYAGDDNCRSVLVFRGQVYCIPW >NE0392 conserved hypothetical protein MEQITRWLMIAGAALLVIGVVLHFAPWLFNWFGKLPGDIRIETRHSKIFI PITSMLIVSIVLSVIINLFKK >NE1636 hypothetical protein MIWVNNLISGFDGIYWYPNSSLLRIEEWDGEFTVFQPESGKTHFLNEMGL RILTVLDRSPATLEAICQELSAYFSLQLDAQFPGQIIRTLQRYEALGLIT RVKENE >NE0655 hypothetical protein MTLVTRCPVCHAVFRLTGIQLHSCNGDVRCGQCRQVFNGFVALIVVPETC IQPAARSAESAPDYLESGNVAVVPVAESSFPADHFGVQLSTRKTSRWWLI PNALLLLLLLGQFVHAYRTEIFIAFPAFQPALDSYCDLMQCEIDLPRHLH LLSLESSDLRVSSPAEPDVVALSAIIRNHAPFPQALPALLLTLTDSDEKP LASRIFTAEDYLDSVTDQSVLGGDSEIQVQCFLNTSSLDAVGYKLELIYP >NE0183 possible (U92432) ORF4 [Nitrosospira sp. NpAV] MNKVLRNSGIAGLCLALSIFLLSAPASAQLMLAHEGHHDAGGCKIEGGDF PVTVSVYEVPEGNIPPMHSYCNHLPDAGKINMTVELSDSQTREVPIAVRV LMEGHENSDHGAHEVLYMPAEKYSSGIIVVATNLEHLGQYTVQLETEDSA GQVKTAVKIPLHVGGGGGHDHGSNFGMLEMILLAVVGGVGTFIFMRSKKA ANA >NE1180 hypothetical protein MINTITILTILCLSHNQPKNFVMILFFNNLATRRCVPADFLSGIHDHSYQ IKHSFTLNIGITGQVGAILQQQCLDAIRIAN >NE0114 hypothetical protein MASLQSSKIPIQRRFPLIGNFPVSHFRHHCEAARGNSVSLYCLTGLLRCI YLAIIVQALEDSVFTAINLLTEYLPGKLFHYARAAPTP >NE1050 hypothetical protein MNYSKPQKVEFATFPESKRNLLRFPGSGEIPGNDKIKASPTLAPLSKHSA NRFHSGVILFSARKMKNMATFYAPAIILKLKMYT >NE2265 hypothetical protein MDFKSGIKHGVAITGIILGLSITPAQAGVAGSMVLDITGGCFSYGANGAT GCGISDGSPDAAGAKEYATGSFTFSNTLSGISNPAAYAYQASVSLYAEAP PDNPVISFYDTRSKSFATLADLQSDPLWNTAYAFVTAVLANTNGSFTATI PTPPAPPGTTVEASWNYTLSNLTPGPGSTPAYATGEFEAWSKDDLNGLAL ILFGPEQPLPTSPVNFSLTVALSAIPEPATIALIGLGILGMGAAQRRKTP AALPV >NE0179 hypothetical protein MQNDNHSTAQPEKDVARLTEEVREAIAHGGDIENAIRNLTLKAMHSNGLD IESLKQIATAVMKGVQEGAQQKMTHAAEQSHAAQSQITQAVVGLDTAFAQ LAGASKLALEEAASKAKQFSDSELTKAQADLKDLESVFLDTLKHTATAAQ GLIAETLRDMLSHAQHNGTAVGMQLKDTLAVFAHQMASTGRAQFEAGVKL TQTTADLLYKISTGVLSGITSQTNRDDK >NE0270 hypothetical protein MPEQITRKASPFCLRLTPEERTLLEREAAGLPLGEYIRQQVFDENRVKRR SRNKQPVKDHRLLSQLLGELGRSRLANNLNQLARAANCGLLNLTPEVKTS LLNACADIRHIRETLMKSLGLNR >NE1937 conserved hypothetical protein MSVKNIGNYVIVLALIAIVTSCNFGSSDWRTASRARTGLAPDPAETPEAV IQVYAARAYSWRGIFGVHTWFAVKPSHAESFTVYEVAGWYARWGGSVVAI HEQAPDKRWFGNAPMLLAEKRGEGVDELIKRIDKTVQTYPYSKEYTIWPG PNSNTFTAWLSRAVPEIGLDLPPTAIGKDYLGNSMTATAPSGSGWQLSVL GLFGIIVSDVEGFEINILGLTFGIKPDPLAIKLPLIGRIDLSV >NE1279 hypothetical protein MKSGLQCESPGRYFSYLSYICLVQLLSLHFESMKGESFMRNETQTTHSHS KHEQHCQHVYETGQLRRAKMHVARLTGNFDSRKILSPSLLQLLETSIILE TTDSKVLASRLKRKPAAIRADLQKICHLLAEEPRL >NE2531 possible signal peptide MCDRAPGPARVRRAGELLLQRRQPRVLPIHAGHETQCGGLRRGLRRTAAL ALLPGQRLRAAHQRAAVRRAGYSARSRIPAVHRGGVVSVIPWPYRLLTLA ALSVALVGFGWIKGASHVQAQWDAAIQQQALQAAAVRERQAQATVKVVTE YVDRVRIVREKGETIIKEVLVYVPVQADSACTINRGFVRLHDAAAAGELP EPARDADAAATGIALSAVAGTVAANYQTCHENAEQLTALQAWVREMKVAG EQ >NE0776 hypothetical protein MSGEKSGRNFIFYFYPSLFIAANALFIGMVLFLFYASMQPPHLLEDTVNT SPYRQIASWFDPIFFQYNLVLLVFAVGIIPLITLCYTSSMREEKKRRLQR ELPPAIYSANSNYIQNYLSKISSIRSYLGSMMSLMFVVMFGCMIILLLKP APLALPDLAYANGVDYSKGANFLMLGTYMKSYMVGNRDYINVLVYTLTAF QFGFLGGYVYFIGLMVRSYFTLDMTPNIFINSSVRMITGSLLAMVLSYFL IDPDKFSEPDAILIRSLPVWSFFIGHFPDRGLVFLENIATKALGLVRIHE FASPLSDLPGINYNQEILLKREGYDNIENLANANALDIALRTGFSYQQLV QWISQARLHGHLRNDYHAFVNCTGIVSLDDFVHFYRTVKLQNSAADPIEL IIASLKNEKHDLDDKIRILRYLADPRDLATDIPHSMRDENSAADQETISN KS >NE0570 possible lipase MNDEVKQRLQSTETWIRVLFMLLFMFIQGSVKFLIVLLALFQLGSTVLTG QANTRLLKLGRQLAMYDYQISLFLTFNSEQRPFPFSSWPSDTDNRTSDNA DNRTPNENPEKTSWFQ >NE0503 hypothetical protein MKRTYCTIMLTGALSFGLSTACTASGIHKLVDERGRVIFTNDPAKNTRQI QSSKSVSVVPSRRNGTSTEPITVAITGSNYPRVSKLQQDQRDSKRRQILS QELANETRLLEDALKTIDLTQQKTDNYLPGRPYFTSDHFDILQLRNQAAA HERNIEALKMELNNL >NE0998 hypothetical protein MTLKRTIKEFATYLGDRESILDRDYPRVAGQIELLWGYVEFYRYLEKLLI TEKGRDRSGFPFEAVLELDKLKEIHERLYP >NE1606 possible transmembrane protein MARIALENWLVRARWQVTRLGTVGRAGAGLLVLTLVFFIAAVMPQKERLK ELKSKVQVMQQAQPDSAGQTKLNNNQALQVFYDFLPRSDSSPYWISELDR IAKDSGVELNSSDCRLKVEKESKLVRYEIQLPLRGTYPQIRAFIASALQA VPALALADIIIRRETIQAGRVDARLNMHLYLNDY >NE1379 hypothetical protein MLPELLAMAQLLVSWLLIIGGWLFVHRATLSRERRKEKREEINNTIQEIR AIENIAIDFHNSKIFDEKAASSLTLRINRLNRKLQAPPTFNELKIPTQLM IEFRKTITLEHFDKSNFPSMVQRIMKGNPYPATSIEILIRDINSATDDLV DCIEAEKNNKL >NE2540 putative bacteriophage related protein MPMAEIRALWQKLVGGDTPTHNRQFLERRIAYRLQELEFRKADANLLDRN QRRIESLVETGKVKKRDRDYRPAAGTVLVREYKGVEYRVIATADGQYDFQ GRIYPSLSMIAREITGMRWSGPLFFGLKPPSNAKAKPSPKKRGGR >NE0113 hypothetical protein MPEPLQPHEYPHRILLCVTGLSPQIVTETLYALAVARATPFIPTEIHLLT TTDGARLARAALLHPDGGHFHALLNDQPQIGLPRFDEDCIHIISHHQEKL ADIRTPAENAAAADTITALVAQLTEDADAALHVSIAGGRKTMGFYLGYAF SLFARPQDNLSHVLVSSPFEGHPDFFYPPRQPRRLVTRDGHHIDTAEAIV TLAEIPVVRLRHGLPATLIAGRAGFSETVVTLQQSFAPPCLLIDLEQRNV VCGTTAVAMKPQLLAWLAWWATLARQGRPETTWREADARLFLDIYRTVVG IDAIDYEKTAELLGNGMEKEFFQTKNAKLERVLKDTLGPAAAPYLLTTTG KRPHTRRGLTLPPERIRIVGTGSK >NE0895 hypothetical protein MLVPPVRKQRRKCIMSNQKNRYGGLFCALINSSFSSWLIKAMERLGFKQF WINFFLWLPCRLAMLEIKLEYGSLENFVNRDSD >NE2542 Bacterial regulatory protein, LacI family MSDIRIQKTGEPDILQTSDGRLTLSVPIQIKRRSGRKLVTLPNGETAPVR PWDVAPTSIQLALARGHRWLAMLESGEAKSLKEIATREGIDNSYVSRMVN LTTLAPDIVAAILDDALPNHVTLFDLAVDPPALWDEQRRKVWDTSFSTSR LMQDA >NE1337 hypothetical protein MIGRNRYAMLTVDTEALPKRAVQDHIKRLMWGEHDGGCAGIREMCAIGNE CGVKQVFFVDMCGAYACLDQTLDVVRWLDQDGQDVQLHAHPEYLPEQFWK EHGFKYRPRFLNQYGIEKATFTIKYFGKLISDLTGKPLRAFRAGSFRWNA DTLRALQEAGVSLSFNNSMNARLKGQCTYSEPTNHSYLWSNGVIEVPVTE RKFFPLFGKEWWGRLQFPVGDWLGSPPWRVLRPYTVGADPSFLVVLLHSW SLLYWDKDGYAVYRDDKRIEDYRKLVRRLARDYDIITTADFLDLYACGKI KTTHTADLSLAEFKAVKK >NE1629 hypothetical protein MKNTQPYSWWLLPCLLTTLPGCLSPITLHHAVSAYDDAITSTISRQLLTN IARARHHQPIHFTGVSNVAATFDFRFSAGATPALGGLAGTTLMPLFGGSV AENPTISIVPIEGEEFTRRLLTPFQQNKFMLLLRQRFDIDLLLRLMAQEV RIQESTSQTTYRNTPSDMTGYETFRKVVLHLSAIQDRDQLYAEPLNLEYD WTLPAAAVSAEGFHTLAKEFVVHHDRQNDLFILHQKKQGPILITNYDPGI LSEKERAQLSKEAEGWEPNDVAFDIRPGGMGGEWPMKGIFRLRSFHAIIS ALGRSLSDEPEYHVEKDLRTLPVSRDENPVATMALLVTDTPSPNTDLSIR SHGRHYAVDTQGQQARWNRDAFQMLYLLFQMTVTDLPRTGAPGITIAK >NE1340 hypothetical protein MVREEELSVIISDIKAKIEWYDITFAEDLYRVADRNAPKLKKLPVQVKHI ELQAIANRVFCGRMLFHYSSLRHRILQGKN >NE0648 hypothetical protein MGALAFYTFIYFVGHFAALGLNIITNKKLLRHRWAGLAGVIIVAIMHGYK IISTTPPSGHDDDTLYALSYFVIFPVVVISAVLFYLSEKDKKDGGSK >NE0510 hypothetical protein MILRSWARGVTLATICMFSLSSGVILAKNNRPTPLPTYQYPDVYNNYLQP VATAIDYHSGMVYLFNYENDKLIIVDPTSLSGWPGNVPLQHTLVFPEGDK IFITSDNTEEHAAYIIILKVNDINWDAGTVSLAVETVVAADNPGTPTEFP FVEPVNNVQAIPNWLVGRGTTQIHGPTILPYSDFVYLTELTSDRIRVINR KTNEFVSGVDPIAIPGYTEQTHGINFNRSGTIGLGTGYFFDNSVIDVYKP NRETGELQTIGQIRLGDEKRHAAFTHFVYWLDERYAVTASMQFDKTSLTP TTTKKIIPPSVWLLDTLEGTATKILDHTNHANGKGIFRSPSDIAVVNGKL YIAEEDSLDYTFANDGYISVFDLTDRYKPRFLKRLKPGRELPTGYAVAHT ISPTPDNRYLIVASWVSGYVLKIDTETDTVVKIWGPNDGLIKPHGIYTAG GLR >NE1841 hypothetical protein MFPAHPTIKEVDLKKIFTIVLGFVMTAAMADGVDQRQILPMNEMQRNHLL SEMRMLLTGTGAILEALAQDDRAAAARHARSLGTDMPHKMEGHMDNILPE QFMQLGMAMHQAFDRIAQDAESGKDTKHTLQQLSETLGRCTACHATYQIS TGRQLAGQGSQKNHGEHGAHSHTY >NE0016 hypothetical protein MTYTFKLLTGLTLSVMLTGCMHTPVQPVDEPSEQAEIRIIQESGSDLSEL MHYYDSLQNKSRVELWEEYKYANSHYRESTDMQQRLKLLILLLLPNTSFQ SNRVALNLVEDLPEQAETTPDTTAFKNLLVLLLKRQRAANLQIQNLSEKL RSAETEVKTLKNKINAIKIIEKDLMRNNTP >NE0791 conserved hypothetical protein MRKNISENNGKRPGSVKSRFLVSAFAVFLVAYSQTGRSSEEVLEQYKSLF AQQQKEFEKQRQIIIEQGKEIEKLKSRLDSLITTQPTDRSPASNVAGKDG QRPPQVSSPSTPKTVVAGPVGKKNDQVQTRTVPGNLPAGPVGQAPPKQDE KPRPPEMPRLSDAVGGVLTRKGKIVVEPALEYAFTDSNRIFLDAFTFLPA IAIGLIDVRQVDQHSLMASIGARYGVTDRLEVEARVPYRARFDEQRSRPV SIGAGIDETFNASGNGLGDIEFAARYQLNSGAGGWPILVGNMRATVPTGK GPFDIKYAQAQGVPGAVFPTEVPTGSGFFSFEPSVTALYATDPAVFFANL AYNYNMGTTEKALDGSGDKFKVDPGYAVGMTFGMGFGINERSSFNIGYGH RHIFNTKINNRTLKGSQLDIGQLLLGYAFKYSQQTTLNFSIAIGTTDDAQ DVRLSFRVPMTF >NE1135 hypothetical protein MIEKYRIKKPPERAVRIFESYLSQFLLSFSALSTLSGMESRSIDMLGGYW AGFGSPQAVKKVVTASAAANTSESFIFSSPIN >NE0319 hypothetical protein MRAWIEYRGGYVMNGRFCIWRSPEFRYKLMSVLFPGMILSSSCFGESEIG QAPLGNRQILQSQFQEFRISSSLQYFPHSYFPVQPILIYPGVQWAYPCFP FVSCMELQQYRRYKRREKRQQPKPVFGQGASLMDESMEDWRAGLRPAVEP FRTDEHQIVPALRGHSLIRPEYREAGSILPRFSNGTE >NE0349 hypothetical protein MKGNALCTGEPVMTIETRGGGSEDIAVLPLDPTGNITRLFIRERCNDSSD PECKTGQWRLGSVDISRENTPASTRYAWRPGNNASEDDFEPLGMSLVPGN TPGEGTLFVIDIARPQSVRIWQLDISGGEITKATLATPADTQTGARLTAA NSLQAVRNDDSRSFHLTITRFDEYGLLPFRPTPWPALVRINNGVIQPPPA QDFRNANGIIRPCAGCDLVIASYWERRLRFVSKENGEIGEYASAELPIRP DNLRLDGERILIAGQRRVDLTALNLLVSPHIPSPSGVYAIDTRSLGPDTV PTLLWEGGWKHGHSVATAVALPGNRLAIGQINTPGILIADCSP >NE1814 hypothetical protein MSMLCSLYRITPEQVTKLKDFPDAIGELVGFTAPPPKVSFLSKLFGKPPK QLSSSGQQFEPVAESDIFELNQAWHILHFLFSGTNAESPWPGGFLISGGE EIGPDQGYGPIRLFDSELSRAVAGFLDTQSFKMLDSAYVASEIEATEIYW KVSSEHTERQRQLEELWSMVKELQTFFEHTVRAGNATLLSIY >NE0281 conserved hypothetical protein METAATNLVRSLTPQEYGFTLIILILAALTGFYCFIRAWKRWHLIKDTPT ARLRSAHQGRIELEGKGRSLPDQPVFAPLSNHECLWYHSRIERKETILEQ KRTRTEWKILYRNTSNHPFLLDDGTGICQVDPEEAEIISNEKLVWYGNTE WPVRTGILDNGSAIIGLASRYRYTEQLILPGQRLYITGHLQTRSPATERS VRDIARDLLSDWKQDRRQLLERFDTNRDGEIDLAEWEIARETALSQAQTV HRQLLHETEIHHVSTLKDGRYPFIISVRPQAELIRKYRRNALIALTGCFS VAGCIIWLLHVHG >NE1199 hypothetical protein MVWVIGLIVLILLIVSAWFRKVAVSVIIVTGVVGSLIYVLNEREEERALS RISLAELDFENVALKPSYSGYKLSGRIKNNSQEFTLKQVNLLIIMQDCTG TPDSQDCVTIGESHENMDLNIPPGQARDFEKSLYFPGGNLKLLGKLEWNY SVSGIKGE >NE0255 hypothetical protein MTYDTNLPSEDGIPDEGNAKKVARGTLQVIGGAVPLVGGLLSALAGAWSE REQAKVNRFFEQWVRMLEDEIREKEATVLEVMARLDLHDEKIAARVESKD FQSLVKKTFRDWAGVESEEKRVLIRNILSNAAASTLSSDDVVRMFIDWIG QYSELHFQVIGAIYNSGGITRGAIWKKIGKGRVREDSADADLYKLLFRDL STGGVIRQHRKTDYYGNFVAKSTQKKSPARSGGTKTLTSAFDEEDQYVLT ELGQQFVHYAMTDLPLRIAYKL >NE2152 hypothetical protein MLNISILIFAIAALGGVFLASKVLAGKLAPWPVSIVHALLGAAGLVTLIL VIMEGPENNRLTAALALLVVAALGGFYLASLHAKSAIAPKGVVFIHAGVA VAGFLTLLSVLL >NE0887 hypothetical protein MLTESGITPYPALNQLSLQYSIFGFLLRNITKPGNHFAARCYLSALDDTS SLPPGQIFNLLTVLDFYYQTERNQTERRAPARSEGRGSEGQKILYVPVTR KVSLTGETFCSF >NE0752 hypothetical protein MISTMQAYALVNSEEWVFNTDIPSALSSFTEIIPLQSKVMLELDLIKELS SNYKIENETHRALAGVFQTNYITQPREDYLLAFRLRQFCVYKLGKASKDG LLKFLGIVAAGYALTPITGPLAWIAPGIAGWEFIKSIVLAYERIEDPDEK MVFETIYILERRPIIVDYRAYEKEDFLNAYGHAWPNIDDLNNELNGKLTE KELKKALVSLKARGIISNK >NE2543 hypothetical protein MADNDRQFSWRQGNVVTLEAAKALNLLAPECDDQHFAVVASHDCDLSASQ DKEPCVEVVVGKRIDKLGGDSFGKTARRLHIEYQSEAGPVAIELLATSKR PVAKHELFSTHPRQDIWLDGQGIGILQRWLASRYHRAAFPEAFEDRLRSA NLPGKRTFLKRIEGILADGGDHIRALLFDLDEGKDVERDGPDDVYQLGIV VLYDSLRDEPAAAQVAGKAAEALEELFEAAFHPKDSGWKNICLMYCDPIS DSAITVAQREMLKQWRLEHMSLQEDPPQPMITP >NE2145 hypothetical protein MRKFAAFILFSVVGTSGWCGDTIRPGLWEVTTRSDLLGLIAHVPSEQMQQ ITSLARQYGLKVPRIQEGAAISKVCITPEMAEQDIPSHFYENQSGCSVVN ASRSGNRYQVELVCDNPRFKGNGHAEGIFSTPERFTGKTEFNSTVQGTPL YVYAETSGRWIGAQCEPMR >NE1226 conserved hypothetical protein MRNQILKKATVSIMLLLCINVSVADTSTSVSCAESGSASAYSRSGNSVSY SSVNCSSNNEQNAMTPAGIAVSKSYSPEPYTSLVLSGAFNTEIKTSSENR VIISGDSNYVESVEVNSSDGELIIRRPGPGNDNLNVIVESISLQKLKISG AGSTNIYGDFPDGLSVRKSGAGSINIEGQASTLKLNLSGAGNTTARDFTV DNVEIDATGAGNIAVCAKKSVAGSLAGAVHFKVYCNPSQRSVNTRGVSRV SYR >NE1917 hypothetical protein MSEQVYSGVDQDEDGGLTPLGRIVIDAWVFGILPESEMCTGWSMSQMQNL YEEVYAAWGPYAHLPSRLPPELQQRHSFFYSQRITVAKNNGWDTDLSDES >NE1277 hypothetical protein MNTAHKWRFFRSGGFDQVRLETGADVESLGTLDPKLWAALSCPTSNLEFD DKTLEFIDTDHDGHIRVPEIIAAVEWVSSVLKNPGDLTSGSEALQLSAID DSTPEGAALLASARQILLNIGKKDEEVITVEDTADLNRIFASTRFNGDGI IPATAASDAETRAVIEDMMKCVGSVQDRSGLPGVSAELIEQFFTEAKAYS EWWQDAERDATSILPLGENTEAAKAAFDAVKAKIDDYFTRCKLAEFDQKA GDPLNPALSDYEALTNTDLSSTTEQLATLPLAKIEAKKPLSLNEGINPAW VGAIDALKRQVVQPLLADKEQLSADEWQALCDRFTAHQAWLDTKRGATVE SLGINRIRSILAGRYQDEISALLQKDQSLASASDAIDSVEKLIRYQRNLF QLLNNFVSFRDFYTAQNKAVFQAGSLYLDGRNCDFCLRVDDIEKHSSMAG LSGIYLAYCECQRRGGSEKMNIAAAFTNGDADNLMVGRNGIFYDRRGQDW DATIVKIVEHPISVRQAFWYPYKRIGKMIGEQIEKVASAREKSVQDQAAA GIADISQKAEAGKPPAAAPFDVGKFAGIFAAIGLAIGAIGTAIASVVTGF LGLAWWQMPLTVLGLILVISGPSVLIAFLKLRKRNLAPLLDGNGWAINTR AIINIPFGISLTQMATLPAGAQRSLTDPYAEKKRPWKFYLFVLLLLGSIA YLFHSGYLNQGTVDTLKKHFLSDKAEIGTEEASTAQEIPSASPAEEIADG QKTEDDKPPQPAAKEGVENGSVASPEPVSTVVPAPKPVSVPASH >NE2344 possible unsaturated glucuronyl hydrolase MTSPVLIVEDRLLSTDEIVNTLEQMFRRMEMMDVLCGQNFPLYSSGENSD WSVSPGGSWMGGFWAACWWLRAKMTGSAGDRQKAGEISQRLFRKLTADSG YRSLIFWYGAALGEIWLQNAPARELTHSSIAALAHSFDPRLNCIPLGMAM GGLTTGSCAISVDNFASLIQLLCFSREKQYHRIAQCHAETLLAACRGDKG AFHAEASFDGHEFQVKDRAGVWSRGQAWAMLGLSRAAAQWGEPYLSQARA ACTYWRDVHKGNLPRNRPDQTEDVKDPSAVVIASLAMLSLARLLPDETSW CKYAHQQISTVLHSPYFTVINPDSAFGSAGLFQGCCYRTRQNREEIVESV WGNFFLVAALAVLAGLIDPYDC >NE0633 hypothetical protein MLRLIPNKGLKKTINTLTVAEKARINHQDIEELLLRYQRSCPEDAARNIT EVTNPNQEKHEGLRVFVIQTWMALMVANSYGLETYQTLKTFMDRQGYKTQ PDSTYMATEQKD >NE2176 conserved hypothetical protein MKKVAIVQSNYIPWKGYFDMIAAVDEFILYDDMQYTRRDWRNRNQIKTPQ GVQWLTVPVLVKGKYHQKIRETEIDGTDWAAAHWKALVQNYRRSPHFTEI AAWLEPLYLAETFTHISQLNRRFIEAICNYLGIKTVIKNSWDYTLLDGKT ERLADLCVQAGGTEYISGPAAKDYVDEQVFKENGIKLTWFDYIGYPEYQQ LWGEFTHGVTILDLLLNCGKNAKRYMKYVE >NE0544 putative ORF1 [Plasmid pTOM9] METICMTEMKMSSPVEHGVLTCLCMKALFIAVIFGVTSDVTAHGVTEGDK GYILESTGILPIPFIYLGAKHMMTGHDHLLFLLGVIFFLYRLKDICIYVT LFAAGHVITLLSGVLFEVAVSPYLIDAIIGFSIVYKSLDNMGAFQRWFGF QPDTKIATFIFGLFHGFGLATKILEYELPADGLLPNLIFFNIGVEIGQIL ALAAILIAIAYWRKSGKFMNHAYNTNTFMMMLGFLLMGYQITGHFILFST I >NE1802 hypothetical protein MKSFGCGNDNFSRLRMCLRVGCIVGLSSWGFQVSAAEWNIQPRLTVSETY TDNVGLGGGGFGGFGGAGRGGEFITQINPGVSITGEGRRFKSNLSYTLNN LIYAKNERFRIRNQLNTDATAEIIKHHFFVDGRATISQQNAFLFGPQAPD NAVLTGNRRNIYMWNISPYVRQRFSNLASGEVRYVHGEVSSNANSFSNSS SDAAIFSLNSGSAFRTLGWGVNYSHTQIDRKYARSNLGRLQTIELERTTG TLRYIVTSQFSLIGTAGYERNSFISIRGRTSSPLWTVGFSWTPTKRTKID ASGGKRFFGNTYAASVDHRTRSTVWNLSYVEDITTFGQQSLAGGSILSAS MLGQLFSGIQGGDALLNQGLPLSFSDPNNFLTNRLFLLRRLQASLTLNGK KNSLVFRGFSYSRKSFSSDEEDADLIGIENAALTRDTTQTGGNLLWNHRL SPRTNANINLGYIRTSYDVTSQEDDNIIVTAGLNKRFTSNISGSIMYYHL HRESNRNNGSYDANAITATLNMNF >NE1077 conserved hypothetical protein MPTIQSVRRTQSGRPGKRAINLSLSADVLDAARQLDINISQVCDTYLREV VRHEQERRWREEHADFITAYNATIEAENLPLDEWRSF >NE1999 conserved hypothetical protein MNPGKPMPLVVHGWTIFAHPLFLAQIEVLIQQVEAHKQKDPVGFVKKNAS KRLAAITKLAFDIILQDPARPEYRQGGTLGDDYKHWFRAKFFQQYRLFFR YHTLSKVIVFAWVNDEDTRRAYESSDDAYRMFRKMLENGLPPDDWNQLLA EARAEGQRLQQFAARWW >NE1234 conserved hypothetical protein MTPQSAFMIAATVRVGQLQDLRTLLASMNTIPGHADPDNDLIPFGKLDRL HFARFVIIEAKTLQEIKEFGVKPRPWRPMLAFLGDIDGDMHTFLAELVER AESGLTKIFSHCDDFSTGNQNLLEWMKMRNVSPGANYVNWVGRTVRQIHE EAALHHSLSDCLQKIVAEVGRENIHTLRQKLLSHVEMEKYKGRLVLSPPE PTPSEWRTRNLLHKIGVPLVLLLFSPLLLVIAPFFALWLRKRERSDPELF IRPAYSHIEALSEQEDWDVSNQYSVFGDVKPGLFRLLTFKFILLLTDWFA RHVYNHGFLARIKTIHFARWVFMDNNHRVFFASNYDGSHESYMDDFINKV GWGLNLTFSNAVGYPTTRWMIKEGAQREHAFKYTQRRHQIPTEVWYKAYP GLTAVDLVRNSRIRQGVEIRQSDDAEIREWLSLI >NE1497 conserved hypothetical protein MDYFYESEHTKFMRELFAKRPELIEKQKEARAIWWDKDVDREALKCFEEA EVPQRSYVYFSWPDQEQETEK >NE1204 TPR repeat MRSVRKRTVLVSTLLALVLTQVARADDKGSHPAVIGDVHFKVECNATAQA KFNVAVAYYHSFQWQRVIATADDVLKVDPTCGMAHWVKALAMLDNPFAWP VTLSEKAIAEGPVLLDAARKAGLKTQRERDYVDALAIFFKDLNTTNYRER AESFEKAMAQLAQQNADDSEATVLYALILSRNFDPTDKTYRNQLHAAELL EPIFAREPNHPGVAHYLIHSYDYPPLAKRGIDAARKYAKIAPDTPHSLHM PSHIFTLTGFWQESIDTNRRAAEMADDSITHDGHHASDYMVYAHLQLGQD LAARKIMEQEQVRHGIDMIGVAYPYAAIPARIALERRAWREAADLPLYAR DTYPWKKYPQAEAVNAFARGVGSAMSGEPAKANFEAKRLIKLRDAATAMK LNYWADQIDIQAEVVRGLADFAEGKRDEGIAILHRAAEREDASAKNVVTP GPVVPAREMLATILERDRKPADALAEFEKVLEQHPNRYRTIAGAAQNAKQ AGNEQKADHYAELLLKLAEHADSPRPEIAEAKSMLGM >NE1977 putative transmembrane protein MKILAILLLLSILYSLGSALYFMIKDKGDSTRMVKSLTIRVTLSLVLFML MMLWVYIEYIHGN >NE1245 Kazal-type serine protease inhibitor domain MKQQKNQAGTQARNACVLAGKRSPTQFTSALGLPGSGCSGFNPQLYFSSD FGAGNGAFSSGRGLSPPSITRKTRKELPRYLNALITWLIVLSVSLVVAGC EEGNPPQPQLQPQVCGTIQGLACPAEQYCDLGIGQCKVADAQGVCKTRPT ICTREFNPVCGCDGKTYGNACGAAAAGVSIDHEGECKTAEPQACGGIAGI RCPDGLACVDDPGDTCDPEHGGADCAGICIAGQGQ >NE2562 hypothetical protein MKKQIAGLVLLGLACTATSVSAEEDRELRQKVEALEAKIANLEGRSENEE HGDSHGFDKHKFHGGVVLKQDAFFGFQTILDAGYEVADNIDFTFYSWLWT NPNFGKSSVVSGGNNVGGQGLWTEFGIGLNFRFLDNTLSINPNIGMLNGS LLSSEVVGEDIRAGEGVVPNLVVNYDNDYFAANLYVAYYMATRGPRARDF LHNWINVGVKPALFGLGKTLPINSVGIHWEHLWAAKNRIDSSLEGVVYNW VGPYIEFGLPKNLALRFAGGFDVKSDVSNNFYQASIKLNF >NE2539 hypothetical protein MSAHKWQFASRFRRHAFGWRSDTPVQRIKEAITEIKQVARKEPVLAAEGA ITLLEKLSPALEQVDSSSGALGSAVNKAIDTLVPIIVKADVEPKLRQRWL ERLWQALQDDEMPYIEVLGDYWGELCVTPELASHWADEFLPVVESVWSPK ASGHGFFKGTSACLASMYAAGRHQELWALLDKAPFKWWHDRRWGVKALAA MGKKAEAIRYAEESRGLNDPGWQIAQACEEILLSSGFLDEAYRRYAIEAN QGTTNLATFRAIAKKYPHKQPEEILRDLVASTPGAEGKWFAAAKDAGLFD VAIELATRSPTDPRTLTRAARDYAEKQPAFALAAGLAALRWISLGHGYEI TGTDVLDAYSAVTQAAVNAVVPTQQVNEQIRDMIASTQPGNSLMKTILAR HLAN >NE2241 hypothetical protein MIKILLLLTSVLVAMPVAAVDVAPRISDREIIESLAELKAGQKALEEKMD LRFNAMQEQIDQRFTAIDQRFTAMQEQMDQRFTAVDQRFTAVDQHFTAMQ KQIDQRFIAVDQRFEAIDRRLDFIQQLMLVTIAGIFGLIGFIIWDRYSTL RPMDMRLQRLEEDLERDLELQSPEGSKLTRLIHALRELAKEDKKVEAILR SFSLL >NE2039 hypothetical protein MLMWLCFPLYIVSMHYQQYVRSMSGPMTEMWGGNRVLSGIADYARMTVSG IRYSQIYKGSYERFNREILSNGLIFFHRPMTQLENE >NE2239 hypothetical protein MKQTNLKKQLVAVAIGGVFALGVTAQATAAGIFQYDLDGQGGSGETVIAD AIQGVANESLSLLADGKTLDGQGWVKFNTFLLSTVDQDYKYSEVLLYATF KITTELVDGTIGASGSEYKVTSFTFDLYKDLGNDNTFTVADASSSTHASV TAVGVDDYIASGELIVGSANIQAASGAAINVETTFNLQPGGGEYFFDPDP FYNILKAGFNSTGGNWAFTNNMLAVGSATGVIDFNSTPTEVPEPATLALL GIGLLGFGARRALVASKNA >NE2168 hypothetical protein MLPFSSYKHVLIYYIAVAAVLFSGFFSYVVAPHRQEIEVGHVELVNLDSS LIENRKFSDFTNAYIPEITEHLTMARSGWLPLWSNNTELGRPLYQISGFS SAYLPSWVITRLVDGPWRFITTLSLGFCFLAGLFVLLFTREVGLSPIAGL IAGLGLATSPLFMYWLTFPMFPAVWCWAAGALWAVTRLAKRPDILGWGVL AFSGYSLLMTAYPQPVVFHAYLLGGYGLWLAYHQARVSRLELAKFLTLAL SALVVGAALAFPVYRDLFILSSESARVAPDPSFFTMVLPKFASFTELVRF FVLSTIPEIFGNPIAPSFPFSYDGLSVTLIAIFFGVVALVTSFKETWGWW LAILIFCLLAFVHPLYVLGVKYFGFNLSRSTPLGSITLPLTIITAFGIDA LARRTHHRQFSSAVFAGAAVALVVIAIGVAYGVSQHISIHWEIVIGMLLV TGLLIAQYDRYRPLFLMMALVLVLGMTSYPLMLKQDPAQIAMTSPLVEKV RENLPAGSRYAVAAPGISVLPPNLNATLDLSSVHSYNSLSSTRYHTLIKA LGGEVQTYGRWNGAIDPDYAGTMFWMSNISLILSSGKLAHENLEFLSEES GIHLYRVVSRMGDSMQVTPPQLDMSSTKLVLDDPRGMVTNTPVKILDQGD VLEFEVNSSAPSVFLLSQKFHRDWEALAETNQGWQAAQTVEVNGVFQGVL VPQETRRVRLEFKPLARYAWIAHVFWIFLFVLIIFKFSQTFRRRVLERV >NE2209 hypothetical protein MKFQPLRGESITTGLLFSLFVLLIFSCRAVAQESTRNEFLSIAKSAVVLY DAPSLNAGKLYVAGVNLPLEVVVKVVGWVKVRDYHGYLAWVEDKNLGPKR FVIVKIPVGSVYQSPNPTSSLIFQAQQDVILELLGVVAGGWVKVKHRDGQ TGYIRTDQIWGV >NE1269 hypothetical protein MIEAPFNKAKYAIFMFYGIDDGKQVISSYPIFGQTGTSSSYTTGTITSYG NTAFYSGTTYKTPTRGVVGSRTSTDTVFKRYLNIDIIDIAKSGNGKVQKV YEGKAISSGTNGQLAPVMPAIVRSVFEDFPGKSGASRTSRQPVEK >NE1547 hypothetical protein MAFTTSSMLQSVCKTSTLKGCGKIMRSLCRTPRHRSYATTFVVLLAIITI LPAMRFMEDLIRDSFVVTLPENERLSSFALFTS >NE1986 hypothetical protein MLVFAALALTIAGCVYLYLASPNQKWLVQALPGRPALVAGGLLLAAGLAA WITVLRPLAGFFVTLHVAMVCLFAFPYIAALRGKGRRN >NE1504 hypothetical protein MKTDNKQTTIQLLITSLFALTLTACDSQQEATSNKKPVAQAQGPATGILA DSAVEGVSYSASSGASGVTDVTGLYKFNHGDSIEFHIGKLNLGKIPGTGL TTPIELAAGDRNKLLNLLVLFQSLDADNNLANGISIPKTAADALDASLDL KADPGTFPSSPALATAREAAGIAGSIKTADEANAHFLSQAVNLLGGHLWV NQDDTSLNFFRFSTDGSGEYLHGIATPDDSCDANRACGSKLVFTAGVEYG TAKATEYDERGFKLVSTPEVDTDLQSGLSHPRPNWRVYTNGNELIISDIV IVQREREQASLFGELFHISKPIELSSDDEVAETTVQEIRYHKMDNSQSIV GAWTMDKDSIKSPVFLFFPDNRYMLVDPVGSATQSTPAACAKPGVELATY AFDAASGTLKLSSFTYNTAGCAGLSEYSGKPITFKIDTGAQNATLSGERL APITLQRLSN >NE2105 hypothetical protein MVNGGITKPYAHKGRDDYQNEEEILLLVQFPLLFFWREAYSPERNFCSSM KTVVDPHFSAHTVLIASADKNPGIISIRT >NE1833 hypothetical protein MWKLLNLTGICTLILVVTLAIMTYLAIDSHPRIEREISITPEQIARAKDI LDTHRYQVRPGTSATVRIQADDLDNALNYLAYHLAQGHAKVTMHDKSAQI QLSLPIPPGMITGYLNLQATLTEGKSFPELSSVSIGKIQIPDILAGQLTE KLLAWLQTASPDARAGLDAFRKLRFSRNEVAISYFWKGWGIDKASYSPVS LPFFDRQALDKLSHYHHFLNEQNRKRVSHTITLSEILTQIMQETVRHSPN GNVLEEFRAAILVTAFHVVQFPLRLVIPETADWPDPVRINVTLDGRNDLA MHFMASAVITAYSDTTLSNAIGLYKELEDSRSGSGFSFNDLMADRSGTRF AEKAMASQDSARRMRNIILAGIHDTDLIPHWSDLPEHMSETAFKARFGST SSPRYHEMMDKIEQRVASLKWLRY >NE1985 hypothetical protein MANEICLDWWGKASAGAILGLGLALSLVGLYAYLGPGGIDAPGGRYSLMR YLEVFVWVAVFGFCFLFRSGRAAWAWLGAANLIAFTTLFACRFYFFV >NE0972 hypothetical protein MLSIKPAAEDLAARQPVWEALSDMFLDTDTSLSRQWRADQLARSPYSIDQ LEFILINEVYPICKYNLLSVAGEWAGFDPEWLKEKILRHLGSRFRFLHTL NLGCFTVHASVEWHATRHAILAARSIGTKNTT >NE2057 hypothetical protein MKKYLTGVVLFGLLTLFTGSTWAHSDEQLDGVAAPHGGQLRMAGPYHLEL VAKDGELRLYVTDHMDHEVLTKGGSGKANVFDKDGKKVSVTLIPVFANFM KGTGEFTITPETVVSVFVVLDGAETQAARFTPLKKASAKAEDEEEHHHGD ADQGEHHHHGDVEHEQQPTDQSDAAESEENEEHHH >NE1818 hypothetical protein MAHRIRSSGKSATETGEVSQDVWFDNWEKGLDLWELARHYLRTDTTISLL WFDSDDLPEVEVSRFGARIQDDGGLAELTGELPWPGRSRRR >NE1793 hypothetical protein MEEWSRQSINFTVRLGEMSVLAWPLNACVLKTHFTKLPIHPTVPAELPKL FRDSVDVVVTRSHPIESSLAKLSILPQAIQYIPSSYRRYWVVLDGNFEDY LKKFSAKSRNTLLRKIKRFAELSGGEIDWREYCKPEEMHEFYKLAMEVSQ KTYQERLLDCGLPSDQQFQENMLALAAEDNVRGYLLFYQKKPIAYIYCPV HDGIALYEYVGHDPEYQRWSPGTLLQYFALQRMFTASHIKIFDFTEGEGA HKAFFATNNQYCADVYYFRRTWLNLIKVALHASSDKLSDGIVRMLDKFGV KAAIKKLFRSKA >NE2436 hypothetical protein MSEVITVGDKPILLKAIELVLAEPAAIRKEALQLKDKYVTRYGSDRSEDE INAYAADKIISNYSYYTAFVGGTTALTGVIPGLGTVLAAFGGATANTALS MKYQIEMTMAIATIYGRDITIEEEKRLCLMIAGLGAISEMTRVGGKEPGK KASVKMMQQYLQDASLQTLRELFKKVGITFTKKAAEKAIPFGVGVIIGFS ANKGLTWYVGTKARDFFSVTDSIV >NE1607 possible transmembrane protein MTARIMRSLKLNFPYRRQQIPLVDYLLLFLGMVLLLAVMYTLKQTMSKIT YWEAREARIVQQQKHTRQPRTPMARINKATQQELKQADDILRQLNLPWEA LFDALELAASEQIALLSLQPSVTGQTIRITGEARDLAALVEYVQALELEP VLKNAHLASYKARQDHLRRPIVFSIIATWHESL >NE0473 hypothetical protein MKVITYYQVIADSTAQTDCAFFIEFMLTVIEETLSESQIITPQATLQDIP QAVLEIMEQYPGLAEFCQHPRSCTELQAFYHLNDREHFRKAVLTPLLDAG WLRRTQPDKPNSPRQKYFREH >NE1381 hypothetical protein MLDSNKGTSKVIVAPKIFDINKNENRGNLTIFLGKLRKYFTHNGSHEITI DFTQTEKFIAAGTLLFYSELAYLKQFINNETRLRYIPPKNPKAFEVLIQI ELYKLCGIRKPKSKNANKYDDVLNWKVACGNVVNNEQCAPTIEAYEGQLA EPLIDGIFKGLAEAMTNTVHHAYAEIREDGLNHKPSKNNWWMFSQARDGE LTVVFCDLGIGIPRSLPKKHPSIFHKMLSLGKISDHQCIASSVELNATST KMPGRGKGLGNIIEIASKNKAGGVIIYSNKGMYRLGPDATEPFSRDLKNS ILGTIICWNVTLSKVGL >NE1556 hypothetical protein MDFDTFRPILSGLVGGLVVYLLTYSGRKPAATEGGRRLLIYGLGIRIFTA ILIPSSLFIAYAAAHAHPDQAILAVCIAAAFFSYQVFFVSLAYDNDNIYY RSPIGGNHVIPWPDVVEVGYSWLMQSYYLRTKQVRRIWCSNMLRGYNELE EFIPKKADKLFHPELKSYSEAHIN >NE1597 conserved hypothetical protein MKPDTSSGYSPEDNKNGMQIITTYEAILAITDQMLQAAKNSDWDKLVALE QDCKRLTTWLMEQHTYEQLSEEQKKKKISLIHGILERDAEIRAITEPWMA QLQNKLTSYGHKRKLGQTYQTDS >NE2193 hypothetical protein MALPVTSSANQPTGTNNVAATKCKGCFCPGNPCQLCRLPPHTDDPIPENE PETCRLIREAVPPASFQPGENEYFANLDKATIQCIRSGDVIPNTRRVPGY PGRVYCKPGLPALGAH >NE1235 hypothetical protein MNNTVRAWLGKSREELDEIYRHATPGNIPAGDTRGTAILAGSFFSKTVAA FARLFAWQGKVFDLFCPGGQAGVLVNKITPFGLTFIVAKVYRDKSWLDGQ DTIVIDYSKTSFVAKVIRDEIREVEPGVYLGKVWWGKTRVLDFALTQSDT Q >NE0746 hypothetical protein MTYAEVAKKIRNRLRRYSLISIINVGLNHLTQQHDNKEKALRAMPWLPAL VMKLAIEDEMISMHGDLCPSAEFDACCNAIWNAKRGLDESVQVALLGVRA LMHAQFIFQRSETFGFLRWAALISRIDASHPCRSLFERVFSMTPDDFMMA AILLISQFKKEAPQQPIDLRDYSALPEELTKPLYQLVRLLSKDLSELRVQ LQGELRSRLDSKTKRSARQESERHEFPWLAKYPLLKLDQTRVLAWNPTIF FHGLEEFVHIRLSEFGQDYTDSFSQVFEDYVIELIQESGTHAITDQEFKC LGNKGMSAVDALIPHAEGNVFIECKMSLFADAVLLSDHPPFVSEKLKRIR KAIVQGWKVGDLLRSDKIKLSDAKSADNDYLIVVTSRQLLFGNGLHLKQM VDEQFFDHIFPESNFMSPSKEQLSRMPPQNITILSIEEFEHLVGAVKSKK VTYLSFVQQLSKNASNPKTAKMVADQEIRKYVDKWYIPNLLTNSRDRVVA QLNAVFNCRDRIKSRTYK >NE0790 hypothetical protein MVKNIRSCLVLMVLSCFTVQTYAEDVTVRLSVQNIHHPEWERATDEELAV LRGGFVLPNGVHIDMSLEKFIHLNDVLVHSSSLQLPGAGVVLQAGMQNMV SDSITVPELSTFVQNTLDSQHIEALTTINIEVSNLKGIAANGGGQQVFTE FLAPALLR >NE1202 hypothetical protein MITIWQNLSINIQRLFIALCILTGIVLIGMQFHVNSQGSMSDTYPKGFRG GTCTIESDTLLVGYSAYFIPVDYEIPDDSMSALSVVPVLCDKVPGPGLLS ITVDLLYPASIREQPVAVSLARKNGERIMEPLLSIPARNYQSGIISQEVR IDESGEYVLQLSGTDEYQSEFHLDIPVTIGTKWYEPFVPYWPMLVLGVVA AFFYNLRRIVN >NE1555 hypothetical protein MDGTKAPPVSSTLEGIMNTAPELIIARKAIAKIAIRFPSLTMIEEPTVPV ELSIRLPVQPGLNYEVWLALQNNDELHFSVGNFWLEWFPCTESSRVKEYI SAVTGFLSSQYRVLEHYRGKHCVKAELQAPSGGDWKTVGTWSNLLSFLPL RSSLREVSNTQPIIPPDLPQQAAPDR >NE0941 possible (AF047705) unknown [Nitrosococcus oceani] MTSIKWLGGVLLGMVCSIQVQAHGGLSLAEDMCKLTIGPYTMHFTGYQPE STQEKEFCEDIPNIGRTIVALDYIDEALRTMTTEVRIIRDTGAEPGSEGN LDELTVFHSPPKVYMNGSVTFEHDFPAEGKFVGLVTIRDNGTEHISRFPF AVGTGGKPDMLYILGALALAAGAGIFFFKKKQNL >NE1790 hypothetical protein MKNFIRKLIRALTAIRLETDRYKHSTLRIIINWLTAFLKDLFSAQEIRLY GLADPEDGANRIRQYVSKEMADRFYRKYNPGAAVPNIEDKFIFTTLCLQH HLPIPATYGIFKQGIVKTFDERIFQESSRFRDFIRELEPGEYLLKPNNGM LGLGLSILEIDDQGSLKFQGKSITADDLYRELCSIEVTLPASSKGDSVDM DFEGLLFQQRIANHPEITKLTGFKMLQTIRICTHVTDNNQVEILFAFMKL AGQEGLADAFNLGKTGNMLAKIDPATGKFCNVYAMDQQQGYLVETTHHAV TQANLLNFTVPHWQACLTLAMKLSTTFLPLRAVGWDIAITDNRPVVLEGN DNWVPVVPFDINIDKLKQYKLKS >NE0258 hypothetical protein MFIVFSGSAAFAQEALLSAKEDYITCWKQPCVDVAGSEWSEKNPNGVGIS VRMGTQSGVTDDQIKTVLTRDFKKFGMTNIKFFFEQNDAPAAGIAFHVRG GTEGLFFIDNVREQVAGIARRAANTNPVFQ >NE1133 putative protease MDSLTELFCLIDDFCCQFEPALERRLLETGVKKRKRCSGLSLSELMTLTV LFHQLRFRQFKSFYLVYVCRHLQAEFPKLPSYQRCVELLPRCVAPLAALF EMLKGQCDGISIADATAIAVCDNRRIARHRVFADSARPGKTSNGLGLLDS NSMPSSIPGVN >NE1442 hypothetical protein MKPDRLWFHRGDFLDCVYRLFTRFWLFFLILVLCFPAIAATPRFDFTLEN IRHPVFSIRSAGIKLIGAPSPTLEINLGEVAIGKQTWHGLRLRCNPVHID RESMNCNTGTLQIGERFMTMIFRLSLQHKQFVLEIRPASNKSKEKWRLEV NWQASKWQGVLQVVNGEGKFLADLLPQGDDRIQVHQAILNGNIRLSGNNA SVSALSARLGISKLSFSDASGLHAGEGIDLQLDADAQQKRNDWQWRGKIT WPEGEIFWQPFYFSGEGHQLTARGTVKDERINITQGEFNLAGTGKADFSA VAGIADQSLQQAWLSARDLELSALFGSIIRPLAVDTALAETEAAGQMNID WRYQGNDNQELIVGLQDVSLTDAHGRFAVERLNAHIPWNSNEKRDGSIRF SNAQMVGIPLGETYIPIGTDGMRFSIPRAEIPVLDGKMLIENFVASMQAS GWQWQFDGLLAPISMEKLTESLHIQPMFGTLSGTIPRMSYANSIMTMDGE LVFGIFDGVAVARNLALSGPLSLTPHLTMDMAMYHIDLDLLTRAYSFGNM QGRVDVEIDDLELINWEPVKFDAKLASSAGDYKRRISQAAIKNLIALGGG LAVTAIQKSFLGLFEQFGYAEIGWSCKLRGSVCNMGGIGPATHDGGYMLI KGSGIPAITITGYNRKVDWPELLERLRHAIESGNPIIH >NE2426 hypothetical protein MTKPLDRLIGMTLLSAEIASGSAELRFSRCDFSAYSTYSSFPDFGSLVGQ TVQSIVGSMDRLVIRFAFGEFFISLHPDDYRGPEAFCARFADGPWVVE >NE2179 hypothetical protein MNRIGNGLLVAMAITWSGMVFSAVQNLPQPVFSGQDGEQAAADNAASPAP AEQAAPASTEAETAAKAEEQALPSGIGWKLVRSLEMGDSGKFVHMVLIEK GRQADKTIYSSAIHRLCAKEKEFCRIRFWVQSYLIPEKVSLTLEQQKTQQ ADHLFNRAAGIHRTLWACTVDSTSESCIQ >NE0824 hypothetical protein MKSSFGLSALVGLTLSLHAYAGGNPEFVKFPEKYEQIFTHYDTANRANQT QLAKFYANEIAAESYKKGEEAAPGSIVIMEIYAPKKDAEGKIQSGEDGLF VIDKLAAIAVMEKRNDWGSAFKADDRSGNWGFALYDPEGKAKDNDLTCAQ CHNPLQKQDNLFSFQKLVDYVKAH >NE1341 hypothetical protein MMDVVEATRILRREFAVWGNRLDRFFLARYSLLVDKPPKDFGSRLQHRLR QFLVFIHLVPPRVVRRAWLPTLKHSSSAPDARALLIWALGMERDSLRNAC LGFQRFLASRQDLAPVLVTDVADFAWFSRLGWMVEYLPELEGKGLPYQER KRDYLAWRYRDAVVVPAAAGLLDEENWNRLLQME >NE2229 possible long-chain N-acyl amino acid synthase MQIANHLQSGTAVKNIQPAHIPFSPYQPDSTSSDDTDPRFPRSQKYQISR HPGSNSSHIAETDCLLQRNGYSIHLVNSLKQRIKASTLIKRMYASRGYQT ESASVFSTSSNQYTFEARQSQQLIGTLTLTIDTGKGLLADTLYQPELDQF RRQGRRLCEVSKLAFNPETSSKEIFASLFHMAYIFAHRIHGVDDSFIEIN PRHATFYKRMLGFRQVGELRTCPRVNAPAVLLYLDLEYMKEQITTQAGQF DQKTKSIYPHFLSQNREKEITQRIQIEHTHFVPPSSRKSTFNHHQDYFQP A >NE1530 putative signal peptide protein MSVKHFITAVSLAMVSTVIPLTVTAEQHAHDAKSAKPGHDMNKMWAEMRT RAVGMAVSVAADEKGKLWLVRMQDGHIRVSHSEDGGKHFSEGVTVNPQPE AILAENQNRPKIAVRNGVIAVTWVQALPKVFAGNIRFARSVDGGRTFSEP VTVNDDQGEISHGFSALTLGDNGRVTLTWFDGRERDAADKGGQKYVGTTV YYATSEDGGASFSANRKLADHACECCRIGMTLDSDGVPVVFWRHVFEGSM RDFALARLDSQPKVLRASEDGWEINACPHHGGDIAVDEAGSRHLAWFTGN PQNPGLFYRRADGENMTAPHAFGDLDFQPGYPAVFAYGKKVYLVWREFDG NNYQLMASVSADRGDTWSAARAVATTGGAADLPVFVVGAQKPLVVWNSAR DGVRFFNAEGDL >NE0115 possible M. jannaschii predicted coding region MJ1674 MTTLISFLGKGIADKTTGYRTATYRFDDDSKHTTPYFGLALAGYLRPERL ILVGTAGSMWDVFFEQQDASDDDVLALIDAVRESRVDADMLSAQEKRLTK RLGLPVICRLIPYARDAAEQTEVLLTLAKLVHRSEEVFLDVTHGFRHLPM LALVAARYLAHVKDVKVRGLYYGALEMTSTNGETPVLQLDGMLQMLDWVE SLATYNKDGDYGVFASLLQQDGLPEGKAKQLTRAAYFERSSNPVKARETL GSVFSAIKTHNGPMGVLFRDALTERINWFKEPDRAAWELALADAYLERRD YVRAVIYLYESFVTRAVLEHKLNPNDFSERDEAWKDARQDNKQVRKLEYL RNALAHGIKSDDKEIIRMVNDENCLDDQLKKFRRSLFN >NE1752 hypothetical protein MNIRSTGLSILIGALFAIPATTATAVSPVTDTSIRQGTASYLILADAHQH QGHQGGSAGQGHSGGSGHAGHGQGGQGEGKGRHGGGGGHSGHGGHGKGDM EHHGHPPSYAHSVAMQAEALGLSDEQLGKIVRFHLKEDKQAHERIKQKMM ESMKAFRKAVGEPATDDETLRKLGQAHIDSFNEMVKYHIDERKAVRSILT PEQIGKLKAVKSDHDH >NE1599 conserved hypothetical protein MGYSLIFTDAYNQRAARWLRRHPDLRTQYLRTLQILQTNPYHPSLRLHVL SGKLQGIYAISINLSYRITLEFLIEDKQIIPINIGSHDVVY >NE1120 putative orf; Unknown function MFNMALVFFLIAVLAGILGFAGIAGTLAWAAKVLFFAGLILTVVFYLLGK RTPPV >NE1238 putative oxygenase MSHFPIVIVLTGLLLLSGCDALEPEISCLSVLMQGDIQHVAKTREQRFLG KVTGRRAHCLGGDHAVALNRNPWLDWPNFWGTGDSLSLSSSPLASSFFGP NERGINSALYELELQRIELIKFNLFDNSGTYQAYVTGRDGRAGPVLQVWP EMQLPPTHPRYKDVEHNQEHQVCSGELIRFRTVTGICNDIYNPLMGSTHQ IFARNVQFDTTFPDLGLDEMARNRHGDRLGLLKPDPQVISRKLFTRTQSQ PDKCRNDDELSGDLEKFACDYKKAPALNVLAAFWIQFMTHDWFSHVEEES DQSAWMTVGCITQRIDNIEQPLAAKEARQLGCRPGDRIHVAPIDDDTPPA SFMHDGHLYRTRAPKTTRNHVTAWWDASQLYGYDERSSQRVKRDPEDAAK LALIHVRESVDRGDESGYLPTFEVDDPIDPAWSGQEAAAFPDNWSIGLSF FHNVFAREHNAFVEEFRKQAAKTPDADSGLRNPAHPEMIIRYRDVTAGEL FNVARLVIAAEIAKIHTLEWTTQLLYNEPLYRGMNANWHGLFHEHAAVSE VLREIIRQLDDTEGISNSLHAAFAGGAGIFGLGNHRYEGAPLYSLVDRNR KDIWTLTRNEDINGGVNHFGSPFSFPEEFVTVYRLHPLLPDLIEYREWHN NPNIIRQKIPVIDTFRGKATGAMRQKGLANWALSMGRQRAGALTLQNHPR FLQNLKIPHLQSSTRQIDIAALDLIRDRERGIPRYNEFRRQYGLKQLTSF DDFIDPRVPGDSSVRREQEQLVRTLREVYGQHRCDASRLITNAQLNDDKS PINDCLGHPDGSLVDNIEDVDTVVGWLAEFKRPHGFAISETQFVVFVLNA SRRLFSDRFFTSSFRPEFYSILGVEWVMHNGPGPEIMEEGTYNGHRQPVS PLKRVLLRTLPELADELQGVVNLFDPWARDRGEYYSTQWKPRRGAEGDEV FTR >NE0605 hypothetical protein MMRVKLFLAVVLSLSVSCFAGSGGDQPESLPETGQTKNASTGDEQPTRIE TGKKEAESQSSQETGLDKQSLMIDYCRKHTC >NE0234 hypothetical protein MQIFDDYSNLTGDVTKDAADRALRHFIYRTQRGGMVIPAELQAFILNGIE RQLAEGTGGWFVPARGRPTISNDAGWRFVAMIAWHEYYFIAKGHSEIRRK NVSDFLIKQFGHTYCDFDLSDSGARRMIEDVNNRGFSGIDSGSPSGINNR DTDLLNAKLFCRTELNMRGLAELSHAVAMVRRLNKNRGHK >NE0889 conserved hypothetical protein MAIEKVKVTGITFFDGTLDDGKHIDSGKVFIEHLLDFRKGTAKGSSTTAY PLASSKEAKALMNHDFPLVCEVEFLTLSSNKGPKTVINALRPVPAAASPA R >NE0292 conserved hypothetical protein MQQHGWTLLFHDNLIEQLMRLRAAVLRAQENDPEDFGSNTNVKFFRALIQ LMQDVVPGDPVRDEYRQGNTMGPTYRHWRRAKLGRRYRLFFRYDSKAKVI VYTWVNDEQTLRSSGSKSDPYTIFEKMLGRGNPPDDWNALIQASKPNWSQ LE >NE2440 hypothetical protein MKAHISTVKLSILVPAALLSGFFLTGSTAAIADSSSDSNRANSEKYEKKE DIRKYDQRNDLDESRRGIEDTVIPETDSNSTNQQDHNLRNPSEQSPAEIM PGRN >NE2387 hypothetical protein MMKNLFVLLQSITAIFPVSIFFTYIIMDEGDQFTYEHYLVTALSAFPFFM VLLIKYFISGFENK >NE1156 Bacterial regulatory proteins, MerR family MDYDILHGIVIEESEALSLSELCQICNVEVEWIMALVNEGIFEPAGTRPE DWFFSGVALRRVLVVRHLQRDLDVNLSGAALVLELLEERNALLAKINLY >NE2061 possible (U92432) ORF4 [Nitrosospira sp. NpAV] MRRESLLKKHEGVVKSTGGGVVLKQSLYSIVTAFVFVILCSASTVWGHGR VSLEEDNCVRQVGENMVHLNTYQPQYDQAGHYCTEIPAAGDTYLVVDLID PALRNMPVSMKVFRGEEKGGEAILQVKADYHPDGVINGIGKLDKGLYSVM VTAEGVPPLNYYYQLRVEMVDYGKLVRTWAGPAVAILFLGWLMYKLVQSG RLRSWFKSQDD >NE0973 hypothetical protein MKNIVALLTVFILFMTQSASAELYDPDEVQVYIVPMIDFPEPAAAQLSKI LSDDMKIWVKSSVRLGDLEAATLPGTRQLSGDSIIEKSYPIVTKLPGSSK NTMYVLLTTRDINSETGAFRFQFSMHHSEMRVSVVSMARMIEFIDGKPVV NHLVLNRLYKMCKRAIGEQYFGWKRSTDINDIMYSPIMGMPDLDRIGIHH KENDDENEVEPVDKNRISI >NE0725 hypothetical protein MTTPNDADNETYSQSLENDMSRLLPDQSVLGEPEPWESWETSLCLWSIGI GIAALVILGILVDWFLLPGQK >NE0238 hypothetical protein MDNYTGQSARLLPLVFYQDCSLVDIQLYNPSNNNKLIFSAIFNDEYALLL NGKSDTLYSLNRHVLSLDDSKSAIDYLRFFCSYVQSEYGPFQIITHLDEI PFKDEGMDQNIRDTIKASIHNPEYLEGSFERDGWQAFKACVLYGGALFSS VMRVFSNGRVSLEEDRAIADNLLLLQRQYHGIFRTPL >NE1239 ALOX5, Lipoxygenase MNFILLKERHMMNKLPQQEENRRTVENRKNYLLRRQAQYQYAYEYANTIA VVRKLPCREIPGPGYWLRGGINLLQLIPSLPSLLVTYMRYLLGKPMESYR DYIFYPFSPPNPALVDNFQQDLIFGLQRVIGVNPVVLRAVTSQHPLPQKL PESEIQRVFAKYVDETDYATAITQKRVYILDYADLEILQRNPGQIDGGRK QYVTTPIVVLFLQADGILRPIAIQLYQDAGPDNPIYTPNDGNLWLAAKTF AQVADGNHHILVTHATRIHYVMEAIIMASRRQLYKSHPLCVLLNPHLRHT LNVNHQHTFLRDRKGRPGRYGELFAGDYDATTQCMANGMTSFDFRASAFP NDIASREVDNPDLFYPYRDDGVLLWNAIQHFATEYIDVCYQSDGDVAEDC EIQAWAHDIGARDRGRIPGFPARFASRQELAETIGHVIFLCTAFHSCIHF NQYKYPGFVPNMPHSAYAPPPVGKGAEMDADGLLKFQPAFRAAYSQTWTY FQTNFTVNRIGQYPLRQFDPAARDVIERFRKRLQEIEGRIDQRNSSRPVP YDRMNPRIIPNGVTV >NE0458 Plec1, hypothetical protein MLFKSHLRSILNISPQSENEAESNVVPLYTGKTASDDTLLTTNDTPAQEE YDAPADDSNASRLEAENLAIREAKSRIETEARARVTAEARARVEASARLA AEARIKAETAATEEARARARAEALAAHEAHARQELEARLRQTVEDGLKAE KETVTALRAKVQAEAATTEKARRRLQEEALALEKSRERELAEQRAIEAAI ARRRTHEEALKIALAASAAEAEATALARARIEEDEKNIALANAKAEAERQ AIEEIRLRTEAEADLTSKAQQKLQGEISAREAEQSRLDAEKKAIEAAQSR RDLDLTAKSEAEARAAAELEAATAQRTRIEAERKARAMAEQVALAEQEAA NAALERSRADTLLLEKTRAHTLAENEACAAAEARMQAKEQETAIFNEKAQ TDQAVTDTIKERIQAQETAIRRARARAAAEAVARKTAEDKITAETHAAEL AEKRIALDRQVEKEANELAETEARLIENKRKQAEAVQQAKSAAAARIEME QKLTELSTRIAQNQVIAMAKTEERLKAAETAAATVLHKIKLESSALKAIQ ERIEQDALAVERAIAREAVEAMAVEAALARIRTDEAAIAQASRKIREEIE TTKMIQDCFDEEIPSDTVMHDKEQGDTHSSDDGETLLAATEERMAAETSD ALDESDDSANQPESGMLNDPVQSDVSGNSIESNDTEVLPDKSETAEIE >NE1240 Ptgs1,COX1,Cox-1,Pghs1, putative cyclooxygenase-2 MNEFFFQIIFRLVNRFPWISRVASRITWLRRWISDTFINWQAYATNPRPR PFSMAAPYTTWQALTDRTFTGRHLPEAEGEQNLPDLKSVVNLWRRKENRE IPSVDTSILFSFFAQWFTDSFLRTDFFDRRKNTSNHEIDLCQIYGLREDI THLLRLKKDGKLKYQVIDGEIFPPYLFNVEETTADNWVFADREFENLHPR AVLEFVFDNVPEERLKRMFATGLEHGNSSIGYTLMNTIMLREHNRICDVL KEAHPTWDDERLFQTARNIMIVLLIKVVLQDYVSHFTQFGFTLDPTPGMA ERQRWYRTNWISLEFNLLYRWHSMVPEYYFVGDQRYTLDEFRNNTALVTH QYGIGTMISAASQQKAGRVGLYNTPQFFFDPLPVGADNRSVMERSVEMGR QAKLRSFNDYRQAFSMPRLRSFEELTADPALQRELKELYNDRIDDLEWQV GIFAEDHDEGFSLGRLMVRMVGYDAFTHALTNPLVSGYVHNEKTFSSVGQ SIIEETSLLADIVKRNVRDSDTVIASFRTSAVA >NE0944 amoA1, Ammonia monooxygenase MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV KGKRGRIVHRNDVTAFGEEGFPEGIK >NE2063 amoA2, Ammonia monooxygenase, subunit A MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLA GDWDFWMDWKDRQWWPVVTPIVGITYCSAIMYYLWVNYRQPFGATLCVVC LLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTA LVGGGFFGLLFYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPE YVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLGKVYCTAFFYV KGKRGRIVHRNDVTAFGEEGFPEGIK >NE0943 amoB1, ammonia monooxygenase, 43 kDa subunit MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA EGNRHINSIAGPVIPVFTKL >NE2062 amoB2, AMMONIA MONOOXYGENASE, subunit B MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMR TVQWYDIKWGPEVTKVNENAKITGKFHLAEDWPRAAAQPDFSFFNVGSPS PVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVK DAGPIAGPGAWMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVW MSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKITWVLAILTLA LVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGR ALRVTMEVTNNGDIPVTFGEFTTAGIRFINSTGRKYLDPQYPRELIAVGL NFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA EGNRHINSIAGPVIPVFTKL >NE0945 amoC1, ammonia monooxygenase subunit C2 MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT PSHVVVFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS NLTDVVWNNQSKEILNNRIVA >NE2064 amoC2, ammonia monooxygenase subunit C MATTLGTSSASSVSSRGYDMSLWYDSKFYKFGMITMLLVAIFWVWYQRYF AYSHGMDSMEPEFDRVWMGLWRVHMAIMPLFALVTWGWILKTRDTKEQLD NLDPKLEIKRYFYYMMWLGVYIFGVYWGGSFFTEQDASWHQVIIRDTSFT PSHVVMFYGSFPMYIVCGVATYLYAMTRLPLFSRGISFPLVMAIAGPLMI LPNVGLNEWGHAFWFMEELFSAPLHWGFVVLGWAGLFQGGVAAQIITRYS NLTDVVWNNQSKEILNNRIVA >NE1411 amoC3, ammonia monooxygenase 3 subunit C MATSILKDKTAQQVTDKPAYDKSEWFDAKYYKYGLLPILGIAVFWVWYQR TFAYSHGMDSMEPDFDRIWMGLWRVQMVVIALAAFSIWGWLLKTRNTAEQ LASLTPKQEIKRYFYFMMWLGVYIFAVYWGSSFFTEQDASWHQVIIRDTS FTPSHIPLFYGSFPVYIIMGIAMIIYAKTRLPLYNKGWSFPLIMVVAGPL MSLPNVGLNEWGHAFWFMEELFSAPLHWGFVILAWAALFQGGLAIQLITR YSNLVDVEWNKQDRAILDDVVTTP >NE2574 attINeu, qacE-like protein; integron orf MSEQIFFEHDGVRVSSARFVVKGATYPISAITSVRAVRSKTFPLLAIVLI LIGFGILLGGEPTLLIFGLATIALGVVWIIKKKELYSVVLQTSSGESQVL ESQDRQYIHSVVDALNNSIVQRG >NE2337 cycA1, Cytochrome c-554 precursor MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK >NE2042 cycA2, Cytochrome c-554 precursor MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK >NE0960 cycA3, Cytochrome c-554 precursor MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTA HAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTI ESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQ DFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVK AMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK >NE2406 flhC, probable flagellar transcriptional activator transcription regulator protein MKGKSILSEGKQIQLATELVRLGARLQVLEASTTLSRERLVKLYKEVKGA SPPKGMLPYSEDWFTGWQPNMHSSLFINIYNYITRYTKVRDIDAIIKSYQ LYLEHIEANRLQRILSFTRAWTLVRFVESKVLSVTSCVKCTGNFLVHSLD IQSNHVCGLCHVPSRAGKTKRVAQEARAAQEAGAGELCVI >NE2407 flhD, probable flagellar transcriptional activator transcription regulator protein MGTNQILDEIREVNLSYLLLAQQMLREDRIAAMYRLGIDEDIADILVKLT NSQLLKMAGSNMLLCRFRFDDSLIAEILTSHKQDRALTQSHAAILMAGLP AEKIS >NE2529 mcrC, possible mcrC protein MTAVAEQEESASNSAEGFIGRIPVRNLWLLMLYASDLFRTRGIGKIGLED SPDDLPDLVAEILAHAVEVRQRRRLSLGYRSRDAVINRVRGRIDVLTTER HQLMDRGLVACWFDELTIDTPRNRFVRAALESISRIVQRKDVAHRCRALA GGMKAMGVSGDAPARAQMSTDRFGRNDADDRFMVAAAKLALDLALPTEAS GANVLSLPDREATWVRRLFERAVGGFYEVVLSPQGWRVLCGGTMGWQIEQ RTAGIDKILPTMRTDVVLDHPSTGQRIVIDTKFTSIVTSGWYREETLRSG YVYQIYAYLRSQVGCGDALADHASGLLLHPAIGQMVDETAVIQGHRIRFA TVDLTASTSDIRLQLLRFFDPNQPVTGQ >NE0842 merT, MerT mercuric transport protein MSEPQNGRGALFAGGLAAILASTCCLGPLVLVALGFSGAWIGNLTILEPY RPIFIGAALVALFFAWRRIYRPAEACKPGEVCAIPHVHTTYKLIFWIVAV LVLVALGFPYVMPFFY