TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Nitrosococcus oceani ATCC 19707, ATCC 19707
Gene type: CDS

Number of genes found: 204

Free access
Sort by:

 



# Nitrosococcus oceani ATCC 19707, ATCC 19707

>Noc_0288 DNA polymerase III chi subunit, HolC
MTKVDFYLLTSRQPQASKRFTCKLIEKVYRLGHQVYIQVEDQAQAQEMDD
LLWTFRQGSFVPHALVNSEEATGTPVLIGYDGEPGGKLELLINLGAEVPA
FFSRFQRVAEVIGPDETHRHAGRQRYRYYRDHGCSLETHELNF
>Noc_0171 DNA gyrase, subunit A
MDEIAQEVFPVNIEDEMKHSYLDYAMSVIVGRALPDARDGLKPVHRRVLY
AMRELGNDWNKPYKKSARVVGDVIGKYHPHGDAAVYDAIVRMAQLFSMRC
PLIDGQGNFGSIDGDSPAAMRYTEVRMAKIAHELLADLNKETVDFVPNYD
ESEHEPAVLPTRIPNLLANGSSGIAVGMATNIPPHNLGELLSACIALIDD
PALSIAGLMSHIPGPDFPTAGIVNGARGIQEAYLTGRGRIFMRARTHVET
DEGSGKQSIIVTELPYMVNKARLLEKIGNLVKEKKIVGIVGLRDESDKDG
MRMVVELRRSEMPEVVLNQLYLHTQLQSVFGINMVALVDGQPQLLNLKLA
LEAFLRHRQEVVTRRTMFELRKARERAHVLEGLAIALANIDSIIALIKAS
SSPAEARDGLIARFWPPGVVIDMLQRAGAESSQPEKLETAYGLQQDGYRL
SPAQAQAILEMRLHRLTGLEQSKIIDEYEQVIIFILGLLEILANPERLMA
VIREEFMEMREQYSTPRRTEIQDNHVDLRLEDLIQEENVVVTLSHSGYAK
SQTLDTYRSQKRGGKGKSATAMKEEDFIDKLFIANTHDTLLCFSSRGKVY
WKKVYELPQGSRVARGKPFVNLLPLEEGERINTVLSIREFEKDKYIFMVT
ASGVVKKTSLADFSRPRSSGIIALDLREGDQLVGADLTDGVQEIMLFSSG
GKVIRFSERDVRPVGRTARGVKGMELPSEHRVIALIIAASGTILTATELG
YGKRTAIAEYRLQGRGGRGIISIRTTPRNGEVVGAVQVEKEDEVVLISDG
GTLIRTPVDNISLMGRNTQGVKLINLQEGERLVGIERIVEDGEAGI
>Noc_1585 DNA mismatch repair protein MutS-like
MSSSLKEYLREMIHGIYPPILQGSEAAPSPHSTQPSRVGEGVIDESTFQV
IEADRLFDAINTAHTVIGQAVLYRSLAQPLADIKIIKAKQEALQELASNP
SLREKIESLTKKASKREKSFYRLLFSKFTGFFGSSRGDTEIEGYGYATYE
RGTTFMLELVKDARTLPAPESNYLRILIDDLKGFGATKIHSLMKGPVYLT
ESGIRTREEKKWFIPAVKFRPTLFKPLFILAVLLGIVALFMYGPMVLGIS
FSSSPILILFLLPALIFYMPMVGTFDRDSCIYPLQKRYQESEDVHTALEA
LGKLDELLAFHHYGKSFGSPTVLPRVIAAKNHTLILREAKNPILGKDNPN
YVPNDIDLDGQKLTFISGPNSGGKTAFCKTIAQIQLLSQVGCYVPAEDAE
ISVADRVFYQVPEISSLEDVEGRFGKELKRTKDMFLMTSPESLIILDELS
EGTTHAEKLETSFHVLNGFYRIGNNTLLVTHNHELAERFKENKIGQYFQV
QFIGEGPTYKIIEGISKVSHADRVARKIGFGKEDIERYLKEKGFVSG
>Noc_1657 TatD-related deoxyribonuclease
MLVDSHCHLNLLDLTPFDGSVHSVMEESRKAGISHMLCVSVDLETFPDIR
ALAQEYAEISVSVGIHPNASKDVKITVEQLLELAATPKVVAIGETGLDYF
RSEGDLAWQRERLRIHIAAAKECGKPLIIHTRQAKRDTVRILKEEGADEV
GGVLHCFTEDWETAKEGLDLGFYISFSGIITFRNAGVLRKVATQIPSNRL
LIETDSPYLAPVPHRGTPNRPAYVRYVAQCMADLRKISLADLEDLTTENF
FSLFSEAALPA
>Noc_0995 Helicase RecD/TraA
MNAFHPEASPEFLAGTIERVTFHSEETGFCVLRIKVRGEHNLVTVVGNTA
TVTPGEYVECRGEWINDRTHGLQFKAKHLKVIPPSTLEGIEKYLGSGMVR
GIGPHFAKKLVQAFGEQVFEVIEHTAERLTELDGIGPKRKARVIQSWAEQ
KAVRAIMVFLQSQGVGSARAVRIYKTYGDHAIERVRENPYRLALDIHGIG
FKTADRIAQRLGIPADSLIRAQAGVRHVLQEFTTEGHCAMEQDKLVETAS
KLLEIPVPTIEQAIAQEVTAGQLIAEVIHDKSCLFLTPLYRSETGVAKHL
QRLLQGRPPWGEINTDKAIPWAEEKTGLRLSPSQRAAVTQAVGSKIIVIS
GGPGVGKTTVVNSIIRIVQVKQIQIHLGAPTGRAAKRLAESTGFEAKTIH
RLLEFNPHAIGFKHHRGNPLHTDLVIIDEVSMVDVGLMNQLLQAIPSHAA
CILIGDRDQLPSVGPGQVLADIIASKKITAVHLTEVFRQAARSKIVVNAH
RINQGKMPERTHEAAPLSDFYFIPTESPETIQDRLLELVSTRIPKRFGFD
PIRDIQVLTPMNRGSLGAQALNALLQQQLNAQASPKINRFGWTFALGDKI
IQTVNNYDKEVFNGDIGKITRIDIEESLVYISIDDREVEYELGELDEIAL
AYAISVHKSQGSEYPAVVIPLATQHYTMLQKNLLYTGVTRGKKLVVIVGQ
PKALAIAVKRTHVTSRLTHLSARLANG
>Noc_0636 Exonuclease
MYKELLWIFTQLPDDYIVLDTETTGLPDENGLPDIVTLGITVVSNREIAE
SVEFKTRPQKRISEEAQSIHGITNKQAAAFDSFDSQWQQISEYLKHQLIV
IHNASFDWPILLDHVLRHGLAMPEIQGVFCSQKAAIPWAQAMDLPCSHRG
PSLDTLTKALGVEDLRVKEDGLHGAKIDSRQLAQVVQVISRNENMEWVLI
VNHRG
>Noc_0419 DNA methylase N-4/N-6
MSVYSHARGIGDLPSVSEKKSIPFDRKIKMDGLKFLGRLVDNSVPAAFFD
PQYRGVMDKLKYGNEGSRQQLRAQMKQMPEETIFDFVSEIARVLCPSGHL
FLWIDKFHLCEGTSSWFDGTPMKTVDLITWDKERIGMGYRSRRKAEYLLV
AQKKPVRAKGVWTNHSIPDVYQEKFQRDMSFAHPKPIGLQAKLIDAVTNS
GDVVIDPAAGSFSVLAAAQQTGRKFLGCDIALGEFEWS
>Noc_2402 conserved hypothetical protein
MAAVIPPRKNRVGWRDDDKNLYKVRPPMEKAFRPLKRWRGRATRYAKNSA
SFLAAVPIGCSALWANIS
>Noc_2283 Gram negative topoisomerase IV, subunit B
MSGNYDAAAIEVLTGLDPVRKRPGMYTDTQRPNHLAQEVIDNSVDEAIAG
FATQIDVVLQPDGSLTVCDDGRGMPVDLHPEERLPGVEVILTRLHAGGKF
SSKSYRFAGGLHGVGVSVVNALSSRLEVWVRREGCEYHIAFANGERVSAL
KAINKVGKRNTGTVLRFWPDPVFFDTPKFAVARLKHILRAKAVLCPGLCV
TFSDVATGETVNWRYQEGLLAYLRDELRELSYLPDPPFTGEMQGNTEEVC
WAFTWLPEGGEVVTESYVNLIPTPQGGTHVNGLRAGMLEAVREFCEFRNL
LPKGVKLAPDDVWDRVSYVLSVKLLDPQFSGQTKERLSSRECASFVSGVV
KDSFSLWLNQHVVTGETIAELAIGNAQRRLRQAKKVVRKRIQAGPALPGK
LADCTSQEVSLTELFLVEGDSAGGSAKQGRDRVFQAIMPLRGKILNTWEI
DSAQVLASQEVHNIAVALGVDPGSDKLDSLRYGKICILADADSDGAHIAA
LLSALFFRHFRPLVEAGHIYVAMPPLYRIDVGKRVFYALDEAERQGVLDR
IAAERIKGKINIQRFKGLGEMNPMQLRETTMAPETRRLVRLTVTPNDDPD
QLLDRLLAKRRAADRRRWLEESGNLAEL
>Noc_0703 Exonuclease SbcD
MRVLHTSDWHIGRTLYGRKRYEEFEAFLNWLAETIQQHEIDALLVAGDVF
DTSTPSHRAQELYYRFLCWVAASSCRHVIVVAGNHDSPSFLNAPKELLKA
LDVHVVGSTTSDLEEEVLVLRNEQDAPELIVCAVPYLRDRDIRVAEAGES
VEDKERKLIDGIRTHYAAVAALAEQKREELGGDIPIVAMGHLFTAGGQTV
DGDGVRELYVGSLAHVTAEVFPASFDYLALGHLHVPQKVKGSETMRYSGS
PLPMGFGEAKQQKSVCRVAFDPIEGHSRAASVQLIDVPVFQKLERIKGDW
GGISSRLLELSAMGPPNGLRIWLEVIYEGDEVVGDLRERLEAAIAGTPME
ILRIKNNRIIDRMLGQIHAEETLDDLNVNDVFERCLAIHEVPEDQRPELL
RVYRETLSSLYEDDVQAQ
>Noc_0544 conserved hypothetical protein
MTYNPEIHHRRSIRLQGYDYDQAGAYFVTVCTQNRACLFGNIVAGEMQLN
DAGLTILRWYAELPNKFPAIACDEFICMPNHVHFIVVNAVGADLCVRPDL
CVRPDLCVRPDLCVRPDLCVRPDLCVRPDLCVRPDAVPTINKGEHVGSPL
HGVVQWFKTMTTNEYIRGVKQNGWPSFPGKLWQRNYWERIVRNETELDRI
RDYIRNNPVQWESDKLFVNPA
>Noc_0655 Protein of unknown function DUF1156
MAEIKTPKKLIEVALPLDDINTAAAREKSIRHGHPSTLHLWWARRPLAAA
RAVLFAQMVNDPGYQQGEGFKYGVNKKEAEIKREKLFQIIRDLVKWENTN
NEEVLNRAREAIWESWRETCHLNRNHPQAAELFNPDKLPAFHDPFAGGGA
IPLEAQRLGLESYASDLNPVAVMINKAMIEIPPKFAGQRPVGPLPQGEKQ
GKLMDDWSGARGLAEDVRRYGHWMREEAFERIGNLYPRIKITQEMVAERP
DLKPYQGQELTVIAWLWARTVKSPNPAFSHADIPLASSFLLSTKKGKESY
VNPLVEGHNYQFEVCMGVPPAEARNGTKLGRGANFTCLLSDTPIDPKYIY
AQAQSGNLGQRLMAVVAEGKSGRIYLTPTAEMEQAASAASPDWKPDALMP
ENPRWFSPPMYGMKSYGDLFTPRQLVALNTFSDLVQEACYKAIADAKAAG
MTDDGIGIDDGGRGATAYGDALAVYLTFAINKLADRGSTICTWDSSRSST
RNTFGRQAIPMTWDFAEPNPLSDSTGNFMGGIGWANDVLSRMIPSSGGIA
VQQDAATQNISAEKVISTDPPYYDNIGYADLSDFFYVWMRRSLKSFYPSL
FATMAVPKAEELVAIPYRHGTKEKAETFFLDGMTQAIHNMADKGHPAFPV
SIYYAFKQSETKEGATSNTGWETFLEAVIRAGFSIDGTWPMRTEMSNRMI
GSGTNALASSVVLVCKKREIEAESISRRDFQRELREQMPDALEAMIGGET
GTTPIAPVDLAQAAIGPGMAIFSKYEAVLNQDGSRMSVHDALILINRAIT
EYLSPESGSFDADTQFCSSWFDQYGWSTGPFGEANVLAQAKGTTVDGVNT
AGVVESGGGKVRLLKWAEYEADWDPIKDNRTPIWEACHQMIRSLNNQGES
AAGELLAKMPEKGEPIRQLAYHLYTLCERKKWAEDARAYNELIGSWHAIV
TASHEVGHSGSQAELGLDF
>Noc_0139 DNA recombination protein, RuvA
MIGRLRGILLEKRAPFLLLDVQGVGYELEAPLSTFYVLPAMGAEVILYTH
LVVRDDAHLLYAFASEKERGLFRSLIRVNGVGAKLGLGILSGIEAESFTR
CVQEGDTVSLTRLPGVGKKTAERLIVEMRDRLLDMPESGVAGMRPDRVDG
SAPGTVAEAVSALVALGYKPNEASRAVRRLDTEALTTEEIIRQALQRML
>Noc_2686 Transposase
MKTTLQIKLLPDGTQHSALKETMRVFNDACNAIAEVAFREQCASKFELQK
LVYADVRKQFGLSAQLTIRAIAKVVEAYKRDKSKQCFFKPTGAVVYDQRI
LSFKGLDRASLVTMQGRVSIPIQMGQYQRVQWHRAKGQADLVLVKGAFFL
LVVIDTPEAPPIDPSGFIGIDLGITKVATDSDGGSFCGSTVERVRQRYHR
LRRRLQSKGTRSAKRHLKKIRRKEAQFRRSQNHIISKRLVEKAKDTGRGI
ALEELKHIRSRTTVRKSDRAKHSGWSFFQLQSFIEYKAKLAGVFVQYIDP
WYTSRTCSACGHADKANRKTQSHFQCVSCGYTDNADINAAINIAARADVM
QPMVMRATTAKDSPSTATSLPL
>Noc_0318 Tyrosine recombinase XerC
MEEEQQIWIQKFFTHLQYERGLSPQTVVSYRRDLAKAIAFCGRNGIGNWQ
ELDAQRVRALVVAHHQAGLSGRSIQRLLSALRSFYVYLQRENIVDHNPAQ
GISAPKGKRALPPSLDVDQTAQLLNTQPCSDLLLRDQAILELFYSSGLRL
AELVGLNLSALDLDTALVRVVGKGAKTREVPLGRRAKVALLAWLPVRAGW
INQSQEAVFITRHGRRLSPRAVQKRLRLWGLRQGFDVAIHPHRLRHAFAS
HLLESSGDLRAVQELLGHADISTTQIYTHLDFQHLAKIYDQTHPRARKKR
>Noc_0877 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_2036 Exonuclease VII, small subunit
MPKRNKLDFESALAELETLVSRMEQGEVSLEESLRLYERGVKLSQLCQET
LRDAEQKVQILQQKQGKEQINDFPNEE
>Noc_0138 Crossover junction endodeoxyribonuclease RuvC
MARILGIDPGSRITGYGLIETNNKKTVYIAAGCIRAGEGGLAERLGQIFQ
GITGIIQAYHPDEVAVEQVFMHQNPGSALKLGQARGAAICAAVNAVLPIF
EYTPSQVKQAVVGRGDAAKSQVQYMIRLLLKLPAEPATDAADALACALCH
EHAGPLLAGLAGQVRGRRRGRYYR
>Noc_2052 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_2740 conserved hypothetical protein
MKFDDNFNVIIGRNDVGKSTILEALEIFFNNETVKMEIGDHNVHVDDPEM
SIQVSFRPEDKQYTIDTVPTDLHREYLLDENGKLTIKKSWDCSKDKLTAT
SLKTYIIANYPTAFEEPLISLKIADLKKLLDGYADKHVARERVAHPAISY
>Noc_1882 putative transposase
MPRTHRYARVGQRCYGLCDWHAKGRTNAIGALIGKALLTVGLFTANITAN
IFTAWVKQDLLPQLPENAVIVMDNATFHKRLDTQESLRKAGHTLLFLPPY
FAELNPIEQKWAHIKAIRKPLSCSIDDLFKIESFYVT
>Noc_0988 UvrD/REP helicase
MINPSIPDADARRQALNPHGSFIVQAPAGSGKTELLTQRYLMLLARVEAP
EEIVAITFTRKAAAEMRYRIIAALASAKENQPPKTEPAKTTWDLACAVRR
RDEGKGWSLEDHPARLRIQTIDSLCASLTRQMPLLSRFGAQPGITEDAER
LYRQAAHRTLAEVESGAEWSASVETLLRHLNNNWGKIERLLSAMLARRDQ
WLRHLAGDEQLQREILEEALQGVIQDGLKRASRYCPESMLAEILELARFA
ASNVAATAPASPIQACRDLWNWPGIQAEALPHWLGLAQLLLTDEGEWRQQ
ANKNIGFPAPSAARGQEERQFLKEQKRALLDCLARLEGEEALRRALHGLR
FLPSASYTAAQWQLMEALFQLLPLAVAQLQLVFQEQGEVDFTEVAQRAVV
ALGEAEAPTDLALALDYQIRHLLMDEFQDTSLSQYILLERLTLGWEAGEG
RTLLVVGDPMQSIYRFREAEVGLYLQAWWQGVGLLPLTPLTLSVNFRSQQ
GIVDWVNQSFRQILPVAQEIATGAVPYSQSHAFHPPLMAPAVQIHPFFEK
GSLAEAKRVVHLVQQTRQAAPNSTIAILVRSRGHLAGIVPLLREAGVGFR
ALEIEPLGSRPIVQDLLALTKALFHLGDRLSWLTVLRAPWCGLTLADLYV
LAGEDKQAAVWDRIQEDACIQGLSEEGRERLGRLRQVFAKALGERRRTSP
RRQVEGVWLALGGPACVTDSIALEEASVYLDLLEREAVAGEILDFRALEE
SVAQLFAPPDKEADETLQIMTIHRAKGLEFDTVIVPGLGRPPRHESHQLL
LWAERPTVHYPTGDDGALLLAPISGAGRSQDSIYQYIKRLHEEKGEFENG
RLLYVAVTRAKHRLHLFGQVDLDDTGKPKVPSKHSLLASLWPAVKGKFEA
TALLQRETENREEGRDPGDSSISGFISRLSTQWQCPAPIAGVKADIEVLP
EAASALDPIEFDWAGETARHIGTVVHRYLQIIAGEGPEHWSLARITALAP
ALRQALMQQGVMPNEMENAIERAREVLRQTLEDPRGRWILASNYRQARNE
YSLSGVLNGRIIRAVIDRTFIDEAGVRWIVDYKTGVHEGGGREEFLNREQ
ERYHPQLERYAALMALKVPGTEKIRLGLYYPQIGGWREWEWSKKIF
>Noc_2515 Phage SPO1 DNA polymerase-related protein
MDSRQLHYLETMGIQVWQQRQPTSTGKKPLVETGEELIPLAPSAEVASEW
EELAVQVASCTACLLHCGRTQTVFGVGDQAAKWLIVGEAPGVEEDRQGEP
FVGRAGQLLNAMLEAVGLQRGQVYIANILKCRPPNNRDPLPEEVAHCEPY
LRRQVALLRPRIILAVGRVAGQNLLKSSLPLGRLRGSVHQYPETTIPLVV
TYHPAYLLRAPREKRRAWQDLQLAHKVYREF
>Noc_0344 DNA adenine methylase
MPPNSISVRGYQRPFLKWAGNKYRLLTRIISVLPPGKRLIEPFAGSAALF
LNAEYERYWVNDINPDLIALYQILQKEGKEFIHYAGCLFTPHNNTPKAYY
RLRARFNTTADVAEKAALFVYLNRHGYNGLCRYSGSGIFNVPFGRYQRPY
FPSKEMMAFHIKARRARFTCLDFRKVLARIRRGTIVYADPPYTPLSQTAC
FTHYSGNGFGPEEQKALTQSAQRLAKRGIPVLISNHDTPSVRQAYRNAQL
TGFSVTRLISCKGTQRTPANEVLALFI
>Noc_0957 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_1659 DNA-directed DNA polymerase
MPEFYPWHQSHWNSLMASQIAKRMPHAILLMGPEGMGKRSFASNLIQALS
CEYPQKGGHGCGSCAGCRLQLAGSHPDNLTLFPREGKQTISVNQIRQLHS
HLALKARQSLFKTVIIYPAEEMTLSAANSLLKILEEPPGRTVLLLITSAS
LRLPITIRSRCQQLFFTISKSQEVKMWLQQRLPPSMNLDAVTLLRLSGGG
PLRALEYVEQNFLFYRKEFINNLFSLTQGKLDLFEMVRSCLKDNLGEPLY
WISTLVEDTIRLRSGVSKRFLVNHDVANSLQLLARRASFEMLFALLKKAS
CNREIWKGQININPQLLLEEILIQWLICFSKVNHEHT
>Noc_2255 Tyrosine recombinase XerD
MISSEQARFSADQGHLERFLNALWLEEGLAENTLAAYRRDLERFSRWLHS
QGRTLIGAQREDLLAYLAHRLEGGDKSRSVARSVSSLRRFYRYLMGEKIR
DNDPSDRVESPRLGRLLPESLSEEEVEALLAAPKTEDSLGLRDRTMLETL
YATGLRVSELVNLTLPQLNPRQGVVYLSGKGNKERLVPLGEVALSWLDRY
CREARLGLVRSQVNEILFLTRRGGAMSRQAFWYLIKRYARQSGIQKALSP
HTLRHAFATHLLNHGADLRVVQILLGHADLSTTQIYTHVARARLQQLHQQ
HHPRG
>Noc_0860 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1216 ATP-dependent DNA helicase RecG
MPLSILTSEKPCSLESRGNPNPLALDTPVTALKGVGPSLANRLARLGLCK
VQDLLFHLPQRYQDRTRVAPIGTLQKGREALIEGEIQLCELRPGRRRSLL
CSLSDGTGQIFLRFFHFSAWQQNSLTPGARVCCFGELRQGAAGLEMVHPD
YRCLPEDRPTVTETYLTPVYPATEGLRQSSLRDLIQGVFKDLGEEGIIDY
LPSGLLEGVGLPTLNEALIYLHQPPPDAPLNLLAEGKHPAQQRLAFEELL
AHHLSLRQLHLRASRLQAPLLSSEGQLQQRFLATLPFPLTAAQSQVVQEI
LADMAQESPMQRLLQGDVGSGKTVVAALAILQAVEAGYQAVLMAPTELLA
EQHLRVLQGWFCPFEIRVEWLTAKRTTKARRESLERLEGGETPVAVGTHA
LFQEGVNFHKLGLVVVDEQHRFGVEQRLALREKGRYGNCCPHQLIMTATP
IPRTLAMTAYADLDTSVIDQLPPGRIPVATAVASDRRRDEVVARVRRACR
EGRQAYWVCTLIEESDSLQAQAAEESAAELAEALPELCIGLIHGRMKTQE
KERAMAAFKSGDFHLLVATTVIEVGVDVPNASLMIIENAERLGLSQLHQL
RGRVGRGAADSYCVLLYHGPLSELSRARLACLRATNDGFEIARRDLELRG
PGEVLGTRQTGLPQYRVANLMRDQELLVSVGQVADRFQQRYPAQVASLIR
RWLGEESRYGEV
>Noc_2196 Adenine-specific DNA methylase containing a Zn-ribbon-like
MTIETKFNIPLVASLALREKQIQQNYRPIIAVHKWFARRPGSLFRALALA
EFGEAPLADLYFTANNFPGRQVADPFMGGGTPLIEANRIGCDVTGFDINP
MAAWIVREEIEHLDITVYQEEASRFLHKLRHEIGPLYVTDCSLYGDTDVP
VKYFLWVKVISCESCGQEVDLFPGYVLSQNARHPKNVMVCADCGELSEVN
DRAVPGVCKSCDATLHAKGPAGRGRCVCAHCDHENRYPRASQGPLQHRLF
AIEYYNPHRKAQHKGRFFKKPDAKDLARVAEAKQRWHEFHAHFVPEQKIL
SGDETDRLHRWGYSHYREMFNHRQLLGLELSCRLIARVKNERVRHALATN
LSDLLRYQNMLCRYDTWALKSLDIFSVHGFPVGLIQCESNLLGIINNKGT
NVGSGGWTNIIDKYTKAKRYCDTPFEVQRHGTRNVQIPIQGEWIGEKLNG
EQRRNVAIHCADATTVKLAPNSLDAVFTDPPYFGNVQYGELMDFCYVWLR
RLVGNEAEGFWRPSTRTDGELTGNVTRSWGLPRFTEGLARVYRHMAEALQ
PGAPLAFTYHHNKLNAYFAVGVAILDAGLTCSASLPCPAEMGGSIHIHGT
TSSIIDTVFVCRDTGHVPRRWLFESTDQLAAIVAHDLAQVAEGGHKPSMG
DTRCIIFGHLTRMAIWNLRLTWEAKLSTDEKLDRFAKAVRALADPDDLLE
RLQADGRVSAPAEPLFSAATTLGRIRDAVSFRSSLCRDRKRSRHLCR
>Noc_0325 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_0048 Transposase
MTRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKADLTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0672 Transposase, IS4
MKPQTPPDLPTDDLFRHRLENLIDTRHELAKLAALIDWEFFDAQWGEAFC
ENGRPAIATRLIAGLHYLKHTYGLSDEQVVQRWAENPYWQYFCGERYFQH
ELPLNPSSLTRWRQRLGDEGMESLLSATIDAAIASKAVKARDLKCVTVDT
TVQEKAIAFPTDSKLYNRARERLVRLAKAHGVPLRQSYVRVGPRLLFKNN
RYGYARQTRRMRRTAAKLKTVLGRVVRDIERKLPKQSASVQAAFAESMAL
TKRLLDQQRHDKNKLYALHAPEVECIAKGTAHKRYEFGVKVSIATTNRSN
LVVGAQSLPGSPYDGHTLKKALHQVERLTGQRPERCYVDLGYRGHDVDDV
DVFKARQKRGVTRTIRRELKRRNAIEPIIGHMKNDGLLHRNYLKGVEGDA
INAILCGAGQNLRLILRYLRIFWLKIQPAFIQYLLLAPPRAA
>Noc_0919 MutS 1 protein
MPANDPQKPHTPMMQQYLRIKAEYPNTLLLYRMGDFYELFYDDAQRASEL
LDIALTSRGRSAGEPIPMAGIPYHALDSYLARLVRQGESVAICEQIGNPA
ASKGPVERQVVRIITPGTVTEEALLEARRDNLLAALQKEGDVFGFAVLDL
CSGRFNILEVASESAATSELARIRPAELLVSEDLALILVDSKTEAVVRPL
PPWYFDRESAQRQLCRQFGTQDLAGFGCEEMKTAIAAAGCLLHYVQDTQR
TQFPHIHALQVERQETSIILDPSTRRNLELEESLSGDSGRNTLIAVLDHT
ATAMGSRLLRRYLHRPLRDQTLLKQRQQALATLLEGGLSDVLQTLLRGIG
DIERILSRVALRSARPRDLVQFRQALGLLPKIQESLLQLNRDSLLLQSLQ
EDLGPFPNLHELLQRAICENPPVLIRDGGVIALGFDSELDELRHLSGNAG
QFLVKLEQRERERTKIPTLKVGYNKVHGYYLEITRAQAHQAPPDYIRRQT
LKGAERYITPELKGFEDQVLSARERALAREKALYEELLEQFMEPLPALRA
CANALAELDVLHNLAERAKTLEYVAPLLSDQPGIFIERGRHPVVEQTLED
PFVPNDLTLHEARRMLIITGPNMGGKSTYMRQTALIVLLAHIGSFVPARR
AVIGPIDRIFTRIGAADDLAGGRSTFMVEMTETANILHNATEHSLVLLDE
VGRGTSTFDGLSLAWAVVSHLANKVRSLTLFATHYFELTTLPECLPGVVN
LHLTATEHKEHIVFLHAVKEGPASQSYGLQVAALAGVPQEIIAQARQQLM
ELENNTWQKSINGGGPQLDLLAPPADHPAVQILQDLDPDELTPRQALEKL
YELKQLLDLAVTH
>Noc_2749 NUDIX hydrolase
MSYLHQIKACNSYTLKDFRPFYVDEVQIGHIRSSFAEKLRSWPAVFRVSP
AAVYLAPDLHSFATRTEKVKTVLKALVEEGALPRWHGEEYPVTASSREAA
LFAIDRGAAPYFGIRAFGQHLNGFVNDGDQLKIWIGRRSPNKWNAPDKLD
NLVAGGVPHGVPLRENLAKECWEEAAIPPELAAQALSVGYISYRMETAQG
FKPDVMYCYDLELPPDFVPQCQDGEVEEFYLWPVEKVAALVRETNSFKKN
CNLVIIDFLIRRGFITPEHPDYLEMVAGLRVPLEA
>Noc_2244 Protein of unknown function DUF927
MKANNEKGNLHNLFLFPEGETEAPGAASGKATKAKIPSSAKRAPKVKADA
VNDKGTPAPSLNGESNPKTQAAETTPKAKADKKKQVDIRPPCFAVHDDFC
LVNGRETCPGVWRYDVKETKEGTVLVREWVATPIHVRAITRNEQGENYGF
LLEFLDDDKKWKTWSMPRRMLSGSGEDVRKALLDRGARIAPGKGGLLNRY
FMKQFPKRRVTSTSRVGWTDDGETFVFPRECISSLRGKEAIFQAEMLAEA
DYPKKGTLEGWRRNIGQLCEGNPVLTMAVSAALAGPLLLKTDKSEGAGIH
FLGDSSKGKSTALQVAASVWGNHEFMQSWNSTANGLEGIAAARNDTCLII
DEISEGNPYELGKIAYMIANGRGKSRANRIGEAKGIRRWRIVALSTGEKT
LSSMLESVKIDANSGQNVRLLNIPSTGFSYGAFDCLHGFASGRELADALK
QARHHDYSLVGYAFIENLLKRRSPNLPSRLKDITDELKPLVNTTIEGRAA
DTMALFILAGELGIEYGLLPWKPGAAMEAGKILFELWRDNQTGDGTEDKQ
ILKNVKDFIDRHGDSRFQFRGTQPDKDFVTIRDRAGWFEMNDDNERVYLF
HSSGLKEAGGGFELKRVAQALDRAGWVTKKGKGRLNLSYDFRDFKGRLYA
IKPESNLD
>Noc_2403 possible transposase
MGIARAAPSRGQGRLGRHRQRQSPIFINAVFGIMRTVAPWRDLPPDLGHG
SNTHRCFIRWRDKGIREKLLATRIDEPDYEWLMMEDSHCKVHPHGAGARG
GHRDMSRTKGGSTRPYIGPWMRWACRSALLSQEVAERIVGRLAA
>Noc_0690 hypothetical protein
MRRQRMELKQDDTPEFWAEAMKRVLEYYKQGLSLAEAAERLSIPKVTLTT
WGQQSQRGKAPAAPLVPWLPGPGCVCSKLPQTTGGYLKQERPR
>Noc_1794 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1505 Transposase
MTEHDSTTVQRSYTFRFYPTSVQRQQLAMEFGHARWVWNTCLTWRGRQYR
VHDKRVTGVDFSRQLTFLKGLGPYAWLKEASATCLIQKLRDQDTAFRHFF
AGRAKYPRFKKRTHTQSIRYQLDQRQVAGMYRAGEFLKLPKLGALKLKWS
RKPQGIPKMVTVTQDCVGRYCVSFMCEETLQPLPRKPNGIGVDLGVCDVV
VTSEGWKSGNPRHLRTHTRQLRKTQRRLSRKRKSSVRWHRQRIRVAKAHA
RVSNTRQDWLHKLTTALIRQAGFIAMETLNVRGMMANRRLSKALGDVGMH
ELKRQMEYKAKWYGREFRQVDRWAPTSKTCSVCGAVQKAMPLKVREWTCP
DCQTAHDRDINAARNILILATGGRPGSHARGGVYQPAAAYGC
>Noc_1594 Transposase
MYTSLYHVYMITQRAYKFRFYPTPTQKRQLALEFGHARFVWNWALETRTK
AYQERGERLNNIGLSRQLTALKKAEYPWLSEATAGCHTQKLRDQDTAFKN
FFAGRAKYPRFKRRHHTQSVRYQLDQRHVARNFNAESQRLRLPKLEALKL
KWSRDIEGIPKMVTVSKDPAGHYFVSMACEVTIVPLPARRNALGVDVGVK
DIAITSEGWKSGAPKYTDRYARQLKRAQRRLSKRQKGSGRRYQQRQRVAR
IHARIKDSRRDFLHQISSKLINENQVICLEDLHIKGMLRNRRLSKAIADC
GLYELRRQIEYKAAWTGRDVLIVDRWAPTSKTCSACGTVQESMALKVRAW
TCGCGASHDRDINAAKNVLFFGTAGSAGTSKARGAVKPPRAVA
>Noc_0993 hypothetical protein
MAKRTTKRKTAKSARQQVPQVSEAISRAEKATTLAQKKLDKAAERLDKAK
ERAAQAVTKAKETKRATTIAAAERAREALTKAKEAKREAAGELRQAKANL
KAEEKAERDRIKSEEQEKRLAEKKATAKAHAIAVFEKKWEQQWERKQRTK
KTVKRTQRKVSTPSKEKKITAL
>Noc_1331 Helix-turn-helix, Fis-type
MTYSLDMRNAIISFVKNGGSKTEAARLFNVSRNTLYRWLGLDDLAPKKHG
PRSRKIDKGKLKEHVEDYPDMFVHERAEIFGVHASSISRALKRLRIVKKR
ARV
>Noc_3024 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0223 DnaB helicase
MPELAYQEKKDEFVEGLRIPPHSMEAEQSVLGGLLLDNQAWEQIADILGS
EDFYRQDHRLLFLSIGEMLEQSRPCDVVTLSEWLKNKRLLEETGGLAYLT
NLARNTPTAANIKAYAEVVRERSVLRQLIRVGTDIAGSAYQPEGRESGEL
LDEAEKRVFEIAEQGARGQRSFVNIKDLLGKVVDRIDHLFEQDSPITGLP
SGFADFDLLTSGLQPADLIIAAGRPSMGKTTFAMNIAEHVAIKRRVPVAV
FSMEMPGEQLAMRMMSSLGHIDQHRVRTGRLEDDDWPRLTSAISLLTEAP
LFIDDTPALSPTELRARARRLMREQEGLGLILIDYLQLMQVPGHRENRAT
EISEISRNLKAVAKELKVPVLALSQLNRSLEQRPNKRPVMSDLRESGAIE
QDADVIVFIYRDEVYNEESPDRGTAEIIVGKQRNGPIGTVRLTFRGQYTR
FENYVQDAYGAGSGFPAADKMN
>Noc_2831 DNA mismatch repair enzyme MutH
MTIFSLPPPANESELLARAQALAGSSLEQIAARLDRRVPQDLRRAKGWVG
ELLELALGATSGSQAVPDFPALGVEMKTLPLQADGRPKESTYVCTVPLTD
SEACWETSWIRRKLNRVLWLPVEAQAGMPLSVRRVGMPLLWSPNSEEEAI
LRADWEELMDMVCLGELEMITAHYGIYLQIRPKAANGHALCEGIGEDGAR
IRTLPRGFYLRSSFTAALLGRYYAT
>Noc_1364 Helicase c2
MEYALFSAAELLGEDGPLARQIEGFTPRRPQQEMAEVVAKALEGKQTLVT
EAGTGTGKTYAYLVPALLSGIKVIISTGTKHLQDQLYHRDLPLVKNALGI
SSRVALLKGRSNYLCLHRLHLFTQEGEASHGRLAGDLVKAQLWARQTSTG
DISELGGVADDSPVWPRITSTADNCFGQGCPSFSKCHLMAARRQAQEADI
LVINHHLLLSDMALQEEGHGDLLPPAKAYILDEAHQLPETASRFFGRRVS
GRQLLELARDSGAEQRNDAPDMLAIRDACQWLEKTVADFRLALGLGERRL
AWRKVADMPPVALALEQLSRALADLEVELEPAAVRGKGLEHCFKRCGSLR
EQLAMFAEEASEDFVYWFEAYSRSFILSASPLEISENFQAYMECRPGAWI
FTSATMAVGEHFDHFNHQLGLEMPASYRWESPFDFQRQALLYQPVGLPEP
HDSGYTQAVIEESLPVLEASRGRAFLLFTSHRALKEAYSLLKDRIAFPLL
VQGSAPRSELLRRFRALGNGVLLGTSSFWEGVDVRGEALSCVIIDKLPFA
SPGEPLLQARIEAIRRQGGNPFMDYQLPNAVIQLKQGVGRLIRDVHDRGV
LMLCDPRLRSKSYGRLFLKSLPSIAQSQDIKEVEAFFQTG
>Noc_1187 chromosomal replication initiator protein DnaA
MVTQQLPLPIGDSGAPSFENYYLAAANRESVAAVERCGQGKGDRFLCLRG
PSGVGKTHLLLAACQIAAQKGERVAYVPLKRAVIMAPEILGGLEVAAFVA
IDDIDHIAGYRHWEESLLHLYNLLQEGRGRLLLASTDKPSTLHWLLPDLR
SRLGWGLGYQLQPLDDHQKHAALQFQAAKRGLELPDEVAGFLLRHSERDM
HSLSSILAQLERASMAAQRRLTVPFVRQVLDI
>Noc_0324 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_2282 Gram negative topoisomerase IV, subunit A
MNISMNPDVEERPLGEFAEKAYLDYSMYVILDRALPHLGDGLKPVQRRII
YAMSELGLSAGAKYKKSARTVGDVLGKYHPHGDSACYEAMVLMAQQFSYR
YPLIAGQGNWGSPDDPKSFAAMRYTEARLAPFAELLLSEVGQGTVNWAPN
FDGTLQEPRLLPARVPHVLLNGTMGIAVGMATDIPPHNLHEVTAACLHLL
KYPNATVAELCEYLPGPDYPTEAEIITPSAALVKMYETGQGSVRLRAQYM
VEQGNIVVNALPHQVSGARVLEQIAQQMQAKKLPMLADLRDESDHENPTR
LVLVPYARRMDVETVMSHLFATTDLERSYRVNLNVIGLDGRPQVKNLRDL
LGEWFYFRIETLRRRFRYRLEKVEDRLQVLAGLLIAYLNIDEVIAIIRDE
EAPKPVLMARFKLSDRQAEAILELKLRYLARLEETKIRAENKALAAERKG
LEKMLGSESRIKDLLRQELLADAKKFGDARRSPIVERSSATVLDKQSLLP
AELITVVLSEKGWVRAAKSQDVNPAALSYRTGDSYRASARGRSNQSAIFL
DSTGRSYTLPAHGLPSARGQGEPLSGHLQVAPGAEFIAILLAEPEEFYLM
ASNAGYGFLCQARHLYTKNRAGKAVLNVSAGAKALPPVPVTDRELDQVAV
ATNVGRLLLFPLRDLPFIARGKGVKLINISSTRLAQGQEAVRALAVVSQG
GALQVFAGRRHLTLKSEDLEPYQGERARRGLTLPRGFQQVERMVPIG
>Noc_1057 hypothetical protein
MTNKEDSNFTQNLKNPAFWKRFLFMMLFAVAYTLAEFAVWAAVIFLIFYN
LITGGSNERAVTFGRQVSAYIYHLLLYLTYNTEERPFPFTDWPRPENMPT
GLGYPLTPNAGEATTGRNPSPQPPNPTTGAAQSVPETTAANPAFRKEGSS
QTE
>Noc_0966 hypothetical protein
MFYNPKRESEPLQRQWSAHQESVCTYAAEQLIFLDEMGAVFNLSLDYAPK
GQRVYDEKPTAKGERISTLGVLSLQGLVTGMRFEDTLNGSVFLYFLEQFL
CPPLKPGQCVILDNAAAYKAEGGAEPMSKLALDSFTCLPIFQILTPLK
>Noc_0217 Deoxyribonuclease V
MKTHSWNLSPKEAVALQRRLADQVVAEDRLGQVHFVAGVDVGFEEQGKIT
RAAVVVLRLADLSLVEQVVARRPTHFPYIPGLLSFRECPTVLAALEKLTV
TPNLLLCDGQGIAHPRRFGIACHLGVLTGLPSIGVAKTRLVGKHGPAPDK
RGGWTPLTDKGETIGVVLRTRIKVNPVFVSTGHRISLLTAIQYVMACTTR
YRLPETTRLADKLASGSKK
>Noc_1793 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0986 hypothetical protein
MARAKKPSEMENMRAENPVTMLEVKFATVRQKLPAEYEKAITGAGRVLAS
RIKKLDTLNGRLTKARERLAKAREQQKIKATAAVQARFEKARLAVAELKV
ESADLRAEIVGLRKEISNLKKRSRQFQTIEKAIQKLEKKAAKPHKPRRRK
KTKASDRKIALESQWERFTAGSSLNKESAFAEHEIEAEVE
>Noc_0355 Protein of unknown function UPF0102
MKPATHRDKGEQAEQLACHYLQARGLRLTQRNYHCRLGEIDLIMEDRESL
VFIEVRYRRKGRFGDAIDSITPAKQARLIAAAQHYLQRTGGAQNKPCRFD
VVGITSEKGADNIMWLRDAFRVES
>Noc_1366 Transposase
MTEPDSSKIQRSYKFRFYPTSIQRQQLALEFGHARWVWNTCLTWRGRQYR
VHDKRVTGVDFSRQLTFLKKLGAYAWLGEATRDGLMQKLRDQDRAFRNFF
AGRAKYPKFKKRAHTQSIRYCFDHRRRGKMKAWIEGKLVLPKLGALKLKG
SRRPQGVPKMVTVTQDGAGRYFISFMCEETIQPLPRKPNGIGVDLGVKDI
VVTSEGWKSGNPRHLRKIQRGLSRKRKGSHRWHRQRIRVAKAHARVSNTR
QDWLHKLSTVLIRQAGFIAMEDLNVRGMMANRRLAKALGDVGMHELKRQL
AYKAKWYGRALVQVDRWAPTSKTCSACGTVIDVMALKVREWRRIPTGGGR
MTAKGRPRNANGSKASEAACLCDARRQVGNKRTRLSRKLQYSYHQLAARG
DVT
>Noc_3010 DNA topoisomerase
MSKNVVVVESPAKAKTIKKYLGKNFEVLASYGHVRDLMPKEGAVDPEHGF
KMKYQAIEKNGRHVNAIAKALKSADFLLLATDPDREGEAISWHLLELLKE
EGVLEDKAIQRVVFYEITSQAVNEAVAHPRDISLDLVNAQQARRALDYLV
GFNLSPLLWKKIRRGLSAGRVQSPALRLICEREKEIDAFKVREYWTLEAD
AAASKQEFVAKLTHLDGKKLAQFDIESKDQALALVDRLTKAASGELRVIK
VERKQRRRNPAAPFITSTLQQEASRKLGFSTKRTMSVAQQLYEGVDIGDG
AIGLITYMRTDSVNLANEAVGEIRNFITERFGQSGLPAKPRTFKTRAKNA
QEAHEAVRPTSVYRVPEALKPHLKPEQFKLYQLIWRRTIACQMKHATIDT
VAVDLNTQKLAPEGGNSASGHIFRATGSTVVDPGFMAVYQEGRDDIKGEE
EQRKLPPMKEGDRVTLLQIRPEQHFTEPPPRYTEASLVRALEEFGIGRPS
TYATIISTLQQRDYAVLENKRFQPTDVGRVVNRFLTEHFNSYVDYDFTAR
LEDELDAVSRGEKVWIPVLEEFWGPFSARIQEKEQNVSREEAVQARELGI
DPQSSRPVSVRMGRYGPYIQIGSKEDEEKPRFAGLQPGQKMDAITLEEAL
TLFKLPRELGFTPGGEQVSVNVGRFGPYVKYDNKYVSLRGEDPHTISLER
ALALIEEKKQADANRVIKVFPDSGIQVLNGRYGPYVTDGERNARVPKEQA
PEALSLEQAQALINEAPVKRARRKAGTRKKAKG
>Noc_1139 Excinuclease ABC, B subunit
MKDIFQLVSDYQPAGDQPEAIERLTEGLAAGEMYQTLLGVTGSGKTFTIA
NMIQQVQRPTIVLAPNKTLAAQLYSEMREFFPRNAVGYFVSYYDYYQPEA
YVPASDTYIGKDASVNDHIEQMRLSATKAFLERPDAIVVASVSAIYGLGD
KDSYLNMVLHLMVGDTVDHRGILRRLAELQYQRNDRELYRGTYRVRGEII
DIYPAESEQEAIRVELFDDEIESLSYFDPLTGEILHRVSRLTIYPKTHYV
TPREVLLQAVDEIKIELAERLEQLYGANKLVEAQRLEQRTRFDIEMILEL
GYCTGIENYSRFLSRRQPGEAPPTLFDYLPKNALLVIDESHVTVPQLGAM
YRGDRARKETLVEYGFRLPSALDNRPLKFEEWEQLAPQTIFVSATPGPYE
QQHSGAVIEQVVRPTGLVDPAVEVRPAGSQVDDLLSEIRQRTAADERVLV
TVLTKRMAEDLTQYLEQHEVRVRYLHSDIDTVERVEIIRDLRLGKFDVLV
GINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNLHGRAI
LYGDKVTGSMGRAIAETERRRKKQLAFNETHRIIPRGIQKAVREIIDGVY
TPGSGKGHRSPDRVEEKAAEYTRLPPQQLAKRLQQLERQMHKHAQNLEFE
QAARLRDEIKRIKGWVFNGADSASAPRQENSQARAIC
>Noc_0701 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_2475 Histone-like DNA-binding protein
MKKVILSLILMMVSLSVYGVGDKELAGRIAKQTGMAPDQIEEVLVAFKKQ
IIADLASGQEVRLSHFGKFYAKHMNAREARNPRTGEAIQVPPRSYLRFKA
FDTGNSRLN
>Noc_1644 Excinuclease ABC, C subunit
MANFDIDDFLRNLTPCPGVYRMLDAKGKVLYVGKAKNLKRRIKSYFRNSK
LAPKIHVLVKQICDIKITVTHTENEALILESNLIKALQPRYNVLLRDDKS
YPYIFLSADDFPRLGFHRGVKQVSGQYFGPYPNIRSVWQTLKLLQRVFPV
RQCEDNFYRNRSRPCLQYQIKRCTAPCVGLISKKDYSQDIQHVVMFLKGR
DQQVINELVIRMEEASGQLAFEQAAYYRDRIASLRQIQARQYISGEKKDI
DVLGVALTEKMACVEVFFIRGGHNLGNKTFLPKLEGNLTPEELLSTFIAQ
YYLNRETPPILILSHQPKDMGLLTEVLSKQAGRKIALIKPVRGPKVQWIK
MALANAKINLNQHLAEKSNITARFKSLQQLLSLANFPQRIECFDVSHIQG
TATVASCVVFDREGPRKADYRRFNITGIIPGDDYGALRQALMRRFKKKEG
VFPDLLVIDGGKGQINQSLRVLKEIGITEITVLGIAKGPERKAGNETLFL
AGYENPVMVTSDSPALHILQHIRDEAHRFAIVSHRKRRAKGGKLSLLEGI
SGLGPKRRRKLLIQLGGLQEITRAGVEDLAQIEGISLELAQRIYDVFHR
>Noc_0181 Integration host factor, beta subunit
MAPELVVVCGFGSALFVMTKSELIERLAQRQALLSYRDVELAVKMLLEVM
SQALAQGERIEIRGFGSFSLHYRPPRSGRNPKTGDAVPLSSKYVPHFKPG
KELRERVDRGESI
>Noc_2902 Transposase
MKEIVRTTLIKVDLPFEAAKNTVIAWTEACNAVSCRAFENGKLSNAIKLQ
QLCYETAKSFGLSSQVAVSCIRQVASKYAAARTAKKTLSKPVYFRPCAVV
LQGGKRGRDFSFTQQGLSVWTVNGRIKSLVYHGAPKLQAYVANWRLGDGR
LFVRKGKVFLSVSFKHEAETISKPNDAVVGVDRGIKVLATVTDGQRQLFF
GGGHTHHVRHRYAKTRASLQKKKARTGSRSTRRVLKRLSGRERRFMRNIN
HVMSRHLVDFARDTGNPTIAVEDLGGIRNGRRLRKQQRTDLNRWAFYELE
QFIRYKADTFGMEVIGVDPKHTSQGCSRCGHTEKANRHQRRFLCKACGYE
LHADLNASRNIRLRGVLARQVLDEDGVLSITPEARPVDPGSKPGEGTGKP
LALASGH
>Noc_2219 hypothetical protein
MPVSMPFVQPSPGPPLERLDGQGPSAHPPGFRNASPRRGKSRTFPLLPAP
YGPELHLIKILWRFIQYRWLLFSAFQSYANLKGALENRLANLMDVASAKK
LA
>Noc_0388 Transposase (probable), IS891/IS1136/IS1341
MINAFKYRLYPNAHQARELETMLETHRRLYNECLAQRKERYEIAQKSVKY
TQQSAWFKSERGVNEWYARLNFSSAQATMRRLDKAFQAFFRRIKAGEKPG
YPRFKARGRFDSWTYPAHGDGARLLDGQLRLQHVGLVKVRQHRPVEGAVK
TVQVKNEAGKWFVIASCNIGDGPAPRAEDTSVGLDVGLAYFLSTDQGEHV
DNPRYQKEALRQLRIAGRAVSRKRKGGRNRAKAVARLRAIHARVANKRRD
YHHKVARDLVSRYAFIAAESLTVSNMVRNRRLSRSISDAGWRQFLNILRA
KAESAGSVWVEVPPQGTSQACSGCDTLVRKSLSVRQHHCPECGLRLQRDV
NAARNILARGQARMEPVRLNVAQ
>Noc_1520 Transposase
MTEPDSSKIQRSYKFRFYPTSVQRQQLAIEFGHARWVWNTCLTWRGHQYR
VHDKCVSGVDFSRQLTFLKKLGAYGWLGEATRDCLMQKLRDQDTAFRNFF
AGRAKHPKFKKRAHTQSIRYCFDHRHRGKIKAWMAGQLVLPKLGALKLKW
SRRPQGIPKMVTVTQDGAGRYFISFMCEETIQPLPRKPNGIGVDLGVKDI
VVTSEGWKSGNPRHLRTYRRLLTKTQRRLSRKVRGSDRWRRQRVRVAKAH
ARVSNTRQDWLHKLSTALIRQAGFIAMEDLNVRGMMANRRLSKALGDVGM
HELKRQLEYKAKWYGREFVQVDRWAPTSKTCSACGTVQKAMPLNVREWTC
PDCKSVHDRDINAARNILRLTTVGRTGSDARGGVYQPEVAYGC
>Noc_2611 DEAD/DEAH box helicase-like
MTSSTELLSFDSFAISAPVLQAIKQVGYETPTLIQARAIPHLLAGHDLVG
QAQTGTGKTAAFAIPTLERLELSQKEPQVLVLVPTRELAIQVAEAFQSYA
RYLEDFHVLPIYGGQSMGNQLRQLKRGAHVIVGTPGRIMDHLRRKSLILT
KLTTVILDEADEMLKMGFIEDVEWILKQVPTKRQVALFSATMPTSVRNIA
SRHLQAPQDIKIKGETASLPAINQRYWLVSGLHKLDALTRMLEAEEYDAT
IIFVRTKIATEELAKKLMARGYAAAALNGDIPQSSREKVVDQLKKNTIDI
IVATDVAARGLDVKRIGHVVNYDIPYDTGTYIHRIGRTGRAGRTGTATLF
VAPRERRMLSAIKRATGQPLYEMQLPSHQQVIDRRIERFKQQVMNTLEKE
NLVAFRHLIQQWAQQEDLSPLDIAAALTYSMQRERPLVGPPRSEPVTHGA
PPPRKVAKHKSRKVAPAETTKRKKPYKKAIKKQKDFRSLYLNNSHPPSTP
>Noc_0387 Transposase IS200-like
MHHSQMSRYAKNAGAVFSLKLHLVWCPKYRRGVLVGEIAERLCALLHKKA
DELEMTIHALEVMPDHVHLFVEFDPRWGVAEMVNRFKGSTSKELRKEFPI
LRSRLPTLWNRSYYAGTVGHVSESTVRAYIENQKGK
>Noc_1301 ATP-dependent helicase HrpA
MQRDQHRLKRRLQRLTKGNSSYNLGHLTQAIEDSRLWREQRQSQLPRPAF
EQSLPVIERREEIGAAIRNHQVVILCGETGSGKTTQLPKICLELGRGVAG
MIGHTQPRRIAARTVANRIAKELNSDLGQIVGYKVRFHDQVSPSTYIKLM
TDGILLAETQGDRFLDQYDTLIIDEAHERSLNIDFLLGYLKQLLPKRPDL
KVIITSATIDTERFSQHFGQAPIIEVSGRTYPVEIRYRPLCGEQETQERN
LSEGILDAVDELSRLGPGDILVFLPGEREIRETAEALRKHHPPHTEILPL
YARLSSTEQNRVFKPHSGRRIVLATNVAETSLTVPGIHYVVDPGLARLSR
YSVRSKVQRLPIEKISQSSANQRAGRCGRVATGVCIRLYSEEDFLGRPEF
TDPEVLRTNLASVILQMKSLQLGAVEDFPFLDPPLPKMINDGLRLLAELG
AVDKAQNLTPLGQRLARLPIDPRIGRMVLAGDEFHCLSEMLIIASALSIQ
DPRERPLEAQQAADEAHSRFQDERSDFLSYLKLWEDLHRQRARLSQNKLR
AYCREHFLSYLRLREWRDIHQQLKLLATNIGFRPNQVAAEYGAIHRALLT
GLLGNIAVKSEKDHYLGARNIKLQIFPGSALFKKSPKWIMAAELVETSRL
YARCAGKIEPEWLEALALHLVKRSYFDPHWEKRPAQVIAYERITLYGLTV
IPKRRIHYGPVNPEEAREIFIREALVNGDYDTQAPFFRHNQKLIAEIEEL
EHKSRRRDVLIDEQSLYQFYEERLPAGVYNGAGFKKWRQQAEKKNPQLLF
LSREELMRHDAKEITGVRFPDQMTVKGLPLALSYHFEPGHPADGVTLTVP
LAVLNQLEASHFQWLVPGLLKEKIICLIKALPKGLRRNFVPVPDFAEACI
RALSPAQGPLLDRLARHLQSMTGVPLSATCWQEVDLPLHLQMNFRLVDEK
DKELATGRDLAILQRQWASKAQRSFRGWDNSELTREGITQWDFGELPERI
ELERQGLKLKGYPALQDTETAVSLVIMDSAEAAQEITHLGLRRLFMLALT
QQIKYLRKNLPGIQKMCLHYTSLPAMPWGDSAPSQSSCESLKDALIQGII
DRTFILDHPPVRNGEKFMARKEKGCGELMSTANEFCRLIEEILTEYHEVV
RQLKGNLPFAWLNSIRDMKEQLTHLVYHGFINQTSPIWLIHLPRYLKGIK
LRLAKLQENPRRDQQRQAEITPLWQAYQKRMEIQHQEDGVVPALETYRWM
LEEYRISLFAQELGTKRPVSPKRLAAQWKEI
>Noc_1915 hypothetical protein
MVENKAYRSIFSPCSTQEQAANFDSYWIFSQRHAGELLEEDKDLTNKREK
LNYFQNNPIRSRNPLPNPELFYRNHVKFKDDPQSIDRKTLLLTTIYKFAR
HEWVGISGAWDIIPSMAEAKTLTDKISRVHLAEEFCHVRFFHEMLKTFHL
NKVEWLPLGPVKQKIYKIFPRLPGFLMDTPAFVTELMGMTFYQHLDKLFE
EVFADEPEARQRLREILHEIMIDEMAHVGQRRNFIGPNGVRAAKLMLAPL
YRAFFRDIPEAKYLFDIEKMIQDGLAFDYSSVSPSLLARSWVPSYCQV
>Noc_2051 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_1415 NUDIX hydrolase
MKIKQRSAGVVVIRKTVNYCQYLLLRAYHYWDFPKGLVQPGEDPVMAACR
EVEEETGLTQLQFRWGYQCRETPPYGRGKVAIYYLALASRSEVHLPVSLE
LGRPEHHEFRWVTYREGHQLLGGRLREVLNWAQRISDCKAGCGSTPFNPS
Y
>Noc_0615 Phage integrase
MLTDTKLRNLKPKDKLYKVNDRDGLYVAVTPAGTISFRHNYSISGRQETL
TIGRYGTGGITLAEARERLSEAKKMITAGKSPAREKARDKARVKDAETFG
AWAEKWMRGYQMADSTRDMRKSVYNRELKKRFGNQKLSEITHEDLRAVTD
SIVERGAPATAVHAREVVLQVFRWAIERGQKVENPAELVRPTSIARFEPR
DRALSPAEIGLMYRYMDRVGTSPQYRAAIKLLLLTMVRKSELSNATWSEI
NFSEALWTIPKERMKRRNPHLVFLSRQALDIFIALKTFAGGSEFVLPSRY
DSDIPMSAATLNRVLALTYRAAQKDGKQLSKFGPHDLRRTASTLLHESGY
NTDWIEKCLAHEQKGVRAIYNKAEYREQRVEMLQDWADMIDEWTNVKRQ
>Noc_0193 NUDIX hydrolase
MSKPTFFVVAVAVFLVHDNRFLALRRSTSKAVAPGAWEVISGKVERGELP
HETARRETYEETGITVALDERPVTTYQADYGMAPMIVLVYRGKRLAGEAS
LSSEHEAMAWVTEDEFAQLCLYGELVEAARWALKVP
>Noc_2663 DNA polymerase III, delta subunit
MRVKLEQLPAHLGRGKLAPVYLLFGDEPLLIEEGADLIRSRVQSQGFYER
EVMHIERGFDWAQLQWATRGLSLFAAHRLIELRLPTNAVSEAGAKVLRAY
GENPSPDTLLLIISGKLERRHQGTRWFSALEQAGVVVQVFKVDISSLSRW
IERRMHSRNLSPTQEAVTVLVERLEGNLLACAQEIDRLALLFPEETIDVS
RVEEVVADNARYDVFRLMESALAGDVVQIARIWRGLQAEGVEPVIVVGAL
AWELRRLARMARICAQGMPVDQALREQRVWERRKALSKQALSRHSAHRWL
LFLRQLGEIDRMVKGAARGRPWEAILQLCLAIAGFELFPPAVEH
>Noc_0074 putative transposase
MPRTHGYAPKGLRCFGERDWDAKGRTHAMGALIGKALLSVALFSVALFNV
NVNADVFTAWVAQDLLPKLPAQSIVVMDNATFHKREDTQNMIRNAGDGLL
YLPAYSPDLNPIEQKWAQAKALRRQTHRSIDEIFTENQFI
>Noc_0800 conserved hypothetical protein
MNVFVFDIETVPDVEGARRFYGLEGLDDAAVAEIMFSKRRQETGGGEFLR
LHLHRIAAISVALRSQDRFKVWSLGDPASPEEELIQRFFDGIEKFVPTLV
SWNGSAFDLPVIHYRALLHGIGGARYWEIGEKEQSFRWNNYLNRYHQRHL
DLMDVLSGYQTRAVAPLDEIATLLGFPGKMGMSGAKVWDLFQSGELEAIR
NYCETDVLNTYLVFLRFELIRGQLEKATYQQECQRVREVIGAENKTHFSE
FLALWNI
>Noc_0612 Exodeoxyribonuclease VII
MESPSHNLNPTREIYTISRLTREARHILEGSFPLLWIEGEISNLSRPSSG
HFYFTLKDKIAQIRCAMFRNRNRLLGFSLEEGTQVLARVQVGLYEARGEF
QLIVEYLEEAGDGALRRAFEALKQRLSAEGLFAAAHKRSLPILPQRIGII
TSPSGAAIRDILSVLKRRFPAVPVLIYPTPVQGEGAFQRIAAAIAKAEQH
RACDLLILARGGGSLEDLWAFNEEALARVIYHCPLPIICGVGHEIDFTIA
DFTADQRAPTPSAAAEMAVPDSREWYRNFLNLEQRLKLLFQQHLRHRRQL
LENLTKRLRHPHIRLQEGIQQVDELEQRLERAWTHLNRERFHQLGGLSIQ
LQRLNPAQHLKAYHLRLRELNRRLPTCQQQRLGQQQMRLEMVQRALHAVS
PQATLERGYAIVTGPKGIVLRKASQVQPGAKIEARLAEGHIRGEVTEILD
KP
>Noc_1398 putative transposase
MTYPSSFRRKVLSVREKEGLTMAQVAARFCVGVASVPRWVKNPEPKLTRH
KPATKMDRDALARDVLEHPDAYHYERARRLGVSEKGIGHALPRRGITYKK
NAKASAPMRRRAASVPDIH
>Noc_2768 Transposase
MTEHDSTTVQRSYTFRFYPTSVQRQQLAMEFGHARWVWNTCLTWRGRQYR
VHDKRVTGVDFSRQLTFLKGLGPYAWLKEASATCLIQKLRDQDTAFRHFF
AGRAKYPRFKKRTHTQSIRYQLDQRQVAGMYRAGEFLKLPKLGALKLKWS
RKPQGIPKMVTVTQDCVGRYCVSFMCEETLQPLPRKPNGIGVDLGVCDVV
VTSEGWKSGNPRHLRTHTRQLRKTQRRLSRKRKSSVRWHRQRIRVAKAHA
RVSNTRQDWLHKLTTALIRQAGFIAMETLNVRGMMANRRLSRALGDVGMH
ELKRQLAYKAQWYGRAFRQVDRWAPTSKACSECAAVQETMPLNIREWTCP
DCKSVHDRDINAARNILRLATVGRTGSDARGGVHKPEVAYGC
>Noc_0401 Transposase
MKLLSKIQDNLRKLECKSMLLAHKIELRPKASQAEYLNKSCGSRRHCYNQ
LLEHFSKPDNKWSKAAAYQYYIKVIRPAYPWYNEVSSRVTRNAIDDLDDA
FKHFFRRVKKGGKPGFPKFKKKDINDSFALREKTKFEVKGRKLRIEKLKT
LIPMRQRLRFEGTPKQVAMSKQAGKYFASVLVDTTDYKDYSQNRSPSVGV
DFGVKSLAVTSDNEVIPSNNKLKKSLKKLKHLSRSLSRKRKGSNRRAIAK
QRLAKLHYRIAQQRKAVLHELSHSLTANYDRIAIEDLNVKGMVRNRTLAR
SIADAGFGMLRQLIEYKAFLRGCTVELVDRFFPSSRMCSGCGQLHDITLA
DRALACDCGLTIDRDLNAAINLNRYRRDTLKPDVKRTQEPSKTALAASVW
TV
>Noc_2018 NUDIX hydrolase
MIDRDGFRANVGLILCNQDDRVLWARRAREKAWQFPQGGVKESETTEEAA
YRELEEEVGLGVEHVKIIGCTRSWLRYRLPNRYVRYGNKPLCIGQKQIWY
LFRFVGEEQDVQLNLTDKPEFDYWCWVNYWYPLREIVYFKRKVYQRALNE
LAPLIFPDHQSLPPARSNYRKRRRQKTRSRI
>Noc_2512 NUDIX hydrolase
MPNQRTLLYRGRIIDLGLELASLPNGQQISLEIVRHPGGAVIAAVDDKQQ
ICLLHQYRHAAGGFIWEVPAGKLDPGESPFATAQRELAEEAGLRASHWTE
LGAIYSTPGFCDEILHLYLAQNLTATSRDPQPEEYLESYWFPLAKTLEWA
HRGRIKDAKTLVILFRAAAALD
>Noc_2595 RecR protein
MSLFGLSLGRLLEALRCLPGVGPKSAQRMAFHLLESDREGGRHLAQALLE
ALDKMTHCQTCRILSETDLCSLCADSRRDRGQLCVVEMPSDVQAIEQATS
YSGRYFVLMGHLSPLDGVGPEALGMDLLAKRLDTDQIREVILATNLTVEG
EATAYYISELAQTRGITTTRIAHGVPLGGELEFIDSNTLSHAFQSRRHL
>Noc_0768 Protein of unknown function DUF1568
MPRYLRACVPGGTFFFTVALLERQSHLLTEHIDDLRLAFADARRRRPFTI
EAIVILPDHLHCLWTLPPGDSDFSVRWHDIKARFSARIPPGERLSARRLK
KSERGIWQRRFWEHVIRDERDFWLHGDYIHYNPVKHGHTEKAADWPYSSF
QRFVQRGFYPLNWAASEKVCDLTME
>Noc_0656 helicase, DEAD/DEAH box family
MNEYNTNVSIFINLDIDEELSAEEYELYADQVVDQATAAETIPELEAEIL
ILKDLEHQALGVVQSGNDKKWEELSHLLQDRPEMYTESGSRRKLIIFTEH
KDTLNYLVGRIRGMLGNPEAVITIHGGVNRDDRRKAQEEFRNNRDVQVLV
ATDAAGEGVNLQNANLMVNYDLPWNPNRLEQRFGRIHRIGQTEVCHLWNL
IASETREGEVFQRLFDKIEIEKKALGGKVFDILGEVFEGKSLKDLLVEAI
RSSESDEARHAYQESFFGKALDTEHLKEILRRNALVEQHMSLEDLYAVKE
EMEKAEARKLQPYFIRAFFTEAFQNLTGEMRPREAGRYEVRHVPASIRER
DRIIGESRTPVLKKYERICFEKDLVRAHGKPMADLIHPGHPLMHATTDLI
LSAHRSKLKQGAVLVDSNDDGLEPRILFMVDHSVREAPANDQHDKPRVAS
RRLQFVEIDQHGKAFHAGWAPHLDLQPIDDYDLKLVQDILNAPWISADLE
GLALNHASQHLVPEHYREVKARREHQADKVLAAVNERLVKEINYWSDRYI
KLSDDVAAGKQPRMQPEMARRRVDELTERLNQRKRELEAMKAVVSSTPVV
IGGALVIPQGLLAQRKGETAFCADAEARARIEMVAMNAVIAVEQGFGHEV
KDMSAEKCGWDVTARPPANPDGSIKPDRHIEVKGRAKGQSTITVSRNEII
YALNQTDKFLLAVVIVDGDSFDGPHYIRNPFSTEPDFGVASINYDLSDLL
SKAVSPELTL
>Noc_3012 SMF protein
MDERAYWLALHRAPGVGSVSFCRLLEKYGSPTALFTSPERLAGLSDGIQH
YLRQPDWKAVEQDLKWLEQPDHYLLTLADPGYPPLLREIPDPPPILFVHG
DPSLLSLPQLAIVGSRNPSPAGAETAAQFATYLANSGLVISSGLALGIDA
AAHEGALAAKAATIAVAGTGLDRVYPARHHALAHAIAESGALVSEFPIGT
PPLPQNFPRRNRLISGLSWGILVVEAALQSGSLITARLGAEQGREIFAIP
GSIHNPLARGCHHLIREGAKLVEAAQDIWEELGSLAGAIPNLQCQEAPQK
IEASTDDLEYQLLLDCLGYDPLPIDLLVERCGLTAEAVSSMLLILELQGR
ITALPGGRYLRCGKEGQS
>Noc_0059 hypothetical protein
MKPKQESLFQTRLVPAEAGSGRLFDEELVAGSDGPVKCLGLEFENDEARR
THFTEELRKKLQDPEFRKIEGFPIGSDEDILNLSDPPYYTACPNPWIADF
ITEWEAQKPEQPEGYHYHREPFAADVSEGKNDPIYNAHSYHTKVPHKAIM
RYILHYTEPGDIVFDGFCGTGMTGVAAQMCGDREVVMSLGYQLKPDGTIL
QEEMDEDGKKVWRPFSKLGVRRAVLNDLSPAATFLAHNYNLYVDTASFEN
EAKKFIKIIERECGWMYETTHTDGVTKAKVNYTIWSDVFVCPDCTNEIVF
WDVAVDKESETVNDEFQCPNCQTSLTKRNMDRAWVTTYDRFLGETIRQAK
QIPALINYSVGGKRYEKKPDEGDFAILEKIENEGLDGWFPINRMIEGHES
RRNDPVGITHTHHFYSPRNLSVMSKILDLADKSEFTPFKFGFLNTSWHAT
QMRQYNPGGGHRPRTGTLYMPSIHSEGNMIPVYKKKLNQLVQFYKVKSHR
NRVAIIQTMSSTVESSIEAGLDYVFIDPPFGANLNYSELNSIWEAWLKVS
TNNAEEAIENRSQNKGIDEYRSLMTQCFRQAYNQLKPGRWMTVEFSNTSA
GIWNNIQTAISDAGFIVANVSVLNKKQGSIMAYTTPTAVKQDLVISAYKP
NGGFEERFQKEAQTEEGVWDFVRTHLKYLPVTKQQGALLQFVPERDPRIL
FDQMVAYYVRKGYPVPISSQEFQIGLSQRFIERDGMFFLPDQVAEYDRKK
MTSGELKQMSMFVSDEASAIQWLHQLIKEKPQTFSDINPQFMQQLGGWSK
NEAQLDLRELLNQNFLSYDGKGPVPEQIHAYLSTNWKELRNLPKDDPTLV
AKARDRWYVPDPNKAGDLEKLREKALLKEFEEYKEVKKKLKVFRLEAVRA
GFKKAWQERDYAVIVAVADKIPNNVLEEDPKLLMWYDQAVTRMGGGD
>Noc_2043 conserved hypothetical protein
MPPLQTSLIKLLGISMIAAGLGVLGGCASVPPPRGEIAEATFAVGEAQEA
EAPQYAPAELRSARKKLKAAESAMVDENYEKARRLAEQALVDAQFAEVKA
RAEIQRQGVEELRKSIEILRRELNKRSGNP
>Noc_2814 DNA polymerase III, epsilon subunit
MRQIILDTETTGLEPKEGHRIIEIGGVELSCRRRTGRTFHCYLNPEREVD
KGALKIHGLSNEFLSDKPRFIEIAEEFLAFIEGAELVIHNAPFDVGFLDH
ELCLLGSKWGKISTDCQVLDTLQLARQRHPGQKNNLDALCKRYGIDNSRR
DLHGALLDAEILAGVYLAMTGGQVSLSLHEYGDNGLPFRVGGGNYRPVSA
ERASLTIARASREERMAHDARLTAIDKASGGACLWRQLDSSPD
>Noc_0049 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_2420 NUDIX hydrolase
MADKEPIKPQIIATETVARTQLFRVETVDLCFSNGVETRYERLRSGRHGA
VLIVPLLDRETVLLIREYAVGTERYELALPKGRVETGETLFAAANRELME
EVGYGASRLAYLTALTVAPGYMEHTTHIIMAEELYEERRPGDEPEEIGVV
PWRLAELPALLAREDCTEARSIAALFMVKEKLSL
>Noc_0236 DNA repair protein RadC
MAITDWPVHERPREKLLQRGPNALSDAELLAIFLRTGVAGKTAVDLAREL
LESFGSLRALLEADLKQFCHAPGLGIAKYTQLQACLEMGRRHLEDTLKRG
DVLTDPQTTQRYLVARLRAYPFEVFSCLFLDNRHRVLAFEELFRGTIDGA
SVHPREVLKRALAHNAAAVILAHNHPSGVAEPSRADECITQRLKEALALV
DIRVLDHIIVGDGETLSFAERGLL
>Noc_1464 Micrococcal nuclease (SNase-like)
MNKLILLFFLLPLAASADYTGRVVGISDGHTLKLLAAGNLQVKVRLAEID
TPEKRQPYSNRARQALSSLAFGKQARVVVENVDRCGRTVGHVFVDGVNIN
REMVRQGAAWVYMAYLRDKSLFGVEQEARAAKRGLWVLPEAQQLPPWEWR
>Noc_1587 Transposase
MSTVVKTLKVRVKDNKANALNRMAFEVNQVWNNANEITAEYSSVPMPGFG
YLRSNFSAYDLHPFQKRYRKERGLNITAQTVQEVTEAHAKARRQFKKDKL
RWRVSGGPRRSLGWVPFKKGAAKWKNGCLYFAGHYFKVWDSYGLSKYEFR
SGSFSQDARGRWYFNIAVSVPVEKTTATKAVGMDLGLKDTATCSNGFKLE
AHRFYRNGEAQLGKAQRANKKKRVKAIHAKIKNRRLDSIHKFTTQVVREN
AFIVVGNLSSSGLAKTKMAKSVLDAGWFMLKTMIEYKSKRTQSEFIEVNE
AYTTQACSCCGCISGSSPRGRAGLGIREWSCSECGAHHDRDVNAAMNILA
AGHRRLAGGIPVL
>Noc_1413 ATP-dependent DNA ligase
MQKLAGVLQNFFNQAHAKGRLRRECFGNLIYYRLIDDSQPFRRGTVIFEE
GTLIPGYPQIGRMIRLDKGLKEQFTKPFWAEEKVDGYNVRIFLLGERLLG
VTRGGFLCPFTVDRLPDLIDERIFSDHPDFILCGEIAGPENPYLIGSPPF
IKEDIQLFIFDCQRKGEFDYLSQAEKHQLMEHYSLPSVRNFGLFKAEDIL
AIKQLMMTLDTEGCEGLVFKEDVPRGKRSKYVTSDASLSDIQAMARYLPD
FPPEYFIGRILRSVIFLDEEGISSTHELKAQLGSAFIDGLLKAIKQCQRE
HRVYHRFRCRLHDRANARQLLAHLARGDGHIQIVKHRLVREGEFYIFEFD
KVFQRTTGLLGELLSGEMVYD
>Noc_0001 chromosomal replication initiator protein, DnaA
MPSSLWKHCLNHLEGELDPQEFNTYIRPLQAIQQGTSLQLYAPNQFVIDW
VQNCAESRINALLSHYSSGRIEKALLEVGSCSLQPQPHIQAVELTSKSAR
SSSRVVDRIPESRLNKNYTFDSFVEGKSNQLPRAASHQVAENPGSAYNPL
FIYGGVGLGKTHLMHAVGNYIRSRNPSARVVYLHSEQFVAEMIKALQLNA
INEFKTRYRSVDILLIDDIQFFAGKERSQEEFFYTFNTLLEVQHQIILTC
DRFPKEVNGLEERLTSRFGWGLTVAVEPPELETRVAILMNKASIENIILS
DDVAFFLGRLIYSNIRELEGALRRVIAYSRFTHRPITMELTREALKDLLT
LQEKLVTIENIQKTVAEYYKIRVSDLSSKRRSRVVARPRQTAMSLSKELT
DHSLTEIGKFFGGRDHTTVLHACRKINELKSIDRRMAEDYHNLLKKLST
>Noc_0499 Transposase
MYDDPLARVRSGTAHRARGSHKVMLAAGKIRGKISAARGCLQHHLATCHR
PNEKSIEAMTEHDSTTVQRSYTFRFYPTSVQRQQLAMEFGHARWVWNTCL
TWRGRQYRVHDKRVTGVDFSRQLTFLKGLGPYAWLKEASATCLIQKLRDQ
DTAFRHFFAGRAKYPRFKKRTHTQSIRYQLDQRQVAGMYRAGEFLKLPKL
GALKLKWSRKPQGIPKMVTVTQDCVGRYCVSFMCEETLQPLPRKPNGIGV
DLGVCDVVVTSEGWKSGNPRHLRTHTRQLRKTQRRLSRKRKSSVRWHRQR
IRVAKAHARVSNTRQDWLHKLTTALIRQAGFIAMETLNVRGMMANRRLSK
ALGDVGMHELKRQMEYKAKWYGREFRQVDRWAPTSKTCSVCGAVQKAMPL
KVREWTCPDCQTAHDRDINAARNILILATGGRPGSHARGGVYQPAAAYGC
>Noc_0073 putative transposase
MTYSSDFHYKVLSVRKKEGVTIAEAVSGFCVGVASVTRWLKGPEPKPSRN
KPATKIDMDVMAGDMFGTIRMRINTSERSRFGVRVQGINYALHRLGVSYK
KKLDAPKGMRRRTARLPDSHHMAQTE
>Noc_0658 putative transposase gene of IS630 family insertion sequence ISY100h
MKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPKGRRVYGLIS
GHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKELCPLLNSNHI
VIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKDFGNIKKIRE
YNEHETLENIVAAYQ
>Noc_1085 Phage SPO1 DNA polymerase-related protein
MDDFLEKAVTLQELKWHCERSLGPEVPKGEKLVFGEGPSPAEIMIIGEAP
GVQEAKTGRPFVGSSGKLLTQLLHQIGLKREHVYISNILKTHPPGNRKPY
RSEIKRELPFLLRQIELLQPQLLILLGATALQALLDPKAKITALRGQWVE
VKKLPTFVTYHPAAALRDETKKTALEQDFAVLQRHLESR
>Noc_0659 DEAD/DEAH box helicase family
MIKLEDIKKDAQVLGIQGNEIVRIVQVEPVGDSAITVYYKDNQGRLGEQM
LFRSDEARLELAQAGRPWAFDAPGEDFKLGLEAYRISQAALFDPMMAVHT
SNVEPLPHQISAVYEAMLPRQPLRFVLADDPGAGKTIMAGLLIRELLMRA
DAKRILIVSPGGLTEQWQDELLEKFGVQFEIFSREKQEQCASGNYFDEQN
QLLCRLDQLSRNEEYQEKLKNTEWDLIIVDEAHKLSANYFGNKVNKTKRF
LLGELLGSITRHFLLMTATPHNGKEEDFQIWLSLLDGDRFYGKFREGAHK
VDVSDMMRRIVKEELLKFDGTPLFPERRAYSANYDLSDAEAALYAQVTDY
VRNEMNRADNLDGKRRGTVGFALTQLQRRLASSPEAIYQSLKRRRKRLES
RLDEMKLVARGHSVQKGVAETLGEYTVKRQIDLPDNFDEL
>Noc_0846 DNA polymerase III, alpha subunit
MAPSFVHLRLHTEYSLVDSLVRIRPLIQATSEAGMPAVAVTDQSNVFAMI
KFYRAAQAAGVKPIIGADIFLVGSGEHSRVSRFTLLCQNELGYRNLSSLL
SRAYREGQYQGIPRLQWDWLQNLNDGLIILSGGREGDVGQALLAGNDIQA
GKLLERWQALFPGRYYLELHRTGREGEEDYLHAAIALALARDTPVVATND
VRFLRPQDFEAHEARVCIHEGRTLNDPRRPRHYSEQQYLRTPSEMTELFA
DLPEALENTVEIARRCNLELTLGKHYLPNFPVPEDLSIEAFLAAEARRGL
EQRLVKLYPREAKRETEQPAYEARLAEELAVINQMGFPGYFLIVADFIRW
AKKNGIPVGPGRGSGAGSLVAYALQITDLDPLAFDLLFERFLNPERVSLP
DFDIDFCMERRDRVIDYVSHYYGRDHVSQIITYGSMAAKAVVRDVGRVLG
HPYGFVDQIAKLIPFDLKMTLDKALAESEGLRSRYEGEEEVRFLIDLARK
LEGLIRNAGKHAGGVVIAPKKLTEYVPYYCEQGASGVVTQFDKDDIETIG
LVKFDFLGLRTLTILDWALQAINRQRTQQGEALLDLALLPMDDPKSYALL
KRCATTAVFQLESRGMKDLIKRLQPDCFEDIIALVALFRPGPLQSGMVDD
YINRKHGRAQVNYPHPALEPILKPTYGVIVYQEQVMQIAQVLAGYTLGEA
DLLRRAMGKKKPEEMAEQRAIFTAGAKAREVDEKTATAIFDLMEKFAEYG
FNKSHSAAYAVIAYQTAYLKAHYPASYMAAVLSADMDNTEKVKAFVEECW
AMNLDLLPPDVNASNYSFSAQGETAIRYGLGAIKGVGAAALEGIIKVRER
HGPYQDLFEFCRRIDLRKVSRRVLEALIRAGALDSLGVERATLEASLETA
LALAEQHCRNASLGQNDLFGLDLVSEEGEAGNYVEVREWDKEQRLALEKE
TLGLYLSGHPIDCYKQELKQIAPCRIVELIDRANNRQSRNQQIVIAGLVG
SVRTNKARQGGRNAFVTLEDGSARLEIKAFAEVFDKYRERLQLDHIVVIE
GALKWDSYADSTAVTAEKIYSIAEAQEVFAKSLEIGLDGTRMGQEVIAEL
AQILAPFRQGRCPVAIDYRNRIASARLILGEEWQVRPNEVLLARLRLLPG
AEHVRISY
>Noc_0958 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1615 Transposase
MTYSLDLREAAISYINSGGSKVEASRLFGFSRNTLYRWLNTDDLQPKKHG
FRNRKLDKAALKKHVEDHPDMFLHERAEVFGVHTSSISRALKAMRIVKKR
ARV
>Noc_0696 Resolvase-like
MKIGYARASTLDQNPSLQRDALEAAGCEKMIVDQISGTVAKRPGLEKVKE
LLREEDTLVVWRLDRLGRSLRDLIEQIRTLDAQRVGLQSLHESIDTTTPT
GRLTFHLFGALAEFERNLIQERTQAGLAAARARGRLGGRRKSLNSDKRAL
VVSLYEEKKLPVTKICEMMGISKPTLYSYVREAQEKTNRV
>Noc_0664 TatD-related deoxyribonuclease
MIDFHCHVDLYPDPHAIARECRERKLNVLSVTTTPSAWAGTSALGGGAII
TALGLHPQLAHERKGELPLFDRILPGSAYVGEVGLDGASEFKTHWQDQID
VFRHILGACTEAGGRVMSIHSRRASTPVLDLLELYPESGTPILHWFTGTA
RELDRAISLGCWFSVGPAMLRSKRGKDLVMRMPRERVLTESDGPFAQIKE
RSIFPWEVNLAEAKLADLWDSDPGSTGQLLSENLNILLSIGR
>Noc_0053 toprim domain-containing protein
MDKTFSDFGIDVPPAASGQLSLTCPQCSAQRKKKRAKCLSVNVEKGAWIC
HHCSWRGGLSQREQSNRTLYWRRPDYRQPAPFSPGALPEDIQRWFAKRGI
TPAVLERNHIATKKVYMPQLERWVSAIAFPYYRGETLINAKYRDGRKHFR
LEAGAERILYGLNDLEQTTLIVEGEMDKLALEVAGFRNVVSVPDGAPPPQ
AKDYARKFEFLQADEEALKTVKTWVIAVDNDAPGQYLAEELSRRFGREKC
KRVLWPEACKDANEVLLKRGPEVLTDCIKNAQPYPLAGVLTVSHLSEDID
FLYTHGLKRGMSTGWPSVDICYTVKPGELTVVTGVPNSGKSNWLDCLALN
LAQQGWRFGVFSPENQPVGHHMARMIEKWAGKPFNKGSIARLSRSTLAQG
KDWVHEHFYWILPEDDQDWTVEHVLDRARALVLRYGIKGLLLDPWNEFEH
LRAPNVTETEYISLVLKRVRQFARYYQVHVWIVAHPAKLFRGKNDQYPVP
TLYDISGSANWRNKADNGLVIWRDLGDPKKDLVEIHIQKIRFREVGRLGA
VRLRFDPVTAVYREPEPDDEAAFPPADGADKADEQAYLDSLYAEYEAQGG
K
>Noc_1441 hypothetical protein
MNTSVQQLHNFAEKHHLVGLSAPSLQALMEGRTGYSTSKARRFLSFQDRI
NRELLSHRIILHNRYTDWFKRGEQNLEQIKAFIVQFSVFSNQFLVAQLFK
TINADSLESMRASKEILANEIGVVFNPGRGKVKKVSDLEREGDPKLVSTE
GTVEGGVFRFQAAHFEWLLQIAHKIGLGFNEVGKRSQGTPATLFFCDELN
RLYGSEDYRISQASSYAVENWAAAGFWDELIEGFSKFNRRTGIALPLAFF
TWHSRLEAQHARHTQEELEEVYFSREIKENEFISYGNEMLAGVAAFWDGL
EDQRKALAAVH
>Noc_0071 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNVPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1169 Endonuclease III/Nth
MSCPSAFKNARPSMKKNAEIQEIFSRFQAANPKPTTELKHQTPFELLIAV
ILSAQATDKGVNKATAQLFPVANTPQAILDLGEEGLKHYIKTIGLFNSKA
KNILQTCRLLLEQHGGQVPSDRVALEALAGVGRKTANVMLNTAFGQPTIA
VDTHIFRVANRIGLASGKTPRQVEDTLTRVVPDEFLHDAHHWLILHGRYV
CTARNPRCQECLINDLCDYYSKIAKEKSRGLKKTAS
>Noc_2218 Transposase
MTQCDSTTLQRSYTFRFYPTSVQRQQLAMEFGHARWVWNTCLTWRGRQYR
LHDKHVSGVDFSGHLTKLKKTAAYGWLKEASATTLNQKLRDQDTAFKNFF
AGRAKYPRFKKRAHAQSIRYQLDQRQVAGRYRAGKLLKLPKLGALSLKWS
RKPQGIPKMVSVTQDCAGCYCVSFMCEETLQPLPRKPNGIGIDVGISDVV
VTSEGWKSGNPRHLRTYRRLLTKTQRRLSRKRKGSVRWHRQRVRVAKAHA
RVSNTRQDWLHKLTTALIRQAGFIAMETLNVRGMMANRRIAKALGDVGMH
ELKRQLEYKAPWYGRAFRQVDRWAPTSKACSECAAVQEEMPLNVREWTCP
DCQTVHDRDINAAKNILRLATVGRTGSDARGGVHKPEVAYGY
>Noc_1856 Protein of unknown function DUF1568
MTLARRQQISLEETPFYHCMARCVRRAFLCGEDSLTGQSFEHRKQWIVDK
LKALAGIFAIDVCAYAVMSNHYHVVLRVDPVRAQGWSDEEVIDRWRRLFS
GGVLVERFLQGETATQAERDQVAELAVQWRERLWDISWFMRCLNESIARQ
ANQEDGCKGRFWEGRFKSQALLDERALLACMVYVDLNPVRAGIADTPEAS
DYTSLQARIRAYAEQRQLPNNSEGDTRSGKDKRPRRAVSPEAGSPPTRNR
LSEPSAALLPFRGSEPVDQSLAGIPLAFSDYLTLTDWTGRAIRNDKRGVI
PEDVPPILRRLGIDENAWVETVRDYGRHFCRVVGPVERLRRLAGKLGHRW
LRGLKPSGVLYPRPQTGSPS
>Noc_2267 A/G-specific adenine glycosylase MutY
MNKSDFSQRLLTWFDAYGRKDLPWQQNPTLYRVWVSEIMLQQTQVATVIP
YYQRFIERFPSLPALAHASVDEILGLWAGLGYYARARRLHQAARIAWETH
GGELPATLEALMELPGIGRSTGGAILALALGQRYPILDGNVKRVLTRQEA
IEHWPGQPKVEKQLWQRAATLLPRTRLADYTQAIMDLGATVCTRHRPHCP
SCPVKKTCQAHLQENPEAYPRSRPRKRLPLRATCMLILLNDQGEVLLERR
PPVGIWGGLWSFPECPPQTEAALWCQEQFGWPIGEVQHWPPLRHHFTHFT
LDIQPVIARIRGEARQVMEPNSQVWYKMEPMYKRGLPAPTLRLLKRLREP
SKGE
>Noc_2068 putative transposase gene of IS630 family insertion sequence ISY100d
MALKAHAQQFPDLYLHERAAIFDVHTSSMGRMLKKLGIVKKERQYKERCL
MKRQEFCKKLKAEHRMFGFKNLIDVDETGFDAPTPRPSSWAVKGCRIFGE
ITGQRKRRTNLLMAQRHGAKGREKEWLAPMLFKGSCNAQLFEMWVEQCLM
KELHEPTIVVMDNSSFHNHKRVQDILAKGYLTI
>Noc_0336 DNA mismatch repair protein
MAAPSIPRIQILPPALANQIAAGEVVERPASVLKELVENALDAGAQRIEI
ETEAGGIGLIRVRDDGCGIHHNDLPLALSSHATSKVRHGEELLNITTLGF
RGEALASIDAVSRLSLSSRMADNEHGWCIRENTPVQPIAHPLGTTVEVRD
LFYNTPARRRFLRGEKTEFIRLRTIQTQLALSHFEISFRISYNRRPFLTL
PACTCPPEQLKRITELCGRNFAEHSMYFKREIEGLCLWGWLGHPEFARSQ
TDLQYCYVNHRMVRDKLLSHAARQAYGNRLSQGRHPAYLLYLELPTHQVD
VNAHPAKHEVRFRESRQVHGFIVRTLAEILEQTEPEGEHRLASGEFRSHP
HEVLGKEQAGDTYLVAEVPGSYGPRKHGKHNPLSKGRNDAPSRFGQVQAF
VLGRYLLTENSQGLMLVDLPIARAHLAQARLRTAYAAGHIIRQPLLLPLT
FQVSLQQAEWTERHVQELRKLGLGLHRLGPQTVVLREIPAAIRELDLEGL
LLALLAQLTRQQHIMPAEIPLGELIVALTAQYPASTTSRPSLQEMNAFLQ
ELENLYQIETGLKAPLPWRELPEHEIAQWFLPS
>Noc_0625 DNA repair protein RadC
MASKSSPTQVIPKNRFARVRMASLNDPEKQSLLELAFAVLHDLHQPGVEL
PSPNHTRDFLRMLLAERKAEVFGCLYLDNRHRVIETVELFQGTIDGASVY
PRVVVQQALSVNAAAVMFFHNHPSGVAEPSNADEAITRRLKEALALVDIR
VLDHFVVTAGESISFAERGLL
>Noc_2543 Transposase
MTQCDSTTLQRSYTFRFYPTSVQRQQLAMEFGHARWVWNTCLTWRGRQYR
LHDKHVSGVDFSGHLTKLKKTAAYGWLKEASATTLNQKLRDQDTAFKNFF
AGRAKYPRFKKRAHAQSIRYQLDQRQVAGRYRAGKLLKLPKLGALSLKWS
RKPQGIPKMVSVTQDCAGCYCVSFMCEETLQPLPRKPNGIGIDVGISDVV
VTSEGWKSGNPRHLRTYRRLLTKTQRRLSRKRKGSVRWHRQRVRVAKAHA
RVSNTRQDWLHKLTTALIRQAGFIAMETLNVRGMMANRRIAKALGDVGMH
ELKRQLEYKAPWYGRAFRQVDRWAPTSKTCSACGAVQKAMPLKVRQWTCS
DCKSVHDRDINAAKNILRLATVGRTGSDARGGVYPLEVVL
>Noc_0046 Phage integrase
MIAECRDKLASGNITRYNQKNSGGQRSPGSVNRYLAALSHTFTIAVKEWG
WLEDSPMGRISKLKEPRGRVRFLSDSERERLLKACRGSSNSFLYPAVVLA
LSTGARRMEIMALRWRNVDLQRGLISLHETKNGERRALPLAGHALDCVKR
LSKVRQIDTDLLFPSNHNPQQPLDLRKPWEIALKQAGIEDFRWHDLRHSA
ASYLAMNGATLAEIAEVLGHKTLQMVKRYAHFSETHTARVVASMNAKIFG
E
>Noc_0545 Helicase-like
MTLWRFSSRTTRLDQSFLAEHLQGARAYRRIAGYFTSSLFEVAGEWLREI
PEVRIVCNGDLSPEDLRVAKVREIRLLGRWHEQAVEADALLNRDRYRQLH
AFLTARGPMIWVAPNTVCGFLHGKAGVIERADGRKVGFIGSMNETRQGWQ
THYEILWEDDSPEGVAWIEAEFAHLWQAAKPLPEAVIQEVGRHARRVEIP
LDDRVTPEEMAPAALAESPLYREGFSLQPWQQGFIAECLNHYRAYPAVRL
LLADEVGLGKTLSLGTAAVALCLLAEQAGKGRKLVAIFAPATLIEQWQTE
VMDKLGLPCGRWDSQGKIWLDPEARPVSPKGAAQIGRCPFRMGIISTGLL
TQPTQEREFLGNLTFELLVLDEAHKARTRQGLGKNAGAPNELLKFMRAAA
GRARHVLLGTATPIQTRPEDLWDLMGILHQGEGRFVLGEDLGPWHHPDRV
LPVLTGQERVTDEEQGWALLRSPLPPVSSSAEGSLRRLLRDIRLELAMPD
QRYEARVPVVNLPKDIREDLEDVLHEEREGTYFFQRHNPIVRHTVLRQRR
TLEAKALLPRIGVNVHPERRLSRDPSAFSVLFADRALRTGEAFDRAYKGA
EAFSRVLGQRGQGMGFMKNLLRQRVCSSCVAGLATAERMLAGRTEQETQD
EQEGDLTMQTQAERNELQVLAAPLRALAEDPKLKAIRHYLTVEGWLNHGC
IIFSQYYDTAAWVAGKLADLFPQERIGLYAGAGKSRLYHGGETVQMEREP
LKRLVAEREVRLMVATDAACEGLNLQMLGTLINVDLPWNPTRLEQRLGRI
KRLGQLRENVDMLNLVYQGTVDETIYERLSERMRDRYDLFGALPDTLKDE
WIEDITRLGEEMDRYIEARRQATGFDLRYNATLEPGEDSWRNCAKVFARR
DLVNLMGQGW
>Noc_0861 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_1772 Histone-like DNA-binding protein
MNKSELIESVADAGNLTKAAAARAVDSVIEAVTDALRRGDQVTIVGFGTF
SVRDRAARTGRNPQTGEEIKIKASKMPSFKAGKALKDAVN
>Noc_1991 Transposase
MKAYQLRLYPTLRQRRQLEEAFSACRYVWNWALDRRTRAYKEKGESLNAI
ALSRALTALKKEKVFLKAASATALTYVLKSQDEAFQKFFNKQARYPKFKR
RGRVHSCTFQLDKRRGEKVFMPGQLLRLPKLGPVRVVWSYQDIPVFPNSA
TVSCNACGQWFVSLQCDCIDVIHPPATDKTIGLDLGLSTLIAMSDGRKEK
PRRFLKNALRRLRFAQRRLSKTAKGGSNRRKQRSRVARLHQRIASKRANF
LHGLSTSIVRENQAIAIEDLNVRGVMANGKLARSVGDCGWYELRRQLTYK
AKWYGRQLNVVPRFQRTTGVCPDCGTVGEKLPLRVRSWTCGHCGSAHDRD
IAAARVIDLMGNTARSAGIDACGLAHKPEEAVS
>Noc_3021 Excinuclease ABC, A subunit
MNRICIRGARTHNLKNIDLDLPRERLIVITGLSGSGKSSLAFDTLYAEGQ
RRYVESLSAYARQFLSLMEKPDVDHMEGLSPAIAIEQKSTSHNPRSTVGT
ITEIYDYLRLLYARAGEPRCPEHGIILAAQTVSQMVDQVLGLPEGGRYML
LAPIVEGRKGEHLQVLENLLRRGFIRARIDGEVVELEQAPQLDGNKKHTI
EAVVDRFKVRPDLQLRLAESFETAIQLSDGLARVASLEESALAEQVFSAH
LACPVCRYSLSELEPRLFSFNNPKGACPSCEGLGVKPFFDSSRVVAHPEL
SLAAGAVRGWDRRNAYYYQMILSLARHYGFEVDLPFQDLPEAVRRVVLYG
SGQEKITFRYLDSQDKQTIRRHAFEGVIPNMERHYRETETSAVREELARY
LAVQVCPACQGTRLRQEARHVFVADYSLPEITALPVGQAQSLFSQLCLPG
RRGAIANPILKEIQDRLGFLINVGLGYLTLNRRAETLSGGEAQRIRLASQ
IGAGLVGVMYILDEPSVGLHQRDHQRLLETLIRLRDRGNTVIVVEHDEEA
IRAADQVIDMGPGAGRHGGEIVAQGTPLEIMANPASLTGQYLKGQKEIPM
PCQRVPFNTSRLLSLRGAHGNNLDQVDLDIPLGAMTCITGVSGSGKSTLI
NDTLLRAATRILHRASVESAPYESIEGLEHLDKVIAIDQNPIGRTPRSNP
ATYTGFFASIRSLFAGTHEARSRGYGPGRFSFNVKGGRCESCQGDGLIKV
EMHFLPDLYVACDVCQGKRYNRETLEIRYKGKSIDEILAMTVEEAQDFFA
NVPAVARKLLTLLEVGLSYITLGQNAVTLSGGEAQRIKLAKELARRDTGR
TLYILDEPTTGLHFHDIAQLLQVLLRLRDGGNTIVIIEHHLDVIKTADWI
VDLGPEGGEGGGRIIATGTPETVAACQASYTGRYLARILPKAKRGQSPVA
AKP
>Noc_0367 Holliday junction resolvase YqgF
MSATLDAPSPSKPRIVLGFDFGLRYIGVAVGQEVTHSANPLTTLKAHEGN
PDWNQITQLIRQWNPDLLIVGLPLNMDQSEQFLTKAARRFGHRLHGRYGL
AVEWVDERLSTVEARERLNIKSSASGRRQGIDQMAAQCILQTWLTEQQTI
RH
>Noc_1642 Transcription-repair coupling factor
MPPALNLASLAPVLPRQAGDHHRFGQLYGSSFGLVLAASAYYHPGPILVI
TPDTITANRLEDELRFYRNGQEDSPILHFPDWETLPYDTFSPHQDILSER
LATLYQLPRLERGILIVPVSTLMQRLAPQEYLETHSLLVATGDHLHLENW
RKQLEKGGYRCVSQVMEHGEFAIRGSLIDLFPMGSTLPYRIDLFDNEVDS
LRSFDPETQRSLQSVAQIQLLPAREFPLVEETITRFRKNYRGAFNGDPQR
SLIYREVSEGHPFPGIEYYLSLFFDHTDTLFDYLPNNTLAVTVEGVNTTA
ESFWREINARYEQYRHDIERPLLPPPKLYLQAGEVFSSLKHLPRISLQSS
KVEEKAGYQNFSTEAPPSLMLNARISQPLKTLNQFIKSFTGRVLFAAETA
GRRETLRDLFKDSGIRPHFFENWETFLQAEERLGITVAPLQHGLLLSEPR
TAVVAESQLFGQQAMQYRRRKERTRDADAVVRDLVELSIGAPVVHEEHGV
GRYLGLQTLEVGKVRTEFMALEYAGGDKLYVPVSSLHLINRYTGATPEAA
PLHKLGSNHWERAKRKARERVRDVAAELLAIYAQRAARKKLPLPTPDSHY
TAFARAFPFEETPDQADAIQAVIADLTSDQPMDRLVCGDVGFGKTEVAMR
ATFIVSQAGKQVAVLVPTTLLAQQHYQSFKDRFADWPARVEVISRFRSRK
EQEAVISGIADGRADIVIGTHKLLQENIRFKNLGLVIIDEEHRFGVRQKE
RMKALRTEVDILTLTATPIPRTLHMSLSNLRDLSIIATPPARRLAIKTFV
RQWNDNLLREALLREIKRGGQVYFLHNEVESINKMAQRVQTLFPEAKVGI
AHGQMRERELEQVMLNFYHRRFNVLICTTIIETGIDIPSANTIIIHRADK
LGLAQLYQLRGRVGRSHHRAYAYLIVPPRSVMTADAIKRLDAIESLEELG
AGFTLASHDMEIRGAGELLGKDQSGQMQEIGFDLYHDLLERAVNSLKSGQ
ALDLEQPPEQGSEVDLHAPALIPEDYLPDVHTRLVLYKRIATAKNHQALT
ELQVEMIDRFGLLPEATKTLFATHELRLKANEIGIRKIEAGAHGGRIHFQ
SEPKVDPMAIIDLIQTQPSVYKLDGQEKLRFTRKLPTVQARLETLEKLLK
ILIMKKAA
>Noc_1184 Transposase, IS605 OrfB
MIIQCAYKFRFYPTPTQKRQLALEFGHARYVWNWALETRTKAYQAQGESS
NTISLSRQLTALKKTQCPWLSEATASCHTQKLRDQDTAFRNFFAGRAKYP
RFKKRHHTQSVRYQLDQRHVAKNFNAESKLLKLPKLGRVKLRWSRGIEGI
PKMVTVSQAPAGRYFVSLTCEVEILPLPVRRNAIGVDVGVKDVVITSEGW
KSGAPKYTYHYARQLKMAQRRLSKKCKGSHRRRRQQVRVARIHARIKDSR
RDFLHQISSTLIHENPVICLEDLNIQGMLRNRRLSKAVADCGLYELRRQM
EYKAAWYGRDVLIADRWAPTSKT
>Noc_2593 DNA-directed DNA polymerase
MSYQVLARKWRPRDFTQVVGQEHVVRALTNGLDKGRLHHAFLFTGTRGVG
KTTLARILAKSLNCKEGVRSTPCGKCQNCQAIDGGNFVDLIEVDAASRTG
VDDTRELLENVHYAPSRGHYKVYLIDEVHMFSTSSFNALLKTLEEPPPHI
KFLLATTEPKKLPVTVLSRCLQFNLRRITPKAIAEHLNSILEAEEIPSES
YALTLIARAAEGSVRDALSLLDQAINYGRGQVMVADVRTMLGSIEQGDLF
ILLDALLAGNGQGLIEKVREICAYSVDISSILADLLHLLQRLALYQLAPD
TVDDIDERGIFSSLAARTTPEEVQLFYQIGLIGSRDLTYAPDHHTAFEMI
LLRMLCFRPADGASSCQALTNPKPEKTGLASSDPPLAQETTSPPLSGNDH
WPGLVTQLKLTAIARQLAENCALERREEGIIYLQLAPSMANLHSKRAEER
LQQALEEYYGEAIQLIIRIADSTLGTDTVASRRKQEDATWQQAAVESIQN
DANIKALSETFNARLPLDSVRPLTKPKQE
>Noc_1199 Transposase, IS605 OrfB
MEIVRTTKTRLDWDVAAAKRTVEAWSAACNDISQQAFAQGCLSNTVRLHR
LVYRDIRTRFGLSAQVAQNAIRHVASKYAGARIKKIQLKRPVTFSKQCAV
ALQGGERGRDFGFRHKGVSLWTVDGRIKGLPFHGEPRLCEYLSEWKMGDG
RLFIGKGKVYLSISFKREVETVFKPNDAVVGVDRGIRVLATVTDGQRQLF
FGGGHTHHVRNRYAKTRASLQKKKARTGSRSTRRTLKRLSGRERRFMRND
NHVMSRRIVDFSRDTGNPTIAVEDLGGIRNGRKLRKQQRTDLNRWAFYEL
EQFIRYKADTFGMEVIGVDPKYTSQGCSRCGHTEKDNRHQHRFLCKACGY
ELHADLNASRNIRLRGILARQVLCEDGSLSCGPEARLVDPGSKPGEGAGK
PSALAVTVHD
>Noc_1946 Transposase IS200-like
MPDYKRLTHTKWGCKYHVVFIPKKRREWIYGNLRKYWGEIFRALATRRRV
EIVEGPLMRDHVHICLNIPPKYSVSQVVGYLKGKSAIAIARRFRGKQRKF
NGEHFWA
>Noc_1189 RecJ exonuclease
MAEKVLKQRPVNELEWPEAIHPILRRVYGARGIKAPDELDYTLERLPSPW
LLSNIKMAVTLLMEALVRDWRILVVADFDADGATSCAVAVRALRLMGAHK
VDYLVPNRFIHGYGLTPAIVAEAMARGQPDLIITVDNGISSLAGVQAARA
ANIRVLITDHHLPGISLPAANAIVNPNLPNDPFPSTCLAGVGVIFYVMLA
LRAHLREQGWFIRRDEQEPALAPLLDLVALGTVADVVPLDQINRILVAQG
LARIRQSRCCAGIQALVACARRPLETLTTSDLGFAVGPRLNAAGRLEDMS
LGIACLLTDSLELAQQQANQLDGLNRERREIESTMQEQAVTHLENLVFQG
EERAPLGYCLFDESWHQGVIGLLAARIRERVYRPVIAFAPHDSEELKGSA
RSIPGLHIRDALDRVATCYPDLLTKFGGHAMAAGLSLRRGHLEPFRVAFL
EVLETLLDKEALEDVILSDGSLEQWDLEMAETLRNGGPWGQGFPEPLFDG
VFRVAGFRIVGEAHLKLTLTTLDGRQQLEGIAFRCLPPDGFALGIKIRLA
YRLDVNIYRGSRTAQLMVEHLELI
>Noc_0442 Transposase
MIIQCAYKFRFYPTPTQKRQLALEFGHARYVWNWALETRTKAYQAQGESS
NTISLSRQLTALKKTQCPWLSEATASCHTQKLRDQDTAFRNFFAGRAKYP
RFKKRHHTQSVRYQLDQRHVAKNFNAESKLLKLPKLGRVKLRWSRGIEGI
PKMVTVSQAPAGRYFVSLTCEVEILPLPVRRNAIGVDVGVKDVVITSEGW
KSGAPKYTYHYARQLKMAQRRLSKKKKGSQRRRQQQQRVARIHARITDSR
RDFLHQQSSKIVNENQVICLEDLNIQGMLRNRRLSKAIADCGLYELRRQM
EYKAAWYGREVLIVDRWAPTSKTCSACGAVQESMPLKVRAWACECGATHD
RDINAAKNILFFGTAGSAGTSKARGAVKPPRAVA
>Noc_2966 Histone-like DNA-binding protein
MNKTELVNFVVLKANLSQATAQRAVNALFQTITGVLSEEGRVNLVGFGSF
SVQKRAARSGRHPQSGEVIIIQAKSAPTFKPSKALKEIVQQRKQ
>Noc_2002 UvrD/REP helicase
MPNLNPQQRLAVRHIDGPLLVLAGAGSGKTRVITHKIVYLIEQCHLSARS
IVAVTFTNKAAREMKSRIGQLLTKGESRGLVVSTFHALGLNILRREHEIL
RLKAGFSLLDAQDSRALICDLHQQEFSSGGEESSFQWQISTWKNALVTPE
EALCRASNDQEAIAAQLYAAYDRRLRAYNAVDFDDLIGLPVHLLTTRPEI
LSRWQNYFRYLLVDEYQDTNAAQYQLVKYLAGVRGAVTVVGDDDQSVYAW
RGAQPENLHQLKEDFPQLTVIKLEQNYRSTTRILRVANQLISSNPHVFEK
RLWSALGEGDSIRVLTCRDEHHEADRVVAELMYHRFKYRTACRDYAILYR
GNYQSRPFERALRAHGIPYVLSGGTSFFERGEVKDIMAYLRLLANEDDDN
AFLRVANTPRRGIGAVTLEKLAGYAALRGQSLLVSGFELGLGEHLSGEAL
PRLRRFCEWVVDLADRGRRGDPIAVIKDLIADIDYRAWLDENCNDRRTAE
RRMANVEELVGWLERLYQRGDERRALGDLVAEISLQDILERTQEKKDRDA
VNLLTLHAAKGLEFPYVFMVGMEEELLPHRTSVEQGTLEEERRLAYVGIT
RAQRSLCFTMAEKRQQYGETILCEPSRFLSELPAADLQWEREGIPRDPAE
RMERGQVHLANLREMLRQ
>Noc_0078 Excinuclease ABC, A subunit
MSKAFIKVKGARQNNLKGLNLKFPLNELIVITGVSGSGKSSLAFDTIYAE
GQRRYVETFSPYARQFLDRMDKPQVDRIEGIPPAIAIDQINPVRTSRSTV
GTMTELNDHLKLLFARAGKLYCQGCGQGVRRDTPEGIYKELLLHSQEQDD
TPRILITFEITVPENFSFQEVKDQLLRQGYTRFHHQHDQVLEAIQDRVRL
EPSRRERIMEALETALKHGRGHVTVYPLDEHRQPQQPWRFSSDLHCPQCD
IAYQDPFPSLFSFNSPLGACGTCRGFGRTMGIDYDLVVPDENKTLAEGAI
KPWQSASYEECQQDLMRFARKRGIPTRLTWRELTPEQQHWVLAGEGEWEE
NKWYGVQRFFDWLESKSYKMHIRVLLSKYRAYHLCPTCQGARLKPESLLW
RLGTKAGADEVLPDKRRFMPAGVRFSQRRLAALPGLTLHDVMQLSLKRCQ
QFFAQLTLSAPLDEAADLLLGEIRSRLNYLVEVGLGYLTLDRQSRTLSGG
EVQRINLTTALGTSLVNTLFVLDEPSIGLHARDLQRIIRILQRLRDAGNS
LLLVEHDSQIMLAADRILDLGPGPGERGGQQVFFGPPQELMQAKRSLTGQ
YLTGKKRVVSERRKTRKLPASAYVEIRGAAEHNLKNIDVRFPLNGLVCVT
GVSGSGKSTLIQEILYKGLRKLKGKPVGTPGRHQALEGHEHIQTVVLVDQ
SPIGKTTRSNPISYVGALDGLRKAFAAEPLAQERGYHAGTFSFNTGKGRC
PTCGGRGFEQVEMQFLSDIYLRCPDCNGQRYRPEILEVKLQPSAQGLGKS
IAEVLAMTVAEALDFFADYPEIKRGIEPLQAVGMGYVTLGQPVPTLSGGE
AQRLKLAGHLAKARGNKQGRNTLFLLDEPTTGLHFDDIATLLQAFYRLLA
EGHSLVVIEHNLEVIQAADWIIDLGPEGGEGGGKVIAMGSPQKVARSQHS
HTGQALRDHAQAFDEIFQAASAVGEAPAQYRTACGTAQTEAAPQESGAAA
EGNGFIYIHNAREHNLKNIDVQIPRERLTVITGISGSGKSTVAFDILFAE
GQRRYLESLNAYARQFVQPAARPDLDAIFGIPPTVAIEQRTSRGGLKSTV
ATLTEIYHFLRLLFVKLGQQHCPDCKIPIEAQTLEAIQARLMGDYRNRRL
GILAPLIVARKGYYTELAKWAAAKGFTHLRVDGTLLPTDPWPRLDRFKEH
SIELPVGELKVTPAAEKELHALLEQALNLGQGRVEILLPQKKQKILYSTQ
RACPSCSRSFGELDPRLFSFNSPQGWCPRCYGTGRVLPNFDGEQTGEEKE
WAEKQGNPGQICPTCQGQRLQPEALAVRFQERNIAEFTGMPIAVAEATFQ
SMKLQGRAADIARDILPELRTRLKFLREVGLSYLTLDRAAPTLSGGEAQR
IRLASQLGSNLRGVCYILDEPTIGLHPRDNRMLLNTLGKLEGQGNTIVVV
EHDEDTIRRAEHVIDLGPGAGVNGGKVVAAGTVEALLQHPESVTGRCLAQ
PLPHPLMEKRRPETAEAHLEIRGASLHNLKSLTLRLPLRQLVGVSGVSGS
GKSTLIRQVLHGNLLRLLNRKQTAQHKETTTFQGCQEIRGWKAIQRVLEV
DQTPIGKTPRSCPATYVGFWDSIRRLFADTPEARIRGYGAGRFSFNTKEG
RCPECEGQGVKRIEMSFLPDVTVPCESCRGARFTQETLMIRYKDKNIGEI
LALSVDQAVEFFAAHKRIHRPLQLLQAVGLGYLSLGQPSPTLSGGEAQRI
KLVTELAKAPLNPYRSANQHTLYLLDEPTIGLHMADVEKLIRVLHQLVEV
GNTVIVIEHNLDIIAEVDWLIDLGPEGGDGGGQIVAQGTPETVAKVKKNS
YTAKFLTRFLKTRRAIN
>Noc_0209 Exodeoxyribonuclease III xth
MVDWLEIHQPDVLALQETKLVDDSFPQEAFKEIGYHAAYSGQKTYNGVAI
LCRQAPKDILTDLPNLVDSQRRILGVTVDDIRLLNLYVPNGSEVGSKKYA
YKLDWLGRIKDYLQEALVEYPKLIVLGDFNVAPADQDVHDPDIWHETILC
STPEREALKEILALGFQDSFRLFEQEAQSFSWWDYRGGAFRRNRGLRIDL
ILISKALVPKCTGCVIDKEPRRLTRPSDHAPVIATFA
>Noc_2648 DNA-formamidopyrimidine glycosylase
MPELPEVETVRRGIEPHLVGRQIHTVIVRESRLRWPIPLSLTQNLIGQSF
LAVGRRGKYLLLSCTQGTIILHLGMSGSLRLVTTNTPHGKHDHLDIVLNN
GRCLRFNDPRRFGSVSWTQANPLHHPLLEILGPEPLESLFDGHYLFKHSR
HRRTSVKAFIMNHRIVAGVGNIYANEALFLAGIHPRRSASRIGLARYQRL
AETTKTVLYNAIQAGGTTLRNFLTSDGKPGYFANQLQIYGRSAHPCPICG
TPIRLERIGQRASYYCTQCQH
>Noc_0140 Holliday junction DNA helicase RuvB
MTLNRDMVSPQEDRGEKAVEFSLRPARLADYVGQPQVQEQMEVFIPAARA
RHEALDHVLIFGPPGLGKTTLSHIIANELEVNLRQTSGPVLERPGDLAAL
LTNLEPRDVLFIDEIHRLSPVVEEVLYPAMEDRQIDIMIGEGPAARSIKL
DLVPFTLVGATTRAGLLTSPLRDRFGIVQRLEFYAVDHLVLIVERTARIL
GMAMEKEGALEIARRSRGTPRIANRLLRRVRDYAEIKGDGQVTRQVAQKA
LDLLDVDSHGFDTMDRKLLLAMLEKFDGGPVGVDSLAAAIGEERGTIEDV
IEPFLLQQGFVMRTPRGRMATRHAYLHFGLKPAVKMVPQEVSDLFPNE
>Noc_2104 Methylated-DNA-(protein)-cysteineS-methyltransfe rase
MNASTKNMQLSTAIANDPRWASAVARDPKVDGKFYCNVEIRFAIGKCSLG
ATLVAQSNRGICAIFLDDDPEKLVHDLEDQFPRANLVGGGAQFEQLIAEV
VSFIEAPSIGLNLPLDIRGTAFQQRVWRALRGIPAGSTASYSAIARHIGA
PKSARAVARACAVNTLAVAIPCHRVVRSNGNLSGYRWGLERKRALLKKEA
KDLEFIHKCQA
>Noc_0876 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1146 Integration host factor, alpha subunit
MALTKADMTETLYQELGLNKREAKEIVEMFFEDIRCALEQGEAVKLSGFG
NFELRDKGERPGRNPKTGEEIPITARRVVTFRPGQKLKARVEAYAGGKQ
>Noc_1289 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_0019 DNA gyrase, B subunit
MKASTAYDSSHIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNSI
DEALAGFCNEISIIIHSDDSITISDNGRGIPTDIHQEEGRSAAEVIMTVL
HAGGKFDHNSYKVSGGLHGVGVSVVNALSAVLFLEIHRDGKIYQQHYQEG
VPQGPLVAVGTTERSGTTIRFSPSPKIFSNIVFSFDIFAKRLRELAFLNS
RVRILLEDERIGRKEEFFYEGGISAFVEHLNKNKTPLHGNVFYFQGERDG
VAVELAMQWNDSYQEHFFCFTNNIPQRDGGTHLAGFRSALTRTLNQYIEK
EALAKKTKVGTTGDDAREGLTAVLSIKAQDPKFSSQTKEKLVSSEVKPVV
DSLVSEHFQNFLLEHPVDAKAIGGKMIEAARAREAARKARELTRRKGALD
MAGLPGKLADCQERDPTLSELFLVEGDSAGGSAKQGRDRRNQAILPLKGK
ILNVEKARFDKMLSSVEVATLITALGCGIGREEYNPDKLRYHRIIIMTDA
DVDGSHIRTLLLTFFYRQMPELVERGHIYIAQPPLYKVKRGKQEQYIRDD
AEMERYLINLSTDNAKIYSENGATSIAGETLSTLFTDYFQLRRAIERLSY
RYDPLLLDQMLYLDVVPTPADPMIEIKQLEEWATQLEIRSNSAAKGNARY
EIEVATVPENSAPVLQVKITRHGVTAYQRIYMDFFTSPDYQRIQNLGKRL
VNSVKAESYVQRGDRRQLVNDIQEAVDWLMEEARRGQAVQRYKGLGEMNP
DQLWETTMNPATRRLSQVRIDDVITADEIFTTLMGDHVEPRRDFIESHAL
SVAHLDV
>Noc_0920 Superfamily I DNA and RNA helicases and helicase subunits-like
MNTSHLLVNDQPVGHPANQKVVRLIHYLTRLASLRSKLIRDLTEYEKILW
ISDVPHEHGCFTRAWGQEEEKELDEWLAVQNRREPELPIIPEQCKEWVNQ
ATLREKDRLPELLPEIIRQSQDPDGRKELDRPATISMTERLEEHPGVQQA
WNRYLENKWLPWMEEHNIWEQIHKVYSALFAIHQAQLRLGEEYELVLGLG
LLTWQTPTGQHVRRHLVVANALLEFEARLGKFTVRPYTESVELRPELDML
DIEERPAHAEETAKSSLSGAEDDPWAKERIDSVLQALVHSINSQGTYDDS
LEMKNIRASARPVVEYAPALILRKRSTKGLSEILGRIKKQIESDENVPSE
FADLAEISARDERELDNDGLEGLRATFNGEVYFPKPSNDEQRRIIDKLRS
ANGVLVQGPPGTGKSHTIANLICHLLATGQRTLITAKTPRALKVLEELVP
GGLRPLCINLLGDGPEERRSLEASVGGILRKSEEWEEEDAERQREELETR
LQELREEKAKINRRLHDIREAETHPQSVAEGTYRGTAARIAESVNKNRST
FGWFTDSVPLDKTLQISEIDLREILMALRHFTPEKREELNLAWPDSVPSS
ERFVQLVQNEAKASEEERRLADRADDHVADLLANHSSATIENIRDALARF
RDTRRKLIMASHSWMKEALRDILSGNSALWRELFRVTDHAISEVEGLVAI
ADNTSIEFLESIDIRVLREDARKLKEHMESGGKLGWGPLRSKRVKERLYV
IKSVKVDGRSCSTIEHFSILAAVLHVYIECEKAWGFWAGQGDKSQEPYVL
QLTMLKSLRGALEEALSLQGIIDKCRKAVQECPALDEPSWNDESKIERII
ASCRLALARISKQLAAAELQGIETPISRIAAGGNAHPVTIHVLHAIRDRE
RDGFAQCMSKIQDLEKQYQYLLKRDEDLSKLRQLLPQLADCLERTCNEPY
WEERVRHIGNAWHWAQARYWIGDYIRREDVPALGKRVKQIDDTINDIIAK
LASLHAWSFCFSRLTESHRRHMEAWQQSMRRLGKGTGKHAPRHRREAQGH
LNECREAVPAWVMPLHRVWDTVSPTPGMFDVIIVDEASQCGLEALPLLYL
GKKVVIVGDDKQISPESGFVGKEAMFQLMEQFLYDFQYKDYFHHDASLFD
HGKLRYGTRRITLREHFRCMPEIIRFSNDLCYSDTPLIPLRQYSSNRLPP
LEHVFVTGGYCEGSKNRTINRPEANAIVTRIAELCKDSRYDHKSMGVVVL
QGEAQAYLIENQLAKRLGPEEMERRRLVCGNPYSFQGDERDIMFLSLVAA
ANKRIGPLTKAADERRFNVAASRARDKMILFHSVTFEDLSASCLRRRLLE
FFANTQPQQIAGIERDELERRAAQDNRRIVNPPAPFDSWFEVDVALELLR
KEFRILPQHEVAGRRIDLVVEGGQARLAVECDGDHWHGADRYEADMQRQR
QLERCGWEFFRVRESAFYANKDDALASLWRMLEEREIFPISQCVDLTPKV
NMEEEGYNKSDSDASAADHASYSLNNVEKSIHPSGRRSEEITPSEIEDAI
LQALSKRPNQSCTLDSLTARVLKEIGVSTRGKPREKFERRTLQGVNLLKR
RGRIETYKAKNQRLRLIHQEKT
>Noc_0687 Phage integrase
MEKKTADFWQARKDFLAHLHYAKGYSQGTCYAYHSDLGIWGRWLEEASKD
WRQATHLDTEQFVAWQMRKRGTKAHIVARRSSCLGSFYKWAMKNALVESD
PIYLADKPKRPYRIPVWLEKEEQRAFQEAVQRVEDLPENIFGRTQEHIKA
VRRRYDVLFGLILNSGLRISEALAVKVRDVRMVNGVAKSVRIIGKGNRER
LVPLPEAFGQVLGAWLQGRGGEDFVFAKAPGEKPPGPHAVRAYLRRLIER
AGIDKPVTPHKLRHTYATRLLESGAELVDIQALLGHVDLSTTQIYTHVSE
ERMAGIVAKL
>Noc_2211 Transposase, IS605 OrfB
MKRTVSIKLVPTPEQAQALLELQSELAKACNLIVPFARDNRCWNRVALHH
LAYYPVREATRLGSQMVCNAVKAVADAYKVLKLGRHDEIPVIRFRETGSV
HFDARTYRLKGDAVSLYTLTGRAIVKMSPGEFQAQYLAAGKPKEGKLVRR
GKQWFFNLVLDWPDTAPAKGSGILGIDLGENNLASTSSGKILGGGPLRHV
RDRHLALRRRLQSNGRQSARQLLKKVSGKERRHMRQVNHEASKAMVGEAL
KQGASTIVLETLTHIRKRIKGGKRLRARLHRWAWRELQDFIAYKAEAAGI
RVIYVNPAYSSKSCSACGCLGSRIQHKFSCSSCGHLAHSDRNAAVNLAKL
AKSIGIARLGVAPAHVAVPTH
>Noc_0599 UvrD/REP helicase
MDISSLLNPLNKAQREAAAAPPGHHLVLAGAGSGKTRVLVHRMAWLIRSQ
GIAPVNLLAVTFTNKAAGEMRGRIEELLETPVGGMWVGTFHGIAHRLLRA
HWQEAQLPQDFQILDSEDQYRLIRRILQNLNLDESRWPPRQAQWFINSHK
DKGLRPQHLEEGTNPHVRQQIRIYHDYQSHCERSGLVDFAELLLRAHELL
RDHAHVLQHYQNRFTHVLVDEFQDTNAIQYAWLRLLAGHQGELFIVGDDD
QSIYGWRGAQIENIQQLSRDFPAIRTLRLEQNYRSTGVILAAANAVIANN
TERLGKNLWTESEDGEPIQIYQAFNERDEAHFITERIHAWKAQGGMGADT
AVLYRSNAQSRIVEAALVEAGIPYRVHGGLRFFERAEIKDALAYLRLVTH
QNDDSAFERIVNTPPRGIGERTLSQVREHARQAGVSLWQATVQLVAQQHL
PGRSATALRHFHSLMERMATDIMGLSLPAQIESVLDHSGLLNHYRKDRGE
KSQSRLDNLKELINAASQFKPEDPSLETLSEFLAHAALESGGTQAEGWED
CVQLMTLHAAKGLEFPLVFMIGMEEGLFPSPQSRNEPGRLEEERRLCYVG
MTRAQRHLYLIYATRRWLYGSDSYPQPSRFLHEIPTELTSEQQPHIDIIH
PGGAPQPTAPVSSIDGFRIGQRVNHPKFGEGVVLNLEGKDNQRRVQVNFI
QGGAKWLVVIYANLQII
>Noc_2873 putative transposase gene of IS630 family insertion sequence ISY100h
MKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPKGRRVYGLIS
GHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKELCPLLNSNHI
VIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKDFGNIKKIRE
YNEHETLENIVAAYQ
>Noc_0649 DNA topoisomerase I
MHPQMDFSPIIQQVFNALWYLIPLAILAGIFKSPWFKGIAGEFLVNTAAR
LFLPKDEYHLIKNVTLPTDDGTTQIDHIIVSRYGVFVIETKNMKGWIFGS
ANQRTWTQKIYKHTNKFQNPLHQNYKHVKTLEVLLDIPASAIHSLVVFVG
DSTFKTDIPDNVTYAGGYIRHIKSRREVVLSQADVEAVTAQIEQLRLQRG
LTTNRQHVRHLRQKNAPSPPTTPPTTPQCPKCGGAMVLRTARKGKSVGSA
FWGCATFPACRGVMKQP
>Noc_1905 hypothetical protein
MQNKLNRLQVFSFALVIVSLIPAIAVTACAKSADQKQAASIDQKQAAPAL
AFDQGQPLYTIDIVSAAGVEILDHIVQARADIHNGDIKQAKEELSNAHRH
FEAIRAIEPTTQIKKHISIVAKYLGYGEKEEIQPYFIPLYASLDTIADLV
PISSAKAHIKKAEDNMNKNQREAAAQELSEAKQSLIYTEIDLPLAATEGN
VTMAQQALAENKPEIADKALKAAEQSVMLFSFGSTETATSARNNLSMANH
NYTAGKYQAAKLDLAAAIKNLEEATASNNYLTANLADGLLQEAKAIEPAI
EKKSNETTTRLQALLEHTNALSEHELELMGIGWNDLPKTIEQTRQALANA
KLYLNYAEIDQLTLHDQDKTQEDLKQAQSYLQKGAENETAHKGVIENIEK
KVQALQKTVGEKRDENASQQYADTLAQLRLIINNKL
>Noc_1565 phage integrase
MRAAIRVRHNSIRAEQAYRGWIKRFIFFYGKRHPGDMGKVEISAFLTHLA
VKGKVAASTQNQALNAILFLYRGGLKQDVEWLDEVERAKKPSRLPVVFTP
NEARKVLALPPQGK
>Noc_2022 Ribonuclease H
MLLDGGILSTSVLGQRVAGVDEVGRGPLAGPVIAGAVILDPECPISGVKD
SKQLTAPARERLAALIQAQAVAWALGRAEAAEIDQFNILQASLLAMERAI
SALSVVPDLVLVDGKHCPPTVCPVRAIVKGDQQIMAIGAASIVAKVARDA
EMIAFEESYPGYGFGIHKGYPTRAHLAALKALGPCSIHRRSFRPVRRFLE
A
>Noc_0076 DNA helicase II
MNFNIGWIARYRNFDTMATFIPALTTIPRMTRGERRFGRRLDSLLEEDYL
VWYDIPLGRRRRYPDFIILHPARGLLFLEVKDWKIETIRSITPDSVVIDT
QEGRKTVSNPLAQARQCAFAAIDQLKRDPQLTQSDKCYRGKLCFPYGHGV
VFPNITRRQWNQAIPEAEQEILLPAHRVICKDEMLTTADPEAFQQRLWNM
FDYRFGEQFSVPQLDRIRWQLFPEVRIDAPTTDLFGNDEAAEDEPASNLV
PNIVRVMDLHQEQLARSMGDGHRVIHGVAGSGKTLILGYRCLHLAQAISK
PILVLCFNITLAARLRCFIAEKGISEKVKVHHFHEWCSLQLKTYQADLAP
GKGPIWERQVESVIRAVDQSRIPRAQYGAVMIDEGHDFEQAWLKLVVQMV
DPDTNSLLLLYDDAQSIYQKSSLKFPLSSAGVQARGRTTILKLNYRNTRE
ILTFAYDFAQDFLKAHDADDDHIPLIAPEVAGVSGPRPAFRRLSSPRDEA
RYLVRCIQTWRSQGSGLNSIAVVYTGNSQGRLFYDALREASIPSRCLQQS
ADKRSYDPQADEVVLLSRQSSKGLEFDTVLLCGLGALSNDEERLAQEARL
LYVGMTRARRRLLVTSCKPNWYTQRLTELASA
>Noc_1093 Transposase
MLKATKIRLYPTREQAEFLNRQFGSVRFCYNTGLRIMSHRYKRHGQSLSA
KYDTKKLLPVAKKSRKYGWLKEADSVALAQACINLDKAFQRFFKEKKGYP
RFKRKRGKQSSYHCMSVSCGESWVKVPKLGPIKARVHRSVEGKLKSITLS
RTVTGKHYASLLYETEQPVPEPMTAIDATKVLGLDMGLSHLAIDSTGRKV
ANPRFIKQVQKNLKRKQQSLSRKQKGSSKRAKARLLVAKAHERVADARSD
FQHKLSRQIVDDNQAVIVETLKVNNMMKNAKLAKHIGDASWHALIAKLAY
KAKEQGKHLVKIDPWFASSKTCHVCQHKMDAMPLNIRSWACPTCHTRHDR
DINAALNIQHQGILKLKAEGLSVSAHRGLRKSGMPPVAAVEVGSSVR
>Noc_1931 Conserved hypothetical protein 95
MPPRSRAKPLRSQVRIIGGLWRGRKVDFSARPGLRPTPDRVRETLFNWLQ
PVISGARCLDLFAGSGVLGLEASSRGAAAVVMVEKDLRAYQAIQLQVEHF
SAEKIEVIAGDALAYLRGSVRSFDIAFLDPPFESGLLEPCCAYLERGGWL
VPGAYIYLETRRRDPLPSLPVTWTLLHSKEAGEIGYYLARRISNVSGAET
E
>Noc_1986 hypothetical protein
MVKSSNPLNIHWNFKNGLPPEPLSITISKDAAGRYFVSMLCD
>Noc_0657 Transposase
MRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYERP
GPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLTR
KKNDGVHAAL
>Noc_1034 hypothetical protein
MRVIHTSDWHLGQYFIGKSRKRDRQGETLPIIATGHLATVGNEFLSKLIK
KTIVYCISTLAEGFTGYDVCEILAAVKQGVTEWTGQCILDLLI
>Noc_2256 Methylated-DNA-(protein)-cysteineS-methyltransfe rase
MVHSVYGAIVATPLGKLGLSTSGNILIGLDFLPPNIPEYFPSDSTARVAL
AQLQAYFADPRVMFTLSLLPQGTAFQKRVWHTLRLIPPGSTVTYGKLAKK
LKTSPRAIGAACRSNPLPIFIPCHRVISSQGLGGYSGATEGPYLDIKAWL
LRHETEGG
>Noc_0794 Recombination protein O, RecO
MRVVLQPAYVLHSRPYRETSALVEVFTPEYGRVGLVAKGVKRQRTHRFSL
LQPFCPLLLSWTGRGDLVTLTGAEAAGPIPVLTGEGLICAFYLNELLLRL
LPRRDPLEALFSVYAHSLPSLIHAQQRQQILRLFERDLLAYLGYGLILKY
EAGTSRPIEAGQWYSYQLEKGPVRLLSEDLEGMKVRGHTLQALARGALAD
PASLGEAKRLLRWLLAFHLGDKPLKSRGLLEELRRLGNVSRKEERP
>Noc_0704 ATP-dependent dsDNA exonuclease (SbcC)
MRIRQVRFKNLNSLVGEWEIDLTHPAFVSDGIFAITGPTGAGKTTILDAI
CMALYGRTPRLNKVTKRGNEIMSRQTGECFAEVTFETQTGRYRCHWSQHR
ARKKPDGELQAPRHEIANADSGEIFESKIRGVADQIESATGMDFHRFTRS
MLLAQGGFAVFLQAVQDERAPILEQITGTEIYSQISIRVHERQREEREKL
NLLQAETEGIVMLEPEQEQEIGQTLEIKRKEEADLTAKFADTGQAMAWLT
TIDGLKKEIVNLADEVRKLQNDIEAFRPDREKLNRALSAASLDGAYATLT
AIRKQQVEDREALKAEGEALPGLESSAKEQAESLKSAEQQTARVKEELKV
AAPTLQKVRSLDQELANLKKTAAEDKQDCQQDLEKIDTDKQARLEEQEKR
STAHGNLELVDSYLKEHAQDEWLISGLAGVEEQVSSLLSRQNEIHQKEID
QDKAAKALEQATKSLDDCQKQSDLRKQALEDSSKQLQQGKDALSQLLGNR
LLREYRTEKETLLREMAFLAKIAELEDHRAKLEDGKPCPLCGATEHPFAA
GNVPVADESEQKIDALTRLISEVEDQETAIKEHEKAESLAHKDLTEAEKQ
ESAAANGRKVAEKALAEVTDSLEKLRADFAERRQAVAAKLLPLGITDIPE
TDISLLPEILRARLKAWQAQVKKKADIEKQITDLDSEVKRLDAVIETQST
ALAEKLKRLESLKKELATVSDERNALYGGKNPDDEERCLNKAVADAEGVE
RWVREQHNELQQQWKTGKALVESLKKGIDQREPELSRLETEFFAALVSVD
FSNEEQYLAALLSSERRAELVTTAKDLDDCQTDLKARQKDRETHLATEMA
KKVTDQSIEELESQSKEYENTLKELRDIIASLKHKLSENMAAKERLKEKQ
GAIEAQKKECRRWKNLHELIGSADGKKYRNFAQGLTFEVMVGHANRQLRK
MTDRYLLVRDEAQPLELNVVDNYQAGEIRSTKNLSGGESFIVSLSLALGL
SHMASKNVRVDSLFLDEGFGTLDEEALDTALEALAGLQQDGKLIGIISHV
PALKERISSQIQVTPQTGGRSKISGPGCGGLSAAKWAKEAG
>Noc_1290 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_2815 Ribonuclease H
MTEIVEIFTDGACRGNPGPGGWGALLCYQGREKTLSGAESKTTNNRMELM
AAIRALETLKRPCRVHLTTDSQYLRQGITCWLSNWKRRGWKTANRQPVKN
IDLWQRLDQVAAQHRIEWFWVRGHEGHPGNERADALARSAITNGEEK
>Noc_2928 Site-specific DNA-methyltransferase (adenine-specific)
MKKIQPKEGESADIVSENIERLKELFPEAFSESGVNFDVLHQLLGDAKVL
DEGEEKYGLNWHGKKRARQIALTPSTGTLLPCPEESINWDTTKNLFIEGD
NLEVLKLLQKSYANKVKMIYIDPPYNTGKEFIYPDKFQENLDTYLKYTGQ
VDDEGMKFSSNTESTGRKHTNWLSMMYPRLKLAKQLLSQEGVIFITIDDN
EVATLRQVCDEIFGEENFVTSIVWQKKVSPSNDATWFSSDHDHILVYAKN
KLIWRPFKLPMNERQKSNYTNPDNDPRGNWNSATYTCNKDSDERPNLYYP
LVNPNTGQDVWPKKTAVWKYSREVSQKHAEENIIYWGKDGTSNSPRLKKF
LSKAKGVVPRTVWLYSDVGHTQEATKVLSELIDDIKFDTPKPVRLIEHML
RISTGGDSEEIVLDFFAGSASTAHAILNINANEGSNRRFIMVQLPELLES
ESYRSIADIGKRRVKEAGKKITNENPDATFDQGFKVFKLSSSNIQAWNPD
RQDLEQSLLSQQEHLIEGRSENDILYELLLKRGVDLAVPIESREVLGKNI
YSIGYGVLFACLDESINKDQVEEIGQSIVEWHRELAPSSDTHVFFRDSAF
SDDVSKTNMAAILEQSGITHVRSL
>Noc_3023 Single-strand binding protein
MASRGVNKVILVGNLGRDPEVRYTASGGAIANITLATSETWKDKTTGEQQ
ERTEWHRVVFFGRLGEIAGEYLKKGAKIYVEGRLQTRKWQGQDGQDRYTT
EIVASEMQMLDRATGGSAPYNEDNSMPRGGTAGHPPHSPSSPQPRPSAPP
SSSNDDFEDDIPF
>Noc_0063 Type III restriction enzyme, res subunit
MPWQYSTVHNSACKVIEEQTLWGQAVCRIWLPNQDAVVRVPRSALRPLSA
DLQPEIEAGRIAYVAAAAKVAEVLEGSTSATDGHVLLAPMESNVIPLPHQ
IHALSRAISGDRVRYLLADEVGLGKTIEAGLVMRELKLRGLVRRILVVSP
KGIATQWVAEMQTHFNEQFQLVLGDDISTLQRLAPGADHRNSAWSMFDQV
IVPLDSVKPMDKRRGWTAGRVAEYNRSRFEDLITAGWDLVVVDEAHRLGG
STDQVARYKLGKGLAEAAPYVLLLSATPHQGKTDAFHRLMNLLDEDAFPD
MDSVSRDRVAPYVIRTEKRKAIDADGKPLFKARRTQMAPVVWESRHHLQQ
LLYEAVTDYVREGYNQALREKKRHIGFLMILMQRLVVSSTRAIRTTLERR
LAALKEGEQQASLRLAELENSAGGSENTDDEITELYDMDGQELLDELLKS
HVLALQSEGSHVETLLDAAVRCEQAGPDAKAEALIEWIYELQAEENEPDL
KVLIFTEFVPTQEMLKEFLEARGISVVTLNGSMDMEVRGAAQDTFRKSHR
VLLSTDAGGEGLNLQFAHVIINYDIPWNPMRLEQRIGRVDRIGQPKMVRA
INFVFEDSVEFRVREVLEQKLSVIFDEFGIDKTGDVLDSAQAGELFEDVF
AQAFANPDGIETSVDQTVTRIRDEIQQVRESSAIYGISEELNVQAAEQLR
SHPLPHWVERMTVGYLNSHGGTASRKRSWWDLNWPDGQEHRKAVFNAREA
DRLTDATLLNLENSRVRGLALNLPQIAAGQPLPCVSVSGLPTSISGLWGL
FEIRLQAGMHQKTQLLRIPMVRRGYVSVFLSEEGKLFLPTARHIWDALQT
AEAQVQATLGRDESITAHERLRIAAEQAGQELFDALQQVHLAAVAYEEER
GIVSFASRRKAIERVGLPEVRQFRLARCDAEESEWRHELQSARQIVPEIR
SLLMLRIIKRGA
>Noc_1218 Micrococcal nuclease (SNase-like)
MNAPARMVAAGKLLLLGVLALLALAPSEAGVFRWIDSAGHTHYSDRPQPG
AQELKLNKLAAPYYYVQRVYDGDTLLLKGDIRVRLLGIDTPEIEGRYRLE
QAGGSSARDWLRQRIEGQKVRLEFDQERHDHYQRLLAHVFTVGGEHLNLL
LVEEGLAVVSIFPPNFKYGTQLARAQDRAEADRRGLWSMPDYVPQPILAI
PREGYQRGWRRYQGTPVAIRSSRKYARLVFSKQVEVRIPQAQLDLFGKLE
RYLEKQLEVRGWVSRRKENYSILVRHPSGLKLLF
>Noc_3025 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1501 Transposase
MLKATKIRLYPTREQAEFLNRQFGSVRFCYNTGLRIMSHRYKRHGQSLSA
KYDIKKLLPVAKKSRKYGWLKEADSVALAQACINLDKAFQRFFKEKKGYP
RFKRKRGKQSSYHCMSVSCGESWVKVPKLGPIKARVHRSVEGKLKSITLS
RTVTGKHYASLLYETEQPVPEPMTAIDATKVLGLDMGLSHLAIDSTGRKV
ANPRFIKQVQKNLKRKQQSLSRKQKGSSKRAKARLLVAKAHERVADARSD
FQHKLSRQIVDDNQAVIVETLKVNNMMKNAKLAKHIGDASWHALIAKLAY
KAKEQGKHLVKIDPWFASSKTCHVCQHKMDAMPLNIRSWACPTCHTRHDR
DINAALNIQHQGILKLKAEGLSVSAHRGLRKSGMPPVAAVEVGSSVR
>Noc_0385 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNVPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_0002 DNA-directed DNA polymerase
MRFSIKREEIVRHLVTVCGVVERRHTLPILSNVLLSVKDSQLSLMATDLE
IEIRTALKVLSSEEGKVTVSARKFLEICRALPSGSALEAQYKEGQFYIHS
GRSRFTLSTLPAEDFPSIDSIDAVAELELTQAELKQLLHRTVFCMAHQDV
RYYLNGLLLELTEESIHAVATDGHRLALASLAKGADQEGTEIQSIIPRKA
ILELVRLLEESQEPVRLKFGTNQMRAEFQGLSFSTKLIDGQFPDYKRVIP
VGCEKQFVADRERFKQALVRVNILTNDKYRGVHLHLSDLKLQAIVTNLEQ
GSAEEELDIKYQGENLEIVFNNFYLIDVLNIIDTKEVRLTFTNASSSCLI
TPIDASDSKYVVMPMRL
>Noc_2582 DEAD/DEAH box helicase-like
MTNPDFSEISFDSLNLAEALIQGIQKAGFAICTPIQALALPLLLAGEDVA
GQAQTGTGKTASFLLATMQCLLREPSAGAGQQPRGLIVAPTRELALQVSK
DAQFLGQYTNLKCVAVHGGANYRKQRHLLEQGCDILIGTPGRLIDYYQQR
IIGFKKIQVVVLDEADRMFDLGFIRDIRYILRRLPPPDKRLGMLFSATLS
LRVMELAYEYMNNPQLLRIEPQKVTVDKITERVYFPANEEKISLLLALFK
RLVPRHAIVFTNTKHVAEKVWGYLEGNGFKAALLSSDVPQGKRQRLLAAF
QDRIYPILVATDVAARGLHISTVTHVFNYDLPQDPEDYVHRIGRTARAGA
SGLAVSFACEDYAFSLPDIEEYIGHKVVPFSVEEEELLVPEQRVFVSHKD
KKKSHRRAPHPRQKKKSFQKKETYSGGKNHT
>Noc_0017 putative transposase gene of IS630 family insertion sequence ISY100e (ISS1987)
MFNVGVSRRMHDALKRLGIRKKRVMPYEKQAFLGKLAAGDRTFGFRNLVY
IDESGFKAHCHRDAGWVDKGQQLLRLSSASANAAPTWAWPSAITPGGRKK
EWLAPMLLEGSCTSQLLETWVEQCLIKALHDPTLIIMGNASFHHHKRIQD
IVAKDYHDMIPLLPSSADLNPIEKTKIIHGTGGGN
>Noc_0638 Integrase, catalytic region
MAKHELSKTKACAAVQLSRSSWYRQPSQQAVRDQPVIDALNTMLKKYPRW
GFWMCYDRMRLDGHAWNHKRVYRVYTAMKLNLPRRKKRRLPQRVQQPMVV
EARANAEWSLDFMSDALYHGRRFRTLNVLDEGVREALDIVIDTSIPGARV
VRTLDRLIEWRGKPDAIRVDNGPEYISQVFSEWCEKHGIKLNYIQPGKPN
QNAYIERFNRTYRHEVLNAYVFESLRQVREITRAWIIEYNEERPHDSLGK
IPPAMFRRQVENARNSTLELCH
>Noc_1693 DNA ligase (NAD+)
MVAPAPAKERIQALKELINNYDYAYYVVNNPLVPDSEYDRLIRELQALEE
NYPELITLDSPTQRVGAKPVKSLGEIKHEIPMLSLNNAFHEGELADFHRR
VKTRLGIERVDYAAEPKLDGLAVSLLYQDGVLVQGATRGDGITGEDITHN
IRTIPTVSLRLRGEKIPSLLEVRGEVYMPRQGFEQFNREQIAKGEKPFVN
PRNAAAGSLRQLDPRITANRPLALFCYGVGQVEGGILPDRHSEILFQLKQ
WGLRILPYSEVVEELAGCEKYYQHLLDLRDKLPYEIDGVVFKVDYLDQQQ
ILGSLARAPRWALAYKFPAQEELTQILDIEVQVGRTGALTPVARLQPVFV
GGVTVSKATLHNEGEIQRKDIRIGDTVYVRRAGDVIPEIVKVIMERRLPD
SRPFQMPRQCPVCGSEIVKEEGGAVARCSGGLYCPAQRKEAIKHFAGRRA
MDINGLGDKLVEQLTKQGLLKDVADLYGLTKEQLAGLERMGQKSATNLIN
AIQQSKHTTLPRFLYALGIREVGEATAQVLAKEFGSLEALASVSEERLQQ
VTDIGPIVAAHIAAFFRQPHNRQIIQGLQKAGVCWPEVEDKVQIVQPLLG
RTFVLTGTLESMTREQAKERLQALGGKVNGSVSPHTDYLIIGANPGSKLV
KARNLGITILDETYFRNFLDDTSFP
>Noc_0028 Primosomal protein n
MAVPPILRLAIPSPLRRYFDYLPPAKTPYQQLQPGIRLQVPFGRRTLIGI
LVTIASQSEIETSKLRRAQYCLDKTPVVPNSLLQLLTWAASYYHHSPGEV
FFSALPQLLRQGKPATSPIYRQWLLSSKDCTTDATILSRAPRQQQLVELL
RYHPEGLTSSQIKAQLGTYQSSLRSLIAKGWVYSQEKPIYKSIPPSKEHL
RHPLNDAQKAAVTTILASQQKFRPFLLEGVTGSGKTEVYLRAIEEIIAKS
RQALVLIPEIGLTPQMVERFHRYLQTPITVLHSALSDRERLAAWLAAYEG
KTPVVIGTRSAIWAPLPRLGIIIIDEEHDSSFKQQEGFHYHARDLAIMRA
YQTKIPIILGSATPSLESLDNVKRQRYHLLQLLQRAGAAQTPRIQLLDVR
SRPLEFGLSPPLLAALRYHLGQGNQALLFLNRRGFAPTLICHECGFAVPC
HRCDAYMTVHQHTNRLRCHHCGAEGSLPTSCPQCRNLLLRPRGLGTEQVE
AGLKHFFPEIEIARVDRDTTRRAGTLNKMLDGIHNGKYRLLIGTQMLAKG
HHYPNITLAGILDADQGLFGSDFRAGEHMSQLILQVIGRTGRGNKPGEVL
IQTHHPEHPLLTALVGHGYRHVAEILLEERRQIGFPPYGYLALLRARAVS
PDSPMKFLEMARSMATAHNLHGVTLLGPVPAPMERRAGRYRAQLLLQGSA
RAPLQRLLTTWIPTLERLQAARKVRWSLDVDPIDLM
>Noc_1191 DNA repair protein RecN
MIRELLVRDLAIITELVLPLETGMTALTGETGAGKSILVDALGLVLGDRG
DVNLIRHGQERAEVSAIFELGANRALLTWLQARDLEQDEECVLRRILSRE
GRSRAYINGRPVPIQTLREVGRQLVDIYGQHAHQSLLRPGVQRSLLDAYG
AHTLLFDEWDRAYRHWRQLSQALETLTQTAGEQAARLELLRYQVEELEAF
RAEPGELARLEQEQQQLAHGAELVQGVELALDLLYQNEQGAIYALLGKVS
RQLAQLGAIDPKLTPLVELLENAAIQIDEAVGGLRHYLDSMDIDPERLRW
VEERLSGFYDLARKHRIAPAELPTLAEHLASELRQMESVDNRLGNLQGEF
ATALQTCRALAADLSKARASAAVKLAQGVTEVMQSLAMPGGCFKVILASL
KEQEFSAHGAEQVQFQVSANPGQPPGPLSKVASGGELSRIGLAIQVLTSQ
WGGVSALIFDEVDVGIGGAVAETVGARLRELAKYRQVLCVTHLPQVAAQA
HAQIRISKSYQKKTAIVELTCLDEKERVEEIARMLGGREITERSRAHAHE
MLEMVRRIE
>Noc_1049 Helix-turn-helix, Fis-type
MTLQAESGGHMSDEPARTTERKLTINEEQRNSPISECLRQALDEYFDRLN
GHDPADLYEIVMKEIEPPLLQTTLKHTGGNQTKAAKFLGMNRSTLRKKLR
QYGISAIG
>Noc_1309 Transposase
MSAAGKIGGRISAARQYYQYPLATCHRPNRTSIEAMTQCDSTTLQRSYTF
RFYPTSVQRQQLAMEFGHARWVWNTCLTWRGRQYRLHDKHVSGVDFSGHL
TKLKKTAAYGWLKEASATTLNQKLRDQDTAFKNFFAGRAKYPRFKKRAHA
QSIRYQLDQRQVAGRYRAGKLLKLPKLGALSLKWSRKPQGIPKMVSVTQD
CAGCYCVSFMCEETLQPLPRKPNGIGIDVGISDVVVTSEGWKSGNPRHLR
TYRRLLTKTQRRLSRKRKGSVRWHRQRVRVAKAHARVSNTRQDWLHKLTT
ALIRQAGFIAMETLNVRGMMANRRIAKALGDVGMHELKRQLEYKAPWYGR
AFRQVDRWAPTSKTCSACGAVQKAMPLKVRQWTCSDCKSVHDRDINAAKN
ILRLATVGRTGSDARGGVHKPEVAYGC
>Noc_1698 Phage integrase
MSHELTPKQAAEMLGYVESESTLTLQHQRYIEAATAENTRRAYRSAIRHF
ERWGGHLPADASMVSAYLLAHAEILNPRTLSLRLTALRYWHQLQEFPDPT
VAPEVRKLFQGIARRQGKPKRQAKAFRLEHLKAMVNHLSAQSNLKAYRDR
ALLLVGFFGAFRRSELVQVHLAHLQWEPEGILITVPRSKTDQSGEGQLKA
LPYGEGELCPVTALNAWLRIAGIESGPLFRRVNRWETLLDAPLHPAGVNL
ILKAVATQVGLGFVSELSSHSLRRSLATTAHRAGASFESIKRQGGWVHDG
TVWEYIEAARHFEENAAAVLLTKRQKQG
>Noc_0554 DNA polymerase A
MNMAENPVLILIDGSSYLFRAFHALPSLTTSKGQPTGAIYGVINMLRKLL
DEYQPQYIAVVFDAKGKTFRHELFEQYKDHRPPMPEELACQIQPLHDLIR
ALGLPLLCVKGVEADDVIGTLARQATAQRLETLISSGDKDLAQLVNPHVS
LVNTMNLSKLDPAGVKAKFNVSPEQIVDYLALVGDTVDNIPGIPGIGPKT
AAKLLCQYHSLDQIMAYASEIKGKMGESLRSHLTQLPLAKELATVRQDLL
LDLGPKDLRCAPPNIPALRELYAALEFKSWLRELLDNENAHSSISNSSTN
SAPAYETVFSEESFENWVARLEKAELFAFDLETNNLDYIEAEIVGLSFAI
QPHEAMYIPLGHEDATAPPQLPREQVLARLKPLLEDPRHGKVGQNLKFDC
NVLANYGIELQGIRHDSMLESYVLDSTATRHNMDSLALKYLQRTTITYEM
VAGKGAKQLPFNQVTIEKAAPYAAEDADISLQLHHCFWPRLQQEEGLRQL
YQELEIPLIPVLSRMERNGVQVNTEQLKAQSDELAARLKRLEQEAFELAG
ESFNLASPKQIQAILYEKLKLPVTRKTPTGQPSTAETVLQELALDYPLPQ
LLLEYRTLSKLKSTYTDRLPLQVNSHTGRVHTSYHQAVTATGRLSSSDPN
LQNIPIRSTEGRRIRQAFIAPPGYRLVAADYSQIELRIMAHLSEDEGLLA
AFEAEEDIHQRTATEIFRTPLEDVTPEQRRSAKAINFGLIYGMSAHGLGR
QLGINRTAAQHYIERYFQRYPGVKAYMENICQQARQKGYVETLFGRRLYL
PEIHSRQTQRRNQAERTAINAPMQGSAADIIKRAMIHADRWLQEQKANAR
MIMQVHDELVFEVAEDKLEATIRAIRENMAAAAQLKVPLIVEIGSGTNWD
EAH
>Noc_0637 Transposase IS3/IS911
MKRSRFTESQIVSILKEADAGAKIKDLCRKHGISDATYYNWKAKYGGMST
SDLRRLKETEAELSQYKKMYAELAHENYALKDLIEKKL
>Noc_0702 putative transposase gene of IS630 family insertion sequence ISY100h
MTGYTQRCNMKRKSFLRLRERYRRRGKRFVYLDESGFEPEVSRRYAYAPK
GRRVYGLISGHRRPRTSLLAARMDEGFEAPFLFEGTCNTAVFNAWLEKEL
CPLLNSNHIVIMDNAPFHKAVSSREIIKKTGAGILFLPPYSPDFNPIEKD
FGNIKKIREYNEHETLENIVAAYQ
>Noc_1500 Transposase IS200-like
MSVQEDYRRGRHSVTRLVVHLVFTTKYRGKVFDGYILGQLREAFESACEK
LDCRILEFDGEEDHVHLLVEYPPKLSISVLVNNLKSTSSRRVRLLNTHLP
NLSKSAALWSRSYFACSAGGATIETLKAYVQSQKTPD
>Noc_2240 Phage integrase
MKPAPQRTKLTKTVVDRLPAPTRGQAFYWDSALPCFGVRVSAGGVKSFVI
QKRIQGREKRITLGKYGHLTLMQARKEAARLLGEIAVGRNPLAEKAQAKL
RAVTLGEALEHYLTSRPLKARTIQGTRHTMGKCFSDWMKRPLTSITRDKV
AARHKQLGTASKSHANLAMRYLRAVFNFAMADYTDNEGRPVIADNPVNRL
SEARTWFRVERRRTVIKSHELKPWMQAVQRLENGAARDYFMLVLLTGLRR
TEALNLRWQNVDLVANTLTVQDTKNHQAHTLPLSDYLTEMLAARLEDTYS
EYVFSTSRGRLSNLRGPLAEVRSYAGISFSIHDLRRTFATVADSLDVPGY
AVKALLNHKAANDVTAGYIVVDTERLRAPMQKITDFMLRAGGLWEGGEVV
ELRQYG
>Noc_0929 putative integrase DNA protein
MIGAIRVNAITTEDILKILSPIWTTKTETAKRVQGRMENILD
>Noc_0546 probable predicted DNA methylase containing a Zn-ribbon
MLAPQQEQTLCLEAPPLKNTPALLERVFPAQKISAEAQKERKAGSGKTLT
ALGSYWKGRKPLILVRALILGSLLPATDDPETDLAIFEQLMALDEASFGR
REPKLSAAQVAARITLPRPWDYFDYSFKDATVEPTKIEELTFPLRAGDIP
GLSLRWKRAIPLADKQTLLAAALKELPYPDKVALCKRPEECDPATLYGPI
WDSVNQHLGRFGVQAHSHEELVAQLGMLRFGHRPRVGDTFCGSGSIPFEA
ARLGCEVYASDLNPIACMLTWGALNIIGASPERRDEIAQAQQAVAAAVNQ
EITALGIEHNSQGDRAKAYLYCLETRCPETGWQVPLAPSWVISKTRQVYA
KLIPNPREKRFEIDIVSGASPEEMAAAEQGTVQQGQMVYTLEGKTYRTSI
KTLRGDYRDAQGVNRNRLRQWEKHDFRPQPEDVFQERLYSIQWITQETLG
KSRQQTYFAPVTEEDRARERQVEQIVAENLASWQEQGLVPDMAIEPGKET
TRLQRERGWRYWHQLFNARQLLISSLFCKHRHPVSAICLLKAADWNNRLC
RWEPYWAKSQQVFYNQALNTFYNYGTRAYDMHMQAYDLPMRRSQTLDVSN
YVEMLDCRSITAVADLWITDPPYGDAVHYHEITEFFIAWLRKNPPAPFNE
WIWDSRRALAIQGASDKFRRDMVEAYQAMTEHMPDNGRQCVMFTHQDSRV
WSDMAAIFWAAGLQVINAWYIATETSSELKKGGYVQGTVILLLGKRPPGQ
RAGFTPRLLPQVRKEVNAQIQDMMHLNARTQEQMGAPVFTDSDLQMAGYA
AALKVLTGYTEINGEEVTRLALRPRRKGEKTVVSEMVQQAAATANSLLVP
EGLPKATWEVISGIQRFYLRMVALETTGASKLDNYQNFAKTFRVDNYQAV
MASLKPNRARLKGAQDFKPRELAGTEIGETLLGQVLVALQELLGEKEPPI
VMDNLREALPDYFQQRPHLQAMAQFLGDQLAQRRPQEARAAEIIASRVRN
ERL
>Noc_2198 Transposase, IS605 OrfB
MKLVANLKLTPTPAQERELRLTLARCNEACNWLSERAWETKTFRQYDLHK
LCYQAVRAKFALSAQVAVRCIAKVAHAYKLDQKTQRAFRKHAAHPYDDRI
LRFVCDEKVSLWLLSGREKIGYVGSDHQRQLLEHRKGEVDLMFVRGQWYL
AAVCDFDDPKLLTPEGMLGVDFGIVNIATDSLGERYCGAKVQAYRERYAK
RRATLQRLGTRAAKRCLRHISGRQKRFQKYENHCISKRIVSTAERSRLGI
GLENLKHIRARVKANKAQRKRLHNWGFAQLRAFIEYKAKRAGVPVVIVDP
RNTSRECPACGRIDKANRPTQSEFRCVECGHSNHADHNAAGNIARRAAVT
QPMFAHKCAPCAVESRQL
>Noc_0018 RecF protein
MMHITHLDIRNFRNLKHIELHPSKGVNILSGANSSGKTSFLEAIYLLGLG
RSFRTVQLISAIQAGMESLRVVAKVKQVGGSHTAGVEFGPAGFRARINKD
TVKKRSQLATQLPLLYMSSYSHVVLDGGPRYRRQWLDWSLFHLEPGFHDL
WWCYQRTLKQRNHVLRVHKPSWQQEINAWNKKLSTYGEQITSLREAILFK
LQDSVSQLFTALAHQPISPVTMEFKQGWARTVRLEEILNESLNYDRAAGY
TRYGPHRAEVAFYVDGKDVREILSRGQQKVFCYSLALSQANLLYRTKEQN
CIFLIDDFTSELDADHRKRLLTLLNKLGMQVFATTIESLGSEIKAHPNIK
EFHVKLGQVEEMV
>Noc_0070 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKADLTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0924 DNA recombination protein, RecA
MDENRKKALGAALSQIEKQFGKGAVMRLGDASIVREVEVISTGSLGLDIA
LGVGGLPRGRVVEIFGPEASGKTTLALQVAAEAQALGGTAAFVDAEHALD
PQYAERLGVSVEDLLVSQPDTGEQALEIADMLVRSGAVDVVVIDSVAALT
PKAEIEGEMGDSHVGLQARLMSQALRKLTANIKRSNTLVIFINQIRMKIG
VMFGSPETTTGGNALKFYASVRLDIRRVGALKKGDEIVGNETRVKVVKNK
MAPPFKQVSFDILYGSGVSREGEIIDLGVREGFIEKAGAWYSYNGERIGQ
GRDNVRQFLKEHRELAQGIEAHIREKLLPGKAALEEVQEAVT
>Noc_1367 conserved hypothetical protein
MKVVSQKTRISRKLAVNPSMLEEDRPDTIDISVIPGPRGLEDDKFPFEML
SDIAERESWRKEINRPLSHIHKWWAQRLGTVFRAMTIGALVPKGSNIIDL
FYKPVRIKDAIVFDPFMGSGTTIGEALKLGARGIGRDINPVAYFLVKNAL
SIHDRPAILATFRDIERDVVSKVRPLYQATLPDGTVVDVLYYFWVKIVDC
PACAESVDLFSSYIFARHAYPKKFPRAQAVCPTCGAINAVRNDAQKAYCH
TCNRAFNPQIGPASRQKATCPACAHAFLIAKTIRATDRPPAHRLYAKLVL
MPDGAKAYLPATDEDRALYAQTKETLNKFKNAYPIVPIEPGYNTNQALGY
NYRYWHEMFNVRQLLGLSILADRIRQIPDTILCNLFTCLLSGALEFNNMF
ASYKGEGTGAVRHMFAHHILKPERTPLEANLWGTPKSSGSFSTLFEGRIK
RSLDYAENPFELRLSNRSGKRISEKVFGLSEKIGFSIADSFSSFAAGKRV
YLSCADSSATDLPEHSVDAVLTDPPFFDNVHYSQLADFFHVWQRHILGSN
GYRQDYTTRSRNEVQSAEVNAFTDRLTAVWIEVHRILKDDGILAFTYHHS
RPEGWRSVLHALMAAGFGITAAHPMKAEMSVAMPKHQAKEPINLDIIIVC
RKRSQLQRHCWNGDLWETAMPIAAEQIRRLREGGRRLSRNDVRVIVMAQI
LRRLSVSHTVETALTLLDACSAETEALIEQLYTADKERTISNKMKE
>Noc_0987 conserved hypothetical protein
MLKLPIPLICEDDVFAVLGAGALLLTINNRLARELQHRYDRVQQVKGLTV
WETPQILPWSVWLQRCYDYQTLTLTETDNSSPALLSPLQEQSLWERVIYD
SPYSGALLQVPATVRTAQEAWRLWHAWRLPLAGQSLFLTEDTQAFLEWAQ
VFEDYCRVDHWLDNARLPDAVGGMLESGQIPLPGTVILAGFDEYTPQQQE
LLAVLERQGVLLQVFANQGGSQQTRRVALADTIEEITVAARWARHRLEHS
PGEKIGVIVPELEFLRVQVARIFDDILHPEAVLPGRGRIERAYNLSLAQP
LADNPLVHTALLILELSKGELSMVEMGAFLRSPFVGAAEQEFSHRALLDA
YLRKTREERVSLERLWKAAITAREEDANHRCPALGERLQQFKVEVDSLPA
RQPPSGWAQSFTCWLQLLGWPGERPLDSEEYQAVSAWHKSIQSFSSLDRV
VPSLKKNVAIGKFRHLLVETLFQPENPIVPVQVMGVLEAAGEQFDAAWML
GLHDGIWPTAPRPNPLLPIELQRHYRLPHASAERELAYTRVVTERLLASA
PVVIVSHPRREGDRDLRPSPLIAELASVLPERLQLASVESYVNEIQQTGR
METLVDAQGPPLEAGAQVGGGTGLLKAQAACPFRAFAEYRLGAKGLEEPS
VGLESLDQGILIHVALQYLWEKLQNQHTLLSCSAEELHGLIAEAAKQAIA
TQTAVRPRIFTERFTAIEQERLEQLLLEWLERDKQRPPFAVLHQERSQPL
NLGGLSLDTRADRIDQLESGERVIVDYKTGRSNPRHWFGERPEEPQLPLY
CIAHEAPLAAVLFAQVRRGEMKYLGVTKEEGSMPEVAVFTRVAGGLDSWE
ELLARWFQVLHALAVEVVEGYAAVAPRDANSCDYCALPGLCRIKELGGGA
AKNKKERDRND
>Noc_0680 hypothetical protein
MAMYNELLWIFTQLPDDYVVLDTETTGLPEENGLPDIVTLGLTAVKNREI
SESVEFETRLQRRILEEAQSIHGITNIQTARFESFDSRWHQITDYLKDER
STHRHSQCQF
>Noc_0519 Transposase IS200-like
MTQYRRQCFTQAMLQRLEVICQEQCAKWGIALDEFGGEADHVHLLLDMHP
NIMPSKFINSLETVTSRLIRKEFNEYLLNFYWKPVLWTRAYCLITAGAVP
LEVLREYIKKQERAKD
>Noc_0386 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0044 DNA primase
MPMGERIPQEFIDELVARTDIVELIDSRVPLRKASHNYVACCPFHNEKTP
SFTVSPQKQFYYCFGCSVHGTAIGFLMAFDRLSFIEAVEELAQRAGMMVP
QSSKQQDYYNRHQGLYEVLACAAEFYQQQLEASAYQGQVKAYLRERGLSG
PIIAEFGLGFAPPRWNALLHYTRPSLKSYLQAAGLTISKGEDRYYDRFRD
RLIFPIHDYRGRVIGFGGRLLGDGSPKYLNSPETALFHKGRELYGLYQVR
KSLHRCDRLLVVEGYMDVLALAEHKIRYAVATLGTATTSDHLTRLFRITP
AVIFCFDGDRAGYQAAWRALETALPLLSQGRQVQFMFLPQGEDPDTMVRA
EGQTAFEARLAEAVPLSDFLLNNLRQKVNLSSVDGCARLVELARPLLARI
PPGVYQDMLLARLAELAQIEQTTLIRHLSPGKKPTAVPLRRLEQGAASPT
RRAVAILLQRPKMIQWVDKNLSLRGLEGAGAELLQKLVDLLQNNPHLNTA
ALLERWRDSEMGRYLEQLAGWELLLTDEDMVLELQAALERLQVQGAEQRI
ITLSNQPSLTGAEQRELLALLAEK
>Noc_2996 hypothetical protein
MPKVSNIIQIRFILGIFFILAVLYGLSSSVQAGLLYRYTSEHGTQMLSDN
LPPEAVQGGYEVINNQTMVVVKRVPPAKTQKQLVEEARLAQIEAEKQRRL
KEQANYDRTLLATFGSETDLLRIRNSQIEAIEGLIELIQGKIRTLQEALI
AHQNQAAHLERNGQLLPQQLLTNIQTVKDRLMNNRRYIVKKRQEQETIRQ
KFIKDLERFRELTQPRINTIGSSARAGLRDR
>Noc_0079 DNA-3-methyladenine glycosylase II
MTDLLPPRFYARDALEVAADLLGASLCREQVVLRITEVEAYRWPEDTANH
GRHGQTLRNEPLWGPPGRVYLYLCYGIHHLLNLVTGEEGQAAAVLIRACE
PVAGLDLIQRRRRGKIKPGLLTGPGKVGAALGLDLSWNHHPLYEPGGLEV
RRGTPVAALLAGPRVGIAYAHPEHRDAPWRLAIPDNPWVSCRSQLQPRQQ
N
>Noc_2751 Protein of unknown function DUF48
MPILPSHRQGLYYLEHCRVMAKDERVVYACQEGAFTKFFAIPPANTNVIL
LGSGTSLTQAAARLLASEQVMVAFVGGGGSPLFLASQNEYRPTEYCQAWM
RLWQDNDQRLKVAKTFQRNRAEFLMQQWPKLAEPKPHKASLEKLAERYLA
DIELAGDNGTILAQEAKFAKKLYKFWANCTETENFTRDPGKRDFNDPFNS
YLDHGNYLVYGIAAAVLWVLGIPHSLPVIHGTTRRGALVFDVADIIKDTC
VMPIAFQHAAAGRSDQEMRQACIAWLDESHAMTFLFQSIKRVAQL
>Noc_2874 Transposase
MMRCSIDLRKRVIDFVRGGGSKAEAARRFQVGRASIYRWLSQDDALCYER
PGPRRSHKLDWEALRVHVEDKAALTYKERARHFGVSYYCIWHAMHKMGLT
RKKNDGVHAAL
>Noc_0784 hypothetical protein
MRALMGYGAVLGVGWAMLALFLLGGCTGLKPGSPIPVYELSSPAYKLSVP
SPNSLEPEPGDSELLLQYYRGLHALSEAELQRELEQAWQTTAKEPTAFDR
LQLILLLSLPEVPFQDLEQARAMLRSFLKTELEGAKEYEGAKGLYDLALF
LQGFLMEEAQQKRRYRLLQEQLEQKQEQVKRLRSGLKYLDGRRKQEQEWA
HSLEQQLENERGRAETLEQKLEALKTIEKRLEYRNQSQENLQLPEQKNES
ND
>Noc_1335 hypothetical protein
MILPGAKLQWQFKAFYLYGAVEPLSGERFFLECSHHYSPELNPIERLWQH
MKEQLSWVLFSTLDSLQQSVTDILHELTPRIIRSLIAYPFILSAFKYLEK
WY