Gene list
Applied filters:
COG category: Replication, recombination and repair
Organism: Helicobacter pylori J99, J99
Gene type: CDS
Number of genes found: 104
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Helicobacter pylori J99, J99 >gid:20869 M.HpyI TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MNYIGSKYKLIPFIKENIHAVAGHDLSGAIFCDLFAGTGIVGRTFKKAVN KVISNDLEYYSFVLNQNYIGNIQEIPNKEELINKINSVALKKGFIYSHYS LGGSSRQYFSETNAQKIDAMRLKIEELKLSQNIDNHSYYFLLASLLESAD KVANTASVYGAFLKRLKKSAQKELILKGAHFDVSLNANEVYQQDSNDLIG KISGDILYLDPPYNARQYGANYHLLNTIAAYTPFTPKGKTDLPSYQKSSF CSRFQILNAFENLIKKARFKYIFLSYNNEGLMSETEIKNILKKYGAYSLV TKTYMRFKADNKRAHKAVHTKECLHVLIK >gid:20227 cagH cag island protein MAGTQAIYESSSAGFLSQVSSIISSTSGVAGPFAGIVAGAMTAAIIPIVV GFTNPQMTAIMTQYNQSIAEAVSVPMKAANQQYNQLYQGFNDQSMAVGNN ILNISKLTGEFNAQGNTQSAQISAVNSQIASILASNTTPKNPSAIEAYAT NQIAVPSVPTTVEMMSGILGNITSAAPKYALALQEQLRSQASNSSMNDTA DSLDSCTALGALVGSSKVFFSCMQISMTPMSVSMPTVYAKYQAVATKALT SGVNPMTTPACPIGDKVLAVYCYAEKVAEILREYYIEFVKNNTNLLQNAS QMILNQSGLATSTYDTQAISNISSLYNYNIVANKSFLKSHLTYLDYIKDK LKGQKDSYLTERVQTKIIVK >gid:20226 cagI cag island protein MKCFLSIFSFLTFCGLSLNGAGAVITLEPALKAIQADAQAKQKTAQAELK AIEAQSNAKEKAIQAQIEGELRTQLATMSAMLKGANGVINGVNSMTGGFF AGSDILLGVMEGYSSALSALGGNVKMIVEKQKINTQTEIQNMQIALQKNN EIIKLKMNQQNALLEALKNSFEPSVTLKTQMEMLSQALGSSSDNAQYIAY NTIGIKAFEETLKGFETWLKTAMQKATLIDYNSLTGQALFQSTIYAPALS FFSSMGAPFGIIETFTLAPTKCPYLDGLKISACLMEQVIQNYRMIVALIQ NKLSDADFQNIAYLNGINGEIKTLKGSVDLNALIEVAILNAENHLNYIEN LEKKADLWEEQLKLERETTARNIASSKVIVK >gid:20220 cagS cag island protein MSNNMRKLFSMIANSKDKKEKLIESLQENELLNTDEKKKIIDQIKTMHDF FKQMHTNKGALDKVLRNYMKDYRAVIKSIGVDKFKKVYRLLESETMELLH AIAENPNFLFSKFDRSILGIFLPFFSKPIMFKMSIREMDSQIELYGTKLP PLKLFVMTDEEVNFYANLKTIEQYNDYVRDLLMKFDLEKYMEEKGVQNA >gid:19973 deaD ATP-DEPENDENT RNA HELICASE DEAD MELNQPPLPTEIDDDAYHKPSFNDLGLKESVLKSVYEAGFTSPSPIQEKA IPAVLQGRDVIAQAQTGTGKTAAFALPIINNLKNNHTIKALVITPTRELA MQISDEIFKLGKHTRTKTVCVYGGQSVKKQCEFIKKNPQVMIATPGRLLD HLKNERIHKFVPKVVVLDESDEMLDMGFLDDIEEIFDYLPSEAQILLFSA TMPEPIKRLADKILENPIKIHIAPSNITNTDITQRFYVINEHERAEAIMR LLDTQAPEKSIVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRSSI MAFKKNDADVLVATDVASRGLDISGVSHVFSYHLPLNTESYIHRIGRTGR AGKKGMAITLVTPLEYKELLRMQKEIDSEIELFEIPTINENQIIKTLHDA KVSEGIISLYEQLTEIFEPSQLVLKLLSLQFETSKIGLNQQEIDAIQNPK EKTPKPPNKKTQHEPAHSFKKSHHRDRHPKTNRYSKKHKRR >gid:21155 dnaA CHROMOSOMAL REPLICATION INITIATOR PROTEIN MDTNNNIEKEILALVKQNPKVSLIEYENYLSQLKYNPNASKSDIAFFYAP NKFLCTTITAKYGALLKEILSQNKVGMHLAHSVDVRIEVAPKIQVNAQSN INYKATKTSVKDSYTFENFVVGSCNNTVYEIAKKVAQSDTPPYNPVLFYG GTGLGKTHILNAIGNHALEKHKKVVLVTSEDFLTDFLKHLDNKNMDSFKK KYRHCDFFLLDDAQFLQGKPKLEEEFFHTFNELHANSKQIVLISDRSPKN IAGLEDRLKSRFEWGITAKVMPPDLETKLSIVKQKCQLNKITLPEEVMEY IAQHISDNIRQMEGAIIKISVNANLMNATIDLNLAKTVLEDLQKDHAEGS SLENILLAVAQSLNLKSSEIKVSSRQKNVALARKLVVYFARLYTPNPTLS LAQFLDLKDHSSISKMYSSVKKMLEEEKSPFILSLREEIKNRLNELNDKK TAFNSSE >gid:21018 dnaB REPLICATIVE DNA HELICASE MDHLKHLQQLQNIERIVLSGIVLANHKIEEIHSVLEPSDFYYPPHGLFFE IALKLHEVNCPIDENFIRQKMPKDKQISEDDLVAIFAASPIDNIEAYVEE IKNASIKRKLFTLANTIREQALESAQKSSDILNAVEREVYALLNGSTIEG FRGIKEVLESTMNLITENQRKGSLKVTGIPTGFVQLDNYTSGFNQGSLVI LGARPSMGKTSLMMNMVLSALNDDRGVAVFSLEMSAEQLALRALSDLTSI NMHDLESARLDDDQWENLAKCFDHLSQKKLFFYDKSYVRMDQIRLQLRKL KSQHKELGIAFIDYLQLMSGNKATKERHEQIAEISRELKTLARELEIPII ALVQLNRSLENRDDKRPILSDIKDSGGIEQDADIVLFLYRGYIYQMRAED NKIDKLKKEGKVEEAQELHLKVNEERRIHKQNGSIEEAEIIVAKNRNGAT GTVYTRFNAPFTRYEDMPVDSHLEEGQETKFEMPTT >gid:21091 dnaE DNA POLYMERASE III, ALPHA CHAIN MKENKAFTHLHLHTEYSLLDGANKIKILAKRIKELGMKSVSVTDHGNMFG AIDFYTSMKKEGIKPIIGMEAYIHNDDNLSSKETKQRFHLCLFAKNQEGY ENLMFLSSMAYLEGFYYFPRINKKLLREHSKGIIASSACLQGEVNYHLNT NNERNRKYGAKGYDEAKRIACEYQEIFEDDFYLEIMRHGILDQRFIDEQV IKMSLETGLKIIATNDTHYTMPNDAKAQEVAMCVAMGKTLNDKGRLKHSV HEFYIKSPEEMAKLFADIPEALENTQEIADKCVLEIDLKDDKKNPPTPPS FKFTKAYAQNEGLSFEDDASYFAHKAREGLRERLILVPEEKHEQYKERLE KEIEVITNMKFPGYMLIVWDFIRYAKEMGIPVGPGRGSAAGSLVAFALKI TDIDPLKYDLLFERFLNPERVSMPDIDTDFCQRRRKEIIEYMIEKYGKYN VAQVITFNKMLAKGVIRDVARVLDMPYKEADDFAKLIPNRLGITLKGYEK NGEFIEGAWELEPKIKELVESNEVAKQVWEYSLNLENLNRNAGVHAAALV VDSQKELWHKTPLFASEKTGGIVTQYSMKYLEPVDLIKFDFLGLKTLTVI DDALKIIKTQHNIDVDFLSLDMDDPKVYKTIQSGDTVGIFQIESGMFQGL NKRLRPSSFEDIIAIIALGRPGPMESGMVDDFVNRKHGVEPIAYAFKELE PILKPTYGTIVYQEQVMQIVQTIGGFSLGEADLIRRAMGKKDAQIMADNK AKFVEGAKNLGHDGQKAANLWDLIVKFAGYGFNKSHSAAYAMITFQTAYL KTYYKHEFMAAMLTSESNKIESVARYIDEVRALEIEVMPPHINSSMQDFS VAEFKNQKGELEKKIVFGLGAVKGVGGEPIKNIIEERAKGDYKSLEDFIS RVDFSKLTKKSLEPLVKSGSLDNLGYTRKTMLANLDLICDAGRAKDKANE MMQGGNSLFGAMEGGIKEQVVLDMVDLGEHDAKTLLECEYETLGIHVSGN PLDEFKEEIKGFKNLVKSIDIEELEIGSQAYLLGKIMEVKKKIGKRSGKP YGTADILDRYGKFELMLFEKQLNALEELDINKPLVFKCKIEEQEEVARLR LFEILDLESAREVKIPKARYKDPEKQKEDVREIPPMEMLASSSCSLAIVL ENDVKKEFLRQIKESALKHQGKRPLYLIIKDKDKQFKIQSDLMVNEKIKD DFKGLEWRDLA >gid:19752 dnaG DNA PRIMASE MILKSSIDRLLQTIDIVEVISSYVDLRKSGSNYMACCPFHEERSASFSVN QVKGFYYCFGCGASGDSIKFVMAFEKLSFVEALEKLAHRFNIALEYDKGV YYDHKEDYHLLEMVSSLYQEELFNAPFFLNYLQKRGLSMESIKAFKLGLC TNKIDYGIENKGLNKDKLIELGVLGKSDKEDKTYLRFLDRIMFPIYSPSA QVVGFGGRTLKEKAAKYINSPQNKLFDKSSLLYGYHLAKEHIYKQKQVIV TEGYLDVILLHQAGFKNAIATLGTALTPSHLPLLKKGDPEILLSYDGDKA GRNAAYKASLMLAKEQRKGGVILFENNLDPADMIANHQIETLKNWLSRPI AFIEFVLRHMAGSYLLDDPLEKDKALKEMLGFLKNFSLLLQNEYKPLIAT LLQAPLHVLGIREPVSFQPFYPKTEKPNRPQKFAHVSSMPSLEFLEKLVI RYLLEDRSLLDLAVGYIHSGVFLHKKQEFDALCQEKLDDPKLVALLLDAN LPLKKGGFEKELRLLILRYFERQLKEIPKSPLSFSEKMIFLKRARQAIMK LKQGELVAI >gid:20190 dnaN DNA POLYMERASE III, BETA CHAIN MKISVSKNDLENTLRYLQAFLDKKDASSIASHIHLEVIKEKLFLKASDSD IGLKSYISTQSTDKEGVGTINGKKFLDIISCLKDSNIVLETKDDSLVIKQ NKSSFKLPMFDADEFPEFPVIDPKVSLEINAPFLVDAFKKIAPVIEQTSH KRELAGVLMQFNQKHQTLSVVGTDTKRLSYTQLEKISIHSTEEDISCILP KRALLEILKLFYENFSFKSDGMLAVVENETHAFFTKLIDGNYPDYQKILP KEYTSSFTLGKEEFKEGIKLCSSLSSTIKLTLEKNNALFESLDSEHSETA KTSVEIEKGLDIEKAFHLGVNAKFFLEALNALGTTQFVLKCNEPSSPFLI QEPLDEKQSHLNAKISTLMMPITL >gid:20393 dnaX DNA POLYMERASE III SUBUNITS GAMMA AND TAU MQVLALKYRPKHFSELVGQESVAKTLSLALDNQRLANAYLFSGLRGSGKT SSSRIFARALMCKTGPKAVPCDTCIQCQSALNNHHIDIIEMDGASNRGID DVRNLIEQTRYKPSFGRYKIFIIDEVHMFTTEAFNALLKTLEEPPSHVKF LLATTDALKLPATILSRTQHFRFKKIPENSVISHLKTILEKEQVSYESSA LEKLAHSGQGSLRDTITLLEQAINYCDNAITESKVAEMLGAIDRSVLEDF FQSLINQDEARLQERYAILENYETEGVLEEMMLFLKAKLLSPDSYSILLI ERFFKIIMSSLSLLKEGANASFVLLLLKMKFKEALKLKALDDAIVELEQT PFNQSPSISYNAPKQEPKSTERIEGREKLEKRERIETPQTPMLSAKDRIF HNLFKQVQTLVYERNYELGAVFEKNIRFIDFDSQTKTLTWESLATDKDKE LLRERFKIVKSIVDGVFGKGENIKIALKHHLENKSAPEETKEVKEFKFPP LKPKLTTETTAEMQEKETKEAVGKALQTKENDTKEVQENETKETKEAQPK EAPTALQEFMANHSNLIEEIKSEFEIKSVELL >gid:21153 exoA EXODEOXYRIBONUCLEASE MKLISWNVNGLRACMTKGFMDFFNSVDADVFCIQESKMQQEQNTFEFKGY FDFWNCAIKKGYSGVVTFTKKEPLSVSYGIDIKEHDKEGRVVTCEFESFY LVNVYTPNSQQALSRLSYRMSWEVEFKKFLKALELKKPVIVCGDLNVAHN EIDLENPKTNRKNAGFSDEEREKFSELLNAGFIDTFRYFYPNKEKAYTWW SYMQQARDKDIGWRIDYFLCSNPLKTRLKDALIYKDILGSDHCPVGLELV >gid:20379 gyrA DNA GYRASE SUBUNIT A MQDHLVNETKNIVEVGIDSSIEESYLAYSMSVIIGRALPDARDGLKPVHR RILYAMHELGLTSKVAYKKSARIVGDVIGKYHPHGDTAVYDALVRMAQDF SMRLELVDGQGNFGSIDGDNAAAMRYTEARMTKASEEILRDIDKDTIDFV PNYDDTLKEPDILPSRLPNLLVNGANGIAVGMATSIPPHRIDEIIDALAH VLGNPNAELDKILEFVKGPDFPTGGIIYGKAGIVEAYKTGRGRVKVRAKV HVEKTKNKEIIVLGEMPFQTNKAKLVEQISDLAREKQIEGISEVRDESDR EGIRVVIELKRDAMSEIVLNHLYKLTTMETTFSIILLAIYNKEPKIFTLL ELLRLFLNHRKTIIIRRTIFELEKAKARAHILEGYLIALDNIDEIVRLIK TSPSPEAAKNALIERFSLSEIQSKAILEMRLQRLTGLERDKIKEEYQNLL ELIDDLNGILKSEDRLNEVVKTELLEVKEQFSSPRRTEIQESYESIDTED LIANEPMVVSMSYKGYVKRVDLKAYERQNRGGKGKLSGSTYEDDFIENFF VANTHDILLFITNKGQLYHLKVYKIPEASRIAMGKAIVNLISLAPNEKIM ATLSTKDFSDERSLAFFTKNGVVKRTNLSEFGGNRSYSGIRAIVLDEGDE LVGAKVVDKNAKHLLIASYLGMFIKFPLEDVREIGRTTRGVMGIRLNEND FVVGAVVISDDSNKLLSVSENGLGKQTLAEAYREQSRGGKGVIGMKLTQK TGNLVSVISVDDENLNLMILTASAKMIRVSIKDIRETGRNASGVKLINTA DKVVYVNSCPKEEEPENLETSSVQNLFE >gid:20191 gyrB DNA GYRASE SUBUNIT B MQNYQSHSIKVLKGLEGVRKRPGMYIGDTNVGGLHHMVYEVVDNAVDESM AGFCDTINITLTEEGSCIVEDNGRGIPVDIHPTEKIPACTVVLTILHAGG KFDNDTYKVSGGLHGVGVSVVNALSKRLIMTIKKEGQIYRQEFEKGIPIS ELEIIGKTKSAKESGTTIEFFPDESVMEVVEFQAGILQKRFKEMAYLNDG LKISFKEEKTQLQETYFYKDGLKQFVKDSAKKELLTPIIAFKSMDEETRT SIEVALAYADDYNENTLSFVNNIKTSEGGTHEAGFKMGLSKAILQYIDNN IKTKESRPISEDIKEGLIAVVSLKMSEPLFEGQTKSKLGSSYARALVSKL VYDKIHQFLEENPNEAKIIANKALLAAKAREASKKARELTRKKDNLSVGT LPGKLADCQSKDPLESEIFLVEGDSAGGSAKQGRDRVFQAILPLKGKILN VEKSHLSKILKSEEIKNMITAFGCGIQESFDIERLRYHKIIIMTDADVDG SHIQTLLMTFFYRYLRPLIEQGHVFIAQAPLYKYKKGKTEIYLKDSVALD HFLIEHGINSVDIEGIGKNDLMNLLKVARHYRYTLLELEKRYNLLEVLRF LIETKDALSLDMKVLEKSILEKLEGLNYQILRSFATEESLHLHAQTPKGL VEFNLDDNLFKDVLFEEAHYTYQKLMEYNLDFLENKDILAFLEEVENYAK KGANIQRYKGLGEMNPNDLWETTMHKENRSLIKLKIEDLEKTDAVFSLCM GDEVEPRRAFIQAHAKDVKQLDV >gid:20890 holB putative DNA poymerase III subunit delta' MKNSNRLIYTDNLEESLEEAASLFKHHIKFYTEIIEKDKKVIKTFNKDFK IEHAKEVISKAHLKHSELNAFLIAAPSYGVEAQNALLKILEEPPNNVCFI MFAKSPNHVLATIKSRLIKEDKRQKIPLKPLDLDLSKLDLKDIYAFLKNL DKENFDSRENQRERIESLLESIHRHKIPLNEQELQAFDLAIKANSSYYKL SYNLLPLLLSLLSKKKTP >gid:19785 jhp0043 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MIQIHHANAFEIIKDFHQQNLKVDAIITDPPYNISVKNNFSTLKSAKRQG IDFGEWDKNFKLLEWIARYASLINPNGCMVIFCSYRFISYIADFLEENGF VVKDFIQWVKNNPMPRNIHRRYVQDTEFALWAVKKKAKWVFNKPKNEKYL RPLILKSPVVSGIERVKHPTQKSLTLMEKIISTHTNPNDTVLDPFMGSGT TGLACKNLKRNFIGIESEKEYFQIAQKRLS >gid:19786 jhp0044 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MLVSRFLNAIDPFNLGVLLSRFQIKNGCIYGVCSYKVSKFTPGYEESKAR VLNALNILSKHQIWQSNQESVTKVKGTFVFILENDLHLDENSFYKKLLNL IIDNDFFNRSHLVTPSNGTNSHPELHRSITPREAARIQSFSDDYIFYGNK TSVCKQIGNAVPPLLALALGKAILKSARNDTNPSR >gid:19787 jhp0045 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MQNLLIQAENAIALLFLLNDKNLKGKIDLIYIDPPFATNNHFTITNGRAT TISNSKNGDIAYSDKVVGMDFMEFLKQRLVLLKELLSEQGSIYVHTDYKI GHYVKVMLDEIFGIQNFRNEITRIKCNPKNFKRIGYGNIKDMILFYSKGK NPIFNEPKIPYTPQDLEKRFPKIDKDKRRYTTVPIHAPGEVESGECSKAF KGMLPPKGRHWRTDIATLERWDKEGLIEYSNNNNPRKKIYALEQVGKRVQ DIWEFKDPQYPSYPTEKNAQLLDLIIKTSSNKDSIVLDCFCGSGTTLKSA FLLQRKFIGIDNSDLAIQACKNKLETITKDLFVSQNFYDFLVF >gid:19793 jhp0051 putative MEVKDRLNFNFVATTHSAMDLIASVLSDSKYYLESFYNQASQELGDKRSD KGEKLAELFDLLFEYIKDSKFERLKEPSAYDYSCKKLYPEQNTSQKMRRV VLRGYKHNDKMYHTIVDMGS >gid:19799 jhp0057 putative MSRVQMDTEEVREFVGHLERFKELLNEEVNSLNGHFHNLESWQDARRDKF SEVLDNLKGTFNEFNEAAQEQIAWLKERIRVLEEDY >gid:19802 jhp0060 putative MLFSHKVFLEGCTNELRRICDSFVEGAMQDDLGQKLKSEALENMLKIAHD LENLEQETQYEMRKINEQLEEAKRLEKQVDMQDRHSQSEIDRLMREAKEH EREAKRRYGEYLKDKND >gid:19826 jhp0085 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MIIAHSNEIAHPIFKSADKLFTLYQGDCNEVLPQFENAFDLIFADPPYFL SNDGLSIQSGKIVSVNKGDWDKENGINDIDEFNYQWINNAKKALKNTGSL LISGTYHNLFSLGRILQKLDFKILNLITWQKTNPPPNFSCRYLTHSAEQI IWARKSYKHKHVFNYEILKKINNNKQMRDVWNFPAIAPWEKANGKHPTQK PLALLVRLLLMASDDNSLIGDPFSGSSTTGIAANLLKREFIGIEKESRFI KISMNRKLELDARHQEIRSKIKDLNFQ >gid:19926 jhp0185 putative MQFEMRKIAFNAPKAFSLEHEGVVLEGEIVRVGAKLFRLKARLKGELMLI CDTSGKEFKKSLDESLVLHISDGLWDTQSQSLDFDNLDVIESFNGFIDLS EILRSEVESIKLDYHYAD >gid:19968 jhp0227 putative MRDYSELEIFEGNPLDKWNDIIFHASKKLSKKELERLLELLALLETFIEK EDLEERFESFAKALRMDEELQQKIESRKTDIVIQSMANILSGNE >gid:19985 jhp0244 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MHKVFIMEALECLKRIEKESIQTIYIDPPYNTKSSNFEYEDAHADYEKWI EEHLILAKAVLKQSGCIFISMDDNKMAEVKIIANEIFGTRNFLGTFITKQ ATRSNAKHINITHEYVLSYAKNKAFAPGFKILRTLLPIYAKALKDLMRTI KNVFKQKGQAQAQLILKEQIKELSQKEHFNFLKNYNLVDEKGEIYFAKDL STPSNPRSVAIQEINLFLEPLKSRGWSSDEKLKELYYQNRLIFKNNRPYE KYYLKESQDNCLSVLDFYSRQGTKDLEKLGLKGLFKTPKPVALIKYLLLC STPKDSIILDFFAGSGTTAQAVIEVNKDYYLNWSFYLCQKEEKIKNNPQA VSILKNKGYKNTISDIMLLRLEKIIKRSEYEILKTKSILF >gid:19989 jhp0248 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MKPYFSLEKLDLYHGDASVLETFEKGFYDLCVTSPPYNLSIEYQGSNDFR AYDDYLNWCKNWLKNCYFWGKEQARLCLNVPLDTNKHGKQSLGADITIVA KECGWKYQNTIIWNESNISRRTAWGSWLQASAPYAIAPVELIVVFYKNEY KRKKQTSTMSREEFLLYTNGLWNFSGESKKRLKHPAPFPRELPRRCIQLF SFLEDTIFDPFSGSGTTILEANALGRFSVGLEIEKEYCELSKKRILESLS LV >gid:20055 jhp0316 putative MKSNFQYSTLENIPKAFDILKDPPKKLYCVGDIKLLEAPLKVAIIGTRRP TPYSKQHTITLARELAKNGAVIVSGGALGVDIIAQENALPKTIMLSPCSL DFIYPTNNHKVIQEIAQNGLILSEYEKDFMPIKGSFLARNRLVIALSDAV IIPQADLKSGSMSSARLAQKYQKPLFVLPQRLNESDGTNELLEKGQAQGI FNIQNFINTLLKDYHLKEMPEMKDEFLEYCAKNPSYEEAYLKFGDKLLEY ELLGKIKRINHLVVLA >gid:20056 jhp0317 putative MILACDVGLKRIGIAALLNGVILPLEAILRHNRNQASRDLSDLLRKKDIQ VLVVGKPNESYADTHARIEHFIKLVDFKGEIVFINEDNSSVEAYENLEHL GKKNKRIATKDGRLDSLSACRILERYCQQVLKKG >gid:20075 jhp0336 putative MNLEKLFLEKAPLFVFSSTRRLKHFYLEQGEGFLPNAMSMGSFFEQAFYI PNQKKIPKSARQILMIDTIKAIAKEKKFILEGLLLFENSFLGYLESTSFL FDLFDELSSACIKLNELSFKDIYLDYEKHLEVLEMIYDRYVKKLEELGFY DKIMQKKPTILKEFFEHFSSIEWHLDGFMSVFERQCLLEVAELVPITLYL SCDKYNQKFLEFLNLKLETDCDYSIDFKTQKILSQTFNDQKIEPKLYANS SYLKQGALVLQTIEEYLQENNDPNKMAIITPNADFLPFLKLLDRNNNLNF AMGLGAKNSPYYTELVKILEDLQTSDCNLSGSALLDLENITLALLEQQSS KEKAPLKEAHSQIMHQYHLLKDTLKNYSLKDLLHLYLQEFEANFRLDDSS GGKIRVMDTLETRGMQFDKVVIADFNETCVPNLKDCDLFLNSALRQSLNL PTLLDKKNLQKHYYYQLFKNSKEVALSYIESETLKASNMLLELDLHTEPI KDAYTLFETSPIKEYQEEEIKAAIPKDFSFSASSLNAFLTCKRRFYYHYI KRFKETPKDESNSAVGSLIHELLKEAYEKDKNPHALEERLIWLSETRENV TPKERLDTLVALKKIQAFYKKERERFNAEITILDLEKSFETIIQGVIFKG RIDRIDKTADNEIILLDYKFKSDLKLDSMSEKQRKGLSPIEIAQISTDYQ MAIYVRALKNLGYKEPIKAFFYDLRKGELLEEDELTLQAKMDHLEFSLIP KLKQEIDFEKTLEVKDCEYCSFKDMCNR >gid:20137 jhp0398 putative MSLTVLLNPKSLEEFLGQEHLIGKDAPLFKALQSKHFPHAFFYGPPGVGK TSLAQIIARSLKRPILSFNATDFKLEDLRLKLKNYQNTLLKPVVFIDETH RLNKTQQEFLLPIMEKNRALILGASTQDPNYSLSHAIRSRSFTFELTPLK KSDLDRLCDKALTLLKKQIEPGAKTYLLNNSAGDARALLNLLDLSAKIEN PITLKTLQSLRPHSLNDGSYNDDTHYNLTSALIKSLRGSDENASMYYLTR LVAGGENPEFIARRLVIFASEDIGNANPNALNLAASCLFSVKQIGYPEAR IILSQCVIYLVCSPKFNTAYKAINQALDCVQKGLFYLIPKHLLPNVKDYL YPHDYNGYVKQDYLEKPLNLVSSQGIGFEKTLLEWLDKIRN >gid:20141 jhp0402 putative MRQAFNKTRSTHSRTLLLDIDCVIPNIVRRLLSNKTLPKRFATYSLQEVG VIFLTTQILSIMRKTRCSKTLFFITRGRESFRYQLCDHYKQKRHQFDEDF RSLLKALKIALVEKYPLKKGAKIQGEHCFEYEADDIISFYKKKDPNNYVI ASMDKDILYSNRGSHFNLKTNAFFNVSQKEAHFFAYYQCVVGDKGDNIKG VKGIGGFNYKDFLNEDAKEHELWEQIIQAFKIKEDLSDSEAKEKALLNMR LVNMHQMTHHGVIKLWEPEFKKAFFPKKPQRPDFKRIS >gid:20156 jhp0418 putative MEIILLIVAAVVLFYFYNTLKEYLKNPLNPKTKTEEYDLKNDPYLLVQSS PLDKFKQTQIGAYMRLLKFLDIQKNALDNALRTLFIHELEQPLNSEQQNL AKELLNEPVDKKENFESLCQEIADHTHGEYTKRLKLVEFLMLLAYADGIL DSKEKELFLDVGAFLQIDNQDFNELYDNFEHFNSIEIPMSLEEAKNLFEI QTHTTMQDLEKKALDLSAPYYHKMNDNKRYSEQDFISLKKIALASQLLEN DLKDS >gid:20171 jhp0433 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MHANLFNQNASKKDIFLHNLRSNNGRYKRYVKAPLRYGGGKSLAVGLIVE CIPNGVRRVISPFIGGGSVEIACATELGLEVLGFDIFDILVNFYQALLKD KQALYDNLFSLEPNQETYSIIKQELKAHYKKECVLDPLILARDYYFNFNL SYGPGFLGWMSKIYTDKQRYLNTLLKIKDFNAPSLKVECSSFEEVLIAYP NDFFYLDPSYVLENSKMFKGIYPMRNFPIHHNGFKHEVLAHMLKRHKGPF ILSYNDCELVRNAYKDFKILEPSWQYTMGQGETRMGKNRLERGDNNHVKQ SHELLIIKE >gid:20173 jhp0435 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MYKVADIFCGAGGLSYGFSTHPYFELIWANDIDKDAILSYQANHKETQTI LCDIAQLHCHNLPRVPIDILLGGPPCQSYSTLGKRKMDEKANLFKEYLRI LDLVKPKIFVFENVVGLMSMQKGQLFQRICNAFKERGYILEHAILNALDY GVPQVRERVILVGALKSFKQKFYFPKPIKTHFSLKDALGDLPPIQSGENG DALGYLKNADNVFLEFVRNSKELSEHSSPKNNEKLIKIMQTLKDGQSKDD LPESLRPKSGYINTYAKMWWEKPAPTITRNFSTPSSSRCIHPRDSRALSI REGARLQSFPDNYKFYGSANAKRLQIGNAVPPLLSAALAHAVFDFLRGKN V >gid:20179 jhp0441 putative MNYPNLPNSTLEITQQPEVKEITNELLKQLQNALHSNALFTEQVELSLKG IVRILEVLLSLDFFKNANEIDSSLRNSIEWLSNAGESLKTKMKEYERFFS EFNTSMHANEQEVTATLNANTENIKSEIKKLENQLIEETRMLLEQETQKS VKAYNAMMGHQPQNSLKHGLKNFNNPLSKEG >gid:20195 jhp0457 putative MSYFKNIFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIK SLDSVAVLLYEKESDCFVIVKQFRPAIYARNFYFKRDQDQTIDGYTYELC AGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQT LYYAEAHEGLKVSKGGGIDTEKIEVLFLERSKALDFIMDFQYAKTTGLSL AILWHLKKFKNV >gid:20287 jhp0549 ENDONUCLEASE III MLDSFEILKALKSLDLLKNAPAWWWPNALKFEALLGAVLTQNTKFEAVLK SLENLKNAFILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDL SKNILKDFQSFENFKQEVTKEWLLDQKGIGKESADAILCYVCAKEVMVVD KYSYLFLKKLGIEIEDYDELQHFFEKGVQENLNSALALYENTISLAQLYA RFHGKIVEFSKQKLELKL >gid:20333 jhp0595 putative MQHSRLQTLRSLYMERLLGETYTDISLIKPQNKPLNKQVHEGIENCNLCK RHQHSKPITGLFNPTSKLAFITLTPMLDSQLHFLNNLKAAMLESIIQKVF NYPLKDCSILSLLKCDSNSLNLEEEINACLPHLTWQLDNSAPKVIIVFGE VLPKRLLNLSKEESFGRIVSLKTKHFLSTHALEDMLKNPTLKKEALAHFK IALQFLNQS >gid:20347 jhp0609 putative MEKLPKKRVSKTKSQKLIHSLTTQKNRAFLKKISANEMLLELEKGAFKKN EAYFISDEEDKNYVLVPDNVISLLAENARKAFEARLRAELERDIITQAPI DFEDVREVSLQLLENLRQKDGNLPNINTLNFVKQIKKEHPNLFFNFDNMF KQPPFNENNFENFDNSDEENF >gid:20355 jhp0617 INTEGRASE-RECOMBINASE PROTEIN (XERCD FAMILY) MKHPLEELKDPVENLLLWIGRFLRYKCTSLSNSQVKDQNKVFECLNEFNH ASINSNQLEKVCKKARNAGLLGINTYALPLLKFYEYAQKLSLKSLKNIDE VMLAEFLSIYTGGLSLATKKNYRIALLGLFSYIDKQNQDKNEKSYIYNIT LKNISGVNQSAGNKLPTHLNNEELEKFLESIDKIEMSAKVRARNRLLIKI IVFTGMRSNEALQLKIKDFTLENGCYTILIKGKGDKYRAVMLKAFHIESL LKEWLTERELYPVKNDLLFCNQKGSALTQAYLYKQVERIINFAGLRREKN GAHMLRHSFATLLYQKRHDLILVQEALGHASLNTSRIYTHFDKDRLKEAA SIWEEN >gid:20366 jhp0628 putative MRKGRVMLCVFDIETIPNISLCKEHFQLKEDDALKICEWSFEKQKEKSGS EFLPLYLHEIISIAAVIGDDYGQFIKVGNFGQKHENKEDFASEKELLEDF FKYFNEKQPRLISFNGRGFDIPLLTLKALKYNLTLDAFYSQENKWENYRA RYSEQFHLDLMDSLSHYGSVRGLNLNGVCSMTNIPGKFDVSGDLVHAIYY NPHLSQKEKKGIIDGYCQSDVLNTYWLFLKYEVLKGALNKEQYLGLLNDF LAKFPKEKSYSSVFINALEKEIREFA >gid:20367 jhp0629 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MPIISKNKPYSFIKNLEFYTIKRIKDMGSHPSQDHLNELLELFKQDLSID LKREIASSIGRQLDDDIIYNFLKQEAFKEHYMEVIYQFLRTALYKSKDMR FAKLCDDLLQHYQNENMQKMKQYYDYRHTKKPPLKIIENIIKKPSLLVGD NAQTLNKIAPSSVNLIFTSPPYYNARIYSDYKNYKDYLSAMSQSLKACFR VLEEGRFIIINVSPIITKRAGREFESVRYPIHFDFHQILIDNGFYFVDEI LWIKPDFSVPNRIGGYLQNKKPLGYKPNCVSESLLVYRKKAPFLLDKNIK IAEKRLKPIKQNHTLFGKKELPIETTNCWYITPKSSKDHPAVFPESLCER VLNYYSFENEVVCDPFAGSGTFGMVAKSMGRIPLLCEQHPKYAQNLIKLG FKEI >gid:20395 jhp0657 putative MRRSLAFCLLALLGLQVLGARDFSQLKNEELLKLAGTLPSNEAIDYRMEV SKRLKALSAEDAKKFRANFSRIARKNLSKMSEEDFKKMREEVRKELEEKT KGLSAEEIKAKGLNVSVCSGDTRKVWCRAVKKKDEHCSPK >gid:20456 jhp0718 putative MPYALRKRFFKRLLLFFLIVCMINLHAKSYLFSPLPPAHQQIIKTEPCSL ECLKDLMLQNQIFSFVSQYDDNNQDESLKTYYKDILNKLNPVFIASQTPA KESYEPKIELAILLPKKVVGRYAILVMNTLLAYLNTRNNDFNIQVFDSDE ESPEKLEETYKEIEKEKFPFIIALLTKEGVENLLQNTTINTPTYVPTVNK TQLENHTELSLSERLYFGGIDYKEQLGMLATFISPNSPVIEYDDDGLIGE RLRQITESLNVEVKHQENISYKQATSFSKNFRKHDAFFKNSTLILNTPTT KSGLILSQIGLLEYKPLKILSTQINFNPSLLLLTQPKDRKNLFIVNALQN SDETLIEYASLLESDLRHDWVNYSSAIGLEMFLNTLDPHFKKSFQESLED NQVRYHNQIYQALGYSFEPIKNESETKKE >gid:20484 jhp0746 putative MPNHQPVKKFKIIGGACKGLGLNLPNISSTRPTKAIVRESFFNTLQTEIN GAHFIEVFSGSASMGLEALSRGAKSAVFFEQNKNAYATLLENISLFKNRL KKEIEIQTFLDDAFKLLPTLRLKNGVLNIIYLDPPFETSGFLGIYEKCFH ALERLLNRSHSKNLFVVFEHESLHEMPKSLATLAIIKQKKFGKTTLTYFQ >gid:20494 jhp0756 putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MTAKIKPNIQSLLNNFYVDSCVNFMQHKLQNESIDMILTSPPYDNLRNYQ GYTFAFENIANEIFRVIKKGGVVVWIVGDKIKNGNKSLTSFRQALYFQQI GFNMHDVMIYAKKNTPFMRSNAYTNAYEYMFVLSKGKPKTFNPLKEPTAR NGMEMLVTNKGADAKNNKILKELKKEKTKNNIWHYAVGLGGSTNDKIAFN HPAIFPEQLALDHILSWSNERDIVFDPMCGSGTTCKMAFLHNRNFIGVDI SKEYIQIAQKRLQQYQQGLFVC >gid:20500 jhp0762 putative MRFLNNKHREKGLKAEEEACGFLKTLGFEMIERNFFSQFGEIDIIALKKG VLHFIEVKSGENFDPIYAITPSKLKKMIKTIRCYLSQKDPNSDFCIDALI VKNGKFELLENITF >gid:20512 jhp0774 DNA-BINDING PROTEIN HU MNKAEFIDLVKKAGKYNSKREAEEAINAFTLAVETALSKGESVELVGFGK FETAEQKGKEGKVPGSDKTYKTEDKRVPKFKPGKILKQKVEEGK >gid:20540 jhp0802 putative MQDELNAYQQEIEDTREVLKKIRLELKQVQEILRKKKSVLKGLKQEICQK KLEKENFRSNKETQNTEEDVIFPKALEEVEVYAKDNQVIMAKPCKRLFNE GLYLQYRSVLRENRLLKNHLSKKDFENSLLKIELRDLHKEIKLYQVQNLL KDK >gid:20570 jhp0832 putative MPNTTNKDYTKYSQKQLFNFLNSIKAKQKRALEKLKEIQTQKQRIKKALQ FKALHLTENGYTIEEEREILARAKDTKNRLCFKSIEDFKKHCENL >gid:20585 jhp0847 putative ATP-DEPENDENT HELICASE MLETLQLNPEQLKAASALQGHNLIIASAGTGKTSTIVGRILHLLNNGIKP EEILLLTFTNKASNEMIARVAKYSKLSSKIEAGTFHAVAYRYLKEHYPNL SLKQPKELRKLLESIVDTKNALTDEDKKPYTSQHLYALYSLYTNALKQED FSAWLSSKNPEHAPYAAFYENILEEFENTKKKHDYIDYNDLLLLFKKAML ERSSPYKEVLCDEFQDTNPLQESILDAINPPSLFCVGDYDQSIYAFNGAD ISIISNFTQKYKNARVFTLTKNYRSSKEILDLANQVIQHNQRIYPKNLEV VKSGKFNKPTLLNYNDNIAQCQDIAKRIVMRKNFKEVAVIFRNNASADQL EAALRSHNVPSKRKGSASFFESKEVALALDICALLFNPKDIMAAIHILSY ISDIGSNTAKDIHEALMLLGNGDLKSALIQPNQEAKIYTKKKEITSMGLF EEIFALENSSRFNSVIDKAFHSHPVLMHPKISLNGAKMLSDFFILYTKAP IHSPSALIKHILESAFFQTFKTRLLKERSKNKDGSYNEFKKLQAQKRFNE KMDLLSSLAKNYQNLGRFLNGTLIGSSEATQGEGVNLLSVHASKGLEFKD VYIIDLMEGRFPNHKLMNTGGGIEEERRLFYVAITRAKENLWLSYAKNEL RENAKPKEHKPSVFLYEAGLLKLDSK >gid:20634 jhp0896 putative MRLNEVIGLFKESVDKVFDRVSAFTWEKYKAKNEDEEDDEANYREFEKIK KMALYFRDYCMFCLDWYELSQEKIQEEYRDCIDYDNKLLQLHYSLENLQT LRELKEEADNNYQESLNDEKLQNNLREWRDLKNTPEEENYREFEEIKKMV LYFRDWCMFRLDWYKLRQEEIQKHRDLMDNDNRLLQLDYSLKNLSILKRF KEINEKNYQDHLNNEKLQNDLREWRRSKRR >gid:20664 jhp0926 putative MNNLNNLSDEQINGMIDYLQNILMEMGGLKQEPPQNSNYSHNLEETNAQR TQSTQENSESKEALLKAKSGQINYVKTRIIQGLKEKNSPFWDKPEIVANK ERGHNALNGEPYCNLNDMILDMEKNRLGFQSNAWVSLEEAKMLGASKEER DTIFKATQNKEISPVRLMFIKNKEPVPLVDNNGELVIDKNTKKPKHKQFA VIENGQKVFKPAYRDIEPQAQFKFVYNIEMFPSINKEKIKPLNLDKLSNY AYKTRLFHQKDYLQERRDKTNIIYEDLHRDLSPDNRNEALLERMRSYTLL KNEKYNQLANETKQQTQTRSQSQSYYQKKNKASSGIER >gid:20666 jhp0928 putative MERSSDEDLSHQDPSLFIESREQGGTRGVYRSSDQQAVSEESHRERDRIH EHVSRGDGVSARADARANSNGASSPASRMENGARSEEKGDNPSDERGIPQ TPQSPSHQQNSSRDLGLSLSREQPGQTGRLRLFDHGQMGSLFPTDHENQR KRSDNELDRRSDKANENGDKSPRQNGSANQESARSERYGIAQGSSNQSVL LPAQSRLHHAGLSAQNGLRDLEENRDQEGRLLSNLDNLESLLNAIRNNTI ASEPDFRSRLLEAIQNNDPLKDSIVGAQLLKDPTTKIFYDKFQLKISPKK VLEILENRLKKSIETTNETLNAFNVLDSQAIDLNAISNSVGLNPTQESKI TDNSVELNNAQEQTAQEQTTQEQTTQEQTTQEQTTQEQTTQEQTTQEQTT QEQDTQENAPTTIKQETPITPAIPLNPKIDFKPSEEVLIKGAKTRYKANI KAIELLKELQAKQEILKGDYYATLKEQEILAQFSGWGGLESYFKKAQHPE EFKELNALLTKDEFRRAYLSARDAYYTPKLVIDSIYQGLDQLGFNNDNHP KEIFEPSLGTGKFIAHAPSDKNYRFIGTELDPISANLSKFLYPNQVIQNT ALENYQFYQEYDAFVGNPPYGNHKIYSSNDKELSNESIHNYFLGKAIKEL KDDGIGAFVVSSWFMDAKNPKMREHIAKNATFLGAIRLPNSVFKATGAEV TSDIVFFKKGVEKATNQSFTKAMPYYDKILNSLDDDTLFALQNNRFDSFI PSDQLKIVNAVANHFGFKQEKLQRWYEKIDTANFGYSTQDYKIIKDFIDK VGKNSINLNEQTLNEYFIHHPENILGHLSLEKTRYRFETNGEQIYKYDLQ ALEDESLDLSQALKQAIEKLPKDVYQYHKTTLKTDVLIIDSSNERYQEVQ KLIKNLERRELVKWDNLYFQLEQNNEMGIFLKPTKINSKVQDSRLKAYFK IKDALNDLTSAELNPLSSDLELENKRAKLNLVYDEFVKKFGYLNENKNRK DIRQDLYGAKVLGLEKDFEKEITPRSAKMQNIEPRQAQAKKAQIFFERTL NPKKELIITNAKEALIASINQKGGLDLHFIRDHFTTQSLETTIKELLEQK LIYKDHKDNGGYILANDYLSGNVKRKLKEVKEAINQGVEGLEANVKDLEL IIPKDLKATEIMANINSPWIPTQYLEEFLMELSANHYEKQYGDKMTDYQL SNLKEDIKIEHLSGAYEVFVRNNELNELYGIRHKDKPHSYKVPFESLLNK VLNNKDLSVKYAQVDPNDPKKEIFITDEEQSNLARQKAEELKEAFKDWIY KDYSRRTHLEQIYNDTFNNSVLKTYDGSQLELEGFNYHISLRPHQKNAIF RTIQDRAVCLDHQVGAGKTLCAIASCMEQKRMGLVNKTLIAVPNHLTKQW GDEFYKAYPNANVLVVDSKDTTEKERELLFNQIANNNYDAVVIAHTHLEL LSNPRGIIEELKEEELVNAEKNFERQELAYKNNPRETKKPNERAFKNKLD KIRAKYDAILEKQGSHIDISQMGIDNLIVDEAHLFKNLAFETSMEKIAGL GNQQGSNRARDLFIKTRYLHQNDKKIMFLTGTPIANSLSEMYHLQRYLTP DVLKERGLEFFDDWAKTYGEVVNDFELDTSAQSYKMVNRFSKFSDVQGLS TMYRAFADIVSNDDILKHNPHFVPKVYGDKPINVVVKRSEEVAQFIGVAL ENGKYNEGSIIDRMQKCEGKKSQKGQDNILSCTTDARKVALDYRLIDPNA KVEKEFSKSYAMAKNIYENYLETHATKGTQLGFIGLSTPKTHSQKVSLEA LDNAHETENKNPLDKAQELLESLSSYDEKGNLIAPSKKELENELKEKEAK SVNLDEEIAKGCSFDVYSDVLRHLVQMGIPQNEIAFIHDAKTEEQKQDLF KKLNRGGVRVLLGSPAKMGVGTNVQERLVAMHELDCPWRPDELLQMEGRG IRQGNILHQNDPENFRMKIYRYATEKTYDSRMWQIIETKSKGIEQFRNAH KLGLNELEDFNMGSSNASEMKAEATGNPLIIEEVKLRAEIKSEESKYKAF NKEHYFNEESLKNNASKLDYLKQELKDLETLQRSVIIPTHTEIKLYDLKN EESKDYELIKVKEVEPLKENASMSEELTHKKLKEQNKQIAEQNKEKLDAI KKQFASNLNTLFVNEEEDYKLLEYKGFVVNAYKTKYQVEFSLSPKDIPNI AYSLAIWFIKTILSTCLALIISALRSSLMGF >gid:20679 jhp0941 INTEGRASE/RECOMBINASE (XERCD FAMILY) MHEQCSISFVGGQGAKRLLYILYKLAFNAKSNKIALDRHYAKMFLQVVAR TLIKNVNILEEQGFIEVIKGKQRYLYVYLKDYRELECLVKSKMAKYVMYL RQFFDYLDRKRRYGFDFTLKNLAFAKTKESLPRHLNDKDLKSFLKTLLDY KPATSFEKRNKCILLIVILGGLRKCEVLNIELKHIQVEEQNYSILIQGKG RKERKAYIKKSLLEPSLNAWISDDYRLKYFNGAYLFKKDKQKSQNSLTLY NFIPLIFKLAQIKHYKQYGTGLHLFRHSFATLIYQETQDLVLTSRALGHS SLLSTKIYIHTTQEHNKKVALVFDSLIENKK >gid:20689 jhp0951 INTEGRASE/RECOMBINASE (XERCD FAMILY) MSDCKMSRVSRELFDNIKSFLHYKFKTMIRIQSVNDLELILKWQDRVLEC QSFIALKELNHKLYNQGVRHTIMMQGLFLFFEYFDNRIKLKSLRNLAEEQ VIDFLFGLVKNRKPSSMAKYVMVLRQFFDYLDRKRNYSFDFELKNLSFAK KEMYLPKHLNKNDFKAFIQALLKYHPKTSFEKRNQCILLLIALGGLRKFE ALDLELKNIALENNHYRLLIKGKNNKERYAYIEKEFLQVPLNAWLSDTKR LKSFKGRFVFKKAKNNTTQKTCSLKGFIAKIFKLSNIDVKSYGLGLHLFR HSFATFIYDETQDLVLTSRALGHSSLLSTKIYIHTTQEHNKKVTLVLKGW LKNEKSE >gid:20780 jhp1042 putative MNYPNLPNSALEISEQPEVKEITNELLKQLQNALRSNAHFSEQVELSLKC IVRILEVLLSLDFFKNANEIDSSLRNSIEWLTNAGESLKLKMKEYERFFS EFNTSMHANEQEVTNTLNANAENIKSEIKKLENQLIETTTRLLTSYQIFL NQARDNANNQITKNKTQSLEAITQAKNNANNEISNNQTQAITNITEAKTN ANNEISNNQTQAITNINEAKESATTQINANKQEAINNITQEKTQATSEIT EAKKTDHYQNIDFFEFE >gid:20782 jhp1044 putative MKLPKALNEATAGAALKYHIKRALERSHLISDFSKNLELSAKNSKFTNNT LKIIEELNNGVKQASEEIKEKAFDFSNEKLTNEQIKELLNNAKIPTSGRD AITFGTNNLNPEMVEFLHKNNKKMIIEKASNKELELLKDANFKHPENIRA SLDHDAIAHILKRHGVNSVNVKNGESPITYEDIANYRYIVNNADAILRTL DNEDKEVISAFKQINGYAVVVEQAINKKNELVLKTMYKSNGDYKDNNAYK KFSSTLTLNADAKVNHGLSSHSGATENLTQKPLTSQEDLLKDTENLNETT PKPTHLSPLELANAEKLAKLESEQLQSEQEFLKAKEQELKRKEALKKKLE HERGNAGHIESQTKIEVGEDIPTQVQAQIPKSRVRLNEREIYDLDYAIVK AKDLKPSFTTGGTQKRTDMNEEQIKSIAENFDPKKIFGSGGFEDLPIILH DGQVIAGNHRIQGMLNFTPKSRYIYNKAIKEYYHIDLEPDELLVRVPNKR LDNTEINNLAASSNQGRFNSESDHAIAVLSHYEAKLKELEKKLDADSIYS LKNIVANNLNFDKATHPNVGDSNLALLMFNMPRTKTQGIELLNCWQKAFS NDIKSYEKVKKMFVDNAGSFHNLIHDMNFPNVSLNAYLSDIMDRSFANLK NYQSTSESLKDLSEKFYKTSSLDMFEKSDQRASDISEILGGAIARFARFD DPSKALFEALKSDNIKKGLKEFKIADVTKDMFDPKSKEFKDIDIYDFTHY LLMVNREPNENNPVLKRLIQAVKDMQKEKKKGIKKPKLETPSEWGHHYSE FKGDGLGAINKLLKTKKGFVAGAFYKEGLGDIDLVWGNKDYGLEHILKRR EDQALNNGINEAEAKDYAISVIKTIPEVIDKGVKVERNGRVAIEYQNIRV GLKDNWKGEKSPNHWVITGYEKRLEDSESLYTSPPITKGETLPLNSNKPD PTTNAIKTQEPLYPLELANAEKLAKLETEKAFKAEAVKKLDFNEIKKLID ESPRTGSSMPILGMQNLNAEAVEYIQKNHKRIAVEKIEPSFAKDLKLKYP DDARAVMDYQAINHILKEHKNLAYEDIANYRELSKQANETLKLKDNQNRP VVASFNQINGFFVVVEQVSNAKNELMLKTMYKARGNYKDSLIYKKTLAKS QNSN >gid:20786 jhp1048 putative MVVLHSHLENALNQLKELIDLTERDIRDIKLAKHAEIFERNHQKQLAIQA FEQEKTNIDAQMLSLKNQFPNKEMSELLDEKTSDFLNQMREFLLVLKEKN LIYSRMAFAVSEFYSSLIQQIIPHDTCDYKGSRHVGSHFLRVQA >gid:20788 jhp1050 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MGILTFMDFCSGIGGGRLGLERCHLKCVGHAEINHEAIRTYELFFKDTHN FGDLMRINPNDLPNFDVLVSGFPCQAFSINGKRKGLEDERGTIIYGLIRI LKVKQPKCFLLENVKGLIHHKQQETFKTIIKALQEAGYTTHYQILNSADF QLAQKRERLYIVGFRKDLKRPFNFPLGLANDYCFKDFLDADNECYLDVSN ATFQRYLRNPYNHNRVFLEDILTLENAVLDTRQSDLRLYFNVFPTLRTSR HGLFYTQKGKIKRLNAVESLLLQGFPRDLIAKIKNNPNFKESHLLSQAGN AMSVNVIAAIAKQMLKAFNNE >gid:20906 jhp1168 putative MYRKDLDNYLKQRLPKAVFLYGEFDFFIHYYIQTISALFKGNNPDTETSL FYASDYEKSQIATLLEQDSLFGGSSLVILKLDFALHKKFKENDINPFLKA LERPSHNRLIIGLYNAKSDTTKYKYTSEIIVKFFQKSPLKDEAICVRFFT PKAWESLKFLQERANFLHLDISGHLLNALFEINNEDLSVSFNDLDKLAVL NAPITLEDIQELSSNAGDMDLQKLILGLFLKKSVLDIYDYLLKEGKKDAD ILRGLERYFYQLFLFFAHIKTTGLMDAKEVLGYAPPKEIVENYAKNALRL KEAGYKRVFEIFRLWHLQSMQGQKELGFLYLTPIQKIINP >gid:21009 jhp1271 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MDFLKENLNTIIEGDCLEKLKDFPNKSVDFIFADPPYFMQTEGELKRFEG TKFQGVEDHWDKFGSFEEYDTFCLGWLKECQRILKDNGSICVIGSFQNIF RIGFHLQNLGFWILNDIVWHKSNPVPNFAGKRLCNAHETLIWCAKHKNSK VTFNYKTMKYLNNDKQEKSVWQIPICMGNERLKDVQGKKVHSTQKPEALL KKIILSATKPKDIILDPFFGTGTTGAVAKSMDRYFIGIEKDSFYIKEAAK RLNSTRDKSDFITNLELETKPPKIPMSLLISKQLLKIGDFLYSSNKEKIC QVLENGQVRDNENYETSIHKMSAKYLNKTNHNGWKFFYAYYQNQFLLLDE LRYICQRDS >gid:21022 jhp1284 TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MKSETIYKDFCFELEGILFNFDTSLLESKKYNEKVDLIFDLKSIVKDKDT KTNTLNFSVTLSSKGTQTKTNEILKECSNQGVKLDEEILKKAFVKFKKQG SMDYFIHKNAQGFLKEQLDLYLFEYLFKEMTAFDHKRLNGINTIKEVALE VIALVSEFENELCKIWNKPRLVLNSHFIVSLDKLKAKNYDLSKITNHPNY PKQVKEWQDLNLEIADNLLGNEFLPLDTLYFKDLEEEVKSLFNENEINGT LIKSENYQALNSLKNRYKEAIDCIYIDPPFNTGSDFAYIDRFQDSTWLSL MHNRLQLAYDFLSPQGNFYLHLDYRANYLGRMLLNDIFSKENFRNEIIWH FRTYQGQIQSNFPRKHDSLLWYSKNCNVNNFFKITYSDNYKDTVDYRRWR EFIVDNNKIVYPNYPKADSRFDGYLKRYLQSTKEPKNGDIIATINGYVID DVWTDIQAIDPKKADERLQGTLTQKPEKLLERIIKASSNENSIVCDFFAG SGTTCAVAHKLKRKYIGVEMGEHFESVILPRLKKVIGGFKSGALKEFNGG GVIKVYELESYEEILRKIKYEDNDKPLAYEEQYSDLVERKNESYTLNIEA LENMGVDIKETLENLHGVGVEFFNEKVVKFKGNDKEVSILKALKEALIW >gid:21041 jhp1303 putative MPKKELLKMSKKRIFKDFLEEVKRHRPIIFYTDNDCDGMLAGSVLMSMCY RLGIKDFFFFSPLRNAHGYGFTDLALNDLLSQLCIFNPKTNQLVRLDYIK NQFQKTPLLFSADLGADLVSNIELQKILLERFEQCIITDHHKSFEVDWID KNKIAYINLNDEKDANYYSGAFTSALVFSQIFQIQTTPLEEELIAITLLS DRIDLDNGDNLDMVLNLAQPKHDRIECFFKDKDLSLAQDDLDEISNLYGF NCINYINALSRLSGAREFKGCYNSYLHYLVLKHFNPISDPRLSVFNVKEF KRYNDIKKKMVKESEENAQIFSCNKILVAILDESCSIKVGVSGLVANNFL KKYPFNHSLCIYKDSKDGYSGSARGDGTFLSQIKTIPLIQAGGHEEAFGL SFAKKRILKK >gid:21042 jhp1304 putative MLLDYDFLLLLNDESGKPTRYYYLLQDFEKDFVASEVAQNRARQFVKEII GSKKASKTKNSAIKVSHTKASAIGSETIGSCDLKKACEKIKSGLPFGIIS AFKPFKDAFYRDFNHNEQKLLIGAAKSGCIQSSADKLAQLKTRLIYWQDK SVKVDWDKPILIKDFFKGNNYLYRRLCFLLGKHFMDRFLKNNAKASVKDF MSSKEFVNKYRYTPKQNTERAKKLQSYLESKRDFIGFVQTLNSLKDSPQD PFLPNEEISFLVFANEPTIIFNLRDYLLVLAQIFNQQAICYCESKCPIEL INASPGKGL >gid:21113 jhp1375 putative MQDELFETEKAPQKNAKNAKNAPKKSFEEHVHSLERVIDRLNDPNLSLKD GMDLYKTAMQELFLAQKLLENAYSEYEKLQTPNKKA >gid:21130 jhp1392 putative MSSVQILSNFNYPISKVINEGLRNSLDTHIAVAFLKYSGVEIIQDVLINF LEKGAEFEIIVGLDFKTTDSKSIRFFLDLNKTYKKLKFYCYGDKENNKTD IVFHPKIYMFDNGKEKTSIIGSANLTKGGLENNFEVNTIFTEKEPLYYSQ LNAIYNSIKYADSLFTPNEEYLESYDEVFSAIIKNEQKVSKDKSIQEKIK KIEKQEKLLPGTIPSIKAMIVEFIFACEKKGVKQVALQDIYQALEERIKK EEWGYKYKSDTFKNSIRGELNHHQKDSHSKHCFGLFERLQKGFYALTPKG RSYKGR >gid:21176 jhp1438 DNA POLYMERASE III MIKISLNSNKRAWMWWFQGVIFLNPKIVSWLLKAYRMSDNLLHKDIQALI ARLKHQDLNLSALEKSLSRLIHDEINLEYLKACGLNFVETSENLITLKNL KTPLKDEVFSFVDLETTGSCPLKHEILEIGAVQVKGGEIINRFETLVKVK SVPDYIAELTGITYEDTLNAPSAHEALQELRIFLGNSVFVAHNANFDYNF LGRYFVEKLHCPLLNLKLCTLDLSKRAILSMRYSLSFLKELLGFGIEVSH RAYADALASYKLFEICLLNLPSYIKTTMDLIDFSKCANTLIKRPPRAKYQ EIPSPFPLFERTKGLLDIMKATS >gid:21219 jhp1481 putative MFIDTHCHLDHKDYENDLDEVLKESLEKGVTQCVIPGADMKDLNRAVGIS EKFEGVFFAIGAHPYDVESFDEGLFEKFVSHQKCVAIGECGLDYYRLPEL SERENYKSKQKEIFTKQIEFSIQHNKPLIIHIREASFDSLNLLKSYPKAF GVLHCFNADSMLLELSDRFYYGIGGVSTFKNAKRLVEILPKIPKNRLLLE TDSPYLTPHPFRGTRNSPTYIPLIAQKIAEIIHIETEELASLSTHNAQTL FNFP >gid:20296 lig DNA LIGASE MIKSQKEYLERIEYLNTLSHHYYNLDEPIVSDAVYDELYQELKAYEEENP NRIQANSPTQKVGATATNEFSKNPHLMRMWSLDDVFNQNELRAWLQRILK VYPNASFVCSPKLDGVSLNLLYQHGKLISATTRGNGLEGELVTNNAKHIA NIPHSIAYNGEIEIRGEVIISKEDFDALNKERLNANEPLFANPRNAASGS LRQLDSNITKKRKLQFIPWGVGKHSLHFLRFKECLDFIVSLGFSAIQYLS LNKNHQEIEENYHTLIREREGFFALLDGMVIVVDELDIQKELGYTQKSPK FACAYKFPALEKHTKIVGVINQVGRSGAITPVALLEPVEIAGAMVNRATL HNYSEIEKKNIMLNDRVVVIRSGDVIPKIIKPLESYRDGSQQKIMRPKVC PICSHELLCEEIFTYCQNLNCPARLKESLIHFASKDALNIQGLGDKVIEQ LFEEKLIFNALDLYALKLEDLMRLDKFKIKKAQNLLDAIQKSKNPPLWRL INALGIEHIGKGASKTLAKYGLNVLEKSEAEFLEMEGFGVEMAHSLVNFY ASNQEFIRSLFELLNPKSSDTAEEKQKSSSVFSDKTIVLTGTLSKPRQEY AQMLENLGAKIASSVSAKTDFLIAGENAGSKLALAQKHGVSVLNEEELLK RLKELD >gid:21196 mfd TRANSCRIPTION-REPAIR COUPLING FACTOR MIQSSLYRALNKGFDYQILACKDFKESELAKEVISYFKPNIKAVLFPELR AKKNDDLRSFFEEFLQLLGGLREFYQALENKQETIIIAPISALLHPLPKK ELLESFKITLLEKYNLKDLKDKLFYYGYEILDLVEVEGEASFRGDIVDIY IPNSKAYRLSFFDAECESIKELDPATQMSLKEDLLEIEIPPTLFSLDEPS YKDLKTKVEQSPLNSFSKDLTSFGLWFLGEKANDLLGVYQSIISPRALEE IQELASLNELDDERFKFLKVLENAQGYEDLEIHVHALEGFIALHSNRKIT LLAPNKTILDNSISVLDAGNMECVIAPFVLNFKTPDRIFISLNSFERKKK RQKSKLALNELNAGEWVVHDDYGVGVFSQLIQHSVLGSKRDFLEIAYLGE DKLLLPVENLHLIARYVVQSDSVPVKDRLGKGSFLKLKAKVRAKLLEIAG KIIELAAERNLILGKKMDTHLAELEIFKSHAGFEYTSDQEKAIAEISRDL SSHRVMDRLLSGDVGFGKTEVAMHAIFCAFLNGFQSALVVPTTLLAHQHF ETLKARFENFGVKVARLDRYIKTSEKSKLLKAVELGLVDVLIGTHAILGT KFKNLGLMVVDEEHKFGVKQKEALKELSKSVHFLSMSATPIPRTLNMALS QIKGISSLKTPPTDRKPSRTFLKEKNDELLKEIIYRELRRNGQIFYIHNH IASISKVKTKLEDLIPKLKIAILHSQINANESEEIMLEFAKGNYQVLLCT SIVESGIHLPNANTIIIDNAQNFGLADLHQLRGRVGRGKKEGFCYFLIED QKSLNEQALKRLLALEKNSYLGSGESIAYHDLEIRGGGNLLGQDQSGHIK NIGYALYTRMLEDAIYELSGGKKRLEKSVEIQLGVSAFLNPELIASDSLR LDLYRRLSLCENVDEVGQIHEEIEDRFGKMDDLSAQFLQIITLKILANQL GILKLSNFNQNITLTYSDEKKESLKAPSKDDNDILETLLKHLHAQISLKR R >gid:21034 mod_1 putative TYPE III DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MLKTPLKTLLDILINHFTKECLITLITEHDEKLLIFMLEHENANDYKKHF FKTIANSLVFNEKALLECLEIKELEKSFTRFKNKIGLFSQEGFIKSSELV VLNFPFKDNVLLGNAKDNSTKSNELFYHEILHKNEIDTLLSPKALCRFEM HGQGDLQSALKDENTNYLIKGNNLIALHSLKKKFAKKVKCIYIDPPYNTG NDSFNYNDNFNHSSWLVFMKNRLEAAREFLSDDGVIFVQCDDNEQAYLKV LMDEIFGRENFIACFVWEKTSNSLSRIRIKTEYILCYEQTKFGLIFNGDM AEEGQDFPILNEVNVKRTLQFPPNSIYFKTFKGVIKPTKFNKMELIDDLR IVNKTNSNMVRINAKFKWTQDKLDDEIKEGTTFVIKSDEFSMRYIRKGDR EVKASNVFNAECGVTTNIKATSEIKVLFANSNTDLFSTPKPEALLQRILE ISTNENDLVLDFFAGSGTTCAVAHKMKRRYIGIEQMDYIETITKERLKKV IEGEQGGISKKCDFKGGGSFVYAELKEVNSGIKKQILNAKSASECLKIFN DLNKRILKRADNKMDAIHSEEFQNLDLNEQKRKCCASLDANEDYLNLGDI DEDAWEIDEITKKYNEIFYS >gid:21149 mod_2 TYPE III DNA MODIFICATION ENZYME (METHYLTRANSFERASE) MQNKEIGGEKSVNEKNVEVFNRYFPGCLSIENDNKLTLDTGKLKALLGDF SEIKEEGYGLDFVGKKIALNQAFKKNHKILKPLNESTSKHILIKGDNLDA LKILKQSYSEKIKMIYIDPPYNTKNDNFIYGDDFSQSNEEVLKTLDYSKE KLDYIKNLFGSKCHSGWLSFMYPRLLLAKDLLKQDGVIFISIDDNECAQL KLLCDEIFGEGNFIETFLWNKTQTPPSASNKTRKTHEFILCYQKNKDNKK MIAKFSDGGDAPLWNETNSERILTFPANYVFSNLKNGIYKKGIRDKIEVL DDIEVYNGKIINSFRLKGHFRWKQETLNEEIINNVFLVVKSDKFSIRYCR EGERIVMPTNEISKKDNVGTNETASKELFKLFDNNKIFNFNKPVSLIKYL ISICSNNTNEGDIILDFFAGSGTTAHAVLESNKSDYQKLSEGGGLFNGLN AVFKERRFILVQLDEKIEKNKSAYDFCLNTLKSTSPSIFDITEERIKRAG AKIKEACAHLDVGFRAFEIIDDETHTNDKNLGEAHQKDLFAYSNLDRMET QTILIKLLGCEGLELTTPIICLIENALYLALNTAFIVGDIEMSEVLENLK DKGVEKISVYMPAISNDRLCLELGSNLLDLKLESGDLKIRG >gid:20303 mutS DNA MISMATCH REPAIR PROTEIN MSDASKRSLNPTLMMNNNNTLPKPLEESLDLKEFIALFKTFFAKERGSIA LENDLKQAFTYLNEVDAIGLPAPKSVKESDLIVVKLTKLGTLHLDEIYEI VKRLRYIVVLQNAFKPFTHLKFHERLNAIILPPFFNDLILLLDDEGQIKQ GANATLDALNESLNRLKKESTKIIHHYAHSKELAPYLVDTQSHLKHGYEC LLLKSGFSSAIKGVVLERSANGYFYLLPESAQKIAQKIAQIGNEIDCCIV EMCQTLSRSLQKHLLFLKFLFKEFDFLDSLQARLNFAKAYNLEFVMPSFT QKKMILENFSHPILKEPKPLNLKFEKSMLAVTGVNAGGKTMLLKSLLSAA FLSKHLIPMKINAHHSTIPYFREIHAIINDPQNSANNISTFAGRMKQFSA LLSKENMLLGVDEIELGTDADEASSLYKTLLEKLLKQNNQIVITTHHKRL SVLMAENKEVELLAALYDEEKERPTYTFLKGVIGKSYAFETALRYGVPPF LIEKAKAFYGEDKEKLNVLIENSSTLERELKQKNEHLENALKEQEDLKNA WLLEMEKQKEIFHHKKLELEKSYQQALNILKSEVASKDTSSMHKEIHKAS EILNKHKTDQEIPQIITSFQINEKARYKNESVLVIQILDKGYYLVETELG MRLKAHGSWLKQIQKPPKNKFKPPKTIVPKPKEASLRLDLRGQRSEEALD LLDAFLNDALLGGFEEVLICHGKGSGILEKFVKEFLKNHPKVVSFSDAPI NLGGSGVKIVKL >gid:20887 mutT putative DGTP PYROPHOSPHOHYDROLASE MLHKKYRPNVAAIIMSPDYPNTCEVFIAERIDIEGAWQFPQGGIDEGETP LEALYRELLEEIGTNEIEILAQYPRWIAYDFPSNMEHKFYSFDGQKQRYF LVRLKHVNNIDLNKHTPEFRSYQFIQLKDLLKKIVPFKRQVYRQVIAYFR KEGYLGC >gid:19871 mutY A/G-SPECIFIC ADENINE GLYCOSYLASE METLHNALLKWYEEFGRKDLPFRNLKGINAPYEVYISEVMSQQTQISTVI ERFYPPFLKAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVK EHHSQLPNDYQSLLKLPGIGAYTANAILCFGFREKSACVDANVKRVLLRL FGLDPNIHAKDLQIKANDFLNLNESFNHNQALIDLGALICSPKPKCAICP FNPYCLGKNHLERHTLKKKQEIIQEERYLGVVIQNNQIALEKIEQKLYLG MHHFPNLKENLEFKLPFLGTIKHSHTKFKLNLNLYLATTKDLKNPIRFYS LKDLETLPISSMTLKILNFLKQKNLFGG >gid:20270 nth ENDONUCLEASE III MSLKRAKTKAQQIKELLLKHYPNQTTELRHKNPYELLVATILSAQCTDAR VNQITPKLFEKYPSVNDLALASLEEVKEIIQSVSYSNNKSKHLISMGAKV VKDFKGVIPSTQKELMSLDGVGQKTANVVLSVCFDANYIAVDTHVFRTTH RLGLSNANTPIKTEEELSDLFKDNLSKLHHALILFGRYTCKAKNPLCDAC FLKEFCVSKASFKA >gid:20356 ogt METHYLATED-DNA--PROTEIN-CYSTEINEMETHYLTRANSFERASE MTLYHYYFKTPKSFPLEYLHLCANESHLLRLDFDATNFSHHTPMNTPLKL SVQALERYFLGQLFEFDAPLDLIGTFFQKQVWSALMTIPYGKTKSYDEIA KLINNPKSCRAIGNANRNNPISLIVPCHRVVRKNGALGGYNGGIEVKKWL LEFESKILNERAKNFLIS >gid:20208 orf7 cag island protein MSLATSYNVSNNFSKFNIKRVRGYLICLVCNTPKMIQRGLNGVSFYGCSD YVNKGDCKGVLREINGSMKMVCLHCENTPIMEKVESGRGGAYACKNCNRK FYFIDLAKQNERKKDLEKEKKELLNKIEKQKIKHLERFILAGVKANIKEN SFFLGCKNYPKCEWTASMDSQDLKCPKCNRLMKRKKNFKNNEFFTATSLT LNAIEFCLYINLKKKETNV >gid:21184 pcrA putative ATP-DEPENDENT HELICASE MDTKRQCMALKASAGSGKTFALSVRFLALLFKGANPSEILTLTFTKKATA EMKERILDYLKILQKENLESEKEKSQNILKELEEKYHLDPSLVRNNAQKI YQRFLNAEVRISTIDAFFQSILRKFCWFVGLSANFEVNEDTEAHQRQLNE GFLSALNNKQLEELSAFIVQCLSYDNYTSDSILERLRFLKNKLYLFDPNK KDPVFDEEGFLEKLRSLNNQIQSIETASNEAKKAIKCDSFRGFLNSSLTW LEKKSEYRYFKKLKNEIPTLESECEEIENDLKRYYEAQETAIFKKFPKFI QLYDNATSKIQALDFDAIKDKVHVLLNGYEEMPAEFFYFRLDSKIAHILI DEFQDTSLNDYKILAPFIDEIKAGIGQAKWHRSVFFVGDVKQSIYAFRGS FSSLFESVSKDFYHDNLEFNHRSAPLIINYVNTIFKKAYQNSPTAYLEQK YPKTSNNKHVTEGYVKVSLVADEKELLLEQILQEAQNLLEHRIDPKDITI LCATNKDALEIKNYLQEYLSAIRPSTESSAKLSQLVESKIIKNALEYALA EEPYKPFYKHSVLKLAGYLHDDVIALPGFNPKKESVASFVWKIMEQFKLY EEPAQSCLELAVGCEDADGFLEKLEAKEIASFNPKGAQIMTIHKSKGMQF PYVIVCERLGNPNSSHANQLLEEYDGTELARLYYRMKNREVVDKDYARAL DKEEAAKDHEEINVYYVAFTRAELGLIVVAKDKKESKKESKNKKMHEQLE LAPLEEGEIAPVISPQKEPLMTSVVIKPHAYGEQVQEIEEESDSDYEKNN DQEAINFGIALHKGLEYQYAYNIPKQSVLEYLNYHYGFYGLDYQALEESL ELFENDAGIQALFKNHALKGEAAFLFQGVVSRIDVLLWDRGQNLYVLDYK SSQNYQQSHKAQVSHYAEFLRTQAPHFKIQAGIIYAHKRLLEKLWV >gid:21101 polA DNA POLYMERASE I MEQPVIKEGTLALIDTFAYLFRSYYMSAKNKPLTNDKGFPTGLLTGLVGM VKKFYKDRKNMPFIVFALESQTKTKRAEKLGEYKQNRKDAPKEMLLQIPI ALEWLQKMGFTCVEVGGFEADDVIASLATLSPYKTRIYSKDKDFNQLLSD KIALFDGKTEFLAKDCVEKYGILPSQFTDYQGIVGDSSDNYKGVKGIGSK NAKELLQRLGSLEKIYENLDLAKNLLSPKMYQALIQDKGSAFLSKELATL ERGCIKEFDFLSCAFPSENPLLKIKDELKEYGFISTLRDLENSPFIVENV PILNSTPILDNTPALDNAPKKSRMIVLESAEPLSMFLEKLENPNARVFMR LVLDKDKKILALAFLLQDQGYFLPLEEALFSPFSLEFLQNAFSQMLQHAC IIGHDLKPLLSFLKAKYQVPLENIRIQDTQILAFLKNPEKVGFDEVLKEY LKEDLIPHEKIKDFKTKSKAEKSELLSMELNALKRLCEYFEKGGLEEDLL TLARDIETPFVKVLMGMEFQGFKIDAPYFKRLEQEFKNELNVLERQILDL IGVDFNLNSPKQLGEVLYDKLGLPKNKSHSTDEKNLLKILDKHPSIPLIL EYRELNKLFNTYTTPLLRLKDKDDKIHTTFIQTGTATGRLSSHSPNLQNI PVRSPKGLLIRKGFIASSKEYCLLGVDYSQIELRLLAHFSQDKDLMEAFL KGRDIHLETSKALFGEDLAKEKRSIAKSINFGLVYGMGSKKLSETLSIPL SEAKSYIEAYFKRFPSIKDYLNGMREEILKTSKAFTLLGRYRVFDFTGVN DYVKGNYLREGVNAIFQGSASDLLKLGMLKVSERFKNNPSVRLLLQVHDE LIFEIEEKNAPELQQEIQRILNDEVYPLRVPLETSAFIAKRWNELKG >gid:20732 priA PRIMOSOMAL PROTEIN N' (REPLICATION FACTOR Y) MFYHLIAPLKNKTPPLTYFSKERHLKGALVNIPLRNKTLLGVVLEEVSKP SFECLELEKTPYFLLPFQIELAIFIAQYYSANLSSVLSLFAPFKECDLVG LEKIEPTLNALSQTQTNALKELQKHPASLLFGDTGSGKTEIYMHAIAQTL EQKKSALLLVPEIALTPQMQQRLKKVFKENLGLWHSKLSQNQKKQFLEKL YSQEIKLVVGTRSALFLPLKELGLIIVDEEHDFSYKSQQSPMYNARDLCL YLSHKFPIQVILGSATPSLSSYQRFKDKALVRLKGRYTPTQKNIIFEKTE RFITPKLLEALKQVIDKNEQAIIFVPTRANFKTLLCPNCYKSVQCPFCSV NMSLHLKTNKLMCHYCHFSSPIPKICNACQSEVLVGKRIGTMQVLKELES LLEGAKIAILDKDHTSTPKKLHNILNDFNAQKTNILIGTQMISKGHDYAK VSLAVVLGIDNIIKSNSYRALEEGVSLLYQIAGRSARQISGQVFIQSTET DLLENFLEDYEDFLQYELQERCELYPPFSRLCLLEFKHKNEEKAQQLSLE ASQTLSLCLEKGVTLSNFKAPIEKIASSYRYLILLRSKNPLSLIKSVHAF LKTAPNIPCSVNMDPVDIF >gid:19882 recA RECA PROTEIN. MAIDEDKQKAISLAIKQIDKVFGKGALVRLGDKQVEKIDAISTGSLGLDL ALGIGGVPKGRIIEIYGPESSGKTTLSLHIIAECQKNGGVCAFIDAEHAL DVYYAKRLGVDTENLLVSQPSTGEEALEILETITRSGGIDLVVVDSVAAL TPKAEIDGDMGDQHVGLQARLMSHALRKITGVLHKMNTTLIFINQIRMKI GMTGYGSPETTTGGNALKFYASVRIDIRRIAALKQNEQHIGNRAKAKVVK NKVAPPFREAEFDIMFGEGISKEGEIIDYGVKLDIVDKSGAWLSYQDKKL GQGRENAKALLKEDKALADEITLKIKESIGSNEEIMPLPDEPLEEME >gid:21150 recG ATP-DEPENDENT DNA HELICASE MQETDDLLKTLNVKSLLEALLVYTPKGYKDLSLLERFETGLSGVLEVGIL EKKNYAKVLKIFAYSKRFYKNLELVFFNHSAFYYNQFKTGESLFIYGKLE QSSFNQAYIINTPKILTEFGKISLIFKKVKNHKKIQENLQKLLSLENLKK EGVKENVARLLLEIFFPTPHFVKDFETNKNFPSQHLNALKYIEMLFYMKN LERKKLQFNAKIACPNNSERLKAFIASLPFKLTRDQQNAIKEIQSDLTSP IACKRLIIGDVGCGKTMVILASMVLAYPNKTLLMAPTSILAKQLYHEALK FLPPYFEVELLLGGSHKKRSNHLFEKITHVVIGTQALLFDKRDLNEFALV ITDEQHRFGTKQRYQLEKMASSKGNKPHSLQFSATPIPRTLALAKSAFVK TTMIREIPYPKEIETLVLHKREFKIVMEKISEEIAKNHQVIVVYPLVNKS EKIPYLSLSEGASFWQKRFKNIYTTSGQDKNKEEVIEEFRELGSILLATT LIEVGISLPRLSVIVILAPERLGLATLHQLRGRVSRNGLKGYCFLCTIQE ENERLEKFADELDGFKIAELDLQYRKSGDLLQGGKQSGNSFEYIDLARDE NIIAEVKQDFLKNASVSQGTFEN >gid:20061 recJ putative SINGLE-STRANDED-DNA-SPECIFIC EXONUCLEASE RECJ MKQKLKAQIKERVASIAYNEKGFPSPFLFKDLKKAALKIIEAMRANTEIL VVGDYDADGVISSAIMAKFFKSLNYKHVRVAIPNRFMDGYGISKKFLEKH HAPLIITVDNGINAFEAAQFCKEKNYTLIITDHHCLHHDEIPDAYAVINP KQPDCDFIQKEVCGALVAFYLCYGIHQLLGKEKSHSSELLCLAGVATIAD MMPLTFFNRFLVSKALYFLQKESLGAMGFLRQREVFRKRSLKASDISFNI APLINSAGRMQDAKMALDFLSANNFQDGCSLYERLKACNMKRKMIQQQVF EEAFRHAMVGEKIIVAFKDNWHEGVLGIVASKLVEATQKPSLVFTFKEGV YKGSARSSPNIDLIDALNGVSSLLLGYGGHRQACGLSVGKNNIVSLFETL ENFDFKVLPFYETEPPLTLNLKDIDRELLEIIEMGEPYGQENPEPLFQAK NLEVIEEKIIKESHQVLRFKDKECVKEAIYFNADRFLKAGERVSVLFSVE LDECSNEPKMFVKSLL >gid:21172 recN DNA REPAIR PROTEIN(RECOMBINATION PROTEIN N) MRDFNNIQITRLKVRQNAVFEKLDLEFKDGLSAISGASGVGKSVLIASLL GAFGLKESNASNIEVELIAPFLDTEEYGIFREDEHEPLVISVIKKEKTRY FLNQTSLSKNTLKALLKGLIKRLSNDRFSQNELNDILMLSLLDGYIQNKN KAFSPLLDALETKFTRLEKLERERRSLEDKKRFQKDLEERLNFEKMKLER LDLKEDEYERLLEQKKLLSSKEKLNDKIALALDVLENTHKITHALESVGH SAEFLKSALLEAGALLEKEQAKLEECERLDIEKVLEKLGMLSGIIKDYGS IAHAKERLGHVKNELHNLKEIDHHCETYHKEIERLKTECLKLCEEISGFR KEYLAGFNALLSAKAKDLLLKSPSLVLEEAPMSEKGAQKLVLHLQNSQLE TLSSGEYSRLRLAFMLLEMEFLKDFKGVLVLDEMDSNLSGEESLAVSKAL ETLSSHSQIFAISHQVHIPAVAKNHILVFKENHKSLAKTLNNEERVLEIA RMIGGSENIESAISFAKEKLKV >gid:20597 recR RECOMBINATION PROTEIN MNTYKNSLNHFLNLVDCLEKIPNVGKKSAFKMAYHLGLENPYLALKITHA LRNALENLKTCASCNALSETEVSEICSDESRQNSQLCMVLHPRDVFILED LKDFLGRYYVLNSIEDVDFNALEKRLIGENIKEIIFAFPPTLANDSLMLY IEDKLQHLHLTFTKIAQGVPTGVNFENIDSVSLSRAFNSRIKA >gid:21109 rep putative ATP-DEPENDENT DNA HELICASE MGFEKSILDNLNGAQKIAACHIQGPLLILAGAGSGKTKTLTSRLAYLIGA CGVPSENTLTLTFTNKASKEMQERALKLLKNQALIPPLLCTFHRFGLLFL RQHMNLLKRACDFSVLDSDEVKTLCKQLKISNFRASISQIKNGMMDLSVQ DSECYKAYELYQNALKKDNLVDFDDLLCLSLKILQDNEKLAKETSERYHY IMVDEYQDTNALQLEFLKQLSFTHHNLCVVGDDDQSIYGFRGADISNILN FSKHFKGAKIVKLETNYRSSAEILACANSLISHNQHRHIKTLQSFKGSHK SVICKEYPTQKEESLDVAYQIKALLKKGENLENIAILYRLNGLSRSIEES LNALNIPYRLIGAVSFYERAEVKDALALMHVVAKKDDRFFIKRVLNKPPR GLGKITQEWIFSLLDEEGLNLEEALKIGAFKDKLNPKNEYALKKFTAMIG RLREAFEISVEKFCERFLEETNLLKSYEKEDNYEEREGFVKELLSLVKEH FKTNPTHSLLDFLNESALDVHNTENAQKVSCMSVHMSKGLEFKHVFVIGL EEGFFPHRGFNQESDLEEERRLAYVAITRAKEELQLSYVKERSYFGRKIS CSPSVFLEEAQLLQQDKPPKQNHQKDTPIKVGDLIKHKIFGTGRVLGVEK GLSGLCLKINCGGNVYDKISEKFVEKVDNEF >gid:20344 rnhA RIBONUCLEASE HI MQEIEIFCDGSSLGNPGPGGYAAILRYKDKEKTISGGENFTTNNRMELRA LNEALKILKRPCHITLYSDSQYVCQAINVWLVNWQKKNFAKVKNVDLWKE FVKVSKGHSIVAVWIKGHNGHAENERCNSLAKLEAQKRVKTTT >gid:20981 rnhB RIBONUCLEASE HII MGCVSMTLGIDEAGRGCLAGSLFVAGVVCNEKIALEFLKMGLKDSKKLSP KKRFFLEDKIKTHGEVGFFVVKKSANEIDHLGLGACLKLAIEEIVENGCS LANEIKIDGNTAFGLNKRYPNIQTIIKGDETIAQIAMASVLAKASKDREM LELHALFKEYGWDKNCGYGTKQHIEAINKLGATPFHRHSFTLKNRILNPK LLEVEQRLV >gid:20553 ruvA HOLLIDAY JUNCTION DNA HELICASE MIVGLIGVVEKISALEAHIEVQGVVYGVQVSMRTAALLQTGQKARLKILQ VIKEDAHLLYGFLEESEKILFERLLKINGVGGRIALAILSSFSPNEFENI IATKEVKRLQQVPGIGKKLADKIMVDLIGFFIQDENRPARNEVFLALESL GFKSAEINPVLKTLKPHLSIEAAIKEALQQLRS >gid:20105 ruvB HOLLIDAY JUNCTION DNA HELICASE RUVB MKERIVNLETLDFETSQEVSLRPNLWEDFIGQEKIKSNLQISICAAKKRQ ESLDHMLFFGPPGLGKTSISHIIAKEMETNIKITAAPMIEKSGDLAAILT NLQAKDILFIDEIHRLSPAIEEVLYPAMEDFRLDIIIGSGPAAQTIKIDL PPFTLIGATTRAGMLSNPLRDRFGMSFRMQFYSPSELSLIIKKAAAKLNQ DIKEESADEIAKRSRGTPRIALRLLKRVRDFALVKNSSLMDLSITLHALN ELGVNELGFDEADLAYLSLLANAQGRPVGLNTIAASMREDEGTIEDVIEP FLLANGYLERTAKGRIATPKTHALLKIPTLNPQTLF >gid:20549 ruvC CROSSOVER JUNCTION ENDODEOXYRIBONUCLEASE MRILGIDPGSRKCGYAIISHASNKLSLITAGFINITTTRLQEQILDLIEA LDCLLDRYEVNEVAIEDIFFGYNPKSVIKLAQFRGALSLKILERIGNFSE YTPLQVKKALTGNGKAAKEQVAFMVKRLLNITSEIKPLDISDAIAVAITH AQRLKPR >gid:20904 ssb SINGLE-STRAND BINDING PROTEIN MFNKVIMVGRLTRNVELKYLPSGSAAATIGLATSRRFKKQDGTLGEEVCF IDARLFGRTAEIANQYLSKGSSVLIEGRLTYESWMDQTGKKNSRHTITAD SLQFMDKKSDNPQANSMQDSMTHENFNNAYPTNYNAPSQDPFSQAQSYPQ NAYTKENSQAQPSKYQNSVPEINIDEEEIPF >gid:20565 tnpA IS606 TRANSPOSASE MSVSKLVNSLKGVSSRLTRQHHFKSVEASLWGKHLWSPSYFAGSCGGTPL EMIKQYIQEQETPH >gid:20564 tnpB IS606 TRANSPOSASE MKVNKGFKFRLYPTKEQQDKLQHCFFVYNQAYNIGLNLLQEQYEKNKDLP PKERTRKKSSELDKAIKHHLNARGLSFSSVIAQQSRMNVERALKDAFKVK NRGFPKFKNSKSAKQSFSWNNQGFFIKESDEERFKIFTLMKMPLMMCMHR DFPPHSKVKQIVISCSHRKYFVSFSVEYEQDITPIKNPKNGVGLDLNILD IACSCGVNNHKKLTDFKRYSTDMKELLGIEIDEELDTKRLIPTYSKLYSL KKHSKKFKRLQRKQSRRVLKSKQNKTKLGGNFYKTQKKLNQVFDKSSHQK TDRYHKITSELSKQFELIVVEDLQVKNMTKRAKLKNVKQKSGLNQSILNT SFYQIISFLDYKQQHNGKLLVKVPPQYTSKTCHCCGNINHKLKLNHRQYW CLECGYREHRDINAANNILSKGLSLFGVGNIHADFKEQSLSC >gid:19849 topA_1 DNA TOPOISOMERASE I MKHLIIVESPAKAKTIKNFLDKNYEVVASKGHVRDLSKFALGIKIDETGF TPNYVVDKDHKELVKQIIELSKKASITYIATDEDREGEAIGYHVACLIGG KLESYPRIVFHEITQNAILNVLKTPRKIDMFKVNAQQARRLLDRIVGFKL SSLIASKITKGLSAGRVQSAALKLVIDREKEIRPFKPLTYFTLDALFEPH LEAQLISYKGNKLKAQELIDEKKAQEIKNELEKESYIISSIIKKSKKSPT PPPFMTSTLQQSASSLLGFSPTKTMSIAQKLYEGVATPQGVMGVITYMRT DSLNIAKEALEEARAKILKDYGKDYLPPKAKVYSSKNKNAQEAHEAIRPT SIILEPNALKDYLKPEELKLYTLIYKRFLASQMQDALFESQSVVVACEKG EFKASGRKLLFDGHYKILGNDDKDKLLPNLKENDPIKLEKLESNAHVTEP PARYSEASLIKVLESLGIGRPSTYAPTISLLQNRDYIKVEKKQISALESA FKVIEILEKHFEEIVDSKFSASLEEELDNIAQNKADYQQVLKDFYYPFMD KIEAGKKNIISQKVHEKTGQSCPKCGGELVKKNSRYGEFIACNNYPKCKY IKQTENANDEAKQELCEKCGGEMVQKFSRNGAFLACNNYPECKNTKSLKN TPNAKETIEGVKCPECGGDIALKRSKKGSFYGCNNYPKCNFLSNHKPINK RCEKCHYLMSERIYRKKKAHECIQCKERVFLEEDNG >gid:20657 topA_2 topoisomerase I MYKNCVFIIESPNKIAKIKELTGSSFVFATGGHFVELVNIEVNKEFNPIF EIKKSTDKKKDRSTHINHMINQCKDKVVYIATDPDREGYGIGYKFYEKIK NLAKTIYRTEFHEITKSGVEKGLNNAGLFSQSNLNLYYSWLGRIVSDQFI GFTLTPYLRKNIKNFEVGAGRVQTPALSILVELDRKIQAFEQKNNDEKLS YSIEAIIDALGSQISITLVEENKRKAFETKELAQNFLNDLKNNLNPLAFL DAIEQKDKEKAPPKPFTTSNLLKDGARILGMGVKQIQEHAQKLFEAGLIT YIRTDSEALSKEYLQEHEAFFEGIYPSVYEYREYRAGKNSQAEAHEAIRI THPHCYEDLKKVCEEHNITDIDDLKVYTLIFFNTICSQSKNAIYENTVLN FKVKTHRFNAVLANSNLKVLKRLKT >gid:20669 topA_3 topoisomerase I MNNSVIIIESPNKVAKIREITGAKVFATIGHFMQLKSYDENNGFKPTFDY DQEKKKHIFEMIEACKNKKVYIATDPDREGYAIGYMFYQKIKNVASSIYR AEFFEITPSGINKGLQNALLFENTNRQMYQSALARRVADMLLGFTLSPYL GKALGQMKGSSAGRVQTPCLKLIVDRDREIEKFKALPENEKVSYQIQAKI NDSANREVTIKHCDEKGEEIKFNDKEEALKLFESLKDNKACLLKDLKNSV VETKPKKPFITSTLLEKASSMLGLSISEVQSLAQNLFEAGLITYIRTDAE SLSVEFLDETESFYAPIYKDLYLKREYKAGKQSQAEAHEAIRITHPHTTE DLESIVYNANITNQDALKLYQLIFERTIESQGKNAIYDKQDLLFKIKNEY FKCSVKGLKSAGFLAMFSKKELENDESNDDKDNKEKEQNAQFNLKIDDVL SLNDLVLSTIKRNAPSAYKEADFVKLLENKGIGRPSTYASYLPTLVKREY ISISQDKKHIITPTHKGKRVVEVFENAYQFIIDLTYTKQMEEVLDEIVEN KSSYVDFISNLNSKCPKIEKLERNDDEIKPSSEGQITYIENILRDLQLNL SEEFKNYKEDNRVAKAFLDRYIKEHEFFKKNNKKASSSNNDENRPATPKQ ISFAEILAKKHNVKLPKGFKYSMKVCGDFINEYHKK >gid:21004 ung URACIL-DNA GLYCOSYLASE MKLFDYAPLSLAWREFLQSEFKKPYFLEIEKRYLEALKSPKTIFPKSSNL FCAFNLTPPYAVKIILLGQDPYHSTYLENEQELPVAMGLSFSVEKNAPIP PSLKNIFKELHANLGVPVPCCGDLSAWAKRGMLLLNAILSVEKNQAASHK YIGWEAFSDQILIRLFETTTPLIVVLLGKVAQKKIALIPKNKHIIITAPH PSPLSRGFLGSGVFTSVQKAYREVYRKDFDFSL >gid:20382 uvrA EXCINUCLEASE ABC SUBUNIT A. MQHKTIMDKIIIQGARENNLKNIFLEIPKNQFVVFTGLSGSGKSTLAFDT LYAEGQRRYLESLSSYARQFLDKVGKPNVDKIEGLTPAIAIDQKTTSKNP RSTVGTITEIYDYLRLLFARVGEQFCPTCLEPISSMSASDIISQICHLEE NSKIIILAPIIKDKKGSFNDKLESLRLKGYVRAFVDGVMVRLDEEIHLHK TKKHTIEAVVDRVVINSENASRIASAVEKALKESYGELEVEILQDNAPSI RKHYSEHKACFKCKMSFEELEPLSFSFNSPKGACESCLGLGTKFSLDISK ILDPNTPLNQGAIKVIFGYNRSYYAQMFEGFCTYNGIDSALCFNELNKEQ QDALLYGNGTEISFHFKNSPLKRPWKGIIQIAYDMFKEQKDLSDYMSEKT CSSCNGHRLKASSLSVQVAGLKMADFLTKPIEEVYHFFNDPTHFNYLNEQ EKKIAEPILKEILERVFFLYDVGLGYLTLGRDARTISGGESQRIRIASQI GSGLTGVLYVLDEPSIGLHEKDTLKLINTLRNLQKKGNTLIVVEHDKETI KHADFVVDIGPKAGRHGGEVVFSGSVKDLLQNNHSTALYLNGTKKIERPK FEPPKEKHFLEIKNVNINNIKNLSVQIPLKQLVCITGVSGSGKSSLILQT LLPTAQTLLNHAKKNQSLNGVEIVGLEYLDKVIYLDQAPIGKTPRSNPAT YTGVMDEIRILFAEQKEAKILGYSTSRFSFNVKGGRCEKCQGDGDIKIEM HFLPDVLVQCDSCKGAKYNPQTLEIKVKGKSIADVLNMSVEEAYEFFAKF PKIAVKLKTLIDVGLGYITLGQNATTLSGGEAQRIKLAKELSKKDTGKTL YILDEPTTGLHFEDVNHLLQVLHSLVALGNSMLVIEHNLDIIKNADYIID MGPDGGDKGGKVIASGTPLEVAQNCEKTQSYTGKFLALELK >gid:20779 uvrB EXCINUCLEASE ABC SUBUNIT B MPLFDLKSPYPPAGDQPQAIEALTKSLKNNNHYQTLVGVTGSGKTYTMAN IIAQTNKPALIMSHNKTLCAQLYSEFKAFFPHNRVEYFISHFDYYQPESY IPRRDLFIEKDSSINDDLERLRLSATTSLLGYDDVIVIASVSANYGLGNP EEYLKVMEKIKVGEKRAYKSFLLKLVEMGYSRNEVVFDRGSFRATGECVD IFPAYNDAEFIRIEFFGDEIERIAVFDALERNEIKRLDSVMLYAASQFAV GSERLNLAVKSIEDELALRLKFFKEQDKMLEYNRLKQRTEYDLEMISATG VCKGIENYARHFTGKAPNETPFCLFDYLGIFEREFLVIVDESHVSLPQFG GMYAGDMSRKSVLVEYGFRLPSALDNRPLKFDEFIHKNCQFLFVSATPNK LELELSQKNVAEQIIRPTGLLDPKFEVRDSDKQVQDLFDEIKSVVARGER VLITTLTKKMAEELCKYYAEWGLKVRYMHSEIDAIERNHIIRSLRLKEFD VLIGINLLREGLDLPEVSLVAIMDADKEGFLRSETSLIQTMGRAARNANG KVLLYAKKTTQSMQKAFEITSYRRAKQEEFNKIHNITPKTVTRALEEELK LRDDEIKIAKALKKDKMPKSEREKIIKELDKKMRERAKNLDFEEAMRLRD EIAQLRTL >gid:20498 uvrC EXCINUCLEASE ABC SUBUNIT C MADLLSSLKNLSSSSGVYQYFDKNRQLLYIGKAKNLKKRIKSYFSVRNNE ITPNPRTSLRVQMMVKQIAFLETILVENEQDALILENSLIKQLKPKYNIL LRDDKTYPYIYMDFSIDFPIPLITRKILKQPGVKYFGPFTSGAKDILDSL YELLPLVQKKNCIKDKKACMFYQIERCKAPCEDKITKEEYLKIAKECLEM IENKDRLIKELELKMERLSSNLRFEEALIYRDRIAKIQKIAPFTCMDLAK LYDLDIFAFYGGNNKAVLVKMFMRGGKIISSAFEKIHSLNGFDTDEAMKQ AIINHYQSHLPLMPEQILLSACSNETLKELQEFISHQYSKKIALSIPKKG DKLALIEIAMKNAQEIFSQEKTSNEDRILEEARSLFNLECVPYRVEIFDT SHHSNSQCVGGMVVYENNAFQKDSYRRYHLKGSNEYDQMSELLTRRALDF AKEPPPNLWVIDGGRAQLNIALEILKSSGSFVEVIAISKEKRDSKAYRSK GGAKDIIHTISHTFKLLPSDKRLQWVQKLRDESHRYAINFHRSTKLKNMK QIALLKEKGIGEASVKKLLDYFGSFEAIEKASDQEKNAVLKKRK >gid:19984 xseA EXODEOXYRIBONUCLEASE LARGE SUBUNIT MHVLSVSEINAQIKALLEATFLQVRVQGEVSNLTIHKVSGHAYFSLKDSQ SVIRCVLFKGNANRLKFALKEGQEMVVFGGISVYAPRGDYQINCFEIEPK EIGSLTLALEQLKEKLRLKGYFDEANKLPKPNFPKRVAVITSQNSAAWAD MKKIASKRWPMCELVCINTLMQGEGCVQSVVESIAYADSFYDTKNAFDAI VVARGGGSMEDLYSFNDEKIADALYLAKTFSMSAIGHESDFLLSDSVADL RASTPSNAMEILLPSSDEWLQRLDGFNVKLHRSFKTLLHQKKAHLEHLAA SLKRLSFENKHHLNALKLEKLTIALDNKTLEFLRLKKTLLEKISTQLSTS PFLQTKTERLNRLENALKLAYANLKLPQFGALVSKNHQAIELEALKRGDK IELSNEKARASAEILSVDRV