TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Helicobacter pylori J99, J99
Gene type: CDS

Number of genes found: 104

Free access
Sort by:

 



# Helicobacter pylori J99, J99

>gid:20869  M.HpyI  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MNYIGSKYKLIPFIKENIHAVAGHDLSGAIFCDLFAGTGIVGRTFKKAVN
KVISNDLEYYSFVLNQNYIGNIQEIPNKEELINKINSVALKKGFIYSHYS
LGGSSRQYFSETNAQKIDAMRLKIEELKLSQNIDNHSYYFLLASLLESAD
KVANTASVYGAFLKRLKKSAQKELILKGAHFDVSLNANEVYQQDSNDLIG
KISGDILYLDPPYNARQYGANYHLLNTIAAYTPFTPKGKTDLPSYQKSSF
CSRFQILNAFENLIKKARFKYIFLSYNNEGLMSETEIKNILKKYGAYSLV
TKTYMRFKADNKRAHKAVHTKECLHVLIK
>gid:20227  cagH  cag island protein
MAGTQAIYESSSAGFLSQVSSIISSTSGVAGPFAGIVAGAMTAAIIPIVV
GFTNPQMTAIMTQYNQSIAEAVSVPMKAANQQYNQLYQGFNDQSMAVGNN
ILNISKLTGEFNAQGNTQSAQISAVNSQIASILASNTTPKNPSAIEAYAT
NQIAVPSVPTTVEMMSGILGNITSAAPKYALALQEQLRSQASNSSMNDTA
DSLDSCTALGALVGSSKVFFSCMQISMTPMSVSMPTVYAKYQAVATKALT
SGVNPMTTPACPIGDKVLAVYCYAEKVAEILREYYIEFVKNNTNLLQNAS
QMILNQSGLATSTYDTQAISNISSLYNYNIVANKSFLKSHLTYLDYIKDK
LKGQKDSYLTERVQTKIIVK
>gid:20226  cagI  cag island protein
MKCFLSIFSFLTFCGLSLNGAGAVITLEPALKAIQADAQAKQKTAQAELK
AIEAQSNAKEKAIQAQIEGELRTQLATMSAMLKGANGVINGVNSMTGGFF
AGSDILLGVMEGYSSALSALGGNVKMIVEKQKINTQTEIQNMQIALQKNN
EIIKLKMNQQNALLEALKNSFEPSVTLKTQMEMLSQALGSSSDNAQYIAY
NTIGIKAFEETLKGFETWLKTAMQKATLIDYNSLTGQALFQSTIYAPALS
FFSSMGAPFGIIETFTLAPTKCPYLDGLKISACLMEQVIQNYRMIVALIQ
NKLSDADFQNIAYLNGINGEIKTLKGSVDLNALIEVAILNAENHLNYIEN
LEKKADLWEEQLKLERETTARNIASSKVIVK
>gid:20220  cagS  cag island protein
MSNNMRKLFSMIANSKDKKEKLIESLQENELLNTDEKKKIIDQIKTMHDF
FKQMHTNKGALDKVLRNYMKDYRAVIKSIGVDKFKKVYRLLESETMELLH
AIAENPNFLFSKFDRSILGIFLPFFSKPIMFKMSIREMDSQIELYGTKLP
PLKLFVMTDEEVNFYANLKTIEQYNDYVRDLLMKFDLEKYMEEKGVQNA
>gid:19973  deaD  ATP-DEPENDENT RNA HELICASE DEAD
MELNQPPLPTEIDDDAYHKPSFNDLGLKESVLKSVYEAGFTSPSPIQEKA
IPAVLQGRDVIAQAQTGTGKTAAFALPIINNLKNNHTIKALVITPTRELA
MQISDEIFKLGKHTRTKTVCVYGGQSVKKQCEFIKKNPQVMIATPGRLLD
HLKNERIHKFVPKVVVLDESDEMLDMGFLDDIEEIFDYLPSEAQILLFSA
TMPEPIKRLADKILENPIKIHIAPSNITNTDITQRFYVINEHERAEAIMR
LLDTQAPEKSIVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRSSI
MAFKKNDADVLVATDVASRGLDISGVSHVFSYHLPLNTESYIHRIGRTGR
AGKKGMAITLVTPLEYKELLRMQKEIDSEIELFEIPTINENQIIKTLHDA
KVSEGIISLYEQLTEIFEPSQLVLKLLSLQFETSKIGLNQQEIDAIQNPK
EKTPKPPNKKTQHEPAHSFKKSHHRDRHPKTNRYSKKHKRR
>gid:21155  dnaA  CHROMOSOMAL REPLICATION INITIATOR PROTEIN
MDTNNNIEKEILALVKQNPKVSLIEYENYLSQLKYNPNASKSDIAFFYAP
NKFLCTTITAKYGALLKEILSQNKVGMHLAHSVDVRIEVAPKIQVNAQSN
INYKATKTSVKDSYTFENFVVGSCNNTVYEIAKKVAQSDTPPYNPVLFYG
GTGLGKTHILNAIGNHALEKHKKVVLVTSEDFLTDFLKHLDNKNMDSFKK
KYRHCDFFLLDDAQFLQGKPKLEEEFFHTFNELHANSKQIVLISDRSPKN
IAGLEDRLKSRFEWGITAKVMPPDLETKLSIVKQKCQLNKITLPEEVMEY
IAQHISDNIRQMEGAIIKISVNANLMNATIDLNLAKTVLEDLQKDHAEGS
SLENILLAVAQSLNLKSSEIKVSSRQKNVALARKLVVYFARLYTPNPTLS
LAQFLDLKDHSSISKMYSSVKKMLEEEKSPFILSLREEIKNRLNELNDKK
TAFNSSE
>gid:21018  dnaB  REPLICATIVE DNA HELICASE
MDHLKHLQQLQNIERIVLSGIVLANHKIEEIHSVLEPSDFYYPPHGLFFE
IALKLHEVNCPIDENFIRQKMPKDKQISEDDLVAIFAASPIDNIEAYVEE
IKNASIKRKLFTLANTIREQALESAQKSSDILNAVEREVYALLNGSTIEG
FRGIKEVLESTMNLITENQRKGSLKVTGIPTGFVQLDNYTSGFNQGSLVI
LGARPSMGKTSLMMNMVLSALNDDRGVAVFSLEMSAEQLALRALSDLTSI
NMHDLESARLDDDQWENLAKCFDHLSQKKLFFYDKSYVRMDQIRLQLRKL
KSQHKELGIAFIDYLQLMSGNKATKERHEQIAEISRELKTLARELEIPII
ALVQLNRSLENRDDKRPILSDIKDSGGIEQDADIVLFLYRGYIYQMRAED
NKIDKLKKEGKVEEAQELHLKVNEERRIHKQNGSIEEAEIIVAKNRNGAT
GTVYTRFNAPFTRYEDMPVDSHLEEGQETKFEMPTT
>gid:21091  dnaE  DNA POLYMERASE III, ALPHA CHAIN
MKENKAFTHLHLHTEYSLLDGANKIKILAKRIKELGMKSVSVTDHGNMFG
AIDFYTSMKKEGIKPIIGMEAYIHNDDNLSSKETKQRFHLCLFAKNQEGY
ENLMFLSSMAYLEGFYYFPRINKKLLREHSKGIIASSACLQGEVNYHLNT
NNERNRKYGAKGYDEAKRIACEYQEIFEDDFYLEIMRHGILDQRFIDEQV
IKMSLETGLKIIATNDTHYTMPNDAKAQEVAMCVAMGKTLNDKGRLKHSV
HEFYIKSPEEMAKLFADIPEALENTQEIADKCVLEIDLKDDKKNPPTPPS
FKFTKAYAQNEGLSFEDDASYFAHKAREGLRERLILVPEEKHEQYKERLE
KEIEVITNMKFPGYMLIVWDFIRYAKEMGIPVGPGRGSAAGSLVAFALKI
TDIDPLKYDLLFERFLNPERVSMPDIDTDFCQRRRKEIIEYMIEKYGKYN
VAQVITFNKMLAKGVIRDVARVLDMPYKEADDFAKLIPNRLGITLKGYEK
NGEFIEGAWELEPKIKELVESNEVAKQVWEYSLNLENLNRNAGVHAAALV
VDSQKELWHKTPLFASEKTGGIVTQYSMKYLEPVDLIKFDFLGLKTLTVI
DDALKIIKTQHNIDVDFLSLDMDDPKVYKTIQSGDTVGIFQIESGMFQGL
NKRLRPSSFEDIIAIIALGRPGPMESGMVDDFVNRKHGVEPIAYAFKELE
PILKPTYGTIVYQEQVMQIVQTIGGFSLGEADLIRRAMGKKDAQIMADNK
AKFVEGAKNLGHDGQKAANLWDLIVKFAGYGFNKSHSAAYAMITFQTAYL
KTYYKHEFMAAMLTSESNKIESVARYIDEVRALEIEVMPPHINSSMQDFS
VAEFKNQKGELEKKIVFGLGAVKGVGGEPIKNIIEERAKGDYKSLEDFIS
RVDFSKLTKKSLEPLVKSGSLDNLGYTRKTMLANLDLICDAGRAKDKANE
MMQGGNSLFGAMEGGIKEQVVLDMVDLGEHDAKTLLECEYETLGIHVSGN
PLDEFKEEIKGFKNLVKSIDIEELEIGSQAYLLGKIMEVKKKIGKRSGKP
YGTADILDRYGKFELMLFEKQLNALEELDINKPLVFKCKIEEQEEVARLR
LFEILDLESAREVKIPKARYKDPEKQKEDVREIPPMEMLASSSCSLAIVL
ENDVKKEFLRQIKESALKHQGKRPLYLIIKDKDKQFKIQSDLMVNEKIKD
DFKGLEWRDLA
>gid:19752  dnaG  DNA PRIMASE
MILKSSIDRLLQTIDIVEVISSYVDLRKSGSNYMACCPFHEERSASFSVN
QVKGFYYCFGCGASGDSIKFVMAFEKLSFVEALEKLAHRFNIALEYDKGV
YYDHKEDYHLLEMVSSLYQEELFNAPFFLNYLQKRGLSMESIKAFKLGLC
TNKIDYGIENKGLNKDKLIELGVLGKSDKEDKTYLRFLDRIMFPIYSPSA
QVVGFGGRTLKEKAAKYINSPQNKLFDKSSLLYGYHLAKEHIYKQKQVIV
TEGYLDVILLHQAGFKNAIATLGTALTPSHLPLLKKGDPEILLSYDGDKA
GRNAAYKASLMLAKEQRKGGVILFENNLDPADMIANHQIETLKNWLSRPI
AFIEFVLRHMAGSYLLDDPLEKDKALKEMLGFLKNFSLLLQNEYKPLIAT
LLQAPLHVLGIREPVSFQPFYPKTEKPNRPQKFAHVSSMPSLEFLEKLVI
RYLLEDRSLLDLAVGYIHSGVFLHKKQEFDALCQEKLDDPKLVALLLDAN
LPLKKGGFEKELRLLILRYFERQLKEIPKSPLSFSEKMIFLKRARQAIMK
LKQGELVAI
>gid:20190  dnaN  DNA POLYMERASE III, BETA CHAIN
MKISVSKNDLENTLRYLQAFLDKKDASSIASHIHLEVIKEKLFLKASDSD
IGLKSYISTQSTDKEGVGTINGKKFLDIISCLKDSNIVLETKDDSLVIKQ
NKSSFKLPMFDADEFPEFPVIDPKVSLEINAPFLVDAFKKIAPVIEQTSH
KRELAGVLMQFNQKHQTLSVVGTDTKRLSYTQLEKISIHSTEEDISCILP
KRALLEILKLFYENFSFKSDGMLAVVENETHAFFTKLIDGNYPDYQKILP
KEYTSSFTLGKEEFKEGIKLCSSLSSTIKLTLEKNNALFESLDSEHSETA
KTSVEIEKGLDIEKAFHLGVNAKFFLEALNALGTTQFVLKCNEPSSPFLI
QEPLDEKQSHLNAKISTLMMPITL
>gid:20393  dnaX  DNA POLYMERASE III SUBUNITS GAMMA AND TAU
MQVLALKYRPKHFSELVGQESVAKTLSLALDNQRLANAYLFSGLRGSGKT
SSSRIFARALMCKTGPKAVPCDTCIQCQSALNNHHIDIIEMDGASNRGID
DVRNLIEQTRYKPSFGRYKIFIIDEVHMFTTEAFNALLKTLEEPPSHVKF
LLATTDALKLPATILSRTQHFRFKKIPENSVISHLKTILEKEQVSYESSA
LEKLAHSGQGSLRDTITLLEQAINYCDNAITESKVAEMLGAIDRSVLEDF
FQSLINQDEARLQERYAILENYETEGVLEEMMLFLKAKLLSPDSYSILLI
ERFFKIIMSSLSLLKEGANASFVLLLLKMKFKEALKLKALDDAIVELEQT
PFNQSPSISYNAPKQEPKSTERIEGREKLEKRERIETPQTPMLSAKDRIF
HNLFKQVQTLVYERNYELGAVFEKNIRFIDFDSQTKTLTWESLATDKDKE
LLRERFKIVKSIVDGVFGKGENIKIALKHHLENKSAPEETKEVKEFKFPP
LKPKLTTETTAEMQEKETKEAVGKALQTKENDTKEVQENETKETKEAQPK
EAPTALQEFMANHSNLIEEIKSEFEIKSVELL
>gid:21153  exoA  EXODEOXYRIBONUCLEASE
MKLISWNVNGLRACMTKGFMDFFNSVDADVFCIQESKMQQEQNTFEFKGY
FDFWNCAIKKGYSGVVTFTKKEPLSVSYGIDIKEHDKEGRVVTCEFESFY
LVNVYTPNSQQALSRLSYRMSWEVEFKKFLKALELKKPVIVCGDLNVAHN
EIDLENPKTNRKNAGFSDEEREKFSELLNAGFIDTFRYFYPNKEKAYTWW
SYMQQARDKDIGWRIDYFLCSNPLKTRLKDALIYKDILGSDHCPVGLELV
>gid:20379  gyrA  DNA GYRASE SUBUNIT A
MQDHLVNETKNIVEVGIDSSIEESYLAYSMSVIIGRALPDARDGLKPVHR
RILYAMHELGLTSKVAYKKSARIVGDVIGKYHPHGDTAVYDALVRMAQDF
SMRLELVDGQGNFGSIDGDNAAAMRYTEARMTKASEEILRDIDKDTIDFV
PNYDDTLKEPDILPSRLPNLLVNGANGIAVGMATSIPPHRIDEIIDALAH
VLGNPNAELDKILEFVKGPDFPTGGIIYGKAGIVEAYKTGRGRVKVRAKV
HVEKTKNKEIIVLGEMPFQTNKAKLVEQISDLAREKQIEGISEVRDESDR
EGIRVVIELKRDAMSEIVLNHLYKLTTMETTFSIILLAIYNKEPKIFTLL
ELLRLFLNHRKTIIIRRTIFELEKAKARAHILEGYLIALDNIDEIVRLIK
TSPSPEAAKNALIERFSLSEIQSKAILEMRLQRLTGLERDKIKEEYQNLL
ELIDDLNGILKSEDRLNEVVKTELLEVKEQFSSPRRTEIQESYESIDTED
LIANEPMVVSMSYKGYVKRVDLKAYERQNRGGKGKLSGSTYEDDFIENFF
VANTHDILLFITNKGQLYHLKVYKIPEASRIAMGKAIVNLISLAPNEKIM
ATLSTKDFSDERSLAFFTKNGVVKRTNLSEFGGNRSYSGIRAIVLDEGDE
LVGAKVVDKNAKHLLIASYLGMFIKFPLEDVREIGRTTRGVMGIRLNEND
FVVGAVVISDDSNKLLSVSENGLGKQTLAEAYREQSRGGKGVIGMKLTQK
TGNLVSVISVDDENLNLMILTASAKMIRVSIKDIRETGRNASGVKLINTA
DKVVYVNSCPKEEEPENLETSSVQNLFE
>gid:20191  gyrB  DNA GYRASE SUBUNIT B
MQNYQSHSIKVLKGLEGVRKRPGMYIGDTNVGGLHHMVYEVVDNAVDESM
AGFCDTINITLTEEGSCIVEDNGRGIPVDIHPTEKIPACTVVLTILHAGG
KFDNDTYKVSGGLHGVGVSVVNALSKRLIMTIKKEGQIYRQEFEKGIPIS
ELEIIGKTKSAKESGTTIEFFPDESVMEVVEFQAGILQKRFKEMAYLNDG
LKISFKEEKTQLQETYFYKDGLKQFVKDSAKKELLTPIIAFKSMDEETRT
SIEVALAYADDYNENTLSFVNNIKTSEGGTHEAGFKMGLSKAILQYIDNN
IKTKESRPISEDIKEGLIAVVSLKMSEPLFEGQTKSKLGSSYARALVSKL
VYDKIHQFLEENPNEAKIIANKALLAAKAREASKKARELTRKKDNLSVGT
LPGKLADCQSKDPLESEIFLVEGDSAGGSAKQGRDRVFQAILPLKGKILN
VEKSHLSKILKSEEIKNMITAFGCGIQESFDIERLRYHKIIIMTDADVDG
SHIQTLLMTFFYRYLRPLIEQGHVFIAQAPLYKYKKGKTEIYLKDSVALD
HFLIEHGINSVDIEGIGKNDLMNLLKVARHYRYTLLELEKRYNLLEVLRF
LIETKDALSLDMKVLEKSILEKLEGLNYQILRSFATEESLHLHAQTPKGL
VEFNLDDNLFKDVLFEEAHYTYQKLMEYNLDFLENKDILAFLEEVENYAK
KGANIQRYKGLGEMNPNDLWETTMHKENRSLIKLKIEDLEKTDAVFSLCM
GDEVEPRRAFIQAHAKDVKQLDV
>gid:20890  holB  putative DNA poymerase III subunit delta'
MKNSNRLIYTDNLEESLEEAASLFKHHIKFYTEIIEKDKKVIKTFNKDFK
IEHAKEVISKAHLKHSELNAFLIAAPSYGVEAQNALLKILEEPPNNVCFI
MFAKSPNHVLATIKSRLIKEDKRQKIPLKPLDLDLSKLDLKDIYAFLKNL
DKENFDSRENQRERIESLLESIHRHKIPLNEQELQAFDLAIKANSSYYKL
SYNLLPLLLSLLSKKKTP
>gid:19785  jhp0043  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MIQIHHANAFEIIKDFHQQNLKVDAIITDPPYNISVKNNFSTLKSAKRQG
IDFGEWDKNFKLLEWIARYASLINPNGCMVIFCSYRFISYIADFLEENGF
VVKDFIQWVKNNPMPRNIHRRYVQDTEFALWAVKKKAKWVFNKPKNEKYL
RPLILKSPVVSGIERVKHPTQKSLTLMEKIISTHTNPNDTVLDPFMGSGT
TGLACKNLKRNFIGIESEKEYFQIAQKRLS
>gid:19786  jhp0044  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MLVSRFLNAIDPFNLGVLLSRFQIKNGCIYGVCSYKVSKFTPGYEESKAR
VLNALNILSKHQIWQSNQESVTKVKGTFVFILENDLHLDENSFYKKLLNL
IIDNDFFNRSHLVTPSNGTNSHPELHRSITPREAARIQSFSDDYIFYGNK
TSVCKQIGNAVPPLLALALGKAILKSARNDTNPSR
>gid:19787  jhp0045  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MQNLLIQAENAIALLFLLNDKNLKGKIDLIYIDPPFATNNHFTITNGRAT
TISNSKNGDIAYSDKVVGMDFMEFLKQRLVLLKELLSEQGSIYVHTDYKI
GHYVKVMLDEIFGIQNFRNEITRIKCNPKNFKRIGYGNIKDMILFYSKGK
NPIFNEPKIPYTPQDLEKRFPKIDKDKRRYTTVPIHAPGEVESGECSKAF
KGMLPPKGRHWRTDIATLERWDKEGLIEYSNNNNPRKKIYALEQVGKRVQ
DIWEFKDPQYPSYPTEKNAQLLDLIIKTSSNKDSIVLDCFCGSGTTLKSA
FLLQRKFIGIDNSDLAIQACKNKLETITKDLFVSQNFYDFLVF
>gid:19793  jhp0051  putative
MEVKDRLNFNFVATTHSAMDLIASVLSDSKYYLESFYNQASQELGDKRSD
KGEKLAELFDLLFEYIKDSKFERLKEPSAYDYSCKKLYPEQNTSQKMRRV
VLRGYKHNDKMYHTIVDMGS
>gid:19799  jhp0057  putative
MSRVQMDTEEVREFVGHLERFKELLNEEVNSLNGHFHNLESWQDARRDKF
SEVLDNLKGTFNEFNEAAQEQIAWLKERIRVLEEDY
>gid:19802  jhp0060  putative
MLFSHKVFLEGCTNELRRICDSFVEGAMQDDLGQKLKSEALENMLKIAHD
LENLEQETQYEMRKINEQLEEAKRLEKQVDMQDRHSQSEIDRLMREAKEH
EREAKRRYGEYLKDKND
>gid:19826  jhp0085  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MIIAHSNEIAHPIFKSADKLFTLYQGDCNEVLPQFENAFDLIFADPPYFL
SNDGLSIQSGKIVSVNKGDWDKENGINDIDEFNYQWINNAKKALKNTGSL
LISGTYHNLFSLGRILQKLDFKILNLITWQKTNPPPNFSCRYLTHSAEQI
IWARKSYKHKHVFNYEILKKINNNKQMRDVWNFPAIAPWEKANGKHPTQK
PLALLVRLLLMASDDNSLIGDPFSGSSTTGIAANLLKREFIGIEKESRFI
KISMNRKLELDARHQEIRSKIKDLNFQ
>gid:19926  jhp0185  putative
MQFEMRKIAFNAPKAFSLEHEGVVLEGEIVRVGAKLFRLKARLKGELMLI
CDTSGKEFKKSLDESLVLHISDGLWDTQSQSLDFDNLDVIESFNGFIDLS
EILRSEVESIKLDYHYAD
>gid:19968  jhp0227  putative
MRDYSELEIFEGNPLDKWNDIIFHASKKLSKKELERLLELLALLETFIEK
EDLEERFESFAKALRMDEELQQKIESRKTDIVIQSMANILSGNE
>gid:19985  jhp0244  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MHKVFIMEALECLKRIEKESIQTIYIDPPYNTKSSNFEYEDAHADYEKWI
EEHLILAKAVLKQSGCIFISMDDNKMAEVKIIANEIFGTRNFLGTFITKQ
ATRSNAKHINITHEYVLSYAKNKAFAPGFKILRTLLPIYAKALKDLMRTI
KNVFKQKGQAQAQLILKEQIKELSQKEHFNFLKNYNLVDEKGEIYFAKDL
STPSNPRSVAIQEINLFLEPLKSRGWSSDEKLKELYYQNRLIFKNNRPYE
KYYLKESQDNCLSVLDFYSRQGTKDLEKLGLKGLFKTPKPVALIKYLLLC
STPKDSIILDFFAGSGTTAQAVIEVNKDYYLNWSFYLCQKEEKIKNNPQA
VSILKNKGYKNTISDIMLLRLEKIIKRSEYEILKTKSILF
>gid:19989  jhp0248  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MKPYFSLEKLDLYHGDASVLETFEKGFYDLCVTSPPYNLSIEYQGSNDFR
AYDDYLNWCKNWLKNCYFWGKEQARLCLNVPLDTNKHGKQSLGADITIVA
KECGWKYQNTIIWNESNISRRTAWGSWLQASAPYAIAPVELIVVFYKNEY
KRKKQTSTMSREEFLLYTNGLWNFSGESKKRLKHPAPFPRELPRRCIQLF
SFLEDTIFDPFSGSGTTILEANALGRFSVGLEIEKEYCELSKKRILESLS
LV
>gid:20055  jhp0316  putative
MKSNFQYSTLENIPKAFDILKDPPKKLYCVGDIKLLEAPLKVAIIGTRRP
TPYSKQHTITLARELAKNGAVIVSGGALGVDIIAQENALPKTIMLSPCSL
DFIYPTNNHKVIQEIAQNGLILSEYEKDFMPIKGSFLARNRLVIALSDAV
IIPQADLKSGSMSSARLAQKYQKPLFVLPQRLNESDGTNELLEKGQAQGI
FNIQNFINTLLKDYHLKEMPEMKDEFLEYCAKNPSYEEAYLKFGDKLLEY
ELLGKIKRINHLVVLA
>gid:20056  jhp0317  putative
MILACDVGLKRIGIAALLNGVILPLEAILRHNRNQASRDLSDLLRKKDIQ
VLVVGKPNESYADTHARIEHFIKLVDFKGEIVFINEDNSSVEAYENLEHL
GKKNKRIATKDGRLDSLSACRILERYCQQVLKKG
>gid:20075  jhp0336  putative
MNLEKLFLEKAPLFVFSSTRRLKHFYLEQGEGFLPNAMSMGSFFEQAFYI
PNQKKIPKSARQILMIDTIKAIAKEKKFILEGLLLFENSFLGYLESTSFL
FDLFDELSSACIKLNELSFKDIYLDYEKHLEVLEMIYDRYVKKLEELGFY
DKIMQKKPTILKEFFEHFSSIEWHLDGFMSVFERQCLLEVAELVPITLYL
SCDKYNQKFLEFLNLKLETDCDYSIDFKTQKILSQTFNDQKIEPKLYANS
SYLKQGALVLQTIEEYLQENNDPNKMAIITPNADFLPFLKLLDRNNNLNF
AMGLGAKNSPYYTELVKILEDLQTSDCNLSGSALLDLENITLALLEQQSS
KEKAPLKEAHSQIMHQYHLLKDTLKNYSLKDLLHLYLQEFEANFRLDDSS
GGKIRVMDTLETRGMQFDKVVIADFNETCVPNLKDCDLFLNSALRQSLNL
PTLLDKKNLQKHYYYQLFKNSKEVALSYIESETLKASNMLLELDLHTEPI
KDAYTLFETSPIKEYQEEEIKAAIPKDFSFSASSLNAFLTCKRRFYYHYI
KRFKETPKDESNSAVGSLIHELLKEAYEKDKNPHALEERLIWLSETRENV
TPKERLDTLVALKKIQAFYKKERERFNAEITILDLEKSFETIIQGVIFKG
RIDRIDKTADNEIILLDYKFKSDLKLDSMSEKQRKGLSPIEIAQISTDYQ
MAIYVRALKNLGYKEPIKAFFYDLRKGELLEEDELTLQAKMDHLEFSLIP
KLKQEIDFEKTLEVKDCEYCSFKDMCNR
>gid:20137  jhp0398  putative
MSLTVLLNPKSLEEFLGQEHLIGKDAPLFKALQSKHFPHAFFYGPPGVGK
TSLAQIIARSLKRPILSFNATDFKLEDLRLKLKNYQNTLLKPVVFIDETH
RLNKTQQEFLLPIMEKNRALILGASTQDPNYSLSHAIRSRSFTFELTPLK
KSDLDRLCDKALTLLKKQIEPGAKTYLLNNSAGDARALLNLLDLSAKIEN
PITLKTLQSLRPHSLNDGSYNDDTHYNLTSALIKSLRGSDENASMYYLTR
LVAGGENPEFIARRLVIFASEDIGNANPNALNLAASCLFSVKQIGYPEAR
IILSQCVIYLVCSPKFNTAYKAINQALDCVQKGLFYLIPKHLLPNVKDYL
YPHDYNGYVKQDYLEKPLNLVSSQGIGFEKTLLEWLDKIRN
>gid:20141  jhp0402  putative
MRQAFNKTRSTHSRTLLLDIDCVIPNIVRRLLSNKTLPKRFATYSLQEVG
VIFLTTQILSIMRKTRCSKTLFFITRGRESFRYQLCDHYKQKRHQFDEDF
RSLLKALKIALVEKYPLKKGAKIQGEHCFEYEADDIISFYKKKDPNNYVI
ASMDKDILYSNRGSHFNLKTNAFFNVSQKEAHFFAYYQCVVGDKGDNIKG
VKGIGGFNYKDFLNEDAKEHELWEQIIQAFKIKEDLSDSEAKEKALLNMR
LVNMHQMTHHGVIKLWEPEFKKAFFPKKPQRPDFKRIS
>gid:20156  jhp0418  putative
MEIILLIVAAVVLFYFYNTLKEYLKNPLNPKTKTEEYDLKNDPYLLVQSS
PLDKFKQTQIGAYMRLLKFLDIQKNALDNALRTLFIHELEQPLNSEQQNL
AKELLNEPVDKKENFESLCQEIADHTHGEYTKRLKLVEFLMLLAYADGIL
DSKEKELFLDVGAFLQIDNQDFNELYDNFEHFNSIEIPMSLEEAKNLFEI
QTHTTMQDLEKKALDLSAPYYHKMNDNKRYSEQDFISLKKIALASQLLEN
DLKDS
>gid:20171  jhp0433  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MHANLFNQNASKKDIFLHNLRSNNGRYKRYVKAPLRYGGGKSLAVGLIVE
CIPNGVRRVISPFIGGGSVEIACATELGLEVLGFDIFDILVNFYQALLKD
KQALYDNLFSLEPNQETYSIIKQELKAHYKKECVLDPLILARDYYFNFNL
SYGPGFLGWMSKIYTDKQRYLNTLLKIKDFNAPSLKVECSSFEEVLIAYP
NDFFYLDPSYVLENSKMFKGIYPMRNFPIHHNGFKHEVLAHMLKRHKGPF
ILSYNDCELVRNAYKDFKILEPSWQYTMGQGETRMGKNRLERGDNNHVKQ
SHELLIIKE
>gid:20173  jhp0435  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MYKVADIFCGAGGLSYGFSTHPYFELIWANDIDKDAILSYQANHKETQTI
LCDIAQLHCHNLPRVPIDILLGGPPCQSYSTLGKRKMDEKANLFKEYLRI
LDLVKPKIFVFENVVGLMSMQKGQLFQRICNAFKERGYILEHAILNALDY
GVPQVRERVILVGALKSFKQKFYFPKPIKTHFSLKDALGDLPPIQSGENG
DALGYLKNADNVFLEFVRNSKELSEHSSPKNNEKLIKIMQTLKDGQSKDD
LPESLRPKSGYINTYAKMWWEKPAPTITRNFSTPSSSRCIHPRDSRALSI
REGARLQSFPDNYKFYGSANAKRLQIGNAVPPLLSAALAHAVFDFLRGKN
V
>gid:20179  jhp0441  putative
MNYPNLPNSTLEITQQPEVKEITNELLKQLQNALHSNALFTEQVELSLKG
IVRILEVLLSLDFFKNANEIDSSLRNSIEWLSNAGESLKTKMKEYERFFS
EFNTSMHANEQEVTATLNANTENIKSEIKKLENQLIEETRMLLEQETQKS
VKAYNAMMGHQPQNSLKHGLKNFNNPLSKEG
>gid:20195  jhp0457  putative
MSYFKNIFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIK
SLDSVAVLLYEKESDCFVIVKQFRPAIYARNFYFKRDQDQTIDGYTYELC
AGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQT
LYYAEAHEGLKVSKGGGIDTEKIEVLFLERSKALDFIMDFQYAKTTGLSL
AILWHLKKFKNV
>gid:20287  jhp0549  ENDONUCLEASE III
MLDSFEILKALKSLDLLKNAPAWWWPNALKFEALLGAVLTQNTKFEAVLK
SLENLKNAFILENDDEINLKKIAYIEFSKLAECVRPSGFYNQKAKRLIDL
SKNILKDFQSFENFKQEVTKEWLLDQKGIGKESADAILCYVCAKEVMVVD
KYSYLFLKKLGIEIEDYDELQHFFEKGVQENLNSALALYENTISLAQLYA
RFHGKIVEFSKQKLELKL
>gid:20333  jhp0595  putative
MQHSRLQTLRSLYMERLLGETYTDISLIKPQNKPLNKQVHEGIENCNLCK
RHQHSKPITGLFNPTSKLAFITLTPMLDSQLHFLNNLKAAMLESIIQKVF
NYPLKDCSILSLLKCDSNSLNLEEEINACLPHLTWQLDNSAPKVIIVFGE
VLPKRLLNLSKEESFGRIVSLKTKHFLSTHALEDMLKNPTLKKEALAHFK
IALQFLNQS
>gid:20347  jhp0609  putative
MEKLPKKRVSKTKSQKLIHSLTTQKNRAFLKKISANEMLLELEKGAFKKN
EAYFISDEEDKNYVLVPDNVISLLAENARKAFEARLRAELERDIITQAPI
DFEDVREVSLQLLENLRQKDGNLPNINTLNFVKQIKKEHPNLFFNFDNMF
KQPPFNENNFENFDNSDEENF
>gid:20355  jhp0617  INTEGRASE-RECOMBINASE PROTEIN (XERCD FAMILY)
MKHPLEELKDPVENLLLWIGRFLRYKCTSLSNSQVKDQNKVFECLNEFNH
ASINSNQLEKVCKKARNAGLLGINTYALPLLKFYEYAQKLSLKSLKNIDE
VMLAEFLSIYTGGLSLATKKNYRIALLGLFSYIDKQNQDKNEKSYIYNIT
LKNISGVNQSAGNKLPTHLNNEELEKFLESIDKIEMSAKVRARNRLLIKI
IVFTGMRSNEALQLKIKDFTLENGCYTILIKGKGDKYRAVMLKAFHIESL
LKEWLTERELYPVKNDLLFCNQKGSALTQAYLYKQVERIINFAGLRREKN
GAHMLRHSFATLLYQKRHDLILVQEALGHASLNTSRIYTHFDKDRLKEAA
SIWEEN
>gid:20366  jhp0628  putative
MRKGRVMLCVFDIETIPNISLCKEHFQLKEDDALKICEWSFEKQKEKSGS
EFLPLYLHEIISIAAVIGDDYGQFIKVGNFGQKHENKEDFASEKELLEDF
FKYFNEKQPRLISFNGRGFDIPLLTLKALKYNLTLDAFYSQENKWENYRA
RYSEQFHLDLMDSLSHYGSVRGLNLNGVCSMTNIPGKFDVSGDLVHAIYY
NPHLSQKEKKGIIDGYCQSDVLNTYWLFLKYEVLKGALNKEQYLGLLNDF
LAKFPKEKSYSSVFINALEKEIREFA
>gid:20367  jhp0629  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MPIISKNKPYSFIKNLEFYTIKRIKDMGSHPSQDHLNELLELFKQDLSID
LKREIASSIGRQLDDDIIYNFLKQEAFKEHYMEVIYQFLRTALYKSKDMR
FAKLCDDLLQHYQNENMQKMKQYYDYRHTKKPPLKIIENIIKKPSLLVGD
NAQTLNKIAPSSVNLIFTSPPYYNARIYSDYKNYKDYLSAMSQSLKACFR
VLEEGRFIIINVSPIITKRAGREFESVRYPIHFDFHQILIDNGFYFVDEI
LWIKPDFSVPNRIGGYLQNKKPLGYKPNCVSESLLVYRKKAPFLLDKNIK
IAEKRLKPIKQNHTLFGKKELPIETTNCWYITPKSSKDHPAVFPESLCER
VLNYYSFENEVVCDPFAGSGTFGMVAKSMGRIPLLCEQHPKYAQNLIKLG
FKEI
>gid:20395  jhp0657  putative
MRRSLAFCLLALLGLQVLGARDFSQLKNEELLKLAGTLPSNEAIDYRMEV
SKRLKALSAEDAKKFRANFSRIARKNLSKMSEEDFKKMREEVRKELEEKT
KGLSAEEIKAKGLNVSVCSGDTRKVWCRAVKKKDEHCSPK
>gid:20456  jhp0718  putative
MPYALRKRFFKRLLLFFLIVCMINLHAKSYLFSPLPPAHQQIIKTEPCSL
ECLKDLMLQNQIFSFVSQYDDNNQDESLKTYYKDILNKLNPVFIASQTPA
KESYEPKIELAILLPKKVVGRYAILVMNTLLAYLNTRNNDFNIQVFDSDE
ESPEKLEETYKEIEKEKFPFIIALLTKEGVENLLQNTTINTPTYVPTVNK
TQLENHTELSLSERLYFGGIDYKEQLGMLATFISPNSPVIEYDDDGLIGE
RLRQITESLNVEVKHQENISYKQATSFSKNFRKHDAFFKNSTLILNTPTT
KSGLILSQIGLLEYKPLKILSTQINFNPSLLLLTQPKDRKNLFIVNALQN
SDETLIEYASLLESDLRHDWVNYSSAIGLEMFLNTLDPHFKKSFQESLED
NQVRYHNQIYQALGYSFEPIKNESETKKE
>gid:20484  jhp0746  putative
MPNHQPVKKFKIIGGACKGLGLNLPNISSTRPTKAIVRESFFNTLQTEIN
GAHFIEVFSGSASMGLEALSRGAKSAVFFEQNKNAYATLLENISLFKNRL
KKEIEIQTFLDDAFKLLPTLRLKNGVLNIIYLDPPFETSGFLGIYEKCFH
ALERLLNRSHSKNLFVVFEHESLHEMPKSLATLAIIKQKKFGKTTLTYFQ
>gid:20494  jhp0756  putative TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MTAKIKPNIQSLLNNFYVDSCVNFMQHKLQNESIDMILTSPPYDNLRNYQ
GYTFAFENIANEIFRVIKKGGVVVWIVGDKIKNGNKSLTSFRQALYFQQI
GFNMHDVMIYAKKNTPFMRSNAYTNAYEYMFVLSKGKPKTFNPLKEPTAR
NGMEMLVTNKGADAKNNKILKELKKEKTKNNIWHYAVGLGGSTNDKIAFN
HPAIFPEQLALDHILSWSNERDIVFDPMCGSGTTCKMAFLHNRNFIGVDI
SKEYIQIAQKRLQQYQQGLFVC
>gid:20500  jhp0762  putative
MRFLNNKHREKGLKAEEEACGFLKTLGFEMIERNFFSQFGEIDIIALKKG
VLHFIEVKSGENFDPIYAITPSKLKKMIKTIRCYLSQKDPNSDFCIDALI
VKNGKFELLENITF
>gid:20512  jhp0774  DNA-BINDING PROTEIN HU
MNKAEFIDLVKKAGKYNSKREAEEAINAFTLAVETALSKGESVELVGFGK
FETAEQKGKEGKVPGSDKTYKTEDKRVPKFKPGKILKQKVEEGK
>gid:20540  jhp0802  putative
MQDELNAYQQEIEDTREVLKKIRLELKQVQEILRKKKSVLKGLKQEICQK
KLEKENFRSNKETQNTEEDVIFPKALEEVEVYAKDNQVIMAKPCKRLFNE
GLYLQYRSVLRENRLLKNHLSKKDFENSLLKIELRDLHKEIKLYQVQNLL
KDK
>gid:20570  jhp0832  putative
MPNTTNKDYTKYSQKQLFNFLNSIKAKQKRALEKLKEIQTQKQRIKKALQ
FKALHLTENGYTIEEEREILARAKDTKNRLCFKSIEDFKKHCENL
>gid:20585  jhp0847  putative ATP-DEPENDENT HELICASE
MLETLQLNPEQLKAASALQGHNLIIASAGTGKTSTIVGRILHLLNNGIKP
EEILLLTFTNKASNEMIARVAKYSKLSSKIEAGTFHAVAYRYLKEHYPNL
SLKQPKELRKLLESIVDTKNALTDEDKKPYTSQHLYALYSLYTNALKQED
FSAWLSSKNPEHAPYAAFYENILEEFENTKKKHDYIDYNDLLLLFKKAML
ERSSPYKEVLCDEFQDTNPLQESILDAINPPSLFCVGDYDQSIYAFNGAD
ISIISNFTQKYKNARVFTLTKNYRSSKEILDLANQVIQHNQRIYPKNLEV
VKSGKFNKPTLLNYNDNIAQCQDIAKRIVMRKNFKEVAVIFRNNASADQL
EAALRSHNVPSKRKGSASFFESKEVALALDICALLFNPKDIMAAIHILSY
ISDIGSNTAKDIHEALMLLGNGDLKSALIQPNQEAKIYTKKKEITSMGLF
EEIFALENSSRFNSVIDKAFHSHPVLMHPKISLNGAKMLSDFFILYTKAP
IHSPSALIKHILESAFFQTFKTRLLKERSKNKDGSYNEFKKLQAQKRFNE
KMDLLSSLAKNYQNLGRFLNGTLIGSSEATQGEGVNLLSVHASKGLEFKD
VYIIDLMEGRFPNHKLMNTGGGIEEERRLFYVAITRAKENLWLSYAKNEL
RENAKPKEHKPSVFLYEAGLLKLDSK
>gid:20634  jhp0896  putative
MRLNEVIGLFKESVDKVFDRVSAFTWEKYKAKNEDEEDDEANYREFEKIK
KMALYFRDYCMFCLDWYELSQEKIQEEYRDCIDYDNKLLQLHYSLENLQT
LRELKEEADNNYQESLNDEKLQNNLREWRDLKNTPEEENYREFEEIKKMV
LYFRDWCMFRLDWYKLRQEEIQKHRDLMDNDNRLLQLDYSLKNLSILKRF
KEINEKNYQDHLNNEKLQNDLREWRRSKRR
>gid:20664  jhp0926  putative
MNNLNNLSDEQINGMIDYLQNILMEMGGLKQEPPQNSNYSHNLEETNAQR
TQSTQENSESKEALLKAKSGQINYVKTRIIQGLKEKNSPFWDKPEIVANK
ERGHNALNGEPYCNLNDMILDMEKNRLGFQSNAWVSLEEAKMLGASKEER
DTIFKATQNKEISPVRLMFIKNKEPVPLVDNNGELVIDKNTKKPKHKQFA
VIENGQKVFKPAYRDIEPQAQFKFVYNIEMFPSINKEKIKPLNLDKLSNY
AYKTRLFHQKDYLQERRDKTNIIYEDLHRDLSPDNRNEALLERMRSYTLL
KNEKYNQLANETKQQTQTRSQSQSYYQKKNKASSGIER
>gid:20666  jhp0928  putative
MERSSDEDLSHQDPSLFIESREQGGTRGVYRSSDQQAVSEESHRERDRIH
EHVSRGDGVSARADARANSNGASSPASRMENGARSEEKGDNPSDERGIPQ
TPQSPSHQQNSSRDLGLSLSREQPGQTGRLRLFDHGQMGSLFPTDHENQR
KRSDNELDRRSDKANENGDKSPRQNGSANQESARSERYGIAQGSSNQSVL
LPAQSRLHHAGLSAQNGLRDLEENRDQEGRLLSNLDNLESLLNAIRNNTI
ASEPDFRSRLLEAIQNNDPLKDSIVGAQLLKDPTTKIFYDKFQLKISPKK
VLEILENRLKKSIETTNETLNAFNVLDSQAIDLNAISNSVGLNPTQESKI
TDNSVELNNAQEQTAQEQTTQEQTTQEQTTQEQTTQEQTTQEQTTQEQTT
QEQDTQENAPTTIKQETPITPAIPLNPKIDFKPSEEVLIKGAKTRYKANI
KAIELLKELQAKQEILKGDYYATLKEQEILAQFSGWGGLESYFKKAQHPE
EFKELNALLTKDEFRRAYLSARDAYYTPKLVIDSIYQGLDQLGFNNDNHP
KEIFEPSLGTGKFIAHAPSDKNYRFIGTELDPISANLSKFLYPNQVIQNT
ALENYQFYQEYDAFVGNPPYGNHKIYSSNDKELSNESIHNYFLGKAIKEL
KDDGIGAFVVSSWFMDAKNPKMREHIAKNATFLGAIRLPNSVFKATGAEV
TSDIVFFKKGVEKATNQSFTKAMPYYDKILNSLDDDTLFALQNNRFDSFI
PSDQLKIVNAVANHFGFKQEKLQRWYEKIDTANFGYSTQDYKIIKDFIDK
VGKNSINLNEQTLNEYFIHHPENILGHLSLEKTRYRFETNGEQIYKYDLQ
ALEDESLDLSQALKQAIEKLPKDVYQYHKTTLKTDVLIIDSSNERYQEVQ
KLIKNLERRELVKWDNLYFQLEQNNEMGIFLKPTKINSKVQDSRLKAYFK
IKDALNDLTSAELNPLSSDLELENKRAKLNLVYDEFVKKFGYLNENKNRK
DIRQDLYGAKVLGLEKDFEKEITPRSAKMQNIEPRQAQAKKAQIFFERTL
NPKKELIITNAKEALIASINQKGGLDLHFIRDHFTTQSLETTIKELLEQK
LIYKDHKDNGGYILANDYLSGNVKRKLKEVKEAINQGVEGLEANVKDLEL
IIPKDLKATEIMANINSPWIPTQYLEEFLMELSANHYEKQYGDKMTDYQL
SNLKEDIKIEHLSGAYEVFVRNNELNELYGIRHKDKPHSYKVPFESLLNK
VLNNKDLSVKYAQVDPNDPKKEIFITDEEQSNLARQKAEELKEAFKDWIY
KDYSRRTHLEQIYNDTFNNSVLKTYDGSQLELEGFNYHISLRPHQKNAIF
RTIQDRAVCLDHQVGAGKTLCAIASCMEQKRMGLVNKTLIAVPNHLTKQW
GDEFYKAYPNANVLVVDSKDTTEKERELLFNQIANNNYDAVVIAHTHLEL
LSNPRGIIEELKEEELVNAEKNFERQELAYKNNPRETKKPNERAFKNKLD
KIRAKYDAILEKQGSHIDISQMGIDNLIVDEAHLFKNLAFETSMEKIAGL
GNQQGSNRARDLFIKTRYLHQNDKKIMFLTGTPIANSLSEMYHLQRYLTP
DVLKERGLEFFDDWAKTYGEVVNDFELDTSAQSYKMVNRFSKFSDVQGLS
TMYRAFADIVSNDDILKHNPHFVPKVYGDKPINVVVKRSEEVAQFIGVAL
ENGKYNEGSIIDRMQKCEGKKSQKGQDNILSCTTDARKVALDYRLIDPNA
KVEKEFSKSYAMAKNIYENYLETHATKGTQLGFIGLSTPKTHSQKVSLEA
LDNAHETENKNPLDKAQELLESLSSYDEKGNLIAPSKKELENELKEKEAK
SVNLDEEIAKGCSFDVYSDVLRHLVQMGIPQNEIAFIHDAKTEEQKQDLF
KKLNRGGVRVLLGSPAKMGVGTNVQERLVAMHELDCPWRPDELLQMEGRG
IRQGNILHQNDPENFRMKIYRYATEKTYDSRMWQIIETKSKGIEQFRNAH
KLGLNELEDFNMGSSNASEMKAEATGNPLIIEEVKLRAEIKSEESKYKAF
NKEHYFNEESLKNNASKLDYLKQELKDLETLQRSVIIPTHTEIKLYDLKN
EESKDYELIKVKEVEPLKENASMSEELTHKKLKEQNKQIAEQNKEKLDAI
KKQFASNLNTLFVNEEEDYKLLEYKGFVVNAYKTKYQVEFSLSPKDIPNI
AYSLAIWFIKTILSTCLALIISALRSSLMGF
>gid:20679  jhp0941  INTEGRASE/RECOMBINASE (XERCD FAMILY)
MHEQCSISFVGGQGAKRLLYILYKLAFNAKSNKIALDRHYAKMFLQVVAR
TLIKNVNILEEQGFIEVIKGKQRYLYVYLKDYRELECLVKSKMAKYVMYL
RQFFDYLDRKRRYGFDFTLKNLAFAKTKESLPRHLNDKDLKSFLKTLLDY
KPATSFEKRNKCILLIVILGGLRKCEVLNIELKHIQVEEQNYSILIQGKG
RKERKAYIKKSLLEPSLNAWISDDYRLKYFNGAYLFKKDKQKSQNSLTLY
NFIPLIFKLAQIKHYKQYGTGLHLFRHSFATLIYQETQDLVLTSRALGHS
SLLSTKIYIHTTQEHNKKVALVFDSLIENKK
>gid:20689  jhp0951  INTEGRASE/RECOMBINASE (XERCD FAMILY)
MSDCKMSRVSRELFDNIKSFLHYKFKTMIRIQSVNDLELILKWQDRVLEC
QSFIALKELNHKLYNQGVRHTIMMQGLFLFFEYFDNRIKLKSLRNLAEEQ
VIDFLFGLVKNRKPSSMAKYVMVLRQFFDYLDRKRNYSFDFELKNLSFAK
KEMYLPKHLNKNDFKAFIQALLKYHPKTSFEKRNQCILLLIALGGLRKFE
ALDLELKNIALENNHYRLLIKGKNNKERYAYIEKEFLQVPLNAWLSDTKR
LKSFKGRFVFKKAKNNTTQKTCSLKGFIAKIFKLSNIDVKSYGLGLHLFR
HSFATFIYDETQDLVLTSRALGHSSLLSTKIYIHTTQEHNKKVTLVLKGW
LKNEKSE
>gid:20780  jhp1042  putative
MNYPNLPNSALEISEQPEVKEITNELLKQLQNALRSNAHFSEQVELSLKC
IVRILEVLLSLDFFKNANEIDSSLRNSIEWLTNAGESLKLKMKEYERFFS
EFNTSMHANEQEVTNTLNANAENIKSEIKKLENQLIETTTRLLTSYQIFL
NQARDNANNQITKNKTQSLEAITQAKNNANNEISNNQTQAITNITEAKTN
ANNEISNNQTQAITNINEAKESATTQINANKQEAINNITQEKTQATSEIT
EAKKTDHYQNIDFFEFE
>gid:20782  jhp1044  putative
MKLPKALNEATAGAALKYHIKRALERSHLISDFSKNLELSAKNSKFTNNT
LKIIEELNNGVKQASEEIKEKAFDFSNEKLTNEQIKELLNNAKIPTSGRD
AITFGTNNLNPEMVEFLHKNNKKMIIEKASNKELELLKDANFKHPENIRA
SLDHDAIAHILKRHGVNSVNVKNGESPITYEDIANYRYIVNNADAILRTL
DNEDKEVISAFKQINGYAVVVEQAINKKNELVLKTMYKSNGDYKDNNAYK
KFSSTLTLNADAKVNHGLSSHSGATENLTQKPLTSQEDLLKDTENLNETT
PKPTHLSPLELANAEKLAKLESEQLQSEQEFLKAKEQELKRKEALKKKLE
HERGNAGHIESQTKIEVGEDIPTQVQAQIPKSRVRLNEREIYDLDYAIVK
AKDLKPSFTTGGTQKRTDMNEEQIKSIAENFDPKKIFGSGGFEDLPIILH
DGQVIAGNHRIQGMLNFTPKSRYIYNKAIKEYYHIDLEPDELLVRVPNKR
LDNTEINNLAASSNQGRFNSESDHAIAVLSHYEAKLKELEKKLDADSIYS
LKNIVANNLNFDKATHPNVGDSNLALLMFNMPRTKTQGIELLNCWQKAFS
NDIKSYEKVKKMFVDNAGSFHNLIHDMNFPNVSLNAYLSDIMDRSFANLK
NYQSTSESLKDLSEKFYKTSSLDMFEKSDQRASDISEILGGAIARFARFD
DPSKALFEALKSDNIKKGLKEFKIADVTKDMFDPKSKEFKDIDIYDFTHY
LLMVNREPNENNPVLKRLIQAVKDMQKEKKKGIKKPKLETPSEWGHHYSE
FKGDGLGAINKLLKTKKGFVAGAFYKEGLGDIDLVWGNKDYGLEHILKRR
EDQALNNGINEAEAKDYAISVIKTIPEVIDKGVKVERNGRVAIEYQNIRV
GLKDNWKGEKSPNHWVITGYEKRLEDSESLYTSPPITKGETLPLNSNKPD
PTTNAIKTQEPLYPLELANAEKLAKLETEKAFKAEAVKKLDFNEIKKLID
ESPRTGSSMPILGMQNLNAEAVEYIQKNHKRIAVEKIEPSFAKDLKLKYP
DDARAVMDYQAINHILKEHKNLAYEDIANYRELSKQANETLKLKDNQNRP
VVASFNQINGFFVVVEQVSNAKNELMLKTMYKARGNYKDSLIYKKTLAKS
QNSN
>gid:20786  jhp1048  putative
MVVLHSHLENALNQLKELIDLTERDIRDIKLAKHAEIFERNHQKQLAIQA
FEQEKTNIDAQMLSLKNQFPNKEMSELLDEKTSDFLNQMREFLLVLKEKN
LIYSRMAFAVSEFYSSLIQQIIPHDTCDYKGSRHVGSHFLRVQA
>gid:20788  jhp1050  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MGILTFMDFCSGIGGGRLGLERCHLKCVGHAEINHEAIRTYELFFKDTHN
FGDLMRINPNDLPNFDVLVSGFPCQAFSINGKRKGLEDERGTIIYGLIRI
LKVKQPKCFLLENVKGLIHHKQQETFKTIIKALQEAGYTTHYQILNSADF
QLAQKRERLYIVGFRKDLKRPFNFPLGLANDYCFKDFLDADNECYLDVSN
ATFQRYLRNPYNHNRVFLEDILTLENAVLDTRQSDLRLYFNVFPTLRTSR
HGLFYTQKGKIKRLNAVESLLLQGFPRDLIAKIKNNPNFKESHLLSQAGN
AMSVNVIAAIAKQMLKAFNNE
>gid:20906  jhp1168  putative
MYRKDLDNYLKQRLPKAVFLYGEFDFFIHYYIQTISALFKGNNPDTETSL
FYASDYEKSQIATLLEQDSLFGGSSLVILKLDFALHKKFKENDINPFLKA
LERPSHNRLIIGLYNAKSDTTKYKYTSEIIVKFFQKSPLKDEAICVRFFT
PKAWESLKFLQERANFLHLDISGHLLNALFEINNEDLSVSFNDLDKLAVL
NAPITLEDIQELSSNAGDMDLQKLILGLFLKKSVLDIYDYLLKEGKKDAD
ILRGLERYFYQLFLFFAHIKTTGLMDAKEVLGYAPPKEIVENYAKNALRL
KEAGYKRVFEIFRLWHLQSMQGQKELGFLYLTPIQKIINP
>gid:21009  jhp1271  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MDFLKENLNTIIEGDCLEKLKDFPNKSVDFIFADPPYFMQTEGELKRFEG
TKFQGVEDHWDKFGSFEEYDTFCLGWLKECQRILKDNGSICVIGSFQNIF
RIGFHLQNLGFWILNDIVWHKSNPVPNFAGKRLCNAHETLIWCAKHKNSK
VTFNYKTMKYLNNDKQEKSVWQIPICMGNERLKDVQGKKVHSTQKPEALL
KKIILSATKPKDIILDPFFGTGTTGAVAKSMDRYFIGIEKDSFYIKEAAK
RLNSTRDKSDFITNLELETKPPKIPMSLLISKQLLKIGDFLYSSNKEKIC
QVLENGQVRDNENYETSIHKMSAKYLNKTNHNGWKFFYAYYQNQFLLLDE
LRYICQRDS
>gid:21022  jhp1284  TYPE II DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MKSETIYKDFCFELEGILFNFDTSLLESKKYNEKVDLIFDLKSIVKDKDT
KTNTLNFSVTLSSKGTQTKTNEILKECSNQGVKLDEEILKKAFVKFKKQG
SMDYFIHKNAQGFLKEQLDLYLFEYLFKEMTAFDHKRLNGINTIKEVALE
VIALVSEFENELCKIWNKPRLVLNSHFIVSLDKLKAKNYDLSKITNHPNY
PKQVKEWQDLNLEIADNLLGNEFLPLDTLYFKDLEEEVKSLFNENEINGT
LIKSENYQALNSLKNRYKEAIDCIYIDPPFNTGSDFAYIDRFQDSTWLSL
MHNRLQLAYDFLSPQGNFYLHLDYRANYLGRMLLNDIFSKENFRNEIIWH
FRTYQGQIQSNFPRKHDSLLWYSKNCNVNNFFKITYSDNYKDTVDYRRWR
EFIVDNNKIVYPNYPKADSRFDGYLKRYLQSTKEPKNGDIIATINGYVID
DVWTDIQAIDPKKADERLQGTLTQKPEKLLERIIKASSNENSIVCDFFAG
SGTTCAVAHKLKRKYIGVEMGEHFESVILPRLKKVIGGFKSGALKEFNGG
GVIKVYELESYEEILRKIKYEDNDKPLAYEEQYSDLVERKNESYTLNIEA
LENMGVDIKETLENLHGVGVEFFNEKVVKFKGNDKEVSILKALKEALIW
>gid:21041  jhp1303  putative
MPKKELLKMSKKRIFKDFLEEVKRHRPIIFYTDNDCDGMLAGSVLMSMCY
RLGIKDFFFFSPLRNAHGYGFTDLALNDLLSQLCIFNPKTNQLVRLDYIK
NQFQKTPLLFSADLGADLVSNIELQKILLERFEQCIITDHHKSFEVDWID
KNKIAYINLNDEKDANYYSGAFTSALVFSQIFQIQTTPLEEELIAITLLS
DRIDLDNGDNLDMVLNLAQPKHDRIECFFKDKDLSLAQDDLDEISNLYGF
NCINYINALSRLSGAREFKGCYNSYLHYLVLKHFNPISDPRLSVFNVKEF
KRYNDIKKKMVKESEENAQIFSCNKILVAILDESCSIKVGVSGLVANNFL
KKYPFNHSLCIYKDSKDGYSGSARGDGTFLSQIKTIPLIQAGGHEEAFGL
SFAKKRILKK
>gid:21042  jhp1304  putative
MLLDYDFLLLLNDESGKPTRYYYLLQDFEKDFVASEVAQNRARQFVKEII
GSKKASKTKNSAIKVSHTKASAIGSETIGSCDLKKACEKIKSGLPFGIIS
AFKPFKDAFYRDFNHNEQKLLIGAAKSGCIQSSADKLAQLKTRLIYWQDK
SVKVDWDKPILIKDFFKGNNYLYRRLCFLLGKHFMDRFLKNNAKASVKDF
MSSKEFVNKYRYTPKQNTERAKKLQSYLESKRDFIGFVQTLNSLKDSPQD
PFLPNEEISFLVFANEPTIIFNLRDYLLVLAQIFNQQAICYCESKCPIEL
INASPGKGL
>gid:21113  jhp1375  putative
MQDELFETEKAPQKNAKNAKNAPKKSFEEHVHSLERVIDRLNDPNLSLKD
GMDLYKTAMQELFLAQKLLENAYSEYEKLQTPNKKA
>gid:21130  jhp1392  putative
MSSVQILSNFNYPISKVINEGLRNSLDTHIAVAFLKYSGVEIIQDVLINF
LEKGAEFEIIVGLDFKTTDSKSIRFFLDLNKTYKKLKFYCYGDKENNKTD
IVFHPKIYMFDNGKEKTSIIGSANLTKGGLENNFEVNTIFTEKEPLYYSQ
LNAIYNSIKYADSLFTPNEEYLESYDEVFSAIIKNEQKVSKDKSIQEKIK
KIEKQEKLLPGTIPSIKAMIVEFIFACEKKGVKQVALQDIYQALEERIKK
EEWGYKYKSDTFKNSIRGELNHHQKDSHSKHCFGLFERLQKGFYALTPKG
RSYKGR
>gid:21176  jhp1438  DNA POLYMERASE III
MIKISLNSNKRAWMWWFQGVIFLNPKIVSWLLKAYRMSDNLLHKDIQALI
ARLKHQDLNLSALEKSLSRLIHDEINLEYLKACGLNFVETSENLITLKNL
KTPLKDEVFSFVDLETTGSCPLKHEILEIGAVQVKGGEIINRFETLVKVK
SVPDYIAELTGITYEDTLNAPSAHEALQELRIFLGNSVFVAHNANFDYNF
LGRYFVEKLHCPLLNLKLCTLDLSKRAILSMRYSLSFLKELLGFGIEVSH
RAYADALASYKLFEICLLNLPSYIKTTMDLIDFSKCANTLIKRPPRAKYQ
EIPSPFPLFERTKGLLDIMKATS
>gid:21219  jhp1481  putative
MFIDTHCHLDHKDYENDLDEVLKESLEKGVTQCVIPGADMKDLNRAVGIS
EKFEGVFFAIGAHPYDVESFDEGLFEKFVSHQKCVAIGECGLDYYRLPEL
SERENYKSKQKEIFTKQIEFSIQHNKPLIIHIREASFDSLNLLKSYPKAF
GVLHCFNADSMLLELSDRFYYGIGGVSTFKNAKRLVEILPKIPKNRLLLE
TDSPYLTPHPFRGTRNSPTYIPLIAQKIAEIIHIETEELASLSTHNAQTL
FNFP
>gid:20296  lig  DNA LIGASE
MIKSQKEYLERIEYLNTLSHHYYNLDEPIVSDAVYDELYQELKAYEEENP
NRIQANSPTQKVGATATNEFSKNPHLMRMWSLDDVFNQNELRAWLQRILK
VYPNASFVCSPKLDGVSLNLLYQHGKLISATTRGNGLEGELVTNNAKHIA
NIPHSIAYNGEIEIRGEVIISKEDFDALNKERLNANEPLFANPRNAASGS
LRQLDSNITKKRKLQFIPWGVGKHSLHFLRFKECLDFIVSLGFSAIQYLS
LNKNHQEIEENYHTLIREREGFFALLDGMVIVVDELDIQKELGYTQKSPK
FACAYKFPALEKHTKIVGVINQVGRSGAITPVALLEPVEIAGAMVNRATL
HNYSEIEKKNIMLNDRVVVIRSGDVIPKIIKPLESYRDGSQQKIMRPKVC
PICSHELLCEEIFTYCQNLNCPARLKESLIHFASKDALNIQGLGDKVIEQ
LFEEKLIFNALDLYALKLEDLMRLDKFKIKKAQNLLDAIQKSKNPPLWRL
INALGIEHIGKGASKTLAKYGLNVLEKSEAEFLEMEGFGVEMAHSLVNFY
ASNQEFIRSLFELLNPKSSDTAEEKQKSSSVFSDKTIVLTGTLSKPRQEY
AQMLENLGAKIASSVSAKTDFLIAGENAGSKLALAQKHGVSVLNEEELLK
RLKELD
>gid:21196  mfd  TRANSCRIPTION-REPAIR COUPLING FACTOR
MIQSSLYRALNKGFDYQILACKDFKESELAKEVISYFKPNIKAVLFPELR
AKKNDDLRSFFEEFLQLLGGLREFYQALENKQETIIIAPISALLHPLPKK
ELLESFKITLLEKYNLKDLKDKLFYYGYEILDLVEVEGEASFRGDIVDIY
IPNSKAYRLSFFDAECESIKELDPATQMSLKEDLLEIEIPPTLFSLDEPS
YKDLKTKVEQSPLNSFSKDLTSFGLWFLGEKANDLLGVYQSIISPRALEE
IQELASLNELDDERFKFLKVLENAQGYEDLEIHVHALEGFIALHSNRKIT
LLAPNKTILDNSISVLDAGNMECVIAPFVLNFKTPDRIFISLNSFERKKK
RQKSKLALNELNAGEWVVHDDYGVGVFSQLIQHSVLGSKRDFLEIAYLGE
DKLLLPVENLHLIARYVVQSDSVPVKDRLGKGSFLKLKAKVRAKLLEIAG
KIIELAAERNLILGKKMDTHLAELEIFKSHAGFEYTSDQEKAIAEISRDL
SSHRVMDRLLSGDVGFGKTEVAMHAIFCAFLNGFQSALVVPTTLLAHQHF
ETLKARFENFGVKVARLDRYIKTSEKSKLLKAVELGLVDVLIGTHAILGT
KFKNLGLMVVDEEHKFGVKQKEALKELSKSVHFLSMSATPIPRTLNMALS
QIKGISSLKTPPTDRKPSRTFLKEKNDELLKEIIYRELRRNGQIFYIHNH
IASISKVKTKLEDLIPKLKIAILHSQINANESEEIMLEFAKGNYQVLLCT
SIVESGIHLPNANTIIIDNAQNFGLADLHQLRGRVGRGKKEGFCYFLIED
QKSLNEQALKRLLALEKNSYLGSGESIAYHDLEIRGGGNLLGQDQSGHIK
NIGYALYTRMLEDAIYELSGGKKRLEKSVEIQLGVSAFLNPELIASDSLR
LDLYRRLSLCENVDEVGQIHEEIEDRFGKMDDLSAQFLQIITLKILANQL
GILKLSNFNQNITLTYSDEKKESLKAPSKDDNDILETLLKHLHAQISLKR
R
>gid:21034  mod_1  putative TYPE III DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MLKTPLKTLLDILINHFTKECLITLITEHDEKLLIFMLEHENANDYKKHF
FKTIANSLVFNEKALLECLEIKELEKSFTRFKNKIGLFSQEGFIKSSELV
VLNFPFKDNVLLGNAKDNSTKSNELFYHEILHKNEIDTLLSPKALCRFEM
HGQGDLQSALKDENTNYLIKGNNLIALHSLKKKFAKKVKCIYIDPPYNTG
NDSFNYNDNFNHSSWLVFMKNRLEAAREFLSDDGVIFVQCDDNEQAYLKV
LMDEIFGRENFIACFVWEKTSNSLSRIRIKTEYILCYEQTKFGLIFNGDM
AEEGQDFPILNEVNVKRTLQFPPNSIYFKTFKGVIKPTKFNKMELIDDLR
IVNKTNSNMVRINAKFKWTQDKLDDEIKEGTTFVIKSDEFSMRYIRKGDR
EVKASNVFNAECGVTTNIKATSEIKVLFANSNTDLFSTPKPEALLQRILE
ISTNENDLVLDFFAGSGTTCAVAHKMKRRYIGIEQMDYIETITKERLKKV
IEGEQGGISKKCDFKGGGSFVYAELKEVNSGIKKQILNAKSASECLKIFN
DLNKRILKRADNKMDAIHSEEFQNLDLNEQKRKCCASLDANEDYLNLGDI
DEDAWEIDEITKKYNEIFYS
>gid:21149  mod_2  TYPE III DNA MODIFICATION ENZYME (METHYLTRANSFERASE)
MQNKEIGGEKSVNEKNVEVFNRYFPGCLSIENDNKLTLDTGKLKALLGDF
SEIKEEGYGLDFVGKKIALNQAFKKNHKILKPLNESTSKHILIKGDNLDA
LKILKQSYSEKIKMIYIDPPYNTKNDNFIYGDDFSQSNEEVLKTLDYSKE
KLDYIKNLFGSKCHSGWLSFMYPRLLLAKDLLKQDGVIFISIDDNECAQL
KLLCDEIFGEGNFIETFLWNKTQTPPSASNKTRKTHEFILCYQKNKDNKK
MIAKFSDGGDAPLWNETNSERILTFPANYVFSNLKNGIYKKGIRDKIEVL
DDIEVYNGKIINSFRLKGHFRWKQETLNEEIINNVFLVVKSDKFSIRYCR
EGERIVMPTNEISKKDNVGTNETASKELFKLFDNNKIFNFNKPVSLIKYL
ISICSNNTNEGDIILDFFAGSGTTAHAVLESNKSDYQKLSEGGGLFNGLN
AVFKERRFILVQLDEKIEKNKSAYDFCLNTLKSTSPSIFDITEERIKRAG
AKIKEACAHLDVGFRAFEIIDDETHTNDKNLGEAHQKDLFAYSNLDRMET
QTILIKLLGCEGLELTTPIICLIENALYLALNTAFIVGDIEMSEVLENLK
DKGVEKISVYMPAISNDRLCLELGSNLLDLKLESGDLKIRG
>gid:20303  mutS  DNA MISMATCH REPAIR PROTEIN
MSDASKRSLNPTLMMNNNNTLPKPLEESLDLKEFIALFKTFFAKERGSIA
LENDLKQAFTYLNEVDAIGLPAPKSVKESDLIVVKLTKLGTLHLDEIYEI
VKRLRYIVVLQNAFKPFTHLKFHERLNAIILPPFFNDLILLLDDEGQIKQ
GANATLDALNESLNRLKKESTKIIHHYAHSKELAPYLVDTQSHLKHGYEC
LLLKSGFSSAIKGVVLERSANGYFYLLPESAQKIAQKIAQIGNEIDCCIV
EMCQTLSRSLQKHLLFLKFLFKEFDFLDSLQARLNFAKAYNLEFVMPSFT
QKKMILENFSHPILKEPKPLNLKFEKSMLAVTGVNAGGKTMLLKSLLSAA
FLSKHLIPMKINAHHSTIPYFREIHAIINDPQNSANNISTFAGRMKQFSA
LLSKENMLLGVDEIELGTDADEASSLYKTLLEKLLKQNNQIVITTHHKRL
SVLMAENKEVELLAALYDEEKERPTYTFLKGVIGKSYAFETALRYGVPPF
LIEKAKAFYGEDKEKLNVLIENSSTLERELKQKNEHLENALKEQEDLKNA
WLLEMEKQKEIFHHKKLELEKSYQQALNILKSEVASKDTSSMHKEIHKAS
EILNKHKTDQEIPQIITSFQINEKARYKNESVLVIQILDKGYYLVETELG
MRLKAHGSWLKQIQKPPKNKFKPPKTIVPKPKEASLRLDLRGQRSEEALD
LLDAFLNDALLGGFEEVLICHGKGSGILEKFVKEFLKNHPKVVSFSDAPI
NLGGSGVKIVKL
>gid:20887  mutT  putative DGTP PYROPHOSPHOHYDROLASE
MLHKKYRPNVAAIIMSPDYPNTCEVFIAERIDIEGAWQFPQGGIDEGETP
LEALYRELLEEIGTNEIEILAQYPRWIAYDFPSNMEHKFYSFDGQKQRYF
LVRLKHVNNIDLNKHTPEFRSYQFIQLKDLLKKIVPFKRQVYRQVIAYFR
KEGYLGC
>gid:19871  mutY  A/G-SPECIFIC ADENINE GLYCOSYLASE
METLHNALLKWYEEFGRKDLPFRNLKGINAPYEVYISEVMSQQTQISTVI
ERFYPPFLKAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVK
EHHSQLPNDYQSLLKLPGIGAYTANAILCFGFREKSACVDANVKRVLLRL
FGLDPNIHAKDLQIKANDFLNLNESFNHNQALIDLGALICSPKPKCAICP
FNPYCLGKNHLERHTLKKKQEIIQEERYLGVVIQNNQIALEKIEQKLYLG
MHHFPNLKENLEFKLPFLGTIKHSHTKFKLNLNLYLATTKDLKNPIRFYS
LKDLETLPISSMTLKILNFLKQKNLFGG
>gid:20270  nth  ENDONUCLEASE III
MSLKRAKTKAQQIKELLLKHYPNQTTELRHKNPYELLVATILSAQCTDAR
VNQITPKLFEKYPSVNDLALASLEEVKEIIQSVSYSNNKSKHLISMGAKV
VKDFKGVIPSTQKELMSLDGVGQKTANVVLSVCFDANYIAVDTHVFRTTH
RLGLSNANTPIKTEEELSDLFKDNLSKLHHALILFGRYTCKAKNPLCDAC
FLKEFCVSKASFKA
>gid:20356  ogt  METHYLATED-DNA--PROTEIN-CYSTEINEMETHYLTRANSFERASE
MTLYHYYFKTPKSFPLEYLHLCANESHLLRLDFDATNFSHHTPMNTPLKL
SVQALERYFLGQLFEFDAPLDLIGTFFQKQVWSALMTIPYGKTKSYDEIA
KLINNPKSCRAIGNANRNNPISLIVPCHRVVRKNGALGGYNGGIEVKKWL
LEFESKILNERAKNFLIS
>gid:20208  orf7  cag island protein
MSLATSYNVSNNFSKFNIKRVRGYLICLVCNTPKMIQRGLNGVSFYGCSD
YVNKGDCKGVLREINGSMKMVCLHCENTPIMEKVESGRGGAYACKNCNRK
FYFIDLAKQNERKKDLEKEKKELLNKIEKQKIKHLERFILAGVKANIKEN
SFFLGCKNYPKCEWTASMDSQDLKCPKCNRLMKRKKNFKNNEFFTATSLT
LNAIEFCLYINLKKKETNV
>gid:21184  pcrA  putative ATP-DEPENDENT HELICASE
MDTKRQCMALKASAGSGKTFALSVRFLALLFKGANPSEILTLTFTKKATA
EMKERILDYLKILQKENLESEKEKSQNILKELEEKYHLDPSLVRNNAQKI
YQRFLNAEVRISTIDAFFQSILRKFCWFVGLSANFEVNEDTEAHQRQLNE
GFLSALNNKQLEELSAFIVQCLSYDNYTSDSILERLRFLKNKLYLFDPNK
KDPVFDEEGFLEKLRSLNNQIQSIETASNEAKKAIKCDSFRGFLNSSLTW
LEKKSEYRYFKKLKNEIPTLESECEEIENDLKRYYEAQETAIFKKFPKFI
QLYDNATSKIQALDFDAIKDKVHVLLNGYEEMPAEFFYFRLDSKIAHILI
DEFQDTSLNDYKILAPFIDEIKAGIGQAKWHRSVFFVGDVKQSIYAFRGS
FSSLFESVSKDFYHDNLEFNHRSAPLIINYVNTIFKKAYQNSPTAYLEQK
YPKTSNNKHVTEGYVKVSLVADEKELLLEQILQEAQNLLEHRIDPKDITI
LCATNKDALEIKNYLQEYLSAIRPSTESSAKLSQLVESKIIKNALEYALA
EEPYKPFYKHSVLKLAGYLHDDVIALPGFNPKKESVASFVWKIMEQFKLY
EEPAQSCLELAVGCEDADGFLEKLEAKEIASFNPKGAQIMTIHKSKGMQF
PYVIVCERLGNPNSSHANQLLEEYDGTELARLYYRMKNREVVDKDYARAL
DKEEAAKDHEEINVYYVAFTRAELGLIVVAKDKKESKKESKNKKMHEQLE
LAPLEEGEIAPVISPQKEPLMTSVVIKPHAYGEQVQEIEEESDSDYEKNN
DQEAINFGIALHKGLEYQYAYNIPKQSVLEYLNYHYGFYGLDYQALEESL
ELFENDAGIQALFKNHALKGEAAFLFQGVVSRIDVLLWDRGQNLYVLDYK
SSQNYQQSHKAQVSHYAEFLRTQAPHFKIQAGIIYAHKRLLEKLWV
>gid:21101  polA  DNA POLYMERASE I
MEQPVIKEGTLALIDTFAYLFRSYYMSAKNKPLTNDKGFPTGLLTGLVGM
VKKFYKDRKNMPFIVFALESQTKTKRAEKLGEYKQNRKDAPKEMLLQIPI
ALEWLQKMGFTCVEVGGFEADDVIASLATLSPYKTRIYSKDKDFNQLLSD
KIALFDGKTEFLAKDCVEKYGILPSQFTDYQGIVGDSSDNYKGVKGIGSK
NAKELLQRLGSLEKIYENLDLAKNLLSPKMYQALIQDKGSAFLSKELATL
ERGCIKEFDFLSCAFPSENPLLKIKDELKEYGFISTLRDLENSPFIVENV
PILNSTPILDNTPALDNAPKKSRMIVLESAEPLSMFLEKLENPNARVFMR
LVLDKDKKILALAFLLQDQGYFLPLEEALFSPFSLEFLQNAFSQMLQHAC
IIGHDLKPLLSFLKAKYQVPLENIRIQDTQILAFLKNPEKVGFDEVLKEY
LKEDLIPHEKIKDFKTKSKAEKSELLSMELNALKRLCEYFEKGGLEEDLL
TLARDIETPFVKVLMGMEFQGFKIDAPYFKRLEQEFKNELNVLERQILDL
IGVDFNLNSPKQLGEVLYDKLGLPKNKSHSTDEKNLLKILDKHPSIPLIL
EYRELNKLFNTYTTPLLRLKDKDDKIHTTFIQTGTATGRLSSHSPNLQNI
PVRSPKGLLIRKGFIASSKEYCLLGVDYSQIELRLLAHFSQDKDLMEAFL
KGRDIHLETSKALFGEDLAKEKRSIAKSINFGLVYGMGSKKLSETLSIPL
SEAKSYIEAYFKRFPSIKDYLNGMREEILKTSKAFTLLGRYRVFDFTGVN
DYVKGNYLREGVNAIFQGSASDLLKLGMLKVSERFKNNPSVRLLLQVHDE
LIFEIEEKNAPELQQEIQRILNDEVYPLRVPLETSAFIAKRWNELKG
>gid:20732  priA  PRIMOSOMAL PROTEIN N' (REPLICATION FACTOR Y)
MFYHLIAPLKNKTPPLTYFSKERHLKGALVNIPLRNKTLLGVVLEEVSKP
SFECLELEKTPYFLLPFQIELAIFIAQYYSANLSSVLSLFAPFKECDLVG
LEKIEPTLNALSQTQTNALKELQKHPASLLFGDTGSGKTEIYMHAIAQTL
EQKKSALLLVPEIALTPQMQQRLKKVFKENLGLWHSKLSQNQKKQFLEKL
YSQEIKLVVGTRSALFLPLKELGLIIVDEEHDFSYKSQQSPMYNARDLCL
YLSHKFPIQVILGSATPSLSSYQRFKDKALVRLKGRYTPTQKNIIFEKTE
RFITPKLLEALKQVIDKNEQAIIFVPTRANFKTLLCPNCYKSVQCPFCSV
NMSLHLKTNKLMCHYCHFSSPIPKICNACQSEVLVGKRIGTMQVLKELES
LLEGAKIAILDKDHTSTPKKLHNILNDFNAQKTNILIGTQMISKGHDYAK
VSLAVVLGIDNIIKSNSYRALEEGVSLLYQIAGRSARQISGQVFIQSTET
DLLENFLEDYEDFLQYELQERCELYPPFSRLCLLEFKHKNEEKAQQLSLE
ASQTLSLCLEKGVTLSNFKAPIEKIASSYRYLILLRSKNPLSLIKSVHAF
LKTAPNIPCSVNMDPVDIF
>gid:19882  recA  RECA PROTEIN.
MAIDEDKQKAISLAIKQIDKVFGKGALVRLGDKQVEKIDAISTGSLGLDL
ALGIGGVPKGRIIEIYGPESSGKTTLSLHIIAECQKNGGVCAFIDAEHAL
DVYYAKRLGVDTENLLVSQPSTGEEALEILETITRSGGIDLVVVDSVAAL
TPKAEIDGDMGDQHVGLQARLMSHALRKITGVLHKMNTTLIFINQIRMKI
GMTGYGSPETTTGGNALKFYASVRIDIRRIAALKQNEQHIGNRAKAKVVK
NKVAPPFREAEFDIMFGEGISKEGEIIDYGVKLDIVDKSGAWLSYQDKKL
GQGRENAKALLKEDKALADEITLKIKESIGSNEEIMPLPDEPLEEME
>gid:21150  recG  ATP-DEPENDENT DNA HELICASE
MQETDDLLKTLNVKSLLEALLVYTPKGYKDLSLLERFETGLSGVLEVGIL
EKKNYAKVLKIFAYSKRFYKNLELVFFNHSAFYYNQFKTGESLFIYGKLE
QSSFNQAYIINTPKILTEFGKISLIFKKVKNHKKIQENLQKLLSLENLKK
EGVKENVARLLLEIFFPTPHFVKDFETNKNFPSQHLNALKYIEMLFYMKN
LERKKLQFNAKIACPNNSERLKAFIASLPFKLTRDQQNAIKEIQSDLTSP
IACKRLIIGDVGCGKTMVILASMVLAYPNKTLLMAPTSILAKQLYHEALK
FLPPYFEVELLLGGSHKKRSNHLFEKITHVVIGTQALLFDKRDLNEFALV
ITDEQHRFGTKQRYQLEKMASSKGNKPHSLQFSATPIPRTLALAKSAFVK
TTMIREIPYPKEIETLVLHKREFKIVMEKISEEIAKNHQVIVVYPLVNKS
EKIPYLSLSEGASFWQKRFKNIYTTSGQDKNKEEVIEEFRELGSILLATT
LIEVGISLPRLSVIVILAPERLGLATLHQLRGRVSRNGLKGYCFLCTIQE
ENERLEKFADELDGFKIAELDLQYRKSGDLLQGGKQSGNSFEYIDLARDE
NIIAEVKQDFLKNASVSQGTFEN
>gid:20061  recJ  putative SINGLE-STRANDED-DNA-SPECIFIC EXONUCLEASE RECJ
MKQKLKAQIKERVASIAYNEKGFPSPFLFKDLKKAALKIIEAMRANTEIL
VVGDYDADGVISSAIMAKFFKSLNYKHVRVAIPNRFMDGYGISKKFLEKH
HAPLIITVDNGINAFEAAQFCKEKNYTLIITDHHCLHHDEIPDAYAVINP
KQPDCDFIQKEVCGALVAFYLCYGIHQLLGKEKSHSSELLCLAGVATIAD
MMPLTFFNRFLVSKALYFLQKESLGAMGFLRQREVFRKRSLKASDISFNI
APLINSAGRMQDAKMALDFLSANNFQDGCSLYERLKACNMKRKMIQQQVF
EEAFRHAMVGEKIIVAFKDNWHEGVLGIVASKLVEATQKPSLVFTFKEGV
YKGSARSSPNIDLIDALNGVSSLLLGYGGHRQACGLSVGKNNIVSLFETL
ENFDFKVLPFYETEPPLTLNLKDIDRELLEIIEMGEPYGQENPEPLFQAK
NLEVIEEKIIKESHQVLRFKDKECVKEAIYFNADRFLKAGERVSVLFSVE
LDECSNEPKMFVKSLL
>gid:21172  recN  DNA REPAIR PROTEIN(RECOMBINATION PROTEIN N)
MRDFNNIQITRLKVRQNAVFEKLDLEFKDGLSAISGASGVGKSVLIASLL
GAFGLKESNASNIEVELIAPFLDTEEYGIFREDEHEPLVISVIKKEKTRY
FLNQTSLSKNTLKALLKGLIKRLSNDRFSQNELNDILMLSLLDGYIQNKN
KAFSPLLDALETKFTRLEKLERERRSLEDKKRFQKDLEERLNFEKMKLER
LDLKEDEYERLLEQKKLLSSKEKLNDKIALALDVLENTHKITHALESVGH
SAEFLKSALLEAGALLEKEQAKLEECERLDIEKVLEKLGMLSGIIKDYGS
IAHAKERLGHVKNELHNLKEIDHHCETYHKEIERLKTECLKLCEEISGFR
KEYLAGFNALLSAKAKDLLLKSPSLVLEEAPMSEKGAQKLVLHLQNSQLE
TLSSGEYSRLRLAFMLLEMEFLKDFKGVLVLDEMDSNLSGEESLAVSKAL
ETLSSHSQIFAISHQVHIPAVAKNHILVFKENHKSLAKTLNNEERVLEIA
RMIGGSENIESAISFAKEKLKV
>gid:20597  recR  RECOMBINATION PROTEIN
MNTYKNSLNHFLNLVDCLEKIPNVGKKSAFKMAYHLGLENPYLALKITHA
LRNALENLKTCASCNALSETEVSEICSDESRQNSQLCMVLHPRDVFILED
LKDFLGRYYVLNSIEDVDFNALEKRLIGENIKEIIFAFPPTLANDSLMLY
IEDKLQHLHLTFTKIAQGVPTGVNFENIDSVSLSRAFNSRIKA
>gid:21109  rep  putative ATP-DEPENDENT DNA HELICASE
MGFEKSILDNLNGAQKIAACHIQGPLLILAGAGSGKTKTLTSRLAYLIGA
CGVPSENTLTLTFTNKASKEMQERALKLLKNQALIPPLLCTFHRFGLLFL
RQHMNLLKRACDFSVLDSDEVKTLCKQLKISNFRASISQIKNGMMDLSVQ
DSECYKAYELYQNALKKDNLVDFDDLLCLSLKILQDNEKLAKETSERYHY
IMVDEYQDTNALQLEFLKQLSFTHHNLCVVGDDDQSIYGFRGADISNILN
FSKHFKGAKIVKLETNYRSSAEILACANSLISHNQHRHIKTLQSFKGSHK
SVICKEYPTQKEESLDVAYQIKALLKKGENLENIAILYRLNGLSRSIEES
LNALNIPYRLIGAVSFYERAEVKDALALMHVVAKKDDRFFIKRVLNKPPR
GLGKITQEWIFSLLDEEGLNLEEALKIGAFKDKLNPKNEYALKKFTAMIG
RLREAFEISVEKFCERFLEETNLLKSYEKEDNYEEREGFVKELLSLVKEH
FKTNPTHSLLDFLNESALDVHNTENAQKVSCMSVHMSKGLEFKHVFVIGL
EEGFFPHRGFNQESDLEEERRLAYVAITRAKEELQLSYVKERSYFGRKIS
CSPSVFLEEAQLLQQDKPPKQNHQKDTPIKVGDLIKHKIFGTGRVLGVEK
GLSGLCLKINCGGNVYDKISEKFVEKVDNEF
>gid:20344  rnhA  RIBONUCLEASE HI
MQEIEIFCDGSSLGNPGPGGYAAILRYKDKEKTISGGENFTTNNRMELRA
LNEALKILKRPCHITLYSDSQYVCQAINVWLVNWQKKNFAKVKNVDLWKE
FVKVSKGHSIVAVWIKGHNGHAENERCNSLAKLEAQKRVKTTT
>gid:20981  rnhB  RIBONUCLEASE HII
MGCVSMTLGIDEAGRGCLAGSLFVAGVVCNEKIALEFLKMGLKDSKKLSP
KKRFFLEDKIKTHGEVGFFVVKKSANEIDHLGLGACLKLAIEEIVENGCS
LANEIKIDGNTAFGLNKRYPNIQTIIKGDETIAQIAMASVLAKASKDREM
LELHALFKEYGWDKNCGYGTKQHIEAINKLGATPFHRHSFTLKNRILNPK
LLEVEQRLV
>gid:20553  ruvA  HOLLIDAY JUNCTION DNA HELICASE
MIVGLIGVVEKISALEAHIEVQGVVYGVQVSMRTAALLQTGQKARLKILQ
VIKEDAHLLYGFLEESEKILFERLLKINGVGGRIALAILSSFSPNEFENI
IATKEVKRLQQVPGIGKKLADKIMVDLIGFFIQDENRPARNEVFLALESL
GFKSAEINPVLKTLKPHLSIEAAIKEALQQLRS
>gid:20105  ruvB  HOLLIDAY JUNCTION DNA HELICASE RUVB
MKERIVNLETLDFETSQEVSLRPNLWEDFIGQEKIKSNLQISICAAKKRQ
ESLDHMLFFGPPGLGKTSISHIIAKEMETNIKITAAPMIEKSGDLAAILT
NLQAKDILFIDEIHRLSPAIEEVLYPAMEDFRLDIIIGSGPAAQTIKIDL
PPFTLIGATTRAGMLSNPLRDRFGMSFRMQFYSPSELSLIIKKAAAKLNQ
DIKEESADEIAKRSRGTPRIALRLLKRVRDFALVKNSSLMDLSITLHALN
ELGVNELGFDEADLAYLSLLANAQGRPVGLNTIAASMREDEGTIEDVIEP
FLLANGYLERTAKGRIATPKTHALLKIPTLNPQTLF
>gid:20549  ruvC  CROSSOVER JUNCTION ENDODEOXYRIBONUCLEASE
MRILGIDPGSRKCGYAIISHASNKLSLITAGFINITTTRLQEQILDLIEA
LDCLLDRYEVNEVAIEDIFFGYNPKSVIKLAQFRGALSLKILERIGNFSE
YTPLQVKKALTGNGKAAKEQVAFMVKRLLNITSEIKPLDISDAIAVAITH
AQRLKPR
>gid:20904  ssb  SINGLE-STRAND BINDING PROTEIN
MFNKVIMVGRLTRNVELKYLPSGSAAATIGLATSRRFKKQDGTLGEEVCF
IDARLFGRTAEIANQYLSKGSSVLIEGRLTYESWMDQTGKKNSRHTITAD
SLQFMDKKSDNPQANSMQDSMTHENFNNAYPTNYNAPSQDPFSQAQSYPQ
NAYTKENSQAQPSKYQNSVPEINIDEEEIPF
>gid:20565  tnpA  IS606 TRANSPOSASE
MSVSKLVNSLKGVSSRLTRQHHFKSVEASLWGKHLWSPSYFAGSCGGTPL
EMIKQYIQEQETPH
>gid:20564  tnpB  IS606 TRANSPOSASE
MKVNKGFKFRLYPTKEQQDKLQHCFFVYNQAYNIGLNLLQEQYEKNKDLP
PKERTRKKSSELDKAIKHHLNARGLSFSSVIAQQSRMNVERALKDAFKVK
NRGFPKFKNSKSAKQSFSWNNQGFFIKESDEERFKIFTLMKMPLMMCMHR
DFPPHSKVKQIVISCSHRKYFVSFSVEYEQDITPIKNPKNGVGLDLNILD
IACSCGVNNHKKLTDFKRYSTDMKELLGIEIDEELDTKRLIPTYSKLYSL
KKHSKKFKRLQRKQSRRVLKSKQNKTKLGGNFYKTQKKLNQVFDKSSHQK
TDRYHKITSELSKQFELIVVEDLQVKNMTKRAKLKNVKQKSGLNQSILNT
SFYQIISFLDYKQQHNGKLLVKVPPQYTSKTCHCCGNINHKLKLNHRQYW
CLECGYREHRDINAANNILSKGLSLFGVGNIHADFKEQSLSC
>gid:19849  topA_1  DNA TOPOISOMERASE I
MKHLIIVESPAKAKTIKNFLDKNYEVVASKGHVRDLSKFALGIKIDETGF
TPNYVVDKDHKELVKQIIELSKKASITYIATDEDREGEAIGYHVACLIGG
KLESYPRIVFHEITQNAILNVLKTPRKIDMFKVNAQQARRLLDRIVGFKL
SSLIASKITKGLSAGRVQSAALKLVIDREKEIRPFKPLTYFTLDALFEPH
LEAQLISYKGNKLKAQELIDEKKAQEIKNELEKESYIISSIIKKSKKSPT
PPPFMTSTLQQSASSLLGFSPTKTMSIAQKLYEGVATPQGVMGVITYMRT
DSLNIAKEALEEARAKILKDYGKDYLPPKAKVYSSKNKNAQEAHEAIRPT
SIILEPNALKDYLKPEELKLYTLIYKRFLASQMQDALFESQSVVVACEKG
EFKASGRKLLFDGHYKILGNDDKDKLLPNLKENDPIKLEKLESNAHVTEP
PARYSEASLIKVLESLGIGRPSTYAPTISLLQNRDYIKVEKKQISALESA
FKVIEILEKHFEEIVDSKFSASLEEELDNIAQNKADYQQVLKDFYYPFMD
KIEAGKKNIISQKVHEKTGQSCPKCGGELVKKNSRYGEFIACNNYPKCKY
IKQTENANDEAKQELCEKCGGEMVQKFSRNGAFLACNNYPECKNTKSLKN
TPNAKETIEGVKCPECGGDIALKRSKKGSFYGCNNYPKCNFLSNHKPINK
RCEKCHYLMSERIYRKKKAHECIQCKERVFLEEDNG
>gid:20657  topA_2  topoisomerase I
MYKNCVFIIESPNKIAKIKELTGSSFVFATGGHFVELVNIEVNKEFNPIF
EIKKSTDKKKDRSTHINHMINQCKDKVVYIATDPDREGYGIGYKFYEKIK
NLAKTIYRTEFHEITKSGVEKGLNNAGLFSQSNLNLYYSWLGRIVSDQFI
GFTLTPYLRKNIKNFEVGAGRVQTPALSILVELDRKIQAFEQKNNDEKLS
YSIEAIIDALGSQISITLVEENKRKAFETKELAQNFLNDLKNNLNPLAFL
DAIEQKDKEKAPPKPFTTSNLLKDGARILGMGVKQIQEHAQKLFEAGLIT
YIRTDSEALSKEYLQEHEAFFEGIYPSVYEYREYRAGKNSQAEAHEAIRI
THPHCYEDLKKVCEEHNITDIDDLKVYTLIFFNTICSQSKNAIYENTVLN
FKVKTHRFNAVLANSNLKVLKRLKT
>gid:20669  topA_3  topoisomerase I
MNNSVIIIESPNKVAKIREITGAKVFATIGHFMQLKSYDENNGFKPTFDY
DQEKKKHIFEMIEACKNKKVYIATDPDREGYAIGYMFYQKIKNVASSIYR
AEFFEITPSGINKGLQNALLFENTNRQMYQSALARRVADMLLGFTLSPYL
GKALGQMKGSSAGRVQTPCLKLIVDRDREIEKFKALPENEKVSYQIQAKI
NDSANREVTIKHCDEKGEEIKFNDKEEALKLFESLKDNKACLLKDLKNSV
VETKPKKPFITSTLLEKASSMLGLSISEVQSLAQNLFEAGLITYIRTDAE
SLSVEFLDETESFYAPIYKDLYLKREYKAGKQSQAEAHEAIRITHPHTTE
DLESIVYNANITNQDALKLYQLIFERTIESQGKNAIYDKQDLLFKIKNEY
FKCSVKGLKSAGFLAMFSKKELENDESNDDKDNKEKEQNAQFNLKIDDVL
SLNDLVLSTIKRNAPSAYKEADFVKLLENKGIGRPSTYASYLPTLVKREY
ISISQDKKHIITPTHKGKRVVEVFENAYQFIIDLTYTKQMEEVLDEIVEN
KSSYVDFISNLNSKCPKIEKLERNDDEIKPSSEGQITYIENILRDLQLNL
SEEFKNYKEDNRVAKAFLDRYIKEHEFFKKNNKKASSSNNDENRPATPKQ
ISFAEILAKKHNVKLPKGFKYSMKVCGDFINEYHKK
>gid:21004  ung  URACIL-DNA GLYCOSYLASE
MKLFDYAPLSLAWREFLQSEFKKPYFLEIEKRYLEALKSPKTIFPKSSNL
FCAFNLTPPYAVKIILLGQDPYHSTYLENEQELPVAMGLSFSVEKNAPIP
PSLKNIFKELHANLGVPVPCCGDLSAWAKRGMLLLNAILSVEKNQAASHK
YIGWEAFSDQILIRLFETTTPLIVVLLGKVAQKKIALIPKNKHIIITAPH
PSPLSRGFLGSGVFTSVQKAYREVYRKDFDFSL
>gid:20382  uvrA  EXCINUCLEASE ABC SUBUNIT A.
MQHKTIMDKIIIQGARENNLKNIFLEIPKNQFVVFTGLSGSGKSTLAFDT
LYAEGQRRYLESLSSYARQFLDKVGKPNVDKIEGLTPAIAIDQKTTSKNP
RSTVGTITEIYDYLRLLFARVGEQFCPTCLEPISSMSASDIISQICHLEE
NSKIIILAPIIKDKKGSFNDKLESLRLKGYVRAFVDGVMVRLDEEIHLHK
TKKHTIEAVVDRVVINSENASRIASAVEKALKESYGELEVEILQDNAPSI
RKHYSEHKACFKCKMSFEELEPLSFSFNSPKGACESCLGLGTKFSLDISK
ILDPNTPLNQGAIKVIFGYNRSYYAQMFEGFCTYNGIDSALCFNELNKEQ
QDALLYGNGTEISFHFKNSPLKRPWKGIIQIAYDMFKEQKDLSDYMSEKT
CSSCNGHRLKASSLSVQVAGLKMADFLTKPIEEVYHFFNDPTHFNYLNEQ
EKKIAEPILKEILERVFFLYDVGLGYLTLGRDARTISGGESQRIRIASQI
GSGLTGVLYVLDEPSIGLHEKDTLKLINTLRNLQKKGNTLIVVEHDKETI
KHADFVVDIGPKAGRHGGEVVFSGSVKDLLQNNHSTALYLNGTKKIERPK
FEPPKEKHFLEIKNVNINNIKNLSVQIPLKQLVCITGVSGSGKSSLILQT
LLPTAQTLLNHAKKNQSLNGVEIVGLEYLDKVIYLDQAPIGKTPRSNPAT
YTGVMDEIRILFAEQKEAKILGYSTSRFSFNVKGGRCEKCQGDGDIKIEM
HFLPDVLVQCDSCKGAKYNPQTLEIKVKGKSIADVLNMSVEEAYEFFAKF
PKIAVKLKTLIDVGLGYITLGQNATTLSGGEAQRIKLAKELSKKDTGKTL
YILDEPTTGLHFEDVNHLLQVLHSLVALGNSMLVIEHNLDIIKNADYIID
MGPDGGDKGGKVIASGTPLEVAQNCEKTQSYTGKFLALELK
>gid:20779  uvrB  EXCINUCLEASE ABC SUBUNIT B
MPLFDLKSPYPPAGDQPQAIEALTKSLKNNNHYQTLVGVTGSGKTYTMAN
IIAQTNKPALIMSHNKTLCAQLYSEFKAFFPHNRVEYFISHFDYYQPESY
IPRRDLFIEKDSSINDDLERLRLSATTSLLGYDDVIVIASVSANYGLGNP
EEYLKVMEKIKVGEKRAYKSFLLKLVEMGYSRNEVVFDRGSFRATGECVD
IFPAYNDAEFIRIEFFGDEIERIAVFDALERNEIKRLDSVMLYAASQFAV
GSERLNLAVKSIEDELALRLKFFKEQDKMLEYNRLKQRTEYDLEMISATG
VCKGIENYARHFTGKAPNETPFCLFDYLGIFEREFLVIVDESHVSLPQFG
GMYAGDMSRKSVLVEYGFRLPSALDNRPLKFDEFIHKNCQFLFVSATPNK
LELELSQKNVAEQIIRPTGLLDPKFEVRDSDKQVQDLFDEIKSVVARGER
VLITTLTKKMAEELCKYYAEWGLKVRYMHSEIDAIERNHIIRSLRLKEFD
VLIGINLLREGLDLPEVSLVAIMDADKEGFLRSETSLIQTMGRAARNANG
KVLLYAKKTTQSMQKAFEITSYRRAKQEEFNKIHNITPKTVTRALEEELK
LRDDEIKIAKALKKDKMPKSEREKIIKELDKKMRERAKNLDFEEAMRLRD
EIAQLRTL
>gid:20498  uvrC  EXCINUCLEASE ABC SUBUNIT C
MADLLSSLKNLSSSSGVYQYFDKNRQLLYIGKAKNLKKRIKSYFSVRNNE
ITPNPRTSLRVQMMVKQIAFLETILVENEQDALILENSLIKQLKPKYNIL
LRDDKTYPYIYMDFSIDFPIPLITRKILKQPGVKYFGPFTSGAKDILDSL
YELLPLVQKKNCIKDKKACMFYQIERCKAPCEDKITKEEYLKIAKECLEM
IENKDRLIKELELKMERLSSNLRFEEALIYRDRIAKIQKIAPFTCMDLAK
LYDLDIFAFYGGNNKAVLVKMFMRGGKIISSAFEKIHSLNGFDTDEAMKQ
AIINHYQSHLPLMPEQILLSACSNETLKELQEFISHQYSKKIALSIPKKG
DKLALIEIAMKNAQEIFSQEKTSNEDRILEEARSLFNLECVPYRVEIFDT
SHHSNSQCVGGMVVYENNAFQKDSYRRYHLKGSNEYDQMSELLTRRALDF
AKEPPPNLWVIDGGRAQLNIALEILKSSGSFVEVIAISKEKRDSKAYRSK
GGAKDIIHTISHTFKLLPSDKRLQWVQKLRDESHRYAINFHRSTKLKNMK
QIALLKEKGIGEASVKKLLDYFGSFEAIEKASDQEKNAVLKKRK
>gid:19984  xseA  EXODEOXYRIBONUCLEASE LARGE SUBUNIT
MHVLSVSEINAQIKALLEATFLQVRVQGEVSNLTIHKVSGHAYFSLKDSQ
SVIRCVLFKGNANRLKFALKEGQEMVVFGGISVYAPRGDYQINCFEIEPK
EIGSLTLALEQLKEKLRLKGYFDEANKLPKPNFPKRVAVITSQNSAAWAD
MKKIASKRWPMCELVCINTLMQGEGCVQSVVESIAYADSFYDTKNAFDAI
VVARGGGSMEDLYSFNDEKIADALYLAKTFSMSAIGHESDFLLSDSVADL
RASTPSNAMEILLPSSDEWLQRLDGFNVKLHRSFKTLLHQKKAHLEHLAA
SLKRLSFENKHHLNALKLEKLTIALDNKTLEFLRLKKTLLEKISTQLSTS
PFLQTKTERLNRLENALKLAYANLKLPQFGALVSKNHQAIELEALKRGDK
IELSNEKARASAEILSVDRV