TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Gloeobacter violaceus PCC 7421, PCC 7421
Gene type: CDS

Number of genes found: 208

Free access
Sort by:

 



# Gloeobacter violaceus PCC 7421, PCC 7421

>gid:534806  dnaA  chromosomal replication initiator protein
METLWDGILSHLKGRLSRPTFETWIKPATAQQFEQDCLIIRTPNPFARSW
LQQHYAGAIAQAGEQVIGRPIQVDFIVSEQSEEALKPVIEREPAPAAPPA
NVASLNSKYTFSRFVVGANNRMAHVAALAVAEMPGCNYNPLFLCGGVGLG
KTHLMQAIGHYRLDIDTRTKIAYVSTERFANELIEAIRRDAMQTFREHYR
RVDLLMIDDIQFIEGKEYTQEEFFHTFNALYESGKQIVIASDRPPQLIPR
LQERLSSRFSMGLITDIQQPDIETRMAILQKKAEYENMFVPQDVIHHIAS
AYTTNIRELEGALIRAVAYVSISGLPMTVETISPILSPPRTRGEITDEAI
LELVCDELKVSAEDMRSDSRRRDISQARQLCMYLLRKYTDLSLPKIGQAL
GGKDHTTVLYAIDKIEQSKIRDPEVQRLLQRLGNRLEADARH
>gid:537297  dnaE  DNA polymerase III alpha subunit
MQYVNLHTHSDYSLLDGASQVPDLIERAKGLEMPAIAITDHGVMYGAIEL
IKKCRAAGLKPIVGTEGYVINGDIRDKSRRYRRYHQILLAKNLTGYKNLV
KLVTISHLDGVQGKGIFSRPCMNKEMLVQYHEGIICTSACLGGEIPQLIL
AGQFEQARRTTQWYKDLFGDDFYLEIQDHGQREDRVVNVAVVHLGRELGI
KVIATNDSHFTSCEDVDAHDALLCIQTGKLVVEENRMRYTGTEFLKSAEQ
MAGMFRDHLSDEVIAQALTTTLEIAEKVEKYDRELLGDSRLPRFPIPHGH
TAESYLEQVTWEGMRERFGSEIAPDYRERLKYELGIIEQMGFATYFLVVW
DYIKYARDHGIAVGPGRGSAAGSLVAYALRITNIDPVRYNLLFERFLNPE
RKSMPDIDTDFCIERRDELIRYVTEKYGEQYVAQIITFNRLTSKAVLKDV
ARVLDISYSEADKLAKLIPVVRGKPVPLKKMISEETPAPEFKQSYEGDER
VRKWVELAMRIEGTNKTFGVHAAGVVISSVPLDELVPLQRNNDGQVITQY
YMEDIADLGLLKMDFLGLRNLTMIHRAQELIERYRGVRIDLDNLPLEDSG
TYALLGKADLEGIFQLESSGMKAVVRDLRPSNIEDISSILALYRPGPLDA
GLIPEFINRKHGRKKIEYLHPLLEPILKETYGVLCYQEQIMKVAQELGGY
TLGQADLLRRAMGKKKREEMEKHRSIFVDGSAKNGVPSRVADELFEQMVV
FAEYCFNKSHSAAYGYVTFQTAYLKAHYPVEYMAALLSSVKGDQDKVQKY
IASCMGMGIEIEPPDINRSGFDFTPQEKSILFGLGAVKNVGEGAIQALLD
ARDSQGPFKNLANLCDRVDMRVINKRALESLIKAGAFDAFSSNRGQLIAD
MEKVVDWAQDRARDRALGQFNLFDALSGGGDRNPTGYQSAPSAPPTPDLP
PADKLKFERELLGFYISDHPLKAVRESARLLAPVNLADLADCRAESTVSA
IALVREVKNVVTKKGDRMAIVQIEDLTGSTEAVVFPKTYERVGGYFIPDA
RLMLWAKIDMRDDRPQLIIQDAEPVEEVRMVVIDLEARADLQTWHQIRDV
LQAHQNETAHIAVIASVKTDQRHKLVRFGPQFRVSNDSSLIQALQCKGFS
AHTTRIVGG
>gid:537447  dnaG  DNA primase
MSAPQLHPRTIDEVRAKADLVDVVSERVVLRKAGRDFKGLCPFHEDRSPS
FYVSPGKQIYKCFACGASGDVFKFVMELDKSAFGEVVLELARRFGVPVQT
LKPEQKAEYTRKLSKQQQLGEILELAAQFYSYALWGERGAAARQYLLEAR
ALSEKTIRQFRLGFAPEGWQSLYTYLVDQKRFPASLVEEAGLIIPRSSGQ
GYYDRFRHRLIIPICDLKGKVIAFGGRAMGDEQPKYLNSPETELFNKGQT
LYALDQARESVGRTDGAIVVEGYFDAIALHQSGISHVVATLGTSLRTDQI
KQLLRYTESKRVVLNFDADAAGVQAAERAIAELRPLVVKSGVQLRIVALP
AGKDPDEFLRSHPPQEYLKLASEAPLWIDWQIERLFAGRDLSLAADFQQV
SQGLVELLSGLLSSMTRAHYIHLIAGQLSQGNGRLASQLEDELRRRIRTH
RWGGGDGKSKKKREPFPPACFQAEVQLIQIYLHFAEYREDIHRGLDEHDL
EFSVLHHRQLWQKILQLREEIGEDDDVVVALRTMYAGDPELNQRLGQLLW
LNEHNRIALMRPRMVIRAGLATLQLDRCERRYNYLNQLCDEAERRGAWEE
ADYFVEQRNAEYHRINDLKTHLTLKLGEISETAVWQES
>gid:536586  dnaN  DNA polymerase III beta subunit
MKIVCSQAVLNQNLSLVGRAVPSRPTHPILANILLEADGRSGTVTLTGFD
LDLGIDTRFAAEQVEGSGRTTLPARVLSDIISRLPNESLTMEVSEDNAIS
LVCGSSQYQVQGASAEEFPKLPELPDSAAHVLPVSEFLEGVQRSLFAAST
DESKQILNGVSVKALKEGMEFVATDAHRLSFYRTDFTLPEAGTQAIAAVL
PVRSVRELEKILGAQAGDSVEVRFDEKQMIFQFPNQTLTTRLLGGRYPDY
QQLLPKQFQQTADMERKRLIGCLERIAVLADQKNHIVKLDFAPSEHLMTV
SVDAPDVGRGRESVPVQYSGQDFSVAFNVRYLLEGLKAMDATDVSFCLNG
ASDPAVLKPVGDGNYQYLIMPVSIRG
>gid:534759  dnaX  DNA polymerase III delta prime subunit
MNCTAQSSVKGQPLAMALLERAIVTGRIAPAYLFCGPQGVGKAMAARHFA
ARLLESALGRIERGNHPDLLWVEPTYKKGDKLLSRAEALAEGGNLPRALP
QVRLEQVRGINQFLSRPPLEAARQVIVIEGAEAMGDGAANALLKILEEPG
GAVFVLIAPSAGGLLATVRSRCQKVPFCCLPHELVAQILAALGKSVDERL
LAMAQGSPGRALELQTWLFGIPPELLAEAEKWAYRPLDLRSALTLARRID
AELNLEQQIPLVDYLQQLAWAAGRVERLAPLEALRTQLLGYVSPRLAWEV
GIPPLPPL
>gid:534943  dnaX  DNA polymerase III gamma and tau subunits
MAYEPLHHKYRPQRFGDVVGQGPIVTTLTNALKAGRIAHAYLFTGSRGTG
KTTTARLIAKALNCIHGPTPDPCGRCEQCLAIATGSALDVIEIDAASNTG
VDNIRELIERAQFAPVQSRQKVYILDEVHMLSSAAFNCLLKTLEEPPAHV
TFVLATTDPQKVLPTVISRCQRFDFRRIPLTDMVKHLEEIAWKEDIDIEH
EAVELVAQIAQGGLRDAESLLDQLALLEGTIGAERIWQLVGAVPERELLV
IGECIQAGDSTRLIEQVRRLLDSGKEPLQVLQDLLGFYRDLLIAHTAPDR
RDLVAVTAASWQQMCERAKTLSIPQLLALQERLRQAEPQIKTSTQPRLWL
EVGLLGLLASPSQNQPVALAPAAARPAAPVPVPPPRPAEERPTPPPPAAP
RGVEERPNSVPQPPVEPTPPQRPQVPPPSIEPAAAIHRPARPSGSTPDGT
GLRERWPEWIRLIPQPTQSLMSSSFWLEETPKRLVIGFTTEPLVKRASEP
RRLKQIQEVLQQFLGRPIEVQFHIGKAPAAAPAVAVASSQSAAYTPPPVS
FSPPAPPEVTPVSPTTVAPPSAAKPPLSDEEPDELDRAARGLAEFFNGEV
ISFDGEAEELPGATQPGAAEVSDIDDDEDIPF
>gid:536765  fpg  formamidopyrimidine-DNA glycosylase
MPELPEVETLRRDLLIHLPGERVVGVEVLRSDSVGYPADPAIFIEQMQGQ
IFSDRMLRRGKYLLLYFARGAALGVHLRMSGRLLWRCGEAPLEPHTRVRI
PMASGHELRFEDMRVFGRLWLIPVGVPPERVMGGLTRLGPEPFAEMFDGP
YLAGRFAGRNQPVKSALLDQQLVAGVGNIYADEALFSSGIHPALPVGGLD
AAALERLHRAVVKVLEAGIAQRGATLRNYTDAQGINGNYAGTAWVYGRKG
QPCRVCNTPIERIRLAGRSTHFCPTCQRAQQSVQ
>gid:533338  gll0025  primosomal protein N'
METFRPGPLLRPAAWVQVLVDAPARERTYTYRLADGMTAAGGDVVCVPFG
SQLVGGIVLGCLDQLPTGLDPGRLKTVESVVGTGLFAPGFWPLLVRVADY
YLVPLARVLETALPPGILSRARRRVRLVADAAPMAQWGLSAAAKSAFDFL
RSQAHADFSWRFCVQRLSGGVSSLRELQARRLLESYYVFGETARPRTQQF
VVLAGAGADLCGRSAAVVQCLRRLGGEVSVEQLLAEARTTRALLQKLDAG
GHVRIYERQILRLAAPTASADAPKLLTDAQKQVLDRLQGATPGPVLLEGV
TGSGKTEVYLQAIAPVLDRGESALVLVPEIGLTPQLTDRFAARFGARVRV
YHSALSEGERFDCWRQMLTGEAQVVVGTRSAVFAPLPKLGLIVLDEEHDG
SYKQDRPAPCYHARTVALWRGELAGCPVLLGSATPDLESFDRATAGRYLH
LEMPERVAGRPLPAVEVVDMREELNRGNFTPFSATLQRAVAEMHGAGRQG
ILFINRRGYSTFVLCRNCGETLRCPHCAVSLTYHRLEAGDHLRCHYCNHG
APQPRACPHCTSPNLRYFGAGTQRIAAQLSEQFPTLRVLRFDRDTTARKD
AHRQILEQFGRGEADVLVGTQMLTKGLDLPQVTLVGILAADGLLNLPDFR
ASERAFQLLTQVAGRAGRGSEPGRVILQTYAPEHPVVEAASTHDFRRYAA
AELTQRRALGYPPFVQLVALQLSAAEQQSVVESAEALARRLDGSADFEGR
LLGPAPCTVERVAGRYRWQLLIKNPNGEAGRASLRALLTQFVPCAGVTVA
VDVDPLRLL
>gid:533347  gll0034  
MTSSELDLRIEERLPPQNIDAEETILGGLMMDPEALTRVVEHLRPEAFYV
ESHQIIYRAALALHAQGRPTDLLTLSAWLEDNKLTEKIGGRTYLRRLTDA
AINTINIDGYGRLVSDKYALRMLIRAGQEIAALGFDSATEIPKLLDRAEQ
TLFAVTQERVQRSLVPASEVLMNIFEQLETRYQDGSNVFGIPTHFYDLDN
YTQGLQPSDLIIVAGRPGMGKCCAADTPIADPVTGALVTIEEIYRRGEAG
KLVEVLTLLGDGRLARVEPSHFVDDGIKPVYRVRTGLGREVKTTLTHPYL
TPTGWKPLAEIAAGARIAVPCRIPVFGSESLPPKEISLLCSRATRDNRIP
DPVFRLPRAQLVAFLKQLCITADSARVSDRTVEFTSPSKSFCHQLQHLLL
RLGVLSALREVSGIFYLDIKPAAETPIKPASLWSQDLAHHCDLHWDEIAS
IEYVGNEQVYDLTVPVTHNFVAADICLHNTAFSLSIAQRIAQKAGLPAVV
FSLEMSKEQLVQRLLCSEAGVESHRLRAARISENEWQRIGQAIGELASIP
LYIDDSPNATVTEIRSKARRLQAEQGGRLGLVMIDYLQLMEGAGSDNRVQ
ELSKITRGLKGLARELRVPVMALSQLSRSVEARTNKRPMLSDLRESGSIE
QDADIVLMLYRDDYYNPDSPDRNIAEVNIVKHRNGPTGTVKLLFENQFTR
FLNLTSGNH
>gid:533367  gll0054  serine/threonine kinase
MASESLQGFAIGRVVGGRYRLLGRLDGGSMGSVYEAADTKLAGKVVALKV
MHRSLAGDTEVVKLLRQRFEEEARLSAILGSHPRIIQVTDYGVEGPQPYL
VMELLKGRSLKEVLAQGPMPPGRAVRLAVQLCDGLQHAHAAQATVEGRTI
RGIIHRDIKPGNLFLIEDESLGETVKILDFGIAKANSDISLALGTQAGFV
GTSGYASPEQLRGEALDARSDIYSLGVVLYQMLTGQMPLKPKTETFAGWY
HAHNYTTPTGFQAHRLPYMLPPALAAVVLSCLEKDPANRPQSMQMLGMQL
QWAMGWFPREETSQPLLPRDVFIGGAIALLALVGFLFLLS
>gid:533379  gll0066  
MPVHLYWGEDAYRRGQAVEQLRAAAVEPAWEAFNFGRYAGEALIDALNQA
ASAPFGGGGRLVWVEEARIFSHCPEGELAELARTLPHLHPNGHLLFTLQG
KPDGRLKSTKLVAKVGEVREFSLIPPWQEAKLKAQVQQMAQERKVTLSAP
ALQLLTEAVGNDTRRLDNELEKLALFAQGRSVDAEAVGALVATTAHTSFQ
LAAALLAGEAATALRVLDELLRRNEPALRIQAVLVNQFRTWLWVRVLIEA
GERDAQAIAAAAEIANPGRVYFLQKEVERTSAERLGRVLPILLGLEVALK
SGRPERAALESAVLHIVNALR
>gid:533397  gll0084  
MSGKTYRATGINLRRMPLGESDLLMTILTRENGLVRAVARGARKANARIG
GRTEQFVVNDLQLYRGRSLDQLTQAESLRTFPGLLQDLGRLTAAQYLAEG
VLQEATEGQAQEDLYDLLLVHLERLAATPSHQIAARLVHGVYQLLAVGGV
APEVHFCTVSHRPISAESAGFSVEGGGLVALECLSHERVGFRLDIEQVAA
LQLLADADLTPASLDWNYLWIGLERLLRRHIEFHFDRPLRAATLLEICFE
PLAVPAAASPQ
>gid:533464  gll0151  
MQPSVWQPPVEPSPAEQAILKRIRRAKLFVFLRRIRHQLFDAAFQSELAG
IYKDSPCGQPPIPPAQLALATVLQAYTGISDDETIEVLTMDRRWQMVLDC
LDCEEAPFGKATLIRFRQALIAHGLDRRLIERTIELATSDGGFGSRALRA
ALDSSPLWGAGRVEDSFNLLGHAMRKALRIMVCQTGVGLAQWAEQTGTTL
VAGSSLKAALDIDWSQPDECAEALGRLLGALESLESYLDGQTQPPQAGVA
RCLQAAEQVHQQDVQLNSRGQFVLRRGVSRERRISIEDPAMRHGRKSRAV
RIDGYKRHVLKDIDSGLVRAVGITAANRPEAAVTRDIEADLEPQRVKLIE
LHIDRAYLSSEWVRLRPEDLRIYCKAWPVRNGPYFQKTAFVLDWEAMRIR
CPQGVTQPFEVGGKVRFPAARCRRCPLQERCTTSPKGRSVSIHPDELLLV
ELRERQQTKEGRAKLRERVSVEHSLAHIGQWQGRRARYLGVRKNLFDLRR
VAVVHNLHVLARQEAAGRSQAA
>gid:533490  gll0177  
MVEWNSLNWRQIQRRVFKLQTRIFKASQRGDFKAVRKLQKLLIRSWSARC
LAVRRVTQDNQGKKTAGVDGVKSLTPKARLALTKNLRISEKAKPMRRVWI
AKPGTQEKRPLGIPTMTDRARQALLTLALEPEWEARFEPNSYGFRPGRSC
HDALQAIYNAIRQQSKFVLDADIAKCFDRIDQQALLKKMNTSSAIRRQIR
AWLKAGVMEGSELFPTPTGTPQGGVISPLLANIALHGMEERVKQVSKMAQ
LIRYADDFVCIHTDQQIVQSCQTVLEEWLAGMGLELKPSKTRIAHTLLLE
EGQPGFDFLGFTVRQFPVGKYHTGTNSRGKPLGFKTIIKPSKKSIKTHTE
KLRRIIHKHKAQPQSFLIEHLNPIIRGWTHFFSKVVSGATFKKLDRTLYM
QLKAWATSHHRNQSQKWLARKYWLTIGTQNWCFAVQKGEGIKRLVQHSDR
PITRHVKVKERRSPYDGDWVYWSNRIGQHPQIKAEVARLLKSQTGKCTHC
GLYFMNGELLEVDHRKPKRFGGRNISSNKQLLHRHCHHQKTAAERIQQLQ
SQSRITEAERLQLQAKRQALLEDYQPYKSKKRLTDAEWMQQWH
>gid:533518  gll0202  pilin gene inverting protein
MRLFWESTLRSCPFRLRCSSRASCAPRASKIPPPDLNNCVTGSISSNSNT
YMLAWKPLVAISKLWLIICINRVIQSAVNPKRIKGYAQAQMQRSKTDQLD
ATVIAHFCAALHPQTWQPPAEAVLVLQSLLRRFEGLQQMRQQEENRRQMP
QVCAEVKSSIEQMLSFLDQQIAQVKQQIREHFEQHPDLKHQQQLLCSIPG
IAELTAARLLGELLGFAPFGNARQLAAFAGLSPRQYQSGTSISGRTHLSK
MGNSRIRKALYCPAIVALRYNPKIQEFSERLLAAGKTKMCVVGAVMRKLL
HYAFGVLQSGRPFDTQARQAQAG
>gid:533525  gll0209  
MLLDFPQLVKTALSSLPNDDFPVLDSRLFFSCWLALVMDKSTVSMRDLFK
RVNHTGIPVDISTFSKACKSRSLQIFEQLYQALLVRVRRELPAKKLHPCP
IDSTVVGLTSKLLWAQDYHQVKLLTCLEHGSGTTEGSLINFGYDHDSNFV
NEMLQAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKLAFEGN
AGRMRVGTGKESGMYRVVNFCDIENRAEYRLVTNLPGEGEWLVRDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKSENGIRMQLYITLIAHLLLELVS
VPKIWGSQRLDKLRYLQCCMCQEISYVHWLGKLLGSQRRRARLPRACTYV
H
>gid:533527  gll0211  
MKATPSINTSGGVVGNPGSGVLSASASDTVLINGTNLNTLNSVTINGIAA
YAVAVSATQIQVSLPENASTGPLVVKVAGVNLTASSSFKVNASTAPPSAH
LDLVGVLGALQLLADVGGLIPGLNVPSAILGIGVAAALGDAAGALASGLA
LVPFGGVAKAGVGLLKAGTGLSRGIVAASRACGCFSEGTEVQTEAGAKPI
ELVEPGEKVLARNEQTGEQSLRRVKSTFQFDDRPVYRLELRETNGQGERD
TLTVTGEHPFFLKDKGWTAAERLKSGERVQAADGKWLRVAGLEAQPHRQR
TYNLEVEGDHTFFVGHNQAWVHNECPLFKLNTRVVEQLKDKRLGGLAGQL
TEERLNRLINEPGARRFLDAGPINVSPFDKPNINVIQEVQGRLLRITVAS
DKFEIISVGPIRERNIINSIRSGRYVPF
>gid:533528  gll0212  
MGVLGALQVLADVGGLIPDLNVPSAILGIGASVVLGDAAGALASGLALVP
FGGFAKAGVGLFKARTGLSRGIVAASRACGCFAEGTEVQTETGTKAIEKV
EPGEKVLARNEKTGEQNLRRVQSTFQFDDRPVYRLELRQTDGDGERDTLT
VTGEHPFFLQGQGWTAADKLQAGDRVQAVDGRWLRVVGLAAQEQRQRTYN
LEIEGEHTFFVGHNQAWVHNECISLYRAVLDKELQDIKSSGVFRNPYGIE
IKYFSVTAEGAARYARSQFINRPFEGPYTLLRTRVPSSLLANPQVTRLIL
DRGIGGVDTVTIPTELLKNLSPPTMLRAAPLPSLK
>gid:533593  gll0277  
MKCPACQSDNFSKNGHRRGVQYYICKDCCKQFLEYYTPQGYPEKVKRNCL
TMYLNGMDFRGIERTTGVCRNTVINWVKQAALALPDLPQTKKISTVAQLD
KL
>gid:533743  gll0427  
MEPQPSLSDFSVPLGMSAPASEPETLAPELVSPPEPLGQDIQLPQALIAP
VGASPMAAADLTALIPEGIPGYDRPQNPFEGWPFQAPEPPELPPLEVPEP
LAPGQMSLEEAMAADASNRNDDRRGEDPRREQQEERTVELPRYQPGELGE
LGDEFEPPEALRFAPEPDDDLPAAADLPPPAAALADFTSADDPEALSAEY
LSQVPESPVAEVPGDTLENPLDPAEHSQQPEAFSPPEEALQAYNALSPLP
PTEQLFPSTAAQDPLVPPGSPPQAVPGAPELSGAGSAEGSGPSLGTGDPR
SLFSLEASFAAEPGSTPFVAGNLNPPFPGDASALVPPGGPGQDVLPAVDP
PDTAAQEFLSLEDRFVQEGFLLPDQLAAPEILPDEILPELPSPELLEAEV
PPPPDGLPTAAVDLAASPPTDNGEPLLGEAESLIPAVDPEAAEPTMQPSI
LEAAADGTASPFAGYPPELLTGAHGSPVGEGPIPPVDFFSAPDEGSLPPS
VAEPFAQGDELASTGQPGMIPQPEPLGAQSQPREQSDPAIEPQSIAQAEP
LSAATATPSEWPEAEPSLPAAPAEIPFLEQAEPFYADESAPESPPGDGTS
PVADAEEAFPVPPLIPQATDPAADFPLPVEPVMLTPSELPGEVTASEALR
ATASSEGEPQRYVPPDDAGASAFSSGVPEFSTLPPESESSVEAPGLDAPV
ETLEEVAFLPTQDELADNEPVEFVEPLPVVSEMESEAAPEESVESVPGPE
DALIREVGGGGQAFGAPAGEAGIADTSLLSTETSPQVFPSPATPDEIGQS
AADTAAGREAEMPPRLQDPELPESPLVSILDSQTVAEQPTVVESDPQPTT
GSEPQEFVPSAFEAASRSPSEAQIFQPEPLADPGVPLREEPSIEPAVLST
AESINAGDLGETSESPFVALEEQFPREGLPESVLPAAAEPFVPADIPAPS
SGGELQIPIGDEQPPAAFYEAAGLTAQPLQAQMPTRPEASGEPMPPEGFA
AQVEPSMLESLAPEPSPIAEISSPTGSESIAVPQLSLPEQPAAEPDLTVP
LELPLDDETTAEQVAQAQASELSGETPALSLPQTVELSGQSESLLSFETQ
APGEAPEVSFPVETETVESAGKSDGSVSSLEAVESIGELEPFSVLEAEAS
EPSGQSDTFLSPVTTEPTDETLSVGMPASQLGIPGGESEIFALQEAETAA
AVPEATLLSALQGPPAAEALGAQPEAILPLEGQDPQATEFLETAGEPVFV
SLEAQFSREQPPGSEPVAGLTAQTPERLAQPQAVGETTAEVAEWFAPPEA
IETPTEAPESLAQPEAFAEGASEVLDLRSQPQAVGEGGVFDELDPLADRP
PDLLGRLQPLSEQALPEQSGASLLERPEPLGEAQPLTPQEPLAETELAIP
LEPPSATASTDDAVPFARLQTPEEFQPSEAPLRQTDSPIEPTADELEPEY
EALVEAGLETQPVAVPTPELAAPALDVSEGEFFSEPLLQAEQAELPPQPE
PLSALSAFTEPEEGFAPSPTAEVAAVAPELAAAAEPTSDDEIAVEEVVAF
QPPELPGEPDDFLPLQPAELSGQPEALLSPPLEALEPSGGPETVASIGDG
DLDAVLPPPEAAAAGGEAGIIPQPERTVVIEPSEPDSPSPSLEPQAAEFL
ETAGEPVFVSLETQFTHEHPPGSEPVAGLTAQTPERLAQPQAVGETTAEV
AEWFAPPEAIETPTEAPELLAQPEAFAEGASEVPDLHSQLEAVGGGAVPD
GLNLLADRPPSEQASPEQSSATASTDDAVPFARLQTPEEFQPSEAPLRQT
DSPIGPTADELEPEYEAPSLEEQFAQEQEQPTSTISVDGTAVAQEAPDTE
VGFTEQVTWTPPEVLDPVESVMPPAEPLLPAEESVPSEVGVIEPSAEPAV
QLEPQAVEPEALISSGLPGEVLTFGEETVAPPEALVEAGLETQPVAVPTS
ELAAPALDVSEGEFFSEPLLQAEQAELPPQTEPLSQSPPEESPAQPWIPL
EEQFVGEELPESQSQVLPDEASAVTSPAEVPGEGQETPELTFAALEDRFS
QEEGFEPGMFTAEPPVASEDPSINLELPLETVQSELPAELPGPPQSTDGL
PQTVVPTGTAIEQATAAETAAAAAEPFGPSLEGDSALFEEQPIEPEAQEL
PLSALSAFVEPEESFAPEPTAEVAAVAPELVAAAEATADTEFLETAGEPV
FTSLEAQFSREQPPGSEPVAGLTAQTPERLAQPQAVGETTAEVAEWFAPP
EAIETPTEAPESLAQPDEISEGAFEATELPAPTQAIGEGGAYDESDPLAD
TPADLLGPPHPPGEEAPPEHSGTLLLEQPEPLGQSADEPESILPAEALGE
LEPLAVSEPLGELELLGQPEPLSEAQPLSSPEPLGEGGSIAPELLIAQDA
PVGGASEEDLLSLEEAFVDRSAAPAATESAAAEAVALEPGDSATPATEEL
PPAVARGVEPSLSVSQPTFESSAPLPQAELPEDRTAGEHPEPIHQTAPET
VYPELLLPERGDGQAGLEDLPPAPTAVSPGQEAPSLEAPLVTEAPPVAEL
PVQSEDAAEPAGLTGELPEISAEPEWRQDAQTLPIGESQAEETESADGPA
VAGREETFEIDELAAATEGSLEPLGAAFDLGAFPGLQTQHLGSSDPLGST
DYLGVGSSAALLLPSVVQDLTVLRSLVEDEDDLAATLPDGPELPALAAIE
ADLTPVDLVPAVEPDLPPLSEREIWPGWSGDRSTPDPTALEALPVDGEAA
DDRSESPLGVARDLTVLRSLSEAALPLEADGIVSASPAFALPDEQSETTE
IEPTRPTIDDTQEIPDADGAFDPALRTDEPAPEDDPVAAEGPLGVAQDLT
VLRPLAGDGFSPTVAQDLGVLTLLGVAAGVFATAGSEAIDPVEYGPLLYA
LAEQAEGPLDEADGGLEALLTPLDATVPEPLPLESAAGRSAAADTATSRP
LGMDVQPLGVPEVPALGLETAVPLASVADLQDFDADLPEADLEAPTLPIA
SERTLGQGPLLFAAQLLADQQPEEDAEEEPEVPPDDVALQSDFAQQEPDG
FSGAGEDEMEVEPLVLQEYLLQLEARKNAEEEDEEEEEEDGGGSDLPTVA
GEAAALSGAASAPLLFAGSGALTDALGGRDHQIPDNWSDIEELLTVSAAE
IGLGTTLPADTGSSSSTGGGGGGGGGGGGGGGGGGVPEAASTYSGYMGDS
MEQVLLSGQLPAHVPPPAGAAAAAAAAPGDPEEAEEEGEGENADLEMLAQ
EIYGLLRQRLEVERERSGHHYLGRMPW
>gid:533834  gll0518  
MTTRDIQAQLQEMYGVEVSPTLISNVTDAVMDEVRQWQNRPLETVYPIAY
FDCLQVKVRDNGRVVNKAVYLALGVDLEGRKELLGLWLSAHEGAKFWLGI
LTELNNRGLKDILIACVDGLTGLPEAIESVYHGCLVQLCMVHMVRNSCKY
VSWKDRKSLCADLRSIYSAATEEEAELHLGGVVPTPLELLSEKWDKQYSS
VSRMWRENWGSLPQRPWVRVIPIFRFGEDIRKVIYTTNAIESLNMTIRKV
SRNHHIMPNDESVMKMVYLAIQNQMKKWTMPIRDWRPALNRLTIEFEGRL
KV
>gid:533835  gll0519  
MVAAKKVQYSLRQEELMTIRKEILDELLKDYDGTDPQTILGEGGLLKQLT
KAVIERALEAEMETHLGYKKHEAAGKGTGNSRNGKSQKTLQAECGPVELH
IPRDRNAEFEPVVVRKGHTRWINGWSDTPGGTR
>gid:533902  gll0585  serine/threonine kinase
MTPEQWQQLSEILGDALDYQGAERLAFLDGCKLEAEQRRFIDQLLTIHEQ
QPDFLSAPMLSLLAGLADEAVPVGAPLRAGQRLGAYRVVRELGRGGMGVV
FLAERADGQFQKQVCIKVLQTGWAAALQVGRFLSERQILANLEHPYIARL
IDGGSTEAGVPYLVMEFVDGMPIDRYCEAQQLGLRPRLELFSKVCQAVQY
AHTCRVIHRDLKPSNILVNCEGEPRLLDFGIAKLLDPQGRSSEPTRTDLR
VLTPRYASPEQIAGAELTPASDVYALGVVLYELITGQRPAGAQAAASYEL
AWTLSDQTALLPSRAVGEQPTRAFAPAAAEGGAQGLSRQLEGALDRIVLR
SLSKPLDRRYATVGELLTEVRRYLDGAERTGMRPWPAGALARAVAVVALN
GAAFWVANELTARSLLAGPGWFAAAGLALLAGMMIYGLAEAPRASGRRRR
LAVGTVLLELGLLAGLFWAANPFVRLLSNPPNHAIAVLPFVNLGGDKQSA
YFSAGMAVDITNQLGKIADLKVIASSAAAQYAESPKSLGEIARELGVSSV
LTGSVRREGNKVRIVSQLVDPATGEQLWSGDYDRQLSDVFKIQEDVAQQI
AGRLQAKLSPAEKERVTQTPTGNITAYDYYLKGREYFERGRNAENDLAIE
LFKRALALEPNYALAHAALGNAYFRKAVDYSQGDVWLEASLGASRKALAI
DPQLGEAHRALGNGLRGKGFFRKAQDAYRRAIALAPNNAAALGSYGSVNF
IRCELEEGVNWTLRAVALDPLSTPLQTNLGDFFTILGDDARARQALEAAV
TLRPDDAYALWKWSQFYLLNGNFEAARQTARRLIEAEPRQLSGLYAAGDV
ERFAGNRSQARAYYRRILQGSGDLLAGTGYLLPTTVLGELAWRDSRRTEA
KALLERSLKLDKAAIARGNEDSFYRFDLAAVYAVQGKRDEALRWLRAAVD
EGYCHYRFLRREPVFASLHSDSEFLQLLKALEQRVEAMRVRVANL
>gid:533904  gll0586  
MLMNFKYRLEPTTEQASTLQCWFETSRRVWNFVLAERKDWINSRKAPLNA
CSIRSEFIIAADAPWPTYARQCKALTLARKHNPHLQAVHSQVLQQVLQQL
EKAFISMREGGLGFPRFKKPGRLRSICFPQFASAPVRGERLKVPTIGYVR
MRLSRSIPEGFRVKQVRVLRRASGWYAVMAIQSDAALSEAQPHGEAIGID
LGLHHFAAFSDGELVHRPKFFVDAQSKLKLLQRRVSRKKRGSNNWRKVQQ
KVARLHERIGNLRKEFHRQPAHRLCDRAGTIFAEDLGCQALAASMLAKHV
LDAG
>gid:533924  gll0606  
MQPSVWQPPVEPSPAEQAILKRIRRAKLFVFLRRIRHQLFDAAFQSELAG
IYKDSPCGQPPIPPAQLALATVLQAYTGISDDETIEVLTMDRRWQMVLDC
LDCEEAPFGKATLIRFRQALIAHGLDRRLIERTIELATSDGGFGSRALRA
ALDSSPLWGAGRVEDSFNLLGHAMRKALRIMVCQTGVGLAQWAEQTGTTL
VAGSSLKAALDIDWSQPDECAEALGRLLGALESLESYLDGQTQPPQAGVA
RCLQAAEQVHQQDVQLNSRGQFVLRRGVSRERRISIEDPAMRHGRKSRAV
RIDGYKRHVLKDIDSGLVRAVGITAANRPEAAVTRDIEADLEPQRVKLIE
LHIDRAYLSSEWVRLRPEDLRIYCKAWPVRNGPYFQKTAFVLDWEAMRIR
CPQGVTQPFEVGGKVRFPAARCRRCPLQERCTTSPKGRSVSIHPDELLLV
ELRERQQTKEGRAKLRERVSVEHSLAHIGQWQGRRARYLGVRKNLFDLRR
VAVVHNLHVLARQEAAGRSQAA
>gid:533932  gll0614  
MLLATLAKMLIKLTAPTPDERPLEPQCAIGGRLNAPVPVSPWPSPATVQL
NANVGGSQGRQMSPYTRQMISLSDAFEDVARVLPGYEVRQPQLDMARAVE
RGFAERIAVVAEAGTGTGKSFSALVPAILSGKRVVISTATIALQEQYLYK
DIPLLQKALPVRFEARLVKGRSHYVSKRRWSESLLAPGITWLREWYEQTE
SGDLADLPTSPPSEIWDDIRSDKDDCLREKCPHFDSCFYFESRRSLGQAQ
ILITNHALLLIDRASHGQILPDFDLLVIDEAHQFAEYATRALTLTLSNFG
MGRTLSRIKKQFPNLGIALARAEAEANAFFEVLLTEAHQTRRYNLDPAMA
DDLSAAVVRLLQSLKELNLGSEDNLETNVARMRRDRLVETLTGYEGNLRV
LAEPGENWVNWVEYQTTRGGGTNVALNCTPLDVAPPLGQWFSHPDGPTAV
WMSATLSTAGADPFDFFRRQVGLPAGTAEERIFASPFDYPSQGLLYLPTH
LPDPNDASYANAIAAEIEKLVNFSEGRAFVLFTSLQQMKQVYALLEAKLL
WPAQHQEQMPKRRLVEWFRTVKNPVLFATASFWEGVSVEGPQLSLVVIDR
IPFQSPGDIVYDARCELLSRTSGDRWAWFDRLALPYAQLRLKQGAGRLVR
TRTDKGIVAILDPRMHRKAYGKTILKSLPPMQVSRRFDPRLFRAFIDPAG
TGVSVEDDELWVQDFGLGI
>gid:533975  gll0656  endonuclease III
MSAIRQERRARAERLLVKLKVAYPRGLTLGLSSTNPFEYLVATVLATQCR
DERVNKITPALFARYPDPAAFAAADYEALLPLVRPTGLGPTKARNLTAIG
RLLLERHAGKVPATMAELTALPGVARKIANLVLADCHGIVEGVAVDTHVR
RISKLLGLTDSTDAAKIERDLMDCLPRDAWRSWNNLMVEHGRQCCVAGAP
RCTACPLVEDCPGGRELTAELERRDAAGAVLP
>gid:534051  gll0731  
MSCWVYLVRCRNGALYCGQTADLEARLRLHASGKGARSVRMAGFDRLAAS
WPVADRSAALRLEAAIKRLDKATKEALVRQPERLVTLAPPTGPGLQQSNE
IFHLNGVGDVADSGCASL
>gid:534188  gll0866  A/G-specific adenine glycosylase
MGYSSAKPSLNNCSEPEQVVRLRAQLLEWYGRMGRDLPWRRTRDPYAIWI
SEIMLQQTQVKTVLPYYQRWLAALPTVAALAAAELEAVLKLWEGLGYYTR
ARNLHKAAQVIVKEHGGVFPETAQQLQQALPGIGRSTAGAIASSAFGRCE
AILDANARRVLGRLFAVGDPPARAEAKLWEISQRLVDPQAPHNFNQALMD
LGATVCTARSPLCLLCPWQVDCLGRRSGDPTHFPVRPARAVRSEIAGVSV
AIECQGKFLLVRRPERGLLAGLWEFPFVESVGGGEPEETVRVAFGNRLES
LERLGQVEHEFTHRHLTAQVLRAQWIAAPAALPKVFDCREHTWQPPECWL
KFPMPGYVHKICKLLKEALPKVSVD
>gid:534408  gll1085  
MIFWYTVPVLPARLVRDAAGRLHHPHQILESFMSSESLDSWQKLDSETLG
TNPYWSYRRDHFRTAGGNTGIYYYVQTPGSVLVVPLMDEKTVLMVRQYRY
LRDCESLELPGGGRKIGQSALEGAQNELREETGFRAALWQEVGGFNPCKG
LTDEWCTVFVCRQLQSDPLEGEDPFEVTAAVPVAVADIPSKVADGEIWCG
MTVAAWFLAARTLTD
>gid:534416  gll1093  
MLLDFPQLVKTALSSLPNDDFPVLDSRLFFSCWLALVMDKSTVSMRDLFK
RVNHTGIPVDISTFSKACKSRSLQIFEQLYQALLVRVRRELPAKKLHPCP
IDSTVVGLTSKLLWAQDYHQVKLLTCLEHGSGATEGSLINFGYDHDSNFV
NDMLEAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKLTFEGN
AGQMRVGTGKESGVYRVVNFCDIENRAEYRLVTNLPAEGEWLVRDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKNENGIRMQLYITLIAHLLLELVS
VPKIWGSQRLDKLRYLQCCMCQEISYVHWLGKLLGSQRRRARLPRACTYV
H
>gid:534428  gll1105  
MLLDFPQLVKTALSSLPNDDFPVLDSRLFFSCWLALVMDKSTVSMRDLFK
RVNHTGIPVDISTFSKACKSRSLQIFEQLYQALLVRVRRELPAKKLHPCP
IDSTVVGLTSKLLWAQDYHQVKLLTCLEHGSGTTEGSLINFGYDHDSNFV
NEMLQAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKLAFEGN
AGRMRVGTGKESGMYRVVNFCDIENRAEYRLVTNLPGEGEWLVRDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKSENGIRMQLYITLIAHLLLELVS
VPKIWGSQRLDKLRYLQCCMCQEISYVHWLGKLLGSQRRRARLPRACTYV
H
>gid:534455  gll1132  
MRVVSVLGLDVGSKRIGVAGCDPTGLIASGLETIVRCNLGADLDAIRHWI
ERRRAQAVVIGLPRNMNGSLGPQAHRIQHFGQQLARVIDVPIDYVDERLS
TVQAGRALQSVSATRRKALIDQQAAAIILQQWLDIRRCQHRPTQESLDER
HIDTER
>gid:534518  gll1195  
MLLDFPQLVKTFLSALPDRDFPVLDSRLFFSCWLALIMDKSTVSMQDLFK
RLNHTGIPVDISTFSKACKSRSLQMFEQLYRDLLARVRRELPVKQLHPCP
IDSTVVGLTSKLLWAQSYHQVKLLACLEHGSGATEGSLINFGYDHDSNFV
NEMLQAIPENGVGIFDRGFAGLEYLKSSQASSKYFLMRIPSNYKLTFEGN
AGQMRVGTGKESGMYRVVNFCDIENRAEYRLVTNLPAEGECFVSDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKNENGIRMQLYVTLIAHLLLELVA
VPKMWGSQRLDKLRYLQCCMCKEISYVHWLGKLLGSRRRRVWSLKLCANV
Y
>gid:534945  gll1611  
MYVDQSGMDERDDYGYGWSPLGERFYGLKAGRRQGRINMIAGYRAGQLIA
PFTVEGACNRTVFEIWLESCLIPVLQPGEWVILDNATFDHGGRIAALIEA
AGAHVLYLPPYSPDLNRIEKCWAWLKSRIRKRLRDCGHLRNAMDAVLKQA
AS
>gid:534946  gll1612  
MAKPYSYDFRQKVLQAIELNGLKKSEASELFDISRNTINLWSQRKAETGD
VQAKPRPASHKGQKITDWEKFRAFVEAHGDKTQAEMAQLWDGQISSRTIS
RALHKLGITRKKRPTGIANAMRRNAQHS
>gid:534962  gll1628  
MTTRRYALRDDQWERIKDLLPGREGTVGVTAKDNRLFIEAVLFRYRAGIP
WRDLPARFGDFRVVHTRFSRWSKSGVWQRVFEHLAQDADNEYAMIDSTIV
RAHQHSAGAKGGSVTLSNRSQSRGTKYEDPCHGRCFGQPVELSSDGRSGE
>gid:535097  gll1761  
MTSERAYWVAWRQSPGVGPLTIRRLREHFGSLQAAWEAAETEFSAVEGWS
ARKGLESRRYRERTDPGLLLEALEDRWPSFWTPADGDYPPLLREIPDPPP
LLFYRGPLRQLSPAVAVVGTRHPSDYGRRWAAQIAETLAAAGFLVISGLA
AGIDGIAHEAALGVGKTAAVLATSLTRVYPPEHAGMAAEIAHSGLLLSEY
APEEETLTHNFPRRNRIVVGMSLATIVIEAPERSGALISAYLACDYNREL
FALPGAIDTPQAQGCLKLISQGARVILGLEALLADLGAQLVRATVPSALP
PVLDGDEKRIWDQIQGDGCSFDELALGTGLTTDRLASALLTLELRGLLVQ
QAGSRYGRAL
>gid:535157  gll1821  endonuclease III
MAIETMVTPGGVRAVMEGLAATYRGRGSVELGEPYRVLVSTVISQRTREE
QTTAVSQRVFARYPDMASLAAADEKELLVLLAGSEYREAKGPRLIAMATI
LLEKYGGRVPDDIDALLALPGIGRKTANCVLIYAFNREAICVDTHMHKIA
NRLGWVTTKTPEQTEKALEVVMPRDLWAGSNRLFLQHGRAICLSGAPPLC
SRCPVRPWCAYGQEPTARKR
>gid:535304  gll1968  
MKVMSTTSHKAAYLQQVIKVLGFASLALSCKVVQQPVYAQSPDPRVPEFC
SQVGTDNKLKNQCDDLRRKEIEYLSQQFDAAISAQNKPILPTSGKDVKAL
EGKTEIEGNAGQIEATILAYSSMELIAQAIRKQIKERASDVKTLIIEDPD
VVKSLSNYELLREIGSSLIAAYANIFDSPLVPDKMIEKSFDEMGKGSKKR
LYKKSFEGGVDLDLPLSIASSTIRAVADIFSLFKQDITIKPTNISLEDNA
REALISQLAADFQQCNESRCTNVYHPSLLLLDLTHYREEFSDKEKMSKNH
LDRILRDFVILLAYKQMADSKITELKDNSDLESKQLVRKLTDLGASTNIF
LSGLGSTEFAIYSLAKGAKIEDLLKDEGTYLLRLNVINSGGSTQAKQNLF
FTSDPAYNGGVAVTYMLFARDGSIRLSDTLYKTSGYLHVTSETTNIYRAR
KFNAERPMPTAAQTKETEPVSQIP
>gid:535342  gll2006  
MKAFPPLFSGNGRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRR
IRGHWGVENQVHYPKDVTFGEDRSRIRTLPLVQVWSVARSFALNLYRSLL
MANRAQAQRRCMFGLSTLKILFRMK
>gid:535354  gll2018  
MQTSDTLLLAEAVGHLKRSDPILAAIIERVGDCSYQTSAAGTHFDAVVRA
IVYQQLSGKAAATIHKRLCDLFDGRPPLPAELLAVEAAALRGVGLSRQKL
NYLKSLAAQVESGALAIETLHILEDQAILAELMRLKGIGRWTAQMFLMFR
LGRPNVLPEGDLGIQKAIQLAYSLKALPSPKQMAAVSEPWHPYCTIACWY
LWRSLE
>gid:535441  gll2103  serine/threonine kinase
MPFSEGELVRERYRLCRHLGVNGGRETWLAADGLCAEVSVTLKTLYFGRS
ATWQDHERLEREARTLASLDYPGIPRHRDAFWVELPEGHYYCLAQEYIAG
ITLAERVGSGGRLGEADIVRLAANLLETLAYLHSQAPPVIHRDIKPSNIV
CTADGGYALIDFGSVQAQSSTATLTVAGTFSYMPPEQFIGRAAPGSDLYA
LGATLLFALTGTDPADFPRRGLHLIFQSRVGVGRPLVRWLERLLEPALEE
RLGNAREALETLKHREALYAASSCGRALRPLAAGNPWKPFALLGGSLLFA
VAVPLVTLFGAGASLILYLYLRQVLDPFPTHVHLLNALAIVAVGSLLCLW
WAKSTADDAQP
>gid:535465  gll2126  
MLLDFPQLVKTALSSLPNDDFPVLDSRLFFSCWLALVMDKSTVSMRDLFK
RVNHTGIPVDISTFSKACKSRSLQIFEQLYQALLVRVRRELPAKKLHPCP
IDSTVVGLTSKLLWAQDYHQVKLLTCLEHGSGATEGSLINFGYDHDSNFV
NDMLEAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKLTFEGN
AGQMRVGTGKESGVYRVVNFCDIENRAEYRLVTNLPAEGEWLVRDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKNENGIRMQLYITLIAHLLLELVS
VPKIWGSQRLDKLRYLQCCMCQEISYVHWLGKLLGSQRRRARLPRACTYV
H
>gid:535466  gll2127  serine/threonine kinase
MVVALSREGLIRGRYRLVERLGHNVHRQTWLAGDERTAEQVVLKALAFDN
EMQWQQLKLAEREAATLQSLTHPCIPRLVESFWLELPEGHYFCLVQSYIP
GQSLAAMLRAGRRWSPGEVLNIARQVLEILDYLHSQSPPVVHRDIKPSNI
ILSDDGRLFLIDFGAVQAQSLPDQTVTVVGTFGYMAPEQFYGKTSPASDL
YSLGMLLLCILTGTEATEIPRRGLQVELPSTSLVEQPMRAWFACMLAPEH
QDRFDSAREALESLGKAETAKMPEVLNVRN
>gid:535526  gll2187  
MPRANRIEIQESAEELRALLRQRASSEVKERIQVLYLYKTGIVTTEQGLA
AVVGRSTSTVFRWLQIYRSDGIAGLTRTQRSSGRPADIQGEVLEKLKERL
KQPDAFKSYKQIQEWLASECGIDVSYKVVYDTVRYRLKVTLKSTRSRSLT
LSASQFRLTAQRRFSHRPL
>gid:535583  gll2243  
MKTHIAGLLALGLLLSAAPGMAQTTTPYGGSSSTTPSNPRSNPTTDESGP
RSTEPRTDATPSAGTPAAERPGSSMDPMTRPGSSMDSKNKMQPTTPSSGN
PRSNPMNDKSGPTSEPRTDATPSAGTPAAERPNTSMDPMKPDTSTKKSNK
TSSGMGKSSDRVDVTRTTDLPESQSRSNDAVIADPPSGVSNTDPNLQPKS
TTRPAGEPKPKP
>gid:535589  gll2249  
MSQPPPEIQVQNLDHLGIVAGIIDAFGLVEEVDQLLSIHPQEIVTCGQVL
KGLILNGLGFVSAPLYLFEQFFVGKATEHLIGPGVLPEHFNDDRLGRVLD
KLYEQGTTKVFVHLALKAARHFGVRTGSVHLDSTSFHVDGEYTPKGRVAP
QAEDEPQPIVITHGYSRDHRPDLKQFLLSMITSGDGDVPLYLRVGNGNAA
DKAIFAQIIQDFRAQWDVDALFVVDSDLYSAQNLSAVQAMHWLSRVPSTL
AEVKHLLAALSDEQFAPAQPGYRVTEVGSTYADIQQRWVVVASDERRKSD
LAALDKKLNELEGKLGKELTTLCKTAFACEADALEAAERFAAGLKFHLLG
AVRAVEQARHDKAGRPARGSKPEKTGVWLLSAQLVRNPTAIELEQHSAGK
LVLATNVLDEQQLSTAQVLSQYKAQQSVERGFRFLKDPLFFTSSVFLKSP
ERVEAVAMVMGLCLLVYSLGQRQLRLALSAAQLTVKSQTKKPTATPTLRW
IFQVFQAVHLLEVAGVKQVSNLNAERRRIVECLGPTCGQYYLMG
>gid:535648  gll2308  
MILGIDPGRSKCGLALVGLDRKLYFRSVVASDQLLKQVERLLEDFPVAAL
VIGDQTTSEYWQAQLRAAFPEVRLVAVPERRSSEQARARYWQFNPPRGLN
RLLPQDFRVPPEAYDDVVALILVERYLVGLIDADRR
>gid:535665  gll2325  
MTFADLQLHPDLVRAVEQLGFTQPTPIQTLAIPPALAGRDVLACAMTGSG
KTAAFLLPILQQLIAQPRSTTRALVLTPTRELAGQIEMHLGQLARYSALR
GASIIGGVSPEPQARAFHKGVEVLIGTPGRLLDHFDQPYARLANLEILVL
DEADRMLEMGFLPDIRRILRHTGHRRQTLFFSATMPGPIAKLATEILRNP
VTLNQERKAAPAVGITHSVYPVAQNLKSKFLLELLEREGMRQVIVFTRTK
HRANRLSEYLSKARISCERIHGNRSQGQRTEALAGFKSGKYRVLVATDIA
SRGIDIEALGHVVNFDVPQAAEDYIHRVGRTARAERTGDAFTFVSPEEED
QLRAIERAIGKRLPRLTLLNFDYAVAPQGSRLSG
>gid:535709  gll2367  
MATLSGLVDTHVHINYENFAADLDAVAHNWRAAGVVQLVHACVKPEDFPG
MQALADRYPEVFLAVGLHPLDVERRWDAALAGRIRDYARSDKRVVAIGET
GLDFFKADNPAEQERAFRDQIALAQELDLPVIIHCRDAAPATRRILEEMG
PMRGVMHCWGGTPEETRWFVELGMFVSFSGIVTFKNAILLQQSVGVVPDE
LLLVETDCPFLAPVPKRGKRNEPAYVAHVAEKVAELRSKTPEAVAQLTAA
NARRLFRLPTPAA
>gid:535760  gll2418  
MHILHGTWIPDQSDCYVAPGAFYLWVETPPHKSRTADAANVHPAHLKKQA
LQQFLINTLGIAPNPQSGIVACYFLLPTADDRPLPSLESARHQQIELPES
SALQYWQVDCLRLPLVPQAVGSPGLIRLLHDLHFQLQEQRSEVQFGSDLL
FWHHFCQFLKVQIFKDRYIPALRYRELPRGRGKSKTAPDELYPGWELVCE
HYAAAVAQYAEGMPTVCASGRADRPQVAACFAKQDLLHHFGENLLHAAVG
AIPVTAALGQQIHNSLVQSCLTPRIQPWSAAVPGALEDYRHWRRWRERLS
AARVAAPFVLCLQLLEPEEADEDGDEAVLWQLRFAVASRSDPSWRLSLLE
YWQAKAEERESWRNHLGEAFERHLLAALGYAARIYPALWEGLHTSHPVGL
DLEVEEAFAFLQDAAWVLEEAGFRVLVPAWWTPEGRRRAQVRLRARAGRS
ATGPDGAGRGHFSLAALVEYSYALAIGAQEVTESEWQALVALKTPLVRFR
GQWLVLEAERMEKLLTFWREHRQERPAMPLLEWLKLTASQPDLDIECDAP
LQTLLTRLHDPSRLVPLGDPPGLLGTLRPYQRRGVSWLSYLEQVGLNGCL
ADDMGLGKSLQVIARLLYEREEQAGEGPTLIVAPTSVIGNWRKEIEKFAP
GLRVWMHYGTGRHQQADSLCQVCLEHDVVLTSYTLARKDEKLLAAVPWRR
LVIDEAQNIKNPKAAQTRALLKIPAACRLALTGTPVENRLLDLWSIFHFL
NPGYLGSEAQFRKRFELPIHKEDDRSRTAVLKRLVEPLILRRVKTDPAII
QDLPDKVEQKLFCQLTREQASLYAAVLKEVEAQIEQVEGIARKGLILATL
TRLKQICNHPMQFLQDGSAFSPERSHKLCRLDEMAEEVLAEGESLLVFTQ
FHEIGAALERHFRQVRRWGTYYIHGGVSREKREKLIADFQDPDSEPAVFI
LSLKAGGVGITLTRANHVFHFDRWWNPAVEDQATDRAFRIGQTKNVFVHK
FVALGTLEERIDRMLEEKKRLASAIVGSDEGWLTELDNERFKALIALGES
ALIE
>gid:535835  gll2491  
MRVHHPYHEIPVETLNVDFFNLSPLSVARRLIGCAVVRVLAGERLSGRIV
ETEAYGGLRDPSCYVVRRDERIWSLLSGPPGVLYLHRAYRHWLLNITCDA
VGEPACVLIRALEPTGGEERMRQLRRGARDLTNGPARLVEALAIDSAWEA
SALPRAEFWLEAGEPVPEEQVLNTVRIGLTRGKDLPWRFAVRDSPWVSRS
VEAVLSEASLSAGL
>gid:535852  gll2508  
MATAFSALGLSESVVKALDELGFEQPTPIQLKSIPFLLDGRDLLAQAQTG
TGKTAAFGLPLVDRSDPQDARVQALVLAPTRELAVQVCEAIHTYSKHSGV
RVLPIYGGQPIDRQMRRLRAGAQIVVGTPGRVLDLMRRGSLDLSALRTLV
LDEADQMLDMGFIEEVQTILDAAPPERQLVFFSATLPASIRKLAARHLRT
PMTLTMPAEERDTPAIAQRVYFVNFKNRAQALTRVLAAEDPASALIFTRT
KQAADELAEQLQDDGHRAEALHGDLNQSAREAVLGRFRRQQLNVVVATDV
AARGLDIADLSHVINYDMPQDGESYIHRIGRTGRAGRTGVAISFALPTDR
YRLRLIERATGSTLVPAKIPSAGEIQVRRTERLVEQLRRQLQADESDPSL
VLYREVVGQLREEFDLGDIAATFLKLAQQRPGSTGSG
>gid:535906  gll2561  
MTTRDIQAQLQEMYGVEVSPTLISNVTDAVMDEVRQWQNRPLETVYPIAY
FDCLQVKVRDNGRVVNKAVYLALGVDLEGRKELLGLWLSAHEGAKFWLGI
LTELNNRGLKDILIACVDGLTGLPEAIESVYHGCLVQLCMVHMVRNSCKY
VSWKDRKSLCADLRSIYSAATEEEAELHLGGVVPTPLELLSEKWDKQYSS
VSRMWRENWGSLPQRPWVRVIPIFRFGEDIRKVIYTTNAIESLNMTIRKV
SRNHHIMPNDESVMKMVYLAIQNQMKKWTMPIRDWRPALNRLTIEFEGRL
KV
>gid:535907  gll2562  
MTIRKEILDELLKDYDGTDPQTILGEGGLLKQLTKAVIERALEAEMETHL
GYKKHEAAGKGTGNSRNGKSQKTLQAECGPVELHIPRDRNAEFEPVVVRK
GHTRWINGWSDTPGGTR
>gid:535940  gll2595  
MVASAQLDLLQLFPFDLDDFQREAIAALDENESVVVCAPTGSGKTVIAEY
MVYRALAREKRVFYTTPLKALSNQKFRDFCSQFGPEQVGLLTGDISLNRD
APVVVMTTEIFRNMLYGMPLGEMGTTLAQVEAVILDECHYMNDSQRGTVW
EESIIYCPANIQLVALSATIANAGQLTDWITRVHGPTRLIYSDFRPVPLE
IHFCSPKGLFPLLDRGNQRINPHFKNIKKHLRGERNLQADAPSHKYVIGQ
LARRDMLPAIYFIFSRRGCDQALEELGDLCLLDAHEQEQLARQVDDFVRE
HPEAVRTHQLSQIYNGLAVHHAGVLPAWKALIEELFQQGLIKVVFATETL
AAGINMPARTTVISMLSKRTDSGHRPLNASEFLQMAGRAGRRGMDEVGHV
VTLQSPFESAPEAAALALSQADPLVSQFTPSYGMVLNLLERHSLETAQRL
VGNSFGQYLATLHLEPVRREHAEVSAELEALAGGDAPVSEAELAAYEKLR
GQLREARRLQMILKEQADREREQALEGQLKFALVGSLVVLTDPLDRVAKT
AVLVRKVPGPPLMLVCLTADNRWVVTGVGGVLDFQPHSGRFDEAEGLSIP
TYLMLRPGGHIAGGSESLDLSCRLPSLPVPEPPEAVTAQKNEVEQLRLAQ
ENHPAHRWSGRSAHQRAQHHREKLLKRHQRLAEQLSGESDRYWQEFLRLV
RVLEKVEFLDNHKPNALGAVAAAIRGDNELWLALALLNPEVEKLNAVQMA
GLAAALVSEPPRPNTWATVTPSPQVEEAIAALQQTRRNLVRLQRRQQVLI
SVWLEERLVGLVELWAKGVDWQTLCGSTNLDEGDLVRLLRRTADLLRQVP
HVPYLTDTVRQTCAESQRLLDRFPVSEAV
>gid:535965  gll2620  
MSQPPAEIQVHNLDHLGIVAGIIDSIGLVEEVDRLLGTHPQEHVSCGQVL
KGLILNGLGCVSAPLYLFEQFFVGKATEHLIGPGVLPEHFNDDRLGRVLA
KLYDEGTTKVFVHLALKAARQFGVKTGRVHLDSTSFHGDGEYTAGGRVVP
QAEDEPQPIVITHGYSRDHRPDLKQFLLSMITSGDGELLGASIDKDGLRG
ADLRGAVMPDGAVHS
>gid:536022  gll2675  
MASLCDLVGLPRSSFYRQSVFKVLEDQECETELRHQMQLICLEMPGYSYR
RITAELHRRGWPVNHKRVLRLMRQDNLLCLRRKAFVRTTDSEHGFRVYPN
LATNLMPSGLNQLWISDITYIRLQEEFVYLAIILDRYSRRVIGWSLSRHI
DMELSLSALRMALSNREVLPGLIHHSDRGVQYASKAYIELLEEHKIAISM
SRRGNPYDNAFAESFMKTLKYEEVLLNEYGDYREAQGNIARFIEEVYNRK
RLHSSIGYLPPVEFEARVEHEISPASHPT
>gid:536023  gll2676  
MHIMSKSRRSFTAEFKLQVVREVEADATVACVARLHQLHPNLISHWCTQY
RANPQTAFRQSARGQQQEAQRVAELEQMVGKLTMENQFLKKVLERLERHA
KERNTPH
>gid:536024  gll2677  
MITPDSRYRGHRFPPEIIAHCVWLYFRFPLSYRDVEEMMAVRGVQLTYET
VRAWCRKFGQTYANQIRRRRPRPGDKWHLDEVVLKINGQTSYLWRVVDQQ
GNVLDILVQSRRNKAAAKKFFRKLLKGLRYVPRVLVTDKLASYGAAKKEI
LKSVEHRQPRGLNNRAENSHQPIRERERRMRRFKSAGHAQRFLSAYGPIR
QHFCARRHRLCAEAYRQTLQQRLASWREIVGIAEVS
>gid:536031  gll2684  TetR family transcriptional regulatory protein
MFSEVGYEAATTHGIAERAGTAVGSLYQFFPDKRALFKALEQRHIERVHI
AWAQLDALPLEKLGFEGFMQQLLATYAGIFADATSRVVFVLFYTSRQIFQ
SIDEGFTADAVRFTAKLLTRRNPRLVAEEAHLLAEVCVHAGNALVLRALQ
SDEDHGTRLFAQIPELLGAYLRPHVGEACGADQVMKVMKCPRCGSERLSR
NGHRHDRQRLLCKDCSRQFLLPVGQSA
>gid:536084  gll2737  
MPATLWETSFTETFLNELLNVPQTIQDRVKRTIKLLKRDPVSAKGNIKRL
KDFKNNVYRIRLGDYRLIYSFGSGWIKLLSIRKRDESTYELGLPEFEVPG
APPDPALLEPQATDEPVLVPVYIPEEPESLPQTVTRENRVTTSLPFELTP
ELLKQWQIPEEDWSEVLAVRTSEQILDLPLPNNLISRLLDNLYPRPIEEI
AVQPEFVLQQPEDLERFVEGDLIAFLLKLDPEQEKLRDFGSSGPILVKGG
PGTGKSTLALYRVKKLLDAGHSPVLFTTYTNALVNYSAQLLEQLLGQNAA
TAGAEVSTIDRMAYQYFSQTYGKPHIVEDNEAVVLLEEALKTTAIPATNA
FDRGVRLEVLKRLGVPYLLSEIRTIIEAWELTTPEQYLEIERRGRGTPLK
ANIREAIWAVYQTWSQLLAQKKLITWEQLHSRARDLVESLPQPPYQAVVV
DEAQDLPPVKLRFMMSLATSPGGVYLTADASQSLYHRGFSWKQVHADLKV
AGRTLLLKRNYRNTEEIAGACIAILQNSEAGDEECLYQHPSPHRGDAPTI
LLSDDFEGQVSAIREFLIAAAQKFRLPLHGSAVLCPSNHMGMAIAKQLDS
LDLNAKFVSRREIDIRKPYIKVMTLHSAKGLEFPFVVVAGFDEGNIPYLD
AYIPPDEVPTVLDEQRRLFYVGCSRAMRALMVCGSRSTPSRFLDSLVAPY
WCRQELL
>gid:536088  gll2741  
MSVFLPGTEVQARGLRWAVVTADSLGPQTLYRLRGLEGAVLGQELDLLSP
FEEINPIERDLRPDKAAPLRNWLVYHQAFLLEQALGPNALLAVQPGRLRL
EPYQLVPVLRAIRMSRVRLLLADAVGLGKTVQAGLVITELMARRIAHRIL
VVCPAGPLLEQWKVEMSERFGLRLEVIDRAKLEEVRRGAELGANPFDHIS
LGLVSIDFLKQERILDQLERASYDVVVIDEAHHCMDLGSNEREDSQRRRL
AEVLARQCDAFILATATPHDGSDRSFASLCELLDPSLVDGQGSLRPERYR
AHVVRRLKSHIKDASTGQPLFRERQVKPCPVIPLPNAHSRFIELQRALLE
LLAPQLRRAFKNRNYSDVLAFIALLKRSVSSVAACKRTLSVVAERFQAFL
SEGVENQERRRQRLRTLRDYNRKLERFGSLSAEEEEAQSLLEMEDLAEQL
ASLEREVRGGSREVAKFSSLVEALDNLVHLAGEALEQDPKLDQFIQVIQA
IRAEEPRANVLVYTEYIDTQQAAVRALKQAGFRDVLTMSGEDDEKMRTGT
TERFRSEDGLILVSTDAAAEGLNLHQRCHQLIHLELPFNPNRLEQRNGRI
DRYGQQHDPIVRYLFLRGTFEERILLRLIVKYEKQRARLTFVPNTLGLNT
STEAGEIRLLKGLMDEDTRLFEAEDTLFDLTEAKEDEGADEATRELLEEV
DRSLKSFEKAAQTNTWLGNLGMNAGEDLVREASEARALGHRAGTVDLAQF
VADAVRLGGGSVQPTIHPEIQQIRLPSDWTYGLDDLPGYDAASRTLRLTT
NLDLMRDPAGLPVGFLGRAHPLVRRALDRVRNLSFGQDIHVGQDPRVSAV
KGKVSEPTLLFTFLGRVSSGAGREFERVLAVCSTAKSEPEFYQTAEDWLP
LADPGQAIKTTDVYKNHFVGWFTAAQGKARRVASEGFQPMAQVFSGQQQQ
ETQQEQERQQSWFEQRVSEIVPQLQQANLFAPLRTAASAASVPADPNWGS
MSEPVAKLAAFAADGTQLPRLRSEAEGVLRLYRQRLALIDARATLGEPEM
VLLGVLMIVPETAHAH
>gid:536216  gll2867  
MPRTSSRERLIRAAAELFAAQGVRETTTRQIAERAELNEVTLFRQFGNKH
ALMLAVIKESGLFAALEESTDPSSAGGGAVRTRFAHLADDCLQAFAQLPE
LLLSAVVEAERDAAEERIAVGRSLEGVNRSLANRLEAALGERTGGQDLFK
IATLLNTLLLGRAVSAYAGAPQTPWVEREDFLASVVELFFCGVFDKTSAL
QTVNDLDGELVHRILDGAKKSGAQDFALVYVLFAAGLSAEEVSTLARTDQ
VFDKDRALLQVANRQVPLNRQILGRRYGSYNNNPLTRWLKTRRDGCQAVF
IDEIGKPMTPEGVRFRWSVCTRELVTTGAAPPALEQACETWCVEMLTRGA
NLETLALLTGQPPELLQPFARRAQEKLALETALKLDR
>gid:536279  gll2930  
MSLRIYGNRLLRTPGGLDTRPTLGRVRTAVFGRWHGRVEGCHWLDLCAGA
GTMGAEALARGAAEAIGIEQAAGAARIAAANWDKIAPGLGTIHKTDALTG
VARLAKQGRVFDLIYFDPPYRGELYLPVLEAVVACGLLADDGELAAEHGE
DLPDLPEAVGSLVRLDRRVYGGTVVSYYGFLGAAASNSCG
>gid:536333  gll2984  
MQPSVWQPPVEPSPAEQAILKRIRRAKLFVFLRRIRHQLFDAAFQSELAG
IYKDSPCGQPPIPPAQLALATVLQAYTGISDDETIEVLTMDRRWQMVLDC
LDCEEAPFGKATLIRFRQALIAHGLDRRLIERTIELATSDGGFGSRALRA
ALDSSPLWGAGRVEDSFNLLGHAMRKALRIMVCQTGVGLAQWAEQTGTTL
VAGSSLKAALDIDWSQPDECAEALGRLLGALESLESYLDGQTQPPQAGVA
RCLQAAEQVHQQDVQLNSRGQFVLRRGVSRERRISIEDPAMRHGRKSRAV
RIDGYKRHVLKDIDSGLVRAVGITAANRPEAAVTRDIEADLEPQRVKLIE
LHIDRAYLSSEWVRLRPEDLRIYCKAWPVRNGPYFQKTAFVLDWEAMRIR
CPQGVTQPFEVGGKVRFPAARCRRCPLQERCTTSPKGRSVSIHPDELLLV
ELRERQQTKEGRAKLRERVSVEHSLAHIGQWQGRRARYLGVRKNLFDLRR
VAVVHNLHVLARQEAAGRSQAA
>gid:536603  gll3249  
MICPLPETFEQALLQAQRSVRNALEAGRTRLQVEIQTGRKSATAITRPLL
DVLPQPLLAVSGTGIADYAYTLWGETPYKLLNISEREFIGNSWRSLVLMD
ASSIDVDEVQIYAERARVGDKVLMMVNNWPEGPGLTGVGRGKESTRNAFR
GSVEVAYFLQAFRYRPVVLFRRFPEPWQLWERKEDRFILVRESQAIFTPR
ELAAFNERMGPLQAVERFFKGPSFFDNW
>gid:536616  gll3262  
MMPRPPGVPGTMAQEAEEILSMSIERLERQPVHSGSRLRLFRDRVRLANG
LVRTWDILEQPPVAVILPYQSGPDGGRVLMVRQYRYAIGQDLLEFPAGIV
EAGEDPAHAARRELAEETGLEAARWVTLPPVFRMPGNSNERTHFYLAGEL
SPASGYGVDPEEEIALEWLAVEEFESRVLSGRIEDGKSLILWLLAQPHLQ
P
>gid:536644  gll3289  
MHQTVVRRLLPLVAITALLIACAAETEAPKPTATASAPATESMAMGGGRK
VNINTAILSELDKFEGLLAIPALSNRIQAARPYASPEELVSKNVLTQEQF
DRIKDQVTVEEIVLTGRERDVDYLIKLGLMRGHLIVARELIERQQPEHAL
PHFGHPVEEIYLDIEEQLAERQVPEFKSDLLKLQDLVKFKPDSPEITPGL
SAAFAAIDQAEQAIPAAERTQPAMVLGVIDGLLEAAAAEYSAAVNNGKIA
ARIEYEDSRGFVLYAGQLYATIGPQLDKADPKTAEAIAVGLRKLAGAWPT
IEAPNPPALSAEEVAASVKSVEQAAQKFAT
>gid:536706  gll3349  
MSQPPAEIQVHNLDHLGIVAGIIDSIGLVEEVDRLLGTHPQEHVSCGQVL
KGLILNGLGCVSAPLYLFEQFFVGKATEHLIGPGVLPKHFNDDRLGRVLA
KLYDEGTTKVFVHLALKAARQFGVKTDSVHLDSTFFHVDGEYTPNGRVAP
RAEDEA
>gid:536904  gll3545  
MNYLEPLNRQQLLQQMASRVDGLTQKTAGAALDAALEVIADALAQGRTVK
LSGFGSFQVRRRPARTNIHPRTGQPVLVPAAWNAVFSPAQALRERLRVLP
HPPEA
>gid:537003  gll3642  
MPAYEATLRPAQREVLHYSGGRLGISAVPGSGKTFTLEALIAELVIRRGV
PPERIGVFTFMRSSRANLTGRINRQLWEGGVAGRLEAFTLHSLALKVLQH
FQGQLGLAAIDVLEGYEQERFISRLTQAWLRNHSEIWEPLLPPTAEAERA
ARNRAAFGRGFKAMCREVIRTAKNYRLPPQAIEPVQAGFLPWALGIYRSY
QAELLRAGKLDYDDLAWRAVDLIERDAGVRGEVEGWYDYLFEDESQDSSP
LQERLLDLLSARSGNLVRVGDPNQSIMSTFTTAEPRFFRRFCRLSRRVVL
EESSRSAPMIIALANALVDWAASDHPNPSLRGALVRQHIRTASSGPANPG
DSEAALHFEVVCGPPEEELASVARCAAEALAARPEHSFAVLVGTNELGAQ
VLRQLQRFAGVRTLDLLRSNPTQRELIDRLRVMAEFFAQPSSPVRLAAAI
ESLADWAGLGRRQVAGAQPRLLRIAPEKLLFPVFGSEIALPVAAEERAGW
QKICSTLAGWLLATRSPRADSLRLVVQTLYRSSAEIYLGHYVVDQLERTL
GERPAVDWQEVADEIRAILDGSLNNLPSEAFHFAPEPGAITVATAHRAKG
LEWDEVFLTGISAYEYPVLREDRPVGLYFLDGLDMRAEALSELRASARLR
RGHTSATEQAFLDLAAEKLRLLYVGITRARRRLVLSVATRDLFGREQRPS
RLFQVLQCFDSRR
>gid:537116  gll3754  
MDRRHRFALQAEIWVADHLAAQGGLVLARRWRCRGGEIDLVVRLGGVLCF
VEVKARGGNSWDSAGWEAVGAVKQRRLLLAAALFLAAHPELARSVCRFDV
ALVGRDPGGGVRLVAYIAGAFEGSGR
>gid:537199  gll3836  
MQPSVWQPPVEPSPAEQAILKRIRRAKLFVFLRRIRHQLFDAAFQSELAG
IYKDSPCGQPPIPPAQLALATVLQAYTGISDDETIEVLTMDRRWQMVLDC
LDCEEAPFGKATLIRFRQALIAHGLDRRLIERTIELATSDGGFGSRALRA
ALDSSPLWGAGRVEDSFNLLGHAMRKALRIMVCQTGVGLAQWAEQTGTTL
VAGSSLKAALDIDWSQPDECAEALGRLLGALESLESYLDGQTQPPQAGVA
RCLQAAEQVHQQDVQLNSRGQFVLRRGVSRERRISIEDPAMRHGRKSRAV
RIDGYKRHVLKDIDSGLVRAVGITAANRPEAAVTRDIEADLEPQRVKLIE
LHIDRAYLSSEWVRLRPEDLRIYCKAWPVRNGPYFQKTAFVLDWEAMRIR
CPQGVTQPFEVGGKVRFPAARCRRCPLQERCTTSPKGRSVSIHPDELLLV
ELRERQQTKEGRAKLRERVSVEHSLAHIGQWQGRRARYLGVRKNLFDLRR
VAVVHNLHVLARQEAAGRSQAA
>gid:537200  gll3837  transcription-repair coupling factor
MPLTALIQSLRGSPFLEEMSQRLGKALPVRLQGGNRVARGIAASALARRQ
GTPLLVVTANLEEAARWSAQLEAMGWGSVYLYPSSEATPYEPFDPEEEVT
WGQLQVLAELTGRSSAHWAIVCTSRALHPHLPPPEYLAEYCLSLEAGAGL
SIEKLTGELVRLGYLRVPQVEAEGQFSRRGDILDFFPVSAEIPVRAEWFG
DELERLREFDPATQRSLDAVQQVAITPVGFGPVVLPELQLRLTPARVAEL
PNSWQEVVRTQIQQGQAPEGLRRWLGLAFDQPASLVDYLPDALTVCVDEP
EQVRSAEHRWCEQAEEQWSHQPVGPGPLHTDFAAITASLGRYALIEMREL
VAEGEAFGLGGRPLPSVPHQFGNLAEHLRQYRAQGLQIWMLSAQPSRAVA
LLGDHDCPAQFIPNPQDLPAIEKARSTRTPIALKYSGLAEMEGSILATLR
VVLVTDREFFGQRVLATPNFVRKRRRAASKQIDLDKLNPGDFVVHRSHGI
GRFAKLEKLTVSGSAREYLVIEYADGILRVAADQMNSLSRYRSTGGTVQL
SRMGSKSWEKTKQKVKKAIQKIAFDLLDLYARRAQESRIPFPPDQPWQRE
MEESFPYPLTPDQARAIQEVKIDMESERPMDRLVCGDVGFGKTEVAIRAA
FKALTSGVQCAVLVPTTVLASQHYHTFKERFAPYPISIGLLNRFRTASEK
KDLLARLATGELDLVIGTHQLLGAGVRFQNLGLLVIDEEQRFGVAQKEKI
KTLKTQVDVLTLTATPIPRTLYMSLSGVREMSLITTPPPSRRPIKTHLAP
YDPEHVRTAILQELGRGGQIFYVYNRIEDIQDVAARLQAMIPTARVCVGH
GQMEEGELESTMLAFSGGEFDILVCTTIIESGLDIPRVNTILVENAHQFG
LSQLYQLRGRVGRSGVQAHAWMFYKQEEALTDEARKRLRAIQEFTQLGSG
YQLAMRDMEIRGVGNLLGAEQSGQLNAIGFDLYMELLEEAIQEIRGRKLP
KVEDTQIDLRVTAFIPADYIPDLEQKMRAYRQVAAAPDRAQLQAAALEWT
ERYGPVPPAAQQLLRVMELKQVARALGFARIRPEGTNIVLETPMEAPAWE
QIHQALPAEVRGRFFFQPGKVTVRNLGVLPSAQQLENLVQWLDKVQLPSL
EMAG
>gid:537308  gll3945  
MEWEAKKALQRLQESGQITVIVQDNYSVHRHWRVREKWPQWQEQGLYIFF
LPPYSPQLNDIEGEWLQIKRHGMQGRSYEMEYELGVAVIEAIDGRYRHKG
YACERYLFN
>gid:537366  gll4003  
MDLQAEKRSIEAIYQERLERFGAARRLYERRTGRIANARVAVFLIALAFV
IVGLADRNAFQPLMLTLAAAGAVGFVGLLVLFGRNERELRRLEALEEENR
EALARHRRDWQAAPVPETPEFAEQATFARDLDLFGHASLFQLTCTAHTPM
GRRLLARWLLEPAGPAAIAGRQRAIADLAPLLDWRQDLAAAGRFLAQKPP
DPASFVAWAEAPSWLDQRPWLVWVARLSAAATAALIALQAAEVLSLPLWL
GPLTLNIVLSGIYTASIHQTFAAVSNRSGSVRAYAGLFAALSRLECRDET
LKSLQAESLNAEDRLRKLDTLVGCSDLRFSQLIYLAVQWITLWDVHVLGL
LESWRRQSGDRVRSWLEALAQIEALASLASLAHDQPEWVFAEVDTRLQVI
AAGQLGHPLIGEAVRVANDVTLGPPGTFLLVTGSNMSGKSTLLRSIGLNI
VLALAGAPVCARSLRLPPVTLKTSMRVQDSLASGLSFYMAELQRLKEVVD
AARPAERPLLYLLDEILLGTNSAERQVAVRRVLSFLLGRGALGAISTHDL
ALADLPELAAAARTVHFREHFAAGAAGPVMTFDYRMRPGVAPTTNALKLL
ELVGLGDPEDNRPDDRTNV
>gid:537430  gll4066  
MSEASLEVLLKALRLGFIGEQVERIEAQAVAEGWSHSRFLKCLCEFEHTE
RENRRLARYLKDARLPVGKSLSGFDFAACTKLERRRVQQLAADSTWVKRA
ENVLLFGPSGVGKTHLAAGVGLAMVEKGIPVRYFTATNLVQLLQQAKLNL
ALEKQLVRLDHYPVVVIDDIGYVKRSESESSVLFELIAHRYERHSLVITS
NHPFRDWDQIFSDTTMTVSAVDRLVHHATLIEIEAESYRKKAAQARQQRQ
SAT
>gid:537516  gll4152  
MSAYVKSPQNIVYPSGDGEPVAETFVHLYALLTILEVLKQYLEGQQATVL
ANQFLYFIEGNPRARVAPDVMVIIGVAPGGRDNYKLWEEGGQVPAVIFEI
TSKGTQEKDKAFKKMLYEQLGVHEYWLFDPKGEWIAGQLQGYRLVPVEVD
GEQEELYTPIVDSRIVPLGLRVAVDGQLLAFFREDTGAKLLLPSELHAEL
RRTAALLEQEHGRAERERERAERLAEYLRRQGIDPDSIA
>gid:533389  glr0076  
MSLPRRRFSRELKLHILAQIEAGQTIAAVAREHQIHPTLITQWRQQLAKY
AEEAFSGNGRTYHEEARVAELERMVGQLTMENVLLKKALTRLESRSRPTR
PDGNK
>gid:533390  glr0077  
MLQLVAEQSDAQGGLSIAQACRTLGLSRAEYYRCKGALQKSDSDMEVRVR
IQAIALEWPSYGSRRIRFALKRQGLTVNRKRVQRLMREDNLLCLRKQKFV
KTTDSEHGLTVYPNLSAGLQLSGIDQLWVADLTYIRLSGEFVYLAVILDA
FSRRVIGWSLERFLDAGLAVLALRMAITSRSFGSELVHHSDRGVQYASKE
YTAILKDRGIQISMSRRANPYDNAKAESFMKTLKYEEVYMFEYENMTEAR
NRIGDFLEEVYNQKRLHSSIGYLPPTEFEQRLTMPDPA
>gid:533425  glr0112  
MNERIDQIQSDATLVFERYANYIKHVQNEFSSLDERLRELRLHSKHSWRE
KIEQIPQIFLYAVDYVSAQLEREPLQRLLLVTLKQIAKGLTGQLPRHGEQ
IRYLYNDIKTFVQENEPKQPWYSKYKGYLFIAAICVTFYWILPVAQRKEV
IDLVLGYQDSWKSFLEGLGRSEQIHITMIEFAFAASLWSLMLIAPYLLFR
VSVERKRRTVVRFAQELPTNWNPSSLYIFYRQKPMQLRFSESKPTDVELA
QARLWRNLEVKPEVRREHITEFRANLIPPIVMLFIILVIEAFALRLVVIR
PQIFGSIWLAFLLGLLCIGLLLWEVKIALKTQKEWQTVFEAETFWLEEGL
DSQVELDEKQISEIDSNIEEASMEIIDGLPVVHTRVDTDLVDAIRSMREE
>gid:533453  glr0140  exodeoxyribonuclease III
MSNLTVATWNVNSINVRLGGVCAWLSAHRPEVLCLQETKVPDERFPVAAF
EALGYEVAFAGQKAYNGVAIVSRKPVTVVRRGLPGDEADAPRRLIAATVE
DTQIINVYVPNGSEAGSEKFAYKLLWLERLRQYLLADFDPKAAVLLCGDF
NIAPEERDVWDPKAVAGKVLFHPDEHAALERIRTWGFIDAFRLHSPAAGQ
FSWWDYRAAAFRRNLGMRIDHIWVSSPLAERCSACWIDAQPRAQPSPSDH
VPVAASFR
>gid:533463  glr0150  
MPSLEALFCHVDDFCRRFEPLWQQQLLDDGLRHRRRPRRLCLSEILTILI
AFHQSAYRHFKAFYTQMVWGYWRSAFPGLVSYPRFVEWMPSTLLPLSAYL
RHCFGRCTGISFIDSTPLHVCHVRRVHAHKVFAALAAWGKSSVGWFYGFK
LHLVVSERGELLAMSVTPGNTDDRKPVPELLKDLHGKVFGDRGYISGKLG
RQLREDFGIALMTKLRRKMTNRLMVMTDKLLLRKRGIIEAINDQLKNISQ
IEHTRHRSEVNFLVNLVCGLIAYCHKPKKPSVASDVDQLNA
>gid:533468  glr0155  
MPSLEALFCHVDDFCRRFEPLWQQQLLADGLRHRRRPRRLCLSEILTILI
AFHQSAYRHFKAFYTEMVCAYWRSAFPGLVSYPRFVEWMPSTLLPLSTYL
RHCFGPCTGISFIDSTPLHVCHVRRVHAHKVFAGLAAWGKSSVGWFYGFK
LHLVVNERGELLAMTLTPGNTDDRKPVPELLKDLHGKVFGDRGYISGKLG
RQLREDLGIALITKLRRKMTNRLMVMTDKLLLRKRGIIEAINDQLKNISQ
IEHTRHRSEVNFLVNLVCGLIAYCHKPNKPSVASDVDLLSA
>gid:533485  glr0172  
MTLSRTATETVALVDCYCRAYEHLFVDVRSFEHFKLLHFGLIAVSPRKTL
PAIARVLGTEDAQALHHFVANSPWDARILRQQRLNLVRQTLRQRPFLLCI
DETGDRKKGRTTDYAARQYITNLGRVENGIVSVNACGVLDGAVFPLTFKV
FKPEHKLKSSDQYKSKSQLAVQIIEELMGQEFCFEMVLADCLYGESRQFI
EALEGWGFKYAVMLRGSQGVWMPQGRNIRLTRWRQFNCVFDADEHQPYYI
REALFGRRLTPRFCFVTTDPRLLPVETTRLIMTDLEGDLPGIVGHYYRLR
ARIVQRFKRMKNGLGWADYRLTEYAAIERWWELVLSACWMVSQQSQAFAE
SLAFWADGSERSVLTEETPEP
>gid:533649  glr0333  
MTIRKEILDELLKDYDGTDPQTILGEGGLLKQLTKAVIERALEAELETHL
GYKKHEAAGKGTGNSRNGKSQKTLQAECGAVELAVPRDRNAEFEPVVVRK
GQTRLAGLDEKILALYARGMTTRDIQAQLLEMYGVDVSSTLISNVTDAVM
DEVRQWQNRPLEAVYPIAYFDCLHVKVRDNGRVVNKAVYLALGVDIEGQK
ELLGIWLSAHEGAKFWLGILTELSNRGLKDILIACVDGLTGLPEAIESVY
PGCLVQLCMVHMVRNSCKYVSWKDRKALCADLRSIYSAATEDEAELHLEL
LREKWDKPYPSVGRMWRENWSRVIPIFRFGEDIRKVIYTTNAIESLNMTI
RKVSRNHRIMPNDESVMKMVYLAIQNQMKKWTMPIRAWRPALNRLMIEFE
GRLKV
>gid:533738  glr0422  serine/threonine kinase
MPFTSGALVQDRYLLERQLGSTGARQTWLVQNSATGQALTLKALYFGTGM
DWRNLALFEREAQTLKSLDHPRIPRYHEFFQWQQPEGDYFCLVQDYIPGV
SLAEQVHSGKRWSEAQIEQAALEILEILDYLHSLAPPVVHRDIKPSNVIC
GEDGRLYLVDFGSVQAEQVSGRTVTVVGTYGYMAPEQFGGRAVPGSDLYS
LGATLVHLATGMNPADLVDGGFHIRIPEQLPLSPGLRHWVEKLVDSDPER
RFKNAREAISGLRCKDSLAHPSETTYQGRIALRPGRERFCVEVAAREPAF
EDILTGFACFILFVIATASTHTLKVLEPGDGLTVWIFMVLLTVGFWGVLV
FTASNTLLRLLCCTCLEVDRELFTLSHWILGKRIFNKSGRVMALLGKQRL
SFDLTTAEHHLILAHVQRWLNR
>gid:533829  glr0513  
MVRLLHISDIHLGSGLSHGRINPATGLHTRFEDFLYCLSQAIDRGLAEGV
DLALFGGDAFPNATPEPTHQEEFARQFKRLTDAGIPTVLLVGNHDLHGRG
VGGASLNIYAALQVPGFVVGSRLQIHPIATRSGPVQVLSLPWVNRSTLLT
REEMRGKSLEQVDLALVERMKLALEAQVRRLDPAVPTVLLGHLMVENAVF
GAERHLAVGRSFSIPLAMLARSEFDYVALGHVHRHQVLCEDPPIIYPGSI
ERVDFGEEKESKGFILAEVERGRCRYEFVSVPARSFKTIQANLADSGDPQ
GDLAAILRKHKIEGAIVRVLYRLHPHQIERIDTASLRQMLEGAFSYQLQP
ELISQLSQPRVPGLGESCALDPIDALRQYLESRPELADLRLALVEAAEAL
IKGDSPGLADTDECESDAEVLEVTTTAALDLLAREEGLPAGGTNGQLGLF
HA
>gid:533948  glr0629  exodeoxyribonuclease V alpha chain
MGLAMQRSIEGLAGGERDEMLQGVVERVTFHNPQNGYTIARVAVRGLADL
ATVVGNFAQLQPGQTMQFWGCWKDHPQYGPQFLAHRHEETRPATIGGLEK
YLGSGLIKGVGPVTARRIVAHFGLASLEIIESDCSRLAEVPGVGAHRIRL
IQAAWQEQKAIKEVMLFLQSHQVNTTHAVKIFKTYGDEAIERVRTNPYQL
AQDIWGIGFRTADQIAQNLGVAPDSDERLKAGILYALITATEEGHCYLPL
EEMLDQAVALLRLEEVAESVRPRLVEMARALVREGQIKAERPSGGAEAPV
VCFQPSLWQCEVGLARRLCERPPAPVDTGRVEAWLERYTHHHGLQLSAEQ
RQAVMLAAREPVTVLTGGPGTGKTLTTRAVAALWKAMGKKVLLASPTGRA
AQRLAEVSGQEAKTIHRLLEFDPSTMGFKRCAENPLDAQAFVIDEASMID
VVLAYNLLKAIPAGAQVLLVGDQDQLPSVGPGNVLADLVRSPAIPTARLT
QVFRQAAASRIITNAHRINSGQMPDLAGEGSDCLFIEAHEPAQVVERVRE
FVVQELPRRGFRSLADAQVLCPMNRGLVGSNHLNTVLQEALNPLPPDGGE
LDRGRRLFRVGDRVIQLRNNYDLGVFNGDLGTIAGIDFENQKLQVQFFER
TMGYDFADLNELSLAYAISIHRSQGSEYPVAIIPVHTQHFPMLSRNLLYT
GLTRARKLAVLVGTRKAIAIAVREVKAMQRYTRLSERLSPLQADE
>gid:533958  glr0639  
MTTQTPLPPESTDYPASRRILAGVGWGVLWGGGIAAVIFPLVAANLTVGI
LLALIILIAIAMGISLHWNAAREPCPQCATVFIATPSGGRCPKCGERVRV
VDRRMIKI
>gid:533976  glr0657  serine/threonine kinase
MSGSFARPDTLVGRTVGRYRLVEKIGAGGMGSVYRAVHIEIEDLVVAVKL
LLPGLIDDEALRRRFKDEAAICARLSERSSHIVQIRDYGILEDLDLPYFT
MEYLQGRSLQHLMHKVAAPVHQGLAIARQICLGLRVAHDMNVVHRDLKPS
NIHLIPDLQLGEKVKLLDFGIARLVRDAQRGPLTQGYLGTPQYSAPEQLR
GLEVDARADIYSLGMILYELFSGVCPFAVEDQNFETWYVLHTEGEPSPMA
AANPRRPVPMAIEQLVLHCLAKKPADRPAGVSEILERLELVMGHLPPAPP
AAAIPPPPTITLSAEQVAHLEKQLATQVGPIAPTLVRRALGSSHTPGELV
EQLAAQLPATQREKFSRTVLANLSAQTSAAPSPGVREGSVPPVLRLDPRF
VERCGHELSRLVGPIAAFLLQSALKETPPTPAVLVERLAALVGDQAKAEQ
LRRKLL
>gid:533985  glr0665  serine/threonine kinase
MESLTQPISIGALIDGRYRLTRYIDGGGMGKVYEAVDTRLGDKAVAVKLL
QQNLNVDDRLFEQLRRRFEQEAQLCALLGGQHGIIAVSDYGLDGPQPYLV
MEYLGAAPRGRSLKELVSAEGPLSPERTVRLAVQICESLQYAHGVRTHLG
GRQITGVVHRDIKPSNIFVIDRALVGETTKVLDFGIAKAVSDVTIAMGTN
MGFVGTCDYASPEQLRGEELDARSDIYSLGIVLYQMLTGQLPLQPKTHSF
AGWYQAHNHESPVPLPRLAVGQAIPPRVAAVVMACLEKEPARRPASMQEL
SQRLQDELLKAAPAPVAPEREAPPPASDEALGAALAAAWPVLRERMERWR
SQALDRLEKLPGRIRLGVGLLAAALALKTGRRKCDD
>gid:534047  glr0727  DNA ligase
MSTTVPPEIEEHTRTLRALLHRWGYAYYVLDAPEVSDAIYDQHYRELVDL
ESRYPELVSPDSPTRRVGERPASAFVSVTHRVPMFSLENAFSQAELEKWG
ERLLRAIGPGLEFICELKIDGSATALSYEDGVLVRGATRGDGVEGEEITQ
NLRTIRAIPLKLLGGEVPAVLEVRGEAFIPRDEFERINQERQAAGEKLFA
NPRNACAGTLRQLDSRVVASRRLGFFAYTAHYGRAESQWEALAELESHGF
RVNPHRSLCRDLAEVRTFCEHWENHRHELPYDTDGVVVKVNAFDHQREVG
FTSKFPRWAIAFKYPAEEKSTVVEAIAVQVGRTGALTPVAELQPVAVAGT
TVSRATLHNQDRIESLDVRVGDTVIIRKAGEIIPEVVRVIGELRPPEAVP
YVFPQTCPECGTAVVRAPGEAAVRCPNPRCPALIRGKLGHWCAALEIDGI
GDKLIARLVSLGLVHTVADLYELSAEQLAGLERLGARSAAKIVEQLDRSH
RQPWSRVLYGLGLRHIGASVSVELARAFASADALARADLAAIASLYGFGE
ELARSVVEWFAQAENRALLERLKAHGLQLAGGGRAAQSSALAGLTFVITG
TLPTLSREECTALIESHGGKVTSSVSSRTSYVVAGEKAGSKLARAQDLKV
AVLDEEQLRALIETREMP
>gid:534082  glr0761  
MLLDFPQLVKTFLSALPDRDFPVLDSRLFFSCWLALIMDKSTVSMQDLFK
RLNRTGIPVDISTFSKACKSRSLQMFEQLYRDLLARVRRELPVKQLHPCP
IDSTVVGLTSKLLWAQSYHQVKLLACLEHGSGATEGSLINFGYDHDSNFV
NEMLQAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKFTFEGN
AGQMRVGTGKESGMYRVVNFCDIENRAEYRLVTNLPAEGECFVSDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKNENGIRMQLYVTLIAHLLLELVA
VPKMWGSQRLDKLRYLQCCMCKEISYVHWLGKLLGSRRRRVWSLKLCANV
Y
>gid:534194  glr0872  
MLLDFPQLVKTALSSLPNDDFPVLDSRLFFSCWLALVMDKSTVSMRDLFK
RVNHTGIPVDISTFSKACKSRSLQIFEQLYQALLVRVRRELPAKKLHPCP
IDSTVVGLTSKLLWAQDYHQVKLLTCLEHGSGATEGSLINFGYDHDSNFV
NDMLEAIPENGVGIFDRGFAGLEYLKNAQASSKYFLMRIPSNYKLTFEGN
AGQMRVGTGKESGVYRVVNFCDIENRAEYRLVTNLPAEGEWLVRDEEVME
LYRQRWQIELLWKFLKMHLKLKRLMTKNENGIRMQLYITLIAHLLLELVS
VPKIWGSQRLDKLRYLQCCMCQEISYVHWLGKLLGSQRRRARLPRACTYV
H
>gid:534216  glr0894  
MQQVSGTGCLRILERYDGTLSPNRQYRLLELARSSLYYRPAPVSGKTCIG
CDSSLSSVWTRHSTLRSIWLPGLREGYQANRKHMQSLVWQTSIEAVYPKA
DTSRKAHEHKIWPSPSRGVLVVRTNQIWAADIICVLISNSYLYLAITVNF
HRRYELSKFVEYARSTLLRGGFSARSAV
>gid:534237  glr0915  serine/threonine kinase
MAVEPPPQHDEYRCAAAQPMGEEPLSAGQRLGAYRIIRPVGQGGMGAVYL
AERDDGQFDKRAAVKILQPQLHGPGLRERFIGERQILASLDHPYITRLLD
GGTTEQGLPYLVMDYVEGQPINLYCNERKLDVDERLRLFLKVCEAVHYAH
AHRVIHRDLKPANILIDPEGNPRLLDFGIAKLLPADPEATAVFSQSMTRL
LTPGYASPEQIKGEAITPASDEYSLAVVLCELLSGRRPGEATPQQLTGKL
AGPLASLVLKALCEDPLDRFGSVAAFSLAIERHLTGQALPWPRTGTLLWP
IRAHRRLVTAMAAACLAVGLGWWSLQAAVPPARRAIAVLPFENLSGDGRS
AYFSKGVAVDLADQLGKTGRFTIIAGRAATGGSHHHLRRPFGTEAVLSGS
VRRTGGHIRVVSQLIDARSGVQLWSESFDRPLQSGFADQTEVARQIVGAL
ESQYSPVRKTIAPKSRSAKANCLQNFCPSGGPN
>gid:534392  glr1069  
MARRSARRSRGLAALLASLLLGAEPLTAEPDPAVSQLADVRSGEWAERTL
TSLERQLGLPDPARPHAQLISRAEFAVRLGRVLAAVEKSAQQANALDAEQ
FDALRRLQLEFLPEREGLTARLENLEKRLDRARYAPFDSKTTMSGQVIFG
LSTLSNTLDDDEGDSTAVRFSGRARLTFDTTFGRNDRLRVRLQAGDFPNY
GSLAGTDMARLGVGGNTNRVFNVARLEYRFDLAKNLRVYIGALGGRLDDF
TDALHPLVGSTGSGAISRFGQRNAIYRLISGTGVGVRWEATKTLTLSAGF
VADFGSLAEDDDLDDVRQTDRFLPTGILAQLNFQPSKTTGIGLSYIRTFN
ATDTGTGSDLANDPFDGDSKRTFADSYGLQGFWKISPGFQIGGWFGLTEA
RAEDLPGRPGSSIVNYALSFAFADLAGKGNLGGLVIGQPPRVTRSDLGNK
FQDPDGAVHLEIFYRIRLTDRISITPGLLTIFNPENNRNNPTVQVGTLRA
TFDF
>gid:534409  glr1086  
MKRMPPKAAAAAAPALPDTFIPQPLYDQILANLPIACVDVAIVAQGAVLL
VKRKVPPAMGQWWVPGGRVLKNEMMKDAARRKAFEEVGIECHVGPIIHTA
ETIFPDGPSGIPIHSINSCFFLYPVAAVGAPVLDRFHEEYLWVQRIADGL
HPYVVKCLMGAGLD
>gid:534419  glr1096  serine/threonine protein kinase
MLTTEQWQRVKNLFAEALEKPADERRAFLERSCADCAQMLAEALSLLEHH
RESENFLNRPVVQLLEEPIPTLQPGDNLGPYRVVGKLGQGGMGSVYLAER
ADAQYSKQVAIKVLRAELGTETLVRRFRLERQILADLDHPNIAHLIDGGT
TAGGLPYLVMDYIDGEPIDRYCERRRLAVRARLELFLGVCGAVSYAHQRR
IVHRDLKPGNILVTAEGMPRLLDFGVAKLLEHAESIAPEASLTVGTEQPA
GVTGGALTPEYAAPEQLHGGAITPITDIYALGVVLYELLHGSRPPRPGPE
PAARPQSSQLEELDSIACKAMSVEPEGRYPSVDAFAADIRNYLDGRPVTA
RGVSWLYGARKFLGRHRVSTALAAAALALASGYGWSLLEERLLAASGKTV
VVLPFSERGSTGDERLADGLTLSLTTHLGKVAALTVISDRAAMQYRDSSL
AQIGGDLGATGVLTGSVLRSGERVRIAARLVDPRSGTQLWAEQYDRKLQD
IFAIQTEVAQRIAAAMKAKLTPEEKIRLARAPAQNAMAYQYYLRGREYNN
RFTVKDNEYAIGFFKQALALDPDFALARSGLAVGYLNRFIYGGKVSWREA
ACREAGRAVALESGLAEAHNAMAACYSQMELRLQPQAMAEYRRAIDLNPN
FFSAMYNYSAILASLGRLDEGLYWMKKAVRVNPLCIGCYRSIASYYWLLG
EAAKGDVWIDKGLALIPDGARLHVSRGRAYLPRGRYDEARQIALQVLKEE
PNNAEALSLAGTAERLSGRWERSRRYYERMLKLTETTDLEDAGDTSLRAR
TALGYLALQENQPQRAHRLLAQSLDFDRKRIAEGNEEWLYFRDMAVIHAL
GGEREAAVVQLREAIKNGYRDVHTLQFDPIFENLRRDKRFEQLLTDLRAQ
IEKMRLRALAQEL
>gid:534425  glr1102  
MFTPLNNTGPNERAFDYWTPRRKVKVVLDLLKGNATVFDLARKYKLPPSE
IKAWLFQAEMHMESGFRNRTRDAAQLHEAQLTEAHAALGKALLRARFQRT
PTRR
>gid:534542  glr1219  
MHLRVANRRKDFHHKTAVKLLAKSKVIAHEDLNIQGLASTRLAKSVSDAG
WGQFITILTNKAARAGCLVIAVNPNGTTQMCSGCDTQVPKTLGERWHSCP
RCGLELNRDQNAARNIKSRAVGHPVPARGGYRSTVPTNREACALC
>gid:534673  glr1346  serine/threonine kinase
MANFQGRRDPLLGRTVGRYRLVEKIGVGGMGSVYRAVHVEIDDLTAAVKL
LSQSLNTNDLHQRFRNEAAICARLSERSPYIVQIYDYGILDDLDLPYFTM
EFLKGHALDDLAGSPLPVEQVVAIGVQLCEGLQVAHDLGVVHRDLKPGNI
YLSGDPQSGWRVKILDFGIAKLVSDAMVYGERRQLTHGYLGTPRYSAPEQ
LRGEAVTVLSDVYGLGMILYELFSGTDPFALVDQSFNSWYHAHTERMPRA
MAQANPYRAVPPAVERVVLACLQKDPRSRPAGMREIAQLLRSAVHSEPSR
PLRRDETRIAYPLELNANQLDRLQKHLAALIGPIAPILMRQAQALAQDAQ
DLVERLSEQLPEKQRPSFSQKAMDLLGATPSRPPSTNPGAAPSPALDPNF
VQRCERELAHFVGPIAARLVQVASQSSHLTQADLIDRLAAQIGDPTKAAQ
FRRRLS
>gid:534691  glr1364  
MARIPTLEYDRGTLILHPPPRGKGWIDYATWDDRVEKFRIPASDYRALVQ
ALEAEGARFEDRAGAFAAVELTPGFEMTPYPHQTEALAAWQAAGRRGVVV
LPTASGKTYLAQLAMQSTGRSTLIAVPTLDLMHQWYAHLRAAFPDTPVGL
LGGGSRDTSPILVATYDSAAIHAETLGGRYGLLVFDECHHLPGDFYRKVA
EYAVAPFRLGLTATPERADGRHSELTGLVGPEVYRRTPAELAGTALAPHR
TVRLKVRLSAEEQARHDELLALRNAFLREAKLSLGSAEGWQRFVQTSARS
CEGRRAMKAHTEARAIAMGTESKLRVMADLLVRHYPEPALIFTYDNATVY
RIARDYLIPAITHQTPVKERHAILAAFRTGEYRAVVASQVLNEGVDVPEA
SVAILLSGTASAREYIQRLGRILRKGRDPNKHAVLYEVYSEDTLEERTSA
RRHAHTRPHHQPPGNASR
>gid:534709  glr1382  
MVSLSPFATDIHYPSGDGEPVAETFVHLYAILVTLEVLRQYLAGRQATVL
ANQFLYYAQGYPKLRVAPDVMVIFDVAPGGRDSFRIWEEGQVPAVVFEIT
SKKTQDKDLETKRDLYESLGVREYWLFDPQGEWIEEKLLGYRLARIETEV
EPTDRYIRIRDGRSEPLGLRLQIEGELIGFYREDTGQKLLIPSELAEELT
RESQARRQAEQRAEVLAQRLRELGVDPDTL
>gid:534777  glr1449  
MGIEVQVHRQLLQLLRSEQLPTVWPHQLTMGRLVARALSQGRSTLVQVSG
SGEHRLSYLLPALSAPEPVILCASETIQQQLLDVDLPLLRRNLATEKPIG
LGDRWPGTGWQGVLITDPLTFLRDRLRGGSHFPEGVAVIFDGAHGLEQWA
IQALCERIEPADWDDLREGDFERAEFWLDWQVGLSRALSSQPHHRAALSP
GRQAELASALGESPPSEAWRRFRAALGAPERAHWVQYYRQSNQFSLVSSP
IAVAGDLARALWPRQPAVVIGEALDPAREARTFRSWVGLEEVTCLRFPAD
PREQEVQLYLPTQLSDPTSSQYRQQIEPVLVELAALAAGPLVILLPEGPL
RSHLGTFLASQFGSRVGIHRLQPPPNGITLASWEFWEANHHHLGAPTTLA
VCALPFPRLDDPLVAARVQWLKDRRRDWFREYLLPVAIGRLQRGISPLRQ
TQGLVALLDNRINHRSYGGQLLDALTPARRLRYLE
>gid:534789  glr1461  
MANSPVYERIYAVVRRIPAGKVATYGQVALWAGLPGRARQVGYALFRVAP
DADVPWQRVINAKGEISASPHRLGNDDYQQVLLKAEGVTFDCSGRIDLGE
FGWQPGACAQPPAEFEPPASPPQNRCQC
>gid:534885  glr1552  serine/threonine kinase
MIGRLLDGRYKILQILGAGGFSQTYLALDTRRPSSPTCVVKHLKPTSENP
GSLQIARRLFRSEAETLERIGHHDQIPRLLAYFEEDEEFYLVQEFIEGHV
LATELQPDAPMGEARVAAMLQDVLSTLAFVHSQGVIHRDVKPDNLIRRSS
DGKLVLVDFGAVKTVWSRPAALPGGQRGGTVAGTIIGTPGYMSTEQGRGK
PRPSSDIYALGMIGIQALTGLNPMELPEDSRTGEPLWQDNAQASPGLCAI
LSKMVSYHFKDRYQTAHEALEALQHLYASPGSTSAATVERTPPPLEAPQA
REETYYGQPLPVEMPVLQQTPPPVETPQAREETYYGQRLPLSVSELPPQT
ASASAPQQVSPRRTLALTGAVLAAVALGFLAYRSLANRPSPQLSPVPQAP
ERVRPKQALGVPAKLPVANPALRDASKSAPQAIRPVRPPAPVPTKPPTAA
APVVKTTAPPPAVKAAPTASVRPKPMVKDAAVQPKPKPPQNRPAPVQLPT
VTAKGPPAKTASPPELFLPAKAPVSAPETPDILRPADKPLEPPK
>gid:534927  glr1594  
MTFVLPSVTYPSSDGEPVAETFDHLYAILVTLEVLRQYLSGQQATVLANQ
FLYYAQGFPRLRVAPDVMVIFSVAPGGRDNYKIWEEGQVPAAIFEMTSAG
TQQNDTVEKKHLYERLGVREYWLFDPRGEWVEGRLQGYRLQPVEVMGEVV
EVYLPITDGRSAVLGLRLQVEDRLPAFYREDSGEKLLIPAELAEELHRTA
ALLEQERRAREQERQGREQERRAREQAERRAAALAERLRELGVDPDTV
>gid:535061  glr1726  
MAKPYSYDFRQKVLQAIELNGLKKSEASELFDISRNTINLWSQRKAETGD
VQAKPRPASHKGQKITDWEKFRAFVEAHGDKTQAEMAQLWDGQISSRTIS
RALHKLGITRKKRPTGIANAMRRNAQHS
>gid:535062  glr1727  
MYVDQSGMDERDDYGYGWSPSGERFYGLKAGRRQGRINMIAGYRAGQLIA
PFTVEGACNRTVFEIWLESCLIPVLQPGEWVILDNATFDHGGRIVALIEA
AGAHVLYLPPYSPDLNRIEKCWAWLKSRIRKRLRDCGHLRNAMDAVLKQA
AS
>gid:535080  glr1745  
MKGASRARPLSLSPTALKLYVRCAYAYALDKIVKVPGYRRIIAAHLHTGR
AVHTVLEQLVRQEQIRPEDAVSSLERTFSWGAYTDRQTAEAAFQTARARL
DAWVGTPYGWGAGEQLAVEKMLRTRPRPLGSASIGSVVLIGKPDLVRVDP
EGTLEVVDHKSGKPGDGVEALKRDFQAAIYRILAEERWPDYSAYRVSFSY
LATGTVIGLTYTREEVEDWWQALLTVAERIARARLAVENDIALEEAFVPA
PGEQCAACTFRRVCAFRAS
>gid:535085  glr1749  photolyase
MIRSLVWFRKGLRLHDNPALLDAARDAARLYPLFIVDPWFVNPERVGVNR
MRFLLESLGEIDGNLRRLGSRLIVLQGRPQEVLERVLSRWQIGRLCFERD
TEPYARRRDEAIRSMAERVGVRVISPTAHTLYDPDELIELGRGKVPTTYG
AFGRLAAKLGEPDAPVASPSHLPPPGELDADYGIPTLAELGYPDPECPSR
GIIPPGGEGEGLRRLHVYLSDRQRSAGFAKPDTDPTAFDPPSTTALGAHL
KFGCLSARTFYAEVQKVYREVGEHTEPPMSLIGQILWREFFYTVGYATPN
YDRIEGNPVCRQIPWDDNPEYLAAWSEARTGFPWIDAAMTQLRTEGWLHH
LSRHAVACFLTRGDLWVSWEKGQAVFERLLLDQDWSLNASNWMWLSASAF
FNAYYRVYSPISFAKKYDPEGRYVRRYLPKLARVPAEFIYEPWRAPLLVQ
KQAGCVVGRDYPDPIVDHEQAKACNLERMRLAYEQNAKGG
>gid:535173  glr1837  
MSIELTQLLDLPNVYVERQSINELGIFFYLQPLAQEILCPGCGQLTDIQH
QARPLQVRDLMMRKKPVFLRIPRRQFYCKACQRYCTEKLEFLDWRRRHTR
RFEEDVYERVQHSSLEQIAREEGISPEAVRGIFEHVAVESKKRLGRSKAH
QHR
>gid:535174  glr1838  
MRRGHDFKTVVSDIETGELLEVVDSHKQKGIIESLSQQPFKVRQAVREVS
IDMWGGFTKVVQQVFPKAVIVYDRFHGTRMVVEAVKKIAKQCGFRKCKEQ
ACLLKNGVDLSIEEQEELETRLKSSRRLRKAYAYKEEFRSI
>gid:535344  glr2008  
MSYSQLTLPERHRIYILRYQDCLSLRAIGRLLGRHCSTIARECQRNQLDG
HYLPAEGWLSATRRKEAKTPFLKVSSELLACIKSALKQFHSPEQIAGRLE
AEGQVFVSHETIYKLIYADYEGLGACRKYLRQGRKVRRRRGGAKDKRGLI
PKLVDIEFWPGEAEQKQVIGHWGSLGQRPWEGDTVIGANHQGGLVTYVDR
ASKFLVTGLLKNKKAGPVTALSIRLLQSEAAGKVKTITFDNGKEFSRHQE
LTGALEAECYFAKPYHCWERGLNEHTNGLLRQFFPSRQIFGQSNQSRCKG
QWT
>gid:535447  glr2109  
MEGRKYPSLLRLNVDLFERHREQQQAAEAPLADRMRPRSLDQFVGQGHIV
GPGRLLRRAIQADQLSSLIFYGPPGTGKTTLARIIAGTTRAHFIAINAVL
AGVKDIREAIDEAKSRRGQFGRRTILFVDEVHRFNKSQQDALLPWIENGT
IVLVGATTENPFFEVNKALVSRSRLFQLKLLESEDLRAVALQALADTERG
YGKRNVRLDPEALAHLVDVAGGDARTLLNALELAVETTPADGDGAIRITL
PVAEESIQRRAVLYDKEGDVHFDTISAYIKSLRGSDPDAALYWLARMIYA
GEEPRFIFRRLLIQASEDVGLADPQALAVVVACAEAFDRVGMPEGRYHLA
QATLYLATAPKSNSAMAFFDALAAVEGERASDIPNPLKDANRDKEGFGHG
AGYLYPHAYRDHWVAQQYLPTSLQGQLFYQPGDQGYEAGIQAAVARRREA
QLAALVETDSPERLSFGPADGPSERWLARAMGRQGEQLARLRDRIFELAA
PERHHVILDAAAGSGLLTWEAIRRTPEGGVYARAANAADADALAEQAAAL
SPLRRPVILTAALEHLTAAVGQHSPGLRFERIVGRNLLKGTPHKIAVLTA
LQNLLAPRGVLVIAENMPSRGSRLWQLLDPAWLAADLYRRVVEAEELAFG
DDPQLGWEPDDLQSVFEQAGLACEITSESVEGELRVSGPLLARWFARSHP
RGAYAAQLARLLSPAEVESLEAIFRRQLLDRVVSWSSAVAYAVGRPV
>gid:535590  glr2250  
MAIFSLGSPVPSHPNWRPTMPEETNDLFTGAMGEFMAQGAGVIPPGYEDI
YQQSIDEQRGEPPSEGIEMPLPEETYEQPDPDAEPHPNQMEILPEESYRG
SSSADTDLWMAAIPEELEPQPSGDVPPDSEETEPFDPYPEEEQFDTDDPF
SGEIN
>gid:535606  glr2266  
MAILHGIWVHQPPRAGLFLWGETWRQVAKRRKRSEAPAPHPYVQQPAELS
PRLAAQFPQIPLSLLVPETLALQLPATVENVVYSASIAPEGKLLELEPWL
VEGFWLDGHQAFELLLGVPLGGGDASIGDDLRFWSQCARWVLDLLVRAKY
LPDLESGDGQEIPTARWVPLLDSAVDQARLKEFAARLPGACRAATPELSP
HQILKSFLSAMLDARVRTLLACEPPDPRTLPAGAVRPWLLALAHAQPQLK
SPDPETPALAEALATWRAPLSYQVRSRTCFRLQPPEESQGEWKLHFLLQT
GDDPDSLMAAQQVWSSAGELQEVFLAGLGLASRIFVPVERGLLVPQPTCC
TMSTVEAFQFLKAATWRLRDSGFGVLLPESLADAGSLRNRLGLKLEANAP
GRNGSGLGMQSLLAFKWELSLAGKTLSRAEFDRLAASSEPLVKVNDNWVE
LRPQDVRAAHSFLQSRKDQVGLSLEDVLRLNFGDTPKIDGLPIVNFDSSG
PIQQLLETLTDQRKLTPIDEPPGFKGTLRPYQKIGVGWLAFLQKWGLGAC
LADDMGLGKTVELIAFLLFLKSKNELDGPILLICPTSVMGNWEREIKKFS
PSLSVHVHHGARRPKGRNFVETAQKKQIIVSSYALVQRDSKDLKRVEWLG
LVLDEAQNIKNPDAKQTQSIRELTARFRIALTGTPVENRLAELWSILDFL
NPGYLGARNFFQRRFAVPIEKYGDRSSANALKALVQPFILRRLKSDPQII
QDLPEKQETNVFCPLTPEQAALYERVVNESLAKIEQSTGIQRRGTVLATL
VKLKQICNHPSHYLGDDGPLANRSGKLSRLGEMLEEVLADEERALIFTQF
AEWGHLLQAHLSRQLGSEVFFLYGGTSKNQREAMIERFQSDPQGPRIFIL
SLKAGGVGLNLTRANHVFHFDRWWNPAVENQATDRVFRIGQTKNVQVYKY
VCTGTLEERINALIESKKALAEQVVSAGENWLSDLNTDQLRQLLVLDRSE
IIDTEDTA
>gid:535736  glr2394  ATP-dependent DNA helicase
MVLGMDALLNPSQQKAVERFNGPLLVVAGAGSGKTRVLTYRIAHLIETYQ
VDPEHILAVTFTNKAAGEMKERILQLFCERTARSRGGEAFAALPDPERRA
IAGRVRRELIQPLWVGTFHALCARMLRYDIDKYTDQRGRTWQRNFTIFDE
SDVQDQIKDIVTKQLNLDDRRYVPRSVRFAISGAKNQGLSPEEYQHSEGG
SLRARTIAQVYERYQDQLARNNALDFDDLIWVPVQLLRRRPDVLDYWHRR
FRHILVDEYQDTNRTQYDLIRLLATNNQPKSEWDWRDRSVFVVGDADQAI
YSFRQADFRILMDFQSDFGDGLADDETETLIKLEENYRSSATILDVANEL
IANNIERIDKVLRATRPAGLPVSLHEAEDEVAEAEYIVGQLRKLKDADTR
PWNDFAVLYRVNAQSQPIEQALSRWGVPYTVVGGLRFYDRREIKDVLSYL
KAIHNPADSVALKRSMSAPRRGIGKTTLDKLEAGAGMLQTSLWQLLTDET
SIKNLAGRTSGPILQFVRFVERLQELAARVRVSELLDTVLKESGYVQMLQ
EEGTEEAEGRLENVLELRSVVQRFEEESEDPENATLEAFLANVSLASDLD
NLEAGAERISLMTLHAAKGLEFPVVFLGGLEDGLFPHFRAIQDGDSAAIE
EERRLCYVGITRAKERLFLSYAQARRLYGDRQPAIPSQFLKELPAEKLAG
TRVRKGLADRRRDRAERIAARPAAVPPLAPPARRKSPAQNWAVGDRVEHD
QFGLGQITHVLGVGEKQYLAISFPGQGKKVIDPRLAPLRKLAE
>gid:535750  glr2408  
MAEEQVSLFNLSEAAPPQVQFDRIPNDAGVTIVPGTYSDFEQIRTHCNAC
FRCELGKTRTHAVVGRGNPQALLMVIGEGPGENEDLTGIPFVGKAGQLLD
KILEAVQFDTEKDVYVANIVKCRPPGNRKPAPEEMKACLGYLNEQIRMID
PKILLLAGGTAVEGLTGDKRGITKLRGQWLQWQGRWVMPFFHPAFLLRNP
SRDAGSPKWLAWQDIQQVRAKYDELVVPR
>gid:535794  glr2451  
MNLRDAVEHWLDHLIAARSLAGNTLKAYQRDLEEFARFVESEHLDWQTFG
ATGTHHFAQILQKTHTPRSAARKLSALRTFYRHALVQGWAGGVPPACTRT
LPSAERGLPRILSVAQTEKLIESAASPLESAVLELLYATGLKAGELCELR
VRDVAFAEAYLSVQPAASQPRVVPVGEPALAAVEAYLGSEPVLPERWLFV
GRQNRPLNRFHIYRIVREAAARSAIDWPVTPDTLRHSFAVHLLEGGADLA
TVRELLGHASLATTGIYTRLARNYAVGRPRADSTPNHPG
>gid:535831  glr2487  single-stranded DNA-binding protein
MALLNQVHLVGRAGRDPEVRYFESGNVLCTVTLAVNRIRRKGEQEDQPDW
SDLEIWGKTAEIASEYVRKGSLIGISGALAFNRWSDRTTQQARERPVIKV
DQLELLGRAARPDEPESF
>gid:535832  glr2488  
MNTIALLGTLHSAPQLRHTQDGLAQASVVLSFTTLKADEADYTVRVVLFA
TAAEEFHTNFHQGDAVIVEGRLHSESRAREDGTKERHVEVIARRVHAVAI
PAAPATPAPASVPPATTKQHPPAAERKTAAPTARRPAPSARPPVAAAVAD
DDIPF
>gid:536056  glr2709  site-specific DNA-methyltransferase
MTVASRARPFLKWAGGKTQLLDQIAERFPAVLKHGQIDRYVEPFIGGGAV
FLYVAQRYAVEQFVLFDINRELILAYRTLQRAADDLIEKLEALGLHYHAL
DGDERRGFFYRVRERFNTLAGEIDYDHFDGRWVERTAQIIFLNRTCYNGL
FRMNTKAQFNVPFGRYRNPSICMPENLKAVAALLARARIEWGDFTDCAAL
AGPGTFMYFDPPYRPLSKTARFTAYSAFGFDDAEQLRLAQLYRTLDAAGA
KLMLSNCDPLNTDPADDFFERAYAGFEIRRVHASRLVNCRAARRGVITEL
LITNYPQTAGG
>gid:536089  glr2742  
MLMTYQYRLKPTAAQSESMERWLQLLCKQYNYRLAQRFDWMEHHRCSLNA
CSIRSCSIATPADAPDYSSQKRDLRETKKRFPEYAQIYSQVLQDCIGRVK
KTFDRFVKNDTSGNRSGRPRFKSQSRYRSFTYPQILAGWLEGNRIRLPKL
GCLKIWMHRPLPPDFAVKTATITRKADHWYIAFVLENKDASTAEPVITPT
VQNTTGFDLGLESFLVTDKADRVEIPHFHRRAEARLARLHKRQSRTRKGS
SARRKANRKLSRAYQKVVNQRKDFHYKTAWQLIRTSEVIAHEDLTVQNMA
RTNLAKSIYDAGWSTFIAILTRKAANAGVRTIAVNPAGTTLRCSRCDRDV
PKQLSDRWHECACGIRLHRDHNAAINIRNFAIARAVGHHDLVKNARRSPL
RRETCAELSEASAKGRCHEPRALYCTGRFSKFVAGRCHETYALYSQKDVR
TIE
>gid:536120  glr2772  
MRRAFSVSVFLCREYRLLLIRHKRLGSWLPVGGEVNPGETPLEAALREVR
EETGIEALFVRLGDDNDIDGAPPGLLGYEEHHAGSKGVHLNFAFVAFLHD
GAIIRPNHEFDEFRWVNLDELVGLRDGNHTPLNVAQLGFKALRRVRTLGL
R
>gid:536211  glr2862  
MIPLHLSLSNFLSYRDGRLDFSGIHTACICGANGSGKSSLLEALTWVLWG
KSRADSDDDVVRRGATEARVDLTFSCEGQRFRIIRTRVTGKTSSLEFQVG
DGDSFRTLTRSSLRVTQEAINEVLKMDYDTFINSAYLRQGRADEFTVKRP
AERKEVLVEILNLNRYEQLCERSKEHEKQFAGRAQVLADQLQRSRLALGE
RPMVQARRDTTAGELEALRAQQTQIQQRLELLSTRSRQREQLAGQLERLD
SQITQTHQTHSRLHAQCERQTRVVRELEVLLEQEAQIAQGCARYQQLLAE
EAAMGERGSRHQTLTVRRTELERRLDGERHQLELQLQRHKSRFDGLTQQR
REAQAVLEEAPKIEQGLAELARVRELLTSYDQRQREVFPHTEEKLQLERE
IHRVYSELAAQIPVYEQQQRKLQQDLSRRGFLEQTFQKVSEKVEQLEKKR
VYQQRVTEKGLERSTFEAKLRAQQQQYRKELDQLAEKAALLAQGETDCPL
CGGPLDAEHRELLGEQHQRQHAELEDHLTLLAHQISAAEHEVRVLRTEYS
ELVRELEQLPSLLQQQAQLRQQLSAVDESSGTSRQLRSQIQAIRTQLERG
DYAQDLKARHTGVLQQIQIINYDEKDHAMARSEADRWRWAEIRAGELERA
RRQLGQIEAEIPALEQTCHQIESQLTHQQYGQELQNHLRAAAEQLQSLGY
DGTAHQNLRKHLQDHLHWLSRQQELIKARNRHPEERGVLQELQKAWAEGK
DLLVRLGEQRVALQAELDATPDPREALAAAQAEELLNRRHQEHKLSELGA
ADQRLAQIDALQEQIAHQQSQLEHAQKQQALYKELTRAFGKNGIQALIIE
NVLPELETETNRLLSRLCDSQLHVQFITQRASKNAKKLIDTLDILIADAR
GTRPYETYSGGEAYRVNFAIRLALSRLLARRSGAALQTLIIDEGFGTQDL
EGRNRLVQSINTVADDFACILVITHIHELKEAFQSRIEVEKDHQGSQLSV
VL
>gid:536291  glr2942  single-stranded DNA-binding protein
MSLNMVTLVGRAGRDPEVRYFESGNVKCTLTLAVNRLRRKGEEDKPDWFD
LEVWGKTAEIAAEYVRKGSLIGVSGALTFSRWQDKVTKEAKERPIIKVDR
LELLGSKRDSTGEAPSMDEDF
>gid:536298  glr2949  
MSYPRSCRLSSTNTVSTQPATSIEHACLDVISYWHTSLLDAERLGLDGQV
FKEETIHRAVTRAQLATGALNAQIVEWLFEKDDQDLQTAGETHPRAVGVR
PDPLSPQAAREQAVGPTVAQQETEFVEVQLCPIRLIDERSGASVAPVWMP
AVVSRCGQLSVGSQTPWIARRLLEPNEAYLTIGTIEAFNDYVSRAALPTD
NWSKYWEYCHTMLQHVAGAGYDNLEMQGFRIAPEAYVFKGSQIPGITKHV
RKLYEYMLANTVPPLAYHFTGAHLSTLCPLLDRDQQRRQALRHLGQMSGT
YPLSPSQREALHHFFNCGVGEVLAINGPPGTGKTTLVQSLVASLWVECAV
AASEPPVIVAASTNNQAVTNVIASFGNQEPMGGELARRWLPELLSYGLYL
VSAAKPDEETKAFQCIKGSNNFFATLENEDYVRQAIPHFLDYSRRQYGQA
IDSVQVAVEHLHRELKQLVECIHRLLSELLDCNGRYRQIIDLVEHMRGTL
WQGSALPEPFEKLVVDTMRQIQSAEDGRKATEATYELLDTARYLAFLLTT
HYWEGRWLLETQKLLQKALTGPRVARHKEEERRWRRYAKLTPCFVSTFYM
LPRFFSVYEAGQDNAPLLEFLDLLVVDEAGQASPEVAGASFALARQALVV
GDEQQIEPVWSIPEAIDCGNLLKHRLLSEQTQKPDFDITGRSASAGSVMR
IAQCLSKYQKYPEQRGMFLSEHRRCVRQIINYCNELCYRGRLEPKTKEPT
KMPVLSGEHHLTPMAYCDIHGKAQRVAGSWQNEIEAQAIVAWLVRERSAL
KAHYNQPIEKIVGIVTPFKVQARLIRTALRNVGILNLTVGTVHALQGAER
PIVIFSSVYDALHKGSFFFDAKPNMLNVAVSRAKESFIVFGDMRIFRSQL
EKASGASALPSHLLSRYLREHEGNALPGLLTQGQRSREESTIVPPTESVD
SVLAEAKTDPLPKEIGTDEQPILDSESVSTHSAVWACPVCSAALEAGLSF
DEDGRPTVLLRCSNREARQQPDHDGAIFTLRQIWWSARFGYLSP
>gid:536502  glr3151  
MDSARNWRWLLAVPIVNWLGLMLAGQEVRKPSWVRWAVVYAILSSLGVIA
GTVAGEWFLFWTVWGLVAAHTFQVQSEYQNRLLLMQDSQQRLQSDQDMRL
ARELGMGIDINRCSIDDLLRLPGLSIIEARRIVEARRSGGPYLSADELIE
RADLSSIKVRRLEPLLQFCYYEPPVALPVTLDVNEASVVQLEQLEGLDTH
LAVRIVREREQHGDYPSLGDLRDRLNLTPKVVARLLNRLTF
>gid:536576  glr3222  
MSCQQLSLAERQQIYVLRDKNCLGIQAIARVLHRSPSTISRELLHNQSNG
RYLPETAQALAQTRRHNRKRPFAKVSLELVLLIKQHLAAFHSPEQLCGRL
ELEGKDFVSHELVY
>gid:536577  glr3223  
MIYANHAGLGAYQKYWRQGWRKRQRRGGEKSKRGQIPNRVDIDERPAIAG
FKVETGHWEGDTVIGENHQGALVTLVDKHSKYLLVELIKSWRAAPARSVS
EGLLS
>gid:536606  glr3252  
MTRRYALRDDQWERIKDLLPGREGTVGVTAKDNRLFVEAVLYRYRAGIPW
RDLPERFGDFRVIHTHFRRWSQSGVWQQVFEQLAEEADNEYAMIDSTIVR
AHQHSAGAKGGSLSKRQSVAAEGD
>gid:536620  glr3266  
MPHRIALTPEQEQALLELKNDLSVPRRVRERAEALRLSAHGMNVPRIARY
LDWAPSTVHATFERWWNGGIDQLFEADGRGSKNTWSEADLQYLEECLQRE
PRSYTSSQLAAKLAEERGVQLSADHLRKLLKKRASPGNESVAAL
>gid:536621  glr3267  
MVIKYLDESGFSLPSIMNYTWAKRGEQKRIEQPARHGKRISVLGIYCPES
SFDYGVCFGSIKSATYIELMEWEAKKALQRLQESGQITVIVQDNYSVHRH
WRVREKWPQWQEQGLYIFFLPPYSPQLNDIEGEWLQIKRHGMQGRSYEME
YELGVAVIEAIDGRYRHKGYACERYLFN
>gid:536630  glr3276  
MASRWNPQPTPSHLHPEVRALVGEYGARALGSLGIGDPQSARAFLWPERS
PLGHCPTWPELERAAQWIAQTISEGKSITLQTADRFESAVAALLLQEALR
EAGTELTVVAGAQPVAQSLLIAVGSMPAAPPGCRTIAIEARLGEASGEGL
VALNALQLALDHPLRFLPEAAVAWMLVEALLGQLNRQPPPDRWADLLLAG
CVAGLVSLAHCRPQVLAGLAQDGGGSHPLLAALISSHRRQIFKMAAPLDA
LGARATAWLAGESDGGWEPLWREHEARIARVVQEGALAIESLGLQSQGVA
VLARVHWPRQALSLAAARLAGRYGLAVVLIACPEAEALAHGSGYAPEGTD
LIAALAALRPLWVEAGGRPEAVRLVCESRWVAALQRELGREIARRVARER
LEPPVAIAIDAETTLDLPAELDRAFHELERLAPFGPGRPRPRVAVRNYRP
QWTLSRDGRHLECKIGPRTLRLRDGAEERARRQEAGLMELIFEMEPWEAN
LWWGRLHAVQPAEARPALPAEAGRQIQVEDFRRMGATLPDALPALILREW
PLCAEELSVLLRQSPWSTVVLAGRSRDWSRVHARLPVLVQRWEQGERLPE
ALADSELPEPVVVRIVARCRAGGAVARAACAVLREASAFQRWLDAAPAAE
IARLCNRLVQDG
>gid:536631  glr3277  
MVGWPRGGLSGACLERYRELMATGVAPERIVVLAAGREEAQRMSARLEGA
FERFAGPLRVETWMSFAVRMLAEFWGEVLGREPTLGTTFEPVVLDFALTR
HCVERACALCPDHEARFENCGLKEERVWDQIASAAWIACTSGISLEAVGE
RLRAAWPDGEDTQRLALLGALSCCARRTRDYIRERGAIDAAGAVELFGRV
VMGLDAFWGSFDHLVVDRAEDSCAVALDLFGRCQERGKGLFLAYTVGGGG
LYTGVPQLAAEFVVRRTRFRYLDRPDPAGEALRWLADRIARRIHPEFKNP
LPVAVPDPIPQPVLLEGQTVIDAAEAVAAAIRALLATGVLPGRIAVVAPL
IDAPVATVIESVLGLPLHTPRPPASLLQQPLVRTLLSALDLAYPQWGRFP
TFGEVRLMLGLLLDLDPVRAELLAADVFDPVGRVLRSREAVRYPERVGFE
KLDRYQDLIDWLADNPQQDAPDCTLGRLYTDLLTEVICEPADQQLLFELM
GIARRLRLSGIAAVPQGLGAVLRFAHAPPTRPPASDRLVFSTPWSYINQG
FAADYQFWFDITSERWSRPSWVGLYNHRVLTPEWDGCRYDYQRDQNSRTR
RLARTLFNLCCRTSRGLHLVRSALSARGEANSGQLDRLILSAAERAI
>gid:536970  glr3609  AraC family transcriptional regulatory protein
MDIDHLACYRAVLARDERFDGRFYTAVKTTGIYCRPVCPTPPPKAHNCMF
FASAAAAQAAGFRPCIRCRPELAPGHAGWSGTHETLRRAISLMAEGAPET
ENLPDIAAAVGVGERQLRRIFREQTGASAIEFKTVHRILFAKALIVDTDL
AMAEIAFASGFGSIRRFNAAFQQMYGRPPHTLRKQGCIAGGCGIVLRLGY
RKPFNWSAFLAFLAPRAIRGVEAVTGETYRRSLRAADGAALVTVTDDPQA
GVLVARVQSDRVAALSAVAARLRRFFDLDADPAAIAAVLGADPLLAPLLE
TVPGRRIPGTMDSFELAVRAIVGQQVSVTGARTIVGRIAERWGEPLDLHL
GALPPDGPLLLFPQVQALVNAPLEDVGVMPARARAIRALAAALVEDPELL
NPAPPPARTVARLLQLPGIGPWTAQYVAIRALGDPDAFPAGDLGLLKATS
SGRPHLTPRALELRSQAWRPWRAYAAIHLWASQAATTDKHEGEGEGSCHR
RNPSTALYKSGTSQHRSEPYGVRSTATVPSFTSTS
>gid:536971  glr3610  methylated-DNA--protein-cysteineS-methyltransferase
MTLQPEAVAAVASQLQAYFAGERRSFDLALAPVGNAFQQTVWRQLCRIPY
GVTITYGELAGRIGNPSAARAVGRANALNPIAIVVPCHRVIGSDGSLTGY
AGGLARKAALLALEGTTLNLGWSP
>gid:536998  glr3637  
MLLADLGAETRGLLGWARLCAQLASFAQTKAAKGECETLLPFEARSEAER
WLQRAEEALRLAESVPGGLAFDGVHDIASDVERAGRGGLLTGEALLAVAS
TLAAARRLRRAIEEHSGQAEELALLVAEVRTFPELEQEIYRCIDDTGEVA
DRASEKLRDLRSGHRRLRAEIQRTLLQLLQRRANCFQESLITQRGERFVV
PVKVSHRDQVPGIVHDSSASGQTLFVEPMAVIDTTNRLVEGMRAEQVEIE
RILAELAALVAERATELLHLHRVLVDLDLAAARARYASWLGAVRPRFGER
GCGLVQVRHPLLVWQERHEQGTPVVPVDLPVDPAVRAVVITGPNTGGKTV
TLKTLGLVVLMAQAGLFVPARDPAVLPWFDRVLADIGDEQSIEQNLSTFS
GHIRRIVRILAALTPDALVLLDEVGAGTDPQEGAALARALLVHLAERAGL
VLATTHYGELKALKYTQSHFENASVEFDLATLSPTYRLLWGIPGRSNALT
IAERLGLDAQVVAVAQASLSEGDVELDRVIGALQEQLQIQEEQVRSTTRL
RGEVERLQSDLLRQQVLLDAREAALRARQDQQVREVVAEARAEVAQVIRT
LQRGDATAQQAQQASEALKAVGEAYLGEESAAPAEYRPQPGDKVEIVPLG
QMGEVLSPPDNGDQVRVQVGILKLTVPASQLRRPGSPATRPKPRPQAEVP
RPPSPPKQEPLVRTEAQTIDLRGRRVAEAEALLEPELNRQSGPLWIIHGH
GTGKLRDGIHEILERHPRVARFEFADRTEGGNGVTVVFLK
>gid:537221  glr3858  
MTAKPQPQYTPEQRAEAVRISEQSGKSTYQVARDLGISQTTLSRWRRQAR
AEHKHANDPDAPLGSDERRELTRLRRENKQLQLERDFLKKAAAFFAKDHS
>gid:537222  glr3859  
MEAHKDDYPVALMARVLQVSRSGYYAWRRRVPSKRQLHNQKLAECIEEVF
TASRATYGSPRVHATLRAQGIAAGRHRVARLMRRAGLVARVRKRRYPRTT
DSRHGYPVADNLLARQFGACEADSKWVADITYLPTCEGWLYLAVVMDLFS
RRVVGWSMATHLRTELVLTALQAALAKRVPASSGLLFHSDRGSQYASWAY
QQALSAAGITCSMSRSGTCLDNAVAESFFGTLKVELVYRLGPLDRRQMRT
TVAEWLEVFYNRQRRHSALGYRSPDEYERHYCKEVKLSKTAVHQPTVH
>gid:537249  glr3886  
MGKLARLPIGAAGLVYGYARVSTAEQADSQHALTQQCERLKRAGTDELLI
DVQSGRKDDRRNFQKLLRLAEGGHVREIVATRLDRLGRNVRAILELVDKL
DDLGVALRLLDEKVDTSTASGRMYLTIRAAVDEQESRLLSERVSHGMKHR
RLRGAAHPKPPFGYRMGADDRYVLNVTALCRIADHSEWTIEGIARWLIAE
FLRTGRVRGTLKGCVEMFGFTPFTHIGFSGWLTSPALRGHIVYGDGTTYP
DAHPAYLDADTARAAKLALERGKQLGGWGSAEGPRNRPLTGLVRCPLCSG
GTYYRDQRTQRKMADGSTKTYRRESYHCSAAGREGSCSNTKSVTVECIEA
AIEAALRAKAVELASMVSAGLGQPVEEPETVKALRAQLSALEAIPNPVGA
LQDAMGHLRRQIDNLLGAAAGERKEADLKAEQLHQAFADPDFWATITQEE
RRTVYRDLVRKILIAGGKVVAVELMI
>gid:537315  glr3952  
MPRLSGFEIAESVQELSRLLRECVSTRGKERLQVLYLYKAGLMGCERDLA
AFVGRNPSTVYRWLQCYRRGGLRRLLSPKSGGGRTAGIRGTVLDKLVAYL
EAAQGFNSYKQVQAWLRNECGLEVSYKVVHATLRYRLGLRPSSGPPGTGN
A
>gid:537381  glr4017  
MPIVESMVVQNCYRPVRLLGANGTRQTWLAAEEKTGQTVTLKALYFGEGA
LWQDFKLFEREAETLQSLSHSRIPRQRDAFWWEQPEGNYFCLIQDFIPGQ
SLEASVRNLGPLPEAEVRTIAAEVLEILIYLHRQSPPVIHRDLTPANLIW
GEDGRIYLIDFGAVQAVSAPEKTMTVVGTYRLYAPGAVRGPHGPCVRSVR
PRGDAAFFAHRDQSWGTAPRELSDPPGGEGERRSEALAGANARSRFAVPF
CECLRRPRSTRSAGKSACADSERSTNRQLHCPEANCRTTHHRG
>gid:537405  glr4041  
MERYFATVARGLEAVAAAELERLGAQRVEAAFAGVQFRGDRALLYRVNLW
ARTIFRVLAPIAEFAAPDRERLYRAVQKIDWQRYVPVEATLAVDATGGNA
RLNHTHFTALQVKNAIVDQQRDRTGRRSSVDAHRPDVRLNIHIDNDRAVL
SLDSSGESLHRRGYRPAMGAAPLKETLAAALIELAGWDPAQAFVDPLCGS
GTLPLEAATKALAIAPGLFRDRFGFEGWPDFDQPLWESLRAEAKSMQRTQ
LSAPIAGSDSDVEVLQLARENARRCGVEKLVTWSATELAHLEAPSDRGVI
LCNPPYGERLGEASALGELYSSLGDVFKQRFKGWQAGILTGNRELAKQIG
LRPSRRLPVFNGSLACTLLLYELY
>gid:537429  glr4065  
MRFFTPPCRPSAQKRDLQGRCPNDPQPHRVRAWLTAKHDEHFDDKVRHIC
ALYQQAPGLLAENAVVYSTDEMNGVQALERLHPDKPMRPGEPVKKEFEYI
PHGTLSLIVHREVATGQVIAPFAAPTLEAENCVLAVVLAISERPEVKKAG
GIGSGLCAGQPHGQCRRVLPQGCPERFGGGGGGRGQQPLGEDRLRLQSFW
NRQHRRRQPGEHLAAEQIDRSVGVPGWLGFVPVIGSALQAIGDFQAFRCT
ASTR
>gid:537436  glr4072  
MTPEQWKQIREALSVALDRQGTDRLAYLDALRLEPQLRQCLDKLLAAHDQ
QDGFLSTPILGSPLESLLANRSPTDVSWVAGRRLGAYRLVGELGRGGMGV
VYLAERADGLFSKRVAIKVLQPGRGAPLLLERFVQERQILANLEHPHIAR
LIDGGTSEEGLPYLVMEYIDGEPIDCYCRKQQLPVRERLALIEKVCRAVH
HAHTLQVLHRDLKASNILVDGAGEPKLLDFGIAKLLDEQAPEAEQTATEW
RMLTPSYASPEQIRGEAAGPSSDVFSLGVVLYELLTARRPAGLNVGPLDE
MLWTLGEQAAVPPSRAVAAGTDAGLLADPQKVQIDGSIDPLVLCALAKAP
ADRYTSALAMAEAIRGYLADHTPASGPLSFPPKAPRPALTGRLRQPAAIA
AGIGVLAVSLGVGGFWWTSNFGSPASAAARTIAVLPFANIDGDSRSAYFS
DGMTFDITNQLGKIADLTVIASSAAMQYRGTTKALREVARELGAGTILTG
SVRRNGNRVRIVSQLVDPATGQQLWSQDYERQLKDVFAIQAEVSEQIARR
LQARLSATEKLRLTQVPTASITAYDYYLKGRDYLGRRSRADNDLAIELFK
RALALDPNYALAHAGLGSAYGSKATRYGQEERWEAESLKAIKKALVLDPN
LSQAHKALGSYYYGRGRYRQALASFKRGAELNPSLPVVAGAYGGLSAAMG
NLEEGVRWSKRSIALNPTRNGYPNLGMIYTILGEDARARRALEAAVSIQP
DNVYALSYLSTLHKLRGQYDEARKTAQKILARDPQEVFGLTAAGDAERFA
GRWSEAKAHYEKVLKITDRLDGESGQLQSTTILADIARREGQPTRAAQLL
ARSFQIDREAIDGGNEYYFYPFDLAAAYAVQGDKSSALRWLRRAIKAGWR
DYRWLKLDPVFERLRGDGEFEQLVAQLKAQVEAMRQRVIAAESAES
>gid:537439  glr4075  
MAKPYSYDFRQKVLQAIELNGLKKSEASELFDISRNTINLWSQRKAETGD
VQAKPRPASHKGQKITDWEKFRAFVEAHGDKTQAEMAQLWDGQISSRTIS
RALHKLGITRKKRPTGIANAMRRNAQHS
>gid:537440  glr4076  
MYVDQSGMDERDDYGYGWSPLGERFYGLKAGRRQGRINMIAGYRAGQLIA
PFTVEGACNRTVFEIWLESCLIPVLQPGEWVILDNATFDHGGRIAALIEA
AGAHVLYLPPYSPDLNRIEKCWAWLKSRIRKRLRDCGHLRNAMDAVLKQA
AS
>gid:537471  glr4107  serine/threonine kinase
MARSKVPSECCERHGGLTRFCPVCGRLLAGGEVLENPESGHRYERVATLA
QGGMSTTYLVFNHQNDRLAVLKEIDADLSRKAKARELFLREAQVLAELDH
GGIPRFYDYFSSDERHYLVMEMIHGLTLEQVQPRSAAQAAGWMIEACNVL
VYLHGLQPPVIHRDIKPANLILRYNPREVVLIDYGAVKLAGGRQGTRIAT
PGYSPPEQGRGRPCLQSDIYGVGMTLVFLLTRQFPGRFYHPRERRLVGLE
EAGIEAPLAAVIAKATAYLPQERHRDSQELAHALAPFALG
>gid:537556  glr4191  
MVAAKKVQYSLRQEELMTIRKEILDELLKDYDGTDPQTILGEGGLLKQLT
KAVIERALEAEMETHLGYKKHEAAGKGTGNSRNGKSQKTLQAECGPVELH
IPRDRNAEFEPVVVRKGHTRWINGWSDTPGGTR
>gid:537557  glr4192  
MTTRDIQAQLQEMYGVEVSPTLISNVTDAVMDEVRQWQNRPLETVYPIAY
FDCLQVKVRDNGRVVNKAVYLALGVDLEGRKELLGLWLSAHEGAKFWLGI
LTELNNRGLKDILIACVDGLTGLPEAIESVYHGCLVQLCMVHMVRNSCKY
VSWKDRKSLCADLRSIYSAATEEEAELHLGGVVPTPLELLSEKWDKQYSS
VSRMWRENWGSLPQRPWVRVIPIFRFGEDIRKVIYTTNAIESLNMTIRKV
SRNHHIMPNDESVMKMVYLAIQNQMKKWTMPIRDWRPALNRLTIEFEGRL
KV
>gid:537608  glr4243  endonuclease V
MSVPNEPDSSLAMHSWNLTPQQAIEVQKQLAVQTVRTGNPEGVQRVAGVD
VSFNPREPKALVHAVVVVLSYPGLEVVDRQAVSAAVDFPYIPGLLSFREA
PPILAAIGQLSQKPDLVIVDGHGYAHPRRLGIASHLGLFLDLPTIGCAKS
ILVGRADGDLAEAAGSLTDLLWRGEVVGRAVRTRSRVQPVYVSPGHRLGL
DSAVEWVLRCCRGYRLPEPTRQAHNYSNLVRKARQPISLNGLSGAK
>gid:537687  glr4322  
MQPSVWQPPVEPSPAEQAILKRIRRAKLFVFLRRIRHQLFDAAFQSELAG
IYKDSPCGQPPIPPAQLALATVLQAYTGISDDETIEVLTMDRRWQMVLDC
LDCEEAPFGKATLIRFRQALIAHGLDRRLIERTIELATSDGGFGSRALRA
ALDSSPLWGAGRVEDSFNLLGHAMRKALRIMVCQTGVGLAQWAEQTGTTL
VAGSSLKAALDIDWSQPDECAEALGRLLGALESLESYLDGQTQPPQAGVA
RCLQAAEQVHQQDVQLNSRGQFVLRRGVSRERRISIEDPAMRHGRKSRAV
RIDGYKRHVLKDIDSGLVRAVGITAANRPEAAVTRDIEADLEPQRVKLIE
LHIDRAYLSSEWVRLRPEDLRIYCKAWPVRNGPYFQKTAFVLDWEAMRIR
CPQGVTQPFEVGGKVRFPAARCRRCPLQERCTTSPKGRSVSIHPDELLLV
ELRERQQTKEGRAKLRERVSVEHSLAHIGQWQGRRARYLGVRKNLFDLRR
VAVVHNLHVLARQEAAGRSQAA
>gid:537699  glr4334  
MPMPSSDSPTANGFGKTDTEVRPQPERRHFSAEYKLKILEETDKASQPGQ
MGSIRRREGLYSSLIAEWRKQRKQATLQALKDQPAGPKSSADTGLKAENA
RLQKHVQQLHNQLKRAELLLEIQKKASELLGLDLSQTPSSDAPGTR
>gid:537700  glr4335  
MLSVLRSERFVDQSPRQIYATLLDEGTYLCSYRTMYRLLAEAGEVRERRH
QRRQPLYSKPELLATAPNQLWSWDITKLKGPQKAQHYHLYVLLDVYSRCV
VGWLVAAQESAELAEQLIAQSSTREGIARDQLTIHSERGAAMTSRTVAVL
LSDLGITKSHSRPHVSNDNPYSESQFKTMKYQGRCPNDPQPQFPERFGSL
EDARAFCQRFFAWYNHEHCHSGIGLMPPASLHTGEAHQRQERRRQVLAAA
YTQHPDRFVRKIPEPPALPEAVWINKPKDSSTASSAETQPSCSVPDSESR
L
>gid:533601  gsl0285  
MDNDAEYYVLLRTGQEEQFLTEVELQAVLADAVRRAEGLEGEALAHQVKF
LLDTACDYPTLPGEYLQWYSVRLEKK
>gid:534961  gsl1627  
MLLVGLASGTVIGEKGYDADERVIEVLECAGKTAVIPSKSNRKVAREYDK
DLYKARHLIENFFARLKQYRAIATRYDKRAIHFLGAVYLAAAVVWLN
>gid:534966  gsl1632  
MVPNPPSAFVPTVANYDKIITPAMANVIKNQKYEDLMVSYQGVILGQGEA
QINGTCLDNSCNKVDVKVVTIQGKSSSK
>gid:535189  gsl1853  
MLKNYGQVLASKRHRAAGKATGTTSCIERFNNTVRQRVGRLVRKALSFSK
CLSNHNAGARKLRYCGNTPAG
>gid:535468  gsl2129  
MLLNEYEDYREAQENIARFIEEVYNRKRLHSSIGYLSPVEFEAEIKCETS
QASHPT
>gid:536782  gsl3424  DNA binding protein HU
MNKGELVDAVAKKAKLAKKDVDAIVGAAFDVIVEAVSRGDKVTLVGFGSF
EPRKRAAREGRNPKTREKMTIAATTVPAFSAGKQFREAVSGSAQAD
>gid:537084  gsl3722  
MQTQAYLFDLSEYQWQLIASYIPPARFGGRPRSADTRAVVSAIFYKEMTG
CPWRQLPADLPPWPTVYAYFRQWQDSGVWRRICSVLDTRFARESSAA
>gid:537219  gsl3856  
MATRAEYLRNPSHRLVFHYTPKHASWMNQVEIWLSILARKVFKRGSFQSV
EQLREKVWSFIEYYNAQWARPFKWTYQGKPLEA
>gid:537497  gsl4133  
MELQRPFRRYRCSVQRGLDRWPQPGNILLNVPELGPGSRWFVCPRCEHKW
SLTPEDILAGTDEAPTLLGEAVCPACGAHYHIEQGRIRPLPDIELRR
>gid:533474  gsr0161  
MDLEALHSKRDLLYREAREIHQQRLVLDRNLRRVTGTDPMACQRRALLQA
EFDELTIHLRTLSTKIRALSQQIDARLTTVEVWRNEHTG
>gid:534307  gsr0985  
MREVSIDMWGGFTKVVQQVFSKAVIVYDRFHGTRMVVEAVKKIAKQCGFR
KCKEQACLLKNGVDLSIEEQEELETRLKSSWRLRKAYAYKEEFRSI
>gid:535059  gsr1724  
MVDVLGLVPKVFNSSDHEGLQALAMSMKGRLVRLKKIIANQGYTGSCTEA
VERVCGWKVDIV
>gid:536607  gsr3253  
MIPPKRNRTQPRTYDKHLYKARHPMENFFARLKQYRAIATRYDKRASNFL
GAIHLAAAVIWLN
>gid:536925  gsr3564  
MGSMKYRGWTIATSKASEGFVALLTDPDGKRFDEPLVFLASPELAELYAR
NFINWYIDLEEERRMTEGSMRTSMI
>gid:537228  gsr3865  
MPAMSALRCNGVLERVWKRLLERGKSKMAALVAVMRKMVHSMYGVLNSQR
PFDPNYGSSTA
>gid:536432  gyrA  DNA gyrase subunit A
MTTIIPTNLRNEMQRSYLEYAMSVIVGRALPDARDGLKPVHRRILFAMHE
LGLGPDRPYRKCARVVGDVLGKYHPHGDSAVYDALVRLAQDFSTRYLLID
GHGNFGSVDNDPPAAMRYTECRLTPLTRDTLLADLEAETVDFSDNFDGSQ
QEPTVLPSRLPQLLLNGSAGIAVGMATNIPPHQLGELVDGVTALLANPNL
GIPELMRYIPGPDFPTGGTILGQSGIREAYTTGRGSITMRGVAAIETIQV
RGRPDREAIVITELPYQVCKAALIEKIAELVNDKRIDGISDIRDESDRDG
LRAVIELKRDAYPRVVLNNLFKLTPLQTNFGCNMLALVNGEPRLLNLKEF
LQTFIDFRAEVITRRTHYELRKARERDHILMGMLVALANLESVIALIRAA
DDTASARAELVNRFELSEAQAEAILQMQLRRLTGMEAGRIEAEHRELIAK
IADLSDVLERRERIDQIIRDELLTAKQRLNDPRRTIIEKAEGEIDDADLI
ANQEMVVLVTEQGYIKRLPVDTFDRQRRATRGKSGARMREDDAVEHFITC
CNHDTVLFFTNRGVVYALRGYQIPAGSRTAKGTPVVQLLPIPIEEKVTSI
VPVQEFASDEFLVMLTRTGFIKKTALAAFANLRANGLIAIGLEEGDELRW
VRRARAEDDILVGSRAGMAIRFRADEEQLRPLGRTARGVKSMELRKGDEL
VSMDILSCEQRERAEANGECGPWVLVVTADGFGKRVPFSEFRAQNRGGMG
VIATKFRDQGDELAALRVVSCEDELMIVTARGVIIRQQSEEISQQSRAAT
GVRVQRLDEDDTIVGVAVVPESAEDGGSAFAAEEEEEAEA
>gid:535387  gyrA  DNA gyrase subunit A
MPRQLELTGPEPKARVVPTALHSEMQRSYLEYAMSVIVGRALPDVRDGLK
PVHRRILFAMHELGLAPDRPFRKCARVVGDVLGKYHPHGDTAVYDALVRL
VQDFSSRYPLLSGHGNFGSIDNDPPAAMRYTECRLAAVGVDALLAEIDDQ
TVEFTDNFDGSQREPEVLPARLPVLLLNGSAGIAVGMATNIPPHNLGELV
DGLVALIDNPELADEQLLRIIPGPDFPTGGQIIGTEGIREAYLTGRGSIT
MRGITHCEEVGSGRRRKPAIIVTALPYQVNKAAWIEKVAELVNLGKVNGI
SDLRDESNREGVRVVLEMKREARPEAILAGLYRLTPLQANFGTILLTLVG
GQPRLLKLRELLEQFLEFRVATLTRRIQYNLRRASERAHQLEGQLVALAN
LQPIVQMLSAARDAAQARAELEASYGLSERQSEEILQMPLRRLTQLDRER
LASEHRELREKVIELQSQLDSRRKLLGLLKRELRDLKKKHADPRRTCIEA
RTAPLPTLAEITSDDEVVVQLTHRGYVRRIPAAAFERRSRSRRAVSRTED
NDFVVESYPTRTRNELLVITRSGRAYSLKVSEIPETGPRSRGTPLVTLLS
IQGEQIAATFVRENYPPDLFLVLLTREGRIKKLLLSECANLTGRGLMMLK
LSEEDQLIQVGLATAHSQIVVGTSAGRLLRFVADEAQLPAMGRSALGLQA
IKLRRGESLAAMTVVGNGEDLLMVSRLGLGKRLKTGEIRLQERAGLGTPV
GLLRERNGDSLLAVLAVGAEGEVAMATDQERLVQVDLGELAATDRSSPGK
PLAVLNPAEQIIAVTRLSAEPEETD
>gid:535850  gyrB  DNA gyrase subunit B
MTETYGAEQIQVLEGLEHVRKRPGMYIGSTGPRGLHHLVYEIVDNSVDEA
LAGHCKHIAISLGADGSATITDDGRGIPTDTHPRTGKSTIETVLTVLGAG
GKFGGGGYKVSGGLHGVGAAVVNALSTILTATIWREGKQYIQRFHRGIPE
GGLAVSSDSQKRRGTMISFLPDPEIFTTGVDFDFDTLLSRFRELAFLNAG
VEFRFSDLRGETERTESYIYEGGIREYVKYMTREKAALHPDIIFINQERD
GVSVECALQWSTDVYNDNVLGFANNIRTIDGGTHLEGLKAVLTRTFNSFA
RKANKLKDGDKNLTGEHVREGLTAVLSVKVPNPEFEGQTKTKLGNPEVRG
IVDGVISEKLTEYLEFHPDVVASILEKALQSMQAEEAARKARDLVRRKSA
LESSTLPGKLADCQSRDPAESEVFLVEGDSAGGSAKQGRDRRFQAILPLR
GKILNIERADDRKIYGNNEIQAMITGLGLGLKTEEFDVGRLRYHRIIIMT
DADVDGAHIRTLLLTFFYRYKRDLVEQGYIYIAQPPLYKISIGGGRNIDV
RYCYSEQEKEAILRGLRENQKYELQRFKGLGEMQADQLWETTMNPETRTL
RQVTIEDAAEADRVFNILMGDRVEPRREFIETYGPRLQMADLDI
>gid:534154  hup  DNA binding protein HU
MNKGELVKAVAERTKITLKEADAVISAVFDEIQDTVAGGEKVTLVGFGTF
EARKRAEREGRNPKTNEKMIIPATVAPAFSAGKTFKEAVLTNNK
>gid:534604  mutL  DNA mismatch repair protein
MWCEHPREYRALRSYCFLRRFVCDQSRHRLALISTLVRSNSPLGAIRPLA
DQTVRLLAAGEVIDSPAAVVRELVDNSLDAGADRIRVSFWPESWRVQVQD
NGLGFEAEELPMAARSHATSKLGAIEDLWRLRTLGFRGEGLHSIAVVARL
EILTCTPTARTATRARYDHKGELVESQPAAAAPGTVVTVSELFELQPVRR
RFLADTKAQVRAVTQLVHRYALAYPGHLFELAVDGRLQLQLWAAPGLKQR
ALQLLNSYDEHDLREVHLERDGRRVRLAIGLPDRCSRARADWLQIYINGR
FVRHGELEQAVRAGFERTLRPGRQPVCIVQLILPADEVDWNRHPAKLEVQ
LAGTAPACELVVAAIGEGLRHFAPKPAAALLRLAEAHLPYAAESTGGELL
LKALAQLHQTYILAEYPGGVCLVEQHVAHERVLFEALESDWQVVPLEPPL
ALELSSRQAQNLTEHGIEVEAFGERSWLVRSAPVALVGRVDCAEGLMELA
DQEDAAQMRAALSCRTAIRNGTALSPLQMQQLLDRWQRTRNPHTCPHGRP
IYMPLTDGELARFFRRRWHICGS
>gid:534355  mutS  DNA mismatch repair protein
MADPATQLSQWDFRRFARSDLTPMLQQYVEVKAQHPHCLLLYRMGDFYET
FLADAEIVSRELEIVLTGRQAGDKIGRIPMAGIPHHALERYCAQLIEKGY
AVVICDQVESPEQAKERARQAKVARRSKSDGDAPLLPLLLEDGEQIDWEG
AESVLVRRAVTRVLTPGTVLEDQLLVGRRNNYLAALVQAGECWGLAFADI
STGEFQVTQLESAEALVQELLRLQPAEVLLSGDAPDPLVLLRPGEASSER
PECLPSQFCYTLRPRRYFELDEARRLLMETFGVRSLEGFGCENLPLAVRA
AGGLVQHLLETQRGVSIPLEGIRTYTLSQYLILDHQTRRNLELTQTVRDG
AQYGSLLWALDRTRTVMGGRALRRWLLQPLLDTRAIGRRQDSVAELYDEG
LLRERLQRILESVYDLERLAGRCGSGTANARDLVALGESLLKLPALAEAV
AASTSPYLKALQSIPVELERLGEKLRRTLVDTPPLILTEGGLIRAGVHPE
LEGMRGQLIEDRDWLVDLEARERARTGIQTLKVGFNKAFGYYLSISRGKA
EKAPPEYLRKQTLTNEERYITPELKERETRILNAQQQTNQLEYDIFNILR
QEAGRHVSALRQVARRVAALDALAGLAEVAVYHDYCRPVLGEGREVHIEA
GRHPVIEQAIPAGFFVPNDARMGAEAEPDLIILTGPNMSGKSSFIRQVAL
IQLLAQVGAFVPARGAVLGVADRIFTRVGAVDDLATGQSTFMVEMTETAN
ILNHATPRSLVLLDEIGRGTATFDGLAIAWAVAEYLASHIRCRTIFATHY
HELNELASVVSGVANYQVTVQELADRIVFLHRVTPGGADRSYGIEVGRLA
GLPPSVVARARTVLAQVEQHSQIAVGLRDSNGSASESAAG
>gid:534191  mutT  mutator protein
MPKAIAIGIVCFAGKVLIDRRPVDAALGGLWEFPGGKILPGETPEACVAR
EVLEEVGLTVTVGELLAILEHDYSDFFVRIRAYLCHSESDAARAIACDAV
EWVEPRELDGYTFPVANAPLIPLIQQRLCP
>gid:534157  phrA  
MSTKTVLVWYRNDLRVHDHEPLTSALHKNARVVALYCFDPRQFGKAPFGF
EKTGPFRARFLLESVADLRRSLRQLGSDLLVRRGHPEEVIPALVSELEIA
AVHYHGEVTSEELVVERDLQAALAPLNVPVRSFWGTTLVHPDDLPFAIEA
IPELFTDFRKQVERSAAINPPLPAPAKLPPLPAVDPGEIPQLADLGLESP
VTDERAVLQFKGGETSGLARLEEYFWQKSLLKSYKQTRNGMLGADYSSKF
SAWLALGCLSARYIHEQVQTYETKRIKNDSTYWLIFELLWRDYFRFIAAK
HGDRLFYTAGLRGLDIPWKEDWERFELWRTGQTGFPLVDANMRELAATGF
MSNRGRQNVASFLTKNLGIHWHMGAEWFESRLIDYDVASNWGNWNYTAGV
GNDARGFRFFNILKQARDYDPDGAYVKHWLPELAGLPPARVHEPWKLLPV
EQKRFGVRLGVDYPQPVVDLFQSAAANEAIYNAAWEHHHRKRRGQPRSRT
>gid:534105  phrA  DNA photolyase
MSSIAIVWHRRDLRVHDNPALWQASRTGGQVLAVFIVDPTIVERDDTAPA
RIYFLRESVLELQKAYRTIGGRLAVRVGEPVQQLVALAQAVGAGAVYFND
DIEPYARERDARAAEALRAAGITVHACAEILLHPAGEVLTAAGGKPYTVY
TPFWRQWSAKPKPKPFPTPERLEAPTVEEQSFPELAQLGRPFAGELLVSP
GEQSGLEQLEAFAREGLYRYGERRDLPGCDGTSRLSAHLKFGTVGIRAVW
ARTMDAWKQAEHDRDRAGLAVWQQELGWREFYKYELFHLPQLAGRPFRRE
FENFQWDEDQERFERWCQGETGYPIVDAAMRQLNTVSWMHNRLRMIVASF
LTKDLLLPYGWGERYFMQKLVDGDLSANNGGWQWAASVGTDPKPLRIFNP
STQAGRYDPKAVFIRRWLPELEGVDTALLVTSERLPPLMRAQHRYAQPIV
EHKRQQQVFKERYRAVRAQQADPGARDSE
>gid:533955  polA  DNA polymerase
MPDSSAPVLLLVDGHSLAYRAYFAYVRGGETGLRTSGGTPTSVSFGFLKL
LLDAIERDRPSMVAVTFDTRMPTFRHEVDATYKSGRAETPDEFIDDLQNL
REILTALDLPQFELPGYEADDLIGTLAVHGAGQGYDVKILSGDQDLFQLI
TDEGAPGGSIRVLHQNTRTGTEEFGPAQVKEKLGIAPRQVVDYKALCGDS
SDRIPGVRGIGAKSAVKLLEEYGSLAQLIEAVDTIPGALGKKLKEGVEDA
RHSYWMATIETNVPLVVDFEACRLVGFDAERVAPLLEKLEFRSFLRQLQR
LQRSFGGTPSARPTALNEDADNPLAGIGTASEDELWFDFAPSVPMDLEVR
VVQTAEDFQAFLDALLAQDGLVAWDTETNNLDPRHARLVGIGCAWEPGVA
YYLPLAHQQGSNLETDAVVAALTPYWQDRERPKVLQNAKYDWLVLRNYGV
ALAGIAFDPMLASYVLDPEGKHNLMTLAQNHLQITMGSYEALVPKGQTID
AVEIAAVSRYCGEDAAVTLRLVPVLQAKLDEDPRLAGIFKEIEVPLEPVL
ARMEERGIRIDKAYLGELAQELDRDLESLEQEAYTLAGSKFNLGSPKQLS
DLLFNKLGLSAKKSRKTSLGYSTDAAVLEKLRDDHPIVEAILSYRTLAKL
KSTYVDALPLLVDPRTDRVHTDFNQTVTTTGRLSSSNPNLQNIPVRTSFS
RRIRRAFVPEPGWLLVAADYSQIELRILAHLTQEPVLLEAFQTGGDVHTL
TARLLLGREEVTSEERRLAKIINYGVVYGMGARRFARETGVSATEAEDFI
KAFYRRYPAVFGFMEQTRRMAVEQGYVETLLGRRRYFRGLGQLNQRDREG
ALRAAFNAPIQGTAADIIKIAMVRLEQTLAGRRTRLLLQVHDELVFEMPP
EERPEVEPLIRSGMENALDLLVPLKVELNAGPNWLEAK
>gid:536823  radC  DNA repair protein
MLPAMHTPRIAELPIAERPRERLLTHGARGLATAELLAVLFGGGQAAVRL
SALGLAQQVLHTLGRAGGEPLVHLRDVSAAELMLQPGVGPARAAAVLAAI
ELGRRVFVVRSSGERPVIDGPQAVAAVLGGELAFARQEQFAVLLLDVKNR
LIAHRIVSVGTIDETLAHPREIFREAIRQGAAGIIVAHNHPSGVTEPSPE
DLRLTGQLIECGRTLQIPVLDHVILGQGNFTSLRRLTALWRA
>gid:536774  recA  recombination protein
MARTTDDSKKAAPAAGTADEAQKQKALKMVLTQIKRNFGEGAIMRLGENT
RIRVETVPSGAITLDLALGGGLPRGRVIEIYGPESSGKTTLALHAIAEIQ
KTGGVAAFVDAEHALDPAYAKVLGVNVDDLIISQPDTGEMAMEIVDQLVR
SAAIDVIVIDSVAALVPRAEIEGEMGDAHVGLQARLMSQALRKITGNIGK
TGCMVIFLNQLRSKIGVMYGNPETTTGGNALKFYASVRLDIRKAETLKKG
QDEYGNRVRVKVVKNKVAPPFRKAEFDIIFGKGISSLGCILDLAVEMEIV
ERKGAWYSYGSERLGQGRENVLALLEENAAQAQEIEIKVREKIASGAAVP
AAAVAAPDEGDDDLGDEEV
>gid:535747  recF  DNA repair and genetic recombination protein
MFLRSVQLHDFRNYAEADLELTSPKTILVGDNAQGKSNLLEAVQLLATGR
STRALRDRELIARGKEQARVAATVERLGDTVELEMILRAGKRRTVRVGGE
TRRTQVEALGYLHCVSFSSLDLDLVRGAPETRRDWLDGILLQLEPVYTNV
LAQFVQALHQRNALLRSTELSPDALAEQLPCWDDLLVRAATPVMRRRHRL
IERLAPLARRWHGSISGGRETFAVRYQPQISFEQEDAQSVQQALQELLKE
KRTLEGRRGTSLVGPHRDEVDLSIDEIPARQFGSQGQQRTLVLALKLAEL
ELLEQVTGEVPLLLLDDVLAELDLHRQDQLLGAIQERVQTIVTTTHLSLF
DSQWLQSATVLTIEKGRIGSPPAPA
>gid:534603  recG  DNA recombinase
MSHTEAFAELLQRLHRALAAEADRGFVNLKGSRQYFAEFLSETLATAPGG
LEDKDSARRWQDLGARYARYADLENAARAHLVAETRRFLHRIRRSLETPA
PERPNGKPTAPPEAAILDQPMTKLGGIGPKLAAQLEKLGLTTIGQVLRYY
PRDYLDYSNRTTIKVCQPGEMVTLLGQVRRCRCFTSPRNHKLSIFTLTLG
DGTGQMQLSQFFAGTRFTHRGWQEAQMKQYPRGATVAASGLVKRSTTGLT
LQEPQLEVLDEGEDLQNLTKIVPVYALAEGVGAGVVRRAVKAALPFANAF
TDPLPAAVRTSLGLLDLPGAIRAVHYPESAEHKLQARRRLVFDEFFFLQL
GLLQRRHRQKRQSAGIAFRTRGELIEQFYKLLPFAFTGAQKRVVEEVLAD
LGSPEPMNRLIQGDVGSGKTVVAVVAMLTALQSGYQTALMAPTEVLAEQH
YQKLVQWLSQLHLPVELVTGSVRAARRRDVLRQLASGELNVVVGTHALIQ
DGVQFANLGLVVIDEQHRFGVGQRARLQNKGRNPDLLTMTATPIPRTLAL
TLHGDLDVSQIDELPPGRKPVRTTVVTPSERTQVNELIRRQILEGRQAYI
VLPLIEESEKVDLRSAIEEHERLKEKIFAEFRLGLLHGRLKSEEKEAVIG
AFRRHELDLLVSTTVVEVGVDVPNATVMLIEHAERFGLAQLHQLRGRVGR
GANQSFCLLMSATKTESALQRLRVLEQSNDGFLIAEMDLRLRGPGEVMGT
RQSGLPDMVLSSLVEDQDSLELARREAQSLIERDPELTAHPLLRAELAGR
LDRLMDGAILN
>gid:533354  recJ  single-strand-DNA-specific exonuclease
MPIGLSSMLPKQRWRIAQEDPVQARQLAEAFGLSPLIAQVLINRGLTTCE
LAEHFLAPESWVLPSAGEAFDQLPRALDLLAQAIESGTAIAICGDYDCDG
MTSTALLLRAVRSLGGHIEYAIPSRLQEGYGINLRIVEAFYERGIGLILT
VDNGICAYAPIARAVELGMGVIITDHHDLPERLPPADAILNPKMIPRTSP
YAALAGVGVAYVLAGELARRLGRTGLADCLLELFTLGTVADLAPLVGVNR
AWVLSGLAKIPGSSNTGVQALLAASGLAGRSDLKPEAIGFALGPRINAVG
RIGDPVTVIELLTTDDPAVAAERAAECELANQHRQALCAQIEAEAARMLE
VEQAGGLDLGCERVIVLAREGWHHGVIGIVASRLKDRHGAPVFIAAIEGE
HARGSARGTPEFHVYEALKDSAELLTGFGGHPMAGGFSLLAANLPAFRER
LVQFARARLAPEQVAPLVEVDAQVQLARVDRRLLGEIDRLQPCGLGNAEP
VFWVAGARVVEQKAMGKTRTHLNLTVSDGTAERRAVGWRMAEWLPLPPFV
DIAFQVKENNWQGHSKVELELVGLRAAHPAPIELPEPPPLDRPIRWHDHR
HCEAAAFLHAFVAAQASPVLLYGHARPQLPRGLPVHYDRPQPAHDYEHLL
LWSLPPSPLHLRWLIAATRPNCVHLFAQAVPTAHAWTLQRHLRATYLPER
PVEILRLAQLWWVSPRALCEALVEAGYLADWRSVVADEWHDWLGAELAGL
AAWYASPVEVLEALLAPQPTLKR
>gid:533587  recN  DNA repair protein
MLTHLRIENFALIDNLALDFAGGLNVLTGETGAGKSIILDALDVVLGGRV
SGAQVRTGAQRAVIEATFGPSAAIADWLAAQQIDSLEEGLVVVREISGKT
NRARVNGVLVNQAVLRELREQLLEITAQGQSLQLEKPEVQLELLDNYGEI
APRRERFRQVYETLQRRKGELARKKAARDDRLQQLDLFRFQFEELSRAAL
DDPQEEEQLLADRSRLAHAVELQHNSLKLYEMLYEGALDQPAVTDLLGQS
SELLIQMGEYDASLVPLGEMLENALVQVQECARAVNRYGETVESDPETLE
YTEKRLRQLKNLRQKYGPTLADVIAHYRNLERDLAAIEGSGEDLEVEERE
LAGETRRATELAAELTRARTEAAGRLEGDLVRELAPLGMGKVRFAVRIEP
SRLASSGADRVGFLWSPNPGEPLQPLTETASGGEMARFLLALKACLGGAD
RIATLVFDEIDVGVSGRVAQAVAEKLLQLGASHQVLCVTHQPLVAALADS
HYRVHKAVLAEAGEERTVVRVETLNKTEARREELAQLVGGHSAHEALEFA
GALLADADRRRSAVRP
>gid:535974  recQ  ATP-dependent DNA helicase
MPDFGAGDAEWTVDDQKLGEALKRHFGHERFRPGQRRIVELAIAGHDQLI
LMPTGGGKSLTYQLPALLLPGLTVVVSPLIALMHDQVDRLRENGIAATFL
NSTLAAGERTRREQAIAQGRMKLLYLSPERLLSEECLAFLEYVQRQGGLS
LLAVDEAHCVSEWGHDFRPEYRQLAAVRERFAALPTLALTATATERVRQD
ILVQLKLRDPHIHIASFDRPNLHYAVLAKDKGAYAELLGRLRRLDGASAI
VYCQSRRAVEALAERLVADGLNALPYHAGMAAEMRSRHQTQFLRDDAPVL
VATVAFGMGIAKPDVRAVFHYELPRNLEGYYQESGRAGRDGQPADCVLFF
SPGDRAKIEYLVAQKSDPHEQRLARSQLAQMLAYAESTVCRRRILLGYFG
EALAEADCGGCDNCRSPVATQDRTVDAQKLLSCVARCQERFGLRHIAQVL
RGANTQKIRTQGHDRLSTYGIGADLSQAEWLHLGRTLVHQGLLIETTDGY
PVLKLNALSWEVLRRQRTVTAARLPARPTSPQPADAAADDAAQGLFEHLR
RLRKRLADEQNVPPYVVFADAALKAMAQRRPQTMAQFLAIPGVGTRKAEA
YFTPFTAEIRTYCEALALAPEPAHGEISPPEPPRRARSPQPAEVIPSTCA
LTLDLWRQGLTVEQIAERRSLRIETIEGHLAELVESGEDIAIERLVGPDR
RRTIEAVLLKLGTAVLKPVKEQLGDEYSYGEIRLVRAQLLGGHGDF
>gid:537343  recQ  ATP-dependent DNA helicase
MPDCAAATCPGGPPRGQPALSPGAKLIAMHDARQVLQKLWGYADFRPQQR
PIVEAVAGGRDVLAVLPTGGGKSVAFQVPALLRSGTTLVVTPLVALMEDQ
VARLRSLKVAAACLHGEQSAAVRSESLAGIETGRWALLYLSPETLLSPPT
WARLQGCPIARMVLDEAHCLTGWGSSFRPDYHRLGAARRALGHPPLCAFT
ATAAPADRARIERWLGLVDPVRLVIAPYRPNLAIRVRWMYTTRQRCGAVG
AFLASRPDTSGLVYLRTRAGTEKLAFELAKAGYRTTHYHAGLGAAARRSA
ERDWLAGRLQFLVATNAFGMGVDAPHVRWVIHAHTPPSLEEYLQEIGRAG
RDGEAATALLLASEPTGWLDPTDRLLHRHFAANRREQWQAAEKALARLPA
EGEYRPELALSLALLHERGRLVWKTPFDYRLVETPPTRPPEMGAGVQDFV
RTCRCRWQFLMAAFGEAASPPCGRCDRCTG
>gid:536857  recR  recombination protein
MYTRPLARLIEHLQRLPGIGPKTAQRLAFHLLRRPKSEALQLAQALVEAT
EQIGVCSRCFNLSAEDPCDICRQPGRQGETICVVAEPRDLVAIERTREFK
GHYHVLGGLINPMEGIGPEQLRIKELLQRVGADTVKEVILAINPSTEGEM
TTMYLSKYVKVLGPRVTRIAFGLPVGGDLEYADEMTLARALEGRREI
>gid:536973  rnhA  ribonuclease H
MFRIATDGACSGNPGPGGWAALVVGEGTYEEVFGFEPHTTNNRMEMRAVI
EGLTRVPADAKVKVLTDSQYVIKGMSEWLPGWKRRGWITSTGKPVENRDL
WEALERAVGGRVLWEHVRGHTGHPENERANTLAQTAARGGAPAVGARTVP
TNGTTYLSLVDGHLSRSTGWSACQALVQGVSGARYKKCRNRAEELDTISA
WGLPPESLVQLEG
>gid:534840  rnhA  ribonuclease H
MDEVGRGALAGCVVAAAVVLPPDIPLAALAGITDSKLLDRPRREALHERI
LALASAVGIGSASVGEIDRLNILRATALAMGRALHRVAPVEHVLVDGLPV
AELGITQTAIVGGDRTSLSIAAASIVAKVTRDRWMERLDRRFAGFGWATN
AGYGTAFHREALARLGVTPLHRRSFAPVRLVVQKNLFDEPPPGGHFVE
>gid:536233  ruvA  holliday junction DNA helicase
MITFVRGMLAEVGPRSGQSWATVDVGGVGYRVWTHARTVGKLPRIGEEVK
LFTLMIVREDAMQLFGFLEPGERELFGQLVSVSGIGPRMGLALLETLAPT
ELVQAILQGNTRALALAPGVGAKTAQRLALELRSRLSKWREESGLSAMGA
RASSRVYEEVELALLALGFAPGEVVRALDAVAPAMAGEEQTEAWLRAAIA
WLSEQG
>gid:536470  ruvB  holliday junction DNA helicase
MAIISSRPSGAEGEPAKSRTAPAEPAQRSRTAVQSERAHEDQHEESLRPK
NLVDYAGQKDLKAVLGIAVAAAKSRGESLDHLLFYGPPGLGKTSISLILA
REMNVQIHLTTAPALERPRDIAGLLVKLRRGDILFIDEIHRLPRLTEEIL
YPAMEDYRLDITIGKGQNARITSVPLPRFTLIGATTRVGALSSPLRDRFG
LIQRLRFYDVEELAGIIVRNAALLNISIDEPGAAEIARRARGTPRIANRL
LKRVRDYAQVEGDGRITEPVACAALELFEVDPRGLDWTDRRLLATLIEHY
NGGPVGVESLAAATGEDTQTIEEVYEPYLMQIGYLLRTARGRMASPAAYR
HLGYAPPQRFEQVSLLDSIDSAHGRN
>gid:536655  ruvC  crossover junction endodeoxyribonuclease
MCGMRILGLDPGVAILGYGVLDFFDSAPPVVCDYGIVQTSAKTAFEARLA
AIYEDINSLFSAHKPDLVAIEKLFFYKMGNTISVAQARGVVLLCAAQHGV
PYVEFSPPQVKLALTGDGRADKRAIQEAVQRELGLITMPKPDDAADALAI
ALTGWFHHLPPAAREAVPAYSVG
>gid:536551  topA  DNA topoisomerase I
MSKSLVIVESPAKAKTISKILGKNFVVKASAGHIRDLPQKEMGVNVKNDF
EPKYVIIPKKEEVVEELKGAARNADRVYLAPDPDREGEAIAWHLAQILDI
PGDRLQRIEFHEITKQAIQNAVAHPRDIDINRVDAQQARRILDRLVGYKL
SPLLWKKVQRGLSAGRVQSVAVRLLCDREKEIQAFVSEEYWTVHGRFRQS
AQVEATPFSADLVRWSGKKPDLGNETAARAVVEVLTGAASRVDSVKTRER
QKEPQPPFITSTLQREGASALGLTVKRTMAIAQQLYEGIDLGEEGPVGLI
TYMRTDSTRVAEEAQEAAREFILAAYGKSYYPSRRRQYGAKKGAQDAHEC
VRPTDINRPPDAIKKSLTPDQFKLYRLIWQRFTASQMTAATLETRTVEIA
ATPTAGRPDALFRVSVTRTIFDGYTRVYEEAREEAATGDEEAAGAAPVLE
EGEALTLLTVDPKQHFTQPPARFSEATLVKALEEQGIGRPSTYAPTIGTI
QERGYVNKDGRTLIPTDLGMKVNDQLVQHFPNIVDTEFTANMEAQLDEVE
KGSQRWTQLLADFYGPFVETLKAADEEMKRVVIVTDHLCESCGRPLLNRY
GRFGNFLGCSGYPECDFTHQLTRDNKPVPKDRPAEGISCNQCGHAPMLIT
YGRYGEYLKCPACGKSQPKSTGITCPKCSKGQIVERRSKMGKNFYGCDQY
PDCDFVLWSRPIDKPACPECGSILLYKPRKRGDDMVACSACKFTAPAQEM
VGEEEVERQAELLTG
>gid:534342  topA  DNA topoisomerase I
MMRLLICESPGKIKTFKAILGAGWDVQASLGHVMELANDGADHLGFDVGP
EAITCRYVPRGERGMATLKKLRTAAAKAQEVYLATDPDREGEAIAWHLAR
ELRLKSPRRIRCTQITEGAVRTALQSPGRLDMDLVSAQRARQCLDKLVGY
KVSPLLWNATGGKSAGRVQSATLHFVCEREREIIAFVPVDYWSVWVEYAP
GWRAYYQGDVQETPESTEATDDAKSNKEAPPAESTRVLSEAEADRLVALA
RSHPHAVERVERRTVTKIPPAPFTTSTLQQAAGALLSYNPERTMGIAQQL
YEGIDLPTGRKGLITYMRTDSVEVAPEFVEQARTYLQINDPDNLPARSAT
HRSRAGAQQAHEAIRPTDVSLTPRTIRAHLSEEQLRLYDIVWRRAVASQC
APARLAKTKILTRSGPVHWQALGMTVAFAGYTRYWNDLEAALALPALEAG
QNLELLRSAHEKKRTQPPPRYSEPKLVQLMERKGVGRPSTYASTIKTLKE
RAYVELQGRALVPTELGLATDALLGRTLPELVDSAFTARMETGLDAIAAS
REPWEKYLIGWNTDYLVPAVARARAAIAAEFPKRPAAPRTAQAPQISRTP
CPRCRQALSKVPSKKVKRGYFLKCPACADLVMFWNPWRKIWELPKPKTDP
QTP
>gid:537601  uvrA  excinuclease ABC subunit A
MMNGNGGRDCIHVQGARQHNLKNISLKIPRNQLVVITGVSGSGKSSLAFD
TIFAEGQRRYVESLSAYARQFLGQLDKPDVDHIDGLSPAISIDQKSTSHN
PRSTVGTVTEIYDYLRLLFGRAGKPFCPVCGLPIEPQTIEQIVDQVLELP
EGTRFQLLAPVVRGKKGTHAKLLSALASEGFVRVRVDGRVHELGEKIELE
KNQSHTVEIVVDRLVRREGIAERLADSLATALERAEGTVVVELLPRDDEE
RARLVEISRHHGLHDSESEVGIEMVFSANYACPVHGSVMEELSPRMFSFN
SPYGACPSCHGLGAIQEFSPDLVVPDPAKPVSEAVLPWAESSNPYYGELL
KSLGKQLGFSPASPWHLLTRTQRQTILYGSEERVFVDAESFYNNTRGYFT
KFEGVIPGLKRRLAEATSDRVKLKLEQYIVSQPCAECKGTRLKPQVTAVK
IAGRSILEFTSVPIDTCLGMLDTLTLSERQARIAHEVLREIRARLQFLVD
VGLEYLTLDRSANTLSGGEAQRIRLATQIGSGLTGVLYVLDEPSIGLHQR
DNERLLRTLFRLRSLDNTLLVVEHDEDTMRAADHLIDIGPGAGVHGGRVV
AQGTVEDIVTCPESITGAYLSGRRRIETPAARRCGNGLSLSMRGASRNNL
RSVDVEIPLGKFVCVTGVSGSGKSTLINEILYPYLKHHFGRTSPRPVGVE
AVEGVAHLDKVIVIDQSPIGRTPRSNPATYTGAFDAIREIFSMTIEAKAR
GYEPGRFSFNVKGGRCEACKGEGVNVIAMNFLPDVYVQCDVCKGKRYNRE
TLQVKYKNKTIADVLEMTVEEACQFFENIPKAINKLETLRQVGLDYIRLG
QPAPTLSGGEAQRIKLATELSRRSTGKTLYLLDEPSTGLSFYDVHKLLDV
LVKLVDTGNTVVVIEHNLDMLRVCDWIIDLGPEGGRKGGQIVATGTPEQV
AGVEDSYTGQFLRRVLEHAPGILVGSEAAQQ
>gid:535191  uvrB  excinuclease ABC subunit B
MTDDRFVVSAPYRPTGDQPRAIAQLSAGALGGVTFQTLLGATGTGKTFTI
ANVIEKVGKPTLVLAHNKTLAAQLCNELREFFPDNAVEYFVSYYDYYQPE
AYIPQTDTYIEKSASINDEIDMLRHSATRSLFERRDVIVVASVSCIYGLG
MPEEYLRAAIPLKVGSNIDQRELLRQLVTVQYERNDIDLGRGRFRVRGDV
VEIGPAYEDRIIRVEFFGDEVEAVRWLDPVTGEVVRSVNSLNIYPAKHFV
TPEEQLEQACIAIEQELEARVAELEGENKLLEAQRIKQRTRYDLEMLREV
GYCNGVENYSRHLAARRPGEAPSCLIDYFPQDWLLVVDESHVTIPQIRGM
YNGDAQRKKVLIDHGFRLPSAADNRPLKAPEFWDKVRQAIFVSATPGDWE
VELSGGGRDPETGRMAGEHVAEQIIRPTGVLDPEVFVRPVAGQVDDLLHE
IHDRVARRERVLVTTLTKRMAEDLTEYFQERGVKVRYLHSEIQAIERIEI
LQALRQGDFDVLIGVNLLREGLDLPEVSLVAILDADKEGFLRAERSLIQT
IGRAARHVRGQVIMYADRLTASMDKAISETERRRQIQRAYNAAHGLTPQP
IVKRLDANSILDYLAVSRRLNQQELEAAAAAPAEVALADIPELVSQLEIQ
MRDAAKKLEFEKAAEYRDKIHKLRERLLGK
>gid:534770  uvrC  excinuclease ABC subunit C
MQPWFRVSLDRLEDLPDAPGVYRFVDCDGRLLYIGKSVHLRTRVRSYLRR
DGGHSRQTERLKFEAQAVEVLCTGSELAALLLEGRLIREYLPPFNQAQKR
YRQYPFLRLSVQEDYPRLHLTRVLAGDGAEYYGPYSQARFVSWMADLLSA
SLGLRTCRDFSAIYHGCLLDQLGKCLGPCRTGALVAQYRERVERLRALLR
GEDTGSGIIVDFERQMLNAAQREDFEQAARWRDRRQALANFVTHQGHLRE
RVSLDAVAVYPGAPRSPTSVQLFWIRQGKLTLQQSFAGDLPQERLRAELA
HTLAGHYSEGLPPAPLFALPQQDLDEVQMVSGWLYRHRTDDTLLWLAGIE
PERAAELLLTLIDQAHRRSS
>gid:536953  uvrC  excinuclease ABC subunit C
MPNLIADPERLKERLELLPTSAGVYLMRDEAGEILYVGKAKNLRNRVRSY
FQPGHDHSPRIAIMVGKVHDFELILTDTEAEALVLEDNLIKTHKPRYNVL
LKDDKQYPYLCITWSEEYPRIFVTRRRGSGHPEDRYFGPYTDAGALHSTL
GLLKKLFPLRQRNTPVFKDRPCINYEMGRCPGLCQRLISPQEYRATIRQI
QMILQGRTAELLAQLEDQMQTAAAAMNFEHAARLRDRITGLNQLGAHQKI
TVPDSSVSRDAVALAADAALVSIQLFQVRSGKLIGRLGFSAAASGEDPGH
ILQRVLEEHYRASASEEIPLEVLTQHPLPEADILATWLAEKKGRKVEIHA
PQRQIKAELVEMVARNAEAELQRLERFSRRQEKGLLNLAEALELPSVPRR
MECYDISHIQGTDTVASRVVFVDGAPAKQYYRHYKIRDPRIVAGRPDDFA
SMAEVISRRFARAESEPEGDLPDLVVIDGGKGQLSAARAVMEELSYGDVP
TIGLAKRLEEVFLPGRSDPVLIAQGDPALHLLQRIRDEAHRFAVSFHREQ
RGKRMTRSSLDDIPGIGPAKRKILLDTFRSVPVLEKASFEEIAKTPGIGS
RLAQVIHTYFHGEPEAVARALEAEQAAN
>gid:534315  xer  
MLSEKEVQAVLKKGFIDAKYQALFQAMLYTGCRVSEACSLRTADVYELDE
TGSVRTVGKKSRPRYVVQEVITFRRASTKGGISTRSIPVHPALRACLESF
APGGVYVFESTSPGTAMLRRTVDYAFRAAFKACRIQGASTHSLRRTAITR
LHTSGVGLRTIQRISGHRSLAALQRYIEVSDEQVSAAIQLL
>gid:533765  xer  integrase/recombinase
MSCLDDFWDYLERERRLQPATCRAYRCDLKQWFGYLQTANPLAARPEQLR
SYLVQLGERGIGHGTLRRKRSSLRVFYGYLHRQGVIAADPAAALPTETSR
VPGPPPLRPQQLEALLGAVGDDACGVCHRALILALYAGGLRSAEVSAANL
ENLDWERGLWRLPQRDVLLDPRLYAALLDYLRCGRPGLAAGIAERALFVS
RRGHRLSVRALQAYLSRYGVTAQQLRNSYAAHLLEGGTDPVDVQALLGLR
STQAAAGRQPTVAQGLRRVYDLAHPRSGLHKK
>gid:533749  ycf41  
MNAIALMGTLQSEPELRFTQDGLARLSTLLGFSAGRPEEPDFQIRMVAFG
NLAEEVARTLHHGDGVLVEGRLQAETRGRPDGTKEKVTELIARKVYPATV
GESLAGATPAPAAAPAASAKNGAPAARPAPRPAAPARPTAPEPDLDDIPF