TitleGenColors Logo

Gene list

Applied filters:

COG category: Posttranslational modification, protein turnover, chaperones
Gene type: CDS
Genomic element: chromosome

Number of genes found: 105

Free access
Sort by:

 



# Geobacillus kaustophilus HTA426, HTA426

>GK0062 GK0062, cell-division protein and general stress protein (class III heat-shock)
MNRIFRNTIFYLLIFLVVIGVVSFFNGTNQRTEPMTYDAFITHLENGDVK
SFSIKPERGVYEIKGQLKTYSEGQYFSTYVMNSDKVLDRIDAAAARTRVE
VVPADETSGWVTFFTSIIPFVIIFILFFFLLNQAQGGGSRVMNFGKSRAR
LYTDDKRKVRFRDVAGADEEKEELVEIVEFLKDPRKFAELGARIPKGVLL
VGPPGTGKTLLARAVAGEAGVPFFSISGSDFVEMFVGVGASRVRDLFETA
KKNAPCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQLLVEMDGFNGNEG
IIIIAATNRPDILDPALLRPGRFDRQITVDRPDVKGREAVLRVHARNKPL
DESVDLKTIAMRTPGFSGADLENLLNEAALVAARRNKKKIDMSDIDEATD
RVIAGPAKKSRVISEKERRIVAFHEAGHTVIGMVLADAEMVHKVTIVPRG
QAGGYAVMLPKEDRYFMTKAELMDKITGLLGGRVAEEIVFNEVSTGAHND
FQRATNIARRMVTEFGMSEKLGPLQFGQPSGQVFLGRDLHNEQNYSDKIA
YEIDLEIQRIIKECYEKAKQILTQHRDKLDLIANTLLEVETLDAEQIKHL
FEHGTLPKDRNGREESGKSDGDVKINIQKKEE
>GK0064 GK0064, chaperonin (heat shock protein 33) (HSP33)
MKMGDYLVKALAYNGQVRAYAARTTETVAEAQRRHQTWPTASAALGRALT
AGVMMGAMLKGEETLTIKIDGGGPIGVILVDSNARGEVRGYVTNPHVHFE
LNEHGKLDVARAVGKNGMLTVVKDLGLRDFFTGQVPLISGELGDDFTYYF
ASSEQIPSSVGVGVLVNPDHTIRAAGGFIIQLMPGTEENTITRIEERLKQ
IPPVSRMIERGLSPEQLLEQLLGDGGVRVLETMPVSFVCRCSRERIADAL
ISLGPEEIQDIIDKEGQAEASCHFCNETYHFDKAELEQLKQLAKKE
>GK0078 GK0078, ATP-dependent Clp protease ATPase subunit
MMFGRFTERAQKVLALAQEEAIRLGHNNIGTEHILLGLIREGEGIAAKAL
MALGLGPDKIQKEVESLIGRGSEVSHTIHYTPRAKKVIELSMDEARKLGH
SYVGTEHILLGLIREGEGVAARVLNNLGVSLNKARQQVLQLLGSNESMSG
HGGGASHVSTPTLDSLARDLTAIAREGRLDPVIGRSKEIQRVIEVLSRRT
KNNPVLIGEPGVGKTAIAEGLAQQIVNNEVPETLRDKRVMTLDMGTVVAG
TKYRGEFEDRLKKVMDEIRQAGNIILFIDELHTLIGAGGAEGAIDASNIL
KPALARGELQCIGATTLDEYRKYIEKDAALERRFQPIYVDEPTVEESIQI
LKGLRDRYEAHHRVSISDEAIVQAVKLSDRYITDRFLPDKAIDLIDEACS
KVRLRSFTAPPKLKELEQKLEEVRKEKDAAVQSQEFEKAASLRDMEQRLR
EELEETKRAWKEKQGQENLEVTVEDIAAVVSSWTGIPVSKLAETETERLL
KLEEILHSRVVGQDEAVKAVAKAVRRARAGLKDPKRPIGSFIFLGPTGVG
KTELARALAEAMFGDEDALIRIDMSEYMEKHSTSRLIGSPPGYVGYEEGG
QLTEKVRRKPYSVVLLDEMEKAHPDVFNILLQVLEDGRLTDSKGRTVDFR
NTIIIMTSNVGADALKRNKYVGFNIQDGNQQYKDMKSKVMDELKKAFRPE
FLNRIDEIIVFHSLEKDHLKQIVRLMADTLVKRLKEQDIDLELTEAAIEK
IAAEGFDPEYGARPLRRALQKHVEDRLSEELLKGTIAKGQKVVVDVKDGE
FVVLSKQAVV
>GK0079 GK0079, DNA repair protein
MAKKKTKFVCQECGYESAKWLGRCPGCQTWNSFVEEIEQEKPTVRGAFLH
SERAGSAKPVPITAVEAVQEPRMKTNSAELDRVLGGGIVKGSLVLIGGDP
GIGKSTLLLQTSAQLAAAGHQVLYVSGEESVKQVKLRAGRLRAECDQLYV
LAEADLEYIVAAVETVQPACVIIDSIQTVYRTDISSAPGSVAQVRECTAE
LMKLAKTKGIAIFIVGHVTKEGAIAGPRLLEHMVDTVLYFEGERHHTYRI
LRAVKNRFGSTNEIGIFEMRDIGLKEVENPSEVFLEERSRGAAGSTVVAA
MEGTRPVLVEIQALVSPTSFGNPRRMATGLDHNRVSLLMAVLEKRVGLLL
QNQDAYLKVAGGVKLDEPAIDLAVAVSIASSFRDRPTNPADVIIGEVGLT
GEVRRVSRIEQRVQEAVKLGFSRIIVPKNNLAGWQPPKGVQVIGVSHVAE
ALEHTMC
>GK0210 GK0210, intracellular alkaline serine protease
MFGYSIVQLARRHAGRLDRPLRHALVNMYKPMTRMPCIFHRWIERWLRKW
RTLSVLIEVENAEGMEAVAKAERDHFRMKVRHHFRHVPFYSARVTPAALE
QLLEHPKVKKVYFNRTVKALLNNAVPSANAKRVAVNGTELSGKGVTIAIV
DTGIYPHPDLEGRIAAFVDFVNGRTTPYDDNGHGTHCAGDAAGNGRMSDG
LYAGPAYEANLIGVKVLDRSGSGTLETIMRGIEWCIDYNERHPSKRIDII
SLSLGGEPQPFPIENDDPLVQVAEQAWEQGIVVCAAAGNEGPNYGTISSP
GISDRIITVGALDDHDTATTRADDDVASFSSRGPTEYGVTKPDLVVPGVN
IISLRAPRSFLDKMNKQSRVGDHYISMSGTSMATPICAGIVALMLQAKPN
ATPDEIKRALKDGADLWKGRDPNVYGAGYVNGKRAIEQLLQR
>GK0237 GK0237, glycoprotein endopeptidase
MKVLGIDTSNMPLGIALVDGDVVKGEFITNVKKDHSSRAMPAIELLLRRC
DVAPKDLDLIVVAKGPGSYTGVRIGVTIAKTLAWSLGIPIAGVSSLEVLA
ANGRYFPGVIVPLFDARRGQIYTGLYRYEGGALRCLEEDRIVAADEWARR
LSEREEDVLFIGADAPLYRDLFRSHLGERVHVAPPSLALPRPSELVMLGK
QKERENAHVFVPNYLRLAEAEAKWLAKQKGEQNDGDQCSISANDHS
>GK0239 GK0239, glycoprotein endopeptidase
MGEAEMNEDVYVLGIETSCDETAAAVVKNGREVLSNVVASQMESHRRFGG
VVPEIASRHHVEQITLVIEEAMQQAGVSFASLDAVAVTAGPGLVGALLVG
VNAAKALAFAHGLPLIGVHHIAGHIYANQLVAEMKFPLLALVVSGGHTEL
VFMKEHGNFAVIGETRDDAAGEAYDKVARALGLPYPGGPHIDRLAHEGEP
VIDLPRAWLEEGSYDFSFSGLKSAVLNALHNAKQRGEEIDPRQMAASFQA
SVVDVLVTKTVQAAKEYRVRQVLLAGGVAANRGLRAALQDKMKELPDVEL
VIPPLSLCTDNAAMIAVAGTVLYQQGKRADLALNANPSLPLV
>GK0248 GK0248, chaperonin (GroES protein)
MKPLGDRIVIEVVETEEKTASGIVLPDTAKEKPQEGRVVAVGAGRVLDNG
QRIAPEVEVGDRIIFSKYAGTEVKYDGKEYLILRESDILAVIR
>GK0249 GK0249, chaperonin (GroEL protein)
MAKQIKFSEEARRAMLRGVDKLADAVKVTLGPKGRNVVLEKKFGSPLITN
DGVTIAKEIELEDPFENMGAKLVAEVASKTNDIAGDGTTTATVLAQAMIR
EGLKNVAAGANPMGIRRGIEKAVAVAVEELKAISKPIKGKESIAQVAAIS
AADEEVGQLIAEAMERVGNDGVITLEESKGFTTELDVVEGMQFDRGYVSP
YMITDTEKMEAVLENPYILITDKKVSSIQELLPVLEQVVQQGRPLLIIAE
DVEGEALATLVVNKLRGTFNAVAVKAPGFGDRRKAMLEDIAILTGGEVIS
EELGRELKSTTIASLGRAAKVVVTKETTTIVEGAGDSERIKARINQIRAQ
LEETTSEFDREKLQERLAKLAGGVAVIKVGAATETELKERKLRIEDALNS
TRAAVEEGIVAGGGTALMNIYNKVAAIEAEGDEATGVKIVLRAIEEPVRQ
IAQNAGLEGSIIVERLKNEKPGIGFNAATGEWVDMIEAGIVDPTKVTRSA
LQNAASVAAMVLTTEAVVADKPEENKGNNNMPDMGGMM
>GK0256 GK0256, small heat shock protein
MTNENPFMPFSDWTKHWQQFFQNDFWGSIQPLLPSSNSNKPSSGMNIYKK
DNELLVVISLPGLEKIEDAEVYVSYKTLEVKATINLNFKGFELVEEGLFQ
GQFQKTIPLPFAVKEDRIEATYHNGLLFIHLHRLIPDEPKKKIVIKKSE
>GK0332 GK0332, serine protease Do
MDMMPFEPTPTPQPKRRGRFVSWLAASVAGAVIGSAATWYVAPKWIYGET
HSQTEAAETTAKSEVLPLQPTANVKTNMIDAINKVADAVVGVVNIQKQVD
FFSDQAQDTEAGTGSGVIFKKEGNVAYIVTNNHVIEGANKVEVALPNGKK
VNAEIVGADALTDLAVLKIPAEGVTNVASFGDSSKVKIGEPVAAIGNPLG
LDLSRTVTEGIVSGKRTMPVSTSAGDWEIDVIQTDAAINPGNSGGALINS
AGQVIGINSMKIAETGVEGLGFAIPSENVKPIVEQLMKDGKIKRPYLGVQ
LVDVADLSDEVRADELKLPSNVTYGAAITSVEPFSPAADAGLKSKDVIVA
INGDKIDSVSALRKYLYTKTSVGDRIKLTIYRDGFETTVSVTLKARDSSQ
S
>GK0381 GK0381, hypothetical conserved protein
MESYHAVLEAIAAAPGDGVLAIIISVEGSAYQKEGTCMWIGANGETVGLL
SGGCLEEAAAAYAREVLANRQAAAVSFDLRTEDDWSWGQGAGCNGLVHIW
LEPVTSDAKLDWLALKRCLDRGEHVLMVRSIMYPDERPFFQSETGEQFGG
CRRTPHREWREWLHKTPFYQGRSGLYEVDGETFYVQHFWPKPRLVVFGAG
PDAPPLVSAAKAAGFSVTVSDWRPAFCHPSHVSDADEWVVGFPHETVPML
HLNERDFVIVMTHQFERDRELVSLLADKPLAYLGVLGPRRRTDRLFPSGS
APPFVRSPVGLSIGARSPHEIAISITAELISVLRRHGAEAAQ
>GK0383 GK0383, hypothetical conserved protein
MREQKAWRIDGFIGIGCVALFVLAGFVSLIQAQLLLAVFCFALAAFLATG
ITIVQPNQAKVIIFFGRYFGTIRDSGLFFTVPLTVRKKVSLRVRNFTSKK
LKVNDVQGNPIEIAAVVVFRVIDSAKAVFDVDDYEQFVEIQSEAAIRHVA
TKYPYDTFEDDNEITLRGNADVISDELAQELQERLRIAGVDVMEARLTHL
AYSPEIAGAMLQRQQAAAILAARKKIVEGAVSMARMAIEQLDKENVLELD
DERKAAMVNNLMVAIVSERAAQPVINTGSLY
>GK0477 GK0477, bacterioferritin comigratory protein
MTIAIGQPAPDFTLPASNGKMVSLSDFRGRYVVLYFYPKDMTPGCTAEAC
DFRDRHAQFAELNAVILGVSTDPMNRHEAFIEKYELPFLLLSDERHEVAE
LYGVWKKKKNFGKEYMGIDRSTFIIAPDGTLVQEWRGVKVKGHVDEALAE
VARLASSR
>GK0510 GK0510, hypothetical protein
MFYLWHHRKVLTIDELANRLNREPKAVYEKLKQLLQKGGISDAG
>GK0530 GK0530, serine protease (phage related-protein, ClpP family)
MKTEQKNKFWEIKMSADGSNSADIFIYGDIVSYQWDETDTSAASFKKDLD
AVGDVDTINLYINSPGGNVFEGVAIHNMLKRHKAKINVYVDALAASIASV
IAMAGDTIHMPKNAMLMIHNPWTWTYGNAAELRKVADDLDRIGNSIKQTY
LQKAGDKLTEEKLQEMLDAETWLSADEAYEHGLCDVVLEASQIAASISDE
LFSKYKNVPKQLKNQVKNCQKTVISAEEMAKRRQIAEESKANLAYINTIL
GGMLG
>GK0567 GK0567, hypothetical conserved protein
MKKLLAFGGIIVVLFAAIAFITMYEQKEAASNNPYHKSELNPATIAQLDD
PNYRNIILPAELKQQLADGKTLTVYFYSPTCPHCQRTTPIVVPLAKELGI
DLKLFNLLEFEDGWDAYHIEATPTIVHYENGKEIKRIEGYHDEQTFRNWF
SSLHSDK
>GK0568 GK0568, protein-disulfide oxidoreductase
MEREKRAENLLLAAWATALIATLGSLYFSEVLGFIPCDLCWFQRIFMYPQ
VVILGIAIVRKDAAAARYSFTLSLIGGGISLYHYGLQKIPLLQEYAISCG
RIPCTGQYINWLGFITIPFLAFTAFMIIMLLSWTVMRQQRKESER
>GK0656 GK0656, post-translocation molecular chaperone
MKKWMMAAAVVSLMALSACSNDGSEAIVETKNGNITKDEFYNEMKERVGK
SVLRDLIDEKVLSKKYKVTDEEIDREIERIKEAYGTQYDLAVQQNGEKVI
REMVKLDLLRTKAAVEDIKVTEKELKEYYDNYKPKIRASHILVKDEKTAK
EVKAKLDKGEDFSKLAKEYSQDPGSASNGGDLGWFGPGKMVKEFEEAAYK
LKVGEVSDPVKTDYGYHIIKVTDKEKKKSFNEMKDEIAFEVKRNKLDPAT
MQSKVDKLVKDAGVEIKDKDLQDVIEQQGKQ
>GK0718 GK0718, hypothetical protein
MKSIKPGRGPSMQGLFGSIAAVLFGIFWMVMAFSITADSPFPAARFFPFF
GLVVIAIGVFQAFYHYKNATGKQRMSLLDIVDSEEEPDPLNVRFGSHKQP
NKHCPHCGGHVQHNFQFCPQCGKELLR
>GK0755 GK0755, hypothetical protein
MILDNRGLEPPQPMMRTLAALAKLNQGETLTIINDRRPMFLYEQLDELGY
RHETVAREDGSFEIRITKG
>GK0799 GK0799, ATP-dependent Clp protease ATP-binding subunit
MDASRLTEKLQEALMAAQSLAKERHHQQLDVEHLLLALLEQEDGLAPRLF
ALCGADRAQAIRWLQDRIRQKPEVHGAGEGQMYVAPALARLLEGAENEAK
RMQDEYISVEHVLLALSHGAEPVAQQLASFGLTEEALVEAVRKVRGNQRV
TSPHPEATYEALTKYGRDLVAEAKAGKIDPVIGRDSEIRRVIRILSRKTK
NNPVLIGEPGVGKTAIVEGLAQRIVRKDVPDGLKDKTIFALDMSALVAGA
KFRGEFEERLRAVLNEIKKSEGRIILFIDELHTIVGAGRAEGAIDAGNML
KPMLARGELRCIGATTLDEYRQYIEKDPALERRFQQVLVQEPTVEDTISI
LRGLKERYEVHHGVKIHDRALVAAAVLSDRYISDRFLPDKAIDLVDEACA
TIRTEMESMPSELDEVMRRVMQLEIEEAALSKETDEASRERLAALQKELA
DLREKANAMKAQWQKEKEALDRVRRLREALERAKRELEEAENEYDLNKAA
ELRHGRIPQLEKQLKQLEQEISEQSEGKLLREEVTEEEIAEIVSRWTGIP
LTRLVEGEREKLLRLHELLHRRVIGQDEAVELVADAILRARAGMKDPNRP
IGSFLFLGPTGVGKTELAKALAEALFDSEEQLIRLDMSEYMEKHAVSRLI
GAPPGYVGYEEGGQLTEAVRRKPYSVLLFDEIEKAHSDVFNILLQLLDDG
RLTDSHGRTVDFKNTVVIMTSNIGSPLLLEHKDDDIDEQTRSQVFDQLRA
HFRPEFLNRIDDIVLFKPLSMNEVKGIIEKFARELSARLADRHIELVLTE
AAKQYIAEAGFDPVYGARPLKRFMQKQIETPLAKELIAGRVKDYSTVTVD
AENGRLVIRPSA
>GK0806 GK0806, hypothetical conserved protein
MGLLPTDRWLEEDEGDPLRLCARLAPLFGKMPAREIYSYLRLYGMYRSTR
QAERLLPEMKERRLWERLGRLYERQRGIWKGPDVPVFLLPADDENRKLVR
EFQGKGGVAFADKLFFFLLPDHGDEEIAALVAHEYNHVCRLKQLPNEGED
ATLLDAVVMEGLAEHAVAETVGVKQCAGWTKYYTDREMERFWRRYIVPNQ
HLPVSHPHTSRILYGLGWHPKMGGYAVGYAIIRRCLERGYSLAQLMKMEA
KDIAELAGFSGKQQGAS
>GK0819 GK0819, negative regulator of genetic competence
MEIERINEHTVKFYISYVDIEERGFDREEIWYNRERSEELFWEMMDEVHG
ESDFSLDGPLWIQVQALEKGLEVVVTKAQLSKDGSKLELPLPEEKWREFS
LSAGEKVESLFGHHFHFGQNGDAEEREEQDVLQFILSFKDIEDVISLAHR
ADFSRLSNRLFQFEGRYYLFVEFDERYTEQEVDNMLSLLLEYGSDSQLTI
YRLEEYGTEIMGDNALETIKTYFPAL
>GK0859 GK0859, thioredoxin
MKRIETVEQFEAVISGDKPTIIKFYTTWCPDCVRLNMFIDDIVRDYRQYD
WFEIDRDQFPELGEKYQVLGIPSLLVFRNGKKIAHLHSADAKTPEEVRAF
LQSLPE
>GK0898 GK0898, hypothetical conserved protein
MYSFLSQISNLLSQPFLNMANSTTALPVLSAFLLGIVGAMAPCQLTSNLG
AITLYSNQSLQKGIAWKELLLFIFGKVIAFSGLGLIVWLMGKEIQSTLTL
YFPWLRKLIGPILVLVGLYLLGLFKMYWNVTLFKVPERWVKGKIGSFFMG
FGFSLAFCPTMFVLFFVTLMPLVYSTSYGVLLPSVFAVGTSVPVIFFILI
LWYLGLSGAVMKKGRRVGKLIQQTAGIVMVLLGVFDTITYWF
>GK0899 GK0899, cytochrome c biogenesis protein
MTQLALGVMMVSIAVASIFNNQQEEGKIKPKPVVSTSLDSGKIGTNKGEV
APDFELLSITGDKIKLSDLRGKTVILNFWATWCPPCRAEMPEMQKFYENN
KDSNVEILAVNLTNSERGSNAVSDFVEAKGITFKVVLDEQGDIGNLYGAI
TIPTSYIIDKNGVIRNKYVGPMSYETMDRMISGIQ
>GK0900 GK0900, cytochrome c biogenesis protein (holocytochrome-c synthase)
MNDINIFLAFGAGLLSFISPCCLPLYPAFLSYITGVSVDEIKKENGMLQK
RAILHTFFFLLGFSVIFIAIGFGTSVIGKLFVDYQDLIRQISALFIVFFG
LVILGVFSPSFMMKDKRLVFRNKPSGYFGSILIGIGFAAGWTPCTGPILV
SVIALAATKPSAAMLYMFAYVLGFAVPFFIMSFFIGKLNWIKKYNTVIVR
VGGILMVIMGIMLFFDWMTKIIIFFTDLFGGFTGF
>GK0968 GK0968, ATP-dependent Clp protease
MRCQACQQREATVFVNLQWNGEKQQLHLCHECYEKQKQQLSIPMMNFGFS
PFSFDDFFTSPFTAANAGMSSPETMAKRPQSHSGGFLDQFGRNLTQMAKA
GLIDPVIGRDKEIARVMEILNRRNKNNPVLIGEPGVGKTAIVEGLALKIA
EGQVPEKLLNKEVYLLDVASLVANTGIRGQFEERMKRLIAELQERKNIIL
FIDEIHLLVGAGAAEGSMDAGNILKPALARGELQVVGATTLKEYRQIEKD
AALERRFQPVIVHEPTVEEAIAILKGIQPKYEQFHHVRYTDEAIEACVKL
SHRYIQDRFLPDKAIDLLDEAGSKANLRLGPTDEKQLQERLIQIAKEKEQ
AAKEENYELAAKLRAEELKLEKQLEQGVTQERPTVDVADIEQIIAEKTGI
PVGKLQADEKEKMKHLEENLAKKVIGQAEAVKKVAKAIRRSRAGLKAKHR
PIGSFLFVGPTGVGKTELAKTLAEELFGTKDAMIRLDMSEYMEKHSVSKL
IGSPPGYVGFEEAGQLTEKVRRNPYTIILLDEIEKAHPDVQHIFLQILED
GRLTDSQGRTVSFKDTVIIATSNAGVTDKKITVGFEKEGNGQTSVLDSLG
AYFKPEFLNRFDAIIEFKPLEKAHLLEIVDLMLADVKAAMRGTRH
>GK0977 GK0977, coenzyme PQQ synthesis
MARTIPVLEIFGPTIQGEGMVIGQKTMFVRTAGCDYRCRWCDSAFTWDGS
AKDEIEQLTADEIWRRLEAIGGRRFRHVTISGGNPLLIAALGELIALLHE
KGMRVAVETQGSRWQDWLLDIDDVTLSPKPPSSGMDTDWAALDQIIERLQ
ADQSRVRRISLKVVVFDDADLAYAKEVHCRYPSVPFYLQAGNADVADSDV
DALRAKLFSRLEWLVEQAADSEELADVHILPQLHTLLWGNKRGV
>GK1279 GK1279, hypothetical conserved protein
MEKERYFQAETEGEQPETKAEEATAAITQLGQTNVPQMDPDTNIHCLTIV
GQIEGHIQLPPQNKATKYEHVIPQIVAIEQNPKIEGLLVILNTVGGDVEA
GLAIAEMLASLSKPTVSVVLGGGHSIGVPIAVSCNYSFITETATMTIHPI
RLTGLVIGVPQTFEYLDKMQERVVRFVTKHSKISEEKFKELMFSKGNLTR
DIGTNVVGPDAVRYGLIDEVGGVSQAMAKLRELIAMRKNGEGKMVQ
>GK1339 GK1339, cytochrome c biogenesis protein
MNDLNLFLAFGAGFLSFISPCCLPLYPAFLSYITGVSVSELKEENAMLNR
RSLWHTLCFLLGFSLIFISLGFGTSLLGRLFVDYQDAIRQIGAVVIVALG
LVVAGLWKPAFLLKDRRISFRERPSGYVGSVLIGMAFAAGWTPCMGPILV
AVVALAATNPGSGMMYMLAYTLGFAVPFFMLSFFIGKLQWIRRHSASIMK
AGGYVMVAMGVVLYFDWMTKLIAYTTSLFGGFTGF
>GK1341 GK1341, hypothetical conserved protein
MAFITSLIAVVMAFFIFFVRMKASEKPTNAKKIILPPLFMSTGALMFIDP
VFRVTRGELIEAVILGLFFSLFLIKTSKFEIRGSDIYLKRSKSFIWILLG
LIVIRLGLKTYLGRTIDYRQLSGMFYLLAFSMIVPWRIAMYVSYRKLAAQ
LKPPVLT
>GK1517 GK1517, subtilisin-type proteinase
MKRRYRIWAAASAVVALLVAAAVLGRPAPEENAPAPPRLKPFDAEVAPTH
PTVVALDSLSAGEQLKQQLNKHPRIREILHNRRGDRSHYFANEIVVRFRA
LPSESRLQQMEAAIAGQFIKQVDHVFVFRSREQTYEEMRRYFRSLPTVDY
CEPHYIYMQNEWNKPAPVPNDSFYARYQWNLPAIHTEDGWTLSRGKRNVP
VAIIDSGVDLTHPDLTRRLLPGYNVLADDRSPNDENGHGTHVAGIIASQP
NNGEGVAGMTWFNPIMAVKALNADGYGTSIDVAKGIRWAVDHGAKVINLS
LGNYQPSSVLEEAIRYADAHDVVLVAASGNDSTSQASFPAAYPEVISVGA
VNPDLSYALYSNYGDYVDVVAPGTNIASTFAGHRYAALSGTSMAAPHVTA
LAALIRSVNPRLSNDEVRDIILESADDLGERGKDPYYGYGLINVYRALEL
AKQ
>GK1558 GK1558, hypothetical conserved protein
MRKWLVVLLFLAVAGYGLWNVMAADKPNEANGAGPEVDQTAPDLTLPALG
GQSVKLSALRGKAVVLNFWTSWCPPCKKEMPELAKFYERHGREVALLAVH
LTTQDTLDNAERFAKANRLVFPVGLDVRGEALRQYRIQTIPTTYIIDPNG
VIRRKIVGPVTAKRLEQETALFR
>GK1635 GK1635, hypothetical conserved protein
MEERVLGMPAVESNMFPLGKQAPPFALTNVIDGNVVRLEDVKSDAATVIM
FICNHCPFVKHVQHELVRLANDYMPKGVSFVAINSNDAEQYPEDSPENMK
KVAEELGYPFPYLYDETQEVAKAYDAACTPDFYIFDRDLKCVYRGQLDDS
RPNNGIPVTGESIRAALDALLEGRPVPEKQKPSIGCSIKWKPSA
>GK1665 GK1665, hypothetical conserved protein
MPKLTFSSDVKWSGEGVRSVADINGKQVIIDEPPALGGTDQGPNPVELVL
AALGGCINVLISLFASHHGVELKGVQVHVEGDLDPDGFMEKADVRPGFLE
IRYHIDIDSPSDPKNVQALIEHVERVCPVKDTLRGVPTVSIANQKTS
>GK1669 GK1669, cytochrome aa3 controlling protein
MLGGMGMKSLAFWTLCTTYVLIVFGGYVASSNSGMGCGPDWPLCNGMLIP
MLKGATLIEYAHRLIGALLVLLTGVLCLRLWRSSRSGTDRFISAAAAVLL
ALQIMLGALVVWFDLPPIIVTIHFFIAFLFLGCLLWIWRNVHRSPQMHQP
VGQTTIEAEAKRHIHVLLFLLAAALLLGAYVKHQHYGLACGWLICGEQAW
PSSEAQMWQTAHRTAAATTVLYTIWTAFVAHRRRWGRPLERRLLLAATVG
LLQAVIGIVTVATEIHLSWAVIHLAVGTWLALLLVELKIYLHRPGVTRLS
ARGRMRKYTQTPAR
>GK1707 GK1707, hypothetical conserved protein
MDPLKTLLGKHVKMEISGGTTVQGVLVDFGLDIVVIYTGKEYLYVPHLHI
HYIKRHTDPSFEIASPFPEVPLQEESGISYRKTLLNAKGRFVEIYVIGNK
SIHGYITNVFNDYFVFYSPVYKAMLISLNHLKWLTPYQKNITPYTLGNDV
LPVQPTPLSLQRSLEEQLKKMEGQLVIFDLGDHPMKIGLLKRVQNNIAEL
VTAGGDSIYWKVIHLKTIHSPS
>GK1765 GK1765, hypothetical conserved protein
MKTTVVWNGNMSFSGQSASGVVIPIDAAKDVGGNDSGARPMELLLHALAG
CTGIDIVLILRKMRLDVRAFSMEVEGTRADDHPKRFTEIHIHYALEGDLP
EDKVVRAIRLSKEKYCSVSHSLNAAITASYSINGVRGKETI
>GK1784 GK1784, hypothetical conserved protein
MRLRIKAIEPTPSPNTMKVLLDEELPFGTSHNYKPDNVAAAPPIIQALMR
IEGVKGIYHVADFLAVERHPKYDWRDILTKVREVFGEDVGEAEEEKPTVN
EHFGEVKVFVQMLYGLPMQVKLVDGEREHRVGLPKPFMDAVIEAQKYAGN
VVLERKWVEKGVRYGTFEEIGREIVDELSAAYPPERLERIVNMFRRGEQE
KTVQKRPSLKVTSEMLDDPDWRKRYAALEQMAEPTEDDIPVLAKALKDEK
MAIRRLATAYLGMIGGKKVLPYLYEALKDPAVAVRRTAGDCLSDIGDPEA
IPAMIEALKDESKLVRWRAAMFLYEVGDESALPALKAAENDPEFEVSLQV
KMAIERIEGGEEAKGSVWKQMTESRKKENETEQ
>GK1785 GK1785, glutathione peroxidase
MSVYEFSVKTIRGEEQPLSAYRGKVLLIVNTASRCGFTPQYKELQELYDE
YRDRGFVVLGFPCNQFGGQEPGTEAEIEQFCQLNYGVTFPLFAKVDVNGD
HAHPLFQYLKEEAPGALGTKAIKWNFTKFLVDRHGRVVARFAPQTKPSEL
KEDIEKLL
>GK1863 GK1863, hypothetical conserved protein
MQPLYQTSVKAQGGRNGKVVSNDGVLSLDVKMPKELGGPGGGTNPEQLFA
AGYAACFDSALNLVIRQRKVKVEGTEVTAQVTLGKDEADGGFQLAVVLRV
RVLGVDRKTAEELVHAAHQVCPYSKAVRGNIDVTLSIEE
>GK1960 GK1960, hypothetical conserved protein
MFMKETREFLESLGYPGGDCYDLPTSAKRFPDGAQYRVEIPSVEGPRALE
AVLDEADRLGITIHRVSQGSGIMLLTDEEISDMCEMCASRGIELSLFVGP
RGTWDISALPLTPSGKAAGLRHEGMDQLVYAIEDLKRAAQLGVRGALVAD
EGLLLLTKEMKHRGILPQNFVIKVSVQMMASNPVSIKLMEQLGADTYNVP
TALTLPKLAAIRQTVNLPLDVYVEVPDGFGGFIRHYEIPEIIRILAPVYI
KFGLRNHPDVYPSGKHLESVNIELCRERVRRAALGMRIVEQYYPEAVTSK
LGAEGLGVPVTGVQNAKV
>GK2066 GK2066, hypothetical conserved protein
MIKVDMTVDAKGLSCPMPIVRTKKAINELQPGQVLEVQATDKGSKADIKA
WAESTGHQYLGTIEENGVLKHYIRKSAEHETRKETTFPHVVSNEELQEKL
HDPDSFVLDVREPAEYAFGHIPGAVSIPLGELENRMAELPKDKTIYVVCR
TGTRSDLAAQKLAEKGFDRVRNVIPGMSQWNGPLDSAQS
>GK2070 GK2070, hypothetical conserved protein
MNVAKVLDAKGLACPMPVVRAKKAMDELQSGQVLEVHTTDKGAKNDLPAW
AKASGHTVLDMKEDNGVLMFWIQKG
>GK2075 GK2075, glutaredoxin
MAQVTVYTTTTCPYCVMAKNFLRAQGIPFKEVNVEFDPEAARRLVETTGQ
MGVPQIEIGGRWVLGYDPDAIMALWNQTNHR
>GK2079 GK2079, thiol:disulfide interchange protein (thioredoxin)
MVLVLLILTGWAIYDTLSKNMASSVKDGGKSEGQVAEGIEVGNRAPDFVL
RTLNGEEVRLSDFRGKRVIVNIWATWCPPCRAEMPDMQKFYEQYKDERVE
IVAVNLTQSERQPEHVARFIQEYGITFTVVLDEKGEVSRQYEAQAIPTSY
LIDSKGIIRKKMIGPMSYDWMVDQMESIQ
>GK2080 GK2080, cytochrome c biogenesis protein
MSVIFAFVAGALSFFSPCIFPLVPAYVAHLTGTPIRENRIRVGKREILMR
SLAFIGGFSVIFVLMGATASAIGQMLVQYREFIEKLAGLFIIIFGLQMAG
ILSFRLLMKEKRWDTSTSKAKGWLSSIFVGMAFAAGWTPCVGLTLSSILL
LASSTETLYSGMVLLLVYSLGLGVPFLLLSLAMTKSLQIVKNVNRWLPLL
SKVNGWLLVVLGVLLYTGQMAKISALLSSYSIFSF
>GK2146 GK2146, heat shock protein
MALIPYDPFRHLESIRRDMNRFFASDFPSLFTHMDEQHWMPRIDMHETAN
EYVVSCDLPGLERKEDVHIDVQNNMLTISGTIQRHHDVREEQMHRRERFF
GRFQRSITLPADAATENIRATYKNGVLDIHIPKTTTGTKKRVDIEFH
>GK2159 GK2159, heat shock protein class I (low molecular weight)
MNESFQPPANGGGHHPFHHLRKMMNQWFDERPLQKLFETLDDYFAQTFAA
AYIPIEVKETKHDYQIIVRLPDIKREQISLQWQEDGLQLIVDHNETIESA
DHHGHVYERRHARRRVARLIPFPYPVAEHEVKASFQNGTLTIQLPQKRKY
IDID
>GK2234 GK2234, thioredoxin reductase
MKEEKIIIVGGGPCGLAAAIALQDAGFSPLVIEKGNIVHSIYRFPTHQTF
FSTSDRLEIGGVPFITENRKPTRNQALAYYREVVVRKQVRVNTFEEVKTV
KPQEDGAFLIETTKGTYRAQYVVIATGYYDHPNYMNVPGEELPKVMHYFK
EAHPYFNTDCVVIGGKNSSVDAAMELVKAGARVTVLYRGNEYSKSIKPWI
LPEFDSLVKKGVIRMEFRAHVKAITEDAVVYEVDGETKTIKNDFVFAMTG
YHPDHRFLMNIGVQIDPESGRPHYDSETMETNVPGVFIAGVIAAGNDANE
IFIENGRFHGDAIAACIAKREREGSAKQLQ
>GK2236 GK2236, negative regulator of competence
MRLERLTHNKIKIFLTFDDLLDRGLTKDDLWKDTFKVHQLFRDMIEEASE
ELGFEVNGSIAVEVYSLPAQGMVVIVTNEGDYDDMEEEFADDYIEMQVTL
DESDDIFYEFQTFEDVIQLAHRLHAVGCLDGTLYSYQGRFYLHVPEEPPI
PLDNFVALLAEFGNPATITIHRVQEYGKRLIERRAIEQLVRYFRAN
>GK2280 GK2280, cytochrome c biogenesis protein
MVQLSSTLLYIAFVLYLIGTFFFGGAIRDKRKERQEKDRWSQLGISVTII
GFLAQVGYFVTRWIAAGHAPVSNLFEFTTFFGMMLVAAFIIIYFIYRLSV
LGLFALPVALLVIAYASMFPREISPLIPALQSDWLHIHVTTAAAGEAILA
ISFVAGLIYLIRVVDQSKPSKRTFWLEVVMFCLITTLGFVIVSTAFGLSG
YEAKFTWVDKNKQTVEVIYDMPALVGPHKGELMKESAGRMEPLVEMPAII
NARKLNTVIWSLIAGTALYIMLRLLLRKRVAAALKPLVKNVNLDLTDEIS
YRAVAIGFPVFTLGALIFAMIWAQIAWTRFWGWDPKEVWALITWLFYAAF
LHLRLSKGWHGEKSAWLAVIGFAIIMFNLVAVNLVIAGLHSYAGT
>GK2282 GK2282, thioredoxin (cytochrome c biogenesis)
MKKQQRLVMRTAILLVLLAAIGYTIYTNFFTEKTAVAVGSTAPDFVLTDL
KGHEHRLSDYRGKGVFLNFWGTWCKPCEREMPYMNELYPIYKKQGVEILA
VNVGEPKLSVEKFAERFGLTFPIVIDRQDQVLNAYNVGPLPTTFLIDKNG
EVKQIITGTMTKEDIERHLESIKP
>GK2298 GK2298, peptidyl-prolyl cis-trans isomerase (rotamase)
MAKKGYILMENGGKIEFELFPNEAPVTVANFEKLANEGFYNGLTFHRVIP
GFVSQGGCPRGNGTGDAGYTIPCETDNNPHRHVTGAMSMAHRGRDTGSCQ
FFIVHEPQPHLDGVHTVFGQVTSGMDVVRTMKNGDVMKEVKVFDEP
>GK2503 GK2503, chaperone protein (heat shock protein)
MAKRDYYEILGVSKNATKDEIKKAYRKLSKQYHPDVNKAPDAAEKFKEIK
EAYEVLSDDEKRARYDRFGHADPNETFGGGGFQGGGFDFGGFSGFGGFED
IFETFFGAGPRRRASGPRKGADVEYMMTLTFEEAAFGKETEIEIPREETC
DTCQGSGAKPGTSPTSCPHCHGSGQVTSEQATPFGRIVNRRTCPVCGGTG
RYIPEKCPTCGGTGRVKRRKKIHVKIPAGVDDGQQLRVAGQGEPGVNGGP
PGDLYIIFRVEPHEFFKRDGDDIYCEVPLSFAQAALGDEIEVPTLHGHVK
LKIPAGTQTGTRFRLKGKGVPNVRGYGQGDQHVIVRVVTPTKLTEKQKQL
LREFERLGGDTMHDGPHGRFFEKVKKAFKGEA
>GK2504 GK2504, chaperone protein (heat shock protein 70) (HSP70)
MSKIIGIDLGTTNSCVAVLEGGEVKVIPNPEGNRTTPSVVAFKNGERLVG
EVAKRQAITNPNTIISIKRHMGTDYKVEIEGKQYTPQEISAIILQYLKSY
AEDYLGEPVTRAVITVPAYFNDAQRQATKDAGRIAGLEVERIINEPTAAA
LAYGLDKEEDQTILVYDLGGGTFDVSILELGDGVFEVKATAGDNHLGGDD
FDQVIIDYLVNQFKQEHGIDLSKDKMALQRLKDAAEKAKKELSGVTQTQI
SLPFISANENGPLHLEMTLTRAKFEELSAHLVERTMGPVRQALQDAGLTP
ADIDKVILVGGSTRIPAVQEAIKRELGKEPHKGVNPDEVVAIGAAIQGGV
IAGEVKDVVLLDVTPLSLGIETMGGVFTKLIERNTTIPTSKSQVFTTAAD
NQTTVDIHVLQGERPMAADNKSLGRFQLTGIPPAPRGVPQIEVTFDIDAN
GIVHVRAKDLGTNKEQSITIKSSSGLSEEEIQRMIKEAEENAEADRKRKE
AAELRNEADQLIFMTDKTLKEVEGKVSADEIKKAQDAKEALKAALEKNDI
DDIRKKKDALQEAVQQLSIKLYEQAAKQAQSAGSQGGAANHKDNVVDAEF
EEVNDDK
>GK2505 GK2505, chaperone protein (heat shock protein) (HSP-70 cofactor)
MEQGEKQVMEQATYDEPEREQPIEEEAAPQPEEESGGVPLEEAGGEEAAE
PAEKAPTAEELAAAKAQIAELEAKLSEMEHRYLRLYADFENFRRRTRQEM
EAAEKYRAQSLASDLLPVLDNFERALKIETDNEQAKSILQGMEMVYRSLV
DALKKEGVEAIEAVGKPFDPYLHQAVMQAEAEGYEPNTVVEELQKGYKLK
DRVLRPAMVKVSQ
>GK2549 GK2549, protease
MLLKNDRISEIIDGKRVIVKKPELLAPAGNLEKLKIAVHYGADAVFIGGQ
EYSLRANADNFTIEEIREGVEFANRYGAKVYVTANIYAHNENIPGLDDYL
RALEDAGVCGIIVADPLIIETARRVAPKLEVHLSTQQSLTNWKAVQFWKE
EGLERVVLAREVGAEEIRQIKEKVDIEIEAFIHGAMCSAYSGRCVLSNHM
TARDSNRGGCCQSCRWDYDLYQLSDGREIPLFEKGDAPFAMSAKDLNLIR
AIPVMIELGVDSLKIEGRMKSIHYVATVVSVYRKVIDAYCADPDHFTIRE
EWVRELEKCANRETAPSFFDGFPDYTNHMYGTHSRKTTHEFAGLVLGYDP
ETGIATVQQRNHFRPGDEVEFFGPEIENFTQVIEKIWDEDGNELDAARHP
LQIVKFKVKRPLFPYNMMRKEN
>GK2550 GK2550, protease
MKKPELLVTPTSVAHIDELAEAGADAVIIGEQRYGLRLAGEFSRHDVAAA
VKAAHRRGMNVYVAMNAIFHNDKVDELGDYVAFLADVGADAIVFGDPAVL
LTVRETAPHMKLHWSTETTATNWYACNYWGRKGAKRAVLARELNMDAILE
IKAHAEVEIEVQVHGAMCMYQSKRSLIGSYFEYQGKVMEVERKKYEKGMF
LYDKERDNKYPIFEDENGTHIMSPNDVCMIDELGDMVEAGIDSFKIDGIL
HEPRYITEVTKLYRRAIDLCADDRERYEREKEELLAAVEALQPPHRRIDT
GFFFKETIY
>GK2574 GK2574, NADH-peroxiredoxin reductase
MLLDADIKAQLAQYLQLLENDIVLTVSAGDDNVSRDMLALIDELTAMSSK
IKVEKAKLERTPSFSVNRVGENTGITFAGVPLGHEFTSLVLALLQVSGRP
PKVSQDVVDRIQQIRGKHHFETYVSLTCHNCPDVVQALNIMSVLNPDISH
TMIDGAAFKEEAEQKGIMAVPTVFLNGKLFASGRMSLEDILAKLGSAPDA
SSFADKEPFDVLVIGGGPAGATAAIYAARKGIRTGIVAERFGGQILDTLG
IENFISVKYTEGPKLAASIEEHVKQYNVDIMNSQRAKRLEKKDLIEVELE
NGAVLKSKTVVIATGARWRNLGVPGEEEFKNKGVAYCPHCDGPLFEGKHV
AVIGGGNSGVEAAIDLAGIASHVTLLEFAPELKADAVLQDRLYSLPNVTV
IKNAQTTEITGTDKVNGLTYIDRETGEEHHIELQGVFVQIGLVPNTEWLE
GTVERNRFGEIIVDKRGATNIEGVFAAGDCTDSAYKQIIISMGSGATAAL
SAFDYLIRH
>GK2575 GK2575, peroxiredoxin
MSLVGKKVQPFRAQAYHNGEFIEVTEQDFMGKWSIVCFYPADFTFVCPTE
LEDLQDHYATFKELGVEVYSVSTDTHFTHKAWHDTSPAISKIEYVMIGDP
SHQLSRMFDVLDEEQGLAQRGTFIIDPDGVIQAVEINADGIGRNASTLID
KIKAAQYVRNHPGEVCPAKWKEGAETLKPSLDLVGKI
>GK2593 GK2593, bypass-of-forespore protein C (forespore regulator of the sigma-K checkpoint)
MRILSLILWIAAIVLPVHSASAAPVKMTIVLERQYLDGEMSEEKVTETVD
SMTEIWKKYRGWQLVTLDDQTIVFRKTINDISPLLKTNGYFGITDDGTLS
IFNGKPGRSSEIIQSFFQIDVQKLESRQQEKLKKGIRVLSKERYEQVIEM
YRHFAVVQ
>GK2620 GK2620, hypothetical protein
MDKQRMRIMVNINGKPRPFTVERSSEGRPSEESDGYPVDSHLGKGQRAEN
EADDSRQVSAFQEGGPSAGKPSTGLWFTEDGQVREDVETTGWEEAAALEA
DERVISVSEAMQQKKSARRRTWRLPAQAKSLLAAALVAALVGMAFGMTVL
RIIPKEKLASSPASEAMTELTKPETAAGEEGRESVVRAPFSIAVIQAGVY
SNAAAAKQASESIKTAGVPVVVAGQKPAALYIAVGADKETLRAVNDQYRQ
RVPSTYVKELSFAAEAAARRSEAIQKGEVLYENMAAVSAALLGGRKAGED
DWQALQKAYDALEKSNASSDKTVGRYVEALKQAYVALAAYKERRDEALLA
KAQQQLLEALAAYIELVPLGS
>GK2625 GK2625, signal peptidase (late competence protein)
MILFLIFLYSLLLASFFNVVGLRVPVGESIIRPRSHCPACGRTLSAGELI
PVVSYVVQRGRCKGCGGRISPLYPLMELTTTALLTAAPMWIGWGGRLIVA
WTLISLLAIIVVSDLRYMLIPDRVLLVFAGLFLMERLVIPFLPWVDMLLG
AAVGFSLLWLIAVLSNGGMGGGDVKLFAVLGFVLGWKMVLLAFFLATLYG
TIIGLIGMALGRVRRGKPMPFAPAIALGSLTALFFGDQLVDAYMDLFV
>GK2646 GK2646, negative effector of the concentration of HemA
MVALLYELSILLYIASILFYFIDFLQQNRKANDIAFWLLSIVWLLQTVTF
VSRIMETKRFPILTMSEGLYFYAWLLITLSLVINRLLRVDFIVFFTNVLA
FFILAIHTFAPSSQSPAVAERLVSELLIIHITLSLGAYAAFTLSFLFSAL
YMLQYSLLKKKKWGARLWRMADLSRLDFLSYVLNALGLPMLLLGLILGVI
WAYIQIDHFHWYDAKVLGSFVVLLVYGVYFYKRAVRQMQGKAIALWNIGS
FLFLLVNFFLFGSLSKFHFWYS
>GK2650 GK2650, ATP-dependent Lon protease
MNGKKKETVVPLLPLRGLLVFPTMVLHLDVGREKSVKALEQAMVEDHMIL
LTSQKDVAIDEPDMDDLYKMGTIARVKQLLKLPNGTFRVLVEGVARALIT
EVISEEPYFLVKVEKFADRAAKDLEDEALKRTMLEYFEQYINLSKRLSVD
IYASIVDIDEPGRMADIIASHLPLKLEEKQRILETIDVKERLNKIIQILH
NEKEVLQLEKKISARVKQSMERTQKEYYLREQMKAIQKELGEKEGKTSEV
EELKEKIEAAGMPEHVKQTALKELDRYEKIPATSAESAVIRNYLDWLIAL
PWSKETEDIHDIKRAEAILNEEHYGLDKVKERVLEFLSVKQLTKSLKGPI
LCLAGPPGVGKTSLARSIAKALGRRFVRVSLGGVRDESEIRGHRRTYVGA
MPGRIIQGMKKAGTINPVFLLDEIDKMSSDFRGDPSAAMLEVLDPEQNHT
FSDHYIEEPYDLSKVMFIATANHLAAIPQPLLDRMEVIHIPGYTEVEKLH
IAKRHLLPKQITEHGLKKAALQIRDDAMLDIIRHYTREAGVRELERQLAA
ICRKAARLIVSGEKKRVVITENNLEEFLGKRKYRYGRAEAEDQVGVATGL
AYTAFGGDTLAIEVSLAPGNGKLVLTGKLGDVMKESAQAAFSYVRSRAEE
LDIDPKFHEKYDIHIHVPEGAVPKDGPSAGITIATALISALTGKPVSRFV
GMTGEITLRGRVLAIGGLKEKTLSAHRAGLKTVILPKDNEKDLADIPETV
KRDLRFVLVSHLDEVLPHALVGWKR
>GK2651 GK2651, ATP-dependent protease
MDWTNIVLVIQLFFGVIIGLYFWNLLRGQRVQKVSIDKESRKEMEQLRKL
RSISLTEPLAEKVRPKSFDDIVGQEDGIKALKAALCGPNPQHVIIYGPPG
VGKTAAARLVLEEAKKNPLSPFKKNAVFVELDATTARFDERGIADPLIGS
VHDPIYQGAGAMGQAGIPQPKQGAVTNAHGGVLFIDEIGELHPIQMNKLL
KVLEDRKVFFESAYYSKENPQIPSHIHDIFQNGLPADFRLVGATTRTPNE
IPPAIRSRCLEVFFRELDQDEIALIAKKAAEKIRLNVSESGIRLLAAYAR
NGREAVNMMQIAAGLAITENRENILDKDIEWVIQSSQMTPRYEKKIASAP
AVGVVNGLAVYGPNTGALLEIEVTALPAKGKGLINVTGIVEEESIGSAEK
SVRRKSMARGSAENVITVLRAMGVPADRYDIHVNFPGGVPVDGPSAGVAI
AVGIYSAIYQLPVDHTVAMTGEISIRGCVKPVGGVFAKIKAAKQAGAKKV
IIPIENMQSLLGEVSGIQIIAVRRLEEVLVHVFGEEALRRGPALLPAAMD
RSEKKLV
>GK2652 GK2652, ATP-dependent Clp protease ATP-binding subunit (class III heat-shock protein)
MFKFNDEKGQLKCSFCGKTQDQVRKLVAGPGVYICDECIELCTEIVEEEL
GNEEEFEFKDVPKPLEIREILDEYVIGQDEAKKSLAVAVYNHYKRINSGS
KIDDVELSKSNILMIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGY
VGEDVENILLKLIQAADYDVERAEKGIIYIDEIDKIARKSENPSITRDVS
GEGVQQALLKILEGTIASVPPQGGRKHPHQEFIQIDTTNILFICGGAFDG
IEPIIKRRLGKKVIGFGAEMNQTDVDEKNLLSKVLPEDLLKFGLIPEFIG
RLPVITTLEPLDEQALIDILTKPKNAIVKQYQKMLELDGVELEFEEAALR
EIAKKAIERKTGARGLRSIIEGIMLDVMFELPSREDVQKCIITLDTVRGT
KPPTLIRHDGTVIELERKTSA
>GK2653 GK2653, trigger factor (prolyl isomerase)
MSVKWEKLEGNEGVLTVEVDAEKVNKGLDAAFKKVVKNITLPGFRKGKVP
RVLFEKRFGVEALYQDALDILLPEAYAKAVEEAGIEPVSMPEIDIEQMEK
GKSLIFKAKVTVKPEVKLGQYKGLEVEKMDTTVTDEDVENELKRLQEDYA
ELVVKEDGTVENGDTVVIDFEGFVDGEPFEGGKAENYSLEIGSGTFIPGF
EEQLVGMKAGEEKEIQVTFPDEYHAKQLAGKPATFKVKVHEVKAKQLPAL
DDEFAKDVDEEVETLDELKAKIRARLEEAKKNEAETALRNAVVEKAAANA
EMDIPEVMIKNETDRMLREFDQRLQMQGLNLQLYYQFSGQDEASLREQMK
EDAEKRVRAALTLEAIAKAENIEVTDEEVNKELEKMAEAYKLSVDKLKEL
LGSLDGVKEDLKWRKTVDFLVEHSKVAA
>GK2685 GK2685, thioredoxin (TRX)
MAIVNATDQTFAAETKDGLTLVDFWAPWCGPCRMIAPVLEELDREMGDKV
KIVKVNVDENQETASKFGVMSIPTLLVFKNGELVDKAIGYQPKEALVQLV
GKHVS
>GK2710 GK2710, peptide methionine sulfoxide reductases
MEKATFAGGCFWCMVMPFEELDGIYGIVSGYTGGHVENPTYEQVKTGTTG
HYEAVQITFDPDVFPYERLLELYWCQIDPTDDGGQFHDRGPQYRTAIFYH
NEKQRQLAEQSKRALEESGRFSKPIVTKILPATTFYPAEEYHQNYHKKNP
EHYKQDRAASGRDEFIAKHWGTKR
>GK2787 GK2787, thiol peroxidase (superoxide-inducible protein 8)
MAYVTFKGQPVTLVGNEVKVGDKAPDFTVLDQNLQEVTLADTKGHVRLIS
VVPSLDTGVCDAQTRRFNEEAAKLDNVKVLTISVDLPFAQKRWCGAAGIE
NVQVLSDHRDVSFGQAYGVLIKELRLLARAVFVIDSNDTVTYVEYVPEVT
NHPNYEAAIEAAKAAK
>GK2791 GK2791, protease IV (signal peptide peptidase)
MNRKRWTALAIAAALFVISVLVQAVGVLLSDKAGGWSESWLALMESPFSE
EVIAEGDPLKKIAVLEVNGVIQDAGEAESLLSSSQYNHQTFLQMIKQAKE
DQDVKAIILRINSPGGGVAESAEIYDQLMKLKKKTNKPIYVSMGAMAASG
GYYIAAAGDKIFASPETITGSIGVIMQSVNYEGLAKKYGVELVTIKSGPY
KDIMNPARKMTEAERKILQRLINDSYDAFVDVVAKGRKLPEDTVRKLADG
RIYDGRQAKALRLVDEFGYLDDAIAALKKEHHLTGAQVVKYVNDAPWSSL
FEMVSNRMKPDSEAAGLIRLLSRPSSPRLMYLYAE
>GK2867 GK2867, hypothetical protein
MKRMPWKVMATSAAASLLLASGCSAVGEKENGRSAEAPRLHVPVADDHFD
TAGNMFAYSEFELSGEPLAEGLGLDLDTLDARKPDEPTKFDYTAGIESYE
YSEEAMYEVTEKSGLGLHLINGPIAKQRAEQRHKPADEALADRFYELADS
VGYPREEIFRNMFPTFIEYAGGDPHYAQKVDTDVYAENDDGTYVPVYQVD
FQSLRWDRGKMDKVLTPSAYGGVFLKQALWAGDFLGNFHQKDSDEEVEAK
TPNDDQSGNVALGVSSADGMQGMILTEQIWNKLAFIRDHLFYDAKQQALT
KAAGSRYDPSGGFVYLPHAVEVAERGSELAPDAAKLVVKDPRSLLEDQWL
MLWPAAEFFGMTDQRPENKNQNPAFLAVFDGKPFPKAAKENVDADPANDR
VADDPYSVNRDVLLQVFRNIDAMHFNEKAGAFVTEHDGRTQGNRVDTFQA
GYTMEALRIFERAIDGMPVGYASGESAKGLGTPEGKRALELIRRQADFIM
HNLMRKDGLVANGYTIGQGPDQDEPTLLAQLGAIRGLTAAFLATKDEAYR
DAARLVYEAMDRHFWDQKWHVYHTGETEDKYTPWLAGALSGVFRVALQNL
NNDQADETAKSLDRETIISRYVDFYDRIVDGPTLQEGMQASEFWDTGDVY
IKGKKLGNTDRDHVPQVQAAGGPYGVAPVLRTVKVNVGSGNK
>GK2951 GK2951, hypothetical conserved protein
MTKTLTMNDCPYCEGHGYVQLLLGGSETCYSCQGTGSDHEEDDDH
>GK2954 GK2954, thioredoxin reductase
MSVKEDRNVYDVTIIGGGPTGMFAAFYGGLRQMKVKIIESLPQLGGQLAA
LYPEKYIYDVAGFPKVRAQELVNQLKEQMDLFSPTVCLNESVDTLEKQED
GTFKLVTNQQIHYSKTVIITAGNGAFQPRRLEIESASRYEGKNLHYFIND
LGQFSGKRVLVCGGGDSAVDWSLMLEPIAQSVTIVHRRDKFRAHEHSVEQ
LKKSSVQVKTPFVPVELVGDEHAIRQVILEHVKEGTKETIDVDAVIVNYG
FISSLGPIKNWGLDIEKNAIKVNSRMETNIPGVYAAGDICTYDGKIKLIA
CGFGEAPIAISSAKTYIDPTARMQPAHSTSLF
>GK2961 GK2961, nitrogen fixation protein (NifU protein)
MEMTDQEIKEQVQEVLDKLRPFLLRDGGDCELVDVEDGVVKLRLLGACGS
CPSSTITLKAGIERALFEEVPGVVEVEQVF
>GK2991 GK2991, hypothetical conserved protein
MAKKAPEIGEYKYGFVDKDVSVFRAQRGLTREVVEEISRMKNEPQWMLEF
RLKALDIFYSKPMPQWGGDLSSLDFDEITYYVKPTEKSGRSWDEVPAEIK
ATFDKLGIPEAEQKYLAGVSAQYESEVVYHNMKEDLEKLGVIFKDTDSAL
KENEDLFREYFAKVVPPTDNKFAALNSAVWSGGSFIYVPKGVKVDTPLQA
YFRINSENMGQFERTLIIVDEGAHVHYVEGCTAPIYTTNSLHSAVVEIIV
KKGAYCRYTTIQNWANNVFNLVTKRAVCEENATMEWIDGNIGSKLTMKYP
AVILKGEGARGLTLSIAIAGKGQHQDAGAKMIHLAPNTSSTIVSKSISKQ
GGKVTYRGMVHFGRKASGSRSNIECDTLILDNQSTSDTIPYNEILNDNVS
LEHEAKVSKVSEEQLFYLMSRGISEQEATEMIVMGFIEPFTRELPMEYAV
EMNRLIKFEMEGSIG
>GK2994 GK2994, hypothetical conserved protein
MATETKIPFDETYIRTFSSGRGEPDWLTARRLEALRLAERLPLPKPEKTK
IDKWNFTEFARHTVDSAPYDGLDDLPEAVKALIEAGEGTKNLYVQRNHTP
AYVSLSDELKEKGVIFTDIFTAAREHGDLLKNYLMTAVKPDEHRLAALHA
ALLNGGVFVYVPKNVEIETPLQAVYIQDEDDIALFNHVIIVAEDNSRVVF
VENYISASREGKAVVNVAAEVFAQANASVFFAAVDHLAKGTTTYVNRRGI
AGRDGRIEWALGLMNDGNTVSENITRLVGDGSFGDTKTVAVSRGEQVQNF
TTSVVHYGRHTEGYILKHGVVRDSATSIFNGIGKIEHGASKSNADQESRV
LMLSEKARGDANPILLIDEDDVMAGHAASVGRVDPIQLYYLMSRGIPRRD
AERLIIHGFLAPVVEAIPLEGVKNQLIEVIERKVQS
>GK2995 GK2995, ABC transporter (ATP-binding protein)
MAVLTIRNLHVAVEGKEILKGVDLEVKGGEIHAIMGPNGTGKSTLASAIM
GHPKYEVTEGSVTLDGQDVLEMEVDERARAGLFLAMQYPSEISGVTNADF
LRAAINARLGEGNEISLMKFIRKLDEKMAFLEMNPDMAHRYLNEGFSGGE
KKRNEILQLMMLEPKIAILDEIDSGLDIDALKIVAKGVNEMRSSEFGCLI
ITHYQRLLNYITPDYVHVMMQGRIVKSGGPELAQRLEAEGYDWIKKELGI
EDETVGQEA
>GK3001 GK3001, thioredoxin
MKAIDRNDVMRVVEVEPLLALYLYTPLCGTCQLARRMLTVVEQLFPALPF
YETDINYIPEQAVAWKIESVPCLLLFRDGTVAGKWYAFHSVPYLYEVIQA
CLPSR
>GK3020 GK3020, hypothetical conserved protein
MYDIAIIGAGPAGASAAIFAAKAGKKTVLFDSDKGMTKRAWVENHYGVPE
ISGPELVETGKRQAAKFGAELVETQVTDVQKTDGGFRLETENGSFEAKHV
IFATGVATDLAEKIGLRTKPGTEPRIKTVIDVDANGKTNIDGIWAAGTVA
GVSVHTIITAGDGAKVAINVISELNGERYVDHDVLKK
>GK3043 GK3043, ssrA RNA (tmRNA)-binding protein
MPKGEGKVIAQNKKAHHDYFIEETYEAGLVLQGTEIKSIRNGRVNLKDSF
AKVEKGEVFLHNMHISPYEQGNRYNHDPLRTRKLLLHRREINKLIGYTKE
QGYTLVPLKLYIKNGFAKVELGVAKGKKKYDKREDMKRREAQREIERAFR
ERQKL
>GK3060 GK3060, hypothetical conserved protein
MHIRLYTKTNCPLCDKAKAVLTELLADYSFTLEEIDIYKDDELLEKYQLM
IPVVELDGEEIGYGAIEKEAVRKRLRRAQNS
>GK3062 GK3062, ATP-dependent Clp protease proteolytic subunit (class III heat-shock protein)
MYLIPTVIEQTNRGERAYDIYSRLLKDRIVFLGSPIDDQVANSIVSQLLF
LAAEDPDKDISLYINSPGGSITAGLAIYDTMQFIKPDVSTICIGMAASMG
AFLLAAGAKGKRFALPNSEIMIHQPLGGAQGQATEIEIAAKRILFLRDKL
NRILSENTGQPIEVIERDTDRDNFMTAQKAMEYGIIDRVLTRADEK
>GK3068 GK3068, thioredoxin reductase
MADEKIYDVIIAGAGPAGLTAAVYTSRANLSTLMIERGVPGGQMVNTEEV
ENYPGFETILGPELAAKMFEHAKKFGAEYAYGDVKEIIDGEAYKTVIVGD
KEYKARAVIIATGAEYKKLGVPGEAELGGRGVSYCAVCDGAFFKGKDLVV
VGGGDSAVEEGVYLTRFANKVTIVHRRDKLRAQKILQDRAFANEKIDFIW
NHTVKQINEKDGKVGSVTLVHTQTGEEREFPCDGVFIYIGMVPLSKPFAN
LGITNENGYIVTNEKMETKVPGIFAAGDVREKTLRQIVTATGDGSIAAQS
AQHYVEELKEKLNKQGVS
>GK3088 GK3088, hypothetical conserved protein
MATWWLEWLEGVASVWRQPLLYYGAALALAVGWRRVKRERRDFHVRVYSL
WQEWRGLWTKGWMAGAMLSAAAVGIGIAVPQEAVWTVTVLTVLFSLTMEA
RLLSPAYTVGGALLVLGLAERSSALSRWLPDGMAAAPALAVWLALLLLSE
GWLIIRTRNEAASPQLAKSKRGMTVGLQWTQRLWFVPVVLPVSGGALPPA
SWWPLVSTGDSYSFWLVPFLLGFSQRRQHMLPAEAARAEGRLVVRLAFVV
ALLAASGIWYPPLAVAAGAMAIIGREWIAFSGHRADRARPPRFARHPHGL
VIVGVLPGSKAEKMGLSIGEVIIKTNGAPVRTETEFYEALQRNRAFCKLD
VVGHNGEVRFVQGALYEDEHHELGLLFVRDRNASASEAVS
>GK3143 GK3143, flagellar biosynthesis
MDFLTEEWIYQKNSQQLTALLYEGLMECLEEAIAALEQKDYWKANKQLQK
GNDILRRLGVGLRYDAGIIAHQLDALYNYMAERLIEANMKKDVKIVREVL
QLTTTIATAWNEALKSGASPAQTALKQKTAAYEQFIAYEKS
>GK3281 GK3281, hypothetical conserved protein
MPGGISHHAKDILFKSLSALYQNQALDVYGLHGLPRIKALLPNEFPSVRA
DERRADTVFLLEDDSILLLEYESNERFLDNHLKYLDYACRILHTYYQQEK
RIRPIRIVVIYTSDVTTARERLDAGDVFLSSKAVLLGEFNGDAIFHAIEE
KVHNGEPLTPEETMKLILVPLMHTRFDRQTMIEKTIELAKAIGDEPKQLH
IIAGVLTATDKFIDRSYAEKVKEWIKMNKVFRLLVEELEQEKEEMLKKVM
QEKEQAVQRAIQEKEQAVQRVIQEKEQAVQRAIQEKEQAVQRVIQEKEQA
VQRVMQEKEQAIKQTEKRKAIEIAKNLLDVLPIHEIAKRTGLTVAEVADL
AKEMDK
>GK3288 GK3288, hypothetical conserved protein
MTVKILSIGLRGLEGYRVQVEVQEVPGIAAMVIVGLPDASVKEAKERVLA
SLYAFGCDFFDKRLVVHLSPPERKKHSPMFDLAMAIGILKAMGKLTAPVS
PDAAFLGSLSLDGAIQPVDGMLPAILAAKKLGFQKVYLPYDPALPLHHLK
DLDCIFVQTLEEAVQYLQGQRVLSLPPAFRTPEIIDSPRKHHRDFQSIIG
HHQAKRALEIAAAGGHHVLMVGPPGCGKSLLAETFPTILPSLTHDAQLEV
ISLYQLAGEKIESGHPPFRHPYHSASSVSLIGGGTHPKPGEVSLAHRGVL
FLDEMAEFAKKTLDMLRQPLETGKVTISRISSTVTYPADFILLGAMNPCP
CGYLGSRTRYCTCSPKQIQAYRNRVSGPIYDRMDVLLSLEVIDFTKETRV
SESSETIRKRVEEARRKQYERYGKEITNGRVSFELLMEKSPLANRQQLLL
QQWASQHQWSNRVQTKIIRLARTISDLKGTEKIADESLWEAMTLRWMKTY
TKQQATAR
>GK3403 GK3403, hypothetical conserved protein
MKMGVKLIKISVVYFVIGVCMGLGMSMTHSFALTPVHVHINLLGWTALTL
AGIIYHLFPQAAATKWAKVHFWLHNIGLPVMMIGLIFVVYGNEAFVPVTA
IGGVLVVIGVILFAINVLKNVRASS
>GK3469 GK3469, serine protease Do
MGYYDDHYEPYEQTRRKRRSGSFVSALVGAVLGGLLVLMSIPALSRWDIL
PYDVVPNQRAEEEPKTEENGTPPIRQSVSVDVTTAVTKAIDQVSDAVVGV
VNIQEASFWSQGGEAGVGSGVIYKKAGGRAFIVTNHHVVENASQLEVSLK
DGTRVPAKLLGSDVLMDLAVLEIDAKHVKKVAQFGNSDTVKPGEPVIAIG
NPLGLQFAGSVTQGIISGTNRTVEVDLDQDGAPDWNAEVLQTDAAINPGN
SGGALVNIKGQVIGINSMKIAQEAVEGIGFAIPINTAIPIISDLEKYGQV
RRPYMGVELRSLSDIPSYHLQATLHLPPNVTEGAAVIQVVPMSPAAQAGL
KQFDVIVALDGEKIRNVLDLRKYLYTKKSIGDRMEVTFYRDGKKRTVTMK
LARESY
>GK1213 clpQ, proteasome Clp protease subunit
MGAFHATTIFAIRHNGASAMAGDGQVTFGNAVVMKHTAKKVRRLFQGNVL
AGFAGSVADAFTLFEMFEGKLEQWNGNLPRAAVELAKEWRSDKVLRRLEA
MLIVMDKQHLLLVSGTGEVIEPDDGMLAIGSGGQYALAAGRALKKYAGGS
MTAKEIAKAALEIAADICVYTNGHIIVEEL
>GK1214 clpY, ATP-dependent Clp protease ATPase subunit
MMAETLTPRQIVEKLDQFIVGQKEAKKAVAIALRNRYRRSLLDEKLRDEV
MPKNILMIGPTGVGKTEIARRLAKLVGAPFIKVEATKFTEVGYVGRDVES
MVRDLVETSVRLVKERKMNEVKDRAEQQANKRLVELLVPGKPKQTIKNPL
ELLFGGQGAQADNSYSHEDEQVEQKRRQVAWQLANGQLENEMVTIEIEEQ
TPLWFDFLQGAGIEQMGMNMQDALSSLMPKRRKKRRLKVSEARKVLINEE
AQKLIDMDEVTQEAVRLAEQSGIIFIDEIDKIARSGAVSGSADVSREGVQ
RDILPIVEGSTVMTKYGPVKTDHILFIAAGAFHMAKPSDLIPELQGRFPI
RVELAKLSVDDFVRILVEPNNALIKQYQALLATEGISLEFSDDAIRKIAE
VAFEVNQTTDNIGARRLHTILEKLLEDLLFEAPDIGIDKVVITPQYVEQK
LGSIVKNKDLSEFIL
>GK1080 ctaA, heme O oxygenase
MQRSLKWFASATTLAMLFVLIGGALVTKTGSGMGCGRSWPLCNGQWVPDH
ITPELIIELSHRLVSGLAGIMVLILSIWAWRAIGHVQETKFLAVISFVFL
VLQGLIGAAAVVWGQSDFVLALHFGISLISFAAVLLLTLLIFEIDKTFSA
ASLSLDGKMRFHIYGITIYSYIVVYTGALVRHTNASLACPSWPLCAKTRL
LPVQFHEWVQMGHRLAAAVIIIWIAAAAIHAVRHYRRQPVIYYGWLIALL
LVLAQMTTGALVVFTQLNLYIALAHAFFISCLFGVLSYLLLLALRTRRAP
VKAADHSAGEAAPATLK
>GK1081 ctaB, heme O synthetase
MAELKAVHQDAADAGHRSHVSVKAVWRELSSVVKIGIVNSNLITTFAGMW
LAFYFTGEHFLENLHLVFFTLFGAALVIAGSCAINNYIDRDIDQYMERTK
ARPTVTGTMDPRRVLWLGIGLVAIGEMGLLMTTVTAAVVGLIGMATYVFL
YTLWTKRHYTINTVVGSISGAVPPVIGWTAVDPEFHIVPLILFLIMFLWQ
PPHFLALAMKRCEEYRAAGIPMLPVVHGFAMTKRQIIVWVACLLPLPFYL
FSLGVPFLIVATLLNVGWLLLGLAGLKMKDDIKWAKWMFVYSLNYLTILF
VAMIIATLW
>GK3113 fliS, flagellar protein
MATNNPYQHYQANAVQTASPGELTLMLYNGCLKFIKLARQAMEKGDIAAR
NENLIKAQNIILELMKTLKMEYEVAKSMMTMYDYIYRRLVEANVKNDAAI
LDEVEGYVIEFRDTWKQVIQLNRQRQYAEGGQA
>GK0686 gerPE, spore germination protein
MKRTSVVQTFHAETLIISSVLQIGDSERISARTRAFAVQRQYELFFGPEG
EQIFPVFAKPIPRWTSPPPVAARQTLHESPVISVQSVRVLAISSSAIVHI
GSTSTAEAEARIKHIRQLAGSESSAPTRGRDL
>GK1314 spoVK, spore formation protein
MSELTMNKAKGQINIVLNSKTVNHLVKEERNDWLDGEEHQALRNIQKELD
QLIGLDHVKKIIKEIYAWLYINRLRKENGLKANRQALHMIFKGNPGTGKT
TVARLLGKLFFEMNVLSKGHFIEAERADLVGEYIGHTASKTRDLIKKARG
GILFIDEAYSLARGGEKDFGKEAIDTLVKGMEDYCDDLVVILAGYPKEMD
YFLSLNPGLPSRFPLTIEFPDYTVEELVQIAKQMLREREYEMTPEAERKL
YIHLEGTLEATGRLKFSNGRYVRNLIEKAIRKQAVRLLHEGRYDKKELMT
IRDRDLVIHA
>GK1926 ureD, urease accessory protein
MSWTGRLRCTAVVKNGRTVILDNYSEGALKLMRPVYLDPAHPTLYLVHVG
GGYVDGDSYDMEIFLEPSARLMVTTQSAAKIYKTPSTPVRQYTRLSLGEQ
SVLEYFPDPTIAYEHARFYQETTVYITPSSTFVYGEIITPGWSESGELFR
YDWIRSKLKVYQDEALVLFDHLYLDSRQHLTSMLQLGGYTHVGSFVALSP
FITKEVLEQFNQFMEEMPQEVRCGFSAAAVPGFSVRILAYETSVIEAIFQ
RVQQFIRQQCGEKAPVCWRKY
>GK1929 ureE, urease accessory protein
MIIETIIGNIQTLPSLPPHIERVYMASDDLVKRIQRVVTDHGRELGIRLK
EAKELADGDVLWMDDHNAIVVSVLPEDLLVIKPVSLKQMGEIAHQIGNRH
LPAQFEEGEMLVQYDYLVEELLQQLAIPYKREKRKVKQAFRHIGHRHD
>GK1928 ureF, urease accessory protein
MTDQQLLWLLQLSDSNFPSGAFSHSFGFETYMYNEQICDAKTFREALVVY
IQTQLTYTDGLACRIAYEQLEANSMEGLQRLNETLFALCLAKETREGTRM
IGERLWKLCRDIYGVDELDEIVQTTRSIHPAIVFAAVGRKIGAAKQTTVL
TYLFASVQTMVQNAVRGIPLGQTDGQKLLVMAQPYLIHAASIIETLDEEE
LGAAAVGLEIAQMQHERLPVRLFMS
>GK1927 ureG, urease accessory protein
MEPVRIGIGGPVGAGKTMLVEKLTRAMHRRFSIAVITNDIYTKEDAQFLI
KHSVLPEDRIIGVETGGCPHTAIREDASMNFAAIDELKRRHPDVELILIE
SGGDNLAATFSPELVDFSIYVIDVAQGEKIPRKGGQGMIKSDLLVINKID
LAPYVGASLEVMERDAKAARGAKPVIFTNLKEEIGLFDVVDWIEKQVLLA
GLEE