TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Rhodopseudomonas palustris CGA009, CGA009
Gene type: CDS

Number of genes found: 348

Free access
Sort by:

 



# Rhodopseudomonas palustris CGA009, CGA009

>RPA1953 possible FusB/FusC Fusaric Acid resistance pump
MSLPDEQQWLFAFKTFAAAMLAMAIGLWLDLPRPYWAVATVYITSHPLSG
TTRSKAVFRVIGTLIGACVAVAIVPNLAGAPPLLVLAIALWSALCIYVSV
LDRSPRSYVAMLAGYTTALIGFPTVDTPNQIFDIALARTEEIVLGITCAA
VVSSVVFPRSVGPLAAQRVKAWFKHAHASVRDAVSLNKSPAAEAHRLQVA
ADSADIENLVTFLSYDTNLDQSATRWIKQLQPRMLLLLPVMSSVADRLDE
LQPLGGPSAPVAALVERTRVWMEGDEPGDGAVRDALLADVQAEIDQRPHR
GAWRDLLELGLLMRLRDLLNIQSDCIALAAATTGNATTLDAPLHYPIEQH
VAVVKHQDHGMALLAAFVTLFTVITCCTFWIVCAWPAGGTAAMMATIGTS
LFAARDDPYPSIVGLTKWTIVAAFAAAAYLLMILPAVHNFESLALVLAPA
FILYGVLMGAPATYPIGLGLAVNTASLIALQEMYTFDTASFLNSTVAVVT
GTGLAALVTAVLRPVGAEWSALRLINANRATLAEAADISSENQRASVAGL
MIDRMMLLAPRAAAAGHTMPEALRELRAGFNILDLHRARRELRRGARRRL
ELLLIRLERHYASASSAPPAGLLRTLDRALAAVRDEPSATARRAVLGLVG
LRRCLFPAAKPPQLTPQEAA
>RPA0048 conserved hypothetical protein
MPIAVWFRRLRGPVAFASAALVALYVGGTVPATAQSAVQGVPNAMQGFSQ
NRDKPIQIESDTLEVRDKNKEATFTGNVKVVQGDTTMTSQTLVVFYDQDQ
APAKGAKGKPMPAAAPGPDGASSIRRLEAKGHVVVTQKDQVVTGDKAVFE
TKTNLVTMTGNVVLTQARNVVRGDRLLVDMTTGVSRVESDSGRVQGLFQS
SGANGGGPLPLGPPTSSGKPK
>RPA3779 conserved hypothetical protein
MNRPTIKHGTSRRFGTPALLTLATLLAVAAPAPDADAKRARPAATTEATA
PREAGEPIMAIVSIKGQRVTLYDSEGWIYRAPVSTGTTGRETPAGVFAVV
EKDKDHRSTMYDDAWMPNMQRITWNGVALHGGPLPGYPASHGCVRMPYEF
AEKLFDKTRIGMRVIVSPEDVEPADISHPVLFSPSAEALAAAPTRAETAV
REAEQAAQAADEAKTAAAAAARAVKPLKDSLRKLERAKARAEAALKAADK
VLVAAATDEAKAKAEERQQQAAQQLGEATTQLETAKADADAKHAAAAATK
EAAKATAAKKAETAKLATDAKLAQEPVSIYISRATQKLYVRRNTRKPLPD
GGELFDFSIEVPVTILDPERPIGTHIFTAMARNDAGLRWSAVTIESADNA
KSALDRVTIPPEVLERIGPTALPRSSIIISDEPLSAETNYRTEFVAVLSD
QPQGGFITRKPTSSDVPVASSDDWNDGGFGFFFQPREQRVPAQSGRGRYG
EGYYRQPQDYYRQEQPGWW
>RPA3545 conserved hypothetical protein
MVKKTSPNTVQFSLIRLVDCARMTRGVRLYLVGSFVLVSLAGCGRGLFQT
AEREPWRAEAEIACLKSGVVREGPDLVRIDPISGPGVCGAEFPLKVAALG
ETGAIGFADDLRPPGAIGGASQSQPRWPGGQPQPNYATPQRGYAEPPARA
PNYGAQPQTGYGAPQGGYGKAPVSLNAPGVGPAQDDIELPPEGEPSAERP
PAENVTGYPRGAAPQGGYPGEAERPLPRLGPGQQGGITGSVGPVAIKPTA
TLACPIVSALDRWLAESVQPSAMRWFGVRVVEIKQISAYSCRGMNGNPNA
HISEHAFGNALDIAAFVLADGRRITVKGGWRGLPEEQAFLHDVQNSACQM
FTTVLAPGSNVYHYDHIHVDLMRRRSQRTICKPAAVSGEVIAQRLQQRNP
YAGSASPGPGWNGVTGSIGRNASRHKVDRDEAEDD
>RPA1324 possible FusB/FusC fusaric acid resistance pump
MSVFDHSGREARRFWHGLRQAVLSRLQALAPRVLFGLRLSASVCLALFAT
YYLELQNAFWAATTAAIVCQPNLGASIQKGRFRAIGSALGALVMVALLAA
FPQQREPALLLLALFCGLCAAAATLLRNLAAYAAALAGITSAIIFADTVT
DPSSAPFLAIVRVGEIYIGIWSATVVAMLTGSGSARRQLCQTLGRISANL
LAGFSTDVTSGRGAADSRAARIALARELGPLNLAIDAALGEQSPVLADRA
RLRRIPFALLDALTSWRNAARFGERSSVAETTSRGELRASLAAIDPARFQ
SDPAGFRDSCRTALRRIDATRSSDVDAPIVVAARDVVQGLAAAADGMLAK
GAVGSNGRTPRASFVVADPLPALVNGARTAAAILAVTWFWVVTAWPSGPF
AIVFTAVATLIFASFGDHAGDLAKDYTIGVALMAVVGSVLYFGVLPALSS
FAALIAVLSVLFIVLGVMQAGPWHSVVFLAMTISSLPLLGVGNPITYDAS
AYFNLALAIVSGSAVGAMFFVAIPVVPPSLRLRRLIALSLRDLRRLLSLR
STPDQLRWTALMARRIELLPSEATADDVADLLALHAVGRAALRLEQEVSD
HRALGLLRAALTALADGHLADARMMLVTWRGQSLALDPPDDAEHRRWIAS
QSHIAVIIDAIDNHPVLLATPFRGWQPFFKAFR
>RPA3421 conserved hypothetical protein
MSGNRGFRGLAVIAAVVALTATAQAEDRVRYASVGDETRAPIGWIEFCSD
NPVECRGSRSQARDIVLTQTAWQDLLKVNRWVNDTIKPLTDMDHWGVIEK
WSLPSDGYGDCEDYVLLKRKMLIDAGWPREALLITVVRDKKNEGHAVLTV
KTDKGEFILDNQNEEVVAWTETGYRFVKRQSQSDPNAWVALGDGRPAIAT
ASAKQR
>RPA2377 conserved hypothetical protein
MNSSSASIPRTAHAAVPTAHASRYLQQLCKHWAHKFEAEFDPIQARIALP
LGEARLTARPDALLIDLTATDAANMTTFQQVVISHIERFAFREELRFVWG
>RPA3494 possible TctC subunit of the Tripartite Tricarboxylate Transport(TTT) Family
MNLFDGVSAAGRAAIALMALWLLPASAAEPDRLDQIDFPVRTVTVVVPFA
KGGPTDTVARLITAEMAKTLGQPIEIENMLGAGGTLAATRVAHAAPDGHT
LIVGHLGTHGAAVALFPKLAYRPDKDFTPVALLTEMPVLLLARKQFPPKD
LSEFASYVKSHTDNLNVAHAGFGSVSYASCLLLNRLLKVDPTGVPFSGTG
PALQALVEGQVDYMCDQIVNAVPALREGKVKAYVIAASERDPVVPDVPTA
REAGLPGFQVGAWTGLFAPRGTPEPIVAKLNAAVSRALDQSDVRTRLTDL
GALVPRPEQRAPVVLAQLVQEEISRWEDVVKGTTP
>RPA4242 conserved hypothetical protein
MSTPDATSPGSQQQDVFDFLGRGAGDAPVVRIDTHGAAVFLEGNRAMKIK
RAVKFPFLDYSTLAKRKIACEQELEVGHRFAPTIYRRVVPITRTDKGALQ
IGGEGPAVEWAVEMMRFDDSATLDHLARAGSLGPGLIDAVADAIAASHQA
APLAATAPWGASIEPILADDTNELAAGGFAAADVAALDNGSRNALGRLRP
LLEQRGVAGFVRWCHGDLHLANIVVIDGKPTLFDAIEFDPALASVDVLYD
LAFPLMDLLHYGRGSDSAQLLNRYLAVTNADNLDALSALPLLLSMRAAIR
AKVMLARPAADETIRRANRAIAESYFELALRLIAPPPPRLIAVGGLSGTG
KSVLARALSGNVPPLPGAVVLRSDVARKRLHGVADTERLPATAYTTEVTE
AVYRGLVERAAHILKQGHSVIVDAVFSKPEERDAIESVAAGLGISFHGLF
LTADLATRVARVAGRTADASDATPEIVRQQQSYAQGVIGWTSIDAGGTPA
ETLSRAVAALPQTAQVCST
>RPA4286 Catalytic LigB subunit of aromatic ring-opening dioxygenase
MPRQPTIFLSHGGGPCFWMEPPRRFGPHAYDGLRSYLSGVLASLPARPRA
ILAVSGHWEAALPTVSTSAAPPMLFDYYGFPEHTYRLSYPAPGDPALARQ
VQELLGEAGIATASDAARGFDHGVFVPMLIVDPAAQIPVVMLSLQQDLDP
AQHLAIGAALAPLRDDNVLIIGSGNSFHNLTTFFDGQEAESVAFDDWLTE
AATAPDAAVRNDRLTHWASAPGARACHPREEHLIPLMVTAGAAGPDPGRR
VFHDAIGNKRISGYAFG
>RPA1617 ErfK/YbiS/YcfS/YnhG
MVRRLAASALALISIATALPTAASAQELDLRSVMGFGPFYRSGPSANPIP
RETVMFEGNYAPGTIVISTRERRLYLVQGDGTALRYGIGVGRDGFRWSGT
HRITAKREWPGWTPPAQMLRRRPDLPRYMAGGEDNPLGARAMYLGSTLYR
IHGSNEPETIGQAVSSGCFRMTNDDVKDLYDRVRVGTVVVVKN
>RPA1659 conserved unknown protein
MSRIIELLNDEHRNIARLLDVLEQELSVFDRRERPDYEIFQAIIQYFKEY
PEACHHPKEDVVYRILKERNPAVAGTVGDIEKEHELEVDRLANFAKVVDD
VLADQELPRQTFHAAARDFIEHQRRHMLKEEQLLFPAALETLTPEDWAKI
DARIDDRKDPMFDGETASKFQNLYATILRWEQETAADRSKTSAVRA
>RPA0863 possible MgtC-magnesium transport
MRFLTTFQTADFFDTLVSLAVAFVLGMLIGAERQYRTRTAGLRTNVLVAV
GAAAFVDLAMHLAGADGAVRVIAYVVSGIGFLGAGVIMKEGMNVRGLNTA
ATLWSSAAVGCCAGGDLVAQAVALTVFVIAGNTLLRPLVNAINRIPFDER
TSEATYSVRLTADGGAADRLREHIEKRLEQADYPVAEVEVESVEHADDKV
EIVATLVSTAVEPNELDAVVAELGKESGVDHATWETSTKD
>RPA3726 conserved unknown protein
MVATKDKDLNDLFLDTLKDIYFAEKQILKALPKMAKAATSDKLRAAFEKH
RDETEGQIERLEQVFELLEKPARGKTCDAILGIIDEGKEIMDEYKGTEAL
DAGLLAAAQAVEHYEISRYGTLKQWATQLGMKDAAKLIDQTLQQEKTTDQ
TLTKLAESSINVAAAA
>RPA3881 conserved unknown protein
MNFFHRTRGFVTALAVAMSLAMPLSLAVTSAADARIGGGSSSGSRGSRTF
SAPPSTSTAPGAAQPFNRTYSQPGSPGMSGPAAGAAANKGGFFNRPGMGM
LGGLAAGFLGAGLLGMLFGGGFLSGLGSFASIIGLVLQVALVVILARLAW
GWWQRRNNPQPAYAAGQAPGQGPQPMNQGPQSPGPQAAYRTGFGFGGGAS
NDRPLEIKPEDYEAFERLLGDIQAAWSNEDIDRLRQLATPEMASYFAKDL
EENKAANDINKVSDVKLLQGDLAEAWREGESDYATVAMRFSLVDKTLERG
TNRLVAGSETPIEVTEIWTFTRRPGASWELAAIQQTN
>RPA3393 conserved hypothetical protein
MGRAVLSCLNGGALGRVAALGIVLLGFAADSSIPVTAGAGSLEARRQPMK
FGWVACDPDCGGWISAVGIVTTDTPKDFEDFARDRKLGGATVVLDSSGGS
VNDAITLGRRWRKLGLATTVGTSVPSASPLGSRARIEPGAYCESMCVFLL
LAGNSRYVPEGAHVRVHQIWMGDRAEDARAASYSAQDMSIVERDIGRLAK
FTFEMGGTGDLLSLALSVPPWEDLHELSRTELRDTNLITTQAIADLLPGL
GGAPAKTVASAEQKPVQDRFTPQPAAATKTAEALPTGGTAAAQPGH
>RPA2691 DUF88
MSTSSSNKIALFIDGANLYATAKTLGFDIDYKRLLKEFQSRGNLVRAFYY
TAIIEDQEYSSIRPLIDWLDYNGYTVVTKATKEFIDASGRRKVKGNMDIE
LAVDAMELAEHIDQMVLFSGDGDFRSLVEAVQRRGVRVTVISTISSQPPM
IADELRRQADIFTDLVELQSKIGRDPADRPPPREPREPREPRHHAPQFLQ
RPTALASRAGADDFED
>RPA3604 Uncharacterized protein family UPF0114
MSMTPETPLPPKGLGLRPIPMLIFGSRWLQLPLYVGLIVAQGIYVVLFLK
ELWHLFQHSFDFSEQQIMLAVLGLIDVVMISNLLVMVIVGGYETFVSRLN
LRGHPDEPEWLSHVNASVLKIKLAMAIIGISSIHLLRTFIEAGNLGAAGK
VGGYTETGVMWQTIIHTVFILSAIGIAWVDRISNLATEEAKAHAGH
>RPA2426 hypothetical protein
MSRSLSLVGLAALIVAPLIAFAPPAQAQIGNIFSDPPPRPPGTVPRGAQP
PADDEEEEVPDLPPQGRVLPAPGRAMQGSAMPGPVQSQPLPPPPGSTVIP
QNPPPGAQQPAEPNTGVANAPAATNPLPGLPPGQRQPRGTPPANPPPAPA
TLQPGDEIVTEPPAQKIVNKKASFAGLDKITGRTINFDADIGETVQFGAL
RVKTDACYTRPSTEAANTDAFVEVDEITLQGEVKRIFSGWMFAASPGLHA
VEHPIYDIWLTDCKDPETSNVASAQPDAPKPAAPPPPQRRRQQQQSQQTL
PPPSSLQQPAPQPQRPQRPPPPPFPGFQQ
>RPA3596 hypothetical protein
MNRRDLLKAVAALAPAALTTTIAGRVWAAPATDAKLLVVFLRGAYDAANV
LVPVSSSFYYESRPNLAIAKPDVGNPNAAVALDADWGLHPALRDSLAPLW
TSREIAFVPFAGTSDDTTRSHFETQDTIELGQSTKGSRDYRSGFMSRLAA
ELTRVKPIAFTEQLPLIFRGQAEIPNIALGNVGKPGVDDRQAELIKQMYA
KTKLASAVAEGFRVRDEVVKSIADEMTAANRGAVSPRGFELSARRIGRLM
REQFNLGFVDVGGWDTHVNQGAATGYLADRLGELGRALAGFREEIGPAAW
RDSVVVVISEFGRTFRENGDRGTDHGHGSVYWVLGGGLNGGRIAGEQIKV
AQPSLFENRDYPVLTDYRALFAGLVQRMYGLDGAALQRIFVGVRPADLGL
V
>RPA3041 conserved hypothetical protein
MGMHMPALIALGRFLFSILFIYSGASKLLDLPATTQAAGKFVVPEMLATY
TTQLEQAAGMPFAQMLALAAGGIELLCGLLIALNFGARFCALVLIVFVAI
GTYYFHDFWNQTGADAQANLAVALKSLSLVGGLLIIAGIGRGGAASGGA
>RPA0523 conserved hypothetical protein
MTEHPDDDPLLLRFRKLPRIVRLVYSRPRLFASIAIGVVAFALLPGWLRP
VTRALIGWDVSIIVYLALAYTMMARCGVAYIRRNAVLQDDGRFLILMVTA
IGAYATIAAIVTELGTAHRGAAELALATFTIALSWAAVHTTFALHYAHEY
YRGDREGGLAFPGSDEHTEPDYWDFVYFSFVIGMTAQVSDVGITDRTIRR
TATAHGVVSFVFNTALVALMVNIAASAI
>RPA3597 conserved hypothetical protein
MPSCIQRRFSDPARIIPAPHDERDAAMRTNTAGRWSARGSAILGIGIAAI
ITGGSAGAAEISAHDLALIDRLTWGINGSSVAQFQKLGAARWVNQQLHPT
ADSALPQPVAAQIDAMPDAAGLTPAAINAFQAQGKDADQLTDPEARKTAK
QAYQQALNDRAKQAATRSILHALYAPEQLRERMSWFWLNHFNVHQSKAEL
RLLVGDYEDHAIRAHALGKFGDLLRATLRHPAMLRYLDNAGNANGHLNEN
YAREIMELHTMGVGSGYTQADVESLAKILTGVGIDLKPEDPKLKPALAPQ
LVRHGAFEFNPARHDYSDKTFLGHTIRGSGFAEVDEALDLIVHNPATAQH
VSRKIATYFVSDEPPQPLIDKMAKTFTASDGDIAQVLATMIAAPEFDASL
KTAERFKDPVGYVYSAVRLAYDDKVVLNTVPIQRWLGRLGEGLYQRQTPD
GYPLTASAWNGPGQMMLRFEIARQIGSGSAGLFKPEQADAKDRPAFPLLQ
NALYFGGLSRTLSSTTRGALDQAISSQDWNTLFLSSPEFMVRQRAEAPHE
PS
>RPA0601 probable DMT superfamily transporter
MNPDTLLSWQLWAILSAVFAALTAIFAKVGVADINSDLATFIRTVVVLVA
LGGLLAATGKLVADGPIGPKTWTFLILSGLCTGASWLCYFRALKLGPASL
VAPIDKLSVVLVVLFGVLFLGERPSGTEWIGIALIAAGAVIIALK
>RPA3615 conserved unknown protein
MFLQFFTSLRDAQVPVTLREYLTLMEALDADLADQSVENFYYLSRAALVK
DERNLDKFDRVFGATFKGLENLLDAMDKAEIPAEWLKKLAEKYLSEEEKK
QIEAMGWDKLMETLKKRLEEQKKRHQGGNKWIGTAGTSPFGAEGYNPEGV
RIGQEKSRHQRAVKVWDKREFKDLDGNVELGIRNIKVALRRLRKFARTGA
PDELDLDTTIRETANHGYLDVHMRPERRNAVKVLVFFDIGGSMDSHVAQV
EELFSAAKSEFKHMEYFYFHNCLYEGVWKQNKRRFTDRTPTWDVLHKFPH
DYKVVFVGDASMSPYEIMVPGGSVEHVNEEAGHVWLERVLRTYPHAVWLN
PVAQRHWDYSESTTIIRRLFSERMYPITIEGLEGAMRELVR
>RPA3037 conserved unknown protein
MHRTVSMEISMNRRASQFRLSRFKSTTLALLALAVTGASTAAVAADDPDL
IFRRSTVFKLLSPNDKLAVYGIDDPEIKGVACHFTVPERGGFKGWLGLAE
EVSDISLACRQVGPIHFTKKLDQGDDMFSQRRSMFFKRMQIVRGCDAKRN
VLVYMVYSDRLIEGSPKNSTSSVPVMPWGGDAAVEKCGDFIK
>RPA3070 conserved unknown protein
MSLSSMTGFARSHGASGPYVFEWELKSVNAKGFDFRMRLPPGWDDIEPPV
RKRAAEVLSRGTIYANLTVKRANAVSAIQINQDVLASVLKVASEIAGKVD
AVAPSIDGLLGIKGVIEVVEPEADEAEEKAARAAVESAFGEALKSLIEMR
KREGSSLAAVLAQRLDELEALAKQAEAAPGRKPDAIKARLAEQIAALLDT
SDRFDSDRLHQEAIMMATKADIREELDRIASHIAQSREMLAKGGAVGRRL
DFLAQEFNREVNTCCSKSIDLELTNAGLAMKNVVEQFREQVQNLE
>RPA2102 conserved hypothetical protein
MRSAGEPRWVVPILASPLLWHAARFALVSAYLLGGVVKLFDFAAAVAEQE
RFGLHPGWLWATLAIVVELGGSLLVLADRLVWLGAGALGVLTFVAMLTAN
AFWSAPAADRWIQANAFFEHFGLIAGFVLLAMLSDQRRSTS
>RPA1062 conserved hypothetical protein
MFELLAFAIAIVALVIARKTQTQILQLRARLDALTAGQPLPATPQAATAA
PIAINEAALAPEAEAAPEIAATIAIEPSSPDTTATPAAEAVAEAAKPGFE
ERIGTRWVVWIGGLTLALGGFFMVRYSIEQGLLGPGVRVLLGGLFALALL
IAGEVTRRKESVAQLAALPIANIPAILTAAGTAVAFATVYASYALYDFLA
PATAFILLGLVALGTLAAALLHGPALAGLGVVAGFATPVLVSSGEPDYWA
LYIYLAVITAASFALARIRLWRWLAVTTVILALLWTLPGLEGAATLIAPH
AFHVIAGFTLAALLVVCGLLFGPSIEPGRIEPVSSAALATYLLGAALIVV
ASTHSDAALIVFAGLVAATLGIAWRAAAATAAVGAAAGLVGLVFLSWVVR
GSPELLVLPGGALPGIGADPLSGPVTAHLIAAAVFGLGFAGFGILAQGRS
PSAAVPVIWAAAGVFTPLALLIALYARIAQLDRSIPFAIVAVVLAAVFGF
ATEHLSKRDNRPGLPISIALFATGTLGALALALTFALEKGWLTIALALMA
AGTAWISLQRPIPFLRWLAAILAGIVVARIGYEPRIVGDTVGTTPVFNWL
LWGYGVPAASFWLASIWMRRRGDDAPLRMMESAAILFTVLLAFMEIRHAV
NGGDVYRDSAGLTEVALQVCVTMAMAIGLERLRLRSHSPVHNVAAILLAI
FAALGSVFGLLLWENPTFWATDVGGLIINRLVLAYALPAVLALLLSYAVA
GVRRPAYANGFAALALIMGLAYVTLQVQRVYHGPILAYGPTTDAEQYTYS
VAWLICGVLLLGAGLLFNSQRARLASAVVIGLTVLKVFVIDMSTLTGVYR
ALSFMGLGVVLVAIGWLYQRILFRPRGTPVTGDAPEDPVR
>RPA0923 conserved hypothetical protein
MMTGLYLGALAGIAVALAVLMAGAWVVQQLTGNSGWVDTIWTFSLGLTGA
VSSLWPIDGAAPDARQWLVAVLVATWSLRLGSHIAARTRHITDDPRYAAY
AAQWGTDAPKRMFFFLQNQAYGTIPLVFAIFVAAHAPAGSLRLQDYLGVL
ILIVAIAGEGLSDAQLKAFRENSANKGKVCDAGLWRWSRHPNYFFQWFGW
LAYPVIAIPFAEPLSYLWGYAALLAPLFMYWILVYVTGIPPLEEQMLKSR
GDRYRDYQARTSMFFPLPPRRSATL
>RPA0465 possible transmembrane protein
MIGWLKAISAAATTALLLMTAPARSEPAPISDEEALSLGVDAYLYFYPLV
TMDLTRKQSTNVEAGKEFGKGPMNSFVSVPAYPPADFRTIVRPNFDTLYS
IAWLDLSREPMVVTAPDTNGRYYLLPMLDMWTDVFASPGWRTTGTQAQKF
LVTAPGWSGAVPTGMQRVESPTPIIWIIGRTKTDGPSDYAAVHRIQAGYT
ATPLSQWGKPPVTPPAFVPDPAIDMKTPPKQQVDSMPAGQFFAYAAELLK
QNPPHITDQPMIARLRRIGLEPGKGFDLDKAAPNVRKALDAAPAEALKLM
SWKMPTMARVTNGWSMNIDTMGVYGNYYLKRAIVAQFGLGANVPEDAIYP
VNVADETGKPLDGTSNYVLHFQKGATPPVDAFWSLTLYDSEGFPVPNALQ
RQALSSWMPLKPNADGSLDLLIQNASPGTEHESNWLPAPKGPFTLTMRMY
APKQEALTGRWAPPPVTRVQPPTGFSQ
>RPA4694 Uncharacterized protein family UPF0065:Tat pathway signal
MSLIRRIVLSRRAVLTTAAAALAAARIPSSARAQGLYPTRPVRIVLPFAA
GGVADITARLIADQLGTKLGQRFYVENQPGAGGIAAARTVISSPPDGTTL
ALLSNGTAISVSLFKKLPFDPVKDFAPISSLGTFDFLFAVRSESKFKTLE
EVIKAAKQKPGALNVGTINTGSTQNLAAALFKTAAGVDLVIVPFRGTPEV
LVALLQDSVDLTIDSYSALKGNLSDGKIRALAATGPLRSKITPDIPTLRE
SGIEASIESWNGLFAPAGTPPAVISALNTALQEILADPALKKKMLELGID
AKPSTPDQLAARLRADIEKWRAVIEQSGIERQ
>RPA1106 conserved hypothetical protein
MSRPPLPPFTRETATQKVRMAEDAWNSRDPARVSLAYTPDSRWRNRAEFL
QGRDAIVEFLTRKWAKEHDYRLIKDLWAFDGRFIAVRFQYEWRDEAGLWY
RSYGNEQWEFDDDGLMRRREASINDVAISEGDRRFHWPAPGPRPDDVPGL
AADPF
>RPA0062 conserved unknown protein
MATPAFVKRITRSDAVLSLGVNFSKIYFKFVMWTGRYDRQAVPIPEPYIL
AMWHGRLIMAPMLQTKSKPLVALISGHRDGKIISRVGAAFGIQTAVGSSS
KGGMRAAREMMRLGKSGHSLFVTPDGPRGPRMRINSEGVLDMARLTGLPI
LPVSVSLSRHKLLRSWDRLMLPGLFAKVAIRFGEPITVEAGSPTAEMAER
LQAALTAAQNETDRIVGLQPVETA
>RPA1222 conserved hypothetical protein
MWGDCATRRHRRVNLLGYRGRSSSSPAWDNMRRVLKKRTAKPGTGPAHDI
VLSQNAGSAEAFRAIATELLRLTAAQQPKVRRRDPTGVHQMRIALRRMRA
AISVFGELIEGRDTQRLKRELKWLAGRLAPARDLHLLEVKIKSAQLGAGS
PAFLKRLGSDRGAAFASASATVELQRFRKLMSDLQRWIDAGEWTRGANGA
ERPSAAEFGQQVLARRARKLNKRLEKLEQLDDEERHQVRIAAKKLNYAIG
FFESLFDGRTGKRLERFRKHLKKLLDALGALNDVAVHRKLAGKFSRRATA
KLDPDAARQLADLDDVEIRQQMKAAAKAAAKLAEDPLFGD
>RPA0772 Pentapeptide repeat
MLQRIEQATAKKGRHDFGGERTHNLSHRNFDCSDFASADMRRVDFTGASF
VGADLTAAELQGGNFVNAILDGAKASFVSLEGSAFQDSSMRRAQVVGAKF
DHAIVLRSDFSGTVFDYSSLRGVRCERSVFDGGSFKIADLLGAENDLCNF
RGASFQGARLDGAYFHSSDLRFASFVNSKVRGVEFAGGSLVGAAFSSAEV
QASEFKWVSLDFVEFDDAVLDGASFAYSQLSTVLMRNASVWRTEVGSCDG
YDVASVKTGRFRDAVAGRRRKEAHGGEVLMREILADIDSFALPGSAKLRV
TKMFAAAGIGSDERSDSKRGVSEKAWKACIARSSNFEGKETFSSGILKAV
KSLACTPLASRREIIAGLIEDWSARMRLRRSLSVGFARQMLELEGQNCGI
NSLLTGQQVRELYRMLSASRAYIPETVPRSEVKILAS
>RPA0212 ErfK/YbiS/YcfS/YnhG
MSFGFRSAKLAMVAVVLAGASVLTAPRAEARPDLVVFRGDYSPGTIVVRT
GERRLYLVVEPGHAVRYPVGVGKAGKQWAGVTKIDGKFRNPAWAPPAEVK
RDVPTIPDVIPGGSPANPMGVAAMTLAGGEYAIHGTNRPQSIGGFVSYGC
IRMLNDDITDLYERVPVGTQVVVMR
>RPA3108 conserved hypothetical protein
MAITFDPAKRDWTRRHRGLDFATDAATAFAGRIVTKLDDRFDYGESRYIT
AGYVGSRMVVIVWTPRNGGRHVISMRYCHAKEEVRWAALFRRTSD
>RPA3148 DUF174
MLQPSLSHSAAGQAGDRINRPLAMAGDGGRSGPSTPSTSVITKTKPRTKR
PNLYRVLILNDDYTPMEFVVHVLEKFFQMDVEAATKVMLHVHHHGIGECG
VFTYEIAETKVTQVMDFARKHQHPLQCVMEKK
>RPA1146 conserved unknown protein
MTPQERQLVDDLFERLARLEATPRDPEAAAAITQGLRIAPNAIYGLVQTV
LVQDEALKRAHERIQELEADHAPQSQERGGFLDSMRDAFFGQGQTPQRGS
VPNVPPPSTGSRPAWNTGQVLGQQSATPPYGQPPQAPYGQPQGQYGGPFG
GQGGSAFGGGGGGGGSFLGTAAAAAAGVVGGSLLLSSIRGMMGGGSHQSL
ADQSGLGGGTQSPWGGDSSGGNLAQQAGLGDIGSQRGGVDDSSRYGMMDQ
NGNDYDRVADAGSDRDDMDLDSDDFDDGGDSDFA
>RPA3088 conserved hypothetical protein
MKIAIPTQDWATISGHAGQASRWMVYDLAEHRDGRPLPPPSQVELTKEQL
PHYFKDDGPHPLDGVELIVAGSAGDGFVRHMKKRGADVLLTGETDPATAL
EHIVKGEALPDQRFDITTTLCRLRDLFSRH
>RPA1892 probable large terminase, Rhodobacter capsulatus GTA orfg2 homologue
MLGGRGAGKTRAGAEWVRAMVAGTPPYATRPHGRIALVGESWHDAREVMV
EGESGLLRITPRRERPEWIATRKRLEWPNGAVGEVFSADDPDSLRGPQFE
AAWCDELAKWRYAEASFDMLQFGLRLGSRPRQLITTTPRPLPLIKRLLGD
PHTRVTRAPTRANAAHLSPAFLETVVARYAGTRQGRQELDGELIEDRPDA
LWSRDRIERARVAAAPPLVRIVVAIDPPGSSRPGADACGIVAAGRSAAGD
YYVLEDVSSDRLSPAAWAARAIALYHRLEADAIIAEVNMGGEMVRAVLHE
TDAAIPVREVRASRGKFVRAEPVAALYEQGRVRHVGCFPLLEDEMCDFAP
TGLSSGHSPDRLDALVWAITALIGGAHAGGPRMRVL
>RPA1936 possible serine protease/outer membrane autotransporter
MSTVGRFRHLSSLLLCTTFLVSAPMSAVLYAAESPSKSKSVAKSVLSPEL
QTKFSDAMRQGERALIAQAYDSTYRDPGLRDAVVRHVSTLAPTALRDVTT
AADLGVLTSGQRMVLAPNAAQVLATRTASASGLTASSYQNVAMAVNNSPN
LPNPMPSRTEQTWNLDMIGAQGAYNRGFTGAGVTVTVADTGFDTTNAGLV
NKLRTNLGKNYMVEIGKAFDPNDLSPESAQKTDIHGSHVAGIIAAEKFDN
VDAHGVAYDASIIPLRAIAEKGYTTYGNVDSSALALNYFASLSGTMVYNA
SYGPNSDGLTNLKLWTVGNIDDEANAAFNVLKAGKIIVAAAGNDRADNPI
AARNANGLALLPFFNPAHAAVGVYDDQGQQLDGSALQHQQGQIISVMSVG
ITKAAASYSNLCGVTASWCVAAPGGDDATGALVYSTVPVNTYGFAQGTSM
AAPTVSGAIAVLIQANPSYNAQDLSRLLFSTTEDLGAPGVDAVFGYGLIR
LDRATDGPTTLAANTAVSVVADQTTYWSRLLTTDGDFSKVGSGILSISGR
TNASGNVFAQLGTLAVDGTLTMSAGHRLEVAQPATLAGFGTIAGDTVIAG
TLSPGKMANIGDLVSNNVVPAGTVLNGNSVGSLTFNGNVTLTSTATTRID
IDGSLIVPGGPGTYDKIYVTGAGNVFYAAGTLTPVLRGSVGTVSNYTPAI
GTEFAFVQATDGASTAGSFSKLVQPTSGLPANGRFDLIYNPTSITLVVTP
SSFSQLADADKLGPTQRSIAGILDKDRAASGEMPTANEKALYDALYKLNS
EAEFDKALNQLSGPGQPAMASAPLQAFTGFLGAIGDRQDMLTSGSEIGQN
GTAQSFAMSYAGRNTMSAGTNAAMNAFASISPAERVQDGWSVWGQGFGRN
SRVGDSGDLSGSKAVSAGFAVGVDRRFSNSFNAGGAFGYTRTTATSTDMQ
GTVDTYAGAAYASWTPGAAVLDFRIAAGPSQMATSRQILLSPTSLQGSAN
GVGVGTTFEAGYRFAMGHDVTLKPFVGMTWQGFRRDAYSESQLPIGLVYA
ARTYDKLTSTVGAAVSARLRTTDGTTLAPELKVGWGYDLRDTTLVSEAAL
LDEAFLVDAAQPGRNAALVSAKLSGWRTETFRMFAAYTGEYRSNATSHQV
SAGARVNW
>RPA3139 DUF482
MTATGLERSTANAQSLRNDSHQRSFHRFDGLTASSDITLEAVSSVDAIAA
ADWDACARSGTIVPHPDGAGGACTTATAAYNPFLSHAFFTALEQSGSASP
RTGWGPRHLIAKHEGAIVGIVPCYLKSHSQGEYVFDRGWADAYERAGGNY
YPKLQVSVPFTPATGPRLLIRDRIDSARVAGALANGLMALCDLSKASSVH
VTFARESEWRFLAECGFLQRTDQQFHWHNAGYSSFDDFLATMNSRHRKGI
KRERRDAVASGITIHHLTGADITEDAWDAFFEFYIETGSRKWGRPYLTRE
FYSLIGQSMSEDVLLVMAKRNGRWIAGAINFIGGDTLFGRHWGAIEHHPF
LHFEVCYYQAIDFAITRGLKVVEAGAQGEHKIARGYLPQTTYSAHFIADP
ALRRAIADYLKRERMYVDEMGRELTEAGPFKKGNIADPA
>RPA3583 conserved hypothetical protein
MAKSRGAPAKSARRSAAAAARGATSDAKGSINLRIETGTRQLIDDAAAVL
GKTRTEFMVESARRQAVDVLLDQRLFTLDPERYDAFMQALDNPPAPGPKL
KALLRRVPAWRT
>RPA2549 conserved hypothetical protein
MTASWIAMLPYLAHQAAFGALAAAGFGVLFNFGWRTLAWCAVAGALALAV
RTVVQQTGGSLEAATFAAAFVTSFTAILALRWLGPACNAVALAGCIPMVP
GAFFGQAMLGYMAVAADTTGSSTAQIVAASQAFVRVLSIVGAIGAGLAIP
AYLLKSRQF
>RPA0611 conserved unknown protein
MSAVSLSAAEVIARLGLQPHPEGGHYRETFRDPRCDASGRSYSTAIHFLL
RAGERSHWHRIDAVEVWHYYAGAALMLEVAGEDGHRAATLGPDLVAGHLP
QVIVPPQAWQAATSTGEWTLVGCTVAPGFDFAGFELAPPGWEPAR
>RPA4303 conserved hypothetical protein
MKAFSDLTEREILAVAISGEEEDSRIYLAFAEDLAERYPDSARVFTEMAQ
QEKGHRHMLLRMYEQRFGPDLPPIRREDVKGFIRRRPIWLTRNLPLDRIR
KEAETMEFEAQRFYERAAERATDVHIRKLLSDLAEFEKRHEQRATQLTDK
ILTPDARSAEDHAARRMFVLQYVQPGLAGLMDGSVSTLAPLFAAAFATHQ
NWPTFLVGLAASIGAGISMGFAEALSDDGSMTGRGSPWLRGGICGLMTTL
GGLGHTLPYLVPDSWANAFWIATGIAGLVVFVELWAIAYIRARYMDTPFL
HAVFQIVLGGVIVLAVGILIGGA
>RPA2851 DUF173
MTAEILQFETGRPVEQQADQDEALVIDVEGYEGPLDLLLTLARQQKVDLH
KISILALADQYLLFIEEARKIRLELAADYLVMAAWLAFLKSRLLLPEPPA
QEGPSAEDMANALANRLRRLEAIREAANRLMTRAQLNRDIFPRGQVEEIA
EIKHPKFTATLYDLLSAYASQRQSRVLTTVHLAKRTVWSLSEARASLERL
VGLAEDWSRLDEYLLRYMPDPTQRATVLASSFAAALELVREGEVELHQSG
PLAPLYFRKRPPSPAMEAAALPDTPVG
>RPA4613 DUF683
MSDIETLKAEIKKLSARATTMKMNLHDLSEELPINWQTIMTVAQETQDAY
AALEAARKALKELEAKAA
>RPA3969 Metallo-phosphoesterase
MSDDGPERRFRTLFISDVHLGARGSQTNLLLDFLRVHDADTIYLVGDIID
GWALKSSWHWPQSHNDFVQKLLRKGRKGARIIYVPGNHDEFLRNYYGTHF
GGIEVVENAIHTGADGRRYLVIHGDIFDLVVQNAKWLAHVGDKAYDLAIR
LNRVVNAFRRWFGVPYWSLSQWAKHKVKNAVNYIGAFEETLAQEARRHGT
DGVICGHIHTAAIRDFHGINYMNCGDWVESCTALAEHEDGRFEIITWTDL
LKRNLPVPTVAARAA
>RPA3109 conserved hypothetical protein
MPRKKSAGPRSFGEPLTDDPDDAPELLDEFFRTGEIRVDGKIVRRGRPPL
GTQPKSSVTLRLDADVLDAYRALGRGWQSQINADLRRVRKLKKA
>RPA4429 Uncharacterized iron-regulated membrane protein DUF337
MRVPALKPILLQLHSVAGLVLAAVLVLVALTGAVMSFEDEIRGALDAGRV
QLAPHAGSRMPLDALIAKLQAGGDPVASITMPRSAASAAEVRFARKPDHT
RPAPLYLDPYTGESLGHPAADAFFATVRKLHRWLLLPGDGNGYGRTITGV
AALGLLAMLLTGLVLRWPHRPGSIKVWLKPHWQLRGRGLHRSLHAVIGTW
VLLLYLIMTLTGLWWSFDWYKTAATWLLARSPVTMQPPNRAAVTAAPSAA
ISLDRVWATLVAERGDRFEAARLTLPSGREGTVRVRAWLTDARDGAHDEF
RIDGRSGQVLSADIYADKTIGEKVLARVLDIHRGSIFGWPGQLLFMLAAA
AMPLFGVTGVLLYLSRRRHQRMQRAARAKVG
>RPA3044 Protein of unknown function UPF0061
MTAHFPFDNSYVALPPNFFARVAPTPVAAPRLIKLNRPLAVQLGLDPDLL
ETPEGAEILSGNQMPETAASIAMAYAGHQFGNFVPQLGDGRAILLGEVVD
RNGVRRDIQLKGAGRTPFSRMGDGRAALGPVLREYIVSEAMAALGIPTTR
SLAAVLTGETVLRDPIQPGAVLTRVASSHIRVGTFQYFAARGDLASVRAL
ADHAIARHYPEAAQAPSPYLALLEGVIGRQAELVASWMMVGFIHGVMNTD
NCSVAGETIDYGPCAFMDTFDPKTVYSSIDQFGRYAYGNQPPIALWNLTR
LAECLVRLLADDDDKGIEIAQTALGGFAEQFNAAYLAKLAAKLGLFTSQP
DDQQLSQEFLTALAKGEADFTLAFRRLSDAAVDPSDLGEVRALFADPAAF
DEWAPRWRARIATEPQDATTRQAAMRRVNPAYIPRNHRIEAVIRAAVDRD
DFAPFEEILTVLANPFEEKAEFARYAEPPQPHEQVLETFCGT
>RPA3119 DUF205
MMIGIYIAALVIGYLFGSIPFGLILTKIAGTQDLRSIGSGNIGATNVLRT
GRKGLAAATLLLDALKGTAAVIVAAYLASGTDAIAANAAMLAALGAFLGH
LFPVWLKFKGGKGVAVYIGVLIGLFWPAAVVFCIMWLATAFTSRYSSLSA
LVASFVTPIFLWWFGHDSLASLFAVLTLLLFWMHRENIKRLQAGTESKIG
QKK
>RPA1092 Carboxymuconolactone decarboxylase
MHKNWIDLTGELSVALREVRTGAPDVMKGFSAIAQAALKANALDTKTKEL
IALAISVATRCDGCIGFHAEAAVKHGATRDEVMETMGMAIYMGAGPSVMY
AAQAVEAYDQFVKKKAAAASPAE
>RPA0351 hypothetical protein
MAKNKKIARKFATDQANLLENSIRRELDWAGDRSVVEKGSATDKKNYAEG
LSRALSQRFADALRKSFDGILPDVDGFGQESKARTGKGLKKLDVNYSTVE
LGLGLGVSIKTINFRDAKTKRYTKNYTRVDNELRAEAADYHERQPYAVLC
AVVFIPLDACDDGGSAPSSFGQAVQIFRYRAGRERPVDDATLFERILVGL
YDAGSPCFGTTGFFDVMDAPPRTGRPSALKTFEQAIDKIVEAYDARNKSA
FKWAEGETEILAAPEAEDDEDET
>RPA1097 DUF28
MAGHSQFKNIMHRKGRQDAQRSKLFSKLAREITVAAKLGTPDPAMNPRLR
AAVLAARAENMPKDNIERAIKKAIGGDSENYDEIRYEGYGPGGVAVIVEA
LTDNRNRAASDIRSFFTKSGGNLGETGSVSFMFDRTGIIEYDADKASADD
MLDAAIEAGADDVVSSEAGHEIYASQETFRDVAKALEAKFGEARKAAVIW
KPQNTVAVDDETGEKLFKLMDALNDHDDVQNVYANFEVSDALMAKMAG
>RPA4142 PilT protein, N-terminal
MIVIDTSAIIAIFREEPEAAQFARLIASDDQPVLSSGNLLETAIVLRGLK
DIQPAKAERWLDDFIAEAGIRIEPVTTEQAGYARSAHLRFGKGTGHKAAL
NYGDCFAYALAKALNAPLLCKGNDFPLTDIPLVA
>RPA1591 conserved unknown protein
MTIKTLRHILVACAVVAAAGAAVAQQSGIKRTPLQKIEFPDGYVTVSGLA
ELPPGGNIGRHTHPGIETGYLLEGEAVMSIDGEPDKHLKAGDSYVIQAGV
VHDAKVHGDKGAKVMAVWVVDKTKPLATPAK
>RPA4357 conserved unknown protein
MTQTNNRLFDEIGRLMNDAAGAAQGVKREIDGVVRSQAEKILRDLDLVKR
EEFEAFKEMVRLTREENEALKARIAALEARQGGGSEPPTALA
>RPA2320 possible TctA subunit of the Tripartite Tricarboxylate Transport(TTT) Family
MELFSQLGHGFEAALTLSNLGYCLLGVTLGTLVGVLPGLGPVATIAMLLP
ATYGQPPLSALIMLAGIYYGAQYGGSTSAILVNLPGESSSVVTCLDGHAM
ARKGRAGAALGIAALGSFFAGTVATLLIAALAEPLSNLALKFGPAEYFSL
MVLGLVAAVVLANGSLTKAVAMTVLGLLLGLIGTDVNSGTERYSFGIPEL
SDGFGFVVVSMGLFGIAEIIHNLEKGIERNVFATSVGGIWPTREDFKAAW
KPVLRGTFLGSLLGVLPGGGAILSSFAAYTIEKKLAKDPSRFGQGAIEGV
AAPESANNAGAQTSFIPLLTLGIPPNAVMALMVGAMMIHGIVPGPQVMTE
KPELFWGLIASMWIGNLLLVLLNMPLIGIWIRLLRVPYHVLYPAILVFCC
IGIYSVNQSTIEVAFAVGFGLLGWMFIKLRCEPAPLLLGFVLGPLMEENL
RRALLLAHGDPTVFVTRGLSLALLLIALALIVILVLPSVRQTREVAFHED
>RPA1866 conserved hypothetical protein
MRLIVLTLLALLIAAGVGIGATWMTATRGTDFGTLTIGAWTARPRVGTSE
VDPYARAAIVRSGALPVGAGDGISFLATTDDSKRPLDGRCDVEVSGITPA
ARFWTLTLYDPRGNLIANTLERYGFTSQEIIRDAEGKFSIRIAARARGGN
WLPTGGVDRYWLMLRLYDTPVGIATRTQRDAPMPAVATIGCP
>RPA0897 conserved hypothetical protein
MSDKPRSFFGVFTERGPLGALAVAGVLLFGIVGPASAQFFDFGGPPPARQ
QRGGGGGGFGWFGNDVFQPFHQNQPQRRAAPREDYSKAPAPEKRETIPDR
NVLVLGDSMADWLGYGLEQAYAEQPEMGVVRKFKTISGLLRYAPKGEPSD
WVAAAKEVIAQENPDAIVVMLGLSDRIAIREQATPEKDKKKDDKAAAKPG
EEAKPDAKADNKTDNKTDAKPDGKTDAKPDDKAADNAAADDDEDDDDSLR
IMTEKGKRAAGGVAQFREDRWSELYAKKIEDLINVLKAKNVPILWVGLPA
VRGTKATSDMQFLNALYRDGAAKAGITYVDVWDGFVDEGGRYVLQGPDFE
GQIRRLRSYDGVYFTKAGARKLAHYAEREIARLLAARSGPIALPTEPAAP
DTAAKPPAGPAPRPAAGPILPLVASSVSTDRLLGGPGTQPAPVDALVART
LVKGEPLSAPAGRADDDVWPRREVSIEKAQEPPPPKEQPKPEIPVASAKP
SGSAPSASAGSQPQQQQQQRRVARSAPPPPPPTASGFFGFGGPPQQQGRR
GPPPSASGFFSIFR
>RPA0827 conserved hypothetical protein
MKLLFDQNLSFKLCQDVSDLFPGSCHVRDVGLTQADDRTIWSFARANQLT
IVSQDADFSDMAMLLGPPPKVIWVRSGNRPRVAIALLLRTYSGLIAEFEV
NDAVCLEIY
>RPA0620 Uncharacterized BCR:Protein of unknown function DUF195
MAVQSATPAAMDQILFRLGDLPVNIGMAAAVFAALALVLLTTIVVLIARG
SATRRTAEIDQRLALLMRAQHEANGRVDAMGRALAGRQAEMARAMSERLD
SVTHRFGQSLTQSTRYTMQSLQALHERLGIIDRAHDNLTELTDQVTTLRD
VLANKQARGAFGQARMETIVQDGLPKGSFAFQYTLSTGKRPDCVVLMPNQ
PPLCIDAKFPLEAVTALREATTEEAKKAASQRLRTDVMRHVDDIASKYLI
PGETQDTALMFVPSESVYAEIHDGFDDVIQKAYRARVVLVSPSLLMLAIQ
VMQQILKDARMRDAADQIRTEVLSLGDDLARLRERVTKLQTHFGQVTDDV
RQILISADKIERRAARIEELDFTDQAAGTASAPAPEAPLAPDRADLFAAA
SFRIDEMTRS
>RPA1610 conserved unknown protein
MTDALDIDHLRQWIGRSEQATDFVTPQLVKGLRATLFLDIGKPQTGDAAP
FTVHWCLGQPVYPMDQLGPDGHPTRGGFLPPVPLPRRMWAGGELQFVDAL
KVGDEVTRTSTIGDVTIKQGSTGTLCFVTVNHEITTPRGVAIRDRQDIVY
RDVPPPSSAPAAPAKPAAAPPAAKHRESHLADPVLLFRYSALTFNGHRIH
YDRDYVTKVEGYPGLIVHGPMQAALLVEFAAKLKGSVPKTFSYRGVQPLF
DGADFSVNANETAAGLDLWTANADGVPTMKATASW
>RPA0530 conserved unknown protein
MLTLLHCVSACCREAAGSLSSVCVGDMAKSDQPTTIKKYANRRLYNTGTS
TYVTLEDLATMVKDGEDFLVYDAKTGDDITRSVLAQIIFEQENKAGQNLL
PTTFLRQLIRFYGDSMQMVVPKYLEQSIDSLTREQEKFRKQMASTFSLTP
FAPLEETVRRNMELFQQTFSMFVPRPPSHESTDEVPAETSGGADSIEDLR
RQMKEMQDRLERMSLEQPKKDE
>RPA0284 conserved hypothetical protein
MKKITKSAPSRKSPRPLTRVQVRAAAGQRSRGWLITGGLTIPVALGRGGI
LANKREGDGGTPRGTFHPLRLWWRPDRGPRPRTHLPIRIIGPDDAWCEDP
KSRHYNRPIHRSSAGEGDRLMRDDHLYDLIVEIDHNTRPRIAGRGSAVFL
HLARDNFGPTAGCVAMTRGNLQRLLARIGPRTKIVIG
>RPA2996 conserved hypothetical protein
MLMTKPPVPQLNLTVVREAAERAARAIPPLWPLESSVAVNPFLGQTGEPL
AMAAARLRRVAGAAVTMPRTWYAERIASGELSDVDLAAAIDAAPPTTRPL
TIAELKRAAQIEIAPPQALPTVAELASAVSGFDWTGFVAERISAWASGYF
DRGQALWAAPKGPNAYAAWRLTATHDLTPEIFGLTGFAADVAAAPESADA
ALIRAVEQLGLSEAASESYFHRLLISLGGWAQLARYRLWQAELSGSTDTA
VTDLIAIRAVWDSTLLRKYQPQIAAEWTDAINGYVQPLQPTEDDHINAIL
QDAVERAAQRKLQTVLAASAQPKPDDRPALQMAFCIDVRSEPFRRALESL
DPRIRTLGFGGFFGLPIAHRRFASDVVEARLPVLLPPRVTTSCSGHTHAH
EANDRAKRVAARAKRAWGRFKLAAISSFAFVESMGPVYVAKLLSDGLRSG
TRRTNADPAPQFDPPLALGARVDTAEAVLRAMSLTGPFAPLVLIAGHGAS
VVNNPHASALHCGACGGFPGDVNARLLAGLLNDPQVRTALIGRDIAIPAD
TLFVGALHDTTTDAVTLYDADHPSPAHASALAQTRDWLATAGALTRSERA
LRLPRAATGGAIARRARDWAEVRPEWALAGCRAFIAAPRPHTSGRDLQGQ
AFLHDYDWRKDTDFSVLELILTAPVVVASWISLQYYGSTVAPETFGAGNK
LLHNVTGGIGVVEGNGGLLRAGLPWQSVHDGERLVHQPLRLSVLIEAPHE
AISTILDRYPEVRALFDNRWLHLFALDDDGRMNWRYVGDGGWEHADNPPT
NQRVASFE
>RPA1487 conserved hypothetical protein
MTEAATTAERVLDVREIPPYQRHEIIPRLFDHLAPGQAMQIVVDHDPRPL
RQFFASVHGDDCQWTYLEQGPAVWRVRLRRAA
>RPA4442 ErfK/YbiS/YcfS/YnhG
MALNGTKLGLLALAGLLLSGCMQTTYQAAPEANLKPNDKAQLAKARYAKV
SVPEPFRRAIVDYHRKEAPGTIVVDSDNHFLYYVLDNGKALRYGVTVGEE
ALAFSGIARVGNMAEWPKWTPTADIHKRIEGLPSSVPGGIDNPLGARALY
LYQGNKDTLFRIHGTNQPEYIGASISSGCIRMTNEDVIDLYNRVKMGTIV
VVLDPKQGDSPMNSKMALQGGSGTATQ
>RPA1770 conserved hypothetical membrane protein
MSLITNAKLYPPRHPRAVHLAHHLAAIAPGVLVTAAIAAAAVGLRAIPGM
PGISPMLLAILIGIVAHNGIGTPSWAVPGVKFSLRRVLRFAIILLGLQLT
TAQLIEVGGEGLAIIAATLLATFCFTVWFGRVLGVDRKLTELIAAGTSIC
GASAIIATNTVTRADDEDVAYGVACVTVFGSLAMVSYPLLQSVLNLDAHG
FGLWTGASIHEIAQAVAVSFQGGQRAGEFGTIAKLSRVMMLAPLVIGLGM
LARVRAKHQPAGDHTATPPMPWFVLGFVAMIGVNEFVAIPHEAKSWIVAI
TAFLLSMALAAMGLETDLRKLAARGLRPALLGLTASLFIAGFSLALIKLT
GWHG
>RPA3981 conserved hypothetical protein
MRRFLLSAAKILVSAALLYLALRKTDFAALAARIDATSFGWLAAAVAATI
FQLFVNAVRWRAIGACCEAPLTTGRSMRYAMIGSFFNQTLPSAIGGDAVR
VWLLARAGAGWRAATYSVFVDRATGLIALSAIIFVTLPWTLRLIDNADGR
LGLMLLDFAALGAGVVFLTLPFLRFGLLDRIWATRHMRGCSEVALKALSS
PKSAVILIATSLIIPILAVVIAWCVARAIESPSSFVQLFELVPPIMLITM
IPISIAGWGVREASMGLAFGYAGLNPSDGVAVSLLFGAVYFVVGGIGGLI
WIMSAEKAAKGDAPIEVPE
>RPA0478 conserved hypothetical protein
MGGLIWILVVGFVAGIIARVLSPGPNNPQGFVLTTVLGIAGAFLATAIGQ
MIGHYGPNQGAGFITATVGAVSVLFIWNRLVARRVISDPGNRYPYDPR
>RPA1112 DGPF domain
MMRFMMLMIPKGYEAAKPGVIPEADAVAAMMKYNEALQQSGVLITCDGLH
PPSMGARVSFDTGKPVVTDGPFAEVKEVLGGYWMIEVASREEAIAWATRC
PAGPNEIIEIRQVQEMADFPAEMQSELAGFEAMQAAGRR
>RPA4654 conserved unknown protein
MTKPAISRFPVPEIASLPDDIRTRILAVQEKSGFVPNVFLTLAHRPDEFR
AFFAYHDALMDKPGPITKAEREMIVVATSNANQCQYCVIAHGAILRIRAK
NPLLADQIAVNYRKADITPRQRAMLDFAMKVSAQAYEVGDADIEALTRHD
FSEEDIWDIAAIAAFFGMSNRLANVTSMRPNDEFYAMGR
>RPA0231 Uncharacterized protein family UPF0114
MTDQPDPTPSPSSAKRVERGFETLLFNSRWLMAPFYFGLVISLVVLLYKF
VMLLYEFIVHATLAKESDIILGVLSLIDVSLTGNLVLIVVFSGYENFVSR
IDPGNHPDWPEWMTKVDFAGLKQKLLASIVAISAIQVLKAFMNIDSYDQT
KLAWLVGIHLVFVVSTLIMALSDRWGHSDDKGGH
>RPA1105 possible OpgC protein, require for succinylation of osmoregulated periplasmic glucans
MGQGERDLRLDFLRGVGQWMIFVDHIPYNFLNWFTLRNYGFCDAAEFFVY
ISGYSIGFAYGPAVRAGEMVAATKRLWTRAAQLYVAHIFLFLFFTAQISR
AARRFDNPMYKDEFNVAQFLENPDVMIQQALLLKYKPVNLDVLPLYIVLV
VAAPLILWGLLRRPTLTLVCSGLLYLVSRHFGWNLPSFPDGHWYFNPFAW
QFLFVFGIWCGFGNGPQIRPWVKSLPNQILSWTIVVVALVIALSWNFEAL
YGLVPESIGKILYPIDKTSLAPVRLIHFLALLAIVVKLLPPDLPALRSEH
LRPIILCGQRSLPVFCFGVILAFTAHWILVQVSGSIAMQILVSVGGIGLM
TGVAWLATWYRTLPTLVTVPATVPVDHGDEVPAEAEKASIAEPSAPERVP
TERA
>RPA4580 Uncharacterized protein family UPF0065:Tat pathway signal
MEDAMTISRRALLGTAAGAAICAVGARLVSPALADAAAYPNRPVKWIVPY
AAGGATDVLSRLVCQYLSEKLGQSFVVENKPGAGSTIGTQAVINSPADGY
TLLLTSTANAINASFDRALPFDFAKSITPVAGIARIPLVLVINNDIPATN
VAEFIAYAKAHPGKISVGSSGIGTSLHLSGELFKSMAGIEMVHVPYRGSA
PGLTDLMSGQIQAMFDNVTSSFALAQAGKIRALGVTSRERSAVLPEVPPI
GDSLTGYDTSSFYGVGAPQGTPQPIVDLLNREINAALDEPAIKQRIADLG
AIALHGDARQFGAMLAAETESWRKIVEGSGVHKEG
>RPA2071 hypothetical protein
MRVINILTLLLIIVGGLNWGLVGLFDFDLVSALLGNGSAETATSSTAARI
VYILVAISAVYQIVSLSRLVAARDSVLGTNTTY
>RPA3828 transcriptional regulator, XRE family
MIAIRDVVDDWKLTQAEAAKRLGVTQPRMNDLLRGRIDKFSLDALMLLAT
AVGLTVEWRVVKPAA
>RPA1573 LemA family
MRKFLTVLAALAALSLSNCGYNTIQSEDQEVKSTWSEVVNQYQRRADLVP
NLVNSVKGFAQQEKDVLLGVTNARAKVGSIQATPEVLNDPAAFQKFQQAQ
GELSSALSRLLIVTENYPQLKSDELFKNLMAQLEGTENRITVARNRYIKA
VAAYNVTVRSVPTNFTAMMFGYKEKPNFTVENEKAISTAPKVDFTPAPAP
APAK
>RPA3413 Uncharacterized iron-regulated membrane protein DUF337
MTRRAIRLWVWGHRWSSLVCTVFALLLCLTGLPLIFKDELGPDLKLAATT
APPGSASVDSMIARSLAARPGEVVPYLFYDREHPILKVPTAASMTADPAS
FHYQVFDTRTGLQIEIPQPNEGFLYVMTRLHLDLFGGVSGTLFLGFMGLL
LVVAIVSGLVLYGPFTRRLAFGVVRTDGSRRRTWLDLHNLLGIVTLAWFG
VVSFTGVINTLAAPIELAWQANQLVEMAAQAEPAKIEGQRASTDAVLRDV
RAAVPGMNVMTVAFPGTPFATPSHVAIFLTGNTPITSRILKPALADASTG
QVVSVRDMPWYVTALFMSQPLHFGDYGGLPLKIIWALLDIATIVVLISGL
YLWWAKRPRRDADAEAGSPAS
>RPA0477 conserved hypothetical protein
MGLLDILNGMQNGPRGPADPQDKSGGMSKFTMAILALLVWKAYKHMTSGQ
PQAAPANQPRPMPAPPPANTGGGWGDFLKGGLGGLVLGGAAGSVLSGGLG
DLLRQLQHNGLGDAANSWVGHGPNQQIGPSDLANALGADQIDAMTRQTGM
SRDELLNGLSRYLPDVVDQLTPDGRLPTDDEASRWI
>RPA3029 conserved hypothetical protein
MKLMTILKWALIFLVVSIIAGIFGFTGISAASADIARILFYVFVVIFVVL
LILGFTIFRA
>RPA0269 DUF343
MSLSPSERPDTVDRKLLDILVCPVTKGPLEFDPARQELISRGAKLAYPIR
DGIPIMLPEEARKLG
>RPA4146 DUF323
MLLAFKAKIALAALAGFVTPLAVTPLVIDHGDAALDGQQAMVEIAPGAMD
YRLPGEFTRDGRQIEAPRTKLQFDRPLSIMLHQVSAADYQRCVDDGACHP
MPAATAVRADRPAVQVSWQDATTYADWLSRKTGAHYRLPTDAEWAFAAGS
KFKDDGLALDSDDPSVRWIARYERESDRDNLSAALRSFGGFGANERGLLD
LSGNVWEWTTTCFDRSAIDAAGRPQAQTANCGVRVAEGLHRAYVSDFIRD
AKAGGCSAGLPPIHLGFRLVREPAAPLARLRGQLDRVVTAAGI
>RPA3100 Uncharacterized protein family UPF0065
MQMLGHLFRRTLPPTLALLGITACAISSAAAAQPAAWPPKTVRIVVPFAA
GATPDLVGRVLADQLHTQYPGSTFVIENKPGASGNIGTDTVAKAAPDGAT
LGISIPGPLAINTLLFPKLPYDPERDLAPVTLLTRMPSVLAVPATAGIGS
VDEFVAKVKSDKGGFAYASIGAGSLSQLCMEAIAQKAGAAMVHIPYAGSP
NAMTALIRGDVQAACLPAISVAPQHAVGTVKILAVTTPERSPFLPDVPTL
KESGIDVQSDAWNALIAPAGTPPELITAIHRAVEAALADPAVVTKLKTQM
MMPEASTPEVLRQKIADEKRSWADVIRAAGIKVQ
>RPA0536 Acyltransferase 3 family
MTANGTLAPRMHGGRVDWVDYAKGICIIMVVMMHSVLGVEAAAGQTSFMH
YVVEFARPFRMPDFFLISGLFLSVVIDRGWRTYLDRKVVHFAYFYVLWVT
IQFAFKAPSFAAEMGWAGVAKLYALSFIDPFGTLWFIYLLPIFFVVTKLT
RRLPPLLIWGVAALLESLHVATGWMVPDEFCARFFYFYTGYLFARHVFAF
SDAARARPALALFGLAVWAVVNAVLVHVGAADLPVISLLLGLAGACAIIT
VGTLLAEKRWLDGLRFCGEHSIVIYLAFFLPMAISRSLLLKYAPFLDLGV
IAIIVNIAGVVGALIIWKLAMKSGATFLFERPDAFWIAPRKQATRLQAAE
>RPA3046 conserved hypothetical protein
MIVRPQPNLLQLFFLVQGSVVQRIFPQALVIAGLSVAVVWAHHAYPGLVS
DFNSAPFTVLGIALSIFMGFRNNACYDRWWEARRHWGELICLSRNLARQT
QILPYSGDDPEQSRRKLLTLAMAFAQALVLHLRPGSDTTKVTRRLSAETR
ARYEASRNAPEVILAAMQAELAALHRSGELRDIPFQIIDRTIGQMAMVQA
ACERIRSTPVPFGYSLLLHRTAYVFCLLLPFGFANTLGWLTPFATALAAY
TFFGLDALGTELEEPFGQLPNDLPIAALADTIEINLREALGETDLPPLPA
PVDHILV
>RPA4304 conserved hypothetical protein
MRKLLLGSVAVFALATSAALAQTEGEFPATLAGHAVLPATSFIDAPADAP
ADLKTSGKYTTGQRVEAQGSVMGKSNGRPTGVSVPFKGQPLQGHSGIKAM
PDGTFWVLTDNGFGSRYNSADSMLYLDNYKIDWATGAVDRKQTVFLHDPD
KKVPFRIVHEDTDKRYLTGADFDTEGFQIIGERFWIGDEFGPYIIEADLT
GKIIGVYDSMADGKPIKSPDHWSVQSPGAPGATYTGVNLKRSKGYEGFAA
SKDGKFLYGLLEGPLWDADKKDWEKVDGREASRILEFDVAQKKFTGRSWH
YVFEQNGNAIGDFNMIDATHGLVIERDNGEGTKDKACPEGKTGTDCFNDL
AKFKRVYKIELTDANAGKPANKIGFIDLMKIRDPDKKAKKPLTDGVLAFP
FFTIENVDKVDDRHIIVGNDNNLPFSSSRDPNKADDNEFVLLEVADFLKA
K
>RPA1209 conserved hypothetical protein
MPEASALYCHRGTRAIQRMVEFAPSTGGLALWVQHRDLAADAPADVPAAG
TDGTTVYYSAAFERLPLPEQVGVVAHEVLHIALRHSQRFVELQRTQGDVD
LELFNICADAIVNSTLAHLSWLQLPANAVMLEQIVFQALGREQQPEAALL
EWDVEKLYRAIDDRDRDGTSGKQQSGRQSQQGSEADASGSGGKDQAQSQS
GAGETETVRPSDGAKSAKVRALGAGGSRDLVPNPDSQSAPEHEAERAREW
SERILRGHAGDGAFSMLRALIADLPRTRTPWAQVLRVQLARGLARKPALT
WSRPTRSYIANQGRAGNHRMPFEPGFSPTKNEPRLVLIIDVSGSIDDALV
QRFANEIETIIRRQEAGLVLIIGDERVRQVEVFEPGRRFVLSEIEFSGGG
GTDFTPLLAEADRHKPDITVVLTDLEGPAHFKPRWPVIWAVPQEHAHAVQ
PFGRLLALD
>RPA4212 conserved hypothetical protein
MPLVKNGQITADEFVAVADDVELPAEGAILISAARLLADPERLVARNSPL
GVIWPNNRDVAELKPWLDRLALIALVFPTYKDGRAHSQARRLREVYGYRG
ELRATGQVLRDQFTFLVRNGFDALDVKKETDAHAFGEATHRYTEFYQPTG
DGHRSALQLRRQRHLTSAT
>RPA4011 possible serine protease/outer membrane autotransporter
MQGHHFGGDMSNSEAIDNTTAKLRLAQSSSLLALALLIGSAPAQAADTDW
GWLAIGAPAATAQGWTGKGVVIGVVDTGIDFSHPALSGRAFDYNYGSFVA
GSNHPHATHVAGIIGATDINRGMEGVAPDVRFSSMKIFTGAGGSYLGDAA
VADAYDGAIGSGVRIFNNSWGSSDSIANFTSREELLAHEPLLVGAFTRAV
NADAVLVWSTGNDGRSQPSWQAAAPYYIQELKANWIAVTSVGENGTIASY
ANACGVAKAWCLAAPGGDFNPGIYSTIPGKDYGYMSGTSMAAPYVTGATA
IARQMFPKASGAQLAQIVLQTSRDIGAPGIDDVYGWGLLAVDNIVDTINP
RGAALFASAAWGRFTTLSAIGNTVLDRISDLRNGRGDVVTAPLAFAGQNG
AFSQSGSNPRNAYAADLAAAPQPSPLGFGSVWARGLAGRATLSGSASSPQ
TTADISGGLLGFDLVNNQNLLVGIAGGGSNTNLTASGISDKAGAQAWHVL
GYAAAMYGPAFVNVAGGWNSFDQSYQRRVIPGTAGTVFASTISAAQSSST
DVAYFFQGRGGWTFQTEVGRIEPYVHGATRNQSFGGFSETNASIFSLSVP
SASLSEAEYGAGVRWACAPIKTVDQRVAVAPTIDLAYVRFTNDGPIQVET
NLLGTSVVGQTAALGADAIRVAAGLSLTSLAGISGSFGYTGTVRDAATAH
TVSGGLSIKF
>RPA2819 conserved hypothetical protein
MRWLADECVSASLVAALRADGHDVLYVAEMAAGMRDADVVTMAATDGRLL
QTEDKDFGELTVRFGRAVPGLVLLRIDPTNAKLQAVRLREAITRHGADLF
GRYVVVEEARMRGRLLRPAS
>RPA2079 hypothetical protein
MSSAKPLLQASLKEEEAMAAWIDQNIDSVTRSYVQAQAA
>RPA3114 SEC-C motif
MNCVCGSGKTYDDCCGPLLARTRSAASPEALMRSRYAAYALKDFDYIVET
TDPERRDLFDHDVNRAWMEESDFLELRVLGSSEKGSRGTVEFIARFRRGG
GPEQSHHERSQFRKARGRWYFSEGEAVD
>RPA0253 DUF589
MAYWLVKSEPSVWSWDQQVAKGAAGEAWTGVRNHSAKLHMVAMRRGDRAF
YYHSNEGKEIVGIAEIIREAYPDPTDASGKFVCVDIKADKPLKTPVTLAA
VKAEPRLADMALMKYSRLSVQPVTAEEWKLVCKMGGL
>RPA3565 ErfK/YbiS/YcfS/YnhG
MIRFPRTSRAATVAIVAIGTIAIAAPANAAQPAPFPFLPFLPQPAAMQPL
AYAPVEQTAPQSQVAPQDDEDGTVAELPARLKRQIVAYQTREAPGTIVVD
TPNTYLYYVLGGGRAIRYGIGVGRDGFTWSGVKSVARKAEWPDWTPPPEM
IARQPYLPRHMAGGPGNPLGARAMYLGGTVYRIHGTNAPSTIGTHVSSGC
IRLTNEDVKDLYSRVAVGAKVIVLPDNRRAADAGTGRRG
>RPA2749 conserved unknown protein
MDAAAAPERIDLAGLVADLNALLRLKTTVIGMKLFRTVAEMEAVPKIRRP
NAIHTTDQIVSMASRLGWTVGITAADLVGEQCRAVIGLAPQDEQWLAGRS
YVGVWHATPEDASARQQALDVVPFGHHQAMAVSPLASGRLDPPDICLVYA
TPGQMIILINGLQYAGYKKFEWSVVGETACADSWGRALKTGEPSLSLPCF
AERRYGGVPDEEMLMALSPAHLAKAIDGMKQLAKNGLRYPIAPYGIQADV
RAGMGVSYGNK
>RPA0303 conserved unknown protein
MWGLPTPPAVPKRAIGAPRLMPTWCKPALCRVGAARPETKAAPAAPARSE
SECDVDIYTIIFLALAVFIFLRLRNVLGQRTGSERPPFDRAAARDMIPGK
QDNNVVSMPGTVIDQAPMAPNADVVPPSDRWKGIAEPDSELEHGLNAIAQ
NDSSFDAQHFITGAKSAYEMIVMAFANGDRRSLRDLLSSEVYESFDAAIK
DREKNDLKVETRFVSIEKAELISAELRDRTAMLTLKFVSQMISATRDKTG
AVVDGSPDKVVDITDVWTFARDTSSRDPNWKLVGTGSGT
>RPA2576 conserved hypothetical protein
MTTEHETHAAELGAPEPECARCGKVLPLCICDTVEPIASRTQLLILQHPQ
EQDRALGTARLTAQHFKNAEVRIGLSWPSLSKALGRTVHDPSRWAVLYLG
SARAAELAEGREILAVDAKGQPEPHQDMILDEIEGVVLLDGTWSQAKALW
WRNAWMLKCQRVILDPSAPSRYGRLRKEPRRDGLSTLEAAAMLLSRLEHR
PEIETKLLEAFDRMLARFKQVQAERPDLAPKPKKRDWRRKKRG
>RPA4515 Uncharacterized protein family UPF0065:Tat pathway signal
MIEPLYPRSRISRRSILRGAAAIATLPLLPRLARAADWPTRQVTLVVPFT
SGGTTDMLARLIAARLSEHYGQSFVIDNRSGESGNIAASYVAKVPADGYT
FIIGTPGIHATNRLVYRTMGYDPATDFTPVIVIARVPNLLSVTKSLPVTS
VADLISYARQRPRELFYGVSALGSTGHLSTELFKTMTGVEITAVPYKGSA
PMLRDLAEGRVHLTIDNLPASKPLLEAGEIRPLAVTTAKRWPPLSHLPTI
AEAGVPGYETASWFTVGAPRGTPTEIVTSLNTTVAAFLGSDSGTVKLREI
GAEPGGGSPQDMQRHVEAEIARWEKVAKTAGIAPL
>RPA1907 putative protein
MDNVSNVRTGGCHCGAVRFEVTLSDGFDSIRRCTCSYCRMRGAVVAMAEM
GGIKILQGEEVLTTYRFHTRAAQHFFCSRCGIYTHHQRRSNQNLYAVNVA
CLDGVSPFDFTEVPVMDGINHTNDTGQPTRRAGTLRFIASE
>RPA0625 conserved hypothetical protein
MDWTTLFFSFRGRINRAKYWLVGLIYVAAWMVFTVLAITWLGGVDPDQLF
RFAGGAILIWLIAIALGLAGTWSGLATGVKRLHDRDKSGWWILLFWFGPS
LLGGSNQTMEDSPVSLVLALIGLGIAIWGFVELGCLRGTPGPNQYGPDPL
EPQGTLT
>RPA0423 conserved hypothetical protein
MTMKYLMAVTAFAGAMICAATFAHAGKESPLSASGLPVPRYVSLKSDHVN
VRVGPTKDNDVAWVYTRAGLPVEVTAEFENWRRVRDSEGAEGWVYHSLLS
GRRTAVVTMKDKDGLAPLYESASSGSAVVARLQAGVVAQVKRCDMKWCRI
VGSGFDGWIEKLQLWGVYADEQVN
>RPA2570 conserved hypothetical protein
MALLAANTSPSLGAPPAAAASLYVGEVMHARLKPVGHRFQYRVMSLLIDL
DRLDEADRMSPLFGVNRRALYSFHEADHGPRDASSLRAYAQASAEAKGVD
LTGGRVLLLTYPRIAGYTFNPLSVYFCYDASGALAVVIYEVRNTFGDIHP
YVLPVHAGEMGPAGLRQEQDKLFYVSPFIEMAMRYHFRIVPPGEIVRLRI
LETDVDGPVLAATFAGTHRVLSTASLLQAFLALPLMTLKVIAAIHWEALR
LWIKGAKLVPRPAPPSPPTGFAAGGHDAYTH
>RPA1927 hypothetical protein
MVAQARRQGCGERLTTATAVATPATAEQGETTMETVEEFLAHSIKLEQEA
ALRFGQLADAMDSCGNKEVSKLFRQLADYSRMHQADAQARAGFRDIPQME
PGDFKWPGLESPEAAAIWGTDPFIGRDLALQIALEAETGAFDWYKNVLDT
TDNPEIKMLAKEFVEEESGHVAELHRWIALHKAGKPLPMEIVPF
>RPA1280 conserved hypothetical protein
MSISGKVDPVVRPITIGDIAEALGQGLRDFQAAPLYGLAFGAFYAAGGLL
ILACLTAFHMVYLAYPLGAGFALIGPFVALGLYEVSRQREAGKRPSLLQI
VGLMRSRSELGWMAFVTLFLFVIWMYQVRLLIALFLGVGASFGSLQEFIS
AVLTTNEGLVFLAVGNCVGACLALVLFSLTAVSFPLLLDRDVDFVTAMVT
SVRAVVKSPLPMIGWAATIVVLLAISALPYFLGLVVTLPVLGHATWHLYR
KIVAPVAAELPDTADTEASNNVVAMPKRAATG
>RPA0706 possible endonuclease III
MTSASLSVCPAGPGAYVVAIAIAGPLTVRLGGALSARLRPGRYLYCGSAY
GPGGLQARLARHFRKDKSIRWHVDQLTTAGEVKGAWAIALGNECALVDRL
GFLPVPIEGFGATDCAHCRSHLLRWPSRVPARAIRAALAAGGEAPVWLRV
SATPRSRA
>RPA1049 conserved hypothetical protein
MGPDATDAAGSESPVPFPRRAGLFSVGPPSPSDCFSRIAPPARVPGPARL
SRRRRGLLPCRPPAQMRPSPMPMIATPSVHSLALLLLLSFFLGFAFEDFF
EQRKSARPGGVRTFPLLSLGGGVLYWLDPTHLVAFTGGLLVLGVWLSIFY
TVHLRERDDQGERNAGLVVLLLNVHAYLLGAIALALPHWVAVGVTVTAVM
LLTGRDWLHRLVRKVDTREITTAAQFLILSGVVLPLLPAEPVTPLTSITP
RQVWLALTLVSALSYASYLAQRYWQRAAQGLWMAGLGGLYSSTATTVVLA
RQAAVSESFRRQATAGITLATGIMYLRILVVIAVFNLTLARALAAPMLGL
AAVALAIAALQYLIIKAPSGEATQPSERGNPLELGAAAVFAALFVAISLA
STWVKTEFGTQGIYWLAAIVGFSDIDPFVLNLAQGGTAGIGEHAVAIAVL
IAASSNNVLKAAYAVAFGGRRTWPSAVVLVGLAIAGVVLAAWLAKIAA
>RPA2141 conserved hypothetical protein
MSDKAVITCSLNGVLTDPKQHHVPVTPEQMAREAKAAYDAGAAIVHIHLR
DQRPDKGHLPSWDVNVSREIQQAIRDACPGIIINHTSGTSGPNYQGPLAC
IRETRPEIAACNAGSLNYLKVKSDNSWAWPPMMFDNSVEKIADFLGAMKD
AGTIPEFECFDVGIVRCVGMYVQTGMYSGPLEYNFVMGVASGMPADPELL
PILLKLKRPEAHWQVTAIGRAEIWPLHQACADLGGHLRTGLEDTFYLANG
EKVTSNGQLIEEIAGCARRAGREIASPEEARKIFGVLH
>RPA3741 putative oxidoreductase
MSKTVHVIGAGISGLAAAIRLARAGLTVHVHEAMQQAGGRCRSYFDAQTG
LVIDNGNHLLLSGNHAACEYARTIGTEAGLVGPERAEFDFIDLPANARWR
LKLGGGKLPLWLFDANSRVPDTSIGDYLGLMPLLWAPTTKLIGDTINCSG
PLYDRLVAPLLLAALNVDPPEGSAGLAGAVVRETLLAGGKACRPLIARDG
LSAVLVEPAVAQLAARGPGVQFGHELRALTPAGDRVGALQFGGEDVVTLG
PDDAVVLAVPPRPAASLLPGLKTPQEYRAIVNAHFNYAPPPGMPALTGVI
GGVVEWLFAFPNRLSVTISNGDRLVDAPREQLAAEIWGEICKIAGISANL
PPWQIVRERRATFAATPAQNALRPGPVTQWRNLYLAGDWTDTGLPATIEG
SVRSGNRAADLVLAAGRA
>RPA1203 conserved hypothetical protein
MILVNDDYTPREFVVSVLKGEFRMTEDQATKVMLTAHQRGVCVVGVFTKD
VAETKATRATDAGRAKGYPLLFTTEPEE
>RPA2058 conserved hypothetical protein
MTDQITTVVDVRLFPRPLRHPIVFSLFDQLTPERGLELVSDHDPRPLRYL
FDVKRPGSFCWNYLENGPATWRVSIRKALHVPHREAALG
>RPA4073 DoxD-like family
MNQSFARWQPFALSLLRFITGLLLLQYGVAKLFKYPPVPTFAKVELFSLY
GAAGSLELILGALLMLGLFTRPVAFILSGEMAFAYFLGHVFKGATPVWLP
LLNGGTLAIAMCFTCLYLATSGGGPISLDRVLRRDR
>RPA2177 DUF404
MAVAFDEMNGPGGDVRAAYQELSRWLGETPPEALTHRRQEAELLFRRIGI
TFAVYGDAEAQERLIPFDVIPRILSAQEWARMELGLKQRVRALNMFLRDI
YHGRDILRAGVIPDDLIFQNPVFRPEMNGQAVPHDVYVHIAGIDIVRVDD
EHFYVLEDNARTPSGVSYMLENREIMMRLFPDLFARHRIAPVERYPDELL
TALRSVAPTHSSSEPTVALMTPGVFNSAYYEHSFLADKLGIELVEGRDLV
VKNDEVFMRTTEGLKRVDVIYRRVDDDFIDPLTFRPDSVLGVPGLMSAYR
AGNVTLANAVGTGIADDKAVYSYMPDIVKFYLDEEPILKNVPTWRCREPS
DLAYVLDHLSELVVKEVHGSGGYGMLIGPTATKATIEAFREKLKREPEGF
IAQPTLALSTCPTCTEAGLAPRHVDLRPFVLTGRDRITVVPGGLTRVALQ
EGSLVVNSSQGGGTKDTWVLDE
>RPA0166 Iojap-related protein
MKAEETVTIDLRGKSAMFDYVVVTTGRANRHVGAIAENVVKALKQAGIAA
PHVEGLPNCDWVLIDSGDVVLHVFRPEVREFYNLERLWTQGPSPAKTL
>RPA2352 conserved hypothetical protein
MATLLFDDGREATAAEIEAIARAHRIVVEHRPVPAALAETLARPLLDDTS
AATVLDALPPRPEFPSRDLIVLHPDRPDNEQLATKFENWHRHAGDEIRHI
LDGAGIFGVIVDGQRADLHVGPGDFIVVPAGLEHNFRLTAARRIKAVRYL
SDAEGWSAEFTGRAA
>RPA0415 Family of unknown function YGGT
MAPCAGGTCRTASERRHLQCRSRPSQGRQGGSKSAGRGFIAAEFPCLDGG
EPISTARAFRDRNREFSPAMRAILDIVLIILDLYIWLLIASAILSWLIAF
NVVNTRNQFVGAVSEFLYRITEPLLAPIRNLLPSLGGLDISPIILILLIM
FLQRVITYYIYPAVF
>RPA1583 conserved hypothetical protein
MTERVPPRRPATFKLSDPSVVLIDSDDGGGSYTAKPSAKADARPAASAAG
AAPPPPPPRARVELAREAEPPISAPKAPKSVINPKKGFRWGTVFWSAATG
LVSLAFWLWISKLVEDLFAQSQTLGTIGMVLALLAGGSLAIIIGREAFGL
IRLARIEQLHARAARVLETDNSAEARAIIRELLKFEHPNPQLAHGRATLQ
KHIDDIIDGADLIRLAERELMAQLDLEAKVLISKAAQRVSLVTAISPKAL
IDVLFVAIAATRLIGQLARLYGGRPGALGMFKLMRQTVSHLAITGGIALS
DSVMQSVLGHGLASRLSAKLGEGVVNGMLTARLGLAAMDLTRPLPFDALP
RPQLGDLVKDLMKKREKDE
>RPA1875 possible uncharacterized iron-regulated membrane protein
MTGTRALRIWSKVHTWTSLISMLFLLMLCLTGLPLIFHEEIEELTEQQFA
APHLPGNVPAAPVDDVLLAAQQARPGDHVLFVTWRDEQPDTVVVTMSPTP
KPMRGKFYRLVMDGRTKAVLGEERPQQGVMDIILLLHKNMMLELPGELFL
GAMGLLLVVSIVSGVVVYAPFMRRLEFGTVRQRSRRLKWLDLHNLFGIVT
TVWLFVVGATGAINTLAMPMYDYWRGQVLPPLLAPYRGAPVAQPSSLDQA
IARVRAALPEGRLSSITMPTAENFGSPRHLVVWMKGNSALTALIATPTLV
DVDQTVPVLVPQMPWYLTLLQMSRPLHFGDYGGLPLKIVWGLLDLVCIVV
LITGLYLWIAKQRFGGSVRSRTAAPSDARPA
>RPA0420 conserved hypothetical protein
MELPGTGAGLRAFHFHAIERRAVAPIERRETMLDKLRQFITDVVAPSAPD
ELTLDDGGYRLAATALLIHVISIDGDPSEVEKQKLHSLLEQRFGLDAATA
TRLIASATLVEGEAVDLYHFTSVIMRSVNEQGRLRIVEMMWELVYADGTV
TEFEENVVWRAADLLAVSTRDRVTLRQKVAAAAPVAPGAAQGGTTMGSVA
EVDPAN
>RPA3651 Ku domain
MPRAYWKGTLKLSLVTCPVALYPATTSVEKTRFHMINAETGNRLKQQMID
EDTGDVVEKDQKARGYEVRKGKYIEIEKDELEAVQIESTHTIDIDSFVPA
DEIDQRFLNHPYYIVPDGKAGTDAFAVIRDAMKDKGHVALGRIVLTNREH
VLAIEPFGKGLLATTLRYPYELRDADEYFDGIKSPKISKDMVELAGHILE
TKAAHFEPKKFKDEYETALKALVKRKAAGKSIKLPEPEEKESNVVSLMDA
LKQSLEGGKSGSKTGKAHGRRKASRSSARTKARRSTARQRKAG
>RPA0616 Uncharacterized BCR
MADFLGMMKQAAQLQSKMKAMQAELDQIEVEGLSGGGLVKVRMSAKMEVR
GISIDPSLIKADEREVLEDLLAAAHGDAHRKAEAAMQEKMQALTGGLGLP
PGLGLG
>RPA4190 conserved unknown protein
MGAMPLDGRRRGEECPAQMRRARRPPRRFSQSLICQECTMATSAAAVQED
PATNFAKDQLRAIIERIERLEEEKKTISDDIRDVYAEAKGNGFDVKALRT
IVRMRKQDANERAEQETILETYMQALGML
>RPA0305 possible DNA mismatch repair protein (MutS)
MKRIIPPLDPPMSRRKRSLTEDERALWDGVARQIKPLRGRPRLIKVEIAD
PQALVPAIKPAPVAQPQPLKRTAKPSLPTAPPPLATLGRRERAHLSRGRK
DIDARIDLHGMTQARAHHALLYFLQRASHDGLGFVLVITGKGKSGDTERG
VLRRQVPQWLALPEFRAYVVGFDEAHIGHGGEGALYVRVRRARG
>RPA4818 conserved hypothetical protein
MGRLFIFIFAALIAVAPLPSARLLAHEHNHGAHDHRQDHDEARLAVERGE
IRPLAELLGRVRDKLPGEITGVEIERHHDQWLYEFIVVDKAGRLYEVYVD
ALSGEIRRTKEK
>RPA3394 DUF37
MSAGRGSPQRNRANHRPPKRHLLMQLPSRGTDWIAQVLRLPRNAGRGLIW
LYRHTLSPLVGYNCRHYPTCSMYGDEAIRKFGLWAGGWMTLARLLRCQPW
GTSGIDLVPQTAPSRARWYLPWRYARWRGVNAPPPDVAEPCGCGSHSQLT
PH
>RPA4706 DedA family
MLRKIYDWCIDAAHKPYALWILGMVSFAESSFFPIPPDVMLIPMSLARPQ
RAWFYAALCTVTSVAGGVVGYAIGALLYDSVGHWLIQLYGYGDKVEAFRA
GYAQWGAWIILLKGLTPIPYKLVTITSGFAGYDIWLFILFSIIARGGRFF
IVAVVLNRYGVVIREQIEKRLGLWVTIGAVVLVLGFVIAFKLV
>RPA2910 conserved hypothetical protein
MTDPLLQIGAPVGIIAGGGTLPFAVADSLAARGLTPVLFALKGSCDPERV
TAYRHHWLRMGAFGRLLRLLRDEGCRDLVFIGSLVRPALSDMRLDWGAIK
VLPAVLAAYRGGDDHLLTGVGRLFERHGFRLLGLKDVATDLLMPAGCLTR
AAPDAGVEADIAKGRAVLAALSPFDIGQGCVVIDGHVVAVEDTGGTDELL
RRVAQLRDSRRIRAKPGHGVLVKAPKTGQDLRFDLPALGPKTIEGLITAQ
LGGVAVVAGHTVVAEPQEMIAAADKAGVFAIGMPA
>RPA1017 Nitrogen fixation-related protein
MLIAVASQNFRTVTGHAGKTRRFLVFDAAPCRPPQEVDRLDLPKEMSIHE
FKEDGAHPLDKVSVVIAGSAGPGFLARMAARGVIAITTSETDPVTAIKNY
LAGSLAPAAPHDHDDEEEEGGCNCNCGRAA
>RPA2346 hypothetical protein
MTIISKPPSDPMQPTGSRFRAGITRRRLVGTALIGAATLLSAAPARATEA
YRDYRRTIFVARRGVAALDAIDVDSDTVTGTLALGLEPRELQISQRGGRL
AAIDLRSPRLVSIDLAAQSRSDVPLPFVPSRLRISPDGQRLAAFDDARGT
IALVDTVDGRERSRIEGPREIREAIFSGDSSALLVAAASVGGLSAYDIAT
GQPVPPVEGPTLHALLRAPNGREGFALTAETPRRVLHLDLRSRQVLASVP
ASDHPVLFATGTGIQLLAIDQHAGTLSILPSEPLQPGGVTLPAAASTAYA
AWFDTVAFVPAPATRKLLIFDLERRKAAGSIALDGIPGTGVVTPDGDKLY
LPIEDQGTISVIDTHLRQRTASITIGAAPVQAIIAGGYGLCH
>RPA1406 conserved hypothetical protein
MTAIPASARIVLDTEIPARVPWSAIIRKGQTLRIVDSHGQQAVDTLFYAA
DDHGERYSGQDTLRAQGSAYVTTGTRIMSTEGRTMLRMVADSCGLHDTSA
GACSCESNTVRFGHQTKYLHACRENFVLEAAKHGLSKRDIVPNLNFFMNV
PLDPDGNFTVVDGVSKPGDYVEMVAEMDVLCLISNCPQVNNPCNGFFPTP
IQVVIYEAGED
>RPA2846 conserved hypothetical protein
MAVGQAVAEPFGGLALVHLHRDGFVEGGGITALSGAGRNDDAGFSTLGSR
LTTSFTLSPVWWRCPGWWHPGSTHSVPRRRSRIWRSAAPASGSVAIRCAA
S
>RPA0826 DUF433
MDYREIITIEPGKRGGRPCIRHMRIAVADVLGWLADGQSPEEIISDFPEL
TDQDIRAALAYAADRERRLITAS
>RPA0509 conserved hypothetical protein
MGCGFAGRRGSFVVKKSPLVRALMLTAALATMPLLAGCNSDQMALATNAK
ANQPVSPKLVAAMQEKNMDLQSPILVRLFKQEAELEVWKQDRTGVFQLLK
SYPICRWSGDLGPKIREGDRQAPEGFYSISPGQMNPQSSYYLSFNTGYPN
AFDKSLGRTGSQLMVHGDCSSRGCYAMTDEQIAEIYALGRESFFGGQRAF
QLQAYPFKMTPVNFAKHRNNPNMPFWKMLKEGYDHFEVTRQEPKVDFCEH
KYVFDAAPPPGSSRPLQFNASAKCPTYEVRPDIAEAVREKDKQDNIRIAE
LVAKGTPVARLNTGIDGGMNHVFASKIPDASTGLSEVPEDKSLAVASFDR
APGTIPPTVNPPRPSDNLLVAAPAESSASVAKARVASASKDDIGGLLAGS
RKPGNESTASARPAPSAPVSTAARHEPAPRPQGGAPKAAETKQAAASRPP
LKPSVDAETTSSAPARPSGMISGGAPVVSSNSFDSRFSAFR
>RPA2558 conserved hypothetical protein
MLIQYGQELGDYLYDVFVAKFDFWLAFGLIAQLLFTARFLVQWISSERAG
KSVVPMAFWFFSMGGGLMTLVYGIAKREPVIILGQAMATIIYVRNIMLII
KSHARGSESLPN
>RPA0796 conserved unknown protein
MQSRPATKPQALAKLQDGRYGRPNRSGIVRATEGVRVRYAGFGFALPTCA
TYCLSGCFGCPGVAGPIAARPNNQPQRREFAPLRTIYKICDAAAWRGAEQ
AGRYGGSADDARDGFIHFSTAPQLAGTLAKHYAGQTGLKLIAVDVEALGD
RLRWEPSRGGELFPHLYGELALSAVTGVQDLVARPDGSHVVPELAS
>RPA3943 conserved hypothetical protein
MGLFTKDIKTMDDLFVHQLQDIYYAEKQLLKAIPKMADKASDPMLKQGFL
THLDETKGHVKRLEQVFEMHGVQPKAIDCPAIDGIIEEADETAGEVEDKK
VLDAALINAAQAVEHYEIVRYGSLVSWAKLLGRNDCAAVLQKTLDEEKAT
DKKLNTLAESQVNLRAAG
>RPA4217 conserved unknown protein
MDWNRVEGNWKQFKGNVKEKWGKLTDDDLDVIEGRRDQLEGKLQERYGYA
KDQVRKDVDDWFTTLK
>RPA1865 conserved hypothetical protein
MIRFAIAIIAGLVLGGIVHLVTVLVLPRIAGQDAYARLAPLTQVNAVTQL
PEITPDNAPLPLLDPAFAMAVCRYDLSKGPIKLAVPVSQAYTAVSFYTRN
EVAYYAINDRSAGRKVIELDLMTAEQHAELPEDEDVTSADRLIIDSPTAT
GLILMKALASEPGLMPQARAALKAATCKSQSDAAAKSS
>RPA1213 conserved unknown protein
MTRIAVGGFLHETNTFAPTKATWEAFVHGGGWPSMTIGANVLKVMRGINV
GLAGFVEDAEAKGWKLIPTIACGASPSAHVTEYAFERVVKIMIDGIKAAG
PLDAVYLDLHGAMVTEHLDDGEGEILARVREVIGPDLPLVVSIDLHANVT
PRMVEHADALIAYRTYPHVDMADTGRAAAKHLELILTRGQKFAKAFRQLP
FLIPISWQCTFDQPTKGIYDKLAALESEAVPTLSFAPGFPAADFPDCAPS
VFAYGRTQADADAAADSVASLVASLEDDFDGKIYSPDEGVKHAIELARTA
AKPVIIADTQDNPGAGGDSDTTGMLRALVRNNAQRAAIGVIYDPDAAKAA
HAAGVGAVLSLALGGKSGIPGDAPFEEIFVVEQLSDGRFVAPGPYFGGRE
MEMGPSACLRIGGVRVVVASHKAQLADQAMYRYVGIEPTAQAILVVKSSV
HFRADFQPIAERLLICAAPGAMPADTAALPWTRLKPGIRVRPNGQPFVTP
N
>RPA3131 conserved hypothetical protein
MLSTIMRLIARNVLVEFWGKHPEAKPSVERWYALVKAANWTSTDDVQKAA
PKAKVLNSERVRFEIAGGNFRLVAAFDFRRQIAFVKFVGTHAEYDRIDAL
TVSHF
>RPA4284 YceI like family
MIQRRLLFPILAAALVAAPAVFAETAPKTAKAQAVKTEPVKVESGTYAID
PDHTQVGFVVSHMGFSEFRGRFGDVSGTLQLDGANPAKSSFDIKLPTASV
NTPVDKLTEELKAADWLDAKAHPEIVFKSQNVTVTGPRQAKVTGELTLHG
VTKKVTLDARLHGAGVNPLNKKVTVGFDLTGTIKRSEFGVTKYVPLISDE
VGLEINAAFEKQN
>RPA4405 conserved hypothetical protein
MMESFAWMAWASPTAAFFVALAGTLATMTWLAVVDPEVERVGILRIPTTR
GDRLFLSLILAAVIHLVWIAVVGTDPLLTLPIGEGVEVSSLWLATLLSLV
SAVAIFRTV
>RPA3141 possible VirR protein
MMDRGEIREGEVVVEAPPPTDAGLVFIGRIRTPWTSRGETPRQGSPDGPV
CRLEIFEPWTQALQGFDKFAHAQVVYWLHMSRRDLVIQRGHGIDGPRGTF
ALRSPLRPNPIGISSVAVVGIEGATVLVRGLDCLDGTPLIDLKPDRCVHS
MVK
>RPA1128 conserved hypothetical protein
MTDMLRSDVMIAFAIMTAVTVASRLGGYFMMRYVDVTPRVRRMLDALPGS
IIVAAALPVAVNGGPIVMLALAAAIAVSVLRRNDFLAVITGMLVAAAARA
LGLPG
>RPA0236 conserved unknown protein
MWINAKAPSLADLEQMAHEMFARLPGEFRSLCEDVIIRVDDFATEEVLDE
LGAETEFDVLGLFQGIGLPQRSSQDVAPMPNMIWLYRRPILDYWAEHDDS
LGEIVRHVLVHEIGHHFGLSDDDMEAIEAKADELAED
>RPA2820 DUF433
MGVNGAAGRSPGPKAVKPASVAIAVDYAMLTHEQAAVCAMRHERIDINPE
VMGGKPVVRGTRIPVEMILRKLGAGLSTAEIIADHPRLTADDILAVQTFA
ADYLADQDVIFG
>RPA0386 Ku domain
MAAPPLRSFAPKIVGPGNESESFDFESGSSSSGVCVMAPRANWKGFLRLS
LVTCPVALYPATSDSDKISFNQINRATGHRIKYLKVDAETGDEVPVDEIM
KGYKVDTDSYIEITKDELDNLALESTRTIEIDQFVPRAEIDDLYLVRPYY
LVPDGKVGHDAYAVIRETIRSMDKVALARVVLTNREHVIALEARDNGLVG
MLLRYPYEVRDAADYFDDIQDVKITKDMLDLAKHIVEQKSGHFQPDLFED
HYETALTELINTKMAGQPLKPQARPRGDNVVDLMDALRRSLGDAGKPVPA
TASKNSGADKPAAAKPAKVKKRKTAPGQKEMLLPISGAGASKAKDETAKT
KKPARSPATVKTRKAG
>RPA4191 conserved unknown protein
MDDSTLTELEAAAFRRLVAHLRERTDVQNIDLMNLAGFCRNCLSNWLKDA
ADAQGVAMSRDESREAVYGMPYETWKAKYQSTATPDQVEAFKKSHPHSH
>RPA2025 possible MgtC Mg2+ transport protein
MLPWWEILLRLGVAAIAGGLIGLNRDLKNKPIGMRTLPLVALTSALLVAH
ADHSATSDQLSDPTSRVIQGILTGIGFLGAGVIVRSGNRLEIHGLTTAAC
TWLAAGIGIVCGAGQWLIVGVALMITFAVLIGGHPAERLLHRLLRGRQGR
TRP
>RPA4508 conserved hypothetical protein
MSLCVASAGVVKTLAVVAFTLAWTHSIEKTEWQEDWRVTPQGLELIAARV
KGSGAGMEPAPDARLVDGWFQWPVTRPPQREVLLGNSGLAGEWRLCAHER
CWTLTQILGRAVGVHPTTMYSCN
>RPA1330 conserved hypothetical protein
MLPEPIPAQQAAHVALRLGRLMLANGADTSHVVSAVTSFAERLGCRVRLF
VGLEGLILTLEDDEGFSTRLGPAIAGTAVNMGALGALGELRSRDIADGDD
LAAIDRALDDIEHRGSDYPRWLVALGMGVTAASLARLFGAAWPVVGVAVL
VGIVTQGLRQSLAAAATNPVAGAGLAAFGGALVGLLAMRAFPGLSPTLCL
VAAGMILVPGVPLLNGVRDTLGSHVGTGLGRLMLAAVTVLAITLGLFLSA
SLMDDALPVNGAPPLLATSEDLLFSALAGAGYALLFNVPARAAWVCVICG
MAGHGLRTGLEHLGVELSLACLIGAFVAALIGRIFAGRFRVPAVTFAFPG
IVAMIPGSYAFRAGIGGLAIMKAGAAASPILIAETLGLIVTVAVVTAALG
VGLCLALAAPLPAALGGPSNQEGAPR
>RPA1281 conserved hypothetical protein
MSKPEFAYVTYIRTTPDRLWHALTDADFTRRYWMDCTLRSDWTVGSEMTM
ERGGEIKNRCTIVESDPPHRLAYDWVSVWDPAMRQEKPSRVTYEIEPQGD
LVKLTVTHQNFTQGSTTLPSISFGWPMVLASLKSILETGQPLPFAPQSHA
AAESAHV
>RPA4055 hypothetical protein
MTFALPSSWNDARELFRRHETKIRFLLAGGLNTAFGLTSYPVFFFLLRPV
QLHYMVVLAITQVTCIAFSYLTNKFLVFRTKGDYIRETSRFILFHASYFA
LNLAALPVLVEIVGLPPVWAQTIFAFAVIVTSYFWHSRITFAAKRSDDND
K
>RPA3659 DUF176
MIYVVATLTVKPEMRAELIAAAKACIAETRKEPGNIAYDLHESVTDPGKM
VFVEQWENAEALEPHRKAEHMKTFGRVAVTCFAAPPKIEIITPANVVTR
>RPA2401 conserved unknown protein
MRFLDEHDNDDPLLSVVNLTDVFLVVIAVLLIIVVQNPLRLTTAKSAVLI
ENPGQPDMKMTIKDGQELKKYQASSEIGAGQGTRAGITYRLNDGRMIYVP
EKEGEPVLQRN
>RPA1387 conserved unknown protein
MTFTDTNVTKGPAALSAHSDPVATAGLRPGVIVHDPAETVDELLAAFAIG
LRDRGFKVEGYVQKQLPPDDDASVGRIEFLDLSSGFLRTGDQRAGIAYLQ
TALTGQADLLVIGRFAACLDATDSLRTTKIQPRPGHSLPMLTAIAGHSIH
QWHSYARHEGAMLAPDLTSLWQWWGPEQLYRDLALGVADDEVRQIACGQR
WIMVEGPHGAGLAYLPRHPRALQPRLASLGKQSLRQLAALAMSWDPLETA
LGIAAINAHYNRYDLAAAAGNGVKTFRHVAGRVGVIGAFPGVDGILPNCA
VMEADPRPGEYPLAAMDSILPACDATIVNASSLINRTLPRVLRLSQHRPV
ALIGPATPMTPRLFDYGLSVLGGLVVSDPKGLATAIRAGALPREFSRFGR
FAHLMRGASPDAARRSDTTSTTASLRRSHA
>RPA0277 conserved hypothetical protein
MSEDIKNRHSSVPWKQIAGSGNVYRHDYEDVAAQMIWETVQRALPALKAM
VAEELARCDEQRPQ
>RPA2548 conserved hypothetical protein
MTMIEPDPADFEPIHPYSLARISHVALQAGAVLAQSGASVRVVHEGARMV
AEGLGVEVLGMRSGYASFEITLARGHHSFTRMTQIGPHGVNHRLDFAVRD
LLKRAARGDMTPDAIEAELKRLQSETPRHPPWLVAIATGAACAAFGRLLG
SDWLSFGPVLAAASIGQGVRHLLLGRRFNVFVVAAIVGCISAALGGLGAR
LIGSSTTELAMMASILLLVPGVPSTNAQTDVMDGYPTMGSARAVTVIMIM
VFAVTGLWLAEFVLRIHT
>RPA0838 possible surfeit 1
MNSAAARRRSIAGMSLIALTMVAVLLALGIWQLQRRDDKHRLIAALSDRL
AAEPVALPAARDWSALDPVPDEFRRVRFTATYLKLPDVMVYSSGSAVRDD
VTGPGTWAFLPAQLPDGRIVVINAGFVPNTMQERGAEDRAVQPLLSGEAA
TVTGYLRFPEQPGLFAPAPNLDKRLWFTRDVPAMAAALGWDKPAEIAPFY
IDLEAPMPAGGVPKPGPLGVHLRDNHLQYAVTWFGLAAAVLITFVFWLRG
QAWQAAAPSAKASRRIA
>RPA1690 conserved unknown protein
MAVRSGFVVAALVALALSPTGAIAQSAGPQGTWLTQAGDAKVRIKSCGNA
LCGSIVWLKTPIDPNTGKPQVDDKNADPQLASRPIIGLQIFGDMRPVSQG
KWSGHIYNADDGKTYESSISHTGPTSLRVEGCVGTLCGGETWTLTR
>RPA0187 conserved hypothetical protein
MPAACSCAPPLTLGETMPPMPTEITVLGWSVVLLIVQMFLASIPTTMELG
GDYQAGPRDELRTPTGRFAGRANRAFRNLLETYPAFVGLALALAITGKAG
GYAATASVVWLIARTLYAPLYIVEIPFARSIVWFVSLLALIAMLVRLLS
>RPA2170 hypothetical protein
MANDNAAAPHSAAGERVGVRAQTAPGRWDGVAVMPYKQTAEAPFQDVSRQ
LLFADPNLACEWRYFEVDEGGYSTLERHAHVHAVMIHRGHGRCLVGETIS
DVAQGDLVFIPPMTWHQFRANRGDCLGFLCVVNAARDRPQLPTADDLAEL
RKDERIADFIRTGGE
>RPA3611 ErfK/YbiS/YcfS/YnhG
MSPDDPRYGRPTGATSYSNAAPGAAPQGPVMSPDDPRYGRPMGPPSVIYS
DRGDGGVRPPAGVGGGAPSAQPGADGRPMQVSALPPEEQPDADQTVQLPP
NLRRQEVDFATKEPAGTIVVDTANTHLYYVLGNGRAIRYGVRVGRDGFTW
NGVQKISRKAEWPDWHPPAEMIERQPYLPRFMAGGPGNPMGARAMYLGST
VYRIHGTNQPSTIGKFVSSGCIGMLNDDVSDLFERAKVGTRVVVLPGSPP
KNTATAQAQPAMAPGGQSPALPPAPLPGAQPTSVAPLPAPVTIR
>RPA4420 conserved hypothetical protein
MAGENDLNDRVEALEVRLAYQDETIEALNQTVTAQWKQIDALTRQLAALS
ERLDQAESSAGAPANERPPHY
>RPA4569 DUF344
MKIKTKQFRVGEGEKVDLGKWPTKVDPFYESKEHYHELLRTQVERLSDLQ
QLLYASNRHAVLLIFQAMDAAGKDGVIRHVLSGINPQGCQVFSFKHPSAT
ELQHDFLWRTTRDLPERGRIGVFNRSYYEEVLIVRVHPDILQSEAVPNGE
NFGKSFWHKRYRSIRNLEQHLHANGTRIVKFFLHLSKDEQRKRFLARIDE
PEKNWKFSAADLEERQYWDDYMDAYEKCLSETSSEDSPWYAVPADDKENA
RLIVSQVIAETMESLKMSYPETTPARRKELLQMRQQLLK
>RPA0298 DUF299
MLTDGSYFHLHLVSDSTGETLITVSRAVTAQYANVTPVEHVYPLVRSQKQ
LDRVLQEIEEAPGIVLFTLLETELVNRLEAKCQEINSPSLSIIGPVMQLF
EAYLGASTMGRVGAQHTLNAEYFQRIDALNYSMMHDDGQHVEGLEEADVV
LVGVSRTSKTPTSIYLANRGIRTANVPLVAGIPIPHQLETLKKPLVVSLH
ASPERLIQVRQNRLLSLGAGSGNDSYIDRQAVTDEVLLARKLSAKYGWSL
LDVTRRSIEETAAAIMKLLADRQRQRVPE
>RPA2191 DUF188
MTDALTRIYVDADACPVKDEVYKVAERHHLPVTLVAGGFIRVPQHPLIER
VAAGSGMDAADDWIAERIKPGDIVVTADIPLASRCVKAGATAIAPNGKPF
TEESIGMTLAVRNLMTDLRSTGEITGGPRAFSPRDRSTFLSALDSAIRRI
ARRRAAPT
>RPA3479 2OG-Fe(II) oxygenase superfamily:Prolyl 4-hydroxylase, alpha subunit
MLVCIPEVLPKSEVAEFRRLMDAADWEDGRSTAGAQSAMVKRNEQLPPDS
DLARALGRRIVSALTGNPKFVSAAVPLQIFPPLFNRYAASGGHHFGIHVD
NAVRGDHLTGLRIRTDLSVTLFLAEPDEYDGGELVIEDTYGSHEVKLAAG
DAVLYPSTSLHMVTPVTRGARVASFFWLQSMIRDAQARSMIYDLDNAIQA
LVERLGRDDPETVKLTGIYHNLIRYWAEV
>RPA1868 conserved hypothetical protein
MFRTQPCRLSACAAGQTPPRTAASPPQSSIQGMTARPKKPSAQDGFFWKT
KTLEQLSPTEWESLCDGCARCCLEKLEDEDTGRIYFTHVGCRLLDGDACA
CRDYPNRSDRVPDCVRLTPKEVRELTWLPPSCAYKLVAEGRDLYWWHPLI
SGDPNTVHEAGVSVRGRVERTETEIPVEELEDHIVSWPALLPKRAKLKQR
PR
>RPA0439 DUF150
MTEPTPAVPETDDLAEPRLVIEPGVAARFCAVAEPVLMAMGYRLVRIKVS
GDAGCTVQIMAERPDGTMLIDDCEAVSRALSPVLDVADPIDRAYRLEISS
PGIDRPLVRRSDFARYAGHLVKIEMAVPHQGRKRYRGLLDGVEGDAVRVQ
REGATKDEDPLVLLPMHDIGDARLVLTDELIAESMRRGKQAERDLKQSLG
LAPEPPPHAKRSDPKKSHAPKRGPKAAKAPTKPKPQNTKQHRLAADKSRR
GEIDTSEGD
>RPA2527 possible DNA-dependent RNA polymerase
MTPEALSFWTQADKPFLFLAACIELTNALAYGPGYVCGLPCSWDGSCSGA
QHLCAATRSEDAWKVNLIGNDEVHDLYTIVGNAAKQSMARDTDPMVSTIL
AYEGDWRKVFKRNTMTTFYGSKGFGMAQQHMDDLMEPLSREVMQKKRAKH
PFGQTRKEHHAASRLLAAHAAAAIKGEIPLAIKAMEFIQKLSAMMAHEGK
PLEWTTPIGLPWSNRYHDPVTERVNLWMHDHGVRRPFRPVVADGYKKTIN
KNHSVNGSAPNFVHACDAAHLLRTVNAARAEGITQFALVHDSFGSLPSHA
ARFRQIIREEFVRMYEEHDVLTEVLEQAKCDLDEHNDKLSALVDMRADLT
GKLNIKEVLNAEYAFAYPKLSEPDTKGEYADGKYKTEGTATEDYTERFQE
EIQKVADDNFPGKKNVHLPWKETKEGDIAFIFKNPKKKPILTDSRGKPLK
EGTVIRGGSKIRIAGVIAAWEKGAKRGVSLWPDAVRVIKLSEGFDANAAF
GAAEDGFNADDYEPTNFGNDGEDADNDADDAEAGDDAFKL
>RPA4476 conserved unknown protein
MTAAAGAVSVRVKQTSGRSTMKSLFKAAAAVALLIAAAVPGFAADAKPHR
VAIQVDQNDPAVMNLALNNAANIIEYYKAKNEDVQVEVVTFGPGLHMLRT
DSSPVKDRIKQIADGAFPSSIKFAACENTKTGMEKHEGHAVSLIPQASAV
PSGAVRLMELQDDGWAYLRP
>RPA4208 SCP-like extracellular protein
MFRGVMAVLGLLVLAGCAGGDVAPSVDTPSMYQSMAREGAKVDVAAAAVM
ISQYRQNNGLGLVTVDPALTKLAEDQSSTMARRNKLDHDVKAPLAQRLNA
SGYPAAVAVENVSAGYHTLAEAFSGWRDSPPHRANMLKNGVDRIGIAASY
APGTKYKVFWTMILASTDTPR
>RPA2244 conserved hypothetical protein
MATIGTFKKSGNEFSGEIVTLSVQAKGVRIVPESNRSGDNAPSHRVYVGR
AEIGAAWSKTSNEGREYLGLKLDDPSFTAPIYANLFDDEEGDTFSLIWSR
AAKRNGD
>RPA2993 Domain of unknown function UPF0126
MDKLPIVLDLAGTFVFALSGAVAGARRRLDLFGVLVLAFAAGNAGGITRD
VLIGAVPPVAVADWRYLGVSLLAGLVTFYLVPVVVRMRSAILMLDAAGLA
LFAVTGASKALAHGLSPVTAVALGVVTGIGGGMARDLLLAEIPTVLRAEL
YAVAALIAAAIVVVGQMLQLPAAPVTAAALLACFALRVTAIRRGWRLPVA
KLDEAGDPVKPPPQDGG
>RPA4090 conserved hypothetical protein
MSQFAVIIFGDQAKANEASRVLTELQADNSLVLFSQAVIAKDLAGTVSLM
APASGGAVGSAVSGLASRLIGVPVGGHGDLSDAGVLDSFCQEAADALKPG
ATAIVAEISEDRLGPLNTQMHALGGQVLRSCNSDSEGDEIAKQTARQIAN
IERLNAEYVRAPQDADKPGETPVDAVTLAPDERASNQSIVKANAEARSAK
LSQACGLTKDALG
>RPA2930 conserved unknown protein
MRSRSVTAAAALTAAAFGAAPALAHVTLQPNEAVAGNYFQAALSVPHGCD
GTATVAVRVKIPDGVLSVKPQMKPGWSVEIKTRKIEGEQPKLHGKTVTET
VDEVTWRGGPLPDNLYDTFGLVMKLPDAAGKTLYFPVVQECEKGVSRWIE
IPAAGQKSDALHEPAPTLRLKPKAP
>RPA2732 conserved hypothetical protein
MSGVSGIKEHMEVIGADGTHVGTVDRVDGDRIKLTKKDSGEGHHKGHHHF
IDGGLVAEVEGDKVRLSANADVAVTMEEEK
>RPA3323 unknown protein
MLQRLLYFIAGVDRELIARCPPTDRIWAMQIGFSLCLSFLVVFGVSYYAL
GYIVGSTPLRASIAVVIALTVMMFDRALFQSDWFVQGAFWLDDDQSKKVG
EGLRRSVWQFVRITARLAISLGLAWVIALFLELALFSDAISQKIDRDRVA
GNRPIYEQIDKYASGLDREIAESRAQLTAIEQHQRELLAQAPLPSDTPTA
ADPEVDRELKSLAEREATVRGEVRELEATIRDQSLDMSAEQFGEKLRPTN
SGRAGAGRRYELAKKQKEAAETLLQSRQAELSDVVARRERVKNDAAARMA
ADLAQRDDQKQDFAAKRAAIQKQVDTARANLQLLESQREAKVQRYRERTL
EGSFVQQRKDSNDPLVRLTAYQALKSDPQEGQAITLFSWMIRLFVVFLEI
VPVVTKIFFSPPSMYAINVQSQIKSARVAARSMTWKLAEDEPERERAIPE
AVAPVEPAHADDRVTPIFPRFEPPEAARAPSNPTARRRWGLPLRKAKEAY
PAPTNDQAIIATIPVPAESSDKIEARLADEVSAEPVAPVHREPVVEEAVP
ASQPEEGERLVVVYNGGGMRDVGPGSHAPSETELVAPRRA
>RPA2487 possible DedA family. Putative integral membrane protein.
MEAYIHDIAEFVRQHEAWAAPVVLALAFGESLAFVSLLIPAWGALVAIGA
LIGVSGLSFWPVWIAGGVGAALGDWVSYWFGYRYKDNVAQIWPLSKYPGL
LRRGEDFVRKWGVPSIFIGRFFGPLRASVPLAAGIFEMPFWPFQIANVVS
ALVWSAVLLLFGDGLSIGLGWMLRSV
>RPA1848 Uncharacterized iron-regulated membrane protein DUF337
MSDLHTWTGLLLGWVLYAMFLTGTVSFFKEELSQWMRPELPRVTQALDPA
VVAERVADEIGRIAPNATQWSIKLPEGRSNSVYAFWRLPIAQDPRGFGEG
HFDAVTGRQVEARGTLGGDFFYRFHFQFYYMSPFWGRLLAGLAAMFMLIA
IVAGVITHKKIFTDFFTFRWGKGQRSWLDAHNALSVFGLPFHVMITYTGL
VTLMALYVPWGERAAIKTPAERQQLMAELSAFIQPGKPAAEAAPLGSIET
MVRQAQVRWGTPDVGRVNAADPGNAAARIAVTRGDAGRVSMSPDYLEFDG
VTGKLLTVHDHVGAAAETRGVLYALHIGRFSDLETRWLYFIVSFMGTAMV
GTGLVMWTVKRRQKLPDPERPYFGFRLVERLNIASIAGLSIAMSAFLWAN
RLLPTTMAERAFWEIHVFFIVWGLTLLHALLRPARAAWVEQLWTAAALLA
LIPVLNAMTTLRPLWHSFAVGDWVFVGTDLMCWTLALLHAVLAIRTARHG
ARVRPPRGSAKRHALPTMSSEAAT
>RPA4200 YbaK / prolyl-tRNA synthetases associated domain
MSLESVRAWFAQHAPDIAVEESTMSSATVPLAAEAYGVPPAQIAKTLSLR
VGERVVLIVTSGTMRLDNKKAKALLGGKPKMLGVHEVADLTGHEVGGVCP
FGLKAPLPIYCDVSLKAFDVVVPAAGSTHSAVRIAPQRMAELVGAEWVDV
CEDRSAPAQP
>RPA0850 Protein of unknown function UPF0047
MSSSRTLSRAPLARATATSVASSVLTVQTGGTGFTDITREIAAFVAEAQA
KDGGVTVFVRHTSASLTIQENADPTVLTDLSTVLNRLAPENAGWRHDTEG
PDDMPAHVKTMLTSVSLQIPVLQGRLALGTWQGVYLIEHRARPHRREVVL
QFIGTTG
>RPA2176 DUF403
MLSRTAENLFWLSRYVERAEYLARTIEATLRVTALPNTYSGQGNEWDSAL
LTAGVSASFYAQYDEANEPNVVEFLSMSTSNPSSIRNCIEAARLNARSVR
TALTGAMWDTINSAWIELQENWSKGAKTREQLTQFLRFVQETSLRFDGSA
YRTMLRNDAYWFSRLGVHLERADNTARILDVKYHLLLPEEEHVGGPLDYY
QWSSILRSVSALTAYHWVYRETLKPWLIADLLILNDSLPRSLASCYGNLV
RNLDQIGVDYGRQGPAQRHARAIRNRLEHSNMDDIFQRGVHEFVQEFIAD
NSRLGEIVAKQYLI
>RPA4359 DUF185
MIDQTALATEIKRLIKAAGPMPVWRYMELCLGHPEHGYYVTRDPLGREGD
FTTSPEISQMFGELLGLWSASVWKAADEPQTLRLIEIGPGRGTMMADALR
ALRVLPILYQSLSVHLVEINPVLRQKQQTLLAGIRNIHWHDSFEDVPEGP
AVILANEYFDVLPIHQAIKRETGWHERVIEIGASGELVFGVAADPIPGFE
ALLPPLARLSPPGAVFEWRPDTEILKIASRVRDQGGAALIIDYGHLRSDV
GDTFQAIASHSYADPLQHPGRADLTAHVDFDALGRAAESIGARAHGPVTQ
GAFLKRLGIETRALSLMAKATPQVSEDIAGALQRLTGEGRGAMGSMFKVI
GVSDPKIETLVALSDDTDREAERRQGTHG
>RPA2133 conserved hypothetical protein
MAMLSQRVHLWLSGNPEDAVLRQIFRLLIVGTIAALGYDLAGAHSASDDG
ATRETAPFSLPSLPAVLSPLLPDGDRQTPLPQPGGALGQPMTFELRSGGR
LQASGTITPGSAEAFAKELARHGEYIKTVVLNSPGGSVSDALAMGRLIRE
HKLATEVEAGKYCASSCPLMLFGGTERRVGAKAAVGVHQVFAVKSTEVAA
PRDQMSDAQRISARCQRYLRDMGIDLEVWIRAMETPKDRLFVFKPEELTR
YGIVTATPAPAKP
>RPA0581 hypothetical protein
MNAAATFLVALVAALHVFFLVLEMFLWTKPLGLKVFRNSPDKAAASAVLA
ANQGLYNGFLAAGLIWSLLHPNAAVALQLATFFLGCVIVAGLYGAWTVSR
RILYVQAAPAALALIAAWIA
>RPA1994 conserved unknown protein
MLTRRGFAAVAGCAICSIAGFGATEAFAQGAAATTANGVTRKILSRTDGP
AAGYETIIMDVTLDAGATVGRHTHPGIESTYVMEGELELPIQGQPTKTAK
AGDAFQVPPNTPHAGGKPSPSKTRLMVNYIVEKGKPLASPA
>RPA1285 conserved hypothetical protein
MMSDRLFVYGTLMRGYDHPMARLLSSNAEFLGAASIPGRLYLVRHYPGLV
PSDDPADRVHGEVFRLHRPDEVLAQLDDYEGCGPRDQRAEYRREIACVTL
EGGTELDAFVYYFNWDVARLPHIATGRFQPEAFSG
>RPA2223 conserved hypothetical protein
MTMLKLGALVDDRPVRLTIELPAAIHRDLAAYADVLARETGTKTEPTKLI
APMLARFMASDRAFAKARRAKAQPSSDGDGST
>RPA2067 hypothetical protein
MLDWPTVFRIIHVLGVVHWIGGLLFVTFVILPSLLDMEATRRLPLFDALE
RRFAAQARISTLLVGISGFYMLYAYDRWAQLADPQFWWLSAMLGLWAIFI
VMLFVAEPLFLHAWFDTRARRDPDRAFRVALRGHRVLSTISLITVLGAVG
GAHGLL
>RPA0403 conserved hypothetical protein
MTTDHSVGFEPADRLAHDPEQVRRRFWRKIKSVAVRLPFVEDILAAYYCA
FDRQTPRHVQIALLGAIAYFILPFDFLPDILPVLGFTDDAAILATALRMV
ASNITPEHREAARAAMQRGLEDEA
>RPA2517 conserved hypothetical protein
MSLPDDSLRPEDVERLQARVEAAIRDHWKAYLFEGILLLIIGFAAILLPL
LASITFTIVLGWLFFVAGVAGIGFSIWARKGPGFWWSLLSAVLALVAGLV
LIAMPLQGTLTLTFIVGVYFLAEGVATIMYAFQHRGRMSERWGWMLAAGI
ADILVAFVIISGWPGTVWAIGLLIGINLVFGGTSLIAMALAARKST
>RPA3895 conserved hypothetical protein
MNALRDIRILPVVLVAVFGLAVLKIAGLVLDGGYVFDYDPNATKPSWAQE
NLNFPGRKPIDPDITGSVHGEPKKKEEEKPAVAPPEQTKPAETEPETPAI
SASERAILERLQARRQELEARAREVEIRESLLKAAEKRIESKVQEMKETE
GQIGKATEEKSEAEAARFKGIVTMYESMKPKDAAKIFDRLEMPVLIEIAT
QIAPRKMSDILGLMSAEAAEKLTVEMARRATGKASLSASVAVLPKIEGRP
TPRQ
>RPA2884 conserved hypothetical protein
MAAPSRRTANAQLCPACGVVPSVIVLLILSVAPTRAESPGSAPPGVVTGG
SQDVIVGGKPAARAGDATSDGAIVEGSKNVFINGKPAAIGGSRTGCGSVV
IGSGSGVFINGKPVARAGDSTANCK
>RPA4189 conserved hypothetical protein
MLAALSRRLKSLSMPMAGYGAVLTTALLLAGAGSVHDASAVGDSRTLSFH
HTHSGESLTVTFKRSGRYDEDALKQLNHFLRDWRSQEQTVMDRQLFDILW
EVYRDVDAKQPIQIISAYRSPATNAMLRRRSSGVARHSQHMQGHAMDFFI
PGVALEQIRFAGLRLQRGGVGFYPTSGSPFVHLDTGGIRHWPRMTPDQLA
RVFPDGRTVHIPTNGKPLRGYELALADIEKRRDGSTVAPAKTNFLATLFG
GKSRDDEDETAATAAPSGAKPMADIKVAAADAVAAAAGMKPADVGSSDPV
PMPRAKPAAAIQIASAGDVVLPAPRPAQAAKAEAKTAEPKTAESKTADAK
LQSPADIINARGFWDDIPVAPKQASPAQVAAISARQALAAADKSEQATAM
NALAYAPMAQENSSKHAPTRHPHVVTASAPLPPTRASLQRQAAVSGKVDS
VIGKSSGQGKTVIATSARLAAAGSRDNDVWIRAMILMPRAMHTAATVIGD
PDMTLLSGYLAKPEATLATSFADDPQPGLYADAFSGSAVATLTTTAFPGD
ASR
>RPA4416 conserved hypothetical protein
MSKIVPCLWFAGEAEQAAQFYVSLLPGSRIDNIQRSVVDTPGGPEGSVLV
VEFTVAGQPFMALNGGTPLPFNHAVSFKIDCVDQAEVDRLWDALLAEGGE
PVACGWLRDRYGLSWQIVPADALKLFAGSNREGAKRPMQAMMQMVKLDVG
ALQRAYEGV
>RPA4192 conserved hypothetical protein
MIITDSLPLRHLWARPAATMTAALAFGVLLSSSVPAAADFRLCNNTSSRV
GIALGYKDVDGWTTEGWWNVSSRSCETLLRGALVARYYYIYALDYDRGGE
WSGQAFMCSRDKEFTIKGTENCLARGFDRTGFFEVDTGEQRAWTVQLTES
NEQNMQKLPGMPGAPGTGLPGIPPSPGRTAPATPGTPGEAGTKP
>RPA1275 conserved hypothetical protein
MSLARDQIKPLELVFDDDGLIPNTPLPLLLYKRAIDVSGREPEQAIEELF
EQNGWGDRWRNGIFDYHHYHATVHEALGVARGRAMVLFGGEQGEAIELTP
GDIAVLPAGTGHKCLFASHDFSVVGAYPPGPKMQVTRPTPANYRRALQTI
PQVALPKTDPVYGADGPLRKLWLK
>RPA3499 conserved unknown protein
MATDHIRYDVLARDALRGVLRHVLTDVAQHGLPGEHHFFITFQSKGDGVK
LSPRLLAQYPEEMTVVLQHQFWDLVVTEDRFEVGLSFGGIPERLVVPFAS
IKSFFDPSVKFGLQFEAADEVDETGEADDGTELVTAAPAPVALPTSSATT
DSSAPSDDEPPRSGEGAEVVRLDRFRKK
>RPA3753 DUF404
MAQQGGPARKARPRDRKVAQWSRDYVRLPGIPDEFIGADGQPRAVWSRYF
DAFAALSTDEIERRFAAADRHLRDAGVTYRAPGDAVDRAWPLSHVPLLIG
ESEWQQIAAGIAQRATLLEGVLQDIYGDGRLIADGALPAAAIAGSADYLR
PMVGVKPPGGRYLHFYAADIGRGPDGQWWVLGDRTQAPSGAGYALENRLV
LSRAFNDLYKSMNVERVASFFEAFRDSLRATADRDEPRIGLLTPGRFSET
YFEHATLARYLGFLLVEGDDLAVADDRLHIRTVAGLKRIDVLLRRVDANS
LDPLELDASSQLGVAGLIDVIRKSGVVVANMPGSGVLEARSLLGFLPPLA
RRLLGEDLKMPHIATWWCGQTAAREEVLDRLDEFVIEGAYGQNVPGFSHH
GPVLPGELPPVVRDHLRDAIAARGLDYVAQEQVRLSTTPVWDDGKLAPRP
FVLRVFAAATEHGWAVMPGGFCRIADRLDSRAVSMGEGARAADVWVVADH
AVAPSSLLPAVDSVRIRRITGVLPSRAADNLFWLGRYLERAEATLRLLRA
LSAPQRDPGKGPLHHAIEKIQRLLIAWGATSLGPRAQTAKIAADALQSPD
EFGSALSLIRAAQRNASSLRERLSPDAWHVITQMEARLAVPVEGEEAVVQ
AAEVALQELASFSGLSNENMNRAAGWRFLRIGRRVERAVNTARFARQFSC
DGATAEDLDVLLTLVDSQITYRSRYLLAPLLAPVRDLAVLDPYNPRSVAF
QVEELNDHVASLPALHEGGLIERPQRLAVSTLAKLTAAEVETIDASWLFA
LEQDLLSLAEAVGSHYFPHGASAMRPEKLMGLA
>RPA3831 conserved unknown protein
MRQLLIGRRAFGAAVLAVLGAAMSGTAVRADDVAGTWLRETGASKVKFAP
CGGAVCGTLVWLKPGVETPAKLGQKIFFDMKPTGPDAWAGSAFNPEDGKT
YTGKMNLSGGTLTTQGCAMGGLICKSATWTRAN
>RPA0342 conserved hypothetical protein
MAIEPDLGLDSTEPKLFSARLKPHRALSHNGFLILIGLVGTASFAAGLAF
WLIGAWPVFGFFGLDVLALAFAFKVSFARARASEEISMTCSELRVRRTSA
RGEVAEWVLNPLWVRLEKIVHHDAGIERLYLVSSGRRLAIANFLGAEEKA
NFAKALMAALDAARRGPLYPA
>RPA4751 DUF80
MVVLNRIYTRTGDDGTTALGSGERRPKSDPRIAAYGTVDETNAALGVVRL
HLSELPELDAMIGRIQNDLFDLGADLAVPEREGKAERLRVLSSQVERLER
DIDQLNEGLAPLTSFVLPGGKPAAAYLHVARTICRRAERAMVELSAKPNE
KITPAAVQYVNRLSDFLFVAARTVNDGGARDVLWVPGQNR
>RPA0263 Protein of unknown function UPF0047
MQIQRHRIELATTAPIQLIDITDQVRRFVTSSGIKEGLVTVSCLHTTARI
NVNEREEKLERDMLTFLKRFVPRDGDYLHNLDPVDGRDNAHSHLIGLFMN
SSETIPVAKGTMVLGEWQSVFFIELDGPRERRGVELQIIG
>RPA1151 conserved hypothetical protein
MSRYDFRSPRLFVDAALAPGAQVPLERDQSNYLGNVLRLGAGAAVLAFNG
RDGEWRATIAGRKRPEALEVAEQTRPQDRLPDVAYVFAPLKHARLDYMAQ
KAVEMGAGRLQPVLTQHTQVHRLNTDRMRANVIEAAEQCGILSLAEVGEP
IGLDRFLAGRANGERLLVFCDEDAEIADPVAALQAARDAAARGVDLLVGP
EGGFSTEERALLLKQPAILRLALGPRIMRADTAAVAALTLVQAVLGDWKG
N
>RPA4141 conserved hypothetical protein
MHLNIKNDEAHQLATELARMTGESLTAAVTNALREQLAREHRRRQTDQIA
QRLLKIGGRYAALPDTGRTPDEILGYDENGLPT
>RPA3960 conserved unknown protein
MPSEHRSPRLAVLIDADNASAKIADGLFEEIAKIGEASVRRIYGDFSTPR
SKGWADILAKHAIIPQQQFAYTTGKNASDITLVIDAMDLLHSGRFEGFCL
VSSDSDFTRLASRIREQGVDVFGFGEQKTPESFRQACRRFVYTENLIAGA
ADSKGTASPAKPLQPPSAATPIIERVIKQMESEDGWVSLGEVGKHLSNLA
SDFDPRTFGFRKLSDLVRKTNAFELEQQNGHSMRIRLKPTGAPAAKTTSP
RKRPARQDTRRAGGVAS
>RPA3793 conserved hypothetical protein
MQFDTKIALVIRTDLEAWQKLNVAAFLTSGIAAAFPECIGEPYQDGSGTP
YHPLIGQPILVYGADGPGLSRALERALARNVKPAIYTEAMFKTTHDAANR
EAVKAVARADLPLVGLAVRADRKVVDKILDGLKFHA
>RPA4551 conserved unknown protein
MTDLTITCPNCASSVPLTESLAAPLLKDTQAKYERLIKQKDQDIAGREQA
LRAQQAEVEKAKAAVAQQVADQVTAARARIAAEEAAKAKRLAENDLADKA
RQLAELQEVLKSRDVKLAEAQQAQAEFVKKQRLLEDEKRELHLTIEKQVQ
AGLDEARQKAQQAAEDNLRLKVTEKEEQIAAMQRQIEDLKRKAEQGSQQL
QGEVLELELEASLRAKFPHDQIEPVPKGEFGGDVLQRVVSAAAQPCGSIL
WEFKRTKNWSDGWLTKLRDDQRKAKAELALIVSNALPKGVHTFDHIDGVW
VTEARCAIPVAIALRQSLIELAAARQAGVGQQTKMELTYQYLTGPAFRQR
IEAIVEKFTEMQSDLDKERRSMMRMWAKREAQIRGVLEATAGMYGDLQGI
AGKALAEIDGMALPMLEDFSDDDGDSEAA
>RPA2661 DUF502
MTEQLPTPTGDLPPDPPRGVMGRIRNYFLTGLIVAGPVAITFYLTWWFVN
WVDGFVRPLVPPDYRPETYLPFAVPGSGLVVAFVALTLLGFLTANLIGRS
LVDLGERLLGRMPVVRAIYRGLKQVFETLFSGNGNSLRKVGLVEFPSPGM
WSIVLISLPPNQEVATKIPSQDEHISVFLPCAPNPTTGFFFYVPKNKVIP
VDMSAEEAATLIMSAGVVQPGSDPQKKIAALAATALAAKKVREPGEPESA
FSRRRAKPAEKASDKVE
>RPA1636 conserved unknown protein
MAHSEDHSALAAEYALGTLDAAERAEAEALTRSDAEFAALVKAWELKLGA
LNQMVGLVEPRAEVWDRIKAAIAAESQRAVPPQAAPPIGDVANAGVLGAP
GDPILPRPANDTGPAPSSASSELSPESKRVVHFAKRAYIWRGVAMLSSAV
AASLLLVVGAQVYQPDLLPDGVRPPIRTKIVKVETPPPAVPAQYVAVLQK
DGNAPAFILTVDAGSKTFTVRRVAATPEPGKSYELWIVSDKLQKPRSLGV
IGNRDFTVNPVLSSYDPDLVSKATYAVTVEPEGGSPTGVATGPIVFTGKL
VESVPQEPIPTTPRQ
>RPA1434 conserved hypothetical protein
MKIAAYVDAEGEITKVSDKGVILLFEQHGEVWKVRKTIRFAVRPDDGLAE
IHASLAAMVPELEDCRLFLSRSVRGVVNSLLQEMGVQTWQSHGPLFTQLE
TIARKENERAQQSAKSARAERRSRRRRRRQDDAGSDLDEIKPVLVGDDDS
GHFRLDLVRLLQDDPGLNSWDILIPFLTGTPFRQLDLVCDHIPRWFSRAM
RALDMDAEIMELPGLGVAAIITPKPQAEAT
>RPA0297 conserved hypothetical protein
MEDRMIDWVDVYPWVKTLHVVAVISWMAGMLYMPRLFVYHCTAEVGSTQS
ETFKIMERRLYRAIMNPAMMVAWVAGLAIAWEQSYFTSGWFHAKLAAVVL
MTVIHLLLGRYVRDFAADRNTRSHKFYRIINEIPTLLMIAAVLFVIVKPF
>RPA3218 Protein of unknown function UPF0047
MKSYRKELWFQTPGRRAFINITGEVEQALRDSGVREGLVLVNAMHITASV
FINDDESGLHADYEKWLEQLAPHEPVSSYRHNRTGEDNADAHMKRQVMGR
EVVVAITEGRLDFGPWEQIFYGEFDGGRRKRVLIKIIGE
>RPA1075 DUF159
MCGRFVITSAPAAIRQLFGYADQPNFPSRYNIAPTQPVPVVIVDEGARRF
RLMRWGLIPSWVKDPRMFSLLINARADTIQDKPAFRNAFRRRRCLVPADG
YYEWKAGGSRKQPYFIHPAGGGPIGFAALWETWTGPNGEELDTVAIVTTA
ARGGLADLHDRVPVTIAPHHFARWLETDETDTEAVMALLGPPGEGEFVWH
PVSTAVNRTANDNPQLILPIAAEEVEAPPPAVAKPAPPKRAARPKISADD
SGQGSLF
>RPA3947 conserved unknown protein
MITHRPIRLVARAAVAAFALLLSVAAPSVAAEPTAAGLWQRTDTGKPGGK
PVVWVLMLDRGNNMYEGIVAKSFPQAGQPNLTICEECEDDRKNQPILGIS
LIRDMKRKGRVYENGNILDPRNGDVWKAMLTVSPDGQTLTLRGYLMTPAL
GKDEDWFRLPDTAISQLDPEIVTKFMPEQAAMQAAQAGKPPVAAPGTAGK
AKAPPAPAPKGGAMAPAPAK
>RPA1885 putative portal protein, R. capsulatus GTA orfg3 homologue
MDLPMFDRLKAFLTVPEAKTSRTAQLLAVGFGGVARFTPRDYAGLAREGY
VRNAIVYRCVRLVAENAAACVFGVFDGAQEKEAHPLAALLARPNPRQDGA
ALLETLYAHLLLAGNAYIEAVTLGEAVHELYALRPDRIKLIPGTDGWAEA
YDYSVAGRTVRFDQHATPVPPILHLTFFHPLDDHYGLAPLEAAAVAVDTH
NAAARWNKALLDNSARPSGALVYAGPEGAVLSENQFERLKRELENTYEGA
ANAGRPLLLEGGLEWKAMALSPKDMDFLEAKHAAAREIALAFGVPPMLLG
IPGDNTFSNYQEANRSFVRQTVLPLATRVGNALAQWLSPQFGDGVRLVID
TDRIDALSPDRTALWDRVTRAPFLTLNEKREAVGYAPIEGGDRLG
>RPA1108 DGPF domain
MIVLQPRETHAMLYAILCYHDESFVGSWSREEDAAVMEKLKAVHGKLAEA
GRLGPVARLLPTTAATTLRKDSEPPVVIDGPFAETKEQLLGFYVVECDGL
DGALEVARDLGRANPGGAYEIRPIGLFMPGTAQP
>RPA0499 DedA family
MSSLLDSLVHLAASHSGLAYLALFMAALLEAVPVVGSLVPGSTVILALSA
LIPGGELSLAGVLASAIAGALIGDGAAYWVGHRAQRSILSAWPLSNYPYV
VAQAEAFFVRYGTWAVLFGRFVPPIRAFVPVTAGALQMTPQRFFAVDVPA
ILLWAPAHVLPGVVAATALEHNHATLHHWLPVLAGVAGTLLLGLWAYRRW
RGTPA
>RPA0713 conserved unknown protein
MHRTEAARLAARSLPCVPTKLDSCRDDSSPMDSQPPADVSADPPRVLPSA
PVMVSVCTTCKTIDGVAVGAPMLEAVRAALAGSASVQVRAVQCLSACKRA
ATVAVSSEGGYTFVFGDLDRASGADAVATFVASYQDSHYGLVPWRQRPEV
LRKGTVVRLPPPQWSPDDGRAPA
>RPA3437 conserved unknown protein
MPWNWEEPNWPVFSYETKGLEKFESDFLLRSGEFIGAFKHVSQDEQTNLR
IELISEEALKTSEIEGEILNRDSVQSSLRYQFGLAKDRPGVPAAERGIAQ
MMADLYRRFDEPLTHETLFEWHQMLMAGTKSIPIVGAYRTHADPMQVVSG
PIAHRKVHFEAPPSERVTEEMSTFINWFNDTAPGGSQALPALTRAGLAHL
YFVSIHPFEDGNGRIARALAEKSLAQNLKQPTLIALAYTIQHKRKAYYDA
LEHNNKSTEVTDWLVYFGTTVLEAQDNTNKRVEFSLAKTRFYERLRGQLN
ERQEKAIARMFAEGIEGFKGGLSAENYITITKATRSTATRDLQDLVAKDA
LTRTGELRYTRYHLKLD
>RPA3489 conserved hypothetical protein
MRSFAFADLLIGVGLLFVLEGLIFAASPNWMRRAMKSALETPDNVLRAVG
IGSAVAGLILIWVVRH
>RPA3548 possible serine protease/outer membrane autotransporter
MQVRGIGEGLRSSVTGRPRGARRRMVLSSLLASTALVAVAMPALAQQVWV
GADQIYNTPSNWSGPATVPDTGDTAVFRNNGASTAVVLSTTRNPNGFTFE
AGAPGFLIGVASGGQLNFNGAGIVNNSGTTQQLLIGPSSQIDFRGSSTAA
DVTITNTGGTLIFSETSSGGTAAVVSNGGDLALRTTSGAISIGSLSGSGT
VQATAVGGVPVQALTVGSLNTSTEFSGTFVDNGAQFALGKTGTGTLTLTG
DNFYTGGTTISSGTLQLGNGGISGSITGDITNNATLTVNRSNGTSLGGEV
SGTGQLIKLGGGILALLGNNTYTGGTIISAGTLRVGNGATSGSIVGDVVN
NGVLQFNRFDSIGFNGVISGSGSVTKLGNNAMILGGDNTYTGGTTISGGY
LQVGNGGTGGSIVGDVLNNGTLEFARSDAHTFSGAISGTGNLISFGGSAG
SGVFTMTGTNTYTGGTTVSRGTLQIGDGGTSGSIVGDVTNNATLAFNRSD
ATSFGGAISGGGNLIKRGAGNLSLTGVSSYTGATTVEAGTLGVNGSIASS
SLTTVNAGAALGGNGTVGTTLINGGALAPGNSIGTLNVSGNLTLTAASSY
MVELSPSSADRVNVSGTATLGGATVNASFASGGYVERQYTLVNATGGVVG
TFGTLVNTNLPSGFRSNLGYDSNNAYLNLVLDYTPGPSPDINSGLNRNQT
EVANALSGYFARTGSIPILFGALNPSGLSAVSGETATGAQQSTFSAMTQF
LGVLTDPSSNGRGARDAAPEPLGFADRTPRGSASDAYAMITKSAAERFVP
HWNVWGAGFGGSQTTDGNASLGSATATSRLAGIAAGADYWLSPQTVAGFA
MAGGATQFGLTSGLGSGTSDLIQVGGFIRHSFGASYLTAAAAYGWQDITT
ERTVAIGGLNQLRAKFNANAYSARVEAGHRWIAPEIGGVGLSPYAAAQVT
AFDLPAYAEQAVGGTGVFALGYTAKTVTATRSELGVRTDKSFALDGALLT
LRGRAAWAHDFDVERSIAATFQALPGASFAVNGARPARDAALTTVSAEVS
WLNGFSVAASFEGEFSDVTRSYAGKGLLRYAW
>RPA4364 conserved hypothetical protein
MPLLEGIIESRNNPLAVVEDIAASNDLAFERSGEDEITIVAKGQWTDYTL
SFTWMNEIEALHLACAFDMKVPAARRNETLRLIAAINEQLWVGHFDVWNH
TGTVMYRQALVLPGGLAATEAQCETMLISAIHACERYYPAMQFTVWAGKT
AAEAMSAAMFDTQGEA
>RPA3511 ErfK/YbiS/YcfS/YnhG
MRWYLASLSGLILLAAASTAEAKVAITVDKDSQQLSVAVDGVERYRWPVS
TGNPSHETPNGKFQTFRMEADHYSKEFDDAPMPHSIFFTKQGHAIHGTDA
AGKLGVPVSHGCVRLSRDNAATLYALVEKEGVLNTTVTLTGSSQVALARL
KSKGGNAIARAEPLPSPTYDANGQPIDLVPSRPRAVRPQVTDDGYIYPAD
GNYADRRYPAPPGYRRVAPPQYGYVESDSYAQDYAPRRSYQPRGYYDGGY
GGYSAY
>RPA3826 conserved hypothetical protein
MVTYMLPSAMLDVRYYSADRESPFADWLDGLDVMAAAKVTTAVARLSLGN
LSNVKSVGEGVLETKIDFGPGYRIYFGRDGDTLVILLCGGTKKRQRSDIA
RAIVYWRDYKQSKKRRSG
>RPA3178 hypothetical protein
MKRYGLLYLATFIVLIPLDFLFLGTIAKSFFQSQVGEMLGEVRLLPAVLF
YLLYVVGIVIFVNGSAQATWQSSLVYGALFGVFCYATFELTSLALLKHWT
WSVVAVDISWGAFVTAVSGTLGLLLADWWSPR
>RPA0047 conserved hypothetical protein
MTAVQLSSYQTGMEARFAAAARHSRLVRTLRLAVPLVVVLSMAVIIAISI
FNPFRYLSKLPIDVGDLVVSGTKITMESPHLAGFTNDGRPYEMWARTATQ
DLTSQDHVDLHTLRAKVQSEDQSTVIIESREGQFETKAQLLKLNKDVYLH
NSTGTEAWMTQADIDMGKGTVTSDSPVDVKWDGGTLRGQRMRITEKGDLI
RFEGGVVMNIDNASLGEPPPAAQPAPPSTPTKPRAAPNSGQRAVAR
>RPA0338 conserved unknown protein
MPKGYWVGRIDVHDLDGYKRDYVAHNGAVFAKYGAKFLTRGGTYEAKEGQ
ARSRNVVIEFKDYETALACYHSPEYQALIKARAPFSEGEMVVVEGYDGPQ
PF
>RPA3198 conserved hypothetical protein
MTVPWHLVALAFAVCLVVAAPGFLRVYYFVSLCYAGAIAAQSLVFALLFS
TTISGWVTLQLLLLLIYGVRLGAFLAVRERNPAYAAELARAERRTAELKL
WHQIAIWLGVSLLFTLLFLPALLTLSLQAQGAWPVSTPLGVMIMAVGLWV
ESLADWQKYRYKAEHPSHYCDTGLYRLVRCPNYLGEMVFWFGVWFSGLSA
YGSGGAWTLTLLGMLYILALMTAAAAGLERKQDERYGDRPDYQEYVRSVP
ILFPWVPLFTLRKLLIRFR
>RPA0167 DUF163
MRLLILAVGKLKQGPERELAERYRARFDDLGRKLGFRGLDVHEIAESRAR
EAPARMAEEAAAIIAQVPDGAVLVTLDERGQSLGSTAFAAQLGRWRDEQV
PGTIFVIGGADGLLPELRRKAKLSMSFGGATWPHQMVRVMLLEQIYRAAT
ILAGHPYHRA
>RPA2762 conserved hypothetical protein
MTGHFETRRLEQLSNTIFGVAMTLLAYQAPREKFASADPQWREIWHLYGA
FLSTLLLSFIVAGMFWYSHQRRLTYATDAGRIEVLVNLFFLLSIIVLPVT
CGLYGNNYDSPNVTTLYSFNLFMISLFNTVLWTIAVARRRDWLTLITPAF
ALVVMGAALLGCLIQPHWPKFIWPLAFLSPAISAWIEKRR
>RPA4220 ErfK/YbiS/YcfS/YnhG
MTSTMLSAFVPQRGRAFTPVAATAAALLLALALPATPAAAQALGYAAQSP
QFAPFETDESVIAPTDNDVSVDAGEDAQLPDRLRRAVISYPTKEPAGTIV
IDTANTYLYYVLGGGRAIRYGVGVGREGFTWAGVQTITRKAEWPDWRPPA
EMIARQPYLPRFMAGGPGNPLGARAMYLGSTTYRIHGTNDPSTIGKFVSS
GCIRLTNEDVEDLFGRTGIGTRVVVLPKSASRPIEARSTVRPGRQAMNIS
ISSID
>RPA3495 possible TctA subunit of the Tripartite Tricarboxylate Transport(TTT) Family
MDLFSNLALGFQVAASPTNLLLCLTGALVGTLIGVLPGIGTIATVAMLLP
ITFGLPPVGALIMLAGIYYGAQYGGSTTSILVNIPGEATSVVTTLDGFQM
AKQGRAGPALAIAAIGSFAAGCFATVLIALVGEPLTRLALEFGPAEYFSL
MVLGLVFAVVLARGSVLKAVAMIVLGLLLSTVGSDIETGVSRMTFDVPEL
ADGLGFATVAMGVFGFAEIIRNLDFGAATDRELVQQKITGLMPTRKDLRD
AAPAIGRGTLLGSLLGILPGGGAVIASFAAYTLEKKIARDPKRFGRGAIE
GVAAPESANNAAAQTSFIPLLTLGIPPNAVMALMVGAMTIHNIVPGPQVM
KNQPELVWGMIASMWIGNLMLLVINLPLVGIWVRLLRVPYRLMFPSIVVF
CCIGIYSVNNAPVDVVLAGAFGLIGYWLVKHDFEPAPLLLGMVLGPLMED
NLRRALLISRGDATAFVTRPLSASLLVVAAGLLILSVLPMLRRKRDEVFV
ESEG
>RPA3785 DUF72
MKKAQAGHIRIGIGGWTYEPWRGVFYPDKLPQRRELEYAASKLTSIEING
TFYGSQKPESFRKWASEVPDDFVFSLKGPRFATNRKVLAEAGDSIERFYH
SGVLELGDRLGPVLWQFAPTKKYDEADFGKFLELLPREIDGRKLRHVVEV
RHDSFKTPAFIALLREHNIPVVYSEHATYPEIADVTGDFVYARLQKGKDD
IATGYPQKSLSAWAKRLQLWADGGEPDDLPKVDASSAKPHKRDVFAYVIH
EGKVRAPAAAMELIKLVG
>RPA3537 conserved hypothetical protein
MMRLVHVLVIGMLVFAAAYVYRIKMESTVRTEKVLQIQAELRKEREAIAR
LRAEWAQLDSPGRLQGLAARHLKLKPVGARQFDALKNLPERPPAVVDPSA
PDPIASMIEKIDPDIVTGSLPAPEPAQ
>RPA1661 DUF156
MHKDIKASCQKRLGRIEGQVRGISKMVEEGRYCIDIVTQISAVRAALRRV
EEEVLKDHVAHCVEHAIASGDKADQRQKIAELMDVIQRSGR
>RPA2162 possible serine protease/outer membrane autotransporter
MLRPQLLQRRHSDPISLLSLEVTMKKQLLLTTSLAPLLAVGVLGSAPALA
DCTISGNTTGLTCSSGSISSSPSTAVTVGDGAGANGSVTLSGAGASWTDT
NEVSVGRDGGVGSINVSNGAALTTRRLALSTGIWEAGSGGNGSLVVTGPQ
TVWTSTGGVDIGRTVGAIGSLTVSNGAAAYIRNTGIYTGAGAEITIRDAG
THVEIGDPNNSSAAAWLSPSGGTVNVLSGAYLYASGIYVGSEGSNLTTMN
ISGRGTVVDTPVRIYVGGQNGSRNVDPMNGNGILNVTDGAVVTSGTVGAG
MDPLSQGIINVSGNGSRLWAKADSAQNILGNVYAGYNGNGTVVVSSGAEL
KADNEVRIGYDTQGQGVLAIGAASGQAAAAPGTITAPRIVFGSGTGQLVF
NHTSSNYTLASDIVGNGSIKAIAGRTTLTGDSSQFTGTMEVSGGILTMTG
VIGAVMAVESGAVLTGSGTVGGVNAQTGATVAPGNSPGTLTVIGNYQQAA
GSTYNAELVPGSTVSDRIAVGGTATIAPGAILKLTKYGAAPYALDARYTV
LTAAGGINGTYILTGDTSVSAFYSLIASYDPNNVYLQSVQTRGFTDAALT
ANQRATAGVLQTMGNGGGLRAGVAASPTDTAARVAFDQLSGEAFASTKTA
LLDDSRFVREATSLQVQSALRGPDTAVWARAYGSWAKADGNGNAADVRRN
AGGVLFGADGELFNALRLGVVGGYGSTSISLDGGRGSVSGDTYSVGVYGG
RAWNALALSFGANHAWHNLSSSRTVALTGFSDRLTADYNARTTQVYGDVG
YTIELGRAALQPFAGLAYVNLDTSSLTERGGAAALRSAGDNVDATFSTLG
LRANGALTLGTTAVTASGMVGWRHTMNNVVPTVTNAFSTGNAFTVSGIPL
AKDVAVFEAGLGTALNPSTALSVNYSGQVGSSISDHGVRANLRVTF
>RPA2840 conserved hypothetical protein
MRIPDRFHIAADHEHARDAAERLFAVVKQQLIAVLPKTSEFLHIGATSVP
GCLTKGDLDIVVRVEREDFQVTEAALAARYARNSGSVRTNEFAAFEDSAC
MPHLGIQLTVKGGEFDVFHRFAAALRADPALVRRYNELKRAYDGQPMDRY
RAAKDAFVTNALRGGSAERGNERAE
>RPA1993 possible virulence-associated protein
MTSRTAKIFTTGRSQAVRLPAEFRFEGSEVFVRRDPRTGDVILSRKPDSW
DGLLQLHQSADVPDDFLGPSDRSQLPQDRDPFEGWTE
>RPA2992 conserved hypothetical protein
MEERTVDPRVWSDHRGADGSSILPLAIASDRITGEIDMKRALVLLSCILL
ACPAIAQTPAPSTTTTSQAPLSAATESFIQQVAISDLFETASARLALARG
NEAQQQFANQMLQDHGKTSGDVRRLIVRDSLKVDVPSQLDAPHQALLDKL
SANNGDDFVVTYAAQQVEAHQKAVALFQSYASNGDLPALKQWAGETLPVL
KHHLEMAQKLGPAKAGAPTVGSSPAPK
>RPA0201 conserved hypothetical protein
MAGERKAAYAKGRLFVLRLLDGWRRAAVPKHSAKAARQTTMNGVGGFSRW
SGALGLCVTIATADFAAAEPRAVVELFTSQGCSSCPPADKIIGELARDPS
VIALSLPIDYWDYLGWKDTLADSRFTARQKAYSQVRGDRDVFTPQVVVNG
SAHLVGSNRAGIDSAIKTTDKTDGVMTVPVSVAVEGKEIKVSVAAAPQGE
TAKSGEVWICAVSKAIPIEIGRGENRGRHVTYHNVVRNLLKVGDWTGAAE
HWTVPLENVVGDGVDAAAVYVQNGSREKPGAMLGAAFTSLN
>RPA2319 Uncharacterized protein family UPF0065
MTATTVSRRILLCATFAASLVAALPAAAQPADWPNKPLKLIVPFPPGGAA
DAVGRVVAEKLSEVLKQPVVVENKAGAGTAIAADAAAKAAPDGYTLSLAP
AGQLTILPHLNKSLSYDPFKSFAPVSLVAEVPYVIGASADTPATSLKDLI
AAAKKEPGKLSYSSCGNGTLCHLSGELFKNLTGTDLLHVPFKGSAPAIQA
LLGGQVNLSFDTLTILAPQVKSGKVKGLVITSRTRSPLLPDVPTAAEAGL
PEFVVSSWFGIVVPAGTPKEIVARLTTEINALAKLPDVRDRLAAQGLDAI
PSTPETFTKVIHEDYARWGKVVEASGAKLD
>RPA1604 conserved hypothetical protein
MNVHVAVQTQAHRLDSNTAAFARDVLDGLTKPQKELSPKYFYDDTGSELF
VEITRLPEYYPTRTELGILRDRAAEIAALIPPDAAIVEFGAGATTKIRLL
LAAHQVAAYVPVDISGDFIITQAEALQRDFPQLAVHPLAADFTKPFALPP
AVRDLPKVGFFPGSTIGNFEPQEARRFLESARDILGRGARMIIGVDLEKD
ENVLVPAYDDAAGVTGKFNLNLLIRINRELGADFDLSAFAHRAIYNRDQH
RIEMHLVSLRDQSVHLFGERIGFRAGESIHTENSYKYSLERFRALAQQAG
WTPARCWTDADAMFSVHAIETNGSRV
>RPA2829 conserved hypothetical protein
MAQRSEIPHFPRTAAIDAYGKGGFYFADMSHQGSLLFLPDAVWGWDVTKP
EQIDRYSLQRVFDNANAIDTLIVGTGADVWIAPRQLREALRGVNVVLDTM
QTGPAIRTYNIMIGERRRVAAALIAVP
>RPA1407 conserved hypothetical protein
MILTEQEKAEVEANRRRYEQHKALGQQQAPKALPPPTPRDGAPIAPESII
HRETVPGGWYYVTKLGRGEAVRIINTSGTSCVSIQAWNALDPSERLNHAD
TIKVQWAASLRKGRVILSDMGRALLSIIEDTSGAHDTMVGGSNAATNAER
YGVGSFRNARDNFVLAAGKLGLDRRDVHPSIALFAPVAVDSQGRFEWHGE
RRHKGDFVDLRAELDLLIAVSNCPHPLDPSPSYAPGDIELIRYQAPQPAS
DDLCRTVSLEAKRAFENNAFYLAGAAGRA
>RPA3358 conserved hypothetical protein
MHPLDRPSHRRVHDQLRSMLNKLLCPILSLAAVIVVVSPSAARPMAGRSG
AIQLLAEVAPTLAPFQHVRFCIRYPLDCASDRDQTDRVELSPSTLKLLEA
INRSVNSEIAPIQKVYGSDLKIGWTISPKAGDCNDYAVTKRHRLIESGLP
ARALRLSVVRTPDQIGHLVLLVSTTDGDLVLDNLTPSIRSWQNTDYEWLK
IQSRADARLWSDVGRKIAEPADLKAPARLQLAAQRTAE
>RPA0947 conserved hypothetical protein
MSSSDSAGSANHVSVTINGRQYRMACEPGQEPQLLGLAENLETRIQSLRG
RFGEIGDARLTVMAALMMADELLDAHGRIASLQQEVDALRNDRAAALDRT
VSTNRAVAAALNSAAERIERTTQVLNRTIGGGIAIG
>RPA2856 Protein of unknown function, HesB/YadR/YfhF
MTESSVTISERAARRIGEILKGEGQGAMLRISVEGGGCSGFQYKFDVDRT
KTDDDLVIAKEGAVVLVDSASVPFLAGSEVDFVDDLIGASFRVNNPNATA
SCGCGTSFSI
>RPA3594 conserved hypothetical protein
MCRNIKTLFNFEPPATDDEVRAAALQFVRKLSGFNAPSQANAAAFDRAVT
EITDVARTLLLSLQTQAAPRDREVEAAKARERSRGRFG
>RPA0152 Protein of unknown function, UPF0066
MDATDDIRAGELASDWSGSPDAGVVFIGRIHTPWNRLKECPRHGRADGPV
CRIEVFETWLPALAGIDDGTLLEVFYWLHRSRRDLLLQCPRNDGDARGTF
SIRSPLRPNPIGTSIARVDRRDGANLFIRGLDCLDGTPLVDLKPDRAEFM
PLAPPKPGDFQVGEPRR
>RPA4272 conserved unknown protein
MSNAPLMPKATAVWLLDNTALTFDQVADFTKMHVLEVKAIADGDAAQGIK
GMDPISTGQLTRDEIEKGEGDPDYRLKLAESKVVLPQAAKKKGPRYTPVS
RRHERPSAILWLVRNHPELKDAQIMRLVGTTKTTIASVRDRTHWNAATLT
PMDPVTLGLCSQLDLDFEVQRAAKEKPIDQQYGGATLLPASETTRKEAEF
EAATSGKSQEDLDVDAVFAKLKTIGGKKHDDEDEDGGF
>RPA0686 Uncharacterized protein family UPF0065
MPITRRALLASTAAAFTASSWPARAAPDWPARIVKTISPYGAGGANDISL
RIINDFLERELHQQFIVENKPGAGTRLANEMVAHSQPDGYTFLYAAAPYA
TAEALFGKLNYQRRDLRPVAMAAFAPIFLIVNAKSDFKTLPDLIAYGKSK
PEGLTFGSPGPGSQLHLAAELLFRIAGVKGLNVPLRGDAAAYTELLAGRV
DATFTAISSALPHIQAGNFRVLGAGSAQRSAIYPGAPTLVEQGFPQIVAT
GWYGFMAPAATPAPIIDRLQDAVLRALGDADVKKKLIAQGLEAHGLNGSD
FAVFIDAETAKWSRVIEEAGLREN
>RPA0414 DUF167
MSAGMAEAWRYSAQGVAVAVRVTPRGGRDDIDGLETLSDGRPVVKVRVRA
IADGGEANRAVTELLAKAVGVPKRNVRLLSGATSRQKQIAIDGDPKQLGE
ALRRLVAAKPAE
>RPA3838 Protein of unknown function UPF0060
MTSLLTFCAAALMEIAGCFAFWAWLRLDKSPLWLIPGMLALALFAYLLTL
ADSPLAGRAYAAYGGIYIASALLWGWAIEGNRPDQWDVIGAAICLVGMSV
ILFGPRTLPA
>RPA2844 SCP-like extracellular protein
MSFPVIASASKQSTAPCTVLDCFVAALLAMTVILAVITTSSAPAFAAGPA
EMISDFRAKNGEGKVVMDATLNAIAQRQAVAMASKDVLSHEAGGSFTSRV
APANVSRAAENIAYGYPDFPNTLKQWINSPGHRANLLLKGASKVGVASVR
SSASGRTYWAMVIAGGDEKPKARQPAKSAKPAATSAKGADKEKPAKPKSV
QPKSVQPKSVQRDCSIKMLGLCF
>RPA3879 conserved hypothetical protein
MSDVKVLTGGCHCGLVRFECTTDLTMVTACNCSICTKKGLHFTFLPPKSF
QLRAGQDSLKEYLFNKRAISHQLCSECGVEVFARGTKPDGTQVVALNVAC
IDGIELSQLKMTPIDGRHR
>RPA3309 conserved unknown protein
MNRTALVLGCLLVATPALAQSVTEKTGVNSVLGIAPTTPDFVKQVAISDM
FEIQSNQLAERKGNAAQQAFAKQMIADHTKTSSELKAMVSDGKVKAELPT
ALDSAHQSKLDKLKNANGAEFSETFDEQQVSAHKDAVSLFERYAKGGDND
ALKKWAGTTLPKLQHHLEMAQQLERDRTNTTSESKAKK
>RPA2536 putative opgC protein
MEIQAILPSKGRDLRLDLFRGVANWAIYLDHIPNNVVNWITTRNWGFSDA
ADLFVFISGYTASFVYAKMMLERGFIVGGTRLTKRVWQLYVAHIVLFVIY
IVAIGYVAQRFSDPEIINEFNVAGLVANPIETLRQGLLLKFKPVNLDVLP
LYIVLMGLFPPVLWLMLRRPDAVMIGSFVLYFAARYFEWNLQAFPEGTWY
FNPFCWQLLFVFGAWCALGGTVRSRSIIASRGMLWFCLAYLVFALVMTMA
GRFQDFGDLFPRWLYDAFNPNDKTNLAPYRFLHFAVIVVLVIRFVPKDWK
GLEWRGFDPLIKCGQQSLAVFCVGVFLSFVAHFELTMSSGSLMAQILVSV
IGIALMTLVAYYISWSKQQDKPARPAAKPATQAAPAAPAAASGLEAQPAR
SQLQDSGVPAVSPAMIAAAQSSPE
>RPA3824 conserved hypothetical protein
MVRKSIDRSTAAEITRGIGNVFADLGMPDAEERQTKLRLAYALNAVIDRA
RLSQAAAAARLGINQPKVSALRNYKLEGFSVERLMTLLNALDQDVEIVIR
KKPRSRAAARISVVAA
>RPA2510 conserved hypothetical protein
MMRHHWTWHRIKALAHEEWRELVTFNPSDRPWQMPFSAALAAGVPMLIGA
YFDHLDYGLISSLGGMAFLYLPRTPLHHRMVSMMAAGFGYAACYTLGLVV
HLLPWLLVPAITFTAILVTMLCRFYRVGPPGSMFFVMAAAIAAYTPGDLM
QLPLKAGLFVLGSLLATLIAMIYSVVVLRIRDPLPVPPLATDDFEHVVLD
AVVIGVFVGGALALAQALQLERPYWVPVSCLAVIQGLSVRAIWNRHLHRL
LGTAIGLGLAAVLLALPLEKWSIAFAVIALSFVIETAVIRHYGFAVIFIT
PLTIFLADAATLGQEPVHQIIEARFIDTVVGCVVGLVGGVCLHHPGFRHV
TAGLIRRLIPAYLIPAKHDRGD
>RPA2290 conserved hypothetical protein
MRLSGWMIGAAVLLAAGSGANLAVAQEVGKASAVNPAATANLRTISIGES
IAHKERIKTTDKGSVQILFVDKTSMTVGPNSDLTIDEYVYDPNSGTGKLA
ARLGKGALRFVGGQISHTGDAEIKTASATVGIRGGVAMIGTGFVFAGYGS
STVTTPGGTVTLGAGEFTQTPGGGEPPSPPGPPPPNFVQNLIQSFQSTPG
QSGGAGQGTASRGNVQRAEARATGTPNGSVAGSLTPQLPTPLPPINQTAS
TLNQTIQTSTQQTAVEQTATRQASGTLFGFIGGLMTTWLTDTANSTYGNY
GYATVGPFTGTAVVGIDAARRRVQANFDGTVAHDANGNYLPGSFSYQFGS
LDPNEDPQGGYYDQSNFAAGPAIRRNGAIVSTIDDAPITNQTGAMVTISR
EMAQQYGSVLGFGNIQPCDCDYTKWGFWSSGSNQGPYYDWAHGFWVAGRP
PTAGEVPLSGQASYVGHAVAAIQNDGSMYYAAGAFNNTVNFGARTGQVSV
TGLDGTNYVGQVSLVSGTTGFVGSLAGNVGSRNMALVGSFFRGVSSPVGE
MGGSLAVSGPGYLGAGIFAAKMK
>RPA1046 conserved hypothetical protein
MVKRQISDFKRATANKLRGNSTAAEDILWRRLRRMNVEGSHFRRQVVIGP
YIADFACLAKRLIIEVDGSQHGDEDGLKRDEVRTQWLQSEGYRVIRFWNN
DVMSKTDAVMDAIYNATAVTPPRSPLASDPPPPGEGDTVRGGKE
>RPA3167 conserved hypothetical protein
MRLWRLSLARHAQVFDGGYGLLFDGRWNTAGRPVTYCATSPALCVLEKLV
HVEDPLLLPELMMVRYEAPDDLAIDDVPRSALPVDWRRRESWTQERGDAW
HASLTAPLLRVPSAIIPIEGSPDLNVLINHRHPDAERIRIAAIEPFALDP
RLF
>RPA2582 unknown protein
MTQPAKAQEPSMEEILASIRRIIADDEAKPATPAPAPAAVAPKPEPPKPA
PPPSKPAAMTPPPSAPSPVAQAAPPKPAAAPKPAPPPMPAPAESNSQDDI
DALLAGLDAGTTEEEVRPPQPDGDVLELTDEMALPEPEPEPAPPPPPPPP
PPVQKAPIEDDIEFAEAAPKPRAPEPAYEPPAAAAAWSEEPQQPILSRTT
AAAVESAFNSLATTVLSNNARTLEDLVKEMLRPMLKAWLDDNLPTLVERI
VRQEIERVSRGR
>RPA0036 conserved unknown protein
MDDPKQPTWNVSPSLRRHARELRKKSTEAERLMWGELRDKRLNGFSFKRQ
VPIGPYIADFACHAKKLVVELDGGQHFSDDGERSDAARTAAIEARGFRVI
RFSNHEVMTNRAGVLQSIADVLAARVLTPTLSRKRERERAKRAAKPRSDA
GAANDQGDE
>RPA2024 conserved unknown protein
MTNPREPLLHTIPILSLRPTQMTVGMREVKEKRLRWRQHKPKKQAELLGN
HMIPVVLGPGKTHYVIDHHHLARALHDEGVKEVLVTVIADLSMVDREAFW
VVLDSRRWVYPYDAKGERHHYREIPKTVAGLKDDPFRSLAGELRRVGGYA
KDTTPFSEFLWADFLRRRLSRKSVTANFQNAAEKALALAKSKDAIYLPGW
CGPAADD
>RPA3205 Type III secretion proteins, related to flagellar biosynthesis protein FlhB
MTTTDTKNALAIALHYDYAGAPRVVAKGKGVLGAKIIEVAKQHDIPIEEN
EMLAGALSHVELGDEIPEELYKAVAEVLAFVLRLSGRGPVR
>RPA4782 conserved hypothetical protein
MRIAIRRTIEVLREKQILHKLGVAVSIAVIAAACYVLYHILRGIDHDRVL
DAMGQTEPTSIALAALFVAAGYFTLTFYDLFAVHAIGRDDVPYRVNALAA
FTSYSIGHNVGASALTGGAVRYRIYSAWGLDAIDVAKVCFLAGLTFWLGN
AAVLGLGVAYHPEAASAVDLLPPAVNRVLALLILAGLMVYVAWVSFKPRC
VGRGSWTVTLPGGKLTLLQIAIGIVDLGFCALAMYVLTPDEPNVGFVVVA
VIFVSATLLGFASHSPGGLGVFDAAMLVGLWQMDKEELLAGMLLFRLLYY
IVPFVISVLVLAVREIVLGARLKRVPPLAEALKPKPVRGDPSPG
>RPA2708 possible fusaric acid resistance pump
MTRTCPAARQALTNFLRQPARKEDADAILFSAKSFAAAMLAYYVSLRIGL
PKPFWAIVTVYIVSQTSAGASLSRGVYRFAGTFVGAIATVAIVPNFVNDP
IVCCMILAGWIGRCLFLSLLDRTPRAYAFVLAGYTTSLIGFPSVLDPGAV
FDTASLRVQEICIGILCAVLIHRYVWPKPMTGLFTGKLSAMLQDARRLAS
VALTASSEENRQRREQLAVDLLMLQGLATHLPYDSASARPRRETLQLIHD
RLARLLPLTSEIEDRVRSLNIERVATVDELTALIIDVGGWIAVEKPDDQH
AEAARLIRRARSIQKRLGPDATMPADRVGANLAGHLAEMIGLLNDCDKLS
QSMTASGRVRKAAALHGPELAKGYVYHRDPWMAARAALGAMVSILGGCAF
WIWSAWPDGGTAVSILGVCCTLFGNVDTPVPFVVKYIVGSIFGVLISLVY
SFVILPHVTNFAVLVAVLAPAFLFAGSLQARAPTAFMAMGITLTIPILSG
LGTSYTGDFAASLNMAIALFVAVGFAAVSMSIFQTVPVDVSINRLLHLSR
RDVGRRALGGAPNEARWTSLMIDRTALLLPRLRAARKAYADVVDDTLRLL
RIGHVAGQLRKMVPQLKGEARAAIDGLLTEIAAHFRGRRPTTPAVLIDFD
QRIERLTAMMEHCSPKSRLRIFDLLIDLRFALGASGAAGKGALPHDH
>RPA1278 GatB/Yqey
MLRDDINNAVKEAMKAKDERKLSTLRMVNSTIKNADIEARGQGKPPLSDG
DLLSLLQKMIKQRQESVELYDKGGRAELADQERAEIAVIQAYLPQQMSDD
EVKAAIAATISETGAAGIKDMGKVIGALKAKYAGQMDFGKASGMVKAALT
G
>RPA2882 conserved hypothetical protein
MSPRSKEPVFDGENPEWTEQDFANARPPEEILPPEILAQFKNTRGPQKAP
TKVPVSIRLSADVVEHYKATGPGWQSRIDETLRKAAKLKAG
>RPA1145 ErfK/YbiS/YcfS/YnhG
MQSTSMFNKTTLALLTSVSCLTAGYHPAVAYEFSSGPQPTVVYAEPQPAP
PARVRTAYADRPNMGGGFIEFLFSDGAPPPSSRYYQREPAYEQRREYIAP
TAPQLMQEQEAAIEPAHPEFDPRYEKQLVDYHGGEKAGTIVIDTPNKFLY
LVQGDGKALRYGVGVGRPGFTWSGVKTITAKREWPAWTPPKEMLARRPDL
PRHMEGGPANPLGARAMYLGSSLYRIHGSNEPWTIGTNVSSGCIRMRNED
VIDLYGRVGVGTKVVVI
>RPA1311 possible 4-carboxymuconolactone decarboxylase
MVATEMKSPREAARAFTPQLTHFVEDPLFSKVWEDGALSKRDRSLVTVSA
LIALGAADELPAHLRRAIENGVTKVELSGLITHLAFYAGFPAAISASARA
AETLGALDGG
>RPA0677 hypothetical protein
MNMNIMHVDYSTKIPNNVDLGSDRQVLKALEGWHPGYIDWWNDMGPEGFQ
ESLVYLRTAYSVDPRGWAKFDYVKMPDYRWGILLAPKEEGRVVPFGQHYG
EPAWQEVPGEHRAMLRRLIVIQGDTEPASVEQQRHLGKTAPSLYDMRNLF
QVNVEEGRHLWAMVYLLQKYFGRDGREEADGLLRRRSGSDDAPRMLGAFN
EATPDWLSFFMFTYFTDRDGKMQLHSLAQSGFDPLSRTCRFMLTEEAHHM
FVGETGISRVVQRTCDAMRVAGISDPTDVDKVRALGVIDLPTVQKKLNLH
YSLSLDLFGSEVSTNAANAFNAGIKGRFHETQIQDDHKLERDTYPVLKLV
DGEIKRVDEPALTALNMRLRDDYSQDCVKGLLRWNKVITTAGYDFQLKLP
HVAFHRQIGEFKDVEATPDGVLIDPATWAKRRHEWLPSTDDGEFIASLMV
PVSEPGQYAPWISPPKVGIDNRPGDFEYVKIET
>RPA4491 conserved hypothetical protein
MRRAASCQSACLPTVIPLSIAEPTSAPKFGGGSMLKWRAGASWMVFACLG
VVSVSLPASAASFDVSSGATDTTAKTVTVTDTGTVESGGTLNVSGTAITF
TGPAAGVVINNSGTISSGNRGIDTSGANPPRSLTLNNFAGALISTVDDAF
RLNTAIGNGTVVINNSGTIVSNSGQVFDFASNNSATGTVQINNNAGGIIR
ALGNDAIRPGSGSTTITNAGLIDATTTANRAISLNISNLGTVTAFTVINT
STGTIQSQDDAIRATASTLATTGSFTIDNAGIITSTATGQAIDFNDLTSG
TIQIINRATGVIRSTGADAVRPGEGATVTNYGLIYSDGAVGSSNDGIDWQ
SHSGTVINKSGGTISGFRHGITTDANVTVTNEAGATIIGRNGSGVGSDGT
GTVVNYGRITGAYNGSGTGDGDGVDVDFAATVTNYGIIEGTGAGGYDSGG
RLNNSEGLSIGGGTVTNFGTISGASYGIVVNNDSNPDGSRSGVAATTITN
NAGATIIGQNGFAIRLENKTGTAADNDTIVNAGTIIGNGAIPDPNAVVLL
QNGTPDTGSVGTLDGVTYTGTGSARFIRGDGSAIQMGEGNDVLTNSGTII
GNNGRAVNMEGGNDTVNILSGSRIVGLVNGGAGTDTLNYYKVGLSDAKRA
QLAAGQTVNIGGTLYTSFEVANAFAPSFSAFASPSKPGSAGPAWLFDNLS
NSIAASADAQTMIDRLASAADVGAALAQLSPASYQALGRMTLNSVSQTGT
LIDQRMMQTRFGGGSDLGGAGTALAMADAGLFGTSGSPMERTLASLGAWN
MRDALTASGWGANADALAYAPITKAPPLRAALDPGFGVFVGSSLSSSREG
ARPDVSATRSTTASVVAGADWRVSDRWVVGAFGGYALTSGDLDSLGSTTK
ITSRSIGGYGSYRAPAWYANLLGFYGWDGYDNRRVALGSANSSRFDGNHY
TVRGAIGTDLRMFGGFVVTPN
>RPA3166 conserved hypothetical protein
MGRTMSIIEIVANKLGGKPVLGATVRSQADLAMVVRNRLPLAALQGLSRA
GLSDQEIEQFVIPQRTRRHRAERNQPLTIDESDRAVRLLRIQSLAEDTFG
GVDKASLWLRRPLAELNGETPLTVAQTEAGARLIEMVLGKIAWGAAA
>RPA2669 conserved hypothetical protein
MIKTRTAKFQIGQIVRHRIFSFRGVVFDIDPEFANTEEWWLAIPEEVRPS
KDQPFYHLLAENAESEYVAYVSEQNLLPDDSGEPIRHSQVAEIFIKDKSG
GYRPRNPSLN
>RPA1599 conserved hypothetical protein
MKLPGPDHPISITANPNGVRISADGKVIAETRRALTLQEASYPPVQYIPR
EDADMRLLEKTARVTHCPYKGDASYFSIVAGGEPQANAVWSYENPFPAMA
EIAGYLAFYPQKVQTEEFKP
>RPA1186 conserved hypothetical protein
MAKQGQSSPHNLDGLTEAAREAAASAAKGKGLPPVHLWNPPFCGDLDMRI
ASDGTWFYLGTPIGRPALVRLFSTILKREGDKHFLVTPVEKVGIKVDDAP
FLAVEMVRDTDERGALLRFRTNVDDWVTCDPDHRLRFEVAGDGGVTPYLH
VRADLWAKVTRAVYYDLVDIGEERVVDGRSMFGIASGGAFFAMADAELMR
EAH
>RPA4360 DUF152
MIITSPLLGAAPGLRHAFFTREGGVSDGIYAGLNGGIGSNDDPAKVAENR
RRMAEALGVAPERFLTVYQVHSPDVAVAEAPWDQASRPKADAIVTRTPGL
AIGVTTADCGPILFADPAAQVIGAAHAGWKGALTGVLESTLQEMEKLGAQ
RDRVVAAIGPLIRQPSYEVGDEFVARFTATDPAYAAFFTAAERPGHAMFD
LGGFIRMRLEAAGVGMIDDLGIDTYPDERFFSYRRTTHRSEPDYGRHVHA
IVLDPGV
>RPA1115 conserved hypothetical protein
MTPRLNPAAAAPDAYKAMVALEKYIQGSGLEPSLIELVKMRASQINGCAF
CLDMHSKDARARGESEQRLYLLNAWQESPLYTDRERAALGWTEALTLVAQ
THAPNQDYAAVKSHFSDAEQVNLTLLIGAINTWNRIAIGFRLLHPTPAVA
HAKAS
>RPA4610 Protein of unknown function, HesB/YadR/YfhF
MIILTEQAGTAVKAAMSRAGKLDAGLRVMVEAGGCAGYKYLIGLDAEPRV
DDAVVEAGGVKVFIDPDSQPLLSGMTIDFVESLEGSGFTFDNPNAGTKCG
CGKSFG
>RPA3784 putative dolichol-phosphate mannosyltransferase
MIDASPAAVATRAQPAAVPLQLSVVVPTFNERDNVELLFRKLEAALAGVA
WEVIFVDDNSPDGTWQVVREMARRDPRARCIRRIGRRGLSGACIEGILAS
GAPFAAVMDADLQHDETQLAKMLSLLESNAADLVIGSRYIEGGSADSFDR
QRAGISAIATQVAQKALKIEVADPMSGFFMIRRDRFEQLAPELSTQGFKI
LLDVIATARGSLRIVEVPFTFGSRLHGESKLDSMVALDFLGLVLAKFSHD
VLSLRFILFALVGSTGVVVHFVTLFVVLKLFEQPFAEAQGVAALVAMTSN
FILNNFLTYRDQRLKGFAIVRGLLLFYLVCGVGLLANVGVAFAVYNEHIT
WWLAGAAGALMGVVWNYAMSGLFVWRKR
>RPA2396 DUF208
MMSSSERQKLVAPVPQGRVLLHCCCAPCAGDVIETLLWSEVDHEVLFYNP
NIHPHGEYLRRKDELQRFAARNGIGVVDADYDSAAWFRAVRGFETEPERG
ARCTVCFDVRLERTAAHAEQHGFAMIATSLGISRLKNIRQVDACGHRAAA
RHAGLIYWGHNWRKSGGADRAAELARQEQFYRQDYCGCVYSRSKLKLIS
>RPA1836 DUF636
MTGTGKAVLSGGCQCGAIRYALSAPPLSTALCHCRMCQKATGAPFASMAE
VAKDNFAWTRGQPASFRSSSVAERDFCKDCGTPLGYRLMFCDSIEILTGT
FDRPDRVVPTMQYGTEARLGWTGTIANLPSKTTLQNYGPDRLAQVFSYQQ
PDHD
>RPA2931 DUF461
MKMFIRSVASVAVATLLSTAAVQAAEVKAGDLVISQSWSRATPGGAKVAG
GFLTIENKGGSADRLVSGTADIAGKVEIHEMSMDNGVMKMRPLDKGLPIE
PGKTVTLMPGSYHIMLMDLKGPLKQGDKVPMTLQFEKAGKVQVTLDVEGV
GAKAPGDTGGSMMKMDHGTMDHSGHGMKK
>RPA3944 conserved hypothetical protein
MHRLLSALGRGFKTYIGWKRVGIVASILIIGFAISSLIRTLKGVDHNVIL
TALTDKSPTQIGMAALCVVGAFCTLTFYDFFALRTIGKLHVPYRIAAMSA
FTSYVIGHNLGATVFTGGAIRFRIYSDYGLSAIDVAKICFISGLTFWLGN
LFVLGIGMIWHPAAASAMDLLPDQINRLIGVACLAGIAAYFIWLATGKKR
RELGQNGWKVVLPSAKLTLVQVLIGVVDLGFCALAMYLLMPSAPYIDYVS
LAVVFILATLLGFASHAPGSLGVFDAAMLVALPMFAREDVIATLLIYRVL
YFLLPFGIAISIMGVREIWLSVIKPWQERRAACNGNEHAATAAPAAAPAR
APVGQVVQRSSKL
>RPA1904 DUF262
MTASQNLLPVYQTLDLPLKSILNEIHQSRWVLPAIQREFVWSDQQICRLF
DSMMRGYPFGTFLFWQIEGSTASKFKFYEFMRRYHERDAKTCRELPALGD
RQVTAVLDGQQRLTALNIGLAGSRAMKLPNKRRSNNQAYPETFLHLDLLG
EGAEQEDGMVYDFRFLTLEKAAEQSKPAENHIWFRVGDSLQLKEGIGIFE
RVQQHELNRDQQTVAFERLDLLHRLVHKTPLISCYTERVQDLGRVLDIFI
RMNSGGTVLSYSDLLFSIAVANWTELDARREVNEIVDALNGVGGGFSFSK
DFVLKAGLMLAEIKSVGFKVENFDRTNMATLERAWPAIRQALDLTVRLVS
NFGFTGQTLRADSALLPIAYYLFKRKAQQDFLTHSRHSDDRRNILRWLAR
SYLKASGIWGSGLDTLLTSLREVVATQGTNAFPATQIEEEMARRGKTLSF
TPDELDDLVELPYGDSRTFALLTLIFETVDTSKSLHMDHIVPISKFKRRL
LLAEGVQEAAIEEWSEQANTLPNLQLIEGAINQEKLAMMPADWLRMREPD
DLRREKYIYLHMLEGLPEKLGDFEVFYQRRRRSLQSQIALLLGGSGAESA
SAPKSSAA
>RPA0170 conserved hypothetical protein
MAADELSTPLGQKRNRWQRLRHYRLPFNATQGIAALLGLFVLTFLGFALF
GNDPFGGEPVVRVEIKAGPEEAKQAAAEAGKDQGGKPENAAAAGAEKKDA
EPKDAAAGQKTITIIDGSKGTRQDVVVGTDSPDKAEPPAGPTTSDGINPK
LLEQSRYGMIPVMAGGLKPFSAYAMTTDADRAKAERMPTVTIVVGGLGIG
AARTNDAVMKLPAAVALAFTPYGSDPGKLATEARAKRHEVILQIPMEPFD
YPDNDPGPQTLLTSSAPEQNLDRLNWHLSRIQGYVGLSNFMGARFVATEP
AMQAVIRDAAKRGLGYLDDGTAPRSVAGTLAKSLAIPFVRADLTIDQVPA
GADIDKSLARLESIAKERGSAVGMASALPVTIERIVNWSKSLESRGILLA
PLTTAMLKSKSS
>RPA4188 conserved unknown protein
MRDHSTGRRGFEGVLMAVAATFLTVSATSVLAQSPTRSPADLAIDAAVPV
PSPVDLPPPSVGDFKPETHAAKPETAAAPAPTEPAKDVAATPAPAAPEPS
KDTAATAPATEPKTEAPAATATAPAAVPAKPATDQAETTKPAEPQKPASM
VAAADQPVAEQLRDLIAKSASRYFERKSERSAVEKFYETRDYAPVWTKSG
SLTAQAKGVIARLKDAAADGLDPDDYPVPSFTAATSPEALAEAELRLTES
MMDYARQAQSGRMHWSQVSADIQYPEHPIDPAQVLAKVTTAADASAALDS
YNPPHKLYRELKKKLAELRGEGDKPVIKIADGETLRYQPARKKRAEVKMD
DPRVPQLRARLGVTENPDSTTYDATVAAAVRKFQDHADLKASGVLDERTV
KALNTPKRDRTIDTIIVNMERWRWLPRDLGAPSLGDAYVILNIPDYTLKV
MQHGAEVWTTRVVTGKPGKHATPLLTETMKYITVNPTWNVPPSIIYNEYL
PALQQDPTVLDRMGLKLERNRDGSIHISQPPGEANALGRIRFNFPNKFLV
YQHDTPDKNLFAREERAFSHGCMRVQNPDVYASTLLNIAMPEKDYTPAKV
RAMYGRSEVDLKFPTPIPVNITYQTAFVDDAGKLQLRKDVYGRDAAMLAL
LKNEKGKNLEAVVAHAQPNYSRPTRLPDGVSVAGDYTGSSGASSSPFSFL
ENLFGGPQPRQQPAQRNQRRVYAR
>RPA3308 ycfI, putative structural proteins
MGFFSRDIQTMEDLLLHGLRDIYYAEQQITKALPKMIEQATNRDLSQGLT
SHLEETQKQIERLDQVFKKLGQKPSGVNCPAIDGLIKEADETAGEIADKT
VLDAAIVANAQAVEHYEIARYGTLIAWAEELGHDDIVRFLTTNLNEEKAA
NTKLNTVALRKGVNRKAAS
>RPA3804 conserved unknown protein
MAMTMNGEVQLDAPKDLVWSKLNDPEVLKVCIPGCEELEKTEDDSGFRAV
AKLKVGPVSARFKGKVTLSDLDPPNGYKITGEGEGGVAGFAKGGATVGLS
DKDGGTLLSYDVEAHIGGKLAQLGQRLINGSAKKLADEFFANFSKAVQNG
NGAA
>RPA4418 conserved unknown protein
MGSTMDKIKGQANELAGKAKQGIGEATGSDKLKGEGAIQEAKGHGQQALG
NAKDAVKDTADKVAGAAHKNL
>RPA4290 conserved hypothetical protein
MNDLPPPTPPMPLIRKGFQLIENAFLLLIGTFAVVAMAQEVYETVLNVRV
TLKELLLMFIYVEVIAMVAVYYESKKIPITLPMFIAITAISRLLILQGKD
QPPANLLYESGAILILAIACAVISYRPPRHSHHDHGHAKSAHPGQDQQA
>RPA3038 ErfK/YbiS/YcfS/YnhG
MRVFQCEADFTDERYSRTSKRSGAGRLTISQATLAAGIMATAMAAAPAQA
SPAWFWSDEVPIYDEPPPAAPKRHYQHPRKRLQLDHKAEKQIEKQATKPQ
GPVVIAVSIEQQKLRVYDANGLFAETPVSTGMRGHSTPMGVFSVIQKSKY
HRSNIYSGAPMPYMQRITWSGIALHAGALPGYPASHGCIRMPMTFAVKMW
GWTRMGARVIVAPGDVTPVSFSHPLLATKKVAPDPEPMMSDSPKPATTTK
SDKAAVNDQPAEPALSAGELRRGLALGSDKPSASPVRTADAAAATVTLTD
ASSSKAPETNDAAPATTGTVDAAKTQDKPTDNKPADAKAPAAAEAAPAAE
PKASAATSDDKAPASQGAKDQSHAPTSDEHTAAAPALPAPNPREQIAAFV
SGKDGKLYVRQNQTPLFDVPVTIAASDRPLGTHIFTAELDKDQSVRWSVV
SLPDAPAKAEPEVQRKSRRHDKALTEESKASLHTNSPAEALDRLTIPPEA
MKRIAEALTNGGSLIVSDQGIAAGETGKGTDFILRLR
>RPA3842 Uncharacterized iron-regulated membrane protein DUF337
MTQRPDHAVRRLSARRIWRALHLSLALTVGAGLALIGASGALLVLKEPLV
ALEVGREIVKPPRIAGLPDATADRWIERTSQRYPELKVIGANAPGAGFLP
GDAAIVFGRLPPGDLGVVFVHPQSGEPLGLFAFDRSWFAAVVNLHRRLLL
PNSLGTDVVGWCGIGLSLSLLSGAYLWWPRQRKWWRSFWINASSGRRNLI
EWHGVSAAYLLAPMLVLAATGVALTKPQWLGLPGPAPAKPMAQAIAPACA
AGTDILAALAATAQAHPTAHIVGIVLPRGPAGAYQIRLRSPTSSRNDLTV
EVRTGCTTPLVETVEADSALRREWLNPLHADMRLGISGQAVVFLCGIALP
ILYVTGLWLWIGRRRKRAG
>RPA1124 conserved hypothetical protein
MSSGFAVSARSLALMATLAFAAPALAQQYGGEVDPEIRIQQLEDRLRTLT
GQNEELQHRNRRLEEQVRQLQSGAGAPGAQQPAPNQATTPPAAPQQPYSQ
PAQQPAYGQQQPGYGQPQAPMVQPQPAAPQGGGRRGDAFDPSRDPNAPGV
PRALGGGQLPMEQSGGMPAGRAAGAPMDLSNGNGGYVQGGYPQGGAPSAA
KQPPAATAGGALTTQPPSQTPRDEFDLGIGYMQRRDYALAEETMRNFAQK
YPDNPLTADAQYWLGESFFQRQMYRDAAEAFLAVTSKHEKSGKAPDALLR
LGQSLSALKEKEAACAALGEIGRKYPQASSSVKKAVDREQKKLKC
>RPA3432 conserved hypothetical protein
MSDHVVPHFQNDAGVATIEIGSREFMCVGAAPPFDHPHVFLDLGNDNEII
CPYCSTLYRYAADLKPGEARPPECVLKDKVA
>RPA2241 conserved hypothetical protein
MDPRRAAEETEVYWLANYDPRLIQLVADTTGLGDVTAGQLPLRHDIASYD
GAYAKIGASKASVTAKLAPGLPLDAPLMAIVPLDANIPTRIEAMARLWRA
LTGQQPLPYARSELPASRRERLAMSLRAVDARHAGATRRDIAVTIFGPEQ
VPEGVAFDDHHLRSRTARLIRDGLALIAGGYRKLLKS
>RPA0774 conserved hyhpothetical protein
MSPVSLLLNILWIALGGFWMAVGWAVAAVIMAITIIGLPWARAAFSIGVY
TLFPFGQTAVPREAVTGSEDIGTGPLGVIGNIIWLVFAGWWLAIGHLVTA
VLLAVTIIGIPFAWAHLKLAGIALWPIGKVIVPIDDLGLRR
>RPA4453 Metallo-phosphoesterase:Conserved hypothetical protein 282
MRILFIGDVVGKTGRTAIAEYLPGLIRDWQLDCTIINGENAAGGFGITEA
IYNDFIDAGADAVTLGNHAWNQKEALVFIERAPRLVRPVNFPRHTPGRGA
ALVDTRSGARVLVINAMGRVFMEPLNDPFAAIARELEACPLRDAADAIVV
DFHGEASSEKQGMGHFCDGRVSLVVGTHTHVPTADHQILPNGTGYMTDAG
MTGDYDSVIGMHKDEPVHRFLTGIPQGRFEPANGDATLSGVAVETDNATG
LAIKIAPVRIGGRLEPAKPAFWTAN
>RPA0600 conserved hypothetical protein
MIRYSLRCDKGHVFESWFQSSSAYESQVRRKLVSCPQCDSVKVEKAIMAP
QIVGKKGRGRAPEPAAAEPAATNLPAETASANSPTPLLMAQERELRAKLK
ELRDHIVKTADNVGERFPEQARAMHYGDIEHRPIYGEASPTEAKALIEEG
IEVAPLPVLPDDRN
>RPA4298 ATP/GTP-binding site motif A (P-loop)
MLAGQPCPMAARVRVRSLARVPRNDGAGDMMALHCVGLRAVTMCAPAHAG
APSCYLPRVPAMSKSTRATQMLEKLGVAFKLHSYDYDPNAEAIGLAAAAA
VGVEPKRMLKTLMAELDGKPVCAVIASDKEVSMKKLAAALGGKSARMMKP
ADAERLTGYHVGGISPFGQKKRVPVAIDDAALAETTVYLNGGQRGLQIEL
DPSAAVTALGAVARPIAAD
>RPA4734 conserved hypothetical protein
MTAKGAQTRSIARPMELSPDRNPISPSSDRPVRRGGRLRATIVAVLALGF
LVGAGGFVVFLSQLRGAEQKPRHNADGIVVLTGGSSRVSDAVELLAAGYG
KRLLISGVHRTNGARDISRSVPESRDWFSCCVDLDRSAVDTRSNASETRR
WVQERGFRSLIVVTSNYHMPRAIAEMSHAMPDVELIPFAVIGDKWRDEPW
WTSGGTLRLLLSEYAKYLAVELRVRLSKLGIEVMPEPADQADPTGARKPA
TAAAN
>RPA2685 conserved hypothetical protein
MDIFAAGSRPTQRAPKDYFTGTVWQDPIVAAPAPARLAATRVAFEPGART
AWHTHPLGQTLYVVTGVGRIQTEGGPVREIRAGDVVWIPPGEKHWHGASP
SNGMTHIAMQEALDGSFATWMEQVSDDDYAVAPSA
>RPA1982 conserved unknown protein
MSADHLRRIGMKYLSKALLAAALVFGVALVGHQAQAQTKQPSPAAIASAK
EILELKRAGGIYAQAVPNIIQRTMDTLMQSNLNYQQDLKEVAVVIAKKLA
GREKEIGDGMAKIYATDFTEQELKDLVTFYKSPLGQKLLTQEPKSIAESM
QFMNQWAQKFAEEVNGEFRAEMRKRGKNI
>RPA1747 conserved hypothetical protein
MTVFAAEQQFHDAERPLPSHQARLACRAGLAETTAGVAPGFVQGNLAILP
EKYAAAFHRFCQLNPKPCPIVGMSDVGNPMIPSLGIDLDIRTDLPRYRVW
RDGELVEEPTDIVAHWRDDLVAFVIGCSFSFEEALLADDIPIRHIEEKVR
VPMYRTNIPCAPAGPFSGPMVVSMRPLKPKDAIRAIQITSRFPSVHGAPV
HIGLPQSIGIADIAKPDYGDPVPIGPDELPVFWACGVTPQAVIAAAKVPF
AITHAPGLMLVTDLKNKHLAVL
>RPA4796 ErfK/YbiS/YcfS/YnhG
MRFPRLLSRVVAAAALVMLASVPSYAQQQDAGDEPGLIADDGYVLPPEWQ
KQMVYFRTTEAPGTIIVQTSERYLYLVQGNNRALRYGIGVGREGFTWQGL
LKISRKAEWPDWVPPPEMIQRQPYLPRFMAGGPGNPLGARAMYLGSTVYR
IHGTNRPDTIGTAISSGCFRLVNADVMDLYARVPVGTKVVVRQRPEL
>RPA2470 Protein of unknown function, HesB/YadR/YfhF
MATARPRPQVMRLTDAAAGRVKELMVRADSEILGLRVGIKNGGCAGQSYT
VEYAHDIKPNDEVIEDKGVKILVDPKAVLFLLGTEMDYKADRMQAQFVFN
NPNQISACGCGESVELKPASIDG
>RPA3160 DUF218
MFFLLSKILGFFVHPSNTIAMICVAGLVLMLTRWWRAGRALLIGGVVLLL
IAGYSPLGNVLLLSLSERFPPWHEAGRAPDGIIVLGGAIASELSAARNAL
EVDASAERVLAALDLARRYPKARIVYSGGSGNLIQRSVAEAPLAGELLER
FGLASDRLVLEDKSRTTAENAAYTRALVAPKPGELWLLVTSAFHMPRSII
AFRAAGFDVVPYPVDWRTRGSEDAARTFSTLAAGLARTDVAVHEWAGLLA
YRLGGKTTSLLPSP
>RPA0893 conserved hypothetical protein
MASATARGLRKRLTPQEARLWAGLRELKLRGYHFRRRAPIGCYVVDFVCL
SAKLVIDADGEQQAMPAHFAADRQRGVVLRRQGFRVLHFWNSDIDRYFEG
VMQTVLDALPPPSVRPNVSRTGHSIHEGAGESG
>RPA1685 possible serine protease/outer membrane autotransporter
MAYKRTRSTDIQVLLAGGSRVGARSIAALLRAITVMTTLTVPAMADGGAG
GQGGDSSAAGGIGGSGYVGASGGSSGPSTTAGSGGGGGGAAGGGSGGSSG
NTDSVGGAAGGTTPGQDGSNGTTAASPSGGGGGAGGNNGNVATAPFSNAA
NSTGQNGGKGGNGGAGTDSGSTGGGGGGGGAGGVGFVFSDSSSFTNFNVT
IQGGAGGAGGAGGSAAAANGNGGAGGDGGTGLLLTSPGATVTNRGTIKGG
TGGVAGAAGAGTGTAGAAGAAGAGGAGIVGSGLTIINSGSIQGGVAGDGT
QANSITFTGGDNTLKFMSAGAGVRGAIGLATSSTTLTIEQSANSTLDQMI
TGAGSVAKTGSGVLTLTGSNSYLGGTTISAGTLQIGNGGNTGSITGDIIN
NASLAFNRSGTYTFDGAISGTGGVSKTSNGTTVLTGTNTYTGGTTISAGI
LQVGGGGSTGSITGDIINRGSLRFKRSDAYTFGGAISGTGAILQDGDGVT
ILTGTNTYTGNTAISRGKLQIGDGGTTGSITGGVTNNSELWFNRSDTYTF
GGVISGSGRVSQLGPGTTVLTGKSTYTGGTTISAGTLQIGNGGTTGSIAN
NVNNAGTLAFNRSDAYTFGGAISGTGSVSQLGSGNLTLSGANTYIGGTTI
SGGTVTVGNNSAFGTGDVKTTQGATVAFGAADYTLANNFVVSGTSIFDIQ
TGTTQTITGAVSDGDTAGIIEKNGGGKLILNGRNTFSGGTIINAGTLQLA
NSNMLSATTAATVASGTTLDLGGFNQTIGSLAGAGSVALGAALLTTGADN
SSTNFSGTMSGTGGLTKQGTGNLILSGTNDYTGPTTVNAGTLSVNGSIAS
AVTAASGGRLGGSGSVGSTTIASGGTLAPGNSIGTLTVNGDLTLAAGSSY
AVEVSPTASDRTNVTGRATLAGTVQASYATGSYISKRYTIVNAGGGVAGT
FGALVDTNLPANFRSALAYDANNAYLDLMLAFVPPTAPDYGNGLSRNQQQ
VVGALVNSFNVAGGIPTTLGTLRPDGLSQASGEAGAALQQSLFGSADLFM
RSVFSNALDPTDGSAGRECAAPGSQAGPRDACAALPPAAASASRWSAGYG
GTLRVGGDASTGSHDTTSRVAGAAAGASYQVSPQARVGFALGGSGSSFSL
AQGLGSGSSDSFNAALYGRYQLGPSYLAAALGYVWQDATVDRTVTVAGGE
ALRARMHPQALTARVEGGQRLQVSGIGVTPYAALQTTTMFLPTFGETGTS
VFALDYAGRQSTVTRSELGARFDDEVFIDGRPLALRARAAWAHGWNTDWQ
AAATLRQIPAAQFNVYGAALPGDSVQVSLGASLGLGRGWTVSAAFDGEFW
KHAESYAGRALVSYQW
>RPA1147 conserved hypothetical protein
MSKPDQRRIWLVADDYGISPGVNRAIRDLIERGRINATSVMMVGPAIGRD
DAAELLNAAAANPHAAIGLHATLTAPFAPLTMHYHPLHGGQFLPLGRKLR
GTLLRRHDRAVIATELSAQIEAFTDRFGRPPDYIDGHQHVQLFPQVRDAF
LGAVKTLAPDAWVRQCGRSVPLTRRLGNPKPLLLDALSAPFRNRAAQAGI
AFNSGFAGAYDFLRETDFDTLMGSFLHGLPDGGLVMCHPGEVDDTLISLD
PFTDQREREYAYLGSDRFPELLAKQNVSLAAVPPQAPGQTVLSHTEI
>RPA1575 conserved hypothetical protein
MGIRRIGKHLFTSRGAVHRAFSPDAFDAIERAIKASETRHVGQIRFVVEG
ALDGAPLFRDQPVRERALDIFSQLRIWDTEHNNGVLIYLLLADRNVEIIA
DRGIDGRVGADTWEAICREMEAEFREGRFEQGVLRGIDRITAHLAEHFPR
SGPSPNEIPDAPVVV
>RPA2489 hypothetical protein
MTRTAFPFATQDVSALARALNRELDAQGHKLGHVEMLNLLARSAGYRNFQ
HFRAQFDAEERLLRPPEPEPVADLQKVAQATRYFDASGVLTRWPKKASHR
LLCLWVMWSRVPAGRELSEKQFNELLLGHHGFGDHALLRRMMCDYGLMTR
TRDGRVYRRVEQKPPPEGVALIRHLAPRLAA
>RPA2784 possible nodulin-related protein
MHPLHRENHLINRIGWLRAAVLGANDGIISTASLVVGVAAAATSSEEVLL
AGVAGLVAGAMSMAAGEYVSVSSQSDTEQADLARERKELADAPDSELDEL
TKIYVDRGLEPALARQVAEQLSAKDVFAAHARDELGLSAHVVARPVQAAL
TSALTFSVGAALPIGIVLLAPTGSTSMVVSGGSLICLAILGAVSAHIGGA
GLLKPTLRVTFWGALAMAASAAIGALVGHTI
>RPA1605 conserved hypothetical protein
MLVTILSSAASAAAESSTSSVSTRDLAAQLATAYRAVRDETERRAAPLSP
EDQVVQSMPDASPAKWQRAHTTWFFEQFLLGPHCPDYQVFHPDYAFLFNS
YYVSAGPRHTRAARGLLTRPGVAEIAAYRRHVDEAMLNWFVTADAAKLEE
VAPLIEVGLNHEQQHQELLLTDILHAFAQNPIPPAYDPAWRLPTTDHHGE
DWVTLPEGIHPIGHAGDSFHFDNEKPVHRALVGPVRLARHLVTNAEWLQF
MAEGGYRTATLWLMDGFACAEREGWEAPGHWRQVDGEWKIMTLGSLQSID
PDAPVCHVSYYEADAFARWAGKHLPTEMEWEVAARAGQLPDAFGAVWQWT
RSAYAPYPGYKAIDGALGEYNGKFMVNQLVLRGSSCATPEGHSRVTYRNF
FYPHHRWQFTGMRLADYGA
>RPA2526 DUF192
MQASEMFKFRRLRWLGALAMAVMLCGLLVGGAVAAQMQSLEIVGKTGVHV
FTVEIATTDQEREVGLMYRKSLPDGQGMLFDFRPEQQVSMWMKNTYIPLD
MIFIRGDGTILRIAENTEPLSTRIIASGGPVAGVLEVSGGTAKKLGIAAG
DRVAHPLFKSR
>RPA4696 FldA, conserved unknown protein
MQTAIPCLFMRGGTSRGPFFNAADLPSDIATRDKVLLAVMGSPDRRQIDG
LGGAHPLTSKVGIVSKGSKPGVDLDFLFAQLQPDKDTVDTTPNCGNMLAA
VVPFALETGLVKPQGPTTTLRVLTLNTDMQCDITVQTPDSHVAYEGDARI
DGAPGTSAPIKINFLDTAGSVAPGLLPTGNVRDVIDGVEVTCIDNGMPLV
MFRTEAIGRTGYEGVAQLNADTELKARLEKLRIACGHAMQLGDVTSKNYP
KMTLISAPRAGGSISTRSFIPHVCHDAIGVLAAVTVATACVLKGSVTEGI
AQVPDGTVKQISVEHPTGEFSVEIEVDPQNPQNVTRAALLRTARLIMRGD
AMVPGSVWGGKQTSDAALQRAG
>RPA1902 GTA orfg11, homologue of Rhodobacter capsulatus gene transfer agent (GTA) orfg11
MTGSGTGEGCDCACVDDTTQKVDQLTTKTQALSSEAKGFARAMTSAFSQS
VTGGKQFEDVLKSLALKVSDLALKAALKPLTTSLATGFDSLFSGLFGGSL
LKNADGAIKPFAAGGVIGTPTYFPMLGGGVGLAGEAGPEAILPLARGADG
RLGVASSGGATTVNIQIATPDADSFRRSETYLTGQIARAVARGQRGL
>RPA1903 GTA orfg12, homologue of Rhodobacter capsulatus gene transfer agent (GTA) orfg12
MTPFHEILFPLDVALKSAGGPERRTEIVGFGSGREQRNARWADSRRRYDA
GYGVKTFEALQQVVAFFEERRGPLTGFRWRDRLDCSSAAPGAPVSPLDQG
IGIGDGTRASYQLVKTYGAGFAPYVRTIAKPAAGSVRVAVGGVEAAAATF
ACDPATGVVTFAPGHLPPSGAAVTSGFQFDVPVRFDTDYLEVDLSAFAAG
AIPKIPLVEIRV
>RPA1905 GTA orfg13, homologue of Rhodobacter capsulatus gene transfer agent (GTA) orfg13 (37% identity)
MLTIPTALQARLDAGVTTLAQCWIVRRRDGAVLGFTDHDRDLVIEGVSCR
AGTGFSASEASQRFDLSVDGAEISGALDDTLLREADLAAGRFDAAAIESW
LVDWSAPELRVLMARGTLGEVRREGSAFTAELRGLADLLSQETGRLYTAS
CSADLGDARCRVDLGSPARRGEGRVVAAPGTSTITVSGLDGFAPGLFTAG
RLSWRSGANAGAAVEIKQHRIVGGEVRLSLWQAMAEPIVAGDSFVVTAGC
DKLFATCRDRFGNSDNFRGFPQIPGNDFVVSYPVPGAPGNTGEPIGPVLR
GDV
>RPA1899 GTA orfg9, homologue of Rhodobacter capsulatus gene transfer agent (GTA) orfg9 (56% identity)
MGAQKGKDLLIKIHDGTNYVTVAGLRSRKIAFNAELVDITHAESADRWRE
LLAGAGVRRASISGRGLFKDAGSDALVRTAFFSGLISDCQVVVPDFGSIT
GPFQIASLEFAGEHNGEVTFDLTLESAGALGFVAI
>RPA0248 bp26, DUF541
MNMLRSLSIAAAACCLFVALPARAQDVPPPGIVVRGEAQKSVAPDTAVIE
AGVSNFAKSARAAAEATNLAIGKVLLELKNAGIDGKDIQTSQLSLQPQYA
DRPGPSEVTGYVAKNIVSVRVRDITALAGILDRLMVAGANEVRGISFTVS
NASKLLDEARTEAVADARRKAEIYARAAGLTLGAPVSIAEDSGPGVMPMR
KMAADLSAGAQVAPGEQSLNVSVTVNWGVKAQ
>RPA1256 fmdB, putative formamidase regulatory protein FmdB
MPVYEYMCDACGPFTDLRPMAECDDPQLCPTCETSSPRVILTAPNFSCMP
ASQRNAHATNEKSRNAPMTVGEYKAKHAPGCSCCSGIKKPARLQTRTKSG
AKGFPTARPWMISH
>RPA4628 hesB, Protein of unknown function, HesB/YadR/YfhF
MDMSFTPSAEKFIRRMLRFSGGTGGFRLVVSAGGCSGLSAQFDIAGAPHT
GDAVVDRGDYKLFLPESSLKLLEGVVIDFMETPTSSGFMFHDPKGSNCQC
ASDSGPKPAAGLHQLREL
>RPA1663 hpaD, putative 3,4-dihydroxyphenylacetate 2,3-dioxygenase
MGKLVLAAKVSHVPSLMLSETSDSPLRQARAGAVAALRELGRRAKEREVT
TFVVFDTHWLSNFGYHINANAQHRGSFTSHEAPHMIQDLRYDLPGDPALA
QAIAAEAQSHGLKVLAHQVPTLGLEYGTIVPMHYLNSDGWAKIVSIASPL
FTSIEESRALGEATRRAIDASDERVAILASGSLSHRLWPNKDLGPDAWTT
IASEFNRQVDLRVLQLWQERRYREFTAMLPDYAVKCNGEGGMADTVMLFA
ALGWDDYTGEAEPLCDYFPSSGSGQVNVEFHVS
>RPA4110 mycA, myosin-crossreactive antigen
MHYSSGNYEAFVRPRKSESADKKTAWLVGSGLAGLAGAAFLIRDGGVAGE
RITILEELDVPGGALDGLDVPEKGFVIRGGREMEEHFECLWDLYRSIPSL
EVEDASVLDEFYRLNKDDPNFSLQRATQNQGQDAPDKALLTLNDRAQKDL
LSVFLATREEMENKRINEVFSEDFLKSNFWLYWRTMFAFEEWHSALEMKL
YLHRFIHHIGNLADFSSLKFNRYNQYESMVLPLVKWLQDHGVRFRYGIEV
TDVDFDIHAEGKQATRIHWTEKGVEGGVDLGPDDLVLITIGSLTENSDNG
DHHTPAKLREGPAPAWDLWRRIAAKDPAFGRPDVFGAHVTETKWASATIT
ALDQRIPQYVEKIAKRNPFTGKIVTGGIVTVKDSSWLMSWAVHRQPHFKK
QPKDQMIAWLYALFVDRPGDYVKKPMLDCTGEEITQEWLYHLGVPVEDIP
ELAAAGANTVPVMMPYITAFFMPRQAGDRPDVVPEGAVNFAFIGQFAESK
QRDCIFTTEYSVRTPMEAVYTLMNVERGVPEVFNSTYDIRTLLAAIGPLR
DGKGIDLPGPSFLHKLLMKKLEGTEIAELLKEFHLISE
>RPA1485 nnrU, putative NnrU protein
MCDAGLLRRFAPRNDGLRRQRVRSTLSGWTEFVAAFAVFLLSHAIPARPA
VRARLVGALGERGFLIAYSIESLLVLTWLIVATERAPFVELWPFETWQMW
VPNLALPLACQFAAFGIGAANPLSFGGDPRKPFDPQHPGVVGIVRHPLLW
AIGLWAGAHVVPNGDLAHVLLFGFFAMIAVIGMLIIDRRKRRQLGAERWA
ELAARTSFWPFAALISGRFRPQTWRISSLRLCIGLAAWLSLLLLHPLVIG
VSPLP
>RPA4769 opgC, putative opgC protein
MTPPVTTVAEPVTGAPPAGGPPTEPAAPPPLPAASPPRKVVPKRELRLDL
FRGLALWLIFIDHLPANVLTWFTLRNYGFSDATEIFIFISGYTAAFVYGK
AMTELGFVVAAARILKRVWQIYVAHVFLFTIFLAEISYVATSFQNPLYSE
EMGILDFLKQPDVTIVQALLLRFRPVNMDVLPLYIVLMFFLPPILWLMRR
WPDITLGLATLLYAATWQFDLHLTAYPSGAWVFNPYAWQLLFVFGAWCAM
GGAKRLSRVLASKVTLWLAAAYLVAAFYVTLTWYTPQLFHTLPKWLEQWM
YPIDKPNLDVLRFAHFLALAALTVRFIPRDWPALNSPWLRPLILCGQHSL
EIFCIGIFLAFAGYFILAEVSGGAVLHFFVSVTGICIMSAAAWLFSWYKN
VASKAGSRTPPDADLAGGDAA
>RPA3768 paaA, phenylacetic acid degradation protein paaA
MYTQALNVSDGDERTIEDAERAARFQARIDAEERIEPNDWMPAAYRKTLV
RQISQHAHSEVVGMLPEGNWITRAPTLRRKAALLAKVQDECGHGLYLYAA
AETLGASREELVDQLLSGKAKYSSIFNYPTLTWADIGAIGWLVDGAAIMN
QIPLCRCSYGPYARAMIRVCKEESFHQRQGYEIMLTLAKGSAEQKALAQD
ALNRWWWPCLMMFGPPDQASQHSDTSTKWKIKRFSNDELRQKFVDATVPQ
AHYLGLTVPDPDLKKNDATGHWEYGEIPWDEFKQVLAGNGPCNRDRMAAR
RKAHDDGAWVREAAAAYAEKRKKKLAA
>RPA3766 paaC, phenylacetic acid degradation protein paaC
MPAASIEINDTPLFTTTLRRADDALVLGHRLSEWCGHAPMLEEDMALSNI
ALDLIGQARELYSYAGQVEGKGRGEDQLAYLREERQYQNLLLVEQPNGDF
ARTIARQLFYSAFADPYWRAMMQSSDATLAAIAAKSEKESAYHLRHAAEW
MIRLGDGTDESHARAQDAVEALWAFTGELFEADNAERRLIETGVAVDPAS
LRATWQSTIDTVLREATLTAPSNPWMQQGGRSGRHTEHLGRLLAELQHMQ
RTYPGLTW
>RPA4740 pcaC, putative 4-carboxymuconolactone decarboxylase
MDKTTHDKGLEIRKAVLGEAYVENALKNADDFNQPFQELVTEYCWGAIWG
RDGLPKKTRSMLNLAMIAILNRPHELRAHIKGALTNGVTKDEIREIFMQV
ACYAGIPAGVDSFRIAREVFAELDKA
>RPA3263 phnB, conserved hypothetical protein
MKIVTSLSFQGQCREAFEFYAKVLGGKIIAAFPYSDAPPEMPITDPKYKT
WLMHCWLEVGDQALMGADMDTGWASNIDKPKNGFDVALHTHDKAQAQRWY
EQLSEGGKPMMPFGETFWSPGFGSLIDRFGIPWMINTQPAQA
>RPA4752 rbn1, putative ribonuclease RNAse BN (RBN) transmembrane protein
MRQIRYAYAVAVDALYTFLADDGWAIASHIALSTLMALFPFLIVLTSLAG
FVGSRELADSAAELLLDVWPAQVASTLSGEIHDVLTTTRGGVLTIGLVLA
LYFASNGVESLRVGLNRAYAVIEPRPWYLLRLESIGYTLVAAFTALAMGF
LIVLGPLIVATARHYVPLLVHDNEPLLTFARYGIAITALTVALFLLHAYL
PAGRRSFRQILPGIVFTITASLISGMTFGMYLARFANNYVSMYAGLASVI
IALVFLYFIAAIFVYGGELNAAIIKSRLPEGTTLQEAQLRAPGAKPV
>RPA1247 rbn2, Ribonuclease BN
MSLGRSRGKALRSKIRTRPTSRIWSAGTRPTRTDSSKDQASIEMNAAEPS
AARRVWRAPQRTSPAGLHRFEDSPWQVLGAEWKPIAVGTYERIGDDRLFL
VAAGVVFYWLLALFPAITALVSSYALFADAATIGDHLAQLSSIVPAGTYS
VVEEQVGRVLANGQTKLGFAFLISLSLALWSANGGVKAIIDALNAVYDVE
EQRGFFKLNGFSLLLTLGALGAVLAAIGLVIAAPIVLARIGLGNLVAVAI
DYGRWPVLALMTFGGLSLLYRTAPNRPSPPWRWVAPGSIVATLSWLAGSA
ALSYYLANFADYNATYGSLGAAIGLMIWMWMTAIVVLAGGEVNAEIEARA
GRLGFTETSS
>RPA0012 rdxS, fixS, possible fixS
MEVMVILVPLALGLGLLGLVAFLWSLKSGQYEDLDGAAWRAIADDDPPLP
PPANVPAEKRG
>RPA3833 rrm2, tRNA/rRNA methyltransferase
MKDLPMSGSGTDRSKPPAALDGPVVILVEPQLGENIGMCARAMGNFGLTR
LRLVKPRDGWPNIAAQRSAAGADHILNAVELFDSVAEAVKDCTLLFATTA
RAHDQAKPVRGPEAAAQEIVVETASGGTTGIMFGRERHGLENDEVALANR
IVTFPVNPAFASLNLAQAVLLMGYEWFKHATQNALPYEMPERSPRASQHQ
IDAFFSNLVAELDRVEFLRPPEKRDTMLVNLRNIFTRMEPSKQDMHTLHG
VVMAIADGRKGPAKGGVLDGDQATRLRALLAERAAAGGPDAEGGSLRGLA
RMLRRNPTDAERLLWEHLRKDRRFAGNFKRQTPVGRHIPDFVSFTRRVAI
ELVNPDESDAIVRDRAMRKAWLEARDYRVALVAATDVTSDIAAVLARLEA
VLAA
>RPA1367 soxY1, putative sulfur oxidation protein
MQHLDDPPLAVDRRQMLIAASVGLVASIVPFARASTSDEVDAAIRDLIGN
ATPRDGGIALQVPETAENGAVVPVTVVVDSPMTTERYVRAIHLVATKNPT
PGIASFRLSPASGRAQVSMRIRLAEKQMLLVFAEHSDGTVNRAAAEIKVS
VGGCLT
>RPA4467 soxY2, putative sulfur oxidation protein soxY
MVFMKTETSRREALALAGIAGLAALLAPRMSFADAAMVDAEIKKLYGDKK
FDSGKIKLDVPEIAENGLVVPITVEVESPMTDADYVKAVHVFADGNPMPG
IVSYKFTPACGKASASTRMRLAQTQNIICIAEMSDGKLYSTKSSVKVTIG
GCGG
>RPA2965 yjeF, Protein of unknown function UPF0031:YjeF-related protein, N-terminal
MELLTPAEMDRADLLTIAGGSSGFALMLHAGRHVAQAAIEMADEGPILVI
AGPGNNGGDGLIAATELVALGRTVHVMLLGEREALKGDAALAAREWRGPL
LPFLTQSIGAPALIIDALFGSGLNRPVKDQALKVIEAVNHSGVPVLAVDL
PSGINGATGAVMGAAIRARETVTFFRRKIGHLLLPGRLHCGQVRLVDIGI
EPGVLGEIRPQAFENDPDLWLPDFPVPRADGHKYGRGHAVVVSGELSQTG
AARLAARGALRAGAGLVTVASPRDALAVNAAALTAVMVRPVDTPDELGTM
LADRRFNAIGIGPGAGIGEETRGKVLAALAAGAAVVLDADALTSFAGHPD
ELFEAIKSASSPQVVLTPHEGEFPRLFSDMSNKNPLRSKLERVRVAAQRS
GAVVLLKGADTVVASPDGRAAIAANAPPWLATAGSGDVLTGIITGLLAQR
VPAFEAACIGVWMHGEAACEAGPGLIAEDLTETMPAVHRRLYGALGIEY