Gene list
Applied filters:
COG category: Function unknown
Gene type: CDS
Genomic element: chromosome II
Number of genes found: 201
![]() | ||||||
Hide UniProt / TrEMBL protein name | ![]() |
View in Fasta format (DNA) | ![]() |
View as list | ![]() |
|
![]() |
# Burkholderia thailandensis E264, E264; ATCC 700388 >BTH_II1653 conserved hypothetical protein MRTAYVACALAAGFASAAAFADAPAMPSGGMLVAANGMTLYTFDKDAPNA GKSLCNGPCAANWPPYKASATDRPTGGYTIIKRDDGTLQWAYQGKPLYFF AKDAAKGDKKGDGFKDVWHAVKE >BTH_II0511 conserved hypothetical protein MNTKPIPTTTTKRPAFTKTTLLVSLAATAALSLSACVAPNAYGPYGSQPQ YGTPAYSQPSYAQPDYSQPGYPQQGYAQPGYQQGYQQGYQSGYSGYSTQY GTIAGIRPIGGATSPSGVAGTVVGALVGGVLGNQIGRGHGRDAATVIGAL GGAVAGNQIGQQMGAQPSAYRVDVQLSDGSTRSFDLQTPGDLRPGDRVRI DGNQISRY >BTH_II0589 CHAD domain family MARVLEIVLDFPLQGWQAGRGARARARDLGAELARAWRICPPVKMRRGHE RVTIEPCRFVEAEPDDGGRWQTWIETTAQARRALAVRCHPFVPGVMVRER LDDYRGDVRVATPAAGASVASDASGATDAGGDSRPSASTAASVFAESAAQ SADSAPMPGFAAGRAAFPESRSPYGPAYGFVADRRRGRWLDADGIDVELT LDDIAFAPAAASSEAARAASSRVCELRLAVADPDDSSARAAALRALFNAA RELSGAWPASLATASVLDRACMGDAPDAAGSPAKAQPVDLSTMRTQRAAF FALGCGVTAQWLGNEAGARDMADPEFVHQMRVALRRLRTLVRLFPRYADE AWKDAFSGDIRWLAGMLGAVRDWDVCVTSTLPALAAADSDEAAWAGTLDA ARAQGDAARAELRQALGTARYTRLVFAWLEWLSLFSLGEDDPARGKAPSL KRHAAKRVSRLFGHLYGAGRLTTLDAAARHRVRIDAKRLRYALEFFSSLA SRRTREDTVRLLARLQNALGDANDAAVALRCLERLSAPPYQLGFARGYGA AAQRYAAEAGEQMLRGMRVPKIGGRKA >BTH_II0933 Rhs element Vgr protein, putative MPTTGRSTATCIPCADGGLTSWQIGFSSALHFLRHRRDEHFWLDRDAQEI LSEVFNRYPPLQGAFRFELSGALAKRSYCRQSETDWHFVNRIMEDEGLYG YWIHDEREQKTTLRIVDRVEALPAAKPIDFYRGNAGDEIGGFTQWATLRQ LNSVHVASRSGDYKRPSTPFEVRQSVQTTRYVEQTNWRTQEQKAIPYPPL EDYSAGAYRYPDSDRGAAWARIRAEEYESRSRRYAGVGGSRWIDAGGRFV LNDHPAHAESDPKAREFVAMAARWTIENNVSIARSSRHFPYSLQADIERA RSGFGSAFAVAPHPQDGATGLYVIEVEAQRTDIEYRSPFEHRKPAMSVEM ATIVTPNGEDVWTDPLNRVRARFHWDRQSPPDAFETSPPLLVAQSDTGPQ YGGVHVPRRGETVYVDFVGGDCDRPYIVSRAPGGATPPMWHSDGLLSGFQ SREYGGGGGYSEMQLDDATGQVRARVLSKTRGDYSHLTLGYGIVQQGNTR GRYLGSGFTLHADQYGAVRANRGLYIGTHATRHDAEQLDVDAARDQLKAA ADVLARQSSLSEQHRAESLKAGHDALTELTDATRQPVEPGASGGRTSGGG TGSANGFKVPAMLLGSAGGMGLTTFQSLHASADRHVNVVAGQSAFVATGK SFVASAGEKVSVFAQSGIKLFSKDAVQIESHRETIDLIGQKTVRIVSATE RIEIAADKEILITSGQAYIRLKGGDIQIHAPGKIDIKGSLHNFSGPASMP YPMPTQPDAVCVPCMMKQAAGRGAFVAMGA >BTH_II0866 Bacterial protein of unknown function (DUF879) superfamily MATTTLNRYYEDELVRLRELAAEFGRAHPLLAPMLGAPSGDPDVERLLEG VAFLTGLARQKLDDGLPELVQALANLLFPHSLCPVPAATLIAFEPRGALR ERAVVAAGTEIESMPVDGTACRFRTCGDLDVEPIALAGCRFVPPADGGPA LRLDFEMLGIDASEWDATRIRLFVGGERLHASRLFAFLMQHVVSVDIASG PPELPGPRCTLGGRALRAAGFDDALLPCPERAFPGFRLLHEYFAYAEKFL FVELSGIERWRTARSGSQFSVRLALDCAPDWLPGIERDSFRLNVVAALNL FAHEAVPIQHEHRATDYRLQPEGDAHGHCRIYSVDRVVGYRPGHPVDRHY VAFGAAGDGALTASYRLIRRAALDGRGHDVHLALAYPPGEALAAETLSIG LSCTNGTLPARLRIGDVCRPTDSSPERFAFANIAPVSPPLDPPLGEPLLW RTIGHLALNFLSLGNVDNLKQMLALHAFGERGDAARAQADRRRIDGIEAV DVRAETRIVGERMLSGQHVALRCRAHAFGGAGELYLFGCVLERFLAEYAA LNTYTRVEIDASPDDGRFAWPPRMGAQCLL >BTH_II1902 ImpA-related N-terminal family MNDDALPVHSTDLLDFDEDFIKIDAAICEYDSVGHAPQRKGESAFQWASV ETACLALLKKAKDVRVAIWYLRACIARRGLSGLADGVRLLAELMSAPVEE LHPRALPDESPGETLLIHLGWLAGPQFLHQLGSSRFEDRDATLNDLIGGR AAAIVEDRDYCVRANTLVHDIKDSLSRIRESVAVVEQELNLSRSLDLLSV AASRLARTDAGRAESESVAGEKPAHAEAGFSPAPDPQQPAAGPGGALRSR QEVGVALERIVEYFRLHEPSHPAPIFLSRIQRMLGAGFEEVMAELYPEAA SLVAQLNRPQSSIK >BTH_II1937 membrane protein, putative MRRIRYPLCRSLRQIIRVDLDSTSNTASPASPVPHSAMQGFKRKIVYVTC FELIAIAITTTGFSLLTGQAPAHASIAAVASSAIAVAWNLAYNTAFEWWE ARQTRRGRGLLRRIAHAVGFEAGLVVMLVPLFAWWLDVSLWQAFVLDLGL IAFFLAYTFAFNLAFDRLFGLPASARPAPPEASSS >BTH_II1901 Protein of unknown function (DUF770) superfamily MVRQKDGQKFIGESRAPRVQIEYDVEVYGSQKKVELPFVAGVMADLSGDN TEPLGPVEDRRFQEVDVENFDERMAQIAPSLSYHVKNVLTNDGTLIPIDL TFTSMESFEPAAVVKRIPELSTLLEARNRLKELLTYMDGKAAAEDVIQEL LKSPQWANEADAAPEQSGGAGEGADHQPEEGAK >BTH_II0136 Bacterial protein of unknown function (DUF879) superfamily MDPQFLDHYNRELTYMRELSAEFAAQHPKIARRLGMQGIEVADPYVERLI EAFCFMSARTQLKLEAEFPRFTQRLLEVTYPNYVAPTPSMAVARLRPSLR EGDFSKGFKVPRHSMLRSSIPPGEQTACEFRTGQDITLWPIEIAGATLTA VPPDLPDLQRSLLPHTKLRGALRLRVRTVGEIKFSQIAGLDRLSLYIGGD ERIASHLFELIHASSVASVVRAQGAARGEGVVVAKNAVDFEGLSPDQSLL PLVWNTFHGHNLLHEYFTCRQRFYFFALTQLNAGLSRIDGKEAEIVLLLD RLPDELVTHVEAARFLLFCAPIVNLFPKRTDRVEINRAQTAFHLIPDRTR PLDYEVFSVSRVFGQKAETSTEVTFNPLYQTLHSDIGNYGRYFSILREPR TTSTNARKYGTRTPYVGTEVYVSLVDQAEAPYADDIRYLSVDAWVTNRDL PRLIPRNGVNDLTMQDSVPIEGVSLVHPPSAPREPYATGETAWRLIRQLS FNYMPLAELDHRDGGQALRNMLRLFVGTSEREQATQIDSLVGARTEPVVR RLPGHGLLVYGRGVRCELTVDESGFSGLSPYLFGLVLEQYLTRHVSINVF TETELRSMQRGLVTRWKPRMGGRGAV >BTH_II0261 Protein of unknown function (DUF1316) subfamily MNDDTRARGGREGGLRAARDRLQPALLDRLTDDERARCTEPPDAQAIGGE RLRAAVLRDLAWLLNTRNGEDGFVDWAAFAHAQASVLNYGMRPLVGKPMS GVERMSVEASIRDAIVRFEPRIAPDSVEVRSVLDAPGGAAGERRHNVLMF EIKGTLWSVPHPVEFVLRSALDLETGAMALQPAAGG >BTH_II1479 Protein of unknown function (DUF1089) superfamily MREVRWASLEHDGIEHLAFERHARGSVAESVVVGRAGGLAYGLAYRVVCD ERWRAKHVIVKMMGGGTLELRADGEGRWRNAADAPLAALDGCIDIDIAAT PYTNTLPIRRLGLARDERRLIDVVYLSIPDLTARRMQQAYRCIEPDRVYR YESVASGFTARLEVDRDGLVIDYEALFKRLPSDAR >BTH_II1470 membrane protein, putative MLSHLPPRVRGLWLAAALAALLYGLSLGHAPYPGQPAAKAALGALLLAAA LRHPPTRERVWLGAALAASALGDVLLALASWPPSFIAGLGAFLLAHLAYC ALFAPWRAAPRGARAVALAALWIAAPALYAAFFPHLAALAAPVAVYVAVL AVMASLALCAHTPGPQIAAGALVFVASDALIGIDRFLGAFAGVDYFIWFL YAIAQLTIAFGVLQRKSK >BTH_II0359 conserved hypothetical protein MASIIDNLIAEHRRLERLVRLLECQSMLRDAQAAENAALLVDALYYLTRF PDVNHHALEDRIIDKLLEKTVLPLELGGELSAQHATLARQGHALIQDLES MVRDENMSRELLEFRIRLYAERLRHNMAVEELTMFPIAKRYLDTGDWSSI LQTGAHRSADPLFQTPVHERFVQLHRMIAVEADCGCKEGNA >BTH_II0990 uncharacterized protein conserved in bacteria MNSTPDAFQRPAIRALTPLEARVLGVLIEKQHTVPDTYPLSLNALTAGCN QKTARSPVMNVSEAEVLTSIDGLKRLSLASEGSSSRVPRFEHNMNRVLGI PSQAAALLTMLLLRGPQTAAELRLNTARLHGFADISSVEAFLDELAARAP ALVVKLPRAPGERESRWMHLLCGDVALDEALAHGVQEDAVPPSEFEALKA EQKALTAELARLRAFVEYMANELGIDADKFTRES >BTH_II1269 conserved hypothetical protein MSQPHLHLSPTAIYPFDARGVAKRFRHAAIFGAIDALQSGETMRFVNDHD PLPLLEQIRQHYGERVGIEYRQREPGAIVIDFVVQ >BTH_II1913 membrane protein, putative MTSTLQSRLPGFARNALRPVLDPYRRYRHAKLIHAARVALSVLASIALTT GLRVPHGEWATITVLIVIGGLQHHGNIRKKAAERALGTLIGAIAGLSLIL LQTTVHLSPLTFLVMSAACGVCAYHAIGKAGYIALLSAITMVIVAGHGDN EIADGLWRAVNVLVGIVIALAFSFALPLYATYSWRYRLADALRGCAAVHA RIAGERYVSDSEHLKDMAKLNALLVQLRSLMPSVSKEISVSMPQLEAIQR GLRLCMSSLEILSSLQPRADDEAGRRFVQLRMKADNRRIQEMLVGAGRAL KFGTLSRLGPLHGPALPAGEPAPPTHLSGYVSLTAKLSHEIEQLRQRLHD TAPQWNI >BTH_II0040 conserved hypothetical protein MHSTTTAFTHRGYLLNCAPARASDGSFQPYVVISRSSDGELVANRFFPSD LHFNDEDAAVAHARDWAVRWIDASSLTI >BTH_II0808 conserved hypothetical protein MRRAAAGGAGHAAQPRVASAAAPPPGAASAIVGIPRMHCSNPTRRPKPNR SASSSSSPARPRGSGRRSPNPRRSRAGGRWAAGDASPVPGHRFMLDMVPW GRRPREVIAVEPERRFAIAFAQGTLDTTIAWRLEPAAGGTRVFLEHAGFD ADAPRARIDIEYEGMKRGRPSVLARIEPAIDG >BTH_II2238 OpgC protein, putative MSSQAPAARSIEVDFFRGLVLLTIVVDHIGASVLSRVTLHAFALCDAAEV FVFLGGFATASAFLAVSARHGPGAARRRFVRRAAQIYRAFLATSTLMLVV SAVLDHYGIDAPNMALDDISVLLASPLTGLVELLTFKRQPYLASVLPMYV LFALATPAIVPLARAKPWWLLFGSVLMWGCAPWLAAELLDTDSFRWSFNP FAWQLMFVLGALMRCWPLHRDVATRPGGAAITAIAFAIVLACAYYKLCAG LPLPEGELKRHLAWPRVMNFVAFAWLMAELVRYGWIARVARAAQPVVAVG QRGMPCFVAGAAISLTLDSVLHGVRGNARLPQLGMGLAADACALALMLTV AHSGPLFRRKRRSVAA >BTH_II2251 Uncharacterized small protein MFSDLGGELRTAGRYLGQALRLMVGLPDYDGYVAHMRATHPDRPVMTYEA FFRERQNARYGSGAGKCC >BTH_II1429 serine protease, subtilase family MARRNKQKTMKRRGATLLAPVVVAAAAAVAARPGWTQAAPYQDPGRRGDP ASWRTPEFTNAWGLGAMHAEYAYAAGHTGANVAIGVLDSGYYAQHPELPG SRFVPVTAAGVSGVLNANNNNHGTLVSGVVGGVRDGVGMHGVAPDATVFE GNTNATDGFRFGVSDPKFPASDAKYFSEVYDALAAKGVRIISNSWGSQPA NENYSTLNTLTDAYKLHEAVRTATGQGTWLDAAAKVSRDGVINNFSSGNT GYDNASLRGAYPYFHPELEGHWMTTTGYDQLGGQVYNKCGVAKWWCVMAP TGVPSTSYSGGAAAPTGATYANFNGTSAAAPHASAALALIMERFPYMTGE QALSVLFTTAQNMEADPSRPDYTNNGLFSPVHPAKPGASGVPNGFGGWGL VDLRRAMNGPGQLLGTFHAALPAGVADVWSNDISDVALAARKLEDDAEHR AWLDTLKTKGWERGLPAGASDGDWIDYALGVARDAAYQAREYQGSLVKSG GGTLTLAGANTYRGLTTVDGGELRIDGSIAAGAVVNPAGRLTVTGRAADI AVDGGVATIAGTSANLSVDRQGRAAVTGTTADVRVANGFASLGGTSGNVA VGALGVTVITGRTADVAVDGGRASLDGASGNVSVGNGGIVNGNGTVRTLT AAANGTVAPGHSVGTLTVSGDVRFAPGSAYAVEVSQGGASDRIVAGGRAQ IDGGALTLALENAPPPLTPDQSRSVLGRRFEILNAAGGVAGRFDAPGGYL FVDPVLAYGPTSVSLTIDRNATPFASVARTANERSVADALETANPGSAVY NSVLLAASAQAPQATLSQLTGEIYPAAYAALVNESRQVRDAALDRLWAAR GEPGRAGAWARLLGSWGGARGSGDVNGYTSSTGGLLAGADAAVLDGVRAG GFAGYRHTGVNLRNQPSSASFDSFQLGAYAGWQPGALGVRVGAAHAWHRG GVDRAVQYGTVAESETTTLHAETTQVFGEAGYQLALGGAATVEPFFGIAY VHLKNEGTTETGGAAALRVQEGNHDVTFSTLGVRGETRLGLTSRLQLTLQ GSAGWQHALTDGQPTGALAFATGSNTFIVASVPVAKDAAVLNVGAGLELG KNGLLRVGYSGALASRQSEHAVQGGLHWKF >BTH_II1755 conserved hypothetical protein MTEQEAERIATHRHYKGGLYRVIGVARHSETEERLVVYEQLWPKEHSLWV RPEAMFNETLADGTPRFRKLGD >BTH_II0255 conserved hypothetical protein MPTTLACDGEPPAWYGKIPGAGDFVNHRLSHELAGWWERWLQQGMAAMRQ RGGDELARYYTVAPVWNFLIPAGAGAQCVQPGCLAPSCDRVGRYYPVIAT LPMRTADYWSALPDVADAFYWQVGSALLDAIRHARAPVQLEQALAKVRLV RGADARSAAFGGAGGERGELGESGWRGERVAARESGEGGCIGGAGDIDGA GESGKSGASDERGESGGEARSGPFARPRPSPPATPAWPGLSQYFDPYGAT SFWWTNRADGSPLRTHAHTGAPDSRLFLRLFGGVQHGV >BTH_II0227 conserved hypothetical protein MNGVDAKRFVNRQRERGAGAGAERARCGSGSGGGGGSDVAINRRRLRRCV RRTHLCHTRRPILSTGQIGRASLMVKKRIRAMFAVALVGLAAHAAHAQYT TDWIANTYGTIASHVGNNARSMWVSPEGVIYTASFWDENAGGVAIYQNGK TLGSIGTHAEFQGGAITGNATSIFAAMQYGTPQGSGTVGRYNRATLQRDL TIPVSVWNAVSRADVITGLATAGTLLYASDYFGNRVRVFTTGGVWQRDIG IANPGALALDDAGNLWVAQKNAAKIVEFDPSGALMNTIQMASASRPASLY YDASKRQLMVGDQGPDMNIKLYAIAGVPRQIGTFGVQGGYLDTTTGIKGQ VGDRRFTRVVGIGKDAAGTLYVLNNPWGGGWDLGRNGATDIHAYDALGNA LWKLQALNFEAIAAPDPTTDGALFYSGMNIYSGTAGGTFIANTVDPFTYP SDPRLDMNDYQRGQHFGQLVSVGGNRILVASGQNPGNFNFYHFNAASGYI AIPDASLPGKGFNTSLQVTAGFSIDNKGDVWVGLNGTNAISHYPLAGIDA NGKPSWGAPTSIPTPASVQPTARILYLSDSDTMILAQGIAGSWDWTAMNG RIEVYHGWSAGNVTQPNPVIALTSANPKSIAAAGNYLFVGYVHTVPNIDV FNLNTGQLVATLINSNSGVMDVGNDVDSMYGLRAYLRSTGEYVITKDNYN GSSIVVYRWRP >BTH_II1887 Bacterial protein of unknown function (DUF876) superfamily MSSLPVGPVAWSDGMLIETQHFQQLERHLAHQAALRLGQTSNHGWGFTLL DLDQDGLGLGRLGLRHARGVFQDGTAFSLPSDDPLPPPLETELAQAGDVA CLALQAARTGGPEMAFGDVASASRYRAVSTEVPDLAVGLDAPGTPRRLTI ETGQLITRLCWKSQLRSDEVALPIARVAGRNASRTVSLDPRFIPPLLDTR AHLVLRSLIDELQSTLRVRLASTSAQRVLSTGGGVADLIELLLRQAIAEY RMRLANLDAFDPLPPAMLYHELVGLLGRLSVLPGVDEELADRELGYDHDD LQTSFEPLAMMLRQALARVIETPVLPLRFEDRGDQVHICIVDKQWNLKKL IFAFSAAMPAEKLRQLLPQQTKLGAVEQIQKLVDLQLPGARLNALPNPPR QIPYYAQSTYFEVESTDPFWKQTLAGSAMALRIVGDFPDLRFEAWGLRDG KVA >BTH_II1406 Protein of unknown function (DUF636) family MQTGDSPNYEGGCTCGAVRYRMTSRPLIVHCCHCRWCQRETGTAFALNAL IESDRLLLLRGEVDIVDTPSNSGKGQKIARCPHCRIAVWSHYAGGGGAVS FVRVGTLDEPDRLSPDIHIFTSTKQPWVILPPEARAVPEYYSSDEVWSAE SLRRRAALRAKRER >BTH_II1061 Bacteriophage lambda tail assembly protein I MNETLCTIRLHSTLGVRFGRIHRLAVSSTAEAVRALSVLIPGFRAFLMSA RDDGLTFAVFNGRRNIGEDELEHPVGRDEIRIAPVIIGSKRGGLFNTILG AALVAVGAIATFGFAQPWGASLMGLGASMALGGIVQMLSPQQAGLAGAAN NGTSYYFNGPVNSAAQGEPVPLVIGEMIVGSKVGSSGIYAEDQV >BTH_II1424 conserved hypothetical protein MTAAMRIAAVSLQKSVRDRLEGLRVTGLYYGLAWASVILPIALLAIGLFN ATLMPSEKGFYAMSFALALSGSVAVQKNTRDLKAAGRGRAETEIVADVAE >BTH_II1998 unnamed protein product MEFPIAVHKDDGSVYGVTVPDIPGVHSWGETIDDAIKNTREAIVGHVETL IELGDDVEFTCSTVEELVAKPEYAGAVWALVSVDLSQLDSKPERINVSIP RFVLHKIDAYVASRHETRSGFLARAALEALNEGKVRHA >BTH_II1703 uncharacterized domain protein MSHDDARGILDAEYRPPIRRGDLDGFGQGIIVAIIDGVFDQQLAVTPSEL RAALARGARIFGASSMGALRAVEVPGVVGVGRIYEMFRDGVVDRDDEVAV TFDAQSLTALCQPLVNIRHALERLAATGTLARPLAKRILRTAQLMPYFDR TYPLILARVGLDDHRDAAQLAEMLASHDLKREDAITLLEYLRNVDAEPVG SAPRMQSAITLPTNARDVPSPDEPLHLWEFGPPLPFRELLEFLAFTGGLR AVALRAVAALASEDIPEITDASELQALFDRRMAQIGRTWHWLTEEEVTTS LRGLGIGTDALQASVVSESIDELRAMVLLRNASAPFLQALRNHLFLDELM LKREAARALSLRWLATRATSCGAAPSAHDRDSARRALCRQLDVRDFKGAV RQLSAWGITPSRCEDFVTELALARRAWQADSVVPQPNSRRWRWLPASRKA AGSRRFCMPAATAYTIATRLRNVVGITRVAMITGLGTLGIPNAQAFRPDG QWSSTVGSGKSESAIGARIGAIMEEVEKWAQERYSQNLDRHVVCVSSYRG LRRRAESAVDPATLDLPYDSQYSAKLVMPWVRGFDLAAGAPCLLPAAAAS HMRLPLDIFYSPQGARKTVTTNGLASGMTLAEALTHALCEYVERHARTID AIVNDNPGAPYAARSPVTDLDRAPASTRRLLRRIERAGYRLVARSIAVDI AIPTFIATILLPEGHADGTLFGDGWQQASGWAAHPDPETALNMAILEASQ TIMSHIAGAREDLTLAARSLGRHERTESRRRPALVPEFDGDAPRLPFDAI RGLVSDDAAADVRWIVARLRDAGLTRIVMIDYSIAEIAPARVVRVIVPGL ETTNPFHTGMRARIALLSDLLGVQYGKPRVGT >BTH_II0870 Protein of unknown function (DUF770) superfamily MTTNGSVAPKERVNIVYRPATGDAKAEVELPLKVLVLGEFSTADDKPPIE ELAPVNVDKDDFNDVMKAQRLTLALSVPNLLDDQAGEDDRLSVALGFESI ADFSPDAIVETVPELKQLVALRDALKALKGPLGNIPGFRKRIQEIVADKG ARKQLLDELGLDEQ >BTH_II0258 Protein of unknown function (DUF770) superfamily MTSKYKASASGQKFIARNRAPRVQIEYDVETYGAERRVQLPFVMGVIADL AGKRAEPLPDLPERKFLEVDVDNFDERMKSIAPRVAFQVPNTLSGDGMLS VDMTFESIDDFSPAAIARNVDALRRLLEARTELSNLLSYMDGKHGAEQLI ENAINDPELLKTLVRQPLAASADGGAPAVADTGTADDSEIRHE >BTH_II0257 ImpA-related N-terminal family MQSSDLIESLLAGVALDAPCGANLEYEQDFLRLQESATPRPEQQYGDTVI PAEAPDWGAVERLALELTARTKDLRVAAHLARSWTELRGIPGYADGLKLV AGMLGRWWDDVHPRLDADGDRDPAPRANALAEIAGAHGCARAARRQALFD GGPSVRDAERVFDGRDGGEHGYPGGRERLIADLVRARDGGQPALQAALAA LDALDAIRARVASALGGEWAPDTSDFEKALRRIVRDGLPPPVAATETDAR AGAANGAANAAHGGAANGAASGGTPAFAANGRAWRDAEVTSRDDVQFGLE KMCRYFELHEPSHPAPILLRRAQRLLSLDFYEIIRDLAPESLPKLDLLSG QRSE >BTH_II1896 conserved hypothetical protein MTTLERVSRALTPRRTFFELMRRVEALQRKHDKRLARKRRMPKWLRIEQP AEMHFASTEVERVHVALARFIEDDDHPQVTVVQRHFGLFAPYGPLPLHVT EHAMQEKRFERNAAFERFVNVVCGDLAWLHYSAWSSMHPVLGYERARNPF VERVTALADARRAQQEGGEPFEQHALACRRAFPGIYCAPRRSLADLQRML CAYFGVALQVVPRHGRWVPVPAAQSHARRLGGWRLGARIWDVQHSVEIVV GPIEADEFYRWQRRAAAVMALSAVVTDFVDGRIYPVIKVQVWTRPELAGR VGCMRVGVDAWSRPNRALRTLTVFESFRD >BTH_II1458 4-carboxymuconolactone decarboxylase MSERDREQGKARRAQVMGDAFVERAMSNLDGFSRPLQDWLNEHAWGSTWQ RGGIDLKTRSLCTCAMLAALGRGHELKGHVRGALNNGATLVEIREVLLHS ALYAGAPAAVEAFRNAREVIDALGLDMPDDGA >BTH_II1925 chitin binding domain protein MRRFHSSESGSLHAACRPERPASRELAAPAIFGRPRRRAASSTIVTESSD ECASARHFDIQNNRRISTTLRNANRVTSHLEERYMKSHFDAPSSRPRARA ALTLGAAATLTASFAALLAPVDADAHGAVGFPIARQYQCRLEGGYWDPPN GSAIPHDDCRAAYRAGNNSAYPFTQWNEVSANPVGQGNDLAQLKAAVPDG LLCAGGDTSKAGLDKAPASVWRKTQLTPRNGHIELQWENTTAHNPARMRV FISKPSYDRSRPLRWDDLQQIYDAPAPAPVPANGAGHLPGSIQSFYKLDV TLPAGRTGDAVLYSYWQRIDAGNEGFFNCSDVTIASDERASGFPWVATRA FVEPGIAPQAGQQVRFRVMGADARGAEIVDVRQPITPYNADRSVWAKQIA DQVNGRYGSVAKIGVRSGNTIYFDTANLNANKVWLQPNYSSALGVVGAK >BTH_II1763 conserved hypothetical protein MTTYRDTHATSFRIGDVVTLKTGGPRMTVTYAGPVVFDTGEWVICQWFDE HGEFRQEMFPNETVVLEPRTISAGLARMRSLSVRGGMQA >BTH_II1921 Bacterial domain of unknown function (DUF403) superfamily MLLGRTASGLYWMYRYIERAENTARIVDAGLRMALTRTSDAPAEWSSVLV SSGADDGYRRKYETYAADTVADYLLRDRDNPSSVLSCIECARSNARMVRT ALTREAWESVNGAWLAIKRALAQPIRASALPAILDEIKRETALILGSFYS TMLRNEIFDFAQIGAFVERADNTARILDVKYHLLLPSVSHVGTILDNYQW ESILRCVAAHRSYRWVYDVQYKPLNIADYLILNGRMPRSLRYCYGRVVSS LEHLAKDYGLTHGCHETAANIKRSLEDNTVERVFKSGLHEFLTDFIAKNN SLGLEIAQAYNFD >BTH_II1317 hypothetical protein MPDFILLTRLSPEGLRSPSSLETLEKRTVKEIEQACPGVEWRHCYAILGP YDYLDIFSAPDIETAFKVSAILRTLGRSHAEVWAATEWRAFKAIIDSIG >BTH_II0531 Bacterial protein of unknown function (DUF883) family MALTDTVEYKLDRGLSEARRAGHRFARDARSAARDLNGEVKDNMRSLVDE LDALLKEDVDADALRKRLRGRLEAARDTLDDASWHASRRLRRSAERVTQA VHDNPWQAAGIVAGLAFAAGILLARR >BTH_II1201 2OG-Fe(II) oxygenase superfamily:Prolyl 4-hydroxylase, alpha subunit MMLHIPGVLTKEQVAQCRDILDAADWADGNATSGAQSALAKRNRQLPEGS PAARAIGDAIQDALARNALFFSAALPLKVFPPLFNRYAGGDAFGTHVDNA IRLLRGTDFRVRSDLSATLFLEEPDAYDGGELCVEDTYGVHRAKLPAGDM VLYPASSLHHVTPVTRGERVASFFWIQSMVRDDADRTLLFQLDTQIQQLT AEKGGRDASVIALTGIYHNLLRRWADA >BTH_II0011 Protein of unknown function (DUF1348) superfamily MSDATEIRPPVPPFTRETAIQKVRAAEDGWNTRDPERVSLAYTPQSKWRN RAEFATGRAEIVELLRRKWTRELDYRLIKELWAFTGNRIAVRFAYEWHDD AGNWFRSYGNENWEFDENGLMAHRHASINDMPIREADRLFHWPLGRRPDD HPGLSDLGL >BTH_II1045 head portal protein MRFRLWGSIRQRKANRSTKRAAFVFSEVGMGWFDFIRRGKQPEADARPHV EPSFQVAAPATSPPGESFSGLDDPRLKEYIRRGELDGGAGRETRALRNMA VLRCVTLISGTIGMLPMNLINSDDSKRVQADDPAHRLLKYRPNDWQTPME FKSLMQLRALLDGQSMARIVWTGNRPIRLIPMDRGSAKGRLTASWQMVYD YTTPAGDKVELPAREVFHLRDLSLDGINGISRVRLSRDALELAEQAERAA SRTFTTGVMAGGAIEVPKELSDNAYGRMKSSIRENHSGSENAGSWMLLEE GATAKQFSNTAESAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIE QLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLN DQAAFFSKALGAGGQSPWMKQNEVRETLDLPRVDDPVADQLRNPMTQKPK GSGDEPPATT >BTH_II1897 Bacterial protein of unknown function (DUF879) superfamily MNFLDHYNDELRQLRDAGARFAKEHPQVASALGLHPDAVTDPFVERLLEG VAYLSARVQKRLDRECAEFAQQALGRICPLYMASTPAISTFAFHPDLGSP DAFRGNTLPRGSLVAAHLPGRKLPVMFSTARDVTLLPLRLASVECSRSIT GLPSLLSQRLASSHAVLRFRFEVEGGASIAELAREDEGFKPLHLSLAGDL PRAYALHRAMLADTTAWYALVSTNRGDEVLTLPMSGIRLSGVDDAEALLP EEFGGLPGLRLLREYFAQPTRLLGVYVDALAAIAAKAPSARAFELFFALR DAPGDLVGDVDASQFRLFATPAINLYSKRFDPVPYDANQPEQWIPVDRMR PAAHHLWALTEVFVCETNGRAHRARSVLETAGYEGHEGSIRYGMRREDAL LVDGVRRDRFDPLASHDLIAVSMVDDTLDPDDVATITGRALVADRDWRPT GLLDADLQLLDPTAVKRIECLWPASAPRGKPSVDACWEAVSHVGRNPLAL HASQQQDVTARIVEQLLLAIDKDDALDRQRLESLRSVLLRSRFVAAGRAT PTALVRATQVEIDIAESLHADRGGWLFGRLLAQALAEAATLNDGIEIVVR LDGEAASTHTNVAARDGRLA >BTH_II0530 lipoprotein, putative MRKTLVMKVAIATALGGLALAGCTTTPDKPSTAASNASTREAIDANVNAT LTRLYSTVPGSRELVAKSRGVLVFPNVLQAGFIVGAQSGNGALRVGGATL GYYNTSSLSVGLQAGAQSKALIFLFMTQDALDRFRGSEGWAAGADASVAL VKMGANGAVDTSTATAPVEVIVLTNAGLMGDLSVNGTKVTRLKL >BTH_II1354 pANL56 MELEYDPNKRDKTLTERGLDFARAVEVFAGHHFTLEDTREDYVESRYITV GMLDGRMIVMVWTPRGEARRIISMRKANDREQARYAYRLG >BTH_II0856 conserved hypothetical protein MMKLSDSMMPLVAYARQLAQSPRGDAAEVALRLDILIERARDQAACAGAP EADVDDALFAVCAWIDEALLNSGWNDAERWTLRLLQKRYFDTSHAGVVFF ERLDALDGARADVLDVYLLCLQLGFRGRYGYDGGSGPLDAIRQRALDALS AHAPPCAAHALPFPAAYPHAETQPGAHAIAAKARARRRIALAGRNFGLPL AILLSLYLIYHAVIVRMVDSVFPRLQ >BTH_II1282 serine/threonine kinase MTLKLADNPNRLSDRAAMALPDDLFVARTSATVFSTHVDLLRMTSAQWIA IAESPDAPFERRYAAGTLLGFAGDPRIRALDPPMCDVPAARARLGTEPAR VPQVIAEWARVGVIDEWIEKECPRHTVELAAYRMMRYPVTNLEYRRFLEE TGAPWLPTSWTFGVYPLERANHPVWSVPAEAADAYATWLGEATGRRFRLP TEAEWEYAASGGADVEYPWGDAFDPRAANTVEAGPLSTTPVGIFPLGRSA FGIDDLAGNVEEYVADDYRPYPGGEAIADDLAVTQGGYRIARGGSFTRFG DLARCARRHGRYQRDIYAMGFRLAESR >BTH_II0734 conserved hypothetical protein MNETNPYFQEVIDAHVDIERWLSGHAEFDRLPALLGRFSPHFSMIATRGA PLDHAGLDELFRRGHGQRPGLRIAIDELQQIGAWRGGAVIGYRETQTDGQ GRTNTRRSTVVFERGAASRIVWRHLHETPLAA >BTH_II0571 pectin degradation protein kdgF MAGLNERGFGSWNRVERETLTERIERQVVSGDALTMAKLYLKKGAFVGTH SHPNEQFTYILEGRLRFRYGEHLEHEVEVGPGEILHLPANVPHNALCLED AIDLDVFTPVRADWLAPDGNRYFAGTPAAASPAPSASR >BTH_II0926 conserved hypothetical protein MASRSAVAILCLLVWLPARAATVAITVVDASNGRPLAGATAYASGIARTS AADGTFAIPAAAGDTATAMSVTAPGYARAEVALHEADAARIVRMTPVRPK AIYLSASGVANRALRDAALALPGKTAINAIVIDVKGDDGATPYRSAARRS VGAAAERRAAAHAIDLPALVRALHARGLYLIARIVVFKDDPLAAAHPDWA VRDAAGAVWRDRERQRWIDPTVRAAWAHPFDLAEEAARMGFDEIQFDYLR FPDANGLRFGEPNTEANRVAAITGFLAGARARLRPYNVYLSVDIFGYVCW NSNDTHIGQRIETLGRLVDYISPMLYPSGFTWGLPGIRKPTEQPGAIVGR SLAQAQRRTGLPGVRFRPWLQAFRDYAFDRRTFGADEIREQIAAAEAAGT DGWMLWNPHNRYDYAALPH >BTH_II1900 Protein of unknown function (DUF877) superfamily MQQLESSAEKVVVDQNNVNEDLKDILRRSFRPRTNEAAEAVQNAVETLLT YARRSRVVVREDVAQTIEQLVAELDKKISEQLTLVLHNKRFQSLEGAWRG LHYLVSNTDTSENLKIRYLNISKADLGKTLRRFKGVVWDQSPIFKMIYEQ EYGQFGGEPFGCLIGDFYFDHSMQDVSILTEMSKISAAAHAPFIAAAAPG LLQMDDWSELSNPRDVSKIFTATEYAFWRRLRESNDSRYLALTLPRFLAR VPYGPKTQPVEEFGFEEKVDPNRAEDFCWANSAYAMGANITRAFKTYGWC TKIRGVESGGAVEVLPKFVLPSQDREVDLHCPTEIAISDRREHELSESGL MPLVYRKNSDTAAFIGAKTVHRPAIYEDDDATANSNLSSRLPYIFATCRF AHYLKCIVRDKIGSFKSAEDTQRWLNDWLMNYVDGDPSISSEVTKSQRPL SAAEVVVDEIPENPGYYRAQFFLRPHFQLEGLTVSLRLVSKLPSTKQEVT T >BTH_II0130 Rhs family protein MTIPAARVGDAIGHGGVLVTGSGDVFMNGIPAGFVAGSVAVCALHASAQA VVAASGTVFINRLPAAFVGGVTSCGAPIVSGSPDVMIGS >BTH_II0895 conserved hypothetical protein MVERSENRELVSFWNELVSLYQLPDEFRPSKIHSDPVCARSDARRARRRN DENPRKNPMNPNLAALLATVSDQPTIDELMRLMHVPACFERCEPGVVTRA ELAQALNMALFADLLERVPTGRAYTRDVARDGGRVTFDHGALRTVRWPSC GALPPGEAAFTRILRPLGYRLNGTYPLERLRMTGRSYAHLDAPDQIAQFF VSELHPERFSAAFQAAVTNVLGTSADPLTPQAAATLAELERDASLPLADA RALMPVLVRCFDRQHASPALADYETLLAESAEMAWIATEGNAFNHATDRV PDVDALAAAQRALGRPIKPAVEVSRSGRVRQTAFRADPVVRAFVDASGAL VEREVPGSFYEFITRDAVADAETGRRALDLSFDAGNATGIFKMTAAA >BTH_II1891 pentapeptide repeat family protein MSKIRSAVPPPPLPEIVEGQRYVSAQRDVALADTLFVDCHFERVEWTGCR LSNLRFVNCTFDANRFDRCELEKLSYESSRVREGAWTQSALQRVSFNECE IDGGAWAGCLLKDVVCSQSKGGAWTFDAVRGAHVSLVAGEYAGVTLRGGH WSDTSWIGSRLVDLRLESVGLENLIAGQSGFERAVLVECRGINVRWIDSR IERMTVQGCELKQAAWSHSTWATGEIHASRLPIASFDHASVNGLTVTNSE LPQAIFDSASVADSALQGVRAPRIALRDAWLTRVNLAGAQLQQLDARGVR LERVDLRGADCRSGNLVGQLRQTWAAADTRDAVFEEATSADDRLWWQRVQ PGARGV >BTH_II2176 MgtC family protein MLNNVELIMRLILAAALGSVIGIERERLSWAAGLRTHMLVCVGSALIMIV SAFGFADVLGQAHVDLDPSRMAAQVVSGIGFLGAGSILLRGEIVRGLTTA ASLWSVAAVGLAVGGGLYAAAIAATVIILVILAGIKPLERRYLTVRQRRH LVLLVERGTLTFDSLHAALGVDSARVKRFIVQQSEDAADSDEVTIALGRV SDAEYGSICARLRQLPGVKGFAEQKSGLPDD >BTH_II2166 lipoprotein, putative MTTNRQESTVSLRRACASLAALALLAACTSTPEAIQRKTGWMRSEVADSY IYAYPLVLMDVAKEAATADGAPGAMPVNTLRHAQALPAPGAANPPLASVD MLDSTAWLDVAEEPVVVVLPDGRGRYVDARALDMWTNVLWSSGPSANPRS GASRARTLAFVGAGWEGSLPEGVTRVDAPARNVWLDVRVQTSGGRDLAAA KRLQRAIRVEPLSVYTGDARRAPRAARADAGAASAEPASGVASPVASPVA QVAALDPPGFFSRVARALQDNPPPAADAHAQEILADIGVTAGAPVQWKGD KLIGAENGAAEARERLAALPPNALAANGWSWLGDGVGNYGQDYALRAYAA STQFGAGTRDDEIVAVVKTDSAGRPLNGAHRYVIRFAPNALPPARAFWSL APYTPDGAVPELGRARRSIGERDRLHRNRDGSLEIVVSATPPGKGYASNW LPAPRADFELALRLYAPKRQAADGNWQPPAVVRK >BTH_II0560 conserved hypothetical protein MNMSTDRIERRIVLHAPRSRVWRALTNADEFGAWFRVDLAGQAFEAGRRV EGRITYPGYEHLVLQMRIERIEPEHHFSYRWHPAAVDPAVDYSQEAPTLV VFELADAEGGPLLTVVESGFDALPVERRADAFRMNSGGWDEQMTNIAAHV DAR >BTH_II1918 u1937b; B1937_F1_4 MQTLFDTGSQQDAAELGAALAAPAITGRYDELRGGAAALPSACLAPAWRA FFTQLGSVGFADLDRRAEALQRRMRENGLAYHPHERAAGGGAVRPWSLDL LPLIIAPDDWAAIERGVLQRVRLLNAILADLYGEQTILRRGLLPPALVTG HPGYLRPMCGARAPGGTWLHVVAFDLARGPDGAWRLMAQHAQGPSGLGYL LENRLIVSRLFPRAFRGLHVQRLAASYRALLQSMQALSPERKNSRIVLLT PGPHAAAYFEHAYLARYLGLTLVEGGDLTARDQRVYLKTLRGLEPVHGIL RRVDDEWLDPLELRPDSLLGVPGLMQAVRAGNVLLANLPGSGFLESPGIL GFLPRLAQSLLGETLSLPAVPSWWCGEQAACDEALPLLARSIVKPTFPAS VQAGGAFEPVIGARLSPQQLAEWRARIAAQPAHYTIQADLPLSQAPTWAL GAGMGDGGARIVPRPLLLRVFALADGARSWRVLPGGLARVGTRDELFNAP MPRGGSSVDTWVMTEGAVDPTTLLQTHLGPDDLQERSRAIASRAAENLFW LGRYTERATNLTRLARAALERLRGEDDVETPVHLHLLDALCRDNGLLPAD APPAVDAPRAFQQALTRSLTPRADRSMGIASCLFGMRGAAAAIRERLSRE QWRLIDDATQLFDEGGDDVEPEEQLGNDALQRLERLNLLLSAITGAQTDN MTRDDGWRLLSIGRQIDRLEFLGGVLGHAFDGGAIHKQEGFELVLELFDS GITFRSRFQRCFDVAPLLSLVVLDTDNPRSLAWVAQALRGRLSKVERGEG YALSELADGIPDIAGWPLHVLCETDGDGRHAALLARLQACGKAAWDVSNR IGERYFSHVRDAGRSLWG >BTH_II0370 conserved hypothetical protein MVLNAALSRLYVAPDNADVVSVIDTAASKIVSTVAPARLVTEMQYRGASP NGLMLSADEHTLYATNLGTNDVAVISLAGASPAVTGLIPTGCYPSDLAVG AANALSVVYTKNMPGPNPGNCMDSGRTVPCPVKSTPVKLVENQYIEQLSK SSPMWMPAPGGKTLDLLTTQVANNNSVNAALTPNDITTMAALRKKIKHVM LQGGGVRPAREHATADSAGRHQDGGARSDAQDALLGARDARHGFLGRGSR RCSRVQQGALEGADERSRVSGARRRACRAAPARR >BTH_II0608 conserved hypothetical protein MNKPTRRPTTGDLSDRREPNRRRRARRSPARQRGSLAVIAAIAIGVVIAA LGAVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATAQSAALGNG FDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTL TRVVPYYFLGAQRTVSATSTAQATNVGAYSIGTTLAQLQGGVVNALLNGL LGANLNLSVLSYQGLANARIRIKDLMAAANVGTVNALLNTQTTVPQLANW MLTALSQTSVANADLQTSIGALQTIVSANVPGGQTFTIGSTANSTGIFSV GLSDPQAALDATFSPFDALLVAAEIATGQTAFSLANGLNVGGLNASLQVQ IIQPPVLGIGEAGIDPATKTWRTIARTAQVRLYLNIGLGTANLPLGLLGA LVPVQVNLPLSMQIAPGQAWLQSANCTASPSTCASAIGVQTGIANLCVGD TPANLSASLPFTCSTPATLVNVANLVTIQSLASLPADVPASETPTLTFYG TTGGYQSTNSNGVGSVLGNALSGLGTSLQQTQISLIGISLPLDPIQAALD SFLGAVLPPMLSGLDAAVVPLLQLLGVQIGESTIHDMSLTCGVSQLVY >BTH_II0077 conserved hypothetical protein MAEKKRRIHAALLAAAIAATAPACHAQNATDWLANTYGTLAAHVGNAARS MWVAPEGVIYTASMWDEYEGGVAIYQNGRSVGSIGTHAEFQGGAITGNAT SVFAALQYDKSHGSGAVGRYNRATKTRDLVIQVSASNNQPRVDVVTGLAA AGSLLYASDFYGNRVRIFTTDGIWQRDIGVSGPGALAVDRAGNVWVARKS AGAIVEFSPAGALLNIIRMPGGSQPSALYFDASSGQLMVGDEGPDMNIKS YRASGTPALVGTFGIRGGYLDATTGIKGQVGAKRFTRVAGIGKDSAGTLY VLNNPWGGSWDLGRNGGTDIHAYDSAGNLQRTLQSLNFEGVAAPDPATDG ALFYGGTNIYAGGAGGTFVANTVDPFSYPSDPRIDMNDTQRDEHFGQLVA VGANRILVVSGQNPPVFHFFHFNQANGYVAIPDASIPGPAFNTTQRVTGG FCIDGNGGVWAGLDKTGSIYHYPLTGFDAGGKPAWGAGIPIRIPASVQPL TRIVYLADSDTMILAQGIVGSADWTAIGTRIEVYHGWRAGNTTAPDPVIN LAGAGAKSIDAAGNHLFVGYWFSGSGPARPNIDAFNLATGKLDATLVNTS SGTVDASSAVDSMYGVRAYLRSTGEYVVTKNNVKGNSITVYRWKP >BTH_II0902 conserved hypothetical protein MKTSRLIKSAALASLASLVLVGTLGIRAAVADSGDDCRAPLADWKPRDAV HALAQQKGWRIDKLKADDGCYEIKGHDAGGKRFKAKLDPVTLDVLRMKRE GDREREHGHDDGDSDDHGRAPDARAAAGGPPAGAPPGGVLKPGSKPDVQI R >BTH_II2258 GtrA-like protein family MSAAADIGMRARIVRFGVSGAASTALHAAIAGTLMGALAATAVQANAIAF VCATGASYLLNTLWSFSVPLRWRNVARFLAVSVAGLMLTMAISHGVQALG IAAAWSIAAVVVLVPPLTFAMHRLWTYR >BTH_II0123 Protein of unknown function (DUF796) superfamily MSHDIFLKINGIDGEAEDATHKGEIEVLSWSWNVSQQSNMHLGSGGGAGK ATVDDLLFEHYIDRASPNLVQYCLLGKHIDEARLVVRKAGGSPLEYIKLT MNDVLVTQVSPAGVAQDESRPRELVRLSFSRLKQEYVVQNAQGGSGGAIT ATFDIKKNAA >BTH_II0559 conserved hypothetical protein MTFYPQLLRNRPRMVIAAAAGVAFGLLFPYPLRPFARVLIGWDCTIWLYL VLMWVRMVRAHHHKVREIAMREDENATIVLTIICFATVASIAAIVLELVS AKSVGFRSGLGHYAVTGATMFGAWFLIPTIFTLHYARLYYLSPKEARAMA FPDRELEPDYWDFLYFSFTIAVASQTSDVSLRGRSIRRAALAQSILSFYF NMAVLGLSVNVAAGLLG >BTH_II0263 Protein of unknown function (DUF1305) family MTAQPKPNPEPAGADAGSHANSKAGRRAGAASNAEPGAMPNADWDAAADA TPNATPSATTQADAARRDAWWRRLRAAPHGYDLFHALRWLDALSPEHAPS GYASRPRDEPVRFGQAPSLAFAAAMLADVRDEAPRPRVAIHGFGLFGPNG PLPSHLTEYAYERAAQHDDPTFAAFADLFHHRLILLFYRAWADAQPTVSL DRPARARFDRYVASLIGPCDARSCDAPRGANAIAPHAKYHQAGHLVRHTR NPEGLVQILQRYFGVRARIVEHVPRWVMLDRAQRCAIRATRPTQPLGGAV LGRAVRDAQSRFRIVLGPLTLDEYRRFLPGGAHAQQLAQWVREYVGIEFD WDVQLELARGEVPALALGSRDGLGRTAWLGERLDPGPARDLVLGYDGRAR GGAGVARARSTAADQAAAADAFDRQPA >BTH_II1355 pANL12 MTVSKRATHTDWVDPDDAPELTDEFFERADEYVGDRLVRRGPGRPLGSHK TATTIRLDDDVLDAFKATGRGWQTRVNAALKEWLKTHKLA >BTH_II1885 ImcF-related family MIRTSLRVFAAILIAILIWWVGPLFAFGIYHPLGPVWVREILVALVLIWG FWPTLARLWARLAMSPRQVKVAPKTKQLDFVDKHLRNLDQQLKERWRKEP RGRWKRWVGTLTREHRAMLPWYLVLGSEGSGKTSLVAKAVSVSGSLQDRV LGSDATYGRGDDFNFRITREAVWFDVGGRWSLRAGADEAEFDAWRKLLRG MRRLRKGAPISGVVLCVDGLDMVDAPLDARKRLADSVRARLEEMREAFGQ QVQVYVALNGLDRLDGAVSTLSLLDASKWAKGVGFSLPDDGAEADAARAD ANWQHALQGLQQRVQQQVLYSAPAATEVSMNHAQLRFVETLSRLQKALVA WLHVALAPGEPHTAARLRGVWLGSMAELAEAHPAGVGSSELPVPSRPLSE LWTPLIRQVSLERDAVRPSGPKSWRGRLGEALRWGAVPLVALSLLLWFGW GYVTERDYLDGVWAQFTEAKRLAQAEASYGNDGGSALIEIANQMRYAQLQ AEDAAQGMATPYFEHGLVAETARETYYRHLQKMLMPELYNEVRRTLVSQV DGTPGDIYQTLKVYLMLCRPDRRSADDVVRWLDGRWDALSGGQYSDDDRR SLLGHVRTLMSLKNVPATPEDANLVRSARAKAAQIPLVTRVLQHIHAQGL PQQVNDISLSRAAGFEASMSLRMRSNVPSTDTAVSGWFTRAGYTDVFLPR LQKSARAMLEEESWVLRDETLSGNSFQIDGLVQKLADSARNQFLQDYISA WQNFLNDVTVRGVTGLDDASQLAAAMMEAQSPLANLLRFAARETTLTGAS DEGNIDSWIDRQKYRFEKGRRQIVGELSGQHYRTVLLPEHVVEEHFQAIR QLAAQLNRNNTIANNPLSRLFEPLYRQLGLVNGALQAGQVLPAQYDAFSR LKETAARQPEPVRGIMLDLVSSGSTMTTRESGALLNRGAAGATKMVCDQG FTGRYPMRRNAQADAGVEDFERLFSAQGLMATYFRDHLAAYVDTSAKPWQ ALRSNGGPNGMVSQSVLNSYETADRIRGAMLDDSGHLRVSTVLRFIDMDS QLSEAQLSVAGQTVRFAHGVTSPHRVDWTNQNTQLAIKLQLKSVDGRMTT LQFDGPWALFRFFDAGQAVGGTGTADRRERLYQTSLGTVRIEWQALTLPS PIWSGILQSFRCPS >BTH_II1906 membrane protein, putative MILLSLGTAAFVALLAWQGLGAVAATLASAGWGLALVAAFHLVPLVVDAT AIAVMFRAGEPGSRLGDALRARWVGESVNSLLPAGQIGGPVLMVRYLAQR GARLADAAASITVSTTMQALAQMVFALVGIAAFSAYATHGAASHLRTPAL VATAVLGGCAALFYFAQRRGLFGRGLRAASKLLGPRDWSSLATRADAIDD AIGRLYRERAKVRATFVLSFVGWVVGTAEVWLALRFLDHPVSWLDALLLE SVGQAIRGAAFAIPGSLGAQEGGYLLLAPLVGLPPDAALALSLAKRAREL ALGLPGLLYLHFSERNWQRRRAPLPIAD >BTH_II1207 carboxymuconolactone decarboxylase family protein METRLDYRKANPHALNAMLALEERIAQSGLEPTLIELVRLRASQINGCAY CVDMHTRDARKHGETDRRLATVVVWREAPFFTDRERAALEWTEAVTLVAR DHVPDAVWEAVRPHFTDAELVDLTLAVATINSWNRFAVSFRKLPA >BTH_II1767 Bacterial protein of unknown function (DUF899) superfamily MTEHSMPVSAAELVKRNTTRWPNESDAYRRARDALLVEEIELRRRVERVA VLRRALPPGGEVTGDYRFEGERGPCDFAQLFGDRQTLVVYSYMFGPQRER PCPMCTSLLGAWNGEARDIEQRVALAVVARSPLERLVAFKRERGWRDLKL YCDLDGRYSRDYHAIGADGGEDPAINVFTRRDGTIRHFWSGEMGGWSADP GQDPRGAPDPMPLWTILDMTPEGRGVDWYPSLDYPD >BTH_II2330 PAAR motif family MVKRAIICVGDTTTHGGKVLEGSPTFTLNGRNVAGVGHKVLCPRCKGIFP ILPDLLGRRYPHTIADRDTAVEGMRTACGAELIASQGTGTIDDVGAGERG DGGSPGGSAAAAAAAVAPSPTLCLECLKAAAKNAATMVARG >BTH_II0074 chromate resistance protein ChrB MLRELADAIAESGGAAHLLRAPSLDTSQEAELRALFDREEDYASFVRGLA QARKTLAGQSATELARLLRRLRKDFEAIRAIDYFPDDAATRAELAWQDFV ALVDTVLSPGEPHAAERAIRRLAIDDYQGRTWATRQRMWVDRVASAWLIR RFIDARARFIWLASPSECPDGALGFDYDGAAFTHVGERVTFEVLLASFGL DKDPALLRLGTIVHALDVGGPGVPETVGFEAVLAGARRRAENDDRLLEQM SDVLDSLYAHFAANEAGETGERS >BTH_II1059 phage minor tail protein L MTITADIQQLEPGRLIELFEVDCTEIGADVLRFHGHMQSTSIVWQGNEYK PWPIQAAGFEQTSDAQQPSPTLRVGDINGTISALCVALGDLVGAKVFRRR TLARYLDAVNFPAGNPTADPNEEMPTQQWRIEQKSDEQPGLHVEFTLSSP LDFGGQQLPKRQIISICQWEYRGPECGYTGAACFDKDDNPVSDPALDRCS KKISGCERRFGVNNALPFGGFLCDTMA >BTH_II1358 gp29 MGFAFICEGDTTTHGGRVVGCNTANLVYGKAIALLGDMVTCPRCGGIFPI VSVKSGLNMTFGDRPVATDGDKTACGATLIASQGTATVAPTAGQGSPIGG GKSVIAQARSAPNEPYRGRFQLLDDHTREPLANHAYTITSADGRTVHGQT DANGFTSWLDSDEASSLTFTNSGASPA >BTH_II0110 fusaric acid resistance protein, putative MKYLLESPLACGTFLPHGPLPCAATRVASWRGSKRSVVVPIRRTRTATVA SSPSVPSKEFDVSHASVAPLRSRDPLSIDHRKLFFSISTFIAAALVLLIS FTVSFPRPWWALLTVYVTAQPMAGAFRPKVLYRLAGIAAGAMVAIVVAPN LQNSPLLLVLCLALWIGFCIYLAVLDRTPRAFLFQMAAFSSAVICLPYLD DPADIFITAISRVEEMTVAILCVTVAHMVLRPAGVRPVIHERALSFLDDA CRWTAEAFGTHHARLEHEHRRKLAADVVELGMIAMNLPFDQRFALATRET VTALQHRLAALLSIASAAANRLDRLRSLNAVDAETAALIDSLIVDLRASQ EVGDALDIDLASRCRALAAKRLRDPAWTSLLAASLFDRTADFIDTLHAAR SLVRTFGDSDVELERHARLVDGAHRFRLARDHGLALLAGAATTTAIMIYC AVWILLAWPSGSATAAFAALVTCSFAAQDDPAPAIGRYLVATLTTFPLAA LYLFVILPRVDGAGMLILTLAPAFLWMGYIQADPARSARALPMFSCFIVA MGFLDRFQADFATFVNTGLAQVGGIVTTLVVTKLFRSASTRWTAYRIVRQ NWAELAQMADPREALDAQAWTARAVDRLGQVASRVALADPGDALHAVDGL SDLRIGRNIIQVRQRIGRGSARTRHAIERALGEVSNLYRARADASLPVPA SAPLLRALDLAIHSAAHDPAGRDDATLLALVGMRCNLLPNVQPIEGGAR >BTH_II1899 Protein of unknown function (DUF796) superfamily MANALVDYFLQIDGVEGESTDSQYPGLIQIQSWQWAEENSGRWGFGSGGG AGKVEMKDFEFRMVSNKASPKLFLMCATGEHIQNAKLICRKSGKGQQEFL TISFASGLVSSFRTLGNMPISQLGHASGEVDGVLPTDQIRINFAQIEFEY REQRNDGTMGAVIKAGYDLKQNAPI >BTH_II0873 ImpA-related N-terminal family MGMNERRQPGGAASGALLPEDFDALGALGRADIDPAAPAGADVRADARFD ALHAELAKLASPGASGHVDWRAAMSLAAGLLRDRGKDLLVGCYLAGALLQ IGGAAGLRCGLEVVGDLVERHWAAMSPPVSRMRARRGALQWLLDRVDATR DAGAAACGAACSVELVEQLRAAARRIDALLAERDDEAPTMRAVTAFAARL PVESGESGEPGEPGEPGEPGEPGEPGESNSTHAHGSVGAPAERAALSFAE HASIEPAGRAAPRANADAARHPASLDDAAGRERALADALAQLHRIATAFA QADWADTRGFRLRRIACWSSVHAMPDTEADSGRTRIAAPNAQVVDVAKGI DAQGDPAAAVRFAEEHAQAFPLWLDLQRIAARALARAGGDCTGAQREVET AVRALLMRLPGLDALKFADGTPFADAATRAWLAELCTPIGAANAALTSPP PPSPPAPSLPMTGESDRARGDARDANADDAHARARALAASGRLDLALGAI QQAIDRAPSAERRLRARIRLCEFARDHWEHEIPDAFARGVIEPIRRHDLL AWEPELALDGLSAAYALLIRRDGDSAHARAVLNEIAGVDAARAMRLST >BTH_II0259 Protein of unknown function (DUF877) superfamily MNEHAQTRADTRAAAQPAVARDEFAALLQKEFKPKTAEARESVERAVRTL AQQALEHTAGMTTDAYGSVKQIIAEIDRKLSEQINLILHHQEFQTLEGAW RGLHYLVTNTETDELLKIKALPASRNELARTLKRYKGVAWDQSPLFRKVY EEEYGQFGGEPFGCLVGDFYFNHSPPDVEMLGELSKIAAAAHAPFIAGAS PELMQMDSWQELANPRDLTKIFQNTEYAAWRSLRQSEDSRYVGLAMPRFL ARLPYGARTNPVDEFDFEEDTDAASHDRYTWANSAYAMAANINRSFKLYG WCSSIRGVESGGAVEGLPCHTFPTDDGGVDQKCPTEIAISDRREAELAKN GFMPFVHRKNSDFAAFIGAQSLYQPAEYHDPDATANARLSGRLPYLFACC RFAHYLKCIVRDKIGSFRERDDMERWLNDWIMNYVDGDPANSSQETKARK PLAAAQVVVEEIDDNPGYYASKFFLRPHYQLEGLTVSLRLISKLPSAKAA SE >BTH_II1435 conserved hypothetical protein MDGRARRDGLFPRASARATSTSTSRPDMRYSVNEGHLELPGQWLDRSVNA LLPAMAEVTGSNLVLTRDELPYGVEFADYVDVQRAKYRKELSGLQMQRDE PGVLDGRPCQFLAFTWNKDDLLIHQMAVIALDAPLVLALTYTSPGRLPDG VRDAIGAALASFRFHRSAQLPAS >BTH_II1062 host specificity protein J MKKLHAERGLKRIYGAKGGGGGGGSSESPDSLHSIARAKVLDVISAGPIV GLVNGLQSVYLDGTPIQNADGSLNFQNYTVDVRTGTQDQDYIPGFPAVER EAGVGVPLTSDAPWVRQIQNTQLTAVRVRFGVPALQRQDTSNGNITGYRV DYAIDLSVDGGSYTQVLAGAFDGKTTSLYERSHRIELPRAKNGWLIRVRR ITPNAHTATIADAINIEAITEIIDRKLRYPMTALVGMTFDARSFSSVPVR SYHVRGMIFRVPTNYDPETRTYSGTWDGTFKAAWTNNPAWVYYGLLLDKL NGLGDRVDASMVDKWALYAIARHCDELVSDGKGGKEPRFTCNCVIQTKAD AFKVVQDIASVFRGISYWGAGSVVASADMPSDPVYLYTAANVVGGSFKYV GSERKTRYTVALVSYNDPTNQYKQAVEAVQDDDGIARYGVIKTEVTAFGC TSQAQAHRLGRWLLLTSRYETGTVSFQVGLDGTLCAPGQVIAVADPKKAG RRIGGRIRAAAGETITLDKAPTIAAGDRFTAILPSGIAQARVVKAVNGDT VTLAARFDADPVPGAVWMVESNELAAQQYRVVSVQESDDNGQIVYTINAT QYEPGKYAAIDDGAQIQQRPITIVPPSVQPPPSNVRLSTYSVVDQGISKT SMVIAWDAANHATSYVAEWRKDNGEWVRAPSTGGLQVEVPGIYQGKYLAR VRAENALGVTSIPAYGVDTQLTGKTTPPPSVVSLTAAGIVYGIDLKWAFP GDGSAGDTQRTEIWYSRTPNRDDATKFSDFAYPQASTSYQGLAVGQVFYF WARLVDTSGNVGPWFPAKGPGVQGQPSTDQSDYEKYFAGQIGKSALGTEL RAPIDLITPPMAGDATIYAGDERLNAGVWSLQAAIAEGDMAVAKKVETVA AQLHSGSNLLNAAVQKETIARVEADRAMAQDITTVQAQVDDNVAAVQTVA KSYADLNGRVAASYQIKVQTTADGHKYMASIGVGIDNENGVVESQVLVSA KRFAVIDEDGSGVIGAPFVVQGGQVFLRQALIGAGWITNAMIGSYIQSDN YIAGRQGWRLDKTGWFEINAADGSGNRLVMDGSSVRVYDGNGVLRVRMGM W >BTH_II2016 conserved hypothetical protein MAASSGTSSPTPSHSAEPPFVPAPLARAHARYWRFNVALIAVLMTIGFAV SFVVPLFAPALAHLRFAGFSLPFYVGAQGAILVYLALIGAYIVLMQRADR TLRRDYDAYADEAKRKEVISTDTDAC >BTH_II1314 Uncharacterized conserved protein MDLTRLTRHGEFEWHIPATGAMRVPGVIYADRKLIADMDDKVYEQVCNVA MLPGIVGASYAMPDAHWGYGFPIGGVAAFDAHAGGVISAGGVGFDISCGV RTLHTGLVRDDIDAVKKTLADALFAHIPAGIGSTGRLRLSAAKTDDMLTG GAVWAVEQGYGTPSDLERIEEGGMVRHAKPSMVSALAKRRQRDELGTLGS GNHYLEVQEIEDIYDPACAQRYGLQRGQVVVTIHCGSRGLGHQIGTEFLK AMVIAAKSYGIALPDRELACAPILSDLGERYLGAMRAAINCALANRQVLT HLTREVFAKVLPAAQLTLFYDVSHNTCKVEDHVIDGRRRQLYVHRKGATR AFGPGHPALPDALRDAGQPVLVGGSMGTASYVLAGANAPGGERAFGSACH GAGRAMSRFAASRRWRGRALVDELAARGIVIRSLSDRGIAEEAPGAYKDV GAVVDAAAEAGLARKVARLAPLVCIKG >BTH_II1423 Uncharacterised protein family (UPF0187) superfamily MIVRPREHWFRMLFVWNGSVLKSILPQLALMSAVSVVALLTNGRILGEKV PLNPTPFTLAGLALAIFAAFRNNASYDRYWEARKLWGGVLTAARALTSQA LGYDASADGASFARATAGFVYALKHQLRGTDPAEDLRARLPADWLEPVLA ARHRPVAILHALRGRLAGRHRGGALTDTQLWMLDAQLNELGAKLAGCERI ASTPIPFPYHVLLHRTVYAYCVMLPFGLVDSIGIATPFVSVFVSYTLIAL DAIAGEIAEPFGDGPNHLALDALARQIERSLLELAGLPLPDEIRAGPSYR LS >BTH_II0534 DGPF domain protein MRFMIMVKANATSESGAMPDESLIAAMATYHEELAKAGVLLDASGLQPSL KGWRVRYSGGRRAVVDGPFAETKELIAGYTLIQVRSRDEALEWTRRFPAP FGEHEDGEIEVRQLFELDDFEPGDAVERFRELESKLG >BTH_II0252 Bacterial protein of unknown function (DUF876) superfamily MNEPVLSATPAAALRQRVIWTEGMFLRPQHFQQLERHWERYVGMRCLPLQ GFYWGYDALQIDRELLALGKVALLAATGVMRDGTPFDLSHPDDRPEPLDV PADAKDQLVVLALPLWRGGAQEVSFGGEGNGSGNGSGNGNAGAGFARYVV REHEIADANEVALGPALLQTGRLNVRLMLESELTGDWEALGVARIVERRT DGRLLVDDGYIPPRLVAQRDPVLLRHTRELHGLLTQRSEALGERLSEPGR GGVSEVADFLLLQLVNRYLALTWHAQQDVAAHPETLFRDWLKLACDLSTF TAAGRRPQSLAIYRHDDLRASFGELMAELRRSLSTVLEQNAIQIELRDAG NGMKVATIADPALRDTAGFVLAVRADVPADSLRARFPAQAKLGPVERIRD LVQLQLPGIAMRQLPVAPRQIPYHAGHTYFEIDKGGEMWKQLERSGGLAF HFAGEFPGLSMEFWAIRG >BTH_II0857 Bacterial protein of unknown function (DUF876) superfamily MDNVYWHQGMLLQPQHFQLAELHQQFRIEPWLASAPPHFWGVGALSIAQA AIDRRVVEIRSAQLLFSERSYVEYPGNAVVAARAFDPAWLDDGRALVAHV ALKRLARGANNVTVAASPDALPDAPTRYATLPCADEIGDLYSDHPGAPVR TLKHVLKIVFGHELDALAGHETIPIARIVRDGERLRLDDDFAPPCYAVSG SRALLDRVRCIRDELAGRARQLQQYKNPREMQRAEFDASYAAFLLALRSL NRFGPLLFHLAECDRLHPWTVYGVLRQLVGELSAFSERFDMLGETPDARG GLPPYDHLDLGGCFSRAHALIGHLLDEISVGPDCVATFEPDGERQPAQRS AQLPPDVFADRHLIYLAIRSAHDPDTLAQRFLLGGRIAATDEMPQLAALA LPGVELTRLPGAPPRLPRRGDARYFRIEQTGRPWDAIRRDGRVSLRWADA PDDLHAELVAVRHA >BTH_II1892 conserved hypothetical protein MLGISVGMGFRLDQPSILVHEAAVWEALKAAAPSLPLYEAALPKQRAEWL LAGHSVHAVGGGTRSRDIDWTAWVELDGVRKIVSCATQLGDEHEPGGCVR VAIDHRHAAAGGAQENPFGMASGTPPLQQLRSFGVGPAPLAAMGAIGCDW PERMQWMPTRPGTVDAMAQDGTHMGWPADVDLRFFQQAAPDQWARGECWM PGARFELNGFGPRGEGFAGELPRLAPVALVTRNGRPGIERPTFKQQTVWF LPDRGIGVLWWNGAVALDFLLDDSPTMLVIAFKDDAERIDVDALMKFADQ RADLNCTDPLQQADHELMPAVAKGWTWEMILDTEDHPRFAPAPRGYEEVR ARVEQNRRDLVEARDASERLSAFEEANRNAKLPGAPRGGENWRTRLRQAK TPELANVTIRDADLSSLRFDGWKFDDVRFERCTLDRSEWTNCRLNQVHAV DCSFADVKMSDGWWKGGKLQRCNLERSAWLNVEIERISLDECRLDDLKVA GGSWSMLSVQGRGGVRGDVQDVQWNSVSWSEVSAPGWTWTRVRADDLAIV ECAMAGLTVSQCTLAKPSILLTDLSASVWQRSMLTFAVLSHGTSINGARL TDCVFKSSSLQELRADRVQVDHCSFMQLNAQHLHAQQSHWSRTVLDGANV MHAQLTGTSFDRCSLKEAMFYGADMRQTRMRDCNLVRVRTSWIHPPEAGA WRGNLNAGQLDVPRRV >BTH_II1146 Uncharacterized protein family UPF0029 family MRFSPATLMTYTLAATYTRELEIRKSRFIAYAIPVENRDAAMAELQRLRA EHPAATHVCWALLAGGQSGMSDDGEPSGTAGRPILEVLRHHDLDGVLGAV VRYFGGVKLGAGGLVRAYTDAIAATLIDAERIERIAYARLAIEIGYPDEA RVRRWIEQEGHDLVDSAYGMTVRLVIRLPATALDAARDALFDQTQGRAGF PSAD >BTH_II0124 lipoprotein, putative MVSQFFFRREAARRTAVKRIVAAMVCASLAACASLPARKETADLIVKIKV SEGANPDEHERPAPVMVRLYELKSAGAFENADFFTLQSDSRKVLGDDAIA TDEFVMRPGDTRDIHRRADSAATSIGVLVGYRALGKSVWRAVHKLPPVPE DAWYRAFTPRTKIKLNVDVGQQTVSITELE >BTH_II1056 gp14 MATSLRELIVSVTANTTEYDRRMRGLSSTAGSYFNAVRDGGRTADAAFAS NAASVQVTVRALDAARSSIREYAQAAAAAFGVHQLIEYADEWTNLSNRLR IVTRDEIDFAIAQNDVLRIARDTRQPLDATAELYQRIANNASHLGLSIKQ VGPLVTTISKAVALSGVSADTARMGLVQLGQAFAAGQLRGQDLNSVLEEL PGVADAIARGMGKSSAQLKSMAEEGKLTVGNLVEALTRAAGGTDTLFEKM QATVGQTMTRLQTEIVKYIGESDQATGASARLAQGIAYVAEHLDGIVKLG VSLAAGRIAVYFGQSAVAATQAATAWVGARRALVEETIKQHEAAQAALAK AQGDRAAAAAKLQNAQAAEASAQAELAGMRAMRESLAMQSALTAGSIKYT EAKLAEARAVEATAQAHVATARANVAGSQEIGARIAGTPYAAIIARETAA AQQELERAEASLALALQRRTALEAAAKQGTIDKARYTASLAETDRGLAQA ERDVALATQARERAERAATATAAGLKTATESAATAQTALARTGTMMRSVG SGLLAAVGGLPGILATVGTVALGAAANWLLFRDNASSATSSLIDMQAPLD QIIDKYRQLTPLLQESERLRTKQEASRAADDAQSAYRSLATRAAQSVMVP AFGDAPSVVSDADQAALDRFLAGLDRLKTSNLGVDEKSREIGRLIDRFVS ATSGGEALREELVRAAGAIDTAGLASQKGAQALAAMDAAARGAAEGVRLL SDANNFFAGGMASEAWEKYVHKLREESDVIGMTARQKAEYEARTKGANDA QARMAGLVAGRADAYKSLEKAIADKDAKAAAGARTNIDNLTRELALMNQQ MVVAAALADFQAAIRTNNFGKFAKYGFGEKSGDVELAAMVARAEANGRGQ QAFDETIASAAAQTARVSTNAAAARVTKGGGVHSLESERMLDNIRQRIAQ LRVEAVATDKLTQSQKDLLAFDQKVTDLRSKRKKLSDDDKSLLRDQQAIR GMYEQASQLEKEVRYRDAINKLKERSAQIDAELGDYAAERQRDVQRELGA MSMGDNARELNQAINRVSDEFRRRRDELTKGARKDGTLGSPEYIAEIERI NTAEAEQVARERGYLEQRLALQADWRVGVKRAMAVYQESAQNAAQMAEEA LTSSFRNAEDALVSFAASGKLNFRGLIDSMIADLARFSARAAMSQVFGAI GSALGFGGVSDAVGALGGAASAAVGSNAYGFHLATGGAVWGPGTSTSDSI PAQLSNGEFVVRAAVVSQPGVRAHLERLNAGGRSGFARFAAGGLVGGSAG GGDSPARNGGISVSAPVSIEGGSSNPASLIAVGEFRKMLEQMIRELIQRE RRQGGTLWRAQNGIAG >BTH_II1647 fusaric acid resistance domain protein MKHEPALQRFLIDPQRWQERLRSAGRLARDWAGRDGLVWLHLAKTVAAAL LAMGIAMLLDLSQPRIAMTTVFVLMQPMSGMVLAKSFYRVIGTGVGLVAA LALGGLFAQQPELYMAGITLWVACCIAAAVRNRHFRWYGYVLAGYTAALI GLPAVMAPNTLFLSALTRAAEVAVGIFCSGAVSALVFPLSSSDALMRTLN ARHADFVAFAASTMAGTVERRDFERRFADFVDGIVGFEATRAFATFEDPH IRARSRRLARLNSEFMNACARLHAFHQLLKRLRANHAAEVLAAIAPHVDA LSGALSALRRDLAQPRATPSPSLSPAPALVELSAYLAALAKRARASRRAL ETLAPAGVLDFDTAIELLYRFVDEFLGYAQTHASLALDSHVLERSITRYA VKTNRYFVGFTFLRTLVAMGAMSAFWIASEWPSGALAVIGTAIACALSST APRASRFVAQMAAGAALATLTGYLYVCHVYPNIDGFPLLCAALAPALAAG AYLATRPGKSGYGIGFVVFFCLLAGPDNVIVYAPDVLINNGLALVVSMLA ASIAFAVVFPAEMPWLTGRIARDLRRRIALACDGPLQGLDQRFQSSSHDL MSQLRNLLMKRSRRHRDALRWMLATLEVGHAVIDLRREMQAFTAAQPAQA LRWSALIDAVRDALPRLFETPDAHRLARALKSVNLAIRAVQHTQHLWYAV PDERRRMQRIVSCLHFIRSALIDQDAPFNRGSRARERVRARRM >BTH_II0861 pentapeptide repeat family protein, putative MSDTVHAALAALTDTRTLSGVDLSDADLSGLDLSGCTLHRVILRGANLSA AQLDATRWLHCDLTGARVDGATLGESSWHAVALRGASLRATTGDAFAMTD ADLGGATLTDALWARATFERVDFSAAQCARAKLLRCEAADCRFERTDFSS AELERFSAMRADLSSARFDATRLTNALLCEADLRGQRFARCDLTMTHLNG ATLAGSDFSGTSLVQTMFFAADLEGATLAGARGRHVRFADATLVGARLAE AVFDECDFARARLSSANARGLRARMSLFSHADCAGATLAGGHFVYCDFSH ATLSRADCTDADFSHANLHGIDDRAARWDGACKTGACATDPTLALAERWT APER >BTH_II1385 YceI like family protein MSVAPVPDARGAPGAPPGTPRLFVAAASAEVVRDAEPAGTLDASPHAAPR YRLDPRHSGVTFRVDNFWHAHLTMRFTRMRAELAGIDDDGLASRVDVTVD AASLGANVPFVAALLKGSAMLDVARYPEIRFVGTRFERTGATEGRLTGDL TIRSTTRPITLAVRFAAGQPGTGAREGVERGAARPRAEWGQRESGSRDAR TLAFVADGHFSRAAFGLSRWLPAVGDDVRMRIRAEFVRERAEP >BTH_II0009 YceI like family protein MKPARWAERFACAIALVGMAASCTPLRVVTHTVSTTEAAVPAGRYTLDPH HWSIVFDVDHFKYSRFTMRFDRASAQLDWRAGGLADSGVTASIDAASIDT NVPLLDKLVAGSDALDAARAPRIRFDGTRFAHTSATQGTLTGNLTIRGAT HPVTLAVTFNGYGRNPLTKQDTLGFSASGTFSRAQFGVTSWYPAVGDDVR VRIEAEFVKQGEAPAT >BTH_II1592 conserved hypothetical protein MSISIVRLGTPRADDEGVRIGTVRRPPRGVPKDEFASRDYYDVWLPTLSP SPELVAEAQAAESDAEWKAFARKFRAEMKHGDASKVLDVLAVLSTTSNFA IGCYCENEARCHRSVLRELLEERGASIRS >BTH_II1413 DoxD-like family protein MAGRSRQGGGASRVTRIARRLGSQPARARCVARSAHGARIRLSTTMRQTF LAPQKDLLLLLSRILLVILFVIFGWEKLLNFSGTVQFMGAEGTPLPSIAA VVAIVMEFFVGIAILLGFCTRPLALLLGLYTVGTAFIGHHYWSMPAAEQM NMMIHFYKNIAITGGLLALCAAGPGRYSLDRG >BTH_II1774 serine protease, subtilase family MSCTRYRKNNDHPKLAQSMGVLVALVGAGIVPAHATCTAAGTTVTCSGAA DPLAPSYANSGNNLGVTVNSGASLGVLLGVGGTALSLTGSGVTLTNNGTI DPTVLGFGLGVLSSGAVVGNASPSTTTVTNNGTMNGSTGVSISGLTGMAL AVENGTGGVSNITNTGTIGSTPLAGATLLGPDSPVVAAYGGGQVNFSNSG TITGRVAFQSNGTAGQGNTFVNSGTIDGSVSMGTNSTNTFTAMTGSTVSA AGGTGLSLNIGVGSLTLGFAATGIVDGGAGGNNTLVLQQATGGPATGAIA VDNYINFNHLDVTSGAWTISGASSAQDATLSGGVAIIGNNASLGTGAITG NGGALQAGAAGLDVSNNVALGAGGLTVQGATGLTLSGAISGSGALTKNDT GTLTLTGANTYTGGTTINAGTLAIGAGGSLAATGAVNLAGAGAALDISAA GANQTIGALSGVAGTNVNLGANGLTFGDGTNQTFAGAIGGTGGVTKQGAG VETLTGANTYTGGTTINAGTLAIGAGGSLAATGAVNLAGAGAALDISAAG ANQTIGALSGVAGTTISLGANTLGFGSAANQTFGGSIAGTGGIVKNGTGT ETLTGANTYTGGTTVNAGTLALGAGGGLSGSTTVNLAAAGAGFDISGATG NQTIGGLSGAAGTTVALGGNSLTLAGSGSATFGGTIGGTGGLTFAGTGTQ ALTGNNTYSGGTTLAGGTVALGSGGALGTGAVTVAAPTTIDTTSAVNLSN AVALNATATVGGTQSLTLSGAVSGPGGVVMNGSSTLTLGGANTYAGGTTV NAGMVVVGNGSALGTGGLTVNGGGVSLGGSSVTLPTLNGAAGGTIDTGAG SLAVTGGGSFGGALTGGGSLAVSGGAPLTLTGANTFTGGTTIASGGALQI GNGGTTGSLAGNVADNGALVFNEAANLAYGGAISGSGLLTQAGSGVLTLT GASTLAGPTTVAAGTLAVDGSLANSTVTVQNGATVTGTGTLGGLIVASGG TASLPQPGQALNVAGNVTFEAGSTLQVAANPQQSGSLAATGSATLNGGTV QVLASQASYQANTTYTILSANAGVAGQFAGVNSTYAFVTPTLGYDANHVF LRLAPNGNAFTSVATTQNQTAVAGALGTLGAGNPLFDTVLVSDAPTARGA FSQLDGELNASLQSMLLSDSRYVRDAVTDRVRQGLAPGSGPLAALSAGGA ALCDDAGGGAARHDAMPPERRLGSRDSCVGRTPYRPVVWGQAYGGRSRLA SDGNASTLNRSMTGFIAGADVALNDRWRAGAAAGVTHSSLDNDLNASASL NSYYVALYGGAQYGAWGVRGGAVYTWYRINADRSPAFANFRDHDSAGYDA NSGQVFGEVGYAIPVGRFALEPFAGLAYVSLHTDGYQESGGAAALKSGAQ TSNVAFSTLGVRAATALDVLAKGTLSAHAMAGWRHAFGSARPTSTLAFAR GGASFQVAGVPIARDSAVLELGIDASVTKNLTLGVSYSGQYGSGVRDNAV LGNALWRF >BTH_II1893 Rhs element Vgr protein MRLIELRSPLLDPDVVALSFVVHENLSQEPSYQLDLLSRDPNLDFDELLG STLSADIDLGGGDIRTFNTHVFGGHDTGQMSGQYTYTLELRSWLSFLAEN RNSRIFQDLSVPQIVEQVFQGHQRNGYRFELEGTYEPREYCVQFQETDLN FVKRLLEDEGIYFWVEHEPDRHVVVISDTQRFEDLPLPNETLEYLPDGEE SRAIQGREGVQRLQRTRRIKASNVALRDFDYHAPSKQLDSDAQIEQQSLG GIPLEYYDYAAGYRDPEQGERLARLRLEAIQADAHALGGEANARALAVGR AFTLVGHPALSRNRRYYVTNSELTFIQDGPDSTSQGRNVAVKFRALADDR PYRPLLVTKRPRVPGIQSATVVGPEMSEVHTDKLGRIRVHFHWDRYKTTE ADASCWIRVTQAWAGKGWGVLAMPRVGQEVIVVYVDGDLDRPLATGIVYN GENPTPYDLPKDIRYTGLVTRSIKRAGGIPNASQLTFDDQHGAERVMIHA ERDLQQTVERNSSTSIAQDLNLSVNGTSTSVIGIKVSFTGISVSYTGLSV SFTGVSASFTGVSTSFTGVSTSFTGVSTSFTGVTTGFTGVSTSFVGVDTS FTGISTGFVGVSTSITGSKNSVTGVSNSMTGISSSWTDVSMSTTGQSQSI TGVSLSYTGTSNSMTGTSTSVTGTSTSITGTSMSNTGSSTSITGTSMSTT GSSVGTTGSSMSATGSSVSTTGSSVSTTGSSMSVTGFSFSYIGASYSDVG IDLKKLGMQTKN >BTH_II0302 Uncharacterized conserved protein MVPIGKVASPRVELIDDHWGAIESTITIDSPALSDDALMGLDTFSHIEVI YHLNQVPSQEIERGARHPRGRADWPKVGILAQRSKGRPNLIGVSRCQLLS VEGRTLRVRGLDAVDGTPVLDIKPYLEEFGPIGRVSQPAWSHEVMKDYYI LQPSE >BTH_II0092 PAAR motif family MRGVIRIGDSTSHGGRVVTGREGSTVMGRAVACVGDRCTCPMNGHEHCVI VEGDEGVRIEGRAVAFDGHRTSCGATLISSIPTSGRT >BTH_II0706 Uncharacterized ACR, COG1434 family MTLALLTAIVFVAALLRLAKRRRASLALFVASAAAFFAIGCGPVPAWLLR ELQAPYAARPAIAWGERNAIVMLGLATEKIAATGAVEPGTFSYSRVVEAA SLYRDCRRARANAGCKILVSGGDARRNGAPEASIYRDALIGLGVDAADVL SEPRSMNTWQNAQFTRAVLDGYRADRVLLVTSGVHLRRSALYFAHFGVAA IPVRAEYLQAVTFPLPLAYNFSVADLALHECLGIARYRLYDALGWNPART QPGDA >BTH_II0619 GatB/Yqey family protein MSLRDQISEDMKAAMRAKESERLATIRLLLAAIKQREVDERVTLDDAGVT AVVDKMIKQRRDSISQFEAAGRADLVEKEQAEVAVLTAYMPAQLSEAEIA AEVQAAVAQTGAAGPQDMGKVMGVLKGKLAGRADMTAVSALVKAALSK >BTH_II0251 lipoprotein, putative MRLRFSGSVALGGALLLAGCGATERSVAVPYSIALDVAPDVNPDINRKPS PIVLKVFQLKTASAFESADFFSLQDKPQSVLGADLLGVDRIILRPGDART LHYRGNVDAGAIGIVAEYRVLEKNRWRMTVPLPRAKQLNLYRFWQTSPSE MKLQVAVKNGGIGLGGDSRVRR >BTH_II0137 Protein of unknown function (DUF1316) subfamily, putative MRHPEGRHRAAYLPSLLDRLQDDAPHSFSESPDAYAPSADEMRRIVQRDL SLLLNTSNLDDEVDAGRYPLVAASVVNYGVPPLSGSYLHDPNRETIDRLV RTAIVHFEPRLIADSLAIRPLAAQQGASYNKLTFEISALVQWSPYPLELR IQSTFDLELNRVTLDKTTLNGK >BTH_II1712 conserved hypothetical protein MSEAAAGVANETKRDSAFGRDLLAGLRRAPRSIAPKYFYDAAGSALFDRI CELPEYYPTRTELAILKRHAHEIAAQIGRDANLIEFGAGSLSKIRVLLDA CAASGPPARYLPVDISAEHLAQSAAALRDAYPWLDVQPVVADYLQSEQLR AIECVAGRRVGCFLGSTIGNFSPDEASAFLRHAASLLKGGGLLIGVDLVK DVSILHRAYNDAAGVTAAFNLNLLKRANAELGADFALDAWAHRAFYDVDQ QRIEMHLVSRRAQTVRLAGYAFRFDAGETLHTENSHKFTVDGFRALAQAA GFTPGTVWIDDARLFSVHWLESRG >BTH_II2144 membrane protein, putative MHTLYLIAIVAEAMSGALMGMQRGMDRFGLALVGAVTALGGGTVRDVLLG RYPLTWVAHPEYLLLTLAAATFASMTATHVARLKSLFTTVDALGLAAFSI IGCDVAATVNGSPVVIVLAGAITGVCGGMLRDVLCNEMPLVLRKELYASI ALLTGGLYVGMKALGVAEGLATVVALIAGFALRMLAVRRGWRLKAFHAAE AG >BTH_II0360 carboxymuconolactone decarboxylase MYPQPGPEIARRRRELAPEALAAFRAFSNSVFADGALPTKTKQLIAVAVA HVTQCPYCIRGHTKEALKAGATEGEIMEAIWVAAEMRAGAAYAHSALALD AMKEEAQSREAQHGEHH >BTH_II1371 Protein of unknown function (DUF355) superfamily MLQLLTVAIDKPETANFILGQTHFIKSVEDIHEALVGAVPGIRFGLAFCE ASGKRLVRHSGTDGALTELACRNATAIGAGHCFVVFLGDGFYPLNVLNAI KAVPEVCRIFCATANPTEIVVVQSDQGRGILGVVDGFAPLGVENDEDVRW RKELLRNIGYKA >BTH_II1531 Rhs element Vgr protein, putative MPTTGRSTATCIPCADGGLTSWQIGFSSALHFLRHRRDEHFWLDRDAQEI LSEVFNRYPPLQGAFRFELSGALAKRSYCRQSETDWHFVNRIMEDEGLYG YWIHDEREQKTTLRIVDRVEALPAAKPIDFYRGNAGDEIGGFTQWATLRQ LNSVHVASRSGDYKRPSTPFEVRQSVQTTRYVEQTNWRTQEQKAIPYPPL EDYSAGAYRYPDSDRGAAWARIRAEEYESRSRRYAGVGGSRWIDAGGRFV LNDHPAHAESDPKAREFVAMAARWTIENNVSIARSSRHFPYSLQADIERA RSGFGSAFAVAPHPQDGATGLYVIEVEAQRTDIEYRSPFEHRKPAMSVEM ATIVTPNGEDVWTDPLNRVRARFHWDRQSPPDAFETSPPLLVAQSDTGPQ YGGVHVPRRGETVYVDFVGGDCDRPYIVSRAPGGATPPMWHSDGLLSGFQ SREYGGGGGYSEMQLDDATGQVRARVLSKTRGDYSHLTLGYGIVQQGNTR GRYLGSGFTLHADQYGAVRANRGLYIGTHATRHDAEQLDVDAARDQLKAA ADVLARQSSLSEQHRAESLKAGHDALTELTDATRQPVEPGASGGRTSGGG TGSANGFKVPAMLLGSAGGMGLTTFQSLHASADRHVNVVAGQSAFVATGK SFVASAGEKVSVFAQSGIKLFSKDAVQIESHRETLDLIGQKTVRLVSATE RIEIAADKEILITSGQAYIRLKGGDIQIHAPGKIDIKGSLHNFSGPASMP YPMPTQPDAVCVPCMMKQAAGRGAFVAMGA >BTH_II1099 Protein of unknown function family MALPQDSVNMALFCDFENIALGVRDTKFEKFDIKPVLEKLLLKGSIVVKK AYCDWDRYKTFKAAMHEASFELIEIPHVRQSGKNSADIRLVVDALDLCYT KAHVDTFVIISGDSDFSPLVSKLRENAKRVIGVGVKNSTSDLLVANCDEF IFYDDLAREQQRALAKRDARKAEAGAKRTADDDRHRRHDGDARKAEAISL AVETFDALASERGESGKIWASVLKSAIKRRKPDFNESYYGFRAFGNLLEE AQARGLLEIGRDDKSGTFVFRPIQTVAVEFAGAAEAAATVDDNHAGAKPS GKKAHGKGRGAKKRVPEQMPLIVETGAEAHADADIEADAEIAEANVSAPR DFRRERTAEAARDTHPAHEAHEPREERAAREPARTDESAEAQEAEAASPA RSRTRKTAAKKAKGTKARAADTSPASPAVAAEANAAGEPPPEAAAEAAPA KAPRKSAPRARRPRKTVATNES >BTH_II1769 glyoxalase family protein MPTAVKPIPEGMHSLTPHLICDGAAAAIEFYKKAFDAVEITRLPSRDAGR LMHAAVRIGDSTLMLVDESAQCGALGPRALKGSPVFIHLYVPDVDAVVAR AVEAGAKLTMPPADMFWGDRYGQLEDPFGHRWSVATHKRDLTPEQIREGM ENCVPPGQ >BTH_II2167 conserved hypothetical protein MPGAGGGRSGAVAAGGRRSASAGRRGGRCGFSGLREGAAFGHVSAAGNVT VRRASAWVPRPWPWPMRPDAPGVFACGCRSLYHRHRHRHRHRHRHGHGHG HGHGHGHRHRHRNRNRHRHRPHFRCAPHERSRNCVLEAPQGGSMATNTKR LMAAALGAALAFGAPFARAASFDCARAASAAERAICGTPALGELDVRMAA YYELLQNARPADEGIAYREFRDALRDGQQSWRQRVRDACGARIDCLTNAY TARIAALRGVAAERLALRMTGGSAPSPDAAGATYAIEGESIKLTNGESVR PAAPGSAAKRVTTLVARSAPATIAGGPLEAVLLSDDPGGSGRFLYVATVQ PGGGAPAVLLGDRVKPVSVSIERAATGGAIVVVEYLDRPEGAPFAQAPTV KVVRRFALEQGRLVEQRG >BTH_II0855 lipoprotein, putative MKTLCSMLARIAALIALAALSWATALYFDWPWWGAPAVFCAALAVWPLFG VARRALRAVRARAQLARLDGAKRLPADRDAPLRRVVARWRAALDALERAA PAPGLRGRPRDALPWYLVIGRAGAGKTTALARARIASPLRRARHDASLAP TEDCDWWCFDDAVVLDLAGRFAEPDATDDDRRAWGALLEQLGRARARRGI NGVVVAIDAPRLVTADRDALTIDGCAVRERLEQLIRLFDRRFPVYVLVTQ CDRLYGFDEWAAQLAPEQRERAFGYLGDHDADAFVAHALEHVDARLAALR IALAARGEPPSPRALMLPHELARLRPALEALARAAFGPNVYQETPYLRGL LFSSGRQAGGAPSLTLPDWLDAAPARTPGDAGLFLHDVFARVLPGERDAS RPVERPSRSRLTPRRLALAAWLFACVAAGLLMSASFVGDIRTVELIRRDY PAHPRFTGELAHDAATLARIARVIADVERCDERRLVRRLARPIADATPVG RLEAQLKRRYIAHYRRAIEPAADRLLFGEPDGASGARAADDGDIAVRIRN LVRYVNLMQARRRGADRETLARMPAPALARARDGGGASRADGARSAGGMS DTGDSGDTSDRTGRLSALIGALAVDRIAWSAPDDATLAARIAAAQAQLER LAYRDPDGAWLLALPDASAPRDVTLADFWPAAAPRTPPPEPRDARVPAAL TAAHRPAIDAFLDEMAQAVANRPKFAFHRDAFDTWYRARRIDAWRDFVAR FPQGEQGLATQAQWRAVIDTIADRRDPFAALLARVDREFESVRDDALPPW LRFVRTASRMLAPARVPAASGGLGAALGSISRSGGRALREALGGAPEQGR RTLERDAALRDALVDYERRVAALAADALAGPGAAYRLAADFHGFGVDPSV EASAMRAADDALRDVKRLAGERDVGGDVVWSLVGGPLHAVIAYVERQASC ALQEDWERDVLWPLRRAATREDADDRLYGPQGAFWAFVDGPAKPFVRVGA ARASAVDTLGYRLPFTDAFLPLVDDAAARRVAQARRADEQRARQQAAAEL DERIASLGKQIDAIRAQTVRIEIVAQPTDVNPEARARPFETVLTLQCAPQ ARTLANYNLRVSERIDWQPDQCGDATLRIALGGVTLTRRYAGPLGIARFV QDFRYGVRRFTPGDFPDAKAQLERLGVRHVDVRYDFSGHDALLAHVERID ALERARRDDIARQRRIASRQDDGASDSGAAAIARAGGFAANPANPANTAR APSASSRSSPSPAPGASAPSGTPDAGLPRRIGACWGDPARDGGMRP >BTH_II1714 Domain of unknown function MVYVKGDAHRQRRISLSRAGAESARTRRWPTHARRFRLPNFLQLTAFATG IVRSLELQNSRLARRLRRRRGVRCRRHPTCFATGGAFMTQSSHTVKIPGP DHPITIEATGERVVVKAAGQTLADTRDALTLREASYPPVQYVPRKDVDLA QLERTTHESHCPYKGDASYYSIKGAGARGVNAIWSYETPHDALKRIAGHL AFYPDRVDSITIG >BTH_II0312 Protein of unknown function (DUF636) family MRFARCPPIVETGRSPCDPIAGAPHARDNGASRRQLRAVSIYCAPTAGET MSATRSLRCLCGAVGVKLTGEPAARAHCHCMACRDFYGAPMLSATAWPAG QVIVAEGDVASFAHPTRRLSRAFCATCGETVFGTNRLGMRVVPNAIVARA AGGELPAGLRPTMHLFYRHRIVDVRDDLPKYLDGWDGPTDDA >BTH_II0884 Uncharacterised protein family (UPF0261) superfamily MVPHRKQIYVAATVDTKGAEAHFVKDRIADAGLAAVVVDLSTRAPGLAAD IGAAAVAAHHPDGAAAVFCGDRGRAIAAMAVAFEHYIRSRDDVAALIGIG GSGGTALVTPAMQALPVGVPKLMISTMASGDVSAYIGSSDIAMLYSVADI AGLNRISRQVLANGAYMIAGAVRDMQPLPADLKPALGLTMFGVTTPCIQA VTSRLDARFDCIVFHATGHGGHAMEKLADSGLLDGVLDLTTTEVCDLLMG GVLACGDDRFDAIARSKVPYVGSCGALDMVNFGHIDTVPPRYAQRLLYKH NPQVTLMRTTPDENRRIGEWIGAKLNACDGPVRFLIPEGGVSALDAPGQA FWNPEADAALFDALEATVVQTENRRLVRVSAHINDPLFADIAVEHFLSLH AAHRN >BTH_II2213 dedA family protein MLHDLVARFGPLIVFVNVLAAAIGLPVPAMPTLVLFGAMATLHPGAIGAQ LAPVLALAVLAALIGDTVWYVAGRHFGGRALKTLCKLSLSRDSCVKKTER FFGRWGVRVLAVARFIPGLSLISVPMAGALGTRYRIFVGYDGLGALLWAG CGVAIGFVFAKQIDWLFAGANQLGRTVLVVIVALLAAYTAVRWMRRRALI RQLANARIDVDELDRLLQADPTPVVFDARSPEHRKLDPYAIPGAQFADER DLRDIVAHYPATQKFVIYCSCPNEVSAAVMARRLKQAGFADALALRGGLD AWRDAGRRLIELDPQPGGEAPVRAPAPKTA >BTH_II0863 Rhs element Vgr protein MSSSHRHYADTALADVAALTDAASRADAAPLANARRFTFASTAYDAATFD VVDIDGRDAISQPYRFEITLVSRSVRIDFAKMLSCEATLAILPPFGEAGT TRYAGVLAEFEQKERFRDFTVYRAALVPRLWRLSLYKASDVYLNEQTIPD IVKRVLRAASFGKRNFRMRHRGVYRKRSFVCQYDESHLDFVSRWMEKEGL YYYFEHDGRREKLEIVDDRRDQPGPADDLALRYLPATCLDAGIESDRVQA FACRATPLPREVVLRDFNHRKAELSLEVREHVAHDGVGERVSSDEHFHTK DEGRRYAKLRAEALVCEGRRFAGESTAAGLRAGRFFALSGHYRKDFDGRY LVTAVTHRGSQAHLLFPDLDAPFGATPGEPVYRAEFEAIAANLQYRPPRT TPKPRAAGVVSAIVDGEGSGKRAELDEHGQYKVRFPFAHTAHPTNKASAR IRMATPYAGDDRGMHLPLLKRTEVKIAFDGGDPDRPVIVGAVPNSSHRSV VTRSNPDAHRILTEHNQLYMKDGSGAATWLHAPNNHIGIGAVGPGDGLAL LTSGNKFDFSLGNAYSFSGGLKCSVSMGGNTDIYVGVRNSLDVSANFLTT LQGNLRWMLPGSRSFEINDSASTLLQTLHKQSATGAIRLSAGQDASALLQ KQLDKLKGTVRKFMIVSGLANAGAAATAAGLIKGGGALADLPWAGFGVSA AQFAGATGFSTALMATSRTLLSKIAKLQEALPLVADLSLDKQGIALAAKN LTHATRMSLTVDGVSWSTHAKGPGAAGAAMSVGKGRWGVEAAEHAHVHAN DTLLFAVPADPTSKFDLKELIGLRRDLDECVKGIADLEADISENEVLSTD QNTFGVGALVPTPPSPANAVAAVAIKAKEAKLVELNAKRKLVATKIDNLQ QKLAKHAKNLSAARMSASDAEVGFKGNRLVATAEGVTLAHAQGKAKLDVR EAKIGVEAGKSSLELDESKLAAGCGGASLKLGSDGAIDVRATNVKLNGSA SLKLDGQLIQLG >BTH_II2019 conserved hypothetical protein TIGR00645, putative MPRRCKRPGRRVTITRFFSKTLHIPASPLRPTMSAAPTDPRAARPARRKM RPLPAVIFMSRWLQVPLYLGLIVAQAIYVFLFLKEVWHLLSHATGLDETN IMLAVLGLIDVVMISNLLIMVIVGGYETFVSRLGVEGHPDEPEWLDHVNA GVLKVKLSMALISISSIHLLKTFINPDQHTTHAIMWQVLIHVAFLVSALV MAWVDRLTTHTHPQHFHEASTDPSAPREPAQQSA >BTH_II1369 YbaK / prolyl-tRNA synthetases-associated domain family protein MSMSVTLQDCLRQKASRYEVIHHPYSHSNMEAAAAAHIPGDRLAKTVLLE DAQGYVAAVLPTTHAVRLSELWAKTGRHLVLAKEVELRELFKDCDVGALP PVCMAYGMQTFLDDSLARQPDVYFEAGDHEELIHMDRDEFLSLMDKAERA SFSHKIQGVVS >BTH_II0364 Protein of unknown function, DUF488 superfamily MKHAIEIQRVYEHAGDDGHVHFLVDRLWPRGVKKESVKLDAWLKDVAPSN ELRDWFGHDPQRWDEFRQRYEHELDVSPASWQPILDAARKKPVTLLYGAR DTEHNQAVVLRDYLLRQLQLHRH >BTH_II0028 conserved hypothetical protein MKAFDRDASEPRIGLAYGPGMAEFAARNAHLVDYIEVPFEQLRFSPAVAE LQQTIPFVLHCASLSVAGFVPPDDSTVDAIERTAVQTGTPWIGEHLAYIS ADPVGEALGGTGEPTSLSYTLCPQLSDETVRRVVDNLAALRPHFPVPLIV ENSPQYFPIPGSTMGMTDFIRAITDRCDVGLLLDLSHFLITAHNTGAEVH RELARLPLERVVEVHLSGMSVQSGTAWDDHSLPASPILFELLERLLDVAR PRALTFEYNWSPYFPLSVLTTHIERARQLMGLA >BTH_II0495 Protein of unknown function (DUF445) superfamily MLDDKERELRNSKRRALALLLAAAGVFAATLFAPRGFWIDGVKAVAEASM VGALADWFAVVALFRRVPIPLVSRHTEIIPQNKDKIADNLAAFVREKFLD PASIVALIKRHDPAARLAQWLATPRNADVLGGYSARLVAFGLDMTDDARI QTFVKDAFHALLDRIDLSQSAGAILDTLTKDGRHQALLDDGIAQIVEFLR DPDNRASIATYIVDWLRYQFPKMEKLLPTNWLGEHGAELISNVVTRVLTQ IAEDPEHRLRRGFDDAAARLVTRLKSDPAFIEKGEEIKRYLRDGEAFNRY VKDMWDQLRAWLKADLARDGSIVRQRATALGGWLGERLAQSPQLRDSMNE HVERAASEMAPEFAEFLTRHISDTVKNWDAREMSRQVELNIGKDLQYIRI NGTLVGGLIGLGLYAVSSIARWAGALPY >BTH_II1928 probable transmembrane protein MTVSQTCLLITVLMPFVWTMCAKSSNRYDNREPRRYLGQLEGWRARAFAA HQNSWEALALFTAALVVAWHNGANVQRVDQLAIVFVASRVLHGVLYLLNW ATLRSLAWTVGLVCVVWLFFAAP >BTH_II2008 DNA primase MSNTPMSEFERASVALGYVPADDRDTWRHAGMALKAEFGEEGFALWNEWS QGAQNYNARDTRDVWKSFKGGKITINTLFHLAKLGGFDPRAHRAKPVDPE QRERQHAERAAREAAELAALAEKQQAASALAESIWSAAEPAPTDHPYLVR KRIPADALRVYRGNLSLGTAACDGALVIPARDADGQLWTLEFILTDGQKR YLPNGRKAGCFSLIGGPVSSVLLIGEGYATCATLAAVTGYPAAVAFDAGN LHAVATALRGRYPDARIVVCADDDHATNGNPGVTKARAAANAVGGAVAVP DFGPNRPAAGTDFNDLAAHLGPDAVAAAVRAALVPAGASDADRGKASPFA TKPAKRPKTARAQDGTWRFEVDDEGVWYHGFNNQGDPLPPHWISTRIDVI AETRNEMSSEWGYLLEFTDRDGIRKRWAVPAGLFAGDGTELRRMLLDMGV KLGVTQTARTQIANYIQMARPDERVRCVPRVGWHHGAFVLPDRVIGTGKE ALIYQADTPIQSQFKERSTLDDWRRDVAAYCVGNSRLLFCVATAFAGPLL HFSGLQSGGFHLLGTTSKGKSTGGVIAASVFGSPDYVRSWKATDNALEAV ATQHSDALLILDEIGQVEPRLVGDVIYMLANESGKARASRNGSAKPVLTW RLLFLSNGEKSVSALMAEANKPMKGGIEVRLPAIPAEVGDMGVVEELHGF PTPAALIEHLERHAGKHYGTAGPAFIEFASAQADELAEHLRTRVDELVTE WVPEGAHSQVARVAKRFCLVAVAGELATAHGLTGWPEGASVKAARRCFEG WMELRGGAGNSDEAEAVRQVLHFLVAHGDNRFVWMNRAQDDHRPNVPHRA GFKQHVKRDERRTAIASDREYYAEFGGKMGADDAEHVETEYLIEAAVFRK DVCAGFDHKMVAKALMKRGVLMPRSDGYPYRQEYIPGHGKFMVYRVRPSI FTLEL >BTH_II0391 Uncharacterized BCR, COG1937 family MSHTIREKQKLLNRVRRIKGQVEAIERALEEECGCGDVLQRITSCRGAMN GLLAVVLEDHIRTHLVDAEAHDDHEGSASEQLIDVVHSYFK >BTH_II0580 ebsC protein, putative MPAVRAATSDAAAAFSRAGAGHRAARAVTEATSPAQSPCQAPSRPIPLQP PQRILHNRSIHSAPRTIAYPFRGFTKDAGARIAEPALAPADKTKAGTPAA RRATNVSVESVRQFLAEKAPDIDVIALSESSSTMTLSAAWDIKPAQIAKT LAMKVGDAHVLLVSCGDSRLDNQKIKAALGGKAKMLSAEETVAVTGHPVG GVCPFGLSTPLPVYCDVMLKSYDFVVPAAGSTHAALRIDPVRLAELVEAE WVDVCK >BTH_II0533 DGPF domain superfamily MSYMLLIVEPRGQRAARTQAEGEALYERMRHFAGELQSRGVLIGAESLVS DDKSTRVQVRNGEVRLVDGPYAEAKEMVGGFFLLDVGTQGEALAIAKDCP AAEWCSVEVREIGPCFR >BTH_II1564 dedA family protein MWHFPQAVPASLGPWAVFASVLVTQLGMPVPAVPMLIVGGTMAAMGQASY ASMLVAAVAATVLADSMWFFAGRARGRRLLNALVRFSLSLDTTLRVARKV FEKHGAPLLVLAKFLPGLGLVSAPLLGTTAVAVWVFLFWDVAGASLWASV WLFGGAALHDEIVRLMQWVSASGGTLFDAFAAIFVTFLLYRWAMRMRFRR WLAKIRISPQQLDAMLKSAAPPVVFDARPRAVREKEAYRIAGAYPLDLDS PDPLHPDLTTRPIVVYCVCPNEATAKRIVSQLHRKRIRHALALKGGLDAW EKQGYPVEPLPADFDAARYVAPPLAEPAEPAELDAGGYPMRAGLTD >BTH_II1894 Rhs element Vgr protein MRLIELRSPLLDPDAVALSFVVHENLSQEPSYQLDLLSHDPDLDFDALLG STLSADIDLGGGDIRTFNTHVFGGHDTGQMSGQYTYTLELRSWLSFLAEN RNSRIFQNMSVPQIVEQVFQGHQRNGYRFELEGTYEPREYCVQFQETDLN FVKRLLEDEGIYFWVEHEPDRHVVVISDTQRFEDLPLPNETLEYLPDGEE SRAIQGREGVQRLQRTRRIKASNVALRDFDYHAPSNKLDSDAQLVSPPNL EGIPLEYYDYAAGYREPEQGERLARLRLEAIQADSHMLVGEANARALATG RAFTLIGHPALGRNRRYYVTNSELTFIQDGPDSTSQGRNVAVKFRALADD QPYRPLLTTPRPEVPGIQSATVVGPEMSEVHTDKLGRIRVHFHWDRYKTT EADASCWIRVSQAWAGKGWGVIAMPRVGQEVLITYVDGDLDRPLVTGIVY NGENPTPYDLPKDIRYTGLVSRSIKRAGGYQNASQITFDDQRGAERVMIH AERDMQQTVERNSSTSIAQDLNLSVKGTATSVVGISVSFTGISVSYTGLS VSFTGVSASFTGVSTSFTGVSTSFTGVSTSFTGVSTSFIGVSTSFTGVDT GFKGVSTAMIGVSTSVVGSSNSVTGVSNSMTGISSSWTDVSMSTTGQSQS MTGVSLSYTGTSNSMTGTSTSVTGTSTSITGTSMSNTGSSTSITGTSMST TGSSTSITGSSVSTTGSSVSTTGSSVSTTGSSVSTTGFSFSYTGASYSDV GIDLKKVGMQVKS >BTH_II1041 pANL12 MSSKRKIVMPTDEEDAAINRGIAADPDTFEVPAEDFAKMTRRGKRGRPPL EAPKVQLTVRYDVDIVDAFKATGEGWQTRMNDALREWLKEHQPA >BTH_II0862 pentapeptide repeat family protein MKIVKPESLALLCRTLRFEGIDRLSIGALACFALRADAPAGPGDLAPEAS LWQVARQWLGEHAPLDDGLPKPSGEFLVYGDACAPPGRDRAARAPFAVRA RIGAACKERLVDARDAAGRALAEFRALPPSHPERSRDLGPFDERWLAARW PHLPAGTRAEHFHTAPRDQRIAGFWRGDEDIELVNLHADRPAIAGALPRV RARCFVERWVGGVARIDACPMRAETVWLFPGAACGIVLYRALVAIDDEDG DDVVRVIAGWEHADAPPLPDEAYIGRPAPEDEGSRPALAPAAAPAAIADD DARADAGDAADRAPGAPASAAHAHSPAAPESSAEPPAPDLSALERDAAAL AAQTDALLAAAGLTEADVARLLPPRDAPADMTLDELTALAAELDARTAQW QAQYDAAAAERDEASSPASPNSAAAHDASLADLLRQADAQIRALVDQHGL SRAQMEAAARDRPELAALADALDALDAPLDIDALTAGLAAPAGDEAIVEP DAPAGPDRPAGADRPADGAPASMHAAAPSAGDAPPAEPLTREQVIERHAR GLGFAGLDLSGLDLSSAALERADLRDARIERTCFAGCRLRGASFERALLS RADFSNADLREATFVDASAPGASFRGAALDRARLAHADFTGADFTRASLA DGHCAHARFDESAMTQLAAARLDGAHASFAGCALDAADFTSARMPRANFQ HATLTAATFAFAQCDGAEWYGAQASGAQLRSASLRGSRADASTSFRQAVL SGAALDDANWDGVDLRYANLHKATLDRASLARAIASGAQLTLSLARRADL TKADLTHADARFSNLQGASLRRARLDGTQLQSSNLYGADCYGTALGRSQL AGANVERTLFVVPGRPELASSR >BTH_II1822 PepSY-associated TM helix family MDASLRSRWRAVHRGAGALFGVVLFVILFTGTWSLAQESMQGWWRPPALA VAGPPLPLERLAARAAALGFSLRDARIVLPQPSDPAVRFCSARQVCALAL NPATGEPLAEAARAMPLVTLHKTMFAGFPGRIFVSLWGIALLVLIVAGLV LHRRHWPDSARVRRDRGVRIALFDLHGWIGLWGAPWLVLFALTGALSGLG ALGTVSLAPVAFPGQPQRAFAALMGAPPPAAVDKPWSRAPDLDALLRRDA ARAPAFRPEVVALHHWGDANASVEIAGTAAGLPSTALFERHLYRAADGQW LADATSRGRGFWLRAFIAVQPLHFARYGWSGAAGGSLRALHFLMGVAACV LCATGLVLWIERRHAQRDARARTLAALGAGVCGGLVLAGGVLLFAGRVLP PGARADDALAALFWSTWLGSALLAARVRDRAALVRALMGAAGAAYLLAGA AHLSIALLGAGAPVYAHVDAALACLGALLLRAARRPRRVAAPPAGLPPPR SELP >BTH_II2078 Uncharacterized ACR, YkgG family COG1556 family MSARDDILGRIRAALGGERASLAAQFPAPARTAAAADASDAPRARGTLDS SAASSGSPAAARGPAISLGDVDLVARFVAKAQQVSATCAHIATAAGAPAA VDAYLRDVGLADAPLVVAAALDALPWRTHRALPGVDLRRDGLVSVTPSFA AIAETGSVVCLSSSATPTSLNFVAATHVVLVNRSAIVATMEDAWARVRAT IATLPRAINVITGPSRTADVEQTVQVGVHGPKRVLVLICDDA >BTH_II1642 glyoxalase family protein MARARIRFRARRCHQRDAQGRIAHAEMAFGGGIVMIGASGWRDFAVSPES LGGGNAQRVHVQLRSGIDAHCEPARAAGAAILQAPAGQFYGDRTYCPCGP EGHVRTFGQTVRRVSREDAGRHGGLSIEGGR >BTH_II0917 conserved hypothetical protein MIKADRERLSVVSLIEKVARLREPAAWPDGTQSVKAIETHMSWVFLTDRH AWKLKKPVRAPQLDFRSLAARERFCHEEVRLNRRLAEGVYLGAVPVTMDS DGRLRPGGAGEVVDWLVEMKRLPAERMLDRALLHGSATRADARRIAQRLS AFYRSLAPVRGDPALYRDDLRRTIDCNERALCRPMFDQPVAAVRAVCALQ RALLDAEAGRFDARVRQGRIVEGHGDLRPEHICIDARIAIIDCLEFSKRL RTQDAADEIGFLALECERLGAPEFARALLGEYRAASGDDVDDALVHFHQS CRAMTRARLAAWHLREKAFRATPAWRDRARAYVALAQRHIGCCEQLWTAA RVSAAAGP >BTH_II0099 conserved hypothetical protein TIGR00149 MQQSIQHITVEARGRGLVEFTPQVRAFVEVQSVSTGLLTVFCRHTSASLL IQENADPSVQRDIERYFAALAPEDDARYEHDTEGADDMPAHLRTALTQVQ LSIPVEHGRMVLGTWQGIYLFEHRRAPHRRDVVLHLIGE >BTH_II1754 Protein of unknown function (DUF1006) superfamily MRRFALQWRIRFHPPFPAPLVTAPLSIASARALHLAAQGLLTPPRRKAVK ADVLAAIRRMAQLQIDTIHVVARSPYLVLFSRLGAYAPQWLDEHLADARL FEYWSHEACFLPIEDFGLMRHKMLNPVGMGWKYAAEWHAKHRDAIDALLA HVRASSPVRSADFARGAGKGNGWWDWKPEKRHLEVLFSTGQLMVAERRNF QRVYDVAERVLPHWDDARDLPPRETVLPRLVGNTCRALGIVRADWVADYY RLPKRSYRDELHALANAGELLPVAVEGWSADAFVHREFAPLVDAARDGAL HPTVTTLLSPFDPVVWDRRRASALFGFDYTIECYTPAHKRRYGYFCLPIL HRGRLVGRIDAKAHRAQRVFELKAVHIEPGVRVGAGLAADVGRAIRKLAD WHETPVVEAGNAPKEIARAIGAD >BTH_II0125 Bacterial protein of unknown function (DUF876) superfamily MVPGIHAKDEDQVERRCRAANRVDYRIGMIAMSWHNKVVWNEGLFLLPQL FQQQERYFEYFAHKRAAVLSPFFWGFSRYEIDQESLSFGKLVFKSGTGIF SDGTPFDVPGHTPPPPPLTIASEHQDQVIYLAVPLRLPNTEETAFDEQAG SLARYSAFEIELRDSNAIGQGPKPVQLANMRLRLLPEKELTQSWIGIALT RVKTLHADGSVALHDGDHIPPVSQYGANPLLREWATQLHGLAKLRADALA TRLSGSDGRAGAAAEVADYLLLQVLNRYEPLLEHICRIREMPPVTLYREL SMLAGELSTFVRPQTRRPRPTPGYDHAQLYASIRPLVDEVHYLLNQVLIR GAQPIPLTEQPHGIRVATMLPSELAGYSSLVLAVGAQMSPDVLQQQFASQ TKISHPQRLPELIRSHLPGMTMIPLPVPPRQIPFNSSYIYYELSRTGPFW EQIAQQGGLAMHIAGHFPELKLELWGVRHK >BTH_II1708 unnamed protein product MSELTLAALLSPITVDAFMERYWGRKPLIVRRQAPHLYACLPDSEEFAFL LHSLTDPERGWFSIVNGVARPPSDSLLTQEGLLNLSEVYAAYRDGNSLLM NQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPPSSQGFNIHY DPHDVLILQIEGRKHWRLYGRHVAWPTQPPATPIPPEEAGSPRREFVLSP GELVYIPRGVLHDANTTDSRSLHLTLSIETLTWTDLLIEAMSDNPAFRRN LPVCPPFGKRIGDEARAELTRLTASLNNPRALRRALAAMSGRLLGNLDPL PNGGFAEVDGLHLIEPKTWLSLAPGTFGHVEVNGDEAILHLPGSALRAAR EMAKAFYYLLRARRVRACDLPVSASEADKLTFVRKLVQMGFLVKASE >BTH_II0134 ImpA-related N-terminal family MNATTPAATLRYADLLAPVSQDAPCGPDLEYDPAFVMLQSAIAPKKDAQY GEFVEAPQPANWAEAERDCRALLLRTKDIRLVVILTRCRIRQSGAQGLRD GLALLNEMLARYGEALHPVPFFEGERDPVVYANAIATLADPDATLADIRE IQLPKASGLQLQLRDIEKALAVTRVKDALAPESASRLLKEWWNRRDGTIV ALAQAQCLMADLIASTRESLGDDAPDLSGIAKLLHPFTQAQLESPYSASA AQPQGDAKPANGDTAHSRAADALAPAGDAATQTPAAMSIANPQPPMDRRG ALAAIQATRLWFEQNEPSSPVIVLLRQSERMVGKRFSEIANAIPAELLAQ WDALDV >BTH_II0260 Protein of unknown function (DUF796) superfamily MGVAMFMKVDGVTGESADAQHKGWTDIQSFSWGASQPGAMASGSGGNAGK ASFNDLVVAAYMDKGATAIIKNCANGKHLSSVEISACKTGGSQIEFMRVT LQEVLVTSAQIAGVDPGDAADRLMMQYGFQAAKVKKQYWQQNDNGGKGAE VSVGWNIKENTEM >BTH_II0234 class III extradiol-type catecholic dioxygenase, putative MLISRPAARPASCCSCRDTPSFARSPTHRTGCLFMGKIIGAGLISHAPVV MMPRAVRLRENDGRDFTLATGLARLRREVFDAHDYDTVLVLDSHWRTTTE AVVTAHARRTGRFTSDEMPNAIRQLPYDLAGDPELARAIAELATRRACWI AAVDDPCLPIHYATLNPWTYLGRPDKRWISMSVCQTATTDDFLRMGEIVA QAISRLDRNVLLVASGGLSHAFWPLAELRRRMAGAASNIVTPAARAADER RIAWLEQGRHDRVIDAMSEFLRFDPEANFGHYLMMAGAIGARACAARARR FSEYENGIGTGHVHLWFGPVDGGWTRAETRAEREAARA >BTH_II2074 DoxD-like family protein MTRSVDSGVIFFARLLLAVLFLWGGTMKVTGYGEFVGYLKGLGVPFTQVA APAIVALEALGGVLLVVGYKVKPLALMLAIYTIATALIGHNFWDATSPAL QRDMAVHFWKNVAIAGGFLLLYVTGAGGASIDGARRPSSSYGSLR >BTH_II0703 YihY family protein MQKRQTTPARDARPYAHPMKSLIPQQPQKLVRHNVNWALDAFRRFSADRC SSMAASIAFYSAFSLAPTLVMVIAVAGWFFGADAARGEVFSHVHELIGNE AAAGVQTIVENAHRSGSRGGTAALISFAMLAIGASATFASLNTALSVIWP ATETRASSVLGLVRVRLISFGLVLGVAFLLIVSLVLDTAITFIGRWLWGA SPYVAIGNLLQFSIGIAVLAFAFGTLMKFLPDARVSNRDAMTGGIVSAVL FSAGKKLFALYLAHAGTASAFGAAGSFAVLLMWLYFSAAVLLLGAEFAAA RGAAHASIDAASQADAAPARGGPLD >BTH_II2000 gp29 MGSALIREGDTTSHGGRVLAGTSTNIVYGKPLALEGDMVSCPKCGGIYPI VGVRNRSMTFGDRPVATEGDKTACGATLIASQGTATVELTSGAGGPVGKG KSVVPRPAAQSNEAYRGRFQLVDDKTREPIANHPYTVTSADGQTIQGTTD ATGHTDWLSSHQASSLSFQQPGSDA >BTH_II2314 conserved hypothetical protein MSSRRAAQSGRMTTGAMAARRFRRHGSRVAGRAPSRATTIHGGDEMRRSR SPKPAKYAYADVPRRPRGFARTTAMRGDGLRRRAVRERAARRLSRELDAT LRRASTYPHPAGRIVRIETHISVVYLVGRFAYKRLKPFDFGFANFGGLAA RRRACEAELALNRPLAAPIYLATGPVVRRAHGLRLFGAGAAVDHVVRMRR FDERMLFSRLLARGALGAADIDAAAARLAAYHLHAPRDVPRRAYGSAREL RKQIDDVLAPLERALGPALPPALRAWCARRCDELAAHLDARRADGYVRAC HGDLHLDNVVKHGRDALMFDCIDFDDALRWIDVINDLSFLLMDLHAHDRA DLAHRLLNRWLDETGDFAGLAALPLYVAYRALVRALVATMRAGDDAAARA ERARRYVDAAAHAARARRPCLLLCHGYSGSGKSVASRALADVSGAIRLSS DSERKRARPFAAVDARPLAASAYTAQQIDAQYERLRALARDVLRAGYTAL VDATFLSHARRARFAALARETGVPMFILDFHASRACLERRVDARAAARND RSDAGAAVLATQLATADPLDAGERACTIGFDTDVPLATIRSAGYWRPALD ALDAADANAPATC >BTH_II1057 Phage minor tail protein MKDTFEWPSTVQGHGGDTTLRVRKAQFGDGYTQRAADGLNNRESTFDLRF VGNAAKVAAIIDFLDRHAGAESFYWTPPLRARGLFVCEKYSEPIKNGAVY TMTAQFEETFSV >BTH_II1623 conserved hypothetical protein MGYSGRLGRRGGGGHDAVRRPRLAPQCQRWPPSQAPRRRCEASGQCRAVS SRPRIGHRQCAMPDLPFDRHGDAPAAVDRARMGDHQQQDARGLRRAVARE GCRRARQVLVQHQREKERPGRGCPFGRRHAGQLRRRRPAMARSQGGMRSV NRSRVSPCSPSRRPSLPRALPSIAPHVFAGARRRATVARRRARSGIVTNA PPPSRRRECARPARAAAMPPAARKAGTRPRRAVPAGQPRRHPKAIRTPPE APMKNTDSPENQAASELIDQRIAELGDWRGDTLSRMRRLIHEAAPDVVEE WKWRGTPVWSRDGIVCTGESYKSVVKLTFAKGASVDDPAGLFNSSLDGNV RRAIDIREGETLDAGAFKALVRAAIAVNQSSGKAKTRAKRAKPDAP >BTH_II2323 conserved hypothetical protein MTNPFVQWLRASAPACALAIACVAQTGAAFAARATANVAANVAAAVTAPP SVALPDAPAGATSSGTSAAIRGTVVDAQTGKPIAAAIVTIDGHPIRADDQ GAFSADTAATDIAARAPGYLAARAPIEAGRPVTVALAPFRPKAVYLSAFG ITSKTLRDAAVNLKDTTAINALVIDMKGDRGVTPYPSAARRASGAAAQAP NAPVVRDFAALVADLHRRGLYLIARIVVFKDDPLAAAHPDWTVRDAGGDI WHDREELRWIDPSLREAWTHNLDVAEEAAKLGFDEIQFDYVRFPDARGLR FSVPNTRANRTAAISGFLQAARERLAPYNVFIAADIFGYVCWNEDDTAIG QQIEMLGGPLDYISPMLYPSGFTWGLPGCTQPTADPGQIVRRSLAEARSR TKLPGVRFRPWLQAFRDYAFDHRDFAAAEIRAQVDAAEAADTDGWMLWNA RNRYDPQQLPK >BTH_II1313 Uncharacterized conserved protein MTAFWEHFRHGADIGVRGVGETLAQAYEQAALALTAIVANPASVRALRDV EIACAEADRELLLVDWLNAIVYEMAVRKMLFSRFEVTLAENGLRARIAGE RIDAARHRPAVEPKGATYTALHVGRRGDGAWAAECVVDV >BTH_II0858 conserved hypothetical protein MSDRRIFMPAAAQARRPTQPAQACRSSRQLRPRRACAARACAATLAALAA LAGCAAGVTEEQARADVRWDYAPDALRIDIDASPRLNEYLNAPHTLLLAV FQSADARTFRQLADDPDRLRMTLAAGGPATDFIQTTRYVVEPGARVALSI DRAQQARYVGIVAGYYDADSPRAARLFDVPLRIDKRGWFSSTYRAAPRTL GLKLRLGAQSITDAREAPLNLPPPGARAWTTLDGGAKTLTLPAGDANGSE NGGGGDENDASAHAPRKR >BTH_II0254 lipoprotein, putative MSHAVARIVRPLPSREIWTFAGLVGLACFVWLAGPLFAFAEFHPFESGWA RALTIAALFVAWGARIAWRNWRAGQLNAQLLNQLREASPRPAAPGDPARA QLDELRSRFDEASTLLKKVRFGAADGARKGLPQWLERMSRQYLYQLPWYV FIGAPGSGKTTALVNSGLSFPLAEQFGRAAIRGVGGTRHCDWWFTNDAVL IDTAGRYTTHESNRALDEAEWNGFVDLLKKYRARQPLNGAMLTISVADLL GASEAERTQHAMVLRKRLLELRSQLGIRFPVYLLVTKADLLAGFAEYFGG FGRAECAQVWGFTFPLAESEAPGFDLRAAFDREYRLLHKRLNDGLPELLA SQTDAHQREMSYLLPQQIADLQDMLGQFVAEVFSVSSFEPMPMLRGVYLT SGTQEGTAFDRVMSGIKRFLKIEGVPPAAQTGSTGRSFFLKSLLQDHIFR EAALAGSNLRWHRRQRVLQIVGYAAIVLLCVAVLFAWLRSYSRNRGYLDQ VAARVPAVDAQISRAKFTGAADVVQLLPVLDELSGLPSAGGVDLRHPPLA YRWGLFQGEKIEEASDAVYRRALDDVLLPIAASRMEQALREARPDEVEYA YAALKAYLMLYDGAHYDPAFVQAVVDLEMERSLPADFSSAQRSALRSHLG ALFGNRVAVSPFPMNERLVADVRERLRQVPFSQRLYRQLARTLRPSTATY DFSVARAVGPDASLVFRRQSGKSLADGVPGLYTRDGYRNVFAPRLPGAID AYGREEVWVLNLGASETPNPADAAAWARDIRQLYLNDYIKNWDDYLADIR LQHTSTLAQSIQVARTLSSADSPLTRLMVALARVTPLGDAPGGARNLASR AQDKVDEARNSLAQIFAGQPGADAGAAAASPASPEQIVDSHFAGLRAFAP GGGDQAASFDAVLKAIDALYTYLTATDDALRGGATPPPSDAPARLRAQAG RLPTPFREVLDDLSNVANGSVASVEQRNVAQRAGANVGDFCRQAIAGRYP FARGASRDVAPSDFAQMFAAGGLMDDFFQKNLQTLVDTTTHPWRFNNRNA EADPAAAAMLGSFEKAAVIRDVYFGGGARTAQIKVEIVPLEMDPSISEMV LDVDGQIVRYAHGPQVPTAVQWPGPRGSDQVRLQVTEQSGATGGFTTEGP WALHRLFDRAGVSGGRGPEQMVARFAVDGKPIVLQVTASSVRNPFRLPQM ESFTCPPKQ >BTH_II0265 Rhs element Vgr protein MFTLDSLHGDDLKFHRLYGEEALGRMFDFRIEALADNHSLSLKELLGKPV TVRIRQQDESERHLNGIVARAALVGRRAQRHYGYQLIVRPWLWLATRRSD CRIFQNKTVPEIVQDVLVTYGFPIENHLTDTYAPRDYCVQYNETDAAFVS RLMEFEGIYYYFKHAAQTHTLMLCDAMASHVALPGYEHIPFIARDRTAIA DEEHIDSWLPAQEVSIGKHETSDYDYTKPRADLSAQKIDPRGHDHDGFAS FEWPGGYRDDEPGAHYSRVRLEEQQAEHERALARTDVRGIAPGYLFTLEH CPRADQNREYLIVRCQYRFQENAYATDSGNEAVVHESQVLVQPSSLPYRS PRATPRPRTNGPQTATVVGPVGEEIWTDQYGRVKLQFRWDRYGQSDQNSS CWVRVSSPWAGGGFGGVQIPRIGDEVVVDFLNGDPDQPIVTGRVYNGEKM PPWGLPGSATQSGLLSRSSPGGTTEHSNAFRFEDKKGAEQLWMHAERNFD AETEQDHTLSVGHDHSHSVGNNETMSVANDRQRSVGQNETVNIGKHRVAQ IGGNETHGVAGNRTRQVGQNEAVTIGANREATIGGNHVETVAKDKTETIG QGKTLNVTQHYQTNSKSMKTAVVQHHAEEIGSRTSTIKNAHVLNVGDSQS VNVGASHTMSVRNNVHVGAGDEIALVCGHASITLKKDGTILINGVTVESS ASGSHSVRGKTVTSSATGEHTVEGTILKLNP >BTH_II1713 Domain of unknown function MTKNEESTLGLASDLARGFTDVRRYSVELAKPLSAEDQALQSMPDASPTK WHLAHTTWFFETVILARHARGYKLFDSRYPYLFNSYYEALGPRHARAQRG MLSRPSLGDVHRYRRHVDDALLDLLRTSDLPTLIAIEPEITLGLHHEQQH QELILTDILHAFSLNPLLPAYRSDDATPPADARRANGAMRWLSGPSGIVE IGHDGRGFSFDNERPRHRTMLHPYQIAERLVTNGEYAAFIDDGGYARPEF WLSDGWAIVQREEWKAPLYWIASDGGEGLGWREFGFGGLQPLMLDAPVSH VSFYEAAAYAEWARARLPTEAEWEAAFDAPDIVQMTGCVWQWTRSSYGPY PGFRPMAGVAAEYNGKFMVGQQVLRGGSVATPPGHARATYRNFFPPAARW QFTGVRLARDI >BTH_II1898 conserved hypothetical protein MSDLQLYNKLSRRIRRHSLQEVVADHLVDLMNHAIRGARMRIADDSPAAH SVLNFGCPPMQMAGATKINPVHAAAHICEVIRRFEPRVDPVATTVKPRTE SRKRLAQTIYFDVSMKAREDGAELRASLALDYLSGYFSLADDH >BTH_II0135 putative cytoplasmic protein MTAHPPATPPIDARSPLHPDALRAWFDPQAPWRAGFLSLLRAIAARDARM PLPGTACLPREEPFRIGQRPSMAFAPREIASLDVQRGRLDIQLFGLGLWG PQGPLPLHMTELAYNRAESYQDHAIAHFSNLFHHRALALFYRAWASSQAT VSLDRADRETFSFYIGSLMGTDPEEAARTHPPTHARYAACAHLVREARNP DGVAATLSHYFGVPISVDEYVFHWIRIAEPERCLLGARAASTVMGEGALL GDMVPDCQHKFRLVIGPLDLDQYLRLTPHGNDLPTLVDWVRAFIGHEYDW EIKLLVKPRAAPPARADTAHRLGYSTWLGESRDDKPVVGMVFEPEKYCS >BTH_II1384 conserved hypothetical protein MVKILLTAIACAVPAAALAEPPKASGGMVVDDAGMTLYTFDRDTVPGKSA CAGGCTANWPAALADAYDKPGGDLGLIAAAGGRHQWTYKGRPLYRFSGDT RPGQHNGDGFGGMWHVARP >BTH_II0868 hcp protein MPMPCYLTLEGQTQGSIEGSCDIQGHEGKILVQAVEHTIEIPKSPQTGLP TSKRLHGPMTLTKEIDKSSPKLSQALASGEQMKSVVLEFYRILKEGKEEH YYTVKLENAILTSIRSWTPNCLVLDNKQLGHMEDVSFTYEKITWTYVPDG IEAEDSWRAPKV >BTH_II1958 efflux transporter, RND family, MFP subunit MNNVLATIAIGFALAVSNAAQAAGEMGGMDMQGGAQQAGAAHAGMSHGEV KKVDAAAGKLTIKHGPLENLGMDAMTMVFKVKDPAMLSQVKAGDKIDFVA DEVDGALTVVKLIKQ >BTH_II0976 ygbK domain protein MSTDQAFRPLLGCIADDFTGATDLANMLVKSGMRTVQTIGVPAAGAPVQA DAIVVALKSRTIAAADAVAQSLAALEWLRAQGCRQFFFKYCSTFDSTDAG NIGPVADALLDALGGEHAFTIACPAFPENGRTVYRGHLFVGDALLSESGM ENHPLTPMKDANLVRVLQRQTRSKVGLIRHDAIALGTSAVRETIDTLRRE GVRIAIADALTDLDLYVLGEACADLPLITGGSGVALGLPSNFRLGALLPE RGDAAALPAIEGASAVLAGSASKATNAQVAAWRAARPAFRIDPLAAARGE PVVEQALAFARLHLPQPVLIYATAAPDEVKQVQQALGVEAAGHLVEATLA AIARGLRELGVRKFVVAGGETSGAVVQALGVKALRIGAQIDPGVPAAATT EGSPGGATEGSPRETPRAQPLGLALKSGNFGSIDFFEKALRALEGAA >BTH_II0129 Rhs element Vgr protein MPNHFSNGRTNQSRTVVIRSGAMPRLLGQPALKFLSLRGEEHLGKLYTYE LLLRTPDDFHVPLATSANLDLKAMIGTEMTVGIQLDGIGTGAQGGVGAGA REISGLVVKAGFLRSEGRYNVYRIELRPWLWLATLTSDYKIFQDKSVVEI IDTVLHDYPYPVEKRLDIDKYSVAGESARNEPRAFQVQYGETDFDFVQRL MEEWGIYWFFEHSDNKHRLVLCDHIGGHRKAPSEAYHEIAHHPEGGKIDI EYINYFSTDEALRPGRVVIDDFDFTRPLASLVTSNHQPRETNWGEGELFE WPGDYTDSKHGDLISRVRMEERRATGSRAYGRGNVRGLACGHTFVLAKHK HDDANREYLVIESALMLTEVADETGSGYRYECDNELVVQPSNEVFRMPRE TPKPTTNGPQSAIVVGPPGHEVWTDEFGRVKIRFLWDRYARNDATDSCWV RVSQAWAGVNFGGIYIPRIGQEVIVGFMNGDPDRPLILGSLYNTITPPPW DLPGDATKSGFKSKSITGGRENYNGIRFEDKLGAEEFHMQAEKDMNRLTK NDESHTIGANYSIGVGITHTRAVGAMFSSIVGGAASYAVGGAESTMIGGA YALNVGGAHAVAVGGASSVSVGGAYARNVGGAYALTVGGVLSIVCGASSI TMTACGSIKIVGKNIRIIGSDEVVVQGAPLQLNPGDSDCGGGGGGGGGGG AIPPIPLPSFFLDITKPILPPPPPPPTEVPPEPTPTPEPTPTPEPTPTPT PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT PTPTPTPTPTPTPTPTPTSSEI >BTH_II0865 Protein of unknown function (DUF1305) superfamily MAAADGGAMPALEPVLLGEAKHFAYFQAIRLLRRIVRERREHHAGAASAP AAPMPIHTRPNLALSFPDTDVERIDKADDGGYRVVANFFGLYGVSSPLPT FYTEDLIDEAFKGRHAARGFLDVLHRALYPLLFDAWLKHRLSLRIVEERD AHALRPLYALAGVDARIARDAGLPEHALLRHVGLLSQRPRSASGLRALLA DAFAPAAVDIEPCVPQWLPIPDDQRTRVGARAHRLGVDARVGARMRDDGA RLRIVLRDVPGPLFRALMPGGDAFRRLRFLVRLYLTQPFTVDVAIRVRAR DALPARCGGGAWSRVGLDAWLGGPPAERAAAPQFRLPTSLFDQARPHHAA G >BTH_II0089 Rhs element Vgr protein, putative MTNLNDTLRNFASGAVDWNKRPVALHFGAAQGALGHLLALQHASVQEGLM TGIGGRLTCVSTRRDIPPGVFLGMPVSIRLITDRGQPHMVNAIISDVQIG QSDGELCVYQLTVCDALSLMDKRTNSRVFRKRSVIDVLATLFNEWQQRSP ALARAFEFDLSGLRADRYPPRELTRQVNESDAHFVRRLLRREGITVFAKA GPAKGEQPLQGDATVHTLVCCDEPMSLPQAPAGTVRLHPRDAGTEQRDTV TLFALRRQLVPGKAGRPSWDYKKARIDESSVASSLDQGEAGNDLAKLLTD IAIDIPHAGDSWSDHERLTRARMLAHEFEAERYDGVSSVRDLAVGAWITL TGDPDWDRQLADKRQFVITSIDHDIWNNLPKGLNDRVHALFAASRNLVNA PGALPAALANDADTRYENTFTCVRRGVPLAPAYDPQADLPPVHLLTGTIV GAEGEEVFCDENGRVRVRVHGLDPADHAHAQGAGTNDNAGDSAPIRVASS LAGAHFGASFLPRVGMEVLLGCLGGDPDRLVIIGVLGNGANPPATFSHAG GLPGNRYLSGIKTKEIKGQRYNQLRLDDTPNQISAQLASEHAHSQLNLGY LTQPRENGHGNDRGEGAELRTDAAAALRAAQGILLTTYARTQASGGQLDR DELIRLLGECSELFKALGDYAGQHGGQAADTAGQHAVAAAFRNWTPGAGG ADAPPDGGDRALMAFGAQAGSVNVTPKTHVTYAGENIDQVAQQHLQLMSG QRLNATAGQGMQLFARGAGVQAVAGEGPMLLQAQADTLTANAQKGVKITT NEHEVFVSAPRIRLVAEDGSYLEIGNGITLGTNGDIKLLSASHQWGGPST AQAAKTAFDNQPTDQRFKLHYPGEDGDLPAAANKPFRITLNDGRVIEGKT DASGLTDLVKDDAMRIAKIDYLKPKL >BTH_II0065 creA protein, putative MITLRTSTWIASVALAIFFASAVDAEELARISPHSQRYGTHIGISAYDDP LLKGVTCFVSEPHTSDERPSFRDGHGAEASVSCHQTGTLAATARLPRQAQ VFDESVDPVFRSVHVVRIFDIRRLVVLYFSYMESDVAGNLPGHVDVVRLP VHWGRTGASAK >BTH_II0068 DNA-binding response regulator MRILLVEDDVPLGDGIRAGMRQQGFQVDWVRDGDSGPARAARAWSRRDGP RSRFAARGRHGCSATHSQTIMRMTDTTGQCRRVAGARENGRWRTVRRVRY RTRRTGRSLAGGAACGAAGLAVHVAGTRAHPHPLPAMHAAHGGRVGELDS ARVRTLLHLGMMFGADFLALGIRRCRAHPRTVFNPLGRRHQLTAVNHASR LDGRALSWLARLGVGQTGRHRDRDADQARRSERIRTEHGNLQTICVHSTV HGKAADPLCQIKWRRVAANTIEMHAGHHVDVNCTVHHRAARVAAWRAALI GPLRSRSAMSNSLRYRERTVELADQMRKLRQSQPDPMSAFGALAVAGAKD GALTKKTRGLIALGIGISCRCGDCIGFDRQSPPIKRKATREEVEEAAGVA VYMGGGPSMMYAAPTLMAVGEPAA >BTH_II2285 MlrC C-terminus family MKILVAGFRHESNTFAPSKATYASFAADGGRYPLSRGAEIGRLKGMNLPV AGALAALDDAGHVALPATWADATPSGRVDSVAFERIAGEIVDAAKRRDAD GIYVDLHGAMATERYDDGEGELLRRLRETVGARVPIVASLDLHANVTQQM LDSADGLVTYRTYPHVDMAETGRRAVALLDALLGRRGRHRHFSSARRVPF LIPVNAMCTSLEPSKSLFRLLERLETGSVRSLSFAPGFPAADFPECGPTI WGYGADPVQLARAVDALYEHVVSAEAQWSVPFMSADDAVTEAIRIARRAR KPVVIADTQDNPGAGGGSNTTGLLRALVRHRAPDAALGLFFDPAAACAAH AAGLGASVEITLGADSGLPFTGTFRVESLSNGRCHCNGPMLRGATFELGP TACLRIGDVRVVVTSARVQMTDRSFYRIAGIAPETMKILVNKSSVHFRAD FDAIADCVLIAKAGGWMAADPADLRWTSLADGMRTSPCGAPFFGCGGRHA PHADGITGEMRL >BTH_II1126 DGPF domain protein MRVMVMVKATNESEAGKLPTKAQFEAMGKFNEELVKAGVILAADGLHPSA KGKRVRFSGSARTVIDGPFAQTKELVAGFWLWKVASMDEAVEWVRRCPNP MDGDSEIEIRPLYEMEDFGEEFTPELREREARLRDGIDEPREGAR >BTH_II0098 PAAR motif family MRGVIRIDDSTSHGGRVVTGREGSTVMGRAVACVGDRCTCPMNGHEHCVI VEGDEGVRIEGRAVAFDGHRTSCGATLISSIPTSGRV >BTH_II0869 Protein of unknown function (DUF877) family MNRDTERTTMEGEHLYSPKNDDAPPAPAPDSPASLLDELIEAARVKRDEE AYPITRHGIEAFVAHLARPKRPIETVSQATIDDMIAEIDRKLCRQVDAIL HHPDFQQLESTWRSLKFLVERTDFRENIKIQFLDVGKAALLDDFDDSPDI TKSGLYQKVYAAEYGQFGGQPIGAIVANYTFGPGAQDVKLLQYVASTSAM AHTPFIAAAGPAFFGIDSFGKLPNVKDLASLFEGPQFAKWNAFRESEDAR YVGLTLPRFLLRLPYGANTTPVKRFNYDERVDGGDADFLWGNAAFAFATR LTASFADYRWCANVIGPKGGGTVADLPLYAYEAMGEIQNKIPTDVLISER REFELAEQGFIALTMRKHSDNAAFFSANSTQKPKFFGISKEGKDAELNYR LGTQLPYIFVVNRLAHYIKVIQRENIGTWKERGDLEQELNQWIRQYVADM DNPTEGVRSRRPLRQAEIFVSDVEGEPGWYRVDMKVRPHFKYMGASFTLS LVGKLEKR >BTH_II0400 conserved hypothetical protein MRVGRPVPALPQPVCDYCGAKALLARFGDDAYPYREDHGELWICAPCDAW IGVFARSRRHVPLGRLANAELRHAKSELHAALEPLVAAKMRRDGCNAFEA RAKGIRWLATQLGLDAASSTIHTFDLDACRNALRLVEQFTSRKSLSSNS >BTH_II1601 conserved hypothetical protein MRYCLALDLKDDPDAIARYEAHHERIWPEVAAHLRAHGVVAMEIYRLGTR MTMVMETDDTRFDAARFDADARADAKIVEWEALMSTFQRPTPWTPAGVKW TPMARIFDLSKQ >BTH_II1455 opgC protein, putative MAQCATIVVRIQSCSDAALRHAPFVRPRFRRSSPAPRPSRVAQMRRRMNA APAQRYAELDFFRGLVLLVIVVDHIGGSMLSRVTLHAYALCDAAEVFVFL GGFATAIAYNSLAARHTEAAARQRFIKRAFEIYRAFLFTAGLMLFITAVL NAFEIDAPNMPINDLDGLMHAPLAALRDILLLRRQPYLASVLPMYTFFAL LVPLALPIARSRGWWLLVAASAATWLGAREIAAYLPTVDGVPWDFNPFAW QFLFVLGIVARCQPIYPVLAKRPVGWFATAAALAVVAAGAYYRLRIEPFP TDPSIKQNLGALRLANFIAIAWLAAKLIHLGWMHRIARAMPWIGTIGRQG LLCFVAGTGISLLVDSLLYTATDGYLDVRLGLVADAAAVGLLYVVAKLYA PLVARASDFVRQARLLRPQRPFRLPLRRPKR >BTH_II1151 conserved hypothetical protein MIQPTVVFKDNLAQLPAIDGIERIDLVDGGGAVIASIENKPGKQGSLAVY HYLREAFGTLDARAAEHGLAVFAEHTADARHRPGAHPNVDRLLAIAAGGD ALRIDVVARG >BTH_II0937 conserved hypothetical protein MTVRQAGTPITDDDLKQSAIDSGLMVARRQCPAEKADMVACRLFPSLPFF FDGTNNNMDRDVPLNKHSNVAKLFRITKDSIQSDVRRTYIPGVGTPFKFE KVAGYTDRLNDDGGGVLGLGLGTGGDLRIKFALAEFSRLLEVEWGPGSWK HMREVTVAIFGFSRGATQARAFARRFIEQKCGKDGGRLYWAAPSGMRVPL RITFMGIFDTVASVGGPALHLDWASELAIPAEVERCVHYVSAHEVRRAFP LDSVRVDKSYQGSCEEVVYPGVHSDVGGGYGPEEQGRVHDLSLIPLRHMF AEALRAHVPIIPLDQMPRNIRKDFDLSDDARIVGLYTEYMATLPSASGDT LEALIQPHRYLNFRWRSVLARHHADERVLGRLYAKVGESFCRTVPVGTDA DHSACRPNEWVYDVPKDPQEQATQLLREQRLLARHIEFLRYPIERRPGPQ SYPPTPRELTPYEKMILSAWDEQAPPSLVVDQLLAEYVHDSVAAFTSWPC ALWDQRGIWCDQRRYLAENDPMHAGDLAVA >BTH_II1848 Prokaryotic protein of unknown function (DUF849) superfamily MNQEVIVTCAVTGAGDTVGKHPAIPVTPKQIADAAIEAAKAGATVAHCHV RDPLTGRGSRDPRLYREVVDRIRSADVDIIINLTAGMGGDLEIGAGEDPM RFGQGTDLVGGLTRLAHVEELLPEICTLDCGTLNFGDGDYIYVSTPAQLR AGAKRIRELGVKPELEIFDTGHLWFAKQLLKEGLLDDPPLFQLCLGIPWG APADTGTMKAMVDNLPPGAHWAGFGIGRMQMPMVAQAMLLGGHVRVGLED NLWLDRGVHATNGSLVERAREIAERLGARVLAPAEGRRKLGLPPRGERPL ERRAIAGYA >BTH_II0121 Protein of unknown function (DUF770) superfamily MAISNSSQKFIARNRAPRVQIEYDVEIYGSEKKVELPFVMGVLADLSGKP IEPLPAVGDRKFFSIDIDNFDERMKAMKPRVAFSVPNTLSGDGQLMVDIT FESMDDFSPAAIAKKVDALSQLLDARTQLANLQTYMDGKSGAENLVSKVL KDPALLSALAKAPKPAAQQAENRESKEKH >BTH_II1142 pvdS MDMSEKDTRRDTAPLHERQRRFEEDLVDAYDEELEMELDDRRFDDESLFS PERREARKRYFRELFRLQGELVKLQDWVVSSGHRLVVIFEGRDAAGKGGA IKRITQRLNPRVCRVAALPAPSDRERTQWYFQRYVAHLPAGGEMVLFDRS WYNRAGVERVMNFCTDAEYEEFFRSVPEFEKMLVRSGIQIVKYWFSITDE EQEVRFQNRIEDPLKQWKLSPMDLESRRRWEAYTAAKEEMLMRTHIPEAP WWVVQAVDKKRARLNCIHHLLSLVPYYEIERDSVHLPQREHHDDYIRRPV PGDMIVPEIY >BTH_II0285 Prokaryotic protein of unknown function (DUF849) superfamily MSHPVVVTCALNGIFTDPKQFNVPVTPEQMAREAKGAYDAGASCMHVHFR RQEPDRGHLPSWDPRLAAEIAQAIREACPGVIFNQSTGIVGPNVEGPLAC IRAIRPEIAACNAGSLNYLKVKADGAWAWPPMLFDNPVEKVASFLTTMAE TGAVPEFECFDTGIVRCIDMYARAGLFAGRSNYNFVMGVESGMPADPELL PILIRLLRPDSTWQVTAIGRANIWALHRRTAELGGQLRTGLEDTFYLPDG ARATSNAQLVDAIAAIAREAGREIASPQDARRILGTRAETASHA >BTH_II0867 Protein of unknown function (DUF1316) subfamily, putative MARAEGVSARLPARAPARAPSGERRLLERIADREAGGERSPSADALARSI IDHLRRILNTRQGHVPIDPAFGVPDFTNLAGGFAQGSAREIEAQIERVIA CYEPRLKSPRVTLAERALDAATLHFSLDARLVLDAREVPARFLTTVSGNG KIDIRTIS >BTH_II0806 conserved hypothetical protein MNTMCGPIAAAAPAHDKTAREAALAQRIDALPWASIERDLDRDGHAIVRG LLAPRTCEALAALYARDALFRSRVTMARHGFGRGEYQYFGYPLPRAVHAL RTALYPHLAPIANRWHERMRIDARFPAEHERFIERCHAAGQLRPTPLLLR YRENDYNCLHQDLYGEHVFPLQAAILLSKPGADFTGGEFVITEQRPRMQS RVDVVPLEQGDAVVFAVHHRPVQGARGAYRVNLRHGVSRLRSGQRYTLGI IFHDAS >BTH_II0313 conserved within P. aerophilum MKCPVCVTPDLLMTERQSIEIDYCPTCRGVWLDRSELDKLIARADDDANE RRRDAAREVGRDAPLARYDHGDRDRRRHDAHGYDEHDRSRHGGYRKKKSL FDMFDFD >BTH_II0127 lipoprotein, putative MSYLKRFFRFVFSWQMLACIAVLLVSVAVWFVGPLLAFDELRPLAGVVVR VAVIVLLVALLAFWLMRWPLSPVGVAALCLLIWHAGPLFAFGDHRPFGPA WVRVLIVAVILFCYAVYGLYRLWQAVRTNDALLRRILDPSAGKPDAAARA DIRAVNVAVSKAIGQLKRLRGGAFGWRRLLESGRYLYELPWYMMVGAPGA GKTAAIARSGLKFPLADQMEASTERARGGTINCDWWFANEAVLIDTAGRY ARHEVPGDEEATLANEAEWKGFLGLLRKHRPRAPVNGVVLSVSVEDLVGR TPAERTAHAAALRARLGELHQELGIHFPVYVIVTKLDLLPGFPEYFQSLT AEGRTQIWGFTLPYDAENRKSAVGALREHCADELKRLEMRIDAGLNNRLL EEYENDRRKRLYALPQEFRSLSEALTDMLGLVFLDSRYDDAQLQNTLRGV YFTSAEQTDQVMAADRETILQRLKRQLGRMLGGDAGAQTRGNGAMSGSRG YFLRDVFQHVIVPEAHLVRPNVMWEVRFRLMRWAGHLLAVALLVWLASSL TVSFDSNRGYLDAISDKTTALAARVNAYNKAPKPAGVAGVLDGSRDLPQY GNLDLEAPGASFRYGLYVAPGIVDASDATYRNLLRRSLLPQIVRRVENAL SAQIDAKNADEVYRTLTIYLMLYDAARHDAKAVKDWVMRDWERSDSAAEM GGRNRMARHLDALFVDGQPFEPSGRQNAALVQRARLFLNANPAPRRLYER AMAAIEKEAPENFTLARAVGLQGAGIFRLVDGSRFQRGVPGLYTYEGYHQ VFSARLPEFLARAQSDDAWVMGSADSAARWGDAIRNTKIVAGRSALADDI RRQYLTDYGNYWQQYLADIRPVSSGENGGSGTLAFDLATLRALAAPDSPL VRLARAVVRETSLSVVDAREDASLTDTALSAVGRRSGTAKEVADGAQKLA ARRPEQRLEKELVDNRFAALREVVTGQADTGSGPAMTDMPISSGGKALQL DAILTLINEQYTRLVVADNALSSHSMPPALDIGTTLQMEAEKLPAPLRAV LGGIATQAADKVGREVGSLLAMQVDSSVGKACRAAVDGKYPFARSSQEVD IEDFNRLFAAGGLFDEFFQKALAAHVDTNSKPWRYKALNPGMPPIRGPSL EPFERAAAIREVFFREPGAKRMAWKMDAKVASIDPEITEFIVDVDGQSQR YVHGPVLPFSVNWPGPRGGAIAEITAKPRIRPDTSTITTTGPWALFRLIE RGRLTGTTSASRLMLDFDFDGRRAALELRTNGQANPLTSGLLTNFRCPGS LG >BTH_II1040 pANL56 MDITFDPTKNETNIAKHGVSLALAAQLDWSDVLSYVDDRRDYSEVREVGF GVIGDRLYCVVFTQRGDSMHIISMRKANKREVKSYVEQA >BTH_II1436 Rhs element Vgr protein MKMSDIAGFLSLQNSRLLTIKTPLAGRAELVLSDFQCSEGLSVLFDMRLG LASRDPTIELKQMIGQAVTISLQPPGGIVGGSARHFHGYVTQFSHTGADG GLATYSATVQPWLWMLSRRVDSRIFQDKSARDILDEVFSQYSALASYEFR AGRTLKPYSYCTQYRETDLNFVLRLMELEGLFFYFEHAEDGHKLIIDDDS TRAKPIDGLPSLRYASGEILEDEAVVTQWAAQRQLMSGAVSMKAYDYKVP AARRYVSGESDFNQGEVERYEIYDYVGLHGFDSTDRGEELARFRLESLAA SGKTFSGTSTGRTLAPGRYFELSAHYDHDNGPMHDRQFLLTNVRHHGVNN YQSNEGSGSYHASFQCIRKKIPFRPPLAHARPVIPGPQTAIVVGPKGEQI HTDALGRVKIQFHWDRIGQRNQGSSCWVRVSQPWAGGGFGSVQIPRIGDE VVVTFLDGNPDRPLILSSVYNAQNMPPWALPAGATQSGFLTRSHKGTSEN ANAIRFEDKLGEEEIWLHAERNQRIEVEHDESHTVGAKRTKTIGADEIVT IGGAQTHTITGARTQTIGADHTQTIKGAHKQNVAGTHAQVIGGNASITTT GPQPGAAGTVGDIEIQSSQGKIHLKAATEIVIEVGASVIHLKADGTIEIS GPTHIGLNSKS >BTH_II0732 conserved hypothetical protein MNSLSSAIPAVHSTCADARAAVREVHVALAGCDAELVLFFCSSRFDLDAL ADEMRERFRGTRVIGCTTAGEIGPAGYRNGSLVAVALPRALFTIETALLE GLQTFTIASGHACTLDALHDLERRAPRASGANPFALLLIDGLSVREEPVT RTLQGALGDIPLVGGSAADDLRFERTAIFYDGRFRDDCAALIVASTALPF RTFKTQHFRCGAERLVVTQADAERRTVSEINGLPAAEEYARLIGARVEDL SPGHFAAAPVVVLIDGTDYVRSIQKLNPDGSLTFYCAIEEGLVLRVARAL DLVDNLQATFGDLRDSFGEPQLVLAWDCILRHLEMMQRGTRDTAAELLKA NRAVGFSTYGEQYGGVHVNQTLTGIVFSHAPRPERA >BTH_II1705 oxidoreductase domain protein MPHYVEYHAGFGHFHWHNDYSHESEEAPRKLTVIVQLSEPHEYEGGDLEV FGSSIAVAPRHRGSIICLPSFVEHRVTPVVAGVRRVLVAWIAGPRLK >BTH_II0880 Domain of unknown function (DUF323) superfamily MRAPTSRHTFRAASAFRLAGFAIVALTLAFTATAAHAARFVRLPGGDFES ALPQDVPGRSTPVHIDAFELQDTPVTVRAFAAFLRAHPEWRRERVARVFA GPAYLADWADPLHPAPATPPDAPVTGVSWFAARAYCASEGARLPTWLEWE YAAAADATRTDARNDPLWRQQILSWYEQPAARVLPSVGGAPNVYGVRDLH GLIWEWVDDFNALFISGDSRTQGDPDQQRFCGAGAISIVRRDSYAVLMRV ALLSSLTGADSTGSLGFRCARSISGD >BTH_II0615 conserved hypothetical protein TIGR00294 MPLQGSNKMNLMNPEFVMPDVQSTVDTRQMPIQRVGVRAVRHPLTVRTAE GETQATVGTWNLDVHLPADQKGTHMSRFVALLEESGGPLTADAFRAMLAT MLEKLEAQAGRIEVSFPYFVNKTAPVSGVRSLLDYEVTLTGDVRDGLTRV FAKVLVPVTSLCPCSKKISQYGAHNQRSHVTIDAELAADVPVEDLIRIAE EEASCELWGLLKRPDEKFVTERAYENPKFVEDLVRDVARRLDADERIVAY VLEAENFESIHNHSAYALIERDKRRRA >BTH_II0262 Bacterial protein of unknown function (DUF879) superfamily MDTRLLDYYNRELAYLRELGGEFAQQFPKVAARLRMHESGPPDPYVERLL EGFSFLTARVQLKMDAEFPRFTQALLDAVYPGYVAPVPSMAIMQFTPMMN EGSLAQGYRLPAGTALRARPAASEQTACEFRTAHDLTLWPLELTAASVTG APAYLPRSATAARRDVRGALRIRLKACGGASLAKLPIERLMFHLAGPERD ALHLLELIAGHTIGVVCHDAAQPPRWLHALGADALAHQGFDAAQALLPDD GRSFQGYRLLREYFAFPARFLFFSIEGLRPALARATGDTFELTLLLDRHD AALENSVDARHVALNCTPAVNLFARRGDRIPVHPGAREHHVVVDRSRPLD YEVYAVRRLAGEPRDDAQAREFRPFHASFAGDDGNYGAYYTVRREPRLVS ARARANGTRTGYVGSETFVSLVDSECAPYDESIRYLSADTLCTNRDLVLL LPAGDANAFTLRVSAPVERIAAIRGPSRPRAPIADAQTAWRLVSHLGLAR QTLTDVDDEEGARVLRELLGLHADPADAAMRRQIDGVHRVAFTPVFRRLP AAGPLMFGRGVQVDVTVDDHAFSGDSPYLLGAVLEQFFARHVSINSFAEC VLSSAQRGRLAQWPARVGRRPAI >BTH_II0361 conserved hypothetical protein MEAAMAMPRATAGELIDVRPLGDALPGSKSITLMRSDHLEVVRLVLPAGK HIPEHRVPGEITVQCLEGIVKFGTDAGTQLMRRGDMLFLQGGERHWLEAA ENASVLVTLYLPHGH >BTH_II1509 MgtC family protein MTFEFALRLFTAFACGVAIGLERQIRQRTAGLRTITLVASGACLFVTLGV LTGNGIPGVTQIAAYVVSGVGFLGGGVIMRDKGSIQGINTAATLWCSAAV GVLSGAGHYLPALAGTGVVLLTNTLLRGVSQAINSTPVSNADLVREYQIT VICLASDEVHIRTLLSNSMYAKPLSFQSLTSEDVPREADAPERIKVTATL KLHPKDQPKLEQIASRMSMEKSISSVSWTAKEAEPLME >BTH_II0122 Protein of unknown function (DUF877) superfamily MAARESQARSADAQLATQSDFNALLSREFKPKTEQAREAVEHAVKTLAEQ ALANSVTLSDDAYKSIEAIIGEIDRKLSEQINLILHHDDFQKLESAWRGL HHLVTNTETDEKLKIRFMDVSKDDLRRTMKRYKGVAWDQSPFFKQIYEEE YGQLGGEPYGCLVADYYFDHTPPDVDLLSSIGKVAAAAHAPFITGASPSV LQMDSWQELANPRDLTKIFTQNLEYAPWNSLRNSEDARYIGLAMPRFLAR LPYGIRTNPVDEFDFEESTDGSDHRKYVWANAAYAMAVNVNRSFKHYGWC TLIRGVESGGVVENLPCHTFPTDDGGIDMKCPTEIAISDRREAELAKNGF IPLIHRKNTDYAAFIGAQSLQKPAEYYDPDATANANLSARLPYLFACSRF AHYLKCIVRDKIGSFKEREDMQQWLNEWIMNYVDADPANSSQETKARRPL AAAEVVVEDVEGNPGYYQAKFFLRPHFQLEGLTVSLRLVAKLPSVKEAA >BTH_II2255 MlrC C-terminus family MHSRRVSSDRRRPEMPFAIQESDMNILIAGFQHETNTFAPTRASYRSFVL GEGFPPLVRGGGVLSLRDVNVPIGGFIRAAQANGHALLPVVWAGACPSAH VMSDAFERIGGEIVAAAEAGGFDAIYLDLHGAMVTEQFDDGEGELLARVR RIVGDRMPIVVSLDLHANVTARMAAHASALVAYRTYPHVDMAQTGERAAQ VLERLAAEARPLHCAMRRLPFLIPINGMCTHAEPASGAYRLLAQLERDGV VSMSFAPGFPAADFPECGPTVWAHAFEADAAQRAADALFAKLVGDEARWS VPFLAPDAAAAEAIRLSRTATKPVVIADTQDNPGAGGDADTMGMVRALLR NGADDAAVGLIWDPDAAAAAHGAGVGARVSLRLGGRSRVRGDAPLDAVFE VEHLSDGRFRFDGPMFNGAQGDLGPVACLRIDGVRMAVSTNKMQTFERNQ FRVAGIEPERMRIVVSKSSVHFRADFEAIADAILIAKSPGPMAADPSDLP WARLDPDIRVRPNGPTFGALRAAAR >BTH_II1922 u1937b; B1937_F1_4 MKPFDEMLQPGDTVRAPYERLKQWLDTQDPASLAQKAHDAEGVFRKTGIT FAVYGDAEAAERLIPFDIVPRIISGREWNRLSQGIEQRVMALNAFLDDIY HRQEIVRAGIVPKHLISHNDAFLPEMIDFRPPGNVYTHIIGVDIVRTAEN QFYVLEDNARTPSGVSYMLENRETMMQLFPELFQQVKVRPVETYPQLLRQ SLAAVCPPGGNADNPTVAVLTPGIHNSAYYEHAFLADQMGVHLVEGSDLQ VIGDRVAMRTTEGFRPIDVLYRRLDDAFLDPLTFRPDSVLGVAGIMDVYR AGNVTIANAPGTGIADDKAIYSYMPEIVEFYTGRRAMLENVPTWRCAEAD SLKYVLEHLEELVVKEVHGSGGYGMLVGPAASKAERDAFAAKLRAKPSNY IAQPTLALSTTPILTERGLAPRHVDLRPFVLVSDRIRITPGGLTRVALKE GSLVVNSSQGGGTKDTWVLAD >BTH_II1206 conserved hypothetical protein MAPDRKGIAASARIAGIARSAAFGAAALAAALFAAPGSAHRGDVGQVRPV LTQPLAEAPGNDAQVVTVAYAPGAASGPHAHAGSIFAFVTQGRVVSQLEG EPPRTYGPGEAWYEPPGSHHIVSRNASDTEPAQIVVFAVVGEHRALKTPL PR >BTH_II1732 hpaD, 3,4-dihydroxyphenylacetate 2,3-dioxygenase MGKLSLAAKITHVPSMYLSELPGRHHGCRAEAIRGHQAIGERCRALGVDT IVVADTHWLVNAGYHVNCNGHFAGVYTSNELPHFIRDMRYEYPGNPALGH LIAASANERGIGTRAHEIDSLELEYGTLVPMRYMNADQHFKVVSIAGWCM WHSLDESRRFGEALRHAIEASDANVAFFASGSLSHRFNDNGSPEEAIHMI SREFYRQVDLRVVELWKQGDFATFCKMLPEYNEHCHGEGGMHDTAMLLGL LGWDRYDKPVEIVTDYFASSGTGQINAIFPLP >BTH_II0050 pcaC, 4-carboxymuconolactone decarboxylase MNDEQRYEAGMNVRRAVLGDAHVDRSLENRTELTEDFQNLITRYAWGEIW TRDGLPRHTRSLLTIAMMVALNRGEELALHLRAAKNNGVTRDEIKEVLLQ TAIYCGVPAANSAFHFAQRIFGEEDAAS >BTH_II2186 prpF, probable AcnD-accessory protein PrpF MTRRRGSVRPSGERTRITGFARDRSRCPSARSDRHRTSRTMAHVPQIKIP ATYLRGGTSKGVFFRLQDLPEAARQLGAARDALLSRVIGSPDPYGKQIDG MGGASSSTSKTVIVAKSARPDHDVDYLFGQVSIDKPFVDWSGNCGNLSAA VGPFAIAGGLIDPARVPHNGVATVRIWQANIGKTIVAHVPITDGAVQETG DFELDGVTFPAAEVQLEFMDPAADEEGAGGAMFPTGNVVDDLDVPGVGTL KATMINAGIPTIFVDAEAIGYTGTELQDAINSDAKALAMFETIRAHGALR MGLIGSLDEIATRQHTPKVAFVAKPADYVASSGKRVAAADVDLLVRAMSM GKLHHAMMGTAAVAIGTAAAIPGTLVNLAAGGGARREVRFGHPSGTLRVG AEAKQDGGEWVVTKAIMSRSARVLMEGWVRVPGDAF