TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Gene type: CDS
Genomic element: chromosome II

Number of genes found: 201

Free access
Sort by:

 



# Burkholderia thailandensis E264, E264; ATCC 700388

>BTH_II0732 conserved hypothetical protein
MNSLSSAIPAVHSTCADARAAVREVHVALAGCDAELVLFFCSSRFDLDAL
ADEMRERFRGTRVIGCTTAGEIGPAGYRNGSLVAVALPRALFTIETALLE
GLQTFTIASGHACTLDALHDLERRAPRASGANPFALLLIDGLSVREEPVT
RTLQGALGDIPLVGGSAADDLRFERTAIFYDGRFRDDCAALIVASTALPF
RTFKTQHFRCGAERLVVTQADAERRTVSEINGLPAAEEYARLIGARVEDL
SPGHFAAAPVVVLIDGTDYVRSIQKLNPDGSLTFYCAIEEGLVLRVARAL
DLVDNLQATFGDLRDSFGEPQLVLAWDCILRHLEMMQRGTRDTAAELLKA
NRAVGFSTYGEQYGGVHVNQTLTGIVFSHAPRPERA
>BTH_II0808 conserved hypothetical protein
MRRAAAGGAGHAAQPRVASAAAPPPGAASAIVGIPRMHCSNPTRRPKPNR
SASSSSSPARPRGSGRRSPNPRRSRAGGRWAAGDASPVPGHRFMLDMVPW
GRRPREVIAVEPERRFAIAFAQGTLDTTIAWRLEPAAGGTRVFLEHAGFD
ADAPRARIDIEYEGMKRGRPSVLARIEPAIDG
>BTH_II1041 pANL12
MSSKRKIVMPTDEEDAAINRGIAADPDTFEVPAEDFAKMTRRGKRGRPPL
EAPKVQLTVRYDVDIVDAFKATGEGWQTRMNDALREWLKEHQPA
>BTH_II1355 pANL12
MTVSKRATHTDWVDPDDAPELTDEFFERADEYVGDRLVRRGPGRPLGSHK
TATTIRLDDDVLDAFKATGRGWQTRVNAALKEWLKTHKLA
>BTH_II0560 conserved hypothetical protein
MNMSTDRIERRIVLHAPRSRVWRALTNADEFGAWFRVDLAGQAFEAGRRV
EGRITYPGYEHLVLQMRIERIEPEHHFSYRWHPAAVDPAVDYSQEAPTLV
VFELADAEGGPLLTVVESGFDALPVERRADAFRMNSGGWDEQMTNIAAHV
DAR
>BTH_II0313 conserved within P. aerophilum
MKCPVCVTPDLLMTERQSIEIDYCPTCRGVWLDRSELDKLIARADDDANE
RRRDAAREVGRDAPLARYDHGDRDRRRHDAHGYDEHDRSRHGGYRKKKSL
FDMFDFD
>BTH_II0110 fusaric acid resistance protein, putative
MKYLLESPLACGTFLPHGPLPCAATRVASWRGSKRSVVVPIRRTRTATVA
SSPSVPSKEFDVSHASVAPLRSRDPLSIDHRKLFFSISTFIAAALVLLIS
FTVSFPRPWWALLTVYVTAQPMAGAFRPKVLYRLAGIAAGAMVAIVVAPN
LQNSPLLLVLCLALWIGFCIYLAVLDRTPRAFLFQMAAFSSAVICLPYLD
DPADIFITAISRVEEMTVAILCVTVAHMVLRPAGVRPVIHERALSFLDDA
CRWTAEAFGTHHARLEHEHRRKLAADVVELGMIAMNLPFDQRFALATRET
VTALQHRLAALLSIASAAANRLDRLRSLNAVDAETAALIDSLIVDLRASQ
EVGDALDIDLASRCRALAAKRLRDPAWTSLLAASLFDRTADFIDTLHAAR
SLVRTFGDSDVELERHARLVDGAHRFRLARDHGLALLAGAATTTAIMIYC
AVWILLAWPSGSATAAFAALVTCSFAAQDDPAPAIGRYLVATLTTFPLAA
LYLFVILPRVDGAGMLILTLAPAFLWMGYIQADPARSARALPMFSCFIVA
MGFLDRFQADFATFVNTGLAQVGGIVTTLVVTKLFRSASTRWTAYRIVRQ
NWAELAQMADPREALDAQAWTARAVDRLGQVASRVALADPGDALHAVDGL
SDLRIGRNIIQVRQRIGRGSARTRHAIERALGEVSNLYRARADASLPVPA
SAPLLRALDLAIHSAAHDPAGRDDATLLALVGMRCNLLPNVQPIEGGAR
>BTH_II0806 conserved hypothetical protein
MNTMCGPIAAAAPAHDKTAREAALAQRIDALPWASIERDLDRDGHAIVRG
LLAPRTCEALAALYARDALFRSRVTMARHGFGRGEYQYFGYPLPRAVHAL
RTALYPHLAPIANRWHERMRIDARFPAEHERFIERCHAAGQLRPTPLLLR
YRENDYNCLHQDLYGEHVFPLQAAILLSKPGADFTGGEFVITEQRPRMQS
RVDVVPLEQGDAVVFAVHHRPVQGARGAYRVNLRHGVSRLRSGQRYTLGI
IFHDAS
>BTH_II0285 Prokaryotic protein of unknown function (DUF849) superfamily
MSHPVVVTCALNGIFTDPKQFNVPVTPEQMAREAKGAYDAGASCMHVHFR
RQEPDRGHLPSWDPRLAAEIAQAIREACPGVIFNQSTGIVGPNVEGPLAC
IRAIRPEIAACNAGSLNYLKVKADGAWAWPPMLFDNPVEKVASFLTTMAE
TGAVPEFECFDTGIVRCIDMYARAGLFAGRSNYNFVMGVESGMPADPELL
PILIRLLRPDSTWQVTAIGRANIWALHRRTAELGGQLRTGLEDTFYLPDG
ARATSNAQLVDAIAAIAREAGREIASPQDARRILGTRAETASHA
>BTH_II2258 GtrA-like protein family
MSAAADIGMRARIVRFGVSGAASTALHAAIAGTLMGALAATAVQANAIAF
VCATGASYLLNTLWSFSVPLRWRNVARFLAVSVAGLMLTMAISHGVQALG
IAAAWSIAAVVVLVPPLTFAMHRLWTYR
>BTH_II2176 MgtC family protein
MLNNVELIMRLILAAALGSVIGIERERLSWAAGLRTHMLVCVGSALIMIV
SAFGFADVLGQAHVDLDPSRMAAQVVSGIGFLGAGSILLRGEIVRGLTTA
ASLWSVAAVGLAVGGGLYAAAIAATVIILVILAGIKPLERRYLTVRQRRH
LVLLVERGTLTFDSLHAALGVDSARVKRFIVQQSEDAADSDEVTIALGRV
SDAEYGSICARLRQLPGVKGFAEQKSGLPDD
>BTH_II1708 unnamed protein product
MSELTLAALLSPITVDAFMERYWGRKPLIVRRQAPHLYACLPDSEEFAFL
LHSLTDPERGWFSIVNGVARPPSDSLLTQEGLLNLSEVYAAYRDGNSLLM
NQVQRRHRETAMLCRRIESALSAHGIALARHIGANGYLSPPSSQGFNIHY
DPHDVLILQIEGRKHWRLYGRHVAWPTQPPATPIPPEEAGSPRREFVLSP
GELVYIPRGVLHDANTTDSRSLHLTLSIETLTWTDLLIEAMSDNPAFRRN
LPVCPPFGKRIGDEARAELTRLTASLNNPRALRRALAAMSGRLLGNLDPL
PNGGFAEVDGLHLIEPKTWLSLAPGTFGHVEVNGDEAILHLPGSALRAAR
EMAKAFYYLLRARRVRACDLPVSASEADKLTFVRKLVQMGFLVKASE
>BTH_II0261 Protein of unknown function (DUF1316) subfamily
MNDDTRARGGREGGLRAARDRLQPALLDRLTDDERARCTEPPDAQAIGGE
RLRAAVLRDLAWLLNTRNGEDGFVDWAAFAHAQASVLNYGMRPLVGKPMS
GVERMSVEASIRDAIVRFEPRIAPDSVEVRSVLDAPGGAAGERRHNVLMF
EIKGTLWSVPHPVEFVLRSALDLETGAMALQPAAGG
>BTH_II0580 ebsC protein, putative
MPAVRAATSDAAAAFSRAGAGHRAARAVTEATSPAQSPCQAPSRPIPLQP
PQRILHNRSIHSAPRTIAYPFRGFTKDAGARIAEPALAPADKTKAGTPAA
RRATNVSVESVRQFLAEKAPDIDVIALSESSSTMTLSAAWDIKPAQIAKT
LAMKVGDAHVLLVSCGDSRLDNQKIKAALGGKAKMLSAEETVAVTGHPVG
GVCPFGLSTPLPVYCDVMLKSYDFVVPAAGSTHAALRIDPVRLAELVEAE
WVDVCK
>BTH_II0099 conserved hypothetical protein TIGR00149
MQQSIQHITVEARGRGLVEFTPQVRAFVEVQSVSTGLLTVFCRHTSASLL
IQENADPSVQRDIERYFAALAPEDDARYEHDTEGADDMPAHLRTALTQVQ
LSIPVEHGRMVLGTWQGIYLFEHRRAPHRRDVVLHLIGE
>BTH_II1913 membrane protein, putative
MTSTLQSRLPGFARNALRPVLDPYRRYRHAKLIHAARVALSVLASIALTT
GLRVPHGEWATITVLIVIGGLQHHGNIRKKAAERALGTLIGAIAGLSLIL
LQTTVHLSPLTFLVMSAACGVCAYHAIGKAGYIALLSAITMVIVAGHGDN
EIADGLWRAVNVLVGIVIALAFSFALPLYATYSWRYRLADALRGCAAVHA
RIAGERYVSDSEHLKDMAKLNALLVQLRSLMPSVSKEISVSMPQLEAIQR
GLRLCMSSLEILSSLQPRADDEAGRRFVQLRMKADNRRIQEMLVGAGRAL
KFGTLSRLGPLHGPALPAGEPAPPTHLSGYVSLTAKLSHEIEQLRQRLHD
TAPQWNI
>BTH_II1061 Bacteriophage lambda tail assembly protein I
MNETLCTIRLHSTLGVRFGRIHRLAVSSTAEAVRALSVLIPGFRAFLMSA
RDDGLTFAVFNGRRNIGEDELEHPVGRDEIRIAPVIIGSKRGGLFNTILG
AALVAVGAIATFGFAQPWGASLMGLGASMALGGIVQMLSPQQAGLAGAAN
NGTSYYFNGPVNSAAQGEPVPLVIGEMIVGSKVGSSGIYAEDQV
>BTH_II0902 conserved hypothetical protein
MKTSRLIKSAALASLASLVLVGTLGIRAAVADSGDDCRAPLADWKPRDAV
HALAQQKGWRIDKLKADDGCYEIKGHDAGGKRFKAKLDPVTLDVLRMKRE
GDREREHGHDDGDSDDHGRAPDARAAAGGPPAGAPPGGVLKPGSKPDVQI
R
>BTH_II1056 gp14
MATSLRELIVSVTANTTEYDRRMRGLSSTAGSYFNAVRDGGRTADAAFAS
NAASVQVTVRALDAARSSIREYAQAAAAAFGVHQLIEYADEWTNLSNRLR
IVTRDEIDFAIAQNDVLRIARDTRQPLDATAELYQRIANNASHLGLSIKQ
VGPLVTTISKAVALSGVSADTARMGLVQLGQAFAAGQLRGQDLNSVLEEL
PGVADAIARGMGKSSAQLKSMAEEGKLTVGNLVEALTRAAGGTDTLFEKM
QATVGQTMTRLQTEIVKYIGESDQATGASARLAQGIAYVAEHLDGIVKLG
VSLAAGRIAVYFGQSAVAATQAATAWVGARRALVEETIKQHEAAQAALAK
AQGDRAAAAAKLQNAQAAEASAQAELAGMRAMRESLAMQSALTAGSIKYT
EAKLAEARAVEATAQAHVATARANVAGSQEIGARIAGTPYAAIIARETAA
AQQELERAEASLALALQRRTALEAAAKQGTIDKARYTASLAETDRGLAQA
ERDVALATQARERAERAATATAAGLKTATESAATAQTALARTGTMMRSVG
SGLLAAVGGLPGILATVGTVALGAAANWLLFRDNASSATSSLIDMQAPLD
QIIDKYRQLTPLLQESERLRTKQEASRAADDAQSAYRSLATRAAQSVMVP
AFGDAPSVVSDADQAALDRFLAGLDRLKTSNLGVDEKSREIGRLIDRFVS
ATSGGEALREELVRAAGAIDTAGLASQKGAQALAAMDAAARGAAEGVRLL
SDANNFFAGGMASEAWEKYVHKLREESDVIGMTARQKAEYEARTKGANDA
QARMAGLVAGRADAYKSLEKAIADKDAKAAAGARTNIDNLTRELALMNQQ
MVVAAALADFQAAIRTNNFGKFAKYGFGEKSGDVELAAMVARAEANGRGQ
QAFDETIASAAAQTARVSTNAAAARVTKGGGVHSLESERMLDNIRQRIAQ
LRVEAVATDKLTQSQKDLLAFDQKVTDLRSKRKKLSDDDKSLLRDQQAIR
GMYEQASQLEKEVRYRDAINKLKERSAQIDAELGDYAAERQRDVQRELGA
MSMGDNARELNQAINRVSDEFRRRRDELTKGARKDGTLGSPEYIAEIERI
NTAEAEQVARERGYLEQRLALQADWRVGVKRAMAVYQESAQNAAQMAEEA
LTSSFRNAEDALVSFAASGKLNFRGLIDSMIADLARFSARAAMSQVFGAI
GSALGFGGVSDAVGALGGAASAAVGSNAYGFHLATGGAVWGPGTSTSDSI
PAQLSNGEFVVRAAVVSQPGVRAHLERLNAGGRSGFARFAAGGLVGGSAG
GGDSPARNGGISVSAPVSIEGGSSNPASLIAVGEFRKMLEQMIRELIQRE
RRQGGTLWRAQNGIAG
>BTH_II0137 Protein of unknown function (DUF1316) subfamily, putative
MRHPEGRHRAAYLPSLLDRLQDDAPHSFSESPDAYAPSADEMRRIVQRDL
SLLLNTSNLDDEVDAGRYPLVAASVVNYGVPPLSGSYLHDPNRETIDRLV
RTAIVHFEPRLIADSLAIRPLAAQQGASYNKLTFEISALVQWSPYPLELR
IQSTFDLELNRVTLDKTTLNGK
>BTH_II2314 conserved hypothetical protein
MSSRRAAQSGRMTTGAMAARRFRRHGSRVAGRAPSRATTIHGGDEMRRSR
SPKPAKYAYADVPRRPRGFARTTAMRGDGLRRRAVRERAARRLSRELDAT
LRRASTYPHPAGRIVRIETHISVVYLVGRFAYKRLKPFDFGFANFGGLAA
RRRACEAELALNRPLAAPIYLATGPVVRRAHGLRLFGAGAAVDHVVRMRR
FDERMLFSRLLARGALGAADIDAAAARLAAYHLHAPRDVPRRAYGSAREL
RKQIDDVLAPLERALGPALPPALRAWCARRCDELAAHLDARRADGYVRAC
HGDLHLDNVVKHGRDALMFDCIDFDDALRWIDVINDLSFLLMDLHAHDRA
DLAHRLLNRWLDETGDFAGLAALPLYVAYRALVRALVATMRAGDDAAARA
ERARRYVDAAAHAARARRPCLLLCHGYSGSGKSVASRALADVSGAIRLSS
DSERKRARPFAAVDARPLAASAYTAQQIDAQYERLRALARDVLRAGYTAL
VDATFLSHARRARFAALARETGVPMFILDFHASRACLERRVDARAAARND
RSDAGAAVLATQLATADPLDAGERACTIGFDTDVPLATIRSAGYWRPALD
ALDAADANAPATC
>BTH_II2323 conserved hypothetical protein
MTNPFVQWLRASAPACALAIACVAQTGAAFAARATANVAANVAAAVTAPP
SVALPDAPAGATSSGTSAAIRGTVVDAQTGKPIAAAIVTIDGHPIRADDQ
GAFSADTAATDIAARAPGYLAARAPIEAGRPVTVALAPFRPKAVYLSAFG
ITSKTLRDAAVNLKDTTAINALVIDMKGDRGVTPYPSAARRASGAAAQAP
NAPVVRDFAALVADLHRRGLYLIARIVVFKDDPLAAAHPDWTVRDAGGDI
WHDREELRWIDPSLREAWTHNLDVAEEAAKLGFDEIQFDYVRFPDARGLR
FSVPNTRANRTAAISGFLQAARERLAPYNVFIAADIFGYVCWNEDDTAIG
QQIEMLGGPLDYISPMLYPSGFTWGLPGCTQPTADPGQIVRRSLAEARSR
TKLPGVRFRPWLQAFRDYAFDHRDFAAAEIRAQVDAAEAADTDGWMLWNA
RNRYDPQQLPK
>BTH_II0873 ImpA-related N-terminal family
MGMNERRQPGGAASGALLPEDFDALGALGRADIDPAAPAGADVRADARFD
ALHAELAKLASPGASGHVDWRAAMSLAAGLLRDRGKDLLVGCYLAGALLQ
IGGAAGLRCGLEVVGDLVERHWAAMSPPVSRMRARRGALQWLLDRVDATR
DAGAAACGAACSVELVEQLRAAARRIDALLAERDDEAPTMRAVTAFAARL
PVESGESGEPGEPGEPGEPGEPGEPGESNSTHAHGSVGAPAERAALSFAE
HASIEPAGRAAPRANADAARHPASLDDAAGRERALADALAQLHRIATAFA
QADWADTRGFRLRRIACWSSVHAMPDTEADSGRTRIAAPNAQVVDVAKGI
DAQGDPAAAVRFAEEHAQAFPLWLDLQRIAARALARAGGDCTGAQREVET
AVRALLMRLPGLDALKFADGTPFADAATRAWLAELCTPIGAANAALTSPP
PPSPPAPSLPMTGESDRARGDARDANADDAHARARALAASGRLDLALGAI
QQAIDRAPSAERRLRARIRLCEFARDHWEHEIPDAFARGVIEPIRRHDLL
AWEPELALDGLSAAYALLIRRDGDSAHARAVLNEIAGVDAARAMRLST
>BTH_II0400 conserved hypothetical protein
MRVGRPVPALPQPVCDYCGAKALLARFGDDAYPYREDHGELWICAPCDAW
IGVFARSRRHVPLGRLANAELRHAKSELHAALEPLVAAKMRRDGCNAFEA
RAKGIRWLATQLGLDAASSTIHTFDLDACRNALRLVEQFTSRKSLSSNS
>BTH_II1896 conserved hypothetical protein
MTTLERVSRALTPRRTFFELMRRVEALQRKHDKRLARKRRMPKWLRIEQP
AEMHFASTEVERVHVALARFIEDDDHPQVTVVQRHFGLFAPYGPLPLHVT
EHAMQEKRFERNAAFERFVNVVCGDLAWLHYSAWSSMHPVLGYERARNPF
VERVTALADARRAQQEGGEPFEQHALACRRAFPGIYCAPRRSLADLQRML
CAYFGVALQVVPRHGRWVPVPAAQSHARRLGGWRLGARIWDVQHSVEIVV
GPIEADEFYRWQRRAAAVMALSAVVTDFVDGRIYPVIKVQVWTRPELAGR
VGCMRVGVDAWSRPNRALRTLTVFESFRD
>BTH_II0262 Bacterial protein of unknown function (DUF879) superfamily
MDTRLLDYYNRELAYLRELGGEFAQQFPKVAARLRMHESGPPDPYVERLL
EGFSFLTARVQLKMDAEFPRFTQALLDAVYPGYVAPVPSMAIMQFTPMMN
EGSLAQGYRLPAGTALRARPAASEQTACEFRTAHDLTLWPLELTAASVTG
APAYLPRSATAARRDVRGALRIRLKACGGASLAKLPIERLMFHLAGPERD
ALHLLELIAGHTIGVVCHDAAQPPRWLHALGADALAHQGFDAAQALLPDD
GRSFQGYRLLREYFAFPARFLFFSIEGLRPALARATGDTFELTLLLDRHD
AALENSVDARHVALNCTPAVNLFARRGDRIPVHPGAREHHVVVDRSRPLD
YEVYAVRRLAGEPRDDAQAREFRPFHASFAGDDGNYGAYYTVRREPRLVS
ARARANGTRTGYVGSETFVSLVDSECAPYDESIRYLSADTLCTNRDLVLL
LPAGDANAFTLRVSAPVERIAAIRGPSRPRAPIADAQTAWRLVSHLGLAR
QTLTDVDDEEGARVLRELLGLHADPADAAMRRQIDGVHRVAFTPVFRRLP
AAGPLMFGRGVQVDVTVDDHAFSGDSPYLLGAVLEQFFARHVSINSFAEC
VLSSAQRGRLAQWPARVGRRPAI
>BTH_II0124 lipoprotein, putative
MVSQFFFRREAARRTAVKRIVAAMVCASLAACASLPARKETADLIVKIKV
SEGANPDEHERPAPVMVRLYELKSAGAFENADFFTLQSDSRKVLGDDAIA
TDEFVMRPGDTRDIHRRADSAATSIGVLVGYRALGKSVWRAVHKLPPVPE
DAWYRAFTPRTKIKLNVDVGQQTVSITELE
>BTH_II0068 DNA-binding response regulator
MRILLVEDDVPLGDGIRAGMRQQGFQVDWVRDGDSGPARAARAWSRRDGP
RSRFAARGRHGCSATHSQTIMRMTDTTGQCRRVAGARENGRWRTVRRVRY
RTRRTGRSLAGGAACGAAGLAVHVAGTRAHPHPLPAMHAAHGGRVGELDS
ARVRTLLHLGMMFGADFLALGIRRCRAHPRTVFNPLGRRHQLTAVNHASR
LDGRALSWLARLGVGQTGRHRDRDADQARRSERIRTEHGNLQTICVHSTV
HGKAADPLCQIKWRRVAANTIEMHAGHHVDVNCTVHHRAARVAAWRAALI
GPLRSRSAMSNSLRYRERTVELADQMRKLRQSQPDPMSAFGALAVAGAKD
GALTKKTRGLIALGIGISCRCGDCIGFDRQSPPIKRKATREEVEEAAGVA
VYMGGGPSMMYAAPTLMAVGEPAA
>BTH_II0619 GatB/Yqey family protein
MSLRDQISEDMKAAMRAKESERLATIRLLLAAIKQREVDERVTLDDAGVT
AVVDKMIKQRRDSISQFEAAGRADLVEKEQAEVAVLTAYMPAQLSEAEIA
AEVQAAVAQTGAAGPQDMGKVMGVLKGKLAGRADMTAVSALVKAALSK
>BTH_II1455 opgC protein, putative
MAQCATIVVRIQSCSDAALRHAPFVRPRFRRSSPAPRPSRVAQMRRRMNA
APAQRYAELDFFRGLVLLVIVVDHIGGSMLSRVTLHAYALCDAAEVFVFL
GGFATAIAYNSLAARHTEAAARQRFIKRAFEIYRAFLFTAGLMLFITAVL
NAFEIDAPNMPINDLDGLMHAPLAALRDILLLRRQPYLASVLPMYTFFAL
LVPLALPIARSRGWWLLVAASAATWLGAREIAAYLPTVDGVPWDFNPFAW
QFLFVLGIVARCQPIYPVLAKRPVGWFATAAALAVVAAGAYYRLRIEPFP
TDPSIKQNLGALRLANFIAIAWLAAKLIHLGWMHRIARAMPWIGTIGRQG
LLCFVAGTGISLLVDSLLYTATDGYLDVRLGLVADAAAVGLLYVVAKLYA
PLVARASDFVRQARLLRPQRPFRLPLRRPKR
>BTH_II2238 OpgC protein, putative
MSSQAPAARSIEVDFFRGLVLLTIVVDHIGASVLSRVTLHAFALCDAAEV
FVFLGGFATASAFLAVSARHGPGAARRRFVRRAAQIYRAFLATSTLMLVV
SAVLDHYGIDAPNMALDDISVLLASPLTGLVELLTFKRQPYLASVLPMYV
LFALATPAIVPLARAKPWWLLFGSVLMWGCAPWLAAELLDTDSFRWSFNP
FAWQLMFVLGALMRCWPLHRDVATRPGGAAITAIAFAIVLACAYYKLCAG
LPLPEGELKRHLAWPRVMNFVAFAWLMAELVRYGWIARVARAAQPVVAVG
QRGMPCFVAGAAISLTLDSVLHGVRGNARLPQLGMGLAADACALALMLTV
AHSGPLFRRKRRSVAA
>BTH_II1921 Bacterial domain of unknown function (DUF403) superfamily
MLLGRTASGLYWMYRYIERAENTARIVDAGLRMALTRTSDAPAEWSSVLV
SSGADDGYRRKYETYAADTVADYLLRDRDNPSSVLSCIECARSNARMVRT
ALTREAWESVNGAWLAIKRALAQPIRASALPAILDEIKRETALILGSFYS
TMLRNEIFDFAQIGAFVERADNTARILDVKYHLLLPSVSHVGTILDNYQW
ESILRCVAAHRSYRWVYDVQYKPLNIADYLILNGRMPRSLRYCYGRVVSS
LEHLAKDYGLTHGCHETAANIKRSLEDNTVERVFKSGLHEFLTDFIAKNN
SLGLEIAQAYNFD
>BTH_II1958 efflux transporter, RND family, MFP subunit
MNNVLATIAIGFALAVSNAAQAAGEMGGMDMQGGAQQAGAAHAGMSHGEV
KKVDAAAGKLTIKHGPLENLGMDAMTMVFKVKDPAMLSQVKAGDKIDFVA
DEVDGALTVVKLIKQ
>BTH_II1269 conserved hypothetical protein
MSQPHLHLSPTAIYPFDARGVAKRFRHAAIFGAIDALQSGETMRFVNDHD
PLPLLEQIRQHYGERVGIEYRQREPGAIVIDFVVQ
>BTH_II1436 Rhs element Vgr protein
MKMSDIAGFLSLQNSRLLTIKTPLAGRAELVLSDFQCSEGLSVLFDMRLG
LASRDPTIELKQMIGQAVTISLQPPGGIVGGSARHFHGYVTQFSHTGADG
GLATYSATVQPWLWMLSRRVDSRIFQDKSARDILDEVFSQYSALASYEFR
AGRTLKPYSYCTQYRETDLNFVLRLMELEGLFFYFEHAEDGHKLIIDDDS
TRAKPIDGLPSLRYASGEILEDEAVVTQWAAQRQLMSGAVSMKAYDYKVP
AARRYVSGESDFNQGEVERYEIYDYVGLHGFDSTDRGEELARFRLESLAA
SGKTFSGTSTGRTLAPGRYFELSAHYDHDNGPMHDRQFLLTNVRHHGVNN
YQSNEGSGSYHASFQCIRKKIPFRPPLAHARPVIPGPQTAIVVGPKGEQI
HTDALGRVKIQFHWDRIGQRNQGSSCWVRVSQPWAGGGFGSVQIPRIGDE
VVVTFLDGNPDRPLILSSVYNAQNMPPWALPAGATQSGFLTRSHKGTSEN
ANAIRFEDKLGEEEIWLHAERNQRIEVEHDESHTVGAKRTKTIGADEIVT
IGGAQTHTITGARTQTIGADHTQTIKGAHKQNVAGTHAQVIGGNASITTT
GPQPGAAGTVGDIEIQSSQGKIHLKAATEIVIEVGASVIHLKADGTIEIS
GPTHIGLNSKS
>BTH_II0260 Protein of unknown function (DUF796) superfamily
MGVAMFMKVDGVTGESADAQHKGWTDIQSFSWGASQPGAMASGSGGNAGK
ASFNDLVVAAYMDKGATAIIKNCANGKHLSSVEISACKTGGSQIEFMRVT
LQEVLVTSAQIAGVDPGDAADRLMMQYGFQAAKVKKQYWQQNDNGGKGAE
VSVGWNIKENTEM
>BTH_II0136 Bacterial protein of unknown function (DUF879) superfamily
MDPQFLDHYNRELTYMRELSAEFAAQHPKIARRLGMQGIEVADPYVERLI
EAFCFMSARTQLKLEAEFPRFTQRLLEVTYPNYVAPTPSMAVARLRPSLR
EGDFSKGFKVPRHSMLRSSIPPGEQTACEFRTGQDITLWPIEIAGATLTA
VPPDLPDLQRSLLPHTKLRGALRLRVRTVGEIKFSQIAGLDRLSLYIGGD
ERIASHLFELIHASSVASVVRAQGAARGEGVVVAKNAVDFEGLSPDQSLL
PLVWNTFHGHNLLHEYFTCRQRFYFFALTQLNAGLSRIDGKEAEIVLLLD
RLPDELVTHVEAARFLLFCAPIVNLFPKRTDRVEINRAQTAFHLIPDRTR
PLDYEVFSVSRVFGQKAETSTEVTFNPLYQTLHSDIGNYGRYFSILREPR
TTSTNARKYGTRTPYVGTEVYVSLVDQAEAPYADDIRYLSVDAWVTNRDL
PRLIPRNGVNDLTMQDSVPIEGVSLVHPPSAPREPYATGETAWRLIRQLS
FNYMPLAELDHRDGGQALRNMLRLFVGTSEREQATQIDSLVGARTEPVVR
RLPGHGLLVYGRGVRCELTVDESGFSGLSPYLFGLVLEQYLTRHVSINVF
TETELRSMQRGLVTRWKPRMGGRGAV
>BTH_II0252 Bacterial protein of unknown function (DUF876) superfamily
MNEPVLSATPAAALRQRVIWTEGMFLRPQHFQQLERHWERYVGMRCLPLQ
GFYWGYDALQIDRELLALGKVALLAATGVMRDGTPFDLSHPDDRPEPLDV
PADAKDQLVVLALPLWRGGAQEVSFGGEGNGSGNGSGNGNAGAGFARYVV
REHEIADANEVALGPALLQTGRLNVRLMLESELTGDWEALGVARIVERRT
DGRLLVDDGYIPPRLVAQRDPVLLRHTRELHGLLTQRSEALGERLSEPGR
GGVSEVADFLLLQLVNRYLALTWHAQQDVAAHPETLFRDWLKLACDLSTF
TAAGRRPQSLAIYRHDDLRASFGELMAELRRSLSTVLEQNAIQIELRDAG
NGMKVATIADPALRDTAGFVLAVRADVPADSLRARFPAQAKLGPVERIRD
LVQLQLPGIAMRQLPVAPRQIPYHAGHTYFEIDKGGEMWKQLERSGGLAF
HFAGEFPGLSMEFWAIRG
>BTH_II0703 YihY family protein
MQKRQTTPARDARPYAHPMKSLIPQQPQKLVRHNVNWALDAFRRFSADRC
SSMAASIAFYSAFSLAPTLVMVIAVAGWFFGADAARGEVFSHVHELIGNE
AAAGVQTIVENAHRSGSRGGTAALISFAMLAIGASATFASLNTALSVIWP
ATETRASSVLGLVRVRLISFGLVLGVAFLLIVSLVLDTAITFIGRWLWGA
SPYVAIGNLLQFSIGIAVLAFAFGTLMKFLPDARVSNRDAMTGGIVSAVL
FSAGKKLFALYLAHAGTASAFGAAGSFAVLLMWLYFSAAVLLLGAEFAAA
RGAAHASIDAASQADAAPARGGPLD
>BTH_II1207 carboxymuconolactone decarboxylase family protein
METRLDYRKANPHALNAMLALEERIAQSGLEPTLIELVRLRASQINGCAY
CVDMHTRDARKHGETDRRLATVVVWREAPFFTDRERAALEWTEAVTLVAR
DHVPDAVWEAVRPHFTDAELVDLTLAVATINSWNRFAVSFRKLPA
>BTH_II1714 Domain of unknown function
MVYVKGDAHRQRRISLSRAGAESARTRRWPTHARRFRLPNFLQLTAFATG
IVRSLELQNSRLARRLRRRRGVRCRRHPTCFATGGAFMTQSSHTVKIPGP
DHPITIEATGERVVVKAAGQTLADTRDALTLREASYPPVQYVPRKDVDLA
QLERTTHESHCPYKGDASYYSIKGAGARGVNAIWSYETPHDALKRIAGHL
AFYPDRVDSITIG
>BTH_II0360 carboxymuconolactone decarboxylase
MYPQPGPEIARRRRELAPEALAAFRAFSNSVFADGALPTKTKQLIAVAVA
HVTQCPYCIRGHTKEALKAGATEGEIMEAIWVAAEMRAGAAYAHSALALD
AMKEEAQSREAQHGEHH
>BTH_II0254 lipoprotein, putative
MSHAVARIVRPLPSREIWTFAGLVGLACFVWLAGPLFAFAEFHPFESGWA
RALTIAALFVAWGARIAWRNWRAGQLNAQLLNQLREASPRPAAPGDPARA
QLDELRSRFDEASTLLKKVRFGAADGARKGLPQWLERMSRQYLYQLPWYV
FIGAPGSGKTTALVNSGLSFPLAEQFGRAAIRGVGGTRHCDWWFTNDAVL
IDTAGRYTTHESNRALDEAEWNGFVDLLKKYRARQPLNGAMLTISVADLL
GASEAERTQHAMVLRKRLLELRSQLGIRFPVYLLVTKADLLAGFAEYFGG
FGRAECAQVWGFTFPLAESEAPGFDLRAAFDREYRLLHKRLNDGLPELLA
SQTDAHQREMSYLLPQQIADLQDMLGQFVAEVFSVSSFEPMPMLRGVYLT
SGTQEGTAFDRVMSGIKRFLKIEGVPPAAQTGSTGRSFFLKSLLQDHIFR
EAALAGSNLRWHRRQRVLQIVGYAAIVLLCVAVLFAWLRSYSRNRGYLDQ
VAARVPAVDAQISRAKFTGAADVVQLLPVLDELSGLPSAGGVDLRHPPLA
YRWGLFQGEKIEEASDAVYRRALDDVLLPIAASRMEQALREARPDEVEYA
YAALKAYLMLYDGAHYDPAFVQAVVDLEMERSLPADFSSAQRSALRSHLG
ALFGNRVAVSPFPMNERLVADVRERLRQVPFSQRLYRQLARTLRPSTATY
DFSVARAVGPDASLVFRRQSGKSLADGVPGLYTRDGYRNVFAPRLPGAID
AYGREEVWVLNLGASETPNPADAAAWARDIRQLYLNDYIKNWDDYLADIR
LQHTSTLAQSIQVARTLSSADSPLTRLMVALARVTPLGDAPGGARNLASR
AQDKVDEARNSLAQIFAGQPGADAGAAAASPASPEQIVDSHFAGLRAFAP
GGGDQAASFDAVLKAIDALYTYLTATDDALRGGATPPPSDAPARLRAQAG
RLPTPFREVLDDLSNVANGSVASVEQRNVAQRAGANVGDFCRQAIAGRYP
FARGASRDVAPSDFAQMFAAGGLMDDFFQKNLQTLVDTTTHPWRFNNRNA
EADPAAAAMLGSFEKAAVIRDVYFGGGARTAQIKVEIVPLEMDPSISEMV
LDVDGQIVRYAHGPQVPTAVQWPGPRGSDQVRLQVTEQSGATGGFTTEGP
WALHRLFDRAGVSGGRGPEQMVARFAVDGKPIVLQVTASSVRNPFRLPQM
ESFTCPPKQ
>BTH_II0976 ygbK domain protein
MSTDQAFRPLLGCIADDFTGATDLANMLVKSGMRTVQTIGVPAAGAPVQA
DAIVVALKSRTIAAADAVAQSLAALEWLRAQGCRQFFFKYCSTFDSTDAG
NIGPVADALLDALGGEHAFTIACPAFPENGRTVYRGHLFVGDALLSESGM
ENHPLTPMKDANLVRVLQRQTRSKVGLIRHDAIALGTSAVRETIDTLRRE
GVRIAIADALTDLDLYVLGEACADLPLITGGSGVALGLPSNFRLGALLPE
RGDAAALPAIEGASAVLAGSASKATNAQVAAWRAARPAFRIDPLAAARGE
PVVEQALAFARLHLPQPVLIYATAAPDEVKQVQQALGVEAAGHLVEATLA
AIARGLRELGVRKFVVAGGETSGAVVQALGVKALRIGAQIDPGVPAAATT
EGSPGGATEGSPRETPRAQPLGLALKSGNFGSIDFFEKALRALEGAA
>BTH_II0234 class III extradiol-type catecholic dioxygenase, putative
MLISRPAARPASCCSCRDTPSFARSPTHRTGCLFMGKIIGAGLISHAPVV
MMPRAVRLRENDGRDFTLATGLARLRREVFDAHDYDTVLVLDSHWRTTTE
AVVTAHARRTGRFTSDEMPNAIRQLPYDLAGDPELARAIAELATRRACWI
AAVDDPCLPIHYATLNPWTYLGRPDKRWISMSVCQTATTDDFLRMGEIVA
QAISRLDRNVLLVASGGLSHAFWPLAELRRRMAGAASNIVTPAARAADER
RIAWLEQGRHDRVIDAMSEFLRFDPEANFGHYLMMAGAIGARACAARARR
FSEYENGIGTGHVHLWFGPVDGGWTRAETRAEREAARA
>BTH_II1922 u1937b; B1937_F1_4
MKPFDEMLQPGDTVRAPYERLKQWLDTQDPASLAQKAHDAEGVFRKTGIT
FAVYGDAEAAERLIPFDIVPRIISGREWNRLSQGIEQRVMALNAFLDDIY
HRQEIVRAGIVPKHLISHNDAFLPEMIDFRPPGNVYTHIIGVDIVRTAEN
QFYVLEDNARTPSGVSYMLENRETMMQLFPELFQQVKVRPVETYPQLLRQ
SLAAVCPPGGNADNPTVAVLTPGIHNSAYYEHAFLADQMGVHLVEGSDLQ
VIGDRVAMRTTEGFRPIDVLYRRLDDAFLDPLTFRPDSVLGVAGIMDVYR
AGNVTIANAPGTGIADDKAIYSYMPEIVEFYTGRRAMLENVPTWRCAEAD
SLKYVLEHLEELVVKEVHGSGGYGMLVGPAASKAERDAFAAKLRAKPSNY
IAQPTLALSTTPILTERGLAPRHVDLRPFVLVSDRIRITPGGLTRVALKE
GSLVVNSSQGGGTKDTWVLAD
>BTH_II0364 Protein of unknown function, DUF488 superfamily
MKHAIEIQRVYEHAGDDGHVHFLVDRLWPRGVKKESVKLDAWLKDVAPSN
ELRDWFGHDPQRWDEFRQRYEHELDVSPASWQPILDAARKKPVTLLYGAR
DTEHNQAVVLRDYLLRQLQLHRH
>BTH_II1317 hypothetical protein
MPDFILLTRLSPEGLRSPSSLETLEKRTVKEIEQACPGVEWRHCYAILGP
YDYLDIFSAPDIETAFKVSAILRTLGRSHAEVWAATEWRAFKAIIDSIG
>BTH_II1406 Protein of unknown function (DUF636) family
MQTGDSPNYEGGCTCGAVRYRMTSRPLIVHCCHCRWCQRETGTAFALNAL
IESDRLLLLRGEVDIVDTPSNSGKGQKIARCPHCRIAVWSHYAGGGGAVS
FVRVGTLDEPDRLSPDIHIFTSTKQPWVILPPEARAVPEYYSSDEVWSAE
SLRRRAALRAKRER
>BTH_II2008 DNA primase
MSNTPMSEFERASVALGYVPADDRDTWRHAGMALKAEFGEEGFALWNEWS
QGAQNYNARDTRDVWKSFKGGKITINTLFHLAKLGGFDPRAHRAKPVDPE
QRERQHAERAAREAAELAALAEKQQAASALAESIWSAAEPAPTDHPYLVR
KRIPADALRVYRGNLSLGTAACDGALVIPARDADGQLWTLEFILTDGQKR
YLPNGRKAGCFSLIGGPVSSVLLIGEGYATCATLAAVTGYPAAVAFDAGN
LHAVATALRGRYPDARIVVCADDDHATNGNPGVTKARAAANAVGGAVAVP
DFGPNRPAAGTDFNDLAAHLGPDAVAAAVRAALVPAGASDADRGKASPFA
TKPAKRPKTARAQDGTWRFEVDDEGVWYHGFNNQGDPLPPHWISTRIDVI
AETRNEMSSEWGYLLEFTDRDGIRKRWAVPAGLFAGDGTELRRMLLDMGV
KLGVTQTARTQIANYIQMARPDERVRCVPRVGWHHGAFVLPDRVIGTGKE
ALIYQADTPIQSQFKERSTLDDWRRDVAAYCVGNSRLLFCVATAFAGPLL
HFSGLQSGGFHLLGTTSKGKSTGGVIAASVFGSPDYVRSWKATDNALEAV
ATQHSDALLILDEIGQVEPRLVGDVIYMLANESGKARASRNGSAKPVLTW
RLLFLSNGEKSVSALMAEANKPMKGGIEVRLPAIPAEVGDMGVVEELHGF
PTPAALIEHLERHAGKHYGTAGPAFIEFASAQADELAEHLRTRVDELVTE
WVPEGAHSQVARVAKRFCLVAVAGELATAHGLTGWPEGASVKAARRCFEG
WMELRGGAGNSDEAEAVRQVLHFLVAHGDNRFVWMNRAQDDHRPNVPHRA
GFKQHVKRDERRTAIASDREYYAEFGGKMGADDAEHVETEYLIEAAVFRK
DVCAGFDHKMVAKALMKRGVLMPRSDGYPYRQEYIPGHGKFMVYRVRPSI
FTLEL
>BTH_II0734 conserved hypothetical protein
MNETNPYFQEVIDAHVDIERWLSGHAEFDRLPALLGRFSPHFSMIATRGA
PLDHAGLDELFRRGHGQRPGLRIAIDELQQIGAWRGGAVIGYRETQTDGQ
GRTNTRRSTVVFERGAASRIVWRHLHETPLAA
>BTH_II0533 DGPF domain superfamily
MSYMLLIVEPRGQRAARTQAEGEALYERMRHFAGELQSRGVLIGAESLVS
DDKSTRVQVRNGEVRLVDGPYAEAKEMVGGFFLLDVGTQGEALAIAKDCP
AAEWCSVEVREIGPCFR
>BTH_II1429 serine protease, subtilase family
MARRNKQKTMKRRGATLLAPVVVAAAAAVAARPGWTQAAPYQDPGRRGDP
ASWRTPEFTNAWGLGAMHAEYAYAAGHTGANVAIGVLDSGYYAQHPELPG
SRFVPVTAAGVSGVLNANNNNHGTLVSGVVGGVRDGVGMHGVAPDATVFE
GNTNATDGFRFGVSDPKFPASDAKYFSEVYDALAAKGVRIISNSWGSQPA
NENYSTLNTLTDAYKLHEAVRTATGQGTWLDAAAKVSRDGVINNFSSGNT
GYDNASLRGAYPYFHPELEGHWMTTTGYDQLGGQVYNKCGVAKWWCVMAP
TGVPSTSYSGGAAAPTGATYANFNGTSAAAPHASAALALIMERFPYMTGE
QALSVLFTTAQNMEADPSRPDYTNNGLFSPVHPAKPGASGVPNGFGGWGL
VDLRRAMNGPGQLLGTFHAALPAGVADVWSNDISDVALAARKLEDDAEHR
AWLDTLKTKGWERGLPAGASDGDWIDYALGVARDAAYQAREYQGSLVKSG
GGTLTLAGANTYRGLTTVDGGELRIDGSIAAGAVVNPAGRLTVTGRAADI
AVDGGVATIAGTSANLSVDRQGRAAVTGTTADVRVANGFASLGGTSGNVA
VGALGVTVITGRTADVAVDGGRASLDGASGNVSVGNGGIVNGNGTVRTLT
AAANGTVAPGHSVGTLTVSGDVRFAPGSAYAVEVSQGGASDRIVAGGRAQ
IDGGALTLALENAPPPLTPDQSRSVLGRRFEILNAAGGVAGRFDAPGGYL
FVDPVLAYGPTSVSLTIDRNATPFASVARTANERSVADALETANPGSAVY
NSVLLAASAQAPQATLSQLTGEIYPAAYAALVNESRQVRDAALDRLWAAR
GEPGRAGAWARLLGSWGGARGSGDVNGYTSSTGGLLAGADAAVLDGVRAG
GFAGYRHTGVNLRNQPSSASFDSFQLGAYAGWQPGALGVRVGAAHAWHRG
GVDRAVQYGTVAESETTTLHAETTQVFGEAGYQLALGGAATVEPFFGIAY
VHLKNEGTTETGGAAALRVQEGNHDVTFSTLGVRGETRLGLTSRLQLTLQ
GSAGWQHALTDGQPTGALAFATGSNTFIVASVPVAKDAAVLNVGAGLELG
KNGLLRVGYSGALASRQSEHAVQGGLHWKF
>BTH_II0227 conserved hypothetical protein
MNGVDAKRFVNRQRERGAGAGAERARCGSGSGGGGGSDVAINRRRLRRCV
RRTHLCHTRRPILSTGQIGRASLMVKKRIRAMFAVALVGLAAHAAHAQYT
TDWIANTYGTIASHVGNNARSMWVSPEGVIYTASFWDENAGGVAIYQNGK
TLGSIGTHAEFQGGAITGNATSIFAAMQYGTPQGSGTVGRYNRATLQRDL
TIPVSVWNAVSRADVITGLATAGTLLYASDYFGNRVRVFTTGGVWQRDIG
IANPGALALDDAGNLWVAQKNAAKIVEFDPSGALMNTIQMASASRPASLY
YDASKRQLMVGDQGPDMNIKLYAIAGVPRQIGTFGVQGGYLDTTTGIKGQ
VGDRRFTRVVGIGKDAAGTLYVLNNPWGGGWDLGRNGATDIHAYDALGNA
LWKLQALNFEAIAAPDPTTDGALFYSGMNIYSGTAGGTFIANTVDPFTYP
SDPRLDMNDYQRGQHFGQLVSVGGNRILVASGQNPGNFNFYHFNAASGYI
AIPDASLPGKGFNTSLQVTAGFSIDNKGDVWVGLNGTNAISHYPLAGIDA
NGKPSWGAPTSIPTPASVQPTARILYLSDSDTMILAQGIAGSWDWTAMNG
RIEVYHGWSAGNVTQPNPVIALTSANPKSIAAAGNYLFVGYVHTVPNIDV
FNLNTGQLVATLINSNSGVMDVGNDVDSMYGLRAYLRSTGEYVITKDNYN
GSSIVVYRWRP
>BTH_II0559 conserved hypothetical protein
MTFYPQLLRNRPRMVIAAAAGVAFGLLFPYPLRPFARVLIGWDCTIWLYL
VLMWVRMVRAHHHKVREIAMREDENATIVLTIICFATVASIAAIVLELVS
AKSVGFRSGLGHYAVTGATMFGAWFLIPTIFTLHYARLYYLSPKEARAMA
FPDRELEPDYWDFLYFSFTIAVASQTSDVSLRGRSIRRAALAQSILSFYF
NMAVLGLSVNVAAGLLG
>BTH_II1423 Uncharacterised protein family (UPF0187) superfamily
MIVRPREHWFRMLFVWNGSVLKSILPQLALMSAVSVVALLTNGRILGEKV
PLNPTPFTLAGLALAIFAAFRNNASYDRYWEARKLWGGVLTAARALTSQA
LGYDASADGASFARATAGFVYALKHQLRGTDPAEDLRARLPADWLEPVLA
ARHRPVAILHALRGRLAGRHRGGALTDTQLWMLDAQLNELGAKLAGCERI
ASTPIPFPYHVLLHRTVYAYCVMLPFGLVDSIGIATPFVSVFVSYTLIAL
DAIAGEIAEPFGDGPNHLALDALARQIERSLLELAGLPLPDEIRAGPSYR
LS
>BTH_II2213 dedA family protein
MLHDLVARFGPLIVFVNVLAAAIGLPVPAMPTLVLFGAMATLHPGAIGAQ
LAPVLALAVLAALIGDTVWYVAGRHFGGRALKTLCKLSLSRDSCVKKTER
FFGRWGVRVLAVARFIPGLSLISVPMAGALGTRYRIFVGYDGLGALLWAG
CGVAIGFVFAKQIDWLFAGANQLGRTVLVVIVALLAAYTAVRWMRRRALI
RQLANARIDVDELDRLLQADPTPVVFDARSPEHRKLDPYAIPGAQFADER
DLRDIVAHYPATQKFVIYCSCPNEVSAAVMARRLKQAGFADALALRGGLD
AWRDAGRRLIELDPQPGGEAPVRAPAPKTA
>BTH_II0884 Uncharacterised protein family (UPF0261) superfamily
MVPHRKQIYVAATVDTKGAEAHFVKDRIADAGLAAVVVDLSTRAPGLAAD
IGAAAVAAHHPDGAAAVFCGDRGRAIAAMAVAFEHYIRSRDDVAALIGIG
GSGGTALVTPAMQALPVGVPKLMISTMASGDVSAYIGSSDIAMLYSVADI
AGLNRISRQVLANGAYMIAGAVRDMQPLPADLKPALGLTMFGVTTPCIQA
VTSRLDARFDCIVFHATGHGGHAMEKLADSGLLDGVLDLTTTEVCDLLMG
GVLACGDDRFDAIARSKVPYVGSCGALDMVNFGHIDTVPPRYAQRLLYKH
NPQVTLMRTTPDENRRIGEWIGAKLNACDGPVRFLIPEGGVSALDAPGQA
FWNPEADAALFDALEATVVQTENRRLVRVSAHINDPLFADIAVEHFLSLH
AAHRN
>BTH_II1998 unnamed protein product
MEFPIAVHKDDGSVYGVTVPDIPGVHSWGETIDDAIKNTREAIVGHVETL
IELGDDVEFTCSTVEELVAKPEYAGAVWALVSVDLSQLDSKPERINVSIP
RFVLHKIDAYVASRHETRSGFLARAALEALNEGKVRHA
>BTH_II1892 conserved hypothetical protein
MLGISVGMGFRLDQPSILVHEAAVWEALKAAAPSLPLYEAALPKQRAEWL
LAGHSVHAVGGGTRSRDIDWTAWVELDGVRKIVSCATQLGDEHEPGGCVR
VAIDHRHAAAGGAQENPFGMASGTPPLQQLRSFGVGPAPLAAMGAIGCDW
PERMQWMPTRPGTVDAMAQDGTHMGWPADVDLRFFQQAAPDQWARGECWM
PGARFELNGFGPRGEGFAGELPRLAPVALVTRNGRPGIERPTFKQQTVWF
LPDRGIGVLWWNGAVALDFLLDDSPTMLVIAFKDDAERIDVDALMKFADQ
RADLNCTDPLQQADHELMPAVAKGWTWEMILDTEDHPRFAPAPRGYEEVR
ARVEQNRRDLVEARDASERLSAFEEANRNAKLPGAPRGGENWRTRLRQAK
TPELANVTIRDADLSSLRFDGWKFDDVRFERCTLDRSEWTNCRLNQVHAV
DCSFADVKMSDGWWKGGKLQRCNLERSAWLNVEIERISLDECRLDDLKVA
GGSWSMLSVQGRGGVRGDVQDVQWNSVSWSEVSAPGWTWTRVRADDLAIV
ECAMAGLTVSQCTLAKPSILLTDLSASVWQRSMLTFAVLSHGTSINGARL
TDCVFKSSSLQELRADRVQVDHCSFMQLNAQHLHAQQSHWSRTVLDGANV
MHAQLTGTSFDRCSLKEAMFYGADMRQTRMRDCNLVRVRTSWIHPPEAGA
WRGNLNAGQLDVPRRV
>BTH_II0531 Bacterial protein of unknown function (DUF883) family
MALTDTVEYKLDRGLSEARRAGHRFARDARSAARDLNGEVKDNMRSLVDE
LDALLKEDVDADALRKRLRGRLEAARDTLDDASWHASRRLRRSAERVTQA
VHDNPWQAAGIVAGLAFAAGILLARR
>BTH_II1435 conserved hypothetical protein
MDGRARRDGLFPRASARATSTSTSRPDMRYSVNEGHLELPGQWLDRSVNA
LLPAMAEVTGSNLVLTRDELPYGVEFADYVDVQRAKYRKELSGLQMQRDE
PGVLDGRPCQFLAFTWNKDDLLIHQMAVIALDAPLVLALTYTSPGRLPDG
VRDAIGAALASFRFHRSAQLPAS
>BTH_II1822 PepSY-associated TM helix family
MDASLRSRWRAVHRGAGALFGVVLFVILFTGTWSLAQESMQGWWRPPALA
VAGPPLPLERLAARAAALGFSLRDARIVLPQPSDPAVRFCSARQVCALAL
NPATGEPLAEAARAMPLVTLHKTMFAGFPGRIFVSLWGIALLVLIVAGLV
LHRRHWPDSARVRRDRGVRIALFDLHGWIGLWGAPWLVLFALTGALSGLG
ALGTVSLAPVAFPGQPQRAFAALMGAPPPAAVDKPWSRAPDLDALLRRDA
ARAPAFRPEVVALHHWGDANASVEIAGTAAGLPSTALFERHLYRAADGQW
LADATSRGRGFWLRAFIAVQPLHFARYGWSGAAGGSLRALHFLMGVAACV
LCATGLVLWIERRHAQRDARARTLAALGAGVCGGLVLAGGVLLFAGRVLP
PGARADDALAALFWSTWLGSALLAARVRDRAALVRALMGAAGAAYLLAGA
AHLSIALLGAGAPVYAHVDAALACLGALLLRAARRPRRVAAPPAGLPPPR
SELP
>BTH_II1906 membrane protein, putative
MILLSLGTAAFVALLAWQGLGAVAATLASAGWGLALVAAFHLVPLVVDAT
AIAVMFRAGEPGSRLGDALRARWVGESVNSLLPAGQIGGPVLMVRYLAQR
GARLADAAASITVSTTMQALAQMVFALVGIAAFSAYATHGAASHLRTPAL
VATAVLGGCAALFYFAQRRGLFGRGLRAASKLLGPRDWSSLATRADAIDD
AIGRLYRERAKVRATFVLSFVGWVVGTAEVWLALRFLDHPVSWLDALLLE
SVGQAIRGAAFAIPGSLGAQEGGYLLLAPLVGLPPDAALALSLAKRAREL
ALGLPGLLYLHFSERNWQRRRAPLPIAD
>BTH_II1059 phage minor tail protein L
MTITADIQQLEPGRLIELFEVDCTEIGADVLRFHGHMQSTSIVWQGNEYK
PWPIQAAGFEQTSDAQQPSPTLRVGDINGTISALCVALGDLVGAKVFRRR
TLARYLDAVNFPAGNPTADPNEEMPTQQWRIEQKSDEQPGLHVEFTLSSP
LDFGGQQLPKRQIISICQWEYRGPECGYTGAACFDKDDNPVSDPALDRCS
KKISGCERRFGVNNALPFGGFLCDTMA
>BTH_II0868 hcp protein
MPMPCYLTLEGQTQGSIEGSCDIQGHEGKILVQAVEHTIEIPKSPQTGLP
TSKRLHGPMTLTKEIDKSSPKLSQALASGEQMKSVVLEFYRILKEGKEEH
YYTVKLENAILTSIRSWTPNCLVLDNKQLGHMEDVSFTYEKITWTYVPDG
IEAEDSWRAPKV
>BTH_II0855 lipoprotein, putative
MKTLCSMLARIAALIALAALSWATALYFDWPWWGAPAVFCAALAVWPLFG
VARRALRAVRARAQLARLDGAKRLPADRDAPLRRVVARWRAALDALERAA
PAPGLRGRPRDALPWYLVIGRAGAGKTTALARARIASPLRRARHDASLAP
TEDCDWWCFDDAVVLDLAGRFAEPDATDDDRRAWGALLEQLGRARARRGI
NGVVVAIDAPRLVTADRDALTIDGCAVRERLEQLIRLFDRRFPVYVLVTQ
CDRLYGFDEWAAQLAPEQRERAFGYLGDHDADAFVAHALEHVDARLAALR
IALAARGEPPSPRALMLPHELARLRPALEALARAAFGPNVYQETPYLRGL
LFSSGRQAGGAPSLTLPDWLDAAPARTPGDAGLFLHDVFARVLPGERDAS
RPVERPSRSRLTPRRLALAAWLFACVAAGLLMSASFVGDIRTVELIRRDY
PAHPRFTGELAHDAATLARIARVIADVERCDERRLVRRLARPIADATPVG
RLEAQLKRRYIAHYRRAIEPAADRLLFGEPDGASGARAADDGDIAVRIRN
LVRYVNLMQARRRGADRETLARMPAPALARARDGGGASRADGARSAGGMS
DTGDSGDTSDRTGRLSALIGALAVDRIAWSAPDDATLAARIAAAQAQLER
LAYRDPDGAWLLALPDASAPRDVTLADFWPAAAPRTPPPEPRDARVPAAL
TAAHRPAIDAFLDEMAQAVANRPKFAFHRDAFDTWYRARRIDAWRDFVAR
FPQGEQGLATQAQWRAVIDTIADRRDPFAALLARVDREFESVRDDALPPW
LRFVRTASRMLAPARVPAASGGLGAALGSISRSGGRALREALGGAPEQGR
RTLERDAALRDALVDYERRVAALAADALAGPGAAYRLAADFHGFGVDPSV
EASAMRAADDALRDVKRLAGERDVGGDVVWSLVGGPLHAVIAYVERQASC
ALQEDWERDVLWPLRRAATREDADDRLYGPQGAFWAFVDGPAKPFVRVGA
ARASAVDTLGYRLPFTDAFLPLVDDAAARRVAQARRADEQRARQQAAAEL
DERIASLGKQIDAIRAQTVRIEIVAQPTDVNPEARARPFETVLTLQCAPQ
ARTLANYNLRVSERIDWQPDQCGDATLRIALGGVTLTRRYAGPLGIARFV
QDFRYGVRRFTPGDFPDAKAQLERLGVRHVDVRYDFSGHDALLAHVERID
ALERARRDDIARQRRIASRQDDGASDSGAAAIARAGGFAANPANPANTAR
APSASSRSSPSPAPGASAPSGTPDAGLPRRIGACWGDPARDGGMRP
>BTH_II1099 Protein of unknown function family
MALPQDSVNMALFCDFENIALGVRDTKFEKFDIKPVLEKLLLKGSIVVKK
AYCDWDRYKTFKAAMHEASFELIEIPHVRQSGKNSADIRLVVDALDLCYT
KAHVDTFVIISGDSDFSPLVSKLRENAKRVIGVGVKNSTSDLLVANCDEF
IFYDDLAREQQRALAKRDARKAEAGAKRTADDDRHRRHDGDARKAEAISL
AVETFDALASERGESGKIWASVLKSAIKRRKPDFNESYYGFRAFGNLLEE
AQARGLLEIGRDDKSGTFVFRPIQTVAVEFAGAAEAAATVDDNHAGAKPS
GKKAHGKGRGAKKRVPEQMPLIVETGAEAHADADIEADAEIAEANVSAPR
DFRRERTAEAARDTHPAHEAHEPREERAAREPARTDESAEAQEAEAASPA
RSRTRKTAAKKAKGTKARAADTSPASPAVAAEANAAGEPPPEAAAEAAPA
KAPRKSAPRARRPRKTVATNES
>BTH_II1358 gp29
MGFAFICEGDTTTHGGRVVGCNTANLVYGKAIALLGDMVTCPRCGGIFPI
VSVKSGLNMTFGDRPVATDGDKTACGATLIASQGTATVAPTAGQGSPIGG
GKSVIAQARSAPNEPYRGRFQLLDDHTREPLANHAYTITSADGRTVHGQT
DANGFTSWLDSDEASSLTFTNSGASPA
>BTH_II1040 pANL56
MDITFDPTKNETNIAKHGVSLALAAQLDWSDVLSYVDDRRDYSEVREVGF
GVIGDRLYCVVFTQRGDSMHIISMRKANKREVKSYVEQA
>BTH_II0391 Uncharacterized BCR, COG1937 family
MSHTIREKQKLLNRVRRIKGQVEAIERALEEECGCGDVLQRITSCRGAMN
GLLAVVLEDHIRTHLVDAEAHDDHEGSASEQLIDVVHSYFK
>BTH_II0370 conserved hypothetical protein
MVLNAALSRLYVAPDNADVVSVIDTAASKIVSTVAPARLVTEMQYRGASP
NGLMLSADEHTLYATNLGTNDVAVISLAGASPAVTGLIPTGCYPSDLAVG
AANALSVVYTKNMPGPNPGNCMDSGRTVPCPVKSTPVKLVENQYIEQLSK
SSPMWMPAPGGKTLDLLTTQVANNNSVNAALTPNDITTMAALRKKIKHVM
LQGGGVRPAREHATADSAGRHQDGGARSDAQDALLGARDARHGFLGRGSR
RCSRVQQGALEGADERSRVSGARRRACRAAPARR
>BTH_II1146 Uncharacterized protein family UPF0029 family
MRFSPATLMTYTLAATYTRELEIRKSRFIAYAIPVENRDAAMAELQRLRA
EHPAATHVCWALLAGGQSGMSDDGEPSGTAGRPILEVLRHHDLDGVLGAV
VRYFGGVKLGAGGLVRAYTDAIAATLIDAERIERIAYARLAIEIGYPDEA
RVRRWIEQEGHDLVDSAYGMTVRLVIRLPATALDAARDALFDQTQGRAGF
PSAD
>BTH_II2019 conserved hypothetical protein TIGR00645, putative
MPRRCKRPGRRVTITRFFSKTLHIPASPLRPTMSAAPTDPRAARPARRKM
RPLPAVIFMSRWLQVPLYLGLIVAQAIYVFLFLKEVWHLLSHATGLDETN
IMLAVLGLIDVVMISNLLIMVIVGGYETFVSRLGVEGHPDEPEWLDHVNA
GVLKVKLSMALISISSIHLLKTFINPDQHTTHAIMWQVLIHVAFLVSALV
MAWVDRLTTHTHPQHFHEASTDPSAPREPAQQSA
>BTH_II0917 conserved hypothetical protein
MIKADRERLSVVSLIEKVARLREPAAWPDGTQSVKAIETHMSWVFLTDRH
AWKLKKPVRAPQLDFRSLAARERFCHEEVRLNRRLAEGVYLGAVPVTMDS
DGRLRPGGAGEVVDWLVEMKRLPAERMLDRALLHGSATRADARRIAQRLS
AFYRSLAPVRGDPALYRDDLRRTIDCNERALCRPMFDQPVAAVRAVCALQ
RALLDAEAGRFDARVRQGRIVEGHGDLRPEHICIDARIAIIDCLEFSKRL
RTQDAADEIGFLALECERLGAPEFARALLGEYRAASGDDVDDALVHFHQS
CRAMTRARLAAWHLREKAFRATPAWRDRARAYVALAQRHIGCCEQLWTAA
RVSAAAGP
>BTH_II0361 conserved hypothetical protein
MEAAMAMPRATAGELIDVRPLGDALPGSKSITLMRSDHLEVVRLVLPAGK
HIPEHRVPGEITVQCLEGIVKFGTDAGTQLMRRGDMLFLQGGERHWLEAA
ENASVLVTLYLPHGH
>BTH_II0880 Domain of unknown function (DUF323) superfamily
MRAPTSRHTFRAASAFRLAGFAIVALTLAFTATAAHAARFVRLPGGDFES
ALPQDVPGRSTPVHIDAFELQDTPVTVRAFAAFLRAHPEWRRERVARVFA
GPAYLADWADPLHPAPATPPDAPVTGVSWFAARAYCASEGARLPTWLEWE
YAAAADATRTDARNDPLWRQQILSWYEQPAARVLPSVGGAPNVYGVRDLH
GLIWEWVDDFNALFISGDSRTQGDPDQQRFCGAGAISIVRRDSYAVLMRV
ALLSSLTGADSTGSLGFRCARSISGD
>BTH_II1767 Bacterial protein of unknown function (DUF899) superfamily
MTEHSMPVSAAELVKRNTTRWPNESDAYRRARDALLVEEIELRRRVERVA
VLRRALPPGGEVTGDYRFEGERGPCDFAQLFGDRQTLVVYSYMFGPQRER
PCPMCTSLLGAWNGEARDIEQRVALAVVARSPLERLVAFKRERGWRDLKL
YCDLDGRYSRDYHAIGADGGEDPAINVFTRRDGTIRHFWSGEMGGWSADP
GQDPRGAPDPMPLWTILDMTPEGRGVDWYPSLDYPD
>BTH_II1592 conserved hypothetical protein
MSISIVRLGTPRADDEGVRIGTVRRPPRGVPKDEFASRDYYDVWLPTLSP
SPELVAEAQAAESDAEWKAFARKFRAEMKHGDASKVLDVLAVLSTTSNFA
IGCYCENEARCHRSVLRELLEERGASIRS
>BTH_II0865 Protein of unknown function (DUF1305) superfamily
MAAADGGAMPALEPVLLGEAKHFAYFQAIRLLRRIVRERREHHAGAASAP
AAPMPIHTRPNLALSFPDTDVERIDKADDGGYRVVANFFGLYGVSSPLPT
FYTEDLIDEAFKGRHAARGFLDVLHRALYPLLFDAWLKHRLSLRIVEERD
AHALRPLYALAGVDARIARDAGLPEHALLRHVGLLSQRPRSASGLRALLA
DAFAPAAVDIEPCVPQWLPIPDDQRTRVGARAHRLGVDARVGARMRDDGA
RLRIVLRDVPGPLFRALMPGGDAFRRLRFLVRLYLTQPFTVDVAIRVRAR
DALPARCGGGAWSRVGLDAWLGGPPAERAAAPQFRLPTSLFDQARPHHAA
G
>BTH_II0121 Protein of unknown function (DUF770) superfamily
MAISNSSQKFIARNRAPRVQIEYDVEIYGSEKKVELPFVMGVLADLSGKP
IEPLPAVGDRKFFSIDIDNFDERMKAMKPRVAFSVPNTLSGDGQLMVDIT
FESMDDFSPAAIAKKVDALSQLLDARTQLANLQTYMDGKSGAENLVSKVL
KDPALLSALAKAPKPAAQQAENRESKEKH
>BTH_II1897 Bacterial protein of unknown function (DUF879) superfamily
MNFLDHYNDELRQLRDAGARFAKEHPQVASALGLHPDAVTDPFVERLLEG
VAYLSARVQKRLDRECAEFAQQALGRICPLYMASTPAISTFAFHPDLGSP
DAFRGNTLPRGSLVAAHLPGRKLPVMFSTARDVTLLPLRLASVECSRSIT
GLPSLLSQRLASSHAVLRFRFEVEGGASIAELAREDEGFKPLHLSLAGDL
PRAYALHRAMLADTTAWYALVSTNRGDEVLTLPMSGIRLSGVDDAEALLP
EEFGGLPGLRLLREYFAQPTRLLGVYVDALAAIAAKAPSARAFELFFALR
DAPGDLVGDVDASQFRLFATPAINLYSKRFDPVPYDANQPEQWIPVDRMR
PAAHHLWALTEVFVCETNGRAHRARSVLETAGYEGHEGSIRYGMRREDAL
LVDGVRRDRFDPLASHDLIAVSMVDDTLDPDDVATITGRALVADRDWRPT
GLLDADLQLLDPTAVKRIECLWPASAPRGKPSVDACWEAVSHVGRNPLAL
HASQQQDVTARIVEQLLLAIDKDDALDRQRLESLRSVLLRSRFVAAGRAT
PTALVRATQVEIDIAESLHADRGGWLFGRLLAQALAEAATLNDGIEIVVR
LDGEAASTHTNVAARDGRLA
>BTH_II1282 serine/threonine kinase
MTLKLADNPNRLSDRAAMALPDDLFVARTSATVFSTHVDLLRMTSAQWIA
IAESPDAPFERRYAAGTLLGFAGDPRIRALDPPMCDVPAARARLGTEPAR
VPQVIAEWARVGVIDEWIEKECPRHTVELAAYRMMRYPVTNLEYRRFLEE
TGAPWLPTSWTFGVYPLERANHPVWSVPAEAADAYATWLGEATGRRFRLP
TEAEWEYAASGGADVEYPWGDAFDPRAANTVEAGPLSTTPVGIFPLGRSA
FGIDDLAGNVEEYVADDYRPYPGGEAIADDLAVTQGGYRIARGGSFTRFG
DLARCARRHGRYQRDIYAMGFRLAESR
>BTH_II1894 Rhs element Vgr protein
MRLIELRSPLLDPDAVALSFVVHENLSQEPSYQLDLLSHDPDLDFDALLG
STLSADIDLGGGDIRTFNTHVFGGHDTGQMSGQYTYTLELRSWLSFLAEN
RNSRIFQNMSVPQIVEQVFQGHQRNGYRFELEGTYEPREYCVQFQETDLN
FVKRLLEDEGIYFWVEHEPDRHVVVISDTQRFEDLPLPNETLEYLPDGEE
SRAIQGREGVQRLQRTRRIKASNVALRDFDYHAPSNKLDSDAQLVSPPNL
EGIPLEYYDYAAGYREPEQGERLARLRLEAIQADSHMLVGEANARALATG
RAFTLIGHPALGRNRRYYVTNSELTFIQDGPDSTSQGRNVAVKFRALADD
QPYRPLLTTPRPEVPGIQSATVVGPEMSEVHTDKLGRIRVHFHWDRYKTT
EADASCWIRVSQAWAGKGWGVIAMPRVGQEVLITYVDGDLDRPLVTGIVY
NGENPTPYDLPKDIRYTGLVSRSIKRAGGYQNASQITFDDQRGAERVMIH
AERDMQQTVERNSSTSIAQDLNLSVKGTATSVVGISVSFTGISVSYTGLS
VSFTGVSASFTGVSTSFTGVSTSFTGVSTSFTGVSTSFIGVSTSFTGVDT
GFKGVSTAMIGVSTSVVGSSNSVTGVSNSMTGISSSWTDVSMSTTGQSQS
MTGVSLSYTGTSNSMTGTSTSVTGTSTSITGTSMSNTGSSTSITGTSMST
TGSSTSITGSSVSTTGSSVSTTGSSVSTTGSSVSTTGFSFSYTGASYSDV
GIDLKKVGMQVKS
>BTH_II2078 Uncharacterized ACR, YkgG family COG1556 family
MSARDDILGRIRAALGGERASLAAQFPAPARTAAAADASDAPRARGTLDS
SAASSGSPAAARGPAISLGDVDLVARFVAKAQQVSATCAHIATAAGAPAA
VDAYLRDVGLADAPLVVAAALDALPWRTHRALPGVDLRRDGLVSVTPSFA
AIAETGSVVCLSSSATPTSLNFVAATHVVLVNRSAIVATMEDAWARVRAT
IATLPRAINVITGPSRTADVEQTVQVGVHGPKRVLVLICDDA
>BTH_II0870 Protein of unknown function (DUF770) superfamily
MTTNGSVAPKERVNIVYRPATGDAKAEVELPLKVLVLGEFSTADDKPPIE
ELAPVNVDKDDFNDVMKAQRLTLALSVPNLLDDQAGEDDRLSVALGFESI
ADFSPDAIVETVPELKQLVALRDALKALKGPLGNIPGFRKRIQEIVADKG
ARKQLLDELGLDEQ
>BTH_II2000 gp29
MGSALIREGDTTSHGGRVLAGTSTNIVYGKPLALEGDMVSCPKCGGIYPI
VGVRNRSMTFGDRPVATEGDKTACGATLIASQGTATVELTSGAGGPVGKG
KSVVPRPAAQSNEAYRGRFQLVDDKTREPIANHPYTVTSADGQTIQGTTD
ATGHTDWLSSHQASSLSFQQPGSDA
>BTH_II0123 Protein of unknown function (DUF796) superfamily
MSHDIFLKINGIDGEAEDATHKGEIEVLSWSWNVSQQSNMHLGSGGGAGK
ATVDDLLFEHYIDRASPNLVQYCLLGKHIDEARLVVRKAGGSPLEYIKLT
MNDVLVTQVSPAGVAQDESRPRELVRLSFSRLKQEYVVQNAQGGSGGAIT
ATFDIKKNAA
>BTH_II0862 pentapeptide repeat family protein
MKIVKPESLALLCRTLRFEGIDRLSIGALACFALRADAPAGPGDLAPEAS
LWQVARQWLGEHAPLDDGLPKPSGEFLVYGDACAPPGRDRAARAPFAVRA
RIGAACKERLVDARDAAGRALAEFRALPPSHPERSRDLGPFDERWLAARW
PHLPAGTRAEHFHTAPRDQRIAGFWRGDEDIELVNLHADRPAIAGALPRV
RARCFVERWVGGVARIDACPMRAETVWLFPGAACGIVLYRALVAIDDEDG
DDVVRVIAGWEHADAPPLPDEAYIGRPAPEDEGSRPALAPAAAPAAIADD
DARADAGDAADRAPGAPASAAHAHSPAAPESSAEPPAPDLSALERDAAAL
AAQTDALLAAAGLTEADVARLLPPRDAPADMTLDELTALAAELDARTAQW
QAQYDAAAAERDEASSPASPNSAAAHDASLADLLRQADAQIRALVDQHGL
SRAQMEAAARDRPELAALADALDALDAPLDIDALTAGLAAPAGDEAIVEP
DAPAGPDRPAGADRPADGAPASMHAAAPSAGDAPPAEPLTREQVIERHAR
GLGFAGLDLSGLDLSSAALERADLRDARIERTCFAGCRLRGASFERALLS
RADFSNADLREATFVDASAPGASFRGAALDRARLAHADFTGADFTRASLA
DGHCAHARFDESAMTQLAAARLDGAHASFAGCALDAADFTSARMPRANFQ
HATLTAATFAFAQCDGAEWYGAQASGAQLRSASLRGSRADASTSFRQAVL
SGAALDDANWDGVDLRYANLHKATLDRASLARAIASGAQLTLSLARRADL
TKADLTHADARFSNLQGASLRRARLDGTQLQSSNLYGADCYGTALGRSQL
AGANVERTLFVVPGRPELASSR
>BTH_II1314 Uncharacterized conserved protein
MDLTRLTRHGEFEWHIPATGAMRVPGVIYADRKLIADMDDKVYEQVCNVA
MLPGIVGASYAMPDAHWGYGFPIGGVAAFDAHAGGVISAGGVGFDISCGV
RTLHTGLVRDDIDAVKKTLADALFAHIPAGIGSTGRLRLSAAKTDDMLTG
GAVWAVEQGYGTPSDLERIEEGGMVRHAKPSMVSALAKRRQRDELGTLGS
GNHYLEVQEIEDIYDPACAQRYGLQRGQVVVTIHCGSRGLGHQIGTEFLK
AMVIAAKSYGIALPDRELACAPILSDLGERYLGAMRAAINCALANRQVLT
HLTREVFAKVLPAAQLTLFYDVSHNTCKVEDHVIDGRRRQLYVHRKGATR
AFGPGHPALPDALRDAGQPVLVGGSMGTASYVLAGANAPGGERAFGSACH
GAGRAMSRFAASRRWRGRALVDELAARGIVIRSLSDRGIAEEAPGAYKDV
GAVVDAAAEAGLARKVARLAPLVCIKG
>BTH_II1769 glyoxalase family protein
MPTAVKPIPEGMHSLTPHLICDGAAAAIEFYKKAFDAVEITRLPSRDAGR
LMHAAVRIGDSTLMLVDESAQCGALGPRALKGSPVFIHLYVPDVDAVVAR
AVEAGAKLTMPPADMFWGDRYGQLEDPFGHRWSVATHKRDLTPEQIREGM
ENCVPPGQ
>BTH_II0857 Bacterial protein of unknown function (DUF876) superfamily
MDNVYWHQGMLLQPQHFQLAELHQQFRIEPWLASAPPHFWGVGALSIAQA
AIDRRVVEIRSAQLLFSERSYVEYPGNAVVAARAFDPAWLDDGRALVAHV
ALKRLARGANNVTVAASPDALPDAPTRYATLPCADEIGDLYSDHPGAPVR
TLKHVLKIVFGHELDALAGHETIPIARIVRDGERLRLDDDFAPPCYAVSG
SRALLDRVRCIRDELAGRARQLQQYKNPREMQRAEFDASYAAFLLALRSL
NRFGPLLFHLAECDRLHPWTVYGVLRQLVGELSAFSERFDMLGETPDARG
GLPPYDHLDLGGCFSRAHALIGHLLDEISVGPDCVATFEPDGERQPAQRS
AQLPPDVFADRHLIYLAIRSAHDPDTLAQRFLLGGRIAATDEMPQLAALA
LPGVELTRLPGAPPRLPRRGDARYFRIEQTGRPWDAIRRDGRVSLRWADA
PDDLHAELVAVRHA
>BTH_II1900 Protein of unknown function (DUF877) superfamily
MQQLESSAEKVVVDQNNVNEDLKDILRRSFRPRTNEAAEAVQNAVETLLT
YARRSRVVVREDVAQTIEQLVAELDKKISEQLTLVLHNKRFQSLEGAWRG
LHYLVSNTDTSENLKIRYLNISKADLGKTLRRFKGVVWDQSPIFKMIYEQ
EYGQFGGEPFGCLIGDFYFDHSMQDVSILTEMSKISAAAHAPFIAAAAPG
LLQMDDWSELSNPRDVSKIFTATEYAFWRRLRESNDSRYLALTLPRFLAR
VPYGPKTQPVEEFGFEEKVDPNRAEDFCWANSAYAMGANITRAFKTYGWC
TKIRGVESGGAVEVLPKFVLPSQDREVDLHCPTEIAISDRREHELSESGL
MPLVYRKNSDTAAFIGAKTVHRPAIYEDDDATANSNLSSRLPYIFATCRF
AHYLKCIVRDKIGSFKSAEDTQRWLNDWLMNYVDGDPSISSEVTKSQRPL
SAAEVVVDEIPENPGYYRAQFFLRPHFQLEGLTVSLRLVSKLPSTKQEVT
T
>BTH_II0926 conserved hypothetical protein
MASRSAVAILCLLVWLPARAATVAITVVDASNGRPLAGATAYASGIARTS
AADGTFAIPAAAGDTATAMSVTAPGYARAEVALHEADAARIVRMTPVRPK
AIYLSASGVANRALRDAALALPGKTAINAIVIDVKGDDGATPYRSAARRS
VGAAAERRAAAHAIDLPALVRALHARGLYLIARIVVFKDDPLAAAHPDWA
VRDAAGAVWRDRERQRWIDPTVRAAWAHPFDLAEEAARMGFDEIQFDYLR
FPDANGLRFGEPNTEANRVAAITGFLAGARARLRPYNVYLSVDIFGYVCW
NSNDTHIGQRIETLGRLVDYISPMLYPSGFTWGLPGIRKPTEQPGAIVGR
SLAQAQRRTGLPGVRFRPWLQAFRDYAFDRRTFGADEIREQIAAAEAAGT
DGWMLWNPHNRYDYAALPH
>BTH_II0125 Bacterial protein of unknown function (DUF876) superfamily
MVPGIHAKDEDQVERRCRAANRVDYRIGMIAMSWHNKVVWNEGLFLLPQL
FQQQERYFEYFAHKRAAVLSPFFWGFSRYEIDQESLSFGKLVFKSGTGIF
SDGTPFDVPGHTPPPPPLTIASEHQDQVIYLAVPLRLPNTEETAFDEQAG
SLARYSAFEIELRDSNAIGQGPKPVQLANMRLRLLPEKELTQSWIGIALT
RVKTLHADGSVALHDGDHIPPVSQYGANPLLREWATQLHGLAKLRADALA
TRLSGSDGRAGAAAEVADYLLLQVLNRYEPLLEHICRIREMPPVTLYREL
SMLAGELSTFVRPQTRRPRPTPGYDHAQLYASIRPLVDEVHYLLNQVLIR
GAQPIPLTEQPHGIRVATMLPSELAGYSSLVLAVGAQMSPDVLQQQFASQ
TKISHPQRLPELIRSHLPGMTMIPLPVPPRQIPFNSSYIYYELSRTGPFW
EQIAQQGGLAMHIAGHFPELKLELWGVRHK
>BTH_II2255 MlrC C-terminus family
MHSRRVSSDRRRPEMPFAIQESDMNILIAGFQHETNTFAPTRASYRSFVL
GEGFPPLVRGGGVLSLRDVNVPIGGFIRAAQANGHALLPVVWAGACPSAH
VMSDAFERIGGEIVAAAEAGGFDAIYLDLHGAMVTEQFDDGEGELLARVR
RIVGDRMPIVVSLDLHANVTARMAAHASALVAYRTYPHVDMAQTGERAAQ
VLERLAAEARPLHCAMRRLPFLIPINGMCTHAEPASGAYRLLAQLERDGV
VSMSFAPGFPAADFPECGPTVWAHAFEADAAQRAADALFAKLVGDEARWS
VPFLAPDAAAAEAIRLSRTATKPVVIADTQDNPGAGGDADTMGMVRALLR
NGADDAAVGLIWDPDAAAAAHGAGVGARVSLRLGGRSRVRGDAPLDAVFE
VEHLSDGRFRFDGPMFNGAQGDLGPVACLRIDGVRMAVSTNKMQTFERNQ
FRVAGIEPERMRIVVSKSSVHFRADFEAIADAILIAKSPGPMAADPSDLP
WARLDPDIRVRPNGPTFGALRAAAR
>BTH_II0867 Protein of unknown function (DUF1316) subfamily, putative
MARAEGVSARLPARAPARAPSGERRLLERIADREAGGERSPSADALARSI
IDHLRRILNTRQGHVPIDPAFGVPDFTNLAGGFAQGSAREIEAQIERVIA
CYEPRLKSPRVTLAERALDAATLHFSLDARLVLDAREVPARFLTTVSGNG
KIDIRTIS
>BTH_II1062 host specificity protein J
MKKLHAERGLKRIYGAKGGGGGGGSSESPDSLHSIARAKVLDVISAGPIV
GLVNGLQSVYLDGTPIQNADGSLNFQNYTVDVRTGTQDQDYIPGFPAVER
EAGVGVPLTSDAPWVRQIQNTQLTAVRVRFGVPALQRQDTSNGNITGYRV
DYAIDLSVDGGSYTQVLAGAFDGKTTSLYERSHRIELPRAKNGWLIRVRR
ITPNAHTATIADAINIEAITEIIDRKLRYPMTALVGMTFDARSFSSVPVR
SYHVRGMIFRVPTNYDPETRTYSGTWDGTFKAAWTNNPAWVYYGLLLDKL
NGLGDRVDASMVDKWALYAIARHCDELVSDGKGGKEPRFTCNCVIQTKAD
AFKVVQDIASVFRGISYWGAGSVVASADMPSDPVYLYTAANVVGGSFKYV
GSERKTRYTVALVSYNDPTNQYKQAVEAVQDDDGIARYGVIKTEVTAFGC
TSQAQAHRLGRWLLLTSRYETGTVSFQVGLDGTLCAPGQVIAVADPKKAG
RRIGGRIRAAAGETITLDKAPTIAAGDRFTAILPSGIAQARVVKAVNGDT
VTLAARFDADPVPGAVWMVESNELAAQQYRVVSVQESDDNGQIVYTINAT
QYEPGKYAAIDDGAQIQQRPITIVPPSVQPPPSNVRLSTYSVVDQGISKT
SMVIAWDAANHATSYVAEWRKDNGEWVRAPSTGGLQVEVPGIYQGKYLAR
VRAENALGVTSIPAYGVDTQLTGKTTPPPSVVSLTAAGIVYGIDLKWAFP
GDGSAGDTQRTEIWYSRTPNRDDATKFSDFAYPQASTSYQGLAVGQVFYF
WARLVDTSGNVGPWFPAKGPGVQGQPSTDQSDYEKYFAGQIGKSALGTEL
RAPIDLITPPMAGDATIYAGDERLNAGVWSLQAAIAEGDMAVAKKVETVA
AQLHSGSNLLNAAVQKETIARVEADRAMAQDITTVQAQVDDNVAAVQTVA
KSYADLNGRVAASYQIKVQTTADGHKYMASIGVGIDNENGVVESQVLVSA
KRFAVIDEDGSGVIGAPFVVQGGQVFLRQALIGAGWITNAMIGSYIQSDN
YIAGRQGWRLDKTGWFEINAADGSGNRLVMDGSSVRVYDGNGVLRVRMGM
W
>BTH_II1893 Rhs element Vgr protein
MRLIELRSPLLDPDVVALSFVVHENLSQEPSYQLDLLSRDPNLDFDELLG
STLSADIDLGGGDIRTFNTHVFGGHDTGQMSGQYTYTLELRSWLSFLAEN
RNSRIFQDLSVPQIVEQVFQGHQRNGYRFELEGTYEPREYCVQFQETDLN
FVKRLLEDEGIYFWVEHEPDRHVVVISDTQRFEDLPLPNETLEYLPDGEE
SRAIQGREGVQRLQRTRRIKASNVALRDFDYHAPSKQLDSDAQIEQQSLG
GIPLEYYDYAAGYRDPEQGERLARLRLEAIQADAHALGGEANARALAVGR
AFTLVGHPALSRNRRYYVTNSELTFIQDGPDSTSQGRNVAVKFRALADDR
PYRPLLVTKRPRVPGIQSATVVGPEMSEVHTDKLGRIRVHFHWDRYKTTE
ADASCWIRVTQAWAGKGWGVLAMPRVGQEVIVVYVDGDLDRPLATGIVYN
GENPTPYDLPKDIRYTGLVTRSIKRAGGIPNASQLTFDDQHGAERVMIHA
ERDLQQTVERNSSTSIAQDLNLSVNGTSTSVIGIKVSFTGISVSYTGLSV
SFTGVSASFTGVSTSFTGVSTSFTGVSTSFTGVTTGFTGVSTSFVGVDTS
FTGISTGFVGVSTSITGSKNSVTGVSNSMTGISSSWTDVSMSTTGQSQSI
TGVSLSYTGTSNSMTGTSTSVTGTSTSITGTSMSNTGSSTSITGTSMSTT
GSSVGTTGSSMSATGSSVSTTGSSVSTTGSSMSVTGFSFSYIGASYSDVG
IDLKKLGMQTKN
>BTH_II0895 conserved hypothetical protein
MVERSENRELVSFWNELVSLYQLPDEFRPSKIHSDPVCARSDARRARRRN
DENPRKNPMNPNLAALLATVSDQPTIDELMRLMHVPACFERCEPGVVTRA
ELAQALNMALFADLLERVPTGRAYTRDVARDGGRVTFDHGALRTVRWPSC
GALPPGEAAFTRILRPLGYRLNGTYPLERLRMTGRSYAHLDAPDQIAQFF
VSELHPERFSAAFQAAVTNVLGTSADPLTPQAAATLAELERDASLPLADA
RALMPVLVRCFDRQHASPALADYETLLAESAEMAWIATEGNAFNHATDRV
PDVDALAAAQRALGRPIKPAVEVSRSGRVRQTAFRADPVVRAFVDASGAL
VEREVPGSFYEFITRDAVADAETGRRALDLSFDAGNATGIFKMTAAA
>BTH_II0863 Rhs element Vgr protein
MSSSHRHYADTALADVAALTDAASRADAAPLANARRFTFASTAYDAATFD
VVDIDGRDAISQPYRFEITLVSRSVRIDFAKMLSCEATLAILPPFGEAGT
TRYAGVLAEFEQKERFRDFTVYRAALVPRLWRLSLYKASDVYLNEQTIPD
IVKRVLRAASFGKRNFRMRHRGVYRKRSFVCQYDESHLDFVSRWMEKEGL
YYYFEHDGRREKLEIVDDRRDQPGPADDLALRYLPATCLDAGIESDRVQA
FACRATPLPREVVLRDFNHRKAELSLEVREHVAHDGVGERVSSDEHFHTK
DEGRRYAKLRAEALVCEGRRFAGESTAAGLRAGRFFALSGHYRKDFDGRY
LVTAVTHRGSQAHLLFPDLDAPFGATPGEPVYRAEFEAIAANLQYRPPRT
TPKPRAAGVVSAIVDGEGSGKRAELDEHGQYKVRFPFAHTAHPTNKASAR
IRMATPYAGDDRGMHLPLLKRTEVKIAFDGGDPDRPVIVGAVPNSSHRSV
VTRSNPDAHRILTEHNQLYMKDGSGAATWLHAPNNHIGIGAVGPGDGLAL
LTSGNKFDFSLGNAYSFSGGLKCSVSMGGNTDIYVGVRNSLDVSANFLTT
LQGNLRWMLPGSRSFEINDSASTLLQTLHKQSATGAIRLSAGQDASALLQ
KQLDKLKGTVRKFMIVSGLANAGAAATAAGLIKGGGALADLPWAGFGVSA
AQFAGATGFSTALMATSRTLLSKIAKLQEALPLVADLSLDKQGIALAAKN
LTHATRMSLTVDGVSWSTHAKGPGAAGAAMSVGKGRWGVEAAEHAHVHAN
DTLLFAVPADPTSKFDLKELIGLRRDLDECVKGIADLEADISENEVLSTD
QNTFGVGALVPTPPSPANAVAAVAIKAKEAKLVELNAKRKLVATKIDNLQ
QKLAKHAKNLSAARMSASDAEVGFKGNRLVATAEGVTLAHAQGKAKLDVR
EAKIGVEAGKSSLELDESKLAAGCGGASLKLGSDGAIDVRATNVKLNGSA
SLKLDGQLIQLG
>BTH_II0312 Protein of unknown function (DUF636) family
MRFARCPPIVETGRSPCDPIAGAPHARDNGASRRQLRAVSIYCAPTAGET
MSATRSLRCLCGAVGVKLTGEPAARAHCHCMACRDFYGAPMLSATAWPAG
QVIVAEGDVASFAHPTRRLSRAFCATCGETVFGTNRLGMRVVPNAIVARA
AGGELPAGLRPTMHLFYRHRIVDVRDDLPKYLDGWDGPTDDA
>BTH_II1642 glyoxalase family protein
MARARIRFRARRCHQRDAQGRIAHAEMAFGGGIVMIGASGWRDFAVSPES
LGGGNAQRVHVQLRSGIDAHCEPARAAGAAILQAPAGQFYGDRTYCPCGP
EGHVRTFGQTVRRVSREDAGRHGGLSIEGGR
>BTH_II0706 Uncharacterized ACR, COG1434 family
MTLALLTAIVFVAALLRLAKRRRASLALFVASAAAFFAIGCGPVPAWLLR
ELQAPYAARPAIAWGERNAIVMLGLATEKIAATGAVEPGTFSYSRVVEAA
SLYRDCRRARANAGCKILVSGGDARRNGAPEASIYRDALIGLGVDAADVL
SEPRSMNTWQNAQFTRAVLDGYRADRVLLVTSGVHLRRSALYFAHFGVAA
IPVRAEYLQAVTFPLPLAYNFSVADLALHECLGIARYRLYDALGWNPART
QPGDA
>BTH_II1385 YceI like family protein
MSVAPVPDARGAPGAPPGTPRLFVAAASAEVVRDAEPAGTLDASPHAAPR
YRLDPRHSGVTFRVDNFWHAHLTMRFTRMRAELAGIDDDGLASRVDVTVD
AASLGANVPFVAALLKGSAMLDVARYPEIRFVGTRFERTGATEGRLTGDL
TIRSTTRPITLAVRFAAGQPGTGAREGVERGAARPRAEWGQRESGSRDAR
TLAFVADGHFSRAAFGLSRWLPAVGDDVRMRIRAEFVRERAEP
>BTH_II1937 membrane protein, putative
MRRIRYPLCRSLRQIIRVDLDSTSNTASPASPVPHSAMQGFKRKIVYVTC
FELIAIAITTTGFSLLTGQAPAHASIAAVASSAIAVAWNLAYNTAFEWWE
ARQTRRGRGLLRRIAHAVGFEAGLVVMLVPLFAWWLDVSLWQAFVLDLGL
IAFFLAYTFAFNLAFDRLFGLPASARPAPPEASSS
>BTH_II1601 conserved hypothetical protein
MRYCLALDLKDDPDAIARYEAHHERIWPEVAAHLRAHGVVAMEIYRLGTR
MTMVMETDDTRFDAARFDADARADAKIVEWEALMSTFQRPTPWTPAGVKW
TPMARIFDLSKQ
>BTH_II0074 chromate resistance protein ChrB
MLRELADAIAESGGAAHLLRAPSLDTSQEAELRALFDREEDYASFVRGLA
QARKTLAGQSATELARLLRRLRKDFEAIRAIDYFPDDAATRAELAWQDFV
ALVDTVLSPGEPHAAERAIRRLAIDDYQGRTWATRQRMWVDRVASAWLIR
RFIDARARFIWLASPSECPDGALGFDYDGAAFTHVGERVTFEVLLASFGL
DKDPALLRLGTIVHALDVGGPGVPETVGFEAVLAGARRRAENDDRLLEQM
SDVLDSLYAHFAANEAGETGERS
>BTH_II0258 Protein of unknown function (DUF770) superfamily
MTSKYKASASGQKFIARNRAPRVQIEYDVETYGAERRVQLPFVMGVIADL
AGKRAEPLPDLPERKFLEVDVDNFDERMKSIAPRVAFQVPNTLSGDGMLS
VDMTFESIDDFSPAAIARNVDALRRLLEARTELSNLLSYMDGKHGAEQLI
ENAINDPELLKTLVRQPLAASADGGAPAVADTGTADDSEIRHE
>BTH_II1899 Protein of unknown function (DUF796) superfamily
MANALVDYFLQIDGVEGESTDSQYPGLIQIQSWQWAEENSGRWGFGSGGG
AGKVEMKDFEFRMVSNKASPKLFLMCATGEHIQNAKLICRKSGKGQQEFL
TISFASGLVSSFRTLGNMPISQLGHASGEVDGVLPTDQIRINFAQIEFEY
REQRNDGTMGAVIKAGYDLKQNAPI
>BTH_II0092 PAAR motif family
MRGVIRIGDSTSHGGRVVTGREGSTVMGRAVACVGDRCTCPMNGHEHCVI
VEGDEGVRIEGRAVAFDGHRTSCGATLISSIPTSGRT
>BTH_II0040 conserved hypothetical protein
MHSTTTAFTHRGYLLNCAPARASDGSFQPYVVISRSSDGELVANRFFPSD
LHFNDEDAAVAHARDWAVRWIDASSLTI
>BTH_II0937 conserved hypothetical protein
MTVRQAGTPITDDDLKQSAIDSGLMVARRQCPAEKADMVACRLFPSLPFF
FDGTNNNMDRDVPLNKHSNVAKLFRITKDSIQSDVRRTYIPGVGTPFKFE
KVAGYTDRLNDDGGGVLGLGLGTGGDLRIKFALAEFSRLLEVEWGPGSWK
HMREVTVAIFGFSRGATQARAFARRFIEQKCGKDGGRLYWAAPSGMRVPL
RITFMGIFDTVASVGGPALHLDWASELAIPAEVERCVHYVSAHEVRRAFP
LDSVRVDKSYQGSCEEVVYPGVHSDVGGGYGPEEQGRVHDLSLIPLRHMF
AEALRAHVPIIPLDQMPRNIRKDFDLSDDARIVGLYTEYMATLPSASGDT
LEALIQPHRYLNFRWRSVLARHHADERVLGRLYAKVGESFCRTVPVGTDA
DHSACRPNEWVYDVPKDPQEQATQLLREQRLLARHIEFLRYPIERRPGPQ
SYPPTPRELTPYEKMILSAWDEQAPPSLVVDQLLAEYVHDSVAAFTSWPC
ALWDQRGIWCDQRRYLAENDPMHAGDLAVA
>BTH_II1928 probable transmembrane protein
MTVSQTCLLITVLMPFVWTMCAKSSNRYDNREPRRYLGQLEGWRARAFAA
HQNSWEALALFTAALVVAWHNGANVQRVDQLAIVFVASRVLHGVLYLLNW
ATLRSLAWTVGLVCVVWLFFAAP
>BTH_II1413 DoxD-like family protein
MAGRSRQGGGASRVTRIARRLGSQPARARCVARSAHGARIRLSTTMRQTF
LAPQKDLLLLLSRILLVILFVIFGWEKLLNFSGTVQFMGAEGTPLPSIAA
VVAIVMEFFVGIAILLGFCTRPLALLLGLYTVGTAFIGHHYWSMPAAEQM
NMMIHFYKNIAITGGLLALCAAGPGRYSLDRG
>BTH_II0861 pentapeptide repeat family protein, putative
MSDTVHAALAALTDTRTLSGVDLSDADLSGLDLSGCTLHRVILRGANLSA
AQLDATRWLHCDLTGARVDGATLGESSWHAVALRGASLRATTGDAFAMTD
ADLGGATLTDALWARATFERVDFSAAQCARAKLLRCEAADCRFERTDFSS
AELERFSAMRADLSSARFDATRLTNALLCEADLRGQRFARCDLTMTHLNG
ATLAGSDFSGTSLVQTMFFAADLEGATLAGARGRHVRFADATLVGARLAE
AVFDECDFARARLSSANARGLRARMSLFSHADCAGATLAGGHFVYCDFSH
ATLSRADCTDADFSHANLHGIDDRAARWDGACKTGACATDPTLALAERWT
APER
>BTH_II1902 ImpA-related N-terminal family
MNDDALPVHSTDLLDFDEDFIKIDAAICEYDSVGHAPQRKGESAFQWASV
ETACLALLKKAKDVRVAIWYLRACIARRGLSGLADGVRLLAELMSAPVEE
LHPRALPDESPGETLLIHLGWLAGPQFLHQLGSSRFEDRDATLNDLIGGR
AAAIVEDRDYCVRANTLVHDIKDSLSRIRESVAVVEQELNLSRSLDLLSV
AASRLARTDAGRAESESVAGEKPAHAEAGFSPAPDPQQPAAGPGGALRSR
QEVGVALERIVEYFRLHEPSHPAPIFLSRIQRMLGAGFEEVMAELYPEAA
SLVAQLNRPQSSIK
>BTH_II0077 conserved hypothetical protein
MAEKKRRIHAALLAAAIAATAPACHAQNATDWLANTYGTLAAHVGNAARS
MWVAPEGVIYTASMWDEYEGGVAIYQNGRSVGSIGTHAEFQGGAITGNAT
SVFAALQYDKSHGSGAVGRYNRATKTRDLVIQVSASNNQPRVDVVTGLAA
AGSLLYASDFYGNRVRIFTTDGIWQRDIGVSGPGALAVDRAGNVWVARKS
AGAIVEFSPAGALLNIIRMPGGSQPSALYFDASSGQLMVGDEGPDMNIKS
YRASGTPALVGTFGIRGGYLDATTGIKGQVGAKRFTRVAGIGKDSAGTLY
VLNNPWGGSWDLGRNGGTDIHAYDSAGNLQRTLQSLNFEGVAAPDPATDG
ALFYGGTNIYAGGAGGTFVANTVDPFSYPSDPRIDMNDTQRDEHFGQLVA
VGANRILVVSGQNPPVFHFFHFNQANGYVAIPDASIPGPAFNTTQRVTGG
FCIDGNGGVWAGLDKTGSIYHYPLTGFDAGGKPAWGAGIPIRIPASVQPL
TRIVYLADSDTMILAQGIVGSADWTAIGTRIEVYHGWRAGNTTAPDPVIN
LAGAGAKSIDAAGNHLFVGYWFSGSGPARPNIDAFNLATGKLDATLVNTS
SGTVDASSAVDSMYGVRAYLRSTGEYVVTKNNVKGNSITVYRWKP
>BTH_II1458 4-carboxymuconolactone decarboxylase
MSERDREQGKARRAQVMGDAFVERAMSNLDGFSRPLQDWLNEHAWGSTWQ
RGGIDLKTRSLCTCAMLAALGRGHELKGHVRGALNNGATLVEIREVLLHS
ALYAGAPAAVEAFRNAREVIDALGLDMPDDGA
>BTH_II0129 Rhs element Vgr protein
MPNHFSNGRTNQSRTVVIRSGAMPRLLGQPALKFLSLRGEEHLGKLYTYE
LLLRTPDDFHVPLATSANLDLKAMIGTEMTVGIQLDGIGTGAQGGVGAGA
REISGLVVKAGFLRSEGRYNVYRIELRPWLWLATLTSDYKIFQDKSVVEI
IDTVLHDYPYPVEKRLDIDKYSVAGESARNEPRAFQVQYGETDFDFVQRL
MEEWGIYWFFEHSDNKHRLVLCDHIGGHRKAPSEAYHEIAHHPEGGKIDI
EYINYFSTDEALRPGRVVIDDFDFTRPLASLVTSNHQPRETNWGEGELFE
WPGDYTDSKHGDLISRVRMEERRATGSRAYGRGNVRGLACGHTFVLAKHK
HDDANREYLVIESALMLTEVADETGSGYRYECDNELVVQPSNEVFRMPRE
TPKPTTNGPQSAIVVGPPGHEVWTDEFGRVKIRFLWDRYARNDATDSCWV
RVSQAWAGVNFGGIYIPRIGQEVIVGFMNGDPDRPLILGSLYNTITPPPW
DLPGDATKSGFKSKSITGGRENYNGIRFEDKLGAEEFHMQAEKDMNRLTK
NDESHTIGANYSIGVGITHTRAVGAMFSSIVGGAASYAVGGAESTMIGGA
YALNVGGAHAVAVGGASSVSVGGAYARNVGGAYALTVGGVLSIVCGASSI
TMTACGSIKIVGKNIRIIGSDEVVVQGAPLQLNPGDSDCGGGGGGGGGGG
AIPPIPLPSFFLDITKPILPPPPPPPTEVPPEPTPTPEPTPTPEPTPTPT
PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPTPT
PTPTPTPTPTPTPTPTPTSSEI
>BTH_II1898 conserved hypothetical protein
MSDLQLYNKLSRRIRRHSLQEVVADHLVDLMNHAIRGARMRIADDSPAAH
SVLNFGCPPMQMAGATKINPVHAAAHICEVIRRFEPRVDPVATTVKPRTE
SRKRLAQTIYFDVSMKAREDGAELRASLALDYLSGYFSLADDH
>BTH_II0933 Rhs element Vgr protein, putative
MPTTGRSTATCIPCADGGLTSWQIGFSSALHFLRHRRDEHFWLDRDAQEI
LSEVFNRYPPLQGAFRFELSGALAKRSYCRQSETDWHFVNRIMEDEGLYG
YWIHDEREQKTTLRIVDRVEALPAAKPIDFYRGNAGDEIGGFTQWATLRQ
LNSVHVASRSGDYKRPSTPFEVRQSVQTTRYVEQTNWRTQEQKAIPYPPL
EDYSAGAYRYPDSDRGAAWARIRAEEYESRSRRYAGVGGSRWIDAGGRFV
LNDHPAHAESDPKAREFVAMAARWTIENNVSIARSSRHFPYSLQADIERA
RSGFGSAFAVAPHPQDGATGLYVIEVEAQRTDIEYRSPFEHRKPAMSVEM
ATIVTPNGEDVWTDPLNRVRARFHWDRQSPPDAFETSPPLLVAQSDTGPQ
YGGVHVPRRGETVYVDFVGGDCDRPYIVSRAPGGATPPMWHSDGLLSGFQ
SREYGGGGGYSEMQLDDATGQVRARVLSKTRGDYSHLTLGYGIVQQGNTR
GRYLGSGFTLHADQYGAVRANRGLYIGTHATRHDAEQLDVDAARDQLKAA
ADVLARQSSLSEQHRAESLKAGHDALTELTDATRQPVEPGASGGRTSGGG
TGSANGFKVPAMLLGSAGGMGLTTFQSLHASADRHVNVVAGQSAFVATGK
SFVASAGEKVSVFAQSGIKLFSKDAVQIESHRETIDLIGQKTVRIVSATE
RIEIAADKEILITSGQAYIRLKGGDIQIHAPGKIDIKGSLHNFSGPASMP
YPMPTQPDAVCVPCMMKQAAGRGAFVAMGA
>BTH_II1384 conserved hypothetical protein
MVKILLTAIACAVPAAALAEPPKASGGMVVDDAGMTLYTFDRDTVPGKSA
CAGGCTANWPAALADAYDKPGGDLGLIAAAGGRHQWTYKGRPLYRFSGDT
RPGQHNGDGFGGMWHVARP
>BTH_II1712 conserved hypothetical protein
MSEAAAGVANETKRDSAFGRDLLAGLRRAPRSIAPKYFYDAAGSALFDRI
CELPEYYPTRTELAILKRHAHEIAAQIGRDANLIEFGAGSLSKIRVLLDA
CAASGPPARYLPVDISAEHLAQSAAALRDAYPWLDVQPVVADYLQSEQLR
AIECVAGRRVGCFLGSTIGNFSPDEASAFLRHAASLLKGGGLLIGVDLVK
DVSILHRAYNDAAGVTAAFNLNLLKRANAELGADFALDAWAHRAFYDVDQ
QRIEMHLVSRRAQTVRLAGYAFRFDAGETLHTENSHKFTVDGFRALAQAA
GFTPGTVWIDDARLFSVHWLESRG
>BTH_II0065 creA protein, putative
MITLRTSTWIASVALAIFFASAVDAEELARISPHSQRYGTHIGISAYDDP
LLKGVTCFVSEPHTSDERPSFRDGHGAEASVSCHQTGTLAATARLPRQAQ
VFDESVDPVFRSVHVVRIFDIRRLVVLYFSYMESDVAGNLPGHVDVVRLP
VHWGRTGASAK
>BTH_II1564 dedA family protein
MWHFPQAVPASLGPWAVFASVLVTQLGMPVPAVPMLIVGGTMAAMGQASY
ASMLVAAVAATVLADSMWFFAGRARGRRLLNALVRFSLSLDTTLRVARKV
FEKHGAPLLVLAKFLPGLGLVSAPLLGTTAVAVWVFLFWDVAGASLWASV
WLFGGAALHDEIVRLMQWVSASGGTLFDAFAAIFVTFLLYRWAMRMRFRR
WLAKIRISPQQLDAMLKSAAPPVVFDARPRAVREKEAYRIAGAYPLDLDS
PDPLHPDLTTRPIVVYCVCPNEATAKRIVSQLHRKRIRHALALKGGLDAW
EKQGYPVEPLPADFDAARYVAPPLAEPAEPAELDAGGYPMRAGLTD
>BTH_II1142 pvdS
MDMSEKDTRRDTAPLHERQRRFEEDLVDAYDEELEMELDDRRFDDESLFS
PERREARKRYFRELFRLQGELVKLQDWVVSSGHRLVVIFEGRDAAGKGGA
IKRITQRLNPRVCRVAALPAPSDRERTQWYFQRYVAHLPAGGEMVLFDRS
WYNRAGVERVMNFCTDAEYEEFFRSVPEFEKMLVRSGIQIVKYWFSITDE
EQEVRFQNRIEDPLKQWKLSPMDLESRRRWEAYTAAKEEMLMRTHIPEAP
WWVVQAVDKKRARLNCIHHLLSLVPYYEIERDSVHLPQREHHDDYIRRPV
PGDMIVPEIY
>BTH_II0122 Protein of unknown function (DUF877) superfamily
MAARESQARSADAQLATQSDFNALLSREFKPKTEQAREAVEHAVKTLAEQ
ALANSVTLSDDAYKSIEAIIGEIDRKLSEQINLILHHDDFQKLESAWRGL
HHLVTNTETDEKLKIRFMDVSKDDLRRTMKRYKGVAWDQSPFFKQIYEEE
YGQLGGEPYGCLVADYYFDHTPPDVDLLSSIGKVAAAAHAPFITGASPSV
LQMDSWQELANPRDLTKIFTQNLEYAPWNSLRNSEDARYIGLAMPRFLAR
LPYGIRTNPVDEFDFEESTDGSDHRKYVWANAAYAMAVNVNRSFKHYGWC
TLIRGVESGGVVENLPCHTFPTDDGGIDMKCPTEIAISDRREAELAKNGF
IPLIHRKNTDYAAFIGAQSLQKPAEYYDPDATANANLSARLPYLFACSRF
AHYLKCIVRDKIGSFKEREDMQQWLNEWIMNYVDADPANSSQETKARRPL
AAAEVVVEDVEGNPGYYQAKFFLRPHFQLEGLTVSLRLVAKLPSVKEAA
>BTH_II0257 ImpA-related N-terminal family
MQSSDLIESLLAGVALDAPCGANLEYEQDFLRLQESATPRPEQQYGDTVI
PAEAPDWGAVERLALELTARTKDLRVAAHLARSWTELRGIPGYADGLKLV
AGMLGRWWDDVHPRLDADGDRDPAPRANALAEIAGAHGCARAARRQALFD
GGPSVRDAERVFDGRDGGEHGYPGGRERLIADLVRARDGGQPALQAALAA
LDALDAIRARVASALGGEWAPDTSDFEKALRRIVRDGLPPPVAATETDAR
AGAANGAANAAHGGAANGAASGGTPAFAANGRAWRDAEVTSRDDVQFGLE
KMCRYFELHEPSHPAPILLRRAQRLLSLDFYEIIRDLAPESLPKLDLLSG
QRSE
>BTH_II0530 lipoprotein, putative
MRKTLVMKVAIATALGGLALAGCTTTPDKPSTAASNASTREAIDANVNAT
LTRLYSTVPGSRELVAKSRGVLVFPNVLQAGFIVGAQSGNGALRVGGATL
GYYNTSSLSVGLQAGAQSKALIFLFMTQDALDRFRGSEGWAAGADASVAL
VKMGANGAVDTSTATAPVEVIVLTNAGLMGDLSVNGTKVTRLKL
>BTH_II1313 Uncharacterized conserved protein
MTAFWEHFRHGADIGVRGVGETLAQAYEQAALALTAIVANPASVRALRDV
EIACAEADRELLLVDWLNAIVYEMAVRKMLFSRFEVTLAENGLRARIAGE
RIDAARHRPAVEPKGATYTALHVGRRGDGAWAAECVVDV
>BTH_II1531 Rhs element Vgr protein, putative
MPTTGRSTATCIPCADGGLTSWQIGFSSALHFLRHRRDEHFWLDRDAQEI
LSEVFNRYPPLQGAFRFELSGALAKRSYCRQSETDWHFVNRIMEDEGLYG
YWIHDEREQKTTLRIVDRVEALPAAKPIDFYRGNAGDEIGGFTQWATLRQ
LNSVHVASRSGDYKRPSTPFEVRQSVQTTRYVEQTNWRTQEQKAIPYPPL
EDYSAGAYRYPDSDRGAAWARIRAEEYESRSRRYAGVGGSRWIDAGGRFV
LNDHPAHAESDPKAREFVAMAARWTIENNVSIARSSRHFPYSLQADIERA
RSGFGSAFAVAPHPQDGATGLYVIEVEAQRTDIEYRSPFEHRKPAMSVEM
ATIVTPNGEDVWTDPLNRVRARFHWDRQSPPDAFETSPPLLVAQSDTGPQ
YGGVHVPRRGETVYVDFVGGDCDRPYIVSRAPGGATPPMWHSDGLLSGFQ
SREYGGGGGYSEMQLDDATGQVRARVLSKTRGDYSHLTLGYGIVQQGNTR
GRYLGSGFTLHADQYGAVRANRGLYIGTHATRHDAEQLDVDAARDQLKAA
ADVLARQSSLSEQHRAESLKAGHDALTELTDATRQPVEPGASGGRTSGGG
TGSANGFKVPAMLLGSAGGMGLTTFQSLHASADRHVNVVAGQSAFVATGK
SFVASAGEKVSVFAQSGIKLFSKDAVQIESHRETLDLIGQKTVRLVSATE
RIEIAADKEILITSGQAYIRLKGGDIQIHAPGKIDIKGSLHNFSGPASMP
YPMPTQPDAVCVPCMMKQAAGRGAFVAMGA
>BTH_II1848 Prokaryotic protein of unknown function (DUF849) superfamily
MNQEVIVTCAVTGAGDTVGKHPAIPVTPKQIADAAIEAAKAGATVAHCHV
RDPLTGRGSRDPRLYREVVDRIRSADVDIIINLTAGMGGDLEIGAGEDPM
RFGQGTDLVGGLTRLAHVEELLPEICTLDCGTLNFGDGDYIYVSTPAQLR
AGAKRIRELGVKPELEIFDTGHLWFAKQLLKEGLLDDPPLFQLCLGIPWG
APADTGTMKAMVDNLPPGAHWAGFGIGRMQMPMVAQAMLLGGHVRVGLED
NLWLDRGVHATNGSLVERAREIAERLGARVLAPAEGRRKLGLPPRGERPL
ERRAIAGYA
>BTH_II1354 pANL56
MELEYDPNKRDKTLTERGLDFARAVEVFAGHHFTLEDTREDYVESRYITV
GMLDGRMIVMVWTPRGEARRIISMRKANDREQARYAYRLG
>BTH_II1045 head portal protein
MRFRLWGSIRQRKANRSTKRAAFVFSEVGMGWFDFIRRGKQPEADARPHV
EPSFQVAAPATSPPGESFSGLDDPRLKEYIRRGELDGGAGRETRALRNMA
VLRCVTLISGTIGMLPMNLINSDDSKRVQADDPAHRLLKYRPNDWQTPME
FKSLMQLRALLDGQSMARIVWTGNRPIRLIPMDRGSAKGRLTASWQMVYD
YTTPAGDKVELPAREVFHLRDLSLDGINGISRVRLSRDALELAEQAERAA
SRTFTTGVMAGGAIEVPKELSDNAYGRMKSSIRENHSGSENAGSWMLLEE
GATAKQFSNTAESAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIE
QLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLN
DQAAFFSKALGAGGQSPWMKQNEVRETLDLPRVDDPVADQLRNPMTQKPK
GSGDEPPATT
>BTH_II1206 conserved hypothetical protein
MAPDRKGIAASARIAGIARSAAFGAAALAAALFAAPGSAHRGDVGQVRPV
LTQPLAEAPGNDAQVVTVAYAPGAASGPHAHAGSIFAFVTQGRVVSQLEG
EPPRTYGPGEAWYEPPGSHHIVSRNASDTEPAQIVVFAVVGEHRALKTPL
PR
>BTH_II0130 Rhs family protein
MTIPAARVGDAIGHGGVLVTGSGDVFMNGIPAGFVAGSVAVCALHASAQA
VVAASGTVFINRLPAAFVGGVTSCGAPIVSGSPDVMIGS
>BTH_II1885 ImcF-related family
MIRTSLRVFAAILIAILIWWVGPLFAFGIYHPLGPVWVREILVALVLIWG
FWPTLARLWARLAMSPRQVKVAPKTKQLDFVDKHLRNLDQQLKERWRKEP
RGRWKRWVGTLTREHRAMLPWYLVLGSEGSGKTSLVAKAVSVSGSLQDRV
LGSDATYGRGDDFNFRITREAVWFDVGGRWSLRAGADEAEFDAWRKLLRG
MRRLRKGAPISGVVLCVDGLDMVDAPLDARKRLADSVRARLEEMREAFGQ
QVQVYVALNGLDRLDGAVSTLSLLDASKWAKGVGFSLPDDGAEADAARAD
ANWQHALQGLQQRVQQQVLYSAPAATEVSMNHAQLRFVETLSRLQKALVA
WLHVALAPGEPHTAARLRGVWLGSMAELAEAHPAGVGSSELPVPSRPLSE
LWTPLIRQVSLERDAVRPSGPKSWRGRLGEALRWGAVPLVALSLLLWFGW
GYVTERDYLDGVWAQFTEAKRLAQAEASYGNDGGSALIEIANQMRYAQLQ
AEDAAQGMATPYFEHGLVAETARETYYRHLQKMLMPELYNEVRRTLVSQV
DGTPGDIYQTLKVYLMLCRPDRRSADDVVRWLDGRWDALSGGQYSDDDRR
SLLGHVRTLMSLKNVPATPEDANLVRSARAKAAQIPLVTRVLQHIHAQGL
PQQVNDISLSRAAGFEASMSLRMRSNVPSTDTAVSGWFTRAGYTDVFLPR
LQKSARAMLEEESWVLRDETLSGNSFQIDGLVQKLADSARNQFLQDYISA
WQNFLNDVTVRGVTGLDDASQLAAAMMEAQSPLANLLRFAARETTLTGAS
DEGNIDSWIDRQKYRFEKGRRQIVGELSGQHYRTVLLPEHVVEEHFQAIR
QLAAQLNRNNTIANNPLSRLFEPLYRQLGLVNGALQAGQVLPAQYDAFSR
LKETAARQPEPVRGIMLDLVSSGSTMTTRESGALLNRGAAGATKMVCDQG
FTGRYPMRRNAQADAGVEDFERLFSAQGLMATYFRDHLAAYVDTSAKPWQ
ALRSNGGPNGMVSQSVLNSYETADRIRGAMLDDSGHLRVSTVLRFIDMDS
QLSEAQLSVAGQTVRFAHGVTSPHRVDWTNQNTQLAIKLQLKSVDGRMTT
LQFDGPWALFRFFDAGQAVGGTGTADRRERLYQTSLGTVRIEWQALTLPS
PIWSGILQSFRCPS
>BTH_II2166 lipoprotein, putative
MTTNRQESTVSLRRACASLAALALLAACTSTPEAIQRKTGWMRSEVADSY
IYAYPLVLMDVAKEAATADGAPGAMPVNTLRHAQALPAPGAANPPLASVD
MLDSTAWLDVAEEPVVVVLPDGRGRYVDARALDMWTNVLWSSGPSANPRS
GASRARTLAFVGAGWEGSLPEGVTRVDAPARNVWLDVRVQTSGGRDLAAA
KRLQRAIRVEPLSVYTGDARRAPRAARADAGAASAEPASGVASPVASPVA
QVAALDPPGFFSRVARALQDNPPPAADAHAQEILADIGVTAGAPVQWKGD
KLIGAENGAAEARERLAALPPNALAANGWSWLGDGVGNYGQDYALRAYAA
STQFGAGTRDDEIVAVVKTDSAGRPLNGAHRYVIRFAPNALPPARAFWSL
APYTPDGAVPELGRARRSIGERDRLHRNRDGSLEIVVSATPPGKGYASNW
LPAPRADFELALRLYAPKRQAADGNWQPPAVVRK
>BTH_II1653 conserved hypothetical protein
MRTAYVACALAAGFASAAAFADAPAMPSGGMLVAANGMTLYTFDKDAPNA
GKSLCNGPCAANWPPYKASATDRPTGGYTIIKRDDGTLQWAYQGKPLYFF
AKDAAKGDKKGDGFKDVWHAVKE
>BTH_II0251 lipoprotein, putative
MRLRFSGSVALGGALLLAGCGATERSVAVPYSIALDVAPDVNPDINRKPS
PIVLKVFQLKTASAFESADFFSLQDKPQSVLGADLLGVDRIILRPGDART
LHYRGNVDAGAIGIVAEYRVLEKNRWRMTVPLPRAKQLNLYRFWQTSPSE
MKLQVAVKNGGIGLGGDSRVRR
>BTH_II0990 uncharacterized protein conserved in bacteria
MNSTPDAFQRPAIRALTPLEARVLGVLIEKQHTVPDTYPLSLNALTAGCN
QKTARSPVMNVSEAEVLTSIDGLKRLSLASEGSSSRVPRFEHNMNRVLGI
PSQAAALLTMLLLRGPQTAAELRLNTARLHGFADISSVEAFLDELAARAP
ALVVKLPRAPGERESRWMHLLCGDVALDEALAHGVQEDAVPPSEFEALKA
EQKALTAELARLRAFVEYMANELGIDADKFTRES
>BTH_II0135 putative cytoplasmic protein
MTAHPPATPPIDARSPLHPDALRAWFDPQAPWRAGFLSLLRAIAARDARM
PLPGTACLPREEPFRIGQRPSMAFAPREIASLDVQRGRLDIQLFGLGLWG
PQGPLPLHMTELAYNRAESYQDHAIAHFSNLFHHRALALFYRAWASSQAT
VSLDRADRETFSFYIGSLMGTDPEEAARTHPPTHARYAACAHLVREARNP
DGVAATLSHYFGVPISVDEYVFHWIRIAEPERCLLGARAASTVMGEGALL
GDMVPDCQHKFRLVIGPLDLDQYLRLTPHGNDLPTLVDWVRAFIGHEYDW
EIKLLVKPRAAPPARADTAHRLGYSTWLGESRDDKPVVGMVFEPEKYCS
>BTH_II1057 Phage minor tail protein
MKDTFEWPSTVQGHGGDTTLRVRKAQFGDGYTQRAADGLNNRESTFDLRF
VGNAAKVAAIIDFLDRHAGAESFYWTPPLRARGLFVCEKYSEPIKNGAVY
TMTAQFEETFSV
>BTH_II1755 conserved hypothetical protein
MTEQEAERIATHRHYKGGLYRVIGVARHSETEERLVVYEQLWPKEHSLWV
RPEAMFNETLADGTPRFRKLGD
>BTH_II1918 u1937b; B1937_F1_4
MQTLFDTGSQQDAAELGAALAAPAITGRYDELRGGAAALPSACLAPAWRA
FFTQLGSVGFADLDRRAEALQRRMRENGLAYHPHERAAGGGAVRPWSLDL
LPLIIAPDDWAAIERGVLQRVRLLNAILADLYGEQTILRRGLLPPALVTG
HPGYLRPMCGARAPGGTWLHVVAFDLARGPDGAWRLMAQHAQGPSGLGYL
LENRLIVSRLFPRAFRGLHVQRLAASYRALLQSMQALSPERKNSRIVLLT
PGPHAAAYFEHAYLARYLGLTLVEGGDLTARDQRVYLKTLRGLEPVHGIL
RRVDDEWLDPLELRPDSLLGVPGLMQAVRAGNVLLANLPGSGFLESPGIL
GFLPRLAQSLLGETLSLPAVPSWWCGEQAACDEALPLLARSIVKPTFPAS
VQAGGAFEPVIGARLSPQQLAEWRARIAAQPAHYTIQADLPLSQAPTWAL
GAGMGDGGARIVPRPLLLRVFALADGARSWRVLPGGLARVGTRDELFNAP
MPRGGSSVDTWVMTEGAVDPTTLLQTHLGPDDLQERSRAIASRAAENLFW
LGRYTERATNLTRLARAALERLRGEDDVETPVHLHLLDALCRDNGLLPAD
APPAVDAPRAFQQALTRSLTPRADRSMGIASCLFGMRGAAAAIRERLSRE
QWRLIDDATQLFDEGGDDVEPEEQLGNDALQRLERLNLLLSAITGAQTDN
MTRDDGWRLLSIGRQIDRLEFLGGVLGHAFDGGAIHKQEGFELVLELFDS
GITFRSRFQRCFDVAPLLSLVVLDTDNPRSLAWVAQALRGRLSKVERGEG
YALSELADGIPDIAGWPLHVLCETDGDGRHAALLARLQACGKAAWDVSNR
IGERYFSHVRDAGRSLWG
>BTH_II0856 conserved hypothetical protein
MMKLSDSMMPLVAYARQLAQSPRGDAAEVALRLDILIERARDQAACAGAP
EADVDDALFAVCAWIDEALLNSGWNDAERWTLRLLQKRYFDTSHAGVVFF
ERLDALDGARADVLDVYLLCLQLGFRGRYGYDGGSGPLDAIRQRALDALS
AHAPPCAAHALPFPAAYPHAETQPGAHAIAAKARARRRIALAGRNFGLPL
AILLSLYLIYHAVIVRMVDSVFPRLQ
>BTH_II1151 conserved hypothetical protein
MIQPTVVFKDNLAQLPAIDGIERIDLVDGGGAVIASIENKPGKQGSLAVY
HYLREAFGTLDARAAEHGLAVFAEHTADARHRPGAHPNVDRLLAIAAGGD
ALRIDVVARG
>BTH_II0511 conserved hypothetical protein
MNTKPIPTTTTKRPAFTKTTLLVSLAATAALSLSACVAPNAYGPYGSQPQ
YGTPAYSQPSYAQPDYSQPGYPQQGYAQPGYQQGYQQGYQSGYSGYSTQY
GTIAGIRPIGGATSPSGVAGTVVGALVGGVLGNQIGRGHGRDAATVIGAL
GGAVAGNQIGQQMGAQPSAYRVDVQLSDGSTRSFDLQTPGDLRPGDRVRI
DGNQISRY
>BTH_II1369 YbaK / prolyl-tRNA synthetases-associated domain family protein
MSMSVTLQDCLRQKASRYEVIHHPYSHSNMEAAAAAHIPGDRLAKTVLLE
DAQGYVAAVLPTTHAVRLSELWAKTGRHLVLAKEVELRELFKDCDVGALP
PVCMAYGMQTFLDDSLARQPDVYFEAGDHEELIHMDRDEFLSLMDKAERA
SFSHKIQGVVS
>BTH_II2016 conserved hypothetical protein
MAASSGTSSPTPSHSAEPPFVPAPLARAHARYWRFNVALIAVLMTIGFAV
SFVVPLFAPALAHLRFAGFSLPFYVGAQGAILVYLALIGAYIVLMQRADR
TLRRDYDAYADEAKRKEVISTDTDAC
>BTH_II2330 PAAR motif family
MVKRAIICVGDTTTHGGKVLEGSPTFTLNGRNVAGVGHKVLCPRCKGIFP
ILPDLLGRRYPHTIADRDTAVEGMRTACGAELIASQGTGTIDDVGAGERG
DGGSPGGSAAAAAAAVAPSPTLCLECLKAAAKNAATMVARG
>BTH_II0263 Protein of unknown function (DUF1305) family
MTAQPKPNPEPAGADAGSHANSKAGRRAGAASNAEPGAMPNADWDAAADA
TPNATPSATTQADAARRDAWWRRLRAAPHGYDLFHALRWLDALSPEHAPS
GYASRPRDEPVRFGQAPSLAFAAAMLADVRDEAPRPRVAIHGFGLFGPNG
PLPSHLTEYAYERAAQHDDPTFAAFADLFHHRLILLFYRAWADAQPTVSL
DRPARARFDRYVASLIGPCDARSCDAPRGANAIAPHAKYHQAGHLVRHTR
NPEGLVQILQRYFGVRARIVEHVPRWVMLDRAQRCAIRATRPTQPLGGAV
LGRAVRDAQSRFRIVLGPLTLDEYRRFLPGGAHAQQLAQWVREYVGIEFD
WDVQLELARGEVPALALGSRDGLGRTAWLGERLDPGPARDLVLGYDGRAR
GGAGVARARSTAADQAAAADAFDRQPA
>BTH_II0359 conserved hypothetical protein
MASIIDNLIAEHRRLERLVRLLECQSMLRDAQAAENAALLVDALYYLTRF
PDVNHHALEDRIIDKLLEKTVLPLELGGELSAQHATLARQGHALIQDLES
MVRDENMSRELLEFRIRLYAERLRHNMAVEELTMFPIAKRYLDTGDWSSI
LQTGAHRSADPLFQTPVHERFVQLHRMIAVEADCGCKEGNA
>BTH_II0255 conserved hypothetical protein
MPTTLACDGEPPAWYGKIPGAGDFVNHRLSHELAGWWERWLQQGMAAMRQ
RGGDELARYYTVAPVWNFLIPAGAGAQCVQPGCLAPSCDRVGRYYPVIAT
LPMRTADYWSALPDVADAFYWQVGSALLDAIRHARAPVQLEQALAKVRLV
RGADARSAAFGGAGGERGELGESGWRGERVAARESGEGGCIGGAGDIDGA
GESGKSGASDERGESGGEARSGPFARPRPSPPATPAWPGLSQYFDPYGAT
SFWWTNRADGSPLRTHAHTGAPDSRLFLRLFGGVQHGV
>BTH_II1887 Bacterial protein of unknown function (DUF876) superfamily
MSSLPVGPVAWSDGMLIETQHFQQLERHLAHQAALRLGQTSNHGWGFTLL
DLDQDGLGLGRLGLRHARGVFQDGTAFSLPSDDPLPPPLETELAQAGDVA
CLALQAARTGGPEMAFGDVASASRYRAVSTEVPDLAVGLDAPGTPRRLTI
ETGQLITRLCWKSQLRSDEVALPIARVAGRNASRTVSLDPRFIPPLLDTR
AHLVLRSLIDELQSTLRVRLASTSAQRVLSTGGGVADLIELLLRQAIAEY
RMRLANLDAFDPLPPAMLYHELVGLLGRLSVLPGVDEELADRELGYDHDD
LQTSFEPLAMMLRQALARVIETPVLPLRFEDRGDQVHICIVDKQWNLKKL
IFAFSAAMPAEKLRQLLPQQTKLGAVEQIQKLVDLQLPGARLNALPNPPR
QIPYYAQSTYFEVESTDPFWKQTLAGSAMALRIVGDFPDLRFEAWGLRDG
KVA
>BTH_II0302 Uncharacterized conserved protein
MVPIGKVASPRVELIDDHWGAIESTITIDSPALSDDALMGLDTFSHIEVI
YHLNQVPSQEIERGARHPRGRADWPKVGILAQRSKGRPNLIGVSRCQLLS
VEGRTLRVRGLDAVDGTPVLDIKPYLEEFGPIGRVSQPAWSHEVMKDYYI
LQPSE
>BTH_II2167 conserved hypothetical protein
MPGAGGGRSGAVAAGGRRSASAGRRGGRCGFSGLREGAAFGHVSAAGNVT
VRRASAWVPRPWPWPMRPDAPGVFACGCRSLYHRHRHRHRHRHRHGHGHG
HGHGHGHRHRHRNRNRHRHRPHFRCAPHERSRNCVLEAPQGGSMATNTKR
LMAAALGAALAFGAPFARAASFDCARAASAAERAICGTPALGELDVRMAA
YYELLQNARPADEGIAYREFRDALRDGQQSWRQRVRDACGARIDCLTNAY
TARIAALRGVAAERLALRMTGGSAPSPDAAGATYAIEGESIKLTNGESVR
PAAPGSAAKRVTTLVARSAPATIAGGPLEAVLLSDDPGGSGRFLYVATVQ
PGGGAPAVLLGDRVKPVSVSIERAATGGAIVVVEYLDRPEGAPFAQAPTV
KVVRRFALEQGRLVEQRG
>BTH_II1901 Protein of unknown function (DUF770) superfamily
MVRQKDGQKFIGESRAPRVQIEYDVEVYGSQKKVELPFVAGVMADLSGDN
TEPLGPVEDRRFQEVDVENFDERMAQIAPSLSYHVKNVLTNDGTLIPIDL
TFTSMESFEPAAVVKRIPELSTLLEARNRLKELLTYMDGKAAAEDVIQEL
LKSPQWANEADAAPEQSGGAGEGADHQPEEGAK
>BTH_II0589 CHAD domain family
MARVLEIVLDFPLQGWQAGRGARARARDLGAELARAWRICPPVKMRRGHE
RVTIEPCRFVEAEPDDGGRWQTWIETTAQARRALAVRCHPFVPGVMVRER
LDDYRGDVRVATPAAGASVASDASGATDAGGDSRPSASTAASVFAESAAQ
SADSAPMPGFAAGRAAFPESRSPYGPAYGFVADRRRGRWLDADGIDVELT
LDDIAFAPAAASSEAARAASSRVCELRLAVADPDDSSARAAALRALFNAA
RELSGAWPASLATASVLDRACMGDAPDAAGSPAKAQPVDLSTMRTQRAAF
FALGCGVTAQWLGNEAGARDMADPEFVHQMRVALRRLRTLVRLFPRYADE
AWKDAFSGDIRWLAGMLGAVRDWDVCVTSTLPALAAADSDEAAWAGTLDA
ARAQGDAARAELRQALGTARYTRLVFAWLEWLSLFSLGEDDPARGKAPSL
KRHAAKRVSRLFGHLYGAGRLTTLDAAARHRVRIDAKRLRYALEFFSSLA
SRRTREDTVRLLARLQNALGDANDAAVALRCLERLSAPPYQLGFARGYGA
AAQRYAAEAGEQMLRGMRVPKIGGRKA
>BTH_II0615 conserved hypothetical protein TIGR00294
MPLQGSNKMNLMNPEFVMPDVQSTVDTRQMPIQRVGVRAVRHPLTVRTAE
GETQATVGTWNLDVHLPADQKGTHMSRFVALLEESGGPLTADAFRAMLAT
MLEKLEAQAGRIEVSFPYFVNKTAPVSGVRSLLDYEVTLTGDVRDGLTRV
FAKVLVPVTSLCPCSKKISQYGAHNQRSHVTIDAELAADVPVEDLIRIAE
EEASCELWGLLKRPDEKFVTERAYENPKFVEDLVRDVARRLDADERIVAY
VLEAENFESIHNHSAYALIERDKRRRA
>BTH_II2251 Uncharacterized small protein
MFSDLGGELRTAGRYLGQALRLMVGLPDYDGYVAHMRATHPDRPVMTYEA
FFRERQNARYGSGAGKCC
>BTH_II1509 MgtC family protein
MTFEFALRLFTAFACGVAIGLERQIRQRTAGLRTITLVASGACLFVTLGV
LTGNGIPGVTQIAAYVVSGVGFLGGGVIMRDKGSIQGINTAATLWCSAAV
GVLSGAGHYLPALAGTGVVLLTNTLLRGVSQAINSTPVSNADLVREYQIT
VICLASDEVHIRTLLSNSMYAKPLSFQSLTSEDVPREADAPERIKVTATL
KLHPKDQPKLEQIASRMSMEKSISSVSWTAKEAEPLME
>BTH_II2285 MlrC C-terminus family
MKILVAGFRHESNTFAPSKATYASFAADGGRYPLSRGAEIGRLKGMNLPV
AGALAALDDAGHVALPATWADATPSGRVDSVAFERIAGEIVDAAKRRDAD
GIYVDLHGAMATERYDDGEGELLRRLRETVGARVPIVASLDLHANVTQQM
LDSADGLVTYRTYPHVDMAETGRRAVALLDALLGRRGRHRHFSSARRVPF
LIPVNAMCTSLEPSKSLFRLLERLETGSVRSLSFAPGFPAADFPECGPTI
WGYGADPVQLARAVDALYEHVVSAEAQWSVPFMSADDAVTEAIRIARRAR
KPVVIADTQDNPGAGGGSNTTGLLRALVRHRAPDAALGLFFDPAAACAAH
AAGLGASVEITLGADSGLPFTGTFRVESLSNGRCHCNGPMLRGATFELGP
TACLRIGDVRVVVTSARVQMTDRSFYRIAGIAPETMKILVNKSSVHFRAD
FDAIADCVLIAKAGGWMAADPADLRWTSLADGMRTSPCGAPFFGCGGRHA
PHADGITGEMRL
>BTH_II1647 fusaric acid resistance domain protein
MKHEPALQRFLIDPQRWQERLRSAGRLARDWAGRDGLVWLHLAKTVAAAL
LAMGIAMLLDLSQPRIAMTTVFVLMQPMSGMVLAKSFYRVIGTGVGLVAA
LALGGLFAQQPELYMAGITLWVACCIAAAVRNRHFRWYGYVLAGYTAALI
GLPAVMAPNTLFLSALTRAAEVAVGIFCSGAVSALVFPLSSSDALMRTLN
ARHADFVAFAASTMAGTVERRDFERRFADFVDGIVGFEATRAFATFEDPH
IRARSRRLARLNSEFMNACARLHAFHQLLKRLRANHAAEVLAAIAPHVDA
LSGALSALRRDLAQPRATPSPSLSPAPALVELSAYLAALAKRARASRRAL
ETLAPAGVLDFDTAIELLYRFVDEFLGYAQTHASLALDSHVLERSITRYA
VKTNRYFVGFTFLRTLVAMGAMSAFWIASEWPSGALAVIGTAIACALSST
APRASRFVAQMAAGAALATLTGYLYVCHVYPNIDGFPLLCAALAPALAAG
AYLATRPGKSGYGIGFVVFFCLLAGPDNVIVYAPDVLINNGLALVVSMLA
ASIAFAVVFPAEMPWLTGRIARDLRRRIALACDGPLQGLDQRFQSSSHDL
MSQLRNLLMKRSRRHRDALRWMLATLEVGHAVIDLRREMQAFTAAQPAQA
LRWSALIDAVRDALPRLFETPDAHRLARALKSVNLAIRAVQHTQHLWYAV
PDERRRMQRIVSCLHFIRSALIDQDAPFNRGSRARERVRARRM
>BTH_II2074 DoxD-like family protein
MTRSVDSGVIFFARLLLAVLFLWGGTMKVTGYGEFVGYLKGLGVPFTQVA
APAIVALEALGGVLLVVGYKVKPLALMLAIYTIATALIGHNFWDATSPAL
QRDMAVHFWKNVAIAGGFLLLYVTGAGGASIDGARRPSSSYGSLR
>BTH_II1713 Domain of unknown function
MTKNEESTLGLASDLARGFTDVRRYSVELAKPLSAEDQALQSMPDASPTK
WHLAHTTWFFETVILARHARGYKLFDSRYPYLFNSYYEALGPRHARAQRG
MLSRPSLGDVHRYRRHVDDALLDLLRTSDLPTLIAIEPEITLGLHHEQQH
QELILTDILHAFSLNPLLPAYRSDDATPPADARRANGAMRWLSGPSGIVE
IGHDGRGFSFDNERPRHRTMLHPYQIAERLVTNGEYAAFIDDGGYARPEF
WLSDGWAIVQREEWKAPLYWIASDGGEGLGWREFGFGGLQPLMLDAPVSH
VSFYEAAAYAEWARARLPTEAEWEAAFDAPDIVQMTGCVWQWTRSSYGPY
PGFRPMAGVAAEYNGKFMVGQQVLRGGSVATPPGHARATYRNFFPPAARW
QFTGVRLARDI
>BTH_II1774 serine protease, subtilase family
MSCTRYRKNNDHPKLAQSMGVLVALVGAGIVPAHATCTAAGTTVTCSGAA
DPLAPSYANSGNNLGVTVNSGASLGVLLGVGGTALSLTGSGVTLTNNGTI
DPTVLGFGLGVLSSGAVVGNASPSTTTVTNNGTMNGSTGVSISGLTGMAL
AVENGTGGVSNITNTGTIGSTPLAGATLLGPDSPVVAAYGGGQVNFSNSG
TITGRVAFQSNGTAGQGNTFVNSGTIDGSVSMGTNSTNTFTAMTGSTVSA
AGGTGLSLNIGVGSLTLGFAATGIVDGGAGGNNTLVLQQATGGPATGAIA
VDNYINFNHLDVTSGAWTISGASSAQDATLSGGVAIIGNNASLGTGAITG
NGGALQAGAAGLDVSNNVALGAGGLTVQGATGLTLSGAISGSGALTKNDT
GTLTLTGANTYTGGTTINAGTLAIGAGGSLAATGAVNLAGAGAALDISAA
GANQTIGALSGVAGTNVNLGANGLTFGDGTNQTFAGAIGGTGGVTKQGAG
VETLTGANTYTGGTTINAGTLAIGAGGSLAATGAVNLAGAGAALDISAAG
ANQTIGALSGVAGTTISLGANTLGFGSAANQTFGGSIAGTGGIVKNGTGT
ETLTGANTYTGGTTVNAGTLALGAGGGLSGSTTVNLAAAGAGFDISGATG
NQTIGGLSGAAGTTVALGGNSLTLAGSGSATFGGTIGGTGGLTFAGTGTQ
ALTGNNTYSGGTTLAGGTVALGSGGALGTGAVTVAAPTTIDTTSAVNLSN
AVALNATATVGGTQSLTLSGAVSGPGGVVMNGSSTLTLGGANTYAGGTTV
NAGMVVVGNGSALGTGGLTVNGGGVSLGGSSVTLPTLNGAAGGTIDTGAG
SLAVTGGGSFGGALTGGGSLAVSGGAPLTLTGANTFTGGTTIASGGALQI
GNGGTTGSLAGNVADNGALVFNEAANLAYGGAISGSGLLTQAGSGVLTLT
GASTLAGPTTVAAGTLAVDGSLANSTVTVQNGATVTGTGTLGGLIVASGG
TASLPQPGQALNVAGNVTFEAGSTLQVAANPQQSGSLAATGSATLNGGTV
QVLASQASYQANTTYTILSANAGVAGQFAGVNSTYAFVTPTLGYDANHVF
LRLAPNGNAFTSVATTQNQTAVAGALGTLGAGNPLFDTVLVSDAPTARGA
FSQLDGELNASLQSMLLSDSRYVRDAVTDRVRQGLAPGSGPLAALSAGGA
ALCDDAGGGAARHDAMPPERRLGSRDSCVGRTPYRPVVWGQAYGGRSRLA
SDGNASTLNRSMTGFIAGADVALNDRWRAGAAAGVTHSSLDNDLNASASL
NSYYVALYGGAQYGAWGVRGGAVYTWYRINADRSPAFANFRDHDSAGYDA
NSGQVFGEVGYAIPVGRFALEPFAGLAYVSLHTDGYQESGGAAALKSGAQ
TSNVAFSTLGVRAATALDVLAKGTLSAHAMAGWRHAFGSARPTSTLAFAR
GGASFQVAGVPIARDSAVLELGIDASVTKNLTLGVSYSGQYGSGVRDNAV
LGNALWRF
>BTH_II0011 Protein of unknown function (DUF1348) superfamily
MSDATEIRPPVPPFTRETAIQKVRAAEDGWNTRDPERVSLAYTPQSKWRN
RAEFATGRAEIVELLRRKWTRELDYRLIKELWAFTGNRIAVRFAYEWHDD
AGNWFRSYGNENWEFDENGLMAHRHASINDMPIREADRLFHWPLGRRPDD
HPGLSDLGL
>BTH_II0866 Bacterial protein of unknown function (DUF879) superfamily
MATTTLNRYYEDELVRLRELAAEFGRAHPLLAPMLGAPSGDPDVERLLEG
VAFLTGLARQKLDDGLPELVQALANLLFPHSLCPVPAATLIAFEPRGALR
ERAVVAAGTEIESMPVDGTACRFRTCGDLDVEPIALAGCRFVPPADGGPA
LRLDFEMLGIDASEWDATRIRLFVGGERLHASRLFAFLMQHVVSVDIASG
PPELPGPRCTLGGRALRAAGFDDALLPCPERAFPGFRLLHEYFAYAEKFL
FVELSGIERWRTARSGSQFSVRLALDCAPDWLPGIERDSFRLNVVAALNL
FAHEAVPIQHEHRATDYRLQPEGDAHGHCRIYSVDRVVGYRPGHPVDRHY
VAFGAAGDGALTASYRLIRRAALDGRGHDVHLALAYPPGEALAAETLSIG
LSCTNGTLPARLRIGDVCRPTDSSPERFAFANIAPVSPPLDPPLGEPLLW
RTIGHLALNFLSLGNVDNLKQMLALHAFGERGDAARAQADRRRIDGIEAV
DVRAETRIVGERMLSGQHVALRCRAHAFGGAGELYLFGCVLERFLAEYAA
LNTYTRVEIDASPDDGRFAWPPRMGAQCLL
>BTH_II1705 oxidoreductase domain protein
MPHYVEYHAGFGHFHWHNDYSHESEEAPRKLTVIVQLSEPHEYEGGDLEV
FGSSIAVAPRHRGSIICLPSFVEHRVTPVVAGVRRVLVAWIAGPRLK
>BTH_II1925 chitin binding domain protein
MRRFHSSESGSLHAACRPERPASRELAAPAIFGRPRRRAASSTIVTESSD
ECASARHFDIQNNRRISTTLRNANRVTSHLEERYMKSHFDAPSSRPRARA
ALTLGAAATLTASFAALLAPVDADAHGAVGFPIARQYQCRLEGGYWDPPN
GSAIPHDDCRAAYRAGNNSAYPFTQWNEVSANPVGQGNDLAQLKAAVPDG
LLCAGGDTSKAGLDKAPASVWRKTQLTPRNGHIELQWENTTAHNPARMRV
FISKPSYDRSRPLRWDDLQQIYDAPAPAPVPANGAGHLPGSIQSFYKLDV
TLPAGRTGDAVLYSYWQRIDAGNEGFFNCSDVTIASDERASGFPWVATRA
FVEPGIAPQAGQQVRFRVMGADARGAEIVDVRQPITPYNADRSVWAKQIA
DQVNGRYGSVAKIGVRSGNTIYFDTANLNANKVWLQPNYSSALGVVGAK
>BTH_II1754 Protein of unknown function (DUF1006) superfamily
MRRFALQWRIRFHPPFPAPLVTAPLSIASARALHLAAQGLLTPPRRKAVK
ADVLAAIRRMAQLQIDTIHVVARSPYLVLFSRLGAYAPQWLDEHLADARL
FEYWSHEACFLPIEDFGLMRHKMLNPVGMGWKYAAEWHAKHRDAIDALLA
HVRASSPVRSADFARGAGKGNGWWDWKPEKRHLEVLFSTGQLMVAERRNF
QRVYDVAERVLPHWDDARDLPPRETVLPRLVGNTCRALGIVRADWVADYY
RLPKRSYRDELHALANAGELLPVAVEGWSADAFVHREFAPLVDAARDGAL
HPTVTTLLSPFDPVVWDRRRASALFGFDYTIECYTPAHKRRYGYFCLPIL
HRGRLVGRIDAKAHRAQRVFELKAVHIEPGVRVGAGLAADVGRAIRKLAD
WHETPVVEAGNAPKEIARAIGAD
>BTH_II1126 DGPF domain protein
MRVMVMVKATNESEAGKLPTKAQFEAMGKFNEELVKAGVILAADGLHPSA
KGKRVRFSGSARTVIDGPFAQTKELVAGFWLWKVASMDEAVEWVRRCPNP
MDGDSEIEIRPLYEMEDFGEEFTPELREREARLRDGIDEPREGAR
>BTH_II0608 conserved hypothetical protein
MNKPTRRPTTGDLSDRREPNRRRRARRSPARQRGSLAVIAAIAIGVVIAA
LGAVDLGNLFYQRRALQSVADLAALAAAQTMDDGCTQPAATAQSAALGNG
FDSAASGQSMTVVCGRWDVKDNAGPSFFAGSASGTAAGSDAQLNAVQVTL
TRVVPYYFLGAQRTVSATSTAQATNVGAYSIGTTLAQLQGGVVNALLNGL
LGANLNLSVLSYQGLANARIRIKDLMAAANVGTVNALLNTQTTVPQLANW
MLTALSQTSVANADLQTSIGALQTIVSANVPGGQTFTIGSTANSTGIFSV
GLSDPQAALDATFSPFDALLVAAEIATGQTAFSLANGLNVGGLNASLQVQ
IIQPPVLGIGEAGIDPATKTWRTIARTAQVRLYLNIGLGTANLPLGLLGA
LVPVQVNLPLSMQIAPGQAWLQSANCTASPSTCASAIGVQTGIANLCVGD
TPANLSASLPFTCSTPATLVNVANLVTIQSLASLPADVPASETPTLTFYG
TTGGYQSTNSNGVGSVLGNALSGLGTSLQQTQISLIGISLPLDPIQAALD
SFLGAVLPPMLSGLDAAVVPLLQLLGVQIGESTIHDMSLTCGVSQLVY
>BTH_II0869 Protein of unknown function (DUF877) family
MNRDTERTTMEGEHLYSPKNDDAPPAPAPDSPASLLDELIEAARVKRDEE
AYPITRHGIEAFVAHLARPKRPIETVSQATIDDMIAEIDRKLCRQVDAIL
HHPDFQQLESTWRSLKFLVERTDFRENIKIQFLDVGKAALLDDFDDSPDI
TKSGLYQKVYAAEYGQFGGQPIGAIVANYTFGPGAQDVKLLQYVASTSAM
AHTPFIAAAGPAFFGIDSFGKLPNVKDLASLFEGPQFAKWNAFRESEDAR
YVGLTLPRFLLRLPYGANTTPVKRFNYDERVDGGDADFLWGNAAFAFATR
LTASFADYRWCANVIGPKGGGTVADLPLYAYEAMGEIQNKIPTDVLISER
REFELAEQGFIALTMRKHSDNAAFFSANSTQKPKFFGISKEGKDAELNYR
LGTQLPYIFVVNRLAHYIKVIQRENIGTWKERGDLEQELNQWIRQYVADM
DNPTEGVRSRRPLRQAEIFVSDVEGEPGWYRVDMKVRPHFKYMGASFTLS
LVGKLEKR
>BTH_II2144 membrane protein, putative
MHTLYLIAIVAEAMSGALMGMQRGMDRFGLALVGAVTALGGGTVRDVLLG
RYPLTWVAHPEYLLLTLAAATFASMTATHVARLKSLFTTVDALGLAAFSI
IGCDVAATVNGSPVVIVLAGAITGVCGGMLRDVLCNEMPLVLRKELYASI
ALLTGGLYVGMKALGVAEGLATVVALIAGFALRMLAVRRGWRLKAFHAAE
AG
>BTH_II0571 pectin degradation protein kdgF
MAGLNERGFGSWNRVERETLTERIERQVVSGDALTMAKLYLKKGAFVGTH
SHPNEQFTYILEGRLRFRYGEHLEHEVEVGPGEILHLPANVPHNALCLED
AIDLDVFTPVRADWLAPDGNRYFAGTPAAASPAPSASR
>BTH_II0265 Rhs element Vgr protein
MFTLDSLHGDDLKFHRLYGEEALGRMFDFRIEALADNHSLSLKELLGKPV
TVRIRQQDESERHLNGIVARAALVGRRAQRHYGYQLIVRPWLWLATRRSD
CRIFQNKTVPEIVQDVLVTYGFPIENHLTDTYAPRDYCVQYNETDAAFVS
RLMEFEGIYYYFKHAAQTHTLMLCDAMASHVALPGYEHIPFIARDRTAIA
DEEHIDSWLPAQEVSIGKHETSDYDYTKPRADLSAQKIDPRGHDHDGFAS
FEWPGGYRDDEPGAHYSRVRLEEQQAEHERALARTDVRGIAPGYLFTLEH
CPRADQNREYLIVRCQYRFQENAYATDSGNEAVVHESQVLVQPSSLPYRS
PRATPRPRTNGPQTATVVGPVGEEIWTDQYGRVKLQFRWDRYGQSDQNSS
CWVRVSSPWAGGGFGGVQIPRIGDEVVVDFLNGDPDQPIVTGRVYNGEKM
PPWGLPGSATQSGLLSRSSPGGTTEHSNAFRFEDKKGAEQLWMHAERNFD
AETEQDHTLSVGHDHSHSVGNNETMSVANDRQRSVGQNETVNIGKHRVAQ
IGGNETHGVAGNRTRQVGQNEAVTIGANREATIGGNHVETVAKDKTETIG
QGKTLNVTQHYQTNSKSMKTAVVQHHAEEIGSRTSTIKNAHVLNVGDSQS
VNVGASHTMSVRNNVHVGAGDEIALVCGHASITLKKDGTILINGVTVESS
ASGSHSVRGKTVTSSATGEHTVEGTILKLNP
>BTH_II1891 pentapeptide repeat family protein
MSKIRSAVPPPPLPEIVEGQRYVSAQRDVALADTLFVDCHFERVEWTGCR
LSNLRFVNCTFDANRFDRCELEKLSYESSRVREGAWTQSALQRVSFNECE
IDGGAWAGCLLKDVVCSQSKGGAWTFDAVRGAHVSLVAGEYAGVTLRGGH
WSDTSWIGSRLVDLRLESVGLENLIAGQSGFERAVLVECRGINVRWIDSR
IERMTVQGCELKQAAWSHSTWATGEIHASRLPIASFDHASVNGLTVTNSE
LPQAIFDSASVADSALQGVRAPRIALRDAWLTRVNLAGAQLQQLDARGVR
LERVDLRGADCRSGNLVGQLRQTWAAADTRDAVFEEATSADDRLWWQRVQ
PGARGV
>BTH_II0495 Protein of unknown function (DUF445) superfamily
MLDDKERELRNSKRRALALLLAAAGVFAATLFAPRGFWIDGVKAVAEASM
VGALADWFAVVALFRRVPIPLVSRHTEIIPQNKDKIADNLAAFVREKFLD
PASIVALIKRHDPAARLAQWLATPRNADVLGGYSARLVAFGLDMTDDARI
QTFVKDAFHALLDRIDLSQSAGAILDTLTKDGRHQALLDDGIAQIVEFLR
DPDNRASIATYIVDWLRYQFPKMEKLLPTNWLGEHGAELISNVVTRVLTQ
IAEDPEHRLRRGFDDAAARLVTRLKSDPAFIEKGEEIKRYLRDGEAFNRY
VKDMWDQLRAWLKADLARDGSIVRQRATALGGWLGERLAQSPQLRDSMNE
HVERAASEMAPEFAEFLTRHISDTVKNWDAREMSRQVELNIGKDLQYIRI
NGTLVGGLIGLGLYAVSSIARWAGALPY
>BTH_II1763 conserved hypothetical protein
MTTYRDTHATSFRIGDVVTLKTGGPRMTVTYAGPVVFDTGEWVICQWFDE
HGEFRQEMFPNETVVLEPRTISAGLARMRSLSVRGGMQA
>BTH_II0534 DGPF domain protein
MRFMIMVKANATSESGAMPDESLIAAMATYHEELAKAGVLLDASGLQPSL
KGWRVRYSGGRRAVVDGPFAETKELIAGYTLIQVRSRDEALEWTRRFPAP
FGEHEDGEIEVRQLFELDDFEPGDAVERFRELESKLG
>BTH_II1424 conserved hypothetical protein
MTAAMRIAAVSLQKSVRDRLEGLRVTGLYYGLAWASVILPIALLAIGLFN
ATLMPSEKGFYAMSFALALSGSVAVQKNTRDLKAAGRGRAETEIVADVAE
>BTH_II0009 YceI like family protein
MKPARWAERFACAIALVGMAASCTPLRVVTHTVSTTEAAVPAGRYTLDPH
HWSIVFDVDHFKYSRFTMRFDRASAQLDWRAGGLADSGVTASIDAASIDT
NVPLLDKLVAGSDALDAARAPRIRFDGTRFAHTSATQGTLTGNLTIRGAT
HPVTLAVTFNGYGRNPLTKQDTLGFSASGTFSRAQFGVTSWYPAVGDDVR
VRIEAEFVKQGEAPAT
>BTH_II0098 PAAR motif family
MRGVIRIDDSTSHGGRVVTGREGSTVMGRAVACVGDRCTCPMNGHEHCVI
VEGDEGVRIEGRAVAFDGHRTSCGATLISSIPTSGRV
>BTH_II1623 conserved hypothetical protein
MGYSGRLGRRGGGGHDAVRRPRLAPQCQRWPPSQAPRRRCEASGQCRAVS
SRPRIGHRQCAMPDLPFDRHGDAPAAVDRARMGDHQQQDARGLRRAVARE
GCRRARQVLVQHQREKERPGRGCPFGRRHAGQLRRRRPAMARSQGGMRSV
NRSRVSPCSPSRRPSLPRALPSIAPHVFAGARRRATVARRRARSGIVTNA
PPPSRRRECARPARAAAMPPAARKAGTRPRRAVPAGQPRRHPKAIRTPPE
APMKNTDSPENQAASELIDQRIAELGDWRGDTLSRMRRLIHEAAPDVVEE
WKWRGTPVWSRDGIVCTGESYKSVVKLTFAKGASVDDPAGLFNSSLDGNV
RRAIDIREGETLDAGAFKALVRAAIAVNQSSGKAKTRAKRAKPDAP
>BTH_II1470 membrane protein, putative
MLSHLPPRVRGLWLAAALAALLYGLSLGHAPYPGQPAAKAALGALLLAAA
LRHPPTRERVWLGAALAASALGDVLLALASWPPSFIAGLGAFLLAHLAYC
ALFAPWRAAPRGARAVALAALWIAAPALYAAFFPHLAALAAPVAVYVAVL
AVMASLALCAHTPGPQIAAGALVFVASDALIGIDRFLGAFAGVDYFIWFL
YAIAQLTIAFGVLQRKSK
>BTH_II0259 Protein of unknown function (DUF877) superfamily
MNEHAQTRADTRAAAQPAVARDEFAALLQKEFKPKTAEARESVERAVRTL
AQQALEHTAGMTTDAYGSVKQIIAEIDRKLSEQINLILHHQEFQTLEGAW
RGLHYLVTNTETDELLKIKALPASRNELARTLKRYKGVAWDQSPLFRKVY
EEEYGQFGGEPFGCLVGDFYFNHSPPDVEMLGELSKIAAAAHAPFIAGAS
PELMQMDSWQELANPRDLTKIFQNTEYAAWRSLRQSEDSRYVGLAMPRFL
ARLPYGARTNPVDEFDFEEDTDAASHDRYTWANSAYAMAANINRSFKLYG
WCSSIRGVESGGAVEGLPCHTFPTDDGGVDQKCPTEIAISDRREAELAKN
GFMPFVHRKNSDFAAFIGAQSLYQPAEYHDPDATANARLSGRLPYLFACC
RFAHYLKCIVRDKIGSFRERDDMERWLNDWIMNYVDGDPANSSQETKARK
PLAAAQVVVEEIDDNPGYYASKFFLRPHYQLEGLTVSLRLISKLPSAKAA
SE
>BTH_II1371 Protein of unknown function (DUF355) superfamily
MLQLLTVAIDKPETANFILGQTHFIKSVEDIHEALVGAVPGIRFGLAFCE
ASGKRLVRHSGTDGALTELACRNATAIGAGHCFVVFLGDGFYPLNVLNAI
KAVPEVCRIFCATANPTEIVVVQSDQGRGILGVVDGFAPLGVENDEDVRW
RKELLRNIGYKA
>BTH_II1479 Protein of unknown function (DUF1089) superfamily
MREVRWASLEHDGIEHLAFERHARGSVAESVVVGRAGGLAYGLAYRVVCD
ERWRAKHVIVKMMGGGTLELRADGEGRWRNAADAPLAALDGCIDIDIAAT
PYTNTLPIRRLGLARDERRLIDVVYLSIPDLTARRMQQAYRCIEPDRVYR
YESVASGFTARLEVDRDGLVIDYEALFKRLPSDAR
>BTH_II0089 Rhs element Vgr protein, putative
MTNLNDTLRNFASGAVDWNKRPVALHFGAAQGALGHLLALQHASVQEGLM
TGIGGRLTCVSTRRDIPPGVFLGMPVSIRLITDRGQPHMVNAIISDVQIG
QSDGELCVYQLTVCDALSLMDKRTNSRVFRKRSVIDVLATLFNEWQQRSP
ALARAFEFDLSGLRADRYPPRELTRQVNESDAHFVRRLLRREGITVFAKA
GPAKGEQPLQGDATVHTLVCCDEPMSLPQAPAGTVRLHPRDAGTEQRDTV
TLFALRRQLVPGKAGRPSWDYKKARIDESSVASSLDQGEAGNDLAKLLTD
IAIDIPHAGDSWSDHERLTRARMLAHEFEAERYDGVSSVRDLAVGAWITL
TGDPDWDRQLADKRQFVITSIDHDIWNNLPKGLNDRVHALFAASRNLVNA
PGALPAALANDADTRYENTFTCVRRGVPLAPAYDPQADLPPVHLLTGTIV
GAEGEEVFCDENGRVRVRVHGLDPADHAHAQGAGTNDNAGDSAPIRVASS
LAGAHFGASFLPRVGMEVLLGCLGGDPDRLVIIGVLGNGANPPATFSHAG
GLPGNRYLSGIKTKEIKGQRYNQLRLDDTPNQISAQLASEHAHSQLNLGY
LTQPRENGHGNDRGEGAELRTDAAAALRAAQGILLTTYARTQASGGQLDR
DELIRLLGECSELFKALGDYAGQHGGQAADTAGQHAVAAAFRNWTPGAGG
ADAPPDGGDRALMAFGAQAGSVNVTPKTHVTYAGENIDQVAQQHLQLMSG
QRLNATAGQGMQLFARGAGVQAVAGEGPMLLQAQADTLTANAQKGVKITT
NEHEVFVSAPRIRLVAEDGSYLEIGNGITLGTNGDIKLLSASHQWGGPST
AQAAKTAFDNQPTDQRFKLHYPGEDGDLPAAANKPFRITLNDGRVIEGKT
DASGLTDLVKDDAMRIAKIDYLKPKL
>BTH_II0127 lipoprotein, putative
MSYLKRFFRFVFSWQMLACIAVLLVSVAVWFVGPLLAFDELRPLAGVVVR
VAVIVLLVALLAFWLMRWPLSPVGVAALCLLIWHAGPLFAFGDHRPFGPA
WVRVLIVAVILFCYAVYGLYRLWQAVRTNDALLRRILDPSAGKPDAAARA
DIRAVNVAVSKAIGQLKRLRGGAFGWRRLLESGRYLYELPWYMMVGAPGA
GKTAAIARSGLKFPLADQMEASTERARGGTINCDWWFANEAVLIDTAGRY
ARHEVPGDEEATLANEAEWKGFLGLLRKHRPRAPVNGVVLSVSVEDLVGR
TPAERTAHAAALRARLGELHQELGIHFPVYVIVTKLDLLPGFPEYFQSLT
AEGRTQIWGFTLPYDAENRKSAVGALREHCADELKRLEMRIDAGLNNRLL
EEYENDRRKRLYALPQEFRSLSEALTDMLGLVFLDSRYDDAQLQNTLRGV
YFTSAEQTDQVMAADRETILQRLKRQLGRMLGGDAGAQTRGNGAMSGSRG
YFLRDVFQHVIVPEAHLVRPNVMWEVRFRLMRWAGHLLAVALLVWLASSL
TVSFDSNRGYLDAISDKTTALAARVNAYNKAPKPAGVAGVLDGSRDLPQY
GNLDLEAPGASFRYGLYVAPGIVDASDATYRNLLRRSLLPQIVRRVENAL
SAQIDAKNADEVYRTLTIYLMLYDAARHDAKAVKDWVMRDWERSDSAAEM
GGRNRMARHLDALFVDGQPFEPSGRQNAALVQRARLFLNANPAPRRLYER
AMAAIEKEAPENFTLARAVGLQGAGIFRLVDGSRFQRGVPGLYTYEGYHQ
VFSARLPEFLARAQSDDAWVMGSADSAARWGDAIRNTKIVAGRSALADDI
RRQYLTDYGNYWQQYLADIRPVSSGENGGSGTLAFDLATLRALAAPDSPL
VRLARAVVRETSLSVVDAREDASLTDTALSAVGRRSGTAKEVADGAQKLA
ARRPEQRLEKELVDNRFAALREVVTGQADTGSGPAMTDMPISSGGKALQL
DAILTLINEQYTRLVVADNALSSHSMPPALDIGTTLQMEAEKLPAPLRAV
LGGIATQAADKVGREVGSLLAMQVDSSVGKACRAAVDGKYPFARSSQEVD
IEDFNRLFAAGGLFDEFFQKALAAHVDTNSKPWRYKALNPGMPPIRGPSL
EPFERAAAIREVFFREPGAKRMAWKMDAKVASIDPEITEFIVDVDGQSQR
YVHGPVLPFSVNWPGPRGGAIAEITAKPRIRPDTSTITTTGPWALFRLIE
RGRLTGTTSASRLMLDFDFDGRRAALELRTNGQANPLTSGLLTNFRCPGS
LG
>BTH_II1201 2OG-Fe(II) oxygenase superfamily:Prolyl 4-hydroxylase, alpha subunit
MMLHIPGVLTKEQVAQCRDILDAADWADGNATSGAQSALAKRNRQLPEGS
PAARAIGDAIQDALARNALFFSAALPLKVFPPLFNRYAGGDAFGTHVDNA
IRLLRGTDFRVRSDLSATLFLEEPDAYDGGELCVEDTYGVHRAKLPAGDM
VLYPASSLHHVTPVTRGERVASFFWIQSMVRDDADRTLLFQLDTQIQQLT
AEKGGRDASVIALTGIYHNLLRRWADA
>BTH_II0858 conserved hypothetical protein
MSDRRIFMPAAAQARRPTQPAQACRSSRQLRPRRACAARACAATLAALAA
LAGCAAGVTEEQARADVRWDYAPDALRIDIDASPRLNEYLNAPHTLLLAV
FQSADARTFRQLADDPDRLRMTLAAGGPATDFIQTTRYVVEPGARVALSI
DRAQQARYVGIVAGYYDADSPRAARLFDVPLRIDKRGWFSSTYRAAPRTL
GLKLRLGAQSITDAREAPLNLPPPGARAWTTLDGGAKTLTLPAGDANGSE
NGGGGDENDASAHAPRKR
>BTH_II0028 conserved hypothetical protein
MKAFDRDASEPRIGLAYGPGMAEFAARNAHLVDYIEVPFEQLRFSPAVAE
LQQTIPFVLHCASLSVAGFVPPDDSTVDAIERTAVQTGTPWIGEHLAYIS
ADPVGEALGGTGEPTSLSYTLCPQLSDETVRRVVDNLAALRPHFPVPLIV
ENSPQYFPIPGSTMGMTDFIRAITDRCDVGLLLDLSHFLITAHNTGAEVH
RELARLPLERVVEVHLSGMSVQSGTAWDDHSLPASPILFELLERLLDVAR
PRALTFEYNWSPYFPLSVLTTHIERARQLMGLA
>BTH_II0134 ImpA-related N-terminal family
MNATTPAATLRYADLLAPVSQDAPCGPDLEYDPAFVMLQSAIAPKKDAQY
GEFVEAPQPANWAEAERDCRALLLRTKDIRLVVILTRCRIRQSGAQGLRD
GLALLNEMLARYGEALHPVPFFEGERDPVVYANAIATLADPDATLADIRE
IQLPKASGLQLQLRDIEKALAVTRVKDALAPESASRLLKEWWNRRDGTIV
ALAQAQCLMADLIASTRESLGDDAPDLSGIAKLLHPFTQAQLESPYSASA
AQPQGDAKPANGDTAHSRAADALAPAGDAATQTPAAMSIANPQPPMDRRG
ALAAIQATRLWFEQNEPSSPVIVLLRQSERMVGKRFSEIANAIPAELLAQ
WDALDV
>BTH_II1703 uncharacterized domain protein
MSHDDARGILDAEYRPPIRRGDLDGFGQGIIVAIIDGVFDQQLAVTPSEL
RAALARGARIFGASSMGALRAVEVPGVVGVGRIYEMFRDGVVDRDDEVAV
TFDAQSLTALCQPLVNIRHALERLAATGTLARPLAKRILRTAQLMPYFDR
TYPLILARVGLDDHRDAAQLAEMLASHDLKREDAITLLEYLRNVDAEPVG
SAPRMQSAITLPTNARDVPSPDEPLHLWEFGPPLPFRELLEFLAFTGGLR
AVALRAVAALASEDIPEITDASELQALFDRRMAQIGRTWHWLTEEEVTTS
LRGLGIGTDALQASVVSESIDELRAMVLLRNASAPFLQALRNHLFLDELM
LKREAARALSLRWLATRATSCGAAPSAHDRDSARRALCRQLDVRDFKGAV
RQLSAWGITPSRCEDFVTELALARRAWQADSVVPQPNSRRWRWLPASRKA
AGSRRFCMPAATAYTIATRLRNVVGITRVAMITGLGTLGIPNAQAFRPDG
QWSSTVGSGKSESAIGARIGAIMEEVEKWAQERYSQNLDRHVVCVSSYRG
LRRRAESAVDPATLDLPYDSQYSAKLVMPWVRGFDLAAGAPCLLPAAAAS
HMRLPLDIFYSPQGARKTVTTNGLASGMTLAEALTHALCEYVERHARTID
AIVNDNPGAPYAARSPVTDLDRAPASTRRLLRRIERAGYRLVARSIAVDI
AIPTFIATILLPEGHADGTLFGDGWQQASGWAAHPDPETALNMAILEASQ
TIMSHIAGAREDLTLAARSLGRHERTESRRRPALVPEFDGDAPRLPFDAI
RGLVSDDAAADVRWIVARLRDAGLTRIVMIDYSIAEIAPARVVRVIVPGL
ETTNPFHTGMRARIALLSDLLGVQYGKPRVGT
>BTH_II1732 hpaD, 3,4-dihydroxyphenylacetate 2,3-dioxygenase
MGKLSLAAKITHVPSMYLSELPGRHHGCRAEAIRGHQAIGERCRALGVDT
IVVADTHWLVNAGYHVNCNGHFAGVYTSNELPHFIRDMRYEYPGNPALGH
LIAASANERGIGTRAHEIDSLELEYGTLVPMRYMNADQHFKVVSIAGWCM
WHSLDESRRFGEALRHAIEASDANVAFFASGSLSHRFNDNGSPEEAIHMI
SREFYRQVDLRVVELWKQGDFATFCKMLPEYNEHCHGEGGMHDTAMLLGL
LGWDRYDKPVEIVTDYFASSGTGQINAIFPLP
>BTH_II0050 pcaC, 4-carboxymuconolactone decarboxylase
MNDEQRYEAGMNVRRAVLGDAHVDRSLENRTELTEDFQNLITRYAWGEIW
TRDGLPRHTRSLLTIAMMVALNRGEELALHLRAAKNNGVTRDEIKEVLLQ
TAIYCGVPAANSAFHFAQRIFGEEDAAS
>BTH_II2186 prpF, probable AcnD-accessory protein PrpF
MTRRRGSVRPSGERTRITGFARDRSRCPSARSDRHRTSRTMAHVPQIKIP
ATYLRGGTSKGVFFRLQDLPEAARQLGAARDALLSRVIGSPDPYGKQIDG
MGGASSSTSKTVIVAKSARPDHDVDYLFGQVSIDKPFVDWSGNCGNLSAA
VGPFAIAGGLIDPARVPHNGVATVRIWQANIGKTIVAHVPITDGAVQETG
DFELDGVTFPAAEVQLEFMDPAADEEGAGGAMFPTGNVVDDLDVPGVGTL
KATMINAGIPTIFVDAEAIGYTGTELQDAINSDAKALAMFETIRAHGALR
MGLIGSLDEIATRQHTPKVAFVAKPADYVASSGKRVAAADVDLLVRAMSM
GKLHHAMMGTAAVAIGTAAAIPGTLVNLAAGGGARREVRFGHPSGTLRVG
AEAKQDGGEWVVTKAIMSRSARVLMEGWVRVPGDAF