TitleGenColors Logo

Gene list

Applied filters:

COG category: General function prediction only
Gene type: CDS
Genomic element: chromosome

Number of genes found: 689

Free access
Sort by:

 



# Anabaena variabilis ATCC 29413, ATCC 29413

>Ava_4554 TPR repeat
MDWITLLRSLQSDFIKRLSSGCLLHCEIEGQYSELTVISGERLKTLRDFC
WLMAEKYKRVSPVRDVFISYLKGKLGEEVVKERLADLITEVDYEKRLGGD
GKIDFTLTANPAIGIEVKSRHGNIDRVRWSVSAEEVEKNAVVVCIFIKED
VNEAQSSYHLLLAGFLPTQMIKLKTGNISFGIEQLLYGGGLWGYLEQLQA
SSNYHQFQQSPPIYEYQPQPEFSTKINQSQSIKPALFTGIKNILSYRREE
DINIDYIKLGDECFAQGEYTASIKNYSQALQASSNNGELYYKRGLTYYQL
GDYEAAIADYSQAINLNFHDAKSYHKRGLALSQLAAYEAAIDDYNQAIRI
NPHAASIYKNRAEARSHLGDNQGAIEDYTQAIKINPQYADTYKNRGISRY
LLATQPGFTQAIKINPNDANAYKNRGNARADIGDYAGAIEDYNQAIQINP
KAADAYYNRGNARYDLGDEEGAIADYTQAIQINPSYADAYYNRGNVRAGI
KDKQGAIADFQKAADIYRKEGKLAELKDATERIVELEIEESIDILNF
>Ava_2147 WD-40 repeat
MDWISLLKAQQADFLLRVKKPKTYDLSLLESQVKGCHSEIMAFWGEPFAK
IQEVSRRQAEILAKNPPPIPPEYPEPPDWTIPFPIHFQQQAEDYFLREQI
VDRIIVERLGKLLKKLPQDTVKNMVLDDEGNLRGESKFTYLLADNPKISV
QVYVADGESFNGIKKDKIKWSVTQEDLKNHQVLIFLCLFYPSTGKLGYEK
QAVVAGFLPTSHIALTEPKLYFTPSHLLYAGGLSWYLESLTAKKEAKLDF
LPVVIERTIPETIQTVPSEHPRKEIIGDWECWQTLRGHTRGINCLSFSSG
CENGLPMLASGSRGETKLWDLSQGELIDTLSEYPWIVSGLVDEVNSLAFS
ADGQMLVSGGADSTIKIWHTGALDLIDILHKHNGIVRCAAFTPDGQMLAT
GGDDRRILFWDLMHRQVKAILSLDDTAAHSLVLSRDGQTLVTGSYRKIKV
WQTSGSWFGKNLKDAQPLHTLMGHGHIVRSLAMSKDGQLLISGSWDQTIK
IWHLATGRLIRTLKGHTDKVYAIALSPDEQIIASGSSDQTIKLWHLETGE
LLATFTGHTDIVTALTFTTSGEMLVSGSLDKTIKLWQRS
>Ava_0770 conserved hypothetical protein
MTLCDAGVLLCLVDRTQPQHNACKIAVIRLAKPLITTWSCLTEAMYLALH
RGGWQMQKQLGQLLLDKLLTVYEIQESDYSRLLALMEQYRDRPMDLADAT
LVLTAEKTGNRQILTLDSDFLFYRIGHQDTFEIISL
>Ava_4906 Short-chain dehydrogenase/reductase SDR
MKLLEGKVSLVTGATRGLGKGIAIGLGEAGAIVYITGRRLDSNSTSSNDV
SGSLNDTKRAVEEAGGICIPVQVDHSNDEQVRLLFERIQNEQNGQLDLLV
NNAYSGVQALTNAQGKPFWENAPNLWDACNNVGLRSHYIASVYAAQMMSK
RQQGLICTISSWGGMAYLFGAAYGTGKAGCDRLAADMAVELQPYNITSLS
IWPGIVGTELFSRLASEMSDNHTNDGKISALAERYNWETPLLTGRVIAKL
ASQTNVINRTGRVQVVAELAKQYRLVDQEGNLPVSLRSLRFLIPLVLPAL
RKHSWLIPDIKIPWSILLLSVLKSPKT
>Ava_5054 Serine/Threonine protein kinase
MQICQNPNCSNPFNPDGNRFCMSCGQSNFGKLLRNRYRVLGLLGEGGFSK
TYAAEDADRLDAPCVIKQFFPQIQGTGQRSKAAEFFKEEAFRLYELGENH
TQIPRLLAYFEQGSSLYLVQEFIKGLTLLQEVQQEPFNEEKIRQLLIDLL
PVLDFIHFHQVIHRDIKPENIIRRDGDGKLVLIDFGGAKQVTQTSIARQA
TAIYTIGYAPTEQMAGFACHASDLYALGVTCVRLLTQCLPQQNPYGHIDD
GLYDPMNGKWLWQEYLQDRGIKISENLSQILDKLLKHLPSERYQSAAEVL
HDLQVSTAIVEETQILPNTQTTLVPLAPATQRTKRPLPPLQTFEYEVVTV
DTAGRIVNRDRTNTQILVEQLNKDITLEMVSIPGGAYLMGSPNFEGDADE
RPQHQVAIAPFFMGKYPVTQAQWRAVAGLPKIKQALNPYPSKYKGQNRPV
ENVSWHEVLEFCARLSEKTGREYRLPSEAEWEYACRAGTTTSFHFGETIT
PDLANFSDGDIHNLEAKTRYRKETTDVGNFRVANAFGLYDMHGLVWEWCA
DPWHNNYNGAPTDGSVWEAGGDIYRRVLRGGSWNFAAELCRSASRSWNES
DGGLRICGFRVVFSPG
>Ava_4333 HAD-superfamily hydrolase, subfamily IA, variant 1
MPYSAILFDLDGTLTDPKLGITRCIQYALSELGYKPPDADELLWCIGPPI
KESFSRLLETSDNGLIDQAIALYRRRFSTIGLFENSLYPQIIDILQKIRF
AGYQTFVATSKPHIYAKQIIEHFDLSLLFDAVYGSELDGTRTVKGELIQH
ILITENLTPSTVVMVGDRQHDIIGAKLHNLTAIGVTYGYGTEEELKTHGA
DLIAHSPEEIKKLLVHSS
>Ava_4594 hypothetical protein
MNGFQNSETSGIVPQQPETIISLDDFKSLFYQLNAKPDTEIRLLPGKKTV
ELADISNINEQIQAKLRNHDVAASIGSINFILSNKKIKDYSTWAEFEREK
WNTINERIKTLTIHWDIAIKLPQYSLPQRHSIRLRIGSNIPPKDIFQLIF
TSDNIQEIMEAKTPSVCKIDFINNIIAIELLNIVSNWYDGLRDSPESSTL
EKILKKRGRIFSEIIRYASPIVVLMIVCQYSNYLLPILGIGEEISIENLQ
ACFIFLAAFFMVGLFFGFKLEAFIDKRIDEFEEFPSFSITKGDEKAIENF
EKSNKKLTAQIVNRILWILFSVIVSSSLKFILNHIIS
>Ava_2160 conserved hypothetical protein
MGSENLSQDTLAEHLLELAIKSGAEAAEVYQSRSLSRPVFFEANRLKQLE
TSQSEGTALRL
>Ava_0954 Zinc-containing alcohol dehydrogenase superfamily
MTTQGNPEVLQLQEVPNPSLPVGSTELLVRLVAAGINPIDTKLRKRGTFY
PEKMPAILGCDGAGIVEAVGAGVQSFGAGDAVYFCNGGLGGHQGNYAEYT
VVDERFVARKPASLTFAEAAAAPLVLITAWEALYERGRLEPGEKVLIHAG
AGGVGHVAIQLAKLKGATVATTVGSEDKAHFVKQLGADHVIFYKNTDFAQ
AALNWTNGEGVDLAFDTVGGDTFHKTFPAVRIYGDIVTILEPDANTVWKV
ARNRNLRIGLELMLTPMFLNLVEAQQHQAEILTECADWIDQGKLKIHVSH
KFPLQDAAKAHQLLESGSIAGKIVLLISDE
>Ava_1303 Rieske (2Fe-2S) region
MQPEFNFFQHWYPVSPIEDLDPKRPTPVTLLGIRLVIWKPKSSDSFRIFI
DQCPHRLAPLSEGRIDDKTGNLMCSYHGWQFDTEGVCTRIPQAENPEIVT
KNQKDFCVNSLPVRQENDLLWVWPDVNSQELAHQTPLPLSPQVDASKGFV
WTSFVRDLEYDWQTLVENVADPSHVPFAHHGVQGNREQGRARPISIMQST
PGLIEAVADGLVTAKITFEPPCRLEYAISVGNDGKQLGLVTYCLPVSPGK
SRIVAQFPRNFAKTLHRLTPRWWDHIKTRNLVLDGDMILLNQQEHLIQQR
QLSASWKTAYKMPTSADRLVIEFRKWFDKYCDGGLPWKEVGIPIPEAAAI
NDNRDVLLDRYKQHTRHCSSCRNTLKNIERLQLILLGYFALVISGVAVLP
DSLRVRLGLPLIITAILGLGLYSWLKFWLVPKFYFVDYVHAEK
>Ava_3243 Plasmid maintenance system antidote protein
MNNNRLPNIHPGEILKLDFLEPLNITAYRLSKDIGVTQTRISEILSGKRS
ITADTALRLSHYFGNTAQFWLNLQTQYDLRQALEENSEVYNQISKFLSDD
VA
>Ava_3514 GCN5-related N-acetyltransferase
MKIRLFHIQDAEQIALLFHQTVREVNIRDYSNNQVIAWAPDNIHFRDWAK
ICSERFTYVADDQGVIAGFGELELNGHIDCFYCHKNYLGMGVGRKIYSAI
ETKADELGISRLYTEASITAKSFFLRMGFSIIREQQVERRREYFINFAME
KFLISS
>Ava_2484 conserved hypothetical protein
MLTVNNSHRALEVTLQIPVRTYEIDFAGIVSNIVYIKWLEDLRLKFLEEH
FPIHQQIEQGYVPILTGTEIEYKRPIKLIDQVIGRLWASNLGRLKWTVQA
EILANNQLAAVAQQKGAFVNLQNGRPVRIPDELQQKYLNITKLNNE
>Ava_0653 Abortive infection protein
MQNEKPSNKQRLQQKTKLRLGMFVVLLLLSAIAVLLFYPRTSTVEHTPSS
NYDIHKQQDFNQSQFYPPNQTVNPDLYQPVDKWVGRLVLPNQQQIQSGSD
WVWMEIQHASPEAQNLVGKVVRLEWLNKPESQSYVQAVQQDISFTDETRE
SQAKGIIHPFRLNGRLQVGPLQSLAGARPKDDVVVTLDDGELVKGSDNQL
RLQIAHEPVMATGRFYGLVKILNPEPASSQYPASPFCPGTSSCPSDFFRV
RHYNRVSGDFNGVEETVRIPQQVIDTRNIPPSTPREIEKSPAGQAGWYIY
GAKNTQGIFVVQGIVPRSLLKIQPQHVVLGQQPGLTYIKNQNWQIAPQDK
GTIRTVLLDPSTNQDQLAASNWQEGDRAIVLHNFGGIGGKKSEGSTAYTI
TGHFAFGLAQVVRDPITKELQFAIEYQQIYANNSDGIIAGRHSWADYMGN
LQWGWAATRPISDVLIKFAPVTQDYNFDGIKLSPVTEFQRQLQIMMARYR
TGDGTGSATVTPATSCVQDASQALYIAIQVITQQVASNSAIQQWLSNHPD
DPQTERFGQLVRLSSDLQRQLTPLGIVRADWKSHADYLTGIGDGKETFRD
GVPPSVADRSPWAALTSWRTMVPRQAHDEIAALFFRHGAKLWFLRTNQIG
GENPDIAPVAPTLGLGEITIPFTNIAPIPILLNRLLASLVVPEIRDWIVV
GITVLIYSAIALPLGWRSGFLSWCFTSTNPLHQLAIALRVIITPAIVEEL
VFRVLLLPHPLEVINWYGWTLWAGLILLLFILYHPLNAKTLYQVGFPTFF
HPIFLTLTGLLGLCCTIAYALTGSLWAIVLIHWIVVLVWLLALGGKDKLL
ISH
>Ava_3336 Zinc-containing alcohol dehydrogenase superfamily
MKGLWLENNQLQLLTDIPIPEPAPEEALVRVLRAGICNTDLELLRGYYPY
TGILGHEFVGVVEQGPENLLNQRVVGEINAVCGYCRFCRRGQPTHCENRT
VLGIVNRHGAFAEYLCLPVKNLHPVPENVATEVATFTEPLAAALEIQQQV
ILSKDDRVLVVGDGKLGQLVAQTLALTGCELLVVGRHQEKLANLKVRGIQ
TGLVDAVQDRAFDIVVECTGNPEGFAIARRALRPRGTLVLKSTYAGNLSL
DASSLVVDEITLIGSRCGSFPPALELLATGKVDVEPLIHAHYPLSQGLLA
FEEAQRRGVLKVLLEIGN
>Ava_0294 Light-dependent protochlorophyllide reductase
MAQDRKSTVIVTGASSGVGLYAAKALAKRGWHVVMACRNLEKAEQAAQEV
GIPKDSYTIIHIDLGSLDSVRQFVNDFRATGKSLDALLCNAAIYMPLIKE
PLRSPEGYELTMTTNHLGHFLLCKLMLEDLQKSSAADKRLVILGTVTHNP
DELGGKIPPRPDLGNLEGFAQGFKPPISMIDGKKFEPVKAYKDSKVCNVL
TMRELHRRYHESTGITFTSLYPGCVAETPLFRNHYPLFQKIFPLFQKYIT
GGYVSQELAGERVADVIAAPEYKQSGAYWSWGNRQKKDGKSFVQKVSPQA
RDDEKAERLWDLSEKLVGLESQKPVALNS
>Ava_2856 TPR repeat
MAQEGGSVPKHVSLISLLVVCGLWSMPQAAHAQALIPHTLQLDPAKLEKQ
GLSLAQEAAQLGQFQQYELALARAKLASQLAPGNDKVWFLLGGLQLQTKN
FDGAIASLNRSKTINPKNADVLFALGSANFQQKKYQVAIEHYQAGLALKP
NEADGLFDLGNAYYMIGRLPDAIAQYNKAVAQDKKFWPAINNIGLISYEQ
GNVDEAIKRWQSAVAIDKQAAEPLLALAVALYTKGDRQQGISLGEAALRI
DPRYASIDFLKENLWGDRLLSDTKQFLELPRIQAALGQRESAPTPRQ
>Ava_3308 Serine/Threonine protein kinase
MTSILLNNRYQVIQVLGAGGFGETLLAEDTHMPSRRRCVIKQLKPVSNDP
QAYQSIQQRFEREAATLEFLGEHNNQIPRLYAYFSENGQFYLVQEWIHGQ
TLRQLLVSQGVQSEGIVKTILLSLLSVLDYVHSKGIIHRDIKPDNIILRD
VDQKPVLIDFGAVKETIRSVVSSPGYATRSLVIGTPGYMPSEQAVGRPVY
ATDIYSLGLTAIYLLTGKSPEELPTHPQTGEILWQNFAPHVSQKLVSVLN
QAIKPHAGDRYSTASKMLYALSSGWTAQPSIPPSTIPTQQTQALSISAIK
PQGEPIRWTEPNRQKSLFIFGGLIVGGLMAGVAMSTLTRQPQPQTPVVSN
PLPSPVSQIPTTAPNTQPVSPDSSPSSILPDSTEQPVNLDTPSQTETRQE
NPDPPAIFTPAPENTPITADTPTPQPSTTPEPQNQPVSAPPTPADSLPQR
KANTTRPSVPAFPTGTARSSVEAKLGKPNRDLRGVWGNTRAVVYRLVPNQ
VDLGYLFDRNSGVLRQTEVSFAQSVDPEVMQATLNGMLGGQSTAKIQQGL
QKIQQRQSDNFSFTAGGVKGQIVRQNCDLIYISVWESDLHDFVNPASAKG
C
>Ava_0012 Serine/Threonine protein kinase
MELDAFNKGAIPLINHPDCSALGYQVIRELGRNQEEGRITYLAHHHSKQQ
VVIKEFSFAHTYADWFGVTAYESEAQILQKLNHPRIPRYLDSFATQTAFY
LVQEYKHALPLSSKRSFQAEEIKQIAVSILEILVYLQQRIDPIIHRDIKP
ENILVDDQLNAYLVDFGLARVQDTKIALTSLLTGTPGFIPPEEQSGYSLS
LASDLYSVGATLVCLLANIRSVDIDKLINSKGYFDFQKLDSPIDLRFRSW
LMGMVEPKWQYRYANAADALAALQPIQVTGNATAMEILATVIKLKYQTTV
LCLAIIGVLAVAGATLILSQQGGTAQQLQEARKSEVL
>Ava_2990 Serine/Threonine protein kinase
MPWIAGQQLQGGKYVIEKVLGQGGFGITYKALQVELNRPVVIKIPNEFLS
HDPEYEKYVERFIKEGRILARLSQEPHPHIVGVIDLFQEGNTHCLVMEFV
EGENLFEAIKHRGALPESEIVRCISQIGEALVKVHQAGLVHRDAHPGNIM
LRKNGKAVLIDFGIAKELLPQTLSSTGNIGNRGFAPYEQMTRGSREPTVD
VYCLAATLYYAVTGQPPTNSLARKLENVPLKPPKQITPNISNQLNKAILK
GMALEAQDRPQSMQAWLAILEAPKPTPPYSVVPVHKKEAVSPKPKIKLQA
SPRKAVTKSHSIIPWGCLIGILFSSLFIGYLFAMSNAPFWAWAVTVAGTL
TWAVAVSGVVVGGAVAVAVVGAATVALVGAATVAGAGIWAWFWALAVAAF
WGGAGEKLQNYFSQFHTFLILVTTSFFGLGLGLLVHRVFNVL
>Ava_4813 WD-40 repeat
MTTSPEPQPANHSTLFSYQVGGSLPIDNPAYVERQADRELYERLKAGEYC
FVFNSRQMGKSSLRVRAMQKLQQDGVVCAVIDPQTRGTTLREDQWYAGTI
KRLIGDLHLQDKIDFPGWWKQLDGQSISVVERFYEFIDQILLPQTTQNIT
IFVEEVDNLLSLKFDTDGFFILIRSLYERRAEKPDYTRLTFAFLGVATPS
DLIRSRRSSSFNIGHAVEMSGFQLAEAEPLQQGLIGKVSDPQAVMQSVLT
WTGGQPFLTQKVLNLVVQQVNSSLSPQELVEQVVCAKVIDNWETQDVPPH
LKTIRDRILQSDEQGRGRLLGLYQQVLDQGGIAADESYEQMQLRLTGLVV
KRDGKLNVYNPIYAAVFNQQWVDRALADLRPSFYAEAMKAWQESDEQKQS
FLLRGQALEDAEAWAKGKRLSDEDDRFLRESREVEKQDIDRKLEAERLAR
ETAEQANQILLEAEQTAKGKVRVGSIILAGTLVVAGVASVWAGISVNDAN
VKVANTQKEAKELTDKANQRVKEANIQVANTKEEAKNRTKKADDDVRLAL
ENLGKITKQAKIDKETAAKSLIEAQVKQKEANRKVEQAKKDLAAAKAELK
NVDRQSQEKVAAAQAKVTDAEAKVAQAIQLREKAEKEAREAQDNTRLALQ
GNALEQSGIVAMQQFDSEELNSLIKSIRSARNLENLVKNGTSLENYPAFS
PLFALQKILDKIFEVNQLTGHQGWVRGIRFSPNGRLIVTSGSDGTVRIWD
YLGKQQIEFKAHWGSILSVNFSPDSKLIATASDDGMVRIWNLLGEMLSEY
KHQNVIRDVAFSPDSKFIVTGGEDGDINLWSLQEKQKIKNWMAEQGAIYS
LSISSDGQYIATAGKDRIAKLWNLVGQKLSEFKSPNGSFRSISFSPDGRL
LATAGDDSKARLWKLSGEQLAEFKGHVGWVRDVSFSPDGKLLATAGDDGK
VRLWHLSGKQLIEFKGHQGGVLSVRFSPNKKLLATTGTDSNAKVWSLAGK
QLNPDVLYSTRPFLKNIEGESSNECFSLYAPGLYPERVSSSLFEDNQVGR
ISFENRTSRIIKVKLYHPDVPGIAFGEYPVEAKSIWTFPPEGEDYGFGSD
WGIQADNSKICVLGRVSNWNLSSNSKIPHIFKTSSDRFPLGDSFENQNIV
NSVSFNPSGQILATAELNGMVRFWDLSKKELSRWKANNYGGIVNINFSSD
GKYVATAIPGKVIIWDLSGKPIVQYNNTYHVSFSYDNRYLATVLQASNTE
LSKVQIWNWQNPSQKQPIKEWPLSPSKDRDVGVYSMNFSQDGQKILIAGT
LQVNRYSIVDSPVQLRDISGKMLAEFKGHRGGVFSANFSPDQKQVLTGGM
DGTVRLWDLSGVQQSQWKAHKGWVRSVIFIDNQRIATVGDDGLVKLWSRS
GQQLAEFAGHQGKISSIAFRAVDQTIVTSGYDGTVRTWHIDNLSELLKKG
CDWLHDYLSTNPNVTESDRQMCGIVKH
>Ava_0666 TPR repeat
MSQPRNRWIVQLILALAVLAFVGVSVVPIIGALNDNTSPSNQNSASNQGS
SLASNQQSKLADEVRGYELVLQREPENQTALKGLLQARLQLLALKQGNVQ
GVIEPLEKLAKLNPNQSEYGVLLAQAKQQIGDKEGAATAYRSILDTKPGD
LKALQGMVVLLLDQQRPEAAVGLLQDTLTNANQANTIQPGSVDVVAVQVL
LGNVHAAQKRYPQAISAFDQAIQKDTKDFRPVLAKAMLLKQQGKASEAKP
LFDSALALAPAQYKDEINKAATASPTLTPTAPPVPTPESTPKQ
>Ava_4988 TPR repeat
MLKNFPSPHSIIFSQVNKLAQASFGVAIALTLASNPSWAGDPFRAEKPRN
IGDNTEQAFKAVFQRGDYPTAERHLNQAISSEANEPLAYAMKASLAYING
DLAGLSNYGKKTLETGQKLVATDELRGNIYTAVGHFLEGGAIISREGTLR
GAPQALGRLRQVYEHLDKAEAISPQDPELNLLKGYMDLMLSVSLPFANPD
EAINRLQRNAAPQYLTNRGIAIAYRDLKQYPQALDYANRAIKEASDNPEL
YYLKAQILQGQGSQEKSQQLIREAIANFDKALTKRSQLPISLVKQIESER
NNAVNNLNSARR
>Ava_2801 Short-chain dehydrogenase/reductase SDR
MTEQKRIAVITGSNRGLGYAISRKLAQIGLHVILTSRNEADGLAAKQQLS
AEGLDADYCVLDVTNDVSVQRFTKWLRETYSKVDILVNNAGINPTTKPEE
SSLLTVQLETMRVTWETNVLAVVRITQALIPLMQVENYGRIVNISTEMAS
LSSISDDYYPLAPSYRLSKVGVNGITAILAKELQGTNILVNAYSPGWMKT
DMGGDNAPFTAEEGAETAVYLATLPDGGVQGQFFAEMRKFGGPVQLQW
>Ava_0039 Peptidase C14, caspase catalytic subunit p20
MPPIGLSTSRTTHTKQTNIPKLWLILVGVNQYQDEHLPNLNYSAIDCQGL
SEALTEAASQFTQKIVNIYHDFAPQSPSLANVRHRLQDITSTVSPIDTIL
FYFSGHGVVDPKTQEVFLCLADTQKSNLQNTGLALQEILQLLGNSGVQNQ
LVWLDACHSGGMSLRGVTLQLVELLQQSAAKSKGFYALLSCDNNQQSWEF
PELGHGVFTYYLMRGLQGEAVDSQGLIYLDGLYRYVYHQTLQYIDKTNQQ
LRLINQQKRGKGDTQLYNEYPLQTPKRIVEGVGELILGKRLKKIASGVHH
GLGMVVEGLSNSKITLDISKLLGSTGDFAVEYLPATKTSAEDIKAAIARH
WQQPQAELTLLYLRGRIEESEAGEWLRLGEDICIKRSWLKQQLRQCTSQQ
VIILDCPLGTASLGDWIEDLQIESHHGQCLIACASPPEAPENFAQKFLDT
LMISAQGNGLSAAGAIAQLQLSLADSKTPLYVWLSGTQGIIEILTNNTHK
SQQTSGLDLGVCPYMGLNAFAEADAAYFYGRETVTQQLIHHLRDNSFLAV
IGASGSGKSSVVQAGLIPQLRQGKHIPNSEQWGIKTIRPGVNPVEALARK
LGEWRETHLVIEGILHQGVESFVYWLRSLPHRVTVLVIDQFEELFTLAPS
PDRELFLELLLGAVQYAGDRFKLIITLRADFIAPCLEVPALAEALQVASV
LVPPKLSLDDYRRVILNPAQQVGLQVEGELVEVLLRELNQSVGDLPLLEF
VLEQLWQQRTAGKLTLQSYQEQLGGIKGALERSCQGVYESLPPQLQECAK
WIFLSLTQLGEGTEDTRRRIYKSDLIVKKYPVGLVEQTLNALTTAKLVVI
NLETDIEAQGKSSSPASPASPASPTPFVTVEVAHEILIRHWSTLRWWLEE
NRDRLRKQRQINHACQLWQQSGKQADFLLQGARLAEAEDIYIHWTDELGA
DVQEFIGACLAERKHQQLQAKNRLKQAQRAVVALSVLGIASVSFGGLAYW
QGREAQFREIAALNSSSQANLLSHQQLAALIASLKAAQQVNNVIAVPNNL
KLATVTTLQQALLGMQERNRLEGHKDGVISISISGDGQTIASGGLDKTIK
LWSRDGRLFRTLNGHEDAVYSVSFSPDGQTIASGGSDKTIKLWQTSDGTL
LKTITGHEQTVNNVNFSPDGKTLASASSDHSIKLWDSTSGQLLMTLNGHS
AGVISVRFSPDGQTIASASEDKTVKLWHRQDGKLLKTLNGHQDWVNSLSF
SPDGKTLASASADKTIKLWRIADGKLVKTLKGHNDSVWDVNFSQDGKAIA
SASRDNTIKLWNRHGIELETFTGHSGGVYAVNFLPDGKTLASASLDNTIR
LWQRPLISPLEVLAGNSGVYALSFSPDGSIIATAGADGKIQLWHSQDGSL
LKTLPGNKAIYGISFTPQGDLIASANADKTVKIWRVRDGQLLKTLIGHDN
EVNKVNFSPDGKAIASASRDNTIKLWNVSDGKLKQILKGHTEEVFWVSFS
PDGKIIASASADKTIRLWDSVSGNLIKSLPAHNDLVYSVNFSPDGSMLAS
TSADKTVKLWRSQDGHLLHTFSGHSDVVYSSSFSPDGRYIASASEDKTVK
IWQLDGHLLTTLPQHQAGVMSAIFSPDGKTLISGSLDTTTKIWRFDSQQA
QTSQINTLVMSACNWLQDYLNTNPHVTTNEQKLCPS
>Ava_4367 Phenazine biosynthesis PhzC/PhzF protein
MGQIITQVDAFTDKPFGGNPAAVCVVPSPQPDIWMQNIAQEMNLSETAFL
VKQDDGFNLRWFTPTVEVPLCGHATLASAHVLWSEGHLSPDEIARFHTKS
GLLIAKRQGDWIELDFPVNHSQPITTPPELTEALGVSLKSVFQNSLGYLV
EVESEDVVRNLQPNFQLLKTLAVADVIVTSQTQPDSPYDFVSRFFAPGLG
INEDPVTGAAHCCLASYWRDRLGKDEFLAYQASSRGGVVKVSYGGGDRVF
LAGQAVTVLRGELI
>Ava_0558 Protein of unknown function UPF0118
MRRSSSLQSLLIYGLSGPVIALNVWLLSVLFRYFQSPFTILSLAAILAFL
LNYPVKFFERARITRTQAVVIVLLVTLTLFGILAVTLVPMLIDQTVQLLN
KIPDWLTASQANLEHFERFAKQRRLPLDLRVVSNQINASIQSVVQQLASG
AVGLAGTLLSGLLNFILVVVLAFYMLIYGDRVWYGLMNLLPSKIRLPLTK
SLQLNFQNFFLSQLLLGLFMIVALTPIFLILKVPFALLFAILIGISELIP
FIGATLGIGLVTILVLLQNWWLAVQVAIAAIIMQQIKDNLLGPKLLGNFI
GLNPIWIFVAILMGYEIAGLLGTLVAVPIAGTIKGTFDTLKGGKSDDFMS
TVTIDHDSPNNE
>Ava_3152 conserved hypothetical protein
MRVVLDVNVWVSGLLWRGVPGKIFDLAAANKITIYTSEPILADVEEILVR
KKFQARINTLNTSVKELLSIIKHRSVEY
>Ava_2381 Peptidase C14, caspase catalytic subunit p20
MVFMKRRTFVQRIGSILAVLGVAETEWLTMNNHYYQALAQPTPRKLALLI
GINQYRKSSSLSGCLTDVELQKELLVHRFGFQATDILTLTEEQASREFIE
AAFLDHLVKQAKPGDVVVFHFSGYGTQLPVESDTLQNALVTTDENQEAQD
SQIANYLLEDTLLLLLRSLPTDRVIAVLDTSYTVPAINQPAGLKIRARQE
SPGTRLTAAELDFRRQIKPQNPEFSPAVILSATSDPQQSAREILMSGFNA
GLFTYALTRQLWQSTPATTIRVSLSHAASSIHQLGSKQQPKLLTKKKNQL
GVVTVENLLLDSHAGAEGVILSIEEDGKTVQVWLGGLPAQVLENYGANSK
FTLTTGEQLTLRSLMGLTAKAQVSKFAETLPLQVGQLIQETVRVLPRNLN
LILALDTKLERIERVDATSAFASVGHISTVVAGEQPADYLFGKLPQTPSR
YGLFTLDGELIPNTDGGVGEAVKVAAQRLSSKLSTLLAAKLWRLTENQGS
SRLPVKVNLEVVNSISSQAVLQRETTRTITTETATKKSLPTPNPPLPIPI
IPIGSRIQYRIQNLSDRPIYFMLLGLKNNRTAIAFYPWQTPQEPNNPENQ
PQLKQVVIAPNETVTLPQTTPTSGWVTSGPAYECEHQLIFSTAPFNATLE
ILDNAKYSTADQQPIATLINPLEVGQALLKDLHNSSAVKTETTTTATDSY
VLDINNWASLNFSFQIA
>Ava_2528 Serine/Threonine protein kinase
MSYCLNPRCPKPENPHDVKFCLSCGSKLHLKERYRAIKPIGQGGFGRTFL
AVDEDKPSKPRCVIKQFYPQAQGTNTVQKAVELFTQEAIQLDELGKHPQI
PELLAYFTQDDRQYLVQEFIDGLNLAQELAHKGVFSETRTIQLLNDLLPV
LQFCHNRQVIHRDIKPENIILRNSDNKLVVVDFGASKSATNTALNQTGTS
IGSPEYVAPEQMRGRAIFASDIYSLGVTCINLLTGRSPFDSYDTNNDTWV
WQQYVKPQVSNYLSQIINKMTASVPARRYQTVEEVLKDLNQHSPVATTPA
KPSHQVPQNPPPQPVSKTQSQIDLELEELKTQFLPGGKSKPQNIQPQPAT
NNNSAKSAIDLELEELKAKYLGNNNG
>Ava_1723 conserved hypothetical protein
MSRNPDAPSSSSNLRTIAQTFRMTGWISFWIQLVLGVISGIIVLLFGIFS
QRAGSPNNNPGTGFGVFLAICGLVVLGGGIYLAFRYTRIGNQLLSSNPSN
RPRKVETVQVLRLGLWINLGGTLVTLLGAQAIVGTLVARSISPQAVTTQL
FDPTRIISGLDMLVVQANINTVSAHFAGLVSSLWLLNRINKS
>Ava_1384 Polysaccharide biosynthesis protein
MLSKLRLSNLSKLKSHSRLRAIIANTGWLFADRILRMGASLVVGVWIARY
LGAQQYGLFNYALAFVTLFTPVLTLGLDEIVVRHVVRESSNKEEILGTTF
WLKFLGGIASVLLAVGITLFLGEREFLKISLVAILAIAGIFRAADTIELW
FQSQVQSKYAVIAKNIAFLLNTLIKVALILTKAPLLAFAWVTLAEFAMNA
VGLAIVYQSKGFSFWSWRWNFQVAKTLLKESLPLIFSGFAIMIFMKIDQI
MLGQMIGDKEVGVYSAAVRISEVWYFIPGAIVPSVAPSIYAAKDQSDGVY
YQRLGQLFRLLTCIALAIAVPMTFLSDKIIMVLFGNGYAGAGAILAVHIW
TSIFVFLGFASSPWFIAEGLNHVTLGKTVFGAILNIILNFLLIPQYLGLG
AAIATIISQAAAAFLCNAFDRRTQKIFKIQLQSLLLFYKY
>Ava_4012 Creatininase
MHSFIPPERFFAYLTWTEIQEMPNKENVVIIQPVGAIEQHGPHLPLIVDA
AIGVGVLGKALSKLDASIPAYALPTLYYGKSNEHWHFPGTITLSTETLTA
IIMEVGESIYRAGFRKLVLMNSHGGQPQVMQMVARDLHVKYGDFAVFPLF
TWRVPHITKELLTPKEATQGMHAGDAETSIMLAILPDQVQLDKAVAEYPP
EQPEGSLLSWEGKLPVAWVTKDISKSGVIGDATTATREKGDRILESVSDG
WVQVIKEIYAFR
>Ava_4607 Hydrogenase expression/synthesis, HypA
MHELGITQNIVAIVNEYAHGAKVRRVLLEIGKLSAIMPDAIKFCFDICSQ
GTVLEGAVLEILEIPGLAKCRQCGAEIALEKPFGICNCGSVHLDLITGEE
LKIKEIEVEEVCV
>Ava_3094 Virulence-associated E
MSTTFRVTQQFSSANNFLKLVGEPPYLFQTFDDSKHKDRSLTRQFYGSLD
EHWDKLVEFNNRGAGIFVTVNMTSGNKRKAENITAVRALFIDCDNELPNE
FHLTPTVVVNSSNNKGHAYWLLDKASNDVANFTQHQQQLIKHYGSDPAVK
DLPRIMRLPGFNHMKGEPTLVTFIQTGKPYESMASITDGLVSNDFIQQLE
SFANKVKNAPEGTRNDTLNTAKYTLAGAYPNKLPEIDQRLTEVALENGLT
IGEIKPTLNSGDVGAQRPIKLVGSGKRSKSNMVREVLEQFFGDELQWDEM
KNKLRFRGQNVTAEKLHDICERELDIDLPFESFKRMASVKASDNPYHAVR
EYLQSLPTVDNPEPVLSALYQAMGITNKLHRLYIRRWLIAAVGRAMTPGC
KADCALVLQGKQGIGKTTFFNSLFGEFFQTLGEHKSDVDQLLSMARSWCI
EWGEIENAFSRKAVSAIKSFMSTTHDVYRRPYAAEPDNYPRHFIICGTTN
QSEFLTDSTGNRRFWVVNAENRIDTAAVKEMRDDVWSAVLKLYLDGEPSF
LNETEVVESAEDTSQYEQTHPWFAEVSAYMTHHNPCTLADIMKNALGFET
SRLNDKKAQREVSDILDKLGCTKKQQRLNGVKAIYWHKPTLDNVTTDDVR
VTTKDIPTSEYF
>Ava_1708 Phosphoesterase, RecJ-like
MDLILCHTTADFDTLGAAVGLTCLLPGSKIVLTGGAHPPVRDFLALHRDE
YPLIERRSVNPEKIRSLTIVDAQQRDRLGKPAEWLDLPQVQKITVYDHHT
GQTSDIPATQLHIASVGATTTLIAEELQRQQITLNPAQATVMALGIHVDT
GSLTYDQSTPRDALALAWLMVQGASLSVIANYRDPGLSPQLQRLLTEALE
KLEYLCLRGYTIAWVTLTTDGYVPGLSSLASQLIELTEIDALLLANEHPS
NKDDSRFTVIGRSQIPGLHLDQLFQPLGGGGHSQAASLNLRGVDTQDILQ
QLLKGIKAAIPHPPTARDLMSSPVRTIRPETTIAEAQRILLRYGHSGLSV
VNPQGQLVGIISRRDLDIALHHGFSHAPVKGYMTTDLKTITPDTTLPQIE
SLMVTYDIGRLPVLANEQLVGIVTRTDVLRELHQNIAVGGSGNLSQFRDM
GTELKTSLSHELRSRLTPQLWQLLTTASQAAEERGWHLYLVGGAVRDLLL
AEAAAGTLMITDIDLVVDGFHKSADVGAGVELAKALQEIYPAARLEIHGA
FQTAALLWHKDPELDSLWVDIATARTEFYPYPAANPEVEASSIRQDLYRR
DFTINALALRLTSPRAGDLLDFFGGLLDLKAKQIRVLHANSFIEDPTRIY
RGVRFAVRFGFAIEPRTEEYIRYAINSGVYDRTAQNNTKTPALQTRLKTE
LKHILEAPYWRSALQLLDDLGALHCIHPTLSLNSELIHQLRLLERCLRRF
DPQQNLIPWQMRLEAIIAHLAPMYRGKVAKNLQLQEDSIQHLQNLAVAQA
EIVQSLSEYQRPSQVVQLLRKYNLEILILIALQSQRPIRRQIWQYLTVWA
NVQPILNGNDLKKLGYKPGPQYRQMLDDLLAKTLDGNITNQVEAQEFLAQ
QYHK
>Ava_1793 Beta-lactamase-like
MLFRQLFDPETSTYTYLIADLETKTAAFVDPVLEQVERDQKLLTELDLTL
GYCLETHIHADHITGAGKLREKIGCENIVPFGANAACANKKMQPGDVLQF
GSIVIEAIATPGHTDSHLAYLVNKTHLLTGDSLLIRGCGRTDFQSGSAAA
LYDSITKNLWTLPDSTLVYPGHDYHGQTVSTIGEEKKFNLRLVGRSRSEF
IELMGNLNLPNPRKIMEAVPANQRCGDVAMTSS
>Ava_2971 WD-40 repeat
MQYQYQVGGSLPADAPTYVKRQADEDLYTGLKAGQFCYVLNSRQMGKSSL
RVQVMGRLQAEGFACAAVDITAIGTAEITPEQWYAGVIDTLVGYFNLYTD
FDLETWWNNNGLLSPVQRFSKFIETVLLPRITENIVIFIDEIDSVLSLDF
NLDDFFAVIRDCYNRRADHPEYHRITFALIGVSTPSDLIQDKGRTPFNIG
RAIDLTGFELAEAEPLAQGLAALGNPQEIIAAVLAWTGGQPFLTQKVCNL
LIADYLVETRFIASGENKEKSSIENLVATVVKNRIIENWEGQDEPEHLKT
IRDRIMRSGEQRTGRLLGLYQQILQQSELVADDSYDQMMLRLTGLVVRRD
GKLRIYNRIYAEVFQQEWCENILAGLRPYSDTFNAWVASNYQDESRLLRG
QALQEALAWRIGKNLTDVDDRFLDASQELQKREVEKSLALARKEQEILTA
ANKKARQRVFIASVLLIISVVAAVGLGVLAGRSNKQLADARTERDKIDRE
KQQKERELATAQQRVVDANKKVIDANKNLQDATANLKQQQLTAKQQLNAT
NQQLKQAQDKEKQARGQVEKAQNDLRQAREQQRQALVGLKTAEAQQKRAQ
DNLKKTEAEREIALTGTRLERSGVANINRFEFNQIGALLAAMRDGRELKS
LIDKQGIKQLKDYPAASPVLALQTILDNVRGMTVMAGHENWVNSATFSPD
GQRILTASSDKTARLWDLQGRQIAKFQGHESSVNSATFSPDGQRILTASS
DKTARLWDLQGRQIAKFQGHESSVISATFSPDGQRILTLSGDRTTRLWDL
QGRQIAELQGHEGWVRSATFSPDGQRILTASVDETARLWDLQGRQIAKFQ
GHKSWLFSATFSPDGQRILTASSDKTARLWDLQGRQIAKFQGHENSVISA
TFSPDGQRILTLSVDKTARLWDLQGRQIAELQGHEDWVNSATFSPDGQRI
LTASSDKTARLWDLQGRQIAELQGHEDWVNSATFSPDGQRILTASRDETA
RLWNLQGWQIAKFQGHENVVSSATFSPDGQRILTASPDKTARLWDLQGRQ
IAELQGHENVVSSATFSPDGQRILTASPDKTARLWDLQGRQIAELQGHKG
WLFSAIFSPDGQRILTASDDKTARLWDLQGRQIAELGHKGWLFSATFSPD
GQRILTASSDSTARLWNLQGREIAKFQGHKNLVISASFSPDGQRILTASS
DKTARLWELQGREIAKFQGHEGDVITAIFSPDGQRILTASRDKIARLWDL
QGREIAKFQGHEDWVNSAIFSPDGQRILTASRDKTARLWDLQGREIAKFQ
GHEDWVNSATFSPDGQRILTASRDKTARLWQVESLEQLLARGCGWLRNYL
IYAPNLSESDKQVCKKE
>Ava_0535 Rieske (2Fe-2S) region
MVNTNPTLEQILPGGSDPHSFDWHEAWYPVHYLEDLDKSKPTAFTLLGKD
IVIWWDQQSQTWQAFEDQCPHRLAPLSEGRINEEGLLECPYHGWSFAGDG
SCQRIPQQPEGGQAETSQRACVTSWLTTERQGMLFVYIGNPENAAKTKIP
VVEPLEESPDGWVIINTFRDVPYDALTLLENILDPSHLAYTHHKTVGNRK
NAAPLELEVVVSGKHGFKGVWHQSIKAGQSELSTTFVAPALMWHDINSER
GRILTVVYATPIRKGECRLFARFPFKFPSKLPGIFIKLRPRWYYHLGQNG
VLEDDQIFLHYQERYLQAKGGSPNFTKAFYLPTKADSFVFELRQWVNNYN
AEPFPGQAFSQAIPKEQLLERYYSHTIKCASCRNALAKIQKLRLWSGVIT
VISLVSTPLITLFFDASSIPAILIETVTPITFAVIWRGLSKLEKQFYEGR
IIPPRNLPNS
>Ava_4286 CobB/CobQ-like glutamine amidotransferase
MTLELTIGWLYPTLMSTYGDRGNVITIERRAQWRGYTVKVLPLDQNSTAE
DIKSVDVIVGGGAQDRQQEIVMRDLQGAKADAMREKIENGTPGVFTCGSP
QLLGHYYEPGLGQRIEGLGILDLVSIHPGENTKRCIGNLVIEVTASRLAK
DLEEMTGSKAYLVGFENHGGRTKLGKVEALGKVVYGLGNNGEDGTEGAFY
QNAIATYSHGPLLPKNPFVADWLIQTALRLKYQQPITLQPLDDSLAVQAR
EAMFKKLQVNPPKLSAVAKVSG
>Ava_3813 Phospholipase/Carboxylesterase
MRKPYKFKNQSGLLLNRVDFRSKSLSLQFTTFPPANSQTPAGLVVTLHGW
GANAEDVASLLPYFNLPDYQFVFPNAPYPYPYAPLGRSWYDLRQENMYEG
LAQSRELLKDFVLSLESSTGVPLSRTILSGFSQGGAMTFDVGSKLPLAGL
VVMSGYLHPEAISPDHTNIPPTLILHGTKDEVVPLQAAVKARTTVESLGV
PVQYQEFEAGHEINLEMLNVARNFIVNALV
>Ava_0752 sporulation-control protein Spo0M-like
MAVVFRLTADQLEVLLEIDKRASGWKGSPEEAFDYR
>Ava_1022 Exopolysaccharide synthesis, ExoD
MAIHLNSHQVRLSFSQEIKSLLQRLAEQHLTLGDILAETSERGFSLVIAL
LVLPFLFPMPPGATGPFGAACLILSVQMVLGRRSPWLPKKIAHYKFPRPF
AQFLLQNLRRVTKIVEKIARPRLSKIADNPLTWRINGFCISWLTILLISP
IPLTNPIPTVGILLLAIATIESDGLLICLSYVVTAVITAVFAFIGYGLWL
APSILPSIFK
>Ava_2104 WD-40 repeat
MNNLPNSHNLTPENEQSLQTLVRAITFSQGEFSLILLRCNYAALRKRITQ
QLHQISPIKIHEITLPETVKTLYTNINEQLGDEQPPALIVFGLESAKDID
TVLTAANRVREEFRKNFPFPILIWVNDQVLQKFIRLATDLENWATIIAFE
SSTNELISFIQKKTDEVFIGDLIPNPQICWELNRGIQDLQNRGRELNQVI
QANLEFVRGLHDYLYDNTDSALAHYQDSLDFWQKNHNLKRQGILLFNIGL
TYVRQAQKNQIDNQDCYQKARKYLQRCIAILEKGHFDNLIAKYINTLGEV
LRNLNAWPDLYNLAHKSLKLHKNYGQSLKIAQSYGFLAEVSLESFQWYEA
KKLAQKALQRLKNISNQKVPERSLYLFILARSQKELNENTDAISNLLAAK
NETCAYYNPRLYINILEMLSNVYFEQSQYLAAFQIKQEQLQIEQQYGFRA
FVGASYLNPQLQAINPAQLQVYRKGTIAQEIVASGREKDVQRLRERISST
EHKLTVIHGQSGVGKSSILQAGLIPALQEKPIAERDALPLLLRVYTDWVG
TLGQNLAQAVEEVRGYKLSHDLNSALAIREQLQKNSERHLLTVLIFDQFE
EFFFVYTDKASRKAFYEFLRFCLDIPFVKIVLSLREDYLHYLLELERLFN
LGSINNNILDKNIRYYLGNFTPADAKAIIQNLTEKTHFYLEPKLIDKLVE
DLSCQFGEVRPIELQIVGAQMQTEKINTLAKYCQFGTKEKLVERFLETVI
RNCGSENEQMARLVLYLLTDENGTRPLKTRTDLASDLKPEVKTLDLILEI
FVKSGLVLLLPEVPADRYQLVHDYLVPFIRQQQGNELLAELEQEREQRKQ
IEQELERVETILTQVNAELEKQQIVLREVQAGTTLEQEGVSALRQFDFAQ
LDSLMSAMRSGKALQALVKDGRSLAKYPATSPLLALQTILDNIQERNQFQ
GHQGWVRSVSFSPDGEYILTASDDCTARLWNLQGKQLISLQGHEDTIWSA
NFSPDGKYMATASSDRTARLWNFRGQQLAKIQGHQGYVRSVSFSSDGKYI
ATSSDDRTARLWNFSGQQLAQFSGHQGTVWCVSFSPDGKHIATAADDRIV
RLWNLKGKLLVRFPGHQDCVWDVSFSPDGQYVATASSDGTARLWNLAGEQ
ISRFRGHQDVVWSVRFSPNGKYIATASSDRTARVWNLNGQQLEQFPGHQD
YVRSVSFSPDGKYIATASSDRTVRLWYLNKQQFPPFRGHQSTVRSIDFSP
DGQQVVTASDDRTVRLWSIQGEELLQFLGHRGKVWSVSFSPDGKYIATTS
SDRTVRLWDVTGQMLQQFPGHQGTVWSVNFSPDGQHIATASSDLTARLWS
LDGQELMRFKGHDKWVRYVSFSCNGEHLATAADDCTARLWNLQGQQVGQF
LGHQSTVWSVNFSPDCQYLVTASEDHTAKLWTLDGQILTEFRGHQAPLKS
AVFSHNGQYIATSSDDRTVRLWNLNGQQIAQFKGHKGAVRSISISPDDQY
IATASDDRTVRLWPIENLDQLLRRGCNWLQDYLENNTHVTESDRRLCEIL
KY
>Ava_0722 ABC transporter-like
MLRLEHISKIYPTGEVLKDINWEVKPGDRIGLVGVNGAGKSTQLKIITGE
IEPTAGEIIRPASLHIAYLNQEFEVDPSRTVREEFWTVFKEANQVQLALT
QVQHDMQTATPEELDKLIDKLDRLQRQFEGLDGYGLEARIGKILPEMGFE
QEDGDRLVSAFSGGWQMRMSLGKILLQAPDILLLDEPTNHLDLETIEWLE
NYLKKLTTPMVIVSHDREFLDRLCTQIVETERGVSSTYLGNYTSYLEQKA
ESQLAQLSAYERQQKEIEKQQAFVDRFRASATRSTQAKSREKQLDKIERI
EAPVAGVRTLHFRFPPAPRSGREVVKIKDLTHTYDDKILFLGANLLIERG
DRIAFLGPNGAGKSTLLRIIMGLEPPTEGSVGIGEHGVIPGYFEQNQAEA
LDLKKTVMETIHDEVPDWKNEEVRTLLGKFLFTGDTVFKQVEALSGGEKA
RLALAKMLLRPANLIILDEPTNHLDIPAKEMLEEALRNYDGTAIVVSHDR
YFISQVANKIVEIRDGEFRVYLGDYHYYLDKIAEEKETARLEAIAAEKAA
KKAAKSAKSSAKKK
>Ava_1121 Glycosyl transferase, family 2
MRLSACITTRNRPEDLENCLRSLWDSQIKPHSVIVSDDSPSMEMQQQNQK
IVEQYPQTIYITGPRIGVCANRNNAVNAIPASATDLVAFIDDDICVEPEF
IGGAIAQYAKMSPEQSQHTILSGISYSPDGYVMAPGKLSFRGYFRSSDVP
ETIAIHASIFPRQFFEQEQWDENIFFGYEDAELCLRALKRGYKILNCPEL
RVLNAGGNGKSSLMESDIGKLTKYEISIEAARLYIGIKRYKDLFPNVIKL
IGFCCVYFLHMTAYLCKRGSLQAWPEIIRRSHIQKLWQPSQLNWG
>Ava_0895 UbiE/COQ5 methyltransferase
MQRQLEPEVMDSWEEASEYDAMDFTEVNNAFAEEAVACGPSEHGLVLDAG
TGTARIPVLICQKRPRWQLVAIDMAENMLQIATQHVQQSGLQEHIRLELV
DAKRLPYEDGIFDLVVSNSLVHHLPDPLPFFAEIKRVCKPQGGIFIRDLF
RPEDEATMNAVVASIGNEYDDYQKKLFRDSLHAALTLDEVNQLIITAGLT
GVEIYQSSDRHWTAKRSWTN
>Ava_2681 ABC-1
MFLTQTVPRQREIIEVVLRNGWDYMRRLLTGGKADEPQLPTPAVLKNILV
DLGPVYVKLGQLMSTRPDLLNAAYIEELSTLQDEVPPVPWTEIEILIRKQ
LKRPLEETFSKINPVPVAAGSIAQTHRATLIDGREVALKVQRPGIDLTIA
QDIALIQGIADLVARTDFGQNYEIKSIAEEFTKALEAELDFTREAGHTDL
LRRNLSRSRWFDPTQLVVAEIYWSLTTEKLLVMEWLDGVPILSASLNNNN
GKDPVAERKAVTTLLFRAFFQQLYVDGFFHADPHPGNIFYLSDGRVALLD
CGMVGRLDPRTQQILTEMLLAIVDLDAGRCAQLTLQLADSAQPVILSRLE
SDYDRMLRKYYNVSLTEMNFSQIFYEILQIARNNKIRLPSNMGLYAKTIA
NLEGVAQTFNPEVNLFDEIQPLITDLFRRQLLGDNPVRSLLRTALDLKSL
SLQSPRQVELLLDRVTSETLRWNLSLHGLDGVRRTMDDAANRLSFSILVG
SLIMGAAVISTKAQTSELSFLSSVLFAVASLLGLWLIVSILRSGRLK
>Ava_4445 conserved hypothetical protein
MSKSRTSAFWNYVAECVRLLYWSCFKPYTFERWLRDIHPELKPQDNPFTK
RIEFSTNPHLRRYAEQVSWLIAVVPIFAVLLAAPIYTLVTGQSFDWLISC
NFCLGWFLGRFTRNLLECSDKEKLVIWLFIIFIFIWILKFLPGMALSAKL
SLVSSVALGMTLNILIILLIGVAFSLFIGMPIGILLGLDFSVALLKKRVH
KEEIASKISKTLGAVVSISIGVFVGLLTSIMSCLVLGIAIGKKQGVEWSV
LISINVLVISVFSSLLRVCLWLPELLWIFLLFLYSKIDILFSMTEEKNLS
IDFLDDEVPNFYQIKTQAKYLSYLPHRFDELIILPLPFLEQMIAEAYQDN
SLTALETIDYLITSTNQQKVAVQAMLTIAADIFNSCQRLEDIVDIANQLT
WVPSPLPKELGFVLPPFLDISQSVRASQQATTFYRRYELLNPPISALNEL
KNTLAFGKNASLATKFGSISDRWLTILQLAQRTLEETARQSQEIRQVYIA
GNSLDPETAKYRFKGRIDIFREIETLTLSDQPPVLLLYGGRRTGKTSALK
YLPYKVTSDIIPLLIDVQGAASATTIKGFTENLAQQIIDTARRLPRKLHL
PNPDSSKLNEDPFPALQTWLAEIERSHSGKRFLLCLDEFERLSEVVNATN
SRAPLNFIRHLLQHQKQWILLFSGSHLLSELDAYWSDYLINTRALRMTYL
QESEVRELILKPVEDFTNIYEPEAVDKIIQITCCQPYLVQLVCYELVELL
NRDIRANRRQPNTAKATVADVQDIIPIVLERGDQYFRELWTSLEESDRNL
LRRLIDGETPTPQDSKIVKKLSRKEILTPEGNTFQVPLVERFVEYLLEEE
>Ava_3113 Alpha/beta hydrolase fold
MKDWWQETFPKGRQSLIISDVHGYPVQIAYGEKGTGRPLFLIHGMGSWSY
NWRYSVGPLSKFFRVICVDAKGFGFSDKPCLRQEKNGHQVIELERIIQCL
CDEPAIVVAESLGALVALALAQRNAELIGRLVVINAPIFTESLPHWAMSI
LAQTPIEILQTIDDLRLAYLFAPIVREVMAIERRKVLFDPSILTQEDVYW
ITYPFIEIPGTLVKVAEELQIAAREIENWQANKPNMLSEIQNKLNTIEAP
TLILWGDKDSWFPASHGKKLHQHLPNSKLQILDNCYHDASTGSAKVVNKE
ILQFLKETDFL
>Ava_2899 Protein of unknown function UPF0001
MISSINERITHIRASLPPSVRLIAVTKQMPTEVIRAAYAAGVRDFGENRI
QEAASKQAQLQDLPDITWHFIGHLQANKAKKALEQFQWIHSVDNLKLAQR
LDQLAQQLGVNPQVCLQVKILPDPNKSGWSVPELLADLPALNQCKTLQIQ
GLMTIPPFGLNDAKILHVFNSTSKLAEDIAEQNWDHIHMEQLSMGMSGDY
QLAVQAGATMVRLGTILFGQRT
>Ava_1895 Short-chain dehydrogenase/reductase SDR
MKGRQVLLTGGTGGLGVGVTPVVLAQGANLTITYRNLKEVERLKEVLPPA
DFARVTFLPANLEEESSVDNLITRIGRVDVLIHLVGGFSMGKIHEYSYYS
WKREFDINLNTTFLVCKYSLKSMLDHGYGRIITIGSKAAVEPSGGLAAYS
AAKAGVVAFTKAIADETKGTNITANVILPTIIDTPANRQAMGTENADKWV
KPESIGELICFLASEKAKDIRGAAIPIYGGV
>Ava_2732 Protein of unknown function UPF0118
MSGIEAKNLWHRLNNLALVRFLLLVAAGWAIVQLLAYFETVIVIFTFAAI
LAFLLSYPVQWLRRFLPHNIAVVVIFLISIVILGGLLITVGIAILSQGQQ
LIDSISAFLTSLLPFLERVEGLLRNRNLQIDLSVIQEQLRTQAVSTLVTS
LAIVQQFLTNFVTFILIAVVAFFMLLDGEKLWNFTLKIIPQKRRIRFTNI
MRRSFLGFFRGQLLLCLFLTSSTFIIFLLLQVPFALILSVIVGILDIIPG
IGATLGVGTITLIVLSQDVWLALKVLVACVVLQQIQDNLISPRIMQGALN
LNPVVVFFALLVGARVAGLLGVFISIPITGVIVSLFEIDEMKSEV
>Ava_3501 conserved hypothetical protein
MIIVSDTSPITNLAAIGQLDLLRQLYGSVIIPEAVYNEMASVNKIVPGAV
EVQTLSWIQTQTVMNSLQVTEIQENNESIHLGEAEAIILSLEMKADLLLM
DERRGRIVATNYGINVTGLLGVLLQAKKQGLIPVIKPLIDQLITQADFRV
SPQLYTVVLQASNEV
>Ava_3132 RNA-binding region RNP-1
MSVYVGNLSYDVTEDSLNAVFAEYGSVKRVQLPTDRETGRMRGFGFVEMG
SDAEETAAIEALDGAEWMGRDLKVNKAKPKEDRGSFGGGNRGGYGGGGGR
NRY
>Ava_3863 Protein of unknown function UPF0118
MKLGQWLGLLALVISLYILWEIRQLVLLLFTSVVLAVAINQLVLRLQLSG
IKRIWAVLLSVGIVVTFLVGALLLILPPFIEQFRQLLVLLPTGINQIQQG
INWLEERLVDSYLPEIPDIDRLLEQIQPWVTRLTQQAIALFSTSVTALLE
TLLVIVLTLMLLANPQPYRRVFIRFFPNFYRHRVDEILTRCATGLGDWTI
GALIEMLFIGTLSGLGLWILQVPLALAHAVLAGLLNFIPNIGPTLSVFLP
MAIAFLDAPWKAVAVLILYLVIQNVESYWLTPTIMAKQVALLPAVTLTAQ
IVFVTLFGALGLLLALPLAVVAKTWIQEVLFNDILDEWQPSNSSNLY
>Ava_1414 Aldo/keto reductase
MPTGLDCTQTMLYRRFGKTNLHLSVFSLGTMRYLADSENVQQTIATALAL
GINHIETARGYGKSEEYFGQAIKVGLSVARSQLYVTTKIPPTSDADSMRR
YIDESLERLNLDYLDCLGIHGLNTWEHLEWVQAKGGCMKAVEEAINDGRI
RHVGFSTHGSLEVIQAAINTDFFEFVNLHYYYFFQRNAPAIKLASEKDMG
IFIISPADKGGKLYTAPQTLQDLCQPLSPLELNYRFLLSDQRITTLSVGP
ATPEELVEPLQVADSCGELTSAEITIFQRLQNHQESVLETDKCSQCYACL
PCPENINIPEVLRLRNLAVAYNMTDYGQYRYGMFENAGHWFPGMKANRCT
ECGDCLPRCPEKLDIPNLLEDAHNRLNGRAGRRLWG
>Ava_5015 conserved hypothetical protein
MLKDAQGLVVTTDSAKAIAAINRFTQQMLGYGSDAETAILQAIAADPTCA
LAHAYAAAYYLTQENRKSWQQAQPYLRTAQQHFAKATAREQLYIQAISAW
ANQEIEVAIAIHEEITDKSPCDLISVQQGQYHYFYLGDKEKLWQIAQKVL
PSNPENHYLYGMAAFGLEQCHQLEAAENMAYQAIAINRYDPWAHHAIAHV
METQKRVDEGIAWMESFADTWENCNSMLYTHNWWHIALYYLQLENYREVL
NLYDTHIWRRANKQSPKDQVGAISLLLRLELHGVDVGNRWQGISPYLYSR
IDEHALPFQDLHYVYALAKAGHHDWVKQMLLSMQYHALSINPFQRRRWLE
ITLPAARGMVAHAQGDFHTTVAELQPVLSRLHEIGGSHAQRVLFGQVYQD
AVSSSQQQSWVYGITA
>Ava_3338 Short-chain dehydrogenase/reductase SDR
MEDFVTPPAFGEKVRERWTLAGRKALITGATKGIGLAIAQEFLALGAEVI
IVARNAEAIEQQINAWDSAGKVHGVTADVSTSEGRQIIHEYVSKTVGELD
ILVNNVGTNIRKKATDYTEEEFAGIFQINLTSIFELSRLFYPLLKTSKNS
SIVNIASVAGLISVRTGAPYGMTKAALVQLTRSLAVEWADDGIRVNAIAP
WFIQTPLTEPLLNNPETLSAVLSRTPMKRVGQPEEVASLTAFLCMPTASY
ITGQCIAVDGGFLAFGF
>Ava_3752 HAD-superfamily hydrolase subfamily IA, variant 3
MNTKCPSRDFTYQDWILTETRFNPEQLHSRSTVFTIGNGYLGTRGSLEEG
HARGLPATFIHGVYDDVPVVYTELANCPDWLPMIIAINGDRFRMDQGEIL
QYERKLDVSQGLLSRSLRWRSPSGSIIDIHFERFASLADHHILGQRCQIT
AHDGDCLVEIQASINGYAENQGFNHWEGIDQGKTEPGIWLQSRTRGTQIE
IGMAAKMTISGVEAALQVSIVPGYPTISASFLAKSQQTITVEKLVTVFTS
REVNKPVTAAQEKLAQLPDYTTLLTANKQAWDEVWQKSDIYIEGDPTAAF
AVRYNLFQLLIAAPYHDEKVSIPAKTLSGFGYRGHIFWDTEIFILPFFTF
TQPALARNLLSYRHHTINGARRKATHYGFKGAMYAWESGDTGDEVTPRWA
LPDNYYGEDVRIWCRDREIHNSADIAYAVWQYWQATSDDVWMRDYGAEII
LDAAIFWSSRVEYNSQGDRYEIRGVIGTDEYHEFVHNNTFTNRMVQWHLE
KALKVADWLRHTFPEGAKELEEKLQLTPELETHWQDIIKKICIFYDSSTG
LIEQFEGFFQLKDINLEDYEPRQRSMQAILGIETTNQHQVLKQPDVLMLL
YLMRLSAEFPYNEKALKSNWDYYAPRTDITYGSSLGPAIHGILASDLGKS
ATAYERFMQALMVDLEDSRGNTNDGIHGASAGGIWQAVIFGFGGIQLTEQ
GPIANPHLPPNWTRLKFQLHWRGQWYPFDLPGGVGIGDWGLGTGGVTSTQ
SPTPHTHSPDIRGFIFDLDGVLTDTAEYHYLGWQRLADEEGIPFNRKANE
ALRGVSRRESLMRIIGDRPYSEAQIQEMMERKNCYYVELIEHITPKDLLP
GAIALLDELRQAGIKLGIGSASKNAHTVIERLGLADKVDAIADGYSVQKP
KPAPDLFLFAAHQLGLEPKQCVVVEDAAAGVEAALAGGMWAVGLGPVERV
GAAHVVLPSLAGVTWTDLRTKLNEAAGV
>Ava_4370 Peptidase M16-like
MNQLSRSISRRLLATLMATVVIWWGWTPEIALAQTPPALQPSKTPTAKVP
TSIQPYLDRVIKDLTEFRLDNGMKFIVLERHQAPVVSFLTYADVGGVDEP
DGKTGVAHFLEHLAFKGTTRIGTQNYQAEKPLLERLEQLDTQIRAAKANG
KQDDVARLQATFKEVESQAGKLVKQNELGQIVEQSGGVGLNANTSTEATR
YFYSFPSNKLELWMSLESDRFLDPVIRREFYKEKDVILEERRMRIENSPI
GLMVEKFIDAAYKVHPYRRPVIGYDQDIRNLTPEDVQTFYNTHYVPSNIT
IAVVGDVKTAEVKQLAQTYFGRYKAAPKPQSKITPEPKQTQTREVTLELA
SQPWYLEGYHRPAVTHPDNAAYDIIASLLSSGRTSRLYKSLVEKERVALN
AQGFSGFPGDKYPNLMLFYALTAPGHTVDEVAVSLSKEIDKLKTEPVSAV
ELERVKTQARAGLLRSLDSNMGMAQQLLEYDVKTGSWRNLFKQLDEIVAV
TPADIQRVAKATFTPENRTIGKLLSKKA
>Ava_4319 Alpha/beta hydrolase fold
MTIENYQFNYSLTSNTDKPVILLLHGFMGNIDEFDAAIELLGDDFSYLKL
DLPGHGKTQVFGGDEYYSMANTAQGLINLLDKLEISKCFLVGYSMGGRLG
LYLALHFPERFYQVVLESASPGLATEAERLDRVKRDAQIARKLGRSLAKT
DFTAFLLNWYNQPIFGNIKNHSWFERMVESRLQNHPHELVKSLQFMGTGS
QPSLWEKLQKNQIPLLLLVGEHDEKFIDINIKMTKIAPASQLKIISNAAH
NVHLENTLEFVQQLKVFFTKSVPSDDQ
>Ava_4993 ABC transporter-like
MNKELFCIENLRVAYPQRSGEELQWAIDDVSFTLQPGERMGLVGESGCGK
STLGRAAMRLLPPSSLVEGRVTFQGKSVFDLTPNELRKFRGEAVALIFQD
PMTRLDPLMTIGNHCIETLQAHAPELSAKEAKEKALATLEKVKIPASRWG
QYPHEFSGGMRQRVAIALALLLNPKLIVADEPTTSLDVTVSAQILQELTR
LCSEENMALLLISHDLAMVAEYCSRIGVMYNGKMVEMGTTESVFKNPQHE
YTRSLLKAALHIQAVDEGSSEASQSPIPSPQSPILRITELKQYYTIEPNF
VERLFKAESQTIKAVDGINLELYPGEILGLVGESGCGKSTLSRTILQLIR
PTSGKVEFLGQDLTKLSRQEIRACRRQIQMVFQDPHACLNPAMTVGESIA
DPLLIHHLADAAKAKEQVLWMLQKVGLTPPEVYYQRYPSDLSGGQQQRVA
IARALITRPKMLICDEPVSMLDASVQSQVLDLMLQLKEEFELTYLFITHD
LWLARFLCDRIAVMHGGKIVELGETKQIFAHPQHPYTQTLLAAAPLLARA
>Ava_2288 Alpha/beta hydrolase fold
MLQFQPPGFGHKVVHTSLGAMVYYTQTAAPWWDDDSEDLPPLLFLHNFGG
GASAYEWSKVYPAFASTHRILAPDLIGWGESAHPVRDYEIRDYLTAIAEF
ISQTCQQPVKVVASSLTAAFTIRLAITQPDLFDSLYLVCPSGFDDFGQGA
GRRLPLPIINAPLLDNLIYALGAENEIAVRNFLQSFLFAQPQRVSQEMVD
AYLYSAQQPNAKFAALSFLKGDLYFDLSLYIQQLITPTVIFWGEKAQFTR
IELGQRLANLNPHIIRKFYAIADTGILPHLEQPEIIIGLLQPYVKV
>Ava_3358 Glycosyl transferase, family 2
MSEASNLLVSIILVNYNGADVIPNCLNSIDKFIPKDNCEIILVDNASQDN
SYELVAKDFPDVKIVKLPKNYGFGSGNNAGAKIAKGEFLLLLNTDTILTT
NILPHLIDLMRENPEVGVIGTKLLFPDETFQVSFAYTISLKGEYKSRKLH
KYAEDKSKLNSLEQEFNTIKEVDIVVGAALFIRADLFHSLGGFDEKFFIY
FEDADLCKRVQNQGYKILYTPQVSLIHIRGHSMKKNANATAMEYRRSQIY
YYKKHCPLWERIILRIYLLAKFVPEFLATRNPYSLEIIKLLRKF
>Ava_2809 Serine/Threonine protein kinase
MSGASLIGKTLRSRYYITDKIGEGGIGETYLAIDKDQPEDYQCVIKRLKP
QNTNQSTIMWLQRAFNREAVTLQRLGSHDQIPRLLAFFREDEEFFIAQEF
IHGINLRTEFSNSGKWSETQVISLLQNILEVLDFVHQNQVIHRDLKPENL
IKRSSDNKIVLIDFGAVKEIGTQIINNKGKKVSTSFIIGTPGYMPMEQLR
QNPMLCSDIYAVGMIAIEALTGLHSTKLLDSYTGQIVWRDRLKIREDLAE
ILDRMIAYNPHHRYQSAAECLQETLSLSQGLNHQRQTSISLSLSPICTII
RFGSFGRITTNEFIGKFLAITQNRKLVKTPGISLILASIVAIIVGKGIWS
QILINNIDAMADEISLPASRATKGHTPNINISKFLSYFINSSANLTKKNP
TDLPIIKPLVHNQSSSPVLSRLQTLRLENIQIHQVATPRTYNICNAPLQL
LQPNHSDRLNLDWQLDWTTKGNAHVGRLKMQGYSGKMRTISPDGKGGLRT
VEQTMQLYTSAKGYVLLGFNPMDVEKQQARRYKANNLIIRVEIDGSTTII
NCDDSGNISSAIAQEF
>Ava_2599 ABC transporter-like
MYLRLENISKRFNSFIANDNISLSVDAGKIHGILGENGAGKTTLMNIIGG
LYQPDAGEIYLQDQPVKITSPNQAIKLGIGMIYQHFMLVPQLTVTENIIL
GRENSWRLNLRQKQQEIAALSQAYGLEIDPTAKVEDLPVGTQQRVEILKV
LYRQAKLLILDEPTAVLTPTEVESLINILRQLAAAGNTIIFISHKLEEVI
NLCDTVTVLRRGKVVATTTTKDMTPQKLAELMVGREVVLQVNKSAFVPGK
VILSVENLQVADDRGILAVHNVSFQLLAGEILGIAGVDGNGQRELADAIA
NLRGILNGTIQLNSSSPQQKIGYIPEDRQKMGLVLQFTIAQNLILNVFKK
IPFCRHFLLKSSVIKHHAQVAMQEFDIRATGEDIQVSQLSGGNQQKVVLA
RELAGKPDLIVAMQPTRGLDVGATSAVHSRLLTERDRGAAILYISTELEE
VMAMSDRIAVIYRGKFVAILDAQTAIIEEIGLLMAGGTRRE
>Ava_0173 WD-40 repeat
MVRPYLILLFMGVVALPVNLGSGLRVNAADTVQVTPNPEPTTGFTNPRLL
HSLNAHSGRVKSLTFSPDSRTIFSGGAYNDGIIRLWNSTTGKRVGTINKA
QKNAVESLVISPDGQTLASSGSDNIINLWNLKNNQFTRSFVGHTASVMSL
AVSSDGKVLVSGALDGIRVWDLLQQRPLSTLVRFDNRIDALAMSSDGQTL
ASGDTKGVIKLWNLSTGKLIREFTAHSGTVTDIVFTPDGQNLISCSSDRT
IKVWHIPSEKLSRTLTGHNNWVNAIAINRDGKTLASAGRDGIKLWDLSTG
ELLNTLIGHSDWVSAIAFSPDGKTLASGGFDGRISIWGNPPVTVRK
>Ava_2812 conserved hypothetical protein
MRFRDYRLFTLGRVLLSVGSQMQTVAIGWELYERTNSAMVLGGVGLAQVL
PMIALTLVAGDIADRRSRKLTILLSVMLLALCSLGLAILSYTRGAVFLIY
GCLALIGVGRAFLKPASDAIMWQLIPVNAFTNAATWNSSSFQLASVAGPA
LGGLGIAVLGSATGVYILAAIAGFLCFFFMAAIKEKKVERVTEPISLQAL
AAGAKFIWENQLILAAITLDMFAVLLGGAVALLPVFAKDILQVGPVELGY
LQAAPSIGALIMAVTLAYLPPLRKAGTALLWSVIGFGVVTIIFGLSRWFW
LSLIMLTLSGALDTISVVIRHTLVQIRTPDQLRGRVAAINSVFISASNEL
GGFESGLTAALFGPVLSVVGGGIGTILVVVATAAIWPGIVKLGSLQEYE
>Ava_1439 Flavin reductase-like, FMN-binding
MVSMSTTGNAHTENVQHRLTVETVEIAPNTTAIRCLDWDRDRFDIEFGLQ
NGTTYNSYLIRGEQTVLVDTSHQKFRQLYLETLKGLINPKAIDYIIVSHT
EPDHSGLVEDVLQLAPRATVLASKIALQFLEGLVHDPFSKRIVKSGDRID
IGKGHEIEFVSAPNLHWPDTIFSYDRKTEVIYTCDAFGMHFCDNRTFDED
LEAIEADFRFYYDCLMGPNARSLLNAMKRMGELGKIKIIANGHGPLLYHH
LDVLTECYQSWSQRQAKAETTVGLFYVADYGYSNLLVQAIGEGIQKTGVA
VEMIDLSTAEIQEIQELAGRAAGLIIGMPPTTSVAAQAGISSLLSVVKDK
QAVGLFECFGGDDEPVDTIRRKFIDLGVKEAFPAIRIKDVPGASAYQLCT
EAGTDLGQLLTRERNIKQIKSLDVNMEKALGRISNGLYIVTTKKGDVSSA
MLASWVSQASLQPLGFTIAVAKDRAIDTLMQVGDRFVLNVLEEGNYQELK
KQFLKRLHPGADRFAGVRTQIAKNGSPILTDALAYMECEVQSSLECSDHY
ILYCTVEDGRVSKPDGLTAVRHRKVGNYY
>Ava_2161 Peptidase U62, modulator of DNA gyrase
MVERALALSELNQSEPVELVSNSKPSYPDLGEAVSVEVLVGWGKEAIAII
RDNYPDVLCNSDWECDVETTRLVNTQGLDCYYSDTTLSCYMSAEWVRGDD
FLSVSDGQTQRDYLDPEKLAYQILQRLVWAKENVPPPNGRVPVLFTSKAA
DMLWGTAQAALNGKRVLEAASPWAERVGKQVIAPSLTLYQDPQAGPYSCP
FDDEGTPTKSLVFIEKGILQNYYCDRTTGRQLGNSTTGNGFRPGLGSYPT
PGLFNFLIKPGSKSLKDLIQNMDDGLIVDQMLGGSGGISGDFSINIELGY
RVQKGQVIGRVKDTMVAGNVYTALKQVELGSDADWNGSCYTPSLIVEGLS
TTGRNN
>Ava_4402 conserved hypothetical protein
MEPEKLGRFKEYGELILQKLDFVPQSPSQQEDWVPASLDDCLLRLREAAQ
KTVELATSPVKIGVMGEFSSGKTLLLGSLIGYADALPISENPTTGNVTAI
HLIPHPGFTTTQVGNFTVEYLTREGVNECLRFMLGEANRRTIAAGLPAMQ
PAKLSSGKEILGWCEASWKSSNNLELRYILRELVLFIRAYSSYGEALCGG
RYEIDPDSAREGLQLAEQPLAIQTLGFEDLPPAHIRLPSPPQKLATKLLQ
NSFPLIRRVDIDVKISREIWDITDASEFTLLDFPGLGAANSGARDTFLSL
RELAEVQTILVLLNGKSPGSDRANKIFTMMQQQRPGQDLKDLILVGVGRF
DQLPLESEGGERLLDQLIDESRTPHLTADKVLQQLRVLQTTIDGASAFTT
NKDRIALLSPLLGLAELAKRSSTIKAGSPEFLANLDYPNYLERSKQLQQK
WGYLSDRLLESDPRSHLSRKLGYFAQDGGIAKLRELMQNHVATHGLKQLY
EDTSRAADNLRQQQDNLKGIIAEIHEQGIPTGDSQALIDLRTALENLDKT
YRNFQKDLGKEPLKDRRGTVVSDVVKDELTFRVLSWNHWTLLFNKANNGT
ITITESKGAAGKLFDRGNRTNTSIPTKSDDFYPAFEKTVKEVEEFARDRI
RQAVVDLLSKLSQQIAPERERLQALLNPEIEQDIEAKFGGEEADLFYQLL
LGSDPIQWQAAIISEINHQEKFLTPEIMFPLARQDEKHDIGQIFDWSPEK
AQTISKSSNHQMFVLRLRDEITASASLHLVQYVSEVNQRVNAELDGILDQ
IIPTLQNISKKDGLLRFIAAGDTQSSGAVPAWLQNLSEIADLAVKYP
>Ava_4114 Thiamine pyrophosphate enzyme
MPQLAPHIFDILYQKGVEHAFGIPGDFALTLFDALADSKIAPIVMTHEPC
VGFAADAYSRMRGLGLAVVTYSVGGLNMVNAVAGAYAEKSPLVILSGGPG
VREQKEHDLLHHKVKTFDTQRRVYEEVTLYATKLTDPKTADAKIHHALDY
ATTFKRPVYLEIPRDLVYAEITESEHLPPPIKRTDPDTLTEAIAETLEML
KRSHSPVILACVEVHRFGLQEQLLALAEKLGVPVCSTMLGKSVFPERHPQ
YIGIYNGEAGDLNVQKIVEESDCVLMLGVFMTDINLGMFTAHLNPGFTVY
ATSERLAIKHHEYPNVRFEDYITTLLDSPDLPHWDSSGIYTMKPRVTPSV
GKISMSGLLYELNQFIDSNTLLVTDVGDALFAADDIQTQQGTSFLCPAFY
ASMGFGVPGVIGAQLADPSRRAIALVGDGAFHMTGMELLTAQRLRLNPIV
IVINNGSFASLQAMGHQEAAFVQIPTMDYAQLANVLGGHGFVIHTSTQLQ
QALQTAQNSKTFSILDVHLSPDDVSPALQRLSALFTKSLKG
>Ava_0809 HAD-superfamily hydrolase, subfamily IA, variant 1
MERPKVIFVDAVGTLFGVKGSVGKVYSQIAQEFGVEVAPDIVDKAFMESF
KASPPPIFPDADAEDIPQREFEWWRRIALNTFESAGVLTQFADFSSFFGE
LYIHFGTAEPWVIYPDVVQSLSNWQHIGIELGVLSNFDSRLYSVLQSLGL
SHYFSSVTISTQVGAAKPDPKIFAIALEKHNSSPEEAWHIGDSIEEDYQG
AKAAGLRGVWINREKSYN
>Ava_0872 conserved hypothetical protein
MDNKSKPDWAGESLLSNFVNLLIQTKPIYGVMKQQARQVLIKTAEKNGIP
WRKNYEALQASPAKKLLAAVTNPHVIYPDYYKVPFHAYTEGNLCWDAAFE
TESATYAMALRVWPQENLTWEAAHDRLRGTFHDALETYAPQQVRDILDIG
CSVGISTLALHRYYQRRERYPVRTVGLDLSPYMLAVAKTRDVNSEISEWL
HARAENTGLSDKSFDLVTIQFVTHELPSYVSQEIFAEAKRLLRPGGYIAL
VDNNPRSPVIQNLPPVLFTLMKSTEPWSDDYYTFDIEAALQAVGFAPPVT
VPSDPRHRTIIARKPI
>Ava_0328 Beta-lactamase-like
MQVIRLAALSDNYIFLLHDSQKNIAAVVDPAEAEPVLKQLAQLNAELVAI
FNTHHHNDHVGGNQQLIQNFPQLKVYGGAEDKGRIPGQQVFLQPGDRVQF
TDRVAEVIFVPGHTRAHIAYYFPPQTADTPGELFCGDTLFAGGCGRLFEG
TPAQMVESLTKLRSLPENTRVWCAHEYTLKNLQFALSVDSENTELQKRFD
EVKTKRSQGIATVPSLLGVEKLTNPFLRWEQPSLQSAVNSNDPVQTFARI
RGLKDKF
>Ava_3278 GCN5-related N-acetyltransferase
MNLPLYKVLKNGSTVELDYIKPQEYEDVRTLLNFVINEGKTYPQKQPLSQ
PEFAAYWLSQDAFVVRLSGDDGTQKPQKILGSFYIKPNFPGWSSHICNAG
FIVQPGLRGQGIGRFMGESMLSLASQLGYEAVMFNLVFETNIPSITLWQS
LGFDIIGSIPDAAKLADGKVVKALMMYRGVGV
>Ava_4288 conserved hypothetical protein
MPTLPMPSFLSSYWESSFNLKQHIQEFLQLDAEILETKLEFGQQQMAELG
HKDFDWEQATAFYRDKVGEVYLFELGAWHLASHDYIGDTLRLIADHAHGC
VLDFGGGIGTHAIGAALCPHVEQVVYCDINPINFDFVKYRAEKLGLSEKI
IFCVEIPPQKTFDTIMSFDVLEHLPDPSQQLLEFHQILADEGKMILNWHF
FKGFNQEHPFHLDDPQAIDTFFKTIQSNCLEVFHPYHITARCYRKWN
>Ava_1377 Glycosyl transferase, family 2
MEQPTLAVIMTCHNRRNTTLACLQALYQQTNHFDVYLTDDGSTDGTAELI
KAEYPNVKILQGDGNLFWVGGMHLAFGEAIKNQYDYYLWLNDDTFLEADA
LEKLLQVHQNLAAQGYEQSIVVGSTQDPITKQATYGGAVKSKKWYSNKFE
FLEPTLDVQKCDAMYGNCVLIPHSVTLKVGNIDTAFIHSLGDLDYALRAR
KLGCHVWAAPGYIGTCNKNSFRNSWVDTNLTVLERLRKILQIKGFPLQPW
TTFCSRHSGPFWVFYWFLPYIRAIIGYKNLATSPTFSEDPKEPKPEV
>Ava_2235 ABC-1
MTVKTLPPNSRPIEGELQLKDTQALVVRSSGTVSPERPQVLVTNNLEPET
IVYDPVAVAEHYRHRPLQVLRRIFAVLGPTFSFVFGLWWDTKRGVVVKND
RRRAIQLKELLTQLGPAYIKIGQALSTRPDLVPPIYLEELTRLQDQLPAF
PNEIAYQFIQEELGQSPEEVYAELSAQPIAAASLGQVYKGKLKTGEEVAV
KVQRPDLRERITIDLYILRGLAAWVQKKVKRVRSDLVGILDELGDRIFEE
MDYIHEGENAERFFQLYGHIQDIYVPRIYWEYTNRRVLTMEWINGTKLTQ
TAEISAQGIDARYLIEVGVQCSLRQLLEHGFFHADPHPGNLLATPDGKLA
YLDFGMMSEIKPPQRYGLIEAIVHVVNRDFEGLAKDYVKLDFLSPETDLT
PIIPAFGKVFANAQGASVAEFNIKSITDELSELMYEYPFRVPPYYALIIR
SLVTLEGIAIYIDPNFKVLSEAYPYVAKRLLTDPAPELRISLRDLLFKDG
RFRWNRLENLLRNARKNQDYDLNLVVNQGVDFLSSERGSFIRDKLVDEFL
NGINALSKNVLHNFTYLLRERVGITAINETPAATVEQQQTLEHIKNILNI
LRETRGFDPSQLAPQVAQLAFNPGVQRLGQQVANQLVQKATVRLIRELLT
AEEVKSSEIEYPGRRL
>Ava_2782 Protein of unknown function UPF0054
MVQVELDVQDCFLESSPEAAQASGYIDSQVSSATWEDWFHRWLEILDSSL
PPAPSYEIGLRLTDDTEIQAINAQYRQQNKPTDVLAFAALEADLPQNPEM
VAEPLYLGDIVVSINTAQRQAGQQEHSLSTELAWLTAHGLLHLLGWDHPD
EESLIAMLQQQVVLLDAVGIKININY
>Ava_3943 CBS domain containing membrane protein
MRAKQIMTQDVATIRGSASVAEAVRLMRLKGLRALIVEPRHSADAYGIVT
VADIAGKVIAYGKDSENVRIYEIMSKPCITVDPDLDVEYVARLLSTTNLW
CAPVIKGELLGVISITDIVSKGDCIPKPKLTFLRKELHKAISDARGISAN
YGSDSKRAIEAWDLVDELEVEACFYGLPKPDKSARELFADKPELVTV
>Ava_4686 Predicted signal transduction protein containing EFhand domain
MTTEQELQSLFNTLDRDQDGKISINELFLSPGLSAVISSETNTNSPQELL
VQYDSDQDGSITFEELKKAVKKASNLT
>Ava_2045 ABC transporter-like
MRETVLEVRNLQVEFSGDDNAVKAVDGVSFQLHRGETLGIVGESGSGKSV
TSLAVMGLLQHPGRVSGGEILFCPQANANPINLSALSAEEMQLYRGGDIA
MIFQEPMSSLNPVYDIGFQLTEAILRHQNVSQTEAKQIAIAGLQEVKLLP
SDEQIKQQYIETWPQTNPNSPLDEFKLAQLVKQHKETMLERYPHQLSGGQ
LQRVMIAMAISCNPLLLIADEPTTALDVTVQATIIELLRELQQKREMALI
FITHDLGLISEIADQVAVMYKGKVVEYGAAEQIFSNPQHPYTKGLVACRP
TLNRRPHKLLTVSDYMSVEETSSGQLIIQAKEPAHPPEITSEEISARLEN
LEEKQPLLQIKNLKVGFPVKGWFGGTKRYQMAVNDVSFDVKPGETLGLVG
ESGCGKTTLGRTLLRLIEPISGQIIFDGQDITHFKGEPLQKLRREMQIVF
QNPFSSLDPRMKVGDAVMEPLLIHSVGKTTRQRRERVAELLERVGLSADA
MNRYPHQFSGGQRQRVCIARSLALNPKFIICDESVSALDVSVQAQVLNLL
KELQDEFQLTYIFISHDLSVVKFMSDRILVMNRGQIVEQGTAESIYREPK
EAYTQKLIASIPTGSPERVRSHHLKTS
>Ava_4102 Major facilitator superfamily MFS_1
MTQHPKNTGMQTFTIIWFGQMISLIGSQLTNFALGVWVYQRTGSVTQFAL
ISLFTSLPMILISPVAGTLVDQFPRRWMMLFSDLGAGISTGVIAILLATG
DLATWHIYVGAAISSCFGAFQWPAYTAATTLLVPPEKLARANGMLQVGEA
AGRLVAPMLGGILLLFLEIDGIIFIDFATFLFALSTLLLAPFPKQYIDRH
RAEKTPWLKEASSGLVYLVNRRGLFALLLFFAVNNFLVGIVQMLITPLVL
SFGSATDLGTIMTTGGIGMLVSSILVSTVRMPQYLALSIFTFMLLGGICI
TCAGFYQSILALALIAFLFFFGLPIINSSAQVIFQKKVPSSLQGRVFATI
GAIANASQPLAYTVAGPLADKIFEPLMAQNGLLAESMGKIIGVGQGRGIG
LMFIVMGILTVLATIIAYQYKPLRLVERQLPDAMNPSC
>Ava_2361 conserved hypothetical protein
MLSVLLLIVINAFFVTAEFSIVTVRRSRIHQLVQSGDIQAIAVESLQRNI
DRVLSTTQLGITLSSLAVGWIGESSIAVVMRWWIKSWPLPANVNNFVAHS
LSIPIAFFLIAYLQIVLGELCPKSVAMLYSEQLARFLGPAVKAIVRFFRP
FIWILNQSTSYLLRLFGVEYTGQSWRPPVTPEELQLIISTERESTGLQTA
ERELLNNIFEFGDVTAQDVMIPRNGIIALSKDANFQSLLQQMTATGHSRY
PVIGESLDDIRGIVYFRDLANPLAVGKLSLETQIQPWMRPARFVPEHTPL
SELLPMMQQEKPAMVIVVDEFGGTVGLVTIQDVIAEIIGNAGETSSSEDL
LIQMLDQETFLVQAQVNLEDLNEVLHLNLPLTKQYQTLAGFLLYQLQKMP
IKGEIFCYDNIEFTIVSVDGPRLHQIQLRRLG
>Ava_0574 conserved hypothetical protein
MKDQQQSDSKEFVGNLKNGIWLFGLSSWVFGITDRSIASFADGYLSALDL
TQLFTAATFFVAWLFLKPTSKV
>Ava_1673 Major facilitator superfamily MFS_1
MFPKLILLATLYICQFIPTTFFIQALPVFMRQQKMSLDLIGFLGLLILPS
GLKFLWSPFIDRYRLGKLGHYRGWIICFQLLLISTMLVTAFLDIQDNLNA
LLTCMFLASLFSASQDIATDALAVNLLEPQERGLGNAIQSGGNIFGAILG
GGVMLILLDKIGWRYSLITMSIFMLVNLVPILIYREKSQHQLENSTFFRS
YFQPFASFLSRPKALPWLFIVLLYMMGDSVTSLMIRPLLVDRGLSLPDIG
WILGIVSYSARIFSALIAGLVIVKLGRAKSLIIFGLIAALTTLLYVIPAI
GMSSLIVLYTVCIVVNSTQSMAYTALLSAMMDRCEKNTAATDYTIQVSVM
FLGGIAATVLSGMLASTMGYSFIFIISTVVSLLSVFLITNKYGVTS
>Ava_0887 GCN5-related N-acetyltransferase
MNANEIFLRPAEKQDAWVLSAVHIAAIKALPTTFYTRQELLAWRNHLSKP
DGSHILKRMKSETFWVAVEGNNIVGFTSYIVDELIALYVHPHYQGRGNGR
ALVEHFFHQATQQKVDKVITTASLYAEGFYLRLGFRAIEKSPHQLSKGVI
VPVIKMSKELIKIT
>Ava_3507 Protein prenyltransferase, alpha subunit
MLKQLWQWLKRSFGRLFGRKHSPVREQNKVEPPQRLTDAEYESLFLQLLA
EVNDGLTRGEAKGFLAAKHINEGDLVEWLRGFGERLFASAKPNDELVSRM
VRLGELSIGEVSDVAGDIGRRLGGGETNRRGAEDAEEEETSNQFNDAIEL
TSDEAEAWLNQGVALANLGQLEQAITSFDKAIEFKPDDDSAWYSRGVALC
NLGRFEQAIASYNRAIEFKHNFPEAWTNRGVILNSLKLYQEALTSFETAL
QINPNFPEVFNAWYGRGNTLFNLEKFEEAIASYDKAIEFKADDYSAWYNR
GVALDNLGQFEEAIASYDKAIEFKADDYSAWNYRGVALANLGRFEEAIAS
YDKAIEFKADDYSAWYNRGVALSNLGRFQEAITSYDKAIEFKADFYIAWM
NRGIVAGNVIVERIDFSTFPLPHAAAHNLALKFNNPDLNKRGYEGRLASY
EEGLKHCQQETHPEGWGKLHRAIGDSHYYQGRGNYNTRYFWRKAINSYKT
ALQTLTATNFPELHLEVLQDLIRVLLDLGEIAEATELQRQGTELLRRLLN
EPNRSERSKKQLALKFAWIQQLTVDLAVQSGDLLQAIELAEEGKNTCLRW
LLDGWSDEISSPNYSEIQQLLNPSTAIVYWHLSSYALHTFILKHNAPSPI
VLGNTECLTQAQRLRDFEAWVKKWNEQYANYPKDKDKQGEKDRTWRDNLP
EMLRNLSHILDINAVVSTIPDITQLILIPHRDLHRFPLHALFPPEFTISY
LPSAKIGSISVKKDNYNQKNLLSIEHPNSTGYPSLDFAEIESEAISQMFA
NPTRLHSEQATQKALINALPQSYNIFHFTGHGVYNFQNPALSFLALADED
KLTLADIHGFKLQSYQLVTLAACETAITGNHTITTEYVGLVSGFMGCGVA
HVVSTLWTVESAASALVMIQFYQLLQQGKPETIALAEATQWLRNVTNAEL
AQWYAAQLAKVPENQGLLYNCWSRHLNKLKNNPEPSKQPYNHPYFWAAFT
ITGNFSQ
>Ava_1308 von Willebrand factor, type A
MKVQLLRALSDTNIDATQLSSQRQLAISISAVAEQFEQNLPLNLCLILDQ
SGSMHGKPLKMVIAAVERLLDRLQPGDRISVVAFSGSATVIIPNQIVEDP
ESIKTQIRKKLQASGGTVIAEGLQQGITELMKGTRGAVSQAFLLTDGHGE
DSLKIWKWEIGPDDSRRCQEFAKKAAKINLTINTLGFGNNWNQDLLETIA
DAGGGTLAHIERPEQAVHHFNRLFARVQSVGLTNAYLILSLAPQVRLAEL
RPIAQVAPDIIELPVEPEADGSFIIRLGDLMKDENRVVLANLYLGKLPEG
QQVIGNVQVRYDNPSLNEEGLLSQIWPIYANVMQAYQADFDPHVHKSILT
LAKYRQTQLAETKLRQGDRSGAATMLQTAAKTALQIGDNNAATVLQTSAT
RLQAGEKLSNADLKKTRIVSKTVLQSF
>Ava_1542 Protein of unknown function DUF6, transmembrane
MPFTDGKGELAALVAAGLWAIASVLYGIVGQRIAPLQLNLIKGIVAIAFL
SLTIWLTGESLPISDHTQILLLCLSGVVGIALGDTAFLAAINYLGARRVL
LIGTLAPPITAIAANLVLQEQLNFRAWCGILLTIMGVAWVVTERVPDTGG
KRVEDSLLMRGLGFALLAAMTNAAGTVISRAAFAIGNVTPLWAALLRLSA
AELILVVGIWLSYKRQMSSYFYRESWRVILISCFASFCGTYLGIWLQQTA
IKLTAAGVASTLMQTSPLFVIPLSLCLGEKVSWRSLAGVIIAIAGIGLLF
YLK
>Ava_3324 PilT protein-like
MEWLAQLQGQIVGLDTAPLIYFIEENPNYLNVADAFFEALFRGEFSVVTS
VLTITEVLVYPLRQGNTILAQQYRDILLNSQGLTTIEVFPDIAETAAQLR
ANHNLRTPDAIQIATAIRGGASFFLTNDARLPSLPGLPVLVLEQLIV
>Ava_4970 Phosphoribosyltransferase
MLFKDRTVAGQVLAKKLADYANRSNVLVLALPRGGVPVGFEVARALNAPL
DVLVVRKLGVPDNEELAMGALAKLPGTGFASGGVRILNQSIVNDIQISDE
VIARVAVQEERELERRESMYRGDRPFPNLQGQTVILVDDGLATGATMWAA
IIAVRQQQPKEIVIAVPVAAPETCDEMQNKVEKIVCANTPSPFYSVGMWY
EKFPQTTDDEVRELLNKANNNHEPLLSGN
>Ava_2803 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MLNIPGYTISEELYDGSRTLVYRAIRQADTLPVVIKLLKNPYPSFSELVQ
FRNQYTIAKNLNYPGIIATHSLEPLHNGYQLVMEDFGGISVKDYFVHNHY
VASLDKFLQIAIALCDILDILYRQRIIHKDIKPANILINPETTQVKLIDF
SIASLLPRETQTLINPNVLEGTLGYISPEQTGRMNRGIDHRTDFYSLGAT
FYELLTGKLPFPSEDAMELVHCHLAKAPTLVHEINLTIPSVLSDIVNKLM
AKNAEDRYQSALGLKYDLEKCLVQLQETGKIESFPIAQRDVCDRFIIPDK
LYGREAEVETLLQAFERVSTGNTEMMLVAGFSGIGKTAVVNEVHKPIVRQ
RGYFIKGKYDQFQRNIPFSAFVQAFRDLMGQLLTESDAQIQQWKSQILAA
VGENGQVIIEVIPELAGIIGQQPPIIELSGTAAQNRFNLLFQKFVQVFKT
QEHPLVMFLDDLQWADSASLNLMQLLMSESGGGYLLLIGAYRDNEVSDAH
PLMFSLAEIRKAQATINTITLAPLSQSSLNQLVADTLSCSSAIAQPLTQL
VYQKTKGNPFFSTQFIKAIYEDGLITFNGDEGNWQCDIVGVKESSLSDDV
VEFMALQLQKLPQTTQNILKFAACIGNKFDLGILAIIWEKSLTETATALW
KALQEGLILPQSEVYKFYVDREIQDEVKGSQTVTYKFLHDRVQQAAYSLI
PAEQKQYTHFQIGQLLLQGLSQGEQEERIFDIVNQLNMGLDVITSNEQKQ
ELAHLNLKAGQKAKLSAAYQAAYDYCTIGMSVLSPDAWQQDYPLMYSLHR
DASEAAYLCGKFDQAEALYAETLTYAQAPLDQAIVYRIQMTQYQLQGRNA
EAIAIQRQSLQMLGWTIPTQPEMIQAGLDAEIATVQQFLEQHTIESILAA
PKMVDPSIAEMLRILQILFYAAWLDGQSTLALLALAKMTTLSLQYGNSDM
SPFGYAGYGLIANGVFKDCATAYEFGEMAVQLCEQFDNADVRGMTNFIFA
ADVHSWSRPIREADTYYNNAYQYGMEAGNWLTVGFMMMQSGSDRLTYGKH
LDDLYAIAQNHAAFLHQIKSLDNLDALTAGVLQPIRHLLGLTKTLFTFDD
DDFSEAEYLQKYANAPYHLSWLYSVKIRHAYLFDQKSTYSDLIPQLSMIE
TTISSHAKVPSSVFYVALMHLALAETATEASERQHHWQALLPLETSLKRW
LKACPENIRHKFLLIQAEKARIKKQKTKAIELYEQAISQAQANQYGYEEA
LANELAAKFYLDWGKVKIAQVYMQEACYGYARWGAKAKAHHLEKTYPQLL
KPILQQQRINFNPLETITFRGATSSTHTTTTSSTNISEILDFTSVLKGAQ
AISGCIELDELIANLTRIILENSGAKKSVLILPLEETWQVKAITSVNQES
SSHTNIQTILSSQSIEDCQDIPPKIINYVKNTQKPLIIDNCQTDIPGVIG
EYMLEHQPKSVLCTPIIHQGHLVGILYLENELTSEVFNSEHLQVVNLLSS
QAAISLENARLYQKAQQALQDLQQAQLQIVQSEKMSALGNLVAGVAHEMN
NPLGFIAASLMQAQPIIADITEHLKLYQENLPDKIEKIADHAQEIDLEYI
LEDLPKMIESMTMACERLKNISTSLRTFSRADRDYKVPFNIHEGIESTIL
ILKHRLKANEQRPAIEVVTEYADSPMIECFPGQINQVFMNILANAIDALD
EANMGRSFAEIQANHNKIMIRTLLENQQVKITIADNGKGMSEEVKAKIFD
HLFTTKMVGKGTGLGLAIAQSIVVEKHGGTLTVNSTLGEGTEFVISLPIL
DESQN
>Ava_0419 TPR repeat
MYKHISFVLSVLLLGGGTATIPTIAQGQVLVVQANNAELKRLLEDGKRLV
DAGDYNGAIAVYQQAATMEPRNARIHSGIGYLHAQQGNFQAALASYRRAI
AINPNNSDFFYAVGYIKGNMGDTPGAKEAYRRAIQLNRNNVSAYVGLGIT
QSRMGDFQSANWAFEQAIKLDKNNAQTYEFMAAMYKQRRQTKQASNLLQK
ARDLYQRRNDADGVARVEAMLQQL
>Ava_3246 Peptidase C14, caspase catalytic subunit p20
MNALFSQGHACIVGVGCDLPNTVDDAVGLANILKDQERCAYSSEQVHLLT
KEQANREGILAALDQLAQSTTPDSTVIVYFSGHGYQVSSPIGEAYYLMPF
GYDQTKLHKTAISGAEFITKLQAISAKKLLVLLDCCHAGGLGDTSKLGYE
AQKAPLPPEAQALFNQGKGRVAIASSQADEKSFAGKPYSAFTLALIEALA
GKGTSQKDGYVRVADLAMYAREVVPRRTGDRQHPILNFEQADNFILAYYA
GGETEPKGLPFEGEPEIEPEPGAFNQPSTNNSVIQVVTQKNSKYNINTGM
GNTVINDSN
>Ava_4500 Zinc-containing alcohol dehydrogenase superfamily
MKAVCWHGTNDVRVETVPDPKILNPRDAIIKITSTAICGSDLHIYNGYIP
TMQSGDILGHEFMGEVVELGSAVKNVKVGDRVVVPFTISCGSCFFCQRDL
WSLCDNSNPNAWMVELQMGHSPAGLFGYSHLFGGYAGGQAEYARVPFADV
GLLKIPDNLPDEQVLFLTDIFPTGYMAAENCNIKPGDIVAVWGCGPVGQF
AIKSAYMLGAERVIAFDRIPERLQMAKEQCNAEVLNYEEVNIGEALKEMT
GGRGPDACIDAVGMEAHGTDLMAFYDQVKQAVRLETDRPTALRQVIVSAA
KGGHVSLAGVYGGFLDKIPMGSAMNKGLTFKMGQTHVHKYLRPLLERIQN
GEIDTSFVITHTLPLEQAPHGYEIFKHKKDNCIKVVLKPSGN
>Ava_0894 TPR-related region
MSNGEGNNSINNKVPVQVAKYPKQVRRNTITHEFDQGEPKESLVPPENDN
HSSEASKLLRQGIQQQQAGDLIAAIKSLQQSLEMFQLARDVKQQEQVLSL
LALIAYTSGDYRNVICYCQKCLSLTDTPDLSVRMQILSHLGNAYRHLNDY
NKAIEFLEACLQLTQTLQDKRSQVAALNNLGLVYKASGNFNQAIAYQEQS
LIIVEELKDSWGIEQVLKNLGNTWYALDNYPKAIAYYEKCVKIALSLNNP
RSAAQVLKNLGNACYAIGDYAKAIKYYEKRWQLARELKDKRSEEQSLGSL
GVACEALGDHSRAITYYEARLLLARSIKDQRIEEQALASLKIACYALGDY
AKAMQYERGTSTST
>Ava_0381 GTPase EngC
MRVFTTGQLLGTVVAVQANFYKVQLDQDVREPGSRGAGEEVHLDSPLPLY
PLSLLLCTRRTRLKKIGQQVMVGDRVVVEEPDWAGGRGAIAEVLSRQTQL
DRPPIANADQILLVFAVADPPLEPYQLSRFLVKAETTGLDVVLCLNKSDL
VSPEIQQQISDRLLAWGYQPLFISVEKQINIDQIAKYLSNKITVVAGPSG
VGKSSLINALIPDINLRVGEVSGKLARGRHTTRHVELFELPNGGLLADTP
GFNQPDVDCSPEELVHYFPEARERLAVASCRFNDCLHRDEPDCAVRGDWE
RYEHYLEFLADAIARQTQLYQQADPESTLKLKTKGKGQSQYEPKLESKKY
RRTSRRTQVQGLQDLYQEEE
>Ava_3584 Serine/Threonine protein kinase
MLQLLDRYVPLQTLGAGGFAQIYTVWDEKTQTEKVLKVLIEDSPKALELF
TQEAEVLVRLRHPGVPKVEADGHFQVNLSNPKPRQLPCLVMEKINGPTLE
EMLNKYPQGCPENLVLNWLTQGIKILQELHKHQIIHRDIKPSNLMLRTPS
PSVISQGGTGWEQLVLIDFGGAKQFNTGRQRQESSSTRLFSSGYSPPEQV
TGGHIGPSVDFYALGRTMIELLTGKYPPELEDPQTGVLHWRNRVTIRPEL
ADLLDEMVHEDVRSRPANAAMIQKRLTKINQPASGQNNFQQWLTQLANQL
ALLNQSTEQVFSNLGQTLGKILRWIAQTIVKIITACFSTIWAMLLTGLGA
SVGTIAGFILAYRTNLGDRLIEFIASQLPELIPNPESGFGAEIIVFAAAG
LGTAWGLTASGCFAQRRRFLVASFMGIISYAFGWLLWQIITSTADSGEGL
VGATSIAVFLLALSMSFRSHHIVYAMIGSFGTAIVVAILIVLGFPTTVLQ
FSNRHLWSELSLPIIFFSSIGILMSFWLGVSYYLIVPGLRFLGWR
>Ava_0206 conserved hypothetical protein
MKVAITGATGFVGTRLVQRLHKEGHQIIVLTRNTASARRHFPAQTFANVE
IVAYTPTTSGAWQDVIAGCDGVVNLAGEPIAEARWTPEHKREILNSRQLG
TQKIVEAIAKANPKPTVLVNASAIGYYGTSETTTFDENSPSGRDFLAQVC
QAWEAEAQKVKQSGVRLVILRLGIVLGLGGALGKMITPFKLYAGGPIGSG
RQWFSWIHIDDLVNLIVQALTNPQLEGVYNATAPHPVRMTDLSQTMGQVM
NRPSWLPVPAFALEALLGDGAIVVLEGQQVLPKRALEAGIKYQYQNLQPA
LQEILQ
>Ava_0817 Histidine triad (HIT) protein
MSSYNRMSETTETIFSKIIRREIPANIVYEDDLALAFTDVHPQAPVHILV
IPKQPLAKLSDADSHDHALLGHLLLTAKRVAQKAGLENGYRVVINNGNDG
GQTVYHLHLHILGGRLMAWPPG
>Ava_0881 Small GTP-binding protein domain
MIQRKICMVGAFATGKTSLVARFVYSIFSEKYQTTVGVKIDKKIVNLPEN
PVNLIIWDIYGEDELQKLQMSYMRGCSGYLLVVDGTRKNTLETAYRLQNS
LEANFGRIPFVLVMNKWDITDEWEVDPAEINSLINKGWNVVETSAKTGIG
VEEVFQILAQKIMET
>Ava_2403 Beta-lactamase-like
MPSNHATIEDIALNIARKRLENAVQITFLGTSSGVPTRARNVSSVALRLP
QRAELWLFDCGEGTQHQILRSDLKVSQLSRIFITHLHGDHIFGLMGLLAS
CGLAGNVQRVDIYGPSGLNEYIQSASRYSHTHFSYPIKIHTVRPGVIYED
DEFTVTCGLLHHRITAFGYRVAEKDRAGRFDIEKAKELQIPPGRVYGQLK
RGETVTLEDGRVINGAELCGPTEIGRKMAYCTDTIYCDGAVELAQDADVL
IHEATFAHQDSEMAFQRLHSTTTMAAQTALAAGVRRLLMTHFSPRYAPGN
TIELKDLLQEARAIFPRTDMAYDFMTYEVPRRREPILSSVSSSSV
>Ava_0011 conserved hypothetical protein
MNIGYFQKIPVMPLVLLWLAYALLGWYLAAHHIVWLVGAFIATIAIAVVR
KSISWLESIVSFGSRTLVVVIVLSASIALVATWSMLLSLFLIPIATTLLA
DLELRFAGFKKIDSFWILTVLAGWGLTVGEIVDILLLPSGRY
>Ava_3201 Putative esterase
MNQQKSSIFLLLSVLSLIGCNYWPISTAQVPTKTAAKLLSQANTDILATP
LTYQIETYKSQVMGGNRTYGVSLPPGYAQNSQQRYPVIFLLHGGHGEPIT
WFDKDRGEALKTLEKLYTTGKLQPSIIITPDGNDQRGSSPYWDPEYIDGS
NGKVSTAVGNELVKVVKSRYRTLPNPHFWAMGGLSSGGWGAVNIGLHNLN
NFSILFSHSGYFRDKSGPFNSPITFIKDVPTQAKKRLRIYLDTGSSDIDE
VKEGEKFSQELARLNIYHVFRQFPGSHTWQYWREHLTNSLTFVGEQFKAA
QAASKDKNLRHD
>Ava_4647 GTP-binding
MGLTDNYKLNLIQWYPGHIAKAEKNLKEQLSRVDVVFEVRDARIPLATHH
PQIDEWVGNKARILVLNRLDMIPPQVRSLWIDYFQNRGEVPYCTNAQHGQ
GVAGVAKAAQAAGVELNQRRRDRGMLPRAVRAVVIGFPNVGKSALINRLL
GKRVVESAARPGVTRQLRWVRISDQLELLDAPGVIPVKLGNQEAAVKLAI
CDDIGQASYDNQLVAAALIDIVNSLQEQAGELLPRSPLYARYELDPTPHT
GEGYLHALAEYRHKGDVERTARQLLTDYRKGLLGTLPLELPPM
>Ava_4415 Alpha/beta hydrolase fold
MHHIGLTRPYSWGIKNFELITVTEKLNIHIQGQGFPILGLHGHPGSGRSL
SVFTNHLSKRYQTIAPDLRGYGTSRFRGNFTMQDHLTDLEALLDRLQIEK
CLVLGWSLGGILAMELALRLPQRITGLILVATAARPRGNHPPISWQDNLY
TGVAGLLNYIKPGWRWNIETFGKRSLFRYLIQQHTPTAYNYIAREALPAY
LQTSPSATRALNSALRLGYNRLADLEQIHCPSLVLAGAQDKHITADSSLE
TAQHLKQSQWQCYPHTAHLFPWEIPQQVINDIDLWLANNSQVMSN
>Ava_4279 conserved hypothetical protein
MCRLLAYLGSPVSLEYLLYKPEHSLIVQSYQPREMTSGVVNADGFGVGWY
HSQKDTEPFTYKNTLPIWNDINLPSLSRYVESKCILAYVRSATAGQALDF
ANCQPFHYKQSLFIHNGYIENFRKTLHRKLRSTLTPDFYEHINGSTDSEH
IFALLLSQIQSNKHRSVESVLRNTLLMLWEMAKRHQVNASANVVFSDGHR
LIASRFASSSSPPSLYWLKDDLTFPNSVIIASEPLFAGNWTACPENSIIS
VGEDCDIKIESI
>Ava_2602 inner-membrane translocator
MITNKRFQLLLPILSPLIAIISALIVGAILILLAGANPITAYTALFQESL
STYFGFGNTLTKMTPLLFTSLGVLIALKASQFNIGGEGQIYLGALGSTLV
GLYVQGLPAIIHIPLALVAGFGFGAVWGWIPGYLKAVGGVNEVITTLLLN
YIAINLISYLVQNPLKAPGAPSPYSPLIAKSAQLPIILPQSLAHAGILLA
LLTAGLLWVLLGRSPLGYQISAVGFNPTAAHYARISVKNTIMLVMSLAGG
LAGLAGASEVMGLKYRLFEQVSPGYGFDAIAIAFLSRGSINGVVLTALFF
AALRSGANVMQRSAGVPVTVVYAIQGFTVLFIAISLAVETQIKTQSNAEI
>Ava_0348 TPR repeat
MRGRVFFYLSQKLYSEGRWQEAIAPFQKLIELNQGSVDIYWNLSQCYRNL
NLLEQYFQTLEQGIQNYPTDARLYFTLIIDLRRNGRTQEAIEYAEKACQY
LPNDYTFQILKYLTVPTIYNNPEEILLYRQRYTQGLQNLIAQTSLQTIED
KENALSGISRLTNFYLSYQAQNDVDLQRQYGKLVHEIMSANFPQWIVPLS
MPKLEPHQKIRIGYASHYLHSYSGTLWLTGWLKYCNHSNFEIYCYYTGNE
PDAITDKFREYSDYFYHIPYNLSAVCEQIITDKLHILVYPEIGMNPQNLL
MGALRLAPVQCVAWGHPVTTGLSTIDYFLSSDLMEAENAQEHYSEKLIRL
PNIGVSYPKPYIPLVLKTRADFGLSDDDILYLCCQAPFKYLPQYDFIFAE
IASRIPQAKFIFLRGTLLQERLQKAFGNLGLKFEDYCVFLNIPERLDYLM
INLLSDIYLDTFTWSGGNTTLEAIACNLPIVTCPGEFMRGRHSDSFLKML
GVTDTIAQNEGEYIDIAVKLGQNPAWRREISERISQRHDKLFDDQVCVTG
LEEFYKQVVELSS
>Ava_1890 Mutator MutT
MNTTTTPPHKIIGVAVIWNDQQQILIDRRRPGGVMGGLWEFPGGKIEPGE
TVEQCIQREIYEELGIFIEVGESLITIDHTYTHLRVTLTVHHCRLLKGIP
QPLECDEVRWVTVDELGDFTFPEANSEIIAALKRVGQLTTDN
>Ava_3635 Phycobilisome protein
MTVISQVILQADDELRYPSSGELKSISDFLQTGVQRTRIVATLAENEKKI
VQEATKQLWQKRPDFIAPGGNAYGERQRALCIRDFGWYLRLITYGVLAGD
IEPIEKIGIIGVREMYNSLGVPVPGMVEAINSLKKASLDLLSSEDAAAAA
PYFDYIIQAMS
>Ava_1487 GCN5-related N-acetyltransferase
MSYPQIQFRNRQSEVDLYQLQQLFNISAFWAKGRSIEDLGVAIANSDPVI
SVWDAERLIGFARATSDGIYRATIWDVVIHPDYQGNGLGSKLVETVLAHP
RMRWVERVYLMTTNRQEFYEKIGFHANNSTTMVLYNQSRISSLATTEVQL
QESLGG
>Ava_4069 HAD-superfamily hydrolase, subfamily IA, variant 1
MSYQAIIFDLDNTLLNFELCERQAILGALSDCAVSLDIYKITETIFLEVF
ESYNSQYWQKRDVLSPIEITELSYQSTLAHLNINTDKTNHLSQSFWHIFN
HSAVTESGVYELLTFLKRNYRLAVITNGFISAQVPRMQAAGIDHFFEEVV
VSEAIGFAKPSPEIFHYALSKLDLTPSQVLYVGDSLSHDYAGTTQVNIDF
CYYNRKNQALPKEVKPKFIVNQLLDLLELVK
>Ava_0434 Serine/Threonine protein kinase
MSLCINPQCSKPQNPDNVLFCQNCGSELLLEGRYRVVSVLGGGGFGKTFA
VNDTRTQTAKVLKVLINNHPKAVELFQREAEVLALLNYPGIPTVEANGYF
VYFPRNSQEPMHCLVMEKIEGLDLGQYLRQRDYRPIDQKLALQWFKEVMI
ILHQVHQQGLFHRDIKPSNIMLRADGRLVLIDFGTARSVTGTYIAKQAVG
QVTGVISAGYTPSEQINGQAVQQSDFFALGRTFIYLLTGKEPSDPTIYNY
LNDELRWRDGLLSSGSNSVTPTVGDRPNILPQFADLLDQMMERLPAQRPQ
NTHIILQRLADIEKSLQPPPPPPPKPKGWTRRRLIAVVGFSSLGLTGALA
LSRLLPKTTLIVSQEGGGDYKTISAAIENAQPGMRILVRPGLYQESLVLD
KALKIIGDGVKAEIIIESKDAGCLVVKTDQAEVRGLTFRSRVGTENKQYF
AVDISQGQVILADCDITSDSLSGIGVHGATANPVIQKCQIHDGKQSGIYL
YENSRGTIEDCDFFGNTTTEITVDAAQPIIRRCKIHQDKEGGILFRNQAQ
GIVEDCDIFNNNLSGIEIRDSSNPTIQKCRIHDNQQGDGILVHLNGRGTV
EDCNIFSNGFSGVEIRDRGNPVIRRCSINKNKYYGVYAYKNSTGTVENCD
LTGNIRSAINVDETSQLQRSGNVE
>Ava_3419 UbiE/COQ5 methyltransferase
MATRPKTIWESFLSPVVRFLIDEDKWRRYAQSIDWEKESDRFRRTDVIIP
SYYTSHNFHGIEGGYLNSSAAVTYDPITEYVLPPNETLVRQALIDAVKVQ
PRRILDLGCGTGSTTLMLKQAFSQADVIGLDLSPYMLVRAEDKARIGGLD
ISWRHGNAEKTSFPDASFDLVTAALLFHETPVEVSQAILQECFRLLVAGG
QVIILDGNQKSLRQLNWLNDIFEEPYIREYAADSVDARMGAAGFAKVRTE
DLWLIHQVTSGIKPILATDTNQAKERQYMAAIDNNNLEGLESPAFGIVA
>Ava_2341 Peptidase M16-like
MTSTLRKLPRLNAPKLHTLPNGLTIIVEQMPVEAVNLSLWIDVGSSVESD
AINGMAHFLEHMIFKGTERLASGEFERHIEERGAVTNAATSQDYTHYYIN
TAPQDFAKLAPLQIDVVLNASIPDEAFERERFVVLEEIKRSEDNPRRRTF
RRAMETAFAELPYRRPVLGPESVISQLTPQQMRDFHASWYQPQSITAVAV
GNLPEEQLIETIVEGFNQLKKTPPSPLPTPRPLNLEPAFTEIVRREFVDE
SLQQARLIMVWRVPGLNQLEQTYGLDVLAGILAHGRTSRLVQDLREERGL
VTSISVSNMSNRLQGTFYISAKCAVEDLQAVEEAIAQHIRKLQTELVTEK
EIARVRKRVANRFIFGNETPSDRAGLYGFYQSLVGDLEPAFNYPAHIQTQ
EAPDLLLAANQYLCPEAYGVVVMKPA
>Ava_1592 Serine/Threonine protein kinase
MNHHMIGKILQARYQIVQNLGSGVFGQTYIAVDINYPHQPKCVVKQLKVN
SFHSSQLDTIRLRFLTETETLKHLGQHPQIPNFIACFEENERFYLVQEYV
AGHALTAELPIAQNWGSLWREDEVITFLEDALSILQFVHSQGVIHCDVKP
ENLIRRAVNGKLVLIDFGSIQSVNFGIDEQLSIYQVPATSLGYIPPEQFI
GKTQINSDIYALGMIAIQALTGLEPLQLKIDPDSNEIIWRFADTPVSDYL
AAILSQMIRYNFQERFQSAAEVLRVLQQMKWETSLPQLLQTQQIEYRQEP
NSDQPSPLITGMKVGLAVNTLLMGLGTYSLLSTSPANTETEILYKATKEY
QDGDLKRAIALAKLIPSHSNVYPDAQATIDEWQQQWQTAAKQYSLAEQAL
LESRWSDVFVAASEVPNISYWQSKVKDVVEKANVNIEAQTQNLLAKAYDK
ARAKDFSSALEYLRQIPQESSAGALVQQKLAEYNQKKQIRAAYFLHQARK
QALAGNFNHAVNYLRKIPQGTPVYAQAQAKLNEYTQKLRPQIQKAKSLIA
HNSVIQVGNVQSEIQLPEVNIR
>Ava_4364 ABC transporter-like
MNAKILETENVTVSFDGFKALNQLNFSMDVGELRVVIGPNGAGKTTFLDV
ITGKVQPTIGRVLFKGKNLRSLREHQIARRGIGRKFQTPRVYLNLTPREN
LEITSNRNKNVFSTLFGRSQPTEENSIKGLLETIGLTPKADIPAALLSHG
EKQRLEIGMLVGQSPDLLLVDEPVAGLTDEETYNIGELLLTLAQSHSILV
IEHDMEFVRQIAKKVTVLHEGSVLCEGNFEEVQSDPRVVEVYLGQQ
>Ava_4509 Small GTP-binding protein domain
MSLVVYPNPVRNLQETHLNRARASLRQALSWYGYLRKSGHLSSNPELAGL
VKPEIEALNSTLNKLDSNVIRIAAFGLVSRGKSAVLNALLGSKILQTGPL
NGVTQWPRSVRWQPGGKVIVELIDTPGLDEIQGESRAQMARDVVRQADLI
LFVVSGDITRTEYQALLELRQAQKPLILVFNKIDLYPDTDQAAIYRNLQQ
LGAGNPQAKPLLPDEIVMVAAEPAPMEVRVEWPDGRVSYEWETPPPQVDE
LKQTILNILNREGRSLLALNALIQARDAEAAIAQKTIDIRQQEAEDIIWQ
FTKYKALAVGLNPIAFLDILGGTVADLALIRSLARLYGLPITSYEAGKIL
KTIFISSGGLLLGELGSSFLLGLGKSAAALTSGDNPTNVTAFAGSAIAQA
GIAGYGAYSVGKAAQVYLEKGCTWGQLGASTVITEILSQVDQNTILYRLQ
QELGMKY
>Ava_3200 Putative esterase
MKIPKFFMGIVGAIAILSATGYYYVFILGAPQLDPPQEEANTGLKFQLET
FNSQAMGTVRNYGVILPPGYDKNLQKRYPVIFLLHGGHDDARAYVDKYAI
LNILAELYKSKKLPLSIVITPDGNDNRGSSPLYDPDYFDGANGKVGTLIG
SELVQVVKSRYRTLESPQFWALGGLSSGGWGALNIGLRYLNNFHIFFSHS
GYFTDNSGPQNSPQQIVQQLSPEDKKRLHIYLDAGVNDTNLLASTKVFHQ
TLNKLGIANVFYSFPGGHGLSGADIGWNYFHKHLKDSLTYVGEQFNNSTR
K
>Ava_0689 GCN5-related N-acetyltransferase
MLIRPATPADVPAVLPMVAKICAVHESWDADKYGFLPQPEKRYQRWLTRL
ANQEGSVFLVAENRGQLVAFVAATVEQEIPIYRTKEFGFIHDIWVEPEYR
QQGIAKQIVELTIERFRQMGVEQIRLDTAAINEAARKLFISCGFRLSTME
MLITL
>Ava_2108 Rhomboid-like protein
MVPIRDNNPVTITPYVTYGLIAANVLAFLYEASLPPQALDGFLHLAAVVP
RELSLSFAGVSVHQPVPEWATLITSQFLHGGFLHLAGNMLFLWIFGNNVE
EKLGHARYLLFYLACGVLASLTQWYFSQDSSIPSLGASGAIAGVMGAYIL
RFPNAEILGVVPLGFFFPTFRVPAYFFLGFWFLQQSFYGLAGLQTRTNIG
MESGGIAYWAHAGGFIFGALLGPLLGLFSDKAKEESWYS
>Ava_4022 TPR repeat
MKFTSITYTLSAVLLLGFSIPLVFAQTPDSGLSADCQMPIPPNVNSVDHF
LGMGHFQQDCKKDLSAAVAAFTQAIKLNPQAEKPYYHRANAYAAMGNYQA
AVTDYTEVIRQNTGRFGLSSAAYWNRARAYEKLGEEQKAISDLTQLIGKN
TSSNADEYLLRANLYRDLGNKESAIADYKIAEKLLQQYTDGVFDTGMMDT
RYQQMLDQVRNELSSMGVAVTVPKTTTGNILRTIAKTEVERALNLAKLQS
QHPTVKNFDAQLQDLYKQLANTQPQVDQRVVKNLIANAAYEKIDTLKIER
SQLLTQYTLHSPVIGVIDSQTSELESLIRRNKF
>Ava_1403 GCN5-related N-acetyltransferase
MTIYHIRRGSTLERSQLVKFIQRTYQELFPQQKDFAHLAITVEQYFSKDT
PLWWVDFSSQESRAELGTMTRVESLNISSPLLPPTSSPVACLWMGNAIDQ
IKGDRHAHIFLLYVVPEHRRRGIATALMQYAENWAKQRGDRQIALQVFQS
NPPAINLYNHLGYQTQSLWMVKKIN
>Ava_3764 3-oxoacyl-(acyl-carrier-protein) reductase
MAILSENLRGQVAVVTGASRGIGRAIALELANYGATVVVNYASSSTAADE
VVAEITGAGGEAVALQADVSQVDQVDNLIDGAIDKFKHIDILVNNAGITR
DTLLLRMKPEDWQAVIDLNLTGVFLCTRAVSKLMLKQRSGRIINITSVAG
QMGNPGQANYSAAKAGVIGFTKTVAKELASRGITVNAVAPGFIATDMTSN
LKSEGILQYIPLGRYGQPEEIAGMVRFLAADPAAAYITGQVFNVDGGMVM
A
>Ava_2635 O-succinylbenzoic acid synthase
MLSKYKFSFRPIARKFVRSLVTSHGIWEVREAIILRLTDTTGKVGWGEIA
PISWFGSETLAQALYFCRQLPEEITQETIFSIPDDLPACQFGFESALEAM
GNGEDVTITNAQFSALLPAGEAALHQWRRLWEEGYRTFKWKIAVGEISEE
LRIFDLLTQNLPTSAKLRLDANGGLSYPEAHLWLQACDNLPVEVEFIEQP
LPVEQFSQMLELSEAYPTAIALDESVANLKQLATCYAQGWRDVFVVKPAI
AGSPSRLRQFCQQHKIDAVFSSVFETAIARKASLQLAAELSRDNRAVGFG
VNHFFYQEAENWLQSLWNNP
>Ava_4078 YeeE/YedE
MSNGVENTLTSKSQLLPPRPQKLVVAIALFIFTVGSVLLSKYGWRQSVLF
LIGGLLGVSLYNSSFGFASAYRKLLLNRDVRGIYAQLVMLAIATVLFAPV
LAAGKAFGQEVAGAIAPVSISGAIGAFIFGIGMQLGGACGCGTLYTIGGG
SYTMLITLITFCLGAFWASLTRYLWAGLPKAEPIVLGETLGWTGAVVLQL
GILLLLAGGLWLWSKNSKSASAEHPSPTRSGFLFGSWSVFTGAIALAVLN
WLTLLISGEPWRITWGFALWTAKIATMFGWNSSTSKFWDGDTALSNSVFA
DVTSVMNLGIILGALLAAALAGKLTPQTQVSPSKILATVIGGLIMGYGAF
TAFGCNVSAFFSGIASTSIHGWVWIVCALLGTAIGIKLRPLFSLPN
>Ava_3891 TPR repeat
MGEPQEEVRTQESPHSGTTIEKNMPRTQKNDNFVDKSFTVMADIILKILP
TNKKAKEAFVYYRDGMSAQAEGEYAEALEYYEEALTLEEDTNDRGYILYN
MGLIYASNGDHDKALELYHQAIELNPRLPQALNNIAVIYHYKGEKAKEDG
DHDGGEALFDQAADYWIRAIRMAPNNYIEAQNWLKTTGRMQIDVFF
>Ava_3255 TPR repeat
MQNQPPFIINSTYHREESKDIEFKSIASQHPVKKIVNHAEEYITGFLNAL
VEGQLYIGIDDAGKILGVKLSRSERDEIQKIIPNKLRNTDPIVSPSLYDI
AFYNVLNEEQKDIEDLFVVNISVLGVDESQFYRTSSREYFLYKTTGGSTY
LKTGTDCIKLNTTEIAQEIQKRKQKYLKQELDNIDDKLLQNPDNRSLLKE
KANIAKLMGDIEKMDESYKRLINLNPNNSRTRVDYASAHKSIGDLEGALD
ILEDALKVDGDNLSILKTKGEILLGSDSSDKVKEAFQTYETALKLNPEDY
TIITQIGIALRKLGKYKESIKVFNLALAKSPHYRAAKYEKRITYHKMFEG
GIRI
>Ava_0210 conserved hypothetical protein
MMRTFVIAKNVFQEVVRDRILYIIGFYVLLLAVAIRALPEFAASTEDKMF
LDFGLAAMSVISLIIAVFVGTGLINKEIDKRTILLLIAKPVSRGEIITGK
FFGLSSVLTVLVVSMTAIYLLFLQFGNIPHTTPSILIAVLFLCLQLSLIT
AVAITFGVFTSSLLAIALTFAVYLMGNITQNIVEFSRLSRNPAMEGISQI
LYLILPDLSRLDLKNDAVYGMQALPDTIALMGNAVYGFVYIAMLLAIAIS
LFSRREF
>Ava_2373 Protein of unknown function UPF0153
MANWQCVKQCGACCNLDPAERPEIEDYLTPDELELYFSMVGEGGWCINFD
HTTRECRIYSTRPRFCRVEPDIFQDMYGIESEELNDFAIECCRQQIEGVY
GDRSLEMLRFDKAVGI
>Ava_3681 transferase hexapeptide repeat
MIFWFCNQPPRVELFQCSWVPYSYSIVSTNSYWPSPDFSQAAFIAANAVV
MGSVKIAAGASIWYGAVVRADVESIDIGECTNIQDGAILHGDPGLPTVLE
DHVTVGHRAVIHSAHIERGSLIGIGAVILDGVRVGAGSIIGAGSIVTKNI
PPLSLVVGVPGKVLRPITPEEAADLIQHAERYKKLALVHAGKGSDIGFYP
QE
>Ava_2347 Zinc-containing alcohol dehydrogenase superfamily
MLAALLYGQEDLRLEQLADPSPEVGEVVIKVGAATTCGTDLKVWRRGGHA
KMLKLPTLFGHEAAGEIVAVGAGVTGWQIGDRVVANNSAPCMKCFFCQRQ
EYSLCPNLTWNNGTFAEYLKIPAPIVQHNLLQIPDELPLPLAAMTEPLAC
VLHGVARSQIKPQDKVVVLGDGAIGLMFVATLADNVEVLLWGGNDQRLEI
GQKLGAAKTFNYHQNPDIPGTVKDLTQGWGADVVIEATGVPKVWETALAC
ARPGAIVNLFGGCPRDTTITVNTEQMHYSELTLKGVFHNTPEYVRAALAL
IASRKIPFELLISEQRPLKDLEQVFDDMKARKVIKVAMVSS
>Ava_4978 Protein of unknown function DUF6, transmembrane
MHHSSGRWRLGLALSLLTVFLWGILPLALKVVLQALDVYTVIWFRFLVSF
ALLAVYLGMRGNLPKLKQLQANSWKLLAIATLMLASNYFLFMQGLALTSP
ANAEVIIQLATLLLGFGGLIFFQERYNFFQWLGVGILTLGYVLFFRSQLT
NVITSHDTYVYGSALVVLGAVVWAIYALAQKQLLQSLSSAQIMLVIYGGC
TLLFTPFTKFDAINQLSNLHLFTLLFCALNTVIAYGAFAESLEHWEASRV
SAVLALAPILTLLAVELISLIAPDLIPPEPLTTTGIVGACLVVSGSVAIA
LKKAS
>Ava_1177 Na+/solute symporter
MSVEIWTILIVGLSFVAYIYIGWQSRVKSSKDFYVAGQGIPSIANGAATA
ADWMSAASFISMAGLISFLGYDGSIYLMGWTGGYVLLALLLAPYLRKFGK
YTVPDFVGDRYYSNVARLVAVVAAIFVSLTYVAGQMRGVGIVFSRFLQVD
INTGVIIGMVIVGFFAVLGGMKGITWTQVAQYGVLIVAYLIPAIAIAWKL
TGNPIPQLAFTFSDVATKLNQIQVDLGFQEYTQPFVNKTMLDVLFTTIAL
MVGTAGLPHIIVRFYTVPSVRAARFSAGWALLFIAILYTTAPALSMFARY
NLITTLHNQTITEVQQLDWANKWEKTGLLKFEDKNNDGRLQLTPKKDTNE
ITIDRDIIVLSTPEVAKLAPWVIALVAAGGLAAALSTASGLLLVISSSVA
HDIYYRILDSSASEKKRVFVGRVMVGLSVVLAGYFGVNPPGFVSEVVAFA
FGLAAASFFPVIVLGIFDKRTNSEGAIAGMLSGLIFTIIYIIGVKFAGMQ
PWFFGVSPEGIGTLGMLINLVVTLVVSRLTPPPPAEIQAMVEDLRSPSIE
EEEVQPIGH
>Ava_3090 Protein of unknown function DUF1400
MGITGKSIKLFAGLVCTFSLTQFLATNTPVQAAETVVVRFGLFAESIPVA
DLQKAAETGEFPSSLNLFTRRLSEQQRRTLIGALRMRVPLNVVTISRLLN
TQIGTTILNDLSRAVVRKDQSGAKALRASLVLGSTAPQGLSILSFITAYP
SRSLEINLPQAFQVAGSLNNAFWRTQQFMLAISPQLDPAKPQISIPFDPS
QPGNAQVQVLKLNLNDQKRNRQIPADIYWSTSATQEKPVIIYSHGMGSVR
TDLHYLAEHLASHGYIFVALEHPGSNQANTDLATKGKVRLLEPQEFLNRP
QDVSFVLDVLEKLNQTTGNPLQGKLATNNTMVIGYSFGGGTALSLAGAEL
QIAGIRERCQNKLTILSLGETIQCVAQELPEKTYQLRDNRIKQAIALTPT
TSLMFGETGLTKVQIPTLIVAASADKTTPALTEQILGFSKIPSPKWLVGI
IGGTHLSVKDPSTTLDQVDKPNTPLTGGEIVGEQATDVRQFVKAIALAMV
AQLTPEAEKYAVFLTPDYAQLASTESFPFRIVTEIPPQAIPIGKQ
>Ava_1066 von Willebrand factor, type A
MMSDRDYTLIIDKSGSMSTPDQAGGRSRWEIAQESTLALARKCEQFDPDG
ITVYLFSGRFKRYDDVTSAKVAQIFLENDPAGTTNLAGVLQDALNNYFQR
KAAGKTKPNGETILVITDGEPDDRKAVFETIIHATRQMERDEELGISIIQ
VGSDAQATKFLKALDDQLQSVGAKFDICDTITLEDLEDMSLADVLMNAIT
D
>Ava_0027 RNA-binding region RNP-1
MSIYVGNLSYSVTQDDLTKVFSEYGSVTRVQLPTDRETGRVRGFGFVEME
SSAAEDAAIQALDGAEWMGRVLKVNKARPREEKGARSGGGNWSRNNGGY
>Ava_1816 Serine/Threonine protein kinase
MIMLGEIFNTRYEIQQLLGKKAGRRTLLAKNVVTGELVIVKLLAFSSDFE
WDDLKLFEREAETLKSLSHPHIPQYLDYFEVNSPTIKGFALVQSYISAQT
LEQYLQSGRSFSESDIKQIATAILEILIYLHGLYPPVIHRDIKPSNILLG
ERSGNHVGQVYLVDFGSVQTALGTEGGTRTVVGTYGYMPPEQFGGRTVTA
SDLYSLGATLIYLVTGTHPADLPQKDFRIQFESVANLSPGLTAWLKTMTE
PSVERRFSSAQQALKALESSHLLTHSQLVIGKPKWSKIQLTKDADSLEIL
IPPVGFQPSMVFMGLFAIAWNSFILFWTIGALSAPFPVNIPFALFSLPFW
GAGYFMASTFLLSLLGRNRLRLNREQITFTHELFGWKFHRPRPASKESIT
KLVYISRHLTKDSEGAKTQVPAQLEIWAGVQKYQLGTNTAIQSETELEWL
AQELSDWLNIPIQRE
>Ava_3590 HAD-superfamily hydrolase subfamily IIB
MYINPHVGAGHSPNLQRISQNTVAELSHVRLIATDMDGTLTQQGRFSAAL
LQTLEDLKAGGLTVLIVTGRSAGWMSGISSLMPVAGAIGENGGLFYPAES
EHPLLLTPIADVTEHRQNLALVFQELKLKFPQIEESGDNRFRVTDWTFDV
AGLTLEELKIIANLCQDMGWGFTYSNVQCHIKPQGQEKAIALLQVLQKYF
PDYSPTQVVTVGDSPNDESLFNQRYFPLSVGVANVLKYADQLQHQPSYVT
TAAEGEGFCELSSYILRKGN
>Ava_1245 conserved hypothetical protein
MNAISKVEFKHPSWQTATLFALGFWLSASLVLDWVIMPSLYLSGMMLQEG
FTTAGYAIFWIFNRLELLSAAVVLTGILVWSKTHTQGKVNMTVIAVMLLA
IALLNTYFFTPQMSAIGVNLNLFATESAIPATMNLLHGGYFILEVVKLLV
GGVLFNWCWQQQS
>Ava_0812 ChaB
MPYQQISELPQDIREQLPERAQQIFFAAFNAAQSDGLSEEGSADVAWNSV
RNEYKQGNNGQWQRKAEDSAIHHKSITTGGN
>Ava_0999 Major facilitator superfamily MFS_1
MNVSKSHILWVQVWVLAALQGAITLAWIIYNAYVPKLLVQFGFPPSLAVG
LLVVENALAVIMEPLMGGLSDQAQRWVGSRFLLISAGVILSATLLIAIPC
IVTFVPPTVVWRSLLPIVLVAWALSMTVFRSPAIALLAKYSMPAELPLAF
SVVVLTGGIIGAFRPIANKFILNLGPIFAFAIASFVLLAVTAALRLVNPP
HTPVTNPREITQLPNRELSLILGTGFGVAWGSRLVMDILGKILPVQLQIT
NNDWLMVWVGLAIAVASLPAAWFTMKVGDRQAMLIGIGLTTLSLLIMVCF
NIPIPFLLTLIVGFSVIINGTIPFCLRLVSQPWEGLGIGMYFGGFALAMS
LFGAIFPQPQQITPVVGLIGLILAFLLAGGCIAASGETNP
>Ava_5005 hypothetical protein
MPKDWYEWHDLYNTEPKLQQRLEIVREYIAYSLNALPDGAIRIVSVCAGD
GRDLLGTLKNHPRINDVYARLVELNPQLVERGRATIESLGLAKQIELING
DATLATNYVGAVPADIVIVCGVFGNLAEEAELNRLLDNLSFLSKPGAFVI
WTRGHSNGIPYSDNVRKILSASGFEEISFNLTATGDMGVGLHRYIGENLP
EPKEQQLFVFSGVPRAAR
>Ava_2851 Serine/Threonine protein kinase with WD40 repeats
MICCLNPDCLNPLNADGKKSCQSCGTTLVPLLRNRFRVVRVLSDEGGFGR
TYLSEDKDKLNEPCVIKQLAPKFQGTWSQKKAVELFAEEAKRLQELGEHP
QIPTLMAYFEQDNCLYLVQQFINGQNLLKELQQRKNYRPGEIQAILLDLL
PVLKFIHDRGVIHRDIKPENIIRCRTDGRLNLIDFGSSKQLTAKVQNFGT
SIGSHGYSPLEQIRDGKAYAASDLFALGATCFHLITSVSPFQLWMEHGYS
WVSNWRQYLRSPLTPELDFVLDKLLQKDLKHRYTSADEVIKDITPKQPLA
LPAAGQTSGKIPVPQTTYLSKSPKKYSLLRSVVLLSALVLLFGFSESWFS
QYLRIYSSLSARLTQNKAGEVVLAQPHKTTLRTISLANTLPSDENAFVSL
AISPNGQIIASCGSDRSIKIWQLATGEDISTLNGHSRKVNAVVFSPDGKT
LVSGSDDNTIKIWNLKTGQVIRTITGHSDAVHTLAISPNGKTLVSGSDDN
TVKVWNLNTGRLINTLTGHTFWVRSVAISPDGVNIASGSFDKTVKIWNLE
TGNLTHTLAGNGETVTSIAFSPDGNTLASASRDRTIKIWKVGAGTRVRTL
KGSTETITSIAFSPDGNTLASASRDQTIKLWNLETGEEIRTLEGHENTVT
TVAFTPDGANLVSGSEDNTMRIWRIGN
>Ava_4557 Thiopurine S-methyltransferase
MLEQQKENLRFHIQQLANEAVQKSEPLAWFEVVYAEAQGDTTQIPWAKLT
PHPYLQEWLTNHQPFPSGQKALVIGCGLGDDAEALSKLGFAVTAFDISPT
AIAWCGERFPNSNVNYIVADLLAIPPQWHLSFDFVFECRNIQALPLNVRA
EVITSVASVVAPDGTLLLINRVRETEAEPSGPPWPLSESELKQLENLGLQ
PIEQLVFLESEQVDVKQVRIEYRRCHMQS
>Ava_0542 TPR repeat
MNEQRLQAYYQLIQTLLDCPSGEEPEILAANTELLDADFVQVVGLAAEHF
AQQGEENRANWLRNLATYLTTPETPPITEADIETYSPFILEVLRATAESN
GNPEVIYLLLAANTDKLNRIFAELLRRWATNTLAEAEPDTATSIANVIGN
FSNLIQQFPLGSKANNMEIAITGYEIIQTIYTRPAYPEKWATTQNNLATA
YSDRILGNRGENLEEAIAAYSAALEVYTRTDFPVDWAGTQNNLAIAYRNR
ILGNRGENLEKAIAAYSAALEVYTRTDFPEKWATIQNNLATAYLYRILGN
RGENLEKAIAVYSAALEVYTRTDFPEKWAMTQNNLAIAYSDRILGNRGEN
LEQAIAAYSAALEVYTRTDFPQKWAMTQNNLGNAYRNRILGNRGENLEQA
IAAYSAALEVYTRTDFPQKWAMTQNNLGNAYRNRILGNRGENLEQAIAAY
SAALEVYTRTDFPQNHAETLFNLGILYQEEKQFNLAYDTFAQAIATVEAL
RGEINAGDNLGEEGKRKQAEEWNKLYRRMIEVCLALGKDTEAIEYIERSK
TRYLVELLSKADSINQKNLPDIDSKIRFAEIQNLLDDETVIIQWYIFTDC
FRAFIITKNHQPIIWQSASENLDNLEEWTDNYLQIYGEDKQKWRYQLNEQ
LTKLTQILHLNEIISLISSQYKKLIVIPHRYLHLFPLHAVPLANKHSSQP
EYLFDRFPHGVSYAPSNQLLRFTQRRLQKLANLELNPFSNLFAIQNPTND
LAFTDIEVETIAADFQPQQILKHHQATKAALTATPTNETLSNSQWLHFSC
HGYFNFRSPLKSGLQLADAVTSNIPSTINSSRYLRIDNETAIDLDKCLTL
EDIFQLNLNNCRLVCLSACETGFIDYTNSSDEYIGLASGFIRAGATNMIS
SLWAVSDFHTALLMIKFYENLPLYQYNVSLALNHTQTWLRRATQSQIIDW
VQSKTNMQNTQQQKIIGFLQQYKPEQQPFKRPEFWAAFSAISPV
>Ava_4137 TPR repeat
MLNFTKYAVSLGGLTCFLITTNLTVSAQTNQQLVQISKLNLEVDSSLRQS
SIIDSSNPKYRLHQGRSSFESGRFAEAVKFWEAAYIGFKNQGDILNEAWS
LSYLSLAYQNLGDWPNAEVSITHSLNILKHLNVQKQGNSAILAVALNTLG
NLKLSTGQAQAALKAWQDAESAYAVAGDEVGKLGSQINQAQAMRALGLYR
RGQQLLVNLHQKLQNQPDSIIKAKALQSLGIAFQLVGDVQQSQEILQQSL
KISKDLNSATDISNILFNLGNTARDLQQIEVAFDYYQQVVAQTSNSQLRV
DAQLNQISLYLQTRRNKPSEALLSEIKWQLEKLPASRMSIYAAINFVRNL
TKWSKLDGNESISYRESAQILARASLQAQMLGDLRAQAYALNELGKLYFE
KQQLGDALKLSQQALQITQEINATDISYQAAWQVGRILKKQGDHQGAIAA
YDNSVKTLKSLRSDLVAINRDVQFSFQESVEPIYRELVDILLESPQPSQG
NLKLARETIEALQLAELDNFFREACLNAKPEQIDQIDKQAAVIYPIILGD
RLEVILSIPGKPLSSYRTVLSSREMEDIIKQTRQSLNPIFSNEERLNVSQ
KLYDWLIRPLESELSKSGVKTLAFVLDGSLRNIPMAVLHDGKQYLLEKYS
LALSPGMQLMPARSLKRENLKLMTAALSESRQGFKALPAVKSEVTEISEE
VSSKLLLNENFTDTNLKQAIESTPFSVLHLATHGQFSSQSDDTFILSWNE
KINVKQLSEFLQARNESQSTPVELMVLSACQTAKGDNRAILGLAGVAVRS
GARSTLATLWSVKDESTAKFMVEFYKHLRQPGISKAEALRQTQLTFLQNA
DFQHPFYWSAFVLVGNWL
>Ava_2994 Peptidase S33, proline iminopeptidase 1
MRELYPLIEPYKEGKLKVSQLHTIHFEESGNPQGKPIVLLHGGPGGGCPP
VYRQYFHHEKWRLVMFDQRGCGKSQPHAELRENTTWDLVSDIEKLREHLG
IEKWVVFGGSWGSTLSLAYSQTHPERCLGLILRGIFLLRQKELRWFYQEG
ASYIFPDAWEEYLQPIPVDERDDLLTAYYQRLTSPDSQVRQEAARAWSIW
EASTSRLFPDTQLKQTFAEDKFAEAFARIECHYFINKGFLNSDHQLLLNV
DCIRHIPSVIVQGRYDVVCPMTSAWELHRAWPEAEFIVVPDAGHSMSEVG
IRSALIEATDRFADAG
>Ava_1011 Short-chain dehydrogenase/reductase SDR
MPTALITGASGGIGKAFAQELAARQTNLVLVARSQDKLHQLAQELQQQHK
IQVDVIAKDLTETDAVADVFDITKSQGLTIDCLINNAGFGDYGDFAESDR
TRQIKIVQLNVLALVDLTHRFLPLMRQSRSGSIINVASIAGFQPIPYLSV
YAASKAFIVSFSEALWAENRQYGVRVLVTCPGPTETDFFTEANFPQALAE
TTNKVMSSEEVVLKSLKALENWEPTVIISDTSTQLRSNVARLVPRKTLLS
LLAKHFKA
>Ava_3020 ABC transporter-like
MSVITLQSVKKDFGIKEILKDASFSLDATDKVGLIGTNGSGKSTLLKMIA
GIESIDSGQILCTSGSKIIYLPQQPDLDENLTVLEQIFADSGEQMTLVRE
YEELSDKLAHYPEDSQIMSRLSVVMQRMDATNAWELETNAKIILSKLGIT
DFDAPIGSLSGGYRKRIALATALLAEPDVLLMDEPTNHLDALSVEWLQSY
LNRFRGALFLITHDRYFLDKVTNRIIEIDRGDVYSYAGNYSYYLEKKALA
EESAISTQRKHQGVLRRELEWLKRGPKARSTKQKARIQRVQAMRETEFKQ
TQGKVDISTVSRRIGKKVIDLHNISKAYNDRILIKDFTYEFSPEDRIGII
GGNGAGKSTLMNIITGRIEPDAGKVEIGSTIHIGYFDQHSEELLTAVDEN
QRVIDYIKEEGEFVQIADGTKITASQMLERFLFPGNQQYAPIHKLSGGEK
RRLFLLRILISAPNVLILDEPTNDLDVQTLAVLEDYLEDFSGSVIVVSHD
RYFLDRTVDTIFALEEGGNLRQYPGNYSVYLDYKKAEEAQQATLNTKEKT
KNSPETKVTSQTKDVETKKRRRLSNWEKREFTELEGKIAQLEDEKTQAEQ
ALTHVPPGNYTQVQKLYEQIEALKQAIDVATERWLELAEIESSESG
>Ava_3010 conserved hypothetical protein
MMLTEKLEQLKALFTEMEQALIAYSGGVDSTLVAKIAYDVLGDRALAVTA
VSPSLLPEELEDAKIQAATIGIPHKVVQTHEMDNPNYTSNPVNRCYFCKS
ELHDTLKPLAVEMDYPYVVDGVNADDLHDYRPGIQAAKERGARSPLAEVG
VTKLEVRQLSQQLGLPWWEKPAQPCLSSRFPYGEEITIAKLQRVGRAEIY
LRKLGWQNLRVRSEEDTARIELPPEKIKDFVLTTDLPSVVNAFQELGFIY
VTLDLEGYRSGKLNQVINQANTPIKV
>Ava_0865 conserved hypothetical protein
MNKNTVKVVKNESFPPGDYVSSGFSTIQPDSCFPNMILGNRYDSLWFYLR
RNIAHNFYVDQRRQEVGFVSRDEAHILYNTALKFQGKKALEIGCWMGWSA
CHLALGGVELDVIDPMLAEQLFNESVTESLKLAGVKESVNLIPGYSPQKV
EEIANQFQRKWSLIFIDGHHEAPAPLNDAIICEQLAEPDALILFHDLASP
DVGQGLDYLKEKGWNTMVYQTMQIMGVAWRGNVEPVIHQPDANIDWQLPP
HLQDYVVSGVSQTVGKDRLGEILKTLQPYTLLSKAQLFSLYSHVKQAYAY
LFWLPQMLIRRSLK
>Ava_1624 WD-40 repeat
MKYQVGGSLPSDDPNYVIRQADKQLYASLKAGDFCYVLNSRQMGKSSLLH
RTSNHLTKEGHICIYIDVTRLGSEDTTTEQWYKGIIISIFYSLNLSEKVN
FQQWWKMQSGLSPIQKLNQFVEEVLLPHNQSGRIFIFIDEIDSLLSLNFS
VSDFFAWIRHCYNQRAHDPKFQRLGFAVFGVVSPSELITDKRRTPFNIGT
AIQLYGFTLDEATPLLKGLEEVISQPQAVLQKIIDWTSGQPFLTQKLCQL
VVQTAWNTPNRKIDLPPATAGYWVEQLVQKQIIQYWEAKDEPEHLRTIRD
RLLFNEKRAGRLLGIYQQMLQAESGNLPVEIDDSQEQKELLLSGLVEKQQ
GYLKIKNPIYRYVFHHEWVIRQLDNLRPYSQIFNVWVASGYQDESRLLRG
QALKDAQDWSQGKSLSDLDYRFLAASQEYEQREIQTALEAARVKEVEARL
VQEKKTAKLQRLLLIAMCVKFLLSSSLAMGIYILYRQAKNSEFQAKNSEI
RALISSSEGMFASNRRLDALIEAIKAKDRLKKLENPNADITNQANNVLRQ
AVYGADESNRFWGHTAAVMAVDVSPDSSLIASASIDRTIKLWRRDGTKIT
TLKGHQGAVRSVRFSPDGQMVASASEDGTIKLWKLNGTLLKTFKGHTASV
WGVAFSRDGQFLASASWDTTVRLWKRDGTLLNTFRDSKEAFWGVAFSPDG
QIVAAANLDGTVKLWQRQGSGWQEAKPLQPLKSHTAWVVGVAFSPDGQTL
ASSSEDKTVKLWRRDPADGSYRLDKTLKQTTGIAGVAFSADGQTIASASL
DKTIKLWNIDGTELRTLRGHSASVWGVTFSPDGSFIASAGAENVIRLWQS
QNPMQKSVTAHYGGIWSIAITSDSSTVGTASHDNTARLWSRQGGLVKTFT
QEKGGIIAISFSADGKLVALPTYNETVLLKKPDGSDVASYKNTQGKITAA
VLSPDGQAIAIANVHKVAQIWRRNQPTSQVLKGHQAEVWQVAFSPNSKIV
ASASGDSTVKLWTLDGKLLTTLAGHSSVVWSVAFSPDNKMVATGSGDNTV
KLWTIDGKLLRTFTGHTAAIWGVAFSPDGKILASGSVDATVKLWKMDGTE
LTTLTGHTAAIRKIAISRDGTILASGGDDNTLILWNLPQILSLDALTYGC
NLVKDYLQTNTGLEEGDRHICNHLEN
>Ava_1151 conserved hypothetical protein
MTSKQAQAVAQQLGDIPVNDEKIQAEIQRLNRKSFIPLEQVQMLHDWLDG
KRQSRQSGRVVGESRTGKTMGCDAYRLRHKPKQEPGRPPTVPVAYIQIPQ
ECGAKELFGVLLEHLKYQMTKGTVAEIRDRTLRVLKGCGVEMLIIDEADR
LKPKTFAEVPDIFDKLEIAVILVGTDRLDAVIKRDEQVYNRFRACHRFGK
FSGDEFKKIVDIWEKKVLQLPVASNLSSKTMLKTLGETTGGYIGLLDMIL
RESAIRALKKGLRKVDLATLKEVTEEYK
>Ava_0977 GCN5-related N-acetyltransferase
MNSYYQDFLIRNWQKGDRTKAAAVISYVLSEYGLGWEPKGADRDVLQVEE
CYLATGGEFWVIEHQSQIVGTGAYYPVNRGEKAVEIRKMYLLPSVRGLGL
GKYLLQQLEAAIASRGFQEIWIETASVLVEAVKLYENHGYQPATGVETAR
CDRVYVKFLN
>Ava_3608 conserved hypothetical protein
MAKNETNAGLKIIPLGGLHEIGKNTCVFEYDDEIILLDAGLAFPTEAMHG
VNIVLPDMTYLRENRHKIKGMVVTHGHEDHIGGIAFHLKQFDIPVIYGPR
LAMAMLEGKLEEAGVRDRTELRKVLPRDVVRIGKSFFVEYIRNTHSIADS
FTVAIHTPLGVVIHTGDFKFDHTPVDGEKFDLQRLAEHGEKGVLCLLSDS
TNSEVPGFTPSEASVFPNLDRVFSQAEGRLFVTTFASSVHRINMILQLAK
KHNRVVTVVGRSMLNLIAHARNLGYIKCEDNLLQPLHMVRNLPDDNVLVL
TTGSQGETMAAMTRIANKEHPHIKIRQGDTVVFSANPIPGNTIAVVNTID
KLMQQGAKVVYGRDQGIHVSGHGCQEDQKLMIALTRPKFFVPVHGEHRML
VKHSQTAQKSGIPAENMVIIQNGDIIELTEDSIRVAGKVQSGIELVDTTS
SGMVSAKVLQERQRMAEEGLVTIAAAIDWQGKLLAKPEIHLRGVVTSVER
SLLQKWVQQRIEEILSVRWSEFATSEGEQPEIDWGGLQGTLERELQRSIR
RELQCQPSVTLLMQIPDEPPVKVADGRRRRTRTAAQVAS
>Ava_0525 Abortive infection protein
MKDAPAITVVMAFFIAWLVCWLPIAAVSLKLLNWQPIKPLQPEQKLPLMI
SLYLLAPLVLWGVIWLTNSSFANYGIVGNFALLSSLALGFGLGVLSIVIL
FTCQLWLGWCYFKESNIKQIASISLPILLIALLVGGIEELVFRGFLLTEL
ERDYSVWVAGILSSLIFAVLHLVWEQRETLPQLPGLWLLGMMLVLARISD
RGNLGIAWGLHSALVWAIATIDTAQLVTYTGKVSEWWTGKNKKPLAGVTG
IICVLGTTLILWLVCDSISLILF
>Ava_4081 Dienelactone hydrolase
MDRTLTHQPQEYAVSVSVGEVKLKGNLVIPNGATGIVLFAHGSGSSRYSP
RNRYVAEVLQQAGLATLLIDLLTQEEEEIDLRTRHLRFDIGLLASRLVGA
TDWLTHNPDTQHLKVGYFGASTGGGAALVAAAERPETVQAVVSRGGRPDL
APSALPHVKAPTLLIVGGYDLPVIAMNEDALEQLQTSKRLVIIPRASHLF
EEPGALTAVAQLASEWFMHYLR
>Ava_1057 conserved hypothetical protein
MNSILICLILGLVAGIVSGMTGIGGGIIILPALIFLLGFSQQQAQGTTLA
LLVPPIDLLAAWVYYKQGYVDIKVAALICLGFILGGWLGAKVGTDLPTGT
LSKIFAVLMIISALKVLFTNPAEGI
>Ava_5039 Lipase, class 2
METRNLQRNPILLVHGITDTEAVFDQMAAYLRQMGWTVYTLNLVPNNGEA
PLNVLAQQVADYVAANITPEQPFDLVGFSMGGIVSRYYVQRLGGISHVQR
FVTISSPHYGTVIAYASQRPGCVQMRPNSLFLQDLNHDVQMLEELNFTSI
WTPYDLMIVPTHSSKMPVGKELSIPVALHSWMLRDVRSIEAVAAALAEPI
NCDRRFEYIPNSSIHN
>Ava_2240 GCN5-related N-acetyltransferase
MTIKSIYKFNFPSCCLEYLRRFYLDNINIQLALTSWFFDPSSHKPVTADT
EPSFRQVHIRAATPADLTSIAQIIAESFHSQNGFWGWAFPLLRLGIYEDF
KHRLLSPAPHHLCLVAVETTNDGVDQLLGTVEVGVRFSDYWTQTGKSFPY
LSNLAVHPQYRRHGVASKLLVRCEQVSQEWGFQNLYLHVLENNYQARQLY
FKLGYQVHKIDSHWNSFLFRRSQHILLHKQINITSTG
>Ava_0776 conserved hypothetical protein
MKLPKINLPLLIISLGIIFSLYLLSRVPEDIYFSGDAGLKALLAKQFSSG
KLNFDLDLSVPSWVRNLWDNGLYPFEPPFSYKISNRYYITFPFTFPLITA
PFHALFGYRGFYIVPLLSTWIIWFNFYRICQFFKISVLTTSIGITTLIFA
SPLTMYSAMYWEHTLAVSLAFAGLVIILSKGEEGFTQKDALVSGILIGLS
VWFRPEFLALVAILIPLALSSYKIKLGKISIINQHKILFIISLLATVSCF
FIINKLIYNHPLGAHALQVVEEFSLQERLSKAHRIFGRLWKNFREYFPIV
YFTIIFTGLSVFYKSIKLTAAMKKIVLISIPFIFLVPILLPSDGGKQWGP
RFLLILIPLLSLLAAFLLESTLSIRQFGIKYISTGIFTALFMIGVYANTF
MGINYSYFTGNTEAIDIRNFLRQDDHKIVAVAHQYISQSFEAAFKNKLFF
LTKNLDDTSKLGLALNEQGYDNFVYVCASYDPCFSSPTIPSQINISATDK
LLRVQLKEVKKNKKYIIQEAEIVQVKDN
>Ava_4728 Endonuclease/exonuclease/phosphatase
MTATTLKPGDIAIAGYNTTNPDSIRVVVLVDIGAGTTFNITDNGWQSSGS
FRTGEGVLTYTAPQDISAGTVLTWTNGSSINSPGFNSNNPSNFALNASGD
SLIIYTGTLASPTLIYSLSSGNWTNATSASTSAEPTATNGGTLETGKTTV
AITTNGTNGYYSGSTVGTQAQLLAAISNPANWTASSTITDIANWRSSFTL
SQPLPNIQITEYMYQGTNGEFMEFTNLGTTAVDFTGWSYDDSGRTAGTVS
LSAFGIVQPGESVILTEASEADFRAAWGLSANVKVIGGLTRNLGRSDEIN
LYDNNGQLIDRLTYGDETFTGTIRTQTRSGWTEPGNLGAVTINTDWQLST
VDDAQNSRTSTGGDIGNPGFYNINNTPLPGITITQSGSSTDVTEGGVTDS
YAIVLKTQPTADVTINITVDNQVTTSSPTLIFTPQNWNIAQTITVTAVND
DVIEGTHTSTIAHNVSSSDTNYNGIAIANININITDNDAPPNTNVNVQIT
EYMYTGANGEFVEFTNLGTTAVDFTGWSFDDNTRIAGSFNLSAFGIVQPG
ESVILTETAAETFRTAWNLPTSVKIIGNSNQGLGRTDEINLYDSTGQLID
RLTYNDEGFTGTIRTQNASGWTTAANLDGFEITTNWQLSAINDGQNSRLS
TGNDVGNPGTYIPNPVSTVGAPKIEVNPSTTDLLDGQNLPVPLPKIGAGA
ISGVINDPTDPARTLGINFTLSDTDTPVENLTITVTSSNQAVVPDANLTL
TGTGAERNLKINPVGVGLANITVTVSDGTLSSAYIINYAASAGSVSPSTR
FLTGTSDASSAIAIDANYMFVADDEDQTIRLYDRRNSGLPLASFDFTSLL
GLSGSSEVDIEASTRIGNTIYWLGSHSNNSNGSDSPNRERIFATQISGTG
ASATLTFQGYYQFLEDDLIAWDNNNGHGLGAGFLGLAGSAANGVSPEIRN
GFNIEGLTVAPDGNTAYVSFRAPNQPTSDRTNALIIPVTNFTNILNTTGG
TSGATSFGAPIFLDLGGRGIRSIERNSSNQYLIIAGPPGGATGVAPNDFR
LYTWTGNATDAPVLRIADLTALNTNGSFESIVEVPDSLTNNTQIQLLVDN
GDTVWYNNGTISKDLAQNNFQKFRSEVITLGSPVLTLPVGAISLKTPYSQ
EFNNLISSGSETWTDSSTIAGWYTARTGTGTTIVASTGSNTAGNLYSFGL
DSSDRALGSIGSGNAAAGTFYWGSRFFNDTGNVVNQLYVNYYGEQWRSGG
TTSNPQTVDFQYQTGATSLTGGTWVDVNNLDFTSLINNTTAGALNGNASA
NRNLISGTISGLSLNPGEEIWLRWVDIDHPGTDHGLSIDDVKVSTAPLPS
ITLIESGGSTNVTEGGATDTYTIVLNTQPTANVTVTINPDSQTTTSVNTL
TFTPANWNTPQTVTITAVNDDLVEGTHTSVITHTVTSDDTSYNGINIGSV
TATITDNDVALTITKIHQIQGSGTTFNSAFGSIRTIEGVVVAAFPGGSGL
NGFFVQEEDADADNDSTTSEGIFVFDPTGQFSGSVGDKVRVTGSVSEFST
NNGVSSLTQLSSVSSIINLGADILPTVSNIQLPVTTVADLERYEGMRVNI
SAGSGDLTVTEHFQLGRFGQVVLSATGTSNQLGTDGRLEQYTQFNDPSVT
GYAAYLDEIAKRRIILDDGSSTQNPATIIFGRGGEPLSATNTLRGGDTVA
SITGILDHRFEGYRVQTSTGVNFTPANPRPTTTPDVGGTLKVASFNVLNY
FNGDGTGSGFTSPEQRGAENLTEFNRQRDKTIAAILGLNADVVGLIEIEN
DGYGANSAIQDLVNGLNAVAGAGTYAFINPGLSQLGTDAIAVGLIYKPNS
VTPIGTAATVADGFGQGAFDNNNRKPLAQTFRQNSTGEQFTAVINHFKSK
GSSFGNPGDADAGDGQGLSNGTRTRASQDLAAWLATNPTGTTDTDYLILG
DLNAYAQEDPIRALENAGYNNLLPNTTYSYVFDGQWGALDHALANASLTA
QLSGAVKWHINADEPNVLDYNTNFKSVGQQTSLYSPDAFRSSDHDPVIVG
LNLNTAPIAVNDIATTNENTAVNINVLTNDSDANGDALQLSLVSNPVNGI
AVVNDNGTPGNFADDFIIYTPNTGYVGSDSFTYGISDGKGGTATASVSLT
INASGGIIGTPDNDILTGTNRNDLIRGLGGNDLLIGGNGNDTLYGDRGKD
ILLGGNGNDTLYGGDGNDTLIGGNGDDLLVGGKENDLLIGGNGRDRFYLS
DTRTGEFDIITDFKVGQDTIFISKTEFGLSQALGTLNPGLFRLGSNATAA
GDRFIYNNTTGQLFFDEDGIGGAAKIQIGLLSNKPAITSSSITIAA
>Ava_3051 Lytic transglycosylase, catalytic
MLKKLQKKQISIIAGAALFAFSAGAMVSAPEIGKSLGQWLNVTQGQPKDL
SEGSRAKSDVFPLISQSPVERAAKLEELSQNSRSPDRERARYLLASDYID
TKQGQKALELLTGLEKSYPVLAPYILLKQAQAQDLLGEDGKASDLRQQVL
KQYPKEAAVVKAIYLIAQPKLQDTAIAQFPSHPLTWEIIRKRLSENPNQP
QLQLILAQHAYTQPGIVGVLDELGKQTNLQPQDWEVIGTAYWENNQFLKA
ANAYAKAPKTARNLYRTARGWQVGGKNREQAISTYKQLVQQFPDARETGL
GLVRLAEMTKTNKDALPYLNQVIAKFPEQASQALVKKAEILTALKDEKAA
QQTWQQLITKYAKSDEAAEYRWKNALEKAKARDYTSAWKWAQPIVINNPN
SILAPRAGFWLGKWAAAVGKQQEAQTAYEYVISQFPYSYYAWRSANLLGL
NVGNFDNVRQLTPEVIPYQRPIPPTGSPAFQELYLLGQDRDAWLQWETEY
LNKQEPTVAEQFTEGLMRLARGEYLSGINLISKLEDRETPEEQAEYQALS
KQITYWQARYPFPYLKEIEKWSSERELNPLLVTALMRQESRFEAKIKSVV
GATGLMQVMPDTAKWIASKIPLDIKTINLENPNDNVMLGTWYLDHTHEQY
GNNSMLAIASYNAGPGSVARWLKTLPNQDPDEFVEAIPFNETRDYVRQVF
GNYWNYLRLYNPEIASIVAKYSAEQPKLPGN
>Ava_2432 conserved hypothetical protein
MNNLLPHSRKITRKSARIKSYEPRKCRIALYSHDTMGLGHKRRNLLIAQT
LGFSPLQTDILMISGIQDASSSPTPPGVDCLTLPALHKNIDGEYQARKLD
LSLQEIITLRSQVILATIKTFKPDIFIVDNVPRGAVRELDPTLKYLRREG
NTRCILGLRDILDEPASVSRDWKRAANEEAIQTYYDQVWVYGDRNIYDLA
KEYHLQPKTAAKFRYTGYLDQRNRLKYLNSDIVQSFKSLNLPSERLVLCL
VGGGQDGAQLAETFAHAELPPGMNGIILTGPFMPREVRQKLHNYAAQRDN
LRVLEYLAEPTMLLHQAERVIAMGGYNTTCELLSFGKRSLILPRVKPRKE
QLIRAERLKKLGLIDFLHPDKLTSAALTNWLNLDIQPPPVRKFVDLKGLT
HIPQFVHEILMSTHQAPPQAKAS
>Ava_1359 von Willebrand factor, type A
MILKGSWLNTRLVAVLGVLLLTACSSNPNSTDNFTGLKIKVLVGSALGDF
CNQAAKNFNATQPKLDNGNALRVECEAQGSGDVVTKLLGLTTQLKNGTLQ
PDGADFPTIISLDGDIYHSQLIYRINQVFPGQNYIPEITDAPLLANSPMV
FMAQADVAGGLQKVPDAYKALVTAKTHRDIDPASPSLTVNYVHTAPTRSN
SGLQTLVAQYTSVSGKRPEELTIADVQTFQPQIQQIQSKITRYGVSTNSL
AQAMVKNGPFWASVGSVYESSVIAANSSLQPGQERYQAVYPKTTFTSNMR
AIVPNAPWVSADEKAGAEKFITYWRSPDTQKIAPDLGLRPGTPGVALGAK
FSPEFGVVAQAKYDSLRPPKPEVVDAMLKSWQEASKKPSLVVVVVDSSGS
MEGNKLPAVQNTLQNYIKNLGKKEQIALIDFDSEIREPVLVDGTPQGRDR
GVQFISGLRADGGTKLYDAAIQARNWLQKNRRQGAINAVLILTDGEDSGS
KISLDNLSAELQKSGFSTDQRIGFFTVGYGEEGEFNPDALKKIAELNGGY
YSKGDPETISRLMSDLQVEF
>Ava_2089 Small GTP-binding protein domain
MVRLKPWQWVVLAMPIASIMIFLLVSAGMQIHTWGISWIWAVFTLVFVAW
RWLLVKWTQPAIRQIEAALAEVKEELKSAVEDTTPSVSSDKTQQIEAALQ
EILTIAQGDRPIWEDWTTFWQRCQDLVRAIALIYYPQVQYPLLNIYVPQA
YGLIRGTVDDLDQWMQKLSPALNQVTIGQAYQAYEVYRKLEPSARKVWRA
WNWAQWLLNPVAAAANRATKGTTNQANQQLLVNLGQLLREAALNNLCRQA
IALYSGTTVNISTATISTPTLPKTKTQTLENILAQAQPVEKVAQKPVNIL
LAGRTGAGKSSLINTIFQSNLAEVDVLPSTAEIQNYHWQTQDGETLNLLD
TPGYEQVKRGDLRDLVLEYATKADLLLLVTPVLDPALQMDVDFLQEIKTT
VADIPAIAVVTQVDRLRPIREWQPPYDWELGNRPKEIAIREATEYRAKLL
GDFCNLVIPVVTGDTKTGRLAWGIEALSLGLIQAIAPAKQLRLARFLRNL
EARTVAAAKIIDHYTLQMATTQGLTALLKSPVLQFISTISTGSPTLAYLL
AEQIPVEQLPIVIGKLQMAYDLFSLLKTEESSTLNFDLLSLWPLLLENPT
TPDRNAWAFGHAVVEYWTQNLTVEQLRQRFDYYLQQV
>Ava_2533 conserved hypothetical protein
MQQVAADLEIDFKSEKYKDAYSRINAIVIEGEQEAYENYIQLSQLLPDDK
EDLIRLSKMESRHKKGFEACGRNLQVSPDIEFAKEFFAGLHGNFQKAAAE
GKVVTCLLIQSLIIECFAIAAYNIYIPVADDFARKITEGVVKDEYSHLNF
GEVWLQKNFAQSKAELEEANRHNLPIVWKMLNQVADDAAVLAMEKEALVE
DFMIQYGEALSNIGFTTRDIMRMSAYGLTAA
>Ava_4476 Predicted signal transduction protein containing Nacht domain
MMLDWLAVWGVTQAVGFAFKSIFEDLAKDAAKDWAKDLLKAVPNNILQKL
QKEDIETAAGKALKEFLQLMQQELEDADLDEAELQRYNQPLTTFIHNQSL
QHLLGLPFQPDCQVIDYQSLVTAWYADNLLPLPPAFDWERLAKRYLKKVK
AIIRESDKLRPIFDSQHLEEARDSLQQMAGIPTEFDLLGYQEGLRERYGN
LKLDSLDTTGYAYNELKLWRMFIAQNVREVHQVLPQVHELPKEHLKRLRE
SNQIEDISLDELTYYKQVYIEQPTFSILDIINNRQNYQYIVILGDPGSGK
STLLQFLALNWAETPLGNAIYQPLPLLIELRTYMRRRENNECSNFIDFFH
KSSGIVHHLNQHKLHEQLKTGNALVMFDGLDEVFEPGKREDIITDIHRFT
NQYPDVRVIVTSRVIGYKPQRLRDAEFRHFMLQDLEPEQIQDFIHRWHEL
TFCDEGDRRRKKERLHRAIDTSHAIAELAGNPLLLTMMAILNRNQELPRD
RATLYEQASRVLLHQWDVERALVEDYRLDPKTIDYKDKQAMLRQVAYRMQ
TSEKGLAGNLISTGDLEKILIRYLKNIEFEQPVIVARVMINQLRTRNFML
SYLGADYYAFVHRTFLEYFCAWEFVWQFKETQTLSIKDLNYEVFGKHWQD
ETWHEVLRLITGMIEPRFVCEILDYLMAQNGETEKFINLFLAAKCLAEVR
NRLLVASTATKLLHQIKALTKYDLGYYYQPYLDEEETQLVQEIRTKAVTS
VATTWKDDPETLTWLKQLATADEREYVRSVALQELARSFKDDPNTLPWLK
KCATTDSDGTVRQAAVQELAKVFKDDPDTLPIVKQRATIDENEYVRQVAV
QELARVFKDNPNTLPWLKQRTIEDNSGAVRQVAVQELARGFKDSPDTLPW
LKQCTTAHDWTVRQAALQEIAKVFKDDYDTLSILKQSAAHDENEYVRQAA
VQELARGFKDDPDTIPILKQSAIADKSFDVRQVAVQELARGFKDDSQTLS
WLKQYATIDGDKYVRRAALQELARGFKDDPDTLPILKLRAIVDTHADVRR
AAVQELARGFKDDPDTLLILKQRATTDDNESVRRAAVQELARGFKDDPDT
LPILKKRATSDKYANVRQAALQELARGFKDDPDTLPILKKRAMTDQHLDV
RHTALQELTRSFKDDPSIFEVFYNCAVNDPFTREYNFQINPRQLALETII
EQYPHHPQTLQLLRDRATNDPDEQVREFASKKLQQVL
>Ava_0654 Major facilitator superfamily MFS_1
MVNSVNTRPEILWRQVWGLAALLAAIIFSWMAYKFYQPRILEELEFVGLV
RWLGIWQGLLIAVIEPMIGGLSDRIQQRLGSRLPMISLGVVLAGVIFVAV
SLLVQQNLPIGIRWVVPMLMTVWVIAIIIFRAPAIALLTQFAPKSELPQA
NAVLVFVLGLIGAISPVLNTLLDNMGASITFLVGAIALILAAYILQLFTP
KHLLHISSFNLEPTAKTPVQMFILIFIIGLGTGIELNLLLSIFPQELQIQ
LPNLTVEFIASAILLVSAIVSVPLGDWTTQLGANKSMLLGLGAMTAFMGL
ALLNDSDKLAIAFILAFGISFSLVFISMIPLVLSKVHPSRAGLGTGLYFG
GSAGGTAIVSFLIKELGNTSIGAFLLAEFAFVLVGVCILLSRRFRILPD
>Ava_0130 Peptidase M48, Ste24p
MKRISKSLLLTLNWILLSVGTTLLILLTQPIAPVPAQEPPAPTEQTNTPP
PENPNDQKLRDALKRSSASEPPLSPEEIARQQKLIEADKLYLAGQIAEAE
KIYREVKTPFTQTSASKERKPAILNPIELSPAGKVYWREAEAGIAKKLQT
RTLVPLQLLVEQYPEFIPGHIRYAEVLTQYDRQTEALDILERASSLYPNQ
PELTQARVSALAESKKWMEASLAARQFAILNPNNPQAPEFTQLAEKNLQR
YQSYVREEIRGNLISNVITGALGYAVTGSLLGPFSALDSTILLLQGEKSI
GESVAKQAKKQLPLIIDDDILAYINDIGQKLAKVAGRNEFKYEFFVIPEE
ELNAFALPGGKIFINAGAIAKTTSEAELAGLVAHELSHVVLSHGFQLVTQ
GNLISNVTQYLPFGGTIGQLFAMNYSRDMERQADILGTRLIVASGYAADG
LRNLMVTLDKQKKYDIPTWLSSHPGGNERVSYLENLISRNSYNRYAYEGV
QRHAEIKVKVNKLLKQKKEEEEKKHTTE
>Ava_4052 Dinitrogenase iron-molybdenum cofactor biosynthesis
MKVAFTTSDGTHIDTHFGVAKEIDVYEVTKDGFNFVETLTFEGDLEEATH
EDKITPKMEAILDCKIVYVKAIGKPVGNKLIKQGITLVRAQEYDKIPDIL
HVLVQSLNGDAPPLLRKALSEENKELVYAYDEE
>Ava_4071 GCN5-related N-acetyltransferase
MVNKVALVEEVTIKQANIKDAERITTLGEQLGYSLTIQQVEQRLNKIQND
AEHIVYVATLANEYVIGWAHAHICDLLIMPKQAILLGLVVDKDYHHHGIG
RYLMQYIEQWAVLKECDGVMLRSNIKRKEAYLFYEKIGYINIKQSLAFYK
QLI
>Ava_3787 hypothetical protein
MTKRNILNDRGQKSRWPNWLPYPSCWLKSFVLMLFVRVVTFVGENLVRFG
YNFAKFISSPELFAIFIILALLSPIAVISFTHHYLHLLLGRFFAEIQAPE
VGDVKGLVPTLMSWWEGLYGWVVIALSTLVAALLCTFILPIFNLSYTNPL
EIYTQFERQIIVIFGIFWLITGALIYQIDYLVRHRLISVYSSNQKSS
>Ava_0307 FHA domain containing protein
MTGKNARHNAFLRLVSGNGAAFGSESRYSLTSKEVVIGRDPSCQVVLDAM
MYRMVSRRHAVVRPVASSVDSKFSWVLCDLNSANGTYLNGQRLYGCQELH
AGDRISLGADGPQFLFEYAVAPQPTIITNQVAPLPSAKHPPKPDSVSFTQ
LFPIISTGKDLTRKAYLVPGILTVVFVVLMFATVGRPQANQVIVATYIAL
AAYYFIYQLCGKPKPWWVLVGAALSTTLILLSPLLNLFIFIFREVLPGSV
PPIDQPVSSLTELFVGYFFGAGLMEELLKALPILGVYLIALGLPPAWRER
VGVWEPLDGILLGTASAVGFTLLETLGQYVPAASLQAGTEVGLQVLIARI
LGLPAGHMAYSGYLGYFIGLAALKPRHAKQILAVGYLSAAALHALWNTAG
HSNNLLLVVVGVLSYAFLMAAILKARALSPTRSQNFATRFLDPK
>Ava_3300 RNA-binding region RNP-1
MSVRLYIGNLPKEEIDRQELQAVFAAEGDAVTTKLIKDRKTGKCRGFGFL
TVNNDEQADQIIEKYNGQMFKETPIKLEKALPRTKGDEGEEQATPKPATT
SGGHAAPNTNKEGSRRDKGAKKSRRGGGNRENTTTTTTDSDAIRPDPRWA
SELEKLKQILAAQATN
>Ava_0409 Protein of unknown function UPF0118
MQTLKLLNWWQTLTPIARIGAIALFAPLLVLNGWAISAIFDYFHSLIVIL
VGASVLAFLLNYPVSWMAHRGARREQVAILVFLLALSILLALGVTLFPLA
LTQAQQLVARIPELIDSGRSQLMMLNEKAETVGLPINLDALVVQINDRVK
GQLQAIAGQVLNLAVVTFTSLLDFLLTMVLTFYLLQHGGELWQSLVEWLP
TKFRAPFSQTVRLSFQNFFITQLILSTCMASALIPAFLWLKVPFGLLFGL
TIGIMALVPFGGSVGIALTTSLVALQDFSMGVRVLIAAVIVQQILENLIA
PRILGNFTGLNPVWILISVLTGARVGGLLGVIVAVPTAVIIKTALTALRP
SSVNNETEVTDTTEITATIVQKENPKNTLSISEPTTP
>Ava_2162 Cl-channel, voltage gated
MYFRSWLQPRRGLAIAEACVIGVLAALSAVFLKVCSGLLGAWRVHSSHVL
PAWVVLPIIGLGFGYLAGLMVQRLAPEAAGSGIPQVKATLANIPMKLSWR
VAVIKLLSAIIALGSGITLGRQGPTVQVGAGLASGMSRLVPTSPDHRRQM
IAAGAGAGLAAAFNAPIAGVLFIIEELLQDLSGLTLGTAIIASFIGGVVS
RLLGGRSLNLNIELLSYSSRFTFPEIPFFLLLGVLAGLLGALFNRGLIFS
LQFYRSLHISLPLRVGLAGLVSGIVVSLLPESFRDNAGLREYVITGDLNP
SFAAIAFVAQFTLTLVAFGSGAPGGLFAPSLILGSALGHLVGVLEYQITG
DGSPVTYALAGMGGFFSAVSKVPITAIVIIFEMTTDFNLVLPLMIVSVTA
YLVADKVVPGSLYEKLLLLNGITLTKQMSVEGILSQMTAKDVMQQRVETL
DADITLEEAKQAFASSHHRGFPVVEDNKLVGIITQSDLTKSLSRNLENHP
HLREIMTANPMTVTPIHTLSNVLYLLDRYQISRLPVVDGQKLIGIITRAD
IIRVEADRLNCENPNPGPQPEPSYVVYQTRSPSTGRGRILVPVANPETAA
TLLKMAAAIARDRHYEIECVQVMLVSRHSSPSETTVRTAKSRRLLRQAEV
LAKKWRIPLHTQIRVAHDPAHAILETIKDRHIDLILMGWKGSTSTPGRIF
GNVVDTIIRQATCDVVLVKLGTSPIPNPPFNRWLVPMAGGPNARIAIKLL
PALVTLSDDPQIRLTRVFKPWEFRPDMTVLEQAIRQLMRRRQLSSTVIAA
PVQADSVVEGVIKLVKTEGYDVVVLGASREGLLQQAIQGNIPEAIASGVD
ITVILVRGAIES
>Ava_3720 conserved hypothetical protein
MKIEEVSTLEFSFNQLLETLLVKKLENEQFTVKLSSERSQFTRFNQAKVR
QTGFVADGWIELTLMTDQRSSFRQFPFTGNWEKDWQSAYQALQELRDELP
LLPVDPYLVLPSGNNTSRETHIGNILPAEAVVTNVLEQVRELDFTGIYAG
GLVIRAYGDSSGQKHWFATDSFTLDYSLFISSGQAVKGTFAGSNWDQEAY
SVKINEGKQQLKLLANRAKEVPKGQYKTYFAPAAVADLLSMLSWGAVSEA
DIQQGNSALAILSRQDKQLSPKFSLRENFQLGLVPRFNQLGEMAAPELSI
IDKGILVNTLVNSRTAKEYQKIANGANSSETLRAPEVIPGNLTYEQILPS
LDTGLYVSNLHYLNWSDRPTGRITGMTRYACFWIENGEIIAPIENLRFDE
SLYRFWGENLIDFTNIQEFIPEVGTYESRQLGGSLVPGMLVNDFTYTL
>Ava_0346 TPR repeat
MISLSTPNHYHQRAEKYFFTGNYIQAASLYEEAISIEPDIKSYYWYLGLI
LLLQGQEVEAQTTWFMAMMDGEAEQVEEWNTELVTILKTEAERQEEREEY
SVAEKIRRNIKEISPQDIHNLLHLVLLSIKLEIYTGNDLQELKVIDILQG
NSTVEVSLDLLTQLVKATLDYAPVHPSTLNFLETCLPYFKKDPKDLLIIL
MPSAIEIAYSKRLPGIAAQMCELYLKLSPNNTEIIGQLSSFYQNSNQFKK
GIETARLYYSLVEEFADKVFANRQVLRGLISAGGHWEESCSVNKIQESLI
ASLIENQPTYLDNVRVSRLFNANFYSAYLQDNPRKNRNIQNCLLQVCQAN
IDNYAKEQVEKYIYGHLERRKQERTNNKIRIGYLSYCLKSHSVGWLARWL
FEHHDRSKFEINAYFINTNPDLDPLHEWYLSKVDKAYKSTNTSELAEQIY
QDEIDILIDLDSITLDISCEVIGMKPAPVQVTWLGWDASGSPSVDYFIAD
PYVLPESAQEYYKEKIWRLPQSYIAVDGFEVSVPTVTREELDIPHDAVVY
LCGQRGFKRHPDITRLQLKIIKEVPNSYFLIKGISDEDSIKLFFDGLADE
EGVDTSRLRFLPIVMSESIHRANLDIADIVLDTYPYNGATTTLETLWMCI
PMVTRVGEQFAARNSYTMMMNAGITEGIAWTDDEYIEWGVRLGKDEALRQ
QIAWKLRQSRKTSPLWNGKQFTREMEKAFTQMWEIYTSS
>Ava_3224 Fe-S metabolism associated SufE
MSSSLDSLPPALAKLVQRFQRATDPKRRYEQLIWYAQKLPEFPETGKVPE
NKVPGCVSQVYVTAHLNDGHVAYEGDSDSQLTKGLLAFLIEGLNGLTPTE
IVQLTPDFIQATGLNVSLTPSRANGFYNIFKTMQKKALECKLEP
>Ava_2004 TPR repeat
MNSHSFLASGDQQNNCYQFIAKTKLDPQENGARGDESGFGDGYLRSCALR
SAQQGNYSEAIALLNQLINRHPDNAVDYNNRGLIYFQCGHTQKAIQDYNT
ALHLNPDLASAYNNRANYYAACGQLAAALADYDRAIDLNPRHVRAWINRG
ITLRDLGEYDQAIENFDIALVFGQLEGHIWAERGRTYHLWGDWNCAIADY
RRAETQLPSLHGRRDVPGYRLRLQMENWLNELLPSA
>Ava_5030 Alpha/beta hydrolase fold
MSTITTQDGTNIYYKDWGVGKPVVFSHGWPLNADSWEAQMLFLADHGFRC
IAHDRRGHGRSSQSWEGNEMDTYADDLAALMEILDLNGATVIGFSTGGGE
VARYIGRYGTARVAKAALISAVPPLMLKTENNPNGLPIEVFDGLRAGSLA
DRSQLYKDLASGPFFGFNRPGAKVSQGMIDWFWLQGMQAGHKNTFDCIKA
FSETDFTEDLKKFDVPTLIIHGDDDQIVPIGAAAIAAAKLVKNATLKIYP
GAPHGLTNTHQDQLNTDLLSFLKE
>Ava_2519 Serine/Threonine protein kinase
MIGYILRRRYKIIKQLGSGGFGETYLAEYPQDLPVSPQYKCVVKRLTRPQ
TPDLDTEERFKREAAILFRLGKEHNQIPELYDFFEENREFYLVQEYIDGH
DLSYEMEQGKPWSEADVIQLLQEILEVLDFVHQNNVIHRDIKPLNLMRRY
SDNKIVLIDFGIVKEISALGVNAQGKISSTVPIGTRGYMPSEQFHGHPKL
CSDIYAVGMTAIQALTGLPPQELHIDPDKLEVIWREKAQVSNILADILTK
MVRYHSKLRYTDASAVLEALKQTQFSLQTSANSPKLKVSNQSQLLLLLLK
PIQIADKYGYIDQSGQVIIQPQFNEAHNFAEELACVQFDNKKYGYIDLSG
RVVIEAQFHEAWSFSEGLAVVQIGDKYGYIDKNGTLVIQAQFDYAHDFRN
GRAMVEIGKQKYHINKMGNFIKIIGMLFQNQMK
>Ava_4859 Sugar fermentation stimulation protein
MIDWLYSYPPLYPGILLKRYKRFFADVQLASGEVVTAHCPNTGPMTGVST
LGSVVQLSKSANPKRKLAYTLELIQVHDNEPTWVGVNTALPNQIVKLALA
KYLFPELGSYNHIKSEVVYGVDKKSRVDFFLTGSDTERPIYLEVKNTTWA
KGTLALFPDTETTRGQKHLRELMTLLPQTRSVMLYFINRGDCTEFAPGDS
TDPIYGKLLREAIALGLEVLPCRFDVTPEGIRYLGLAKLVI
>Ava_2719 conserved hypothetical protein
MEISYHHIKYPLRPPTIIQDFPILETDRYILKLAETEEELASIFCLRFEV
FNVELGLGLADSNLTKMDQDEFDTVCHHLMLISKLTGKTIGTYRMQTYKM
ASQGLGFDAADIFELKTIPESVLKVSVEVGRACIAKEYRSFQSLLLLWKG
LADYLILNCSKYFFGCASLLTQCSWEAACAYHYFEKHNLIHKDILVFPHA
QFYVDIPHKSSDVCRVDIPNILQAYLNVGARICSLPAIDREFKTIDFLTI
ANIKEFTRWNYPNILDK
>Ava_1802 Flavin reductase-like, FMN-binding
MVALTEKTEKRLTIQTAEIAQDTTAIRSLDWERDRFDIEFGLQNGTTYNS
FLIRGEQIALVDTSHEKFRKLYFDTLTGLINPTDINYLIVSHTEPDHSGL
VKDLLQMAPDITVVASKVAIQFLEDLVHQPFKRKIVKNGDRLDLGNGHEF
EFVIAPNLHWPDTIFSFDHKTQTLYTCDAFGMHYCSDIVFDEDLKTIEPD
FHYYYECLMGPNARSVLSALKRMGELPSVKMIATGHGPLLYHNVEELTGR
YRTWSQNQSKAETSIGVFYVSEYGYSDRLAQGIINGISKTGVGVEVVDLG
SAVDLQELRELVGRCAGLVIGMSPAASAASIQGALSTILGSANEKQAVGI
FETGGGDDEPIDPLLSKFRNLGLTTAFPAIRIKQTPTENTYKLCEEAGTD
LGQWVTRDRSIKAMKSLGADLDKALGRLSGGLYIITAKKGDVSSAMLASW
VNQASFKPLGFSIAVAKDRAIESLMQVGDRFVLNVLEEGNYQPLMRHFLK
RFAPGADRFEGVKTQPAENGAPILGDALAYMECEVVSRMDCGDHWAVYST
VYAGRVSKPESLTAVHHRKVGNHY
>Ava_1042 Methyltransferase FkbM
MKTPVTFLIFNRPDTTEKVFQAIRQAKPPKLLVVADGPRSDRPGEKEKCA
AARAIINGVDWECEVLKNYSDVNLGCKKRVSSGLDWVFDTVEESIILEDD
CLPNLSFFQFCEELLERYREDNRIAVISGQNVQFGHKRTHYSYYFSRYNH
CWGWASWRRAWQNFDYDMQLWPLIKENGWLKDILKDEIAVKYWTKIFQDT
YDDKTNSWAYRWTFSCWTQNHLSILSNINLISNIGFAREATNTKKDVSIF
SKIPTEEIYFPLKHPPFIVQDIESDHFTQRTLYCPPLINRINTKIKNIYS
DILGVYYLIFMFLKQLILKGIKKSSNFICRALGRDNITFTKIFPRNDLVE
LGTKYGGWVIPKNLLDSGSIVYCVGCGEDISFDLSLIDKVGCNILGFDPT
PRAIQYVKKVTINNTKYHFNEVGVWDKDDVLKFYAPKNSDHVSHSLLNLQ
RTSKYFEAKVKRLSTIMQEHNHKKIDLLKLDIEGAEYKVINSIIEDELDI
KIICVEYDEYFNPLDSSYKLRIKQSINKLVNAGFSLVCTQGNGNYTFVKR
>Ava_0758 ParB-like nuclease
MVRVQEIPLNQIRRPLPRGNDPYKVQALMESIAAIGQQEPIDVLEVDGQY
YGFSGCHRYEACQRLGKETILARVRKAPRSVLKMHLA
>Ava_2243 Protein of unknown function UPF0227
MTQYIYLHGFASSPQSAKAQDIRQRFAQIHTQLTIPDLNAGEFSQLTITR
QIQQIAATFPDDSVPITLIGSSLGGLTAAHLGQRYLQVQRLVLLAPAFGF
LSHWLPKMGEGAVQSWQQNGYYPVYHYGEGTSLPLSYNFVTDAAQYQEDH
IQRPIPTLILHGKHDEVIPITASRDFAHSRPWVELIELDSNHALGNVMEE
IWQAISLFCQLPESGKI
>Ava_1514 conserved hypothetical protein
MKAPWAVWGILVLAIFFLIGSITAPYPKIADYSSVLLASSIRSFFVAVAG
AFLFFIMIAWFRVFLDTLLVISAALLARIDFQSDGFTEWQAFIATLIVSI
AGVGLGAFTNMLLTQKIMLGL
>Ava_2534 conserved hypothetical protein
MFGLIGHLTSLEHAQAVAQELGYPEYADQGLDFWCSAPPQIVDHIKVTSI
TGEIIEGRYVESCFLPEMLASRRIKAATRKVLNAMAHAQKHGIDITALGG
FSSIIFENFKLEQFSQVRNVTLEFERFTTGNTHTAYIICRQVEQASQQLG
IELSQATVAICGATGDIGSAVTRWLDAKTDVKELLLIARNQERLQELQSE
LGRGKIMSLDEALPQADIVVWVASMPKGVEINPQVLKQPCLLIDGGYPKN
LGTKVQYPGVYVLNGGIVEHSLDIDWKIMKIVNMDVPARQLFACFAESML
LEFEKLYTNFSWGRNQITVDKMEQIGQASVKHGFRPLLV
>Ava_0826 GCN5-related N-acetyltransferase
MANFAENDIIYVRELGIDDIALVYHLGEELFTSDLYPYLYRTWDEWEVIG
LYNTDPEYCLVAETDGELAGFILGTIITKASWTYGYILWLGVNPKFQRQG
VADKLVDKVVARMIEDGARFMLVDTDPTNTPAVKFFNRKGFGNIRQHIFL
SMNLSKHPYYGRLIDYEHQKAERAGYRRSRPAIRARKADGVANEVVINPL
INENQISDEQSPL
>Ava_4976 conserved hypothetical protein
MEIADFLGLIHPAIAVIFVFPLIGIVSNFAWQTRQRRFQTLAGGKSKIPP
IVGTEHRRIGEWLSSAVVGISLVGLGYAIGKNIIKNQLWNKNLTQVIFIL
IMFVLTIASLVFLYQAKQKLWRGIFATLTGVGLVVIGCQDGVFRRTNEWY
WSHYYIGIAASLLMIFSVAIVQDIYQDKTHRWRIIHTILNTIALLLFIGQ
GFTGTRDLLEIPLSWQEPYVYQCNFAQKTCPNPPSQESK
>Ava_3596 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MNTYVDLAIAIPGYFVLEEIYHGSKTVVYRAVREVDQQPVVIKLLKREYP
TFSELLQFRNQYAIAKNLKIPGIVKLYSLEAYRNSYAMVMEDFGGISLRD
YAQQQLLSLTEILTITIQLADILHHLYQQRIIHKDIKPANILINPETKQV
KLIDFSIASLLPRETQSIISPNILEGTLAYLSPEQTGRMNRGIDYRSDFY
SLGVSLFELLTGQLPFTSDDPMELVHCHIAKQPDQFKLPSSFPKEDYGNG
FPNGIQTAKLPNGETIPQVVGEIVMKLMAKNAEDRYQNALGLKYDLEKCL
AELQETGKVEYFTIGSRDVCDRFLIPDKLYGREAEIETLLAAFTRVSLGA
TELMLVAGFSGIGKTAVIKEVHKPIVRQRGYFIQGKFDQFQRNLPFSAFV
QAFRNLMGQLLTESDTQIQRWRGKILEAVGDNGQVIIEVIPELERIIGKQ
PPAPKLSGTALQNRFNLLFKKFTQVFTSAEHPLVIFLDDLQWADSGSLKL
IQLLMNDTGYLLLIGAYRDNEVNPAHPLMLTVNEIGKDNCTIKKIDLAPL
NQSQVNTLVAETLKCSEEIAQSLALLVFQKTQGNPFFTTQFLKALYHDNL
IQFNGDLGCWQCDIAQVNHQALTDDVVAFMAFQLQRLPRATQEILQLAAC
IGNQFDLETLAIVWESSPTKTAACLWKALEEGLIIPQSEVYKFFVGQEQE
IDHAQTTEIFTYKFLHDRVQQAAYCLIPEAERAIAHHNIGQLLLQKISPA
AREEHIFEIVNQLNYGITLITQQSQRDELAQLNLNAAQKARAATAYQASR
EYADTSIILLGETAWQRQYQLTLTIHELAAEIALLCGDVEQMNQLIETVV
NRAKCPLDRVQVYQVKIQALTSRNELLEAIATGTSLLQELGVSLPDHPTP
EDVQQAREEINRLIGDRHLEEFINLPKMTDPEKLAIMQISSSLIPACYMT
GSLLYLLIVPLQVKLSVQFGNSLFSAHGYVSYAFQLSTTWQDMALAQQLG
QIAYQLASEPEAKNVRAATFIVLGGYFYHCTAHLQDTLPIIQEGCIAGLE
TGNWEFFGYNVQNFGMNAYWSGESLSEFASQISTYYQQLLEFNLVTSANH
CFVYWESALTLLGKPTDEISLRQDAYGEKIVTEAVAANDLWRLFQFYLYR
LVLNFLLADPRAKQDAETARQYLTGCMGSVAEPIFYFYDSLIALAELHSS
PADSDSQWQRIRENQAILDKWANHAPMNYLHKWQLVTAEIHRLLEQKTEA
MEFYEMAIKGAKENKYLRDEALANELAAKFYLDWEKEKVAAVYMQEAYYC
YARWGAKAKTNDLEKRYPQLLQPILQERQFKFNAQETIAVSGTSSSTLIS
AVGSASISDTLDFASILKAAQVISTSLELDELITNLTEIILENSGAKKSA
LLLPQEDIWQIKAMTLSNLQPNSSESTQTILESQPLETCEDIPKNIIYYV
KNTQQTVVIDNLQTDIPGLIGEYMLQNQPQSVFCTPMMNQGHLVGILYLE
NRLTRGVFTSDRLEVIRLLSAQAAVSLEKARLYQESQTTAQQLKQSLKQQ
KILFDVVNQMRQSLDLNAIFCVVTQNIRRILDVDRVGIYQFHLDVNYEYG
EFVAEDVSPAFPSALAVKVQDHCFGENYANLYKQGRICAITDVQSSEVLD
CHRQILAQFHVRASLVVPIMQEEELWGLLCIHQCDRPRQWEPLEMQFAQQ
VGAQMGIALKQTDLLIQTQKQATQLEHTLQHLQQTQLQLVQNEKMSALGN
LVAGVAHEMNNPLGFIAVSLQHTQPTFADIVEHLKLYQESLPHPGDKILH
HAAQIDLDYTLEDLPKLIEAMVMACDRLKNISTSLRTFSRADKDYKIPFN
IHKGIDSTILILKHRLKSNDKRPAIEVITKYGNLPHIECFPGQLNQVFMN
ILANAIDALEDANNGLSYAEIAANTNRIIIRTTQVNNHVKISIADNGIGM
SEELKQNIFEHLFTTKGVEKGTGLGLAIARQIVVEKHGGTIEVNSQLGQG
TEFVIILPITAERANKY
>Ava_0460 Dinitrogenase iron-molybdenum cofactor biosynthesis
MKIAFATSDRINVDAHFGWAQEIDVYEISDGGYEFIETLSFNKETPKPSE
DTIEKGEGGCKHGKSDCKKAKKEEEQKPKVQNGESDDKVAQKIAALSDCK
IVYVASIGGIAAAKLIKKGVMPVKPRSKKEDIIYLLNRLVQTLKGNPPPW
LRKALRPNQESLGELESV
>Ava_3781 Pirin-like
MAIVQLIEPEVKDLGGFVARRSLPYPHRQMVGPFIFFDHLGPSVLPPNKG
IDVRPHPHINLATLTYLFDGSIMHRDSLGIVQEIQPGAVNWMTAGKGIVH
SERSPDFDRHNETTIHGIQTWIALPVEYEETDPWFTHYPAETLPTWNDNN
VTIKLIAGEAHGHTSPVKVFSPILYLDGVLSANGHFTIPTDYSERAVYSV
TEGISINDEPLEAYRLAILESGHEVKVSATDAARCIVIGGEPLGTRYKWW
NFVSSRPERIEKAKADWRDCRFATVPGETESIPLPEVVTEANPL
>Ava_2428 Short-chain dehydrogenase/reductase SDR
MAEKQTLQPPQQQKTPGTESKMQPKPQADDARYLGSGKLKDKVALITGGD
SGIGRAVAIAYAKEGADVAFVYLSEHGDAEETKNLVEEQGRRAVSIAGDI
TDEAFCQRAIQQTVDEFGKLDILINNAAEQHPQESIEDITKEQLERTFST
NIFSMFYLTKAALKHLKQGSAIINTTSVTAYKGSSHLLDYSATKGAIVAF
TRSLSQNLISKGIRVNAVAPGPIWTPLIPSTFPTEKVETFGKQVPMQRAG
QPEEVAPSYVFLASDDSSYMSGQVLHPNGGEVVNG
>Ava_4855 Serine/Threonine protein kinase with WD40 repeats
MLCCLNPNCSMPQNPDGKMYCQRCNTQLIPLLRGHYRIIKVLSDEGGFGR
TYLSEDIDKLNELCVVKQFAPKVQENSAMKKAVELFKQEAQRLQHLGEHH
QIPTLLAYFEQDNYLFLVQQFINGNNLLQELRQGVVYNESTIVEFLLDLL
PVLKYIHERGVIHRDIKPQNIIRRQSDGGLVLIDFGAAKQLKATMQTQLG
TTIGSLGYTPIEQMQYGKAYPASDLFSLGATCFHLLTGINPSNLFVEQGY
SWVESWQQYWNTLDSDRKEGEYLVKVLNKLLETDIQRRYQSADEVMNDLI
KQRSLLSRLKTTIPKSALFSTSWSASTSLTASTTKKQARKSLNGKFKQQL
LINTMSALLGLVGVGHLQSLPQLITKFSEISTQPYTLKGHASDVNSVAFS
PNGEFLASGSDDKTIKVWNLKTKQKIHTLPGHSGWVWAIAFSPDGKTLVS
AGADKTIKLWNLATGTEIRTLKGHSQGVASVAFSPDGKTLASGSLDKTIK
LWNLATGKEIRTLSEHSNVVANVAFSPDGKTLASGSWDKTIKLWNLTTNK
VFRTLEGHSDLVMSVVFNPDGKTLASASKDKTIRLWNLAAGKTIRTLKGH
SDKVNSVVYVPRNSTVLASGSNDNTIKLWNLTTGEIIRTLKRDSGYIYSV
AISPDGRNLASGGSAENIIKIWPMSW
>Ava_1214 NADPH-dependent FMN reductase
MARILAIAGSPSHPSRTYGILEHTAKLLQEEGLHVDILSVRDLPAEDLVF
GKYDSPALEQPKDLLAKADGVIIATPIYKAAYTGVLKAFLDLLPQKSLTG
KVVLPIATGGTIAHLLSVEYALKPILGELGARHILATVYAVDKQIERQTD
GSVQLDAEIDQRLKEVLKDLVKSVKHSAIATQELVNAN
>Ava_0755 Homoserine dehydrogenase, NAD-binding
MAEATIRIGIVGTGYAAKLRAEAFLEDKRSHLVAIAGSTLERTQALAQDY
QAEAITGWQQLVEREDIDLVVICTINRDHGAIARAALTAGKHVIVEYPLS
VDLTEAEELIALAKAQQKLLHVEHIELLGGLHQALKQNLDKVGHLFYVRY
STVNPQNPAPRKWTYNHEMFGFPLIGALSRLHRLTDLFGKVFTVNCHQRY
WEIEPEYYQTCFCMTQLCFTSGLLAQVIYGKGESLWQPERKFEVHGDNGA
LIFDGDTGFFIQSGESTAIEVGSRRGLFAKDTSMVLDHLLDGTPLYVTPE
ESLYTLRVADAAQRAAQMGITIFLTDKLN
>Ava_4266 Peptidase S15
MFDVLGKQTASMYTRDGVRLDADIYRPDGDGEFPVLLMRQPYGRAIASTV
VYAHPTWYAAQGYIVVIQDVRGRGTSQGEFKLFANEIADGEDTVNWAASI
PGSNGQVGMYGFSYQGMTQLYTAIAQPPALKTICPAMIGYDLYTDWAYEG
GAFCLQTNLAWAIQLATETARLRGDKEKYQALFTASRNLPLNNPEILQQL
APESFYHEWVTHPQPDSYWQELSPKTHFQGVDLPMLHIGGWFDTYLRGTL
HLYQDMTARATTPQHLLVGPWAHIPWGRKVGEVDFGVNAVSPVDEVQVRW
FDYFLKDIDTGLLDELPVSLFEMGSNMWLQFPIITISNQKIYFLSTTGLA
SIREDSGTLIPNPHYPLSPEGTPPSPVPSSPDVLVHDPWRPVPSVGGHAA
IPAGVFERSHIDCRSDVLTYTSAPLIADLHLLGDAIVEIVCSADQPSYDL
CAVLSQVYPDGKVYNFTQGYLHCPDGMNHTTRRIQLQTTCIRIPQGNSLR
LSLSAACYPAYAMNSGNSPVLDSDSLFHAQIITISVTCRDNNSSQILLPI
TQE
>Ava_4682 WD-40 repeat
MLIDFGLARQFISGTVQQHTQSFTPGYAPPEQYAPIEERGEYIDVYALAA
TLYSLLTGQLPMPAPARLQNFTLQPPKDLNPSVSDRVNEAIMKGMALNYK
FRPQSVQEWLDLLGAGIVVPTQPVISSSNTYLSAKISPTQPITSVSQISS
SWECIHVIPAVSGKIAFSPKENILASVSSGGWDSNIKLWEALTGREIYSL
TGHSWSVYAITFSNDGQILASGGGDGNIKLWEVVSGQEIRTLTGHSWAIY
AVTFSSNRVVLASGSGDKTIKLWDLATGQEISTLTGHAESINSLAFSNNE
LTLASGSVDKTIKLWDLETGKEIYTLTGHSGTVNSICLSNDGQILASGSV
DKTIKLWDLETGKEICTLIGHLESIESVTISSDGQILASASVDKTVKIWE
MATGKEVFTLSHSSSVNSIAFSPDGNLLAAGDSGGNIKIWRRS
>Ava_0417 Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
MKIAIAQINPIIGDLTGNAQKILEMAQQAIKKGARLLLTPELSLCGYPPR
DLLLNPIFVEAMSVTLQQLARDLPVNLAVLVGTVELNCQAHTTGGKPLFN
STALLENGKVKQMFHKRLLPTYDVFDEHRYFEAGQQANYFALDNINIGVT
ICEDLWNDEEFWGKRSYTANPISDLAILGVDLIVNLSASPYSLGKQSFRE
AMLRHSAVRFQQPVIYANQVGGNDDLIFDGRSFALNRQGEVMCRAKGFAS
DLITVDFDESQRDLQLSSVAPIYESEDEEIWQALVLGVRDYAQKCRFSQV
VLGLSGGIDSALVATIATEALGKENVFGVLMPSPYSSEHSISDALALADN
LGIKTQILPIGDLMQSFDHSLVELFAGTEFGLAEENIQSRIRGNLLMAIA
NKFGYLLLSTGNKSEMAVGYCTLYGDMNGGLAVIADVPKTRVYSLCKWLN
SHQSPVIPENILTKAPSAELKPGQVDQDSLPPYEILDDILQRLIHNHQSV
GEIVAAGHDPMVVDKVIQMVARAEFKRRQAPPGLKITDRAFGTGWRMPIA
SNWNAIKSNYRLSCTF
>Ava_2221 phenylalanyl-tRNA synthetase, beta subunit
MRISLNWLRELVEIKLSPEELAHILTMAGFEVEDIEDRRTWANGVVVGRV
LERQPHPNADKLSVCQVDVGATETLNIVCGAPNVRADIYVPVATVGAYLP
NIDLKIKPAKLRGVPSQGMICSLKELGLPSEVDGIYIFPQENLPLGSDVR
PLLGLDDVILDVTATANRADALSMVGIAREVAALTDAKLSIPKPGEAFVS
ESVGKLGLKIADTQACPAYIGTVIEQVKIASSPEWLQQRLRSAGVRPISN
VVDITNYVLLEWGQPLHAFDQERLKSVANADNLTIGVRCANQGETLKTLD
GQTRNLTTQNLLITANDQPVALAGVMGGEETEVHEGTQSLILEAALFDSV
AIRRSSRSVGLRSEASGRYERGVNRAELEIANRRALSLISELADGVIVHQ
EIADTRPDPTTWSRSIFLRLDRVNEVLGPINVGETGELEAKDVERTLTAL
GCELTPAGEGTWNVSVPPYRYRDLEREIDLIEEVARLYGYDNFCDTLPDK
AEAGYLPVDQELVRKLRAALRAEGLTELIHYSLVKPGEDRQIVLSNPLFV
EYSALRTDLLAGLIDAFQYNLEQGNGSLNGFEIGRIFWQEEDGLQEKDAI
AGIIGGDTSLGKWSKGSKDQPLTWFEAKGILESVFQQLGILVEYQPDCRD
ERLHPGRTASLWIGGNRLGIFGQLHPQLRRDKDLPESVYVFQLDLDVLLD
ALDKDEILVPAFKPYSTYPASDRDIAFFAPVKISVSEIEKAINKAGKGLL
ESVELFDEYRGENVPQGQRSLAFRLVYRASDRTLTDTEVEPVHNKVREAL
VEKFGVNLRS
>Ava_1856 Glycosyl hydrolase, BNR repeat
MVIVKSWQKIFALLVVLLLCIGCSKVPSTSYNPWAVVSLPTEAKLLDIAF
TENPQHGFLVGSNATLLETNDGGNNWQPLNLALDDDRYRFDSVSFAGKEG
WIAGEPSLLLHTTDEGRSWSRIPLSEKLPGNPIAIQALGTDIAEMATDVG
AIYKTTDGGKNWKAQVEAAVGVVRNLERSEDGKYVAVSAKGSFYSTWEPG
QNAWVPHNRNSSRRVENMGFSQDGLWLLARGGQVQFSDPTKPDEWLDAQT
PELATSWGLLDMAYRTPNEVWIGGGSGNLLVSTDGGKTWEKDRDVEEVAA
NFYKVVFLKPDQGFVIGDRGVLLKYQPEAAKTATTEPAA
>Ava_0154 Zinc-containing alcohol dehydrogenase superfamily
METNVKAQVFRGVNQLSYEEIPLPTLEPDEVLVQVQVVGLCQSDIKKIRY
PLYEPPRIFGHETAGTIAAVGSQVKGWQVGQRVAVMHHIPCMRCAYCMND
NFSMCDVYKNISTTAGFNASGGGFAEYVKVPGHIVENGGLIPIPDEISFE
EASFVEPTNCCLKAVKKAQIAPGQTVLVTGAGPIGLMFVMLVKYFGAKAI
ATDLLPSRIEKALNVGAEAAFDARDPDLPAKIHALTNGLGVDVTLLAVPS
EKAFFQALDCTRKGGKILFFAEFPDELEIPINPNILYRREIDLMGSYSSS
YRLQNLSADIVFNRRIDVQALISDRYPLKDLSAAVDQAIAPTPETYKILI
YPKIGD
>Ava_5026 Protein of unknown function DUF81
MDFFLLPLFSFLVGIVVGLTGIGGASLITPMLIFAFQVPPAVAVSSDVVA
ATLMKVIGSVKHWQQQTVDREVVKWLALGSVPGSLLGVGILHLIKLKAEH
SLNDIMLHLLGVTILSVTILALMQMLLLTFFPQLQLPELPKFDLTTQLGR
LQTVIVGAFLGCVVGLTSVASGSLFALVLIAFFRLDARKLVGTDISQAAI
LLIFTAVGHLTLGTVDWNLVLPIWLGSVPGVLIGAKVCQIAPQKPLRFMV
YFILMLVSLKLVY
>Ava_5061 Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
MKSYLAAAIQMTSVPDLHKNLAQAEELIDLAVRRGAELVGLPENFSFMGE
EQDKLAQAEAIARESEIFIKTMAQRYQVTLLGGSFPVPVSDTGRVYNTTI
LVSPSGEELARYNKVHLFDVNVPDGNTYRESSTVVAGQQLPPVHFSENLG
NIGVSICYDVRFPELYRHLSDKGTDIIFIPAAFTAFTGKDHWQVLLQARA
IENTAYVIAPAQTGNNYGRRLTHGHAVVIDPWGTILADAGDKPGIAIAEI
NPSRLEQVRRQMPSLQHRVFS
>Ava_2119 Magnesium chelatase, ChlI subunit
MREKIDALTQNLNRTIVGKTEAIRLVLVALLGGGHALLEDVPGVGKTLLA
KSLARSLDGTFQRLQCTPDLLPTDITGTNIWNPKSGEFSFMPGPVFANVV
LADEINRATPRTQSALLEVMEEQQVTVDGVSRTVPHPFFVIATQNPVEYQ
GTFPLPEAQMDRFMLSLSLGYPGVDEELEMLQNLQHGIKVGDLQPCLTLA
EVAELREICSQVKVETVLQQYILELVRSTRQDEEITLGVSPRGTVALHKA
TQALAFLLGRDYAIPDDVKFLAPHVLCHRLIARGGRSARSIVDRLLRSLP
IP
>Ava_2821 Sucrose-phosphate phosphatase
MKPFLFVTDLDHTLVGNDAALAKLSQILTHHREEYGTKIVYATGRSPILY
KELQVEKNLIEPDGLVLSVGTEIYLDGSSHPDSDWSEILTEGWNREIVLS
VTKKFPELVLQPDSEQRPFKVSFFLHQEASFKVIPQLEAELEKYELNIKL
IYSSGIDLDIVPLNSDKGQAMQFLRQKWEFAAERTVVCGDSGNDIALFAV
GNERGIIVGNARPELLQWHSEYPADHRYLAKNFCAGGIIEGLQFFGFLE
>Ava_3253 DEAD/DEAH box helicase-like
MKNNSNYLDPIAAVEQPRQDFIRYLLTAYPLRDPHLRYGLKQQLEVPGTV
WQHPYLEGSQPYRPANSVNALVSQGVLHPEMASLFTPNQRPLYEHQENAV
KAVIQQKQNIVVATGTGSGKTECFLIPMLDMLLKEEANLAIAGVRALILY
PMNALVNDQVKRLRKLLCSQETIKIRFGFYTSRTENDRHKAEEALAEELQ
AYESEELWELFTQKEKSSLNLSTRERLIDEAIAKIQKIQAISRQEIWEKP
PHILVTNYSMLEHMLIRPVERNKVFATSASTFKMLVVDEAHTYNGSTGSE
VSMLIERLKVAVEQEKPGKIRCIATSASLGDASVNNEILEFATKFFGESF
NQVIRGDRVKAIERLGEPYLLSAELTHEEILAYLSILELPKIDDDLSIWF
DRLSGFVPTEHLKAAETKAQGDVHQFLWYALKQHPIVHRLIEFLSRQPQP
WEKITQSTEFWCVNLPFKSDGTIDDTETKFALAHLLQLGTLARENSESLP
LLPVRIHLLFRSLEGMYACINPQCPGGVCDPNYEDKPRRYGRLYLNEKRT
CDDCNSPVLELGSCYQCGQAYAFTQINGVSKLEPLPRSNQGLRENDKIYT
LTSGLLDSITEEEEIGEEEQESSPTNTLIIRQRDGWIGLPTSVTFTNKSS
EPNEFYLAWHRPRDKDAKDLSGCYLPRCAACGTRPIRAQAINRFVAYTDE
PLEAMIDSLFDLLPESQEDQGSASKRKLLTFSDGRQDAAFFASDYQRTHT
ETVYRQIIWQAFQELNPDDDIVSINQLINKVKQQFLETSIPHPDRDSRKN
YLSYYPDDEESLENYRDCQDSAEARAKELLLREFALPFNRRSTLEAYALF
ACHIELRDERLIELVAREFEISNAEAKIFLIVLTDIIRRTGIVSIDGASR
YFPETGGVEGVRPEMVDIQGRSKNYLFLEKSPDEQKKFKDSPSFIPKWKQ
KTGEVSQVQNRLGWYYLQLFGDNLPNRDKFVALFKQLEQLRLLEKAKNGH
QLNWTRLNIIKTQHDWYQCDRCQQIIHVPGLSDVQKSQTKLNIFGCPAFK
CTGQLQPYYAEKIEQVRNEHYQQYIIKNRLPLPLRSQEHTAQLGVTELER
RENRFRRGQINLLSCSTTLEMGVDIGELQAVVLRNFPPHVSNYQQRAGRA
GRRTDGVAITLMYGQRRPHDRFYFEQPEQLIAGSNQIPKLDADNFQIQQR
HIRAELLAEFLKTKERGAEKVTIAEFFSLPVHKFDSLDTPPPTAIVSELQ
EWLHSDDAHNLAQLWINKLKISYAATDILNQFVAAILVFQQAQLADWNDL
VLLLKDIQDSITAETDRTKRKGIEKRRDGIEAELEKIGKRRLHDELVQAS
ILPIYGFPIDVVRLLTGESDEFKSSQGKHRLERDRRLALGEYAPSQDIVV
DDRVYQSVGILRPSDLEQKYYWVCKNCNHFESSISEQIIEQCSVCGCKPT
PAAAGKMKLYKVPKAFTTDWSVTPKVTPYTKPQRQLTSQVFLAKSGEDQE
QLESEFYKLTVNKGGILFLANQGSLGRGKGFSKEGFAICQYCGRDLSELV
QKQREANNAKGGSKKASSKASSTAHKHPITAKDCSGTGYPHIHLGHEFRS
DLLKIEFQKIANLKPLFGDVVHYGDGGTVASVTEQIYHDTDGMAFWRSLT
YALIAAAAQVIDVPRSELDGLFTPLPNQLAQIIIYDNVPGGAGYSKRIAD
HFFQVLEKALEITASCSCDTSCYDCLRTYSNQPFHAELNRKIVANFLTDL
VVKPDPELQAFAPNANRVKLSQMATRVPAICRMAGKESMIYLPSLIDTFN
LNNGSPLPWLKLLKDAVYSMQRSNIALELILNQLPEPNTAASDATRRNLQ
LLQKHLQQWVDQELLKLYQTSVNDVPILCLSTQQQNRIALQLHQQEWFET
RSGEGVDTVFQRLQNLRSQARPVEAIELEDPNTTVIFPDRTWSNLTITQL
RQRLGIDRVFSGGGVKIVYRDRYLNEKGGKILADLLQGDGINENSSVTIW
VLEDYKGELGSQRKANLEAALTQIQRMGVSTTVKVQPWYERSYFPHSREM
EVLTANGQRYKIMFDKGMDFLELKATGVYSVTESTYVVINKQD
>Ava_1544 Small GTP-binding protein domain
MPLSRIVTLIVGLIVILGLALWLIDSLSRLYWQLSYSPLLGNLLLLLLIV
LIGCLVAAFVYYVMVLQAGEQRSRRNRRRVTAAQIPAAKSDAASTTLQAV
RQQVSQIQDEVARQALLSRSKEIEANLARGEIQVVVFGTGSAGKTSLVNA
IMGRMVGKVDAPMGTTQVGETYCLRLKGLERKILITDTPGILEAGVAGTE
REQLARALATEADLLLFVVDNDLRRSEYEPLKSLAEIGKRSLLVLNKTDL
YTDEDKEAILARLRQRVLGFIATNDVVAIAANPEAAQLETGEPFQPEPDI
VPLLRRMAAILRAEGEDLVADNILLQSLRLGEEARKLIDAQRRRQADKIV
ERFQWIGAGVVSVTPLPVVDLLATAAVNAQMVVEIGRVYGCELNMERGRE
LALSLAKTIASLGIVKGAIQLLTTALQLNVATFVIGKAIQGVTAAYLTRI
AGKSFIEYFRHDQDWGDGGMTEVVQKQFQMNRRDEFIKAFVQEAIARVVK
PLTDKSEVVEHDEQTNS
>Ava_2362 ExsB
MKAVILLSGGLDSSTILYQAKADGCECYSISFDYQQRHRRELHSAFLVAQ
TAGIVQHQVVNFDLRLWGGSALTDDKIDLPQERSVDAMSQNIPVTYVPAR
NTIFLSFALAYAEAIAAERVYIGVNALDYSGYPDCRPDYIEAMQEVFRLG
TKQGREGQPINIVAPLINLKKTAIIQLGNQLGVPWNLTWSCYNGGDVACG
VCDSCRLRLAAFAELGLEDPLPYLKGV
>Ava_0242 Glucokinase regulatory-like protein
MANLQERGHLLTEQVNPLSQNLDQLSSLELVELFNSEDRKTIEAVAAAKV
QIATAIEQTADRLRQGGRLFYVGAGTSGRLGVLDAAECPPTFCTPPELVQ
GIIAGGAGALVRSSEDLEDRAEDGDAAIAQRHITQLDVVVGITAGGTTPF
VQGAINSARQRGALTIFIACVPAEQVSFTADIDIRLLTGPEILAGSTRLK
AGTVTKLTLNILSTGVMVKLGKVYGNRMVDVAVTNQKLRDRALRILEDLT
GLSREAAGFLLERSGKWVKLALVMHWTGLDKDAGDRLLSAHQGNLREAVA
SYKNQGN
>Ava_1977 conserved hypothetical protein
MLLKPDQRLKLDDTDDNLFYAYPRFVTHVDEGFIQQLTDLYRERLQPNTR
ILDMMSSWVSHLPEEIKFAHVEGHGLNAEELGRNPQLNHYFVQNLNENPG
LPLPDQEFDAVLNCVSVQYLQYPEAVFSEIHRILKPGGVAIISFSNRMFY
QKAIQAWRDASEPGRLELVKRYFASVPGFSTPEIIARKSTAPNFLQWLGA
PGGDPFYAAIAHREGFA
>Ava_4668 UbiE/COQ5 methyltransferase
MMTNDFLKNKKFIFDRWANSYDWLFPSIFYQAIHKRLLEFVDLPQPANIL
DLGCGTGRLLERLANKFPKLRGTGLDLSSNMLRQARLSNRHHPRLIFLEG
KAESLPFGDGQFDAVFNTISFLHYREPEQVLQEVSRVLSPGGRFYLVDFT
TTREKLPEILPISSQGIRFYSPHQREVLGSSAGLLCVNHHYLLGPVLLTI
FAKPS
>Ava_0118 Protein of unknown function UPF0079
MTKIFLADKESTLNLGILLGETLTAGSVILLEGDLGAGKTTLVQGLGKGL
SITEPIVSPTFTLINEYIEGRIPLYHLDLYRLEPQEVLSLNLEIYWEGIE
VIPGIVAIEWSERMPYKPSTYINVLLTYGDEGSRQAEITPFNCTISDLIA
TK
>Ava_3860 ATPase-like
MLKRIYIDNFRCLVNFELNIDAINLILGGNGSGKSTVFDALRRLQSFIIG
SHPIEMVFPIFECTRWQNLSIQRFEIELSGNQGNYKYELAVEHYQSKSRI
HYERLWFEQQPLIKYEKGEVEIFDDYHSPSPKYPFDFSQSVLSLLQPRND
NTKLTWFKYRMERFIIVQIIPILMNNSSENEEKILYSKMENFVSWYRYIS
QDQGKTAELISILKNVLDGFSSFIIERFSEKLYTLKLRFISSENHKNEYY
FSELSDGQKALLSLYTLLYCTEDEDYTLCIDEPENFLALPEIQPWLTQLY
DFCSEQKMQALLISHHPELINYLLASPIGYWFERQSNAPVRVKKISSEVA
ENTGLPISELIARGWLHESA
>Ava_3714 Major facilitator superfamily MFS_1
MTTTASKANFKNYILVTLAYWGFTITDGALRMLVLLYFNKIGYTPLEIAS
LFLFYEIFGIVTNFLGGWIGSQLGLKVTLYTGIGLQIFSLVMLAYLNPSW
AQWLAVMYVMVAQAFSGIAKDLTKMSSKSAIRLVVPQDDQSSLFKWVAIL
TGSKNALKGVGFFVGSALLGLFGFVNSLLVMAGGLLLLMFTGLMLPKGMG
KIKKKVKFSQLFSKSPEINILSAARFFLFGSRDVWFVVALPVFLREVLGW
SFYQVGGFLAVWVIGYGIIQSSAPSLIRRFGSGRPPQSKTIQFWTLTLTA
VPAAIALALQFGIPANFAIVGGLLVFGVVFAFNSAVHSYLVLAFTDDDKV
ALNVGFYYMANSGGRLAGTVLSGLVYQWLGLVGCLWTSAFLVLGAAIVSL
RLPDPQPSKAIAWKAGDGD
>Ava_4642 RNA-binding region RNP-1
MSIYVGNLSYEVTQDTLSAVFAEYGTVKRVQLPTDRETGQPRGFGFVEMG
SEAEEAAAIEALDGAEWMGRDLKVNKAKPREDRGPSGGNRGGYGGGGGRN
RY
>Ava_2766 conserved hypothetical protein
MSSITVEILIILVLIIANGIFSMSEMAIVSARKVRLQQLANQGDPKARAA
LKLAESPNNFLSTVQIGISLIGILTGAFGGATIASRLAVYVRLVPFLSPY
SEPVSFGIVVLIITYLSLIIGELVPKRLALNSPERIAAVVAIPMRALATL
ASPAVHLLSASTDMVLRTMGITPSLEPQVTEEEIKILIEQGTEAGTFEEA
EQDMVERVFRLGDRPVSSFMTPRPDIVWLDLEDTTEENRQKMSENGYSRY
PVCQGGLDNVLGIIPVTDLLARSLRGDSLDLTVGLRQPVFVPESTRGLKV
LELFKQTITHMALVVDEYGVIQGLVTLNDIMSEIVGDVPSADGEEEPQAV
QREDGSWLLDGMLAVEDFFELFDLEEWDFEERGSYQTLGGFVITHLGRIP
AAADHFEWQGMRIEVMDMDGNRVDKVLVVPTTNKVADTQTAD
>Ava_2183 Peptidase C14, caspase catalytic subunit p20
MSRDALVVGINKYSFAGLSGLKSPAKDAEAIAKRLSEAGNFHVKRSPQFL
DPFEDHAPRVAANQELKLADLEKALEELFYPEGRSIPDTALFYFSGHGLR
KEGRIKEGYLATSDTNPDVGNWGLRLKWLRELLQESPVRQQIIWLDCCYS
GELLNFTEADPGDRGNTRDRCFIAASREFEPAYENVGGASSLLTSAILRG
FDLADVSNQWVTNETLVAFLREELKTAPQKFISNNLGCINIIFRAETTKK
QAVQPVTPTANQPNQTAKLAEQLRAWFTALRYDFEQYEINNETYFEWIIN
VINPVGRKRYNRILVRGITGEAEMKDVAALRQSVNAQRTDIGWLITNRRV
SRAAKDEVSKQENQDLFCYTLDELLDRDADFSCYLNWLEDEIKQRGIDRT
YVPLACTKEEFDPISKQKMGMSRYAQADGWIDGYVDRWLDDPAKEHLSIL
GEFGTGKTWFGLHYAWTTLQRYRDAQRRGVERPRLPLVIPLRDYAKAVSV
ESLFSEFFFRKHEIPLPGYSAFEQLNRMGKVLLIFDGFDEMAARVNRQEM
INNFWELAKVVVPGAKVILTCRTEHFPEAKEGRALLSAELQASTINLPLE
TPRFEVLELEKFDDEQIRQVFGFRAEAETVEQVMNNSTLRDLARRPVMTE
LIIEALPEIEAGKPVDISRVYLYAVRHKMERDIKAERTFTSLADKLYFLC
ELSWEMLSQDQMSMNYKEFPDQIRRLFGSVVEEEKDLDHWHYDMMGQTML
IRNAEGDYSPAHRSLLEFFAAYKLVAELGVLAPDFTELAQAQSCLDKTVP
ARDYTWSEYFQRQCQEDGQLLPIAPIKSFTNASLERLCNYLGTSPLAKAL
LDLAVPMLDEQVFQQILLRLIQATRGKTQAQVGYIGGNLVKLLIERNPYS
LENCDLSNTVIPDVNFANASFRWVNLMGTNLTNSVFAPVLGVVYSVAFSP
DGKKLVIGDSKGTIQVWETFSGRVLLFLQGHENGVKSVAFSPDGGRIVSG
SNDNTIRLWDVNGQPIGQPFRGHEGGVNSVAFSPDGGRIVSGSNDNTIRL
WDVNGQPIGQPFRGHEGGVNSVAFSPDGGRIVSGSNDNTIRLWDVNGQPI
GQPFRGHEGGVNSVAFSPDGGRIVSGSYDNTVRLWDVNGQPIGQPFRGHE
GGVNSVAFSPDGGRIVSGSNDNTIRLWDMNGQPIGQPFRGHEDMVYSVAF
SPDGGRIVSGSYDKTIRLWDMNGQPIGQPFRGHEDMVLSVAFSPDGGRIV
SGSYDNTVRLWEANGQSIGQPFRGHENLVNSVAFSPDGGRIVSGSNDNTI
RLWDVNGQPIGQPFRGHEGRVYSVAFSPDGGRIVSGSNDNTIRLWDVNGQ
PIGQPFRGHENLVYSVAFSPDGGRIVSGSWDNTIRLWDVNGQPIGRPFRG
HENVVYSVAFSPDGGRIVSGSWDNTIRLWDVNGQSIGQPFRGHEDWVRSV
AFSPDGGRIVSGSDDKTLRLWDVNGQPIGQPFRGHEDLVRSVAFSPDGER
IVSGSYDETIRIWDAATGDCLRVISYKLCAGLNITGVTGLTSAQRVALKL
MGAIDNL
>Ava_3473 TPR repeat
MLEHLQLTDILATVAVLVSITFLIYFAGKTLVTSNFFQKGVNLYQQKDYP
GAEAAFRKVIAINSTNDVVRLFLGDVLREQGKVAEATELFQEVINRSPKN
PQGYLRLATILMQQEKQAEAKTNLQKAQELLQKQRQPETAKRVAKLLEQM
NTKTI
>Ava_0318 ABC-1
MRVNTLKDTVNFPEEEADKGVGQYQPGQLKRYNPDAIARYYRFRPWLAWG
RLFKITWSFAMFVLGLKWDEWQNQVEQNKGKRATQLRQLLTRLGPTFIKV
GQALSTRPDLIRKDFLEELIKLQDQLPAFDNAIAYHIIESELDRPISEVY
SELSPAPIAAASLGQVYRGRLISGEEVAVKVQRPHLRPTLTLDLYLMRWA
ASWLAPWLPLNLGHDLTLIVDEFGIKLFEEIDYLNEGRNAEKFAHNFRND
SQVKVPSIYWRFTTSRVLTLEWINGFKLTDTESIRKAGLDPEAIISIGVT
SGLQQLLEYGFFHADPHPGNLFAMPDGRMAYIDFGMMDQLEEKSKESLVD
ALVHLVNKDYSDLALDFVNLGFLTPDTNIVPIIPALETVLGNAIGKNVSD
FNFKTITDQFSELMYEYPFRVPAKFALIIRSLVTQEGIALSLNPNFKIVD
VGYPYVARRLLTGESPELRRRLLNVLFKDGRFQWQRLENLIAIARSDNNF
DVLPTARMGLQFLLSDEGQFLRRQLVLALTEDDRLHTEEVQRLWSLVKDD
LQPNRILNVAIGLLTDLSREGVAAILPKATSFGFFDEAQSKN
>Ava_4310 Peptidase M16-like
MTQTVKPSLAQSPIHRTVLDNGIVLLVAENPAADIIAGRIFIRAGSCYEK
REQAGLAHLLAALMTKGCEGLSSLEIAEQVESVGASLSADTSTDYFLVSL
KTVTSDFPEILALAGRILRSPTFPETQIELERRLALQDIRSQKEQPFTLA
FEQMRQVMYQNHPYAMSVLGDETTLNSITRADLVEYHQTYFRPDNLVISV
AGRITLQEVVALVEQVFGDWQTPSVAPAVVNLPKISVNPQHRLKPVQTQQ
SIVMLGYLGPSVSSPDYASLKLLSTYLGNGLSSRLFVELREKRGLAYEVS
AFYPTRLYPASFVVYMGTAPENTSIALEGLRTEVELLCSKEVSTTNLQAA
KNKILGQYALGKQTNGQIAQIYGWYEILGLGIDFDREFQELIAAVTAQDA
LTSAQQYLQQPYVSLVGQEEAINRAIN
>Ava_2184 Peptidase C14, caspase catalytic subunit p20
MSRDALVVGINTYNFPQLPRLRSPAKDAEGIAQRLSEGDYFRVWRLPEYL
DPFENNAPRVAQNQEVILRDLKTALERLFLPEGNNYPDTALFYFSGHGIR
STGRIKEGFLATSDTNPDDDKWGLSLQWLRRVLEESPIRQQVIWLDCCYS
GELLNFDEANPGDKGKARDRCFIAASREFEESWTDPNSDYSVLTKVLLKG
LEPQRQQDGLVDNYRLIDFINQNLKQEKQRPVFYNSGLPITLVKSKTTVT
QNTDNSLQPNKNPYRGLAAFDFKPEDIQFFYGRTALTDELLEKLWQQNFL
AVLGASGSGKSSVVRAGLLNAIQQGERRDTANWHILPVITPGNDPLASLA
AAFINPNEPDGRKSRLCQTYSRELQEEGAKALAKLVADYQPNPVVLVIDQ
FEEVFTLCNDSQERQKFFACLMDVLTSPNETENNTAALRVVITMRADFLG
KCLEQDYAGLAENIKNHIATVTPLNQEELRDAIIKPAELVGLSVEEALVR
KMIADVQGSPASLPLLQYALTELWELWHQDWQQRGTTAGKTLTLTNYVQI
GEVKGALEKQADKVYNTLSEKEQAVAKRIFLELTQLGEGTEDTRRRVLKS
ELINDQHPEELLDQVICKLADARLVVTNNIVLNVTTVVRTEKMGDTTSQP
VEVVIDVAHEALIRHWTSLRLWLDENRDALRTERKLQTAAKDWQDHQQDS
AYLWLGARLAEAQEYEQKYFRLGRLVPLSKKFIEASKTEQQRLEREKDEQ
IQALNQALTESQLREQAIRILELLQIQPQDGVESAIQAIEENLKQLPANI
LTVVQNNLLQAMAIVNLPNIIQGHESGVNSVAFSPDGQRIVSGSGDKTLR
LWDVNGQPIGQPLIGHEGAVKSVAFSPDGQRIVSGSGDKTLRLWNVNGQP
IGQPLIGHEGEVKSVAFSPDGQRIVSGSWDNTLRLWNVNGQPIGQPLIGH
EGAVNSVAFSPDGQCIVSGSWDNTLRLWDVNGQPIGQPLIGHESGVYSVA
FSPDGQRIVSGSGDNTLRLWDVNGQSIGQPLIGHESGVYSVAFSPDGQRI
VSGSWDNTLRLWDVNGQSIGQPLIGHESGVYSVAFSPDGQRIVSGSWDNT
LRLWDVNGQPIGQPLMGHKAAVISVAFSPDGQRIVSGSADNKLKLWRGSW
QSWLKVCCDRLAYYLISHNPENNIMEGSCAVCAKYAWSSAEIAKILQRQA
YDLAFQGKVELAITKFQQAIEYDPSLKLDPETEAHKWSRR
>Ava_1679 RNA-binding region RNP-1
MTIYVGNLSYRATEADLKAVFADYGEVKRVVLPTDRETGRMRGFAFVEMN
EDAQEDAAITELDGAEWMGRQLRVNKAKPREDDRRGSWGKKQDY
>Ava_1352 von Willebrand factor, type A
MTQTIERQAGGLYAQTPEQQQIAFPLKHTEVQAKIAGNISRVEVTQSFEN
PFTTTLEAVYIFPLPDEAAVDDMLIRIGDKTIKGSIKKRQEAQQIYEQAK
EQGRTAGLLEQERDNIFTQSLANIQPGEQIDVIIRYSESLKFTAGNYEFV
FPMVVAPRYVPGIPIEGNAVGVGSATAPMTQNQDTDIVPDGSRLNAPILP
SGMRSPHDINVTIEIDAGVEVQNIQSPSHQVQISYAEKRVLVKLAGGDTI
PNKDLILRYQVAGESTQATVLSQADERGGHFALYLIPALQYRQNQIVPKD
VVFLIDTSGSQMGAPLMQCQELMRRFINGLNPDDTFSIIDFSDTTRQLSP
VPLANNSQNRTRAINYINRLTANGGTEMLRGIRAVLNFPVTDSGRLRSIV
LLTDGYIGNENQILAEVQQHLQAGNRLYSFGAGSSVNRFLLNRIAELGRG
IAQIIRHDEPTDEVVDKFYRQINNPVLANINLQWEGDGNAPIIYPATPPD
LFAEQPLVLFGKKPDARGGKLHITGIVAGGTRYQNTIKLDFEETGNPAIA
QLWGRSRIKELMNKMVSGDTKLGVEGVTDTALTYQLLSQYTAFVAVSDDV
RVDSRQGSVSVQVPVEMAEGMSYQGIFGSAVTAAAPPPIMANMVSPAPEF
LQRKRSLASPSAPQAESGFSELPSLTELSEDAKIELSFSDLDYTFREPPP
KPRQQASSGWNKYFEDIDSSQEITPVGILQVVNVIGLNYQMIVILTRYLE
SLPLHTDCSGDLVLELQINKGRVRQVLLDEEASTFKEQSVIDIIRRSLIS
WQPPQMLTATVSLTLRLQA
>Ava_3036 Serine/Threonine protein kinase
MNRFSGNITNLHQYNLQQNQLAQMCGSKELFCDRYQILRILGRGGFGITF
LAGDAVLPGNPLCVIKQLCPKVTSAKSWQNACQRFQKEAKTLAKLGSHSQ
IPMLLNYFEGDGELYLVQEYVRGYTLSQEVRHHGIKTEAEVKRFLLELLP
ILQYLYQNQVIHRDIKPQNIVRCADDGRIVLIDFGAVKEKLSDVGLSSFS
QNAHTNFVGTTGFAPPEQFSLRPVYASDIYALGMTCIYLLTGKAPLEMEN
NSKTGEICWHKYVKVSDDFAKIIRKMVKFSLDERFQKPQDVIEALNKERD
LPNFSDCLTSQPLSNKRQTKVETSEKYVPSVTRTAMAIREWKAKLQQKQL
QRSLQHFSSV
>Ava_2427 O-methyltransferase, family 3
MQAIGSDKLQINLMTTRTLGITQNLYDYLLSISLREPEILTKLRQETALQ
PMGRMQIAPEQGQFMALLVQLLGAKKTLEVGVFTGYSSLIVALALPPDGK
LVACDVSEEFTAIAQRYWQQAGVTHKIDFHLAPALETLDKLLVAGEAETF
DFAFIDADKSNYDNYYERSLQLIRSGGVIAIDNVLWSGKVADPEIQDNRT
KKIRAFNHKLLQDQRITLSLIPIGDGLTLVRKN
>Ava_2192 conserved hypothetical protein
MSISNRERVGRALELLRDGLYPFVEREMRSVFGDKWLVAATSFVPEDHTL
RRTVQQILKQDESAILKLMSGQWRDVFKKTLGNAERSLVGELIDTRNSWA
HNKPFSTDDAYRALDSVSRLLSSISAPEADEVDKQKQELLRVRFTEQARR
ETRRATLQPVEGNPTGGLKPWREIVTPHPDVASGRYQEAEFAADLWQVYL
DEGSDEYRIPTEFFSRTFLTEGLKQLLTNALVRLAGNGGDPVIELQTNFG
GGKTHAMLALYHLFMGVPVSKLPGLEPVLAAANVSPPAKVNTAVLVGNKI
SPGEAQTKKDGTVIRTLWGELAWQLGGREAYEMIRQADETSTNPGDNLRL
LFNRYAPCLILIDEWVAYARQLHEINDLPGGSFDTHFTFAQTLSESAKNA
KKTLLVVSIPASDNEIGGDRGKAALSRLKNAIGRVESPWRPASADESFEI
VRRRLFQPIIEESGFVARDAVIRAFGEMYQNQSQEFPSECREASYKRRLE
NAYPIHPELFDRLYNDWSTLDKFQRTRGVLRLMAKVIHSLWEGQDKSLLI
MPASVPMDDGQVQTELTRYLEDHWVPVIEKDVDGDNSLPLACDRQNPNLG
RYSACRRVARTIYLGSAPTLRAANRGVEDNRIKLGCVQPGESVATFGDAL
RRLTDQATHLYIDNTRYWFSTQPSVTRLAQDRAEQVQQDQDKVWDEIIRR
LRADKQRGEFTAVHIAPDSSADVPDEMNARLVVLGPQHPHKNKENNTPAL
TRGNEILHNKGASPRYSKNMLLFLAPDKAKLELLEQSVCQYLAWNSIVSD
KEALNLDVFQSNQATTKQQQTDKDVKSLIQEAYIWLIVPSQPDPQGEIEW
QEIRLQKQDSPILQASRKAVHEEHLIPNYAASRLRLEALDPYLWRDVNHL
DLKKLWEYLAYYLYLPRLKNQQVLLQAIAEGVASLLLNDNFAYATGYDES
KGRYLGLKAAEHITVTLSSQNLIVKPEIAQRQMEADAAAITKLPIPGVKE
GTGSYKTEREKTIDTVITPDDSSNGKVTVIIPPKRFYGSVKLDALRLQRD
VPQLANEVIQHFSSLMDAEVEITLEIQVKAPEGIPENVIRTVTENCRVLK
FSHQEFEPE
>Ava_4094 Peptidase M20D, amidohydrolase
MAHVLTSEKVFPRMVELRRYLHSHPELAFEEKKTASIVIDELKRLGIPFW
YGGVGSGIIGKLINAGQRAPTIALRADMDALPGQENTGLPFASLHPGKMH
ACGHDGHMAMVLGAAALLKENPPPGNVVFIFQPAEEKGAGAKVMIQSGAL
EGVNAIFGGHVTRHYQVGEIMVAKGVITAQSDGFTIRVKGRGGHGARPHE
AVDAVVVAGLLIMAVQTLVSREINPAYPSVVTIGKVEAGSAGNVIAEEAI
LEGTIRTTNLDVQNHIIDGLKRIATAVGELHNARVEIEIRHGYPPVINTG
KETEIARRAIVDILGSKGLVTMDYPSMGAEDFSFYLLHVPGCYVRFGACQ
QGCENIPLHSPSFDFDEEALKVGAAFFDRVVREAIAEYADNS
>Ava_0911 Ribosomal-protein-alanine acetyltransferase
MISSKLKLQSLTSKDLSAILELDQACFGGLWTMEGYQRELDSPNSDLLGL
FSPSSPIKLLGMGCFWSIVGEAHITILAVHPQYHHQGLGQALLYFLLRTA
SDRGLERATLEVRASNEAAISLYQKFGFQTAGRRRAYYQDNGEDALILWL
PDLQYPQFQETLAHWKTIIQDGLQKYSWSIVNSR
>Ava_1371 Flavin reductase-like, FMN-binding
MVALTEDTQANANRGRLTVQTVEIAAETTAIRCLDWDRERFDIEFGLRNG
TTYNSFLIQGEKIALVDTSHRKFEKLYLEIVAGLIDPNTIDYLIISHTEP
DHSGLVKDILQLAPSITIVGAKVAIQFLENMVHQPFKSLQVKSGERLDLG
NGHNLEFVSAPNLHWPDTILTYDHKTGILYTCDVFGMHYCDDQTYDENFF
AIEEDFKYYYDCLMGPNARSVLAALKRIENLTINTVATGHGPLLQNHISE
WLGKYQNWSLEQAKTETLVALFYAEDYGYSEHLVHTLGHGCTKTGVAVEL
IDLNTAEPQEVRELVAQASGLVIAMPSQYSLTAQAALNTILAAVHHKQAI
GLLESGGGEDEPVFPLRNKFQELGLVEAFPPILIKEAPTQTTEQLCEEAG
TDIGQWLTRDRTIKQIKSINTDLEKALGRISTGLYIITTKKGEIQGAMFA
SWVTQASLNPLGVAIAVSKERAIESLMHVGDRFVLNVLEEDNYQGLMKHF
LKRFAPGADRFAGIKTYPATDGSPILAESLAYTECEITSRIDCGDHWIIY
STVQVGRVANVHAMTAVHHRKVGNHY
>Ava_3070 GCN5-related N-acetyltransferase
MPTLKTLQTGDEVLLENFLMQHVDTSMFLRSNSREGGLINQGERFQGTYV
AAIVDKNIVAIAAHYWNGMIIVQAPVYLAEVVQQTVEQSGLAVSGIAGPA
TQVAAAKQVLGLADRPTQMDEQEKLLSLALQDLQVPPALASGQVQCRLPD
VQELELLSEWSAAFHVEALGKVATPDLLSACRAEIAARQSTAKHWLLIAE
NTPVAYTAFNARLPDIVQIGGVWTPPKLRGKGYAKSVVAGSLLAAKSQGV
ERAILFTSQENYAAQAAYQGIGFRSTGEEFGLVLYNL
>Ava_1199 GCN5-related N-acetyltransferase
MTVGNDLTVRFAELADSETLFGLIKALAEYEKLTHAVTGNVLALQEHLFG
SQRYVEAILAESAGQAVGFALFFHNYSTFLTKPGIYLEDLFVLPEYRRQG
IGKALLTKLAQIAVERDCGRLEWSVLDWNESAQAFYRHMGATILDDWRIC
RVTEEAISQLAEGK
>Ava_1995 Serine/Threonine protein kinase
MQPPILIGTVLQNRYRIVQILGQGGFGRTYLAEDQRRFNELCAIKELIPT
ATGTVAWEKAQELFHREAAILYQIENPQIPKFREKFEQDQRLFLVQDYVA
GKTYWDILTERQATGKAFTELEVLQLMRSLLPVLEHIHGRGIIHRDISPD
NIILRDSDSQPVLIDFGVVKELATRLQSPEMTTPVTSVGKLGYSPSEQMQ
TGRAYPSSDLYALAVTAIVLLTGKEASELFDENQLSWTWQRWVRVHPVLA
QIINRMLSHAPGDRFQTAAEVAQALQAIDQPNVGVSLPNVSRVQTIAVGR
RPDFAPPPPAPNKSEPVIPPSRSSSVLDSPLALGAIGTAVVILAGVGSWA
LVSSIRSHSSSQPGETPLPQTFPSPVVTGGTTFSTETPEPVVLNKRLNLG
NSNTATVADTIQANQIIRYSFFGNAGDKVTTFIDQGEGVSLTIVGRNQQP
IDSSAQQVTNYVGTLPNTERYTIQLTLASGVAASDYSLNVAIEKAIPATP
TPTPTPTETPTPTLTPTETPTPTSTPTETPTPTPTPTETPTPTLTPTETP
TSVPEPQSLP
>Ava_4751 2-nitropropane dioxygenase, NPD
MLTQGTSIPTLNPPDSIAFDHTSIQQKLLNLHQPCYVFLKNNQIGISHQP
IEGIQAQMMVNAILPQKLGDRSFLDFHGVNYAYAAGAMAGGIASEILVIT
LGKAGFLASFGSAGLVPQRVEKAILQIQQALPTGPYAFNLIHSPSEEALE
QGAVELYLKHGVRTVEASAYMDLTPHIVRYRAAGLHLNSQGEIQTHNKVI
AKISRSEVAQKFLQPAPLKFLKSLVEQGYITPLQAALAEKVPLADDITVE
ADSGGHTDNRSLVCLLPSIIALRDEIQQQYHYEHPVRVGAAGGIGTPEAA
LAAFMMGASYIVTGSINHSCIEAGTSDRTKQLLAQASMADMVMAPCADMF
EMGVKVQLLKRGSLFPMRAQKLYDFYKTYGGIEEIPPEIRQTLETQIFRK
NLDQVWDETVAYFSQRDPEQIARANNNPKRKMALIFRWYLGLSSRWSTSG
DEQRQMDYQIWCGPAMGSFNEWVRGSYLELPENRTVVDVATHIMQGAAYL
YRLQNLKLQGLNLPSQYCYYRPFHI
>Ava_2109 Serine/Threonine protein kinase with Chase2 sensor
MAEEPTEKVNKKNGSTANIASNKTTKATLTAAARQSQRLVRLGNILTVAL
TMGAALLTTSGLSLVQLLENQAISAFFQIRGPLIPPEDIVILAIDDQSIS
VPEQYYRTYPQKYAYLEPLSKFPFKRVAYAQVVEKLMQAGAKSVALDVVF
DLPGSYGNEDDRQLQAVLQKYSGKVTLAAQYETSQSHQGFFTQLRLPYEK
FRNDEASIGTVNFPVEVDGKIHRLGSELTKILGEVDALTDKIPSFDQAVL
ATAKVKYPRSPGERIYFWGPAGTFATIPFWHVLDPQNWNTYLQQGQVFQN
KIVIIGATAQLANDYHAVAVSSAWLSSEKMSGVEIHANAIATLMEGKAIA
QGIPSQSLQALFVLVLVGGTSFIITRSKSGLQRFILSLAVASGWGSISYI
SFVYAQLLFPTAIPMLAIAFCGISYLGTEVAKEKIRTRQIVDIFQKYKTS
PVVQEIISQQQELQDLLQQRNVAVSGKILSGRYKILKVLGYGGFSETYIA
EDTQRPGNPQCVVKQLKPVNTQAKGLQLARRLFNLEAQSLEKLGSYQQIP
QLLAYFEQEAEFYLVQEYIIGHPLNQELPSGKSISEAETVAIIREILEIL
VFVHENGVIHRDIKPSNIIRRHSDHKLVLIDFGAVKEISLPQANNQEPLP
FTIGIGTKGYAPSEQCFGRPQYNSDIYAVGMIGIKALTGIAPHDLPRDEN
EEIKWRDKALVSQGFAQILSKMVREDFKQRYQSASAVLAALNKLANTPDK
NFGQQNDSSMNTIISIDESDFPTAHWP
>Ava_1009 TPR repeat
MIIKLISVVLSLLVLFGWGTPVMAQSPQPSITQEQIAQGEEWKNQAFKAT
NKGDFVTAEKYWTKIIDNFPTNAGAWSNRGNSRVSQNKLQAALTDFNKAI
ELAPNVTDPYLNRGTALEGLGKWSEAIADYNHVLDLDPNDAMAYNNRGNA
KAGLGKWSEAIADYKKSFEIAPNFAFARANYAIALYETGQKDEAIREMRN
IVRKYPNFADVRAALTAIYWVNGQQGEAESNWVAAYGLDTRYKDMNWVKN
IRRWPPSMVSALDKFLQIK
>Ava_1078 Cyclopropane-fatty-acyl-phospholipid synthase
MNWLFSTLVFFLTLLAAGIALYLITARRYQSSNSVANSYDQWTEDGILEF
YWGEHIHLGHYGSPPQRKDFLAAKSDFVHEMVRWGGLDKLPPGTTLLDVG
CGIGGSSRILARDYGFAVTGITISPQQVQRAQELTPQELNAQFLVDDAMA
LSSPDGSFDVVWSIEAGPHMPDKAIFAKELMRVLKPGGIMVLADWNQRDD
RQKPLNIWEKPVMQQLLDQWSHPAFSSIEGFSELLAATGLVEGEVITADW
TKQTLPSWLDSIWQGIVRPEGLVRFGLSGFIKSLREVPTLLLMRLAFGMG
LCRFGMFRALRANTVTPPAEQTTGIKVAQR
>Ava_4124 Phosphoribosyltransferase
MWQKFYNRTEAGKLLAARLTEYANRPDVLVLGLPRGGVPVAFEVAKALDA
PLDVCLVRKLGVPGHKELAMGAIATGGVRVLNENVVDWLRIPQATIDQVA
AIEMRELERRNIAYRGNRPLPKVKNHTIILVDDGIATGATIRAAIATLKQ
QQPRELVVAVPVAAASTCEELQAEVDKIVCVMMPEDLYAIGIWYENFGQT
TDAEVCELLTRQKLLVANDL
>Ava_2008 putative transmembrane permease
MKYQARFTSPWAYIPILYFASGVPYVIINTVSVIFYKKLGIANTQIAFWT
SFLYLPWVIKMFWGPIVDVYSTKRQWILYTQFAMCACLAVVAFCLQLPNF
FFISLAALTIGAFISANYDIATDGFYLLALNPDQQAFFVGVRSLFYRLAV
IFGSGFLVVFAGQLETYLNDIPLSWTLAIGSATVILAMIYVSDRLILPFP
ESDNPRELQASSKIPFLKIITSYFQQPKILAILAFILLYRFGEAMLLKIA
SLFLLDKPELGGLGLTTSDVGLVYGTFGVISLICGGILGGLAIAKYGLKK
CLLPMALALNLPDLFYVYMAYNQPPLTLVYPLVSLEQFGYGFGFTAFSVY
LMYISQGEYKTSHFAISTGIMALGMMLPGIVSGYLQNSLGYPLFFVLVCV
LTIPGMISLFFIPLPEETNHQTS
>Ava_2526 conserved hypothetical protein
MLANILALVVGLGSLAIYIAAFFFPEIHRKNDFIWSGVGLFYALVLWIFA
PKISGGLLLGHVASVALLVWFGWQTLSLRRQVTPLAQQTTVPSPEVIQST
IQEQAAKFSLQEKVAQLQQDIGSTFSGLKNKVQQTGNQKVTTAPVQTTTK
PSVEIVDKTEKSTEETVTTTETSSPTTTTEPPAAVLSTDTDSTTEIVPEV
IPPNPPAPELVEAAQPHPESEDKEPIPVEEIAPDAVLAPPAEAPPESLPP
T
>Ava_3912 Nitrogenase cofactor biosynthesis protein NifB
MTPPVTGSSVTESTPTKAKSGGCGCDTSTTVEMDEKLQERIAKHPCYSEE
AHHHYARMHVAVAPACNIQCNYCNRKYDCANESRPGVVSELLTPEEAAHK
VLVIAGKIPQMTVLGIAGPGDPLANPEKTFRTFELIADKAPDIKLCLSTN
GLMLPEYVDRIKQLNIDHVTITLNTIDPEIGAQIYSWVHYKRRRYRGAEG
ARILLEKQMEGLQALREADILCKVNSVMIPGINDQHLVEVNKMIREQGAF
LHNIMPLISAPEHGTHFGLTGQRGPSQKELKSVQDQCSGNMKMMRHCRQC
RADAVGLLGEDRSQEFTKDKFLEMAPEYDFDKRQEVHEGIEKFRVELKVA
KEKVLAGKEKTASNPKILVAIATKGGGLVNQHFGHAKEFQVYEVDGSEVR
FVSHRKVDHYCQGGYGEEATFDNIVKTIADCKAVLVSKIGESPKEKLLQA
GIQTVEAYDVIEKVALEFYEQWNKG
>Ava_4274 Auxin Efflux Carrier
MTETLFQAYTPLILWIGLGLLIFRFLPTWLPQFLGRGLYWVGVPLELVAL
ARKGSQNELGGAGSTPILASLITFLALLLGLVAALLVWWGWQQISPHLFQ
PDLSESVSRPWLDSSTKGSFLLAAVLGNTGFVGLAIAPFLIHADSLNWVV
LFSITHNVIGPYGIGVLIASYFSHTKSNNRWWMQLWDLLTVPPLWAFLIG
TLTQSIKLPDFVESGLQGSVDIVIACAFLLTGIRLAQLQRWQSFQLALIP
TVLRVVITPLLVGLVTTVFLGLSGDRRLAMVLMSGMPSAFAGLILAEEYN
LNRDLIASSIMLSTLLLLLILPLWIQVFG
>Ava_4784 conserved hypothetical protein
MSALPFVRVRQHVNPLAQKYLTPANPLEWEKVYSSPHQPLHLDIGCARGR
FVLQMAQVEPRWNFLGLEIREPLVIEANQFRSQLGLSNLHYLYCNANNSL
QPLLSSLPTGILQRVTIQFPDPWFKTRHAKRRVVQPELVQDIANYLAVGG
VVFLQSDMEFVAVEMCDRFAANPAFKKVGSGEWLSENPLPVATERETTTQ
NRGEPVYRALFERIS
>Ava_3543 conserved hypothetical protein
MAQKLPIDSNINSPNQVSIFRTAKPSSEFYTVFSEAEVLGIVQALESRWE
IPLKYSYKGQGAKIWDSFYLKYIVPTWYRTSNVEIDLLKSNFAYLNGDYQ
QCEKVNIIDVGAGNSYPVKAFISRLQKIDKIDKYIALDISDELLKVSSKN
FKKWFPLIDYTNYAIDIENNKIPENIFKNQLENTANIFLHLGVTIANHHD
RHQVFQNFRDSMAQNDLLVFTNEIGSNSQWDGIARGGCKYHADQVYGWLT
NNLGMEAEDCELVRRYDPAKDSIVANLKLRQNYTINFSYLGIDKSLEISA
GEEITIWRHHKFEIPTLIQEIEQAGLKLVHYSTNKYSSHLMVICETASK
>Ava_1980 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MIVLSDPIFPLSGYRITEQLYFGSKTIVYRGLRKQDQKPVVIKLMRNEYP
TFQEIAQFRNQYTITKDLDIPGIVKPLSLETYRNSYALVMEDFGGLSLKD
WGQISKDSNEYGTVLKRFFHIAIAIASTLESLHRSRIIHKDIKPANILIN
PTTLEIRVTDFSIATLLPREVQVLTNHNVLEGTLAYLSPEQTGRMNRGID
YRSDFYSLGVTFFELLTGQLPFMTIEPMELVYCHIAKQPPKASDINPKIP
TILSDIVSKLMAKNAEDRYQSAYGLAYDLEICHKQWQETGNITSFELATK
DVSDRFLIPEKLYGRQHEVATLLAAFERATQGTTEMILVTGFSGIGKTAV
VNEVHKPIVRQQSYFIKGKFDQFQRDIPLSGLVQAFRDLIGQILLETDAQ
IQRWRSKILSALGEQAQVIIDVIPELAEVIGQQSEVSELSGSSAQNRFNL
LFQRFIQVFSSKHHPLVIFLDDLQWADTASLKFIQLLMNQTNLLNKNPAA
PPVETKENLWIPVFNGEPWTGSELQRNNEEEHSLFLIGAYRDNEVSQAHP
LSLTIKEIEKVRGNITRISLSPLNQTNLNALISDTLRCQEEITIPLSQMV
FVKTQGNPFFATQFIKSLHQDGVIKFNFDVAYWQYDIAKIQELSLTDDVV
EFMGQQIEKLPTTTQNILKLAACIGNEFDLKTLAIAHEKSVSGTASDLWA
ALHEGIILPQTELYKFFQEDNNSVAIFGNKKPYKSRINHCRQVPTYKFIH
DRVQQAAYSLIPEQLRKQIHLKIGLLLLDKIPVAEREDKVFQLVNQLNVA
VELITHQSKRDELAQMNLIAGRKALASTAYTAAIKYLTTGIELLAGDSWE
NQYQLTLALYETAAEAAYLAGNFERMESLIQVVLLQAKTLLEQVKIYEIK
IQAYGAQNQAIEAVNTALTFLKLLGVEFPENPSQSDFQLAMAGITTNLNG
KLIENLINLPEMRDKKSLAAIQILSSASGLVYQAVPQLFPLIVLKQVELS
LKDGNAHLSAYAYVLYGLMLCGIIGDIESGYQFGKLALRLTDKFNSEELK
AKITEIFYATISHWKEHTKEILSPLLEGYSAALQTGDLEFASYCLYTHSY
ASYFIGTELTGLASEMASYSHAISQIKQERVFNWHAIYRQTVLNLIQANE
NPCDLIGEAYNEKIMRHVHQISNDGTGLVYLHFNKLFLCYVFGEFKQAIK
NSYIAENYLESGVGNHVFSLFYFYDSLARLAVYNNLIESEKKEVLDKVQA
NQNKIQHWVKYAPMNYLHKFYLVEAEQYRVKEKYLEAIDYYDRSIYLAKE
NEYINEEALANELAARFYLQWGKEKIAQTYLTDAYYGYSRWGAKAKVDDL
AKHYPQLLAPILQQEKLTLQASDRITSATYQSIPDRSIQQTVIGSKTSIS
DSLDLASVIKASQALSGEIELEQLLSTLMQVVMENAGASKSALILTASDT
SELVVTTVSHSTNSASVFTSFPANILDSSFELPVSLIKYVKRNGEIFVID
NAKTVDFLASDRYILREQPKSLLCIPIINQGKLLGILYLENNLTTGAFTR
DRVEVLKLLTTQAAISIENAMLYKNLAQANQNLEDYNHRLEEKVEARTQE
LNDKNHCLQQTLQELQRTQTQLIQSEKMSSLGQMIAGIAHEINNPINFIH
GNIAHASEYVQDLLNLVTIYQRECPNYSDILAEEYAQIDIDFLAEDLPKI
LNSMRVGSSRIRNIVLSLRNFSRLDESDMKPVDIHEGIDNTLMILHHRLK
ENSERPEITLIKEYAQLPLVNCYAGQLNQVFMNIISNAIDALEESVVSRQ
LSIGKTNDMDNMTTAHPTIRICTELLGSTTLRIRIADNGFGMTEAVQQKI
FDPFYTTKPVGSGTGLGLSISYQVVVEKHKGQLTCYSALGQGTEFVIEIP
I
>Ava_0920 Peptidases M20 and M28
MNLKARLHNHLIQVARERDPYLATAGHFFVQEYIRQEFAQWGSVEIHTFQ
VGNKSFNNLILNLPSQSIGKKQELPPILIGAHYDGVPGTSGADDNATGVV
VLLELARKFAAAPAKYPLRLVAFDMEEYGLLGSTDYAGLLRQQQQPLRLM
MSLEMLGYRDCTPGSQRYPAPLEKFYPNTGDFIALIGNLRTIPDLIGMSR
HIRKAGISSQWLPVPNRGLIVPQTRLSDHAPFWDAGYPAIMVTDTAFLRN
PHYHKPSDAIATLDLDFLTGVCEGLEISIRRL
>Ava_2296 conserved hypothetical protein
MYLTWLDSNSWLLELSNQRILIDPWLVDALSFGNLDWLFKGYRPQERTIP
ENIDLILLSQGLEDHAHPPTLKQLNHNIPVVASPNAAKVVQALGYKSVTT
LAHGESFTFNNQIEIRAFPGSPIGPTVVENSYLVKELATSLTLYYEPHGY
HSPQLKQFAPVDVVITPTVDLALPLLGPIIKGYKSALEVAQWLEPQVMLP
TAAGGDVIFEGLLTKVLKTEGSVADLRLLFKKNNLLTQVLEPNPGDRLEL
QLAKRTSGTPK
>Ava_4165 DEAD/DEAH box helicase-like
MSESYKITLKSVYSQTVPTPDGVQIPENWSLSWHQAATLQALRNPNIDVV
FNTAMTGDGKSLAAYLEVLQGEFSAIGLYPTNELARDQEIQIQGYIEVFK
PENQPRVVRLSGADLEVYAENEGLKKGQAIGTLTSQREALLTNPDIFHYL
HRGAYIIRSDSPDKLWGKIDKDFDLFIFDEFHVFAAPQIASVINTMLLIR
CTNRRKKFLFLSATPDSNLIDRLKLAGFRCQEINPIAQEKYQFPDNPELE
QQLKTQGWRQVARTISLDFIPLEPSFKASEVWLKENSNLILEQFQQYPGS
KGAIILNSIAAVKRLTPFFQEILQPHGLQVGENTGLSGKAEKERSLSADL
VLGTSTIDVGVDFKINFLIFESSDAGNFIQRLGRLGRHDGYEKDGAKIKF
QNFTAFALVPNFLVERLFSGDTPPLEVNKICDRPTFHKIIADKYRQINDF
RGYYKRWGAVQSFYLCWQLGDRTIKQQYAQSREQLLKACEEVFDTSLKSV
AGRVAGWAKDWQALSGKQGNPIGDDAASFRGSSPLQCGLYDLTEENEADR
FKTYDLPGILGNLEIEVWTEAAFTRTLKETAARTGQPIAKGRFAHCLAFM
KLRSYREERVNWKFTYSGDLQPIADAWKVQVLTGVEIWQPENYWIGEINK
RLKKEGLVCYVLRRPVAEVRMRLRLPMHFQIYPISDRYSFHDTTAPYSIA
FGHSALLLDTLAYTFKSKGDEIWIA
>Ava_1480 Alpha/beta hydrolase fold
MATIEILGFPHAYELTAPTSYPDALVFIHGWLNSRGYWQPVTSRLSVDFQ
CLSYDLRGFGESQSQLETDFHRGSSSASLSAQSLQEINLSFDSLYTPAAY
AQDLAALLEQLNITSAWLVGHSLGGTIALWAAAQIPQRIKGVICINAGGG
IYLKEAFEQFRMAGQRFLQVRPRWLSQLPLIDLLFTRASVSRPLDRQWAR
QRVIDFVVADPEAALGALLDSTTEDEVNRLPKLVSQLKQPVYFLAGTDDK
VMEPKYVRHLASFHPLFQYCGDNVIEIPDCGHLAMLEQPDAVASHIKSLV
IGHG
>Ava_4391 Serine/Threonine protein kinase
MLAGKILQGGKYTLIQEVGRGGFGITFKATHHYLGHEVVMKTINERLRQH
PDFAKFERQFQDEARRLATCIHPNIVRVSDFFVEDGLPYMVMEYIPGETL
GDAFVLPAIPLPEETAIHYIRQIGAALQVVHNNGLLHRDVKPDNIILRQG
TQEVVLIDFGIAREFNSGVRQTHTGLVSEGYAPIEQYLTQAPRTPATDVY
GLAATLYALLTGQVPLPALLRDREQMPAPRELQPHLSAAVNQAVMRGMAV
ESRFRPPSVTEWLQLLPGSGINLPIHALPTHAVPTIDLSVHLQEKFSGAK
TSGSVKKPSPLKNIALMAQKSGTSQLFLGVIIALVAAAAGFSATSLFSRS
WTQPSAKPLFEERPMVPVGGKSQVDSPQKQDNTRQNSQNYSASETTPVST
SRRRRRNLTNQQTESPNPNSNSPEQSPQLNTGELQPRNYEQPGLSPNTSS
TPSLVEKLREIRSSRTAKPANTTDTSPPSSTQNSTTPVEPPSNSIVVPVQ
PTEPKNSDSSVVVPTVEVKQNSSTESQIPAPSQKNEKFPENTQINN
>Ava_2911 conserved hypothetical protein
MSLQQQIRTRNTDQLLGIGILGLLALLYAPVLLHWLNGWLYKTISTEHEY
FSHGIIGIPFAAYLSWGNRKQWQRLPNKTHPLGATLLVIGAVFYLTGVTE
WVNLSFPAILAGLCLWFKGFPGLKLQGFPLLLLFLATPTAMPYLIAPYTF
PLQSFIAGTAGFILNQFGMDAVVDGINIYVSGRIVEVAPYCAGLKMLFTT
IYVCLMLLYWTDNLSSRRTTVWFLSTAVLISVIANIIRNTILSFFHGTGQ
EAAFKWLHDSWGGDLYSALMLLSLIPILNRMSDYFSAPLESEIEGESQLI
SPEITD
>Ava_0646 Beta-lactamase-like
MAHVNLRRPQNTNGDFYVDTTCIDCDTCRWMTPEVFHRVDEMSAVYHQPT
NETERLAALQALLSCPTSSIGTVEKPKDIKVAQESFPLLVAENIYHCGYH
SEKSYGAASYLIQLPEGNILVDSPRFTPPLVKRIEEIGGIKFMYLTHQDD
VADHQKFAEHFQCQRILHIDEISEGTRNLEIQLTGSEPFTLQADVLIIPV
PGHTKGHTVLLYKNKFLFTGDHLAWSETRQQLTAFHDVCWYSWAEQIKSM
RRLADYSFEWVLPGHGRRFHADVDTMRQQMHKCIELMTDERQLLTFNS
>Ava_0945 ATPase
MRIKQISVSGLFGIFDHVIPLNMDERITIIHGPNGFGKTAILRILNSFFN
SRYSELIDIPFNIFRLEFNNNSSIEIIKYTKELENVDDSDIILKFYENDS
KPVSVSLKPLYPYNTYKAVNTFRSARFDIEELEALVRTFQVSNEGTTSLE
RVKYNLIDVPPNPKLKSQEEPKWLENIKKYIRIRLIESQRLLNLSPNGSS
NKYSMISTVSAYSDELAKLMQDKFQEYGKVSQSLDRTFPIRVVKQQPSAD
ITDNQLRQNLNELEATRSRLIEVGLLDNDEDSEFQIQPQDIDESTKNALS
VYIEDVEKKLSVFDDIASKIDLLKKIINNKFAYSYKEINFSKEKGFIFTT
LYNSSSSNSKNLSPTDLSSGEQHELVLLYELLFKVQPNSLVLIDEPEISL
HVGWQVQFLKDLQEITKLADLDILMATHSPDIIQDRWDLTVELKRPEK
>Ava_1087 Putative esterase
MNNLKLISEYQSFGGKLGFYSHPSFTCNGEMRFAVYQPPQAAEKPLPILY
FLSGLTCTEENFMAKSGAQRYAAEYGLILVAPDTSPRNTGIAGEDDEWDF
GTGAGFYVDATEEPWRSHYQMYSYIVQELPALIAANFPIQAEKQGIFGHS
MGGHGALVCALRNPHIFKSVSAFAPIVAPIGCPWGQKAFSRYLGNNQASW
RAYDASELVKQLGYHSQILIDQGTSDKFLTEQLLTDVFAQACKAVNQPLN
LRYQTGYDHSYYFIASFIEDHIRHHALA
>Ava_3223 Alpha/beta hydrolase fold
MSTNVLSTSDPTGFGGVVHEFLWQWEGQPLRVVYETLGQGSPLLLLPAFS
SVSTRLEMGEMARLLAPRFQVVAVDWPGFGESSRPSLDYRPEIYQRFLED
FVQAVFSTPITVLAAGHAASYVLLLAQKQPDAFSKIVLVAPTWRGPLPTM
GASPQVAGIVRGLVRSPIVGQILYKLNTTPSFLNFMYRRHVFTDADRLTP
AFIDKKWQTTQKPGARFASAAFVTGNIDAVHNQSDFLGLVQSLSVPLMVV
IGASSPPKSREEMDAVAAIPGMQSAVVPGSLGLHEEYPAAIFAAIEGFLF
>Ava_2569 Virulence-associated E
MKSINPILHQNSDYAQAETFLSLLCGDSETAVTWQTFDDSDSEQKDRTLA
QCWHDTLASSWNKLQRLNNKGAGVFVTVQETDGTGERKTANVVQVRALFI
DCDNGIPDSWHLQPSLIVSTSADKCHAYWLLDESVQASADEFRGWQERLI
AHYGSDKAVKDLARVMRLPGLLHKKGEPKLVTIFEASGTRYAVDEIMDGL
PEVGKPAKVNQPKPNYSDANFREKLEHFANKLKQAKEGERNSTLNSTKYT
LAGLFPDKLEDIDNCLFTVATECLGLSEEETRATLASANTGASKPITLTS
NGGKRSKSNLMRDTLEQLFGNSLQWDEMQNKLRFRGIHMSIEKLHDICER
ELDLDLPFDNFRRIASVIAQDNPYHAVRSYLQSLTAAEDPEPILSALYQA
MGITNRLHRLYVRRWLVSAVARALNPGCKADCALVLQGKQGIGKTTFFSS
LFGESFQTLGEHKSDVDQLLAMTRSWCIEWGEIENAFSKKAVSAIKSFMS
IERDTYRRPYASEPDTYPRHFVICGTTNQSEFLTDSTGNRRFWVVNLDQR
VDTKAVEDMRDDVWSAVLALFLAGERWHLEGEEVEKAAEDTAQYEQDNPW
TEKITAYTARHNPCTVADIMENALGFDVSKLNDKKAQGDVTAILRQLGYT
KEQKRLNGVKARYWYRPTLENTSTDDVRVTRDNIPAEENIEYSEDLY
>Ava_1105 TPR repeat
MKVLRYLPQEEIISLSVSLGRGGEACIYAVPSAGDCVAKIYHKPTVAHAS
KLRAMLANPPENPTASLGHISIAWPQELLWGADESERVIGFLMPRIRGMR
PIIDFYNPRTRRQHCPLFNYQYLLRTARNLAAAFAALHNSGYSVGDVNES
NILVSDTALVTLVDTDSFQVCDPDNDLVYRCPVGKPEFTPPELQNKIFAH
HDRQATHDLFGLGVLIFQLLMEGTHPFSGIYQGIPEPPPYEARIASGHFT
YSKKRQVPYLPTPIAPPWEILHPSLQALFIRCFEDGHNEPQLRPNAQAWL
SAIAEAEDSLTTCTVNSQHHYSNHLHSCPWCERALRLGGRDPFPSVQAIE
NREHLRPRIPTKRRYGHGNQPVNLPQPVMPMYQSNWHSPTPSFSPYRNRW
KGKFYPVVFCLLGFGVLGYLDVVTKFTSPLVSRNNYAQQALMPKQANSNT
ALSFAEYYQQGHAAYQVRDYKQAVDNFTHAIQQEPTNAKALVNRGNARYN
LKDYEGALADYTVALQINPNEIKAFVNRGNSRLMLAEYSNDPDQQYRLAI
ADFNHALKLNEKEAEAYIRRGIVRSQMAKYSSDTIKDYQEAIADFDQALK
LNPAKTEAYFQRASVRYLIAQYTGDSTKEYDQAIADFDQALKINDKLAKV
YLKRGMVRYELAQITSNKSDANNAKALADLQLAAKLSLEQEDTESYQQAL
SSICIIEESKCNALFQSSTMRGYASTDLTAKQ
>Ava_4977 GCN5-related N-acetyltransferase
MVEPMTPRFKYTKASQENIQQLGNILEQCFVMSFGDSEIYVKGIGLENFR
VIYREQKVAGGLAILPMGQWWGGQRVPMAGIAAVGIAPEYRGDGAAIALI
QHTLQEISEQDIPISVLYPATQRLYRKAGYEQAGSSCVWEIPTDSIQIQH
ASLPLEPVVLKNNPIFHELYQQQAQLTHGYLDRHPAIWQGLNRTLDTETL
YSYLIGDKDKPQGYIIFTQERTRDGSILRIRDWVTLSNPAVQSFWTFIAN
HRSQIDKVTWKSSVIDALTLLLPEQSATIRSQDRWMLRIVNVCKALEARG
YPLGVEAELHLEVQDDLLATNQGKFILSVANGKSEVTKGGKGELQLDIKG
LASLYTSLFTPRQLQLTGKLQATETALLKATQIFAGESPWMIDFF
>Ava_0254 von Willebrand factor, type A
MVKTSYEFDQPILPAGFSLKANILLRFRAEIPESPRRNLNLSLVIDRSGS
MAGAALHHALKAAESVVDQLEPKDILSVVVYDDAVDTVVSPQPVTDKPAL
KKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINRVLLLTDGHANMGI
QDPKILTATSAQKAEEGITTTTLGFAQGFNEDLLIGMARAANGNFYFIQS
IDEAAEVFSIELDSLRAVVGQNLKVTLELADGITLVDTLSLAKVSQNEAG
QPVITLGELYEGEDKLLGLSLMISSAQVGNLPVMKLHYSADVVQNDVIQR
VSGTTDVIAKVGTVEESALASSSHIILDLSRLTIAKAKETALELAEHGQH
QAAEKTLRDLVQYLRDQGLNENFEIAEEIDQLEYFAGRIAQQALGNAGRK
ELRDQSYQTMTRNRGDLVGRGVTAGDEVHAMPVVNEIGTGVELACVREGG
KLRIKVISDGYDQTKNVQFPRSIRAEGARYIVEGLELSSNGSFYRVVGKV
SRFAKPGETDIFVAPRQSRSTNTSKASKAPATAADLPTTDTIDHGVLIQC
VKDGSKLRARVVSDGYEPDWNMRFPRSIREEGMLYVVEEVKTAPDGKSYI
ASGEIKRFLQPNITN
>Ava_4862 Serine/Threonine protein kinase
MNQSAFTSPHNTGLLANRYQLQKLIGIGGMGEVFLATDVLLGGAPVAIKF
LTQTVSDPKIQKDFAREALMSAALSQKSLHIVRAYDYGVSDTGKPFYVME
YLNGKSLKDLIPLPLNQFVHLTRQICLGLQCAHQGIQIEGKVYPLVHRDI
KPANILVIPDPILGQLVKILDFGIAKFLNHTVTLSTNRGFHGTLPYCSPE
QLDGEKLDGRSDIYSLGVIMFEMLTGAKPWQPETDLFGAWYKAHNFEKPR
TIADVKPQLKIPQQLNDLIMACLEKKASDRPQNVGEILRTIDGLEQSDCA
GLPTNISPPSYPKLISASDVIKALEQQCKELTWPQDKPKKEIVFPLPLNT
HQKNLSTLWLMLPKKEIQQRANSKIYNRFIFVTSPHPMLLWVTLLYNQEQ
EPKWLPCYLDMQHPLNRTLVLSLADNETYPLICFTLEPPHRCIQVLSSNI
ETSQRQKLKIWVEQSQKLPPTSQPLASKNLLRQQYKQIQAHMLQHLSSTR
RLEKLAKG
>Ava_1678 TPR repeat
MIPKTYRFIVPSCLGISTILMLLNTGVNQKAIANTSSQNISPVIVSQAQT
TTDPLVELGITRQELEKALAESDEAIKLNPNDVDAYLGRAMIRHFSKDYA
GAISDYEQAIKLRPRPKDVSVYSNSGKAHAELGDHKSAIAKYNTALKIDP
NGVFSLFIYNDRGLSYLALGDTKSAIADFNQAIQLAPESADSYYNRGLAQ
RRLGQKQAAIADFQKAAKFYQANNKTEEYKRTLKQLEELQ
>Ava_1235 Creatininase
MLLSLSTWQEVETYLQKSTGIILPIGSTEQHGPTGLIGTDAICAEAIARG
VGEATGAIVAPTINVGMALHHTAFPGTISLRPSTLILLVRDYVTSLAKAG
FTKFYFINGHGGNIATLKAAFSETYAYLEDLQIPSAHKVQCQVANWFMCG
SVYQLAKELYGDQEGSHATPSEVAVTQYVYPEAIKQAPLSPEVARGHRIY
SATDFRQHYPDGRMGSNPALATPEHGKQFYDLAVKELSNGYLEFLNADTV
S
>Ava_2978 hypothetical protein
MNTDLIKIIITNNKNLLIAVLITILGTWLRVYNYAVVPDENFTFDEYAFA
WSGMSLIQNSVPTSWSYLSAYDTDKLTRIEWLGKKNLYLVTPWFDHPPLF
GLIVGGFAILLGANTFWECTTQLIRIPSLIFSSISIFIFYFINHKLFNTK
IATITTLIFATDPLFVYLSRLAVSENLLILLLLGSIFCFLEYLNKSRTMY
FYILVALTSLAPLVKVTGLSIVAALSLMFVYKKKLRDGLIVMTAGIFAFS
LYFVYGWFYDFKLFISVLKAHSQRFNSILILKEMIFSGNLPFIDAWFILG
WMTLPYAMKNLVSNYKIQLIYLPLITYLFMSIFSGAQSHFYCWYTIPLYP
FLLISSGYFIGDFIKKPSLLNSSVIIIIIFSWCLNYGLGHPWSNFYILNL
KGFKYIFIALISIVYAPIVIQEVLRTKKLLFLNRVIAILVFSSCIVANIL
ITYNLKQILQSNLIE
>Ava_2742 conserved hypothetical protein
MPLPTVIVPGYLESAIAYQQLATSLQELGFPTVTVPLRRRDWLPLIGGRS
VAPIIQQLDKTVKQALQEHNATQVNLIGHSAGGWISRIYLGEKPYAARGK
AQTSLWNAHPLVATLVTLGTPHISQERWTRWNLDFVNNNYPGAFYQNVRY
VCVAGKTVFGERRRGGWLAYSSYQLTCGTGNTWGDGITPIAAAHLIGAEN
LVIEGVRHSPRNPGIWYGSPEPLKAWTQYLG
>Ava_3436 Single-stranded nucleic acid binding R3H
MSNNPMQRGQQWLQSLLELTGVSAEIHGSVETAQSHNEDSQETDGYWLTI
DATNLNPQQIQTLIGADGSVLDAIQYLVNSTLNINQPQEGQASYTVELNG
YRVKRQAEIQQIAETAAEQVRSTGQEVEIKSLSSAERRLVHTFLKDFGDL
ETFSRGKEPHRHLVVRPAVNEL
>Ava_4992 conserved hypothetical protein
MAIYFIDSSALVKRYVNEIGSSWVLGLFEPALSNEVFIAAITGVEIVAAV
TRRSRGGSISFVDAKLVCNQFRKDLQTEYQVVEITENVIISAMSLAETYG
LRGYDATQLATGLAVNALGIANGLSSVTFISADNELNLAASSEGLVIENP
NTHL
>Ava_0219 Serine/Threonine protein kinase
MQVYCSKQHANNGGNRFCTHCGEPLPLAVKQVVDNRYRIIRQLGQGGFGR
TYLAEDIKKSHKTCVLKEFAPQVEHKEDLQKAKELFEREANVLKKLQHSQ
IPRFHGSLQAKIGTKDFFFLVQDYVEGDNYLQLLEQRQTQGKTFTEEEVI
TLLRQILPVLIYIHSQNTVHRDISPDNLILRRSDNLPVLIDFGGVKQLPA
SQGFWSTKLAGNNTLLGKKGYAPEEQLRQGKVFINSDLYSLAVTALVLLT
GKEPQKLYDSYQGVWRWGKEINASIQLESVLKRMLAYKPGDRYQNAAQIL
TDLPSPSSLTKPPTTHLTKIKTMVVSPGRKVTVLAGKLHNKTQAVSQKLP
LPVWLRPFAVSFGGTALVVLTGAGTWALVNSVVRGVSSITIPSISLPELP
SLPNPLAQPVSDKNKTSSSEVLSRRQQLEIPETFFIQITDSLFYAQKPEL
KGRSLTSKPEDAPLRDQWHAVAGEFLNKIEQANLSTAARRKLGSYTQKDY
ENWRRQARSGQLGNYTASQLNKDTNEKFDQLFPGQQRGKLNQQTFGQVWY
AIAADQVSKAQSK
>Ava_2135 conserved hypothetical protein
MPPITQQSTPTPGLNLVLFSIPQSFLLEIGTASILLLLTTGQVTVKALES
LGQASEELFRGDRLPILPFPDDDESNN
>Ava_4068 GCN5-related N-acetyltransferase
MLELIKPTYNLYSEFIDMVAEYQKHGESRLFDQNNLTLIQEDFSAYIQYL
ENNSKGIDLKPGFVPATTFWLVAQSKIILGESQLRHWLIPTLERRGGHIG
YMIRPLQRRKGYGTQILTKTLEKARNMGLSRVLLTCNKDNVASASVIQKN
QGQLTSEEFVENSDIIISRYWIDL
>Ava_3104 Ankyrin
MKIHDFAIQGNIAGVRQQLAKGVDINCLDESSQTLLMCAVSSPNASLEMV
QFLVEIGADINAIGGTTFNRTVLELAIQSGNIEIIKYLLEVGVNIHAPGA
DSCDILIDAIYSQNIPSGENLISILRLLIAKGADLGKNQYGKSALTAAAS
ECRFDVVEFLLAVGADREQLQWTELMYAIVFGSVEEVKQLIDAGADLEVW
DFCNRTPWILSVQIGEIEKAQLLLPSAADRNKYLTSEEPELMYAIKNNHT
ELLKWLIAQGFDVEATDHYGTTPLIAAVERGATDCVRILLEAGADPSKSG
NYGQKPINCVSNLEILKMLIDAGEDLSDIHDDMQQLLTGLSNDKEISNID
QEQYLSGKYRRFGKANPEIMAVEFWHAMVRSIATAWTARNTFNDADNAFG
DKAVWCFQRLGRTITQLPDGRIIQIAGEHEDYYDPDFCIYNDVVVFQNDG
KFTIFGYPQEIFPPTDFHSATLVGEYIYIIGNLGYDNQVIYDETPVYRLN
CHTFKIEKIETNGEKPGWISRHKACYKEPFHIYITGGKRFSKVGDKTDYI
DNKNSYILDLKNMYWSRKLEQFSN
>Ava_4660 CBS domain containing membrane protein
MLKASDVMTKDVATIRSSATVAEAVKLMRARDWRALIVDRRHEQDAYGII
SESDIVYKVIAYGRDPYKIRVYEIMSKPCIAVNPDLGLEYVARLFADYGL
HRAPVIQGELVGIISLTDILAQSDFLEQPYTILLEQQLQDEIKKARAVCT
QKGINSEECAAAWDVIEEMQAEMAHQRAEKVSKIAFDDYCDEYPEALEA
>Ava_2826 Protein of unknown function DUF938
MNTPQDPRKYAPATQRNREPILEVLLQVLPASGTILEIASGTGEHAIFFA
PRLQPRKWLPSDPNPELRASITAWTAQFPSDNLYPPVELDASQPIWSVEK
DAILNDAPIAAIVNINMIHISPWSACLGLLAGAGRILPPGGILYLYGPYK
QGGEHTAPSNAAFDESLRSQNPEWGVRNLEDVIAAAKQQNLQLHKTYQMP
ANNLSVVFQR
>Ava_4894 TPR repeat
MDSSSINSLLEDLKHSDALVREQATKKLWRIWFQQKGMYGLEKIDQSQKL
LDAGEITEAEVMLTQLIQEQPDFAEAWNRRAFLYYSMGEYQKSLADCQMV
IQINPVHFGALHGIGLCYAALGKYAKAIKAFKRALEIQPYSLVNQKLILE
CTFRLS
>Ava_4780 Short-chain dehydrogenase/reductase SDR
MVKLILITGVSRGLGYALTERFIQEGHTIIGCARSQVTVEKLSHKFGSPH
NFAAVDVADEAQVKAWAELILKEYEPPDLLINNAAIANQPAPLWEVPSTD
FSQLIDINIKGVVHIIRHFVPAMVQKRHGIIINLSSGWGRSTSPLVAPYC
ASKWAIEGLTRALAQELPAGMAAIPLNPGIIHTDMLDICFGEEAVNYPLV
SQWVLKAAPFILQLKPTDNGVPLTVPS
>Ava_3439 Beta-lactamase-like
MCPLPQQPSHITKSPRDVLDGIFAFPPNRDTLGGTSYLIVGNEGNILIDC
PALDQTNLDFLRSHGGVKWLFLTHRGAIGRTAEIQQNYGCEVMIQEQEAY
LLPGLTVTTFTQEFDLTPTAKAIWTPGHSPGSSCLYYNYQGGVLFSGRHL
LPNQQGEPVPLRTAKTFHWPRQIKSLQTILEAFTPETLQYICPGANTGFL
RGKRSIDTAYQRLASLDLKALLQTQPLI
>Ava_0823 Cyclopropane-fatty-acyl-phospholipid synthase
MTNLGTAKNFWDATLYQDKHSFVWQYGEDLLQLLNPQPGEFILDLGCGTG
QLTEKIAQSGAEVLGTDNAATMIEKARQNYPHLHFDVADARNFRVDKPLD
AVFSNAMLHWVKEPEAAIASIHQALKSGGRFVAEFGGKGNIKYILEALYN
ALETLGIHNPQALNPWYFPSIGEYVNILEKQGFDVTYAALFNRPTTLAEG
EFGMANWIQMFASAFLVGLTPDQQVQLIRKVEATLQDKLYHQESWTADYR
RIRIVSIKAQ
>Ava_1499 Alpha/beta hydrolase fold
MICFPPYNPPVFLRNGVVTTVYTTLWGKRYWQNTTPHLEPQYHKTVLMGG
QNVPIFCLIAIPENAHSTIVGTYGITGDLENEWFLRLLGRKAYAQGFAVV
LFDWRAHGKTAELSPTLTSDGLFEGEDFVRIAAGAAAMGCPSKFWFTGFS
LGGQLALWGLKAAADLTNGTEYFGLKYSDIGGGAVICPNLDATRSLSYLV
RHPVGKYLEQSIAKNLKKLAWRIHDHHPGTLDSEAIERANTIWDFDNELV
IKRLGFPSVETYYAASSPLQLLPEISKPSLIVYAADDPLFHPELVPELQA
TCDSNPKIDLLMTRYGGHMGYLSSKKGQIQAQDADPWWAWNRVLQWLEYI
RKF
>Ava_0479 conserved hypothetical protein
MSNNRSGTFIGGMMLGATIGALAGLLAAPRTGRETRKLLKKSADAIPELA
EDLSTSVQIQADRLSTNALRNWDETLDRLRDAIAAGMDASQRESQVLKRQ
QATPDADSLAQELENP
>Ava_1638 ABC transporter-like
MKIVLENIHKSYGKRVIVNRVNLSVAQGEIVGLLGPNGAGKTTTFYIATG
LEKPNQGRVWLDSLDITGLPMHKRARLGIGYLAQEASVFRQLSVQDNILL
VFEQTNVPRWEWAKRLNTLLREFRLEKVAKSKGIQLSGGERRRTELARAL
AAGGEGPKFLFLDEPFAGVDPIAVFEIQQIVAQLRDRGMGILITDHNVRE
TLAITDRAYILREGQILAYGNADELYNNPLVRQYYLGDNFQV
>Ava_1059 Peptidase M20D, amidohydrolase
MLTLIKDLATKLAPRLIEIRRHIHSHPELSGQEYQTAAFVAGVLSSSGLR
VQEGVGKTGVVGELQVTEKNHSFLAIRTDMDALPIQERTGLEYASRADGV
MHACGHDIHTTVGLGAAMVLSQMTEELGGNVRFLFQPAEEIAQGASWMVA
DGAIKDVSAILGIHVFPSIPAGSIGVRYGALTAAADDLEIVIIGESGHGA
RPHEAIDAIWIASQVITSLQQAISRTQNPLRPVVLTIGKITGGRAPNVIA
DKVQLLGTVRSLHPETRAQLPNWIERIVANVCHSYGASYQVNYRQGVPGV
YNDYGLTQLFQSAGEEAWTSDRVQVLPEPSLGAEDFSVYLEHVPGSMFRL
GVGYPERIINHPLHHPEFEVDESAIVTGVVTMAYAAYKYLRG
>Ava_3458 Alpha/beta hydrolase fold
MSTITTKDGTQIYYKDWGIGQPIVFSHGWPLSADAWESQMFFLASHGYRC
IAHDRRGHGRSSQPWHGNDMDTYADDLAELFEALDIQDAVMIGHSTGGGE
VARFIGRHGTKRVSKAVLIGAVPPLMLKTEVNPGGLPIEVFDGFRAAFLA
DRSQFFLDVASGPFFGFNRPDAKVSQGLIYSWWMQGMMAGHKNAYDCIKA
FSETDFTEDLKKFDVPTLIIHGDDDQIVPIGASALLSAKLIKNSTLKIYP
GGSHSLGDTSKEQLNADLLEFVKS
>Ava_1107 FAD dependent oxidoreductase
MYDFTIVGGGIVGLSTGMALGKRYPQARILVLEKESQWAFHQTGNNSGVI
HSGIYYKPGSFKAKFCRDGRDSMVKFCQEYGIDHEVCGKVIVATNGQELP
RLENLYQRGLENGIEVQKISPEEVKEIEPHVKCVAGIRVFSTGIVNYKQV
CLKYVELIQQQGGDLRLNTKVLKICPSGKNHVLETNKGNFETRFVINCAG
LHSDRIAKLGGVQPSAKIVPFRGEYYELTPEKRYLVKTLIYPVPNPEFPF
LGVHFTRMIDGSVHAGPNAVLSLKREGYKKTDFDLRDFAEVMTYPGFWKL
AGKHADEGIQEIIRSFSKAAFTRSLQNLIPEVQAEDLVPTHAGVRAQALM
DDGKLVDDFYIVPGENSIHVCNAPSPAATSSLEIGKAIAAQIPQQSHLEN
AVIA
>Ava_1577 Amidohydrolase
MSFTIQNVLLATDDGYITTDVQILGDKITAIAPNLDVVGTVIDGTHKLLL
PGFVNAHTHSSEMWQRGLISIFPLELWLAELYDFAPLDTEKVYLSALGTA
VETLLSGGTSVVDHLVLIPGEELETIASAVRAYQEVGIRAFIAPLIQDQS
LSAGLPAGESTQTHEPFFRSTAATLELVEEAVRQFHHPDAGVSILVAPTG
IQLCTDALFTGCIELSDRYNLCRHSHLLETKAQEKLAQEKYGCSAVTHLQ
RIGYLGDRTSLAHCVWLNDTDIDILAQTQSTVVHNPLSNLRLGSGIAPIL
KYRQAGVNVTFGCDGASSNDSQDLLEAIKIGSILHNVTDFDYRSWISPRQ
AVEMASLGAAKGLNLADQLGSITVGKKADLILYDLTNLSLVPRTDPIGLL
VLGRPTNVVDSAWVNGRQIIANRQVTTIDVEKLRTELFHLSEWATNRQSQ
TVEQFEVHYRTVMGLI
>Ava_1575 UbiE/COQ5 methyltransferase
MATIFRDLSYRYQWLYDSISRVAALTVGGEARFRQLALQGLTIEKNTSVL
DVCCGSGQATQLLVKYSQNVTGLDASPLSLRRARQNVPEANYVEAFAEKM
PFPDKQFDVVHTSAALHEMEPQQLREIIQEVYRVLKPGGIFTLVDFHTPT
NPIFWPGLTVFLLLFETETAWQLLKTDLAGLLTEIGFEVSKSTLYAGGSL
QVIQAKK
>Ava_2391 hypothetical protein
MSDSIISQNDIDYLKMIRKNIIEFMKYTSVNYAHQPGILLDIAPQIHSGA
KPYFSEYIKVETFDIDIHSGCTYIGDICQYNEFLKINTFDYIVCTEVLEH
TLHPFKAVDEILRILKPNGKLFLSVPFNFRIHGPLPDCWRFTEHGLRVLL
ERFSILELNAIETPERQLMPIHYTVVAQKLI
>Ava_4912 Haloacid dehalogenase-like hydrolase
MLRLITDFDGPIMDVSERYYRVYLFCLQKTQHPGQPVRQLSKAEFWQMKR
QHVPEKEIALISGLDAVQAQEFSQLRRQTVHTEPYFQYDIPIPGALDVLL
KVQQVGVDLVVMTMRRVWELDYAFQKYDLGQFFPENRCYCLSNDYVKTRD
IDDKPLLMQRALAELPPAADTWMVGDTEADITAAKQHGVKVIAVESGIRD
RTQLQQYHPDLIVQNLSAAVDIILESSVVKI
>Ava_1861 kinesin light chain-like
MSAVHHVSLLGDNHHSVAESLNNLVLLYYSQGKYNQSEPLYLQALDILER
SLGANHPHTVTCRKNLANLGNSLQQEQ
>Ava_0859 Protein of unknown function DUF990
MQRYLKVLKLFWSAAIAAEIEYRINFFIATLSSIGNLAGSLFGLFLFYRN
GYTFSGWSWEAALVVLGIFTLLQGFSATFLAPNLNRIVRHVQEGTLDFVL
LKPIRSQFWLSTHTISPWGVPDLIFGGVIIGYAGKRLGLGITNYLPSIIP
LFCGLIILYSLWFILGATSIWFIKIYNATEVLRGLLEAGRYPMVAYPAAY
RFFFTFIVPVTFLTTVPAEVILGRVEIPWLIGALVLATVLFWASSKFWSF
ALRFYTSASS
>Ava_3074 HAD-superfamily hydrolase subfamily IA, variant 3
MLANIRAAIFDMDGLLFDTESIARWAWQQALASHGYIMSDNFYSEFVGRD
LSWREKILKQRYGNDFPFEAIKRHRIEIGDRRELQEGLPMKPGALNLLCQ
LNSLGIIIALGTGTSRSRTIRRLSNAGILPYFTTIVTSEDVPQGKPAPDI
YLEVSRRINVTPVQCVVFEDSCVGVEAAFSAGMYPIMVPDIEQPSPEIRC
LTYKILDSLEQASEFLEQRLEA
>Ava_1773 probable penicillin amidase
MKSFRKLKLYQGLKTTVILLVVLSILLWGFLSYTVQRSLPLENGAIALPS
IKSEVTIKRDQWGIPHIYATNSHDLFMAQGYIHAQDRFWQMDAEMKADLQ
AQT
>Ava_2683 putative ABC-2 type transport system permease protein
MKKTIRKAWTLLTVYYAYMVEYRAELILWVLSGSLPIILMGAWIKAAQGG
SFGLSPVDFARYFLTVFIVRQISVVWVIWEFEKEVVEGKLSPKLLQPLDP
VWHHVASHLSERVARIPFAILLIGLFFILYPQALWFPTVSQLLLFTVAVS
LAFVLRFVIQYTFAMFAFWTERANAIENFWFLFYLFLSGLIAPLDVFPPQ
IKAIVLFTPFPYLIDFPASLLVGLPVDVVQGFLSLLAWIFMFWVVNRLLW
RAGLKKYSGMGA
>Ava_0251 conserved hypothetical protein
MSSMPSQCHNLKDQVESILQLLQEELSLRSQDITSVQTSLGKAISPQFEI
VFAGAFSAGKSMLINALLERELLYSAEGHATGTECKIEYAELDKERVVLT
FLSEVEIREQANSLCQQLGFTTAVNINQAEVVSLLHQGCEAIIQQEGGES
KSERAKQAKALIFLLQGYEANRQHIHTVNNATYSMEHFNFSNLKEAAGYA
RRGSNSAVLKQIEYFCNHPLLQDGNVIIDTPGIDAPVEKDAQLTYAKIQH
PDTSAVVCVLKPASAGDMTKEETELLEKMRQNGGIRDRVFYVFNRIDETW
YNTQLRQRLDDLINTQFRDTSRVYKTSGLLGFYGSQIKQTSTQDRFGLDS
IFGESAKYVNGNEETPQFVYAFNNYCVSSGKLASSNFRISLNGFETPNQN
YVRILSEQGTPLINQLIQDSGIEEFRTAVTRYLTEEKRPQLFKNLADDLE
DICINLKKHYQTVHRDLDSQPREIEMMKAQELQYLNQQLQQVGKDYSLHI
TEEVNHVINHACDRFETDFRQLQSRMIRRLDELLDTFSVADAYRRATLSH
PRNATAPLLAILVEAFYYLANQLEDILVDSSQELVTNLFQRLMEKIRKSE
YYRQLYRLLGNDAGIEQELKALEKRVTQSLVDAASVECDRFVRESPSFYD
ENTFSIYQFRQTLLQTSQSYDSESMVEAEPAIRQLLKLDFEPKVSHTIRK
TFRQTINQILKTQLLPMAKQQGDGILQQYPQARLYLESTLQQEAEQKINN
NNRLLGVVEGKVDVYNTAIANINNCLQAMQLYDYLLPLINLGELTVVDQI
AANNGVVVSDGLLDGVTQV
>Ava_1637 permease YjgP/YjgQ
MVAKKLSSFYSFHSLLPFTIMDRYLTSELLPTFLFGVGAFSSIGVTIDAV
FELVRRVVESGLPVSIAVQVFLLKLPNFIVLAFPMSTLLATLMTYSRLSS
ESELIALRGCGVSVYRMVMTAVMLSFVVTGLTFLFNEQIAPAANYQATLT
LDKALKSDKPKLKQQNIFYPEYRDIQEKDGTKNRILTRLFYADQFDGKQM
KGLTIIDRSTDGLNQIVVAESAEWNGAQSIWDFYNGTIYLVAPDRSYRNI
LRFEKQQLKLPRTPLSLAEQSRDYGEMNIAQALDQLNIERLGGDRQKIRK
LEVRIQQKFALPFVCVVFGLVGAAMGTIPQRTGKGTSFGISVIVIFSYYL
LGFITGALGQAGVFSPFIGAWLPNFIGLGVGIFLLVRVAQR
>Ava_3208 permease
MNQLNNGFTIFLSLLVEAMPFLLLGVLFSSLLLFFVDERKLVEKMPRNPL
LGALVGSMVGFLFPVCECGNVPVARRLLMQGVPTPVAVGFLLAAPTINPI
VIWATWTAFREQPEIVVLRVIFSLLIATIIGFVFSFQADLEPIVQPAIAR
YLKFNPPAKPETKRRGKYSTTKDTNTPSMLRSGTYILGGRAGGVPTRLDA
NLAPTNEASSNNKPLRDKLRLLLDNSIQELRELGAVMVIGSAIAAAIQVL
APRELILSLGSGPITSILVMLVLAAVVSICSTVDSFFALSFAATFSSGSL
LAFLVFGPMIDIKSVGLMLSIFKPKTIFYLFALAGLLTFLFTLFINLHVI
>Ava_2443 NADPH-dependent FMN reductase
MVKIVGIAGSLRPNSYTQLALRVAAQRLEALGAEVEIIDLREWQLPFCNG
GKDYSDYPDVQRLRDTVSNADGLILATPEYHGSVSGVIKNALDLMSFDEL
SGKVTGLISILGGQSNSNALNDLRLIVRWVHGWVIPEQIAIGQAYSAFSP
EGKLLDEKLSQRFDQFAQSLVENTRKLRGVN
>Ava_4720 Glutamine amidotransferase, class-II
MCQLLGMNCNVPTDICFSFEGFSARGGKTDHHSDGWGIAFFEGKGCRIFL
DAKPSIDSPIADFVRCYPIHSTHVIAHIRKATQGEVALENCHPFRRELWG
RYWVFAHNGNLPDFQPPIQSFYQAVGHTDSEKAFCLILETLRQSFPEGKP
SLEKLYPVLQQVTKTLAAIGVFNYLLSDGEHFFTHCSTNLSYIVRQAPFA
AAHLIDQDMTVDFNELTTPSDRVAVIATTPLTDNEVWTPIQPGELLVFQD
GLPLRHIFN
>Ava_1339 WD-40 repeat
MTDIQAWAFLVGCNQYLDYRTIVAPSFMCESKTSSLLAKAAGGDLTEIGR
AYYREIHNSKVGDLTLVFRVIEATSENTGIEGQGVLKDSFGREIDLIEGI
VFKGLLEDVVVEEEELEVIHEKLIYCYREFWDCMSPQRAVSLTDLTRPTV
NNVEDELKNLKLIRLKPYIEKKEQSSIGDKFAARKNTKWVSLSTEEFPDE
INSLAVSKDGSYVAIRYARTILVRNWNNKQTNIIKSERVLFGFDNTPIAI
SNNNQLIATAMVEGIDQNVVKVWNVKTGEERIIGEHLFNGLHRVKAVDFT
PDSNIVASAGGDKNIKLWDVISERLELGTLIGHESEIRCIAISPDGKTLA
SGDGHGCIKLWDLVTRKNTRTITRKKYYEKPVNSLAFSPDSKFIVSGSDE
CDVTLLDGKTGKKILKFGEHSEPVNLVIFSPNGQMIASASDDCTIKLWDV
QEKTEIAELKGHTKAVTSVSFSPDSQTLVSGSKDRTIRLWESSITTTGKS
TGWG
>Ava_0906 conserved hypothetical protein
MQAILLSSEEVAKRAKELYDNNIRQQVETEENIGKMVIIDIETGEYAVDK
TGIESAKYLRSKNRFARLFGIRIGYKVAASFSGEMERDYQ
>Ava_4627 conserved hypothetical protein
MPTNYPTKIAILWIVFLLGTLFHTQLGLMPLFHGLSVIESQKATNINEIS
GIMWLMLGFFVLPMLATIGNIFIENKRYRLVHFGLTIIYSVLNFIHLVLD
LLLPQIIWYQITLMVLLFLIGLLLNLVAFQWMKSPPKNNHSPERLTSTY
>Ava_1833 conserved hypothetical protein
MTASNPTILALDFDGVICDGLIEYFEVAWRTYCQLWSPADDIPPDDLALR
FYRLRPVIETGWEMPVLIKALVDGNSDDQILQEWTSITPKILLDDKLQAK
EIATKLDALRDQWIANDLDGWLSLHRFYQGVIEKLKITVASEVKLYIVTT
KEGRFVEQLLHQEGVDLPRDSIFGKEVKRPKYEILRELIQAADHKPVSLW
FVEDRIKTLQLVQQQTDLEDVKLFLADWGYNTQSERKAAQNDPRIQLLSL
SQFAKDFPGWV
>Ava_3614 Oxidoreductase, molybdopterin binding
MLGKFFQKPDQENSDRVPPGQHLAKGFPVLTYGAAPKVSLEGWEFRVWGL
VKPTVFTWSDFMNLPHHEFTADFHCVTRWSKLDVKWTGIKVTDFMSLIEV
DSKAAHIMEHCYGGYTTNIAIADFVREENFFAFKLFGEDLPSEHGGPMRL
VVPHLYAWKSAKWINGLEFLDKEELGFWERNGYHRRGEPWEEERYS
>Ava_0901 TPR repeat
MTESLPLRDRYLALIDEIVQMTLQGKISSVEMVYQMLQKGILAGTGEVFE
LVLSDRLSTSQAQVDSEQDELKKAKANRSLRAIKTIQSQWQRYAEQNKAT
EAIASAVQEITTASASDRLATFLRIIDPNLKNPLNLSQLQQLGKSLQQFA
QFAPDIQQISQGITRGVAAWQRLQEHLVSWMYESNQALGFGGVPGETGPW
ATWAKKVNSELPQALFRSLAVEQSAIPFVEQHNNITLADWVEIALIFQYL
QRGLVNWFDQQAYNIQAGSKLSISTFLTFAVLWSQLASGFQGKAEYSNGC
SQIMLQTLRTFAQRPYFPLYGGIFASFSGSYLRDALDYLDEPLRKVERTQ
EKARILTLLGYSRRALGQYQRSIKFHEQALEIARNAGDRPCEIANLNHLS
RTYVQEQDYAEAINNSQRALMLSRQAGDKTGETNALVNLGYSEVMQAQKL
EQAEPETYERAISYLEQGLKLSEKSGDIQSKALCVSSLGIAYLVIGQPQD
AIKYLEDGFKTAQISGDLYLQGRNLAYLAEACYQLQNLEKAVYTGCLGMY
LLEQIASLEWRQAAGLLTILQGQIGLEAFQNLLQQSRSRIISIIGVDGYD
YIPKLLAQYQEDI
>Ava_1989 tRNA modification GTPase TrmE
MTQLLAITGTIAAIATAIVPQQGSVGIVRVSGSQAIAIAQTLFHAPGKQV
WESHRILYGYIRHPQTRQIVDEALLLLMKAPRSYTREDVVEFHCHGGIMA
VQQVLQLCLEGGARLAQPGEFTLRAFLNGRLDLTQAESIADLVGARSPQA
AQTALAGLQGKLAHPIRQLRANCLDILAEIEARIDFEEDLPPLDDEKIIS
DIENIAAEISQLLATKDKGELLRTGLKVAIVGRPNVGKSSLLNAWSQSDR
AIVTDLPGTTRDVVESQLVVGGIPVQVLDTAGIRETSDQVEKIGVERSRQ
AANTADLVLLTIDAATGWTTGDQEIYEQVKHRPLILVMNKIDLVDKKLIT
SLEYPKNITQIVHTAAAQKQGIDALETAILEIVQTGKVKAADMDLAINQR
QAAALTQAKISLEQVQATITQQLPLDFWTIDLRGAIQALGEITGEEVTES
VLDRIFSRFCIGK
>Ava_3420 HAD-superfamily hydrolase subfamily IA, variant 3
MSLKAILFDFNGVIINDERIHLQLIDEILIQENLQPQKVQERQASLGRSD
RACFQELLANRGRVVSDDSLTQLLNSKAQAYVLEVEKLEKLPIYPGLEDL
IYQARSRNLHLGIVSGAVRQEIELVLNRAKLAEYFTVIVAGDDITTSKPK
PDGYLLAVERLNQEYPELNLLPQECLAIEDTPAGIAAAKRAQMQVVGVAN
TYPFHMLQRCCNWTVDYLSDLELERVQKIFSGGEFQAIVSEC
>Ava_1314 Peptidase M, neutral zinc metallopeptidases, zinc-binding site
MNLLESLNRLTKNHLGIGIVDAWLQAPQTREASQKAAQLAQTKHLREAVT
IAEKALSCWSRKPGFWERLICQILLGKLVNQLTQQLQEWRKQVGTVDKQL
ASAKTLLKQDTGDPWETTNLTNIITIYQRCSKIIHDERILQAIQQCQQEL
QKRQQFQELVKQAQSLVENLFFKNAIATYQQAEELYSTQLLTQAIATATQ
QVPQEEAYDASLQRVRQGETEGKLRGAIVLLESALAKFPRTDGHELLHKL
KSLVQGRELFRRGLAAEKIGDFPTAISLYTNAESLLPENTNCRIRLGLVT
IKTQAWETALSYLEDLPGEQAAYLRGFVYAQQENLQIAYREWQGLSGAQI
SEQREILKILSQRQRLFSLQNIEELVKAENLEQAKIASREFTQKFGFHQL
VEENLQEHIQPRLEAAVWQRANWQLISQQAENYWIAEPNIITLHNWAVAN
YYHAQQDSNQIINLIISLSTALANLHKDINLQDVPWLGTQSVDFQLVFNQ
VKNRIETFIDTIKDKNIEDYLKFRDCWRLETLALDLMDQPPIKGIKMNDI
FLTPGCYNHYLKFDPSNNFNSIEPHQKILHCLYTNWGLAVAACVAGDSQR
AIKLKPINLATSDLEIFAHNFVAYHEGCYHLQQQKWREAIHPLNQAKSEI
LANQEWQIELDRLCSLQRQNISEDHEHLVFAQFWYDILNSTKARAYLGEY
KAERIREQLASKQISKQKALKELERIKLIDADNPIVIDLIQRIEVALAVE
VIEEFLNKNNLEGAVKFAKQNQYLKVKNIVTEICVDILVDGFKSGKLGFE
EIYDLGRWAYELSPDEPNIQEIYLISQELHEIHHLIKRDRYEEAVRRAKY
SEYDAIHSYVGDYLMMTLIRGMKSETLSTHLVHQLGRWVYQLCPHDPDYQ
EIYRRLNIR
>Ava_3910 conserved hypothetical protein
MKLQKILLSVVTGVSVVSLGLAGCTPQQTDLEAQTETNAPTLTGQTETQA
QAPTTQPQERPADVPYVPTPQVVVDAMLQVAQVGKNDVLYDLGSGDGRIV
NTAAQKFGTRGTGIDINPERIQEANENAQKAGVSDRVKFVQQDLFKTDFS
DATVVTLYLLPDINLKLRPILLKQLKPGTRIVSHAFDMGEWKPEKTLQVD
GRTIYYWVVPEQVPANLRQ
>Ava_2725 Alpha/beta hydrolase fold
MPDVELKPCFLTPRRVRPEYPLFVYLPGMDGTGQLLRSQTAGLEIGFDVR
CLAIPRQDLTSWDVLTNNVLDLIHAELEKSSQRAVYLCGESFGGCLAMKV
AIKSPHLFKRLILINSASAFKLRPWLDGLSQLVQLVPECLYDVGALGLLP
FLASLQRISRNIRQELLKTMRYVPPETVLWRLSLLREFDVSDEQLRSLTQ
ATLLIAGGSDRLLPSVSEATRLANIISHSQKVILPNSGHACLLEQDVNLY
EILQVNNFLEIKSPKISHLKIPQQKI
>Ava_2677 Virulence factor MVIN-like
MTNQEQKPSRSFAGIAGIVAAATLISKVFGLVRQQAIAAAFGVGAAATAY
SYAYIIPGFLLVLLGGVNGPLHSAVVSVLARRKREEAAPLVETVTTLVGG
VLLLVTVAQIFLADNIVDLVGHGLEAKTRAIAIQQIQIMAPMALFSGLIG
IGFGTLNAANQYWLLSISPLLSSVAVVFGIGIMALQLGKDIIKPEYAFIG
GMVLAWGTLAGAILQWLVQLIVQWRLGLGSLRLRFDFKSPGVQEVIKIMT
PATVSSGMMPINVATDLYFASPIPGAAAGFNYANLLVQTPLGIISNIILL
PLLPIFAKLAEPENWPDLKLRIRQGLLLTAVTMLPLGALLISLSVPIVQV
VYERGAFKQEATQLVSSLLVAYGIGMFVYLGRDVLVRVFYALGDGQTPFR
ISIFNIFLNVVLDWFFVKPFGAPGLVLATVSVNCSSMLMLLFLLDRRLNG
LPWREWGLPILGLAGGSVVAGIASFATLAASQQLLGKEGLLIQILQLCIS
GFVGIAVFAAIASLMKIPEVNSFVVRMRQRFLKK
>Ava_1170 Metallophosphoesterase
MNLKRRQFLFLSSLSAFGTGLLAWKFAHKYYQSSDLAIASPPKKDLLLRF
VSVADTGTGARGQYAVAKAMTLYHKQNPYNLVVLAGDNIYNNGEIEKVNA
VFERPYQDLLKQGVKFQACLGNHDIRTDNGDPQVRYPSFNMNGRRYYTFR
RDRVQFFALDTNNNADWQNQLTWLEKELSSSNAPWKIVFGHHPIYSSGVY
GSNQAFIKTFTPLFQKYGVQLYINGHEHSYERTRAIDGTTYLICGAGAGN
RPVGRSKWTEYSTSDLSFATYEVYPDRIELNAITTNNRVFDRGIIRRVGV
>Ava_3483 TPR repeat
MSAGMGINSRKRSLVDGNRALNLFTDRHELTRVFAAYLHDEPAEKILSFS
GDGGNGKSLLLKFLRTKCCKRFGADAWQKLKTKTAAEIADYIESADNDQC
DLVPAILQDFGLQPNGDDQPQDPFYGLLMLRRSLSRAATELGYRLRFPLY
DFACVWYLKQKNRLTREKLAELFPSEEMDLLIEIVNAVSDTSWGTIGKAV
FGIFNKHLGENLLLHWQKRGLKKEDIEEIRGMDAETELMNELPRYLAQDL
SAAMSQEKAPPRIVLFFDTHEAFWGGQRQQTGILYFQRDEWLRYFLAELD
LKAGIVAVIAGRETPRWAQADNFQIPQKYIDIQLVNHLSSADADVYLQRA
EIGDQALRQSAIAYSSVTANQVHPLLLGLSADVILQAQEHLTPEDFPKQE
ATLNKAKYLMNLLLKYTDREFGYAVHALSACRSFNFEIYRLLAEELHFST
TKPAFDILTEFSFVWDVEKLGENWYRIHDLLRRLNYENSNEITQQAHVVL
EKHYRQQGQVAEAIYHANRLDWRRGVDEWEEVFEQALELSRYAQCRSLLE
VRSELVINSDFQIGRVSQSEGDYFAQLAKYQEAQTEYLEAVAAYNRELSI
TPDDTATLNNKGLALESLGNLQTQLAQHTQAIQSYTSAIAAYDQALNLAP
NYTQTINNKGLVLKNLGDLQTKLAQHPQAIQSYTSVIAAYDQALNLAPDY
INALNNKGVALQSLGNLQTKLAQHPQAIQSYTSAIAAYDQALNLAPDYIN
ALNNKGVALQSLGNLQTKLTQHTQAIQSYTSAITTYDQALNLAPDDTYAL
NNKGNALQSLGNLQTKLAQHPQAIQSYTSAIATYDQALNLAPDDTYALNN
KGSVLKNLGDLQIKLTQHSEAIESYTSAIAAYDQALNLAPNYTYALNNKG
FALQSLGNLQTKLAQHSEAIESYTSAIAAYDQALNLAPNYTYALNNKGNA
LAKLGDLQTKLAQHTQAIQSYTSAIAAYDQALNLAPRYINALNNKGLALQ
GWGKLLLQLSQKPEAVNHLQAALAVFNSSLAIVSGDESVRNLRDELQEFL
DNLT
>Ava_5038 Alpha/beta hydrolase fold
MSITEHKITVNTLEWFYRESEPVGRSDLLPVLLLHGIPSQSYSWRNIIPA
LAAQGTRAIAPDWIGSGFSAKPEKHEFAYTTEAYITALAGFIQALEIERF
SLVVQGFLGSVGLQYALKHPEQIANIAILNAPISTDAKLPWKIKQMGLPF
IGDMMTQDPLLIDRTLEGGSRYRIEDKDLDVYRKPFLKTSAVGRALLNTI
RNLQLPAAMTEIESGFKQWQQPILVQWGMIDPWLPVEVAQKFVETAPNAE
LIKLNNVGHYPQEHYDKTILEDLLPFVRRAES
>Ava_4106 Serine/Threonine protein kinase
MNNVLEMITEIEPGTLIYGRYQIQKLLGKGGFGRTYLALDNQRFDEPCVL
KEFVPTASQEKNVCKSKELFEREAKVLYKLKHPQVPQFLAWFTDSDRTFI
VQEYIDGRTYSEILFERVSETGQPFSEIEVRTWLTDVLPVLDYLHDRKII
HRDISLENIMLPHHQSKPVLIDFGAVKENVTQLMSPDSINFYNSIHTSVV
GKYGYSPPEQLRLGISYPSSDIYALGVCAVVLLTGKMPHLLLDESLNWQW
RSQVNIADDLAAIIDRMLIESPTARFQSAKEIILKLNKLHNNSPTVTQVE
FKITSPIEAIKTRQQEKETNKALEELLILQNLERTLRQYHDKLPKPIYLN
LDLPEYMEKHTTPASESSGCAFKKTSKIAAKIINIFTRRVNRHIVRKSEA
TNVNYIQINTHDFLEKTSINRNSQILEVIKKEFTNFIGPIANLIMNKVLV
TFPDCSANQLIEILAASIPDKITAERFQNDTRKLIISNLYIVKTNE
>Ava_3797 conserved hypothetical protein
MIGLNCKPLYIENAENQLYNPTLNIKTTMTYLETAAQFYSEVAQTPQVGL
CCVQSTPLQLPGLKIPLSMQEMNYGCGTTVHPTELGKQPTVLYVGVGGGL
EALQFAYFSRRPGAVIAVDPVAAMREAAARNLEMALTDNPWFESSFVEIR
EGDAFNLPVADGSVDIVAQNCLFNIFEPEDLTRALKEAYRVLKCGGRLQM
SDPIATSPIPAHLQQDERLRAMCLSGALTYEEYTQRIIDAGFGQIEIRAR
RPYRLLDSQTYNLETHLLLESLDSVSFKVPIPEDGACIFTGKTAIYAGCE
DFFDDKAGHILQRGIPATVCDKTAAKLATVKPKEVIITDSTWHYNGGGCC
>Ava_0397 Thioesterase superfamily
MPFTYHRTIHFQDTDAAGVVYFANILSICHEGYEASLRTSGISLKEFFTN
PNMAFPIVHASVDFLRPLFCGDQVIISLVPQKIGAEKFEINYEIYLADVL
VAKAVTRHVCIDANTRSKQELSTEIIQWLDGYRKETEEVERRKAREVV
>Ava_1257 conserved hypothetical protein
MPLFTNIRWRTIVSLAIVLPTGLLYSHYRNSVWWLNQEVGGIFYEIFWCL
FAFLFIPTRRAVWQIPLWVLVITCLLEVMQLWNPPFLNWVRSFWWGRMLL
GTAFTWADFPYYFIGSGLGWLWLRLIVRDAKLK
>Ava_0958 Oxidoreductase, molybdopterin binding
MGLIHIRRPQLTRRQFLHLSGISSVSLLLGGCGTPALEDLVGTVSQPLNQ
KLEKLIFNPQKLVPEFSPNEIQPEALIVNSFRSTPIIDVDKYRLIVDGEV
NHPLNMSMAEIQNLPLTSMIIRHICVEGWAAIVQWGGVQLREIIALAQPK
ENVQYVYFKSADGYYESWDIASALHPQTLLAYEKNGESLPIDNGAPLRLA
APIKLGYKQSKWVTQITLASHLSIFKGYWEDQGYEWFAGI
>Ava_3602 Alpha/beta hydrolase fold
MPYINVRGVEHYYEWVKQPSGDLVKPVMVFIHGWAGSARYWISTANALSD
QFDCLLYDLRGFGRSQGKPTVAQASESVVGADSTQEKSQAIQELTYEIEE
YAEDLVVLLDELKLQRVYVNAHSMGASVATMFFNRYPQRVERGILTCSGI
FEYDEKAFAAFHKFGGYVVKFRPKWLGKIPLVDRMFMARFLHRPIPKSER
KAFLEDFLVADYDTALGTIFTSVSKAQAEVMPQEFAKLQVPTLLVAGEYD
QIIPAKMGRQAASLNDKVEFVLIPNTAHFPMLEDAPTYLRRVREFLQVAT
PELQTS
>Ava_2965 Short-chain dehydrogenase/reductase SDR
MNIRGKVALVTGASRGIGRAIALELAQQGIHRLILVARDRQKLREVAQEV
EAMGVQATTLAIDLTQATEVNIAIAQLWRNYGPIHLLVNCAGVAYQSSFL
QSKLPQVQEELSVNLLGMYTLTSLIAKRMASQRQGTIVNVSSLMGKVAAP
TMATYSATKFAILGFTQALRRELAEYNIQVKALLPSLTDTDMVRDLKLFR
WVIPMTPQEVAKALITGLEKDAPEILVGWQSHLAVWCQRLAPWLLELVLK
IATPPAIRQQQSDENLSFWAKIQRFGDFFFSGNKFPFVFARKT
>Ava_0993 Mov34/MPN/PAD-1
MNQPTIKLLPQHQQTILSHAESVYPEECCGLIMGYVANKAKIVVEVIPTA
NAWETEADNFTQEINQTNITSPASTLKRRYAIAPQVMLQVQRQARDKSLN
IIGIYHSHPDHHAVPSECDRLYAWPGYSYIIVSVQKGIASGILSWSLDDH
HQFQSEIIDNITLNT
>Ava_2598 Basic membrane lipoprotein
MQQDFSRRKFVSYGAATFTTTLLLKACSSNQTSTPTASSGQEKFKIAIAL
PGAITDQAWNQSGYEGLNLAKQKLNAEVAYVEQVAQTDQTEALTDFARKG
YNLIFAHGGQFDAAIEQVAPQFPNTFFVGVNGNTKAENIASLRIDHLQGS
YLCGIIGASVTKSNKLAYIAGQEFPATQEELRGFELGAKSVNPKIQVIST
FTGDWSDVAKAKEATLALISSGVDVIYQWFDSASPAVLQTASDKGVYAFG
NTKDQLDIAPKAVLTSVVKRLDIAIAYLAELAQQKQLKGQIYTIGLERED
ILSLGKFGVAVPETIKQNTLKIKQEIVDQTITFVTCQEAGKNTRCIKKA
>Ava_3080 conserved hypothetical protein
MMSVPNITREILADDPFPRFDEWLDQVEQVLEQNTALLMLDEFEALDSAI
NRGRFDEEDVLGMLRHLIQHRPRFKMLLAGSHTIEEYQRWASYLINVQVV
HISYLKEEEARQLIEHPVKYFTLRYEPEAVERVLQLTRCHPFLIQLLCGE
IIVLKNEQDPSIRRLTSLEDVEAAIPEALQSGGFFFADIQNNQVDANGRE
VLRLIAAQGEGEIVSKFTLSEQFPDVWERTIALLLQQELIEEVADSYRFQ
VELIRRCFV
>Ava_2565 TPR repeat
MIFPIKKNPWLYLCLSILSFCLAVTITPAKASIPLQTTLNVSLSTPQTTN
GLEQGRNLYHAGRFAEAATTWQTAAQRYHIQGDRTNEALSLSYLSLAQQE
LNQWDIARQSIEQSVKILQTAQPSVDAIIWAQILNTQANLQLRTGKAENA
LEIWQQAQKYYEQAGDNVGSLGSQINQAQALQSLGFYRRSKQQLEILTQK
LQEMPDSEVKVSGLRSLGLTLHAIGDNKSQLILEQSLATANKIDAKTHLS
SILASLGKVASDFQDPEVALNYFEDAEKVATNPNESLQARLARFKLLIDY
DKLEYAVPLAPQLQQQLSELPPSHSSLYAAINFVATLNRLEKPDQVLPIK
DLAQLMALTVKSAQQIEDSPAQAYALYQWGQLYRRTKQWTEAKEVAQQSL
NIARQLQADDIIAQSAWQVGQLFKQQGDRPKAITAYTEAVKSLKALRGDM
VAVNPEVQFSFRESVEPVYRELVGLLLEEQPTQTALMQARELIESLQIAE
LDNFFREACLDKAQQIDKVDPTATVIYPIILSDRLAVILSQAGQPLRYYV
TRKSQADIEQTLDNLLVALNPVSNSQDRVRLSQQVYDWLIRPAEAEQAFK
NTKTLVFVLDGKLRNIPIAALFDGHQYLIEKYAVALSPGLQLIAAQSLEQ
NKIKAIIGGISESRSGFAALPAVESEVKQISQTIASSMLLNQKFTSQALA
DRIKSSYADVVHLATHGQFSSRIEDTFLLTWDGQVNVKELSELLKNRSGD
SSKAIELLVLSACDTATGDDRAVLGLAGLAVKSGARSTIATLWPVKDKAA
EMLMTRFYDQLRKPKITKAEALRQAQINLIHQTDFQDPFFWSAFVLVGNW
L
>Ava_3305 Peptidase M48, Ste24p
MANYRVWRRRWFYPLISVVVAVSLCLGTPLVGRAIDLRPLLLQGVQVLQL
SNISDRQEVDLGNQMNQQLRSGEVSISRNAEITRYVDQIGQRLVATSDRP
NLPFTFQVVENDAVNAFATLGGFVYIHTGLLKTADNEAELASVIAHEIGH
IGGRHLVKQMRQRALASGLASATGLDRNTAVGLGVELALNRPRSRQDEFD
ADTRGLRTLTRAGYAPSAMVSFMQKLSKGGGSVPAFLSTHPATGDRITSL
RRAINSQGSSGRDGLDNAAYRARIRSIL
>Ava_3275 Pentapeptide repeat
MSLINLDLVISAVTSIANPVIKEKILRSETVIKLLQQFNLDPEHPPADFS
GVYAYTLVEYGVGKPKAFLELFRQEAIKQAFRKALGHNNPSILLSEVDAF
LDSYTLGDEIRSLELDVRREVAAFATVFIEVAKRSRTPADVLMSQQIGSL
HKRIASIQEQLERLPTLEGIRTEIARLAAQNYPALAGATTENQCRAIALA
QQMRGWFETLGYRLEKYEIWAEDYFEWIINVPVRRSYDRILVRGVAGEVR
LSDVMALRQSVNQQKTDEGWLVSNRRIARAARDEVKKEENRHLDCFTFDE
LIDLDADFTGYLDWLEAEIKRRKIDQKYVPLACTKEEIDPVTKRRIGVSR
YEAEDGWIDGYIDLWLDDPAKEHISILGEFGTGKTWFVFHYAWTALQRYK
DAQKRGVERPRLPLVITLRDFAKALNVENVLAGFFFTQHNIRLNSEVFDQ
LNRMGKLLLIFDGFDEMAAKIDRQQMINNFWELAKVVVPGSKVILTCRTE
HFPESKEGRALLNAELQASTNKLTGETPQFEVLELEKFNDEQIRQVLLYQ
AEAATVEQVMDNSQLLDLARRPVMTDLILEALPDIESGKPIDMSRVYLYA
VRRKMERDIKAERTFTSLADKLYFLCELSWEMLSTDQMSLNYRLFPERIR
RLFDSVVQEEKDLDHWHYDMMGQTMLVRNADGDYTPAHRSLLEFFVAYKF
AAELGALAEDFTALAQAQSGLDSGATPVDYTWFGYFSRQWNDIIAPLKAF
TRESFEKLRETFGKAPLTKAVIDLLVPMLSNNESLISVIERTRGQSEDAV
GYIGGNAATLVLRIDPLGLEGKDLSGTVIKGADFTNVNLQNVNFFAANLV
NCAFTRTLGAVFSVAFNSDCKLLATGDGNGIVRLLDAATCKEILICKGHG
SIIPCVAFSPSAQILASGSYDQTIKLWSIQTGECLKILQGHVSGIRSIAF
SPSGAILASSGNDNIIRLWNIDTGESLKTLHGHRDHVYSVAFDPSGMILV
SGSGDQTIRIWDINSGKCLKILEGHTNAIRSIALNSTGEIIASSSSDHTI
GLWDIKTGKCLNILRGHTDNVMSVVFNNSDRIIASGGADHTVRLWDVQSG
ECLNVIQGHTNVVRSVAFNSSGQTLASGSYDKTLKIWDINTYECLTTVQG
HTNWISSVAFNPSGRTFASGGNDATIIWDANTGKCLKTLQIHTAWVFSVA
FSSCGKMLASSSADAKVRLWNIDTGECLKILNGHTYWVFSVAFSADGKLL
ASSGSDKTLKVWSIETGQCLTTIHANQGTVHSVAFNPVNRTLANGGFDSQ
VKLWDVNTGECLKILQGHSGTIRSVDFHPGGKILASGSADCTIRLWDVDT
SECVKILQGHSKVVQSIAFSSDGQILATGSEDFTIKLWNIFTGECFQTLW
GHTTWVLSVAFSPDCKTLISGSQDETIKVWDIKTGDCIKTLRSDRFYERM
NITRVKGLISSEIATLKSLGAIEE
>Ava_4503 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MSATVDTTVLLSGYQLIEQLYHGSKTLVYRGIRRKESVAQPVVIKLLQRD
YPSFSELLQFRNQYTIAKNLNIPGVVRPTSLEPYGNSYALVMEDFGGVSL
GTYSQTHSLSLVDVLAIALQLADILHELYQHRVIHKDIKPANILIHPESQ
QVKLIDFSIASLLPKENEEIKHPNVLEGTLAYIAPEQTGRMNRGVDYRTD
FYSLGVTLFELLTGQLPFAGNDPLELVHCHIAKPAPRIDEINSEIPEAIA
QIVAKLMSKNAEDRYQSALGLKHDLAICLEQLNQTGQIEQFIIGQRDICD
RFTIPEKLYGRETEVQTLLDAFARVSNGTSELLLVAGFSGIGKTAVVNEV
HKPIVRQRGYFIKGKFDQYNRNIPFSAFVQAFRDLMGQLLSESDIQLKAW
KASILEALGDHAHVIIDVIPELECILGIQPPVAELSGTAAQNRFNLLFQK
FTQVFTTKEHPLVMFLDDLQWADSASLKLMQLLMSEREQGYLLLIGAYRD
NEVFPGHPLMLTLNEVSKVGATINRITLASLSRTSLNQLVADMLKCAEVL
AQPLAGLVYQKTQGNPFFATQFLKTLHQEQFITFNIQAGHWQCDITQIRK
AALTDDVVEFMAKQLQKLPDETQNILKLAACIGNQFDLNTLAIVSEQLPQ
QVASDLWKGLQEGLILPISETYKFFQDNNLNDTNTDGISVTYKFLHDRVQ
QAAYSLIPQEEKQVVHLTIGRHLLNHTSAEQLEEQIFDIVNQLNIGCTLI
SDTLEQARLVELNLRAGRKAIATTAYDTATYCLKYACNLLPTDSWQTQYD
LALACYSSAAEAAYLNTDLVQMETLAEIVLQQARSLLDAAKVYEIKIDAY
TSLGEFSKALTTGLNFLQSFGIEFPTQPNKQDFADFLRQTRQQLAGRSPD
ELLNLPVMTSPQYQALLRMLVQISGATYLASPALYPLICFQQVQLSVAHG
NIPASSFGYVVYGLILCGVVGDIKTGYEFGQLALQLVDKFNNQEYAAKVL
YITSKFTVHWVKNAATTLQPLQEAYTLGLKVGALTYAGYSGYTYAFHAYF
VGKELTALETECQAYSIGLANIEQQAFLGYLHIVHQTVANLLGKCENPTK
LVGTIFDEGVTLANLEKANDKTGLWHFHLCKVTLTYLFECHDQTLDHCQQ
AKLNAGGGSGMLNVPILYFYRALACFTQLSAHSQTDELWALIADAQEKLQ
NWAIYAPMNCQHRYDLICAEEQRVLGHPQQAIDFYDRAISLAQENGYIAE
QAIANELTAKFYLGWGKEKIAAVYMQEAYYCYARWGAKAKTNHLEHRYPH
LLRPILQSVSQPLNVWESLTSTITPDISLHTSHSSLGNSTNINNAIDIAS
MIKASQSLSATLQLDELLHQLTHIILQNSGGDRCALICPNSEGEWQVVAI
ATPETTQLCSELLDGNPNLPVKLIQYVKNTQQIVMIDNLKTDLPVTDEYL
IRQQPKSLLCLPIISHGQLIGIVSLKNRSTSGVFTSDRIFLLNFLCTQAA
ISLENARLYQKAQTYAQKLEQSQLQIVQSEKMASLGNLVAGVAHEINNPI
GFLNGSISNAQEYVQDLLDYIALYQQYHPKAAAPVKTKAKNIDLEYLCED
LPKLLNSMQGASDRIQSISTSLRTFSRADSEYKVMANLHEGIDSTLLILK
YRLKANEYRPAIPVITEYSELPAIKCFPGQLNQVFMNILANAIDMFDEMA
KSQSYKEIEAHPQKITIRTTMEANQVIISIADNGKGMSEDVKVRIFDHLF
TTKGVGKGTGLGLAIARQIVVEKHGGSLEVQSQLGKGSDFCIQLPLL
>Ava_3785 Protein of unknown function DUF81
MNILEFSLLVWLGSFSAGFIGALTGLGGGVVIVPLLTSVFGVDIRYAVGA
SLVSVIATSLGAASTYIKKGYTNLRLGMFLEVSTTIGAIAGAIIATFVSV
KALTIVLAIVLMYSAYLSQRPRLEQVEDDIADPIANYLQLNSTYPTTNGL
MPYHVHAVPAGFSIMLVAGVLSGLLGIGSGGFKVLAMDQAMRLPFKVSTT
TSNFMIGVTAAASAGVYLARGYIDPGLSMPVMLGVLPGAFLGARVLIGAK
TQILRIVFSLVLVVMALKMVYNSLIGGL
>Ava_2896 Small GTP-binding protein domain
MGLPIVAIIGRPNVGKSTLVNRLAGEQTAIVHDEPGVTRDRTYLPAYWSD
REFQVVDTGGLVFNDDTEFLPLIRQQALAALHEASAAIFVVNGQTGPNSA
DEEIAEWLRQQPVPVFLAVNKCESPDQGSIQASEFWELGLGEPYPISAIH
GNGTGELLDELIKHLPPTTELEENNEIKIAIIGRPNVGKSSLLNAFAGEE
RVIVSPISGTTRDAIDTFIERNGQNYRLIDTAGIRKKKSIDYGTEFFSIN
RAFKAIRRADVVLLVIDALDGVTEQDQKLAGRILDEGKACVVVVNKWDAV
EKDSYTIYDYEKNLEARLHFTEWADTIYVSAVTGQRVEKILELVTKANEE
HKRRVSTSVINEVLEDAVSWHSPPTSRGGRQGRIYYGTQVSTQPPTIALF
VNEAKRFNDNYRRYIERQFRQQLGFKGTPIRLLWRSKKVRDVESGSANRA
TRV
>Ava_0020 TPR repeat
MKNLIFIATTFITTISLTNIAQAANQEHIRQLLATKQCQNCNLSGAGLVM
ADLTGANLSGANLAGANLSRANLSGADLRGANLSGAGLFGVNLSEAKLGG
ANLAGADLRSAFLSNAEFTGAYLQGTNFQGALGIPLQIATPDEFYALGVA
EAQKGNQQQAINYFNQAIASKPEYAGAYLARGIARYQLFDRQGATQDAQT
AEKLFTSQDNATGIQTAQAFIKELQTPYTEKVSAGKPSFFDFLGSLGSVL
LQFLPF
>Ava_4901 Pyridoxamine 5'-phosphate oxidase-related, FMN-binding
MTISTNRNQQVQQLKELIADMDYAMLTTVDDDGSLHSRPMYFNGDIDAEG
TLWFFTSASSHKVLEIEHRQQVNVNFSSPNQQRFISISATAELVKERQKI
QARWKSELETWFPQGLDEPDIALLQVNIKRVDYWDDPSNFYAKSMSGLAQ
NI
>Ava_3461 conserved hypothetical protein
MKKILFICSQNRLRSPTAEVVFAEYKGLETDSAGLDHYAEVPVSSEAIEW
ADIIFVMEQLHKQKLAKNFQPFLKNKRVICLDIPDEFEYMEPALIEILKK
KVLPLLGTY
>Ava_3229 conserved hypothetical protein
MNNPIIFALLILTTVGLNTLAQTLLKLGAGQNPLNLYLIGGICCYGLSTI
FYVLVLGKLNLSIAYPLVIGLTIVLTTIAGAVVLREKVLVSQWIGIGLLL
SGISAIALAKPS
>Ava_1642 Protein of unknown function DUF58
MKIIKPITDWLEIRASAPAYTGWVLVGVAVCFFGAAINTMAGWLYAISGV
SFALLGIAAVLPPRSLTGLSVTRYPIQPVSAGDDLTLELEIHNPTKQSVS
LLQVEDLLPFVLGKPIKQGVETIASQDSYRWVHYQPTQNRGIYRWHTVEL
GTGAPLGLFWCRRQRQCDATAIVYPTVLPLTTCPLVDELGQEESQRSEYR
GQPLQTASSGLVRSLRPYRIGDPTRLIHWRTSARYGELRVRELEMVTSGQ
EIIIALDSAGNWSAENFEQAVITAASLYFYAQRQQLQVQIWTASTGLIKG
ERLVLETLAATKFLEEATNCEPPNSPWIWLTQNSLNLSSLPQGSRWVLWQ
DISSHKEQVVINKDYPGIILEQEKPLQLQLQQPLHL
>Ava_0656 Alpha/beta hydrolase fold
MFQAPGFEQRSIITSLGKMAYYTATGSLWQDNKKTDDQETLVFLHGFGGG
SSAYEWSKVYPAFASGYRVLAPDLIGWGESEHPERNYLIEDYLTTIREFF
ERTCTGPVTAIASSLTAAFTIRVAIAHPDLFKSLILVTPAGLSDFGENYS
RSIFAQIVSIPLLDRLLYSTGVASSAGIRSFLEQRQFAQANRVYDEIVEA
YLKSAQQPNAEYAALSFVRGDLCFDLSLYIQQLNTPTAIIWGQKSQFTGP
EIGRRLSEQNPQAIRVFQELENVGLTPQLELPAVTIGLIEKFLPLLNSH
>Ava_3968 Short-chain dehydrogenase/reductase SDR
MSFRNKTIVLTGASAGIGRTLAISLSQQDANLVLAARNSEALEQTMTACT
NYPGKVIAVPTDVTQAEACQQLIEIAIATFGQIDILINNAGIGMLTRFDE
VTDISIFEQVMQVNYLGAVYCTHYALPYLKASQGQLVAISSICGKTGVPT
RTGYVASKHAMQGFFDTLRIELHSTGVDVLVVSPGFVATDIRQRALGADG
KPLGKSPRDETQGNMSVDECVRQIIWAMERRKREHIMTLKGKAIPWAKLI
APGFVDRIVAATIRKTTST
>Ava_4390 conserved hypothetical protein
MSQKSSQSHLSTSQNPYIDNINRSDKWQQRVAQVAYRFNQQYQNQTLELP
AEVQEMPIYREWVSGILAGKIASPFWEIAQPQKNQHCLDIGCGVSFLIYP
WRDWQAFFYGQEVSNVARDTLNSRGSQLNSKLFKGVELGAAHQLKYTSAY
FDLAIATGFSCYFPLEYWTSVLQAVKRVLKPGGHFVFDILNAAHPQAEDW
AVLETYLGAEVFLEPVAEWEKIINAAGAKVVGRQPGEFFDLYKLRF
>Ava_1307 von Willebrand factor, type A
MKVNLQPVLNDANLDAQQPSSQRQLAISISAGAEPQDRTVPLNLCLILDH
SGSMNGRPLEIVKQAAIRLVDRLKTGDRLSVVAFDHRAKVLVPNQVIDNP
EQIKKQINRLAADGGTAIDEGLRLGIEELAKGKKETISQAFLLTDGENEH
GDNNRCLKFAQLAAGYNLTLNTLGFGDNWNQDVLEKIADAGLGSLSYIQK
AEQAVDEFGRLFSRIQTVGLTNAYLLLSLAPNVRLAELKPIAQVAPDTIE
LPLQQETDGRFAVRLGDLMKDVERVILTNIYLGQLPEGKQPIANVQIRYD
NPAQDQTGLFTPNIPVYANVVRAYQPAINPQVQQSILALAKYRQTQLAEA
KLQQGDRVGAATMLQTAAKTALQMGDTGAATVLQTSATQLQAGQDLSESD
RKKTRIVSKTVLQDTPPK
>Ava_3729 TPR-related region
MTQSCNRVKVWEERRNEANIYYKQGKFSEYLDLVTENLQLARAIPDRARE
GHTLNDIGLAYLGCWQPQKALDCFHQALAVAVEIGNIQAEATALSNLGST
CSRLGKLTQALEYFEKAAQIFRELQDTQGEVSTLNDMALIYIRSGKPKRS
LLLQNQILAMRRLLGDLSGEATTLNGIGFAYSVLGEFEKALDYLQQALPI
QKAVKNLAGEAITLNNIASIYLDLGQPKQALLLYHQVLLTRQSMNDFPGE
ATTLNNIGFTYSKLSSHRKALKFYKQALVIYQQLEDSLGEISTLLNMGNL
YVTTKRKKLALLCYRNAQTLAEKISDQTIFDKVKQFMDAI
>Ava_4324 Aldo/keto reductase
MLYRRFGRTELQMPVFSCGGMRYQYKWQDVSYSDIPADQQANLEATINRA
VEVGINHIETARGYGSSEMQLGKILPQFPREKLIVQTKLAPVADAKEFRK
TFEQSLSYLQLDYVDLLGLHGINTAELLEYSVRPGGCLDVVRQLQAEGKV
RFVGFSTHGSTDVIVQAINTNQFDYVNLHWYYINQWNWPAIEAATRHDMG
VFIISPTDKGGRLYNPPKKLVDLCTPLSPMVFNDLFCLSHSQVHTLSLGA
AKPQDFDEHLQTLDLINRASEILPPILARLEDAAIASLGEDWVKTWETNL
PQWEQTPGQVNIRVILWLLNLATTYDLVDYAKMRYNLLGNASHWFPGNKA
DKLNEIDLQPCLRYSPHADKIPQFLAKAHQLLAGEELQRLSQS
>Ava_2603 inner-membrane translocator
MNHLNFLSDYLIASVNLAIPLAFAALGGMYSERSGVLNIALEGMLLTGAF
TSAVTTLYTGNPWIGVFCALIAGGLVGLLHAFLCVTLYVNQLVSGLAINL
VAAGLTSFLARLVFHGSSTQRLPGIEPIIIPGLANIPILGALLFQQDIFV
YLLIISVIVSNYILFHTSLGLTLRAVGEYPKAAATAGVSVSKVQYAAVFI
SGCLASLGGAYLTLVQIKFFTENMSAGKGFIAIAALIFGRWHPLGITLAC
FLFGATEALQLRIQALGANIPYQFLAMLPYAIAFFALVGLAGKSKPPQGN
GVTYSPEHSQDI
>Ava_3459 Short-chain dehydrogenase/reductase SDR
MTKLDGKIAVVTGASKGIGASIAKHLAAEGASVVVNYASSQEGANRVVDE
IVSTGGQAIAVQANVAKKAEIEHLFAQTQQAFGKLDILVNNAGIYEFSPL
EDITEEHFHKQFDLNVLGLILTSQQAVKHFGSAGGSIINISSIVSTLAPA
NASIYSATKAAVDAVTKSLAKELGSRNIRVNSINPGMVDTEGTHTAGITE
SEGRQQTEAQTPLGRIGQPQDIAPAVVFLASSDSGWITGETLYITGGLR
>Ava_3310 Serine/Threonine protein kinase
MEQLLDNRYRVIKTLGSGGFGETFLAEDSQMPSNRRCVVKQLRPIHNNPQ
IYQLVQERFQREAAILEDLGSYSGQIPTLYAYFQSNTQFYVVQEWVEGDT
LTAKLKQQGVLSESAVRDILINLLPVLEYVHSKRIIHRDIKPDNIILRHR
DGKPVLIDFGAVRESMGTVVNSQGNPTSSIVIGTPGYMPSEQAAGRPVYS
SDLYSLGLTAIYLLTGKQPQELETDPHSGEIIWHKYALNISPTLAAVIDR
AIAYHPRERFITAREMLDALQLGVVSPPTVPYQQPQSSPTIPYQQPPVVT
TPPFATQTNTVAVSPGTTPTPQPINHNNSNRGILMGSLIAGGLIGASVVI
GFALTRPNQPVTQTTSLPSETTISNNATPTVEPSPTATPETPINQNVTQN
TTPQASVRFPTNSRPFTTPIDSQPRNNTEPNTSVPQETTPPEPQITTPPE
QTDRPSPEEAVQNYYETINEGEYSTAWNLLAPSFQNNRRLHPRGYDSYLD
WWGGQVESVDVQQVNLVEANADTATVNAQLRYFMKSGRQSSSSVRFSLVW
DADNTRWVVSGAR
>Ava_2163 Protein of unknown function DUF897
MNSSLILSNILNPPVLFFFLGMLAIFCKSDLDIPQPLPKLFSLYLLLAIG
FKGGYEIEESGINPEIALTLLAAIFMASAVPIYSFFVLRIKLDSYNAAAI
AATYGSISAVTFITAQSFLKILNISSGGHMVAALALMESPAIIVGIVLVR
LFTQNKDEEKGEFSWGEVLREAFLNGSVFLLVGSVLVGILTGERGWEKLH
PFTQDIFYGVLAFFLLDMGMVAARRIKDLRRTGSFLIAFSIFMPVANAIF
GIILAKFIGISQGNALLFAVLCASASYIAVPAAMRITVPEANPSLYVSMA
LALTFPFNIIVGIPLYFNIIKLIGI
>Ava_1889 Predicted signal transduction protein containing CBS domains
MPKTVADVMSHNPVVVKPETPLQEAIKILAERRISGLPVVDNDGKLLGII
SETDLMWQETGVTPPAYIMFLDSVIYLQNPAVYERDLHKALGQTVGEVMS
KNPVTVSPEKSVKQAAQLMHDRNVHRLPVLDDAGQVIGILTRGDIIRAMA
NS
>Ava_1998 DRTGG
MPKSAKYLLIGSTETYSGKSATVLGLSHQLQQKGLDIAYGKPLGNSLSTS
EGTLVEEDVQFITHSLNLSANRIAPTILALDEASMQKRLFGEDTTDYSQT
LVEQYLQVPRGDLVVLEGPGDLAEGYLFDLSLLQVADVLDASVLLVSRYS
SLISVESLLSAKQRVGDRLIGVVINDIPPAQLETVENFVRPYLEQQGIAV
MAMLPKNDLLRSVSVGELVKQLNAEVLCRSDRLDLMVESLAIGAMNVNAA
VKYFRKRRNMAVVTGGDRVEIQQAALETSTQCLILTGQLPPPQFILSRAE
ELEIPILSVDLDTLTTVEIVDRTFGQVRVHEPIKVQCIRQLMAENFDSDR
LLSKLGLTPATALP
>Ava_1501 Protein of unknown function DUF839
MAFSRRKFFTIAGATAAGTLAASPLEGLLARAAFGQTTPTTTGYGPLFPG
ADGLLAVPNGFQYRVISRAGDRMIDGNTVPGAHDGMAAFPGPRGTTILVC
NHELSNNSGTQVIVPQGYGYDSASRGGTTNLVIGANRQLIKQFATLGGTI
RNCAGGPTPWGSWITCEENVSTPTSTDGSTKFHGYNFEVPASATGPVQAV
PLKAMGRFNHEAIAVDPATGIVYQTEDSGDSLFYRFIPNVPGQLALGGKL
QALRIGGTPGSSNSVNTATGFNTGQTWTNIDWVDIPEPDPTTDNVRVQGR
NLGAARFARGEGIWYGNGELYFCSTSGGPGSRGQVFRYVPATNTLQLFVQ
SPSASDLRAPDNICVAPFGDLIICEDGAAPNFLRGVTPEGKLYNLVRNNF
QGGSSEFAGACFSPDGQTLFVNMQGPGITFAIWGPWNSRV
>Ava_3934 Dinitrogenase iron-molybdenum cofactor biosynthesis
MKIAFTTSDRIHINSHFGSAKEIDVYEINAEGYQFLETLNFEGELKEDGN
EDKVAPKLAALADCAIVYVVAIGGTAAAKLIKKGVTPVKARSEEEKISEL
LNKLVETLKGNPPPWLRKALQPKTRNFADEIEDEATV
>Ava_3581 Auxin Efflux Carrier
MINLLELYVKLVSLVLVGFILGRKLPSTVPTRLGQFLFWVGVPISIVSFL
RQTDLSGQIWIAPAIAYLAILLGAFFAWLAIKGQTYFSNTTPEKPTQGSL
LLAAMIGNTGYLGFPITLAMVGKEYFAWALFYDLLGSFPGAYALGVALGA
HFGTNLHNHNQFTQVTRAIIINPALWSFGFGLLFRQIAIPEILEYGLDKF
AWSMVALSLVLIGMRLSKLNSLANLPQVGISLGIKMLIVPLILGYTLPLF
GLTGPAAKVIILQTAMPPAFACLVIAETFNLDHNLAVTTIAVGAMLLLLT
LPMWLWLF
>Ava_2655 conserved hypothetical protein
MTSSASFETFESLQADLVELIDRLPTLKNRQLIYNALATIVRLADSDIER
LDWKILSAALADMERGFQLFYDYRHVRKVTIFGSARLLPQAPEYQMAVEF
GRAVTRMGFMVMTGGGGGIMQAGHEGAGRENSFGLNIHLPFEQQANPIIE
GDPKLIHFKYFFTRKLFLLKESDAIALFPGGFGTQDEAFECMTLSQTGKF
GPVPVVLIDHPGGDYWQSWSEYINQHLVKTGLVNPEDPSLYTVTDSLEVA
CDAITRFYQVYHSCRYVGDRLVIRLTQELSDAEVEQLNAEFSDILVQGKI
ERSESLPQENQDETVGLPRLVLSFNQRDLGRLYQMIAVINQMGIPATKEQ
VHPERK
>Ava_4919 ChaB
MLYKSNEDLPLEVRSQLSQSHQDLYRAAFNSAIHWYGEVAKAHHVALSAV
RMQSAMDRTVVLQG
>Ava_1620 Patatin
MINLANTQTVLKFDGIDDYIDFGKNDIGGVFAQGSSCFTVSGWINPHKLT
EKSTSYGTRNVFFARSSDRYSDNFEFGISETGSLDIFIDETISKGIRTFG
NGELTIGQWHFFAIVFNSGQITVYLDDHEYNDSLRGSSLNKATSSVTLGA
TLHKQVYFTGQLANISVWNYPCTQVQIKTHHCGLIVGDEPGLVAYWKLDE
GQGTTVKNKAGKSYQGNFRGNPSWDLAQIPFAAPLSSQDDIQEDVQFEIG
IIAETSISTLTTDLLAATVPLVSNNEDQTIEIQYPEINSEKSEIIANLIN
LPSHEEASKTDQTEVLVNSQQLQTFIQAESPETMNTKSRPRYKILSIDGG
GIRGIIPALLLAEIERRTQEPIFSLFDLIAGTSSGGILALGLTKPRLNSS
EELPLAEYTAEDLVQLFLEYGVEIFYEPLFERLLGPLEDIFLQPKYPSTS
KEEILRQYLGKTPLVNNLKEVFVTSYDIEQRIPVFFTNQLEKQQIESKNS
HNLCGNVSLLDAALATSATPTYFAPHRIVSPENSAIAYTLIDGGVFANNP
AHLAILEAQISSKRKAQTVLNQEDILVVSLGTGSPTSAYPYKEVKNWGLL
QWGRPLLNIVFDGGSGVVSGELEQLFEPSDKEAKSFYYRFQTLLDAELEA
IDNTKLQNTRQLQAIAHKLISEKSQQIDELCELLLG
>Ava_1652 O-methyltransferase, family 3
MSAQTIGLDEQLYDYLLTNSVREPEILWKLRQETANHPNGRMQISPEQGQ
FMRLLVQLLGAKKTLEVGVFTGYSSLSVALALPDDGKIVACDVSEEFTAI
ARRYWQQAGVADKIDLRLAPALLTLDALLADGQAGTFDFAFIDADKENYD
GYYERALQLVRPGGLIAIDNVLWSGRVADPQIQDESTRIIRALNQKLHDD
ERVTLSLVPIGDGLTLALKRGYTD
>Ava_0113 HAD-superfamily hydrolase subfamily IA, variant 3
MKNLCIIFDLDGTLVDSERLCNQAFIDLLLFINESIDSLIYRYRGRKLAL
ILADIEIRYGVKLPVDFEVIYRQKVNELFEFYLQPIPGVPEMLETLEYPI
CVASSAPMAKIRTALNVTNISHYFGDSLFSSYDVGSWKPDPGLFLYAANK
MGFPPEFCVVIEDSDVGIQAAHSAGIYALKYSTEEEAEERTNVFSNMKFL
KKRLDNIYAIKCDRS
>Ava_2275 NUDIX hydrolase
MRGKVSKELNQSGVIPYRERNGKIEILLITTRDRQSWVIPKGGIVNGMTP
PDSAAKEAWEEAGVIGQVDVNELGTYKYRKRGKVYRVKMYLLPVEMISNN
YPEANKRYRRWLDANQAIKLIKKDSLKRILKGFIQTKSHACSSSLEFSQ
>Ava_0153 Serine/Threonine protein kinase
MLGQLLDGRYKITQVLGAGGFGKTYIAEDIKLYNNLCVVKQLQPTANDPI
TLQVARRLFASEAELLHKLGTHDQIPQLLAHFEEHQEFFLVQQFIDGHPL
SDELTPGKRLSEAYTIALLKNILQPLAFVHQNNVIHRDIKPPNLIRRKSD
GKVVLIDFGAVKQIGTQVVNGEGVTKMTVSIGTAGYMPSEQSRGSPRLSS
DVYAVGMIGIQALTGLMPHQLQEDIQTAEIIWRELVQVSPNLADVLDRMV
RYDFRQRYQSAVEALAAVQNLGNAYAPTQTSPSPGSKPTLPVKNHVNTPP
VAEPPLYSPPPAVPAQYRQEVPKNLIAPPPPTTAKSSKVGISFYLQWVLV
NIVGLVGGGIVGAIAYEIFEPLGIIISSQAVAATMGLTIGFIQALVLRRQ
IPVSEKQWVLGTTLGVCISVLVCGDYPEYSLLWIGPIVGSIQWFILKPYV
KQAVWWILVNTLFGWIGGIISGAVLVFLLKNLKDF
>Ava_2685 Alpha/beta hydrolase fold
MPVRQTLSTSDIQLSYLEWNQGKEPLLLLHGLGDHALVWSSLGDDLAAGY
HIVAPDMRGHGESSKPDKDYSFESAIADLEALMNHLGWSSAHIVSHSWTG
KLAVIWARQNPQRLRSMVLVDPIFIWKMPNLFRLTFPLLYRFLSFLKGMG
PFTSYEQAEQEAQQLNQFQGWSPLQQQVFQAGLEQKPDGSWGSKFTIAAR
DGIFDAVLRVPGFTIPLDTPALFVQSEQGLNRLEWQIKPYKTYLKNFRLC
QIPGNHWPFLTAPQTFNQTIAAFLAECQENKSHD
>Ava_3856 ATP-grasp enzyme-like
MAQSLPLSSAPATPSLPSQTKIAAIIQNICTLALLLLALPINATIVFISL
LVFRPQKVKAANPQTILISGGKMTKALQLARSFHAAGHRVVLVETHKYWL
TGHRFSQAVDKFYTVPAPQDNPQAYIQALVDIVKQENIDVYIPVTSPVGS
YYDSLAKPELSHYCEVFHFDADITQMLDDKFALTQKARSLGLSVPKSFKI
TSPEQVINFDFSGETRKYILKSIPYDSVRRLDLTKLPCATPEETAAFVRS
LPITPEKPWIMQEFIPGKEFCTHSTVRNGELRLHCCCESSAFQVNYENVN
NPQITEWVQHFVKELKLTGQISFDFIQAEDGTVYAIECNPRTHSAITTFY
DHPQVAEAYLSQAPTTETIQPLTTSKPTYWTYHEVWRLTGIRSFTQLQRW
LGNIWRGTDAIYQPDDPLPFLMVHHWQIPLLLLNNLRRLKGWTRIDFNIG
KLVELGGD
>Ava_3079 Peptidase C14, caspase catalytic subunit p20
MPRAVATSSLNQSKKAVVSKLWLLLVGVNQYHDKQLPSLRYSAVDCQVLA
EALICATQEQFSQQEVNIFHDFAAELPILSHIRHSLQQITASAQHTDTIL
FYFSGHGMLQANTQQAYLCLADTQNDDLENTGLSVQELLQYLSNSGVQNQ
LIWLDACHSGGMTLRSLNSTPHLLDILQQRAAKIKGFYALLSCDTDQQSW
EFPELGHGVFTYYLMRGLRGNAADNQGIISADGLYRYVYYQTLQYIDKIN
QQLRLINQQKRGKGDTELYSEYPLQTPKRIVEGIGEIIIGRMQAIAELIP
PRKALVIEGIIGSQTALDFSKVLGQAGTFELEYLTHSATTTAQNIRETIK
DFLRSQNQLKQYTSEAPATVLLYLRGRLEETPTGEAALVLFHDIWLSRSW
LRQQLRRSCVTQQIIILDCPVRTHSLSLQDWVEDLQLSSDTGQCIIAAAC
PQNNPEIFAQALIATLKTASAPVGLSVAAWINQLQIHLAAQTPINPSIPL
HIWLSGTQGVMEIIPSSNSAYNNQKPTILDLKICPYRGLRAFSEEDAQYF
YGRESLTQQLISQLAHKSFLAVVGASGSGKSSVVQAGLIAQLRPGKQLPG
SDSWLIKTIRPGVRPLEALARRLGEVKAGGVEEQHSPGEKIYLVPQSPLR
GSPVAYGGNPQDRAGSPVPNPQSPILHLEAMLYQGVEGFVYWLRSRPEPM
VVLVVDQFEELFTLAPSEDRQRFLELLLGAVEYASDKFKVVITVRADFIS
ACLEVPALAHLLQQSNILVPPNLSDDDYRRVIVEPAEQVGLKVEAGLVEV
LLRELNHSAGDLPLLEFVLEQLWEHRQAGELTLSAYQQQIGGIKGALERK
AQEVYESLDSQAQECARWIFLSLTQLGEGTEDTRRRVLKSELVVKKYADD
LIERTLLALTSAKLVVVNLEEESKVEAGQSRTSSSLSTPYYPLSLEGTPP
SPITIEVAHEILIRHWSTLRWWLEENRSRLRSQRQIEQAAALWKHHHAQP
DFLLQGIRLAEAEEIYIKYTDELSQDVQNFIAACLAARQQQQLEQKKRLR
QAQRAVAIISILGIAATSFGGFAYVQKRAAQLREIAALNASSEALLLSNQ
QLEAIIASVKAGKELKQVFAPEKDVQIATVATFQQAIANTQEINRLQSHA
QQVNAVSFSPDGKVLASASDDRTVKLWDIHGQLITTIAASQKRVTAIAVS
RNGKYFAIANADYTIKLYAFDTSCLTLKSLQKCIQLIKTFPGHTNIVTDV
VFSPDSKTIASSSLDKTIKIWRFDGSIINTWNAHNSWVNSIDFRPDGKII
VSGGEDNLVQLWQVTNGQLIKTLAGHKERITSVKFSPDSKILASASGDKT
IKFWHTEGKFLKTIAAHNQQVNSINFSSDSKILVSAGADSTIKVWKIDGT
LIKTIPGRGEQIRDVTFSPDNKFIASASNDKTVRIWQLNYQESKTSNVNS
ISFNPDGTTFASAGWDGNITIWQREKLARSSLSKIQTNQNIITTISYSHD
GKTIATASADNTIKLWNSKTQQLIKTLTGHKDRVTSLSFHPDNQTIASGS
ADKTIKIWQINNGQLLRTLTGHNDEVISIDYSPDGQFLASGSADNTVKIW
QTDGTLIKNLTGHGLAIASVKFSPDSQTLASASWDNTIKLWQVTDGKLIN
NLSAHTDGVTSLSFSPDGEILASGSADNTIKLWNLPHATLLKTLLGHPGK
INTLAFSPDGKTLLSGGEDAGVMVWNLDLDDLMQQGCDRITDYLQHNSNV
SAGDRPICQN
>Ava_1103 von Willebrand factor, type A
MNDTLTLDEVVEFAENPEPRCPCVLLLDTSGSMQGAAIEALNQGLLSLKD
ELMKNSIAARRVEIAIITFDSHINVIQDFVTADQFNPPILTAQGLTSMGA
GIHKALDMVQERKSLYRANGVAYYRPWVFMITDGEPQGELDHLVEQAALR
LQGDEVNKRVAFFSVGVENANMTRLNQIAVRTPLKLKGLNFIEMFVWLSA
SMSAVSHSQIDEQVALPPIGWGSI
>Ava_0285 Cobalamin synthesis protein/P47K
MMADVITDSVPVTVLTGYLGAGKTTLLNHILTYEHGKKVAVIVNEFGEVG
IDNQLVIDADEEIFEMNNGCICCTVRGDLIRIIGNLMKRRDKFDHLVIET
TGLADPAPVIQTFFVDEDMQSQLSLDAVVTLVDAKHIWQHWDADEAQEQI
AFADVILLNKTDLVTLSELDELEKRIRSMNAIAKIYRTRNSELAMDALLG
VKAFDLDRALEIDPNFLGEDAHEHDDTVSSVALVQEGELDGEKLNAWISE
LLRTQGPDIFRMKGILNIAGEDNRFVFQGVHMIFDGRPDRLWKPNEKRKN
ELVFIGRNLDEAQLKQDFLACFA
>Ava_1974 conserved hypothetical protein
MNKKTWLQIPAFRSRNYRLFFAGQGISLIGTWMTQLATVWLVYSLTNSPL
MLGVVGFTSQIPSFFLAPFGGVFVDRFSRYRTLIGTQVLAMFQSLTLAAL
MFTGVIQIWHIIALSLLQGMINALDAPARQAFVPELVQRREDIANAIAIN
STMINGARLIGPAIGGLLISWVGVKYCFLIDGLSYIAVIASLLAMKVKPW
TVTRIDGNPLQQVKEGFIYAFSFPPIRAILLLSTLVSLMGLQNTILVPIF
AETILKGGAESLGFIMAASGLGALSGGIYLASKKTILGIGKLIAIAPAIL
GFGLIAFAISRYLPLSLFTMLFVGLGTILQIAASNTFLQTIVEEDKRGRL
MSLYTMSFLGMIPVGNLLGGALANRIGAPNTLIIDGIACIIGSILFQREL
PKLRKLIMPIYEQKGIVTVENKSA
>Ava_4030 conserved hypothetical protein
MIVISDTSAITNLAAIDQLRLLPLLYKQVIIPEAVYRELVDIDPPVPGTV
EVQTATWLEVKLIANREFVERLQSEVRLDPGESEAITLALELQADLLLID
ERRGRAEADRLGIKITGLLGILVEAKRKNLILAVQPLMDALIATSEFRVS
SALYNQILIIVNET
>Ava_1444 TPR repeat
MALTHPTKMSTVLREIAYCEHQIGEVDRASNYYEQALNLCPAEDERELGA
IYDYLGMLKDDKGEVDQAIALYNQSLEIKERIGNVQGKAATLHQLGILHA
DKGEVAQAIALYHQSLEITEHIGNVQGKAITLWCLGHLAEQQGEYAKAIS
YLQPALEILQRLQSPHAEDVRVSLERVMGNS
>Ava_4422 Pirin-like
MSQNTINNLIHDRNARGRSQTGWLDSYHTFSFSSFYDPNRMGFRSLRVIN
DDRIAPGAGFPTHGHRDMEILTYVLSGAVEHKDSLGTGSVIRPGDVQIMS
AGTGIQHSEFNHSRTEALHLLQIWILPDEQGLAPRYQQKAFTPEEKRGQL
RLVAAKDGRDGAVTIHQNVDIYASILKPGDVVNYHVKGDRYAWLQIAQGV
ATLNGEELRAGDGVQINTEEQLKISTSVGTELLLFDLA
>Ava_2764 Peptidase U62, modulator of DNA gyrase
MPNINEIATYAQDNAQKLGIQKFDIYGSTVDETSVQVDQGEPKQVKASNR
SGVTVRVWNEDNTIGITSTTDVDPKGLELALKTAYEASFFGVKENVPDFS
PEATVPLANTLNEKAPQAPVSELIEKLLVAEKELLAAHPAIKGVPYNGLA
QRDIDRFYLNSQGALRTESVSLSSVYLYSKTEEEGKKPRSAGAYRINRSL
ENLDINGCIQETAEKTISHLNYDKIKTGKYRVIFSPEAFLSLLGAFSNLF
NAQNILDKQSLSTPDDLGKQIASPLLSVYDDALHPANIGAESFDGEGTPT
RQIRLIENGILTGFLHSAGTAKRLNTQPTGNASIGAKVSVSPNFYHVFAS
GNSEEKFSLETADNVILIDDLHALHAGVKSLQGSFSLPFDGWLINKGVKT
SIESATVAGDFLELLKSIIYVEPEVALTPGGVAPKIWVEELSITGE
>Ava_2117 Beta-lactamase-like
MSRIENQFTVQFWGVRGSIPCPGPHTVRYGGNTPCVEMQVAGKRLIFDGG
TGLHVLGQSLLRQMPIEAYLFFTHSHWDHMQGFPFFVPGFVKGNNFHIYG
AIAPDGSTVEQRLNDQMLHPNFPVPLQIMQANLHFHDVQPGLPIHINDIT
VETAALNHPGEAVGYRINWRGGAAVYITDTEHFPDKLDENVLKLAKNADI
LIYDSTYTDEEYHSAKSPRIGWGHSTWQEAVKIAKAANVKTLVIFHHDPA
HDDDFLDQIGAQAFAQFSGAIMAREGMVLQVPVSAPLSESFPVSNVSA
>Ava_3799 Radical SAM
MTTTPSTTINTKIIPFNSQLNAPLTKQEINVLQINLGKRCNLACTHCHVE
ASPKRTEELSAEICEQLIEVIHRFPQIKIVDLTGGAPEMNYGFQPLVAAA
RQHNKQVIVRSNLTIYFVEGFKYLPEYFAANKIRIVASMPCYLADNVDKM
RGVGVFDDSIKALQWLNQVGYGIDPSLVLDLVYNPQLPTSEKFSLTPEQS
KLEQDYKTFLQKHFRIEFNNLFTITNLPVGRTKLYLERKKLETSYLHFLE
LHFNPHTVGNLMCRNQLSVDYLGNIYDCDFNQMMNLPAKTHNGETLTVAK
LIEAGSLDLINEIQTANYCYGCTAGCGSSCGGALV
>Ava_1960 Peptidases M20 and M28
MMKKRIWLTLLVLVVVVIVGSRSSAFFEQRSSPEIVESAPVATPPPQPEL
QEVKDELQVSVDKLLAHIQQLNFQRYTKTERSRTRTYIINELRKSGWTPK
LEKFSGGVNVFAERPGTDNTGDAILVAAHYDTVVGSPGADDNASGVAVIL
EIARLFASHPTPRTLQLAFFDLEEAGLVGSKAFVTNTQRLEKLRGVIVMD
MVGYACYTAGCQQYPPGLPVTPPSDKGDFLVAVGDIENLSLLKAFNHADT
KNLPSVLTIPIPLKGLLTPDTLRSDHAPFWYQGIGAVLVTDTANLRTPHY
HQPTDTPSNIEQAFFVGAAQIVVNAVNTLVNSQ
>Ava_3611 Peptidase C14, caspase catalytic subunit p20
MANYWAIAIGVNQYQLFQPLRCAQADAEALKDFLVHQAGFVNQRCLLMTD
TSPPIDDRDTYPTKENILLLLEDLAAACWQPEDHLWFFFSGYGVNYNGRD
YLMPTEGDPKLVEETGIEVRSLMQSLQLANLNVLLLFDINRASASFGDTP
VGKEIIELAEELQLATILSCQPDQFSQESRELGHGIFTGALLSALRSGYG
SNLGELEKYLSVLTPELSQHYWRPTQNPVAFIPFDEQEILPPVATTNNPE
VELSAAVPSTLEPSEAEPLIFSEESFAVALTAPSLGEPPRTTSKPPKFGK
WEDSPKFETNGNGKSPTITLDRQPFPINTDFPQPYQPPIPDEEETGGRFI
PNAPQAYISRLPSNKPEPPLWRQFLLWGGGTMVVVALISIILLRNQARVL
RARQQLPTATSDSQIIKTPSTPRPVAKLAPLNRPKNQSTAQIAPISESKK
RNQAVLDLAKMSLRQTQASDLSLAIATAKKIKPGEPLYEQAQENIKIWSR
MILDLAEGRAKQRQYTNAIAAAKLIPKDEALYPQAQTTIAQWRSEAKQFL
ANQTLIDAANALIKQGQASTYNRAIEVAKRVPQGQPGFDLAQISINQWSQ
KILDLAKNRADQENFSAAIATATLVPEGTTVYEDAQEAIQKWEARKKSQ
>Ava_2037 Alpha/beta hydrolase fold
MTVHWRERVGNQRDWVWRGWRTRYTYIRPSQEGQEKTPLILLHGFGASIG
HWRHNLEVLGESHTVYALDMLGFGGSEKAPANYSIELWVEQVYDFWRAFI
RQPVVLIGNSNGSLISLAAAAAHPDMVKGIVMMSLPDPSLEQEMIPPFLR
PVVTTIKNIVASPIFLKPVFYFVRRPSILRRWAGIAYANPAAITDELVDI
LAGPPQDRGSARAFSALFKAAIGVNFSPSVKAILPTLQIPMLLIWGNKDR
FVPPILANQFAQYNEKLQLLNLDDVGHCPHDECPEQVNKAILAWMDKSLG
DY
>Ava_0651 Protein of unknown function DUF861
MEIKIEHQPSPEILQKLGVFQWGIWQKEVSKFPWTYDTQETCYFLEGDVI
VTPHGGQPVQMGKGDLVTFPVGMSCIWEIKSGVKKHYSFN
>Ava_2222 PilT protein-like
MLDAIIIFSFILAAAGIGFYSTELLPNGTLDRVTNLEALRLTVAVFAAII
GGAVGLSFQTTYRRLEAQVRELPLEVILTRAIGLVIGLLLANLMLAPLFL
LPIPTDFSFIKPLVAVVGSIILSVTGMNLADTHGRGLLRFINPNTVETMV
AEGTLKPANTKVLDTSCIIDGRIEALLETGFLEGQIIVPQFVLLELQQVA
DASKDQKRVRGRRGLEILNRIKEAYPDRILINPSDYEDIATVDAKLVRFA
QEINGTLLTNDYNLSKVASVQKVPVLNVNDLVNAVRPSYLPGDNLDLKIL
KEGKEPTQGIGYLDDGTMVVVEEGRAYVGGELRVVVTSALQTSAGRMIFA
KPQASALA
>Ava_4663 conserved hypothetical protein
MFQYRFKWSFSSDPNLSDRLFELIEVIFPGLNDLVERGRKLGASWESAST
PFIRFHDDVAITHVGVLEIPMVIMGQRVTVGGIHGVATRPEFRRKGYYRE
VIEEVLEYCDQIYETLILTTPEPEYHLPFGFRVVEEYIFHLKCSSKGNVN
GWRILDFSDNQDLALLHRLLETRAPVSHVVGVVNEKPVFFVNEGSRDLYY
AEDLDLIACIKIENNRLHIFDLVATKICSLKEILGRTSEVIEEVKIYFSP
DLLDVDNVQAFPYKLEDTVLMIRGQFAAVGEKFMLPRSARC
>Ava_3124 Cl-channel, voltage gated
MLFLAVLIGGGTGMGVVTFHYLIELIHHLMLENLMGAIGVWGAWTLALVP
ILGGLIVGLMRWRTQDFGPGLSSLIAASEGSEIKGQLRPVTKMLAAAVSL
GSGASLGPEGPSVEIGANFGMLLSLILQVSQERQRLLLGAGAAAGLAAGF
NAPIAGVFFALEVVMGATSFATSAVSVVLLAAVVAALIAQIGLGAQPAFA
LPVYQVRSPLELPLYLGLGLGASLVSVAYKQSINWGKACFVGSIPGFQFL
GKIPQPIHPIIGGFMIGIVALKFPQILGIGYGTVQAMLQDVKFSLDLLLI
LLVLKLLMTAISAGSGFVGGLFAPAMFLGASLGSAYAKVLTLIAPGIGEY
MAAPPAYAMVGMAAVLAGSVRAPLTSILMLFELTRDYRIVLPLMAAVGLS
VWLVERIKPNTNSHTNLQQIGLSIPKDQKVEILQQILVEDAMLACPKKLP
ATLGILEAAREMISDRTRSALVIDDAEQLVGIISLEDLNRTLSLWQNYPN
SASEIQSNLTNQSIIDICTKEILYAWRDEPLSEALDRMEVRGLHQLPVVA
RDNHDHILGLLDKEQIALTCNLAVTRQTLYRLVSSHQSLVSEKINN
>Ava_0184 Major facilitator superfamily MFS_1
MDSVPIETATSLNSEILQITPPETTLTLANQSNIRIPKEAIRTSLKASTV
DSVFAAVYSLGTGGILLSNFLVELDASPIVFGMLSSIPMLVNLIQPLGAY
LSELTTSRFRYSMCIFGTARFLWLILFIGVILFTRGNVDSQQLEVLTLSI
LLITNLLGGLGSASWLSWLAMIVPRRLRGRYFGLRNSAASLTNLICVPLA
GLLVSHWYGGTIQGYGVVLLISILFGFLSLGCQYFKIDVNPQLQHEVVQY
PQKNEVSNTSASTLQLTIIPPQPESITIWKNSNFLMFLVYFGLWMLAVNI
SAPFFNLYMLDTLNLDVSWVTIYGSLLAGANLLMLILWGKLADKIGNRRI
LIYIGILVALTPLLWLGIGINTLDIWLWLPLLHILLGGTAAAIDLCNNNM
QLGIAPLRNQSIYFAIASAVAGVSGAVGTTIGSFITQFTQYGGLLAVFVL
STALRLLALIPLFFIQEPGK
>Ava_3582 Phosphoribosyltransferase
MPDLYVSWSEYHYKIEQLAAHIYQSGWEFNQIVCLARGGLRVGDIISRIY
HKPLAILATSSYSGAGKQERGYLTVSRHLTMTTESLGSHILLIDDLVDSG
ITLQETIPWLKQYSDSPIEEIRTAVVWYKACSAIAPNYYVDYLPDNPWIH
QPFEHYEHINPADLAARVGQYC
>Ava_0128 RNA-binding region RNP-1
MSIYVGNLSYEVTQEDISNVFAEYGSVKRVVLPTDRETGRLRGFAFVEMG
SDAEETAAIEGLDGAEWMGRDLKVNKAKPKEDRGSFGGGNRGGYGGGGGR
SRY
>Ava_0398 Histidine triad (HIT) protein
MKQQKNQFSHLTAIERTYLSFPAQFLINQNLLQGQILDFGCGFGNDVKLL
QHKGFDIRGYDPYYFPQYPENKFDTIICLYVLNVLFPEDQANILMDIAYL
LKPGGKAYYVVRRDIKREGFREHYIHKKPTYQCLVKLPFRSIHLDESREI
YEYTHYNNQRHSSNYCIFCNPHKNLKLLTESATAYAIFDGYPISKGHTLV
IPKRHVSDYFELPQKEQSACWLMVNKAQEFLKAEFSPDGFNIGMNINRAA
GQNIMHASIHIIPRYQGDAIGAKSGIRNVIPQRK
>Ava_2394 conserved hypothetical protein
MLDLTATAQWHQRAEKFFLSGNYIQAANIYEEAIAAEPEIKLHYCYLGLM
LLLLGQDAEAKETWLLAVKEEESEQFEAFKTELFKTLQTEAERREDIEEY
SVAKKIRLILRELYPHDIHNLLHLVQLGIKLETYQGENLHNLGVIEILKS
EPKVEVNLESLMTTVKSVLDYAPAHPSTLDLVEACLPYCQNNPPVLLIIV
LSSLAEVLQSERLPLVAAKLCELCLELSPNNPKFVLHLADFYQDAGQYKK
SIESAKLFYSLVQRLVNKVYATCLILRGLMSANGYCEESLSVYQRHELLV
KSLIEENPTNLDSSTTIRVSTTNFFAPYIEDNPRKNRHIQNQLLQVCQVN
INNYSQEQIEKYKRGHIKRKRQNRTNEKINIGYLSSCLKIHSVGWLARWL
FQYLNQDKFNINAYFVNTNPKSDPLHEWYLKQVGKVYKSNNSFEIAEQIY
QDEIDILVDLDSITLNLICEVIGLKPAPIQVTWLGWDASGSPAVDYFIAD
PYVLPEYAQEYYQEKIWRLPQTYIAVDGFEVSVPTIRREQLDIPRDAIVY
FSGQRGFKRHPHTTRLQLKIIKEVPNSYFLIKGVSEEEGIKKFFEQLAEE
EGVELSKLRFLPIDSTEPIHRANLEIVDIVLDTYPYNGATTTLETLWMCI
PMVTRVGEQFAARNSYTMMMNAGIPEGIAWTDEEYVDWGVRLGKDEVLRQ
QIVWKLKQSRKTSPLWNGKQFTREMENAYEQMWLRYLEEA
>Ava_3295 TPR repeat
MRLPFYKYRIVALLSLILLGECLTPANATIPAIPKLLAQYSLPTAPTLLN
QGLQAIQAGRIQDAIAAFQSAIQLDPNLAAAHYNLGLALRQTGQLQPAAD
AFYRATQSDPNFALAFANLGGSLLEGNNLQQANDYLQRALELEPRLGFAH
YNLGLVRQQQQNWEGAIASFQKAVELSKNAPEPHYYLGLCYLQLGKLDEA
KNAFNQAIKINPRYSEAHYNLGVILFNQGNSQEALIAFRNSAEANPNYPN
AYYGAGLVFTQLNQYSEAAKVFNHARNLYNTQGNPQWAKNSEQLLQQVQN
LNSVPRGGNNN
>Ava_0974 RNA-binding region RNP-1
MSIYVGNLSYDVTEESLNAVFAEYGSVKRVQLPVDRETGRVRGFGFVEMG
SDAEETAAIEALDGAEWMGRDLKVNKAKPREDRGGSRGSFGGNRSNNNFR
NRY
>Ava_4681 Serine/Threonine protein kinase
MVWNPGRHLFGSRYIIERKLGEGGIGITYLAKNPQGKLRVIKTLREEILN
HPTWIPHQSRLKQDFKEEALRLALCRHPHIVEVENVFDDGDLPCMAMEYI
EGEDLGKRITEKGALPEAEALQYIRQIGDALMLVHDKGLLHRDLKPSNIM
MRAGKPGQKQC
>Ava_2515 Protein of unknown function DUF901
MPRSPIPAVLLVDGYNIIGAWPCLKKTRDKNGLEAARGQLVEAMTSYSSY
QGYDTQIVFDAHYQNTCSNKEIITELVSVYYTDFGQTADTYIEKICASLR
PEVSQARISRVIVATSDRAQQLTVQGYGAEWLSAYQLCGEVETTVCRMRQ
KYQSRKQSKSRFLASTIDPQARQKLAELRMGI
>Ava_2218 Peptidase M20D, amidohydrolase
MVSTFPNSASVDLSRVRLAIRSLQPQLVEWRRRLHQKPELAFQEKITAAF
VSSKLQAWGIEHQTSIAQTGIVATIKGEKPSTQVLAIRADMDALPIQELN
EVPYCSQHNGVMHACGHDGHTAIALGTAYYLQQHRQNFAGTVKIIFQPAE
EGPGGAKPMIEAGVLKNPDVDAIIGLHLWNNLPLGTVGVRSGPLMAAVEL
FDCTIFGKGGHGAIPHQTVDSVVVAAQIVTALQTIVARNVNPIDSAVVTV
GALHGGTTHNVIADTATMKGTVRYFNPAFQGFFPQRIEQVIAGICQSHGA
KYDFKYTELYPPVINDQAIAQLVRSVAAEVIETPIGIVPECQTMGGEDMS
FFLQEVSGCYFFLGSANPDKDLAYPHHHPRFDFDETALAMGVEIFVRCVE
KFFNE
>Ava_4143 Amidohydrolase 2
MTEYSRLKSSRSAAIKAKLNYPIIDTDVHTNDFTPALEDYIAQYGGSKLV
DELRKAESSRLNSKSNGKDWYQQTPEERQYNRTIRSPWWARVTRNTLDLA
TYTLPELFYERQAEQGSDYSVLFPNNVLAPAGASKENRQALQRAVNHYHA
DLYRKYSDRLTVVAGIPMGTPEEAVEELEFAVKTLGLKVANIPGGVKRPI
KAIADKYPADQYPEVAKYASYIDFYGLDSEYDYDPFWAKAVELGVPITTH
YGSQGWTGRSSISNYMNNHIGHFADGSQAFAKALFFGGVTKRFPQLRVAM
LEGGADWGAHVYIHLVDRFSKRSLKGLQNYNPDLTNFDELYALFERFGSE
FLQAHPLSKEELKKTVLGSSFNRHSRSPIGSELEDFAAAGIETIEDIRDR
WVNSFFFGSESDDRTIAAAFNDKANPLGVKINAIYSSDVGHWDVPDLTDP
LAESWDLVQEGVISEADFKAYVFANPYKFYTQANPDFFKGTAVESKVPAP
QVDKSLVVA
>Ava_0369 Exopolysaccharide synthesis, ExoD
MAKLSNELQRHFFEEERPEKVTLADILLLAGERIFGFLLVILSLPSALPV
PAPGYSTPFGVLIFLLAVQLIAGAKSPWLPQKMMNHPIELQTVQKFLKAG
IPWLKRIEAIARPRLSYICTTLAGRITIGIAIALMAISMMIPIPGTNTLP
AMGIFVTGFGLLEDDGAISLGGLVLCVMGAILTTSILMALAWGGSSLLDI
IKTWLGR
>Ava_3378 FAD-dependent pyridine nucleotide-disulphide oxidoreductase
MAHIVIVGAGLGGLPTAYELRHILPKQHQVTVISETPDFTFIPSLPWVAM
GLTSLESIQVSLQPRLKQKGINWILGRVDYLNPQDQKISFGEQSITYDYL
IIATGAELALDAVAGLGPDGYTQSVCNPHHAIKAFQAWQNFLLAPGPLVV
GALPKTSCLGPAYEFTLLADYVLRKKGLREQVSITFVTPEPYAGHLGIGG
MANSAELVTKFMAERGVEVIENAAVTAIEPNQIHLGNGRVLPFAYSMLLP
PFRGPRFVRQVPGLSNQDGFIPVLPTYRHPEYASIYAVGVVVEIKPPEVT
HIPLGVPKTGQMTEAMGMAVAHNIAIELGVFSAPPVTPTLDAICFADFGN
SGILFLANPVLPDVATGKRRRAVALSGTWVTWAKALFERYFLAKMRFGTA
VPWFEKLALKLLGLSLVAPLAVKRIIKEE
>Ava_4252 Dinitrogenase iron-molybdenum cofactor biosynthesis
MKIAFATNDQVHINAHFGWANKIDVYEVSPDGYQFLNTLRFEGDLKEDGN
EDKLVPKIEALADCTIVYVSAIGGNAAARLIKKRITPIKARSEDDKITDL
LEQLVKTLKGSPLPWLRKALQQKSSSFVEEEVGV
>Ava_2682 ABC transporter-like
MSIITVENLSKSYPVAVKEPGIGGTITHLFRRTYRSIQAVQDVSFEIAPG
EVVGFLGPNGAGKTTTLKMLTGLIHPSHGVVRVAGHVPFLRQEAFLQKIT
LVMGQKQQLLWDLPALDSLRINAAVYDISDKEFQRRVGELTEMLSLEGKL
TQPVRKLSLGERMKAELLAALLHRPHVLFLDEPTLGLDVNAQVAVRDFLR
EYNQRYQATVLLTSHYMADITALCERVLLIHQGKLMYDGSLDGLLESFAP
YREVRVELAQPLPLETLKSYGDVQLLEGRAVCFIVQQEVLTRTVSKILTE
LEVIDLTVTEPPVEEVIGRVFQAGVV
>Ava_2640 putative hydrolase
MFTNFEQTIVDTTAAQINLVKGGQGAPLLLLHGYPQTHVMWHKIAPLLAE
NFTVVATDLRGYGDSSTPTSTPNHINYSKRVMAQDQVEVMSKLGYEEFYV
VGHDRGARVAHRLALDYPHRVKKLALLDIAPTHKMYRTTDQEFATAYYHW
FFLIQPDNLPETLIGANPEYYLRNCLEKWGKDFSAFHPQRLFIK
>Ava_1155 metal-dependent hydrolase-like
MITPTCDKPNWKDAQELKNAVREWCDRIQVTVKQVQLRPMKRKWASISHT
GRLTLNTGLLNLPTELGEFVIVHELVHLSVPNHGKLFKCLMSAYLPDWQE
KEQKLQIYQNM
>Ava_3867 Serine/Threonine protein kinase with WD40 repeats
MTILNLLQNRYRLINLMSRGGFCQTYFAVDEGISPPVYCVVQKFSGNGKI
SDIFGQKAKYIKNLGQNPQLPTLLAYFQDQDNYYLVQEFIAGASLAQVVQ
EEGAFPETQIWQLLTNLLPILKWMSDRQLIHGNVKPANIIRRPTPTSTNL
VLVDFGHVQIVSRTDQVSDLQGVGSAEYAAPEQIKGKAVFASDLYSLGVT
CIHLLTMVSPFDLFDIPNDCWVWQQYLTTKISDRLSKILNKLIQKSVDQR
FQSADAVMQVMGIEGKILHYPPPLSPWQCLNTLTGDYCTNSLAISPDGNT
LASGGDDKIIRLWELNTQKLVASFSGHSQAVTSVTFSPQGEILATASDDK
TVKLWHLPTSREVFTLNGHTKPVKSVSFSPNGQILASGSWDKQVKLWDVT
TGKEISALKAHQLQVSAVAFSPQEEILASASFDRTIRLWQITQNHPRYTL
LKTLSGHTRAVLAIAFSPDGKILATGSDDNTIKLWDINTGQLIDTLLVHS
WSVVAVTFTADNKTLISASWDKTIKLWKVSTTEEIVTLASHLDSVCAIAV
NPVAQMIASSSRDKTIKLWQLVIQQN
>Ava_0145 Peptidase M61
MTEATAPRIDIGVQDTVPTINYLVAMPQPETHLFEVSLQIVNYSSPILDL
RMPVWTPGSYLVREYAKNLQDFTAFAGDKILPWRKISKNHWQINKGDVSE
VTVRYRIFANELSVRTNHLDATHGYFNGAALFFRLPGWENLPICVTVIPP
YPQWRVTTPLPTIGEQSNTFYAADFDTLVDSPFEIGEHQLYQFEVLGKPH
ELAIWGQGNCQVQQLISDTQKIIQVEAQMFGGLPYERYVFLLHLFAQAYG
GLEHKNSCSLIYQRFGFRSQDKYERFIQLLAHEFFHLWNVKRIRPQALEV
FNYDQENYTPSLWFCEGTTSYYDLLIPLRAGIYDAKTYLNYWSKEISRLL
TTPGRKVQTLSESSFDAWIKLYRPDTNTGNSQVSYYLKGEMISLLLDLLI
RARYRNQRSLDDVMRQMWQKFGQAEIGYTPEQLQAVIESVAGVDLTDFFA
RYIDGTEELPFNQYLEPFGLHLLAEREEEPYLGVKVNTENGREMIKFVEV
GSPAQMAGIDAGDELLAIAGIKITAHQLSDRLKDYQANDTIQVTVFHQDE
LRTYPVTLGTPRPTKYQLLAIQNPNATQQENFAAWLGAAISTVS
>Ava_3221 conserved hypothetical protein
MSVNDTGHSDNSAWNRQVYHRLKLALSLGLRRQLFLAVCDDLHLRNQVAA
RLHSTLAYPVGQVLYQPSDGQEASTSAYPRLVTLRLNLNEPNPIAQINQW
LANYPPPIVGASKDNPGKPLPIPTFQLVGVEQLTKQTVASQRLFLHYLRL
SEQFFAAQESSRFLESSVLLWVSRPWLSAIQQSAPQFWRWRTGVFVFAGE
PTPTTQNSGHPERFSNSRSVKLGNLEQSVLDELSMEGEVRSPSTTEFNFG
DEVDLPSELPGNHPQPKEETPPLISELPPTQQKPQELTVRSNAANSSSLS
SLSHVSQELTELVLATINTKISSEAEDDWQPQELLLEIEELHSQSVSGEI
LAAAYHQLGNLYRLRIERGQSTLEDLMVAIIAYQESISYDENSPQLPDIL
NDLGTLYWMLYRTPPNLEEGQTYIEQGIEFYELALKIISPETHPDTYARV
HNNLGTAYGDLARFANPAENWQQAVVSYDEALRHRTVELDPLKYAACQNN
LGTAYWHLAQYNQPVVHLKKAIASYKQSLAHYNPQDEPLKYGMIQNNVGT
AYWNLAQYEQPGENLQLAIDVYREALKYRTAAMVPNACAATHNNLGTAYW
HLANLPQTTKDIRQKLLTLCINAYEEAIALAHSLSSVSLSFDLFATHNNL
GLAHYQLVTDNYFSGDKGRRSHHLEAALDNHLQALNGLSKQPEAYQATFA
YVVKTIRAFHNELGFQGQNLALSKVPGHLLPEILPKL
>Ava_4758 Short-chain dehydrogenase/reductase SDR
MPTTALIVGAGSGLSASIARLFAKEGFKVALAARQIDKLEQLSSEIDAVS
FVADAVKPDEVKQLFIDVDHKLGSPGVVVYNPSWRVRGSLIDLDPADVAK
TLEVSAYGGFLVAQEAAKRMLQQGSGAIFFTGASASVKGYPQSAPFAMGK
FALRGLAQSIARELAPKNIHVAHFVIDGAIRSSQRQDPADNPDSTLDPDA
IAQTYLNILRQPRSAWTWEVELRPWVERF
>Ava_4748 conserved hypothetical protein
MSLPAPDFWNVPNTPYQTCLLTDDGSSTTTEVAQCLSDRGWQVIVLSFPN
SLVPKRPVLPAAARRVVLTNLSEEHLQAQLAGIFQTYGLIGTFIHLHPIS
QYLYHQPNTLVNPDKAILKQVFLLAKHLKSSLTQAAGQGRSSFLTLAHLD
GEFGLSGQQDFSAVSGGLFGLTKTLNLEWPAVFCRSLDISPDLDAATTAQ
IILAELHDPNALIQEVGYTKKGRVTLTCELADLGV
>Ava_4667 Short-chain dehydrogenase/reductase SDR
MQNKVVVIVGASGGIGSALADKLASVGAKLVLAARDSSRLAALANDLPGE
VLTIPTDITDASQVENLIQKTVAEFGQIDVLVNAAGVGILKPYNSVEPAD
LDKMLDVNLKGSFYTTQAAAEEMQKRKSGHICNVVGILGKHSMPMAAAYS
ASKFGVVGFSKCLAEELKRFGIKFTLFYFGGVDSPFWDNVSLKVDRKKML
STETAANAIFFALSAEPQAVPLEINIQPDSHLFF
>Ava_1984 RNA-binding region RNP-1
MSIYIGNLSYQVTEEDLKLAFAEYGKVSRVQLPTDRETGRPRGFAFVEME
TEAQETAAIEALDGAEWMGRDLKVNKAKPREERSSPRGGGGSWGNNNRGG
GGGGNRRSY
>Ava_3183 Lipolytic enzyme, G-D-S-L
MKTKFIAASFITLTLISPLKASAANFTGVYVFGDSLSDTGNTFQLSGGLF
DPSNAIPPSPPYEQGRFSNGSIWVDYVGDKLGLTPTPITNLVPTLDFSTP
LTTPFPTQGINFAIGGANSGEGNSFGFPLPGVLQQVSAFQALLQVNQQSF
DPNALYAVSGGANDYLFPPQFSDPSRPEPYTNISQAVSNLAAIGAKNILV
FNLPDLGRIPASGTNGRNPAALTQATQDFNSNLAKNLDEIRKNQQVNIIE
IDIYSLVNRVLATPNEFGFENVTNSCLSQFLNCSNNPSTYFFWDDVHPST
DGHKLVAESVLAATTPESSSTIGVLFLGALGAMSSIKRLQRKSIKSAIQ
>Ava_4718 conserved hypothetical protein
MEKLSAIKKPDINVADAWEQYWNKTLVNSTPVLWDANVERAVVVDLPRFE
LLFNPELPLIDFACGNGTQTKFLSQFFPRVIGLDVSKSALEIAAKENTAA
NISYRLLDGLVPEQAAQIHSEIGDANIYMRTGFHHIPVEKRELLGQSLRI
LLGKQGAMYLIELGTGCIDFFNSLLEKYGQLPYELLLVMEHGIRPGIFTA
EDIELYFPDFEILSQGEGLFQSIHKLPDGNYATPPAFWAVIKHR
>Ava_1598 Radical SAM
MTVAESTCALVSTQPSIGKPTGGIATYSPAYTIVPTYECFNRCTYCNFRT
DPGESSWMSLSAAEDIFKRLQNEQVCEILILSGEVHPNSPKRQVWFQRIY
DLCKLALTLGFLPHTNAGPLSFAEMQELKSVNVSMGLMLEQLTPKLLETV
HRHAPSKLPELRLQQLEWAGELQIPFTTGLLLGIGETNDDCWETLEAISK
LHQRYHHIQEVILQPHSPGNQQTFNAPAFNPHQLPEVIAKARQILPSDIT
IQIPPNLVKDERWLLACVEAGARDLGGIGPKDEVNPDYPHLQAEELREIL
QPAGWDLMPRLPVYPQFDGWLSGELQASVRRWRELVIGNW
>Ava_1556 WD-40 repeat
MTNDLVNNDRSVQQLAWAIDASVGQFKLILARCNYASLRDRLIKRLQEIC
QAEIYVLEVQQSDKTLYTAIRDEFGAEILGCVMVVGLEKVQNLAVMLSSA
NQVREEFRQHFHFPLVLWIDDEIYKQLIQVAPDLESWAVTRNFAIDQQQL
RDFITETAHQYLHEQTKLTLHESLKLETELLAAQRDLFIDGQIDNSELAA
NLLSLLGLTNKNINKISAALEYYHQAGSLWQKLDLIAPQEFIFLETAFCY
YIQSYKYQDINHISWNNTRRYIQEYIDFVNQIQRADLVAKSLYKLATILR
NLKAWEQLEKLSQQALVIHQANNNHREIARDYACLAEVALAQENWRQANQ
LINQALDFISADKSENLLSAPELSLYRFIQGCSQYNLGQIPQAIANLEDA
INRLNPQEDVRLYLHILSYLHNFYFTQKKYLAAYEIKQQSLSIEQQFGLR
AFIGAGRLEARKQTHLETLSTTSLQDNIALEIAASGRQLDVERLIERVGR
PDYKLIVIHGQSGVGKSSLINAGLVPALKKKAIGVQDNLVVVMRVYTNWL
QDLTRILIKDGGQREGEPQHSSSEPQRSPSEPQHSSSEPTVPLSSSFLSS
LLITKLRENEQHNLRTVLVFDQFEEFFFVYPELGQRKQFFEFLGECLHVL
SVKVILSLRVDYIHYLLECNDLPEFQIIGNDILSNNVLYKLGNFSPNDTK
SIIQRLTANTSFNLEPNLIDALVEDLAGELGEVRPIELQIVGAQLQTENI
TTLEKYRRFGTKQDLVQHYLNEVIHDCGEENQQLAEFLLYSLTDERGTRP
LKTRGELERDFQQYFSTYDVESARISPTPRQKQGKITPKTSQQAVRKEQI
LNLQQFDLVLDIFVKSGLVVLLPEKPTDRYQLVHDYLAAFIRQQQQQKLK
QVMAELDKERAQRKLSEAKLYRFLQRALFSSVAVGIGIVVLTALAVQLAN
EAKKQTQNAEISAGINEINALNNSSEAFFVSKQYPDALIEALKAANKLKG
TPWERENSFATIQTAATLQRAIYLQPNEYKENRATEVNTLAGHENWVSSV
AFAPQKRQLASGSGDKTVKIWDINSGKTLKTLSGHSDSVISIAYSPDGQQ
LASGSGDKTIKIWDINSGKTLKTLSGHSDSVINIAYSPNKQQLASASDDK
TVKIWDINSGKSLKTLSGHSHAVRSVTYSPDGKRLASASRDKTIKIWDIN
SGQLLKTLSGHSDGVISIAYSPDGKHLASASSDKTIKIWDISNGQLLKTL
SSHDQPVYSIAYSPNGQQLVSVSGDKTIKIWDVSSSQLLKTLSGHSNSVY
SIAYSPDGKQLASASGDKTIKIWDVSISKPLKILSGHSDSVISIAYSPSE
KQLASGSGDNIIKIWDVSTGQTLKTLSGHSDWVRSITYSPNGKQLASGSG
DKTIKIWDVSTGQPVKTLLGHKDRVISVAYSPDGQQLASASGDTTIKIWD
VNSGQLLKTLTGHSSWVRSVTYSPDGKQLASASDDKTIKIWDISSGKLLK
TLSGHQDSVKSVAYSPDGKQLAAASDNIKIWDVSSGKPLKTLTGHSNWVR
SVAYSPDGQQLASASRDNTIKIWDVSSGQVLKTLTGHSDWVRSIIYSPDG
KQLASASGDKTIIFWDLDFDNLLHTGCNLLNNYLIAHRQVLEELPSCQTS
GR
>Ava_1752 HAD-superfamily hydrolase subfamily IIB
MIKLLVLDIDGTISGESNAVSPYVKEAIAAVQARGIPVAIATGRMYRSAL
RFHQDINSTLPLAAYQGAWIQDPSDQKIHQHLPVDRKTAEQLLDYFEQPQ
WRSLLSIHFYINDQLYVREVTQETATYAQRSGITPIAVGDLRQTLTNAPT
KILALCDDTEVINNLLGSLRLQYTPAELYLTTSVATFFEATNPFVNKGNA
VRYLAEELLGIQSHEVMCIGDNFNDLEMLEYAGIGIAMGNAPTGVQAIAQ
WVAPTVEEDGAAVAIEKFLLS
>Ava_0746 conserved hypothetical protein
MTPTHRESYKIEKLKLTFFEFPPALKSLNFSSFVIGESLSFFGSWMTQIA
LVWLVYQLTNSAMLVGVAGFTNQAVGLFVTPLAGVLLDRWNLRYVLLTTQ
TVSIILSSTLTFLTLNNHINVTWIILIGMLQGTVKAFDLPARQVIIPRLV
ETKADVYSAMASHSFMINTAKFVSPMIGGILIARSGAASCFLVDAISYLP
FISAILTIEVKSLPNSSSHKKTPIWEHLKEGFVFAYEFVPIKYLLILQIL
VCFMAMTYVNLTPIFAKEILNGNAETLGFLMTASALGSIVSGLYLISRKK
ALGLVEIIARSAILLGLSLIMFSRSTVWEFSLIFIFLIGMNNTLTLAALS
NFMQLIITDENKRGRVTSIFTTGFLGILPIGNLFFGALASQVGVANALLF
GGICCLIGGCIFARQISTIKRIVKPIYESVA
>Ava_1771 Rhomboid-like protein
MFPLYDENPTRITPYFTYGLIGMNVLVFLHEVSLSNAQLNQFFSQYAVVP
QELTSNLAGEWPTLFTSQFLHGGWWHLISNMVFLWVFGNNIEERLGHFKY
LIFYLACGALAALCQWFIGMSSTIPSLGASGAISGVLGAYLIRFPQARVT
TLVFLGFFVTTISVPALVIIGIFFVQNVISGLVSLQAAANMSVQTGGVAY
WAHIGGFVFGIILAPIFGLFRRD
>Ava_1517 conserved hypothetical protein
MKRIEVEKRTILVGLAGSHGYGLNRPDSDLDFRGVFIAPKRYYLGFDHIE
QKDAGWDEPGIFPLLDGNKDTVIYELRKILQLLSGANPNVLELLWLNEYP
VLRDVGQHLINHRKLFLSKKVKHTYSGYAFAQIKKMETHRKWLLNPPEKK
PLPSDFGIEDEAPLIKDDLNAFLEYLYTLIRGRIEFLEEAEQLYKLLTAD
IDFKGVLKQYTLPDESLEYTQNLTNSRKDFIRLLQKSQNYQIALREWKAY
LSWQENRNPARAEMERKSGFDLKHGMHCIRLLRSGLEILKTGEITVDRRV
AGDVEDLKAILKGEYSYQQVMEMANDLVAQMDAAYEQSTIPHKPDLEAIN
SLCMELVEMQGWGE
>Ava_1700 Serine/Threonine protein kinase
MTIQLLNDRYQVIRTLGAGGFGETYLAEDTYMPSKRRCVVKQLRPIQNNP
QIYQLVQERFQREAAILEELGGATEQIPALYAYFSADGQFYLVQEWVEGD
TLTARLQQQGLFTESAIQELLVNLLPVLEYVHSKHIVHRDIKPDNIILRH
RDGKPVLIDFGAVRESMGTVVNSQGNPTSSIVIGTPGYMPSEQAAGRPVY
SSDLYSLGLTAIYLLTGRQPQTLDIDSQTGEIMWRQYASQINPVLASVLD
KAIAYHPRDRYATARAMLDNLQSISSPIPPTQPYFAPPPVVSAPPQPTVS
VAPPANVTGNNQKGILFGSLIAGGLIGASVIVGLAFTRTPQPVAENNNTS
PVNSITETPVSNATPEVTQSPVTTQTIVPSPQQQVDPSPVPSITSAPIDV
NINNNNYLWLSQQAVTDADLDGKDGYTLDIMRNTVFARHGRRFDNPGLQD
YFNNQPWYNPIYSPKEFPLKLLTRLEQRNVDYIAAYQKRYNLRHFK
>Ava_2145 conserved hypothetical protein
MQTVQALLTAAKVDSAESPYDTIHIKVLYPAKNTDSNLEKNWGMLPVDAA
QAPFPVVIFFNGFNCDAQKYQWLGIELAQTGLVVVMFNWVGESLPGLVSF
TPGFEIEKRKREFYGTVPTASALPVILTKLAQLQSAGILSGTLDLEKIIL
GGHSEGGRVAMENANPNFFPQVVASFAYGSHTAGAVMLGYKPNTILPLPD
TLPRLVIGGTHDGVIANSSAHFGLNTKDATTPVIRTFQEAIMGGRENSYL
LLLRGANHFSIVDIVDSTLALPFKDIPATQPQEHFRLLLATIIHLFIDAH
VRHQPEAFQKLEQLLIVTNPLIQSFERK
>Ava_0286 WD-40 repeat
MLKRHQEARGFSPPAPSATPAPPALFSHLFQQPMNLTSSKNKEFEEHYSG
MLAEYVTAIAWSPQGNTLAATSANGEVVLWQDGELTTLQAGNGQSVDCLA
FSPDGKFLAVGGQDGRVKIWQEQELIATLENAPAWVDKLAWSHTSNQLAF
SLGRYVQVWDADTREVVTTLNFADSSPLSIDWRIDGQYLAIGGNKGIKIW
HAQDWDEEPYILNMPTVSVTMAWSPDGKFLASGNMDRSVTVLEWNNPDPW
VMRGFPGKIRQLAWSEATTKLGAPILASSSVEGIVVWEKLEDENLGWEAR
VLTNHVGVINAIAFAPKSFLLASAATDGWLCLWNKAKEVSQILTGVADGF
STLAWHPQGKLIAAGGEKGELIIWAKILRGQGFGRS
>Ava_4835 conserved hypothetical protein
MTVIASPYQKYQFLKKVLKNIPVISDYIRYHWSFYRTLTACRGVYRNFPE
ALRATPKGAKAGYNQPEISKHPSAAKLTAAREANEFNSQDYPILVWLASA
FRNSSTVFDLGGNVGHAYYTSKKFLQYPHNLQWLVCDIPEVVNAGEELAK
KGNNPGLSFTVDFSQAENSDILITCGTLQYIEPSLAEMLNKLKKKPRHIL
INQVPFYDGETFITLQNIGYAFCPYKIQNRHEFIASLSAIGYELIDNWKL
ERSCAIPFHPERLVRNYQGFYLRLQ
>Ava_2913 conserved hypothetical protein
MPSGRTHDRITLWALPLVAGLTFWQTRSGNVTLLVAGGFMFGGLMFGPDL
DIYSRQYQRWGFLRWIWLPYQKSLRHRSFLSHGPLIGTTLRVVYLSSLLA
ILTVVILAVADKLWNVTVTWQDVEVTVGRSLTSYSMEFCALFLGLELGAM
SHSLSDWGGSAYKRFRKQGVRGIFPSSKIKKRKVTSRRSTGGRKNQKR
>Ava_4332 ThiJ/PfpI
MTTAQPTSSLSIGIVLFPNVTQLDFTAPYEVFNRLPNTKLYLLSETLEPI
RSDGGLTFLPDTTFAESPLVDLLFVPGGSGIDVKLEDKKFLAFLKTQGEQ
ARYVTSVCTGALLLAAAGLLQGYRATTHWLYLDLLELLGVEVVKQRVVID
RNRITGGGVTAGIDFGLVIAGELFTEAIAQSIQLAIEYNPQPPFESGSPE
TAPEYIINSVKATSKNRLDSRRQIIQKIVAESSISISQK
>Ava_2545 ATPase-like
MLKQLILENWKSFRYAELPLDPLTVLIGTNASGKSNVVEALEFLQRIARG
ENVEAALAGDKTLVSIRGGVEWAARKQETGFTLQTLIQGEDETQDYLYTV
QIQTIPEVRVIQEQITFENINQDNVVYNKKHLTVKNPIFPTKSGLENLEL
HVDINDLMQFFPPDQNILLGDKDFLETLNNVLLPFRNKACKFVVASLKNI
FILNPIASNMRNYSRLADDLESDASNIAGVLAALPDDLKLEIELTLSTYI
KDLPEGDIKKVWAEKVGRFGTDAMLYCQEEWKPGHSTDIDARSMSDGTLR
FLAILTALLTRPEGSQIVIEEIDNGLHPSRAKLLVKILREIGSKRNIDIL
LTTHNPALLDALGPDTVPFVVVAHRDAETGESKLTLLENIDNFSKLFASY
SLGEMTTKGAIERSLSHSE
>Ava_0775 conserved hypothetical protein
MNYKTVDVILERALLGDDISPQEGIVLLTQTDSGAIASIRHTADKLRQQQ
AGDTVTYVINRNINFTNICEQHCSFCAFRRNDGDADAYWLDWAGILEKSH
DAVQRGATEICMQGGLHPQAQIDGKSLPYYLKLLETIKQEYPQIHLHAFS
PQEVQFIARMDGLEYAGVISALQNAGVNSLPGTAAEVLDDQVRRVLCPEK
INTATWLEIISTAHKLGLHTTSTILSGHIETPEQQIGHLEKLRSLQQIAT
NQKYPARITEFIVLPFVGQEAPKSLRRRVGRDQPVLADALLLGAVARIYL
GNWIANHQPSWVKLGLAGATEALNWGCNDIGGTLMEEHITTMAGAVGGTC
MEVETLQNAIASIGRPYQQRDTLYQPVESAKQLANTVIGNG
>Ava_1023 conserved hypothetical protein
MSLRALLLVNRHARQGQARLLEAINHLKKFNFQLIEESTEHPKHLSQVIH
KYKYQVDLVIVGGGDGTLNAVVDALVETQLPLGILPLGTANDLARTLGIS
NSLPEACRTIAEGELRRIDLGWVNGKHFFNVASLGLSVKITRRLTKEFKR
RWGIFAYAVTAMQVIWESRPFSAEIHSKDRVFRIKTVQIAVGNGRYYGGG
MAIVPDASIDDQRLDLYSLEISHWWEIIPLLPAMRNGRHIHRQNVRALNG
QEFEVYTRKPRAINTDGEITTYTPATFRVIPKAVAVLVPPV
>Ava_0054 TPR repeat
MNKTTRVYYPDFVTGRIRKIKGKLFKFLRLALCFLLALICTVNSPVLARV
VTSTDTAQVTSTISLVEQGKALYDTGRFAEAAQILQQVAQEYQQKGENLK
LAATLSNLSLAYQQIGEWQQAQQAITDSLNLLEGKEQNPQVLAQSLDIQG
RLQLAMGKPEAALSTWQRTAKIYQQTKNFHGEVRSQINQAQAWRTQGFYR
RAVKVLLEVRQKLQSQPDSLEKAVGLRSLGDALMVAGNLMDSRTTLEQSL
EVAKRLAIPGEIAASYFSLGNNTRAKLQISEAINYYQQTVATSPSPLTKI
QAQINHLSLLLENEKFTEVATLIPAIQAQINQLPPSHAAIYAEINFAQSL
MKLTSSDNQGRNKNVPSAPSTQDVAKLLATTIQQARSLDDKRAEAYAMLS
LGNLYEQNQQLPEAKSLTQQALILAQASNASDIIYRLQWQLGRLLWAQKD
MSGAIAAYDTAVESLKSLRSDLVAVNQDVQFNFRDSVEPIYRQSVELLLS
SQPEKVNDKILDKARQRIEALQLAELDNFFREACLQGETVLLDQVVDQDN
PTAAILYPIILPKQIQVIVKIPHKPLQLYSTNVEQREVDEVLSELRKNLV
RPTANKNVKTQSQQVYNWLIKPIESELAASGVKTLVFVPDGLLRNLPLGA
LYDGQQYLIEKYAIALSVGLQLLDPKPLQRGDLRALTAGLTQPPPNYPNF
APLPAIKSEVDLIASAGVSITSLFDQQFTREALEKEVNTEPFNVVHLATH
GQFSSRAENTFILAADGPINVTQFDTLLRSREQIRPSAVELLVLSACQTA
AGDSRAALGLAGAAVRAGARSTVASLWQIDDESTALLVGEFYRELKNNNI
TKAEALRRAQLKLLQHPNYSAPSYWSAYVLIGNWL
>Ava_0084 Serine/Threonine protein kinase
MLGTLLVGRYQIRQMLGGGGFGKTYIALDTQRPGQPKCVVKHFQPVTHNP
EFMETARRLFTSEAETLERLGYHDQIPRLLAYFEEHQEFFLVQEFIDGHS
LKAEMPPNQPWTEKKVIALLQQALDILQFIHLHKVIHRDIKPENMLRRLH
DGKLVLIDFGAVKEVQTQISAISGQTEMTVAIGTPGYMSLEQFRGKPRLN
SDIYSLGIVCIQALTGVHPRELAEDPVSGEILWQNSAEVSPMLASVLSKM
VLNNFKLRYQSATEVLEALQQNFHLQAETQITVAPSTLTQQPQLSLSQQQ
RLGLHSSNSSILSVEQYNYLENVLTTFVGPIAATLLRRTASASSYQELID
SLALHLTANQQTEFKKKVEPLLEQPTIKYENTSKSVPEKTQPTTNSGIND
TLIRECERELLDLIGPIAVFLIQKAKKSTSATTSRAEFIKIISDEIPDAQ
KAWQFQQRLLP
>Ava_1572 Short-chain dehydrogenase/reductase SDR
MNKTHKQTALITGAASGIGYELVCIFAENGYNLVLVDRTKEKLEEIATKF
QDKFGIYVKPIVRDLSKTTSPEEIFQELEQANIKVDVLVNNAGFGTYGLF
NDTNLADELEMLQVNLVCTTHLTKLFLKNMVQQGEGKILNVSSAAAFQPG
PLMAVYFATKAYVLSFSEALANELDGTGVTVTVLCPGTTQSAFHQRTGMA
DSKLVKGKRMMDAATVADIGYRGLMQGKTIVIPGLINKIMAKSIRFIPRK
LVTKIVRNMQENK
>Ava_3398 General substrate transporter
MKAFNTFDASLRLNLLILFTAGLLFWSSTATFLPTLPLYIEDVGGSKQEI
GIVMGGFAIGLLVFRPMLGRMADQNGRKLLLLIGTIVATIAPFGYLAFKS
IPLLMLVRVFHGISIAAFTTGYSALIADLAPIAIRGEIISYMSLTAPIGL
AIGPALGGYLQASIGYPILFLIASELAFVGLLGTIQVSNPPVPQGRQATE
KDSNFWQLLSSPRVRVPTLVMLLIGIAIGAVHIFLPLFIKSTGVEFNAGL
FFTIAAIGSFSLRVFAGKASDRFGRGLFITFGIMAYMLSSFLLWQANSAI
SFAIAAIAEGCGGGTMISMITTMMADRSLPQERGRIFSICIAGLDLGIAI
AAPILGFIAEATGYRSMFAYTTALTFLALLIFLTRSSKNLSNSLRFALGR
GQDVYSLHNSN
>Ava_4772 conserved hypothetical protein
MNRSACLIFNPVAGQGDPEVDLAAIRAILEPEMALDIYLTTEELGADKLA
QEAVERGVETIIASGGDGTLSAAAVAVAGTDIPLGIISRGTANAFAVALG
IPDTIDAACRTILQGVTRNVDVAYCNDLPMILLTGIGFEAETVERADREA
KKRFGIMAYVLAGFQELRELESFDVEIETEDKIIKTSASAVTVANAAPPT
SVLAQGPAGIIYDDGLLDLTIVAPNSKAGAIAATYHLFQSASSGNAAERD
DIGYLRAKQFKITADPPQKVVIDGEVVGKTPVDIKCLPAALKIFVPSAPE
EELVEKLEGLPNLTIELKEPAED
>Ava_3000 transferase hexapeptide repeat
MNHQRYSAKSQRFQELLAITVFGNIPTLLLGPKLRNLVYRMIFAHIGSPV
YIQHGVEFTNASNIEIGNSVHLFKGVRLDAKGHPNNRIYLADGVAIERNV
DIGCLENTCIHIDVETFIASDVCISGPGDITIGKRCMIAAHSGIYANNHN
FTDPILPIKYQGVTCKGIVIEDDCWLGHGVTVLDGVTIGKGSVIGAGAVV
TKDIPPFSVAVGAPARVIKSRVAQDLVTSGD
>Ava_0229 Tetratricopeptide TPR_3
MAYWQGRKTEYAQIQQWLSDDDTFLIGIEGIGGTGKSTLATKIYDEIAGF
PKRFWADVSNGASFSDLAREVLTEFGFYVPEQETQLVEALVRCLRSGQFL
LIIDNLESLLQPDRQWGSLFYGDFFQTWVESGGNSKVLVTTRERPELKGF
EWLSLKGLQVDEGVALLTALGIKGDLGEFVKLVDGHPLLLRLVADLLKEE
YSQDPNLSRLADLGLGNLQQLLTDAQVVGVHRRENVGMVLVLDASFNRLS
ELQKALLLNISVYRVAVDSAAAVAMLPGSSAPEIERELRNIVKRSLLVEK
LNGKRQFGFQPVVLEYVRYKAGDQSEAHQRAINYYLLNFKQKPWQTKDDI
KEYLEVFYHWFQLENYDSAFDILKFCDYFLSLRGYYTVQVDLYEQLVQAW
NKTGKSENRNYRATLTSLGNAYNSLGYYPQAIFFFQQSLTVSRQVGDLFW
ENASLTGLGNAYILQGQYHSAIEFYEQSLTISREISDHKTTEGKSLANLG
NIYLCLKQYQAAIEFYKKSWEIFRKIGDRDAESKSLGNLGLVYLSLGQYQ
RAIKFFQQSLVISNDRNVKCSFLSNLGLAHYYLEQYQRAIEFYQQSLEIA
KEIGDIRGEAIAWFNLGLTLENVNRESDALGAYRNARELYQKMGLDANVQ
NCNDAIERLSQPQTPVVTRRGFWAWLRRLWRWLHSWFRR
>Ava_4356 conserved hypothetical protein
MQPFLEAPESLSVDCEISPTLKRAWVRIAFESADKGKVFGRGGRNIQAIR
TVISAAAELAGQSVYLDIYGSTTPGREGMFVEEDQQERTPLPISRERSGN
APPRPVVKPRTR
>Ava_1048 Metallophosphoesterase
MVLNFRFAVVSDLHIALPHTIWDHPSRFHLVEVGIPAFENAINHLTKLNL
DFLLLPGDLTQHGEPENHRWLEERLSKLPFPTYVVPGNHDVPVVMANEQS
IACADFPQYYRQFGYDNTDQLYYTQQLLPGVRLIGLNSNFFNEQGQQMGR
LDAQQFQWLEEVLAAAVDELVLVMVHHNVVEHLPHQSRHPMASRYMLENA
PELVRLLQRYGVKLVFTGHLHVQDVACAEGVYDITTGSLVSYPHPYRVLE
FHRDQQGQEWLQILSHRVESVPEFPNLQQLSRDWMGDRSFPFLVKLLTLS
PLNLPLAQAQELAPSLRDFWATIADGDAVLDYPPNSRNKYAAIFKPIVRS
PTLVPPP
>Ava_1615 Zinc-containing alcohol dehydrogenase superfamily
MPKAAVMSAPNQPVVVQQLPDPILEKGGIIVETLYSEVCGTDVHLLHGRL
EGVPYPIIPGHFSVGRVVETGGAVSDVNSNLIQPGAIATFLDVHETCYNC
WYCLVAKASTRCPQRKVYGVTYSAKEGLLGGWSELIYLKPGVKVLTLPAE
VSPKQFIAGGCALPTALHAIDRAQIQIGDVVVVQGCGPVGLSVAILALLS
GAGKVIVIDKFESRLAVAKSFGVDETLAIKADDPRQHIERVLELTNGHGA
DVIIEATGIPIAVKEGLNMTRNGGRYVIVGHYTNTGEILINPHLEINLKH
IDIRGTWGIDFSHFYRMIELLKRHSDPKKNIAWENLISRSYTLNEINQAL
TDVERGSVLKAVIQPNLS
>Ava_0943 Short-chain dehydrogenase/reductase SDR
MDLYSSIFRCFCIVEAQVNSYMTNTVIITGASQGIGKATALLFARQNYNV
VLAARQPDRLEAIATEIRELGQEAIAIPTDVKDATQVNNMIQKAIAHFGQ
VDVLINNAGIFCLGSVENFSLEDRHQIIDTNLWGYIHTIYAILPYFLQRC
AGTIVNVSSIGGLEPIPYHVPYTASKYAITGLTKSLHAELSPKGIHVSGI
YPSFISTQLMERAIFRGKDEEIAQARTELVGKAIQMPVLEKPEDVAKAIW
SAVKNKRSDVVVGSANFWKAAYQLTPSLIQSLVRRVFGMEERK
>Ava_2212 TPR repeat
MLTRLFQWLKTFDKHPVDDPQTDYFKGVKEQVAEEPPELTNADLELLFNQ
LLQGVNQGRGQQWAIRYLQRMEDRISVERWIDWLLVFGEKLLMSPAPNHH
LGRQMVTLGELGIGKVGELSYDIGIRLLDREFIEINEVNEEIEVSSTTVS
QTSDHLPDSPGQELIRNLGDLLWDSTEPETATHQTQSEIDHLDEDTITSL
QELSWEYHAVPTGSVDEDTIKNLQELSWEYHITPSELVDDHIIENPPELS
WQEPIAPTDVVDATTNLQELTWEYQQAEFTAPPLLMPAQEDVISNLSELV
LDYREQNSAAIVSTQSNWDQSLVNLDPNVAYTLDELMVRLDQSTNLVQQL
AANLIVQNNQPPAIINEHSNAFVKAQELFYQGLQLAKTGDLSGAIANYEQ
AIQLNPNSYEYWFNRGLTLFHLERFVEAIASYDQAIEIKPDYYKAWYNRG
GTLGQLGLYEEAVASLKQAITIQPDMPGAWSSKGWAELKLGQIGEAIASY
DEALLLSPEDQENWYYRGIALGVDEQYEAAIDSYDKALEIQPDFHEVWID
RGVVLFNLKQWSEAIASWDQALSIQADFYLAWYNRGVALENLGHREEAIA
SYKQAIAIKPDFHLAWYNQAVALFYLERFLEAIVCYDNALQIKLDYWEAW
IGRGTAIGNLNETETPLNLLTTIAANNSTLKQAGYEGKLASYQEGLKHIR
PDTHPEGWGRLHLAIANTYYEQGKKYSTSRNYWRKAVTEYNQALLTLTSA
GFPQLHLEVLQSLVKALLGLGQTTQAQEFQQRGSRLLQQLLNEPTRTAEI
KKQLALKFAGFGQLAVDVAVEYGDLVEAWEIAEQGKNACLTWLLCGWNAD
IQPLNYRAVQQLLNPHTAIIYWHISPVALHTFIVKDQAPSPILVFTPVQD
LGAINATPLQDLPLPEAMRRLIAFENWLEDWQQNYQDYHQQAQDQQSKSN
HPWRLEMEHKLLQLKEILEIDTIQQELEDITQLILVPHRDLFRVPLHSLF
QMSESSEAIPNADIAYLPNIQIALSAKNANISQFVNQTLLSVEYPDSTAY
PTLRFANLEAEVVSQMFNNRQRLQGGEATKNNFEDAFFSNYNVLHFTGQA
INRLTDPQRSELALAGEDKFTLAEICETNLSTYNLVTLSTCETATNTNQV
TTSEYVDLVSAFFYKGVSQVVSTLWTVESSASALVIIEFYRRLQLVQSAS
TALAEATTWLRELTASELTQWYENLLNNLDPDELKLRAYLATHLYRISKM
PADKNLYSHPYYWAAFTVTGKPNTTR
>Ava_0353 Serine/Threonine protein kinase with FHA domain modulation
MVTLTLLEPQQKTPLKQWCFENSSVIRIGRAADNHVILSDNLVSRHHLEI
RQVSSGGGGSWQVLSKGTNGTFLNGVLVIQDALPNNALLQLAQGGPILQF
QVQGIIPPEWQSISGSDSPLSGVGSSAEKISPELSNNPGSLTCTHEGNSP
QNLFCIHCGQPLSVIQTIRHYQVLRTLGQGGMGTTYLAWDAAGVIVGHPK
LLVLKQMNADMARIAKAQELFEREAYTLKSLNHSGIPKYYDFFVEGGKKY
LAMELVHGQDLEKRIYATGPVIPSQAIAWMIQTCDILDYLHSQEQPLIHR
DIKPANLMVKTANNQIAVLDFGAVKEIGTTPGTRIGAEGYCAPEQERGQP
LTQSDLYAIGPTIIFLLTGENPFKFYRQKGRGFRFDVSKIPTVTPKLRDV
IERTTEPLPRDRFQSAKELAAALAACK
>Ava_1584 2-nitropropane dioxygenase, NPD
MNSSVKTRILQPLRIGKHIARHPIIQGAMAVRVSGAKLAGAVANAGGVGV
IASLGLGLDSPYFDKRKKRSFFTANRLALIDELAKARSISPDGVIGVNIL
VATKDYSVLAQTAAAQGADIIITGAGLPLALPEYTAAYPDVALVPSVANL
EAAQLICETWQSRYHRLPDALIVENCQQVGGHFTQCEQVNIPGFSLELVI
RQLRDYCQNHLGARIPLIVSGGISDRTDIDRMIAIGADGVQIGTRFVTTV
ECDADQRYKDYHLQAQPEDIVTVPSPVGKPSRALHNLFTEQVMSNSSNLE
KRCIANCLETCLCRDHGKTYCLLQALAQAARGDVENGLIFSGANAGYTQD
IISVPELMTELTQAHTIEVSSSDKY
>Ava_4553 conserved hypothetical protein
MTNTTLNFQNASGHEVLAAVGKKYLRPGGRIATEQLCRWANFQPSETVLE
LASSFGYSAISLAQRYGVKVVGVEKNPDSVVRARENVRVAGLENQIEIIE
GDIFHLDVIPGKFDYVLAEAILTMQSPLGKAKLLAEIHNRLKPGGKFLSH
ELLANDKEEQIRADLARVIRVNSTPLSETNWIAACETAGLKVEKHQIGSM
NLLNLWRMFQDEGIFNTIQILVNILTHKSIRERVLAMRHIFNKYRHELGY
IILCAVAQEDN
>Ava_4923 Serine/Threonine protein kinase
MSYCINPTCPNPENVAYSQRCEACGSRLLLRDRYRVLKPLGQGGFGATFL
AHDQILPGEPSCVIKQLRPSGTAPHVLQMARELFEREAKTLGTIGNHPQV
PRLLDYFEEQEQFYLVQEYISGSTLQQEVKLNGTLSEAGVKQFLSEILPL
LQYIHEHKVIHRDIKPANLIRRSQDARMVLIDFGAVKNQVSQAITNQSAN
TALTAYAIGTPGFAPPEQMAMRPVYASDIYALGVTCVYLLTGKTPKDLDY
NPTTGEVMWEHLVQVSDHLIGVLRKMLEVSVRSRYQSATDVLKALEIEPY
LESLAQGLLVKSEKEQTSQRPENSAVLSSSSPVAATSVGGVAQVAAAIRA
RRAKDAAAKLAILPNSNSNGNGLPTQQSKAGRRLDSQTLIKAYLKGRRDF
ALHNLNFLNLQGVDLSETNFHSAQLQSTNLQGANLHNSDFGRASLTRANL
KDANLSKAYFNHADLEGADLRGADLSHAYLSNANLRGTNLCGANLTGAKI
TDEQLALAKTNWMTIRPNGKRGLL
>Ava_2228 HAD-superfamily hydrolase, subfamily IA, variant 1
MTQKVIIFDFDGTIADTVDALVSIANRLAVEFGYVQITPEQLTLLRNFSS
REIIKYSGVSLIKIPFLVKKVKSELKNKIHELKPIPGIKEALLELKEHDY
KLGIITSNSRENVTNFLSINELDSLFDFIYSGVTIFGKTTIINNVLRQKQ
FKPQAVIYVGDETRDIEASKKANIKVIAVTWGFNSPEILAKQNPDFLIHQ
PRELLEVIKNSQ
>Ava_0846 Polysaccharide biosynthesis protein
MLISKFKQSLSNKFIRNAGALGAAELANRIFRLGTTITLARMFSPQDYGL
MAVVYTVFDFATVFTFRGGIGAKIVQADEQDVKTICDTSYWLNWILCIAI
FLLQCIAAFPIAQFYKNQQLVLPICTVGLVYLMFPLFLVNSAIIERENRL
KITALCNVIQSLLSNIIIVVFALMGMGVWAIVWSMVLTTPVWIIITWRNH
SWRPPKLFKLDKYKEVISFGADLLAIELLGKLRGNIDYLIIGGFLSIEAL
GIYYFAFNAGLGISMSVINTFGSALFPYLCEVRSNLSHLKERYFSSLKKA
YFVITPLILLQTCLAPIYVPIIFGQKWSSAIPVLMLICLSALSLQFGRAT
FLLLNAMGKTRLTLYWNLIYTILFSTGLLISVHGGIIYVAIAVVICQLFI
APIFNIWVVNRTFYKKQFITL
>Ava_0976 conserved hypothetical protein
MPTWNQRFQGLLDLFLQSNCPLCQRPTSTELCPNCTKQLQKCRHTHPDGL
WKQPIPVFSWGLYGGTLKRAIALMKYDNQPQIARPLGQWLGETWLLHSLH
QTTQIVVVPIPLHPSKQKQRGYNQAALIAQSFCQTTGLKLKLNGLARVRE
TTAQFGLSVSERENNLAEAFAIGQDFRHSCPKTPVLLIDDIYTTGATVKS
AVQILRQNEITVLGLAATASTVKDRYIKN
>Ava_0450 Cupin region
MQGRDWLLTGDGQYQVCKSARSWDLLQENYRLYRFLTEMEDVLNQVSDES
TRLPEIRMLVRRLIVNSYWVRSQYLDPSPTTGTSVLLLYDELGFPLTVQT
VTFAPGTLSTIHNHGTWGVVAVLKGQEKNTVWRCTKTLDSQDKIEATGEI
ILSPGDMISFTPDAIHSVQAIGDEPTVTFNIYGETDPKQRFEFDAVTHNA
KKF
>Ava_0900 conserved hypothetical protein
MQGLISLDLVDLVMAVGLMAIAIGLSAWEGLGLELNLAIATGRTILQLLV
LGYVLDFIFALDNPGAVLAILGVILTITAIVARNRISQKIPYVLPLVWGA
IFISTALTVFYTTFLIIQPDRWYEPRYVIPLAGMVLGNAMNAAAIAGERL
VSSMNSFSTEIETHLSLGATPAQAVSQYRKEAIRAALLPTLNQMMLVGMV
AIPGITTGQLLAGINALDAVSYEIVIIFMVAIANLLTTVLLTKGLCRQFF
NSAAQLVR
>Ava_3806 HAD-superfamily hydrolase subfamily IIIA
MTWHKFLQPDLILAGSVLNLTPDIIQHHQLKGLVLDVDETLVPITVGSAS
PELREWVEQIRSVTALWLVSNNMSEARIGGIARSLNLPYYLGAAKPSRRK
IRAALQEMNLPVEQVGMVGDRLFTDVLAGNRLGMFTILVEPIIHPDAALR
SHPIRNFEVWFSEILGASINPEHTKSYKN
>Ava_2660 D-isomer specific 2-hydroxyacid dehydrogenase, catalytic region
MKVAVFSTKAYDRQFLEAANAPKQHDLVFFEPRLNQDTAILAAGFPAVCV
FVHDQVDAATLEILASRGTRLIVLRCAGFNNVDLKAANKLGVNVVRVPAY
SPYGVAEHAVGLILSLNRKIHRAYNRVREGNFALDGLLGFNINGRTVGII
GTGKIGLILGQIMKGFGCRLLAYDVYHNPEMLALGGEYVELPELFANSDI
ISLHCPLMPQTHHLINAEAIEQVKPGVMLINTSRGALIHTQAVIEGLKTG
KIGSLGVDVYEQESELFFEDLSGEIIQDDIFQRLTTFPNVLITGHQAFFT
EDALRNIAETTLNNIADIEQGRSCPNEIRYQPEVEAKVLVS
>Ava_2578 Metallophosphoesterase
MNEKLPISIAQITDIHLLASESQRLQGISTIESFLAVMKRLEELRPELDL
LLMTGDLSEDGTPESYENLQHYLNSLQIAAYWLPGNHDCAIAMDKILNLG
MVSRRKSFQRGNWNFILLNSSIPDCVYGYLSATTLDWLDSELRMLPNNPT
LIALHHPPVSVNSAWIDGSCLRNSQELFAVIDRYPQVKLVLFGHIHQEFR
HQRHNVHYLGSPSTCYQFQSQSPIFAINQELPGFRLLKLYADGTWTTKIE
RVPYSLPIESAVTVSY
>Ava_2226 conserved hypothetical protein
MMIYRIWHLIVIIWLIGGCWLLDGTGVAIASPTTSSVYEQRSIHSPDGIG
KYYMGREIAKFMGHTGAGWLERPSREAEEQPSKLINALNLKPDDVVADIG
AGTGYISLQIAPLLTTGKVFAVDIQPEMLEILEFFKQEKNIANIEPILAT
ANNPNLPPASVDLALMVDAYHEFEYPQEVMQGIVQALKPGGKVVLVEYRG
ENPFIMIKRLHKMTQKQVRQEMAAVGLTWRETQNLLPQQHLMIFEKAVRN
S
>Ava_4825 Tetratricopeptide TPR_3
MRLPIIPLTLILALASPSLAQAPTPTAEEQITQAVILNSNGESLIYKDFF
GVGELQAALENFQQALAIFKKYGAKAGEANSLVNIGYVYFRKGEYGKALE
YFQSSLDIRRKTRDRQNEWIPLSYIGEVYVNLGQYPQALEYYQPALAIIK
ELKAANPKDSSYATSEKTLLADIGAVYFRMGQYTKALDFYQKTLAMQKAD
DDKIGGIQTLNNIGVVYVNLGNYKQALDSYQQGLANLQECCSNYIGTKAA
IINNLASTNFSLGQYKKSLELAEESANIYSRINHDAEKATKQEIKLLYDY
LGQNSQALQQVASRANVGDAFGKDSFQFQGRALNMNNIGQIYLSLGKYDQ
ALKLYQQALNIYQENSYKPGIAVTLNNIAKVQSSLGKYLQAIELNQQALT
IYQEVGDRTGEGVTMSNLGQIYQKQGQQEKASGLYQQALAMHRQVSDKVS
EAATLKLLADTLSAQNQPQLAIAFYKQSVNLTESIRQSLRTIPADIQKSY
TETVAERYRRLADLLLKQNRPSEAQQVLDLLKIQEANDFIGNRRSQPQTT
TAVVNTGQRGVNTEPQLSQKLPLQPQEQQISQKYSAIQDQAIALGQELTN
LRQTPANARTATQEKRIAELVKLEQTITAEFNKFTKTPAVVALVQQLSAN
SGQENLSLRQLNSLRDNLRQLNKKAVLLYPLVLDDRLELVVVTADTPPIH
RPVPVKPAELNQVINEFRQAIVVPYKDSKIPANKLYNWLIKPIENDLKQA
NAQAIIYAPDSKLRYIPLAALYDGKNWLIEHYIINNITAASLTKLNSKPQ
ASLPTLAAAFTKGDYKVAVGERQEVFSGLQFAKVEVDNLAKTIKGTKILL
DNDFSPQVTIPQMNDYKIVHLATHGMLVSGDPESSFILFGNGDRVTIKDI
ENWSLPNVDLVVLSACQTGLGNQLGNGQEILGLGYQIQLTGAKASIASLW
AVSDGGTQALMDGFYNVLKTGNLTKSEALRTAQLSLLTGNNQFNHPYYWA
SFILIGNGL
>Ava_0283 Peptidase U62, modulator of DNA gyrase
MPTILADAQNLLSDLIARYSSRVDYLMIRLEEAEGTDILLRGDKVETLSE
GISIGGHVRACYKGGWGLSCFNQLATIQDRIEEAIAAARMVGDEETILAP
MDSVQAICRLPLTGTDPRKVPLAKKKELCDRYTDLLKSVDHRITTTSVRY
GDSSQKIILATSEGTLIQQSWVDMEMRFAATAKNGETVQTGRETTGSRKA
YEDLINLDTQVKNAAQRAVAALSLPSVKGNTYTVVIDPVLTGLFVHEAFG
HLSEADMAYENPDLLDVMTIGRRFGPKELQIFDGAAPEGHRGSYFYDDEG
TPATTTQLIKDGVLVGRLHSRETAGKLEETPTGNARCLNYHFNPIVRMTN
TWIERGKTPIADLFTGIKEGVYARNWLGGMTNGEMFTFSAGEAWMIRNGN
IAEPVKDVTLSGNVFQTLADIEAIGDDFYWDESGGCGKGGQNGLPVGCGG
PSLRIRDVVVGGEM
>Ava_1555 Putative cyclase
MSSLKTIAYSRVIHLSHIIDPDIPQWPGDPPVEFTTVAQLPDDGYYLRRF
AIGEHSATHINAPNSFYQAGIGIDQYPAQSLFVSAVVIDIQAAAAVNADY
RLTIDDIFAWEGEHGEIPRGNVVLLHSGWQNKWSDKNAFLNQDAEGIMHF
PGFGSDATQFLLDERQIAGVGIDTHGVDPGQDTSFATNRLVLAQQGIVLE
NLTNLHQLPAKGSNLAIAILRLRGGSGSPVGVLALVP
>Ava_3857 O-methyltransferase, family 3
MTNVIVQPTARPVTPLGILTKQLEAIVQEVKQHPDLPGELIANIHQAWRL
AAGIDPYLEECTTPESPELAALAKTTATEAWGEHFHGGTTVRPLEQEMLS
GHIEGQTLKMFVHMTKAKKVLEIGMFTGYSALAMAEALPEDGLLVACEVD
PYAAEIGQKAFQQSPHGGKIRVELDAALATLDKLAEAGESFDLVFIDADK
KEYVAYFHKLLGSSLLAPDGFICVDNTLLQGEVYLPAEERSVNGEAIAQF
NHTVAIDPRVEQVLLPLRDGLTIIRRIQP
>Ava_4371 Peptidase M16-like
MRRAATDMHRRKQFKIRNPKFTMGKGKSFILALVAIFAFLAVTFNFSLTA
TAAAKHYTELQFAPLPEVKLPKYERFVLQNGLVVYLMEDRELPLIGGTAL
VRTGSRWEPADKVGLASFTGGVMRTGGTKEHSPDDLNEILEQRAASVEVN
IGEAAGSASFEALSEDVETVFGLFAEVLRSPVFAQAKLDLAKTQAKGGIS
RRNDDPDDIANREFRKLIYGKDSPYGRITEYATVNAIAREDLVQFHQQYF
HPNNMILGIVGDFDSKKMRSLIQAKLGNWARNPKFTKPTLPAVSPANTGG
VFFVNQPQLTQSSILVGHLGGKFDNPDYAALDVLNGVLNGFGGRLFNEVR
SRQGLAYSVYGYWSPRFDYPGMFMAGGQTRSDATVQFVKALQAEIKRIQS
QPVTAEELARAKESTLNSFVFNFQDPSQTLSRLMRYEYYGYPADFLFRYQ
KAVAATTIADVQRVAKQYLKPDNLVTLVVGNQTAIQPPLTQLAAQVTPID
VTIPSPPQQAQN
>Ava_3381 ABC-1
MSFLPGDSVITQRYAQDMETNYSDKAYRWNRENYSSKRRFVDIWSFVLTL
MFKLWRYNKSWSYPGGVTEAKQAARRKAQAVWIRNTLLDLGPTFIKVGQL
FSTRADIFPGEYVEELAKLQDKVPAFGYEQVEKIVEQELGKKIPELFHSF
EPIPLAAASLGQVHKAVLHSGESVVVKVQRPGLKKLFEIDLRILKGIARY
FQSHPKWGRGRDWMGIYEECCRILWEEIDYLNEGRNADTFRRNFRGYEWV
KVPKVYWRYASPRVLTLEYLPGIKISQYEAIEAAGLDRKVLARQGAQAYL
LQLLNNGFFHADPHPGNIAVSADGALIFYDFGMMGQIKSNIREGLMQTLF
GIAQKDGDRVVQSLIDLGAIAPVDDMGPVRRSVQYMLDNFMDKPFENQSV
AAISDDLYEIAYNQPFRFPATFTFVMRAFSTLEGVGKGLDPEFNFMEVAQ
PYAMQLMTDMNGSDSNSFLNELSRQAVQVSTTAFGLPRRLEDTLEKLERG
DMRLRVRSIESERLLRRQGNIQIGIGYALIISGFTLSATILLVNHYVWLA
LLAGLIAAAVSVMLIRLLLRLDRYDRMY
>Ava_3156 Tetratricopeptide TPR_4
MTQQHLFPSTQAKSYQLFGFYLLILTSLLHPVAASAADITQQLHRPFNSS
IGRQSRDDADSLLQMAEQQYAAGYADKAIDSGLQALDIYHLIGDLRAKGL
TYNLLAKAYIELGRFKEGEDALRRRLAIARDTKDFQSQIFALNNLGTLLL
QAGEATAAGQTIQEAYTIADNVKNIDGEGLSLSNLGLVSARLGDYNKAIK
LYETALIFRRRTGDAPGEMNTLNNLGDAYLAAGNYQDTIGTYGAAMRIAK
TIGDRSNQLRAIDGLVTAHSAVGRYERAFDLLQQRLTIAQELQNLREELK
SFESYAKLYEQLGNYPTARNFYERAIILSQTLEDNKQEVFLRDRLTQMLR
SQKSIAK
>Ava_3970 Short-chain dehydrogenase/reductase SDR
MSFIDEINRANVLIVGASQGIGLGFVKKLLQDDRIAKIYATYRQKDSASE
LITLEKEYSPRLTCLSLDITDELQIAEILQQINTETNRLHLVINCVGILH
EGNLQPEKSLRQLNSENLLRYFHINSIGAVLLAKHLLPCFQHNEPSVFAS
ISAKLGSIGDNQLGGWYGYRASKAALNMFMRTVAVEYGRSCPKTTVVTLH
PGTTDTRLSRPFQKNVAAEKLFSIERTVDQLLAVIAQLQKGDSGKFFSWD
GTQLPW
>Ava_0555 conserved hypothetical protein
MSTESLELAKTRYQAGKFAFENGQYREAVENLEKASALVARNSRLGGEVE
IWLVTAYEAAGRTEDAIALCQQLRRHPHSETSQQARRLLYILQAPRLKRP
SNWMTEIPDLGALSDNEAKTRVMAKPRKPTEKKAPAEVEFVDLSQVNTKD
NRFIWLALIMIGLTISYLVWLGF
>Ava_2233 Protein of unknown function UPF0118
MNLGQWIGLIAIVLSLYILWQIKEVLLLMFAAVVLATTLNRLAKRFQRSG
VKRGLAVVLAVVIFFAIVIGFFWLIVPPFAEQFNELTYRVPQGFERFNGW
INELRTRIPAQLVPYIPDLNSLIQQAQPFINRVLGSSFAIVSGSLEVVLK
VLLVLVLTGMLLADPIAYRKVFVRLFPSFYRRRVDGILDKCEVSLEGWVT
GAVIAMGVVGLMSVVGLSILGVKAALALGVLAGFLNLIPNLGPTLSVVPA
MAIAFLDSPWKVVAVLILYFIIQQTESNFITPIVMAQQVSLLPAVTLISQ
LFFVTFFGFLGLFLALPLTVVAKIWVQEVLIKDVLDEWRHDHSQESELVM
VSHYPEGDDPWSDDQPANQPADDTVSQED
>Ava_1800 Short-chain dehydrogenase/reductase SDR
MSLNGKHVLVTGGSRGIGRAITLKLAEHGVKVAIHYHKNDEAAENTLVQV
RERGGEGFIVQADILQPQDINRMFSQVKDKFGSLDIFVNCARPDLATFYH
SPMKMTLDHWRAAIDSQAQAFLLGTQQAASLMRDGGRIIAITYSPSGRTG
SWQPWMAMGTAKAALESLCRYFAVALGSRGITVNGISPGVIFGEPNPLEG
GVLRSLPTAAQEATQNWHESGWTPMKRVGTPDDIANAAMLLCMQEASFIT
GQIIHVDGGASIMDSMCPLEIQQG
>Ava_4522 Rieske (2Fe-2S) region
MNWIKVIGQDELPPNGRKVVKVEQRNILLLNHNNQVFAVENSCPHLKLPL
QKGKITDDGAIVCPFHRSAFDLTSGNPTEWTPFPPGIGKVMGMISKEKGL
SVFPTRVEEGSIWVSI
>Ava_2834 Major facilitator superfamily MFS_1
MKHRLPLISGALFLCVFLSIFNEVLLSPFYPQFFRKVFGVTDLAYTGYYI
FVCRLTVVLCAPVWGVLSRRFEVKHLLFVGQLGAAFMTALMGTSTSAEQF
LIYTILLLLCKSSYLLVYPLIIQLGGEEKRTAIAGTYQAVFHSAIIIATI
VGAFMVNINTPLIIFYGIAAADLLQLAVCAYILRGVSTKEAGGQGAGGKG
DGGRGENQLVAPNQLGYIITIGVVILTFQLANNLVRPYFTEYVTAEPLNV
DLFTGSLLFLIPSVMVIAALPYIRQACRPERLQTIYLASLSLLIVSLGLQ
GLSSNLPLLILARIVYGFFLAVTQAALELQIFNQSTAKHLHFNYSLAISF
ANIGHLGAPLLASWLVNTHSLASPFIAATIICCLNLLFFRYVPKTGGRRQ
AAEGF
>Ava_0007 NUDIX hydrolase
MTKHGEIRVIVLGLVQNGNRIFVSEGYDPVKQSTFYRALGGGVDFGETSL
AALKREFQEEIQAELTNIRYLGCIENLFTFNARKGHEIIQLYQCDFTDPK
FYQLESLVFYESEHHKHRAIWIDIDRFKSGELILVPEEFFNYL
>Ava_0839 Polysaccharide biosynthesis protein
MLMKKIKQLLSSQFIRNVGWLGTAELINRIFRLGTTVTLARMFSSQDYGA
MALIYTIFEFANVFTLRGGIGAKIIQADEKDLNTICNTSFWLNLILGISV
FLCQCIAAFPIAQFYGNQQLVFPICTLAVVYLIYPIYMINSALIERDNHL
KITALCNAIQAFVGNVITVVFALLGMGIWAIVLPIVFSTPVWVIVTWMNN
SWRPPTKFSLDKWREVTSFGKNILAVEFLNKIRNNLDYLIVGKFLGTEQL
GIYFFAFNAGSGISINVINTFMSALFPYICAVRENLAEFKQRYVNSLKKI
LSFVIPIILAQAILAPIYVPIVFGEKWTTAVPILILICLSVIPRVFGWAN
SLLLNAVDKTNINLNINLLFTIFFAAAVLLTVQWGVFWVAVGVLLSHVLF
IPASVLWSYRYIFHNRTHTLSKV
>Ava_3546 conserved hypothetical protein
MTNLELSGSQSYLNDESTANLSSLAGKFIVQFWGVRGLIPTPSSNTSRYG
GNTACVEMQVAGKRLIFDGGTGLRILGKAWQHLQQPLEAHLFFTNSQSNR
IQGFPFFAPAFIAGNCFHIYGGAASNGASIKQSLCDQMLQPHFPYPLQVM
QSELQFYHLTPDSELKLDDVTIRTALINQTQRSIGYRVTWQEYSVAYVTD
LNMSAEQGEREQVLQIIQDVDLLIANATYTPPTSHEHESAELLWQAAVEM
AQKAGVRKLIISHHHPDDHDDFLDQVEKEVQSTFSNASLACEGLVLPVF
>Ava_2116 Survival protein SurE
MKLLISNDDGISALGIRTLANALAEVGHDVTVVCPDRERSATGHGLTLHQ
PIRAEIVESIFHPAIKAWACDGTPSDCVKLALWALLDSPPDLVLSGINQG
ANLGTEILYSGTVSAAMEGMIEGIPSIAFSLTSHISRNFQPAAKFATILV
EQLAAKPIPDLMLLNVNIPPVEWEEIAGVKLTRQGVRRYVDVFDKRTDPR
GKTYYWLTGEVLEEVEPPEGLNLPQNVPIDVHAVKDNYISITPLQYNLTY
ATALDKLSNWNFPMS
>Ava_1843 Photosystem II manganese-stabilizing protein PsbO
MRYRALIVAFLAVCLGLLTACSDAPASSTRDILTYEQIRGTGLANKCPQL
TETSRGSIPLDSSKSYVIKELCLEPTNFFVKEEPANKRQAAEFVAGKLLT
RYTSTIDQVSGDLKFNDDSSLTFVEKDGLDFQAITVQLPGGERVPFLFTI
KNLVAQTQPGLSSLNTSTDFEGTFKVPSYRGAAFLDPKGRGVVSGYDNAV
ALPAQADDEDLTRTNVKRAEILNGKISLQIAKVDSSSGEIAGTFESEQPS
DTDLGADEPKEVKIRGIFYARVE
>Ava_4708 Alpha/beta hydrolase fold
MFPSFLPAAVGQLTESESIALAKTIQTQAIATPLSNQPITTAYVRQGSGG
TPILLIHGFDSSVLEFRRLLPLLGKENETWAVDLLGFGFTQRLAGIKFSP
VAIRTHLYSFWKTLINQPVILVGASMGGAAAIDFTLTYPEAVQKLVLIDS
AGLRGGSPLSKFMFPPLDYLAAQFLRSPKVRDRVSRAAYKNPNLATVDAL
CCGALHLEMPSWPEALIAFTKSGGYTAFRFKQLAEIISPTLILWGDADRI
LGTEDGKRFKRAIPHSQLIWIQDCGHIPHLEQPGITAQHILSFCS
>Ava_4183 Tetratricopeptide TPR_4
MVRFKEFLWKQRQKFRYGILAFVMALLCIFVSPVVARDRGSPGLTASVVA
SDFSRDLENPSPNLSRQLLQVGKAAQRTGSPARREALKFPPSLVGKGVRG
LGFALAFPHNVKSQVVASVPQEEEAKNLYTAGRFAEAVQLLQQVLPAYQT
SGDTVAQAVILSNLALNLHQLGRIGEATEAIEKAIAILQTSPNSPQRLSI
WAQVWDVKGSLELAQGKAEAALSAWEKSALLYQQLKDHNKATLAQVNQAQ
ALQTLGLYRREITLLQNLTTSLAKQPDSLSKAASFRNLGEALILAGDLKQ
AEVNLQTSLKIAQQLQSPEAIASAYLSLGNLAYLPGNSTQQKNIALNYYQ
QASVLSTTATTKISSQLNRLRLLIDLEQWSNAQALYSEIQSQIANLPLGR
SAIYAQVNLAQSLLKLTKKFPNPSPETIGKILAVARQQAEQLQDVRAGSF
VLGNLGHLYELSHQWSIAQDLTRKALNLSQSIHAPDIDYRWQWQLGRVLC
QGKRQCEQQGDMQGAIAAYTEAFNTLQNIRTDLIATNKNVQFSFRETVEP
VYRRLVDLLLQSSAPSQDNLIQARNVIEALQVAKLQNYLQQTCQDTRLEL
DQVIDSKDQTSAIIYAIILDDRVEVILKRPNQKLDHYATKPQEKIEEVVK
RLYQNLQEVGSYEEVQQDGQQVYSWLIQPIEKKLKESKIKNLVFVLDGFL
RKIPMASLYDGQQYLVEKYAVSLALGLQVREPKPLNRSKMQVLAASLTEP
PNKPGFSGFSRLANVNQEVREIKNTGIAVHWITDKEFTTKNFNQKLNTSN
FDVIHLATHGLFSADRENTFVVTADGKLKIDDFDQLFIDQQQNSTHKIEL
LILSACQTATGNDQAVMGIAGATVQAGASSAIASLWDLDDEASVPFIKEF
YQHLGQPNISRAEALRLAQQALLKNPKYDHPRYWAPYVLIGSWL
>Ava_4630 HAD-superfamily hydrolase subfamily IA, variant 3
MVTAILFDLDGTIVNTDPIHYQAWQQMLWKYNIEIDEKFYKSRISGRLNP
EIVKDILPELSSAAGREFADEKEALFRQLASHLQPLNGFAELIAWTELHQ
LKRALVTNAPRLNAEFMLEVLGITESFHQIVLADDCVAGKPDPAPYQVAL
GKLGISAEKAIALEDSPSGIRAAVGAGIRTIGIASTHDPDVLQEVGSFMA
IHDFTDLHLWTLLNSLIEPDLIRST
>Ava_1043 Methyltransferase FkbM
MKDYLYELRRNIYESFGNSKYSKPGLNNLDKKLENYLNFRNGFFIEVGAN
DGYRQSNTYYLEKFLGWHGILVEGIPSLYKECKRIRNNSSVYNCALVAPD
FPDSFVQMHYANLMSVVENSLKNKESQYEHIQRGLELQKIQQSYGIQVPA
KTLESILDKFAELPQIDFLSLDVEGYELDVLKGINLVKYQPKYILVEARF
FEEINLYLSFNYDLIDQLSHHDYLYRLKSANKIQ
>Ava_3384 Serine/Threonine protein kinase
MSDPNIGRLLGKRYQLQELIGTGAMGRVYRAKDVLLGGVPVAVKFLALSM
QNEKMRLQERFEREAKTCALLGQKSIHIVRVMDYGVDENKIPFYVMEYLQ
GHSLNQIIRQQNISLPRFVSMARQICLGLQCAHNGIPVDGTVYPIIHRDI
KPSNILVIQDPSFGELVKVLDFGIAKLLQSNSDQTKFYLGTLAYSSPEQM
EGKELNNRSDIYSLGVMMFEMLTGKMPLVAPTHSFGSWYKTHHHQPPRTF
SEVAPDLAIPQEIESLVMNCLAKAAKDRPSSISVILQVLESVEQQRTKRL
LKQENYDSRQSTATLTINVVSEQKIKPDVLLSAPQDNTIYNTSWPQNKPI
ADIVFPQPINVSGEVLPALWVMLPEKEIFKRIICSRYNQFLFIAAPHPML
LWITVIHNRKYGAKWLPYYLDLKTGIGQEIASLLVKTGGYRLLFFARESP
NRCSHVLSASVASAQRQRLQQWIMTAKTFVSCVDPQMSKNLLKGEYEKLK
PQILAKLAAIDTDSPIDLSG
>Ava_4839 Polysaccharide biosynthesis protein
MLVNKLTQGITTFVLTAAIARNLGASSLGQYLLALSYYYIFVNLISQGFK
TLFTREIAREPELTSEYLVSGTLLQFVFSIIGYLALVVVVFLLPYNADTS
FICYVMGLTIVPFALSNITEAIFQAQEKMHLIAISTVPVYILRLLVMVWA
MQLNYGVEYIAGILLFSECLILVIEWLLLTKIIKPKWHIKQDFIWKTIKA
ARTFVAIEGIGIIAGKLNLLILSILGSEILVGIYGAVIQLLQPFILVSSS
IALAAFPSMSKAVALGKNQQRKVAEDVINILLCMSLPFFIGILLFSHELL
SLVYKDQGITAATTTLKLVSFALISNSFSGIFGYLLIANGFEKFNLLEAV
LTTLVGGISGIVLIGQHQLLGAALMSLFMSYTNLSFFGYIIYRRLFSLNL
RRVLIRPLLITTFMTAVFFLLKTTNLNFVWILIFAFSVYIMLISLLAIRQ
FGGFNSVYKKILSKG
>Ava_1541 WD-40 repeat
MTPLNLGQLVYTSFAIAGLRTVASKEVPPEIQQAFLEKVVYQYWDAYNPP
SAGYRAAYLLQVTSEDNLFGWLYNDGLDDFGRSDVPYFLCYYLPGKLHPS
QLENIFICLCTGPITIVDRQNLPVSLESLVTQNLWSYQPTRAGVKIPRQV
IEESQITLKQEELLNLFVCASQDEIKDTSIADRNLLIVSKQKIRLLDTPT
TPNQAITQQELPLAVRIAENDHQMPAAKTHLTDNQKAVFPFPGKLTLMLG
TIATVVSLLTLNYFLKIAPFAGTVQKVTAPIPTPNPPVDQQSTTLTKTLF
GHTDSVWSVALTKDGQTLMSASEDKTIKVWNLDTAKVTTTLQGHTDTVRA
IALTPDDQTLISGSADKTIKIWNLQTFKLKRTMSSLSGGIWSLAISSDGQ
TLVTVHENGSIQIWNFPTGQLLRTIKGHQGRVFSVAMSPDGETFATGGID
KNIKIWNLYTGECLRTIAEHQDAVRALVFSHDGKMLVSSSWDQTIKIWQM
PTGKLLHTLLGHTSRVVTLSLGIAEQTLVSGSLDNKLKIWNLQTGKLLET
LSGHSDWILAIATNPAKQILVSSAKDKTIRVWQPQIIGR
>Ava_2187 conserved hypothetical protein
MTDAKPLDFIQEPTREIQAHIERLSRAPYLELNQVKSCHTWMYELVISRM
TGLLVGESRSGKTVTCKAFRNNYNNLRQGQEQRIKPVVYIQISKNCGSRE
LFVKILKALNKPSNGTIADLRERTLDSLEIHQVEMLIIDEANHLKIETFS
DVRHIYDEDSLKISVLLVGTTSRLLAVVKRDEQVVNRFLEKFEIDKLEEN
QFKQMIQVWERDVLRLPEESKLASGESFKLLKQSTNKLIGRLDMILRKAA
IRSLLRGYKKVDQGVLKEIITATKF
>Ava_3102 Alpha/beta hydrolase fold
MQATTVPSTTPIPGQYWQWRGHKIYYVRAGEKRPQRPPLLLVHGFGASTD
HWRKNITGLCDDFEVFAIDLLGFGRSAKPKLQYGGDLWRDQLHDFISEVI
GQKTVLAGNSLGGYACLCVAAQRPESAAGVVLLNSAGPFSETQTTPEPEA
LQSQIQPPKQSSPLQKIFGDSVKWIFQQPLAQFLLFQYVRQGWVIRQTLE
KVYLDKSAITNQLVEEIARPAYDVGALDVFVSVFSSPQGEKVDVLLKQLT
CPLLMLWGEADPWINARERSQKFRLYYPELTEYFLKAGHCPHDEVPNQVN
PLLQEWVLSIAR
>Ava_3359 hypothetical protein
MRVDDKLAMSNQILQQQIAYYRARANEYDQWFYRIGRYDRGNELNRRWFK
EVGIVKQGLQQIGQADKILELACGTGIWTQELLKIGQKITAIDASEEVIE
INRRKLNSPKVEYHQIDLFAWEPQAEYDLVFFSFWLSHVPPELLKPFLLK
VYKSVRVGGRLFIVDSRFEPTSTANNHILNDDGSIYKNRKLNDGQEFQIV
KVFYQQDELQKHLTQVGFKSDVKVTDHYFIYANNTKFYN
>Ava_0107 conserved hypothetical protein
MKIWHRKFGESIGNITYHHLPALRWRNFRLFFGGQLLSMSGTFMTQQLTI
PWLVYDLTKSAWLLGVAGFVQFLPTLLLIPFSGILSDRWSRRDLLMLVQI
LGISVSLALTILTFTNWITFPILLVLSALNGLLKGLDMPVRHTIVTETVD
DRADWSNAIALNSVMLSSSLVLGPAMGGILIATLGVKYCFLYDTLSYIPA
IFTLLAMQLPVRPMQSLSGISNTLQKLGEGFEYVSKFQPIRAILLMLALH
GLVGMSHIALMPVVAAKILNGDATTMAHLSTSAPIGSLLACLYLSIRRGI
AGLERLIVAAQISIGISLICFSLSRQIELSIIILVFIGCFAILQITSSNM
IIQTLVAEDKRGRVMSFYALAMVGTMPFGNLFAGTLADNFGATNALIVCG
SLSISGALWFSSQLPAVSRWIAR
>Ava_1341 conserved hypothetical protein
MTNEEIQQEIERLRQPDILNLEQVKRFSSWLDERRKLRKPGRAVGESGLG
KTTASLFYTYQNRAAKIPNQNPVAPVLYVELIGSSCSPSLLFKTIIETLK
FKAKGGTENQLRERAWYLIKQCKVEMLIIDEAHRLQFKTLTDVTDLSDKV
KIIPILVGTSSRLDALISKNEQVGGRFAAYFSFEQLSGANFIKTVKIWEQ
QILKLPEPSNLAENQEIITILQAKTAGQIRLLDQILRDAAVKALESGVNK
IDKSLLNSIEGDYSLVAS
>Ava_1237 Biopterin transport-related protein BT1
MLIDSSGLSKVKDSVRKQIFFGNEPSAELIAILSVYFVQGILGLARLAVS
FFLKDELLLSPVQVSALMGIVALPWMVKPLFGFVSDGLPIFGYRRRPYLV
LSGILGAISWISLATIVHTSWAATVAIALGSLSVAVSDVIVDSLVVERAR
EESVSHVGSLQALCWGASAFGGLITAYFSGLLLEHFTTRTVFLITASFPL
IVSGVAWLIAESPVSKDGSDNTNLLSVKQQLQQLRQAFTQKSIWLPTAFV
FIWQATPTADSAFFFFSTNELHFEPEFLGRVRLVTSLASLIGVWIFQRFL
KSVSFRLIFGWSTVISAVLGMTMLLLVTHTNRALGIDDRWFSLGDSLILT
VMGQIAYMPVLVLSARLCPPGVEATLFALLMSVFNLAGMVSYEVGAIIMH
WLGITETNFDLLWLLVLITNLSTLLPLPFLNWLPADDAEIETQALAPASA
NSQVSNLVSELRLREAEPKIVE
>Ava_3068 conserved hypothetical protein
MKSLHRPDLYSWSTFNPARNIDFNGFAWIRPEGNILIDPVALSNHDWKHL
ESLGGVVWIVLTNSDHVRSAKEIADQTYTKIAGPVAEKENFPIYCDRWLS
DGDELVPGLKVMELQGSKTPGELALLLEETTLITGDLVRAYRAGGLEILP
DEKLMNKQKVVASVRRLAALEKVEAVLVGDGWSVFRDGRDRLKELVATLA
>Ava_4338 ComEC/Rec2-related protein
MNYTSREVSRILPSDAFARSLMIQTSGVIICLGYILGLLFTAVPGSGVWI
LVLGIVCAVFLRRGAKPQRLLQKSANDVASNSLLLTPLPRVWLIAGLVGL
LASFYFQWRVPQPTATDISQFVSSDSNNQEQLVIVRGEIASNPRLTRSQR
GQFWLQVSQLDEVRNQKDTKETQKGASGKLYVTVPILKATGLHPGQEIAV
TGVLYKPKAASNPGAFDFKNYLEREGTFAGLMGRQINILDEERQWGWWQI
RERIVRSQVMGLDIPEGPLVSAMVLGSKAVDLPYDVRDLFVQAGLAHALA
ASGFQTSLILSVILQLTKRAKKVTQVSLGGLALIIFLSLTGFQPAVLRAV
IMGFAALVGLALDRKVKQLGSLLLAATILLVFNPLWIWDLGFQLSFLATL
GLVVSVPAITSFLDWVPPAIASLISVPLAATIWTLPLQLFVFGVVPAYSL
LLNIITTPLISIISIGGIISAMFALILPGAGSFVAGFLHYPTDWLIRLVE
IFSQLPGNSLVLGSISTWQLLAIYGLVFLVWLVTWWQKRWWFAGLIAIGL
VLFPAWHSASTLSRVTVLEAGAEPVVVVQDRGTVTLINSGDEGTGRFTIL
PFLQQQGVNQVDWAIATDFQRNNNDAWLEVLQRLGIKNFYAYATNKENSL
ADQAIPQILQKQKGIYQLLPVGQTINLGSTVAQLINEQPMMQLQVLGQSW
LLVGDVEAKEVERIMKAGGWPSPQVLWCNAESLKDLVMMLKPQVAIASSG
SLETTVLSELSKTSTKVFVTAQDGAIQWMPNGEFESFTQVSESKSSVL
>Ava_3202 Alpha/beta hydrolase fold
MPNHDALAGGKVHLRRGVDLQVSHSPGVYPPLVFIHGGTGNRFNFRMQYE
FAQAQGWEVLAYDLGGNGQSSRYSRYSIGRHCRDLARLLVRFGVESPVLC
CHSYGVPIGLEFVQHHPVSAIIAIAGGTHDLAPWWEIPLMKFMAGGGRYL
LHLPGVQKINKLLSTSYSHSVMERFFVENPPPTDFDAYKGLEIFWDYNFF
IRHPLPKNLHIPALVITADKDPTFTANMGDELANHFDDGTHLHFAKGGHL
VMAESAELVNEAIANWLIQKLTIN
>Ava_0244 Major facilitator superfamily MFS_1
MPQKSAALRFVILLGFVSLCADATYEGARSITGAYLQVLGASGTVVGLVA
GFGELIGYGLRLVIGYLSDQSRKYWGITTLGYILNTAVVPLLALTGRWEA
AAGLMMAERTGKAVRTPPRDALLSHGASQIGRGFGFGLHEAMDQTGAVMG
PLAVAAMVYFQGEYRHAFTILIVPAVLGLVVLLVLQFLYPNPSDFEEETE
EQKQQEGLPRIFWIYLGAVAVIAAGYADFPLIAFHFQKSGIATGETIPLF
YALAMGVDAIAALIFGYVFDRLGISILIIAAFISCLFAPLVFLGDTNLAL
LGIAFWGIGMGAQESILKAAIAGMVPKHKRATAYGIFSTGYGLSWFLGSA
LMGILYDHSITTLIVFSSATQLLAIPFFTWVKLKADDSPKTITNQAS
>Ava_3525 heterocyst differentiation protein
MSQEFHISVTPVGQNDYLVRTEEVAPGVPLAEELVTWPVADWLAAAGHLM
NDPLKSVLQGDAFLSMGRESAIARNSVNLVALGQQLYNALFQGTLRDSLI
TAQGIAQNHQQVLRLRLGLKDTRLARLPWEVMHAGDRPLATGPYIAFSRY
QSGISPTSRVPSANRLKLPEDGVVRVLMILASPTDQASLDLLKQESIRLQ
AELHRQLPRSIEGGNYLPEIDLTLLNQPGREEVTQALEQGRYHVLHYSGH
SNLGPNGGEIYLVSSRTGLTETLCGDDLAGLLVNNNIQMAVFNSCWGTYT
ASFDNSGDTGERNLTDSLVKRGIQSVLAMSERIPDEVALTLTQLFYRNLS
QGYPVDLCVSRVRQGLISAYGSHQLYWALPILYLQPEFDGFLSPKLSAAT
SVGSLDEYSSSLAANTASTYSGVLDDGEMSLPIEDMMPSGLVHDSSGVDW
LGEETWGDLVDEIEYDDPSYAEDSAFVSDIFRQLDQQIIGDEETEVPPEV
RQPLPDSHLERPIATAPREDFSRIVPPATHHTAQNLNQDLENFRLLASRN
RVRRQRWQIFGMIGVGAIAIILIFSWWQSRQTSVVRDIPPIPTPSLPVET
QPPTDLRQMPTGMVTAIATEKLNQGDLEPGLAAVEELLNRNALQPAQTAL
QLIPANQEKQASVNFLRGRLAWQSIQTGDKKYSIDDARRYWERAVKANPK
SLSYINALGFAYYAEGNINRANNSWFQAISLGLKQVNTADAAEVSPKADV
PIEALPAYAGLALGMYKNARNRNFPPDKQAQYINEALKLRQTVLEKDPIN
FQLDELSKNWLWTENSLRDWRSLLQQKSPKQSRGR
>Ava_1219 Radical SAM
MAVNLQQAIDIGKYLVTQRLLGRKRFPLVLMLEPLFRCNLACSGCGKIQH
PTEILKQNLTPEQCFAAVDECGAPVVSIPGGEPLLHPQIDEIVRGLVERK
KYVYLCTNGLLLEKSLDKFQPSPYLTFSVHLDGLREWHDKCVDRKGVFDI
AVKAIKAAKARGFRVTTNTTIFEGCDPQEMQEFFDFLETLNTDGMMISPG
YSYDWAPDQDHFLKREQTRALFREILTPYKTGKKNWNFNHNPLFLDFLVG
EKDYECTPWGSPSYSVLGWQKPCYLLNEGHYTSFKQLLAETDWNKYGRAS
GNPKCADCMVHCGYEPTAAMDAMQPQNITRSLGSVFGR
>Ava_3944 Ankyrin
MVGNVHHYLFHANSSELQQMMNRKSPFNLSAATKEWLIGKGYNPEDLEQP
GENGDTALMKATREGINSVVKELIDLGVDINARNNDNNNALWFACFGNHY
DLIHLLLAARINIDNQNDNGATVLMYAASAGKTAVVKLLLQYHPNLYLKN
LDDFQAIDFASNVEVLRLLKNATK
>Ava_3658 DEAD/DEAH box helicase-like
MTNAFNRLAPFIQEYIYHHQWTELRPVQTAACEVVFDTDAHLLIAAATAA
GKTEAAFLPVLTLLYNNPASTIGALYIGPIKALINDQFARLNDLLRLADI
PVYHWHGDVAQSRKNKLLQNPQGILQITPESLESLLVNKHKEILRLFGDL
RFVVIDEIHAFMGSERGCQIICQLQRLANLTKTQPRRIGLSATLGDYSMA
EEWLRLGTDKQVITPKVEAGKRQIKLALEHFYIANDVDESEATAYEKYIF
NLSQSRKCLIFANNRTQTESIIASLRQIANQQAQPDIYHVHHGSISASLR
QVAENAMREPHNPAVTAATLTLELGIDIGHLERVIQLESPLSVASFLQRL
GRAGRRGEAADMRFICAEDEISAEEPLPEQIPWQLLQCIAIIQLYLEERW
IEPIKPIKYPLSLLYHQTMSILVAVGEISPADLAKQVLSLPPFAAISQED
FKLLLRYLIDIGHIQHTEQNKLILGLAGERIVGKFQFYAVFADNKEYTVK
QGTKEIGSIATPPVVGKQFALAGVTWEVTEIDFKKRVVIVKQVAGKATSY
WRGGSGTIHTKVLQKMRLILLEDKEYSYLQKNALQRLRKVRELVKEVGLD
KQYILQLEKGRCCIFPWMGTVAYRTLERLLNNFCRESLEISSIGGVNPYY
LTIKLGKGKFKNLQPEIVSLCEQRIMSEDLISSAEAPEMQKYDEFIPHPL
LRKAFAYDYLDMDEMRKVIGNW
>Ava_4865 conserved hypothetical protein
MLINAEHLLQYQRCKRRPILDIHADRSQRDAPSELLLKLQQDKYAHQQSI
LAKFVYHQPEYPKRNWEAGQAATIELMQRGVEYIYQGVLLANYNQLIDTD
DTKYTCLSRPDLLIKQPGQSKFGDWVYEPVDIELGKRPKQEYQVVTAFHT
QILAKVQEVQPQNAWLMLRGKETPYSVDLFKWIPQMQGILREFIQDLESD
TAPEVFISRQKCNLCLWYSQCYAIAQSEQHLSLLPGVTPIRYTQLQILNI
TSVESLAHTSPTILENLVGFDSQVAAKLVVQAQSVLQRQPLILPFPPPKG
NLIFNSPVEIYFDIEAQPDLNLDYLLGVLVINKQTNTEQFYSFVAEKPSE
EELIWRQFLDLVWQYPEAPIYHFCVYELDTVKRLAKLYRTPYTSVNPVLY
RFVDIYEHLTQSVALPIESYALKAIARWMGFEWRDKEATGAKCIYWYDQW
LETGDRTLLEIIIRYNEDDCRATRKVKDWLVEFVQDKYHLRRVG
>Ava_2763 Peptidase U62, modulator of DNA gyrase
MLTSTLLLSNQLPNLQYSSTPERFDETWEAPLATLLGLGRAAGADFIELF
LERRNYISSLAEDDTITSISPSLATGAGVRVFRGKADCYVSTNDLSFSGL
KAALEKGLSILGLQLPAPNAFIPEINLELLRDYATKRGKDAWLPLCSSIR
EMGEILLDGTAYLKQKANHIQSRRASYFRDWQEVLVAASDGTFARDIRLT
QSVGFNLLCADGANRASIGERAGNTSDANFLRTWDYQQAAEQISESAGKM
LYADYVESGTYPIIMANHFGGVIFHEACGHLLETTQIERNTTPFADKKGE
KIAHESLTAWDEGRADNAFGTIDMDDEGMPAQRTLLIEKGVLKNFLADRT
GSSRTGHPRTGSGRRQNYTFAAASRMRNTYIATGDYTVEELFASVDKGIY
CKKMGGGSVGATGQFNFSVDEAYLIENGKITKPLKGATLIGEAKEIMNKI
SMCSQDLELAPGFCGSVSGSIYTTVGQPHIKVDSITVGGR
>Ava_4231 putative ABC-2 type transport system permease protein
MGIVLSNIIAIYRRELQSYFVSPLAYAIASVFWFIAGLFLVMILLGPNGI
LVYVASLDVQGQQLGVPVPPIDVPAEFIQAFLDRLGWLLLFILPVLSMGL
YAEERKRGTLELLATSPVTNWAVAVGKLLGVLTFFITLVVPLLLFEAIAL
SSSNPPMPATIPLLAHLGLILLAAAILSLGMFISSLTDSTILSAVFTFAL
VLLLLFVDVIAKGIGGPVGEIIGHLSLLKHYNTFIQGIFDTSALILFASY
IFLGIFLTAQSIDALRFQRN
>Ava_2596 Short-chain dehydrogenase/reductase SDR
MTTLTGKTVLLTGASRGLGVYIARALAKEQATVVCVSRSQSGLAQTCNVV
KAAGGKAIAIPFDVRNISQLSALVQQAQDIVGPIDVLINNAGIEINAAFA
NYSLAEIQSIFNTNLLAVMELTRLLLPSMMERGSGRIVNIASLAGKKGVA
FNSVYSASKAGLIMWTDAMRQELVGTGVNISVVCPGYVSQTGMTVDTRVS
APKLAGISTPKSVANAVVKAIKKKTSEVVVNQNPITESLTKFMLAIGQIS
PTSVDRIYRWLGVVDFNQKRAENRVKDGYVAVESHRS
>Ava_1413 conserved hypothetical protein
MLQLLILIVFVICVSSSASKIMLKSIRIENFRGFHSFELQQLGRVNLLVG
KNNTGKTSILEAIQLLCSRNKVDLLAQKMTSRSEFCFDEKSNNSLELDVR
HLFYGHEIQLETRFAITATHDNSSEELVVSIKAHNIHIPEGSRVLDAYQN
DEFIKALLERDSFLEISWNGTEKEPLILLPLSSKGGLPIDYVRRFRGDNK
NSASKVQFVTSSFLETERMIELFDQIVLTPEEKLVEQALNIIDSKIKRIA
PMGSIKFRGSLDGSRGGFFVLLSDINQRVPIGSMGDGVWRMLGLALATVC
AKDGYLFVDEIDTGLHFTAMSDMWRMIWETAKRLNVQVFATTHNSDCWMS
LASIVQQEDAAEDGIRIHRIEKGKEKSVVFTEPQIVIAAEREIEVR
>Ava_3529 Protein of unknown function DUF6, transmembrane
MQLKLSASKFPIAPLLLIAPFFLWGTAMVAMKGVIPHTTPFFMAGVRLLP
AGVLILIAAALSGRPQPNSWQGWLWIALFGLVDGTLFQGFLAEGLVRTSA
GLGSVMIDSQPLAVALLSLWLFQERIGLWGWLGLGLGVTGISLIGLPDEW
IFSLLGTGAEVTIGNWQNLFASGEWLMLLAALSMAVGTVLIRFVTRYVDP
VTATGWHMIIGGLPLWGISAVVESQQWENLVGSEWLALAYATVFGSAIAY
GLFFYFASSGSLTSLSSLTFLTPIFALLFGHLLLSEVLSTLQWVGVFLTL
ISIYLINQRDNLAGQNKKSVMEEISAKSQPMLAEATTKKLNPLAVKVRES
EPETLP
>Ava_3437 Protein of unknown function DUF177
MDAIFIPQLTKAPERTEEVQVKEFLPGLETLTPVRGLVRVQHYGNYLQVS
AKAEAIITCTCNRCLQQYNQRLAVDTQEVIWLDEAVEQEQNLPLEREVAM
EDLLETLSPNGYFYPSEWLYEQMCLALPQRQLCDINCPGILTDNSAESSD
QGIDNRWSGLKALKKHLPG
>Ava_2106 expressed protein
MLFANSLENQLQQWNEVISKQPNNPNAYIRRGMVQFQLAKIEESIDDFDT
AEKLDNRIKPYLWQRGLSYYYADRFAEGAQQFEIDLTVNSQDVEETVWRY
LCMARLVGVTEARNNLLTVKNDPRLIMRSVYDLFAGHCTPDNVFNIGQTE
GNKGKFYSHLYLGLYYEAENNLPLAQEYIVKAADKYKLDDYMWYLAQVHK
KLRGWM
>Ava_2536 Short-chain dehydrogenase/reductase SDR
MSLEQKRRALITGASSGIGKATALAFAKAGIDVALVSRSGDKLTPVVEAT
KQTGVVAKAYTVDLADVSQVEAKIQAIAIDFGDIDILVNNAGVAYTATLS
ETPLADWQQVINLNLTSVFECIKGILPRMRARHTGTIINVSSIAAKQSFP
HWGVYSVSKAGLMALSQTLAQEERVHGIRVSAICPGAVNTGLWDTETVKA
DFDRSKMLTPEIVAQSILYTALLPPQAVIDELTLMPSAGAL
>Ava_2021 conserved hypothetical protein
MTKLILLIGLPGSGKSNLAKQLVAQCPQMQLISTDAIRGQLFGSEAIQGS
WLLIWREIERQLQQTVITNKIALFDATNAQRRHRRELIALARELGFTDIT
GVWIKTPVWLCLARNKKRPRQVPEDVILRMHRQLRDAPPTLQEGLENLIV
HENISSPIQGSLLPISSIDKYGNCPDPVSVNHT
>Ava_0220 Serine/Threonine protein kinase
MTKLYCSKGHENYPGTRFCLKCGEKLVDNFFTQGIQPGLTLSDRYLIVRQ
VGQGGFGRTYLAEDINRFRELCILKEFSPQVQTAYVLQKAEELFQREAKV
LHQLQHPQIPRFREIFRVNLAGKEYLFLVQDYVEGETYSGLLNQRIQQGL
RFTEAEIRQLLQQILPVLDYIHSLGVIHRDISPDNLILRSVDKLPVLIDF
GGVKQVVAVVASQYYQPGVVASPPAATLLGKVGFAPPEQMQTGNVSPHSD
LYALAVTAVVLLTGKQPQELLDTYNLSWNWRREISLSPIFGQVLDKMLAA
RPGDRYQSAQQILHALNPAPVNYPPTQPPIPTPLPTPPPVSTPDTIAVAP
SLPTPDISPAPQRKNWLIPTAAFLLVGTVGLVWLGMSNSDRNSISVDPTP
TPTETQPTNPLDKYSPAERQRKQTLNDRRQQLGIDFNFYINVVNQIFWDR
NPSERGRTLSDGAEDENLRTEWDAVAAELLDKLKPLSNRARRQLGTYTAA
ERDRWKVEVNQINVGSRSLYDLGDAAFFRAFPEQKGKNFIEQPIGQVWQA
FVSDKLSAIVAGTVFQKIVFDPGATSKTVSGTLQPAGGRVFIADLAQNQS
LELQLQANSKVLLSVYSPSGKTIFLEDSEKRSLSTELPESGFYEFVVVST
ASTSADYQLTLTVENPTPPPEPTPTPSETTTPTPTPSETPTETPTPTPTT
AP
>Ava_1863 Short-chain dehydrogenase/reductase SDR
MNGLKGKNTLITGASSGIGQAIAIRLAQEGCNIAINYRKSPSGAEETEEM
ALQKACKNVENCGVKSLLVQGDVSKESDVIEMVNTVVEKFGSLDILINNA
GIQTECPSHEITAEDFDRVIGVNLRGSYLCARETIKHLLTQNRRGVIINI
SSVHEIIPRPMYVSYSISKGGMENMTKTLALEYAHRGIRVNSVAPGATIT
PINEAWTDDPEKKAVVESHIPMGRAGTSEEMAAAVAFLASDEAAYITGQT
LFVDAGLTLYADFREPWSA
>Ava_2605 UbiE/COQ5 methyltransferase
MGFYSQVILPRLLDWSLSDPTLATYRQELLTDVTGEVLEIGFGTGLNLAY
YPSHIHEITTVDVNPGMNTLAQKRIDDSGIKVQQLLLSGENLPMADNTFD
SVVSTWTLCSIANVEQALQEVYRVLKPGGKFFFLEHGLSNKPNVQVWQNR
LTPIQKILADGCHLNRNIQKLVEKSFNHVELKRFTPENFPDLMAHFYKGC
ATKKAI
>Ava_3740 Pyridoxamine 5'-phosphate oxidase-related, FMN-binding
MLDIDEMGSKEIHELLQRIEYGHLGCASEGHPYVVPMHYYFENPNLYIFT
TEGMKTKYIDANPEVCLQVEEIHNLSHWRSVIIMGRAERLTEPQEFDHAM
QLIKAHNPKLSPALNRTWIDVWGRANVIVIYRISPSEMTGRTTEGVSSQP
>Ava_0149 UbiE/COQ5 methyltransferase
MSNNLTKLTYQTLQQGKNYFGLAHKTLSAQLKDIVYPTLRQTKPIPREVI
TKLQDRFNQLLEIDWLEAEKEVYPASLLFDNPWEDFFRYYPLVWLDLPQI
WERVQQKKYQDFSSGIETEGYPSYYVQNFHHQSDGYLSDLSANLYDLQVE
ILFGGTADPMRRRILSPLKDRLKVFDSVSPRQIRILDVACGTGRTLKLIR
AILPQASLFGVDLSPAYLRKANELLSQISGELPQLLQANAEELPYVDNYF
HAVTSVFLFHELPATVRQTVIEQCFRVTKPGGVFIICDSIQMSDSPEMAV
LMDNFSDTFHEPYYKHYSTDNLVERLEKVGFENIEVQVHFMSKYFIAHKP
F
>Ava_3803 GCN5-related N-acetyltransferase
MAAQINMTSLLPRNLSVVIRPVYYRDLDGIERISQESFAAHTPQGASSIA
NRMQWLRRWYGLLKFLSWFPNPLQYRFCAYVAEQGRMLLGMIQVSPFNRT
RSTWRVDRVILDRAVDKQGIGSQLLRHCFEGILEARTWLLEVNVNDTDAL
ALYRQNGFQRLAEMTYWEIDPELLSELAQAEPDLPNLLPVSNADAQLLYQ
LDTASMPPLVRQVFDRNTRDFKTSLFGALRDAVKQWVTKIEVVSGYVFEP
QRKAAIGYFQLQLDRKGETPHVATLTVHPAYTWLYPELLSQLARIAQDFP
QQGLQLASSDYQPEREEYLERIGAKRIEHTLIMSRSVWHKLRESKFVSLE
GIQWTDVLQGLQPARKPIPGGMSWVHTRQQSSPDIPVPSSSEPMAFGIKD
VPNQPDSEEGEIGE
>Ava_1591 TPR repeat
MGSANNQGIGWWFLPKLTCYSLTLLLSAVLPSDAGGVTLRKQGLQIAQQP
ETTQQDAARAAAKRVFQEGMQLEQQGTGESLRQAIAKYQEALKLWQQVDD
KLSEASTLNNIGLVYGSLGEKPEALKYYNQALPILRAVGNRGQEAATLNN
IGLVYGSLGEKEEALKYLTQALPIGRAVGDRKEEAVTLSYIGGIYNSLGD
KESALKYLNQALSILRVVRDKGMEASTLINIGRVYSDLGDKEAALKYLNQ
ALPIGRAVGDKKIEARTLNDVGTLYSALGDKEAALKYLNQALPIIRAVGE
SRREATILHNIGGVYNSLGDKEEALKYYNQALPIRRAVGDTEGEIVTLSN
MAALQHSRGNLQKAQTHIQSAIRIIEDLRAKIANEELRAAYVASVRAYYE
FYTYLLMELQKN
>Ava_2293 ATPase associated with various cellular activities
MSETYPILIHLGEKLNRVIVGQSQLIQQLLVGLLSGGHIILEGVPGTGKT
LLVKVLARLIQADFHRVQLTPDVLPSDITGTNIFDLNNRNFTLKKGPVFT
EVLLADEINRTPPKTQAALLEAMEEMQVTLDGESLPLPDLFWVIATQNPL
EFEGTYPLPEAQLDRFLFKLVVDYPDQTAEKQMLLNRQAGFAARRSDINS
LQPIATVTDILEARQAVKAVKVSESIIDYLLALVRTSRQYPDLALGASPR
AAGAWLQTSQAVAWLEGRDFVTPDDIKAVASPLLRHRLILKPEAMLDGLQ
IDAVIAAVVNQVAVPR
>Ava_0110 sodium symporter
MNEILVIIDKLALFTFIVFTMLGAGLGLTIKQIWEPLRSPRLVILSLLTN
FILVPSFVYLLVQIVPLSEALKDGLLIMALASGPPALPKLAQIVKGNIAF
SVGLMMLLMLGTIFYMPLVLPLVVQGVQINSWDIGKPLLLMMISPLVIGL
FIKAKFAAIAPVIQPILFKLSSTGLFLGLVVRLIIHTNDIIGLLKTGAIF
ICAVFIIFSFSVGYLLGGPGIDTQRVLGVGTAQRNFAAALLVGSSNFDDP
NVVSIIMVTSILMMITVLIVGPKFIELDQPKDGDIKQLEISG
>Ava_2888 Short-chain dehydrogenase/reductase SDR
MISLKNQIVLITGASSGIGNACARIFAGAGAKLILAARRLERLQQLADEL
NQDFGVETHLLQLDVRDRSHVESAITSLPPAWSAIDILINNAGLSRGLDK
LYEGDFQDWEEMIDTNIKGLLYLTRYVVPGMVNRGRGHVINLGSIAGHQT
YPGGNVYCGTKAAVKAISEGLKQDLLGTPVRVTSVDPGMVETEFSEVRFH
GDTERAKKVYQGVNPLTPEDVADVIFFCATRSPHVNINEVILMPVDQASA
TLVNRKT
>Ava_1420 sodium symporter
MQANVFTNVILPLALAIIMLGMGLSLQIEDFKRVTKYPKAVSIGSITQLI
LLPIIGFLVAKAVPMQPEIAVGLVILSLCPGGPSSNMITYLAKGDVALSV
TLTVVSSMVTIFTIPIFANLALQHFLGQTAAIALPIGSTMLQIFLITIVP
IGLGMYIKRIFPATALRLEKVTNRLAIAFLALIILILVIREWNRIPGFIV
QVGVGVVILNILAMLSGWYMSKFFQLNIPQQICVAIEVGIQNGTLAIAIT
AGLLKNQDMAIPAAIYSLFMNLTAFVAIYYGRKLSAINSVSNNLVGKV
>Ava_1799 Alpha/beta hydrolase fold
MDTLLRNARRKLSQGLLFWREAGEGIPVVLLHGAWYESSQWVEVMESLSQ
NFHCFAPDLLGFGESEKPNIHYSIDLQVECIAEFFQALKLEKVYLLGDSL
GAWIAASYALKYPEQVYGLVLLAPEGVQIEGQSQNCQKMRRLSKRSPLLF
KIMRSLSFLTKIFGLDKKIEQDWQTRQKLLQNPTAAQLLFQRQQPEIEAE
LLQSYLSKLEIPVLVLQGGKDKPDALARSQFYTKLIPQAELKIIAHGESN
LPESCVGIVAGEIREFITNSPSYPC
>Ava_4492 TPR repeat
MNADYFFNQGLSDNLSREYQRTVIGGTKPTNRKVDAETYCKRGITFSREL
KDYQGAIAFFNLAVEINPNYAQAYYHRGNARYCLADFTAAIADYDQALQI
NPTFAEYYYCRGNAYLAQGDYDQAIANYISTIEFDPLLASNINEDIANAY
YYRGLHNSDHGNYQEAIIDLQQALQWHPYFAAAYSIRGNIYYKLGEYRQA
IADHERAVQLDPNLAEAYQNRGNAYYALGAYQKAIADYNRTLEINPHQVG
AYYNRGLISFYLNEYQQAFADFNQVLSFNSKDAQAYYQRGLIYEAWQDYQ
SALADYNQALQLNPELAVVYGVRANIHRHLGDYPSALADGNRLLQLQPNF
AAGYCDRGTSRRCLGDYRGAITDYNQALQINPNIAEAYYGRAIAHEALQD
LIGAIADYTHSIRISPDFAPAYCNRGNARRQLGDTKGALADYNQALTINT
QLSEAYYNRGSLHYDQQNYRSAIADYTQALELQPESARYYSDRAHARYAL
QDYQGAVADYTQSIAINPGYAEDWYNRGRSYLLLGYLEEALADLNQALKF
QPHWASAYLLRADILRNRGDYQAAITDFQKSADLYSQEGNTQNYQQILEI
IAGLK
>Ava_0806 Protein of unknown function DUF477
MTHILQQVFRMKKLFIRLILPFLTIILAASLSSSPALASGVYQIPNLTAG
DSTWVLDQGDVISRINEGAISSSLENLAKQTGKEVRFVTIHRLDYGETPE
SFGQALFEKWFPNKEAQANQILLVLDTVTNGTAIITGDEVKPLLTDAIAN
SVAEETLAAPLRDGNKYNQAFLDASDRLVAVLSGQPDPGPPQIVDKVQVE
GTFKKAGETDKGNATAWVVGLLIAATIIPMATYYIYLAVQPSSEG
>Ava_2739 Metallophosphoesterase
MHRFLSGPLSVEQLTVNIAGLSASLQGKKLVHLSDFHYDGLRLSEAMLEE
AIAVSNQIQPDLVLLTGDYVTTTPQPIHQLTKKLQKLQSHAGIYAILGNH
DLYQKNSKTEITQALTKAGIQVLWNEIAYPLGTELPIVGIVDYYYREFNP
DLLLKQLEPTTPRIVLCHNPDTAAMLQAWRVDLQLSGHTHGGQIVIPGLG
AVLPYHKKIVRRVPRKIRRLFPLLFKDYFIVRHPEWLQGLHRLGKNQLYV
NRGLGTYLPGRLFCPPELTVITLEAE
>Ava_1178 TPR repeat
MKRWLLEQGRVKLKRVLTIGVLTALSAITSVSCSNNKEVLVTEIGVNPPS
RRTTNNSQAGQFYVQGQRQHAQGDSQGAIASYDKAIGLDPDYGAAYRGRG
LAYFDLGDKQKAIADYNEAIRLSPNDAEAFNSRGNARASLGDNAGAITDY
NEAIRLSPNYAEAYNNRGNARSVQGDKSGSIEDFNQAIRLNPKYAIAYNN
RGNARASQGDSQGAISDYNQAIRLNSNFGPAYNNRGNARAAQGDKQGALE
DLQKAADIFQRQNNNDLYQQAMNNIKELGQ
>Ava_0393 Protein of unknown function DUF815
MDNPVMSAITSLYPQVQFLQRQAASLLLYQSVLQTDVGIAFQELLQAIRY
ADADARGSLQAYGNYFHALATRKQNWEDYLVSQILIAENPFSKVTQQQDF
KDLPPALISAVKHDLQVLQSLYECNSAVLSQWVQAVAHLPISPVVWYLEQ
DDMGSETALSLHHLEHWADAVEELATHYQKFGIGLFAQYRALRWQDGEFV
GIPYPDPVKLGTLVGYESQQEALLKNTEFLLSGEKALHVLLYGSRGAGKS
SLVKALLNEYSHRQLRLLEVSKADLKDLPKIVEQLRGVPQKFIIFVDDLS
FEEDDDAFKALKVVLEGNLTARPHNVVVYATSNRRHLIREFFADRPSLKN
NEEVHAGDTMQEKLSFSDRFGLTLTFESADQSTYLKIVRHLATQAGISIS
PEELEYQALQWATRHNGRSGRTAQQFIDFLKADTSVFATNNSILDTQS
>Ava_0013 Alpha/beta hydrolase fold
MTLKTQPLAATAIANFDKLVWNWRNYKIQYTVMGTGQPLVLVHGFGASIG
HWRKNIPVLANAGYQVFAIDLLGFGGSDKAVIDYSVDVWVELLKDFWTAH
IQQPAVFVGNSIGALLSLIVLAKHPEITSGGVLINSAGGLSHRPHELNPP
LRIVMATFNRVVRSPITGKFVFNRIRQKAQIRRTLYQVYRDRTAVTDELV
DLLYTPSCDPGAQQVFASILTAPPGPTPEELLPQIERPLLVIWGADDPWT
PITGAKIYEQAQESGQDITIIPIPGAGHCPHDEVPNVVNAQIIDWLAQRK
>Ava_2247 hypothetical protein
MGNLDKKRIETVLFITTSSISGATTSAIPSPGGEIPKQFILTASDILMYT
SIWKIYFEEDLSSKSLLGILVDLGLVTVGAIGTAYIVSKASTAIIKEITN
WTGPLGWGVTAAIAGSLSGIFGVAWALHCDRLYSERKQRA
>Ava_1803 Flavin reductase-like, FMN-binding
MSETKPRDVQVLPIGTDTTVMRSRSWTRLRFEIEYALAKGTTANSYLIQG
NKLALIDPPGETFTQIYLDALQKRLDVTEIDYVILGHVNPNRAATLKALL
EIAPQITFVCSNPGAINLRGVLENPDLPILIMRGEETLDLGKGHDLQFIP
TPNPRYADELCTYDPQTEIIYTDKLFGAHICGDQVFDEGWEAINEDRRYY
YDCLMAPHARQVETALDKLADFPARVYATGHGPLVRYGLIELTHAYREWS
QQQTSADLTVALIYASAYGNTATLAQAIARGITKAGVAVESINCEFADPE
EISAAVEKSAGFVMGSPTLGGHAPTPVQTALGIVLSTATNNKLAGVFGSF
GWSGEAVDLIEGKLKDAGYRFGFDSIRVKFKPNEVTLQMCEEAGTDFAQA
LKKARKVRTQSVPATNVEQAVGRIVGSLCVVTAKQGEVSSAMLASWVAQA
SFNPPGLTIAVAKDRAVETLTHTGNKFVLNVLKEGNHLGLMKHFLKPFSP
AQDRFADVATAEAENGSPILKDALAYLECSVQNRMESGDHWLVYATVENG
KVLNQDGVTAVHHRKSGNHY
>Ava_1504 HAD-superfamily hydrolase, subfamily IA, variant 1
MGLNFVTTIKCRDIAFTNIQAILFDKNGTLENSEVYLRSLGQKAARIIDA
QVPGIGEPLLMAFGINGDTLDPAGLMSVASRRETEIATAAYIAETGKGWF
ESLKIARQALDDAEKYIGVTPAPLFTGALEVLQSLSQAGLKLGIVSAATT
SEVKNFVAQHNLSSYIQAQVGVDNGPSKPDPILFLQACQALGVEPEATLM
VGDAVGDMQMARNAQAAGCIGITWVNKPDNVQGADVVINRLDEIQILES
>Ava_3992 hypothetical protein
MITLGVERSAAAATFNPAPIFDSAAHYATTIPRSDGGVDATDIYYPVVSN
TDKSSLPIALFLQGALVDKSDYSNFANTVARYGFVVVVPNHIRTAISPMG
AVTGLIAEQQQVNDVLTYMQSEKSQGVSPVANLLDPSILVLLGHSFGGAV
GIAAIQGNCFAVLCTEDFNRPDELKAGVFYGTNVRIGQTSGGLPIIDNDD
IPIALVQGNRDGVATPANAQETYASIQDPPKAFITIPGANHYGITNEDNL
IREPIRPTLEQNVAIETIARWSALFLRGTVLNDKGAFDYVFNSGDALDPN
VNVESVTKPIPEYTSVVSLLGLGVIGASSLLRQKQKLITK
>Ava_1622 ABC transporter-like
MQTPSVHQQKATNHLSVFTQFWEDVKVVAQPYWYPTKAEGRAFVDVIRSW
GMLILLISLIAGVVGLNAVNSYWNRYVLDIVIEQRDIDKYNSTLWLSSLI
VITIVILVTSLRYVRKKIILDWYKWLNSHILEKYLSNQAYYKINFKSDID
NPDQRLSQEIEPITSIALSFSTTFLEKVLEMGSALIILWTVSAEIAIYLI
IYTIIGNLIAVYLNQSLNKINQEEIQFKADFAYCLTHVRNHAESIAFFQG
EEEELNIIQRRFNNVIKTAERRLNWERGQDAFGRAYQSAIGVFSMFILTP
LFIQDKIDFGEINQVSFACFMFSNSLGELIAQFGASGRFSSYVKRLAEFS
DALRDVSKKTENLATIKVLEEKRLAFENVTLKTPNHEQVIVEDLSLTVQP
GEGLLIVGPSGRGKSSLLRAIAGLWNTGTGRLVRPPLKDILFLPQRPYII
LGTLRQQLLYPHTDRTMSDRQLEEILQQVNLQHLLTRVNSFDTEVPWENI
LSLGEQQRLAFARLLITHPSFTILDEATSALDLNNEGNLYQQLQATKTTF
ISVGHRESLFNYHQWVLELSQESGWQLVSIEDYRWQKGIKTVDISPKNNL
PSKKRNVISKL
>Ava_1482 GCN5-related N-acetyltransferase
MGFWKTWFSTPESATSRTQPLEEHTVDATGNSNPKSDASAATRNAERIVF
STERDIDLYELEELCDAVGWSRRPLRKVKKAIEHSFLVATMWQVRGNQRR
LIGFARATSDHAFNATIWDVVVHPDFQGKGLGKALMKYVLKKLRSEEISN
VTLFADPHVVDFYRTMGFMPDPEGIKGMFWYPH
>Ava_1948 Cobalamin synthesis protein/P47K
MNTLTAETTSIIPEIPKRGMPVTIITGFLGSGKTTLLNQILKNKHDLKVA
VLVNEFGDINIDSQLLVSVDQDMLELSNGCICCTINDGLVDAVYRVLERE
ERIDYLVIETTGVADPLPIILTFLGTELRDLTNLDSILTVVDAEAFEPTH
FESEVALKQLTYADIILLNKTDLATAEKIQALEDYIQTVKDSARILHTKY
GEVALPLILGVGLTSKDDYTTDDAEAPHEHNHEHHSQDHDHHEHHHLHEH
HSHHLDNDGFVAVSFQSDRPFDIHKFENFLTEQMPQNVFRAKGILWFSDS
ELRHIFQLSGPRYNLHADEWHTLPKNQLVFIGRKLETQQIYTQLNNCLI
>Ava_1586 Small GTP-binding protein domain
MQFIDQAQIEVEAGKGGDGIVAFRREKYVPAGGPSGGNGGRGGSVVFVAV
ENLQTLLDFRYKHIFKADDGGRGGPNNCTGASGKDLIVQVPCGTTIYDAE
TGDLLGDLTQPNQQLLIAAGGKGGLGNQYFLSNRNRAPEYSLPGLPGERK
LLRLELKLLAEVGIIGLPNAGKSTLISSLSAARPKIADYPFTTLIPNLGV
VRKPTGDGTVFADIPGLIEGAADGAGLGHDFLRHIERTRVLLHLIDATSD
DVIRDYNTIEQELQAYGRGLSERMQIVALNKIDAVDRETVDLDALATQLN
HLSHAPVFLISAVTRTGLEPMLQEVWGILDQVNALEEAEVFR
>Ava_0449 conserved hypothetical protein
MHTDTNKFSWFQVPEDIKHLLILAAQSWDNTSVSEQYIQQALAITGKNTD
ILVAAYRFFYYKNNYSLALQTTYQLLDKIRESEKFPDEWEELKPILVNRR
EESTIRLYLNAYAAAGLVLAKLGEIEQAKEISMRVKEIDDKHDFGAGILL
DVLTRPADEDD
>Ava_1552 Serine/Threonine protein kinase with TPR repeats
MLGNTLVGRYQIISHLGGGGFGETFVACDTHLPGMPKCVVKKLQPQANDQ
ATLEIARRLFDTEAQVLYKLGSCDRIPQLLAYFEENAEFYLVQEFIPGHD
LSKELTPGKVFTQDEVTILLQEILTILEFVHEQNVIHRDINPRNLLRRQD
GKLILIDFGAVKQITTQIVTPTGENKSTVIIGTPGYIPGEQAQGNPKFSS
DIYAVGIVAIQALTGLLPHQLEHDADTHEIIWQNHAQVSSEFARFIDKMV
CYDFRQRYASATTALQALTELTKPASDTIAITPTLPPTKFRFNHTKKSIF
IKLLLAILLISATGTASVFVVNSINSNNATELAKQGNTFFELQRYKDALS
AYKKAVDIRPDYAPAWYGKGKTLFRLKQYQDALTAYDKAIQIQPDYVEAW
SGRGFSLQSLQRYAEAIASFDKALQLNENYPEVWNARGEAFSNLKQYDRA
IKSYDKAIEFKSDAYESFYNKGLALQSLKEYNEAINAYNKAIEIKSDYER
AWYNLGNSLVNLNRYEDAFKAYDKAVQYKTDYAIAWLSKGNVLIILRRYP
EAIESFNQVIKFNPNSYQAWYGKGWSQHQNQRYAEAIESYKKAATIKPSN
YQVWYSLGNSQYILQQYQEAIASYNKAVRYQPKHIESWYSRGNALFSLKQ
YQDAIASYEQAIKHKPDYSQAINARDEAQRQLQAATPKPVVIPVMPTPEF
TNPSQED
>Ava_4611 conserved hypothetical protein
MSNLLVQLLLIGLAAGVAGGMFGIGGGAIMVPAMVLLMGLDQKFATGTSI
AAQILPIGLLGAAVYHRSGNINFKYAVIIAVGLLVGNLFGAMFANQPFIT
SETMKKLYGIFLLAIGFRYLLVR
>Ava_1987 Protein of unknown function UPF0011
MQTEPKPGALYIVGTPIGNLEDITFRAVRILQNVDLIAAEDTRHTGKLLQ
HLQVKTPQVSYHEHNRSSRIPELLEHLHSGKAIALVSDAGMPGISDPGYE
LVKACVEVAIPVVPIPGASAAITALSAAGLPTDKFVFEGFLPAKGQQRRE
HLEALQTESRTLIFYESPHRLRETLQDLAEVWGSDRQIVLARELTKLYEE
FWRGSIEEAIAHYQQKEPQGEYTLLVAGNPPSQTLLTEEQLKAELQQLIS
QGISRSQASRQLAKYTSLNRRQVYQIALSIVMNPE
>Ava_0350 conserved hypothetical protein
MAKSGEDYLEIRKRQIERKKRIFTIVSIISFVGSTAFAVVPSIQRAIQNP
PPVTPTTSAESSLQEQAKGYELVLQREPNNQTALEKLSLVRVQLKDFKGA
RAVLEKLVKLHPDRQDYQVILEDMKKKEKEPK
>Ava_5032 Pentapeptide repeat
MRGIVLSPHGWQRFQAAKQKAESQETWGKRFTQEDLSDRTRLSLNTLARI
FKRELGVDRQSLELLFQAFGMELTQTDYVTPITSGAESASHWTNPQQDWD
NAVDASVFYGREAELAQLWQWIVTERCRVVGLLGIGGIGKSTIAVKAALQ
MQTEFEIVVWRSLANAPSLDELLTSLLKFFMPLQGDDPIIPATLEEKLSL
LMQYLRSQQCLLILDNTETILHSEQAGQWRSGYEAYGQLLRTLGETPHQS
CCLLTSREKPQEMALMESAQGRVRSLSLSGLTPDDGRAIFREKGEFTASE
AQWQTLINHYGGNPLALKMVAAATQDVFNGSIAEFLAYISQGIFIFEDIR
NLLDRQFNRLSPGEQKILYWFAIYREPVAIAEIIHNVVGSAAGQSVPQQV
NSLLRRSLIEKIDGLFFLQPVVMEYVTERLIQQVCTEFATRQLDVLQSHS
LILVQAKDYIREIQLRLIMQPVVEWLLSCYRNLSEVEDRARQLLAQQRQP
SRHRAGYAVGNLINLLVQMKVDLRGSDFSELVVQQADLRQVNLAGVNFQN
ADLTKSIFSESLNSAMSIDISPDGETVAVGDSTGLIYLWQITTTKLLATF
EGHTSWVWSVAFSPDGHKLASSGSDTSIRLWDVQSGQCLRVLTEHTGCVW
SVNFSPDGQRLASGSDDQTVRVWNLQGDCLQVLKGHTKNVYSVHFSPDHQ
TLASGSKDESIRIWNVIDGNCLNVLQGHTEGVHCVRYSPDGQLLASGSFG
GSIRLWSGQLHTNAYQSKVLHGHTNWVWSMAFSPDGGILASGSDDGTLRL
WNVQDGQCINVLSGHTDDVLAIAIRGQLMVSASQDQTVRLWNLHGQSLKT
LRGCTSGIRSLSLSPNGKTLASRGQDETIHLWHLQFDGDLSSPLRPDKTW
QRVTDTTAGLTSWTSYLSFSPDSQTVATNGQDGSILIWNLQTESLSQWSG
HDAPVWTVMFNPSGKTLASGSHDQTVRLWDVQTHQCLQVLRGHQDGVRAI
AFGTDGQRLASGSSDQTIRLWEVQTGACLGVLQGHSGGVFTLAFTAHDQQ
LISGSFDQTIRLWDLQTRESIQILRGHTGGIWTIAISPDGKTLASGSGDQ
TVRLWNLQTGHCLQVLHEHRSWVTSVSFSSNGQFLLSGSDDRTIKVWDIG
TGRCIKTLIVDRLYEGMNIQRAKGLTNAQKATLKALGALV
>Ava_0204 TPR repeat
MTNTVDSLFDTGLERYKAGEPAADLIPVFKEVCDRAPKASAAWICLAWLY
LLEDKPNLALKAAQKAVKINPQDPQARVNLAVAMLETGQKGLRQHIDITQ
QLMLVNPDWRDEIQNSIADGLSRKPDWQSLAKVKGWLFNE
>Ava_0565 Abortive infection protein
MVEQQKQEPEIPYLTRTQVLVAMGVTAVLLWTVAKLWLRFGNFLLFEWHW
YPRDFLLGLGVGLIITVLSGLAYSFWKAYRRSADYYLELVLKPLAWPDLI
WLGLLPGLSEELLFRGVMLPALGLDHFAVIGSSLCFGILHLSGSQQWPYV
IWATIVGVILGYSALWSGNLLVPIVAHIMTNLVSSCLWKLRQS
>Ava_2580 TPR repeat
MLRRLSLFITTVIIWQLTSSVTLARSNDPQQADRFPPSPLEITKPDPLVP
TLRDKQQLTVPERQKLEAALDELNQQATAKLQAGDNLGAFEIWNRELRLR
RYLGAVTEVKALSRVGEIAWNQNDSQELLYITQRLQAIQKQVQLQNKNVQ
QSVDLPLLQALGEAYQKVRSLKGAITVYSEVLATVREKQDLVAQVNTLTT
IGELNLNWFDYPQAATAYEELLRLATSGGENQNQEFTYLQKLAYIYQQSK
QAQKSIDVLNKIKAIYSQNNNFTQLPTLQLAIAANYETLARENPALIEAA
FKNYQEAYITAWQLQQYPSAAEAVQKLISLYRSQGQLDEALQASEILLEI
QTRAVNFYGLMEAYDQIGKLYLERKDSSAALAAFQKGLELARQLKHQEAY
FTQQINSI
>Ava_3717 Cobalamin synthesis protein/P47K
MNAPKQGMPVTIITGFLGSGKTTLLNHILSNQQGLKTAVLVNEFGEIGID
NELIVSTDENMVELSNGCICCTINNDLVDAVYKVLEKEEKLDYLVVETTG
LADPLPVALTFLGTELRDLTRLDSIITVVDAANYSLDLFNSQAAYSQIAY
GDVILLNKTDLVDEASLNDLERKINEVKEGARILRTKRSQVPLPLILSVG
LFESDKYYDAADDHSHDHHDHDHDHDHSTCEHDHHDHEHDHSACSHDHHD
HDHSACGHDHHDHEHHHHHSDHLENDGFTSISFQSDQPFSIRKFQYFLDN
QLPTNIFRAKGIMWFDESPKRHIFHLCGKRFTLDDEEWKGTPKNQLVLIG
QNLDRETLLTQLENCVCLPSTSRGKGFGK
>Ava_0580 conserved hypothetical protein
MTIYSPDTINFNGVNGDRYWFDEPFPYTGVGDIDERLSGFPVAVFLPHHR
PRQETPLVIGLQGMCAPYGWNAFIVPTLTEMGIAVALFDPPLSGERSLVR
TSTTLVQNEIKPLIDRGIPFDTALFLSIFRTTARDISKIREFCGDRYGLT
DSRLALFGVSMGVLLSAYAFTANSLGDRLLGTIGHADLPSFAKSWGRPFL
PDLAASPLGGLAESLLQRLQPDLAPLFQLLRLTKNLKYQDEYAWDCNPMN
YIQQVDSHRRVRFLVGANDSIVKIKDARACAQKFPDGDCYVVPGMGHGTR
QSGVSFVDHVRYFLATQLGDWCG
>Ava_2975 TPR repeat
MKNQFVVSFLLKSALVFSPFLLCADISRGQATVSISQQKILAQALTNQER
EELTRLRAEKSDRAQIQADFEQAFSRTTVLLNIWLVILSLFPVAIIALFW
LLRRVAIREIVNRAMSQFEGVDKLETQLIIVKQDAENLIQDTKSINRLLE
REIDSLQQKIKIEQENLSLVTSELLQAKTDNLAQIATEIATFQSKVESLF
GEFSHRLTQSESDTQKLKDMTLENIVKIESLLEHQLAELQKEAEKHQKTV
LGDIDTAGFDFKNYLINLQAETQKYKNSIFDDLSRLQSELQGYLQQQKDI
QLGNIQEVANIFNNQVSELQLEAQKQKYSLDNNLNKLQTDTQIHKDEIIQ
RLEELKDLFQAQVAELQIQTQQELASYLSELKIDTENSKEKIIEELEKYE
SDFISQFSELQFNAQQQKLLILEKLERLETDFVNQLSELQLDAQRRKDII
LQELIENKLPVIVEAQPIHKAQSEVLVNENEQPQLSFDECIEQGDSLFSQ
KSYEEAIAYYEQAIKLQQNNAVARFKHGLTSARLKRFKDAIKSYHQAIKM
QPNYHQAWCDLGVAFGNIRRHQEAFAAFDKATQVKPDDTVAWSNRGLALI
ELEEYEEAIASFDKALELQPNSAKIWDKRGYTLVRLGRDDEAITSFNQAL
EIKPEYASAYYNKAVCYALQRDVESSLENLQQAITLNPKYKEDAATDIDF
DEIADDEKFKQLIATP
>Ava_0980 Short-chain dehydrogenase/reductase SDR
MSTTKKIAVVTGGNRGLGFEASRQLAKKGYLVVLTSRDEAKGKTAAGKLQ
AEGLDVVAYPLDVTSEKSSQQLTEFIRQEFGKVDILINNAAIYIDSQTGN
NSIFHTKIETLQQTIDTNVYGVLRVTQALIPLMQEQNYGRIVNVSSGAGQ
LTDMGSGIPTYRISKTALNALTRIFANELKGTNILVNSVCPGWVKTDMGG
QDAPRTPEEGVDTIVWLATLPDGGASGGFFRDRQSIDW
>Ava_0497 2-phosphosulfolactate phosphatase
MKLFVYHTPELTPKDQVPDCAIAVDVLRATSTIATVLSAGGEAVQVFSDL
DELIAVSETWPPQKRLRAGERGGGKVAGFELGNSPLDCTPELVEGRRLFI
STTNGTRALKRVQDSATVLTAAFINRAAVVQYLLEKQPETVWIVGSGWEG
SYSLEDTACAGAIAHSVVEKSQLPPEKLAGNDEVISAIALYSQWQDNLLG
LLHHASHGQRLLRLECHEDLKYCSQTDVLTVLPIQQEAGVFKTKN
>Ava_0948 NUDIX hydrolase
MPLGRELPQLLKQRLFYKGRKFDFEVNRLRLPNKAEGEWECIRHPGGALA
VPVTPEGKLVLVRQYRFAVQGRILEFPAGTLETTEDPLTTVKREIEEETG
YSAQKWDKLGEFFLAPGYSDEIIYAFLARDLEKLDTPPKQDDDEDIETVL
LTPEELERAILDGEPIDAKSITSFFLARPFLV
>Ava_3450 TPR repeat
MDNSLAVVYLSVLVGILGFAVVSVFRQLFKTRKRESALSRLRSKLSKDKG
TAQEYFELACIYSEKKVYTQAIPLFQKALKAAEEEGEENTAPIYNGLGYT
YFAQEQYDLAIRQYKEALKFKPDYVVALNNLGHAYERKKLTAQALQMYEE
ALKCDPNNATAKRRAESLRRLVTA
>Ava_0311 Short-chain dehydrogenase/reductase SDR
MDLQLRGKVALVTAASKGLGKATAWQFAHEGAKVVISARSELVEKAAAEI
ASETGAEVLAVRADVTQPSDIERVINTTVERFGGLDILVTNAGGPPSGTF
DETDLATWETAINLNLLSAVSLVKYALPHLRQSTAPAILTITSTSTKQPV
KNLVLSNSIRLGVIGLTKTLSQELGKDQIRVNSTLPGWTYTGRVEELINA
RMAKTGQTKDAEVANINAAVPLGRMGRPEEFANVAVFLCSPAASFVNGVM
LQVDGGLNAGTF
>Ava_1841 WD-40 repeat
MTNCELKTAIANNNEHSLQRLIRAINLSKGEFSLILVRCNYQQLREEMRD
NLRDLSKDINIREIYVQSSISALHTTITSQLFLDNPSVASDCLPSALMVF
GLESVIALEDLLTGINQSRDIYAASFPFPLILWLQDEVASSLSRLAPDFK
SWAATTIKFEMPQEDLIALITKQTESLFSKVLEVGAEKFISNDALDLAPK
SQHRHEIESARNDLLRSYNIKLEPGLEASLEFVLGRDKYANDQINSALAH
YQRSLSLWQQEEKKESIEKKYLELNYQQRLWQAIILFHLGLCYHRMADLH
QSANSGYWRHALLWFQQCLDVLEEKKRQDLVAKFILPACEMLYRLQLWED
LKKLAQKSLYLHKSYGNQAQIAQDYGFLAAVAGANNDWILAHELANTALD
ILEKVTGTRQHESWYLLLLARSQRHLGEYEESINNLEWARVVCELQYEPS
LYLEILEELRSLYFIERHEYLEAFKLKQEKIQIEHQYGLRAFIGASQLQA
QRYKINSVLEPQSIPFIPEEVAQEISASGRQQDVNRLIERITRADYKLTV
VHGPSGVGKSSLLKAGLLPAVKNKVIGERIPLPIVLSAYTDWTTSLGRGF
YNVLAPLDISSSLEFTPAILLEKFRSATARNHTIIIIFDQFEEFFFVSSF
QQKLEFYQFFSECLNIAYLKIIISIREDYLHYLLDFERWDKTQPDSFHDL
DVINKNILDKDIRYFLGKFSREDAVAIIRYLTQTSQYEVKDDLINELVQD
LAGETEEVHPIELQIVGAQLQAENITNLEKYKLCGGTKKLIERWLEEVIR
DCGQEHEDFSWKLLYELTDEKGTRPLKTKSDLMLALEKYLDNQSDFDSRW
ELILEILVGSGLILCWREELGERYQLVHDYLVEPIRQRNDYGIIAELEKI
KSEKTQAEVARKLSQEQLNSVLRRRLREARAAGVLLAVMGGTIASLWWQA
DMQKRTAELQTIRAETSETNLQISAIAASSEALFSSNQEFDALLESLRAW
QKLKQAKEVRPETRMRVVTALQQAVYGVTELNRLEGHSDIVWGVAFSPDG
QLLASGSTDRTIKLWRPDGTLLQTLEGHTSAVTSVSFSPDGQTIASTSLD
QTVRIWRKNPTTGEFAPEPAQSLRKHKDWVYSANFSPDGELLATASRDRT
IKIWDRDGNLIKTLKGHQGSVNWVSFSPDSQFIASASEDKTVKIWRRDGS
LVKTLSAHQEGVTVVTFSPDGKLLASADRDNVIQLWQWDSSNHNNPEVDI
YKTLKQHTSTVWSLSFSSDSKQLASASDDNTINLWSHTGNLIKTFKGHSD
AVVSVAFSPDTKILASGSYDKSVKLWSLEAPRLPILRGHEDRVLSVAWSP
DGQVLASSSRDRTVKLWRRQLNKGRLDAHLYKTLVGHTQMVHSVSIDPKG
EILASASEDKTVKLWRLDGTLLKTLSGHSDSVVSVSFSPDGHLLASASRD
HTIKLWNRDGSLLKTLVGHEARVNSVSFSPDGEVLASASDDKTIKLWRPD
GSLIKTFDPHDSWVLGVSFSPTEKLLASAGWDNTVRLWRQDGTLLQTLLR
GFSDSVNAVSFSPTGEILAAANWDSTVKLWSREGKLIKTLNGHEAPVLSV
SFSPDGQTLASASDDNTIILWNLHLDDLLTRGCGWVNNYLKHNNNVDERD
RLLCDDVSDRI
>Ava_3727 Pyridoxamine 5'-phosphate oxidase-related, FMN-binding
MTRKFGEIAFTPEVQAAQEERGSRQTYDRYIANGPANDTITPNIAKFIAQ
LEGFYLGTVSSNGYPYIQFRGGSPGFLKVLDEKTLGFADFSGNLQYITVG
NLSSNDKAFLFLMDYRHRQRIKIWGRAEYIEGESSLLEQVRVADYPAQVE
RAILFHVEATSENCPQHIPVRYSEREVKAMMAPLENRIAELEQQLSEQNS
SSKTGKQLPSRD
>Ava_1040 hypothetical protein
MDAIKQPLKYYFLWFWIFFLTSSVILSPRLSENGLLKFIRIDDILFPITL
ITIFVFISGIHPIKKVILAFVYLYVFNLIVLIITHYSGLNSVSILEKVLP
LVKNFQYLIYFGLCFVLGSKSKNFIRYQRVFISIYICFIPNFLHGLYQTV
TLNFTGYYGLGILNEISPALTGAVFYFATIICSTIIYLMPKINLKAIYLW
ICFSAINAIFTALSGSRSAWLALFSYLLVVGIYTFKNIISSNKINKRNIN
ITLTIVTSILLFFLILLLFRDFGLKLESLLTQVPSRYLNINLGQVQDEAR
VVNWSSVLSMYLEFVHQFPLLGLFGLGSGGIYEIFGQLLNAADSQLVYTI
VSGGLIGTLLYLNALFKLYMFAKKYTSKSNIKLTPIFVGLFWSFMVFSVS
QEVFNLSKTGGLFWISCGLILGIAYSDFNLKNQIANDI
>Ava_0626 Alpha/beta hydrolase fold
MNTSTSTKTWIWRGFSICYQTQGTTGPSVVLVHGFGASWSHWRKNIPILA
KNCRVYAIDLIGFGGSAKPQPDTEMAYTLETWGEQVADFCREVVGEPAFL
VGNSIGCIVVMQAAVSNPEISLSVALLNCSLRLLHDRKRETLPWSRRFGA
PMLQRVLSIKAIGQFFFRQIAQPKTVRKILLQAYVNSEAVTEELVDILTV
PASDPGAAAVFLAFTSYSSGPLPEDLLPLLPCPALIVWGTDDPWEPVDLG
RELANYPQVLKFIPLEGVGHCPQDEAPELVNPILQDWIWEKTRLLEAQEI
EHNSQTIG
>Ava_0034 Metal dependent phosphohydrolase, HD region
MAKSLKKKSWVDAVNLGWVHEQRSSVVLAIAIVSLTGVIGHKLYNQPKLK
IGTVAPQTIKAPRTASIEDKKRTEIERKAARKSLTPVLMVDVRTTAQIGQ
NLEKMLDQGNEIRISAGSFPFYDTSVLSLSSQHYLRSCSDAEWQMMLVAL
ENTGQKRLGLFLEKPSTRSLRKQENSQNLPNTETQTSLLFPPPTVSLGQE
SQVQGVSLDPGQPKESISSPKTQNPEVSTNTQFVQALAELATFRMTTANQ
NLPRLINQITQAREAYAQASVQILHVDTITTQTIYHETVLLELSDYEWTQ
TQKGIRQGAERILAQGIPAGLPQSVLQNAVTLQVQAFVPKSAESLATKLL
LAVLEPNLKKDEEQTREHAQKAASVVAPVMVEVKQGAVIVKKGKEITEWD
FEVLEHYQLISRENNWLALLKLGGLVTGGVCIFVLVETRSKCPLRQRDRL
LVLLLTLSTPGVLAMGVSYTTWGAVGLLLGSFYGRELSMTVICLLLFILP
MSMEISTIGLVAGAAGGILGSYIAHRLRSREELALLGGAIAITQGGVYLL
MKVLIGAAFGSSWYLILQEAGLFTLSGLAWSVVALGLSPYLEKLFDLVTP
IRLAELANPNRPLLKRLATETPGTFQHTLFVATLAEAAAKHLGCNVELVR
AGTLYHDIGKMHDPLGFIENQMGGPNKHETEIKDPWKSAEIIKKHVSEGL
VMARRHLLPTAIQAFIPEHQGTMLIAYFHHQAEQMAQTDPNIVVNEADFR
YDGPIPQSRETAIVMLADACEAALRSLKDVNTEQALTVLNNILRARWQDN
QLIDSGLTREEMSQIAEIFVEVWQQFHHKRIAYPKMKSGKG
>Ava_4752 Major facilitator superfamily MFS_1
MTNKTIDSPFAPGLPALYIVAFLSGMSLGLFNPFISTLMKQNNINDIVIG
ANSTLYFFIIAIGTPLVTKILSKIGLRKTMMLGFLLMGITAPLFPFTTQL
SAWFLIRAVMGLACCLYLISGQTAINYFCNDKNRGIVNGLDALCFSLGFG
IGPVMGAAFYNASPKTTFLLGSGLILSGIIVVYLGLPEKEIKFQIPRFQI
IKKLKLPLHGSFAYGFSVATLVSLYPLYLLEQNYGVERIGYIFGLFILGG
LISTVPVSHLADRIGKIKVLKYSVIVVIISVIGLSFIDDPNITPFLAFIS
GVGMSPIFPLSLALIGSRLAVDELSSGSALFTSIYSAGCTAGPILSAIVM
TLLGTQYIFVLMMVIFVLFFLSLSKQNKYNHSLLSVER
>Ava_0076 ATPase associated with various cellular activities
MTLTANNKKRAVLRVRPGQFVVTPAIEQVAIRALRYLTSGFAIHLRGPAG
TGKTTLAMHLANCLDRPIMLIFGDDEFKSSDLIGSESGYTHKKLLDNYIH
SVLKVEDEFKQNWVDSRLTLACREGFTLVYDEFNRSRPEVNNVLLSALEE
KILTLPPSSNQPEYLHVNPQFRAIFTSNPEEYCGVHSTQDALMDRLVTIN
MPEPDELTQTEILAQKTALNRADALLVVQLVKAFRSRTGGEKTSGLRSCL
MIAKVCAEHNILVAPESSDFREICADVLFNRTNWSASEATTIFLELLNHL
NLEEIEEFKNSITSEDTDAIADEDHNAINESGFPTIIDSQFGTLDSEVLE
QPGVEDSIPFEREIYLYLQQYKSAALALLQQEFELSRTVATNALNSLEQK
GLVSKNNHVYTIEEPNQP
>Ava_4328 Short-chain dehydrogenase/reductase SDR
MSVNSEQLKTLITVEPSSQSLIFRSAISMSLELNLTGKTAIVTGGSAGIG
LAVAKALYSEGVNVAIASRSQERLENAVSAIQSLPTPGAKVIAISADLTQ
AESVDRVVSSTLAQFGQIDILINNAGSARAGSFLDSTDDLFLDAWNLKLL
GYIRFVRAVVPHLRSQGDGRIVNIIGGAGRTPRPNFLAGGTSNAALLNFT
KGISKELAEYNIRINAISPGATATERAETLARQNAQARGITVEQAKAESL
QSIPLKRIAQPDEIAALALFLVSDLAASITGTEIIVDGGSTPGV
>Ava_2357 Alpha-2-macroglobulin-like
MIIRICIRCFLVLTLVLGTGGCNFFGINSAREPLPAVSPLTPPKLPDWIE
QISPIGQAQPLNQIRIRFKEALIPVESLDSPEQQQLLQKFALWPPLPGQF
RFLTPRMVGFQADKALPIATRLQVTLKAGLADLKNHRLNKDLSWTFNTPS
IDLTNLPGVNPMEKADAEPIDLQPKLQFTSNVELDLASVQEHLQLIPEGK
NEGLHFQVTLNKEENPLNNEEPLKKFDPSARNWIYNLRPQKNLEKATSYR
LVFSPGIRPAYGNLATEKEFASKLSTYSPLAFQKINFYGQPDAGGTYGRF
IKGSPQLEFNNILVADSAKANIQISPAPKDISRLLQVNDEDRIIGINPYA
LEPAKTYTITIGENLQDKFGQTLGKPVSLKYDTGDLAGDIWVPSDLNIFP
SGKDLRLNISTVNLPESKYKAAYRVVKPTDLVYFNYGNDLLTKPAEWQSF
QVSGKKNQSVDITVPLRERINAKTGMLAYGVQARTNKYQENGKDLWREPT
TYGLVELTNLGVFSQWFPESGLIRVNHLTDGAPVKAAVIEIYQSKLQAKS
RPEPVPCATGKTDENGTFRINRAELQQCTAGSQNSIKSPELLVIARENED
WAFTRTDEYSGVYGYGIDAGWQGNKPESRGVIFSDRQLYQQGEKAWFTGF
ADYLQNGVIQQDKNADYQITLVNPDGQKTSLGTQTTNEFGTFSLEMPINK
TQSLGYYTIQGKGKNGQEISGEFRVAEFKPPNFKVEVKLDKEFAYIGDDV
DINASSNYLFGAPVEGGEAKYFITRQQANFIPKGWEEFTFGRQWFWPEET
PTISSDVLQSNSQLNTNGKSSQTVKVAKDLPYPMTYRVDVQVADVSNLSV
ANSQSFTALPSNRLIGLKSNFIADAGKAFPIEVVVTKPTGEVIAGQRVRL
ELQQIKYSSVTQLVEGSETPKNQVEYKTVAQTEITSTSNSQSVNLTPTES
GAYRIRVNFSDAKNELSATDSQIWVTGGNAVFWGTRDKDVLEVKLDKKEY
KAGETATALIQSPYADAELYFAVIKDKPIYQQITKVQGNAPQIQFQVTPE
MLPNAAVEAVLVRQGKPISQVEVGSLDNLVKIGFTPFKVNLEDKYLKLQV
KPVQTSLEPGAEETIQLELKDNQGNPTKGQFTVMVINEAVLQLSGYRPPN
LVDTVYAEQPISTRFTDNRPDVILQPQDIAKPKGWGYGGGFSTGAANTRT
RTNFQPLAYYNGSVLTDANGNAQITFKLPDDLTTWRVMAVATDGNLRFGN
GDATFITTKPLLTNAILPQFVRPGDRILAGLSVTNNTGNRGNLSINGELS
GTVKFNSKNPTTTTLQTQAESATQAYRFPMVADSVGFGKVRFTTQLNGTA
DAFELPLQVKPLEITEQVVETGVSQKQIKIPLNVDKNIFPEAGGLDIQLA
STLIPEIKAPAKEVLTDNDLPFTEPSASQLIIATNLQTLAQKYGQTFAEF
NSSQQANLAVEKLRKLQISDGGFAAFPGQEKSDPWVSSYAAESLVKASQV
FPDLVDSGMLSRLKTYLQKVLANPGEYDFCKQQLCKRQLQLNALIALAEL
GDKRNTFLTDIYEQSNKFDLVTQIKLARYLSQFPEWQDESQQLLNKLQQN
IYETGRTAVVSLPPSWGWMSSPTAVQAQALRLFIAQQSQPKLIDKLLQSL
LALRRDGTWQTDYNNAQALTALVEYSQLQPTPPNFVATVQLAGKKLGENR
FAGYKNPSLQLNVPMNQLPRGRHDLTLQKSGNGTLHYLVAYNYRLQGNQP
GRFNGLSITREISQVNAEKVLRKTGIYALDQPLTLAPGQVFDIGLEIIAD
RPVDHLVIKDPLPAALEAVDASFQTTTAALQAKADSWELGFRNIYSDRII
AYADHLEPGVYSLHYLVRSVTPGTFSWPGAEVHLQYAPEEFGRTAEMKLI
VEETEK
>Ava_3668 transferase hexapeptide repeat
MEKTEKQKMLAGELYLADDPELAAESKRASRLLRMYNATTEEQPEQRRQI
LQELFAQIGEKITIVPPLHCDYGSNIYVGNGVYMNYGCVILDCNKVEIGD
NVLFAPYVQIYTAYHPTEPEIRLSGRELAAPIKIGNNVWIGGGVIICPGV
TIGDNTTIGAGSVVVKDIPANVVAVGNPCRIIRNLT
>Ava_1644 HTTM
MAANEGISRNILTKKLDDVFGLDLRSLAAFRIGISLILLTDLGIRFSDYI
AHYTDAGVMPRILLADISKPWHWSLHAISGEPIFQKVLFGIAAVMALLML
VGYRTRIATIASWVLLISLHNRNPALIFAADDVLRALMFWAMFLPLGASY
SIESALNTNTRKLPERIVSGATFALICQQCFIYIFSAAFKTKSPVWVDGS
AVYYSLSFDQYTTPLGHFLLNFPPLLTLFTYVTLVLEWVGPLLLFIPFRN
SFFRMCAVTTFILLHAGFGLTLNLGIFPFLSIFSWLTFLPSSFWNGLHKR
LQTPERQGLTIYFDADCGFCKKVVHLLRTLLILPGTPLLMAQEYPDIHTD
MQTYNSWVVEDWQGNRHFKFEGIAYIVSLSPIFRFLVPLLTWKPVMAVGT
KFYEIIASNRKIAGNFTKPFKFKPIQIRSSRILNILALLLLLYTFVWNIS
SYSPDAFKRKVWQSTEFIGRATRLDQSWSIFAPAPPRDDGWHVIPGRLQD
GTEVDIFRGGSPLSWDKPDLGLRSAIYQNMQWRTYFINLNRAIGKKLYPF
YGKYLCRTWNAQHTNGQSLKSFDIYFMSERTVPPGKQQNVEKKQTWQQSC
SQ
>Ava_4144 Phospholipase/Carboxylesterase
MSIERRSGLLPYLFSRAAEETNKPAPLVLFLHGARDRGTDINVLLKWGLP
RFVDESSPLPYFFVAPQLPEGQTWVDREADVIALLDNLIVSQSIDPSRVI
LSGFSLGTAGAWHIAAAHPGRFAGLVAVSGRVPKTLEANQLAALKEIPIQ
IFQGAKDEKLSVEDTQQIVDTLHGLGGTVDFTVIPEGDHFIADEVYTDSK
LQQWLISQSRRPASVVA
>Ava_3876 transferase hexapeptide repeat
MTNDKPFVDLRLYDQSWFDRGRSAWYILLWWLVQAIAFPLTPHPSSNIRC
WLLRLFGARIGRGVVVRPTARFTFPWKVTIGDYSWVGDDVVLYSLDEIHI
GQHCVISQKSYLCTGSHDIQDPTFRLKVAGITIGNGAWVATDCFIAPGVE
IGANAVIGARSSVFTNMPAGQVCWGSPCRPQHPRLKE
>Ava_1153 Serine/Threonine protein kinase
MQVIGTLTYKKIQQIGVGQGCNSQVFLIDEEQLGGLLVAKEVDKRRFHSS
QTYFQEAKIIFASQNPNVVPINYACETPNTITLVMPYFSKGSLADRIQQN
PLSLTEVIRMAQGVLNGLHHIHIKNYLHLDIKPSNIFFSNTDQPMIADFG
QSDSLDQNGIVHSPQMYFHTLPPESFPPNATATVESDIYLMGVTLYRAIN
GDQIFNSQKIPINSQLDFIILQDAIQRGKLPDQNYFLPHVTQSLRKVIRK
ALNVDPKKRYKSAIELADALGKIEIPLDWNTQISSNGDILWIASQGKGKP
YLGVELVKNLHDLWNVNVFTDNQGTRRKKLQYCRESLTFSDAETHLEKTF
ATLA
>Ava_2700 Protein of unknown function UPF0118
MQSVNKLPRWLTVGLAFPVAILNGWLLLQVVQYFQPLVNVVAAAILLAFV
LNYPIQFFQERGVKRNLAIGSVLLLAVVILVGLGVTLVPLIIEQLNELVN
ILPYWIDSGGQQIDAFQKWAATQQLPVNLSGLVTQILERVSSQLQSVTGR
ILGFAFDTIGVVVNVLLTVVLTIYMVLNGDRLWDGLYQWFPTHIGSKVRQ
ILREDFHNYFIGQATLGAILGLTITLAFVTLRVPLALLFGLAIGLFSLFP
FGTGIGISIVSLLVALQNFWLGGEVLGVAVAIDQVNSNFIAPRILGNLTG
LNPVWVVISLLLGAKLGGVLGLLIAIPTASFIKDIADSWRSGELNAINAL
DVEPKKVEIDVNVSSIELFTKKE
>Ava_0728 Major facilitator superfamily MFS_1
MQPSDLDRKILPLSPSHAKRQNRASDPTAPAHLSAVPYPVPNHKDSQESS
PQDVPTNEKNGISQNPQTDLPNELEKPSSEPLKILEFNGHGQSVPVVTTP
EATTDDSGSGGEANSGDIEHQGFLAVFKNPNFLALWGGQVFCQLADKVYL
VLMIALINTQFQASNQSISGWVSALMMAFTIPAVLFGSVAGVFVDRWSKK
AVLVATNAWRGILVLAIPLLLWLTHDWQPIGVLPVGFLIILGVTFLVSTL
TQFFAPAEQAAIPLVVKEQDLLSANSLYTTTMMASVIVGFAVGEPLLAVA
DGIWSQLGGNNGFGKELLVGGSYAIASIILLLLVTHEKTHAPETEFPHVF
SDLRDGLRYLKENYRVRNALLQLMILFSVFAALTVLAVRMAEIIPNIKAS
QFGFLLAAGGVGIAAGATILGQFGQRFSYTQLSLCGCLGMAASLVGLSIF
TTQLGLVLLLVTSLGIFGSLIGIPMQTAIQTETLPEMRGKVFGLQNNVIN
IALSLPLALAGVAETFVGLQAVFLGLAAIVFSGGILTWYNSRD
>Ava_1791 Protein of unknown function DUF81
MTWIIGHLLAVGIGISLGLLGGGGSVLALPVLVYVMGIAPKNAIAMTLVI
VGTVSLLGSISHWRAGNIQWKTAYIFGAATMLGAFFGARLATLPFITDTI
QMLLFGLLMLVASVIMIQRSMGTKTTYDELPYPPPVCKHCWLWLMSEGII
VGGLTGLVGVGGGFAIIPALVLLAKLPMKAAIGTSLFIIAMNAIAGFLGY
LGHITLDWNLIFSFILAASGGTLLGAYLTRFVPAARLQKSFGYFLLAVAA
FVLFQNRGVFSPKSAASAKVSLQKQAFIDDTF
>Ava_2845 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF and PAS/PAC sensor
MIATQVSIPGYIVHEQLYDGSRTFVYRSVKETDNKPVVIKLLKNHYPSFS
ELVQFRNQYTIAKNLNYPGIIRTYSLESFQNGYILVMEDFGGISLKDYFF
KNYVSSLDEFLQIAIALCNTLDILYKERIIHKDIKPSNILINPETKQVKL
IDFSIASLLPRETQTLVNPNVLEGTLAYISPEQTGRMNRGIDYRTDFYSL
GVTFYELLTRELPFPANEPMEVVHCHIAKTAPLAHEVNPQIPPILSAIVS
KLMAKNVEDRYQSALGLKYDLENCLSQLQATGKIESFKIAQRDVCDRFII
PDKLYGREAQIEMLLQAFDRVSLGATEMMLVAGFSGIGKTAVVNEVHKPI
VRQRGYFIKGKFEQFQRNIPFSAFVQAFRNLIGKLLTESNSQIQQWKVQI
LEALGENGQVIIDVIPELELIIGQQPPTKDLSGSAAQNRFNLLFQKFTHV
FTSKEHPLVIFLDDLQWADLASLNLMQILIADTEYLLLIGAYRDNEVSPS
HPLMFTLGEIKKLPSIINTITLTPLSQIRINQLVADTLKCTEALAWNLSQ
LVYQKTQGNPFFSTQFLKALHQENLIQFNFFEGYWQCDITAINQQSLTSD
VVEFMVFQLQRLSASTQDVLKLAACIGNYFDLSILAIVSGQTEVEIATDL
WQALQEGLILPTSNIYKFYQQDTLLDHQRITNDNGKRVFTYKFLHDRVQQ
AAYSLIPNNQKAVTHLKIGELLRQNLSQIEQEEKLFDIVGHLNRGQELIT
QLSEREALAKLNLQAGVKARNSTAYAAATEYLQLGIELLQTNCWQTQYEL
ALNLHIAAVEVAYLNGDFEEMEKIAAMVLRSAQTILDKVKIYTIQIAAQT
TQSQMLEAIAIGTDALSQLGVEFPREPDEAEIAKVLQAITNQLQSRQIGE
LVDLPVMTDPTSVAAMQLLGRLFSPIFVGIPGLLPILSAKMVSLSLQFGN
TPTSTVGYAIHGLVMCAVLQEVETGYSFGKLALSLLERFNLPEYKSMTLH
LFGDFIQHHRETLQATKLTLKDSYRIGMDTGDFLRAGYSIVGYSQVRLFS
GVELDICESELATYSAALAQVKQFSPQVYLNLTWQTVKNLREPVNQPDCL
IGSAYDETQMFPKHFQDRELTAISTAYIAKLMLAYLWGNYSPAISYIKQI
KLYLMAVSGLICVPIFHFYAALTYLAIFRTQSKPEQAEILAQIATHQTRL
HHSAQNAPMNHLHKWYLVEAEKQRVLGYYYEAGDRYDRAIAGAKANGYIQ
EEGLANELAAKFYLDWGREKAAIWYLQEAYYCYAKWGAKAKVADLERRYP
QLLAPILQQSRSTLSTNETIFTVGTVTKSSSSVSSSSSNVSDTLDLTAIL
KASQAISGEIELEKLLSSLLRIIIENAGADKCVLMLLQDNHLLIQGSITQ
GYKPIMLPSLPVEDSLEIPHKLIYKVKHSQHTVVLLDATADTTLANDPYI
IRQKPLSILCSPILYQGKLLGILYLENNLVRGAFTSDRVQLLNLLCTQVA
ISLENAQLYQRSQENAQQLERSLEKLRLSEARFQKLANNVPGIIYQIRIK
ADGSASIPYVSSGCQTLYEIAAEDFMSGKYSLRDFEHPDDRAKAFQIVVE
SAKNLTPFQHEWRIITPNGNIKWVKAASQPERGEDGEIVWDGILIEISDL
KQAELDLQQAQLQIIQSEKMSALGNLVAGVAHEMNNPLGFIAASLKQAKP
TFADILEHLKIYQETLPDKTEEILDHESEIDLEYTIEDLPKMLDSMTIAC
DRLKNISTSLRTFSRADRDYKVPFNLHEGIDSTILILKHRLKSNEQRPAI
EVITNYGNLPQLECFPGQLNQVFMNILVNAIDALDESNHGRNFEEIKSHP
NQITITTSIADKSVKISIADNGKGMSEEVKHKIFDHLFTTKGVGKGTGLG
LAIAKQIVVESHGGQLKCNSALGEGTEFIIEIPV
>Ava_4331 Protein of unknown function DUF6, transmembrane
MLTVLLSSFFLTIHNVTVRVLFSEHLVLGVFVLGGYVKPDLPNSFLLMFM
RMLLVVPLMATLALKLYPSAGKEIQDLFRRERFDVLIQAVGCGILMFVYI
AALYVAIGLIPTGIALTLFFTYPVFTALLAWKFFGDRPTLFRWLVMSMIL
IGGVLTIPQSSTSYSSNTVAIGIFASLSAGVIYAFYNIIAQKCLEKFHPV
PFTWISFTSTLLLSGVNLLIFSPSSSQLDWTPLWIGSIFSGVISFIGHIL
NNLGIRMIGATKASIVGSSSPALTALVAWITINESLNLIQSLGIAVVTLG
IALLSAEGFFHQRSPS
>Ava_2943 GCN5-related N-acetyltransferase
MPYLCLFINKFVMSDVIIKIAQLPEEFPVVATIRKIVFQEEQGVDAALEF
DGKDDICEHLIAYLDDQAVGTARLRYLDDKTVKIERLAVLSIARGQGIGQ
KIMEKALEFIAHKNITEAVVHAQTYIKSLYQKLNFTEVGEIFIEASIPHI
EMRKKI
>Ava_4244 Nitrogenase cofactor biosynthesis protein NifB
MTPPPTGLLTSATPQTKTQPKSNSCGCSSKTDTTTGLDEKIKARIEKHPC
YSEEAHHHYARMHVAVAPACNIQCNYCNRKYDCANESRPGVVSELLTPEE
AAHKVLVIAGKIPQMTVLGIAGPGDPLANPEKTFRTFELIAEKAPDIKLC
LSTNGLMLPDYVDRIKQLNIDHVTITINMVDPEIGTKIYPWVHYRRKRYK
GIEAAQILHEKQMEGLQALQEADILCKVNSVMIPGINDEHLVEVNQVIRS
KGAFLHNIMPLISAPEHGTHFGLTGQRGPTAKELKQIQDNCAGNMKMMRH
CRQCRADAVGLLGEDRSQEFTKEKFMEMAPEYNLEQRQTVHAGIEKSQKE
IQVVKEKVVETLQTTSFNNSPKILVAVATKGGGLVNQHFGHAKEFMIYEV
DGKSAKFVSHRKIDHYCQSGYGEEATLDNIIHAISDCQAVLVSKIGNCPQ
EQLLKAGLQTVEAYDVIEKVALEFYEKWILEARD
>Ava_0610 Protein of unknown function DUF6, transmembrane
MIKFRGRSRLIHRVSGQVYLWLAIFIFGASSAVTRKLTEIGARHFIDGRN
PISLCNVLFVGNLCALMVMLLIYNRQWNKATLAQLSKKDWLGLTAVAILS
GALAPGLIFQALSLTGVNNVILVGRLEPTLALALSVWLLKERIGFGEFIG
AIAAFTGVILTIILQPPGSDMMNMGGWQLGLGELLVALGSITLAISTIIG
KKYLTHIPLGIYSIFRTALGTVIFFFIALLLYGRDHFADVFSPFLWQWMF
LYGGVIVVVGQSFWIIGLKSSTVSTASLVASFTPIAGIMAAYFILGEVPT
MAQYLGGSLIMLGIFLSQVSKSRQTSHKSPIASTPTEQKVEMDIGFKGI
>Ava_1438 Flavin reductase-like, FMN-binding
MNLIQSIKDWAKSTRFLQFFDKPKTKSMSDSKPRDVQVLPIATNTKVLRA
RSWSRLRFEIEYALERGTTSNSYVIEGDKTAIIDPPVESFMGIYLEALQQ
TINLKKLDYVILGHFSPNRIPTFKALLELAPQITFVCSLPAAGDLRAAFP
DDNLNILAMRGKETLDLGKGHVLKFLPIPSPRWPAGLCTYDVQTQILYTD
KIFGAHICGDDVFDDNWESFKEDQRYYYNCLMAPHAIHVEAALEKISDLQ
VRMYAVGHGPLVRTSLIALTQAYADWSKAQKEREISVALLYASAYGNTAT
IARAIALGLTKGGVAVKSINCEFATPEEIQSNLEQVDAFLIGSPTIGGHA
PTPINTALGIVLKVGDNNKLAGVFGSYGWSGEALDMIEGKLRDAGYRFGL
ETLKVKFKPDDVTLKFCEEVGTDFAQTLKKAKKVRVPQQAATPVEQAVGR
IVGSVCVITAKQGDVSTGMLGSWVSQATFNPPGLTVAIAKERAIESLMYP
GGKFALNILPEGSHLEYMKHFRKNFAPGEDRFANFTTTEADNGCTVLADA
LAYVECSVDQRLECGDHWVVYATVDNGKLLKPDDVTAINHRKTGTHY
>Ava_3407 Cobalamin synthesis protein/P47K
MAQKIPVTVITGFLGSGKTSLIRHLLQNNAGRRIAVLVNEFGELGIDGDL
LKSCQVCPEDEDGGSNIFELTNGCLCCTVQEEFLPTMQELLKRRDSIDCI
VIETSGLALPKPLVKAFRWQEIRNAATVDAVVTVVDCAAVASGTFASDLE
AIAIQRQADDSLEHETPLQELFEDQLACADLVVLSKTDLVDAETKSQVEE
LVKQELPRVVKMVESDRGQLDPSILLGFQAAVEDNLDSRPSHHDTEEDHD
HDDDITSTHLILDRDFDPEKLQQQLQTLTNQQEIYRIKGFVAVPNKPMRL
VMQGVGNRFDKFYDRPWQPQEARQTRLVFIGRDLNSTEIESQLVAL
>Ava_1934 conserved hypothetical protein
MTQQKAPSQRTTVKRVPKRANYESETIYQILDEGLVCHVGFVADGQPVVI
PTAYGRIDDTLYIHGSPASRMLKTLQQGLDICVTVTLIDGLVLARSAFHH
SMNYRSVVVFGKATLVEDTEQKLAALKAFTEHVILGRWEEVRSPNRQELA
GTLVLSLPLTEASAKIRTGEPIDDEADYQIPVWAGQIPLKLTAATPINDS
RLDSSIELPVYVKKYTRPQKGAA
>Ava_4309 Peptidase M16-like
MFPASVFRLDNGLTFIHQEIPTTPVVVADVWVRAGAIREPEPWFGMAHFL
EHMIFKGTATLPPGTFDHQIENRGGVSNAATSYDYANYSLTTAAPYLGDT
LPYLADLLLNAAIPDDEFSRERDVVLEEIRACYDDPDWVGFQCLSQSIYQ
DHPYGRSVLGTEEELMQQSPEAMRRFHRAHYQPENMTVVIAGGIAQQPAW
ELVNRSFENFSEPVECPKINPKPKPIIKGIQRQELSLPRIEQARLLMAWV
VPGVEKLRTAYGLDLLSVVLAEGRTSRLVRDLREELQLVQGICSNFSLQC
ESSLFTVTAWLEPENLEQVEDLILSHLDDIQTSGVTEQEIARTRRLLCNE
YAFSTETPNQLTGLYGYYNTIAQAELAVTYPHQIQSFDTQELQQLAKQHL
SPQNYAVTILKPL
>Ava_1369 Flavin reductase-like, FMN-binding
MTIATSSRPRDVQVADIGENTLILRSRTWERLKFEVEYSRQRGTTANSYL
IQADKKALIDPPGESFTAIYLEQLAQYLDFTTLDYIILGHVNPNRRVTLQ
ELLSKAPQATLICSRPAANALKTAFPEWESRIQAVRFEDILDLGQGHQLT
FVTAPTPRWPDGLFTYDSATKILYTDKFFGAHICEDTLFDEDWRKLDAER
HYYFDCLHAPQAKQVEAALDKVVMLGARYYAPGHGPVVRYSLSRFTYDYR
QWCQGQKAQDLNVALLYTSAYGNTGILANAIAQGLVQNDVNVQSVNCELA
DTAEITRIVEACDGIIIGSPTLGGHAPTQIQTALGIVLSTAAKTKLAGVF
GSYGWSGEAIDLIESKLKDANYRLGFDTIRVRFSPTPEILQQCQAAGATF
AQTLKKNKKLRTPRQVIPEAKIDRTEQAVGRIIGSLCVVTTRDQESHKGI
LTSWVSQATFNPPGIMMAIAQEQNADLMSHIGDQFVLNILKEGRNVRRYF
SRQSTLGDNPFANLETKTADNGCLILTEALAYLECTVTNQLECGDRLLIY
AVVDKGEVLANDGVTAVEHRKSGSH
>Ava_0734 Protein of unknown function DUF1230
MMKSSLANCPVPVDQQPLNEYEELKTSWLFRDCALNWREYATKLIWIWSL
SWLVAGPVAAASFPPNKQLIHFLLCGAAGASVGVVLSLVRLYLGWLYVRD
RLYSMTVFYEESGWYDGQTWMKPQEVLTRDRLIVTYEIKPILQRLQFTFT
GLAGLFLIGTIVWHLF
>Ava_3513 NUDIX hydrolase
MSKLQEWQILNSKMVLDHPWCQVRQDEVKLPSGTIIDDYFVNVRPDIVLV
LPITDNREVIFVRQYRHGVGDFFLELPAGRFDPTQESAEDAGLRELQEET
GYIAQQLIKIAILYDNPSKDTNQIHLFLAENVVKVGEQNLDITEEIEIIL
IPLESVLEKITQGEISVAGTIAALFLGLNFITDK
>Ava_2485 4-oxalocrotonate tautomerase
MFEGRSVETKKQLLQDIIRKINEQLQISVYDIEITLLEIPKQNWGIRGVP
GDELNLSYKVEV
>Ava_4599 conserved hypothetical protein
MTPSITDSTVKAALAAIASESATTEEKIQMLIELAQGLQKKPKTPGDLWN
AVELYQQAIKMCGGDYSLWKARSQAGMAGALKSIPDVGAELLLQAKAGYE
EALPVLRQLAGAVEVAEAQMNFGLVLQSLVPFNLARITDSVAAYQEAMGV
FTSQDYPQEYAILANNIAIAYLSMSGGAEQQNLYAGLAVQTFESALKQIN
LIDHPREYAMLQNNLGNALQYLPSSHPVENNLRAIVAYDEALKVRTCQGT
PVEYANTIANKANALFNLPDDPEKPELGNSQNLLQARAYYQEAWEIFSQC
QQMEQAQAVAQALHEVGAEIRISHL
>Ava_2078 Competence-damaged protein
MSAEIICVGTELLLGDILNGNAQYLAQQLAQLGIPHYHQTVVGDNPDRIK
QVIEIAISRANILIFTGGLGPTPDDLTCETIADFFGSPLVESPEIIEDIT
QKFAQRGRVMTPSNRKQALIPQGADILPNPTGTAPGIIWEPRPDMTIFTF
PGVPSEMHRMWQETAVPFLKNQGWGQEIIYSRSLKFWGIGESALAEKVTA
YLNLPNPTVAPYAGKGEVRLRVSAKAPSEVAAEALIAPVEKQIKDIAGLD
FYGVNHDSLASVVGELLRSSGETLSVAESCTGGLLGQMLTEISGSSDYFW
GGVISYDNSVKAGLLGVNPEDLDKLGAVSDTVAEQMAIGVKTRLSTTWAL
SITGIAGPNGGTETKPVGLVYIGLAGPGDEVTSFKYNFGTMRDRSFIRHL
SACTALDLLRRRLLTR
>Ava_4070 putative MerR-family transcriptional regulator
MSIYNSIGKQYSQTRVPDIRIVNAIINLLNLPKGSVIADIGAGTGGYSVA
LANQGLFVYAVEPSIVMRQQAVVHPQVEWFTGYAENLALPDKSVDGVISI
LAIHHFSHLEKSFQEMQRIIRDGTIVLLTFDIRLAQRIWLYDYFPFLWED
ALRFLPLDEQINLLQENTKRRVEAIPFLLPHDLSDLFAAAAWRRPELYLK
AEVRAGISSFALANQDLVEKGLELLTADLNNGEWIRKYGEIHHLQEIDIG
YRFIYTTLDK
>Ava_2817 WD-40 repeat
MKQQKARRNRGVTLTLQGWDKLQAAKAKAEWEKNGGDSFSLEELSDRTRL
ALHTISRILARLEPVDKSSLQLAFAAFNLELSPSDYTRPTSTVEELETRQ
GNPQYDWAEAPDVSVFFGRSEELLQLRQWVLEERCRLVGLLGIGGIGKTT
LAVKLGLQIQAEFEVVVWRSLQNAPPVDEQVTNILQSLLSALQKEMVIPE
SFDGKLVKLMECLHSNRCLLILDNFETILSGGQAGQCRPGYEGYGQLLKR
IGEVPHISCLLFTSREKPREIVPLEGERTGAKSLPLKGLNTTEGQELFQQ
KGQFTGTEQEWQLLIEHYGGNPLALKMVAAGTQELFDGRIAPVLEYVEHG
ILIFEDIGDLLERQFYHLSPVETEVMYWLAINREPVSLAELVADIVTSSS
QRLVPSAINSLLQRSLIERSGKYFFLQPVVMEYTTQRLVQQIRQQLLGEK
SARLGLFQTHALIKATSKDFIRETQRQLIVQPLLEQLLLEMGSQEQLIIL
LQNILEQQRHQTAILTGYAGGNILNFLAHLQVNLREYDFSNLCMRQADLQ
RINLAGVNFQNTAFDQSAFATSLKNIFSLALSPDRKLLATGDQDGQIHLW
QMANRKNLLTFKGHECVVWTVAFSPDGQTLASGGHDGLIKLWDVQTGNCL
KTLAQHEGIVWSVRFSPDGQTLVSGSLDASIRLWDIRRGECLKILHGHTS
GVCSVRFNPDGSILASGSQDCDIRLWDLNTDKCIKVLQGHAGNVRAVCFS
PDGKTLASSSSDHSVRLWNVSKGTCIKTFHGHKNEVWSVCFSSDGQTIAT
GSYDSSVRLWDVQQGTCVKIFHGHTSDVFSVIFSSDRHIVSAAQDFSVRI
WNISKGVCVRTLQGHSCGAFSVSFNSVCPTGVDCMLATGSMDGLVRLWDV
ASGYCTKILQGHTNWVWSVSFSPDGSILASGSHDKSIKLWDVISGHCITT
LYGHNGGVTSVSFSPDGQTLASASRDKSVKLWDIHERKCVKTLEGHTGDI
WSVSFSPDGNTLATASADYLVKLWDVDEGKCITTLPGHTDGVWSLSFSPD
GKILATGSVDHSIRLWDTSNFTCLKVLQGHTSTIWSVSFSPNGSTLASAS
SDQTIRLWDMNNFTCVRVLDSHTSGGCAVSFNSVGNILVNTSQDEVIKLW
DVETFERIKTLKVDRLYEGMNIRGVTGLTAAQRSALLALGAVEGVG
>Ava_1809 probable phosphoribosylaminoimidazole carboxylase catalytic subunit
MTQPEALRSLLEAVANGKVTPDKALNSLKDLAYEPVGEFAKIDHHRHLRT
GFPEVIWGLGKTPEQIVQIMEVMRQRNPVVMATRIEPGVYTLLQPKIRDL
RYYELAKICAITPPTIEPRFAGIISILSAGTADLPVAEEAAVTAELSGFR
VQRLWDVGVAGIHRLLSNRHLIESASVLIVVAGMEGALPSVVAGLANCPV
IAVPTSVGYGASFGGLAPLLTMLNSCAAGVGVVNIDNGFGAAVLAGQILR
TAEKLRLASLES
>Ava_2684 Serine/Threonine protein kinase
MSLCINPVCPNPNHPHNDKNRFCQSCGSQLELLGRYRVTQLLSDTTGFAT
VYEAYEENTAKILKILKANLSRDPKAVELFRQEAVVLGQLNHSGIPKVDS
YFQHQTRNGVILHCIVMDKITGTNLEEWLQQQQNQPISQAQAITWLIQLA
EILDSVHGKLYLHLDIKPSNILLQPDGKLVLIHFGTAKNYANGTTTPTLL
SGYSAPEQMNGEATPQSDFFSLGRTFVFLLTGQHPLDMYDVHQNLLRWRN
YTKQISPLLLNFIDWLIAPESKNRPTNTQGILQRLRDIQQQVNWNEIPQS
EQQTQLKSHLPLQSPTKSFNKVPLIAFCAALIVSVGLLSLVAFMIGYPRF
TLLPPPGQSPQRKGKIAYFPYETGKDSQGRVAKFNIAVLSIEHKWLSGSN
FQIRNNDRVISLDVLKLRLEQEGIQEIMENPEEIISVGTASCEGSLSVAQ
RKAFERSQQIQNLVKKIFQNSPSVKNYRLLNLGEFQGSVCQSNRDLTAYQ
RSVIIIGLKRQSKAVVIDEALRNRLENKPFGDFQLDDYSLGSVDKFKTIP
SQL
>Ava_2701 NUDIX hydrolase
MSKQPIHVAIAILYQDNKFLMQLRDDVPNIPYPAHWALFGGHVEPGETPD
IAVKREILEEIGYILPPFWEFGCYADDAVVRHVFHAPLLLEFNQLVLNEG
WDMGLLTPEDIRKGRFYSANAGGEKPLGEIPQQILLEFMEKDLVNSH
>Ava_2476 BioY protein
MIAASNQLLWSMIGLLLTMGGTFLEAYGITLPWNWSQTGIQTFSLGVSYQ
VGAVLLVGCLGGKNAGALSQIAYLVMGLTLHPVFSDGGGNIGYVKASQFG
YLLGFIPGAWICGFFAFQARPKIESLTFSCICGLLSVHLCGIAYLMIRSF
FHWQGTDTIPLMQSILVYSWWVIPGQLAVVCAVSVVAYVLRHLMFY
>Ava_4863 Serine/Threonine protein kinase
MVNPKTEPDVYIGKFLNNRYLIIDLIGKGGMGRVYLAEDAAKGGKKVALK
ILMLNLVNQHISQRFAREIFIGAQLGRKSKNIVRVLSYGVTDEKTPFYVM
EYLQGKNLKQILKLKPLTIEKFLDICYQICVGLKCAHQGIILKGEIYPIV
HRDIKPENIFITENNKQNENVKILDFGIAKFLTERSGMTLTDSFIGSLPY
CSPEHMEGRKLLDVRSDIYSLGVLMFEMLTAKHPFQTQSNSFGNWYQAHR
FQAPPTLVEVNDQVKIPEELQDLVMCCLAKEVGDRPQNIEEIIQVLELVK
QQNTSSNSISNDIWETLPTVQLVPVTSITEKECLQKTWPKNKPVDLIGFP
HLLHTTQGLIPTFWAMLPKQEIAQFLSKNNSIKFINKMNVYPMILWVTLL
YDVQLSLIRWLSYFIDLQENRGQKIVKALAETGYYHLLFFAIEEPTNCAQ
VVTLILTAQQRQQLTDWLTFKQNIKSNELVPAEQARKILKTEYEKIKLEI
LESLAVNKPQEQMEIKSWLTKLIDKVLQIFKYQS
>Ava_3230 HAD-superfamily hydrolase subfamily IIIA
MTHIKTWLSKLHYSTSIEQPDKLPKQILTLAEIKIDWLKNVGIRGVILDL
DNTIISEDDRYLSPWAEDWIAEAKLAGLKLFILSNGKRRYRVKYWSHRLD
INAISPANKPFPSAFRKAITQMRLPSKNVLVIGDSLHTDIMGAKLSGCPC
IQVASLPHPPRWWEKLAGKWVQIPYPKGKELWEFEGAFGYKIFL
>Ava_2779 Twin-arginine translocation pathway signal
MKRRQLLGYAGAGLATAFVSTLGSNFQADAQSSGLSVQWLGHTSFLFTGG
GARILVNPFRTIGCTVRYRPPKVTADLVLISSQLLDEGAVDGLPGNPKLV
YEPGVYDFKGIKFQGISIAHDRKNGRQFGMNTAWKWTQGGVNILHLGGAA
APISIEQKILMGRPDVLLVPVGGSDKAYNAQEAKQAIEALNPKLVIPTHY
RTQAADPAACDIAPLDEFLTLMQGVTVRRSNSDSINISSGNLPETSTVQV
LSYKF
>Ava_0042 Radical SAM
MKLSHYHVVTQPFFDEIEERTKRVIFSSRTSNVRIIDEHSWHILASGDFA
QLPQYILFDLVDVELIVPDDENELQTILDYNNALAIDNDDLHLVVQPTAF
CQLGCHYCGQEHTSKMMTEDEQQKFIERTAKKLASKNFRSLSIGWFGAEP
LVGLPVMRTLTPKLQALAASFGCSYHAKVVTNGLALTHQVATEIVQELGV
NSVEITLDGTGEYHDVRRMQKNGLPTFEKIFANTVALAHRQDLDVQINIR
CNVDYQNYESVSLLLQKLAEAEIQDKINFYVAPIHSWGNDAHTRSLSKEE
FADWEITWLGEMIELGFKVGLLPERRPLVCMAVMPHSELVDAYGNIFNCT
EVSYVPTYGTPNEYAIDHLSGKQMPGKRERLASFNDKVRQGAYPCSTCPM
LPVCGGSCPKSWLEGIEPCPSAKHNIEQRLLLTYALSRIEEAETNEEALV
YA
>Ava_4482 alcohol dehydrogenase
MRAMILDAPRQPLRLTELPIPRPNSEQVLIRVHACAVCRTDLHIVDGELT
HPKLPLILGHQIVGTIEALGEKVDQFHLGQRVGVPWLGHTCAHCPYCLSG
RENLCDYAEFTGYNIDGGYADYTVADHHFCFPLDPTYPDLQAAPLLCGGL
IGYRAYNMTGDAEKLGFYGFGSSAHILIQLARYQGRKVFAFTRPGDIDGQ
EFARQLGANWAGDSDVLPPESLDAAIIFAPVGKLVPTALRAVAKGGVVVC
AGIHMSDIPSFPYSILWEERVLRSVANLTRQDGEEFLTIAPQIPIRTKIN
SFPLTQANEALDALRSGKIEGSAVLVMK
>Ava_2756 Major facilitator superfamily MFS_1
MFPTEPAAVNNGFGALLKNRGFMLLWIGQLISQLADKVFFVLMIALLEVY
PAPSGLPENSMYSTLMVAFTIPAILFGSAGGIFVDRLPKKLIMVGSDIVR
GLLTLCLPFLPREFLILLILTFAISSVTQFFAPAEQAAIPLLVKRENLMA
ANALFSSTMMGALIVGFAVGEPILSWSKSLMGETYGQELVVGGLYILSGL
LMQPIKFTEHKSHHDQLTSSHPWAEFTESIRYLKKNRLVLNAMLQLTTLY
CVFAALTVLTIRLAEEFGLKEKQFGFFLAAAGVGMVMGAGILGHWGDKFH
HKPLPLIGFLMMAMVLGVFTFTHNLALALGLCAVLGIGAALIGVPMQTLI
QQQTPPTMHGKVFGFQNHAVNIALSLPLAITGPLTDALGLRVVLMTMSAV
VVVVGVWAWKNTRRVLQDVI
>Ava_0588 conserved hypothetical protein
MKPDSQNRLKHFAALKSKYAATEYQDSSSCSPLYCILRKTDLGIELSDLE
FNWLRESKLDTTLKFIQKEQQRRQQELINLEVEFSQLKSKYKAKKHNLPW
QSSNLYCIVLKLESGNLLTDSESQWLRVNGLADTDTIAQEIKQFTHLKYK
YKAHQYQNVYPHDTLYKILKKLDLSERLNDDEYNWLINHQLWETVEIFKQ
QESAKEAEFSALKSKYQASKYQDKSLSDTLYKILLKIEAQGKLIDQEIKW
LKQQGFIETIAILQELEQAQEFAILKVKYKANQYSDSSPKSHLYKVLKII
EGGNYLSDQDINFLKKRKLTETIQIADEKYISTLKLKINLGELLNDLEII
WLKSHGRDDIINLAQQQHFAELKRTYGLVDPLLPIEPFYTIMLKLEKGER
LDPLLVIKLQEEEKLSRHGKIAIAHYKLEAEFYEQELQRTGNKWHIPTAS
SYWRKADEPKQSLKVTDLSLDKIKESKLKSAILVTRGGAFRDMSKLGDAE
NCARKAMEYHPDSYQPYTLMGAIFYDRHDYSQGDYWFEQAIQRGAKTEDI
DDEIKRVLRGTRDEIKRHEAAEYLLRKDARRYSWAKSYLIKVSR
>Ava_1551 Rhomboid-like protein
MIPISDNLHFRERPIINYWLIGINIVIFLWEIKLELSGELGAFINTWGLV
PSQISTAVNHALINPAALVVVFGRLFSLLFAIFLHGSFSQILGNILFLWV
FGKAVENILGHQRYLGFYLVAGVVTGLAQIIIEPNLTVPLIGANGAIAAV
LGAYIIKFPQVKIDTVLPLIIIYIPIELPASFYSLWWYIQQLFYGIGSLN
IPPTGVNQPSLAYWMQLVAMTIGAVYVRKKF
>Ava_2499 WD-40 repeat
MEPGLRLLIQLVLEFAPVFVNIIHKRSEESLTTARYQVPNLIQGCIETIN
NINDIDTSNKSLEQEKLWQQQLVAYQRETQAQIANQHRETALKLPEVNKI
LDSWPLRLYPSQILESHHSYGRKPLKVFIAPPKIQFDKFDNQAETYSEVE
LMLAEGLREFINKNYSLHNPTRPIEFLAGAWDSKRFHSESSIKALFGILK
TEPILILESENDGDYLNFRIAYWGLTQGTYYYKTISRLPYKEIIQNSAKN
RALEWKKIRDELLEIGVDVEEINEFGKDNVFNLAILEKAEKWQAQGIDIS
KLSLQYRINQQDIKQLYQVLITCHCLVTAWVADIYHLGNHDIPPLLPELL
PSLLKDAIDIQSLQTIATSYKQVYEALEKERLYWIPELSMQLAQSLSHLP
DNSWAQAEVDYSINAWLELRQVSLQHFTSPLEAMQSVIKIEDEEYIQKLR
KYFTAIGDRHSLEPVDKLLEAIATLKYKRSLEYPDLTKTITGHSGKVTSV
DISLDGEVLVSGCTDQTVNIWNLQTGKLIRTLTGDLGEVSSVAISPDGNF
LAVGSGIHPKSNVKIWHLKTGKLLHTLLGHQKPVNVVVISPDGQILASGS
NKIKIWNLQKGDRICTLWHSSAVHAVAISPDSTILASGSSDNKIRLWNPR
TGDPLRTLNSHDNEVKAIAISRDGQFLFSGSADTTIKIWHLLTGQILHTL
TGHSGDIKSLTTSPDGQFLVSSSTDTTIKIWRISTGELLHTLTGHSASVN
SVAISPDGTILASGSADQTIKIWQIDKI
>Ava_2084 Serine/Threonine protein kinase
MIGKLLDHRYQVIRVLAIGGFGQTYIAQDTRRPGNPTCVVKHLKPATSDP
RVFETAKRLFNSEAETLENLGHHDQIPRLLAYFDENQEFYLVQEFIDGHT
LTEELIPGNRWSESQVTHLLQEVLSILEFVHHQGVIHRDIKPDNIIRRTA
DNKLVLVDFGAVKQLRTQLVTVGGQPTATVAIGTPGYMPTEQGQGRPRPN
SDIYALGIIAIQALTGISPTELQEDPQTGELIWRHLVNVSDRLAVVLNKM
VRYHFKDRYQTATEALQALQNALNPVSVAVSSTPAKPFNSSYYQPAKPSS
PVSRQQTVAVAPTNPVVTKPGRKDSHKSDPLPLLIGIVLAGGAAALVANF
YPNVKNFAANVLGNNATSGNKCLAVVAGNSNIRSEPSSINTDTVLQTIGV
NTNFEVTGKRTKRGWVEIKFNSSRLAWAHSDVIINNQQWISCLRDKGIAL
KTVDDSTLIAARPVPKRQPKLDPFINSAPQPEDSPDSQPTEAAQPQTDTT
KVVEQARRKYESGDLVGAIAMLRSIPANASAGIKETSAMINQWQQDWQKA
DALFNDINKALEDGQWDKVLEYKNQPEKLPNIKYWRDKLEPIFKQAAENI
AKQALPKTENQGELNNSMEESPKNEEESTNETPSSSELPQGGY
>Ava_4785 Peptidase M50
MNGTIRVGNLFGIPFYIHPSWFLVLGLVTWSYGGGLSAEFPQLSGVMALG
LGLITALLLFASVVAHELGHSFVAIRQGINVNSITLFIFGGLASLEKESK
TPGGAFWVAIAGPLVSLLLCGIVTAIGVTTAVTGPLAAILGVLASVNLAL
ALFNLIPGLPLDGGNVLKAIVWKVTGNPYKGVTFASRVGQVFGWVAIASG
IFPILYFGSFANVWNLLIGFFLLQNAGNAAQFARVQEKLTGLTAADAVTT
DSPIVSAHLSLREFADDQIVQGQNWRRFLVTNNAGQLVGAIALDDLRNIP
TTSWTETQIQQVMRPIQSTTIKSSQPLLEVVQLLEQQKLSALPVILDNGV
LLGILEKAAIIQLLQNGTQPSPA
>Ava_0784 conserved hypothetical protein
MQEPEFAQTQSKEATVPEINSQTGTITKLQPPVQSQEEWRKYGEQVSDFL
ATLPDYVGNFFNQYKQPLVSVGLIVASIVAVKVLLAVLDSLNDIPLVAPT
FELIGIGYSAWFVYRYLLKASTRQELTHEITTLKSQVVGQEDS
>Ava_4517 Small GTP-binding protein
MSKEKTKLSQLENMTAELKVASIDNHISSLSGEVSIPQAPPEFKSGFIGI
IGRPNVGKSTLMNQLVGQKIAITSPVAQTTRNRLRGIVTTPEAQLIFVDT
PGIHKPHHQLGEVLVKNAKLAIESVDVVLFVVDGTVACGAGDRFIADLLI
HSKTPVILGINKVDQQPSDSQNIDDSYQQLASAYQWPTVKFSAKTGAELP
QLQELLVEHLEHGPYYYPPDLVTDQPERFIMGELIREQILLLTREEVPHS
VAIAIDLVEETPTITRVLATIHVERDSQKGILIGKGGSMLKSIGSAAREQ
IQKLIAGKVYLELFVKVQPKWRHSRVRLAELGYRVEE
>Ava_0798 Semialdehyde dehydrogenase, NAD-binding
MTKIAVIGVGRWGVHLLRNFLAHPQAEVVAIVDPHPERLTVVKQQFNLAE
SVLLTTQWSDLQTVPELTAVAIATPATTHYALIKDALAQGYHVLAEKPLT
LDPTECQELCQLAERRQLILMVDHTYLFHPAVEEGQTVIQAGKLGELRYG
YATRTHLGPVRQDVDALWDLAIHDIAIFNNWLGKAPVSVQATGTVWLQGE
GKEAGGRGQGAGEAGGELTARFSPQSPIPNPQSPVPSPQSPELADLVWVT
LTYPDGFKAYIHLCWLNNDKQRRLAVVGSLGTLIFDEMSPSSQLTLLHGE
FERQGNLFLPVNQSREVLELKAGEPLQRVCDRFITSVLQNTPPSISSGWV
GTELVKILSALTTSLQQSGQSVSLQ
>Ava_3139 4-hydroxybenzoyl-CoA thioesterase
MSDEQSNQSQLPPTNAIEVSSLRQFDSWFEYPVRVHPHHTDYAGIVWHGT
YLTWMEEARVECLRSIGIEFADLVALGCDLPVVELSIRYHRSVQLGMAVV
VKARMIDVTGVRINWDYAIVSTDGRQLFVTAKVTLVALDRDRGKIMRQLP
SNVKDALAKVSALHNN
>Ava_3684 Methyltransferase FkbM
MKQIKKFVKNVIQQIFRFMGFYLSPLHNTPFGVEWGQDINYLLDGKNLEL
VIDVGANIGQTVYEVLRYFPQSRIYCFEPVPSTFNRLNEEVGVFSNVYPY
NMALGDKPSTLSMIAEPFAQKNTLVFDVEKTKNNNIEVVDVKVDTLDQFC
LTNNIDKISLLKVDTEGYEMKVLKGAEQLLSSGCIDYILIECDFLKRADQ
PHGDFIEILKYLQSFQYNVVSFYTGGVDHLGWIWGDVLFRKISNDETVFA
MSPFPRQQVSIG
>Ava_3269 PilT protein-like
MIYLLDTNACIVYLNGRNLNLKRQIEQKLESDIAVCSPVKAELFYGALKS
NNPSRNLVLQKAFLNRFVSLAFDDQAAEIFGVIRARLTQLGTPIGPYDLQ
IAAIALANNLILVTHNVNEFSRVEGLRLEDWEV
>Ava_3995 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MTSTVPQFPEISGYTITEQIYRGSRTAVYLAIQDNQRLPVVIKVLQREYA
TFGELVQFRNQYAIAKNLPITGIIPPLSLEPFGNGYALVMADWGGISLET
YIQQQPLDLGDILVVAIQLADILHELQQHRVIHKDIKPANILIHPESQQV
KLIDFSIASVLPKETQEIQNPNTLEGTLAYLSPEQTGRMNRGIDYRSDFY
ALGVTLYQLLTGRLPFLTDDPLELVHCHIAKIATPVHLVNTDVPPTLGAI
VAKLMAKNAEDRYQSALGLKHDLDECWSQWKQTGSIAEFELGQKDLCDRF
LIPEKLYGRETEVQTLLDAFERVAQGSSEMMLVAGFSGIGKTAVVNEVHK
PIVKQRGYFIKGKFDQFNRNIPFSAFVQGFQDLVGQLLSESDTQLQHWKT
QILSTLGENAQVIVELIPELERIIGVQPPTRELSGTAAQNRFNLLFQRFI
QVFTTPAHPLVMFVDDLQWADSASLNLIQVLMSESQTGCLLLLGAYRDNE
VFAAHPLMLTLNGMEKAGAKIHTITLQPLSFTSLNHLIADTLHTHALVVQ
PLTKLVMQKTLGNPFFATQFLKALHQDQLITFDPNAGHWQCDIVQVRDVA
LTDDVLELMVQQLQKMPEATQEILKLAACIGAQFDLTTLAIVSQQSQTEV
ATILWKALQEGLILPQSDLYKFYLGESQAIQHTPQETLNYRFLHDRVQQA
AYSLILDDQKQVTHLTIGKLLLENSNPSFQDSHLFAIVHQLNCGISMITE
LEQRYQYAQLNLQAGYKAKDSTAYNAALHYFDKGMQFLPTDSWDKNYNLT
LLIYESAAEVALLSCDFKQMDTLIQIVLKNTNDLLEQVKVYEIKLQAYQV
QNQQLQAIKIGREILQKLGVILPESITLSDIQRQVQHTLTKLSNYSLEDL
INLPITQDATATAAARIMTSLVPSIHQANPLLFPVVACEEVNLSLQYGNS
LFSAPGYADFSIIVSSVLNEIEVGYRFGQLALQLMEKFNESYVQSIVMFK
VAAFNQSNQQDIRNAIFLLNKSYKVAIETGDSVHALVSTSFRLFYSYLSG
DKPLPELLEEVEIYQSKFATSEHFLTWVHIISSSIHNFIELSDNPDCLGS
IDAENEKLSTLIQENDELALHLFYLSKLILSYSFSCFETAIQIADRGLQY
LKAGISMPSAPAYYYYDSLTRLKLYFNFQPSSQKEVLRKVDSNQKHLLVY
VNAAPMNYQHKYHLVEAVRFQRLGKQAEAIEFFDCAIAGAKANRFIQDEA
LANELAAEFYLNWGKDKFAAGYIQEAYYCYSRWGAKAKVTDLETRYPELL
RPILQQTMTSTDALTTLMTIAAPAVSVHYDTHHTSSSTGVNQVLDFAAIL
KASQVLSRTIQLDQLLQKLTQIILHNSGGDRCALLFPNETGEWQVRAIAT
PDDVQLIAEPFTNNPNIPVKLIQYVKNNQETVVIDDLKTDLPVIDDYLRQ
RQPKSILCLPLLNQGHLIGILYLKNRLTRGVFTSDRLLILNFLCIQAAIS
LENARLYQQAQTYARQLEQSQLQMIQSEKMASLGNLVAGVAHEINNPIGF
LNGSIKNGKEYVQDLLGHLALYQQHYPNPVEAIQDNAKDIDWEFLSEDLP
KLLDSMEGATNRINSISNSLRTFSRADTEYKISANLHEGLDSTLLILKYR
LKANEHRPAIQVIQDLGDLPTIKCFPGQLNQVFMNILANAIDMFDEMAQT
RLFKELEANPQKITIRTEVISNQVYIRIRDNGKGITQELQEKIFDHLFTT
KAVGKGTGLGLAIARQIVVEKHGGSLNVWSELGQGTEFTVQIPV
>Ava_1834 GTP-binding protein, HSR1-related
MCAYVNRRGQVIRVGVGTPRQTQIPPMELPRYGAERLSGIRCIATHLKTE
PPNEAALTAMAMQRLDALVVINITGTGFTRRGGGATGYVKEAYLAHLVPQ
DARALIPSAAIAPLNNGQSTSWTISPPLDLDDLGQQDFIDLVEGLEAEFR
REFIAEEVDADHDRVLIVGVMTDNMSLQQFHDTLVELARLVDTAGGDVLQ
TVQQKRSRIHPQTVIGEGKVQEVALTAQTLGCNLVVFDRDLSPSQIRNLE
AQIGLRVVDRTEVILDIFAQRAQSRAGKLQVELAQLEYMLPRLTGRGQAM
SRLGGGIGTRGPGETKLETERRAIQKRISRLQQEVDQLQAHRSRLRQRRQ
HREVPSVALVGYTNAGKSTLLNALTNAEVYTADQLFATLDPTTRRLVIPH
AETGEPQGILITDTVGFIHELPASLMDAFRATLEEVTEADALLHLVDLSH
PAWLSHIRAVREILAQMPVTPGPALVAFNKIDQVDSTTLALAQEEFPLAV
FISASQRLGLETLRLRLSLLIQYAVDSQ
>Ava_1350 Dynamin
MVNQVATDKFIQDLERVSQVRSKIAISLQKLSDTINQAELAGDTSSGKLS
LERDLEDISVASNNLKKGVFRLLVLGDMKRGKSTFLNALIGENLLPSDVN
PCTAVLTVLRYGAEKKVTIYFNDGKSPQTLDFPSFKYKYTIDPAEAKKLE
QEKKSAFPDVDYAVVEYPLSLLEKGIEIVDSPGLNDTEARNELSLGYVNN
CHAILFVMRASQPCTLGERRYLENYIKGRGLSVFFLINAWDQVKESLIDP
DDAEELRASEDRLRQVFKANLAEYCYVDGQNIYDERVFELSSIQALRRRL
KDSQADLTGTGFSEFMGSLNTFLTRERAIAELRQARTLARQAVNHTREAI
GRRLPLLDKDVDELKKRIDSVEPEFTKLNNIRDQFQKEIFTTRDTQARKV
SESFRSYVLNLGNTFETDFLRYQPELNLFDFLSNGKREAFNAALQKAFEQ
YITDKFAAWTLTAEKDINVAFKELSRSASQYGASYSQVTDQITEKLTGQK
VTVSPTTTTEDDKSPSWAKWAMGLLSLSRGNLAGVALAGAGFDWKNILLN
YFTVVGIGGIITAVTGVLLGPIGFALLGLGVGFLQADQARKELVKTAKKE
LVKYLPQVAHEQSQTVYDAVKECFDAYEREVSKRINDDITARKSELDNLL
KQKETREINREGELKRFKSLQEDVITQLQNIEAAYGNLLAYYG
>Ava_2294 Protein of unknown function DUF58
MVPAKRVYLLLILGLAIAPILSLLIGIPASIAIALLFDITVLGLMIVDSR
QVRSLRVEVQRQLPARLSIGRDNPVILSITSAKTEAVVQIRDYYPTGFGV
STLTVNTTIPSQGKEEIKYTVNPTQRGEFSWGNIQVRQLAPWGLAWDDWQ
IPQSVQVKVYPDLIGLRSLSIRLTLQSSGSMRQSRQLGIGTDFAELRNYR
TGDDLRLIDWKATARRVGVPLVRVLEPEQEQTLIILLDRGRLMTARVKGL
QRFDWGLNAALSLALAGLHRGDRVGVGVFDRLMHTWLPPQRGQHHLSKLI
DHLTPIQPVLLESDYLGAVTNVVRQQTRRALVVVITDLVDVTASTELLAA
LTRLAPRYLPFCVTLRDPQVDNLAHTFTEDVSQTYNRAVALDLLAQRQVA
FAQLKQKGVLVLDAPANQITDQLVDRYLQLKARNQL
>Ava_4974 TPR repeat
MNNEFYNQGLEKAKQRDYAGAIEEFSRALKLTPYFAEAYLQRGLAYYDSG
AILLAVSDYTEVIRINPESVEAYYCRSLARLALKNLPGALEDVEQAIRLN
INYAAAHHLRGTVRRKQGYIQDAIASFKQAAELYLQQKDQENCRLCLEKI
KQLQPPKQSTIQPTKSPIAPITSVNEYFTQLLDKAEQGDTREAIADLNWI
LQADPQDAQAYCCRGVVRCKMGNYREAIADFNQALQLNFQDAVVYRNRGK
ARSLLGDHRGAIADFNQAIQIQPEDTLGYVARGNTYRAMGNYLGAIQDYG
KALQINPHDAQAYYNRGIAYTFLEEMQNAVEDYQRAASIFCEQQDWENYQ
LTQDSLKQIQTSIPEYKQQKSNVLRQKLLRLVGGYWEIAQRLIQQQKDYQ
PGMSEEWYMQKVIDDLERDRGR
>Ava_4335 trans-aconitate 2-methyltransferase
MPDKWNPEQYERFQAERSRPFYDLVDLVQPQENLRILDLGCGTGKLTQYL
HDTLAAKETLGIDASEKMLSVASQFAGNRLRFEQGRIEDSPGEGKFDVVF
SNAALQWLTGHEALFEKLRDKLQPSGQLAVQIPTMDDEPVHQLAVETAKE
FSQELGGYTRRLEVLTPSAYAKLLYKLGFVQQQVKLQIYGHVLPSREAVV
EWYRGTLLTAYESQLDSQTYEQFVQRYQQKLFQVLPDERPFFFPYKRILM
WGKI
>Ava_2825 Rhodanese-like
MNQENTQIVAAFYKFVSLPDFTEKQVPLLAYCLAQDIKGTILLAKEGING
TIAGSRLSIDNVLSYLRADLRLQDLEHKESTADTPPFERMKVRLKKEIVT
LGLPEVDPNEQVGTYVTPEEWNELISDPEVIVIDTRNDYEVHIGTFQGAQ
NPQTNSFRDFPEYVRQNLDPNQHKKVAMFCTGGIRCEKASSFMLSQGFAE
VYHLKGGILKYLEQIPPEESLWQGECFVFDERIAVVHGLEPGTHELCFCC
GHPLAEEDKASPQYEEGISCSHCFDSLTEDKRTRQQEKWRQYQLKNSHSL
GNSKL
>Ava_0157 Protein of unknown function DUF6, transmembrane
MGRFEKRPDNDPRVRGELSRAAETALWAVVEDLESLQQNVLRSFQEEIKK
LQTEKDRLTDEVQQLIEEKEHLQEVRRITEQQVLIRQLSEALAKHICSQL
QSSLAKIANQTESQIAALKSAQSIGPAIENNEQVEKMLGSLDDNLTIAFN
SLQQELKNYQSNLSQQLSRMYNQQQQGETIVEELIDRLRGELTRAIQETS
PAKAQLSPPTVLQPSELQPPSSPAVVNLSPPTVLQFPEQQSPNPSQTSTP
VEETSTTKPSVSITPPEKSTPVPIVPPPQETRPEPQSVIPKVSPDSETKL
QSTQEKAAEPSSVINRELSASAAKSPPTPEKPPEPISTSKTKFSPSSEKP
PEPVSTSKTKFSPSSEKPPEPVSVLSPDSSASKASSPPPASVVRRGSTPS
SSRSRKSSNLSPVQVGFLLVVLSTVMTALYNVVLKGMFYKTSQLSAMLEV
AGLISPTLGNIMLILTLRLMVVVPLMILLAPMMYPQVWQDLQNLKQSLGN
NQSGSRSQPKRVVQLMFASGCFLFLSQVLIYLAIGQVPTGVAIALFFVYP
LINGVLSWLLFRDRPGVFRASAIGSIFCGEVLVFAGATSTGIGTTPLGSI
TAILAGAAFACYLILTRVCAAKVHPVSLSLINFVTMLGLSFIFLMIPLPE
NWSLVVDPSKLLEIVLSAFILGALTLLSYVFSHIGINKLGGLRSAIISAS
VPILTVIFAGLVLQETLNIAQIFGVLFVTFGAIAFGIGKIQNPNKPSNAE
G
>Ava_3562 ATPase-like
MVMKLITAKIENFKSLGNVELSFRNLTIIVGSNSSGKSNSLESLNFFKEL
LVSDSLSAERMQRLLRFGNQNICSTIVVEDDGQKAEYSVSITSNKKHIQI
ASENLKVNGNEVIKIINGEGEVRDEDGQNPQKYTSNPESIEDLALTSAGN
FGNKPVTKKLASYIREWKFYDINPKDIKKYSTFLEMVKIVRNTADNDIVP
SLDNDASEVQEILNYWAKNEINKLRDVSEELSNCLNISLDIGVEKDPIVK
VLEGDGKKMPLSSMSDGTLRLIAYLILLYQSESEMPTLIGIEEPERNFHP
GILKDVASIMKRLSKKTQVVFTTHSSQLLDCFSPEEITSDISVILLSNKG
ELGTKACLLDKLAENRDDLLEWMTDFGLGSAIYHSHLIEEILAI
>Ava_0005 probable methyltransferase
MLLNKAVLVNKINSNHLFSASSLNPTIDMSQLFPGEVFANTADFDTGIRQ
LLPRYDEILEVISRCLPLTSHRILDLGCGTGELSLKILQRCPNAQVIALD
YSPRMLEFAQHKIASSGYKERWTGLQADFGDWAINPETLNIGNEFDACVS
SLAIHHLYDDMKLRLFQRIAASLTPNGCFWNADPTLPESPTLAEIYQAAR
EQWVSEQGSNFTEVRAKVGNSSPQGYSNPDQLATLDTHLQMLTKSGFTTV
AVPWKYYGLAVFGGWVE
>Ava_4605 4-oxalocrotonate tautomerase
MPFVTVKIARGHSIEKKRHLVEAITNALVTALDTKPEWITVHIDEFEREN
WAVNGILHCDRHRGRHDETGR
>Ava_2594 2-nitropropane dioxygenase, NPD
MTTVDALINKYDNGLGFLAYTSYQNLAWKGSLDCISFDHTAIKNKLLALD
KPCYIVKVAGKIGLTNDGYLSAAELGTPGQVELLTSLPSIRIQQLGDPNF
LSFYGVQSAYMTGAMAGGIASEEMVIALGREKILGSFGAGGLPPERLEVA
INRIQQALPHGPYAFNLIHSPNDMAIERRAVDLYLKYEVTTVEASAFLDL
TANIVYYRVAGLSLNDANQIQIKNKVIAKISRREVASKFMQPAPARIIKE
LLEQGLITELQAKLAANVPMADDITVEADSGGHTDNRPLVCLLPSIIALR
DEIQRQYNYSQPIRVGAAGGIATPESALAAFIMGAAYVVTGSVNQACIES
GACDHTKQLLAQAEMADVIMAPAADMFEMGVKLQVLKRGTMFAMRAQKLY
ELYRTYDSIEAIPPAEREKLEKQVFRKTIAEVWEGTAAYLSQRNPEKLGK
AVNNPKLKMALIFRWYLGLSSRWSSAGEKGREVDYQIWCGPAMGGFNDWV
RGSYLAEANNRRVVDVAHHIMTGAAFLYRIQSLKIQGMQIPDYYCQYHPV
RY
>Ava_2521 hypothetical protein
MYFDWKKDKLYFFWILGTTIGLVAGLISALFSSVAIYNLPLVGASKGLIN
FLSSAIAGLILGFCQGIALAKLLKHQELSTNIRRLTIKWILATTIAFSLG
GQVTSLWTNLDGTLTPQMLVFKLSLILFLIGLAQGIVMQWSIKKISQWLL
IHILSILASVLVGVFLFFLTISIGGGLSGGGLGVLVYAFLIAPLLISVST
GIIYGSITILFLPNLLKNDI
>Ava_4803 Tetratricopeptide TPR_3
MVNLPGSKLLAQTPISRDLETASLYQQGVTRYNRSDWQGAENAFRQALQR
EPNLAMARAYLGNIYLMQNRLDVAVQEYGEAIRLNPNLGETYYNLGLALQ
QQGKKEGAITAYRQALVIDPRRVEAYYNLGLVLYEQGLLQEAIAAYQDAI
NLEPSKVNAHHNLAIALQQTGKMEEAIVAYREVLKLDPQNAAAYSNLGSL
MAMQGRPEEAIAAYTQAVRQDPKNALAYYNLGITLYNQGDLQKASNAFKR
AQEEYSQQGNLEQTEKTEQLMQQVAQKIEEQKLQQRQASTPKPTDNATNN
LLEKLTQLTAPKEQPANSGAVTVSDEQKQFPPIFPNTNSR
>Ava_0685 Alpha/beta hydrolase fold
MSVTQETWTHEYIITNGVRLHYVTQGTGRLMLMLHGFPECWYSWRHQIPE
FAQHYQVVAVDLRGYNDSDKPKEQSAYVMDELIKDVAGLIKELGHEKCIL
VGHDWGGAIAWSFAYAYPDMLEKLIILNLPHPAKFIQGLYTPQQLLRSWY
IFFFQIPALPELLLKSTDYQAIPNTIQTTAVNKNAFTPDDLNTYRNAAAK
PGALTAMLNYYRNIFSHSFFNKSWGVLNVPTLLIWGENDTALGKGLTYDT
STYVKDLQVKYIPACGHWVQQEKPELVNQYMRNFLMI
>Ava_2964 Protein of unknown function DUF1400
MKRLLKYLSLGLLSIGFAAKPGLSAERISLFYPPFGEFSLPVSSLETFAK
QGKIDGDLQFYAQRATPEQLTQLREFLQQRFDVTPTFISQITYSPIGEQV
LQRLGDIVQTDSRRNGFYALRSALILSAAKPQGFSVINVLRNFPSDNLRL
NFSEGVRIVDDLSRLTKNRDRVVASLQQGAVAQTVVSNENLSKLPDLRSP
GKFRWQIVNFTLNDTQRNRRLPVDLYLPQANTDTQGKPPFPLVVISHGIA
SDRYSFIYLAEHLASYGFAVAVLEHPGSNAKRFEQYFAGLASPPEPREFV
DRPLDIKVLLDELQRLEQSDPRLQGKLNFQQIGAIGQSFGGYTVLSLGGA
KINFNQLNQDCNPENSSLNISLLLQCEANQLLPQDYQLQDSRIKAVIAIN
PISSSIFGESGISQIKLPVMMVAGSQDIFAPPVPEQIRPFTWLPNPYKYL
VLIDNATHFTLIGDSPQGKNVLPVPSGLLGPDRTAAYSYLKALSVAFLQA
NLLNRPEYRSYLQPSYAQSISQAPLNLNIWQSLTAEQLMEIRE
>Ava_3130 conserved hypothetical protein
MTVAVNKAPGIASRLVNGILAIKPLANLAKHQARQMMIKRAQRIGVPWIN
EVKTLQARDWTEDLAQVQNPQITYPDYYLTSFHGYDEGDLSWQAAFELEV
AARTVHAGIWQDGKADGDAKLRQSYHDILKVRVPQPRDILDLGCSVGLST
FTLQETYPHAQVTGLDLSPYFLAVAKYRAQQSQAQINWIHATAESTGLPD
RAFDLVSLFLICHELPQLATQKIFAEVRRILRPGGHISIMDMNPQAEAYK
KMPPYILTLLKSTEPYMDQYFALDIEQTLVEAGFQTPTITKNSPRHRTVI
AQVRD
>Ava_1356 NB-ARC
MSNSLKASTTGLGIVDKARKRLGWTKTSTACWWQDAHTSRATLRRFWQGD
RIQQEIFIAICQAVGINDWQNIAEIPNADFEETSTQYLDWDDAPDLESFY
GRNQELAQLEEWILGDRCKLIIITGIAGIGKTALALALADLLQLKFDGLI
WKTLHSVPSLVPLLDGLLHTFKQTPVDDIPSDTAKLIHHLQQRRCLLILD
GLDESEKHYHQFIQQLSRAHHQSCIILTSREQPNIIESNTKTVRSLTLTG
LPKDDAVKLLQARGFTGKEIGLSPLIQLYRGNPLALKLVTPLIQSVFGGN
IAAFLSQNTIVIGDRLRAILKQQFEQLSDLEQNILYWLAIWQQPISFSRL
QTHLLISLDPATVLDAIVSLERRSLLEKWICSDAPAFTLQPLVMKIATDE
LVERATQEIIQVMQSRDIADFKVLRTHWLLRPGSDDIVGDRILHQLQEKL
WQIYGANLVQNLQQILLLLNDKSPLTIGYIACNITTIITKGV
>Ava_2173 inner-membrane translocator
MSQTLRPANKRPNEHPKSRQRQSINNLLQVAGILPILVIICILFSLLSPN
FPTAGNAVNILRQASINIVLATGMTFVILTGGIDLSVGSILAVSAVVTVL
VSLLPALGWAAVPVGLLTGLLLGLLNGALITFLDVPPFIVTLGSLTALRG
AAFLIANGTTVINRNINFAWIGNSYVGFLPWLVIIALLTVAVSWFVLRQT
VLGVQIYAVGGNERAARLTGIKVNRVLLFVYGVSGLLAGLAGIMSASRLY
SATGMLGQGYELDAIAAVILGGTSFTGGIGTIGGTLLGALIIAVLNNGLT
LLNMSFFWQLVVKGLVIIAAVMIDRLRRRSRR
>Ava_0119 Patatin
MSFKILSLDGGGIRGVITARILQEVERQIQQQQGKSLHEYFDLIAGTSTG
SILTAGIAAKKNSSELVQMYQEQGQQIFPIERKERYKKIPSFLQPLIEAF
SLPKYSHQGLINVLKNVLGDTRIKDVEGPIILILAYDTLYRNTTFFTNCH
PDLGDRWYDDCYLWEICTASAAAPTFFPPYKLEPVNKEKYGNWVFPHIDG
GIAANNPALAALSLVMRLSQSSVSSAIKQQHNLDGINLDDIAILSIGTGQ
TGEPYSFDQVQNWRGIDWAQHIIDIFMEPTSEVSSTICRQIMGGFNSQRY
LRLQFDLNERFQPNKVESYKDTRQVLKPGQRVNRFTQTPITEEMDDTRDD
ILRYLIDATSKFIDKGCTFYTRNDCGPQVKDAIASFIQAN
>Ava_1543 conserved hypothetical protein
MKYTFDIVGVSPVWQFFSHQQQIQEQSNHPGIEYLGSHKCTLDALIETVE
PLPLKWGWNTEQVLDTVVQFWMNNSESIRYWKTRLTDGGNENILVARLAD
ITALQAEFESLLGKNW
>Ava_2312 conserved hypothetical protein
MNNQQGSNRISSGVIAAVSAAVVTVAGGVAWITANSHNNPTPSNPSQRVQ
EPGQPTTKQPGNEQTASVYWLKPKGDSIDLVPQRVRVAAVQPNQVLEKAF
QNLLAGPTEGTDSTTIPKGTKLLGLKVENNDEVHVNLSEDFTSGGGSTSM
MGRVGQVVYTATSLNPNAKVYIEVNSKPLEVLGGEGVVLDQPLTRDSFKK
DYPL
>Ava_3510 conserved hypothetical protein
MTRRGDNSWANRVTGVWKKAADSVVQRLPVEQVSQKFVQWFSVSEAEVAE
ILEKVRQELPTTEAVLIGKPQAGKSSIVRGLTGVSAEIVGQGFCPHTQHT
QRYAYPSNDLPLLVFTDTVGLGDINQETQAVIQELIGDLQEKSSRARVLI
LTVKINDFATDTLRQIAQELRRKYPEIPCLLAVTCLHEVYPPGVEDHPDY
PPNYEEINRAFAAIQAAFTGLYDRSLLIDFTLEEDGYNPVFYGLDALRDN
LAELLPEAEAQAIYQLLDKPTSEKLGNIYRDAGRRYTLSFAVMAAALAAV
PLPFATMPVLTALQVSMVTLLGKLYGQTLTPSQAGGIVSAIAGGFLAQAI
GRELVKFIPGFGSAIAASWAAAYTWSLGETACVYFGDLMGGNKPDPQKIQ
SVMQEAFAKAKEQFKGIKS
>Ava_2363 Oxidoreductase-like
MSVAEPNSHTQRNQPRPIRVGVIGVGNMGQHHTRVLSSMKDVELVGVSDI
NVERGLETASKYKVRFFEDYCDLLPHVEAVCIAVPTRLHYAVGINCLLAG
IHVLIEKPIAASISEAESLVNAAAESQCILQVGHIERFNPAFRELSKVMK
TEEVLALEAHRMSPYSDRANDVSVVLDLMIHDIDLLLELAASPVTKLTAS
GTRALDSGYLDYVTATLGFANGIVATLTASKVTHRKIRRIVAHCKNSFTE
ADFLKNEILIHRQTSASPVNDHRHYRQDGLIEKVYTTNIQPLSAELEHFV
NCVHGGNQPSVGGEQALKALRLASLIEQMALEERVWNPLDWQSESRVQSL
TQSV
>Ava_3291 Small GTP-binding protein domain
MTSILPLPDPQPSDATNTDTSSLNWEEELDSAIFSFEDIQAELNYKQAQS
ALRNLVGNIDLSSQEKAGLESEIADLETMLGKLDSMVVQIAAFGMVGRGK
SSLLNALVGESVFETGPLHGVTRAAQRVNWRISEEAIGETERALRVTLPS
VGRSQVELIDTPGLDEIDGETRTALAEQIAKQADLILFVISGDMTKIEHE
ALSQLREAGKPIILVFNKVDQYPEADRMAIYQKIRDERVRELLTPLEIVM
AAASPLVKTAVRRPDGSRGVQVRTGNAQVAELQVKILEILQREGKALVAL
NTMLFADNVNEQLVQRKLMIREQNANQLIWKAVMTKAVAIALNPVTVVDI
LSSIVIDVSLILGLSKLYGIPMTEAGAVQLLQKIALSMGGIGVSELLANL
GLSGLKTLLGISATATAGVAIGPYISVALTQAGVAGVSSYAIGQVTTVYL
ANGATWGPDGPKAVINRILSTLDENSILNRIKDELQVKLRRE
>Ava_3056 TPR repeat
MQIRGVNNVIVGTGFVVSLTGQIVTCCHVVRDAGGEVAEGVEINVYFPKA
IKPEQKAHIATVTACFLDHDDDVVVLQLDTATLPDGIEPAILGMAEGSAG
NKFRSFGYRRLQNYQGLPAHGEIVDFAESPEALILHSDPVMLNSQHIDSG
MSGSAVLDIERNLVVGVIAETWDSGESEKDRDTSFAVDCRVLTFDPMSLP
LADALPTPQPTINANPTGEQPVSQPGIISNLNNAPAPLEEWVGRADFLKT
LNQDWVDSNCSIVGLIGFGGEGKSSLTRRWLEDLLQDSSLPQPTGVFWWG
FYEKNNVLEFLEAALEFLVPGIDPYKVTWAEKVNFIHGMLKSGRYLFILD
GLEVLQQEEGDDYGELTNLELRDFLREFAAGEHESFCLINSRAPLVDLID
FTTYTHRDVERLSQEEGRTLLRKVGVKGADNELNKIVENWDGYALVLSLL
GAYLVDVHNGNVKYIREIKPPTASEPRYERVQRVLRRYDEHLTQAEKEFL
TIFSAFRLAVPPSLFTSVFQGVLPGKLPRDRDQPRSTLDGLFLKFQRLYW
RFIDRLFPSQRRVAARLKRQLNAPLADLRGSTFNGMIRRLLNYRILRFYP
EANYYTMHPLISAHYSQRLENNLGVQAKEIHQRIADYYLRIAGPIPENPS
IESLAFPIEAVHHLCCAGSYDEAFNVFWERVLQGINFVLLDKLSAWDINL
ALMVEFFPNGDTSQEPQVGTLAKKCWILQNVAQSLMSVGRLAEALPLYER
VITIYPSTKHWLNISISYQNLAVLYESLGKLEASADADREALNLARRAGS
KEGKLSSLRWQGWVAHLQGDNEIASAFFQQAEALQPEIDSSSKLYLHFLN
GIIQVNYLRRVGSSDHARQVTETNLEIYQRNNCPATVSSYYRVLGDLDTD
AGQHDNAYEYYNEALKIARSISERAVLIEALLARGRWAARQGDVAAARSD
LEEALNYACTGGYRLSEADIRVALGWMHLANHNYPAATTEAEKARLMSDE
MGYYWGRVDADEVLQALKQNF
>Ava_4903 Putative ammonia monooxygenase
MNQSLSISPTQKETNNASPLIAKLQLIVKQIIVLTIEMLLALPLGWALTK
LHFGGISWIFGGIAAGTAVLQGCRVFYNYSPQPNRVARKVGMGLVGLTVG
ASNANSNLVSLVSGIPIFIFLTLFLLLSGIGIGYIYSRLSKTNILTAMLA
TVPGGVGVMSSIAADYNRNVTLVALVQAIRVTSVVLLIPFIARTSVGGYL
NSSTLPVKGVLIDFAASQLGLLAVVLVLTGIIVYLAITFKIPAGEFFGAL
VLGATFNSVLDYLPFVSHIHFHPPLLVNIIGQMLLGITIGEYWGEKPNFG
KKTVGYALMSVFMTLVAGAIAATLAMQLSSWDWLTCLLVTAPGGSAEMIL
VSLSLNHNVEVVTTGHLVRLIAINSSLPLWLFLFRHFDGLREAKATK
>Ava_0109 HAD-superfamily hydrolase subfamily IA, variant 3
MGQNQFELVIFDCDGVLVDSEPIINRIFAETLTEAGFPITYAEVTQKFIG
KSLKTCLEIIETSYNKPLPKNFMELCKEREMAPLEKEIKPVPGISEVLEQ
ITLPKCVASNNSHRHIQMVLKLTGLLDKFDGKIYSANDVLRPKPFPDVYL
YAAEQMNTNPEYCAVIEDSVPGVQAASAAGMTVFGYAYHSDVSQQAVVRR
CTALFEAGAKIVFNDMRQLSQLL
>Ava_0164 conserved hypothetical protein
MSFKSVNDILGVLEKQAKWQQQPFQQVCQFWAEVVGSAIAAQTQPLSLQR
DVLRVATSSAAWAQNLTFSRQTLLLKLNQKLSTPIVDLRFSTAGWQRHPQ
KEKPHSIGLHSQHPSYLGDVNEDQGQVTAVNQDVNQVFGHWAKTRQRRLQ
NLPLCPQCQCPTPPGELQRWGVCGFCSVKQLPKNM
>Ava_3719 Peptidase U62, modulator of DNA gyrase
MWSELTKAIASFKIPADWIGIRVVKEIAANHYVRDGLPQSNGKSTTVGAM
LEVLVNGSLGYAATNSLELPSLQAAAEIAYKQALAASQWGIYPFRETERP
KVVGEYNSPFLEPLDALSPGEINDLLVRICHTLKVDEKIVQTTASASTSE
KQSWFVSSNGSQVYQKFLSIGTHYGAIAQDGAIVQQRSNNGWQAHCYQGG
LELLKQDHLWQRVQQIGEQAIELLTAEECPSTRTHLVLAPDQMMLQIHES
VGHPLEIDRILGDERNYAGGSFVNTSDFGKLVYGSPLMNITFDPTVAGEY
ASYGFDDTGVVATREYVIKEGVLKRGLGSLESQARAKVPGVACARASSWN
RPPIDRMANLNLEPGNASFDEIIRDIEHGVYMESNRSWSIDDRRYKFQFG
CEYAKLIENGKLTKTLRNPNYRATTPEFWHSLVQISDDANWQIYGTPFCG
KGEPNQSIWVGHGSPVCVFADVEVFGGG
>Ava_1797 Peptidase M50
MQTNWRIGSLFGIPLFLDPLWFVILGLATLNFGVAYQEWGTATAWTAGLI
MALLLFGSVLLHELGHSLAARSQGIKVNSITLFLFGGIAAIEEESKTPGK
AFQVAIAGPLVSIGLFLLLRLGSTVVSDSSPVSLMVADLARINLIVALFN
LIPGLPLDGGQVLKAALWQITGDRFQAVHWAAKAGQILGYGAIALGFAVD
FFTRELVTGLWIVLLGWFGVRNANSYDRVTTLQETLLKVTAADAMTRDFR
VIDADQTLRSFADSYLLATTNPEVYFAASDGRYRGMVAIEDLRLVERSAW
ETQTLHSIAHPLTEIPTVAESTVIAEVINKLENEQLPRVTVLTPAGAVAG
IIDRGDIVSALAQKLGLRMTDAEIKRIKEEGSYPPGLQLGVIAKSIN
>Ava_3535 Serine/Threonine protein kinase and Signal Transduction Histidine Kinase (STHK) with GAF sensor
MATQQYSNPIITGYQISSQLYAGSKTRVYRAIREQDQLPVVIKVLASDYP
NFQELLQFRNQYTISKNLNVTGIIRPLSLETYGNGYILVMEDTGGIALRE
YIKTAPLPLVEFLAIAIQITSILQELHLNRVIHKDIKPANILIHPQTKQV
HLIDFSIASLLPKETQEIRSPNVLEGTLAYISPEQTGRMNRGIDYRSDFY
SLGVTLYELLMGELPFSSDDPMELVYCHIAKTPIALGHQQHIPLVLSDII
MKLMAKNAEDRYQSALGLKHDLETCLYQCKNNGKITAFEIGKRDMCDRFL
IPEKLYGRETEVKVLLQAFERVTKGKSEMMLVAGFSGIGKTAVVNEVHKP
ITRQQGYFIKGKFDQFNRNLPLSAFVQALRDLMGQLLSESDSQLSKWRTK
ILDTVGNNGQVLIEVIPELEIIIGKQHPAPELSGIAAQNRFNLLFQKFIA
IFSTPKHPLVMFLDDLQWADTGSLQLIKLLMEDQSYLLLLGAYRDNEVHT
AHPLILTVEELKKAGKTVNKITLASLTLCDTNHLVADTLHCQAERSHPLT
ELIERKTKGNPFFITQFLKALHEDQLITFNRHQGYWECDITQINELSLTD
DVVKFMAQKLQKFPSKTQDVLKLAACIGNSFDLNTLAIVLEKSAADTADA
LWKALQEGLILPQSEVYKFYLDHDRQDAHVNNSQNVEYRFLHDRVQQAAY
SLIPQDQKQATHYQIGQLLLKQISPVAREERIFELVNQLNYGVALITQQT
ERDELAQLNLTACRKARSATAYQAAHEYITVGLLLLGEKAWQHQYEITLH
LHEFAAEVASLRGDFAQMEQFINIVTAQAHTLIEQVNVYRIRIQAYISQN
KFAEAIAIAQKILQQLGVTFAEATTPAVIQQEIQEIRELIGDRAIADLVH
LPIITDEEKVAIVQIASSIMTVAYLSGSPLFPLITLLLVKISIQYGNTPA
TGYIYSTYGVLLCNVLQDVDTATEFGQLALQIVSKVDAKATQPQVLLVLA
LYILQRKSHVQEILPLLQKGYAIALEVGNLEFAGHHAHNFCSHSFWCGQH
LATLQEDARAYSNELEQLNQITTANYVRIYWQSTLNLLGFAVHPTILSGE
ALQEQEFIPLLIAANDGYGLYIFYLYKLMLCYLFGEIETAKSITIKIKDH
LMAAAGLFCEPIFYLYDSLIALAQLRQNSEEVSATLQYVAENQAKLQRWA
QYAPMNHQHKLDLVAAERYRVLGEKTSAIEYYDHAISGAKKSQYIQEEAL
SNELAAKFYLDWGKDKVAQIYMQEAYYCYARWGAKAKIDNLEKLYPQLLK
PILQRRQLNLNPLETIAPINRNTIAVPTDTAGTSSTSISDILDFTSVLKA
AQAISSTIELEELIVNLTKIILEISGAKKIVLILPQDNAWYVRAITFISY
ENKPEGKIQNILLLQPIDTCKHIPGTIINYVKNTLETLVIDNYQIDIPGL
IDQYLLKHQPKSVLCQPIIKQSNLLGILYLENQITSGVFTLERLQVINLL
SSQAAISLENACLYQKAQQALQDLQAAQLQIIQSEKMSALGNLVAGVAHE
MNNPLGFITATLKQAKPTIADIAEHLRLYQANFPNKSAEIINHAEEIDLD
YSLEDLPKMIDAMVMASDRLKNISTSLRTFSRADTDHKVSFNVHEGIDST
ILILKHRLKANEQRPAIEVITDYGNLPQIECFPGQLNQVFMNILANAIDA
LEATNKGQTLEEIKAHPHQITITTTVNNDQVKIIIADNGIGMDEQVKQKI
FDHLFTTKAVGKGTGLGLAIARQIVVEKHNGSLFVNSQLGAGTEFVITLP
IITR
>Ava_3599 FAD dependent oxidoreductase
MNDIVIIGAGIAGLVCAQQLSQAGYSVLVVEKSRGLGGRLATRRLHGTWA
DHGACYLKPKGELFTDFVELLRDRHILEIWTDEVYELQPNAAPRYVAPGG
MSAIAKFLAQNLNILLNQRVTEVNLTPENTWRLTLESSNEELRAKALVIA
IPAPQAAMLLTPLGGGVLGQEFLTNLSAVEFAPCISAIAGYPTSSHPLPN
WKAFNFIDDAVLGWIGLDSSKRHQPQQPVFVLQSSANFAQLHLESSDLQP
IGQQMLHHAAQTLELPWLGIPEWLQVHRWRYAFPHIVWHNQVLVAQAKLP
LVCCGDWCGGNLAEGAMLSGLAASVEINNYLNQLILPDVNFLKVFT
>Ava_0956 Protein of unknown function UPF0118
MNISLNQLLKWLIITLLFPLVCLNGWLVFAFFQYFKPLVTIFVLAILLAF
VLNYPVSSLQKRGIKRGNAVGLVFVLTVIILIASGVTLLPIAIEQFHEIA
KVLPLWIDSSQEKLQNLNDWAVGQHFKLNIGQILTRIIDKIPDELESVSN
QTFKIIVETVDSISEAIVTVVITFYLLVDGQRIWEGIFKKLPISLAQKLN
QSIQKNFQNYLIGQGTLALLMGTSQTLLFLAFQVQFGLLFGLGVGFLSLI
PFGDVVSLIVITLIVATHDFWLALKILAVAVVIDQIIDQAIAPRLLGSFT
GLRPIWVLVSLLVGTYIGGLLGLLIAVPVAGLIKDAVDGFPSLTTGIENV
LSTETTTEALSPESTSS
>Ava_4650 conserved hypothetical protein
MPAEESERSRLPFEPKKKRQKPAKAPSKPPVQLKEADKQDKKQLPYTKEE
MAIPQVVSQRMIRRVAAFCGIPTALGITTLVSSYLLTIYSDIQLAPIAVL
LVNMGFFGLGVLGITYGVLSASWDEERTGSLLGLGEFGTNWGRMVEGWRE
TRQKKV
>Ava_4416 GTP cyclohydrolase I
MSNSSPETVSQPSQEVKYGEREIAEGQLITFPNPRVGRRYDINITLPEFT
CKCPFSGYPDFATIYITYVPDERVVELKALKLYINSYRDRYISHEESANQ
ILDDFVAACDPLEATVKADFTPRGNVHTVVEVKHRK
>Ava_2629 Possible Transcriptional Regulator, Fis family
MMIAADVFQLPEDFITGVATKYGVTKTELDALVLALHDHSGAEIAAKLDI
SQPAVRKRLGESYRKLGIEGKGNKKINGLRQKLYEQYQLTYQHPVFSSED
WGEAVDVAGFRGRKEPLLELEQWIEGNGSNRCRLVAVLGMGGIGKTVLAA
MVAKKVQKEFDYLIWRSLRNAPSLGDILTQLLRFLANENENDLADTDNNK
IVRLIDVLRKKRCLVILDNVESVLRSGEGKNQEWAGEYQPGYENYGYLFK
KVAEASHESCLLLTSREKPKEVAALEGKNLPVKVLQLSSLNLAEAREILL
DKGCYCTDEQLDELVRRYSGNPLALKIVATTVYELFSNNISEFLAQIHQE
SAVYGDIRTLLKQQFQRLSDLEKKVMYSLGANREYVSFREIKDDWLTTES
PIKIMEALESLLRRSLIEKASPTLIEKASSTQGEKEAESSKFGLESVVME
YITAKFIENSVEEFSQKKKLDFINTYPLMKARSLDYIRQIQERLILEPVK
QKLLNIFGTELELHLRRMLGNLQKEPLPKKGYAAGNLINLLRQLQLDKFP
DESPIDLSGRDFSGLTIWQAYFKEVKLRETIFANSDLTGSVFTETMSSVV
SVRFSPDGKYFATGLMNGEIRLWQTTDNKQLRIYKGHTAWVWAFAFSPDS
RMLASGSADSTIKLWDVHTGECLKTLSKNANKVYSVAFSPDGRILASAGQ
DHTIKLWDIATGNCQQTLPGHDDWVWSVTFSPVTDDKPLLLASSSADQHI
KLWDVATGKCLKTLKGHTKEVHSVSFSPDGQTLASSGEDSTVRLWDVKTG
QCGQIFEGHSKKVYSVRFSPDGETLASCGEDRSVKLWDIQRGECTNTLWG
HSSQVWAIAFSPDGRTLISCSDDQTARLWDVITGNSLNILRGYTRDVYSV
AFSPDSQILASGRDDYTIGLWNLNTGECHPLRGHQGRIRSVAFHPDGQIL
ASGSADNTIKLWDISDTNHSRCIRTLTGHTNWVWTVVFSPDKHTLASSSE
DRTIRLWDKDTGDCLQKLKGHSHWVWTVAFSPDGRTLASGSADSEIKIWD
VASGECLQTLTDPLGMIWSVAFSLDGALLASASEDQTVKLWNLKTGECVH
TLTGHDKQVYSVAFSPNGQILASGSEDTTVKLWDISKGSCIDTLKHGHTA
AIRSVAFSPDGRLLASGSEDEKIQLWDMQNCSRLKTLKSPRLYENMDITD
ITGITDAEKASLKMLGAVEES
>Ava_5009 2OG-Fe(II) oxygenase
MTVLQLPIIDISGLTCQRNNSSDVVAQQIKQACQDYGFFYIVGHGVDEQL
QTQLEHLSQEFFAQDQETKLKISMALGGRAWRGYFPVGNELTSGRPDLKE
GIYFGSELKDNHPLVKAGTPMHGRNLFPSNIPQFRETVLEYIDSMTKLGH
ILMAGIALSLDLDKSYFAERYTKDPLILFRIFNYPPNLGSQSEWGVGEHT
DYGVLTILKQDNIGGLQVKSKSGWIDAPPIPGSFVCNIGDMLDRMTRGLY
RSTPHRVRNLSTSNRLSFPFFFDPNFNVEVKPIDLKDVVVKDDQSDRWDK
ASVHEFRGTYGDYLLKKVSKVFPELRQKVL
>Ava_2500 Rieske (2Fe-2S) region
MDANSPHIKATHKPKIFNNSERFIEGWYWVIPSDNLGINEVKSITLLGRK
LVIYRGQDYQVTICDAYCPHMGAHLGEGTVEGNELRCAFHHWKFNADGIC
VEIPCLDEPLSLKLKTWPTVEKYGIIWIWTGEIPRESVPFVPELEFQDYE
AVLGSSFVINCHPHLIMLNAIDTQYFQGVQVLDIGFEKQELNQNAIIFSK
YKRQYHDLPFKKVFRPLYKNPIYSICYWYGSTVTLTVGTDLRRCYLMLAL
RLIAGEQVEVQTIFFTKQRKGLLGWLFNRLMLWLSNSLVQELIRDERNIL
QNMQFNLKNPIKVDQSIVQLINHVEMQKPLMWKTWLLARSPETEVKETQT
KWRDELTND
>Ava_3678 Peptidase M16-like
MSALSIWYRYRFYVLLLSVSLVTVLFFSNNPAESQYKTTLSRVNSQATLV
ANNHHQFRVTENVHKTVLDNGLTVFIKEVPTVPVVSVQVWYKFGSSHEEP
GVNGIAHQLEHMMFKGTKSRPIQFGRLFSALGSDSNAFTSYDQTAYYGTV
ERDKLKALLVLEADRMQNALIDADKLASEKRVVISELQGYENSPEYRLNR
AVMQAVFPNHPYGLPVGGTKADVEKFPVEQVQKYYKNFYSPENAVLVIVG
DCQAEETLATVKEIFGGIPQRQQAKVNSQQSIVNSQQSTVKNPIVLREPG
AAGLLQVIYPLPPASHPDMPALEVVDYILTEGRNSRLYKALIESGLASEV
EASVGGLQRAGWYELLVTADPDQDIGKVDSVLNKAIANLARTGIKAEELA
RAKRQLEAAIILSNRTITDQAMQLGNDETTVGDYRFTDYYLSAIRQVTSA
DVVRVIQKYLPKSHRKVGIFQPTISTSTKEVGNKKPTQRTQENLTGDSSV
TSSEVMKYLPSLDTTTDNIQQKSQTRLPQQFTLANGLQVFLLPDKSTPTV
TLSGYVKAGTEFDPDGQAGLASLVADSLMSGTKTKNASTLAQVLDDRGVT
LDFAAYRNGMRIQGDSLAEDFPVLIRTLADGLKNSIFPKKELELNLQQAV
TSLKMELDDPGEVARRIFLQSVYPKKHPLHTFPTVESLRKIRRQDVIAFS
QKYYRPDTTVLVLMGDFEPQQVRSLIQSEFGDWPASGEPPSINYPQVSLP
KTTTRENPVLPGKTQAITYLGYAGIKRQDPRFYAALVLNQILGGDTLSSR
LGEQVRDRQGLTYGIYSDFQAEKDFGTFWIEMQTSPEDTNKAIASTQQVL
EQIHQQGVTASEVETAKRTLIGNYNVSLADPEELTNKILMNQVYGLEPSE
LHSYNQKIQQVTLNEVNQAASELLHPDQVVVVTAGPSMVARQGVR
>Ava_3027 Protein of unknown function DUF897
MDFVSLFVKDFIAQLQSPTLAFLIGGMIIAALGSELVIPESICTIIVFML
LTKIGLTGGIAIRNSNLTEMVLPMIFAVITGITIVFISRYTLAKLPKVKV
VDAIATGGLFGAVSGSTMAAGLTVLEEQKMAYEAWAGALYPFMDIPALVT
AIVIANIYLNKKKRKEAVYSTEQPVAAGDYPDQKDYPSSRQEYLSQQKGD
EDNRVKIWPIIEESLRGPALSAMLLGLALGLFTQPESVYKSFYDPAFRGL
LSILMLVMGMEAWSRIGELRKVAQWYVVYSVVAPFVHGLIAFGLGMIAHY
TMNFSMGGVVILAVIASSSSDISGPPTLRAGIPSANPSAYIGASTAVGTP
VAIGLCIPFFLGLAQAIGG
>Ava_1201 Oxidoreductase-like
MVSGLQANQGKIGIAIAGTGFGQKVHIPAFQAHHRTDIVAIYHRDIHKAK
AIAEANNIPHAFDTIVDIVNLPEVQAVSIATPPFLHYEMGKTVLQAGKHL
LLEKPVTLNVAEAQELYQLAQKQGVIATVDFEFRFVPAWQLFAELLSSGY
VGTPRLIRIDWLGSSRADTSRPWNWYSSKEKGGGALGSLGSHAFDYIYWL
FGSVRRLNAHLTTAIPQRVDPASGELKAVETDDTCLLSLELANGTPCQVT
ISAAVHASRTHWIEVYGDRGTLVIGSENQKDYIHGFRVWGSQPGQPLQEI
EIPPKLLFPQHYTDGRICAFLRVVDQWVQGIDQQKPVVPSLKEGIYSQLL
MDLSHQSHTTGSWVDVPNVEDYLI
>Ava_2804 conserved hypothetical protein
MSATPVTQLTPSSQPQQPCLPLLGASVTELTSWVQQQGQPAYRGKQLHDW
IYHKGVRSLTDISVFSKQWRAAVADVPIGRSTIHHRSVASDGTVKYLLQL
SDGEIVETVGIPTDKRLTVCVSTQVGCPMACDFCATGKGGYKRNLERHEI
VDQVLTVQEDFQQRVSHVVFMGMGEPLLNTENVLAALRSLNQDVGIGQRS
LTLSTVGIRDRISQLAEHHLQVTLAVSLHAPNQALREQLIPSARSYHIED
LLAECREYVAITGRRISFEYILLAGVNDLPEHALELSKHLRGFQNHVNLI
PYNPISEVDYKRPSGDRIQAFLTVLQQQHIAVSVRYSRGLEADAACGQLR
TKTSR
>Ava_3512 Mandelate racemase/muconate lactonizing enzyme
MQVEVILFTVNKRFPLTISRGTTAKTTNVWVKIVHNDIEGWGEASPFGVG
NYGQSTNVIKDYLQQIVPFLEPFSPLQRQEIEQVFIKYQVPSAVRAALDM
AMHDWLGKSVGLPLWQIWGLDRQAIVPTSVTIGIDTPEAARIRTRDWLDF
TDVRLLKVKLGSPQGIEADKKMLLAVQQEAPAQEFFVDANGGWSLSDAVE
MCNWLADLGIKYIEQPLVRGREEDLIKLKEKSPLPIFVDESCFNSKDIPR
LASYVDGINIKLMKSGGLTEALRMVHTAKACGLQVMFGCYSDSTLSNTAA
AQLAPLADYLDLDSHLNLSDDPFTGALLKEGRVLPNDLPGLGVKYSASAT
>Ava_1044 Polysaccharide biosynthesis protein
MLDKYKTQILFSFRQKFGSDKLAVIQNIAWLFIDRILRMGFGLVVGVWIA
RYLGVQQFGLFNYATAFVALFNPLTTLGLDNLVIRSIVREPEARYQILGT
AFLLKLAGGIGCVLLTVSSIFVLRQNDQLTIGIVAILATAGIFSAFDTID
IWFQSQVQSKYTVVAKNTAFIIIVLLKVTLIKMQAPLLAFAWAGLAEIGL
GAAGLILVYKAKGYSLWLWRWNLPLAKTFLKESWPLMLSGLSIMIYMKID
QIMLGEMVDASAVGLYSAATRISEVWYFIPMAIASSVTPSIFAAKEVSEE
LYYQRIKKLLRGLVLISIVIALPISFMSETIVTLLFGNGYTAAGAVLAIH
VWASLFVFMGVATSSWFVAEGLTHLSLRRTLIGAITNILLNLLLIPHYAG
VGAAIATVISYVIAAFLANLFHSKSHKIFWLQFQSILLFL
>Ava_4325 Signal Transduction Histidine Kinase
MNIPRLHLFDHNLQKPLVMMASNIPVAEAIAQMYQAQTSCVLVIAKHELS
GILTQTDVLRGIANQMMFADLTVGELMSQPVITVHETELENLPNILQRFH
QHQIRHLPVLDDQGQVQCVVTLEEVKTAQLEQEVVRREQMSQLLLQRDAQ
YQTSEAKFNDILNSAISTCIVSFRVFTNFDWEYIYHSVGCETVFGYPSQE
FFSNKHLWLSQVHPDDVETVIMPGFADIFAAKTFDFEYRFKHKDGSLRWI
AATHSSRYDPKADCWVVTATNIDISERKQAEAALRKSEERWQLAIAGTDE
AIWDWDILTNHTYRSDRWFEMLGYERHEMSSFDDEWSIRIHPDDYDRVMA
AQAAYLRREVPSYYTEYRLRCKNGSYKWFRSRAKGVWDEQGNPIRLVGSL
GDITGRKSIELDLIRSETQYRLLFENNPNPMWIFDPETLSFLAVNQAAIY
KYGYSQAEFLSMTVLDIRPREEVPAFFRSLNDFDIFSAVIYVGEAKHCTR
NGTLIDVEINSHLITWLGKPAKFVLAKDITEQKVAQHELQRIEAELRESK
HFIEQVISHSPQILYIFDPMVGSNVYLNRQSVDILGYTPEEIQQRGAQFF
LDVLHPDDLPLLERNLEYWQNAGDGEVLTTECRMKHQDGSWRWLRSREVV
FARDENNRPNKVLGTTQDISDSKLAELEIVHSRDLLAGIYNESADALFLV
DTTTLLTTDCNQRAVELFAASSKAELIGIEGQTLQKRQFTHDELTSIVAE
INHRGFWGQEIEYITKQGDSFWGNLAIKQICVAGQAINLVRVTDISQRKL
AEIAFYERETMLRSIGDNLPNGAVYQIVRELDGSDRFSYFSAGIERLMEV
RAEDVSADATLLYRQLCAEDIPGFVQAVEESYRHLSVFDIQLRVCTPSGM
WKWVQLRSTPRRLPDGRVAWDGLIVDVTDIKHTEETLRQSEALLAESQRV
ARLGNWDYDLATRKITWSQGLFELFQRDPALAAPSYEENLQLYHPEDQQK
LHQAVERAISTGASYKLILRAFKADGSMIYVEGLGHAQFNPQGQVIGLYG
TAQDITNRKQAQEALTKSEEQLRLTLEYTHIGNWDWNLHTNEVIWNYNHY
RLLGLDPETSTASYQAWREVVHPDDINRVEQSVANALEQHSNFEAEYRVI
RPDGSVLWLAGKGHGIYDETGNPVRMLGVIIDINDRKQTEQTLREKEHFL
SSIYDGVGNCIFVVDVVDDDFRFMGLNPAHEQLTGFRSHELQGKTPEQVL
SPILAASVRQHYQDCVEAGATITYEENFYLKNQDTWWITNLTPLRDENLN
IYRIIGSSINITEQKRAQQMLELQAVVTRNIAEGLCLVRASDGMIVYTNP
KFAQIFGYEVSELIGQHISIINYEDEHTSATEIYEAIAAAAMLHGETIYD
VQNVKKDGTPFWCRATASVFEHPEYGTVFVAVQQDITEQKQAEEKIKASL
KEKEVLLQEIHHRVKNNLGIVSSLLQMQCRRTQDPQANAILRDSQNRIAS
IALAHEKLYRSNDLANIDFAQYIPDLTTHLFDSYNVKSSCIQLSIQVEET
SLDLETAIPCGLIINELVSNALKYAFPDNREGEVQVRLYQQSDRTLTLIV
RDNGIGLPVEFDSKKTKTLGITLVQGLVKQLRGKLEIQSQSGTEFKISFK
TGRV
>Ava_3242 Plasmid maintenance system killer
MIVNFKSEETKLIFEGFTSSQYPPNIQKTALRKLLILDAATSINDLRVPP
GNRLEKLVGDRSGQYSIRINDQWRICFVWTDENNASEVEIVDYH
>Ava_4764 Serine/Threonine protein kinase with Chase2 sensor
MIIGISNKLRAAFIKDKSYRDTAVNKNWWQIILVTSLGVTALVWGVRELK
WLQSWELKAYDQMLRSRPAEPPDSRILLVTVTEEDLAKKGWNLADDTINR
LLTKLASYQPRVIALNLYRPEQTNLGAGLENPTNIISACLSSSMGRSEVP
PPPNFPPENIGYNDVISDIEEDQVVRRALVFSEANDSKCATQFSIAALAA
ITYLEQTGIHVDFEKHQFHLGKTAFPILTPNSGSYKGVDAKGYQILLNYR
HPNHLAPTFTLTEVLNNQVNPNLVKDRLVIIGTTASSIHPGLYTPYSAAK
GQPTRTPSVFIHTQIASQLLSTVLDGRPLIWYWPDWAELVWLWAWSLVGS
ILAWRLKHPLLVVVVLGIALTGLVIICAGLFLDAGWIPLIPPALTLVMSG
VSVIIYRTYRTQQQTKVIMLQVEKQQEVIEQLNILLNEATAIADTTAFYD
QHSHHDSVVVTPRKKADDLLLGGRYSISSVLGAGGFGRTYLAQDTQRPGN
PTCVVKKLMPARKDTKFLQVARRLFITEAEILESLGQHQQIPALLAYFEV
NEEFYLVEQYIAGHTLYEELPPVTGLQSEAFVIEMLKGVLEVLAFVHEHR
VIHRDVKPTNIIRSAEDNRLVLIDFGAVKLMQPPTDQKTELATVAIGTRG
YAPPEQFAGHPRLASDIYAVGMMGIQALTGILPQELPPDPETGNVMWRSH
ATVSEELALILDKMVRYHFSDRYQSAATVLQDLNRLSSLIQTSSIS
>Ava_3989 Major facilitator superfamily MFS_1
MHSNHVIYQLSPGFMTEFTSTKSNSPFSTGLPALYSIAFLSGISIGLFNP
FISTLMAQHQVDDLWIGANSTVYFLVIALGTPLVVKVLPKLGLRKTMMLG
LTMMGISAPLFTMTTSMPLWFIIRAVMGIACCLYLVSGNTALNHFCHEGN
RAIVNGLNALAFTFGFGIGPVIGSAFYNVSPKLSFLLGSALIFSGVIVVW
IALPDKAVVFQQSSRSRIFNKLKLPLQGAFAYGFAESTLVSLYPVYLLRQ
NYNIEQIGYTFAVFVVGGLLSTVPVTHIADKFGRLKVLFMSVFIVILSFL
SLSLIQNSTATQIFAFIAGASISPIFPLAMALIGAKLSRNELSSGSALFT
AIYSFGCTAGPIASSLAIKVFGDSYIFSLTIIIFAIFLVYLSIPNKNFRT
YLLNVARKIH
>Ava_2979 GCN5-related N-acetyltransferase
MSFNIMNIRCETATDYLAISEVNNLAFGQENESQLIDKIRISEFYIPELS
LIAEVNHTVVGHILFSYIELAGEEKIQVLGLAPMAVHPEFQRQGIGSALV
KAGLEKADARGESLVIVLGHPQFYSRFGFVPSVNYKIASPFPVPDDVFMV
KTLRSYQDKYQGKVIYPPAFSHV
>Ava_0152 hypothetical protein
MYLAGPVVNGCFCLFFSLVYTYYISHLQFPGYLSIVGFLVYVEFFLFAEN
LLPMDVNSYGRVISNDGKGFIDALTKTEQQFIQTILGLDRYINKEHLSGD
LFNNDLEILQVLYKAHGEFNKHNFSQVIDLLEPILDYPDILTRDKLYIID
TLASIIINHGEKQYLDKANKWSEQAMELASDIKTIQGTRGAILVELGRYS
EGKEILLPLTEVGNEAIDIAVSCCYIAKADYYLGNDDQVKDWLNKAEKIG
NVHLILQRVKQEINY
>Ava_4141 putative methyltransferase
MTQLKTLPIYDPTLFEGAAEVYAQYRTKYPPAVFDKLTEIFNLNGQGRLL
DLGTGPGLIAIPLSTKFQEVVAIDPDPEMLKEAQRQAATAGANNITWLEQ
GAELINPSLGVFKLATIGRAFHWMERQLVLERLYELLADDGAIALLNTGD
DPWKSPLPWKQAAIGVVKKWLGEERRTGQRGQGIRKPVDPPHEVVIANSK
FARQEVHEVTFEKSWTVSSYLGYLYTTAFSLKIFYGDKAEEFEADIKDAL
LAVEPSGHFTEELKATIQVVWKH
>Ava_0762 Serine/Threonine protein kinase
MAWVSGQQLQGGKYTIEQELGEGGFGITYRARDNYGRSVVIKTLNDQVQR
HSDFAKFQQDFLNEAIKLAKCSHPHIVQIHEVINEDSLWCIVMQYIDGEN
LASRVENQGVLPEAEALRYIHQIGEALTVVHNNGLLHRDIKPQNIMLRSG
KSEAVLIDFGIARQFSPNLTQKHTEYLSSGFAPIEQYEERAKRGAYTDVY
ALAATLYSLVTGEVPTMAPLRAIGTSLVEPKNISSHISDQVNQAILRGME
VKPENRPQSIQEWLALILLENNPANFPEMLGTWLGKFGSGQATLSITHQK
EDYFDGTLIHKHFWNGTAKVAIEGNVNHETNIVEIKEIRIISGYWRLAEN
QGSLSSDGKTISGIGKDSQGSYKWALQRIV
>Ava_0122 MOSC
MPYLAKILIYPIKSLDGVEIQQGRIINGGALEHDREFAIFDEHSRVVNGK
RNPSIHQLRSYFRISHREISLQFPGKDSEYVFHLDEERQTLAAILSDFFG
FAVTLAQNSQGGFPDDLQSPGPTVISTATLAEVASWFPGVTIDEMRRRMR
ANLEIDGVPAFWEDQLFSESGEVLSFQVGDVQFFGVNPCQRCVVPTRDSF
SGVAYPSFQKIFVQKRQATLPEWSATSRFNHFYRLSVNTKLPSSEAGKFI
SVGEEIKIIYKN
>Ava_2808 TPR repeat
MFNKLFKSLFNPWLVTLFTASFILCLFFGSFHRAAIVKAQTPDAYKLVNQ
GIQSYKKGDFYAAIQHWETALDFYQKNNDTPNIAIINENLARTYQQLGNK
SLTLSYWEKVKAYYHSQKDLPKVGRILTEIAQIYSNSGQTKKAISLLCGA
DTLICQTGSALQIAQEQQDKLGEVAALGSLGEAHRLQGNYDLSIKYLETV
NNSQNQTDNFAILNSLANAYAGRAQLWNLRAKSAKNHNSSKMNEFIQRSQ
DDYRIAIISLQKSIAVAAKENNKIAELRSHINFIKLAYQTVDSNILNPNQ
IETNIQQALALIEQTPDSINKVYAEIELANLPITNDEFTSSVNQCHQKTR
LPELQVKQLFHQAIQTAKKLQDARSTSYALGELGHLYECQKEYKSAWELT
NQAIWFADQNLQAKDSLYLWEWQAGRILAAQKQLNQALSFYERAYKTLEE
LRNDILTTNRDFQFDFRDVVEPVYRQLAQLQLELATSNNQDSVRHNQQLN
SALTTINSLRQAEIQNYFGNDCILAAFNNQPLYQVIEKDTAVISSIIFKD
KIGILLSLPNQQEYLHWVENKNQETLRQEISQFRNSLLAQQTINYSTKDA
ENIYDAIIIPIEKYLTEQKIKTLVFIQDGFLRDVPMAALYDKNESKYLIE
KYAVATTSSLQLTDLKPLSSQVNRALVLALSQESKIDNKVFPELAYFPIE
YGAIKKIFPESKKLENEEFAIQNLKREIQEKTYPIIHIATHAQFGIIPED
TFLVIGNNEKLTIDKLEAILRQSGNISNAVELLTLTACETATGDDRATLG
LAGVALQAGTKSALASLWPVDDDSTANLIAEFYDKLRNSGMSKAQALQAA
QLKLISAKQIPEINDKYDHPYYWSAFILIGNWL
>Ava_4427 Zinc-containing alcohol dehydrogenase superfamily
MADTVNQQIVLKSRPVGEPQESDFALVESPIPQLGEGEVLNRTIYLSLDP
YMRGRLSTNASYAASTELNSVIVGETVSQVIQSHHPDFQPGDFVLSNHGW
QTYAVAKGKTLRKLDPNQAPLSYYLGVLGMPGLTAYAALLDIGQPKAGET
VVISAASGAVGAVAGQIAKIKGARVVGIVGSDQKRDYIVKELGFNVGINR
RTQEIASALKEAAPDGIDVYFDNTAGEILEAVLQQINLGARIPLVGLISQ
YNASSPPPGPNLLPLLIKRALIKGFLVSDYQHRFSDFARDVTEWLQSGQL
KYKEDIVVGLENAPRAFIGLLRGENFGKLIVEVSQ
>Ava_2914 MazG
MESNHLAALQELIEVVAKLRSPDGGCPWDLAQTPQTLTPYVIEEAYEVVD
AIKSGDQEAIAEELGDLLLQVVLQAQIASEYGQFSLQDVAQGITQKLIRR
HPHVFGDVSVNSVDEVRQNWEQIKAAEKGEPSAESQKLSTKLSRYGRTLP
PLTAAMKISRKAAAVGFEWENIDGVWAKFHEELQEFQQALAEETPERQQA
ELGDLLFAVIQLARWHNLDPSEGLQGTNQRFVQRLQKMEAVVDRPLSDYS
LDELETLWQQAKAQLAKE
>Ava_3551 hypothetical protein
MTKDVKIYNQSSSEVDGNYLEASQQLQLDKQNYGADYIHLYVDDIEGDWL
ENWDWEEDLADYIEAFYHHCEQGNYQFAFDTLKACDDILNQPENYKKSLE
LYSYLVQKLESLEMAKLAEINNQDILAQAKQSVLNLKNNKSRENMSEEIT
FTITERKITRIEYIRKLLEKIHNGVIRKTWKFQGELEWTDCSDFYNLKGT
ASSGDFTVNFGVIAFTKDSYYLSIKKGSEPEETIVNEECDSELFNHDNRY
IDNVNFSYEEFKLIKEYLEKIEQANWQSKFSDLDALPE
>Ava_1827 Rieske (2Fe-2S) region
MSSLSQAVSIHDVRQLGINLNHWYVVARSQEVTNKPLGVTLWHQAIALYR
DSEGQIHALEDRCPHRQVKLSHGQVIGNELECAYHGWRLNHHGECAAVPY
LAENQKLPNCKIRHYPVKEQDGFIWLFPGDGEPSIEPMGLPEWDHLNYIA
SVAIINCQAHYSYLIENLMDMYHGHLHQDLQAWAQAELQDIEETDERVDA
HYQAQSYYKIDKIWSISQLFFPALRRLHPEPLDVSYIYPHWMSTLGKDFK
IYCLLCPINETQTKAYLIHFTSLNAFWRLHKLPVWFRRFVKDSLFGAAQK
FLDGLVVKDVQMIEEEQQAYLQNPQRRNYELNRALVSVQRLMKNQAS
>Ava_1354 conserved hypothetical protein
MIIVSDTSPINNLAAINHLNLLQQLYDTVIIPETVYQELTDPEFPVAGAT
EVKTFDWIQTRQVSDRNVVEALANDLDKGEAEAIALALEIKADQLLIDER
RGRLIADRLQLRYVGILGILVEAKSRGLISTVKPLMDALRNQAGFWIDAA
LYNRVLQLVGED
>Ava_2065 Twin-arginine translocation pathway signal
MKGRFHPKNNQTINPSSNESIRDVIDRMSMSRRKFIFTAASASVLTVVGE
VSIGGFLQSVQAAPIPKGTGFAGIGFKSIPPNLLNPATGKLEKDLVSVPE
GYTAKVLVAWGDPIAPGGPTWLADASQDAAAQEKQFGMHADGMHYFPISY
GNPVGRTVSSQARSLRSFLNQPINTGLLCVNHEYTHEEILHGSEGLTPVT
IQKVRKSQAAHGVSVVEITKNGNDWTYNRNSPYGRRVTANTQMRVSGPAA
GDVLLQSKKFNITPNGSVEIGTNDGYTAYGTLNNCANGYTPWGTYLTCEE
NWNGYFANPTLAANSTSDVESIPGIDKSDILVGQRRYGIPSQSSYRWPDV
DPRFNAQTNPLEPHLFGWVVEIDPYDPQSTPVKRTALGRFKHESAQVVID
DNNRAAFYMGDDERNEYIYKFVCAQPYNPGNRAANRDLLDNGILYVAKFN
DNGTGQWIPLVYGQNGLTPENGFRSQAEVLVKTRQAADRVGATMMDRPEW
TAVRPRIGGYKEIEVYCTLTNNNRRGSTPPSSNNPNGTTTAASARPPVDA
ANPRPDNLYGHIIRWREDGQRVTATTFKWDIFIEAGDKTRPEANLQGNIK
GDDLGAPDGLWFDDFGRLWIQTDQAGDGRGDWQKIGGNTMSCADPNTKQV
RRFLTSPTDCEVTGITSTPDGKAMFINIQHPGEGAPPSNPTQTSNWPYSQ
GYGPSGRPRSSTVVITRNDGGVIGGL
>Ava_2676 Phosphoesterase, RecJ-like
MQLNSSFKQSESFSLTKEPNSEEPEVDKEAEVTITRPSLPVSTGEGVGIY
LGQRNNSLAFQKSEELQKALLLHRHERQLIILQDFPDPDALSCAWAYQLI
AQQYDIKCEIIYAGTLSHQENIALVKLTGLPAQRWTPQTIKGKDLSCYQG
FILIDNQGTTSQLLTSVQQAGIPLVAVIDHHSLQTELKADFVDVRPYVRA
TATIFTQYLQAGLLGLDSSISQHVKCATALMHGLRSDTNRLMQAQEEDFM
AAAYLSRFYDAQLLNAILQANRSKRVMDVIERSLKNRIVQNNFSIAGVGY
LRYDDRDAIPQAADFLVTEENVHTAVVYGIVHDEDDELEVVIGSLRTTKL
TLDPDEFIKEAFGQDSTGRFFGGGRTGAGGFEIPMGFLSGGNENSAYARM
KWEVFDAQIKQKLLRLVNPKDNPIQSE
>Ava_0640 Photosystem I assembly BtpA
MNTQSPLPNPHYPTPSTNPQSLTIVKDVDLYQLFKTRTPIIGVVHLLPLP
TSPRWGGSLKAVIDRAEQEATALASGGVDGIIVENFFDAPFTKNQVDPAV
VSAMTVVVQRIQNLVTLPIGLNVLRNDGKSAMAIASCVRAQFIRVNVLTG
VMATDQGLIEGEAHELLRYRRELGSDVQILADVLVKHARPLSSPNLTVAV
KDTIERGLADAVILSGWATGSPPNQEDLELACDAANGTPVFIGSGASWEN
IATLLQAANGVIVSSSLKRHGRIEQPIDPIRVSQFVEAAHRSWNSKGESK
SVSSITIHS
>Ava_2989 Serine/Threonine protein kinase
MAWVAGGQLQGGKYTIERELGRGRFGITYLVTNRNSDRLVIKTLNDNLLQ
SLSQPQRARLENMFYSEALKLQKCHHQHIVKIIELFREAEYPCLVMEYLG
EDSLANLRPAILSEQDALRYIQQIGEALIVVHKNGLIHRDVRPENILLRK
RDGNLEAVLIDFGLALDFDYILTTSRTQETSAGFTPSELSTQGTIAKACS
DVYSLAATLYKLLTGRTPVDAVKRKLNGEHLVSPKEYNPQISDRTNRAIL
TGMQLDPKQRSQSMREWLDLLGLTQPETNVTSDTKSNWERNIQMWGIIVA
AIAAIGTLLSGIVGWIPIFKPSSPPSPSLVSPSQSPSQTP
>Ava_2950 Protein of unknown function DUF897
MDVSLIVSNILNPPVLFFFLGMLAVFVKSDLEIPAPIPKALSLYLLFAIG
FKGGVELIKSGVTQEVVFTLLAAMLMACFVPIYTFFILKLKLDTYDAAAI
AATYGSISAVTFITASTFLSELGITFDGYMVAALALMESPAIIVGLILVN
LFTVDEKREFAWSEVLQEAFLNSSVFLLVGSLFIGFLTGEHGWQVLEPFT
QGLFYGALTFFLLDMGLVAAKRIKDLQKTGFFLILFAILIPILNAGIGLL
IAKFIGMPEGDSLLFAVLSASASYIAVPAAMRLTVPEANPSLYVSTALAV
TFPFNIIVGIPLYQYGINLFWR
>Ava_2618 conserved hypothetical protein
MSDPQTVSNAVAKLYDTYPFPPEPILDEPPPGYNWRWNWLAAYSFCTGQK
PTKQDIRILDAGCGSGVGTEYLVHLNPQAQVVGIDLSAGTLAVAKERCQR
SGANRVEFHHLSLYDVEQLPGEFNLINCVGVLHHLPDPIRGIQALAKKLA
PGGLMHIFVYGELGRWEIQLMQKAIALLQNEKRGDYRDGVQVGRKIFASL
PENNRLVKREKERWAMENQRDECFADMYVHPQEVDYNIDTLFELIDASGL
EFIGFSNPSFWSLDRLLGKAPELIERSEELSDRQRYRLIELLDPEVTHYE
FFLGRPPIKKADWSSDDALQQAIPELNPCIDGFPGRCIFNHDYQIVNLSA
LEFEFLQKCDGNSTVAEILVSVQLSLDEVRSLIKQQLIMLIPDAKSYAS
>Ava_2891 WD-40 repeat
MVEGHPAVDIKVANERAFTSLWRAIALSHGNFSVALVYCNYRVLQEKILQ
RLDEMFAENPVQKVVLPPNTRSLYTTLHLNLLPQQQQPSALMVLGLESVE
EIDDLLRAINHIRDEFPKRHSFPMIFWVNEEVLQKVIRLAPDFASWAATP
IRFEMTTPELLQFLQQETDSLFARVLPKDMGQPQQPQFGDDYSTLEQVWE
HSNELHYAIAELHERGITLEPELNASLKFVFGLDDYVSDRIHHALNHFQQ
SLQIWQRLGDWGLGTGEQSISFPVRPTPLSSPILRQGVLQLYIGLCYCRL
AEQNQLDNRRHWETAKFYFQECLEILQVAGRPDIASEFIGQLAEVLEHLQ
GWDELQTVAETALELHHTYGSQIQLACDYGFLAQVALQQSRWVQASILAH
VSLLKLTEAQNHNDSDRHHCLFPLLLAQIYYLVLAKAQQNLGEPAVAQEY
LDKAAKELPAALENSTHQYDAHRYIRMLRTLRSLYFEAGRYLEAYRIRQK
RRSVEQQYGFRAFIGAGRLQPQRQATNPALMSPSGSSTVALEIAASGREQ
DINNLIGRISRADQKLIVIHGPSGVGKSSTVTAGLVPALQNRAIGDQIAM
PVVLQVYTDWVRELGKALTEAVAHISGDVSITPEILSTPTPAMDVGYARS
MAIADILGQLRQNANNHLITVLIFDQFEEFFFGYSDGVPPTVGDRQQKKE
FDQFLSQCLNISFVKIIFSIREDYLHRLLEFKHLSYLEAVNNNILDKQIR
YQLNNFSPEYAKVIIQKLTARSQVNLEPALIDAVVEDLSTELGEVRPIEL
QVVGAQIQDERINTLKQYQQYRPNKLIERYIKELIKECGPDNERAALLVL
YLLTDESNKRPFKTRAELAIELAELEDPEKLDLVLDILVNSGLVVLFPDI
PERYQLIHDYLVDLIRYLQQQESSLQAQLDQLRHKVQQSQTEIARLKSEL
SQKKQSKLTDTHLQQGLDLVTELRELRKREELTQLEIEQLRAELKEKELT
AQLAESQKQQRLSQAKLNRSLKIALAASCLAILGLSVSIITAVDSEIKTL
SVSSEALFASQKGLDAVKEGVKAARKLQRAIWVDPYTREQVQTALYQAVV
GVREYNRLDGHTAGVNSAVFSPDGSLIASASADNTINLWRNDGSLINTLS
KHTNVVNSVNFSPDGLLIASASQDKTVKLWNRVGQLVTTLQGHRDVVNNA
SFSPDGSLIASASSDKTVKLWSREGKLLKTLSGHNDAVLGIAWTPDGQTL
ASVGADKNINFWSRDGQPLKTWKGHDDAILGVAWSPNGEILATASFDKTI
KLWNRQGNLLKTLSGHTAGVTAVTFSPNGQTIASASIDATLKLWSPGGLL
LGTLKGHNSWVNSVSFSPDGRTFASGSRDKTVTLWRWDEVLLRNPNGDGN
DWVTSISFSPDGETLAAASRDQTVKILSRQGKLLNIFKGHTGSIWGVAWS
PNQQMIASASKDKTVKLWNRDGKLLHTLQGHQDAVLAVAWSSDSQVIASA
SKDKMVKIWSQDGQLLHILQGHTDAVNWVSFSPDGKILASVSDDTTVKLW
NRDGQLLHTLKEHSRRVNGVAWSPDGQIVASASIDGTVKLWNRDGSLLRN
LPGDGDSFISVSFSPDGKMLAANSDDKIRLWNQKGTLLMVLKGDKDELTS
VTFSPDSQILAAGGGNGKVIFQNLADIKLENLLVRGCDLLQDYLKTNLDV
TKSDRTLCPNTNNR
>Ava_4917 Major facilitator superfamily MFS_1
MKIALPSQLSAWLSPIHPQIWVLVIARFLSEVGSGFTVFYAPIFFVNQVG
LSATAVGFGLASAATSGFFGRILGGSLTDSPKWGRRRTLLLAMAVLAMGS
LVLATITNFTTLIIGNLIYGLGMGIYWPATEAIVADSSQVEHRREAFALT
RLADHLGLAIGSVLAGVLITIAQNYRWLFIVDAISFMVFFAVVYLAIKEP
GSSTTAPAQQQFTVWIAALSDRRFLTYIAVNVFFTIYISQIHNVLPIYLK
NFLNLGETGKGFDEATISALFAWHLVLAIICQMPVTSILKRSSHTLALTI
SAVFWAIGFSLIWITSTTSSHQLIWVILALAIFAIAGVSYTPSAASLVSD
LSPESQRGVYFSINSLCWAAGSFIGSPLGGWALDQPQAITQNLWLGFVLS
VAIALTILQYLNHVLGNGQ
>Ava_3671 Haloacid dehalogenase-like hydrolase
MQLVIFDIDGTLINSNSIDSDCFLSAFKLEFGFTNISSNWAKYTNITDSG
IAQEIFIEKLGRLPTQVELENIKKCFVNLLQESFHENPNLFSEILGSGQI
LAELRVNQDWCVAIATGGWYDSAILKLEKANLNIQGIPLASSDDGIARDD
IVNCAISKSKAIYQVREFQKIVFVGDGVWDVATAYNLNIDFLGRENKLDN
ILLHAGVENIVEDFSDSTKFFQLLKIAKIPR
>Ava_4006 hypothetical protein
MDCKALIYNGVRKQAITPIYSHSSAKWDMGKQAILHRRHFVNAPIKYPHS
LVKWDISQEPRIIEQIELPYGFKSVPVPSIVLLPDGERFAVTPSEVRLDF
GIKILCWDDFSVVQTVEVPHAPYVETSEEDECYGHGKCTHISWLGVTPCQ
RYLMIAEAWGDIYLVNLESGEQIRWLRRWRDYNAAFAIDPQMQFLIVNSV
DMDESHQFYRIDSILEGKLTYLGEYSGGARCHLGGLSFSPDGQRLVYTAY
RSPGVDLLSFRLERSILSTLSPQTQELLDQDWDKSEQYRAKMWGQFFRLY
HDHDGLNPWQSNIVWLDNNRLLCGVGQILAVLSAQTGEIIKTYDVDAVVN
TLVFDYTSSQAIVATKQGIKVVAIA
>Ava_4111 YbhB and YbcL
MSRRRDFLLQSFSIIGLVKLSAIACSSVGNKNTEIASTPSSKLTTERKSM
KLESVFGANSQIPAKYTCDGVDISPTLSWDEPPEETQSLALIVDDPDAPR
HTFVHWVIYDIPPTVRQLPEHITATKTLPSGGVQGKNDFGKLGYGGPCPP
SGTHRYFFKLYALDKNLGLPPGATKEQILQAMKGHVLATAELIGGYQRQP
>Ava_1546 Abortive infection protein
MTIKRLVLFFVLTPIAAFLAASSLFGSLQEPQFQSRLELYQTNIALQAQA
WKPEDSNDDSPQVIQEAILGEKPLENASKEYEKTRQSVQTNLTKTQNQLA
QLRSSSQNPAPPKPLPDVPPTTNTSRNDKEKQLQKSLQELQKFAAELDLK
LGILQAKQGQIPTAIKTWNELQKPSDTPELTKEIEQTAAVLSGLWSDPPR
LFPDAQQLIQHNLEGWFRSTALIQLYQLQQRQDALKEVQSTQQEAAAQAV
FKLAAIATVPSLAALVGTILLIFLIAQRLIKGKEALLSQNAELAWSTPWD
VETILSVFVVGFFFMGQIFVPSLLVLLPIPRPIVNVRLQAVSVLISYLLV
ASGALSVLYFSIKRFFPLPQYWFRFRLRDNWFLWGLGGYCAALPIVVIVS
LINQQLWQGQGGSNPLLQLALESQDFTALSIFYVTAAIAAPLFEEVLFRG
FLLPSLTRYVPVWGAIITSAVLFAVAHLSLSEILPLTALGIVLGVVYTRS
RNLLAPILLHSLWNSGTLLSLFLLGSSN
>Ava_0503 hypothetical protein
MKNLEQRKSWYAEVAINYTSQQRKNWYNDVADAYNRTRPRYPQQLVSRVV
ELAQLPQNAAILEVGCGPGTATTAFAQLGFSMVCLEPSQNSSQLAQQNCS
PYPNVEIINTSFEEWPLEPGKFDAVLAATSFHWVSPEIGYSKAADALKDH
CSLILLWNMTPQPEYEVYQRLHEVYQAQAPTLERYEEQEAQEKHLRRFGK
AVIDSGRFQDLVSEQLRCDVTYSIDDYLTLLSTLSPYIALDSQQRNSLFT
SLRETLEKHCGSSVEISYLSAFHVAKKI
>Ava_1056 Rieske (2Fe-2S) region
MTTETRLQGNSQNENLLDNATNEQSSKEENTFQWTKQWYPLAVVEFLDPS
RPHAMQLLGKDIVLWRDGSSQWRCFEDFCPHRLAPLSEGRVEADGTLLCA
YHAWRFDAQGNCVSIPQSKDEKTAAKNCESQKSCAVVYPTQERQGLLWVW
AEAGEQAKVESQLQTPRIVPELEDNSGKVIKSPWNFRDLPYGWDYFMENV
SDPAHVPVSHHGIIGDRYKDAKFYDMIPVRPISTQDGFAFEIQPTQGKTV
QGIHDFQPPCHMRIVSTSEDGGQLILALYATPTRPGWCRHIGCQVFVKNP
QGKKPQGLSFFGLPLPVWLVHVLASLFLHQDMVFLHYQEKIIAQKKNGKW
LNAVYTPNPQDKMVITLRQWLKNRAGGGIPWAEGYSSDIPPAEKDKQKLF
DVWTTHTQHCTVCQDALKNINRLTVLAYISAAICLFLAVILDARTVAMQA
ALGASIFTLPPVGFWLALGGAILLAVVGYQLKRFSRLFYVYEFEHARND
>Ava_3836 Survival protein SurE
MTIILTNDDGIDAPGIKALAQAVSGKNFIVAAPRDHQSGCGHQVTTTRPI
NLHRRSDSEYAIAGTPADCVRIAITQISRDVKFVLSGINAGGNLGVDAYI
SGTVAAVREAAMHGIAGVAISHYRKAKQNFDWELAAKWTAEVLEELLHRP
LEPGYFWNVNLPHLQPGETQPKLVFCQPCTKPLPANYRIDGNDFYYVGEY
GKRERTPGSDVDVCFTGNIAITQLRV
>Ava_4806 Cyclopropane-fatty-acyl-phospholipid synthase
MSATLYQQIQQFYDASSGLWEEIWGEHMHHGYYGVDGTEQKNRRQAQIDL
IEELLTWAGVQTAENILDVGCGIGGSSLYLAEKLNAKATGITLSPVQAAR
ATERAKEAGLSGRSQFLVANAQAMPFDDNSFDLVWSLESGEHMPDKTKFL
QECYRVLKPGGKLIMVTWCHRPTDETPLTADEQKHLEDIYRVYCLPYVIS
LPEYEAIARQLPLNNIRTADWSQSVAQFWNIVIDSAFTPQAIFGLLRAGW
TTIQGALSLGLMRRGYERGLIRFGLLCGDK
>Ava_2505 PilT protein-like
MYLIDTNHCSYLMEGLPSVAEHLRSLGQVQLATSVIVAGELRFMAHNSHQ
KAANLIKINAFLRRINLYGIDKETTEIYGDFKSEIIKQFGPKEKSQRKTT
KLTSIGISENDLWIAATALRHSLIIVSSDSDFVRMRQVRELALENWV
>Ava_4498 Zinc-containing alcohol dehydrogenase superfamily
MKAVCWYGANEVRVENVPDPKILNPRDAIIKITSTAICGSDLHIYGGYIP
TVQQGDIIGHEFMGEVVEVGSGVDNLKIGDRVVVPSTIGCGRCHYCEHDM
WSLCDNSNTKGWLEEKLYGNITSAIYGYSHLLGGYAGAQAEYIRVPFADV
GVVKVPPDLPDEMLLFISDAIPTGYMGAEMCDIQPGDTVAVWGCGAVGQF
AMISAYMMGAEKVIAIDRFPERLEMARKYAKAEVINYEEVNAGEALKEMT
GGRGPDACIDAVGLEAHGVGLEDFYDQTKQKLKLESDRPHVLREMMVACR
KGGTLSIMGVYGGFVDKIPFGAAFNKGLTFRMGQMHGQKYMNLLLQLILD
GKLDPSFVVSHQLPLEQAPFGYHIFQQKKDNCTKVVLKP
>Ava_3078 Peptidase C14, caspase catalytic subunit p20
MSPLGVATSHSTHTLATGKAKLWLLLVGVNQYQDERLPTLRYSAVDCQGL
AAALADATYRFPDKSEWVHHDFATQLPTLATVRNSLNKVTHQAQPEDTIL
FYFSGHGILETGSQQVILCLADTQTDDLLNTGLGLPELLQCLENSQAQTQ
LVWLDACHSGSLTFRGARSNHTPASLPNPTPQIVELLRQRAKQSKGFYAL
LSCDTNQQSWEFPELGHGVFTYYLMRGLRGEAADIQGLIDADGLYRYVYH
QTRQYIEQTNQQLRLINQQNRSRGNTQVYSEYPLQTPKRIVEGVGEVILG
VKPALVVSPDARKALIVEGLAINQTTLAFSQLLGTVGGFGIEYWPLAHTN
QDLQATISNCLQTRELETENQQNHFATVLLYLRGKLAQSSTGEPVLVLGE
NIQLSRSWLRQQLRRSLYSQQIIILDCPLDQHSHISLQDWVEDLQLGFEQ
GQCIIAAASSPDNPQQFLQILHSNLQANQEQPNLSAAAWINQLQLSSPLP
LHIWLSGTKGVIEIIPASTAAKGKQPNAIVDLGICPYRGLQAFQEEDIQY
FYGRETLTQQLIADLETKSFMAVVGASGSGKSSVVQAGLIAQLRRGQQLP
GSQQWWMKSLRPGENPLVSLSHCLVDSGTAKEKAYQQMQIEGMLYQGAQG
FAHWLHHRSEPMVVLVLDQFEELFTLAASEDRQRFIDTVLGALELSPDKF
KLILTLRADFIAPCLEIPTLAKLLQQSSVLLPPCLTQEEYRRIIIHPAEK
VGLTVDPELVEVLLQELHNSPGDLPLLEFVLEQLWEYRDKGVITLQAYQQ
YLGGIKGALERKAQGVYDTLDPEAQECTRWIFLSLTQLGEGTEDTRRRLL
KSELIVKKYSVALVERTLQVLTAAKLVVVNGDWEEAGGKRQGAGGRGQGE
NILLTTPSVTIEVAHEVIIRHWSTLRWWLEENRSRLRSHRQIEQSAALWQ
QNNQQPDFLLQGVRLAEAEEIYLNYTDELSWDVQHFIEACLHERRRKQHQ
EQSRLRQAQRAVSIISTLGLTAFGLAVFAYQQTQNAQIKEIQALNSLSEN
FLLSHKQLEALITSVQAGKEVQNISLGIPADTRTQTATTLQQAVYSTQER
NRLLHNAWVTSVSYSPDGEVIASGSVDNTIHLWRRDGKLLTTLTGHNDGV
NSVSFSPDGEIIASGSADSTIKLWQRNGKLITTLKGHDQGVKSVSFSPNG
EIIASGGSDNTINLWSRAGKLLLSLNGHSQGVNSVKFSPEGDTIASASDD
GTIRLWSLDGRPLITIPSHTKQVLSISFSPDGQTIASAGADNTVKLWSRN
GTLLKTLEGHNEAVWQVIFSPDGQLIATASADKTITLWSRDGNILGTFAG
HNHEVNSLSFSPDGNTLASGSDDNTVRLWTVNRTLPKTFYGHKGSVSYVK
FSNDGQKITSLSTDSTMKIWSLDGKLLQTLSSPLPDVTSVSFTPDNNIVA
LASPDHTIHLYNRDGILLRSLPGHNHWITSLSFSPDNQILASGSADKTIK
LWSVNGRLLKTLSGHNGWVTDIKFSADGKNIVSASADKTIKIWSLDGKLI
RTLQGHSASVWSVNFSPDGQTLASTSQDETIKLWNLDGELIYTLRGHGDV
VYNLSFSPDSKTIASASDDGTIKLWNVTHGTLLKTFQGHRGGVRSVSFSP
DGKILASGGHDTTIKVWNLEGIELQTLNLDELLNRACDRLHNYLTTNPNI
TTEEYQLCFGD
>Ava_1400 signal transducer ampG1
MREVQALRQAFQSRKMGALLLLGFASGLPLFLTSRTLQLWMQDAKVDLGK
ITLFGLLALPYSLKFLWSPLLDRFVPPLLGARRGWLICTQIGLTLAIAAL
ALQQPSQSDQVLQILAINCLIITFLSATQDIAGDAYRTDILNPLEAEPGA
SVWVLGYRIALFITSSLAIVLADYIPWNGVYLLMAVFMAGSILTTLWSPP
EPEIRNAAEKYAPISVKDVIFIVLITVLVAGLIGGVFVGYIALPVFYWLL
ASLIVAWIVSSLLLPIELLGEVTEDSPPQNLQAAIFLPFKEFFHRFGLTQ
ASVILIFIILYKLGDSLVGITANLFLREIAFTKTEIGAIQAGIGFIATTI
GVLAGGVIMTKIHLNRSLWIFGILQLLSNLGYYALAIAGKNYSLLVLAVN
IENFSAGLVTVATVAFLMNLCNHRFTTTQFALFSSLMAISRDVLSAPAGD
LAKATGWPAFFLLTLAAALPGLLLLPVVAPWNPKPVAINRPGLDDEDLWE
TK
>Ava_3177 TPR repeat
MKQKSQMETRFVKPLSCLTLSIITTIGILPPATAQETSPAKSACEQVLSN
AEQKPRTKQVQKLAQFADPQQERSQLIQQANALFSQGDLPGAEVNLCKFL
KKYSDDAFGHFQLGNVFFRGKKVEAAISAYQEAIRLKSRYAVAYNALGIV
YASQNRWSDAIAQYQKALEINPDYGDALTNVALVLWQTNKKDEALVSLEK
ALNIFKKQNRNEKANQVEQLLQRIKNSDDNLS
>Ava_2791 Serine/Threonine protein kinase
MAYCINPDCSQRENSDTSAVCQNCGTPLILQNHYRLRHLLQTDRRSYTEV
FEIEDLVNRDQPKVLKSLKEVTPQLERLFQQEASILQTLRHPGLPIGEAL
FPLVLNTGRQLWCFVMEKIPGEDLQTWLSHHQYVTSYKTALDWLKQLMQI
LQFVHQEKFFHRDIKPANIMVCPDGKLVLIDFGAARKVTQTIINREPVTI
INSLGYTAPEQRDGHAVVQSDFYALGRTVIYLLTGIDPTGDRVQDLLNWS
KYIQDPKTPKKFISLLQAMTDSYPHHRPPTAQAILAKIENIEQHSYPWPK
LLLGAACGLLLLFAGKSLYQEITLPRTCDNILNDYLSCGEESLTPSSFWG
NSQPPPAKQLGMEEYRHQNFEAAVKFLETAFQQESDPETLIYLNNSKIHQ
QFPVNQIHTIAVAVPLERRTDIGREILRGVAQAQTEALKQGRALRIIIAD
DSNREDSRTGNNARKIAQQLVKYPDLIAVLGHYSSEATKNSLPIYRQAGV
VLISATSTSQNLKDPFFFRTVPSDRIAAQKMVTYLLSELKQNQVAIFYSQ
GSEYAESLSQAVRESTKSLPLKVIDHQAAFNLASDRFNAITALNQAQTQG
AKAIVLIPDAGVGLYNAIPNALRVIQTNINQVWIVAGDSLYSSDSFKSEK
VFSSPEIQYTAWAVFWHPLNEMNSTFVKEVQNLWKIDLSSLLRNTDITWR
TTTSYDAMLVLSQAITQNPTRLGIQKTLSQPQFSVTGATGVIQFAGSDRQ
NGKITMVRFQRNCDDNGFVVIPSDRSLQCR
>Ava_0961 PHP-like
MVINFARTSASTELLKQVFQTIDAQSCPKLFNFHMHTVYSDGRLQPSVLM
EQAIAIGLQGFAITDHHTVGGYEAAQAWLENWKWNNPGVTTPHLWSGVEI
NANLLDVEVHILGYAFQPEHSSLKPYLQRKITTDREYQAGNVIEAIHAAG
GIAVLAHPARYRKSHFDLIPAAAEYGIDGVETYYAYNNPNPWKPSVIESE
QIQNLAQEYGIFNTCGTDTHGLSLLQRL
>Ava_0063 conserved hypothetical protein
MNKSLPTHFQESIYQELLPSLQIESVNPRNLIQVTYVPQPWLLLGKGNYA
AVVYHPDYPEFVVKIYAPRRPGYEEEVEVYHRLGSHPAFSECLYAQDGFL
VLKRLYGTTLYDCMHLGLPIPKQVIRDIDEALEYAQSRGLYPHDVHGRNV
IMHEGRGLVVDISDFLHQEPCSKWHNLKKAYYWLYLPVLRPLRLPVPYSA
LDWIRKCYRLGTLCKQLCRQLAPKLNCFQYLRRY
>Ava_1651 Glycine cleavage T protein (aminomethyl transferase)
MPTSAIDGKDTAAIQAATAEVAVYDRSTWGLIRVSDDDRLRFLHNQSTND
FQSLKPGQGCDTVMVTSTARTIDLVSSYVLDDAVILLVSPSRREFLLQWL
DRYIFFADKVQLTDITEETATFSIIGPGSDAVVEKLGAGGIIGQPQGNHI
TIDGGAIVAVGSGLASPGYTLILPVSQKQQVWQQIIDSGAVELSDRAWDT
LRILQGRPAPDSELTDDYNPLEVGLWQTISFNKGCYIGQETIARLNTYKG
VKQYLWGIRLNAPTEIGDTITIGDEKVGKLTSYTETPDGYFGLGYIRSKA
GGVGLKVQVGNSEGEVIAIPFVSHEYP
>Ava_1285 Peptidase S16, lon
MTSSSRIAVRELPLFPLPEVVLFPTRPLPLHIFEFRYRIMMNTILESDRR
FGVLMVDPVKGTIANVGCCAEIIHYQRLPDDRMKMLTLGQQRFRVLEYVR
EKPYRVGLVEWLEDHPPAKDLRPLATDVEQLLRDVVRLSAKITEQNIEIP
EELPDLPTELSYWVASNLYGVAGEQQSLLEMQDTAARLEREAEILTTTRN
HLAARSVLKDTFNPKL
>Ava_4314 Ankyrin
MTNNDVLLLKVAKSGDIKGLGALLAAGVGVDVCDRDGTTALMFAANLGYT
EIVRSLLDGGANVNLARKRYGLTALMLAASANQVDIVHLLISRGAAVNAT
NEDGSTALMAAAMKGYVEVARVLLAAGADVNITDKDDDTALKLAVKRGQA
DVVQLILQSGADANSEDEEGETLLMLAADSGHGDVVQVLLATGIDVNQQN
QDGGTALLAAVAAGNRAIAETLLDRGAEVNHQDQDGESALHLATVEGYVD
VVQLLLNQGANTQTKNKLGDTPLLVAALQGHDQIVETLLKYGANADGDNL
GETPLTLAASQGHTATVRILLDYGANANIRASDGKTALIKATEHNHPGVI
QLLLAKGANVNYQDSVGATALIWAASGGYNKVVQILLEGGADTNLKNRGG
YTALMIAEFNGFRSIVQILKQAGAQE
>Ava_3155 conserved hypothetical protein
MAKLMQIPHFSEANHPLVKSLFHHSDQELLDLFQHNPDAGKYFTVIFCRY
SPIVYTLIQHSARSPVQADYLFALTWRHIYYELGGLDLKNPLPGQEGLTL
QNWLINITAYCINEIQLPPTEAIHYSLQATSPPLWCYVEQALDQLPPMLR
LMVLMAQTFHWSETRIAAYLQAEGEKVTPAEVANFLQEGYRMLEDKLPAD
IRTIYLGEDLVKS
>Ava_2788 conserved hypothetical protein
MPSLKLSLTGLQIIKQARSEKGWTIDNPCWLEQASQVLEPGRNWENAEVF
AAGVSLATWKRFLKGDAIDASVFKAFCQVLGLNWQDLVERPPNSFIIGTT
QIPNIPLFFGRRYELTTLSQAIEQGTRLIAITGIGGMGKTALATKLVESR
SSHFSQTLWFSFHHNPPAKDKIPTLVPQTLMVFDGWDGILGGNRGGQYRP
EYEPYADFLRTVVQTTHTSCVIITSREQPEGLNILGAGGAVIFPLGGLME
GAIELLQHHQLTFNAQQWITLVNQYGGNPLFLNMAANFIHELFAGDVGEF
LASGTLVAGEFAPLVTQWLKQISTLEQILIKSLATKVQGFTRQEILLHLA
SRAANGDILAALLSLKRRGLIETMKDGELERFYLQPVILKCVQRLF
>Ava_2502 Fumarate reductase/succinate dehydrogenase flavoprotein-like
MLPSKIVVVGGGAAGFFGAIACARVNPQAEVTLIEASRQTLAKVSVSGGG
RCNVTHACFDAHELVQYYPRGGKALRGAFARFQAKDTVDWFATQGVRLKT
EADGRMFPITDSSETIVDCLMNAATAAGVEILTGVAVASIKQSQGNQFEI
FCRSGKIINCDRLLLATGSSRVGYQIAQELGHHIESPVPSLFTFNISEAK
LRALAGISVNPARLRLCADGASPLEQTGPLLITHWGLSGPAVLKLSAWGA
RLLHDKRYQATLLVNWLPDFSQEQVRQNILAIKNEWGKRAIALHRGVDLP
HRLWQYIIARVGITTDERWAGLSNKTLNLLVQELTQGQYLISGKGVFKEE
FVTCGGVNLKEINFKTMESKLVPNLYFAGEILDIDGVTGGFNFQSAWTTG
YLAGTAMGEHSAD
>Ava_1448 HAD-superfamily subfamily IA
MTTKAIAIFDIDGVIRDVGGSYRRALADTVEYFTNKAYRPTSLEIDELKS
EGIWNNDWEASQELISRYFAAQGTRREQLQLDYNNIVAFFQSRYRGPDPD
NWTGYICDEPLLLQPSYLEQLTQAGIAWGFFSGATRGSANYVLKQRLGLH
SPVLIAMEDAPGKPDPTGLFATISQLEDGLEEKSVILYVGDTVADMYTVS
KAREIKPHRTWIGVGILPPHVQETAARREAYAQTLVTAGAAVVLSNVEQL
NPAQIQELLQQLS
>Ava_1459 conserved hypothetical protein
MKTPSFNGKSKPPARRFSVRRMPLSRWHPGLLHLLDWGVTIGSVLLCLLL
LPTRLPGMELLGIGPNWLLIWVVAWSVKRSVWAGTFAGIVLGLLQDAITS
PHPSHAITLGLVGFLTGLLQKQRFIQEDFISIALIVFGMAILAETVFALL
LTLAGDRQTEYIWAYYQRVTLASAILSSLWAPVLYYPLNSWWQRMKMLES
>Ava_1906 Protein of unknown function DUF87
MTNDNPQQPLGSVIQGSLTEGLEVRLHPDISVEDMRVGKFLVVQGMRSRF
FCMLTDVALGTANARIIASPPSWEDTFLRDVLAGSGTYGTINLSPMLMFT
PESNESLTATDGKSVNPFVPSSTGLASFQPQTSTTMELLPVKTIPSHFSQ
VYEASVDDFRRVFGWEDDIQRKNFSIGKPLDMDVPVCIDLNRFVERSNGV
FGKSGTGKSFLTRLLLAGVIRKNAAVNLIFDMHSEYGWEAVAEGKNVNTV
KGLKQLFPGRVEVYTLDPESTKRRGVRDSQELYLSYEQIEVEDIKLCSRD
LGLSEAALDNANILYSEFGKSWIVQLLNMTNEEIEMFCDEKRGHKGSIMA
LQRKLLRLDGLKYMRAVCPQNYINKILQSLEAGKNVVIEFGSQSNMLSYM
LVTNMITRRIHEHYVRKADKFLQSKNPSDRPTPLMITIEEAHRFLDPAIV
QSTIFGTIARELRKYFVTLLVVDQRPSGIDNEVMSQIGTRITALLNDDKD
IDAIFTGVSGAGGLRSVLAKLDSKQQALILGHAVPMPVVVRTRPYDSTFY
AEIGDAAWEEKPDAEVFAAAELAKADLGF
>Ava_2738 Metallophosphoesterase
MHWFFTGRLSVDKITVKIANLSPSLQGIKLVQLSDFHYDGLRLSEEMLEE
AIAVTNEAEPDLILLTGDYVTDDPTPINQLALRLKYLQSRYGIFAVLGNH
DIHYSHSQTLITKALTSIGVNVLWNEIAYPLGHELPIVGLADYWSKEFHP
ASVMNKLDPTTPRIVLSHNPDTAKILEQWRVDLQLSGHTHGGHIVLPGIG
PVVYHYKKLLKKAPRKLRRWVPFLLGDCSKVVRYWEWAQGFHQVTNNQLY
VNRGLGTYRPGRLFCPPEVTVITLT