TitleGenColors Logo

Gene list

Applied filters:

COG category: Unclassified
Gene type: CDS
Genomic element: chromosome

Number of genes found: 455

Free access
Sort by:

 



# Shigella flexneri 2a str. 2457T, 2457T

>S0912 host cell-killing modulation protein
MTALVTRKDLCEVRIRTGQTEVAVFVDYESRK
>S2566 IS4 orf
MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLP
LEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLG
SEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFP
RQTHAGNPALYPQVKMVCQMELKWHTEFGHLNRGDMLTSEQHRCSNEKKK
F
>S2647 hypothetical protein
MKQDIADRLEILEGQRAEAKQLRKQARRAHRNNEAELLTKYISFTNYCIY
ECYKEDAEDWLDSLPEQY
>S1498 IS91 orf
MNRATLRAVHAAGADTGKNGRKLPLRAAGYFRIPFVINTYKPEDQLFVSG
IHKEGNIQVCAALNGIVKIIKAQNFSVFFVKKIIRMKFHNIAGKFASLWC
GFAFTGFS
>S0337 hypothetical protein
MTQRPWSKLQREIYDLLTPTINLQIHCTRYPMRSQNGGSTDLPRYWITLD
KNVIWDYPKDFIAGNAGVRNFHGETCWYPYLTDICSISDLLREYIDTPKA
ELLTKQFTSDKWGLVNILRAADRRIGMRRLDQLRRKTHNIAALKIIARRS
E
>S2796 hypothetical protein
MPRTVTHNPDSPNNDDVLAASEKWDACKPPYTSAHMKICVAAAKIILAAS
GVARCSKYEKENYLRIDFSKAGKVTFYAEFPKKMGLKGKKLGEWPELVIQ
LAREKALGMADGGLRAESVHAALEMYRDDPKPK
>S0906 putative bacteriophage protein
MIYPTNTGKSGEHLRLTTLESVWIQGKLRMWGRWSYIGVKWHTEFGHLNR
GDMLTSEQHRCSNEKKKF
>S4801 putative phage tail fibre protein
MSRTLDLILLCRPVQDTVHLLMRITLQWDINKMSYFYSASTNGFYSTEFH
GTNIPDDAVEISESEWKTLINAQSVTKMITCGENGHPVIVDRPSPTPEQL
ALINDEKKSALIAEATNVIAPLQDAVDLGMATDDETKLYWHGKNIGCCLC
VLM
>S2145 hypothetical protein
MENGLPFYTNRSGKPIVSRDLFTCNKTLPPREVEPNFGAI
>S1169 hypothetical protein
MSSRVANLTVIVSSKRRRVSVARFSCGKTAQLSKKQTGYYSPEIFPSTGK
DCNPQPANCLKDQYVLRHCCVDDRSGKMGYSVKFLVLTRMDTETASLFHC
KPCYSKMTFTIYHPLTHSFFTSCW
>S1214 hypothetical bacteriophage protein
MKLKYSGLTASGNTHPKFTRGDIYRDQYGGMVTIKGVAERRITYRREGYE
YDCVMPVYQFRRDFTLVQAAPRSKPTSREKARANIQEIKKMLNVFRGKK
>S4813 hypothetical protein
MVYTVYCLGNLSKSATSNLEWVASNGLKPNQVFRLFQNCIKNNVLNNLIA
FGRSVTQTAS
>S0929 putative exodeoxyribonuclease VIII of prophage CP-933R
MSTKPLFLLRKAKKSSGEPDVVLWASDDFESTSTTLDYLLVKSGKKLSNY
FKAVATNFPVVNDLPPEGEIDFTWSERYQLSKDSMTWELKPGAAPDDVHH
QDNAQETKELAGGQEENAQADAHEDCQDCEVSVATLRFTQRLLHIFTYAA
GDLKYLHHATREQRKHITALEMDQENSYVQNLLLAIRGMAEPTTLDNAAL
LRLTDAIKAEVYWQ
>S1441 IS1294 orf
MLSAFTPRPLKRLFTANQCWTSFLDAGGLRDIEVEAVTKMLACGTRILGV
KEYICDKPECPHVRYVTNSCGSRACPSCGKKATDLWIATQLNRLPDCDWV
HLVFTLPDTQWPVFESNRWLLNDVCRLAVENLLYAARKRGQEPGIFCAIH
TYGRRLNWHPHVHVSVTCGGLNKHGQWKKLSFLKDAMRSRWMWNMRQRLL
KAWSEGLAMPESLSHITTESQRRSLVLKAGGKYWHVYMSKKTAGGRNTAR
YLGRYLKKPPIAASRLAHYNGGGA
>S0918 putative bacteriophage protein
MKIRHEHIESVLLALAAEKGQAWVANAITEEYLRQGGGELSLVPGKDWNN
QQNIYHRWLKGETKAQREKIQKLIPAVLAILPRELRHRLCIFDTLERRAL
LAAQEALSTAIDAHDDAVQAVYRKAHFSGGGSPGDSVVVH
>S1757 hypothetical protein
MKLKNTLLASALLSATAFSVNAATELTPEQAAAVKPFDRVVVTGRFNAIG
EAVKAVSRRADKEGAASFYVVDTSDFGNSGNWRVVADLYKADAEKAEETS
NRVINGVVELPKDQAVLIEPFDTVTVQGFYRSQPEVNDAITKAAKAKGAY
SFYIVRQIDANQGGNQRITAFIYKKDAKKRIVQSPDVIPADSEAGRAALA
AGGEAAKKVEIPGVATTASPSSEVGRFFETQSSKGGRYTVTLPDGTKVEE
LNKATAAMMVPFDSIKFSGNYGNMTEVSYQVAKRAAKKGAKYYHITRQWQ
KRGNNLTVSADLYK
>S2633 hypothetical protein
MKSLRLMLCAMPLMLTGCSTMSSVNWSAANPWNWFGSSTKVSEQGVGELT
ASTPLQEQAIADALDGDYRLRSGMKTTNGNVVRFFEVMKGDNVAMVINGD
QGTISRIDVLDSDIPADTGVKIGTPFSDLYSKAFGNCQKADGDDNRAVEC
KAEGSQHISYQFSGEWRGPEGLMPSDDTLKNWKVSKIIWRR
>S2783 hypothetical bacteriophage protein
MTFFFDDRDQAVPYTATADDVAPTGQQIWQELQSGKWGEIAPFTVTPEML
EAAREARRQEIEAWRTEQEAKPFTFEWNGRIWNAGPDSLGRLSPVVMLAK
SVTAQTHMAWSDADNQQVKLSMPELEELAESQKTALLTEAESVIRQPGRA
VRLNRETDESGEARGRASVFPESVRSVVYAM
>S0278 hypothetical protein
MDACLFHCKYPGITNAWIFTEEQHNHDGTGLSQVVLNAIFNLVCLLQVYV
QISYLSQQSSIIRYTAFTGP
>S1087 IS91
MLPRFADIFQQGNRWLNWLEKQPEGAVRPVVIESVTKIMACGTTLMGYTN
RCCSSPDCCHTKKVCFRCKSRSCPHCVVKAGAQWIQYLLSLVPYCPWQHI
VFTLPCQYWSLVFHNRWLLAKMSRIAADVILEICHQADVEPGIFTVIHTW
GRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSMWRYRITRLLSRK
YPYLVMPDALAAEGSSKREWNRFLDTHYRRGWNVNVSRVMDNATHVAVYF
GSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYLLMSGDESMERFS
WHVADKGFRMVRYYVFLSPAKRRLLEEVVYIITETVRKTAMQITWRGMYQ
RLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG
>S0231 lysis protein S
MKSMDKISTGIAYGTSSGSAGYWFLQWLDQVSPSQWAAIGVLGSLVLGFL
TYLTNLYFKIREDKRKAARGE
>S2126 hypothetical bacteriophage protein
MAMNVTEIVLKKPVTAHNEMLHVLELREPTYDEIEALGFPFIISGEGSIK
LDSQVALKYIPLLAGIPCSSAAQMAKLDIFKTSMQILRFFTQSETGSTSG
NDSTMLPGSGN
>S2261 hypothetical protein
MSWRLVYASAVGTSHISADLPCQDACQMQVAWLNDQQPLLVMFLADGAGS
VSQGGEGAMLAVNEAMAYMSQKVQGGELGLNDILATDIVLTVRQRLFAEA
EAKELAVRDFACTFLGLISSANGTLIMQIGDGGVVVDFGHGLQLPLTPMV
GEYANMTHFITDEDAVSRLETFTSTERVHKVAAFTDGIQRLALNMLDNSP
HVPFFTPFFNGLASATQEQLDLLPELLKQFLSSPAVNERTDDDKTLALAL
WLP
>S1994 hypothetical bacteriophage protein
MFALIQRGQIYTDRAGYPVEITRSTEHSVFFRRMDGRTGRVRIGEFSSLF
EHIDHLEYHKILAETEQEKHLKKLRAMQRK
>S0256 hypothetical protein
MADFTLSKSLFSGKYRNASSTPGNIAYALFVLFCFWAGAQLLNLLVHAPG
VYERLMQVQETGRPRVEIGLGVGTIFGLIPFLAGCLIFAVVALWLHWRHR
RQ
>S3197 hypothetical protein
MIHLFKTCMITAFILGLTWSAPLRAQDQRYISIRNTDTIWLPGNICAYQF
RLDNGGNGEGFGPLTITLQLKDKYGQTLVTRKMETEAFGDSNATRTTDAF
LETECVENVATTEIIKATEESNGHRVSLPLSVFDPQDYHPLLITVSGKNV
N
>S0731 putative S protein
MKSMDKISTGIAYGTSAGSAGYWFLQWLDQVRPSQWAAIGVLGSLLLGLL
TYLTNLYFKIREDKRKAARGE
>S0720 IS911 orfB
MCSGFIAAATDTGKTVLKNQTADGLYYAVRYLSYMASATVRPEQEASPQW
QPGEATRWDAGLLAGS
>S2462 hypothetical protein
MKKIALAGLAGMLLVSASVNAMSISGQAGKEYTNIGVGFGTESTGLALSG
NWTHNDDDGDVAGVGLGLNLPLGPLMATVGGKGVYTNPNYGDEGYAAAVG
GGLQWKIGNSFRLFGEYYYSPDSLSSGIQSYEEANAGARYTIMRPVSIEA
GYRYLNLSGKDGNRDNAVADGLYVGVNASF
>S2683 putative protein processing element
MPTIHVSIVSFSNSFAYSGGYMTEECGEIVFWTLRKKFVASSDEMPEHSS
QVMYYSLAIGHHVGVIDCLNVAFRCPLTEYEDWLALVEEEQARRKMLGVM
TFGEIVIDASHTALLTRAFAPLADDATSVWQARSIQFIHLLDEIVLEPAI
YLMARKIA
>S0907 putative bacteriophage protein
MVDLLKAARGQMCTVRIPGYCNHDPETSVLAHYRLAGTCGTAIKPHDMQA
AIACSSCHDLIDGRVKTSDYTKEELRLMHAEGVFRTQEIWRKEGYL
>S2092 hypothetical protein
MKTAKVYSDTAKREVSVDVDALLAAINEISESEVHRSQNDSEHVSVDGRE
YHTWRELADAFELDIHDFSVSEVNR
>S0690 hypothetical bacteriophage protein
MSTSVSGKNVVLTNAEFTGAGDADGNAEISWKI
>S2840 hypothetical protein
MRFSHRLFLLLILLLTGAPILAQEPSDVAKNVRMMVSGIVSYTRWPALSG
PPKLCIFSSSRFSTALQENAATSLPYLPVIIHTQQEAMISGCNGFYFGNE
SPTFQMELTEQYPSKALLLIAEQNTECIIGSAFCLIIHNNDVRFAVNLDA
SSRSGVKVNPDVLMLARKKNDG
>S0381 ISSfl3 orfC
MNNELPDDIELLKAMLRKQQSRLRQYACQVAGYEQERCTQMTLR
>S0336 hypothetical protein
MLQSRNDHLRQTALRNAHTPASLLTTLTEPQDRSLAINNPQLAADVKTAW
LKEDPSLLLFVEQPDLSQLRDLVKTGATRKIRSEARHRLEEKQ
>S3207 hypothetical protein
MSEITASRPEVVNGHTDVICSTSIRHILAVRKSTLLQIDTLIRQLAEVSA
MTESIRGKTALDWAMKQDFRCGCWLMEKPETAMKAITRNLDRELWRDLMQ
RSGMLSLMDAQTRDTWYRSLEYDNFPEISEANILSTFEQLHQNKDEVFER
GVINVFKGLSWDYKTNSPCKFGSKIIVNNLVRWDRWGFHLITGQQTDRLV
DLERMLHLFSGKPIPDNRENITIRLDGHIQSVQGKERYEDEMFIIKYFKK
GSAHITFKRLELIDRINDIIARYFPSVLSA
>S2332 putative phage tail fiber protein
MNTVADVMAGNDGEYCFHARTGKYGVYLKQDWRNEYNVGDIAVYEDSKPG
TLNDFLIAPDEGDLKPDVVKRFEEMVAQAQQSAGAAAGNAQQTAQDVAAA
ATARDDAQRFAEKARQDATVTAEDRKATAEDVTSTGANAAAAGQSAQDAA
GYARAAEQAKNDIDAALTGTLKTANHLSETAAAGEKAQQKSRDNLGLKSA
ATMEAQSDIYDRTKGRLAIPGAFGFGCAFLPEDVIRFDTKSDFLAWVRNA
LPGEYSVAGPYGIIIPDTRFEGVLSIRWTDARPETTEPRYRAKSLTFYGI
NGPIYHTRYCYWPISRLTGWVKINITTEDIIYRIVASSVRNRWGDPDIGG
LIIAAYQGEADGDKVIRLVRGQSYRGSRLGPVGISVPGTPTGTYIVSPQF
FITGCSEHSLPGSYCALSGVPDAHVSGAMPGLFIRTSRGMHRGN
>S0750 putative tail component of prophage CP-933K
MLSDMSATELGEWGDYFRMQSFSDVWMDAQFASLKALIVRMVSGSSDAAV
ADFSLLPEENGIPERTDEELMHLGEGISGGVRYGPDSQPGH
>S0238 putative packaging glycoprotein
MADENRLNSILCKFDADWMASDEARTEATNDLYFSRVSQWDDWLSNYTTL
QYRGQFDVVRPEVRKLVAEMRRNPVDVLFRPKDGANPDAADVLMGMYRTD
MRHNTAKIAVNVGVREQIESGVGAWRLVTQYEDNDPTSNNQVIRRLPIHE
ACSHVIWDANSKQMDKSDAKHCTVINALSRNGWKEFAEDYGIDPDTLPSF
QNPNDTWLFPWVSNDVVYVAEYYEVEEEKEKVFIYRDPLTGEPVSYYQQD
IKDVIDDLANRGFIKVAERKVKRRRVYKSIITCTQILKDREKIAGEHIPI
VPVYGEWSFAGDKECYEGVVRLTKDGQRLRNMIMSFNADIVARSPKKKPT
FFPEQIEGYEYMYGGNDDYPYYLQNRTDENGNDLPIGPISYMENPEVPQA
NAYMLEAATNAVKEVASLGVDAQAANGQVAFDTVNQLNMRADLETYVFQD
NLATAMRRDGEIYASMVNDIYDVPRHVTLTLEDGSEKDVQLYAQVVDYQS
GNVVTLNDIRGRYECYTDVGPSFQSMKEQNRAEIQELLTKVPQGTPEFQM
LMLQYFTLLDGKGVEMMREYANKQLVMMGLKKPETPEEMEMVQQAQQQPQ
QPSAEQIQAQGILLQGQAELLKAENQQAQIQVEAAKVEAQNQLNAAKIAE
IFNNMDLDKQAELREYLRLVGQFQQQRSKDARANAELLLKDADQTHSQRM
DFANLMRQVQIPSGGVAETPQ
>S1851 hypothetical protein
MNAYELQALRHIFAMTIDECATWIAQTGDSESWRQWENGKCAIPDCVVEQ
LLAMRQQRKKHLHAIIEKINNRIGNNTMRFFPDLTAFQQVYPDGNFIDWK
IYQSVAAELYAHDLERLC
>S1495 hypothetical protein
MRTTSFAKVAALCGLLALSGCASKITQPDKYNNYSDLKETTSATGKPVLR
WVDPSFDQSKYDSIVWNPITYYPVPKPSTQVGQKVLDKILNYTNTEMKEA
IAQRKPVVTTAGPRSLIFRGAITGVDTSKEGLQFYEVVPVALVVAGTQMA
TGHRTMDTRLYFEGELIDAATNKPVIKVVRQGEGKDLNNESTPMAFENIK
QVIDDMATDATMFDVNKK
>S0924 putative bacteriophage protein
MLNVAIENQNGWNYSAPAPHKTGAGIATPTMTTAHNRAQAVFLCVKHSHI
QIMVGRAGQPQGWPVSVVTGCSNPVRLTTHEIATSGGESFKLTIEAAIMA
TILTLSHPDATIENGRAVTTSVAVAEFFRKMHKTLFRRSKLSNARRSLTG
LILSPSPIPTPKAKNAQCTKSPKTASFSW
>S1832 hypothetical protein
MNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHS
AAGQLVARTVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYY
YSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAP
YYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGL
CEWFEVEQFQNP
>S0921 hypothetical bacteriophage protein
MNETELKHVIALLLEDAKRLQQLEPNAGTGARIWLAKEALESGDYDSEEA
FYKAEGRAGYSPGLGGV
>S0695 hypothetical bacteriophage protein
MDELISIDSRCPLLEKLKLELTTPHRDFDRNGRVMVESKKDLAKREIPSP
NVADAFIMAFAPIDTSLDIWEQLGRQA
>S2645 hypothetical protein
MDAFEVLANKGAELVAQRDKTANEGERALLNKQIKAIRMAQFKLISNEVI
ETIKPQIPQIVADAIKAAGLEKRIAGLKTNAEKMEMYKQSGAPNPDEYFS
PDVEFMAQVEERLGSCLTEEQRRYFDGVDSSAGIDLNSYFGREIEHFDAA
PSMPEPEPEMATDPEAEQRKRVFNAIGIRY
>S1995 putative bacteriophage protein
MNKAFELWVHQRYGNHYDLTRDVDGFYCREIVKRMFEVWCHCRGLSVV
>S2700 hypothetical protein
MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEA
QQSTLSVESPVQR
>S1957 putative host-nuclease inhibitor protein Gam
MSVKLRLPQSAYPIFDRIEEMAWARHYQQIVREEKETELADDLEKGLPQH
LFESLCIDHLQRHGANKQAISHAFDDDVEFQERMAEHIRYMAETIAHHQV
DIDLED
>S0212 hypothetical protein
MRESADSPLPDHARLAASAHLVRESRNPDGLATTLAH
>S1955 putative exodeoxyribonuclease VIII
MNTDKQVYPLYYEAKNDKVRKRLGIKGGFYWAEAKKLSIAISRGAVAIDD
AGYDEDDFKKPVRVNLPVVDDLPPEGVFDTEFCNRYEKGGEDGITMVFIA
SSPSVQDKPASTDNTNVNGEDMTEIEENMLLPVSGQELPIRWLAQHGSEK
PVTHVSRDELQALHIARDEELPAVTALAVSHKTSLLDPLEIRDLHKLVRD
TDRVFPNPGNSSLGLMTAFFEAYMDADYTDRGLLTKEWMKGNRVSRITRT
ASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDVDIYNLHPAHA
KRVEEIIAENKPPFSVFRDKFITMPGGMDYSRAIVVASVKEAPIGIEVIP
AHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEK
TQPSGAMADEQATAETMEPDATEHHQNTQPLDAQSQVNSVDEKYQELRAE
LHEARKNIPPKNPVDADKLLAALR
>S2213 hypothetical protein
MNFNSRLESRYSYELMKKASEYSELYGDNLIQLGLEDGIYFYKGMAIGDV
FGLARYSDWTISNPECEVIPQDDLIEKMKSFNSSFIVISKRSYANFNPEK
YPKFKVLMDTPNGILIAIK
>S0709 putative bacteriophage protein
MFNLMLTSKKLTKTAINEALRRMKKAGLNKSELEAFLRDMINGKQKSWLA
HCTDAEALCIDRVISEVLAEHPGLISVLRQRYEGRGMTKCKMAELLNDAH
PKWSLRTCERRIEHWLKVAEFILYKPMVMAFGIEKKVIAF
>S0269 hypothetical protein
MLRYRRMVQLFFPLSPNVDIYNLWRKTRIKLIKGRYFSMLQIMLFLMHTR
YKYQKYKPDKYLINIQQSLNYSIYSSTTKRQKIITKSYK
>S2359 hypothetical protein
MDVQQFFVVAVFFLIPIFCFREAWKGWRAGAIDKRVKNAPEPVYVWRAKN
PGLFFAYMVAYIGFGILSIGMIVYLIFYR
>S2640 hypothetical protein
MSTLLSLIGVFRHTNNKFIRIAKQTGASLLPIFHPMMLSPGGANPDVDCI
TKAAKSRDKKPPLCSEGWKFGGAKRDRTADLLHAMQALSQLSYSPTMRLR
TKFAGCKIWWS
>S2646 hypothetical protein
MFGNYISTSPEKIIMLALRIMQGIAKTLAEHVLDLKHSPLSKQAMKRQTL
RLWAEYSLGTINKIIDMKSGPSNQSAEEMEFIRRLILIRRDIHSQLHSVG
IDINDGTGD
>S0559 putative homeobox protein
MKMTKLATLFLTATLSLASGAALAADSGAQTNNGQANAAADAGQVAPDAR
ENVAPNNVDNNGVNTGSGGTMLHSDGSSMNNDGMTKDEEHKNTMCKDGRC
PDINKKVQTGDGINNDVDTKTDGTTQ
>S1217 hypothetical bacteriophage protein
MSEPKFGEKFYKHNGRITILQISAATPGWWVETDEGSSPVASWELCVLSY
PDRDVYQDILPVISTDKGMKPVDIKNMGFQCVMLTEKMMEEMKKNNAGSL
H
>S2644 hypothetical protein
MHSLDSYFQRTTAPKSAAQERREEFHEKVMRSADYIADKFVETVRPLVDE
VADKLQSEMPEDMEGTAKRRLICELSRRFGVSISAFK
>S4273 hypothetical protein
MIRWMNEPLWPFIERKKSMRNLVKYVGIGLLVMGLAACDDKDTNATAQGS
VAESNATGNPVNLLDGKLSFSLPADMTDQSGKLGTQANNMHVWSDATGQK
AVIVIMGDDPKEDLAVLAKRLEDQQRSRDPQLQVVTNKAIELKGHKMQQL
DSIISAKGQTAYSSVILGNVGNQLLTMQITLPADDQQKAQTTAENIINTL
VIQ
>S0237 putative terminase large subunit
METSTRRAGGEQLKGQYADAGFMMLQEHATWPDGGNAVEPGITELRDMML
DGRFKVFNTCEPFFEEFRLYHRDENGKIVKLNDDVLSAVRYAYMMRRFAK
MMRDIKKPKEKKIPAPIRPIARRT
>S2715 hypothetical protein
MATGGAALAGKAVMGAAAGAAGGASALQAAFQKASASMETGGDMSSMGSV
VSSGGNGGGEAGTAGSSPFAQAAGFGDSGSSSSGGGFAKAAKLATGTASE
LAKGVGSQVKQGFQERVSETTGGKLAASIRESMEPKEASQSGQFEGNSLG
ADSGPDSNEVRS
>S2143 putative cell killing protein of prophage
MLNTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCE
VRIRTGQTEVAVFTAYEPEE
>S0230 hypothetical bacteriophage protein
MTQKYELIVKGIRNFENKVTVTLALRDKKRFDGEIFDLDISLDRVEGAAL
EFYEAAARRSIRQVFLDVAAGLCEGDELLPETRPCSEARYTIKINSSDNS
ITGC
>S0702 IS911 orfB
MCSGFIAAATDTGKTVLKNQTADGLYYAVRYLSYMASATVRPEQEASPQW
QPGEATRWDAGLLAGS
>S0734 endopeptidase
MSRVTAIISALVICIIVCLSWAVNHYRDNAITYKEQRDKNAGELKLANAT
ITDMQQRQRDADALDAKYTKELADAKAENDALRRKLDNGGRVLVKGKCPV
PSSAETSSASGMGNDATVELSPVAGRNVLGIRDG
>S3063 hypothetical protein
MASEDWQREHHCRTGIIFTAFAIEAMFIFYRKQVDPGYDKTQKECRKTMH
KNTLKLCGINNYMGTKPYQIIKECLEVRDAIAHGDSYTSSFNFSADHLDN
QDDIAKVSWR
>S0829 putative receptor
MFVDRQRIDLLNRLIDARVDLAAYVQLRKAKGYMSVSESNHLRDNFFKLN
RELHDKSLRLNLHLDQEEWSALHHAEEALATAAVCLMSGHHDCPTVITVN
ADKLENCLMSLTLSIQSLQKHAMLEKA
>S4823 hypothetical protein
MVKSHGTVSVDGKVSDADLTYLEEVANSTGQEVDKSRLTSQACARTALIT
DVGIALATELETAGQKWSLGFPPKFQRVDLFNYNVLVRNYDSSAFKGDRY
HNTKNGINADIGASTDLDDNWTLGLVAQNLISRSIETKEVNGITETFRIR
PQVTAGVSWHNAMFTTAFDVDLTPASGFTSDSNRQFAAIGTEFNAWKWAQ
LRAGYRQNLAGNDGSAFTAGVGISPFDVVHLDVAGLIGTDNTYGAVAQFQ
FTF
>S1244 hypothetical protein
MMMKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSA
QVAEIVAVMGNASVASRDLKIEQSPELSAKVVEKLNQVCAKDPQMLLITA
IDDTMRAIGKK
>S2329 hypothetical bacteriophage protein
MLPQHSDIEIAWYASIQQEPNGWKTVTTQFYIQGFSEHIAPLQDAVDLEI
ATEEENSLLEAWKKYRVLLNRVDTSTAPDIEWPEEPDTM
>S0979 hypothetical protein
MEQLRAELSHLLGEKLSRIECVNEKADTALWALYDSQGNPMPLMARSFST
PGKARQLAWKTTMLARSGTVRMPTIYGVMTHEEHPGPDVLLLERMRGVSV
EAPARTPERWEQLKDQIVEALLAWHRQDSRGCVGAVDNTQENFWPSWYRQ
HVEVLWTTLNQFNNTGLTMQDKRILFRTRECLPALFEGFNDNCVLIHGNF
CLRSMLKDSRSDQLLAMVGPGLMLWAPREYELFRLMDNSLAEDLLWSYLQ
RAPVAESFIWRRWLYVLWDEVAQLVNTGRFSRRNFDLASKSLLPWLA
>S3341 hypothetical protein
MKNLPIGGVWKGKVKLHSNSPAQDYFANITLNTLDPNHIDVFFPEFAHAT
PRVQLDLHPTGSVNGSNYAQDLTMLDMCLYDGFNGNAISYEIMLKDEGRP
AAGRRDGYFSIYRQGGTTTDEGERIDYRVKMYNPETGGQIDVRNNENMVW
NSINLKRVRPVVLPGIRYAVMCVPTPLTLAVDKFSVMDKQAGYYMGKLSV
IFTPSLPTIN
>S2642 hypothetical protein
MRISEQQRQILDYLAKNESITARQAADLVYGGNVTRVQVDAARRSLMTMV
SNGLIRKQGRAFVAERHINHAYSQDVEELRRQILWVFSKQGQEYFLSEHP
NFAAKELTVSGLSCALPGPAVKVALLTSSYGTP
>S1114 hypothetical protein
MFRPFLDSLMLGSLFFPFIAIAGSTAQGGVIHFYGQIVEPACDVSTQSSP
VEMNCPQNGSIPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNI
EYN
>S1551 hypothetical protein
MHANNNVKTAISRGHAMNDQMFVETLIITSSFFAIAVVLVLSVLLIERIG
>S3272 hypothetical protein
MLKQKIKTIFEALLYIMLTYWLIDSFFAFNKYDWMLESGGNICSIPSVSG
EDRILQAMIAAFFLLTPLIILILRKLFMREMFEFWLYVFSLVICLVCGWW
LFWGRFIFCY
>S1679 hypothetical protein
MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVE
GCEEKLAPLDLIRLGMN
>S3059 hypothetical protein
MNMTKNAFQFGIEPVRITDTDNIQVNEGLPTNADPQVYALQLAKTVKAML
NGVLKDAQDNIPFPVEVLPTRNSLPTPIIAHTLADRSVLVPVRGGKRPEV
VTAPSGTEITVEPIEQAILVSHQTKLWDQKSTTGFTQGTLQQDALNICDN
VIRTINSKMVDVLESSKLLKTVELPALTGSLTAKADAIMDALYENTESSF
GSEVSDYGIIAHESHLKALSRLAAKQGFGGEDAIVDMLGTDVAYYNGEDR
GVFMMAKRFTALSFGCFRHDGESITVVLSRDGDSQSHDLEILGKVFVVAE
AATTIKMGTGSATAVLPVVKRLSFTKEAN
>S1489 hypothetical protein
MADTVYLKYTPSDYSFNLGKNASGIVFNQTAPPEEGAEEKTINSSRGRQH
TDVYPALAGNTDTAMFH
>S1429 IS91 orf
MARSAKPRKRKPASQRSKLPRYVVKLHEDDFFDEEDAEVLRFDSFDDAVE
CCADLNIPFFVDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILAGISA
GGMYCPECGTVHWPDGVAPPF
>S0319 hypothetical bacteriophage protein
MHRIDTKTAKKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSV
VEASGASLEKGRHDQLLTALRALLLSRKNPFGDIKSDGTVKTALENLGLG
EAAKRNVGTGANQIPDMSLFASINTVTAAAQKFPSGLILQCGQLNGAPNV
SSTYGMRFPMTFSRVIAVVVTLNVTGAAGQPTVSATSVQNTGFNITVSPG
SGYGSSADAYYIAMGY
>S2536 hypothetical protein
MIAEFESRILALIDGMVDHASDDELFASGYLRGHLTLAIAELESGDDHSA
QAVHTTVSQSLEKAIGAGELSPRDQALVTDMWENLFQQASQL
>S0908 hypothetical bacteriophage protein
MDETEFQQVYKSVLNVLWNWILFRKFSSPEQVENVAAQLLEFA
>S2790 hypothetical bacteriophage protein
MQNVAATVLAQYAASPRLNALINSFNAALSPDSFISDFYGLIWNIDTAEK
YGLDVWGKIVGVSRWLTVKDDFNYLGFSESRMDTPVMDDPCPFNQAPFYN
GKSDTRTVDLSDAVYRRLILMKAMSNITDCSVPDINLMLRFMFGKKRRAY
VLNNGGLRMSYVFESALSLAELAIIQSSGALPSPPGVYVSVVLKESRNEG
Q
>S1517 hypothetical protein
MCGIFSKEVLSKHVDVEYRFSAEPYIGASCSNVSVLSMLCLRAKKTI
>S1694 hypothetical protein
MVTPVSICNYISLPDDFPARNIAPQVKEVLKDFIDALSTIICDEEWRTSL
NTNSATKKIFNNLDNLSYIQRTSFRGNDTLYNEKVQFKLTYPVKNGRHKE
NIEFQVVINLSPIYLDNFRHDGEINIFCAPNPKPVTMGRVFQTGVERVLF
LFMNDFIEQFPMINLGAPIKRAHTPHIEPLPPDHHTAADYLRQFDLLVLN
FISRGNFVILPRLWNNSEVHRWFVNKDPNLITAILDITDSELKEDLLQSL
MDSLGSNKHVQPEVCICFLSLLAEQESPHFQDLFLFFANMLLHYHQFMNP
NESDLNDVLMPASLSDDKIIKHMARRTLKLFVKNETPPKVTHEDLVKNRP
RSPVRPPIPATAKTPDLPERH
>S2127 hypothetical bacteriophage protein
MAKIAGTCFFKVDGQQLSLTGGIEVPMNTNVRDDIVGMAGDVDYKETWRS
PYVKGTFKVPKNFPVDKITTSDQMTITAELANGMVYVLSAAWLHGEANHN
AEEGTADLEFHGEEGGYQ
>S1505 hypothetical protein
MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSI
PALLNGEHSVLYRTRYDQQSDAWIMRLA
>S0005 hypothetical protein
MQSIVLALSLVLVAPMAAQAAEITLVPSVKLQIGDRDNRGYYWDGGHWRD
HGWWKQHYEWRGNRWHPHGPPPPPRHHKKAPHDHHGGHGPGKHHR
>S0763 hypothetical bacteriophage protein
MWLAENEGVKFWLNVLTELKKRGWSPYLFIKPVVCPPQHHFSALIEILSM
VVNSANPVR
>S2328 DNA-damage-inducible protein
MRVEICIAKEKITKMPNGAVDALKEELTRRISKRYDDVEVIVKATSNDGL
SVTRTADKDSAKTFVQETLKDTWESADEWFVH
>S0706 IS911 orfB
MCSGFIAAATDTGKTVLKNQTADGLYYAVRYLSYMASATVRPEQEASPQW
QPGEATRWDAGLLAGS
>S0738 putative head completion protein gp3
MVTVAELQALRQARLDLLTGKRVVSVQKDGRRIEYTAASLNELNRAINDA
ESVLGTTRRRRPLGVRL
>S0242 putative DNA stabilization protein
MNLTTKGDLVLAALRKLGVASNATLTDVEPQSMEDGVNDLEMMMAEWLGG
DASPGINVGYIFADADVAPDPGDEHGLSNNAINAVIFNLACRIAPDYALE
ASAKLITTARYGKERLVKLSAMDRAKAAKCKSGYPNRMPVGSGNQLAKWN
GWNYFHRKEPCDNGSE
>S2786 putative tail fiber protein
MAISDTGRNSADNVDGVYYRPLQKLINGTWYNVASI
>S3185 hypothetical protein
MNGIKTGLGITPGEHIISADSALSRNIRQCFCLSCHGRLILQTDAQGAWF
EHDLHALSEQQKADCVVLNPEKSHPYIEDMAMFLSPLPVVLEWHCVMCEQ
FFHGKKFCEACGTGIYCRAVCTKSVYSYQPDLFEDCGGAST
>S3968 putative transport protein
MKPTTLLLIFTFFAMPGIVYAESPFSSLQSAKEKTTVLQDLRKICTPHAS
LSDEAWEKLMLSDENNKQHIREAIVAMERNNQSNYWEALGKVECPDM
>S2643 hypothetical protein
MTNIERLERCADIMRRRWIYDPEAGLLFSRETKDVIKGSMTNKGHLVVTV
HEGKFKVTLTYQKACYVYTYGAYDETLYEVHHVNLNKQDNRIKNIRLMPK
EKHRRIHKNVRKLIRLGHTQWLQPQLIEQSL
>S0746 putative tail component of prophage CP-933K
MKGLENAIRNLNSLDTRMVPQTSAWAVNRVAAKIVSVATRQVAQNTVAGD
NQVKGIPLKTVRERVRLLKASPSGKMYARMRVNRGNLPAIKLGTAQVRLA
RSRHGSNSRHRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPID
VVKIPMSGPLTQAFEDARDRIIAAEMPKQLGYALKQQLRLHLSK
>S0920 hypothetical prophage protein
MNTIDLGNNESPVYGVFPNQDGTFTAITYTKSKTFKTEAGARRWLGRHSG
E
>S1621 hypothetical protein
MTYQQAGRIAVLKRILGWVIFIPALISTLISLLKFMNTRQENQEGINAVM
LDFAHVMIDMMQANTPFLNLFWYNSPTPNFNGGVNVMFWVIFILIFVGLA
LQDSGARMSRQARFLREGVEDQLILEKAKGEEGLTREQIESRIVVPHHTI
FLQFFSLYILPVICIAAGYVFFSLLGFI
>S0743 major capsid protein
MGLFTTRQLLGYTEQKVKFRALFLELFFRRTVNFHTEEVMLDKITGKTPV
AAYVSPVVEGKVLRHRGGETRVLRPGYVKPKHEFNYQQAVERLPGEDPAQ
LNGPAYRRLRIITDNLKQEEHAIVQVEEMQAVNAVLYGKYTMEGEQFETV
EVDFGRSVGNNITQAGGTEWSAQDRDTFDPTHDLDAYCDFASGTINIAIM
DGKVWRLLNGFKLFREKLDTRRGSNSQLETAVKDLGAVVSFKGYYGDLAI
VVAKTSYVAEDGTEKRYLPDGTLVLGNTAAEGIRCYGAIQDAQALSEGVV
ASSRYPKHWLTVGDPAREFTMTQSAPLMVLPDPDEFVVVQVK
>S0247 putative prophage DNA injection protein
MATWQGTNGGLLAGIGGVNSNAPSVNDIGNTLQLIRQNNDIERSGANNVG
LTALQGLSGIAGVFQQEKQAQRQKEFQQAYANAYASGDRGALRQLATQYP
DQIESVRKGMGFIDEEQRNSIGTLAAGARLASSSPEAMQSWLQNNAGELA
RVGVNPHDVAQMYQQNPRQFGEFVDHLGMNSLGPEKYFDLQDKMQGRQVT
MRGQDLDSQTAARNQAITMRGQDIQANLGQQRINLDAETNRINNENKRLD
RMLSAETNDLKRQEIQSRIAANNQQLQQKQQALNDGYKDGINTLTTSMFT
LNDIVSSPSLKSITGLRGVIPNVPGSQAADTQARLDTFKSQAYLTAVQAM
RGMGALSDAEGKKLDQAVGSLQNSQSEESFRRNAGVILNTLNQKRNEAVG
KYVQQNGIKRVEAPQASIDYLKQHPELSIDFINRYGYLPSLGQ
>S1427 IS91 orf
MPDALAAEGSSKREWNRFLDTHYRRGWNVNVSRVMDNATHVAVYFGSYLK
KPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYLLMSGDEFMERFSWHVAD
KGFRMVRYYGFLSPAKRRLLEEVVYIITETVRKTAMQITWRGMYQRLLKV
DPLKCVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG
>S2702 putative membrane protein
MKKVFLCAILASLSYPAIASSLQDQLSAVAEAEQQGKNEEQRQHDEWVAE
RNREIQQEKQRRANAQAAANKRAATAAANKKARQDKLDAEATADKKRDQS
YEDELRSLEIQKQKLALAKEEARVKRENEFIDQELKHKAAQTDVVQSEAD
ANRNMTEGGRDLMKSVGKAEENKSDSWFN
>S2550 putative fimbrial-like protein
MSKFVKTAIAAAMVMGVFTSTATIAAGNNGTARFYGTIEDSVCSIVPDDH
KLEVDMGDIGAEKLKNNGTTTPKSFQIRLQDCVFDTQETMTTTFTGTVSS
ANSGNYYTIFNTDTGAAFNNVSLAIGDSLGTSYKSGMGIDQKIVKDTSTN
KGKAKQTLNFNAWLVGAADAPDLGNFEANTTFQITYL
>S1007 homolog of Salmonella FimH protein
MQIIFGEKCVLLLRLFFAAVLMLWCAQTAAYIGQCHTTQGNPYIGVNFGV
KTLDEEENTAGVVKDKFYQWNESNDYYVSCDCDKDNVRSGRWAFAADSPL
VYLGDNWYKINDYLAAKVLLQVKGSSPTAVPFENVGTGADTRWHICDPGG
QRLGGQGASGNSGSFSLKILQPFVGSVVIPPMALARLFECYNIPAGDSCT
TTGTPVLVYYLSGTINSLGSCSVNAGETIEVDLGDVFAANFRVVGHKPLG
ARTAELAIPVRCNTGNAGLVNVNLSLTATSDPSYPQAIKTSRPGVGVVVT
DSQHNIISPAGGTLPLSIPDDADSIRE
>S0745 putative tail attachment protein
MDGFDNLFDAALAGVNEVILRDMGISAVITSGELEGTHLTGVFDDPESIS
FVAGGIRLEDSSPCLFVKTADISQLRRQDTLTIGDDSFFVDRITPDDGGC
CYIRLRRGSPAPQNNARMRRYDEGT
>S3081 hypothetical protein
MVLWQSDLRVSWRAQWLSLLIHGLVAAVILLMPWPLSYTPLWMVLLSLVV
FDCVRSQRCINARQGEIRLLMDGRLRWQGQEWSIVKAPWMIKSGMMLRLR
SDGGKRQHLWLAADSMDEAEWRDLRRILLQQETQR
>S3847 hypothetical protein
MNIAAVAGSPTGDMIEIGQHLRVRHFCYQTAKQRGDIGKIIRIAFAEVKF
RCDSQIPLQRQTATNVADMFMHAKNFLHYNDHWQWAIALLWSGMVSRHVI
PL
>S0499 hypothetical protein
MLTKYALVAVIVLCLTVLGFTLLAGDSLCEFTVKERNIEFRAVLAYEPKK
>S0219 putative cytoplasmic protein
MKPNITWGLQKIQAHGIADNTIYATEGTQIIFTVTSPWIPVFKVNDDVKI
ALTLTRHEEAWWIINQSTEYCCTVNDQIVEPHHRMRLNEGDLIEWGLSS
>S3018 hypothetical protein
MMKKTAAIISACMLTFALSACSGSNYVMHTNDGRTIVSDGKPQTDNDIGM
ISYKDANGNKQQINRTDVKEMVELEQ
>S0932 hypothetical bacteriophage protein
MSLAQCPGFLFAHREECTVEIKKIINPRYTESGAVDCDVFFDDRDQAVPY
TATADDVAPTGQRIWQELQSGKWGEIAPFTVTPEMLEAAREARRQEIEAW
RTEQEAKPFTFEWNGRIWNAGPNSLGRLYPVVMAAKSDIVRDVMTWGDAD
NQQVKLSMPELEELAAAMVQAQVDRNDEIYRRQREMKEELSGLDDLASIR
AFDVE
>S1946 hypothetical bacteriophage protein
MRFEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGL
SVTRTADKDSAKTFVQETLKDTWESADEWFVH
>S0744 putative DNA-packaging protein
MATKEENLKRLCELAERLGREPDVSGSAADIAQRVAELEEELGDAGEVAE
PDTYSPSKDDPTAHKGESVPEQPGIVTGDETGQVTVVALATLHTKCNGSV
IFVSPGTSFRVSAEVAASMAAGGLAKRQ
>S2784 putative tail fiber protein
MSVVISGALTDGAGIPMSGYHIILKSRVNTPEVVMNTVADVMTGNDGEYC
FHARTGKYGVYLKQDWRNEYNVGDIAVYEDSKPGTLNDFLIAPDEGDLKP
DVVKRFEEMVAQAQQSAGAAAGNAQQTAQDVAAAAGYARAAEQAKNDIDA
ALTGTLKTANHLSEIAAAGEKAQQKSRDNLGLKSAPTMEAQSDIYDRTKG
RLAIPGAFGFGCAFLPEDVIRFDTKSDFLAWVRNALPGEYSVAGPYGIII
PDTRFEGVLSIRWTDARPETTEPRYRAKSLTFYGINGPIYHTRYCYWPIS
RLTGWVKINITTEDIIYRIVASSVRNRWGDPDIGGLIIAAYQGEADGDKV
IRLVRGQSYRGSRLGPVGISVPSTPTGTYIVSPQFFITGCSEHSLPGSYC
ALSGVPDAHVSGAMPGLFIRTSRGMHRGN
>S1218 hypothetical bacteriophage protein
MTAQIAAYGQLVGDPLVKQSSKGTPEPPTENYADLRNAVNRTYADMGLSK
LAE
>S1225 hypothetical bacteriophage protein
MTEELITLEEVKLHCRIDGDDEDQLISGYIAASLEACQIHIGRRFDDGLA
FTPAIKIGCMMFIAHLYENRQIVADNAKTRVPMTIGALWTAYRDVGGY
>S2580 hypothetical protein
MKVNLILFSLFLLVSIMACNVFAFSISGGVSERSYKETEKTSAMTTTHST
KLQPSQAILFKMREDVPPLNLTEEITPTYPTKANYLIHPVR
>S2870 hypothetical protein
MNINHSPHDGLVIINKGNEEVEGTWPNKLQPGIYKNMGSNSVNIIINNTR
KIIPPCKVFTLRGGSLNINIPRRSALLLGKTGEPPNYLYL
>S0104 hypothetical protein
MLLHASVGRLQVVEDAVFLFQHTNKRGALGQLFQA
>S1697 hypothetical protein
MQSLDPLFARLSRSKFRSRFRLGMKERQYCLEKGAPVIEQHAADFVAKRL
APALPANDGKQTPMRGHPVFIAQHATATCCRGCLAKWHNIPQGVSLSEEQ
QRYIVAVIYHWLVV
>S1730 hypothetical protein
MTSDNKQEYTIHIFGQDITSQDGLFRLDEIRRVGIQIGKVEDNRSTEVAR
FMRTKAGRSLTNPENGFVKKINMGHLGTSWLADPVAAIEFGRWLDLGYGL
AVTRAFVSMTSSATVQSAVASAEDIKPETKEFIQRHGTGLYIAGREVSQA
EYSDYQAVTAGRCEWR
>S0246 putative DNA transfer protein
MLYAFKLGRKLRGEEPYCPEKGGKGGSSDKSAKYAAEAQKYAADLQNQQW
QTIMKNLAPFTPLAEQYVNQLQNLSSLEGQGQALNQYYNSKQYKDLAGQA
RYQSLAAAEATGGLGSTATSNQLATIAPTLGQSWLSNQMSNYNNLANVGL
GALQGQANAGQTYANNMSSIAQQSAALAAANANKPSSLQTAISGGTSGAI
AGAGLASLLGTSTPWGAGIGAGIGLLGSLF
>S1783 hypothetical protein
MTTTTPQRIGGWLLGPLAWLLVALLSTTLALLLYTAALSSPQTFQTLGGQ
ALTTQILWGVSFITAIAMWYYTLWLTIAFFKRRRCVPKHYIIWLLISVLL
AVKAFAFSPVEVGIAVRQLLFTLLATALIVPYFKRSSRVKATFVNP
>S0099 hypothetical protein
MAIFPKAMTGAKSQSSDICLMPHVGLIRRGQRRIRHLVQMSDAA
>S0241 hypothetical bacteriophage protein
MTHMIFRHGDMKKWKGVGYDFEIVKAEELQEYLDAGWFSHPDDLLKDVAE
PEPEEKQRKKPGRKPKAAADEPDNEG
>S1614 hypothetical protein
MRLVKPVMKKPLRQQNRQIISYVPRTEPAPPEHAIKMDSFRDVWMLRGKY
VAFVLMGESFLRSPAFTVPESAQRWANQIRQEGEVTE
>S2118 putative bacteriophage protein
MLPQHSDIEIAWYASIQQEPNGWKTVTTQFYIQGFSEHIAPLQDAVDLEI
ATEEENSLLEAWKKYRVLLNRVDTSTAPDIEWPEEPDTM
>S0742 capsid protein small subunit
MVTKTITEQRAEVRIFAGNDPAHTATGCSGISSATPALTPLMLDEDTGKL
VVWDGQKAGHVVGILVLPLEGTETVLTYYKSGTFATEAIHWPERVDTHKK
ANAFAGSALSHAALP
>S2595 hypothetical protein
MFRSLFLAAALMAFTPLAANAGEITLLPSIKLQIGDRDHYGNYWDGGHWR
DRDYWHRNYEWRKNRWWRHDNGYHRGWDKRKAYERGYREGWRDRDDHRGK
GRGHGHRH
>S2706 hypothetical protein
MKKPNQDDEPFFITEEIAAEMIAGGYEFELPPIPCTIRLRDVLERMTDAE
LALQPGEIADQERERCRRKPCSTS
>S3187 hypothetical protein
MQTYNTNPAYEMAPQLLEHFNQHLDSLFGVYSKLLPFRMDFAYRKNTLSY
RCACRYAMCAEMLQLINEVGEKLVGYAWVMEYTERKGLHIHFVGYLNGQS
HRSSYLVSRLMGDIWRRVTEGNGYYHWCRFNKNYPVNINHVIHYSDHKAV
NDLRYAISYLAKREQKECGIILKCSGLPEKSNRGRPRLDSPLPGICALV
>S2634 hypothetical protein
MKSTEFHPVHYDAHGRLRLPLLFWLVLLLQARTWVLFVIAGASREQGTAL
LNLFYPDHDNFWLGLIPGIPAVLAFLLSGRRASFPRTWHVLYFLLLLAQV
VLLCWQPWLWLNGESVSGIGLALVVADIVALIWLLTNRRLRACFNEEKE
>S0239 putative scaffolding protein
MDQTTDIQASEELTLPGNHAAASADGLVVDNANDSAGQEEGFEIVRKDDE
KPKQDPATNAEFARRRIERKRQRELEQQMEAVKRGELPEHLRVNPELPKQ
PDPNDYLSEDALAKYDYDQSRALAAFQQANSEWQIKAMDARSQAVAEQGR
KTQEFTQQSAQYVEAARKHYEAAEKLNIPDYQEKEDAFMQLVPPAVGADI
MRLFPEKSAALMYHLGANPEKTRQLLAMDGQSALIELTRLSERLTLKPRA
KPVSEAPLPDEPIQGHAVAANISAIEKQMEAAANKGDVETYRKLKAQLNK
GIR
>S1948 hypothetical bacteriophage protein
MNKPQLAESQKTALLTEAESVIRPPGCAVRLNRETDESGEARGRASVFPE
SVRSVVYAM
>S1661 putative lysis protein S of prophage CP-933V
MDQMEKITTGVSYTTSAVGTGYWFLQLLDRVSPSQWAAIGVLGSLLFGLL
TYLTEVYWQ
>S3292 IS4 orf
MTRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMR
DLASMGQLVKLPTRRGRAFPRVVKERPWKYPTAPKKSQSVA
>S0704 hypothetical bacteriophage protein
MPSSAETSSASGMGNDATVELSPVAGRNVLGIRDGIISDQTALRMLQEYI
RIQCLGG
>S1782 hypothetical protein
MTVQDYLLKFRKISSLESLEKLYDHLNYTLTDDQELINMYRAADHRRAEL
VSGGRLFDLGQVPKSVWHYVQ
>S0227 putative lipoprotein Rz1 precursor
MRKLKMMLFGASLIMVAGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQL
LDETFSVSGTESSETRQH
>S1847 hypothetical protein
MTHIAFGHYLHSKRWYTVQVLSKSEDAMSTQLDPTQLAIEFLRRDQSNLS
PAQYLKRLKQLELEFADLLTLSSAELKEEIYFAWRLGVH
>S2705 putative transposase
MPHVAARTASRDRDTGRYQSHRPEQTLLYQIVDEYYPAFAALMAEQGKEL
PGYVQREFEEFLQCGRLEHGFLRVRCESCHAEHLVAFSCKRRGFCPSCGA
RRMAESAALLVDEVLPEQPMRQWVLSFPFQLRFLFASRPEIMGWVLGIVY
RVIATHLVKKAGHTHQVAKTGAVTLIQRFGSALNLNVHFHMLFLDGVYVE
QSHGSARFRWVKAPTSPELTQLTHTIAHRVGRYLERQGLLERDVENSYLA
SDAVDDDPMTPLLGHSITYRIAVGSQAGRKVFTLQTLPTSGDPFGDGIGK
VAGSSLHAGVAARADERKKLERLCRYISRPAVSEKRLSLTRGGNVRYQLK
TPYRDGTTHVIFEPLDFIARLAALVPKPRVNLTRFHGVFAPNSRHRALVT
PAKRGRGNKVRVADEPATPAQRRASMTWAQRLKRVFNIDIETCSGCGGAM
KVIACIEDPIVIKQILDHLKHKAETSGTRALPESRAPPAELLLGLFD
>S1903 hypothetical protein
MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRT
AIAHHGRKHKEDMEARHEQLTKGGTIL
>S1991 hypothetical bacteriophage protein
MNTVTINNKQFPVIEYRGQRVVTLAMIDEVHQRPEGTARAAFNRNREHFI
SGVDYAELGADVIRTDLPEGTFSKFAPSGIVLFESGYLMLTKPFNDALAW
QVQRELVNSYFRTRTPLTEIEMIAAIAADAVRQQKRLNHVEEQLETVTEA
VETIKRGNMRAGYVGYRQVVAKSGMSDTKCRNLVNAYQIPTDTHEFMTPD
GLLSRRAIVELEPFMAAFHQMMSEADPRGTRWYHPKMGLFQVIGWEDKA
>S1969 putative tail fiber protein
MSVVISGALTDGAGIPMSGYHIILKSRVNTPEVVMNTVADVMTGNDGEYC
FHARTGKYGVYLKQDWRNEYNVGDIAVYEDSKPGTLNDFLIAPDEGDLKP
DVVKRFEEMVAQAQQSAGAAAGNAQQTAQDVAAAAGYARAAEQAKNDIDA
ALTGTLKTANHLSETAAAGEKAQQKSRDNLGLKSAATMEAQSDIYDRTKG
RLAIPGAFGFGCAFLPEDVIRFDTKSDFLAWVRNALPGEYSVAGPYGIII
PDTRFEGVLSIRWTDARPETTEPRYRAKSLTFYGINGPIYHTRYRYWPIS
RLTDWVKINITTEDIIYRIVASSVRNRWGDPDIGGLIIAAYQGEADGDKV
IRLVRGQSYRGSRLGPVGISVPSTPTGTYIASPQFFITGCSEHSLPGSYC
ALSGVPDAHVSGAMPGLFIRTS
>S3351 hypothetical protein
MSSKVERERRKAQLLSQIQQQRLDISASSREWLEATGAYDRRWNMLLSLR
SWALVGSRVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG
>S3509 hypothetical protein
MVSRNHDFPVWMMLYNFLTKVFDESCPANRFSR
>S2884 hypothetical protein
MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIP
VNILIAWPYGMYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAIL
LVVGADWHQIMAAVSSNIVVSMLMGAVYGYFLDYCRRLFKVSRYQQVKA
>S2340 hypothetical protein
MSEWASEDINAPSLLTGNPALQPQTGGILNIPANHQRIFTLFFTCRSQTN
GDVVDQYAVEDRQQRLESRLLALKPSAQRNKQMCGECLGGVILTALFDPR
SQAM
>S3130 hypothetical lipoprotein
MNYQLYVRNYSRNTNYEIVATADGLDVLNGKQGSLNNNGYIVNAGDSLVI
KGFRKDKHTEAAFQFANVADSYAANSAQGDVRNTGVIGFAAFELQGPAQN
ALPPCSGQAFPADNNGYAPPPCRK
>S1440 IS1294 orf
MVRYFGFLANRVCGEKLPQVYRALGMDKPEPVAKVCYAQMVKQFLSRDPF
ECVLCGGRMVYRRAIAGLNVEGLKKNARDISLLRYMPA
>S3342 hypothetical protein
MDNRTLFNAQGGNFTFNFLPLGAIDGMRIGSTLSYLNQAQSQQGTPVMVL
LSRNSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQL
TRTELVPFTKTGGLTDGNAQWFLQAGKTTSQVSDDESSAYQLGVRLPLHP
QYELYAGLANADDVSAFELGNNWTADLGRVGNLAISASVFRNDDGGKGDM
QQANWSNPGWPTLGFYRTNSDGDACTTDSRESYNALSCYESISATVSLNF
VGWNMMLGYTCTQNNTDDSLRWDKQQSFENNYLRQTTAQSISETVQLSAS
RAIVMRDWILSTSVGVFHRNDNGGDNDDNGLYLSFSLSDTPTMDSNNNSH
STNVPTDYRYSEQDGDQTSWQLSHTFYNDSFSHKELGVTVGGLNTDTINS
AVNGRWDGQYGNVYATVSDSYDRKNHDHLSAFTGTYSSTLAVSRYGVNLG
ASGTDDLLGAVLVDVKGFSEQDEESQDLQLEARVAGSRTLQLGQSDSVLF
PYPGFQSGFVEVNDSSQGNQQGTTNIINGAGNRELMLLPGKLRYREVSAS
FNYNYIGRLLLPAAVKKFPIVGLNSAMLLVAEDGGFTLEINGSEKELYLL
SGQQFLKCPLSVVKKRASIRYSGDVTCSVVTYSQLPESIQVQAQLKQPKL
RGNVQTAQREVAP
>S2139 putative Q antiterminator of prophage
MNNQYLQFVREQLIIATADLSGATKGQLEAWQENAMFDTGRYRRKKIRYR
DEVTGKMITRDNPPIPGKQSLAKGTSIPLVSPVEFSTSSWRRAVLSLEEH
YKAWLLWCYSGSICWEYQIAITQWAWNEFNAQSGTRKIAGKTQERLKKLI
WLAAQAVKAELFGGEGYEYQELALLAGVTTKNWSKTFTRHWVAMKHIFHR
LDSEALLFVMRTRSKKRRHFQSKVLQK
>S0240 putative coat protein
MALNEGQLVTYALDEIIGTVQNLTPMASKVTKYPPPAESMQRSSNTVWMP
VEQEAPTQTGWDLTGNATGILELSVKCNMGDPDNDFFELRADDLRDERSY
RRRIQASAKKLANNIESAIAKQATEMGSLVVHDTRAIGPSTGLPGWDFVS
DAERLMFSRELNRDMGISYFLNPDDYRKAGRNLVDGDIFGRVPEEAYRNG
TIQRQIAGFDEILRSPKLPAVTKSTATGVTVSGAQKFKPQAYTLDTDGNK
ENVDNRVATVTVSSTTGFKRGDKISFIGVKFLSQMAKNVLTDDATFSITR
VIDGTHIEITPKPIALDDASLTKEEKAYANVNTSLADTTPVNVLNVATTT
ANVFWADDSIRLLSQPIPVTHELFAGMKTSSFSIPGIGVNGIFATQGDIN
TLSGKCRIAVWYSACAVRPEAIGVGLPNQTA
>S1954 putative bacteriophage protein
MITSYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGN
VKTLNDGVKKMCLVHIGKNLPAEKKAEFLATLIAMKLKGEI
>S0585 hypothetical protein
MNALSGLQVITRRPDKRSASGVTKKCRKSLKTAPDTKTVFKGSFASHYVD
KSSQVNPGTHITVRGSIHGEVSIKKMLKGSRCRTAL
>S3246 hypothetical protein
MNNHFGKGLMAGLKATHADSAVNVTKFCADYKRGFVLGYSHRMYEKTGDR
QLSAWEAGILTRRYGLDKEMVMDFFRENNSCSTLRFFMAGYRLEN
>S2142 hypothetical bacteriophage protein
MPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIR
AAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQ
LRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQ
VADALPEAVLRRSLGLRAEKIRSMYRESDIVPGEQTATSILKQRTKNLAP
LPHAHQQQNPPQEETVVSIAVDPESPAQYLQRQKPQREEMPVYTRWVKTQ
KCMTCGNQADDPHHIIGHGLGGMGTKADDLFVIPLCRKCHNELHAGVKDF
EEKHGSQLLLLIRFLMHARNSGVLKWKA
>S1516 hypothetical protein
MNEVVNSGVMNIASLVVSVVVLLIGLILWFFINRASSRTNEQIELLEALL
DQQKRQNALLRRLCEANEPEKADKKTIESQKSVDDEDIIRLVAER
>S2273 hypothetical protein
MPLLYLNTRECRWYLMGEGEMKKIAAISLISIFLISGCAVHNDETSIGKF
GLAYKSNIQRKLDNQYYTEAEASLARGRISGAENIVKNDAAHFCVTQGKK
MQIVDLKTEGAGLHGVARLTFKCGE
>S1909 IS1294 orfB
MPDALAAEGSSKREWNRFLDTHYRRGWNVNVSRVMDNATHVAVYFGSYLK
KPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYLLMSGDEFMERFSWHVAD
KGFRMVIRGPESGEAAITGRCGVRHNGDSEKNGEANHKERDVSAVTEG
>S2129 hypothetical bacteriophage protein
MPDPQGGEIVYVGGTLLDLNRYELYYQFDFTAKYEITEEDTRQAEDVNAL
PDLSLLSIDVDYIDPGTGPDGDIEHHLEMRFPQN
>S0759 putative tail component of prophage CP-933K
MHMSLAQCPGFLFAHREECTVEIKKIINPRYTESGAVDCDVFFDDRDQAV
PYTATADDVAPTGQQIWQELQSGKWGEIAPFTVTPEMLEAAREARRQEIE
AWRTEQEAKPFTFEWNGRTWNADASSVARLSPVVMLAKSVAAQTHMVWSD
ADNQQVKLSMPELEELAAAMVQAQVDRNDEIYRRQREMKEELSSLDDLAS
IRAFDVK
>S2334 putative phage tail fiber protein
MAISDTGRNSADNVDGVYYRPLQKLINGTWYNVASI
>S1229 hypothetical bacteriophage protein
MTEAEILRLIRRVSGISQQADEQTTQPDSVTAENYARVVAEVMRRDGIQL
NDADMRDIRIRVLEMLAYNRRVELYREKEKITYHWKKPERLRR
>S4502 hypothetical protein
MSLKLLIILVLTPRVSTRLLAVRLGYLAVYSMNMSVRKLFRLPIFFVMIV
R
>S0917 putative bacteriophage protein
MASNWIKLEVITPDKPEIFRLAEILNIDPDAALGKVIRFWAWADQQMIDG
NADCNARGVTKSAIDRITFMSGFADALIQVGWLVENDGGLSLPNFERHNG
KSSKKRAVTNERVTKIRELKRKGNAASVTQTDQKALPVEEEEEDLNTDLP
LNPPRQKRASKKFEPEAIELPDWLPETLWHEWVRFRQALRKPIRTEQGDT
GTGKIPSAGFYT
>S4826 hypothetical protein
MNNEQMLIKCRDPRKCRFYAEYLNHKYPNEFPHFAPDNPQNCIRNVNLFL
KEKYCNVSPPPFGTFGTTEEEALNNEINELEIIAGQAILPEESFLWLKTD
ELATFFTWTTIYLSKEDHTKFPKINHNSSGIKNGIEVRKIRTVTDETLNT
YKKSNLPDNPSSHNERIKVIISYFDSCHSLINKNAKSIYLEELRQAWLKK
KSACQKFKWLLPDDDEMCQWFWTRLKQKQDEKHPVHNPVTRWFIPSSTKE
YYLASLVAFALWKEAPDTIELFRTRINHAWHQKKQKDKYRSDGKKAINIH
IRSNVKEMLDELRTVYGLRTGEFFEMLITEKYLEHKRRK
>S3196 hypothetical protein
MLQIVGALILLIAGFAILRLLFRALTSTASALAGFILLCLFGPALLAGYI
TERITRLFHIRWLAGVFLTIAGMIISFMWGLDGKHIALEAHTFDSVKFIL
TTALAAGLLALPVQIRTIQQNGLTPEDISKEINGYYCCFYTAFFLMACSA
YAPLIALQFDISPSLMWWGGLLYWLAALVTLLWAASQIQALKRLTSAISQ
TLEEQPVLNSKSWLSSLQNDYSLPETLTERIWLTLISQRISRGELREFEL
ADGNWLLNNAWYERNMAGFNEQLKENLSFTPDELKTLFRNRLNLSPEAND
DFLDRCLDGGDWYPFSEGRRFVSFHHVDELRICASCGLTEVHHAPENHKP
APEWYCSSLCHETETLCQDIYERSYTGFISDATANGLILMKLPETWSTNE
KMFASGGQGHGFAAERGNHIVDRVRLKNARILGDNNARNGADRLVSGTEI
QTKYCSTAARSVGAAFDGQNGQYRYMGNHGPMQLEVPRDQYAGAVETMKN
KIREGKVPGVTDPAEASRLIRRGHLTYTQARNITRFGTIESVTYDIAEGS
VVSLAAGGISFALTASVFWLSTGDRDAALQTAAVQAGKTFTRTLAVYVTT
QQLHRLTVVQGMLKHIDFSTASPTVRQALQKGTGAGNISALNKVMKGTLV
TSLALVAVTTSPDMIKMLRGRISGAQFIRNLAVASSGVAGGAVGSVAGGI
LFSPLGPFGALTGRVVGGVLGGMIASAVSGKIAGALVEEDRVKILAMIQE
QVTWLAGSFLLTGHEIENLNANLARVIDQNALEIIFAAGIQQRAATNMLI
KPLVVSIIRQRPVMEYDASHLGKMVNRLEEAFPPELPA
>S3115 hypothetical lipoprotein
MHPFTSLTLWALAACTTLILPAQTILPIYSAATFFCLIALKATRRRAKYV
VWLMFSLGAGLWLVHGGWLPEWLSGTPRSPERWSHAITLWLLILAIVSTS
QL
>S2043 hypothetical protein
MRLLILTLSLITLAGCTVTRQAHVSEVDAATGIVRLVYDQAFLQHAHTDR
YVSRGIADRACQQEGYTHAIPFGQPVGNCSLFAGSLCLNTEFTLSYQCHH
SAFPVFL
>S1088 IS91
MARSAKPRKRKPASQRSKLPRYVVKLHEDDFFDEEDAEVLRFDNFDDAVE
CCADLNIPFFVDAGNKKLVFWFVRVDDEGYPEIARCTEREFATILAGISA
GGMYCPECGTVHWPD
>S1949 hypothetical bacteriophage protein
MLMSLAQCPGFLFAHREECTVEIKKIINPRYTESGAVDCDVFFDDRDQAV
PYTATADDVAPTGQQIWQELQSGKWGEIAPFTVTPEMLEAAREARRQEIE
AWRTEQEAKPFTFEWNGRTWNADASSVARLSPVVMLAKSVAAQTHMVWSD
ADNQQVKLSMPELEELAAAMVQAQVDRNDEIYRRQREMKEELSSLDDLAS
IRAFDVK
>S1128 hypothetical bacteriophage protein
MENFIDTLENWRDQKQIISVICKDGIKIDDSIITSFKASKDVGISNGLRI
QLTFQEINFKAIVGQTDVSAATGRTSTTNDGGATSKKNTGNTTTSLGSPM
LTCKELFSYSASELSDEALKARVTCSKSVSVKNGESTFTA
>S3374 hypothetical protein
MHQKQKLIDAGLKLMATAKMPDALRLSGLRHLCNILNLRAFVGRIRRSRS
IRHE
>S0533 hypothetical protein
MLLPWLFPQTKRNNFAAKVNSENQEKLLILRKEDASASSGQSELSLVQMG
LKQVVASSFWLVFEFQHFKVGCNAARKFPVNGIANAQPQQCGTYRSHNGK
LSIAIGHFCRIHQRAHTHFAIAKIAEFNPAVHCHHVVWHLFWRTHLGAIQ
LCV
>S2558 hypothetical protein
MLPSISINNTSAAYPESINENNNDEINGLVQEFKNLFNGKEGISTCIKHL
LELIKNAIRVNDDPYRSNINNSSVTYIDIGSNDTDHITIGIDNQEPIELP
ANYKDKELVRTIINDNIVEKTHDINNKEMIFSALKEIYDGDPGFIFDKIS
HKLRHTVTEFDESGKSEPTDLFTWYGKDKKGDSLAIVIKNKNGNDYLSLG
YYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKAIMAESFATGS
DHQVVNELNGERLREPNEVFKRLGRAIRYNFQVDDAKFRRDNVKEIISTL
FANKVDVDHPENKYKDFKNLEDKVEKRLQNRQTKYQNEINQLSALGVNFD
DI
>S0232 hypothetical bacteriophage protein
MMNPCGYSYGYTLIYLWSSPLVTLERVTEVLTLRQWPQSSRLERGQGGRG
TDLCLLDTYRDFCEIFLIQGHNLPPLAPTDRKVKTCKHQSNSYCGHTADC
YYLSLPVEHLADQVDTEDELKQLL
>S1968 hypothetical phage protein
MEIKKIINPRYTESGAVDCDVFFDDRDQAVPYTATADDVAPTGQQIWQEL
QSGKWGEIAPFTVTPEMLEAAREARRQEIEAWRAEQEAKLFTFEWNGRIW
NAGPDSLGRLSPVVMLAKSVTAQTHMAWSDADNQQVKLSMPELEELAAAM
VQAQVDRNDEIYRRQRELKEELNSLKDLNSVRNFIVE
>S2701 putative outer membrane lipoprotein
MMKFKKCLLPVAMLASFTLAGCQSNADDHAADVYQTDQLNTKQETKTVNI
ISILPAKVAVDNSQNKRNAQAFGALIGAVAGGVIGHNVGSGSNSGTTAGA
VGGGAVGAAAGSMVNDKTLVEGVSLTYKEGTKVYTSTQEGKECQFTTGLA
VVITTTYNETRIQPNTKCPEKS
>S0286 putative cytoplasmic protein
MTGEQILATHRSGKTEVYQRQAGFITGPAKVLMLTLTTQRPFDDHTDQLW
TAWLTSFQPAKS
>S2144 hypothetical bacteriophage protein
MARVEGELFKIARASLEAEPIAWRYRYVKKGVMDSQGELWVGDWKYVPKK
EDCNDRPNYEIQALFTAPPVPVTSEELVKAVHFYEQLKRENPPASGNQIN
GLTMSVKRPAN
>S2341 putative membrane protein
MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG
>S1802 hypothetical protein
MKFMLNATGLPLQDLVFGASVYFPPFFKAFAFGFVIWLVVHRLLRGWIYA
GDIWHPLLMDLSLFAICVCLALAILIAW
>S2019 hypothetical protein
MNVVSNTQLLEQRIADFFTLSDEHKKARVLLDTLACSCPARIFGGMVRDL
GLYGVDGFSSDLDIVIGRSREELFQTLAELPVKQLRFNKFGGIRFRYHDF
EFDIWNLNETWAFQEKLIFCEDESSLLNEVA
>S1971 hypothetical bacteriophage protein
MYWYGHEMYYSPGSNTVSWRFCAPSGHGLSGMAISDTGRNSADNVDGVYY
RPLQKLINGTWYNVASI
>S3687 hypothetical protein
METCEFMQLKNCIICFIGHILCYTMKKQPNGGQISTLFQHHCAQQCAPAK
P
>S2215 hypothetical protein
MSLNILIIYFLGMVGQFNKIAIFLIFTVCWVLSIIKRQQFRWLAINNIEF
STLFVILFLVLIFVVTLLSSLRAPGDWDDTMYHLPLARSLVEHHAIVVEQ
YLRFPLFPQNADLLMALGLQLGDVRLAQFLANICFFVIACGLVGCSWEIT
KTYYPGIIATILLFTINPLKDHLGYAYIDLTLSLFCCSQYSYIYSLRKQ
>S1533 hypothetical protein
MQIKVIYSLIDNMVNFKDKNMPAVIDKALDFIGAMDVSAPTPSSMNESTA
KGIFKYLKELGVPASAADITARADQEGWNPGFTEKMVGWAKKMETGERSV
IKNPEYFSTYMQEELKALV
>S2596 hypothetical protein
MKSPYCHRSNYKLAIAIITVITGTVVTGEIVITGIAIMNGAKTAGGVMIM
AIIVAGISVKHMSVAIVKAGAIVTIIAEKAADMGTAIKRVLQWSTMPDAT
LARLIRPTNRLFNA
>S4818 hypothetical protein
MLERTMAFKHYDVVRAAPPSDLAEKLTHKLKEGWQPFGSPVAITPYTLMQ
AIAAEGDVVVSGATEPE
>S3252 hypothetical protein
MNALSGLQTHEDSTCCNRFCRPDERSASGNSTLLRFGGFFADQTYLQVTR
LMQRIHYLHQRLVIDGFVRSEEDGGVFLAFG
>S0243 putative packaged DNA stabilization protein
MPIQQLPLMKGVGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIA
KRSDVNGVSRGVEYNMAQNAVYRVCGGKLYKGESEVGDVAGSGRVSMAHG
RTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGR
YAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVC
FGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFI
SNPATGAPSVYIIDSGQVSPIASASIEKILRSYTADELADGVMESLRFDA
HELLIIHLPRHVLVYDASSIANGPQWCVLKTGLYDDVYRAIDFVYEGNQI
TCGDKLESVTGKLQFDISSQYDKQQEHLLFTPLFKADNARVFDLEVESST
GVAQYADRLFLSATTDCINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNV
GFKLRVITKSPVTLSGAQIRIE
>S1958 putative bacteriophage protein
MGFAGDGKTYTASELAIGLVMLMRQRGIEAGNRPVMFLDTETGSDWVKPR
FDAENIELFTAKTRSFVDLLEAINEAESSGSVMIIDSISHFWTGLCDEYA
RRRNRKRGLEFSDWAWLKQEWRRFTDRFVNSQAHIIMCGRAGYEYDFFEG
DDGKRQLAKTGIKMKAETETGYEPSIRIQMEKQMNIETGQVWRTARILKD
RSTRIDGQVFSNPTFKNFLPHIESLNLGGEHPGIDTSRDNSELFANDGTP
TWLKEKRAKEIALDEIIELLNKHHGGTSNDAKRAKADLLEKTFTSRSWER
IKGMDWPTIKAGRNALWIELEGVEYAFPDPQTQNETQQAGYDENIPV
>S0550 hypothetical protein
MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH
>S1497 IS91 orf
MLPRFADIFQQGNRWLNWLEKQPEGSVRPVVTESVTKIMACGTTLMGYTQ
WCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYLLSLVPDCPWQHI
VFTLPCQYWSLVFHNRWLLAEMSRIAADVILEICHQTDVEPGIFTVIHTW
GRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSMWRYRITRLLSRK
YPELVIPDELAVEGNSKRDWNRFLDTHYRRGWNVNVSRVMDNATHVAVYF
GSYLKKPPVPMSRLEHYAGQDEIGLRYNSHRTKREEYLLMSGDEFMERFS
WHVADKGFRMVRYYGFLSPAKRRLLEEVVYIITETVRKTAMQITWRGMYQ
RLLKVDPLKCVLCGSQMRFTGLKRGYRLAEQVLMHEPLARMRWCG
>S1904 hypothetical protein
MKKLDARQKMAHICARFIHMAGRPYMFLYQHMLVFYAVMAAIAFLITWFL
SHDKKRIRFLSAFLVGATWPMSFPVALLFSLF
>S1835 hypothetical protein
MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRI
QRSELEKQAMETVINALVK
>S0233 hypothetical bacteriophage protein
MTTSNTLSDYAVLVPERLEAILDRAAQRAGHEDISELLGCTVCSYAGDDT
GIYLLPKRFASISFRSTKDAKTVDVKVTRNSNTAGYDLELISVVDVLATG
SVKVKAGEFDVEKDASFPLIHVIRFTTPRVTINPE
>S0919 putative regulator of cell division encoded by prophage CP-933O
MLKIDAIAFFGSKTKLANAAGVRLASIAAWGELVPEGRAMRLQEASGGEL
QYDPKVYDEYRKAKRPGKVIHENQA
>S2871 hypothetical protein
MSLLCAFFRNVDIFRLWLRNEHRHVTLHLIRGSLLMNALTAVQNNAVDSG
QDYSGFTLIPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRF
RVGKILDDLCANQLQPLLLKTLLNRAEGALLINAVGIDDVAQADEMVKLA
TAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELHNDGTY
VEEITDYVLMMKIDDWEHLDHYFRHPLARRPMRFAAPPSKNVSKDVFHPV
FDVDQQGRPVMRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVG
KFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYATHHYQTHQ
>S1428 IS91 orf
MLPRFADIFQQGNRWLNWLEKQPEGAVRPVVIESVTKIMACGTTLMGYTQ
WCCSSPDCCHTKKVCFRCKSRSCPHCGVKAGAQWIQYLLSLVPYCPWQHI
VFTLPCQYWSLVFHNRWLLAEMSRIAADVILEICHQTDVEPGIFTVIHTW
GRDQQWHPHIHLSTTAGGVTSGHTWKNLHFYARKVMSMWRYRITRLLSRN
TLTW
>S3200 hypothetical protein
MKTLSQNTTSSACAPETDLQQLVATLVPDEQRISFWPQHFGLIPQWVTLE
PRVFGWMDRLCEDYCGGIWNLYTLNNGGAFMAPEPDDDDDETWVLFNAMN
GNRAEMSPEAAGIAACLMTYSHHACRTECYAMTVHYYRLRDYALQHPECS
AIMRIID
>S1504 hypothetical protein
MAGYLSWLFPRCKISPKLNGTAPHFGDEMFALVLFVCYLDGGCEDIVVDV
YNTEQQCLYSMSDQRIRHGSCFPIEDFIDGFWQPAQEYGDF
>S0757 putative membrane protein precursor
MRKLYAAILSAAICLAVSGAPAWASEHRSTLSAGYLHASTNAPGSDDLNG
INVKYRYEFTDTLGLVTSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVM
AGLSVRVNEWFSAYAMAGVAYSRVSTFSGDYLRVTDNKGKTHDVLTGSDD
NRHSNTSLAWGAGVQFNPTESVAIDLAYEGSGSGDWRTDGFIVGVGYKF
>S1245 hypothetical protein
MADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVSAVTEKGRDGKN
ARKGAKVGAAVGAVTGVLTGNGLEGAIKGAVIGGTGGAILGKMK
>S4474 hypothetical protein
MWAFARRALNVEDILCLPIQDCDKSHIWLLVKDDQRLE
>S1219 hypothetical bacteriophage protein
MPMKFDEILKQRDKYHADNMETMNITDYRAFLETGALIEKDHHGFVRCAL
SGEMLAVNPEQTDALIEFLKGIRD
>S4816 hypothetical protein
MVCNYRYKNRQCHCLSGGYMARSAKPRKRKPASQRSKLPRYVVKLHEDDF
FDEEDAEVLRFDSFDDAVECCADLNIPFFVDAGNKKLVFWFVRVDDEGYP
EIARCTEREFATILAGISAGGMYCPECGTVHWPDGVAPPF
>S2865 hypothetical protein
MEAVSGFMSRVITTQGEWERIKICYLHHKIIMFIAWFISIVIAYLRPFSS
VVRAGDS
>S0758 putative tail component encoded by cryptic prophage CP-933M
MAVKISGVLKDGAGKPVVNCAIELRARRTSPTVVAHVVATCVTDNNGAYV
IEAEPGYYEVALHCNGWQPTRVGDIDVAPTDAPGTLNAFLNAPKDGDLRP
EVMKRFEEMVAQAQQSAGAAAGNAQQTAQDVAAAATARDDAQRFAEKARQ
DATVTAEDRKATAEDVTSTGANAAAAGQSAQDAAGYARAAEQAKNDIDAA
LTGTLKMANHLSEIAAAGEKAQQKSRDNLGLKSAATMEAQSDIYDRTKGR
LAIPGAFGFGCAFLPEDVIRFDTKSDFLAWVRNALPGEYSVAGPYGIIIP
DTRFEGVLSIRWTDARPETTEPRYRAKSLTFYGINGPIYHTRYRYWPISR
LTG
>S1515 hypothetical protein
MKKFRWVVLVVVVLACLLLWAQVFNMMCDQDVQFFSGICAINQFIPW
>S0684 putative repressor protein
MKSANEINQWMTPKQITELDGMPGTIQGVHKRAKKEGWPKRSQEGRRGPG
VEYIPAIPAMQKIAIEEMNADTLKMFSLLINKLGENEVREIELNMAKYGL
SGLMQERPSISVDTLLATLGIDRQTLQTALALHKLPPETRQEILSMYGVH
KQEEPVAPLLEPQDAKKAV
>S0245 putative head assembly protein
MITFKPTRNIDLIEAVGNHPDIIAGSNNGDGYDYKPECRYFEVNVHGQFG
GIVYYQEIQPLTLDCHAMYLPEIRGFSKEIGLAFWRYILTNTTVQCVTSF
AARKFRHGQMYCAMIGLKRVGTIKKYFKGVDDVTFYSATREELIDFLNHG
R
>S0696 putative bacteriophage protein
MSGNIGANPAIIKLSWIEAAVDAHKTLNFEPSGRKRIGFDVADSGTDKCA
NVYRHGSVVFWADEWKAKEDELLKSCQRTYQAALEREADIVYDSIGVGAS
AGAKFSEINADRKSENAYARRVNYQRFNAGAGVHEPDDEYNGIPNKDFFA
NLKAQAWWLTVSEIRLTPLTTENSILWMS
>S4838 hypothetical protein
MTKLMQFVQRCYYMTNKKMYFILILVFTLLQVCFFALWKARDGSTTSLEC
TSTLTRNAKTDHSLYYSANLSVILKKDGSGSFTIVGLTDEDTPRKFSHSY
FFTYKIDSNGRISGNAKAKVSGLENQIKDENFRLNFLDASLTGKGNARLS
KFNNVYIFSIPGLIINTCAPI
>S1216 hypothetical bacteriophage protein
MTEHALKVAIRTIDRHAGEGYAKAHPELISAFMTTTAANFATLTEREIAE
AEQVTTINVKTGEVES
>S0700 putative bacteriophage protein
MAKPDWEAIESAYRAGVLSLRDIGEKYGVTEGAIRKRAKKLDWARSGGTQ
VCKNGTQKRKVRTSRKPAITGLTQKSTQLKTESTPDTKPIRGMRTDPPTN
PFQPGNQQALKHGGYARRLLLKDEVIEDAKALTLEDELFRLRANNLVAAE
NIGRWLVSLEDANGDQERKMLIENISAAEKAMMRNTVRIESIVGTLATVG
KIFADTAYRKAATDKVSLEADRLRRDAGIDDGNGERDLNDFYSDIQTDAE
SGFT
>S1246 hypothetical protein
MRKEFVDDNRVKVNSDGNFVNDLSGRRGIYQAGIKASFSSTLSGHFGVGY
SHGAGVESPWNAVAGVNWSF
>S1720 hypothetical protein
MKITLSKRIGLLAFLLPCALALSTTVHAETNKLVIESGDSAQSRQHAAME
KEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTL
RCLDRRTGRVITP
>S2331 hypothetical bacteriophage protein
MTFFFDDRDQAVPYTATADDVAPTGQQIWQELQSGKWGEIAPFTVTPEML
EAAREARRQEIEAWRTEQEAKPFTFEWNGRIWNAGPDSLGRLSPVVMLAK
SVTAQTHMAWSDADNQQVKLSMPELEELAESQKTALLTEAESVIRQPGRA
VRLNRETDESGEARGRASVFPESVRSVVYAM
>S0747 putative tail component of prophage CP-933K
MNRHTQIRQSVLARLREQCGDSATFFDGLPAFIDAQELPAVAVWLSDAQY
TGKMTDEDDWQAVLHIAVFIRAQAPDSELDMWMESTIFPALNDVPALSGL
IDTLNPLGFNYQRDNEMATWAMAEITYQITYTN
>S1986 putative Q antiterminator encoded by prophage CP-933P
MNNQYLQFVREQLIIATADLSGATKGQLEAWQENAMFDTGRYRRKKIRYR
DEVTGKMITRDNPPIPGKQSLAKGASIPLVSQVAFSTSSWRRAVLSLEEH
YKAWLLWCYSGSICWEYQITITHWAWEEFKAHSGTRKIAEKTQERLKKLI
WLAAQAVKAELFGGEGYEYQELALLAGVTTKNWSKTFTGHWVVMKHIFHR
LDSEALLFVMRTRSEQKAAFSKQSIAKVD
>S0911 hypothetical bacteriophage protein
MAHIQLVKQTSSGLLLPATPESCDFLHQIKIGEWIHADFKRVRNYAFHKR
FFKLLQLGFDYWTPVGGAITPRERKLVSGFVDYLCESVGREHTPALSEAA
EQYLNTVATRRTRDTALLKSFEA
>S0023 hypothetical protein
MCRHSLRSDGAGFYQLAGCEYSFSAIKIAAGGQFLPVICSMAMKSHFFLI
SVLNRRLTLTAVQGILGRFSLF
>S2834 hypothetical protein
MSCRFFILLVVKLKRFSHYRSHQIWLALRYSSSKKTPLPAISHKKDSLTK
SDKIMRFPSHILTSGTVC
>S3255 hypothetical protein
MLVTFLLRKRKEKKAKVRQYANSNENDYQFDVVLILLCADFVTCVLEIHS
G
>S3871 hypothetical protein
MVYLRPYKETNLSLPDVNYKSLRRLPNLLIDPTTLDEWDKEPPLTDLTTD
YLYEGAQAWYPHYSWHSDGRNILYAGEVVQNPPGTPPVDVASFKAWGDFA
ADKHSLYFEGKRTDDNGGGNSLDIKTLHQVEFRPPWDPDLLGLILRDANF
LYINGHRLADPESFRVLAQKSWDQRGKFSTTFNPCIAVPFGPWDTLARTR
TKILLNGEQLDADPDTFSVVRWMPGSLLTWRDKNGLQRKVLDKENLAWDE
DLTKHCLDFSLLEKKVFWRKGPACKQEELPGLDPEQFQPISDAVAQHQDS
LYTIIETESGNRKLEIVKLDDPNLIINKRFNAGKRHGYLLTRAEG
>S1655 hypothetical protein
MHATTVKNKITQRDNYKEIMSVIVVVLLLTLTLIAIFSAIDQLSISEMGR
IARDLTHFIINSLQDWK
>S3191 hypothetical protein
MQQLENRCDLLLIQHQKWMTSVTRLIVAHGMGSPHLHGYHRLTLAHFFLP
EKGSVISVAPQGLYQVVNPGTPPFIPAIQEGLMTSIQTHEIMLLTHFNLG
GVLLSELHRLGENRLANRLNSLLRRFDDRDLYHTLIWLCWYDLMCAHSMQ
PWTEELKHKSHAELENWAVARKREKRELELMIDEYLLYAC
>S2791 putative bacteriophage protein
MYGGSPADIAAAIWRKAPPGIDMNGNTTFTVADKEYDPPYPEYVISWQTL
KPVSLHVSVTLKKSDYLPSDITRQVQQSVLDAFNGTDGGLRARVASVVSA
GRYYAGIYKTDPEHIDILGLTVSRDGSSWTTAVTFGIDEIPVLDVSDISV
KLQEA
>S1829 hypothetical protein
MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIRHALIEEGYTDWAESL
VEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCC
EDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSR
ISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLVQIASFGANARIANSG
DNVHIIASGENSTVVSTGVVDSIILGLGGSAALAYHDGERVRFAVAIEGE
NNIRTGVRYRLNEQHQFVEC
>S3114 putative inner membrane protein
MARSHFSSQALVLIVISIAINMIGGQLASMVKLPIFLDSIGTLISAVLLG
PVIGMLTGLLTNLLWGLLTDPIAAAFAPVAMVIGLVSGWLARAGWFRTLP
KVVVSGVIITLAVTVVAVPLRTALFGGVTGSGADLFVAWMHSMGQNLVES
VAITVIGANLVDKILTAVIVWLLLRQLPIRTTRHFPAMAAVR
>S1580 hypothetical protein
MSRALFAVVLAFPLIALANPHYRPDVEVNVPPEVFSSGGQSAQPCTQCCV
YQDQNYSEGAVIKAEGILLQCQRDDKTLSTNPLLWRRVKP
>S4817 hypothetical protein
MNNFVNNTQSTPSAITGDMVNSNAQLAMAVSELGRKVDALTAETKKPLSV
NITGEATVRPDAPSFFSFNQSANDQFTQDMLLSSSYPEEE
>S4808 hypothetical protein
MPARKVCQTFFRNALASTHQYRQNAIIDSAAALVGGASLSLTSIGRHLPG
PARVKDKIKRVDCLLGNKRLHNDIPLIFKNITSMMTNKLSWCVIAVDWSG
YPFQEYHVLRASLLCDGRSIPLMSQVFPSKKNNEAVEIAFLDALSGAISP
RTRVVIVTDAGFQSAWFRHIKSQGWDFIGRIRGVVKFRLDSDKDKWLDIK
MCRGSSEAKYLGTGTLARKKRSQCEGHFYLYKHSPKGRKSRRARGRPGLP
TTEKEQKAAGREPWLIFSNTAEFNAKKIMKLYSRRMQIEQNFRDEKSERF
GFGLRASRSRRGERFLVLSLLVTLASIVLWLLGYHFENKGFHLKYQANSL
KKRRVLSFLTLAENVLRFNPELLRRAQPEKILCRLASTYRSMVLAY
>S1723 hypothetical protein
MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLS
IVPNDQVDQPDSQVVGHCANDTHKILYNRTTSGNVSAPAQSTQDGAPAEP
Q
>S2581 hypothetical protein
MIYLWMFLALCIVCVSGYIGQVLNVVSAVSSFFGMVILAALIYYFTMWLT
GGNELVTGIFMFLAPACGLMIRFMVGYGRR
>S3199 hypothetical protein
MQLVSRFGYANQIRRDRPLTHEELMHHVPGIFGEEKHTSRSQNYTYIPTI
TVLESLQREGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPEI
ILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGNVVEKVIE
GAYEVVGVFDRIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHQPVTT
ADILTPRRREDYGKDLWSTYQTIQENMLKGGISGRSAKGKRIHTRAIHNI
DTDIKLNRALWVMAETLLESLR
>S1075 agp, periplasmic glucose-1-phosphatase
MNKTLIAATVAGIVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNG
SVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYMGHYMREWLAEQGIVKSGE
CPPPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFN
PMITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQ
QCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAW
GEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRA
SAPKITVLVGHDSNIASLLTALDFKPYQLHDQNERTPIGGKIVFQRWHDS
KANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDANGFCP
MDKFDSVLNEAVK
>S2464 ais, protein induced by aluminum
MKSKKYFIILLALAAIAGLGTHAAWSSNGLPRIDNKTLARLAQQHPVVVL
FRHAERCDRSTNQCLSDKTGITVKGTQDARELGNAFSADISDFDLYSSNT
VRTIQSATWFSAGKKLTVDKRLLQCGNEIYSAIKDLQSKAPDKNIVIFTH
NHCLTYIAKNKRDAIFKPDYLDGLVMHVEKGKVYLDGEFVNH
>S1048 appA, phosphoanhydride phosphorylase; pH 2.5 acid phosphatase
MKAILIPFLSLLIPLTPQSAFAQSEPELKLESMVIVSRHGVRAPTKATQL
MQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLAKKGCP
QSAQVAIIADVDERTRKTGEAFTAGLAPDCAITVHTQADTSSPDPLFNPL
KTGVCQLDNANVTDAILCRAGGSIADFTGHRQTVFRELERVLNFPQSNLC
LNREKQDESCSLTQALPSELKVSADNVSLTGAVSLASMLTEIFLLQQAQG
MPEPGWGRITDSHQWNTLLSLHNAQFYLLQRTPEVARSRATPLLDLIMAA
LTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGGALELNWTLPGQPDNTP
PGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSLNTPPGEVKLT
LAGCEERNAQGMCSLAGFTQIVNEARIPACSL
>S0334 aroM, protein of aro operon
MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYA
PEAGEDTILTLLNDNQLAHVSRRKVERDLQGVVEVLDNQGYDVILLMSTA
NISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEEMLPVQAQKWQ
ILQKSPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFHQRHRDL
LQKQLDVPVLLSNVLIARLAAELLV
>S1750 asr, acid shock protein
MKKQIEGMTMKKVLALVVAAAMGLSSAAFAAETATTPAPTATTTKAAPAK
TTHHKKQHKAAPAQKAQAAKKHHKNTKAEQKAPEQKAQAAKKHAGKHGHQ
QPAKPAAQPAA
>S0033 caiF, transcriptional regulator of cai operon
MKLHTIDISTILIWPCLIASKRVIARLLICETGVRMCEGYVEKPLYLLIA
EWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVTEISCEVKMIPNK
LEGRGCQCQRLVKVVDIDEQIYARLRNNSREKLVGVRKTPRIPAVPLTEL
NREQKWQMMLSKSMRR
>S0681 crcA, hypothetical protein
MNVSKYVAIFSFVFIQLISVGKVFANADERMTTFRENIAQTWQQPEHYDL
YIPAITWHARFAYDKEKTDRYNERPWGGGFGLSRWDEKGNWHGLYAMAFK
DSWNKWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVL
LPLASVGYGPVTFQMTYIPGTYNNGNVYFAWMRFQF
>S0309 crl, transcriptional regulator of cryptic csgA gene for curli surface fibers
MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEV
REFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTL
REFHEKLRELLTTLNLKLEPANDFRDEPVKLTA
>S1108 csgB, minor curlin subunit precursor
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIG
QAGTNNSAQLRQGGSKLLAVVAQEGSSNRAKIDQTGDYNLAYIDQAGSAN
DASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAVVVQRQSQMAIRVTQ
R
>S1113 csgC, putative curli production protein
MNALLLLAALSSQITFNTTQQGDMYTIIPEVTLTQSCLCRVQILSLREGS
SGQSQTKQEKTLSLPANQPIALTKLSLNISPEDRVKIVVTVSDGQALHLS
QQWTPSSEKS
>S1105 csgF, Curli production assembly/transport component CsgF precursor
MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNS
YKDPSYNDDFGIETPSALDNFTQAIQSQILGGLLSNINTGKPGRMVTNDY
IVDIANRDGQLQLNVTDRKTGQTSTIQVSGLQNNSTDF
>S1145 dinI, damage-inducible protein I
MRIEVTIAKTSPLPAGAIDALAGELSRRIQYAFPDNEGHVSVRYAAANNL
SVIGATKEDKQRISEILQETWESADDWFVSE
>S2532 div, cell division protein
MIQPISGPPPGQPPGQGDNLPSGAGNQPLSSQQRTSLESLMTKVTSLTQQ
QRAELWAGIRHDIGLSGDSPLLSRHFPAAEHNLAQRLLAAQKSHSARQLL
AQLGEYLRLGNHRQAVTDYIRHNFGQTPLNQLSPEQLKTILTLLQEGKMV
IPQPQQREATDRPLLPAEHNALKQLVTKLAAATGEPSKQIWQSMLELSGV
KDGELIPAKLFNHLVTWLQARQTLSQQNTPTLESLQMALKQPLDASELAA
LSAYIQQKYGLSAQSSLSSAQAEDILNQLYQRRVKGIDPRDMQPLLNPFP
PMMDTLQNMATRPALWILLVAIILMLVWLVR
>S4663 dnaT, primosomal protein i
MSSRVLTPDVVGIDALVHDHQTVLAKAEGGVVAVFANNAPAFYAVTPARL
AELLALEEKLARPGSDVALDDQLYQEPQAAPVAVPMGKFAMYPDWQPDAD
FIRLAALWGVALREPVTTEELASFIAYWQAEGKVFHHVQWQQKLARSLQI
GRASNGGLPKRDVNTVSEPDSQIPPGFRG
>S2091 dsrB, hypothetical protein
MKVNDRVTVKTDGGPRRPGVVLAVEEFSEGTMYLVALEDYPLGIWFFNEA
GHQDGIFVEKAE
>S4456 fimH, minor fimbrial subunit, D-mannose specific adhesin
MKRAITLFAVLLMGWSVNAWSFACKTANGTAIPIGGGSANVYVNLAPVVN
VGQNLVVDLSTQIFCHNDYPETITDYVTLQRGSAYGGVLSNFSGTVKYSG
SSYPFPTTSETPRVVYNSRTDKPWPVALYLTPVSSAGGVAIKAGSLIAVL
ILRQTNNYNSDDFQFVWNIYANNDVVVPTGGCDVSARDVTVTLPDYPGSV
PIPLTVYCAKSQNLGYYLSGTTADAGNSIFTNTASFSPAQGVGVQLTRNG
TIIPANNTVSLGAVGTSAVSLGLTANYARTGGQVTAGNVQSIIGVTFVYQ
>S2031 flhC, flagellar transcriptional activator FlhC
MSEKSIVQEARDIQLAMELITLGARLQMLESETQLSRGRLIKLYKELRGS
PPPKGMLPFSTDWFMTWEQNVHASMFCNAWQFLLKTGLCNGVDAVIKAYR
LYLEQCPQAEEGPLLALTRAWTLVRFVESGLLQLSSCNCCGGNFITHAHQ
PVGSFACSLCQPPSRAVKRRKLSQNPADIIPQLLDEQRVQAV
>S2060 fliZ, putative regulator of FliA
MPHFNFDYQEFLMMVQHLKRRPLSRYLKDFKHSQTHCAHCRKLLDRITLV
RDGKIVNKIEISRLDTLLDEKGWQTEQQSWAALCRFCGDLHCKTQSDFFD
IIGFKKFLFEQTEMSPGTVREYVVRLRRLGNHLHEQNISLDQLQDGFLDE
ILAPWLPTTSTNNYRIALRKYQHYQRQTCTGLVQKSSSLPASDIY
>S2609 flxA, hypothetical protein
MATISSTSIPSIQTQSSNRASQGSDVASQIARISQQIIKLTQQIKEIVDT
SGSAEDKQKQAELIQQQITLLETQLAQLQKQQAEKAQEKEQRLSLNVSLL
NPVEKTTHIDIYI
>S0076 fruL, fruR leader peptide
MRNLQPNMSRWAFFAKSVGTWNKSSCRS
>S0017 gef, membrane toxin
MLNTCRVPLTDRKVKEKRAMKQHKAMIVALIVICITAVVAALVTRKDLCE
VHIRTGQTEVAVFTAYESE
>S3294 glgS, glycogen biosynthesis protein GlgS
MEHSLNSLNNFDFLARSFARMHAEGRPVDILAVTGNMDEEHRTWFCARYA
WYCQQMMQARELELEH
>S4803 gtrII, putative glucosyl tranferase II
MIKINLFKNANLLAFISCFAISIYCYWGWLYDGTLNIDGEFTNNFYQTIT
LGRWFHTFLRHYFLPEPFSLYITPLIALSFIIISAFIICRSLKLESYELL
IGMLVFITFPQISYQLEFLNQADTVGIAFLLAAISAIIFHSQKNRIVIFS
GIVLSILSMAIYQTFVTYIIAFVIGLQINSIIRNEKNIRESFYSSCLSLS
LIALSTLIYLLLTKAIKHYFSLESNEYISNYIQNASDIKWLVKSAIDNIY
NFYNNPPTGLNLYKWLLIPLLILMFTLTYKLKTRSIYLISSIIFIYILPV
IFIVVVGSGAPPRLFVLMPIVAVILFSCLSNFRSIKYLNCMFFLFIIFNG
VSTSKNLFLNDTLARQKDISLAKEISYTSQTKGISLNGKYIYIYGSNDSG
NMLSMSADTFGKSFFWWDGGNYFRMVAFMNYYGICNCKPANKEQIEKIYP
IVKSLPSWPNPDSIAEINGLVIIKLSEKKGWLPFNI
>S4223 hdeA, hypothetical protein
MKKVLGVILGGLLLLPVVSNAADAQKAADNKKPVNSWTCEDFLAVDESFQ
PTAVGFAEALNNKDKPEDAVLDVQGIATVTPAIVQACTQDKQANFKDKVK
GEWDKIKKDM
>S4224 hdeB, hypothetical protein
MGYKMNISSLRKAFIFMGAVAALSLVNAQSALAANESAKDMTCQEFIDLN
PKAMTPVAWWMLHEETVYKGGDTVTLNETDLTQIPKVIEYCKKNPQKNLY
TFKNQASNDLPN
>S0412 hha, hemolysin expression modulating protein
MSEKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHR
LAELTMNKLYDKIPSSVWKFIR
>S1918 holE, DNA polymerase III, theta subunit
MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRS
WFRERLIAHRLASVNLSRLPYEPKLK
>S0011 htgA, positive regulator for sigma 32 heat shock promoters
MLPLTALTPFNAAPTGPPSPAPRSKPCPSTLIAAWVRKMRVSWLESKCDT
PFANNLSFISSGSSSSSSFTLASTACRNSCLCSSSIFFQVLRRNCSSNCC
SISNVDISLSAFSFNRFETSSKMARYNLPCPRSLLAILSPPKCCNSPAIS
CQLRRCCSGCPSIDLNSSLRISMLERRVLPFSLWVSNRAKFANCSSLQC
>S1045 hyaF, Hydrogenase-1 operon protein hyaF
MSETFFHLLGPGTQPNDDSFSMNPLPITCQVNDEPSMAALEQCAHSPQVI
ALLNELQHQLSERQPPLGEVLAVDLLNLNADDRHFINTLLGEGEVSVRIQ
QADDSKSEIQEAIFCGLWRVRRRRGEKLLEDKLEAGCAPLALWQAATQNL
LPTDSLLPPPIDGLMNGLPLAHELLAHVRNPDAQPHSINLTQLPISEADR
LFLSRLCGPGNIQIRTIGYGESYINATGLRHVWHLRCTDTLKGPLLESYE
ICPIPEVVLAAPEDLVDSAQRLSEVCQWLAEAAPT
>S3240 hybE, hydrogenase-2 component
MTEEIAGFQTSPKAQVQAAFEEIARRSMHDLSFLHPSMPVYVSDFTLFEG
QWTGCVITPWMLSAVIFPGPDQLWPLRKVSEKIGLQLPYGTMTFTVGELD
GVSQYLSCSLMSPLSHSMSIEEGQRLTDDCARMILSLPVTNPDVPHAGRR
ALLFGRRRGENA
>S2934 hycA, formate hydrogenlyase regulatory protein
MTIWEISEKADYIAQRHRRLQDQWHIYCNSLVQGITLSKARLHHAMSCAP
DKELCFVLFEHFRIYVTLADGFNSHTIEYYVETKDGEDKQRIAQAQLSID
GMIDGKVNIRDREQVLEHYLEKIAGVYDSLYTAIENNVPVNLSQLVKGQS
PAA
>S2929 hycF, probable iron-sulfur protein of hydrogenase 3 (part of FHL complex)
MFTFIKKVIKTGTATSSYPLEPIAQGQSANGAVHCVDCV
>S3918 ilvL, ilvGEDA operon leader peptide
MTALLRVISLVVISVVVIIIPPCGAALGRGKA
>S2452 inaA, pH-inducible stress response protein
MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMT
HHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRA
LLVTEDMAGFISIADWYARHAVSPYSDEVRQAMLKAVALAFKKMHSVNRQ
HGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEP
IPKADWEQVKAYYYAM
>S3979 ivbL, ilvB operon leader peptide
MTTSMFNAKLLPTAPSAAVVVVRVVVVVGNAP
>S3762 kdgT, 2-keto-3-deoxy-D-gluconate transport protein
MQIKRSIEKIPGGMMLVPLFLGALCHTFSPGAGKYFGSFTNGMITGTVPI
LAVWFFCMGASIKLSATGTVLRKSGTLVVTKIAVAWVVAAIASRIIPEHG
VEVGFFAGLSTLALVAAMDMTNGGLYASIMQQYGTKEEAGAFVLMSLESG
PLMTMIILGTAGIASFEPHVFVGAVLPFLVGFALGNLDPELREFFSKAVQ
TLIPFFAFALGNTIDLTVIAQTGLLGILLGVAVIIVTGIPLIIADKLIGG
GDGTAGIAASSSAGAAVATPVLIAEMVPAFKPMAPAATSLVATAVIVTSI
LVPILTSIWSRKVKARAAKIEILGTVK
>S0072 leuL, leu operon leader peptide
MTHIVRFIGLLLLNASSLRGRRVSGIQH
>S3563 malM, periplasmic protein of mal regulon
MKMNKSLIALCLSAGLLASAPGISLADVNYVPQNTSDAPAIPSAALQQLT
WTPVDQSKTQTTQLATGGQQLNVPGISGPVAAYSVPANIGELTLTLTSEV
NKQTSVFAPNVLILDQNMTPSAFFPSSYFTYQELGVMSADRLEGVMRLTP
ALGQQKLYVLVFTTEKDLQQTTQLLDPAKAYAKGVGNSIPDIPDPVARHT
SDGLLKLKVKTNSSSSVLVGPLFGSSAPAPVTVGNTAAPAVAAPAPVPVK
KSEPMLNDTESYFNTAIKNAVAKGDVDKALKLLDEAERLGSTSARSTFIS
SVKGKG
>S1689 marB, multiple antibiotic resistance protein
MKPLSSAIAAALILFSAQSVAEQTTQPVVTSCANVVVVPPSQEQPPFDLN
HMGTGSDKSDALGVPYYNQHAM
>S1370 osmB, osmotically inducible lipoprotein
MFVTSKKMTAAVLAITLAMSLSACSNWSKRDRNTAIGAGAGALGGAVLTD
GSTLGTLGGAAVGGVIGHQVGK
>S1604 osmE, activator of ntrL gene
MNKNMAGILSAAAVLTMLAGCTAYDRTKDQFVQPVVKDVKKGMSRAQVAQ
IAGKPSSEVSMIHARGTCQTYILGQRDGKAETYFVALDDTGHVINSGYQT
CAEYDTDPQAAK
>S4807 pheL, leader peptide of chorismate mutase-P-prephenate dehydratase
MKHIPFFFAFFFTFP
>S4806 pheM, phenylalanyl-tRNA synthetase (pheST) operon leader peptide
MNAAIFRFFFYFST
>S3031 ppdC, prepilin peptidase dependent protein C
MSASLKNQQGFSLPEVMVAMVLMVLMVLMVLMVMIVTALSGIQRTSMNSL
ASRNQYQQLWRHGWQQTQLRAISPPANWQVNRMQTSQAGCVSISVTLVSP
GGREGEMTRLHCPNCQ
>S0251 psiF, phosphate starvation-inducible protein
MKRDGAMKITLLVTLLFGLVFLTTVGAAERTLTPQQQRMTSCNQQATAQA
LKGDARKTYMSDCLKNSKSAPGEKSLTPQQQKMRECNNQATQQALKGDDR
NKFMSACLKKAA
>S1392 pspB, phage shock protein
MSALFLAIPLTIFVLFVLPIWLWLHYSNRSGRSELSQSEQQRLAQLVDEA
KRMRERIQALESILDAEHPNWRDR
>S1394 pspD, phage shock protein
MNTRWQQAGQKVKLGFKLAGKLVLLTALRYGPAGVAGWAIKSVARRPLKM
LLAVALEPLLSRAANKLAQRYKR
>S4505 pyrL, pyrBI operon leader peptide
MVQCVRHFVLPRLKKDAGLPFFFPLITHSQPLNRGAFFCLGVRR
>S0189 rcsF, exopolysaccharide synthesis regulator
MRALPICLVALMLSGCSMLSRSPVEPVQSTAPQPKAEPAKPKAPRATPVR
IYTNAEELVGKPFRDLGEVSGDSCQASNQDSPPSIPTARKQMQINASKMK
ANAVLLHSCEVTSGTPGCYRQAVCIGSALNITAK
>S1668 relF, prophage maintenance protein
MKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPE
E
>S1667 rem, hypothetical protein
MNIEELRKIFCEDGLYAVCVENGNIVSHYRIVCLQKNGAALINFVDARVT
DGFILRDGEFVTSLQALKEIGIKAGFSAFSEE
>S2720 repC, putative replication protein C
MKKPKHDLTHVRHDPAHCLAPGLFRSLKRGDRKRCKLDVTYTFGEDESMR
FVGFEPLGADDMRLLQGIVALGGPNGILLTPEPTSETGRQLRLFLEPRFE
AIEQDGLVVRESLTKLLSETGMTDSGDNIKALKASLLRMSNVTILVTKGR
RQAAFHLMSHAFDETDGRLWVALNPRIAEAILGHRPYARIDMAEVRVLQT
DPARLMHQRLCGWIDPGKSGRVELDTLCGYVWPDEANAEAMKKRRQTARK
ALAELAAVGWVVNEYAKGKWEIKRPGPTATAPVYRRNVPLLPS
>S3904 rhoL, rho operon leader peptide
MRSEQISGSSLNPSCRFSSAYSPVTRQRKDMSR
>S0218 sat2, putative cytoplasmic protein
MLSPEQDLNRHADPFPFNEEPSSQDLDSLRDLVTEAETLQDMVTGGLSID
AILNVLDATGEDETIWPVEKTPPDILHLLSPEYAPETAHNTVLPDLTRKE
HRIIGIDSHYRINPAQHGEIYHDKQ
>S4833 shiB, hypothetical protein
MDENALRFASYWRNSLADAESGKGCFERKDAQNFTHWHGIAAGRLDEAIV
NKFFKGEKDDVETVNVILRPKVYFRLLQHGKDRSAGTPDIVTPIVTPALL
SREGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTDPTHVMW
TRP
>S4831 shiC, hypothetical protein
MMLFSIIIMLLSPKITHANEKVICQNSASFVTKEFICGTILVSLILSSLF
CYISISPSIFMHTYGLSTFSYSIIFSFSVLFFIIGNQLSKIEKLSNLWFL
LPNSQFYRITNKWFKNRICRIYYRFYSNNKRWYNQFPVSNRKRQRCWSRN
YFRSWFTNNFITVNSRNRHCRK
>S4830 shiD, colV-immunity protein
MKRWFKYPVTFDLHYTVSHRLIFAPARFLFLLVVTILNAYDHSILAPVWL
CFNSAFRADKQLLAGILLVLMSVPLFTSADFHIPSNRAYSLSLDYSIVRL
KFFLNNNRSTAGRSSPFFITHCTDNHNLISRN
>S4829 shiE, hypothetical protein
MPCFTAMRAEIALMSGSAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYT
KSVSWQHYPWSETKAAYRCFSNDKVSANKIMTPHKENITERAQQFKRVLV
LQDTTELNYSGQKEKQGVGPKKHKDERNLFLHPQLVISESGVCLGVYDDY
QWFRDELKTSKNTRQEICNDLLHKKHVSEKETWRWVEGYNKATELARQCP
DTHVLSISDREGDFYDLFERAAQTPGIKADWLVRMKFNNRATLNINGKRD
HRLLHERIMEITPQQLVEFTIPDGRGQQARTLPDMLSSDAGPLSYPLIYS
SKVMIISLFDSTVMPDKYRHNGK
>S2741 sseB, enhancer of serine sensitivity
MSETKNELEDLLEKAATEPAHRPAFFRTLLESTVWVPGTAAQGEAVVEDS
ALDLQHWEKEDGTSVIPFFTSLEALQQAVEDEQAFVVMPVRTLFEMTLGE
TLFLNAKLPTGKEFMPREISLLIGEEGNPLSSQEILEGGESLILSEVAEP
PAQMIDSLTTLFKTIKPVKRAFICSIKENEEAQPNLLIGIEADGDIEEII
QATGSVATDTLPGDEPIDICQVKKGEKGISHFITEHIAPFYERRWGGFLR
DFKQNRII
>S3001 syd, Syd protein
MDDLTAQALKDFTARYCDAWHEEHKSWPLSEELYGVPSPCIISTTEDAVY
WQPQPFTGEQNVNAVERAFDIVIQPTIHTFYTTQFAGDMHAQFGDIKLTL
LQTWSEDDFRRVQENLIGHLVTQKRLKLPPTLFIATLEEELEVISVCNLS
GEVCKETLGTRKRTNLASNLAEFLNQLKPLL
>S3371 tdcR, threonine dehydratase operon activator protein
MTGITIFYGDNIIRYVVNTKKGLRPYFKQLPDNYQAKFELNLMSKFSNFI
INKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFF
SSDRTSFCECNRFP
>S0001 thrL, thr operon leader peptide
MKRISTTITTTITITTGNGAG
>S4016 tnaL, tryptophanase leader peptide
MNILHVCVTSKWFNIDNKIVDHRP
>S2718 trbJ, putative mating pair formation protein
MKSKILAAKKTALAVALATGFITTTTAPVQAGIPVIDGGNLAQNIMTAIE
SVAQTLKQIEQYQTQLQQYENQLQNTMAPAAYIWDQAQTTINRLIAAQNT
LAYYENQLGSLDRYLAKFQDVAYYRSSPCFNGSGGCTPAEKAAMEENRRL
ASESQKKANDALFQTVADQQKALKDDARTLERLQGAAQGATGQLQAIGYA
NQLASQQANQLLQIRTMLTAQHNAEAARIAAELDAEARGDARAEQMRTWT
FRPSPADNY
>S4805 trpL, trp operon leader peptide
MKAIFVLKGWWRTS
>S1764 tus, DNA-binding protein
MARYDLVDRLNTTFRQMEQELATFAAHLEQHKLLVARVFSLPEVKKEDEH
NPLNRIEVKQHLGNDAQSLALRHFRHLFIQQQSENRSSKAAVRLPGVLCY
QVDNLSQAALVSHIQHINKLKTTFEHIVTVESELPTAARFEWVHRHLPGL
ITLNAYRTLTVLHDPATLRFGWANKHIIKNLHRDEVLAQLEKSLKSPRSV
APWTREEWQRKLEREYQDIAALPQNAKLKIKRPVKVQPIARVWYKGDQKQ
VQHACPTPLIALINRDNGAGVPDVGELLNYDADNVQHRYKPQAQPLRLII
PRLHLYVAD
>S1769 uidC, membrane-associated protein
MAVICLTAASGLTSAYAAQLADDEAGLRIRLKNELRRADKPSAGAGRDIY
AWVQGGLLDFNSGYYSNIVGVEGGAYYVYKLGARADMSTRWYLDGDKSFG
FALGAVKIKPSENSLLKLGRFGTDYSYGSLPYRIPLMVGSSQRTLPTVSE
GALGYWALTPNIDLWGMWRSRVFLWTDSTTGIRDEGVYNSQTGKYDKHRA
RSFLAASWYDDTSRYSLGASVQKDVSNQIQSILEKSIPLDPNYTLKGELL
GFYAQLEGLSRNTSQPNETALVSGQLTWNAPWGSVFGSGGYLRHAMNGAV
VDTDIGYPFSLSLDRNREGMQSWQLGANYRVTPQFTLTFAPIVTRGYESS
KRDVRIEGAGILGGMNYRVSEGPLQGMNFFLAADKGREKRDGSTLGDRLN
YWDVKMSIQYDFMLK
>S2229 wcaM, hypothetical protein
MPFKTLSRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFK
QAFSEGQTVVVPAGFVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQD
GSQVTGGEGGGMHNITLDVRGSDCTIKGLAMSGFGPVMQIYIGGKNKRVM
RNLTIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAI
NDSDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVV
ANITGSDCRQLIHVENGKHFVIRNINARNITPDFSKKAGIDNATVAIYGC
DNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNNIQLDNTHLAYKL
RGIQISAGNAVSFVSLTNIEMKRASLELHNKPQHLFMRNINVMQESSVGP
ALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNERGQSSVDIDRINHHIV
NVEKINFRLPERRE
>S3896 wecD, hypothetical protein
MQAKIAASNTGELDALQQLGFSLVEGEVDLALPVNNVSDSGAVVAQETDI
PALRQLASAAFAQSRFRAPWYAPDASGRFYAQWIENAVRGTFDHQCLILR
AASGDIRGYVSLRELNATDARIGLLAGRGAGAELMQTALNWAYARGKTTL
RVATQMGNTAALKRYIQSGANVESTAYWLYR
>S0013 yaaI, hypothetical protein
MKSVFTISASLAISLMLCCTAQANDHKLLGVIAMPRNETNDLALKLPVCR
IVKRIQLSADHGDLQLSGASVYFKAARSASQSLNIPSEIKEGQTTDWINI
NSDNDNKRCVSKITFSGHTVNSSDMATLKIIGDD
>S0096 yacA, hypothetical protein
MLWTSGFNDKICALNTFEFDRDGNNVSGILTRWRQFGKRYFWPHLLLGMV
AASLGLPALSNAAEPNAPAKATTRNHEPSAKVNFGQLALLEANTRRPNSN
YSVDYWHQHAIRTVIRHLSFAMAPQTLPVAEESLPLQAQHLALLDTLSAL
LTQEGTPSEKGYRIDYAHFTPQAKFSTPVWISQAQGIRAGPQRLT
>S0121 yacC, hypothetical protein
MSIVLPLTGRSSRRHNLIDNNGRRLARSVLTFIFFKPLVEAMKTFFRTVL
FGSLIAVCANSYALSESEAEDMADLTAVFVFLKNDCGYQNLSNGQIRRAL
VFFAQQNQWDLSNYDTFDMKALGEDSYRDLSGIGIPVAKKCKALARDSLS
LLAYVK
>S0131 yadD, hypothetical protein
MDAPSTTPHDAVFKQFLMHAETARDFLEIHLPVELRELCDLNTLHLESGS
FIEECLKGHSTDVLYSMQMQGNPGYLHVVIEHQSKPDKKMAFRMMRYSIA
AMHRHLEAGHDKLPLVVPILFYQGEATPYPLSMCWFDMFYSPELARRVYN
SPFPLVDITITPDDEIMQHRRIAILELLQKHIRQRDLMLLLEQLVTLIDE
GYTSGSQLVAMQNYMLQRGHTEQADLFYGVLRDRETGGKSMMTLAQWFEE
KGIEKGIQQGRQEERQEFALRLLSKGMSREDVAEMANLPLAEIDKVINLI
>S0302 yafO, hypothetical protein
MRVFKTKLIRLQLTAEELDALTADFISYKRDGVLPDIFGRDALYDDSFTW
PLIKFERVAHIHLANVNNPFPPQLRQFSRTNDEAHLVYCQGAFDEQAWLL
IAILKPEPHKLARDNNQMHKIGKMAEAFRMRF
>S0333 yaiA, hypothetical protein
MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEA
MDAKKRYEDPDKE
>S0253 yaiB, hypothetical protein
MKNLIAELLFKLAQKEEESKELCAQVEALEIIVTAMLRNMAQNDQQRLID
QVEGALYEVKPDASIPDDDTELLRDYVKKLLKHPCQ
>S0257 yaiW, hypothetical protein
MSRVNHLSSLSLLAVLVLAGCSSQAPQPLKKGEKAIDVASVVRQKMPASV
KDRDAWAKDLATTFESQGLAPTLENVYSVLAVAQQESNYQADPAVPGLSK
IAWQEIDRRAERMHIPAFLVHTALKIKSPNGKSYSERLDSVRTEKQLSAI
FDDLISMVPMGQTLFGSLNPVRTGGPMQVSIAFAEQHTKGYPWKMDGTVR
QEVFSRRGGLWFGTYHLLNYPASYSAPIYRFADFNAGWYASRNAAFQNAV
SKASGVKLALDGDLIRYDSKEPGKTELATRKLAGKLGMSDSEIRRQLEKG
DSFSFEETALYKKVYQLAEAKTGKSLPREMLPGIQLESPKITRNLTTAWF
AKRVDERRARCMKQ
>S0357 yajI, hypothetical protein
MLSLLLPSTIRLIATRKGRPMNTNVFRLLLLGSLFSLSACVQQSEVRQMK
HSVSTLNQEMTQLNQETVKITQQNRLNAKSSSGVYLLPGAKTPARLESQI
GTLRMSLVNITPDTDGTTLTLRIQGESNDPLPAFSGTVEYGQIQGTIDNF
QEINVQNQLINAPASVLAPSDVDIPLQLKGISVDQLGFVRIHDIQPVMQ
>S0413 ybaJ, hypothetical protein
MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNE
LIEHIATFALNYKIKYNEDNKLIEQIDEYLDDTFMLFSSYGINMQALQKW
RKSGNRLFRCFVNATKENPASLSC
>S0418 ybaM, hypothetical protein
MSLENAPDDVKLAVDLIVLLEENQIPASTVLRALDIVKRDYEKKLTHDDE
AEK
>S0476 ybcH, hypothetical protein
MRKFIFVLLTLLLVSPFSFAMKGIIWQPQNRDSQVTDTQWQGLMSQLRFQ
GFDTLVLQWTRYGDAFTQPEQRTLLFKRAAAAQQTGLKLIVGLNADPEFF
MHQKQSSAALESYLNRLLAADLQQARLWSAAPGITPDGWYISAEIDDLNW
RSEAARQPLLTWLNNAQRLISDVSAKPVYISSFFAGNMSPDGYRQLLEQV
KATGVNVWVQDGSGVDKLTAEQRERYLQASADCQSSAPASGIVYELFVAG
KGKTFTAKPKPDAEIASLLAKRSSCGKDMLYFSLRYLPVAHGILEY
>S0714 ybcR, hypothetical bacteriophage protein
MYQMEKISTGIAYGTSAGSAGYWFLQWLDQ
>S0495 ybdJ, hypothetical protein
MKHPLETLTTAAGILLMAFLSCLLLPAPALGLTLAQKLVTMFHLMDLSQL
YTLLFCLWFLVLGAIEYFVLRFIWRRWFSLAD
>S0658 ybeR, hypothetical protein
MDMGSQKILFALSTPMEIRNECCLPSHSSPKMYLGTCFFDLSSSWGIDDR
DDLLRIIHRMIENGHAARLAGFYHRWFRYSPCEWRDYLAELNEQGQAYAQ
FVASTAECCGEGGIKAWDYVRMGFLSRMGVLNNWLSEEESLWIQSRIHLR
ALRYYSNWRQYFAGYTFGRQYWQSPEDDHLPLLREFLARKEYDDSGNDMF
YQLFASDDAYYPTLSWQPLADYPTCPETLKDMSDL
>S0655 ybeU, putative tRNA ligase
MNKEEQYLLFALSAPMEILNQSCKPAHDSPKMYTGIKEFDLSSSWGINNR
DDLIQTIYQMTDDGHANDLAGLYLTWQRSSPEEWKALIAGGSERGLIYTQ
FVAQTAMCCGEGGIKAWDYVRMGFLSRAGVLNNWLTEEESLWLQSRVYAR
AHHYYHSWMHYFSAYSLGRLYWQSSQCEDNASLREALTLYKYDSAGSRMF
EELAAGSDRFYATLPWQPLTVQPECPVTLKDVSDL
>S0654 ybeV, hypothetical protein
MKTCWQILEIESTTQIDIIRQAYLARLPLCHPETDPQGFKALRQAYEEAL
RLAVNPVEEADDEEKDATAEHEILRAFRTLLDSESDRFQPSAWQKFIQQL
NTWNMEDVDQLRWPLCAIAIEARYLSLNCASLLAERLNWHSFNDSEGMDE
EEREAFLEAIQAGDCFDFLSLLEYPVALQNQTVEYYFALERCCRYHPDYV
TAFLAMEGPWFIPDDAKLHRKLLRWYSSVQTGMVELIPVAKQWQAEEPES
EDARYYQCAQRLYCGEGESLLADLGAYWESYPSTQADNLLLQWSKRHCPD
YFALLVMVIEARSMVDAQGQPLKYVPGESARTRLLWAEILHSGKLSPLGQ
SFIESLFFKRKAWAWWKSRVGSETEEDSPLLDLYRVAEQVVLEAFPKQEM
LARLNTRLEGGDAHPLEAIITRMLLTKVKLEPEDEDVDEPTPENHEEKND
EGEKPQSITSIIKISLTVLVIGYALGKIAMLFS
>S0608 ybfA, hypothetical protein
MELYREYPAWLIFLRRTYAVAAGVLALPFMLFWKDRARFYSYLHRVWSKT
SDKPVWMDQAEKATGDFY
>S0619 ybfE, hypothetical protein
MYYGALSIRAEAWLIVSPEVTKIMAKEQTDRTTLDLFAHERRPGRPKTNP
LSRDEQLRINKRNQLKRDKVRGLKRVELKLNAEAVEALNELAESRNMSRS
ELIEEMLMQQLAALRSQGIV
>S0623 ybfM, hypothetical protein
MRTFSGKRSTLALAIAGVTAMSGFMAMPEARAEGFIDDSTLTGGIYYWQR
ERDRKDVTDGDKYKTNLSHSTWNANLDFQSGYAADMFGLDIAAFTAIEMA
ENGDSSHPNEIAFSKSNKAYDEDWSGDKSGISLYKAAAKFKYGPVWARAG
YIQPTGQTLLAPHWSFMPGTYQGAEAGANFDYGDAGALSFSYMWTNEYKA
PWHLEMDEFYQNDKTTKVDYLHSIGAKYDFKNNFVLEAAFGQAEGYIDQY
FAKASYKFDIAGSPLTTSYQFYGTRDKVDDRSVNDLYDGTAWLQALTFGY
RAADVVDLRLEGTWIKADSQQGYFLQRMTPTYASSNGRLDIWWDNRSDFN
ANGEKAVFFSAMYDLKNWNLPGFAIGASYVYAWDAKPATWQSNPDAYYDK
NRTIEESAYSLDAVYTIQDGRAKGTMFKLHFTEYDNHSDIPSWGGGYGNI
FQDERDVKFMVIAPFTIF
>S0793 ybiJ, hypothetical protein
MKTINTVVAAMALSTLSFGVFAAEPVTASQAQNMNKIGVVSADGASTLDA
LEAKLAEKAAAAGASGYSITSATNNNKLSGTAVIYK
>S0799 ybiM, hypothetical protein
MHVIDLSYFVKIYRSKNKHFSVNFLRHLHRAGDLHLSGTTRCIYEDVIMK
KCLTLLIATVLSGISLTAYAAQPMSNLDSGQLRPAGTVSATGASNLSDLE
DKLAEKAREQGAKGYVINSAGGNDQMLGTATIYK
>S0814 ybiU, hypothetical protein
MASTFTSDTLPADHKAAIRQMKHALRAQLGDVQQIFNQLSDDIATRVAEI
NALKAQGDAVWPVLSYADIKAGHVTAEQREQIKRRGCAVIKGHFPREQAL
GWDQSMLDYLNRNRFDEVYKGPGDNFFGTLSASRPEIYPIYWSQAQMQAR
QSEEMANAQSFLNRLWTFESDGKQWFNPDVSVIYPDRIRRLPPGTTSKGL
GAHTDSGALERWLLPAYQHVFANVFNGNLAKYDPWHAAHRTEVEEYTVDN
TTKCSVFRTFQGWTALSDMLPGQGLLHVVPIPEAMAYVLLRPLLDDVPED
ELCGVAPGRVLPVSEQWHPLLIEALTSIPKLEAGDSVWWHCDVIHSVAPV
ENQQGWGNVMYIPAAPMCEKNLAYAHKVKAALEKGASPGDFPREDYETNW
EGRFTLADLNIHGKRALGMDV
>S0846 ybjC, hypothetical protein
MRAIGKLPKGVLILEFIGMMLLAVALLSVSDSLSLPEPFSRPEVQILMIF
LGVLLMLPAAVVVILQVAKRLAPQLMNCPPQYSRSEREKDNDANH
>S0839 ybjH, hypothetical protein
MIMKNCLLLGALLMGFTGVAMAQSVTVDVPSGYKVVVVPDSVSVPQAVSV
ATVPQTVYVAPAPAPAYRPHPYVRHLASVGEGMVIEHQIDDHHH
>S0849 ybjN, putative sensory transduction regulator
MTSLVVPGLDTLRQWLDDLGMSFFECDNCQALHLPHMQNFDGVFDAKIDL
IDNTILFSAMAEVRPSAVLPLAADLSAINASSLTVKAFLDMQDDNLPKLV
VCQSLSVMQGVTYEQFAWFVRQSEEQISMVILEANAHQLLLPTDDEGQNN
VTENYFLH
>S0854 ybjO, hypothetical protein
MEDETLGFFKKTSSSHARLNVPALVQVAALAIIMIRGLDVLMIFNTLGVR
GIGEFIHRSVQTWSLTLVFLSSLVLVFIEIWCAFSLVKGRRWARWLYLLT
QITAASYLWAASLGYGYPELFSIPGESKREIFHSLMLQKLPDMLILMLLF
VPSTSRRFFQLQ
>S1012 ycbW, hypothetical protein
MLLRFYRVGERQMRIKPDDNWRWYYDEEHDRMMLDLANGMLFRSRFARKM
LTPDAFSPAGFCVDDAALYFSFEEKCRDFNLSKDQKAELVLNALVAIRYL
KPQMPKSWHFVSHGEMWVPMPGDAACVWLSDTHEQVNLLVVESGENAALC
LLAQPCVVIAGRAMQLGDAIKIMNDRLKPQVNVDSFSLEQAV
>S1072 yccD, hypothetical protein
MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAA
IVVQRAVRLRHELALDWPGIAVALTLMDDIAHLKQENRLLRQRLSRFVAH
P
>S1074 yccE, hypothetical protein
MGSNIHGISCTANNYLKQAWNDIKNEYEKNQTYSITLFENTLVCFMRLYN
ELRRKVNEEDTPCLECESLEKEFEEMQNDNDLSLFMRTLRTNDTQIYSGV
SGGITYTIQYVQDVDIVRVSLPGRGSESITDFKGYYWYGFMEYIENINAC
DDVFSEYCLDNENMSIQPEQINMPGISDLDTGIDLSGISFIQSEINKTYG
LKYAPVDGDGYCLLRAILVLKEHEYSWALGSHKTQKQVYEEFIKIVDKQT
IEALVDTAFYNLREDVKTLFGVDLQSDNKIQGVGSFMSWSFLFFKKQFID
SCLNDKKCILHLPEFIFNDNKNLLALDTDTSDRIKAVKNFLAVLSDSICS
LFIVNSNVASISLGNESFSTDEDLEYGYLTNTGNHYDVYLPPELFSQAYK
LKNKEMNAQLDYLNRYAT
>S1076 yccJ, hypothetical protein
MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEV
LVKAFAKTDKDSLFWGEQTIERKNV
>S1078 ycdF, hypothetical protein
MISILAKKLLQDITKRGSVRVSSLSLVPRSENMAAAIRRVNSMQYDVYTQ
TEVSNGKPSRRFRQFCRRPRKSIRSR
>S1147 yceB, hypothetical protein
MNKFLFAAALIVSGLLVGCNQLTQYTITEQEINQSLAKHNNFSKDIGLPG
VADAHIVLTNLTSQIGREEPNKVTLTGDANLDMNSLFGSQKATMKLKLKA
LPVFDKEKGAIFLKEMEVVDATVQPEKMQTVMQTLLPYLNQALRNYFNQQ
PAYVLREDGSQGEAMAKKLAKGIEVKPGEIVIPFTD
>S1142 yceO, hypothetical protein
MRRLLHYLINNIREHLMLYLFLWGLLAIMDLIYVFYF
>S1144 yceP, hypothetical protein
MMEKNNEVIQTHPLVGWDISTVDSYDALMLRLHYQTPNKSEQEGTEVGQT
LWLTTDVARQFISILEAGIAKIESGDFPVNEYRRH
>S1196 ycfR, hypothetical protein
MSFANFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADEMGAKS
FRITSVTGPNTLHGTAVIYK
>S1292 ychH, hypothetical protein
MKRKNASILGNVLMGLGLVVMVVGVGYSILNQLPQFNMPQYFAHGAVLSI
FVGAILWLAGARVGGHEQVCDRYWWVRHYDKRCRRSDNRRHS
>S1307 ychP, putative factor
MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFAWGKVRDALSQQ
VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQL
GLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLQDENLQRAGFGAEAW
GEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSV
SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE
NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY
RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLT
PGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLT
LVEPFDALSNDELRWEP
>S1344 yciC, hypothetical protein
MSITAQSVYRDTGNFFRNQFMTILLVSLLCAFITVVLGHVFSPSDAQLAQ
LNDGVPVSGSSGLFDLVQNMSPEQQQILLQASAASTFSELIGNAILAGGV
ILIIQLVSAGQRVSALRAIGASAPILPKLFILIFLTTLLVQIGIMLVVVP
GIIMAILLALAPVMLVQDKMGIFASMRSSMRLTWANMRLVAPAVLSWLLA
KTLLLLFASSFAALTPEIGAVLANTLSNLISAILLIYLFRLYMLIRQ
>S1360 yciN, hypothetical protein
MNKETQPIDRETLLKEANKIIREHEDTLAGIEATGVTQRNGVLVFTGDYF
LDEQGLPTAKSTAVFNMFKHLAHVLSEKYHLVD
>S1463 ydbD, hypothetical protein
MAGNVCNQMKKEALFRPKPSPELVQELQMLDEGNVAAFEGRDIATFDLAI
MRTLPRLKGISANLRKQLINSNDEQTIESMARYMPDNEILELTDQQLGYQ
PVVLGLLDREPLSVEIMTRMSRLPDGVGPLNLALRENLPLDIVMTLAKRD
WDMIIQELYKDAWLLPESIIDGYIRSDDSSIRQVGAGGQLTYNQAMQLAN
DSSNNVVTSLAFKLAEMKHHGQLLRMTPQESDKVAAYLYQKFENDDDLIR
VLFLALPDNLQFNFVKRMEKKSPAYFCCRDMQVIHSDAALQRLLTRFNDP
EGWSNLAKNQYLSTSMKQKIWQRALSHRKNNPKADSDAYETSADMILSEL
ISHGEVDDQMLLNAAALIRLEDWDFLESALVSWDNLPAVVLKELQQNTPR
NDIWAKFFLRQENSSRAQVDEALRVYYALDPDALAQLDVLAKQPDRIWWS
TLAKSNLTFFKFGALSNRHTPSAALAAEIDPEWWIVAMNNPRFPVDVLKA
RLKRDPLLALELVNPELDLVRQLALNGKTRAIREQAMRKLDELY
>S1458 ydbH, hypothetical protein
MLGKYKAVLALLLLIILVPLTLLMTLGLWVPTLADIWLPLGTRIALDESP
RITRKGLIIPDLRYLVGDCQLAHITNASLSHPSRWLLNVGTVELDSACLA
KLPQTEQSPAAPKTLAQWQSMLPNTWINIDKLIFSPWQEWQGKLSLALTS
DIQQLRYQGEKVKFQGQLKGQQLTVSELDVVAFENQPPVKLVGEFTMPLV
PDGLPVSGHATATLNLPQEPSLVDAELDWQENSGQLIVLARDNGDPLLDL
PWQITRQQLTVSDGRWSWLYAGFPLSGRLGVKVDNWQAGLENALVSGRLN
VLTQGQAGKGNAVLNFGPGKLSMDNSQLPLQLTGEAKQADLILYARLPAQ
LSGSLSDPTLTFEPGALLRSKGRVIDSLDIDEIRWPLAGVKVTQRGVDGR
LQAILQAHENELGDFVLHMDGLANDFLPDAGRWQWRYWGKGSFTPMNATW
DVAGKGEWHDSTITLTDLSTGFDQLQYGTMTVEKPRLILDKPVVWVRDAQ
HPSFSGALSLDAGQTLFTGGSVLPPSTLKFSVDGRNPTYFLFKGDLHAGE
IGPVRVNGRWDGIRLRGNAWWPKQSLTVFQPLVPPDWKMNLRDGELYAQV
AFSAAPEQGFRAGGHGVLKGGSAWMPDNQVNGVDFVLPFRFADGAWHLGT
RGPVTLRIAEVINLVTAKNITADLQGRYPWTEEEPLLLTDVSVDVLGGNV
LMKQLRMPQHDPALLRLNNLSSSELVSAVNPKQFAMSGAFSGALPLWLNN
EKWIVKDGWLANSGLMTLRLDKDTADAVVKDNMTAGSAINWLRYMEISRS
STKINLDNLGLLTMQANITGTSRVDGKSGTVNLNYHHEENIFTLWRSLRF
GDNLQAWLEQNARLPGNDCPQGKECEEKQ
>S1479 ydcA, hypothetical protein
MKKLALILFMGTLVSFYADAGRKPCSGSKGGISHCTAGGKFVCNDGSISA
SKKTCTN
>S1684 ydeI, hypothetical protein
MKFQSIVLASFLVMPYALADDQGGLKQDAAPPPPHAIEDGYRGTDDAKKM
TVDFAKNMHDGASVSLRGNLISHKGVR
>S1830 ydhU, hypothetical protein
MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALL
RARGVKKSATDHGEKVYLYSKAVRLWHWSNALLFVLLLASGLINHFAMVG
ATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYRIRRQGWLERAA
KQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGL
LCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHET
FKSMVDGYHRH
>S1592 ydjY, hypothetical protein
MKALRCALFYNSVWRAWRCLFSSSSQNSTPDTNMWICLLRAAAHLLTQKM
KDRTMSQHYSVSWKKGLAALCLLAVAGLSGCDQKENAAAKVEYDGLSNSQ
PLRVDANNHTVTMLVQINGRFLTDDTRHGIVFKDGSNGHKSLFMAYAPPK
AFYEALKEAGGTPGENMTMDNKETTHVTGSKLDISVNWQGAAKAYSFDEV
IVDSNGKKLDMRFGGNLTAAEEKKTGCLVCLDSCPVGIVSNATYTYGAVE
KHGEVKFKGNASVLPADNTLATVTFKITE
>S1923 yebF, hypothetical protein
MKKRGAFLGRLLVSACASVFAANNETSKSVTFPKCEGLDAAGIAASVKRD
YQQNRVARWADDQKIVGQADPVAWVSLQDIQGKDDKWSVPLTVRGKSADI
HYQVCVDCKAGMAEYQRR
>S2054 yecF, hypothetical protein
MSTPDFSTAENNQELANEVSCLKAMLTLMLQAMGQADAGRVMLKMEKQLA
LIEDETQAAVFSKTVKQIKQAYRQ
>S2045 yecH, hypothetical protein
MDSIHGHEVLNMMIESGEQYTHASLEAAIKARFGEQARFHTCSAEGVTAG
ELVAFLAAKGKFIPSEEGFSTDQSKICRH
>S2067 yedD, hypothetical protein
MKKLAIAGALMLLAGCAEVENYNNVVKTPAPDWLAGYWQTKGPQRALVSP
EAIGSLIVTKEGDTLDCRQWQRGIAVPGKLTLMSDDLTNVTVKRELYEVE
RDGNTIEYDGMTMERVDRPTAECAAALDKAPLPTPLP
>S3202 yeeT, hypothetical protein
MKIITRGEAMRIHRQHPASRLFPFCTGKYRWHGSTDTYTGREVQDIPGVL
AVFAERRKDSFGPYVRLMSVTLN
>S3203 yeeU, putative structural protein
MSDTLPGTTPPDDNHDRPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGI
RGRFSDVDAYHLDQAFPLLMKQLELMLTGGELNPRHQHTVTLYAKGLTCE
ADTLGSCGYVYLAVYPTPAAPATTV
>S3204 yeeV, hypothetical protein
MNTLPDTHVREASGCPSPITIWQTLLTRLLDQHYGLTLNDTPFADERVIE
QHIEAGISLCDAVNFLVEKYALVRTDQPGFSAGAPSQLINSIDILRARRA
TGLMTRDNYRTVNNITRGKHPEAKQ
>S3205 yeeW, hypothetical protein
MKLALTLEADSVNVQALNMGRIVVDVDGVNLSELINKVSENGYSLRVVDK
SDQHATSTPPPLTTLTCIRCSTAHITETDNAWLYSLSHQTNDDGETEWIH
FTGSGYLLRTDAWSYPVLRLKRLGLSRTFRRLVVTLTRRYGVSLIHLDAG
AECLPGFPTFDW
>S2301 yehE, hypothetical protein
MNKYWLSGIIFLAYGLASPAVSSETATLTINGRISPPTCSMAMVNSQPQQ
HCGQLTYNVDTRHQVSSPVKGVTTEVVVAGSDSKRRIVLNRYD
>S2314 yehM, hypothetical protein
MSEPLIVGIRHHSPACARLVKSLIESQRPRYVLIEGPADFNDRVDELFLA
HQLPVAIYSYCQYQDGAAPGRGAWTPFAEFSPEWQALQAARRIQAQTYFI
DLPCWAQSEEEDDSPDTQDESQTLLLRDTRMDNSDTLWDHLFEDESQQTA
LPSALARYFAQLRGDSPGDALNRQREAFMARWITWAMQQNNGDVLVVCGG
WHAPALANMWRECPQEINKPELSSLADAVTGCYLTPYSEKRLDVLAGYLS
GMPAPVWQNWCWQWGLQQAGEQLLKTVLTRLRQHKLPASTADMAAAHLHA
MALAQLRGHTLPLRTDWLDAIAGSLIKEALNAPLPWSYRGVIHPDTDPIL
LTLIDTLAGDGFGKLAPSTPQPPLPKDVTCELERTAISLPAELTLNRFNP
NGLAQSQVLHRLAILEIPGIVRQQGSTLTLAGNGEERWKLTRPLSQHAAL
IEAACFGATLLEAARHKLEADMLDAGGIGSITTCLSQAASAGLASFSQQL
LEQLTLLIAQENQFAEMGQALEVLYALWRLDEISGMQGAQILQTTLCAAI
DRTLWLCESNGRPDEKEFHAHLHSWQALCHILRDLHSGVQLPGVSLSAAV
ALLERHSQAIHVLALDRGATLGALMRLEHPNASAEAALTMLAQLSPAQSG
EALHGLLALARHQLACQPAFIAGFSSHLNQLSDADFTNALPDLRAAMAWL
PPRERGTLAHQVLEHYQLAQLPVSALQMPLHCPPQAIAHHQQLEQQALAS
LQHWGVFHV
>S2316 yehQ, hypothetical protein
MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIAT
FSDGVRTQLANGQALKEAQCTYGASGMCRHRVMLVLSYQRLCATAQPTGK
EEEWDPAIWLEELATLPDATRKRAQALVAKSITIELFCAPGEIPSARLPM
SDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVQAKAQQAELTHLIWQ
MRSEHVTSSDDPFASEEGKTCRQYVQQLSQALWLSGISQPLIHYEAAFSR
AQQAAERCSWRWVSESLRQLRASVDAFHARASHYHAGECLRQLAALNSRL
NCAQEMARSDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDI
EHYGLRIWFTDPDTGSILHLSRSWPRSEQEDSPAATRRLFSFQAGALAGG
QIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALR
EYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGE
DNLLTLSLPASASAPYAIERMAALLQQTDDPVCLVSGFVSFVDGQLTLEP
QVMMTKTRAWALDAETAPVVVSLPSASVLPVPSTAHQLLMRCQALLIQLL
HNGWRYQEQSAINQAELLANDLTAVGFYRLAHVLAQFRNTESEARVEAMN
NGVLLCEQLFPMLQQQG
>S2397 yejG, hypothetical protein
MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPD
GDNAWSVMYKLSQALSDIEVPCSVLECEGEPCLFVNRQDEFAATCRLKNF
GVAIAEPFSNYNPF
>S2603 yfeC, hypothetical protein
MTPDELARLTGYSRQTINKWVRKEGWTTSPKPGVQGGKARLVHVNEQVRE
YIRNAERPEGQGEAPALSGDAPLEVLLVTLAKEMTPVEQKQFTSLLLREG
IIGLLQRLGIRDSK
>S2620 yfeK, hypothetical protein
MKKIICLAITLLMTLPAYAKLTVHEEARINAMLEGLAQKKDLIFVRNGDE
HTCDEAVSHLRLKLGNTRNRIDTAEQFIDKVASSSSITGKPYIVKIPGKS
DENAQPFLHALIAQTDKTVPAEGN
>S2774 yfhG, putative alpha helix protein
MRHIFQRLLPRRLWLAGLPCLALLGCVQNHNKPAIDTPAEEKIPVYQLAD
YLSTECSDIWALQGKSTETNPLYWLRAMDCADRLMPAQSRQQARQYDDGS
WQNTFKQGILLADAKITPYERRQLVARIEALSTEIPAQVRPLYQLWRDGQ
ALQLQLAEERQRYSKLQQSSDSELDTLRQQHHVLQQQLELTTRKLENLTD
IERQLSTRKPAGNFSPDTPHESEKPAPSTHEVTPDEP
>S2885 ygaC, hypothetical protein
MYLRPDEVARVLEKVGFTVDVVTQKAYGYRRGENYVYVNREARMGRTALV
IHPTLKERSSTLAEPASDIKTCDHYQQFPLYLAGERHEHYGIPHGFSSRV
ALERYLNGLFGEAS
>S2897 ygaH, hypothetical protein
MSYEVLLLGLLVGAANYCFRYLPLRLRVGNARPTKRGAVGILLDTIGIAS
ICALLVVSTAPEVMHDTRRFVPTLVGFAVLGASFYKTRSIIIPTLLSALA
YGLAWKVMAII
>S2941 ygbA, hypothetical protein
MSGKRISREKLTIKKMIDLYQAKCPQASAEPEHYEALFVYAQKRLDKCVF
GEEKPACKQCPVHCYQPAKREEMKQIMRWAGPRMLWRHPILTVRHLIDDK
RPVPELPEKYRPKKPRE
>S2965 ygbE, putative cytochrome oxidase subunit
MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNT
LFFFLYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGA
LFMWLLG
>S3032 ygdB, hypothetical protein
MLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHSWQ
TQPAVQCSQYAGTDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGKIV
FSPRGWSDFCPLKEGALCQLP
>S3156 yggM, putative alpha helix chain
MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAISL
KNLDAKGNQATWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFS
AMLTSKGTPASGWSVNFYSFQAAASDRGRVVDDIKTNNKYLIVNSEDFNY
RFSQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDANGKQM
TREEAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYE
VPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI
NSKKVAILMKIDDINQANER
>S3281 ygiA, hypothetical protein
MTTTGLRPRLNVRQRKDTGYLPHSSPFSLQFRPAILYSDGYLPLVPEDKN
ETDKIHTPRIVPQKLERTPSDTSRSRGCHCFYAGWL
>S3358 yhaL, hypothetical protein
MSKKSTKKRQPAKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAED
EATA
>S3493 yhcN, hypothetical protein
MGHETKAQLKDYVEVKIMKIKTTVAALSVLSVLSFGAFAADSIDAAQAQN
REAIGTVSVSGVASSPMDIREMLNKKAEEKGATAYQITEARSGDTWHATA
ELYK
>S3497 yhcR, hypothetical protein
MPFSAATDAENNNACSLYTTKVNMSLFPVIVVFGLSFPPIFFELLLSLAI
FWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRLFV
>S4382 yhfG, hypothetical protein
MKKLTDKQKSRLWELQRNRNFQASRRLEGVEMPLVTLTAAEALARLEELR
RHYER
>S4375 yhfL, hypothetical protein
MNKFIKVALVGAVLATLTACTGHIENRDKNCSYDYLLHPAISISKIIGGC
GPTAQ
>S4368 yhfS, hypothetical protein
MKTFPLQSLTLAEAQQKQFALVDTICRHFPGSEFLAGGDLGLTPSLNQPR
ITQRVEQVLADAFHAQAAALVQGAGTGAIRAGLAALLKPGQRLLVHDAPV
YPTTRVIIEQMGLTLITADFNDLSALKQVVDEQQPDAALVQHTRQQPQDS
YVLADVLATLRAAGVPALTDDNYAVMKVARIGCECGANVSTFSCFKLFGP
EGVGAVVGDADVISRIRATLYSGGSQIQGSQALEVLRGLVFAPLIHAVKA
GVSERLLALLNGGAVAEVKSAVIANAQSKVLIVEFHQLIAARVLEEAQKR
GALPYPVGAESKYEIPPLFYRLSGTFRQANPQLEHCAIRINPNRSGEETV
LRILRESIASI
>S4362 yhfY, hypothetical protein
MGNRNPVIKRKASDMETRLNLLCEAGVIDKDVCKGMMQVVNVLEKECHLP
VRSEQGTMAMTHMASALMRSRRGEEIEPLDNELLAELAQSSHWQAVVQLH
QELLKEFALEVNPCEEGYFLANLYGLWMAANEEV
>S4361 yhfZ, hypothetical protein
MDNKALLSHVDINNVVCAMPLPYTRLYEGLASGLKAQFDGIPFYYAHMRG
ADIRVECLLNGVYDMAVVSRLAAESYLSQNNLCIALELGPHTYVGEHQLI
CRKGESGNVKRVGLDSRSADQKIMTDVFFGDSDVERVDLSYHESLQRIVK
GDVDAVIWNVVAENELTMLGLEATPLTDDPRFLQATEAVVLTRVDDYPMQ
QLLRAVVDKHALLAHQQRVVSGEQEPSY
>S4333 yhgA, hypothetical protein
MSKKQSSTPHDALFKLFLRQPETARDFLAFHLPAPIHALCDMKTLKLESR
SFIDDDLRESYSDVLWSVKTEQGPGYIYCLIEHQSTSNKLIAFRMMRYAI
AAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPKLARQLY
ASAFPLIDVTVMPDDEIMQHRRMALLELIQKHIRQRDLMGLVEQMACLLS
SGYANDRQIKGLFNYILQTGDAVRFNDFIDGVAERSPKHKESLMTIAERL
RQEGEQSKALHIAKIMLESGVPLADIMRFTGLSEEELAAASRLAP
>S4342 yhgE, putative transport protein
MDNVELSPATRWGMIATGLLQGMIATGLLQGLVCYLLIAWLAGKNHSWIV
YGVPATVAFSSVLLFSVISFKQKRLWGWLALVFIATTGMSGWLKWQTDGM
NPWRAEKAIWDFGCYLLLMAMLLLPWIQQSLRIRNGSSRYSYFYQSVWHN
VLILLVIFLSNGLTWLVLLLWGELFKLVGITFFNTLFFATDWFMYLTFGL
VTALAVILARTQSRLIDSIQKLFTLIATGLLPLVSLLTLMFIITLPFTGL
SAISRHISAAGLLLTLAFLQLILMAIVRDPQKASLPWTGPLRCLIKTALL
VAPLYVFVATWGLWLRVAQYGWTVDRLQGALAELVLLVWSLGYFVSIVWR
NGQNPLVLQGKVNLAVSLLVLVILVLLNSPVLDSMRISVNSHMARYQSGK
NTPDQVTIYMLEQSGRYGRAALESLKSDAGFMKDPKRARDLLMALDGEQH
LQEQVSEKVFAENVLIAPGSVKPDATFWSALIQDRYNVMTCIEKDACVLV
EQDLNSDGQAERILFAFNDDRVIVYGFDSDRKEWDALDMSLLPNEITKEK
LLTAAKDGKLGTRPKAWRDLTVDGETLEINLSK
>S4334 yhgG, hypothetical protein
MASLIQVRDLLALRGRMEATQISQTLNTPQPMINAMLQQLESMGKAVRIQ
EEPDGCLSGSCKSCPEGKACLREWWALR
>S4297 yhhA, hypothetical protein
MKRLLILTALLPFVGFAQPINTLNNPNQPGYQIPSQQRMQTQMQTQQIQQ
KGMLNQQLKTQTQLQQQHLENQINNNSQRVLQSQPGERNPARQQMLPNTN
GGMLNSNRNPDSSLNQQHMLPERRNGDMLNQPSTPQPDIPLKTIGP
>S4278 yhhM, putative receptor
MSKPPLFFIVIIGLIVVAASFRFMQQRREKADNDMAPLQQKLVVVSNKRE
KPINDRRSRQQEVTPAGTSMRYEASFKPQSGGMEQTFRLDAQQYHALTVG
DKGTLSYKGTRFVSFVGEQ
>S4259 yhiJ, hypothetical protein
MKIGTVAGTNGSTTTIATNDMVQEHVTNFTKELFGYIANGIGDDISSIAR
TMLGEVVEKIDDWQIERFQQSIQDDKISFTIQTNHSEKYSMLSGMRAHIL
RRNNCYQFIVTINSKNYGCPLDNTDINWCSIVYLLNNMTVNDNANDVAVT
ESYKPIWNWEISQYNVFDIKFETIIKPQFADRTYFSNCSPVDPTSTRPTY
FGDTDGSVGAVLYALFATGHLGIMAEGENFLSQLLNIEDEVLNVLLRENF
NEQLDTNVNTIISILNRRDNVLESLQPYLVINKDAVTPCTFLGDQTGDRF
SNICGDQFIIDLLKRIMSINENVHVLAGNHETNCNGNYMQNFTRMKPLDE
DTYDGIKDYPVCFYDPKYKIMANHHGITFDDQRKRYIIGPITVSIDEMTN
ALDPVELAEIINKKHHAIINGKKFKTSRAISCRSFNRYFSVSTDYRPKLE
ALLACSQMLGINQVVAHNGNGGRERIGETGTVLGLNARDSKHAGRMFSMH
NCQINPGAGPEITTPWKSYQHEKNRNGLMPLIRRRTMLQL
>S4243 yhiO, hypothetical protein
MISTVALFWALCVVCIVNMARYFSSLRALLVVLRNCDPLLYQYVDGGGFF
TSHGQPNKQVRLVWYIYAQRYRDHHDDEFIRRCERVRRQFILTSALCGLV
VVSLIALMIWH
>S4203 yhjN, hypothetical protein
MAMGMSAFPSFMTQATPATQPLINAEPAVAAQTEQNPQVGQVMSGEQGAD
APIVAQNGPSRDVKLTFAQIAPPPGSMVLRGINPNGSIEFGMRSDEVVTK
AMLNFEYTPSPSLLPVQSQLKVYLNDELMGVLPVTKEQLGKKTLAQMPIN
PLFITDFNRVRLEFVGHYQDVCENPASTTLWLDVGRSSGLDLTYQTLNVK
NDLSHFPVPFFDPRDNRTNTLPMVFAGAPDVGLQQASAIVASWFGSRSGW
RGQNFPVLYNQLPDRNAIVFATNDKRPDFLRDHPAVKAPVIEMINHPQNP
YVKLLVVFGRDDKDLLQAAKGIAQGNILFRGESVVVNEVKPLLPRKPYDA
PNWVRTDRPVTFGELKTYEEQLQSSGLEPAAINVSLNLPPDLYLMRSTGI
DMDINYRYTMPPVKDSSRMDISLNNQFLQSFNLSSKQEANRLLLRIPVLQ
GLLDGKTDVSIPALKLGATNQLRFDFEYMNPMPGGSVDNCITFQPVQNHV
VIGDDSTIDFSKYYHFIPMPDLRAFANAGFPFSRMADLSQTITVMPKAPN
EAQMETLLNTVGFIGAQTGFPAINLTVTDDGSTIQGKDADIMIIGGIPDK
LKDDKQIDLLVQATESWVKTPMRQTPFPGIVPDESDRAAETQSTLTSFGA
MAAVIGFQSPYNDQRSVIALLADSPRGYEMLNDAVNDSGKRATMFGSVAV
IRESGINSLRVGDVYYVGHLPWFERLWYALANHPILLAVLATISVILLAW
VLWRLLRIISRRRLNPDNE
>S4200 yhjR, hypothetical protein
MNNNEPDTLPDPAIGYIFQNDIVALKQAFSLPDIDYADISQREQLAAALK
RWPLLAEFAQQK
>S4199 yhjS, putative protease
MRDIVDPVFSIGISSLWDELRHMPAGGVWWFNVDRHEDAISLANQTIASQ
AETAHVAVISMDSDPAKIFQLDDSQGPEKIKLFSMLNHEKGLYYLARDLQ
CSIDPHNYLFILVCANNAWQNIPAERLRSWLDKMNKWSRLNHCSLLVINP
GNNNDKQFSLLLEEYRSLFGLASLRFQGDQHLLDIAFWCNEKGVSARQQL
SVQQQNGIWTLVQSEEAEIQPRSDEKRILSNVAVLEGAPPLSEHWQLFNN
NEVLFNEARTAQAATVVFSLQQNAQIEPLARSIHTLRRQRGSAMKILVRE
NTASLRATDERLLLACGANMVIPWNAPLSRCLTMIESVQGQKFSRYVPED
ITTLLSMTQPLKLRGFQKWDVFCNAVNNMMNNPLLPAHGKGVLVALRPVP
GIRVEQALTLCRPNRTGDIMTIGGNRLVLFLSFCRINDLDTALNHIFPLP
TGDIFSNRMVWFEDDQISAELVQMRLLAPEQWGMPLPLTQSSKPVINAEH
DGRHWRRIPEPMRLLDDAVERSS
>S4198 yhjT, hypothetical protein
MMTISDIIEIIIVCALIFFPLGYLARHSLRRIRDTLRLFFAKPRYVKPAG
TLRRTEKARATKK
>S4197 yhjU, hypothetical protein
MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVF
AAFLLMPIPRYSLHRLRHWIALPIGFALFWHDIWLPGPESIMSQGSQVAG
FSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVVAILLWLN
VLTLAGPSFSLWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAP
PTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVINICSLSWSDI
EAAGLMSHPLWSHFDIDFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQ
PANNDCYLFDNLSKLGFTQHLMMGHNGQFGGFLKEVRENGGMQTELMDQT
NLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNH
YPGVSKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPEHGGALKGD
RMQVSGLRDIPSPSITDVPVGVKFFGMKAPHQGAPIVIDQPSSFLAISDL
VVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLN
GGDWVPYPQ
>S4187 yhjY, putative lipase
MIIKKSGGRWQLSLLASVVISAFFLNTAYAWQQEYIVDTQPGHSTERYTW
DSDHQPDYNDILSQRIQSSQRALGLEVNLAEETPVDVTSSMSMGWNFPLY
EQVTTGPVAALHYDGTTTSMYNEFGDSTTTLTDPLWHASVSSLGWRVDSR
LGDLRPWAQISYNQQFGENIWKAQSGLSRMTATNQNGNWLDVTVGADMLL
NQNIAAYAALTQAENTTNNSDYLYTMGVSARF
>S4181 yiaF, hypothetical protein
MILPGRLRRKGILQACPGLSLSRQTRVCRCALFLGERSKKMATGKSCSRW
FAPLAALLMVVSLSGCFDKEGDQRKAFIDFLQNTVMRSGERLPTLTADQK
KQFGPFVSDYAILYGYSQQVNQAMDSGLRPVVDSVNAIRVPQDYVTQSGP
LREMNGSLGVLAQQLQNAKLQADAAHSALKQSDDLKPVFDQAFTKVVTTP
ADALQPLIPAAQTFTQQLVMVGDYIAQQGTQVSFVANGIQFPTSQQASEY
NKLIAPLPAQHQAFNQAWTTAVTATQ
>S4180 yiaG, hypothetical protein
MEYKDPMHELLSSLEQIVFKDETQKITLTHRTTSCTEIEQL
>S4114 yibD, putative regulator
MYKRLITNVRSVKVGYQALLWSFRLWQWRDKTRSHHRITRSAFNLR
>S4136 yibI, hypothetical protein
MFLNYFALGVLIFVFLVIFYGIIAIHDIPYLIAKKRNHPHADAIHTAGWV
SLFTLHVIWPFLWIWATLYQPERGWGMQSHVASQEKATEPEIAALSDRIS
RLEHQLAAEKKTDYSTFPEI
>S4132 yibL, hypothetical protein
MKEVEKNEIKRLSDRLDAIRHQQADLSLVEAADKYAELEKEKATLEAEIA
RLREVHSQKLSKEAQKLMKMPFQRAITKKEQADMGKLKKSVRGLVVVHPM
TALGREMGLEEMTGFSKTTF
>S4074 yicH, hypothetical protein
MKFIGKLLLYILIALLVVIAGLYFLLQTRWGAEHISAWVSENSDYHLAFG
AMDHRFSAPSHIVLENVTFGRDGQPAPLVAKSVDIALSSRQLTEPRHVDT
ILLENGTLNLTDQTAPLPFKADRLQLRDMAFNSPNSEWKLSAQRVNGGVV
PWSPEAGKVLGTKAQIQFSAGSLSLNDVPATNVLIEGSIDNDRVTLTNLG
ADIARGTLTGNAQRNADGSWQVENLRMADIRLQSEKSLTDFFAPLRSVPS
LQIGRLEVIDARLQGPDWAVTDLDLSLRNMTFSKDDWQTQEGKLSMNASE
FIYGSLHLFDPIINAEFSPQGVALRQFTSRWEGGMVRTSGNWLRDGKTLI
LDDAAIAGLEYTLPKNWQQLWMETTPGWLNSLQLKRFSASRNLIIDIDPD
FPWQLTALDGYGANLTLVTDHKWGVWSGSANLNAAAATFNRVDVRRPSLA
LTANSSTVNISELSAFTEKGILEATASVSQTPQRQTHISLNGRGVPVNIL
QQWGWPELPLTGDGNIQLTASGDIQANAPLKPTVSGQLHAVNAAKQQVTQ
TMNAGVVSSGEVTSTEPVR
>S3984 yidI, hypothetical protein
MGIIAQNKISSLGMLFGAIALMMGIIHFSFGPFSAPPPTLESIVADKTAE
IKRGLLAGIKGEKITTVEKKEDMDIDKILDQSGIALAIAALLCAFIGGMR
KENRWGIRGALVFGIGIVCSILLIFLIFSFLTGGSLV
>S4028 yieI, hypothetical protein
MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLLVIIQVLPLFL
LLSITTGAIPALLTGVMVACLPEKIGSQKNYRCLAGGIGGVVITEIYCAV
IVHIKGMASSELFENILSGDSLVVRIIPALLAGVVMSRIITRLPGLDISC
PETDSLS
>S3893 yifM, 4-alpha-l-fucosyltransferase
MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSC
PALSVQFFPGKKSLAEAVIAKAKANRQQRFFFHGQFNPTLWLALLSGGIK
PSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFA
KTHPKVRGELLYFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHVA
ALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELFSEENLQILSE
KLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQ
DMTEQHLPVLFTTDDLNEDIVREAQRQLASADKNTIAFFSPNYLQGWQRA
LAIAAGEVA
>S3859 yigG, hypothetical protein
MLTGDSHKDVKFMLRMFIPTSNGKISRHRYIFLFILINFIFAFLIIFFND
GEAGFLVIVSTIALHYLVINMNCQRLRDSGFIYIKIYVFGTLAVYIISII
TMIAEDFACSGNGSMIFLICYFSTFSMLMLAPTDSSKQ
>S3749 yiiQ, hypothetical protein
MKPGCTLFFLLCSALTVTTTAHAQTPDTATTAPYLLAGAPTFDLSISQFR
EDFNSQNPSLPLNEFRAIDSSPDKANLTRAASKINENLYASTALERGTLK
IKSIQMTWLPIQGPEQKAAKAKAQEYMAAVIRTLTPLMTKTQSQKKLQSL
LTAGKNKRYYTETEGALRYVVADNGEKGLTFAVEPIKLALSESLEGLNK
>S3732 yiiX, hypothetical protein
MKNRLLILSLLVSVPAFAWQPQTGDIIFQISRSSQSKAIQLATHSDYSHT
GMLVMRNKKPYVFEAVGPVKYTPLKQWIAHGEKGNYVVRRVEGGLSVEQQ
QKLTQTAKRYLGKPYDFSFSWSDDRQYCSEVVWKVYQNALGMRVGEQQKL
KEFDLSNPLVQAKLKERYGKNIPLEETVVSPQAVFDAPQLTTVAKEWPLF
SW
>S3662 yjaH, hypothetical protein
MNSFNEGVVSPLLSFWRRSLMLAGALFLTACSHNSSLPPFTASGFAEDQG
AVRIWRKDSGDNVHLLAVFSPWRSGDTTTREYRWQGDNLTLININVYSKP
PVNIRARFDDRGDLSFMQRESDGEKQQLSNDQIDLYRYRADQIRQISDAL
RQGRVVLRQGRWHAMEQTVTTCEGQTIKPDLDSQAIAHIERRQSRSSVDV
SVAWLEAPEGSQLLLVANSDFCRWQPNEKTF
>S3552 yjbE, hypothetical protein
MKKVLYGIFAISALAATSAWAAPVQVGEAAESAATSVSAGSSSATSVSTV
SSAVGVALAATGGGDGSNTGTTTTTTTSTQ
>S3553 yjbF, hypothetical protein
MVILPLWRRVVKRPALILICLLLQACSATTKELGNSLWDSLFGTPGVQLT
DDDIQNMPYASQYMQLNGGPQLFVVLAFAEDGQQKWVTQDQATLVTQHGR
LVKTLLGGDNLIEVNNLAADPLIKPAQIVDGATWTRTMGWTEYQQVRYAT
ARSVFKWDGTDTVKVGSDETLVRVLDEEVSTDQARWHNRYWIDSEGQIRQ
SEQYLGADYFPVKTTLIKAAKQ
>S3554 yjbG, hypothetical protein
MIKQTIVALLLSVGASSVFAAGTVKVFSNGSSEAKTLTGAEHLIDLVGQP
RLANSWWPGAVISEELATAAALRQQQALLTRLAEQGADSSTDDAAAINAL
RQQIQALKVTGRQKINLDPDIVRVAERGNPPLQGNYTLWVGPPPSTVTLF
GLISHPGNQPFTPGRDVASYLSDQSLLSGADRSYAWVVYPDGRTQKAPVA
YWNKRHVEPMPGSIIYVGLADSVWSETPDALNADILQTLTQRIPQ
>S3555 yjbH, hypothetical protein
MKKRHLLSLLALGISTACYGETYPAPIGPSQSDFGGVGLLQTPTARMARE
GELSLNYRDNDQYRYYSASVQLFPWLETTLRYTDVRTRQYSSVEAFSGDQ
TYKDKAFDLKLRLWEESYWLPQVAVGARDIGGTGLFDAEYLVASKAWGPF
DFTLGLGWGYLGTSGNVKNPLCSASDKYCYRDNSYKQAGSIDGSQMFHGP
ASLFGGVEYQTPWQPLRLKLEYEGNNYQQDFAGKLEQKSKFNVGAIYRVT
DWADVNLSYERGNTFMFGVTLRTNFNDLRPSYIDNARPQYQPQPQDAILQ
HSVVANQLTLLKYNAGLADPQIQAKGDTLYVTGEQVKYRDSREGIIRANR
IVMNDLPDGIKTIRITENRLNMPQVTTETDVASLKNHLAGEPLGHETTLA
HKRVEPVVPKSTEQGWYIDKSRFDFHIDPVLNQSVGGPENFYMYQLGVMG
TADLWLTDHLLTTGSLFANLANNYDKFNYTNPPQDSHLPRVRTHVREYVQ
NDVYVNNLQANYFQHLGNGFYGQVYGGYLETMFGGAGAEVLYRPLDSNWA
FGLDANYVKQRDWRSAKDMMKFTDYSVKTGHLTAYWTPSFAQDVLVKASV
GQYLAGDKGGTLEIAKRFDSGVVVGGYATITNVSKEEYGEGDFTKGVYVS
VPLDLFSSGPTRSRAAIGWTPLTRDGGQQLGRKFQLYDMTSDRSVNFR
>S3617 yjcZ, hypothetical protein
MLESVHPRFLVDLAQGDDARHPQAHQQQFRERLMQELLSRVQLQTWTNGG
MLNAPLSLRLTMVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPE
LPQQIAALYEWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNA
WSLPGRCFIVLEELRWGAFGDACRLGSPQAVVLLLGDLREKATQHLAESI
NAAPTTRHYYHQWFASSTVPTGGDHADFLSWLGKWTTADKQPVCWSVTQR
WQTVALGMPRLCSAQRLVGAMVEEIFSVNLA
>S4565 yjeI, hypothetical protein
MASSSLIMGNNMHVKYLAGIVGAALLMAGCSSSNELSAAGQSVRIVDEQP
GAECQLIGTATGKQSNWLSGQHGEEGGSMRGAANDLRNQAAAMGGNVIYG
ISSPSQGMLSSFVPTDSQIIGQVYKCPN
>S4528 yjfA, hypothetical protein
MITYPCLTSRRFQLALIHRRVVDKRTSMHSHTASESTGARIHRPWCARHQ
VRPAWRCQYDKLHRVPFRSPELRLDSGPGYTTGSYRY
>S4607 yjfM, hypothetical protein
MARKRKSRNNSKIGHGAISRIGRPNNPFEPRRNRYAQKYLTLALMGGAAF
FVLKGCSDSSDVDNDGDGTFYATVQDCIDDGNNADICARGWNNAKAAFYA
DVPKNMTQQNCQSKYENCYYDNVEQSWIPVVSGFLLSRVIRKDRDEPFVY
NSGGSSFASRPVWRNTSGDYSWRFGSGKKESYSSGGFTTRKASTVSRGGY
GRSSSARGHWGG
>S4613 yjfN, hypothetical protein
MGKRKMELTMKQLLASPSLQLVTYPASATAQSAEFASADCVTGLNEIGQI
SVSNISGDPQDVERIVALKADEQGASWYRIITMYEDQQPDNWRVQAILYA
>S4614 yjfO, hypothetical protein
MWVTLSSRFHSKIAYQVTIREHSPKVKLIRHYTMVSRKRNSVIYRFASLL
LVLMLSACSALQGTPQPAPPVTDHPQEIRRDQTQGLQRIGSVSTMVRGSP
DDALAEIRAKAVAAKADYYVVVMVDETIVTGQWYSQAILYRK
>S4624 yjfY, hypothetical protein
MFSRVLALLAVLLLSANTWAAIEINNHQARNMDDVQSLGVIYINHNFATE
SEARQALNEETDAQGATYYHVILMREPGSNGNMHASADIYR
>S4506 yjgG, hypothetical protein
MSNTLNHTSSRQIVRHYTHSQKRCKHLMQYFVSANGLFELKVKIYAFLFS
MILEGKCRSVSIIADISCFFLFHFHAIRNAFYSIHPTYRAEGENERLTLL
LIAQGYALSL
>S4468 yjhA, hypothetical protein
MESNTWNTIHDNKKENAALNDVQVEVNYAIKLDDQWTVRPGMLTHFSSNG
TRYGPYVKLSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNVHRWDGY
VTYHINSDFTFAWQTTLYSKQNDYRYANHKKWATENAFVLQYHMTPDITP
YIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL
>S4470 yjhS, hypothetical protein
MQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSA
FTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPQNKFLG
VCWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFC
GDTTWYWKENFPHSYEAIYGNYQNNVLANIIFVDFQQQGERGLTNAPDED
PDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAILQFWR
ER
>S4448 yjiD, hypothetical protein
MRKMMRQSLQAVLPEISGNKTSLLRKSVCSDLLTLFNSPHSALPSLLVSG
MPEWQVHNPSDKHLQSWYCRQLRSALLFHEPRIAALQVNLKEAYCHTLAI
SLEIMLYHDDEPLTFDLVWDNGGWRSATLENVS
>S4683 yjjI, hypothetical protein
MPTSHENALQQRCQQIVTSPVLSPEQKRHFLALEAENNLPYPQLPAEARR
ALDEGVICDMFEGHAPYKPRYVLPDYARFLANGSEWLELEGAKDLDDALS
LLTILYHHVPSVTSMPVYLGQLDALLQPYVRILTQNEIDVRIKRFWRYLD
RTLPDAFMHANIGPSDSPITRAILRADAQVSPNLTFIYDPEITPDDLLLE
VAKNICECSKPHIANGPVHDKIFTKGGYGIVSCYNSLPLAGGGSTLVRLN
LKAIAERSESLDDFFTRTLPHYCQQQIAIIDARCEFLYQQSHFFENSFLV
KEGLINPERFVPMFGMYGLAEAVNLLCEKEGIAARYGKEAAANEVGYRIS
AQLAEFVANTPVKYGWQKRAMLHAQSGISSDIGTTPGARLPYGDEPDPIT
HLQTVAPHHAYYYSGISDILTLDETIKRNPQALVQLCLGAFKAGMREFTA
NVSGNDLVRVTGYMVRLSDLEKYRAEGSRTNTTWLGEEAARNTRILERQP
RVISHEQQMRFSQ
>S4705 yjjY, hypothetical protein
MTKVRNCVLDALSINVNNIISLVVGTFPQDPTVSKTAVILTILTAT
>S0291 ykfE, hypothetical protein
MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQM
VQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWS
EKSNQMTGLFSAIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENH
PDGFNFK
>S0277 ykgH, hypothetical protein
MSEQIKQDIDLIEILFHLKKKIRVILFIMAICMAMVLLFLYINKDNTKVI
YSLKINQTTPGILVSCDSNNNFACQTTMTEDVIQRITTFFQTSPDIKNRE
IRLEWSGDKRDLPTAEEEISRVQASIIKWYASEYHNGRQVLDEIQTPSAI
NSELYTKMIYLTRNWSLYPNGDGCVTISSPEIKNKYPAAICLALGFFLSI
VISVMFCLVKKMVDEYQQNSGQ
>S0410 ylaC, hypothetical protein
MTEIQRLLTETIESLNTREKRDNKPRFSISFIRKHPGLFIGMYVAFFATL
AVMLQSETLSGSVWLLVVLFILLNGFFFFDVYPRYRYEDIDVLDFRVCYN
GEWYNTRFVPAALVEAILNSPRVADVHKEQLQKMIVRKGELSFYDIFTLA
RAESTS
>S1053 ymcA, hypothetical protein
MKKNSYLLSCLAIAVSSACHAEVLTYPDPLGSSQSDFGGTGLLQMPNARI
APEGEFSVNYRDNDQYRFYSTSVALFPWLEGTIRYTDVRTRKYSQWEDFS
GDQSYKDKSFDFKLRLWEEGYWLPQVAFGKRDIAGTGLFDGEYLVASKQA
GPFDFTLGMAWGYAGNAGNITNPFCRVSDKYCHRAESHDAGDISFSDIFR
GPASIFGGIEYQTPWNPLRLKLEYDGNNYQNDFAGKLPQASHFNVGAVYR
AASWADLNLSYERGNTLMFGFTLRTNFNDLRPALRDTPKPAWQPAPESEG
LQYTTVANQLTALKYNAGFDAPEIQLRDKTLYMSGQQYKYRDSREAVDRA
NRILVNNLPQGVEKISVTQKREHMAMVTTETDVASLRKQLAGTAPGKSEQ
LQQQRVEAEDLSAFGRCYRIREDRFSYSFNPTLSQSLGGPEDFYMFQLGL
MSSARYWFTDHLLLDDGIFTNIYNNYDKFKSSLLPADSTLPRVRTHIRDY
VRNDVYLNNLQANYFADLGNGFYGQVYGGYLETMYAGVGSELLYRPLDAS
WALGVDVNYVKQRDWDNMMRFTDYSTPTGFVTAYWNPPTLNGVLMKLSVG
QYLAKDKGATIDVAKLFDSGVAVGVWAAISNVSKDDYGEGGFSKGFYISI
PFDLMTIGPNRNRAVVSWTPLTRDGGQMLSRKYQLDPMTAEREVPVGQ
>S1054 ymcB, hypothetical protein
MNKLQSYFIASVLYVMTPHAFAQGTVTIYLPSEQQTLSVGPVENVVQLVT
QPQLRDRLWWPGALLTDSAAKAKALKDYQHVMAQLASWEAEADDDVAATI
KSVRQQLLNLNITGRLPVKLDPDFVRVDENSNPPLVGDYTLYTVQRPVTI
TLLGAVSGAGQLPWQAGRSVTDYLQDHPRLAGADKNNVMVITPEGETVVA
PVALWNKRHVEPPPGSQLWLGFSAHVLPEKYADLNDQIVSVLTQRVPD
>S1055 ymcC, putative regulator
MRPLILSIFALFLAGCTHSQQSMVDTFRASLFDNQDITVAAQQIQALPYS
TMYLRLNEGQRIFVVLGYIEQEQSKWLSQDNAMLVTHNGRLLKTVKLNNN
LLEVTNPGRAPLRNALAIKDGSRWTRDILWSEDNHFRSATLSSTFSFAGL
ETLNIAGHNVLCNVWQEEVTSTLPEKQWQNTFWVDSATGQVRQSRQMLGA
GVIPVEMTFLKPAP
>S1056 ymcD, hypothetical protein
MNKGKVMKHKLSAILMAFMLTTPAAFAAPEAANGTEATTGTTGTTTTTTG
ATTTAATTSGVAAGAVGTATVVGVATAVGVATLAVVAANDSGDGGSHNTS
TTTSTTR
>S1061 ymcE, suppressor of fabA and ts growth mutation
MRRWISQNNIRLPRGAFFISALFFFNAVCIVSDNLLIIESFGKMAYNISY
LTRVPGTNTLLACCCLSRPDEVNSEY
>S1117 ymdD, hypothetical protein
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAEPSLWLTL
FNDFIHSFRMQVFFVISGYFSYMLFLRYPLKKWWKVRVERVGIPMLTAIP
LLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMT
TLCVWIFKRIRNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIV
YPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALFTTPSRGCTLA
AALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNF
QSARVTYFVNASLFIYLVHHPLTLFFGAYITPHITSNWLGFLCGLIFVVG
IAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR
>S1457 ynbE, hypothetical protein
MKILLAALTSSFMLVGCTPRIEVAAPKEPITINMNVKIEHEIIIKADKDV
EELLETRSDLF
>S1722 ynfC, hypothetical protein
MIENHLYSLVTVVKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFD
FDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVV
ALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASS
KQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLN
NQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY
>S2349 yohC, hypothetical protein
MSHVWGLFSHPDRKMQVINRENETISHHYTHHVLLMAAIPVICAFIGTTQ
IGWNFGDGTILKLSWFTGLALAVLFYGVMLAGVAVMGRVIWWMARNYPQR
PSLAHCMVFAGYVATPLFLSGLVALYPLVWLCALVGTVALFYTGYLLYLG
IPSFLNINKEEGLSFSSSTLAIGVLVLEVLLTLTVILWGYGYRLF
>S3134 yqgB, hypothetical protein
MLNNVMKKKPVAQSERQHTLLENPCAYGLLSQFQAATVVNCFTLNKII
>S3135 yqgC, hypothetical protein
MGITSAGMQSRDAECGERVFTRTVRQVKQQTTVHYFVSPPRPPVKTNPQA
KTLISTRLEVATRKKRRVLFI
>S3136 yqgD, hypothetical protein
MVADPTTTLQVKNTGSLSVNRYGWINIWMAILGQFFTRFPLFFESCLILL
KTWLEIFPDNAGILRIYLLQFSAIVGYKTRRAA
>S3261 yqhG, hypothetical protein
MKIILLFLAALASFTVHAQPPSQTVEQTVRQIYQNYKSDASTPYFGETGE
RAITSARIQQALTLNDNLTLPGNIGWLDYDPVCDCQDFGDLVLESVAITQ
TDVDHADAVVRFRIFKDDKEKTTQTLKMVAENGRWVIDDIVSNHGSVLQA
VNSENEKTLAALASLQKEQPESFVAELFEHIADYSWPWTWVVSDSYRQAV
NAFYKTTFKTANNPDEDMQIERQFIYDNPICFGEESLFSRVDEIRVLEKT
ADSARIHVRFTLTNGNNEEQELVLQRREGKWEIADFIRPNSGSLLKQIEA
KTAARLKQ
>S3347 yqjB, hypothetical protein
MSLRQLAWSGTVLLLVGTLLLAWSAVRQQESTLAIRAVHQGTTMPDGFSI
WHHLDAHGIPFKSITPKNDTLLITFDSSDQSAAAKAVLDRTLPHGYIIAQ
QDNNSQAMQWLTRLRDNSHRFG
>S3350 yqjE, hypothetical protein
MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQL
LLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGG
IWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ
>S3465 yrbL, hypothetical protein
MIRLSEQSPLGTGRHRKCYAHPEDAQRCIKIVYHRGDGGDKEIRRELKYY
AHLGRRLKDWSGIPRYHGTVETDCGTGYVYDVIADFDGKPSITLTEFAEQ
CRYEEDIAQLRQLLKQLKRYLQDNRIVTMSLKPQNILCHRISESEVIPVV
CDNIGESTLIPLATWSKWCCLRKQERLWKRFIAQPALAIALQKDLQPRES
KTLALTSREA
>S3536 yrdB, hypothetical protein
MNQAIQFPDREEWDENKKCVCFPALVNGMQLTCAISGESLAYRFTGDTPE
QWLASFRQHRWDLEEEAENLIQEQSEDDQGWVWLP
>S4351 yrfB, hypothetical protein
MNMFFDWWFATSPRLRQFCWAFWLLMLVTLIFLSSTHHEERDALIRLRAS
HHQQWAALYRLVDTTPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELA
LKTLWEAVPSAFTRLAERNVSVSRFSLSVEGDDLLFTLQLEMPHEG
>S4349 yrfD, hypothetical protein
MTGDKEIITMTFKIWQIGLHLQQQEAAAVAIVRGAKECFLQRWWRLPLAH
DIIKDGRIVDAQQLAKTLLPWSRELPQRHHIMLAFPASRTLQRSFPRPSM
SLGEREQTAWLSGTMARELDMDPDSLRFDYSEDSLSPAYNVTAAQSKELA
TLLTLAERLRVHVSAITPDASALQRFLPFLPSHQQCLAWRDNEQWLWATR
YSWGRKLAVGMTSAKELAAALSVDPDSVAICGEGGFDPWEAVSVRQPPLP
PPGGDFAIALGLALGKAY
>S4346 yrfF, putative dehydrogenase
MSTIVIFLAALLACSLLAGWLIKVRSRRRQLPWTNAFADAQTRKLTPEER
SAVENYLESLTQVLQVPGPTEASAAPISLALNAESNNVMMLTHAITRYGI
STDDPNKWRYYLDSVEVHLPPFWEQYINDENTVELIHTDSLPLVISLNGH
TLQEYMQETRGYALQPVPSTQASIRGEESEQIELLNIRKETHEEYALSRP
RGLREALLIVASFLMFFFCLITPDVFVPWLAGGALLLLGAGLWGLFAPPA
KSSLREIHCLRGTPRRWGLFGENDQEQINNISLGIIDLVYPAHWQPYIAQ
DLGQQTDIDIYLDRHVVRQGRYLSLHDEVKNFPLQHWLRSTIIAAGSLLV
LFMLLFWIPLDMPLKFTLSWMKGAQTIEATSVKQLADAGVRVGDTLRISG
TGMCNIRTSGTWSAKTNSPFLPFDCSQIIWNDARSLPLPESELVNKATAL
TEAVNRQLHPKPEDESRVSASLRSAIQKSGMVLLDDFGDIVLKTADLCSA
KDDCVRLKNALVNLGNSKDWDALVKRANAGKLDGVNVLLRPVSAESLDNL
VATSTAPFITHETARAAQSLNSPAPGGFLIVSDEGSDFVDQPWPSASLYD
YPPQEQWNAFQKLAQMLMHTPFNAEGIVTKIFTDANGTQHIGLHPIPDRS
GLWRYLSTTLLLLTMLGSAIYNGVQAWRRYQRHRTRMMEIQAYYESCLNP
QLITPSESLIE