TitleGenColors Logo

Gene list

Applied filters:

COG category: Cell motility
Gene type: CDS
Genomic element: chromosome

Number of genes found: 48

Free access
Sort by:

 



# Methylococcus capsulatus str. Bath, Bath

>MCA1117 general secretion pathway protein H
MLLALCIGAILMAVAVPNVMPAIEAAQLGSAARDVASGLRFARGQALSRR
QEVRFTLDVPGHRYRVSTRPKVFGLPAGVGLRLFTAAGEVEGEGVGSIVF
FPDGSSTGGRVTLEAAGRKRLIDVNWLTGRISSSAEEADDS
>MCA2061 conserved hypothetical protein
MARRSLTLIGILVSPGSHALGVGELRLQSALNQTLKAEVPLVVSDEKLED
IKVMLASPEAFAQAGIERQQFLSSLRFQAERRADGSGVIRISSREPIREP
FLSFLMEVDWPDGRVVKEFTVLVDPPTPVLPDLGSPGRSERASQYPREPV
PRSLHAADHPAFPTAGRAGSAGEMSEFGPVRRRDTLSSIAKAVNANGDLT
PDQVKVGIYRANPDAFVGGNVDALRQGAVLNIPPHSVLAGLQPADAAREW
RELRRAARRGGSEDAPVSPPATSPPMPLSEGGEAAAHGSRLKLLAPGGKG
QVGGAGAREDIALELAESLRQENEEIRARLGALEQQLSALQKLLESQEQQ
IAAMQAVAPAAGAGTGVLPAATSPQADATPSPEPSPVAETGAAAPVTVVI
QPPAEATAPSEDSPANGSWFWGLGVGAITLAAALAWWANRKRWLFDLGLG
LPSSLDRLMTGSKESAATRTVSAAGVRSLEAYGNMVKGVDAVEPVDPIAE
ADAFLANGKHVQAEQLMRAAVAAHPERDEFHLKLLEVLYLEGKYQAFEEL
AEELSGWRETRPELWGEVSRLSLKLRLKSPVPADAPDGFPDAASVPALPP
LAGETAPEPPPPPAAAEQVPPEAVEAELSAGAAHGEEATTEDIPPLEFDL
AGLEPAARGGIVKEEPEIVHDAGNLIAYEPEPYAGSAGPRDESLEALLAE
LDGLGENAERSPAQETSALLSEPPLTIEIEAVPVQRSGGGAGEPGSSAAG
LTREDTDPYGDITDMDPLETKLDLSKAYVDMGDSGSARELLEEILAAGSE
RQRAEARVLMERLREGQSR
>MCA0033 DNA-damage-inducible protein D
MPKKTDIAIAGPSFEDLKQTNAHGAEYWSARELQPLLGYSQWRRFEQAIE
RAITSCKESGNSPEHHFAGAGKPITGGKGAVQVVDDYHLSRFACYLIAQN
GDPRKPEIAQAQKYFAIQTRRQELSDQAAADLERLELRKQTAEEFKALSG
AAQQAGVQSKMFGVFHDAGYKGLYGGLGRDAIKQRKAIPDKDNLLDRMNA
TELAANQFRMTQTRDKLARDGVRSQAQAIQTHEQVGREVRAAIQRIGGTL
PEDIPAAEHIKEVEKRLKSATPRLTLDERDAGGLTGGKPDKNETP
>MCA1563 conserved hypothetical protein
MTLNRKTLSGSALAVLAVLFVSLTLISNALFRGVRLDLTENSLYTLSDGS
RNILAKLDEPIHLRLYFSDKATAESNRPDVRGLRLYFDRVREMLEEMRSR
AGGKIELEVIDPLPYSEAEDRAAAAGLQGIPLGPSGEKVFLGLEGSNSTD
GRATIPFFEPGKEAFLEYDLVKLVHDLATPKKPVVGLVSSLPLAGGFDPQ
RGDMGAPWTVYEQLSQSFDVRSLDADSLKAVAPEIEVLVVVHPKRLGEDA
QYALDQFVLRGGRLMVFVDPHAENDGVGDPRDPMAAMVTPKASDLPVLFK
AWGVEYKPDEAVLDRARAVTVNTAPGAPPSRHPAILRLGKSDLNAADVIT
ANLDTIHLASVGHFRPTTDGGSRLTPLLQTSDQAMVVPADRIRFLGDPSS
LLQDYKPGGERLIVAGRLEGKFKTAFPQRGDPGHLAEAKGTGEILLIADT
DLLSDRLWVRSSNFFGRKLLTTFAGNGDLFINAVDNLAGSSDLISIRGRG
TSARPFTKVEELKAAADDRFRSKERELQQELGETERKLAELQRGKTGDDT
FVLSPAQRKELDDFLRRKLEIRKELREVRRQLDADIDALGARLKFIDIAL
VPLLLTLGTLGFIAWNARREKA
>MCA1118 general secretory pathway protein I
MAIALGVLLRVFGGATRSARIAEEYSRAVIAAESVLDDVGVETPLAPGVT
EGRFGDEFRWILRIAPVPIPALEQQVRPDNAPLAGSPLAGLKLFAVDASV
VWGEGEEPREITLSTLRLFNENPSGVPGRLPGGPFGPRM
>MCA0520 hypothetical protein
MVATMRKSGSGSGNYLTLTGLLSPIIRLIGGKFQKIYFQMVESAGRHKRD
ILVARVDSARGSLEEAKEQFQTALERFSALTRFDGGTLEDVYRQLKIEFD
YSASKALAVRDRIDAVQDVAEALFREWEDELDQYTNRSLRAASRQKLKLT
QQHYAQLMSAMRRAESKIDPVLRLFRDQVLFLKHNLNAQAIASLEGELAD
MSQGVAGLIQAMERSIDHAEVFMRTLSGPKALPAGL
>MCA0089 pilin-like protein
MGFSTVELLISMVLGLLVVGAIGSLFVQHKTNYRQNEQFALMQENGRFTL
NLLASDLALAGFWGGADCNDLSRCPSASAISVQNDCGTNWTTTTTSSSPA
THYLMAPSADSIPADWPCFNSNDYLKPGTNLLALRHAEGKPVYCQTGEDK
RNDRFAGLIYLRTSGSNGNLVKLPVSTATPPEPPDCPVSSSSAAYWRYVV
HVYYISRGYTARQCASKPTDPRCAPRLIRKTLGRQTGNPIIFESDDGGEI
AEGIEYFHIEFGIDDGDGNMTDESCDMDGIPERYLSTREDYAAITPDELD
CAISARVHVLVRSMEPDSSYTDDKTYTFGSVTLGPFGDHFRRKVFTTTVQ
LRNQQYH
>MCA0828 CheW domain protein
MGLRVPWGREGGPTKYRRRSGGRSRHAEKCAGRYDLWPMLLVPFQIGGEH
YALPAADVFTVTQVPRLRPLPQAPAWVAGVFRYRGAVLPAVDLTLLIAGR
PSRRLLSTRLLLVGHARGGEVGPALGLLAERVTGTETVDRDALRETGVGI
PGAEWLGEVAGLEQALLQLIRWQPLATEELARLCAEVERCG
>MCA0831 chemotaxis protein
MSDFALLDLFREEVEAHTGVLTSGLIALETGPADSGRIEPLMRAAHSLKG
AARVLGLDAAVGIAHAMEDALLAAKEGRLALTPARVDVLLEGVDWIVVLA
QVDEPDLPAWLATQADAAENLERRLRHLDQVAATAGGISVTAAATPVPAP
TAGSKGGGAEAALSDAETEAAAVPAPGPESAAAGAPDRAVKISAEVLNRI
ISLASESLVDTHRMEDTLDGLEATRRGLDRLAAVLAKEGPSGSALRLLEE
IRGRLGRTLEALDAHLRRAASTAESQFREVVAGRMRPFEEGARSFPRQVR
DLARELGKQVHFEIGGGATLVDRDILAKLEAPLGHLLRNALDHGLEQPEE
RRAAGKPERGSLRLEARHHAGLLLITVSDDGRGVDRERLRRKIVERNLTT
AALAMQMTDTELFDFLFLPGLSTRDVTSRVSGRGVGLDVVQSMVHAEGGS
VSVESRSGRGTVFRLQLPVTRSVVRALLVRLGGEPYAFPLARIGRLVALD
PAELKAIEGRSYFIADGANIALVAARELLGMDPPDAVADRVPAVVLTNQG
ESYAIEVDGFLGESELVVRPMDARFGKVPHVSAVSVAEDGMPVLILDVDD
LFRSIASMLSGGLAVGRRNRAAPERRRVRRILVVDDSITVRELERRLLEN
HGYEVDVAVDGMEGWNALALGRYDLVVSDIDMPRMNGIELVRRIRADARF
ERLPVMIVSYKDREEDRMAGLEAGADHYLAKSGFRDDTLIRAVQELIGEA
TV
>MCA1121 general secretion pathway protein L
MVDFSKPIELDVQAFLRWWAGELAFLVPDHFRRLLGGRASWIFLTWRGET
LDAVHRTANGARPLGSFTLDDVGREAWRRLLEAEPELAESRTVLRLLPDQ
CMTRVVKLPLATEENLLQVIAFELDRLTPFKPDQVYFGARLIERLKAAGQ
IRVSLAVVPREKLDLMLESMIAAGWRPDYVDVSEDPFKRSHQLLPERFRV
KKSRWNRWLNIGLGGLAVGLILALLIVPVWKKSQWVEQLEVEVRKAGKVA
KEVEALRQEAEQMVHEMGYLAQKKRKDPIVLDVLNELSKVMPDDTWLNGL
QYKEGHLVVQGQSPSASSLIARMETSEEFKNTSFVSPVTKDVSNGLERFQ
IASDAVNGRSSEASAQSENPGQ
>MCA0452 HAMP domain protein
MTSIRVVFPLIFALQIFVAVGLTGAIAYVSHMREAQDTAKEILELIGNMI
DDQLYRFLSQPIDLTQVYADSLRGRPDLPLDDLLEPRLVKKLSNGVWTQA
QMTRWANLSIANSAGQNLRFDRREYGKKVVKVSDIRRGGGVEWRLLENYG
KGGMPLEVQEAAYDPRTEAYYRDALAKGGLVVSPIHFSPLLQGSAPVVTV
AEPVYDAHGRPRAVVSSDIYLAGIGRYLQNLHLPNSSIAFLFDAEGHLIA
GSRGSFPGPTAGRLPLATASSDPIIRATAGYIAERYGFLAVPDAESFRYV
RGEQHNYVHTDHLLTDRQFAGLGLDWRLAVVIEESDLIENLITGIHSAAW
LSLGLLVLAVAVGSWTAAGMIRPILTLGKVAAALEKDELDARHLDVRQLE
RDTRRRNEFGELAHVFLRMVQEVRARHDLLEAQLEQLRVNIDEADTQAKI
RQIADTEFFADLKAKATRIREERKRAAEPSAQAFAEHDP
>MCA0087 conserved domain protein
MKIQPAIMVLSIAAPFCAGQADDTDIYFAGGGSLGGKPLVMFSLDYRSNL
GALVCQGTGEDDEHDDHRYDHQEDDDDEHGDDCHSLRTTVSPSSGTTYLP
PSGDVTHFDLIRGALRKAMDDLGTELATVRVGLMINHADSTTGQCGAGPS
GQCSNGAYVLRGFSTDKARFHDALARIPVPQGNLAHDFQGKELYFELFRY
LTGQAIFNGHKGWQDYGDSNRNDNLDVDNPGTSFDDRLLKWDSGIENGTN
YVSPLVDNCSKVFMINFMFQVSNKDNDSDAAIRQSKDSGGMNGLNPGTSN
SAFANVIRWMRDADLADGTYGSAGNLEDVQNVTSYFFVAPTNINTTTNGY
AVAGGTNNALPLSTDPTVLIASLKEVFKEVLSVSTTFVSAASPVNVFNRT
ESLNHAFFSIFQPEQAPRWNGNLKRLKLASDPTTHRLQVLDASSPPIEAI
SPIDGRIKHEALTYWTSASGFDVQTYADTGKGEVVGKDGRSVGRGGGGQR
IPGFLTNSPGDSNSTSGARKLYTEPDSLSNGTATALRALNADSTTAAALW
NTLRLNGAKSGEVWSDAASYGTASSDDQATAVKLLKFARGQDANDEDGDG
NYTEAHPWASPANQAKRKWLMADPLHSAAVPLNYGARSGYSRDVQDVRIL
VSGNDGFLHMFRNSASGGTTTETPSGVEDWAFIPRRLLGNLKPLSDNGAA
TPHVYGLDGTAAVYVNDMNRDGTVNGADQVIAFIGMRRGGKGYYALDITD
PDTPRMLWTIEKNDASGNFNELGYTFSDPTIAQLDWGEGRKPVLIFGGGY
GTRKDEASAANARNAATGTDDTEGNAIYIVNAKTGALVWKAVKGSGASSA
TTFQHAGLADSIPSKVTAVDTDDNGMVDRLYVGDTGGVLWRADLAYVQPD
GTTSYNEPAHWTLTPVLSVGRHAGQPDRRFFHRPDFVPSPAGSSRFDAVL
IGSGDREHPLDTNVQNQFYMFKDTYVASGSPPTGPVKTLADLDNLTSSTT
VHPDKSGWYIDIGSSASGEKVLSSPLTYNGSVYFSTYAPQGGSVSGQCGP
NEGESLTYVIRLADAAGVFNFNAATPANERFTVTGRGLPSDPVTVAVNGK
LYVTAGNLPANPDLSNPVGNSGINRLYWYEKE
>MCA1674 putative methyl-accepting chemotaxis protein
MRLRFTVRARLLLACVLPLSLAMVSMLGVVICNYYRDLVRESELRLRAEV
NEAGLRIDLANVTALTVPKTMALAQEAGLFGNRLVSLEFARRILEAFPEF
TGAYFGYEPDGDGHDRTASQTGEVPSDAVSREGRFLPYWFRDREDPLRIR
LSPLVNTESSYYYRGVKNRHEGKKEEEGVSLSGGISRLYRNSGNGAQGQR
FTMVTEPYDYEGKYMVEHAVPLLRDGRFLGIAGVDKALNDIDDTLRKMHH
HESGGFFLLSQRGRVIASTVDPSARARPIEDTRWLAVLEPAYLDTGKGTF
QLAPDPRDGAETYFVSTRLATGDWKLVLVASREEILAGLSDTIKGAVGIS
AFVLTSVLGLAFLALRTVSRRIEDAARVAARVAAGDLGSRLEVADSDEIG
DLLRAMEAMRQALADLIGRVKRSSEQLVALGDEITTAARHDESLAHEFGA
LTQQVAASVREISTTGDEMVRNVADVAGATAETAEVASAGRACLAETETA
MRRMSERMSSVVERFAAIGEQAKGINRIMNAMTDIAVQTNLLSLNASIEA
EKAGKQGAGFAVVASEIRRLADQSAVAALDVEAMVRDMQRAVAGGIQVLE
SFQEDMSTAGLSNVRRLADHFTAVLERIEPLVPGFGQVYEGMRAQSEGAR
QINDAMVALKDEAQVAAASSKHLSSMVVKLHTAVQELQAEISKFRL
>MCA0562 conserved hypothetical protein
MFKTWVVAAESSRARIFAIDNRLNPLKEIDDLVHAEGRAKEQDLTSDRPG
RMLNSSTEGRHDYEKPQDAKQHEAQVFAKRVADRLEQARVNGDCLSLILI
AAPEFLGLLRQALSGPTAKLVSKTLDKNLVQKTEQEIRSYIFS
>MCA2280 hypothetical protein
MPTHVRISFQLTRFVMKTAIKILALAIALAAQPVFAGRDPGERLEHMKKE
LKLTPEQTEQVRKILEEFEPQRKALHEQKRALHEQVRSRLKAVLSKEQGE
KLDKMAEERRARHHRGWDTEPGKSPSP
>MCA1114 general secretion pathway protein E
MTQAVSRKTEAEPLVPAELLQDPDGYGRFADALLRRGKVREMDLARAKRL
AAQADELRLPALLVKLGVVSERDVAETLAEASGLPLIGPTDYPDVSPLPE
GIASRFLKDRHAVGIAARADGFVVAVEDPFDAELIHALGLACGQPVHPVV
GLASEIDRALEQQIGSGRSVMGQIVENLGGDEDADEADVEHLKDLASEAP
VIRMVNLIMQRAVESRASDIHVEPFEQTLKVRFRIDGVLRDVEAPPVRST
AAVISRIKIMAKLNIAERRLPQDGRIKLQVQGKELDLRVSTVPTMYGESV
VIRLLHKESITFDFGTLGFDGSVLRRFLEILELPHGIILITGPTGSGKST
TLYTALHKINTPSRKIITVEDPVEYQLEGVNQIQVKPQIGLNFASALRSI
MRQDPDVIMIGEMRDLETARIAVQSALTGHLVLSTLHTNDAAGGVTRLLD
MGLEDYLITSTVNGILGQRLVRRLCQSCREPHPALEEVAEEMGLRRFQRD
GEVVLYRPVGCEQCGGTGFRGRLAILEFLVMSDEVRRLVMSHAQARQIEE
VALREGMHTMYDDGVRKALMGLTTVEEVLRVTSDS
>MCA0829 methyltransferase, CheR family
MRLNGDVGWLWDWLAAAGLEPRALGEAAVLCALKRRLDAAGCTAAAYPAR
LAADAEERARLLDAVLVPETWFFREPPAFEALAESAARHRLQRRSVPFHV
LSLGCSTGEEPWSIVIALREAGMRKGDFRVEALDISARAIEAARAGIYGG
RSFRSPGDGTWRSRYFDELGDGRYRVKDGLRGDVEYRVAHLGDPGWGGGR
RYQAVFCRNVLIYLRAELRARIMDQCRDALDPGGLLVLGHADGIGGPDRG
FRRHGVAGAFSWIRRDAAEAEPPPARSSRPLAAKAVRRDAPGEAKPSGPD
LLSGGRETGRAAPFDEGAVLGTARALADGGNYQAAERLCQSHLASHPHDP
EVHALLGIVMSAANRDDEALRYFRQALYLAPSHNESLLHLAALYERRGDE
ERARHFRNRSAAAEGEP
>MCA3076 conserved hypothetical protein
MALVAWLSTRYSAQFDWTAAHRHTLSEASVKVLDMLKDPVRVTAYARETK
SLRDQISDQVGRYSRRKKDLSLNFVNPDTQPDKVRELGISVDGELVIEYQ
GRSENVQEANETTITNALQRLARAKDQRIVFLEGHGERSPTERANHDLGQ
FGDELGRKGLTVSTVNLGVTPKIPDNTDVLVIAGPNANLLPGEVSLIGDW
VKQGGNLLWMIDSDNLKGLGPLAQQLGIQVLRGTVVDASTQLFGIDDPTF
ALVAEYPPHAITYGFQTMTLFPSALALDAVQDNHDFEREALLNTLSRSWT
ETGPIEGKIQFDADKGERQGPLHIAYVLTRELKPEPKGEEKKEGAEAGKP
REQRIVVIGDGDFLSNAFLGNGGNLNLGLNMINWLSQNDQFINIPAKTSP
DRDLQLTPLASGIIGFGFLIVLPVALIGSGVLIWWRRRQR
>MCA0091 conserved domain protein
MRRTAAFTLIELMIGIALAAVLLTVGIPGFRDLILDNRMAAQINSLVADL
SYARSEAVKRNSEVTVCKRNTAGTACDDSKNWTDGWIVVAGTTVLRVHDP
VSSSLTMTIKYTGSNRVVYGGKGFLSGVNNGTFIFCDSRGYTKARGLVLA
MTGRLRTTRDDDGNQIQEKGSNGVNLSESDCQ
>MCA0150 conserved domain protein
MKNLPSSEGPALPRSDVQLGQLLLHAGKLSEKDIEAIGVKQKEDGLRFGE
AALRLGLIGEDDLRQALARQFAHPCLPESDTSVDAELIAAFCPPGPEGEA
LRDLRSILMLRWFGGHRRLLALTSGRPGEGCGRLAANLAVVFSQLGERIL
LIDANLREPELHRLFKLCGDPGLSGVLAGRHSPEKAISSIPALGDLSVLP
AGAPPPNPQELLSRASFIRLLDAAAEQYDIVLLNTPPALQSADARIVATR
AAGCIIVVRRDATRLGDAAALRDQLSGAGVEVLGAVLTS
>MCA0088 hypothetical protein
MARKRSTQNGAALVVGLLFTIVLTMLGIAAIHSSSTSVRMARNAEARING
LQQAQAAADYVAAEAIDLNLDRMGNNVRCTASYPYGTCNGLSLPALPSPL
TTPPVHVRVRGSIAFGSSRREEGNKVFERVIESDYDGAPAGSSADATRAG
VVSAVRGTLVSSGSVRYDSETPDPTPTPAPTS
>MCA2888 putative type IV pilus biogenesis protein PilF
MRGWLALTALLGAIAACATPEQRPYATNDPYGELSTADVYVQKGVRYMAQ
GALEVALEDLKHAVELDPANSDAHDALAILYEKLGRTGEADLHFREALTL
NPENYSAYNNYGRFLCHTGHTEEALARFEVAYSTPLYPQPWIPLSNAGTC
LRRAGRTAEAEPYLRRALEKNPGYPPALLEMAHVSLETRQYLSTRAFLQR
YQAVAGDTPETLWLGVQTELALGDAAEARRLADRIGVDFPDSGEAVQARR
LFAQP
>MCA0644 hypothetical protein
MNENTASMAEILADLLKLSSDAMANVSEREQLSIAVESLVMSTEHARRAR
DQIAIALLELEEYRRQNRRLVETCRKWEAIANDLLARLVMVTRPPEHSRR
MH
>MCA1123 general secretion pathway protein D
MLKKPEYYPRKGAVVNPPSAGGGGYPGSATTSSAGGGATGKGGGRTSPRK
EGKYTLNFDDADLSEVTKVILGDTLKVNYVLSPKVTGKVSLQTTRPLTED
EMIPTLETLLRMNGAALIREGGMYKIEPDAQAAISASGPGVGLGMMEPGY
QLRVIPLRYISAAEMQKVLEPIMPPKAVLRMDETRNLVMVAGTAEELGGV
MEAVQIFDVDYMRGMSVALYPVKNADVPTIADELTKVLGLGGKGAMGNVL
RILPIERLNAILAVAPEMHLLHEVQDWIERLDRYNTTRTGNVHVYRCQHV
DAGELAHTLGGIFGGGAGRQGPSLAPGLTGMDVGSGISSAFGSASATAGG
FGGSSAGYGGSGGSFGGSGSSFGSSGSSGSSMGSTTGSMGTSGSSGGLGR
SGSGGLGGSGGGLGTSGGSFGGARSGAGGQGASMTQLGNNARVVADPANN
ALIVIAKPQDWKEIEAVIKALDVLPLQVLIDATIVEISLKNDLQYGLKWL
ISSGSSTEALGSPLAALVKDFINGAASASGGFSYALIANGGNVKVLLNLL
AQENLVNVISSPSLMVLNNQQAKINVGDQVPVLTGTTASATIGTTQFNQF
QQQQSGVTLQIRPRVNSGGLVSLEIFQAISTPSPVEVGSGQKTFQFANRE
IQSVVAVPNGQTLALGGLISDKRSETQTGLPYLNQIPMIGWMFGSTEIKP
ERTELVVLITPRVVEKKGDINSISNEFRRKLTNLYRGEPDAQPAAVEPST
P
>MCA1115 general secretion pathway protein F
MAPAGARPFAWLSLSRRRNRISQKQIGIFTRELLTLLQAGLPLDRALFVL
EDLTREDVALNGMIGRVIELVKGGSQLSAALERQEGVFSRFYLNLIRAGE
AGGALEDVLERLSDYLERSKELRDTVSTAMIYPAILLTMSVGSLLLLLTF
VVPQFEEMFETAGKELPVPTQIVVGIAELLKNYWWALFGVVLVLVAWGRY
ELAEPSRRLVWDRRFLGWPLFGDLVRKVEVANFARTLATLLGNGVPLLGG
LSIVKETLGNRAVAERVDIAMDNLKEGGALSGPLTDAGVFPTLALQMIKL
GEESGHLTEMLERVAVTYDKEIKITVQRILALLEPALIVGLGIMIGGIIV
SILMAIVSVNDLAF
>MCA1246 putative methyltransferase CheR
MDGEKADEGLIATHLVVLPWDEAEPARAFFEAISLDRRPAFFLPALPDTA
LAKLAEAPLAVTIAEPGAPVLPSHVYVPPSGKYLSIESERLRLDDAPASG
HPLDHFLHCLPTDRHGRTAVVRHPGASSREREILGMIERAGGLVCAVAPF
EPGADALESAAETIAQTVENWLRREEGDDSETLAAILELVRSRSGADLRA
YKVPPLRRRIQHRMGIARVEDTAGYLRLLESSEDELDRLAEDLLIGVTAF
FRDAEAFALLETTVVPAICGEGRADRPVRCWIAGCSTGEEAYSIAILLTE
GFERAGQRPRLQIFATDVDAEALEFARHGLYSEAALAGVSETRLERYFVR
EGEAYRISKQIRESIVFALHNLIGDPPFSRLDLVVCRNVLIYLNATTQEK
LLEIFHFVLNPGGHLFLGSSESLGEAARLFDEVSQPWRIFRRNDAAAAVR
PSLPLSAPATTARVAEGRILAADPKAHEIQERLFRELLERHAPTLIIANA
RHEVLYTSGNVQSYLELPAGEPTLELFRLVRPPLRTMLRCSLDRCVRERR
KVAAVATGANHAFPLPEAVRITVTPLADGEVFELFLIGLEEETPGDPGAS
ATTRTGDDGWLLQQLERELAATREDLRQTIEKSRHVNDELRASNEEIMAM
NEELQSANEELESSREELQSLNVELTATNAMLDAKVGELEATTNDLNNLL
VGTDLAILFLDCDLNIRRYTPACNRLMHLIPSDIGRPFADIVHHFEDDEL
FTDATDVLRGAEPKSREILDQQGAWYLRKILPYRSRDNCCVEGLVITFGE
VTSLKQAEHELIAQAAELRQQAELLKHAHVLARDLEDRIVFWNSYAEKFY
GWTKDEALGRTSHELLNSRFPLPLATIRQKVLADGHWKGELTHVDRQGRE
RTVATHWELYRDAEGVPRAIVEVNNDITERNSFEAQLRQSRHYLDYLAYY
DPLTQLPNRFRFQQRLEQAVSQALAGGKRLGLLFLDVDRFKLINDSLGHE
VGDSVLLEISRRLEERLGPGDTLARIGGDEFAVLMEDLDDIGYASRFALA
AIEAVKTPVVLDRHELFVSLSGGISLFPEDSRDADGLLRSADAAMFLAKE
KGRGNYEFFTPDLNRRANRLLSIETRLRKALENGELELAFQPQMSLQSGA
VTGAEALARWNSRELGRVPADEFIQVAEDSGLIVPLGKWVIRTVCQTIAG
WQRSGLKPVRIAVNISARQFREPNFVRLVADTLTETAVDPTLLELELTER
LLLQDVDTVVRTLLELKRTGVSFSMDDFGTGYSSLSYLRRLPLSHLKVAR
EFVPWGADDCNNLSISRAIVSLAKSLELSCTVEGIETEAQLDCFKTLGCD
QAQGYLISPPLPPGEFEDFLRRPLPFKP
>MCA0305 putative chaperone protein pmfD
MNLFAWAAGRHLPILAALLLLPFRLGAGQFDVSPTRIELTAAKPTAAVTL
KNESGDKLVIQNSIVSWTREGKEDRYAPTKDLIVTPPITTVPPGGSQVLR
VGLRRPVDPRRELAYRLFVQEVPPPPQTGFTGVQIALRLSLPLLVQPATP
ASPRIAWSGTKRPDGGLEITARNEGSALLSVDELSIRATAGKPQGQGPVS
IFPGGRQSWVFPGEVLGSESGTAHVRASTSAGVIEGDVDVEGP
>MCA1661 putative methyl-accepting chemotaxis protein
MFIALSKAEDVETYAYERIRAAVVLRNDRYLVSSINALQKYRREPGRYAR
ELQAFDGEIRPVLDFMIENLAFRNLIAFDPRGEVLYQSHPVQQLGNNLFE
GLLKESPLAKAVLHATTLLDAEISDFGAYPGGDKPVGYVVTPLFDSGGEL
VAIIALQLDTDEVFAPFLDYSGLGNTGQTIVGLLDHNEIRIISPLRNQPD
AAQKLRIQMGDSQGQGIQKAVLGNRDFGEIPGIFKEPVLASWTYLPSFRW
GLSVQMDVGEALALVDTQRLVNGGLLALLLIPTALAASLVSRTISRPVGE
AIAAAETVAAGDLTANLDSDRNDETGKLLRALGRMTSYLNSLVGQVQKSI
IELVSTSNTLTAMNRAQEEDISAFGATTNQIAAASREISATSEELLRTMA
DVTRMAGSTAEMANAGRNELTDMEQVMHTLAESTGSIADKLNAISARASD
ISAMTTTIKRVAEQTNLLSLNASIEAEKAGEFGLGFSVVAREIRRLADQT
AVATLDIETVVREMHAAVSIGVMEMDKFSEQVRRGVDETRRISQQFSRII
EQVGELTPRFDAVHEGMRSQSAGALQIRDAMVALSESAMQSARALEETNR
ASQRLETAIAELRKEIAIFRLS
>MCA1116 general secretion pathway protein G
MKHRRFVRSRSGFTLIELLVVLAIIGLLAGLIGPQVMKHLGESKSKTARL
QIEELASSLDMYKLDVGRYPTTDEGLNALIEQPSTARVWNGPYLRKKKVP
LDPWNNPFHYVSPGQHGKYDLWSLGQDNAEGGEGEDADILGWE
>MCA0090 putative type IV fimbrial biogenesis protein PilV
MNRSHPNGFTLIEVLVSVFIFAVALLGLAALQIAARKAAFESAQRSLAAA
IGQDVLERMRLNTEALSGYTGTLNANTTFSDSLDCAGPANLAVCDRRDFQ
RSLLGTAETTGTGTNVSKTGGLANPTLCVTSSNAGGSGQYTVTIVWRGRE
PALPEAGSENASDTCGNGQYGTNNEYRRIFRISTYICREGVSGGCI
>MCA0304 fimbrial biogenesis protein
MPIAGRSREEAELQYAASDSLSVTAVVRPPSPGGTAQPPEDREEHLLAVE
LNGETVSEGTLVLRGQDGSLYVPEEAIESWRMELPAANTLAHDGRDYRAL
ANPGIIEARIDDTRQTLVLRVEPSLLAGTVVEAGTRLAQPKAERASFAGF
LNYDFNIMHSRQGDFQGGALELGISNRYGLATSAFLARHSSGQGGSETQF
TRLYTTWIMDDPANMASLQLGDNIVRPWQRGYQARFAGIQYATNFGLQPR
FYRYPQPAVTGSLTEASSLDIFLNGALVDQQNLPAGPFQIRDLPAANGSG
EIRVVARDLLGREQVITQPFYVGSSMLREGLVDFSYEAGFLRRYYGTDSM
SYGAPFGSATYRRGLNSWLTGELHGQVVRDQVAVATGGSFLIGTYGVVDA
AAGISAGRGWGGLGGLGFLHQGETLSYGASSYFASQAFTQLGIEQGQFAP
KQLSDAFCGFTLTSGSSVSLRYGSANYFDRPHVDTYVANYTQNLWGGLTL
SLAAVYTRSDFEGAQVAAIFTLPLGERTYATANARATRTQGGPTQLEFDA
QYTHTPGWGPGWGYRLDAGTNDRQAAQLTAQTAFGLFSAEVAHLQGDFAE
RAFGSGGIGLIEGHPFLSRQIQDGFAIAKVGDYKDVRVYADNQLIGRTDE
EGYVVLPRLRAYDVNNVQVEGEDLPMDAQVGTLALPVTPYFRSGVVVDFP
IRRSRGATLTILLEDGQPIPAGATVTVEGVADVFPVGMDGEVYLTGLSPR
NEMKVEWNGKRCTIVADYPESAEILPDLGKFVCAGVHR
>MCA1819 OmpA domain protein
MVKKFATTVLTAGVLSGCVMQSTYDRDVGYERQLNEQLRTEVEADEVKIQ
QLRDRLRVTMEDELLFPEGGYQISKEGKASIDKIIPTLQSATNHRIEVEG
HTDNVPIGKHLAHRFSSNWELSAGRACEVVKYLQSKGIDPARMTAAGHGE
YQPVASNATAEGKAQNRRIDIDLVPLYTE
>MCA0830 CheW-like domain protein
MTEAGIQEGQCWNRIGIYGSRACSRLTEVIHCRSCDVYQVAGRALFDRPP
GEDSIDRWTEQLSAVPDDDAAERTPLVVFRLGAEWFALPAEWVREGAAPG
PWRRVPHRSGTEFLGIMNVRGELIPYFSLAAVLGIESEADSAAERVLVCG
GEGHRLVFPVGAILGLRRLVLDETSRPPATVGKAADAYTRWLVRHAVGSI
AVLDAERLSGALEALLR
>MCA2145 conserved hypothetical protein
MNGCGVRVPKFPKGTGLFPLPPAGRRLVAAALIPALLVSGCAGTGGYRQT
YGSSSQARLAQLSEAYNRTLAEGCVAGAAVGGLAGGLIGRDWKGAVAGLL
AGAALGCAAGSYMGNLQNNYAREEDRLNAVIIDIRRDNQSLAELIPVAQR
VVEDNKARVAELNQAIAAGRISRAQAAGQLADLDASRSQLQATLGSAKKR
LAEQHAAIAMNAHNADPQLANIAQEELARKEQQIQALESELNTLTQLRSV
DRVG
>MCA0199 hypothetical protein
MDRLIADLQKTIQALETLSTTKPALAEQSDELLDQLFQQKIDLVAASLNA
DAPTYQQALQAISTAAGKVEKLAKNPTDATDTFKSVENAIARLARLLDQV
VHTP
>MCA0832 protein-glutamate methylesterase
MRIAIVNADRQAIEILRGVIEGEAGLDVCWTARDGKEAVARCAADRPDLI
LMAMHLPAMDGVEATRHIMGTMPCPILVVTATIAGAFSRVYEAMAAGALD
AVETPRRVGDGEEIAGAGELLRKIATLGALMAAPLQGASAGKPAPRSLHP
SHGGAVPVVAIGSSTGGPHALGVVLSGLPADFPAAVLVVQHLDVRFADNL
AHWLSRQTPLDIRLLERPERIRSATVYLAAREAHMVLDGRMRLHYVEAGD
GTNHCPSVDRLFESLAILPGLTGCGVLLTGMGRDGAAGLLAMRRAGLMTV
AQDEDTSVIWGMPGAAVRQQAASAVLPLDRIAPAVVRYIRSLPDPARISR
GASDA
>MCA1510 fimbrial protein
MKALQKGFTLIELMIVVAIIGVLAAVALPAYQDYTVRAKVSELILAGSKF
RTDITEKCQLAGTCTGSGTSMTVAIGGKIATGSDVGDNGTIVIQGSTATD
SVGAAVSITLTPSWNSTLGSAVWSCTGAPARYVPGSCR
>MCA2096 pilB, type IV pilus assembly protein PilB
MAAVSGVQQLSGLAKSLVKAQLLSEADAQAYFEDARRKNTPFVSYLVANK
ILDGLQIAQVAHQEFGLPLLDLDAVVFDIPPTQWVGEKLIRRHHILPLLR
RGNHLFVAVSDPTNFLGLDEITFQTRLVTDCVLVEENKLSRCIEAVLDAA
DTSLKQLLEEDLDSLQITAGDDEGPHAEPEISGIDDAPIVRFVNRLLLDA
VRKGASDIHIEPYEKFLRIRFRHDGILHEVANPPPALGVKICARIKVMSR
MDIAEHRIPQDGRIKMTLSRSRAIDFRVNTCPTLFGEKIVLRILDPSSAQ
IGVEMLGFEPEQRDNFLRALEKPYGMILVTGPTGSGKTVTLYTGLNLLNR
VEANISTVEDPVEITVPGINQVNVNYKTGLTFAEALRAFLRQDPDIIMVG
EIRDLETAEIAVKAAQTGHLVLSTLHTNDAPQTLNRLMQMGIPAFNIASS
VLLIMAQRLARRLCPRCKREEKVPPEVLLASGFREEDIGNFTLFGPAGCD
QCVKGYKGRVGIYQVMPISEEINRIILEGGNVMALTEQAHAEGIADLRES
GLKKVKAGITSLEEIDRVTRD
>MCA2095 pilC, type IV pilin biogenesis protein PilC
MANQETALFVWEGLDRNGSRTRGEISSRSEITARTELRRQGIRVVKIKKK
PKPLFSGPRQKITPKHIAVFSRQLATMLSAGVPLVQAFDIVGRGHDNPAM
QDLLMSIKADVEAGTTLADALSKHPVHFDELFCNLVRAGEQAGVLETLLH
KVADYKEKTESLKGKIKKALAYPIAVVVVAVIVTSILLIFVVPTFEDLFK
SFGADLPAFTQMVINLSRWLRDWWYVVFGSLGVAAAAFVKARQRSPAFNH
LLDRLALGLPVVGAILRKAAVARFARTLSTMSAAGVPLVEALQSVAGATG
NIIYGEAVMRMREDVSTGQQLQMSMRQANLFPNMVMQMVAIGEESGALDS
MLSKVADFYEEEVDNAVDSLSSLLEPLIMVVLGVLIGGLVIAMYLPIFKL
GSVV
>MCA2094 pilD, leader peptidase PilD
MMLLPTGPAGVALCGSFGLIIGSFLNVVIHRLPLMMEQAWRRECGELLEP
GTGRPVGKAEYNLWRPGSQCPHCQAPVRFWQNVPVFSYLWLRGRCAACGT
AISWRYPLVETLTGVLFAMVAWHFGASWQALAACALTAGLIALAFIDLDH
LVLPDDITLPLLWLGLLLNAQGLFCSLETALYGAVAGYLLLWTVHRGFLL
LTGREGMGFGDFKLLAMLGAWMGWPMLPVIILFSSISGAVIGSVALAVGG
KGRDTPIPFGPFLAAGGWIALLWGNALNDAYWRWAAPGG
>MCA0086 pilE, type 4 fimbrial biogenesis protein PilE
MAEPNGKVSITPARRAAEGRGFSLLELMITVAIIGILATVAYPSYKEHIV
RTRRADGKAALLRAAAREEQYFMDNKTYTSDVTRLGFASNGKSDEGHYVI
SVTAADANGFTLQATPQSPHTDALCGNLTLNSLGVKGKSGSGSVADCWNW
>MCA0325 pilM, type 4 fimbrial biogenesis protein PilM
MNYQRCRFPVACCGFPSSVPFSGDMSAMWGLGRKKPLLLGIDISAAAVKL
LELSRKNGGYQVESYGVVPLPRNTVVDNTLAEPDNVSAAVRAVVKQSGTA
LRHAVVAVPGSVVITKRIALPANLEGDELEAQIELEADQYIPHAREEVSL
DFEVLGKNPKNPDLLDVLLVATRRENVEDRVSVLENAGLGVEIVDVESYA
IERAFELVRDQLPPLLRDRAVAVADVGAMATTLNVIHGGSIIYTREQGFG
GMQLTDEIQRRYGLTYEEAGLAKKEGSLPGNYAEEVLEPFKHALVQQISR
SFQFYLSSTAQRGFDAVVLAGGCAMIPGIDRYVEAALQVPTVIANPFRHM
SFSGRVRQERLSYDSSALMLACGLALRSFD
>MCA0326 pilN, type 4 fimbrial biogenesis protein PilN
MRPGTAEFRLMARINLLPWRAALRRERQKEFALWTGLGLALTAAMMLLIR
TQISAAMDTQNRRNQYLEGEIGALERQIAEIRDLEAKKSRLVAKMEIIQQ
LQLSRPEEVHLFDEIARTVPEGVSLRDLTQEERAITVNGFAQSNARVSAY
MRNLEGSEWLYDPVLEVIENKQDAQGRGKEREKGSRFTLRMKQGPRGRTG
EESLPGGKAAP
>MCA0327 pilO, type 4 fimbrial biogenesis protein PilO
MNLAKINWDLEYAGSWPTPVKLAVIAVLSTVLAGVWYYLDTRTQLAGLED
QQRREQELKETFEIKQKKAVNLEEYKLQLADIEKTFGDLLRQLPDRTQVP
DLLVDVSQTGLASGLEFELFKPGGEVSKEFYAELPIEIRVVGTYMEFGAF
VSGLASLPRIVTVHNVKIVSRSADGAGPKPGEAELVMTALVKTYRYLDDG
AVPSSPADRGRR
>MCA0328 pilP, type 4 fimbrial biogenesis protein PilP
MMVRAKYAWAPAVLVCLSGCAEDELADLRSYVADVKARQKITIEPLPEIR
TVSPFLFNPAGLHDPFKSIEKPDDAGTAAADNGIRPDLSRQKEQLESYEL
DTLRMVGTVRMSGIMWALVRAADGTIHRVRTGNHMGRNFGRITRISENGV
ELVEIVPDAQGGWLERITTMTLIEAGGDKK
>MCA1537 pilT, twitching motility protein PilT
MDIAELLAFSVKNKSSDLHLSSGLPPMIRVDGDIRRINVPPLEHKEVHAL
IYDIMNDKQRRDFEEFLETDFSFELPGVARFRVNAFNQDRGAAAVFRTIP
SKVLTLEELGCPKFFQDVTRQPRGLIVVTGPTGSGKSTTLAAMIDYINSN
DYSHILTIEDPIEFVHQSKKCLINQREVHRNTLGFNEALRSALREDPDVI
LVGEMRDLETIRLALTAAETGHLVFGTLHTSSAAKTIDRIVDVFPAAEKE
MIRSMLSESLQAVISQALVKKAGGGRTAAWEIMVGTPAIRNLIREAKVAQ
MYSTIQTGRKDGMQTLDQHLQELVEKGIVTRQVAREVAVNKAAF
>MCA1538 pilU, twitching motility protein PilU
MEFASLLRLMVLKKASDLFITAAKEPCMKLNGAIVPLSSTKLSTDQVRQL
VLGIMNQRQRDEFENTNECNFALSAAGLGRFRVSAFVQRNSPGMVLRRIE
TEIPTVEQLNLPSVLNDLVMTKRGLILFVGATGTGKSTSLAAMLKYRNEH
SSGHIITIEDPLEFVHPHAGCIVTQREVGIDTESYEVALKNTLRQAPDVI
LIGEIRTRETMQQAITFAETGHLCLSTLHANNANQALDRILHFFPEDMHP
QVFMDLSLNLRAIIAQQLVRRADGKGRYPAVEILINTPLVSDLIRQGEVH
KLKDVMKQSREQGMQTFDQALFELFKAGRIGYEDALYSADSKNEVRLMIK
LSEEGSIDKYAPKDDSIRIVDN
>MCA1994 pilZ, type 4 fimbrial biogenesis protein PilZ
MSEEQTGARQGILALTIKDKNALYAAYMPTIKNGGLFIPTTRSYRLGDEV
FMLLQLMDEPERIPISGRVVWITPKGAAGFRSAGIGVQFSDEDGGVTKAL
IESYLSGHLESDQPTHTM