TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Rickettsia conorii str. Malish 7, Malish 7
Gene type: CDS

Number of genes found: 75

Free access
Sort by:

 



# Rickettsia conorii str. Malish 7, Malish 7

>RC0660 unknown
MPSSYKLRKKIWKSVYLLITVGILYIGYILIKSGYINEKNDINVTKKSLK
DNKNFDLKYNIILKDSIFEGVNKNLNAYKIKTERAIKESNNKYKLDIINA
IYNVNQDQTLIINAKEGFLDEESSILDLKNDVKLFFDEIIFNTNDARIDL
VNKNITGHSPAKLLYKNSSITSDSFNTKDENNIIIFKGNVSTIIDLSD
>RC1071 unknown
MQKEMKMKTNDTFDGFTIELFKDTDGDWLARFEELPNVSAFGNSPEKALQ
ELQQAWTLMKESYISHNQSIPLAPSRKEYSGQFNVRVDKRVHRALVLEAL
RAKISLNALVSQKLTLSVKQ
>RC0474 unknown
MIVIYNNIFKNFFTDMIKKIAAGVIICFSLLLFIMFGALSFVNYNSVTNN
FTSHLGIAKENIGKIKMNKFPLSYLVIETIREEGKLDLEQIKIHFSLWSL
IKFNPKINKIDILDAKFYSHSNVLNIYNHEELIKNFFKYKLQNINLNVTN
LSIINKQDYSILNFNNCILKKENALSSNYIFKTTSNYIGKISGSINKRDD
IVDFSLNIDNNDYGFKLSQIYKDSKLTSGSGEYQIKNLASVMYNILPDLN
HLFNKFNQHEAVNVKFNISNNEDAIELKDIVIASSFIIGHGFVNIAKNDN
ITTNVKLDFPKIDLSSLISPNAEVTFNTSSSNSRFIFANKLLKADVAINE
IILSNNEELKKIVFSSNLLKGTLKINEFSGNIKSGGEFKLTGNVTQNAVR
SMFDGQLYLKHNDINSLLNILGFNDVTIKEAIPFSLSSDLKLTLIDLFFK
NLLLKTDNLNLSGNFSSKFIAQTPRLDATLNISSLDLSSRTYPIISPLIE
FTKNLTKDMKALDYPSKFIPIRTIGYLANLDILIDSVKYNDHVFDKMNLL
AKIVPANIKISNLDFKTANSYLSTSWNLDASSVLPSLTVEIKDGNLTTDL
LSPAGMLNLRNKLINDYSLDKATLQVSGTLSTLLQNDLILKNVKFYVANN
NNLLQFNNIEAELLGGKFQGNGNILLEPYAINFVYALNTIDLNKVSALMP
KIFTASGGKISISGSLGTNGNTLQSQLYNLTTKSQFAINNIDVNNFAIDA
FIEKIDTADYKVQNLDKDINSAITTGQENIRGISGDIELQKGIALLKKVK
FATQYSSGAASVAVNIYNFDMDASSILSFYVPARLVKLNTSNTSSDKDSL
AHLNIKMQGSIFAPKKTFDSSELKKLLIPQTTEDTITTDNH
>RC1372 putative integral membrane protein
MASYYLWFKSFHLISAICWMAGLLYLPRIYVYHIKAKIGSELDSTLQVME
LKLLRFIMNPAMISTFIFGLINAHIYGFVALDTWFHIKMFAVLILVIFHG
LLARWRKDFANGKNVHSEKFYRIVNEIPAICMIVAVIMVIVKPFD
>RC0487 unknown
MKLLKIIFIITICINFPIFAENLESVTTTEDIENDVFIPLDENHPILNPN
DNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKC
IKNLDPYNEDNYLLMTITEYKIDEDPNVLFQGWMISSSISLSTFEHPIYE
IFAKDCF
>RC0102 unknown
MPLSTSLLGKKSTYKDSYDVTLLFKIPRINNRNELGINSNNLPFYGVDVW
NTYELSCLNKNGKPWVGVGTFYIPTDSENIVESKSFKLYLNSFNNFVVES
VKELERIILQDLSNVTHAKVTGRIFPINTKVEFGVPSGKNIDDLDIVCNN
YGAPDNSLIEYEDVLVEEEINSHLLKSNCLVTGQPDWGTIVIKYKGKKLK
YDSFLKYLISFRNCNEFAEQCAERIFTDIKNAISPDFLSIYIVYARRGGI
DICPYRSTDKSYTLPSDKRFIRQ
>RC1110 unknown
MLPNLGIIAGRGSLPYLIAGNYTKQGGNCYIAAIKDEADIEQIKDFEYKI
LKIGMVGEAIKYFKEHKVKNIIFIGGVNRPNFKNLAVDKIGGLLLFKIVG
QKIRGDDSLLKIVADFFESYGFKVISSNEIYKNQQGNSNIITNTNPISSD
KNDIELGIKLLNHLSAFDIAQSVIVESGYILGIEAAEGTDNLITRCADLR
KNPHGGVLVKIAKLGQDNRLDMPTIGPNTIKNLAKYNYKGVAIQKNNVII
VEEELTIKLANKHKIFITKC
>RC0365 unknown
MYVVIELKTGKFKPEYAGKLNFYLNLMERTIKDNSDNPTIGLILCEEKQG
ITVEYAIEGIQKTNRSITI
>RC1034 unknown
MSKSQEQEQESIDNIRQQFDEEYKNLSWVQVGYLALGADTGNKQYDELNK
RYLDLNKDNLGIIQKVLSKQSGMFLVHCSLPGRLLKFYLILHWSLITSEK
REVLPLV
>RC0638 similarity to late-developmental spore coat protein
MYSSATPLAFGTYVPTVDALQTNTIIIKCILGTNYTVALNAGTAPAATTS
TRKMTGVVNTNSYLPYNLYSNAGRTQNWGHQSSDWVMGTGLDQTLTIYGK
ILQGANVPSDTYNDTITIIVAY
>RC0766 unknown
MKETKRFFYKNNRLNKGYAKTFSVNEPDNNFYRKKFEHILPPIDLISEYE
SIYPGTLQELMHMAQQEQAHRHAIDLKNLKIQARVAKLTRICLLIFGICL
VVSIFFKTF
>RC0681 unknown
MFIICSAPRFADSLLFFQLIFVYYFSFYFRVYNTTFERKEINMAGHSKFK
NIQHRKGAQDKKRAKVFTKLIREIVTAAKTGSSNNPENNPRLRNALTAAR
SQNLPKERIDKAINSANDSSNNENYTEIRYEGYAPNGIAIIVEALTDNKN
RTAAEVRSSFTKYGGSLGETGSVNYLFNHCGVIQYPINIASNKDILEAVI
EAGGHDIISDDTTHTIYTDIENFSKVLEFLTGKYGIPEDSYIGWIPLNTI
IIDDKEKAEKLLKLVEVLEESDDVQRVFGNYELSDDVYEIIQGEP
>RC1356 unknown
MIRKFLLTIFALWVGGFGYYLYLINSYKLNSNTTNAIIVFAGGGHKIETG
IAWLKAGYAPILFITGIESTEQLKILLKERNVIEQQVIFAPNKIMSEEDN
IKKVVDFIVTYNLTSIILVEHNYNMPFMLNKLEKAIPSSNNIYIVPYPVF
SKQKYDVLLKSYHRYLMSILV
>RC1253 unknown
MKKETEELKLFILECLSEKKAEDIEVIDLTEKHKLADYIIFASGRSTKNV
GAIAEYVALELKNNAGINSNIEGLGKSEWVLIDAGTILINIFYPEVREHF
KLEEIWKR
>RC0307 unknown
MSKDNKKNQDMSIEDILKSIKGVINERKNPIHENDSEDEDVLELTEIVNQ
DEEEKLISTKSAEEVGDIFKNFTDTIKDKKLDNNISSKNALEELVIEMLK
PELKAWLDKNLPVLVKELVEIEIKKLVQNSKR
>RC0768 unknown
MIILEDIIPDYVKDAEEVKITAGCDKNFITCCNKFNNAINFRDEPLIPKT
DFINLV
>RC0608 unknown
MKKQIIYPDFIARIFSTALDLSVFAFIAIPISQFCSFNLLWLFFNDYFLS
NNINLHNPNEMFNSVMSQEFYEYLKAGNFNKYILFNISIFATNILVIGSY
FITLWYYKGATLSKMFLRMKIVDAVTLNRPTLKQLIKRFLGYMTFPIGIF
FILFSSKKQALHDKIAGTVVIKS
>RC1154 unknown
MIKVKFMRSALITSILVAVAFLTSACNTMQGAGQDIQVAGKKLKDSAESN
KPQKGCGCPHSSAN
>RC0119 unknown
MRKVFKKFLKNNKYVLSIITILLYWYLRFVYFTSKQKFIFYDNGNKEKFL
NEQGVIFAFWHNMLALSPSMFIGHKNIYALISPHLDGKILNDLVGKFGCR
VIVGSTNKNPIGALRNIIGKLSQGANIIVTPDGPKGPVYKVNSGITEIAY
RYNTKLITIVSSTSRCFRLKSWDKLIIPLPFGTIKIIVGSPLELTNDKIQ
NHISLEQQLASLTESLKK
>RC0001 unknown
MTKLIIHLVSDSSVQTAKYTANSALAQFTSVKPKLYHWPMIRNLELLNEV
LSKIEYKHGIVLYTIADQELRKTLTKFCYELKIPCISVIGKIIKEMSVFS
GIEIEKEQNYNYKFDKTYFDTLNAIDYAIRHDDGQMLNELSEADIILIGP
SRTSKTPTSVFLAYNGLKAANIPYVYNCPFPDFIEKDIDQLVVGLVINPN
RLIEIREARLNLLQINENKSYTDFNIVQKECLEVRKICDQRNWPVIDVST
RSIEETAALIMRIYYNRKNKYNK
>RC1301 unknown
MDKFYNYNSSSHQALLSFKVKPNSKQNLISNFVIINNIPYLKLSIKAIPE
QGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILIKNINEDYLNLII
NSYIK
>RC0719 unknown
MRRKLYENSLAHSWKQAGIEEGRKKEKITMAKEMKKEGLSLETIMTITKL
DKKDIEKLK
>RC0510 unknown
MNQEERNLIMNKYILDSSALLALFNLETGSDKVEELLPLSIMSTVNIAEV
VAELDKKLNISFIQSKAMISASINKIVALDFDQAIEIGRLKKETEQFGLS
LGDRACISLGLITGYPIYTADKIWAKLQLNCKIVLIR
>RC0103 unknown
MNSQKTLPLLPKATAIWLIENTSLTFKQIADFCGIHEFEIKGMADGEVAQ
SIKGLNPIANGQLTLEEIERCSKDPNANLQISYSPAYELMKNQKKKRAKY
TPIARRQDKPDAIYWLLSNYPNIQDHQIIKLIGTTKITIDAIRTRSHWNM
NSIRPRDPVLLGICSQIDLNKIVESLKPPQNPTKES
>RC1173 unknown
MSSMLKQIRLESGKTLNQVSSDLKIRKKYLVALEEGDFDVLPGEVYVRGY
LKLYLDYLNVKDRNAEQIEATKQNETEKLLNNKRATVINYKRKKQLVLIS
IIMLSIIIVSHPFIINA
>RC0604 unknown
MSPQIIELLIFAVIAFYIINKLITTLGSTSEEEQTKQKSYFGEPIIKDVT
YSIVKSNKEEKNIPTAQDIKAFKDIIVEHNITAIVDGMEQVHKRLYSFDP
VKFINNAKTAFQMIIEAAYKKDAKELSELIDKRYLEEFEKITPSYGDFFD
SSALSAKYSEIYMFGNNIFIKLLFQGKNVVDKIEDLKEEWTFTRNANTKE
VDWFLSNIERV
>RC0950 unknown
MWFIELYPIIQLYVKVKNMSIQEEHHIKKISFVQSLLELLPFNEWNNKLL
EEAEEKCGFAKGYSLIVFPEGLSEIVGFLEEYLDNIMLESLKIIAEPSKI
REKISLAVKTRVKTVLPIIHSKNAAYFALNPIQGTEVAFRSCDAIWRYAG
DKSLDFNYYTKRSLLLSVYVSSILFYIQDESENYIETDKLIETAVENIVK
TFSQMKKLLAPSRIPIVRMFT
>RC1144 unknown
MQKKLTITIDEAVYYRLYSVIGERKISKFIEQLVKPYVINEKLGAAYKAM
AQDIKAEEKANEWVEGLIERDFYEKS
>RC0914 unknown
MNKLNKNSLQKKLFYRSKNRGCREMDYILSSFAEKYLSLMDETQLGSYSL
ILDQNDNDLYNWINNKSSAPSYLDAEIIDKLHKIAKI
>RC0366 unknown
MHNKPSKMVGQEMYYLGKLKVIYMKGQRKKGINNFSNTLPELQSDLARSI
IKEPYNLDFLDIQGKIIERDLENQLIDNIKNFLLELGQGFAFVGNQYHIE
LEGEDYYLDLVFYHIKLKCYVCCY
>RC0153 unknown
MSEVVVKEQLEQYISKIERLEQEKADLSQEVKDIFQDASSHGFDVKAMKS
ILKLKKLDKDKLAEQDAMLELYRDTLGI
>RC0364 unknown
MQKYGPMILFNGGAYDNKLLQEALDKNIITDYPKENFYIFTLPEYQTNIG
ADNLTLYIKSMKTVILT
>RC1171 unknown
MSIVTVTLNNKSFQLYCNNGDEEELLSLANKLNDKIAEIKLGSPTASFEL
LLVMASLNAQAEIVSLTEKLNKNGLQKNHPDEEKFAETLTTIAGYLENLA
RKMGK
>RC1320 unknown
MSEVFEIPNGESKVLLHCCCAPCVGPLMEKMIDTGIKFMLFFYNPNIHPK
KEYELRKNENIKFAEKHNIEFIDADYDPQNWFRRAKGMELEPERGIRCTM
CFDMRFERTALYAYENGFKVITSSLGISRWKDMNQINESGTRAASHYEGV
TYWTYNWRKDGGASRMYEIAKEEHFYKQEFCGCVYSFRDTNDWRVANNRP
KIEIGKEYY
>RC0818 unknown
MQTIEQQIANVIEESLTDMGFELVLVKFKGVNPKVVEILIDSLNSEKISV
EDCTKASRTISAILDVEDLIEAAYSLEVASSGLERPLVKFENYNRFLERE
VKIKLKELLNGKTRYQGKIIKAENNKIYLKCEEQEVLIDYDLIKNANLVL
TEEVFKKLLKQ
>RC0754 unknown
MLAIISSAKTLNFEKLAPKTELTIPMFLTLTNKLLSTLQSYSENQLSKIM
NISAKLALINKERFKDFDNQESKAAIFTYAGDVFNNIHIEKLTNHALNFL
QSHLLIISGLYGVLKPLDTIKPYRLEMATKLNEINLTNFWQDEVTNYINK
ILAKQENKYLLNLASQEYSSVINPNKLKYQLVNVHFKENRNGKLSRIGIN
AKKARGAMVKVIANNLIDSPELLKNFSYLGYAFSTKHSSDNELVFIKS
>RC1057 unknown
MLHSMSYLLPTIITTLLILLALIIWFYVKTQTLRTQLQFLSEQNLEISNN
NQLLNQEQIGYLQKIEQLQCKIEYQAQTIKDSEKIREESFSSAKAALFDL
GQDLSKQLIEIHKMENTAARELAEKNIATASGKFNSEFERLITMVGALNK
DIEQSKGTVDLIKQSLLSPIGAGLLAEITLENILKSSGLRPNLDFIMQYG
LTTLDSGKLRPDALIFLPSGNLMVIDSKASKFLVDKQDNNMSLNKTMNYH
LKSLANKEYAANILTNLNKKDQSFNNVMTLMFLPTEQAVEKVIAADPEFL
QKAWGCNIFPVGPSGLMNMLSFAKFQITDHRRSENYKVIIEEVRKLLSSI
GTMADYSQKIGNNLHNMVTNYDKFAASFNRNFMSRVKNIQKLGIDSGNKA
MPATLERYQIVSSKSEIIEVEAENPPQIAEKL
>RC0028 unknown
MASSRNDGDKLMNTEKLKDIKARIKDLTTPKFSNPKIRQEISPFTIAVDL
VSGTMLGVVIGIFTDKIFNSKPLFLIIFTIIGMIAGFNIIRKKVNNKK
>RC0697 unknown
MRCSSYCTSSEYKMSDLVTNLKKIGLEPQHFDDVLYIRKEINKDSDFIEI
FFFPFGCVTIWGGDEIQEKIVLSDTDLVPVNKLKEPVSDYIYFEYNTEVK
KTFIDEEKNKIILADKSVFVKLSISHALAQSVKLSVLEQSVSNLIVQTTP
IQQELARTGSVSLSKKEILQQIGILFNERYSISLHSDIFDTPEFFWRRPS
YEPLYLMTAEFQDIEIRQNIMNHRLNMIHELLNILSNDLNYKHSTKLEWI
IIILIGLEVVLSLSHTNLFLKIIGAL
>RC0659 unknown
MSLQSSIYRIRHLAKPAYREEFKGDIECSTAAYKEVLEDTSTDSTSKLPL
EAKFGKMSIIKLVLLLIISTIIYANDKNISNLHITSDSLIIDRTKQKAAY
LGNVIVYFDNAILRTKELYIFYKTIDEKQTIDHIVVPTKLTVERKINNEL
LLADSAKYFCDNKQLILLGNVILQRDDNVLKTNKLIYYVDIIKK
>RC0781 similarity to N-terminal of biotin-protein ligase
MKIKIYNDLGVSKESIKHCVHTLRLYAPKYNVDYITAQEIIDEKWVQNTL
LLILLGGRDLYYVQKLQGKGNANIKNYIKNGGNFLGICAGSYYSGNYVEF
AKGTNIEVISKRELKIFNGTVRGPLLAPYCYNSHKGARAAYLKINPTLNL
NIKDCYAFYNGGGYFIDAENTKDTEIIASYEDSQAAIIKCTYGDGTAILS
GVHLEYEPALIKNQSLNNIHKILKAHDTERIKLLNYILDFFGNTLIERNL
>RC1337 unknown
MVNFNQFLKQAQSMQKKMQEAQEQMANARYTGKAGGGLVEVIATGKGEVE
KISIDESLLKAEEKEMLEDLIKVAFNDAQQKCDEDSQNSLSGALNGMRLP
PGFKMPF
>RC0209 unknown
MDDKKDNRHLSKPAYREECTGDTERSTTAYMDILEDVSTGSTSKLPLEAK
FVKISNNISEKENLPKEKEIGGVKGLEPTRYGDWQHKGKVTDF
>RC1029 unknown
MSPKFMTFYEKYMTIVGTIGNFMFYVQAHKIFTCQSSASVSMPAFTISAI
ALCSWLIYGILIKNTPIIIANIVGFIGALLVLLTIIIY
>RC1109 unknown
MNIEYKKFVHEYMLEFVKKILTKIQHENLYWDQLIYISYRTDNPAVILPS
KVKQAYPKQITIVLQYQFENLIVNDTGFSLTVSFDGVKEIIYVPFDALIS
FVDFNNNYSLTFNQSLNIHENPQHEEAISNNKSDQTSSSSSPNVIMLDKF
RNSSKPS
>RC0210 unknown
MSIIFFIVHSTKYMKYNRIYINSRLAENSKIELASDHHVHYVKTVLRLKV
NDGLRLFNGTDGEFLAQINDIGKNNLSVRLKEQLKKPYTKSTLTLAVAII
KQDKLMLAINMATQLGITKIIPLITRWCQFRSVNIERLTKCVIEATEQSE
RLTPPIIEKAITIQDYLKKNNNLMLYANEHEKEENSILRISSSLSNSDIT
IIVGPEGGFTNDELELLASYKNTKSISLGSNILRAETAAITAIAQVRLLG
SHCEEIA
>RC0646 unknown
MINGVRKLNYNLYKDAGFTQILGNGTSSTVTFTDSYTLGLEPVTKNYTVY
ARLPSQPLAAASTFQDTITVTQPWLYNKKSLKSKFFSLGSNFVAYMD
>RC1251 unknown
MLYLKWKIMIKILITLIIVILSTTINADNKKLPIPRFVSIKSNEVNARSG
PTTKSAVEWVFVKKGEPVEITAEYKQWRQVRDINGEGGWIHSSVLSGKRS
VVITSDKEIELTKSADHKSRVIAKLMPKVRCSLKKCKEQFCQITCKDYTG
WISKKVIWGVYDDNDRY
>RC0770 unknown
MKYVNSKIVKILSQLTSQKYLIKNARLSSTEFEQFNSFFKARCGSNFALR
FRNYADYRGINEVIAKGDGNLNKFQLRKIYGNPIAPYERVITKPVNNSVM
LYINNVRTMGIVDYNDGIVNLPSPLGQDVILTTDFTFDVAVRLSIDSFEY
SYCRRFYSVIQHRVSGGDYMSIAIEEVITKLTNFLFSN
>RC0489 unknown
MILITPRKFILIKYVRKFIDWFEQTFVSRLNNRNKGAIVLVMQRLHTDDL
SGYLLNNSNSWHHLKILAISIQDYSFKLMNKEYQYLSGQVIRQLLKNLLI
V
>RC0291 unknown
MYIIRYTIQVQKDAKKIVQAGLKNKVEVLLNIVSTDPWKIYPPYEKLVGD
FSGCYSRRINIQHRLVYEVYKQEKVVKILRMYTYYE
>RC0670 unknown
MRKIFNCLYVALFRTIEDDGVEHSGYMSFMILLSIFPFLVFLLALTSFLG
ASELGQNFIQIFLESLPEQATESIEKRIRELLSAPPQSLMNLAIVGSIWT
ASSFVECLRTILNRVYQIKSPPPYIRRRLLSIIQFLIISALITFTMFLLV
VIPILFTKIPIILETIEKYKIILNFIRYFLILILLFLGASSLYYILPNVK
LNFIDVFPGALLTVILWIISGYLLSTYIVYYNQLNLMYGSLGSIIVTLIF
FYIINMIFIYGAEFNYLMKNYENIE
>RC0672 unknown
MEGLINGSVYYKIFDKTFNNSSHRYIKKNNSINETEIVENKKSITTYFKA
QDILILNQVHGNQIVNADESIIAVPEADGSITTKKNLILAVQSADCVPVL
LASGDGKIIGAAHAGWKGSINNIISNIVTKITEKGAKNLIAVIGPAIAQS
SYEVDDEYYKAFLSKDINNKQFFIHSIKENHYMFDLPAFVELKLKEAGVK
DIKNIAEDTYTNPLKYPSKRRSYHLQEPYNQNILSAIVMK
>RC1232 unknown
MDKHLKEYTESGKKNLIVVYILYLCGIVAPILPLIGVFFAYLNKDKGDNF
AISHYVFLFRTFCIGVLGWIVCFIFTFIMIGVVLYVILAVWYILRVAIGF
KYMIEDKAYPNPMTYWIK
>RC1044 unknown
MANRLAEENKDFENYINNIDTVYKKQKVNDSIWFNINDHVNKIPKQNKKK
K
>RC0314 unknown
MTKKIALLLLPFILISCNGLGPKRVKNIVELTPKLAIQTHEPIYLDSNAN
IYAFNANMLKNKQYSFARSKTITEPVFIGDMIYALDIRSNISAFSIEKNK
IIWSYNLSRHKKDNYIGGGILHHNGKLYVTYGSRLLVVLDAKSGYEIIRK
ELPDIIRIKPIVLNDNTVLVQTISNQTIALNAETLKTVWEHESLAEVLSA
SYFMTPIVQYDNVIVTYNSGQILALNITNGEVKWNFEFTNLNDRTAIPNF
DESSILCTPVHDNMNLYIATGLGKLIKLNVATGSVIWQVNAEDIQSMSLI
GNSLFVTNNARQIAAFNPETGKVKFVADLNDGQDPKKLKSAAFLVPFVGV
NNNNKRSLNVISVNGVLYSFDVDNNGLNMIPHVVKIIKNIRYYGLSANNN
LYFSTDSKIIFGSK
>RC0367 unknown
MSNTLSNESYTNLIKNLKQEISKARIRAHLAANKELIVLYWHIGNLILER
QNKEKWGSKVIQNISDDLRKEFPKMKGLSYQNLSYMRQFFAEYNNDQILQ
QAVGEIPWSHNIIILSKLNNINQRIWYAQQTIENGWSRNVLSWQIKSNLH
ERSAKKRYK
>RC0850 unknown
MTKNMRKQMLKVISIITIYLLLSSCSESTRDANGLLTDSQSTVIRNYIIS
QNSKNLKVNLKEKFGSNLKGVKLIGVQLINEDLSGIDLTSCEILRADFAG
SNLDKAILTNAIIQESNFADSVIKNISGYHSDFQGSIFHNITLQNTNFVQ
SNFSDTAFNKTTIINVNFENSKFSHVLWSNNTIDGVNFQKANLQNNSFKN
TNITNSIFYGTDLEKSIIKNTNFTNNYFESSNLSQTTLTAVIIKDSNFTQ
SIFNEVNFNNVQSNNSCFSYASFQDSTLQNISLTKCDLQNSTISSSVLKH
FKINNAILNNMSLNDNKFNTLSIKNSNANFVRINKTKGSNITLDNISYTN
NIFSNNDFKQFIVINTDLNSSEIINSNITNGQFNNINFSKSLIQNVNFSD
VKITLGNLNQVALINSTLTNTAVINSVLSNSQINNINYQAYSSFINTNVS
NNIILNSDNSSKILPNNIVINSVKDLQKITHLANMNLTNFDLSNLIFDRV
DFSNSIFKNANLTNTVIKNSILKEANFSAAILTKTDFSNSILTDSIFKSA
KIDQAGFNNSDLTNADFTETAIKDTSFDKAKTSGMKGVE
>RC0926 unknown
MPWSEVKEIAKEDYEYWFGRGKSFFIDCKYPLERGDFSKSAFELHQATAS
VYSNILLVFARYKPKLHDIRTLGGYCANL
>RC0808 unknown
MTNLNIYKCAEKDLNDYLGYIKDTPSVSLNDFIKNKYFAEDNDNIIITSL
ENMEINADLANANFQGTILTDAVFNNCDLTNTILCDSDLTNVKFNDCTFI
GTDFRGANLHYTDFNYKDYDNYKIPNLKDKIRDIKLSFSDLERLNQYIDK
DLEKEHIKEIVIDEATKTKKYILAGEDEKTLWDIKSKELKTKQEELETLK
QNLDNPGIATNLLNAFWHSAETIARNRQNELEKINKLQHEINKLETEVYA
LDNLRMFCGKGLDGIFEQLKDEEIQITLDPSYIIGSTATERDIPKEYIKL
TSAEFDLYLAEAAKQSNTKLSLTEFVRKQKNLSEDLNIVPDLSAINLSGK
TLTNLNLKNTLFASANLENVKISNCNLDFTNFEGANLQNAVFQNVTARNA
GFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKL
IIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTNLENV
FMNKTHALEAKFKEQCNMQGITARNAYFSDAEFENILSLKEADLRETIMQ
RVKLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEAE
GLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAA
EAIFKEAKLKQANLKAANLAGINKEGADFDKAKINDATKMHDTKGEAKGN
LDHQDKDGKKTSVNVNEHAKLQDKIHAREKSGWFLKTGVGQLCTKLAKST
TSGISRVTNFLASKKFLVGLAVVAGLAVAAAPFVAMPVLLVTGTALTTKA
VILGAGLLAGGLVATGTYKLTQKPLRNLQKSFENLTSSIDKYISPPPKNI
DELVTAKQQARQKAETEKSKEREENLNNVNNNIDKAKEQDLLKQAQNNLN
QATPKVEIKEKKDKTVEKQQAKSNTFAAKFKPNTKGKGFATKIKDEKKSR
SIKEAYN
>RC0529 similarity to cell filamentation proteins (fic)
MNSKPSFQITNKILELSQDISYELGILAGSKFYSQPIKLRKNNQIKTIHS
SLAIEGNSLSVEQITDIINDKRVLAPEKDIVEVKNAIKLYNNLTIFNPFK
IESLLKAHEILMQGLVEDNGKWRKGNASIFKGTEIIYFAPTARRVSLLMQ
DLFEFIAQDKQISGIVKACIFHYEFEFIHPFSDGNGRIGRLWQQLLLMQA
NKIFEYISVESLIRNNQSEYYSVLSKCDKLGESTLFIEFMLDKIVAALRL
YSNNITYEANTPLSRMEFAKVNLIDQWFSRKDYITVHKNISTATASRDLL
YGLERKLLISKGDKNQTYYKFV
>RC0825 unknown
MSIKLDSYEQDIEDNFEKQQKIDDRSEIALLQKFAKAHLSTKRSVTVRVA
EHDIEAIKIKVSKHGLPYQTYLNMLIHLDPT
>RC0071 unknown
MSIESKIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTA
PEISQLFGEIIGLWCIREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAKL
VPEFYKALSIELIEINKNFIAYQKANLQDINLPISHQSFVEDIPKKPTII
IANEFFDAIPIKQYIKVKELWYERIFVVQPVDERIKYDKISVNKQLQEYL
LCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGYDIAPNG
RTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINV
IDTISQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRSTAA
YTLVREDASIGSTYKLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFK
VLQIMN
>RC0312 unknown
MKLIVLLFTFLFSMFSFGESETIKGKPLKYAANNDFENRLDEQEQEIRRL
IGKVEVLQHKIDMLKQNSNISNQEENTEVLEAGDSKKQDVFDIALLKDMP
DNAPKKPIAVNKDIAPDKQAYDLALAAYKDNKLTEAKDKFKHFIQNYPNS
LLISNAYFWYGECFFKQKDYNRAAVNYLKGYKELPKGAKSSDGLLKLALS
LGELKKTQEACNMLAKFDKEFPTNRTAASKKMAEDAKIKFGCKNK
>RC1362 unknown
MYKSMEIVNSVDASVSCQGKEPPYDHPKVYLEIDKKKKEVICPYCSKKFK
LVTK
>RC0477 unknown
MRNKENISNIILDAEENALLESFENDEWQRIKNFEQEKHISQVAAANYLK
KDTRINIRISSSDLMRIKQKAAYEGLPYQTLISSILHKYSAGHG
>RC0858 unknown
MNVFLSKYVNGVDKKSRVSVPANYRAVLGKELFNGVIAYPSIRNNCIEVC
GISHIEKLRQMIETLDPYSEERDAFETMIFGEAVQLSFDGEGRVILPQSL
MKHAGIEEQACFVGKGVIFEIWQPQNFEKYLNAAQKIAHEKRLTLRNAH
>RC0740 unknown
MSQKLKIILKGFGMYAHKTIENGWSRNVLTLQIKSNLCARSGKSINNFGH
TLPAMQSDLAKSIIKNPYNLEFLDIKENILERELENKLIDNIKDFLLELG
QDFVFIGNPYL
>RC0488 unknown
MLHSKELLNSILRQDFHSFIIKVFNTINPGAEYYPSKHIRIITDYLNAVQ
SGDINRLIINIPPRSLLQSICVSVAWPAYLLVVNPTKRIMVASYSQILSI
KHSLDCQFILNSDWYTELFPSTILSKPHNQKSKFLTTANGFRFATLVGGS
ATGEGGDILIIDDPHNPTQIHSYKIRKKVYRLV
>RC0865 unknown
MLKTLLISFITLICAVNYADADQTAQNPNSTTSSSDQPDDLPAEAAVHFA
QPWARPTTNVQGKVSNSAMYFTLINSRSKSYNLVNISSDKISGIEIHQTI
NDQGVSKMVKVDYPFLIAGNINVDFKPGDMHIMLYDPKVDLNVGDEFKIT
FFFDDNTRKIVNVKVANDNPYNKTGN
>RC0093 hesB1, hesB protein
MTITITDRAFERVCELVKLEKDKNLVLRISVDSGGCSGLMYNYELVSKDN
IEQDDYVITKHNATIIIDPISQKFILDCTLDFIEELGSSYFNVSNPQAKA
KCGCGNSFSV
>RC0728 hesB2, hesB protein
MKNVISLTDSAAKQIKLLIEKRAKPTFGIRVGVKSGGCAGQTYYVEYADS
KNQFDEVVEEKGVRILIDPKALMYILGSEMDYVETKFKSQFTFTNPNEKA
SCGCGKSFRV
>RC1273 rompA, 190-KDa cell surface antigen
MANISPKLFQKAIQQGLKAALFTTSTAAIMLSSSGALGIAVSGVIATNNN
AAFSDNVGNNWNEITAAGVANGTPARGPQNNWAFTYGGDYTITADVADHI
ITAINVADTTPIGLNIAQNTVVGSIVTGGNLLPVTITAGKSLTLNGNNAD
AANHGFGAPADNYTGLGNIALGGANAALIIQSAAPAKITLAGNINGGGII
TVKTDAAINGTIGNTNALATVNVGAGIATLEGAIIKATTTKLTNAASVLT
LTNVNAVLTGAIDNTTGVDNVGVLNLNGALSQVTGNIGNTNALATISVGA
GKATLGGAVIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFT
GDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTN
PVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLG
GAIIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVT
GNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTNPVVVTG
AIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKA
TTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGNSTVTGNIGNT
NALATVNVGAGIATLEGAVIKATTTKLTNAASVLTLTNVNAVLTGAIDNT
TGVDNVGVLNLNGALSQVTGNIGNTNALATISVGAGKATLGGAVIKATTT
KLTDNASAVTFTNPVVVTGAIDNTGNANNGIATFTGDSTVTGNIGNTNAL
ATVNVGAGLLRVQGGVVKSNTINLTDNASAVTFTNPVVVTGAIDNTGNAN
NGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNA
SAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATVNVG
AGVTLQAGGSLDANNIDFGARSTLEFNGPLDGGGNAIPYYFKGAIANGNN
AILNVNTKLLTAYHLTIGTVAEINIGAGNLFAIDASAGDVTILNAQDIHF
RALDSALVLSNLTGVGVNNILLAADLVAPGVDEGTVVFDGGVNGLNIGSN
VAGAARNIGDVGGNKFNTLLIYNAVTITDDVNLEGIQNVLINNNADFTSS
TAFNAGTIQINDATYTIDANNGNLNIPAGNIKFAHADAQLILQNSSGNDR
TITLGANIDPDNDDEGIVILNSVTAGKKLTIAGGKTFGGAHKLQDIVFKG
EGDFGTAGTTFNTTNIVLDITGQLELGATTANVVLFKDAVQLTQTGNIGG
FLDFNAKNGTVTLNNNVNVAGTVKNTGGTNNGTLIVLGASNLNRVNGIAM
LKVGAGNVTIAKGGNVKIGEIQGTGTNTLTLPAHFKLTGSINKTGGQALK
LNFMNGGSVSGVVGTAANSVGDITTAGATSFASSVNAKGTATLGGTTSFA
HTFTNTGAVTLAKGSITSFAKNVTATSFVANSATINFGNSLAFNSNITGS
GTTLTLGANQVTYTGTGSFTDTLTLNTTFDGAAKSGGNILIKSGSTLDLS
GVSNLALVVTATNFDMNNISPDTKYTVISAETAGGLKPTPKENVKITINN
DNRFVDFTFDASTLTLFAEDIAAGVIDEDFAPGGPLANIPNAANIKKSLE
LMEDAPNGSDARQAFNNFGLMTPLQEADATTHLMQDVVKPSDTIAAVNNQ
VVASNISSNITALNARMDKVQAGNKGPVSSGDEDMDAKFGAWISPFVGNA
TQKMCNSISGYKSDTTGGTIGFDGFVSDDLVLGLAYTRADTDIKLKNNKT
GDKNKVESNIYSLYGLYSVPYENLFVEAIASYSDNKIRSKSRRVIATTLE
TVGYQTANGKYKSESYTGQLMAGYTYMMSENINLTPLAGLRYSTIKDKSY
KETGTTYQNLTVKGKNYNTFDGLLGAKVSSNINVNEIVLTPELYAMVDYA
FKNKVSAIDARLQGMTAPLPTNSFKQSKTSFDVGVGVTAKHKMMEYGINY
DTNIGSKYFAQQGSVKVRVNF
>RC1085 rompB, outer membrane protein B (cell surface antigen sca5)
MAQKPNFLKKLISAGLVTASTATIVASFAGSAMGAAIQQNRTTNAVATTV
DGVGFDQTAVPANVAVPLNAVITAGVNKGITLNTPAGSFNGLFLNTANNL
DVTVREDTTLGFITNVVNNANHFNLMLNAGKTLTITGQGITNVQAAATKN
ANNVVAQVNNGAAIDNNDLQGVGRIDCGAAASTLVFNLANPTTQKAPLIL
GDNAVIVNGANGTLNVTNGFIKVSSKSFATVNVINIGDGQGIMFNTDADN
VNTLNLQANGATITFNGTDGTGRLVLLSKNAAATDFNVTGSLGGNLKGII
EFNTVAVNGQLKANAGANAAVIGTNNGAGRAAGFVVSVDNGKVATIDGQV
YAKDMVIQSANAVGQVNFRHIVDVGTDGTTAFKTAASKVAITQNSNFGTT
DFGNLAAQIIVPNTMTLNGNFTGDASNPGNTAGVITFDANGTLASASADA
NVAVTNNITAIEASGAGVVQLSGTHAAELRLGNAGSVFKLADGTVINGKV
NQTALVGGALAAGTITLDGSATITGDIGNAGGAAALQGITLANDATKTLT
LGGANIIGANGGTINFQANGGTIKLTSTQNNIVVDFDLAIATDQTGVVDA
SSLTNAQTLTINGKIGTVGANNKTLGQFNIGSSKTVLSDGDVAINELVIG
NNGAVQFAHNTYLITRTTNAAGQGKIIFNPVVNNNTTLATGTNLGSATNP
LAEINFGSKGAANVDTVLNVGKGVNLYATNITTTDANVGSFIFNAGGTNI
VSGTVGGQQGNKFNTVALDNGTTVKFLGNATFNGNTTIAANSTLQIGGNY
TADFVASADGTGIVEFVNTGPITVTLNKQAAPVNALKQITVSGPGNVVIN
EIGNAGNYHGAVTDTIAFENSSLGAVVFLPRGIPFNDAGNRIPLTIKSTV
GNKTATGFDVPSVIVLGVDSVIADGQVIGDQNNIVGLGLGSDNDIIVNAT
TLYAGIGTINNNQGTVTLSGGIPNTPGTVYGLGTGIGASKFKQVTFTTDY
NNLGNIIATNATINDGVTVTTGGIAGIGFDGKITLGSVNGNGNVRFVDGI
LSHSTSMIGTTKANNGTVTYLGNAFVGNIGDSDTPVASVRFTGSDGGAGL
QGNIYSQVIDFGTYNLGISNSNVILGGGTTAINGKINLRTNTLTFASGTS
TWGNNTSIETTLTLANGNIGNIVILEGAQVNATTTGTTTIKVQDNANANF
SGTQTYTLIQGGARFNGTLGGPNFVVTGSNRFVNYGLIRAANQDYVITRT
NNAENVVTNDIANSSFGGAPGVGQNVTTFVNATNTAAYNNLLLAKNSANS
ANFVGAIVTDTSAAITNAQLDVAKDIQAQLGNRLGALRYLGTPETAEMAG
PEAGAIPAAVAAGDEAVDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTG
VVIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVNGFSFSLYGAQ
QLVKNFFAQGSAIFSLNQVKNKSQRYFFDANGNMSKQIAAGHYDNMTFGG
NLTVGYDYNAMQGVLVTPMAGLSYLKSSDENYKETGTTVANKQVNSKFSD
RTDLIVGAKVAGSTMNITDLAVYPEVHAFVVHKVTGRLSKTQSVLDGQVT
PCISQPDRTAKTSYNLGLSASIRSDAKMEYGIGYDAQISSKYTAHQGTLK
VRVNF
>RC1113 surf1, surfeit locus protein 1
MKTNFLVFITFTILISLGFWQLSRLKEKKLFLASMQANLTSPAINLAEIQ
DGLPYHKVKITGQFLPNKDIYLYGRRSMSSEKDGYYLVTPFKTIEDKVIL
VARGWFSNRNKNIITQATNDRQHEIIGVTMPSEKTRIYLPANDIKNNVWL
TLNLKETSKVLGLDLENFYIIAEGKDISNLDILLPLAINHLAAIRNDHLE
YALTWFGLAISLIVIYVIYRRRYMAVDVIPRACSGIQKNN