Gene list
Applied filters:
COG category: Function unknown
Organism: Rickettsia conorii str. Malish 7, Malish 7
Gene type: CDS
Number of genes found: 75
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Rickettsia conorii str. Malish 7, Malish 7 >RC0660 unknown MPSSYKLRKKIWKSVYLLITVGILYIGYILIKSGYINEKNDINVTKKSLK DNKNFDLKYNIILKDSIFEGVNKNLNAYKIKTERAIKESNNKYKLDIINA IYNVNQDQTLIINAKEGFLDEESSILDLKNDVKLFFDEIIFNTNDARIDL VNKNITGHSPAKLLYKNSSITSDSFNTKDENNIIIFKGNVSTIIDLSD >RC1071 unknown MQKEMKMKTNDTFDGFTIELFKDTDGDWLARFEELPNVSAFGNSPEKALQ ELQQAWTLMKESYISHNQSIPLAPSRKEYSGQFNVRVDKRVHRALVLEAL RAKISLNALVSQKLTLSVKQ >RC0474 unknown MIVIYNNIFKNFFTDMIKKIAAGVIICFSLLLFIMFGALSFVNYNSVTNN FTSHLGIAKENIGKIKMNKFPLSYLVIETIREEGKLDLEQIKIHFSLWSL IKFNPKINKIDILDAKFYSHSNVLNIYNHEELIKNFFKYKLQNINLNVTN LSIINKQDYSILNFNNCILKKENALSSNYIFKTTSNYIGKISGSINKRDD IVDFSLNIDNNDYGFKLSQIYKDSKLTSGSGEYQIKNLASVMYNILPDLN HLFNKFNQHEAVNVKFNISNNEDAIELKDIVIASSFIIGHGFVNIAKNDN ITTNVKLDFPKIDLSSLISPNAEVTFNTSSSNSRFIFANKLLKADVAINE IILSNNEELKKIVFSSNLLKGTLKINEFSGNIKSGGEFKLTGNVTQNAVR SMFDGQLYLKHNDINSLLNILGFNDVTIKEAIPFSLSSDLKLTLIDLFFK NLLLKTDNLNLSGNFSSKFIAQTPRLDATLNISSLDLSSRTYPIISPLIE FTKNLTKDMKALDYPSKFIPIRTIGYLANLDILIDSVKYNDHVFDKMNLL AKIVPANIKISNLDFKTANSYLSTSWNLDASSVLPSLTVEIKDGNLTTDL LSPAGMLNLRNKLINDYSLDKATLQVSGTLSTLLQNDLILKNVKFYVANN NNLLQFNNIEAELLGGKFQGNGNILLEPYAINFVYALNTIDLNKVSALMP KIFTASGGKISISGSLGTNGNTLQSQLYNLTTKSQFAINNIDVNNFAIDA FIEKIDTADYKVQNLDKDINSAITTGQENIRGISGDIELQKGIALLKKVK FATQYSSGAASVAVNIYNFDMDASSILSFYVPARLVKLNTSNTSSDKDSL AHLNIKMQGSIFAPKKTFDSSELKKLLIPQTTEDTITTDNH >RC1372 putative integral membrane protein MASYYLWFKSFHLISAICWMAGLLYLPRIYVYHIKAKIGSELDSTLQVME LKLLRFIMNPAMISTFIFGLINAHIYGFVALDTWFHIKMFAVLILVIFHG LLARWRKDFANGKNVHSEKFYRIVNEIPAICMIVAVIMVIVKPFD >RC0487 unknown MKLLKIIFIITICINFPIFAENLESVTTTEDIENDVFIPLDENHPILNPN DNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKC IKNLDPYNEDNYLLMTITEYKIDEDPNVLFQGWMISSSISLSTFEHPIYE IFAKDCF >RC0102 unknown MPLSTSLLGKKSTYKDSYDVTLLFKIPRINNRNELGINSNNLPFYGVDVW NTYELSCLNKNGKPWVGVGTFYIPTDSENIVESKSFKLYLNSFNNFVVES VKELERIILQDLSNVTHAKVTGRIFPINTKVEFGVPSGKNIDDLDIVCNN YGAPDNSLIEYEDVLVEEEINSHLLKSNCLVTGQPDWGTIVIKYKGKKLK YDSFLKYLISFRNCNEFAEQCAERIFTDIKNAISPDFLSIYIVYARRGGI DICPYRSTDKSYTLPSDKRFIRQ >RC1110 unknown MLPNLGIIAGRGSLPYLIAGNYTKQGGNCYIAAIKDEADIEQIKDFEYKI LKIGMVGEAIKYFKEHKVKNIIFIGGVNRPNFKNLAVDKIGGLLLFKIVG QKIRGDDSLLKIVADFFESYGFKVISSNEIYKNQQGNSNIITNTNPISSD KNDIELGIKLLNHLSAFDIAQSVIVESGYILGIEAAEGTDNLITRCADLR KNPHGGVLVKIAKLGQDNRLDMPTIGPNTIKNLAKYNYKGVAIQKNNVII VEEELTIKLANKHKIFITKC >RC0365 unknown MYVVIELKTGKFKPEYAGKLNFYLNLMERTIKDNSDNPTIGLILCEEKQG ITVEYAIEGIQKTNRSITI >RC1034 unknown MSKSQEQEQESIDNIRQQFDEEYKNLSWVQVGYLALGADTGNKQYDELNK RYLDLNKDNLGIIQKVLSKQSGMFLVHCSLPGRLLKFYLILHWSLITSEK REVLPLV >RC0638 similarity to late-developmental spore coat protein MYSSATPLAFGTYVPTVDALQTNTIIIKCILGTNYTVALNAGTAPAATTS TRKMTGVVNTNSYLPYNLYSNAGRTQNWGHQSSDWVMGTGLDQTLTIYGK ILQGANVPSDTYNDTITIIVAY >RC0766 unknown MKETKRFFYKNNRLNKGYAKTFSVNEPDNNFYRKKFEHILPPIDLISEYE SIYPGTLQELMHMAQQEQAHRHAIDLKNLKIQARVAKLTRICLLIFGICL VVSIFFKTF >RC0681 unknown MFIICSAPRFADSLLFFQLIFVYYFSFYFRVYNTTFERKEINMAGHSKFK NIQHRKGAQDKKRAKVFTKLIREIVTAAKTGSSNNPENNPRLRNALTAAR SQNLPKERIDKAINSANDSSNNENYTEIRYEGYAPNGIAIIVEALTDNKN RTAAEVRSSFTKYGGSLGETGSVNYLFNHCGVIQYPINIASNKDILEAVI EAGGHDIISDDTTHTIYTDIENFSKVLEFLTGKYGIPEDSYIGWIPLNTI IIDDKEKAEKLLKLVEVLEESDDVQRVFGNYELSDDVYEIIQGEP >RC1356 unknown MIRKFLLTIFALWVGGFGYYLYLINSYKLNSNTTNAIIVFAGGGHKIETG IAWLKAGYAPILFITGIESTEQLKILLKERNVIEQQVIFAPNKIMSEEDN IKKVVDFIVTYNLTSIILVEHNYNMPFMLNKLEKAIPSSNNIYIVPYPVF SKQKYDVLLKSYHRYLMSILV >RC1253 unknown MKKETEELKLFILECLSEKKAEDIEVIDLTEKHKLADYIIFASGRSTKNV GAIAEYVALELKNNAGINSNIEGLGKSEWVLIDAGTILINIFYPEVREHF KLEEIWKR >RC0307 unknown MSKDNKKNQDMSIEDILKSIKGVINERKNPIHENDSEDEDVLELTEIVNQ DEEEKLISTKSAEEVGDIFKNFTDTIKDKKLDNNISSKNALEELVIEMLK PELKAWLDKNLPVLVKELVEIEIKKLVQNSKR >RC0768 unknown MIILEDIIPDYVKDAEEVKITAGCDKNFITCCNKFNNAINFRDEPLIPKT DFINLV >RC0608 unknown MKKQIIYPDFIARIFSTALDLSVFAFIAIPISQFCSFNLLWLFFNDYFLS NNINLHNPNEMFNSVMSQEFYEYLKAGNFNKYILFNISIFATNILVIGSY FITLWYYKGATLSKMFLRMKIVDAVTLNRPTLKQLIKRFLGYMTFPIGIF FILFSSKKQALHDKIAGTVVIKS >RC1154 unknown MIKVKFMRSALITSILVAVAFLTSACNTMQGAGQDIQVAGKKLKDSAESN KPQKGCGCPHSSAN >RC0119 unknown MRKVFKKFLKNNKYVLSIITILLYWYLRFVYFTSKQKFIFYDNGNKEKFL NEQGVIFAFWHNMLALSPSMFIGHKNIYALISPHLDGKILNDLVGKFGCR VIVGSTNKNPIGALRNIIGKLSQGANIIVTPDGPKGPVYKVNSGITEIAY RYNTKLITIVSSTSRCFRLKSWDKLIIPLPFGTIKIIVGSPLELTNDKIQ NHISLEQQLASLTESLKK >RC0001 unknown MTKLIIHLVSDSSVQTAKYTANSALAQFTSVKPKLYHWPMIRNLELLNEV LSKIEYKHGIVLYTIADQELRKTLTKFCYELKIPCISVIGKIIKEMSVFS GIEIEKEQNYNYKFDKTYFDTLNAIDYAIRHDDGQMLNELSEADIILIGP SRTSKTPTSVFLAYNGLKAANIPYVYNCPFPDFIEKDIDQLVVGLVINPN RLIEIREARLNLLQINENKSYTDFNIVQKECLEVRKICDQRNWPVIDVST RSIEETAALIMRIYYNRKNKYNK >RC1301 unknown MDKFYNYNSSSHQALLSFKVKPNSKQNLISNFVIINNIPYLKLSIKAIPE QGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILIKNINEDYLNLII NSYIK >RC0719 unknown MRRKLYENSLAHSWKQAGIEEGRKKEKITMAKEMKKEGLSLETIMTITKL DKKDIEKLK >RC0510 unknown MNQEERNLIMNKYILDSSALLALFNLETGSDKVEELLPLSIMSTVNIAEV VAELDKKLNISFIQSKAMISASINKIVALDFDQAIEIGRLKKETEQFGLS LGDRACISLGLITGYPIYTADKIWAKLQLNCKIVLIR >RC0103 unknown MNSQKTLPLLPKATAIWLIENTSLTFKQIADFCGIHEFEIKGMADGEVAQ SIKGLNPIANGQLTLEEIERCSKDPNANLQISYSPAYELMKNQKKKRAKY TPIARRQDKPDAIYWLLSNYPNIQDHQIIKLIGTTKITIDAIRTRSHWNM NSIRPRDPVLLGICSQIDLNKIVESLKPPQNPTKES >RC1173 unknown MSSMLKQIRLESGKTLNQVSSDLKIRKKYLVALEEGDFDVLPGEVYVRGY LKLYLDYLNVKDRNAEQIEATKQNETEKLLNNKRATVINYKRKKQLVLIS IIMLSIIIVSHPFIINA >RC0604 unknown MSPQIIELLIFAVIAFYIINKLITTLGSTSEEEQTKQKSYFGEPIIKDVT YSIVKSNKEEKNIPTAQDIKAFKDIIVEHNITAIVDGMEQVHKRLYSFDP VKFINNAKTAFQMIIEAAYKKDAKELSELIDKRYLEEFEKITPSYGDFFD SSALSAKYSEIYMFGNNIFIKLLFQGKNVVDKIEDLKEEWTFTRNANTKE VDWFLSNIERV >RC0950 unknown MWFIELYPIIQLYVKVKNMSIQEEHHIKKISFVQSLLELLPFNEWNNKLL EEAEEKCGFAKGYSLIVFPEGLSEIVGFLEEYLDNIMLESLKIIAEPSKI REKISLAVKTRVKTVLPIIHSKNAAYFALNPIQGTEVAFRSCDAIWRYAG DKSLDFNYYTKRSLLLSVYVSSILFYIQDESENYIETDKLIETAVENIVK TFSQMKKLLAPSRIPIVRMFT >RC1144 unknown MQKKLTITIDEAVYYRLYSVIGERKISKFIEQLVKPYVINEKLGAAYKAM AQDIKAEEKANEWVEGLIERDFYEKS >RC0914 unknown MNKLNKNSLQKKLFYRSKNRGCREMDYILSSFAEKYLSLMDETQLGSYSL ILDQNDNDLYNWINNKSSAPSYLDAEIIDKLHKIAKI >RC0366 unknown MHNKPSKMVGQEMYYLGKLKVIYMKGQRKKGINNFSNTLPELQSDLARSI IKEPYNLDFLDIQGKIIERDLENQLIDNIKNFLLELGQGFAFVGNQYHIE LEGEDYYLDLVFYHIKLKCYVCCY >RC0153 unknown MSEVVVKEQLEQYISKIERLEQEKADLSQEVKDIFQDASSHGFDVKAMKS ILKLKKLDKDKLAEQDAMLELYRDTLGI >RC0364 unknown MQKYGPMILFNGGAYDNKLLQEALDKNIITDYPKENFYIFTLPEYQTNIG ADNLTLYIKSMKTVILT >RC1171 unknown MSIVTVTLNNKSFQLYCNNGDEEELLSLANKLNDKIAEIKLGSPTASFEL LLVMASLNAQAEIVSLTEKLNKNGLQKNHPDEEKFAETLTTIAGYLENLA RKMGK >RC1320 unknown MSEVFEIPNGESKVLLHCCCAPCVGPLMEKMIDTGIKFMLFFYNPNIHPK KEYELRKNENIKFAEKHNIEFIDADYDPQNWFRRAKGMELEPERGIRCTM CFDMRFERTALYAYENGFKVITSSLGISRWKDMNQINESGTRAASHYEGV TYWTYNWRKDGGASRMYEIAKEEHFYKQEFCGCVYSFRDTNDWRVANNRP KIEIGKEYY >RC0818 unknown MQTIEQQIANVIEESLTDMGFELVLVKFKGVNPKVVEILIDSLNSEKISV EDCTKASRTISAILDVEDLIEAAYSLEVASSGLERPLVKFENYNRFLERE VKIKLKELLNGKTRYQGKIIKAENNKIYLKCEEQEVLIDYDLIKNANLVL TEEVFKKLLKQ >RC0754 unknown MLAIISSAKTLNFEKLAPKTELTIPMFLTLTNKLLSTLQSYSENQLSKIM NISAKLALINKERFKDFDNQESKAAIFTYAGDVFNNIHIEKLTNHALNFL QSHLLIISGLYGVLKPLDTIKPYRLEMATKLNEINLTNFWQDEVTNYINK ILAKQENKYLLNLASQEYSSVINPNKLKYQLVNVHFKENRNGKLSRIGIN AKKARGAMVKVIANNLIDSPELLKNFSYLGYAFSTKHSSDNELVFIKS >RC1057 unknown MLHSMSYLLPTIITTLLILLALIIWFYVKTQTLRTQLQFLSEQNLEISNN NQLLNQEQIGYLQKIEQLQCKIEYQAQTIKDSEKIREESFSSAKAALFDL GQDLSKQLIEIHKMENTAARELAEKNIATASGKFNSEFERLITMVGALNK DIEQSKGTVDLIKQSLLSPIGAGLLAEITLENILKSSGLRPNLDFIMQYG LTTLDSGKLRPDALIFLPSGNLMVIDSKASKFLVDKQDNNMSLNKTMNYH LKSLANKEYAANILTNLNKKDQSFNNVMTLMFLPTEQAVEKVIAADPEFL QKAWGCNIFPVGPSGLMNMLSFAKFQITDHRRSENYKVIIEEVRKLLSSI GTMADYSQKIGNNLHNMVTNYDKFAASFNRNFMSRVKNIQKLGIDSGNKA MPATLERYQIVSSKSEIIEVEAENPPQIAEKL >RC0028 unknown MASSRNDGDKLMNTEKLKDIKARIKDLTTPKFSNPKIRQEISPFTIAVDL VSGTMLGVVIGIFTDKIFNSKPLFLIIFTIIGMIAGFNIIRKKVNNKK >RC0697 unknown MRCSSYCTSSEYKMSDLVTNLKKIGLEPQHFDDVLYIRKEINKDSDFIEI FFFPFGCVTIWGGDEIQEKIVLSDTDLVPVNKLKEPVSDYIYFEYNTEVK KTFIDEEKNKIILADKSVFVKLSISHALAQSVKLSVLEQSVSNLIVQTTP IQQELARTGSVSLSKKEILQQIGILFNERYSISLHSDIFDTPEFFWRRPS YEPLYLMTAEFQDIEIRQNIMNHRLNMIHELLNILSNDLNYKHSTKLEWI IIILIGLEVVLSLSHTNLFLKIIGAL >RC0659 unknown MSLQSSIYRIRHLAKPAYREEFKGDIECSTAAYKEVLEDTSTDSTSKLPL EAKFGKMSIIKLVLLLIISTIIYANDKNISNLHITSDSLIIDRTKQKAAY LGNVIVYFDNAILRTKELYIFYKTIDEKQTIDHIVVPTKLTVERKINNEL LLADSAKYFCDNKQLILLGNVILQRDDNVLKTNKLIYYVDIIKK >RC0781 similarity to N-terminal of biotin-protein ligase MKIKIYNDLGVSKESIKHCVHTLRLYAPKYNVDYITAQEIIDEKWVQNTL LLILLGGRDLYYVQKLQGKGNANIKNYIKNGGNFLGICAGSYYSGNYVEF AKGTNIEVISKRELKIFNGTVRGPLLAPYCYNSHKGARAAYLKINPTLNL NIKDCYAFYNGGGYFIDAENTKDTEIIASYEDSQAAIIKCTYGDGTAILS GVHLEYEPALIKNQSLNNIHKILKAHDTERIKLLNYILDFFGNTLIERNL >RC1337 unknown MVNFNQFLKQAQSMQKKMQEAQEQMANARYTGKAGGGLVEVIATGKGEVE KISIDESLLKAEEKEMLEDLIKVAFNDAQQKCDEDSQNSLSGALNGMRLP PGFKMPF >RC0209 unknown MDDKKDNRHLSKPAYREECTGDTERSTTAYMDILEDVSTGSTSKLPLEAK FVKISNNISEKENLPKEKEIGGVKGLEPTRYGDWQHKGKVTDF >RC1029 unknown MSPKFMTFYEKYMTIVGTIGNFMFYVQAHKIFTCQSSASVSMPAFTISAI ALCSWLIYGILIKNTPIIIANIVGFIGALLVLLTIIIY >RC1109 unknown MNIEYKKFVHEYMLEFVKKILTKIQHENLYWDQLIYISYRTDNPAVILPS KVKQAYPKQITIVLQYQFENLIVNDTGFSLTVSFDGVKEIIYVPFDALIS FVDFNNNYSLTFNQSLNIHENPQHEEAISNNKSDQTSSSSSPNVIMLDKF RNSSKPS >RC0210 unknown MSIIFFIVHSTKYMKYNRIYINSRLAENSKIELASDHHVHYVKTVLRLKV NDGLRLFNGTDGEFLAQINDIGKNNLSVRLKEQLKKPYTKSTLTLAVAII KQDKLMLAINMATQLGITKIIPLITRWCQFRSVNIERLTKCVIEATEQSE RLTPPIIEKAITIQDYLKKNNNLMLYANEHEKEENSILRISSSLSNSDIT IIVGPEGGFTNDELELLASYKNTKSISLGSNILRAETAAITAIAQVRLLG SHCEEIA >RC0646 unknown MINGVRKLNYNLYKDAGFTQILGNGTSSTVTFTDSYTLGLEPVTKNYTVY ARLPSQPLAAASTFQDTITVTQPWLYNKKSLKSKFFSLGSNFVAYMD >RC1251 unknown MLYLKWKIMIKILITLIIVILSTTINADNKKLPIPRFVSIKSNEVNARSG PTTKSAVEWVFVKKGEPVEITAEYKQWRQVRDINGEGGWIHSSVLSGKRS VVITSDKEIELTKSADHKSRVIAKLMPKVRCSLKKCKEQFCQITCKDYTG WISKKVIWGVYDDNDRY >RC0770 unknown MKYVNSKIVKILSQLTSQKYLIKNARLSSTEFEQFNSFFKARCGSNFALR FRNYADYRGINEVIAKGDGNLNKFQLRKIYGNPIAPYERVITKPVNNSVM LYINNVRTMGIVDYNDGIVNLPSPLGQDVILTTDFTFDVAVRLSIDSFEY SYCRRFYSVIQHRVSGGDYMSIAIEEVITKLTNFLFSN >RC0489 unknown MILITPRKFILIKYVRKFIDWFEQTFVSRLNNRNKGAIVLVMQRLHTDDL SGYLLNNSNSWHHLKILAISIQDYSFKLMNKEYQYLSGQVIRQLLKNLLI V >RC0291 unknown MYIIRYTIQVQKDAKKIVQAGLKNKVEVLLNIVSTDPWKIYPPYEKLVGD FSGCYSRRINIQHRLVYEVYKQEKVVKILRMYTYYE >RC0670 unknown MRKIFNCLYVALFRTIEDDGVEHSGYMSFMILLSIFPFLVFLLALTSFLG ASELGQNFIQIFLESLPEQATESIEKRIRELLSAPPQSLMNLAIVGSIWT ASSFVECLRTILNRVYQIKSPPPYIRRRLLSIIQFLIISALITFTMFLLV VIPILFTKIPIILETIEKYKIILNFIRYFLILILLFLGASSLYYILPNVK LNFIDVFPGALLTVILWIISGYLLSTYIVYYNQLNLMYGSLGSIIVTLIF FYIINMIFIYGAEFNYLMKNYENIE >RC0672 unknown MEGLINGSVYYKIFDKTFNNSSHRYIKKNNSINETEIVENKKSITTYFKA QDILILNQVHGNQIVNADESIIAVPEADGSITTKKNLILAVQSADCVPVL LASGDGKIIGAAHAGWKGSINNIISNIVTKITEKGAKNLIAVIGPAIAQS SYEVDDEYYKAFLSKDINNKQFFIHSIKENHYMFDLPAFVELKLKEAGVK DIKNIAEDTYTNPLKYPSKRRSYHLQEPYNQNILSAIVMK >RC1232 unknown MDKHLKEYTESGKKNLIVVYILYLCGIVAPILPLIGVFFAYLNKDKGDNF AISHYVFLFRTFCIGVLGWIVCFIFTFIMIGVVLYVILAVWYILRVAIGF KYMIEDKAYPNPMTYWIK >RC1044 unknown MANRLAEENKDFENYINNIDTVYKKQKVNDSIWFNINDHVNKIPKQNKKK K >RC0314 unknown MTKKIALLLLPFILISCNGLGPKRVKNIVELTPKLAIQTHEPIYLDSNAN IYAFNANMLKNKQYSFARSKTITEPVFIGDMIYALDIRSNISAFSIEKNK IIWSYNLSRHKKDNYIGGGILHHNGKLYVTYGSRLLVVLDAKSGYEIIRK ELPDIIRIKPIVLNDNTVLVQTISNQTIALNAETLKTVWEHESLAEVLSA SYFMTPIVQYDNVIVTYNSGQILALNITNGEVKWNFEFTNLNDRTAIPNF DESSILCTPVHDNMNLYIATGLGKLIKLNVATGSVIWQVNAEDIQSMSLI GNSLFVTNNARQIAAFNPETGKVKFVADLNDGQDPKKLKSAAFLVPFVGV NNNNKRSLNVISVNGVLYSFDVDNNGLNMIPHVVKIIKNIRYYGLSANNN LYFSTDSKIIFGSK >RC0367 unknown MSNTLSNESYTNLIKNLKQEISKARIRAHLAANKELIVLYWHIGNLILER QNKEKWGSKVIQNISDDLRKEFPKMKGLSYQNLSYMRQFFAEYNNDQILQ QAVGEIPWSHNIIILSKLNNINQRIWYAQQTIENGWSRNVLSWQIKSNLH ERSAKKRYK >RC0850 unknown MTKNMRKQMLKVISIITIYLLLSSCSESTRDANGLLTDSQSTVIRNYIIS QNSKNLKVNLKEKFGSNLKGVKLIGVQLINEDLSGIDLTSCEILRADFAG SNLDKAILTNAIIQESNFADSVIKNISGYHSDFQGSIFHNITLQNTNFVQ SNFSDTAFNKTTIINVNFENSKFSHVLWSNNTIDGVNFQKANLQNNSFKN TNITNSIFYGTDLEKSIIKNTNFTNNYFESSNLSQTTLTAVIIKDSNFTQ SIFNEVNFNNVQSNNSCFSYASFQDSTLQNISLTKCDLQNSTISSSVLKH FKINNAILNNMSLNDNKFNTLSIKNSNANFVRINKTKGSNITLDNISYTN NIFSNNDFKQFIVINTDLNSSEIINSNITNGQFNNINFSKSLIQNVNFSD VKITLGNLNQVALINSTLTNTAVINSVLSNSQINNINYQAYSSFINTNVS NNIILNSDNSSKILPNNIVINSVKDLQKITHLANMNLTNFDLSNLIFDRV DFSNSIFKNANLTNTVIKNSILKEANFSAAILTKTDFSNSILTDSIFKSA KIDQAGFNNSDLTNADFTETAIKDTSFDKAKTSGMKGVE >RC0926 unknown MPWSEVKEIAKEDYEYWFGRGKSFFIDCKYPLERGDFSKSAFELHQATAS VYSNILLVFARYKPKLHDIRTLGGYCANL >RC0808 unknown MTNLNIYKCAEKDLNDYLGYIKDTPSVSLNDFIKNKYFAEDNDNIIITSL ENMEINADLANANFQGTILTDAVFNNCDLTNTILCDSDLTNVKFNDCTFI GTDFRGANLHYTDFNYKDYDNYKIPNLKDKIRDIKLSFSDLERLNQYIDK DLEKEHIKEIVIDEATKTKKYILAGEDEKTLWDIKSKELKTKQEELETLK QNLDNPGIATNLLNAFWHSAETIARNRQNELEKINKLQHEINKLETEVYA LDNLRMFCGKGLDGIFEQLKDEEIQITLDPSYIIGSTATERDIPKEYIKL TSAEFDLYLAEAAKQSNTKLSLTEFVRKQKNLSEDLNIVPDLSAINLSGK TLTNLNLKNTLFASANLENVKISNCNLDFTNFEGANLQNAVFQNVTARNA GFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKL IIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTNLENV FMNKTHALEAKFKEQCNMQGITARNAYFSDAEFENILSLKEADLRETIMQ RVKLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEAE GLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAA EAIFKEAKLKQANLKAANLAGINKEGADFDKAKINDATKMHDTKGEAKGN LDHQDKDGKKTSVNVNEHAKLQDKIHAREKSGWFLKTGVGQLCTKLAKST TSGISRVTNFLASKKFLVGLAVVAGLAVAAAPFVAMPVLLVTGTALTTKA VILGAGLLAGGLVATGTYKLTQKPLRNLQKSFENLTSSIDKYISPPPKNI DELVTAKQQARQKAETEKSKEREENLNNVNNNIDKAKEQDLLKQAQNNLN QATPKVEIKEKKDKTVEKQQAKSNTFAAKFKPNTKGKGFATKIKDEKKSR SIKEAYN >RC0529 similarity to cell filamentation proteins (fic) MNSKPSFQITNKILELSQDISYELGILAGSKFYSQPIKLRKNNQIKTIHS SLAIEGNSLSVEQITDIINDKRVLAPEKDIVEVKNAIKLYNNLTIFNPFK IESLLKAHEILMQGLVEDNGKWRKGNASIFKGTEIIYFAPTARRVSLLMQ DLFEFIAQDKQISGIVKACIFHYEFEFIHPFSDGNGRIGRLWQQLLLMQA NKIFEYISVESLIRNNQSEYYSVLSKCDKLGESTLFIEFMLDKIVAALRL YSNNITYEANTPLSRMEFAKVNLIDQWFSRKDYITVHKNISTATASRDLL YGLERKLLISKGDKNQTYYKFV >RC0825 unknown MSIKLDSYEQDIEDNFEKQQKIDDRSEIALLQKFAKAHLSTKRSVTVRVA EHDIEAIKIKVSKHGLPYQTYLNMLIHLDPT >RC0071 unknown MSIESKIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTA PEISQLFGEIIGLWCIREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAKL VPEFYKALSIELIEINKNFIAYQKANLQDINLPISHQSFVEDIPKKPTII IANEFFDAIPIKQYIKVKELWYERIFVVQPVDERIKYDKISVNKQLQEYL LCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGYDIAPNG RTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINV IDTISQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRSTAA YTLVREDASIGSTYKLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFK VLQIMN >RC0312 unknown MKLIVLLFTFLFSMFSFGESETIKGKPLKYAANNDFENRLDEQEQEIRRL IGKVEVLQHKIDMLKQNSNISNQEENTEVLEAGDSKKQDVFDIALLKDMP DNAPKKPIAVNKDIAPDKQAYDLALAAYKDNKLTEAKDKFKHFIQNYPNS LLISNAYFWYGECFFKQKDYNRAAVNYLKGYKELPKGAKSSDGLLKLALS LGELKKTQEACNMLAKFDKEFPTNRTAASKKMAEDAKIKFGCKNK >RC1362 unknown MYKSMEIVNSVDASVSCQGKEPPYDHPKVYLEIDKKKKEVICPYCSKKFK LVTK >RC0477 unknown MRNKENISNIILDAEENALLESFENDEWQRIKNFEQEKHISQVAAANYLK KDTRINIRISSSDLMRIKQKAAYEGLPYQTLISSILHKYSAGHG >RC0858 unknown MNVFLSKYVNGVDKKSRVSVPANYRAVLGKELFNGVIAYPSIRNNCIEVC GISHIEKLRQMIETLDPYSEERDAFETMIFGEAVQLSFDGEGRVILPQSL MKHAGIEEQACFVGKGVIFEIWQPQNFEKYLNAAQKIAHEKRLTLRNAH >RC0740 unknown MSQKLKIILKGFGMYAHKTIENGWSRNVLTLQIKSNLCARSGKSINNFGH TLPAMQSDLAKSIIKNPYNLEFLDIKENILERELENKLIDNIKDFLLELG QDFVFIGNPYL >RC0488 unknown MLHSKELLNSILRQDFHSFIIKVFNTINPGAEYYPSKHIRIITDYLNAVQ SGDINRLIINIPPRSLLQSICVSVAWPAYLLVVNPTKRIMVASYSQILSI KHSLDCQFILNSDWYTELFPSTILSKPHNQKSKFLTTANGFRFATLVGGS ATGEGGDILIIDDPHNPTQIHSYKIRKKVYRLV >RC0865 unknown MLKTLLISFITLICAVNYADADQTAQNPNSTTSSSDQPDDLPAEAAVHFA QPWARPTTNVQGKVSNSAMYFTLINSRSKSYNLVNISSDKISGIEIHQTI NDQGVSKMVKVDYPFLIAGNINVDFKPGDMHIMLYDPKVDLNVGDEFKIT FFFDDNTRKIVNVKVANDNPYNKTGN >RC0093 hesB1, hesB protein MTITITDRAFERVCELVKLEKDKNLVLRISVDSGGCSGLMYNYELVSKDN IEQDDYVITKHNATIIIDPISQKFILDCTLDFIEELGSSYFNVSNPQAKA KCGCGNSFSV >RC0728 hesB2, hesB protein MKNVISLTDSAAKQIKLLIEKRAKPTFGIRVGVKSGGCAGQTYYVEYADS KNQFDEVVEEKGVRILIDPKALMYILGSEMDYVETKFKSQFTFTNPNEKA SCGCGKSFRV >RC1273 rompA, 190-KDa cell surface antigen MANISPKLFQKAIQQGLKAALFTTSTAAIMLSSSGALGIAVSGVIATNNN AAFSDNVGNNWNEITAAGVANGTPARGPQNNWAFTYGGDYTITADVADHI ITAINVADTTPIGLNIAQNTVVGSIVTGGNLLPVTITAGKSLTLNGNNAD AANHGFGAPADNYTGLGNIALGGANAALIIQSAAPAKITLAGNINGGGII TVKTDAAINGTIGNTNALATVNVGAGIATLEGAIIKATTTKLTNAASVLT LTNVNAVLTGAIDNTTGVDNVGVLNLNGALSQVTGNIGNTNALATISVGA GKATLGGAVIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFT GDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTN PVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLG GAIIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVT GNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTNPVVVTG AIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKA TTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGNSTVTGNIGNT NALATVNVGAGIATLEGAVIKATTTKLTNAASVLTLTNVNAVLTGAIDNT TGVDNVGVLNLNGALSQVTGNIGNTNALATISVGAGKATLGGAVIKATTT KLTDNASAVTFTNPVVVTGAIDNTGNANNGIATFTGDSTVTGNIGNTNAL ATVNVGAGLLRVQGGVVKSNTINLTDNASAVTFTNPVVVTGAIDNTGNAN NGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNA SAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATVNVG AGVTLQAGGSLDANNIDFGARSTLEFNGPLDGGGNAIPYYFKGAIANGNN AILNVNTKLLTAYHLTIGTVAEINIGAGNLFAIDASAGDVTILNAQDIHF RALDSALVLSNLTGVGVNNILLAADLVAPGVDEGTVVFDGGVNGLNIGSN VAGAARNIGDVGGNKFNTLLIYNAVTITDDVNLEGIQNVLINNNADFTSS TAFNAGTIQINDATYTIDANNGNLNIPAGNIKFAHADAQLILQNSSGNDR TITLGANIDPDNDDEGIVILNSVTAGKKLTIAGGKTFGGAHKLQDIVFKG EGDFGTAGTTFNTTNIVLDITGQLELGATTANVVLFKDAVQLTQTGNIGG FLDFNAKNGTVTLNNNVNVAGTVKNTGGTNNGTLIVLGASNLNRVNGIAM LKVGAGNVTIAKGGNVKIGEIQGTGTNTLTLPAHFKLTGSINKTGGQALK LNFMNGGSVSGVVGTAANSVGDITTAGATSFASSVNAKGTATLGGTTSFA HTFTNTGAVTLAKGSITSFAKNVTATSFVANSATINFGNSLAFNSNITGS GTTLTLGANQVTYTGTGSFTDTLTLNTTFDGAAKSGGNILIKSGSTLDLS GVSNLALVVTATNFDMNNISPDTKYTVISAETAGGLKPTPKENVKITINN DNRFVDFTFDASTLTLFAEDIAAGVIDEDFAPGGPLANIPNAANIKKSLE LMEDAPNGSDARQAFNNFGLMTPLQEADATTHLMQDVVKPSDTIAAVNNQ VVASNISSNITALNARMDKVQAGNKGPVSSGDEDMDAKFGAWISPFVGNA TQKMCNSISGYKSDTTGGTIGFDGFVSDDLVLGLAYTRADTDIKLKNNKT GDKNKVESNIYSLYGLYSVPYENLFVEAIASYSDNKIRSKSRRVIATTLE TVGYQTANGKYKSESYTGQLMAGYTYMMSENINLTPLAGLRYSTIKDKSY KETGTTYQNLTVKGKNYNTFDGLLGAKVSSNINVNEIVLTPELYAMVDYA FKNKVSAIDARLQGMTAPLPTNSFKQSKTSFDVGVGVTAKHKMMEYGINY DTNIGSKYFAQQGSVKVRVNF >RC1085 rompB, outer membrane protein B (cell surface antigen sca5) MAQKPNFLKKLISAGLVTASTATIVASFAGSAMGAAIQQNRTTNAVATTV DGVGFDQTAVPANVAVPLNAVITAGVNKGITLNTPAGSFNGLFLNTANNL DVTVREDTTLGFITNVVNNANHFNLMLNAGKTLTITGQGITNVQAAATKN ANNVVAQVNNGAAIDNNDLQGVGRIDCGAAASTLVFNLANPTTQKAPLIL GDNAVIVNGANGTLNVTNGFIKVSSKSFATVNVINIGDGQGIMFNTDADN VNTLNLQANGATITFNGTDGTGRLVLLSKNAAATDFNVTGSLGGNLKGII EFNTVAVNGQLKANAGANAAVIGTNNGAGRAAGFVVSVDNGKVATIDGQV YAKDMVIQSANAVGQVNFRHIVDVGTDGTTAFKTAASKVAITQNSNFGTT DFGNLAAQIIVPNTMTLNGNFTGDASNPGNTAGVITFDANGTLASASADA NVAVTNNITAIEASGAGVVQLSGTHAAELRLGNAGSVFKLADGTVINGKV NQTALVGGALAAGTITLDGSATITGDIGNAGGAAALQGITLANDATKTLT LGGANIIGANGGTINFQANGGTIKLTSTQNNIVVDFDLAIATDQTGVVDA SSLTNAQTLTINGKIGTVGANNKTLGQFNIGSSKTVLSDGDVAINELVIG NNGAVQFAHNTYLITRTTNAAGQGKIIFNPVVNNNTTLATGTNLGSATNP LAEINFGSKGAANVDTVLNVGKGVNLYATNITTTDANVGSFIFNAGGTNI VSGTVGGQQGNKFNTVALDNGTTVKFLGNATFNGNTTIAANSTLQIGGNY TADFVASADGTGIVEFVNTGPITVTLNKQAAPVNALKQITVSGPGNVVIN EIGNAGNYHGAVTDTIAFENSSLGAVVFLPRGIPFNDAGNRIPLTIKSTV GNKTATGFDVPSVIVLGVDSVIADGQVIGDQNNIVGLGLGSDNDIIVNAT TLYAGIGTINNNQGTVTLSGGIPNTPGTVYGLGTGIGASKFKQVTFTTDY NNLGNIIATNATINDGVTVTTGGIAGIGFDGKITLGSVNGNGNVRFVDGI LSHSTSMIGTTKANNGTVTYLGNAFVGNIGDSDTPVASVRFTGSDGGAGL QGNIYSQVIDFGTYNLGISNSNVILGGGTTAINGKINLRTNTLTFASGTS TWGNNTSIETTLTLANGNIGNIVILEGAQVNATTTGTTTIKVQDNANANF SGTQTYTLIQGGARFNGTLGGPNFVVTGSNRFVNYGLIRAANQDYVITRT NNAENVVTNDIANSSFGGAPGVGQNVTTFVNATNTAAYNNLLLAKNSANS ANFVGAIVTDTSAAITNAQLDVAKDIQAQLGNRLGALRYLGTPETAEMAG PEAGAIPAAVAAGDEAVDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTG VVIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVNGFSFSLYGAQ QLVKNFFAQGSAIFSLNQVKNKSQRYFFDANGNMSKQIAAGHYDNMTFGG NLTVGYDYNAMQGVLVTPMAGLSYLKSSDENYKETGTTVANKQVNSKFSD RTDLIVGAKVAGSTMNITDLAVYPEVHAFVVHKVTGRLSKTQSVLDGQVT PCISQPDRTAKTSYNLGLSASIRSDAKMEYGIGYDAQISSKYTAHQGTLK VRVNF >RC1113 surf1, surfeit locus protein 1 MKTNFLVFITFTILISLGFWQLSRLKEKKLFLASMQANLTSPAINLAEIQ DGLPYHKVKITGQFLPNKDIYLYGRRSMSSEKDGYYLVTPFKTIEDKVIL VARGWFSNRNKNIITQATNDRQHEIIGVTMPSEKTRIYLPANDIKNNVWL TLNLKETSKVLGLDLENFYIIAEGKDISNLDILLPLAINHLAAIRNDHLE YALTWFGLAISLIVIYVIYRRRYMAVDVIPRACSGIQKNN