Gene list
Applied filters:
COG category: Function unknown
Gene type: CDS
Genomic element: chromosome
Number of genes found: 75
Show UniProt / TrEMBL protein name | View in Fasta format (DNA) | View as list | ||||
# Rickettsia conorii str. Malish 7, Malish 7 >RC1253 unknown MKKETEELKLFILECLSEKKAEDIEVIDLTEKHKLADYIIFASGRSTKNV GAIAEYVALELKNNAGINSNIEGLGKSEWVLIDAGTILINIFYPEVREHF KLEEIWKR >RC0307 unknown MSKDNKKNQDMSIEDILKSIKGVINERKNPIHENDSEDEDVLELTEIVNQ DEEEKLISTKSAEEVGDIFKNFTDTIKDKKLDNNISSKNALEELVIEMLK PELKAWLDKNLPVLVKELVEIEIKKLVQNSKR >RC0209 unknown MDDKKDNRHLSKPAYREECTGDTERSTTAYMDILEDVSTGSTSKLPLEAK FVKISNNISEKENLPKEKEIGGVKGLEPTRYGDWQHKGKVTDF >RC0719 unknown MRRKLYENSLAHSWKQAGIEEGRKKEKITMAKEMKKEGLSLETIMTITKL DKKDIEKLK >RC1232 unknown MDKHLKEYTESGKKNLIVVYILYLCGIVAPILPLIGVFFAYLNKDKGDNF AISHYVFLFRTFCIGVLGWIVCFIFTFIMIGVVLYVILAVWYILRVAIGF KYMIEDKAYPNPMTYWIK >RC1034 unknown MSKSQEQEQESIDNIRQQFDEEYKNLSWVQVGYLALGADTGNKQYDELNK RYLDLNKDNLGIIQKVLSKQSGMFLVHCSLPGRLLKFYLILHWSLITSEK REVLPLV >RC0119 unknown MRKVFKKFLKNNKYVLSIITILLYWYLRFVYFTSKQKFIFYDNGNKEKFL NEQGVIFAFWHNMLALSPSMFIGHKNIYALISPHLDGKILNDLVGKFGCR VIVGSTNKNPIGALRNIIGKLSQGANIIVTPDGPKGPVYKVNSGITEIAY RYNTKLITIVSSTSRCFRLKSWDKLIIPLPFGTIKIIVGSPLELTNDKIQ NHISLEQQLASLTESLKK >RC0001 unknown MTKLIIHLVSDSSVQTAKYTANSALAQFTSVKPKLYHWPMIRNLELLNEV LSKIEYKHGIVLYTIADQELRKTLTKFCYELKIPCISVIGKIIKEMSVFS GIEIEKEQNYNYKFDKTYFDTLNAIDYAIRHDDGQMLNELSEADIILIGP SRTSKTPTSVFLAYNGLKAANIPYVYNCPFPDFIEKDIDQLVVGLVINPN RLIEIREARLNLLQINENKSYTDFNIVQKECLEVRKICDQRNWPVIDVST RSIEETAALIMRIYYNRKNKYNK >RC0768 unknown MIILEDIIPDYVKDAEEVKITAGCDKNFITCCNKFNNAINFRDEPLIPKT DFINLV >RC0071 unknown MSIESKIRQLIDQNGYITCDVLMQEVLNLNPTSYYKQVKSLANEGDFVTA PEISQLFGEIIGLWCIREWQRIGCPKSLSLVELGPGRGLLMRDLLRTAKL VPEFYKALSIELIEINKNFIAYQKANLQDINLPISHQSFVEDIPKKPTII IANEFFDAIPIKQYIKVKELWYERIFVVQPVDERIKYDKISVNKQLQEYL LCTHIEAKDGAVLEESYKSIEIIKFIAQHLKRLSGSGLIIDYGYDIAPNG RTRYQYNQTLQAVKNHKYCPILENLGEADLSAHVDFYALKTVAKNSKINV IDTISQRDFLIENGILLRKQTLQDKLNDRHLAKFAYREEFKGDTKRSTAA YTLVREDASIGSTYKLPLEVEFGKMSEQAQIIERQVERLISPKQMGELFK VLQIMN >RC0367 unknown MSNTLSNESYTNLIKNLKQEISKARIRAHLAANKELIVLYWHIGNLILER QNKEKWGSKVIQNISDDLRKEFPKMKGLSYQNLSYMRQFFAEYNNDQILQ QAVGEIPWSHNIIILSKLNNINQRIWYAQQTIENGWSRNVLSWQIKSNLH ERSAKKRYK >RC1057 unknown MLHSMSYLLPTIITTLLILLALIIWFYVKTQTLRTQLQFLSEQNLEISNN NQLLNQEQIGYLQKIEQLQCKIEYQAQTIKDSEKIREESFSSAKAALFDL GQDLSKQLIEIHKMENTAARELAEKNIATASGKFNSEFERLITMVGALNK DIEQSKGTVDLIKQSLLSPIGAGLLAEITLENILKSSGLRPNLDFIMQYG LTTLDSGKLRPDALIFLPSGNLMVIDSKASKFLVDKQDNNMSLNKTMNYH LKSLANKEYAANILTNLNKKDQSFNNVMTLMFLPTEQAVEKVIAADPEFL QKAWGCNIFPVGPSGLMNMLSFAKFQITDHRRSENYKVIIEEVRKLLSSI GTMADYSQKIGNNLHNMVTNYDKFAASFNRNFMSRVKNIQKLGIDSGNKA MPATLERYQIVSSKSEIIEVEAENPPQIAEKL >RC0766 unknown MKETKRFFYKNNRLNKGYAKTFSVNEPDNNFYRKKFEHILPPIDLISEYE SIYPGTLQELMHMAQQEQAHRHAIDLKNLKIQARVAKLTRICLLIFGICL VVSIFFKTF >RC0825 unknown MSIKLDSYEQDIEDNFEKQQKIDDRSEIALLQKFAKAHLSTKRSVTVRVA EHDIEAIKIKVSKHGLPYQTYLNMLIHLDPT >RC0818 unknown MQTIEQQIANVIEESLTDMGFELVLVKFKGVNPKVVEILIDSLNSEKISV EDCTKASRTISAILDVEDLIEAAYSLEVASSGLERPLVKFENYNRFLERE VKIKLKELLNGKTRYQGKIIKAENNKIYLKCEEQEVLIDYDLIKNANLVL TEEVFKKLLKQ >RC0103 unknown MNSQKTLPLLPKATAIWLIENTSLTFKQIADFCGIHEFEIKGMADGEVAQ SIKGLNPIANGQLTLEEIERCSKDPNANLQISYSPAYELMKNQKKKRAKY TPIARRQDKPDAIYWLLSNYPNIQDHQIIKLIGTTKITIDAIRTRSHWNM NSIRPRDPVLLGICSQIDLNKIVESLKPPQNPTKES >RC0474 unknown MIVIYNNIFKNFFTDMIKKIAAGVIICFSLLLFIMFGALSFVNYNSVTNN FTSHLGIAKENIGKIKMNKFPLSYLVIETIREEGKLDLEQIKIHFSLWSL IKFNPKINKIDILDAKFYSHSNVLNIYNHEELIKNFFKYKLQNINLNVTN LSIINKQDYSILNFNNCILKKENALSSNYIFKTTSNYIGKISGSINKRDD IVDFSLNIDNNDYGFKLSQIYKDSKLTSGSGEYQIKNLASVMYNILPDLN HLFNKFNQHEAVNVKFNISNNEDAIELKDIVIASSFIIGHGFVNIAKNDN ITTNVKLDFPKIDLSSLISPNAEVTFNTSSSNSRFIFANKLLKADVAINE IILSNNEELKKIVFSSNLLKGTLKINEFSGNIKSGGEFKLTGNVTQNAVR SMFDGQLYLKHNDINSLLNILGFNDVTIKEAIPFSLSSDLKLTLIDLFFK NLLLKTDNLNLSGNFSSKFIAQTPRLDATLNISSLDLSSRTYPIISPLIE FTKNLTKDMKALDYPSKFIPIRTIGYLANLDILIDSVKYNDHVFDKMNLL AKIVPANIKISNLDFKTANSYLSTSWNLDASSVLPSLTVEIKDGNLTTDL LSPAGMLNLRNKLINDYSLDKATLQVSGTLSTLLQNDLILKNVKFYVANN NNLLQFNNIEAELLGGKFQGNGNILLEPYAINFVYALNTIDLNKVSALMP KIFTASGGKISISGSLGTNGNTLQSQLYNLTTKSQFAINNIDVNNFAIDA FIEKIDTADYKVQNLDKDINSAITTGQENIRGISGDIELQKGIALLKKVK FATQYSSGAASVAVNIYNFDMDASSILSFYVPARLVKLNTSNTSSDKDSL AHLNIKMQGSIFAPKKTFDSSELKKLLIPQTTEDTITTDNH >RC1144 unknown MQKKLTITIDEAVYYRLYSVIGERKISKFIEQLVKPYVINEKLGAAYKAM AQDIKAEEKANEWVEGLIERDFYEKS >RC0291 unknown MYIIRYTIQVQKDAKKIVQAGLKNKVEVLLNIVSTDPWKIYPPYEKLVGD FSGCYSRRINIQHRLVYEVYKQEKVVKILRMYTYYE >RC0604 unknown MSPQIIELLIFAVIAFYIINKLITTLGSTSEEEQTKQKSYFGEPIIKDVT YSIVKSNKEEKNIPTAQDIKAFKDIIVEHNITAIVDGMEQVHKRLYSFDP VKFINNAKTAFQMIIEAAYKKDAKELSELIDKRYLEEFEKITPSYGDFFD SSALSAKYSEIYMFGNNIFIKLLFQGKNVVDKIEDLKEEWTFTRNANTKE VDWFLSNIERV >RC1301 unknown MDKFYNYNSSSHQALLSFKVKPNSKQNLISNFVIINNIPYLKLSIKAIPE QGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILIKNINEDYLNLII NSYIK >RC0366 unknown MHNKPSKMVGQEMYYLGKLKVIYMKGQRKKGINNFSNTLPELQSDLARSI IKEPYNLDFLDIQGKIIERDLENQLIDNIKNFLLELGQGFAFVGNQYHIE LEGEDYYLDLVFYHIKLKCYVCCY >RC0312 unknown MKLIVLLFTFLFSMFSFGESETIKGKPLKYAANNDFENRLDEQEQEIRRL IGKVEVLQHKIDMLKQNSNISNQEENTEVLEAGDSKKQDVFDIALLKDMP DNAPKKPIAVNKDIAPDKQAYDLALAAYKDNKLTEAKDKFKHFIQNYPNS LLISNAYFWYGECFFKQKDYNRAAVNYLKGYKELPKGAKSSDGLLKLALS LGELKKTQEACNMLAKFDKEFPTNRTAASKKMAEDAKIKFGCKNK >RC0365 unknown MYVVIELKTGKFKPEYAGKLNFYLNLMERTIKDNSDNPTIGLILCEEKQG ITVEYAIEGIQKTNRSITI >RC0102 unknown MPLSTSLLGKKSTYKDSYDVTLLFKIPRINNRNELGINSNNLPFYGVDVW NTYELSCLNKNGKPWVGVGTFYIPTDSENIVESKSFKLYLNSFNNFVVES VKELERIILQDLSNVTHAKVTGRIFPINTKVEFGVPSGKNIDDLDIVCNN YGAPDNSLIEYEDVLVEEEINSHLLKSNCLVTGQPDWGTIVIKYKGKKLK YDSFLKYLISFRNCNEFAEQCAERIFTDIKNAISPDFLSIYIVYARRGGI DICPYRSTDKSYTLPSDKRFIRQ >RC0488 unknown MLHSKELLNSILRQDFHSFIIKVFNTINPGAEYYPSKHIRIITDYLNAVQ SGDINRLIINIPPRSLLQSICVSVAWPAYLLVVNPTKRIMVASYSQILSI KHSLDCQFILNSDWYTELFPSTILSKPHNQKSKFLTTANGFRFATLVGGS ATGEGGDILIIDDPHNPTQIHSYKIRKKVYRLV >RC0608 unknown MKKQIIYPDFIARIFSTALDLSVFAFIAIPISQFCSFNLLWLFFNDYFLS NNINLHNPNEMFNSVMSQEFYEYLKAGNFNKYILFNISIFATNILVIGSY FITLWYYKGATLSKMFLRMKIVDAVTLNRPTLKQLIKRFLGYMTFPIGIF FILFSSKKQALHDKIAGTVVIKS >RC0850 unknown MTKNMRKQMLKVISIITIYLLLSSCSESTRDANGLLTDSQSTVIRNYIIS QNSKNLKVNLKEKFGSNLKGVKLIGVQLINEDLSGIDLTSCEILRADFAG SNLDKAILTNAIIQESNFADSVIKNISGYHSDFQGSIFHNITLQNTNFVQ SNFSDTAFNKTTIINVNFENSKFSHVLWSNNTIDGVNFQKANLQNNSFKN TNITNSIFYGTDLEKSIIKNTNFTNNYFESSNLSQTTLTAVIIKDSNFTQ SIFNEVNFNNVQSNNSCFSYASFQDSTLQNISLTKCDLQNSTISSSVLKH FKINNAILNNMSLNDNKFNTLSIKNSNANFVRINKTKGSNITLDNISYTN NIFSNNDFKQFIVINTDLNSSEIINSNITNGQFNNINFSKSLIQNVNFSD VKITLGNLNQVALINSTLTNTAVINSVLSNSQINNINYQAYSSFINTNVS NNIILNSDNSSKILPNNIVINSVKDLQKITHLANMNLTNFDLSNLIFDRV DFSNSIFKNANLTNTVIKNSILKEANFSAAILTKTDFSNSILTDSIFKSA KIDQAGFNNSDLTNADFTETAIKDTSFDKAKTSGMKGVE >RC0477 unknown MRNKENISNIILDAEENALLESFENDEWQRIKNFEQEKHISQVAAANYLK KDTRINIRISSSDLMRIKQKAAYEGLPYQTLISSILHKYSAGHG >RC1044 unknown MANRLAEENKDFENYINNIDTVYKKQKVNDSIWFNINDHVNKIPKQNKKK K >RC0914 unknown MNKLNKNSLQKKLFYRSKNRGCREMDYILSSFAEKYLSLMDETQLGSYSL ILDQNDNDLYNWINNKSSAPSYLDAEIIDKLHKIAKI >RC1154 unknown MIKVKFMRSALITSILVAVAFLTSACNTMQGAGQDIQVAGKKLKDSAESN KPQKGCGCPHSSAN >RC1372 putative integral membrane protein MASYYLWFKSFHLISAICWMAGLLYLPRIYVYHIKAKIGSELDSTLQVME LKLLRFIMNPAMISTFIFGLINAHIYGFVALDTWFHIKMFAVLILVIFHG LLARWRKDFANGKNVHSEKFYRIVNEIPAICMIVAVIMVIVKPFD >RC1320 unknown MSEVFEIPNGESKVLLHCCCAPCVGPLMEKMIDTGIKFMLFFYNPNIHPK KEYELRKNENIKFAEKHNIEFIDADYDPQNWFRRAKGMELEPERGIRCTM CFDMRFERTALYAYENGFKVITSSLGISRWKDMNQINESGTRAASHYEGV TYWTYNWRKDGGASRMYEIAKEEHFYKQEFCGCVYSFRDTNDWRVANNRP KIEIGKEYY >RC0529 similarity to cell filamentation proteins (fic) MNSKPSFQITNKILELSQDISYELGILAGSKFYSQPIKLRKNNQIKTIHS SLAIEGNSLSVEQITDIINDKRVLAPEKDIVEVKNAIKLYNNLTIFNPFK IESLLKAHEILMQGLVEDNGKWRKGNASIFKGTEIIYFAPTARRVSLLMQ DLFEFIAQDKQISGIVKACIFHYEFEFIHPFSDGNGRIGRLWQQLLLMQA NKIFEYISVESLIRNNQSEYYSVLSKCDKLGESTLFIEFMLDKIVAALRL YSNNITYEANTPLSRMEFAKVNLIDQWFSRKDYITVHKNISTATASRDLL YGLERKLLISKGDKNQTYYKFV >RC0210 unknown MSIIFFIVHSTKYMKYNRIYINSRLAENSKIELASDHHVHYVKTVLRLKV NDGLRLFNGTDGEFLAQINDIGKNNLSVRLKEQLKKPYTKSTLTLAVAII KQDKLMLAINMATQLGITKIIPLITRWCQFRSVNIERLTKCVIEATEQSE RLTPPIIEKAITIQDYLKKNNNLMLYANEHEKEENSILRISSSLSNSDIT IIVGPEGGFTNDELELLASYKNTKSISLGSNILRAETAAITAIAQVRLLG SHCEEIA >RC1173 unknown MSSMLKQIRLESGKTLNQVSSDLKIRKKYLVALEEGDFDVLPGEVYVRGY LKLYLDYLNVKDRNAEQIEATKQNETEKLLNNKRATVINYKRKKQLVLIS IIMLSIIIVSHPFIINA >RC0660 unknown MPSSYKLRKKIWKSVYLLITVGILYIGYILIKSGYINEKNDINVTKKSLK DNKNFDLKYNIILKDSIFEGVNKNLNAYKIKTERAIKESNNKYKLDIINA IYNVNQDQTLIINAKEGFLDEESSILDLKNDVKLFFDEIIFNTNDARIDL VNKNITGHSPAKLLYKNSSITSDSFNTKDENNIIIFKGNVSTIIDLSD >RC1071 unknown MQKEMKMKTNDTFDGFTIELFKDTDGDWLARFEELPNVSAFGNSPEKALQ ELQQAWTLMKESYISHNQSIPLAPSRKEYSGQFNVRVDKRVHRALVLEAL RAKISLNALVSQKLTLSVKQ >RC1356 unknown MIRKFLLTIFALWVGGFGYYLYLINSYKLNSNTTNAIIVFAGGGHKIETG IAWLKAGYAPILFITGIESTEQLKILLKERNVIEQQVIFAPNKIMSEEDN IKKVVDFIVTYNLTSIILVEHNYNMPFMLNKLEKAIPSSNNIYIVPYPVF SKQKYDVLLKSYHRYLMSILV >RC1171 unknown MSIVTVTLNNKSFQLYCNNGDEEELLSLANKLNDKIAEIKLGSPTASFEL LLVMASLNAQAEIVSLTEKLNKNGLQKNHPDEEKFAETLTTIAGYLENLA RKMGK >RC0808 unknown MTNLNIYKCAEKDLNDYLGYIKDTPSVSLNDFIKNKYFAEDNDNIIITSL ENMEINADLANANFQGTILTDAVFNNCDLTNTILCDSDLTNVKFNDCTFI GTDFRGANLHYTDFNYKDYDNYKIPNLKDKIRDIKLSFSDLERLNQYIDK DLEKEHIKEIVIDEATKTKKYILAGEDEKTLWDIKSKELKTKQEELETLK QNLDNPGIATNLLNAFWHSAETIARNRQNELEKINKLQHEINKLETEVYA LDNLRMFCGKGLDGIFEQLKDEEIQITLDPSYIIGSTATERDIPKEYIKL TSAEFDLYLAEAAKQSNTKLSLTEFVRKQKNLSEDLNIVPDLSAINLSGK TLTNLNLKNTLFASANLENVKISNCNLDFTNFEGANLQNAVFQNVTARNA GFLFADLKKSKIENSDMSRAYMPKVDLSEVEVTNSKFNAVMMVNADAEKL IIKDSEWTNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTNLENV FMNKTHALEAKFKEQCNMQGITARNAYFSDAEFENILSLKEADLRETIMQ RVKLKNADLTKAQLDKANLEYADLTNATLTNATAQFAKLSNATLEKAEAE GLNISDAIAKNINAKEANFKNAIMQRADLTKANFTKAVLENADMQAVEAA EAIFKEAKLKQANLKAANLAGINKEGADFDKAKINDATKMHDTKGEAKGN LDHQDKDGKKTSVNVNEHAKLQDKIHAREKSGWFLKTGVGQLCTKLAKST TSGISRVTNFLASKKFLVGLAVVAGLAVAAAPFVAMPVLLVTGTALTTKA VILGAGLLAGGLVATGTYKLTQKPLRNLQKSFENLTSSIDKYISPPPKNI DELVTAKQQARQKAETEKSKEREENLNNVNNNIDKAKEQDLLKQAQNNLN QATPKVEIKEKKDKTVEKQQAKSNTFAAKFKPNTKGKGFATKIKDEKKSR SIKEAYN >RC1362 unknown MYKSMEIVNSVDASVSCQGKEPPYDHPKVYLEIDKKKKEVICPYCSKKFK LVTK >RC1109 unknown MNIEYKKFVHEYMLEFVKKILTKIQHENLYWDQLIYISYRTDNPAVILPS KVKQAYPKQITIVLQYQFENLIVNDTGFSLTVSFDGVKEIIYVPFDALIS FVDFNNNYSLTFNQSLNIHENPQHEEAISNNKSDQTSSSSSPNVIMLDKF RNSSKPS >RC0646 unknown MINGVRKLNYNLYKDAGFTQILGNGTSSTVTFTDSYTLGLEPVTKNYTVY ARLPSQPLAAASTFQDTITVTQPWLYNKKSLKSKFFSLGSNFVAYMD >RC0487 unknown MKLLKIIFIITICINFPIFAENLESVTTTEDIENDVFIPLDENHPILNPN DNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKC IKNLDPYNEDNYLLMTITEYKIDEDPNVLFQGWMISSSISLSTFEHPIYE IFAKDCF >RC0028 unknown MASSRNDGDKLMNTEKLKDIKARIKDLTTPKFSNPKIRQEISPFTIAVDL VSGTMLGVVIGIFTDKIFNSKPLFLIIFTIIGMIAGFNIIRKKVNNKK >RC0740 unknown MSQKLKIILKGFGMYAHKTIENGWSRNVLTLQIKSNLCARSGKSINNFGH TLPAMQSDLAKSIIKNPYNLEFLDIKENILERELENKLIDNIKDFLLELG QDFVFIGNPYL >RC1110 unknown MLPNLGIIAGRGSLPYLIAGNYTKQGGNCYIAAIKDEADIEQIKDFEYKI LKIGMVGEAIKYFKEHKVKNIIFIGGVNRPNFKNLAVDKIGGLLLFKIVG QKIRGDDSLLKIVADFFESYGFKVISSNEIYKNQQGNSNIITNTNPISSD KNDIELGIKLLNHLSAFDIAQSVIVESGYILGIEAAEGTDNLITRCADLR KNPHGGVLVKIAKLGQDNRLDMPTIGPNTIKNLAKYNYKGVAIQKNNVII VEEELTIKLANKHKIFITKC >RC1337 unknown MVNFNQFLKQAQSMQKKMQEAQEQMANARYTGKAGGGLVEVIATGKGEVE KISIDESLLKAEEKEMLEDLIKVAFNDAQQKCDEDSQNSLSGALNGMRLP PGFKMPF >RC1251 unknown MLYLKWKIMIKILITLIIVILSTTINADNKKLPIPRFVSIKSNEVNARSG PTTKSAVEWVFVKKGEPVEITAEYKQWRQVRDINGEGGWIHSSVLSGKRS VVITSDKEIELTKSADHKSRVIAKLMPKVRCSLKKCKEQFCQITCKDYTG WISKKVIWGVYDDNDRY >RC0865 unknown MLKTLLISFITLICAVNYADADQTAQNPNSTTSSSDQPDDLPAEAAVHFA QPWARPTTNVQGKVSNSAMYFTLINSRSKSYNLVNISSDKISGIEIHQTI NDQGVSKMVKVDYPFLIAGNINVDFKPGDMHIMLYDPKVDLNVGDEFKIT FFFDDNTRKIVNVKVANDNPYNKTGN >RC0153 unknown MSEVVVKEQLEQYISKIERLEQEKADLSQEVKDIFQDASSHGFDVKAMKS ILKLKKLDKDKLAEQDAMLELYRDTLGI >RC0314 unknown MTKKIALLLLPFILISCNGLGPKRVKNIVELTPKLAIQTHEPIYLDSNAN IYAFNANMLKNKQYSFARSKTITEPVFIGDMIYALDIRSNISAFSIEKNK IIWSYNLSRHKKDNYIGGGILHHNGKLYVTYGSRLLVVLDAKSGYEIIRK ELPDIIRIKPIVLNDNTVLVQTISNQTIALNAETLKTVWEHESLAEVLSA SYFMTPIVQYDNVIVTYNSGQILALNITNGEVKWNFEFTNLNDRTAIPNF DESSILCTPVHDNMNLYIATGLGKLIKLNVATGSVIWQVNAEDIQSMSLI GNSLFVTNNARQIAAFNPETGKVKFVADLNDGQDPKKLKSAAFLVPFVGV NNNNKRSLNVISVNGVLYSFDVDNNGLNMIPHVVKIIKNIRYYGLSANNN LYFSTDSKIIFGSK >RC0697 unknown MRCSSYCTSSEYKMSDLVTNLKKIGLEPQHFDDVLYIRKEINKDSDFIEI FFFPFGCVTIWGGDEIQEKIVLSDTDLVPVNKLKEPVSDYIYFEYNTEVK KTFIDEEKNKIILADKSVFVKLSISHALAQSVKLSVLEQSVSNLIVQTTP IQQELARTGSVSLSKKEILQQIGILFNERYSISLHSDIFDTPEFFWRRPS YEPLYLMTAEFQDIEIRQNIMNHRLNMIHELLNILSNDLNYKHSTKLEWI IIILIGLEVVLSLSHTNLFLKIIGAL >RC0754 unknown MLAIISSAKTLNFEKLAPKTELTIPMFLTLTNKLLSTLQSYSENQLSKIM NISAKLALINKERFKDFDNQESKAAIFTYAGDVFNNIHIEKLTNHALNFL QSHLLIISGLYGVLKPLDTIKPYRLEMATKLNEINLTNFWQDEVTNYINK ILAKQENKYLLNLASQEYSSVINPNKLKYQLVNVHFKENRNGKLSRIGIN AKKARGAMVKVIANNLIDSPELLKNFSYLGYAFSTKHSSDNELVFIKS >RC0510 unknown MNQEERNLIMNKYILDSSALLALFNLETGSDKVEELLPLSIMSTVNIAEV VAELDKKLNISFIQSKAMISASINKIVALDFDQAIEIGRLKKETEQFGLS LGDRACISLGLITGYPIYTADKIWAKLQLNCKIVLIR >RC0950 unknown MWFIELYPIIQLYVKVKNMSIQEEHHIKKISFVQSLLELLPFNEWNNKLL EEAEEKCGFAKGYSLIVFPEGLSEIVGFLEEYLDNIMLESLKIIAEPSKI REKISLAVKTRVKTVLPIIHSKNAAYFALNPIQGTEVAFRSCDAIWRYAG DKSLDFNYYTKRSLLLSVYVSSILFYIQDESENYIETDKLIETAVENIVK TFSQMKKLLAPSRIPIVRMFT >RC0638 similarity to late-developmental spore coat protein MYSSATPLAFGTYVPTVDALQTNTIIIKCILGTNYTVALNAGTAPAATTS TRKMTGVVNTNSYLPYNLYSNAGRTQNWGHQSSDWVMGTGLDQTLTIYGK ILQGANVPSDTYNDTITIIVAY >RC0672 unknown MEGLINGSVYYKIFDKTFNNSSHRYIKKNNSINETEIVENKKSITTYFKA QDILILNQVHGNQIVNADESIIAVPEADGSITTKKNLILAVQSADCVPVL LASGDGKIIGAAHAGWKGSINNIISNIVTKITEKGAKNLIAVIGPAIAQS SYEVDDEYYKAFLSKDINNKQFFIHSIKENHYMFDLPAFVELKLKEAGVK DIKNIAEDTYTNPLKYPSKRRSYHLQEPYNQNILSAIVMK >RC0926 unknown MPWSEVKEIAKEDYEYWFGRGKSFFIDCKYPLERGDFSKSAFELHQATAS VYSNILLVFARYKPKLHDIRTLGGYCANL >RC0670 unknown MRKIFNCLYVALFRTIEDDGVEHSGYMSFMILLSIFPFLVFLLALTSFLG ASELGQNFIQIFLESLPEQATESIEKRIRELLSAPPQSLMNLAIVGSIWT ASSFVECLRTILNRVYQIKSPPPYIRRRLLSIIQFLIISALITFTMFLLV VIPILFTKIPIILETIEKYKIILNFIRYFLILILLFLGASSLYYILPNVK LNFIDVFPGALLTVILWIISGYLLSTYIVYYNQLNLMYGSLGSIIVTLIF FYIINMIFIYGAEFNYLMKNYENIE >RC0364 unknown MQKYGPMILFNGGAYDNKLLQEALDKNIITDYPKENFYIFTLPEYQTNIG ADNLTLYIKSMKTVILT >RC0659 unknown MSLQSSIYRIRHLAKPAYREEFKGDIECSTAAYKEVLEDTSTDSTSKLPL EAKFGKMSIIKLVLLLIISTIIYANDKNISNLHITSDSLIIDRTKQKAAY LGNVIVYFDNAILRTKELYIFYKTIDEKQTIDHIVVPTKLTVERKINNEL LLADSAKYFCDNKQLILLGNVILQRDDNVLKTNKLIYYVDIIKK >RC0781 similarity to N-terminal of biotin-protein ligase MKIKIYNDLGVSKESIKHCVHTLRLYAPKYNVDYITAQEIIDEKWVQNTL LLILLGGRDLYYVQKLQGKGNANIKNYIKNGGNFLGICAGSYYSGNYVEF AKGTNIEVISKRELKIFNGTVRGPLLAPYCYNSHKGARAAYLKINPTLNL NIKDCYAFYNGGGYFIDAENTKDTEIIASYEDSQAAIIKCTYGDGTAILS GVHLEYEPALIKNQSLNNIHKILKAHDTERIKLLNYILDFFGNTLIERNL >RC1029 unknown MSPKFMTFYEKYMTIVGTIGNFMFYVQAHKIFTCQSSASVSMPAFTISAI ALCSWLIYGILIKNTPIIIANIVGFIGALLVLLTIIIY >RC0681 unknown MFIICSAPRFADSLLFFQLIFVYYFSFYFRVYNTTFERKEINMAGHSKFK NIQHRKGAQDKKRAKVFTKLIREIVTAAKTGSSNNPENNPRLRNALTAAR SQNLPKERIDKAINSANDSSNNENYTEIRYEGYAPNGIAIIVEALTDNKN RTAAEVRSSFTKYGGSLGETGSVNYLFNHCGVIQYPINIASNKDILEAVI EAGGHDIISDDTTHTIYTDIENFSKVLEFLTGKYGIPEDSYIGWIPLNTI IIDDKEKAEKLLKLVEVLEESDDVQRVFGNYELSDDVYEIIQGEP >RC0770 unknown MKYVNSKIVKILSQLTSQKYLIKNARLSSTEFEQFNSFFKARCGSNFALR FRNYADYRGINEVIAKGDGNLNKFQLRKIYGNPIAPYERVITKPVNNSVM LYINNVRTMGIVDYNDGIVNLPSPLGQDVILTTDFTFDVAVRLSIDSFEY SYCRRFYSVIQHRVSGGDYMSIAIEEVITKLTNFLFSN >RC0489 unknown MILITPRKFILIKYVRKFIDWFEQTFVSRLNNRNKGAIVLVMQRLHTDDL SGYLLNNSNSWHHLKILAISIQDYSFKLMNKEYQYLSGQVIRQLLKNLLI V >RC0858 unknown MNVFLSKYVNGVDKKSRVSVPANYRAVLGKELFNGVIAYPSIRNNCIEVC GISHIEKLRQMIETLDPYSEERDAFETMIFGEAVQLSFDGEGRVILPQSL MKHAGIEEQACFVGKGVIFEIWQPQNFEKYLNAAQKIAHEKRLTLRNAH >RC0093 hesB1, hesB protein MTITITDRAFERVCELVKLEKDKNLVLRISVDSGGCSGLMYNYELVSKDN IEQDDYVITKHNATIIIDPISQKFILDCTLDFIEELGSSYFNVSNPQAKA KCGCGNSFSV >RC0728 hesB2, hesB protein MKNVISLTDSAAKQIKLLIEKRAKPTFGIRVGVKSGGCAGQTYYVEYADS KNQFDEVVEEKGVRILIDPKALMYILGSEMDYVETKFKSQFTFTNPNEKA SCGCGKSFRV >RC1273 rompA, 190-KDa cell surface antigen MANISPKLFQKAIQQGLKAALFTTSTAAIMLSSSGALGIAVSGVIATNNN AAFSDNVGNNWNEITAAGVANGTPARGPQNNWAFTYGGDYTITADVADHI ITAINVADTTPIGLNIAQNTVVGSIVTGGNLLPVTITAGKSLTLNGNNAD AANHGFGAPADNYTGLGNIALGGANAALIIQSAAPAKITLAGNINGGGII TVKTDAAINGTIGNTNALATVNVGAGIATLEGAIIKATTTKLTNAASVLT LTNVNAVLTGAIDNTTGVDNVGVLNLNGALSQVTGNIGNTNALATISVGA GKATLGGAVIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFT GDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTN PVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLG GAIIKATTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVT GNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNASAVTFTNPVVVTG AIDNTGNANNGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKA TTTKLTDNASAVTFTNPVVVTGAIDNTGNANNGIVTFTGNSTVTGNIGNT NALATVNVGAGIATLEGAVIKATTTKLTNAASVLTLTNVNAVLTGAIDNT TGVDNVGVLNLNGALSQVTGNIGNTNALATISVGAGKATLGGAVIKATTT KLTDNASAVTFTNPVVVTGAIDNTGNANNGIATFTGDSTVTGNIGNTNAL ATVNVGAGLLRVQGGVVKSNTINLTDNASAVTFTNPVVVTGAIDNTGNAN NGIVTFTGDSTVTGNIGNTNALATISVGAGKATLGGAIIKATTTKLTDNA SAVTFTNPVVVTGAIDNTGNANNGIVTFTGDSTVTGNIGNTNALATVNVG AGVTLQAGGSLDANNIDFGARSTLEFNGPLDGGGNAIPYYFKGAIANGNN AILNVNTKLLTAYHLTIGTVAEINIGAGNLFAIDASAGDVTILNAQDIHF RALDSALVLSNLTGVGVNNILLAADLVAPGVDEGTVVFDGGVNGLNIGSN VAGAARNIGDVGGNKFNTLLIYNAVTITDDVNLEGIQNVLINNNADFTSS TAFNAGTIQINDATYTIDANNGNLNIPAGNIKFAHADAQLILQNSSGNDR TITLGANIDPDNDDEGIVILNSVTAGKKLTIAGGKTFGGAHKLQDIVFKG EGDFGTAGTTFNTTNIVLDITGQLELGATTANVVLFKDAVQLTQTGNIGG FLDFNAKNGTVTLNNNVNVAGTVKNTGGTNNGTLIVLGASNLNRVNGIAM LKVGAGNVTIAKGGNVKIGEIQGTGTNTLTLPAHFKLTGSINKTGGQALK LNFMNGGSVSGVVGTAANSVGDITTAGATSFASSVNAKGTATLGGTTSFA HTFTNTGAVTLAKGSITSFAKNVTATSFVANSATINFGNSLAFNSNITGS GTTLTLGANQVTYTGTGSFTDTLTLNTTFDGAAKSGGNILIKSGSTLDLS GVSNLALVVTATNFDMNNISPDTKYTVISAETAGGLKPTPKENVKITINN DNRFVDFTFDASTLTLFAEDIAAGVIDEDFAPGGPLANIPNAANIKKSLE LMEDAPNGSDARQAFNNFGLMTPLQEADATTHLMQDVVKPSDTIAAVNNQ VVASNISSNITALNARMDKVQAGNKGPVSSGDEDMDAKFGAWISPFVGNA TQKMCNSISGYKSDTTGGTIGFDGFVSDDLVLGLAYTRADTDIKLKNNKT GDKNKVESNIYSLYGLYSVPYENLFVEAIASYSDNKIRSKSRRVIATTLE TVGYQTANGKYKSESYTGQLMAGYTYMMSENINLTPLAGLRYSTIKDKSY KETGTTYQNLTVKGKNYNTFDGLLGAKVSSNINVNEIVLTPELYAMVDYA FKNKVSAIDARLQGMTAPLPTNSFKQSKTSFDVGVGVTAKHKMMEYGINY DTNIGSKYFAQQGSVKVRVNF >RC1085 rompB, outer membrane protein B (cell surface antigen sca5) MAQKPNFLKKLISAGLVTASTATIVASFAGSAMGAAIQQNRTTNAVATTV DGVGFDQTAVPANVAVPLNAVITAGVNKGITLNTPAGSFNGLFLNTANNL DVTVREDTTLGFITNVVNNANHFNLMLNAGKTLTITGQGITNVQAAATKN ANNVVAQVNNGAAIDNNDLQGVGRIDCGAAASTLVFNLANPTTQKAPLIL GDNAVIVNGANGTLNVTNGFIKVSSKSFATVNVINIGDGQGIMFNTDADN VNTLNLQANGATITFNGTDGTGRLVLLSKNAAATDFNVTGSLGGNLKGII EFNTVAVNGQLKANAGANAAVIGTNNGAGRAAGFVVSVDNGKVATIDGQV YAKDMVIQSANAVGQVNFRHIVDVGTDGTTAFKTAASKVAITQNSNFGTT DFGNLAAQIIVPNTMTLNGNFTGDASNPGNTAGVITFDANGTLASASADA NVAVTNNITAIEASGAGVVQLSGTHAAELRLGNAGSVFKLADGTVINGKV NQTALVGGALAAGTITLDGSATITGDIGNAGGAAALQGITLANDATKTLT LGGANIIGANGGTINFQANGGTIKLTSTQNNIVVDFDLAIATDQTGVVDA SSLTNAQTLTINGKIGTVGANNKTLGQFNIGSSKTVLSDGDVAINELVIG NNGAVQFAHNTYLITRTTNAAGQGKIIFNPVVNNNTTLATGTNLGSATNP LAEINFGSKGAANVDTVLNVGKGVNLYATNITTTDANVGSFIFNAGGTNI VSGTVGGQQGNKFNTVALDNGTTVKFLGNATFNGNTTIAANSTLQIGGNY TADFVASADGTGIVEFVNTGPITVTLNKQAAPVNALKQITVSGPGNVVIN EIGNAGNYHGAVTDTIAFENSSLGAVVFLPRGIPFNDAGNRIPLTIKSTV GNKTATGFDVPSVIVLGVDSVIADGQVIGDQNNIVGLGLGSDNDIIVNAT TLYAGIGTINNNQGTVTLSGGIPNTPGTVYGLGTGIGASKFKQVTFTTDY NNLGNIIATNATINDGVTVTTGGIAGIGFDGKITLGSVNGNGNVRFVDGI LSHSTSMIGTTKANNGTVTYLGNAFVGNIGDSDTPVASVRFTGSDGGAGL QGNIYSQVIDFGTYNLGISNSNVILGGGTTAINGKINLRTNTLTFASGTS TWGNNTSIETTLTLANGNIGNIVILEGAQVNATTTGTTTIKVQDNANANF SGTQTYTLIQGGARFNGTLGGPNFVVTGSNRFVNYGLIRAANQDYVITRT NNAENVVTNDIANSSFGGAPGVGQNVTTFVNATNTAAYNNLLLAKNSANS ANFVGAIVTDTSAAITNAQLDVAKDIQAQLGNRLGALRYLGTPETAEMAG PEAGAIPAAVAAGDEAVDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTG VVIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVNGFSFSLYGAQ QLVKNFFAQGSAIFSLNQVKNKSQRYFFDANGNMSKQIAAGHYDNMTFGG NLTVGYDYNAMQGVLVTPMAGLSYLKSSDENYKETGTTVANKQVNSKFSD RTDLIVGAKVAGSTMNITDLAVYPEVHAFVVHKVTGRLSKTQSVLDGQVT PCISQPDRTAKTSYNLGLSASIRSDAKMEYGIGYDAQISSKYTAHQGTLK VRVNF >RC1113 surf1, surfeit locus protein 1 MKTNFLVFITFTILISLGFWQLSRLKEKKLFLASMQANLTSPAINLAEIQ DGLPYHKVKITGQFLPNKDIYLYGRRSMSSEKDGYYLVTPFKTIEDKVIL VARGWFSNRNKNIITQATNDRQHEIIGVTMPSEKTRIYLPANDIKNNVWL TLNLKETSKVLGLDLENFYIIAEGKDISNLDILLPLAINHLAAIRNDHLE YALTWFGLAISLIVIYVIYRRRYMAVDVIPRACSGIQKNN