TitleGenColors Logo

Gene list

Applied filters:

COG category: Function unknown
Organism: Rickettsia felis URRWXCal2, URRWXCal2; California 2
Gene type: CDS

Number of genes found: 114

Free access
Sort by:

 



# Rickettsia felis URRWXCal2, URRWXCal2; California 2

>RF_0993 WD40-like repeat
MYLRKNMTKKIALLLLPFILISCNGLGPKRVKNIVELTPKLAIQTNEPIY
LDSNANIYTFNANMLKNKQYSFARSKMITEPVFIGDMIYALDIRSNISAF
SIEKNKIIWSYNLSRHKKDNYIGGGILHHNGKLYVTYGSRLLVVLDAKSG
YEIIRKELPDIIRIKPIVLDDNTVLVQTISNQTIALNAETLKTIWEHESL
AEVLSASYFMTPIVQHDNVIVTYNSGQVLALNIKNGEVRWNFEFANLNDR
TAIPNFDESSILCTPVHDNMNLYIATGLGKLIKLNIATGSVIWQVNAEDI
QSMSLIGNSLFVTNNARQIAAFNPETGKVKFVADLNDGKDPKKLKSATFL
VPFVGVDNNNKRSLNVISVNGVLYSFDVDNNGLNMIPNIVKIIKNIRYYG
LMANNNLYFSTDRKIIFGSK
>RF_1043 Conserved hypothetical protein
MQRYLDPTNDSLFKKIFRDLERLKEFINAVLELPEGFRIKEIEFIPVEQV
PIIDKGKKSIFDLKVKDEAGSWYIIEMQKRNESDYLKRVQYYSAHSYVQQ
LTKGIKHKDLLPVIVISLIKRLKCLMTKYLV
>RF_pd41 Transposase
MNNKNIPKDYSEVSVADTKASNRHHEQVDLDSHNTSERPRHDALIRKALE
NPLVAKEFFEMHLPKEIKAMFSSHTLKMEKESFVEADLKHSISDILFSAK
FKDNTGYLWVLLEHQSTPDHFMAFRLFKYMTDIASRHLTLNPKSKHLPFV
YPLVFYNGKKKYNAPKNIWDLCQHKELMQDIWTKDHKVVNVHDIPDEELK
KKAWAGILQFFMKHIHERDLLKRWYEVADLLPEFAKLNIGIDYLELILTY
TLIKIEKSDKIELEKLLKSRLNTEQGEKLMTSLAHHWEQQGVEKGMQIGE
AKGMQIGRNEGKHEKTIEVAKNMLSNNYSIPEVSRITGLSIAEINDLLKS
>RF_0907 unknown
MSKYRTTSLSHVINEEKKLYYNALENNNKSLEITDWILYFVETIIKAQDY
TLRNIEFLINKTKFYDKFKNILNARQEKVVKRIFEEGVEGFKGGLSAKNY
ITITKTSKATATRDLQELVEMQAFIKTGELKGTRYSINLENFSQ
>RF_1240 HicB family
MIKNDLMNYKGYLGSVHFNASEELFFGKVEFIRDLISYEASDAKTLIKSF
QEAIDSYLEDCNIVGKIPDKPFKGSFNVRIEPELHKEVSLYAMQHGYTLN
GIVKKALNEFIKI
>RF_0695 unknown
MNNYGMDVELERHYLGKQKLVLSLDDEDIFSELDSNTLLTELEKYLIDLV
DSKKNSNIVICTYLQGYSIQDSDDLLIDSSLIVPRNKAWVSKIEDYLKEY
NNILFAVGNDHLFGEMGLLRLLMDEGYLINRLNDDLTENKFSIDDLNYYK
ASAIELFIRDENGKSLFNQEPGIKLDDQLYERVDSDNPFNEIYTKILGAT
SE
>RF_1332 Conserved hypothetical protein
MINNIMNKFYNYNSSSHQVLLNLKVKPNSKQNLISDFVIINNIPYLKLSI
KATPEQGKANEEIINYLAKEWKLSRKDIEIIKGHTNSLKTILIKNIDEDY
LNLIINSYIK
>RF_0908 unknown
MLCLQVDEALNTSEIEGEYLNRASVQSSIKRYFNIATDNRKASPAETGIS
ELLADMYYSYEQPLSHDCLFRWHKMLTNGRRDLGAIGKYRTHLEPMQVVL
GKYHEPTVHFEAPPSNIVRQEMDKFIK
>RF_1317 unknown
MQKTKQIQFFITDSGKSYIKDWLEKLDVETHSRIINRLVRLEYGVYGDYK
QIKGRLYELRFFFGKGYRIYFTEKDNKIILLLNAGSKDTQDKDIKKALEI
IEKVYK
>RF_0729 Conserved hypothetical protein
MKGLINGSVYYKIFDKTFNNSSHRYIKKNNSINETEIVENKKSITTYFKA
QDILILNQVHSNQIVNADESIIAVPEADGSITTKKNLVLAVQSADCVPVL
LASGDGKIIGAAHAGWKGSINNIISNIVTKMIEKGAKNLIAVIGPAIAQS
SYEVDDEYYKAFLSKDINNKQFFINSIKENHYMFDLPAFVELKLKEASVK
DIKNIAEDTYKNPLKYPSKRRSYHLQEPYNQNILSAIVIK
>RF_1210 unknown
MSIVTVTLNNKSFQLYCNNGDEEELLSLANKLNDKIAEIKLGSPTASFEL
LLVMASLNAQAEIANLTEKLNKNGFQKNHPDEEKFAETLTTIAGYLENLA
RKMGK
>RF_0701 Toxin of toxin-antitoxin system
MQNINSGLVLDTHVLLWSILQPEELSEQIKHKINLAQENSQLFLSSISLW
EIAMLNFKKRINVYEPIKDFLNSITNINGLSIKDISPEVAAESVSLMDDF
HGDPADRIIVATAKCLGATLLTRDQKILSWAKLGHIKSISI
>RF_1162 NT (nucleotidyltransferase) domain and HEPN (higher eukarytoes and prokaryotes nucleotide-binding) domain
MKTTLPERSLKIEARLNFIVQQILDIAQDKIAMIILYGSFARGDWVRDLP
NGYHSDTDILIILKKGKYKGHAALRLEDNIYKRLEKTGIIKNQIIPYDSE
ISIILESIDEVNRQLEKGRYFFTDIKKEGILLYDSSEFVLREAKELPWSE
VKEIAKEDYEQWYERGYGFLDGAYNFLEKQKYALAAFMLHQATESFYSTI
LLVFSRYKPKLHDIKKLGGKAENYNSELLQVFPIATPEQKECFELLQKAY
VDARYNKNYKITKEQLLYLIDRIEKLKQITEKICLEKINGI
>RF_0792 Probable toxin of toxin-antitoxin system
MKVIWSNKALEQLRFWKRTNPKITKRIQILIDNIVSTPYDGKGKPEPLKY
KLNGSWSRRIDQEHRLVYSVNLEIQLIEIESCKGHY
>RF_0929 unknown
MAIVLHGIHKKAELSPLEKIYTTITGQSQEEYQAKKVEKILTMEHPNYKV
IDDTKSYKSSLQTPTPVQIKQEKTKLVKQALNTPSKAPPKPARNFKTPNQ
TPNTSKDSSKQHER
>RF_0128 unknown
MQQRHLLRIRLKVYMIKKIIFGIAILLSLSCFANSTTSDGSKKDAAKTND
GTTQKIIDDFSAYAGTIKPEVRKEIQEYRVEIVDINKKKRELYNSLSKEA
QNFLAEQQKYKQKLSISKLPTEDDSPNNTANSKDNKDTDTK
>RF_0425 unknown
MSRLKNMSIKEEHHIKKISFVQSLLELLPFNEWNNKLLEEAEEKCGFAKG
YALIVFPEGLSEIIEFFESYLDNIMLESLKTIEEPAKIRDKISLAVKIRI
KTVLPIIHSKNAAYFALNPMQGTEVAFRSCDAIWRYTGDKSLDFNYYTKR
GLLLSVYVSSILFYIQDESENYIETDKFIETAVENIVKTFSQMKKLLDPS
NIPIVRMFT
>RF_0073 Uncharacterized protein
MRKAFKKFLKNNKYVLSIITILLYWYLRFVYFTSKQKFIFYDNGNKEKFL
NEQGVIFAFWHNMLALSPAMFIGHRNIYALISPHLDGKILNDLVGKFGCR
VIVGSTNKNPIGALRNIIGKLSQGANIIVTPDGPKGPVYKVNSGTTEIAY
RYNKKLIPIVSSTSRCFRLKSWDKLIIPLPFGIIKIIVGSPLELTNDKIQ
NHISLEKQLTSLTESLKK
>RF_0066 Conserved hypothetical integral membrane protein
MTKESRLLEKFTHLIGKTRIFFTVLIIVGNLIYQKFVYLNIFNFYILELS
IGAIFYPLTFLLTDLIAEFYGKERANFCVKLAIIFNIIVVLIISLMDKLE
ATNWSNVDNITFHKVFGSYHISFLASTFACYIAQLVDINIYLWIRKITKG
KYLWIRNNFSTAISLFIDTFIVIGIMSLFNIFPFDQLGQLVLNSYSFKLF
FTVFSTPIFYLAVWLISLFIKKG
>RF_0671 unknown
MKKQIIYPDFIARIFSTALDLSLFAFIAIPISQFCSFNLLWLFFNDYFLS
NNINLHNSNEMFNSVMSQEFYEYLKAGNFNKYILFNISIFATNILVIGSY
FVTLWYYKGATLSKMFLRMKIVDAVTLNRPTLKQLIKRFLGYMTFPIGIF
FILFSSKKQALHDKIAGTIVIKS
>RF_1388 unknown
MEIVNSVDVSVSCQGKEPPYDHPKVYLEIDKEKKEIVCPYCSKKFKLVTK
>RF_0485 unknown
MNLQDFRIHKRLNLNPSVIEEIYTKISEIEGIKNSWYITGQILPQTLDRL
TRSVIITSTGSSNRIEGNKLTDEQVENLYKNLSVKKFKTRDEQEIVGYLK
CLEFIFNNYEEISITESFILKLHSDMLVHSEKDARHKGNYKFGSNRVEAK
DHNGNVVGTIFDPTPPYLVKKEMQELIDWYNWTVDSKTKHPLIIIANFIF
EYLAIHPFQDRNGRTSRLLTNLLLLKHGYLFVQIISHEHIIEANKIDYYM
ALNKTQATWKTKSEDITAWLIFF
>RF_0668 unknown
MSPQIIELLIFAVIAFYIINKLITTLGSTSEEEQTKQKSYFGEPVIKDVT
YSTVKSNKEEKNIPTAQDIKAFKDIIVEHNITAVVDGMEQVHKHLYSFDP
IKFINNAKTAFQMIIEAAYKKDAKELSELIDKRYLEEFEKITPSYGDFFD
LSALSAKYSEIYMFGNNIFIKLLFQGKNVVDKIEDLKEEWTFTRNANTKE
VDWFLSNIERV
>RF_0237 unknown
MEFDRKNAIISFLKENEDQKFTSYGIAEWLVENYIEEARLKANASTNKRL
LEAKTQEEKDKEIIGIYAGEISSSKLSAERRQEPNIRIELKPKLKFYYSQ
NPDYYVNNEQATEPKINANKEDKITEEQVCDTLALYLQNKLKIHNMPIKH
NFSSNKQGTKGNKWLHPDWVGMEVLNEKWGLLIKDCAKYYAGKQARLWSF
EAKTKINRSNLREYFFQALSNSSWAHYGYLVAADLIESRQNDTRHELEML
CTRYGIGFILVDIENPEDSKISIPARERPEIDWNMANRLAEENKDFENYI
NNIDTFYKKQKVNDSTWFNINDHVNKIPKQNKKKK
>RF_1180 unknown
MSEVVVKEQLEQYISKIERLEQEKADLSQEVKDIFQDASSHGFDVKAMKS
ILKLKKLDKDKLAEQDAMLELYRDTLGI
>RF_0793 Uncharacterized phage-associated protein
MNTKQKEQALSCFDVANYFLVLVDREAGDIISQLKLQKLVYFAQGVHLAL
FNKPLFEEEIEAWQHGPVAPKLRIPFSSLKDGSIAASGEMDFDIYTEQQK
NLMYKIFSLYGEHSARYLRNLTHKHSIWREAIESSDNTITKEKIQEFFKA
NIVNDIKDYILLITEDDIQQMENAEDQWWMNYDSGVPAEDVTNEILEAKK
ELKEGKGIESHLV
>RF_0738 Biotin-protein ligase
MKIKIYNDLGVSKESIKHCVHNLKLYAPKYEVDYITAQDIIGGKWVQNTL
LLILLGGRDLYYVQKL
>RF_1398 Conserved hypothetical protein
MASYYLWFKSVHLISAICWMAGLLYLPRIYVYHTKAKIGSELDNTLQVME
LKLLRFIMNPAMISTIIFGLINAHIYGFVALDTWFHVKMFAVLILVIFHG
LLARWRKDLANGKNIHSEKFYRIVNEIPAICMVIAVIMVIVKPFD
>RF_0251 Predicted membrane protein
MKGTVYGDPKFTDFLYNEKNQDIIRIYQNNMELINGLIKKEIDGFISDRI
VGAVNILGRTMDRNILEVPLNIKTPLHLMFSKKTVSLNIVEQFNFAIDDF
LASNEYKKIIKTYIYHILLPKSIDSRWCHVIGLLGCLAFAFSGIILSSRK
NSTLFGTFLFAVLPSVSSCIMLDLIVNHDTGHLNFYFTPSYFYYIFVVVL
LGFTIIKLFSYYNKQIAEDNYLEQSLNNIVAICDSFGQATFIIIGVAMVI
IHKIEPLSFWGPFFAFITANCGAILRDFIMKENSIKRVPRGVSIEITVLW
GIAFSVLLDMYGSNPNYHTIKYSMIIVISGAFITSLLVYHFGFLEWRFRN
EKLEDIEKQT
>RF_p61 Transposase
MTNEEHNQRQQQTDVSTNGDANDTTAERPRHDELFKKVMSEPVAAREFLE
HYLPASFKNKINLNSIKIEKESFVTEDLRKRFSDVVYSVSLNKNNIKDST
TGSANNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCKRHKDANKNKSNA
AAEKDNKLPLICPIVVYANDKLYNAPRSFWELFEDCKTAKEMMGGEYLLV
DLQKQSDDEIEQKKHLGMIEYMLKHIKARDILNLWQSLLEKFESSIEIDK
ENGYIYIKWLLWYSDAKVSEDKQVELAKIIAKHLNKADQEGLMRTIADKY
IDEGVQKGMVQGMQIGRNEGMQIGEARGMQIGEAKRTMEVAKNMLNAGSD
ISFISKVTGLSISELNQLLKS
>RF_0708 Chitin binding domain
MKNEEQQNQPTPKHGRIVFPLYTMGKVCVDKKLINEEWKLNGFETGKGSD
ERFGDDVVGEPLPLDGHILNGGRTDDTDWVNATNEEIRAELKDPTFNWIN
FAIPLEGEKKLTIEWDYTASHKTRGYIVQMTKAGFTFENRLKHDDLEEIY
SELSDKFPYWDYTLAPSKTIEVAFPDRDPRYYVVEVDWIVADTPRKFVSS
FIVEYKDDVSPIGLNTNDGECL
>RF_1335 unknown
MSFFNRKTAIIKLLKTHAGKEFTASKIATWLVDTYPQEAKRKEEASNDKR
LLNAKSKVRKRKIIIMIYRNELNKLLTAIQIIEPNIKIIKKRNRAKYCYI
NNTDNTFNTAKVIKALEHNKKQELTAMEIAQLLLNAKST
>RF_1330 unknown
MKKSKSITTLNLNNNEITDKGAIAIAEALKINNTLQSLKLMDNKIGNEGI
KRIACALIENTTLTSLNLMDNNTTEEGINAIIDDLEKNYSLTYCNLSIKT
IPKLQEILERNLALEEENHKKAEILNAEGDVFYNDNEYKKAIGKYTEAIA
LNKQRLYINNKHRAEKGYEQQLKVELIELFKVQNLNIQIDDLLNTDKTSL
NICYQTISDEVAKLIADKLKGIETITTINFSGSQISTDGIIAIIEVLKTS
KYLEKFDFSCSNLGDQKISILAEILQVSTLTHIYLGENDIGIIGIKAIIE
ASKFNNNIVHLDLHNNNISNEGVKLIA
>RF_1112 unknown
MLTYHYTLRSSAYRLLALFQVDLRISTLHFAGVLIMDNKKDNISEEEKLP
KEKEIGGVKGLEPTRYGDWQHKGKVTDF
>RF_0773 Bacterial transcription activator, effector binding domain
MNKVITQLSEIKLVGITAHTSNAAEMNADTAKIGTTMQRFFADKLQDKIL
NRKTPEKIFAVYTNYENDATGEYTYFFGEEVTSFENIDKGFSTLTIPMQT
YAKFTSGPDQMPKVVIDMWQKIWNMNTAMLEGKRAYIADFEIYDERSSDL
NNAIVDIYIGIKNT
>RF_0179 unknown
MPGNRKTPHNSYTYFYFFRMNIEYKKFVNEYMLEFVKKILTKIQLENLYW
DQLIYISYRTDNPAVILPSKVKQAYPKQITIVLQYQFENLIVNDTGFSLT
VSFDGVKEIIYVPFDALISFVDSNNNYSLTFNQSLNIQENHQHEEEISSN
KSYKTSLSPNPNVIMLDKFRNSSKPKPS
>RF_1220 unknown
MYRNVYSTISNSIIKKPKILLLVLKIYLKYTKSIKKYINRKSIMNEITAN
TTELIKDISVLIDNAKVRVAIKVNSEMTMLYWNIGKRIQEEILKSTRAEY
GQEIVRTLANGLSSLYGKGFTYTALVRMNQFYQSFQGQQIVATVSQQLSW
SHIVELLPLKEQNQRDFYAYMSIQENWSVRHLRSNIDKMLYERTKLSQKP
NNEQVLSLLKMNSQLEPDLILKDPYILDFLELPDEHYESDLENAILQQIE
QFILELGIGFSFVARQKRMTIDNEHFYLDLLFFNRKLKRLVAVELKTGKF
KAEYKGQMELYLNWLKKNECFEAENSPIGIILCSEKSDAQVELLEMSASG
IHVARYWTELPPIEIFQKKIGEIVLQTKKIYENKNFRT
>RF_0366 unknown
MNKLNKNSLQKKLFYRSKNRGCKEMDYILGSFAEKYLSLMDEKKLGSYSL
ILDQNDNDLYNWINNKSSAPSYLDAEIIDKLRKIAKI
>RF_1302 Outer memberane protein rOmpA
MLEWMQYSPAAVAAGDEDRDAKFGAWISPFVGNATQKMRNNISGYKSDTT
GGTIGFDGLVNDDLALGLAYTRADTDIKLKNNKTGDKNKVESNIYSVYGL
YNVPYENLFVEAIASYSDNRIKSKSRRVIATALETVGYQTASGKYKSESY
TGQLMAGYTYMMPENINLTPLAGLRYSAIKDKGYKETDY
>RF_0135 unknown
MKKTIIFCWLFLSSISYAFAVDCNNAMTQGDMNYCAGEEYKKVDKKLNQI
YKEILKHISDEQEKVNLLKKSQNLWIKYRDADCEFRSSGVYGGSVYPMIL
LMCLTEKTEERIKEFEAMLKCEEGDSSCPFIIKTQNLD
>RF_0913 unknown
MLKTLLVGFITLICAVSYADDQTAQNPNSTNSSSDQPNDILPAEAAVNFA
QPWARPTTNVQGKVSNSAMYFTLINSRSKSYNLVNISSDKISGIEIHQTI
NNQGVSKMVKVDYPFSIAGNINVDFKPGGMHIMLYDPKVDLNAGDEFKIT
FFFDDNTTKTVNVKVANDNPYNKTGN
>RF_pd61 Transposase
MTNEEHNQRQQQTDVSTNGDANDTTAERPRHDELFKKVMSEPVAAREFLE
HYLPASFKNKINLNSIKIEKESFVTEDLRKRFSDVVYSVSLNKNNIKDST
TGSANNDKAYVYVLIEHQSSSDYWIAFRLWQYMLLLCKRHKDANKNKSNA
AAEKDNKLPLICPIVVYANDKLYNAPRSFWELFEDCKTAKEMMGGEYLLV
DLQKQSDDEIEQKKHLGMIEYMLKHIKARDILNLWQSLLEKFESSIEIDK
ENGYIYIKWLLWYSDAKVSEDKQVELAKIIAKHLNKADQEGLMRTIADKY
IDEGVQKGMVQGMQIGRNEGMQIGEARGMQIGEAKRTMEVAKNMLNAGSD
ISFISKVTGLSISELNQLLKS
>RF_0759 unknown
MKETKRFFNKKNNRLNKGYAKTFSVNEPDNNFYRKKFEHILPPIDLISEY
ESIYPGTLQELIHMAQKEQAHKHAIDLKNLKIQERVAKLTRICLLIFGIC
LVVSIFFKTF
>RF_1284 Iojap-related protein
MKKEAEELKLFILECLNEKKAEDIEVIDLTGKHKLADYIIFASGRSTKNV
GAIAEYVALELKNNAGINSNIEGLGKSEWVLIDAGTILINIFYPEAREHF
KLEEIWKR
>RF_0883 unknown
MREVVQELKKREDLKRPLEQTVLPSIDDFKKDESPKRQKFDLNPKDLLLQ
KIKNNDPEIVRVELSYEEDSSFLSIVIKTLEKNTVVKEIDLYGSNLSNDD
IKNLSKIVKLNKISHLNLNSTSLDSEKIKILINALADNTSLEYLSLNFIS
LKIEDLRILIDAIKNHPNLSSLALGSSEVGNKGAAIISELFHSKLPLKSL
DIGQINITAEGIEIIANNISSAKLMSLNLGYNNLRNEGVIKLLHALMANQ
YIKELILNETRISEKSAEVIESFLTSTPIDTLGLNKNNLGDGIKYIASAL
KLNPWLKTLNLNDNNLTHNNIKELTTALTENKALETLLISNNIEIGDLGF
EAIANMLQKNNSLKKLALMNIKLGDLGVAYLSTVFLY
>RF_0707 unknown
MQIRTLLLLLQKHCCRDRFFRHCEEKLKILTKQSHAKVLRLPRSFQSLAM
TMMLIIPGNATALLATCNPSVPTVNFGTFNAGAVRTTSVTASVTCSALLS
LLVSYTVTFSTGSAGIYNPRNMVNGVRKLNYNLYKDAAFTQILGNGTSST
VTYSYLLGLGPVTKNYTVYARLPSQPLAAAGTFQDTITVTVTQ
>RF_1263 unknown
MQQCYVARHKEVNKNMDKHLKEYTESGKKNLIVVYILYLCGVVAPPLPLI
GVVFAYINKDKADNFAASHYVFLFRTFCFAAIGWVICFTDIWIVLDFALA
IWYILRIVIGFKYMIEDQAYPNPMTYWIK
>RF_0561 unknown
MFIVNLKYTLGLDLSSIEKFITEHRKFLDKYYDKGYFILSGPIHPRIGGI
ILANIEKLNQLKDILKEDPFYINDIAEYEITEFTPTKWHKNLNIFIKKYE
>RF_0225 Uncharacterized protein
MPYLLPITAAALLILLALIIWFYVKTKTFKTQLQFLSEQNLEISNNNQLL
NQEKIGCLQKIEQLKCKVEYQEQTIKDSEKIREESFSSAKAALFDLGKDL
SKQLIEIHKMENNAARELAEKNIATASGKFNSEFERLITMVGALNKDIEQ
SKGTVDLIKQSLLSPIGAGLLAEITLENILKSSGLRPNLDFIMQYGLTTT
DSGKLRPDALIFLPSGNLMVIDSKASKFLVDEQDNSGNLSKTMNYHLKSL
ANKDYAENILTNLNKKDQNFNNVITLMFLPTEQAVEKVIAADPEFLQKAW
SCNIFPVGPAGLMNMLSFAKFQITDHRRSENYKVIIEEVRKLLSSIGTMA
DYSQKIGNNLHNMVTNYDKFAASFNRNFMSRVKNIHKLGIDSGNKAMPAA
LERYQIVSSKSEIIEVEAENPPQIEEKL
>RF_1111 Conserved hypothetical protein
MKYNRIYINSHLAENSKIELANDHVHYVKTVLRLKVNDGLRIFNGTDGEF
LAQITYIGKHKLSVRLKEQLKKPYTESALTLAAAIIKQDKLMLAINMATQ
LGITKIIPLITRRCQFRTINIERLTKCVIEATEQSERLTPPIIEKAITIQ
DYLKQNNNLMLYANEHEKEENSILRILSSLSNSDITIIIGPEGGLTDDEL
GLLASYKNTKSVSLGNNILRAETAAITAIAQVRLLSRHCR
>RF_0001 Uncharacterized protein
MTKLIIHLVSDSSVQTAKYAANSAFAQFTSIKPKLYHWPMIRNLELLNEV
LSKIESKHGLVLYTIADQELRKALTKFCYELKIPCISVIGKIIKEISVFS
GIEIEKEQNYNYKFDKTYFDTLNAIDYAIRHDDGQMLNELSEADIILIGP
SRTSKTPTSVFLAYNGLKAANIPYVYNCPFPDFIEKDIDLLVVGLVINPN
RLIEIREARLNLLQINENKSYTDFNIVQKECLEVRKICDQRNWPVIDVST
RSIEETAALIMRIYYNRKNKYNK
>RF_1382 Uncharacterized conserved protein
MIRKFLLTILAIFALWVGGLGYYLYLINSYKLNSNTTNAVIVFAGGGHKI
ETGIALLKAGYAPILFITGIESTEQLKNLLKERNVIEQQVILAPNKIMSE
EDNIKKAVDFIVTYNLTSIILVEHNYNMPFMLNKLEKAIPSSNNIYIVPY
LVFSKQKYDVLLKSYHRYLMSIVN
>RF_1041 Conserved hypothetical protein
MEDAEEIALSENFEKGKAEGKAELIQMMLKQGKNVQQIIEFTGLSKEEIE
QLKAEIENSKAS
>RF_0216 unknown
MKMKTNDAFDGFTIELFKDTDGDWLARFEELPNVSAFGNSPEKALQELQQ
AWTLMKESYISHNQSIPLAPSRKEYSGQFNVRVDKRVHRALVLEALRAKI
SLNALVSQKLTLSVNNEKSSYSS
>RF_0940 NT (nucleotidyltransferase) domain and HEPN (higher eukarytoes and prokaryotes nucleotide-binding) domain
MKTTLPERSLKIQARLNFIVQQILDIAQDKIAMIILYGSFARGDWVRDLP
NGYHSDTDILIILKKGKYKGYTALRLEDNIYKRLEKTGVIKPQIIPYDSE
ISIILESIDEVNRQLEKGRYFYTDIKKEGILLYDSKEFTLSEAKELPWSE
MKEIAKDYYEEWFDFGVGFLIDSKNNLERKSLRHSAFYLHQATASFYSSI
LLVFSNYKPKLHDIKKLGSRAANYNSELLQVFPIVTPEQKECFKLLQKAY
VDARYDKNYKITKEQLLYLIERVEKLKEITERICLKRIG
>RF_0311 unknown
MVNIMSIKLDPYEQDIEDNFEKQQKIDDPALIALLQKAAKAHLNNKRSIT
IRVAEHDIEAIKIKASKHGLPYQTYLNMLIHSDATKL
>RF_1280 NT (nucleotidyltransferase) domain and HEPN (higher eukarytoes and prokaryotes nucleotide-binding) domain
MKTTLPERSLKIQARLNFIVQQILDIAQDKIAMIILYGSFARGDWVRDLP
NGYHSDTDILIILKKGKYKGYTALRLVDNIYKRLEKTGVINPKQIIPYDS
LISIILESIDEVNRQLEIGRYFFTDIKKEGILLYDSGEFTLSEAKDLPWS
EMKEIAKDYYEYWFGRGKGFLKGATTYLNDSEYALSAFSLHQATESLYST
ILLVFSNYKPKLHNLQKLGSMVGNYDSELWEVFP
>RF_0134 Uncharacterized conserved protein
MSIESKIRQSIEQNGYITCDVLMQEVLQSNPTSYYKQVKSLASEGDFVTA
PEISQLFGEIIGLWCIKEWQRIGCPKSLSLVELGPGRGLLMRDLLRTAKL
VPEFYKALSIELIEINQNFIAHQKANLQDINLPISHRSFVEDIPKKPTII
IANEFFDAMPIKQYIKVKELWYERIFVVQPVDGRIKYDKISVNKQLQEYL
LRTHIEAKDGAVLEESYTSIEIIKFIAQHLKKLSGSCLIIDYGYDLAPGN
RTRYQYNPTLQAVKNHKYCPILENLGEADLSAHVDFYTLKTVAKNSKINV
IDTISQRDFLIENGILLRKQTLQDKLNNRHLFKFAYREEFKGDTETLATA
AYTLVREDTSLGSTSKLPLEVEFEKMSEQAGIIEKQVERLISSKQMGKLF
KVLQIMN
>RF_1215 unknown
MVLLIKGVIIMAPVLNINFKCVMPIFGNEMVSSLPGNVSHIIHKLSGTIF
SSVYSTVEGTFKKGYMVNNAQCNWKNIEQSFSESPNDFSILHQECNNGLN
TPINNIFTDAESNPHILLQQTVTPHQNGTANILGQLYKVTNDYMFSQGDI
TETNGADFLGLNECTKKEAISALQNYNTYKKQLTSIETLTNILQDTLNHL
GISQDNAQEVGVKIFDVLKIPVKFTTDNLRVILNAFKITLDNAQEAGVKT
FDVLKIPVKFTTDNLKIILEAFNVTLDDISPTEIGKNILDLLSKPFTETT
TSTTERTNDDTNDGGYSGAYIAGVALGTLAIGTLLGYAAKYGWDWYKGRN
IKNENLGIEGENIKLEGENIKLEGENIKLQKDNLNFKTLIEFQAILDEII
EIGNLTDILAILSKTNQDTINLHDQDYNISPLKLKIQGLDKAITKLNSVS
NICTNINSLGNTLHKICELLNGKDTFISIKSFVNLIKELRKTDIKSEEHK
NCLEKIFETLLGIDETSSGEYTYIPEEELQNHEMPLLADVSSNSSGIGV
>RF_0334 Conserved hypothetical protein
MFRPIYTKSFEKDIKLAIKRGKDLNKLKMLIELLVNKTALPIKYRDHQLI
GNFIGRRECHIEPNWLLIYKIDNECIIFERTGTHSDIFK
>RF_0168 unknown
MHSSFEWDEEKNNINIEKHNVSFYEAQKAFLDIKRIILEDIDHSITEKRY
FCLGQVDGNILTVRFMFRGNHIRIFGAGYWRKGKRIYEQENKIH
>RF_0991 unknown
MKLIILLFTFLFSMFSFGESETIKGKPLKYAANNDFENRLDEQEQEIRRL
IGKVEVLQHKIDMLTQNSNIPNQEENTEVLEAGDSKKQDVFDIALLKDMP
DNAPKKPIAVNKDIAPDKQAYDLALAAYKDNKLTEAKDKFKNFIQKYPNS
SLISNAYFWYGECFFKQKDYNGAAVNYLKGYKESPKGAKSSDGLLKLALS
LGELKKTTEACNMLAKLDKEFPTNRTAASKKMAEDAKIKFGCKNK
>RF_0200 unknown
MKRILITLGATATLLLTPSIFADSPTKDETKVNEVQTEEVATSSTKTITE
NLQELKDNLTKLAQTGADKFNQTLSDTYDQIAQSVAEIKKNVKDQKDKEG
EELQKSIDDVKAKMEDYKKAGSKKQEEIRQHLVDKLEELNKNINEYNKEK
ANS
>RF_0230 Transposase
MVKKLKHDSLVKTIMTDPIAAQEFLEYYLPDDFKNLVDLSKITVEQESYI
EESLNKRYSDIIYKISTHNKKEAFVYVLVETQSTIDYWAALRLWKYTLLL
CERHKRGKDKLTLVYNLVIYNGKEIYNAPSNLWSLFTDSVMSKKLMTEDY
QLVDLQAMSDDEIVKKKHLGMLEYMIKHIHMRDMIKLWEKFLTEFKHIII
LDKEKGYIYLRSFLWYTDTKLSKQKQPELVQVFTKHLSSKDKESIMKTIA
DTYRDEGRQEGVVQGKEIGKAEGEHNKAIIIAKEMFTQGFKIPVIAKVTG
LQETFVRSVVKSH
>RF_0896 Uncharacterized low-complexity protein
MKKNMRKQMLKVISIITIYLLLSSCSESTRDANGLLTDSQSTVIRNYIIS
QNSKNLKVNLKEKFGSNLKGVKLIGVKLINEDLSGIDLTSCEILRADFAG
SNLEKAILTNAIIQESNFADSVIKNISGYNSDFQGSIFNNITLQNTNFVQ
SNFSDTAFNKTIIINVNFENSKFSHVLWSDNTIDGVNFQKTNLKNNSFKN
TNITNSIFYGTDLEKSVINNTNFTNNYFESSDLSQTKLTAVIIKDSNFTQ
SIFNEVNFNNVQSNNSFFSYASFQDSTLQNINLTKCDLQNSTISSSVLNH
FKIDNAILNNMSLNDNKFNNLSIKNSNANFVRINKTKGSNITLDNISYTN
NIFSNNDFKQFIVINTDLNSSEIINSNITNGQFNNVNFSKSLIQNVNFSD
VKITLGNLNQVALINSNLTNTTVINSVLSNSQINNINYQTYSGFINTNVS
NNIILNNDNSSKIPPNNIVISSVKDLQKITNLTNINLTNLDLSSLVFNGT
DFSNSIFKNANLTNTVIKNSILKEANFSAAILTKTDFSNSILIDSIFKSA
KIDQANFNNSDLTNTDFTEATIKDTSFDKAKTNGMKGVE
>RF_0737 Biotin-protein ligase
MLAPYYYNSNKGARAAYLKINPTLNLNIKDCYAFYNGGGYFVDAENTKNT
EIIASYEDNKPALIKNQSLNNIRKILKAYDTERIKLLNYILDFFINSLIE
RNL
>RF_0873 Uncharacterized protein
MQTIEQQITNVIEESLTDMGFELVLVKFKGVNPKVVEILIDSLNGEKISV
EDCTKASRTISAILDVEDLIEDAYSLEVASSGLERPLVKFENYNRFLGRE
VKIKLKELLNGKTRYQGKIIKAKNNKIYLKCEEQEVLIDYDLIKNANLVL
TEEVFKKLLKQ
>RF_0769 unknown
MLVIISSAKTLNFEKLALKTELTTSIFPNLTNQLLSTLQSYSENQLSEIM
NISTKLAHINKERFKDFNNQESKAAIFAYAGDVFNNIHVEKLTNHELNFL
QSHLLIISGLYGALKPLDAIKPYRLEMTTKLNEINLTSFWQDEITDYINK
VLAKHENKYLLNLASQEYSSVVNPNKLKYQLVNIHFKENRDGKLSTIGIN
AKKARGAMVNVIANNLIDSPELLKNFPYLGYEFSPKHSSDSELVFIKE
>RF_1348 Uncharacterized protein
MSEIFEVPDGESKVLLHCCCAPCVSPLMEKMIDTGIKFTLFFYNPNIHPK
KEYELRKNENIKFAEKHNIEFIDADYDPQNWFRRAKGMEFEPERGKRCTM
CFDMRFERTALYAYENGFKVITSSLGISRWKDMNQINESGIRAASHYEGI
TYWTYNWRKDGGASRMYEIAKEENFYKQEFCGCVYSFRDTNDWRVANNRP
KIEIGKEYY
>RF_0057 unknown
MPLSTSLLGKKSTYKDSYDATLLFKIPRINNRNELGINSNNLPFYGVDIW
NTYELSCLNKNGKPWVGVGTFYIPTDSENIVESKSFKLYLNSFNNFVVES
VEELERIILQDLSNVTHAKVTGRIFPINTKIEFSIPSGKNIDDLDIVCNN
YGPPDNSLIEYEDVLVEEEINSNLLKSNCLVTGQPDWGTIVIKYKGKKLK
HDSFLKYLISFRNCNEFAEQCAERIFTDIKNAINPDFLSIYIVYTRRGGI
DICPYRFTNKSYTLPSDKRFIRQ
>RF_0703 unknown
MKNNFKILRNIIGFIILTISTGNSFASSANAFFLVSATVLPSCIVTATPL
AFGTYVPTADSLQTNTLTITCTLGTNYTVSLNAGTAPAATTSTRKMTGLV
NTTSYLPYNLYSNAARTQNWGNQSSDWVPGTGTGLPQTLTIYGKIPQGAN
VPSDTYNDTITVTVAY
>RF_1365 Conserved hypothetical protein
MVNFNQFLKQAQSMQKKMQEAQEQMANARYTGKAGGGLVEVIATGKGEVE
KITIDESLLKPEEKEMLEDLIKVAFNDAKQKCDEDSQNSLSGALNGMSLP
PGFKMPF
>RF_0178 unknown
MLPNLGIIAGRGSLPYLIASNYTKQGGKCYIAAIKDEADIEQIKDFEYKI
LKIGMIGEAIKYFKDNEVQNIIFIGGVNRPNFKNLSVDKIGGLLLFKIVG
QKIRGDDNLLKIVAAFFESYGFKVISSNEIYKNQQDNSNIITDITLTNSD
KNDIELGVKLLNHLSSFDIAQSVIVENGYILGIEAAEGTDNLIARCADLR
KKPYGGVLVKIPKLGQDNRLDMPTIGPDTIKNLAKYNYKGVAIQKNNVII
VEEELTIKLANEHKIFITKC
>RF_0490 Probable toxin of toxin-antitoxin system
MYTLYYTHDAKKDFKKIIKSNHKEICLQLLDLITQNPFQTPPPYKKLLGE
HVGMYSRRINIQHRLFYEVDEKNKRIKILRMWNHYYDN
>RF_0986 unknown
MSKENKKNQDMSVEDILKSIKGVINERKNPIYENDSEDEDILELTEIVNQ
DEEEKLISTKSAEAVGDIFKNFTDTIKDKKLDNNISSKNALEELVIEMLK
PELKAWLDKNLPVLVKELVEIEIKKLVQNSKR
>RF_1110 Uncharacterized protein
MLINTSSNPLITVIHLLSSIGAINWGLVGLFNFNLVTLLFGSFPIIVTIL
YIIIGFCGVYSFLCLGKLFCKPGIEKAK
>RF_0473 Cell surface antigen-like protein Sca7
MRKKSRILKSFLATASILGLSLIILVNEIYAAAVSQMVGDVNLNNPNTNF
NPPFVDGNTIEVINSGNITVSNSGTYNIGAIRADQEVTTISVGSGSTDLA
INFTIGSMSGVVKSLNITRFNNATNINITLNGSAGGGNIHPVNDYSALQQ
VSFTQMGGGIVQIVLI
>RF_p41 Transposase
MNNKNIPKDYSEVSVADTKASNRHHEQVDLDSHNTSERPRHDALIRKALE
NPLVAKEFFEMHLPKEIKAMFSSHTLKMEKESFVEADLKHSISDILFSAK
FKDNTGYLWVLLEHQSTPDHFMAFRLFKYMTDIASRHLTLNPKSKHLPFV
YPLVFYNGKKKYNAPKNIWDLCQHKELMQDIWTKDHKVVNVHDIPDEELK
KKAWAGILQFFMKHIHERDLLKRWYEVADLLPEFAKLNIGIDYLELILTY
TLIKIEKSDKIELEKLLKSRLNTEQGEKLMTSLAHHWEQQGVEKGMQIGE
AKGMQIGRNEGKHEKTIEVAKNMLSNNYSIPEVSRITGLSIAEINDLLKS
>RF_1282 unknown
MPYLKWKIMIKVFIILITIILSTTINADNKKLPIPRFVSIKSNEVNARSG
PTTKSAVEWLFVKKGEPVEITAEYEQWRQVRDINGEGGWIHSSVLSGKRS
VVITSDKEIELTKSADHKSRVIAKLMPKVRCGLKKCKEQFCQITCKNYTG
WISKKVIWGVYDDNDRY
>RF_0799 unknown
MAGHSKFKNIQHRKGAQDKKRAKVFTKLIREIVTAVKTGSSNIPENNPRL
RNALTAARSQNLPKERIDKAINSADDANTENYTEIRYEGYAPNGIAIIVE
ALTDNKNRTAAEVRSGFTKYGGSLGETGSVNYLFKHCGVIQYPTNIASNE
DIFEAAIEAGGDDIVSDEIFHTIYTDIENFSKVLEFLTGKYGIPEDSYIG
WIPLNTIIIDDKEKAEKLLKLVEILEESDDVQRVFGNYELSDDVYEIIQG
EE
>RF_0810 Uncharacterized conserved protein
MPNYYTLNLYKLNLIIMRCSSYCTSSEYKMSDLVTNLKKIGLEPQHFDDV
LYIRKEINKDSDFIEIFFFPFGCVTIWGGDEIQEKIVLSDTDLVTVNKLK
EPVSDYIYFEYNTEVEKTFIDEEKNKIILADKSVFVKLSISHALAQSVKL
SVLEQSVSNLIVQTTPIQQELARTGSVSLSKKEILQQIGILFNERYSISL
HSDIFDTPEFFWRRPSYEPLYLMTAEFQDIEIRQNIMNHRLNMIHELLDI
LSNDLNYKHSTKLEWIIIILIGLEVVLSLSHTNLFLKIIGAL
>RF_1216 unknown
MAIVDGFLTSPTTKVKLRWLKLEQFKFNQSGVNMSDQDTKTISKLIIEFT
LNKNKKNLRAEIADLLERPITKSDIERIEQRLAELEKIFSNDL
>RF_1206 PIN domain containing protein
MSKIIFDASALIALFAKEDDYQLIKKYMRDGVISSVNIAEVYKYCIEKQG
LTEETAKILIKLLDIKIIDFCPEQALISANIINKTKIYGLSLGDRACIAL
AIFKNYPILTCDKIWQKLDLGIKFIMAR
>RF_0835 unknown
MPSKKQLIPNNNIYELPTTLVSNIIYQIESAKSQVASYTNSTLVMLYWHI
GSLINQEILNNKRAEYGEQILSQITKRLTLLYGNGFANLSRMVKFSKLYP
YQEIVVTVSQQLSWLHIISFVIWLILCHSRVGGNPVKSIKNLLKLFFWIP
AYAGMTSSRFSESCNKTHIIKLIHI
>RF_0911 Probable toxin of toxin-antitoxin system
MEMIYTILYTKKAIEDIQNLKAAKLADKAENLCKSLTINSMPLNSKKLYR
DLVGKRSIRINLQHRLVYEILEDKKIIKILSMWGHYE
>RF_0755 Conserved hypothetical protein
MNFHDIRMPEFIESFAVGKTEFSTSHAITKSGREARHLDRNYGCQKYLIK
NARLSSSEFEQFNSFFKARRGSNFAFRFRDYADYKVTNGMIAKGDGNLNK
FQLKNIYSDSIAPYERVITKPVNNSVILYINNVRAMGIVDYNDGIVTLPS
PLGQDVVLTADFTFDVAVKFSIDSFEYSYCNDGSIELSNIELVEVAILV
>RF_0446 unknown
MTINNNVYINQLKPTAKAAMLIRKPIDQVFEAFINPNITSKFWFTKSSDR
LEVKKQITWTWEMYSFSAQINVQEIEKNKKILIEWDTYKIPTLVEWQFTS
ISSEETFVTITNTGFIGDGDEVIEQAISSTEGFTLVLAGAKAFLEHNIIL
NLISDRFPKK
>RF_1212 Uncharacterized protein
MSSMLKQIRLDSGKTLNQVSSDLKIRKKYLVALEEGDFDILPGEVYVKGY
LKLYLDYLNVKDRNAEQIEATK
>RF_0313 unknown
MKRILQKIASIIAPPPKLTVSLHMVTFLWQSSKNIDLKMEVKSFSNRLVT
LVLPMPFVISTGDEFTIIAGCDKASRTCIVKFNNIINFRGEP
>RF_0719 unknown
MPSSYKRRKKIWKSVYFLIIVGILYIGYILIKSGYINEENDINVTKKSLK
DTKNFDLKYNIILKDSIFEGENKNLNAYKIKTERAIKESDNKYKLDIINA
IYNVNQDQTLIINAKEGFLDEESNILDLKNDVKLFFDEIIFNTNDARIDL
VNKNITGNSSAKLLYKNSSITSDSFNTRDENNIIIFKGNVSTIIDLSDY
>RF_0031 unknown
MVKLRDDRGEIDPRLLADGDKLMNTEKLKDIKARIKDLKTSKISNSKIQQ
EISPFTIAVDLVSGTMIGVVIGIFTDKFFNSKPLFLIIFTIIGMIAGFNI
IRQKVNNKK
>RF_0562 Conserved hypothetical protein
MTRILLLLLRFYQYFISPLLGNNCRFHPTCSEYAKEAITTHGTLKGLWLT
FKRIIKCQPFCNGGYDAVPLSIKNSKLFNKKI
>RF_0249 unknown
MNVIACVDISARHELRSRRLHGNLMKYPEIASSKLTVSPRNDEKIRNKHM
SPKFMKFYEKYMTIVGTIGNFMFYVQAHKIFTCKSSASVSMPAFTISAIA
LCSWLIYGILIKNTPIIIANIVGFIGALLVLLTIIIY
>RF_0058 unknown
MNSQKTLPLLPKATAMWLIENTSLTFKQIADFCGIHEFEIKGMADGEVAQ
SIKGLNPIANGQLTLEEIERCSKDPNANLQISYRPADELMKNQKKQRAKY
TPIARRQDKPDAIYWLLSNYPNIQDHQIIKLIGTTKTTIDAIRTRSHWNM
NSIRPRDPVLLGICSQIDLNKIVESLKPTQNPIKES
>RF_0670 Predicted membrane protein
MNQFEAYSLLFVDSFVSNLVIGFQNELIFHSMKMFGGYNSLIMLLVAICA
SLSGNAVNYIFGRIVLNIFYASKNEQNILRHKNLTKLYYKYEIFIIFLMA
FPFWGCFISLFSGFFKTKFLKFLGIGCLAKACYYALTLYIL
>RF_0220 unknown
MDYLFIKTIHIISSTILFGTGIGTAFFMWWTNKTGDLNAKAYAARTTVIA
DLLFTTPTVIIQPVSGIILVNMLGYNYSDLWLILTYIGYIIAGSCWLPVV
WIQIQLRNMAFEALKQRTPLPEKYYKLFKLWFYLGWPAFISLIIIFFLMV
FKPL
>RF_1002 unknown
MSNISINESYTNLIKNLKQEISKARVRAHLAVNKELIVLYWNIGKLILER
QNKEKWGSKVIQNISNDLRKEFPGMKGLSYQNISYMRQFAEEYNDEQILQ
QPVGEIPWGHNIVIFSKLKNINQRIWYAQQTIENGWSRNVLSLQIKSNLY
ERSAKGINNFSNTLPELQSDLARSIIKDPYNLEFLDIQGKIIERDLENKL
IDNIKNFLLELGQGFAFVGNQYHIELEREDYYLDLVFYHIKLKCYVVIEL
KIGKFKPEYAGKLNFYLNLMDRKIKDNSDNPTIGLILCEEKQGITVEYAI
EGIQKPIGVSQFKLTETLPKKLEKFLPTPQELAKLKSE
>RF_0718 Uncharacterized protein
MSLQSSIYRIIKLVVFLTVSISIYANDKNISNLHITSDTLIIDRIKQKAE
YLGNVVVYFDNAILRTKELYIFYKTIDDKQTIDYIVIPTKLTVERKINNE
LLLADSAKYFFDDKQLILLGNVILQRDDNVLKTNKLIYYVDIVKK
>RF_0864 Uncharacterized low-complexity protein
MTNLNIYKCEEKDLNDYLGYIKDNPSVSLNDFIKNKYFAEDNDKIIITSL
ENMEINADLVNANFQGTILTDAVFNNCDLTNTILCDSDLTNVKFNDCTFI
GTDFRGANLHYTDFNYKDYDYDNYKIPNLKDKIRDIKLSFSDLERLNKYI
DKDLEKERIKEVVIDETTNKKKYILATEDEETLHTVKSKELKTKQEELET
LKQNLDNPGIATNLLNAFWNSAETIAQNRQNELEKINKLQHEVNKLETEI
YALDNLRMFCGNGLADIFEQLKNEKIQIKLDPSYIIGSTAKERDIPKEYI
KLTSAEFDLYLAEAAKQSDTKLSLTEFVRKQKNLSEDLNIVPDLSEINLL
GRTLTNLNLKNTLFVAANLENVNISNCNLDFANFEGANLQNAVFQNVNAR
NAGFLFADLKNSKIENSDMSRAYMPKVDLSEAEVTNSKFNAVMMVNADAE
KLIIKDSEWKNSNLTGISLAYADMQRVQMQGVVLNNALLDQANIVSTDLE
NAFMNNARALEAKFKEQCNMQGITARNAYFSDAEFENILSLKEADLREAI
MQRVKLKNADLTKAKLDKANLEYADLTNATLTNATAQFAKLSNATLEKAE
AEGLNISDAIAKNINAQEANFKNAIMQRADLTKADFTKAVLENADMQAVE
AAEAIFKEANLKQANLKAANLAGINKEGADFDKAKINDATKMHDTKGEAK
GNLDHQDKDGKKTSVNVNEHAKLQDKIHAREKSGWFLKTGVGQFCTKIAK
TTTSGISSVTNFLASKKFLVGFAVVAGLAVAAAPFVAMPVLLVTGSALAT
KAVILGAGILAGGLVATGTYKLTQKPLRNLQKSFENLTSSIDKYISPPPE
NIDELVTEKQQARQKAETEKSKEREENLNNVNNNIDKAKEQDILKQAQNN
LNQETPKVEIKEKKDKTVEKQQTEPGKFVAKFKPNTKGKGFVEKIKDKKK
TNSTKEAYN
>RF_0757 unknown
MLKESLRILDLDKENGYYNGGQIIFGENRFNSKILSNFGDLIILEDIIPD
YAKDTEEVKIIAGCDKNFISCCNKFNNAINFRGEPLIPKKDFINLV
>RF_0471 Phage portal protein
MIKNYWKKFWKNSTTKSQNFIELNDIAYGNLKRVKVGIEAYRENVIVYRC
INLIAQSAGHVPWKVLKSRTGEVISELPVHYLLKRPNPEKAGVDFFSELI
ASKLLFGNSYILSTLDSYPKEIYLLPALATELVIEHDNLVAYRYKSSKGD
RIYKIDHIAKMSRVLHLKNYHPLDQHYGLSCLEAASLPIDLHQQSFYWNH
SLLQNGARPSGALIVKDSNGYLSDEQFERLQAQLSEKFSGNSNAGKPLLL
EEGLGWQEMSINPKDMDFIESKNSAAREIALAFGVPPQLLGINGDNTYSN
MQEARLALWEETLIPLLDKIADSVSNWFSYLFKEDIIIDFDRDSISALTE
KRENLWAKISNANFMTLNEKRAFVGLPPIINGDRL
>RF_0357 unknown
MPWSEMKEIAKDYYEEWFRSGCGFLIDCKYPLERGELNKSAFYLHQATES
FYSSILLVFSNYKPKLHDIEELGGRAANYNSELWEVFPQANEEQKECFEL
LKKAYVDARYDKN
>RF_0183 unknown
MKESNSNSAKEDIEGIKKDIESLVSRLRNLKGKSGDILDEQLGNLSSVME
HYKDKGIDKGKANVVDLCESTRDHPLRNLAYAFGAGVLLAILMK
>RF_0569 unknown
MKLLRIVFITIICTYSSIFAENLESVTTTEDIENDVFIPLDENHPVLNSN
DNINDSSEFKSYTNGKIIALNKITATSEEINFKVGEEKYFGNIKIKLHKC
IKNLDPYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYE
IFAKDCF
>RF_0909 unknown
MIWNWQHKDWPNFKYNQKHILDLEKNFVKNSGILLGAAKYLSEADQNNLI
VMLASR
>RF_0111 iscA1, Iron-sulfur cluster assembly accessory protein
MPRRGSASPRNDDLVASTQRHLEKIFIILYNLQDFYKVILVTITITDRAF
ERVYELVELEKDKNLVLRVSVDSGGCSGLMYNYELVSKDNIEQDDYVITK
HNATIIIDPISQKFMLDCTLDFIEELGSSYFNVSNPQAKAKCGCGNSFSV
>RF_0843 iscA2, Iron-sulfur cluster assembly accessory protein
MKNVISLTDSAAKQVKLLIEKRAKPTFGIRVGVKAGGCAGQTYYVEYADS
KNQFDEVVEEKGVRILIDPKALMYILGSEMDYVETKFKSQFTFTNPNEKA
NCGCGKSFNV
>RF_0905 mraZ, MraZ protein
MNVFLSKYVNGVDKKSRVSVPANYRAVLGKELFNGVIAYPSIRNDCIEVC
GISHIEKLRQMIETLDPYSEERDAFETMIFGEAVQLSFDGEGRVILPQSL
MKHAGIEEQACFVGKGVIFEIWQPQNFEKYLNSAQKIAHEKRLTLRNAN
>RF_0727 rbn, Ribonuclease BN, putative
MRKIFNCLYVALFRTIEDDGVEHSGYMSFMILLSIFPFLVFLLALTSFLG
ASELGQNFIQIFLESLPEQATESIEKRIRELLSAPPQSLMNLAIVGSIWT
ASSFVECLRTILNRVYQIKSPPPYIRRRLLSIIQFLIISALITFTMFLLV
VIPILFTKIPIILETIEKYKIILNFIRYSLILILLFLGASSLYYILPNVK
LNFIDVFPGALLTVILWVISGYLLSTYIVYYNQLNLMYGSLGSIIVTLIF
FYIINMIFIYGAEFNYLMKNYENIE
>RF_0205 rompB, Outer membrane protein B
MAQKPNFLKKLISAGLVTASTATIVAGFAGSAMGAATQQNRTTVGAATTV
DGAGFDQTAAPANLAVAPNAVITANANNGINFNTPAGSFNGLFLDTANNL
AATVSEDTTLGFITNAANNGNFFNFTLGAGKTLTITGQGITAGQAAATKN
AQNAVAQVNGGNAIANNDLSGVGTIDFGAAPSTLVFNLTNPTTQRAPLIL
GDNAVIANGANGTLNVTNGFIQVSDETFATIKTINIGDGQGFIFNTDATA
GNALNLQVGGATINFNGTDGTGRLVLLSNAAGGGATDFNVTGSLGGNLKG
IIEFNTTAVAGQLIANAGPANAVIGTNNGAGRAAGFVVSVANGNAATVAG
QVYAKDMVIQSTNAGGQVNFGHIVDVGTDGTTAFKTAATTVAITQNSNFG
AVDFGNTASQITVPDTKVLTGNFTGDASNNGNTAGVITFAANGTLASGNA
DANVAVTNKITAIEAAGVGVVQLSGTHTAELRLGNAGSQFKLADGTIING
NVNQTVLVGNAALANGAIQLDGSATITGDIGNGAGNAAPIQGITLANDAS
KTLTLGGANIIGANAGGTIDFQANGGTVKLTSTQNNILVDFDLAITTDKT
GVVDASSLINAQTLTISGNIGTIAANNKTLGQFNIGSSKTALNSGDVAIN
ELVIGNNGSVQLAHNTYLITKTTNAANQGKIIFNPVVNDNTTLAAGTNLG
SEANPLAEINFGSKGVNGDTILNVGQGVNLYATNITTTDANVGSFSFTVG
GTNIVSGTVGGQQGNKFNTVELDNGTTAKFLGNAIFNGETTIEANSILQI
GGNYTADKVESADGTGIVEFVNTTPITVTLNKQAGPVDDLKQITVSGRGN
VVINEIGNAGNDHGAATDTISFENVSLGAALFLPNGIPLDGLTIKSTVGN
ETATGNFDVPRLIVSGVDSVIADGQAIGDQDNIVGLGLGSDNSITVNATK
LYAGIGSVNNNQGTVTLSGGIPNTPGTIYGLGIENGSPKLKQVTFTTDYN
NLGSIIATNATINDGVTVTTGGVAGTDFDGKITLGSVNGNANVRFVDGTF
SDSTSMIVTTKANNGTVTYLGSALVGNIGSSDTPVASVKFIGSDDGAGLQ
GNIYSQVTDFGTYDLSVLNSNVILGGGTTAINGEIDLLTNTLTFASGTST
WGSNTSIETTLTVANGNIGHIVIAENAQVNATTTGTTTINVQDNANANFS
DTQTYTLIQGGARFNGTLGGPNFAVTGSNRFVNYGLIRAANQDYVITRTN
NAANVVTNDIANSPFASAPGVGQNVTTFVNSTNTAAYNNLLLAKNSADSA
NFVGAITTDTSAAITNAQLDVAKDIQAQLGNRLGALRYLGTPETAEMAGP
EAGAIPAAVAAGDEAVDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTGV
VIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVNGFTFSLYGAQQ
LVENFFAQGSAIFSLNQVKNKSQRYFFDANGKMNKQIAAGNYDNMTFGGN
LMVGYDYNAMQGVLVTPMAGLSYLKSSDENYKETGTTVANKQVNSKFSDR
TDLIVGVKVAGGTMNITDLAVYPEAHAFVVHKVNGRLSKTQSQLDGQVTP
FISQPDRTAKTSYNIGLSASIRPDAKMEYGIGYDFNAASKYTAHQGTLKV
RVNF
>RF_1290 sca13, Cell surface antigen-like protein Sca13
MKFNNAGSILNITSNGNVTATMNGDLDPGAAASGIIQLSSTGGLLTITGG
NNLGINGGNTLNSMILAGNGDITITPAINIATPLSTGIAGNLTLTTVNGP
VNFTNTTTLAVTNITGKTDFVNNAGIIKLNNNGTVNVLGAATLGPVTGIT
ALNLNGAGVVTINGASSATNLTINNAGVVAKAAGGFTGNAAVGAGQLTTN
TIGNVTIGTGTYTGDISGTVTFSGTGILNTTQVGGKADFVNNAGTINLAS
NGQLGAVTSTGGVSGTVNVLGAGTLGPVTGITALNLNGAGAVTINGASSA
TNLTINNAGVVATAAGGFTGNAAVGAGQLTTDTTGNVTVGTGTYTGDITG
TATFTDAGVITTDQIGGQTDFAGKAGTVNVNDSGNLGVVTSSNAVNGNLV
FLGGGNVNGAISNINTISIKGVAGNIVNFTQNVSAASLTFTNGATANLSG
SLGTANNAVAVDFGAGGGVLEFSGTNSAGYLLNSPITNGNTGTLNVYTTG
TTLTATDSSIGTVQTINIGQGNAAAAFTVDVSNKALTLESAINLNNTNSI
FGLTTSKNQQVTFTNNVDGFVNVGNAGGIVNLSSTSNAPPNKNVLTLQGN
VGNETLGTKANPLAVINVSGNVGVVGTNAKLGTNGLDVSNTAVLNIAAGG
VFVDASLTSASINAINIGDAKAGPAVYALDAINVDFDLDAPGVAFKNDSS
VLKLMTTSTDVNTPSTIYLTGNIDPGAANFGIVELNAENANTTLVINGVK
NKGVNPILSLGTANHPLQQIKFSGLGTIEVTAINTKSIDVSVPQLAIGVV
NADAFFSGATKLGVVQISGNMDFQNNAGTAIFASDTKNNIPALITGNITS
TGGQPNGTVVFMDNGTIGTVGGNNTITNLAMLQAGADNSTVTINSGGNMS
ITEVQGTGTGNILFTQPTTLTGTITGNGVNLMFTGPSSVSGDIGSSTSKV
GDITVSTDLLNCAGGVNAAYVILINGGDIKFNGAINILDISEGSSFFPFA
LAINPEDSLVIFDDSNAITYSGSIGTKGMVDVVQIDGGDVTIQGTVKTGS
VSFTTTQSATLTLANTSVVGGATTTGNKIHTLAITGDLTTGTSPFGSDSN
HLKTIQLNSDGKFTIDSQDVYSSVTTKTNNEGTAIFNADNGFTDDLGGEN
LNLKLVQFSSNKGTVKGDTYAKDITIDAGKSAVFTGYNSRSLDIAAATVG
GVKVPRATTKFNYKTKIVSENFKGSSSDSSAEYTNAALVQAPINGGSHKF
DDDVWLQKAVTGANVITFAPKKTAFIASNLGANTIVADQATMMFTGDNTK
VNVGGNISGSNITFDLGNNQVTYTGSATPTGELIINVFYDTINAGQTGNA
NSGNIVLASGSTFDLSNVSSIKVFLTAQNNPSAIGEGSAYPVISAAGGSI
IVGNAANLPFNVTANEGGFVRWQITGNSFVLLPIEPTGDVTVDNIIGKII
KAPPGSDAAKVANVLVNTPIDQRAAVIKHLLPLIERPSNEIHRVTTPLTP
MGPLVGPSVGGGFNPIQPPPTSGSVVNNIYVPPTTPGGYDVTPSNPAGYN
TPTTPVTGTGGFGPNGPGTGGNAPSGSVTGGVFTPNGPSVVAPSGPATGG
SVGGFGPNGPGTGGTTGGNIGTGGGSPSVGTGGFGPNGPGTGGTTGGNIG
TGGGSPSVGTGGASGSNVGTGGTSYNGPASGGSPAGYVPSTSATSGGSVG
TGGTSYNGPASTGGVGNSGTTPSGTGSNVGTGSNSPAGGNSVYSPSTGST
RNNVGGYTPSVGQGGTSSNGSVTGGSVGTGGTSYGGAGSNGGVTNGGSSP
YGNSGNPTTTSISPANNSNNIGGSSESGMPAIGSSRDNSSGTGSGGNSGS
GISRSNANDGFNAAHDNTSTGTQEVGKRLKTLKDAAEDGANPAAEDATSR
KRGAAAVGGGDCELDEGNANNSVYGVWVSPYYGKAVQKAFNGLSGYKAKS
TGGSIGVDTVINDNIVLGAAYTRVDTKLRYQNAKSGNTTKVGTNMGSIYG
LYYFVNNWFVEGVTTYSRSDIKTNELRNIIGGYETAHGKYHSTSYSGQVV
GGYNYLWKETSFAPMAGLRFTKIKDSGYQEYGTSFQNLTIQKRQYNKVEG
ILGGEIKTTFYKDEFIIRPQVHAFINYDFKGKTPAIIADLNGLNEPLPVP
TPKPTKMLYDLGAGVVVKKGRTEYGVHYGLNLAKKYHAQSGTLRLKVNF
>RF_0725 sca4, Cell surface antigen
MSKDSDNPGYESGYESDTEEKKQEQAVPAQPISSTANKDGNPDTSEFDPL
ANKEYTEEQKQKLEQEQKEYFSQTTPQELEADDGFSFTPASSTQSTPSIS
SLSGGISSDSQTSDPITKAVRETIIQPQKDEIAEQILKDLAALADRDLAE
QKRKEIEEEKDKTLSAFFGNPANREFIDKALENPELKKKLESIEIAGYKN
VLSTYSAANGYQGGFKPVQWENQISASDLRATVVRNDAGDELCTLNETTV
KTKPFTVAKQDGTQVQINSYREIDFPIKLDKADGSMHLSMVALKADGTKP
SKDKAVYFTAHYEEGPNGKPQLKEISSPKPLKFAGDGPDAVAYIEHGGEI
YTLAVTRGKYKEMMREVELNQGQSVDLSQTIAEDLTKVQGRSQETPQPII
TPNQELKSSIETPTTTQVPPITPANQPLQPETSQMPQPQQVNPNLLNAAT
ALSTSMQDLLNYVNAGLTKEKDGNKQIDLINEAATAILNNEKSDIAEKQA
NIIALTENTVNNNDLTPDTKVAGVNAVLETIKNDQNTPDLEKSKMLEATV
AIALNSENLEPKQKQQMLEKAVDVGLSLKDDASRVTAIDGITDAVIKSNL
STEDKGTMLIAVGDKVNASELSNAEKQKLLGSVLKKGVEAQVLSPEQQQL
MQQNLDKITAEQTKNAQITEVQGILANPAFNTIAKTEAIQNVTTKVLDSP
IKAEIKGETLESITKVVAESPLNGQDKADIVKGMGEAIASHKTMAPTEKI
STIESVEKGVAESITDLEDKKLMTKGLVEGIYEGKANPEITSEKTKAVSR
GIDKSTAIPEDKQALKDAANEAALDRETQNLTEGLKRQNLGEPKPRDDIY
NKAQDVADALKNVITPVLDAHPEKREVSEEEEVVKKTSSILNDISKLAIE
KVNNFRAMLSPDGNLKTLEEKKAESTKKVDELVKEFGTKSSTEEQQSFIK
ANLIDDKTLSKEIRLQTINKLLQEQAQKRAEAIENPNVKTEDVRVVSGVN
IKDNIKIMGALMNARDSIIQSENLNKSTPIKRESSFPPR
>RF_0175 surf1, Surfeit locus protein 1
MKTNLVVLITFTILISLGFWQLSRLKEKKLFLASMQANLTSPAINLAEIQ
DSLPYHKVKITGQFLPNKDIYLYGRRSMSSGKDGYYLVTPFKTIEDKVIL
VARGWFSNRNKIIITQATNDRQHEIIGVTMPSEKTRSYLPANDIKNNVWL
TLDLKEASQTLELNLEDFYIIAEGKDISNLDILLPLSINHLAAIRNDHLE
YALTWFGLAISLIVIYVIYRRNVISV
>RF_0094 vapB2, Antitoxin of toxin-antitoxin system VapB
MNKAKIFMNGQSQAVRLPKEFRFSVKEVSVIPLGKGIVLQPLPNSWKDVF
QEMAEISSDDIFPEGRKDLPPQKRKYFE