TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Frankia sp. CcI3, CcI3
Gene type: CDS

Number of genes found: 391

Free access
Sort by:

 



# Frankia sp. CcI3, CcI3

>Francci3_3926 transcription-repair coupling factor
MTLAPLLDALIARPGGDPALGRAIGAVGEPVLDLAGPAALRPFAAAALAV
GADRPVLAVVATGREAEDLASALGSLLGPDVVTVFPSWETLPHERLSPRA
DTVGQRLAVLRRLAHPASTGRPPLRIVVASIRAVLQPQVAGLGELAPVTL
AEGDTADLDGVVTRLVDIAYHRVDLVERRGEIAVRGGILDVFPPTEEHPL
RVEFFGDEVEDIRRFSVADQRALPEDDGATGGGRVLFAPPCSELLLTAQV
RARAADLATRHPELLDLLDKIADGIPVEGMEALAPVLVDAMSLLLDDLPA
GTHVLVCDPERVRSRASELVRTSQEFLDASWSVAALGGGAPIDLGAAAYR
SIIEVRERAEDLGLPWWSVTPFMSTPAGSDREPAGTGLTGAAYDGTEYGD
NYGDNYGDNTVVASLRPAPIYHGDTAAVIADVKGWLAESWRVLLVTEGHG
PAQRLVEMLRDADLGAGLAEEATLTPGVAIVTCGRLATGFTSDTLRLAVL
TESDIAGARGVSTKDMRRMPSRRRKGIDPLALLPGDMVVHDAHGVGRYVE
MVTRTVAGAKREYLLLEYARGDRLYVPTDQLEQITRYVGGDAPSLDRIGG
ADWAKRKSRARRAVKEIAGELIRLYSARMAAPGHAFAPDSPWQRELEDAF
PFRETPDQLAAIDEVKADMEKPVPMDRVICGDVGYGKTEIAVRAAFKAVQ
DGKQVAVLVPTTLLVQQHFQTFSERYAAFPVVVKAMSRFNSPAEHKAVQE
GLATGSVDVVIGTHRLLSGENRFKDLGLVIVDEEQRFGVEHKEQLKKMRT
AVDVLTMSATPIPRTLEMSITGIRELSTIDTPPEERHPVLTSVAAYDARQ
VAAAIRRELLREGQVFFIHNRVETIDRAAARLRDLVPEARIATAHGQLHE
DALEQVMVSFWEKKFDVLVCTTIVESGLDISNANTLIVERADVFGLSQLH
QLRGRVGRGRDRAYAYFLYPPDKPLTETAHDRLATIAQHNDLGAGMAVAM
KDLEIRGAGNLLGGEQSGHIASVGFDMYVRMVGEAVAEYRREGVEEPPEV
KVELPVDASLPHDYVPSERLRLDAYRRLAGAATDADIDEVRGELVDRFGP
VPEPVENLLAVAGLRVLARRFGVTEIITAGRQIRFAPLELRESQTLRLTR
LYRGAVVKPAVRTVLVPAPTETGRIGSRPLRDRALLVWVGQLLHAVAGDS
VAAAAASI
>Francci3_1800 Endonuclease/exonuclease/phosphatase
MNPQQSLFPNPPTPVAPTSTTELRLVTFNAQHAAPDRARRQAAWLGQQPD
ADLVVLTEVGHGPGRHALLDALAQHGYTHLHAPRPDQPDYSTVIASRHTP
LEPVPAGIDVLAHRAPAVDLWVGGQRLRLVGLYVPSRGPAARRNENKRAF
QHAVTRALPGVLAGFDGPVVVTGDLNVVEPGHVPHYPVFGRWEHDFYRSF
TDAGLTDAYRTLHPDAPGHSWYGRGGNGYRFDHTFLTTDRLDLTRCDYVH
APRHHGLSDHAAMAVAAVVAPRQLQP
>Francci3_2540 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_3350 phage integrase
MFKRCGCRDPQTGKDYGARCPKLPRSDHGSWLYIHDVPPGPNGRRRRTTG
GPFETETEAEVALTGSLAELHGGGRPDDTGLTVARYLDDWLDGKASLAAS
TRRSYEEHVRLYLKPGLGHLRLADLRDHHIEQLYAAMRLIGRDLHGRKRS
PLLVRLLEVRKDDPNHRRPLSASRLRRVHATTMSALNSAVKRKKLGVNPA
EHVELPKARKPRPLVWTAPRIAAWERTGKRPSPVMVWTPQQAGAFLDFTV
ADRLYPLWHLIAHRALRRGEAVALGWTEIDIDDDSIYILDNLPGSVSGAD
DNLDLDGDEYTEPKSDAGYRTVSLDPATKNVLTTWQARQDEERAAYGKAW
VDSGRMFTHPDGSRLTPNGVSQRFERLITRFATIRHEHAEHSWTVDQLAA
RHFMPEDAIQTALAFGPLPPIRLHDLRHTAASLTYRATKDLKVVSELLGH
SSVHFTGDVYTSVFADADRAAAKAAADIVPRRHPPGQVEPDSLDDPTPPP
ANGPGLDL
>Francci3_2147 phage integrase
MFEDSDHRVIEAIRLPLWGSVVASDGVVPWRLVDGLGEPVEPVEVFLRDF
VAQGRSANSVRSYALALLRWWRFLVAVGVAWDRVSPAEVRDFVLWLGQAT
KPVAAARVGSRATVGQVNPVTRKRHLGDGYGPATVRHSNAVLRTFYEFWL
ERGEGPLVNPVVLARAGRGRAHAHHNPLEAFRPEGRLRYNPRLPKRKPRA
MPDERWDALFAAMGRNRDRAILALGVSTGARAAELLGMRGADLDWGGQLV
RVVRKGTRAEQWLPASPEAFVWLRLYLADVGGGVLGAGDPVWRTVYRRGG
VHEPLNYEALRAVFRRANRRLGANWTMHDLRHTCAIRMVRDGRLSLRDAQ
TILGHAHLSTTQLYLEQDDEEVFARVREHLAARERPRPAPPAPAALGYDP
ADLAVLFGNPR
>Francci3_3307 zinc finger, CHC2-type
MPTIDRDAVLDATDLTALATEICGEPQGRGRAARWHCPNPAHPDEHPSMG
IYQGHHGRWRWKCHACGDGGTAIDLLMTGTGMTVREALLDLADRAGITPA
QHHTPHRPRSTAHPTPTPPAEQLTPVDPAIGTLVTAAADLLWRPIGSGAR
RYLHNRGLTDRLLAANRVGFDPGPRHLPRPDGLPRAGPGIVFPVLHPTDR
RPVYYQLRYLNPELRRRYDQPAHDLAPNPKLAHLTTDRPAHPNLTVVCEG
FPDALTAAHSGLAAVALLGTAHASPASAPTLARRLTEERPGSAFIICFDD
DTAHDPGKLPAGQNAAARLADELAQHQRLVLNIQTPAGIKDLNAWWQQDP
GAVHTVFRDLATLLELPHPAIGPPTDSPRPTGPALPEGP
>Francci3_4225 putative transposase
MIPEKADQKANRKKRGSLGGRPVSHDATLYKDRNTVERSINKIKEWRGLA
TRYDKTPESYAAGLHLRGSILWLRSLPTP
>Francci3_2868 Phosphoesterase PHP-like
MLKEVFRQQMPAVAITDHGYMYGAYDFHKQATAAGVKPIIGCEAYVAPES
RPLKQRVRR
>Francci3_2347 transposase Tn3
MPVEFLTDEQVAAYGRFDGEPVRAELERFFFLDDADRELVVRRRGEHSRL
GFAVQLGTVRFLGTFLSDPLEVPWVVVDYLAGQLDVADSSVVKRYTERLP
TQHEHAREIRSVYGYRDFADPQAATELRAFLASRAWTQAEGPVALFGQAT
AWLRRSRVLLPGVSVLARLVVSVRSEATDRVHQELAVAAEQADPQLAGRL
RSLLEVAPGARVSELERLRRGPTRTSGPGLERALSRAGEVIVVGAGAADV
SGVPANRLATLARYGLAAKAPLLRELAEPRRTATLLATVRGLEAAAVDDA
LDLFDVLMATRLLNPARRAAQQERLAVLPKLEKASVTLAGAASALLGLLA
GTAGEALDVAAAWAVVEQVAPRERVLEAAALVEELVPGDAGLETVMRAEL
ARRYGTVRPFLLLLAEALPLAATGDGQPVLDAVRLLPNLVGRRRIRPDEV
DIDLVPPVWRRAVLANPTLPAGTVDRDGYVLCVLEQLHRALRRRDVFAVR
SQRWSDPRARLLAGEGWDRVRGEVLAGLGLAAPVETHLRELAGSLDAAWR
QTVDRLAEAGPDSPVRVEPAADGRMRLAVSRLEALGEPDSLVALRATTAA
MLPRVDLPDLLLEVHAWTGFLGAYTHLGGSGSRMENLHISLAALLVAEAC
NVGLTPVTKPSEPALSRDRLSHVDQNYLRADTHAAANAALITAQAQIGLA
QAWGGGLVASVDRLRFVVPVRTINAGPSPRYFGLKRGVTWLNAVNDQVAG
IGAVVVPGTVRDSLYILDTLLTLDAGPKPEMVATDTASYSDIVFGLFRLL
GYRFSPRIADLSDQRLWRATLPGDAEGDYGPLNAIARHRVNLARVGVVLW
NTRYLDAALTALRTAGHDVHDLDVARLSPLADRHINMLGRYAFAAPPTGT
GLRPLRDPGENDQEES
>Francci3_0151 transposase, IS4
MVRTHRYPSDLTDAEWALVEAVLPPVSKDGRPGAHSRRDIVDAILYVTHN
GIVWRALPAGFPPWQTVYGFFDRWKKKGVTAGIHDALRGRVRLARGREAE
PTAGVVDSQSVKGAQTVGADSGGYDAGEKVNGRKRFVIVDTLGLLLTVLV
VPANVQDRDGGRRLLIDHYFTHHRCRHLFADGGFAGQPVAWARTIMRTTV
EIVRKKPGQKTFEALPKRWVVERTLVWLTAHRRLAHDYERHPATSASFIH
>Francci3_4149 Integrase
MIVLAVRWYLRYALSYRDVEELLAERSLEVDHVTIYRWVQRFTPLLVDAA
RPCRHRPGDRWFVDETYVKVAGRWTYLYRAVDQHGQVIDVLASTRRDQAA
ARRFFTSALSHGRRPVEVTTDKAPVYPRILDELVPEACHVDAARENNRIE
SDHSRLKAGLRPMRGLKRLRSAQTISAGHALVQNIRRGHYELGTDTDPHA
RLTAAFTELTLAI
>Francci3_0671 hypothetical protein
MGTLGAGFVRVSTGSQDETSQVKILTEEASQRGITIVKWFTLHGYSASHG
AQEPALREAIADIQRRDYTTLMVTESSRLDRRDDSDAQAEILLSIRSAGG
DIVSVAEPQFGKTDFAGRIVTLVAQHANAEKSKTVKATTYRGVSMIIANG
AHHGPLPSFWATRGERYAKQAYCADPESVRDIYGRVANGDSLLSIGRAYD
LHSTSIRTLIQFAANHTGVEECRYQGDPQSLFVQVDGVRRGAGGWGLGPV
VV
>Francci3_4086 hypothetical protein
MGGSRPFGWKDDKRTLDLTEARILREGARRILAGVPVIDLVNEWNAAGVR
GTRGKKWTKSSVLKVYRNPRICGLRGRGVEEPNVNGHVAKYMQVVTRKER
TPDGRTIEVPVKGQWKAIIGVRRWEQVIAKIGGRTYAQQGHNSRRYLLSG
VVACGRCGRSMFGSPPYRERKHAIYRCPAPTQGGCGKVSRHGPHTDDHIL
AALFNKIELETASAVVEVAPWEGEAALAEVQESITATRAAWTSVPRRISP
KDYFPTMEELREQEEALLKGRNDHLVATANAHARPADVRAEWDTYSLARQ
RAIIKEHLIAVVVHPAGRGRRFDPDLLDPVWREET
>Francci3_3223 transposase
MYWRPVFHILEDAIGECWLLNARHMHNVPGRKTDAADAAWIAELVEYGLV
RPSFVPPQPIRQLRDLTRYRKAQIEERTREVQRLDKVLQDAGIKLSSVSS
SILTVSGRAILEAMIAGTTNPEVLSELARGRLRAKIPALREALNGFFTGH
HGLIIGEILAKLDYLDEAIDRLSTEIDRVIAPFEAKVDLLDTIPGVDRRM
AECLLAEIGPDMSVFPTAGHLASWAGRCPGQHESAGRSKGGKTRKGSK
>Francci3_2963 putative transposase, IS891/IS1136/IS1341
MRRSYKFLLRPTAHQQIALTAMLDDHRALYNAALQERRDAYRHPSKTTVR
YGGQSAQLKDIRGFDADQARWSFSSQQATLRRLNLAFAAFFRRVKAGETP
GYPRFKGAGWFDTVTWPANGDGCLWDSQPENPKTMFVRLQGVGHVKVHQH
RPVAGRVKTLSVRREGARWYLVLSCDDVPAEPLEPTGAVVGVDLGVASLA
STSNGEHYGNPRFLERAAGRLANAQRDLARKKRGSKRRRKAATRVANQSR
AVARQRVDLANKTARELVADHDLIAVEKLNVKGMVRRAKPKPDPDQPGAF
LPNGQAAKSGLNRSILDAGWGVFLNALRAKAESAGRVVVEVNPRHTSQRC
AECGHVAPENRPSQATFRCVECGHAAHADVNAAINILGAGLALQVAQAS
>Francci3_1869 hypothetical protein
MVRVPGRGGNGRVSVVGMVCYRPGWRPRLLYRARTWRGGGGRRGIGWEDC
RDLLAAAHAQLPGGRLVLVWDRVNIHRQTEMTTFLREHAGWVSAVLLPAY
APELNPAEGVWSQLKRTAVVNLAARALDEVCQAVKHGLKRMQYRPPLLLG
FLAETGLVWEELWST
>Francci3_4145 transposase IS66
MRLLADRDATIAAQAATIAEQARLVERLVGQVEQLTDRVRELDRQLGRDS
TNSSWPSSSDSPYTKKKAKPRSSRTSMGRPRGKQPGATGATRQMVDDPDE
IHTIDPSLCADCGFPLAGAARLTTRRHQIFDPPPPPRPYVIEYRIVTRVC
PCCAATTEGLTPVPLAGRLVWGPRMLARAVWLVCAHHLPIRRAAAVLTVM
VGATVSAGWAGGVRARAARLLENTFLPHVRALIAAAPVAHADETTARADG
ALRYVHVAATDYLTALHTGDRTAETIDAGGIWPAFTGVLMRDGYQGYTHL
TRALHAWCGAHTLRDLRSIHDGDRGGQVWADAMATTLLDAHHAACDARDA
GANALAPEAVALIRNHYRGALARGETDNHGDRSSLAHDARTLIRRMRREE
DMILRFVVDLTVPFSNNQAERDVRPVKVQQRTSGGCWRTLAGLVDFAVVQ
SYLSTATKWGLDTLDVLERLFTTGPWLPPAAEPG
>Francci3_4504 UvrD/REP helicase
MLLDGPAARIPATAASTAGTSTAGTSTAHPPTTASSTRDADPLPEPSALG
QAVLLGMEIDGVQRQIQHLTDVLKQLRRHRQHWEAGGRAEQSVVRVLVGM
DDAGWHVLPDRRWPGTRRANIDVIVVGPGGVFVVDVKNWREITVERGRLW
RGDADADDDVRKLLDQTVAVEEVLVEAGLPPTEVVPLLVLARRKNVRAQL
DRVTVLGEQDLTRDLARRGARLAPDLVEQLLDRLARGCPPMPRAAAVTTV
TVAEAPTSRSVRRPVARRAGPSSNGTAATESPAAIERPTQAYCPAPAGRP
TSSEPLSPPEQSVPPDQDALLSREELWQELLDAAAREPIETWMTWLHPTQ
ARLAGRQWSGPARVRGAAGTGKTVIALHRAKYLAARGERVLFTSLVRTLG
PVYRALLTRMAPEHVDRIEFATVHAVATRCLREHGLADRHQQEAADTCFW
RAWGQVGQFSVLPGLGLAPGYWKDEIATVVKGRGLTDFEQYAKLARVGRS
TPLQPTHRRAVWELYERYEQLRTERGVLDRDDVLLLARDLVRESSDTRYD
AVIVDEVQDLTCVGLQLLHAFVGDKPDGLLVVGDGQQSIYPGGFTLAEAG
VAVVGRSTVLGRNYRNREKILRYAQAVVADDSFDDLDGVQERGRREVDVD
RPGGEIREVTVSGTVAQDAALCEHLVEFHDNRNVRYGDMAVLVPTNAAAA
RWLRVLTERGIPAVSLKEYDGTSCEAVKVGTYHRVKSLDFAHVCIPDRNL
FPKPRRPSESADAFRERTQLERRQMYVAITRARDSVWAGIREVPDSDHSI
PAEIGHSSR
>Francci3_1144 phage integrase
MPSTTPGSSNGRRKERRKRANGQGSIYQRGDGLWVGAAYVLMPDGTTKRR
PVYGKSEEIVRGKLTELQANSDQGIPADATGWTVERFLTVWLAQTVKPNR
QPNTYVTYEKAVRLYLIPGLGKKRLNRLSGADVRQFIRRTETTCRCCEMG
RDKARPEQERRCCAVGQCCNRRPSKRQVQVVHAVLRNALQAAVREELIRR
NVAKLVQVSTPRYNVDRGLTVEQAHRLLDEAAGNRLYALLVLALFLGMRR
GELLGLQWSDIDTDRETLTIRHTLQRVGGELRLLPPKTEDSERTLSLLGL
VADALTEHRKLQQAERDAAGDRWVTTDHVFTTKIGTAIEPDNLRRFWMPL
RQAAGLDGVVFHGLRHTCVTLLLDLGVPPHIVRDIAGHSAIEVTMTIYAH
ASMGEKRRALGRLDGHLTGPRP
>Francci3_0044 serine/threonine protein kinase
MGGQGRRITRNPTKGVIAGQDGSRRTGGGTADARRGSGRIWTLLIDKSRV
EAALPGYSVEGDLGRGGYGLVLAGQHRLIGRKVAIKILLDTSDDPDLRTR
FLSEARVLAELDHPHIVRIHDYVEHEGTCLLVMELLSGGTLKQRMSSGPV
SAETTCSIGLAAAAALATAHGHGVLHRDIKPDNIMFAGDGLLKVTDFGIA
KIFDGAETTASAILGTPRYMAPEQIMGTRLFPSTDLYALAGVLYEMIANR
PLFGRQMAVQPLTHHHLTIMPEPLTMVPPPVSAVILRALAKDPSIRFADA
ADFALELARASSRAFGPTWLSRSDVKVRIDDEIREAALATSTTPRPPAAG
RPGFPGSPGAPGSPGFPGSPGAPGSPGFPGSPGAPGSPAGHPMGGYPPPG
GPGWGGFPPGNTPPPRSTPPPRSTPPPRSLGPGYGGPDAPGGPGAPGGPG
GQTYRPGPGGPVHGMAGVPPAATRQAGHQSSPNARNRTPLIIGAVAFVVI
VAITVGIVAAVTNSGGGSRGGDRGGGTARLATAYRGTALSVQGLSPYSVD
VDPDGSLLVSSLATDRIQKITPAGAVSDLAGTGAGGISGDGGPATAAQLD
GPGSTARDKAGNIYIGDAKNNRIRKISPAGIITTIAGTGDAGYGGDGGPA
TAAKINSAEKVTTGPDGSVYLSDYENHRIRKISPQGIITTYVGTGVAGYT
GDGGPATAAKINGPNDLQMTDDGTLYFADLASDTIQKVTPDGIITTVAGT
GEGGFSGDGGPATRARLNVPSLTVGPDGRTLYLADYRNHRIRRVDPNGVI
TTIAGTGGEGSGGDGGPATAAQFKNPSSVAVDGSGALYIADNGNDRVRRI
DPNGTITTVAQPG
>Francci3_2137 putative alkylated DNA repair protein
MEIAAFQASLLDDAPAIEVGPLGTSVRRVGLARGAWVDVRPRWIVGADVL
FERLRDRVPWRAEQRTMYDRVVDIPRLLAFYDERASLPDPALGAAKRVLD
EHYAAELGEGFATAGLCLYRDGRDSVAWHGDRVGPGGFRDTMVAIAVLGA
PRALLLRPRGGGGPAIRHDLGHGDLLVMGGSCQRTWDHAVPKTARPVGPR
ISVQFRPRGVR
>Francci3_3654 hypothetical protein
MLGAGVWPNGGQFEAAVSTGAEVFTDPRLRTARFERDAFGAPVQVTGESA
VVFFGAATGSAFALRCPTRPLTGGAERYAAVTNHLRRHRVLSAFPKAEWV
AQGVRVGEKWWPVVVMDQVQGTTLRTYVRQHLASPEALRDLAAGWKLLLG
EMSKAEIAHGDLQHGNVFVTEQGSVRLVDLDSVWVPAVAHLPPDEHGHRH
YQHPRRQSTGYWDRRVDTFPGLVIYLSVLAVAADPALWEEFHRDENLILS
ASDLAQPDHTPIWSRLAASRDGEVRRLVPLLRRFCLGPPAVDVDLPTLLR
DADSALPDAPVSEAVAPSLGAGSSWWEPSQASVPTKPIDPSQGWATPPTP
PSPSGAPATPGTPATPAAGRNLWDATTRTSPNTGPSAGQGGTAAGAGDAS
GSSRAGATSAAGDGARRASPSGSVPNRGSGSRTAALVLAALAILALILII
LTVG
>Francci3_2673 serine/threonine protein kinase
MPWPMPFPGGGSVPGDRAGRDGGTVAARWMIDRRYELEAVPLAKGGMGEV
WVGRDVKLDREIAVKFVRFPDGQPDQELIRRFIRESRITARLLHPGVPAV
YDAGTHEERPYLVMQRVHGLSVADLVAEQGALPVGWAAAIAAQVASVLAV
AHRASLVHRDLKPANLMLEPDGTVKVLDFGLAVALDRTNLSKITVTGQHL
GSPAYMAPEQVLAGLSTPATDIYALGATLFEMLTGQRLFAGASSYTVMNK
QVEEVPARVRSLRADVPRGLDELVAAMLSKAPEDRPCGVEKVYAALIGSV
VDLGPLPGAVDDAVDNPVRMYAAVAGRVVPRGAARNTAASTRGSAAGPPG
ESGGGVGRSELRRARDEARTLARASRYSQAAGVLAAAVDRAGRALGHDSA
DVLALRLDLADHLFEGGDFRRAVDGYRTLLVDLVHRDAPDSERVLRCRLR
EATCRTLIGETGQALEQLRGLLEDEIRVFGADDAQVFELRRQIGLLQLGA
GHRASATETLSALLADLLRVWGMEHPMVPVVRELLAGAVG
>Francci3_2680 Transposase and inactivated derivatives-like
MLSCDDVPAAVVEPTGAVVGLDVGVASLVTTSNGDHYGNPRFLERSAERL
ADAQRDLSRKKWGSKRRKKAVARVAARSRAVARQRVDLANKTALELVRDH
DLIAVEKLNIKSMVKRAAPKPDPDRPGAFLPNGQAAKTGLNTSILDAGWG
VFFNVPRAKAESAGRTVVEVNARRTWVTWVASANQRSTSTACSRLVSFRC
HRPPSATQPPTVRDAHAELSIVARKGSLR
>Francci3_2121 transposase, IS4
MPAAPVLPPAPVLDRLAAVGAGNQPPSPAGLLAVFNQLPDPRKPRGRRHS
LAAVLTLATCAVLAGARSFTAIGEWSADAGQAVAGLLGVSRVPEESTFRR
VLAALDADALDTALGAWAAAATTPPAGTRRRLAVDGKTLRGSRTPDSPGR
HLLAALDHTSGVVLGQVAVDAKSNEIPALPVLLADLDLTDVIVTADALHT
QRQTASWLVSRHAHYILTVKANQPALYAQLAALPWRRVKTAARTVERGHG
RRERRTVKTTEVRAGLLFPHAVQAVQVTRRRQPLADGPATTEIVYLVTSL
PTHQASPTLLATYAREHWLVENRLHWVRDVTFGEDLSQVRTGHAPQVMAS
LRNLAIAILRLTGATNIAQAIRHHARRPERPLETIKSLAC
>Francci3_1095 phage integrase
MPGRTIRLDGESGVPLAPAVFLWLLAEEELDASPMERMEPPKIGERVKPL
LVLDQLSALVAVCKGKDFTSRRDEALVRVFADSGGRRSEVAGLTVADVDL
GRKRLLVTGKGDRQRHIAIGAQTALSLNRYLRRGHHRHATLPALWLAGRG
TAMTPSGVYQVVRRRGREAGIDVHPHLFRHALADAWLFAGGGEQTLADHM
GWSTTQMVRRYGAAGRSRRAQNEHERLGLGDACRLISPRRVS
>Francci3_2721 Recombinase
MLRTIGYRRISDDREGLEAGVTRQDEDIREHAARRDDIDLIDVLTDNDLS
ATKDYRPEFERILVMAAAGEIDAVISWTSNRFFRSAKDRVRVLDLFEAKG
IRIIPVRGSEADPTTADGYLMVDLMGAIDRAEAKRTAERVSRAAKQRAEQ
GRHHGGRRAFGYGPVVATDHLGRPVRDFHAVIPEETATIRRIAADILAGV
PLGSIARALNAAGVPTVTKAHRAACVNNVKGRRAWVDCPCPYRAWDATTL
KELMVNPRLIGMRGYQTRRSRRSTGAIAVMGEAQWPAILDVETWEQVRAI
LTDAARRTNTDAERVRRVLAGFVYCASCGHKLTGNGATGTRLYCQRRDGR
CAAKVRIGESFLVGLVGLAVRERLDVLVLQPAATADPVAAELATLEARKS
ALADRWASGKMEDDAYDDAHRAIGRQIREAQTRMYTSARRRATLPVDGAA
GWDALGEDIAAKRVILTQLIDGVIVGGNGTVEGDDVQRVRIVWRDLNDPR
>Francci3_1950 transposase, IS4
MSVVSPMLWLECQQEQVEVSTAAQVAGLVAVLRRVPDWRDPRGVRYELAP
VLALWVAGNIAGHDTTVAVWEWACALPVGVLAGLGFPRRVPSERTIRRIV
EEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGARSRPPQGS
VRQEAVVEAVRHDTGTALGHQRVVAGDEIASVRRLVNRVCDHNTLVTTDC
LHAHEPLARAIRAKGGHWLFSIKGNQPTVRAKLAGLPWDEFGNQHVTREK
AHGRIEERALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATST
EEFYLVTSLSTDQASPAQLARWARGHWTVEAIHHVRDRTMDEDRHTIRTK
NAALNWAIARDTTISALRLAGYKNIRQARRATIRDPGLVLQIIALTSQNR
L
>Francci3_1235 transposase, IS4
MPVGALSRDAAGVPGAGEVSREGIWERLDRVTDPRSTRGRVYSWLCLAAV
WLCSLTAAGHHRVSAVRAWLARTSGAERARLRLPWDPFAGWRLPSTATIH
CFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAAPVDPGHGCQ
PVESAVALDGKTSRHAKRADGSKVHLVGVASHGDGRLLAQVEVEAKTNET
AVFRRLLRPLDLTNVLVTADALHTVRANLDTRSRRSVRTAGAGRKDGRLP
GRASMPSPASASPRPGPRCSRPLAHRNRQQAAQDLPTRPREGPALQAPRH
RLPGNLGLPAHPPRDQRPEVSGRNRRRDRPRPDQVPAHRNDHPRPGRHRP
GFSPLMTSITVCCGHSPPTSPGQRGTQHNAPPTIRFTNPLNPQLDRLSVS
GIGARVLTTTLGWS
>Francci3_1874 conserved hypothetical protein
MKATEIVQASLAERGLTPGEHYLDAGYSSADLIHDAATRGITMVTPALLD
HSPQAKANAGFDKAAFRIDWKARQVRCPQGHTSTAWYPVRQHERDAIVVA
FARTDCHPCPTQKACTTSTRRTRYLSLRPRELHETLATARAEQATDTWKT
RYALRAGIEGTINQALDVTGLRNARYRGLPKTRLQHAFSATAVNVLRLDA
HWNPDQPPFTPRASRLTRLSQQRTTPTAN
>Francci3_0144 transposase IS66
MLWCVAVVESGVCGVAGGEALADVLVENGRLRVELAELGTEHARLLARET
AAAAELEALRAELATLRRMLFGRSSERASGGPPACAGAGDSGDGRDGAAG
EGAGGSTGRPRGPGARSGRRSYDHLARDEVDGDFEGGGYTCPSCGASFTP
WGEHVVEQLDWVVSVRLRVTRRRRYRRGCRCGGPVTVTAPGPSKAIGKGL
FTHRFLALLVVERYVAGRSQNSLVTGLSRQGAEISPATLTGACAQVGDLL
APLAEKIAGRSRGAWHLHADETTWRVFTPTGGDGSARWWLWVFLGPDSVC
FVMDPTRSAAVLADHVGIDAETGQLTGDDGAGGPRRLVISSDFYAVYASA
GAKADGIVNLFCWVHVRRYFLRAGDANPAQLGIWARQWRERIAALYAAHA
ELAAAWQAAITAPSPPAERRLATAYTGWDRAIDAIDTARREHMASPGLPE
PARRALATMDREWTGLVAHRNHPMIGLDNNPAERIIRKPVITRRNTGGSR
TDDAARRAATIFTVTATADLHGLNPLTYLADYLDACGRAGARAPTGTDLD
RFLPWAANPDDLARWRKPPG
>Francci3_4110 Integrase
MRRRRACPPVPVSEFAGFRFPPEVIVLAVRWYLRYVLSYRDVEELLAERG
LEVDHVTVYRWVQRFTPLLVEAARPCRHRPGGRWFVDETYVKVSGRWTYV
YRAVDQYGQVIDVLASTRRDQATARRFFVRALTYGRRPDEVTTDKAAVYP
RVLDELVPEACHIDAARENNRIEADHGRLTARLRPMRGLKRLRSVQTISA
GHALVQNIRRGHYELGIDTRPQIRLAAAFTELAAAV
>Francci3_0777 hypothetical protein
MLYLAGQILAYVLVAMLIGAALAWVFLIAPLRRQARNADADAARNRAGSG
ETPAGTGPDLDESRPGAVPSADRAAAANDVNRVNAGPAGTGDADTGVPDS
GARGTGTGQAAELIARLRRQRDDVALEKADLTARLAVAEQRAAESERRIA
EAERHAVIAGARVEEIETALRARAYAVTPGAAQALPAGSGAAGSGAAGSG
AAGPVPAGSPGAGGDPVGPSAAQLAHEAELLRRQLVEAEGRAAKFSSRLA
MARTEAEDAQRQVATMTTRLDRHQAEWAAERLSLLGRIAKSEALLGQASS
DEEAGAEHPEAAPTAEAAPTAEAAPTAEAVGVAAAGGVVPSNLAVTPNDA
GPSKSALPAHGLALSPEPNVSGATVQASGVATKNRPPALGGPPSSGQGSG
GGVKSTGGRSTGGTNGASGQPGTGSRVVLEPAPRWNGLLDPVRSGGDNLK
EIVGVGPVIEARLRALGITTFSQLAEMGDTDVERLAHRLDGFGDRIVSDD
WVGQAQDLQARHYGGVY
>Francci3_0878 phage integrase
MGPDPTQPGGPDQAAAAEGPGRAAHEDPHLDERRTPDLPRRHPRRTDLRP
APHLATTGVRRGEALGLAWSAVDLDAGRISIRRTLVNVTTSDTGRVPVFS
DPKTVRGTRVIALDTATVTALRDLRESKLKELALLGREPEADLVFTHWDG
RAMHPERNSRAFLRRVRRLGLPVIRLHNLRHTWATLALASNVHPKVVSER
LGHASITITWRSTATFCPACTPMPRKSSPGSSSALAGRPTRTRRTGPTAV
ATRPTRTTPMAWVMTMRNRVRLRQTARPPARGLLVRPLRSRPGRDQSVTK
RAQTIRGGTVGEHLTCENAAVQRVHLLGRYSNTPEVLTDLQTVWAVVAET
PGQGETQELPGLTGSGQVPRRHAIVDRLPASDIETLISLYLAGSTARALA
ARYSISLTAVKTLLRKRGIRRNRRSAEPS
>Francci3_3158 DNA repair protein RecN
MLEEIRIRGLGVIDDAALDLAPGLTVVSGETGAGKTMIVQGLGLLTGGRA
DYGLVRPGVDRAFVEGRLVIGAESAVAARVREVGGDLDEDPGGAVLVVGR
TLTAEGRSRAQVAGRSVPASVLAEIAEELIAVHGQSEAQRLLKPSTQRDA
LDRFAGSAVAGPLARYGGVYRELTRVSRQLAEITDRVREREQEAELLRIG
LDEVERIAPSPGEDVTLDAELTKLEHAETLVRAARTAHAALMSDPATGSD
EPGAVDLVAAAQRVIAGESALDAELAALGTRLTEVGMLLTDVAADLASYA
EGIDADPVRLADAQARKAALTSLTRAHGTGIDGVLAWADQAGRRLLELDG
AGDSVEALTARRDSLTAELARLAEEVSEARTKAAARFGAAVAAELAGLAM
PRARVEAAVSQRDDPAGLPVGLRVVAFGPFGVDDVELRLIPHPGAPPRPV
QKGASGGELSRVMLAIEVVLAAADTGSTMVFDEVDAGVGGRAAVEIGRRL
ARLARTHQVICITHLPQVAAFADRHLVVHKADDGSVTRSGIVTLDDAGRV
RELSRMLAGQEESPLARGHAEELLAAAEADKALP
>Francci3_2531 response regulator receiver protein
MLTYPGVVDLPESTLTFLAGLLAEDRAQRRTWRKLPPPEQALLVLVHLRK
GERYEQLAEGFQVSVGTVHNYIREAVRLLATHGRTLLAAVWIFAWTQSNF
LILDGTVVRTNRVRAHNKLYYSGKHKYHGINLQGLTDPYGRLIWISEGLP
GSVHDLTAARMHDILDLIDRSELYLYADKGYVGGEGDRLLVPIKKPKNND
LPDRDKEANRTHATTRSQGERGFAVLKNWHIFDRFRGCPRRVGTFAQAAL
VLATEGL
>Francci3_1687 hypothetical protein
MRSDAERWRRGRAATVWPTGLLALAGQDGAGDGPRGGAAADMAVPAAVAA
HAIALYSWPGDTVCDPDCADGAVLVEAVLARRHAVGIAADRRAWQTARHA
LTAAKARGAPGDGTILDRLPDGWSWTGLGPVDLLLTALGPPADSAGPHGL
RGDRLATRLAGYRDLARPAGRLVVVAAYHVADGLDLASRIVAAGRKAGWQ
PVQRAVALTAVPYTRALDAAPATGRAWPAHHDVLLFRSRGQPARPPRPPE
LPPSSPAPAADPPAAAANRAA
>Francci3_0118 putative IS630 family transposase
MTRPCPTLVSGMRYADGGGLTAQGRARREVVRVQAADLFAAGVDPVEVAG
RLRVSTKSAYQWKRLWQAGGTAALASRGPSGASCRLSDSQRDRLRVELDR
GPAAHGWPDQRWTLARVTLLIGRLFRTRYTLRGTSYLLHRMGYSPQVPAR
RATERDEEKITAWRAETWAKVRG
>Francci3_3266 putative transposase, IS891/IS1136/IS1341
MRRSYKFLLRPTAHQQIALTAMLDDHRALYNAALQERRDAYRHPSKTTVR
YGGQSAQLKDIRGFDADQARWSFSSQQATLRRLNLAFAAFFRRVKAGETP
GYPRFKGAGWFDTVTWPANGDGCLWDSQPENPKTMFVRLQGVGHVKVHQH
RPVAGRVKTLSVRREGARWYLVLSCDDVPAEPLEPTGAVVGVDLGVASLA
STSNGEHYGNPRFLERAAGRLANAQRDLARKKRGSKRRRKAATRVANQSR
AVARQRVDLANKTARELVADHDLIAVEKLNVKGMVRRAKPKPDPDQPGAF
LPNGQAAKSGLNRSILDAGWGVFLNALRAKAESAGRVVVEVNPRHTSQRC
AECGHVAPENRPSQATFRCVECGHAAHADVNAAINILGAGLALQVAQAS
>Francci3_2792 response regulator receiver protein
MDGENAQVGTRHAESGDARRHPSGSVAHGRSTASTTTRTTVKTVGMAAGT
SAGRPAAMEAELASVHPGPAAPVCEAAGRIARSEETIEAEVIRLLPGALA
VVRTPAGTEEVGIALVDAHPGDAVLVHAGEAIAVL
>Francci3_2024 putative transposase
MGASPAGSVNTTVDSTERLGRHRWKGERTISWLDGYRRLTIRYERDGRNF
LGFLTLAATITCRKKLPPPT
>Francci3_4211 transposase, IS4
MTHRVDSYPADHGHSGSSLSRDHAPSPPNLRQSRPLDRLLRGAAGPDGKT
PHLLAAVTHATGAVLAEHQIGAKTNEVPAFVPLLRELHTYHPPTGHVITA
DAAHTNRAHAEAIVTELGAHFVFTVKNNTPALAVDCHQVTDWTKAPIGHT
TEGKAHGRLEKRTIQLAEASEAIRARHPHARTVARIRRRTTRTVTRGTPR
RRVTRRTTTTTTVHVITSLAPGEVTAAELATYVRDHWTVENRVHRVRDVT
FREDASRVRTGPPPRVLATFRNLAIGLLRQAGHPRISPTPRRLRHDPALL
TAILGLENPA
>Francci3_3390 phage integrase
MARPSLDLGVGGKIFYSATAKGSRARCFYRDHDGVRREVERGGTSKAAAT
RALKLALRDRLRVAVGDGDITPETTMKVLGEAWFAEQQKKDRSPNTLAAY
RTTLDRHVYPALGGVKARQVTVGTADRFFSAVTTKSGPGAARIARTVLSG
MCAMAARLDAMDRNVVRDAGQITRPEPKPVSKALGAAQLRQLRALLTYDE
RARRRDIPDLVDMLIATGARIGEVCGIVWDAVDLDAGTVEIRSTVVRITG
QGLINKPRPKSKAGHRLLLLPAWAVAMLRTRHHGQNSDEVVFPAQMGGLR
DPSNTQADIRDAVNDAGFPGLTSHLFGRRSVATLLDGDGHTPRQIADVLG
HANPSITLSTYMGRKVSNPGAAETLAVLAI
>Francci3_3647 DNA ligase, NAD-dependent
MSATAGTADESGVASAAASADERARAASLARELDEHAYRYYVLDSPTISD
AEYDRLMAELAALEERHPDLRTPDSPTQKVAGSYSTLFTPVEHLERMLSL
ENVFDDDEFHQWAARVARESEVDAWLCELKIDGLAVDLVYEDGYLVRAAT
RGDGRTGEDITPNIRTLASVPVRLRGPRVPGLLEVRGEVFFPTAKFAELN
AGLVAVGGKPFANPRNAAAGSLRQKDPRVTATRPLEMIVHGLGAQRGFEV
TSQSAAYARFAELGLPVATHFEVLATVPGVLDYVHRWGDARHDVVHEIDG
VVVKVDSFALQRRLGSTSKSPRWAVAYKYPPEEVTTKLRDIRVNVGRTGR
VTPFGELEPVLVAGSTVGLATLHNIDEVGRKGVLIGDTVVLRKAGDVIPE
IVGPVVDLREGSERAFVMPTRCPECGTELVRPEGEVDIRCPNTVSCPAQL
RESIFHFASRGAMDIDGLGYETATALLEAGRVRDIGDIFHLTPESFEGLR
GFAQKKIDQILRGVEAARHRPLWRLLVGLSIRHVGPTAARALARELRSLE
AIAATSAEDLAAVEGVGPKIAGAVLDWFADERHRDILARIAAGGARLADV
GAEEGPRPLDGVTVVITGTLTDWSRDSAKEAVEARGGKVTGSVSRKTTAV
VVGADPGASKYDKARSLRIPMLDEAGFAVLLAQGVDAASKLAVPADGPEK
AETPVE
>Francci3_2883 hypothetical protein
MGEHRWCGYPAGRVRTGECGRRWSATAPGGGPGLLYRARTWRGGSGRKGI
GWQDCRDLLAAAHAQLPGGRMILVWDGVNIHRQAEMAAFLQEHADGVSVV
ALPAYAPELNPAEGCGHRSSGPRWCTWLPAASTPSRSSYRGVATTV
>Francci3_4274 phage integrase
MNESKQKRTRANGEGSIYPYRNGFAAYVWVTTPDGKSRRKYVYGQTREIV
HEKWIKLHSTARQGPMPTRSVTVAVFVARWLSEVVEPNLAPLTYSTYETL
ARLYIVPGLGAKRLDRLTVRDVQKWVNGLQRACQCCAQGKDARRPERRRR
CCALGRCCGQTISARTLKDVRGVLRSALTHAGREELVSKNVAGLVKVPKV
RARRRKAWTTDEARIFLESARGDRYYAAYVLIVVLGFRKGEALGLPDVTD
DGPEELAVEWQLQRVRGQLLHRETKTAGSDATLPLPQICRTAIAERRRLR
AEDRKAAGAAWQESGLFTTGRFGTAVEPRTFDRAFALRVQKAGVPRITVH
DARRTCASLLVDLDVHPRVIMRILRHANIDVTMEIYAQASSTATREALNR
LGESLDR
>Francci3_4436 serine/threonine protein kinase
MTDVTEMYAPGAGPQIIGGRYELGEGIGYGGMAEVFRGRDIRLGREVAVK
TLRPDLARDPTFLARFRREAQSSAALNHPAIVSVYDTGEDLINGAQIPYI
VMEFIEGRTLRDALQTEGRFTERRAMEITSDVCAALDYSHRMGIIHRDIK
PANVMLSPDGSVKVMDFGIARATTATSSTMTATAAVIGTAQYLSPEQARG
ARVDARSDVYTTGVLLYELLTGSPPFRGDNPVAVAYQHVREDPLPPSAHD
RDISPEADAIVLKAMEKDADDRYGTAGEMRDDLERALAGRRVHAMMGGGG
AGGAATTLGPAAPGTAILSRGDRAGYPTRDRYRDDDYRNDDQTTYRDVPY
DDGRFGTGPFDRYDVTGAIDRRAVPNQERSQTWKYILAGLGIVIVFVAVV
LLATTFLSTGDNTASTKKILVPTGLVGQNEQQARARLQSAGFGGEVVTKL
VDSPDLAGRVTAVDPAEGKEVAAKGVITLSIGKAADDVQIPADLIGKTQA
EAEAALRALGLVPNPMAYGNPTSQKVGTVDSTDPQAGSAVKRGSTVTLNV
VSPNVNVPDVRNRTYDNAAAELGRYGLRAAQQPVANNNVAPGTVVDQSPS
GGNVPRGSTITLSVAQQIQTTAPPTPTETPTGPTPTTEPTSTSPSPTGNP
TPGTPATTKPGGLGGLLGGRNGGGGSGGGGSGGGGSGGGGT
>Francci3_1080 hypothetical protein
MRAAVIVWSSSTTTSGARCGRRILEAMTKSALPRPRLSTSPFKAPVTPPC
KRFAVGDRVTHDMYGLGQVTGVEAEVAVLVDFGSQQERILSPFSKMSRL
>Francci3_2343 hypothetical protein
MLTPTGAVCLLGGRGSGKTTALLHLLTGVAGPVALVANSLAFLNPAGPVQ
VHALPTTIGLRAPTIALFPALRDLTRATGTLADEENRTYLPAATVAAAFT
VARAAGGPVTAFIDMAFREARPAVWRRLDTRQGTTALTAARLPDGLLDDP
HEHARLTADHARGHRRRLRECAKSVAAARCESGTDTPTVLGRGVTDLVAQ
AAR
>Francci3_1531 hypothetical protein
MAVWEAGLDRGPGSVRRGGGAPTAGFATKPLLALDMIRRFWLAHRELGWV
AGDEVYGANAELRDWLEEQMIPFSSNEIRHVLALFGQTAIPAAMITWWSD
WRRRHQARARLYHFQTRIRANQSAVARAATT
>Francci3_1882 transposase
MGLCDRPRSGRPRRISELERAELTRRGLTGDISASSVRRILAEHPVKPWR
YQSWIFPRDPEFTAKATVVLDLYQGQPLGPNDRVISVDAKPSIQARARIH
PTAPPAPGRVIRVEHEYERHGALALLALLAALDVHTGQITATTPPTSGIA
PFMALLGQIMAQDRYKKADRVFVIVDNH
>Francci3_1935 hypothetical protein
MMSYEQVLQVSDPLERAALADDLMWADHPRRLDLRTARGVAIREALEAGR
SPDDVARRLVVTVADLTWMAAPAASAVA
>Francci3_2264 Resolvase-like
MVGAVRDARAARLDGVEAERLFTDKASGKDADRPKLDEMLAFVREGDTVL
VHSMDRLARNLDDLRRIVRTLTAKGVRVEFVKEGLTFIGEDSPMATLLLS
VMGAFAEFERALILERRREGIAAAKQRGAYTGRKPALTPGQARDLVARAA
AGEQKSDLAREFGVSRETVYSYLRATETAAAAG
>Francci3_3600 formamidopyrimidine-DNA glycosylase
MPELPEVEVVRRGLERGVVGRVIASVDVHHPRAVRRHLAGAADFSALLVG
RRITAARRRGKYLWLVLQPPVDHAACAPVVPEEPPEEESAAVLAEMSPPA
LPPGHPAQGDALIAHLGMSGQLLVVPPATPDQKHLRIRFVFTDGGRELRF
VDQRTFGGLAVATGEADLPAPVAHIARDPLDPAFDERLVTERMRRRRTGV
KRALLDQTLVSGVGNIYADEALWAAKLHYARPTETLTRAEVGRLLGCVRT
VMIAALEVGGTSFDRLYVSADGVSGLFERSLQVYGRAGRPCTRCGDAVRR
DAFMNRSSFTCPTCQPHPRRARW
>Francci3_1104 transposase IS66
MRVVRRTTRRRRYRRACACYGPATVMAPGEPRALGKGLWTNRSLALLLVE
RYGAGRSLNSLVAGLARHGAVLAAPTLVGACAQAGVLLAPLVEAIRQRSQ
ASWHLHADETSWKVFTPNGGGKPQRWWLWVSLGEDTVCFVIDQTRASSVL
TGHLGLTEQADGTLTAPGGGERVLSSDFYAVYVSVGRRRAGLVNLYRVAH
LRRYFPRARLSNPVQLEYWEKAWLDRFRALYTAHRELATAWARARDTPGP
DADTRLAEAYTAWDGAIEAIDTARREQQASPGLQPAAKDALATLEREWDG
VVAHRDYPMVDLDNNAAERAPRRPVVTRKNAYGSRTDDAAALAAAVWTVL
GTAEKHGLNTLTYLTAYLDACGRAGGKPPQGTDLDRFLPWLASPDDLATW
KQPPG
>Francci3_0116 site-specific recombinase XerD-like
MTGKGRKERMIKLGYNAARAIDRYLRLRGKHSYAHSPKLWLGINNRQPMT
ANGIYQMISRRGDEAGVVVHPHKFRHHFSHSWLDKGGNEGDLMELNGWTS
PQMLRRYGASARNVRARRTYDRIMNGE
>Francci3_0505 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_3433 NUDIX hydrolase
MPLQAAIRLDVRLLVRIDDRILLARPPGEAWHVLPGGPVAAGESTDDALE
RQVGRLAGPRTISRQFIGAVEHDGTITGHSPESATDHVLSIMFAGFWPSD
IPTPSRWGEHTLVPVNINVLLATRLRPLSMAEVVRRWLAEGWPLWRGLDP
AVGNRRLPSLASLRAQLFARREELRSLTFRDAAVAICALVTAADGRIDPA
EREGLLGFIATDPVMSQFPEQDVERLFDEHLSRLTADFAAGKQAALADIA
KVRGRVTEAAAVVRIGQVIGLVDGEFVASERAVVREAALALGLNTAEFAL
>Francci3_2571 Recombinase
MSASTCPPARPASHDEHPVGSSARNRRSGAGSPGQPGTRVTRLPARLPVL
RTDPLRGRPDLPAPGLPVPACRIVVTLKARQSLDTSKRVRRKHLAMAQAG
ITVGGNRAFGWLADKETKDEPAAALLVAGADQILAGVGLHTICRQWNDLG
IASAMGKKWQKPVLRNIYLSPRIVGYRVYGPTSVPLEKRYVVDADGQPVK
GQQQPILDLDVWEAVVAKLRDPSRVSKHVHIGGRKYLLSGIICCGFRGRH
LMGGYDRRWGKHHYACKAVTAGGCGKVGVTGRHVDDLVSELVLAYLAGRD
VEAEVGRWPRAGELAKAEAKIAKLMGAYDRDELPGPYVFPRVREQEQSIL
HLRAEQAEWLRAHTGPKVTNLAEGWPSLELEQRWEIISTVIEAVVLKAAD
GPTNRFDPERVEVVWRP
>Francci3_2733 transposase IS3/IS911
MGATRRGFTEEYKEQAVAFVIDGSRPVAEVARNIGVHEMTLGKWVKKAKD
VESGDRHWLRRLGWARDLPVMTVFAPACPTWSATTRRSRT
>Francci3_1180 exodeoxyribonuclease III
MGLPSTDVRIATWNINSAKARQARLIEWLDRAEPDVVCLQETKLADDAFL
ELFDEDLFRRGYRVAHHGDGRWNGVAILSRDTLDDVARGLPGDPGFPGPE
PRAIAATCGGIRIWSLYVPNGRTIDDPHYAYKLAWLAALREVVAEATTQG
AVMTLGDFNIAPTDVDVWDITQFEGATHVTPAERAALAELVDAGLIDVLR
ARWPDDVVYTYWDYRQLCFPKNLGMRIDLALATADVAGRVRAVWVDRAAR
KGVGTSDHAPVIVDLDTAPDGDIGPMVPPPSSGSSAGQSAPGRTSSTGTG
PRRR
>Francci3_1959 transposase IS116/IS110/IS902
MERQVARCAGLDVHKDEIVACARISDPGGPGRVELHTFGTTTRELLALRD
WLTGLGVTRVGMESTGVFWKAPFHILEDAVAECWLLNARHLRNVPGRKTD
AADAAWIAELVEYGLVRPSFVPPQPIRELRDLTRYRRAQIDERTREAQRL
DKVLQDAGIKLSSVASDVLGKSGRAILDALVAGTTDPVVLAELAKGQLRK
KIPALQEALTAFFTGHHAIIIGEILSKLDYLDEAIDRLSTEIDRVIAPFA
DEVALLDTIPGVDRRMAECLIAEIGVDMTVFGSAERLASWAGRCPGQHES
AGKSKGGRTRKGSKWLRIYLHDAARAASRTKNSYLNAQYHRIKARRGPAK
ARVAVEHSILVAAFHMLDRGEPYHDLGADYFTRRRDPNRHAQRLISQLDA
LGYDAVITRRTDQPTDTKAA
>Francci3_4216 transposase IS116/IS110/IS902
MDLLEAAGPGAERRETRMLFVGDDWAQDHHDVEVQDETGRRLAKGRLPEG
VAGIARLHALIGRHLAEDAGPEQVVVGIETDRGPWVRALVAAGYQVIAVN
PLQAARYRERYSTSGAKSDAGDAHSLADMVRTDRHQLRPVAGDSDTAEAV
KIVARAHQNLIWDRTRQTQRLRSALLEFFPAALAAFDDLDTPDALELLAK
APSPAEAARLTVAQISAALRHARRRKIPERAAAIRAALRAEQLPVTPAAT
TAYAAVVRAQAGLLAALNGEIARLEEQVADHFDQHPDAKILLSQPGLGPV
LAARVLAEFGDDPTRYADAKARKNYAGTSPITRASGKKKTVLARYARNNR
LADALHQQALSALSASPGARSYYDAIRARGTSHHAALRQLGNRLVGILHG
CLKTHTPYSEATAWTQKATLDVAA
>Francci3_2683 Resolvase-like
MNLKEWAESQGVAYVTARRWYAAGKLPVPARRVGGLILVGEPDQPTGDGL
TAVYARVSSADQRPDLDRQVARVTAWATGQNLPVDKVVTEVGSALNGHRR
KFLALLRDPDVATIVVEYRDRFARFGAEYVEAALSAQGRRLLVVDPGEVD
DDLVGDVTEILTSLCARLYGRRAAVNRATRAVAAATEAAE
>Francci3_1370 DNA recombination protein, RuvA
MIASLAGTVTALTPLSAVIEVGGVGLLVHCSPSTLSRLRVGESASLATTL
IVRETELTLYGFADADARDVFEILQSAAGVGPKLAQAVLGVHDPDTVRRA
VAEEDLAVLTKVPGIGRKGAQRIVLDLRDRLGPPGDGAPLPGPRLTSGPE
PMPADAVGVAATVREALVGLGYSGREADAAVSRALVVLAAPVGEGAAAGG
EQAAPGKGDVPGKEGAPGSRAGADSGTGGAPPDTATLLRASLAVLRR
>Francci3_3213 Holliday junction resolvase YqgF
MSGSGSRPGDSRPGDSRPGDSRPGVRIGVDVGSVRVGVAASDPGGVLAVP
VTTLARDRRGNADIDQLVLIVRERQAVEVVVGLPRQMSGQEGRAVRLVRQ
YAEVLAERIAPVPVRFVDERLTTVAAHRRMAERGVRSRARRSLVDQEAAV
QILQHDLDSRRGSAAPGVIGCAAPAAGPDGVVRAPRDGPRAPDGVVPPSD
ER
>Francci3_3990 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPVNLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_2073 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_2708 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPANLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_4227 transposase, IS4
MVVLTVVFMRVAVVGMELGEMGRVRPVIERFAGEMFADLPRRDQRGKGEL
YVQRLLTDGKRKSMVPMAARLGVDPQQLQQFVTSSTWDYRQVRRRLTGWA
AGFCDPVALVVDDTGFPEGRAASPGVARMYSGTLGKVGNCQIGVSVHAVT
DWASAAVAWRLFLPTCWDDTTLTDPTEVAAARARERAAIPDKARHREKWR
LVLDMIDELAGWGMPVRPVVADAGYGDAAAFRQGLTDRNIPYVLAVKPTA
TAYPADAVPVTAPYPGNSRRPTPAYPDPPRDLKSLVMAAGRRTGRSVTWR
HGTHRTPANPTAGMRSRFLALRVRPAGRNITRNPDRSLPVCWLLAEWPVG
QPEPTDYWLSTLPTGIPLRDLVRLAKIRWRIEHDYRELKDGLGLDHFEGR
TFAGWHRHVTLVRVAQALCTQLRRTPKVPAPA
>Francci3_1726 transposase IS3/IS911
MPAPHPPEFRRRAVELARRGDTPIAALAKQLTISESCLRNWIAQADADDN
GGENRLTSVEKRELAQLRRDKKRLEMENEILKRAAAYFARENILPN
>Francci3_0296 insertion element conserved hypothetical protein
MTDTGSQLTDWISLGVLTSFVSRDAVDGAIEATGRGARRSDTTIPPRVAV
YFVMALALFADDDYEAVACRLAATLDDLDVVGPRWEPTSGGLTKARQRLG
SAPLAELFCQVAGPVADLDTVGAFLGPWRLMSIDGLEWDVPASRENVAAF
GLPAGRDGAPGALPKVRAVTVSECASHAPVLAAFGPAGGAKPASEQAPAR
TLYPRLAERWLLLADRNFYSWTDWCTAADTGAALLWRVKASLRLPPLRAL
SDGSYLTVLVNPKIGGKARDALVAAARAGEVLDPAKARYARLVEYDVPDR
DGDGKHEIIGLLTTICDPREATATALAGAYRQRWEHEIGNKQLKTYLRGP
GKVLRSKHPDTVYQEIYGYLLTHHAISALTCQAATAAGIDPDRIKFKRAV
RIIRDRVVTDPAFSP
>Francci3_4047 serine/threonine protein kinase with WD40 repeats
MAISRRNLPGLTGTDLQTIGPYTVERKLVDARTGPVFLARNGEARPVLVK
TITAPFGRDAEFRRRLRVDLDNIRRLAPSCLAAILDLDTGARPPYVVAEF
IDAPTLAATVAGGAALSGPDTYRLAVGLATALAALHELEIFLGDLKPINV
VLSGQGVRLVDFGLFRAMNAVSINNPGGPPSGIGTLAFITPEQALGQTAT
VASDVFTWGGMLLFAATGRPPFGAGTPRVLLQRAVYAEPDLSVFCPELRE
LVAAAMRKDPKRRPAAAELLEQLMAYPTRSEAEPAVEPTRRLALPAGVIE
TLVPVQTRRTVESETKPEVVGTSDLAAPAIGAIHGLEIVLETVTVLETVT
VLGTGPAAEITAVPPAAEITAVPPAAAVALGAVRTGAEPRMRPSSPGPSS
PAPSSPGPSAALAGVPAPLPTAAARARASYVQEGTGNVSALSSPSSPRDH
LDRGWLRRVLSIGVAVSVLVLSTVVIVDTLRERSAAATSREAARGAMRLL
DREPDLAGQLAVSAYRMAPTSAAAEALVNASIRQIGPATGAIRDLLITPD
GRYLIIVGDFGGSVWNIIGPGRVRYITDLPAVAAAMGDRSPLAAGVAPAA
AVKGLLAVALIPASGPATGVRASTIMVTAGTDGVIRLWRLAQPGSTDDGD
LQSAINTGQRVSLLAELRGHTGAVASVAVSGDGRTLASAGADRVVRLWDI
GHPQNPRALAELPQPAEVTSLAFTPDGDSLAVGGVGHLSVWDVTAAGQPR
RRAQLTAPATVRKLLVSPDGRWLAVASTSDGGSLTEIYGLDSPRGLHRLT
AIASRPGQAGSIALSADGRVLAVSTPAGQVTLWDMRSPSRPVQRATLPVG
TAPTATVFGPLGHEGVLAVVAGDAVRLWQLDLLAAEDEICARAEGRINRE
QWRTYLGHRHYDPPCD
>Francci3_1106 transposase, IS4
MKVECGRARLRPPARRRDRPVPRHPLGTGPAAGTRAWPALATRPDDAWRC
TCPNAACVLSPTRQPGDPLFDHRRCRHLFTDAGFSGTLVGWAKDVLATTV
EVVKKKPGQKTFEALPRRWVVERTFAWTTAHRRLARDYERRTAHSESFFR
WALIRTMARRIVRGTPVPRWRPGSTPDPQ
>Francci3_0959 conserved hypothetical protein
MRGVELTLEEREAISRGLAEGLSHRMIAARLDRNQSVISREVARNGGNAG
YRAADAQKRADERRRRPKAFKLETHVRLHDAVAERLSADFSPEQVSNRLK
KDFPDDPEMRVSHETIYQTLFVQARGELNTRLKLAPRSGRAERRPRGSTR
PKQARIAGMVGISEHSAEAADRAVPGH
>Francci3_0950 conservedhypothetical protein
MAGTDTEAVLAELLARHGRTYADEIGADVPADTAEAMFKMIVFALLASAR
IRTSIAVAACRSLMDAGWTDAAAMAEATWEDRTRVLNSSGYARYDESTSR
MLEAACRSLMDTYDGDVRRLRDAAEHSPDRERELLQKIKGIGPVGADIFL
REAQAGWDELVPYLDERTRRTAGALGLPTDPARLAALVGRQDFPRLVAAL
VRARLEHDVTDLREAASRP
>Francci3_0672 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_0110 putative DNA helicase
MVHSAHGGTASRAERQLTHTRVRPAQVRPEPAPGVGDTAGGEPGGQKDVE
RPLFEAFTDQDLVDLGVVPALLPLIRRIVDEDELLGLAEVAPQLTSDVLL
ALHDGKSVEEVLEHVTVPVKADSPVDPEDFAAAVARPATQVTSDDAALQA
VLGESFARWQVFLHPTQRQLVDRVYGGPARVSGGPGTGKTIVALHRAGHL
AARLPPGDDKPILLTTFNRNLAADLRARFLDLVGPDLVDRVDIVNIDKLA
SRVVGEAGASRRRRTIDDDAAVREWAAMLDEVGERRWDAEFLAAEWAEVV
LGQVLRSRTDYFKARRPGRGRPLSRADRDAVWQLVERFTKRLDDAGVWTW
RQVAEYAARTEQDRAAAVVSAAGQPTASGSLPRYRYRHVVVDEAQDLNPA
HWKMLRAMVAPGRDDIFLVGDTHQRIYDNYVTLGSLGVNIRGRSAKLTLS
YRTTRQILWTALRLLAGETYDDLDGGDDNLAGYRSLLSGGEPVLGEAVTW
ADELKLISGQIRVWESAGDGSTSAGDGSTAVCVPTRRMVEDVVTHLEASG
ISAIEIGPDGPKRAGGVHVGTMHRFKGLEYRRIIIGGASAGLVPRGVVER
YQDVDPKRYQQERARDRSLLFVAATRARDDLAIFWHGRPSPFLAAAWTRR
RAVGSSLRN
>Francci3_2875 serine/threonine protein kinase
MLTPLVDDDPRHVGPFTIHNRIGAGGMGTVYLGFNADGRAAAVKVPDARF
ADDPEFRERFRREVAAARRVHGRAVAAVLDADPEATSPWLATEYVEGTSL
ADAVLRHGRLEERLLHGFSVGLADALIAIHAAGVVHRDLKPSNILLAWDG
PKVIDFGIARASGIPSHTRTGILIGTLAWMAPEQLRGERAGPPADVFAWG
ACVTYAATGHPPFASEQSDVLTRMREDRPPDIAGVPVKLAPLVRAALGRR
PEERPSAAELVRSLVSDSAVRTPADADRAAALALTPWQARPPLSAGPGNG
SNSNGAPLDPDRPDQPTHPLANRPTPPPGDDQSWRTVPVPRRGADGSVQD
PADGGAAGPPRHPADGGLLGMLLGRLPASWRRQAPGAHGALPVVLLAGIV
GLVGLVAALIGAGGSDPQVEPAAPAPSFGPAAPGGPANQPPAGAPLWTPG
TRPGPARPGPASPPPVGDEQPMTPTDPGTATPVPTTASAQPSATPTSRPS
QPTGASPTVAPTPSPSPGASTTATSCPTGAPVGGATTAPPC
>Francci3_2704 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_2084 ISRSO5-transposase protein
MPWGRGWPGRTGRVSRRATAITLTCDVRAVLAGRARSLTVPRRDWLRAAI
VLAAADGASNTTIATDLGVCEDTVGKWRGRFAREGLAGLVDRPRSGRPAR
FTAVQVAEVKALACTRPADVGAPLERWSNAELARHAAREGIVAGVSASSV
GRWLARDAIRPWQHRSWIFPRDPAFVAKASRVLDLYARIWDGAPLGENDY
VISADEKSQLQALRRCHPTAPAQARHGPRVEFEYQRGGTLAYFAAYDVHH
AHVIGRIEPTTGIAPFHRLVDQVMTAEPYASARRVFWVVDNGSSHNGQTS
IRRMSAAHPTATLVHLPVHASWLNQVEIYFSILQRKAIERGDFADLDALG
DRVMGFQDLYNQTAAPFDWKYTRADLLKTASGLALAA
>Francci3_4006 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_3623 serine/threonine protein kinase
MRKLGSRYVLHEILGQGTTGQVWRGAAVSDGAPVAIKVLRPELADDPEIV
ERFLREWDLLIDLDSPDLVAVRDLVNEPDVLAIVMDLVDGPDLRTHLREF
GPRPVEEAVRLVVGTLWALDSVHAAGVVHRDVKPENILIDTSDPAHPIVR
LTDFGIAQMINGSSRTSVTGPIGTPLYMAPELSTGAPPTPAADVYSAGVV
LYELLAGSPPFDSPNPAELLQAHREKQPQPIQGVPAPVWGVLAGMLAKSP
RGRPVSAADAAEDLVEALEASRWGGGYASRPGSHAEHDNRIQLSALAGAS
AARSADHSPATGTQRVVRSAAVAAAGGAGCVVGAGGAVGAAAWSDLEHTQ
IAGSPLTTTTQAGGTQAGGREGGVERTRMAPATGGRGGWDGDAPTGMQPA
LRVPARSDAPAADTNTVMSAIPANHQPGPPIGGAAAASRAAADRRRRSRI
AAGAGLVVALVAGAGGWALASSGGTGAELSADGPGGVSAAAASGGSGFGT
SGAAPGVAVGVNPTSLSGGGATPRPSAKGSPSPGAGTAPAASPTGATPTN
SPTARSSPSPSPTDDGTATVPNTEGSSFTTAENTLKSAGFTNLSKAFGCY
GAGAVDTVAHQSPKSGKIAKTATINLQVEDCAQVPGVIGMTESDAKWQLT
LAGFTSTAVNGSCSNSETSKVSAYSPTGQRPRHSTTVTLTLACTKPPASA
PAPTPTSTSTSTART
>Francci3_0650 serine/threonine protein kinase
MLTPLMEEDPRQIGPYRLQNRIGAGGMGTVYLGFAPDRRPVAVKVAAEDL
AEDEEFRSRFEREVRAALRVRGTAVAAVLDADTEAAAPWMVTEYVEGTSL
AEAVRARGRLEDHLVRGLAVGLADALVAIHAAGVVHRDLKPSNILLAWDG
PKVIDFGIAHLTDSATLTRTGHVIGTLAWMSPEQMRGEPSDASADVFAWA
SCVTYAATGRHPFHAETPDLLAVRVQRDSPDLYALPGYLLNQVARALSKA
PRERPDAASLLAALVGREVRGVTEADAAAGDVLERTWTGSSPAFGVPAPF
PAPVSSPSPNRSGPRVPGAGRVPARPAPGAAGSAGGWAPDVQGPGPAPGL
WPVGPPPPASAFRPAVPLPPAAPLRPVAPPAVAPPAVAPPAVAPPPSSGA
VSPGGWADQAPAGRRGTTAADVASDPGRSEVTYRSSDPTPPARPPDPSAF
PPAGRPAAPRVGRWTDARRAEASRRRHWNSAAMLSVLLAVLWIFGIGSAA
AVVVGLVARVRIRHRDERGATVAAFGIGLGWLGIGLTMILIVVILKFV
>Francci3_1778 Transposase and inactivated derivatives-like
MITLAVRWYLRFALSYRDVAELLAERGLEIDHVTVYRWVRRFTPLLIDAA
RPCRHTPDDRWFVDETSVKVAGRWTCLYRAVDQVGQVIDVVAGEKRDLAA
ARRFFTRALSHGRRPVEVTTDRAASSPRVLDEQLPAAPHIDDQYANNPIE
ADHGRITARLQPMRSPRGTERVSLVKLAGDLQVDHPVARRPPRMSSITAH
GTVRRPDPRFLLWAARCWPEVTASGRRGCCQGSRARSASATRRRVCWSVR
>Francci3_2572 hypothetical protein
MQLRCQHLIRALRAVLMLAPNIQKWAGQLIELFREANGLVVAARAAGCTR
LDQDVIDGLRARFDRDVEVGRLANMSRPWKDGKNHPGLVLARRLAAKADQ
VWLFLTDFKIPWTNNAAEQSIRLPKRHQAVSGYWHTPTTLAGYLRVRSYL
VSTRDHGIRPIDAIRMLLASRPWLPTPRAALAEPDGLAVAT
>Francci3_4224 hypothetical protein
MSRGEALGGLWACSRGWRQRGPVGTGLPRSHPREAPLPVQSRKPAEVSHP
AGEAQEALAVLATAGGPVAECFEQIPDPRDPRGVRHRLPVVLSLCAAVLC
GESGLAGVAAWVAAEGPGDPTGEGCRWPRPGIAVDGAVKFSALSQIRW
>Francci3_4213 transposase, IS4
MVDSQSVKGAQTVGADSRGYDAGKKVNGRKRFVIVDTLGLLLTVLVVPAN
VQDRDGGRRLLIDHYFTHHRCRHLFADGGFAGQPVAWARTIMRTTVEIVR
KKPGQKTFEALPKRWVVERTLVWLTAHRRLARDYERHPATSASFIHWAMI
RTMVRRLVRGNPVPRRQPRDTTER
>Francci3_3313 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_0149 methylated-DNA--protein-cysteinemethyltransferase
MPVRRAPLHAAVTHRHVDTAVGRLFLVRTQRGVVRVAFRDQDADFVLDSV
TRQVGPAAEARCGQLDDVYRQIVEYLDGDRREFAVPLDTSLVGPAQREVL
RALCAVPFGAAVTVGELAVAAGRPAEAAAVGLEVAANPLPILIPCHRVLR
VGPDAAVYPGGCHVRHQLRALEARG
>Francci3_2811 serine/threonine protein kinase
MNLPAPRVTALPRRFPAPGAHRDPLRLPHLVGPPPGSTPPLLDAPSVPTG
LRECAGCGAPVARPGHDEQDEQDESVALEGSCAGCGHRYSFTVKLRPGER
VGRYTVHGVIAHGGLGWVYAATDDNLGGDGVRAWVVLKGLLDAANPEARR
IAEGERRILTTVSHPGIVKILDYVTHHGEDYIVMEYVPGVSLAGLAEVGV
AGPDGRSAPPSAADVIRYLLRVLPALGHLHRLGLVYCDLKPENVMVTAED
VKLIDLGGARRLDDRVSGYLSTPGYRAPELDDDGERPAGIARTAPTVTTD
IFAVARTLARLVLGRFPGFLGAYRHALPPRRAHAPLRDFESLDRLLRRAT
ATDPDQRFQSTTELADELVGVLYEIVARTEGPVPPLASRWFDAVGHPTGE
AGPAGTPAWWEVLPDLRIDQDDPWAPGLTAGSDEEPASLATRLAAIVPRT
TEVRLSLARAQIRAGQLHEAAGTLDDAAVEQPREWRVDWYRGLHALVGGR
PAVAAAAFDRVYSRVPGELAPRLALATALADVATTADTADTADTADTAPE
RDAARHRAAALFDVVSTIDPSTTSAAFGLARCRTDPTDKIDAYLRVPPSS
AAYTASRIRMIGVLVGLAARPDRAASGSAALHRAATILADPRLDLGERRR
AELHRDLFTAALALVMTYPAAYPAADAPRPPTLLSRVMVERDLRFGLAET
YREMARLAGDRASRVRLVDQANRIRPRTLL
>Francci3_4506 hypothetical protein
MGRHRRAHPGTDVAAMLAPLGRRLLVVDPAEVDDDLVRDVTEILTSLCAR
PHASLTLSELTFTCQSCGLVGDRDLNAAVNLSNLVAASTSETVNARGADR
KTPSAGRAAGKREPGTAPAGQTGSASPQGEAA
>Francci3_3104 DNA polymerase III, epsilon subunit
MRPDPAPDRPRPPRQGRLDDLGRPLADVTFVVFDLETTGTSPGRDEITEI
GAVRVRGGRILAEMATLVRPGVGIPPMVSVLTGITDVMVATAPPVTQVLP
TFLEFARGAVLVAHNAPFDLGFLRAAVELCGYPVPVWEYLDTLRIARRVV
TRDESPDCRLTSLASLFRSPVEPRHRALADARATVDVLHGLFERLGNAGV
TTLEDLHDYSSRVSPAQRRKRHLADGLPTGPGVYIFRDADERALYVGTSR
SVRSRVRTYFTASEPRTRMAAMVALAERVDAIGCAHALEAEVRELRLIAE
YKPPYNRRSRFPERSVYLKLTDEPFPRLSRVRAARDDATYLGPFGSVRAA
DAAAEALLAAVPLRQCSGRLSPRVRRSACTLADLGRCGAPCDGREDVASY
GRHVAAARAAITGDPGRVIAASTRRIDRLAAERRYEEAAVQRDRMIAFVR
AAARAQRLSALTGVAELVAAAPTAEAGWDLAVVRHGRLVSAASVPPGVDP
RPWVDAAVASAETVRPRPGPAPCASVEETERIARWLGGPGVRLVRLEGEW
SWPAAGAIRAAAGFGAAPGRSVRAYDGDGWFPSA
>Francci3_2318 RNA-directed DNA polymerase
MREVRHAHSTCEPAEQSRESGCGGGGGKGVAKGNTASETRSGRRAGSGVS
NPLVRVRQAARRDRKMRFTALLHHVDLARLEAAFRAVRPKAAPGVDGVTW
EEYEQDLQRNLVGLHDRIHSGRYRASPSRRVYIAKADGRRRPLGIATLED
KIVQRAVVEVLNAVFEEDFVNFSYGFRPGRSQHMALDALAVGIQRKRVNW
VLDLDIKEFFSSLNHQWLVRFLEYRIADKRLLRLIEKWLRAGVVENGEWA
ETTVGSPQGASASPLLANVYLHHVFDLWAQWWRNHNAHGDVIVVRFADDA
IVGFQAEDDARRFLADLRERFAKFGLELHPDKTRLIEFGWFAAVNRSRRG
EGKPETFSFLGFTHLCATGKKGFFWVRRVTDKRRMAANLREINTEAKRRR
HQPIPDQGQWLRSFDATTQGRSPAR
>Francci3_3734 phage integrase-like SAM-like
MDSWTVLGVDDAPVEPVERYLAYLSDIERSPNTVKAYAHDLKDHWVFLGW
RGLDWREVRLEDIGEFVAWLRLPAAGRDGRVAVLSSVEPAVSASTVNRKL
SALAAFYAYQVRHGVELGELLTTWGSPGRRGGWKPFLHHVGKGRPQPRRV
IALKTAKKLPRVLAAAEVQAVLDSCTRLRDRFLFAVLYDTGMRVGEALGL
RHDDIDAAACEVTVVARDNDNGARSKSRGRRMVPVSAGLVRLYADYLHGE
YGDLDSDYVFVNLFAEPRGQALSYPASYDLVKRLRKRTRIDFDPHWYRHT
YATRLLRDGVPLEVVSTLLGLRS
>Francci3_0117 putative IS630 family transposase
MAGRDLGEGTRLAATTGAWIVFEDEAGQSLRPPKARTWAPRGHTPTVRVS
GKGSGRVSMAALVCYRPGQRPRLFYRVLTHHGRKGERRSFSEDDYATLLV
AAHHQPRAPIILCWDNLNTHRSAAMRRFLTRHAHWLTVIPLPAYAPDLNP
VEGVWAHVKRDLGNHVRVTVDQLTATIKTLLKRVQYRPDLIAGFLGQTEL
IIDPEPP
>Francci3_3265 transposase IS200-like
MSRTVQVGAGGAYDLGYHVVWCPKYRRAVLVGPVRDRLDGLIREKCAEHD
WLIVALEIEPDHVHLFVKAHPKHAPSYIANQLKGFTSHVLRGEFAHLRSR
LPTLWSRSYFVATVGAVSAETVRRYIDTQNERPWRTGVPR
>Francci3_3267 transposase, IS605 OrfB
MLEGGVPRWPGSACPGVEELHRLPVREAEGPAGRFPRFKKRGRARDSFRY
TTGAYGPAGDRQVKLPRVGRVKVHEPMGALTGRLVDGSARLLGATVSRTA
GRWFVAFTVEADRDVPGKPSTRQRRGGPVGVDLGVRHLAVLSTGETVENP
RPLARSLRELRRASRAYARSTPGSAGRRRHAATLGRLHARVAYQRRDGLH
KLTARLAKTHDTIVVEDLHVAGMVRNRRLARAVSDTGMAEVRRQLAYKTL
WYGSTLVVADRWYPSSKTCSDCGWRNPGLTLSERIFACQSCGLVGGRDLN
AAVNLSNLVAASRSETVNARGADRRTPSAGRAAGKREPGTARAGRTGSAS
PQGEAA
>Francci3_2053 phage integrase-like SAM-like
MHVQRVALPGSRVDSWTVLGVDDAPVEPVERYLAYLSDIERSPNTVKAYA
HDLKDYWVFLGWRGLDWREVRLEDIGEFVAWLRLSPAGRDGRVAVLPSVE
PAVSASTVNRKLSALAAFYAYQVRHGVDLGELLTTWGPRSG
>Francci3_2136 serine/threonine protein kinase
MVGHAGLVVAGRYRLQDRLGAGGMGAVWRATDQMLRVDVALKEVSIPVDS
TPGEWTERIARARREGMNAARLRGHPGIVSVHDVVEDGGLPWIVMDLIIP
ARSVADRLRGSGGLRPDETASIGAAVADALAFAHAKGVVHRDIKPGNILL
AESGRALVTDFGIAAHNDDSRMTAAGVVGTIAYVAPERLGGQPADGRSDV
FSLGVTLYQMVEGRLPFQADTTAGLLSAILFEPPRPTVLAGPLRPVLDAM
LEKDPVVRLDAAAAARALASLAAGGPASPVLVPAQVPTPTPAQLPPPAQV
PTPPLPRVPTVSRPATALLPPVGEPEAGGSRWPDSGMPGTPGLSGAPGRP
SGRVWLVAGTLTAVLAILAGVFLARGGVGAHPGDGSLTGTASATVAATGP
EDGVPRVSPSAAASGAGSGAASGAGTSAGTGARTSSATAAGSPRPSQELT
VVPSFSAAAPGVSQYSADYYATLTRSRQDGFLLRVGFDATGRSDLRDPRT
SCVLVSSGSRELRLFPVQADVPVSSSGRYSGTLTFSLALPGSYRFRYSCQ
ADYSAALLGTVTMPSVAVSVYDDNYFVNVLEVRVGAGRTVVFFAAAGAAD
LRVPVTSCLRLESGIRRPVVELKTSRKSPTGATYIGTMRFEGTPPATLVY
SCSDYSPVNL
>Francci3_2150 putative IS630 family transposase
MSARTPSASGGAGSPARAWRGWLTGAARAGHGPRVEFEYERGGTVAYFAA
YDVQRAQVIGRIEDTTGIVPFGRLAAQVMNSEPYASARRVFWIVDNGSSH
RGQASIDRMRAAHPTATLVHLPVHASWLNQVEVYFSILQRKAIERGDLAD
FADLDALAARVMGFQTLYNQTATRFDWKYTRADLLKTASRLNLAA
>Francci3_0687 DNA-3-methyladenine glycosylase I
MSGGPLSGGPLSGGPPGAEPDLLGSGISGSGTSGPVRPASDVPVRPASDV
PVRPASDVPVRPASDVPVRPASDVVVGADGCPRCPWGLSTPEYVAYHDYE
WGRPVRDTVGLFERLTLEAFQSGLSWLTILRKRSAFRAAFAGFDPAAVAA
FGSADVDRLLADAGIVRNRRKIEATIVNARAVLTLDHPLEELIWSFQPEP
TGRAAPATLADVPAKTPESVALAKGLRAAGFVFVGPTTAYALMQACGLVD
DHLSGCWVAGGVG
>Francci3_0130 transposase IS116/IS110/IS902
MDVLIDRVAGLDVHRDTVVAAVRVGGRGGGRRGEVRTFATTGAGLTRLAG
WLSEQRVSLVGMESTPDYWRPFYYLLEARGLTVWLVNARDVKNVPGRPKT
DKLDAIWLAKLNERGMLRPSFVPPPEIREIRNLTRLRLDLTAECTRHRLR
VEKLLEDALIKLSTVLSDIFGVSGRAMLDALVAGERDPKKLAALARGRVK
ATQAELATALTGQFTEHHGYLLSVLLAQIDGLDRRIAELTERIDTAIAAL
PAPAHAAADAARGGETGPDGDATTGTGQGGGGAAARPGLDILDRLDEIPG
IARHAAQVIIAEIGTDMAQFPTSGHLNSWAKLTPQTIQSGAKSRTGRTGK
GNPYLRGALGEAAMAAAKTKTFLGSRYRRLVKRRGHLKALVAVARSILTI
VWHLLNDPTARFHDLGVDYHASLQSRERRKRNALRELKSLNLSAQEITAL
LAAA
>Francci3_4210 Integrase
MTVEDRELISRELSRNRSARFIAKALGRHHSTISREIERNGGESAYRAVD
AQARCDAMRKRPKERKLVASAALHDAVNAALVEKWSPKQISERLEKDFPD
DESMRVSHETIYECLYLQARGELRTQLTIALRKGRARRVNRSRTAVARGR
IVDIVNISERPKEAEDRAVPGFWEGDLILGKGNKSQIATLVERTTRFVML
VRIPYDRNAEKVAYLLARKMETLPDFMKKSVTWDQGKEMARHAKFTVATG
MPVYFCDPHSPWQRGSNENTNGLLRQYFPKGTDLSLHTQAELDKLAEQLN
GRPRQTLGWAKPVEVFNDLLANHASL
>Francci3_2707 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_2893 hypothetical protein
MPFAADRWMTEYPAWLPAADGVRRVFALPGGPVLAEVHGGKLTLAALGDE
AAEPLADVFGLPEGAASEVPELAKELAGLGLVGRFRNPSLWEALATAILR
QVIRAGQSKKLYRALCAAHGEQVALPDGGAFGLFPSPEKILELDDEQYGE
LGLAFKRPALRAAATAYLAHGENWNRLPLAALVDELQSVQRIGPWTAGAA
VADYTNTWDLYPYADLAVRTWAGRAAPAHHWFNNERAFGAQWRHLAGEHL
STLTILTLAWGSYRGDIG
>Francci3_4024 hypothetical protein
MCVCGGGGLGGLAALAAGPGRGRGRRAGRRDRTADRRGEGRRVRRDPAEG
RAEGREALRAVRVDAAVHAVHAGRRDKASFWVFLLGRMLGVVVHDRYALY
DAEEFVGFLHQLCVSHLLRDLQDAVETYPEAVWPVQLQQALRGLVHQANL
ARAAVLAEVPAALRDPLVAEYRGAVRVGLRDVPAAEKGAKQPVGRCLLEC
LRDRQDDMLRFVFDLDVWPTNNQSEGDLRPFKTQQKISGRLTSAAVAACR
LQIASYLSTTRKHSVSALHALRLAFRGTPWMPPPAVAPT
>Francci3_2373 Recombinase
MGMGETGTGPGQARIRRQARYGRKRASREDWAGQSAAIYCRISHVADEDQ
TGVDRQERICREVVRRLGLRVAHVFVDNNRSAWKRDRKRKGWDRLLEVAR
AGEVQHVVAYHPDRLMRQPKDLEELLAISDDRDITLHGQANQRDLSDPDD
RFFLRIEVAHACRSSDDTSRRMKDAMVDRANDGKPHPGKRRFGYTPDGMS
IVEAEAEVARDIAARYLDGATSIQIAAVLNEQGKVTASGRPWDEFSVLAV
LDSHHAAGIRVFRGEEIGQGIWPAIFDPGTWAEIRDRRSYRAAAHAATRT
PARFFLLRGIVTCKRCGTRMAGTGGHVSPGYLCSRKYRTDEQKCSRRINA
PALEAFVTDAAVDLLTRLDPSGQEAAATLTDADQAAIEEDNAELAELKAM
WNAREIKSREYREMRRTVEERIKKVQRKTVVRPAVEVLAGLTGPHARAAW
DALVEAEEYERMNAVLRFLFAAVVIDESRVPRGQFDYSRIHIDQNPL
>Francci3_0111 putative transposase, IS891/IS1136/IS1341
MRRPFPQEARGAGQTYGWQSSCSRGRPPGVRHGSPACRPGHRPCHPPSSS
GRCSAAGREAGPRPRFGEHLPLKHADGPGGECRPVALRTRIRKVGDVELA
WSRGLPSVPSSATVIREADGRYYVSFVVDVTDEPYPVVPAEVGVDLGLDR
LVTMSTGEIVANPRPPRSKRRHLARAQRSLARARKGSADRRKAVRRVAVL
HRKVRETRRDHHHKLAARLVRDNQVVYVEEMAATGQTPVPWDKVVPWDKV
KADLGLV
>Francci3_2712 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPANLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_2424 serine/threonine protein kinase
MAVGTVRPAADATGRPLPGLTDDDLSVGDGRGPDPAAFGGRSPSPTPHLN
PDLNHDLNHDLNRGACVNAVIPCPEPDCGGVVEDGSRTAPSRLGAGLVEV
PEPDVPEPDVPDPASMLLADPQIPERRRACAGCGAPVGRARGRRPARPEG
FCPGCGQPFSFRPTLHAGDRVGPYEIAGALAHGGQGWIYLARDPSVAEGS
WVVLKGLLDSGDREAQAAAIAERRFLASVDHPAIVRIFTFVEHAGTSYIV
MEYIGGTSLREVLCRRRADAGRPDPLPVTQAVAYLLAALPAFAYLHRNGL
VFGDFTPDNVMLGRETPRLIDLDAIRRIDDGAVAGAGCGTLGYQAPEVPT
TGPSVASDLFAVGRTLATLILDFRGNTSTYLHTMPPAADHPVLARHESLY
RFLLKATAPDPDRRFTGAEEMHDELLGVLREIVAAERGTPAPAPAPSRRF
TGDLHPTGEGGLTGEGGLTGEGGLTGEGGLTGEGGLTALPPWSVLPRLRA
DPDDPAADPLTALPDLAPEQLAELLGAMGTTSVGARLHLADLRMRLGRTA
AAREMLAEIEAEDPFEWRVDWQRGLLALADGDTAAARGAFDRVYDEVPGE
LAPKLALARTAEATGDLPRAQQLYDLVSRTDDAFTGAAFGLARVRIAAGN
RDGAVAAYRRVPPASAAHVDAQIRLARVLGTVTVAGVPNRAGIMAASDVL
AGLDLDSGRRAALTRDLLTTALDLVAAGTLPVDPGVTVAGAALREADLRF
GLERAYRELARLAATTDERYALVDLANRVRPRTLV
>Francci3_4534 serine/threonine protein kinase
MVDGSNRPQAGDVDNDVNPNAPGVRGATPPSGTAILAESVLDRRYRLLSA
LTTRGPVTLWRGDDKVLARPVAVRIVEHGPASAGGGGPVTDPAQEQAARR
LLTAAINSGRLVHPGAASTYDATTTTAESRRISYVVSEWVDGKTLRQLST
DGPLRPEQAGAVVLAAARVIAAAHERGIHHGDLNPGDVIVSSHGTVKVID
LEIGGVLAELDGSLPATELDGHGRQPDQSGIGGDDGSGPGTADRAGGGSD
ATAVNPDLIAAADVRALGGLLYAGLTGYWPLGGDHGLPTAPTSGGRLRTP
RQVASSVPRDLDAITMATLGDDRVGAPITTAADLVEELESINPVDAVLDT
GLMSLSDAPPSTEAMDVDAFASSDYLAPGNYPAQGRYADTAGYPGPAQAR
DARYARYDTRVEGPRGSRAGGTGYGRGGYDDRGGYDDRGGYDDRGGYDDR
GYPAGGGGYPAGSDRYPPGSGPKRRGGPSLGRAVPWIALVVVVAIAVVAV
IALRDDGGKNNTPKPGTSTTVPATPTGTILKPSSTDSFDPLAPAGEPKTE
KPADVGKAFDGNTSTEWTTSNYTSATFGNLKSGVGLRMAFPQKILPTSVT
VTVGSLGPVSFELRSGDRLSDSLDGFDRVVAQKSGASGTVVLPMSSDVNP
SQYWVLWLTSLPSNAGQYRGSIAEITFRS
>Francci3_0099 Excisionase/Xis, DNA-binding
MTTRTPDAEPLLTPAEVATMFRVDPKTVTRWAKAGKLTSIRTLGGHRRYR
EAEVRALLKGVPSIGSDI
>Francci3_1877 transposase, IS4
MAASAVRVRGVGTHGAPSVPGVDPGGCLASTAPQDLGPARCGRTGRLVIG
DRGRGQPASEKGGSLTGPNPVDRGKPGSKIHQPERRRFLRRRGIAVRIAR
RGVDSTERLGRHRWKVERTLAWLGGYRRLSPRYERNGYNFLGFLCLAAAI
TCWKKLPRST
>Francci3_0388 hypothetical protein
MGLDWDPSGPVFTTLDGDPLHPAAVSAEFQDQIARAGVPPIRLHDARHVA
ATLMHGGGADLKVIQETLGHSSHEVTVNTYTSVLEELAREAAEGAARLVP
RTPPTRAAHTPRTPDRSQEDRIGEKPQVKQGALGNAPQTPPPARLRITHA
IILSATRAQWC
>Francci3_2526 HhH-GPD
MPTTTPPVGTTPTTQPDQRPGGGCDAVGYGVGVGLHLSQIAEADELLTSD
PLALLIGMVLDQQIPLERAFAAPHELTRRLGQPLDAEELAKYDPDALGAI
FSQVPALHRFPGSMAKRVQAMCQLIVDTYDGDAARVWTTAADGKELLRRV
AALPGFGQQKAKIFVALLGKQLGVTPPGWREASAPFGDEGTFRSIADITD
VATRDQVREYKKAMKAAAKAQAG
>Francci3_3385 transposase, IS4
MIFRGARWGCTRRIMAGWGWLGADGGRMAHAHRHRYPSDLSDSQWALVGP
LLPPPAAVCRPEKHDRRDLVDAILYVVRSGCAWRALPSDYPPWQTVYYYF
ALWHDLGVTERVHDVLREQARRAEGRDVEPSAGIIDSQSVKGADTVPASS
RGYDAGKKVNGRKRFIAVDTMGLLLAVLVVPASTHDTASGRQLLLDSFFA
GRRLRLVFADAGFAGVFMDWAARILTLTLQVVRKPAGQKGFSVLPRRWVV
ERTWSWITGYRRHARDYERRPDHAESLIRWAMIATMVRRIDRRTPAQRPG
PRPLQRII
>Francci3_1369 crossover junction endodeoxyribonuclease RuvC
MRVLGVDPGLTRCGLGVVDGGLGIRAHLVEVGVVRTPATAEVAERLCAVS
DGIDAWLDRTRPEAVAVEKVFSQANMRTVMGTAQAGAVAIVLAARRGLPV
GLYTPSEVKAAVTGSGRADKAQVSFMITRLLGLTEAPRPADAADALALAL
CHLWRGPALARFRSAAPGGPTR
>Francci3_2684 transposase IS200-like
MTAGAGGVYDLGYHVVWCPKYRRPVLTGGVRERLDALLREKCAEHDWPVI
ALEVEPDHVHLFVKAHPKHSPSYVANQLKGFTSHVLRAEFPHLRSRLPTL
WSRSYFVATVGAVSAGTVTRYIETQNERPWRKAAAREADVQVPAAPDGAS
GRVVDRDAERPPRALQRGVAGASGRLRAPVEDQGRVRGPSRPS
>Francci3_3410 serine/threonine protein kinase
MSTVASDLEIVERLRYGASFASVPRTWGRGRYRCVRHLAEGGQGYVELAR
DEWSNALVVVKGAWWGGRDHDINPEYARTQYEKRSIDVEDAVAVQAALGE
ITHGVPALVDVVYGPSPTRHDHNALVTGGNEREVERYNREAFIVMQFIGD
IGQMVPTTLDSRVTESGPLSARQVVELADQISATLEAMHTIRPQRLYQHE
ERIRGYWVHGDVKPENILVAGDPPRYSLVDLSTAAIVEPSAKVMPTTATP
GYAPPGAEPLSPQYDLHCLAATLLFALTGDRPDDWLGGATEARSAADAAG
SRADAEARDEKLRQLRGELAARRVHPMLIRLITDCLPADPRFRLGTATVL
RAEIAAVRTALVAREVLSDEEPQP
>Francci3_3619 histone-like DNA-binding protein
MNKSQLVDRIAQQIEGGRSNATKAVDAVFDAIQEAVASGEKVTITGFGVF
ERVERAARTGRNPATGEPVHVAASVVPKFRPGSEFKSSVSASND
>Francci3_3799 UvrD/REP helicase
MNPRVEQPGAGVHVTIPRQPFLPGFDGDVDGDPAGGVAGPRLDPDDLRDL
LEVPYTDEQIAAATAPLEPGVIIAGAGSGKTSVMAARVVWLVATGQVRPD
QVLGLTFTTKAAAELSGRVRLALRKASAGAGPGGPAADGEVDGEPTVATY
HAFAGRLVVDNALRLGLEPDLQLISGAARYQLAARVARSHGGKVEALTRS
LGALVGELVALDAEMSEHLVDPADLVAFDGALLAEIDAALRRAEQRRGTR
GVRRELRRCAAAARGRRELAALVAEYRAARRERDVLDFGDQVTCAARLAE
TMPEVGRAERARAAVVLLDEYQDTSVAQRRMLTGLFGGGHPVTAVGDPCQ
AIYGWRGASVANLDHFPTHFPGADGTPAAVYELSVNQRSGGRLLRLANTV
AAQLRARHRVVELRPRPDVADQGEAVVALHVSWADEVAWIAGQLRRVVDA
GTAPGDIAVLVRARGDIPALFTAMQAAGLPVEVVGLTGLLIVPEVAEIVA
MLEVLDSPTANAALVRLLTGPRVRLGPRDLAALGRHAREAARVPDPVGDP
DLPGRSDPLAEAVADVDPADVVALSDVLDDPGPQMSAPGRARVRRLAAEI
AALRGHVGEGLLDLLHRVVATIGLDVELTATEVAVRARRQENVAAFLDVA
AGFTDPDGTNSLPAFLGFLRAAREHERGLDVAGPSGADAIALMTMHRSKG
LEWEVVAVPNLTSKVFPDLTVRDQWTTSPGVLPIPLRGDADDLPAFTVCA
EKAALDAFRADARQYAEREERRLAYVAVTRAKSLLFASGHWWGPTQKTPR
GASVFLDELAEHARTGGGLVDVWAPEPAERTNPALATPERFAWPIPYEPE
PYARRLAAAEGVMARLAALSVPGASAEPAGEPFVDGPAGMTAAERVLLAE
LDREARLLLAEERAARLARTDVALPASLTASQIVRLRADPEAFARELVRP
LPRRPVAAARRGTRFHAWVEEIFDYRALIDTEDLPGAADAELTDDDLRSL
QQAFLRTAYGARRPFAIEAPFELRLAGRIVRGRIDAVYDLGGGRWEVVDW
KTGRSDADDLQLGIYRLAWARLRGVDPSAVDAAFLYVRTGAVVRPPTLSE
EELADLLASPSTGPAAGQARGSARSTDSAR
>Francci3_4220 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_2605 DSH-like
MSSTPEILVEFAAGLPFGLDPFQFEAVAALAAGEGVLVAAPTGAGKTVVG
EFAAHLALRTGTRCFYTTPIKALSNQKYADLVSRYGAVSVGLLTGDTSRN
GDAPIVVMTTEVLRNMLYAGPVDNGRLDDLGYVVMDEVHYLADRQRGAVW
EEVIIHLPAQVRLVSLSATVSNAEEFAEWLVTVRGHTRVIVSDHRPVPLW
QHVLADRTLYDLFLEDTAGTPPPGGPEALRATLDGDLVRSRWNVGVEAGF
EVAGTRRATIGDRDRGGDRDRGGDRDRGGDRDRGRGRGGNRNGERGRATQ
GGRGGDRGRNGGAGQAPVAAIPAARGGPTAVGGRVVNPDLLRLAREESRA
LSGGSPAVGRGRPAPGARRRTWVPGRPEVVERLDRDGLLPAIVFVFSRAG
CDAAVTSCVRAGLRLVGPAEQQRIRTLVRERTAGIPETDLAVLGYWAWLE
GLERGIASHHAGMLPTFKEIVEELFVQGLVRVVFATETLALGINMPARTV
VLERLTKFNGESRVDITPGEYTQLTGRAGRRGIDVEGHAVVLWQPGLDPL
ALAGLASTRTYPLRSSFRPSYNMAVNLVGRLGRERARTVLESSFAQFQAD
RAVVGLARAVQRNTEAIDAKREALSCDKGDIGEYDRLRREIAEREAALSR
EGSARRRAESAAALARLRTGDIVRVPAGRRSGLAVVLDADAAANAADGPR
PVVLTADRQVRRLSLTDFPIAVEPLGRVRVPRSFNPRSPQSRRDLASSLR
AADVNPDAPPGRRARVRSAAADDAELARLRRALRAHPVHDCPRREEHLRS
AEQVNRLVKETAAISRKVEGRTNTVAKTFDRVRAALEDLGYLDGDRVTAA
GRVLARIYSEQDLLVAECLRAGIWDDLTPPALAAAVSTLVFEPRGDDAGV
PALPGGAALRDCLAEMVRLSERLAEAEQAHRLAFLRPPELGFVAVAHDWA
AGRTLERVLTDSSVELTAGDFVRWMRQLIDILDQIAQVAPMVQADPGTPD
GARVRRTARAAMDAVRRGVVAYAMSV
>Francci3_0576 Uracil-DNA glycosylase superfamily
MSDAAGARFAAAPTRRRPSRDALLAAYGTTVPDLVAPATVVLLCGINPSL
ESGATGFHFGTPSNRLWPVLHLAGFTPRRLHPSETDELVRRGISITNLVH
RSTARADEISDDEVRAGVPRLTELVERVAPRWVAFLGLAAYRIGFGRRGA
KVGRQAETLGSAGVWLLPNPSGLNAHYQLPDLVRVYGELRETAYS
>Francci3_3919 Uracil-DNA glycosylase superfamily
MRSGGPVRPGSGWPGDLAAADTPVAATPGEVRALAADATGTRELDARMSV
CRACPRLVTWREEVAAVRRAAFAEQAYWGRPVPSFGPGDARILVIGLAPA
AHGGNRTGRIFTGDRSGDWLFASLHRVGLAALPTSVSAGDGQRLTATRIV
AVVRCAPPANKPTTTERDTCRPWLVRDLELVRSTLRVIVVLGGFAWSALW
PALVQAGFPVPPRRVSFGHGVRVELPVSGVSVVGCYHPSQQNTFTGRVTE
VMLDTIFGTAAELAEVVERAG
>Francci3_0387 hypothetical protein
MKVKLDEGLPVSLAERLAKHGIDADTVLAENLSGRSDPEVLAAAVAENRI
VFTLDRGCGNIRAYPPGSLHYHLVTYRGPRREETCALNWPDVDLDQKKIL
IQWQLVVVGWKVQRTRPKTDAGVRELLLDDLTVEEF
>Francci3_4283 NUDIX hydrolase
MVDHKAALIARLRWSPADGGRRDGVTARHPSAPPAAVRDASTVVLLRDAP
GNQGIEAYLLRRAATMAFAGGMYAFPGGRVDPADMGPDVPWAGPSVAEVM
HALDADPALARALVCAAVRETFEECGILLAGAVAGEDGRLGGGPEALSDQ
VRAAERLALERHELGLSALLRKYSLVLRADLLAPWARWVTPEIEPQRYDT
RFFVAALPTGQHPGQPSSEADRMQWIRPADALERHRAGTMDMLPPTAFTL
AELSEYADVAAVLAAAHARDLSPIMPRILVSGGEARLLLPHDEAYDDPGN
HDDLRDAGPADPAWQ
>Francci3_3191 primosomal protein N'
MTEHPGTLPGLSPSEVGRAEVGRAEVGRAAAGRAAAGRAAAGRAAAGPGG
ARSGVSRSAGSGSPRPLAARLPVARVAVDTALAHLDRPFDYLVPADLADL
AVPGSRVRVRFAGRLVDGFVLERRESSDHSGQLAPLARSISPEPVLSEPV
AKLARAVADRYAGTLADVLRLAVPPRHGRAEAASPRPEVVSPGPAAVSPG
LEAVPPEPGEWGRYPTGPSLLTALAAGRSPRAVWWAPPGPAWPEMIAAAA
AATAVSGRGVLVVVPDHRDVDRVEAAMVAAVGRETTVALRADLGPARRYR
AFLAVSRGQARAVVGTRAAMFAPVAELGLVVVWDDGDDLHAEPLAPYPHA
RDVAVLRSRLQGCALVIGGFCPSTDAEALVRTGWAHPVVPPREQVRALAP
RVEATGSDLEHARDPAARAARLPSLAVRTARDALAAGLPVLVQVPRRGYQ
PSLACADCRRPARCPHCQGPLGRAAGGGALSCRWCGRPVAAAVWHCPSCS
GPRLRASVTGDRRTAEELGRAFPGVPVRTSGRGGVLATVPAEPALVISTP
GAEPVADGGYGAVLLLDGWALLGRPDLRAGEETLRRWTAAAALARPAARG
GRVIVAADAGLGPVQALVRWDPGTFVAREADERAELGFPPATRMASVAGT
AAAVAELLAAVELPDGADVLGPVPLPPAPSARAVPDGANQGDSSAVGAGR
RGPDGATSGPEEIERVLIRVRRERGAALAASLRSARGVLATRRTTGRLRI
QLDPLILG
>Francci3_2344 phage integrase
MTTSATAPPTGRGRRGGERPPDRYAELFAAYRAELDHVRLAEHSRRAYAS
RVAGFLRWLDTDGAAADPTGADPLSDAHARDFAVRDYRAHLLTVAKKRPA
TINAHLTALDHFFTWRHLGRAVVDRADLPEQAPRALTDDEARRVLRAAER
LPSLRDRTIIELLYNTGVRCAELVALDLDDVPVTARTGAVHVRAGKGRDG
GKPRTIPANPRARRALTDWKPARAAWPNAEADPALFLNRYGRRLSTRTVD
QVVADLGRSVGIDDLTPHVLRHTFATAMLRRGADLVLVAELLGHARTDTT
RVYTKPTEADRLRAVELLPGDS
>Francci3_2057 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAAAGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGCRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_2061 putative IS630 family transposase
MLDAARGYSNARIARRLCVTEDTVRTWRGRFARRREAGLVDLPRSGRPRR
ISEAERAEVVALACQLPAETQVPLARWSCPELAAELLSRGLVDAISASSV
RRILAEHPIKPWRYQSWIFARGPGFAAKAKVILDLYEGFYQEEPLGPEDR
IVSIDAKPSIQARARIHPTTPPAPGRIIRVEHEYERHGALALLPALDVQT
GRIAAVLTPPTTGIAPFMELMGQVMAQDRYRTAKRVFVIVDNGSDHRGQA
SINRLRAAHPNRILIHTPTHASWLNQVEIFFSLVQRKVVSPCDFASLDVL
ADTLTAFVDRYNVTATPFKWKYTAADLERHLARLDDDTAPAVAGSVARLP
VPPPDTNEPFQSLFV
>Francci3_1710 DNA polymerase III, beta chain
MTVTAPPATIAGATAVVPYRDLLAALTTVGTVLDRKSRPPWNAALITADL
DGRLTVTGASPAATVSVRLPGAARSAGQFLVDGWALTQMCKALVRGEHRR
DTDELPVLLDGSFPPAPTVTIGDYTVPLTGLPVEDHPGACRPAPTIATVD
RAAWRAAVDRVLPAVGRDDTLRVLTGLYVSFAPGLATVAGTDRYRLAVDV
LPATVTGPAVQELLLPAKILAACAETWTGPSVTIGRRRAESVHDVDRVTF
TCGQTTVSLVETGGAYPPWRRLLGSLDPQHTATFDRATVAAHVARVLAIL
TAHPTTRRPIMTMTLTLTPDGLRVAPLLPEHGARVSAPTLPATTSLTDGT
VRWAFNAAYLRAALAALPGDTVLFSGQADVAKPVLLTSPEQGASIPPYRH
LLMPICID
>Francci3_2194 transposase
MDGAGGVEGVLGAPVGETPVISRTGDRKSVLMLSAVSAQGKLHFMLQQGG
SVDSKVFIEFCRKLLQDDAGKVFLIVDNVSYHDSKAVRRFVAGTDGRLRI
FFLSPYASDTNPDEWVWNNVKTAQIGRKMMGVDPSE
>Francci3_3880 exodeoxyribonuclease VII, large subunit
MGRGCGRGYGARVALTSSPEQPLPVRSVSRALGDWINRLGRVWVEGQVTG
IVRRPGMGTVFLTLRDPVADVSLRVTCPRAVCDAVGPALVDGARVVVWGK
PSFHPGRGTLALAVVEIRPLGAGALLARLEQLKATLAAEGLFAASRKRRP
PFLPAAVGLITGRASAAERDVVENARRRWPAVRIVLREVAVQGPLAVAQV
IDALRELDARAEVEVIVIARGGGSLEDLLPFSDEALLRAVVAACTPVVSA
IGHEQDTPLLDFVADVRASTPTDAAKRVVPDVGEQAIAVAQLRARARRVV
EHRLDREERWLADTRGRPVLATPTRDVDRRADDVSALLARSRRCVRHVID
VTGNDLAATRARMRALSPAATLDRGYAVVQRTVDGAVVRGPAEVAAGDGL
AIRVAGGTLRATVVDG
>Francci3_2062 response regulator receiver protein
MLTYPGVVDLPESTLTFLAGLLAEDRAQRRTWRKLPPPEQALLVLVHLRK
GERYEQLAEGFQVSVGTVHNYIREAVRLLATHGRTLLAAVWIFAWTQSNF
LILDGTVVRTNRVRAHNKLYYSGKHKYHGINLQGLTDPYGRLIWISEGLP
GSVHDLTAARMHDILDLIDRSELYLYADKGYVGGEGDRLLVPIKKPKNND
LPDRDKEANRTHATTRSQGERGFAVLKNWHIFDRFRGCPRRVGTFAQAAL
VLATEGL
>Francci3_2025 transposase, IS4
MPPVPRMPVRACLAWPLRVHIDHGCVGEQPRRAAGPGRVVGAGGTAAAPV
RGPLAGRWDCPDRGSGGVYRGGLRADLGVCLAASAVRVRGVGTHGAPSVP
GVDPGGCLASTAPQDLGSARCGRTGRLVIGDRGRGQPASEKGGSLTGPNP
VDRGKPGSKIHVLTDAGGLPLVVAVSPANPHDSGAFVPLVASIPAIRSRR
GPRRRHPAKLRADKAYDQPERRRFLRRRGIAVRIARRGVDSTERLGRHRW
KVERTLAWLGGYRRLTIRYERNGYNFLGFLCLAAAITCWKKLPHST
>Francci3_1960 site-specific integrase-resolvase-like
MHDSALNNVLWQAIVGHALLRGTQVATIVVEYRDRFGRFGVEYVDADLAA
SGRRLLVVDPAGVDEDPAGVDDDLVGDVTEILTSLCFRLYGREAAADRVR
RAVAAAVEEAREGPPGVSGSLLTRNDAQLADLRRYASAGRFVYKRNL
>Francci3_1320 nucleic acid binding, OB-fold, tRNA/helicase-type
MGESRGWLGRKLHRLTAGTSELDAEDLQAASAAAGAAPMSGCRDRDEICV
AGTIQAVTVRARSGAPALEVDIYDGSGTVTLVFLGRRDIPGLRAGASVKA
SGRVTVQENRPIIFNPRYELLPVPSASA
>Francci3_1967 ISRSO5-transposase protein
MPWGRGWPGRTGRVSRRATAITLTCDVRAVLAGRARSLTVPRRDWLRAAI
VLAAADGASNTTIATDLGVCEDTVGKWRGRFAREGLAGLVDRPRSGRPAR
FTAVQVAEVKALACTRPADVGAPLERWSNAELARHAAREGIVAGVSASSV
GRWLARDAIRPWQHRSWIFPRDPAFVAKASRVLDLYARIWDGAPLGENDY
VISADEKSQLQALRRCHPTAPAQARHGPRVEFEYQRGGTLAYFAAYDVHH
AHVIGRIEPTTGIAPFHRLVDQVMTAEPYASARRVFWVVDNGSSHNGQTS
IRRMSAAHPTATLVHLPVHASWLNQVEIYFSILQRKAIERGDFADLDALG
DRVMGFQDLYNQTAAPFDWKYTRADLLKTASGLALAA
>Francci3_1757 hypothetical protein
MLVAFRAPYAGEFLTAATPAARARETSQQSAPPSPTPYAAPVTRSSRPAG
APTPTDALTHHGFHCGKPGQIANTPKPWARDGTPAVIAHLARVLKPPTIG
QFWQLVDQARDESWSHEVYLAAVLQRQVADRAIPRTPSCGSAPPLSPGQD
VGGLQPRPSAPAGREPARKRSSP
>Francci3_3965 TatD-related deoxyribonuclease
MARNSGRSGDPPPPPDPLPVAVVDSHCHLDLMGTEVPAAIAAARAVGVTR
AVTVGIDLPTSRWQAEVAAAHPEIYAAVAIHPNEAARGVTEETFAAIAEL
ARADRVRAVGETGLDYFRTPPEAHAVQQESFRRHIAIAKETGRALMIHDR
DAHDDTLRILAEEGAPEKVVFHAFSGDTAMAKICADAGYVMSFAGNVTFS
NAANLREAAAVAPADLILVETDAPFLTPTPWRGRPGGPYLIPLTLRVLAE
TRGVGVAELGTHIAANAERVFGPW
>Francci3_4435 serine/threonine protein kinase
MNNSPGNPISPSQPFAGAASPNTVLDNRYRLNGRIAAGGMGEVWRGLDLT
LGRPVAVKLLRPEYASDESFLVRFRGEARHAARLSHPGVASVYDYGEVAT
ADDYPTAYLVMELVEGEPLSAALHREKRLSPERTLDILGQAADALQAAHA
LGVVHRDVKPGNLLLRPDGAVKVTDFGIARAVDAAPLTATGIMMGTAYYV
SPEQASGRPVTPASDVYSLGVVAYECLAGRRPFDDRNPIVVVMAHQQDTP
PPLPTDIPYQVRALVDSAMAKDPARRPSSAGAFARSAAGIRRSLWSPEPA
RWPGAPDATARHPGPGAVLPPAGAAARPPRSTSRTRPPSWVAAPPPSGHT
SGPGRTGASPDNAQAVPDGPEAMLGGPDVPATALHSPPFGGPDTAYGGPE
SAYGGSAYGTSDRGGPRDPGTAYHPGPPPWARTPGGPPGPSGPDSPDDLR
PGWEGRAARRHRPKAASRPALPLPLLIILLILTVVIVAFATNRLFSGSNS
ASNSATTPPRVVYPTAIGDEIVSGNAGSAISGAGGATAGEYS
>Francci3_0407 phage integrase
MARRQPPGRGLMAEGKGSRRGNGEGSIYRYRNGWAGQISMPDGTRPTFYG
KTKDDVRGKIIAALRAVQDGVPLVTTRMTVAEYLDQWVTIVLPSRVVAGT
LAESTAESYADQVRLHITPILGRHELRKLSTAHVRHWMTRKLTEPSSAGL
AQCAAWEAEAARKAAERAKEKKTRPARRATAQKKAPAPSERKPVKPLSAR
SVRYLLVILTAALNDAVREELLTRNVAALVQPPRKADAPIRALTEEEARR
VVAVALVDRMSALWLVLLALGLRKGEALALRWDALDLDAGTVAVVRKQRR
RKAGIDPETGRQRWELVEEEALKTKGSKAVLALPDMVVTMLREHRWRQDD
AKADLSAKADQAEKDLAALAAQNLPEGVSGTPADLKVIRWEDPGLVFTTA
LGRKVDPRNVNRWWDAVCERAEVRHTRVHDLRHTAATMLFRAGVDLNEIR
ALLRHTRLGTTADIYVDVLADVRRGTARSMDDILTRLRNPTTATESEADE
GAA
>Francci3_3744 UvrD/REP helicase
MRSRFYADVHIHSRYSRACSRDCDLEHLAWWAARKGIAVVGTGDFTHPAW
SQEIATKLVPAEPGLFRLRSDLEHEVLRTLPASCRTATRFMISSEISTIY
KRGDRTRKVHHLLYAPDREAAGRITAALARIGNLAADGRPILGLDSRDLL
EITLGGGAGCYLVPAHVWTPWFAVLGSKSGFDAVEDCYGDLADEVFALET
GLSADPEMFWRISGLDRYRLVSNSDAHSPPMLGREATAFTCDLDYFSIEA
ALRGGDGFAGTVEFFPEEGKYHLDGHRKCGVVLTPDQTREVGGRCPTCGG
GLTVGVLNRVEALADRRPGHRPVTAPDVTSLVPLPEVVGEILGVGPKSKA
VAGQVTSLVSRLGPELDILGDVPLTDIAGVGSPELVEAISRLRRGEVIRQ
AGFDGEYGVIRLFEPRELARDGGTLFDLGSGAAGGQDRIDAGPSLDEALA
ARARPAPDAAVVPGDAAVADSQLFTPVSGDTPSVLDGLDPEQRLAASHLS
GPLLVLAGPGTGKTRTLVHAIAHRVAEHGVPAGECLAVTFTRRAAGELAE
RLAGLLGDAAGRVLATTFHGLGLTIIREQHAKLARGPQVQVADDAVRVEL
IAAALHGEGDARTRRRVAAGVAELKRHRALGQAIRDHDLVGALARYDAAL
RDRDMVDLDDLITLPLTLLRSSPDLAEHYQRRWRHVWVDEYQDTDELQYR
LLGLLCPPTANLCVIGDPDQAIYSFRGADVRFFLRFEQDYPSARPVALTR
GYRSTRTIVRTALDVIAPTSLVPDRTLTAVRGAEGDGPVLLRRYRSEAEE
AIAVVDTIDAALGGTSFHALDSGVDGSVDAGFSFADIAVLYRTARQAEPI
MEALATRGFPFQRRSHLPLADAPAVADLLALLQDLTTTDPSGPGVPRPVS
GLLRDAAARATDLAEARRSELGAVPSDGFPGFSGGRVPTEAELRLAVELL
APAAAAAGNDLAGFLTSVTLAAEVDGLDPRADRISLLTLHASKGLEYGLV
IIVGCEDGLLPMRFGPAGEAPGGGIPGGGVNGTADGTADAGTKDAEAEER
RLFFVGVTRARHRLVLTSAASRRRAGSVVTTRPSPFLADIRPALLSSVPA
EGVPAEGGRRRSRRPAPGKQLRLL
>Francci3_2080 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLRRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_1282 DNA primase
MGGRIRDADVALVRERSPIADVVAEHVQLRPAGRNELVGLCPFHDEKSPS
FYVNAATGLFHCFGCQQGGDVYSFVREIDGLTFREAAERLARRAGVALTY
EGGGTTDRTVSSQRQRLLETHRAAAEFYAEQLESQASARAGWDFLAARGF
DLTTAARFEVGYAPTGWDTLTRHLLARRFTGEELVLGGLAKQGARGGLID
RFHHRLMWPIRDLAGEVVGFGGRRLAEDDAPNAGPKYLNTPDTPLFKKAN
LLYGADLARRDIARRYQVVVVEGYTDVMACHLAGVTTAVATCGTAFGSEH
VAVVRRLLMDSDERRGEVIFTFDGDAAGQKAALRAFEHEERFATQTFVAV
EPSGRDPCELWMSGGDGAVRDLVASRVPLVEFAIRGVLDRYDLNTTEGRL
AALDAAAPLVNRLKDAALRHRYAVNLDRWLGFLDERFVVGRVAEHRSRQG
GERAGARSAPMRRRGAGADDAAVIVEREALKLAVQYPGLAGPMFDTLGPE
MFTVPAHRAVRDLIAEAGGVCAGLARSGEVTTWVTELAAAASEEELRRFV
TALAVERVLCDHDVDDRYVDEQMSRVQELHVTRRITDLKSRVQRLNGVTD
AEALRRAFGELIALEQHKHQLRERGVGAA
>Francci3_4008 putative IS1648 transposase
MTRGDLTDGEWELIEPHLPLGASGPIPDLRSYFNAVMWRFRTGSPWRDVP
NSYGSWSTIYDRFRMWARDGVFQTLMDAMITEAAARDDVDLSLVSVDSTI
ARAHHHAAGMAVDPDLLEDIEKALTEEKGLQKPGKTTP
>Francci3_3437 transposase IS116/IS110/IS902
MQVLFPRCAGLDVHRDTVAAAVRIQTGSGKAVTEVRTFTTTGGSLGLLAD
WLTECRVTIVGMESTGVYWKPVFHLLEDRFECWLLNATHVRNVPGRKTDV
ADAAWISDLVAHGLVRASFVPPKPQRDLRDLTRARRIVVEEKTREIQRLE
KLMQDAGVKLTSVASKLLGVSGRAILEKMIEGEQSLEYLADQARGRLRSK
IPQLQEARAGTFRSGHHGFLAAQLLARIDLCDEQIDELDHRIEVMIAPFR
ETVDRIRTITGVGEVTATVLLAEVGLDMSRFPTAGHLASWAGICPGNNTS
GGKRLSGRTRHGNKWLRTALTEAAHAAARSKDTYLASHHAQVRGRRGVLK
AIGATRHDILIAYWHIIANKTVYQDLGGDWHARRRRDPERRRKNLVGELE
KLGYTVTITPAA
>Francci3_3745 superfamily I DNA and RNA helicases and helicase subunits-like
MTTQTAGPERHAALADRCHRLVRFLHEVAAARSGRIRTIEEHPLTVWLAD
IPAGVPLSEHAGAGEVLLTAPAATVLPPPPPPPAILAGQIVLPDAAGADP
DQPPVWNGAAAEQTAAEQTAAAAAYQAWLPRWQLWATRARVDAERRNLHN
TLHAVAARVAQEGDALELVLATGLLTWQPPGGPTVREHLLTTRLTCDIDP
VTSDIRVRVPAGATTRLADGPLLDGIAGFRRERTNALHERLRGRPSAPLG
PDDIRLLRDWLALALDVPTDGFEPDLAPPAPDSASRPRVTFAPAFVLRTR
DQAALLTYYERMLDVLRGDGPPPLGLVQLVETLDAADRLAWLEAEGATSG
AELGADPLLPLPASPEQARVLERLRHDNGVVVEGPPGTGKTHTIANLVCA
LLAEGQRVLVTSQKSQALRVLREKLPPDLQRLCVSFTDTEEARDAPVDQA
GNAGADLDGRPDAVFDPAPAGAEDEADELGGIGVGSPELAASVSALAAEK
ATYHPEALDRRIVRAAARRQEAIRARDVLADRARALRLAEHTVHPDSEVA
AGWGGTRASIAARVRAEADASRWLPRPVPRSAAATTTARTTATGSGSVGP
APVPPPLTDAEALELHALLRSGQVRSGQVRSGQDVGADRSAGPVVEMSRF
LPAERFADLVATARAADEAVRLARAAVRPATRGLLDTLAAVDPAVVSLDP
LQQAVASLSRALEELDGRGQRSNRRHWVPAALDDALAGRDRVLWAKVAAA
TPMIAEATRHVTELGLHAVALPTSASPDGSASAADAGSPADPGFDPAALL
ARGEALREYLDAGGELRRRFRPKVQKEADTLLTRTLVDGVAPTTPELLDL
VLVRLRAEVAVEAATRGWVLVGIAPRPTDPFEVRLSRLVEAQEAVTEIQA
IVAARNRVVAILEDIVPGRAPAMDSAERVRDLAAVMAALRPMAAATDAER
ALGELREGYAGLATAAGTPSEVAELAGAVAARNVAAYTRTRAALATTMAA
REVALRRDELLGRLRAAHPDLADLLVSTAGATRPNGAEIWPARLAAFTTA
WSWAVAASWLTATAPGAATAPGGPAGPPVDLDAELDAAEDMVAAATAELA
GARAWKHCLERMGAEQSAALRAYADAVAASGTFGGRHAERFRQAAREAME
IAQTAVPAWVMPIREVLSTIPAVRDSFDVVIVDEGSQAGLDSLFLLWLAP
RVIVVGDDRQCTPPVEVTDELDRVFDRLEALLPEVPAWLRVGFTPRSSLF
TLLRTRFGEVIRLREHFRCMPEIIEWSSAMFYRDAPLLPLRQFGADRLPP
LVARFVPHGFTTDAGLGPRNPAEADALVAQVLSCAEDPRYARLSFGVVVP
QGTAQAALIRDQLADRMSAAEHQRRRLRVGVPADFQGDERDVVFLSLVVA
PDSRITPLTRLEYQRRFNVAASRARDQVWLFHSVPVTNLDPRDLRHNYLS
YVLSTTHPAAGGRGEEPPALAEVPTNRPHPAFGSLFEQRVFREIVGRGYL
VTPQVEINGRRVDLVVSGGRARLAVECDGDVGSGPDEIAHEFARERELRR
AGWRFWRVRQSEFELDADAALASLWPRLARAGIAPVNTGARGLRDEAATG
LPRVEHSTVRASADEALHRWRPIALSDLEGLDDAPPDADPRGSARDSAPA
LRRGTGTAPGAGGGDCSPRAERVGQAGGPVTPSGSNGAGRTSGS
>Francci3_2127 putative transposase
MARPVRARRLTDEEGQRLQAIARRGKHGAIRVRRALIIMASASGTPVPAI
ARLVAADEDTVRGVIHRFNEIGLDALDPRWAGGRPRLISSDDEVFAVATA
RTRPERLGQPFTHWSLRKLAAYLADNPDRTVTIGRERLRQLLHHHRISFP
RTRTWKESTDPDKNRKLDRIEYLTGTCPNRCFAFDQFGPLLKRPGFVRAA
PTQTGTTRTRQPSAPPTLQRQGRR
>Francci3_4212 transposase, IS204/IS1001/IS1096/IS1165
MKVAFSGLSPLVVEEVVDDGELVRLRARTPDATASGPSCGAETSRVHGYH
LRTVADLPLDERRVVVVVRVRRLVCPTRGCRQTFREQLPGVLDRYQRRTS
RLARQVGVVVRELAGRAGARVLSALGVVVSRHTALRALLRLPLPARRVPR
VLGVDFALRRRRYATVLIDAETRQRVDILPGRLAGSFEAWLRAHPGVEIV
CRDGSGAYAEAVRRVLPDAVQVGDRWHVWRVRREALVRREALRIEGGARP
PRWAVAAVRWELSAVRPGRRRGGREAALRACPGS
>Francci3_0683 serine/threonine protein kinase
MAGESPDNPGSARAVTLVGGRYRLDGVIGRGGFGVVHRATDELLQRVVAV
KEVRLPLGENADERGLTRERVLREARAAGRLHHPGAVAVLDVIDDGELPW
IIMEYVDGRSLATIINDRGPLPVEETCRIGISLAYALEAAHRLGVVHRDV
KPSNVLVTADGRARLTDFGIAVSQGDPRLTSTGMVMGSPAYLPPERARGD
AGSAAGDRWGLGATLFTTVEGYPPFTGGDPISVLAALVQGRRQPFRLAGP
LIPVIDDLMAPHEASRPPLTVVRRRLREIVERDAVRRPSRPSRPTRPRSS
TRTTSPAARAIPTTPAESATEAVPSGDVTGFPETEVRSETSAVSGISAVP
APVAALTTPDTETRQDTETRQEATPSRTPRSRPTGPLERMHSIDRMRALG
GIPARARLIGRGHGNGLTGGDVELANLFATAAPASPPSTARLRPAPDGPA
GIEPAGIGPAEDPPRDPGAGDRRRVTIAVATLVALVAAAIIGIVIVVTGG
PDDRSRDTSANQTGGISSPTGAGGSALADPAAARDPVARETTPVTGPPGW
VSYVDPTAGWSVAYPSDWQRREGPGGPGNVDFVDPATRSFLRVGSVRQAN
TSAIGDWQRNEAGFRQSVRDYRQIRLAPSDGGDGTNQADWEFGYRGSDGV
MVHVLNRGAIRNGHGYALYWHTREDLWLQDQPLMRQLFATFRPGP
>Francci3_4168 DNA polymerase III, alpha subunit
MPTDSFVHLHVHTEYSMLDGAAKTGLLFKEAAKLGMPAVGMTDHGNMFGA
YEFYQGAKSAGVKPIIGIEAYLAPESRHHKRPVLWGERSQRDVDPAGEGG
DVSGGGAYTHMTMLAANAAGLRNLFRLSSIASIEGYYRKPRMDHELVSQY
SEGIIATTGCPSGEVQTRLRLGQFDKALAAAATYQEVFGADNFFLELMDH
GLPIERSVRQGLLDIGDKLGLRPLATNDSHYVTQDQAGSHEVLLCVGTGK
KLDDPTRFKFDGSGYYLKSSEEMRNLWDSEVPGACDSSLLIAERVESYDD
VFKFVDRMPRYPVPAGETQLSWLRKEIDRGLTWRFPAGVPADVVERVDYE
VGVIDKMGFPAYFLVVADICKFARDRGIGLGPGRGSATGSMIAYILGITE
LNPIEHALIFERFLNPERISPPDIDLDFDERRRGEVIRYITEQYGEDRVA
QINTFGTIKAKAAIKDSCRVLGYDYALGDKISKAMPPDVMGQGIPLAGIF
DPNHERYGEAAEVRAQYETDTKVRKVIDTARGLEGLTRGTGVHAAGVILC
SEPLLDVLPIHRRDNDGAIITGFPFPQCEEMGLLKMDCLGLRNLTVIGDA
IEAVKRNRNVDIDLSTLPLEDAKAFELLARGDTLGVFQLDGGPMRNLLRL
MAPTKFGDIAAVLALYRPGPMAANSHIEYADRKNGRKEILPIHPELAEAL
EPILGETYHLVVYQEQVMAIARELAGYSLGGADLLRRAMGKKKKEILDKE
FARFSAGMKERGYTDAAVQALWDVLVPFSGYGFNKSHTAGYGVVSFWTAY
LKANYPAEFMAALLTSVGDDKDKMAVYLAETRRMGIQVLPPDVNESDLRF
GAVGDSIRFGLGAVRNVGENVVASIAAARRRKGAYESFADFLQKVDIGVC
NKRTIDSLIKAGAFDSLGHHRRVLVNVHENAIDAVIITKRAEAIGQFDLF
GDGGAGEEEESPGLGLDLDLSGPEWPKKELLAQERDMLGLYVSSHPLEGA
ERALDRHRDTRIVDLAEANDGTTVQIAGIISKIDRRINKNTAKAWAIVTV
EDLDASVEVLFFPQSYEVHSYALATDAVISVRGRINEREGSVSLFAQDLT
VVDVATHVNGPPVVITLPSHKITPPLVDDLKLVLTTHPGTTPVHLRLEGP
QNTHLLLLELQVQASSSLLGDLKALLGALLHWAEAGRGGAVSGGRSDGQG
EFGEGSRQPVPEVGIDAQFVVAAADVLHERVPGADDLCGAEAFQAAHRP
>Francci3_3532 phage integrase
MFEDSDHRVIEAIRLPLWGSVVASDGVVPWRLVDGLGEPVEPVEVFLRDF
VAQGRSANSVRSYALALLRWWRFLVAVGVAWDRVSPAEVRDFVLWLGQAT
KPVAAARVGSRATVGQVNPVTRKRHLGDGYGPATVRHSNAVLRTFYEFWL
ERGEGPLVNPVVLARAGRGRAHAHHNPLEAFRPEGRLRYNPRLPKRKPRA
MPDERWDALFAAMGRNRDRAILALGVSTGARAAELLGMRGADLDWGGQLV
RVVRKGTRAEQWLPASPEAFVWLRLYLADVGGGVLGAGDPVWRTVYRRGG
VHEPLNYEALRAVFRRANRRLGANWTMHDLRHTCAIRMVRDGRLSLRDAQ
TILGHAHLSTTQLYLEQDDEEVFARVREHLAARERPRPAPPAPAALGYDP
ADLAVLFGNPR
>Francci3_3052 phage integrase
MIVETGERGALGRHLAANGGPADPAELERSHVEAFLADYAATHAPATVSL
VFRALQQFGQWLAEEEDLDRSPLDRMRPPVVPEQPVPVLTDDQIRTLLAG
CACRDLVSRRDEAIIRLFMDTGARRAEIANLTVEDVDFTQDVIHIVAKGR
RARAVPFGTGQRPREAGPAHHARRMGSMKIRLEGSDRARRGHGCDQGVVP
GGGPSVQRRPDAPRSAASGWRPRRGGGPGRVKPVRHAPPGHGGGPVCVRR
PWMLTFPAPRRSTSATRGFFRSSFALLSQRGKAGRPRVF
>Francci3_4189 putative transposase
MIPEKADQKANRKKRGSLGGRPVSHDATLYKDRNTVERSINKIKEWRGLA
TRYDKTPESYAAGLHLRGSILWLRSLPTP
>Francci3_0268 recombination protein RecR
MYEGIVQDLIDELGRLPGIGPKSAQRIAFHLLAADPVDVRRLATALTEVK
EKVQFCRSCFNVAQSELCRICSDPRRDPSSICVVEEPKDVVAIERTREFR
GRYHVLGGAINPIGGVGPDDLHIRELVARLADGTVTELILATDPNTEGEV
TASYLARQIAPMGLKVTRLASGLPMGGDLEWADEVTLGRAFEGRRVVSA
>Francci3_0877 DNA integration/recombination/invertion protein
MSAGRQGSVRKDASGRWFFVVDITAAGGPRRQARRRGFATKKAAQAALTG
FLGKLAAGTYVEPSRLTVREFIETRWLPAVEGELRPSTLASCRRNLRLHV
LSRLGGVRLQLLDTATLQAL
>Francci3_1870 phage integrase-like SAM-like
MVVVRRRVVVLPSTPMLLIFFSSRGWESWDVEAAPLIPERMPVLVDDDLR
FEDGPGCVRPAAVVNRWLRELPACGVPAPRSWAAYARAVKDWVEFLAGHG
IDVLGPREQLKLALGKYAEDRSAGPVERRLAASTWSQHISIVSMFYRWAM
AEGYASAEPFTYRTAKAMVGGMVRDVRVNLAMRRTPKPHVTIKYLEADFA
TLLLNALAGLDRDGQADAGYAGRELARNRAVVGLALATGLRLQEFSSLLV
YEIPTPPSRPGALPVPSGVAKGRKFRTSWISVEALRVVHDYVTLDRAAAV
EGASWCPPARWGPALRVSEPDMRGGRINGVRRDWDALTPGERRRLVAPDG
GSCLLAVRSDGGPFTAWATVLERASDRIRARVELRFPHVHPHRLRHSFAM
ATLERLVDGHYRRAAELVAAGGDGGPDAALALYLSKADPLLVLRDLMGHS
SVTTTEAYLRRLDTTRIFGEAYARAGAAAGLADDPAADTEVAAEFTDEPG
EDD
>Francci3_1864 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAADGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGRRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_1380 type III restriction enzyme, res subunit
MRAGPLDRSPRPRRPVPPARPLRAWQRAALEIYRSRSSSGARDFMAVATP
GAGKTTFALQVAADLLTAGEISRITVVAPTEHLKRQWAVAAAEVGVDLDP
DFRNSAGATSSDYTGVAVTYAQVAAHPALHRARTAARRTLIVLDEIHHAG
DALSWGEAIREAFTPAARRLALTGTPFRSDVNPIPFVTYLPGPDGVMRSI
ADSSYGYAEALRDGVVRPVLFLAYSGEMTWRTSAGAELSARLGEPLTNEQ
TAAAWRTALDPGGDWMPAVLASADTRLSQVRAGGMPDAGGLVIATDHAAA
RAYAALLTRITGTSPVLILSDDPTASTKIDDFRRSADRWMVAVRMVSEGV
DVPRLAVGVYATSASTPLFFAQAVGRFVRGRGRGETASVFLPSVPSLLAL
AGEMEVQRDHALEKSPRDPDAFDDDALRDANRRKDIPDKPDTLFTALGSS
AHLDRVIFDGGEFGTPAAPGSIEEEDFLGLPGLLEPDQVAVLLRQREAAQ
QAAQQAATRRAAARAADSPVVPPAREGVQPAVEAATRPVHEVIGELRKEL
NRLVGAHFHRTGKPHGMIHAELRRTCGGPPSGQATAAQLQARIDTIRRWA
G
>Francci3_2681 transposase, IS605 OrfB
MSRTAGRWFVSFTVEVDRDVPERPSRRQRAGGPVGVDLGVKHLAVLSTGQ
TVPNPKHYQRAERRLRRASRAHARSKPGSAGRRQRAAQLATIHVRVANQR
HNGLHKLTTRLARSHDTVVVEDLHVAGMVRNRRLARAVADAGMAEVRRQL
AYKTRWYGSTLVVADRWYPSSKTCSGCGWRNPSLTLSERTFTCQSCGLVL
DRDHNAAINLHHLVAASTPETENARGADQKTRASGQVAGKREPGTATAGQ
TGGASPRGEAA
>Francci3_3814 hypothetical protein
MSDGATGAHRPAPPAFFLHAVAAGGLRGLPPPVPDDVSGPAAAAQGAALG
AHLLRRQRAQRAATLVGGGPIEQRVPQRAAALLRLLWVDEGFRDGEQSYR
WRERGARMAHDYVAHLDPAAEPRGVERSVAARTEALAFSGRVDRLDDRDG
ELVVVDYKTGRRPVTDDDARSSPALALYALAAGRMLRQPCRRVELHHVPS
GRVAAAEHTEASIARHVRRAESVAADAVRATEALRAGEDPQRAFPPSPGP
LCSWCDFRSHCPEGQAAGPERQPWDALGDEGDED
>Francci3_2891 hypothetical protein
MNGRPPSSAECQSARTVYGYISSEHADEAEIDRLHDQLTAHAQAEGLALA
EIFVDRSIPPGRIVRPGLTVLLEAVMRSEAAGVLVASLDHLSPLPAVRQA
IEVEIEVLGARVLTVTPAVPSPRST
>Francci3_3388 phage integrase
MFKRCGCRDPQTGKDYGARCPKLPRSDHGSWLYIHDVPPGPNGRRRRTTG
GPFETETEAEVALTGSLAELHGGGRPDDTGLTVARYLDDWLDGKASLAAS
TRRSYEEHVRLYLKPGLGHLRLADLRDHHIEQLYAAMRLIGRDLHGRKRS
PLLVRLLEVRKDDPNHRRPLSASRLRRVHATTMSALNSAVKRKKLGVNPA
EHVELPKARKPRPLVWTAPRIAAWERTGKRPSPVMVWTPQQAGAFLDFTV
ADRLYPLWHLIAHRALRRGEAVALGWTEIDIDDDSIYILDNLPGSVSGAD
DNLDLDGDEYTEPKSDAGYRTVSLDPATKNVLTTWQARQDEERAAYGKAW
VDSGRMFTHPDGSRLTPNGVSQRFERLITRFATIRHEHAEHSWTVDQLAA
RHFMPEDAIQTALAFGPLPPIRLHDLRHTAASLTYRATKDLKVVSELLGH
SSVHFTGDVYTSVFADADRAAAKAAADIVPRRHPPGQVEPDSLDDPTPPP
ANGPGLDL
>Francci3_3256 serine/threonine protein kinase with WD40 repeats
MSAREGQVPAGGGGRGFADRAATVAVTTPDGSVPWDVANSRATPLLDSDP
VTVGTYQLVARLGQGGMGIVYLGRSRDGRPVAVKVVRTDLARQPEFLARF
RREAEVAQRVARFCTAEVLEVDVEADRPYLVTEFIDGPTLADAIAANGPM
AEADLERLAISVAAALTAIHAAGMIHRDLKPSNVLLSRLGPRVIDFGIAR
AMDSTTSLTTSTGLIGTPAFMAPEQARGALVTAAADIFAWGGVVTFAGTG
IGPFGKATTPVLLYRAVHEAPKIDGLPDGLRSIVAKTMSKEPADRPTASD
LYQSLLGLSNVDEAVRGEAGPTAVTVVTDPPVKPPARMDAGASRTPSDSV
GSVGSANPAMPPPSDDILSPLDLSPLDRPAPGRGYPEGGEPKQQQRQRRG
RWQRRTVVSLVASLCAVAVAVTATLMFVTHKDPASVPESIEVSRRLASEA
AAASTTQPQLARQLALAAYRVEPTEEAQRSLIENFGGVEINTLVGHTERV
IGVRFSPNGRMLVTGSDDSTVRLWDISNPMATRPLAVIPGDSGTFLQGGF
SADGRLLSTSTDDHIVRLWDISVPERPRSLAVLRDVNNSVAFAVGGKTMA
TAGTDNTAKLWDITDPHQPRLAATLVGHTDWVNTVAFSPDGRMLATGSHD
RTIRLWDVSTPTRPTPLAILNGHENAVLGLAFRPDGRILGSTSADRSARL
WDIATPTEPKMLYTFTDRSDWVSSVAFRVDLHLMGTTARKAARLWDISDP
RNPRPVGGLLGHTATVRRVQFSPDGTLAATSGDDNTARLWDIDPAHIAKR
ACTDPTAVMSEAEWERHAPQIPYPSRCP
>Francci3_2052 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_3431 Resolvase-like
MASYDVRVNLKEWAAANGVGYTTARRWYRDGLLPVPARKVGGLVLVDEST
VPAGRPVAVVYARVSSADQKPDLDRQVARIVTWAASQNLAVGRVVTEVGS
ALNGHRRKFLGLLRDPAVATIVVEHRDRFARFGAEYVEAALSAQGRRLLV
VDPGEVDDDLVGDVTEILTSLCARLYGRRAAANRAARAVAAATEAGE
>Francci3_3419 serine/threonine protein kinase with WD40 repeats
MSHPPGRSGPADGPADRAASEPTEVGPYRLVRRLGAGGMGTVYLGENAAG
GLVAVKLIRADLARLAEFRSRLKQEADNARRVARFCTAAVLDVDITADPP
YLVTEYVDGPTLSEAVGTRGPLTPAELHQLAVSMTTALMAIHRAGLVHRD
LKPSNILLSRLGPKVIDFGIARALDSATVLSGDRQLGTPAFMAPEQALGE
QVTSAADVFAWGGVLIFAGTGRYPFGNGPAPSVLYRTVNDPPTLDGFEDS
LRPLVSDAMRKAAAERPTAEKLYARLLDLRVEAPMAVKGLSLSEVTALIR
PLNTSPGGASGTSASDADLAGPITPVTGHQPIGPPPPANLISFPGPGIPG
APARSGPSSVPGPSSVPGPSSVPGPTSSPGLGSSSSGSEASPSGPDARWL
PGAGSSSRSRDFADALVTVWTEDEHSQDPGRGASPSSSSSSSQRSSSSQR
SSSSSRRSSSAAPPGDRARNRRPLIIVALVAAVTVLVTTVAIVSTRGGGS
RTEMSVPEAVAERALLLQDGDTGLARRLALAAYRAEPHSARTRSAMIALF
GAGITPTTIPVGTGALLALAVSPDGHWIAAGSNNGTVTLWEVVGRTELVR
RTSVSVPSRSWIESLAFNRDGGLLAAGHSDGTIRLWNLHDPDQMVRWSTI
QAHTDAVQSVAFSPDSNTLGSASADGIVALWDVTDPARPKQRVRADGQTG
GVRSMAFAPNGTLLAFAGEDGTVHLWNIRDAARPTAGGILRGHSRGVRSV
VFTGDGGVLVSGGVDATVRLWEVRYPDNPARGVATGSLGGIQSVAFEPGA
DVVASAGDDETVRLTDISRLDTPILLTQWHGHTQPISAIAFVSGTGVVVS
AGHDGTLRLWDAEPGRLADTACADPANRITAGEWSTAFRDMGYRAPCG
>Francci3_3156 NUDIX hydrolase
MTATPHRYEVTESTLAYQGRVISVRRDQVRMPDGDISQRDVVVHPGAVGV
VALDEADRVVMVHQYRHPVGGPLWELPAGILDVPGEPASSAAARELAEEA
GLRADRYDLLVDVWASPGMTDEAYRLFLARGLHEIPAAERYVPVHEEAEM
GLARIDFTDAVERVLRGEITNAMAVIGLLATARARAENYAGLRPPDAPWP
ARPAHRG
>Francci3_1875 Transposase and inactivated derivatives-like
MSLQPKGLPEIPAQTVAVARAAFPRGTLAMRLRDRLAEVLVDEPFTGAFG
RRGAPGLPPAVLSLVTVLQFTENLTDRQAAAMAVRAIDWKYALGAELTDT
GFDPTVLSRFRARLADHGLERVVFDRLLQACVDQGLIGGGRRARTDSTHV
ISAVRDLNRTELAGESVRAVLEALAVAAPGWLADTVGIDELAHRYGERVN
GWTMPSSKTARDRLAVVFGQDALALCRAVAASTAPGWLGEIPAVAFLREM
LVQTYYLSTDPVHRSPGTGGDRKAGRRQTGCPAWTSSLGLPLRPRCAVGG
QGRRPVLVRLQGPPHRNLRPSPRSRSRSRRVAVTADHERSHHPGERPGRE
GHRDRPGQPR
>Francci3_2621 ABC transporter related
MSTAMSTAMSTATRTDPRSPDPRSPDPRSPDPRSPAPHVADRSDVIRVHG
ARENNLRDVSIEIPKRRLTVFTGVSGSGKSSLVFGTIAAESQRLINETYS
AFVQGFMPTLARPEVDVLEGLTTAIIVDQQRMGADARSTVGTATDANAML
RILFSRLGEPHIGSPNAFSFNVPSVRAAGAVTVERGAGKTKTVKQTYTRA
GGMCPRCEGRGTVSEIDLTQLFDDSKSLAEGAITIPGYKVDGWWTVGIFI
ESGFLDPNKPIRQYTKKELRDFLYKEPTKVKVNGVNLTYEGLVPKVQKSF
LAKDPDALQPHIRAFVDRAVTFTACPDCGGTRLSEAARSSRIKGINIADA
CAMQISDLAEWVRGLDEPSVAPLLATLRRTLDSFVEIGLGYLSLDRPSGT
LSGGEAQRVKMIRHLGSSLTDVTYVFDEPTTGLHPHDIRRMNDLLLRLRD
KGNTVLVVEHEPETIAIADHVVDLGPGAGTAGGTVCFEGTVEGLRVSGTR
TGRHLDDRAALKEAVRTPTGRLAIRGATAHNLRGVDVDIPLGVLVVVTGV
AGSGKSSLVHGSIPHLSGPAGAGVVSIDQGAIRGSRRSNPATYTGLLDPI
RKAFAKAGGVKPALFSANSEGACPTCNGVGVIYTDLAMMAGVATTCEECE
GKRFEASVLEHHLGDRDISEVLAMSVTEAEEFFGAGEARTPAAHAILNRL
ADVGLGYLSLGQPLTTLSGGERQRLKLATHLAEKGGVYVLDEPTTGLHLA
DVEQLLGLLDRLVDAGKSVIVIEHHQAVMAHADWIIDLGPGAGHDGGRIV
FEGTPADLVAARSTLTGEHLAAYVGT
>Francci3_4112 putative IS630 family transposase
MAGRDLGEGTRLAATTGAWIVFEDEAGQSLRPPKARTWAPRGHTPTVRVS
GKGSGRVSMAALVCYRPGQRPRLFYRVLTHHGRKGERRSFSEDDYATLLV
AAHHQPRAPIILCWDNLNTHRSAAMRRFLTRHAHWLTVIPLPAYAPDLNP
VEGVWAHVKRDLGNHVRVTVDQLTATIKTLLKRVQYRPDLIAGFLGQTEL
IIDPEPP
>Francci3_3732 transposase, IS204/IS1001/IS1096/IS1165
MSDSPTWPAGRACSARSEGRTADDVAYWLASQTPSWRDRITHVAIDMCTV
FVAAVRRYLPGATLVVDHFHVVKLANDAVTEARRRVTTQLRGRRGRDTDP
EWKIRNLLTRNRENLTDRQLTKLWNTLIDLGEPGQTILTAWIAKEELRAL
LALARTHPARTVIAYRLTRFYTWCADAAVPELERLATTISTWWTCIEAFP
HSGITNAASEGHNRVVKLDARNAFGYRNPENQRLRTRCATTRRTRRCLNP
A
>Francci3_4193 transposase
MTRWILTHPEHLNEDETLASKTILTRCPDLQKTADRVTAFAQMLTGRHGD
RLNGWVDADDLSELHRFTRGLLRDHDAVLNGLTLPHSSGQVEGAVNRIKM
IKRQMYGRASFDLLRKRVLLAT
>Francci3_3802 methylated-DNA-(protein)-cysteineS-methyltransfe rase
MSPRPRARRDAPPASVSAGLGTRISLGGAPGPHALEVLDAVARIPPGRVM
TYGDVAEYVGAGSGRTVGAVLSRFGDEVPWHRVIRATGEPNPAAPVEALR
RLVADRTPLRPGGDQVDLAAARWDGSPA
>Francci3_3338 hypothetical protein
MIATVKVLTLRARDGETVARAARAVVAYVEGGQPGAVAPLRRYYGEGLVP
GWARGSAAYLVGLDAGRPVAGEALERLLRGEHAVTGRPLLTALGSAGRAS
PPVEGQRSAGPGGGLLTLAQAARRAGVSAAYLRALAVRTAAMATAERSAS
RGGDADAGGSAQGADRAVHERTGAAGMRRGPSAGSVPGDEGRAVDNGVGE
GRGPWLAAVREAGTGRWLVSATEVDRFCAARVPPAVVLGYDVTCSAPKSV
SLLWAFGDEEIRRDVAAAMDAGVEAVLGYLERHATVGTVAGRNRPGVGVA
AVSYPHEVSRSDEAHLHVHSIVVNATAVPDLDEQGRPVADEQGRGRVDWR
ALDGEVFLSHVKTAGYVGAAALRHELSRRRGLAWGPVRNGVAELAAFPAQ
LLAAFSTRHGEVQAEYAQLVADGLTPGGVTEAAAQRGSRAAKKVLADAQV
RRIQHERLTAAGWTPQRVRALAAPASRNRAPVDGEDLAGLCDLLTGPAGL
TEHDSTFDRRAVVRRVAAWAADRLPADEVDRLTDQVLADRRIVLLGHSAA
RARQQPEPVYTTQELLEVEDTLLALCRQGRVEAGAQPRILVDPATLEAHL
AAAQQRPSSDGPGGVGGEDGGGQGNGGQGSGGPSGPALSAEQITLVRRLL
TSGDLVRPVVGPAGSGKTEAMRLLTRIVHAGGGQVFAAAHGGRQTEELTG
RIGVAGRVVSGWLTLLDHTEDPGRVWPAGSVVIVDEATQVSTRDAARLAR
YASRTGTVLILLGDPAQLGAVGAGGWYAHLVASTPDVPALGSLHRQTGAA
LAPVRAALGALRAEGGASARKALELLAADGRIRLFDSREALLAQVVNDWY
TERTAPHPRGATDPDSATDPGSVDGAGSTEGDGWTAAGPGRRRSGGTVRP
RPAAALHMMAERTRDVEILARAARDRLAADGTLTGPVLTVAGRDFQAGDE
VITLTQTGHTLIPAGKPASAYIRTGTLGRVTAVHLDPDHPDRQALTVRFP
RKGSVRVPWEYLTHRFTDGRTGGLGYAYAITAAKAEGSSLPTARAVAPDD
TSRAGLYVMLSRARTDLAAYVIRRADLEADLDEEDWLPVLRDPTGPLERF
ADHLAQSRTERLASEYDPLAHAAHRLRRTHTLAGLARLPAPPPSGGAHRA
AGAPPAPPAPPAVLRRAELAAEAALRTAAVANPPADLVARIGPRPAAAGA
DRALWDRAVGALAVHHARYRPAVPPHDPGPPLSSGEPAGTLRARWMLHHD
QATRLARTWADVLPRRARARFHSRAEQIPRARAIAGLHALLDNGHQPADL
LVALTREDQSSVRTGAAVLDHRVTDLCQQQGLHPTDYLLPPPRPARDEWN
ELVGLLDTCEIHHLARHPTAQLAAERRHLRDAQGATVPRARPHGEGSRAS
TVEARTGRQDRLRLIEEALDRQIAHAVFRAGIDPADYLTGLLGARSSAGL
DATGWDSRVEAVEGFRHRDLGLPYGTPATTDGETDPLRRAVGDRPTDPAL
AEGYRGIRALIREHTPTLDL
>Francci3_4000 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_1776 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_0191 putative IS630 family transposase
MAVHKRLGRPVVLVWDNLNRHLCAEMAAFIAANTLWLTVVALPSYAPDLN
PVEGLWSVLKGGQIANRAFDDVDLTLKPS
>Francci3_1962 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_3588 Ribonuclease H
MRHYGVALRRDAGLFGYERALNRHGLGPVAGVDEAGRGACAGPLVIAAVI
LAPEARQRLARLADSKLLTEQIRESVFEDVMAAAAAWSTVVISAAEIDRV
GLHVANITGMRRAVARLSARPGYVLTDGFAVAGFGVESLAVVKGDRVVAC
VAAASVVAKVTRDRIMRALHTRYAEYDFAQHKGYVTAAHAAALARCGPCD
EHRMSYVNVAAHAATTREARSLRLEDRVLVTSRHGVTETV
>Francci3_2098 hypothetical protein
MAGTVRALSFTRHTGTGDSAEQRDVFTSACAARGWSAGGAVREFGSGLRA
WWAALRLVRAGRYDVVVVDSIDRLAETESGQLAALEMLRCAGVRLLVARD
GTDTADPVGAELVASLLTV
>Francci3_4218 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_4290 NUDIX hydrolase
MTELPANADGAFDTDGTTVPPPPAWLRALAEKAGQALVTSRTHPGHAGAD
ARPAAVLILFGDDGPEGPDILLLERAAELRSHASQPAFPGGATDATDESR
VHTALREAEEEVGLDPAGVEVLAVASPLYLHASRYLVTPVIGWWHTPCAV
VPVDPAETSSVARVPLAELADPANRVMLRHRTGLAGPAFRVRGMLVWGFT
AGILDVLLRLGEWERPWNDTELIDYPVARATTSGTPVVPAQTTSPPAPSH
PPAPSHPPAPSHPPAPSHPPLQSGPLDKSPPLDKSSPPAAPASPCDERHS
>Francci3_3344 CRISPR-associated protein TM1801
MAFNLDPEKKHDMVLLFDVTDGNPNGDPDNGNRPRTDDETGHGLVTDVAI
KRKVRDTIGLAAEAEGLDLTRYQIFVEAGHALNTRLEESYLVKGLELGKK
IDDAKAAKAREWLANRYVDIRLFGAVLSTGKTQSLGQIRGPIQVGMARSL
DPVLPVDHAITRVTQTTQADIDKGERTEMGGKWTVPYGLYRAEIHYSAPR
GRQTGVSAADLDLFLCTLVNMFDHDRSATRGEMATRGLYVFSHHNAFGVA
PAHTLSARITARKISAGEPRSFGDYKIDVDDADLPDDVALTRVLG
>Francci3_4191 transposase, IS204/IS1001/IS1096/IS1165
MQEESGLPPPTDDVLTRGTHRVKRLRRSDLGVVLPHLAGLVVDRVDRSGD
RLRLRARVRGAAAACPGCGVLSERVHGRYRRRLVDAAIAGAHVEIDLLIR
RFRCLAVSCPRVTFAEQVVGLTHPHARFTPLADRLIEAIGLALAGRAGAR
LAGRMGLPAGRNTLLRRVRALPDPQVGAVTVLGVDDFALRRGRVYGTVLV
DLDSRWPVDLLPDREAATFAGWLGIPVRG
>Francci3_0496 Integrase
MDTARGVLAEFSHASTPTQDLVLAAIEARLEQEHGPGVVRLPGRTRARAL
LRELSRGTSAFGGAKGRREIAGRPVAPYGKLRAHRPGEYLLVDTTRLDVF
AMERVTLRWVQAELTVAMDRYDRCITGLRLTPVSTKAVDAAAVLFESIRP
LPEPAAGWVDVRPPYHGVPGRVVVDVERLVDAGGVPLLPSVAAETLVVDH
GRIYLSEHLLSVCQRLGISVQPARVAQATDKAAVERFFRTLREQLLVALP
GYKGPDVHHRGADVEEQAFYFLDELEELIRQWVADCYHRQPHGGLVVPEV
PGLAVSPLEMFAHGVARAGHLQVPARADLVFDFLAVEWRTIQHYGVEIGG
LRYDGPALSPYRNRTSPHTGVHAGKWPIRVDADDVSRVYFQDPADQRWHV
LRWEHADALGGPFSADALAYARQLATATDRFPDTRRALARLLERWDAGLA
GNRAERRMAVRLSERRLRLVGDTAVPDEPAPAVASPDQDRSAEETAGDDD
RDDELGAPFPGEDDFYADAMEIV
>Francci3_3701 NUDIX hydrolase
MHSVSVAGVTLNEKGLILCIRRRDIGAWQIPGGVLERGETLHTGLRREVE
EETGAVVEPVRLTGVYLNMPLGVVAMVFLCHHPTGVIASDTAEATEVSWL
SIDEVRTRFVPAFAIRVADAVAGRLEPFIRTHDGVSVLPSDAAE
>Francci3_2921 transposase, IS605 OrfB
MSYDGHVNLKEWAESQGVAYVTARRWYAAGKLPVPARRVGGLILVGEPDQ
PTGDGLTAAHARSKPGSAGRRQRAAQLAAIHVRVANQRHNGLHKLTTRLA
RSHDTVVVEDLHAAGMVRNRRLARAVADAGMAEVRRQLAYKTRWYGSTLV
VADRWYPSSKTCSGCGWRNPSLTLSERTFTCQSCGLVLDRDHNAAINLHH
LVAASTPETVNARGADRKTRASGRVAGKREPGTATAGQTGGASPRGEAA
>Francci3_1203 phage integrase
MAARRRFGSIRRRESGRYQVRYPGPDGQQRTAPETFARKSEAERYLTLIE
GQILRGEWIDPERGKVTLTDYAARWIVERPNLRPRTIGLYSGLLTRHITP
YIGGIPIGKLTTPIIREWRTKLLESGVSVGTTAKAYRLLRAVLMTAVRED
ELIRTNPCRIPGADQENAPERPVLTVSQVFALAEKLGGRYRALVLVTTFA
SLRWGEVAALQRRDLDTDDGIVHIRQSLVEIGGQGVVLGPPKSRAGVRTV
SLPAVILPWLRIHLAEYVADDPAAFVFTGPKGGFLRRGNFRKLVGWSDAV
AAIGMPNLHFHDLRHTGNTLASRTGASLRDLMTRMGHDSPRAALIYQHAS
TEADAAIADALSAVLAAQQGSVPPPAPRPDDTPEDGAAGALVPA
>Francci3_3321 hypothetical protein
MDVMGSSLAPRLVPDDLWKLVEPLLPRFETRPQGGGTAPVEDRAVFTAVV
YMLTASPREATTCAEHNSQAA
>Francci3_1552 Helicase RecD/TraA
MIGREPFRGAVLDAVLERITYANEETGYTVARVDTGRGGDLVTVVGALLG
AQPGESLRMRGRWGSHPQYGRQFQVEDYTTVLPATVQGVQRYLGSGLIKG
IGPRLAERIVEHFGVAALDVIEKEPERLIEVPKLGPKRTRAIAEAWEEQK
AIKEVMVFLQGVGVSTSLAVRIYKQYGDASIGVVRNEPYRLATDVWGIGF
RTADTIAKAVGIPHDSPQRIKAGLLFTLSEATDGGGHCFLPEPRLISDAV
QILQVDTGDVIECLGELVAEDGVVREEVPSEDPATPTAAIYLVPFHRAEI
SVAGQLRRLLNTGADRLAAFASVDWDRALGWLRERTGVELAPAQRDAVQL
ALTSRVAVLTGGPGCGKSFTVRSIVTLALARQAKVILAAPTGRAAKRLTE
LTGHEASTVHRLLELRPGGDAAFDRDRPLDADLIVVDEASMVDLLLANKL
AKAVPPGAHLLLVGDVDQLPSVGAGQVLRDLLAEGTPIPHVRLTQVFRAA
TESGVVTNAHRINRGDYPLVRGLPDFFLFAAEEAEEAAKITAEVVARRIP
AKFGLDPRRDVQVLTPMHRGPAGAGALNTLLQEALTPSRPNVAERRFGSR
TFRVGDKVTQVRNNYDKGASGVFNGTLGVVTALDVENQTLTIRTDEDEDI
DYDFGELDELTHAYAVTIHRSQGSEYPAVVIPLTTSAWMMLQRNLLYTAV
TRAKKLVVLVGSRRAIGQAVRAAGHGQRHTALDHRLRSG
>Francci3_3989 Recombinase
MPRSTARRTTAKRTKQAVAAQPEIVRVGIYLRRSTDDENQPYTIEAQEER
LRSYVDSQPNWIVALRFADDASGATTERKDLQRALAAARNGLIDVLLVYR
VDRLSRSLRDTVDLLEELEQAGVVFRSATEPFDTATPIGRMLLQILAMFA
QFERDMIIDRVIAGMERKAAKGLWKGGRRPFGYQVDKIAKKLIIDVAEAT
IVRLIFDLYVRDRLGTRAIASVLNNRGLRTTVGGPWSGHKILRMLDNRAY
LGELTFREITVEGTHEPIIDEETFDAAQKILTERSEETSRRASNPSDYYL
TGRMRCPQCGTALIGTRATGRNHTYRYYTCHTRNRYNRHECDAPRLDADA
VDYAVLTALAGFYRDHQQLIADAVLKAQRSHRDARSEHTAELSTIETELT
LTDQAIDRYLGAFERGTLDDETLATRLEALRTKQKQLRQRQAELTEEIDH
EPVMPARSSLRAVTRHIETIIETGDDLGRKALIEALVAEVKITGPDRLTP
IFKVLGPDAPRDVTNVDTGDISQPKLTQPATSPAPHRGAAAVLPATTPPK
GAVRAMPTLVEVRGLEPLASSVRGRRSTRLSYTPWKHLKLTGPRSPCDDH
PVRMDRHLRPAVS
>Francci3_3240 transposase IS116/IS110/IS902
MDLLEAAGPGAERRETRMLFVGDDWAQDHHDVEVQDETGRRLAKGRLPEG
VAGIARLHALIGRHLAEDAGPEQVVVGIETDRGPWVRALVAAGYQVIAVN
PLQAARYRERYSTSGAKSDAGDAHSLADMVRTDRHQLRPVAGDSDTAEAV
KIVARAHQNLIWDRTRQTQRLRSALLEFFPAALAAFDDLDTPDALELLAK
APSPAEAARLTVAQISAALRHARRRKIPERAAAIRAALRAEQLPVTPAAT
TAYAAVVRAQAGLLAALNGEIARLEEQVADHFDQHPDAKILLSQPGLGPV
LAARVLAEFGDDPTRYADAKARKNYAGTSPITRASGKKKTVLARYARNNR
LADALHQQALSALSASPGARSYYDAIRARGTSHHAALRQLGNRLVGILHG
CLKTHTPYSEATAWTQKATLDVAA
>Francci3_4093 NUDIX hydrolase
MPVTNDDIAKTIRAHLDAHPEDAESLAPLLEAARGSDAPLASRTTTPGHV
TCGVVAVTSDRQVLQIRHRSLNRWLLPGGHIEPDDASLLDAALRELAEET
GIPRAWANPALERPVDVDAHVIPPNPAKREPEHIHYDLRFLLAIDPPAGN
ANVALQLEEVADYRWAPLTELPGRLSDRTRANLLARP
>Francci3_2088 Recombinase
MNGLSKITASHRSRVAVVYLRQSTLVQVRDNTASTVRQYGLVDTAVELGW
DRENVRVIDADLGVSGTFGADREGFRDLVAQVCLGEIGAIFGLEVSRLAR
SSADFARLLELARLTDALLVDADGVYDLADINDRLLLGLKGSMSECELHL
LTGRLQGAKRAAAERGELRFPLPVGYVYDDEGVCVIDPDQEVQGAIRDVF
AAFAAGGSAFQVVAAFVRRRFPLRAYGGIWAGQLRWGRLTHSRALGVLSN
PCYAGTYVYGRYATCRTVRPDGTVHTGVRLRPREQWPVVRHGHHESYISW
EEYVAIEARLSANCTHQGARPAREGLALCQGIMFCGSCGRPMTTRYYPQG
RAAYGCSSSRADHEATPTCRSIRADVVDDAVAGLLLATVSPGQIEQALAA
ADEVTSRHTRAHRAAELAVERARYDADRAERALGAVEPENRLVARTLETR
WEARLSALAEAEAALAAVRERRPALPDRDGLRALAADLPGLWHNPHTRDR
DRKRLLRTLISDVTLLPETDRARARLGVRWHTGATDELALRRPATSPQVR
RTPAPARQLIARLGPDHSDAEIVTALADAGLTTATGRPYDEAAVAWVRHA
FHIPGRCPFRDGEISVDQVAATLGITANAVYYWLTHDRLAGRKDLSGRWC
IPWNAQVEAACRRQIDASGHLIPRTPGPREVLPGDVTVHEAATRLGVPDD
VVYYRIRIGQLTARRTPSGCLSLPWNPQAEAACRTRIAHPGSGPTGPGSR
PLPHAATAHGDISVQQAATRLGIPTAQVYYWIRRGYLTAYRLNAGRVAIP
WNDQVETACLHRAARTVKVNSTTQTITAR
>Francci3_2367 transposase Tn3
MWLIRALVAVDVPDDDTGQESVLALIKSVPGNVSLDSMLTEIRKMRAVRA
VDLPDGLFADVAPRVVAAWRTRAAVESPSHLRDHPDPLMLTLLAALLHHR
HGEITDTLVELLISTVHRIGARADRKVTEELINAFKRVTGKENILFAIAE
ASLSSPDEPVREVVFPAVAGGEQTLRELVAEFKTKGPVYRRTVRTTLKAS
YSNHYRRGLIALLDVLEFRSNNDTHRPVLDALDLIRRYADTRLTYLPVGE
TVPTHKGLLGDWNELVFTDPAKGPKRVVRGVYEICTFQALCDSLRCKEIW
VVGAEKWRNPDEDLPADFEAQRATHYAALRKPLDPTAFIDQLREEMRTEL
AALDTALPKLPWLEIAERGRNGPIRLTDLDAAPEPRNLRRLKAEVRTRWG
TVPLIDMLKEAVLRTGCLATATTMAGRGDLAPEVLAERLMLAIYAYGTNT
GIRAVAGNAQNGHSEDDLRYVRRRYLTAELARTVAVEIANATFAARAQQV
WGAGSTAVASDSTHFGAFDQNIFTEWHSRYGGRGVLIYWHVERKSMAIHS
QLISCTASEVAAMVEGAIRHGTAMEVEGTYVDSHGQSEIGFGITRLLGFD
LLPRIKRINKVRLYRPAAGEPDAYPRLGPALTRPIRWDVVAEQYDQMIKY
ATAIRTGTASAEAILRRFTRTNAIHPTYQAMIEVGRAQRTLFVARYLRDR
DLQREINEGLNVVESWNRANSVIFFGKGGDIATNRRDEQELSVLCLRVLQ
AALVYVNTLMVQDVLADDDWAEQLTDADLRGLTPLFWTHVAPYGEVKLDM
TSRLTLSVSAAAPLV
>Francci3_0155 hypothetical protein
MFQHQPCGHQRGPLMGRRNYDHLTRDEVDCDFPGGGPARFWLWVFLGEDS
VCFVMDATRSSAVLADHVGLDPDTGQLSDTPGGAARRLVLSSDFYTVYAS
AGRRADGLVNLYCWAQARRHFVRAGDANPAQLGIWARQRVDRIRDLYTAH
DALAAAWHTPATAPSPRAERALAAAYAGWDTAIGVIDTVRQEQMTSPGLQ
EPAKKARATMDREWDGLTAHRGYPMIGLDNNPAERAIRGPVVTRRNAGGS
RTEDAARDAATIFTVTATTTLHGLNLLTYLESYLDACGRTDDTALTGTDL
ERFLPWAASPDDLEAWKQPPS
>Francci3_2380 hypothetical protein
MERPLAYGYFSLTAEDDDEVSRLNRQVKDYADAESYVLAEIYVDRNMPPG
RLIRPALTTLLDCLRHDDRCSVIVPSADHISPWPLIRKAIAMEIQLLGAK
LVVASNTAHEPPGVAAWRAERDTGQQL
>Francci3_0727 serine/threonine protein kinase with WD40 repeats
MRRKRETGNAGTVAPPSASPARPDPASEANPVGPETPSGPAARSGRGTSS
APGFGPVGGSGLTGPSIEPLDSGDPTELGQFTLLGRLGEGGMGTVFLGRG
RPDVAEHAGRLVAVKVIRPDLARVPEFRARFRREADIARRVARFCTAEVL
GVVDPPDGRPYLVTEYIDGLTLAQTVAADGPLRSADLERVAVSVAAALTA
IHGAGLVHRDLKPSNVLLSALGPRVIDFGIARALDAPTMLSQEIQRIGTP
AFMAPEQANGEPVTAAADVFAWGGLVTYAGTGSFPFGDGPTPVQLYRVVH
REPLLDGLAPALRPIVEEAMRKDPATRPSAQELFLRLVGMGPTTHPDPEV
TRVIRAGVSMPAPPQRTDPDRCAPGQMSAPGGSASSGSVGAGPPVDLSAR
DRGRWNWRRVALLAAPLLAVLLIAALIPFALTTGSDRRPSREETAARVAL
AAEAVRNSDANLAARLSLAAYRISPVREARAALRTSFAAATATVLDGHTQ
SALGVDISRDGRLLASTGADNLVQLWDISARSHPVKLATLARHTSWTLDA
AFSPDGRLLATVSYDRSVILWDLGDPRHPVELSVILGHNGWVLDAAFSPD
GKVLATSGYDNTARLWDVTDPRRPSQLSVLDRHTSWVNEVAFSPNGHLLA
TASADRTARLWDVTDPRRPRPLAAITAHTDYVWAVAFSPDGRRLATGAYD
GTARIWDITNPSRPAATASFPADEKWVFDVAFSPDGRTLATAGWDTTVHL
WDVTEPGRPPAIGTITGHGDWVQALAWTPDSHSIATASDDYTVRISRIGD
ADLIAAACADPSKQITDAEWQRHISDVPYQPVC
>Francci3_4164 hypothetical protein
MEFPSLLPLPEWPDAEPALWYGRMSDVDNDARIPDQFARGQRYARLTGEY
WIARAWADDGISAWREDVVRPEFEGFLTTLRTGKHRVVVAWEESRITRDP
VVGAEFGKIMQRVSGRLIVTDGEKATTYDFRRQRDRDAWHGAVGKSVSDS
GLKSELVKRKLDAKREAREFLGGPVGFGWSQTISRKGKKIVTEWSVNEEQ
ARWLREASQRIREGEAVLKVADDFYDRGLRIPHRRTRPDDTMKTGTLTRA
SLTAMLRNPRIAGLFATGNVHRGWTVVGPMANFPAILTEEEWRETCAALE
AVKTRKGTGTAVKHVFAGYYVCHKCKRSLIRNSPRAYALWRHRLGKSREH
VECDQSFHINAADADDLMTRLVDAYLVRRDWEKAGEVADAEELKAERTEK
EQELADLPRAITAKEISLRLGGQVEAQIEARLREIDAELARRARLVTVLD
GQEALRLWRNGTLTEKRRVLSTIIERIIAIPGKDLPLRDRLDPQWRNPSP
A
>Francci3_4113 putative IS630 family transposase
MTRPCPTLVSGMRYADGGGLTAQGRARREVVRVQAADLFAAGVDPVEVAG
RLRVSTKSAYQWKRLWQAGGTAALASRGPSGASCRLSDSQRDRLRVELDR
GPAAHGWPDQRWTLARVTLLIGRLFRTRYTLRGTSYLLHRMGYSPQVPAR
RATERDEEKITAWRAETWAKVRG
>Francci3_2940 protein of unknown function DUF1524 RloF
MKANETTLGELLQGQCQYVVPLYQRPYSWERANLRQLWADITSVAAAGPA
ATHFLGSLVLAPSPSTTPAGVAIWLVVDGQQRLTALSILLCAIRDHVRDD
DQMLAAKIDDLYLMNRYAAGTERYTLLPTRADRTAWTALVERSPEAGGRA
GIGDAYQFFRKELAALRDADDPLDAALIEQAVVGQLAIVEIAAHVDDDVY
RIFESLNHTGRRLTQADLLRNYLFMRLPTRADRVYDWRWFPLQELLGDRL
EDLVWLDLVLRGDDRATKETVYQSQRQYLQTLPDEDAIEQWISELHAKAL
LFSRILDPDREEDPVLRQALHRLRRWGADVVRPIVLHMLIAHANDRLDAT
ETAAALRVVESYLVRRMLVGIASANSNRILMSLVKELGDQTPTAAAVTRV
LSGPRKKFPTDQPVKEAVLLNPFYWTGRGPQRTYVLRSLEEDYEHLEPVD
WGVKLTVEHILPQSLSRPGWKAVLDADAHEGETRDELHRRLVHTLGNLTL
TAYNPKLADHEFTEKKKLLADSGLAMNREIAGQDRWGREEIQNRGRALAE
RIVKIWPGPDESVVPPPQDQRWTLMSRVLAAIPPGRWTSYSDVAEVIGSH
AVAVGAKLASARISNAHRVLLLNGSVSPDFRWPDPERTDDPREILTAEGV
ILSRSGRALARQRMTAAELAAAADLETPEESQGAAGTKERLWAAPAARSA
VGLAVSLWPSERRAAAPTDDANSGLSYRDRVRGCLLGGALGDALGAAIEF
QSLDEIRREYGTRVTRKVCLCR
>Francci3_3389 hypothetical protein
MIHQQRGRPHQPRTVKIGQTAGRPGHPHDRGGAAEGHQELVRIDRHRGTL
TSHGHSPQPAEEARKMVAVATSPARHHQGPLLISTATWPLQKARSGGVGR
VGLEPTTDGLAIPLIRYPSLLAKTC
>Francci3_2342 NUDIX hydrolase
MGGRAAFQVLVLPYRQTGQGTEYALFRRADAAYWQGVAGGGEAGESPAQA
ARRETAEEAGLVGEREFIVLDARATIPVVYVTGEFTWGPDVLVIPEYAFG
VRAEDAEVTLSDEHTEFGWFGLDDAVKVVQWDSNRTALWELDHRLRHGIG
HRAA
>Francci3_1089 transposase, IS4
MSAAIGRSYATAADRRQAQEHGPDGRPPRGGPSAAAAVRHQLDLGLPPGP
AAADGLGGRVLRPGRAGRGRHRLSQGRAASPGVARMYSGTLGKVGNCQIG
VSVHAVTDWASAAVAWRLFLPTCWDDTTLTDPTEVAAARARRERAAIPDK
ARHREKWRLALDMIDELAGWGMPVRPVVADAGYGDAAAFRQGLTDRNIPY
VLAVKPTATAYPADAVPVTAPYPGNSRRPTPAYPDPPRDLKSLVMAAGRR
AGRSVTWRHGTHRTPANPTAGMRSRFLALRVRPAGRNITRNPDRSLPVCW
LLAEWPVGQPEPTDYWLSTLPTGIPLRDLVRLAKIRWRIEHDYRELKDGL
GLDHFEGRTFAGWHRHVTLVRVAQALCTQLRRTPKVPAPA
>Francci3_4122 transposase IS116/IS110/IS902
MQVLFPRCAGLDVHRDTVAAAVRIQTGSGKAVTEVRTFTTTGGSLGLLAD
WLTECRVTIVGMESTGVYWKPVFHLLEDRFECWLLNATHVRNVPGRKTDV
ADAAWISDLVAHGLVRASFVPPKPQRDLRDLTRARRIVVEEKTREIQRLE
KLMQDAGVKLTSVASKLLGVSGRAILEKMIEGEQSLEYLADQARGRLRSK
IPQLQEARAGTFRSGHHGFLAAQLLARIDLCDEQIDELDHRIEVMIAPFR
ETVDRIRTITGVGEVTATVLLAEVGLDMSRFPTAGHLASWAGICPGNNTS
GGKRLSGRTRHGNKWLRTALTEAAHAAARSKDTYLASHHAQVRGRRGVLK
AIGATRHDILIAYWHIIANKTVYQDLGGDWHARRRRDPERRRKNLVGELE
KLGYTVTITPAA
>Francci3_2467 transposase, IS4
MSVVSPMLWLECQQEQVEVSTAAQVAGLVAVLRRVPDWRDPRGVRYELAP
VLALWVAGNIAGHDTTVAVWEWACALPVGVLAGLGFPRRVPSERTIRRIV
EEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGARSRPPQGS
VRQEAVVEAVRHDTGTALGHQRVVAGDEIASVRRLVNRVCDHNTLVTTDC
LHAHEPLARAIRAKGGHWLFSIKGNQPTVRAKLAGLPWDEFGNQHVTREK
AHGRIEERALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATST
EEFYLVTSLSTDQASPAQLARWARGHWTVEAIHHVRDRTMDEDRHTIRTK
NAALNWAIARDTTISALRLAGYKNIRQARRATIRDPGLVLQIIALTSQNR
L
>Francci3_1237 putative transposase
MAQRVRARRLTGEESRQVQAIVRRGTHSSVRARRATIIMASAGGTTVPAI
ARLVAADEGTVRDVIHRFNEIGLACLDPRWAGGRPRRISCEDEAFIIAVA
RTRPTKLGRPFTHWSLRKLADYLGDNPDRVVTVGRERLRQLLGHHRISFQ
RTRTWKEFTDPDKEGNSTGSST
>Francci3_2260 DEAD/DEAH box helicase-like
MTLTDHLPPTSGTGHDGERAAHPKDPHPKDPDSKNPDLTGPDDGPGGGPD
PDALYDAFTRWVAEQGLELYPAQSEALIEIVSGSNVILATPTGSGKSLVA
AGAHFAALAARRRTFYTAPIKALVSEKFFALCAMFGAAQVGMMTGDASVN
DTAPIICCTAEVLANIALRDGVAADVGQVVMDEFHYYADPDRGWAWQVPL
LELPHTQFILMSATLGDVSLFEADLTRRTGRETTVVRSAQRPVPLFYRFV
TTPMHETIGELLETHQAPVYVVHFTQAQALERAQALMSVNVSTRAEKDAI
ARTIGNFRFTAGFGKVLSRLVRHGIGVHHAGMLPKYRRLVEQLAQAGLLK
VICGTDTLGVGINVPIRTVVFTSLSKYDGSRVRLLSAREFHQIAGRAGRA
GYDPVGNVVVQAPEHVIENERALAKAGDDPKKRRKVVRKKPPEGSVGYGL
ATFERLIAAEPEPLTSQFTVNHAMLLNVINRPGDAFASMRHLLTDNHSDR
ATQRRLIHRAIAIYRALLAAGVVERLDTPDELGRGVRVTVDLQLDFALNQ
PLSPFALAALELLDPASPSYALDVLSVLEATLDNPRQVLHAQLSKAKGEA
VAAMKAEGVDYDRRMELLEEVSYPQPLAELLGAAFGLYRRGHPWVDDYEL
APKSVARELYERAMTFAEYVAFYGLARSEGLLLRYLADAYKALRQTVPED
ARTEEVVDLTEWLGELVRQTDSSLLDEWERLSNPAGDGAGATAGTTASGG
GVDERPPAVTGNIRAFRVLVRNALFRRVELAALRRYDTLGELDGGYDGGF
DADGWRAALEPYFAEHGEIGTGPDARGPGLLILDEKPGTPGTWTARQIFD
DPAGDHDWGFSAEIDLTASDEAGVAVVRVTAVDRF
>Francci3_2799 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_2408 SNF2-related
MLSVSWQGPGTAGLRTAVSERSYARGVHYAQQSAVTRIDWDPDENTLQGR
VRGSGGEVYATNAQFSGDGPSSWTFQYGSCTCPVGADCKHVVALVLTATT
GATATASAATGAAAPGPSARRSGSSPRSARPRPAQPRPSRSAAWTEDLGE
LLGSSPAVGGGHGGAAGTTPLAVELSFTADPPAHGGGSELGLRVTARLVQ
PGRNGWVNAIGWDALNAAYQTREYQQPQVRLLQEFFAVYRAHEERYGYSY
YSSYSSGGAKRLDLCAFESRQLWSMLDEAEAVGLRFVHARKKLGDVSRYG
SAELCLNVTQDESTQSLVVAPLLHLGGTATDAAVFSFIGRDGHGVVYVDR
AEVLTSDDHRDWHVRLARLAQPVPPRLQRMAVGNRSLRVPRSEESRFRAE
FYPRLRRLAPVVSSDGSFEPPEVPPPALVLHASYGDDHELDLSWGWAYEV
GGERIHVPLYATEPDGTEPDGTMLPAATPPAAEPDGLRDPRQEQDALRDV
LAALDLPSGTAALLAGDQGSGLRPRVRLRGVDTMRATTELLPLLADRPGV
AVEVSGDPAAYREAGDSLLIGLSTDDVVGETDWFDLGVTVTVEGRQVPFT
DVFLALSQGQSHLLLADGAYFSLQKPELQSLRRLIEEARALQDSPGGPLR
ISRFQVGLWEELAGLGVVERQARSWRQQVTGLLELGAEGQLEQEPPATLA
ATLRSYQLDGFRWLAFLWKYRLGGVLADDMGLGKTLQTLALVCHARQADP
ELAPFLVVAPTSVVSNWAAEAARFAPGLRTVAIRDTQRRRGEALDEAVSG
ADLVITSYTLFRLEHEEYASLAWSGLILDEAQMIKNHQAKAYRCARLLPA
PFKLAVTGTPMENNLMELWSLLSVAAPGLFPNPIRFRDYYARPVEKQNDV
ELLAQLRRRIRPLMMRRTKEQVAPELPPKQEQVLEVDLHPRHRRLYQTYL
QRERQKVLGLVDDMNRNRFTILRSLTLLRQLSLHAGLVDDHHADLPCAKI
DALFEQLTDVVDSGHRALVFSQFTGFLGKVRERLSALGVEHCYLDGRTRD
RSTVLERFKTGSAPVFLVSLKAGGFGLNLTEADYCFLLDPWWNPATEEQA
VDRTHRIGQSRNVMVYRLVARDTIEEKVMAMKDRKARLFSSVMDDGDVFS
STLDADDIRELFA
>Francci3_4425 serine/threonine protein kinase
MDSGRDHSRAVLTVADNAARMGPGRSEPGDPLNPHEPRSVGPYVLLSRLG
VGGMGTVYLARNRAGRRVAIKVIRPDLAADEEFRRRFRAEVEAARKVAPF
CTAEVLDADPNAPAPYLVTEFIDGVRLDMQVDKGPLTSSTLTGLAVGVAT
ALTAIHSAGLVHRDLKPSNVMLSLSGPRVIDFGIAQALEGAKAKPTAWGF
GSAGWMAPEQVHGQPIGPEADVFAWGILIAYAGTGRHPFGDGTDIDLGMR
IVGSAPDLRGLPQPLVGLVSAALAKHPDDRPSARDLLLRLIAQPPSRGTG
AIGEAMGQAEELLSRTGEVPRAAPTRIGPATSAPARQVHWNAGAPAARAG
QGPGQPRQAPPGQSGAHSPRGRPGPRPAQLGERMGGQVPGAPPGPWSGQA
PDRHGPVPGEPGAPYPSFAQPPPARGLDRRRLVWIAVIVLGALIAIAVVV
SLPGKSGQNGSAEQSAATTVPSRSTGPSTAPGPASGLNRPVTDGQLEFIV
DSFRCGQTKLGDGLLARHTDGQFCLAHVTVRNIGDSTRTLVTSSQFLWDG
GGGRHSADFLARFQLSNEDLWDSINPGDLRAGTMVFDIPRDATAQELELH
DNPVSAGARVPVP
>Francci3_0101 hypothetical protein
MRPGVWRPPVEPSAAEQTVIRLVRRAKLFVFLRRYRHELFDEAFQTELAE
VYRDSPKGQPPVPPAQLALALILQAYTGISDDEVIEATVMDRRWQLVLDC
LDTDRPPFAKGTLVGFRTRLINAGWDRRLIARTIEIAMVSGGFGPRALRA
ALDSSPLWGAGRVEDTPNMGHALRKALGVIAAGQGWGLDEGTAVLARRAG
APVLAGSSLKAALDADWDDPGERDHALAVVLAALEAVETFIAAQPAPVGA
AVAVARQVRDQDVETTPAGVARLRRGVAKNLTELHIDRAYLPSTLVRDRD
ENLQVFCKAWRVRNTTGRYDETAFTLNFDHGQLTCPAGVVMPFTPGRTVR
FPAATCAACPLREQCTTSTRGRSVSIHPDETLLAELRERQATPAGRARLR
ERTAVEHTLAHVGHWQGRRARYHGQRKNLFYLRRTAVVHNLHVIARQRND
QQAA
>Francci3_0392 transposase, IS4
MPAAPVLPPAPVLDRLAAVGAGNQPPSPAGLLAVFNQLPDPRKPRGRRHS
LAAVLTLATCAVLAGARSFTAIGEWSADAGQAVAGLLGVSRVPEESTFRR
VLAALDADALDTALGAWAAAATTPPAGTRRRLAVDGKTLRGSRTPDSPGR
HLLAALDHTSGVVLGQVAVDAKSNEIPALPVLLADLDLTDVIVTADALHT
QRQTASWLVSRHAHYILTVKANQPALYAQLAALPWRRVKTAARTVERGHG
RRERRTVKTTEVRAGLLFPHAVQAVQVTRRRQPLADGPATTEIVYLVTSL
PTHQASPTLLATYAREHWLVENRLHWVRDVTFGEDLSQVRTGHAPQVMAS
LRNLAIAILRLTGATNIAQAIRHHARRPERPLETIKSLAC
>Francci3_3079 hypothetical protein
MPATRQNSRNRHTDDHQDPVSGSGIGTRARDAGVLGRASTTARPPEKADQ
TANRKKRGSAGGRPVRHDAALYKDRNTVERGISKIKEWRGLATRYDKTPE
SYAAGLHLRGSILWLRSLPTP
>Francci3_2775 transposase, IS605 OrfB
MLEGGVPRWPGSACPGVEELHRLPVREAEGPAGRFPRFKKRGRARDSFRY
TTGAYGPAGDRQVKLPRVGRVKVHEPMGALTGRLVDGSARLLGATVSRTA
GRWFVAFTVEADRDVPGKPSTRQRRGGPVGVDLGVRHLAVLSTGETVENP
RPLARSLRELRRASRAYARSTPGSAGRRRHAATLGRLHARVAYQRRDGLH
KLTARLAKTHDTIVVEDLHVAGMVRNRRLARAVSDTGMAEVRRQLAYKTL
WYGSTLVVADRWYPSSKTCSDCGWRNPGLTLSDRIFACQSCGLVGGRDLN
AAVNLSNLVAASRSETVNARGADRRTPSAGRAAGKREPGTARAGRTGSAS
PQGEAA
>Francci3_0003 DNA polymerase III, beta subunit
MKFRVERDEFTEAVAWTARTLPSRPTTQLQVLSGLLLDATGPILKIAAYD
YEVAAQCTVHATVSEEGRGLVNGKLLAEITRALPAAPVDLGIDGTRLVIT
CGNARFALPMLPVDDYPALPAMPPITGHIEGSAFAAAVSQVAIAAGRDDT
LPVLTGVRIEIEGDTLTLAATDRYRLAVRTLKWRPSETAAGPDEDGVTGV
DGPPPTPVTVALVPARTLLDTAKSLSGSGVEVSIALGTGPSGETLAGFAG
STRQTTTRLLDGSFPPYRKLLPDSSPLIAQLEIAPLQEAVKRVALVAAKT
APVQLTFSPDHLVLEAGTGGEAQATETLPVTYDGPELSVAFNPSYLLDAL
GALESDVVRIGFASAEDPAVAANKPAILTGKADDDGEVPDYRYLLMPIRL
HG
>Francci3_3606 conserved hypothetical protein 95
MTRIISGTAGGRRLVVPPGTTTRPTSERAREGLFNTLSTCLDLRGARIAD
LYAGSGAVGLEALSRGATHALLVDRDPVVIRTLRRNVTALGLSGAKIAQA
AVERVVQNTSDNPYDVVFLDPPYAMRDSELGEVLSKLLAAAWLTADGVCV
VERSHRSGPVAWPDGLCALRDRRYGEGALWYGIRS
>Francci3_0006 DNA gyrase, B subunit
MCVALRRRQGAPCVRYGGLPGRLVDTSAQNEGTRRVAYDASSIKVLEGLD
AVRKRPGMYIGSTGERGLHHLVYEVVDNAVDEALAGYCDTITVTLLADGG
VRVTDNGRGIPVGMHPTEKRPAVEVVLTTLHAGGKFDGKSYAVSGGLHGV
GVSVVNALSTRLDVEIHLDGHVWFQPYVATRPDKPLAKTGTTRRTGTSVT
FWADPTIFETTEYKYETLSRRLQEMAFLNKGLSITLVDERDEERVAVTYK
YANGLVDFVGHLNATKDTIHRSVISLESKGVGIEAELAMQWNGGYTESVY
TFANTINTHEGGTHEEGFRAALTSAVNAYAKDQNLLKPVKAGAKNSDERL
SGDDIREGLTAIISVKLAQPQFEGQTKTKLGNTEAKKFVQEMCYSALKDW
FEVNRTEARAIVSKALDAQRARIAARQARDLTRRKGLLGGTGLPGKLADC
QYTDPERCELYIVEGDSAGGSAKGGRDSKFQAILPLRGKILNVEKARIDR
VLKNTEVQALIQALGTGIHDDFDIAKLRYHKIVLMADADVDGQHIRTLLL
TLLFRFMRPLVEAGHVFLAQPPLYKIKWGREDWEYAYSDRERDGLVARGV
ESGRKLPKDAIQRFKGLGEMNATELWDTTMDPDRRILLQVTLDDAAVADE
LFSVLMGEDVDARRSFIQRNAKDVRFLDI
>Francci3_4288 hypothetical protein
MPATVDAAAETPLAKTRRARRIVRLLGDIHPDARIALHFDNALELLVATV
LSA
>Francci3_3430 putative transposase, IS891/IS1136/IS1341
MTTLQAYRFALDPNQAQLAGIRRHAGASRFAYNWGLARVKAAHAQRDAEQ
SYGLTGDLLTPVPWTLPALRLAWNAVKRDIAPWWDECSKEAFRAGLDQLA
RGLKNFTDSRQGKRKGRRVGFPRFKKRGKARDSFRYTTGAYGPADETYVK
LPRIGRVKVHEPMGALTGRLADGRARLFGVTVSRTADRWFVSFTVEVDRD
VPERPSRRQRAGGPVGVDLGVKHLAVLSTGQTVPNPKHYQRAERRLRRAS
RAHARSKPGSAGRRQRAAQLATIHVRVANQRHNGLHKLTTRLARSHDTVV
VEDLHVAGMVRNRRLARAVADAGMAEVRRQLAYKTRWYGSTLVVADRWYP
SSKTCSGCGWRNPSLTLSERTFTCQSCGLVLDRDHNAAINLHHLVAASTP
ETENARGADRKTRASGRVAGKREPGTATAGQTGGASPRGEAA
>Francci3_4260 HhH-GPD
MAGMTSTASTASTAGPASPAGPASPAGISSASGPLADLVLGWFAVCGRDL
PWRRPLTSPWAIMVSEVMLQQTPVSRVLPVWEAWLDRWPTPAALAAEPAG
EAVRAWGRLGYPRRALRLHQAATVVVERHDGEIPQHLDDLLALPGIGTYT
ARAVAAFAFRQRHPVVDVNVRRLFARAVEGRADPPATVSRRDLVEIAELL
PPDTETAARASAAFMELGALVCVARAPRCAACPLLGRCAWVSAGSPPSAG
PARRPQGYAGTDRQVRGRLLAVLRDASPAVGQEILDQVWDDPVQRTRALA
GLIDDGLVVRVAPGVYALPS
>Francci3_3800 UvrD/REP helicase
MVVLPAPAPRPGYRLIAAPVAPPRPLVLDAAQRAVVEHGGGPLLVLAGPG
TGKTATLVEAVAARIEAGADPRSILVLTFSRRAAGELRERITARLGAEGG
AGGGPGAWTFHAWCLALLRAHERPAPPGGLRLLSGPEQDSRLRDLIEGSR
EDGRPVWPEPLVGCLRTRGFTEEVRALLARAREVGLEPVALAQLARRTGR
PDWAALAEFYEMYLDVFGFEGAVDYTDLVHRAVVVAESAEGGAWLRGRYR
HVFVDEYQDTDPAQERLLEAVAGGGGNLVVLGDPDQSIYAFRGAEVAGLL
GFPARFPRLDGEPAPIVALRRCRRMAPAPLAASRHVARRIPAAGLPVAAI
RAHRDLVGRADAGAGQVQARTFPGTGAEAESVADLLRREHLENGVAWDAM
AVLVRTAERIGRLRRVLAAAGVPVSADGDDLPVAQEPAAALLLLALRCAE
DPAGALTVDAARTLLTSPLGGADPAGLRALGRALRTLERDAGSEHPAPSA
ELLRAAVAEPDHWLATIPDDLAGPVRRVGGLLRTAGQALRDSSGAPQDAL
WALWSASEWPARLRHASAAGGAAGRAADRDLDAVVALFDAVTRLGQRRGP
GGGVASLVAELTRQQIAGDVLKPAGESVRRRGVRLLTAHRAKGLEWEVVV
VCGVQDGTWPDLRERHSLLGAEQLDAPSRGGLRPPLTRQDLLADERRLFY
VALTRARRRLVVTAVNSPEDDGSLPSRFLEELGVAVEHVPGRPARPLTLV
GLVATLRRLATEPDSSPVMRSAAQARLAALAAARDQAGRPLVPAAHPDTW
WGLLDPTTSDVPVVPVAGPIRLSGSSLSSIGACSLRWFLEHEAYAVTPAS
TAQGFGKVVHALADEVTTGRTPADLAALDARLDTVWRQLDFDARWRSDQE
RAAAREALARFLDWHAAERGRRVIDAEVRFSCDLRVAGRDVQLRGFIDRL
ELDEAGRVHVIDFKTGRTAVAPAELATHPQLGSYQLAVRAGALDDVLAAA
APDQPGADQPRAVPGGAELVQLRRDAGAAAAGPRLPGERPGPPEVQAQSA
LPPHGATWMDEVLDAAVRTIDAEAFRPTPGDHCTLCTFQTSCPARPEGRQ
VVE
>Francci3_1783 response regulator receiver protein
MLTYPGVVDLPESTLTFLAGLLAEDRAQRRTWRKLPPPEQALLVLVHLRK
GERYEQLAEGFQVSVGTVHNYIREAVRLLATHGRTLLAAVWIFAWTQSNF
LILDGTVVRTNRVRAHNKLYYSGKHKYHGINLQGLTDPYGRLIWISEGLP
GSVHDLTAARMHDILDLIDRSELYLYADKGYVGGEGDRLLVPIKKPKNND
LPDRDKEANRTHATTRSQGERGFAVLKNWHIFDRFRGCPRRVGTFAQAAL
VLATEGL
>Francci3_0286 hypothetical protein
MTSPDPKSRKTAILPDRGDLIAELAGRLEALDALLNRLEDAERQAADASE
HLIRTRRWQEDTVRTIQEERARMRQRQHALDELADHARAAVEALAHHRSL
PREVHELAVELQVLDAAGFLTRRGSRSR
>Francci3_1966 transposase, IS605 OrfB
MSRTACRWFVSFTVEVEREVPTGPSRRQRAAVTVGVDLGVRHLAVLSTGE
TVANPKPLARSLRKLRRDGPHKLTTRLATSHETIVVEDLPVAGMVRNRRL
ARAVSDTGMAEVRRQLAYKTLWYGSTLVVADRWYPSSKTCSGCGGRNPNL
TLPDRIFTCPSCGLVADRDANAAVNLRHLVAASTSETINARGADRKTHPG
GRVAVKREPGTAMAGKSVTRTTEVASGGKYGGTRRDTEGVPVGVKSFTRH
GRARHVIPVAIALPGLP
>Francci3_2351 Resolvase-like
MRVGYVRVSSADQNTVRQLDGVDVERVFVDKASGKDSDRPKLGEMIAFVR
DGDTVLVHSMDRLARNLDDLRSTVRMLTGKGVRVEFVKEGLTFTGEDSPM
ATLLLSVMGAFAEFERALILERQREGIAAAKQRGVYTGRKPALTPDQARD
LRERAAGGARKSDLAKAFGISRETVYTYLRAARPAGCGSGSPPSDSRDRA
DGAVTCRYGAACPGSSATPGSSMQPSRTVTGATPA
>Francci3_2433 serine/threonine protein kinase
MPGDAHGVRMQPPRATDPGSLGRHRVLGRLGAGGMGVVYLAEGPLGQVAV
KLIRPEYADDPQFRARFHREVQACFRVGGAHTARLVDFELEAERPWLATE
FVDAPDLAAQVAAAGPLSTGEQIILAAGLAEALASIHAAGLIHRDLKPSN
VLWTADGPKVIDFGIAAAAEARPLTAVGGVVGTPGWLSPEQATGGEVTAA
SDVFGWGALVCFAATGQPPFGSGSADAVASRIAAAEFQIDFDRLAPELHA
PVREALDRQPPQRPTALDLCERLVGHARGARIPVTRVLPDAARTPTPPAA
PQIPAAPQIPASSEPRARRRRRWLLVGAGTVVILALAGALAAVLVIGGDN
SSADNSASGRQSFTADAPWRLAIIDQIEGTDNGCTITVTSDSTGEQKAIK
EVYGAKTFQVPLVGGFHWRANDPGCSVTARSGSGTAVLPFAQQAGTGDTD
AFEVASGLVRVEVVDFAGSDDCGLQLHDAADGREISFGTAARGGPPLLLD
PSGRTQVYLANLTCGVRVSAAPAAPG
>Francci3_1371 Holliday junction DNA helicase RuvB
MSDDGLVSAAASPEERAFEAGLRPRTLAEFVGQRKVREQLTIMLEGARAR
GRPPDHVLLSGPPGLGKTSLAMIMAQELEVPLRMTSGPAIERAGDLVAIL
TALSPGEVLFLDEIHRIARPAEELLYAAMEDFRVDVILGKGPGATAIPLD
VSPFTLVGATTRSGLLTGPLRDRFGFTAHLDFYDADELARVLTRSAGLLG
VTLTAEGAAEVAGRSRGTPRIANRLLRRVRDYAEVRADGVVTREIAQAAL
RIYDVDGLGLDRLDRAVLEALVTRFGGGPVGLTTLAVSVGEEPETVEDVA
EPFLLRAGLLIRTARGRMATPAAFEHLGLDPVTDPLGRTQVSLFTEGE
>Francci3_0393 Recombinase
MRPAITLWSDNRDAAYGSGVVRKLRTIGYRRISDDREGLEAGVTRQDEDI
RELAERRDDVDLIDVLTDNDLSATKDYRPEFERILTMAQAGQIDAVIAWT
SNRFLRNRPDRMRVIELFKARNVRLIPVRGTTMDFTSADGRMMADIIFSI
DAGEAERTAERVARAAQQRAEQGRHHGGPRAYGYGPIVGTDHAGKPVRDF
YALVPDEAAVIRRIAANILAGVPLAAIARTLNAEAVPTVTKAHRPICPNN
VKGRRAWTDCPCPYRGWNATTLKEMMVNPRLIGMRGYQTRSTRRSTGTIA
VMGQAVWPAILDVDTWEQIRAILTDASRRTNTNAQKLRRMLAGFVYCASC
GHKLTGNGTNGTRLYCQRRDGRCPAKVRIGEAFLVGLVSRAVRGRLDELV
LQPVAMDDPAAAELATLEARKSALADRWASGKMDDDAYDDAHKAIVRQIR
AAEARMHTSARRRATLPVEGAVGWDGLGEDIAAKRVILTQLIDGVIVGGN
GTVEGDDVQRVRIVWRDLNDPR
>Francci3_4190 hypothetical protein
MTRGDLTDGEWELIEPYLPLGASGLIPDLRSYVNAVMWRFRTGSPWRDVP
ERYGSWSTIYDRFRLWAQDGVFQTLMDAMITEAAARDDVDLSLASVDSTV
ARAHHHAAGMVVDPDLLVDLEKALTEEKGLQKPGKTTV
>Francci3_3792 UvrD/REP helicase
MMSGPPALAAQAARVRSGNDLLAALDPEQRAAAGAPLGPVCILAGAGTGK
TRTITHRVAHMVAQGGVGPGQILAVTFTARAAGELRGRLRAMGVDGVQAR
TFHSAALKQLLYFWPSVAGVALPKVEKSKIPYVVRAAARLHLRPDRAELR
DLTSEVEWAKTTLVTPADYARMAAASGRETAAPPETVARLYAAYEEVKRD
AGVIDMEDLLLLTAAMIEEHSWVAAEVRARYRHFVVDEYQDVNDLQQRVL
DAWLGERDSLCVVGDPHQTIYSFTGASPEHLLGFPRRFPDAAVVRLIRDY
RSTPQVVGLANTLMRAAPAARLVAARPDGPAPVWLECDSEPAEAAAVASR
IRRLLDQGVPASEIAVLYRTNAQSEAYEAAIGGAGIAYLLRGGEKFFERT
EVVAAMRLLRAAVRSADTDRPEGLTAAVADILAALGWRADQPTVGSGAER
EQWENLAALHRLAGDLAARSPEADLEAFVAELGDRATHEHQPTVQGVTLS
TLHTAKGLEWDAVFLVGLTEGTLPLVHARTAEQIAEERRLLYVGVTRARR
HLVLSWALSRAEGGRRSRRPSRFLDDLRPMRRESAAGVRHGGRGEGTRPV
AGAEETGPGRSRSRSAVRCRTCGRSLTGVHARVGRCEGCPGDVDAALFER
LRVWRSARAKEQGAPAFVVFTDATLQAIAESRPGTVAELVRLPGIGQAKL
DRYGEEVLALVAGRTPPVGSPPVVESLGSDTPPDAS
>Francci3_2533 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAAAGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGCRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_2108 hypothetical protein
MVGAEFGKIMQRIGGRLIVTDGEKATTYDFRRQRDRDSWHGAVGKSVSDS
GLKSELVKRKLDAKREALEFLGGPVGFGWSQTITRSGKKIVTVWSVDEEQ
ARWLREAARRIREGEAVLKVSDDFYDRGLRIPHRRTHPGDTMKSGSLTRA
SLSAMLRNPRIAGLFATGNVHTGWTVKGPMANFPAILTEEEWRETCAALE
AVTTRKGTGTAVKHTFAGYYVCHKCRRSLVRNSPRAYALWRHRLGKSREH
FECDQSFHINAADADDLMTRLVDAYLRRRDWEKTGDVADGDELKAERTEK
ERELADLPRAIAAKEISLRLGGQLEAQYETRLREIDAELARRARLVTVLD
GAEALRLWRGGTLTEKRRVLSTIMVKIIVVPGKDLPLRERLDPQWRYPGP
A
>Francci3_1446 serine/threonine protein kinase
MPAMSLRSGDPERIARFTLTARLGSGGMGVVYLGIDDETGGPVALKVIRS
DFTADPEFRSRFRREVAAARAVDGACTARLVDADPDAEDPWMATEHIHGQ
SLAEAIADRGALAMPVVMALATGLAEALKSIHDAGIVHRDLKPGNVILSE
DGPKVIDFGIAAAVDATAATRTGVLLGSPGYMAPEQVTGRGEIGPPADVF
AWGLTVLFAASGRPPFGAGRPDALLYRVVHDEADTGDVPPALRPAVRAAL
LKEPMARPSAHALLRVLIGSTGDPGRETRRILRDAWLAPPVATRVRSLLP
AVEEDTHDAKTQIRLDAPESAGAGTNGAGTADRAPLGTGGPAGESGQVGE
SGQVGESGQVGELGRVRGSAPGEDDGPAGGRPTGDDHDVAASAEVASAGA
KRPLSGRRARARRHPRRTAGLAALAATLAGLAALSASEAASDLGPRAPGT
AASSRATGGELEPGPHDVPPTPSSRDDVPARSGRTPGPLGLPSPSVIVPP
AGLSSGRPIPGFRGTVGQLTQAKAFTDFVAAHDTQIIFLDISTLAEGNEG
AFYPGPDFGGDNRPNFTLFDACGALGPGEPPGFEPGKECFGSTYRLAEMA
HSGASFGYVQGSYRLRGYFWVDLVPGMHQGFANINLRAVDVRDLPR
>Francci3_0023 CRISPR-associated protein Cas1
MWWLSNPQDLHRVEDRVSTLYVEKCHVDRDDNAVVLVNKERTVRVPAAFV
ATVLLGPGTRITSAAVRLLADSGTALCWVGDRGVRMYAAGLGPSRGAGLL
MRQAYLVTRTSERLDVARRMYAKRFPDDDVTTATMQQLRGREGARIKKIY
RDHATRTGVTWNKRVYTHGDPFADSDDINRLLSAGHSCLYGICHAAIVGI
GASPALGFVHTGAATSFVLDIADLYKADYTIPLAFDLAAAGLTDERDIRT
AFRDKVADGHLMARIIHDIKDLLIEEGTRDNDEDALHLWDELDGHVPGGV
NWAADLADQTDDTTILGVTGPDTDQPPPPW
>Francci3_0815 NUDIX hydrolase
MLRSAMSSKPPATPRQRPDHGTEASLNRPPMARPYAAAGVLFFDEEDRIL
LVEPSYKPGWDIPGGFVEPGESPYSACVREVAEELGIAPPIGGLLAIDWA
PCLNDGWLDSEMLAFVFDGGVLPASWRERIRLDMDEIINCAFVSVDEVGG
LLPSPHARRVRAAAALRGVPQRSGYLEFGNQIPESVGGSVSVASLGEPDA
RGAGAAQVLAGPPACGAVMMSSLANEVMEA
>Francci3_4107 Integrase
MRRRRACPPVPVSEFAGFRFPPEVIVLAVRWYLRYALSYRDVEELLAERG
LEVDHVTVYRWVRRFTPLLVDAARPCRHRPGDRWFVDETYVKVSGRWTYV
YRAVDQHGQVIDVLASTHRDQAAARRFFARALTHGRRPVEVTTDKAAVYP
RVLDELLPGACHVDAARENNRIETDHGRLKARIRPMRGLKRLRSVQTVSA
GHALVQNIRRGHYELGIDSSPHLRLAAAFTELAAAL
>Francci3_3695 protein of unknown function DUF91
MRLVIARCSVDYVGRLTAHLPSAVRLVLVKADGSVSIHADGRAYKPLNWM
SPPCVIAEADGVWRVTNRAAEQLVITLEQVLHDSTHDLGVDPGLRKDGVE
AHLQVLLADRPDAIAPGLTLVRREYETGIGPVDLLCRDADGSTVAVEIKR
RGEIDGVEQLTRYLVRLEADPALPHPVRGILAAQSITPQARLLAADRGLG
CAVVDYDELRGLEPSIPTLF
>Francci3_4192 putative IS6 family transposase
MICRDRAGAYADGARTSAPDAVQVADRFHLWSNLAGYVETTVARHRSCLA
EPPSTEQAVDGPRADLDGAVAAARDAQFEQRAFVRHARERYAAVQELKAA
GVGIKPIAARLGLARGTVRKYYRATSVDDVLAKARDERGSILRPWEPYLT
GRVNAGITNGSQLFGEIRAQGYPGSRAVVLAYLRPLRAGGGTAAPAAR
>Francci3_2363 Resolvase-like
MIAVVTAAFEPAKFLAAEPAVLTHIRIGYARVSTGGQKLERQIDALTAAG
CRRIFAEKQSGRDTDRPQLTACLAFAQPGDTLVVPALDRLSRSLQDLITT
VGDLRRRGVGFTSLHENLDTTTPGGRLVFHVFAALAEFIRELIVTGTREG
LAAARERGRVGGRPTVATPEIIRAARDMLPNPDASVTSIAKLLGVSPGTL
YNHIPDLRELRAAGRTRHQLDAAPPQAS
>Francci3_3436 transposase IS116/IS110/IS902
MQVLFPRCAGLDVHRDTVAAAVRIQTGSGKAVTEVRTFTTTGGSLGLLAD
WLTECRVTIVGMESTGVYWKPVFHLLEDRFECWLLNATHVRNVPGRKTDV
ADAAWISDLVAHGLVRASFVPPKPQRDLRDLTRARRIVVEEKTREIQRLE
KLMQDAGVKLTSVASKLLGVSGRAILEKMIEGEQSLEYLADQARGRLRSK
IPQLQEALAGTFRSGHHGFLAAQLLARIDLCDEQIDELDHRIEVMIAPFR
ETVDRIRTITGVGEVTATVLLAEVGLDMSRFPTAGHLASWAGICPGNNTS
GGKRLSGRTRHGNKWLRTALTEAAHAAARSKDTYLASHHAQVRGRRGVLK
AIGATRHDILIAYWHIIANKTVYQDLGGDWHARRRRDPERRRKNLVGELE
KLGYTVTITPAA
>Francci3_1626 excinuclease ABC, B subunit
MLEAVSTTFERPTIDIERTRAPFQVVSDFSPSGDQPAAIDELARRVGAGE
SDVVLLGATGTGKSATTAWLVERLQRPTLVMAPNKTLAAQLANEFRELLP
HNAVEYFVSYYDYYQPEAYIAQTDTYIEKDSSINEEVERLRHSATMNLLT
RRDVVVVASVSCIYGLGTPQEYIDRMVRLRVGDEIERDLLLRRFVDVQYT
RNDLAFTRGTFRVRGDTVEIFPVYEELAVRVEMFGDEIERLTYLHPLTGE
VVSEAEEIYVFPATHYVAGPERMERAIAGIEAELAERLATMERQGRLLEA
QRLRMRTTYDIEMMRQVGFCSGIENYSRHIDGREAGSPPHTLLDYFPDDF
LLVIDESHNTVPQIGGMYEGDMSRKRNLVEHGFRLPSAMDNRPLRWEEFL
ERIGQTVYLSATPGPYELGRSVGVVEQIIRPTGLLDPEVVLKPTKGQIDD
LVHEIRLRAERDERVLVTTLTKKMAEDLTDYLLELGIRVRYLHSEVDTLR
RVELLTELRRGEFDVLVGINLLREGLDLPEVSLVSILDADKEGFLRSDKS
LIQTIGRAARNVSGQVHMYADAITPSMRRAIDETNRRREKQIAYNTERGL
DPQPLRKKVVDILDDMVRQSADGELIGGGGRSQSRGKAPVPGMKSRAGRE
GAVGRYAAELAGLPSHELAQLIRQLDDQMHEAAKELQFELAARLRDEIAE
LKKELRGMGAAGVQ
>Francci3_2845 DNA helicase, putative
MSARDQVRGRSLRLLDYLAALAAERRGAPRRQLAEYVPAPLLPADVPSHP
GVRLGPTTQRESWLEVARVPQPAAPSWPPPLDRYLAGTVVSPDAPPTIPA
HLLDAEAVEQTGEREPPGVPGEERDAPPGAVALLDAWVRQVWRPWAPEAR
AARDARALYQRLFDLRLRLQREAATVELAWGHSVLCWRVGDATIIHPLLV
TRMVLTVDPDSGVIRLHPDGPVSLETEPLHGLGLPGLDEMSALRDRLRTA
PADPWDAPAQAELHRQILAPLGLDARATTGPVPPPGPSPVLVDTWAVYVR
PRPALQAQFYAELRTALAERDLLPEAIAAVVADDRLVAAALGGDDRADGT
LPEGAHADRGRGRRGHDGSPAGLGERLLMPLPTNADQERIARQLAGARGV
TVQGPPGTGKSHTIANMICHLVAHGQRVLVTAQDEQALAVLRDKIPRELR
DLTVAVLGSSRADLDELRAAIVEISGAVSEVDPARETAAVKALAEELDAA
RATARALELRMIDLLAGEAREFELPHGRERAPEVASWLAERAEELGFVPD
RLDPTRALPLGPAELAELYRTAREITPADARAAAGNLPGDLPDGASLPGA
VALGRLHDELDEIRAGLADLEQAGLSVDALDALDVDGADGADGVRALVED
TRAAADRLRRLSVGWLTTARAQCAASGEQARFWADQATALAAEVAELRRL
TTLTFGRSIELPAGDPRVQLRLLADLRGRFAAGRGIPRLGGRELRDLHDA
VRVDGLAARTGEDVAIVEAEVRRRQTLAAAAARQAAIAAALGGEAIEPAA
PDVLTRLDAVAGGLADAVDWERRAAPELSARLRAVFPAGHPAGHPTGHPV
TTTPSDPDTLSHLAGLLATATGRRREKEIAGALAEVERALAEGGRSPRAG
RVWADLHEALTRRDLATWTALLDESARLAALRPGVERRDRLAARLRAVAP
LWTDAILTGQGDPSSCGEAVRASEAWRWRQAQTWLDDLHSDGDLTTLGRQ
LADASRHVRALVLETARRSARLGLAIRLGDSQRRALTGWVQALDRIGKGT
GKYAPRWRAEARAHMRAAMGAVPVWIMPTHRVMESFDPGADDLFDVVIVD
ESSQCDLLALGVLSLARKAVVVGDDAQTSPEAVGIERAKVHALIDAHLPD
VAQRSLLDVEASLYDTAARVFPRTIVLKEHFRCLPDIIGFSNRFYDQQIL
PLREDPELALGAPLRPVRVPRGARSQTRFGDANPAEAQALVERVLACCAD
PAYDGMSMGVVTLLGAGQPRLIEHTLVERLGEREFSRRDLRVGDPYQFQG
DERDVIFISVVADDNRSAATRRRDQQRVNVAASRARNQMWVFHSVDPATL
RDDDVRRQLIEYMYAGQTGRLDADLADRCESEFERAVLRELLARGYRVRP
QHPVGRYRIDLVVEGVATTRATAEQAVTEGATAGGVGRRGPRLAVECDGD
RFHGPDQWEADLRRQRILERLGWTFFRIRGSEFYRHPRPTLEALVRRLDQ
LGIHPVPVPAPAPAPAGSPTEGP
>Francci3_0122 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPANLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_0001 chromosomal replication initiator protein DnaA
MSNLRADSVAGLPFGDEPSGDPDLAAVWSQAVAGVADGTLSAQQRAWLRL
TRPLGLVQDTALLAAPNEFTKDLLDSRLRPFLSTALSTAYGREIRVAVTV
EHLPDPEPMSGPIRIVRPVDARGDTTPGQGSGPASGSALNAGTGSGSTGA
AAAPVPPTSPGSSAVPVPAPAPAPVPPAPAALVNGELPFPDATEGTPPVR
VSAGLGRDAAPHETEPAQARLNPRYIFETFVIGDSNRFPHAAAVAVAEAP
AKAYNPLFIYGDSGLGKTHLLHAIGHYALKLYPNMRVKYVSSEEFTNDFI
NSIRDDRQQAFQRRYRDIDVLLVDDIQFLENKERTQEEFFHTFNVLHDGE
KQIVISSDRSPKQLSALEDRLRSRFEWGLMTDITPPDLETRIAILSKKAA
TERLPVPPDVLEYIATHIERNIRELEGALIRVAAFASLNKSHVDRTLAEI
VLRDLIPDAGNPDITAAAIMNATAAYFGVSMEDLCGTSRSRVLVTARQIA
MYLCRELTDLSLPKIGQHFGGRDHTTVMHADRKIRGLMAERRAIYNQVTE
LTNRIRLQARQA
>Francci3_0682 serine/threonine protein kinase
MAKSASVPPDTSRHGTPRYVAQRYRLDTPIGRGGAGVVWRGEDELLQRPV
AIKEILVPMAGAQNERDAIRARVLREARALARLHSPAIVSVYDVVEEHQR
HWIIMELVDADSLGDVIRNQGPLPFDQVAAIGLALTDALAAAHSAGVLHR
DVKPGNVLLGRDGRVRLTDFGIAATEGDVTLTGTGALVGSPAYIAPERVR
GSSGTPASDLWGLGATLYSAVEGQPPFEGPETYAVLTAVVEGRRRAFRLA
GPLRNLLSDLMDRPAEERPDVTEIRRRLIPIARNANPVPASVMHARPRTD
EETDASSVDVGLENLDDPAEPAAGAGPAADSRSGSSGGVAGTAGVQAGEA
ASAAVGDQGAVPGPANRPASAAAPPVPADRDVTVDPLLNPLGGAAVAAAG
AAGSRTGRREPDRAGPSNRVIPPDRTTPQNRAAPPGRTRPDPDHDAPTSI
GLNSVAGPSVDDDVTRIGAVTPVSATSDLDLFDDDNHAAGLLPLGEARRR
AEPARHDGHGRGPGDPQRRRQRTIAVVAGAAMLVMGAAIALTVGLTSGGS
TPGPNDVTAQQTTPPPIATIASSPPAVTSRSAASEVVAPDSPRYTPRPRP
SSSATPTTTPTTETPTPTPQPSTQVQTSPPPTQTPPPRVTPSPSRTAPAS
PGGTTPPVVSTTAPAPTGTTTFVP
>Francci3_2037 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_0007 DNA gyrase, A subunit
MVDVLPPPPGDRIEPIGIEVEMQRSYLDYAMSVIVGRALPEVRDGLKPVH
RRVLYAMYDGGYRPDRGYFKCSRVVGDVMGNYHPHGDTAIYDTLVRLAQG
WSLRYPLVDGNGNFGSPGNDPPAAMRYTEARMAPLAMEMLRDIDQETVDF
APNYDGRSQEPLVLPSRFPNLLVNGAGGIAVGMATNIPPHNLREVAKGVR
WALDHPDASDSELLEALIGLIKGPDFPTSGLIVGRNGIEEAYRTGRGSIR
MRAVVNVEENKGRTQLVVTELPYQVNPDNLAEKIAELVRDNRVTGIADVR
DETSARIGQRLVIDLKRDAVAKVVLNNLYKHTQLQDTFGVNMLAIVDGVP
RTLRLDQMVRYYVEHQIDVIVRRTRYQLRKARERLHVLDGLLIALDHLDE
VINLIRNAESADVARGQLMERFSLSEIQATAILDMQLRRLAALERQRIID
EAAELRAKISDLEAILASPTRQRQIIGEELAEVVEKFGDERRTRLVPFEG
DMSIEDLIAQEDVVVTVTRGGYAKRTKTDLYRSQRRGGKGVQGAALREDD
IVEHFFVTTTHHWLLFFTNKGRVYRAKAHELPEQARSAKGQHVANILAFG
QDERIAEVIAVKDYEAAPYLVLATKRGLCKKTALHDFDSNRAGGLVAINL
RDDDELIAARLVAPGDDLLLVSRNAQSIRFHADDEQLRPMGRATSGVIGM
RFDAEDELLSMDVVVPGTTADLLVATSGGYAKRTPLAEYPVQGRGGKGVL
TAKIVSTRGGLVGALVVDPDDQLYAIASNGGVLRTVAKDVRRAQRQTMGV
RLIDLESGVQVVGVARNADAEDTDARIDAGPQES
>Francci3_1108 transposase, IS4
MSVGALSRAAAVVVGAGAVSRAGIWERLAAIPDHRSARGLVYPLPVLAAV
WLCAVTAAGHDRVAAVTEWLAATSWTERVRLRLPWNPWDGHLLPDEATIR
RFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGDQAVPVRAY
AVDGKTSRGAKRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRAL
LAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLRAFLA
ALPWTEIPTADLTRDRGHGREETRTLKVATVTHLDFPHAAQAIRIRRWRR
QKGQPASHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFG
EDSSTSRTGRGPAVLALFRATVADTLRRAGHRSVPACRRAHKTATAALDL
HGFP
>Francci3_1684 Resolvase-like
MARATSRRKSANRTPQPAVDPLDTVRVGIYLRRSTDDEHQPYSIEAQEER
LRSYIDSQPGWAIALRFSDDASGATTERDDLQRALSAARHGLIDVLLVYR
VDRLSRNLRDTVTLLEELDQAGVVFRSATEPFDTATPMGRMLLQMLAMFA
QFERDTIIDRVIAGMERKAAKGLWMGGNRPFGYQVDRANWKLLVDEKEAP
VVRLIFNLYVKERVGTRAIAKTLNERGHRTTTGGPWSGHQVLRVLDNRIY
LGELTFREITVTDTHKPIIEAAQFAEAEKILTIRSDGHTHRAASDSDYYL
TGRMRCPQCAKAMLGSNAGGRNRTYRYYTCFTRLRYSRDRCDAPRLDADA
LDQAVLTALAAFYRDHQQLISDAVHHARQRHHDAHADRTGELATVQADLT
QTDQAIDRYLSAFERGTLDEETLATRLATLRTKQKQLRRRQTELTAQIDD
EPVMPPRATLSKIAGHIDTIIEVGTDLQRKALVEALIHEVKILGPGRLQP
VFKVPRPEPSETAAAALPATTPPKGAVRTMPNMVERVGLEPTTDGNTVAT
TTSIPSARCRTRRRGVTLFPWTHRRSISPASAGSPTATIPGRTKIPSSSG
SSTAGVRPGHRRRWFISTRPARPAPVGPRSRPRPRTWSARAHSARTPPRW
RQAAGSTPRGPGSSA
>Francci3_1873 transposase, IS4
MVTDLDTGLVPAVGLTPANVPEATVTNTISDDLAAQNLRLTELHIDRAYL
SSTLVRDRDPNLEIFCKAWRVRNATGRYAKTAFTLDFDRGLLTCPHQVTM
PFAPGRTVRFPAKVCAACPLRERCTTSATGRSVTIHPDEALLTELRERQH
TPAGRARLRERTKVEHTLAHIGHWQGRRARYHGQRKNLFDLRRTAVVHNL
HVIARQKHDQQAA
>Francci3_2729 helicase-like
MSVRSEMGTEEGRHGATGPGIGRPASCQTGRAVHQGGYQVPSSTTERAPS
ARRVRQQAMGGTGPGPGAPASDERPDIGALVEVRGQKWVVADIDGPAAGG
ARWIAADVGAADKADAAGPDAVGDGAAPSTSTLVTLQSVEDGRYGHTVEV
IWEVEPGRRVLPSGSLPDVTRGGFDPPGRLAAFLDAVRWSAVTSADARRL
QAPFRSGVAVEDYQLEPVARALAAPRVNLLLADDVGLGKTIEAGLVAEEL
LLRHRARRIMIVCPAGLTLKWRDEMAEKFGRDFTIIDAERCAAVRRSHGS
AANPFEVYPLTIVSLPWLRGPKAQRLLDEVLPGDGQPTYPRTFDLLILDE
AHHIAPAAPRQVYAVDSQQTKLIRRLAPHFTHRLFLSATPHNGYLASFTA
LLEILDDQRFARGVEPDRMAVDEVVVRRLKSSIENADGTPRFPRRRSIDM
PVVYPDGEREIHGLLKEFAALRRARLDSPRGRKATDLVTLLLKKRLFSSP
AAFGHTVGVYRETLVSRHGRPVHASRLADEAEPGMPGPGMPEPWMEDYFD
DVATLDDEQLADAEDDALGRIGPMQLDPTDGEIELLDRMRRWADAHEAEP
DAKARALLDYLNAVCRPDGRHWFDERVVVFTEYRDTQIWLAGLLRQEGLA
GERLGLLHGGLSVDEREQLRLAFQAEPSEHPMRVLLATDAASEGIDLQNH
CHRLVNYDIPFNPNKLEQRIGRIDRYGQRRSPEIRHFVGSGWSGSVDSFE
ADLEFLSRVVKKVAQMKADLGSVNAVLSDRVQRRMLGERIDLGPDDLDVG
PGGSVAAGGTVPVAENVREQVRRLRANLDDTVRELGITPAAVRRVVGTAL
SLARQQALTPYVDEKEHAEGLFTVPTLTGSWQRASAGLTEKLRREDLPPG
TPPRQRPITFDSAMAKDRDDVVLAHLGHRLVDMSTRLLRGAVSNADVGLH
RVTAVRSDDPRCEEVLIGAFSRFVLVGADGVRLHEEVLYAGGWAPATGRF
RRLENLTVLGDLVGTALAAGRPVSGETRERLAERWPAVRPGVLSALEWRT
RTRQESLTRLLGQRRAAEENRITTILDQFAATLRGAIEDPENDEDALFSR
FEIARSREELAQYRKDRQAWQDRLDRLDQERRRELDVIAQRYRDPRQHQF
PVAVVLVVPAAEEAR
>Francci3_2865 helicase-like
MPTRELAQQVNDALEPLTKALNLRLAPVYGGTSISRQISALRRGVHLVVA
TPGRLTDLVERGACVLDGIEITVLDEADFMCDLGFLPAVKALLDATPADG
QRLLFSATLDREVEVLVRDYLPDPVLVAVDSEVSQVTTLARHALEAVDRA
AQVALVTALAGGSGRTLVFVRTQRDADWVAESLSRAGVPAEPLHGGMPQG
ARTRALAGFTDGFYRVLVATDVAARGIHVDGIRLVVHLGPPEDAKTYQHR
GGRTARAGADGADVLVTLPAERGKVRGLLRAVGIEARPIPTTADDPIVLE
LAGPPAPRISKDEIPLARTGGGRGPRGGGTPRGNEHRPARREGHRFGRPA
TGHGRPPRRGAAPGTASTAPAPTA
>Francci3_2961 putative integrase
MVAQTTYSLRHACLSGWLAAGVPPTRVAQWAGHTVQMLLTIYAQCIDGDE
EICRARIAAALSADSA
>Francci3_3153 Tyrosine recombinase XerD
MGVTDGDRVFRLRPPAASASGVGGSSDHRPSAPRPGPDDLAGPDDLAGPD
DLAGPDDLAGPDDLAGPDDPPRPADHRAGSQLSQLRGTPAGVIERYLHHL
EGERGLARNSVLAYRRDLRRYCDHLSACGLPSLDAVGEAEVAGFAAVLRT
GDDTHPPLAAASVARMLVAVRSLHRFAADEGDVPEDVSRPVRPPTPPRRL
PKALSIDQVVAVLAAAAGAPPPGSPPPSGRPQPSGRPQPVEPAEAVRRLR
ATALLELLYGTGARISEAVGLDVDDLDLEAASVRLHGKGGRDRIVPLGRY
AIAAVADYLRVGRPTLVAPRSGPAVFLSRRGNRLSRQSAWSVLRTAALAA
GVEGVSPHVLRHSFATHLLDGGADVRVVQELLGHASVSTTQIYTLVTVDR
LREVYAASHPRALGAALGSIGHHSRTWGPRTDRCVTGPCMTDP
>Francci3_3167 DNA-3-methyladenine glycosylase
MAPTDGAAPDGVDFYDRPVLAVAPALLGATVWHGPVAVRITEVEAYGGLD
DPASHAYRGPTPRAAVMFGPPGRAYVYLSYGVHWCLNVVCGPVGSASAVL
LRSGEVVAGRDLVAGRFPRLVEADLARGPGRLGRALAVTGALSGTTITGP
GPVTVALAGGRGIRPPGPPGISGGRVRRGPRAGIRVATEWPWRFWLAGEA
TVSGPRPPRRPR
>Francci3_4226 hypothetical protein
MTRGDLTDGEWELIEPYLPLGASGLIPDLRSYVNAVMWRFRTGSPWRDVP
ERYGSWSTIYDRFRLWAQDGVFQTLMDAMITEAAARDDVDLSLASVDSTV
ARAHHHAAGMVVDPDLLVDLEKALTEEKGLQKPGKTTV
>Francci3_2078 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_3998 transposase, IS4
MREIVNAVLYQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDL
LRWLVREHHRRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRG
IAVDTLGLLIAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKT
TVVEHGAGLGIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARD
YEAKPASAVAMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_2874 serine/threonine protein kinase
MLTPLTTDDPRRIGPYRLANRIGAGGMGVVYLGFGTDGRPAAVKVPSAGL
ADDPEFRSRFRHEVDAARRVRGSAVAAVLDADLTGQRPWMATEYVEGRNL
ADAVATRGQLDDRLVQGLAVGLADALVAIHAAGVVHRDLKPANILLTWDG
PKVIDFGIARAGDNTSHTRTGMLIGTLVWMAPEQLRGERAGPAADIFAWG
ACVTFAAAGRPPFRGERAEAIGLQILTAEPNLDGLPASLVGVVRAALDKE
PARRPAATELLARLVGHDVRSPAESDRASETALARWWSLPPTPPDGEGPH
GGYRDAGYGPDPAHHRASAHHRASAHHRASAPPTVSHGADGWEDDGWRVG
SGRGGSGGGGRRGAVVAVAALLAVLLSGGLAAALVLNRSDDTPTITGSTP
DPLATAGTAAGTAGTAAGTTGPLQAAGTTGPLQAAGTTGPLQAAAPPATT
GPASARARASASASPTPVATTVATTVATLSAADAAATVRRKGYEPDMSTY
AADRRLNVVIGTGQPAGDTPRQLAFVFADGEYRGTDTKAPSARITLQRQR
NDHEVVLRYATYDPRDPVDAPSGHADVRFRWTGTIFSTLDKIPPSDPNVS
GSRR
>Francci3_3474 hypothetical protein
MVWNKRGNTGGGASSGSGGTKRGGSGRTPLNDGERSTTLTCPRCQGEGQI
GNPAAEKDEDGTFTGRDLIKCPRCGGSGTVKSK
>Francci3_0738 DNA polymerase III, beta subunit
MRFSVERDAFADATGWAARHLPNRPGPARNVLTGILLAAGHGSPASPGSA
AVPLPSGSAASPGSAAVPLPSGSAASPGSAAPLGAATASVLTICAYDTEV
AVRAPVAASVDEPGRVLVPGRLLTDIVRSLPATTVDVVVDGDRLVLRCGS
VRFTVPLMDPGEYPQLPSFPGPIGEVDTLGFASAVGQVAPAAGRDETMPV
LTAVRMEISGRGLALVATDRYRLAVRGIPWQPTVDDPWALAHVPAKVLAE
VARTPTSASRIVIGLDLDDPSGARLGLAAGGRQTIVRLIEGTFPNYRKLL
PEAAALVVTAQTAALAAAVRRVAVVATRTGPLRFTFTHNQVVAEAGDGGG
AQASETIPVEYAGPELSVLFNPSYLLDGLGAVEDDQATIGFVDDDPVEAA
AKPAVLTGKDSGRGAVDEGAYRYLLMPIRHGGG
>Francci3_1643 serine/threonine protein kinase
MTVDGVGIGPLVAGRYRLVERIGSGGMGTVWRAHDDVLRVEVAIKEIRVS
ADLDDDERAAGVETAMREARNAARLRGNPHVVTVHDAVEHDGLPWIVMEL
VRAGTLATTVNRDGPLPPERAIQVGLAVLDALVAGQRMGVLHRDVKPSNI
LLADDGRVLLTDFGIATHAADPTLTGGIGSGGTPAYMAPERLLGGPATLA
GDLFALGATLYFAVEGVSPFQRDTLPTTIGAVLHADPPPFLRGGRLSAAI
AGLLAKNPASRLRAEGAQALLTWAASHPADSAPASLVSPASLVSPVSPAS
PASPAASLPPSPAPDAVIRRRHGSRPVPFPPAWPGTARPPGAVGWVRSWR
APRLMAGAMILLVLLAAVAAGAYQLFGADDGGDGDRRRASPPTVATRDPA
GVGSQPGAEAGEALPPGMIGSWSGSVTQAFVHFNAELVLRGGRIGEVIGT
SAYPESGCAGELVLRGVSGASVRLEERLTRVGALCFAATWLDLVLHGDGT
LDCSYPATEISSAGQATMRRSAPPMSPSSPAPPG
>Francci3_1225 putative exonuclease
MRPHHLTLAAFGAFPGTVEIDFDVLGSGGLLLLCGETGGGKTTLLDAVGF
ALFGRVPGMRGEVSGPPDLRSHHAAASLRPEVTLEFTVAAGRFRITRGPA
WDKPKRGGGTTRAHPTARLERFDGAGGWETVATRMEDVGHEIDLLVGMSD
KQFFQVIMLPQGRFADFLQADHGAREKLLKRLFHVNRFEYAEQWLLDQAK
IAAERLALARAELDRVTARVSQVAAVDEPEDPTADAGWASDLARTAAAAA
ISADEAAGAAAARRTAAEEALDQARDLARRVGQRRVLAARQEELAAQAPR
IELLATELDAARRAAVVAPALGEVHRRAAEVHRAEAAERDARDRLGRHPS
GLLPGEPAPSGPLPEEGTGRPAEAPAEAPAEELARLARLAHTETGRLGTL
ADTLAAAERDAEEAAGADQDTAAYTRSAAELAEAISVTLPRARVAAEARV
EVARRAGAALPGLVERARWAKELASAVREGRQVRTVADEAERDASAARTH
ASDLRQQRFDAITAELAAALVHDTPCPVCGALEHPDPAETRADHVSKDAE
TAAGQEADRLADAATRASRAVAHWESRVRALHADLVGPTDPDHSANPDHS
ANPDHSANPDHSANPDDRVEAAFAEIRALPVATLLGGSGAPVADRLDELA
AVLTHAVRARTRTAKKLAAAEAALREVHENEKETAARHSAARTAAQAARE
RAADARDRAARRLAGVPAELCDPDALAARRRAVTALAADHEAAQAAALAA
EQARAEHIRAGTAALDQARQAGFSDLDDAAEAVRDSDWMRRAADEARAHR
DELVAVGARLAGEDLAVDPDTEVPLADHETAVTDAREVHESALATAARAR
ERAERLASLVTEFTEKLTTLDPLREAADELRGLADLAAGRGANTERMPLS
SFVLAARLEEVAAAASHRLAAMSSGRFTLVHDAGESRDKRRRAGLGLLVD
DAWTGRRRDTATLSGGETFQAALSLALGLADVVTAEAGGRRMDALFIDEG
FGTLDPDSLDEVMTVLDELRSGGRLVGVVSHVTELRQRIPNQIRVVKGVG
GSRVETTS
>Francci3_1272 DNA repair protein RecO
MYCHTYPPAMDRDVLRVPQENIRPVPAICRATSLHRMNSGPCNHGRVPVY
RDEGVVLRTAPLAEADRIITVLTRRTGRVRAVAKGVRKTSSRFGSRLEPG
TYVDLLLHSGRALDTVTQADIISPYGATIAVDYPRYTAAAVMLETAERLT
SEERQPALRLFLLLVGGLRTLAGDERPPALVLDAFLLRALAVSGYGMALD
HCARCGGPGLHLSLSVPGGGVVCPQCRPHGAASVSAGAVRLLADLLRGDW
DGALVSDARARREAGGIAAAYLQWHLERGLRALPYLERA
>Francci3_4081 putative transposase
MRRRRACPPVPTSEFAGFRFPPEVIVLAVRWYLRYALSYRDVEEPLADRG
LEVDHVTVYRWVRRFTPLLVDAARPYRHTPGDRWFVDETYVKVAGRWTYL
YRAVDQSGQVIDVLASEKRDLAAARRFFTRALSHGRRPVEVTTDRAAFYP
RVLDEQLPAAHHVDDQYANNPLEADHGRFTARLRPMRGLKRLRYVQIIGS
GHAFVQNIRRGHYELGVDADPGTG
>Francci3_3583 phage integrase
MSPGAVSRTPRRLDENHRCCHGRLMVLRGESAEEPADGRLLPRQRSRATC
GSDARPVEIVETPESDWDTAVAGFLRHLIAELARSPETVRAYAADLANLR
AHAERMGCSVLADLDLALLRSWLASMRAAGAAPASLARRASMARVFSSYA
ARHGFLDTDVAARLVGNRTVRRVPEVLTAAAARQLLENPSPDVSPPGTSQ
PSGLPDSTADSERVVRRAVELRDALVLELLYGSGIRVSELCGLDLGHIDD
DRRLLRVLGKGGKERSVPFGVPAVAALRSWRSTGRPRLMTARSGRALLLG
LRGGRLDPRTVRRILTTRVAAGVAPAGLTPHGLRHSAATHMLEGGADLRS
VQEFLGHASLATTQIYTHVTPERLRAAFEQAHPRA
>Francci3_2051 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_2522 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAAAGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGCRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_2634 putative RecB family exonuclease
MTSTQRSPAIPGSVASAAPRPLIGSLSPSRAADFVNCPLRYRFRVVDRLP
EPPSEAATRGTVVHAVLEKLFDLPASRRTLQAARELVEPAWDTVRAREPA
VEALFGGRDELAAWLESARELLAGYFALEDPSRLAPVARELYVEHVLDSG
LRLRGYLDRLDEATTAQGPALRVVDYKTGRSPGPAFESSAMFQMKFYALL
LWRIRGVIPRELRLYYLGDRTWLRATPDEADLRAAERRIEALWAAIDRAH
RTGDWRATPNRLCSWCDHQARCPAFGGTPPPLPQQTSAVPLDETAPTVCE
ADG
>Francci3_2130 transposase, IS4
MPAVDSERLTDRISIGVLARIVPRDLVDEALVETKRQERRTRLLPARVMV
YFTMAMCLFFDDDYEEVMRKLAGALRWLGNWKGDWQVPTSGAISQARIRL
GAAPLKLLFERVAVPVAGRGTKGAWLRSRRLTAIDGFFLDAADTPENVAR
FGRHTNGHKASALPQVHVVALAECGTHAIVAAAVGPRASDERTLAATLFD
ACEPGMLLTADRNFYGRDLWHQALDTGADLLWRVRSNLALPVIQPLPDGS
YLSIVINPKLSGKRREQLIVDARAGRDFPEDYATPVRVIEYTVPDRTGAG
TGELICLITNILDQTDISAVELATGYHERWEIERIFDEVKTHQRGEARSV
SIGQCTTYESGHRSGLVGTWPTDLEVRCAARC
>Francci3_2330 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_1631 excinuclease ABC, A subunit
MADRLVVRGAREHNLRDVDLDLPRDGLIVFTGMSGSGKSSLAFDTIFAEG
QRRYVESLSAYARQFLGQMDKPDVDFIEGLSPAVSIDQKSTSRNPRSTVG
TITEVYDYLRLLFARAGRPHCPKCGRLISRQTPQQIVDRLLELPEGTRFQ
VLAPVVRGRKGEYVDLFAELQSSGFARARVDGTVVPLTDPPKLEKQRKHT
IEVVVDRLAIKSSAKRRLTDSVETALKLGGGLVLVDFVDRDADDPERERM
YSEHLACLYDDLSFEELEPRSFSFNSPYGACPECTGIGTRKEVDPELVVP
DPTLSLAQGAVAPWSGGHNKEYFERLLTALAEDLSFRMDTPWEGLPERAR
KAVLYGSGDTEIHVGYTNRYGRKRSYHTSFEGVIGFLGRRHREAESDSSR
ERYEGYMRDIPCPACRGARLKPESLAVTLGGRSIAEVSGLAIGECAVFLR
GLDLTERERTIAGRVLKEVDARLTFLLDVGLDYLSLDRAAGTLAGGEAQR
IRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIQTLLRLRDLGNTLIVV
EHDEDTIRASDWVVDIGPGAGEHGGRVVVSGPVEKLLASEESMTGAYLSG
RRRIPVPDIRRVPAKGRALTVHGARQHNLRDVTVSFPLGCFVAVTGVSGS
GKSTLVNDILAAVLANKLNGARQVPGRHRTVSGLDNLDKAVRVDQSPIGR
TPRSNPATYTGVFDHIRRLFAETTEAKVRGYLPGRFSFNVKGGRCEACSG
DGTLKIEMNFLPDVYVPCEVCHGDRYNRETLEVHYKGRNIAEVLDMPIEE
AAEFFAAVPAIARHLRTLNDVGLGYVRLGQSAPTLSGGEAQRVKLASELQ
RRSTGRTVYVLDEPTTGLHFEDIRKLLGVLGRLVDAGNTVIVIEHNLDVI
KTADWIIDLGPEGGTGGGRVVAQGSPEAVAAVEESHTAVFLREILSDRVA
EVGPLPPSQVRSRSARTASSSCSVCS
>Francci3_1018 ISRSO5-transposase protein
MPWGRGWPGRTGRVSRRATAITLTCDVRAVLAGRARSLTVPRRDWLRAAI
VLAAADGASNTTIATDLGVCEDTVGKWRGRFAREGLAGLVDRPRSGRPAR
FTAVQVAEVKALACTRPADVGAPLERWSNAELARHAAREGIVAGVSASSV
GRWLARDAIRPWQHRSWIFPRDPAFVAKASRVLDLYARIWDGAPLGENDY
VISADEKSQLQALRRCHPTAPAQARHGPRVEFEYQRGGTLAYFAAYDVHH
AHVIGRIEPTTGIAPFHRLVDQVMTAEPYASARRVFWVVDNGSSHNGQTS
IRRMSAAHPTATLVHLPVHASWLNQVEIYFSILQRKAIERGDFADLDALG
DRVMGFQDLYNQTAAPFDWKYTRADLLKTASGLALAA
>Francci3_0958 hypothetical protein
MGTHRYRSGLGDAGWVVAEAVLPPGHRGRHPVRGYVARNGIAWRALPAGF
PAWRAVCGFFDRWKNKGVTARAQGASRGGPGPRGSWTILKTTVETTRENP
GRKTSGALPGRGVVERTLAWLTAHRRLAHGYGRHPAASESFINGLVRAGP
R
>Francci3_3217 AAA ATPase, central region
MSLFDADLSDGAGAGAGASDASEAGTASRSASGFAALRDRRGGGPGVDRA
APLAARLRPRTLDEVVGQRHLLGPGSPLRRLVEGGATTSVVLWGPPGTGK
TTLAHIVSRATGRRFRELSAVTAGVKDVRAVIDEARETLSTSGARTVLFI
DEVHRFTRTQQDALLPSVERGWVTLVAATTENPFFSVVSPLLSRSLLLTL
EPLTDDDIRGVLTRALRDPRGYDGRLSVSDEAREHVVRLAGGDARRALTT
LEAGAEALLEAHECESGPGGKAGAGADAGADAEAFVNPRPRLDLDLLERA
VNRAAVRYDRSGDQHYDVISAFIKSMRGGDVDAALHYLARMLEAGEDPRF
VARRMIILASEDIGMADPGALGVAVAASQALEFVGLPEARLALAQAVIHL
ALAPKSNAVIRAIDAATSDVRAGRAGPVPAHLRDAHYRGARRVGHGAGYR
YPHEAPGNVVRQQYPPDGLTSTDYYQPSGNGFERRAADRVHELRGVVRGD
PPPSPGSSPPPSPPPSSPS
>Francci3_2551 Recombinase
MYEGALEDLKRGATKTGKPLDGLIVSDVDRLTRDPRHLEDAIDVVVQYGR
PVIDISGTLDLLTDNGRSVARIVVALKNQQSADTSRRVRTAHRELAKAGV
PVGGYRPFGWEPDKRTIRKAEADMIVVGADEILAGVGTHTLCRRWNELGI
LTTRGHQWQRQVMKNMYLSPRLAGYRVHGPTTVPLEQRYACTVDGQPVMG
LQGHILDVEVWEKVVAKLRDPARTGNQNIHIGGRKYLLSGIIRCAYCGAR
LTGGWDKGWKKHHYSCRPVTAGGCGSVAVTGHHVDELVTNLVLAYLANRD
VEAESGPWPKAAEAEIAEIMAAWRETKRGGTRALQMVEELEGDVAKLRGE
RNDWLRAHSGPQLTNVASSWPRLEVEQRRDIIATVIEAVVLSKADGPKNR
FDPDRITVIWRP
>Francci3_4067 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPANLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_4009 transposase, IS4
MHLATDRRCRPLSIILTPGQAADSPRFLPVLKKIKVRGPVGRPRTRPDAV
AGDKAYSSRANRAHLRTRKIQAVIPEKADQTANRKKRGSAGGRPVSHDAT
LYKDRNTVERGINKIKEWRGLATRYDKTPVGTDPNCRHLTCPCQGRHRSS
IPGGQSLLSEPEGRAEGCEPVLAGLVVRQGAFDQSPEDWCVVSFVSVGEF
VDEDVVDETDRELHGRPMDVDSPGGAERAPSVAEVAHVEAGDVDTHAAGP
GTDAGW
>Francci3_1856 Excisionase/Xis, DNA-binding
MTSPTRPEPLHDETYLPGEDTGEIIDFLAALRDRGRQTADPRPRLTGPDG
HSVELPEPMFNVLLQVAAAMKAGLAVTVAPHHLTMSTQEAADLLRISRTT
LVRLLESGAIPFEKPSRHRKVRLDDLLEYRRRQRHAADLAFADMVADTER
LGLYDVSPDETRAALKAARKKTEG
>Francci3_4151 actinorhodin polyketide synthase bifunctional cyclase/dehydratase
MTLAPFLRREHEIRVGAPAGFVYRLLAGLEHWPSIFAPFVHAERLGTDGE
LERVGMWTTSGTRVERWVAFRRCQQEDLRIWFRVEQPPPPLESMERAWTV
VPVSDGECLLRLAHELRVTAGVSADIGAVVRQVDVVAEAETAAVRAAAER
AVSAPDFLLTLHDDVRVDGPADRVYDFLYAVDRWPDRLPHVTRVDVQQDE
PDVQLVEIDTAERRGGVLTTRTARVGRPPRGIVYKQLRLPPLASSHHVRW
DIEDGGDHTVVRSIQTVVINGPGIAELLGASTSLEQARSFIRTELGSKVR
LILDQAGQHVADAAG
>Francci3_4518 replicative DNA helicase
MSVTEISRGSGGSAEFDRTPPHDLPAEQSVLGGMLLSKDAIADVVEVLRT
GDFYRPAHGLVYEVIGDLYGRGEPADVISVAAELSRRDLLERVGGPAYMH
TLISSVPTAANAGYYARIVAEKAVLRRLAEAGTRIVQLAYGAAPDVSDVV
DRAQAAVYEVTERRTNEDYLPLGELLNPALEEIESIQGHDGSLTGVPTGF
VDLDELTNGLHGGQLWIVAARPAVGKSTLGLDFARAASIKNGMASVIFSL
EMSRMEITMRLLSAEARVSLQNIRTGRLTDDDWGRLARRIGEVAEAPLFI
DDSPHLTMMEVRAKARRLKQRNELRLVILDYLQLMSSPKKVENRQQEVSE
ISRSLKLLAKELDVPVVAISQLNRASEQRADKRPQVSDLRESGSLEQDAD
AVILLYREDTVEKESARAGEADLIIAKHRNGPTGTVTVAFQGHYSRFVDM
AN
>Francci3_2646 Exodeoxyribonuclease V
MPPQRRPGYHATQRATTAAERFVVTLPGMPDEYTTASTTEPDAVGPQARA
DPQPADPEEVFAAFCGAGLWPGLGRTTAGRLPAAGITRPDHVDVGRLGTV
EGVTGPRARRLADSFRAVAGTYAVVELLVAADLPARLARGVTDLLGPASA
DLLRADPWMLLTAADTEIAQADRFARRRGLQRDDPRRGPAVLTHLLGRAA
SRAGDTAGPVQAVLRAAAREGVADPSAALTVALDDGRIITVGDRIALERY
AMAEQSVADGIERLTATAEPLRAGAPRARRRPDDGPGYDGPGYDGLGSVD
DDDPAGRAVSAWAATAPPPTALTFDDDKDDDKDGGDESPLGADGTEGDTN
RDAAAAGRGGPANDGGPANDEVVTGLDELQLAAARAALESGVSVLTGGPG
TGKSRTVAAVVRLAWAAEAVVALAAPTGRAAKRLEELCGAPASTLHRLLQ
AQGRGSGFARGEHNPIDADLVVVDEASMLDAELAAALLDACADGTHLMFV
GDPAQLPSIGPGQILADLLESGEVPVTELRRLYRQADGGAIATMAAAVRG
GELPPPGPGREVVVVPSGSSGEAAHRTVQLVTNSIPRALGIPVADIQVVT
PVHAGPAGTTALNTALKGTLNPGGGAVSGFDIGDRVVATANHIDVGFANG
EIGTVVALGERGALRVAFPGGVVEVPAGVLGDLRHGWAVTVHRAQGSEWP
AVVAVFPPEAGRMLTRPLIYTALTRAQAHLSIVAVNGPAIRNAVRSSGGR
RRTTLLPALLAGQTAGGFDDPDEDPDEDPDGGPDGRPDGGSDARRAGGAT
AGEAARVRDGDERMTV
>Francci3_1621 DNA polymerase I
MSVTTSSPTSSPSGSRSAASATGATGATGATAATAAGPAVSSPSPAPSSP
AKSTPATPRLLLLDGHSLAYRAFYALPVENFSTTTGQPTNAVYGFTSMLI
NVLRDERPTHVAVAWDLPTPTFRHTQYAEYKAGRGETPADFVGQVSLIHQ
VCDALAVPGVSAPGYEADDVIATLATLGAAEGMDVLVVTGDRDALQLVDE
RVTVLMTRKGISDMVRFTPDEVQAKYGLSPVQYPDFAALRGDPSDNLPSV
PGVGEKTATKWIQQFGSLAELVDHADEIGGKTGASLRAHLSEVIRNRSLT
ELSRDVPLDVVPAGLRMRPWDREAVHQLFDTLQFRVLRERLYAALAIAPP
PADEGFEIELTVLGPGEVARWLAEHAHRVGRTGLHARGTWGRGTGVLAGL
ALAAAGGAAAWIDPTLLTPADVAALGAWLADPNQPKAAHDVKGPMLALTE
LDLPLAGVTSDTALAAYLALPGQRSFDLADLVARYLHRDLSADPVPGGQQ
LTLDGSGEADQAHADAVRARACLELADALDADLERRSAATLLRDIELPLV
TVLAGMERAGIAVDSEHLTELQKHYGGEVSAVAAQAHEIVGRPFNLGSPK
QLQQILFDELGLPKTKKIKTGYTTDADALAWLAVQSDHPLLPVLLRHRDV
ARLKTVVDSLIPMIDDIGRIHTTFNQTIAATGRLSSADPNLQNIPIRTAE
GRQIRRAFVVGAGYETLLTADYSQIEMRIMAHLSGDEGLIEAFGSGEDLH
TFVAAEAFGLPVSEVDPELRRRIKAMSYGLAYGLSAFGLAGQLGIAPDEA
REHMDAYFARFGGVRDFLRGVVERARKDGYTETILGRRRYLPDLTSDNSQ
RRQMAERMALNAPIQGSAADIIKIAMLGVDRALCAGGYASRLLLQVHDEL
VLEIAPGEHDAVERLVRAEMTSAYTMSVPLDVSVGAGCTWDDAAH
>Francci3_1189 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_4124 Recombinase
MIRPDYAELLADLRSGFLDGAIVWDLDRLTRDLRDLEDAIEVVELYGRPI
VGPGVDLTTEYGKANARAQAVAANKASADTSRRVRRSHKQRAERGVPVGG
SRPFGWKDDKRTLDLTEAAILREGARRILAGVPVVDLVNEWNAAGVRGTR
GKKWTKSSVLKVYRNPRICGLRSRGVEEPNINGQVAKYMQVVTRKERTPD
GRTIEVPVKGQWKAIIGVRRWDQVIAKIGDRTYAQQGHNSRRYLLSGVVA
CGRCGRSMFGSPPYRERKHAIYRCPAPTQGGCGKVSRHGPHTDDHILAAL
FHKIELETASAVVDVAPWTGEAALAEVQESITETRAAWTSVPRRISPKDY
FPTMEDLRAQEEILLRERNDHLVATANAHARPADVRAEWDGYSLARQRAI
IKEHLIAVVVHPAGRGRRFDPDLLDPVWREET
>Francci3_2869 DNA polymerase III, alpha subunit
MERLREVLGAPPGRTEVHLRLQTASRTTVFRFDDAHTVQRTPALMGDLKA
LLGASAVA
>Francci3_2087 hypothetical protein
MGGDGASVGDHGGRSYGSARIGESVSGPAGGLWSRENLSVALRRVEANRG
APGVDGMTTAELRPWLVVHWPVVREALDAGSYRPAPVRQVMIPKPGGGQR
MLGVPTVRA
>Francci3_3584 DNA processing protein DprA, putative
MSSTTPPDARGADPASAPGRVAPGDSDWSDPERLARVALARVFGPEHRRV
AVEVRRRGAFEVWNALRAAHPSVDPVRDLDAAWRAGARLVCPQDAEWPLE
LDALDRLRDAGDGSMIGTPLALWVRGPLNLSELPPRAVTVVGCRTATSYG
LHLAGEIAFAMAEQGWAVVSGAAFGIDAAAHRGALAAAGPTVAVLAGGVD
VPYPTAHVELLEEIARTGAVVSEVSPGTPPYRRRFLTRNRIIAALSRGTV
LVEAGHRSGALNTVAHTRRLGRPVMVVPGPVTSAMSAGCHRLLRDFREQT
VLVTGAEDIREEIASIGSLVQRPASGNGPRDGLSEAVRELLDAMPARAAV
GVSVLARRTGLRPEAVLAMLGPLAVEGLVENVAGGYRLTDLGRAPSNPSH
PATSGRRSGTQPGAAHGGADRSPTRSTADPGLDGENPDGENPDGET
>Francci3_4046 conserved hypothetical protein
MTSETDGTAVPANGAVDRTASVQRIWLDDRSWVDVTRGWLREADQLYETL
HAEIPWRQGTMWRYERHVTEPRMSAWIPRGRPVAFPALLDAYRTLRRTYG
VEFDGFGLSLYRDGADGVAFHRDREMRWLDDTVIAILTLGARRPFLIKSR
HLPPGRRILNDPEASGARDLSPAGGDLIVLGGRAQADWLHAVPRVPEHVA
GRISVQWRWTSRTGRPEQGPGYGAARHFSR
>Francci3_2439 hypothetical protein
MRASSVLCDAELLTTVFPQLAAVLVQRVVDEGRRVRVVARTRADPVPCPR
CGTRTERVHGYHRRWLTDLLRKANTLVVTARAAGNSRLDQKTIDALRARY
DADVRIGELTNLSRPWTDEKNHPGLVLARRLAAKADQVWLFTTNFTIPWT
NNPSEQAIRLPKRHQAVSGYWHTPTTLAAYLRVRSYLVSARDHGLTAVDA
IRLALAGTPWMPARRAASPTHALAA
>Francci3_0143 response regulator receiver protein
MLTYPGVVDLPESTLTFLAGLLAEDRAQRRTWRKLPPPEQALLVLVHLRK
GERYEQLAEGFQVSVGTVHNYIREAVRLLATHGRTLLAAVWIFAWTQSNF
LILDGTVVRTNRVRAHNKLYYSGKHKYHGINLQGLTDPYGRLIWISEGLP
GSVHDLTAARMHDILDLIDRSELYLYADKGYVGGEGDRLLVPIKKPKNND
LPDRDKEANRTHATTRSQGERGFAVLKNWHIFDRFRGCPRRVGTFAQAAL
VLATEGL
>Francci3_0647 ATP-dependent DNA helicase PcrA
MSGFFGDSHGGGSGIAGSVGTDVHGGAHVPYDLFGPAVLAPPVGGDEPGG
FSSPRLDAEGLLDGLNLQQRAAVVHIGAPLLVVAGAGSGKTRVLTHRIAY
LLAARGVRPGEMLAITFTNKAANEMRERVSALVGPRARAMWVSTFHSACV
RILRSEAKRLGFGSTFSIYDAADAQRLITLVTRDLDLDPKRHPARGLAAQ
ISNLKNELIDWEAARDRATNHLERTVAEVYAAYQQRLAQANALDFDDLIM
MTVNLLQSFPDVAEHYRRRFRHVLVDEYQDTNHAQYVLVRELVGQPAYFG
GGSAPVEPAEPAAGPGTWASGAVPPAELCVVGDADQSIYAFRGANIRNIV
EFEEDFPNAAVILLEQNYRSTQTILSAANAMIARNNQRKAKRLWSDAGDG
EKIVGFAADNEHDEAAFVAEEVDRLSDAKLARPADVAVFYRTNAQSRVFE
EIFVRVGLPYRVVGGVRFYERKEIRDLLAYLRVLANPSDTVNLRRILNVP
RRGIGERAEAALAGFADVERIGFAQALERVDEVPGIAARSAKAVREFTAL
LAELRAVAADNPVAVVEGVLERTGYLSELLAEDTVESQGRVENLQELVGV
VQEFARRLPEGTLAEFLEQVSLVADADQVPTDDGSDGVVTLMTLHSAKGL
EFPVVFLTGLEDGVFPHLRTLGDPTQLEEERRLAYVGVTRARIRLYLTRA
VMRSAWGQPAYNPPSRFLGEVPDTLVDWRRLADPPPAPAGTSWGSSGGFG
SGATGRGAAGAPAARKPLPNSPFSGRARPAARAVLELRQGDRVTHDSFGL
GVVVATGGVGDSAEATIDFGENTGTKRLLLRYAPVEKL
>Francci3_2776 putative resolvase
MWCGENGVTVGRVVTEVGSVLNGHRRKFLGLLRDPDVSTIVVEHRDRFAR
VGAEYVEAALSAQGRRLLVVDSAEVDDDLVRDVTEILTSLCARLYGRRAA
ASRAARAVAAAMETDG
>Francci3_4214 insertion element conserved hypothetical protein
MTDTGSQLTDWISLGVLTSFVSRDAVDGAIEATGRGARRSDTTIPPRVAV
YFVMALALFADDDYEAVACRLAATLDDLDVVGPRWEPTSGGLTKARQRLG
SAPLAELFCQVAGPVADLDTVGAFLGPWRLMSIDGLEWDVPASRENVAAF
GLPAGRDGAPGALPKVRAVTVSECASHAPVLAAFGPAGGAKSASEQALAR
TLYPRLAERWLLLADRNFYSWTDWCTAADTGAALLWRVKASLRLPPLRAL
SDGSYLTVLVNPKIGGKARDALVAAARAGEVLDPAKARYARLVEYDVPDR
DGDGKHEIIGLLTTICDPREATATALAGAYRHRWEHEIGNKQLKTYLRGP
GKVLRSKHPDTVYQEIYGYLLTHHAISALTCQAATAAGIDPDRIKFKRAV
RIIRDRVVTDPAFSP
>Francci3_3268 hypothetical protein
MIWRRGRWRGFALDPNTVRLAALRRHAGAERFAYNWGLVRVKAAFAQREA
EQSYGLTGDLLTPVSWTLPALRLAWNAAKHKLAPWWARCSKEAFRAGLDQ
LARGLKNFTDSR
>Francci3_4219 hypothetical protein
MVHGTREHVDALHGQVSRVLAGMGLALSPAKTRIVHLADGFDFLGFRIVW
KRKRGTDNWHVYPFIADEPVASLKRKIRSLTRKLSHLDYRIALIRINQIP
RGWAAYFQHAVAKQGDPQSLFVQVDGVRRGAGGWGRWWCEDPGPAAVSSL
>Francci3_3866 NUDIX hydrolase
MNADVEGLDALPSTHGRRRGTPEDRDGTDSPVVKGRLVVAVALLDDDRRV
LAARRREPHPYAGMWEFPGGKVEPGEHELDALVRECREELDVEIEVGPPL
GEVGLSSPGWVLRVWLGRVTRQQPRLVEHDELRWLGVAELDDVRWMPADG
PLVAELRRVLSTPGSLF
>Francci3_2431 NUDIX hydrolase
MGKRQAVVAVLLRAGRVLVIRRGPQARRPGYWAPLSGRIEPGESQAAALV
REVREEVGLAVTPLAKVWECDTDDGSYQLHWWTAEVGSDEELILDPGEVS
DARWVTPHEFTRLELTFAGHHEFFERVLPTLG
>Francci3_4082 DNA polymerase III, alpha subunit
MPTDSFVHLHVHTEYSMLDGAAKTGLLFKEAAKLGMPAVGMTDHGNMFGA
YEFYQGAKSAGVKPIIGIEAYLAPESRHHKRPVLWGERSQRDVDPAGEGG
DVSGGGAYTHMTMLAANAAGLRNLFRLSSIASIEGYYRKPRMDHELVSQY
SEGIIATTGCPSGEVQTRLRLGQFDKALAAAATYQEVFGADNFFLELMDH
GLPIERSVRQGLLDIGDKLGLRPLATNDSHYVTQDQAGSHEVLLCVGTGK
KLDDPTRFKFDGSGYYLKSSEEMRNLWDSEVPGACDSSLLIAERVESYDD
VFKFVDRMPRYPVPAGETQLSWLRKEIDRGLTWRFPAGVPADVVERVDYE
VGVIDKMGFPAYFLVVADICKFARDRGIGLGPGRGSATGSMIAYILGITE
LNPIEHALIFERFLNPERISPPDIDLDFDERRRGEVIRYITEQYGEDRVA
QINTFGTIKAKAAIKDSCRVLGYDYALGDKISKAMPPDVMGQGIPLAGIF
DPNHERYGEAAEVRAQYETDTKVRKVIDTARGLEGLTRGTGVHAAGVILC
SEPLLDVLPIHRRDNDGAIITGFPFPQCEEMGLLKMDCLGLRNLTVIGDA
IEAVKRNRNVDIDLSTLPLEDAKAFELLARGDTLGVFQLDGGPMRNLLRL
MAPTKFGDIAAVLALYRPGPMAANSHIEYADRKNGRKEILPIHPELAEAL
EPILGETYHLVVYQEQVMAIARELAGYSLGGADLLRRAMGKKKKEILDKE
FARFSAGMKERGYTDAAVQALWDVLVPFSGYGFNKSHTAGYGVVSFWTAY
LKANYPAEFMAALLTSVGDDKDKMAVYLAETRRMGIQVLPPDVNESDLRF
GAVGDSIRFGLGAVRNVGENVVASIAAARRRKGAYESFADFLQKVDIGVC
NKRTIDSLIKAGAFDSLGHHRRVLVNVHENAIDAVIITKRAEAIGQFDLF
GDGGAGEEEESPGLGLDLDLSGPEWPKKELLAQERDMLGLYVSSHPLEGA
ERALDRHRDTRIVDLAEANDGTTVQIAGIISKIDRRINKNTAKAWAIVTV
EDLDASVEVLFFPQSYEVHSYALATDAVISVRGRINEREGSVSLFAQDLT
VVDVATHVNGPPVVITLPSHKITPPLVDDLKLVLTTHPGTTPVHLRLEGP
QNTHLLLLELQVQASSSLLGDLKALLGARAVTV
>Francci3_1922 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAAAGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGRRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_0360 serine/threonine protein kinase
MLRPLNDTDPRVMGPYRLHNRIGAGGMGVVYLGFGPDDQPVAVKVPHEVH
ASDPEFRARFRSEVSAARHVRADTVARVIRAEVDGPKPWLATEYVAGPTL
RAAVQEGGPLTGRPLDGLAIGLAAALEAIHAASVVHRDLKPANIVMSWAG
PKVIDFGVARSADYTGYTQAGELVGTVVWMAPEQINGQQAGSAADVFAWG
CCVVFAATGRRPFRGEAPEIVALHISSTEPELDGVPERLLGPVRQALTKN
PGHRPSAGELVRLLTQRLSPEETADASNDVPPVTGAEPNREQTRPTPNPG
PPATPPPTPSPVPTPSFPAVGPAHPSRPSPSSSRPSVLTALLALTGLAAT
VGVWAAETERTGHAVVGPVAAAIIGLICGQMIFLSDRRIGVLTTVAAAVS
GSGIGLLLARVLDVDEPNRVLLSVAGALIVATAFAGAMAPSRPAPGDRPG
TDGPGEPVGSHRLLEPTHPVSRTNTGGEAGTDAAGATLRLHGAAAPGQHL
VPERPDPS
>Francci3_2183 transposase, IS4
MGHDGSEGLSPEGWLPDRVTVGVLTRVYPPELVDRVLAVTDTAEVRRRLL
PSWLVVYFVLALWLFRGRNCGYVQVLARLTSGLHFQRRAAVLAAGGAGGA
GWSLPASPSLGEARARIGSDPVRMLFEHAAGPVGVEGQAGVFLHGLRLVQ
IDGSTCDLPDTQANRAFFPGPSNAGGPAPFPKVRWVIAAEAATGALLGAS
FGPWSTGEPALARDLLGQLGPGMLTLADRNFLSHRLAGEVLATGAHLLWR
AKATFTLAPVHVLDDGSYLAELTPPRGSEGPPLTMRVIEYTVHSTTAGGD
ESSSELFCLVTDLLDPEEWSMLDLARAYPTRWGCETVIGHHKTDLGEGRP
VLRSKDPEGVAQEMWALFAVHQALARLIGVAADTTGTPPDRISFRRALTA
ASDSIGTAAFPP
>Francci3_0300 transposase, IS4
MRRSLRVLGAHGGEVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVA
AGEKSVEEIAAWAAHAPTQVLTALGARVHPVTGQPQAPSVDTMIRVLSAV
DSSALARAVGMFAAARARQARGGGRRVVAVDGKTLRGAAGPEGRAPHLLA
VAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGVVVTADALHT
TRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPIGHSAEGRA
HGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTR
TIPSTVTVHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDA
SRVRTGPLPRIMTTLRNLIIGLIRLAGHNRIAPTIRRIRHDNALLLAILT
LDNPADLHQ
>Francci3_4185 serine/threonine protein kinase
MRLTGPEPERFITDRVAEALPAYEVTAMLGHGGHAVVLAGRHRRLGRTVA
IKVLSTSAADGGAHGRFLAEARLLAGLDHPHVVRIYDYVESGDLCLLVME
RLAGGTVRARAANGLTVDVVCAIGLATAAALECAHAGGILHRDIKPDNIL
FSADGLLKVTDFGIAKLIGASGAAPSTLVGTPVYMAPEQFDGRPPGPACD
LYALGIVLYELLSGRPPFVRALSMAQLMDHHLRVAPQPLDAAPPAIAAVV
DRALRKEPGARPPSARAFALDLAAATAHALGAGWLTRCPLPLRLDDDVRD
AARGSPPARTPPARTPPARTPPARTPPARTPPARTPPARTPPARTPPART
PPARTPPARTPPARTPPARTPPDTPPSAVPPPATASPRRWSAGVIRRRSL
NRTARRDLRPADITSAPLRVDYPYSVVAAPDGAVYVSQRLRHRVLRIERD
GRTVHVAGSGKSGPHGDGGPAVNAELDNPCGLALGPDGSLFIADSFNNRI
RRVAPDGRIVTVAGSGRHGPPAGPAARHAASLNLAHPHGVYVDAAGLVYV
ANTGGHQVIRIDPDLRAAPLAGAGVPGLSGDHGPAQFAQLRRPHDVTAPP
GRNVYLADTDNHLLRAVDADGIISTAAGMFYGASPDDGAPARIADVGRPH
SLAPTPSGGLLVTDPDRGRVRLVTHDRLVRIFADARTGLRRPLGVTVHSD
GTAFVVDTAQHLIHRLPVA
>Francci3_2952 serine/threonine protein kinase
MDDERWTQCTPSEYAWERAALSYLKAQLPTGDPYRAWANAEFLGSDGSVN
EVDLLLVLPAGIVVLEIKSWSGVLVGDAGTWRQSHRAPVDNPVIGANRKA
RKLKSLLVSRPAMRGRRVPWVEGAVFLSDARLEIRLVPEGRAHVFGRHDQ
TQLPSFLDFARQPGRVDGELSNALARAVDQAGIRPSQRARTVGSLRLELP
AFQEGVGWQDFLARNQRFPDDRPRRVRIYLASGVESVHQRDQLVRAAERE
YLALRGIDYPGIAAPIDFVEHDLGPALVFPHDPELIRLDHFLQEHERELS
FDDRLALLRVLAETMAYAHRRVLTHRGLSPRCVWVRRRSDRFALQITDWQ
TASRGSESTSGIPSTTGPATSTGGYVELADDAAAAYFAPEWSWGTAKGVT
LDVFGVGAIAYRIFTGKPPAASSGELSRRLSEHNRLLLAEQVDAVSEELN
KLVARATEADPEQRTRDMAAFLLDLDLVRERLAAIAEEAPVVVDPLEAEA
GAELEGGFSVLGRLGRGSTALALLVERDGGRAVLKVSLDRDRDPRLVAEA
ETLRTLSEHPGVVQLLSDGVIDVGPRRALLITSAGERTLAQELRERGRLQ
PEWLQGWGDDLLEIVQHLQRAGLAHRDIKPDNLGVAERGGRGKKRQLVLF
DFSLAKEPLEAIEAGTRPYLDPFLGRGERRRWDQAAERYAAAVTLYEMAT
ARRPEYGTHGGHPSFVDADVAIEPELFDRSYATGLTEFFRRALHRDVRQR
FDTAEDMRRAWNAILAEPASVVPQPPSARISRDTPIGAAGLSRPVLSVFE
RLSIDTVGGALDLSPAQVVWLPGIGTKTRQQLRADLDRLAGQVTSSPAER
PTEPTLLDRVAAGLIPRATDRAVAEALLGLGEQGGNAWTSVRDAAKALDR
EQRTVRQAVARFERHWLALDGMRELRDTIVKVVESVGGVASARHCAAALL
DFQGSTVEEPLRSRLAEAVVRAAIDAELADDSPRDDTTHDTTHDASGGSE
MGGDPRLVYSREPNGILVAAGPARAGDGPSTADRLDWASRLGGAADDLAG
ADPLPAPARVIETLRAVRAPGDTDPSLIFPERLLDVAMIASENAAVTPRL
EVYPRGLDAGRALRLAAGALYGRTELTPEQVAERIIVRFPHAGALPGRPE
LDALLDAAGVPLCWDDEKKRYVTRRIEVTGLTSLVTSRGTRPRWSTGAGA
AWTTMGWRRVSDEVLAADDRLTRSLVDGGWLVLSVPPRRLARAERCLAAQ
DVTVVDAEQALLAGMREFCAQHRVQWSIVLAADAADRSSRDWANLSRVAQ
AGLAGVRASIEAAGPAVLITNAGVVARYDPALVVLDELRASVRMTTETSP
VRTVWLLVPWADVDKQPLLDGGAPVPQFGNQGLALSEEWIVRHESRLADG
AEGGAA
>Francci3_3586 protein of unknown function UPF0102
MRAKDALGRFGEDVAARHLAAVGAEILDRNWRCREGELDLVVQDGESLVF
CEVKTRSGTRYGSAAEAVVGRKAARIRRLAARWLAEHPHASSLVRFDVLL
VSRPSTGPVRVEHIRGAF
>Francci3_3525 recA protein
MAGLDREKALDNALAQIDKQFGKGSVMRLGDDTRPPVQAIPTGSIALDVA
LGIGGLPRGRIIEIYGPESSGKTTVALHAVANAQAAGGIAAFIDAEHALD
PEYAGKLGVDTDGLLVSQPDTGEQALEIADMLVRSGALDIIVIDSVAALV
PRAEIEGEMGDSHVGLQARLMSQALRKMTAALANSGTTAIFINQLREKIG
VMFGSPETTTGGKALKFYASVRLDVRRIETLKDGTEAVGNRTRVKVVKNK
VAPPFRTAEFDIVYGGGISREGSLIDMGVEHGIIRKSGAWYTYDGDQLGQ
GKENARSFLRDNPDLANEIEKKIKEKLGILPSLESDAVAPVPVDL
>Francci3_2356 DNA primase catalytic core-like
MVSGPGPVEPQVAAHLSAAAAQAGSRDGWARWLALAGQARTRANPVRDAA
LLVDPLTRLFHARGYSLAAGERGQVRVDRVHRTVTHPPTFLADLGIMQDL
AHAAAHLMLHPDLGPQVDCVGRDRAEALSVAYLVAAHAGVPLPVPDALPD
PRQWAAGADPAGVVREATRRVWSAADRLGHALAPGPRPGAQQLREARRRA
NTPRTEGLATRARTLRVHADSSLPPPPRQASAEHARLYDAIRAAARWYTL
HLTGAADGPAGRMLAERGLADVASDPRWMVGYAPPGWTGLVDQLHALGFT
EQELLDAGLAARTRRGTLVDRFRHRLLFGLRDPDGRIVGFIGRALDGQTP
KYLNSPTTVLFDKSRLLFGLAEQRDLLAAGARPVVVEGPTDVLAVAVAAR
QTGRALAAVAACGTAFTADHARVLGSATPGRDGITVAHDPDPAGLRAAAR
AYQVLRDLDTRLHHADLPAGADPAGLLATAGPDALRAALSDPARLRPLAA
AVIDQRLAWLDDDHRRFLEHQFTALRAVAPIIATEDPALVGGLVSYTAGR
IGLDASDVIGAVFDAVGDLSGAALARLHSGTGNRSAAEISAAAALPGAGG
PRAAAALAAAYPSSTRPVLAALSLPPRPASVDTAADQSRAR
>Francci3_0075 phage SPO1 DNA polymerase-related protein
MQPVHIPPRADLPTLAAAAADCQGCDLAQLPGTRTVFGVGPAHSWLALVG
EQPGDIEDRRGVPFVGPAGKLLDRALGEVGLDRAEVYTTNAVKHFRYRTG
NGPRRIHQTPDLRHVTACRPWLSAELNLVRPAIVVILGATAGMSLLGPSF
RVTKMRGRLLPGPAGSGAQLLATLHPSAVLRADPARRDEVYAGFVNDLRI
AVAARPTMDPSASVDA
>Francci3_3982 hypothetical protein
MVVIGVTGHRGLSGGQAEYTAGEMRRVLAPLAAAGLTGISCLAEGADTIF
ADVVLELGGRLIVMIPAAGYRDFQPATHRRNYDRLLGRATEIRSQDRAKP
DLPSLMSASRLLVDASDRVLAVWDGLPARGFGGTADVVSYARWRRKPLQI
IWPDGAVRAGATSVAGSGERQR
>Francci3_1746 hypothetical protein
MPANIRSRVLALTRATPPAETGLTHWSSRELAKYVTRTTGVSISWHFVAK
LWRENHLQPWRQGTFKLSRDPEFADNVMDIVGPYLPADGRGGAQLRREDA
GPSTGPYPAVAADHVRRHREAYPRLRPPRHHQPLRDNRSRNRQGLR
>Francci3_2529 Recombinase
MTNRYRERLIDPKDLETMPEARKRAVLSGLDINNRQTQLKDCRTFVESRG
GVLLDAPYDEPDTSAWKKRRIRQPDGTIIYRVVRPKYEEVLRDLRRGVAR
NGERLDGIVVADVDRLTRDQRDLEDAIEVVTQYGRPIIDISGSLDLLTDN
GRDVARIVVTLKARQSLDTSKRVRRKHLAMAQAGITVGGNRAFGWLADKE
TKDEPAAALLVAGADQILAGVGLHTICRQWNDLGIASAMGKKWQKPVLRN
IYLSPRIVGYRVYGPTSVPLEKRYVVDADGQPVKGQQQPILDLDVWEAVV
AKLRDPSRVSKHVHIGGRKYLLSGIICCGFRGRHLMGGYDRRWGKHHYAC
KAVTAGGCGKVGVTGRHVDDLVSELVLAYLAGRDVEAEVGRWPRAGELAK
AEAKIAKLMGAYDRDELPGPYVFPRVREQEQSILHLRAEQAEWLRAHTGP
KVTNLAEGWPSLELEQRWEIISTVIEAVVLKAADGPTNRFDPERVEVVWR
P
>Francci3_4521 single-strand binding protein
MAGETVITVVGNLANDPELRFTPNGAAVASFTVASTPRTLDRQTNEWKDG
EALFLRCSIWRQAAEHVAESLQKGARVIVTGRLKQRSFETREGEKRTVIE
LDVDEIGPSLRYATAKVVKAARGGGGGGYGGGGGYGAPAGAPPGAPAGVP
AGVPAGGGYGGGSGGGAPIDDPWSQPAGGYSDEPPF
>Francci3_2143 hypothetical protein
MTARSSTARGSSSNGGPDPSETPAQRRYRNWGDLLQELRVVQTGVQLLTA
FLLAIPFQQRFTTLSNEQKSLYLVIVLLTVSATGLLIMPVSLHRAVFRRK
EKETLIRVANRLAQVGLALFAVAVSGVVLLIFDVVRGAPAGWIAGGCTFV
VLVVLWAVVPAAIRLAAGGWASGP
>Francci3_4338 phage integrase
MKRRDLPGAPTRCIRRRPPGRHRLHQRNSHPPLAVRPDRLRHPTRQAEIR
RRGPCHLPRPRHPRRPESLPHPPTRQPARRRNHPAEQRPRVHPPRRHSHP
PRTPHQPLPHPHPGSRPTPITIRGLRHGAATLALAATLALAATLALAAGA
DLKAVQELLGHSTIMITADTYTHILPDLAAEIARNTARLIPRTRSPQEYP
RHTTTID
>Francci3_1086 helicase-like
MLLEGLKPGLRVDGLIPAEVVTVIAAQWHGSDALELTYKTAAGGLGQQVV
FRKDQDKLSVAQTGSRAFDANASDFKLVAEAQRISLAGLFDPMLAVATSD
VQPLPHQIRAVYGELLPRTPLRFLLADDPGAGKTIMAGLYLKELLLRDDV
KQCLIVAPGGLVEQWQDELFFKFGLRFDLLTNQLMDSQVNLNVFETNPLL
IARMDQLSRNEQLRAQLRETEWDLIVVDEAHRMGAHYFGGKLEKTKRFQL
GELLGEITRHLLLMTATPHSGKEEDFQLFLTLLDRDRFEGRQKKTTDTSG
IMRRLVKEELLTFEGKNLFPERVAETVPYELTALEYALYADVTHYVREGM
NRADRLGGKRKNTVGFALTVLQRRLASSPEAIYKSLVRRSERLEGKRQEI
LNGTYRESEPTVDLGEIDEDGYNAEEIEELEEDLLDAATAAQTVEELNAE
LIELAELVKTAKIVRDAGTDRKWTELSNILQDEVLTADANGWPRKLIIFT
EHRDTLDYLRARIGSLLGKPDAVRAIHGGVRRGERRQITEEFTKNRDVQI
LLATNAAGEGLNLQAAHLMVNYDLPWNPNRIEQRFGRIHRIGQDKVCRLW
NLVASNTREGEVFTRLLAKLDQMRQDYGGKVFDVLGEAFTETPLRALLLD
AIQYGERPDVKAKMHEVIDASVGDGLKDLLDERALASEHLAEADLQRLRA
AMDEARAKRLQPHYVELAFKAAFTRLGGRIARRERGRYEIANVPQHLRSA
GRGPIATKYDRATFDLAHCDHGCKVVRWLIEFRPVHRMSCFPSVMLRNAR
EYPRTVGTVPARHLDPRSGRARRGGPSWR
>Francci3_0402 transposase, IS4
MAASAVRVRGVGTHGAPSVPGVDPGGCLASTAPQDLGPARCGRTGRLVIG
DRGRGQPAGKKGGSLTGPNPVDRGKPGSKIHVLTDAGGLPLVVAVSPANP
HDSGAFVPLVASIPAIRSRRGPRRRHPATLRAGKAYDQPERRRFLRRRGI
AVRIARRGVDSTERLGRHRWKVERTLAWLGGYRRLSPRYERNGYNFLGFL
CLAAAITCWKKLPHST
>Francci3_3002 serine/threonine protein kinase with WD40 repeats
MTGTTPPPLPGDAVRPLQLTDPRRLGVYQVIGRLGQGGMGTVFLGRAPDG
SAVAIKMIRPELAQRPEFRARFAREAESARRVRRFTTAAVLDADPYGPQP
YLVTEFVEGPTLSRRVSVRGPLRPADLEQLAVSVTTALSAIHAAGIVHRD
LTPGNVLLSPVGPKVIDFGLAREFNADTDLSHNVRHAIGTPGYMSPEQIL
DAPITSAVDIFAWGAVIIFAATGHAPFGTGRIDAILYRIVNEPPRLDGVG
GELRNLVEIAMAKDPAARPSAEELRTALIGGGTLPARPEPGSIPSGPPGA
GPTGRARRWARRRPSGAAGSAPRSEPPPRGTGSTGSTGGTAANVADMPVT
QLSPPPVASPPPIPARTPPPIPARTSPPRGQPGSVPPHPAGTPHPPAPSP
SPAPSRLRWSRTALLVAGLAIAITAATVLIIVPRGGGPSPVSAADRASIS
SRLAADAAAQRARQPDLAGRLSLAAYRIAPTEAARAAVLASFAQSTAARI
PAGPAAFSDIALSPDGTTLAGTDDTGSLHLWKVDAAGRPTATTGGSANDH
AHGVVFDRSGTRLATGGETDAGRLWDIADPARPRPLSTLDPQATPVHRLA
LSSSAHLLVTAGEDWSVGLWDVADPARPVSIQLLIGRAGPVTDVALRPDG
AVLAIAGAGGPVQLWNVRDPRRPVQTASVPGHTGAVNTVAFSPDGRRLAT
GGDDRILQVSDVGDPDHPRVLRRLSGHTAPVAAVAFTTDDHLVSADGGGA
VAYWDLSAPTPPMTPLGVLDAPARAVAGTGTETVALTTDKGSVLLGTLDP
ARLRRLACAKPGAALSPAEWSRLVPRLPYTDSCSG
>Francci3_4289 putative Endonuclease III
MFARYRTAAGYAGADRAELEDMLRPTGFFRAKANSLIGIGAALTERFDGE
VPRSLAALVTLPGVGRKTANVVLGHAFDMPGITVDTHVGRLSRRFGLTTQ
TDPVKVESDLAALIEQRDWTIASDRMIFHGRRICHSRRPACGACGLARLC
PSFGLGPTEPAEAARLVRGTRSDVETLR
>Francci3_0147 methylated-DNA--protein-cysteinemethyltransferase
MTSDDLRAPDPFAAMPDLDARLSRLRQRLAAEAEREGLLDVAYRSVDSPV
GRLLLAATPQGLVRVAYEVEDHDAVLASLSERISPRVLRANGRLDPVAVQ
LDAYFAGARSRFTLSLDLRLATGFRLSVLEHLRDIPYGSTASYAAVAALA
GSPRAVRAVGTACATNPLPVILPCHRVIYSDGRIGRYLGGEQAKQSLLRL
EGAL
>Francci3_4120 transposase, IS4
MSNEPVDCEQLDISSLLAMLGEIPDPRKAKGAIYSLRYILSTSLVSTMTG
AKRLSEIGRWAARIPQPLLARLGAPYDHFLGRYRVPSEKTIRRVLQVIDV
AALDARIGCWLFAQTTWENGEHITIAVDGKVMRGAWVDEKTQVKLLAAMI
HGRGLVIGQIRIPDDTNEITQVENLLDQLPEMPGHPTVTLDAAHTQDDTA
KDIVKHGMDYVMTVKGNRPTLKRQTFERVLPLLQEPAHHEVQERGHGRIK
NWQTWTTKADGIEFPHVNKAAIIRRDEFDLTGVRLTREYALALTSATGTH
ATASYFHSHVRGQWGIENEIHYIRDTAWREDDDPIYTGNTNQAFASLRNL
TIGILRLNGRRKIKETLEDVAADRHAALDLLVTACSGSKR
>Francci3_4096 putative IS6 family transposase
MQRFTPLLVDAARPCRHLPGDRWFVDETYVKVAGRWTYLYRAVDQHGQVI
DVLASARRDQAAARRFFTAALSHGRRPVEVTTDKAAVYPRILDELVPEAC
HVDAARENNRIESDHSRLKARLRPMRGLKRLRSVQTISAGHAFVQNIRRG
HYELGIDTDSHARLTAAFTELTLAI
>Francci3_1958 Integrase
MKKACELTGRSRATFYRRHRPPASVPPRPVPKPHRERVQPRALSARERER
VLEVLNSERFRDASPAHVWATLLDEGGYLASWRTFYRILAAAGQTGERRA
QASHPSHVKPELLVCAPNQVWTWDITKLKGRSKHEYYYLYVILDVYSRYV
VGWLLAPNESGELAKELVSQTCEKYGVDTSGLTVHADRGSSMTSKTLALL
LADLDIVRSHSRPRVSNDNPFSESQCKTLKYRPEFPDRFDSIEQARRFCR
GFFTWYNTAHKHSGIGYLSPAAVYFGLAGQAHAARARVLDAAYAAHPERF
VNRRPVPPPLPKPAGINTKPDDQAEPVTDGAPERSEVIPRQRSKQGSNAG
>Francci3_3263 Integrase
MVDISERPKEAEDRAVPGFWEGDLIIGKGNRSQIATLVERTTRFVMLVRI
PSDRTAERVAYLLAKKMGTLPEFLRNSVTWDQGKEMARHAEFTVRTGIPV
YFCDPHSPWQRGSNENTNGLLHQYFPKGTDPSLHTQDELNKLAAQLNGRP
RQTLGWLKPIEVFNELLESHASLWPFDSTRQVVRQANGA
>Francci3_1109 transposase, IS4
MAERKPYPSDLSDEAWDLIRPVLTAWKARHPSASGHEGGYDMREIVNAVL
YQARTGCQWRYLPHDLPPTSAVYYYFGQWRDDGTTETIHDLLRWLVREHH
RRKADPSAVVLDSQTVRSSTNAPKGTTGLDPGKKSPGRKRGIAVDTLGLL
IAVVVVAASVHDNTIGTALLDRVAAAAPPVRKAWVDAGFKTTVVEHGAGL
GIDVEVVGREPGARGFTPLPKRWRVEQTLGTLMLHRRLARDYEAKPASAV
AMIHWSMVEVMARRLTGAATPTWRDPPV
>Francci3_3347 protein of unknown function DUF196
MDLLVTYDVDTTTPDGNRRLRKVAKICEGHGIRVQKSVFEIVCTEPQRVL
LEHKISQVIDETLDSIRIYQMPQHTLDNVHHLGAGVQPVHRADHII
>Francci3_1119 transposase, IS4
MPAAPVLPPAPVLDRLAAVGAGNQPPSPAGLLAVFNQLPDPRKPRGRRHS
LAAVLTLATCAVLAGARSFTAIGEWSADAGQAVAGLLGVSRVPEESTFRR
VLAALDADALDTALGAWAAAATTPPAGTRRRLAVDGKTLRGSRTPDSPGR
HLLAALDHTSGVVLGQVAVDAKSNEIPALPVLLADLDLTDVIVTADALHT
QRQTASWLVSRHAHYILTVKANQPALYAQLAALPWRRVKTAARTVERGHG
RRERRTVKTTEVRAGLLFPHAVQAVQVTRRRQPLADGPATTEIVYLVTSL
PTHQASPTLLATYAREHWLVENRLHWVRDVTFGEDLSQVRTGHAPQVMAS
LRNLAIAILRLTGATNIAQAIRHHARRPERPLETIKSLAC
>Francci3_0531 serine/threonine protein kinase
MTERSARRVSPIGGYPAEAPHPDDPTTIGVYQVVGRLGAGGMGTVFLAQD
AAEKFVAIKVIRSDLAADPEFRARFRDEVAAARRVAPFCTAQVLDADPDA
RRPYLVTEYIDGVRLDQAVTESGPLPLSTLQGVAVGVASALTAIHRAGIV
HRDLKPSNVMLSYSGPRVIDFGIARTLDMTKGRTQTGLVLGSVGWMAPEQ
MEGAALGPAVDVFAWGLLIGYAATGGHPYGHGTYLEMSEKILTGQPDLRA
MPPDLTPIVRSALARDPRDRPSTENLLLTLLGERGRAGDARDAATALLDG
TWPRGRSTAGAGAAALAGGAGAGWPDPTHVATPYEQRRWWENEPRPLGGD
RWWNDSARDRPPPAGPPVPPPALAPPPAADRAAGPRHGVPRARRRRRGGF
VQYDDEADGNGHRSGFDNSGEGGHPPPAPDARPRHGGQPRVDSPHAGQPY
PDQAHAGRPYAGRPYAGQPYAGRAAEAAPSPQAPARPRPVAQQRPPSAPA
GPAPAPAPRVAEARPAPRGRAPAPYAESVPRPRPRRRFRLRIPFKRTILF
VLVVLLLLSAADQIALMVDEQRQRLWNKVVTSVKDDTHDQLDSLWNRVRD
TFGGSSGGSSNG
>Francci3_2126 hypothetical protein
MRTTVEIVRKKPGQKTFEALPKRWVVERTLVWLTAHRRLARDYERHPATS
ASFIHWAMIRTMVRRLVRGNPVPRWQPRDTTER
>Francci3_0065 hypothetical protein
MVGECLALEEEVPVIFRLVGDGRPYPEHGLGQRDWAAIPPRQVRLDQLVT
TKRVLALDVLLDEDSTFYGDLFPHVVEWDGALYLEDGLHRAVRAALQQRT
SIHARVYVLTGPALTGPAQSSAAR
>Francci3_1259 DNA polymerase III, delta subunit
MLTCVSSSALPPLVLVTGDEDLLVGRAVRGVLDAARARDPDVEIVDRPAG
ELSEADLIDLGATSMFGGGRVMVVRGAQDLAEDLRDALLEYIARPLDDVV
LVVVHSGVVKNRKLADVLKTAGARVVPAAKITKPRERHDFVVAEVRHLGG
RVSDGAVRALLDAVGGDLRELAAVCDQLVADTDGLIDEAAVARFHRGRAE
ASGFAVADAIIGGDLPQALTLLRQSLEGGTAAVLVSSAVAGGLRDLARVA
SAGPGSKWDLARALGMPDWKVEKAQRSARAWSDRGLGQALRAAAEADAGV
KGAAVDAGYALEKLLREVAVARLVGRARAAGSRG
>Francci3_1226 nuclease SbcCD, D subunit
MITGTARKARSSDPHDDHLDGIAYRARMRALHTSDWHLGRGLYGHDLMPA
QAAFVDHLVDVVRSEGVDVVLIAGDVHDRAIPPVGALELFDEALSRLRDA
GARVVVISGNHDAARRLGDKAGLLDPRVRIRTDPAAVGDPVVVEDPAGAV
RVYAIPYLEPSAANSQLPEPAQAPSGSDVPAAGIPAATMHRAMHAVRADL
ARYPDARSVVVAHAWVTGGAASESERDISVGGVGNVPARLFEGITYTALG
HLHRPQAIAPSVRYSGSPLAYSFSESGDAKASLLVEIGPTGLGNVTRIGV
PARRRMTLLRGSLADLLTDPAHAPHEADFVSAVLTDPVRPMDAMARLQHR
FPFALRLAHEPETEPDEILSFGRRTRGRSELEIAEAFVAHVRSAPSARER
ALLAEALGAARRAEEEVA
>Francci3_1145 single-strand binding protein
MLETTITLVGNLVDDPDHRMTANGASICTFRLASTPRRFDRSESRWVDGT
TLFLRVSCWRQLADNVAASLVRGDRALVYGRLRQRSFETGEGERRVTYEI
DADAVGTELTWHAARSERLARRPGSAVVAAAATDVPSDPSALVGSALVGS
EHGALAGFPVAGADADADAAGAASAATAMASMAEGPPEGSGRLQGAGAAP
GEPRPSAVTVGDSPWQSFPSSHDLS
>Francci3_1728 putative transposase
MVVGWGSGSGWLLAVTPEEMAVVRPVVEEFAARMFVDLPRRDQRAKGELY
LRGLLLDGKRKSVQPMAERLGVDHQQLQQFLRNSTWDYAEVRARLAGWAV
GFVRPEALVIDDTGFVKDGAASPGVARMYSGTLGKVGNCQIGVSVHAATD
WASAALDWRLFLPASWDDTGLDDPDEAAVVRARREKAGVPDGARHREKWR
LALDMLDEIAGWGVPARPVLADAGYGDCAQFRQGLTDRGLVYTVGVTPTA
TAHPGAAEPVTAPYTGSGRPPLPAYPDPPATLKALVLAAGRGALRWVTWR
RGTHKTPGNTAALMRSRFLALRVRPAAKALTRDDDRSLPACWLLAEWPPG
KNEPTDYWLSTLPPDIPLRRLVRLAKLRWRVEHDYRELKDGLGLDHYEGR
SFDGWHRHVTLTCLAQALCTQLRRTPKAPAPA
>Francci3_4155 transposase, IS4
MPVGALSRDAAGVVGAGAVSREGIWQRLHAMGEHRGRRGRVYPLAVLAAV
WLCALTAAGHDRVTAVIEWLAGTTEEERRRLRLPYDPFDGYRLPSESTIR
RFLNDTDDARLARALLDPPLADPAPPKPASPEAAGEAVRAVYALDGKTSR
GAKRADGRQVQLVGVADQATGRLVNQHEVDSKSNETKAFRPVLEPLDLAC
DLLTFDALHTVRDHLDWLVTDKKAHYLAVVKGNQPTLRAFLAALPWADVP
VADTTHDHGHGRDETRTLKAATVEHAEFPHARQAVRIQRWRREKGRKPSR
ETVYGITDLAFEQASAGFLADAARGQWIIENRQHHVRDVTFGEDASRSRT
RRGPANLAIFRATVAHAVRAAGHRYVPAGRRACKTATAALDLHGFP
>Francci3_2481 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_1257 helix-hairpin-helix DNA-binding, class 1
MVESTPGPGYPLSAPSPRFPVHAFPPETSVSSRWPSLDLLPSDDATVPLR
PPDPPADGSAGGSRRVRDATFTEDEQEIGGDESDLFGIREGVFLDEDRAA
GGWVADELAAEGADLGRLVAGRTDRRRPVHSRLVHPRAGREMDDGEDEGG
RGPDRGPSAGRPAPGVPRGDLVGVIRRRLPATLHGIVVAPAARAALVLAL
VALAAALMTAWFSWQHRPVPLASSEVDTPRTGESAAAGNARSDHSDHGAT
SATDTSVTSAPTARAARTGASGEVVVDVAGRVARPGVVRLPAGARVVDAI
ERAGGVLPGTDTTGLALARLLVDGEQVLVDGKPGPARPGTAAGQPAGTGL
GSAGSTAATGPIDLNAATAEELDGLPGVGPVLARRIVEWRTAHGPFRSPE
QLAEVTGVGDKRLADLLPLLKV
>Francci3_2885 NUDIX hydrolase
MNIEQEWEASYAGRDAAEFAHLAEGNARQARKRVSADALIRDEAGRLLLV
DPTYKPDWDLPGGMAEANEPPRDALRRELKEELGLDPQVGDLLCVDWVSP
HGPWDDLLAFVFDGGALTQQQAQGLRSVDPELAAVRFCSPEEAAQLLRPY
VWRRVHVALAVLGGGGVRYLQDGHS
>Francci3_0114 phage integrase
MRLPPACRRYPSRSDHASADAKNDQLGLSTTKWPWGSIYQRESDGMWVGA
AYVLMPDGTQRRRPVYGKTADIVREKLTKMQAQSDQGIPAEATGWTMERF
LTYWLSDIVTPACKPRTVQGYEVIVRNYLIPAIGKKRLNKLNGVDVRNLL
KRVRGTCLCCLHGTDRRRPVKQRRCCAVGRCCHQAPSARLVQQVHSVLRN
VLGAAVREELVGRNVAKLAKVSGPTYKVHRGLSADQASHLLKAAAHDRLY
ALYVLALYLGLRRGEILGLRWEDIDFEDETLAVRHSLQRVGGHLRVVAPK
TRTSERTLPLLPLIAKVLREHQARQDAERETADVNWRETGFVFTTAIGTP
IEPDNLRRSWLPLCGVLGLEGVRFHDIRHTCVTLLLNAGVPPHVVREIAG
HSAIDVTMEIYAHASLDDKRAALQKLVDELA
>Francci3_0277 transposase IS66
MSVLSVTDDVTEVAYWRGRAERAEECAEKAEARVGQLQLRVEELSEQVAV
LSRMLFGRSSEKTGPSSAVDEKPEDRQDSGGGDAGRPARQRGQRPGSRGH
GRRDYSHLQTREEIHDVPEVDRACPGCGVAFTPLGTDDSEQVDWQVVITR
IVHRRRRYRRCCTCPGPRTVTAPVPPKPIPKGRFTAGFLARLLYEKYVLG
LPLHRIARALAAAGLGVAEGTLCGALKDVHGLLGGLDEQIVARNAAAGHV
HADETTWRVFERVEGKDGTRWWLWVFVAADTVVFRMDPTRSAAPVEKHFG
IDRAAGALSDGRRLVVSSDFYTVYQSLGRVDGVDPLWCWAHIRRYFIRAG
DAHPQLRYWADQWVARIGMLYLAHRALAAEQPTTGGYREAAGAFEAALRA
IDTARRAEAAIHSLHPAAKKVLATLDREWDGLARHQDFPDLDLDNNAAER
ALRTPVVGRKNYYGAHAEWAAHLAARVWTIVATAERNGREPLAFLTGYLN
ACATAGGKAPAGPALEPFLTWQTTTQTGSPPSTDPPQDGPPDGPEP
>Francci3_4167 putative transposase
MMRRRRVRPPVPTSEFAGFRFPPEVIILAVRWYLRYALSYRDVEELLAER
GIDVDHVTVYRWVRRFTPLLIDAARPCRHTPGDRWFVDETYVKVAGRWTY
LYRAVDQSGQVIDVLASEKRDLAAARRFFTRALSHGRRPVEVTTDRAAFY
PRVLDEQLPAAHHVDDQYANNPIEADHGRFKARLRPMRGLKRLRSAQIIG
SGHAFVQNIRRGHYELGVDADLRNRLTAAFTELTLAI
>Francci3_1016 transposase IS116/IS110/IS902
MLFVGDDWAQDHHDVEVQDETGRRLAKGRLPEGVAGIARLHALIGRHLAE
DAGPEQVVVGIETDRGPWVRALVAAGYQVIAVNPLQAARYRERYSTSGAK
SDAGDAHSLADMVRTDRHQLRPVAGDSDTAEAVKIVARAHQNLIWDRTRQ
TQRLRSALLEFFPAALAAFDDLDTPDALELLAKAPSPAEAARLTVAQISA
ALRHARRRKIPERAAAIRAALRAEQLPVTPAATTAYAAVVRAQAGLLAAL
NGEIARLEEQVADHFDQHPDAKILLSQPGLGPVLAARVLAEFGDDPTRYA
DAKARKNYAGTSPITRASGKKKTVLARYARNNRLADALHQQALSALSASP
GARSYYDAIRARGTSHHAALRQLGNRLVGILHGCLKTHTPYSEATAWTQK
ATLDVAA
>Francci3_3607 DEAD/DEAH box helicase-like
MLRCTIGDVVDLDTPLKALVGARAAALLADGLELRLVGDLLGHLPRRYHE
RGELTDLADLVVGETVTVQARVEKTERRPMRGTRKSMVRVTVTDGRHSLS
LTFFNQSWRERDLRPGTTALFAGKVDEFRGQRQLTNPEVQLLEPEESGEA
VSGAPFANALVPIYPASAKVPSWTLARCVRLALDSLDPVEDPLPDDLRSR
YRLPALAEAFRLVHQPANRGEIASGRRRLTWDEALVLQVALAQRRREVEA
LATTPRPQRPDGLLAAFDADLPFPLTTGQREVGKTIAAEIGRPVPMHRLL
QGEVGSGKTLVALRAMLTVVDTGGQAVLLAPTETLAAQHSRSLREMLGDL
ARAGELGADHRATRVTLVTGSMGGRARREALAEIADGSTGLVVGTHALLH
DEVIFNDLGLIVVDEQHRFGVEQRDALRARGRRPPHLLVMTATPIPRTVA
MTVFGDLEVSELTELPAGRSPIGTFVVPASQRSWTDRMWGRIRDEVAAGH
QAYVVCPRIGAGGEDGGGEDDPAESSAAGPDSRASRPAATVCEVLPLLVD
GEFADLRVEPLHGRLAPGQREATMNRFAAGEVDVLVATTVIEVGVNVPNA
TVMVVLDADRFGVSQLHQLRGRVGRGTAPGWCLLHTAAEPGSPAWERLSA
VAATSDGAKLARLDLAQRREGDVLGAAQSGGRRSLKLLELLRDEDLIRDA
REEAGRLVDDDPGLDRHPALRRLLDAVLDDTSVAFLEKG
>Francci3_1868 putative IS630 family transposase
MRYATGGGLTPAEQVERERVRRAAAGRFAAGASQAEVAREFRVTPKTASR
WHHAWEAEGESGLRSAGPGSRCRLDDRDLARLEAVLRRGPGPYGWEDQRW
TLARVRQVIAAEFGVDYTLAGVWLLLDRQGWSCQLPARRAVERDDAAVDA
WAQEEWPAVKRPRRPRTPGSVSSTRQDRG
>Francci3_4166 Integrase
MRSRMRRALIGYPDGGSGDFPRPMCLATCCGRRGAQPWGSLRRTQQRLTI
YRWVQRFTPLLVDAARPCRHRPGDRWFVDETYVKVAGRWTYLYRAVDQHG
QVIDVLASTRRDQAAARRFFTSALSHGRRPVEVTTDKAPVYPRILDELVP
EACHVDAARENNRIESDHSRLKARLRPMRGLKRLRSAQTISAGHALVQNI
RRGHYELGTDTDPHARLTAAFTELTLAI
>Francci3_0910 Recombinase
MINQRPPAIRGEQAIAYYRVSSSGQVNTDYDPEGISLPAQRVACKQRARE
LGVVLVDEYIDPGKSGKTIDQRPAFQEMIARIKADRHIKHVFVYALSRFA
RNRYDDAIMMMTLERLGVQLHSATEKNLDTTPAGQAMHGMIAVFNEYQVR
VSGEDIKYKMGQKAKKGGTLGVAPLGYLNVREQFEGREVRTVALDPERAP
FVVMAFELYATGKFNFHTLRDALTEAGFRTRPTKQWASRPISINKIGEML
RDRYYLGYVRYDGEEYEGRHEPLISQELFDRVQRVLYAERRAGTRHRTHD
HYLKGLVWCDRCRRRLIIMPGKSKSGVRYFYYICQGRLDHQCDLPYMAVS
RVERAIEDFYVNVRLTPDFRATVQAHLDEMMASTSDASRRLRARYERQLK
ELDVREDGLLDLVGDPDWPKEKLTAKIRAVREERTRLEDRLAESDRPLDT
GHEVLATVLRLLEDPQTLYRRAGVRARKVLNMAIFTKLHVDVQGEPVVTS
DDLKEPFAATVSAHRAWSLTEAVDGVLADRERQVAPARQSGAPQGDDAAL
DDLSDRDLLITALSGGCSSTGVLVREGGLEPPRPKAADPKSAASAIPPLP
RHLQSSDRLTWGALRLPPGYAPHPHQDRGQPRTTVDNRGQQWAAVDDRGQ
PRVAMEAAPVRPRTFVVSHSRGLPVSAGFGGRQGGQISTVQRAHAAGRRG
SSRGRRDDNRERAVVVSLTRTAIRPAAGLFRAGLARTTGRRFI
>Francci3_3983 NUDIX hydrolase
MTPLASKPTATTSYGDPVAGPVRINARALLVAGDRVLLANERGQKSFHLP
GGRVEPGETVQAALRRLLNEQAGIELDYLTFVGGIESISIERGGRTGVLD
VVFAADRSWGADFGSRLNDLDIVSVSLGSLLDLELRPADLRRLVPAWLTN
TRPAWYGSASR
>Francci3_0359 serine/threonine protein kinase
MATVPVGNRMVAGRYRLVARLGAGAMGTVWRAFDSVLETEAALKEIEFAG
GVAEAERADRVERALREARHAAKLRGHPHVVTILDVLLENGLPWIVMELV
PSRSLFEVVRSDGPLPVAEVARIGTAVLDALVAARAHGIVHRDVKPSNVL
IGTDGRVVLTDFGIATGDGDPTLTVTGVLGTPLYMAPERLNNQPATFEAD
LFSLGGTLYFAVEGRPPFERDTFGAMLAAILLQPPAQAHRAGELAAVLDG
LLEKDPGRRMTPARAHELLARAAQADPPRRAAHVDELSWHPGPTARAAPS
QVPGQAPSQVPGQAPSQAPSGHVDPGAPQTGRPAAPTELTAVEQDGAVLL
RWEPSTTQGASYRVSRVLVDPTAPGGRRERSLGITTATELFDAGVPRGVP
HWHEVVTTVSGESGQLRSVPVRTPTRTLFPPVTALRASMIDDAVALSWRP
VPGQGCIVIERTFAETSPLSGAKRRFRGSDGYFLDQDTAPGAIYRYQVWV
AGADASDSLVAPSGAAEVLVRVVARPRAVVDLEARSTLGGTVLRWTTVPG
AIVRIYVTEVPEQAGLVGTGPFGPADHEVGVGSLEGRARLVGESRRGRLV
DRNPAGVVVYTPVTITEDRAVIGAAVTHHAP
>Francci3_3981 serine/threonine protein kinase
MTARLPRHVAPLAVGEPTSLGPYQLLGRLDAGITGIVYLSRAVGGAQVAV
RALHPGDALRADMRARFAAAIDRARVLAGAHTCGVLDAAPAAERPYMVTE
FVGGPTLARELTERGPLSSDEIDRLARTLLETIDAAHRTVGALGDLTATD
VVLSVTGPRLVDLGVAAALRPLLLSPAAGSAGGTAPGTPHIVAGRSGHLW
PPPSSPGYVPGYGYGPAAGGWEGPAADIVAWAGIVDLAATGKPSGSVEPA
ARAGRRRADRSEHTAPIGEGVRRALARVRQADGASLPDTTELLALLAGSK
PRSTRAAWGRGRQAATAHGGRAGAATTGADPRPEPTDPRPPTRTAPPGKA
PLRTATGGRHSTGRRAPSPTGSVPDPMAARALPPLPTLPPPPTLPPPPTL
PPTPTPTLPPPPTLPPTPTLPPPERREASPRPDSGRADSGRAASVRSTSA
RRAAEPRVERLPADPLVRARRSRMRPVAAFAVAAVAAVAAAATAAGLNSS
PDDVTTGARAPLHESQPDDGGTVDTPSSGRSGSREGRPPVRTPAAPRPAA
SGSPATEELAVDEPGSGGPSAPGGGSVLGPTEEEQQNDLAAEDFALNTLR
LAAGGRSAPPGRSSGGPASAASPPAHQAPPVLAAAPPVPTRPSAAAAGVA
AGPIEARTPTTRTPTTGTPATGGRTTRAATTASAASVTSTATKPPTRPAT
TRPATTRPPATTAPARRSLAPVQVDVVTLAPR
>Francci3_4115 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_0004 DNA replication and repair protein RecF
MHLTHLSLVDFRSYPALDLTLGPGVATFVGGNGQGKTNVIEAISYVATLA
SHRVAGDAPLVRDGASRAVIRARIVRGDRAALVEIEIVPGKANRARLNRA
PVARPRDIVGLLCTVLFAPEDLALVKGDPAQRRQFLDELLIARTPRMAAV
LADYDRVLKQRSTLLRTAGTARRAGGQGDLRTLDVWDGYLAAHGAEVLAA
RLALVDALRPAVAAAYEAVAGAESATALDYRSSVTLPDILHASGPPGPPG
QPEQPGAGRPDPAAPDRTMLAEAIRADLEAARPREVERGMTLVGPHRDDL
LLSINGLPARGYASHGESWSLALALKLASFDLLRADDREPVLLLDDVFAE
LDTRRRGRLAELVASAEQVLVTAAVETDVPTELTGVRYAVAGGEVQHAH
>Francci3_2884 transposase IS66
MIRVVAELSASSWESRLAAAEARIEELAVGQAASVAANERLVSVNERLRR
VVEDSAARHEVELGAVRAERDRAVRRVEELELEFAELRRRLSMDSTNSSV
PPSKEPVGAREKRKAERWASSERVRSKDRKPGGQPGHPGAGLAREETPDR
SLSADPPSECSSCQADLSDAGVLADGWAQVWDLLEPVLEKVEWVLPRRRC
ACCAKVTTAVVPGVRHAAAGAVSYGPRLHGAAVLLASEGNVPVERAAMVI
GALLGVPVSAGFVARANARLAQDLKAAGFDEAMKAALRAEPVLCGDESPV
NVLRKDRDEATGTALSGTPHLLVVRTPAPGLVWYAAISSRSSGAIDATGV
LTGWHGYLTRDDYAGWHQYDPTLAGVQLCCAHLIRALRAVLILAPKVQKW
AGRLIDLLREANGLVVAARAAGNSRLGQAAIDALRARYDADVRIGELTNM
SRPWADEKNHPGLVLARRLAAKADQVWLFTTDFKIPWTNNPSEQAIRLPK
RHQAVSGYWHTPTMAAYLRVRSYLVSARDHGLTAVDAIRLALAGTPWMPA
RRAASPTHALAT
>Francci3_2197 RNA-directed DNA polymerase
MSAIAAIPAASGSRVDPVRVLQHTLYRAAKADPGRRFHALSDKVHRGDVL
ERAWALVRANRGAPGIDRTTIAQVEEYGVSRLLDELGGELREGGYRPLPV
RRVEIPKPGDKGASRPLSIPAVRDRVVQTAVKIVLEPVFEADMAGCSFGF
RPRRSAHDALQVLVDACGKGRRWVVETDIADCFTAIPFEGLMQTIKERVC
DQAMLRLVRAVLRAGVMRDGEVRRPVSGMSQGGPLSPLLCNVYLNRLDRE
WDDGDGMMVRFADDIVVMCWSEDQAVRAMDCLVRLLADLGLETKAAKTRI
VHLQVGGDGLEFLGFHHRLVLSRGDRGRRRVAFLARWPSDRAMQHARDRI
REITGRSRLLLNPEAVVRELNLFLRGWAGYFRYGYSARRLAMIRRFAQVR
LARFVRARHRRSMGFGWRVLIRSRPVDLGLVSLSGIVVTPRAGKLWRERP
NAGGERRR
>Francci3_4083 Transposase and inactivated derivatives-like
MRRRRACPPVPTSEFAGFRFPPEVIILAVRWYLRYALSYRDVEELLAERG
IDVDHVTVYRWVRRFTPLLIDAARPCRHTPGDRWFVDETYVKVAGRWTYL
YRAVDQSGQVIDVLASEKRDLAAARRFFTRALSHGRRPVEVTTDRAAFYP
RVLDEQLPAAHHVDDQYANNPIEADHGRFTARLRPMRGLKRLRSAQTIGS
GHTFVQNIRRGHYELGVDADLRNRLTAAFTELTLAI
>Francci3_3733 hypothetical protein
MFLTQPRGHGATGLDDHTNDATRLLDIDGLTVSRVEMLADRTRQVWLATA
DETARACPGCGVFARRVKGVVTTRPRDLRYGPSPLRLVWVKRRWYCQEPL
CAKRSFTESLPAVPARARLTTRLREQAGRLVVDGICATVVASARHLALSW
PTVMDAARDVAAPLTDTASPPVDVLGIDETRRGRPRWRPADRVPTPASPE
PAGGGPAVPAPAETVEPDPAGGEPAKTRVLADRWHVGFTDLAGGQGMLGQ
VGGPHRRRRRVLAGQPDTVLARPDHPRGDRHVHGVRRRGPPLPAGRHPGR
RPLPRRETRERRRHRGPPPGHHPATGPPRPRHRPRMEDP
>Francci3_3809 DEAD/DEAH box helicase-like
MSPRPPCSRRRLVTCAPRVAIEGTGAHGFSGRGRRAAHRERVQRRHRSEK
TRSVPLSVTAEVPTDDLTPYEPQTTPSTPGSPAAPTFAELGVRAETVSAL
TEAGIVHAFPIQELTLPLALARNDIIGQARTGTGKTLAFGVPVVQTVLAA
KEGADGRPQALVVVPTRELCVQVTADVTRAGARRGLRVLSVYGGRAYEPQ
LSALRAGVDIVVGTPGRLLDLARQHVLDLAGVGTLVLDEADEMLDLGFLP
DVERIMSQLPTERQTMLFSATMPGPVISLARRFMKRPVHVRAEQPDEGRT
VPTTRQHVFRAHALDKMEVLARVLQAGGRGLAMVFVRTRRTADKVAEDLA
KRGFAAAAVHGDLGQGQREQALRAFRSGKVDVLVATDVAARGIDINGVTH
VVNYQCPEDENVYLHRIGRTGRAGESGVAITFVDWDDLPRWTLVNKALAL
PFDGPVETYSTSPHLYEALGIPAGAKGTLPHAARTRAGLAAEDIEDLGQS
GRGGRRGSRTGRDQDRSEPAAVPTRTRARRRTRGGGAAAAGAGLAIAADP
ADPADPVDEDGRKAGAPVVDGAGQTGLVEFTGTAPLTDTDTDTARVVSAL
ASETGVEAEESPRRRRRRRGNRGRGTGTMREAGDGTEADADAPPRAESA
>Francci3_2124 transposase IS116/IS110/IS902
MDVVIDRVAGLDVRRDTVVAAVRVGGRGGGRRGEVRTFATTGAGLTRLAG
WLSEQRVSLVGMESTGVYWKPVFHLLEDRFECWLLNATHVRNVPGRKTDV
ADAAWISDLVAHGLVRASFVPPKPQRDLRDLTRARRIVVEEKTREIQRLE
KLMQDAGVKLTSVASKLLGVSGRAILEKMIEGEQSLEYLADQARGRLRSK
IPQLQEALAGTFRSGHHGFLAAQLLARIDLCDEQIDELDHRIEVMIAPFR
ETVDRIRTITGVGEVTATVLLAEVGLDMSRFPTAGHLASWAGICPGNNTS
GGKRLSGRTRHGNKWLRTALTEAAHAAARSKDTYLASHHAQVRGRRGVLK
AIGATRHDILIAYWHIIANKTVYQDLGGDWHARRRRDPERRRKNLVGELE
KLGYTVTITPAA
>Francci3_3448 ATP-dependent helicase HrpA
MSPRDAHRLGRRLAESRRTREPAARQRALQAIAAEVDRASLRLERRRASV
PTLDYPDILPVTQRKDEILAAIRDHQVVVVAGETGSGKTTQLPKICLELG
RGVRAMIGHTQPRRIAARTVADRIAEELRTPAPQMGGVVGYQTRFTDQVH
ENTLVKLMTDGILLAEISSDRQLRRYDTLIIDEAHERSLNIDFILGYLRS
LLPRRPDLKIVITSATIETARFSAHFAGAPVIEVSGRTYPVEVRYRPLVP
AASGPAGGPASGPASGPADARRTGRENGEAERDQTQAISEAVDELCAEGP
GDILVFLSGEREIRDTAEALTREQRPNTEIVPLYARLSAGEQHRVFQPHT
GRRVVLATNVAETSLTVPGIHYVIDPGTARISRYSHRTKVQRLPIEPISQ
ASANQRKGRCGRTADGICIRLYSEEDFAGRPEFTDPEILRTNLASVILRM
ADLGLGEMATFGFLDPPDPRQISDGELLLAELGAFDATASDPRHRITPLG
RRLAQIPVDPRLARMVLAADEQGCLREVLVIAAALAIQDPRERPVEHQQA
ADARHARFADPTSDFLAYLNLWNYLRDARGELSANQFRRMCRTEFLNYLR
IREWQDVHGQLAAVVRGLGLTPRDDSSGAADPRTVHRALLTGLLSHIGRY
DPERREYAGARGGRFALWPGSVLARRSNRAERTGPTATSANPAAALADPG
DPAGEDAPKRPSGPPAWVMAAELVETSRLWGRTAARIDPDWIEPLAAHLV
HRTYSEPRWSRRQGAVLADEKVTLYGVTIVASRPVQYSRIDPVLCRELFL
RHALVEGDWQTRHTFFHANRELLADVEELEHRARRRDIVVDDETLFHFYD
ERVPADIVSARHFDAWWRKARRTTPDLLDFPRSMLVTADATGITEADYPD
VWQAGDLALPLSYQFEPGSAADGVTVHIPLAVLNQVGAEGFEWQVPGLRE
ELVTALIRALPKAVRRSFVPAPNYAKAVLANITPRQAPLLTAVEHELRRM
GGPEIPRDSWSLAGVPDHLRFTFRVEDAGGRVLAEGKDLDAIKERLRPRT
REAVAAAADGLERAGLRAWGDLGTLPKVVELRRGGNVAGGHVVKAFPALV
DEGGSVAVRAADTEAEQRQLMWAGTRRLVLLGVPSPVRGLNARLSNAAKL
ALSHNPHRDAADLLDDCVRAAADRLIAAAGGPAWDEAGFTALLAAVRAGL
PEAAFEVVREVQQVLGLAHAVDLALRELRAPAVAASVADARDQLISLIYR
GFVTDTGADRLADLVRYLTALERRLERLPRDPGRDRLNTATVGRVQDAYR
ELLATVPAGREPAPEIRRLRWMIEELRVSLFAQSLRTPYPVSEERVYRAI
DAILG
>Francci3_4400 helicase-like
MTDGPLIVQSDKTLLLEVEHPSAVEARSAIAPFAELERSPEHVHTYRVTP
LALWNARAAGHDAEQVVDALVRFSRYAVPHALLVDIADTMDRYGRLRLEN
DPAHGLVLRALDRVVLVEVARAKKVVPMLGSRIDDDTIAVHPSERGRLKQ
VLLKLGWPAEDLAGYVDGEAHRIDLRQDGWELRDYQKGAVAGFWEGGSGV
VVLPAGAGKTIVGAAAMAQAGATTLILVTNTVAGRQWRHELLRRTSLTEE
EIGEYSGERKEIRPVTIATYQVMTARRKGEYLHLELFGARDWGLIVYDEV
HLLPAPVFRMTADIQSRRRLGLTATLVREDGREGDVFSLIGPKRYDAPWR
EIEAQGWIAPAQCTEVRVTLTEDERMAYAVAEPEERYRMCATARSKRAVV
ERLVRQHSDDRVLVIGAYLDQLDELGELLDAPVIQGSTRNRERERLFEAF
RTGEITTLVVSKVANVSIDLPEAGVAVQVSGTFGSRQEEAQRLGRVLRPK
ADGRTAHFYTVVARDTLDQEYAAHRQRFLAEQGYAYTIVDADDVC
>Francci3_1633 excinuclease ABC, C subunit
MADPASYRPAPGSIPETPGVYRFRDEHGRVLYVGKAKNLRARLANYFAEL
HTLHPRTQHMVSSASSVDWTVVSTEVEALQLEYTWIKQFDPRFNVRYRDD
KSYPSLAVTLHEEFPRLQVMRGPKRKGVRYFGPYAHAWAIRETLDLLLRV
FPARTCSGGVFKRAAQVGRPCLLGYIDRCSAPCVGRVDAATHREIVEDFC
DFMSGQTGRYLRRLEREMQQAAQAQEYERAARLRDDIGALRRAVEKQAVV
LPDGTDADVIAFAEDELEAAVAVFYVRGGRVRGQRGWVVDKLDEVTTADL
VEQFLTQEYLDGTGAASTGTAGSTVPTTTAGSQGEGIPREILVPALPPDV
EAVTELLGAARGSRVEVRVPRRGDKRTLLETVERNAKQAFALHRTKRASD
LTARSRALAELQEALELPDAPLRIECFDVSNTQGTNVVASMVVFEDGLPR
KSEYRRFAIRGIAGREGAGREGAGDDVASMYETIHRRFSRYLAERSRIAD
IADIAELGDIAGAPGAAQPIDPGTGRPRRFAYPPNLVVVDGGAPQVAAAA
RALDELGIDAGPGGVALCGLAKRLEEVWLPDTADPVILPRTSEALYLLQR
VRDEAHRFAIAYHRQKRSTAMVASALDEVPGLGDTRRKALLRHFGSVAKI
RAASAEQIAQVSGIGPRTAAAIVTALARSAPGRADAPAPVVDPRTGEILD
TETVS
>Francci3_2345 DNA methylase N-4/N-6
MSSSVVAGGATSVVMGSPVGQAAQVLRGLPDASVHCVVTSPPYFGLRDYG
EPGQIGLEPTPAAYVARLAEVFTEVRRVLHPDGTCWLNLGDSYAGKANGG
PSVGLTRRADRAELIPPRRNTTAAAPYKSLLGIPWRVAFALQDAGWTVRN
AIVWAKTNAMPESVTDRFASRTETLFLLTRSARYHFDLDPVRETPVDPTG
GAEWAQRRKQGVPGRRGRNPESSVTAADRDFAAHQAGRNPGDVWQIPVAN
FPGAHFAVFPPEIPRRAILTGCPPGGVVLDPFSGSATTGMVALQLGRRYV
GIDLNPDYHRLALRTRLLERPLPGIDQPAS
>Francci3_2058 hypothetical protein
MWRRTGRRCWPSWRPCAGRRSAPRRGREAAATAASNAGPCRPPRSAPGSA
SPTRSSPSGSSATAGPWPGGPATAETVHAVTSLPTHHASPRLLAELAQAH
WAIENRLHWVRDVTYDEDRHRARTGNAPQVMTSLRNLAITILRLTGAKNI
AKALRHHARHPERPLETIKKAGC
>Francci3_0881 transposase IS66
MLRCVTVVESGAGAAASGEVAEGAALLAENAWLRARVAELLTDIAGLVAR
EATREAEVVELRLQLEALQAELATLRRMLFGRSSERECGGSPAVGSPDGG
DGCGDGARGEAAGSAGRRRGPGARSGRRSYDHLSRDEVDCDFEGGGYGCL
SCGQPFTPWGEHVVEQLDWLVTVRVRVSRRRRYRRGCRCGGSLTVTAPGP
SKAIGKGLFTHRFLAMLIVERYVAGRSQNSLVTGLARHGAQLSPATLTGA
CAQVAGLLAPLAEQIVGRSRGSWHLHADETTWRVFTPTGGGGPARWWLWV
FLGPDSVCFVMDATRSTAVLAEHVGLDPDSGQLTDDADGGPRRLVLSSDF
YTVYVSAGRRADGLVNLYCWAHARRYFVRAGDANPAQLGIWARQWVERIR
ALYTAHGELAAAWHTAAAAPSPATEKRLAAAYAGWDTAITVIDTVRREQT
ASPGLQEPARKALATLDREWDGLVAHRDYPMIGMDNNPAERAIRGPVVTR
RNAGGSRTEDTARHAATIFTVTATAAMHNLNLLTYLENYLDACGRAGGKP
PTGADLDRFLPWAASPEDLTTWQQPPG
>Francci3_4223 insertion element IS466S transposase
MAACPEMTALAGLVRSFAAMLTPADGNAERLTAWISQARTEDLPHLHAFT
RGLELDRDAVNAALTCAYHNGGTEGVNTKTKLIKRQMYGRAGFALLRHRI
LLG
>Francci3_4305 DNA topoisomerase
MPARTKTTTRATARSSAPVAEPTEAGLPPETGSSEASADVPAGATSRTAN
GRTEAGHSANGSGGATGAHGSRATGSGPGNRLVIVESPAKAKTIAGYLGP
GWQVESSIGHIRDLPRSAADVPAAHKGKPWARLGVDVDNDFEPLYIVTPD
KKPQVSKLKALVKDASELYLATDEDREGEAIAWHLLQTLKPTVPVKRMVF
HEITPQAIQRAVDNPRDIDKNLVNAQETRRILDRLYGYEVSPVLWKKVMP
RLSAGRVQSVATRILVERERARMRFHSAEYWNIEGLFGATVARQAWSGAD
GTPGSGPDGPGAGSIRGDGVEKTPLPATLIALNGNRIATGRDFSPTGQLV
SSGVTRLDEATARSLAERLADAAFTVRSVETKPYRRSPYPPFMTSTLQQE
AGRKLRFSSQRTMQVAQRLYENGYITYMRTDSTNLSKTALTAARAQAASL
YGPEYVPARPRTYAKKVKNAQEAHEAIRPAGDHFRTPGEVRGELDVDSYR
LYELIWQRTVASQMADARGTSATIRLGATSSAGEDAEFSASGKVITFPGF
LRAYVEGADDPDAELEDRERRLPDVRQGDPLTTRSLTPRGHTTSPPARFT
EASLVKTLEELGIGRPSTYASIIGTIQDRGYVWKKGSALVPSFVAFAVVG
LLEDHFTRLVDYRFTATMEDDLDDIAAGTAASTDWLTRFYFGTGDGTDPA
ADGLKHLVNERLGEIDAREVNSIPLGETDDGTLLVVRVGRYGPYVQHGER
RASVPDDLAPDELTVDKALGLLAAPSGDRMLGIDPASGATITAKAGRFGP
YVTTDTDPPRTSSLLRGMSLETLTLDDAVRLLTLPRILGAGDDGEEVTAQ
NGRYGPYVKKGAESRSLESEDQLFTVTLDEALALLSQPKARGRRQAAQSP
PLRELGTDPASGKPMVVREGRFGPYVTDGETNASLRKGDTVETITDERAA
ELLADRRARGPATAKRPARGTAKAGTAKTGPKTTKAKPDTAKSGTAKSGT
AKTGTAKTGTAKTGTARSKTARTVTDDGGGSDGSDDSGSSSSSGTRRTD
>Francci3_4169 hypothetical protein
MRSMVDPSTYPRTQRRDEARPASWSATCRASTPPRFTPLLADAARPCRHQ
PGDRWYVDDTHVKIASRWTYLYRAVDQHGRVIDVLTSTPVPAATRRLPAG
SSTGRRRMGAGVWAPPGRGHYRQNPGLPADPRRAAPEACHSGAENAATSD
FAALRDISRNGPIRNVSSGNRAVRGPIRRPTPRTPRRTGDLAVCPLRRIS
PGHRPRTTSGIPQAQAGTGAVRHASIKVTSSFTRSDRALTARFTTRMR
>Francci3_4307 DNA-directed DNA polymerase
MTVWETLVGQDAVVAALSAAASAASAGLASGGGAPAAARAGMTHSWLFTG
PAGAGHQDAARAFAASLTCENEPPGCGECTGCHTVLTDTATDLRTVQPEG
LSLGVKDVRALVRDAASAPTAGRWRILLITDADRLTEAAANALLKALEEP
ADRAVFLLCAPSVEDVLPTIRSRCRTVALRLPSATDVRDALVREGVNEQV
AETAARASQGHLALARRLASDERVRANRAAVLRIPARLGRVGDCLAAAAD
LVSAANADAEAANAERDEVETEALKTALGVGATAVGGKSRSARAPKAGAK
GRTMVVRGAAGALKELERTQKSRGRRTVLDALDRALVDLAGLYRDVLVQQ
LGAGVDAIHPDHCVEAAGYAAACTPEQTLRRLEAVLATRASLARNPGLVP
LLAVESLTLALRDT
>Francci3_2275 5'-3' exonuclease
MCGEGGTVFVGERAGADGRLMLVDTPSLYFRSFFGVPRSVRAPDGTPVNA
VRGLLDVLARQIGEVRPKRVVACFDADWRPAFRVALLASYKAHRVADVGA
GEAGGGGQVERVDGDLLAQLPIIDEVLDAFGIARAEAAGFEADDVIATLA
TRHRGEVDILTGDRDLFQLVDDAAGVRVRYAVEKFAVVDEAAVTARYGVP
GRAYADFAVLRGDPSDGLPGVAGIGAKTAAALLTRFGSLAAVLAAVDADG
GAGLPAGARRRLRGARDYVRAAVGVVAVVRDVPVGAFADLLPAAPVDPAA
VGALAERWGLTGSCTRLVRALAEAAGG
>Francci3_1001 reverse transcriptase
MTGYAQRSSPDRVLTRPDRPALRIIQRCSLGSPISPVLANLFMHYAFDAW
LARTYPGVAFERYADDAIVHCDSRSRARHVLAALDDRMGQVGLRPHPTKT
RIVYCKDSNRRGSHEHTSFTFLGYTFRARKARSRHGKFFTNFLPAISKEA
QKKVSGQVRRWRLHLRTGHTLDELARKINPIVRGWAQYYGAFYRSALLPL
LQRINAYLVRWLRKKYKRLRPFHKALACWQRITRQRPGLFAHWAWTSNAW
R
>Francci3_1872 putative IS630 family transposase
MLDAARGYSNARIARRLCVTEDTVRTWRGRFARRREAGLVDLPRSGRPRR
ISEAERAEVVALACQLPAETQVPLARWSCPELAAELLSRGLVDAISASSV
RRILAEHPIKPWRYQSWIFARGPGFAAKAKVILDLYEGFYQEEPLGPEDR
IVSIDAKPSIQARARIHPTTPPAPGRIIRVEHEYERHGALALLPALDVQT
GRIAAVLTPPTTGIAPFMELMGQVMAQDRYRTAKRVFVIVDNGSDHRGQA
SINRLRAAHPNRILIHTPTHASWLNQVEIFFSLVQRKVVSPCDFASLDVL
ADTLTAFVDRYNVTATPFKWKYTAADLERHLARLDDDTAPAVAGSVARLP
VPPPDTNDHGSRVESEPSARALAQAA
>Francci3_2962 transposase IS200-like
MTSGGDCRLQATGESGTDRNGYTGHAGIINGMSRTVQVGAGGAYDLGYHV
VWCPKYRRAVLVGPVRDRLDGLIREKCAEHDWLIVALEIEPDHVHLFVKA
HPKHAPSYIANQLKGFTSHVLRGEFAHLRSRLPTLWSRSYFVATVGAVSA
ETVRRYIDTQNERPWRTGVPR
>Francci3_3730 RNA-directed DNA polymerase
MQYEVTTPTVADRIAQTVAAVRLEATVEPIFHPGSYGYRPGRSALDAVAA
CRQRCWKTDWVIDLDIQEFFDSVPWDLVVKAVEAHTSDRWVVLYVQRWLK
APLQLPDGTLRARDRGTPQGSSISPVLANLFMHYAFDAWLARTYPGVAFE
RYADDAIVHCDSRSRARHVLAALDDRMGQVGLRPHPTKTRIVYCKDSNRR
GSHEHTSFTFLGYTFRARKARSRHGKFFTNFLPAISKEAQKKVSGQVRRW
RLHLRTGHTLDELARKINPIVRGWAQYYGAFYRSALLPLLQRINAYLVRW
LRKKYKRLRPFHKALACWQRITRQRPGLFAHWAWTSNAWR
>Francci3_0953 ISRSO5-transposase protein
MGTGKVFGECYPTRTGDDFLEFIKKAVEPHRDKEIHIILDNLSTHTTPDA
MKWLEENPRITFHFTPKGSSWINQIENWFGIITKQPIRRGTFSSVKVLIK
QIRDYIEHWNSTAEPFVWTATAGEILAKIELVETNIKKLVANNG
>Francci3_0270 DNA polymerase III, subunits gamma and tau
MSTALYNRYRPATFAQVVGQEHVTDALRKALRTGRLHHAYLFSGPRGCGK
TSSARILAASLNCVQGPTPEPCGVCDECVGIRTGASMDVTEIDAASHGLV
DDARDLRERAFFAPASARFKVFVVDEAHMVTAAAFNALLKVVEEPPPYLK
FVFATTEPDKVIPTIRSRTHHYAFRLVPPGVLREHLASVCAREGVVVDPA
VLPLIVRAGAGSVRDSLSVLDQLLAGADDDGLRYDRAVALLGMTDGVLLD
ETVDALADRDGAALFTVVDKVVSAGHDPRRFATDLLDRFRDLIVMAAVPD
ADARGLLESFSPDQIDRMRAQTTRMGPAELSRTADVLHNGLVEMRGTASP
RLILELIMARALLPTASTDPAALLTRLERLERRAVLGGEEPEPGPPAPLP
VGRAPAPVHPPAADPEPMPLAVRASARTGPVSRPAAGPQPGSPGPAFTSG
DRPSGPAGVPGTAGTPGTGRIDAAGLRMLWDELLALAGRHSRKTHAILKD
HATVADVRGDEVVLAFATPTMGRMFGQGNNAELFCTVLAERLGGTWRVTV
APAGGSGRGSGRPAAGPGPGGGGYSGPPGPSKASGFAGSGGAPGPTGQAG
ASRSPAPAPAPARAAGPAGQSSGPSWGPRLDPPVGAENRAVSPGSPRAMP
GQMRGPAGLSPDTGNGYGGSTDGTEGGTTDPGRGPDGYGDPREHGDPREH
GDPREHGDSHGAGHGSTGPGPGPEGGPGPEGDRPWSATGRDGSSPGPANG
VAAAASAPPAGDEPGPRREDEPSLDDEAVPSGPGGARGGEEAAVSLLRSG
LGATIIEQISSG
>Francci3_4121 hypothetical protein
MFCSTNAIESVNARIRRRAVRARGHFPNETAALKCIYMAVMSLDPTGQGR
KPGLRRGSSPL
>Francci3_2317 hypothetical protein
MGMESTGVYWKPVFRLLEDQFECWLLNAAHVRNVPGRKTDVADAAWLADL
VAHGLVRPSFVPPKPFRDLRDLTRARRTVVEEKTREVQRLEKLMQDAGVK
LTSVASQPLGVSGRAILEKMIAGERNPEYLAELARGRLRGKIPALKVQPA
GQPGQHLHIERVSCGRSGGHRRSWTRPACGQKKGKPDRSEPGRPRQTGLE
DPCPDRRGRAAAGRGGLPGEPARQRGLRPAGGQHPGDPFPTWPAAAPPGH
AACGQGLRPAGTTAFPAPARDRGADRPPWCGQYRTPRPAPVESRENAGLA
RWLPPPEPAL
>Francci3_2941 transposase, IS4
MPVQSRMPVTPTDLGQAGSGQLVRMRRSLRVLGAHGGEVQGLADVLAGVP
DPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTALGA
RVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRR
VVAVDGKTLRGAAGPEGRAPHLLAVAEHGTGVVLAEHEVGAKTNEVTAFA
PLLRELHSHDPLDGVVVTADALHTTRAHADLIVTELGAHFVFTVKANTPA
LSVDCHQATDWTKIPIGHSAEGRAHGRFERRTIQLAQASEAIRARYPHAR
TVARIRRHVRRTVTTGTGRARVTRTIPSTVTVHVLTSLTLDAVTPADLAG
YARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRNLIIGLIRLA
GHNRIAPTIRRIRHDNALLLAILTLDNPADLHQ
>Francci3_1727 Integrase
MDVAVACRVLGVSRSGYYDWIGRPPSLREQENTLLAKQIERIHLESRGTY
GWPRVHAELALGLGVPVNHKRVARLMREAGLQGVYRRRARRGPVAEATAE
DLVNRQFAVDAPDRLWLTDITEHPTGDGKLYCAAVMDAYSRRIIGWSIAH
HIRTELVLDALGMAILRRRPPEKQTILHSDHGTQYTSWAFGNRLRIAGLL
PSMGTVGDCYDNSMMESFWGTLQLEVLDRHTWENRDELANAIFEWIECWY
NPKRRHSSLGMLSPIDYEAAHLPRSSPDDDR
>Francci3_0115 transposase, IS4
MEFSTLHVRQGRPCPPMPSSPVAAVVDELAAQLPDRINQPRLITEGDQTA
LLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLP
ADARAGLGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAG
HQPPQRRRRVRRVLAVDGKAMRATRHGTHPVHLLGVLDHARGVVLAQVDV
DEKTNEIPLFSTVLDQIPDLTDVLITVDAMHAQTAHADHLHARGAHLLVT
VKRNQPTVHTRLKTLPWKDVPVGHTTTGRGHGRIETRTLKAVTVPAGLGF
PHAAQAIGDHPHLPSDQHEQEEDRGQAPPAT
>Francci3_2554 reverse transcriptase
MLHCASERQAHQVRQAVEDRMAEVGLRLHPTKTRIVYCKDANRRLGHEHT
AFTFLGYTFRARAARGRNGRLFASFQPAISRQALTALGRQVRHWRLHRRS
NVTLADLARTINPIVRGWMAYYGAFYRSALSVLLTRINSYLVRWIRKKYK
RLRPMRKASAAWQRAVTGSPGLFAHWRWVPTFR
>Francci3_3345 CRISPR-associated protein Cas4
MNPSDWWPGGDAHAGRLPISALEHHAYCPRQAALIHLDNVFADNIETMRG
NVAHAHVHEPSPAVPTPQGNRIRQVTGLQVWSDRLGLYGVCDVVEISSTS
AIPIEHKVGPYVPGGPADLQAGAQALCLRDMLTLDVPYAEVFSHTDRRRH
RVDLTDTLATRIITAAERMRDILTTAALPEAVADRRCRRCSLHHDCLPEL
ANAAAGTHDLFTPRPAGHWHG
>Francci3_3346 CRISPR-associated protein Cas1
MAELLNTLYATTPGTSLHLDGDAVRIWHPDNDKGRRLLPLVRVDHIVVFG
GVTITDDLLQRCATDRRSVTWLTGNGRFRARVEGPTGGNPHLRIAQHDHF
RDDERRLTLAMSYIAGKLQNSRQLLLRAARDATGTRQTALRDTAAHLADA
LPTLRDTTNVAEAMGVEGQAARRYIATWPHLLTPHATVTAPAGRTSRPAT
DPVNAALSFGYGILRIAVHGALDHVGLDPHIGYLHGIRPGKPALALDLME
EFRALLVDRLVFTAFNQRQLTDADFEHHPGGSCQLTESGRKNYLTLWSQA
RARTWPHTLLTHDTPAATLPLLQARILARHLRGDIPRYIPWSPT
>Francci3_0677 ATP-dependent DNA helicase, RecQ family
MFPAGLADPAEPGPGGLRERAEELLRALAGPAAALREDQWQAIDALVTHR
RRVLLVQRTGWGKSAVYFLATALLRGAAGRAAPTVIVSPLLALIRNQAAA
AARLGIRAGEIHSGNVTEWDEVYAALRSGELDVLLVGPERLNNPTFRDDY
LPELAATTGLLVIDEAHCISDWGHDFRPDYRRLRTLVAGLRPRVPVLATT
ATANTRVVEDVSEQLGTSAPAGPASAGTLVLRGSLDRESLRLAVVSLPQA
EHRFGWLADHLGELPGSGIVYTLTKAAAEELTIFLRGQGHAVATYHGGIE
AAERIVAEEDLLANRVKALIATSALGMGFDKPDLGFVVHIGAPSSPISYY
QQIGRAGRALDTAEVVLLPAAEDRDIWRYFADTSFPPEHVVRRVLGVLAG
ADRPLSTQALLASVDLGPSRLDQMLKVLDTDGTVRRVRGGWVSTGEPWLY
DEERYRRVAAARSREQQAMLDYIVTTGCRMEFLRRQLDDPHAAACGRCDR
CTGRSWSSAVSESSRTRARDELRRAGVPVEPRRMWPTGMGRLGVDLAGRI
PASAAAEPGRALARLSDVGWGNRLRPMFTSATLAVPGAAVPGAAVPGAAV
PGAAVPGAAVPGALDGPLPGDLVEAVVATLAGWGWQRRPVAVVSIASRSR
PRLIASLAERISTIGRLPLLGQLDRVAGGPHTGQVHNSAHRLAGLWDAFT
VSPALAAALGELGGPVLLVDDRIVTGWTMTVAARLLRQAGAPSVLPFALA
VDTA
>Francci3_3395 serine/threonine protein kinase with WD40 repeats
MPQPATALRIDDPTHIGVYRLRGRLGVGGMGVVYLAEDPHHHPVAVKVIR
VEFAADPEFRARFRHEAEAARRVPRFCTAAFLDADPNAERPYLVTDYVPG
PTLAQAARRPLRGAELEQVAVHIAVALTVIHGAGVVHRDLKPSNVILSPT
GARVIDFGIARAHGMTMFHIDEQIGTPAYTAPENIDGAPPDPAADIFAWG
GVVLYAATGQPPFGDRSSELLLHRIRYDHPTHLNQLHGRLHDTVTAAMAK
APEQRPTAEQLYRMLTSTQPPTPPPAAPTRHGRQGPQPAPPPPAIPPHYA
RPIPPARRTRITILATATAVVLAAAVLLITDRPRHTPTTANRPDRHGAAA
ILADQATTDDPDLAVRLAVAAYRLNPDPAARIALLAAAARGLPPLATFRH
SGKILSVAISPDGHTLATGSADHTARLWNPTHPDQPVATLGHDDGVNHVA
FNPAGTLLATTSDDTTIRLWNITDPHHPDQVNTLTLHTGGTPYGAAFNPA
GTLLAISTSTGAVLLIDITDPRATSTLATFTPHSYIAGNVAFSPDGHTLA
TASLDGTARLWDVTDPRTPRPLATLAPGPTFDATFSPDGTMLATAQQDGT
TLLWTLTTPTQPQPAATIPETGMTTTAVFAPDGTTLATASTDGTAHLWDL
TNPRTPRPLATLTGHTGPVETLAFDGTMLATAGDDTTARLWDLNPTSLTR
RACTTPTGRLSEDEWHRYLPTFPYQPPCP
>Francci3_0120 response regulator receiver protein
MLTYPGVVDLPESTLTFLAGLLAEDRAQRRTWRKLPPPEQALLVLVHLRK
GERYEQLAEGFQVSVGTVHNYIREAVRLLATHGRTLLAAVWIFAWTQSNF
LILDGTVVRTNRVRAHNKLYYSGKHKYHGINLQGLTDPYGRLIWISEGLP
GSVHDLTAARMHDILDLIDRSELYLYADKGYVGGEGDRLLVPIKKPKNND
LPDRDKEANRTHATTRSQGERGFAVLKNWHIFDRFRGSRVSQFS
>Francci3_1763 putative transposase
MVVGWGSGSGWLLAVTPEEMAVVRPVVEEFAARMFVDLPRRDQRAKGELY
LRGLLLDGKRKSVQPMAERLGVDHQQLQQLQQFLRNSTWDYAEVRARLAG
WAVGFVRPEALVIDDTGFVKDGAASPGVARMYSGTLGKVGNCQIGVSVHA
ATDWASAALDWRLFLPASWDDTGLDDPDEAAVVRARREKAGVPDGARHRE
KWRLALDMLDEIAGWGVPARPVLADAGYGDCAQFRQGLTDRGLVYTVGVT
PTATAHPGAAEPVTAPYTGSGRPPLPAYPDPPATLKALVLAAGRGALRWV
TWRRGTHKTPGNTAALMRSRFLALRVRPAAKALTRDDDRSLPACWLLAEW
PPGKNEPTDYWLSTLPPDIPLRRLVRLAKLRWRVEHDYRELKDGLGLDHY
EGRSFDGWHRHVTLTCLAQALCTQLRRTPKAPAPA