TitleGenColors Logo

Gene list

Applied filters:

COG category: Replication, recombination and repair
Organism: Thermosynechococcus elongatus BP-1, BP-1
Gene type: CDS

Number of genes found: 196

Free access
Sort by:

 



# Thermosynechococcus elongatus BP-1, BP-1

>gid:519778  apcA  allophycocyanin alpha subunit
MSVVTKSIVNADAEARYLSPGELDRIKNFVSTGERRLRIAQTLTENRERI
VKQAGDQLFQKRPDVVSPGGNAYGEEMTATCLRDLDYYLRLVTYGIVAGD
VTPIEEIGLVGVREMYNSLGTPIPAVAEGIRAMKNVACSLLSAEDAAEAG
SYFDFVIGAMQ
>gid:519421  dnaA  chromosomal replication initiator protein
MISSAEDLWHQILERLQLLLSRPTFETWIKTATVQSFDGQTLTICTPNPF
ARNWLQKYYLKTIADVAREIVGKPVEIELAIAQGAEHDTSIRPTSRGEVE
ATPPAARHRGSELNPKYVFSRYVVGPNNRMAHAACLAVAESPGREFNPLF
LCGGVGLGKTHLMQAIGHYRLEIDPNAKIFYVSTEQFTNDLIAAIRKDSM
QSFREHYRAVDVMLVDDIQFIEGKEYTQEEFFHTFNTLHEAGKQVVLASD
RPPSQIPRLQERLCSRFSMGLIADIQPPDLETRMAILQKKAEYENIRLPR
EVIEYIAASYTSNIRELEGALIRAVAYISISGLPMTVENIAPVLSPTVRK
IETSPEIILKVVSEALNVPISDLKGNSRRREISMARQVGMYLMRQYTGLS
LPKIGEEFGGKDHTTVMYSCDKVAELQRTDPEMGSLLRQLSDRINLASRS
TEP
>gid:521264  dnaB  replicative DNA helicase
MVQEPSFQPDSQLIPPQNLEAEEWILGGILLDPEAINRVVDILPVEAFYL
SAHREIYRAAVSLHSRSNPTDLLCVTAWLQDQGLLERVGGHRKLAELVER
TVSAINIDRYALLVKEKYLRRKLIEAGTHVVKLGYDNSLPLDVILDQAEQ
QIFSVTQDRIQQGLTHTNEILIHTFTELDKRVEGNIQPGLFCNFYDLDNI
TQGFQRSDLIIIAGRPSMGKTAIALQIARRIAEIHNLAVAIYSLEMSKEQ
LVQRLLASEARIDTNYLRAGRISQHQWEPLSRALGILSQLPIYIDDTPNP
TLGQIRSTARRLHAEHPNGLGLILIDYLQLMGGEETSEGRVQELSKITRS
LKGLARELNVPVIALSQLSRSVESRQNKRPLMSDLRESGSIEQDADLVIL
LYRDEYYNPDTPDRGICELLIAKHRNGPVGTVKLLFDPQYTRFENLARD
>gid:520894  dnaE  DNA polymerase III alpha subunit
MSFVGLHIHSDYSLLDGASQLPDLVARAMELGMPAIALTDHGVMYGAIEL
LKLCRGKPLKPIIGNEMYVINGDITKQERKPRFHQVVLAKNKQGYHNLCK
LTTISHLQGFQGKGIFARPCINKELLAQYREGLIVTSACLGGEIPQAILQ
GKPELARSVAAWYQQTFGEDFYLEIQDHGSQEDRVVNVELVRIGRELGIK
IIATNDSHFISCHDVEAHDALLCIQTNKLLSDEKRLRYSGTEYLKSAAEM
ARLFRDHLPDEVIQEALANTLEVADKIEAYNIFREPQSPEFPVPPGHTAD
TYLEQVAWQGLLERFQLSDRQQLEATYRQRLEYELKMLQQMGFSNYFLVV
WDYIKYARDHNIPVGPGRGSAAGSLVAYALRITNIDPVHHGLLFERFLNP
ERKSMPDIDTDFCIDRREEVIQYVTQKYGSDRVAQIITFNRMTSKAVLKD
VARVLDIPYSQADQMAKLIPVVRGKPVKLAVMISEETPSPEFKEKYDSDP
VVRRWIDMAMRIEGTNKTYGVHAAGVVIASEPLDQLVPLQRNNDGAVITQ
YFMEDLESLGLLKMDFLGLKNLTMLQKTQELIEKNHGQRIDLDALPLDDA
KTYQLLAEGKLEGIFQLESSGMHQIVRELKPSNLEDISSVLALYRPGPLD
AGLIPKFINRKHGREPIQYQHELLKPILSETYGVLCYQEQIMRMAQDLAG
YSLGQADLLRRAMGKKKKEEMEKHEALFIEGAAKNGVPSAIAQELFKQML
DFAEYCLSGETAVMTVEYGAVPIRRLVQERLSCHVYSLDGQGHLYTQPIA
QWHFQGFRPVYEYQLEDGSTICATPDHRFMTTRGQMLPIEQIFQEGLELW
QVAIAPRQALLQGLKPAVQMSG
>gid:520907  dnaE  DNA polymerase III alpha subunit
MKIVGRRLMGWQAVYDIGLAADHNFVLANGAIAANCFNKSHSTAYGYVTY
QTAFLKANFPVEYMAALLTANSGDQDKVQRYIATCLSMGIEVLPPDVNRS
DIDFTPVGNKILFGLSAVRNVGQGVIEAILQARAEGGAFQSLADFCERVP
RAGGDSRILNRRALESLIACGAMDSLHPQRNRNQLMQDLPLVLEWALARA
KDRAVGQVNLFDMLAGDSSNESKGSYDPAPSAPPVDDLPDTEKLRQEKDL
LGFYVSNHPLKDIHRPAAMIAPISLANLEQHTGQGVVSVIALLTGLKSIT
TKRGERMAIVQLEDLTGQAEAVVFPKAYERIHGQLQLDHRLLLWGTLEMR
DDRPQLLIEDAEPLEEVKLVLVDLPVEQAGDIQAQSRLSEVLKQQAGEMP
KVPVVVKVTDGYHAQYVRLGPQFRVEDADRARHALTSAGFQAHASELLSL
QL
>gid:520144  dnaG  DNA primase
MDIPRLHPDTIEAVRQAVNIVDVVAEHVVLRKRGREYVGCCPFHEEKTPS
FTVSPLKGFYYCFGCGAGGNAIKFLMELQKRSFAEVVLDLAQRHQIPVRT
LDSEQKQALQARLSLQEQLYEILAIATSFYEHALRQPIGQAAQDYLQRQR
HLKEETIQQFRLGYAPAGWQVLYTYLVEQKGYPAALVEQAGLIMPQRTGD
RYYDRFRDRLIIPIMDTQGRVVGFGGRTLSNEEPKYLNSPETPIFNKGQL
LFGLDKARQAIAQTDQAIVVEGYFDVMALHQAGFPQTVASLGTALSQAQI
KLLLRYTESKQIILNFDADKAGQRAADRAIGEAADLVYRGDLRLRILTLP
AGKDAADFFTVDEATLAARRAAYQTLLDQAPLWLDWQIQTLVQQYDLEQG
DQFQQASQALSELLAKLPNATLRSHYIHHCAELLGRHDSRLTLHLEEALR
QQVRGYRWPGRSQKWQRPADYNLRQAAEEQLLRLYLHRGEYRPLIRQTLQ
ARDIEFSLSHHRWLWRQILAVEEEHCGGRPDPDDPYNAEWPLHSPTEAGT
QLTDLDLVSHLFDRLDEAEDASLITPLLHLDETTASGLDRPKLVIEAAAA
ALERITIEKECRYLLQSWQEIVALILSESPATEDLRRYLPLINSNTEIDD
IADVIPERLQVLARLRRKYYQNRQYLQQLDQQRCPSLALIRDYS
>gid:521198  dnaN  DNA polymerase III beta subunit
MKVVCSQSVLSSKLAPLSRVAPSNPSHPILANILLQAEGGRLGLSVFDLS
LGMQIWLDAAVKVPGAITVSAKLFSEMVSRMPNRDIEITAEDTRVILDYG
SGFFEIQGMSAEEFPALPTLEDVTPITLTAEALRRGLQGSLFAASTDENK
QILSGLHVTFERDRLEFAATDGHRLAVTVTEQPVPAEPLSPITIPAKSLK
DLERLMAKQDGMVALRCDPTQVVFDIGNDARITSRLLEGQYPNYRQLIPK
TFARQVTVERSALADALERVAILAAQKNNVVKINIDTEAQELKLSAEAPQ
LGSGEENLPAQISGESMVVAFNVKYLLEGLKVMNSADVQLQLNGETQPAI
LLPLGEAQMKYLVMPIQIRS
>gid:519335  dnaX  DNA polymerase III delta prime subunit
MNWFRPVIGQPTAIQLLTYALDRRQIAPAYLFVGPEGVGRALTARCFLQA
ILNEAKDLSNHPDVLWMEPTYSVQGTLYTRRQLLAADKEIPRSAPQIRLE
QIRQLSRHLSQPPMRAPRSLVVLTQAETMNEAAANALLKTLEEPGRATLI
LIAPSPSALLNTIVSRCQKIPFYPLSRQAVEQVLRQVAPPDFWHQVTPAL
LDLGAGSPGAILQAWQTWQEIPEAFRHLGEQLTAPLPLQTALELARDISQ
SLDVERQLWLLSLMQQQIWQKRDLPRCVLVLQQLERARQYLQQYVQPRLV
WEVLLMQLGTV
>gid:520397  fpg  formamidopyrimidine-DNA glycosylase
MPELPEVETVRRGLELVTLKQPIVDVEVLLARSIALPKEPQAFIEHLRDR
AIEQWQRRGKYLLATLDDGSRLVIHLRMSGQLLWLTTPQPPCPHTRVRWF
FPTRAELRFVDQRTFGRCWWLPPDCRVAEAIPALATLAPEPLSEAFTVAF
LAARLAHCRRSIKTALLDQSIVAGMGNIYADESLFLSGLHPTQSAHTLTP
EQVQRLHGVICQVLREGIAAGGTTIRTFMSPAGVNGHYGGQAWVYGRKGE
ACRVCGTTIERLRLAGRSSHYCPQCQPLSSAIGK
>gid:520196  gyrA  DNA gyrase A subunit
MSFAADSSRIIPTELREEISRSYLEYAMSVIVGRALPDARDGLKPVHRRI
LYAMYELGLTSDRPFRKCARVVGEVLGKYHPHGDSAVYDALVRMAQDFSM
RHPLIEGHGNFGSIDNDPPAAMRYTECRLQALATEALLQDIEQETVDFVD
NFDGSQQEPLVLPARIPQLLLNGASGIAVGMATNIPPHNLGELVDGLVAL
IHHPQMSDRELMRYIPGPDFPTGGHILGQGGIEEAYTTGRGSMTLRAVAT
IETLEAPGRQPREAIIITELPYQTNKAALMEKIAELVNEKKIEGIADLRD
ESDRDGIRVVIELKRDAHPRVVLNNLYKQTPLQVNFGANMLAIVNGEPQL
LTLKRSLEVFLRFREEAIARRTRYALRKAEERDHLLQGLLVALANLDAVI
QLIRSASDTALARQQLMQTYALSEAQADAILQMQLRRLTALEAEKIEREH
AELQRQIADYRDILAHRQRVLEIIEKEVTELKAKFATPRRSRIVQADGEI
SDIDLIANDKSVILVTQQGYIKRMPVDTFEAQSRDGRGRKGAEIKEDDAV
EHFFSCNDHDRILFFSDRGLVYALAAYQIPSGSRQARGTPIVQLLPIPRE
EKITSVIAVQEFSEDEYLVMLTRKGFIKKTPLAAFSHIRSNGLIAISLEE
GDQLRWVRRTREQDTIIIGSRQGMAIHFRASHDQLRPLGRATRGVKSMNL
RPGDELVGMDILPAAIANRFATPSEDDSGEIEDSEETVAHSEGPWVLVIT
TNGYGKRVPVQQFRLQNRAGMGITATKFKAKSNEDQLAALRIVNAGDELM
IVTSRGIIIRQKVMDISSQSRSATGVRLQRLDEDDAIVTAAVLPPGSMEA
AAD
>gid:520448  gyrA  DNA gyrase A subunit
MIVPKQLDLLTSGQVIPTPLHSEMQRSYLEYAMSVIVGRALPDVRDGLKP
VQRRILYAMYELGLTPDRPFRKCARVVGDVLGKYHPHGDQAVYEALVRLV
QPFSSRYPLLAGHGNFGSIDGDPPAAMRYTETRLAAIAHRMLLQNISEAI
VDFAPNFDSSQEEPLVLPAQLPILLLNGSTGIAVGMATNIPPHNLGEVVD
ALIALIDRPHLSLGELLTYLPGPDFPTGGVIVEGAGLLRAYRTGRGTITL
RGVATLEDMAPGRGRHRRQGIVVTELPYQVNKAAWIEKVADLVNQGRLEG
IADLRDESDREGLRVVIELKRDAPAAQLLEQLYHLTALQVTYGINLMALV
GNQPRQLSLKEILSEFLQFREQTLLRQYRHDLATAQARQHVVTGLLISLN
AVEQVIEIVRSAADARAAQSELMIRLGLSDRQASALIAMPLRRLTQQERQ
ELEREQAVLQGRIAHLQTLIHQRPERLKALKKELRQLKKEFGDPRLTQIL
RETPAPEAVPDCGGDSLWLEMSYRGDLPLQTWRPWSEPLPLGFSAQTSDR
LWVFTESGKVYPLSVERLPVTSHREGDRDQPLLTLLPESAQGETLLSVIP
SQATADKLILLTRQGRIKVIPSSSLQNLRGRGLQLTKLKAGDTLGWVLPA
QGDHLVLATSRGRVFHFFLPDIPVLGRLHQGQAAVRLSRQEILVGAIALS
EHENVILVTASGLGKQVPLSLIEVVPLGHLGQLAIPLGQKLDRLAGIGRG
SRPLALVTTQERAWITHGEEIPILNREVMGGAIAPLDPGEQVQAVVALA
>gid:519466  gyrB  DNA gyrase subunit B
MTAEYSSAQLRVLKGLEPVRTRPGMYIGSTGPKGLHHLVYEVVDNSVDEA
LAGYCSTILVQINADGSVTVTDDGRGIPTDIHPDTGVSGVETVMTVLHAG
GKFGDGGYKVSGGLHGVGVSVVNALSEWLEVTVWRNGFVHHQRYERGVPT
TPLEKTPDTEERRGTSVTFFPDRQIFTETIEFDAKVLMTRLRELAYLNAG
IRIEFKDLRVTPPLSETYCYEGGIREYVRYMVQEKEPLHPEIIYIQGEKD
DVQVEAALQWCADAYSENLLGFANNIRTIDGGTHMEGLKAVLTRTLNALG
RKRNKLKENNANLAGENIREGLTAIISVKVPNPEFEGQTKTKLGNTEVRG
IVDAIVGAALTEYLDFHPNVTDTILEKAIQAFNAAEAARRAREMVRRKSV
LESSTLPGKLADCSSRDPAVSEIFIVEGDSAGGSAKQGRDRRFQAILPLR
GKILNIEKTDDAKIYKNNEVQSLITALGLGVKGEEFDPAKLRYHKIILMT
DADVDGSHIRTLLLTFFYRYQQELVNQGFVYIACPPLYKVERGRQHFYCY
SDRELQQQIASFPENANYTIQRFKGLGEMMPEQLWETTMNPETRILKRVE
IEDAAEADRIFTILMGDRVAPRREFIETYGPQLALENLDI
>gid:519678  lig  DNA ligase
MVLKEPPAERIQQLRRLLQRASYAYYALDQPIMEDEVYDQLYRELQELEA
AYPEYITPDSPTQRIGEAPVSQFESVSHRIPLYSLENAFTFADMVAWQER
WQRYWRTLRQEEPLPPAEYVCELKMDGVALALTYENGLLVRGATRGDGQR
GEDVTSNVRTIRPIPLRLALDNPPPVVEVRGEAFLPLERFHQLNQERQAQ
GEPPFANPRNAAAGTLRQLDPRIVAQRQLDFFAYALHLPEGGSVPLGENQ
AGEPQSQRQVLYALQHLGFRVNPHHADCPDLEAVKAYYDRWQTARHQLPY
LTDGIVVKLNDLKLQQTLGFTQKFPRGSIAWKYEPEQAITDVLAITVQVG
RTGALTPVAELAPVQLAGTTVSRATLHNADYIAELDLHIGDKVVIHKAGE
IIPEIVRVFPELRPPTARPFTMPTACPECHQPVVRPANEAVSRCGNPRCP
AIVRGQIRHWASRDALDIQGLGEKLVQQLVTKELVRTPADLYRLTAAQLL
SLERMGQKSADKLLVAIANSKQQPWPRVLYGLGIRHVGSVNAQLLADRFK
SVEELATATVADLCGVDGIGEEIAQAVQEWFQDPDHQSLIADLQALGLQL
AAALHPAQKALTTEKSLNGKRFVITGTLPTLTREQAKALIQKHGGHVSES
VSRQTDYLVVGEKAGSKLRRAQELGIPCINETELIQMCR
>gid:521317  mutL  DNA mismatch repair protein
MNHGLTVVPLPLEMQRAIAAAETLDSLATVAQELVENALDAGASRIHLHW
HPSAWHLEVTDNGEGIRGADLTQVALPYTSSKLPASGQLADITTLGFRGQ
ALHSLAQMAQLTICSRHREAESGWQVSYDAHGQVRSQRPLGMAVGTRVIA
EHIFQDWPQRQQGANPKQLQQRLQQIALCFPQVAWYLLKEGKRWCHWPAV
ASLSDRLLQLIPQLHPQDLRQMRDAQIELVLAPPDRHHRPRPDGLGVAVN
GRWVELHSDPSWQQVILEAFGRGLPRQRFPLCIAHLHLPPGAIDWSAEPQ
KRRIYLREPEQWQALLVERIGQLLACPATPLRDASSYQVLKAAEPAARYR
TLLSTNGAQLSLPPLKVVGQLHNTYIIVEHAEGIWLIEQHIAHERVLYEQ
IETDWQAVELEQPVLLKSLTETQVQRFQDWGLGIAPFGVQLWAVRRVPGL
LRDRPDLVAALIELSQVTDLSAAKVAVACRSAIRNGTPLTLAEMQTLVDQ
WYRCRQPHTCPHGRPICLQLQESSLARFFRRHWVVGKSHGI
>gid:521323  mutS  DNA mismatch repair protein
MSSATVSGSSVFSRWRTLVVTTSQMSALDKSLGRLEWPRLCQQLATFAST
KRGMRQLQGGDILGGTQAASQVLLAQTAEVIALETVHQVRLDFSQVTDIE
PALARLDHQGCLQGTELLAIAHLLSTARQQRRQIEEHGQLSELQQLVAGV
RTYPEVTQEIYRCITDQGQVSDRASPELAQIRQQQRQCRAQIQQQLQQIL
QQRAGAIQEPVVTQRRDRYVLAVKATHKDQIVGIVHDLSASGATLYIEPQ
ETIDLQNRLQQLAHQEAEVERAICQALSDQLATISDDLWYLLDVLTTLDV
AVARARYSLWLQGNPPQFVSQTRLHLKALRHPLLVWQEHHEQGQPVVPID
IELQPATKVVTITGPNTGGKTATLKTLGLAALMAKAGLYVPAAAPVELPW
FTGIWADIGDEQSLTQNLSTFSSHICNIRDILTELEVTGGNTLVLLDEVG
AGTDPSEGTALAIALLRYLAEHASLTFATTHYGELKALKYQDSRFENASV
EFDEETLAPTYRLLWGIPGQSNALAIAQRLGLYPSIVEEAKALLSKDSNS
VNEMIMGLVAQRQAQEAKTTAAATLLRDTEALYQEIATRAQELRQRQQQL
RQQQEEQVRTALHQAQQEIAKIIAQLQRANSPEQVQAAQTALAQIENNYL
PPPQPAGFIPQPGDRVRLRQWQQVGEVLSVSQQGDIVVQVGAVKFTVPPH
AVESLQGEPVHLPSKPKPSSAPSPPTARTTVLAIRTEDRTLDLRGKRTHE
AEPLLEEFLNRQQGTVWIIHGHGSGALRRFVHQFLDQHPSVQSYHLAPPE
EGGRGVTIAQL
>gid:521309  pcrA  ATP-dependent helicase
MGSDFLTGLNPSQRQAVEHYSGPLLVVAGAGSGKTRTLTYRIAHLIRHHQ
VAPEHILAVTFTNKAAREMKERIETLFSQEMAQQLYGRDWLDLSPAEQRR
VRSRVYHTYTQPLWIGTFHSLCARLLRLEIEAYQHPQGYRWTRHFTIFDE
SDVQSLIKQIVTGELNLDERRYDPRAIRYKISHAKNRGLSPDQLAQEQRS
PAGRVAAEVYRCYEAALAKNNALDFDDLILRTVHLLQQRPERLDYWHQQF
QHILVDEYQDTNRTQYDFIRLLATNGTPPQEFRNWGNRSIFVVGDVDQSI
YSFRCADFTILMNFQQDFGDRLPDQQTRTMIKLEENYRSVANILQAANYL
IEHNSERIDKVLRPTKAPGPPIHCECCDTETDEAYFITQQIKALGSQSLE
PQWGRFAILYRTNAQSRPFEEALVRANIPYTVVGGLKFYERKEVKDILAY
LRLLQNPQDTVSLRRIINIPRRGIGKTSLDRLSDAAQTLGISLWDLIADT
ESMTPLAGRASRAVQQFVTLITRLRSLVDDIELPELVKTVIEETGYRREL
ENEGTDESLERLQNLMELVNAAQQFSEDNEGASLSDFLNSSALASDLDTL
QEGEGVVSLMTLHAAKGLEFPVVFLVGMEQGLFPNFRSLNDPMALEEERR
LCYVGITRAQERLFLTFAQSRRLYGGREDTMPSQFLTELPPELLTGNVQR
RPPRTAVTTVSPSRLSKAAASPWRVGDRILHPVYGEGEITHVFDTGPKLS
LAIRFPRRGQKVVDPRLTPLERL
>gid:519242  phrA  DNA photolyase
MSLRLFWHRRDLRLNDNLGLAAAYTRTPKVVGLFCFDPAILSASDIAAVR
VAYLVGCLQALQEAYRRLGGSFLIFRGDPRQILPQVAKGLGAVAVHWHED
VEPYGRERDRAVAAALKEKGIAVETAWDQLLHPPEAIQTKQGQPYTVYSP
FWRNWSSLAKPEPVSAPRHLEPLTEIEQATAGTLGAICLPTAKDLGFHWS
GDLILAPGEAAAQAQLETFIDQHIQDYGEQRNYPAQPGTSLLSPALKFGV
IGIRRVWTATQAAMAAARSEEAQRSIRTWQQELAWREFYQHALYWFPHLA
ERPHREGFATFPWLNNEAHFAAWCEGRTGYPIVDAAMRQLNETGWMHNRC
RMIVASFLTKDLIINPQWGERYFMQKLIDGDLAANNGGWQWSASSGMDPK
PLRIFNPASQAQKFDPEAEYIRRWLPEVRSLDTIDLVTGNISPLERHRCG
YPLPIVDHRQQQQRFKQLYQEHFRSPIPQE
>gid:520166  polA  DNA polymerase I
MSAPHLLLIDGHSLAFRAYYAFSKGRDGGLRTSTGIPTSVCFGFLKSLLE
ILEQQQSDHVAIAFDLGEPTFRHAADENYKAGRAETPQDFITDIANLQAL
LRALRLPLLSQPGYEADDVIGTVAHCWRSQGWPVSIVSGDRDLFQLIDRE
GQVQVLYLGSTLGQRKQGLEAFDALKVKETMGVWPEQIVDYKALCGDASD
RIPGIKGIGPKTAVQLLSQYPTLEDIYAHIHEIRPPSLRTKLINGEADAR
HSRTLAQIVHDVPLELPLEELHLSPFDWPTLDHLLDQLEFRALRLQLQDW
HLRLGGILPDLETDSEETWFFAPTDTPPPLNVQLIETPEQLQWLMSQLET
CTDANHPVAWDTETTALNPRDAALVGLGCCWGAAPDQVAYLPLGHKEGQN
LPLEETLAALRPILEGDRYPKVLQNAKFDRLVLRFQGIQLRGVVFDTMLA
SYVINPEASHNLKDLCQRYLPLQAQSYRTLVGKDQTLADLSPATVAQYCG
LDVHTTYLLKEKLEADLTPRLRQLLLEVELPLEPILAEMEATGIRIDSDY
LRQLSQELEQQLAALQQQAWDAVGQPFNLASPKQLSELLFGTLGLDTKKT
HKTKLGYSTDAATLEKLRGDHPVIDLILSHRTLAKLKSTYVDVLPTLVRP
DTGRVHTEFNQAVTATGRLSSSSPNLQNIPIRTEFSRQIRRAFIPEAGWL
LVAADYSQIELRILAHLSQEPALLAAYQEGADVHRLTAQFLLEKTDISSS
ERRLGKMINFGVIYGMGPQRFAREAGVSVAEAKVFIQRFYNRYPRVFDYL
RQMERLALSQGYVETILGRRRYFAFESRELQSLRGKPLDVLADVDPSKLK
MSNYERGLLRAAANAPIQGSSADIIKCAMVKLAPLLPPEKARLLLQVHDE
LVLEMTPEAWEHLQTTIPEVMSTAVPLSVPLAVDIYAAANWLEAN
>gid:520557  psaD  photosystem I subunit II
MTTLTGQPPLYGGSTGGLLSAADTEEKYAITWTSPKEQVFEMPTAGAAVM
REGENLVYFARKEQCLALAAQQLRPRKINDYKIYRIFPDGETVLIHPKDG
VFPEKVNKGREAVNSVPRSIGQNPNPSQLKFTGKKPYDP
>gid:521259  psbU  photosystem II 12 kDa extrinsic protein
MQRLGRWLALAYFVGVSLLGWINWSAPTLAATASTEEELVNVVDEKLGTA
YGEKIDLNNTNIAAFIQYRGLYPTLAKLIVKNAPYESVEDVLNIPGLTER
QKQILRENLEHFTVTEVETALVEGGDRYNNGLYK
>gid:520540  radC  DNA repair protein
MMSYSLRVADLPAGDRPREKLLSQGARYLSSAELLAILLGTGQGAGKLSA
VGLGQFILKQLGERTGDSTDAVSALRDITPEELMAIPGVGPAKATTILAA
VELGKRVFQSRPGEQTIIDSPALAAAVLAADLMWQATERFAVLLLDVRHR
LLGSHVITVGTATETLAHPREIFREAVRRNASRLIIAHNHPSGNLSPSQA
DLDLTKQILQAGQLMEIPVLDHLILGNGDYQSLREITPLWQQVPQGDGSA
>gid:520928  recA  recombination protein
MSDTTLSLLSPEKRKALEFAISQIERSFGKGSIMRLGDATSMKVETISSG
ALTLDLALGGGLPKGRIIEIYGPESSGKTTLALHAIAEVQKAGGVAAFVD
AEHALDPTYAAALRVDIQNLLVAQPDTGEAALEIVDQLVRSTAVDIIVVD
SVAALVPRAEIEGDMGESHVGLQARLMSQALRKINGNISKTGCTVIFLNQ
LRQKIGLTYGNPETTTGGVALKFYASVRLDIRRVQTLKKGTEEFGIRAKV
KVAKNKVAPPFRIAEFDIIFGKGIANLGCILDMAEEVGVITRKGAWYSYN
GENLAQGRDNTINYMEENPSFAQEIEQQVRQRFDQRVALSANTAAYTEEE
IGEGE
>gid:521295  recF  DNA repair and genetic recombination protein
MFLKSLHLRHFRNYSEQSVTFAAPKTILVGDNAQGKSNLLEAVEWLATLQ
SHRTHRDRDLIQQGHESAQIEATLERQGVPLDLAVSLRPSSGRVLRVNGC
TVKRTADFLGQLNAVEFSCLDLELVRGTPAIRRNWLDRILLQLEPLYSQL
LQTYQKALRQRNALLKQAGSQGWDEALWQAWNQQLVINGTRIIRRRQRLI
ERLAPLAQDWHRVLSGDRETLTLSYESHVPLGDGTSEAIVAAFSEALATR
RAIEFLQKTSLVGPHRDDVGFCLNAQSARQFASQGQQRTLVLALKLAELA
LVESVVGDTPLLLLDDVLAELDLQRQGILLEVMGDRYQTLMTTTHLAPFA
APWRQQAQILKVTAGTIASVSDTAAQTSD
>gid:519554  recG  DNA recombinase
MSLDWPRLQRALAIETERGFCNLQGNTYRFNEYLHQALTQAPLQQLPPQV
RDRWQETAAAYARYDDLSPSQRQHLVVTTRQLLHDVHQQLTRSAPRPRPL
DQGLKTLHGIGEQLRCVVGDRTAAQLAKLGLYTVADLIYYFPRDHIDYAR
QVPIRQLQAGETVTLVGQVRRCKCFTSPRNAKLTILEIILQDRTGQIRLT
RFYAGARYAQRGWQEQQKRLYAPQTLVAASGLVKQTKYGLTLEEPELEVL
EHPGAEIDSLTIGRIVPIYPLTEGVSPDVIRRAVARVLPLVQGYPDPLPQ
ALCQHHQLIPLDTALRYIHFPPDQTQLSLARRRLIFDEFFYLQLGLLQRR
RQQQQQVSVPLKPQGELIEQFYQRLPFQLTGAQQRVVAEILADLERPIPM
NRLLQGDVGSGKTVVAVIALLAAVQSGYQGALMAPTEVLAEQHYRKLFEW
LTPLHVPVELLTGSVRTAKRRQILDQLATGELPVLVGTHALIQEGVRFQR
LGLVVIDEQHRFGVAQRAKLQAKGVLPHVLTMTATPIPRTLALTLHGDLE
VSQIDELPPGRRPIQTTILGRGDRCCAYDLIRREVAQGRQAYIILPLVEE
SEKLDLKSAIAEHQRLQTEIFPNFRVGLLHGRMASSEKDATIQAFAQGEL
DILVATTVVEVGVDVPNATVMLIEHAERFGLSQLHQLRGRVGRGAQQSYC
LLLLNSARNDAAKQRLNVLAQSQDGFFIAEMDLRLRGPGEVLGTRQSGLP
DFALASLVEDQDCLEAARSAASELIAQDPELRNYPLLQAVLQQRRDRLLE
TMMN
>gid:518981  recJ  single-strand-DNA-specific exonuclease
MQRPWQIYSADPAPPDFIEAVKQLHPEAGAITAQLLWQRGYRDSLRQVPP
FLDWRYYESASPLEFPEMSAALGRLQQALDQQEKVVIWGDFDTDGVTATA
VLWEGLKPLLGTQLVDFYIPNRHHDSHGLSPHGLQQLQQKGVALIITCDT
GSTNGPEIVLARQLGMDVIVTDHHTLDPTPLGAVALINPRQLPWDHPLRH
LSGVGVAYKFLEAVYDRWPEKTQGYPLENLLDLVAIGLIADWVELRGECR
YLAQRGLDQLGKRQTLRPGIAALLGHVSKHRAWQREISFTVAPRLNAVSR
VEGDVRPLITLLTTQDRHEARQLAERIATLNTERQRRQKEVTRMAHAKAA
QLDLSASRVILLTDDNWPLSLLGLVANDIVKTYGRPALLLQTNPETGMAV
GSARSDGSVDLYEAFHSQRHLLEGLGGHPYAVGFRLKMAHIPLLAAALNQ
FLAQQEGTASATPKPLVIDLEVTLGQLNGQLLKELEVLAPFDSTHHPYPR
LLVRNVELTQLRDNNSHGGRRYVSMVLHDRQHQHTFPAKWWDHQIGDCPQ
GACDLVIELEQWQSSLSAVIKELRPSETNVIQEASFQPLMDYRDRPPAAS
VKGLRVETCPTSRQSWRQWIQRAKREQQPLILAYAPPPEQDALKVWREFL
TLCQQATQQGKPLEREGLQKRLGIEATTLNYALQVLETLGVKVLSTPEEF
WCQWPPQLSSSPPTGLALERFTAAIAEENFRRRYFAAAPLQALQETY
>gid:519620  recQ  ATP-dependent DNA helicase
MPDADLKPIQAALQQYWGYSELRSPQAEVMRALLQRRDALVVLPTGAGKS
LCFQLPAVLQGGLTLVVSPLLALMENQITELRQRGLAAAAYHSELPSSQR
RQILSQLRDYRLLYVSPESLFSNPLWQSLCSAQVQLNGLIVDEAHCLVQW
GDRFRPAYRRLGTVRPALRQCKPQQSPLAIAAFTATADPHAQRTLIEVLG
LEQPVQIIHSPYRANLHLAVRSVWSRGYRRHCLQQFLKQQGRTCGLIYAR
TRHDCETIATWLQEQGHRTSPYHGGLPAAQRRQIERDWLQDKLPFVVCTN
AFGMGVNKPDVRWICHYQPPLQLSEYLQEVGRAGRDGEAAQALVLVSDRW
GLDREDQQRWSFFQHQSQDTYNRAMALQTQLPLQGNLQQLRQHFPEVELT
LALLHQQGALRWQDPFHYCRQPLAQVPPPPKDPQEQLMQKFLYHRGCRWQ
FLLQAFGFATEARGFHCGHCDRCRPPHRSRKIP
>gid:521045  recR  recombination protein
MSSVYTRPLARLIEQLQRLPGIGPKTAQRLALHLIKRPEADIQALAQALL
EAKQQVGLCSVCFHLSAEPVCEICASPQRDNHTICVVADSRDVIAIEKTR
EYHGKYHVLGGLISPLEGITPEHLHIQPLIQRASQPQVEEVILAINPSIE
GETTTLYVGQLLRPFVKVTRIAFGLPVGGDLDYADEMTLARALAGRREIE
WQ
>gid:519482  rhnB  ribonuclease H
MEPLSLIYEQAYWQQGYSRVVGVDEVGRGCLAGPVVAAAVILPVDCVPLP
EVRDSKQLTARQRSRLFAQIYHQAIAIGIGSASVAEIDQVNILQATYRAM
ARALGRVAPWDHALIDGKLTKTAPFERVTAIIGGDRHSYSIACASIIAKV
RRDRFMARLARRYPQYGWERNVGYGTPEHRQALDQYGLTPWHRRSFLKSL
LPSEAHLCNAIPAE
>gid:519110  rnhA  ribonuclease H
MTHIRAIYTDGACEGNPGPGGWGVVIYFTDGSVHELGGHHPATTNNRMEL
QAAIEALKAWRQLAPGSAIALYTDSEYVLRGITEWIHHWKRRGWKTAAKK
PVLNQDLWQELDALNDPLVQWHHVRGHRGDVGNERCDLIARRLSRGQPIA
LRTLPPPLG
>gid:519350  ruvA  Holliday junction DNA helicase
MFVYLRGTVVGHQSEGGHRCALILEVNGVGYRLLVTSHLLQQYPPRPEVV
QIFTHLSIREDQMLLYGFASAAERDLFLRLIRVNGVGPQMALSLLDTLPL
PELVQAIVSGNTRRLSRAPGVGHKTAERIALELKAALSAWRQEMGFTTAS
SGLPSEAVREELELTLLALGYSDREIAAALTAVGQTTTLPNNSDPEAWLR
EAIAWLSANT
>gid:521099  ruvB  Holliday junction DNA helicase
MMAIISSQNSPPERSRPPSVLSNQPTPEEQHSLAEDSLRPHSLQDYIGQQ
ELKEVLHIAIQAAKARQEPLDHLLLYGPPGLGKTTIALILAAEMGVNCKV
TSAPALERPRDIVGLLVNLQAGDILFIDEIHRLSRMTEELLYPAMEDFYL
DLTVGKQQTARPRRLKLNRFTLVGATTRAGALTSPLRDRFGLVQRLRFYH
PEELQQIVQRGAALLQTPITPEAALEIGRRSRGTPRIALRLLKRVRDYAA
VKHDGRITLDVARAALELLHVDPAGLDGSDRRLLRVMIESYQGGPVGIET
LAAATGEDVQTIEEVYEPYLLQMGYLQRTPRGRVATPRAWQHLGYTAPEN
QLPGLSSPYDTLS
>gid:519961  ruvC  Holliday juction resolvase
MRILGLDPGLATLGYGCIEVYRDTCQVRDFGVITTSADLPTGDRLQSLYN
DLHTLIPILQPDLVALERLFFYRMSHTIGVAQARGVILLVLSQRHCPLLE
LTPPQVKQALTGYGNATKIEVQRAVQRELHLCTLPQPDDAADALAIALTA
SRHCGHING
>gid:520319  ssb  single-stranded DNA-binding protein
MGLNVVHLVGRVGGDPEVRYFESGSVKCRLTLAVNRPSKDDQPDWFNLEI
WGKTAQVAADYVRKGTLLGIKGSLKFDRWQDRNTGVDRSSPIILVERMDI
LSSKRDTDPNAVPAGYVPEI
>gid:520011  tatD  
MLVDTHVHLNFPEYAPDLEAVAERWRSAGVVRLVHSCVEPREFPVIQRLS
AQFPELFMAVGLHPLDTQQWQPQLKEKIASLATSEPKVVAIGETGLDFYK
ATNREQQEAAFWAQLEVAQALDLPVILHCREAAAAARDLLQQFVRDRGPV
RGVMHCWGGTPEETAWFLELGFYISFSGTVTFKNAKQIHASAQMVPSDRL
LIETDCPFLAPVPKRGEKRNEPSYVRFVAEAVARLRQCDLGELEQQTTLN
ACHLFRLPLPTEQSTAPFS
>gid:518864  tll0060  
MVRFLHVADVHLGYNKYRQDNPSRMLDFFRAFDSALETYAIQAQVDFVLI
AGDLFEERMITPGILNQAEYVLDKVRSAGIPVLAIEGNHDNCPYGVKSNW
LRYLCEKDYLYLLEPDETGTLQPWDPETARGGYVDLPCGVRVIGSQWYGA
SAPRAIQQLARQIQALPPAAGATILLFHHGLEGQVCRYQGALRYNELLPL
RQAGVDYLALGHIHRHYAVEDWIFNPGSIEANSIQENQQQNPRGVLLVNL
DQGPPQAELKRDYWQRPIHRYTLTLTPSDTVSDVESQLQQLVQRHRSDMA
EAIVEVTLKGEVGFERGELSVRSLQGQLQEAAAAFIFRLGFAATAVAYQT
YRDGTAASPPRAQIEEQVFTDLLASVAEYRDRAQPLAKALMRMKEDLLQP
NANIPDLYQWLASLSLESEGWEDQGRRG
>gid:518919  tll0114  maturase; reverse transcriptase
METRQMAVEQTTGAVTNQTETSWHSIDWAKANREVKRLQVRIAKAVKEGR
WGKVKALQWLLTHSFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKAQA
IKSLRRRGYKPQPLRRVYIPKASGKQRPLGIPTTKDRAMQALYALALEPV
AETTADRNSYGFRQGRCTADAAGQCFTVLGRSDCAKYILDADITGCFDNI
SHEWLLDNIPLDKEVLRKWLKSGFVWKQQLFPTHAGTPQGGVISPMLANM
TLDGMEELLKKHLRKQKVNLIRYAGDFVVTGESKETLEKVTTVIQEFLKE
RGLTLSEEKTKVVHIEEGFDFLGWNIRKYGEKLLIKPAKKNIKAFHKKIR
DALKELRTATQEAVIDTLNPIIKGWANYHRNQVSKRIFNRADDNIWHKLW
RWAKRRHPNKPARWTKNKYFIKIGNRHWVFGTWKKDKEGRLRSRYLIKAG
DTRIQRHVKIKADANPFLPEWAEYFEERKKLKEAPAQYRRIRRELWKKQG
GICPVCGGEIEQDMLTEIHHILPKHKGGSDDLDNLVLIHANCHKQVHSRD
GQHSRFLLKEGL
>gid:518933  tll0128  
MEIQRLTLKNFKTHRDRTFEFMPGVNVICGENGAGKTSLFEAIAWVLFDA
RSGYGSGFHKAIIRRGTQRAEAIVQFISAADGRSYIVRRNTQTGYSIVDP
QVGELGLPLREDVHAWLQEHLGIRSAFPLRDLFEQIIGIPQGMMTADFLK
PPAQRRQIFEPILQVSDYRQAFDNALALVNFSQEQVASLERQLALQNQEL
ATRSQYEQQATALAAELERDRQRCEELRQECQALAAKKQEYEAAVETLNR
LQQTCERLEAQLRQQEELCRDRQRQLTAARESQARCQQLQSDYDRYRQQE
ACYQELEQQLRERATLEHKIRTLEEQQQKLATELARIESQRQAIATIASQ
LAALEPQIADADALDAEIAPLEVRYQQAQQADQELRHLKQQAATLQTRLE
EIDQQVQALEQQRPIAATLSAKQAQREQLQAQLNHAAAAHALAATLDPIL
NTAQRQAAVSDALVTAAITGLTAAQQFSLVAAAIQEGLGALQQLQQNYHW
LLDQLTALRHTLTDPAALPALNQRLEELEAEIALAAAAERSLLRAQALEE
ERCRLQTELRQLGDRQQALEPLSQELADLEGRVEKLRQERANLGQPHAQR
QLLLEQQAAAPQLEADYERLCSERASLQARLTPLYRERHHYATLEARRQQ
LSDELARGRSSYDTYLQHQQQAAQVETYEAALRAATQEANALKGDLAKAQ
AEYKAQAQGVDLAALAAVRDRYESLEREYQRCLGAIPEKEKQYQNCLATL
QHLDDVAAAKETTLQHLAAARNHHHLIETARNIFRRSGPRVSEAYLHTVS
AEADRLLRELLNRPDVALQWTSDYEIQVNEGGYWRPFKSLSGGEQMCAAL
AVRLALLRVLVNTDIAFFDEPTTNMDQVRRQQLAESLSNLKSFHQLFVIS
HDETFEALTEHTIHLERSLL
>gid:518961  tll0155  
MIKDTANRYFLSFVVEIDPVDQPASNPSIGIDLGIKAFATLSTGEKIDGP
DYSKLDKKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDFLHK
LSSRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAKCTK
LNREFIVIDRWEPTSQTCSTCGFKWGKIDLSVRSVVCLNCGTEHDRDENA
ARNIEKVGIGHCHDYKWTQRESKTTSVASPSEASRIIAL
>gid:518962  tll0156  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPI
>gid:519012  tll0205  
MEKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTKWKKQDDLQFLNEVSSVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:519030  tll0222  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIPWSRRLPDGVEPSTV
TIRLNPAGQWYINLRFDDPRELTLQPVDPSVGLDVGISSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAARNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:519049  tll0240  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519066  tll0256  
MSSHLRKGRHSVTDLKIHLVCVTKYRRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSVSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGALLEVLKEYMRNQKS
>gid:519123  tll0312  
MNLLPLSPTVSHLLSTLRPGQREISEWQGGMLAVSAVPGAGKSHGMAVGA
AIAIAREKLHQQRQLVVVTYSRSAAANIKVRIRQYLREMGLPRNGFSVQT
LHSLALKIATSHPTAGLRSRGENLMSEHEQRRLCVTCVKEWARSHPNLLE
RLIQGRDTSPLSNVEHEGRKSALLTDILVKLAQTVISSARSMALTPHDLR
QLSQQLRSPGAAEAEPYPFLEIGADLLELYQHHLAQGEQIDYDEMILAAV
RLLEGDRQYRHEWQQRVYAVFEDEAQDSTPLQSQLLHLLAADHTTGQVNF
VRVGDPNQAINSTFTAADPLFFNEFCDKCAQQQAFYEMTQAGRSTPLIIR
AANYLVRWANYALKDQEPPFREQFIDCVSPTGLESGANPAPWGQGVEIAR
PKTVVETVKHLAQRISQVLAAHPEASVAVLVRTNRQAEFVADVLRSPTDF
NLDSDLCAQGIPLLDVAGIERRSQVPKELLDILYFLHCPYSPAAVKAALT
VLQERKRIPPQNLDRLAAQPEVFLYPGPLDPPDEEPVLKARHYCHCLLNA
RLELSLFPLITYCAQELGYDAAELATVDRLIWELSQQEPTQLWERIYPRW
QELVAADRFQAVEMEDLHSRLVRSGQVTIMTMHRAKGLDWDAVFVPFLEE
RTLPGQSWVAANAKFLSPEVDFIDVVRSQLRAYGHHQPLPNWQTASKKAT
AAKIAEEYRLLYVAMTRAKRLLWLAAAQQAPFNWQNFNWRGFDQLQLHDS
PPCPFIKDLERQLKAHAAPTPGDR
>gid:519198  tll0382  
MPRIHANARTTPRIRREIQQAPASISHRELARRYGIHRHTVAKWRKRATV
EDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDDLLVLARTFLNPNLSR
SALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQFKAYAPGFIHIDVKY
LPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAESATRFLANVLANAPF
VVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLCREKRITHRLIQPRHP
QTNGMVERFNGRIAEILRAERFVSAADLQETLTRYLWAYNHRIPQRVLGH
MTPIEKLRWWQTERPDLFVSRVDNVTGLDT
>gid:519203  tll0387  
MTVIKDAANRYFLSFVVEIDPVDQPASNPSIGIDLGIKAFATLSTGEKID
GPEYSKLDRKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDFL
HKVSTRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAKC
NKLNREFIVIDRWEPTSRTCSTCGFKWGKIDLSVRSVVCLNCGTEHDRDE
NAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:519217  tll0400  
MSSHLRKGRHSVTDLKIHLVCVTKYCRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSLSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGAPLEVLKEYMRNQKKPS
>gid:519232  tll0415  primosomal replication factor Y
MLMAALRSQSTQLHPLGFTKGVCNMQQLNGGRAFLPCLKAEVSSAEYDEG
PAFASVLVDCPGATAAYTYQIPRGWRVQGGDVVEVPFGSQVVRGIVLEVL
AVLPPSVDPQGLRSLLEVVDQQLFPKDYWALLEQIATYYCTPLIQVVRTA
LPPGVLGRSQRRVRLRPPQGIPPLSEQGQHLLRFLQTKGSGDYSVRYLQQ
QLPKVQRALKELERLGLVETYLAAAASQQPKRQQAVVLLNSEGKTLTQRQ
RQILRYLQQQGRDCWLQEVLKATGTTPQTLHRLAAKGYIAIVQQQHCRTE
QGIAVTPDRPKTLTPAQATALHVISKHLDCAQTFLLHGVTGSGKTEVYLQ
VIAQCLGRGRSALLLVPEIGLTPQLTDRVRARFGERLLVYHSGLSEGERY
DTWRLTLMPEPRVIIGTRSAVLLPLVGLGLIILDEEHDSGYKQDQPQPCY
HARTVAQWRSRQQQCPLILGTATPALSTWQAAQEGQIHLLSLPQRIHAAP
LPPITIVDMRQELHRGNRSMLSRPLQEALENLQGQQAILFVPRRGHSTFV
SCRSCGTVIYCPHCSVSLTGHLFGEEMEVLRCHYCNYTQRVPQRCPSCGS
PYLKPFGGGTQRVVRELNRLFPQLRVLRFDSDTTQRKGAHRQLLTQFAAG
EADVMVGTQMLTKGIDLPQVALVGILAADSLLHLPDYQAAERTFQILTQV
AGRSGRGAHPGQVILQTYVPDHPVITAVKAYDWDSFATRELSSRAPLGYP
PYAQLVLLRLSSPDPEDVAATAQAIAQHLQRLDAVSQGDWEVLGPAPAAI
AKIAGRYRWQILLKGQSLQTCGLSQALIHLKAECPRSTRLSIDVDPLNFL
>gid:519240  tll0423  
MTKIPPEVLERHCFCRSRKFTFEVNQYRLPHGSVSILGTVRHPGGALAVP
VNPEGNLILVKQYRFATEEYLLEFPAGTVEDHENPFATIEREIEEETGYR
AHHWQKLGEFYIAPGYSDEVIYAYLATQLEKLEVPPPQDTDEHIEVVEFS
PSDLAAAIHRGQVKDAKTVTSFYLALPYLPLA
>gid:519259  tll0442  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519285  tll0468  
MTVIKDAANRYFLSFVVEIDPVDQPASNPSIGIDLGIKAFATLSTGEKID
GPEYSKLDRKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDFL
HKVSTRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAKC
KRLNREFIVIDRWEPTSQTCSKCGFKWGKIDLSVRSVVCLNCGTEHDRDE
NAARNIEKVGIGHCHDYKWTQRESKTTSVASPSEASRIIAL
>gid:519292  tll0475  
MLVPRSESIERWSVAMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRASVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIPIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHENKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGRHPFDQLC
REKRIAHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDS
>gid:519295  tll0478  
MLVPRPESIEQWSVTKQERESLGGSPKSTTRQFKAYAPGFIPIDVKYLPQ
MPDEEQRRYLFVAIDRATRWVYLAIHENKSAESATRFLANVLANAPFVVR
TVLTDNGKEFTDRFSSAGERQPTGRHPFDQLCREKRIAHRLIQPRHPQTN
GMVERFNGRIAQILRAERFVSAADLQETLTRYLWAYNHRIPQRVLGHMTP
IEKLRWWQTERPDLFVSRVDNVTGLDT
>gid:519329  tll0512  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRNRKSQK
>gid:519363  tll0545  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519370  tll0552  
MMTDSAKDLLAAAAVERRLGNRDAYVPYLRQVFADVVGEDDTVSETYGGL
GAAQERLANIQPLRYGKRRNFLTGEVTGLSAYLRHGVISLAAVRDRVRQL
VDDPSEAEALLQQLAWRDYWQRLYAHWGDRLWQDIEPYKTGWSAVDYATD
LPAALVAAETGLACMDAFSSDLQRTGYLHNHARLWLAAYVVHWCRVRWQA
GAAWFLQHLLDGDPASNNLSWQWVASTFSHKPYFFNRQNLERYSDGKYCR
QCPMSDRCPFDATYEELAARLFPHKT
>gid:519384  tll0566  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDS
>gid:519422  tll0604  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAVRTEAWYERKERLDY
VQTSALLTKWKKQDDLQFLNEVSSVPLQQALRHLQSAFSNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDAKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRELKLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHKLTTRLIRENQAIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAARNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:519506  tll0687  
MTAKYSCVTHRIIAIMRGMQKAFSYRFYPTTEQESLLRRTLGCVRLVYNR
ALAVRTEAWYERKERLDYVQTSALLTQWKKQDDLQFLNEVSSVPLQQALR
HLQSAFSNFFAGRAKYPNFKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEP
LNIRWSRRLPDGVEPSTVTIRLNPAGQWYISLRFDDPRELKLQPVDPSVG
LDVGMSSLITLSTGEKIANPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKA
RLKVAKIHQKISDSRKDHLHQLTTRLIRENQTIIIESLAVKNMVKNRQLA
RSISDAGWGELVRQLEYKAQWYGRTLVKIDRWFPSSKRCGQCGHIVERLP
LSVREWDCPKCGAHHDRDINAARNILAVGHTVTVCGAGVRPDRHTSGGQL
RRSRKSQK
>gid:519540  tll0721  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDTY
>gid:519582  tll0762  
MARQCVVIIAYEYIDPLWQPLPDPQVWGQEIDHWVLDVEASRPQLRYWLQ
RLVPGYWLLQQLSALGQTVMEVSDRLRQLETAGMTVIALAEGYVSDRPPA
GDQLLSLWDQVKQQLHRQTLCHNHARHRLKHRPPPGRAPYGYRRGKEHYV
IDRAAAVVVKDFVEHFLLYGSLSAAVRFIAHTHHKRISVATGRRWLTHPV
YRGHLYYQGKTVIPQTHAPLITPDEAAQVDRLLRRQRSLPRRSASAPHPL
AGLVICRQCQQRFGRTQVQPYRQPSQYAYLRPLHCPLSPKCRSIPYNAAL
AAVIDQIGERLPPAIAQLCLPSSTLTAEIAAIDKQLQQLTALEGQGLLDA
ETAQLRRYKLAGERARIEAQQAQLPPSNLLQLAVTLSQPQFWYQLSAAEQ
RFYLREFLQAIQVTSRPPQPWSVHLQFIWESAIGSAP
>gid:519750  tll0929  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDR
>gid:519773  tll0952  transcription-repair coupling factor
MSLMAIPRSWGKLPLTAELLSKLQHQRELVLTGMPRLVKGLVATTLAQES
RQSLCVITSTLEEGGRWAAQLELMGWDAVFFYPTSEASPYDLFDLEGEMV
WGQLQVLVESDRPNIAIVTTERALQPHLPPPEQFRAACLTLQVGQAYSLG
EVATTLAALGYERVSLVETEGQWSRRGDIVDIFPVSAELPVRLQWFGDEI
ESIREFDPASQRSLHTEGDRLDVLEQVTLTPISFTPLIAQALRAADHAHL
IPEDQEGLRRYLGLAFPEPASLLDYLPAQTLIAVDEPPLCVAHGDRWYEH
THTYWQSLETPPPPIHRPWSASAQALERFQRLHLYELASEGLGLNLSARA
VPAIPHQFGRLAATLREERDKGYTVWLVSAQPSRSVALLQEHDCPAQFVP
NPKDFPAIDKLQQQRLPIALKASGLAEISGFILPTFRTVLVSDREFFGQH
NLVNLGYVRKRRRAAAKQVDLNKLQPGDYVVHRQHGIGQFLRLETLTINN
ETREYLVLQYADGILRVAADQLNSLSRYRTQEDRAPQLNKLTGNTWERTK
ARVRKAIKKVAVDLLQLYAQRAQQRGFAFPPDTPWQREMEDSFPYQPTPD
QLKAIQEVKADMESDRPMDRLVCGDVGFGKTEVAIRAIFKAVMAGKQVAV
LAPTTILTQQHYHTLKERFAPYPIQVGLLNRFRSERERQDLLQKLKIGEI
DVVVGTHQLLSNSVKFRDLGLLVVDEEQRFGVNQKEKIKALKTQVDVLTL
SATPIPRTLYMALSGVREMSLITTPPPSRRPIQTHLAPYDPETVRSAIRQ
ELDRGGQVFYVVPRVEGIEAVAAKLQGMVVGARILIAHGQMAEGELESTM
LGFSNGEADILVCTTIIESGLDIPRVNTILVEDAQRFGLAQLYQLRGRVG
RAGIQAHAWLFYPRQEVLTDAARQRLRAIQEFTQLGSGYQLAIRDMEIRG
VGNLLGAQQHGQLDSVGFDLYVELLEEAIAEIRGQEIPTVDDTQIDLNVT
AFIPADYMPDLAQKMAAYRAVSAATTKEDLMQLAAEWSDRYGALPKSVQQ
LLRVVELKQLARQCGISRIRPEGKQHVILETSMAEPAWKLLLEQLPTHLQ
SRFVYSQGKITVRGLGTQPVEKQLEQLIDWFSQMKTTVATAP
>gid:519785  tll0964  
MQKAFSYRFYPTTEQESLLRKTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTKWKKQDDLQFLNQVSSVPLQQALRHLQSAFSNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRELTLQPVDPSVGLDVGISSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWHKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIEWLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDQWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
VNAAGNILAVGHTVTVCGAGVRPDRHTSKGQLRRSRKSQK
>gid:519900  tll1077  adenine glycosylase
MPTVQDSGGGYPLPALRFALLNWYQQQGRDLPWRHSRDPYAIWVSEIMLQ
QTQVATVIPYYQRWLATFPTLPDLAAAELETVLKLWQGLGYYARARHLHR
AAQQIMTHHAGEFPRSYEAVVALPGIGRSTAGAILSAAFNQPQPILDGNV
KRVLARLYGLTVPPKQAEAQLWQWSAQLLCPQSPRDFNQALMDLGATICT
PRHPLCHACPWQHHCLAHRHQLTHEIPRKMSRSPLPHKKIGVAVIWNATG
QILIDRRPPTGLLGGLWEFPGGKIEPNETVQECIQREIREELGIEIRVGE
HLIDIDHAYTHFRVTLHVYYCQHLSGTPQPLECDAIRWVTPEELEQFPFP
KANTAIIQAIHERGRPTA
>gid:519907  tll1084  
MELSQAHLHTLQTCPRRYQYRYLESLMLPEARQLTQTAAQKRGRDFHRLL
QQHFQGLDVSPILEVQPELKSWFAAFQATPPPMIDGQGEAEHARSLGWQE
FTLVGIYDYVIFGQGQAQILDWKTHAHPPAPETLIHHWQTRLYCYLLAAT
SPYSPSQISMTYWFPRGEKGPSSYTFSYSQAMHQETHQVLGQCLNQLRQW
LRAYEQGQDLPQVPEAERAQYCDGCPFWERCQPSSVSMLDPFLAVFKDLG
VTCQVQINNRRQAT
>gid:519933  tll1108  
MTDDPTVTDASDPTSEDLVTLKAQRDALKAEIQALDAQFHRLVHDRLKSL
EERQQSLQLTIEQLERRKERIEQELRRNFVGASQELAIRVQGFKEFLVKS
MQELAATVEEMELLPPAPAVAETAPSPAETAATPPKLILDEGFQEEADRI
RRLLEQYRSSPNYYGPPWQLRRTFEQVHAERVESWFFDLGGRGALRSLPS
RLQNILVASAIISILRDFYGEMLRVLVLADSPERLGDWRRGLQDCLGITR
QDFGPDQGVALFESADALAFRADRLEQEDYIPLILIDDSQPQVSLSLLQY
PLLLGFAPEPQLRPSRSSDFFE
>gid:519965  tll1141  
MSSHLRKGRHSVTDAEGLELIEKSFREVAKKMDFQILEFNGEEDHVHALI
EYPPKLSVSQIVNALKGVSSRRYGKAALPKPHEESLWSPSYFAASVGGAP
LEVLKEYIRNQKKPS
>gid:519976  tll1152  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSSVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSKGQLRRSRKSQK
>gid:519986  tll1162  
MTAKYSCVTHRIIAIMSGVEKAFSYRFYPTTEQESLLRRTLGCVRLVYNR
ALAARTEAWYERKERLDYIQTSALLTQWKKQDDLQFLNEVSCVPLQQALR
HLQSAFSNFFAGRAKYPNFKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEP
LNIPWSRRLPDGVEPSTVTIRLNPAGQWYISLRFDDPRELTLQPVDPSVG
LDVGISSLITLSTGEKIANPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKA
RLKVAKIHQKISDARKDHLHQLTTRLIRENQTIVIESLAVKNMVKNRQLA
RSISDAGWGELVRQLEYKAQWYGRTLVKIDRWFPSSKRCGQCGHIVEWLP
LSVREWDCPKCGAHHDRDINAAGNILAVGHTVTVCGAGVRPDRHTSGGQL
RRSRKSQK
>gid:519999  tll1175  
MSSHLRKRRHSVTDLKIHLVCVTKYCRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSLSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGAPLEVLKEYMRNQKS
>gid:520030  tll1205  serine/threonine protein kinase
MLLGSCWRIARGVETVPYPWPKWLPIPVLATTVGVAAVVIGVRQLGLLQP
SELSLYDFYVRSQPPRDPDSRILTVLVTEQDIQAQKTWPLPDATVAELLT
RLNAASPRAIGLDIFRDLPQPPGHDELQNMIRINPRIIPVCKSPGETVEQ
PEVPPPPSIPTAQVPERVGFADIPLDSDGVIRRNLFVINNPKAQRCTTPF
SFALMLALRYLEQDPSQTIKLDDKGLTLNDVHFPLLTSDAGGYVAVDDRG
MQTLLKYRSRTAVGPTVSLTDVLTGRVSEDLIRDRVVLIGVSAPSIKDTF
LTPYSGSDAVTQQMPGVVVHGQMVSQFLSAALEGEGPIWYWPVPLVLGWI
LLWAGVGSILGWWLRQPLWLGGAVVGGIIVLFGGGFGIFVMASGWIPVVP
PAIALVLSSVGMVGYVSYQAQQEQKSFREKVQEQEKALALLRSLMAEDTA
ALNAALREPTEITGKPRSLLSGRYKIQKVLGSGGFGRTYLAQDTQRPGNP
LCVVKHLRPGRTDERFMQVARRLFNTEAEILEKLGRHDQIPLLLAYVEEN
REFYLVQEFIDGVSLSEELKRKHTEAEAIQLLREILEVLNFVHSHYVIHR
DIKPDNIIRRASDRKLVLIDFGAVKQIQPQEAERTQGSTVVIGTMGYAPP
EQLSGQPTLSSDIYAVGMIVLQALTGIKPRDLPRDPRTGEVDWQQCVTVS
EGLAVVLNKMVKFNFSDRYPSAKEALQDARRLGTPAASVS
>gid:520068  tll1243  
MTTVSSIGVVPLPISPLRIMGVLPQQRWLLPPIDPECHDALRRALGCHPS
LAEIYLRRGLTTPTAVRAFLEPETLELPPPNAVFPDLDLAVELLQRAIAR
GDKMTICGDYDADGMTSTALLLRALRHLGADIDYEIPSRMHEGYGINERI
VQECYDRGVKLILTVDNGIAALHPILKARELGLTVIVTDHHDVPPQLPPA
HAILNPKLVPPTSPYHTLAGVGMAYILAVSLAQRLGNWRPLVRPLRELCT
LGTIADLAPLTGVNRRWVKQGLQTLPTSSLVGVQALMQVAGCLPEQDASL
KPTAVGFRLGPRINAIGRIGDPQVVIELLTTDDPDRAQELAALCEATNRR
RQDLCAAIEAEAIAHLEETDFDPQQEWVLVIVQPNWHHGVIGIVASRLVE
RYGVPVFIGTYENETIIRGSIRSIPEFHVFEALEATKDLLLKYGGHKAAG
GFSLRAEHLAAWRDRLRAFAQTCLKPEDLRPLVTLDAEISFEQLTWDFYT
QVEQLQPFGSENPQPMFCSRGVKILEQTPMGQQGEHLKLTLEQNGQQMTA
KAWRWGPFLPLPTRVDIAYHLTAHTWHGETQLELELKGVKPSLAWYVPAP
PSHPIPSWQPLPPLTTLLPQLRDPVLLYGYGRPEVPPNLTTAAIHYDRPR
HRCPTLILWSLPPSSTHLRWLLAIAQPKTVYVGYQRPAIAPLTTLILSIQ
NELKAKEETQVNLLALSQAYWIAPCTLVAILRHLGYSCEEFAPTLSITEE
LARLQRWYQLQAKDLARLAQLWGTA
>gid:520070  tll1245  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:520201  tll1374  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
IQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFSNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRELTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:520206  tll1379  
MSSHLRKGRHSVTDLKIHLVCVTKYRRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSVSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGAPLEVLKEYIRNQKKPS
>gid:520309  tll1480  Type III restriction-modification enzyme helicase subunit
MPFEITQQNASALQPLFKPWEEPTRHRLPNPQSGQPAIIAPGRRPSKVPL
VRAIRAEVDAWRRGGYAGVSETSHTLLHHWFESEHLVKNEAGDLIPFRYH
WAQREAIETFIYLYELRRVRNVAELLFEFGDQQLADLAFGIPPEQDRWAR
YCAKIATGAGKTKIMSLAIVWSYFHSLYEPNSDLARHFVVIAPNLTVYER
LKDDFENCAIFYADPLIPEEWRPDFQMQVVLQDEPGGATTTGALYLTNIH
RLYPSRDNGGEASEEEVSAIFGPPVVRGRALDTGESLRARITAHPRLMVL
NDEAHHLHDPDLAWNRAIDALHEESLQRGQRGLCLQLDFTATPKHNDGSL
FRHIVVDFPLGEAVDAGIVKVPVLGESDELVVRGDKKTPAHERYGMHLQL
GYQRYARTYEELGRVRKPVLFVMTEDAQAANEVADYLDSDAFPLLKGRVL
NIHTRLKGRIKTVTRGGRTYQEFVENETAMKADDLRALREMSRELDSPDS
KFRCVVSVMMLREGWDVRNVTTIVPLRPYSARSGILPEQTLGRGLRRMFP
LGEMPEIVTVIEHPAFRRLYEDELAQEGLDIALLPVREVFKQTVTIFVDH
EHKPVADLDIEIPQVSDAVETTSELQGLTFEEIRAAFQSKFKPLPIGRKK
EGAIEYKERHLFTDEIVATMKLDAGLLNNAWSAPSYFAQMLGRACRISNP
HQVLAPLVERFIAEVLFERPVDLYSGEVDHRMRDMDVMEHIRATFTPLIL
EKTVRQKKRQRISRGQRLSTWKPYQATSTAQRPALPAERTLFNLVPCDND
LEQAFTDFLETAQDVVAFAKNAGPQKLMLDYLKPDGQRAFYVPDFFVRTA
GGDHYLVELKGRQDELVPLKASAAVEWCKTASGKEARWHYLYVPYHLFQQ
GAATTIAELARACEPSLKALLDEWKTKQLALPLEEATTQSASEALFQRVL
QEAGIAQPPAEIADLMRQAVILLDHAVRSRYPTYAHAFQPLLGPLDEFAL
HILEKFLKPAIPANKEASILFFSPDLNHLAQREQALLERNQRYLRDNLVF
GRSIQRLGTLLFCLHYAHQGGMGADGIWKAVSDTFSSPGFRDLYELLERV
NEFRNTRVAHVETPLNDADEARGAMTTWLQALVQLHELATSPL
>gid:520311  tll1482  Adenine specific DNA methylase
MSNGVSWGPDNPHPLSRMKTELVWEGKYDEYGNRRPVKLPTLPLPLQRIE
TIDEPRDREKAQQLSIFNEAAFHQQAHRDDFRNMLIWGDNKLVMAALLEQ
FRGKIDLIYIDPPFDVGADFTMQVQIGEEGEAVQKEQSILEAVAYRDTWG
KGTDSYLHMMYERLTLMRELLSERGSIYVHCDWRMNAFLRQVLDDIFGRD
RFLNHIIWAYKTGGIPENVGFSKKHDDILIYTKSDTPVFNQLLQKSYVPT
LPEPTTISGKQLGVQRDEVCELCGVGRPGQKYRNVIMRDVWDDIQSIFRN
DQQTTGFDTQKPEALLERIIKASSNEGDLVADFFCGSGTTLAVAEKLGRR
WIGVDLGRYAIHTTRKRLIQVQRELHAADQPYRSI
>gid:520315  tll1486  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDTYVKRE
TTELVLVGWGKASVLTSASGTFIGTVGYSAPEQQEGKPEPASDLFALGAT
MVYLLTGYEPETYFRWGLREYRLYAEDIPHLHPLMVDLINCLTHPDPRER
YPNAAEVKRRLQEIATITPTTPPVSSSAS
>gid:520413  tll1582  
MSLPAPTLAELPLHPYIDGSGRVAPDLKGAIGLYAIFDAAGSVQYIGYSR
DMRLSLLQHLVRCPQGCSGYKAIAIERPDRPWLETVKQQWLAELGTVPLG
NDRDRRQWENAIDVKEQMTEAERSAWASADSLTQPKLLKQVARRVEAAIL
EQLRQRSLQEPLVFNAKLKESGRLDLK
>gid:520466  tll1635  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDN
>gid:520472  tll1641  endonuclease III
MAITRRLCAKQQRALEILTRLKRLYPHATCSLNFENPLQLLVATILSAQC
TDERVNQVTPALFARYRDAEDFAAADLAELEQYIKSTGFYRNKARHIQGA
CRRIVEVYGGQVPKVMEDLLSLPGVARKTANVVLAHGYGILGGVTVDTHV
KRLSRRLGLTQETDPVKIERDLMRLIPQPDWENWSIRLIYHGRAVCQARQ
PQCESCELIDLCATGRKLIPKP
>gid:520570  tll1737  
MRHVGDRGEAVVAAWLQTQQCQILAQNWSCPWGELDIIACDPGGVVLFVE
VKTRGSYNWDRDGLDAISPSKQRKLILAAQAFLESQPQWQEHPCRFDVAL
VRHQRGAYHLHHYLAQAFTLDSIK
>gid:520618  tll1784  
MAGYVLVLAVIILGGAIATVGDRLGSKVGKARLSWFNLRPRQTAVLITIL
TGSLISASTLAILFALSRELRDGVLRIDTIRRQQAAAEQELAQTRAQKDE
IEAELAQSQIELANIRQRLSQTNRVLEQAVNRQTLTEAELKQLQHRYTQA
QKNLENFEAQGARLRREIQRLQRERQAIQGRLEEVAGQKAALETAIRTAQ
QRLAEVEAQKDRLRAEIDRIQDQLAAANQQQQVLRNQQRSLQQEIAALEA
SRQRLEENVNILLLGLRRGTIAIRTGQVLASAVIQNVKDPDKATQVIEEL
LREARRNAIVLNSPQNLQPTDQVIQITTADVHRLRSQISDGQSYVVRILA
AANYLQGESNILVVPQVARNQQVFREGENLATISLDPSQMTDEQILQRLD
QLFTVSNQRAIASGVLPDPVTGTVGSFRQIELVKFVLELKDHQGTIDISA
VTPTSVYTAGPLTLSLVARQNQRVILRSG
>gid:520704  tll1869  
MPRRRKQFPCGHQGYGQVCHRCAQLAEAKALEAAQREQARLARQEWQASF
ASDVVELRGLPKGIVLKARQVIAQLLAGADYRQFKGKRLNHDRRVISIPL
SYDYRLICYDTGDRIEPRSVLSHEEYNVKKPLA
>gid:520734  tll1899  
MSSLGKQGQESHRIHQLQRLCRKYGPDLASVIAYRDRIQAELAALKDATT
SQECLEAEVAQRRQVFEQASEQLHQLRQGAAERLQQDLLAHLGPLGLPQA
RFTVQLTTTEASSSGSDEITFLWSANPGQPLQPLGETASGGEMKWLALPV
M
>gid:520765  tll1929  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDIYAPVL
RYETYLNNHQDWFHRCAHPMKAEPVGKHGYILTIGRYGSYGYEVEPKIGL
HLLPQEQGVYRIETIPVPEQPFLNYKVDFQAAMALVPMRATPETDRDLLA
LNILEYTRVNWELDLRVEMYFPRFIYRLPHGLIQGTGNRVLAQIVRQVSY
RLTAKVQDDFHKTIGLDLGKRWRRKRFHAERVRTLTPAPAPDRPDEPEPP
AA
>gid:520780  tll1944  DNA modification methyltransferase
MLSYPPHLVRGYIEDFGLNEGSVILDPFCGTGTTLVESKRQGIPSLGMEA
NPFAHFATSVKTDWRVDPDLLYSHSWEVAELALDILRQQGIEDSVPFAAD
IADLPLRQLSPEQNQLILAGSISPVPLHKVLLLLDCLEKYRIEKVYRHQL
LAIAHTLVFAVSNLRFGPEVGVAKAKVDAPVIRAWLAKIGEMVEDLRRVQ
DQEDTPAQVYLADARGPDRVLPPQSIDAVITSPPYPNEKDYTRTTRLESV
ILGFIKTKADLQTLKKGLIRSNTRNVYKGDDDDRWIQDHPKIQAIADAIE
RRRIQLGKTSGFEKLYSRVTKLYFGGMARHLAALRSLLRPNAQLAYVVGD
QASYLRVMIPTGQLLADIAQALGYEFVRTDLFRTRFASATKEQLREEVVI
LRWKGSSSQFRPTEGR
>gid:520792  tll1956  
MSEPLQFSLFDSPTEAEPAPATPLDPATYDQIPLRAEVPIPAGTYRNLEA
LAVHCQQCQRCGLAATRTHVVVSRGNPAAKLMIIGEGPGQAEDESGRPFV
GKAGQLLDKILASVNLDSERDAYICNIVKCRPPGNRVPTPIEAAACIPYL
LEQIRLVNPRIILLAGATAVSGLLKDHRGITKIRGQWIEWQGRWCMPIFH
PAYLLRNNSREPGSPKWLTWQDIQAVRDRLRQLDS
>gid:520841  tll2004  
MEKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTKWKKQDDLQFLNEVSSVPLQQALRHLQSAFSNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDQWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
VNAAQNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:520869  tll2031  
MSSHLRKGRQSVTDLKIHLVCVTKYRRPVLSAEGLELIEKSFREVAMKMD
FQILEFNGEEDHVHALIEYPPKLSVSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGALLEVLKEYMRNQKKPS
>gid:520902  tll2064  mutator MutT protein
MSATTMVPVALAILYQGDRVLMQLRDDYPHILYPGHWGLFGGHLEPEEVP
LEGVRREVYEEIGYCPPHLTFFGEYGDPQVHRYIFTGPLTCELRTLVLNE
GQGMDLVPYASVVAGVHYAQTLGEDRPLGAIHQRILLDFFAQFRKESTL
>gid:520909  tll2071  
MNDTPLNLADELLVRDQLVQQLSEELYQLMVQHPELFARFYQARKAEAAN
AEALRLLQAQVQQVEAQIAAYQEQILAYQQQAQNREAEMNNLKAQVMALS
DRNEMLERVIQEMPEVYRQKFSERLSQVKLKIESLEKENSQLRAELRNLQ
TLLAAQVRQQQQQGLASLQPARIGLIPSFNT
>gid:520944  tll2104  
MLVPRPESIEQWSVTKQERESLGGSPKSTTRQFKAYAPGFIPIDVKYLPQ
MPDEEQRRYLFVAIDRATRWVYLAIHENKSAESATRFLANVLANAPFVVR
TVLTDNGKEFTDRFSSAGERQPTGRHPFDQLCREKRIAHRLIQPRHPQTN
GMVERFNGRIAQILRAERFVSAADLQETLTRYLWAYNHRIPQRVLGHMTP
IEKLRWWQTERPDLFVSRVDNVTGLDT
>gid:520978  tll2137  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDTYATTA
>gid:520988  tll2147  N-methylpurine-DNA glycosirase
MVAEELLGCILVRQQANGQLYRGRIVETEAYMAGDPACHGYRRQTARNAP
MFAAPGTIYVYQIYGIHHCLNIASDRPNFASAVLIRALEMLSPPLPPSSA
AGPGKLCRVLGIDRRLSGLMLGKESGLWLERPPQPLTDPVVQTTRIGIAQ
GQEIPWRWYVQGNPAVSRYC
>gid:521063  tll2222  serine/threonine protein kinase
MSMSSLGLLSVGTTLQGGKYHLDALLSQGGFGVTYRATHTLLHQPVVLKT
LNLQQEPPKRIHDLGERFIQEAQRLAQFNHPHIVRVSDCFIEGGRPFIVM
DYIPGRTLAQVIQEEGPLPEKTALHYIRQVASALELVHEHGLLHRDVKPD
NIMLREGTDQVVLIDFGIAREYTTGVTETNTGLVSAGYAPVEQYLPRHQW
TPATDVYALAATLYALLAGRPPVASILRDRVPLEDLRQFQPNLSQRTIDA
IEAGMALDVRERPQTVRAWLQLLMGQSVARQTTATVAVMPQHRSTVFAST
QPEGTAVVASPRQRNFSPWLWLLGTAVFGSLAGIGLGLFLRNQPVDTVTP
PPPRPQEQEFPPTLPRVVPPVEVTPSPEPTPNPAPEPTPSPESTEPATPS
PAPQEASPPQPTPPPSSSEPSNATPSPEPGVSQSSPPPAPVESPVPTPPP
SPAAETPPPPTNSQPTSEPPTPPPQPTPPPSSSEPATP
>gid:521078  tll2237  
MSSHLRKGRHSVTDLKIHLVCVTKYRRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSVSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGAPLEVLKEYIRNQKKPS
>gid:521114  tll2267  DNA-binding protein HU
MIPMNKAELVDAVFNRAHSANNVTKKQVEAIISATVEEIMEAVAKGEKVT
LVGFGAFERRERKAREGRNPKTKEKMQIPATNVPAFSAGKLFKEKVAPPA
AAEPAAKGKKK
>gid:521145  tll2297  
MKMDTGALTFQQKIYWLVKQIPLGRVATYGQIAALCGWPRHGRYVGYALY
RVAPNSDIPWHRVVNAQGKISYAPQRRGTDELQRWRLAAEGIIFTAGDRI
NLRRYQWQPPATVYQQLIWPEGAATDSAKAKVG
>gid:521170  tll2321  
MSSHLRKGRHSVTDLKIHLVCVTKCRRPVLSAEGLELIEKSFREVAMKMD
FQVLEFNGEEDHVHALIEYPPKLSVSQIVNPLKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGAPLEVLKEYIRNQKKPS
>gid:521178  tll2329  
MISVLGLDLGRKRIGVAGCDRLGQLATGITTIYRRNFASDVAQLRRICQE
RGVEKLIVGLPYTLDGQLGSQARQVQHLAEKIGAALNLPVEYIDERLTSF
QAEEILKQRRRSPRHHKDLVDQIAAALILQQWLDARSQTAKATLAAGDPQ
L
>gid:521235  tll2385  
MKMKFNHHRYRVAQFYDENQTEFCLATNLKHLRDEEVSQLYRHRWAIENL
WKFLKMHLSLDRLITKSLKEVVNQIYMVLITYLILELVDAPKYFGRKLLD
KLRYFQLELSGRCSIVHWRFDWQPEQLVTYNVKFCCRIQHFCLKRLPAAQ
>gid:521248  tll2398  
MSSHLRKGRHSVTDLKIHLVCVTKYRRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSVSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGALLEVLKEYIRNQKKPS
>gid:521267  tll2417  
MPMSYVVRKWAAGRVWQMLSYKAQKLAMKTTLQNEAYTSQECPVCLHRQK
VKGKNYHCTNCGFKYHRDGVGSINIRRKYLNLGSVVGVMAPPFGVPFKPH
IQRSPVPGVSVGQ
>gid:521281  tll2431  
MTAKYSCVTHRIIAIMRGVEKAFSYRFYPTTEQESLLRKTLGCVRLVYNR
ALAARTEAWYERKERLDYVQTSALLTQWKKQDDLQFLNEVSCVPLQQALR
HLQSAFTNFFAGRAKYPNFKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEP
LNIRWSRRLPDGVEPSTVTIRLNPAGQWYISLRFDDPRDLTLQPVDPSVG
LDVGMSSLITLSTGEKIANPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKA
RLKVAKIHQKISDSRKDHLHQLTTRLIRENQTIIIESLAVKNMVKNRQLA
RSISDAGWGELVRQLEYKAQWYGRTLVKIDQWFPSSKRCGQCGHIVERLP
LSVREWDCPKCGAHHDRDVNAAQNILAVGHTVTVCGAGVRPDRHTSGGQL
RRSRKSQK
>gid:518835  tlr0031  reverse transcriptase
MPPCYPTMTVDQTTGAVTNQTEISWHSINWAKANREVKRLQVRIAKAVKE
GRWGKVKALQWLLTHSFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKT
QAIKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALE
PVAETTADRNSYGFRRGRCTADAAGQCFLALARAKSAEHVLDADISGCFD
NISHEWLLANTPLDKGILRKWLKSGFVWKQQLFPTHAGTPQGGVISPVLA
NITLDGMEELLAKHLRGQKVNLIRYADDFVVTGKDEETLEKARNLIQEFL
KERGLTLSPEKTKIVHIEEGFDFLGWNIRKYNGVLLIKPAKKNVKAFLKK
IRDTLRELRTATQEIVIDTLNPIIRGWANYHKGQVSKETFNRVDFATWHK
LWRWARRRHPNKPAQWVKDKYFIKNGSRDWVFGMVMKDKNGELRTKRLIK
TSDTRIQRHVKIKADANPFLPEWAEYFEKRKKLKKAPAQYRRIRRELWKK
QGGICPVCGGEIEQDMLTDIHHILPKHKGGSDDLDNLVLIHANCHKQVHS
RDGQHSRSLLKEGL
>gid:518940  tlr0134  
MTAKYSCVTHRIIAIMRGVEKAFSYRFYPTTEQESLLRKTLGCVRLVYNR
ALAARTEAWYERKERLDYVQTSALLTQWKKQDDLQFLNEVSCVPLQQALR
HLQSAFTNFFAGRAKYPNFKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEP
LNIPWSRRLPDGVEPSTVTIRLNPAGQWYISLRFDDPRELTLQPVDPSVG
LDVGMSSLITLSTGEKIANPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKA
RLKVAKIHQKISDSRKDHLHQLTTRLIRENQTIIIESLAVKNMVKNRQLA
RSISDAGWGELVRQLEYKAQWYGRTLVKIDRWFPSSKRCGQCGHIVEWLP
LSVREWDCPKCGAHHDRDINAAGNILAVGHTVTVCGAGVRPDRHTSGGQL
RRNRKSQK
>gid:518963  tlr0157  
MSSHLRKGRHSVTDAEGLELIEKWFREVAKKMDFQILEFNGEEDYVHALI
EYPPKLSLSQIVNALKGVSSRGYGKAALPKPHEESLWSPSYFAASVGGAP
LEVLKEYMRNQKKPS
>gid:519001  tlr0194  
MDNLSQRYLRLCQAYSQLAERYTKLDIDHMTLREKLVPFLMAFKYYKQMT
EQLTAEKEAMQRELNELRDRYQLLVSQNGNVAIHEELLSALAEAEEQMEL
IEETLKEQEADPDPNLLPIEKQLLEEYTRGSGDFQVLLPQSLSHSGVTA
>gid:519022  tlr0215  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDT
>gid:519056  tlr0247  
MISENAVVVMDRGFASWKFLEQLSERNCLFVVRLKNNMRMKFNDHRYRVV
EFYDENQREFCLATNLKHLSDEEVSQLYRHRWAIENLWKFLKMHLSLDRL
IAKSLKGMVNQIYMFLIVYLILELVDVPKYFGRKLLDKLRYLQLELSRGC
SVVHWCFDWQPEQLIT
>gid:519067  tlr0257  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVHQPASNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDF
LHKVSTRVVIENQVIALEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CNKLNRDFIVIDRWEPTSQTCSKCGFKWGKVDLSVRSVVCLNCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:519084  tlr0273  
MQKAFSYRFYPTTEQESLLRKTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIPWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRELTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVEWLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRNRKSQK
>gid:519120  tlr0308  reverse transcriptase
MPPCYPTMTVDQTTGAVTNQTETSWHSINWTKANREVKRLQVRIAKAVKE
GRWGKVKALQWLLTHSFYGKALAVKRVTDNSGSRTPGVDGITWSTQEQKT
QAIKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALE
PVAETTADRNSYGFRRGRCTADAAGQCFLALAKAKSAEHVLDADISGCFD
NISHEWLLANTPLDKGILRKWLKSGFVWKQQLFPTHAGTPQGGVISPVLA
NITLDGMEELLAKHLRGQKVNLIRYADDFVVTGKDEETLEKARNLIQEFL
KERGLTLSPEKTKIVHIEEGFDFLGWNIRKYNGVLLIKPAKKNVKAFLKK
IRDTLRELRTATQEIVIDTLNPIIRGWANYHKGQVSKETFNRVDFATWHK
LWRWARRRHPNKPAQWVKDKYFIKNGSRDWVFGMVMKDKNGELRTKRLIK
TSDTRIQRHVKIKADANPFLPEWAEYFEKRKKLKKAPAQYRRIRRELWKK
QGGICPVCGGEIEQDMLTDIHHILPKHKGGSDDLDNLVLIHANCHKQVHS
RDGQHSRSLLKEGL
>gid:519124  tlr0313  
MQLIYRGAKYKTSEQHIPLVESGAKGLYRGAPWVGHKPAETVPQPNHVLC
WRGVTYQTNGQPVATTAPRVSAAPSVVPQRKRDSWVEAHRRAILETLERR
LQVARSQGNQELISLLEEEWQQFA
>gid:519162  tlr0350  
MSERSPALADFLAQLPFELDAFQAAAIAALDAGRSVVVCAPTGSGKTLIG
EYAIHRALTRQQRVFYTTPLKALSNQKWRDFQQQFGAAQVGLLTGDISIN
RDAPILVMTTEIFRNMLYGTPIGEVGTSLAGVEVVVLDECHYMNDRQRGT
VWEESIIYCPKEIQLVALSATIANGEQLTDWIQSVHGDAELIYSDWRPIP
LHFYFCNGKGLFPLLDGQRKRLNPKLHGQPELRRRGSKRDFLSIRYVVSQ
LQQRDMLPAIYFIFSRRGCDQAVQEVLGMNLLTKAEQQALAERVEAFLAQ
HQDIVAPEMIAPLYQGIAAHHAGVLPVVKTLVETLFQEGLIKLVFATETL
AAGINMPARTTVISTLSKRTDSGHRLLTASEFLQMAGRAGRRGMDTVGHV
VTLQTPFEGAHEAAFLATAAPDPLMSQFTPSYGMVLNLLQRHTLEEAREL
VERSFGQYLATLQLTPQRQAIAQLEMELQTVQQRLAGIDRQQLAQYQKLR
ERLRQDQRLLKILEQQAEQERTQALLPLMMAVPPGTWLHIKSPLREHPPL
AAVLCQPVAGSGQLPHWLCLAADGRLRVVGIDDILGVYPDRPPCDPLPPL
PEAMKLRRGESYPCHGADHYLGCLPNLPPVLAAPEVAAQAAKIADLEAKL
SQLQGSLPQNVHSLLRLVRREERLQTELRDRQQKLHQQSQRHWEQFLALI
AALQDFGGLNDLTPTPLGEMAAALRGENELWLALALASGELNDLPPHLLA
AAVAALVTETPRSDSWCNYPIPSEVEERLAALSPIRRRLFQVQRRYQIIF
PLWYEWDLIGLVEQWALGTPWHELCAQTNLDAGDIVRLLRRTLDFLSQIP
HAPHTSPQLRQSAQQARYLLDRFPVNDLLEGVELEVATL
>gid:519175  tlr0361  
MQAPNLVYRKFQAARDAAQGFKGQWQQFSKRFIDCLEQTQKAGQVKADFL
QQCKADIAALHSSLEEALQALNARLDSPLLTLATTGTTSSGKSTLVNFLC
GAELVPVAVSEMSAGTVTLEDGDVPSLTIEATPGALWECGTWENISAATI
YDKLEEAMVQYLDTKDTNPSLAPPKANITYPIQFLRQHRHQLGLPFGTRV
QLLDLPGFAYVGDERNAQIIREQCRQALCLVTYNSAETDAKKTEQLLEEV
VIQVKNLGGSPARMLFVLNRIDVFQEDRDPQKSEQRFVSKTVGQIKAKLK
EKLPEHEEAIETLKVLKLSARAALYSQLIQQSVGEERAKICERAEEQHKA
LIGEDLLEELPRNPAKWTDHDCQRVAESLWANSYGQELFKELVEQIQQHF
AELVIPQIVVDFNKGAANKVLEWCTQEVDAICNSIEEKYRQECERLEETG
VQLSEKLQQAGKNLKVPFEEMKELITSHSPITNHSIVLLESIMFEFIGNK
EQTECEPEVDDNDDDEWSDKIRQWRDKIQQWHSQVDDEWRDKIKLDDEIL
PLIYTWRREVFISIEQLLEQVYLSIQGGKVDFSADVNLKQLPPPDQKYVQ
ELEETLREFIEFIKAIGYNCADVAKQGMTIRQETQGDFKIIYFQSIIKNI
QKRLSEVIEVLVKSVLAREQGPIYDVVSLLFRNHLEYLQESAQRIAPELG
IQFPMSTLSQVQKELNWDISLEKAFSPEAKEATWTAIETRKTRPWWLLRL
IEIETKVTVQKSGLELEVPSMEKLCEDWIAACKERVEEYLVPAYLDWFIE
QTDLLTEQANEYQQAVLERYRSKLSECRQALSIDYETRKQFWEPMRTSVA
ELEQERPSWDKIKQSENAGVTSIDLNAETSSV
>gid:519176  tlr0362  
MQAPNLVYRKFQAARDAAQGFKGQWQQFSKRFIDCLEQTQKAGQVKADFL
QQCKADIAALHSSLEEALQALNARLDSPLLTLATTGTTSSGKSTLVNFLC
GAELVPVAVSEMSAGTVTLEDGDVPSLTIEATPGALWECGTWENISAATI
YDKLEEAMVQYLDTKDTNPSLAPPKANITYPIQFLRQHRHQLGLPFGTRV
QLLDLPGFAYVGDERNAQIIREQCRQALCLVTYNSAETDAKKTEQLLEEV
VIQVKNLGGSPARMLFVLNRIDVFQEDRDPQKSEQRFVSKTVGQIKAKLK
EKLPEHEEAIETLKVLKLSARAALYSQLIQQSVGEERAKICERAEEQHKA
LIGEDLLEELPRNPAKWTDHDCQRVAESLWANSYGQELFKELVEQIQQHF
AELVIPQIVVDFNKGAANKVLEWCTQEVDAICNSIEEKYRQECERLEETG
VQLSEKLQQAGKNLKVPFEEMKELITSHSPITNHSIVLLESIMFEFIGNK
EQTECEPEVDDNDDDEWSDKIRQWRDKIQQWHSQVDDEWRDKIKLDDEIL
PLIYTWRREVFISIEQLLEQVYLSIQGGKVDFSADVNLKQLPPPDQKYVQ
ELEETLREFIEFIKAIGYNCADVAKQGMTIRQETQGDFKIIYFQSIIKNI
QKRLSEVIEVLVKSVLAREQGPIYDVVSLLFRNHLEYLQESAQRIAPELG
IQFPMSTLSQVQKELNWDIFVEKAFSPKKATWTTTERRKAGPRSLLCLIE
IETKVNVKKSGFVLEVPSMVKLCEDWSAACKEWVDEYLVPAYLDWLLEQT
DLLTEKANEYQQAVLERYRSKLSECHQALSIDYETHKQFWELMRTSVAEL
EQARPSWDKIK
>gid:519205  tlr0389  
MSSHLRKGRHSVTDLKIHLVCVTKYCRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSLSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGALLEVLKEYMRNQKS
>gid:519218  tlr0401  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVHQPASNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDF
LHKVSTRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CNKLNREFIVIDRWEPTSQTCSKCGFKWGKVDLSVRSVVCLNCGTEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:519223  tlr0406  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSSVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRELTLQPVDPSVGLDVGISSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:519262  tlr0445  serine/threonine protein kinase
MSEPHRDTKIGQVLMNRYQLTELIGKGSMGRVYRAEDILLGGVPVAVKFL
SQTLLNDRMKTRFAQEARAGALLGQKSMHVVRVLDYGMNNEEIPFYVMEF
LEGENLSDLLLEEPLPLSRFLRIARHMCLGLQVAHEGIIIEGQKCPIIHR
DIKPSNVLVIQDGTMGELAKLLDFGIAKFLGDVPEKGQTSSFMGTLAYCS
PEQIEGRELDHRSDIYSLGITMYELLTGKMPIQAESHSIGSWFKAHHFQK
PIPFNVASPGLHLPPALEELIMACMAKSPSDRPQNVAEIIKVLTALEDQF
GSTRVTQPGVEVGAKVSQPLERQSQPPVALATVEEVCWQSVWPADKPVAE
IVFPMPLYAQRESAASLWVMLPRSEIDRRMLNLRYNQFLFTTSPHPMILW
ITAIYDPQQGPRWLPCYLDMKLPRNQELCLLLSETGYYPLLFFSLEDPQH
CINVRTFTIATFQRQLLRDWLQTSKNLPSSAPAVVSRNLLKAEFENYKPQ
ILAKLENVKMATVID
>gid:519287  tlr0470  
MSSHLRKGRHSVTDLKIHLVCVTKYCRPVLSAEGLELIEKSFREVAKKMD
FQILEFNGEEDHVHALIEYPPKLSLSQIVNALKGVSSRRYGKAALPKPHE
ESLWSPSYFAASVGGALLEVLKEYMRNQKS
>gid:519339  tlr0522  reverse transcriptase
MPPCYPTMTVDQTTGAVTNQTETSWHSINWTKANREVKRLQVRIAKAVKE
GRWGKVKALQWLLTHSFYGKALAVKRVTDNSGSRTPGVDGITWSTQEQKT
QAIKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALE
PVAETTADRNSYGFRRGRCTADAAGQCFLALAKAKSAEHVLDADISGCFD
NISHEWLLANTPLDKGILRKWLKSGFVWKQQLFPTHAGTPQGGVISPVLA
NITLDGMEELLAKHLRGQKVNLIRYADDFVVTGKDEETLEKARNLIQEFL
KERGLTLSPEKTKIVHIEEGFDFLGWNIRKYNGVLLIKPAKKNVKAFLKK
IRDTLRELRTATQEIVIDTLNPIIRGWANYHKGQVSKETFNRVDFATWHK
LWRWARRRHPNKPAQWVKDKYFIKNGSRDWVFGMVMKDKNGELRTKRLIK
TSDTRIQRHVKIKADANPFLPEWAEYFEKRKKLKKAPAQYRRIRRELWKK
QGGICPVCGGEIEQDMLTDIHHILPKHKGGSDDLDNLVLIHANCHKQVHS
RDGQHSRSLLKEGL
>gid:519428  tlr0610  
MTYKRLSDSERQRVFQQYTAGETIKALAGEFGVSENTIRRVIKQMEEEAA
APEAPLLAAATTDDMPEEEDLEDVETAEAVVLDEDDYSDAADDDLEDEEE
LEGDIPLPDLTDVEVVQSVEVIPLSEAVLPRPCYVVVDRRAELLTRPLEA
FRGLGALSSEEAQQSTLPIFDSRPVALRYSQQNQRLYKASDVQWRKTVVK
VPDGEMLRKVRSYLQSKGITRLLYHGRVYALD
>gid:519437  tlr0620  reverse transcriptase
METRQMAVEQTTGAVTNQTETSWHSIDWAKANREVKRLQVRIAKAVKEGR
WGKVKALQWLLTHSFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKAQA
IKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALEPV
AETTADRNSYGFRRGRCIADAATQCHITLAKTDRAQYVLDADIAGCFDNI
SHEWLLANIPLDKRILRKWLKSGFVWKQQLFPIHAGTPQGGVISPMLANM
TLDGMEELLNKFPRAHKVKLIRYADDFVVTGETKEVLYIAGAVIQAFLKE
RGLTLSKEKTKIVHIEEGFDFLGWNIRKYDGKLLIKPAKKNVKAFLKKIR
DTLRELRTAPQEIVIDTLNPIIRGWTNYHKNQASKETFVGVDHLIWQKLW
RWARRRHPSKSVRWVKSKYFIQIGNRKWMFGIWTKDKNGDPWAKHLIKAS
EIRIQRRGKIKADANPFLPEWAEYFEQRKKLKEAPAQYRRTRRELWKKQG
GICPVCGGEIEQDMLTEIHHILPKHKGGTDDLDNLVLIHTNCHKQVHNRD
GQHSRFLLKEGL
>gid:519470  tlr0651  
MMAMGRTKYFLGFDPGQDKCGLALALGQEVVRHEVVASDRAIACVQEWHQ
AYPFEQVIIGNQTTSRRWQQQLQAALAVEIIPVDERHSTLEARDRYWQLF
PPRGWQRLIPKGLRVPPRPIDDIVAIVLLERYCGYPLLLSQRQ
>gid:519572  tlr0752  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519594  tlr0774  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519797  tlr0976  
MATVSLLPNSRGLPSDLTINDQEVHRGGEGSIFFTVDAQYVVKIYHRPAP
DKQQLLQHVLDLGRNLGEDEQFLAWPLGIVDQLNGQTKVGVVTRRVPASH
VPLYKLIYSPLDAMEQFHQGRTWLEYLKMARGTAAAVRTIHGKGMAHADI
HLKNFLADPISGEVVLIDLDGLVVKGFLPPQVKGMVGFIAPEVMMGKAQP
DQSSDRHSLAVLVLWILLLRNVMLTQTCYDPDDEKHDDELGYGQYACFSE
HPNDRRNWIPRIGTPFFRRGALSYRTLTPKLQELTERALIHGLHDPPQRP
QALEWERALAEAYDLLISCPTCRQSFFYPYWLQPPRRRQCPFCGASVQPP
FPAVLQLMESRARGTYVPVRMIVLYHGLPIFADIAEPGRLPPFTRRGTPL
LGQVVWDAKAGVHRLLNNSDTPWQMLAGGTGMVRRGASVVLRPGLLLSFG
DGRRLVRVIE
>gid:519819  tlr0998  reverse transcriptase
METRQMAVDQTTGAVTNQTETSWHSIDWAKANREVKRLQVRIAKAVKEGR
WGKVKALQWLLTHSFYGKALAVKRVTDNSGSRTPGVDGITWSTQEQKTQA
IKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALDSA
VTSL
>gid:519820  tlr0999  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:519822  tlr1001  reverse transcriptase
MLANMTLDGMEGLLHKYLKKYKVNLIRYADDFVVTGESRETLCIAAAIIQ
KFLKERGLTLSPEKTKIVHIEEGFDFLGWNIRKYDGKLLIKPAKKNVKAF
LKKIRDTLRELRTAPQEIVIDTLNPIIRGWTNYHKNQASKETFVGVDHLI
WQKLWRWARRRHPSKSVRWVKSKYFIQIGNRKWMFGIWTKDKNGDPWAKH
LIKASEIRIQRRGKIKADANPFLPEWAEYFEQRKKLKEAPAQYRRTRREL
WKKQGGICPVCGGEIEQDMLTEIHHILPKHKGGTDDLDNLVLIHTNCHKQ
VHSRDGQHSRFLLKEGL
>gid:519839  tlr1017  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDN
>gid:519876  tlr1053  
MPLRISGQRSLKTLPGHRTRPTSAKVRQAIFNIWQGRLQGCCWLDLCAGT
GAMGAEALLRGAQWVVGIEQSPQACQIIRENWSALAQPHQRTLVLRGDVR
QKLPQVPMPQFDLIYFDPPYNSDLYLPVLQLLWQEERLNPKGEIAVECRT
KNPPDLEGILAIGWQQQRTKTYGSTMVIFLSR
>gid:519923  tlr1098  serine/threonine protein kinase
MTELMGQILGDRYELLRELGSHPGRRTYLALDRWHYEQVVLKLLLFGGGI
TWDDFKLFEREVAVLQTLDHPAIPRYLDYFEINTPDLKGFALVQTYIEAK
SLALWQAEGRVFSEGDLRLLADRLLDILIYLHSRQPPVIHRDIKPTNILL
GDRSGHDLGKVYLVDFGAVQTAIAGSTRTVVGTYGYMPPEQFGGKTLPAS
DLYALGMTLIYLATATPPDELPQKELRVQFQPLVSLSPPFIRWLEKMVAP
ALEERFDSATAARHALKQLDRSAEDASLITKTPPYGSRLQVRESPQGLLV
HIPPVGLNASLFFLIPFAVAWNGFLVVWYSLAIASGFWLMAVFATGHLAV
GLGLIGGILYRLLGWQVLMVSPKDLRFYEHILGFRWRHAKVLPLEVIEEI
ELQPRGTCRHQEDDRQQLWSKLVIRAGSQEIELTENAPRVTSRDLEWLAY
TLGRYLGRRVTQARQDCRST
>gid:519938  tlr1113  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQAKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDAIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSKPS
SVTVIKDNANRYFLSFVVEIDPVDQPASNPSIGIDLGIKAFATLSTGQKI
DGPDYSKLDKKIRRKQRKLARQVKGSKRREKTRLQIAKLHSRISDIRRDF
LHKLSTRVVIENQVIALEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CNKLNREFIVIDRWEPTSQTCSTCGFKWGKVDLSVRSVVCLNCGTEHDRD
ENAARNIEKVGIGHCHDYKWTQRESKTTSVASPDEASRIIAL
>gid:519940  tlr1115  
MLVPRSESIERWSVAMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRASVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIPIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHENKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGRHPFDQLC
REKRIAHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDT
>gid:519943  tlr1118  
MPPCYPTMTVDQTTGAVTNQTEISWHSINWAKANREVKRLQVRIAKAVKE
GRWGKVKALQWLLTHSFYGKALAVKRVTDNSGSKTPGVDGITWSTQEQKT
QAIKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMRDRAMQALYALALE
PVAETTADRNSYGFRRGRSTADAAGQCFITLARADSATYVPDADISGCFD
NISHEWLLANIPLDKEILRKWLKSGFVWKQQLFPTHAGTPQGGVISPILA
NMALDGMEELLNEALQKSQSPSH
>gid:519951  tlr1127  
MPPRYPTKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERK
ERLDYVQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGR
AKYPNFKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIPWSRRLPDGV
EPSTVTIRLNPAGQWYINLRFDDPRELTLQPVDPSVGLDVGISSLITLST
GEKIANPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISD
SRKDHLHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVR
QLEYKAQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGA
HHDRDINAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:519956  tlr1132  DNA polymerase III gamma and tau subunits
MTYEPLHHKYRPQRFSDLVGQGAIATTLTQALLKERIAPAYLFCGPRGTG
KTSSARILAKSLNCLRSSKPTPDPCGQCEVCRQVANGTSLDVIEIDAASH
TGVDNIRELIEKAQFAPVQCRYKVYVIDECHMLSVSAFNALLKTLEEPPP
QVVFILATTDPQRVLPTIISRCQRFDFRRIPLGEMVAHLQNIADKEQIDI
EPEALTLVAQLSQGGLRDAESLLDQLSLYPERITVEQVWQLTGAVPEQDL
RQLLGAIARRDAVSVLEHTRQLLERGRDPLTILQNLASLYRDLLLAKTVP
QRRDLVALTAETWQALIALAEAWSVEEILHAQEHLRTCEGQIKQSNQPRL
WLEISILGLMTERVAQPPPPAVMPAARPQPTTMTPPRTEAEVAVAAKPAE
PLPQDSALSVNHPHIQATAAAAPEPALFPEPVANPEQVWQQALQVLQQMQ
MSTYALLSQHAHLRHLGDREVRIGIVNQTLLNLTKNNHAKIEAAFSQVLG
RPTKVALEVAPLPAKTTPTPKAKEPPQSSVVIPLPPREAPPTPPGEPVPV
PSPPAPTTAPDPVRESAEKLARFFNGVVIRLPDEGESPDMPMAEAPEEEE
LEF
>gid:519966  tlr1142  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KRVITQAKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFNSIKGRRKGKK
INSPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGKIRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVHQPASHPSIGIDLGIKAFATLSTGEKI
DGPEYSKLDKKIRRKQRKLARQVKGSKRREKTRLQIAKLHSRISDIRRDF
LHKVSTRVVIENQVIALEDLNVSGMVKNRKLARAISLQGWREFRALVEAK
CNKLNREFIVIDRWEPTSQTCSKCGFKWGKIDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRESKTTSVASPDEASRIIAL
>gid:519985  tlr1161  reverse transcriptase
MPPCYPTMTVDQTTGAVTNQTETSWHSINWTKANREVKRLQVRIAKAVKE
GRWGKVKALQWLLTHSFYGKALAVKRVTDNSGSRTPGVDGITWSTQEQKT
QAIKSLRRRGYKPQPLRRVYIPKANGKQRPLGIPTMKDRAMQALYALALE
PVAETTADRNSYGFRRGRCTADAAGQCFLALARAKSAEHVLDADISGCFD
NISHEWLLANTPLDKGILRKWLKSGFVWKQQLFPTHAGTPQGGVISPVLA
NITLDGMEELLAKHLRGQKVNLIRYADDFVVTGKDEETLEKARNLIQEFL
KERGLTLSPEKTKIVHIEEGFDFLGWNIRKYNGVLLIKPAKKNVKAFLKK
IRDTLRELRTATQEIVIDTLNPIIRGWANYHKGQVSKETFNRVDFATWHK
LWRWARRRHPNKPAQWVKDKYFIKNGSRDWVFGMVMKDKNGELRTKRLIK
TSDTRIQRHVKIKADANPFLPEWAEYFEKRKKLKKAPAQYRRIRRELWKK
QGGICPVCGGEIEQDMLTDIHHILPKHKGGSDDLDNLVLIHANCHKQVHS
RDGQHSRSLLKEGL
>gid:520000  tlr1176  
MKAIYRYRFYPTDQQRQRLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQAKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
VNPPRFKKKTERQSARFTTCGFSIKGEEVYLAKIGNIRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVDQPATNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSKRREKTRLQIAKLHSRISDIRRDF
LHKLSTRVVIESQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CKKLNREFIVIDRWEPTSRTCSTCGFKWGKVDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRESKTPSVASPSEASRIIAL
>gid:520065  tlr1240  
MIEAEVHRQLRAFLRSSGDRRWPHHLTLARLVARALRLRRGCLLQVSQRA
VLQHRYGLSYLLPLLLYPEPALLVVPQERLTRLLHQEIPELLSFLAVTKP
IQHSHCPQPVFEGVLVMNLEDWCRQATPYPNVVTIIDGIEALPAIAQQQM
TCTITTSDWEHLKLAIPSAIGAIRQVYAQLVQHLFQRPQNPYGDYLLTPT
EQQPLLELLHQWPQSLPPQWSQLRHYLNRDDTVIWGRRHPTAGYFTLHSH
PLNLRPYFQDLWCQAPFVLIGSGPDTDPPLAYVQQELGIPAQTTIKFASD
RHSEAITLAIAEDLPLPNTPEFAPAVLRRLYDLLGTIGHQRAVILVSDVP
LREQLATQLAALYGSRVQVETTTLETNTILVTGETFWLRHGAQLPCPALL
VLTTLPFPSPEKAVVSARIEWHKRQKQDWFRQYLLPECLTVLDRALAPVR
QDDTLVAILDRRLTERSYGREILQSLSPYNRVRDRAQSPR
>gid:520085  tlr1260  
MVTLAMAATTGRVVLNHSTHIEGLIPVLEKLAKVAGISTLTPGVIAPVKG
KSPHLHLRVSVPIKGGFKVIARRGKTVQEVFVITQLSEAELKAAIDAVLA
SK
>gid:520092  tlr1267  
MNSESPITEHLPPEVRSWLYAYQQEHQLASPEAAIVDIVCKFYTQPNHLS
ERVANLERRVNALSREVIHLRQQLPENYDRLREQLAAVRLSHSGILHNLR
DRLEALESAVFSGGPSAADAEADS
>gid:520153  tlr1326  serine/threonine protein kinase
MSYCLNPNCPKPDNPDGLDICQACGSKLLLNDQYRAIKVLGRGGFGTTFL
AVDTKLPGNPTCVIKQLRPAATAPHILTMARELFLREATTLGKVGNHPQL
PRLLGYLENENEFYLIQEYVGGLTLQQEVKRFGPKSEEEVKQVLQEVLPI
LDYLHKNGVIHRDIKPANLIRRDIDKKLVLIDFGAVKDKVTQAMVENAPE
LSTFTSFAVGTPMYAPPEQMAMRPIYASDIYALGVTCVYLLTGKSPKEIE
RDPRTGEWHWQRHVKNISPQFAALLNKMLQDAVKDRYQTAMDVLADLQLL
ESNQGAAVAVPTTSTEDDDLSSALSTPPSQGRPRVPGAPKRSSYASSVSQ
NAQAARVRQSRMTGTQLGGPPVVGTGPGQHLRPKVTPRMTAAQLLTAYKR
GERDFIDVDLSNVVLRNADLSGANFANANFTNADLKGCILANAVLREANF
QGANLHDANLCGAYLVQANFERANLKGAKLHDASISGANFTSADISGADF
SMVSGFVQGQLDRAKKNWFTKLPK
>gid:520180  tlr1353  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDSYIEAA
AIRRGLDIRTVDPQAVVAI
>gid:520207  tlr1380  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVHQPASNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDF
LHKVSTRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CTKLNREFIVIDRWEPTSRTCSTCGFKWGKIDLSVRSVVCLNCGTEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:520259  tlr1431  SMF protein
MLLKRLWEHFGDLKTAWQAPIAAIGQVEGIGPKLQGAIAHYRKQCQPLTL
YEHHCQLNPDHWVLSDPRYPPLLREIPDPPPLLYYGGNRILEILRQPTPT
VAIVGTRDPSEYAERWAHKLGYALAQAGLTVVSGMAAGIDGAAHRGCLEA
GGATLAVLGTGVDMIYPQENRPLYYQIHERGGFLSEYPRGVGADRSQFPR
RNRIIAGLCRAVIIAEAPYKSGALITARYAAEYGRDVYVLPGHLDNHRAK
GALSLVNQGAHLILGEEALLEQLLGHTPPLDPTPFSAPSPTLSLPQQQIL
EAIQQLSQGSQSVAFDDIVQRVCLDTGVVMAELIHLELMGYVEQQPGNRY
RKTGC
>gid:520289  tlr1460  serine/threonine protein kinase
MIGKTLGGRYKLIRVLGAGNFGQTFLAEDVHRPIRAKCVVKYLRPARRDA
AFLPLARSLFQREAQILERLGTHDQIPRLLAYFEEDNEFYLVQDFIEGQV
LRQELLLGSGWPEARVIELLHDALGILSFVHQCGVIHRDIKPDNLIRRQF
DQRLVLIDFGAVKEMGVSIGGGTLVGDANPQTIAIGTPGYMAPEQAQGRP
RPASDLYSLGMVAVQALTGLSPLQLSQNPYGCWCWQATEPVSDRLVQFVN
KLIHPSPYERFATAADALAGLQQVEVPKSFWFRLGKFLRTPLQDLWRSPK
TGEVLPPPLPPVSTLSPQASEPKPVSSPPAATTALPNQGRRIFISHSSHG
TDLQLASALQEALQKAGHQPFMASQSIRLGEAWAQRIDQELKQCDFFVLL
LSQTAAQSEMVLEEVRTVKQLQAQRGDRRPLILPVRVNFPLDMPLNYELR
GYLHRLQQRFWRSPADTDTLLQEILDLVNATTAPVATRSDAPQETIAEPD
QRNTFVADGPPVPVAEPELPEGQVEVASAFYIERPPIEQRCRETLLQPGS
LIRIKAPRQMGKTSLMARLLYRATQEGYGTVPLSFQLADAQVFSDLEKLL
RWLCASVGRRLGLENRLNEYWDDIFGSKYNCTAYFEEYLLPNCQKLGTQF
ESRPLVLGLDEVDRVFEYPAVASDFFGLLRAWHEEGKNRDIWRNLRLIVV
HGTEVYIPLDINQSPFNVGLAIDLPEFTQEQIAELCRLHGLSLSEGQLQD
LQHLVGGHPYLIRLSLYHLARQDLTWQELMANAASETGLYRDHLRRQWWH
LQQQEDLVPAFRKILQADNGSVVDSTIGFKLHSLGLITLEGNLAKVRCDL
YRRYFGDRLH
>gid:520318  tlr1489  
MEKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSSVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLKLQPVNQAVGLDVGISSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWHKARLKVAKIHQKISDARKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAARNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:520370  tlr1540  serine/threonine protein kinase
MLPSDVGRMLRVVRLNLPIARSVGQDRTETAFSIFQRQRKSPNCAMSPIS
IPDPLKRALPVVMAAAIATVGVLALRVGGFLQPLELRVYDQWLRWRSTPA
TSQRLLIVEITEADIQTLKQYPVPDEVLIQAVNELQEYQPRVIGIDIFRD
FPVPDRFKLPTNGLPSLGRVMMTQPNTVIVCKAGSEGDPGIAPPAGLLND
QVGFADIPIDDDSVVRRAILATQPEANDRCSTPQSFALALARLFLGVNPQ
AVTENRLELGTARFQSLTRNWGGYNNLDAAGFQILINYARPTQPYETVTL
SEVLRGEVLPSKVRDRAVLIGLTGSSSNDKFLIPITLPEYTNRLTPGVVV
QAAILEDLLAAALDHRSPMGTWPQEAIALWILAWSGLGALIIAKGHRWVM
VPLWVAGGLGLSGLTFVLFLQGTWIPLVAPLISFGSGAVLMLGYRALSSA
ESMSTPSTPTAVPNGTSATPLEFITEGISQSTTAESAPILEISEVLPTEV
SRLKTDTNATVLQPETAIPFTPIEPTPPPGDKSSTAPTLLTAIVDQISPT
EIPETPPQPINERETFLLVHPDDAATSATELPKTPEPTPVDLPETQVGVP
APPPTVPSGIPSPLPPDLPETQLPVPEALRTMPQTPSVPSLDLPETQLTL
PEPPTTPELPSTPPTQFTTSADLPETQLSVPESVRPGEVISNPVTHGHIP
ETLPEIPATPPQFATRADLPESQLSTPEAPKGGQPERQPSHPDSLTSMPA
DATPIPPRTVPSPPSPTTETATPMGTEPKTTAISDITAEPSEPLPQTVGG
RYRILSQLGQGGFGRTFLAADLHLPDHPICVVKQLVPSRKDERFLAIARR
LFQREAETLAQLGQHQRIPRLLAYFEEGGYFYLTQEYVDGESLKEEFEKK
ITLSQGEAIAILKSILEILQYVHQFGVVHRDIKPANIIRRRSDQQLFLID
FGAVRHVQPEDLLRHGKYTISIGTRGYAPSEQMAGRPVIASDIYSLGMVI
VEGLTGLAPMDLPSDPDSGDLIWQPGRHLSPQFVAIINKMIKYNFRDRYQ
SAREVLTDLAKAGL
>gid:520404  tlr1573  
MRENFCIMPVYFYWGEDQFQLEQAVKNLRQQVVDPLWESFNFEKFAGEDA
VAVQAALAQAMTPPLGGGDRLVWLMNTTLGQRCSAELLADLELTLPQIPP
NCHLLLTSTQKPDSRSKAVKLLQQHGNIQEFNPIAPWKTAEIEQQIREAA
QRYQLHLPPEGLQLLAEAVGNDSRQLRNELEKLALFAGDRPLTAEDITTL
VHATQQNSLTLASTLLRGDTAAALTQLEDLLLQNEPPLRLLATLTKQFRT
WLWVKLLSHERDNQRIAKLAEVGNPKRVYFLQKEVNAVALPSLQACLQIL
LATEYQLKVGAEPVATLRQGMIQLSLTCSRRSWGDRSGGGDN
>gid:520409  tlr1578  
MATATILKPRPFLKWAGGKSQLLSQMAPYLPRQCRCYAEPFCGSAALYWY
LFGQAQQGQFQFQQAWLSDRNPELINCYQIVRDRVEDLIQQLTEYRQQHS
EAFYYHIRSWDQRQLDPLTRAARLIYLNKTCFNGLYRVNRAGQFNVPMGR
YRNPQIFDPEALRQASIALQDVILSVADFQEVLTWATAGDFIYFDPPYYP
LSKTASFTSYTDQPFGEAEQIALANVVAELAQRGCYVMLSNAWVEPMLQL
YRSWRCIELKASRVINSNRHKRGKVSELLVVTYRC
>gid:520411  tlr1580  
MLVPRSESVQRWSITMPRIHANARTTPRIRREIQQAPASISHRELARRYG
IHRHTVAKWRKRATVEDKSTRPHRLQTTLTEAQELVIVEVRKLLLLPLDD
LLVLARTFLNPNLSRSALDRCLRRHGVSNLRILQKERESLEGSPKSTTRQ
FKAYAPGFIHIDVKYLPQMPDEEQRRYLFVAIDRATRWVYLAIHEEKSAE
SATRFLANVLANAPFVVRTVLTDNGKEFTDRFSSAGERQPTGHHPFDQLC
REKRITHRLIQPRHPQTNGMVERFNGRIAEILRAERFVSAADLQETLTRY
LWAYNHRIPQRVLGHMTPIEKLRWWQTERPDLFVSRVDNVTGLDS
>gid:520444  tlr1613  
MAIFHGTWLPEPAPQFFIWAEEWRSLAQAITPWAPPAIPVYPYATQRKTP
LRKTARPSATYVALPAQIQGHQLLPPPLAEVQGELLFLWQVPGWSIPASE
VLEQLHQLSLHGQDSGSIGDDLRYWLHVSRWLLDLIVRGQYLPTPEGWRI
LLTHGGDRDRLRHFSQLMPDLCRCYQADGTALQLPPHAADLLADFLQHTL
QGYLHTALADLELPKVGLAKEHGHWLAFLKTGQTPELPPPLIERLHRWQE
PYREQLHLRPQWRLALQLVPPDTADGDWHLAFGLQTEGETDTMLRAAEIW
QCTQEALLYQGQVLWQPQETLLRGLGLASRIYRPLDRSLQERSPVALTLH
TTEVYAFLQSAIAPLEQQGVAIILPPSLRRNSAQHRLGLKIIATLPPPAT
NGLTIDSLMQFQWQLQLGQHPLSEADFDQLRRQGTPLVYLNGEWVLLRPQ
EVKAAQEFLQSPPKTQLSLAETLRIATGDTVTVAKLPILGLDTNDALQTL
LDGLTGKQSLDPVPTPQEFCGELRPYQARGVAWLSFLERWRLGACLADDM
GLGKTIQLLAFLLHLKETGRAYRPTLLICPTSVLGNWLRECQKFAPTLRA
YVHHGSDRPKGKAFLKKVETHDLILTSYALLQRDRTTLQQVLWQHLVLDE
AQNIKNANTQQSQAARELSAQFRIALTGTPLENRLLELWSIMDFLHPGYL
GHRTYFQHRYVRPIERYGDTTSLNALRTYVQPFILRRLKTDRSIIQDLPE
KQEMLVYCGLTLEQMQLYTAVVEDSLAAIENSQGIQRRGNILATLTKLKQ
ICNHPAQYLKQEDYAPDRSGKLQRLIEMLQALQEVGDRALVFTQFAEFGT
HLKTYLEKALQQEVFFLSGRTPKAQRELMVERFQHDPEAPRVFILSLKAG
GVGLNLTRANHVFHYDRWWNPAVENQASDRVFRIGQARNVQIHKFICTGT
LEEKIHEQIEQKKALAEMIVGSGEHWLTELNLDQLRQLLTLDKERLITL
>gid:520471  tlr1640  
MGDLSVPRPFLKWAGGKTQLADALLEHKPVYFNTYHEPFVGSGAIFFRLY
RENQVRRAILSDINAELIDTYLAIRDRVAEVIVLLSEFPHSEDFYYEIRA
KDPWKLSLSERAARMIYLNKTGYNGLYRVNRQGKFNVPFGRYKAPKYLDK
DNLLAVSHALRNVEILCAPFDTVTERAKPGDWVYFDPPYVPISQTSNFTS
YYADGFGLQDQERLRDICITLSQNNVYITVSNSDTAIVRSLYKFPHFAID
EVLANRTINCNGARRGKITELVITNYPVNQAVQLHLTQSRLLSGAVKEIL
KNGG
>gid:520619  tlr1785  exodeoxyribonuclease III
MPMLPPIRARANHLDCNQPMPKIATWNVNSIRTRLDHVCQWLDSTGVDYL
CLQETKVTDAEFPRQPFLDRGYHVYCSGQKAYNGVAILSRQPLAGVEAGF
APQLPSHSELDTQKRLIRAQFAPDVILVNVYIPNGGEYDSEKYHYKLHWL
KTLYVYLEQLTTQGEVILCGDFNIAPEDKDLFDASDRATKVGATDAERNL
LAAIRDLGFHDAFRQFTEAPGHYSWWDYRSGAFRRNHGWRIDHLYVTPGV
KARACNCHIDIAPRRLPKPSDHAPVILEIE
>gid:520691  tlr1856  
MMVAIASLNLALVLFDLTYIPLRDFWLLGKVGIPLIQRVIYLPQPLPITR
LYDPIKGIQPHRVTQQYITTVNQLRHTITTEGVDAPATQALLTQLRQQSK
TMVDTNPFALANKSGTLEQIKNRMRQRVFGSRDASSKEAFERFWSPEYIN
ERDWRSNLNWFEKHIIPLIATNYWRAYGEDGNFLNRFGVLDLPFNLLFFV
EFLGRTFWLSRQYGGCGWRDAMLWRWYDGLLFFPFWWLAPTWAWLRVIPV
MIRLDQAGLICLDTLERQLSQGLVGAIATDISEVVVLQVLTQLQKAIRRG
QFSHWLPRASSLQVNSVNSRVNDIDELGELAALVVQVILYRVVPQLRPEV
EALVAYSVDQVLRQAPPYRRLLESPTLAALPLQWRQSLAKEISDRLFLLL
EQASKNQGNPPREGTLLVSQLAQKFGQTLSAELQEEQVGKVIQDLLVVWL
EELKINFVRQTHLEGIETVLEETRTLQERLAQNRLKP
>gid:520713  tlr1878  DNA-methyltransferase
MMASGDICEVVRQVPDEAMTLIFTSPPYNLGKAYETPVAIEDYLQSQSAV
IAELYRVLRPEGSLCWQVGNFVQRGEVYPLDILFYPLFKRLGLKLRNRII
WKFGHGLHATKRFSGRYETILWFTKSDEYIFNLDAVRIPAKYPGKRHFKG
PNKGKPSGNPLGKNPSDVWEVVVQDWQELVWDIPNVKSNHPEKTLHPCQF
PIELVERCVLALSHEGDWVFDPYMGVGSSLLAALMHNRRAMGCEKEPAYV
NIARQRIQAYENGTLPYRPLGRPVYSPTGQEKIVQIPEEWQQ
>gid:520714  tlr1879  
MATVNWRSLDELVRIQSKWVTLIAEKWLTDTGETLEYWRVEKADSVIVLP
LQGDDLICLPPTFRVGVQRATVDFPGGRVLPNQTPRAMVPQILRRELGIT
SDAIQAITPICEQPWLVNSAFSNQKVWGFVAELALGARVPQAGGKFRLGS
PDISRLLGQLECLQCRAVLLEWLYQCRPV
>gid:520767  tlr1931  
MRTPDYRRHDISDRVWERLEPHLPGRRGSWGGVAKDNRQFINAVFWILRT
GAPWRDLPPEYGDWKNVHRRFCRWRDKGVWEKVLEQLIDEPDYEWLIIDA
THVKVHPHATGAKGGNQDMGRTKGGSIPRYIWPWMRMVCRCEWLSQQVPL
RIVAKLQP
>gid:520814  tlr1977  
MAMKILHVSDIHLGSGLSHGHINPATGLNTRLEDFITALATCIDRALREP
VDLVLFGGDAFPDATPPPLVHEAFASQFRRLADARIPTVLLVGNHDQHAQ
GQGGASLSLYRTLGVPGFIVGDRLATHRIETRQGSVQVITLPWLTRSTLL
TRPETSGLSLAEVHQLLLERLRLALEGEIRQLDPALPTVLLAHAMVDTAQ
YGSERYLSAGKGFTIPLSFLARPCFDYVALGHVHRHQVLCHDPPVVYPGS
IERVDFGEEGEEKGYVLVNLVKGKTAFQFCPLPTRPFRTIRVDLTAVELD
PQAALLAAIASVDITEAVVRVIYQLRPDQIPLINLHELQKALESAHSVSL
LPQLANSEPIARLPEVALEQCLDPSHALQLYLDHRPDLEPLRQDLLAALQ
TLEGNPEQESQDTPQRPRSPEPVIQQLKLLS
>gid:520870  tlr2032  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQAKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
VNPPRFKKKTERQSARFTTCGFSIKGEEVYLAKIGNIRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVDQPATNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSKRREKTRLQIAKLHSRISDIRRDF
LHKLSTRVVIESQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CKKLNREFIVIDRWEPTSQTCSKCGFKWGKIDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:520871  tlr2033  excinuclease ABC subunit B
MTSTPFRIHAPFEPTGDQPQAIQKLVAGVQAGHRFQTLLGATGTGKTHTI
ARVIEALGRPTLVLAHNKTLAAQLCNELRSFFPENAVEYFISYYDYYQPE
AYIPVTDTYIEKSASINEEIDMLRHSATRSLFERRDVIVVASISCIYGLG
IPAEYLKAAIPLEVGRETELRQLLRQLATIQYTRNDVELGRGRFRVRGDV
LEIGPAYEDRIIRVEFFGDEIEAIRYVDPLTGETLQSVERLNIYPAKHFV
TPAERLEAACVAIEAELQAQVANLEAQNKLLEAQRLSQRTRYDLEMLRQV
GYCNGVENYSRHLAGRAAGEPPECLIDYFPENWLLVVDESHVTVPQIRGM
YNGDQARKKVLIDHGFRLPSAADNRPLKPEEFWQKVQQCIFVSATPGDWE
LAVSTQVVEQIIRPTGVVDPEIFVRPTQGQVDDLYGEIRLRCDRQERVLV
TTLTKRMAEDLTEYFQERGVRVRYLHSEINAIERIEILEALRQGDFDVLI
GVNLLREGLDLPEVSLVAILDADKEGFLRAERSLIQTIGRAARHVRGQAI
LYADTLTESMQKAIQETERRRAIQLAYNQAHGIIPQPIVKKTSNAILAFL
DVSRRLNAESVPVLSSQTLQDLSLEDIPPLIQDLEAKMKAAAQELAFEEA
ARYRDQIKRLRDRLVGHP
>gid:520903  tlr2065  
MRFIYSFSGVNCPMFYRLAEQHRQFIRDLVLNLQALAIALENRGYMASCY
TCGSELNSASFMVSLADNHLIRFLVSDYGITWTEMRDDRELMKLEGAEAI
NQLQELANLLKEVRVPTAV
>gid:521020  tlr2179  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTKWKKQDDLQFLNEVSSVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPLNIRWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSRGQLRRSRKSQK
>gid:521023  tlr2182  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKVQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIVIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRSRKSQK
>gid:521040  tlr2199  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSRGQLRRSRKSQK
>gid:521075  tlr2234  
MGRTYRTIGINLKAMPLGESDRLLSVFSRDRGLLKLVAPHSRGSRSRLGG
RVDLFVVNDLFISPGRNLDRILQAETVATYQGLHHQLTTLTAAQYLGEVV
LYQIHPQQPQPELFDWFCATLDQLQGASSRAALAILVRGLCGILRLGGIA
PEWYQCHESGCKIAVPTADTDWRLGFSFAGGGVFRIRADHPVGNESVAGA
GGDRQLTASEVRLGQWLAMPTTQFLARDEFLTQAEAYPLSVWLSLERVLR
QYLQFHLEQPLRVPPLLDSCFSPVAVSQP
>gid:521079  tlr2238  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSKPS
SVTVIKDAANRYFLSFVVEIDPVHQPASNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKLARQVKGSQRREKTRLQIAKLHSRISDIRRDF
LHKVSTRVVIENQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CTKLNREFIVIDRWEPTSRTCSTCGFKWGKVDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPDEASRIIAL
>gid:521081  tlr2240  DNA mismatch repair protein
MSNDRPLTHSEAESSALRLGRNPGLQVRHDEVERSLLTPMLQHYAELKDA
YPHALLLYRVGDFYETFFQDACTVARELELVLTGKEGGKEVGRVAMAGIP
HHALERYCRTLIEKGYAIAICDQVEDPAQAQGLVKREVTQVFTPGTVLDT
ELLQPRRNNFLAAVVLSGNHWGLAYADVSTGEFCTTQGSDRADLVAELNR
LQPAEILLPTEAPDINRVLRPGEGKDQLPPELPPQWCYTLRSPEDFQAAA
ARQRLCQRFQVKSLEGFGCEHLPLALRAAGGLVAYLDETHRQQPVPLQNL
STYSLQQYLFLDPQTRRNLELTQTVRDGSFQGSLLWAIDRTATAMGGRLL
RRWLLQPLLDIEEITARQDAIAELMANSSLRQSLHRHLQEIYDLERLAGR
AGSGTANARDLAALRDSFRTLVSLAAVVANTSSPYLQALAQLPPVIEQLA
DTLSAALVDQPPTSLSEGGILRPGAYPELDQQRQQIEQDQQWILNLEAQE
RQRTGISTLKVGYTKVFGYYLSVSRAKLNQVPDDYIRKQTLTNEERFITA
ELKEREARLLAAQSHLFELEYQYFVQLREQVAAQASTIREIAAAVAAVDA
LLGLAEVALYQGYCRPQLTRDRQLCIRGGRHPVVEQTLPAGFFVPNDTQL
GTGADLMVLTGPNASGKSCYLRQVGLIQLLAQMGSYVPATSATLGICDRI
FTRVGAVDDLATGQSTFMVEMNETANILNHAGDRSLVLLDEIGRGTATFD
GLAIAWSVAEYLATILKSRTIFATHYHELNQLATLLPNVANYQVVVKELP
NEIIFLHQVKPGGADRSYGIEAGRLAGLPAVVIQRAREVMCQIEKHSRIT
VGLRKSSMGDPPTAPEINQGELPF
>gid:521152  tlr2304  serine/threonine protein kinase
MDLAGQGQYGQVYLAVNRESGDLVAIKVLSERQLLTRGFLRELNFLLTLQ
HPHVVGCQAIDYIRVHHSPQVSRSLVMDYCAGGTLRSLLEQEQALPLTTA
LRLTLDILSALAYAHHRGILHCDLKPENILLEVTATGWQAKVSDFGVARL
IEDVKGSGQTGSPAYMAPERFYGQTMPASDLYAVGILLYEMIVGDRPFHG
TPAELMAAHLSRPYTLPEGLPFLVRSIIAKALDKLPQRRYKSAAEMTMAV
QLALEVIEAEHHQQPLLFSRSPAAVWLGEPQGFALPRLPQQVASTATGFY
GVVGDRLWAWQARETIPEATAQWPLPPLPIQLYGSEVSAWLRINALPPQL
FCIHDQLIPLDGHLPSTLNPLTIDARGYWWAETQIDSAAAELQLHVQLIN
GQGRRTLRCQWHGRTFVDTLLLNRRYGAVISRSQDPETAHPITHFQLFDR
RGHWRLYTPLPAHLTLLTPAHHQPWQVAAFEDHPQQPLLVLLHLRPWRIQ
RLGLRFRPQFLCATPWGYVVAGRHSLNLVTHSGEIVGTAESEQEIDGLGF
TGDRHLWLIRRQGSGCVIQTWDITTWDIHLIL
>gid:521171  tlr2322  
MKARYRYRFYLTDQQRQRLAQLFGCVRVVWNDALAICKQSDSLPKTSELQ
KLVITQAKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNIRPIWSRQLPSEPS
SVTVIKDAANRYFLSFVVEINPVDQPASNPSIGIDLGIKAFATLSTGEKI
DGPDYSKLDKKIRRKQRKVARQVKGSKRREKTRLQIAKLHSRISDIRRDF
LHKLSTRVVIESQVITLEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CKKLNREFIVIDRWEPTSQTCSKCGFKWGKIDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPDEASRIIAL
>gid:521226  tlr2376  
MQKAFSYRFYPTTEQESLLRRTLGCVRLVYNRALAARTEAWYERKERLDY
VQTSALLTQWKKQDDLQFLNEVSCVPLQQALRHLQSAFTNFFAGRAKYPN
FKKKRNGGSAEFTKSAFRWKDGKVFLAKCNEPPNILWSRRLPDGVEPSTV
TIRLNPAGQWYISLRFDDPRDLTLQPVDPSVGLDVGMSSLITLSTGEKIA
NPKHFNRYYKRLRKAQRSLSRKQKGSRNWDKARLKVAKIHQKISDSRKDH
LHQLTTRLIRENQTIIIESLAVKNMVKNRQLARSISDAGWGELVRQLEYK
AQWYGRTLVKIDRWFPSSKRCGQCGHIVERLPLSVREWDCPKCGAHHDRD
INAAGNILAVGHTVTVCGAGVRPDRHTSGGQLRRNRKSQK
>gid:521249  tlr2399  
MKARYRYRFYPTDQQREHLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFDSIKGRRKGKK
INPPRFKKKTERQSARFTTYGFSIKGEEVYLAKIGNVRPIWSRKLPSEPS
SVTVIKDAANRYFLSFVVEIDPVHQPATNPSIGIDLGIKAFATLSTGEKI
NGPDYSKLDKKIRRKQRKLARQVKGSKRREKTRLQIAKLHGRISDIRRDF
LHKLSTRVVIENQVIALEDLNVSGMVKNRKLARAISLQGWREFRVLVEAK
CNKLNREFIVIDRWEPTSQTCSKCGFKWGKIDLSVRSVVCINCGAEHDRD
ENAARNIEKVGIGHCHDYKWTQRGSKTTSVASPSEASRIIAL
>gid:521283  tlr2432  serine/threonine protein kinase
MLVYCTRPHCPRPQNNLPELDQPNQRYSRIRDRFCITCGMPLILRGHYVA
ERVLGRGGFGAAYLARDLDTPGWRYCVIKQFLPNVSDPQSLQKAQELFER
EAKVLEELGQHAQIPDLLAFFREEVAGFNSSSEESYFYLVQEFIDGETLE
DELAQQGCFSEEEVRQVLRELLPVLQYVHERGSIHRDIKLSNIMRQHPSK
TKFPGQGRLYLLDFGAVKQVSQTSMESRSTGIYTAHYAPPEQIRGEQVFP
SSDLYALAVTCIVLLTGKDPEKLFDAYNNRWNWHSYVPSVSQQLQQILDR
MLQPAPSDRYQSAAQVLADLNASPTPAPAPPPPPPPLTPPLPPQPVTTPL
PPSQVSSPPSPAAPIPSPKMAPPAKAKPPRQPPAPLPAFKILMGAGFTGF
EMTALGVMIFSLMTTWQFPLGVSAGLIGGVFALLVFMQFKGWIEHWEQLI
IATISGAALFFFPFLQAGLGGLTALLMCGLVGLGCMVVGNLFLLVYNILA
RFL
>gid:520716  topA  DNA topoisomerase I
MSTLVIVESPAKARTIRKFLPPDYRVEASMGHVRDLPRSAADVPPEFRGE
EWATLGVNVAAGFEPLYIVPKEKQKVIKELKAALKTADELLLATDEDREG
ESISWHLLQLLEPKVPVRRMVFHEITEEAIQEALHNCRDVNQQLVRAQET
RRILDRLVGYTLSPLLWRKIAPHLSAGRVQSVAVRLLVQRERERLAFRKG
QFWDLKATLDQRGTLFPARLVSVGGQRLATGNDFDPTTGQLRNPDAVLLL
DEAAANALRDRLLTETWTVTEQEERQQTRKPAPPFTTSTLQQEANRKLHL
SAQETMRIAQKLYEEGYITYMRTDSVHLSDQAIAAARSCVEAMYGKAFLS
PQPRQYTTKTKGAQEAHEAIRPAGSQFRTPQETGLRDRELELYELIWKRT
VASQMADARVTLLTVSITAGDALFRAHGKRIDFPGFFRAYVEGSDDPDAA
LESQEVMLPVMQVGDILRCQALESVRHETQPPPRYTEASLVKALEQAGIG
RPSTYATIISTIQDREYAIRRGNALEPTFTAFAVTALLEKYFPDLVDINF
TARMEQTLDDISTGEVQWQPYLESFYLGENGLEQQVKERERTIEATEARA
IALPELNAEVVVGRFGPFVVYQNGNGSEPIKASLPQDATPGSLTREQVEQ
LIRQKLEGPDKLGVHPETGEPIFLLTGRFGPYVQLGEATEANPKPKRASL
PKGVSPDEVTLDLAITLLSLPRTLGVHPETGKLIQANQGRFGPYIVHDPE
GEKDYRSLKGEDDVYTITLERALELLATPKSSRARAKKQVLAVVGTHPED
GKLVQIFDGPYGPYVNHGKVNASLPEGVTPETMTLEQALLLLAEKAPKKT
RRSATATAPKPKSTIKRTSKKTATQTANTSRRKSTET
>gid:519048  tsl0239  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:519204  tsl0388  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFD
>gid:519258  tsl0441  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:519286  tsl0469  
MKARYRYRFYPTDQQRQRLAQLFGCVRVVWNDALEICKKSDKLPKTSELQ
KLVITQGKKTPERQWLSDVSNVPLQQSVADLGVAYKNFFD
>gid:519362  tsl0544  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:520069  tsl1244  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:519573  tsr0753  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:519595  tsr0775  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:519821  tsr1000  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:520175  tsr1348  
MAERCNGRIAKILRAERFVSAADLQETLTRYLWACNHRIPQRALGHMTPI
ERLRTWQMEGPELFSSQVDNVAGLDS
>gid:520768  tsr1932  
MQPVIPSRRHRRQRRECDKQLYRLRHLVENAFLHLKRWRGIATRYAKNTA
SFLAAVHIRCIAIWASIS
>gid:521130  tsr2282  
MDIEDLQVRNMPRSATDTKNAPGRNIHAKFGLNQSIFDQGWFEFRHQLDD
KLAWQRG
>gid:518991  uvrA  excinuclease ABC subunit A
MPGGEIRIRGARQHNLKNIDLDLPRDRLIVMTGVSGSGKSSLAFDTIFAE
GQRRYVESLSAYARQFLGQLDKPDVDAIEGLSPAISIDQKSTSHNPRSTV
GTVTEIYDYLRLLYGRAGEPHCPHCDRSIRPQTIDEMVDQVMQLPLQSRF
QVLAPIVKGKKGTHKKLLSSLASEGFVRVRIDGEVRDLSEAIELDKNHAH
RIEIVVDRLVLKPGIEERLGDSLRTALHHGNGTAMVSVVSRETGAEGQTL
LFSENFACPEHGAVMDELSPRLFSFNSPYGACPECHGLGYLRKFSPELIV
PNPELPVYLAIAPWAEKEHDYYLALLWGVAEAFGFDIHTPWYRLTELQRE
ILLYGTDTPILIPSDSRYRKRDYYRQFQGVIPILERQYRETTSDAYRQKL
EEYQVNQTCPACQGQRLKPAALAVRLGQYRLTDLTRVSIRECLARLQSLQ
LTPRQQQIAALALREVTGRLQFLIDVGLDYLTLDRSAATLSGGEAQRIRL
ATQIGSGLTGVLYVLDEPSIGLHQRDNDRLLQTLFRLRDLGNTLIVVEHD
EDTIRAADYIVDIGPGAGIHGGQIVAQGSLEAILNHPDSLTGAYLSGRKR
IETPSDRRPGNGKSLILKKVSRNNLKDITVEIPLGKLVCLTGVSGSGKST
LMHEVLYPALQHHLGYNVPLPKELGHIEGLNAIDKVIVIDQSPIGRTPRS
NPATYIGVFDVIREVFSQTVEAKARGYKPGQFSFNIKGGRCEACGGQGVN
VIEMNFLPDVYVQCEVCKGTRYNRDTLQVKYKGCSIADVLDMTAETALTF
FENIPKAVSKLQTLVDVGLGYLKLGQSAPTLSGGEAQRLKLAAELSRRAT
GKTLYLIDEPTTGLSFYDVHKLLEVLQRLVDKGNSILVIEHNLDVIRCAD
WIIDLGPEGGDRGGEVVAVGTPEAVALMPQSYTGQYLARVFNRAAANSQ
>gid:520586  uvrC  excinuclease ABC subunit C
MLQPLIQDRDRLEQVLRQLPLAPGVYFLKDKTDQILYIGKSKRLRSRVRS
YFREPAQLGPRLERMVYQVAEIEFIVTDTEAEALALEANLIKQHQPHFNV
LLKDDKKYPYVCITWSEPYPRIFITRKRQFGNGGDPRSDSAKDRYYGPYV
DSFRLRQTLALVKRLFPLRQRPRPLFRDRPCLNYDIGRCPGVCQGLISPQ
EYRQTLQRVAMIFQGRTGELVAQLQAQMAQAAADLNFELAARLRDQIRGL
EHLGVDQKVSLPDDTVSRDAIALAVGDRHAAIQLFQIRAGRLVGRLAFVA
DAQSGSAGTILQRVLEEHYAQVEDVEIPSEILLQHPLPEPDFLRTYLSEK
KGRAVTLTVPQRQAKAELIAMVQRNAELELARLQQASDRTQTALEDLAQL
LGLDTLPHRIEGYDISHIQGSDAVASRVVFIDGLPAQQHYRHYNIRNPEV
KLGHSDDFASLAEILRRRFAPYLEGQGETPDDWPDIILIDGGKGQLSAVV
QVLEPLLEELTLVSLAKQREDIFLPHQRQPLPADPEQPGVQLLRRVRDEA
HRFALNFHRQKRAQRQRRSHLDQIPGLGYQRQKELLATFRSIDYIRMATP
EQLAQVNGVGPRLAARIYRYFHPDAD
>gid:519100  ycf41  
MNSCVLFAEVIQGPELRYTQENQMAVATMVVQFASLRSEEPPMSLRTVAW
GNLGQKMHAECKVGDRLILEGRLKMDTIDRPEGFKEKRAELVVSRFYTVD
GNIVDHPAQPAATVPVTPRVAPPPPEFEDISLDEVPF