TitleGenColors Logo

Gene list

Applied filters:

COG category: Cell motility
Organism: Mycobacterium tuberculosis H37Rv, H37Rv
Gene type: CDS

Number of genes found: 20

Free access
Sort by:

 



# Mycobacterium tuberculosis H37Rv, H37Rv

>Rv3690 PROBABLE CONSERVED MEMBRANE PROTEIN
MPSIDIDREAAHQAAQRELDKPIYPKDSLTKELTDWIDEQLYRILEKGSS
IPGGWFTITVLLILLMIAVTAAVQIARRTMRTNRGGDYQLFDAGQLTAAQ
HRSTAESYAAEGNWAAAIRHRLQAVARELEETGMLNPAAGRTANELASDA
GEVLPHLAGELTQAATAFNDVTYGERPGTQGAYQMIADLDDHLRSRSPAV
VSAVQHPAVFDSWAQVR
>Rv2743c POSSIBLE CONSERVED TRANSMEMBRANE ALANINE RICH PROTEIN
MAVKAGQRRPWRSLLQRGVDTAGDLADLVAQKISVAIDPRARLLRRRRRA
LRWGLVFTAGCLLWGLVTALLAAWGWFTSLLVITGTIAVTQAIPATLLLL
RYRWLRSEPLPVRRPASVRRLPPPGSAARPAMSALGASERGFFSLLGVME
RGAMLPADEIRDLTAAANQTSAAMVATAAEVVSMERAVQCSAASRSYLVP
TINAFTAQLSTGVRQYNEMVTAAAQLVSSANGAGGAGPGQQRYREELAGA
TDRLVAWAQAFDELGGLPRR
>Rv3900c CONSERVED HYPOTHETICAL ALANINE RICH PROTEIN
MVAADLPPGRWSAVLVGPWWPAPSAALRAAAQHWATWAMQKQELARNLIS
QHDLLLRNQGRTAEDLIGRYLRGAKSEVTKAEKYEIKKGAFNTAADAIDY
LRSRLTGIAGEGNKEIDDVLASKKPLPEQLAEIQAIQTRCNADAANASRD
AVDKVMTAMQEILEAEDIGDDPRTWARANGFNVDDAPPPRLIRENDLAAL
TGPGARGGSFGSVEGAGDLASPQSVGAGGFSGSGVQAACSQPAPRAIGAS
SRHASAGPVPPAPVVTTPAAATPPVIATGPRWRCPAGRCRRRPSDRAYRL
RRLGNRLRPGW
>Rv2551c CONSERVED HYPOTHETICAL PROTEIN
MLAAAVLAWMGVLCVCDVRQRRLPNWLTLPGAGVILLFAGLAGRGVPALA
GAAALAGVYLLVHLALPAAMGAGDVKLAIGLGGLTGCFGVEVWFLAALAA
PLLTAVCGVMVTPWGVRTLPHGPSMCVASLGAVGLALLG
>Rv3769 HYPOTHETICAL PROTEIN
MTTLKELGARVAALEANQADYRAVLAAVNPPGANQREIATTVREHTGRLD
RVTTKVGQLAAKSDDTNARVRSLEEGQAEIKDLLLRALDK
>Rv2082 CONSERVED HYPOTHETICAL PROTEIN
MAGDLPPGRWSALLVGAWWPARPDAPMAGVTYWRKAAQLKRNEANDLRNE
RSLLAVNQGRTADDLLERYWRGEQRLATIAHQCEVKSDQSEQVADAVNYL
RDRLTEIAQSGNQQINQILAGKGPIEAKVAAVNAVIEQSNAMADHVGATA
MSNIIDATQRVFDETIGGDAHTWLRDHGVSLDTPARPRPVTAEDMTSMTA
NSPAGSPFGAAPSAPSHSTTTSGPPTAPTPTSPFGTAPMVLSSSSTSSGP
PTAPTPTSPFGTAPMPPGPPPPGTVSPPLPPSAPAVGVGGPSVPAAGMPP
AAAAATAPLSPQSLGQSFTTGMTTGTPAAAGAQALSAGALHAATEPLPPP
APPPTTPTVTTPTVATATTAGIPHIPDSAPTPSPAPIAPPTTDNASAMTP
IAPMVANGPPASPAPPAAAPAGPLPAYGADLRPPVTTPPATPPTPTGPIS
GAAVTPSSPAAGGSLMSPVVNKSTAPATTQAQPSNPTPPLASATAAATTG
AAAGDTSRRAAEQQRLRRILDTVARQEPGLSWAAGLRDNGQTTLLVTDLA
SGWIPPHIRLPAHITLLEPAPRRRHATVTDLLGTTTVAAAHHPHGYLSQP
DPDTPALTGDRTARIAPTIDELGPTLVETVRRHDTLPPIAQAVVVAATRN
YGVPDNETDLLHHKTTEIHQAVLTTYPNHDIATVVDWMLLAAINALIAGD
QSGANYHLAWAIAAISTRRSR
>Rv0990c HYPOTHETICAL PROTEIN
MAESSLNPSLVSRISAFLRPDWTRTVRARRFAAAGLVMLAGVAALRSNPE
DDRSEVVVAAHDLRPGTALTPGDVRLEKRSATTLPDGSQADLDAVVGSTL
ASPTRRGEVLTDVRLLGSRLAESTAGPDARIVPLHLADSALVDLVRVGDV
VDVLAAPVTDSPAALRLLATDAIVVLVSAQQKAQAADSDRVVLVALPARL
ANTVAGAALGQTVTLTLH
>Rv3657c POSSIBLE CONSERVED ALANINE RICH MEMBRANE PROTEIN
MALWLGAGPSVVRARAGRPPRAHRPHQGLLLGRTDVADPLAVAASLDVLA
VCLAAGMAVSTAAAATAAVAPPRLARVLRRAADLLALGADPNIAWSRPPD
LPPGTHDAQTDAVLRLARRSAASGAALADGIVELAVQVRHDAAQAAAAAA
ERAGVLIAGPLGLCFLPAFLCVGIVPLVVGLAGDVLQFGLV
>Rv1548c PPE21, PPE FAMILY PROTEIN
MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTS
QLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSAFEAA
LAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAA
MLGYHGEASAVALSLTPFTPSPSAAATPGGAVIIAGFPFLDLGNVTIGGF
NLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGG
NTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGG
NTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFG
NKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIG
FFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQ
LSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLF
NSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLF
NSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGL
ANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFG
SLMSGFANFDDEVSGYLNGDSRASGWIH
>Rv1753c PPE24, PPE FAMILY PROTEIN
MNFSVLPPEINSALIFAGAGPEPMAAAATAWDGLAMELASAAASFGSVTS
GLVGGAWQGASSSAMAAAAAPYAAWLAAAAVQAEQTAAQAAAMIAEFEAV
KTAVVQPMLVAANRADLVSLVMSNLFGQNAPAIAAIEATYEQMWAADVSA
MSAYHAGASAIASALSPFSKPLQNLAGLPAWLASGAPAAAMTAAAGIPAL
AGGPTAINLGIANVGGGNVGNANNGLANIGNANLGNYNFGSGNFGNSNIG
SASLGNNNIGFGNLGSNNVGVGNLGNLNTGFANTGLGNFGFGNTGNNNIG
IGLTGNNQIGIGGLNSGTGNFGLFNSGSGNVGFFNSGNGNFGIGNSGNFN
TGGWNSGHGNTGFFNAGSFNTGMLDVGNANTGSLNTGSYNMGDFNPGSSN
TGTFNTGNANTGFLNAGNINTGVFNIGHMNNGLFNTGDMNNGVFYRGVGQ
GSLQFSITTPDLTLPPLQIPGISVPAFSLPAITLPSLNIPAATTPANITV
GAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPANI
TVGAFSLPGLTLPSLNIPAATTPANITVGAFSLPGLTLPSLNIPAATTPA
NITVGAFSLPGLTLPSLNIPAATTPANITVSGFQLPPLSIPSVAIPPVTV
PPITVGAFNLPPLQIPEVTIPQLTIPAGITIGGFSLPAIHTQPITVGQIG
VGQFGLPSIGWDVFLSTPRITVPAFGIPFTLQFQTNVPALQPPGGGLSTF
TNGALIFGEFDLPQLVVHPYTLTGPIVIGSFFLPAFNIPGIDVPAINVDG
FTLPQITTPAITTPEFAIPPIGVGGFTLPQITTQEIITPELTINSIGVGG
FTLPQITTPPITTPPLTIDPINLTGFTLPQITTPPITTPPLTIDPINLTG
FTLPQITTPPITTPPLTIEPIGVGGFTTPPLTVPGIHLPSTTIGAFAIPG
GPGYFNSSTAPSSGFFNSGAGGNSGFGNNGSGLSGWFNTNPAGLLGGSGY
QNFGGLSSGFSNLGSGVSGFANRGILPFSVASVVSGFANIGTNLAGFFQG
TTS
>Rv1918c PPE35, PPE FAMILY PROTEIN
MHYSVLPPEINSALIFAGAGSGPMLAAASAWDGLATELASAAVSFGSVTA
GLVGGSWQGRSSVAMAAAAAPYAGWLAAAATQAEQAATQAQVMVAEFEAV
RLAMVQPALVAANRSGLISLVISNLFGQNAPAIAAAEAAYEEMWALDVSA
MAAYHSGASAVAVALPAFALPLRLPAGLAAGPAAVVTALTTAVGMPTFAG
RAIAASLGLANVGGGNLGNANNGLGNIGNANLGNNNLGSGNFGSFNIGSA
NLGGNNIGIGNAGANNFGLANLGNLNTGFANAGIGNFGIANTGNNNIGNG
LTGNNQIGIGGLNSGNGNVGLFNAGSANIGFFNSGNGNFGIGNSGNFSTG
LFNPGHGNTGFLNAGSFNTGMFDVGNANTGSFNVGHYNFGAFNPGPSNTG
TFNTGGANTGWFNTGSINTGAFNIGDMNNGLFNTGDMNNGVFYRGVGQGS
LQFAITSPDLTLPSLEIPGISVPAFSLPAITLPSLTIPAVTTPANVTVGA
FDLPGLTVPSLTIPAAMTPANITVGAFDLPGLTVPSLTIPATTTPANITV
GAFNLPQLSIPSVTVPPITIPAGTALGAFNLPTLSIPSVTVPPITIPAGT
TVGGFTLPTIHTPLISTPQISIGGFSTPGIATQANSGVINLPTFSLNGIT
ITNLVVFIPNNITALQTNMPGVFPQIGGFANTPPAFINTGTITVGGGQIN
GVGFSIGAINVTPFTLPNVVIQPWSLGGISVDGFTLPEISTQEFTTPALT
ISPIGVGALSLPDITTQQFTTPELTIDPITLGGFTLPQLSIPAITTPAFT
IDPIALGGFTLPQIMTPEITTPPFAIDPIGLSGFTLPQVNIPEITTPEFT
IQPVGLAAFTTPALTIASIHLPSTTMGGFAIPAGPGYFNSSATPSLGFFN
AGIGGNSGFGNSGSGLSGWFNTSPVGLLAGSGYQNYGGLISGFSNLGSGI
SGFANTGTLPFAVTSLVSGLANIGNNLSGLFFQSTTP
>Rv2353c PPE39, PPE FAMILY PROTEIN
MPGRFRNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGN
FNFGSGNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNS
GSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNG
GSTNTGLANAGAGNTGFFDAGNYNFGSLNAGNINSSFGNSGDGNSGFLNA
GDVNSGVGNAGDVNTGLGNSGNINTGGFNPGTLNTGFFSAMTQAGPNSGF
FNAGTGNSGFGHNDPAGSGNSGIQNSGFGNSGYVNTSTTSMFGGNSGVLN
TGYGNSGFYNAAVNNTGIFVTGVMSSGFFNFGTGNSGLLVSGNGLSGFFK
NLFG
>Rv0304c PPE5, PPE FAMILY PROTEIN
MNLVSTTSGMSGFLNVGALGSGVANVGNTISGIYNVGTSDLSTPAVNSGL
ANIGTNIAGLLRDGAGTAAINLGLANHGNLNVGFASLGGFNFGGATIGHN
NVGIGNTGIFDVGLANLGSYNIGFGNLGDDNLGFGNFGSYNIGFGNVGND
NLGFANAGGGNIGFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSG
SGNIGLFNSGSNNIGFFNSGSGNFGIANSGSFNTGIGNTGNTNTGLFNSG
DVNTGAFNPGSFNTGSFNTGSFNTGGFNPGNTNTGYLNIGNYNTGIANTG
DVDTGAFITGNYSNGLFLSGDYQGLVGLNLVIDMPLPISLGVNIPIDIPI
TASAGNITLMGVTIPPTGDIVLSSIAGQRAHFGPITIPNITVVGPTTTVA
IGGPNTAITITGGGAIRIPLISIPAAPGFGNSTTNPSSGFFNTGAGGASG
FGNFGGANSGFWNLASATSGASGLLNVGALGSGLANVGTTVSGFYNTSTS
DLATPAFNSGLANISTSIAGLLRDSTGTMVLNLGLANHGTLNVGIANLGD
YNIGFANLGSANFGSANIGGNNIGGANTGIFDIGLANLGSYNIGFGNFGD
DNLGFGNLGSYNVGFGNLGNDNLGFANTGSNNIGFANTGSNNIGIGLTGD
GQIGFGSLNSGSGNIGLFNSGSGNIGFFNSGNGNVGIGNTGTANFGLGNT
GSTNTGFFNSGDVNTGIGNTGSFNTGSFNPGDSNTGDFNPGSYNTGLGNT
GDVDTGAFISGSYSNGFLWSGNYQGLIGLHAALAIPEIALTFGVDIPIHI
PINIDAGVVTLQGFSIVAAENNIDFTPIIIPTINITLPTAAITVGGPTTS
IGITASAGIGSITIPIIDIPATSGFGNSTTSPSSGFFNSGAGSASGFLNV
VAGASGISGYLNVGALGSGVTNVGHTVSGFYNASALDLVTPAFASGLMRD
GMGTMTLNLGLANLGSNNAGFGNTGIFDVGVANLGNYNIGFGNFGDDNLG
FANLGSYNIGVANTGSNNIGFANTGSNNIGIGLTGTGQIGIGALNSGSGN
IGLFNSGDGNIGFFNSGTGNFGIGNTGTGNFGIGNSGSTSTGLFNSGDGN
TGGFNPGNFNTGNFNTGSFNTGGFNAGNTNTGHFNTGNYNTGIANTGDVS
TGAFISGNYSNGILWRGDYQGLIGYSYALTIPEIPAHLDVNIPIDIPITG
SFTDLVVDNFTIPIIGFESFAFSFHIHTEPDIGPIIVPSFVLSVPTFAIA
VGGPTTAINISATAGLGPITIPIIDIPAAPGIGNSTTSPSSGFFNTGAGT
ASGFGNVGGNTSGLWNLASAASGVSGLLNVGALGSGVANVGNTISGIYNT
SPLDLGTPAFGSGLANIAGLLQGGAGTTILDLAGLGNLNVGLANLGGSNF
GIGNTGIFNVGFANVGNHNIGLANLGNYSVGFANSGNYHIGIANTGSANI
GFANTGSGNIGIGLTGTGQIGFGSFNSGSHNIGLFNSGDGNVGFFNSGTG
NVGIGNTGTANFGIANSGSFNTGLGNTGSTNTGLFNPGNVNTGVGNTGSI
NTGSINTGSFNTGSTNTGSFNLGDHNTGSFNSGDYNTGYFNAGDYNTGVA
NTGNVNTGAFISGNYSNGFFWRGDYQGLIGLSTTITIPEIPYRYDLSVPI
DIPITGTVVATTPNSFTIPGFQIRVLLGPAAVLVNEMIGPITIDVNQVIA
IDSPIQQTISMVGTGGFGPIPIGISIGGTPGFGNSTTGPSSGFFHTGAGH
VSGFGNFGAGNMSGSGNFGAGNSGFFNAGGLGNSGLLNFGALQSGLANLG
NTISGVYNTSTLDLATPAFGSGIANIGANLAGLFLDNTGNLTLNFGVANQ
GGLNAGIGNLGSVNIGFVNTGDSNLGIGNLGDLNFGGVNIGGNNIGIANT
GIFDIGLANLGSYNIGLANLGDDNLGFGNAGSYNIGFANFGSDNLGFANT
GSYNIGFANTGNNNIGVGLTGNGQIGIGSLNSGSNNIGLFNSGSGNIGFF
NSGTGNVGIFNTGTGNFGLANSGGFNTGIGNAGSTNTGVFNPGDLNTGSF
NPGSFNTGGFNPGSGNTGYLNTGDYNTGVANTGDVDTGAFITGSYSNGFL
VSGDYQGLIGLPLLGIPVTPGYFNLTGGPSSGFFNSGAGSVSGFVNSGAG
LSGYLNTGALGSGVANVGNTISGWLNASALDLATPGFLSGIGNFGTNLAG
FFRG
>Rv3343c PPE54, PPE FAMILY PROTEIN
MSFVVMPPEINSLLIYTGAGPGPLLAAAAAWDELAAELGSAAAAFGSVTS
GLVGGIWQGPSSVAMAAAAAPYAGWLSAAAASAESAAGQARAVVGVFEAA
LAETVDPFVIAANRSRLVSLALSNLFGQNTPAIAAAEFDYELMWAQDVAA
MLGYHTGASAAAEALAPFGSPLASLAAAAEPAKSLAVNLGLANVGLFNAG
SGNVGSYNVGAGNVGSYNVGGGNIGGNNVGLGNVGWGNFGLGNSGLTPGL
MGLGNIGFGNAGSYNFGLANMGVGNIGFANTGSGNFGIGLTGDNLTGFGG
FNTGSGNVGLFNSGTGNVGFFNSGTGNWGVFNSGSYNTGIGNSGIASTGL
FNAGGFNTGVVNAGSYNTGSFNAGEANTGGFNPGSVNTGWLNTGDINTGV
ANSGDVNTGAFISGNYSNGVLWRGDYQGLLGFSSGANVLPVIPLSLDING
GVGAITIEPIHILPDIPININETLYLGPLVVPPINVPAISLGVGIPNISI
GPIKINPITLWPAQNFNQTITLAWPVSSITIPQIQQVALSPSPIPTTLIG
PIHINTGFSIPVTFSYSTPALTLFPVGLSIPTGGPLTLTLGVTAGTEAFT
IPGFSIPEQPLPLAINVIGHINALSTPAITIDNIPLNLHAIGGVGPVDIV
GGNVPASPGFGNSTTAPSSGFFNTGAGGVSGFGNVGAHTSGWFNQSTQAM
QVLPGTVSGYFNSGTLMSGIGNVGTQLSGMLSGGALGGNNFGLGNIGFDN
VGFGNAGSSNFGLANMGIGNIGLANTGNGNIGIGLSGDNLTGFGGFNSGS
ENVGLFNSGTGNVGFFNSGTGNLGVFNSGSHNTGFFLTGNNINVLAPFTP
GTLFTISEIPIDLQVIGGIGPIHVQPIDIPAFDIQITGGFIGIREFTLPE
ITIPAIPIHVTGTVGLEGFHVNPAFVLFGQTAMAEITADPVVLPDPFITI
DHYGPPLGPPGAKFPSGSFYLSISDLQINGPIIGSYGGPGTIPGPFGATF
NLSTSSLALFPAGLTVPDQTPVTVNLTGGLDSITLFPGGLAFPENPVVSL
TNFSVGTGGFTVFPQGFTVDRIPVDLHTTLSIGPFPFRWDYIPPTPANGP
IPAVPGGFGLTSGLFPFHFTLNGGIGPISIPTTTVVDALNPLLTVTGNLE
VGPFTVPDIPIPAINFGLDGNVNVSFNAPATTLLSGLGITGSIDISGIQI
TNIQTQPAQLFMSVGQTLFLFDFRDGIELNPIVIPGSSIPITMAGLSIPL
PTVSESIPLNFSFGSPASTVKSMILHEILPIDVSINLEDAVFIPATVLPA
IPLNVDVTIPVGPINIPIITEPGSGNSTTTTSDPFSGLAVPGLGVGLLGL
FDGSIANNLISGFNSAVGIVGPNVGLSNLGGGNVGLGNVGDFNLGAGNVG
GFNVGGGNIGGNNVGLGNVGFGNVGLANSGLTPGLMGLGNIGFGNAGSYN
FGLANMGVGNIGFANTGSGNFGIGLTGDNLTGFGGFNTGSGNVGLFNSGT
GNVGFFNSGTGNWGVFNSGSYNTGIGNSGIASTGLFNAGGFNTGVVNAGS
YNTGSFNAGQANTGGFNPGSVNTGWLNTGDINTGVANSGDVNTGAFISGN
YSNGAFWRGDYQGLLGFSYRPAVLPQTPFLDLTLTGGLGSVVIPAIDIPA
IRPEFSANVAIDSFTVPSIPIPQIDLAATTVSVGLGPITVPHLDIPRVPV
TLNYLFGSQPGGPLKIGPITGLFNTPIGLTPLALSQIVIGASSSQGTITA
FLANLPFSTPVVTIDEIPLLASITGHSEPVDIFPGGLTIPAMNPLSINLS
GGTGAVTIPAITIGEIPFDLVAHSTLGPVHILIDLPAVPGFGNTTGAPSS
GFFNSGAGGVSGFGNVGAMVSGGWNQAPSALLGGGSGVFNAGTLHSGVLN
FGSGMSGLFNTSVLGLGAPALVSGLGSVGQQLSGLLASGTALHQGLVLNF
GLADVGLGNVGLGNVGDFNLGAGNVGGFNVGGGNIGGNNVGLGNVGWGNF
GLGNSGLTPGLMGLGNIGFGNAGSYNFGLANMGVGNIGFANTGSGNFGIG
LTGDNLTGFGGFNTGSGNVGLFNSGTGNVGFFNSGTGNWGVFNSGSYNTG
IGNSGIASTGLFNAGGFNTGVVNAGSYNTGSFNAGQANTGGFNPGSVNTG
WLNTGDINTGVANSGDVNTGAFISGNYSNGAFWRGDYQGLLGFSYTSTII
PEFTVANIHASGGAGPIIVPSIQFPAIPLDLSATGHIGGFTIPPVSISPI
TVRIDPVFDLGPITVQDITIPALGLDPATGVTVGPIFSSGSIIDPFSLTL
LGFINVNVPAIQTAPSEILPFTVLLSSLGVTHLTPEITIPGFHIPVDPIH
VELPLSVTIGPFVSPEITIPQLPLGLALSGATPAFAFPLEITIDRIPVVL
DVNALLGPINAGLVIPPVPGFGNTTAVPSSGFFNIGGGGGLSGFHNLGAG
MSGVLNAISDPLLGSASGFANFGTQLSGILNRGADISGVYNTGALGLITS
ALVSGFGNVGQQLAGLIYTGTGP
>Rv3347c PPE55, PPE FAMILY PROTEIN
MNFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVSFGQVTS
GLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAQAVAVAGQARAAVAAFEAA
LAATVDPAAVAVNRMAMRALAMSNLLGQNAAAIAAVEAEYELMWAADVAA
MAGYHSGASAAAAALPAFSPPAQALGGGVGAFLNALFAGPAKMLRLNAGL
GNVGNYNVGLGNVGIFNLGAANVGAQNLGAANAGSGNFGFGNIGNANFGF
GNSGLGLPPGMGNIGLGNAGSSNYGLANLGVGNIGFANTGSNNIGIGLTG
DNLTGIGGLNSGTGNLGLFNSGTGNIGFFNSGTGNFGVFNSGSYNTGVGN
AGTASTGLFNVGGFNTGVANVGSYNTGSFNAGNTNTGGFNPGNVNTGWLN
TGNTNTGIANSGNVNTGAFISGNFSNGVLWRGDYEGLWGLSGGSTIPAIP
IGLELNGGVGPITVLPIQILPTIPLNIHQTFSLGPLVVPDIVIPAFGGGT
AIPISVGPITISPITLFPAQNFNTTFPVGPFFGLGVVNISGIEIKDLAGN
VTLQLGNLNIDTRINQSFPVTVNWSTPAVTIFPNGISIPNNPLALLASAS
IGTLGFTIPGFTIPAAPLPLTIDIDGQIDGFSTPPITIDRIPLNLGASVT
VGPILINGVNIPATPGFGNTTTAPSSGFFNSGDGGVSGFGNFGAGSSGWW
NQAQTEVAGAGSGFANFGSLGSGVLNFGSGVSGLYNTGGLPPGTPAVVSG
IGNVGEQLSGLSSAGTALNQSLIINLGLADVGSVNVGFGNVGDFNLGAAN
IGDLNVGLGNVGGGNVGFGNIGDANFGLGNAGLAAGLAGVGNIGLGNAGS
GNVGFGNMGVGNIGFGNTGTNNLGIGLTGDNQTGIGGLNSGAGNIGLFNS
GTGNVGLFNSGTGNFGLFNSGSFNTGIGNGGTGSTGLFNAGNFNTGVANP
GSYNTGSFNVGDTNTGGFNPGSINTGWFNTGNANTGVANSGNVDTGALMS
GNFSNGILWRGNFEGLFGLNVGITIPEFPIHWTSTGGIGPIIIPDTTILP
PIHLGLTGQANYGFAVPDIPIPAIHIDFDGAADAGFTAPATTLLSALGIT
GQFRFGPITVSNVQLNPFNVNLKLQFLHDAFPNEFPDPTISVQIQVAIPL
TSATLGGLALPLQQTIDAIELPAISFSQSIPIDIPPIDIPASTINGISMS
EVVPIDVSVDIPAVTITGTRIDPIPLNFDVLSSAGPINISIIDIPALPGF
GNSTELPSSGFFNTGGGGGSGIANFGAGVSGLLNQASSPMVGTLSGLGNA
GSLASGVLNSGVDISGMFNVSTLGSAPAVISGFGNLGNHVSGVSIDGLLA
MLTSGGSGGSGQPSIIDAAIAELRHLNPLNIVNLGNVGSYNLGFANVGDV
NLGAGNLGNLNLGGGNLGGQNLGLGNLGDGNVGFGNLGHGNVGFGNSGLG
ALPGIGNIGLGNAGSNNVGFGNMGLGNIGFGNTGTNNLGIGLTGDNQTGF
GGLNSGAGNLGLFNSGTGNIGFFNTGTGNWGLFNSGSYNTGIGNSGTGST
GLFNAGSFNTGLANAGSYNTGSLNAGNTNTGGFNPGNVNTGWFNAGHTNT
GGFNTGNVNTGAFNSGSFNNGALWTGDHHGLVGFSYSIEITGSTLVDINE
TLNLGPVHIDQIDIPGMSLFDIHELVNIGPFRIEPIDVPAVVLDIHETMV
IPPIVFLPSMTIGGQTYTIPLDTPPAPAPPPFRLPLLFVNALGDNWIVGA
SNSTGMSGGFVTAPTQGILIHTGPSSATTGSLALTLPTVTIPTITTSPIP
LKIDVSGGLPAFTLFPGGLNIPQNAIPLTIDASGVLDPITIFPGGFTIDP
LPLSLALNISVPDSSVPIIIVPPTPGFGNATATPSSGFFNSGAGGVSGFG
NFGAGSSGWWNQAHAALAGAGSGVLNVGTLNSGVLNVGSGISGLYNTAIV
GLGTPALVSGAGNVGQQLSGVLAAGTALTQSPIINLGLADVGNYNLGLGN
VGDFNLGAANLGDLNLGLGNIGNANVGFGNIGHGNVGFGNSGLGAALGIG
NIGLGNAGSTNVGLANMGVGNIGFANTGTNNLGIGLTGDNQTGIGGLNSG
AGNIGLFNSGTGNIGFFNSGTGNWGLFNSGSFNTGIGNSGTGSTGLFNAG
GFTTGLANAGSYNTGSFNVGDTNTGGFNPGSINTGWFNTGNANTGIANSG
NVDTGALMSGNFSNGILWRGNYEGLFSYSYSLDVPRITILDAHFTGAFGP
VVVPPIPVLAINAHLTGNAAMGAFTIPQIDIPALNPNVTGSVGFGPIAVP
SVTIPALTAARAVLDMAASVGATSEIEPFIVWTSSGAIGPTWYSVGRIYN
AGDLFVGGNIISGIPTLSTTGPVHAVFNAASQAFNTPALNIHQIPLGFQV
PGSIDAITLFPGGLTFPANSLLNLDVFVGTPGATIPAITFPEIPANADGE
LYVIAGDIPLINIPPTPGIGNTTTVPSSGFFNTGAGGGSGFGNFGANMSG
WWNQAHTALAGAGSGIANVGTLHSGVLNLGSGLSGIYNTSTLPLGTPALV
SGLGNVGDHLSGLLASNVGQNPITIVNIGLANVGNGNVGLGNIGNLNLGA
ANIGDVNLGFGNIGDVNLGFGNIGGGNVGFGNIGDANFGFGNSGLAAGLA
GMGNIGLGNAGSGNVGWANMGLGNIGFGNTGTNNLGIGLTGDNQSGIGGL
NSGTGNIGLFNSGTGNIGFFNSGTANFGLFNSGSYNTGIGNSGVASTGLV
NAGGFNTGVANAGSYNTGSFNAGDTNTGGFNPGSTNTGWFNTGNANTGVA
NAGNVNTGALITGNFSNGILWRGNYEGLAGFSFGYPIPLFPAVGADVTGD
IGPATIIPPIHIPSIPLGFAAIGHIGPISIPNIAIPSIHLGIDPTFDVGP
ITVDPITLTIPGLSLDAAVSEIRMTSGSSSGFKVRPSFSFFAVGPDGMPG
GEVSILQPFTVAPINLNPTTLHFPGFTIPTGPIHIGLPLSLTIPGFTIPG
GTLIPQLPLGLGLSGGTPPFDLPTVVIDRIPVELHASTTIGPVSLPIFGF
GGAPGFGNDTTAPSSGFFNTGGGGGSGFSNSGSGMSGVLNAISDPLLGSA
SGFANFGTQLSGILNRGAGISGVYNTGTLGLVTSAFVSGFMNVGQQLSGL
LFAGTGP
>Rv3350c PPE56, PPE FAMILY PROTEIN
MEFPVLPPEINSVLMYSGAGSSPLLAAAAAWDGLAEELGSAAVSFGQVTS
GLTAGVWQGAAAAAMAAAAAPYAGWLGSVAAAAEAVAGQARVVVGVFEAA
LAATVDPALVAANRARLVALAVSNLLGQNTPAIAAAEAEYELMWAADVAA
MAGYHSGASAAAAALPAFSPPAQALGGGVGAFLTALFASPAKALSLNAGL
GNVGNYNVGLGNVGVFNLGAGNVGGQNLGFGNAGGTNVGFGNLGNGNVGF
GNSGLGAGLAGLGNIGLGNAGSSNYGFANLGVGNIGFGNTGTNNVGVGLT
GNHLTGIGGLNSGTGNIGLFNSGTGNVGFFNSGTGNFGVFNSGNYNTGVG
NAGTASTGLFNAGNFNTGVVNVGSYNTGSFNAGDTNTGGFNPGGVNTGWL
NTGNTNTGIANSGNVNTGAFISGNFNNGVLWVGDYQGLFGVSAGSSIPAI
PIGLVLNGDIGPITIQPIPILPTIPLSIHQTVNLGPLVVPDIVIPAFGGG
IGIPINIGPLTITPITLFAQQTFVNQLPFPTFSLGKITIPQIQTFDSNGQ
LVSFIGPIVIDTTIPGPTNPQIDLTIRWDTPPITLFPNGISAPDNPLGLL
VSVSISNPGFTIPGFSVPAQPLPLSIDIEGQIDGFSTPPITIDRIPLTVG
GGVTIGPITIQGLHIPAAPGVGNTTTAPSSGFFNSGAGGVSGFGNVGAGS
SGWWNQAPSALLGAGSGVGNVGTLGSGVLNLGSGISGFYNTSVLPFGTPA
AVSGIGNLGQQLSGVSAAGTTLRSMLAGNLGLANVGNFNTGFGNVGDVNL
GAANIGGHNLGLGNVGDGNLGLGNIGHGNLGFANLGLTAGAAGVGNVGFG
NAGINNYGLANMGVGNIGFANTGTGNIGIGLVGDHRTGIGGLNSGIGNIG
LFNSGTGNVGFFNSGTGNFGIGNSGRFNTGIGNSGTASTGLFNAGSFSTG
IANTGDYNTGSFNAGDTNTGGFNPGGINTGWFNTGHANTGLANAGTFGTG
AFMTGDYSNGLLWRGGYEGLVGVRVGPTISQFPVTVHAIGGVGPLHVAPV
PVPAVHVEITDATVGLGPFTVPPISIPSLPIASITGSVDLAANTISPIRA
LDPLAGSIGLFLEPFRLSDPFITIDAFQVVAGVLFLENIIVPGLTVSGQI
LVTPTPIPLTLNLDTTPWTLFPNGFTIPAQTPVTVGMEVANDGFTFFPGG
LTFPRASAGVTGLSVGLDAFTLLPDGFTLDTVPATFDGTILIGDIPIPII
DVPAVPGFGNTTTAPSSGFFNTGGGGGSGFANVGAGTSGWWNQGHDVLAG
AGSGVANAGTLSSGVLNVGSGISGWYNTSTLGAGTPAVVSGIGNLGQQLS
GFLANGTVLNRSPIVNIGWADVGAFNTGLGNVGDLNWGAANIGAQNLGLG
NLGSGNVGFGNIGAGNVGFANSGPAVGLAGLGNVGLSNAGSNNWGLANLG
VGNIGLANTGTGNIGIGLVGDYQTGIGGLNSGSGNIGLFNSGTGNVGFFN
TGTGNFGLFNSGSFNTGIGNSGTGSTGLFNAGNFNTGIANPGSYNTGSFN
VGDTNTGGFNPGDINTGWFNTGIMNTGTRNTGALMSGTDSNGMLWRGDHE
GLFGLSYGITIPQFPIRITTTGGIGPIVIPDTTILPPLHLQITGDADYSF
TVPDIPIPAIHIGINGVVTVGFTAPEATLLSALKNNGSFISFGPITLSNI
DIPPMDFTLGLPVLGPITGQLGPIHLEPIVVAGIGVPLEIEPIPLDAISL
SESIPIRIPVDIPASVIDGISMSEVVPIDASVDIPAVTITGTTISAIPLG
FDIRTSAGPLNIPIIDIPAAPGFGNSTQMPSSGFFNTGAGGGSGIGNLGA
GVSGLLNQAGAGSLVGTLSGLGNAGTLASGVLNSGTAISGLFNVSTLDAT
TPAVISGFSNLGDHMSGVSIDGLIAILTFPPAESVFDQIIDAAIAELQHL
DIGNALALGNVGGVNLGLANVGEFNLGAGNVGNINVGAGNLGGSNLGLGN
VGTGNLGFGNIGAGNFGFGNAGLTAGAGGLGNVGLGNAGSGSWGLANVGV
GNIGLANTGTGNIGIGLTGDYRTGIGGLNSGTGNLGLFNSGTGNIGFFNT
GTGNFGLFNSGSYSTGVGNAGTASTGLFNAGNFNTGLANAGSYNTGSLNV
GSFNTGGVNPGTVNTGWFNTGHTNTGLFNTGNVNTGAFNSGSFNNGALWT
GDYHGLVGFSFSIDIAGSTLLDLNETLNLGPIHIEQIDIPGMSLFDVHEI
VEIGPFTIPQVDVPAIPLEIHESIHMDPIVLVPATTIPAQTRTIPLDIPA
SPGSTMTLPLISMRFEGEDWILGSTAAIPNFGDPFPAPTQGITIHTGPGP
GTTGELKISIPGFEIPQIATTRFLLDVNISGGLPAFTLFAGGLTIPTNAI
PLTIDASGALDPITIFPGGYTIDPLPLHLALNLTVPDSSIPIIDVPPTPG
FGNTTATPSSGFFNSGAGGVSGFGNVGSNLSGWWNQAASALAGSGSGVLN
VGTLGSGVLNVGSGVSGIYNTSVLPLGTPAVLSGLGNVGHQLSGVSAAGT
ALNQIPILNIGLADVGNFNVGFGNVGDVNLGAANLGAQNLGLGNVGTGNL
GFANVGHGNIGFGNSGLTAGAAGLGNTGFGNAGSANYGFANQGVRNIGLA
NTGTGNIGIGLVGDNLTGIGGLNSGAGNIGLFNSGTGNIGFFNSGTGNFG
IGNSGSFNTGIGNSGTGSTGLFNAGSFNTGVANAGSYNTGSFNAGDTNTG
GFNPGTINTGWFNTGHTNTGIANSGNVGTGAFMSGNFSNGLLWRGDHEGL
FSLFYSLDVPRITIVDAHLDGGFGPVVLPPIPVPAVNAHLTGNVAMGAFT
IPQIDIPALTPNITGSAAFRIVVGSVRIPPVSVIVEQIINASVGAEMRID
PFEMWTQGTNGLGITFYSFGSADGSPYATGPLVFGAGTSDGSHLTISASS
GAFTTPQLETGPITLGFQVPGSVNAITLFPGGLTFPATSLLNLDVTAGAG
GVDIPAITWPEIAASADGSVYVLASSIPLINIPPTPGIGNSTITPSSGFF
NAGAGGGSGFGNFGAGTSGWWNQAHTALAGAGSGFANVGTLHSGVLNLGS
GVSGIYNTSTLGVGTPALVSGLGNVGHQLSGLLSGGSAVNPVTVLNIGLA
NVGSHNAGFGNVGEVNLGAANLGAHNLGFGNIGAGNLGFGNIGHGNVGVG
NSGLTAGVPGLGNVGLGNAGGNNWGLANVGVGNIGLANTGTGNIGIGLTG
DYQTGIGGLNSGAGNLGLFNSGAGNVGFFNTGTGNFGLFNSGSFNTGVGN
SGTGSTGLFNAGSFNTGVANAGSYNTGSFNVGDTNTGGFNPGSINTGWLN
AGNANTGVANAGNVNTGAFVTGNFSNGILWRGDYQGLAGFAVGYTLPLFP
AVGADVSGGIGPITVLPPIHIPPIPVGFAAVGGIGPIAIPDISVPSIHLG
LDPAVHVGSITVNPITVRTPPVLVSYSQGAVTSTSGPTSEIWVKPSFFPG
IRIAPSSGGGATSTQGAYFVGPISIPSGTVTFPGFTIPLDPIDIGLPVSL
TIPGFTIPGGTLIPTLPLGLALSNGIPPVDIPAIVLDRILLDLHADTTIG
PINVPIAGFGGAPGFGNSTTLPSSGFFNTGAGGGSGFSNTGAGMSGLLNA
MSDPLLGSASGFANFGTQLSGILNRGAGISGVYNTGALGVVTAAVVSGFG
NVGQQLSGLLFTGVGP
>Rv3429 PPE59, PPE FAMILY PROTEIN
MHPMIPAEYISNIIYEGPGADSLSAAAEQLRLMYNSANMTAKSLTDRLGE
LQENWKGSSSDLMADAAGRYLDWLTKHSRQILETAYVIDFLAYVYEETRH
KVVPPATIANNREEVHRLIASNVAGVNTPAIAGLDAQYQQYRAQNIAVMN
DYQSTARFILAYLPRWQEPPQIYGGGGG
>Rv0305c PPE6, PPE FAMILY PROTEIN
MDFVVSAPEVNSLRMYLGAGSGPMLAAAAAWDGLADELAVAASWFGSVTS
GLADAAWRGPAAVAMARAVAPYLGWLISATAQAEQAAAQARVAVATFEAA
RAATVHPAIVAANRAVLVSLVSSNLLGFNAPAIAATEAAYERMWAQDVAA
MVGYHAGASAAVSALMPFTQQLKKLAGLSERLTSAAAAAAGPPSAAGFNL
GLANVGANNVGNGNVGVFNVGFGNLGSYNLGFANLGSDNLGLANLGGHNI
GFANTGSNNVGFGNTGSNNVGIGLTGNGQIGFGSFNSGSHNIGLFNSGSG
NVGLFNSGTGNFGIGNSGTGNFGLGNTGSTNTGWFNTGDVNTGGFNPGSY
NTGNFNTGNYNTGSFNAGNYNTGYFNTGDYNTGVANTGNVNTGAFIAGNY
SNGVLWRGDYQGLIGADIALEIPAIPINAQLFSMPIHQVMVMPGSVMTIP
GMRLPFTSIVPFVVYYGPVELPQSTLTLPTVTITVGGPTTTIDGNLTGMV
GGVSIPLIKIPAAPGFGNSTTSPSSGFFNAGAGTASGFGNFGGGASGFWN
LASATSGLSGFGNVGALGSGVANVGNTISGLYNTSTSNLATPAFNSGLLH
HSVGTMTLNFGLANVGGNNVGGANAGIFNVGLANLGDYNIGFGNLGGDNL
GFAHAGSYNIGFANTGSNNLGFANTGDNNIGFANIGSNNIGIGLTGSGQI
GFGSLNSGSHNIGLFNSGDGNIGLFNSGSGNFGIGNAGTGNWGIGNSGAG
NFGIGNAGSTNTGLFNSGDLNTGSLNPGSYNTGSVNTGSVNTGGFNAGNY
NTGYFNTGDLQHRHGEHRQYQHRRFHLRQPQQRPSVAGRQPGSDRPRHRR
RHSRNPDCERRREYPDSHTDHRQLHGHRIQRARSSTEHSRHCYFFRTRRY
RPLHRPSDTDNRSHTCGHGGWTHYRDQYRRHCGRRRHQHPDYPYSSDSRL
RQLDRRTVVGLLQ
>Rv3533c PPE62, PPE FAMILY PROTEIN
MNYAVLPPELNSLRMFTGAGSAPMLAAAVAWDGLAAELGSAASSFGSVTS
DLASQAWQGPAAAAMAAAAAPYAGWLSAAAARAAGAAAQAKAVASAFEAA
RAATVHPLLVAANRNAFAQLVMSNWFGLNAPLIAAVEGAYEQMWAADVAA
MVGYHSGASAAAEQLVPFQQALQQLPNLGIGNIGNANLGGGNTGDLNTGN
GNIGNTNLGSGNRGDANLGSGNIGNSNVGGGNVGNGNFGSGNGRAGLPGS
GNVGNGNLGNSNLGSGNTGNSNVGFGNTGNNNVGTGNAGSGNIGAGNTGS
SNWGFGNNGIGNIGFGNTGNGNIGFGLTGNNQVGIGGLNSGSGNIGLFNS
GTNNVGFFNSGNGNLGIGNSSDANVGIGNSGATVGPFVAGHNTGFGNSGS
LNTGMGNAGGVNTGFGNGGAINLGFGNSGQLNAGSFNAGSINTGNFNSGQ
GNTGDFNAGVRNTGWSNSGLTNTGAFNAGSLNTGFGAVGTGSGPNSGFGN
AGTNNSGFFNTGVGSSGFQNGGSNNSGLQNAVGTVIAAGFGNTGAQTVGI
ANSGVLNSGFFNSGVHNSGGFNSENQRSGFGN
>Rv0355c PPE8, PPE FAMILY PROTEIN
MSFAVLPPEINSARLYVGAGLAPMLDAAAAWDGLADELGSAAASFSAVTA
GLAGSSWLGAASTAMTGAAAPYLGWLSAAAAQAQQAATQTRLAAAAFEAA
LAATVHPAIISANRALFVSLVVSNLLGQNAPAIAATEAAYEQMWAQDVAA
MFGYHAGASAAVSALTPFGQALPTVAGGGALVSAAAAQVTTRVFRNLGLA
NVGEGNVGNGNVGNFNLGSANIGNGNIGSGNIGSSNIGFGNVGPGLTAAL
NNIGFGNTGSNNIGFGNTGSNNIGFGNTGDGNRGIGLTGSGLLGFGGLNS
GTGNIGLFNSGTGNVGIGNSGTGNWGIGNSGNSYNTGFGNSGDANTGFFN
SGIANTGVGNAGNYNTGSYNPGNSNTGGFNMGQYNTGYLNSGNYNTGLAN
SGNVNTGAFITGNFNNGFLWRGDHQGLIFGSPGFFNSTSAPSSGFFNSGA
GSASGFLNSGANNSGFFNSSSGAIGNSGLANAGVLVSGVINSGNTVSGLF
NMSLVAITTPALISGFFNTGSNMSGFFGGPPVFNLGLANRGVVNILGNAN
IGNYNILGSGNVGDFNILGSGNLGSQNILGSGNVGSFNIGSGNIGVFNVG
SGSLGNYNIGSGNLGIYNIGFGNVGDYNVGFGNAGDFNQGFANTGNNNIG
FANTGNNNIGIGLSGDNQQGFNIASGWNSGTGNSGLFNSGTNNVGIFNAG
TGNVGIANSGTGNWGIGNPGTDNTGILNAGSYNTGILNAGDFNTGFYNTG
SYNTGGFNVGNTNTGNFNVGDTNTGSYNPGDTNTGFFNPGNVNTGAFDTG
DFNNGFLVAGDNQGQIAIDLSVTTPFIPINEQMVIDVHNVMTFGGNMITV
TEASTVFPQTFYLSGLFFFGPVNLSASTLTVPTITLTIGGPTVTVPISIV
GALESRTITFLKIDPAPGIGNSTTNPSSGFFNSGTGGTSGFQNVGGGSSG
VWNSGLSSAIGNSGFQNLGSLQSGWANLGNSVSGFFNTSTVNLSTPANVS
GLNNIGTNLSGVFRGPTGTIFNAGLANLGQLNIGSANLGDFNLGSGNVGS
FNVFSGNQGSYNIGPANLGNYNIGFANLGNYNIGFGNAGDFNQGFANTGN
NNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGTANIGLFNSGTNNVGI
GNSGTGNWGIGNSGSGNTGIGNTGSTNTGFFNTGIVNTGVANAGSYNTGW
YNTGDTNTGIANLGDFNTGFYNTGNFSTGFANQGDIATGAFITGDMGNGA
FWRGDQQGLFSAGYRVHVPEIPAHVTVEVPVNIPITASFTNTVYSGITLE
QINFGFTIDIAGIPLLAGAISKAVLPPITGTGPAITVNIGDPGGSTAIRI
PATASVGPFDVTFVNIAATTGFFNATTDPSSGFFNGGPGTVSGIANIGAN
ISGFQNVANSATSGFNNYGSLQSGLANLGDTVSGVFNTGIGAPANVSGMF
NIGSNLAGFFHDQATGMSMFNLGLGNIGQFNVGFSNVGDSNAGLANIGSF
NLGSGNLGSFNVFGGNQGSYNIGPANLGNYNIGLGNLGSYNFGFGNAGDF
NLGFANTGNNNIGFANTGNNNIGIGLSGDNQQGFNFAGGWNSGSGNSGLF
NSGTNNIGLFNSGTGNIGIGNSGTGNWGIANTGDTNTGIFNTGDVNTGLL
NAGNVNTGIFNTGHYNTGSFNAGSFNTAGFNPGSYNTGYLNTGSYNTGLA
NSGDVNTGGFITGNYSNGFWWRGDYQGLAGISQTITVPDTAVPVKLHVPI
FLDIPVTGTLGTFTVHGFRFPEITGDIFLIGIPFNAATLDAFSFPNISIV
LPNIGINLGSGPDPLIDIAGTGGLLPIKIPLIDIPAAPGFGNSTTTPSSG
FFNAGTGTVSGVGNVGSNSSGFFNLTSGSSGISGVQNFGELISGGFNFGN
TVSGLVNASTLGLSMPANLSGGGNVGATVAGFVNNTQILNLGFGNVGSGN
VGHGNIGDSNVGLGNLGNANVGHGNIGSFNVFSGNRGSYNIGPANLGNYN
IGLGNLGSYNFGFGNAGDFNLGFANSGSNNIGFANTGNNNIGIGLSGHNQ
QGFGSWNSGTANTGLFNSGTNNIGLFNSGTGNIGIGNSGIGNTGIGNPGV
GNTGLGNSGTGNWGLWNPGTGNMGVANVGTYNTGGYNVGSTNTGIANVGI
ANTGSYNTGSTNTGSFNDGDFNTGFYNTGDYNTGFYNTGDVNTGAFIGGN
FSNGAFWQSDHQGQWGAHYAITVPQIPLLNFSLNIPVNIPIHLDFGTLAV
NGFQIPAITLRALGVTHFSVGPIIVPRIAGTLPVIDINIGDPGGSSSIPI
TITSGAGPVVIPLLDIPPAPGFGNSTTGPSSGFFNSGTGSSSGFGNVGAN
NSGFWNTAFAGIGNSGLQNFGSLQSGWANLGNTVSGFYNTSAADFATPAN
LSGLSNVGADLTGVLRGPNGSTFNAGLANLGQFNVGSANLGSANLGSANL
GSANLGNSNVGFGNIGNANIGGANIGDFNVGIANTGPGLTAAVNNIGIGN
TGNYNIGVGNTGNYNIGFGNTGNNNIGIGLSGDNQIGFGPLNAGIANMGL
FNLGDNNFGMANAGNFNQGIANTGNNNIGLFNTGNNNVGIWLTGDGLSGF
SSLNSGAGNTGFFNSGTANTGLFNSGTGNTGLFNSGTGNVGIGNMGTGGF
GVGLSGDSQVGIGGTNSGSFNIGLFNSGTGNVGIGNSGTGNVGIGNTGTG
NTGIGNSGNYNTGLLNAGLVNTGIANPGNHNTGLFNIGTFNTGIANPGHY
NTGSYNTGSYNTGMANAGDYGTGAFITGSMNNGLLWRADRQGLLAANYTI
TIERPAAFLNVDIPVNIPITGDITNVSIPAITFPRIDASGSVDIGILSGT
VLAPVGPITLHGGDASAPLDTPIEIDFGPSPAINLNIGKPDGSTVINIVG
GAGAGPISIPIIDLRPAPGFFNATTGPSSGFLNWGAGSASGLLNFGNNSG
LYNFATSSMGNSGFQNYGSLQSGWANLGNSISGIYNTGLGAPANVSGLLN
IGTNLAGWLQNGPTETTFSVGLANLGFWNLGSANIGNYNLGSANIGVYNL
GSANIGDFNLGSANIGDFNLGSANIGSSNIGFGNVGPGLTAAIGNIGFGN
TGNGNIGIGNTGTGNIGFGNTGNGNIGIGLTGDTMTGFGGWNSGTGNIGL
FNSGTGNIGFGNSGTGNWGIGNSGDYNTGIGNTGSTNSGFFNTGLVNTGI
GNSGDYNTGLFNAGNTNTGSFNPGDYNTGGFNPGNYNTGYFNPGNSNTGI
ANSGDVNTGAFNSGNYSNGFFWRGDYQGLGGFAYQSAVSEIPWSYDRFQH