Supplementary file 2. Annotation of Mtb diagnostic missense SNPs. #ID - position of the corresponding amino acid residue in the pseudo-alignment in Supplementary file 1.fa; #Locus - location of the SNP on the chromosome Mtb H37Rv (NC_000962.3); #Alleles - Allelic state in Mtb H37Rv | [Alternative allelic states]; #Tag - locus tag of the corresponding CDS as in Mtb H37Rv (NC_000962.3); #Gene - name of the gene containing the corresponding missense mutation; #Codon - number of the codon affected by the corresponding missense mutation; #Annotation - protein product annotation of the corresponding CDS as in Mtb H37Rv (NC_000962.3); ID Locus Alleles Tag Gene Codon Annotation 1 1 L | M,V Rv0001 dnaA 1 Chromosomal replication initiator protein DnaA 2 2 L | *,S,W,X Rv0001 dnaA 1 Chromosomal replication initiator protein DnaA 3 3 L | F Rv0001 dnaA 1 Chromosomal replication initiator protein DnaA 4 4 T | A,I,K,M,R,S Rv0001 dnaA 2 Chromosomal replication initiator protein DnaA 5 7 D | *,E,G,I,L,N,Q,R,S,T,V,Y Rv0001 dnaA 3 Chromosomal replication initiator protein DnaA 6 8 D | *,E,G,I,L,Q,R,S,T,V,Y Rv0001 dnaA 3 Chromosomal replication initiator protein DnaA 7 2386 G | A Rv0002 dnaN 112 DNA polymerase III (beta chain) DnaN (DNA nucleotidyltransferase) 8 3446 A | V Rv0003 recF 56 DNA replication and repair protein RecF (single-strand DNA binding protein) 9 4013 I | T Rv0003 recF 245 DNA replication and repair protein RecF (single-strand DNA binding protein) 10 4600 V | G Rv0004 . 56 Rv0004, (MTCY10H4.02), len: 187 aa. Conserved hypothetical protein (see Salazar et al., 1996). Belongs to superfamily DUF721; this family contains several actinomycete proteins of unknown function. 11 6432 A | V Rv0005 gyrB 398 DNA gyrase (subunit B) GyrB (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 12 6446 A | S Rv0005 gyrB 403 DNA gyrase (subunit B) GyrB (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 13 6738 T | I Rv0005 gyrB 500 DNA gyrase (subunit B) GyrB (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 14 6749 A | T Rv0005 gyrB 504 DNA gyrase (subunit B) GyrB (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 15 6750 A | V,X Rv0005 gyrB 504 DNA gyrase (subunit B) GyrB (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 16 7362 E | Q Rv0006 gyrA 21 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 17 7510 H | R Rv0006 gyrA 70 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 18 7521 A | S Rv0006 gyrA 74 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 19 7564 G | A Rv0006 gyrA 88 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 20 7570 A | V Rv0006 gyrA 90 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 21 7572 S | P Rv0006 gyrA 91 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 22 7581 D | H,N,Y Rv0006 gyrA 94 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 23 7582 D | A,G Rv0006 gyrA 94 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 24 7585 S | T Rv0006 gyrA 95 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 25 7705 T | S Rv0006 gyrA 135 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 26 7932 D | Y Rv0006 gyrA 211 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 27 8040 G | S Rv0006 gyrA 247 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 28 8175 R | G Rv0006 gyrA 292 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 29 8227 R | Q Rv0006 gyrA 309 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 30 8428 R | L Rv0006 gyrA 376 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 31 8476 R | L Rv0006 gyrA 392 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 32 8634 A | T Rv0006 gyrA 445 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 33 8793 I | V Rv0006 gyrA 498 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 34 8806 D | A Rv0006 gyrA 502 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 35 9034 R | Q Rv0006 gyrA 578 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 36 9202 K | N Rv0006 gyrA 634 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 37 9304 G | D Rv0006 gyrA 668 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 38 9348 F | V Rv0006 gyrA 683 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 39 9432 L | M Rv0006 gyrA 711 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 40 9487 G | R Rv0006 gyrA 729 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 41 9796 G | A Rv0006 gyrA 832 DNA gyrase (subunit A) GyrA (DNA topoisomerase (ATP-hydrolysing)) (DNA topoisomerase II) (type II DNA topoisomerase) 42 10032 A | V Rv0007 . 40 Rv0007, (MTCY10H4.05), len: 304 aa. Possible conserved membrane protein. A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 43 10109 H | D Rv0007 . 66 Rv0007, (MTCY10H4.05), len: 304 aa. Possible conserved membrane protein. A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 44 10785 D | G Rv0007 . 291 Rv0007, (MTCY10H4.05), len: 304 aa. Possible conserved membrane protein. A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 45 11879 S | P Rv0008c . 145 Rv0008c, (MTCY10H4.07c), len: 145 aa. Possible membrane protein. 46 14251 D | N Rv0012 . 55 Rv0012, (MTCY10H4.12), len: 262 aa. Probable conserved membrane protein. Belongs to superfamily DUF881. Contains probable N-terminal signal sequence. 47 14785 C | R Rv0012 . 233 Rv0012, (MTCY10H4.12), len: 262 aa. Probable conserved membrane protein. Belongs to superfamily DUF881. Contains probable N-terminal signal sequence. 48 15117 I | M Rv0013 trpG 68 Possible anthranilate synthase component II TrpG (glutamine amidotransferase) 49 15517 R | G Rv0013 trpG 202 Possible anthranilate synthase component II TrpG (glutamine amidotransferase) 50 15880 A | P,T Rv0014c pknB 531 Transmembrane serine/threonine-protein kinase B PknB (protein kinase B) (STPK B) 51 16119 R | L Rv0014c pknB 451 Transmembrane serine/threonine-protein kinase B PknB (protein kinase B) (STPK B) 52 17608 N | K Rv0015c pknA 386 Transmembrane serine/threonine-protein kinase A PknA (protein kinase A) (STPK A) 53 19427 P | S Rv0016c pbpA 270 Probable penicillin-binding protein PbpA 54 20259 A | G Rv0017c rodA 461 Probable cell division protein RodA 55 20368 S | A Rv0017c rodA 425 Probable cell division protein RodA 56 21795 P | S Rv0018c pstP 463 Phosphoserine/threonine phosphatase PstP 57 24292 V | I Rv0020c fhaA 385 Conserved protein with FHA domain, FhaA 58 24493 Y | D Rv0020c fhaA 318 Conserved protein with FHA domain, FhaA 59 24698 Y | * Rv0020c fhaA 250 Conserved protein with FHA domain, FhaA 60 24707 Q | H,P Rv0020c fhaA 247 Conserved protein with FHA domain, FhaA 61 24716 Y | A,H,T Rv0020c fhaA 244 Conserved protein with FHA domain, FhaA 62 24738 G | V Rv0020c fhaA 236 Conserved protein with FHA domain, FhaA 63 26309 A | G Rv0021c . 192 Rv0021c, (MTCY10H4.21c), len: 322 aa. Conserved hypothetical protein, similar to various proteins. 64 26741 G | A,R Rv0021c . 48 Rv0021c, (MTCY10H4.21c), len: 322 aa. Conserved hypothetical protein, similar to various proteins. 65 27952 K | E Rv0023 . 120 Rv0023, (MTCY10H4.23), len: 256 aa. Possible transcriptional regulator. Contains probable helix-turn helix motif from aa 19 to 40 (Score 1615, +4.69 SD). 66 29402 A | V Rv0025 . 53 Rv0025, (MTCY10H4.25), len: 120 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. Rv0739 (268 aa), FASTA score: (37.6% identity in 101 aa overlap), and Rv0026 FASTA score: (35.4% identity in 113 aa overlap); etc. 67 29551 A | E Rv0025 . 103 Rv0025, (MTCY10H4.25), len: 120 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. Rv0739 (268 aa), FASTA score: (37.6% identity in 101 aa overlap), and Rv0026 FASTA score: (35.4% identity in 113 aa overlap); etc. 68 29993 A | V Rv0026 . 91 Rv0026, (MTCY10H4.26), len: 448 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis: Rv0025 FASTA score: (35.4% identity in 113 aa overlap) and Rv0739 (268 aa), FASTA score: (32.4% identity in 142 aa overlap). 69 30688 S | A Rv0026 . 323 Rv0026, (MTCY10H4.26), len: 448 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis: Rv0025 FASTA score: (35.4% identity in 113 aa overlap) and Rv0739 (268 aa), FASTA score: (32.4% identity in 142 aa overlap). 70 31258 D | Y Rv0027 . 24 Rv0027, (MTCY10H4.27), len: 105 aa. Conserved hypothetical unknown protein. 71 31518 T | I Rv0028 . 2 Rv0028, (MTCY10H4.28), len: 101 aa. Conserved hypothetical unknown protein. 72 32385 M | I,T Rv0029 . 110 Rv0029, (MTCY10H4.29), len: 365 aa. Conserved hypothetical protein, showing some similarity to other proteins from Mycobacterium tuberculosis e.g. C-terminal region of Rv2082; Rv3899c. 73 33456 H | R Rv0030 . 78 Rv0030, (MTCY10H4.30), len: 109 aa. Conserved hypothetical unknown protein. 74 33551 * | G Rv0030 . 110 Rv0030, (MTCY10H4.30), len: 109 aa. Conserved hypothetical unknown protein. 75 35892 D | G Rv0032 bioF2 533 Possible 8-amino-7-oxononanoate synthase BioF2 (AONS) (8-amino-7-ketopelargonate synthase) (7-keto-8-amino-pelargonic acid synthetase) (7-KAP synthetase) (L-alanine--pimelyl CoA ligase) 76 36008 D | H Rv0032 bioF2 572 Possible 8-amino-7-oxononanoate synthase BioF2 (AONS) (8-amino-7-ketopelargonate synthase) (7-keto-8-amino-pelargonic acid synthetase) (7-KAP synthetase) (L-alanine--pimelyl CoA ligase) 77 37326 L | P Rv0035 fadD34 23 Probable fatty-acid-CoA ligase FadD34 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 78 37763 D | H Rv0035 fadD34 169 Probable fatty-acid-CoA ligase FadD34 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 79 37884 H | P Rv0035 fadD34 209 Probable fatty-acid-CoA ligase FadD34 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 80 39474 L | S Rv0036c . 119 Rv0036c, (MTCY10H4.36c), len: 257 aa. Conserved protein, highly similar to CAB95889.1|AL359988 conserved hypothetical protein from Streptomyces (276 aa). Also some similarity to Rv3099c|MTCY164_10 (283 aa), FASTA scores: E(): 3.3e-05, (25.9% identity in 205 aa overlap). 81 39478 K | *,Q Rv0036c . 118 Rv0036c, (MTCY10H4.36c), len: 257 aa. Conserved protein, highly similar to CAB95889.1|AL359988 conserved hypothetical protein from Streptomyces (276 aa). Also some similarity to Rv3099c|MTCY164_10 (283 aa), FASTA scores: E(): 3.3e-05, (25.9% identity in 205 aa overlap). 82 40324 A | G,V Rv0037c . 294 Rv0037c, (MTCY10H4.37c), len: 441 aa. Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of macrolide. 83 40842 V | I,L Rv0037c . 121 Rv0037c, (MTCY10H4.37c), len: 441 aa. Probable conserved integral membrane protein, member of major facilitator superfamily (MFS) possibly involved in transport of macrolide. 84 42281 C | F Rv0039c . 24 Rv0039c, (MTCY21D4.02c, MTCY10H4.39c), len: 115 aa. Possible conserved transmembrane protein. A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al., 2004). 85 42967 I | M Rv0040c mtc28 134 Secreted proline rich protein Mtc28 (proline rich 28 kDa antigen) 86 43041 H | Y Rv0040c mtc28 109 Secreted proline rich protein Mtc28 (proline rich 28 kDa antigen) 87 43722 P | L Rv0041 leuS 54 Probable leucyl-tRNA synthetase LeuS (leucine--tRNA ligase) (LEURS) 88 44734 I | M Rv0041 leuS 391 Probable leucyl-tRNA synthetase LeuS (leucine--tRNA ligase) (LEURS) 89 44768 R | G Rv0041 leuS 403 Probable leucyl-tRNA synthetase LeuS (leucine--tRNA ligase) (LEURS) 90 45750 E | A Rv0041 leuS 730 Probable leucyl-tRNA synthetase LeuS (leucine--tRNA ligase) (LEURS) 91 47035 G | R Rv0042c . 58 Rv0042c, (MTCY21D4.05c), len: 208 aa. Possible transcriptional regulatory protein, MarR-family. Some similarity to Mycobacterium tuberculosis proteins Rv2327,Rv0880, and Rv1404. 92 48280 D | N Rv0044c . 250 Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase, highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. 93 48557 T | A,P,R Rv0044c . 158 Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase, highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. 94 48577 G | C,F Rv0044c . 151 Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase, highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. 95 48721 H | N Rv0044c . 103 Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase, highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. 96 48767 Y | * Rv0044c . 88 Rv0044c, (MTCY21D4.07c), len: 264 aa. Possible oxidoreductase, highly similar to AAD32732.1|MmcI|AF127374| F420-dependent H4MPT reductase from Streptomyces lavendulae (264 aa). Also similar to Mycobacterium tuberculosis proteins e.g. Rv1855c, Rv0953c, Rv0791c, Rv0132c, etc. 97 49360 V | I Rv0045c . 194 Rv0045c, (MTCY21D4.08c), len: 298 aa. Possible hydrolase, showing similarity with others. Also similar to Mycobacterium tuberculosis proteins Rv3473c, Rv1123c,Rv1938, Rv3617, Rv3670, etc. 98 49692 G | A,E Rv0045c . 83 Rv0045c, (MTCY21D4.08c), len: 298 aa. Possible hydrolase, showing similarity with others. Also similar to Mycobacterium tuberculosis proteins Rv3473c, Rv1123c,Rv1938, Rv3617, Rv3670, etc. 99 50557 R | G Rv0046c ino1 190 myo-inositol-1-phosphate synthase Ino1 (inositol 1-phosphate synthetase) (D-glucose 6-phosphate cycloaldolase) (glucose 6-phosphate cyclase) (glucocycloaldolase) 100 51949 V | A Rv0048c . 250 Rv0048c, MTCY21D4.11c, len: 289 aa. Possible membrane protein. 101 52000 L | N,T Rv0048c . 233 Rv0048c, MTCY21D4.11c, len: 289 aa. Possible membrane protein. 102 54842 A | S Rv0050 ponA1 394 Probable bifunctional penicillin-binding protein 1A/1B PonA1 (murein polymerase) (PBP1): penicillin-insensitive transglycosylase (peptidoglycan TGASE) + penicillin-sensitive transpeptidase (DD-transpeptidase) 103 55553 P | A,R,S Rv0050 ponA1 631 Probable bifunctional penicillin-binding protein 1A/1B Pona1 (murein polymerase) (PBP1): penicillin-insensitive transglycosylase (peptidoglycan TGASE) + penicillin-sensitive transpeptidase (DD-transpeptidase) 104 55558 E | A,P,S,T,X Rv0050 ponA1 632 Probable bifunctional penicillin-binding protein 1A/1B Pona1 (murein polymerase) (PBP1): penicillin-insensitive transglycosylase (peptidoglycan TGASE) + penicillin-sensitive transpeptidase (DD-transpeptidase) 105 57298 R | G Rv0051 . 535 Rv0051, (MTCY21D4.14), len:560 aa. Predicted to be in the GT-C superfamily of glycosyltransferases (See Liu and Mushegian, 2003). Probable conserved transmembrane protein. 106 57828 E | G Rv0052 . 140 Rv0052, (MTCY21D4.15), len: 187 aa. Conserved protein, similar to others including Rv1930c from Mycobacterium tuberculosis (174 aa). May be a membrane protein. 107 58961 F | V Rv0054 ssb 126 Single-strand binding protein Ssb (helix-destabilizing protein) 108 61975 N | T Rv0058 dnaB 527 Probable replicative DNA helicase DnaB 109 62049 R | G Rv0058 dnaB 552 Probable replicative DNA helicase DnaB 110 63771 P | L Rv0059 . 191 Rv0059, (MTV030.02), len: 230 aa. Hypothetical unknown protein. 111 64935 V | I Rv0060 . 343 Rv0060, (MTV030.03), len: 352 aa. Conserved hypothetical protein. 112 65037 P | R Rv0061c . 105 Rv0061c, len: 112 aa. Conserved hypothetical protein supported by RNA-seq data. Similar to MMAR_3839, 76% identity in 112 aa overlap. Replaces questionable ORF Rv0061 (MTV030.04). 113 65083 P | S Rv0061c . 90 Rv0061c, len: 112 aa. Conserved hypothetical protein supported by RNA-seq data. Similar to MMAR_3839, 76% identity in 112 aa overlap. Replaces questionable ORF Rv0061 (MTV030.04). 114 65089 G | S Rv0061c . 88 Rv0061c, len: 112 aa. Conserved hypothetical protein supported by RNA-seq data. Similar to MMAR_3839, 76% identity in 112 aa overlap. Replaces questionable ORF Rv0061 (MTV030.04). 115 65150 Q | *,W Rv0061c . 67 Hypothetical protein 116 66285 A | G Rv0062 celA1 245 Possible cellulase CelA1 (endoglucanase) (endo-1,4-beta-glucanase) (FI-cmcase) (carboxymethyl cellulase) 117 66632 P | S Rv0062 celA1 361 Possible cellulase CelA1 (endoglucanase) (endo-1,4-beta-glucanase) (FI-cmcase) (carboxymethyl cellulase) 118 68340 M | K,L Rv0063 . 473 Rv0063, (MTV030.06), len: 479 aa. Possible oxidoreductase, similar to many. Similar to Mycobacterium tuberculosis proteins e.g. Rv3107c, Rv1257c, etc. Contains PS00862 Oxygen oxidoreductases covalent FAD-binding site. 119 69989 G | D Rv0064 . 457 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 120 70267 V | F Rv0064 . 550 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 121 70912 N | D Rv0064 . 765 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 122 71336 R | P Rv0064 . 906 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 123 71366 P | R Rv0064 . 916 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 124 71443 G | W Rv0064 . 942 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 125 71449 D | N Rv0064 . 944 Rv0064, (MTV030.07), len: 979 aa. Probable conserved transmembrane protein, similar to many. Contains probable coiled-coil domain from aa 948 to 976. 126 72545 E | A Rv0066c icd2 656 Probable isocitrate dehydrogenase [NADP] Icd2 (oxalosuccinate decarboxylase) (IDH) (NADP+-specific ICDH) (IDP) 127 72549 G | S Rv0066c icd2 655 Probable isocitrate dehydrogenase [NADP] Icd2 (oxalosuccinate decarboxylase) (IDH) (NADP+-specific ICDH) (IDP) 128 75304 T | A Rv0068 . 2 Rv0068, (MTV030.11), len: 303 aa. Probable oxidoreductase, similar to many. 129 75875 R | H Rv0068 . 192 Rv0068, (MTV030.11), len: 303 aa. Probable oxidoreductase, similar to many. 130 75940 V | L Rv0068 . 214 Rv0068, (MTV030.11), len: 303 aa. Probable oxidoreductase, similar to many. 131 76530 P | T Rv0069c sdaA 365 Probable L-serine dehydratase SdaA (L-serine deaminase) (SDH) (L-SD) 132 79506 S | D,Y Rv0071 . 7 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 133 79531 P | A,R,S Rv0071 . 16 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 134 79538 D | A,G,T Rv0071 . 18 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 135 79549 P | A,L,R Rv0071 . 22 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 136 79560 A | L,P,R Rv0071 . 25 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 137 79565 V | D,G,T Rv0071 . 27 Rv0071, (MTV030.14), len: 235 aa. Possible maturase,similar to many proteins of the group II intron maturase family. Contains 5 VDP repeats at N-terminus, these are also found in two Streptococcus plasmid hypothetical proteins Q52246|X17092 and Q54942|X66468. 138 81135 I | S Rv0072 . 171 Rv0072, (MTV030.16), len: 349 aa. Probable glutamine-transport transmembrane protein ABC-transporter (see citation below). Note that supposed act with near ORF Rv0073|MTV030.17 ATP-binding protein ABC-transporter. 139 83033 L | V Rv0074 . 96 Rv0074, (MTV030.18), len: 411 aa. Conserved protein,similar to many. 140 83288 A | P Rv0074 . 181 Rv0074, (MTV030.18), len: 411 aa. Conserved protein,similar to many. 141 84174 Y | S Rv0075 . 60 Rv0075, (MTV030.19), len: 390 aa. Probable aminotransferase, similar to many class-II pyridoxal-phosphate-dependent aminotransferases (MALY/PATB subfamily). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv2294, Rv0858c, etc. 142 84629 H | D Rv0075 . 212 Rv0075, (MTV030.19), len: 390 aa. Probable aminotransferase, similar to many class-II pyridoxal-phosphate-dependent aminotransferases (MALY/PATB subfamily). Also similar to other proteins from Mycobacterium tuberculosis e.g. Rv2294, Rv0858c, etc. 143 85328 E | K,Q Rv0076c . 82 Rv0076c, (MTV030.20c), len: 129 aa. Probable membrane protein, with membrane-spanning domain at C-terminus. 144 85752 A | P Rv0077c . 239 Rv0077c, (MTV030.21c), len: 276 aa. Possible oxidoreductase, weakly similar to others from Streptomyces. Also similar to MTCY05A6_35 and MTCY1A11_10 from Mycobacterium tuberculosis. And shows some similarity in part with AAL17935.1|AY054120 putative epoxide hydrolase from Mycobacterium smegmatis ... 145 86133 P | A Rv0077c . 112 Rv0077c, (MTV030.21c), len: 276 aa. Possible oxidoreductase, weakly similar to others from Streptomyces. Also similar to MTCY05A6_35 and MTCY1A11_10 from Mycobacterium tuberculosis. And shows some similarity in part with AAL17935.1|AY054120 putative epoxide hydrolase from Mycobacterium smegmatis ... 146 91502 G | D Rv0083 . 368 Rv0083, (MTV030.27, MTCY251.01), len: 640 aa. Probable oxidoreductase, showing some similarity to other various oxidoreductases. Nucleotide position 91071 in the genome sequence has been corrected, T:C resulting in I224I. 147 91805 A | G Rv0083 . 469 Rv0083, (MTV030.27, MTCY251.01), len: 640 aa. Probable oxidoreductase, showing some similarity to other various oxidoreductases. Nucleotide position 91071 in the genome sequence has been corrected, T:C resulting in I224I. 148 93987 L | V Rv0086 hycQ 13 Possible hydrogenase HycQ 149 94598 L | F Rv0086 hycQ 216 Possible hydrogenase HycQ 150 94821 V | M Rv0086 hycQ 291 Possible hydrogenase HycQ 151 96977 R | K Rv0088 . 17 Rv0088, (MTCY251.06), len: 224 aa. Possible polyketide cyclase/dehydrase. Belongs to the SRPBCC ligand-binding domain superfamily. Predicted to be an outer membrane protein (See Song et al., 2008). 152 98774 L | R Rv0090 . 99 Rv0090, (MTCY251.08), len: 256 aa. Possible membrane protein. Contains IPR014511 Protein of unknown function DUF2068, transmembrane, subgroup. 153 101351 V | I Rv0092 ctpA 257 Cation transporter P-type ATPase a CtpA 154 101706 A | V Rv0092 ctpA 375 Cation transporter P-type ATPase a CtpA 155 103602 G | A Rv0093c . 21 Rv0093c, (MTCY251.12c), len: 282 aa. Probable conserved membrane protein, equivalent only to CAC30943.1|AL583924 probable integral membrane protein from Mycobacterium leprae (237 aa). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 156 103743 H | Q,R Rv0094c . 308 Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap). 157 103744 L | R,T,X Rv0094c . 307 Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap). 158 103745 L | C,H,R,T,Y Rv0094c . 307 Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap). 159 103879 G | D Rv0094c . 262 Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap). 160 103930 A | G Rv0094c . 245 Rv0094c, (MTCY251.13c), len: 317 aa. Member of 13E12 repeat family, showing some similarity to U15187|MLU15187_7 from Mycobacterium leprae (94 aa), FASTA score: (49.4% identity in 79 aa overlap). 161 104824 S | I Rv0095c . 131 Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). 162 104942 Q | E Rv0095c . 92 Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). 163 104943 H | Q Rv0095c . 92 Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). 164 104962 A | V Rv0095c . 85 Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). 165 105007 S | T Rv0095c . 70 Rv0095c, (MTCY251.14c), len: 136 aa. Member of 13E12 repeat, also partially similar to AF0418|AF041819_8 from Mycobacterium bovis BCG (222 aa), FASTA score: (89.6% identity in 96 aa overlap). 166 112212 P | T Rv0101 nrp 738 Probable peptide synthetase Nrp (peptide synthase) 167 113142 A | T Rv0101 nrp 1048 Probable peptide synthetase Nrp (peptide synthase) 168 113897 I | N Rv0101 nrp 1299 Probable peptide synthetase Nrp (peptide synthase) 169 114561 R | W Rv0101 nrp 1521 Probable peptide synthetase Nrp (peptide synthase) 170 117351 A | T Rv0101 nrp 2451 Probable peptide synthetase Nrp (peptide synthase) 171 118429 P | Q Rv0102 . 239 Rv0102, (MTCY251.21), len: 661 aa. Probable conserved integral membrane protein, highly similar to P53525|Y102_MYCLE|ML1998|NP_302349.1|NC_002677 possible membrane protein from Mycobacterium leprae (659 aa), FASTA scores: opt: 3107, E(): 0, (70.2% identity in 662 aa overlap). Also similar to othe... 172 120085 M | V Rv0103c ctpB 697 Probable cation-transporter P-type ATPase B CtpB 173 121962 E | * Rv0103c ctpB 71 Probable cation-transporter P-type ATPase B CtpB 174 122109 L | S Rv0103c ctpB 22 Probable cation-transporter P-type ATPase B CtpB 175 123454 Q | * Rv0104 . 380 Rv0104, (MTCY251.23), len: 504 aa. Conserved hypothetical protein, showing weak similarity with other cAMP-dependent protein kinases e.g. AAC37564.1|M65066 cAMP-dependent protein kinase RI-beta regulatory subunit from Homo sapiens (380 aa); etc. 176 123520 Y | H Rv0104 . 402 Rv0104, (MTCY251.23), len: 504 aa. Conserved hypothetical protein, showing weak similarity with other cAMP-dependent protein kinases e.g. AAC37564.1|M65066 cAMP-dependent protein kinase RI-beta regulatory subunit from Homo sapiens (380 aa); etc. 177 123745 G | R Rv0104 . 477 Rv0104, (MTCY251.23), len: 504 aa. Conserved hypothetical protein, showing weak similarity with other cAMP-dependent protein kinases e.g. AAC37564.1|M65066 cAMP-dependent protein kinase RI-beta regulatory subunit from Homo sapiens (380 aa); etc. 178 125560 E | G Rv0106 . 396 Rv0106, (MTCY251.25), len: 398 aa. Conserved hypothetical protein, similar to others e.g. AL049841|SCE9_33 from Streptomyces coelicolor (370 aa),FASTA scores: opt: 282, E(): 2.5e-11, (32.0% identity in 381 aa overlap); etc. Some similarity to P94400 homologue to nitrile hydratase region from Baci... 179 125824 V | L Rv0107c ctpI 1573 Probable cation-transporter ATPase I CtpI 180 126319 G | V Rv0107c ctpI 1408 Probable cation-transporter ATPase I CtpI 181 127277 A | P,R Rv0107c ctpI 1089 Probable cation-transporter ATPase I CtpI 182 127985 T | H,P Rv0107c ctpI 853 Probable cation-transporter ATPase I CtpI 183 128027 V | I Rv0107c ctpI 839 Probable cation-transporter ATPase I CtpI 184 128243 T | P Rv0107c ctpI 767 Probable cation-transporter ATPase I CtpI 185 129134 P | A,S Rv0107c ctpI 470 Probable cation-transporter ATPase I CtpI 186 131904 S | A Rv0109 . 175 PE-PGRS family protein PE_PGRS1 187 132099 G | F Rv0109 . 240 PE-PGRS family protein PE_PGRS1 188 132175 A | G Rv0109 . 265 PE-PGRS family protein PE_PGRS1 189 132417 R | G Rv0109 . 346 PE-PGRS family protein PE_PGRS1 190 132856 T | N Rv0109 . 492 PE-PGRS family protein PE_PGRS1 191 137598 A | S Rv0113 gmhA 94 Probable sedoheptulose-7-phosphate isomerase GmhA (phosphoheptose isomerase) 192 139087 V | G Rv0115 hddA 192 Possible D-alpha-D-heptose-7-phosphate kinase HddA 193 140944 T | P Rv0116c ldtA 27 Probable L,D-transpeptidase LdtA 194 141404 K | E Rv0117 oxyS 69 Oxidative stress response regulatory protein OxyS 195 141623 D | N Rv0117 oxyS 142 Oxidative stress response regulatory protein OxyS 196 142843 V | E Rv0118c oxcA 345 Probable oxalyl-CoA decarboxylase OxcA 197 143025 R | A Rv0118c oxcA 285 Probable oxalyl-CoA decarboxylase OxcA 198 143207 S | G Rv0118c oxcA 224 Probable oxalyl-CoA decarboxylase OxcA 199 144406 R | W Rv0119 fadD7 120 Probable fatty-acid-CoA ligase FadD7 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 200 146087 N | S Rv0120c fusA2 562 Probable elongation factor G FusA2 (EF-G) 201 146610 A | S Rv0120c fusA2 388 Probable elongation factor G FusA2 (EF-G) 202 147311 R | L Rv0120c fusA2 154 Probable elongation factor G FusA2 (EF-G) 203 148012 A | S Rv0121c . 111 Rv0121c, (MTCI418B.03c), len: 144 aa. Conserved protein, showing some similarity with others proteins from Mycobacterium tuberculosis e.g. Rv1155, Rv1875, Rv2074,etc. 204 149717 Y | C Rv0124 . 62 PE-PGRS family protein PE_PGRS2 205 150271 G | H,R Rv0124 . 247 PE-PGRS family protein PE_PGRS2 206 150691 S | A Rv0124 . 387 PE-PGRS family protein PE_PGRS2 207 150890 D | A,E,G,N,Q,V Rv0124 . 453 PE-PGRS family protein PE_PGRS2 208 150897 A | G,R Rv0124 . 455 PE-PGRS family protein PE_PGRS2 209 154283 S | P Rv0127 mak 18 Maltokinase Mak 210 156601 A | P,R Rv0129c fbpC 334 Secreted antigen 85-C FbpC (85C) (antigen 85 complex C) (AG58C) (mycolyl transferase 85C) (fibronectin-binding protein C) 211 157129 G | S Rv0129c fbpC 158 Secreted antigen 85-C FbpC (85C) (antigen 85 complex C) (AG58C) (mycolyl transferase 85C) (fibronectin-binding protein C) 212 158073 Q | P Rv0130 htdZ 76 Probable 3-hydroxyl-thioester dehydratase 213 158806 R | G,K,Q,X Rv0131c fadE1 285 Probable acyl-CoA dehydrogenase FadE1 214 160604 A | G Rv0132c fgd2 60 Putative F420-dependent glucose-6-phosphate dehydrogenase Fgd2 215 162581 G | S Rv0134 ephF 271 Possible epoxide hydrolase EphF (epoxide hydratase) (arene-oxide hydratase) 216 163940 K | R Rv0136 cyp138 192 Probable cytochrome P450 138 Cyp138 217 164518 T | A,P Rv0136 cyp138 385 Probable cytochrome P450 138 Cyp138 218 164942 A | E Rv0137c msrA 107 Probable peptide methionine sulfoxide reductase MsrA (protein-methionine-S-oxide reductase) (peptide met(O) reductase) 219 164946 M | A,I,K,R Rv0137c msrA 106 Probable peptide methionine sulfoxide reductase MsrA (protein-methionine-S-oxide reductase) (peptide met(O) reductase) 220 165471 G | A Rv0138 . 50 Rv0138, (MTCI5.12), len: 167 aa. Conserved hypothetical protein, showing weak similarity to Q10827|YT10_MYCTU hypothetical 17.0 KDA protein from Mycobacterium tuberculosis (147 aa), FASTA scores: opt: 131, E(): 0.047, (31.15% identity in 106 aa overlap). 221 166565 I | V Rv0139 . 247 Rv0139, (MTCI5.13), len: 340 aa. Possible oxidoreductase, similar to others e.g. O34285|HPNA HPNA protein from Zymomonas mobilis (337 aa), FASTA scores: opt: 507, E (): 5.8e-27, (31.1% identity in 328 aa overlap); TRE_STRGR|P29782 dtdp-glucose 4,6-dehydratase (328 aa),FASTA scores: opt: 254, E():... 222 167438 E | K,L,S,V Rv0141c . 82 Rv0141c, (MTCI5.15c), len: 136 aa. Unknown protein. 223 170671 A | T Rv0144 . 130 Rv0144, (MTCI5.18), len: 280 aa. Probable transcriptional regulator, possibly TetR family. Has region similar to others e.g. Q59431|UIDR_ECOLI|GUSR|B1618|Z2623|ECS2326 UID operon repressor (GUS operon) from Escherichia coli strains K12 and O157:H7 (196 aa), FASTA scores: opt: 214, E(): 1.1e-06,(2... 224 172492 Y | * Rv0146 . 94 Rv0146, (MTCI5.20), len: 310 aa. Possible S-adenosylmethionine-dependent methyltransferase (see Grana et al., 2007), highly similar to others e.g. AC30975.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (304 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv... 225 173091 M | P Rv0146 . 294 Rv0146, (MTCI5.20), len: 310 aa. Possible S-adenosylmethionine-dependent methyltransferase (see Grana et al., 2007), highly similar to others e.g. AC30975.1|AL583924 conserved hypothetical protein from Mycobacterium leprae (304 aa); and several Mycobacterium tuberculosis proteins e.g. Rv0726c, Rv... 226 173668 V | G Rv0147 . 144 Rv0147, (MTCI5.21), len: 506 aa. Probable aldehyde dehydrogenase (NAD+) dependent, similar to others e.g. DHAP_RAT|P11883 aldehyde dehydrogenase (dimeric NADP-preferring) (452 aa), FASTA scores: opt: 1291, E(): 0,(43.9% identity in 453 aa overlap). Also similar to several Mycobacterium tuberculos... 227 178205 G | R Rv0151c . 369 PE family protein PE1 228 178946 A | T Rv0151c . 122 PE family protein PE1 229 179227 V | G Rv0151c . 28 PE family protein PE1 230 179231 W | G Rv0151c . 27 PE family protein PE1 231 179636 L | F Rv0152c . 421 PE family protein PE2 232 180025 G | E Rv0152c . 291 PE family protein PE2 233 180805 R | T Rv0152c . 31 PE family protein PE2 234 182434 H | Q Rv0154c fadE2 256 Probable acyl-CoA dehydrogenase FadE2 235 184309 A | S Rv0155 pntAa 230 Probable NAD(P) transhydrogenase (subunit alpha) PntAa [first part; catalytic part] (pyridine nucleotide transhydrogenase subunit alpha) (nicotinamide nucleotide transhydrogenase subunit alpha) 236 184409 G | D Rv0155 pntAa 263 Probable NAD(P) transhydrogenase (subunit alpha) PntAa [first part; catalytic part] (pyridine nucleotide transhydrogenase subunit alpha) (nicotinamide nucleotide transhydrogenase subunit alpha) 237 187690 T | P Rv0159c . 384 PE family protein PE3 238 188317 S | P Rv0159c . 175 PE family protein PE3 239 188800 T | A Rv0159c . 14 PE family protein PE3 240 189850 F | S Rv0160c . 197 PE family protein PE4 241 189866 L | P,R Rv0160c . 192 PE family protein PE4 242 189948 A | K,N Rv0160c . 164 PE family protein PE4 243 191462 A | P Rv0161 . 286 Rv0161, (MTCI28.01, MTV032.04), len: 449 aa. Possible oxidoreductase, similar to hypothetical proteins and various oxidoreductases e.g. AIP2_YEAST|P46681 actin interacting protein 2 (530 aa), FASTA scores: opt: 356, E (): 0, (33.3% identity in 357 aa overlap); DLD1_YEAST|P32891 d-lactate dehydrog... 244 191750 P | S Rv0161 . 382 Rv0161, (MTCI28.01, MTV032.04), len: 449 aa. Possible oxidoreductase, similar to hypothetical proteins and various oxidoreductases e.g. AIP2_YEAST|P46681 actin interacting protein 2 (530 aa), FASTA scores: opt: 356, E (): 0, (33.3% identity in 357 aa overlap); DLD1_YEAST|P32891 d-lactate dehydrog... 245 192342 V | G Rv0162c adhE1 265 Probable zinc-type alcohol dehydrogenase (E subunit) AdhE1 246 192637 N | Y Rv0162c adhE1 167 Probable zinc-type alcohol dehydrogenase (E subunit) AdhE1 247 194305 G | A,P Rv0165c mce1R 171 Probable transcriptional regulatory protein Mce1R (probably GntR-family) 248 194786 R | S Rv0165c mce1R 11 Probable transcriptional regulatory protein Mce1R (probably GntR-family) 249 196400 G | R Rv0166 fadD5 470 Probable fatty-acid-CoA ligase FadD5 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 250 198401 G | C Rv0168 yrbE1B 248 Conserved integral membrane protein YrbE1B 251 199470 S | A Rv0169 mce1A 313 Mce-family protein Mce1A 252 200430 I | T Rv0170 mce1B 179 Mce-family protein Mce1B 253 201949 G | R Rv0171 mce1C 340 Mce-family protein Mce1C 254 202087 T | P Rv0171 mce1C 386 Mce-family protein Mce1C 255 202675 I | T Rv0172 mce1D 67 Mce-family protein Mce1D 256 204840 G | D Rv0173 lprK 259 Possible Mce-family lipoprotein LprK (Mce-family lipoprotein Mce1E) 257 205182 N | S Rv0173 lprK 373 Possible Mce-family lipoprotein LprK (Mce-family lipoprotein Mce1E) 258 206339 L | P Rv0174 mce1F 370 Mce-family protein Mce1F 259 206852 A | E Rv0175 . 13 Rv0175, (MTCI28.15), len: 213 aa. Probable conserved Mce-associated membrane protein, equivalent, but longer in N-terminus, to CAC32127.1|AL583926 possible membrane protein from Mycobacterium leprae (182 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv... 260 207079 R | P Rv0175 . 89 Rv0175, (MTCI28.15), len: 213 aa. Probable conserved Mce-associated membrane protein, equivalent, but longer in N-terminus, to CAC32127.1|AL583926 possible membrane protein from Mycobacterium leprae (182 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv... 261 207451 K | N Rv0175 . 213 Rv0175, (MTCI28.15), len: 213 aa. Probable conserved Mce-associated membrane protein, equivalent, but longer in N-terminus, to CAC32127.1|AL583926 possible membrane protein from Mycobacterium leprae (182 aa). Also similar to mce-associated proteins from Mycobacterium tuberculosis e.g. Rv1363c, Rv... 262 208158 P | L Rv0176 . 236 Rv0176, (MTCI28.16), len: 322 aa. Probable conserved Mce-associated transmembrane protein. Contains short region of similarity to PRA_MYCLE|P41484 proline-rich antigen (36 kDa antigen) from Mycobacterium leprae (249 aa) (outside the proline-rich region), FASTA scores: opt: 165, E(): 2.9e-05, (40.... 263 209844 G | A,R Rv0179c lprO 324 Possible lipoprotein LprO 264 211596 F | G,L,V Rv0180c . 219 Rv0180c, (MTCI28.20c), len: 452 aa. Probable conserved transmembrane protein, equivalent to CAC32132.1|AL583926 probable conserved membrane protein from Mycobacterium leprae (465 aa). Shows some similarity with others membrane proteins e.g. AL096849|SCI11_29 from Streptomyces coelicolor (354 aa),... 265 213281 G | D Rv0182c sigG 287 Probable alternative RNA polymerase sigma factor SigG (RNA polymerase ECF type sigma factor) 266 213335 R | P Rv0182c sigG 269 Probable alternative RNA polymerase sigma factor SigG (RNA polymerase ECF type sigma factor) 267 213587 A | G Rv0182c sigG 185 Probable alternative RNA polymerase sigma factor SigG (RNA polymerase ECF type sigma factor) 268 214091 V | F Rv0182c sigG 17 Probable alternative RNA polymerase sigma factor SigG (RNA polymerase ECF type sigma factor) 269 214093 V | C Rv0182c sigG 17 Probable alternative RNA polymerase sigma factor SigG (RNA polymerase ECF type sigma factor) 270 220883 I | H,P,V Rv0189c ilvD 281 Probable dihydroxy-acid dehydratase IlvD (dad) 271 223916 G | D Rv0192 . 118 Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375,... 272 223942 S | P Rv0192 . 127 Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375,... 273 224338 P | S Rv0192 . 259 Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375,... 274 224422 Q | E Rv0192 . 287 Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375,... 275 224600 V | A Rv0192 . 346 Rv0192, (MTCI28.31), len: 366 aa. Conserved hypothetical protein. Has Gly- Arg-rich region followed by highly Pro-rich repetitive region near N-terminus. Similar in C-terminus to other hypothetical proteins e.g. Q49706|B1496_F2_81|U00013 from Mycobacterium leprae (271 aa), FASTA scores: opt: 375,... 276 225085 D | H Rv0193c . 496 Rv0193c, (MTV033.01c-MTCI28.32), len: 615 aa. Hypothetical unknown protein. 277 230170 P | L,R Rv0194 . 1098 Rv0194, (MTV033.02), len: 1194 aa. Probable multidrug efflux pump (See Danilchanka et al., 2008),highly similar to many e.g. U62129|STU62129_2|T30293 ABC transport protein homolog from Salmonella typhi (1218 aa),FASTA scores: opt: 1116, E(): 0, (36.3% identity in 1209 aa overlap); CAB66302.1|AL13... 278 230921 S | L Rv0195 . 8 Rv0195, (MTV033.03), len: 211 aa. Possible two-component response regulator, luxR family, similar to many e.g. U00008|ECOHU49_15 regulatory protein narP from Escherichia coli strain K12 (225 aa), FASTA scores: opt: 232, E(): 7.3e-09, (29.2% identity in 219 aa overlap). Start chosen by similarity.... 279 232574 G | V Rv0197 . 115 Rv0197, (MTV033.05), len: 762 aa. Possible oxidoreductase, similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (74... 280 234477 Y | * Rv0197 . 749 Rv0197, (MTV033.05), len: 762 aa. Possible oxidoreductase, similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (74... 281 234496 P | A,C,G,L,R,V Rv0197 . 756 Rv0197, (MTV033.05), len: 762 aa. Possible oxidoreductase, similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (74... 282 234500 A | F,I,L,P,R,S,V Rv0197 . 757 Rv0197, (MTV033.05), len: 762 aa. Possible oxidoreductase, similar to others e.g. 9948789|AAG06102.1|AE004699_7|B83307 probable molybdopterin oxidoreductase from Pseudomonas aeruginosa strain PAO1 (769 aa); 5441785|CAB46809.1|AL096811|T36812 probable dehydrogenase from Streptomyces coelicolor (74... 283 234964 W | S Rv0198c zmp1 515 Probable zinc metalloprotease Zmp1 284 236431 G | V Rv0198c zmp1 26 Probable zinc metalloprotease Zmp1 285 238535 T | A,H,P Rv0202c mmpL11 920 Probable conserved transmembrane transport protein MmpL11 286 238814 G | P,R Rv0202c mmpL11 827 Probable conserved transmembrane transport protein MmpL11 287 239733 D | E Rv0202c mmpL11 521 Probable conserved transmembrane transport protein MmpL11 288 243592 L | P Rv0205 . 70 Rv0205, (MTV033.13), len: 367 aa. Possible conserved transmembrane protein, similar to hypothetical proteins from many bacteria e.g. AL0209|SC4H8_6 from Streptomyces coelicolor (402 aa), FASTA scores: opt: 436, E(): 1.7e-21,(27.2% identity in 349 aa overlap); Z99117|BSUB0014_221 from Bacillus sub... 289 247637 S | L Rv0207c . 159 Rv0207c, (MTCY08D5.02c), len: 242 aa. Conserved hypothetical protein, equivalent to Z95398|MLCL622_19 from Mycobacterium leprae (261 aa), FASTA scores: E(): 0, (60.8 identity in 199 aa overlap). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 290 248176 A | G Rv0208c . 244 Rv0208c, (MTCY08D5.03c), len: 263 aa. Hypothetical methyltransferase, equivalent to Z95398|MLCL622_20 from Mycobacterium leprae (279 aa), FASTA score: (64.2% identity in 246 aa overlaps). Also similar to others e.g. 10178368|CAC08407.1|AL392177|Q9F305|MT04_STRCO|SCD17A.03c hypothetical methlytran... 291 249522 V | A Rv0209 . 162 Rv0209, (MTCY08D5.04), len: 361 aa. Hypothetical unknown protein. 292 251256 W | * Rv0210 . 379 Rv0210, (MTCY08D5.05), len: 492 aa. Hypothetical unknown protein. Possibly membrane protein; has hydrophobic stretches around aa 333 - 381. 293 251299 D | N Rv0210 . 394 Rv0210, (MTCY08D5.05), len: 492 aa. Hypothetical unknown protein. Possibly membrane protein; has hydrophobic stretches around aa 333 - 381. 294 251575 A | T Rv0210 . 486 Rv0210, (MTCY08D5.05), len: 492 aa. Hypothetical unknown protein. Possibly membrane protein; has hydrophobic stretches around aa 333 - 381. 295 252083 N | T Rv0211 pckA 101 Probable iron-regulated phosphoenolpyruvate carboxykinase [GTP] PckA (phosphoenolpyruvate carboxylase) (PEPCK)(pep carboxykinase) 296 254705 Y | H Rv0213c . 416 Rv0213c, (MTCY08D5.08c), len: 437 aa. Possible methyltransferase, weakly similar to others methyltransferases e.g. AF127374_30|LINA from Streptomyces lavendulae (611 aa), FASTA scores: opt: 400, E(): 8.1e-19,(27.3% identity in 388 aa overlap); Q50258 fortimicin kl1 methyltransferase (553 aa), FAS... 297 255373 D | G Rv0213c . 193 Rv0213c, (MTCY08D5.08c), len: 437 aa. Possible methyltransferase, weakly similar to others methyltransferases e.g. AF127374_30|LINA from Streptomyces lavendulae (611 aa), FASTA scores: opt: 400, E(): 8.1e-19,(27.3% identity in 388 aa overlap); Q50258 fortimicin kl1 methyltransferase (553 aa), FAS... 298 256105 M | I Rv0214 fadD4 14 Probable fatty-acid-CoA ligase FadD4 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 299 256182 G | A Rv0214 fadD4 40 Probable fatty-acid-CoA ligase FadD4 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 300 256830 T | I Rv0214 fadD4 256 Probable fatty-acid-CoA ligase FadD4 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 301 258666 V | G Rv0215c fadE3 64 Probable acyl-CoA dehydrogenase FadE3 302 258673 S | A Rv0215c fadE3 62 Probable acyl-CoA dehydrogenase FadE3 303 259031 H | P Rv0216 . 40 Rv0216, (MTCY08D5.11), len: 337 aa. Double hotdog R-specific hydratase of unknown function, shows no activity for crotonyl-CoA, equivalent to Z95398|MLCL622_22 from Mycobacterium leprae (339 aa), FASTA scores: E(): 0, (73.7 identity in 338 aa overlap). Shows structural similarity to six others in... 304 259435 I | T Rv0216 . 175 Rv0216, (MTCY08D5.11), len: 337 aa. Double hotdog R-specific hydratase of unknown function, shows no activity for crotonyl-CoA, equivalent to Z95398|MLCL622_22 from Mycobacterium leprae (339 aa), FASTA scores: E(): 0, (73.7 identity in 338 aa overlap). Shows structural similarity to six others in... 305 261461 T | P Rv0218 . 180 Rv0218, (MTCY08D5.13), len: 442 aa. Probable conserved transmembrane protein, some similarity with sulfite oxidases e.g. SUOX_HUMAN|P51687 sulfite oxidase precursor (488 aa), FASTA scores: opt: 153, E(): 0.0087,(28.6% identity in 161 aa overlap); and with some nitrate reductases e.g. NIA_FUSOX|P3... 306 261869 C | R Rv0218 . 316 Rv0218, (MTCY08D5.13), len: 442 aa. Probable conserved transmembrane protein, some similarity with sulfite oxidases e.g. SUOX_HUMAN|P51687 sulfite oxidase precursor (488 aa), FASTA scores: opt: 153, E(): 0.0087,(28.6% identity in 161 aa overlap); and with some nitrate reductases e.g. NIA_FUSOX|P3... 307 262023 V | G Rv0218 . 367 Rv0218, (MTCY08D5.13), len: 442 aa. Probable conserved transmembrane protein, some similarity with sulfite oxidases e.g. SUOX_HUMAN|P51687 sulfite oxidase precursor (488 aa), FASTA scores: opt: 153, E(): 0.0087,(28.6% identity in 161 aa overlap); and with some nitrate reductases e.g. NIA_FUSOX|P3... 308 262494 V | F Rv0219 . 81 Rv0219, (MTCY08D5.14), len: 182 aa. Probable conserved transmembrane protein, showing similarity with CAB76992.1|AL159178 putative lipoprotein from Streptomyces coelicolor (163 aa). 309 263149 E | A Rv0220 lipC 113 Probable esterase LipC 310 264129 M | I Rv0221 . 21 Rv0221, (MTCY08D5.16), len: 469 aa. Possible triacylglycerol synthase (See Daniel et al., 2004), similar to other proteins from Mycobacterium tuberculosis e.g. Q50680|Rv2285|MT2343|MTCY339.25c 47.7 kDa protein (445 aa),FASTA scores: opt: 455, E(): 8.1e-23, (26.7% identity in 461 aa overlap); Rv37... 311 265886 A | V Rv0222 echA1 127 Probable enoyl-CoA hydratase EchA1 (enoyl hydrase) (unsaturated acyl-CoA hydratase) (crotonase) 312 271658 E | D Rv0227c . 395 Rv0227c, (MTCY08D5.22c), len: 421 aa. Possible conserved membrane protein, equivalent to AL022486|MLCB1883_4 from Mycobacterium leprae (448 aa),FASTA scores: opt: 2148, E(): 0, (76.6% identity in 423 aa overlap). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,... 313 273925 A | T Rv0228 . 291 Rv0228, (MTCY08D5.23), len: 407 aa. Probable integral membrane acyltransferase, equivalent to 3063875|CAA18555.1|AL022486|T44870 acyltransferase from Mycobacterium leprae (384 aa), FASTA scores: opt: 2004,E(): 0, (79.3% identity in 381 aa overlap). Also similar to others e.g. Q11064 probable acyl... 314 278681 H | D Rv0233 nrdB 33 Ribonucleoside-diphosphate reductase (beta chain) NrdB (ribonucleotide reductase small chain) 315 279177 I | A Rv0233 nrdB 198 Ribonucleoside-diphosphate reductase (beta chain) NrdB (ribonucleotide reductase small chain) 316 279516 A | V Rv0233 nrdB 311 Ribonucleoside-diphosphate reductase (beta chain) NrdB (ribonucleotide reductase small chain) 317 280860 R | H Rv0234c gabD1 94 Succinate-semialdehyde dehydrogenase [NADP+] dependent (SSDH) GabD1 318 282188 R | C Rv0235c . 143 Rv0235c, (MTCY08D5.31c), len: 482 aa. Probable conserved transmembrane protein, highly similar to AL133278|CAB61913.1|SCM11_2 putative integral membrane protein from Streptomyces coelicolor (470 aa), FASTA scores: opt: 2116, E(): 0, (61.8% identity in 474 aa overlap); and similar to hypothetical ... 319 283313 T | I Rv0236c aftD 1180 Possible arabinofuranosyltransferase AftD 320 283610 A | V Rv0236c aftD 1081 Possible arabinofuranosyltransferase AftD 321 287747 T | P Rv0237 lpqI 188 Probable conserved lipoprotein LpqI 322 290062 P | R Rv0241c htdX 198 Probable 3-hydroxyacyl-thioester dehydratase HtdX 323 290374 H | R Rv0241c htdX 94 Probable 3-hydroxyacyl-thioester dehydratase HtdX 324 291607 E | D Rv0242c fabG4 142 Probable 3-oxoacyl-[acyl-carrier protein] reductase FabG4 (3-ketoacyl-acyl carrier protein reductase) 325 292547 V | G Rv0243 fadA2 126 Probable acetyl-CoA acyltransferase FadA2 (3-ketoacyl-CoA thiolase) (beta-ketothiolase) 326 296312 S | F Rv0245 . 103 Rv0245, (MTV034.11), len: 162 aa. Possible oxidoreductase, equivalent to AL022486|MLCB1883_17|T44882 probable oxidoreductase from Mycobacterium leprae (162 aa),FASTA scores: opt: 860, E(): 0, (83.4% identity in 157 aa overlap). Also similar to several hypothetical proteins and various oxidoreduct... 327 299395 S | T Rv0248c . 470 Rv0248c, (MTV034.14c), len: 646 aa. Probable succinate dehydrogenase, flavoprotein subunit, highly similar to flavoprotein subunit of various succinate dehydrogenases e.g. M88696|RIRSDHA_1 flavoprotein from Rickettsia prowazekii (596 aa), FASTA scores: opt: 651,E(): 0, (34.6 % identity in 598 aa ... 328 299952 L | S Rv0248c . 285 Rv0248c, (MTV034.14c), len: 646 aa. Probable succinate dehydrogenase, flavoprotein subunit, highly similar to flavoprotein subunit of various succinate dehydrogenases e.g. M88696|RIRSDHA_1 flavoprotein from Rickettsia prowazekii (596 aa), FASTA scores: opt: 651,E(): 0, (34.6 % identity in 598 aa ... 329 300193 A | V Rv0248c . 204 Rv0248c, (MTV034.14c), len: 646 aa. Probable succinate dehydrogenase, flavoprotein subunit, highly similar to flavoprotein subunit of various succinate dehydrogenases e.g. M88696|RIRSDHA_1 flavoprotein from Rickettsia prowazekii (596 aa), FASTA scores: opt: 651,E(): 0, (34.6 % identity in 598 aa ... 330 302541 A | T Rv0251c hsp 38 Heat shock protein Hsp (heat-stress-induced ribosome-binding protein A) 331 302881 S | G Rv0252 nirB 6 Probable nitrite reductase [NAD(P)H] large subunit [FAD flavoprotein] NirB 332 305188 V | L,M Rv0252 nirB 775 Probable nitrite reductase [NAD(P)H] large subunit [FAD flavoprotein] NirB 333 305940 P | S Rv0254c cobU 137 Probable bifunctional cobalamin biosynthesis protein CobU: cobinamide kinase + cobinamide phosphate guanylyltransferase 334 306418 G | R Rv0255c cobQ1 481 Probable cobyric acid synthase CobQ1 335 306639 E | G Rv0255c cobQ1 407 Probable cobyric acid synthase CobQ1 336 308260 G | V Rv0256c . 430 PPE family protein PPE2 337 308602 K | F,L Rv0256c . 316 PPE family protein PPE2 338 309913 R | L Rv0257 . 72 Rv0257, len: 124 aa. Hypothetical protein,orthologue of ML1828A conserved hypothetical protein from Mycobacterium leprae. Replaced Rv0257c (older annotation). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al., 2004). Predicted to be an outer membrane protein (See... 339 310973 A | V Rv0259c . 182 Rv0259c, (MTCY06A4.03c), len: 247 aa. Conserved hypothetical protein, showing some similarity to Rv2393|Z81368|MTCY253_28 from Mycobacterium tuberculosis (281 aa), FASTA scores: E(): 9.5e-16, (33.6 % identity in 235 aa overlap). Also some similarity with CAC33938.1|AL589708 putative secreted prot... 340 312871 F | G,V Rv0261c narK3 433 Probable integral membrane nitrite extrusion protein NarK3 (nitrite facilitator) 341 317718 V | G Rv0266c oplA 1146 Probable 5-oxoprolinase OplA (5-oxo-L-prolinase) (pyroglutamase) (5-OPASE) 342 318687 T | K,N Rv0266c oplA 823 Probable 5-oxoprolinase OplA (5-oxo-L-prolinase) (pyroglutamase) (5-OPASE) 343 319600 R | C Rv0266c oplA 519 Probable 5-oxoprolinase OplA (5-oxo-L-prolinase) (pyroglutamase) (5-OPASE) 344 321755 H | P Rv0267 narU 142 Probable integral membrane nitrite extrusion protein NarU (nitrite facilitator) 345 331633 R | H Rv0275c . 9 Rv0275c, (MTV035.03c), len: 241 aa. Possible transcriptional regulator, TetR family, similar to others e.g. Q9RJE7|SCF81.04c putative TetR-family transcriptional regulator from Streptomyces coelicolor (219 aa); Q9FBI8|SCP8.33c putative TetR-family transcriptional regulator from Streptomyces coeli... 346 332869 V | L,M Rv0277c vapC25 90 Possible toxin VapC25 Contains PIN domain 347 333059 L | F Rv0277c vapC25 27 Possible toxin VapC25 Contains PIN domain 348 333612 D | V Rv0278c . 900 PE-PGRS family protein PE_PGRS3 349 333637 W | G,R Rv0278c . 892 PE-PGRS family protein PE_PGRS3 350 333892 R | G Rv0278c . 807 PE-PGRS family protein PE_PGRS3 351 334198 F | V Rv0278c . 705 PE-PGRS family protein PE_PGRS3 352 334331 D | E Rv0278c . 661 PE-PGRS family protein PE_PGRS3 353 334546 A | P Rv0278c . 589 PE-PGRS family protein PE_PGRS3 354 334654 F | I Rv0278c . 553 PE-PGRS family protein PE_PGRS3 355 334657 L | A,G Rv0278c . 552 PE-PGRS family protein PE_PGRS3 356 334677 D | A Rv0278c . 545 PE-PGRS family protein PE_PGRS3 357 335134 F | I Rv0278c . 393 PE-PGRS family protein PE_PGRS3 358 335365 L | R,V Rv0278c . 316 PE-PGRS family protein PE_PGRS3 359 335391 V | A,P,Q Rv0278c . 307 PE-PGRS family protein PE_PGRS3 360 335461 A | G,S Rv0278c . 284 PE-PGRS family protein PE_PGRS3 361 335682 V | A,G,P Rv0278c . 210 PE-PGRS family protein PE_PGRS3 362 335695 A | G,P,Q,R,T Rv0278c . 206 PE-PGRS family protein PE_PGRS3 363 335699 G | A,L,P,Q,R,W Rv0278c . 205 PE-PGRS family protein PE_PGRS3 364 335701 G | A,F,H,L,M,P,R Rv0278c . 204 PE-PGRS family protein PE_PGRS3 365 335704 A | F,G,P,R,V Rv0278c . 203 PE-PGRS family protein PE_PGRS3 366 335714 A | C,F,G,L,P,R,S,T,W Rv0278c . 200 PE-PGRS family protein PE_PGRS3 367 335822 G | A,L Rv0278c . 164 PE-PGRS family protein PE_PGRS3 368 335956 A | T Rv0278c . 119 PE-PGRS family protein PE_PGRS3 369 336053 Y | * Rv0278c . 87 PE-PGRS family protein PE_PGRS3 370 336081 V | A Rv0278c . 77 PE-PGRS family protein PE_PGRS3 371 336193 M | L Rv0278c . 40 PE-PGRS family protein PE_PGRS3 372 336590 I | M Rv0279c . 829 PE-PGRS family protein PE_PGRS4 373 336680 N | H,K,R,S,V Rv0279c . 799 PE-PGRS family protein PE_PGRS4 374 336718 F | G,V Rv0279c . 786 PE-PGRS family protein PE_PGRS4 375 336873 V | G Rv0279c . 734 PE-PGRS family protein PE_PGRS4 376 337420 S | A Rv0279c . 552 PE-PGRS family protein PE_PGRS4 377 337534 W | G Rv0279c . 514 PE-PGRS family protein PE_PGRS4 378 338021 C | A,E,K Rv0279c . 352 PE-PGRS family protein PE_PGRS4 379 338346 A | G,S,V Rv0279c . 243 PE-PGRS family protein PE_PGRS4 380 338530 A | S Rv0279c . 182 PE-PGRS family protein PE_PGRS4 381 338538 T | A,I Rv0279c . 179 PE-PGRS family protein PE_PGRS4 382 338844 V | A Rv0279c . 77 PE-PGRS family protein PE_PGRS4 383 340132 E | K Rv0280 . 257 PPE family protein PPE3 384 340372 S | P Rv0280 . 337 PPE family protein PPE3 385 340672 R | W Rv0280 . 437 PPE family protein PPE3 386 340894 P | S Rv0280 . 511 PPE family protein PPE3 387 342146 E | A Rv0282 eccA3 6 ESX conserved component EccA3 ESX-3 type VII secretion system protein 388 342161 G | D Rv0282 eccA3 11 ESX conserved component EccA3 ESX-3 type VII secretion system protein 389 346275 P | R Rv0284 eccC3 214 ESX conserved component EccC3 ESX-3 type VII secretion system protein Possible membrane protein 390 347421 V | A Rv0284 eccC3 596 ESX conserved component EccC3 ESX-3 type VII secretion system protein Possible membrane protein 391 347456 P | T Rv0284 eccC3 608 ESX conserved component EccC3 ESX-3 type VII secretion system protein Possible membrane protein 392 349132 M | I Rv0284 eccC3 1166 ESX conserved component EccC3 ESX-3 type VII secretion system protein Possible membrane protein 393 350983 P | L Rv0286 . 350 PPE family protein PPE4 394 353309 S | N Rv0290 eccD3 76 ESX conserved component EccD3 ESX-3 type VII secretion system protein Probable transmembrane protein 395 353365 A | T Rv0290 eccD3 95 ESX conserved component EccD3 ESX-3 type VII secretion system protein Probable transmembrane protein 396 353380 P | A Rv0290 eccD3 100 ESX conserved component EccD3 ESX-3 type VII secretion system protein Probable transmembrane protein 397 353735 V | A Rv0290 eccD3 218 ESX conserved component EccD3 ESX-3 type VII secretion system protein Probable transmembrane protein 398 353990 P | R Rv0290 eccD3 303 ESX conserved component EccD3 ESX-3 type VII secretion system protein Probable transmembrane protein 399 356528 N | D Rv0292 eccE3 217 ESX conserved component EccE3 ESX-3 type VII secretion system protein Probable transmembrane protein 400 359118 Y | L,P Rv0295c . 211 Rv0295c, (MTV035.23c), len: 267 aa. Conserved protein, showing weak similarity with CAC46877.1|AL591790 conserved hypothetical protein from Sinorhizobium meliloti (213 aa); and NP_104818.1|14023999|BAB50604.1|AP00300 Protein with weak similarity to NodH from Mesorhizobium loti (257 aa). Predicted... 401 360035 P | R Rv0296c . 374 Rv0296c, (MTCY63.01c, MTV035.24c), len: 465 aa. Probable sulfatase, possibly an aryl-/steryl-sulfatase or a sulfamidase (sulfohydrolase) (sulphamidase). Similar to various hydrolases e.g. AAG41945.1|AF304053_1|AF304053 heparan N-sulfatase from Mus musculus (502 aa); NP_061292.1|6851181|AAF29460.1... 402 361032 A | P,S Rv0296c . 42 Rv0296c, (MTCY63.01c, MTV035.24c), len: 465 aa. Probable sulfatase, possibly an aryl-/steryl-sulfatase or a sulfamidase (sulfohydrolase) (sulphamidase). Similar to various hydrolases e.g. AAG41945.1|AF304053_1|AF304053 heparan N-sulfatase from Mus musculus (502 aa); NP_061292.1|6851181|AAF29460.1... 403 361805 A | T Rv0297 . 158 PE-PGRS family protein PE_PGRS5 404 361820 N | Y Rv0297 . 163 PE-PGRS family protein PE_PGRS5 405 361839 D | A,G Rv0297 . 169 PE-PGRS family protein PE_PGRS5 406 362999 K | E Rv0297 . 556 PE-PGRS family protein PE_PGRS5 407 366830 S | R Rv0304c . 1979 PPE family protein PPE5 408 366917 T | A Rv0304c . 1950 PPE family protein PPE5 409 366947 F | I,T Rv0304c . 1940 PPE family protein PPE5 410 368060 S | H,L Rv0304c . 1569 PPE family protein PPE5 411 368087 F | I,Q Rv0304c . 1560 PPE family protein PPE5 412 368102 I | F,Q Rv0304c . 1555 PPE family protein PPE5 413 368427 S | G Rv0304c . 1447 PPE family protein PPE5 414 370764 L | F Rv0304c . 668 PPE family protein PPE5 415 370857 K | N Rv0304c . 637 PPE family protein PPE5 416 371561 G | R Rv0304c . 402 PPE family protein PPE5 417 372646 D | A Rv0304c . 40 PPE family protein PPE5 418 373239 F | L Rv0305c . 825 PPE family protein PPE6 419 373329 F | A,V Rv0305c . 795 PPE family protein PPE6 420 373353 T | G,P,S Rv0305c . 787 PPE family protein PPE6 421 375435 A | R Rv0305c . 93 PPE family protein PPE6 422 382086 S | T Rv0312 . 511 Rv0312, (MTCY63.17), len: 620 aa. Conserved hypothetical protein with highly Pro-, Thr-rich C-terminus. Similar to Pro-,Thr-rich region in Rv2264c|AL021925|MTV022_14 from Mycobacterium tuberculosis (592 aa), FASTA scores: opt: 1075, E(): 0, (38.9% identity in 627 aa overlap). Also some similarity... 423 383716 P | S Rv0315 . 39 Rv0315, (MTCY63.20), len: 294 aa. Possible beta-1,3-glucanase precursor (has hydrophobic stretch in its N-terminal part), similar to others e.g. Q51333|AAC44371.1 beta-1,3-glucanase II a from Oerskovia xanthineolytica (306 aa), FASTA scores: opt: 76, E(): 3e-14, (34.1% identity in 302 aa overlap)... 424 384608 A | D Rv0316 . 25 Rv0316, (MTCY63.21), len: 204 aa. Possible muconolactone isomerase, showing weak similarity with some muconolactone isomerases e.g. O33947|CTC1_ACILW muconolactone delta-isomerase 1 (MIASE 1)(96 aa), FASTA scores: opt: 179, E(): 3.9e-05, (32.6% identity in 92 aa overlap). 425 386432 G | A Rv0318c . 223 Rv0318c, (MTCY63.23c), len: 264 aa. Probable conserved integral membrane protein, with some similarity to C-terminus of GUFA_MYXXA|Q06916 (254 aa), FASTA scores: opt: 157, E (): 0.0032, (28.3% identity in 198 aa overlap). Also similar to O26573 conserved protein from Methanobacterium thermoauto (... 426 390828 S | G Rv0323c . 142 Rv0323c, (MTCY63.28c), len: 223 aa. Conserved hypothetical protein, similar to others e.g. YPJG_BACSU|P42981 hypothetical 24.8 kDa protein from Bacillus subtilis (224 aa), FASTA scores: opt: 182, E(): 1.3e-05, (27.5% identity in 211 aa overlap). Also some similarity to MLU15183_8 from Mycobacteri... 427 391788 A | V Rv0324 . 146 Rv0324, (MTCY63.29), len: 226 aa. Possible transcriptional regulator, arsR family, with its N-terminus similar to the N-terminus of other DNA-binding proteins e.g. P30346|MERR_STRLI probable mercury resistance operon from Streptomyces lividans (125 aa), FASTA scores: opt: 154, E(): 0.002, (32.2% ... 428 392261 * | Q Rv0325 . 75 Rv0325, (MTCY63.30), len: 74 aa. Hypothetical unknown protein. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 429 392329 Q | H Rv0326 . 19 Rv0326, (MTCY63.31), len: 151 aa. Hypothetical unknown protein. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 430 392773 V | P,R Rv0327c . 425 Possible cytochrome P450 135A1 Cyp135A1 431 393391 S | P Rv0327c . 219 Possible cytochrome P450 135A1 Cyp135A1 432 393642 R | X Rv0327c . 135 Possible cytochrome P450 135A1 Cyp135A1 433 393941 Q | I,M Rv0327c . 35 Possible cytochrome P450 135A1 Cyp135A1 434 394874 V | F Rv0329c . 150 Rv0329c, (MTCY63.34c), len: 208 aa. Conserved hypothetical protein, showing some similarity with others hypothetical proteins and methyltransferases e.g. MitM|AF127374_14 methyltransferase from Streptomyces lavendulae (283 aa), FASTA scores: opt: 242, E(): 1.8e-08,(37.2% identity in 145 aa overla... 435 396667 V | G Rv0331 . 156 Rv0331, (MTCY63.36), len: 388 aa. Possible dehydrogenase/reductase, similar to various dehydrogenases/reductases e.g. NP_103779.1|14022957|BAB49565.1|AP002999 flavoprotein reductase from Mesorhizobium loti (377 aa); NP_147681.1 predicted NAD(FAD)-dependent dehydrogenase from Aeropyrum pernix (381... 436 397275 A | P Rv0331 . 359 Rv0331, (MTCY63.36), len: 388 aa. Possible dehydrogenase/reductase, similar to various dehydrogenases/reductases e.g. NP_103779.1|14022957|BAB49565.1|AP002999 flavoprotein reductase from Mesorhizobium loti (377 aa); NP_147681.1 predicted NAD(FAD)-dependent dehydrogenase from Aeropyrum pernix (381... 437 398558 V | G Rv0333 . 102 Rv0333, (MTCY63.38), len: 124 aa. Unknown protein. 438 403472 N | K Rv0338c . 791 Rv0338c, (MTCY279.05c), len: 882 aa. Probable iron-sulphur-binding reductase, possibly membrane-bound,equivalent to CAC32018.1|AL583925 probable iron-sulphur-binding reductase from Mycobacterium leprae (880 aa). Also highly similar to others e.g. T36608|5019323|CAB44376.1|AL078610 probable iron-s... 439 403980 A | V Rv0338c . 621 Rv0338c, (MTCY279.05c), len: 882 aa. Probable iron-sulphur-binding reductase, possibly membrane-bound,equivalent to CAC32018.1|AL583925 probable iron-sulphur-binding reductase from Mycobacterium leprae (880 aa). Also highly similar to others e.g. T36608|5019323|CAB44376.1|AL078610 probable iron-s... 440 410935 V | G Rv0342 iniA 33 Isoniazid inductible gene protein IniA 441 413864 R | C Rv0343 iniC 370 Isoniazid inductible gene protein IniC 442 415863 Q | P Rv0346c ansP2 368 Possible L-asparagine permease AnsP2 (L-asparagine transport protein) 443 415915 T | P Rv0346c ansP2 351 Possible L-asparagine permease AnsP2 (L-asparagine transport protein) 444 417647 V | G Rv0347 . 115 Rv0347, (MTCY13E10.07), len: 328 aa (alternative start possible). Probable conserved membrane protein,similar to Rv0831c|AL022004|MTV043_23 from Mycobacterium tuberculosis (271 aa), FASTA scores: E(): 9.6e-21, (33.1% identity in 266 aa overlap). This region is a possible MT-complex-specific genom... 445 420405 R | C Rv0350 dnaK 191 Probable chaperone protein DnaK (heat shock protein 70) (heat shock 70 kDa protein) (HSP70) 446 423014 G | D Rv0352 dnaJ1 188 Probable chaperone protein DnaJ1 447 424322 G | *,K,R Rv0354c . 125 PPE family protein PPE7 448 425223 S | G,T Rv0355c . 3153 PPE family protein PPE8 449 425341 G | A,C,P Rv0355c . 3114 PPE family protein PPE8 450 425343 T | A,H,P,R,S Rv0355c . 3113 PPE family protein PPE8 451 427083 D | N Rv0355c . 2533 PPE family protein PPE8 452 427098 I | *,F,K,L,N,Q,R,S,V Rv0355c . 2528 PPE family protein PPE8 453 427110 N | G,T Rv0355c . 2524 PPE family protein PPE8 454 427325 S | N,R Rv0355c . 2452 PPE family protein PPE8 455 427338 A | P,Q,S Rv0355c . 2448 PPE family protein PPE8 456 427356 S | A,G,H,P,Q,R Rv0355c . 2442 PPE family protein PPE8 457 427587 S | L,P,T Rv0355c . 2365 PPE family protein PPE8 458 429746 V | D Rv0355c . 1645 PPE family protein PPE8 459 431242 N | K Rv0355c . 1147 PPE family protein PPE8 460 433104 F | D,E,V Rv0355c . 526 PPE family protein PPE8 461 433981 S | H,R Rv0355c . 234 PPE family protein PPE8 462 433986 I | L,Q,T Rv0355c . 232 PPE family protein PPE8 463 434472 A | C,P,R Rv0355c . 70 PPE family protein PPE8 464 435689 Q | E Rv0357c purA 361 Probable adenylosuccinate synthetase PurA (imp--aspartate ligase) (ADSS) (ampsase) 465 437380 I | T Rv0358 . 174 Rv0358, (MTCY13E10.20), len: 215 aa. Conserved protein, highly similar to ML0281|AL023514|MLCB4_14 conserved hypothetical protein from Mycobacterium leprae (229 aa), FASTA scores: opt: 852, E(): 0, (62.9% identity in 229 aa overlap). A core mycobacterial gene; conserved in mycobacterial strains (... 466 441823 M | I Rv0363c fba 160 Probable fructose-bisphosphate aldolase Fba 467 443050 D | A Rv0364 . 219 Rv0364, (MTCY13E10.26), len: 227 aa. Possible conserved transmembrane protein, equivalent to O69601|Y364_MYCLE|ML0287|CAA18951.1|AL023514|AL023514|MLCB 4_19 hypothetical 24.3 KDA protein from Mycobacterium leprae (222 aa), FASTA scores: opt: 1027, E(): 0, (66.1% identity in 227 aa overlap). Shows... 468 443554 L | F Rv0365c . 215 Rv0365c, (MTCY13E10.27c), len: 376 aa (start uncertain). Conserved protein (see citation below), very similar to G388212|CAA35191.1, a truncated ORF immediately upstream of the Corynebacterium glutamicum fda gene encoding fructose-1,6-biphosphate aldolase (304 aa), FASTA scores: E(): 7.1e-19, (42... 469 444261 F | M,Q Rv0366c . 186 Rv0366c, (MTV036.01c), len: 197 aa. Conserved hypothetical protein, showing weak similarity to HI1395|P44173|YD95_HAEIN hypothetical protein from Haemophilus influenzae (140 aa), FASTA scores: opt: 152,E(): 0.0015, (27.0% identity in 126 aa overlap). Contains PS00017 ATP/GTP-binding site motif A ... 470 446437 V | A Rv0368c . 30 Rv0368c, (MTV036.03c), len: 403 aa. Conserved hypothetical protein, showing some similarity to AJ224684|BJAJ4684_4 cooxS protein from Bradyrhizobium japonicum (422 aa), FASTA scores: opt: 341, E(): 4.3e-13,(27.4% identity in 387 aa overlap); Rv2425c|MTCY428_22 hypothetical protein from Mycobacter... 471 447551 T | A Rv0370c . 165 Rv0370c, (MTV036.05c), len: 298 aa. Possible oxidoreductase, similar to many hypothetical proteins, but also similar to ORF4|X82447|OCCOXMSL4_4 Protein of coxMSL gene cluster from Pseudomonas/Oligotropha carboxidovorans (295 aa), FASTA scores: opt: 851, E(): 0, (48.2% identity in 282 aa overlap);... 472 450112 G | R Rv0373c . 565 Rv0373c, (MTV036.08c), len: 799 aa. Probable carbon monoxide dehydrogenase, large chain, highly similar to others e.g. AAD00363.1| U80806|CUTL carbon monoxide dehydrogenase large subunit CutL protein from Hydrogenophaga pseudoflava (803 aa); S49124|509391|CAA54902.1|X77931|1094915|2107180C|CUTA c... 473 453055 N | H,P Rv0375c . 34 Rv0375c, (MTV036.10c), len: 286 aa. Probable carbon monoxide dehydrogenase, medium chain, similar to others e.g. AAD00361.1|U80806|CUTM carbon monoxide dehydrogenase middle subunit from Hydrogenophaga pseudoflava (287 aa); S49122|509389|CAA54900.1|X77931|CUTB carbon-monoxide dehydrogenase medium ... 474 455749 G | D Rv0378 . 38 Rv0378, (MTV036.13), len: 73 aa. Conserved hypothetical gly-rich protein, showing some similarity to Mycobacterium tuberculosis PE_PGRS family; also similar to MTCY06H11_16|Z85982 hypothetical glycine-rich 88.5 KD protein (1011 aa), FASTA scores: opt: 237, E(): 0.0032,(58.7% identity in 63 aa ove... 475 456299 R | P Rv0380c . 174 Rv0380c, (MTV036.15c), len: 183 aa. Possible RNA methyltransferase, equivalent to CAC32002.1|AL583925 possible RNA methyltransferase from Mycobacterium leprae (182 aa). Also some similarity with others methyltransferases e.g. P19396|TRMH_ECOLI|78514|JV0043 tRNA (guanosine-2'-O-)-methyltransferase... 476 458517 R | A,F,L,P,T Rv0383c . 267 Rv0383c, (MTV036.18c), len: 284 aa. Possible conserved secreted protein, with hydrophobic stretch in N-terminus and Pro-rich C-terminus. Equivalent to CAC32006.1|AL583925 possible secreted protein from Mycobacterium leprae (286 aa). A core mycobacterial gene; conserved in mycobacterial strains (S... 477 466797 S | A Rv0387c . 204 Rv0387c, (MTV036.22c), len: 244 aa. Conserved hypothetical protein, showing some similarity to MTCI237.20c, and M17282|HUMEL20_1 Human elastin gene, exon 1, Elastin (687 aa), FASTA scores: opt: 193, E(): 0.35,(34.4% identity in 189 aa overlap). 478 466808 A | G Rv0387c . 200 Rv0387c, (MTV036.22c), len: 244 aa. Conserved hypothetical protein, showing some similarity to MTCI237.20c, and M17282|HUMEL20_1 Human elastin gene, exon 1, Elastin (687 aa), FASTA scores: opt: 193, E(): 0.35,(34.4% identity in 189 aa overlap). 479 467500 P | N,R,S,T Rv0388c . 168 PPE family protein PPE9 480 467516 H | Q Rv0388c . 163 PPE family protein PPE9 481 467638 Q | K Rv0388c . 122 PPE family protein PPE9 482 470289 T | P Rv0391 metZ 94 Probable O-succinylhomoserine sulfhydrylase MetZ (OSH sulfhydrylase) 483 471666 M | T Rv0392c ndhA 325 Probable membrane NADH dehydrogenase NdhA 484 478236 D | E Rv0399c lpqK 108 Possible conserved lipoprotein LpqK 485 481609 K | I,Y Rv0402c mmpL1 542 Probable conserved transmembrane transport protein MmpL1 486 483287 E | A,G,P,R Rv0403c mmpS1 124 Probable conserved membrane protein MmpS1 487 483294 M | F,H,P,R,S,V Rv0403c mmpS1 122 Probable conserved membrane protein MmpS1 488 484005 R | L Rv0404 fadD30 10 Fatty-acid-AMP ligase FadD30 (fatty-acid-AMP synthetase) (fatty-acid-AMP synthase) 489 484395 P | L Rv0404 fadD30 140 Fatty-acid-AMP ligase FadD30 (fatty-acid-AMP synthetase) (fatty-acid-AMP synthase) 490 484596 P | L Rv0404 fadD30 207 Fatty-acid-AMP ligase FadD30 (fatty-acid-AMP synthetase) (fatty-acid-AMP synthase) 491 486458 S | C,R Rv0405 pks6 243 Probable membrane bound polyketide synthase Pks6 492 488038 V | I Rv0405 pks6 770 Probable membrane bound polyketide synthase Pks6 493 491014 T | P Rv0407 fgd1 78 F420-dependent glucose-6-phosphate dehydrogenase Fgd1 494 494101 H | P Rv0409 ackA 84 Probable acetate kinase AckA (acetokinase) 495 498015 V | I Rv0411c glnH 96 Probable glutamine-binding lipoprotein GlnH (GLNBP) 496 498557 D | Y Rv0412c . 355 Rv0412c, (MTCY22G10.08c), len: 439 aa. Possible conserved membrane protein, equivalent to AL035159|MLCB1450_16|T44737 probable membrane protein from Mycobacterium leprae (403 aa), FASTA scores: opt: 2027,E(): 0, (80.4% identity in 403 aa overlap). Also some similarity with CAB71201.1|AL138538 put... 497 502589 S | C Rv0417 thiG 75 Probable thiamin biosynthesis protein ThiG (thiazole biosynthesis protein) 498 506386 I | T Rv0419 lpqM 434 Possible lipoprotein peptidase LpqM 499 509199 G | A,R Rv0423c thiC 343 Probable thiamine biosynthesis protein ThiC 500 510998 D | R Rv0425c ctpH 1442 Possible metal cation transporting P-type ATPase CtpH 501 511381 P | R Rv0425c ctpH 1314 Possible metal cation transporting P-type ATPase CtpH 502 513257 M | V Rv0425c ctpH 689 Possible metal cation transporting P-type ATPase CtpH 503 515527 P | L Rv0426c . 97 Rv0426c, (MTCY22G10.23c), len: 147 aa. Possible transmembrane protein; has potential transmembrane domains aa 19-41, and aa 61-83. 504 516206 G | E Rv0427c xthA 230 Probable exodeoxyribonuclease III protein XthA (exonuclease III) (EXO III) (AP endonuclease VI) 505 517338 E | K Rv0428c . 156 Rv0428c, (MTCY22G10.25c), len: 302 aa. Probable acetyltransferase. Contains GNAT (Gcn5-related N-acetyltransferase) domain in C-terminal part. See Vetting et al. 2005. 506 517358 D | G Rv0428c . 149 Rv0428c, (MTCY22G10.25c), len: 302 aa. Probable acetyltransferase. Contains GNAT (Gcn5-related N-acetyltransferase) domain in C-terminal part. See Vetting et al. 2005. 507 522423 T | K,N,P Rv0435c . 704 Rv0435c, (MTCY22G10.32c), len: 728 aa. Putative conserved ATPase, similar to others e.g. SAV_SULAC|Q07590 sav protein involved in cell division from sulfolobus acidocaldarius (780 aa), FASTA scores: opt: 897, E(): 0,(34.5% identity in 693 aa overlap); NP_148637.1|7435761|B72479 transitional endop... 508 524891 G | V Rv0436c pssA 167 Probable CDP-diacylglycerol--serine O-phosphatidyltransferase PssA (PS synthase) (phosphatidylserine synthase) 509 531556 I | T Rv0442c . 220 PPE family protein PPE10 510 531775 W | S Rv0442c . 147 PPE family protein PPE10 511 531901 A | E Rv0442c . 105 PPE family protein PPE10 512 532003 V | A Rv0442c . 71 PPE family protein PPE10 513 535544 T | P Rv0447c ufaA1 322 Probable cyclopropane-fatty-acyl-phospholipid synthase UfaA1 (cyclopropane fatty acid synthase) (CFA synthase) 514 543826 D | A Rv0453 . 218 PPE family protein PPE11 515 544297 V | A Rv0453 . 375 PPE family protein PPE11 516 545515 F | L,P,V Rv0455c . 103 Rv0455c, (MTV037.19c), len: 148 aa. Conserved protein, equivalent to CAC31896.1|AL583925 possible secreted protein from Mycobacterium leprae (153 aa). Has hydrophobic stretch at N-terminus. A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al., 2004). Predicted to be... 517 547120 R | C,F Rv0456A mazF1 15 Possible toxin MazF1 518 548326 T | A Rv0457c . 428 Rv0457c, (MTCI429A.01, MTV038.01c), len: 673 aa. Probable peptidase, similar to many e.g. NP_102851.1|14022026|BAB48637.1 probable endopeptidase from Mesorhizobium loti (687 aa); Y4NA_RHISN|P55577 probable peptidase (726 aa), FASTA scores: opt: 1126, E(): 0, (40.9% identity in 491 aa overlap). Al... 519 549537 A | V Rv0457c . 24 Rv0457c, (MTCI429A.01, MTV038.01c), len: 673 aa. Probable peptidase, similar to many e.g. NP_102851.1|14022026|BAB48637.1 probable endopeptidase from Mesorhizobium loti (687 aa); Y4NA_RHISN|P55577 probable peptidase (726 aa), FASTA scores: opt: 1126, E(): 0, (40.9% identity in 491 aa overlap). Al... 520 551340 D | G Rv0459 . 48 Rv0459, (MTV038.03), len: 163 aa. Conserved hypothetical protein, highly similar to other hypothetical proteins. Note that highly similar to products of unidentified ORFs in Xanthobacter autotrophicus, AF029733_2 (139 aa), and Rhodococcus erythropolis, REREUTP BC_1 (186 aa). Like MTV038.03, these... 521 552501 Y | C,S Rv0461 . 159 Rv0461, (MTV038.05), len: 174 aa (start uncertain). Probable transmembrane protein. Nucleotide position 552085 in the genome sequence has been corrected, A:G resulting in Q20Q. 522 552506 C | D,H,N,Y Rv0461 . 161 Rv0461, (MTV038.05), len: 174 aa (start uncertain). Probable transmembrane protein. Nucleotide position 552085 in the genome sequence has been corrected, A:G resulting in Q20Q. 523 555991 C | R Rv0465c . 106 Probable transcriptional regulatory protein 524 556478 L | F,V Rv0466 . 7 Rv0466, (MTV038.10), len: 264 aa. Conserved protein,equivalent to CAC31980.1|AL583925 conserved hypothetical protein from Mycobacterium leprae (264 aa). Similar to Rv2001|Z74025|MTCY39.17c hypothetical 28.7 KDA protein from Mycobacterium tuberculosis (250 aa), FASTA scores: opt: 592, E(): 0, (38.... 525 559746 Y | * Rv0468 fadB2 284 3-hydroxybutyryl-CoA dehydrogenase FadB2 (beta-hydroxybutyryl-CoA dehydrogenase) (BHBD) 526 562064 L | V Rv0470A . 71 Rv0470A, len: 146 aa. Hypothetical unknown protein. GC plot suggests CDS for Cys-rich protein, could possibly be continuation of Rv0471c but no frameshift found to allow this. Sequence same in Mycobacterium bovis and Mycobacterium tuberculosis strain CDC1551. Weak hits to Cys-rich region (aa 258-... 527 562602 T | H,P Rv0471c . 38 Rv0471c, (MTV038.15c), len: 162 aa. Hypothetical unknown protein. 528 568217 A | T Rv0479c . 251 Rv0479c, (MTCY20G9.04c), len: 348 aa. Probable conserved membrane protein, equivalent to CAC31967.1|AL583925 possible secreted protein from Mycobacterium leprae (254 aa); and C-terminus highly similar to AAF74996.1|AF143402_1|AF143402 putative multicopper oxidase from Mycobacterium avium (149 aa)... 529 568415 Q | K Rv0479c . 185 Rv0479c, (MTCY20G9.04c), len: 348 aa. Probable conserved membrane protein, equivalent to CAC31967.1|AL583925 possible secreted protein from Mycobacterium leprae (254 aa); and C-terminus highly similar to AAF74996.1|AF143402_1|AF143402 putative multicopper oxidase from Mycobacterium avium (149 aa)... 530 568802 P | A Rv0479c . 56 Rv0479c, (MTCY20G9.04c), len: 348 aa. Probable conserved membrane protein, equivalent to CAC31967.1|AL583925 possible secreted protein from Mycobacterium leprae (254 aa); and C-terminus highly similar to AAF74996.1|AF143402_1|AF143402 putative multicopper oxidase from Mycobacterium avium (149 aa)... 531 574149 V | L Rv0485 . 56 Rv0485, (MTCY20G9.11), len: 438 aa. Possible transcriptional repressor, member of the NAGC/XYLR repressor family; similar to several e.g. D87820_3|O32446|D82254 NAGC N-acetylglucosamine repressor from Vibrio cholerae (404 aa), FASTA scores: opt: 378, E(): 1.2e-17, (26.9% identity in 350 aa overla... 532 575907 A | V Rv0486 mshA 187 Glycosyltransferase MshA 533 583948 H | R Rv0493c . 248 Rv0493c, (MTCY20G9.19), len: 329 aa. Conserved protein, showing some similarity to U00018_33|B2168_F2_93 from Mycobacterium leprae (167 aa), FASTA scores: opt: 166,E(): 0.00077, (35.9% identity in 131 aa overlap). 534 584045 A | P Rv0493c . 216 Rv0493c, (MTCY20G9.19), len: 329 aa. Conserved protein, showing some similarity to U00018_33|B2168_F2_93 from Mycobacterium leprae (167 aa), FASTA scores: opt: 166,E(): 0.00077, (35.9% identity in 131 aa overlap). 535 584171 S | G Rv0493c . 174 Rv0493c, (MTCY20G9.19), len: 329 aa. Conserved protein, showing some similarity to U00018_33|B2168_F2_93 from Mycobacterium leprae (167 aa), FASTA scores: opt: 166,E(): 0.00077, (35.9% identity in 131 aa overlap). 536 585579 A | P,R Rv0495c . 246 Rv0495c, (MTCY20G9.21c), len: 296 aa. Conserved hypothetical protein, highly similar to S72915|B2168_F1_37 hypothetical protein from Mycobacterium leprae (323 aa),FASTA scores: opt: 1615, E(): 0, (82.7% identity in 271 aa overlap); and P54579|Y495_MYCLE|ML243|13094009|CAC31952.1|AL583925 conserve... 537 595501 A | V Rv0505c serB1 362 Possible phosphoserine phosphatase SerB1 (PSP) (O-phosphoserine phosphohydrolase) (pspase) 538 595770 M | A,L Rv0505c serB1 273 Possible phosphoserine phosphatase SerB1 (PSP) (O-phosphoserine phosphohydrolase) (pspase) 539 598475 R | H Rv0507 mmpL2 426 Probable conserved transmembrane transport protein MmpL2 540 599801 G | E Rv0507 mmpL2 868 Probable conserved transmembrane transport protein MmpL2 541 601257 A | P Rv0509 hemA 273 Probable glutamyl-tRNA reductase HemA (GLUTR) 542 610456 V | G Rv0518 . 90 Rv0518, (MTCY20G10.08), len: 231 aa. Possible exported protein; has hydrophobic N-terminus. 543 611977 G | D Rv0519c . 33 Rv0519c, (MTCY20G10.09c), len: 300 aa. Possible conserved membrane protein, with hydrophobic region near N-terminus. Could be a lipase. Similar to Rv0774c|MTCY369.19c|A70708 from Mycobacterium tuberculosis (312 aa), FASTA scores: opt: 1092, E(): 0, (57.9% identity in 299 aa overlap). Contains PS0... 544 614588 T | A Rv0523c . 45 Rv0523c, (MTCY25D10.02), len: 131 aa. Conserved protein, showing some similarity to M. tuberculosis proteins Rv1598c|MTCY336.06; and Rv1871c|MTCY336_06|O06592 (136 aa), FASTA scores: opt: 197, E(): 5e-08, (38.4% identity in 99 aa overlap). 545 615211 V | G Rv0524 hemL 126 Probable glutamate-1-semialdehyde 2,1-aminomutase HemL (GSA) (glutamate-1-semialdehyde aminotransferase) (GSA-at) 546 617740 V | G Rv0527 ccdA 83 Possible cytochrome C-type biogenesis protein CcdA 547 621598 P | L Rv0530 . 231 Rv0530, (MTCY25D10.09), len: 405 aa. Conserved protein, similar in part to other hypothetical proteins e.g. AL031231|SC3C3_3|CAA20252.1 from Streptomyces coelicolor (1083 aa), FASTA scores: opt: 870, E(): 0,(39.5% identity in 443 aa overlap); etc. Also similar to Mycobacterium tuberculosis protei... 548 623021 V | L Rv0532 . 77 PE-PGRS family protein PE_PGRS6 549 623225 S | A Rv0532 . 145 PE-PGRS family protein PE_PGRS6 550 623289 T | A,G,R Rv0532 . 166 PE-PGRS family protein PE_PGRS6 551 623472 D | G Rv0532 . 227 PE-PGRS family protein PE_PGRS6 552 623960 G | R Rv0532 . 390 PE-PGRS family protein PE_PGRS6 553 624025 S | R Rv0532 . 411 PE-PGRS family protein PE_PGRS6 554 624769 G | S Rv0533c fabH 238 3-oxoacyl-[acyl-carrier-protein] synthase III FabH (beta-ketoacyl-ACP synthase III) (KAS III) 555 625590 G | R Rv0534c menA 284 1,4-dihydroxy-2-naphthoate octaprenyltransferase MenA (DHNA-octaprenyltransferase) 556 627485 V | I Rv0536 galE3 80 Probable UDP-glucose 4-epimerase GalE3 (galactowaldenase) (UDP-galactose 4-epimerase) (uridine diphosphate galactose 4-epimerase) (uridine diphospho-galactose 4-epimerase) 557 630722 R | P Rv0538 . 228 Rv0538, (MTCY25D10.17), len: 548 aa. Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus. 558 630949 R | G Rv0538 . 304 Rv0538, (MTCY25D10.17), len: 548 aa. Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus. 559 631333 P | A Rv0538 . 432 Rv0538, (MTCY25D10.17), len: 548 aa. Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus. 560 631346 V | A Rv0538 . 436 Rv0538, (MTCY25D10.17), len: 548 aa. Possible conserved membrane protein. Middle region highly similar to AAB63811.1|AF009829|MBE4863a|O32850 unknown protein from Mycobacterium bovis (295 aa) possible transmembrane protein with a repetitive proline, threonine-rich region at C-terminus. 561 636921 S | A,P Rv0545c pitA 182 Probable low-affinity inorganic phosphate transporter integral membrane protein PitA 562 637283 T | P Rv0545c pitA 61 Probable low-affinity inorganic phosphate transporter integral membrane protein PitA 563 637319 P | S Rv0545c pitA 49 Probable low-affinity inorganic phosphate transporter integral membrane protein PitA 564 639474 G | A,R Rv0548c menB 162 Naphthoate synthase MenB (dihydroxynaphthoic acid synthetase) (DHNA synthetase) 565 644159 V | G Rv0552 . 424 Rv0552, (MTCY25D10.31), len: 534 aa. Conserved protein, similar to others from several organisms. Also shows some similarity with regulatory proteins e.g. AEPA_ERWCA|Q06555 exoenzymes regulatory protein aepA [Precursor] from Erwinia carotovora (465 aa), FASTA scores: opt: 278, E(): 7.6e-11, (23.0... 566 645031 V | G Rv0553 menC 181 Probable muconate cycloisomerase MenC (cis,cis-muconate lactonizing enzyme) (MLE) 567 648002 L | R Rv0556 . 15 Rv0556, (MTCY25D10.35), len: 171 aa. Probable conserved transmembrane protein, equivalent to NP_302479.1|NC_002677 putative membrane protein from Mycobacterium leprae (175 aa). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse et al.,2004). 568 648990 R | P Rv0557 mgtA 152 Mannosyltransferase MgtA 569 652638 F | L,P Rv0561c . 40 Rv0561c, (MTCY25D10.40c), len: 408 aa. Possible oxidoreductase, highly similar (except in first 30 aa) to NP_302482.1|NC_002677 putative FAD-linked oxidoreductase from Mycobacterium leprae (408 aa). Also similar to T34627 probable electron transfer oxidoreductase from Streptomyces coelicolor (430... 570 656190 D | E Rv0565c . 428 Rv0565c, (MTV039.03c), len: 486 aa. Probable monoxygenase, highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-term... 571 656264 A | D,E,G,R,X Rv0565c . 403 Rv0565c, (MTV039.03c), len: 486 aa. Probable monoxygenase, highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-term... 572 656341 M | G,S Rv0565c . 377 Rv0565c, (MTV039.03c), len: 486 aa. Probable monoxygenase, highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-term... 573 657142 R | H Rv0565c . 110 Probable monooxygenase 574 657295 R | H Rv0565c . 59 Rv0565c, (MTV039.03c), len: 486 aa. Probable monoxygenase, highly similar to NP_301173.1|NC_002677 putative monooxygenase from Mycobacterium leprae (494 aa). Also highly similar to others e.g. NP_421371.1|NC_002696 monooxygenase (flavin-binding family) from Caulobacter crescentus (498 aa); C-term... 575 665293 F | L Rv0572c . 31 Rv0572c, (MTV039.10c), len: 113 aa. Hypothetical unknown protein. 576 669033 D | V Rv0575c . 238 Rv0575c, (MTV039.13c), len: 388 aa. Possible oxidoreductase, similar to many diverse oxidoreductases and monooxygenases e.g. AL109974|SCF34_5|T36404 probable monooxygenase from Streptomyces coelicolor (407 aa), FASTA scores: opt: 786, E(): 0, (38.7% identity in 398 aa overlap); P96555|AB000564 sa... 577 669551 Y | * Rv0575c . 66 Rv0575c, (MTV039.13c), len: 388 aa. Possible oxidoreductase, similar to many diverse oxidoreductases and monooxygenases e.g. AL109974|SCF34_5|T36404 probable monooxygenase from Streptomyces coelicolor (407 aa), FASTA scores: opt: 786, E(): 0, (38.7% identity in 398 aa overlap); P96555|AB000564 sa... 578 670545 R | H Rv0576 . 233 Probable transcriptional regulatory protein (possibly ArsR-family) 579 672517 T | A Rv0578c . 1134 PE-PGRS family protein PE_PGRS7 580 672524 G | A,P Rv0578c . 1132 PE-PGRS family protein PE_PGRS7 581 672534 T | N Rv0578c . 1128 PE-PGRS family protein PE_PGRS7 582 672628 T | A,G Rv0578c . 1097 PE-PGRS family protein PE_PGRS7 583 672725 G | A,K,R,V Rv0578c . 1065 PE-PGRS family protein PE_PGRS7 584 672756 V | G Rv0578c . 1054 PE-PGRS family protein PE_PGRS7 585 673314 A | G,P Rv0578c . 868 PE-PGRS family protein PE_PGRS7 586 673318 N | A,H,P Rv0578c . 867 PE-PGRS family protein PE_PGRS7 587 673362 V | G Rv0578c . 852 PE-PGRS family protein PE_PGRS7 588 673976 F | L Rv0578c . 648 PE-PGRS family protein PE_PGRS7 589 674242 N | D Rv0578c . 559 PE-PGRS family protein PE_PGRS7 590 674980 T | A Rv0578c . 313 PE-PGRS family protein PE_PGRS7 591 675361 W | G Rv0578c . 186 PE-PGRS family protein PE_PGRS7 592 681965 Y | D Rv0585c . 770 Rv0585c, (MTV039.23c, MTCY19H5.37), len: 795 aa. Probable conserved integral membrane protein. C-terminus similar to CAB88984.1|AL353864 putative integral membrane protein from Streptomyces coelicolor (299 aa); and C-terminal region of CAC01311.1|AL390968 putative integral membrane protein from S... 593 686972 F | S Rv0589 mce2A 51 Mce-family protein Mce2A 594 694306 N | A Rv0594 mce2F 357 Mce-family protein Mce2F 595 700776 S | P Rv0604 lpqO 180 Probable conserved lipoprotein LpqO 596 706536 S | R Rv0612 . 71 Rv0612, (MTCY19H5.09c), len: 201 aa. Conserved hypothetical protein, highly similar, but in part, to downstream ORF Rv0609A conserved hypothetical protein from Mycobacterium tuberculosis (75 aa); and showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis. Note th... 597 706871 A | G Rv0612 . 183 Rv0612, (MTCY19H5.09c), len: 201 aa. Conserved hypothetical protein, highly similar, but in part, to downstream ORF Rv0609A conserved hypothetical protein from Mycobacterium tuberculosis (75 aa); and showing weak similarity with other hypothetical proteins from Mycobacterium tuberculosis. Note th... 598 707052 T | P Rv0613c . 822 Rv0613c, (MTCY19H5.08), len: 855 aa. Unknown protein. Contains a very short region with strong similarity to several preprotein translocases e.g. P47847|SECA_LISMO preprotein translocase seca subunit (836 aa), FASTA scores: opt: 138, E(): 0.18, (38.6% identity in 70 aa overlap, and 72.7% identity... 599 707066 A | G Rv0613c . 817 Rv0613c, (MTCY19H5.08), len: 855 aa. Unknown protein. Contains a very short region with strong similarity to several preprotein translocases e.g. P47847|SECA_LISMO preprotein translocase seca subunit (836 aa), FASTA scores: opt: 138, E(): 0.18, (38.6% identity in 70 aa overlap, and 72.7% identity... 600 707334 T | A Rv0613c . 728 hypothetical protein 601 710322 L | R Rv0614 . 323 Rv0614, (MTCY19H5.07c), len: 330 aa. Conserved hypothetical protein, similar in part to Mycobacterium tuberculosis hypothetical proteins e.g. YY16_MYCTU|Q10685|Rv2077c|MT2137|MTCY49.16c conserved hypothetical protein (323 aa), FASTA scores: opt: 200, E(): 0.00016, (28.3% identity in 269 aa overla... 602 712693 T | A Rv0619 galTb 174 Probable galactose-1-phosphate uridylyltransferase GalTb [second part] 603 713310 C | R Rv0620 galK 199 Probable galactokinase GalK (galactose kinase) 604 722583 V | F Rv0630c recB 811 Probable exonuclease V (beta chain) RecB (exodeoxyribonuclease V beta chain)(exodeoxyribonuclease V polypeptide) (chi-specific endonuclease) 605 722852 P | L Rv0630c recB 721 Probable exonuclease V (beta chain) RecB (exodeoxyribonuclease V beta chain)(exodeoxyribonuclease V polypeptide) (chi-specific endonuclease) 606 725220 A | P,R Rv0631c recC 1030 Probable exonuclease V (gamma chain) RecC (exodeoxyribonuclease V gamma chain)(exodeoxyribonuclease V polypeptide) 607 725428 R | H Rv0631c recC 960 Probable exonuclease V (gamma chain) RecC (exodeoxyribonuclease V gamma chain)(exodeoxyribonuclease V polypeptide) 608 727321 G | E Rv0631c recC 329 Probable exonuclease V (gamma chain) RecC (exodeoxyribonuclease V gamma chain)(exodeoxyribonuclease V polypeptide) 609 728617 G | A,R Rv0632c echA3 221 Probable enoyl-CoA hydratase EchA3 (enoyl hydrase) (unsaturated acyl-CoA hydratase) (crotonase) 610 738902 D | N Rv0644c mmaA2 87 Methoxy mycolic acid synthase 2 MmaA2 (methyl mycolic acid synthase 2) (MMA2) (hydroxy mycolic acid synthase) 611 741620 L | R Rv0647c . 333 Rv0647c, (MTCY20H10.28c), len: 488 aa. Conserved protein, equivalent to NP_302277.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (448 aa). Also showing similarity to a variety of hypothetical ABC1-like proteins or conserved hypothetical proteins e.g. D90908_28|P73627 ABC1-li... 612 747833 A | G Rv0650 . 266 Rv0650, (MTCY20H10.31), len: 302 aa. Possible sugar kinase, highly similar to others e.g. CAB95296.1|AL359779 putative sugar kinase from Streptomyces coelicolor (317 aa); NP_406512.1|NC_003143 putative sugar kinase from Yersinia pestis (290 aa); NP_229269.1|NC_000853 glucokinase from Thermotoga m... 613 756216 D | G Rv0663 atsD 27 Possible arylsulfatase AtsD (aryl-sulfate sulphohydrolase) (arylsulphatase) 614 760151 E | G Rv0667 rpoB 115 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 615 760314 V | F Rv0667 rpoB 170 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 616 761109 D | W,Y Rv0667 rpoB 435 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 617 761110 D | G,V,W Rv0667 rpoB 435 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 618 761140 H | L,R Rv0667 rpoB 445 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 619 761155 S | *,L,W Rv0667 rpoB 450 DNA-directed RNA polymerase (beta chain) RpoB (transcriptase beta chain) (RNA polymerase beta subunit) 620 763884 A | V Rv0668 rpoC 172 DNA-directed RNA polymerase (beta' chain) RpoC (transcriptase beta' chain) (RNA polymerase beta' subunit) 621 764819 W | G Rv0668 rpoC 484 DNA-directed RNA polymerase (beta' chain) RpoC (transcriptase beta' chain) (RNA polymerase beta' subunit) 622 764918 V | L Rv0668 rpoC 517 DNA-directed RNA polymerase (beta' chain) RpoC (transcriptase beta' chain) (RNA polymerase beta' subunit) 623 771731 D | A Rv0672 fadE8 83 Probable acyl-CoA dehydrogenase FadE8 624 773431 Q | H Rv0673 echA4 103 Possible enoyl-CoA hydratase EchA4 (enoyl hydrase) (unsaturated acyl-CoA hydratase) (crotonase) 625 779773 T | H,P Rv0679c . 90 Rv0679c, (MTV040.07c), len: 165 aa. Conserved Thr-rich protein, similar in part to neighboring ORF Rv0680c (124 aa), FASTA score: (35.1% identity in 131 aa overlap); and Rv0314c (220 aa). Predicted to be an outer membrane protein (See Song et al., 2008). 626 791249 A | T Rv0691c . 140 Probable transcriptional regulatory protein 627 793231 P | S Rv0693 pqqE 359 Probable coenzyme PQQ synthesis protein E PqqE (coenzyme PQQ synthesis protein III) 628 806061 E | G Rv0710 rpsQ 102 30S ribosomal protein S17 RpsQ 629 815128 L | M Rv0722 rpmD 46 50S ribosomal protein L30 RpmD 630 816354 A | G Rv0724 sppA 231 Possible protease IV SppA (endopeptidase IV) (signal peptide peptidase) 631 819026 L | A Rv0726c . 206 Rv0726c, (MTCY210.45c), len: 367 aa. Possible S-adenosylmethionine-dependent methyltransferase (see Grana et al., 2007), highly similar to other proteins from Mycobacterium tuberculosis e.g. Q10552|Y893_MYCTU|Rv0893c|MT0917|MTCY31.21c (325 aa), FASTA scores: opt: 646, E(): 0, (38.3% identity in 3... 632 829698 D | E Rv0737 . 164 Rv0737, (MTV041.11), len: 165 aa. Possible transcriptional regulator, similar to others e.g. BAB69161.1|AB070937 regulator protein from Streptomyces avermitilis (169 aa); NP_419731.1|NC_002696 transcriptional regulator MarR family from Caulobacter crescentus (148 aa) (homology only at C-terminus)... 633 836164 S | L Rv0746 . 155 PE-PGRS family protein PE_PGRS9 634 837292 A | G Rv0746 . 531 PE-PGRS family protein PE_PGRS9 635 839144 S | A Rv0747 . 232 PE-PGRS family protein PE_PGRS10 636 839817 I | A Rv0747 . 456 PE-PGRS family protein PE_PGRS10 637 840115 S | R Rv0747 . 555 PE-PGRS family protein PE_PGRS10 638 841634 R | P Rv0749 vapC31 136 Possible toxin VapC31 Contains PIN domain 639 842057 V | F Rv0750 . 9 Rv0750, (MTV041.24), len: 81 aa. Conserved hypothetical protein, showing almost perfect overlap with C-terminus of Rv0740|MTV041_14 conserved hypothetical protein from Mycobacterium tuberculosis (175 aa), FASTA scores: (93.8% identity in 81 aa overlap). Possible duplication. This region is a poss... 640 842472 A | P,S Rv0751c mmsB 254 Probable 3-hydroxyisobutyrate dehydrogenase MmsB (hibadh) 641 842867 V | G Rv0751c mmsB 122 Probable 3-hydroxyisobutyrate dehydrogenase MmsB (hibadh) 642 846948 W | G Rv0754 . 264 PE-PGRS family protein PE_PGRS11 643 859239 T | A,I,P,S Rv0766c cyp123 279 Probable cytochrome P450 123 Cyp123 644 859498 M | C,P,R Rv0766c cyp123 192 Probable cytochrome P450 123 Cyp123 645 867536 A | P,R Rv0774c . 273 Rv0774c, (MTCY369.19c), len: 303 aa. Possible conserved exported protein with hydrophobic region near N-terminus, highly similar, except in N-terminus, to Rv0519c|Z97831|MTY20G10.09c|O33364 hypothetical protein from Mycobacterium tuberculosis (300 aa), FASTA scores: opt: 1092, E(): 0, (57.9% iden... 646 877833 N | T Rv0783c emrB 203 Possible multidrug resistance integral membrane efflux protein EmrB 647 878006 G | C,V Rv0783c emrB 146 Possible multidrug resistance integral membrane efflux protein EmrB 648 885120 T | L,N,S,X Rv0791c . 241 Rv0791c, (MTV042.01c, MTCY369.35c), len: 347 aa. Conserved protein, similar (except in N-terminus) to others e.g. CAC44585.1|AL596162 conserved hypothetical protein from Streptomyces coelicolor (307 aa); NP_252643.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (364 aa); etc. Also so... 649 886040 S | L,P Rv0792c . 203 Rv0792c, (MTV042.02c), len: 269 aa. Probable transcriptional regulator, GntR-family, similar to many others of GntR family e.g. BSUB0018_189|Z99121 from Bacillus subtilis (243 aa), FASTA scores: opt: 367, E(): 1.5e-17, (32.1% identity in 246 aa overlap); P31453|YIDP_ECOLI from Escherichia coli (2... 650 888125 P | S Rv0794c . 171 Rv0794c, (MTV042.04c), len: 499 aa. Probable oxidoreductase, possibly dihydrolipoamide dehydrogenase or mercuric reductase. Highly similar to CAB62675.1|AL133422 probable oxidoreductase from Streptomyces coelicolor (477 aa); and similar to various oxidoreductases e.g. P08663|MERA_STAAU mercuric r... 651 891668 G | R Rv0798c cfp29 201 29 KDa antigen CFP29 652 893733 L | R Rv0800 pepC 139 Probable aminopeptidase PepC 653 905084 G | H,P Rv0810c . 2 Rv0810c, (MTV043.02c), len: 60 aa. Conserved hypothetical protein, with its N-terminus highly similar to NP_302445.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (62 aa); and AL118514|SCD25_24 hypothetical protein from Streptomyces coelicolor (84 aa),FASTA scores: opt: 180, ... 654 906857 I | M Rv0812 . 145 Rv0812, (MTV043.04), len: 289 aa. Probable amino acid aminotransferase, similar to other amino acid aminotransferases, generelly class-IV of pyridoxal-phosphate-dependent aminotransferases, and especially ILVE proteins and PABC proteins e.g. B76065.1|AL157953 putative aminotransferase from Strept... 655 907869 E | D Rv0813c . 51 Rv0813c, (MTV043.05c), len: 226 aa. Conserved protein, highly similar to U15182|MLU15182_16 hypothetical protein from Mycobacterium leprae (242 aa), FASTA scores: opt: 1191, E(): 0, (78.3% identity in 226 aa overlap); and NP_302442.1|NC_002677 conserved hypothetical protein from Mycobacterium lep... 656 909724 L | V Rv0816c thiX 104 Probable thioredoxin ThiX 657 914508 F | G,I,V Rv0822c . 602 Rv0822c, (MTV043.14c), len: 684 aa. Conserved protein, highly similar in the region between aa 370 - 580 to U2266O|U15182|MLU15182_30 hypothetical protein from Mycobacterium leprae (222 aa), FASTA scores: opt: 819, E(): 0, (60.6% identity in 221 aa overlap). More extended similarity to Rv3267|Z92... 658 915207 T | H,P Rv0822c . 369 Rv0822c, (MTV043.14c), len: 684 aa. Conserved protein, highly similar in the region between aa 370 - 580 to U2266O|U15182|MLU15182_30 hypothetical protein from Mycobacterium leprae (222 aa), FASTA scores: opt: 819, E(): 0, (60.6% identity in 221 aa overlap). More extended similarity to Rv3267|Z92... 659 916681 A | G,L Rv0823c . 323 Rv0823c, (MTV043.15c), len: 389 aa. Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overla... 660 916684 R | L Rv0823c . 322 Rv0823c, (MTV043.15c), len: 389 aa. Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overla... 661 916688 L | G,P,R,S Rv0823c . 320 Rv0823c, (MTV043.15c), len: 389 aa. Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overla... 662 916833 T | P Rv0823c . 272 Rv0823c, (MTV043.15c), len: 389 aa. Possible transcriptional regulator (resembles nitrogen regulation protein), equivalent (but longer 24 aa in N-terminus) to MLU15182_31|U15182|NtrB NtrB protein from Mycobacterium leprae (384 aa), FASTA scores: opt: 2070, E(): 0, (82.3% identity in 384 aa overla... 663 919351 F | C Rv0825c . 68 hypothetical protein 664 925643 F | V Rv0833 . 95 PE-PGRS family protein PE_PGRS13 665 926756 S | A,G Rv0833 . 466 PE-PGRS family protein PE_PGRS13 666 926766 A | G Rv0833 . 469 PE-PGRS family protein PE_PGRS13 667 926825 G | R Rv0833 . 489 PE-PGRS family protein PE_PGRS13 668 927045 V | A,G Rv0833 . 562 PE-PGRS family protein PE_PGRS13 669 927413 N | D Rv0833 . 685 PE-PGRS family protein PE_PGRS13 670 928165 G | A,T Rv0834c . 774 PE-PGRS family protein PE_PGRS14 671 928228 A | D Rv0834c . 753 PE-PGRS family protein PE_PGRS14 672 929120 G | A,P,V,W Rv0834c . 456 PE-PGRS family protein PE_PGRS14 673 929705 T | A Rv0834c . 261 PE-PGRS family protein PE_PGRS14 674 932280 * | W Rv0836c . 218 Hypothetical protein 675 933598 P | L Rv0837c . 145 Rv0837c, (MTV043.30c), len: 342 aa. Hypothetical unknown protein. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 676 941722 I | S Rv0845 . 178 Rv0845, (MTV043.38), len: 425 aa. Possible two-component sensor kinase, with its C-terminus similar to C-terminal part of others e.g. NP_294951.1|NC_001263 two-component sensor histidine kinase from Deinococcus radiodurans (469 aa); CAC32293.1|AL583943 putative two component system histidine kina... 677 947649 R | A,G,W Rv0851c . 274 Rv0851c, (MTV043.44c), len: 275 aa. Probable short-chain dehydrogenase/reductase, similar to many e.g. Q01198|LIGD_PSEPA C alpha-dehydrogenase (SDR family) from Pseudomonas paucimobilis (Sphingomonas paucimobilis) (305 aa); D11473|PSELIG_1 C alpha-dehydrogenase from P. paucimobilis (305 aa), FAST... 678 954586 V | G Rv0858c dapC 112 Probable N-succinyldiaminopimelate aminotransferase DapC (DAP-at) 679 955127 K | N Rv0859 fadA 17 Possible acyl-CoA thiolase FadA 680 956002 K | N Rv0859 fadA 309 Possible acyl-CoA thiolase FadA 681 960367 L | A,S Rv0862c . 749 hypothetical protein 682 964925 V | A,R Rv0867c rpfA 204 Possible resuscitation-promoting factor RpfA 683 966972 N | D,P Rv0869c moaA2 126 Probable molybdenum cofactor biosynthesis protein A2 MoaA2 684 968424 * | N,P,R,T,Y Rv0872c . 608 PE-PGRS family protein PE_PGRS15 685 968426 * | D,G,P,Q,R Rv0872c . 607 PE-PGRS family protein PE_PGRS15 686 968428 G | A,D,E,N,P,R,V Rv0872c . 606 PE-PGRS family protein PE_PGRS15 687 968429 G | *,A,C,E,L,N,P,Q,R,T,Y Rv0872c . 606 PE-PGRS family protein PE_PGRS15 688 968430 G | *,A,E,H,L,N,P,Q,R,S,Y Rv0872c . 606 PE-PGRS family protein PE_PGRS15 689 968431 P | *,A,E,G,H,L,Q,S,T,Y Rv0872c . 605 PE-PGRS family protein PE_PGRS15 690 968432 P | *,A,E,I,L,N,S,T,Y Rv0872c . 605 PE-PGRS family protein PE_PGRS15 691 968435 Q | A,G,K,N,P Rv0872c . 604 PE-PGRS family protein PE_PGRS15 692 968440 A | D,G,K,P,S,V,W Rv0872c . 602 PE-PGRS family protein PE_PGRS15 693 969740 G | P,R,W Rv0872c . 169 PE-PGRS family protein PE_PGRS15 694 970823 L | V Rv0873 fadE10 107 Probable acyl-CoA dehydrogenase FadE10 695 972713 R | A,G Rv0874c . 332 Rv0874c, (MTCY31.02c), len: 386 aa. Conserved hypothetical protein, highly similar in part to SPU62616_1 hypothetical protein from Synechococcus sp. (280 aa), FASTA scores: E(): 6.3e-26, (35.2% identity in 264 aa overlap); SYCSLLLH_102 from Synechocystis sp. (447 aa), FASTA scores: E(): 1.1e-18, ... 696 976896 K | H,N,P,Q,S,T Rv0878c . 437 PPE family protein PPE13 697 976897 Q | K,N,P,T Rv0878c . 436 PPE family protein PPE13 698 976899 H | A,K,L,P,Q,S,T Rv0878c . 436 PPE family protein PPE13 699 985124 D | A Rv0886 fprB 441 Probable NADPH:adrenodoxin oxidoreductase FprB (adrenodoxin reductase) (AR) (ferredoxin-NADP(+) reductase) 700 988604 E | Q Rv0888 . 458 Rv0888, (MTCY31.16), len: 490 aa. Probable exported protein. Equivalent to AAK45157.1 from Mycobacterium tuberculosis strain CDC1551 (507 aa) but shorter 17 aa. Contains possible N-terminal signal sequence. Predicted to be an outer membrane protein (See Song et al., 2008). 701 989536 R | L Rv0889c citA 109 Probable citrate synthase II CitA 702 991896 E | G Rv0890c . 234 Probable transcriptional regulatory protein (probably LuxR-family) 703 993346 V | G Rv0891c . 37 Possible transcriptional regulatory protein 704 998556 T | P Rv0895 . 259 Rv0895, (MTCY31.23), len: 505 aa. Possible triacylglycerol synthase (See Daniel et al., 2004); member of family with: Rv3740c, Rv3734c, Rv1425, Rv1760, etc. Shows some similarity with NP_301898.1|NC_002677 conserved membrane protein from Mycobacterium leprae (491 aa). This region is a possible MT... 705 1001013 F | S Rv0897c . 468 Rv0897c, (MTCY31.25c), len: 535 aa. Possible oxidoreductase, similar to various oxidoreductases from diverse organisms e.g. CAB94055.1|AL358672 putative oxidoreductase from Streptomyces coelicolor (540 aa); NP_147877.1|NC_000854 phytoene dehydrogenase from Aeropyrum pernix (549 aa); Q01671|CRTD_R... 706 1014815 H | Q Rv0909 . 45 hypothetical protein 707 1020072 N | D Rv0915c . 420 PPE family protein PPE14 708 1024346 S | G Rv0918 . 46 hypothetical protein 709 1034205 I | A Rv0927c . 143 Rv0927c, (MTCY21C12.21c), len: 263 aa. Probable short-chain dehydrogenase/reductase, similar to various dehydrogenases/reductases, notably 7-alpha-hydroxysteroid dehydrogenases and glucose 1-dehydrogenases e.g. P25529|HDHA_ECOLI 7-alpha-hydroxysteroid dehydrogenase from Escherichia coli (255 aa),... 710 1037105 F | V Rv0930 pstA1 36 Probable phosphate-transport integral membrane ABC transporter PstA1 711 1039568 V | A Rv0931c pknD 116 Transmembrane serine/threonine-protein kinase D PknD (protein kinase D) (STPK D) 712 1039946 A | V Rv0932c pstS2 368 Periplasmic phosphate-binding lipoprotein PstS2 (PBP-2) (PstS2) 713 1042125 R | L Rv0934 pstS1 4 Periplasmic phosphate-binding lipoprotein PstS1 (PBP-1) (PstS1) 714 1043136 T | I Rv0934 pstS1 341 Periplasmic phosphate-binding lipoprotein PstS1 (PBP-1) (PstS1) 715 1049574 P | L Rv0939 . 388 Rv0939, (MTCY10D7.35c), len: 644 aa. Possible bifunctional enzyme, including 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase activity, and cyclase/dehydrase activity. N-terminal part similar to many isomerases e.g. NP_343861.1|NC_002754 2-hydroxyhepta-2,4-diene-1,7-dioate isomerase (hpcE-1) from Su... 716 1055151 L | M Rv0946c pgi 512 Probable glucose-6-phosphate isomerase Pgi (GPI) (phosphoglucose isomerase) (phosphohexose isomerase) (phi) 717 1061110 R | K,M,T Rv0950c . 182 Rv0950c, (MTCY10D7.24), len: 332 aa. Conserved hypothetical protein, highly similar to AL035500|MLCL373.02c|T45433 hypothetical protein from Mycobacterium leprae (343 aa), FASTA scores: opt: 1500,E(): 0, (71.0% identity in 331 aa overlap). C-terminus highly similar to part of various proteins e.g... 718 1062745 V | A Rv0951 sucC 261 Probable succinyl-CoA synthetase (beta chain) SucC (SCS-beta) 719 1069151 V | G Rv0957 purH 316 Probable bifunctional purine biosynthesis protein PurH: phosphoribosylaminoimidazolecarboxamide formyltransferase (AICAR transformylase) (5'-phosphoribosyl-5-aminoimidazole-4-carboxamide formyltransferase) + inosinemonophosphate cyclohydrolase (imp cyclohydrolase) (inosinicase) (imp synthetase) (... 720 1075945 S | C,H,P Rv0963c . 52 Rv0963c, (MTCCY10D7.11), len: 266 aa. Conserved hypothetical protein, similar in part to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: E(): 1.2e-23,(39.0% identity in 254 aa overlap); Rv2542 (403 aa); Rv2079 (656 aa). Also s... 721 1075946 A | D,G Rv0963c . 51 Rv0963c, (MTCCY10D7.11), len: 266 aa. Conserved hypothetical protein, similar in part to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: E(): 1.2e-23,(39.0% identity in 254 aa overlap); Rv2542 (403 aa); Rv2079 (656 aa). Also s... 722 1075947 A | *,D,G,L,R,T Rv0963c . 51 Rv0963c, (MTCCY10D7.11), len: 266 aa. Conserved hypothetical protein, similar in part to other conserved hypothetical proteins from Mycobacterium tuberculosis e.g. Rv2797c|MTCY16B7.46 (562 aa), FASTA scores: E(): 1.2e-23,(39.0% identity in 254 aa overlap); Rv2542 (403 aa); Rv2079 (656 aa). Also s... 723 1077685 R | A,G Rv0966c . 51 Rv0966c, (MTCY10D7.08), len: 200 aa. Conserved protein, equivalent to AL035500|MLCL373_12 conserved hypothetical protein from Mycobacterium leprae (200 aa),FASTA scores: opt: 1080, E(): 0, (79.5% identity in 200 aa overlap). Also highly similar to SCE6.30c|CAB88834.1|AL353832 hypothetical protein... 724 1079718 R | C Rv0969 ctpV 326 Probable metal cation transporter P-type ATPase CtpV 725 1080192 D | N Rv0969 ctpV 484 Probable metal cation transporter P-type ATPase CtpV 726 1081882 S | P Rv0971c echA7 235 Probable enoyl-CoA hydratase EchA7 (enoyl hydrase) (unsaturated acyl-CoA hydratase) (crotonase) 727 1086648 R | L Rv0974c accD2 233 Probable acetyl-/propionyl-CoA carboxylase (beta subunit) AccD2 728 1089917 G | P,R Rv0976c . 87 Rv0976c, (MTV044.04c), len: 560 aa. Conserved hypothetical protein, highly similar to others e.g. CAB95890.1|AL359988 conserved hypothetical protein from Streptomyces coelicolor (558 aa); P_251576.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (600 aa); etc. N-terminal part highly s... 729 1090026 D | K,P Rv0976c . 51 Rv0976c, (MTV044.04c), len: 560 aa. Conserved hypothetical protein, highly similar to others e.g. CAB95890.1|AL359988 conserved hypothetical protein from Streptomyces coelicolor (558 aa); P_251576.1|NC_002516 hypothetical protein from Pseudomonas aeruginosa (600 aa); etc. N-terminal part highly s... 730 1091820 R | L Rv0977 . 483 PE-PGRS family protein PE_PGRS16 731 1093465 P | H,L,X Rv0978c . 298 PE-PGRS family protein PE_PGRS17 732 1094059 V | A,E Rv0978c . 100 PE-PGRS family protein PE_PGRS17 733 1095731 T | A Rv0980c . 241 PE-PGRS family protein PE_PGRS18 734 1096078 D | A Rv0980c . 125 PE-PGRS family protein PE_PGRS18 735 1096079 D | H Rv0980c . 125 PE-PGRS family protein PE_PGRS18 736 1097023 G | S Rv0981 mprA 68 Mycobacterial persistence regulator MRPA (two component response transcriptional regulatory protein) 737 1100746 V | G Rv0984 moaB2 96 Possible pterin-4-alpha-carbinolamine dehydratase MoaB2 (PHS) (4-alpha-hydroxy-tetrahydropterin dehydratase) (pterin-4-a-carbinolamine dehydratase) (phenylalanine hydroxylase-stimulating protein) (PHS) (pterin carbinolamine dehydratase) (PCD) 738 1104690 F | V Rv0987 . 717 Probable adhesion component transport transmembrane protein ABC transporter 739 1106422 I | V Rv0989c grcC2 321 Probable polyprenyl-diphosphate synthase GrcC2 (polyprenyl pyrophosphate synthetase) 740 1106962 V | I Rv0989c grcC2 141 Probable polyprenyl-diphosphate synthase GrcC2 (polyprenyl pyrophosphate synthetase) 741 1107708 I | R,V Rv0990c . 131 Rv0990c, (MTCI237.04c), len: 218 aa. Hypothetical unknown protein. 742 1108076 R | S,W Rv0990c . 9 Rv0990c, (MTCI237.04c), len: 218 aa. Hypothetical unknown protein. 743 1111678 G | S Rv0995 rimJ 23 Ribosomal-protein-alanine acetyltransferase RimJ (acetylating enzyme for N-terminal of ribosomal protein S5) 744 1116934 V | G Rv1000c . 72 Rv1000c, len: 205 aa. Conserved hypothetical protein, equivalent to ML0190|NP_301263.1|NC_002677 conserved hypothetical protein from Mycobacterium leprae (205 aa). Also highly similar to SC5F8.12c|CAB93740.1|AL357613 hypothetical protein from Streptomyces coelicolor (210 aa), FASTA scores: E(): 2... 745 1118939 H | P,R Rv1002c . 334 Rv1002c, (MTCI237.17c), len: 503 aa. Conserved membrane protein. Predicted to be in the GT-C superfamily of glycosyltransferases (See Liu and Mushegian, 2003). Similar to AL132674|SCE87.05 hypothetical protein from Streptomyces coelicolor (591 aa), FASTA scores: opt: 666,E(): 0, (39.0% identity i... 746 1130261 R | H Rv1011 ispE 24 Probable 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase IspE (CMK) (4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol kinase) 747 1142220 A | G Rv1020 mfd 1085 Probable transcription-repair coupling factor Mfd (TRCF) 748 1143764 V | G Rv1022 lpqU 10 Probable conserved lipoprotein LpqU 749 1148930 G | S Rv1027c kdpE 60 Probable transcriptional regulatory protein KdpE 750 1157771 S | C Rv1032c trcS 62 Two component sensor histidine kinase TrcS 751 1164809 E | D Rv1041c . 210 Rv1041c, (MTCY10G2.08), len: 287 aa. Probable is like-2 transposase, overlaps MTCY10G2.07. Similar to Q00430|X53945 insertion element IS869 hypothetical protein from Agrobacterium tumefaciens (186 aa), FASTA scores: opt: 173, E(): 0.00016, (40.9% identity in 176 aa overlap). Similar to Rv1150, C-... 752 1166610 H | P,R Rv1043c . 66 Rv1043c, (MTCY10G2.06), len: 341 aa. Conserved hypothetical protein similar to AL096872|SC5F7.08 putative lipoate-protein ligase from Streptomyces coelicolor (362 aa), FASTA scores: opt: 206, E(): 1.4e-05, (30.3% identity in 201 aa overlap). Weak similarity to P39668|YYXA_BACSU hypothetical prote... 753 1168712 L | V Rv1046c . 173 Rv1046c, (MTCY10G2.03), len: 174 aa. Hypothetical unknown protein. Start changed since first submission (-65 aa). This region is a possible MT-complex-specific genomic island (See Becq et al., 2007). 754 1168717 E | G,K,V Rv1046c . 171 Rv1046c, (MTCY10G2.03), len: 174 aa. Hypothetical unknown protein. Start changed since first submission (-65 aa). This region is a possible MT-complex-specific genomic island (See Becq et al., 2007). 755 1174655 D | N Rv1051c . 16 Rv1051c, (MTV017.04c), len: 251 aa. Conserved hypothetical protein, similar to LLU36837|U36837.1 protein encoded by Lactococcus lactis plasmid pNP40 (298 aa), FASTA scores: opt: 194, E(): 3.5e-06, (30.3% identity in 155 aa overlap). Contains possible helix-turn-helix motif at aa 197-218 (Score 10... 756 1175947 D | E Rv1052 . 75 Rv1052, (MTV017.05), len: 129 aa. Hypothetical unknown protein. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 757 1177157 G | D Rv1054 . 77 Rv1054, (MTV017.07), len: 104 aa. Probable integrase (fragment), similar to Rv2309c|MTCY3G12_25|Z79702 hypothetical protein (shows similarity to integrases) from Mycobacterium tuberculosis (151 aa), FASTA scores: opt: 273, E(): 8.8e-13, (64.7% identity in 68 aa overlap); and to L39071|MSGINT_1 in... 758 1177984 D | E Rv1056 . 119 Rv1056, (MTV017.09), len: 254 aa. Conserved protein,some similarity in C-terminal region of Rv0140|MTCI5.14|Z92770 Mycobacterium tuberculosis (126 aa),FASTA scores: opt: 254, E(): 1.2e-10, (43.4% identity in 106 aa overlap); and to Rv1670. C-terminal region is similar to AL035569|SC8D9.02 hypothe... 759 1180285 V | G Rv1057 . 297 Rv1057, (MTV017.10), len: 393 aa. Conserved hypothetical protein, some similarity to X84710|MMSAG_1 surface antigen of Methanosarcina mazeii (491 aa), FASTA scores: opt: 363, E():6.2e-15, (31.3% identity in 294 aa overlap). Regulated by MprA (Rv0981) under physiological conditions and environment... 760 1183947 V | G Rv1060 . 147 Rv1060, (MTV017.13), len: 157 aa. Unknown protein. 761 1185154 F | C Rv1062 . 91 Rv1062, (MTV017.15), len: 285 aa. Conserved hypothetical protein, some similarity to AL079356|SC6G9_10 hypothetical protein in Streptomyces coelicolor (289 aa),FASTA scores: opt: 556, E(): 1.2e-27, (39.0% identity in 287 aa overlap), and Z99111|BSUB0008_176 Bacillus subtilis (260aa), FASTA scores... 762 1188510 A | G,K Rv1067c . 639 PE-PGRS family protein PE_PGRS19 763 1188517 G | A,R,T Rv1067c . 637 PE-PGRS family protein PE_PGRS19 764 1188518 K | G,N,R,T Rv1067c . 636 PE-PGRS family protein PE_PGRS19 765 1188519 K | D,R,T,V Rv1067c . 636 PE-PGRS family protein PE_PGRS19 766 1189137 A | G,N,Q,R Rv1067c . 430 PE-PGRS family protein PE_PGRS19 767 1189138 A | G,N,Q Rv1067c . 430 PE-PGRS family protein PE_PGRS19 768 1189579 Q | G Rv1067c . 283 PE-PGRS family protein PE_PGRS19 769 1189580 G | E,P Rv1067c . 282 PE-PGRS family protein PE_PGRS19 770 1189588 A | G,P Rv1067c . 280 PE-PGRS family protein PE_PGRS19 771 1189591 G | P,R Rv1067c . 279 PE-PGRS family protein PE_PGRS19 772 1189985 T | I Rv1067c . 147 PE-PGRS family protein PE_PGRS19 773 1191000 S | A,R Rv1068c . 384 PE-PGRS family protein PE_PGRS20 774 1191351 N | C,H Rv1068c . 267 PE-PGRS family protein PE_PGRS20 775 1191362 G | A,L,R,S Rv1068c . 263 PE-PGRS family protein PE_PGRS20 776 1191366 G | A,R Rv1068c . 262 PE-PGRS family protein PE_PGRS20 777 1191374 G | H,P,R,X Rv1068c . 259 PE-PGRS family protein PE_PGRS20 778 1191463 G | A Rv1068c . 229 PE-PGRS family protein PE_PGRS20 779 1212382 L | G,V Rv1087 . 275 PE-PGRS family protein PE_PGRS21 780 1213685 D | A,G,N,P,R,S,W Rv1087 . 709 PE-PGRS family protein PE_PGRS21 781 1217973 A | G Rv1091 . 502 PE-PGRS family protein PE_PGRS22 782 1218780 V | G Rv1091 . 771 PE-PGRS family protein PE_PGRS22 783 1219828 H | P Rv1092c coaA 120 Probable pantothenate kinase CoaA (pantothenic acid kinase) 784 1227830 D | E Rv1099c glpX 286 Fructose 1,6-bisphosphatase GlpX 785 1228116 I | T Rv1099c glpX 190 Fructose 1,6-bisphosphatase GlpX 786 1232568 Y | G Rv1105 . 86 Rv1105, (MTV017.58), len: 171 aa. Possible para-nitrobenzyl esterase (fragment; possibly second part) . Similar to C-terminal domain of many e.g. P71048 para-nitrobenzyl esterase from Bacillus subtilis (489 aa),FASTA scores: opt: 248, E(): 2.7e-10, (32.3% identity in 167 aa overlap). Gene may be ... 787 1233039 D | E Rv1106c . 307 Rv1106c, (MTV017.59c), len: 370 aa. 3-beta-hydroxysteroid dehydrogenase (see Yang et al.,2007). Equivalent to AL049491|MLCB1222_7 Mycobacterium leprae (376 aa) (75.5% identity in 375 aa overlap). Highly similar to Q03704 NAD(P)-dependent cholesterol dehydrogenase from Nocardia sp. (364 aa), FASTA... 788 1234262 A | G Rv1108c xseA 400 Probable exodeoxyribonuclease VII (large subunit) XseA (exonuclease VII large subunit) 789 1242967 V | A Rv1119c . 16 Rv1119c, (MTCY22G8.08c), len: 49 aa. Hypothetical unknown protein. 790 1246656 V | I Rv1123c bpoB 133 Possible peroxidase BpoB (non-haem peroxidase) 791 1250155 G | W Rv1127c ppdK 417 Probable pyruvate, phosphate dikinase PpdK 792 1252054 L | A,P,R,W Rv1128c . 307 Rv1128c, (MTCY22G8.17c), len: 451 aa. Conserved hypothetical protein, in REP13E12 degenerate repeat, highly similar to several Mycobacterium tuberculosis proteins in REP13E12 repeats e.g. Rv1148c, Rv1945, Rv3467, etc. 793 1252879 T | H,P Rv1128c . 32 Rv1128c, (MTCY22G8.17c), len: 451 aa. Conserved hypothetical protein, in REP13E12 degenerate repeat, highly similar to several Mycobacterium tuberculosis proteins in REP13E12 repeats e.g. Rv1148c, Rv1945, Rv3467, etc. 794 1254144 T | A,P Rv1129c . 131 Rv1129c, (MTCY22G8.18c), len: 486 aa. Possible transcriptional regulator protein, similar to Rv0465c|MTV038.09c Mycobacterium tuberculosis (474 aa),FASTA scores: E(): 0, (47.4% identity in 468 aa overlap). Helix turn helix motif present from aa 32-53. 795 1254562 D | G Rv1130 prpD 3 Possible methylcitrate dehydratase PrpD 796 1274127 A | V Rv1146 . 258 Probable conserved transmembrane transport protein MmpL13b 797 1281118 T | A Rv1154c . 123 Hypothetical protein 798 1285001 T | P Rv1159 pimE 4 Mannosyltransferase PimE 799 1299305 P | L Rv1168c . 167 PPE family protein PPE17 800 1299338 A | G Rv1168c . 156 PPE family protein PPE17 801 1301909 D | G Rv1172c . 258 PE family protein PE12 802 1302014 G | * Rv1172c . 223 PE family protein PE12 803 1307140 G | R Rv1175c fadH 363 Probable NADPH dependent 2,4-dienoyl-CoA reductase FadH (2,4-dienoyl coenzyme A reductase) (4-enoyl-CoA reductase) 804 1307811 V | G Rv1175c fadH 139 Probable NADPH dependent 2,4-dienoyl-CoA reductase FadH (2,4-dienoyl coenzyme A reductase) (4-enoyl-CoA reductase) 805 1313247 V | G Rv1179c . 18 Rv1179c, MTV005.15c, len: 939 aa. Unknown protein. 806 1320614 L | V Rv1182 papA3 194 Probable conserved polyketide synthase associated protein PapA3 807 1337672 G | *,R,W Rv1194c . 281 Rv1194c, (MTCI364.06c), len: 421 aa. Conserved protein, highly similar to Q50018 possible transcriptional activator from Mycobacterium leprae (517 aa), FASTA scores: opt: 1960, E(): 0, (69.8% identity in 421 aa overlap). Also similar to Mycobacterium tuberculosis Rv2370c|MTCY27.10,(62.0% identity... 808 1339823 T | A Rv1196 . 159 PPE family protein PPE18 809 1340183 Q | E Rv1196 . 279 PPE family protein PPE18 810 1341073 L | S Rv1198 esxL 23 Putative ESAT-6 like protein EsxL (ESAT-6 like protein 4) 811 1341102 R | C,S Rv1198 esxL 33 Putative ESAT-6 like protein EsxL (ESAT-6 like protein 4) 812 1347345 A | V Rv1204c . 427 Rv1204c, (MTCI364.16c), len: 562 aa. Conserved hypothetical protein, some similarity to Q55103 CHO-ORF2 from streptomyces SP. (642 aa), FASTA scores: opt: 215,E(): 3.6e-06, (26.4% identity in 576 aa overlap). Contains PS00017 ATP/GTP-binding site motif A. 813 1368322 G | P,Q Rv1225c . 134 Rv1225c, (MTCI61.08c), len: 276 aa. Conserved hypothetical protein, some similarity to other hypothetical proteins e.g. AE001078|AE001078_2 Archaeoglobus fulgidus (265 aa), FASTA scores: opt: 339, E(): 5.1e-15, (27.1% identity in 262 aa overlap), and to NAGD_ECOLI|P15302 nagd protein from Escheri... 814 1379834 E | *,L Rv1236 sugA 303 Probable sugar-transport integral membrane protein ABC transporter SugA 815 1382628 K | E Rv1239c corA 139 Possible magnesium and cobalt transport transmembrane protein CorA 816 1385073 T | A,E,P Rv1243c . 536 PE-PGRS family protein PE_PGRS23 817 1385225 D | N,T Rv1243c . 485 PE-PGRS family protein PE_PGRS23 818 1385251 D | A,G Rv1243c . 476 PE-PGRS family protein PE_PGRS23 819 1385255 G | A,P,R Rv1243c . 475 PE-PGRS family protein PE_PGRS23 820 1401033 S | L Rv1253 deaD 355 Probable cold-shock DeaD-box protein A homolog DeaD (ATP-dependent RNA helicase dead homolog) 821 1415443 D | A,G Rv1266c pknH 133 Probable transmembrane serine/threonine-protein kinase H PknH (protein kinase H) (STPK H) 822 1418863 A | G,P,V,W Rv1269c . 31 Rv1269c, (MTCY50.13), len: 124 aa. Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap). 823 1418865 P | A,G,L,R,S,W Rv1269c . 30 Rv1269c, (MTCY50.13), len: 124 aa. Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap). 824 1418868 A | G,L,P,Q,R,S,T,W Rv1269c . 29 Rv1269c, (MTCY50.13), len: 124 aa. Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap). 825 1418870 A | G,H,L,P,S,T,V Rv1269c . 29 Rv1269c, (MTCY50.13), len: 124 aa. Conserved probable exported protein with putative N-terminal signal sequence. Similar to Mycobacterium tuberculosis protein Rv1813c|Y0DU_MYCTU|Q50620 hypothetical protein cy1a11.30 (137 aa), FASTA scores: E(): 9e-21, (41.6% identity in 137 aa overlap). 826 1419608 I | M,N,T,V Rv1270c lprA 48 Possible lipoprotein LprA 827 1436342 A | G,V Rv1283c oppB 259 Probable oligopeptide-transport integral membrane protein ABC transporter OppB 828 1437046 C | G Rv1283c oppB 25 Probable oligopeptide-transport integral membrane protein ABC transporter OppB 829 1446543 K | N Rv1292 argS 55 Probable arginyl-tRNA synthetase ArgS (ARGRS) (arginine--tRNA ligase) 830 1446733 A | T Rv1292 argS 119 Probable arginyl-tRNA synthetase ArgS (ARGRS) (arginine--tRNA ligase) 831 1480538 P | S Rv1318c . 96 Rv1318c, (MTCY130.03c), len: 541 aa. Possible adenylate cyclase. Some similarity at the c-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores, opt: 270, E(): 2.5e-11, (28.8% identity in 184 aa overlap); similar to other mycbacterium tuberculosis putative... 832 1480713 R | W Rv1318c . 38 Rv1318c, (MTCY130.03c), len: 541 aa. Possible adenylate cyclase. Some similarity at the c-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores, opt: 270, E(): 2.5e-11, (28.8% identity in 184 aa overlap); similar to other mycbacterium tuberculosis putative... 833 1480929 H | Y Rv1319c . 525 Rv1319c, (MTCY130.04c), len: 535 aa. Possible adenylate cyclase. Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 254, E(): 2.4e-10, (33.3% identity in 144 aa overlap); similar to other mycbacterium tuberculosis putative... 834 1480958 S | C,F Rv1319c . 515 Rv1319c, (MTCY130.04c), len: 535 aa. Possible adenylate cyclase. Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 254, E(): 2.4e-10, (33.3% identity in 144 aa overlap); similar to other mycbacterium tuberculosis putative... 835 1481135 A | P Rv1319c . 456 Rv1319c, (MTCY130.04c), len: 535 aa. Possible adenylate cyclase. Some similarity at the C-terminus to CYAA_RHIME|P19485 adenylate cyclase from Rhizobium meliloti (193 aa), FASTA scores: opt: 254, E(): 2.4e-10, (33.3% identity in 144 aa overlap); similar to other mycbacterium tuberculosis putative... 836 1488381 G | R Rv1325c . 529 PE-PGRS family protein PE_PGRS24 837 1489254 A | G,T Rv1325c . 238 PE-PGRS family protein PE_PGRS24 838 1489273 P | A,C,G,Q,R,S Rv1325c . 232 PE-PGRS family protein PE_PGRS24 839 1489278 G | L,P,R,S Rv1325c . 230 PE-PGRS family protein PE_PGRS24 840 1492194 D | G Rv1326c glgB 40 1,4-alpha-glucan branching enzyme GlgB (glycogen branching enzyme) 841 1497581 S | L,P Rv1329c dinG 537 Probable ATP-dependent helicase DinG 842 1504334 E | G Rv1336 cysM 314 Cysteine synthase B CysM (CSASE B) (O-phosphoserine sulfhydrylase B) (O-phosphoserine (thiol)-lyase B) 843 1505034 A | T Rv1337 . 227 Rv1337, (MTCY130.22), len: 240 aa. Probable integral membrane protein. Highly similar to P53426 hypothetical protein B1549_C3_240 from M.leprae (251); and P74553|D90916 hypothetical protein from Synechocystis sp. (198 aa), FASTA scores: E(): 2.3e-25, (43.6% identity in 181 aa overlap). 844 1520500 L | A,G,S Rv1354c . 460 Rv1354c, (MTCY02B10.18c), len: 623 aa. Conserved hypothetical protein, similar to many hypothetical proteins e.g. the C-terminus of G1001455 Synechocystis sp. (1244 aa), FASTA scores: opt: 933, E(): 0, (36.8% identity in 462 aa overlap); also similar to Rv1357c|MTCY02B10.21c (34.0% identity in 25... 845 1540235 V | R Rv1367c . 138 Rv1367c, (MTCY02B12.01c,MTCY02B10.31c), len: 377 aa. Conserved protein. Some similarity to penicillin binding proteins e.g. PBPE_BACSU|P32959 penicillin-binding protein 4* (pbp 4*) from Bacillus subtilis (451 aa), FASTA scores: E(): 6.9e-06, (23.6% identity in 373 aa overlap). Similar to AL031107... 846 1541798 A | V Rv1368 lprF 260 Probable conserved lipoprotein LprF 847 1547125 T | A Rv1374c . 136 Rv1374c, (MTCY02B12.08c), len: 152 aa. Hypothetical unknown protein. 848 1561939 E | D Rv1387 . 57 PPE family protein PPE20 849 1563003 G | D Rv1387 . 412 PPE family protein PPE20 850 1571363 I | A,P Rv1395 . 106 Rv1395, (MTCY21B4.12), len: 344 aa. Transcriptional regulatory protein (see citation below), similar to many e.g. URER_PROMI|Q02458 urease operon transcriptional activator from Proteus mirabilis (293 aa), FASTA scores: E():1.5e-08, (41.7% identity in 84 aa overlap); YHIX_ECOLI|P37639 hypothetical... 851 1573160 V | G Rv1396c . 233 PE-PGRS family protein PE_PGRS25 852 1573268 V | G Rv1396c . 197 PE-PGRS family protein PE_PGRS25 853 1594906 V | I Rv1420 uvrC 289 Probable excinuclease ABC (subunit C-nuclease) UvrC 854 1598503 Q | R Rv1423 whiA 200 Probable transcriptional regulatory protein WhiA 855 1599440 Y | F,N,S Rv1424c . 72 Rv1424c, (MTCY21B4.42c,MTCY493.30), len: 253 aa. Possible membrane protein, contains PS00402 Binding-protein-dependent transport systems inner membrane comp signature. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 856 1601528 C | Y Rv1426c lipO 265 Probable esterase LipO 857 1602027 T | H,P Rv1426c lipO 99 Probable esterase LipO 858 1612585 M | I,R Rv1435c . 123 Rv1435c, (MTCY493.19), len: 202 aa. Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa... 859 1612595 G | A Rv1435c . 119 Rv1435c, (MTCY493.19), len: 202 aa. Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa... 860 1612606 I | M Rv1435c . 116 Rv1435c, (MTCY493.19), len: 202 aa. Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa... 861 1612618 A | P,R Rv1435c . 112 Rv1435c, (MTCY493.19), len: 202 aa. Probable conserved Pro-, Gly-, Val-rich secreted protein (see citation below) with a N-terminal signal sequence. Similar at C-terminus to AF017099|AF017099_1 Mycobacterium tuberculosis pGB1 (87 aa), FASTA scores: opt: 550, E(): 2.3e-17, (97.7% identity in 86 aa... 862 1618628 G | A,H,Q,R,T Rv1441c . 353 PE-PGRS family protein PE_PGRS26 863 1618642 D | A,Q Rv1441c . 348 PE-PGRS family protein PE_PGRS26 864 1619173 A | G Rv1441c . 171 PE-PGRS family protein PE_PGRS26 865 1625327 N | D,G,H,L,R Rv1446c opcA 14 Putative OXPP cycle protein OpcA 866 1626857 V | I Rv1447c zwf2 36 Probable glucose-6-phosphate 1-dehydrogenase Zwf2 (G6PD) 867 1631318 G | A,P,R Rv1450c . 1104 PE-PGRS family protein PE_PGRS27 868 1631413 T | N Rv1450c . 1072 PE-PGRS family protein PE_PGRS27 869 1632025 S | T Rv1450c . 868 PE-PGRS family protein PE_PGRS27 870 1632510 N | A,G,V Rv1450c . 707 PE-PGRS family protein PE_PGRS27 871 1632518 N | D,G,H,P,R Rv1450c . 704 PE-PGRS family protein PE_PGRS27 872 1632540 G | A,P,R,T Rv1450c . 697 PE-PGRS family protein PE_PGRS27 873 1632542 G | A,P,R,V Rv1450c . 696 PE-PGRS family protein PE_PGRS27 874 1632748 A | G Rv1450c . 627 PE-PGRS family protein PE_PGRS27 875 1633527 G | A,V Rv1450c . 368 PE-PGRS family protein PE_PGRS27 876 1636161 A | G,K,R Rv1452c . 690 PE-PGRS family protein PE_PGRS28 877 1636166 G | N Rv1452c . 689 PE-PGRS family protein PE_PGRS28 878 1636463 T | A,G,P Rv1452c . 590 PE-PGRS family protein PE_PGRS28 879 1637146 D | H Rv1452c . 362 PE-PGRS family protein PE_PGRS28 880 1637207 V | G,X Rv1452c . 342 PE-PGRS family protein PE_PGRS28 881 1639333 T | I Rv1453 . 318 Rv1453, (MTCY493.01c), len: 421 aa. Possible transcriptional activator, similar to Q50018 putative transcriptional activator trx from Mycobacterium leprae (517 aa), FASTA scores: opt: 1719, E(): 0, (54.0% identity in 500 aa overlap). Also highly similar to Mycobacterium tuberculosis proteins Rv23... 882 1644248 P | D,H Rv1458c . 5 Rv1458c, (MTV007.05c), len: 313 aa. Possible unidentified antibiotic-transport ATP-binding protein ABC transporter (see citation below), equivalent to Z99125|MLCL536.31 from Mycobacterium leprae (315 aa), FASTA scores: opt: 1812, E(): 0, (88.0% identity in 308 aa overlap). Similar to AF027770|AF0... 883 1655179 P | L,R,W,X Rv1467c fadE15 109 Probable acyl-CoA dehydrogenase FadE15 884 1655202 I | L Rv1467c fadE15 101 Probable acyl-CoA dehydrogenase FadE15 885 1655650 G | R Rv1468c . 358 PE-PGRS family protein PE_PGRS29 886 1655875 F | V Rv1468c . 283 PE-PGRS family protein PE_PGRS29 887 1664834 D | R Rv1475c acn 405 Probable iron-regulated aconitate hydratase Acn (citrate hydro-lyase) (aconitase) 888 1670818 G | S Rv1480 . 136 Rv1480, (MTV007.27,MTCY227.01), len: 317 aa. Conserved protein, last 110 aa residues correspond to first 110 aa of YS01_MYCAV|O07394 hypothetical 18.7 kDa Mycobacterium avium protein MAV169 (169 aa), FASTA scores: opt: 642, E(): 0, (84.2% identity in 114 aa overlap). Also similar to Mycobacterium... 889 1688649 A | P Rv1497 lipL 237 Probable esterase LipL 890 1692574 A | T Rv1501 . 229 Rv1501, (MTCY277.23), len: 273 aa. Conserved hypothetical protein, some similarity to O06374|Rv3633|MTCY15C10.19C hypothetical protein from Mycobacterium tuberculosis, FASTA scores: E(): 3.9e-10,(27.5% identity in 280 aa overlap). 891 1693561 Y | C Rv1502 . 213 Hypothetical protein 892 1694495 S | N Rv1503c . 17 Rv1503c, (MTCY277.25c), len: 182 aa. Conserved hypothetical protein, similar to C-terminal region of P27833|RFFA_ECOLI lipopolysaccharide biosynthesis protein from Escherichia coli (376 aa), FASTA scores: opt: 565,E(): 0, (49.4% identity in 170 aa overlap); Rv1503c and Rv1504c are both similar to... 893 1695796 A | T Rv1505c . 51 hypothetical protein 894 1703770 M | V Rv1511 gmdA 233 GDP-D-mannose dehydratase GmdA (GDP-mannose 4,6 dehydratase) (GMD) 895 1712824 T | A Rv1521 fadD25 175 Probable fatty-acid-AMP ligase FadD25 (fatty-acid-AMP synthetase) (fatty-acid-AMP synthase) 896 1715939 N | A,F,P,Q Rv1522c mmpL12 559 Probable conserved transmembrane transport protein MmpL12 897 1715969 Q | *,H,P Rv1522c mmpL12 549 Probable conserved transmembrane transport protein MmpL12 898 1717378 E | F,L,R Rv1522c mmpL12 79 Probable conserved transmembrane transport protein MmpL12 899 1725954 T | S Rv1527c pks5 819 Probable polyketide synthase Pks5 900 1725966 H | P Rv1527c pks5 815 Probable polyketide synthase Pks5 901 1726802 A | P,Q Rv1527c pks5 537 Probable polyketide synthase Pks5 902 1736595 Y | S Rv1536 ileS 26 Isoleucyl-tRNA synthetase IleS 903 1739390 T | A Rv1536 ileS 958 Isoleucyl-tRNA synthetase IleS 904 1744023 T | H,P Rv1541c lprI 117 Possible lipoprotein LprI 905 1753526 C | P,S Rv1549 . 6 Possible fatty-acid-CoA ligase FadD11.1 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 906 1760292 M | V Rv1554 frdC 40 Probable fumarate reductase [membrane anchor subunit] FrdC (fumarate dehydrogenase) (fumaric hydrogenase) 907 1767572 V | L Rv1563c treY 621 Maltooligosyltrehalose synthase TreY 908 1767878 P | A Rv1563c treY 519 Maltooligosyltrehalose synthase TreY 909 1769293 I | K,N,S Rv1563c treY 47 Maltooligosyltrehalose synthase TreY 910 1779274 G | H,L,P,T Rv1572c . 9 Rv1572c, (MTCY336.31B), len: 34 aa. Partial ORF,part of REP13E12 repeat element; 3' end of Rv1587c (MTCY336.17) after phage-like element (see citation below). Similar to C-terminal ends of other REP13E12 repeat elements e.g. Rv1148, Rv1945, Rv3467, etc. Length extended since first submission (+7 ... 911 1779278 H | D,L,N,P,Q,R,S,Y Rv1572c . 8 Rv1572c, (MTCY336.31B), len: 34 aa. Partial ORF,part of REP13E12 repeat element; 3' end of Rv1587c (MTCY336.17) after phage-like element (see citation below). Similar to C-terminal ends of other REP13E12 repeat elements e.g. Rv1148, Rv1945, Rv3467, etc. Length extended since first submission (+7 ... 912 1779280 V | D,H,K,N,P,Q,R,S,T,X Rv1572c . 7 Rv1572c, (MTCY336.31B), len: 34 aa. Partial ORF,part of REP13E12 repeat element; 3' end of Rv1587c (MTCY336.17) after phage-like element (see citation below). Similar to C-terminal ends of other REP13E12 repeat elements e.g. Rv1148, Rv1945, Rv3467, etc. Length extended since first submission (+7 ... 913 1784238 S | L Rv1581c . 22 Rv1581c, (MTCY336.23), len: 131 aa. Probable phiRv1 phage protein (see citation below). This region is a possible MT-complex-specific genomic island (See Becq et al., 2007). 914 1789671 A | T Rv1588c . 56 Rv1588c, (MTCY336.16), len: 222 aa. Partial REP13E12 repeat protein (see citation below), nearly identical to ORF's in other Rep13E12 repeats, including Rv0095c|MTCY251.14c|Y05E_MYCTU|Q10891 hypothetical 15.4 kd protein cy251.14 from Mycobacterium tuberculosis (136 aa),FASTA results: opt: 613, E(... 915 1789813 L | R Rv1588c . 9 Rv1588c, (MTCY336.16), len: 222 aa. Partial REP13E12 repeat protein (see citation below), nearly identical to ORF's in other Rep13E12 repeats, including Rv0095c|MTCY251.14c|Y05E_MYCTU|Q10891 hypothetical 15.4 kd protein cy251.14 from Mycobacterium tuberculosis (136 aa),FASTA results: opt: 613, E(... 916 1792732 T | P Rv1592c . 337 Rv1592c, (MTCY336.12), len: 446 aa. Conserved hypothetical protein, some similarity to Q49629|B1170_F1_46 from Mycobacterium leprae (132 aa), FASTA results: opt: 332, E(): 4.5e-14, (56.3% identity in 87 aa overlap). Nearly identical to truncated Mycobacterium bovis BCG protein (148 aa) AF041819|A... 917 1794377 T | H,P Rv1593c . 111 Rv1593c, (MTCY336.11), len: 236 aa. Conserved protein, highly similar to Q49628|B1170_F1_44 from Mycobacterium leprae (286 aa), FASTA scores: opt: 1304, E (): 0, (85.4% identity in 233 aa overlap); similar to several putative DNA hydrolases e.g. Q9S233|SCI51.07C from Streptomyces coelicolor (239 ... 918 1803265 S | N Rv1602 hisH 201 Probable amidotransferase HisH 919 1804409 P | Q Rv1604 impA 124 Probable inositol-monophosphatase ImpA (imp) 920 1808795 D | A Rv1609 trpE 298 Anthranilate synthase component I TrpE (glutamine amidotransferase) 921 1817856 E | A Rv1618 tesB1 81 Probable acyl-CoA thioesterase II TesB1 922 1819619 A | T Rv1619 . 349 Rv1619, (MTCY01B2.11), len: 484 aa. Conserved membrane protein. Some similarity to N-terminus of P94974|Rv1640c|MTCY06H11.04c probable lysyl-tRNA synthetase 2 from Mycobacterium tuberculosis (1172 aa), FASTA scores: E(): 1.4e-16, (28.0% identity in 410 aa overlap); and similar in part to O69916| ... 923 1820436 F | L,P,V Rv1620c cydC 420 Probable 'component linked with the assembly of cytochrome' transport transmembrane ATP-binding protein ABC transporter CydC 924 1822330 V | G Rv1621c cydD 315 Probable 'component linked with the assembly of cytochrome' transport transmembrane ATP-binding protein ABC transporter CydD 925 1835569 A | G Rv1631 coaE 186 Probable dephospho-CoA kinase CoaE (dephosphocoenzyme a kinase) 926 1839759 G | R Rv1634 . 198 Possible drug efflux membrane protein 927 1843567 V | A,I,P Rv1637c . 43 Rv1637c, (MTCY01B2.29c,MTCY06H11.01c), len: 264 aa. Conserved protein, some similarity to others e.g. P05446|GLO2_RHOBL probable hydroxyacylglutathione hydrolase (255 aa), FASTA scores: opt: 252, E(): 2e-09, (39.0% identity in 146 aa overlap). Also similar to Q9Z505|AL035591|SCC54.20 putative hyd... 928 1843636 Q | *,D,E,H,I,M,P Rv1637c . 20 Rv1637c, (MTCY01B2.29c,MTCY06H11.01c), len: 264 aa. Conserved protein, some similarity to others e.g. P05446|GLO2_RHOBL probable hydroxyacylglutathione hydrolase (255 aa), FASTA scores: opt: 252, E(): 2e-09, (39.0% identity in 146 aa overlap). Also similar to Q9Z505|AL035591|SCC54.20 putative hyd... 929 1849191 I | V Rv1640c lysX 949 Lysyl-tRNA synthetase 2 LysX 930 1857117 A | G Rv1647 . 115 Rv1647, (MTCY06H11.12), len: 316 aa. Adenylate cyclase, some similarity to other Mycobacterium tuberculosis proteins e.g. Q11055|Rv1264|YC64_MYCTU 42.2 kDa protein (397 aa), FASTA scores: opt: 197, E(): 9.4e-06,(27.1% identity in 181 aa overlap) and Q10400|Rv2212|YM12_MYCTU (378 aa). Belongs to a... 931 1857806 S | G Rv1648 . 26 Rv1648, (MTCY06H11.13), len: 268 aa. Probable transmembrane protein, some similarity to Rv3434c|MTCY77.06C (237 aa), FASTA scores: E(): 0.00039,(31.4% identity in 194 aa overlap). 932 1860392 A | V Rv1650 pheT 212 Probable phenylalanyl-tRNA synthetase, beta chain PheT 933 1861042 I | V Rv1650 pheT 429 Probable phenylalanyl-tRNA synthetase, beta chain PheT 934 1861804 G | C Rv1650 pheT 683 Probable phenylalanyl-tRNA synthetase, beta chain PheT 935 1863684 T | P Rv1651c . 567 PE-PGRS family protein PE_PGRS30 936 1864134 S | D,H,R,X Rv1651c . 417 PE-PGRS family protein PE_PGRS30 937 1865976 P | L Rv1652 argC 134 Probable N-acetyl-gamma-glutamyl-phoshate reductase ArgC 938 1867838 S | N Rv1653 argJ 403 Probable glutamate N-acetyltransferase ArgJ 939 1873954 R | L Rv1659 argH 439 Probable argininosuccinate lyase ArgH 940 1877744 E | A Rv1661 pks7 814 Probable polyketide synthase Pks7 941 1884260 E | K Rv1662 pks8 853 Probable polyketide synthase Pks8 942 1894296 G | P,R Rv1668c . 350 Rv1668c, (MTV047.04c), len: 372 aa. Probable first part of macrolide-transport ATP-binding protein ABC transporter (see citation below), similar to many ATP-binding proteins ABC transporter e.g. X80735|SEABCT_1|Q54072 Saccharopolyspora erythraea ertX gene (481 aa), FASTA scores: opt: 938, E(): 0,... 943 1894298 S | A,G,P,V Rv1668c . 349 Rv1668c, (MTV047.04c), len: 372 aa. Probable first part of macrolide-transport ATP-binding protein ABC transporter (see citation below), similar to many ATP-binding proteins ABC transporter e.g. X80735|SEABCT_1|Q54072 Saccharopolyspora erythraea ertX gene (481 aa), FASTA scores: opt: 938, E(): 0,... 944 1894301 P | A,D,G,K,L,Q,R,T Rv1668c . 348 Rv1668c, (MTV047.04c), len: 372 aa. Probable first part of macrolide-transport ATP-binding protein ABC transporter (see citation below), similar to many ATP-binding proteins ABC transporter e.g. X80735|SEABCT_1|Q54072 Saccharopolyspora erythraea ertX gene (481 aa), FASTA scores: opt: 938, E(): 0,... 945 1912228 T | K,S Rv1687c . 232 Rv1687c, (MTCI125.09c), len: 255 aa. Probable conserved ATP-binding protein ABC transporter (see citation below), similar to many ABC-type transporters e.g. P55476|NODI_RHISN nodulation ATP-binding protein I from Rhizobium sp. (343 aa), FASTA scores: opt: 479, E(): 3.7e-23, (34.6% identity in 243... 946 1912322 H | R Rv1687c . 200 Rv1687c, (MTCI125.09c), len: 255 aa. Probable conserved ATP-binding protein ABC transporter (see citation below), similar to many ABC-type transporters e.g. P55476|NODI_RHISN nodulation ATP-binding protein I from Rhizobium sp. (343 aa), FASTA scores: opt: 479, E(): 3.7e-23, (34.6% identity in 243... 947 1931718 L | V Rv1705c . 313 PPE family protein PPE22 948 1935437 P | S Rv1707 . 186 Rv1707, (MTCI125.29), len: 486 aa. Probable conserved transmembrane protein, possibly involved in transport of sulfate, similar to several hypothetical proteins belonging to the sulfate permease family e.g. P40877|YCHM_ECOLI hypothetical 58.4 kDa protein in pth-prsa intergenic region from Escheri... 949 1938311 V | I Rv1710 scpB 56 Possible segregation and condensation protein ScpB 950 1940549 A | T Rv1713 engA 88 Probable GTP-binding protein EngA 951 1945575 I | T Rv1718 . 256 Rv1718, (MTCY04C12.03), len: 272 aa. Conserved hypothetical protein, similar to O29058|AF1210|AE001021 Hypothetical protein from Archeoglobus (313 aa), FASTA scores: opt: 301, E(): 8e-23, (31.6% identity in 301 aa overlap). 952 1947061 G | D,E,R,V Rv1720c vapC12 120 Possible toxin VapC12 953 1955913 A | G,K,L,Q,R,T Rv1730c . 445 Rv1730c, (MTCY04C12.15c), len: 517 aa. Possible penicillin-binding protein, similar to others e.g. PBP4_NOCLA|Q06317 penicillin-binding protein 4 (pbp-4) from Nocardia lactamdurans (381 aa), FASTA scores: opt: 643,E(): 3.8e-32, (33.8% identity in 370 aa overlap); etc. Also similar to other Mycoba... 954 1955919 F | E,K,L,R Rv1730c . 443 Rv1730c, (MTCY04C12.15c), len: 517 aa. Possible penicillin-binding protein, similar to others e.g. PBP4_NOCLA|Q06317 penicillin-binding protein 4 (pbp-4) from Nocardia lactamdurans (381 aa), FASTA scores: opt: 643,E(): 3.8e-32, (33.8% identity in 370 aa overlap); etc. Also similar to other Mycoba... 955 1963674 S | R Rv1736c narX 172 Probable nitrate reductase NarX 956 1967237 R | L Rv1739c . 134 Rv1739c, (MTCY04C12.24c, MTCY28.01), len: 560 aa. Probable sulphate-transport transmembrane protein ABC transporter, similar to several e.g. P53392|G607186 high affinity sulphate transporter from Stylosanthes hamata (662 aa), FASTA scores: opt: 382, E(): 1.6e-16, (28.0% identity in 564 aa overlap... 957 1978607 V | I Rv1750c fadD1 321 Possible fatty-acid-CoA ligase FadD1 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 958 1979026 S | N Rv1750c fadD1 181 Possible fatty-acid-CoA ligase FadD1 (fatty-acid-CoA synthetase) (fatty-acid-CoA synthase) 959 1982770 T | S Rv1753c . 669 PPE family protein PPE24 960 1983433 V | G Rv1753c . 448 PPE family protein PPE24 961 1986637 R | F,G,K,N,Q,S,T,V Rv1754c . 12 Rv1754c, (MTCY28.17c), len: 563 aa. Conserved protein, has proline-rich central region. Some similarity in central region to other Mycobacterium tuberculosis proline-rich proteins e.g. O06555|Rv1157c|MTCI65.24c (371 aa), (32.5% identity in 191 aa overlap). Contains PS00017 ATP/GTP-binding site mo... 962 1987008 P | G,H,L,R,X Rv1755c plcD 230 Probable phospholipase C 4 (fragment) PlcD 963 1987120 G | A,W Rv1755c plcD 193 Probable phospholipase C 4 (fragment) PlcD 964 1989064 D | A,E,F,G,P Rv1758 cut1 8 Probable cutinase Cut1 965 1989888 A | G Rv1759c wag22 897 PE-PGRS family protein Wag22 966 1990678 N | D Rv1759c wag22 634 PE-PGRS family protein Wag22 967 1991734 L | V Rv1759c wag22 282 PE-PGRS family protein Wag22 968 1992027 A | G Rv1759c wag22 184 PE-PGRS family protein Wag22 969 1992331 G | P,R Rv1759c wag22 83 PE-PGRS family protein Wag22 970 2002709 E | D Rv1769 . 28 Rv1769, (MTCY28.35), len: 414 aa. Conserved protein,similar to O88066|SCI35.31|AL031541 hypothetical protein from Streptomyces coelicolor (402 aa), FASTA scores: opt: 1341, E(): 0, (53.8% identity in 398 aa overlap). 971 2009854 A | V Rv1775 . 228 Rv1775, (MTCY25C11.02), unknown, len: 272 aa. Conserved hypothetical protein, similar to O28806|AF1466 conserved hypothetical protein from Archaeoglobus fulgidus (255 aa), FASTA scores: opt: 364, E(): 1e-17, (29.2% identity in 267 aa overlap). 972 2047033 S | E,I,K,P Rv1804c . 106 Rv1804c, (MTV049.26c), len: 108 aa. Conserved protein, similar to several hypothetical Mycobacterium tuberculosis proteins that may be exported (hydrophobic stretch at N-terminus) e.g. O07222|Rv1810|MTCY16F9.04C|Z96073 (118 aa), FASTA scores: opt: 361, E(): 2.3e-19, (53.5% identity in 101 aa over... 973 2049097 V | L Rv1807 . 234 PPE family protein PPE31 974 2050822 G | A Rv1808 . 301 PPE family protein PPE32 975 2053987 R | H Rv1811 mgtC 182 Possible Mg2+ transport P-type ATPase C MgtC 976 2054240 T | P Rv1812c . 374 Rv1812c, (MTCY16F9.02), len: 400 aa. Probable dehydrogenase, similar to other dehydrogenases/oxidases e.g. AE001947|AE001947_10 NADH dehydrogenase II of Deinococcus radiodurans (379 aa), FASTA scores: opt: 404,E(): 3.4e-18, (26.4% identity in 363 aa overlap) and DHNA_HAEIN|P44856 nadh dehydrogena... 977 2057591 P | S Rv1815 . 22 Rv1815, (MTCY1A11.28c), len: 221 aa. Conserved protein, similar to G473456 hypothetical protein from Mycobacterium fortuitum (255 aa), FASTA scores: opt: 182,E(): 3.2e-05, (29.6% identity in 230 aa overlap). Alternative nucleotide at position 2057774 (a-T; I83F) has been observed. Predicted to be... 978 2061433 L | A,G,R Rv1818c . 415 PE-PGRS family protein PE_PGRS33 979 2061434 G | A,P Rv1818c . 414 PE-PGRS family protein PE_PGRS33 980 2061436 G | A,P,R Rv1818c . 414 PE-PGRS family protein PE_PGRS33 981 2061437 G | A,P,R Rv1818c . 413 PE-PGRS family protein PE_PGRS33 982 2061438 G | A,P,R Rv1818c . 413 PE-PGRS family protein PE_PGRS33 983 2061439 G | A,P,R,S,V Rv1818c . 413 PE-PGRS family protein PE_PGRS33 984 2061440 A | G,P,R Rv1818c . 412 PE-PGRS family protein PE_PGRS33 985 2061441 A | C,F,G,H,P,R,V Rv1818c . 412 PE-PGRS family protein PE_PGRS33 986 2061447 G | A,C,L,P,R Rv1818c . 410 PE-PGRS family protein PE_PGRS33 987 2061453 G | A,C,F,P,R Rv1818c . 408 PE-PGRS family protein PE_PGRS33 988 2061600 S | A Rv1818c . 359 PE-PGRS family protein PE_PGRS33 989 2085274 K | E,G Rv1837c glcB 570 Malate synthase G GlcB 990 2086672 G | S Rv1837c glcB 104 Malate synthase G GlcB 991 2088513 F | G,V Rv1840c . 336 PE-PGRS family protein PE_PGRS34 992 2089104 G | A,N Rv1840c . 139 PE-PGRS family protein PE_PGRS34 993 2094915 A | D,H,R,S Rv1844c gnd1 92 Probable 6-phosphogluconate dehydrogenase Gnd1 994 2098195 V | M Rv1850 ureC 79 Urease alpha subunit UreC (urea amidohydrolase) 995 2100004 V | G Rv1851 ureF 104 Urease accessory protein UreF 996 2104217 T | H,P Rv1856c . 203 Rv1856c, (MTCY359.17), len: 225 aa. Possible oxidoreductase. Equivalent to MLCB1788.11c|AL008609 oxidoreductase from Mycobacterium leprae (224 aa), FASTA scores: opt: 1211, E(): 0; (80.4% identity in 224 aa overlap). Some similarity to dehydrogenases of short-chain dehydrogenase/reductase family ... 997 2115075 R | G Rv1866 . 646 Rv1866, (MTCY359.07c), len: 778 aa. Conserved protein, N-terminal region similar to fatty acyl-CoA racemases e.g. Rv0855, Rv1143, and C-terminal region (from aa 370) similar to L-carnitine dehydratases, racemases, and Rv3272|MTCY71.12 Mycobacterium tuberculosis (394 aa), FASTA score: opt: 472, E(... 998 2132031 V | G Rv1881c lppE 100 Possible conserved lipoprotein LppE 999 2133465 G | C,V Rv1883c . 77 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1000 2133467 P | M,S,T,V Rv1883c . 76 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1001 2133468 P | *,C,M,R,S,V,W Rv1883c . 76 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1002 2133469 E | A,C,D,G,M,S,V Rv1883c . 75 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1003 2133472 C | A,D,E,F,L,R,T,V,Y Rv1883c . 74 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1004 2133473 C | A,D,E,G,I,K,P,R,S Rv1883c . 74 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1005 2133474 * | A,C,E,G,P,R,S Rv1883c . 74 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1006 2133479 T | *,A,D,E,G,R,V Rv1883c . 72 Rv1883c, (MTCY180.35), len: 153 aa. Conserved hypothetical protein, some similarity to hypothetical proteins e.g. Rv2778c|AL008967|MTV002.43 from Mycobacterium tuberculosis (156 aa), FASTA score: opt: 212, E(): 3.1e-08,(34.4% identity in 151 aa overlap). Also similar to U75434|SAU75434_3 Nsh-OrfB... 1007 2134298 Y | C Rv1885c . 192 Rv1885c, (MTCY180.33), len: 199 aa. Chorismate mutase, AroQ class (See Prakash et al., 2005, Sasso et al.,2005), some similarity to P42517|CHMU_ERWHE monofunctional chorismate mutase (181 aa), FASTA score: opt: 181, E(): 0.00017, (28.6% identity in 133 aa overlap). Contains N-terminal signal sequ... 1008 2153739 D | E,L Rv1907c . 49 Rv1907c, (MTCY180.11), len: 215 aa. Hypothetical unknown protein. Similar to Q50763 Ethyl methane sulphonate resistance protein from Mycobacterium tuberculosis (168 aa), FASTA scores: opt: 638, E(): 0, (69.7% identity in 152 aa overlap). Downstream of a cloned katG gene (EMBL:mtkatg). Differences... 1009 2155168 S | N,T Rv1908c katG 315 Catalase-peroxidase-peroxynitritase T KatG 1010 2161656 F | V Rv1916 aceAb 31 Probable isocitrate lyase AceAb [second part] (isocitrase) (isocitratase) (Icl) 1011 2161902 D | N Rv1916 aceAb 113 Probable isocitrate lyase AceAb [second part] (isocitrase) (isocitratase) (Icl) 1012 2163375 N | D,F Rv1917c . 1313 PPE family protein PPE34 1013 2163494 A | G Rv1917c . 1273 PPE family protein PPE34 1014 2164337 N | K,M,P Rv1917c . 992 PPE family protein PPE34 1015 2166863 V | A Rv1917c . 150 PPE family protein PPE34 1016 2172380 E | A Rv1920 . 253 Probable membrane protein 1017 2173033 W | R Rv1921c lppF 255 Probable conserved lipoprotein LppF 1018 2173273 Y | D Rv1921c lppF 175 Probable conserved lipoprotein LppF 1019 2175390 G | D Rv1923 lipD 73 Probable lipase LipD 1020 2180650 S | G,H,I,R Rv1928c . 190 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1021 2180813 N | A,E,K,L,P,R,W Rv1928c . 136 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1022 2180817 M | A,C,E,K,L,N,R,S,T Rv1928c . 134 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1023 2180818 M | A,E,G,K,L,R,T,W Rv1928c . 134 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1024 2180824 K | G,L,P,Q,R,W Rv1928c . 132 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1025 2181055 I | A,D,R Rv1928c . 55 Rv1928c, (MTCY09F9.36), len: 255 aa. Probable short-chain dehydrogenase/reductase, highly similar to others e.g. NP_228109.1|NC_000853 oxidoreductase (short chain dehydrogenase/reductase family) from Thermotoga maritima (257 aa); T41116 short chain dehydrogenase from Schizosaccharomyces pombe (26... 1026 2185938 A | G Rv1934c fadE17 84 Probable acyl-CoA dehydrogenase FadE17 1027 2188045 E | G,X Rv1936 . 221 Rv1936, (MTCY09F9.28c), len: 369 aa. Possible monooxygenase, similar to LXA2_PHOLU|P23146 alkanal monooxygenase alpha chain (362 aa), FASTA scores: opt: 196,E(): 6.3e-06, (22.3% identity in 373 aa overlap). Also similar to many other Mycobacterium tuberculosis hypothetical oxidoreductases and mon... 1028 2195395 N | K Rv1944c . 181 Rv1944c, (MTCY09F9.20), len: 196 aa. Conserved protein, similar to C-terminal part of SCE20.29|AL136058|CAB65585.1 hypothetical protein from Streptomyces coelicolor (338 aa), blastp scores, Identities = 37/131 (28%), Positives = 51/131 (38%). 1029 2199471 L | E,F,S Rv1949c . 189 Rv1949c, (MTCY09F9.15), len: 319 aa. Conserved hypothetical protein, partial ORF. Rv1949c and Rv1950c|MTCY09F9.14 are similar but frameshifted with respect to Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kd protein (323 aa), FASTA scores: opt: 459, E(): 2.8e-16,(54.8% identity in 157 aa overlap). ... 1030 2199476 F | E,G,H Rv1949c . 187 Rv1949c, (MTCY09F9.15), len: 319 aa. Conserved hypothetical protein, partial ORF. Rv1949c and Rv1950c|MTCY09F9.14 are similar but frameshifted with respect to Rv2077c|MTCY49.16C|Q10685 hypothetical 33.3 kd protein (323 aa), FASTA scores: opt: 459, E(): 2.8e-16,(54.8% identity in 157 aa overlap). ... 1031 2213713 Q | K Rv1969 mce3D 287 Mce-family protein Mce3D 1032 2217407 T | R Rv1973 . 82 Rv1973, (MTV051.11), len: 160 aa. Possible conserved Mce-associated membrane protein. Probably part of mce3 operon. Similar to several other proteins from Mycobacterium tuberculosis e.g. Rv1362c|Z75555|MTCY02B10.26C (220 aa), FASTA scores: opt: 378, E(): 2.8e-19, (50.0% identity in 128 aa overlap... 1033 2218604 P | S Rv1975 . 185 Rv1975, (MTV051.13), len: 221 aa. Conserved hypothetical protein, showing some similarity to AJ251435 hypothetical protein from Mycobacterium avium subsp. paratuberculosis (193 aa). Predicted to be an outer membrane protein (See Song et al., 2008). 1034 2222109 F | L,S Rv1979c . 353 Rv1979c, (MTCY39.40-MTV051.17c), len: 481 aa. Possible permease, APC family possibly involved in transport of amino acid, showing some similarity to other permeases. Also similar to MTCY39.19 from Mycobacterium tuberculosis (28.2% identity in 277 aa overlap). Contains PS00599 Aminotransferases cl... 1035 2227994 I | D,K,L,T,Y Rv1984c cfp21 190 Probable cutinase precursor CFP21 1036 2229864 L | F Rv1985c . 14 Rv1985c, (MTCY39.34), len: 303 aa. Probable transcriptional regulatory protein, LysR family member. Similar to many regulatory proteins, especially ICIA_ECOLI|P24194 chromosome initiation inhibitor from Escherichia coli (297 aa), FASTA scores: opt: 520, E(): 1.1e-28, (35.8% identity in 285 aa ove... 1037 2230045 C | Y Rv1986 . 12 Rv1986, (MTCY39.33c), len: 199 aa. Probable conserved integral membrane protein, LysE family possibly involved in transport of Lysine, similar to P11667|YGGA_ECOLI hypothetical 23.2 kDa protein in sbm-fba intergenic region (211 aa), FASTA scores: opt: 379, E(): 1.5e-19, (37.3% identity in 185 aa ... 1038 2235163 S | L,P Rv1992c ctpG 715 Probable metal cation transporter P-type ATPase G CtpG 1039 2239350 A | V Rv1996 . 116 Rv1996, (MTCY39.23c), len: 317 aa. Universal stress protein family protein. Similar to several Mycobacterium tuberculosis hypothetical proteins e.g. Rv2005c|Q10851|YK05_MYCTU (295 aa), FASTA scores: opt: 775,E(): 0, (50.3% identity in 316 aa overlap); Rv2026c (294 aa) (47.9% identity in 311 aa ov... 1040 2267949 T | D,E,G,H,R Rv2023c . 54 Rv2023c, (MTV018.10c), len: 119 aa. Hypothetical protein, alternative upstream start possible. 1041 2268725 C | A,L,Q,T,V,X,Y Rv2024c . 506 Rv2024c, (MTV018.11c), len: 515 aa. Conserved hypothetical protein. Identical to N-terminal part of much larger hypothetical protein, RvD1-Rv2024c' (1606 aa), from Mycobacterium bovis BCG: CAB44655.1|Y18605|13881753|AAK46361.1|AE007059 so probably truncated. Part of RvD1 chromosomal deletion region. 1042 2270101 W | * Rv2024c . 47 Rv2024c, (MTV018.11c), len: 515 aa. Conserved hypothetical protein. Identical to N-terminal part of much larger hypothetical protein, RvD1-Rv2024c' (1606 aa), from Mycobacterium bovis BCG: CAB44655.1|Y18605|13881753|AAK46361.1|AE007059 so probably truncated. Part of RvD1 chromosomal deletion region. 1043 2271498 A | V Rv2025c . 84 Rv2025c, (MTV018.12c), len: 332 aa. Conserved transmembrane protein, involved in transport of metal ions,contains IPR002524 Cation efflux protein domain. 1044 2274480 N | T Rv2027c dosT 10 Two component sensor histidine kinase DosT 1045 2282787 C | Y Rv2037c . 312 Conserved transmembrane protein 1046 2285251 V | F Rv2039c . 131 Probable sugar-transport integral membrane protein ABC transporter 1047 2287726 F | L,P,V Rv2041c . 41 Rv2041c, (MTV018.28c), len: 439 aa. Probable sugar-binding lipoprotein component of sugar transport system, similar to many. Contains signal sequence and appropriately positioned PS00013 Prokaryotic membrane lipoprotein lipid attachment site. 1048 2288718 M | T,W Rv2043c pncA 175 Pyrazinamidase/nicotinamidase PncA (PZase) 1049 2288791 L | G,P Rv2043c pncA 151 Pyrazinamidase/nicotinamidase PncA (PZase) 1050 2288830 C | R Rv2043c pncA 138 Pyrazinamidase/nicotinamidase PncA (PZase) 1051 2288851 V | F,G Rv2043c pncA 131 Pyrazinamidase/nicotinamidase PncA (PZase) 1052 2289231 L | S,W Rv2043c pncA 4 Pyrazinamidase/nicotinamidase PncA (PZase) 1053 2300547 H | N Rv2048c pks12 2147 Polyketide synthase Pks12 1054 2304351 V | A,G Rv2048c pks12 879 Polyketide synthase Pks12 1055 2306452 S | G,L,N,P,T Rv2048c pks12 179 Polyketide synthase Pks12 1056 2306720 H | D,R Rv2048c pks12 90 Polyketide synthase Pks12 1057 2311875 A | T Rv2052c . 215 Rv2052c, (MTV018.39c), len: 534 aa. Conserved protein, similar to many. Contains IPR013108 Amidohydrolase 3 domain. 1058 2313206 G | S Rv2054 . 28 Rv2054, (MTCY63A.06c), len: 237 aa. Conserved protein, similar to many. Contains IPR002925 Dienelactone hydrolase domain. 1059 2328543 P | I,M Rv2071c cobM 145 Precorrin-3 methylase CobM (precorrin-4 C11-methyltransferase) 1060 2332093 G | R Rv2075c . 263 Rv2075c, (MTCY49.14c), len: 487 aa. Possibly exported or envelope protein; has potential signal peptide at N-terminus and hydrophobic stretch around residue 430. Predicted to be an outer membrane protein (See Song et al.,2008). 1061 2335075 E | G Rv2078 . 6 hypothetical protein 1062 2338188 H | G Rv2081c . 107 Rv2081c, (MTCY49.20c), len: 146 aa. Conserved transmembrane protein, similar to many. Hydrophobic stretch from aa 32-54. 1063 2338194 V | A,C,F,G,L,W Rv2081c . 105 Rv2081c, (MTCY49.20c), len: 146 aa. Conserved transmembrane protein, similar to many. Hydrophobic stretch from aa 32-54. 1064 2338200 G | V Rv2081c . 103 Rv2081c, (MTCY49.20c), len: 146 aa. Conserved transmembrane protein, similar to many. Hydrophobic stretch from aa 32-54. 1065 2339493 T | S Rv2082 . 262 Rv2082, (MTCY49.21), len: 721 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0029, and to Rv3899c and Rv3900c which may be frameshifted. 1066 2339957 A | G,P,X Rv2082 . 417 Rv2082, (MTCY49.21), len: 721 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0029, and to Rv3899c and Rv3900c which may be frameshifted. 1067 2339981 P | S Rv2082 . 425 Rv2082, (MTCY49.21), len: 721 aa. Conserved hypothetical protein. Similar to Mycobacterium tuberculosis Rv0029, and to Rv3899c and Rv3900c which may be frameshifted. 1068 2346295 W | G,R,V Rv2089c pepE 344 Dipeptidase PepE 1069 2361392 S | P Rv2101 helZ 385 Probable helicase HelZ 1070 2372436 P | L,R Rv2112c dop 45 Deamidase of pup Dop 1071 2372460 C | A,F,G,R,Y Rv2112c dop 37 Deamidase of pup Dop 1072 2372503 V | A,C,G,R,S,W Rv2112c dop 23 Deamidase of pup Dop 1073 2377331 D | H Rv2117 . 62 Rv2117, (MTCY261.13), len: 97 aa. Conserved hypothetical protein, similar to many. 1074 2383235 F | G,L,V Rv2124c metH 945 5-methyltetrahydrofolate--homocystein methyltransferase MetH (methionine synthase, vitamin-B12 dependent isozyme) (ms) 1075 2383922 L | A,M Rv2124c metH 716 5-methyltetrahydrofolate--homocystein methyltransferase MetH (methionine synthase, vitamin-B12 dependent isozyme) (ms) 1076 2383923 L | A,R,V Rv2124c metH 716 5-methyltetrahydrofolate--homocystein methyltransferase MetH (methionine synthase, vitamin-B12 dependent isozyme) (ms) 1077 2387449 D | A,G,V Rv2126c . 175 PE-PGRS family protein PE_PGRS37 1078 2395687 S | P,R Rv2135c . 109 Rv2135c, (MTCY270.33), len: 236 aa. Conserved protein. Function: unknown but equivalent to hypothetical Mycobacterium leprae protein, Q49773. FASTA best: Q49773 B2126_C1_148 opt: 1183, E() : 0; (74.8% identity in 250 aa overlap), also similar in C-terminus to PMG2_ECOLI P36942 probable phosphogly... 1079 2397760 A | G Rv2138 lppL 144 Probable conserved lipoprotein LppL 1080 2399734 G | S Rv2139 pyrD 339 Probable dihydroorotate dehydrogenase PyrD 1081 2401319 H | P Rv2141c . 135 Rv2141c, (MTCY270.27), len: 448 aa. Conserved protein. Shows some similarity to conserved hypothetical proteins and to acetylornithine deacetylase and succinyl-diaminopimelate desuccinylase and contains ArgE/dapE/ACY1/CPG2/yscS family signature 1 (PS00758). FASTA best: CBPS_YEAST P27614 carboxype... 1082 2406843 V | A,L,P,R Rv2147c . 1 Rv2147c, (MTCY270.21), len: 241 aa. Conserved hypothetical protein, similar to conserved hypothetical proteins in Mycobacterium leprae ML0920 (210 aa) and Streptomyces coelicolor. FASTA scores: emb|CAC31301.1| (AL583920) hypothetical protein ML0920 hypothetical protein (210 aa) opt: 1242, E(): 5.... 1083 2413246 V | L Rv2153c murG 36 Probable UPD-N-acetylglucosamine-N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol-N-acetylglucosamine transferase MurG 1084 2413615 D | E,T Rv2154c ftsW 437 FtsW-like protein FtsW 1085 2416167 S | F,L Rv2155c murD 76 Probable UDP-N-acetylmuramoylalanine-D-glutamate ligase MurD 1086 2421973 P | R Rv2160A . 111 Rv2160A, len: 211 aa. Conserved hypothetical protein, possibly a TetR-family transcriptional regulator,similar to N-terminal half of AL512667_12|Q9AD73|SCK31.01c putative TetR-family transcriptional regulator from Streptomyces coelicolor (200 aa), FASTA scores: opt: 285,E(): 1.4e-08, (51.042% ide... 1087 2423808 A | D,G Rv2162c . 344 PE-PGRS family protein PE_PGRS38 1088 2435457 L | R Rv2173 idsA2 204 Probable geranylgeranyl pyrophosphate synthetase IdsA2 (ggppsase) (GGPP synthetase) (geranylgeranyl diphosphate synthase) 1089 2461586 G | R Rv2197c . 188 Rv2197c, (MTCY190.08c), len: 214 aa. Probable conserved transmembrane protein, equivalent to ML0878 conserved hypothetical protein (212 aa) of Mycobacterium leprae. FASTA scores: opt: 858; 62.559% identity in 211 aa overlap CAC31259.1|(AL583920). A core mycobacterial gene; conserved in mycobacter... 1090 2466764 G | R Rv2201 asnB 590 Probable asparagine synthetase AsnB 1091 2511155 A | P,R Rv2238c ahpE 8 Probable peroxiredoxin AhpE 1092 2520761 E | K Rv2247 accD6 7 Acetyl/propionyl-CoA carboxylase (beta subunit) AccD6 1093 2534559 C | L,R Rv2262c . 332 Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|cute|B0657 apolipoprotein N-acyltransferase from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity i... 1094 2534560 C | L,R,S Rv2262c . 332 Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|cute|B0657 apolipoprotein N-acyltransferase from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity i... 1095 2534561 R | P Rv2262c . 331 Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|cute|B0657 apolipoprotein N-acyltransferase from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity i... 1096 2534564 L | P,R Rv2262c . 330 Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|cute|B0657 apolipoprotein N-acyltransferase from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity i... 1097 2534780 W | F,L,S Rv2262c . 258 Rv2262c, (MTV022.12c), len: 360 aa. Conserved hypothetical protein, with function unknown but some similarity to N-terminal 70% of P23930|P77703|LNT_ECOLI|cute|B0657 apolipoprotein N-acyltransferase from Escherichia coli strain K12 (512 aa), FASTA scores: opt: 239, E(): 1.6e-07, (30.4% identity i... 1098 2538210 E | D Rv2264c . 48 Rv2264c, (MTV022.14c), len: 592 aa. Conserved hypothetical Pro-rich protein, similar to hypothetical proteins Rv0312 (MTCY63.17, 620 aa and Rv0350) that has highly Pro-, Thr-rich C-terminus. Contains PS00343 Gram-positive cocci surface proteins 'anchoring' hexapeptide. FASTA scores: Z96800|MTCY63... 1099 2540441 L | P Rv2266 cyp124 113 Probable cytochrome P450 124 Cyp124 1100 2542964 T | N Rv2268c cyp128 438 Probable cytochrome P450 128 Cyp128 1101 2545215 A | D Rv2270 lppN 173 Probable lipoprotein LppN 1102 2548842 K | N Rv2276 cyp121 365 Cytochrome P450 121 Cyp121 1103 2550013 T | A,I,K,M,N,R,S Rv2277c . 6 Rv2277c, (MTCY339.33), len: 301 aa. Possible glycerolphosphodiesterase, similar to e.g. UGPQ_ECOLI P10908 glycerophosphoryldiester phosphodiesterase (cytosolic) (247 aa), FASTA scores, opt: 149, E(): 0.0061,(27.2% identity in 195 aa overlap). Start of protein uncertain, encoded by neighbouring IS... 1104 2551576 A | G Rv2280 . 6 Rv2280, (MTCY339.30c), len: 459 aa. Probable dehydrogenase. Similar to D-lactate dehydrogenase (cytochrome) precursor e.g. G1061264 (587 aa), FASTA scores, opt: 645,E(): 1.3e-31, (28.0% identity in 478 aa overlap), similar to MTCY50.25, 36.5% identity in 447 aa overlap 1105 2555381 G | P,R Rv2282c . 166 Rv2282c, (MTCY339.28), len: 312 aa. Probable transcriptional regulator, lysR family, similar to others e.g. YC30_CYAPA|P48271 hypothetical transcriptional regulator YCF30 (324 aa), FASTA scores: opt: 292, E(): 4e-12, (27.6% identity in 286 aa overlap); etc. Also similar to Rv0377|MTCY39.34 from M... 1106 2562644 A | S Rv2290 lppO 16 Probable conserved lipoprotein LppO 1107 2564930 C | G,R,V Rv2293c . 35 Rv2293c, (MTCY339.17), len: 246 aa. Conserved hypothetical protein; some similarity to hypothetical protein (299 aa) AAK24237.1| (AE005897) belonging to phosphorylase family [Caulobacter crescentus] (33% identity in 131 aa overlap). Possible lipoprotein: signal peptide at N-terminus 1108 2566396 R | L Rv2294 . 357 Rv2294, (MTCY339.16c), len: 407 aa. Probable aminotransferase, similar to others in M. tuberculosis e.g. MTV030_19, also similar to PATB_BACSU|Q08432 putative aminotransferase b from Bacillus subtilis (387 aa), FASTA scores: opt: 563, E(): 2.8e-29, (31.4% identity in 408 aa overlap); and to MALY_... 1109 2592476 H | R Rv2319c . 84 Rv2319c, (MTCY3G12.15), len: 292 aa. Universal stress protein family protein. 1110 2603290 V | *,L,R Rv2329c narK1 58 Probable nitrite extrusion protein 1 NarK1 (nitrite facilitator 1) 1111 2604160 L | C,H,V,W Rv2330c lppP 22 Probable lipoprotein LppP 1112 2604165 L | *,C,H,I,W Rv2330c lppP 20 Probable lipoprotein LppP 1113 2612019 L | A,G Rv2337c . 324 Rv2337c, (MTCY98.06c), len: 372 aa. Hypothetical unknown protein, sharing some similarity with Q9RI33|SCJ12.27c hypothetical 37.2 KDA protein from Streptomyces coelicolor (335 aa), blast scores: 134 and 46,(28% and 33% identity, 52% and 44% positive); FASTA scores: opt: 176, E(): 0.00042, (31.95%... 1114 2612448 F | L,N,P,R,T,V Rv2337c . 181 Rv2337c, (MTCY98.06c), len: 372 aa. Hypothetical unknown protein, sharing some similarity with Q9RI33|SCJ12.27c hypothetical 37.2 KDA protein from Streptomyces coelicolor (335 aa), blast scores: 134 and 46,(28% and 33% identity, 52% and 44% positive); FASTA scores: opt: 176, E(): 0.00042, (31.95%... 1115 2626513 T | P,S,X Rv2347c esxP 3 Putative ESAT-6 like protein EsxP (ESAT-6 like protein 7) 1116 2637954 S | D,G Rv2356c . 528 PPE family protein PPE40 1117 2638716 G | I,N,Q,R Rv2356c . 274 PPE family protein PPE40 1118 2638997 S | L Rv2356c . 180 PPE family protein PPE40 1119 2647913 T | K,N,P,S Rv2366c . 152 Rv2366c, (MTCY27.14), len: 435 aa. Probable conserved transmembrane protein, highly similar to Q9L2L3|SCC117.07 putative membrane protein from Streptomyces coelicolor (358 aa), FASTA scores: opt: 1159,E(): 5.5e-64, (53.0% identity in 353 aa overlap); ans similar to hypothetical proteins and hemol... 1120 2650720 D | A,G Rv2370c . 280 Rv2370c, (MTCY27.10), len: 437 aa. Conserved hypothetical protein, member of family proteins from Mycobacterium tuberculosis with Rv1453|MTCY493_01c|O06807 conserved hypothetical protein from Mycobacterium tuberculosis (432 aa), FASTA scores: opt: 1943, E(): 9.4e-115, (69.9% identity in 409 aa ov... 1121 2685720 G | R Rv2391 sirA 348 Ferredoxin-dependent sulfite reductase SirA 1122 2688225 M | I Rv2394 ggtB 72 Probable gamma-glutamyltranspeptidase precursor GgtB (gamma-glutamyltransferase) (glutamyl transpeptidase) 1123 2695378 G | A Rv2398c cysW 141 Probable sulfate-transport integral membrane protein ABC transporter CysW 1124 2704884 H | A,L,P,Q,R,S Rv2407 . 63 Rv2407, (MTCY253.13c), len: 273 aa. Conserved hypothetical protein, highly similar (but longer at N-terminus) to AAK46775|MT2479 putative arylsulfatase from Mycobacterium tuberculosis strain CDC1551 (224 aa) FASTA scores: opt: 1433, E(): 2.5e-81, (96.43% identity in 224 aa overlap); O33130|MLCL53... 1125 2704886 G | A,D,E,L,P,R,S,T Rv2407 . 64 Rv2407, (MTCY253.13c), len: 273 aa. Conserved hypothetical protein 1126 2715958 F | K,L Rv2417c . 120 Rv2417c, (MTCY253.03), len: 280 aa. Conserved protein, highly similar to Q9RDL7|SCC123.07c hypothetical 29.2 KDA protein from Streptomyces coelicolor (281 aa),FASTA scores: opt: 579, E(): 3.6e-27, (38.3% identity in 274 aa overlap). Also some similarity with DEGV proteins or hypothetical proteins... 1127 2725258 A | P,R,T Rv2427c proA 74 Probable gamma-glutamyl phosphate reductase protein ProA (GPR) (glutamate-5-semialdehyde dehydrogenase) (glutamyl-gamma-semialdehyde dehydrogenase) 1128 2727037 M | L Rv2429 ahpD 78 Alkyl hydroperoxide reductase D protein AhpD (alkyl hydroperoxidase D) 1129 2729620 A | *,G,S,T,V Rv2434c . 314 Rv2434c, (MTCY428.12), len: 481 aa. Probable conserved transmembrane protein, with some similarity to BAB48444|MLR0973 probable integral membrane protein from Rhizobium loti (410 aa), FASTA scores: opt: 298, E(): 4.1e-11, (27.25% identity in 389 aa overlap); and also similarity with other hypothe... 1130 2738274 R | S Rv2440c obg 471 Probable GTP1/Obg-family GTP-binding protein Obg 1131 2751670 R | L,P,S Rv2450c rpfE 171 Probable resuscitation-promoting factor RpfE 1132 2751963 N | T Rv2450c rpfE 73 Probable resuscitation-promoting factor RpfE 1133 2752122 T | R Rv2450c rpfE 20 Probable resuscitation-promoting factor RpfE 1134 2760152 Y | C Rv2458 mmuM 125 Probable homocysteine S-methyltransferase MmuM (S-methylmethionine-homocysteine methyltransferase) (cysteine methyltransferase) 1135 2766211 T | N Rv2463 lipP 186 Probable esterase/lipase LipP 1136 2768308 F | L,P,V Rv2466c . 193 Rv2466c, (MTV008.22c), len: 207 aa. Conserved protein (see citation below), equivalent to Q9CBY0|ML1485 hypothetical protein from Mycobacterium leprae (207 aa),FASTA scores: opt: 1154, E(): 1.1e-67, (80.6% identity in 206 aa overlap). Also highly similar to Q9L201|SC8E4A.04c hypothetical protein ... 1137 2773380 H | T Rv2470 glbO 68 Globin (oxygen-binding protein) GlbO 1138 2773644 M | I Rv2471 aglA 27 Probable alpha-glucosidase AglA (maltase) (glucoinvertase) (glucosidosucrase) (maltase-glucoamylase) (lysosomal alpha-glucosidase) (acid maltase) 1139 2786952 C | R Rv2482c plsB2 778 Probable glycerol-3-phosphate acyltransferase PlsB2 (GPAT) 1140 2788298 D | G Rv2482c plsB2 329 Probable glycerol-3-phosphate acyltransferase PlsB2 (GPAT) 1141 2790458 C | G Rv2483c plsC 189 Possible transmembrane phospholipid biosynthesis bifunctional enzyme PlsC 1142 2793444 N | S Rv2485c lipQ 182 Probable carboxylesterase LipQ 1143 2793706 L | F Rv2485c lipQ 95 Probable carboxylesterase LipQ 1144 2795470 S | N Rv2487c . 639 PE-PGRS family protein PE_PGRS42 1145 2795910 E | A,D,H Rv2487c . 493 PE-PGRS family protein PE_PGRS42 1146 2796215 A | C,G,P,R Rv2487c . 391 PE-PGRS family protein PE_PGRS42 1147 2798756 G | P,R,W Rv2488c . 709 Rv2488c, (MTV008.44c), len: 1137 aa. Probable transcriptional regulatory protein, belonging to luxR family, similar to many in Mycobacterium tuberculosis e.g. AAK44621|MT0399 from strain CDC1551 (1092 aa) FASTA scores: opt: 3767, E(): 1.8e-211, (56.75% identity in 1093 aa overlap); O53720|Rv0386|... 1148 2801426 D | G Rv2490c . 1604 PE-PGRS family protein PE_PGRS43 1149 2801730 W | G Rv2490c . 1503 PE-PGRS family protein PE_PGRS43 1150 2802266 V | G Rv2490c . 1324 PE-PGRS family protein PE_PGRS43 1151 2804166 N | A,V,X Rv2490c . 691 PE-PGRS family protein PE_PGRS43 1152 2804784 L | V Rv2490c . 485 PE-PGRS family protein PE_PGRS43 1153 2805290 A | G Rv2490c . 316 PE-PGRS family protein PE_PGRS43 1154 2809621 T | A Rv2495c bkdC 107 Probable branched-chain keto acid dehydrogenase E2 component BkdC 1155 2817158 G | I,M Rv2502c accD1 439 Probable acetyl-/propionyl-CoA carboxylase (beta subunit) AccD1 1156 2817502 F | V Rv2502c accD1 325 Probable acetyl-/propionyl-CoA carboxylase (beta subunit) AccD1 1157 2819180 V | L,M Rv2504c scoA 231 Probable succinyl-CoA:3-ketoacid-coenzyme A transferase (alpha subunit) ScoA (3-oxo acid:CoA transferase) (OXCT A) (succinyl-CoA:3-oxoacid-coenzyme A transferase) 1158 2819440 V | C,L,W Rv2504c scoA 144 Probable succinyl-CoA:3-ketoacid-coenzyme A transferase (alpha subunit) ScoA (3-oxo acid:CoA transferase) (OXCT A) (succinyl-CoA:3-oxoacid-coenzyme A transferase) 1159 2833329 A | V Rv2516c . 62 Rv2516c, (MTV009.01c), len: 267 aa. Hypothetical unknown protein. Contains probable helix-turn-helix motif at aa 98 to 119 (Score 1743, +5.12 SD). C-terminus extended since first submission (+ 18 aa). 1160 2838897 A | I,M Rv2522c . 215 hypothetical protein 1161 2841022 C | R Rv2524c fas 2771 Probable fatty acid synthase Fas (fatty acid synthetase) 1162 2851768 F | I,L,P Rv2528c mrr 302 Probable restriction system protein Mrr 1163 2856314 F | I,V Rv2531c . 490 Rv2531c, (MTCY159.25), len: 947 aa. Probable amino acid decarboxylase, equivalent to Q9CCR8|adi|ML0524 putative amino acid decarboxylase from Mycobacterium leprae (950 aa), FASTA scores: opt: 5426, E(): 0, (86.45% identity in 951 aa overlap). Also similar to other amino acid decarboxylases (but l... 1164 2861535 R | A,G Rv2537c aroD 20 3-dehydroquinate dehydratase AroD (AROQ) (3-dehydroquinase) (type II dhqase) 1165 2866744 D | N Rv2543 lppA 93 Probable conserved lipoprotein LppA 1166 2866745 D | A Rv2543 lppA 93 Probable conserved lipoprotein LppA 1167 2866749 D | E Rv2543 lppA 94 Probable conserved lipoprotein LppA 1168 2866805 A | G Rv2543 lppA 113 Probable conserved lipoprotein LppA 1169 2866876 I | V Rv2543 lppA 137 Probable conserved lipoprotein LppA 1170 2866880 A | V Rv2543 lppA 138 Probable conserved lipoprotein LppA 1171 2866882 A | T Rv2543 lppA 139 Probable conserved lipoprotein LppA 1172 2867230 G | D Rv2544 lppB 36 Probable conserved lipoprotein LppB 1173 2867298 H | N Rv2544 lppB 59 Probable conserved lipoprotein LppB 1174 2867347 Q | R Rv2544 lppB 75 Probable conserved lipoprotein LppB 1175 2867405 D | E Rv2544 lppB 94 Probable conserved lipoprotein LppB 1176 2867532 I | V Rv2544 lppB 137 Probable conserved lipoprotein LppB 1177 2867536 A | V Rv2544 lppB 138 Probable conserved lipoprotein LppB 1178 2867538 A | T Rv2544 lppB 139 Probable conserved lipoprotein LppB 1179 2867575 V | A Rv2544 lppB 151 Probable conserved lipoprotein LppB 1180 2876239 F | G,I,V Rv2555c alaS 83 Probable alanyl-tRNA synthetase AlaS (alanine--tRNA ligase) (alanine translase) (ALARS) 1181 2876703 A | G Rv2556c . 88 Rv2556c, (MTCY09C4.12), len: 129 aa. Conserved hypothetical protein, highly similar to others e.g. Q9EWY5|2SCG38.34 conserved hypothetical protein from Streptomyces coelicolor (140 aa), FASTA scores: opt: 488,E(): 8.2e-26, (58.8% identity in 131 aa overlap); Q9L9G4|NOVD NOVD protein from Streptom... 1182 2880702 V | L Rv2560 . 210 Rv2560, (MTCY9C4.08c), len: 325 aa. Probable transmembrane protein, pro-, gly-rich protein. 1183 2898136 A | T Rv2573 . 32 Rv2573, (MTCY227.28c), len: 246 aa. Conserved hypothetical protein, similar to various proteins e.g. Q9ABG6|CC0261 hypothetical protein from Caulobacter crescentus (290 aa), FASTA scores: opt: 516, E(): 5.8e-26,(40.1% identity in 237 aa overlap); Q99R37|SA2393 hypothetical protein (similar to 2-d... 1184 2913087 A | T Rv2586c secF 309 Probable protein-export membrane protein SecF 1185 2935649 N | L,R,S,T Rv2608 . 202 PPE family protein PPE42 1186 2944220 N | D Rv2615c . 256 PE-PGRS family protein PE_PGRS45 1187 2947246 W | C,G Rv2619c . 69 Rv2619c, (MTCY01A10.14), len: 117 aa. Conserved protein, highly similar to Q9L0F3|SCD31.14 hypothetical 11.6 KDA protein from Streptomyces coelicolor (110 aa),FASTA scores: opt: 407, E(): 2.3e-21, (55.95% identity in 109 aa overlap). Also similarity with other short bacterial hypothetical protein... 1188 2949033 A | V Rv2622 . 133 Rv2622, (MTCY01A10.10c), len: 273 aa. Possible methyltransferase, similar in part to others e.g. AAK75664|SP1578 putative methyltransferase from Streptococcus pneumoniae (252 aa), FASTA scores: opt: 406,E(): 6.6e-18, (32.65% identity in 251 aa overlap); Q9F8B8 methyltransferase from Streptococcus... 1189 2950630 L | F,G,P,R,V Rv2624c . 227 Rv2624c, (MTCY01A10.08), len: 272 aa. Universal stress protein family protein, similar to several Streptomyces proteins e.g. Q9RIY5|SCJ1.29c hypothetical 30.1 KDA protein from Streptomyces coelicolor (283 aa),FASTA scores: opt: 260, E(): 5e-09, (32.05% identity in 290 aa overlap). Also similar to... 1190 2954439 R | G Rv2627c . 104 Rv2627c, (MTCY01A10.05), len: 413 aa. Conserved protein. Some similarity in C-terminal part of O53697|Rv0293c|MTV035.21c hypothetical 44.0 KDA protein from Mycobacterium tuberculosis (400 aa), FASTA scores: opt: 392, E(): 1.9e-17, (31.1% identity in 299 aa overlap). Alternative nucleotide at posi... 1191 2955957 D | A Rv2629 . 64 Rv2629, (MTCY01A10.03c), len: 374 aa. Conserved protein, similar to Q9ZC00|SC1E6.22c hypothetical 40.7 KDA protein from Streptomyces coelicolor (373 aa), FASTA scores: opt: 425, E(): 2.5e-18, (30.2% identity in 371 aa overlap). Predicted possible vaccine candidate (See Zvi et al., 2008). 1192 2961011 D | E Rv2634c . 478 PE-PGRS family protein PE_PGRS46 1193 2961721 T | D,P Rv2634c . 241 PE-PGRS family protein PE_PGRS46 1194 2965278 R | A,G Rv2639c . 28 Rv2639c, (MTCY441.09c), len: 110 aa. Probable conserved integral membrane protein, highly similar to many bacterial hypothetical or membrane proteins e.g. Q9X889|YE14_STRCO|SCE15.14 potential integral membrane protein from Streptomyces coelicolor (112 aa), FASTA scores: opt: 597, E(): 3.1e-31, (7... 1195 2967747 G | C Rv2643 arsC 280 Probable arsenic-transport integral membrane protein ArsC 1196 2982959 G | X Rv2665 . 87 Rv2665, (MTCY441.34), len: 93 aa. Hypothetical arg-rich protein, showing some similarity to N-terminus of P71640|Rv2811|MTCY16B7.32c hypothetical 21.1 KDA protein from Mycobacterium tuberculosis (202 aa), FASTA scores: opt: 157, E(): 0.0011, (37.5% identity in 72 aa overlap); and also to part of ... 1197 2983095 T | A Rv2666 . 9 Probable transposase for insertion sequence element IS1081 (fragment) 1198 2984740 H | R Rv2668 . 3 Possible exported alanine and valine rich protein 1199 2988630 H | D Rv2672 . 317 Possible secreted protease 1200 2994455 A | *,T,W Rv2678c hemE 203 Probable uroporphyrinogen decarboxylase HemE (uroporphyrinogen III decarboxylase) (URO-D) (UPD) 1201 3017465 I | T Rv2702 ppgK 203 Polyphosphate glucokinase PpgK (polyphosphate-glucose phosphotransferase) 1202 3027798 V | A Rv2714 . 245 Conserved alanine and leucine rich protein 1203 3031168 Y | H Rv2719c . 124 Possible conserved membrane protein 1204 3048917 A | G Rv2736c recX 57 Regulatory protein RecX 1205 3062139 M | T Rv2748c ftsK 123 Possible cell division transmembrane protein FtsK 1206 3074408 T | A,P Rv2764c thyA 22 Probable thymidylate synthase ThyA (ts) (TSASE) 1207 3083640 P | G,T Rv2776c . 222 Rv2776c, (MTV002.41c), len: 309 aa. Probable oxidoreductase, similar to other oxidoreductases e.g. Q9KZ15|SC10B7.17 putative iron-sulfur oxidoreductase from Streptomyces coelicolor (364 aa), FASTA scores: opt: 846,E(): 1.2e-45, (46.75% identity in 308 aa overlap); O88034|SC5A7.28c iron-sulfur oxi... 1208 3097404 P | L Rv2788 sirR 149 Probable transcriptional repressor SirR 1209 3105899 W | L Rv2797c . 470 Rv2797c, (MTCY16B7.46), len: 562 aa. Conserved hypothetical ala-rich protein. C-terminus highly similar to several mycobacterial proteins e.g. AAK46927|MT2616 hypothetical 28.0 KDA protein from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 535, E(): 4.6e-22, (42.95% ident... 1210 3106766 L | P,R Rv2797c . 181 Rv2797c, (MTCY16B7.46), len: 562 aa. Conserved hypothetical ala-rich protein. C-terminus highly similar to several mycobacterial proteins e.g. AAK46927|MT2616 hypothetical 28.0 KDA protein from Mycobacterium tuberculosis strain CDC1551 (265 aa), FASTA scores: opt: 535, E(): 4.6e-22, (42.95% ident... 1211 3107360 A | G,V Rv2798c . 93 Rv2798c, (MTCY16B7.45), len: 108 aa. Conserved hypothetical ala-rich protein, similar to P71545|Y965_MYCTU|Rv0965c|MT0993|MTCY10D7.09 hypothetical 14.5 KDA protein from Mycobacterium tuberculosis (139 aa),FASTA scores: opt: 198, E(): 8e-07, (38.9% identity in 90 aa overlap). 1212 3126525 K | *,E,H,N,P Rv2819c . 282 Rv2819c, (MTCY16B7.23), len: 375 aa. Hypothetical unknown protein (see citations below). This region is a possible MT-complex-specific genomic island (See Becq et al., 2007). 1213 3128984 K | P Rv2822c . 122 Rv2822c, (MTCY16B7.20), len: 124 aa. Hypothetical unknown protein. This region is a possible MT-complex-specific genomic island (See Becq et al.,2007). 1214 3131465 A | Q Rv2823c . 104 Rv2823c, (MTCY16B7.19), len: 809 aa. Conserved protein, similar in part to others e.g. Q9X2D1|TM1811Thermotoga maritima (717 aa), FASTA scores: opt: 401, E(): 3.6e-18, (27.15% identity in 773 aa overlap); O27154|MTH1082 conserved hypothetical protein from Methanothermobacter thermautotrophicus (8... 1215 3131468 I | L,Y Rv2823c . 103 Rv2823c, (MTCY16B7.19), len: 809 aa. Conserved protein, similar in part to others e.g. Q9X2D1|TM1811Thermotoga maritima (717 aa), FASTA scores: opt: 401, E(): 3.6e-18, (27.15% identity in 773 aa overlap); O27154|MTH1082 conserved hypothetical protein from Methanothermobacter thermautotrophicus (8... 1216 3131470 N | A,H,I,Q,R,T Rv2823c . 102 Rv2823c, (MTCY16B7.19), len: 809 aa. Conserved protein, similar in part to others e.g. Q9X2D1|TM1811Thermotoga maritima (717 aa), FASTA scores: opt: 401, E(): 3.6e-18, (27.15% identity in 773 aa overlap); O27154|MTH1082 conserved hypothetical protein from Methanothermobacter thermautotrophicus (8... 1217 3133055 S | W Rv2825c . 162 hypothetical protein 1218 3141550 F | G,I,L,V Rv2835c ugpA 225 Probable Sn-glycerol-3-phosphate transport integral membrane protein ABC transporter UgpA 1219 3159999 L | C Rv2850c . 19 Rv2850c, (MTCY24A1.07), len: 629 aa. Possible magnesium-chelatase, highly similar (but with gaps) to magnesium-chelatases from notably photosynthetic organisms involved in chlorophyll biosynthesis e.g. Q9RJ18|SCI8.35c putative chelatase from Streptomyces coelicolor (672 aa),FASTA scores: opt: 194... 1220 3160000 L | C,G,R,V Rv2850c . 19 Rv2850c, (MTCY24A1.07), len: 629 aa. Possible magnesium-chelatase, highly similar (but with gaps) to magnesium-chelatases from notably photosynthetic organisms involved in chlorophyll biosynthesis e.g. Q9RJ18|SCI8.35c putative chelatase from Streptomyces coelicolor (672 aa),FASTA scores: opt: 194... 1221 3162021 A | G Rv2852c mqo 14 Probable malate:quinone oxidoreductase Mqo (malate dehydrogenase [acceptor]) 1222 3163778 E | A Rv2853 . 504 PE-PGRS family protein PE_PGRS48 1223 3170693 T | H,P Rv2858c aldC 11 Probable aldehyde dehydrogenase AldC 1224 3178971 D | E Rv2867c . 115 Rv2867c, (MTV003.13c), len: 284 aa. Probable acetyltransferase. Contains GNAT (Gcn5-related N-acetyltransferase) domain in C-terminal part. See Vetting et al. 2005. Similar to others e.g. Q9KYR8|SC5H4.21 hypothetical 31.3 KDA protein from Streptomyces coelicolor (287 aa), FASTA scores: opt: 798, ... 1225 3186091 N | K Rv2874 dipZ 415 Possible integral membrane C-type cytochrome biogenesis protein DipZ 1226 3192198 I | A,V Rv2882c frr 2 Ribosome recycling factor Frr (ribosome releasing factor) (RRF) 1227 3196088 I | M Rv2886c . 116 Rv2886c, (MTCY274.17c), len: 295 aa. Probable resolvase for IS1539. Contains PS00213 Lipocalin signature. 1228 3197093 Y | C Rv2888c amiC 398 Probable amidase AmiC (aminohydrolase) 1229 3197221 L | F,V,W,X Rv2888c amiC 356 Probable amidase AmiC (aminohydrolase) 1230 3198692 I | K,N,S Rv2889c tsf 139 Probable elongation factor Tsf (EF-ts) 1231 3201110 V | G Rv2892c . 304 PPE family protein PPE45 1232 3204810 D | E Rv2895c viuB 142 Possible mycobactin utilization protein ViuB 1233 3207393 T | H,P Rv2897c . 184 Rv2897c, (MTCY274.28c), len: 503 aa. Conserved hypothetical protein, possibly Mg-chelatase, highly similar to hypothetical proteins and chelatases e.g. Q9RTV0|DR1656 mg(2+) chelatase family protein from Deinococcus radiodurans (519 aa), FASTA scores: opt: 1333, E(): 3.6e-68, (46.55% identity in 5... 1234 3209156 A | V Rv2899c fdhD 84 Possible FdhD protein homolog 1235 3221002 S | R Rv2913c . 233 Rv2913c, (MTCY338.01c, MTCY274.45c), len: 611 aa. Possible D-amino acid aminohydrolase, similar (principally in N-terminus) to D-amino acid aminohydrolases e.g. Q9V2D3|NDAD|PAB0090 D-aminoacylase (aspartate, glutamate etc) from Pyrococcus abyssi (526 aa), FASTA scores: opt: 336, E(): 2.2e-13, (27... 1236 3223038 A | F,L,S Rv2914c pknI 163 Probable transmembrane serine/threonine-protein kinase I PknI (protein kinase I) (STPK I) (phosphorylase B kinase kinase) (hydroxyalkyl-protein kinase) 1237 3232007 H | P Rv2920c amt 167 Probable ammonium-transport integral membrane protein Amt 1238 3232407 F | I,N,V Rv2920c amt 34 Probable ammonium-transport integral membrane protein Amt 1239 3233940 A | G Rv2921c ftsY 67 Probable cell division protein FtsY (SRP receptor) (signal recognition particle receptor) 1240 3238123 T | A,G,H,L,R,S,V,W Rv2923c . 126 Rv2923c, (MTCY338.12c), len: 137 aa. Conserved protein, showing similarity with other hypothetical proteins e.g. P24246|YHFA_ECOLI|B3356|Z4717|ECS4207 from Escherichia coli strains K12 and O157:H7 (134 aa), FASTA scores: opt: 110, E(): 1.9, (25.9% identity in 135 aa overlap); etc. 1241 3244126 V | I Rv2930 fadD26 144 Fatty-acid-AMP ligase FadD26 (fatty-acid-AMP synthetase) (fatty-acid-AMP synthase) 1242 3248301 T | N,P Rv2931 ppsA 953 Phenolpthiocerol synthesis type-I polyketide synthase PpsA 1243 3248303 T | N Rv2931 ppsA 953 Phenolpthiocerol synthesis type-I polyketide synthase PpsA 1244 3254880 R | L Rv2932 ppsB 1270 Phenolpthiocerol synthesis type-I polyketide synthase PpsB 1245 3266769 I | L Rv2934 ppsD 1508 Phenolpthiocerol synthesis type-I polyketide synthase PpsD 1246 3275013 V | G Rv2939 papA5 22 Possible conserved polyketide synthase associated protein PapA5 1247 3276861 E | G Rv2940c mas 1952 Probable multifunctional mycocerosic acid synthase membrane-associated Mas 1248 3277456 T | P Rv2940c mas 1754 Probable multifunctional mycocerosic acid synthase membrane-associated Mas 1249 3293046 E | G Rv2946c pks1 1103 Probable polyketide synthase Pks1 1250 3296373 G | A,D,H,L,P,Q,R,V Rv2947c pks15 490 Probable polyketide synthase Pks15 1251 3296374 G | A,D,H,L,P,Q,R,S,V Rv2947c pks15 490 Probable polyketide synthase Pks15 1252 3297518 W | *,F,L Rv2947c pks15 108 Probable polyketide synthase Pks15 1253 3299852 Q | * Rv2948c fadD22 35 P-hydroxybenzoyl-AMP ligase FadD22 1254 3300461 D | G Rv2949c . 37 Rv2949c, (MTCY349.41), len: 199 aa. Chorismate pyruvate lyase, equivalent to Q9CD83|ML0133 hypothetical protein from Mycobacterium leprae (210 aa), FASTA scores: opt: 797, E(): 7.4e-47, (62.55% identity in 195 aa overlap). Equivalent to AAK47348 from Mycobacterium tuberculosis strain CDC1551 (212... 1255 3304966 G | R Rv2952 . 176 Possible methyltransferase (methylase) 1256 3311579 L | I,P Rv2958c . 141 Rv2958c, (MTCY349.30), len: 428 aa. Possible glycosyl transferase (see citation below), highly similar to Q9CD88|ML0128 putative glycosyl transferase from Mycobacterium leprae (435 aa), FASTA scores: opt: 2116,E(): 5.8e-126, (75.05% identity in 417 aa overlap); and Q9CD91|ML0125 putative glycosyl... 1257 3311580 L | I,P,S,T,V Rv2958c . 141 Rv2958c, (MTCY349.30), len: 428 aa. Possible glycosyl transferase (see citation below), highly similar to Q9CD88|ML0128 putative glycosyl transferase from Mycobacterium leprae (435 aa), FASTA scores: opt: 2116,E(): 5.8e-126, (75.05% identity in 417 aa overlap); and Q9CD91|ML0125 putative glycosyl... 1258 3328608 G | P,R Rv2973c recG 447 Probable ATP-dependent DNA helicase RecG 1259 3331366 A | R,W Rv2975c . 83 Rv2975c, (MTCY349.12), len: 84 aa. Conserved hypothetical protein, similar to N-terminus of others e.g. Q9ZBR4|SC7A1.09 hypothetical 59.5 KDA protein from Streptomyces coelicolor (589 aa), FASTA scores: opt: 141,E(): 0.0019, (41.25% identity in 80 aa overlap); Q98R49|MYPU_1610 hypothetical protei... 1260 3335708 P | A,R Rv2979c . 14 Rv2979c, (MTCY349.08), len: 194 aa. Probable resolvase for IS1538, with low level matches to transposon resolvases; highly similar from aa 101 to YX1C_MYCTU|Q10831 from Mycobacterium tuberculosis (295 aa), FASTA scores: opt: 809, E(): 0, (69.1% identity in 194 aa overlap). Contains PS00397 Site-s... 1261 3342165 V | L Rv2985 mutT1 1 Possible hydrolase MutT1 1262 3345047 I | D,E Rv2988c leuC 343 Probable 3-isopropylmalate dehydratase (large subunit) LeuC (isopropylmalate isomerase) (alpha-IPM isomerase) (IPMI) 1263 3346634 A | V Rv2989 . 163 Rv2989, (MTV012.03), len: 233 aa. Probable transcriptional regulator (ala-rich protein), highly similar to O86533|SC1C2.33c putative transcriptional regulator from Streptomyces coelicolor (238 aa), FASTA scores: opt: 711, E(): 2.3e-38, (53.05% identity in 230 aa overlap); and similar to others e.... 1264 3352919 F | L Rv2995c leuB 184 Probable 3-isopropylmalate dehydrogenase LeuB (beta-IPM dehydrogenase) (IMDH) (3-IPM-DH) 1265 3358235 M | L Rv2999 lppY 212 Probable conserved lipoprotein LppY 1266 3369382 A | V Rv3010c pfkA 158 Probable 6-phosphofructokinase PfkA (phosphohexokinase) (phosphofructokinase) 1267 3369613 A | G,V Rv3010c pfkA 81 Probable 6-phosphofructokinase PfkA (phosphohexokinase) (phosphofructokinase) 1268 3375165 F | C Rv3015c . 167 Rv3015c, (MTV012.29c), len: 337 aa. Conserved hypothetical protein, equivalent to Q9CBR6|ML1706 hypothetical protein from Mycobacterium leprae (337 aa),FASTA scores: opt: 1703, E(): 3.1e-92, (78.05% identity in 337 aa overlap); and (but longer 47 aa) O33101|MLCB637.09 hypothetical 30.0 KDA protei... 1269 3384277 S | D,H,P Rv3025c iscS 264 Cysteine desulfurase IscS (NIFS protein homolog) (nitrogenase metalloclusters biosynthesis protein NIFS) 1270 3402510 R | H Rv3042c serB2 218 Probable phosphoserine phosphatase SerB2 (PSP) (O-phosphoserine phosphohydrolase) (pspase) 1271 3406045 A | T Rv3044 fecB 304 Probable FEIII-dicitrate-binding periplasmic lipoprotein FecB 1272 3406798 A | T Rv3045 adhC 172 Probable NADP-dependent alcohol dehydrogenase AdhC 1273 3425148 S | C,G,L,R,T Rv3061c fadE22 94 Probable acyl-CoA dehydrogenase FadE22 1274 3425315 D | A,G,R Rv3061c fadE22 38 Probable acyl-CoA dehydrogenase FadE22 1275 3440265 D | A Rv3077 . 242 Rv3077, (MTCY22D7.04c), len: 603 aa. Possible hydrolase, with some similarity to variety of hydrolases (aryl- and steryl sulfatases principaly) e.g. Q45087|PEHA phosphonate monoester hydrolase from Burkholderia caryophylli (514 aa), FASTA scores: opt: 239, E(): 7.2e-07,(23.95% identity in 413 aa ... 1276 3441002 A | S Rv3077 . 488 Rv3077, (MTCY22D7.04c), len: 603 aa. Possible hydrolase, with some similarity to variety of hydrolases (aryl- and steryl sulfatases principaly) e.g. Q45087|PEHA phosphonate monoester hydrolase from Burkholderia caryophylli (514 aa), FASTA scores: opt: 239, E(): 7.2e-07,(23.95% identity in 413 aa ... 1277 3462150 R | A,P Rv3093c . 206 Rv3093c, (MTCY164.04c), len: 334 aa. Hypothetical oxidoreductase, with some similarity with various oxidoreductases e.g. Q58929|mer|MJ1534 N5,N10-methylene tetrahydromethanopterin reductase from Methanococcus jannaschii (331 aa), FASTA scores: opt: 300, E(): 1.1e-10,(24.1% identity in 324 aa over... 1278 3462152 P | A,R Rv3093c . 205 Rv3093c, (MTCY164.04c), len: 334 aa. Hypothetical oxidoreductase, with some similarity with various oxidoreductases e.g. Q58929|mer|MJ1534 N5,N10-methylene tetrahydromethanopterin reductase from Methanococcus jannaschii (331 aa), FASTA scores: opt: 300, E(): 1.1e-10,(24.1% identity in 324 aa over... 1279 3462157 A | G,P Rv3093c . 203 Rv3093c, (MTCY164.04c), len: 334 aa. Hypothetical oxidoreductase, with some similarity with various oxidoreductases e.g. Q58929|mer|MJ1534 N5,N10-methylene tetrahydromethanopterin reductase from Methanococcus jannaschii (331 aa), FASTA scores: opt: 300, E(): 1.1e-10,(24.1% identity in 324 aa over... 1280 3468854 P | A,C,S Rv3099c . 138 Rv3099c, (MTCY164.10c), len: 283 aa. Conserved protein, some similarity with hypothetical proteins e.g. Q9XA69|SCGD3.09 from Streptomyces coelicolor (274 aa),FASTA scores: opt: 384, E(): 1.8e-17, (32.7% identity in 269 aa overlap); and P71606|Y036_MYCTU|Rv0036c from Mycobacterium tuberculosis str... 1281 3476058 N | H,P Rv3107c agpS 302 Possible alkyldihydroxyacetonephosphate synthase AgpS (alkyl-DHAP synthase) (alkylglycerone-phosphate synthase) 1282 3480373 Q | G,L,R,V Rv3113 . 100 Rv3113, (MTCY164.23), len: 222 aa. Possible phosphatase, with weak similarity to other phosphatases e.g. Q9KYY0|SCE33.02c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 368, E(): 1.2e-16, (32.9% identity in 222 aa overlap); and Q55039|GPH_SYNP7|CBBZ phosphoglycolate phosphatase from Sy... 1283 3480474 G | E Rv3113 . 134 Rv3113, (MTCY164.23), len: 222 aa. Possible phosphatase, with weak similarity to other phosphatases e.g. Q9KYY0|SCE33.02c from Streptomyces coelicolor (223 aa), FASTA scores: opt: 368, E(): 1.2e-16, (32.9% identity in 222 aa overlap); and Q55039|GPH_SYNP7|CBBZ phosphoglycolate phosphatase from Sy... 1284 3491589 G | A,P,S Rv3125c . 22 PPE family protein PPE49 1285 3501339 D | A,R Rv3135 . 2 PPE family protein PPE50 1286 3518555 T | A Rv3151 nuoG 604 Probable NADH dehydrogenase I (chain G) NuoG (NADH-ubiquinone oxidoreductase chain G) 1287 3538601 V | L Rv3170 aofH 33 Probable flavin-containing monoamine oxidase AofH (amine oxidase) (MAO) 1288 3542018 P | K,N,S Rv3173c . 10 Rv3173c, (MTV014.17c), len: 200 aa. Probable transcriptional regulatory protein TetR family, similar to several bacterial putative regulatory proteins e.g. Q9EWI2|SC7H9.14 from Streptomyces coelicolor (195 aa),FASTA scores: opt: 319, E(): 1.7e-13, (34.55% identity in 195 aa overlap); O85695|3SCF6... 1289 3543469 R | C Rv3175 . 204 Rv3175, (MTV014.19), len: 495 aa. Possible amidase ,similar to others e.g. Q9F6D0|ZHUL enantiomer selective amidase from Streptomyces sp. R1128 (507 aa), FASTA scores: opt: 1328 ,E(): 7.5e-69, (44.5% identity in 492 aa overlap); BAB51815|MLR5350 probable amidase from Rhizobium loti (Mesorhizobium... 1290 3550012 R | E,G,V Rv3181c . 45 Rv3181c, (MTV014.25c), len: 150 aa. Conserved protein, with some similarity to other mycobacterium proteins e.g. Q50718|YY07_MYCTU|Rv3407|MT3515|MTCY78.21c (99 aa), FASTA scores: opt: 123, E(): 0.25, (33.7% identity in 89 aa overlap); and O50412|Rv3385c|MTV004.43c (102 aa),FASTA scores: opt: 123,... 1291 3550929 S | *,C,F,R Rv3183 . 72 Rv3183, (MTV014.27), len: 109 aa. Possible transcriptional regulator, similar to others e.g. Q9S1D9|YPPCP1.08c from Yersinia pestis (99 aa), FASTA scores: opt: 119, E(): 0.47, (40.55% identity in 74 aa overlap); Q9X153|TM1330 from Thermotoga maritima (111 aa),FASTA scores: opt: 115, E(): 0.91, (4... 1292 3576231 Q | R Rv3201c . 269 Probable ATP-dependent DNA helicase 1293 3576706 V | I Rv3201c . 111 Rv3201c, (MTV014.45c), len: 1101 aa. Probable ATP-dependent DNA helicase, similar to others e.g. Q9FCK4|2SC3B6.08 from Streptomyces coelicolor (1222 aa),FASTA scores: opt: 1209, E(): 5.4e-63, (38.45% identity in 1199 aa overlap); P71561|PCRA_MYCTU|CRA|IVRD|Rv0949|MT0976|MTCY10D7.25c from Mycobact... 1294 3581047 G | V Rv3203 lipV 137 Possible lipase LipV 1295 3582019 A | D,P,R,V Rv3205c . 163 Rv3205c, (MTCY07D11.21), len: 292 aa. Conserved protein, highly similar to Q9CCG7|ML0818 hypothetical protein from Mycobacterium leprae (297 aa), FASTA scores: opt: 1745, E(): 9.1e-98, (87.3% identity in 291 aa overlap). A core mycobacterial gene; conserved in mycobacterial strains (See Marmiesse... 1296 3597249 G | R Rv3220c . 96 Rv3220c, (MTCY07D11.06), len: 501 aa. Probable sensor (probably histidine kinase), equivalent to Q9CCH8|ML0803 putative two-component system sensor kinase from Mycobacterium leprae (500 aa). Similar to others e.g. Q9F3M1|2SC7G11.01 putative histidine kinase (fragment) from Streptomyces coelicolor... 1297 3618006 V | P Rv3240c secA1 843 Probable preprotein translocase SecA1 1 subunit 1298 3624486 D | G Rv3244c lpqB 142 Probable conserved lipoprotein LpqB 1299 3625065 M | L Rv3245c mtrB 517 Two component sensory transduction histidine kinase MtrB 1300 3626562 P | S Rv3245c mtrB 18 Two component sensory transduction histidine kinase MtrB 1301 3631941 D | E,R Rv3252c alkB 17 Probable transmembrane alkane 1-monooxygenase AlkB (alkane 1-hydroxylase) (lauric acid omega-hydroxylase) (omega-hydroxylase) (fatty acid omega-hydroxylase) (alkane hydroxylase-rubredoxin) 1302 3645990 G | R Rv3265c wbbL1 299 dTDP-RHA, a-D-Glcnac-diphosphoryl polyprenol,a-3-L-rhamnosyl transferase WbbL1 (alpha-L-rhamnose-(1->3)-alpha-D-Glcnac(1->P)-P- decaprenyl) 1303 3658224 N | H,P Rv3275c purE 139 Probable phosphoribosylaminoimidazole carboxylase catalytic subunit PurE (air carboxylase) (AIRC) 1304 3661382 D | N Rv3279c birA 211 Possible bifunctional protein BirA, biotin operon repressor + biotin--[acetyl-CoA-carboxylase] synthetase (biotin--protein ligase) 1305 3663793 E | A,D,P Rv3281 accE5 35 Probable bifunctional protein acetyl-/propionyl-coenzyme A carboxylase (epsilon chain) AccE5 1306 3671532 A | G Rv3290c lat 88 Probable L-lysine-epsilon aminotransferase Lat (L-lysine aminotransferase) (lysine 6-aminotransferase) 1307 3682715 S | A,G,V Rv3298c lpqC 104 Possible esterase lipoprotein LpqC 1308 3687372 L | S Rv3301c phoY1 69 Probable phosphate-transport system transcriptional regulatory protein PhoU homolog 1 PhoY1 1309 3690016 L | S Rv3303c lpdA 308 NAD(P)H quinone reductase LpdA 1310 3703480 P | S Rv3315c cdd 129 Probable cytidine deaminase Cdd (cytidine aminohydrolase) (cytidine nucleoside deaminase) 1311 3704261 L | I Rv3316 sdhC 54 Probable succinate dehydrogenase (cytochrome B-556 subunit) SdhC (succinic dehydrogenase) (fumarate reductase) (fumarate dehydrogenase) (fumaric hydrogenase) 1312 3704596 V | L Rv3317 sdhD 54 Probable succinate dehydrogenase (hydrophobic membrane anchor subunit) SdhD (succinic dehydrogenase) (fumarate reductase) (fumarate dehydrogenase) (fumaric hydrogenase) 1313 3735673 W | C Rv3343c . 422 PPE family protein PPE54 1314 3737502 G | C,F,L,R,V Rv3344c . 313 PE-PGRS family protein PE_PGRS49 1315 3737735 P | L Rv3344c . 235 PE-PGRS family protein PE_PGRS49 1316 3738024 G | A,H,I,Q,R Rv3344c . 139 PE-PGRS family protein PE_PGRS49 1317 3738028 N | A,D,G,H,M,R,W Rv3344c . 138 PE-PGRS family protein PE_PGRS49 1318 3738226 G | S Rv3344c . 72 PE-PGRS family protein PE_PGRS49 1319 3738230 A | E,G,N Rv3344c . 70 PE-PGRS family protein PE_PGRS49 1320 3738369 G | L,P,R Rv3344c . 24 PE-PGRS family protein PE_PGRS49 1321 3738520 G | A,E,Q,R,S,V Rv3345c . 1419 PE-PGRS family protein PE_PGRS50 1322 3738543 A | G,Q,S,V,W Rv3345c . 1411 PE-PGRS family protein PE_PGRS50 1323 3738730 S | A,C,L,R Rv3345c . 1349 PE-PGRS family protein PE_PGRS50 1324 3738880 N | D Rv3345c . 1299 PE-PGRS family protein PE_PGRS50 1325 3739291 Y | F,G,L,R Rv3345c . 1162 PE-PGRS family protein PE_PGRS50 1326 3739978 N | A,H,L,Y Rv3345c . 933 PE-PGRS family protein PE_PGRS50 1327 3740707 G | P,Q,R,T Rv3345c . 690 PE-PGRS family protein PE_PGRS50 1328 3740904 I | N Rv3345c . 624 PE-PGRS family protein PE_PGRS50 1329 3741255 G | A Rv3345c . 507 PE-PGRS family protein PE_PGRS50 1330 3741286 G | A,R,S,V Rv3345c . 497 PE-PGRS family protein PE_PGRS50 1331 3748730 N | A,H Rv3347c . 1486 PPE family protein PPE55 1332 3751744 F | V Rv3347c . 481 PPE family protein PPE55 1333 3756082 N | A,S,Y Rv3350c . 3674 PPE family protein PPE56 1334 3756261 F | L Rv3350c . 3615 PPE family protein PPE56 1335 3760073 I | V Rv3350c . 2344 PPE family protein PPE56 1336 3760770 S | E,G,P,R Rv3350c . 2112 PPE family protein PPE56 1337 3761020 G | A,R Rv3350c . 2028 PPE family protein PPE56 1338 3764955 G | C,V Rv3350c . 717 PPE family protein PPE56 1339 3765069 S | D,P Rv3350c . 679 PPE family protein PPE56 1340 3766912 A | G,P,R Rv3350c . 64 PPE family protein PPE56 1341 3766918 A | G,P,R,W Rv3350c . 62 PPE family protein PPE56 1342 3768339 G | A,R Rv3352c . 86 Rv3352c, (MTV004.09c), len: 123 aa. Possible oxidoreductase, similar to part of several oxidoreductases (and hypothetical proteins) from diverse organisms e.g. Q9KYD6|SCD72A.20 putative lipoprotein (fragment) from Streptomyces coelicolor (403 aa), FASTA scores: opt: 348,E(): 7.9e-15, (51.0% ident... 1343 3774712 T | A,E,Q,X Rv3364c . 55 Rv3364c, (MTV004.21c), len: 130 aa. Conserved protein, highly similar to others from Streptomyces coelicolor e.g. O86524|SC1C2.24c (137 aa), FASTA scores: opt: 466, E(): 1.3e-22, (58.6% identity in 116 aa overlap); O86521|SC1C2.20c (140 aa), FASTA scores: opt: 445, E(): 2.7e-21, (56.9% identity i... 1344 3779701 A | G,V,W Rv3367 . 378 PE-PGRS family protein PE_PGRS51 1345 3779788 Q | H Rv3367 . 407 PE-PGRS family protein PE_PGRS51 1346 3779978 G | R Rv3367 . 471 PE-PGRS family protein PE_PGRS51 1347 3784011 A | T Rv3370c dnaE2 244 Probable DNA polymerase III (alpha chain) DnaE2 (DNA nucleotidyltransferase) 1348 3789989 R | G Rv3375 amiD 457 Probable amidase AmiD (acylamidase) (acylase) 1349 3792893 L | H,P,R Rv3378c . 119 Rv3378c, (MTV004.36c), len: 296 aa. Diterpene synthase. Note that this ORF and the downstream ORF MTV004.35c have a significantly lower GC bias than the rest of the genome. This region is a possible MT-complex-specific genomic island (See Becq et al., 2007). Cofactor: Mg2+. 1350 3797822 A | E,G,P Rv3383c idsB 223 Possible polyprenyl synthetase IdsB (polyprenyl transferase) (polyprenyl diphosphate synthase) 1351 3802445 H | D Rv3388 . 265 PE-PGRS family protein PE_PGRS52 1352 3808103 N | S Rv3392c cmaA1 112 Cyclopropane-fatty-acyl-phospholipid synthase 1 CmaA1 (cyclopropane fatty acid synthase) (CFA synthase) (cyclopropane mycolic acid synthase 1) 1353 3811629 A | V Rv3395c . 3 hypothetical protein 1354 3823194 N | D Rv3403c . 224 Rv3403c, (MTCY78.25), len: 533 aa. Hypothetical unknown protein, but some weak similarity to Q9KJP2 hypothetical 54.9 KDA protein from Myxococcus xanthus (504 aa), FASTA scores: opt: 157, E(): 0.011, (24.1% identity in 548 aa overlap). 1355 3826501 R | C Rv3407 vapB47 84 Possible antitoxin VapB47 1356 3830220 G | L,R,W Rv3411c guaB2 434 Probable inosine-5'-monophosphate dehydrogenase GuaB2 (imp dehydrogenase) (inosinic acid dehydrogenase) (inosinate dehydrogenase) (imp oxidoreductase) (inosine-5'-monophosphate oxidoreductase) (IMPDH) (IMPD) 1357 3847564 L | V Rv3429 . 134 PPE family protein PPE59 1358 3851084 I | T Rv3432c gadB 224 Probable glutamate decarboxylase GadB 1359 3853547 A | C,G,L,W Rv3434c . 128 Rv3434c, (MTCY77.06c), len: 237 aa. Possible conserved transmembrane protein, showing some similarity with Q9CGH7|YLDB hypothetical protein from Lactococcus lactis (subsp. lactis) (Streptococcus lactis) (258 aa),FASTA scores: opt: 248, E(): 1.6e-09, (28.8% identity in 198 aa overlap); and P94983|... 1360 3861914 A | T Rv3442c rpsI 13 30S ribosomal protein S9 RpsI 1361 3863173 M | *,L,R,Y Rv3445c esxU 31 ESAT-6 like protein EsxU 1362 3863175 M | A,C,Y Rv3445c esxU 31 ESAT-6 like protein EsxU 1363 3868001 R | A,D,T Rv3447c eccC4 80 ESX conserved component EccC4 ESX-4 type VII secretion system protein Probable membrane protein 1364 3868185 D | E,R Rv3447c eccC4 19 ESX conserved component EccC4 ESX-4 type VII secretion system protein Probable membrane protein 1365 3879288 R | K,N Rv3459c rpsK 135 30S ribosomal protein S11 RpsK 1366 3894732 R | G Rv3478 . 103 PE family protein PPE60 1367 3899451 N | H,I,L,P,R Rv3480c . 318 Rv3480c, (MTCY13E12.33c), len: 497 aa. Possible triacylglycerol synthase (See Daniel et al., 2004), similar to many from Mycobacterium tuberculosis strains H37Rv and CDC1551 e.g. O69701|Y1D4_MYCTU|Rv3734c|MT3839|MTV025.082c (454 aa), FASTA scores: opt: 520, E(): 2e-23, (39.95% identity in 488 aa ... 1368 3905379 A | C,P,W Rv3485c . 63 Rv3485c, (MTCY13E12.38c), len: 314 aa. Probable short-chain dehydrogenase/reductase, similar, but longer 41 aa, to P71824|Rv0769|MTCY369.14 putative short-chain type dehydrogenase/reductase CY369.14 from Mycobacterium tuberculosis (248 aa), FASTA scores: opt: 462, E(): 1.8e-19, (34.0% identity in... 1369 3906644 L | V Rv3487c lipF 122 Probable esterase/lipase LipF 1370 3914781 Y | S Rv3496c mce4D 369 Mce-family protein Mce4D 1371 3916386 R | S Rv3497c mce4C 191 Mce-family protein Mce4C 1372 3917338 F | K,N,V Rv3498c mce4B 221 Mce-family protein Mce4B 1373 3928283 N | T Rv3507 . 572 PE-PGRS family protein PE_PGRS53 1374 3930448 D | N Rv3507 . 1294 PE-PGRS family protein PE_PGRS53 1375 3934699 S | N Rv3508 . 1232 PE-PGRS family protein PE_PGRS54 1376 3936340 G | D,I,M,S Rv3508 . 1779 PE-PGRS family protein PE_PGRS54 1377 3937953 P | T Rv3509c ilvX 158 Probable acetohydroxyacid synthase IlvX (acetolactate synthase) 1378 3940514 H | D Rv3511 . 300 PE-PGRS family protein PE_PGRS55 1379 3943782 G | R Rv3512 . 687 PE-PGRS family protein PE_PGRS56 1380 3948629 T | A,P Rv3514 . 946 PE-PGRS family protein PE_PGRS57 1381 3948738 A | G Rv3514 . 982 PE-PGRS family protein PE_PGRS57 1382 3968006 L | F Rv3531c . 313 Hypothetical protein 1383 3972229 L | A Rv3533c . 76 PPE family protein PPE62 1384 3976595 R | S Rv3537 kstD 409 Probable dehydrogenase 1385 3981395 F | G,I,L,V Rv3542c . 196 Rv3542c, (MTCY03C7.14), len: 311 aa. Hypothetical protein, showing some similarity to other e.g. Q58947|MJ1552 from Methanococcus jannaschii (141 aa) FASTA scores: opt: 177, E(): 0.00065, (46.65% identity in 60 aa overlap); BAB59276|TVG0142586 from Thermoplasma volcanium (135 aa), FASTA scores: o... 1386 3989920 D | N Rv3551 . 9 Rv3551, (MTCY03C7.05c), len: 292 aa. Possible CoA-transferase, alpha subunit, similar in part to other CoA-transferases e.g. Q59111|GCTA_ACIFE|GCTA glutaconate CoA-transferase subunit A (GCT large subunit) from Acidaminococcus fermentans (319 aa) FASTA scores: opt: 247,E(): 6.3e-09, (27.35% ident... 1387 3995255 K | E Rv3555c . 149 Rv3555c, (MTCY06G11.02c), len: 289 aa. Conserved protein, highly similar to others from Mycobacterium tuberculosis e.g. O53562|AL022022|Rv3517|MTV023.24 (279 aa), FASTA scores: opt: 874, E(): 8.3e-48, (49.45% identity in 275 aa overlap); P71763|Rv1482c|MTCY277.03c (339 aa),FASTA scores: opt: 755,... 1388 4005114 W | S Rv3563 fadE32 275 Probable acyl-CoA dehydrogenase FadE32 1389 4022917 S | I,K Rv3580c cysS1 296 Cysteinyl-tRNA synthetase 1 CysS1 (cysteine--tRNA ligase 1) (CYSRS 1) (cysteine translase) 1390 4031277 R | P,Q Rv3589 mutY 262 Probable adenine glycosylase MutY 1391 4031926 G | *,P,R,X Rv3590c . 412 PE-PGRS family protein PE_PGRS58 1392 4032218 A | V Rv3590c . 314 PE-PGRS family protein PE_PGRS58 1393 4036969 A | D,G Rv3595c . 361 PE-PGRS family protein PE_PGRS59 1394 4036989 M | A,L Rv3595c . 355 PE-PGRS family protein PE_PGRS59 1395 4049647 A | P Rv3608c folP1 112 Dihydropteroate synthase 1 FolP (DHPS 1) (dihydropteroate pyrophosphorylase 1) (dihydropteroate diphosphorylase 1) 1396 4051823 D | G Rv3610c ftsH 354 Membrane-bound protease FtsH (cell division protein) 1397 4055801 T | I Rv3616c espA 192 ESX-1 secretion-associated protein A, EspA 1398 4058711 L | S Rv3618 . 5 Possible monooxygenase 1399 4069292 A | T Rv3630 . 40 Probable conserved integral membrane protein 1400 4069743 T | M Rv3630 . 190 Rv3630, (MTCY15C10.22c), len: 431 aa. Probable conserved integral membrane, highly similar to P71789|YF10_MYCTU|Rv1510|MTCY277.32 hypothetical 44.3 KDA protein from Mycobacterium tuberculosis (432 aa) FASTA scores: opt: 1940, E(): 2.3e-103, (70.75% identity in 424 aa overlap). Note that N-termina... 1401 4079553 V | A,G,L Rv3640c . 66 Rv3640c, (MTCY15C10.12), len: 409 aa. Probable transposase, highly similar to others e.g. Q48882 transposase from Mycobacterium avium (411 aa) FASTA scores: opt: 1574, E(): 6.2e-93, (59.75% identity in 400 aa overlap); Q9AKV5 putative transposase (fragment) from Mycobacterium paratuberculosis (39... 1402 4089058 L | P Rv3649 . 93 Probable helicase 1403 4094352 A | G,R Rv3653 . 138 PE-PGRS family-related protein PE_PGRS61 1404 4095002 S | A,C,F,L,R,W Rv3655c . 100 Rv3655c, (MTV025.003c), len: 125 aa. Hypothetical protein, with similarity to Q9X917|SCH5.15c hypothetical 15.2 KDA protein from Streptomyces coelicolor (150 aa) FASTA scores: opt: 211, E(): 7.7e-07, (39.65% identity in 111 aa overlap). Equivalent to AAK48119 from Mycobacterium tuberculosis strai... 1405 4096820 R | A Rv3658c . 41 Rv3658c, (MTV025.006c), len: 266 aa. Probable conserved transmembrane protein, similar to Q9X920|SCH5.18c putative integral membrane protein from Streptomyces coelicolor (321 aa), FASTA scores: opt: 335, E(): 4.1e-13,(38.05% identity in 247 aa overlap). 1406 4102632 F | L,S Rv3663c dppD 350 Probable dipeptide-transport ATP-binding protein ABC transporter DppD 1407 4106154 E | K Rv3666c dppA 311 Probable periplasmic dipeptide-binding lipoprotein DppA 1408 4106691 F | L,P,V Rv3666c dppA 132 Probable periplasmic dipeptide-binding lipoprotein DppA 1409 4131581 L | V Rv3689 . 409 Rv3689, (MTV025.037), len: 451 aa. Probable conserved transmembrane protein, with Proline rich N-terminus, similar to Q9KYW6|SCE33.17 putative integral membrane protein from Streptomyces coelicolor (462 aa) FASTA scores: opt: 730, E(): 2.7e-21, (38.1% identity in 412 aa overlap). 1410 4133316 T | S Rv3691 . 267 hypothetical protein 1411 4140143 G | R Rv3697c vapC48 34 Possible toxin VapC48 Contains PIN domain 1412 4156099 V | L Rv3711c dnaQ 211 Probable DNA polymerase III (epsilon subunit) DnaQ 1413 4162339 T | A Rv3719 . 12 Rv3719, (MTV025.067), len: 470 aa. Conserved protein, equivalent to O69516|ML2333|MLCB2407.17c hypothetical 51.8 KDA protein from Mycobacterium leprae (459 aa), FASTA scores: opt: 2593, E(): 7.8e-161, (82.75% identity in 458 aa overlap). Also some similarity to Q9CU63|5830417J06RIK hypothetical p... 1414 4182695 R | H Rv3731 ligC 313 Possible ATP-dependent DNA ligase LigC (polydeoxyribonucleotide synthase [ATP]) (polynucleotide ligase [ATP]) (sealase) (DNA repair protein) (DNA joinase) 1415 4187063 G | R Rv3736 . 144 Transcriptional regulatory protein (probably AraC/XylS-family) 1416 4187817 D | G Rv3737 . 40 Probable conserved transmembrane protein 1417 4189721 T | N Rv3738c . 171 PPE family protein PPE66 1418 4194544 W | F,L,S Rv3743c ctpJ 277 Probable cation transporter P-type ATPase CtpJ 1419 4197409 E | X Rv3748 . 58 Rv3748, (MTV025.096), len: 119 aa. Hypothetical protein, highly similar to upstream ORF O69714|Rv3747|MTV025.095 conserved hypothetical protein (127 aa), FASTA scores: opt: 496, E(): 2.5e-28, (64.4% identity in 118 aa overlap). 1420 4206010 K | I,N Rv3761c fadE36 303 Possible acyl-CoA dehydrogenase FadE36 1421 4206013 F | C,K,L Rv3761c fadE36 302 Possible acyl-CoA dehydrogenase FadE36 1422 4223172 V | A Rv3777 . 160 Probable oxidoreductase 1423 4230033 V | A Rv3783 rfbD 259 Probable O-antigen/lipopolysaccharide transport integral membrane protein ABC transporter RfbD 1424 4239298 A | V Rv3792 aftA 456 Arabinofuranosyltransferase AftA 1425 4247431 M | I Rv3795 embB 306 Integral membrane indolylacetylinositol arabinosyltransferase EmbB (arabinosylindolylacetylinositol synthase) 1426 4263501 T | P Rv3802c . 289 Rv3802c, (MTV026.07c), len: 336 aa. Probable conserved membrane protein, with a N-terminal signal sequence followed by Pro-rich region. Equivalent to Q9CDB3|ML0099 hypothetical protein from Mycobacterium leprae (336 aa) FASTA scores: opt: 1759, E(): 1.1e-85,(75.5% identity in 335 aa overlap). A c... 1427 4270177 A | G Rv3807c . 54 Rv3807c, (MTV026.12), len: 165 aa. Possible conserved transmembrane protein, equivalent to Q9CDB6|ML0094 putative membrane protein from Mycobacterium leprae (192 aa), FASTA scores: opt: 714, E(): 2.4e-38,(72.85% identity in 151 aa overlap). Also highly similar to Q9KZA3|SC5G8.11 putative integral... 1428 4270285 A | V Rv3807c . 18 Rv3807c, (MTV026.12), len: 165 aa. Possible conserved transmembrane protein, equivalent to Q9CDB6|ML0094 putative membrane protein from Mycobacterium leprae (192 aa), FASTA scores: opt: 714, E(): 2.4e-38,(72.85% identity in 151 aa overlap). Also highly similar to Q9KZA3|SC5G8.11 putative integral... 1429 4275935 M | V Rv3811 . 380 hypothetical protein 1430 4284429 P | L Rv3820c papA2 466 Possible conserved polyketide synthase associated protein PapA2 1431 4302036 T | A Rv3827c . 252 Possible transposase 1432 4305064 S | F,L,M Rv3830c . 208 Rv3830c, (MTCY01A6.39), len: 209 aa. Probable transcriptional regulator TetR family, similar to others e.g. P39885|TCMR_STRGA tetracenomycin C transcriptional repressor from Streptomyces glaucescens (226 aa) FASTA scores: opt: 255, E(): 6.1e-10, (33.65% identity in 202 aa overlap); Q9RDR0|SC4A7.0... 1433 4305066 V | F,G Rv3830c . 207 Rv3830c, (MTCY01A6.39), len: 209 aa. Probable transcriptional regulator TetR family, similar to others e.g. P39885|TCMR_STRGA tetracenomycin C transcriptional repressor from Streptomyces glaucescens (226 aa) FASTA scores: opt: 255, E(): 6.1e-10, (33.65% identity in 202 aa overlap); Q9RDR0|SC4A7.0... 1434 4307621 R | P Rv3833 . 252 Rv3833, (MTCY01A6.36c), len: 263 aa. Probable transcriptional regulator belonging to araC family, similar to others e.g. Q9KYN4|SC9H11.05 putative AraC-family transcriptional regulator from Streptomyces coelicolor (289 aa), FASTA scores: opt: 754, E(): 1.2e-42, (50.45% identity in 232 aa overlap)... 1435 4311131 D | I Rv3837c . 193 Rv3837c, (MTCY01A6.32), len: 232 aa. Probable phosphoglycerate mutase, equivalent to Q9CDC3|ML0079 putative phosphoglycerate mutase from Mycobacterium leprae (231 aa), FASTA scores: opt: 1116, E(): 7.3e-66, (71.55% identity in 232 aa overlap). Also similar to others e.g. Q9ZAX0|PGM 2,3-PDG depend... 1436 4315145 K | E,P,Q Rv3842c glpQ1 140 Probable glycerophosphoryl diester phosphodiesterase GlpQ1 (glycerophosphodiester phosphodiesterase) 1437 4326464 I | N,T Rv3854c ethA 337 Monooxygenase EthA 1438 4326831 Q | * Rv3854c ethA 215 Monooxygenase EthA 1439 4355184 E | K Rv3877 eccD1 60 ESX conserved component EccD1 ESX-1 type VII secretion system protein Probable transmembrane protein 1440 4358396 S | A,G,L,P,X Rv3879c espK 463 ESX-1 secretion-associated protein EspK Alanine and proline rich protein 1441 4365541 I | K,N,S Rv3884c eccA2 433 ESX conserved component EccA2 ESX-2 type VII secretion system protein Probable CbxX/CfqX family protein 1442 4369612 P | A Rv3886c mycP2 187 Probable alanine and proline rich membrane-anchored mycosin MycP2 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-2) 1443 4369618 P | A Rv3886c mycP2 185 Probable alanine and proline rich membrane-anchored mycosin MycP2 (serine protease) (subtilisin-like protease) (subtilase-like) (mycosin-2) 1444 4373029 Q | R Rv3889c espG2 201 ESX-2 secretion-associated protein EspG2 1445 4374228 T | A Rv3891c esxD 49 Possible ESAT-6 like protein EsxD 1446 4375056 Y | D,L,P Rv3892c . 210 PPE family protein PPE69 1447 4375673 P | R Rv3892c . 4 PPE family protein PPE69 1448 4380295 T | I,K,N,S Rv3894c eccC2 53 ESX conserved component EccC2 ESX-2 type VII secretion system protein Possible membrane protein 1449 4383142 C | A,G,L,P,R,W Rv3897c . 167 Rv3897c, (MTCY15F10.15), len: 210 aa. Conserved hypothetical protein, highly similar in part to Q10691|YK83_MYCTU|Rv2083|MT2145|MTCY49.22 hypothetical 30.8 KDA protein from Mycobacterium tuberculosis (314 aa) FASTA scores: opt: 815, E(): 4.7e-26, (73.05% identity in 167 aa overlap). Similarity to... 1450 4383600 G | A,H,L Rv3897c . 14 Rv3897c, (MTCY15F10.15), len: 210 aa. Conserved hypothetical protein, highly similar in part to Q10691|YK83_MYCTU|Rv2083|MT2145|MTCY49.22 hypothetical 30.8 KDA protein from Mycobacterium tuberculosis (314 aa) FASTA scores: opt: 815, E(): 4.7e-26, (73.05% identity in 167 aa overlap). Similarity to... 1451 4384671 W | G Rv3899c . 237 Rv3899c, (MTCY15F10.13), len: 410 aa. Conserved hypothetical protein, similar in part to proteins from Mycobacterium tuberculosis strains H37Rv and CDC1551. Region between aa 29-80 is strictly identical to P96909 hypothetical 15.1 KDA protein (fragment) (143 aa) FASTA scores: opt: 562, E(): 4e-16... 1452 4394509 D | X Rv3909 . 106 Rv3909, (MTCY15F10.02c), len: 802 aa. Conserved protein, equivalent to Q9CCY0|ML2699 putative secreted protein from Mycobacterium leprae (797 aa) FASTA scores: opt: 3777, E(): 8.8e-206, (72.35% identity in 803 aa overlap). Note that the N-terminal end is highly similar to Q50196|L222-ORF7 (286 aa... 1453 4395107 T | P Rv3909 . 306 Rv3909, (MTCY15F10.02c), len: 802 aa. Conserved protein, equivalent to Q9CCY0|ML2699 putative secreted protein from Mycobacterium leprae (797 aa) FASTA scores: opt: 3777, E(): 8.8e-206, (72.35% identity in 803 aa overlap). Note that the N-terminal end is highly similar to Q50196|L222-ORF7 (286 aa... 1454 4398047 V | D Rv3910 . 484 Rv3910, (MTCY15F10.01c.MTV028.01), len: 1184 aa. Probable conserved transmembrane protein (hydrophobic domain ~50-550), equivalent to Q9CCX9|ML2700 possible conserved membrane protein from Mycobacterium leprae (1206 aa), FASTA scores: opt: 5554, E(): 0, (75.15% identity in 1182 aa overlap); and h... 1455 4401897 D | G Rv3913 trxB2 57 Probable thioredoxin reductase TrxB2 (TRXR) (TR) 1456 4402376 V | M Rv3913 trxB2 217 Probable thioredoxin reductase TrxB2 (TRXR) (TR) 1457 4406116 A | S Rv3917c parB 126 Probable chromosome partitioning protein ParB 1458 4407790 A | E,G,V Rv3919c gid 138 Probable glucose-inhibited division protein B Gid