Rule 94: (14, lift 40.0) [hom( A ),e_val_gt( A ,2e-37),keyword( A ,plasmid)] = 0 [hom( A ),psi_iter_lteq( A ,7),classification( A ,bangiophyceae)] = 1 [hom( A ),species( A ,cyanophora_paradoxa),mol_wt_lteq( A ,32892)] = 1 -> class 'Ribosomal proteins - synthesis modificationRiboso' [0.938] Evaluation on test data (712 items): ecoli3228 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplF' '50S ribosomal subunit protein L6' ecoli3798 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdoI' 'formate dehydrogenase cytochrome B556 (FDO) subunit' ecoli3264 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsG' '30S ribosomal subunit protein S7 initiates assembly' ecoli3243 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplC' '50S ribosomal subunit protein L3' ecoli3238 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplV' '50S ribosomal subunit protein L22' ecoli4092 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsR' '30S ribosomal subunit protein S18' ecoli3882 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplA' '50S ribosomal subunit protein L1 regulates synthesis of L1 and L11' ecoli3231 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplE' '50S ribosomal subunit protein L5' ecoli169 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsB' '30S ribosomal subunit protein S2' ecoli3229 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsH' '30S ribosomal subunit protein S8 and regulator' ecoli23 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsT' '30S ribosomal subunit protein S20' ecoli1447 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdnI' 'formate dehydrogenase-N nitrate-inducible cytochrome B556(Fdn) gamma subunit' ecoli2556 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsP' '30S ribosomal subunit protein S16' ecoli3323 - 1,6,1 Cell processes Adaptation Adaptations, atypical conditions 'yrfH' 'heat shock protein 15 DNA/RNA binding' ecoli2553 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplS' '50S ribosomal subunit protein L19' ecoli3244 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsJ' '30S ribosomal subunit protein S10' Test Accuracy: 13/16 (81.25%) Test Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 23/712 (3.23%) Test Significance: dev(17.65) ; prob(2.131753E-17) Application to new data (2167 items): Frequency rule on new data: 0/2167 (0.00%) Evaluation on training data (939 items): ecoli3559 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmB' '50S ribosomal subunit protein L28' ecoli4090 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsF' '30S ribosomal subunit protein S6' ecoli3881 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplK' '50S ribosomal subunit protein L11' ecoli3220 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsK' '30S ribosomal subunit protein S11' ecoli3240 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplB' '50S ribosomal subunit protein L2' ecoli1684 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplT' '50S ribosomal subunit protein L20 and regulator' ecoli3234 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsQ' '30S ribosomal subunit protein S17' ecoli3230 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsN' '30S ribosomal subunit protein S14' ecoli3558 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmG' '50S ribosomal subunit protein L33' ecoli3624 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmH' '50S ribosomal subunit protein L34' ecoli3265 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsL' '30S ribosomal subunit protein S12' ecoli3161 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplM' '50S ribosomal subunit protein L13' ecoli3222 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmJ' '50S ribosomal subunit protein X' ecoli3233 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplN' '50S ribosomal subunit protein L14' Training Accuracy: 14/14 (100.00%) Training Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 22/939 (2.34%) Training Significance: dev(24.16) ; prob(1.501756E-23) Evaluation on validation data (471 items): ecoli3237 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsC' '30S ribosomal subunit protein S3' ecoli3226 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsE' '30S ribosomal subunit protein S5' ecoli3239 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsS' '30S ribosomal subunit protein S19' ecoli3236 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplP' '50S ribosomal subunit protein L16' ecoli1685 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmI' '50S ribosomal subunit protein A' ecoli3227 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplR' '50S ribosomal subunit protein L18' ecoli3160 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsI' '30S ribosomal subunit protein S9' ecoli3221 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsM' '30S ribosomal subunit protein S13' Validation Accuracy: 8/8 (100.00%) Validation Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 12/471 (2.55%) Validation Significance: dev(17.49) ; prob(1.775345E-13) ------------------ Rule 35: (4/1, lift 104.3) [hom( A ),species( A ,mycoplasma_hyorhinis)] = 1 [hom( A ),species( A ,streptomyces_lividans),mol_wt_lteq( A ,55220)] = 1 [hom( A ),species( A ,mycoplasma_genitalium),mol_wt_lteq( A ,55220)] = 1 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 0 [hom( A ),species( A ,cyanophora_paradoxa),mol_wt_lteq( A ,32892)] = 0 -> class 'Chemotaxis and mobility' [0.667] Evaluation on test data (712 items): ecoli1905 - 4,1,5 Structural elements Cell envelop Surface structures 'fliH' 'flagellar biosynthesis; export of flagellar proteins?' ecoli1853 1,1,1 Cell processes Chemotaxis, motility Chemotaxis and mobility 'tap' 'methyl-accepting chemotaxis protein IV peptide sensor receptor (2nd module)' ecoli2173 - 4,1,3 Structural elements Cell envelop Outer membrane constituents 'ompC' 'outer membrane protein 1b (ib;c) (2nd module)' ecoli3005 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'air' 'aerotaxis sensor receptor flavoprotein(2nd module)' Test Accuracy: 1/4 (25.00%) Test Frequency class 'Chemotaxis and mobility': 4/712 (0.56%) Test Significance: dev(6.54) ; prob(2.228325E-02) Application to new data (2167 items): ecoli3976 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjcR' 'paral putative membrane protein' ecoli523 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0532' 'paral putative outer membrane protein (2nd module)' ecoli3410 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhiI' 'paral putative membrane protein' ecoli255 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0263' 'paral putative membrane component of transport system' ecoli1076 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ycfF' 'orf' Frequency rule on new data: 5/2167 (0.23%) Evaluation on training data (939 items): ecoli3526 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'lctR' 'transcriptional regulator(1st module)' ecoli1392 1,1,1 Cell processes Chemotaxis, motility Chemotaxis and mobility 'trg' 'methyl-accepting chemotaxis protein III ribose and galactose sensor receptor(2nd module)' ecoli4244 1,1,1 Cell processes Chemotaxis, motility Chemotaxis and mobility 'tsr' 'methyl-accepting chemotaxis protein I serine sensor receptor(2nd module)' ecoli1854 1,1,1 Cell processes Chemotaxis, motility Chemotaxis and mobility 'tar' 'methyl-accepting chemotaxis protein II aspartate sensor receptor (2nd module)' Training Accuracy: 3/4 (75.00%) Training Frequency class 'Chemotaxis and mobility': 6/939 (0.64%) Training Significance: dev(18.66) ; prob(1.038558E-06) Evaluation on validation data (471 items): ecoli1856 1,1,1 Cell processes Chemotaxis, motility Chemotaxis and mobility 'cheA' 'sensory transducer kinase between chemo- signal receptors and CheB and CheY(2nd module)' Validation Accuracy: 1/1 (100.00%) Validation Frequency class 'Chemotaxis and mobility': 2/471 (0.42%) Validation Significance: dev(15.31) ; prob(4.246285E-03) ------------------ Rule 69: (2, lift 22.0) [hom( A ),e_val_gt( A ,6e-14),classification( A ,aculeata)] = 1 [hom( A ),e_val_lteq( A ,0.0006),classification( A ,lactococcus)] = 1 [hom( A ),e_val_lteq( A ,0.0006),keyword( A ,repeat)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 1 -> class 'Mechanism not stated' [0.750] Evaluation on test data (712 items): ecoli3967 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'nrfD' 'putative nitrate reductase formate dependent also paral putative STP family of transport protein' ecoli4248 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'mdoB' 'phosphoglycerol transferase I add phosphoglycerols to OPG backbone' Test Accuracy: 0/2 (0.00%) Test Frequency class 'Mechanism not stated': 19/712 (2.67%) Test Significance: dev(-0.23) ; prob(1.000000E+00) Application to new data (2167 items): ecoli1766 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1798' 'paral putative transport protein' ecoli2275 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'dedA' 'orf' Frequency rule on new data: 2/2167 (0.09%) Evaluation on training data (939 items): ecoli4178 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fecC' 'ABC superfamily (membrane) citrate-dependent iron(III) transport protein (2nd module)' ecoli2443 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b2492' 'membrane protein formate transporter of hyf operon (formate channel 2)' Training Accuracy: 2/2 (100.00%) Training Frequency class 'Mechanism not stated': 32/939 (3.41%) Training Significance: dev(7.53) ; prob(1.161365E-03) Evaluation on validation data (471 items): ecoli1867 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1899' 'split high-affinity L-arabinose transport system; membrane protein fragment 1' Validation Accuracy: 1/1 (100.00%) Validation Frequency class 'Mechanism not stated': 14/471 (2.97%) Validation Significance: dev(5.71) ; prob(2.972399E-02) ------------------ Rule 104: (4, lift 30.1) [hom( A ),classification( A ,amphibia)] = 1 [hom( A ),classification( A ,enterobacter)] = 0 [hom( A ),species( A ,mycoplasma_hyorhinis)] = 1 [hom( A ),e_val_lteq( A ,6e-14),classification( A ,asteridae)] = 0 [hom( A ),species( A ,mycoplasma_genitalium),mol_wt_lteq( A ,55220)] = 0 -> class 'Surface structures' [0.833] Evaluation on test data (712 items): ecoli167 - 3,1,8 Metabolism of small molecules Amino acid biosynthesis Glutamine 'glnD' 'uridylyltransferase acts on PII (GlnB) and GlnK(2nd module)' ecoli139 4,1,5 Structural elements Cell envelop Surface structures 'htrE' 'probable outer membrane porin protein involved in fimbrial assembly (2nd module)' Test Accuracy: 1/2 (50.00%) Test Frequency class 'Surface structures': 22/712 (3.09%) Test Significance: dev(3.83) ; prob(6.084301E-02) Application to new data (2167 items): ecoli3077 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yraK' 'paral putative fimbrial-like protein' ecoli1940 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1978' 'paral putative factor(2nd module)' ecoli1625 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1655' 'orf' ecoli2500 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2549' 'orf' ecoli524 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0533' 'involved in fimbrial asembly(2nd module)' ecoli2297 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2339' 'paral putative fimbrial-like protein' ecoli3076 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yraJ' 'paral putative outer membrane protein (2nd module)' ecoli2070 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehD' 'paral putative fimbrial-like protein' ecoli2977 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ygiL' 'paral putative fimbrial-like protein' ecoli2295 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2337' 'paral putative outer membrane protein' ecoli3146 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhcD' 'paral putative outer membrane protein (2nd module)' ecoli220 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yafL' 'orf' ecoli1474 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1503' 'paral putative fimbrial-like protein' ecoli2456 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2505' 'orf' ecoli141 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yadN' 'paral putative fimbrial-like protein' Frequency rule on new data: 15/2167 (0.69%) Evaluation on training data (939 items): ecoli1888 4,1,5 Structural elements Cell envelop Surface structures 'fliC' 'flagellar biosynthesis; flagellin filament structural protein(2nd module)' ecoli1049 4,1,5 Structural elements Cell envelop Surface structures 'flgE' 'flagellar biosynthesis hook protein(1st module)' ecoli4208 4,1,5 Structural elements Cell envelop Surface structures 'fimG' 'fimbrial morphology' ecoli4209 4,1,5 Structural elements Cell envelop Surface structures 'fimH' 'minor fimbrial subunit D-mannose specific adhesin N-terminal binds carbohydrate c-terminal binds periplasmic chaperone' Training Accuracy: 4/4 (100.00%) Training Frequency class 'Surface structures': 26/939 (2.77%) Training Significance: dev(11.85) ; prob(5.878020E-07) Evaluation on validation data (471 items): ecoli1889 4,1,5 Structural elements Cell envelop Surface structures 'fliD' 'flagellar biosynthesis; filament capping protein; enables filament assembly' ecoli1056 4,1,5 Structural elements Cell envelop Surface structures 'flgL' 'flagellar biosynthesis; hook-filament junction protein C-terminal involved in chaperone (probably FlgN) binding' Validation Accuracy: 2/2 (100.00%) Validation Frequency class 'Surface structures': 12/471 (2.55%) Validation Significance: dev(8.75) ; prob(6.491136E-04) ------------------ Rule 83: (3, lift 25.9) [hom( A ),classification( A ,klebsiella)] = 1 [hom( A ),e_val_lteq( A ,0.0006),classification( A ,escherichia)] = 0 -> class 'Pool multipurpose conversions of intermed metm' [0.800] Evaluation on test data (712 items): Test Accuracy: 0/0 (0.00%) Test Frequency class 'Pool multipurpose conversions of intermed metm': 20/712 (2.67%) Test Significance: dev(0.00) ; prob(1.000000E+00) Application to new data (2167 items): ecoli1093 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1120' 'orf' ecoli1017 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1044' 'orf' ecoli991 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1017' 'putative cytochrome' ecoli1693 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1725' 'orf' ecoli2480 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2529' 'orf believed to be involved in Fe-S protein formation or repair' ecoli603 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0613' 'orf (probable modifier of citrate lyase protein)' Frequency rule on new data: 6/2167 (0.28%) Evaluation on training data (939 items): ecoli607 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b0617' 'citrate lyase acyl carrier protein (gamma chain)' ecoli606 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b0616' 'citrate lyase beta chain (acyl lyase subunit)' ecoli605 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b0615' 'citrate lyase alpha chain' Training Accuracy: 3/3 (100.00%) Training Frequency class 'Pool multipurpose conversions of intermed metm': 29/939 (3.09%) Training Significance: dev(9.70) ; prob(2.945759E-05) Evaluation on validation data (471 items): ecoli608 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b0618' 'citrate lyase synthetase (citrate (pro-3S)-lyase ligase' Validation Accuracy: 1/1 (100.00%) Validation Frequency class 'Pool multipurpose conversions of intermed metm': 16/471 (3.40%) Validation Significance: dev(5.33) ; prob(3.397028E-02) ------------------ Rule 107: (11, lift 39.4) [hom( A ),species( A ,mycoplasma_hyorhinis)] = 0 [hom( A ),e_val_lteq( A ,0.0006),species( A ,bacillus_subtilis)] = 0 [hom( A ),psi_iter_gt( A ,7),classification( A ,burkholderia_group)] = 1 [hom( A ),psi_iter_gt( A ,5),species( A ,rhizobium_leguminosarum)] = 0 [hom( A ),species( A ,cyanophora_paradoxa),mol_wt_lteq( A ,32892)] = 0 -> class 'Transposon-related functions' [0.923] Evaluation on test data (712 items): ecoli1375 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_2' 'transposase 2 for IS30' ecoli1989 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_7' 'IS5 protein' ecoli3479 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi5A' 'IS5 protein' ecoli3480 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi5B' 'IS150 transposase' ecoli4173 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_3' 'transposase for IS30' ecoli1374 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_2' 'IS21 protein 2' ecoli946 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'hyaA' 'hydrogenase-1 small subunit' ecoli2979 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_5' 'IS2 protein' ecoli1953 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_6' 'IS5 protein' ecoli2034 - 1,5,33 Cell processes Transport/binding proteins RNDfamily 'b2075' 'RND family of transport protein paral putative outer membrane receptor' Test Accuracy: 8/10 (80.00%) Test Frequency class 'Transposon-related functions': 13/712 (1.83%) Test Significance: dev(18.46) ; prob(5.379083E-13) Application to new data (2167 items): ecoli1721 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1753' 'orf' ecoli1000 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'tra5_3' 'putative transposase for IS3' ecoli162 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yaeG' 'orf' ecoli365 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0373' 'orf' ecoli290 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0298' 'orf' ecoli2047 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2088' 'orf' ecoli291 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0299' 'putative IS tranposase' ecoli531 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'tra5_2' 'putative transposase for insertion sequence IS3' ecoli530 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0540' 'orf' ecoli2933 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2999' 'orf' ecoli4161 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4272' 'orf' ecoli364 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'tra5_1' 'putative transposase 5 of IS3' ecoli259 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yagA' 'orf' ecoli2048 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'tra5_4' 'putative transposase for IS3' ecoli1001 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1027' 'orf' ecoli1548 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1578' 'orf' ecoli2272 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'dedD' 'paral putative lipoprotein(1st module)' ecoli4174 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4285' 'putative transposase' ecoli4162 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4273' 'putative transposase' Frequency rule on new data: 19/2167 (0.88%) Evaluation on training data (939 items): ecoli251 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_1' 'IS5 protein 1' ecoli1373 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_2' 'IS22 protein 2' ecoli248 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_1' 'transposase1 for IS30' ecoli1955 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_3' 'IS2 protein' ecoli1956 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_3' 'IS2 protein' ecoli3428 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_11' 'IS5 protein 11' ecoli2150 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_8' 'IS5 protein' ecoli542 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_2' 'IS 5 protein' ecoli1341 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_5' 'IS5 protein' ecoli353 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_1' 'IS22 protein 1' ecoli2916 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_9' 'IS5 protein' Training Accuracy: 11/11 (100.00%) Training Frequency class 'Transposon-related functions': 22/939 (2.34%) Training Significance: dev(21.41) ; prob(1.167691E-18) Evaluation on validation data (471 items): ecoli1302 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_4' 'IS5 protein' ecoli352 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_1' 'IS21 protein 1' ecoli2798 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_4' 'IS2 protein' ecoli3148 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_10' 'IS5 protein 10' ecoli948 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'hyaC' 'probable Ni/Fe-hydrogenase 1 b-type cytochrome subunit' ecoli2797 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_4' 'IS2 protein' ecoli2978 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_5' 'IS2 protein' Validation Accuracy: 6/7 (85.71%) Validation Frequency class 'Transposon-related functions': 14/471 (2.97%) Validation Significance: dev(12.89) ; prob(4.684207E-09) ------------------ Rule 13: (5, lift 50.3) [hom( A ),classification( A ,acanthopterygii)] = 1 [hom( A ),e_val_lteq( A ,6e-14),classification( A ,zea)] = 0 [hom( A ),e_val_lteq( A ,2e-37),classification( A ,caenorhabditis)] = 1 [hom( A ),e_val_lteq( A ,2e-37),classification( A ,eurotiales)] = 0 [hom( A ),species( A ,cyanophora_paradoxa),mol_wt_lteq( A ,32892)] = 0 [hom( A ),mol_wt_lteq( A ,43194),classification( A ,leishmania)] = 0 -> class 'Amino acyl tRNA syn; tRNA modificn' [0.857] Evaluation on test data (712 items): ecoli2465 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'hisS' 'histidine tRNA synthetase' ecoli144 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'yadB' 'paral putative tRNA synthetase (1st module)' ecoli3026 - 1,5,21 Cell processes Transport/binding proteins MFS family 'exuT' 'MFS family of transport protein transport of hexuronates (2nd module)' Test Accuracy: 2/3 (66.67%) Test Frequency class 'Amino acyl tRNA syn; tRNA modificn': 17/712 (2.39%) Test Significance: dev(7.29) ; prob(1.683025E-03) Application to new data (2167 items): ecoli4077 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'aidB' 'adaptive response(2nd module)' ecoli1343 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1372' 'putative membrane protein' ecoli1940 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1978' 'paral putative factor(2nd module)' Frequency rule on new data: 3/2167 (0.14%) Evaluation on training data (939 items): ecoli1682 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'pheS' 'phenylalanine tRNA synthetase alpha-subunit' ecoli1607 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'tyrS' 'tyrosine tRNA synthetase' ecoli1681 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'pheT' 'phenylalanine tRNA synthetase beta-subunit' ecoli517 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'cysS' 'cysteine tRNA synthetase(2nd module)' ecoli905 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'asnS' 'asparagine tRNA synthetase (3rd module)' Training Accuracy: 5/5 (100.00%) Training Frequency class 'Amino acyl tRNA syn; tRNA modificn': 16/939 (1.70%) Training Significance: dev(16.98) ; prob(1.436389E-09) Evaluation on validation data (471 items): ecoli1687 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'thrS' 'threonine tRNA synthetase(2nd module)' ecoli1889 - 4,1,5 Structural elements Cell envelop Surface structures 'fliD' 'flagellar biosynthesis; filament capping protein; enables filament assembly' ecoli194 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'proS' 'proline tRNA synthetase(1st module)' ecoli2355 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'gltX' 'glutamate tRNA synthetase catalytic subunit(1st module)' ecoli1834 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'aspS' 'aspartate tRNA synthetase(2nd module)' ecoli4048 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'yjeA' 'paral putative tRNA synthetase' Validation Accuracy: 5/6 (83.33%) Validation Frequency class 'Amino acyl tRNA syn; tRNA modificn': 11/471 (2.34%) Validation Significance: dev(13.14) ; prob(4.071431E-08) ------------------ Rule 100: (3, lift 75.1) [hom( A ),classification( A ,acanthopterygii)] = 1 [hom( A ),classification( A ,mesocricetus)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 1 [hom( A ),species( A ,bacillus_stearothermophilus),mol_wt_lteq( A ,43194)] = 1 -> class 'Sugar-specific PTS system' [0.800] Evaluation on test data (712 items): ecoli3409 - 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3486' 'ABC superfamily (membrance) paral putative membrane component of transport system (3rd module)' ecoli1467 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yddA' 'ABC superfamily (atp_bind) paral putative ATP-binding module (2nd module)' ecoli3961 - 1,5,35 Cell processes Transport/binding proteins SSS family 'yjcG' 'SSS family transport protein' ecoli3643 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'bglF' 'Sugar Specific PTS family beta-glucosides enzyme II cryptic (2nd module eiia (ei interaction)?)' Test Accuracy: 1/4 (25.00%) Test Frequency class 'Sugar-specific PTS system': 14/712 (1.97%) Test Significance: dev(3.32) ; prob(7.636216E-02) Application to new data (2167 items): Frequency rule on new data: 0/2167 (0.00%) Evaluation on training data (939 items): ecoli1705 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'celB' 'PTS family sugar specific enzyme II for cellobiose arbutin and salicin' ecoli2343 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2386' 'Sugar Specific paral putative membrane component of transport system' ecoli3855 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'frwC' 'PTS system fructose-like IIC component first module overlaps second(2nd module)' Training Accuracy: 3/3 (100.00%) Training Frequency class 'Sugar-specific PTS system': 10/939 (1.06%) Training Significance: dev(16.69) ; prob(1.207823E-06) Evaluation on validation data (471 items): ecoli1074 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'ptsG' 'Sugar Specific PTS family glucose-specific IIBCcomponent (3rd module hydrophilic second phosphorylation domain) mutant form transports D-ribose' ecoli4130 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'treB' 'PTS family enzyme II trehalose specific (maltose may be transported)' ecoli3223 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'prlA' 'protein secretion inner membrane preprotein translocase SecY subunit interacts with SecE (1st module)' ecoli1591 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'malX' 'Sugar Specific PTS family maltose and glucose-specific ii abc (2nd module hydrophilic second phosphorylation domain)' ecoli2380 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2429' 'Sugar Specific paral putative PTS system enzyme II' Validation Accuracy: 4/5 (80.00%) Validation Frequency class 'Sugar-specific PTS system': 15/471 (3.18%) Validation Significance: dev(9.78) ; prob(4.979610E-06) ------------------ Rule 66: (23/2, lift 35.9) [hom( A ),classification( A ,ascaridida)] = 1 [hom( A ),e_val_lteq( A ,3e-06),species( A ,streptococcus_mutans)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 1 [hom( A ),species( A ,bacillus_stearothermophilus),mol_wt_lteq( A ,43194)] = 1 -> class 'MFS family' [0.880] Evaluation on test data (712 items): ecoli335 1,5,21 Cell processes Transport/binding proteins MFS family 'lacY' 'MFS family of transport protein galactoside permease (M protein)(1st module)' ecoli2711 1,5,21 Cell processes Transport/binding proteins MFS family 'b2771' 'MFS family of transport protein (3rd module (function unknown)' ecoli470 1,5,21 Cell processes Transport/binding proteins MFS family 'fsr' 'MFS family of transport protein fosmidomycin resistance protein(2nd module)' ecoli818 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b0842' 'transmembrane multidrug/chloramphenicol efflux transporter (2nd module)' ecoli333 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'cynX' 'cyanate transport' ecoli1499 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ydeA' 'ABC superfamily (membrane)putative membrane component of ABC transport system appears to facilitate arabinose export contributes to control of arabinose regulon' ecoli3961 - 1,5,35 Cell processes Transport/binding proteins SSS family 'yjcG' 'SSS family transport protein' ecoli2324 1,5,21 Cell processes Transport/binding proteins MFS family 'emrY' 'MFS family of transport protein multidrug resistance protein y (2nd module)' ecoli3587 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpT' 'MFS family of transport protein hexose phosphate transport protein (2nd module)' ecoli2538 1,5,21 Cell processes Transport/binding proteins MFS family 'kgtP' 'MFS family of transport protein alpha-ketoglutarate permease(1st module)' ecoli3580 1,5,21 Cell processes Transport/binding proteins MFS family 'yicK' 'MFS family of transport protein two-module paral putative transport protein (2nd module)' ecoli2198 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glpT' 'sn-glycerol-3-phosphate permease' ecoli1514 1,5,21 Cell processes Transport/binding proteins MFS family 'b1543' 'MFS family of transport protein (1st module)' ecoli2357 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'xapB' 'xanthosine permease' ecoli3594 1,5,21 Cell processes Transport/binding proteins MFS family 'emrD' 'MFS family of transport protein 2-module integral membrane pump; multidrug resistance (2nd module)' ecoli45 1,5,21 Cell processes Transport/binding proteins MFS family 'yaaU' 'MFS family transport protein' ecoli3026 1,5,21 Cell processes Transport/binding proteins MFS family 'exuT' 'MFS family of transport protein transport of hexuronates (2nd module)' ecoli2141 1,5,21 Cell processes Transport/binding proteins MFS family 'bcr' 'MFS family of transport protein bicyclomycin resistance protein; transmembrane protein (2nd module)' ecoli2487 1,5,21 Cell processes Transport/binding proteins MFS family 'b2536' 'MFS family of transport protein (1st module)' Test Accuracy: 13/19 (68.42%) Test Frequency class 'MFS family': 14/712 (1.97%) Test Significance: dev(20.86) ; prob(1.595516E-18) Application to new data (2167 items): ecoli1604 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1634' 'orf' ecoli821 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0845' 'paral putative transport protein (2nd module bind phosphorylated sugar? )' ecoli3780 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yihN' 'orf' ecoli1221 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'kch' 'putative potassium channel protein' ecoli3419 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhiP' 'orf' ecoli2626 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2681' 'orf' ecoli1658 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1690' 'paral putative MFS family of transport protein' ecoli4221 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjiJ' 'paral putative transport protein (2nd module)' ecoli1505 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ydeF' 'paral putative transport protein (1st module)' ecoli1743 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1775' 'paral putative transport protein (1st module)' ecoli2715 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2775' 'orf' ecoli2204 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2246' 'putative transport protein' ecoli1943 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1981' 'shikimate and dehydroshikimate permease (2nd module)' ecoli873 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ycaD' 'paral putative transport protein (1st module)' ecoli4245 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4356' 'paral putative transport protein cryptic orf joins former yjiZ and yjjL' ecoli1038 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1065' 'orf' ecoli1759 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1791' 'putative amino acid/amine transport protein (3rd module)' ecoli2729 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2789' 'paral putative membrane component of transport system (2nd module)' Frequency rule on new data: 18/2167 (0.83%) Evaluation on training data (939 items): ecoli4024 - 1,5,31 Cell processes Transport/binding proteins POT family 'b4130' 'POT family of transport protein paral putative transport protein (3rd module)' ecoli2057 1,5,21 Cell processes Transport/binding proteins MFS family 'b2098' 'MFS family of transport protein (2nd module)' ecoli1659 1,5,21 Cell processes Transport/binding proteins MFS family 'b1691' 'MFS family of transport protein' ecoli3396 1,5,21 Cell processes Transport/binding proteins MFS family 'yhhS' 'MFS family of transport protein (2nd module)' ecoli1566 1,5,21 Cell processes Transport/binding proteins MFS family 'b1596' 'MFS familty transport protein (2nd module)' ecoli3154 1,5,21 Cell processes Transport/binding proteins MFS family 'nanT' 'MFS family of transport protein sialic acid transporter cryptic in K12?(1st module)' ecoli2631 1,5,21 Cell processes Transport/binding proteins MFS family 'emrB' 'MFS family of transport protein multidrug resistance; probably membrane translocase(1st module)' ecoli1026 1,5,21 Cell processes Transport/binding proteins MFS family 'yceE' 'MFS family of transport protein (2nd module)' ecoli425 1,5,21 Cell processes Transport/binding proteins MFS family 'ampG' 'MFS family of transport protein ampicillin resistance (1st module)' ecoli4226 1,5,21 Cell processes Transport/binding proteins MFS family 'yjiO' 'MFS family of transport protein (1st module)' ecoli2036 1,5,21 Cell processes Transport/binding proteins MFS family 'b2077' 'MFS family of transport protein (1st module)' ecoli3675 1,5,21 Cell processes Transport/binding proteins MFS family 'yieO' 'MFS family of tranport protein (1st mdule)' ecoli345 1,5,21 Cell processes Transport/binding proteins MFS family 'b0353' 'MFS family transport protein (2nd module function unknown)' ecoli3059 1,5,21 Cell processes Transport/binding proteins MFS family 'yhaU' 'MFS family of transport protein (D)-glucarate or galactarate transporter (1st module)' ecoli2129 1,5,21 Cell processes Transport/binding proteins MFS family 'yeiO' 'MFS family proton-coupled sugar efflux pump transport selective monosaccharides and disaccharides narrower substr. specificity than SetA(2nd module)' ecoli2899 1,5,21 Cell processes Transport/binding proteins MFS family 'nupG' 'MFS family of transport protein transport of nucleosides (2nd module)' ecoli388 1,5,21 Cell processes Transport/binding proteins MFS family 'araJ' 'MFS family of transport protein involved in either transport or processing of arabinose polymers (2nd module function unknown)' ecoli3287 1,5,21 Cell processes Transport/binding proteins MFS family 'yhfC' 'MFS family of transport protein paral putative transport protein' ecoli2878 1,5,21 Cell processes Transport/binding proteins MFS family 'galP' 'MFS family of transport protein galactose-proton symport of transport system (2nd module)' ecoli2741 1,5,21 Cell processes Transport/binding proteins MFS family 'fucP' 'MFS family of transport protein fucose permease(1st module)' ecoli3469 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'yhjX' 'putative resistance protein' ecoli3583 1,5,21 Cell processes Transport/binding proteins MFS family 'yicM' 'MFS family of tranport protein (1st mdule)' ecoli1440 1,5,21 Cell processes Transport/binding proteins MFS family 'narU' 'MFS family of transport protein nitrate sensor-transmitter protein anaerobic respiratory path(1st module)' Training Accuracy: 21/23 (91.30%) Training Frequency class 'MFS family': 23/939 (2.45%) Training Significance: dev(27.57) ; prob(3.571731E-32) Evaluation on validation data (471 items): ecoli419 1,5,21 Cell processes Transport/binding proteins MFS family 'b0427' 'MFS family transport protein' ecoli2778 1,5,21 Cell processes Transport/binding proteins MFS family 'araE' 'MFS family of transport protein low-affinity L-arabinose transport system proton symport protein(1st module)' ecoli1196 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'narK' 'nitrite extrusion protein(2nd module)' ecoli1627 1,5,21 Cell processes Transport/binding proteins MFS family 'b1657' 'MFS family of transport protein (2nd module)' ecoli2772 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'ygeD' 'putative resistance proteins' ecoli1630 1,5,21 Cell processes Transport/binding proteins MFS family 'ydhC' 'MFS family transport protein (2nd module)' ecoli70 1,5,21 Cell processes Transport/binding proteins MFS family 'yabM' 'MFS family of transport protein proton-coupled beta-galactosidase/sugar efflux pump ? role in lactose metabolism (2nd module)' ecoli4168 1,5,21 Cell processes Transport/binding proteins MFS family 'yjhB' 'MFS family of tranport protein (1st module)' ecoli1796 1,5,21 Cell processes Transport/binding proteins MFS family 'b1828' 'MFS family of transport protein (2nd module)' ecoli3223 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'prlA' 'protein secretion inner membrane preprotein translocase SecY subunit interacts with SecE (1st module)' ecoli3925 1,5,21 Cell processes Transport/binding proteins MFS family 'xylE' 'MFS family of tranport protein xylose-proton symport (2nd module)' ecoli3612 - 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'yidT' 'D-galactonate transport' ecoli3588 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpC' 'regulator of uhpT (1st module)' ecoli4005 1,5,21 Cell processes Transport/binding proteins MFS family 'proP' 'MFS family of tranport protein low-affinity constitutive transport system; proline permease II transports proline and betaine under conditions of hyperosmolarity(2nd module)' ecoli3631 1,5,21 Cell processes Transport/binding proteins MFS family 'yidY' 'MFS family of tranport protein (1st mdule)' ecoli2280 1,5,21 Cell processes Transport/binding proteins MFS family 'b2322' 'MFS family of transport protein paral putative (2nd module)' ecoli1737 1,5,21 Cell processes Transport/binding proteins MFS family 'ydjE' 'MFS family of transport protein (1st module)' Validation Accuracy: 13/17 (76.47%) Validation Frequency class 'MFS family': 14/471 (2.97%) Validation Significance: dev(17.84) ; prob(2.982296E-17) ------------------ Rule 2: (25/2, lift 34.8) [hom( A ),classification( A ,ascaridida)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 1 -> class 'ABC superfamily (atp_bind)' [0.889] Evaluation on test data (712 items): ecoli3646 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'pstB' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity phosphate-specific ABC transport system' ecoli924 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0949' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli151 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fhuC' 'ABC superfamily (atp_bind) ATP-binding component of hydroxymate-dependent iron transport' ecoli3275 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yheS' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli805 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0829' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli785 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'glnQ' 'ABC superfamily (atp_bind) ATP-binding component of glutamine high-affinity ABC transport system(2nd module)' ecoli4176 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fecE' 'ABC superfamily (atp_bind) ATP-binding component of citrate-dependent iron(III) transport protein' ecoli2622 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'proV' 'ABC superfamily (atp_bind) ATP-binding component of transport system for glycine betaine and proline(1st module)' ecoli3125 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3195' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli796 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0820' 'paral putative ATP-binding component of transport system' ecoli4248 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'mdoB' 'phosphoglycerol transferase I add phosphoglycerols to OPG backbone' ecoli1089 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1116' 'ABC superfamily (membrane) paral putative membrane component of ABC transport system' ecoli486 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ybbA' 'ABC superfamily (atp_bind) putative' ecoli3201 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhdZ' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2264 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'hisP' 'ABC superfamily (atp_bind) ATP-binding component of histidine ABC transport system' ecoli578 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fepC' 'ABC superfamily (atp_bind) ATP-binding component of ferric enterobactin transport(2nd module)' ecoli1261 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapF' 'ABC superfamily (atp_bind) ATP-binding protein of peptide ABC transport system(2nd module)' ecoli358 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0366' 'ABC superfamily (atp_bind) ATP-binding component of a taurine transport system' ecoli3981 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjcW' 'ABC superfamily (atp_bind) ATP-binding component of allose transport system (2nd module)' ecoli3805 - 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'frvB' 'PTS family fructose-like enzyme IIBC component(2nd module)' ecoli4277 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjjK' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system (2nd module)' ecoli1826 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1858' 'ABC superfamily (atp_bind) ATP-binding component of a high affinity Zn transport system(1st module)' ecoli840 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'artP' 'ABC superfamily (atp&memb) ATP-binding component of 3rd arginine transport system(2nd module)' ecoli2373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'cysA' 'ABC superfamily (atp_bind) ATP-binding component of sulfate permease A protein of ABC transport; chromate resistance (1st module)' ecoli3131 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhbG' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli127 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yadG' 'ABC superfamily (atp_bind) ATP-binding component of transport protein (1st module)' ecoli1455 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1484' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3952 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'uvrA' 'excision nuclease subunit (3rd module prob. DNA binding)' ecoli770 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0794' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli254 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yagC' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' Test Accuracy: 26/30 (86.67%) Test Frequency class 'ABC superfamily (atp_bind)': 27/712 (3.79%) Test Significance: dev(23.76) ; prob(2.655764E-33) Application to new data (2167 items): ecoli2585 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2639' 'putative pump protein' ecoli4153 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjgR' 'orf' ecoli2858 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yggA' 'orf' ecoli2088 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehX' 'paral putative ATP-binding component of transport system' ecoli103 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yacE' 'putative DNA repair protein' ecoli1484 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1513' 'paral putative ATP-binding component of transport system (2nd module)' ecoli2863 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yggC' 'paral putative kinase' ecoli2498 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2547' 'paral putative ATP-binding component of transport system' ecoli2769 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2832' 'putative transport protein' ecoli1454 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1483' 'paral putative ATP-binding component of transport system' Frequency rule on new data: 10/2167 (0.46%) Evaluation on training data (939 items): ecoli855 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0879' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli1099 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potA' 'ABC superfamily (atp_bind) ATP-binding component of spermidine/putrescine ABC transport system (1st module)' ecoli3489 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'xylG' 'ABC superfamily (atp_bind) ATP-binding component of D-xylose ABC transport system(2nd module)' ecoli3991 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnK' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transportbelieved to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli3386 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ftsE' 'ABC superfamily (atp_bind) ATP-binding component of a membrane-associated complex involved in cell division' ecoli66 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yabJ' 'ABC superfamily (atp_bind) ATP-binding component of thiamine ABC transport system' ecoli3403 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikE' 'ABC superfamily (stp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system' ecoli1650 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1682' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ugpC' 'ABC superfamily (atp_bind) ATP-binding component of sn-glycerol 3-phosphate ABC transport system (1st module)' ecoli3670 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'rbsA' 'ABC superfamily (atp_bind) ATP-binding component of d-ribose high-affinity transport system (2nd module)' ecoli1090 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1117' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli908 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ycbE' 'paral putative ATP-binding component of transport system' ecoli1262 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapD' 'ABC superfamily (atp_bind) ATP-binding protein of peptide transport system(2nd module) affects potassium transport;' ecoli3463 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppF' 'ABC superfamily (atp_bind) ATP-binding component of a dipeptide transport system(1st module)' ecoli2139 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yejF' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3378 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity branched-chain amino acid ABC transport system' ecoli1218 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppF' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' ecoli2159 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ccmA' 'ABC superfamily (atp_bind) ATP-binding component of heme exporter A heme exporter protein A cytochrome c-type biogenesis protein' ecoli2108 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mglA' 'ABC superfamily (atp_bind) ATP-binding component of methyl-galactoside transport and galactose taxis (2nd module)' ecoli3402 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikD' 'ABC superfamily (atp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system(2nd module)' ecoli1882 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1917' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli481 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0490' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli380 - 3,1,5 Metabolism of small molecules Amino acid biosynthesis Chorismate 'aroL' 'shikimate kinase II' ecoli736 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modF' 'ABC superfamily (atp_bind) ATP-binding component of molybdenum transport system (2nd module)' ecoli440 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mdlA' 'ATP-binding component of a transport system (2nd module)' Training Accuracy: 23/25 (92.00%) Training Frequency class 'ABC superfamily (atp_bind)': 24/939 (2.56%) Training Significance: dev(28.34) ; prob(6.747322E-35) Evaluation on validation data (471 items): ecoli3929 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'malK' 'ABC superfamily (atp_bind) ATP-binding component of transport system for maltose phenotypic repressor of mal operon(1st module)' ecoli441 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'mdlB' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli4119 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ytfS' 'ABC superfamily (atp_bind) putative ATP-binding component of a transport system' ecoli3990 - 3,3,13 Metabolism of small molecules Central intermediary metabolism Phosphorus compounds 'phnL' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport believed to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli1074 - 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'ptsG' 'Sugar Specific PTS family glucose-specific IIBCcomponent (3rd module hydrophilic second phosphorylation domain) mutant form transports D-ribose' ecoli1289 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1318' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2271 - 5,1,1 Extrachromosomal Laterally acquirred elements Colicin-related functions 'cvpA' 'membrane protein required for colicin V production' ecoli3464 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppD' 'ABC superfamily (atp_bind) ATP-binding component of dipeptide tABCransport system(2nd module)' ecoli642 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'gltL' 'ABC superfamily (atp_bind) ATP-binding protein of glutamate/aspartate transport system' ecoli3377 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livF' 'ABC superfamily (atp_bind) ATP-binding component of leucine ABC transport system' ecoli1677 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'btuD' 'ABC superfamily (atp_bind) ATP-binding component of vitamin B12 ABC transport system(2nd module)' ecoli199 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'abc' 'ABC superfamily (atp_bind) ATP-binding component of ABC transport system(1st module)' ecoli1868 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'araG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity l-arabinose transport system (2nd module)' ecoli741 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modC' 'ABC superfamily (atp_bind) ATP-binding component of molybdate ABC transport (1st module)' ecoli1724 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1756' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli831 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potG' 'ABC superfamily (atp_bind) ATP-binding component of putrescine ABC transport system(1st module)' ecoli4000 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnC' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport system' ecoli1412 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1441' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2169 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yojI' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli727 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pnuC' 'required for NMN transport' ecoli1217 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppD' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' Validation Accuracy: 15/21 (71.43%) Validation Frequency class 'ABC superfamily (atp_bind)': 17/471 (3.61%) Validation Significance: dev(16.66) ; prob(1.000326E-17) ------------------ Rule 3: (6/1, lift 20.1) [hom( A ),classification( A ,amphibia)] = 0 [hom( A ),species( A ,mycoplasma_hyorhinis)] = 1 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 0 -> class 'ABC superfamily (membrane)' [0.750] Evaluation on test data (712 items): ecoli3927 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'malF' 'ABC superfamily (membrane) membrane component of maltose ABC transport system (2nd module)' ecoli2374 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cysW' 'ABC superfamily (membrane) membrane component of sulfate ABC transport system; permease W protein' ecoli808 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0832' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3200 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yhdY' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1020 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'b1047' 'membrane protein required for succinyl substitution of glucan backbone of OPG (osmoregulated periplasmic glucan) possible succinyl transferase' ecoli2266 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'hisQ' 'ABC superfamily (membrane)histidine transport system' ecoli3825 - 3,5,5 Metabolism of small molecules Energy metabolism, carbon Glycolysis 'tpiA' 'triosephosphate isomerase' Test Accuracy: 5/7 (71.43%) Test Frequency class 'ABC superfamily (membrane)': 26/712 (3.65%) Test Significance: dev(9.56) ; prob(1.281905E-06) Application to new data (2167 items): ecoli3932 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjbI' 'orf' ecoli3997 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4103' 'orf' ecoli562 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0572' 'orf' ecoli701 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0718' 'paral putative outer membrane protein (2nd module)' ecoli2134 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2175' 'suppresses thermosensitivity of prc mutants at low osmolality; in turn suppressed by multicopy expression of PBP 7' ecoli2087 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehW' 'paral putative membrane component of transport system' ecoli3147 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhcE' 'orf' ecoli255 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0263' 'paral putative membrane component of transport system' ecoli1076 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ycfF' 'orf' ecoli1442 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1471' 'putative glycoportein' Frequency rule on new data: 10/2167 (0.46%) Evaluation on training data (939 items): ecoli1456 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1485' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1091 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1118' 'ABC superfamily (membrane) paral putative membrane component of ABC transport system' ecoli643 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'gltK' 'ABC superfamily (membrane) glutamate/aspartate transport (1st module)' ecoli3401 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'nikC' 'ABC superfamily (membrane) membrane component in nickel transport system probably forms heterodimeric pore with NikB(1st module)' ecoli722 - 5,1,1 Extrachromosomal Laterally acquirred elements Colicin-related functions 'tolA' 'membrane spanning protein required for outer membrane integrity' ecoli1413 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1442' 'ABC superfamily (membrane) paral putative membrane component of transport system' Training Accuracy: 5/6 (83.33%) Training Frequency class 'ABC superfamily (membrane)': 35/939 (3.73%) Training Significance: dev(10.29) ; prob(4.182729E-07) Evaluation on validation data (471 items): ecoli838 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'artQ' 'ABC superfamily (membrane) membrane component of 3rd arginine ABC transport system' ecoli1055 - 4,1,5 Structural elements Cell envelop Surface structures 'flgK' 'flagellar biosynthesis hook-filament junction protein 1 C-terminal involved in chaperone (probably FlgN) binding(2nd module)' ecoli832 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potH' 'ABC superfamily (membrane) membrane component of putrescine ABC transport system(2nd module)' Validation Accuracy: 2/3 (66.67%) Validation Frequency class 'ABC superfamily (membrane)': 19/471 (4.03%) Validation Significance: dev(5.51) ; prob(4.684941E-03) ------------------ Rule 95: (10/1, lift 35.6) [hom( A ),classification( A ,acinetobacter)] = 0 [hom( A ),classification( A ,archiascomycetesschizosaccharomycetales)] = 0 [hom( A ),classification( A ,bacillariophyta)] = 1 [hom( A ),psi_iter_gt( A ,5),species( A ,rhizobium_leguminosarum)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 0 -> class 'Ribosomal proteins - synthesis modificationRiboso' [0.833] Evaluation on test data (712 items): ecoli3264 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsG' '30S ribosomal subunit protein S7 initiates assembly' ecoli3238 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplV' '50S ribosomal subunit protein L22' ecoli3571 - 2,2,11 Macromolecule metabolism Macromolecule synthesis, modification RNA synthesis, modification, DNA transcription 'rpoZ' 'RNA polymerase omega subunit' ecoli4092 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsR' '30S ribosomal subunit protein S18' ecoli3115 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmA' '50S ribosomal subunit protein L27' ecoli3882 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplA' '50S ribosomal subunit protein L1 regulates synthesis of L1 and L11' ecoli3116 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplU' '50S ribosomal subunit protein L21' ecoli3656 - 3,4,0 Metabolism of small molecules Degradation of small molecules ATP-proton motive force interconversion 'atpH' 'membrane-bound ATP synthase F1 sector delta-subunit' ecoli3842 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmE' '50S ribosomal subunit protein L31' ecoli169 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsB' '30S ribosomal subunit protein S2' ecoli3229 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsH' '30S ribosomal subunit protein S8 and regulator' ecoli23 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsT' '30S ribosomal subunit protein S20' ecoli1649 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b1681' 'required for stability of iron-sulfur component of FhuF regulated by Fur repressor(2nd module)' ecoli2556 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsP' '30S ribosomal subunit protein S16' ecoli2266 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'hisQ' 'ABC superfamily (membrane)histidine transport system' ecoli3654 - 3,4,0 Metabolism of small molecules Degradation of small molecules ATP-proton motive force interconversion 'atpG' 'membrane-bound ATP synthase F1 sector gamma-subunit' ecoli2553 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplS' '50S ribosomal subunit protein L19' ecoli3244 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsJ' '30S ribosomal subunit protein S10' Test Accuracy: 13/18 (72.22%) Test Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 23/712 (3.23%) Test Significance: dev(16.56) ; prob(3.068954E-16) Application to new data (2167 items): ecoli2801 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2865' 'orf (2nd module)' ecoli1876 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yecA' 'orf' ecoli1600 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1630' 'orf' ecoli506 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0515' 'orf' ecoli276 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yagR' 'orf' ecoli1848 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1880' 'putative part of export apparatus for flagellar proteins' ecoli288 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0296' 'putative ribosomal protein' Frequency rule on new data: 7/2167 (0.32%) Evaluation on training data (939 items): ecoli3105 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'secG' 'protein export - membrane protein' ecoli4090 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsF' '30S ribosomal subunit protein S6' ecoli3881 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplK' '50S ribosomal subunit protein L11' ecoli3230 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsN' '30S ribosomal subunit protein S14' ecoli3558 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmG' '50S ribosomal subunit protein L33' ecoli3235 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmC' '50S ribosomal subunit protein L29' ecoli3624 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmH' '50S ribosomal subunit protein L34' ecoli1062 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmF' '50S ribosomal subunit protein L32' ecoli3161 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplM' '50S ribosomal subunit protein L13' ecoli3233 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplN' '50S ribosomal subunit protein L14' Training Accuracy: 9/10 (90.00%) Training Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 22/939 (2.34%) Training Significance: dev(18.33) ; prob(2.082372E-14) Evaluation on validation data (471 items): ecoli1651 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b1683' 'paral putative factor for stability of iron-sulfur component of FhuF (2nd module)' ecoli3239 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsS' '30S ribosomal subunit protein S19' ecoli3221 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsM' '30S ribosomal subunit protein S13' Validation Accuracy: 2/3 (66.67%) Validation Frequency class 'Ribosomal proteins - synthesis modificationRiboso': 12/471 (2.55%) Validation Significance: dev(7.05) ; prob(1.897727E-03) ------------------ Rule 108: (5/1, lift 30.5) [hom( A ),classification( A ,methanococcaceae)] = 0 [hom( A ),mol_wt_lteq( A ,32892)] = 1 [hom( A ),e_val_gt( A ,0.0006),classification( A ,bacteria)] = 0 [hom( A ),e_val_lteq( A ,2e-37),classification( A ,haemophilus)] = 0 [hom( A ),psi_iter_lteq( A ,5),classification( A ,kinetoplastida)] = 0 [hom( A ),species( A ,saccharomyces_cerevisiae__baker_s_yeast_),keyword( A ,repeat)] = 0 [hom( A ),species( A ,gallus_gallus__chicken_),mol_wt_lteq( A ,55220)] = 0 [hom( A ),species( A ,bacillus_stearothermophilus),mol_wt_lteq( A ,32892)] = 0 [hom( A ),mol_wt_gt( A ,77359),classification( A ,cricetinae)] = 0 [hom( A ),mol_wt_lteq( A ,77359),mol_wt_gt( A ,55220)] = 1 -> class 'Transposon-related functions' [0.714] Evaluation on test data (712 items): ecoli545 - 5,1,2 Extrachromosomal Laterally acquirred elements Phage-related functions and prophages 'b0555' 'bacteriophage lambda lysozyme homolog' ecoli22 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_1' 'IS1 protein InsA' ecoli4126 - 3,5,3 Metabolism of small molecules Energy metabolism, carbon Electron transport 'cybC' 'cytochrome b(562)' ecoli4167 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi41' 'IS4 protein' ecoli3603 - 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'glvB' 'Sugar Specific family of transport protein PTS system arbutin-like IIB component' ecoli3245 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'pinO' 'calcium-binding protein required for initiation of chromosome replication' Test Accuracy: 2/6 (33.33%) Test Frequency class 'Transposon-related functions': 13/712 (1.83%) Test Significance: dev(5.76) ; prob(4.762032E-03) Application to new data (2167 items): ecoli2722 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'chpA' 'probable growth inhibitor A PemK-like autoregulated' ecoli892 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0917' 'orf' ecoli4015 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjdF' 'orf' ecoli4105 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4215' 'orf' ecoli4075 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjfM' 'orf' ecoli1752 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1784' 'orf' ecoli4085 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ptxA' 'paral putative sugar specific-family of transport protein' ecoli3746 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yigP' 'orf' ecoli3900 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjaI' 'zinc resistance-associated protein precursor' ecoli3365 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhhZ' 'orf' ecoli535 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0545' 'orf' ecoli3726 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3814' 'orf' ecoli1130 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1157' 'putative tail fiber protein' ecoli1506 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydeH' 'orf(2nd module)' ecoli2616 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ygaC' 'orf' ecoli1636 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1668' 'orf' ecoli3638 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yieJ' 'orf' ecoli244 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yafZ' 'orf' ecoli2579 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2633' 'orf' ecoli3216 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhdN' 'orf' ecoli872 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ycaC' 'orf' ecoli501 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0510' 'orf' ecoli241 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0249' 'orf' ecoli1034 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'dinI' 'damage-inducible protein I; multicopy suppresses phenotype of cold-sensitive dinD filamentous mutation' ecoli402 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yajD' 'orf' ecoli4137 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjgG' 'orf' ecoli3826 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yiiQ' 'orf' ecoli1577 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1607' 'orf' ecoli4115 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'chpB' 'probable growth inhibitor B PemK-like autoregulated' ecoli3412 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhiK' 'orf' ecoli3617 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yidX' 'putative replicase' ecoli56 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yabP' 'orf' ecoli271 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0279' 'orf' ecoli212 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0219' 'orf' ecoli1653 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1685' 'orf' ecoli3306 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhfZ' 'orf' ecoli3920 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjbE' 'orf' ecoli1513 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1542' 'paral putative oxidoreductase (2nd module)' ecoli137 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yadL' 'orf' ecoli1835 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yecD' 'orf' ecoli3137 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3207' 'orf' ecoli374 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yaiB' 'orf' ecoli3517 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yibJ' 'orf(1st module)' ecoli496 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0505' 'orf' ecoli50 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'apaG' 'orf' ecoli1525 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1554' 'paral putative lysozyme' ecoli3087 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhbQ' 'orf' ecoli1657 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1689' 'orf' ecoli4236 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjiW' 'orf' ecoli1407 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1436' 'orf' Frequency rule on new data: 50/2167 (2.31%) Evaluation on training data (939 items): ecoli646 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_3' 'IS5 protein' ecoli17 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi82_1' 'homolog IS186 and IS421 protein' ecoli3002 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'b3068' 'G/U mismatch specific DNA glycosylase' ecoli267 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_3' 'IS1 protein InsA' ecoli1862 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_5' 'IS1 protein InsA' Training Accuracy: 4/5 (80.00%) Training Frequency class 'Transposon-related functions': 22/939 (2.34%) Training Significance: dev(11.48) ; prob(1.478363E-06) Evaluation on validation data (471 items): ecoli257 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_2' 'IS1 protein InsA 2' ecoli3936 - 3,6,1 Metabolism of small molecules Fatty acid biosynthesis Fatty acid and phosphatidic acid biosynth 'dgkA' 'diacylglycerol kinase' ecoli962 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insB_4' 'IS1 protein InsB' ecoli1427 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'rhsE' 'rhsE protein in rhs element' ecoli2430 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'gcvR' 'transcriptional regulation of gcv operon' ecoli4183 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_7' 'IS1 protein InsA' ecoli3689 - 2,2,9 Macromolecule metabolism Macromolecule synthesis, modification Protein modufication 'ppiC' 'peptidyl-prolyl cis-trans isomerase C (rotamase C)' ecoli3367 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'insA_6' 'IS1 protein InsA' Validation Accuracy: 5/8 (62.50%) Validation Frequency class 'Transposon-related functions': 14/471 (2.97%) Validation Significance: dev(9.91) ; prob(1.186887E-06) ------------------ Rule 97: (2, lift 352.1) [hom( A ),classification( A ,bangiaceae)] = 0 [hom( A ),species( A ,leishmania_tarentolae__sauroleishmania_tarentolae_),mol_wt_gt( A ,55220)] = 1 [hom( A ),species( A ,klebsiella_pneumoniae),mol_wt_lteq( A ,32892)] = 0 [hom( A ),species( A ,gallus_gallus__chicken_),mol_wt_gt( A ,77359)] = 0 [hom( A ),mol_wt_lteq( A ,43194),classification( A ,lactobacillaceae)] = 1 -> class 'STP family' [0.750] Evaluation on test data (712 items): ecoli3462 1,5,36 Cell processes Transport/binding proteins STP family 'yhjV' 'STP family of transport protein' ecoli3043 1,5,36 Cell processes Transport/binding proteins STP family 'yhaO' 'STP family of transport protein (1st module)' Test Accuracy: 2/2 (100.00%) Test Frequency class 'STP family': 2/712 (0.28%) Test Significance: dev(26.65) ; prob(7.890418E-06) Application to new data (2167 items): ecoli1562 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1592' 'orf' ecoli3083 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yraQ' 'orf' ecoli2349 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2392' 'putative transport system permease(1st module)' Frequency rule on new data: 3/2167 (0.14%) Evaluation on training data (939 items): ecoli2736 1,5,36 Cell processes Transport/binding proteins STP family 'sdaC' 'STP family of transport protein serine transporter' ecoli3049 1,5,36 Cell processes Transport/binding proteins STP family 'tdcC' 'STP family of transport protein anaerobically inducible L-threonine/ L-serine permease' Training Accuracy: 2/2 (100.00%) Training Frequency class 'STP family': 2/939 (0.21%) Training Significance: dev(30.61) ; prob(4.536582E-06) Evaluation on validation data (471 items): ecoli2623 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'proW' 'ABC superfamily (membrane) membrane component of high-affinity ABC transport system for glycine betaine and proline (2nd module)' ecoli2782 1,5,36 Cell processes Transport/binding proteins STP family 'b2845' 'STP family of transport protein' Validation Accuracy: 1/2 (50.00%) Validation Frequency class 'STP family': 1/471 (0.21%) Validation Significance: dev(15.30) ; prob(4.237269E-03) ------------------ Rule 56: (7, lift 27.8) [hom( A ),classification( A ,alveolata)] = 0 [hom( A ),classification( A ,hymenomycetes)] = 0 [hom( A ),species( A ,rhizobium_leguminosarum__biovar_viciae_)] = 0 [hom( A ),e_val_gt( A ,6e-14),classification( A ,burkholderia_group)] = 0 [hom( A ),e_val_gt( A ,2e-37),classification( A ,salmonella)] = 1 [hom( A ),psi_iter_gt( A ,5),species( A ,rhizobium_leguminosarum)] = 1 [hom( A ),species( A ,cyanophora_paradoxa),mol_wt_lteq( A ,32892)] = 0 -> class 'Global regulatory functions' [0.889] Evaluation on test data (712 items): ecoli3589 - 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpB' 'sensor histidine protein kinase phosphorylates UhpA(2nd module)' ecoli4287 6,1,1 Global functions Global regulatory functions Global regulatory functions 'arcA' 'negative response regulator of genes in aerobic pathways (sensors ArcB and CpxA)(1st module)' ecoli2829 - 4,1,5 Structural elements Cell envelop Surface structures 'dsbC' 'protein disulfide isomerase II(1st module)' ecoli2037 6,1,1 Global functions Global regulatory functions Global regulatory functions 'baeS' 'suppresses envZ and phoR/creC mutations(2nd module)' ecoli4006 - 2,2,11 Macromolecule metabolism Macromolecule synthesis, modification RNA synthesis, modification, DNA transcription 'basS' 'sensor protein for basR(2nd module)' Test Accuracy: 2/5 (40.00%) Test Frequency class 'Global regulatory functions': 16/712 (2.25%) Test Significance: dev(5.70) ; prob(4.826709E-03) Application to new data (2167 items): ecoli2177 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'atoS' 'sensor protein AtoS for response regulator AtoC(2nd module)' ecoli2635 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2690' 'orf' ecoli2507 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2556' 'paral putative sensor/kinase in regulatory system' ecoli2710 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2770' 'orf' ecoli3724 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yigB' 'orf' ecoli2960 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3026' 'paral putative sensor/kinase in regulatory system' ecoli2337 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2380' 'putative sensor protein(1st module)' ecoli652 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0662' 'orf' ecoli3514 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3592' 'paral putative S-transferase(1st module)' ecoli1932 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1968' 'paral putative sensor/kinase in regulatory system' ecoli609 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0619' 'sensory kinase in two component regulatory system with DpiA(2nd module)' Frequency rule on new data: 11/2167 (0.51%) Evaluation on training data (939 items): ecoli3327 6,1,1 Global functions Global regulatory functions Global regulatory functions 'envZ' 'protein histidine kinase/phosphatase sensor for ompR modulates expression of ompF and ompC (sensor)(2nd module)' ecoli1579 6,1,1 Global functions Global regulatory functions Global regulatory functions 'rstB' 'sensor histidine protein kinase (RstA regulator)(2nd module)' ecoli4284 6,1,1 Global functions Global regulatory functions Global regulatory functions 'creB' 'catabolic regulation response regulator(1st module)' ecoli4285 6,1,1 Global functions Global regulatory functions Global regulatory functions 'creC' 'catabolite repression sensor kinase for PhoB; alternative sensor for pho regulon(2nd module)' ecoli3817 6,1,1 Global functions Global regulatory functions Global regulatory functions 'cpxA' 'membrane sensor in 2-component cpxAR signal transduction system(2nd module)' ecoli1102 6,1,1 Global functions Global regulatory functions Global regulatory functions 'phoQ' 'periplasmic sensor protein in two component system with PhoP ligand is Mg+(2nd module)' ecoli3159 6,1,1 Global functions Global regulatory functions Global regulatory functions 'sspA' 'regulator of transcription; stringent starvation protein A(1st module)' Training Accuracy: 7/7 (100.00%) Training Frequency class 'Global regulatory functions': 30/939 (3.19%) Training Significance: dev(14.56) ; prob(3.397736E-11) Evaluation on validation data (471 items): ecoli2766 - 1,5,17 Cell processes Transport/binding proteins General PTS family 'ptsP' 'General PTS family (3rd module npr interaction homol to ei enzymes?)' ecoli677 6,1,1 Global functions Global regulatory functions Global regulatory functions 'kdpE' 'cytoplasmic response regulator of kdp operon (transcriptional effector) (2nd module DNA binding)' ecoli392 6,1,1 Global functions Global regulatory functions Global regulatory functions 'phoR' 'positive and negative sensor protein for pho regulon (and asr gene)(2nd module)' ecoli3775 - 3,1,8 Metabolism of small molecules Amino acid biosynthesis Glutamine 'glnL' 'bifunctional protein kinase/phosphatase nitrogen regulator II NRII(2nd module)' Validation Accuracy: 2/4 (50.00%) Validation Frequency class 'Global regulatory functions': 8/471 (1.70%) Validation Significance: dev(7.48) ; prob(1.672667E-03) ------------------