Rule 23: (3, lift 8.4) [hom( A ),classification( A ,arthropoda)] = 1 [hom( A ),species( A ,helicobacter_pylori__campylobacter_pylori_),mol_wt_gt( A ,55220)] = 0 [hom( A ),mol_wt_gt( A ,55220),classification( A ,helicobacter_group)] = 1 [hom( A ),keyword( A ,repeat),classification( A ,embryophyta)] = 0 -> class 'Energy metabolism carbon' [0.800] Evaluation on test data (712 items): ecoli1439 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'narZ' 'nitrate reductase 2 alpha subunit(1st module)' ecoli2241 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoG' 'NADH dehydrogenase I chain G' ecoli1840 - 3,3,0 Metabolism of small molecules Central intermediary metabolism other 'bisZ' 'biotin sulfoxide reductase 2(1st module)' ecoli2164 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'napA' 'periplasmic nitrate reductase in complex with NapB(1st module)' Test Accuracy: 3/4 (75.00%) Test Frequency class 'Energy metabolism carbon': 70/712 (9.83%) Test Significance: dev(4.38) ; prob(3.520861E-03) Application to new data (2167 items): ecoli1557 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1587' 'paral putative reductase 2(1st module)' ecoli1472 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1501' '3rd module of a paral putative reductase (3rd module)' ecoli3724 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yigB' 'orf' ecoli1558 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1588' 'paral putative reductase 2(1st module)' Frequency rule on new data: 4/2167 (0.18%) Evaluation on training data (939 items): ecoli869 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'dmsA' 'anaerobic dimethyl sulfoxide reductase subunit A(1st module)' ecoli706 3,5,8 Metabolism of small molecules Energy metabolism, carbon TCA cycle 'sdhA' 'succinate dehydrogenase flavoprotein subunit EC 1.3.5.1' ecoli1197 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'narG' 'nitrate reductase 1 alpha subunit(1st module)' Training Accuracy: 3/3 (100.00%) Training Frequency class 'Energy metabolism carbon': 89/939 (9.48%) Training Significance: dev(5.35) ; prob(8.514776E-04) Evaluation on validation data (471 items): ecoli1445 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdnG' 'formate dehydrogenase-N nitrate-inducible alpha subunit(1st module)' ecoli971 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'torA' 'trimethylamine N-oxide reductase subunit' ecoli3973 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdhF' 'selenopolypeptide subunit of formate dehydrogenase H (part of formate hydrogen-lyase complex: FHL complex) EC 1.2.1.2 CONSISTS OF TWO SEPARABLE ENZYMATIC ACTIVITIES: A FORMATE DEHYDROGENASE COMPONENT(1st module)' Validation Accuracy: 3/3 (100.00%) Validation Frequency class 'Energy metabolism carbon': 44/471 (9.34%) Validation Significance: dev(5.40) ; prob(8.152584E-04) ------------------ Rule 40: (12, lift 37.9) [hom( A ),e_val_gt( A ,0.0006),classification( A ,artiodactyla)] = 0 [hom( A ),psi_iter_lteq( A ,7),classification( A ,solanaceae)] = 1 [hom( A ),mol_wt_lteq( A ,32892),classification( A ,bangiaceae)] = 1 [hom( A ),keyword( A ,inner_membrane),classification( A ,epsilon_subdivision)] = 0 -> class 'Ribosome constituents' [0.929] Evaluation on test data (712 items): ecoli3798 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdoI' 'formate dehydrogenase cytochrome B556 (FDO) subunit' ecoli3264 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsG' '30S ribosomal subunit protein S7 initiates assembly' ecoli3232 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplX' '50S ribosomal subunit protein L24' ecoli3238 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplV' '50S ribosomal subunit protein L22' ecoli4092 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsR' '30S ribosomal subunit protein S18' ecoli3115 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmA' '50S ribosomal subunit protein L27' ecoli3656 - 3,4,0 Metabolism of small molecules Degradation of small molecules ATP-proton motive force interconversion 'atpH' 'membrane-bound ATP synthase F1 sector delta-subunit' ecoli3241 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplW' '50S ribosomal subunit protein L23' ecoli3231 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplE' '50S ribosomal subunit protein L5' ecoli169 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsB' '30S ribosomal subunit protein S2' ecoli3229 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsH' '30S ribosomal subunit protein S8 and regulator' ecoli2274 - 3,6,1 Metabolism of small molecules Fatty acid biosynthesis Fatty acid and phosphatidic acid biosynth 'accD' 'acetylCoA carboxylase carboxytransferase component beta subunit' ecoli1447 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdnI' 'formate dehydrogenase-N nitrate-inducible cytochrome B556(Fdn) gamma subunit' ecoli2556 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsP' '30S ribosomal subunit protein S16' ecoli3219 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsD' '30S ribosomal subunit protein S4' ecoli3323 - 1,6,1 Cell processes Adaptation Adaptations, atypical conditions 'yrfH' 'heat shock protein 15 DNA/RNA binding' Test Accuracy: 11/16 (68.75%) Test Frequency class 'Ribosome constituents': 24/712 (3.37%) Test Significance: dev(14.49) ; prob(2.383432E-13) Application to new data (2167 items): ecoli2338 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2381' 'paral putative regulator protein' ecoli483 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0492' 'paral putative thioredoxin protein' ecoli2810 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2874' 'orf' ecoli1771 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1803' 'paral putative oxidoreductase(1st module)' ecoli315 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0323' 'orf' ecoli1624 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydhD' 'orf' Frequency rule on new data: 6/2167 (0.28%) Evaluation on training data (939 items): ecoli3559 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmB' '50S ribosomal subunit protein L28' ecoli4090 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsF' '30S ribosomal subunit protein S6' ecoli3220 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsK' '30S ribosomal subunit protein S11' ecoli3240 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplB' '50S ribosomal subunit protein L2' ecoli1684 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplT' '50S ribosomal subunit protein L20 and regulator' ecoli4093 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplI' '50S ribosomal subunit protein L9' ecoli3230 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsN' '30S ribosomal subunit protein S14' ecoli3558 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmG' '50S ribosomal subunit protein L33' ecoli1062 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmF' '50S ribosomal subunit protein L32' ecoli3265 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsL' '30S ribosomal subunit protein S12' ecoli3222 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmJ' '50S ribosomal subunit protein X' ecoli3233 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplN' '50S ribosomal subunit protein L14' Training Accuracy: 12/12 (100.00%) Training Frequency class 'Ribosome constituents': 23/939 (2.45%) Training Significance: dev(21.86) ; prob(4.663875E-20) Evaluation on validation data (471 items): ecoli3237 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsC' '30S ribosomal subunit protein S3' ecoli3239 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsS' '30S ribosomal subunit protein S19' ecoli3236 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplP' '50S ribosomal subunit protein L16' ecoli3221 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsM' '30S ribosomal subunit protein S13' Validation Accuracy: 4/4 (100.00%) Validation Frequency class 'Ribosome constituents': 15/471 (3.18%) Validation Significance: dev(11.03) ; prob(1.028683E-06) ------------------ Rule 42: (85/6, lift 4.6) [hom( A ),classification( A ,carnivora)] = 1 [hom( A ),classification( A ,corynebacteriaceae)] = 1 [hom( A ),keyword( A ,transmembrane),classification( A ,kinetoplastida)] = 1 -> class 'Transport/binding proteins' [0.920] Evaluation on test data (712 items): ecoli335 1,5,21 Cell processes Transport/binding proteins MFS family 'lacY' 'MFS family of transport protein galactoside permease (M protein)(1st module)' ecoli3338 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gntT' 'high-affinity gluconate permease in GNT-I system' ecoli2711 1,5,21 Cell processes Transport/binding proteins MFS family 'b2771' 'MFS family of transport protein (3rd module (function unknown)' ecoli1485 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1514' 'ABC superfamily (membrane)paral putative membrane component of ABC transport system' ecoli1307 1,5,43 Cell processes Transport/binding proteins ArAAP family 'ydaH' 'ArAAP family p-aminobenzoyl-glutamate utilization paral putative pump protein (transport)(1st module)' ecoli746 1,5,11 Cell processes Transport/binding proteins DASS family 'b0770' 'DASS family of transport protein' ecoli3374 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ugpE' 'ABC superfamily (membrane)sn-glycerol 3-phosphate transport system integral membrane protein(1st module)' ecoli112 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'aroP' 'APC family of transport protein aromatic amino acid transport protein' ecoli4210 1,5,19 Cell processes Transport/binding proteins GntP family 'gntP' 'GntP family of transport protein high affinity gluconate transporter/gluconate permease in gnt-iii system (2nd module)' ecoli818 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b0842' 'transmembrane multidrug/chloramphenicol efflux transporter (2nd module)' ecoli4120 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ytfT' 'ABC superfamily (membrane)paral putative membrane component' ecoli333 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'cynX' 'cyanate transport' ecoli1499 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ydeA' 'ABC superfamily (membrane)putative membrane component of ABC transport system appears to facilitate arabinose export contributes to control of arabinose regulon' ecoli3462 1,5,36 Cell processes Transport/binding proteins STP family 'yhjV' 'STP family of transport protein' ecoli47 1,5,9 Cell processes Transport/binding proteins CPA2 family 'kefC' 'CPA2 family k+ efflux antiporter glutathione-regulated (2nd module)' ecoli3409 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3486' 'ABC superfamily (membrance) paral putative membrane component of transport system (3rd module)' ecoli2921 1,5,32 Cell processes Transport/binding proteins PiT family 'pitB' 'PiT family low-affinity phosphate transport(1st module)' ecoli3525 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'lldP' 'L-lactate permease(1st module)' ecoli581 1,5,31 Cell processes Transport/binding proteins POT family 'ybdA' 'paral putative POT family of transport protein (1st module)' ecoli769 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0793' 'putative membrane component of ABC transport system(2nd module)' ecoli3980 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yjcV' 'ABC superfamily (membrane) membrane component of allose ABC transport system(1st module)' ecoli1467 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yddA' 'ABC superfamily (atp_bind) paral putative ATP-binding module (2nd module)' ecoli252 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'ykfD' 'APC family of transport protein S-methylmethionine permease' ecoli3961 1,5,35 Cell processes Transport/binding proteins SSS family 'yjcG' 'SSS family transport protein' ecoli3643 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'bglF' 'Sugar Specific PTS family beta-glucosides enzyme II cryptic (2nd module eiia (ei interaction)?)' ecoli2324 1,5,21 Cell processes Transport/binding proteins MFS family 'emrY' 'MFS family of transport protein multidrug resistance protein y (2nd module)' ecoli662 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'nagE' 'Sugar Specific PTS family n-acetylglucosamine-specific enzyme IIABC (3rd module)' ecoli3587 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpT' 'MFS family of transport protein hexose phosphate transport protein (2nd module)' ecoli3293 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yhfM' 'APC family paral putative amino-acid transport protein (2nd module)' ecoli2655 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'ascF' 'PTS family enzyme II ABC (asc) cryptic transports specific beta-glucosides' ecoli4017 1,5,12 Cell processes Transport/binding proteins Dcu family 'dcuB' 'Dcu family anaerobic C4-dicarboxylate transporter' ecoli602 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'b0612' 'citrate carrier transport citrate trading for succinate export' ecoli3380 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'livH' 'ABC superfamily (membrane) membrane component of high-affinity branched-chain amino acid ABC transport system(2nd module)' ecoli2538 1,5,21 Cell processes Transport/binding proteins MFS family 'kgtP' 'MFS family of transport protein alpha-ketoglutarate permease(1st module)' ecoli3580 1,5,21 Cell processes Transport/binding proteins MFS family 'yicK' 'MFS family of transport protein two-module paral putative transport protein (2nd module)' ecoli2198 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glpT' 'sn-glycerol-3-phosphate permease' ecoli566 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pheP' 'phenylalanine-specific transport system' ecoli4041 1,5,34 Cell processes Transport/binding proteins SMR family 'sugE' 'SMR family of transport protein' ecoli3579 1,5,16 Cell processes Transport/binding proteins GPH family 'yicJ' 'GPH family paral putative transport protein' ecoli1827 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'yebI' 'inner memrane component of a high affininty Zn transport system' ecoli3043 1,5,36 Cell processes Transport/binding proteins STP family 'yhaO' 'STP family of transport protein (1st module)' ecoli1164 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1191' 'PUTATIVE NA(+)/H(+) EXCHANGER according to SwissProt version 38 the orf starts at a methionine 42 aa upstream of b 1191 start' ecoli1514 1,5,21 Cell processes Transport/binding proteins MFS family 'b1543' 'MFS family of transport protein (1st module)' ecoli3188 1,5,35 Cell processes Transport/binding proteins SSS family 'panF' 'SSS family transport protein sodium/pantothenate symporter(1st module)' ecoli1633 1,5,20 Cell processes Transport/binding proteins MATE family 'ydhE' 'MATE family of transport protein(2nd module)' ecoli2357 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'xapB' 'xanthosine permease' ecoli1097 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potC' 'ABC superfamily (membrane) membrane component of spermidine/putrescine ABC transport system(2nd module)' ecoli3604 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'glvC' 'PTS family arbutin-like IIC component' ecoli67 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yabK' 'ABC superfamily (membrane) membrane component of thiamine ABC transport system(1st module)' ecoli198 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yaeE' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2497 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b2546' 'ABC superfamily (membrane)paral putative membrane component of ABC transport system (2nd module)' ecoli3594 1,5,21 Cell processes Transport/binding proteins MFS family 'emrD' 'MFS family of transport protein 2-module integral membrane pump; multidrug resistance (2nd module)' ecoli3836 - 3,2,8 Metabolism of small molecules Biosynthesis of cofactors, carriers Menaquinone, ubiquinone 'menA' '14-dihydroxy-2-naphthoate --> dimethylmenaquinone' ecoli45 1,5,21 Cell processes Transport/binding proteins MFS family 'yaaU' 'MFS family transport protein' ecoli4155 1,5,19 Cell processes Transport/binding proteins GntP family 'yjgT' 'GntP family l-idonate transporter (2nd module)' ecoli2137 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yejB' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3026 1,5,21 Cell processes Transport/binding proteins MFS family 'exuT' 'MFS family of transport protein transport of hexuronates (2nd module)' ecoli1875 1,5,43 Cell processes Transport/binding proteins ArAAP family 'tyrP' 'ArAAP family tyrosine-specific transport system' ecoli328 1,5,25 Cell processes Transport/binding proteins NCS1 family 'codB' 'NCS1 family transport protein cytosine permease/transport(2nd module)' ecoli3971 1,5,10 Cell processes Transport/binding proteins DAACS family 'gltP' 'DAACS family of transport protein glutamate-aspartate symport protein' ecoli2141 1,5,21 Cell processes Transport/binding proteins MFS family 'bcr' 'MFS family of transport protein bicyclomycin resistance protein; transmembrane protein (2nd module)' ecoli2487 1,5,21 Cell processes Transport/binding proteins MFS family 'b2536' 'MFS family of transport protein (1st module)' ecoli3959 1,5,8 Cell processes Transport/binding proteins CPA1 family 'yjcE' 'CPA1 family PUTATIVE NA(+)/H(+) EXCHANGER YJCE(1st module)' ecoli807 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0831' 'paral putative membrane component of transport system' Test Accuracy: 62/64 (96.88%) Test Frequency class 'Transport/binding proteins': 151/712 (21.21%) Test Significance: dev(14.81) ; prob(2.207893E-39) Application to new data (2167 items): ecoli1604 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1634' 'orf' ecoli2823 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2887' 'putative oxidoreductase fe-s subunit (2nd module)' ecoli3170 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhcP' 'orf' ecoli487 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0496' 'putative oxidoreductase' ecoli821 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0845' 'paral putative transport protein (2nd module bind phosphorylated sugar? )' ecoli1349 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1378' 'paral putative oxidoreductase (2nd module)' ecoli3606 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yidE' 'paral putative transport protein(1st module)' ecoli1404 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1433' 'putative membrane transport protein' ecoli4009 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4115' 'putative amino acid/amine transport protein cryptic' ecoli1575 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1605' 'putative arginine/ornithine antiporter' ecoli1562 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1592' 'orf' ecoli2419 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yffG' 'paral putative oxidoreductase (2nd module)' ecoli4034 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjeH' 'putative transport' ecoli3849 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yijE' 'orf (2nd module)' ecoli3780 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yihN' 'orf' ecoli2529 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfiK' 'paral putative transport protein' ecoli3419 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhiP' 'orf' ecoli2626 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2681' 'orf' ecoli3357 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3434' 'orf' ecoli2821 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2885' 'orf' ecoli1766 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1798' 'paral putative transport protein' ecoli3739 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yigM' 'paral putative transport protein (2nd module)' ecoli980 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1006' 'putative transport protein(2nd module)' ecoli2943 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yghB' 'orf' ecoli2997 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ygjE' 'paral putative DASS family of transport protein' ecoli3585 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yicO' 'orf (2nd module)' ecoli2585 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2639' 'putative pump protein' ecoli1658 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1690' 'paral putative MFS family of transport protein' ecoli4100 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ytfF' 'orf (1st module)' ecoli3114 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhbE' 'orf' ecoli1923 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yedA' 'orf (1st module)' ecoli2558 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2611' 'orf' ecoli1042 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'mviN' 'putative virulence factor' ecoli1504 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydeD' 'orf' ecoli4221 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjiJ' 'paral putative transport protein (2nd module)' ecoli2250 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2292' 'putative transport protein' ecoli3600 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yidK' 'putative cotransporter' ecoli3731 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'rarD' 'chloramphenicol resistance' ecoli1505 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ydeF' 'paral putative transport protein (1st module)' ecoli502 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0511' 'orf' ecoli1743 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1775' 'paral putative transport protein (1st module)' ecoli3706 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yifJ' 'putative cytochrome' ecoli3083 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yraQ' 'orf' ecoli2715 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2775' 'orf' ecoli3322 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yrfG' 'orf' ecoli2858 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yggA' 'orf' ecoli4083 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjfS' 'orf' ecoli789 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ybiF' 'orf' ecoli2305 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfdC' 'putative transport' ecoli3576 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yicE' 'putative transport protein' ecoli2346 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2389' 'orf' ecoli2204 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2246' 'putative transport protein' ecoli3126 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yrbG' 'orf' ecoli320 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0328' 'paral putative transport protein' ecoli2285 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfcA' 'putative structural protein' ecoli1486 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1515' 'paral putative membrane component of ABC transport system' ecoli1444 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yddG' 'orf' ecoli7 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yaaJ' 'putative inner membrane transport protein' ecoli504 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0513' 'putative transport(2nd module)' ecoli4243 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjiY' 'putative carbon starvation protein' ecoli2769 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2832' 'putative transport protein' ecoli1943 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1981' 'shikimate and dehydroshikimate permease (2nd module)' ecoli1697 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1729' 'part of a kinase(1st module paral putative tdomain shared with transporter)' ecoli2322 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'dsdX' 'transport system permease (serine?)' ecoli2275 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'dedA' 'orf' ecoli873 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ycaD' 'paral putative transport protein (1st module)' ecoli4245 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4356' 'paral putative transport protein cryptic orf joins former yjiZ and yjjL' ecoli1038 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1065' 'orf' ecoli2361 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfeH' 'putative cytochrome oxidase' ecoli1759 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1791' 'putative amino acid/amine transport protein (3rd module)' ecoli4049 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjeM' 'paral putative amino-acid transport protein' ecoli262 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yagG' 'paral putative transport protein' ecoli2729 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2789' 'paral putative membrane component of transport system (2nd module)' ecoli2256 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2298' 'putative S-transferase' ecoli794 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0818' 'orf' ecoli2906 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2972' 'paral putative peptidase' ecoli3958 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjcD' 'orf (2nd module)' Frequency rule on new data: 77/2167 (3.55%) Evaluation on training data (939 items): ecoli4029 - 3,3,19 Metabolism of small molecules Central intermediary metabolism Sulfur metabolism 'dsbD' 'thiol:disulfide interchange protein N-term.(1st module)' ecoli4024 1,5,31 Cell processes Transport/binding proteins POT family 'b4130' 'POT family of transport protein paral putative transport protein (3rd module)' ecoli2057 1,5,21 Cell processes Transport/binding proteins MFS family 'b2098' 'MFS family of transport protein (2nd module)' ecoli3359 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gntU_1' 'split gene low-affinity gluconate transport permease protein in GNT-I system first part of fragment 1(1st module)' ecoli3425 1,5,4 Cell processes Transport/binding proteins Ars family 'arsB' 'Ars family arsenical pump membrane protein(2nd module)' ecoli1263 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'sapC' 'ABC superfamily (membrane) membrane component of peptide ABC transport system(2nd module)' ecoli1723 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1755' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli153 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'fhuB' 'ABC superfamily (membrane) split gene C-term module hydroxamate-dependent iron uptake (2nd module iron uptake )' ecoli1659 1,5,21 Cell processes Transport/binding proteins MFS family 'b1691' 'MFS family of transport protein' ecoli3396 1,5,21 Cell processes Transport/binding proteins MFS family 'yhhS' 'MFS family of transport protein (2nd module)' ecoli1566 1,5,21 Cell processes Transport/binding proteins MFS family 'b1596' 'MFS familty transport protein (2nd module)' ecoli3154 1,5,21 Cell processes Transport/binding proteins MFS family 'nanT' 'MFS family of transport protein sialic acid transporter cryptic in K12?(1st module)' ecoli2736 1,5,36 Cell processes Transport/binding proteins STP family 'sdaC' 'STP family of transport protein serine transporter' ecoli3783 1,5,16 Cell processes Transport/binding proteins GPH family 'yihP' 'GPH family paral putative transport protein' ecoli2680 1,5,19 Cell processes Transport/binding proteins GntP family 'b2740' 'GntP family of transport protein function unknown (3rd module)' ecoli3782 1,5,16 Cell processes Transport/binding proteins GPH family 'b3876' 'GPH family paral putative transport protein (2nd module)' ecoli1267 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b1296' 'APC family paral putative amino-acid transport protein' ecoli2631 1,5,21 Cell processes Transport/binding proteins MFS family 'emrB' 'MFS family of transport protein multidrug resistance; probably membrane translocase(1st module)' ecoli3049 1,5,36 Cell processes Transport/binding proteins STP family 'tdcC' 'STP family of transport protein anaerobically inducible L-threonine/ L-serine permease' ecoli1264 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'sapB' 'ABC superfamily (membrane) membrane component of peptide ABC transport system' ecoli1026 1,5,21 Cell processes Transport/binding proteins MFS family 'yceE' 'MFS family of transport protein (2nd module)' ecoli1586 1,5,16 Cell processes Transport/binding proteins GPH family 'uidB' 'GPH family glucuronide permease' ecoli425 1,5,21 Cell processes Transport/binding proteins MFS family 'ampG' 'MFS family of transport protein ampicillin resistance (1st module)' ecoli2115 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'lysP' 'APC family lysine-specific permease (2nd module)' ecoli4226 1,5,21 Cell processes Transport/binding proteins MFS family 'yjiO' 'MFS family of transport protein (1st module)' ecoli1179 1,5,38 Cell processes Transport/binding proteins SulP family 'ychM' 'SulP family transport protein (1st module)' ecoli2036 1,5,21 Cell processes Transport/binding proteins MFS family 'b2077' 'MFS family of transport protein (1st module)' ecoli675 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'potE' 'APC family putrescine-lyase antiporter' ecoli2608 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gabP' 'transport permease protein of gamma-aminobutyrate' ecoli3157 1,5,13 Cell processes Transport/binding proteins DcuC 'yhcL' 'DcuC family of transport protein (2nd module)' ecoli1159 1,5,27 Cell processes Transport/binding proteins NhaA family 'nhaB' 'NhaB family of transport protein Na+/H+ antiporter regulator of intracellular pH(1st module)' ecoli3675 1,5,21 Cell processes Transport/binding proteins MFS family 'yieO' 'MFS family of tranport protein (1st mdule)' ecoli3466 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'dppB' 'ABC superfamily (membrane) membrane component of dipeptide ABC transport system; permease protein 1(2nd module)' ecoli3671 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'rbsC' 'ABC superfamily (membrane) ABC superfamily of transport protein D-ribose high-affinity ABC transport system(1st module ATP-binding subunit)' ecoli1705 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'celB' 'PTS family sugar specific enzyme II for cellobiose arbutin and salicin' ecoli345 1,5,21 Cell processes Transport/binding proteins MFS family 'b0353' 'MFS family transport protein (2nd module function unknown)' ecoli394 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0402' 'APC family of transport protein proline permease transport protein' ecoli3059 1,5,21 Cell processes Transport/binding proteins MFS family 'yhaU' 'MFS family of transport protein (D)-glucarate or galactarate transporter (1st module)' ecoli2129 1,5,21 Cell processes Transport/binding proteins MFS family 'yeiO' 'MFS family proton-coupled sugar efflux pump transport selective monosaccharides and disaccharides narrower substr. specificity than SetA(2nd module)' ecoli4026 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'cadB' 'APC family transport of lysine/cadaverine(1st module)' ecoli2448 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'uraA' 'uracil transport' ecoli2899 1,5,21 Cell processes Transport/binding proteins MFS family 'nupG' 'MFS family of transport protein transport of nucleosides (2nd module)' ecoli3273 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'kefB' 'K+ efflux; NEM-activable K+/H+ antiporter' ecoli3379 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'livM' 'ABC superfamily (membrane) membrane component of high-affinity branched-chain amino acid ABC transport system (2nd module)' ecoli3022 1,5,10 Cell processes Transport/binding proteins DAACS family 'ygjU' 'DAACS family Na+/serine (threonine) symporter' ecoli3011 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'ygjI' 'APC family paral putative amino-acid transport protein' ecoli2035 1,5,33 Cell processes Transport/binding proteins RNDfamily 'b2076' 'RND family of transport protein paral putative outer membrane receptor' ecoli3630 1,5,43 Cell processes Transport/binding proteins ArAAP family 'tnaB' 'ArAAP family low affinity tryptophan permease' ecoli3668 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'kup' 'low affinity potassium transport system' ecoli4121 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yjfF' 'ABC superfamily (membrane) ABC superfamily of transport protein (1st module membrance component)' ecoli393 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'brnQ' 'branched chain; mutants valine and o-methylthreonine resistant glyclyvaline sensitive; transport system I for Ile Leu and Val' ecoli833 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potI' 'ABC superfamily (membrane) membrane component of putrescine ABC transport system(2nd module)' ecoli1570 1,5,34 Cell processes Transport/binding proteins SMR family 'b1600' 'SMR family of transport protein' ecoli1414 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1443' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli388 1,5,21 Cell processes Transport/binding proteins MFS family 'araJ' 'MFS family of transport protein involved in either transport or processing of arabinose polymers (2nd module function unknown)' ecoli1098 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potB' 'ABC superfamily (membrane) membrane component of spermidine/putrescine ABC transport system(2nd module)' ecoli4178 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fecC' 'ABC superfamily (membrane) citrate-dependent iron(III) transport protein (2nd module)' ecoli1463 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'xasA' 'APC family acid sensitivity protein putative glutamate:gamma-aminobutyric acid antiporter (GadC)(2nd module)' ecoli2343 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2386' 'Sugar Specific paral putative membrane component of transport system' ecoli2138 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yejE' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli469 1,5,9 Cell processes Transport/binding proteins CPA2 family 'ybaL' 'CPA2 family transport protein' ecoli3287 1,5,21 Cell processes Transport/binding proteins MFS family 'yhfC' 'MFS family of transport protein paral putative transport protein' ecoli2375 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cysU' 'ABC superfamily (membrane) membrane component of sulfate thiosulfate ABC transport system (2nd module)' ecoli874 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0899' 'APC family paral putative amino-acid transport protein' ecoli40 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'caiT' 'probable carnitine transporter' ecoli2878 1,5,21 Cell processes Transport/binding proteins MFS family 'galP' 'MFS family of transport protein galactose-proton symport of transport system (2nd module)' ecoli2005 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'b2046' 'probable export protein /export to periplasm in colanic acid gene cluster' ecoli2741 1,5,21 Cell processes Transport/binding proteins MFS family 'fucP' 'MFS family of transport protein fucose permease(1st module)' ecoli3711 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yifK' 'APC family paral putative amino-acid transport protein' ecoli1283 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1312' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2181 - 3,4,4 Metabolism of small molecules Degradation of small molecules Fatty acids 'atoB' 'short chain fatty acids transporter' ecoli3934 - 3,2,8 Metabolism of small molecules Biosynthesis of cofactors, carriers Menaquinone, ubiquinone 'ubiA' 'p-hydroxybenzoate: octaprenyltransferase(1st module)' ecoli3469 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'yhjX' 'putative resistance protein' ecoli19 1,5,27 Cell processes Transport/binding proteins NhaA family 'nhaA' 'NhaA family of transport protein Na+/H antiporter pH dependent(1st module)' ecoli3583 1,5,21 Cell processes Transport/binding proteins MFS family 'yicM' 'MFS family of tranport protein (1st mdule)' ecoli3855 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'frwC' 'PTS system fructose-like IIC component first module overlaps second(2nd module)' ecoli1215 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'oppB' 'ABC superfamily (membrane) membrane component of oligopeptide ABC transport system(2nd module)' ecoli1440 1,5,21 Cell processes Transport/binding proteins MFS family 'narU' 'MFS family of transport protein nitrate sensor-transmitter protein anaerobic respiratory path(1st module)' ecoli2107 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'mglC' 'ABC superfamily (membrane) membrane component of methyl-galactoside ABC transport system and galactose taxis(1st module)' ecoli1679 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'btuC' 'ABC superfamily (membrane) membrane component of vitamin B12 ABC transport system' ecoli440 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mdlA' 'ATP-binding component of a transport system (2nd module)' ecoli477 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0486' 'APC family of transport protein amino-acid transport protein' ecoli851 1,5,22 Cell processes Transport/binding proteins MIP family 'aqpZ' 'MIP family transmembrane water channel; aquaporin Z' ecoli4185 1,5,19 Cell processes Transport/binding proteins GntP family 'yjhF' 'GntP family of transport protein (1st module)' ecoli359 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0367' 'ABC superfamily (membrane) membrane component of taurine ABC transport system' Training Accuracy: 79/85 (92.94%) Training Frequency class 'Transport/binding proteins': 187/939 (19.91%) Training Significance: dev(16.86) ; prob(5.072241E-48) Evaluation on validation data (471 items): ecoli4014 1,5,16 Cell processes Transport/binding proteins GPH family 'melB' 'GPH family melibiose permease II' ecoli2868 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'cmtA' 'PTS family mannitol-specific enzyme II component cryptic' ecoli837 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'artM' 'arginine 3rd transport system permease protein' ecoli1883 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yecC' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3521 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'mtlA' 'Sugar Specific PTS family mannitol-specific enzyme IIABC components (3rd module eii a domain phosphoryl by p-hpr 491-637)' ecoli419 1,5,21 Cell processes Transport/binding proteins MFS family 'b0427' 'MFS family transport protein' ecoli1216 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'oppC' 'ABC superfamily (membrane)homolog of Salmonella oligopeptide transport permease protein(2nd module)' ecoli3416 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pitA' 'low-affinity phosphate transport' ecoli3926 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'malG' 'ABC superfamily (membrane) membrane component of maltose ABC transport system (2nd module)' ecoli3375 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ugpA' 'ABC superfamily (membrane) sn-glycerol 3-phosphate integral membrane protein ABC transport system' ecoli2778 1,5,21 Cell processes Transport/binding proteins MFS family 'araE' 'MFS family of transport protein low-affinity L-arabinose transport system proton symport protein(1st module)' ecoli4098 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'cycA' 'APC family transport of D-alanine D-serine and glycine (2nd module)' ecoli2623 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'proW' 'ABC superfamily (membrane) membrane component of high-affinity ABC transport system for glycine betaine and proline (2nd module)' ecoli1196 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'narK' 'nitrite extrusion protein(2nd module)' ecoli2089 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yehY' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1627 1,5,21 Cell processes Transport/binding proteins MFS family 'b1657' 'MFS family of transport protein (2nd module)' ecoli2772 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'ygeD' 'putative resistance proteins' ecoli3196 1,5,33 Cell processes Transport/binding proteins RNDfamily 'acrF' 'RND family of transport protein acriflavin resistance protein F multidrug efflux (?encodes lipoprotein with signal peptide; osmotcially remedial envelope defect)' ecoli1630 1,5,21 Cell processes Transport/binding proteins MFS family 'ydhC' 'MFS family transport protein (2nd module)' ecoli70 1,5,21 Cell processes Transport/binding proteins MFS family 'yabM' 'MFS family of transport protein proton-coupled beta-galactosidase/sugar efflux pump ? role in lactose metabolism (2nd module)' ecoli4130 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'treB' 'PTS family enzyme II trehalose specific (maltose may be transported)' ecoli2158 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'ccmB' 'heme exporter protein B cytochrome c-type biogenesis protein' ecoli4168 1,5,21 Cell processes Transport/binding proteins MFS family 'yjhB' 'MFS family of tranport protein (1st module)' ecoli1973 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yeeF' 'APC family paral putative amino-acid transport protein' ecoli3938 1,5,20 Cell processes Transport/binding proteins MATE family 'dinF' 'MATE family of transport protein; also DNA-damage-inducible protein F;induced by UV and mitomycin C; SOS lexA regulon(2nd module)' ecoli1282 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1311' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1796 1,5,21 Cell processes Transport/binding proteins MFS family 'b1828' 'MFS family of transport protein (2nd module)' ecoli1424 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1453' 'L-asparagine permease (2nd module)' ecoli3659 1,5,24 Cell processes Transport/binding proteins Membrane-bound ATP synthase 'atpB' 'membrane-bound ATP synthase F0 sector subunit a' ecoli3490 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'xylH' 'ABC superfamily (membrane)d-xylose transport permease (2nd module might interact with atp hydrolysing subunit )' ecoli3465 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'dppC' 'ABC superfamily (membrane) membrane component of dipeptide ABC transport system; permease protein 2 (2nd module)' ecoli2782 1,5,36 Cell processes Transport/binding proteins STP family 'b2845' 'STP family of transport protein' ecoli3223 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'prlA' 'protein secretion inner membrane preprotein translocase SecY subunit interacts with SecE (1st module)' ecoli3925 1,5,21 Cell processes Transport/binding proteins MFS family 'xylE' 'MFS family of tranport protein xylose-proton symport (2nd module)' ecoli3612 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'yidT' 'D-galactonate transport' ecoli2909 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b2975' 'LctP transporter L-lactate permease homologue' ecoli3588 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpC' 'regulator of uhpT (1st module)' ecoli1867 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1899' 'split high-affinity L-arabinose transport system; membrane protein fragment 1' ecoli1996 - 4,1,4 Structural elements Cell envelop Surface polysaccharides & antigens 'rfbX' 'hydroponic protein o-antigen (3rd module)' ecoli4005 1,5,21 Cell processes Transport/binding proteins MFS family 'proP' 'MFS family of tranport protein low-affinity constitutive transport system; proline permease II transports proline and betaine under conditions of hyperosmolarity(2nd module)' ecoli1591 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'malX' 'Sugar Specific PTS family maltose and glucose-specific ii abc (2nd module hydrophilic second phosphorylation domain)' ecoli3631 1,5,21 Cell processes Transport/binding proteins MFS family 'yidY' 'MFS family of tranport protein (1st mdule)' ecoli1877 - 2,2,7 Macromolecule metabolism Macromolecule synthesis, modification Phospholipids 'pgsA' 'phosphatidylglycerophosphate synthetase = CDP-12-diacyl-sn-glycero-3-phosphate phosphatidyl transferase' ecoli3093 1,5,43 Cell processes Transport/binding proteins ArAAP family 'mtr' 'ArAAP family tryptophan-specific transport protein' ecoli2280 1,5,21 Cell processes Transport/binding proteins MFS family 'b2322' 'MFS family of transport protein paral putative (2nd module)' ecoli2380 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2429' 'Sugar Specific paral putative PTS system enzyme II' ecoli4031 1,5,12 Cell processes Transport/binding proteins Dcu family 'dcuA' 'Dcu family anaerobic dicarboxylate transport' ecoli2818 1,5,13 Cell processes Transport/binding proteins DcuC 'b2882' 'DcuC family paral putative transport protein' ecoli1189 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'chaA' 'sodium-calcium/proton antiporter' ecoli692 1,5,31 Cell processes Transport/binding proteins POT family 'b0709' 'POT family of transport protein (1st module)' ecoli3451 1,5,10 Cell processes Transport/binding proteins DAACS family 'dctA' 'DAACS family of transport protein uptake of C4-dicarboxylic acids' ecoli2824 1,5,26 Cell processes Transport/binding proteins NCS2 family 'b2888' 'NCS2 family paral putative transport protein' ecoli909 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0934' 'ABC superfamily (membrane) probable membrane component of transport system' ecoli1737 1,5,21 Cell processes Transport/binding proteins MFS family 'ydjE' 'MFS family of transport protein (1st module)' Validation Accuracy: 49/54 (90.74%) Validation Frequency class 'Transport/binding proteins': 104/471 (22.08%) Validation Significance: dev(12.16) ; prob(6.529304E-27) ------------------ Rule 41: (40/1, lift 4.8) [hom( A ),classification( A ,rhizobium)] = 1 [hom( A ),e_val_lteq( A ,3e-06),classification( A ,solanum)] = 0 [hom( A ),species( A ,mycoplasma_hyorhinis),mol_wt_lteq( A ,77359)] = 1 -> class 'Transport/binding proteins' [0.952] Evaluation on test data (712 items): ecoli3646 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'pstB' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity phosphate-specific ABC transport system' ecoli924 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0949' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli151 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fhuC' 'ABC superfamily (atp_bind) ATP-binding component of hydroxymate-dependent iron transport' ecoli3927 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'malF' 'ABC superfamily (membrane) membrane component of maltose ABC transport system (2nd module)' ecoli889 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'msbA' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli2374 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cysW' 'ABC superfamily (membrane) membrane component of sulfate ABC transport system; permease W protein' ecoli3275 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yheS' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli805 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0829' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli785 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'glnQ' 'ABC superfamily (atp_bind) ATP-binding component of glutamine high-affinity ABC transport system(2nd module)' ecoli581 1,5,31 Cell processes Transport/binding proteins POT family 'ybdA' 'paral putative POT family of transport protein (1st module)' ecoli4176 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fecE' 'ABC superfamily (atp_bind) ATP-binding component of citrate-dependent iron(III) transport protein' ecoli1467 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yddA' 'ABC superfamily (atp_bind) paral putative ATP-binding module (2nd module)' ecoli3200 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yhdY' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2622 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'proV' 'ABC superfamily (atp_bind) ATP-binding component of transport system for glycine betaine and proline(1st module)' ecoli3125 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3195' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli662 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'nagE' 'Sugar Specific PTS family n-acetylglucosamine-specific enzyme IIABC (3rd module)' ecoli796 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0820' 'paral putative ATP-binding component of transport system' ecoli644 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'gltJ' 'ABC superfamily (membrane) glutamate/aspartate transport system permease' ecoli3998 1,5,29 Cell processes Transport/binding proteins Outer membrane channel 'phnE' 'membrane channel protein component of Pn transporter' ecoli602 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'b0612' 'citrate carrier transport citrate trading for succinate export' ecoli486 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ybbA' 'ABC superfamily (atp_bind) putative' ecoli2198 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glpT' 'sn-glycerol-3-phosphate permease' ecoli566 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pheP' 'phenylalanine-specific transport system' ecoli1249 - 2,2,7 Macromolecule metabolism Macromolecule synthesis, modification Phospholipids 'pgpB' 'non-essential phosphatidylglycerophosphate phosphatase membrane bound' ecoli3201 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhdZ' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli1020 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'b1047' 'membrane protein required for succinyl substitution of glucan backbone of OPG (osmoregulated periplasmic glucan) possible succinyl transferase' ecoli2264 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'hisP' 'ABC superfamily (atp_bind) ATP-binding component of histidine ABC transport system' ecoli578 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fepC' 'ABC superfamily (atp_bind) ATP-binding component of ferric enterobactin transport(2nd module)' ecoli786 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glnP' 'glutamine high-affinity transport system; membrane component(1st module)' ecoli1261 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapF' 'ABC superfamily (atp_bind) ATP-binding protein of peptide ABC transport system(2nd module)' ecoli358 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0366' 'ABC superfamily (atp_bind) ATP-binding component of a taurine transport system' ecoli2266 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'hisQ' 'ABC superfamily (membrane)histidine transport system' ecoli3825 - 3,5,5 Metabolism of small molecules Energy metabolism, carbon Glycolysis 'tpiA' 'triosephosphate isomerase' ecoli3981 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjcW' 'ABC superfamily (atp_bind) ATP-binding component of allose transport system (2nd module)' ecoli198 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yaeE' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli4277 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjjK' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system (2nd module)' ecoli1826 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1858' 'ABC superfamily (atp_bind) ATP-binding component of a high affinity Zn transport system(1st module)' ecoli840 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'artP' 'ABC superfamily (atp&memb) ATP-binding component of 3rd arginine transport system(2nd module)' ecoli2373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'cysA' 'ABC superfamily (atp_bind) ATP-binding component of sulfate permease A protein of ABC transport; chromate resistance (1st module)' ecoli3131 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhbG' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli127 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yadG' 'ABC superfamily (atp_bind) ATP-binding component of transport protein (1st module)' ecoli1455 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1484' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3952 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'uvrA' 'excision nuclease subunit (3rd module prob. DNA binding)' ecoli770 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0794' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli254 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yagC' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' Test Accuracy: 40/45 (88.89%) Test Frequency class 'Transport/binding proteins': 151/712 (21.21%) Test Significance: dev(11.11) ; prob(4.402152E-22) Application to new data (2167 items): ecoli3028 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3095' 'orf' ecoli1803 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1835' 'paral putative rRNA methyltransferase (2nd module)' ecoli3248 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yheF' 'orf (2nd module)' ecoli1611 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1641' 'orf' ecoli2088 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehX' 'paral putative ATP-binding component of transport system' ecoli103 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yacE' 'putative DNA repair protein' ecoli1484 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1513' 'paral putative ATP-binding component of transport system (2nd module)' ecoli2087 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehW' 'paral putative membrane component of transport system' ecoli3453 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhjL' 'paral putative reductase' ecoli2863 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yggC' 'paral putative kinase' ecoli2498 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2547' 'paral putative ATP-binding component of transport system' ecoli1454 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1483' 'paral putative ATP-binding component of transport system' ecoli2308 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2351' 'orf' ecoli794 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0818' 'orf' ecoli255 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0263' 'paral putative membrane component of transport system' ecoli1442 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1471' 'putative glycoportein' ecoli2212 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2254' 'orf(1st module)' Frequency rule on new data: 17/2167 (0.78%) Evaluation on training data (939 items): ecoli855 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0879' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli740 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'modB' 'ABC superfamily (membrane) membrane component of molybdate ABC transport system (2nd module)' ecoli1099 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potA' 'ABC superfamily (atp_bind) ATP-binding component of spermidine/putrescine ABC transport system (1st module)' ecoli3489 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'xylG' 'ABC superfamily (atp_bind) ATP-binding component of D-xylose ABC transport system(2nd module)' ecoli1456 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1485' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2265 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'hisM' 'ABC superfamily (membrane)histidine transport membrane protein m (2nd module transport function )' ecoli3991 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnK' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transportbelieved to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli3386 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ftsE' 'ABC superfamily (atp_bind) ATP-binding component of a membrane-associated complex involved in cell division' ecoli66 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yabJ' 'ABC superfamily (atp_bind) ATP-binding component of thiamine ABC transport system' ecoli3403 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikE' 'ABC superfamily (stp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system' ecoli1650 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1682' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3782 1,5,16 Cell processes Transport/binding proteins GPH family 'b3876' 'GPH family paral putative transport protein (2nd module)' ecoli3373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ugpC' 'ABC superfamily (atp_bind) ATP-binding component of sn-glycerol 3-phosphate ABC transport system (1st module)' ecoli3670 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'rbsA' 'ABC superfamily (atp_bind) ATP-binding component of d-ribose high-affinity transport system (2nd module)' ecoli1091 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1118' 'ABC superfamily (membrane) paral putative membrane component of ABC transport system' ecoli1090 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1117' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli675 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'potE' 'APC family putrescine-lyase antiporter' ecoli1348 1,5,29 Cell processes Transport/binding proteins Outer membrane channel 'b1377' 'outer membrane protein n non-specific porin (2nd module)' ecoli643 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'gltK' 'ABC superfamily (membrane) glutamate/aspartate transport (1st module)' ecoli3199 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yhdX' 'ABC superfamily (membrane)paral putative membrane component of transport system' ecoli908 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ycbE' 'paral putative ATP-binding component of transport system' ecoli1262 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapD' 'ABC superfamily (atp_bind) ATP-binding protein of peptide transport system(2nd module) affects potassium transport;' ecoli3463 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppF' 'ABC superfamily (atp_bind) ATP-binding component of a dipeptide transport system(1st module)' ecoli2139 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yejF' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli861 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cydC' 'ABC superfamily (atp&memb) ATP-binding and membrane components of cytochrome-related ABC transport(2nd module)' ecoli3378 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity branched-chain amino acid ABC transport system' ecoli3648 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'pstC' 'ABC superfamily (membrane) membrane component of high-affinity phosphate-specific ABC transport system (2nd module)' ecoli1218 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppF' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' ecoli862 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cydD' 'ABC superfamily (atp&memb) ATP-binding and membrane components of cytochrome-related ABC transport Zn sensitive(2nd module)' ecoli3401 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'nikC' 'ABC superfamily (membrane) membrane component in nickel transport system probably forms heterodimeric pore with NikB(1st module)' ecoli2159 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ccmA' 'ABC superfamily (atp_bind) ATP-binding component of heme exporter A heme exporter protein A cytochrome c-type biogenesis protein' ecoli2108 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mglA' 'ABC superfamily (atp_bind) ATP-binding component of methyl-galactoside transport and galactose taxis (2nd module)' ecoli3402 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikD' 'ABC superfamily (atp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system(2nd module)' ecoli1413 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1442' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1882 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1917' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli481 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0490' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli736 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modF' 'ABC superfamily (atp_bind) ATP-binding component of molybdenum transport system (2nd module)' ecoli543 - 5,1,2 Extrachromosomal Laterally acquirred elements Phage-related functions and prophages 'nmpC' 'outer membrane porin protein; at locus of qsr prophage' ecoli440 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mdlA' 'ATP-binding component of a transport system (2nd module)' ecoli359 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0367' 'ABC superfamily (membrane) membrane component of taurine ABC transport system' Training Accuracy: 39/40 (97.50%) Training Frequency class 'Transport/binding proteins': 187/939 (19.91%) Training Significance: dev(12.29) ; prob(1.500266E-26) Evaluation on validation data (471 items): ecoli3929 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'malK' 'ABC superfamily (atp_bind) ATP-binding component of transport system for maltose phenotypic repressor of mal operon(1st module)' ecoli441 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'mdlB' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli4119 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ytfS' 'ABC superfamily (atp_bind) putative ATP-binding component of a transport system' ecoli3990 - 3,3,13 Metabolism of small molecules Central intermediary metabolism Phosphorus compounds 'phnL' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport believed to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli1216 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'oppC' 'ABC superfamily (membrane)homolog of Salmonella oligopeptide transport permease protein(2nd module)' ecoli1289 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1318' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli4168 1,5,21 Cell processes Transport/binding proteins MFS family 'yjhB' 'MFS family of tranport protein (1st module)' ecoli3464 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppD' 'ABC superfamily (atp_bind) ATP-binding component of dipeptide tABCransport system(2nd module)' ecoli838 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'artQ' 'ABC superfamily (membrane) membrane component of 3rd arginine ABC transport system' ecoli642 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'gltL' 'ABC superfamily (atp_bind) ATP-binding protein of glutamate/aspartate transport system' ecoli3377 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livF' 'ABC superfamily (atp_bind) ATP-binding component of leucine ABC transport system' ecoli1677 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'btuD' 'ABC superfamily (atp_bind) ATP-binding component of vitamin B12 ABC transport system(2nd module)' ecoli932 - 4,1,3 Structural elements Cell envelop Outer membrane constituents 'ompA' 'outer membrane protein 3a (II*;G;d)(2nd module)' ecoli199 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'abc' 'ABC superfamily (atp_bind) ATP-binding component of ABC transport system(1st module)' ecoli1868 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'araG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity l-arabinose transport system (2nd module)' ecoli741 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modC' 'ABC superfamily (atp_bind) ATP-binding component of molybdate ABC transport (1st module)' ecoli1724 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1756' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli831 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potG' 'ABC superfamily (atp_bind) ATP-binding component of putrescine ABC transport system(1st module)' ecoli4000 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnC' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport system' ecoli1412 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1441' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2169 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yojI' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli1217 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppD' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' ecoli832 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potH' 'ABC superfamily (membrane) membrane component of putrescine ABC transport system(2nd module)' ecoli692 1,5,31 Cell processes Transport/binding proteins POT family 'b0709' 'POT family of transport protein (1st module)' ecoli234 - 4,1,3 Structural elements Cell envelop Outer membrane constituents 'phoE' 'outer membrane pore protein e (eicnmpab) (2nd module)' ecoli909 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0934' 'ABC superfamily (membrane) probable membrane component of transport system' ecoli1737 1,5,21 Cell processes Transport/binding proteins MFS family 'ydjE' 'MFS family of transport protein (1st module)' Validation Accuracy: 24/27 (88.89%) Validation Frequency class 'Transport/binding proteins': 104/471 (22.08%) Validation Significance: dev(8.37) ; prob(2.496662E-13) ------------------ Rule 39: (14, lift 38.3) [hom( A ),e_val_lteq( A ,3e-06),classification( A ,pulmonata)] = 0 [hom( A ),psi_iter_lteq( A ,7),classification( A ,salmonella)] = 0 [hom( A ),psi_iter_lteq( A ,5),classification( A ,actinobacteria)] = 1 [hom( A ),mol_wt_lteq( A ,32892),classification( A ,stramenopiles)] = 1 -> class 'Ribosome constituents' [0.938] Evaluation on test data (712 items): ecoli3228 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplF' '50S ribosomal subunit protein L6' ecoli3243 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplC' '50S ribosomal subunit protein L3' ecoli3238 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplV' '50S ribosomal subunit protein L22' ecoli4092 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsR' '30S ribosomal subunit protein S18' ecoli3115 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmA' '50S ribosomal subunit protein L27' ecoli3882 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplA' '50S ribosomal subunit protein L1 regulates synthesis of L1 and L11' ecoli3656 - 3,4,0 Metabolism of small molecules Degradation of small molecules ATP-proton motive force interconversion 'atpH' 'membrane-bound ATP synthase F1 sector delta-subunit' ecoli4086 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'yjfV' 'probable hexulose-6-phosphate synthase' ecoli3241 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplW' '50S ribosomal subunit protein L23' ecoli3842 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmE' '50S ribosomal subunit protein L31' ecoli169 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsB' '30S ribosomal subunit protein S2' ecoli3229 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsH' '30S ribosomal subunit protein S8 and regulator' ecoli1623 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'lhr' 'member of ATP-dependent helicase superfamily II (2nd module)' ecoli3652 - 1,5,24 Cell processes Transport/binding proteins Membrane-bound ATP synthase 'atpC' 'membrane-bound ATP synthase F1 sector epsilon-subunit' ecoli3970 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'nrfG' 'part of formate-dependent nitrite reductase complex' ecoli3244 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsJ' '30S ribosomal subunit protein S10' Test Accuracy: 11/16 (68.75%) Test Frequency class 'Ribosome constituents': 24/712 (3.37%) Test Significance: dev(14.49) ; prob(2.383432E-13) Application to new data (2167 items): ecoli1625 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1655' 'orf' ecoli3261 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yheB' 'putative enzyme' ecoli1518 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1547' 'orf' ecoli3153 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3223' 'ManNAc-6P epimerase' ecoli322 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0330' 'regulator for prp operon(2nd module)' ecoli961 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0987' 'orf' ecoli288 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0296' 'putative ribosomal protein' Frequency rule on new data: 7/2167 (0.32%) Evaluation on training data (939 items): ecoli4090 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsF' '30S ribosomal subunit protein S6' ecoli3881 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplK' '50S ribosomal subunit protein L11' ecoli3220 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsK' '30S ribosomal subunit protein S11' ecoli3240 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplB' '50S ribosomal subunit protein L2' ecoli1684 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplT' '50S ribosomal subunit protein L20 and regulator' ecoli3234 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsQ' '30S ribosomal subunit protein S17' ecoli3230 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsN' '30S ribosomal subunit protein S14' ecoli3558 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmG' '50S ribosomal subunit protein L33' ecoli3235 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmC' '50S ribosomal subunit protein L29' ecoli3624 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmH' '50S ribosomal subunit protein L34' ecoli3265 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsL' '30S ribosomal subunit protein S12' ecoli3161 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplM' '50S ribosomal subunit protein L13' ecoli3222 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpmJ' '50S ribosomal subunit protein X' ecoli3233 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplN' '50S ribosomal subunit protein L14' Training Accuracy: 14/14 (100.00%) Training Frequency class 'Ribosome constituents': 23/939 (2.45%) Training Significance: dev(23.61) ; prob(2.798153E-23) Evaluation on validation data (471 items): ecoli3242 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplD' '50S ribosomal subunit protein L4 regulates expression of S10 operon' ecoli1289 - 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1318' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3237 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsC' '30S ribosomal subunit protein S3' ecoli3226 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsE' '30S ribosomal subunit protein S5' ecoli3239 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsS' '30S ribosomal subunit protein S19' ecoli3236 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rplP' '50S ribosomal subunit protein L16' ecoli3160 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsI' '30S ribosomal subunit protein S9' ecoli3221 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsM' '30S ribosomal subunit protein S13' Validation Accuracy: 7/8 (87.50%) Validation Frequency class 'Ribosome constituents': 15/471 (3.18%) Validation Significance: dev(13.58) ; prob(2.573513E-10) ------------------ Rule 43: (123/10, lift 4.6) [hom( A ),classification( A ,salmonella)] = 1 [hom( A ),species( A ,paracoccus_denitrificans)] = 1 [hom( A ),e_val_gt( A ,0.0006),mol_wt_gt( A ,55220)] = 1 [hom( A ),e_val_gt( A ,2e-37),classification( A ,rhodobacter)] = 1 [hom( A ),mol_wt_lteq( A ,55220),classification( A ,streptococcaceae)] = 1 [hom( A ),keyword( A ,inner_membrane),classification( A ,epsilon_subdivision)] = 1 -> class 'Transport/binding proteins' [0.912] Evaluation on test data (712 items): ecoli3646 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'pstB' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity phosphate-specific ABC transport system' ecoli924 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0949' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli335 1,5,21 Cell processes Transport/binding proteins MFS family 'lacY' 'MFS family of transport protein galactoside permease (M protein)(1st module)' ecoli151 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fhuC' 'ABC superfamily (atp_bind) ATP-binding component of hydroxymate-dependent iron transport' ecoli3338 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gntT' 'high-affinity gluconate permease in GNT-I system' ecoli3967 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'nrfD' 'putative nitrate reductase formate dependent also paral putative STP family of transport protein' ecoli2711 1,5,21 Cell processes Transport/binding proteins MFS family 'b2771' 'MFS family of transport protein (3rd module (function unknown)' ecoli2240 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoH' 'NADH dehydrogenase I chain H' ecoli1485 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1514' 'ABC superfamily (membrane)paral putative membrane component of ABC transport system' ecoli889 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'msbA' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli1307 1,5,43 Cell processes Transport/binding proteins ArAAP family 'ydaH' 'ArAAP family p-aminobenzoyl-glutamate utilization paral putative pump protein (transport)(1st module)' ecoli470 1,5,21 Cell processes Transport/binding proteins MFS family 'fsr' 'MFS family of transport protein fosmidomycin resistance protein(2nd module)' ecoli3275 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yheS' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli746 1,5,11 Cell processes Transport/binding proteins DASS family 'b0770' 'DASS family of transport protein' ecoli3374 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ugpE' 'ABC superfamily (membrane)sn-glycerol 3-phosphate transport system integral membrane protein(1st module)' ecoli175 - 3,6,1 Metabolism of small molecules Fatty acid biosynthesis Fatty acid and phosphatidic acid biosynth 'cdsA' 'CDP-diglyceride synthase(2nd module)' ecoli112 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'aroP' 'APC family of transport protein aromatic amino acid transport protein' ecoli4210 1,5,19 Cell processes Transport/binding proteins GntP family 'gntP' 'GntP family of transport protein high affinity gluconate transporter/gluconate permease in gnt-iii system (2nd module)' ecoli818 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b0842' 'transmembrane multidrug/chloramphenicol efflux transporter (2nd module)' ecoli4120 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ytfT' 'ABC superfamily (membrane)paral putative membrane component' ecoli333 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'cynX' 'cyanate transport' ecoli1499 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ydeA' 'ABC superfamily (membrane)putative membrane component of ABC transport system appears to facilitate arabinose export contributes to control of arabinose regulon' ecoli785 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'glnQ' 'ABC superfamily (atp_bind) ATP-binding component of glutamine high-affinity ABC transport system(2nd module)' ecoli3462 1,5,36 Cell processes Transport/binding proteins STP family 'yhjV' 'STP family of transport protein' ecoli47 1,5,9 Cell processes Transport/binding proteins CPA2 family 'kefC' 'CPA2 family k+ efflux antiporter glutathione-regulated (2nd module)' ecoli3409 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3486' 'ABC superfamily (membrance) paral putative membrane component of transport system (3rd module)' ecoli2921 1,5,32 Cell processes Transport/binding proteins PiT family 'pitB' 'PiT family low-affinity phosphate transport(1st module)' ecoli3525 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'lldP' 'L-lactate permease(1st module)' ecoli581 1,5,31 Cell processes Transport/binding proteins POT family 'ybdA' 'paral putative POT family of transport protein (1st module)' ecoli769 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0793' 'putative membrane component of ABC transport system(2nd module)' ecoli3980 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yjcV' 'ABC superfamily (membrane) membrane component of allose ABC transport system(1st module)' ecoli4176 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fecE' 'ABC superfamily (atp_bind) ATP-binding component of citrate-dependent iron(III) transport protein' ecoli1467 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yddA' 'ABC superfamily (atp_bind) paral putative ATP-binding module (2nd module)' ecoli3200 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yhdY' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli252 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'ykfD' 'APC family of transport protein S-methylmethionine permease' ecoli2622 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'proV' 'ABC superfamily (atp_bind) ATP-binding component of transport system for glycine betaine and proline(1st module)' ecoli3961 1,5,35 Cell processes Transport/binding proteins SSS family 'yjcG' 'SSS family transport protein' ecoli3643 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'bglF' 'Sugar Specific PTS family beta-glucosides enzyme II cryptic (2nd module eiia (ei interaction)?)' ecoli1380 - 3,3,11 Metabolism of small molecules Central intermediary metabolism Nucleotide interconversions 'b1409' 'putative phosphatidate cytidiltransferase(2nd module)' ecoli3125 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b3195' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2324 1,5,21 Cell processes Transport/binding proteins MFS family 'emrY' 'MFS family of transport protein multidrug resistance protein y (2nd module)' ecoli662 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'nagE' 'Sugar Specific PTS family n-acetylglucosamine-specific enzyme IIABC (3rd module)' ecoli796 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0820' 'paral putative ATP-binding component of transport system' ecoli3587 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpT' 'MFS family of transport protein hexose phosphate transport protein (2nd module)' ecoli644 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'gltJ' 'ABC superfamily (membrane) glutamate/aspartate transport system permease' ecoli3293 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yhfM' 'APC family paral putative amino-acid transport protein (2nd module)' ecoli3998 1,5,29 Cell processes Transport/binding proteins Outer membrane channel 'phnE' 'membrane channel protein component of Pn transporter' ecoli2655 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'ascF' 'PTS family enzyme II ABC (asc) cryptic transports specific beta-glucosides' ecoli4017 1,5,12 Cell processes Transport/binding proteins Dcu family 'dcuB' 'Dcu family anaerobic C4-dicarboxylate transporter' ecoli602 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'b0612' 'citrate carrier transport citrate trading for succinate export' ecoli3380 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'livH' 'ABC superfamily (membrane) membrane component of high-affinity branched-chain amino acid ABC transport system(2nd module)' ecoli2662 - 3,5,4 Metabolism of small molecules Energy metabolism, carbon Fermentation 'hycD' 'membrane-spanning protein of hydrogenase 3 (part of FHL complex)' ecoli486 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ybbA' 'ABC superfamily (atp_bind) putative' ecoli2538 1,5,21 Cell processes Transport/binding proteins MFS family 'kgtP' 'MFS family of transport protein alpha-ketoglutarate permease(1st module)' ecoli3580 1,5,21 Cell processes Transport/binding proteins MFS family 'yicK' 'MFS family of transport protein two-module paral putative transport protein (2nd module)' ecoli2198 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glpT' 'sn-glycerol-3-phosphate permease' ecoli566 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pheP' 'phenylalanine-specific transport system' ecoli3201 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhdZ' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3579 1,5,16 Cell processes Transport/binding proteins GPH family 'yicJ' 'GPH family paral putative transport protein' ecoli1827 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'yebI' 'inner memrane component of a high affininty Zn transport system' ecoli2264 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'hisP' 'ABC superfamily (atp_bind) ATP-binding component of histidine ABC transport system' ecoli578 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'fepC' 'ABC superfamily (atp_bind) ATP-binding component of ferric enterobactin transport(2nd module)' ecoli786 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'glnP' 'glutamine high-affinity transport system; membrane component(1st module)' ecoli3043 1,5,36 Cell processes Transport/binding proteins STP family 'yhaO' 'STP family of transport protein (1st module)' ecoli1164 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1191' 'PUTATIVE NA(+)/H(+) EXCHANGER according to SwissProt version 38 the orf starts at a methionine 42 aa upstream of b 1191 start' ecoli1514 1,5,21 Cell processes Transport/binding proteins MFS family 'b1543' 'MFS family of transport protein (1st module)' ecoli3188 1,5,35 Cell processes Transport/binding proteins SSS family 'panF' 'SSS family transport protein sodium/pantothenate symporter(1st module)' ecoli1633 1,5,20 Cell processes Transport/binding proteins MATE family 'ydhE' 'MATE family of transport protein(2nd module)' ecoli2357 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'xapB' 'xanthosine permease' ecoli2421 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'acrD' 'sensitivity to acriflavine integral membrane protein possible efflux pump' ecoli1261 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapF' 'ABC superfamily (atp_bind) ATP-binding protein of peptide ABC transport system(2nd module)' ecoli358 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0366' 'ABC superfamily (atp_bind) ATP-binding component of a taurine transport system' ecoli1097 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potC' 'ABC superfamily (membrane) membrane component of spermidine/putrescine ABC transport system(2nd module)' ecoli3604 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'glvC' 'PTS family arbutin-like IIC component' ecoli67 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yabK' 'ABC superfamily (membrane) membrane component of thiamine ABC transport system(1st module)' ecoli3981 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjcW' 'ABC superfamily (atp_bind) ATP-binding component of allose transport system (2nd module)' ecoli2497 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b2546' 'ABC superfamily (membrane)paral putative membrane component of ABC transport system (2nd module)' ecoli3594 1,5,21 Cell processes Transport/binding proteins MFS family 'emrD' 'MFS family of transport protein 2-module integral membrane pump; multidrug resistance (2nd module)' ecoli4277 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yjjK' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system (2nd module)' ecoli2034 1,5,33 Cell processes Transport/binding proteins RNDfamily 'b2075' 'RND family of transport protein paral putative outer membrane receptor' ecoli45 1,5,21 Cell processes Transport/binding proteins MFS family 'yaaU' 'MFS family transport protein' ecoli1826 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1858' 'ABC superfamily (atp_bind) ATP-binding component of a high affinity Zn transport system(1st module)' ecoli840 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'artP' 'ABC superfamily (atp&memb) ATP-binding component of 3rd arginine transport system(2nd module)' ecoli4155 1,5,19 Cell processes Transport/binding proteins GntP family 'yjgT' 'GntP family l-idonate transporter (2nd module)' ecoli2373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'cysA' 'ABC superfamily (atp_bind) ATP-binding component of sulfate permease A protein of ABC transport; chromate resistance (1st module)' ecoli2137 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yejB' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3026 1,5,21 Cell processes Transport/binding proteins MFS family 'exuT' 'MFS family of transport protein transport of hexuronates (2nd module)' ecoli1875 1,5,43 Cell processes Transport/binding proteins ArAAP family 'tyrP' 'ArAAP family tyrosine-specific transport system' ecoli328 1,5,25 Cell processes Transport/binding proteins NCS1 family 'codB' 'NCS1 family transport protein cytosine permease/transport(2nd module)' ecoli3971 1,5,10 Cell processes Transport/binding proteins DAACS family 'gltP' 'DAACS family of transport protein glutamate-aspartate symport protein' ecoli3131 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yhbG' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli768 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0792' 'ABC superfamily (membrane)putative membrane component of ABC transport system(2nd module)' ecoli1022 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'mdoH' 'membrane glycosyltransferase; synthesis of membrane-derived oligosaccharide (MDO)/synthesis of OPGs (osmoregulated periplasmic glucans)(2nd module)' ecoli2141 1,5,21 Cell processes Transport/binding proteins MFS family 'bcr' 'MFS family of transport protein bicyclomycin resistance protein; transmembrane protein (2nd module)' ecoli127 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yadG' 'ABC superfamily (atp_bind) ATP-binding component of transport protein (1st module)' ecoli2487 1,5,21 Cell processes Transport/binding proteins MFS family 'b2536' 'MFS family of transport protein (1st module)' ecoli3959 1,5,8 Cell processes Transport/binding proteins CPA1 family 'yjcE' 'CPA1 family PUTATIVE NA(+)/H(+) EXCHANGER YJCE(1st module)' ecoli1455 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1484' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli420 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'cyoE' 'protohaeme IX farnesyltransferase (haeme O biosynthesis)' ecoli3952 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'uvrA' 'excision nuclease subunit (3rd module prob. DNA binding)' ecoli770 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0794' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli807 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0831' 'paral putative membrane component of transport system' ecoli254 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yagC' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' Test Accuracy: 93/103 (90.29%) Test Frequency class 'Transport/binding proteins': 151/712 (21.21%) Test Significance: dev(17.15) ; prob(5.180516E-51) Application to new data (2167 items): ecoli1604 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1634' 'orf' ecoli3445 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhjD' 'orf' ecoli3170 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhcP' 'orf' ecoli487 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0496' 'putative oxidoreductase' ecoli2634 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2689' 'orf' ecoli821 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0845' 'paral putative transport protein (2nd module bind phosphorylated sugar? )' ecoli3606 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yidE' 'paral putative transport protein(1st module)' ecoli2215 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2257' 'orf(2nd module)' ecoli3581 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yicL' 'putative permease transporter' ecoli3028 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3095' 'orf' ecoli1404 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1433' 'putative membrane transport protein' ecoli4009 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4115' 'putative amino acid/amine transport protein cryptic' ecoli1575 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1605' 'putative arginine/ornithine antiporter' ecoli1562 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1592' 'orf' ecoli3468 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhjW' 'orf (2nd module)' ecoli2329 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2372' 'putative receptor protein' ecoli4034 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjeH' 'putative transport' ecoli3437 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhiV' 'paral putative membrane component of transport system (3rd module)' ecoli3849 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yijE' 'orf (2nd module)' ecoli3780 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yihN' 'orf' ecoli1221 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'kch' 'putative potassium channel protein' ecoli155 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yadQ' 'putative channel transporter' ecoli2147 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yejM' 'putative sulfatase' ecoli2095 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yohD' 'orf' ecoli3419 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhiP' 'orf' ecoli2626 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2681' 'orf' ecoli3357 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3434' 'orf' ecoli2821 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2885' 'orf' ecoli1766 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1798' 'paral putative transport protein' ecoli3739 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yigM' 'paral putative transport protein (2nd module)' ecoli980 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1006' 'putative transport protein(2nd module)' ecoli2943 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yghB' 'orf' ecoli2997 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ygjE' 'paral putative DASS family of transport protein' ecoli3585 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yicO' 'orf (2nd module)' ecoli2585 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2639' 'putative pump protein' ecoli1658 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1690' 'paral putative MFS family of transport protein' ecoli4100 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ytfF' 'orf (1st module)' ecoli1720 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1752' 'orf' ecoli3114 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhbE' 'orf' ecoli1923 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yedA' 'orf (1st module)' ecoli2558 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2611' 'orf' ecoli1042 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'mviN' 'putative virulence factor' ecoli1504 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydeD' 'orf' ecoli4221 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjiJ' 'paral putative transport protein (2nd module)' ecoli2250 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2292' 'putative transport protein' ecoli3600 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yidK' 'putative cotransporter' ecoli3731 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'rarD' 'chloramphenicol resistance' ecoli945 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yccA' 'putative carrier/transport protein(2nd module)' ecoli1505 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ydeF' 'paral putative transport protein (1st module)' ecoli502 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0511' 'orf' ecoli1743 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1775' 'paral putative transport protein (1st module)' ecoli1718 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1750' 'orf' ecoli784 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0808' 'pral putative transport protein (2nd module)' ecoli3083 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yraQ' 'orf' ecoli2715 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2775' 'orf' ecoli2858 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yggA' 'orf' ecoli4083 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjfS' 'orf' ecoli789 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ybiF' 'orf' ecoli2088 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehX' 'paral putative ATP-binding component of transport system' ecoli2305 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfdC' 'putative transport' ecoli3576 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yicE' 'putative transport protein' ecoli103 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yacE' 'putative DNA repair protein' ecoli2346 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2389' 'orf' ecoli2204 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2246' 'putative transport protein' ecoli3126 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yrbG' 'orf' ecoli320 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0328' 'paral putative transport protein' ecoli2285 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfcA' 'putative structural protein' ecoli3021 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ygjT' 'orf' ecoli1486 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1515' 'paral putative membrane component of ABC transport system' ecoli1571 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1601' 'paral putative transport protein (2nd module)' ecoli2863 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yggC' 'paral putative kinase' ecoli2349 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2392' 'putative transport system permease(1st module)' ecoli791 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0815' 'orf (2nd module)' ecoli1444 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yddG' 'orf' ecoli2498 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2547' 'paral putative ATP-binding component of transport system' ecoli7 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yaaJ' 'putative inner membrane transport protein' ecoli504 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0513' 'putative transport(2nd module)' ecoli4243 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjiY' 'putative carbon starvation protein' ecoli1943 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1981' 'shikimate and dehydroshikimate permease (2nd module)' ecoli1697 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1729' 'part of a kinase(1st module paral putative tdomain shared with transporter)' ecoli1454 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1483' 'paral putative ATP-binding component of transport system' ecoli2322 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'dsdX' 'transport system permease (serine?)' ecoli1753 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1785' 'orf(2nd module)' ecoli2275 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'dedA' 'orf' ecoli873 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ycaD' 'paral putative transport protein (1st module)' ecoli4245 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4356' 'paral putative transport protein cryptic orf joins former yjiZ and yjjL' ecoli1038 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1065' 'orf' ecoli2361 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfeH' 'putative cytochrome oxidase' ecoli1759 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1791' 'putative amino acid/amine transport protein (3rd module)' ecoli4049 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjeM' 'paral putative amino-acid transport protein' ecoli262 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yagG' 'paral putative transport protein' ecoli2729 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2789' 'paral putative membrane component of transport system (2nd module)' ecoli2256 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2298' 'putative S-transferase' ecoli794 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0818' 'orf' ecoli3958 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjcD' 'orf (2nd module)' Frequency rule on new data: 95/2167 (4.38%) Evaluation on training data (939 items): ecoli4029 - 3,3,19 Metabolism of small molecules Central intermediary metabolism Sulfur metabolism 'dsbD' 'thiol:disulfide interchange protein N-term.(1st module)' ecoli4024 1,5,31 Cell processes Transport/binding proteins POT family 'b4130' 'POT family of transport protein paral putative transport protein (3rd module)' ecoli2057 1,5,21 Cell processes Transport/binding proteins MFS family 'b2098' 'MFS family of transport protein (2nd module)' ecoli3359 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gntU_1' 'split gene low-affinity gluconate transport permease protein in GNT-I system first part of fragment 1(1st module)' ecoli855 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b0879' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli3425 1,5,4 Cell processes Transport/binding proteins Ars family 'arsB' 'Ars family arsenical pump membrane protein(2nd module)' ecoli128 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yadH' 'ABC superfamily (membrane) paral putative ABC superfamily (membrane)' ecoli1099 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potA' 'ABC superfamily (atp_bind) ATP-binding component of spermidine/putrescine ABC transport system (1st module)' ecoli3489 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'xylG' 'ABC superfamily (atp_bind) ATP-binding component of D-xylose ABC transport system(2nd module)' ecoli1723 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1755' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3647 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'pstA' 'ABC superfamily (membrane) membrane component of high-affinity phosphate-specific ABC transport system (2nd module)' ecoli2265 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'hisM' 'ABC superfamily (membrane)histidine transport membrane protein m (2nd module transport function )' ecoli3991 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnK' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transportbelieved to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli87 - 4,1,2 Structural elements Cell envelop Murein sacculus, peptidoglycan 'mraY' 'phospho-N-acetylmuramoyl-pentapeptide transferase essential in cell wall growth' ecoli153 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'fhuB' 'ABC superfamily (membrane) split gene C-term module hydroxamate-dependent iron uptake (2nd module iron uptake )' ecoli1659 1,5,21 Cell processes Transport/binding proteins MFS family 'b1691' 'MFS family of transport protein' ecoli3396 1,5,21 Cell processes Transport/binding proteins MFS family 'yhhS' 'MFS family of transport protein (2nd module)' ecoli1566 1,5,21 Cell processes Transport/binding proteins MFS family 'b1596' 'MFS familty transport protein (2nd module)' ecoli3154 1,5,21 Cell processes Transport/binding proteins MFS family 'nanT' 'MFS family of transport protein sialic acid transporter cryptic in K12?(1st module)' ecoli3386 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ftsE' 'ABC superfamily (atp_bind) ATP-binding component of a membrane-associated complex involved in cell division' ecoli66 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yabJ' 'ABC superfamily (atp_bind) ATP-binding component of thiamine ABC transport system' ecoli2736 1,5,36 Cell processes Transport/binding proteins STP family 'sdaC' 'STP family of transport protein serine transporter' ecoli3403 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikE' 'ABC superfamily (stp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system' ecoli3783 1,5,16 Cell processes Transport/binding proteins GPH family 'yihP' 'GPH family paral putative transport protein' ecoli1650 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1682' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2680 1,5,19 Cell processes Transport/binding proteins GntP family 'b2740' 'GntP family of transport protein function unknown (3rd module)' ecoli3782 1,5,16 Cell processes Transport/binding proteins GPH family 'b3876' 'GPH family paral putative transport protein (2nd module)' ecoli1267 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b1296' 'APC family paral putative amino-acid transport protein' ecoli2631 1,5,21 Cell processes Transport/binding proteins MFS family 'emrB' 'MFS family of transport protein multidrug resistance; probably membrane translocase(1st module)' ecoli3049 1,5,36 Cell processes Transport/binding proteins STP family 'tdcC' 'STP family of transport protein anaerobically inducible L-threonine/ L-serine permease' ecoli3373 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ugpC' 'ABC superfamily (atp_bind) ATP-binding component of sn-glycerol 3-phosphate ABC transport system (1st module)' ecoli1264 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'sapB' 'ABC superfamily (membrane) membrane component of peptide ABC transport system' ecoli3670 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'rbsA' 'ABC superfamily (atp_bind) ATP-binding component of d-ribose high-affinity transport system (2nd module)' ecoli1026 1,5,21 Cell processes Transport/binding proteins MFS family 'yceE' 'MFS family of transport protein (2nd module)' ecoli1586 1,5,16 Cell processes Transport/binding proteins GPH family 'uidB' 'GPH family glucuronide permease' ecoli425 1,5,21 Cell processes Transport/binding proteins MFS family 'ampG' 'MFS family of transport protein ampicillin resistance (1st module)' ecoli2115 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'lysP' 'APC family lysine-specific permease (2nd module)' ecoli1090 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1117' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli4226 1,5,21 Cell processes Transport/binding proteins MFS family 'yjiO' 'MFS family of transport protein (1st module)' ecoli1179 1,5,38 Cell processes Transport/binding proteins SulP family 'ychM' 'SulP family transport protein (1st module)' ecoli2036 1,5,21 Cell processes Transport/binding proteins MFS family 'b2077' 'MFS family of transport protein (1st module)' ecoli89 - 1,7,1 Cell processes Cell division Cell division 'ftsW' 'cytoplasmic membrane required for PBP2 expression' ecoli675 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'potE' 'APC family putrescine-lyase antiporter' ecoli2608 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'gabP' 'transport permease protein of gamma-aminobutyrate' ecoli3157 1,5,13 Cell processes Transport/binding proteins DcuC 'yhcL' 'DcuC family of transport protein (2nd module)' ecoli643 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'gltK' 'ABC superfamily (membrane) glutamate/aspartate transport (1st module)' ecoli369 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'sbmA' 'ABC superfamily (atp&memb) sensitivity to microcin B17; methylmalonyl-CoA mutase (mcm); ATP-binding and membrane component' ecoli1159 1,5,27 Cell processes Transport/binding proteins NhaA family 'nhaB' 'NhaB family of transport protein Na+/H+ antiporter regulator of intracellular pH(1st module)' ecoli3675 1,5,21 Cell processes Transport/binding proteins MFS family 'yieO' 'MFS family of tranport protein (1st mdule)' ecoli908 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ycbE' 'paral putative ATP-binding component of transport system' ecoli3466 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'dppB' 'ABC superfamily (membrane) membrane component of dipeptide ABC transport system; permease protein 1(2nd module)' ecoli1262 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'sapD' 'ABC superfamily (atp_bind) ATP-binding protein of peptide transport system(2nd module) affects potassium transport;' ecoli3671 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'rbsC' 'ABC superfamily (membrane) ABC superfamily of transport protein D-ribose high-affinity ABC transport system(1st module ATP-binding subunit)' ecoli1705 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'celB' 'PTS family sugar specific enzyme II for cellobiose arbutin and salicin' ecoli453 1,5,33 Cell processes Transport/binding proteins RNDfamily 'acrB' 'RND family of transport protein acridine efflux pump(2nd module)' ecoli345 1,5,21 Cell processes Transport/binding proteins MFS family 'b0353' 'MFS family transport protein (2nd module function unknown)' ecoli394 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0402' 'APC family of transport protein proline permease transport protein' ecoli3059 1,5,21 Cell processes Transport/binding proteins MFS family 'yhaU' 'MFS family of transport protein (D)-glucarate or galactarate transporter (1st module)' ecoli3463 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppF' 'ABC superfamily (atp_bind) ATP-binding component of a dipeptide transport system(1st module)' ecoli2129 1,5,21 Cell processes Transport/binding proteins MFS family 'yeiO' 'MFS family proton-coupled sugar efflux pump transport selective monosaccharides and disaccharides narrower substr. specificity than SetA(2nd module)' ecoli4026 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'cadB' 'APC family transport of lysine/cadaverine(1st module)' ecoli989 1,5,35 Cell processes Transport/binding proteins SSS family 'putP' 'SSS family transport protein major sodium/proline symporter' ecoli2139 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'yejF' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2448 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'uraA' 'uracil transport' ecoli3761 1,5,39 Cell processes Transport/binding proteins Trk system 'trkH' 'Trk system potassium uptake requires TrkE' ecoli861 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cydC' 'ABC superfamily (atp&memb) ATP-binding and membrane components of cytochrome-related ABC transport(2nd module)' ecoli2899 1,5,21 Cell processes Transport/binding proteins MFS family 'nupG' 'MFS family of transport protein transport of nucleosides (2nd module)' ecoli3273 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'kefB' 'K+ efflux; NEM-activable K+/H+ antiporter' ecoli3379 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'livM' 'ABC superfamily (membrane) membrane component of high-affinity branched-chain amino acid ABC transport system (2nd module)' ecoli3022 1,5,10 Cell processes Transport/binding proteins DAACS family 'ygjU' 'DAACS family Na+/serine (threonine) symporter' ecoli3011 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'ygjI' 'APC family paral putative amino-acid transport protein' ecoli2035 1,5,33 Cell processes Transport/binding proteins RNDfamily 'b2076' 'RND family of transport protein paral putative outer membrane receptor' ecoli3792 - 2,2,1 Macromolecule metabolism Macromolecule synthesis, modification Amino acyl tRNA syn; tRNA modification 'yihY' 'tRNA processing exoribonuclease BN' ecoli3630 1,5,43 Cell processes Transport/binding proteins ArAAP family 'tnaB' 'ArAAP family low affinity tryptophan permease' ecoli3378 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity branched-chain amino acid ABC transport system' ecoli3668 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'kup' 'low affinity potassium transport system' ecoli4121 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yjfF' 'ABC superfamily (membrane) ABC superfamily of transport protein (1st module membrance component)' ecoli3648 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'pstC' 'ABC superfamily (membrane) membrane component of high-affinity phosphate-specific ABC transport system (2nd module)' ecoli393 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'brnQ' 'branched chain; mutants valine and o-methylthreonine resistant glyclyvaline sensitive; transport system I for Ile Leu and Val' ecoli1218 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppF' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' ecoli1570 1,5,34 Cell processes Transport/binding proteins SMR family 'b1600' 'SMR family of transport protein' ecoli1414 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1443' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli862 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cydD' 'ABC superfamily (atp&memb) ATP-binding and membrane components of cytochrome-related ABC transport Zn sensitive(2nd module)' ecoli388 1,5,21 Cell processes Transport/binding proteins MFS family 'araJ' 'MFS family of transport protein involved in either transport or processing of arabinose polymers (2nd module function unknown)' ecoli1098 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'potB' 'ABC superfamily (membrane) membrane component of spermidine/putrescine ABC transport system(2nd module)' ecoli4178 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fecC' 'ABC superfamily (membrane) citrate-dependent iron(III) transport protein (2nd module)' ecoli1463 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'xasA' 'APC family acid sensitivity protein putative glutamate:gamma-aminobutyric acid antiporter (GadC)(2nd module)' ecoli2343 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2386' 'Sugar Specific paral putative membrane component of transport system' ecoli1786 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'manY' 'Sugar Specific PTS Sugar specific-family of transport protein; enzyme IIC mannose-specific(1st module)' ecoli2138 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yejE' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli469 1,5,9 Cell processes Transport/binding proteins CPA2 family 'ybaL' 'CPA2 family transport protein' ecoli3287 1,5,21 Cell processes Transport/binding proteins MFS family 'yhfC' 'MFS family of transport protein paral putative transport protein' ecoli2375 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'cysU' 'ABC superfamily (membrane) membrane component of sulfate thiosulfate ABC transport system (2nd module)' ecoli874 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0899' 'APC family paral putative amino-acid transport protein' ecoli40 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'caiT' 'probable carnitine transporter' ecoli2878 1,5,21 Cell processes Transport/binding proteins MFS family 'galP' 'MFS family of transport protein galactose-proton symport of transport system (2nd module)' ecoli2005 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'b2046' 'probable export protein /export to periplasm in colanic acid gene cluster' ecoli2159 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ccmA' 'ABC superfamily (atp_bind) ATP-binding component of heme exporter A heme exporter protein A cytochrome c-type biogenesis protein' ecoli2108 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mglA' 'ABC superfamily (atp_bind) ATP-binding component of methyl-galactoside transport and galactose taxis (2nd module)' ecoli2741 1,5,21 Cell processes Transport/binding proteins MFS family 'fucP' 'MFS family of transport protein fucose permease(1st module)' ecoli3711 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yifK' 'APC family paral putative amino-acid transport protein' ecoli3402 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'nikD' 'ABC superfamily (atp_bind) ATP-binding component of nickel ABC transport system probably couples energy to transport system(2nd module)' ecoli1283 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1312' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2181 - 3,4,4 Metabolism of small molecules Degradation of small molecules Fatty acids 'atoB' 'short chain fatty acids transporter' ecoli1882 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1917' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3408 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yhhJ' 'ABC superfamily (membrane)paral putative transport system membrane (2nd module)' ecoli3934 - 3,2,8 Metabolism of small molecules Biosynthesis of cofactors, carriers Menaquinone, ubiquinone 'ubiA' 'p-hydroxybenzoate: octaprenyltransferase(1st module)' ecoli481 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b0490' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli3469 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'yhjX' 'putative resistance protein' ecoli19 1,5,27 Cell processes Transport/binding proteins NhaA family 'nhaA' 'NhaA family of transport protein Na+/H antiporter pH dependent(1st module)' ecoli3583 1,5,21 Cell processes Transport/binding proteins MFS family 'yicM' 'MFS family of tranport protein (1st mdule)' ecoli3855 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'frwC' 'PTS system fructose-like IIC component first module overlaps second(2nd module)' ecoli736 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modF' 'ABC superfamily (atp_bind) ATP-binding component of molybdenum transport system (2nd module)' ecoli1215 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'oppB' 'ABC superfamily (membrane) membrane component of oligopeptide ABC transport system(2nd module)' ecoli1440 1,5,21 Cell processes Transport/binding proteins MFS family 'narU' 'MFS family of transport protein nitrate sensor-transmitter protein anaerobic respiratory path(1st module)' ecoli2443 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b2492' 'membrane protein formate transporter of hyf operon (formate channel 2)' ecoli1334 1,5,39 Cell processes Transport/binding proteins Trk system 'trkG' 'Trk system potassium uptake' ecoli2107 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'mglC' 'ABC superfamily (membrane) membrane component of methyl-galactoside ABC transport system and galactose taxis(1st module)' ecoli1679 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'btuC' 'ABC superfamily (membrane) membrane component of vitamin B12 ABC transport system' ecoli440 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'mdlA' 'ATP-binding component of a transport system (2nd module)' ecoli477 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'b0486' 'APC family of transport protein amino-acid transport protein' ecoli3698 - 3,3,18 Metabolism of small molecules Central intermediary metabolism Sugar-nucleotide biosynthesis, conversions 'rfe' 'synthesis of enterobacterial common antigen (ECA): UDP-GlcNAc:undecaprenylphosphate GlcNAc-1-phosphate transferase; synt' ecoli4185 1,5,19 Cell processes Transport/binding proteins GntP family 'yjhF' 'GntP family of transport protein (1st module)' Training Accuracy: 113/123 (91.87%) Training Frequency class 'Transport/binding proteins': 187/939 (19.91%) Training Significance: dev(19.98) ; prob(1.066869E-66) Evaluation on validation data (471 items): ecoli4014 1,5,16 Cell processes Transport/binding proteins GPH family 'melB' 'GPH family melibiose permease II' ecoli3929 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'malK' 'ABC superfamily (atp_bind) ATP-binding component of transport system for maltose phenotypic repressor of mal operon(1st module)' ecoli2868 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'cmtA' 'PTS family mannitol-specific enzyme II component cryptic' ecoli837 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'artM' 'arginine 3rd transport system permease protein' ecoli1883 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yecC' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli441 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'mdlB' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli3521 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'mtlA' 'Sugar Specific PTS family mannitol-specific enzyme IIABC components (3rd module eii a domain phosphoryl by p-hpr 491-637)' ecoli419 1,5,21 Cell processes Transport/binding proteins MFS family 'b0427' 'MFS family transport protein' ecoli4119 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'ytfS' 'ABC superfamily (atp_bind) putative ATP-binding component of a transport system' ecoli3990 - 3,3,13 Metabolism of small molecules Central intermediary metabolism Phosphorus compounds 'phnL' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport believed to be part of carbon-phosphorus (C-P) lyase in phosphonate metabolism' ecoli1074 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'ptsG' 'Sugar Specific PTS family glucose-specific IIBCcomponent (3rd module hydrophilic second phosphorylation domain) mutant form transports D-ribose' ecoli2237 - 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoK' 'NADH dehydrogenase I chain K' ecoli1216 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'oppC' 'ABC superfamily (membrane)homolog of Salmonella oligopeptide transport permease protein(2nd module)' ecoli3416 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'pitA' 'low-affinity phosphate transport' ecoli3926 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'malG' 'ABC superfamily (membrane) membrane component of maltose ABC transport system (2nd module)' ecoli3375 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ugpA' 'ABC superfamily (membrane) sn-glycerol 3-phosphate integral membrane protein ABC transport system' ecoli2778 1,5,21 Cell processes Transport/binding proteins MFS family 'araE' 'MFS family of transport protein low-affinity L-arabinose transport system proton symport protein(1st module)' ecoli1289 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1318' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli4098 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'cycA' 'APC family transport of D-alanine D-serine and glycine (2nd module)' ecoli2623 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'proW' 'ABC superfamily (membrane) membrane component of high-affinity ABC transport system for glycine betaine and proline (2nd module)' ecoli1196 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'narK' 'nitrite extrusion protein(2nd module)' ecoli443 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'amtB' 'probable ammonium transporter' ecoli2089 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yehY' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli1627 1,5,21 Cell processes Transport/binding proteins MFS family 'b1657' 'MFS family of transport protein (2nd module)' ecoli2772 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'ygeD' 'putative resistance proteins' ecoli1630 1,5,21 Cell processes Transport/binding proteins MFS family 'ydhC' 'MFS family transport protein (2nd module)' ecoli70 1,5,21 Cell processes Transport/binding proteins MFS family 'yabM' 'MFS family of transport protein proton-coupled beta-galactosidase/sugar efflux pump ? role in lactose metabolism (2nd module)' ecoli4130 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'treB' 'PTS family enzyme II trehalose specific (maltose may be transported)' ecoli2158 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'ccmB' 'heme exporter protein B cytochrome c-type biogenesis protein' ecoli4168 1,5,21 Cell processes Transport/binding proteins MFS family 'yjhB' 'MFS family of tranport protein (1st module)' ecoli1973 1,5,42 Cell processes Transport/binding proteins APC family of transport protein 'yeeF' 'APC family paral putative amino-acid transport protein' ecoli2929 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'hybB' 'probable cytochrome Ni/Fe component of hydrogenase-2(1st module)' ecoli3938 1,5,20 Cell processes Transport/binding proteins MATE family 'dinF' 'MATE family of transport protein; also DNA-damage-inducible protein F;induced by UV and mitomycin C; SOS lexA regulon(2nd module)' ecoli1282 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1311' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli3464 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'dppD' 'ABC superfamily (atp_bind) ATP-binding component of dipeptide tABCransport system(2nd module)' ecoli838 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'artQ' 'ABC superfamily (membrane) membrane component of 3rd arginine ABC transport system' ecoli1796 1,5,21 Cell processes Transport/binding proteins MFS family 'b1828' 'MFS family of transport protein (2nd module)' ecoli1734 - 2,1,4 Macromolecule metabolism Macromolecule degradation Degradation of proteins, peptides, glyco 'sppA' 'protease IV a signal peptide peptidase(2nd module)' ecoli642 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'gltL' 'ABC superfamily (atp_bind) ATP-binding protein of glutamate/aspartate transport system' ecoli1424 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1453' 'L-asparagine permease (2nd module)' ecoli3377 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'livF' 'ABC superfamily (atp_bind) ATP-binding component of leucine ABC transport system' ecoli1677 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'btuD' 'ABC superfamily (atp_bind) ATP-binding component of vitamin B12 ABC transport system(2nd module)' ecoli3659 1,5,24 Cell processes Transport/binding proteins Membrane-bound ATP synthase 'atpB' 'membrane-bound ATP synthase F0 sector subunit a' ecoli3490 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'xylH' 'ABC superfamily (membrane)d-xylose transport permease (2nd module might interact with atp hydrolysing subunit )' ecoli3465 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'dppC' 'ABC superfamily (membrane) membrane component of dipeptide ABC transport system; permease protein 2 (2nd module)' ecoli3223 - 1,2,1 Cell processes Chromosome replication Chromosome replication 'prlA' 'protein secretion inner membrane preprotein translocase SecY subunit interacts with SecE (1st module)' ecoli3925 1,5,21 Cell processes Transport/binding proteins MFS family 'xylE' 'MFS family of tranport protein xylose-proton symport (2nd module)' ecoli3612 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'yidT' 'D-galactonate transport' ecoli2909 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b2975' 'LctP transporter L-lactate permease homologue' ecoli199 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'abc' 'ABC superfamily (atp_bind) ATP-binding component of ABC transport system(1st module)' ecoli3588 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpC' 'regulator of uhpT (1st module)' ecoli4177 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'fecD' 'ABC superfamily (membrane) membrane component of citrate-dependent ABC transport system of iron' ecoli1868 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'araG' 'ABC superfamily (atp_bind) ATP-binding component of high-affinity l-arabinose transport system (2nd module)' ecoli1867 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'b1899' 'split high-affinity L-arabinose transport system; membrane protein fragment 1' ecoli1996 - 4,1,4 Structural elements Cell envelop Surface polysaccharides & antigens 'rfbX' 'hydroponic protein o-antigen (3rd module)' ecoli4005 1,5,21 Cell processes Transport/binding proteins MFS family 'proP' 'MFS family of tranport protein low-affinity constitutive transport system; proline permease II transports proline and betaine under conditions of hyperosmolarity(2nd module)' ecoli741 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'modC' 'ABC superfamily (atp_bind) ATP-binding component of molybdate ABC transport (1st module)' ecoli1591 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'malX' 'Sugar Specific PTS family maltose and glucose-specific ii abc (2nd module hydrophilic second phosphorylation domain)' ecoli3631 1,5,21 Cell processes Transport/binding proteins MFS family 'yidY' 'MFS family of tranport protein (1st mdule)' ecoli3093 1,5,43 Cell processes Transport/binding proteins ArAAP family 'mtr' 'ArAAP family tryptophan-specific transport protein' ecoli1724 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1756' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli831 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'potG' 'ABC superfamily (atp_bind) ATP-binding component of putrescine ABC transport system(1st module)' ecoli2280 1,5,21 Cell processes Transport/binding proteins MFS family 'b2322' 'MFS family of transport protein paral putative (2nd module)' ecoli4000 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'phnC' 'ABC superfamily (atp_bind) ATP-binding component of phosphonate ABC transport system' ecoli1412 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'b1441' 'ABC superfamily (atp_bind) paral putative ATP-binding component of transport system' ecoli2380 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'b2429' 'Sugar Specific paral putative PTS system enzyme II' ecoli4031 1,5,12 Cell processes Transport/binding proteins Dcu family 'dcuA' 'Dcu family anaerobic dicarboxylate transport' ecoli2169 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'yojI' 'ABC superfamily (atp&memb) paral putative ATP-binding component of transport system' ecoli2818 1,5,13 Cell processes Transport/binding proteins DcuC 'b2882' 'DcuC family paral putative transport protein' ecoli1217 1,5,1 Cell processes Transport/binding proteins ABC superfamily (atp_bind) 'oppD' 'ABC superfamily (atp_bind) ATP-binding protein of oligopeptide ABC transport system' ecoli1189 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'chaA' 'sodium-calcium/proton antiporter' ecoli3709 - 3,3,18 Metabolism of small molecules Central intermediary metabolism Sugar-nucleotide biosynthesis, conversions 'rffT' 'synthesis of enterobacterial common antigen (ECA): TDP-Fuc4NAc:lipidII transferase(1st module)' ecoli692 1,5,31 Cell processes Transport/binding proteins POT family 'b0709' 'POT family of transport protein (1st module)' ecoli3451 1,5,10 Cell processes Transport/binding proteins DAACS family 'dctA' 'DAACS family of transport protein uptake of C4-dicarboxylic acids' ecoli2824 1,5,26 Cell processes Transport/binding proteins NCS2 family 'b2888' 'NCS2 family paral putative transport protein' ecoli1737 1,5,21 Cell processes Transport/binding proteins MFS family 'ydjE' 'MFS family of transport protein (1st module)' Validation Accuracy: 66/76 (86.84%) Validation Frequency class 'Transport/binding proteins': 104/471 (22.08%) Validation Significance: dev(13.61) ; prob(3.990802E-33) ------------------ Rule 45: (13/1, lift 4.4) [hom( A ),classification( A ,bos)] = 0 [hom( A ),species( A ,bacillus_licheniformis)] = 0 [hom( A ),e_val_lteq( A ,0.0006),keyword( A ,transmembrane)] = 1 [hom( A ),species( A ,salmonella_typhimurium),keyword( A ,transmembrane)] = 1 [hom( A ),species( A ,mycoplasma_hyorhinis),mol_wt_lteq( A ,77359)] = 0 [hom( A ),species( A ,halobacterium_halobium),mol_wt_gt( A ,55220)] = 0 [hom( A ),mol_wt_gt( A ,43194),classification( A ,chroococcales)] = 0 -> class 'Transport/binding proteins' [0.867] Evaluation on test data (712 items): ecoli1913 - 4,1,5 Structural elements Cell envelop Surface structures 'fliP' 'flagellar biosynthesis' ecoli2765 - 2,2,7 Macromolecule metabolism Macromolecule synthesis, modification Phospholipids 'lgt' 'phosphatidylglycerol-prolipoprotein diacylglyceryl transferase; a major membrane phospholipid posttranslational lipid modification oflipoproteins' ecoli2364 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'cysZ' 'required for sulfate transport' ecoli222 - 4,1,5 Structural elements Cell envelop Surface structures 'fhiA' 'flagellar biosynthesis paral putative transport protein' ecoli1457 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'b1486' 'ABC superfamily (membrane) paral putative membrane component of transport system' ecoli2120 1,5,7 Cell processes Transport/binding proteins CNT family 'yeiJ' 'CNT family of transport protein' ecoli2123 1,5,7 Cell processes Transport/binding proteins CNT family 'yeiM' 'CNT family of transport protein' ecoli3603 1,5,37 Cell processes Transport/binding proteins Sugar-specific PTS system 'glvB' 'Sugar Specific family of transport protein PTS system arbutin-like IIB component' Test Accuracy: 5/8 (62.50%) Test Frequency class 'Transport/binding proteins': 151/712 (21.21%) Test Significance: dev(2.86) ; prob(1.345962E-02) Application to new data (2167 items): ecoli2859 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yggB' 'involved in stability of MscS mechanosensitive channel paral putative transport protein' ecoli1340 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1369' 'orf' ecoli1597 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1627' 'orf' ecoli850 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0874' 'putative surface protein' ecoli1930 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1966' 'putative outer membrane protein' ecoli1213 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ychE' 'orf' ecoli1763 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1795' 'orf' ecoli4253 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjjP' 'paral putative membrane protein' ecoli823 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0847' 'putative transport protein' ecoli1342 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1371' 'orf' ecoli1500 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydeB' 'orf' ecoli309 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0317' 'orf' ecoli1168 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1195' 'orf' ecoli2974 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ygiE' 'orf' ecoli1186 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1213' 'orf' ecoli1789 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1821' 'orf' ecoli1972 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yeeE' 'paral putative membrane component of transport system' ecoli3431 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhiD' 'putative transport ATPase' ecoli1313 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1342' 'orf' ecoli3568 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yicG' 'orf' ecoli1848 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1880' 'putative part of export apparatus for flagellar proteins' ecoli4164 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjgX' 'orf' ecoli1894 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yedE' 'paral putative membrane component of transport system' ecoli286 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0294' 'orf' ecoli1443 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1472' 'putative outer membrane porin protein' ecoli3635 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yieG' 'orf (2nd module)' ecoli4147 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4257' 'orf joins former yjgN and yjgO' ecoli1928 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1964' 'putative outer membrane protein' ecoli10 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yaaH' 'orf' Frequency rule on new data: 29/2167 (1.34%) Evaluation on training data (939 items): ecoli3590 1,5,21 Cell processes Transport/binding proteins MFS family 'uhpA' 'response regulator positive activator of uhpT transcription (sensor uhpB)(1st module)' ecoli3928 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'malE' 'ABC superfamily (bind_prot) periplasmic maltose-binding protein; substrate recognition for transport and chemotaxis with chaperone properties(2nd module)' ecoli1866 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'araH' 'high-affinity L-arabinose transport system; membrane protein fragment 2' ecoli2940 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'exbB' 'uptake of enterochelin; tonB-dependent uptake of B colicins' ecoli577 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fepE' 'ferric enterobactin (enterochelin) transport' ecoli1281 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'b1310' 'ABC superfamily (peri_perm) ABC super family transport protein peri_perm subunit' ecoli68 1,5,16 Cell processes Transport/binding proteins GPH family 'tbpA' 'ABC superfamily (periplasmic) periplasmic thiamine binding component of thiamine transport protein' ecoli2376 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'cysP' 'thiosulfate binding protein' ecoli1096 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'potD' 'ABC superfamily (peri_perm) spermidine/putrescine binding periplasmic ABC transport protein' ecoli4255 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'yjjR' '2-component transcriptional regulator' ecoli830 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'potF' 'ABC superfamily (peri_perm) periplasmic putrescine-binding protein ABC transport system(1st module)' ecoli4180 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fecA' 'outer membrane receptor; citrate-dependent iron transport outer membrane receptor(1st module)' ecoli3728 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'corA' 'Mg2+ transport system I' Training Accuracy: 12/13 (92.31%) Training Frequency class 'Transport/binding proteins': 187/939 (19.91%) Training Significance: dev(6.54) ; prob(4.128922E-08) Evaluation on validation data (471 items): ecoli588 - 6,1,1 Global functions Global regulatory functions Global regulatory functions 'b0598' 'carbon starvation protein' ecoli3376 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'ugpB' 'ABC superfamily (peri_perm) ABC super family transport protein peri_perm subunit sn-glycerol 3-phosphate periplasmic binding protein of ABC transport system(2nd module)' ecoli403 1,5,29 Cell processes Transport/binding proteins Outer membrane channel 'tsx' 'nucleoside channel; receptor of phage T6 and colicin K' ecoli3823 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'sbp' 'periplasmic sulfate-binding protein' Validation Accuracy: 3/4 (75.00%) Validation Frequency class 'Transport/binding proteins': 104/471 (22.08%) Validation Significance: dev(2.55) ; prob(3.355386E-02) ------------------ Rule 15: (7, lift 10.6) [hom( A ),classification( A ,chytridiomycetes)] = 1 [hom( A ),e_val_gt( A ,2e-37),classification( A ,nematocera)] = 0 [hom( A ),species( A ,bacillus_subtilis),mol_wt_gt( A ,77359)] = 0 [hom( A ),mol_wt_gt( A ,55220),classification( A ,archaea)] = 1 [hom( A ),keyword( A ,inner_membrane),classification( A ,epsilon_subdivision)] = 0 -> class 'Degradation of small molecules' [0.889] Evaluation on test data (712 items): ecoli3642 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'bglB' 'phospho-beta-glucosidase B; cryptic' ecoli3486 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'xylB' 'xylulokinase(2nd module)' ecoli790 - 4,1,3 Structural elements Cell envelop Outer membrane constituents 'ompX' 'outer membrane protease receptor for phage OX2' ecoli3832 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'glpK' 'glycerol kinase EC 2.7.1.30(2nd module)' Test Accuracy: 2/4 (50.00%) Test Frequency class 'Degradation of small molecules': 79/712 (11.10%) Test Significance: dev(2.48) ; prob(6.339307E-02) Application to new data (2167 items): ecoli2148 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yejO' 'orf' ecoli1482 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1511' 'paral putative kinase' ecoli1634 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1664' 'possible enzyme(2nd module)' ecoli1142 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1169' 'orf' ecoli3474 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yiaD' 'paral putative membrance protein (2nd module)' ecoli2028 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yegD' 'paral putative heatshock protein (Hsp70)' ecoli2716 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ygcE' 'paral putative kinase' Frequency rule on new data: 7/2167 (0.32%) Evaluation on training data (939 items): ecoli3613 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'yidU' 'multimodular 2-oxo-3-deoxygalactonate 6-phosphate aldolase and galactonate dehydratase (2nd module galactonate dehydratase)' ecoli2654 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'ascG' 'ascBF operon repressor(2nd module)' ecoli2405 3,4,1 Metabolism of small molecules Degradation of small molecules Amines 'eutJ' 'paral putative heatshock protein (Hsp70)' ecoli2837 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'bglA' '6-phospho-beta-glucosidase A; cryptic' ecoli2743 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'fucK' 'L-fuculokinase' ecoli3810 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'rhaB' 'rhamnulokinase' ecoli3502 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'lyxK' 'L-xylulose kinase cryptic (entire)(2nd module)' Training Accuracy: 7/7 (100.00%) Training Frequency class 'Degradation of small molecules': 79/939 (8.41%) Training Significance: dev(8.73) ; prob(2.983531E-08) Evaluation on validation data (471 items): ecoli63 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'araB' 'L-ribulokinase(2nd module)' ecoli3009 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'ebgA' 'evolved beta-D-galactosidase alpha subunit; cryptic gene(1st module)' ecoli2656 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'ascB' '6-phospho-beta-glucosidase; cryptic' ecoli2774 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'galR' 'repressor of galETK operon' ecoli2719 - 3,5,5 Metabolism of small molecules Energy metabolism, carbon Glycolysis 'eno' 'enolase EC 4.2.1.11' ecoli336 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'lacZ' 'beta-D-galactosidase(1st module)' ecoli3181 - 1,7,1 Cell processes Cell division Cell division 'mreB' 'split gene mecillinam resistance; cell shape affects division versus elongation; fragment 1 (2nd module)' Validation Accuracy: 5/7 (71.43%) Validation Frequency class 'Degradation of small molecules': 34/471 (7.22%) Validation Significance: dev(6.56) ; prob(3.543483E-05) ------------------ Rule 37: (3, lift 44.2) [hom( A ),classification( A ,chytridiomycetes)] = 0 [hom( A ),e_val_gt( A ,2e-37),classification( A ,muridae)] = 0 [hom( A ),e_val_lteq( A ,6e-14),classification( A ,chondrichthyes)] = 0 [hom( A ),mol_wt_gt( A ,43194),classification( A ,nematocera)] = 1 -> class 'Nucleotide biosynthesis' [0.800] Evaluation on test data (712 items): ecoli2670 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'hypE' 'plays structural role in maturation of all 3 hydrogenases' ecoli2765 - 2,2,7 Macromolecule metabolism Macromolecule synthesis, modification Phospholipids 'lgt' 'phosphatidylglycerol-prolipoprotein diacylglyceryl transferase; a major membrane phospholipid posttranslational lipid modification oflipoproteins' Test Accuracy: 0/2 (0.00%) Test Frequency class 'Nucleotide biosynthesis': 12/712 (1.69%) Test Significance: dev(-0.19) ; prob(1.000000E+00) Application to new data (2167 items): ecoli892 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0917' 'orf' ecoli640 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0650' 'paral putative heatshock protein (Hsp70)' ecoli4253 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yjjP' 'paral putative membrane protein' ecoli3391 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3468' 'putative enzyme' ecoli250 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ykfC' 'orf' ecoli3087 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhbQ' 'orf' Frequency rule on new data: 6/2167 (0.28%) Evaluation on training data (939 items): ecoli2451 3,7,1 Metabolism of small molecules Nucleotide biosynthesis Purine ribonucleotide biosynthesis 'purN' 'phosphoribosylglycinamide formyltransferase 1' ecoli2508 3,7,1 Metabolism of small molecules Nucleotide biosynthesis Purine ribonucleotide biosynthesis 'purL' 'phosphoribosylformyl-glycineamide synthetase = FGAM synthetase' ecoli1203 3,7,1 Metabolism of small molecules Nucleotide biosynthesis Purine ribonucleotide biosynthesis 'purU' 'formyltetrahydrofolate hydrolase (activated by methionine inhibited by glycine)' Training Accuracy: 3/3 (100.00%) Training Frequency class 'Nucleotide biosynthesis': 17/939 (1.81%) Training Significance: dev(12.76) ; prob(5.934033E-06) Evaluation on validation data (471 items): ecoli409 - 3,2,14 Metabolism of small molecules Biosynthesis of cofactors, carriers Thiamin 'b0417' 'thiamin-monophosphate kinase' ecoli2450 3,7,1 Metabolism of small molecules Nucleotide biosynthesis Purine ribonucleotide biosynthesis 'purM' 'phosphoribosylaminoimidazole synthetase = AIR synthetase' Validation Accuracy: 1/2 (50.00%) Validation Frequency class 'Nucleotide biosynthesis': 5/471 (1.06%) Validation Significance: dev(6.75) ; prob(2.100604E-02) ------------------ Rule 31: (8/1, lift 18.8) [hom( A ),classification( A ,embryophyta)] = 0 [hom( A ),mol_wt_lteq( A ,43194),classification( A ,protacanthopterygii)] = 1 -> class 'Laterally acquirred elements' [0.800] Evaluation on test data (712 items): ecoli1989 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_7' 'IS5 protein' ecoli572 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi81_2' 'IS186 protein' ecoli4167 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi41' 'IS4 protein' ecoli1953 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_6' 'IS5 protein' ecoli3165 - 2,1,4 Macromolecule metabolism Macromolecule degradation Degradation of proteins, peptides, glyco 'hhoB' 'periplasmic serine endoprotease(2nd module)' Test Accuracy: 4/5 (80.00%) Test Frequency class 'Laterally acquirred elements': 25/712 (3.51%) Test Significance: dev(9.29) ; prob(7.386457E-06) Application to new data (2167 items): ecoli2791 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2854' 'orf' ecoli2351 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2394' 'putative transposase' ecoli1568 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1598' 'orf' Frequency rule on new data: 3/2167 (0.14%) Evaluation on training data (939 items): ecoli161 - 2,1,4 Macromolecule metabolism Macromolecule degradation Degradation of proteins, peptides, glyco 'htrA' 'periplasmic serine protease Do; heat shock protein HtrA(2nd module)' ecoli251 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_1' 'IS5 protein 1' ecoli3428 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_11' 'IS5 protein 11' ecoli2150 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_8' 'IS5 protein' ecoli542 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_2' 'IS 5 protein' ecoli16 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi81_1' 'homolog IS186 transposase' ecoli1341 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_5' 'IS5 protein' ecoli2916 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_9' 'IS5 protein' Training Accuracy: 7/8 (87.50%) Training Frequency class 'Laterally acquirred elements': 40/939 (4.26%) Training Significance: dev(11.66) ; prob(1.960440E-09) Evaluation on validation data (471 items): ecoli1302 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_4' 'IS5 protein' ecoli4042 - 2,2,6 Macromolecule metabolism Macromolecule synthesis, modification Lipoprotein 'blc' 'outer membrane lipoprotein (lipocalin)' ecoli3823 - 1,5,3 Cell processes Transport/binding proteins ABC superfamily (peri_perm) 'sbp' 'periplasmic sulfate-binding protein' ecoli3148 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi52_10' 'IS5 protein 10' Validation Accuracy: 2/4 (50.00%) Validation Frequency class 'Laterally acquirred elements': 26/471 (5.52%) Validation Significance: dev(3.90) ; prob(1.632053E-02) ------------------ Rule 18: (17/2, lift 8.9) [hom( A ),classification( A ,carnivora)] = 1 [hom( A ),classification( A ,corynebacteriaceae)] = 0 [hom( A ),keyword( A ,transmembrane),classification( A ,kinetoplastida)] = 1 [hom( A ),keyword( A ,inner_membrane),classification( A ,epsilon_subdivision)] = 0 -> class 'Energy metabolism carbon' [0.842] Evaluation on test data (712 items): ecoli2241 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoG' 'NADH dehydrogenase I chain G' ecoli680 - 1,5,30 Cell processes Transport/binding proteins P-type ATPase family 'kdpB' 'P-type ATPase familyATPase of high-affinity potassium transport system B chain' ecoli4248 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'mdoB' 'phosphoglycerol transferase I add phosphoglycerols to OPG backbone' ecoli2246 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoA' 'NADH dehydrogenase I chain A' ecoli4132 - 1,5,30 Cell processes Transport/binding proteins P-type ATPase family 'mgtA' 'P-type ATPase familyMg2+ transport ATPase P-type 1(2nd module)' ecoli1249 - 2,2,7 Macromolecule metabolism Macromolecule synthesis, modification Phospholipids 'pgpB' 'non-essential phosphatidylglycerophosphate phosphatase membrane bound' ecoli2435 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'b2484' 'hydrogenase 4 membrane subunit(1st module)' ecoli2201 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'glpC' 'sn-glycerol-3-phosphate dehydrogenase (anaerobic) K-small subunit(2nd module)' ecoli3308 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'gph' 'phosphoglycolate phosphatase' ecoli2660 3,5,4 Metabolism of small molecules Energy metabolism, carbon Fermentation 'hycF' 'probable iron-sulfur protein of hydrogenase 3 (part of FHL complex)' ecoli2664 3,5,4 Metabolism of small molecules Energy metabolism, carbon Fermentation 'hycB' 'probable small subunit of hydrogenase-3 iron-sulfur protein (part of formate hydrogenlyase (FHL) complex)' ecoli3392 - 1,5,30 Cell processes Transport/binding proteins P-type ATPase family 'b3469' 'P-type ATPase familyzinc-transporting ATPase(2nd module)' Test Accuracy: 6/12 (50.00%) Test Frequency class 'Energy metabolism carbon': 70/712 (9.83%) Test Significance: dev(4.67) ; prob(4.933563E-04) Application to new data (2167 items): ecoli2022 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2063' 'orf (2nd module)' ecoli2251 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2293' 'orf' ecoli299 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ykgF' 'orf' ecoli798 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0822' 'orf' ecoli1642 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1674' 'paral putative oxidoreductase' ecoli3495 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3573' 'paral putative oxidoreductase' ecoli2835 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2899' 'putative oxidoreductase' ecoli1919 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1955' 'orf' ecoli438 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'cof' 'complements deletion mutant for growth on succinate(1st module)' ecoli1559 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1589' 'paral putative oxidoreductase Fe-S subunit' ecoli3618 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yidA' 'orf' ecoli2513 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2562' 'orf' ecoli1639 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1671' 'paral putative oxidoreductase (2nd module)' ecoli4260 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjjG' 'orf' ecoli3791 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yihX' 'paral putative enzyme' ecoli820 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0844' 'orf' ecoli3861 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yijP' 'orf (2nd module)' ecoli3128 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yrbI' 'orf formerly yrbI and yrbJ' ecoli1695 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1727' 'orf' ecoli2352 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yfeA' 'orf(2nd module)' ecoli966 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0992' 'orf' ecoli3456 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3533' 'putative cellulose synthase' ecoli817 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0841' 'orf' ecoli1379 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1408' 'probable enzyme' ecoli888 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ycaI' 'orf' ecoli2133 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2174' 'orf' ecoli2106 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yeiA' 'paral putative dihydro-orotate oxidase(2nd module)' ecoli1655 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1687' 'paral putative oxidase(1st module)' ecoli999 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1025' 'orf(2nd module) homologue of Yersinia pestis hmsT involved in haemin uptake/storage ?cryptic' ecoli1784 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1816' 'orf (2nd module)' ecoli2822 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2886' 'paral putative oxidoreductase' ecoli996 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1022' 'orf(1st module) homologue of Yersinia pestis hmsR involved in haemin uptake/storage ?cryptic' ecoli355 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0363' 'polysaccharide metabolism(1st module)' ecoli2309 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2352' 'paral putative ligase(1st module)' ecoli1288 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1317' 'orf' ecoli200 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yaeD' 'paral putative enzyme' ecoli3394 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhhQ' 'orf(1st module)' ecoli3636 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b3715' 'orf' ecoli2212 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2254' 'orf(1st module)' Frequency rule on new data: 39/2167 (1.80%) Evaluation on training data (939 items): ecoli3799 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdoH' 'formate dehydrogenase-o fe-s subunit (2nd module)' ecoli3966 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'nrfC' 'putative nitrite reductase; formate-dependent Fe-S centers(1st module)' ecoli1446 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'fdnH' 'formate dehydrogenase-N nitrate-inducible Fe-S beta subunit' ecoli1198 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'narH' 'nitrate reductase 1 beta subunit(2nd module)' ecoli870 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'dmsB' 'anaerobic dimethyl sulfoxide reductase subunit B' ecoli2238 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoJ' 'NADH dehydrogenase I chain J(1st module)' ecoli2653 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'hydN' 'involved in electron transport from formate to hydrogen Fe-S centers' ecoli2163 3,5,3 Metabolism of small molecules Energy metabolism, carbon Electron transport 'napG' 'ferredoxin-type protein: electron transfer' ecoli2930 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'hybA' 'thought to be hydrogenase-2 small subunit now identified as hybO(2nd module)' ecoli2162 3,5,3 Metabolism of small molecules Energy metabolism, carbon Electron transport 'napH' 'ferredoxin-type protein: electron transfer(1st module)' ecoli422 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'cyoC' 'cytochrome o ubiquinol oxidase subunit III' ecoli2235 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoM' 'NADH dehydrogenase I chain M(2nd module)' ecoli1865 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'otsB' 'trehalose-6-phosphate phophatase biosynthetic' ecoli2166 3,5,3 Metabolism of small molecules Energy metabolism, carbon Electron transport 'napF' 'ferredoxin-type protein: electron transfer' ecoli2432 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'yffE' 'hydrogenase 4 Fe-S subunit' ecoli2912 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b2978' 'glycolate oxidase Fe-S subunit(1st module) previously thought to be two genes glcE and glcF' ecoli4046 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'frdB' 'fumarate reductase anaerobic Fe-S protein subunit EC 1.3.99.1' Training Accuracy: 15/17 (88.24%) Training Frequency class 'Energy metabolism carbon': 89/939 (9.48%) Training Significance: dev(11.09) ; prob(5.053516E-14) Evaluation on validation data (471 items): ecoli475 - 1,5,4 Cell processes Transport/binding proteins Ars family 'b0484' 'Ars family of transport protein' ecoli2157 - 1,5,2 Cell processes Transport/binding proteins ABC superfamily (membrane) 'ccmC' 'ABC superfamily (membrane)heme exporter protein C necessary for incorporation of heme into CcmE (cytochrome c-type biogenesis protein)' ecoli1598 - 3,3,15 Metabolism of small molecules Central intermediary metabolism Pool, multipurpose conversions of intermed. met'm 'b1628' 'PUTATIVE FERREDOXIN-LIKE PROTEIN' ecoli2239 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'nuoI' 'NADH dehydrogenase I chain I' ecoli423 3,5,1 Metabolism of small molecules Energy metabolism, carbon Aerobic respiration 'cyoB' 'cytochrome o ubiquinol oxidase subunit I' ecoli2439 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'b2488' 'hydrogenase 4 Fe-S subunit' ecoli1438 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'narY' 'nitrate reductase 2 beta subunit(1st module)' ecoli2887 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'b2952' 'putative resistance protein' Validation Accuracy: 4/8 (50.00%) Validation Frequency class 'Energy metabolism carbon': 44/471 (9.34%) Validation Significance: dev(3.95) ; prob(3.601248E-03) ------------------ Rule 32: (7/1, lift 18.3) [hom( A ),classification( A ,chytridiomycetes)] = 0 [hom( A ),classification( A ,liliopsida)] = 0 [hom( A ),e_val_gt( A ,6e-14),classification( A ,shigella)] = 1 [hom( A ),mol_wt_gt( A ,77359),classification( A ,cricetinae)] = 0 [hom( A ),mol_wt_gt( A ,77359),classification( A ,mollicutes)] = 0 [hom( A ),mol_wt_gt( A ,43194),classification( A ,nematocera)] = 0 -> class 'Laterally acquirred elements' [0.778] Evaluation on test data (712 items): ecoli1502 - 1,4,3 Cell processes Protection responses Drug/analog sensitivity 'marA' 'multiple antibiotic resistance; transcriptional activator of defense systems (act as monomer does not have a dimerization domain)' ecoli3027 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'exuR' 'negative regulator of exu regulon exuT uxaAC and uxuB(1st module)' ecoli1375 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_2' 'transposase 2 for IS30' ecoli3937 - 6,1,1 Global functions Global regulatory functions Global regulatory functions 'lexA' 'regulator for SOS(lexA) regulon' ecoli2914 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'glcC' 'transcriptional activator for glc operon(1st module)' ecoli1160 - 3,4,4 Metabolism of small molecules Degradation of small molecules Fatty acids 'fadR' 'negative regulator for fad regulon and positive activator of fabA(1st module)' ecoli3480 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi5B' 'IS150 transposase' ecoli4173 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_3' 'transposase for IS30' ecoli3545 - 2,2,5 Macromolecule metabolism Macromolecule synthesis, modification Lipopolysaccharide 'rfaK' 'lipopolysaccharide core biosynthesis; probably hexose transferase' ecoli4282 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'rob' 'right origin-binding protein(1st module)' ecoli36 - 3,4,1 Metabolism of small molecules Degradation of small molecules Amines 'caiD' 'Canitine racemase' ecoli1374 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_2' 'IS21 protein 2' ecoli1910 - 4,1,5 Structural elements Cell envelop Surface structures 'fliM' 'flagellar biosynthesis component of motor switch and energizing enabling rotation and determining its direction' ecoli2979 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_5' 'IS2 protein' ecoli4182 - 6,1,1 Global functions Global regulatory functions Global regulatory functions 'fecI' 'sigma factor in two component regulatory system wtih FecR FecR interacts wtih the periplasmic iron binding FecA' ecoli4131 - 1,6,2 Cell processes Adaptation Osmotic adaptation 'treR' 'repressor of treABC' ecoli1735 - 3,4,2 Metabolism of small molecules Degradation of small molecules Amino acids 'ansA' 'cytoplasmic L-asparaginase I' Test Accuracy: 5/17 (29.41%) Test Frequency class 'Laterally acquirred elements': 25/712 (3.51%) Test Significance: dev(5.80) ; prob(2.316556E-04) Application to new data (2167 items): ecoli365 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0373' 'orf' ecoli290 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0298' 'orf' ecoli1763 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1795' 'orf' ecoli3860 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yijO' 'paral putative regulator' ecoli2101 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yohK' 'putative seritonin transporter' ecoli4246 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjjM' 'orf' ecoli3485 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yiaB' 'orf' ecoli1615 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1645' 'orf' ecoli2047 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2088' 'orf' ecoli1990 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yefJ' 'putative creatinase' ecoli2014 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2055' 'PUTATIVE ACETYL TRANSFERASE in colanic acid biosynthesis (1st module)' ecoli292 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ykgA' 'paral putative regulator' ecoli1226 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yciC' 'orf' ecoli2471 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2520' 'orf' ecoli913 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0938' 'paral putative fimbrial-like protein' ecoli4010 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'adiY' 'paral putative regulator' ecoli530 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0540' 'orf' ecoli1523 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1552' 'cold shock-like protein' ecoli2509 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfhD' 'paral putative periplasmic binding protein of transport system (1st module)' ecoli4161 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4272' 'orf' ecoli1511 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1540' 'paral putative regulator protein' ecoli259 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yagA' 'orf' ecoli2396 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2445' 'orf' ecoli1946 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1985' 'paral putative transport protein (3rd module)' ecoli1001 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1027' 'orf' ecoli2018 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2059' 'putative glycosyl transferase in colanic acid biosynthesis (1st module)' ecoli3996 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'phnF' 'paral putative regulator protein' ecoli1516 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1545' 'orf' ecoli1548 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1578' 'orf' ecoli1443 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1472' 'putative outer membrane porin protein' ecoli1336 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1365' 'orf' ecoli106 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'hofC' 'orf' ecoli2388 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yfeG' 'paral putative regulator' ecoli1345 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1374' 'putative transposon resolvase' ecoli2308 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2351' 'orf' ecoli1928 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1964' 'putative outer membrane protein' ecoli3491 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'xylR' 'xylose operon regulatory protein (2nd module)' ecoli4162 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b4273' 'putative transposase' Frequency rule on new data: 38/2167 (1.75%) Evaluation on training data (939 items): ecoli1373 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_2' 'IS22 protein 2' ecoli248 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'tra8_1' 'transposase1 for IS30' ecoli1955 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_3' 'IS2 protein' ecoli1956 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_3' 'IS2 protein' ecoli1131 5,1,2 Extrachromosomal Laterally acquirred elements Phage-related functions and prophages 'pin' 'inversion of adjacent DNA; at locus of e14 element' ecoli3956 - 6,1,1 Global functions Global regulatory functions Global regulatory functions 'soxS' 'regulation of superoxide response regulon' ecoli353 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_1' 'IS22 protein 1' Training Accuracy: 6/7 (85.71%) Training Frequency class 'Laterally acquirred elements': 40/939 (4.26%) Training Significance: dev(10.67) ; prob(4.030049E-08) Evaluation on validation data (471 items): ecoli1200 - 3,5,2 Metabolism of small molecules Energy metabolism, carbon Anaerobic respiration 'narI' 'nitrate reductase 1 cytochrome b(NR) gamma subunit' ecoli2171 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'ada' 'O6-methylguanine-DNA methyltransferase; transcription activator/repressor(1st module)' ecoli352 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_1' 'IS21 protein 1' ecoli2798 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_4' 'IS2 protein' ecoli556 - 4,1,3 Structural elements Cell envelop Outer membrane constituents 'envY' 'envelope protein; thermoregulation of porin biosynthesis(2nd module)' ecoli1528 - 1,6,1 Cell processes Adaptation Adaptations, atypical conditions 'cspB' 'cold shock protein; may affect transcription' ecoli2797 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi22_4' 'IS2 protein' ecoli2978 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'yi21_5' 'IS2 protein' Validation Accuracy: 4/8 (50.00%) Validation Frequency class 'Laterally acquirred elements': 26/471 (5.52%) Validation Significance: dev(5.51) ; prob(5.179218E-04) ------------------ Rule 8: (9/1, lift 13.5) [hom( A ),classification( A ,rhizobium)] = 0 [hom( A ),e_val_lteq( A ,0.0006),species( A ,oenothera_bertiana__bertero_s_evening_primrose_)] = 0 [hom( A ),e_val_lteq( A ,3e-06),classification( A ,solanum)] = 0 [hom( A ),species( A ,mycoplasma_hyorhinis),mol_wt_lteq( A ,77359)] = 1 [hom( A ),keyword( A ,inner_membrane),classification( A ,epsilon_subdivision)] = 0 -> class 'Cell envelop' [0.818] Evaluation on test data (712 items): ecoli1075 - 1,5,23 Cell processes Transport/binding proteins Mechanism not stated 'fhuE' 'outer membrane receptor for Fe(III)-coprogen Fe(III)-ferrioxamine B and Fe(III)-rhodotrulic acid uptake' ecoli3641 - 1,5,29 Cell processes Transport/binding proteins Outer membrane channel 'yieC' '(bglH) cryptic carbohydrate-specific outer membrane porin(2nd module)' ecoli2173 4,1,3 Structural elements Cell envelop Outer membrane constituents 'ompC' 'outer membrane protein 1b (ib;c) (2nd module)' ecoli139 4,1,5 Structural elements Cell envelop Surface structures 'htrE' 'probable outer membrane porin protein involved in fimbrial assembly (2nd module)' ecoli2114 4,1,3 Structural elements Cell envelop Outer membrane constituents 'cirA' 'outer membrane receptor for iron-regulated colicin I receptor; porin; requires tonB gene product(1st module)' Test Accuracy: 3/5 (60.00%) Test Frequency class 'Cell envelop': 40/712 (5.62%) Test Significance: dev(5.28) ; prob(1.627065E-03) Application to new data (2167 items): ecoli2593 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2647' 'orf' ecoli2191 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yfaL' 'orf' ecoli2148 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yejO' 'orf' ecoli2068 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehB' 'paral putative outer membrane protein (2nd module)' ecoli2296 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2338' 'paral putative outer membrane protein (2nd module)' ecoli1175 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1202' 'orf' ecoli3932 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjbI' 'orf' ecoli3077 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yraK' 'paral putative fimbrial-like protein' ecoli1343 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1372' 'putative membrane protein' ecoli915 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0940' 'paral putative outer membrane protein (2nd module)' ecoli1940 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1978' 'paral putative factor(2nd module)' ecoli916 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0941' 'paral putative fimbrial-like protein' ecoli1634 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1664' 'possible enzyme(2nd module)' ecoli3997 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b4103' 'orf' ecoli1142 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1169' 'orf' ecoli781 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0805' 'paral putative outer membrane receptor' ecoli2718 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2778' 'orf' ecoli523 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0532' 'paral putative outer membrane protein (2nd module)' ecoli524 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0533' 'involved in fimbrial asembly(2nd module)' ecoli701 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0718' 'paral putative outer membrane protein (2nd module)' ecoli3076 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yraJ' 'paral putative outer membrane protein (2nd module)' ecoli2134 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2175' 'suppresses thermosensitivity of prc mutants at low osmolality; in turn suppressed by multicopy expression of PBP 7' ecoli817 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0841' 'orf' ecoli366 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b0374' 'flagellar protein; similar to 3rd module of ATP-binding components of transporters (2nd module)' ecoli4110 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ytfM' 'orf' ecoli1476 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b1505' 'paral putative outer membrane protein (2nd module)' ecoli2070 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yehD' 'paral putative fimbrial-like protein' ecoli3960 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjcF' 'orf' ecoli1518 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1547' 'orf' ecoli1480 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1509' 'orf' ecoli283 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yagX' 'paral putative enzyme' ecoli4143 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yjgL' 'orf' ecoli2977 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'ygiL' 'paral putative fimbrial-like protein' ecoli363 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b0371' 'orf' ecoli2295 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b2337' 'paral putative outer membrane protein' ecoli3146 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yhcD' 'paral putative outer membrane protein (2nd module)' ecoli1145 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1172' 'orf' ecoli1143 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b1170' 'orf' ecoli3147 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'yhcE' 'orf' ecoli1376 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydbA_2' 'split orf fragment 2 (2nd module)' ecoli2980 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'b3046' 'paral putative outer membrane protein (2nd module)' ecoli2456 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2505' 'orf' ecoli1481 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydeK' 'orf' ecoli141 - 7,0,0 Miscellaneous Some information, but not classifiable Not classified (included putative assignments) 'yadN' 'paral putative fimbrial-like protein' ecoli1372 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ydbA_1' 'split orf fragment 1 (2nd module)' ecoli2307 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'b2350' 'orf' ecoli1076 - 0,0,0 Open reading frames Unknown proteins, no known homologs Unknown function 'ycfF' 'orf' Frequency rule on new data: 47/2167 (2.17%) Evaluation on training data (939 items): ecoli1888 4,1,5 Structural elements Cell envelop Surface structures 'fliC' 'flagellar biosynthesis; flagellin filament structural protein(2nd module)' ecoli4206 4,1,3 Structural elements Cell envelop Outer membrane constituents 'fimD' 'outer membrane protein; export and assembly of type 1 fimbriae split fragment 1 (2nd module)' ecoli1049 4,1,5 Structural elements Cell envelop Surface structures 'flgE' 'flagellar biosynthesis hook protein(1st module)' ecoli4208 4,1,5 Structural elements Cell envelop Surface structures 'fimG' 'fimbrial morphology' ecoli3405 - 5,1,4 Extrachromosomal Laterally acquirred elements Transposon-related functions 'rhsB' 'rhsB protein in rhs element' ecoli150 4,1,3 Structural elements Cell envelop Outer membrane constituents 'fhuA' 'outer membrane protein receptor for ferrichrome colicin M and phages T1 T5 and phi80(1st module)' ecoli4209 4,1,5 Structural elements Cell envelop Surface structures 'fimH' 'minor fimbrial subunit D-mannose specific adhesin N-terminal binds carbohydrate c-terminal binds periplasmic chaperone' ecoli904 4,1,3 Structural elements Cell envelop Outer membrane constituents 'ompF' 'outer membrane protein 1a (ia;b;f) (2nd module)' ecoli1959 4,1,3 Structural elements Cell envelop Outer membrane constituents 'b2000' 'phase-variable outer membrane associated fluffing protein(2nd module)' Training Accuracy: 8/9 (88.89%) Training Frequency class 'Cell envelop': 57/939 (6.07%) Training Significance: dev(10.41) ; prob(1.569737E-09) Evaluation on validation data (471 items): ecoli60 - 2,2,3 Macromolecule metabolism Macromolecule synthesis, modification DNA - replication, repair, restraction/modification 'polB' 'DNA polymerase II and and 3 --> 5 exonuclease cross-link DNA repair (new pwy)' ecoli3885 - 2,2,11 Macromolecule metabolism Macromolecule synthesis, modification RNA synthesis, modification, DNA transcription 'rpoB' 'RNA polymerase beta subunit' ecoli1889 4,1,5 Structural elements Cell envelop Surface structures 'fliD' 'flagellar biosynthesis; filament capping protein; enables filament assembly' ecoli3644 - 3,4,3 Metabolism of small molecules Degradation of small molecules Carbon compounds 'bglG' 'positive regulation (transcriptional antiterminator ) of bgl operon' ecoli1015 4,1,5 Structural elements Cell envelop Surface structures 'csgA' 'curlin major subunit coiled surface structures; cryptic' ecoli1055 4,1,5 Structural elements Cell envelop Surface structures 'flgK' 'flagellar biosynthesis hook-filament junction protein 1 C-terminal involved in chaperone (probably FlgN) binding(2nd module)' ecoli1056 4,1,5 Structural elements Cell envelop Surface structures 'flgL' 'flagellar biosynthesis; hook-filament junction protein C-terminal involved in chaperone (probably FlgN) binding' ecoli3097 - 4,2,2 Structural elements Ribosome constituents Ribosomal proteins - synthesis, modificationRiboso 'rpsO' '30S ribosomal subunit protein S15' Validation Accuracy: 4/8 (50.00%) Validation Frequency class 'Cell envelop': 30/471 (6.37%) Validation Significance: dev(5.05) ; prob(8.854625E-04) ------------------