The BERBL library

This is an implementation of Drugowitsch's Bayesian Learning Classifier System¹². BERBL stands for Bayesian Evolutionary Rule-based Learner.

Note that documentation for this library is still underway. If you have any questions, feel free to open an issue.

Usage examples

You could use defaults everywhere but in the population size and the number of iterations to run:

import numpy as np  # type: ignore
from berbl import BERBL
from berbl.search.operators.drugowitsch import DefaultToolbox
from sklearn.compose import TransformedTargetRegressor  # type: ignore
from sklearn.metrics import mean_absolute_error  # type: ignore
from sklearn.pipeline import make_pipeline  # type: ignore
from sklearn.preprocessing import StandardScaler  # type: ignore
from sklearn.utils import check_random_state  # type: ignore

# Generate some data for the identity function.
X = np.arange(300).reshape((-1, 1))
y = np.arange(300).reshape((-1, 1))

random_state = check_random_state(2)

# Initialize toolbox, let's assume that a population size of 10 suffices for
# this task.
toolbox = DefaultToolbox(n=10, random_state=random_state)

# Instantiate BERBL.
regressor = BERBL(toolbox=toolbox, n_iter=20)

# Next, just some standard scikit-learn stuff: Create a pipeline that
# standardizes inputs.
pipe = make_pipeline(
    StandardScaler(),
    TransformedTargetRegressor(regressor=regressor,
                               transformer=StandardScaler()))

# Fit the pipeline.
estimator = pipe.fit(X, y)

# Make predictions (here, on the training data for simplicities sake).
y_pred = estimator.predict(X)

# Get some metrics.
print("MAE on training data: ", mean_absolute_error(y, y_pred))

You can also override certain operators of the evolutionary algorithm similarly to the DEAP API:

import numpy as np  # type: ignore
from berbl import BERBL
from berbl.search.operators.drugowitsch import DefaultToolbox
from sklearn.compose import TransformedTargetRegressor  # type: ignore
from sklearn.metrics import mean_absolute_error  # type: ignore
from sklearn.pipeline import make_pipeline  # type: ignore
from sklearn.preprocessing import StandardScaler  # type: ignore
from sklearn.utils import check_random_state  # type: ignore

# Generate some data for the identity function.
X = np.arange(300).reshape((-1, 1))
y = np.arange(300).reshape((-1, 1))

random_state = check_random_state(2)

# Initialize toolbox, let's assume that a population size of 10 suffices for
# this task.
toolbox = DefaultToolbox(n=10, random_state=random_state)


# Let's assume we want to use a custom mutate that does nothing (i.e. disable
# mutation in a slightly awkward fashion).
def custom_mutate(genotype, random_state):
    return genotype


# Override mutation operator in toolbox with our custom one.
toolbox.register("mutate", custom_mutate)

# Instantiate BERBL.
regressor = BERBL(toolbox=toolbox, n_iter=20)

# Next, just some standard scikit-learn stuff: Create a pipeline that
# standardizes inputs.
pipe = make_pipeline(
    StandardScaler(),
    TransformedTargetRegressor(regressor=regressor,
                               transformer=StandardScaler()))

# Fit the pipeline.
estimator = pipe.fit(X, y)

# Make predictions (here, on the training data for simplicities sake).
y_pred = estimator.predict(X)

# Get some metrics.
print("MAE on training data: ", mean_absolute_error(y, y_pred))

Note on nomenclature

In the implementation we try to avoid the overloaded term classifier and instead use rule. A rule consists of a matching function and a (local) submodel. In addition, there is a mixing weight associated with each rule that comes in to play when rules overlap (i.e. when an input is matched by the matching functions of more than one rule).

Jan Drugowitsch. 2007. Learning Classifier Systems from first principles. PDF. ↩
Jan Drugowitsch. 2008. Design and Analysis of Learning Classifier Systems - A Probabilistic Approach. PDF. ↩