Note

Go to the end to download the full example code or to run this example in your browser via JupyterLite or Binder.

Visualizing the probabilistic predictions of a VotingClassifier#

Plot the predicted class probabilities in a toy dataset predicted by three different classifiers and averaged by the VotingClassifier.

First, three linear classifiers are initialized. Two are spline models with interaction terms, one using constant extrapolation and the other using periodic extrapolation. The third classifier is a Nystroem with the default "rbf" kernel.

In the first part of this example, these three classifiers are used to demonstrate soft-voting using VotingClassifier with weighted average. We set weights=[2, 1, 3], meaning the constant extrapolation spline model’s predictions are weighted twice as much as the periodic spline model’s, and the Nystroem model’s predictions are weighted three times as much as the periodic spline.

The second part demonstrates how soft predictions can be converted into hard predictions.

# Authors: The scikit-learn developers
# SPDX-License-Identifier: BSD-3-Clause

We first generate a noisy XOR dataset, which is a binary classification task.

importmatplotlib.pyplotasplt
importnumpyasnp
importpandasaspd
frommatplotlib.colorsimport ListedColormap
n_samples = 500
rng = np.random.default_rng (0)
feature_names = ["Feature #0", "Feature #1"]
common_scatter_plot_params = dict(
 cmap=ListedColormap (["tab:red", "tab:blue"]),
 edgecolor="white",
 linewidth=1,
)
xor = pd.DataFrame (
 np.random.RandomState (0).uniform(low=-1, high=1, size=(n_samples, 2)),
 columns=feature_names,
)
noise = rng.normal(loc=0, scale=0.1, size=(n_samples, 2))
target_xor = np.logical_xor (
 xor["Feature #0"] + noise[:, 0] > 0, xor["Feature #1"] + noise[:, 1] > 0
)
X = xor[feature_names]
y = target_xor.astype(np.int32 )
fig, ax = plt.subplots ()
ax.scatter(X["Feature #0"], X["Feature #1"], c=y, **common_scatter_plot_params)
ax.set_title("The XOR dataset")
plt.show ()

The XOR dataset

Due to the inherent non-linear separability of the XOR dataset, tree-based models would often be preferred. However, appropriate feature engineering combined with a linear model can yield effective results, with the added benefit of producing better-calibrated probabilities for samples located in the transition regions affected by noise.

We define and fit the models on the whole dataset.

fromsklearn.ensembleimport VotingClassifier
fromsklearn.kernel_approximationimport Nystroem
fromsklearn.linear_modelimport LogisticRegression
fromsklearn.pipelineimport make_pipeline
fromsklearn.preprocessingimport PolynomialFeatures , SplineTransformer , StandardScaler
clf1 = make_pipeline (
 SplineTransformer (degree=2, n_knots=2),
 PolynomialFeatures (interaction_only=True),
 LogisticRegression (C=10),
)
clf2 = make_pipeline (
 SplineTransformer (
 degree=2,
 n_knots=4,
 extrapolation="periodic",
 include_bias=True,
 ),
 PolynomialFeatures (interaction_only=True),
 LogisticRegression (C=10),
)
clf3 = make_pipeline (
 StandardScaler (),
 Nystroem (gamma=2, random_state=0),
 LogisticRegression (C=10),
)
weights = [2, 1, 3]
eclf = VotingClassifier (
 estimators=[
 ("constant splines model", clf1),
 ("periodic splines model", clf2),
 ("nystroem model", clf3),
 ],
 voting="soft",
 weights=weights,
)
clf1.fit(X, y)
clf2.fit(X, y)
clf3.fit(X, y)
eclf.fit(X, y)

VotingClassifier(estimators=[('constant splines model',
 Pipeline(steps=[('splinetransformer',
 SplineTransformer(degree=2,
 n_knots=2)),
 ('polynomialfeatures',
 PolynomialFeatures(interaction_only=True)),
 ('logisticregression',
 LogisticRegression(C=10))])),
 ('periodic splines model',
 Pipeline(steps=[('splinetransformer',
 SplineTransformer(degree=2,
 extrapolation='periodic',
 n_knots=4)),
 ('polynomialfeatures',
 PolynomialFeatures(interaction_only=True)),
 ('logisticregression',
 LogisticRegression(C=10))])),
 ('nystroem model',
 Pipeline(steps=[('standardscaler',
 StandardScaler()),
 ('nystroem',
 Nystroem(gamma=2,
 random_state=0)),
 ('logisticregression',
 LogisticRegression(C=10))]))],
 voting='soft', weights=[2, 1, 3])

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

fromitertoolsimport product
fromsklearn.inspectionimport DecisionBoundaryDisplay
fig, axarr = plt.subplots (2, 2, sharex="col", sharey="row", figsize=(10, 8))
for idx, clf, title in zip(
 product ([0, 1], [0, 1]),
 [clf1, clf2, clf3, eclf],
 [
 "Splines with\nconstant extrapolation",
 "Splines with\nperiodic extrapolation",
 "RBF Nystroem",
 "Soft Voting",
 ],
):
 disp = DecisionBoundaryDisplay.from_estimator (
 clf,
 X,
 response_method="predict_proba",
 plot_method="pcolormesh",
 cmap="RdBu",
 alpha=0.8,
 ax=axarr[idx[0], idx[1]],
 )
 axarr[idx[0], idx[1]].scatter(
 X["Feature #0"],
 X["Feature #1"],
 c=y,
 **common_scatter_plot_params,
 )
 axarr[idx[0], idx[1]].set_title(title)
 fig.colorbar(disp.surface_, ax=axarr[idx[0], idx[1]], label="Probability estimate")
plt.show ()

test_sample = pd.DataFrame ({"Feature #0": [-0.5], "Feature #1": [1.5]})
predict_probas = [est.predict_proba(test_sample).ravel() for est in eclf.estimators_]
for (est_name, _), est_probas in zip(eclf.estimators, predict_probas):
 print(f"{est_name}'s predicted probabilities: {est_probas}")

constant splines model's predicted probabilities: [0.11272662 0.88727338]
periodic splines model's predicted probabilities: [0.99726573 0.00273427]
nystroem model's predicted probabilities: [0.3185838 0.6814162]

print(
 "Weighted average of soft-predictions: "
 f"{np.dot (weights,predict_probas)/np.sum (weights)}"
)

Weighted average of soft-predictions: [0.3630784 0.6369216]

print(
 "Predicted probability of VotingClassifier: "
 f"{eclf.predict_proba(test_sample).ravel()}"
)

Predicted probability of VotingClassifier: [0.3630784 0.6369216]

print(
 "Class with the highest weighted average of soft-predictions: "
 f"{np.argmax (np.dot (weights,predict_probas)/np.sum (weights))}"
)

Class with the highest weighted average of soft-predictions: 1

print(f"Predicted class of VotingClassifier: {eclf.predict(test_sample).ravel()}")

Predicted class of VotingClassifier: [1]

fromsklearn.model_selectionimport FixedThresholdClassifier
eclf_other_threshold = FixedThresholdClassifier (
 eclf, threshold=0.7, response_method="predict_proba"
).fit(X, y)
print(
 "Predicted class of thresholded VotingClassifier: "
 f"{eclf_other_threshold.predict(test_sample)}"
)

Predicted class of thresholded VotingClassifier: [0]

Launch binder

Launch JupyterLite

Download Jupyter notebook: plot_voting_decision_regions.ipynb

Download Python source code: plot_voting_decision_regions.py

Download zipped: plot_voting_decision_regions.zip

Plot classification probability

Polynomial and Spline interpolation

Comparison of Calibration of Classifiers

Decision boundary of semi-supervised classifiers versus SVM on the Iris dataset

Visualizing the probabilistic predictions of a VotingClassifier#

This Page