Module ensemble (2.25.0)

Ensemble models. This module is styled after scikit-learn's ensemble module: https://scikit-learn.org/stable/modules/ensemble.html

Classes

RandomForestClassifier

RandomForestClassifier(
 n_estimators: int = 100,
 *,
 tree_method: typing.Literal["auto", "exact", "approx", "hist"] = "auto",
 min_tree_child_weight: int = 1,
 colsample_bytree: float = 1.0,
 colsample_bylevel: float = 1.0,
 colsample_bynode: float = 0.8,
 gamma: float = 0.0,
 max_depth: int = 15,
 subsample: float = 0.8,
 reg_alpha: float = 0.0,
 reg_lambda: float = 1.0,
 tol: float = 0.01,
 enable_global_explain: bool = False,
 xgboost_version: typing.Literal["0.9", "1.1"] = "0.9"
)

A random forest classifier.

A random forest is a meta estimator that fits a number of decision tree classifiers on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting.

Parameters
Name Description
n_estimators Optional[int]

Number of parallel trees constructed during each iteration. Default to 100. Minimum value is 2.

tree_method Optional[str]

Specify which tree method to use. Default to "auto". If this parameter is set to default, XGBoost will choose the most conservative option available. Possible values: "exact", "approx", "hist".

min_child_weight Optional[float]

Minimum sum of instance weight(hessian) needed in a child. Default to 1.

colsample_bytree Optional[float]

Subsample ratio of columns when constructing each tree. Default to 1.0. The value should be between 0 and 1.

colsample_bylevel Optional[float]

Subsample ratio of columns for each level. Default to 1.0. The value should be between 0 and 1.

colsample_bynode Optional[float]

Subsample ratio of columns for each split. Default to 0.8. The value should be between 0 and 1.

gamma Optional[float]

(min_split_loss) Minimum loss reduction required to make a further partition on a leaf node of the tree. Default to 0.0.

max_depth Optional[int]

Maximum tree depth for base learners. Default to 15. The value should be greater than 0 and less than 1.

subsample Optional[float]

Subsample ratio of the training instance. Default to 0.8. The value should be greater than 0 and less than 1.

reg_alpha Optional[float]

L1 regularization term on weights (xgb's alpha). Default to 0.0.

reg_lambda Optional[float]

L2 regularization term on weights (xgb's lambda). Default to 1.0.

tol Optional[float]

Minimum relative loss improvement necessary to continue training. Default to 0.01.

enable_global_explain Optional[bool]

Whether to compute global explanations using explainable AI to evaluate global feature importance to the model. Default to False.

xgboost_version Optional[str]

Specifies the Xgboost version for model training. Default to "0.9". Possible values: "0.9", "1.1".ß

RandomForestRegressor

RandomForestRegressor(
 n_estimators: int = 100,
 *,
 tree_method: typing.Literal["auto", "exact", "approx", "hist"] = "auto",
 min_tree_child_weight: int = 1,
 colsample_bytree: float = 1.0,
 colsample_bylevel: float = 1.0,
 colsample_bynode: float = 0.8,
 gamma: float = 0.0,
 max_depth: int = 15,
 subsample: float = 0.8,
 reg_alpha: float = 0.0,
 reg_lambda: float = 1.0,
 tol: float = 0.01,
 enable_global_explain: bool = False,
 xgboost_version: typing.Literal["0.9", "1.1"] = "0.9"
)

A random forest regressor.

A random forest is a meta estimator that fits a number of classifying decision trees on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting.

Parameters
Name Description
n_estimators Optional[int]

Number of parallel trees constructed during each iteration. Default to 100. Minimum value is 2.

tree_method Optional[str]

Specify which tree method to use. Default to "auto". If this parameter is set to default, XGBoost will choose the most conservative option available. Possible values: "exact", "approx", "hist".

min_child_weight Optional[float]

Minimum sum of instance weight(hessian) needed in a child. Default to 1.

colsample_bytree Optional[float]

Subsample ratio of columns when constructing each tree. Default to 1.0. The value should be between 0 and 1.

colsample_bylevel Optional[float]

Subsample ratio of columns for each level. Default to 1.0. The value should be between 0 and 1.

colsample_bynode Optional[float]

Subsample ratio of columns for each split. Default to 0.8. The value should be between 0 and 1.

gamma Optional[float]

(min_split_loss) Minimum loss reduction required to make a further partition on a leaf node of the tree. Default to 0.0.

max_depth Optional[int]

Maximum tree depth for base learners. Default to 15. The value should be greater than 0 and less than 1.

reg_alpha Optional[float]

L1 regularization term on weights (xgb's alpha). Default to 0.0.

reg_lambda Optional[float]

L2 regularization term on weights (xgb's lambda). Default to 1.0.

tol Optional[float]

Minimum relative loss improvement necessary to continue training. Default to 0.01.

enable_global_explain Optional[bool]

Whether to compute global explanations using explainable AI to evaluate global feature importance to the model. Default to False.

xgboost_version Optional[str]

Specifies the Xgboost version for model training. Default to "0.9". Possible values: "0.9", "1.1".

XGBClassifier

XGBClassifier(
 n_estimators: int = 1,
 *,
 booster: typing.Literal["gbtree", "dart"] = "gbtree",
 dart_normalized_type: typing.Literal["tree", "forest"] = "tree",
 tree_method: typing.Literal["auto", "exact", "approx", "hist"] = "auto",
 min_tree_child_weight: int = 1,
 colsample_bytree: float = 1.0,
 colsample_bylevel: float = 1.0,
 colsample_bynode: float = 1.0,
 gamma: float = 0.0,
 max_depth: int = 6,
 subsample: float = 1.0,
 reg_alpha: float = 0.0,
 reg_lambda: float = 1.0,
 learning_rate: float = 0.3,
 max_iterations: int = 20,
 tol: float = 0.01,
 enable_global_explain: bool = False,
 xgboost_version: typing.Literal["0.9", "1.1"] = "0.9"
)

XGBoost classifier model.

Parameters
Name Description
n_estimators Optional[int]

Number of parallel trees constructed during each iteration. Default to 1.

booster Optional[str]

Specify which booster to use: gbtree or dart. Default to "gbtree".

dart_normalized_type Optional[str]

Type of normalization algorithm for DART booster. Possible values: "TREE", "FOREST". Default to "TREE".

tree_method Optional[str]

Specify which tree method to use. Default to "auto". If this parameter is set to default, XGBoost will choose the most conservative option available. Possible values: "exact", "approx", "hist".

min_child_weight Optional[float]

Minimum sum of instance weight(hessian) needed in a child. Default to 1.

colsample_bytree Optional[float]

Subsample ratio of columns when constructing each tree. Default to 1.0.

colsample_bylevel Optional[float]

Subsample ratio of columns for each level. Default to 1.0.

colsample_bynode Optional[float]

Subsample ratio of columns for each split. Default to 1.0.

gamma Optional[float]

(min_split_loss) Minimum loss reduction required to make a further partition on a leaf node of the tree. Default to 0.0.

max_depth Optional[int]

Maximum tree depth for base learners. Default to 6.

subsample Optional[float]

Subsample ratio of the training instance. Default to 1.0.

reg_alpha Optional[float]

L1 regularization term on weights (xgb's alpha). Default to 0.0.

reg_lambda Optional[float]

L2 regularization term on weights (xgb's lambda). Default to 1.0.

learning_rate Optional[float]

Boosting learning rate (xgb's "eta"). Default to 0.3.

max_iterations Optional[int]

Maximum number of rounds for boosting. Default to 20.

tol Optional[float]

Minimum relative loss improvement necessary to continue training. Default to 0.01.

enable_global_explain Optional[bool]

Whether to compute global explanations using explainable AI to evaluate global feature importance to the model. Default to False.

xgboost_version Optional[str]

Specifies the Xgboost version for model training. Default to "0.9". Possible values: "0.9", "1.1".

XGBRegressor

XGBRegressor(
 n_estimators: int = 1,
 *,
 booster: typing.Literal["gbtree", "dart"] = "gbtree",
 dart_normalized_type: typing.Literal["tree", "forest"] = "tree",
 tree_method: typing.Literal["auto", "exact", "approx", "hist"] = "auto",
 min_tree_child_weight: int = 1,
 colsample_bytree: float = 1.0,
 colsample_bylevel: float = 1.0,
 colsample_bynode: float = 1.0,
 gamma: float = 0.0,
 max_depth: int = 6,
 subsample: float = 1.0,
 reg_alpha: float = 0.0,
 reg_lambda: float = 1.0,
 learning_rate: float = 0.3,
 max_iterations: int = 20,
 tol: float = 0.01,
 enable_global_explain: bool = False,
 xgboost_version: typing.Literal["0.9", "1.1"] = "0.9"
)

XGBoost regression model.

Parameters
Name Description
n_estimators Optional[int]

Number of parallel trees constructed during each iteration. Default to 1.

booster Optional[str]

Specify which booster to use: gbtree or dart. Default to "gbtree".

dart_normalized_type Optional[str]

Type of normalization algorithm for DART booster. Possible values: "TREE", "FOREST". Default to "TREE".

tree_method Optional[str]

Specify which tree method to use. Default to "auto". If this parameter is set to default, XGBoost will choose the most conservative option available. Possible values: "exact", "approx", "hist".

min_child_weight Optional[float]

Minimum sum of instance weight(hessian) needed in a child. Default to 1.

colsample_bytree Optional[float]

Subsample ratio of columns when constructing each tree. Default to 1.0.

colsample_bylevel Optional[float]

Subsample ratio of columns for each level. Default to 1.0.

colsample_bynode Optional[float]

Subsample ratio of columns for each split. Default to 1.0.

gamma Optional[float]

(min_split_loss) Minimum loss reduction required to make a further partition on a leaf node of the tree. Default to 0.0.

max_depth Optional[int]

Maximum tree depth for base learners. Default to 6.

subsample Optional[float]

Subsample ratio of the training instance. Default to 1.0.

reg_alpha Optional[float]

L1 regularization term on weights (xgb's alpha). Default to 0.0.

reg_lambda Optional[float]

L2 regularization term on weights (xgb's lambda). Default to 1.0.

learning_rate Optional[float]

Boosting learning rate (xgb's "eta"). Default to 0.3.

max_iterations Optional[int]

Maximum number of rounds for boosting. Default to 20.

tol Optional[float]

Minimum relative loss improvement necessary to continue training. Default to 0.01.

enable_global_explain Optional[bool]

Whether to compute global explanations using explainable AI to evaluate global feature importance to the model. Default to False.

xgboost_version Optional[str]

Specifies the Xgboost version for model training. Default to "0.9". Possible values: "0.9", "1.1".

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025年10月14日 UTC.