paddlets.ensemble.stacking_ensemble

class StackingEnsembleBase(estimators: List[Tuple[object, dict]], final_learner: Optional[Callable] = None, verbose: bool = False)[source]

Bases: EnsembleBase

The StackingEnsembleBase Class.

Parameters

estimators (List[Tuple[object, dict]]) – A list of tuple (class,params) consisting of several paddlets models
final_learner (Callable) – The final learner on stack level 2, should be a sklearn-like regressor, set to GradientBoostingRegressor(max_depth=5) by default.
verbose (bool) – Turn on Verbose mode,set to true by default.

fit(train_tsdataset: TSDataset, valid_tsdataset: Optional[TSDataset] = None) → None[source]

Parameters

train_tsdataset (TSDataset) – Train dataset.
valid_tsdataset (TSDataset, optional) – Valid dataset.

predict(tsdataset: TSDataset) → TSDataset[source]

Parameters: tsdataset (TSDataset) – Dataset to predict.

class StackingEnsembleForecaster(in_chunk_len: int, out_chunk_len: int, skip_chunk_len: int, estimators: List[Tuple[object, dict]], final_learner: Optional[Callable] = None, use_backtest: bool = True, resampling_strategy: str = 'cv', split_ratio: Union[str, float] = 0.1, k_fold: Union[str, int] = 3, verbose: bool = False)[source]

Bases: StackingEnsembleBase, BaseModel

The StackingEnsembleForecaster Class.

Parameters

in_chunk_len (int) – The size of the loopback window, i.e., the number of time steps feed to the model.
out_chunk_len (int) – The size of the forecasting horizon, i.e., the number of time steps output by the model.
skip_chunk_len (int) – Optional, the number of time steps between in_chunk and out_chunk for a single sample. The skip chunk is neither used as a feature (i.e. X) nor a label (i.e. Y) for a single sample. By default, it will NOT skip any time steps.
estimators (List[Tuple[object, dict]]) – A list of tuple (class,params) consisting of several paddlets Forecasting models.
final_learner (Callable) – The final learner on stack level 2, should be a sklearn-like regressor, set to GradientBoostingRegressor(max_depth=5) by default.
resampling_strategy (str) – A string of resampling strategies.Supported resampling strategy are “cv”, “holdout”.
split_ratio (Union[str, float]) – The proportion of the dataset included in the validation split for holdout.The split_ratio should be in the range of (0, 1).
k_fold (Union[str, int]) – Number of folds for cv.The k_fold should be in the range of (0, 10], defaults to 3.
use_backtest (bool) – If use backtest on predictions.
verbose (bool) – Turn on Verbose mode,set to true by default.

fit(train_tsdataset: TSDataset, valid_tsdataset: Optional[TSDataset] = None) → None[source]

Parameters

train_tsdataset (TSDataset) – Train dataset.
valid_tsdataset (TSDataset, optional) – Valid dataset.

predict(tsdataset: TSDataset) → TSDataset[source]

Predict

Parameters: tsdataset (TSDataset) – Dataset to predict.

save(path: str, ensemble_file_name: str = 'paddlets-stacking-forecaster-partial.pkl') → None[source]

Save the ensemble model to a directory.

Parameters

path (str) – Output directory path.
ensemble_file_name (str) – Name of ensemble object. This file contains meta information of ensemble model.

static load(path: str, ensemble_file_name: str = 'paddlets-stacking-forecaster-partial.pkl') → StackingEnsembleForecaster[source]

Load the ensemble model from a directory.

Parameters

path (str) – Input directory path.
ensemble_file_name (str) – Name of ensemble object. This file contains meta information of ensemble.

Returns

The loaded ensemble model.