simpleml.pipelines.projected_splits
Module for dataset projection into pipelines. Defines transfer objects returned from pipelines
Module Contents
Classes
Straight passthrough variety of projection (ie projected split == dataset split) |
|
Index based subset. Compatible with dataset splits that support indexing |
|
Transfer object to pass dataset splits through pipelines |
Attributes
- class simpleml.pipelines.projected_splits.IdentityProjectedDatasetSplit(dataset, split)[source]
Bases:
ProjectedDatasetSplit
Straight passthrough variety of projection (ie projected split == dataset split)
- Parameters
dataset (simpleml.datasets.base_dataset.Dataset) –
split (Optional[str]) –
- apply_projection(self, dataset_split)[source]
Identity return
- Parameters
dataset_split (simpleml.datasets.dataset_splits.Split) –
- Return type
- class simpleml.pipelines.projected_splits.IndexBasedProjectedDatasetSplit(indices, **kwargs)[source]
Bases:
ProjectedDatasetSplit
Index based subset. Compatible with dataset splits that support indexing
- apply_projection(self, dataset_split)[source]
Index subset return
- Parameters
dataset_split (simpleml.datasets.dataset_splits.Split) –
- Return type
- class simpleml.pipelines.projected_splits.ProjectedDatasetSplit(dataset, split)[source]
Transfer object to pass dataset splits through pipelines
Contains a reference to the dataset and internal logic to project the split (references the dataset on each call to avoid mutability issues)
Wraps the normal Split object but delegates behavior so can be used interchangeably
- Parameters
dataset (simpleml.datasets.base_dataset.Dataset) –
split (Optional[str]) –
- abstract apply_projection(self, dataset_split)[source]
Main method to apply projection logic on the dataset split Returns a new Split with the data subset
- Parameters
dataset_split (simpleml.datasets.dataset_splits.Split) –
- Return type