simpleml.pipelines.validation_split_mixins module¶
Module for different split methods for cross validation
- No Split – Just use all the data
- Explicit Split – dataset class defines the split
- Percentage – random split support for train, validation, test
- Chronological – time based split support for train, validation, test
- KFold
-
class
simpleml.pipelines.validation_split_mixins.
ChronologicalSplitMixin
(**kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
ExplicitSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
KFoldSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
TBD on how to implement this. KFold requires K models and unique datasets so may be easier to wrap a parallelized implementation that internally creates K new Pipeline and Model objects
-
class
simpleml.pipelines.validation_split_mixins.
NoSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
RandomSplitMixin
(train_size, test_size=None, validation_size=0.0, random_state=123, shuffle=True, **kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
Class to randomly split dataset into different sets