simpleml.pipelines.validation_split_mixins¶
Module for different split methods for cross validation
No Split – Just use all the data
Explicit Split – dataset class defines the split
Percentage – random split support for train, validation, test
Chronological – time based split support for train, validation, test
KFold
Module Contents¶
Classes¶
TBD on how to implement this. KFold requires K models and unique datasets |
|
Class to randomly split dataset into different sets |
|
Container class for splits |
|
Explicit instantiation of a defaultdict returning split objects |
|
-
class
simpleml.pipelines.validation_split_mixins.ChronologicalSplitMixin(**kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.ExplicitSplitMixin[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.KFoldSplitMixin[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixinTBD on how to implement this. KFold requires K models and unique datasets so may be easier to wrap a parallelized implementation that internally creates K new Pipeline and Model objects
-
class
simpleml.pipelines.validation_split_mixins.NoSplitMixin[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.RandomSplitMixin(train_size, test_size=None, validation_size=0.0, random_state=123, shuffle=True, **kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixinClass to randomly split dataset into different sets
Set splitting params: By default validation is 0.0 because it is only used for hyperparameter tuning
-
class
simpleml.pipelines.validation_split_mixins.Split[source]¶ Bases:
dictContainer class for splits
Initialize self. See help(type(self)) for accurate signature.
-
__getattr__(self, attr)[source]¶ Default attribute processor (Used in combination with __getitem__ to enable ** syntax)
-
-
class
simpleml.pipelines.validation_split_mixins.SplitContainer(default_factory=Split, **kwargs)[source]¶ Bases:
collections.defaultdictExplicit instantiation of a defaultdict returning split objects
Initialize self. See help(type(self)) for accurate signature.