simpleml.pipelines.validation_split_mixins
¶
Module for different split methods for cross validation
No Split – Just use all the data
Explicit Split – dataset class defines the split
Percentage – random split support for train, validation, test
Chronological – time based split support for train, validation, test
KFold
Module Contents¶
Classes¶
TBD on how to implement this. KFold requires K models and unique datasets |
|
Class to randomly split dataset into different sets |
|
Container class for splits |
|
Explicit instantiation of a defaultdict returning split objects |
|
-
class
simpleml.pipelines.validation_split_mixins.
ChronologicalSplitMixin
(**kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
ExplicitSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
KFoldSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
TBD on how to implement this. KFold requires K models and unique datasets so may be easier to wrap a parallelized implementation that internally creates K new Pipeline and Model objects
-
class
simpleml.pipelines.validation_split_mixins.
NoSplitMixin
[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
-
class
simpleml.pipelines.validation_split_mixins.
RandomSplitMixin
(train_size, test_size=None, validation_size=0.0, random_state=123, shuffle=True, **kwargs)[source]¶ Bases:
simpleml.pipelines.validation_split_mixins.SplitMixin
Class to randomly split dataset into different sets
Set splitting params: By default validation is 0.0 because it is only used for hyperparameter tuning
-
class
simpleml.pipelines.validation_split_mixins.
Split
[source]¶ Bases:
dict
Container class for splits
Initialize self. See help(type(self)) for accurate signature.
-
__getattr__
(self, attr)[source]¶ Default attribute processor (Used in combination with __getitem__ to enable ** syntax)
-
-
class
simpleml.pipelines.validation_split_mixins.
SplitContainer
(default_factory=Split, **kwargs)[source]¶ Bases:
collections.defaultdict
Explicit instantiation of a defaultdict returning split objects
Initialize self. See help(type(self)) for accurate signature.