simpleml.datasets.pandas.pipeline

Pipeline derived datasets

Module Contents

Classes

PandasPipelineDataset

Pandas dataset class that generates the dataframe as the output of the

Attributes

__author__

simpleml.datasets.pandas.pipeline.__author__ = Elisha Yadgaran[source]
class simpleml.datasets.pandas.pipeline.PandasPipelineDataset(squeeze_return=False, **kwargs)[source]

Bases: simpleml.datasets.pandas.base.BasePandasDataset

Pandas dataset class that generates the dataframe as the output of the linked pipeline

Parameters

squeeze_return (bool) – boolean flag whether to run dataframe.squeeze() on return from self.get() calls. Particularly necessary to align input types with different libraries (e.g. sklearn y with single label)

build_dataframe(self)[source]

Transform raw dataset via dataset pipeline for production ready dataset

Return type

None