piper.factory.make_null_dates¶
-
piper.factory.make_null_dates(df: pandas.core.frame.DataFrame, cols: Union[str, List[str]] = ['effective', 'expired'], null_values_percent: float = 0.2, seed: int = 42) → pandas.core.frame.DataFrame[source]¶ Generate ‘random’ null, pd.NaT values from supplied dataframe
%%piper sample_data() >> make_null_dates(cols=['dates', 'order_dates']) >> head(6) dates order_dates countries regions ids values_1 values_2 0 2020-01-01 00:00:00 2020-01-07 00:00:00 Italy East A 311 26 1 2020-01-02 00:00:00 2020-01-08 00:00:00 Portugal South D 150 375 2 NaT 2020-01-09 00:00:00 Spain East A 396 88 3 2020-01-04 00:00:00 2020-01-10 00:00:00 Italy East B 319 233 4 2020-01-05 00:00:00 2020-01-11 00:00:00 Italy East D 261 187 5 2020-01-06 00:00:00 NaT Switzerland North D 155 253
- Parameters
df – pandas DataFrame
cols – column(s) within dataframe to generate random null values for
null_value_percent – % number of rows to generate null values
- Returns
- Return type
A pandas DataFrame