piper.factory.bad_quality_orders

piper.factory.bad_quality_orders()[source]

Generate sample data cleaning orders dataset

Contains the following problems:
  • column names too long

  • invalid numeric data

  • invalid/uneven character data

  • invalid date data

   Gropuing cde_      Order_NBR  This column name is too long      Second column          Quantity  Price
0  A100                23899001  First                        row  Scally, Aidan                14  1,23
1  A101                23899002  SECOnd   Row                      McAllister, Eoin            103  4,32
2  A101                23899003  Thrid        Row                  Tarpey, Mike                  1  3  4,32
3  A102                23899004  fOuRth        ROW                 Denton,        Alan          13  49
Returns

Return type

pandas DataFrame