Table 2.
Argument | Description |
---|---|
A positive number specifying the time of prediction; is the observation period to consider when processing time-dependent data. The unit of could be minutes, hours, days, etc., and must be the same as the unit of . | |
A positive number specifying the temporal granularity (eg, hourly vs daily) at which to resample the time-dependent data. The unit of must be the same as the unit of . | |
A value between 0 and 1 specifying the threshold for the pre-filter step. | |
A value between 0 and 1 specifying the threshold for the post-filter step. | |
A positive number specifying the threshold, in terms of the average number of measurements per time window, at which we deem a variable “frequent” (for which summary statistics will be calculated). | |
A set of statistics functions (eg, min, max, mean). Each function takes as input 1 or more recordings within a time bin and outputs a single summary statistic. These functions are only applicable to “frequent” variables, as determined by . | |
discretize | A Boolean flag (default value: True) specifying whether features with numerical values are kept as raw values or discretized into binary features. |
discretization_encoding | A string specifying how numerical values are encoded into binary features after discretization. Possible values are: “one-hot” (default) and “ordinal.” This argument is ignored and should not be used when discretize=False. |
Note: FIDDLE: Flexible Data-Driven Pipeline.