The anomaly detection pipeline via ADBox can be easily customize by creating a use case. In this context, a use case is a sequence of actions to be performed and the characteristics of the desired outcome. Examples of “informal” use cases are:
These informal use cases can be translated into real action by using a provided YAML template, as explained in the following section.
The YAML file for detector training and prediction includes parameters to configure the training and prediction processes. Below is a guide explaining the purpose of each parameter, its default value, and format.
Represents the data source index where the training data should be fetched from.
default
which will be processed as {current_year}-*-*
(fetches all data for the current year).YYYY-MM-DD
or with wildcards (*
).Specifies if the given input features include categorical features.
False
(default as features are numerical).True
or False
).List of columns used as features to train the detector.
data.cpu_usage_%
, data.memory_usage_%
.Specifies if the column values should be aggregated.
True
.True
or False
).aggregation
is True
.
Zero
."Linear"
, "Previous"
, "Subsequent"
, "Zero"
or "Fixed"
.fill_na_method
is Fixed
.
0
.fill_na_method
is Fixed
).1min
."1min"
, "5min"
, "1s"
, "1hour"
, etc.).data.cpu_usage_%
: ["average", "max"]
.data.memory_usage_%
: ["average", "max"]
."average"
, "max"
, "min"
, "count"
, or "sum"
.10
.30
.default
which is converted asdetector_<current timestamp>
.default
. ("historical"
run mode)."historical"
, "batch"
, "realtime"
).
default
(current day).YYYY-MM-DD
or with wildcards (*
).default
(most recently trained detector).default
(start of the current date).YYYY-MM-DDTHH:MM:SSZ
.default
(current timestamp of the current date).YYYY-MM-DDTHH:MM:SSZ
.batch
run mode.10
.training:
index_date: "default"
categorical_features: false
columns:
- "data.cpu_usage_%"
- "data.memory_usage_%"
aggregation: true
aggregation_config:
fill_na_method: "Zero"
granularity: "1min"
features:
data.cpu_usage_%:
- "average"
- "max"
data.memory_usage_%:
- "average"
- "max"
train_config:
window_size: 10
epochs: 30
display_name: "default"
prediction:
run_mode: "default"
index_date: "default"
detector_id: "default"
start_time: "default"
end_time: "default"
batch_size: 10