Amazon web services 基于历史数据,CloudWatch警报返回正常状态

Amazon web services 基于历史数据,CloudWatch警报返回正常状态,amazon-web-services,amazon-cloudwatch,Amazon Web Services,Amazon Cloudwatch,我有一个CloudWatch警报,它在60分钟的时间内检查所有数据点,如果其中5个数据点出现问题,则CloudWatch警报将进入警报状态。 这可以正常工作,CloudWatch的输出显示哪些数据点出现了问题: Message = {"AlarmName":"Test Alarm","AlarmDescription":"Test Alarm","AWSAccountId":"xxxxx

我有一个CloudWatch警报,它在60分钟的时间内检查所有数据点,如果其中5个数据点出现问题,则CloudWatch警报将进入警报状态。 这可以正常工作,CloudWatch的输出显示哪些数据点出现了问题:

Message = {"AlarmName":"Test Alarm","AlarmDescription":"Test Alarm","AWSAccountId":"xxxxxxxxxx","NewStateValue":"ALARM","NewStateReason":"Threshold Crossed: 5 out of the last 5 datapoints were greater than or equal to the threshold (1.0). The most recent datapoints which crossed the threshold: [1.0 (02/08/20 10:11:00), 4.0 (02/08/20 09:56:00), 2.0 (02/08/20 09:41:00), 2.0 (02/08/20 09:26:00), 1.0 (02/08/20 09:11:00)] (minimum 5 datapoints for OK -> ALARM transition).","StateChangeTime":"2020-08-02T10:26:48.505+0000","Region":"Asia Pacific (Sydney)","AlarmArn":"xxxxxxx","OldStateValue":"OK","Trigger":{"MetricName":"Problem","Namespace":"xxxxx","StatisticType":"Statistic","Statistic":"MAXIMUM","Unit":null,"Dimensions":[{"value":"default","name":"environment"}],"Period":900,"EvaluationPeriods":5,"ComparisonOperator":"GreaterThanOrEqualToThreshold","Threshold":1.0,"TreatMissingData":"- TreatMissingData: missing","EvaluateLowSampleCountPercentile":""}}
我所面临的问题是何时转换回OK状态。从转换回OK时的CloudWatch输出来看,它显示了以下内容:

{"AlarmName":"Test Alarm","AlarmDescription":"Test Alarm","AWSAccountId":"xxxxxxxxx","NewStateValue":"OK","NewStateReason":"Threshold Crossed: 1 out of the last 5 datapoints [0.0 (02/08/20 09:27:00)] was not greater than or equal to the threshold (1.0) (minimum 1 datapoint for ALARM -> OK transition).","StateChangeTime":"2020-08-02T10:27:48.489+0000","Region":"Asia Pacific (Sydney)","AlarmArn":"xxxxxxxx","OldStateValue":"ALARM","Trigger":{"MetricName":"Problem","Namespace":"xxxxxxx","StatisticType":"Statistic","Statistic":"MAXIMUM","Unit":null,"Dimensions":[{"value":"default","name":"environment"}],"Period":900,"EvaluationPeriods":5,"ComparisonOperator":"GreaterThanOrEqualToThreshold","Threshold":1.0,"TreatMissingData":"- TreatMissingData: missing","EvaluateLowSampleCountPercentile":""}}
我在这里最突出的一点是:

Threshold Crossed: 1 out of the last 5 datapoints [0.0 (02/08/20 09:27:00)] was not greater than or equal to the threshold (1.0)
我理解这意味着,如果最后5个数据点中的任何一个未超过阈值,则将报警移回正常状态。这不是我想要的闹钟的表现


我想做的是,仅当新数据点未超过问题阈值时,才将其移回正常状态。

如果我没有弄错,您的警报将仅在数据点上查看60分钟。因此,一旦此窗口完成,新窗口将开始,如果您不将旧数据点带入新窗口,警报将切换到OK,因为新时间窗口的所有新数据点都不会超过阈值

我假设CloudWatch使用滚动计时器,并检查该滚动小时内的标准是否匹配,这很好。根据CloudWatch如何表达其消息:在最后5个数据点中有1个不大于或等于阈值,我认为您是错误的。