Event
"Predictive Analytics for Data Center Thermal Risk and Reliability Management" 11AM-12PM, October 20th, 2015 - Single Webinar on 2015 October 20
Location: Live Webinar
Date: 20, October 2015 To 20, October 2015
choose this option to view just a single speaker at coolingzone 2015

"predictive analytics for data center thermal risk and reliability management"
dr. cliff federspiel, ph.d., ceo | vigilent
technical presentation
real-time operational data in combination with analytics has the power to reveal previously unseen and unknown weaknesses in a data center. specifically, high temperature events can trigger expensive, emergency truck rolls or sla violations.
this presentation will describe a means of precisely determining the functional cooling redundancy of a facility, show statistical analysis that correlates poor redundancy with high temperature events, and share tools that identify and suggest remediation for rooms – and facilities – at risk. use of cooling analytics is an important aspect of operational excellence, and is a competitive differentiator for datacenter operators.
by combining industry standards for design redundancy and equipment performance with data from a properly monitored data center, it is possible to determine a “score” or metric that identifies low/poor redundancy. a low score can be caused by three things: underperforming or non-performing equipment, operator issues, and it problems. equipment problems are flagged when cooling units aren’t responding to commands or are underperforming according to ashrae 90.1 or corporate standards. operator issues usually involve manual overrides – both known and unknown - that often reduce cooling capacity in a complex environment. it problems are identified as overloaded rooms, or rooms in which it load is present but not accomplishing its purpose, i.e. decommissioned but still-powered it equipment. individually and together, these factors contribute to compromised redundancy, lower reliability, and higher risk.
the presentation will describe the science behind the metrics, stepping through a statistical analysis based on data from hundreds of facilities. the analysis projects that rooms with scores of 0-2 are likely to experience a high temperature event, while rooms with scores below zero (no redundancy) are significantly more likely to experience a high temperature event. such events result in expensive, emergency truck rolls and painful middle-of-the-night calls. a tool showing the scoring and identifying specific issues that contribute to the score, along with methods to remediate those issues, will be demonstrated.

click here to return back to the coolingzone-2015 summit main page