Is there a way to get the "physical health state" of an Alpha (PWS, DS10) and Itanium (RX2620): CPU and memory temperature, fan state, power state etc, by a program, to monitor and signalling of "extreme" situations BEFORE they occur?
DS10 has RMC in snoop mode, which allows me to display a status - when it does react, which is not always the case - but that can only be done via console.
getting system health state
-
Topic author - Valued Contributor
- Posts: 86
- Joined: Fri Jul 12, 2019 1:59 pm
- Reputation: 0
- Location: Netherlands
- Status: Offline
- Contact:
Re: getting system health state
For my DS10L, I can get CPU temperature examining f$getsyi("TEMPERATURE_VECTOR"). The last 2 digits are a hex encoding of the temperature in Celsius. I haven't found any other CPUs that this works. Since the 10L is a 1U box, keeping the thing cool is a problem.
-
- Member
- Posts: 6
- Joined: Thu Feb 18, 2021 9:50 am
- Reputation: 0
- Status: Offline
Re: getting system health state
This script used to work on my DS20E Alpha's, but not anymore on my current Integrity servers. Currently using CockpitMgr which does all the HW monitoring.
I added an extra check to the "env_check" script:
I added an extra check to the "env_check" script:
Code: Select all
---
$ activecpu_cnt = f$getsyi("ACTIVECPU_CNT")
$ availcpu_cnt = f$getsyi("AVAILCPU_CNT")
----
$ gosub cpu_check
----
$cpu_check:
$ if availcpu_cnt .gt. 1
$ then
$ if activecpu_cnt .lt. availcpu_cnt
$ then write sys$output -
"CPU is BAD : avail ''availcpu_cnt' / active ''activecpu_cnt'"
$ else write sys$output "CPUs are Good"
$ endif
$ else
$ write sys$output "CPU is Good"
$ endif
$ return
----
-
Topic author - Valued Contributor
- Posts: 86
- Joined: Fri Jul 12, 2019 1:59 pm
- Reputation: 0
- Location: Netherlands
- Status: Offline
- Contact:
Re: getting system health state
Thanks - great script, gives the information I needed. It would be nice if Cockpit manager was available for community members in some form
Last edited by willemgrooters on Wed Jan 05, 2022 11:41 am, edited 1 time in total.
-
- Active Contributor
- Posts: 46
- Joined: Fri Jun 28, 2019 8:45 am
- Reputation: 0
- Location: Reading, UK
- Status: Offline
- Contact:
Re: getting system health state
What I'm doing using DCL and Kermit scripting to connect to the iLo and do a PS command to get the power supply status and temp then checking the output. Works for Alpha and I64.
Ian Miller
[ personal opinion only. usual disclaimers apply. Do not taunt happy fun ball ].
[ personal opinion only. usual disclaimers apply. Do not taunt happy fun ball ].