Health Management

Health Controller

First of all the installation flag must be properly set:

installHealthController="true"

Then, the following variables can be configured in order to configure the Health Controller:

Variable Description Default Value Required
hcEnable Specifies if health monitoring is enabled. "true" No
hcControlCycleLength Specifies the time between consecutive health checks to determine if a health policy condition is breached. The value is specified in minutes and ranges from 1 to 60 minutes. Longer control cycles reduce the health monitoring load. 5 No
hcMaxConsecutiveRestarts Specifies the number of attempts to revive an application server after a restart decision is made. 3 No
hcRestartTimeout Specifies how long to wait for a server to stop before explicitly checking its state and attempting another start. 5 No
hcMinRestartInterval Controls the minimum amount of time that must pass between consecutive restarts of an application server instance. 0 No
hcMinRestartIntervalUnits Indicates the units to use with the hcMinRestartInterval variable. Valid values: 2 (minutes), 3 (hours), or 4 (days). 2 No
hcProperties Custom properties. They must be set in the form as an example: hcProperties=[[['name', 'com.ibm.ws.xd.hmm.controller.ControlConfig.approvalTimeOutMinutes'],['value', '40']]] hcProperties.append([['name', 'com.ibm.ws.xd.hmm.controller.ControlConfig'],['value', '41']]) hcProperties.append([['name', 'com.ibm.ws.xd.hmm.controller'],['value', '42']]) hcProperties.append([['name', 'com.ibm.ws.xd.hmm'],['value', '43']]) [] No

Health Policies

First of all the installation flag must be properly set:

installHealthPolicies="true"

Then, the following variables can be configured in order to create Health Policies, where "_n" is an incrementing number starting by 1 ("_1"):

Variable Description Default Value Required
hpName_n Specifies a name for the health policy that is unique in the cell. Yes
hpDescription_n Specifies a description for the health policy. "" No
hpReactionMode_n Specifies if runtime tasks are created before taking actions on a health policy, or if actions are automatic. Valid values: "SUPERVISE" and "AUTOMATIC". "SUPERVISE" No
hpCondType_n Specifies the condition type. The value can be AgeCondition, WorkloadCondition, MemoryLeakAlgorithm, MemoryCondition, ResponseCondition, StuckRequestCondition, StormDrainCondition, or GCPercentageCondition. Each value has different parameters that you must specify. Yes
hpCondParams_n AgeCondition: maxAge and ageUnits -0 (milliseconds), 1 (seconds), 2 (minutes), 3 (hours), or 4 (days)- WorkloadCondition: totalRequests MemoryLeakAlgorithm: level (FAST, NORMAL, or CONSERVATIVE) MemoryCondition: timeOverThreshold, timeUnits -0 (milliseconds), 1 (seconds), 2 (minutes), 3 (hours), or 4 (days)- and memoryUsed ResponseCondition: responseTime and responseTimeUnits -0 (milliseconds), 1 (seconds), 2 (minutes), 3 (hours), or 4 (days)- StuckRequestCondition: timeoutPercent StormDrainCondition: level (NORMAL or CONSERVATIVE) GCPercentageCondition: garbageCollectionPercent, samplingPeriod, samplingUnits -0 (milliseconds), 1 (seconds), 2 (minutes), 3 (hours), or 4 (days)- Yes
hpMembers_n Specifies the health policy members as a list of [memberName, memberType]. Values for memberName can be: server:!:node (application server), clusterName (static and dynamic cluster), or cellName (cell)] Values for memberType can be: CELL, CLUSTER, DYNAMIC_CLUSTER, SERVER, or ODR] Example: [["Server01_clone1:!:rafaelruiz-MVNode01","SERVER"],["ClusterClient","CLUSTER"],["rafaelruiz-MVCell01","CELL"],["Serv1:!:rafaelruiz-MVNode01","ODR"]]
hpActions_n Specifies the health policy actions as a list of [actionType, stepNumber]. Values for memberName can be: server:!:node (application server), clusterName (static and dynamic cluster), or cellName (cell)] Example: [["RESTART",1],["THREADDUMP",2],["HEAPDUMP",3],["SENDSNMPTRAP",4]]