CPU Subsystem

CPU Subsystem Analysis

The CPU - is the highest level of measurement in z/VM.

CPU response time is a function of processor speed and number of CPUs. If there are issues with the CPU utilization, it will be one of the first things to be seen.
It is key to know the baseline numbers of your environment to be able to see when abnormalities appear.
Sometimes looking at the utilization numbers can be misleading especially in a Linux environment. zVPS measures impacts in CPU seconds from the hardware.
It measures the impacts of LPAR(s), z/VM virtual machines, Linux processes, zVSE jobs/partitions, etc. This gives a very accurate measurement of the system.
In a z/VM MP environment, adding processors can reduce queuing time and increase availability but it also costs money.
For example, Linux should not have multiple processors when the workload doesn't need it. An extra virtual processor may provide a few milliseconds improvement in performance,
but can result in spin locks, which consumes the processor(s) unnecessarily. Also, if the share of the server doesn't change, extra unneeded vCPUs will make all the vCPUs run slower.
Minimizing polling is another action that can reduce CPU requirements. Linux hertz time is just one example of polling within a Linux server. This can and should be corrected using the timer patch.
Note that WAS, Domino, SAP and some other applications have since implemented polling.
Watch for master processor bottlenecks:

Pay attention to which processor is the master processor.
The master runs processes that need synchronization and consistency or code that was not written for multi-processing. These include RACF, Spooling, IUCV, Paging, CP commands, the CP monitor and linemode commands like screen scraping.
If the master processor is overloaded, move the workload or add another PHYSICAL (not virtual) processor.
Watch ESACPUU (system/user overhead), ESAXACT (simulation wait), ESALPAR (master engine with weighting) and ESAPLDV.
IF THE MASTER PROCESSOR IS OVERLOADED - ADDING ANOTHER ENGINE WILL NOT HELP - IT WILL ADD TO THE PROBLEM.
A second LPAR will need to be added so there is a second master processor.

Affinity processing is the concept that because all information for an instruction needs to be in L1 cache before it can execute, a virtual CPU will try first to be dispatched on the same thread/CPU to reduce the need to move data into L1 cache. However, with the way z/VM server systems tend to poll, this doesn't tend to work.
CPU overcommit refers to allocating more vCPUs to servers than the number of available physical CPU cores on the LPAR. The overcommit ratio should not exceed 2:1.
If seeing available CPU capacity (ESALPARS/ESACPUU) AND CPU wait (on ESAXACT) - use the SYSCONTROL command below.
What is right for your environment depends on reviewing the current environment using the zVPS tools/information to gain an understanding of what improvements or corrections are possible.
Again, it is incredibly helpful to know your baseline environment then abnormalities become more obvious and easier to find.

Some clarification on CPU naming:

The machine is equipped with physical cores - not engines, not processors, not CPUs, not IFLs, not CPs.
Cores, whether physical or logical, come in different types: CP, IFL, etc. Logical cores and physical cores have a percent-busy metric called core utilization.
For a logical core - this is the percent of time the logical core is dispatched on a physical core.
For a physical core, this is the percent of time the physical core has a logical core dispatched upon it.
Contained within a core, either physical or logical, are instruction execution units called processors. Physical cores contain physical processors. Logical cores contain logical processors.
An IFL core-type can have either one or two processors contained in the core, depending upon the SMT level. SMT-2 has two processors in the core.
Logical processors have a percent-busy metric called processor utilization. This is the percent of elapsed time the processor has a non-wait PSW loaded (this hasn't changed).
Synonymous with processor utilization is processor busy, processor load, CPU utilization, CPU load and CPU busy.
Keep in mind, you pay for physical cores (not threads or logical cores/processors - which gets confusing with SMT).

Presentations about the CPU environment and utilization:
Processor Configuration and Analysis Intro
Processor Advanced Topics

Helpful system settings:

SET SRM DSPSlice minslice - This can be useful for systems with few processors and CPU intensive workloads. For Linux workloads, the default of 5 (ms) may be too high. Start by setting it to 3 (ms) then go down as far as 1 (ms). This helps servers running online transactions. Note that when turning on SMT, the dispatch time slice default goes to 10 and should be lowered if running servers that do online transactions.
Check/Update SYSCONTROL:

q syscontrol - (with a default MODLEVEL of 1):
DISPATCH THDAFFINITY       ON
DISPATCH PREEMPTLOCAL     OFF
DISPATCH TSEARLY           50
DISPATCH INCHIPBUSY     50000  <-- Delay for steal on chip
DISPATCH INCHIPDELAY    50000
DISPATCH INNODEBUSY    100000  <-- Delay for steal on node
DISPATCH INNODEDELAY   100000
DISPATCH INSYSBUSY     200000  <-- Delay for steal on system
DISPATCH INSYSDELAY    200000

SET SYSCONTROL DISPATCH MODLEVEL 0 - This can be useful for when there is available capacity but also CPU wait.
It will allow available capacity to be used immediately instead of waiting for the dispatch delay. This also alleviates "steal time" for Linux. (Set it back to 1 to return to the original default setting).

q syscontrol - (with MODLEVEL 0):
DISPATCH THDAFFINITY      OFF
DISPATCH PREEMPTLOCAL      ON
DISPATCH TSEARLY            0
DISPATCH INCHIPBUSY         0
DISPATCH INCHIPDELAY        0
DISPATCH INNODEBUSY    100000
DISPATCH INNODEDELAY   100000
DISPATCH INSYSBUSY     200000
DISPATCH INSYSDELAY    200000

To turn all affinity delays off, the following command can be done: CP SET SYSCONTROL DISPATCH STEALBARRIER INCHIPBUSY 0 INCHIPDELAY 0 INNODEBUSY 0 INNODEDELAY 0 INSYSBUSY 0 INSYSDELAY 0

q syscontrol - (with all settings at 0):
DISPATCH THDAFFINITY      OFF
DISPATCH PREEMPTLOCAL      ON
DISPATCH TSEARLY            0
DISPATCH INCHIPBUSY         0
DISPATCH INCHIPDELAY        0
DISPATCH INNODEBUSY         0
DISPATCH INNODEDELAY        0
DISPATCH INSYSBUSY          0
DISPATCH INSYSDELAY         0

SET SRM POLARization VERTical - Vertical polarization can have better performance and is required if using SMT (Simultaneous MultiThreading).

Settings that are no longer relevant/useful:

SET SRM DSPBUF | LDUBUF | STORBUF
SET SRM IABIAS
SET SRM MAXWSS

Understanding how to view CPU utilization with SMT

When SMT is active, there are x vCPUs and x*2 threads. If viewing from a hardware perspective (ESALPARx/ESAUSP5) the numbers shown are the number of vCPUs. If viewing from a z/VM perspective (ESACPUx), the numbers show are the number of threads.

For example, the pictures below show a system with 7 vCPUs and thus 14 threads. ESAUSP5 is showing the percentage of CPU used as 682.9 (out of 700 - for 7 vCPUs) but ESACPUU shows it as 1220 (out of 1400 - for 14 threads).

Helpful ESAMON screens/ESAMAP reports:

ESAMAIN - System overview - shows current total CPU processor utilization
ESACPUU - CPU Utilization Analysis (Part1) - shows current CPU processor utilization details
ESACPUA - CPU Utilization Analysis (Part2) - shows more current CPU processor utilization details
ESAMFC - Processor Cache Analysis - See Understanding MFC Data
ESADIAG - Diagnose code rate - shows information on which diagnose codes are being used at what rate
ESAPLDV - Processor Local Dispatch Vector Activity - shows z/VM Dispatcher information
ESAIUER - IUCV error analysis - shows errors in inter-system communication
ESALCK - Spin lock activity - shows where spin locks are happening
ESATOPU - Top Users Resource Use - shows Top users for last 30 minutes

Using zVPS to find information for solving issues with the CPU utilization:

Use zVPS real time monitoring and daily reports to see how efficiently the environment is running. What is the total CPU utilization?
How is that broken down by LPARs/IFLs? What are the users consuming? What else might be happening? Here are some places to start:

ESAMAIN - System overview information:

Processor Utilization Total - This is the same as CPU Busy. This column has an indicator (highlight) if it has passed a certain threshold - this can be changed by the administrator. Since this LPAR/system has 6 CPUs/engines, the total percentage available is 600% (so over 100% is not necessarily a problem). However, other screens will show which different engine types are being used and if they are close to their utilization capacity. It is a good idea to set up thresholds to see when utilization numbers are higher than expected. In this example, there are 6 CPUs it might be that 500% is flagged as yellow and 550% is flagged as red - depending on your installation.

SMT Prort Ratio now shows on ESAMAIN. This shows the thread to core ratio. A number of ~0.5 is very good. If it starts to climb, SMT may be providing capacity, but also could be impacting response times.

ESACPUU - Shows information for each CPU/engine on the box.

CPU Type/ID - This shows the CPUs/engines (IFLs) on the LPAR/system.

Total util - This shows the CPU utilization for each of the engines. This can be helpful to see if all engines are relatively equal in utilization. The ESACPUU report will show totals for each 15 minute increment in the day, which is good for trending.

Overhd User/Syst - This shows the CPU overhead. This can be attributed to user functions or system functions. If this is high, there could be issues. High User overhead signifies high master processor simulation. Check ESAUSP2 for specific user data that correlates to the problem time. High Syst overhead (along with low Emulation time) usually means there is a master processor bottle neck or check the syscontrol control settings above and set to 0.

CPU Wait Idle - This shows the amount of time the CPU was idle (no work to run). If this is high, there might be too many vCPUs assigned to this LPAR.

Page Write - This shows page writes to disk. These are done on the Master processor. Watch for high numbers which can show up as Sim wait on ESAXACT.

CPU Wait Steal - This shows the amount of time the CPU was waiting to be dispatched - also known as suspended - neither running, in a wait state nor parked.

Vertical Park Secs - This shows the amount of seconds a CPU was parked. This can be used to determine if the CPU has been parked multiple times. This can cause overhead in the PR/SM hypervisor. Look at ESALPAR to see the how the LPAR is defined.

Look for large fluctuations in numbers in the other columns.

ESACPUA - Shows similar information as ESACPUU.

CPU Type/ID - This shows the CPUs/engines (IFL) on the LPAR/system.

Total util - This shows the CPU utilization for each of the engines. This can be helpful to see if all engines are relatively equal in utilization.

Internal Diagnose and User Diag/sec - This shows the number of internal diagnose instructions executed per second and the number of calls to user diagnose codes. If the first diagnose number is over 5000 and the second is over 1000, it is likely caused by a Linux server with too many virtual processors defined. ESAUSRD shows user diagnose calls. ESASRVC shows how many virtual processors are defined to each server.

Look for large fluctuations in numbers in the other columns, especially the overhead columns.

ESADIAG - Shows Diagnose rates.

Diag Code - A diagnose code of 44 shows an older version of Linux that uses spin locks - which can cause a performance issue. It is better to use diagnose code 9C instead. However, too many 9C calls can show that a server has too many vCPUs. Use ESAUSRD to see what machines are using which diagnose codes.

ESAPLDV - Shows Processor Local Dispatch Vector.

If the Master Processor (found in ESAHDR report) is constrained, it will show up if there is high Simulation wait on ESAXACT. The ESAPLDVC report shows when a VMDBK is moved "To Master". This happens when the z/VM Dispatcher finds something that needs to run on the Master Processor (examples above). Watch for large fluctuations in these numbers. Also see Master Processor Issue) for more information on this issue.

ESAIUER - Shows IUCV errors.

IUCV failures - Since many IUCV services run "master only" (meaning it only uses the one specified master processor) any IUCV errors can cause performance issues.

ESALCK - Shows spin lock activity.

CPU% - Locking doesn't tend to have issues. However, if system utilization is high, check for out of control spin locks. If either the exclusive or shared CPU% is over 10, there are spin lock issues. This is most likely an issue that needs to be sent to IBM.

ESATOPU - Shows CPU utilization by user - top users first.

CPU Time - This shows how much CPU the top user for any given minute is utilizing. This picture shows that ZADMIN was utilizing more of the system than normal from 14:51 to 15:06. On this system, a systems programmer was running a trace on that machine. (This wouldn't happen with this ID on your system, but it shows how one user can cause the CPU utilization to go up tremendously).

This is a fast way to show a possible abusive user.

Conclusions:

Looking at CPU utilization is one of the quickest ways to find processing issues.

Just like on the freeway:

Sometimes it is just one car (out of control user/system)
Sometimes additional lanes or further tuning can be needed (CPU utilization stays high or spikes frequently)
Sometimes the lanes are clogged by an accident (not a CPU problem, but other hardware issues).

The best thing to do is to know your current environment and what is normal/abnormal. If CPU is not the issue, continue to continue the search to other parts of the system.