Cheetah - Agile & Fast Performance Enhancements: IBM - Informix Dynamic Server
Cheetah - Agile & Fast Performance Enhancements: IBM - Informix Dynamic Server
Agenda
► Non-Blocking Checkpoints
► Automatic Checkpoints
What is a checkpoint?
► A checkpoint is a point in time where
cached data (bufferpool) is flushed to
disk to create a consistency point for
fast recovery, backups, HDR…
What is an LRU?
► The LRU are queues used to manage the bufferpool
Non-Blocking Checkpoints
►Exceptions….
■ Checkpoint running short on resources
• Physical log 75%
• At least one checkpoint per logical log space.
■ Admin, archive checkpoints
Interval checkpoint
Recommendations
Automatic Checkpoints
Automatic Checkpoints
►If potential transaction blocking detected
Automatic Checkpoints
►Default is always on
►onmode –wm AUTO_CKPTS=0 … turn
off
►onmode –wm AUTO_CKPTS=1 … turn
on
23:28:26 Performance Advisory: The current size of the physical log buffer is smaller
than recommended.
23:28:26 Results: Transaction performance might not be optimal.
23:28:26 Action: For better performance, increase the physical log buffer size to 128.
13:25:54 Performance Advisory: Based on the current workload, the physical log might
be too small to accommodate the time it takes to flush the buffer pool.
13:25:54 Results: The server might block transactions during checkpoints.
13:25:54 Action: If transactions are blocked during the checkpoint, increase
the size of the physical log to at least 14000 KB.
13:25:54 Performance Advisory: The physical log is too small for automatic checkpoints.
13:25:54 Results: Automatic checkpoints are disabled.
13:25:54 Action: To enable automatic checkpoints, increase the physical log to
at least 14000 KB.
onstat –g ckp
IBM Informix Dynamic Server Version 11.10.FB7TL -- On-Line -- Up 01:03:54 -- 39936 Kbytes
AUTO_CKPTS=Off RTO_SERVER_RESTART=Off
Max Plog Max Llog Max Dskflush Avg Dskflush Avg Dirty Blocked
pages/sec pages/sec Time pages/sec pages/sec Time
200 200 1 405 10 1
The server is blocking transactions because the physical log is too small.
Based on the current workload, to prevent the server from blocking future
transactions,
increase the size of the physical log to 14000 KB.
Based on the current workload, the logical log space might be too small
to accommodate the time it takes to flush the buffer pool. The server might
block
transactions during checkpoints. If the server blocks transactions,
increase the size of the logical log space to at least 14000 KB.
onstat –g ckp
AUTO_CKPTS On/Off Displays if automatic checkpoints feature is on or off
RTO_SERVER_RESTART Seconds Displays the RTO policy. 0=RTO policy is off.
Estimated recovery time Seconds This is the estimated time it would take the IDS server to perform fast recovery.
Interval Number Checkpoint interval id
Clock Time Wall clock time This is the wall clock time that the checkpoint occurred
Trigger Text There are several events that can trigger a checkpoint. The most common are RTO, Plog
or Llog (running out of logical log resources).
LSN Log position Log position of checkpoint
Total Time Seconds Total checkpoint duration from request time to checkpoint completion
Flush Time Seconds Time to flush bufferpools
Block Time Seconds Transaction blocking time
# Waits Number Number of transactions that blocked waiting for checkpoint
Ckpt Time Seconds amount of time it takes for all transactions to recognize a checkpoint has been requested
Wait Time Seconds Average time thread waited for checkpoint
Long Time Seconds Longest amount of time a transaction waited for checkpoint
# Dirty Buffers Number Number of buffers flushed to disk during checkpoint processing
Dskflu/Sec Number Number of buffers flushed to disk per sec during checkpoint processing
Plog Total Pages Number Total number of pages physically logged during the checkpoint interval
Plog Avg/Sec Number Average rate of physical log activity during the checkpoint interval
Llog Total Pages Number Total number of pages logically logged during the checkpoint interval
Llog Avg/Sec Number Average rate of logical log activity during the checkpoint interval
Slide 28 © 2007 IBM Corporation
IBM Informix Dynamic Server
Onconfig parameter
► New onconfig parameter
► RTO_SERVER_RESTART
■ Amount of time in seconds that Dynamic Server has to recover
from a problem after you restart Dynamic Server and bring the
server into online or quiescent mode.
■ Seed the logical recovery pages in physical log
■ Valid values are 60 – 1800
■ Default is 0 (disabled)
RTO
► Facts about RTO_SERVER_RESTART
■ Allows users to set target fast recovery time.
■ RTO_SERVER_RESTART and CKPTINTVL mutually exclusive.
■ If turned off, the system will use the CKPTINTVL to trigger
checkpoints (the old style).
■ Valid values 60 - 1800 seconds (1–30 minutes).
■ Automatically adjust the checkpoint frequency to meet the RTO
policy.
■ The server will fine tune with each fast recovery to improve the
predictability.
■ This parameter can be updated with onmode –wf and –wm.
■ RTO_SERVER_RESTART=0 (off) is the default.
CLEANERS
► Initial setting will be 1 cleaner thread per AIO VP
► Value adjusted in conjunction with changes to the number
of AIO VPs.
Limitations
Summary
Questions