--- layout: default title: Metrics Reference parent: Performance Analyzer nav_order: 3 redirect_from: - /monitoring-plugins/pa/reference/ --- # Metrics reference This page contains all Performance Analyzer metrics. All metrics support the `avg`, `sum`, `min`, and `max` aggregations, although certain metrics measure only one thing, making the choice of aggregation irrelevant. For information on dimensions, see the [dimensions reference](#dimensions-reference). This list is extensive. We recommend using Ctrl/Cmd + F to find what you're looking for. {: .tip }
Metric | Dimensions | Description |
---|---|---|
CPU_Utilization | ShardID, IndexName, Operation, ShardRole | CPU usage ratio. CPU time (in milliseconds) used by the associated thread(s) in the past five seconds, divided by 5000 milliseconds. |
Paging_MajfltRate | The number of major faults per second in the past five seconds. A major fault requires the process to load a memory page from disk. | |
Paging_MinfltRate | The number of minor faults per second in the past five seconds. A minor fault does not requires the process to load a memory page from disk. | |
Paging_RSS | The number of pages the process has in real memory---the pages that count towards text, data, or stack space. This number does not include pages that have not been demand-loaded in or swapped out. | |
Sched_Runtime | Time (seconds) spent executing on the CPU per context switch. | |
Sched_Waittime | Time (seconds) spent waiting on a run queue per context switch. | |
Sched_CtxRate | Number of times run on the CPU per second in the past five seconds. | |
Heap_AllocRate | An approximation of the heap memory allocated, in bytes, per second in the past five seconds | |
IO_ReadThroughput | Number of bytes read per second in the last five seconds. | |
IO_WriteThroughput | Number of bytes written per second in the last five seconds. | |
IO_TotThroughput | Number of bytes read or written per second in the last five seconds. | |
IO_ReadSyscallRate | Read system calls per second in the last five seconds. | |
IO_WriteSyscallRate | Write system calls per second in the last five seconds. | |
IO_TotalSyscallRate | Read and write system calls per second in the last five seconds. | |
Thread_Blocked_Time | Average time (seconds) that the associated thread(s) blocked to enter or reenter a monitor. | |
Thread_Blocked_Event | The total number of times that the associated thread(s) blocked to enter or reenter a monitor (i.e. the number of times a thread has been in the blocked state). | |
Thread_Waited_Time | Average time (seconds) that the associated thread(s) waited to enter or reenter a monitor in WAITING or TIMED_WAITING state. | |
Thread_Waited_Event | The total number of times that the associated thread(s) waited to enter or reenter a monitor (i.e. the number of times a thread has been in the WAITING or TIMED_WAITING state). | |
ShardEvents | The total number of events executed on a shard in the past five seconds. | |
ShardBulkDocs | The total number of documents indexed in the past five seconds. | |
Indexing_ThrottleTime | ShardID, IndexName | Time (milliseconds) that the index has been under merge throttling control in the past five seconds. |
Cache_Query_Hit | The number of successful lookups in the query cache in the past five seconds. | |
Cache_Query_Miss | The number of lookups in the query cache that failed to retrieve a `DocIdSet` in the past five seconds. `DocIdSet` is a set of document IDs in Lucene. | |
Cache_Query_Size | Query cache memory size in bytes. | |
Cache_FieldData_Eviction | The number of times OpenSearch has evicted data from the fielddata heap space (occurs when the heap space is full) in the past five seconds. | |
Cache_FieldData_Size | Fielddata memory size in bytes. | |
Cache_Request_Hit | The number of successful lookups in the shard request cache in the past five seconds. | |
Cache_Request_Miss | The number of lookups in the request cache that failed to retrieve the results of search requests in the past five seconds. | |
Cache_Request_Eviction | The number of times OpenSearch evicts data from shard request cache (occurs when the request cache is full) in the past five seconds. | |
Cache_Request_Size | Shard request cache memory size in bytes. | |
Refresh_Event | The total number of refreshes executed in the past five seconds. | |
Refresh_Time | The total time (milliseconds) spent executing refreshes in the past five seconds | |
Flush_Event | The total number of flushes executed in the past five seconds. | |
Flush_Time | The total time (milliseconds) spent executing flushes in the past five seconds. | |
Merge_Event | The total number of merges executed in the past five seconds. | |
Merge_Time | The total time (milliseconds) spent executing merges in the past five seconds. | |
Merge_CurrentEvent | The current number of merges executing. | |
Indexing_Buffer | Index buffer memory size in bytes. | |
Segments_Total | The number of segments. | |
IndexWriter_Memory | Estimated memory usage by the index writer in bytes. | |
Bitset_Memory | Estimated memory usage for the cached bit sets in bytes. | |
VersionMap_Memory | Estimated memory usage of the version map in bytes. | |
Shard_Size_In_Bytes | Estimated disk usage of the shard in bytes. | |
Indexing_Pressure_Current_Limits | ShardID, IndexName, IndexingStage | Total heap size (in bytes) that is available for utilization by a shard of an index in a particular indexing stage (Coordinating, Primary or Replica). |
Indexing_Pressure_Current_Bytes | Total heap size (in bytes) occupied by a shard of an index in a particular indexing stage (Coordinating, Primary or Replica). | |
Indexing_Pressure_Last_Successful_Timestamp | Timestamp of a request that was successful for a shard of an index in a particular indexing stage (Coordinating, Primary or Replica). | |
Indexing_Pressure_Rejection_Count | Total rejections performed by OpenSearch for a shard of an index in a particular indexing stage (Coordinating, Primary or Replica). | |
Indexing_Pressure_Average_Window_Throughput | Average throughput of the last n requests (The value of n is determined by `shard_indexing_pressure.secondary_parameter.throughput.request_size_window` setting) for a shard of an index in a particular indexing stage (Coordinating, Primary or Replica). | |
Latency | Operation, Exception, Indices, HTTPRespCode, ShardID, IndexName, ShardRole | Latency (milliseconds) of a request. |
GC_Collection_Event | MemType | The number of garbage collections that have occurred in the past five seconds. |
GC_Collection_Time | The approximate accumulated time (milliseconds) of all garbage collections that have occurred in the past five seconds. | |
Heap_Committed | The amount of memory (bytes) that is committed for the JVM to use. | |
Heap_Init | The amount of memory (bytes) that the JVM initially requests from the operating system for memory management. | |
Heap_Max | The maximum amount of memory (bytes) that can be used for memory management. | |
Heap_Used | The amount of used memory in bytes. | |
Disk_Utilization | DiskName | Disk utilization rate: percentage of disk time spent reading and writing by the OpenSearch process in the past five seconds. |
Disk_WaitTime | Average duration (milliseconds) of read and write operations in the past five seconds. | |
Disk_ServiceRate | Service rate: MB read or written per second in the past five seconds. This metric assumes that each disk sector stores 512 bytes. | |
Net_TCP_NumFlows | DestAddr | Number of samples collected. Performance Analyzer collects one sample every five seconds. |
Net_TCP_TxQ | Average number of TCP packets in the send buffer. | |
Net_TCP_RxQ | Average number of TCP packets in the receive buffer. | |
Net_TCP_Lost | Average number of unrecovered recurring timeouts. This number is reset when the recovery finishes or `SND.UNA` is advanced. `SND.UNA` is the sequence number of the first byte of data that has been sent, but not yet acknowledged. | |
Net_TCP_SendCWND | Average size (bytes) of the sending congestion window. | |
Net_TCP_SSThresh | Average size (bytes) of the slow start size threshold. | |
Net_PacketRate4 | Direction | The total number of IPv4 datagrams transmitted/received from/by interfaces per second, including those transmitted or received in error. |
Net_PacketDropRate4 | The total number of IPv4 datagrams transmitted or received in error per second. | |
Net_PacketRate6 | The total number of IPv6 datagrams transmitted or received from or by interfaces per second, including those transmitted or received in error. | |
Net_PacketDropRate6 | The total number of IPv6 datagrams transmitted or received in error per second. | |
Net_Throughput | The number of bits transmitted or received per second by all network interfaces. | |
ThreadPool_QueueSize | ThreadPoolType | The size of the task queue. |
ThreadPool_RejectedReqs | The number of rejected executions. | |
ThreadPool_TotalThreads | The current number of threads in the pool. | |
ThreadPool_ActiveThreads | The approximate number of threads that are actively executing tasks. | |
ThreadPool_QueueLatency | The latency of the task queue. | |
ThreadPool_QueueCapacity | The current capacity of the task queue. | |
Master_PendingQueueSize | Master_PendingTaskType | The current number of pending tasks in the cluster state update thread. Each node has a cluster state update thread that submits cluster state update tasks (create index, update mapping, allocate shard, fail shard, etc.). |
HTTP_RequestDocs | Operation, Exception, Indices, HTTPRespCode | The number of items in the request (only for `_bulk` request type). |
HTTP_TotalRequests | The number of finished requests in the past five seconds. | |
CB_EstimatedSize | CBType | The current number of estimated bytes. |
CB_TrippedEvents | The number of times the circuit breaker has tripped. | |
CB_ConfiguredSize | The limit (bytes) for how much memory operations can use. | |
Master_Task_Queue_Time | MasterTaskInsertOrder, MasterTaskPriority, MasterTaskType, MasterTaskMetadata | The time (milliseconds) that a master task spent in the queue. |
Master_Task_Run_Time | The time (milliseconds) that a master task has been executed. | |
Cache_MaxSize | CacheType | The max size of the cache in bytes. |
AdmissionControl_RejectionCount (WIP) | ControllerName | Total rejections performed by a Controller of Admission Control. |
AdmissionControl_CurrentValue (WIP) | Current value for Controller of Admission Control. | |
AdmissionControl_ThresholdValue (WIP) | Threshold value for Controller of Admission Control. | |
Data_RetryingPendingTasksCount (WIP) | NodeID | Number of throttled pending tasks on which data node is actively performing retries. It will be an absolute metric at that point of time. |
Master_ThrottledPendingTasksCount (WIP) | Sum of total pending tasks which got throttled by node (master node). It is a cumulative metric so look at the max aggregation. | |
Election_Term (WIP) | N/A | Monotonically increasing number with every master election. |
PublishClusterState_Latency (WIP) | The time taken by quorum of nodes to publish new cluster state. This metric is available for current master. | |
PublishClusterState_Failure (WIP) | The number of times publish new cluster state action failed on master node. | |
ClusterApplierService_Latency (WIP) | The time taken by each node to apply cluster state sent by master. | |
ClusterApplierService_Failure (WIP) | The number of times apply cluster state action failed on each node. | |
Shard_State (WIP) | IndexName, NodeName, ShardType, ShardID | The state of each shard - whether it is STARTED, UNASSIGNED, RELOCATING etc. |
LeaderCheck_Latency (WIP) | WIP | WIP |
FollowerCheck_Failure (WIP) | ||
LeaderCheck_Failure (WIP) | ||
FollowerCheck_Latency (WIP) |