AIStore Observability: Logs
AIStore Observability: Logs
AIStore (AIS) provides comprehensive logging that captures system operations, performance metrics, and error conditions.
Scope. How to configure, collect, and read AIS logs.
AIS logs are the cluster’s ground truth: every proxy or target writes a chronological stream of events, warnings, and periodic performance snapshots. Well‑rotated logs let operators:
- Reconstruct incidents (root‑cause analysis)
- Correlate client symptoms with internal state changes
- Spot long‑running jobs without polling the control plane
Table of Contents
- Configuring Logging
- Severity & Verbosity
- Log Format and Structure
- Log File Layout & Rotation
- Accessing Logs
- Common Log Patterns
- Key Performance Metrics
- Troubleshooting Checklist
- Operational Tips
- Related Documentation
Configuring Logging
Show current values:
The new value propagates to every node within a second.
Example (Development Defaults)
Example (Production Configuration)
In production environments, settings are typically adjusted for higher retention and less frequent statistics collection:
At startup, AIS logs some of these settings:
Severity & Verbosity
AIS prepends every line with a severity prefix and—in the case of informational messages—an internal numeric level.
Severity Prefixes
Numeric Levels for I Lines
Tip. Temporarily crank a node:
Per‑module Overrides (log.modules)
log.modules lets you boost just a subset of subsystems to level 5 without flooding the whole cluster.
Log Format and Structure
AIS logs follow a consistent format:
Common prefixes:
config:– effective runtime configurationx-<n>:– extended (batch) action lifecyclenvmeXnY:– per‑disk I/O snapshotkvstats:– cluster‑wide key‑value metrics (see below)
Log File Layout & Rotation
File names include the node ID plus a sequence number (target‑A43c.log.3). Rotation is triggered by max_size; retention is enforced by max_total.
AIS implements automatic log rotation as indicated by the header:
When logs are rotated, new log files are created and old ones are typically compressed or archived according to the retention policy.
Accessing Logs
Via CLI
The AIS CLI provides commands to view and collect logs:
Directly in Kubernetes
In Kubernetes deployments, access logs using kubectl:
Common Log Patterns
Startup Sequence
The startup sequence provides important information about the AIS node configuration:
Operation Logs
AIS logs details about operations such as list, put, get:
Performance Metrics
AIS regularly logs performance metrics in two formats:
- Disk-specific performance:
- Comprehensive key-value statistics (at regular intervals defined by
stats_time):
Kubernetes-specific Information
In Kubernetes deployments, AIS logs include pod and cluster-specific details:
Key Performance Metrics
The key-value statistics contain valuable operational metrics:
Troubleshooting Checklist
- Scan for E & W lines around the timeframe.
- Look for spikes in
err.<n>.ncounters. - Watch disk
util> 80% or sustainedread.bpsplateaus. - Temporarily raise
log.levelorlog.moduleson a single node to capture more detail.
For advanced log analysis, consider forwarding logs to external systems for aggregation and visualization.
Operational Tips
- Keep
log.level=3in production; raise to4or5only while debugging. Lower to2or below if you truly need silence. - Raise
stats_time(≥ 60s) if logs get noisy on busy systems. - Ship rotated logs off‑host weekly.
- Always attach
ais cluster download-logstarball to GitHub issues.