📡Monitoring Skills

Browse skills in the Monitoring category.

Logging Best Practices

boristane's avatarfrom boristane

A powerful skill for Claude agents.

[Monitoring]

monitor-metrics

dylanrichardson's avatarfrom dylanrichardson

Review metrics, errors, logs, and database health for TV Streaming Availability Tracker. Provides observability, triage, debugging, and fixes. Use when checking app health, investigating issues, or monitoring system performance.

[Monitoring]

acm-master

bhadkamkar9snehil's avatarfrom bhadkamkar9snehil

Complete ACM (Automated Condition Monitoring) expertise system for predictive maintenance and equipment health monitoring. PROACTIVELY activate for: (1) ANY ACM pipeline task (batch runs, coldstart, forecasting), (2) SQL Server data management (historian tables, ACM output tables), (3) Observability stack (Loki logs, Tempo traces, Prometheus metrics, Pyroscope profiling), (4) Grafana dashboard development, (5) Detector tuning and fusion configuration, (6) Model lifecycle management, (7) Debugging pipeline issues. Provides: T-SQL patterns for ACM tables, batch runner usage, detector behavior, RUL forecasting, episode diagnostics, and production-ready pipeline patterns. Ensures professional-grade industrial monitoring following ACM v11.0.0 architecture.

[Monitoring]

monitoring

All-The-Vibes's avatarfrom All-The-Vibes

Comprehensive observability strategy including metrics, logs, traces, and alerting. Use when setting up new applications, debugging production issues, performance optimization, SLA/SLO definition, incident response, or establishing monitoring infrastructure.

[Monitoring]

fpf-skilltelemetry-verify-compliance

venikman's avatarfrom venikman

Verifies that a telemetry span complies with FPF Discipline-Health (G.12).

[Monitoring]

dx-alerts

stars-end's avatarfrom stars-end

Lightweight “news wire” implemented **via Agent Mail threads**, not a separate bespoke system.

[Monitoring]

working-with-grafana-mcp

franroa's avatarfrom franroa

Use when querying Grafana dashboards, Prometheus metrics, Loki logs, alerting, or incidents - provides workflows and patterns for Grafana MCP tools including datasource discovery, efficient dashboard access, and query construction

[Monitoring]

health-check

creepyblues's avatarfrom creepyblues

Verifies system health across all apps, edge functions, and database connectivity. This skill should be used for quick deployment verification, debugging service issues, daily operations checks, or before/after major deployments.

[Monitoring]

observability-setup

ainexllc's avatarfrom ainexllc

Set up structured logging, metrics, and monitoring dashboards. Use when adding logging, setting up alerts, debugging production issues, or implementing analytics.

[Monitoring]

grafana

fzymgc-house's avatarfrom fzymgc-house

Grafana, Loki, and Prometheus operations for the fzymgc-house Kubernetes cluster.Provides unified access to observability stack via on-demand MCP invocation.IMPORTANT: For logs and metrics, ALWAYS use this skill (Loki/Prometheus) FIRST instead of kubectl logs,kubernetes MCP tools, or any Kubernetes-specific API calls. Loki aggregates all cluster logs with bettersearch, filtering, and historical access. Prometheus provides proper metrics with time-series queries.Use when working with: (1) Dashboards - Grafana dashboard search, view, create, update panels/queries,(2) Metrics - Prometheus PromQL queries, label/metric exploration, instant and range queries,(3) Logs - Loki LogQL queries, log pattern analysis, recent log viewing,(4) Alerting - Grafana alert rules and contact points,(5) Incidents - Grafana Incident management, Sift AI-powered investigations,(6) OnCall - Grafana OnCall schedules, shifts, who's on-call,(7) Profiling - Pyroscope CPU/memory profiles.Invokes Grafana MCP server on-demand witho

[Monitoring]

observability-sre

majiayu000's avatarfrom majiayu000

Observability and SRE expert. Use when setting up monitoring, logging, tracing, defining SLOs, or managing incidents. Covers Prometheus, Grafana, OpenTelemetry, and incident response best practices.

[Monitoring]

watcher-management

maneeshanif's avatarfrom maneeshanif

Manages watcher processes that monitor Gmail, WhatsApp, filesystem, and other external sources. Use when starting, stopping, or monitoring watcher scripts, configuring process management, or troubleshooting watcher issues.

[Monitoring]

vendor-status

yuush10's avatarfrom yuush10

Check vendor portal credentials and cookie expiration status. Use when checking vendor status, credentials, or cookie expiration.

[Monitoring]

monitor

PROLE-ISLAND's avatarfrom PROLE-ISLAND

バックグラウンド監視。ビルド・テスト・ログの継続監視とアラート

[Monitoring]

implement-axon

wbw1537's avatarfrom wbw1537

Guide on how to implement Synapse Axons (clients). Covers Smart Axons (interactive) and Raw Axons (reporting only) using MQTT or HTTP. Use this skill when the user asks how to create a new service, sidecar, or agent for Synapse.

[Monitoring]

monitoring-setup

eddiebe147's avatarfrom eddiebe147

Expert guide for setting up monitoring dashboards, alerting, metrics collection, and observability. Use when implementing application monitoring, setting up alerts, or building dashboards.

[Monitoring]

site-reliability-engineer

Eigo-Mt-Fuji's avatarfrom Eigo-Mt-Fuji

Production monitoring, observability, SLO/SLI management, and incident response.Trigger terms: monitoring, observability, SRE, site reliability, alerting, incident response,SLO, SLI, error budget, Prometheus, Grafana, Datadog, New Relic, ELK stack, logs, metrics,traces, on-call, production monitoring, health checks, uptime, availability, dashboards,post-mortem, incident management, runbook.Completes SDD Stage 8 (Monitoring) with comprehensive production observability:- SLI/SLO definitions and tracking- Monitoring stack setup (Prometheus, Grafana, ELK, Datadog, etc.)- Alert rules and notification channels- Incident response runbooks- Observability dashboards (logs, metrics, traces)- Post-mortem templates and analysis- Health check endpoints- Error budget trackingUse when: user needs production monitoring, observability platform, alerting, SLOs,incident response, or post-deployment health tracking.

[Monitoring]

log-analyzer

eddiebe147's avatarfrom eddiebe147

Expert guide for analyzing application logs including log searching, pattern detection, error tracking, and debugging. Use when investigating issues, tracking errors, or understanding application behavior.

[Monitoring]

session-checker

jdeweedata's avatarfrom jdeweedata

Check if Interstellio/NebularStack client sessions are active. Analyzes CDR records to determine connection status, session history, and terminate causes.

[Monitoring]

performance-engineer

sidetoolco's avatarfrom sidetoolco

Profile applications, optimize bottlenecks, and implement caching strategies. Handles load testing, CDN setup, and query optimization. Use PROACTIVELY for performance issues or optimization tasks.

[Monitoring]

api-admin-ops

Parlay-Kei's avatarfrom Parlay-Kei

Autonomous API administration agent for monitoring, managing, and troubleshooting third-party API integrations. Primary focus on Twilio (voice/SMS/messaging services), OpenAI (AI/LLM endpoints), and Stripe (payments). Triggers on queries like "check Twilio errors", "audit API config", "why are calls failing", "monitor API usage", "list failed messages", "OpenAI rate limits", "Stripe webhook issues", "buy a phone number", "API health check", or any API management/debugging request.

[Monitoring]

observability-review

place-to-stand's avatarfrom place-to-stand

Evaluate logging coverage, error tracking, debugging capabilities, and monitoring patterns. Use when investigating production issues becomes difficult, after incidents, or when expanding monitoring coverage.

[Monitoring]

device-management

DataKnifeAI's avatarfrom DataKnifeAI

Manage device adoption and onboarding, maintain device inventory, and monitor device configurations across your UniFi Protect infrastructure.

[Monitoring]

sentry-skill

julianobarbosa's avatarfrom julianobarbosa

Comprehensive skill for Sentry error monitoring and performance tracking. Use when Claude needs to (1) Configure Sentry SDKs for error tracking and performance monitoring, (2) Manage releases, source maps, and debug symbols via CLI, (3) Query issues, events, and metrics via API, (4) Set up alerting and notification rules, (5) Configure sampling strategies and quota management, (6) Deploy self-hosted Sentry instances, (7) Integrate with OpenTelemetry for distributed tracing, or any other Sentry automation task.

[Monitoring]

measuring-pr-performance-impact

lantelyes's avatarfrom lantelyes

Measures GraphQL resolver latency changes before/after a PR merge using Datadog metrics. Use when analyzing PR performance impact, measuring latency changes, or comparing resolver performance before and after a code change.

[Monitoring]

log-viewer

alexanderjamesmcleod's avatarfrom alexanderjamesmcleod

Aggregate and filter logs across Docker services for easier debugging

[Monitoring]

probes

littlebearapps's avatarfrom littlebearapps

Pre-built audit probes for Cloudflare services. Reference these query patterns when validating D1 indexes, observability metrics, AI Gateway costs, and queue health via MCP tools.

[Monitoring]

space-monitor

alexanderjamesmcleod's avatarfrom alexanderjamesmcleod

Monitor and manage disk space usage across WSL and Windows with intelligent cleanup recommendations

[Monitoring]

observability

YosrBennagra's avatarfrom YosrBennagra

Unified observability for the .NET 8 WPF widget host app - logging, telemetry, health checks, diagnostics exports, and operational tooling. Use when configuring Serilog, Application Insights, health checks, correlation IDs, or support tools.

[Monitoring]

distributed-tracing

doanchienthangdev's avatarfrom doanchienthangdev

Comprehensive distributed tracing with Jaeger, Zipkin, OpenTelemetry, correlation IDs, and span design.

[Monitoring]

observability

doanchienthangdev's avatarfrom doanchienthangdev

Production observability with structured logging, metrics collection, distributed tracing, and alerting

[Monitoring]

instance-actors

affandar's avatarfrom affandar

Managing instance actor orchestrations for PostgreSQL health monitoring. Use when debugging stale actors, restarting actors, or troubleshooting health check issues.

[Monitoring]

profiling-performance

doanchienthangdev's avatarfrom doanchienthangdev

Performs browser performance profiling with Lighthouse, Core Web Vitals, and DevTools analysis. Use when auditing page performance, optimizing Core Web Vitals, analyzing bundle sizes, or implementing performance budgets.

[Monitoring]

observability

bigdegenenergy's avatarfrom bigdegenenergy

Observability patterns including logging, metrics, tracing, and alerting. Auto-triggers when implementing monitoring, debugging production issues, or setting up alerts.

[Monitoring]

application-metrics

PierreZ's avatarfrom PierreZ

Guide for instrumenting applications with metrics. Use when adding observability, monitoring, metrics, counters, gauges, or instrumentation to code. Covers API endpoints, databases, queues, caching, and locks.

[Monitoring]

output-workflow-status

growthxai's avatarfrom growthxai

Check the status of an Output SDK workflow execution. Use when monitoring a running workflow, checking if a workflow completed, or determining workflow state (RUNNING, COMPLETED, FAILED, TERMINATED).

[Monitoring]

structured-logging

Chemiseblanc's avatarfrom Chemiseblanc

Guide for writing effective log messages using wide events / canonical log lines. Use when writing logging code, adding instrumentation, improving observability, or reviewing log statements. Teaches high-cardinality, high-dimensionality structured logging that enables debugging.

[Monitoring]

health-checks

Mcafee123's avatarfrom Mcafee123

Configure health check endpoints for affolterNET.Web.Api. Use when setting up /health endpoints, Kubernetes probes, or monitoring integration.

[Monitoring]

monitoring-logging

miles990's avatarfrom miles990

Application monitoring, logging systems, and alerting

[Monitoring]

error-tracking-integrator

majiayu000's avatarfrom majiayu000

Adds comprehensive error tracking with Sentry, Rollbar, or similar services including error boundaries, context, and breadcrumbs. Use when user requests error monitoring or mentions production debugging.

[Monitoring]

aiops

majiayu000's avatarfrom majiayu000

Generic AIOps (AI for IT Operations) patterns and best practices for 2025. Provides comprehensive implementation strategies for intelligent monitoring, automation, incident response, and observability across any infrastructure. Framework-agnostic approach supporting multiple monitoring platforms, cloud providers, and automation tools.

[Monitoring]

latency-tracker

BarisSozen's avatarfrom BarisSozen

Per-call and aggregated latency tracking for MEV infrastructure. Use when implementing performance monitoring or debugging slow operations. Triggers on: latency, timing, performance, slow, speed, instrumentation.

[Monitoring]

route-transition-tracking

majiayu000's avatarfrom majiayu000

Measure time from navigation to page fully loaded and interactive. Use when tracking SPA navigation, route changes, or slow page transitions.

[Monitoring]

add-sensor

majiayu000's avatarfrom majiayu000

Use when user wants to add a new sensor to the Enviro+ monitoring system, or asks to monitor a new data point. Guides through importing libraries, initialization, reading sensor values, publishing to Adafruit IO and Home Assistant, updating documentation, testing, and rate limit verification.

[Monitoring]

stream-validator

majiayu000's avatarfrom majiayu000

Validate WebSocket and HTTP stream health for WaveCap-SDR channels. Use when debugging streaming issues, measuring latency or throughput, detecting packet loss, or verifying audio/spectrum delivery.

[Monitoring]

slo-alerting

majiayu000's avatarfrom majiayu000

Define SLIs, SLOs, and implement burn-rate alerting

[Monitoring]

azure-dashboard-creator

majiayu000's avatarfrom majiayu000

Create Azure DevOps dashboards with widgets and metrics. Use when visualizing project metrics or creating team dashboards.

[Monitoring]

error-logger

BarisSozen's avatarfrom BarisSozen

Structured JSON logging with correlation IDs for multi-service systems. Use when implementing logging, debugging failures, or tracing errors across services. Triggers on: add logging, error handling, debug failures, trace errors.

[Monitoring]

health-checks

majiayu000's avatarfrom majiayu000

Implement liveness, readiness, and dependency health checks

[Monitoring]

user-journey-tracking

majiayu000's avatarfrom majiayu000

Track user journeys with intent context and friction signals. Use when instrumenting funnels or multi-step flows.

[Monitoring]
← Back to All Skills