
Company News
/Security News
Socket Selected for OpenAI's Cybersecurity Grant Program
Socket is an initial recipient of OpenAI's Cybersecurity Grant Program, which commits $10M in API credits to defenders securing open source software.
holmesgpt
Advanced tools
Installation |
Docs |
Open-source AI agent for investigating production incidents and finding root causes. Works with any stack — Kubernetes, VMs, cloud providers, databases, and SaaS platforms. We are a Cloud Native Computing Foundation sandbox project. Originally created by Robusta.Dev, with major contributions from Microsoft.
Most AI agents are great at troubleshooting problems, but still need a human to notice something is wrong and trigger an investigation. Operator mode fixes that — HolmesGPT runs in the background 24/7, spots problems before your customers notice, and messages you in Slack with the fix. Connect the GitHub integration and it can even open PRs to fix what it finds.
While the operator itself runs in Kubernetes, health checks can query any data source Holmes is connected to — VMs, cloud services, databases, SaaS platforms, and more.
HolmesGPT uses an agentic loop to query live observability data from multiple sources and identify root causes.

HolmesGPT integrates with popular observability and cloud platforms. The following data sources ("toolsets") are built-in. Add your own.
| Data Source | Notes |
|---|---|
| Azure Kubernetes Service cluster and node health diagnostics | |
| Get status, history and manifests and more of apps, projects and clusters | |
AWS | RDS events, instances, slow query logs, and more (MCP) |
Azure | Azure resources and diagnostics (MCP) |
Azure SQL | Database health, performance, connections, and slow queries |
Confluence | Private runbooks and documentation |
Confluence (MCP) | Private runbooks and documentation (MCP) |
| Retrieve logs for any resource | |
Datadog | Query logs, metrics, and traces |
Docker | Get images, logs, events, history and more |
| Query logs, cluster health, shard and index diagnostics | |
| Google Cloud Platform resources (MCP) | |
GitHub | Repositories, issues, and pull requests (MCP) |
| Build status, pipeline logs, and job history (MCP) | |
| Query and analyze dashboard configurations and panels | |
Helm | Release status, chart metadata, and values |
| Public runbooks, community docs etc | |
Kafka | Fetch metadata, list consumers and topics or find lagging consumer groups |
| Pod logs, K8s events, and resource status (kubectl describe) | |
| Apply fixes like scaling, rollbacks, and resource edits (MCP) | |
| Query logs for Kubernetes resources or any query | |
| MariaDB database queries and diagnostics (MCP) | |
| Query data, diagnose performance, inspect schemas, find slow operations | |
| Cluster health, slow queries, and performance diagnostics | |
NewRelic | Investigate alerts, query tracing data |
| Projects, routes, builds, security context constraints, and deployment configs | |
| Workflow orchestration monitoring, flow runs, and worker health (MCP) | |
| Investigate alerts, query metrics and generate PromQL queries | |
RabbitMQ | Partitions, memory/disk alerts, troubleshoot split-brain scenarios and more |
Robusta | Multi-cluster monitoring, historical change data, runbooks, PromQL graphs and more |
| Query tables and incident records | |
| Error tracking, issues, and performance monitoring (MCP) | |
Slab | Team knowledge base and runbooks on demand |
| Splunk | Log search and analysis (MCP) |
| PostgreSQL, MySQL, ClickHouse, MariaDB, SQL Server, SQLite | |
Tempo | Fetch trace info, debug issues like high latency in application |
See the full list of built-in toolsets for additional integrations including Cilium, KubeVela, Notion, and more.
HolmesGPT can fetch alerts/tickets to investigate from external systems, then write the analysis back to the source or Slack.
| Integration | Status | Notes |
|---|---|---|
| Slack | ✅ | Demo. Available via Robusta |
| Microsoft Teams | ✅ | Available via Robusta |
| Prometheus/AlertManager | ✅ | Robusta or HolmesGPT CLI |
| PagerDuty | ✅ | HolmesGPT CLI only |
| OpsGenie | ✅ | HolmesGPT CLI only |
| Jira | ✅ | HolmesGPT CLI only |
| GitHub | ✅ | HolmesGPT CLI only |
Read the installation documentation to learn how to install HolmesGPT.
Read the LLM Providers documentation to learn how to set up your LLM API key.
See the walkthrough documentation for usage guides, including:
By design, HolmesGPT has read-only access and respects RBAC permissions. It is safe to run in production environments.
Distributed under the Apache 2.0 License. See LICENSE for more information.
Join our community to discuss the HolmesGPT roadmap and share feedback:
If you have any questions, feel free to message us on HolmesGPT Slack Channel
Please read our CONTRIBUTING.md for guidelines and instructions.
For help, contact us on Slack or ask DeepWiki AI your questions.
Please make sure to follow the CNCF code of conduct - details here.
FAQs
Unknown package
We found that holmesgpt demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 2 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Company News
/Security News
Socket is an initial recipient of OpenAI's Cybersecurity Grant Program, which commits $10M in API credits to defenders securing open source software.

Security News
Socket CEO Feross Aboukhadijeh joins 10 Minutes or Less, a podcast by Ali Rohde, to discuss the recent surge in open source supply chain attacks.

Research
/Security News
Campaign of 108 extensions harvests identities, steals sessions, and adds backdoors to browsers, all tied to the same C2 infrastructure.