Telemetry and metrics

Virtual MCP Server (vMCP) provides comprehensive observability through OpenTelemetry instrumentation. You can export traces and metrics to monitor backend operations and workflow executions.

Telemetry types

vMCP supports two types of telemetry:

Traces: Track requests across vMCP and its backends, showing the full path of tool calls, resource reads, and workflow executions
Metrics: Counters and histograms for backend request rates, error rates, and latency distributions

For general ToolHive observability concepts including trace structure and metrics, see the observability overview.

Enable telemetry

Configure telemetry in the VirtualMCPServer resource using the spec.config.telemetry field:

apiVersion: toolhive.stacklok.dev/v1alpha1
kind: VirtualMCPServer
metadata:
  name: my-vmcp
  namespace: toolhive-system
spec:
  config:
    groupRef: my-group
    telemetry:
      endpoint: 'otel-collector:4318'
      serviceName: 'my-vmcp'
      insecure: true
      tracingEnabled: true
      samplingRate: '0.05'
      metricsEnabled: true
      enablePrometheusMetricsPath: true
  incomingAuth:
    type: anonymous

Configuration options

Field	Description	Default
`endpoint`	OTLP collector endpoint (hostname:port)	-
`serviceName`	Service name in traces and metrics	VirtualMCPServer name
`tracingEnabled`	Enable tracing	`false`
`metricsEnabled`	Enable OTLP metrics export	`false`
`samplingRate`	Trace sampling rate (0.0-1.0)	`"0.05"`
`insecure`	Use HTTP instead of HTTPS	`false`
`enablePrometheusMetricsPath`	Expose `/metrics` endpoint	`false`

Export to observability backends

Export to Jaeger via OpenTelemetry Collector

Deploy an OpenTelemetry Collector configured to export to Jaeger:

otel-collector-config.yaml
receivers:
  otlp:
    protocols:
      http:
        endpoint: 0.0.0.0:4318

processors:
  batch:
    timeout: 10s
    send_batch_size: 1024

exporters:
  otlp/jaeger:
    endpoint: jaeger:4317
    tls:
      insecure: true

service:
  pipelines:
    traces:
      receivers: [otlp]
      processors: [batch]
      exporters: [otlp/jaeger]

Then configure vMCP to send telemetry to the collector:

spec:
  config:
    telemetry:
      endpoint: 'otel-collector:4318'
      serviceName: 'production-vmcp'
      tracingEnabled: true
      metricsEnabled: true
      insecure: true

Metrics collection

vMCP supports two methods for collecting metrics:

Push via OpenTelemetry: Set metricsEnabled: true to push metrics to your OTel Collector via OTLP
Pull via Prometheus: Set enablePrometheusMetricsPath: true to expose a /metrics endpoint on the vMCP service port (4483) for Prometheus to scrape

Backend metrics

These metrics track requests to individual MCP server backends:

Metric	Type	Description
`toolhive_vmcp_backends_discovered`	Gauge	Number of backends discovered
`toolhive_vmcp_backend_requests`	Counter	Total requests per backend
`toolhive_vmcp_backend_errors`	Counter	Total errors per backend
`toolhive_vmcp_backend_requests_duration`	Histogram	Duration of backend requests
`mcp.client.operation.duration`	Histogram	MCP client operation duration (OTel semantic convention)

Workflow metrics

These metrics track workflow execution across backends:

Metric	Type	Description
`toolhive_vmcp_workflow_executions`	Counter	Total workflow executions
`toolhive_vmcp_workflow_errors`	Counter	Total workflow execution errors
`toolhive_vmcp_workflow_duration`	Histogram	Duration of workflow executions

Optimizer metrics

When the vMCP optimizer is enabled, these metrics track tool-finding and tool-calling performance:

Metric	Type	Description
`toolhive_vmcp_optimizer_find_tool_requests`	Counter	Total FindTool calls
`toolhive_vmcp_optimizer_find_tool_errors`	Counter	Total FindTool errors
`toolhive_vmcp_optimizer_find_tool_duration`	Histogram	Duration of FindTool calls
`toolhive_vmcp_optimizer_find_tool_results`	Histogram	Number of tools returned per call
`toolhive_vmcp_optimizer_token_savings_percent`	Histogram	Token savings percentage per call
`toolhive_vmcp_optimizer_call_tool_requests`	Counter	Total CallTool calls
`toolhive_vmcp_optimizer_call_tool_errors`	Counter	Total CallTool errors
`toolhive_vmcp_optimizer_call_tool_not_found`	Counter	CallTool calls where tool was not found
`toolhive_vmcp_optimizer_call_tool_duration`	Histogram	Duration of CallTool calls

Distributed tracing

vMCP creates client-side spans for backend operations with the following span names:

tools/call <tool_name> - Tool calls to backends
resources/read - Resource reads from backends
prompts/get <prompt_name> - Prompt retrieval from backends
list_capabilities - Backend capability discovery

Each span includes attributes for the target backend (target.workload_id, target.workload_name, target.base_url) and the relevant MCP attributes (mcp.method.name, gen_ai.tool.name, mcp.resource.uri).

Observability concepts - Overview of ToolHive's observability architecture
Kubernetes telemetry guide - Telemetry for MCPServer resources
OpenTelemetry tutorial - Set up a local observability stack

Telemetry types​

Enable telemetry​

Configuration options​

Export to observability backends​

Export to Jaeger via OpenTelemetry Collector​

Metrics collection​

Backend metrics​

Workflow metrics​

Optimizer metrics​

Distributed tracing​

Related information​