Skip to content

Conversation

@hhoikoo
Copy link
Member

@hhoikoo hhoikoo commented Feb 11, 2026

resolves #8774 (BA-4377)

Overview

Raises Tempo's max_bytes_per_trace from the default 5 MB to 50 MB so that large OTel GraphQL traces are ingested instead of being rejected with TRACE_TOO_LARGE.

Problem Statement

  • With OTel distributed tracing enabled in the Manager (BA-4330) and per-resolver spans in GraphQL middleware (BA-4376), complex queries produce traces exceeding Tempo's 5 MB default limit
  • Tempo rejects these with TRACE_TOO_LARGE and drops the entire trace, making it impossible to debug the queries that need investigation most
  • The 50 MB limit provides generous headroom given that BA-4376's leaf-type span suppression already eliminates ~92% of resolver spans

Checklist: (if applicable)

  • Milestone metadata specifying the target backport version
  • Mention to the original issue
  • Installer updates including:
    • Fixtures for db schema changes
    • New mandatory config options
  • Update of end-to-end CLI integration tests in ai.backend.test
  • API server-client counterparts (e.g., manager API -> client SDK)
  • Test case(s) to:
    • Demonstrate the difference of before/after
    • Demonstrate the flow of abstract/conceptual models with a concrete implementation
  • Documentation
    • Contents in the docs directory
    • docstrings in public interfaces and type annotations

Raise Tempo's max_bytes_per_trace from default 5 MB to 50 MB
in tempo-config.yaml. This prevents TRACE_TOO_LARGE rejections
for OTel GraphQL traces from Manager that exceed the default
limit.

The configuration change applies to the global overrides section
and includes an inline comment documenting the rationale.
Copilot AI review requested due to automatic review settings February 11, 2026 10:53
@github-actions github-actions bot added the size:XS ~10 LoC label Feb 11, 2026
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the local Grafana Tempo configuration to allow ingesting significantly larger traces, preventing OTel GraphQL traces from being dropped with TRACE_TOO_LARGE.

Changes:

  • Increase Tempo per-trace ingestion limit by setting overrides.defaults.global.max_bytes_per_trace to 50,000,000 bytes (50 MB).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@hhoikoo hhoikoo marked this pull request as draft February 11, 2026 11:04
Add towncrier changelog entry documenting the increase of
Tempo's max_bytes_per_trace configuration from 5 MB to 50 MB.
This change accommodates larger OpenTelemetry GraphQL traces
that were previously being truncated.
@hhoikoo hhoikoo marked this pull request as ready for review February 12, 2026 01:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:XS ~10 LoC

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Increase Tempo max_bytes_per_trace to accommodate OTel GraphQL traces

1 participant