CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

This is preset-cli, a command-line interface for interacting with Preset (https://preset.io/) workspaces. The CLI allows users to:

Run SQL queries against analytical databases in workspaces
Pull/push resources (databases, datasets, charts, dashboards) using git-like workflow
Sync from dbt Core/Cloud projects to Preset workspaces
Manage users, teams, and workspace permissions
Handle authentication via API tokens or JWT

The project provides three main CLI entry points:

preset-cli: Legacy CLI for interacting with Preset workspaces
superset-cli: Legacy CLI for standalone Superset instances
sup: Modern, git-like CLI with beautiful UX (NEW - primary development focus)

Architecture & Code Structure

Legacy CLI Structure

src/preset_cli/
├── api/clients/          # API client implementations
│   ├── preset.py        # Preset API client
│   └── superset.py      # Superset API client
├── auth/                # Authentication modules
│   ├── preset.py        # Preset-specific auth
│   ├── superset.py      # Superset auth
│   ├── jwt.py           # JWT token handling
│   └── token.py         # Token management
├── cli/                 # CLI command implementations
│   ├── main.py          # Main preset-cli entry point
│   ├── superset/        # Superset-specific commands
│   │   ├── main.py      # superset-cli entry point
│   │   ├── export.py    # Export resources
│   │   ├── sql.py       # SQL execution
│   │   └── sync/        # Synchronization commands
│   │       ├── native/  # Native YAML sync
│   │       └── dbt/     # dbt project sync
│   └── export_users.py  # User export functionality
└── lib.py              # Shared utilities

Modern sup CLI Structure (NEW)

src/sup/
├── main.py                    # Main sup entry point with beautiful branding
├── commands/                  # Entity-focused command modules
│   ├── workspace.py          # Workspace management + cross-workspace config
│   ├── database.py           # Database operations
│   ├── dataset.py            # Dataset management
│   ├── chart.py              # Chart pull/push with dependency management
│   ├── dashboard.py          # Dashboard operations
│   ├── query.py              # Saved query discovery
│   ├── user.py               # User management
│   ├── sql.py                # Direct SQL execution
│   ├── sync.py               # Multi-asset sync operations (NEW)
│   └── config.py             # Configuration management
├── clients/                   # sup-specific client wrappers
│   ├── preset.py             # Wrapped PresetClient with sup UX
│   └── superset.py           # Wrapped SupersetClient with sup UX
├── config/                    # Modern Pydantic configuration
│   ├── settings.py           # Type-safe config models
│   ├── sync.py               # Sync configuration models (NEW)
│   └── paths.py              # Config file path resolution
├── filters/                   # Universal filtering system
│   ├── base.py               # UniversalFilters for all entities
│   └── chart.py              # Chart-specific filters
├── output/                    # Beautiful Rich output system
│   ├── styles.py             # Emerald green Preset branding
│   ├── formatters.py         # Output format handlers
│   ├── tables.py             # Rich table formatting
│   └── spinners.py           # Loading indicators
└── auth/                      # Authentication wrappers
    └── preset.py             # sup-compatible auth

Legacy CLI Components

API Clients: PresetClient and SupersetClient handle REST API interactions
Authentication: Supports JWT tokens, API token/secret pairs, and credential storage
CLI Commands: Built using Click framework with hierarchical command structure
Templating: Uses Jinja2 for parameterized YAML configuration files
Sync Operations: Bidirectional sync between filesystems and workspaces

Modern sup CLI Components (NEW)

Entity Commands: Chart, dashboard, dataset, database, user, workspace, query
Universal Filtering: Consistent --mine, --name, --ids, --limit patterns across all entities
Pull/Push Operations: Git-like asset lifecycle with dependency management
Cross-Workspace Support: Enterprise target-workspace-id for multi-instance sync
Beautiful UX: Rich tables, emerald branding, spinners, progress feedback
Configuration: Type-safe Pydantic models with YAML persistence
Output Formats: Rich tables, JSON, YAML, CSV, porcelain for automation

sup CLI Development (Current Focus)

sup Commands (Production Ready)

# Core workflow
sup workspace list                            # Beautiful Rich tables
sup workspace use 123                         # Set source workspace
sup workspace set-target 456                  # Set push target for cross-workspace sync
sup workspace show                            # Display source + target context

# SQL execution with multiple formats
sup sql "SELECT * FROM users LIMIT 5"        # Rich table output
sup sql "SELECT COUNT(*) FROM sales" --json  # JSON for automation

# Chart lifecycle management (COMPLETE PATTERN - PRODUCTION)
sup chart list --mine --limit 10              # Universal filtering
sup chart pull --mine                         # Pull charts + dependencies to ./assets/
sup chart push --workspace-id=456 --force     # Push to target workspace

# Dashboard lifecycle management (STUB - follows chart pattern)
sup dashboard list --mine                     # Universal filtering
sup dashboard pull --mine                     # Pull dashboards + dependencies (not implemented)
sup dashboard push                            # Push dashboards to workspace (not implemented)

# Dataset lifecycle management (STUB - follows chart pattern)
sup dataset list --search="sales"             # Universal filtering
sup dataset pull --mine                       # Pull datasets + dependencies (not implemented)
sup dataset push                              # Push datasets to workspace (not implemented)

# User management (READ-ONLY)
sup user list --limit 50                      # List users in workspace
# Note: User pull/push intentionally not implemented (security/RBAC concerns)

# Multi-asset sync workflows (NEW - WORKING)
sup sync create ./my_sync --source 123 --targets 456,789  # Create sync config
sup sync run ./my_sync --dry-run                          # Preview operations
sup sync run ./my_sync --pull-only                        # Pull from source
sup sync validate ./my_sync                               # Validate config

# Configuration management
sup config show                               # Display all current settings
sup config set target-workspace-id 789        # Set cross-workspace target

sup Development Status - Pull/Push Terminology

Terminology Standard: All sup commands use pull/push (not import/export) for git-like UX consistency

Entity Pull/Push Status:

✅ chart: Full pull/push implementation with dependency management (PRODUCTION)
🔨 dashboard: Pull command stub added, push not implemented
🔨 dataset: Pull command stub added, push not implemented
🔨 database: No pull/push commands (managed via config files)
❌ user: Intentionally no pull/push (security/RBAC concerns, list-only)
❌ query: No pull/push (discovery-only entity)

Additional Features:

✅ Enterprise Features: Cross-workspace sync, target configuration, safety confirmations
✅ Production Tested: Live integration with real Preset workspaces
✅ Multi-Asset Sync Framework: YAML-based sync configs with automatic dependency resolution
✅ Sync Pull Implementation: Full working implementation using legacy export_resource
🎯 Next: Dashboard/dataset pull implementation, sync push for multi-target workflows

Multi-Asset Sync Implementation (NEW)

Sync Architecture Decisions

Key architectural choices made during sync implementation:

Always Overwrite in Sync: Removed overwrite option from sync config schema
- Sync operations are opinionated: local should match remote
- Users have git for safety (git stash, git diff, --dry-run)
- Eliminates sync config complexity and user confusion
Multi-Asset Type Support: Single sync config can handle multiple asset types
- Each export_zip() call includes dependencies automatically
- Later asset exports overwrite earlier dependency files (beneficial)
- Gets most up-to-date dependency state during sync operation
Atomic Operations via Legacy CLI:
- export_resource() from preset_cli.cli.superset.export provides atomic export
- client.export_zip(resource_name, ids) handles batching and dependencies
- Reuses battle-tested export logic with proper error handling

Sync Configuration Schema

# sync_config.yml - Multi-target sync configuration
source:
  workspace_id: 187
  assets:
    dashboards:
      selection: ids
      ids: [254]
      include_dependencies: true
    charts:
      selection: all
      include_dependencies: true
    # NOTE: No overwrite option - always true in sync operations

target_defaults:
  overwrite: false                    # For push operations
  jinja_context:
    company: Default Company
    region: us-east-1

targets:
  - workspace_id: 456
    name: production
    jinja_context:
      environment: production

Sync Implementation Status

✅ Sync Config Models: Pydantic models with validation
✅ Sync Pull: Working implementation using export_resource()
✅ Multi-Asset Support: Single config handles databases, datasets, charts, dashboards
✅ Dependency Resolution: Automatic via existing export_zip logic
✅ Dry Run: Preview operations without execution
🎯 Sync Push: Connect to existing native() import function
🎯 Jinja Templating: Environment-specific asset customization

dbt Integration Entity Distribution

How dbt capabilities map to sup entities:

sup database pull: dbt profiles → Superset database connections
sup dataset pull: dbt models → Superset datasets (schema, metrics, metadata)
sup chart pull: Superset charts → dbt exposures (usage tracking)
sup dashboard pull: Superset dashboards → dbt exposures (business context)

Required sup config keys for dbt integration:

# dbt Core
dbt-profiles-dir, dbt-project-dir

# dbt Cloud
dbt-cloud-account-id, dbt-cloud-project-id, dbt-cloud-job-id, dbt-cloud-api-token

Legacy CLI Development Commands

Environment Setup

# Activate virtual environment first
source .venv/bin/activate

# Using uv (preferred for fastest installation)
uv pip install -e '.[testing]'

# Or using make (which uses uv)
make install

Testing

# Run all tests with coverage
make test

# Run pytest directly with coverage
pytest --cov=src/preset_cli -vv tests/ --doctest-modules src/preset_cli

Code Quality

# Run pre-commit hooks (linting, formatting, etc.)
make check

# Spell check documentation and code
make spellcheck

Requirements Management

# Update requirements.txt from requirements.in
make requirements.txt

# Update dev-requirements.txt from dev-requirements.in
make dev-requirements.txt

Clean Up

# Remove virtual environment
make clean

CLI Usage Patterns

The CLI uses a common pattern:

Authentication (API tokens, JWT, or interactive prompts)
Workspace/team selection (interactive or via --workspaces/--teams flags)
Command execution with resource-specific options

Example command structure:

preset-cli --workspaces=https://workspace.preset.io/ superset [command] [options]

Key Configuration Files

setup.cfg: Main package configuration, dependencies, and pytest settings
Makefile: Development workflow automation
pyproject.toml: Build system configuration
.pre-commit-config.yaml: Code quality hooks
tox.ini: Testing environments configuration

Authentication Flow

Check for JWT token in environment (PRESET_JWT_TOKEN)
Check for API credentials in environment (PRESET_API_TOKEN, PRESET_API_SECRET)
Look for stored credentials in system-dependent location
Prompt user interactively and optionally store credentials

Testing Framework

Uses pytest with coverage reporting
Includes doctests in source modules
Mock objects for API interactions
Test coverage target configured in setup.cfg

Dependencies

Key dependencies include:

Click for CLI framework
PyYAML for configuration parsing
Jinja2 for templating
SQLAlchemy for database operations
Requests for HTTP clients
Rich for terminal formatting

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Architecture & Code Structure

Legacy CLI Structure

Modern sup CLI Structure (NEW)

Legacy CLI Components

Modern sup CLI Components (NEW)

sup CLI Development (Current Focus)

sup Commands (Production Ready)

sup Development Status - Pull/Push Terminology

Multi-Asset Sync Implementation (NEW)

Sync Architecture Decisions

Sync Configuration Schema

Sync Implementation Status

dbt Integration Entity Distribution

Legacy CLI Development Commands

Environment Setup

Testing

Code Quality

Requirements Management

Clean Up

CLI Usage Patterns

Key Configuration Files

Authentication Flow

Testing Framework

Dependencies

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Architecture & Code Structure

Legacy CLI Structure

Modern sup CLI Structure (NEW)

Legacy CLI Components

Modern sup CLI Components (NEW)

sup CLI Development (Current Focus)

sup Commands (Production Ready)

sup Development Status - Pull/Push Terminology

Multi-Asset Sync Implementation (NEW)

Sync Architecture Decisions

Sync Configuration Schema

Sync Implementation Status

dbt Integration Entity Distribution

Legacy CLI Development Commands

Environment Setup

Testing

Code Quality

Requirements Management

Clean Up

CLI Usage Patterns

Key Configuration Files

Authentication Flow

Testing Framework

Dependencies