vLLora CLI

Use vLLora from the terminal and scripts. The CLI is designed for fast iteration, local reproduction, and automation workflows—perfect when you need to query traces, export data, or check recent failures without opening a browser or IDE.

The CLI is not "MCP but in terminal." It's built for non-agent, non-editor workflows where you want direct command-line access to trace data.

Quick Start

The core workflow is:

Find a trace

vllora traces list --last-n-minutes 60 --limit 20

+--------------------------------------+----------------------+------------------+--------+---------------+---------------------+--------------------------------------+--------------------------------------+
| Trace ID                             | Span ID              | Operation        | Status | Duration (ms) | Start Time          | Run ID                               | Thread ID                            |
+--------------------------------------+----------------------+------------------+--------+---------------+---------------------+--------------------------------------+--------------------------------------+
| a7838793-6421-43b9-9dcb-0bc08fc6ab6f | 13919283956904092872 | openai           | ✓ OK   | 14312         | 2025-12-23 05:04:38 | 4ea18f79-4c4c-4d2c-b628-20d510af7181 | 7510b431-109c-42b2-a858-f05c29a4f952 |
+--------------------------------------+----------------------+------------------+--------+---------------+---------------------+--------------------------------------+--------------------------------------+
| a7838793-6421-43b9-9dcb-0bc08fc6ab6f | 314675728497877876   | run              | ✓ OK   | 14320         | 2025-12-23 05:04:38 | 4ea18f79-4c4c-4d2c-b628-20d510af7181 | 7510b431-109c-42b2-a858-f05c29a4f952 |
+--------------------------------------+----------------------+------------------+--------+---------------+---------------------+--------------------------------------+--------------------------------------+
... truncated ...

Inspect the run

vllora traces run-info --run-id <run-id>

Run Overview:
+--------------+--------------------------------------+
| Field        | Value                                |
+--------------+--------------------------------------+
| Run ID       | 4ea18f79-4c4c-4d2c-b628-20d510af7181 |
| Status       | ok                                   |
| Start Time   | 2025-12-23T05:02:52.801745+00:00     |
| Duration     | 120114 ms                            |
| Root Span ID | 10384579106551160164                 |
+--------------+--------------------------------------+

LLM Calls (18):
+----------------------+----------+--------------+----------+-------+
| Span ID              | Provider | Model        | Messages | Tools |
+----------------------+----------+--------------+----------+-------+
| 12495210593948314377 | openai   | gpt-4.1-mini | 30       | 0     |
+----------------------+----------+--------------+----------+-------+
... truncated ...

Inspect an LLM call

vllora traces call-info --span-id <span-id>

{
  "span_id": "12495210593948314377",
  "trace_id": "40c1a59d-5d10-47c5-8e68-65dcf7a31668",
  "run_id": "4ea18f79-4c4c-4d2c-b628-20d510af7181",
  "thread_id": "7510b431-109c-42b2-a858-f05c29a4f952",
  "duration_ms": 1515,
  "costs": "0.0016456000245213508",
  "raw_request": "{\"messages\":[{\"role\":\"system\",\"content\":\"...\"},{\"role\":\"user\",\"content\":[{\"type\":\"text\",\"text\":\"Plan a 5-day trip to Tokyo in April\"}]}],\"model\":\"gpt-4.1-mini\",\"stream\":false,\"temperature\":0.7,\"tool_choice\":\"auto\",\"tools\":[...]}",
  "raw_response": "{\"id\":\"chatcmpl_...\",\"choices\":[{\"index\":0,\"message\":{\"role\":\"assistant\",\"tool_calls\":[{\"id\":\"call_...\",\"type\":\"function\",\"function\":{\"name\":\"research_destination\",\"arguments\":\"{\\\"destination\\\":\\\"Tokyo\\\"}\"}}]},\"finish_reason\":\"tool_calls\"}],\"model\":\"gpt-4.1-mini-2025-04-14\",\"usage\":{\"prompt_tokens\":3910,\"completion_tokens\":51,\"total_tokens\":3961}}"
}

Commands

`traces list`

Search/list traces by various criteria.

vllora traces list [OPTIONS]

Options:

--limit <LIMIT> - Limit number of results (default: 20)
--offset <OFFSET> - Offset for pagination (default: 0)
--run-id <RUN_ID> - Filter by run ID
--thread-id <THREAD_ID> - Filter by thread ID
--operation-name <OPERATION_NAME> - Filter by operation name: run, agent, task, tools, openai, anthropic, bedrock, gemini, model_call
--text <TEXT> - Text search query
--last-n-minutes <LAST_N_MINUTES> - Filter traces from last N minutes
--sort-by <SORT_BY> - Sort by field (default: start_time)
--sort-order <SORT_ORDER> - Sort order: asc or desc (default: desc)
--output <OUTPUT> - Output format: table or json (default: table)

Example:

vllora traces list --last-n-minutes 60 --limit 20

`traces call-info`

Get detailed LLM call information for a span.

vllora traces call-info --span-id <SPAN_ID> [OPTIONS]

Options:

--span-id <SPAN_ID> - Span ID (required)
--output <OUTPUT> - Output format: table or json (default: table)

Example:

vllora traces call-info --span-id 12495210593948314377 --output json

`traces run-info`

Get overview of a run and its spans.

vllora traces run-info --run-id <RUN_ID> [OPTIONS]

Options:

--run-id <RUN_ID> - Run ID (required)
--output <OUTPUT> - Output format: table or json (default: table)

Example:

vllora traces run-info --run-id 4ea18f79-4c4c-4d2c-b628-20d510af7181

`traces overview`

Get aggregated stats for recent LLM and tool calls.

vllora traces overview --last-n-minutes <LAST_N_MINUTES> [OPTIONS]

Options:

--last-n-minutes <LAST_N_MINUTES> - Number of minutes in the past to include (required)
--output <OUTPUT> - Output format: table or json (default: table)

Example:

vllora traces overview --last-n-minutes 60

When to Use CLI vs Other Methods

The CLI is ideal for:

Terminal workflows - Quick checks without leaving your terminal
Scripts and automation - Monitoring, reporting. Use --output json with shell redirection to export: vllora traces list --last-n-minutes 60 --output json > traces.json
Local reproduction - Exporting trace data for debugging
Bulk operations - Processing many traces at once

For visual exploration and deep dives, use the Web UI. For debugging from coding agents or IDE tools, use the MCP Server.

Quick Start​

Find a trace​

Inspect the run​

Inspect an LLM call​

Commands​

traces list​

traces call-info​

traces run-info​

traces overview​

When to Use CLI vs Other Methods​

Quick Start

Find a trace

Inspect the run

Inspect an LLM call

Commands

`traces list`

`traces call-info`

`traces run-info`

`traces overview`

When to Use CLI vs Other Methods