Execution Metrics

Overview

When DataFusion executes a query it compiles the logical plan into a tree of physical plan operators (e.g. FilterExec, ProjectionExec, HashAggregateExec). Each operator can record runtime statistics while it runs. These statistics are called execution metrics.

Typical metrics include:

output_rows – number of rows produced by the operator
elapsed_compute – total CPU time (nanoseconds) spent inside the operator
spill_count – number of times the operator spilled data to disk
spilled_bytes – total bytes written to disk during spills
spilled_rows – total rows written to disk during spills

Metrics are collected per-partition: DataFusion may execute each operator in parallel across several partitions. The convenience properties on :py:class:`~datafusion.MetricsSet` (e.g. output_rows, elapsed_compute) automatically sum the named metric across all partitions, giving a single aggregate value for the operator as a whole. You can also access the raw per-partition :py:class:`~datafusion.Metric` objects via :py:meth:`~datafusion.MetricsSet.metrics`.

When Are Metrics Available?

Metrics are populated only after the DataFrame has been executed. Execution is triggered by any of the terminal operations:

:py:meth:`~datafusion.DataFrame.collect`
:py:meth:`~datafusion.DataFrame.collect_partitioned`
:py:meth:`~datafusion.DataFrame.execute_stream`
:py:meth:`~datafusion.DataFrame.execute_stream_partitioned`

Calling :py:meth:`~datafusion.ExecutionPlan.collect_metrics` before execution will return entries with empty (or None) metric sets because the operators have not run yet.

Reading the Physical Plan Tree

:py:meth:`~datafusion.DataFrame.execution_plan` returns the root :py:class:`~datafusion.ExecutionPlan` node of the physical plan tree. The tree mirrors the operator pipeline: the root is typically a projection or coalescing node; its children are filters, aggregates, scans, etc.

The operator_name string returned by :py:meth:`~datafusion.ExecutionPlan.collect_metrics` is the display name of the node, for example "FilterExec: column1@0 > 1". This is the same string you would see when calling plan.display().

Available Metrics

The following metrics are directly accessible as properties on :py:class:`~datafusion.MetricsSet`:

Property	Description
`output_rows`	Number of rows emitted by the operator (summed across partitions).
`elapsed_compute`	CPU time in nanoseconds spent inside the operator's execute loop (summed across partitions).
`spill_count`	Number of spill-to-disk events due to memory pressure (summed across partitions).
`spilled_bytes`	Total bytes written to disk during spills (summed across partitions).
`spilled_rows`	Total rows written to disk during spills (summed across partitions).

Any metric not listed above can be accessed via :py:meth:`~datafusion.MetricsSet.sum_by_name`, or by iterating over the raw :py:class:`~datafusion.Metric` objects returned by :py:meth:`~datafusion.MetricsSet.metrics`.

Labels

A :py:class:`~datafusion.Metric` may carry labels: key/value pairs that provide additional context. For example, some operators tag their output metrics with an output_type label to distinguish between intermediate and final output:

for metric in metrics_set.metrics():
    print(metric.name, metric.labels())
# output_rows  {'output_type': 'final'}

Labels are operator-specific; most metrics have no labels.

End-to-End Example

from datafusion import SessionContext

ctx = SessionContext()
ctx.sql("CREATE TABLE sales AS VALUES (1, 100), (2, 200), (3, 50)")

df = ctx.sql("SELECT * FROM sales WHERE column1 > 1")

# Execute the query — this populates the metrics
results = df.collect()

# Retrieve the physical plan with metrics
plan = df.execution_plan()

# Walk every operator and print its metrics
for operator_name, ms in plan.collect_metrics():
    if ms.output_rows is not None:
        print(f"{operator_name}")
        print(f"  output_rows    = {ms.output_rows}")
        print(f"  elapsed_compute = {ms.elapsed_compute} ns")

# Access raw per-partition metrics
for operator_name, ms in plan.collect_metrics():
    for metric in ms.metrics():
        print(
            f"  partition={metric.partition}  "
            f"{metric.name}={metric.value}  "
            f"labels={metric.labels()}"
        )

API Reference

:py:class:`datafusion.ExecutionPlan` — physical plan node
:py:meth:`datafusion.ExecutionPlan.collect_metrics` — walk the tree and return (operator_name, MetricsSet) pairs
:py:meth:`datafusion.ExecutionPlan.metrics` — return the :py:class:`~datafusion.MetricsSet` for a single node
:py:class:`datafusion.MetricsSet` — aggregated metrics for one operator
:py:class:`datafusion.Metric` — a single per-partition metric value

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Execution Metrics

Overview

When Are Metrics Available?

Reading the Physical Plan Tree

Available Metrics

Labels

End-to-End Example

API Reference

FilesExpand file tree

execution-metrics.rst

Latest commit

History

execution-metrics.rst

File metadata and controls

Execution Metrics

Overview

When Are Metrics Available?

Reading the Physical Plan Tree

Available Metrics

Labels

End-to-End Example

API Reference