chore: add actor-level retry for infra errors by SantiagoPittella · Pull Request #2062 · 0xMiden/node

SantiagoPittella · 2026-05-07T17:35:24Z

closes #2052.

The NTX builder was permanently dropping notes after a run of unrelated infrastructure failures because every error type counted against the per-note max_note_attempts cap.

This PR classifies NtxError into infrastructure vs. intrinsic. Infra failures now retry the same candidate after a exponential backoff (using backon) without touching per-note state. Intrinsic failures keep the existing behaviour.

This PR also introduces a dependency for backoff, backon, and note that I didn't used the retry feature (probably the most important part) because we use a tokio::select! that needs to interleave several other things around each retry. Making use of retry will require further "plumbing" changes.

…ding

Mirko-von-Leipzig · 2026-05-12T13:58:08Z

+                            },
+                            ExecutionOutcome::IntrinsicFailure => {
+                                self.reset_infra_backoff();
+                                self.mode = ActorMode::NoViableNotes;


I think this is incorrect? A failure doesn't mean that there are notes to consume?

Mirko-von-Leipzig · 2026-05-12T14:19:43Z

There are three different backoff variations I can think of

Actor based (this impl)

Request based i.e. each request to the prover has its own backoff which resets next tx

Client based i.e. shared by all requests and all actors

I'm actually unsure which we should be doing.. It sort of makes sense to me that it would be client based, but also maybe not? But if one considers a service that rate limits us, then it doesn't care about the distinction, beyond the entire ntxb should slow down.

However, maybe per request makes the most sense.. otherwise we share backoffs for unrelated infrastructure e.g. prover, block-producer, validator..

@SantiagoPittella I think it might be more correct to have retries at the request level itself. At that stage, it doesn't really make sense to have an error kind like this, since you'd be managing different errors at each request point.

The current intrinsic errors would also be invoked at a specific point only, so either an is_intrinsic() -> bool or an inline match make sense to me.

We should go over the list to ensure we are doing the correct thing. Do we want to reject prover errors? Maybe? If its caused by a bug.. a problem becomes what if the bug is in the prover, and it gets fixed? Then the note won't be retried though it would now pass. I'm unsure what the correct answer is.

Hey, sorry for the late response. I'm changing to a per-request retry

With this approach I did a better use of the retry crate that you mentioend in the feature

chore: add actor-level retry for infra errors

e57d2a7

SantiagoPittella requested review from Mirko-von-Leipzig and kkovaacs May 7, 2026 17:35

Merge branch 'main' into santiagopittella-fix-ntx-builder-note-discar…

1651506

…ding

Mirko-von-Leipzig reviewed May 12, 2026

View reviewed changes

per-request approach

6ce6ed4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: add actor-level retry for infra errors#2062

chore: add actor-level retry for infra errors#2062
SantiagoPittella wants to merge 3 commits into
mainfrom
santiagopittella-fix-ntx-builder-note-discarding

SantiagoPittella commented May 7, 2026 •

edited

Loading

Uh oh!

Mirko-von-Leipzig May 12, 2026

Uh oh!

Mirko-von-Leipzig May 12, 2026

Uh oh!

Mirko-von-Leipzig May 12, 2026

Uh oh!

Mirko-von-Leipzig May 13, 2026

Uh oh!

SantiagoPittella May 14, 2026

Uh oh!

SantiagoPittella May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SantiagoPittella commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mirko-von-Leipzig May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Mirko-von-Leipzig May 13, 2026

Choose a reason for hiding this comment

Uh oh!

SantiagoPittella May 14, 2026

Choose a reason for hiding this comment

Uh oh!

SantiagoPittella May 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SantiagoPittella commented May 7, 2026 •

edited

Loading