Server-driven delay

The problem

When a server rate-limits you (429) or is briefly unavailable (503), it often tells you exactly how long to wait with a Retry-After header. Backing off on your own curve ignores that — you retry too early (and get rejected again) or too late (and waste time).

The solution

getDelay derives the next wait from the error. Return a number of milliseconds to override the computed backoff, or null/undefined to keep it. The backoff curve keeps advancing underneath, so you can fall back to it whenever the hint is missing.

await retry(callApi, {
  times: 5,
  getDelay: (error, { attempt, computedDelay }) => {
    const retryAfter = (error as { retryAfterMs?: number }).retryAfterMs;
    return retryAfter ?? computedDelay; // honour the server, else back off
  },
});

The callback receives the error plus a context of { attempt, computedDelay } — the attempt that just failed and the delay the library would have used.

Try it

The simulated server returns 429 with a Retry-After hint for the first couple of attempts; getDelay waits exactly that long instead of using the backoff.

attempt timelineidle

failed
timed out
bailed
pending
succeeded

The gaps match the server's Retry-After hint, not the backoff curve.

retryAfterMsserver-hinted waittimesretries after the first failurefailUntilattempts that fail before successinitialDelayTimems before the first retry

Press Run to start.

Server-driven delay ​

The problem ​

The solution ​

Try it ​

Server-driven delay

The problem

The solution

Try it