Data Guard 26ai – #26: Fast-Start Failover Lag Histogram

This post is part of a blog series.

Oracle Database 26ai introduces V$FS_LAG_HISTOGRAM, a new view that exposes the distribution of Fast-Start Failover (FSFO) lag.

When FSFO is configured (including observe-only mode), it samples the lag every minute, and puts it into buckets to count how often each range occurs. The buckets can be queried from this new view:

Where:

  • LAG_TIME: upper bound of the bucket
  • LAG_COUNT: number of occurrences
  • LAST_UPDATE_TIME: most recent occurrence

So:

  • 5 → 0–5 seconds
  • 10 → 5–10 seconds
  • 15 → 10–15 seconds

This view is particularly useful to tune FastStartFailoverLagLimit in Maximum Performance mode (ASYNC replication).

Why this matters:

Instead of relying on point-in-time lag or implementing external monitoring (which is still a good idea), you can analyze the actual distribution and choose a value that reflects your RPO: failover will not occur if lag exceeds this limit.

A slide with an example select from the view, redundant text with the blog post.

How to set the FastStartFailoverLagLimit based on the buckets

Use the histogram to understand the typical and peak lag behavior, then set FastStartFailoverLagLimit:

  • Slightly above your normal/daily peak lag (if any)
  • In line with your acceptable data-loss window (RPO)
  • Less than or equal to FastStartFailoverThreshold

With proper setting, failovers are not unnecessarily prevented, while still respecting your data protection requirements.

The following two tabs change content below.

Ludovico

Principal Product Manager at Oracle
Ludovico is a member of the Oracle Database High Availability (HA), Scalability & Maximum Availability Architecture (MAA) Product Management team in Oracle. He focuses on Oracle Data Guard, Flashback technologies, and Cloud MAA.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.