Explainer: Layout Instability Metric
Overview
Many websites suffer from layout instability - DOM elements shifting around
due to content loading asynchronously.
We propose a way for the user agent to measure layout instability during a
browsing session to compute "layout shift scores", which would be exposed by a
new interface in the
Performance API.
Layout Shift Score
Each animation frame (a.k.a.
"rendering update")
computes a layout shift (LS) score approximating the severity of visible
layout instability in the document during that frame. An animation frame with
no layout instability has an LS score of 0. Higher LS scores correspond to
greater instability.
The LS score is based on a set of shifting nodes and two
intermediate values, the impact fraction and the
distance fraction.
Shifting Nodes
A shifting node is a DOM node whose visual representation starts in a
different location than it did in the previous animation frame for a reason
other than transform change or scrolling.
"Starts" refers here to the node's
flow-relative offset - for
example, its top left corner in a horizontal left-to-right writing mode.
The visual representation of a node is the space occupied by its
box fragments
(for elements) or line boxes (for text nodes).
Note that:
-
A node that changes in size (for example, by having children appended),
but starts at the same offset, is not a shifting node.
-
A node whose start location changes two or more times during the same
animation frame (for example, from
forced synchronous layouts),
but is ultimately painted at the same location as the previous frame, is not
a shifting node.
Transform Changes
Changing an element's transform
affects its visual representation. However, because
- transform changes don't reflow surrounding content,
- transform changes are a common target of fluid animations, and
- animated transform changes are easily rendered with hardware-accelerated
compositing on a separate thread from the browser's layout and script
execution tasks,
the layout instability metric doesn't treat transform-changing elements, or
their descendants, as shifting elements (unless their layout is affected in some
other way at the same time).
Scrolling
To be a shifting node, the start location must change relative to the document
origin, the viewport, and every containing scrollable area. This ensures that
-
scrolling a simple element doesn't produce a layout shift
(though this changes its location relative to the viewport);
-
scrolling with a position: fixed
element doesn't produce a layout shift
(though this changes the fixed element's location relative to the document origin); and
-
scrolling an overflow: scroll
container doesn't produce a layout shift
(though this changes the locations of descendant elements
relative to both the viewport and the document origin).
Impact Fraction
The impact region of an animation frame is the geometric union of the
previous-frame and current-frame visual representations, intersected with the
viewport, of all shifting nodes in that frame.
The impact fraction of an animation frame is the fraction of the viewport that
is occupied by the impact region.
Example: An element which occupies half the viewport shifts by a distance equal
to half its height. The impact fraction for this animation frame is 0.75.
Distance Fraction
The move distance of a shifting node is the distance it has moved on
the horizontal or vertical axis (whichever is greater), relative to the viewport.
The distance fraction of an animation frame is the greatest move distance
of any shifting node in that frame, divided by the width or height
(whichever is greater) of the viewport.
Example: The most-shifted element moved a distance of one quarter of the
viewport. The distance fraction for this animation frame is 0.25.
The intent of incorporating the distance fraction into the LS score calculation
is to avoid overly penalizing cases where large elements shift by small
distances.
LS Score Calculation
The layout shift (LS) score is equal to the impact fraction multiplied by the
distance fraction.
Performance API
Animation frames with non-zero LS scores will notify a registered
PerformanceObserver.
The observer's callback receives one or more LayoutShift
entries:
interface LayoutShift : PerformanceEntry {
double value;
boolean hadRecentInput;
DOMHighResTimeStamp lastInputTime;
sequence<LayoutShiftAttribution> sources;
};
The entry's value
attribute is the LS score. Its
entryType
attribute is "layout-shift"
.
The hadRecentInput
and lastInputTime
attributes are described in
Recent Input Exclusion.
The sources
attribute is described in Source Attribution.
Cumulative Scores
The user agent can compute a document cumulative layout shift (DCLS) score
as the sum of the document's LS scores for each animation frame that has occurred
during the browsing session. The DCLS score is 0 when the document begins
loading, and grows whenever layout instability occurs. The DCLS score does not
account for layout instability inside descendant browsing contexts, such as
those created by <iframe>
elements.
The user agent can compute a cumulative layout shift (CLS) score for a
top-level browsing context
by summing the LS scores of the top-level browsing context to the weighted LS
scores of its descendant browsing contexts. In performing this aggregation,
the LS score of a layout shift in an <iframe>
should be weighted by the
fraction of the top-level viewport the <iframe>
occupies at the time the
layout shift occurs.
The DCLS and CLS scores are not directly exposed by the Performance API,
but we hope to make it easy for developers to construct these from the LS scores.
Recent Input Exclusion
In calculating DCLS and CLS scores, developers and user agents may wish to
exclude LS scores from animation frames that occur after recent
UI events events such as taps, key presses,
and mouse clicks. This allows the page to modify its layout in response to
the event.
To facilitate this exclusion, the LayoutShift
entry has attributes
indicating when such input last occurred, and whether it should be considered
"recent" for the purpose of the exclusion.
The hadRecentInput
attribute is true
when the last input occurred within
the past 500 ms. It should be treated as a hint to ignore the layout shift in
calculating the DCLS and CLS scores. This threshold was chosen to allow the
page to make asynchronous rendering updates as a result of the input, as long
as they occur without excessive delay. Developers wishing to implement a
different threshold can do so by examining the lastInputTime
.
Events caused by pointer movement or scrolling do not count as "input" for the
purpose of the recent input exclusion and the input-related attributes on the
LayoutShift
entry.
Source Attribution
NOTE: The sources
attribute is currently only available in Chrome 84+ with
"Experimental Web Platform features" enabled (chrome://flags).
On a complex website, it can be difficult to understand the cause of a high
CLS score given only the numeric values in the value
attribute of the
LayoutShift
entries.
To aid that effort, the sources
attribute connects the LayoutShift
back to the specific DOM elements that experienced the shift. This gives
the developer more insight into the causes of layout instability on their site.
The sources
attribute is an array of up to 5 LayoutShiftAttribution
objects:
interface LayoutShiftAttribution {
Node node;
DOMRect previousRect;
DOMRect currentRect;
};
Each attribution contains a reference to a shifted DOM node
along with rects that describe its visual representation in the viewport
before and after the shift.
Prioritization by Impact
Many nodes may shift in a single animation frame, but the user agent
selects no more than 5 to attribute in sources
, and tries to avoid
redundancy. The method of selection follows these principles:
-
If two nodes have shifted, and one fully contains the other (visually), only
the larger node is attributed. This means for example that if a container node
shifts, we would not generally need to attribute all of its descendants,
even though they too have shifted.
-
If, after the elimination described above, there are still more than 5 shifted
nodes eligible for attribution, they are prioritized by the size of their
contribution to the impact region. That is, nodes
occupying a greater area within the viewport are preferred.
We limit the number of attributions to 5 for the following reasons:
-
In a large DOM, many nodes may shift at once, and it may be infeasible
for user agents to report the full set of shifted nodes in a performant way.
-
It may be cumbersome for developers to receive the full set of shifted nodes,
and would encourage them to write non-performant code to examine such a set.
-
Given the hierarchical nature of DOM, surfacing a small number of high level
shifted elements is usually sufficient to understand the cause of layout
instability. Limiting to 5 with prioritization improves the signal to noise
ratio of the report.
Caveat: Causality
It is possible that the true "root cause" of instability will be only
indirectly related to the DOM element that experiences a layout shift.
For example, if a newly inserted element shifts content below it,
the sources
attribute will report only the shifted elements,
and not the inserted element.
We do not believe it is feasible for the user agent to understand
causes of instability at the level of indirection necessary
for a meaningful "root cause" attribution. However, we expect that
the more straightforward reporting of shifted elements in sources
will nevertheless be of significant value to developers
who are attempting to diagnose an occurrence of layout instability.
Specification
The updates to the Layout Instability API specification to incorporate
and explain the sources
attribute are tracked in
issue #11.
Computing DCLS with the API
The developer can compute the DCLS score by summing the LS scores:
addEventListener("load", () => {
let DCLS = 0;
new PerformanceObserver((list) => {
list.getEntries().forEach((entry) => {
if (entry.hadRecentInput)
return;
DCLS += entry.value;
});
}).observe({type: "layout-shift", buffered: true});
});
By passing buffered: true
to
observe,
the observer is immediately notified of any layout shifts that occurred before
it was registered. (Layout shift entries are not available from the
Performance Timeline
through getEntriesByType
.)
A "final" DCLS score for the user's session can be reported by listening to the
visibilitychange event,
and using the value of DCLS
at that time.
A demo page illustrating the use of this
code can be viewed in Chrome 76+ with the command-line flag
--enable-blink-features=LayoutInstabilityAPI
, or in Chrome 73-75 with the
command-line flag --enable-blink-features=LayoutJankAPI
.
Limitations
The presence of "layout instability" as defined by this metric correlates
imperfectly with the user experience of "jumpy" websites.
It's possible for a website to seem jumpy, but score well on CLS. For example,
rebuilding the DOM with entirely new elements does not trigger a layout shift.
Conversely, it's possible for a website to provide a smooth user experience, but
score poorly on CLS. For example, an image carousel that animates a layout
property such as left
will produce a layout shift on every frame of the animation. (Carousel authors
should use transform
instead, which avoids the layout shift, and also enables off-thread accelerated
compositing.)
The metric tries to make some allowances (transform changes,
recent input) for visual updates that are not likely
to negatively impact the user experience. But these are in essence heuristics,
and not guaranteed to work well in every case.
Precision, Variance, and Evolution
We provide a reasonably precise method of computing scores for layout instability,
but the score remains an approximation of the user experience.
We expect developers to use the score as a signal, and not to rely on its exact
numeric value in a manner such that the correctness of their page would be impacted
by a minor deviation in it.
The user agent may trade off precision for efficiency in the computation of
LS scores. It is intended that the LS score have a correspondence to the
perceptual severity of the instability, but not that all user agents produce
exactly the same LS scores for a given page.
We expect the definition of the layout instability metric to evolve over time;
it should not be considered "frozen" merely because a spec has been produced.
We hope that such evolution can occur with sufficient cooperation between
implementers, so that browsers do not vary so significantly that developers
must choose between optimizing for one implementation over another.
Privacy and Security
Layout instability bears an indirect relationship to resource timing, as slow
resources could cause intermediate layouts that would not otherwise be
performed. Resource timing information can be used by malicious websites for
statistical fingerprinting.
The layout instability API only reports layout shifts in the current browsing
context (frame). It does not directly provide the CLS score incorporating
subframes. Developers can implement such aggregation manually, but browsing
contexts with different
origins
would need to cooperate to share LS scores.
Terminology
The "layout instability metric" was previously called the "layout stability
metric".
"Layout instability" and "layout shift" were previously referred to as
"layout jank". The impact region was previously referred to as the "jank
region". The LS score was previously referred to as the "jank fraction".
The DCLS score and CLS score were previously referred to as
"(aggregate) jank score".
The LayoutShift interface was previously implemented as PerformanceLayoutJank.
Its "value" attribute was previously named "fraction", and its entryType was
previously "layoutJank".
The layout instability API is an extension of the web performance API, but it is
not related to the speed or timing of layout computation.
Links