Attention Heatmap

A visualization showing which parts of an image or video frame attract viewer attention.

An attention heatmap is a visualization that shows which regions of an image or video frame are likely to attract viewer attention. Areas that draw the eye are highlighted in warm colors (red, orange, yellow), while areas that are ignored appear in cool colors (blue, green) or are uncolored.

Attention heatmaps are generated by two main methods:

  1. Eye-tracking studies: Actual measurements of where participants look
  2. Saliency models: Computer vision models that predict fixation based on visual features like contrast, color, edges, and motion

Traditional saliency models use bottom-up visual processing rules to predict attention. More advanced models incorporate top-down factors like face detection and semantic understanding.

For video creators, attention heatmaps reveal whether viewers are looking at the most important elements of each frame — the speaker's face, the product, the text overlay. Misaligned attention (viewers drawn to background clutter rather than the subject) is a common but invisible engagement killer.

Unlike single-image heatmaps, video attention heatmaps must account for motion, temporal context, and how attention shifts between frames. VidCognition integrates attention heatmap predictions into its frame-by-frame analysis, showing creators exactly where attention is being directed at each moment — and flagging frames where attention is split or misdirected.