Gestalt theory, originating from psychology, emphasizes that humans perceive visual elements as organized patterns or wholes rather than just a collection of individual parts.
In data visualization, applying Gestalt principles can enhance the clarity and effectiveness of visual representations.
Proximity: Elements that are close to each other are perceived as related. In data visualization, grouping related data points or categories can help viewers quickly understand relationships.
Similarity: Elements that look similar are perceived as part of the same group. Using consistent colors, shapes, or sizes for related data can reinforce connections.
Continuity: The eye is drawn to continuous lines and patterns. Designing visualizations with smooth transitions and flows can guide viewers through the data more intuitively.
Closure: The mind tends to fill in missing information to create a complete image. Visualizations that suggest shapes or patterns can engage viewers and encourage them to explore the data further.
Figure-Ground: This principle involves distinguishing an object (figure) from its background (ground). Effective use of contrast and spacing can help highlight important data points.
By leveraging these Gestalt principles, data visualizations can become more intuitive, making it easier for viewers to interpret and derive insights from complex datasets.
A scatter plot that uses color coding to group related data points, applying the principle of similarity.
import matplotlib.pyplot as plt
import numpy as np
# Sample data
x = np.random.rand(50)
y = np.random.rand(50)
colors = np.random.choice(['red', 'blue'], size=50)
plt.scatter(x, y, c=colors)
plt.title('Scatter Plot with Similarity Principle')
plt.show()
import matplotlib.pyplot as plt
# Sample data
categories = ['A', 'B', 'C', 'D']
values1 = [5, 7, 3, 4]
values2 = [6, 8, 4, 5]
x = np.arange(len(categories))
width = 0.35
plt.bar(x - width/2, values1, width, label='Group 1')
plt.bar(x + width/2, values2, width, label='Group 2')
plt.xticks(x, categories)
plt.title('Bar Chart with Proximity Principle')
plt.legend()
plt.show()
import matplotlib.pyplot as plt
# Sample data
x = np.linspace(0, 10, 100)
y = np.sin(x)
plt.plot(x, y)
plt.title('Line Graph with Continuity Principle')
plt.show()
import matplotlib.pyplot as plt
# Sample data
sizes = [30, 20, 25, 15]
labels = ['A', 'B', 'C', 'D']
plt.pie(sizes, labels=labels, startangle=90)
plt.title('Pie Chart with Closure Principle')
plt.show()
import seaborn as sns
import numpy as np
import matplotlib.pyplot as plt
# Sample data
data = np.random.rand(10, 10)
sns.heatmap(data, cmap='coolwarm')
plt.title('Heatmap with Figure-Ground Principle')
plt.show()
tSNE and UMAP visualizations often leverage Gestalt principles to help users identify clusters and patterns in high-dimensional data.
Also makes interpretation of tSNE plots difficult when these principles are not effectively applied.
Dashboards and data reporting tools use these principles to create intuitive layouts that facilitate quick comprehension of key metrics.
Interactive visualizations (plotly) often incorporate Gestalt principles to enhance user engagement and understanding, such as through hover effects that highlight related data points.
Gestalt principles are part of a broader set of visual perception principles that influence how we interpret visual information. Other important principles include:
Preattentive Processing: Certain visual properties (like color, shape, and size) are processed rapidly by the human brain before conscious attention is directed. Effective data visualizations leverage preattentive attributes to highlight important information quickly.
Color Theory: The use of color can significantly impact the readability and interpretability of data visualizations. Understanding color relationships (complementary, analogous, etc.) helps in creating visually appealing and effective graphics.
Visual Hierarchy: Establishing a clear visual hierarchy through size, color, and placement helps guide the viewer’s eye to the most important elements first.
Balance and Alignment: Proper alignment and balance in design contribute to a cohesive and organized appearance, making it easier for viewers to process information.
Contrast: Using contrast effectively can help differentiate between different data points or categories, enhancing clarity.
Use blue in large regions
Use red and green in the centers of attention
🧩🚀 Concept edges of retina not sensitive to red and green
Use black, white and yellow in peripheral regions
Visual variables are attributes that can be manipulated to represent data in visualizations. Common visual variables include:
Position: The location of an element on a chart or graph, often used to represent quantitative data.
Size: The dimensions of an element, which can indicate magnitude or importance.
Shape: Different shapes can represent different categories or types of data.
Color: Color can convey information about categories, values, or intensity.
Orientation: The angle or direction of an element can provide additional context or meaning.
Texture: Patterns or textures can differentiate between areas in a visualization.
Glyphs are visual representations that combine multiple visual variables to encode complex data. Examples of glyphs include:
Chernoff Faces: Use facial features (like eyes, nose, mouth) to represent multivariate data.
Star Plots: Use radial lines extending from a central point to represent multiple variables.
Glyph Maps: Combine position with other visual variables to represent data on geographical maps.
Change blindness is a phenomenon where significant changes in a visual scene go unnoticed by the observer.
This occurs because our attention is limited, and we often focus on specific elements while ignoring others.
In data visualization, change blindness can impact how effectively information is communicated.
Jacques Bertin — Semiology of Graphics (excerpt PDF) link
⚠️ NOTE Depth perception in 2D visualizations can be enhanced using retinal variables.
apparent movement of objects when the observer changes position
a decrease in the size of object
a decrease in the intensity of color
a change in the texture or pattern of an object
a change in contrast between an object and its background
a change in orientation or shape of an object
🤔❓Retinal variables are visual properties that can be manipulated to convey information in data visualizations. They include:
Color Hue: Different colors can represent different categories or types of data.
Color Intensity: The brightness or saturation of a color can indicate magnitude or importance.
Size: The dimensions of an element can represent quantitative values.
Shape: Different shapes can be used to differentiate between categories.
Orientation: The angle or direction of an element can provide additional context.
Texture/Pattern: Patterns can be used to distinguish between areas in a visualization.
Page 7 of PDF here - Semiology of Graphics by Jacques Bertin
Associative visualizations use visual elements that are closely related to the data being represented, making it easier for viewers to understand the information. Examples include using icons or images that directly relate to the data.
Dissociative visualizations use abstract or unrelated visual elements, which may require viewers to interpret the data without direct visual cues. Examples include using geometric shapes or colors that do not have an inherent connection to the data.
PDF pg 10 - Semiology of Graphics by Jacques Bertin
In data visualization, the vibratory effect is that annoying optical illusion where a dense collection of points or lines seems to shimmer, shake, or “vibrate” on the screen. It’s essentially the visual equivalent of static noise.
Here is the quick breakdown of why it happens and why it matters:
The Cause: It usually occurs when points are placed in a high-density, repetitive, or strictly geometric grid. When the spacing between points nears the limits of the eye’s resolution, the brain struggles to process the gaps, leading to a perceived “flicker.”
🤔❓ The Chartjunk Factor: Edward Tufte famously categorized this as a form of unnecessary visual clutter. It creates Moire patterns that distract the viewer from the actual data trends.

page 14 of PDF here - Semiology of Graphics by Jacques Bertin
Orientation of symbols matter.
Limit the number of categories of orientation
See diagram below

Does this remind you of an ancient art/writing form?
🎉 🥳 Here is a fun image (AI generated of course) of what data visualization may have looked like in ancient Egypt!

There are infinite shapes
Shape variation is associative
Shape variation can be used to reveal similar elements
Shape variation is not selective i.e. cannot be used to answer where is a similar shape in a different region
Concept 🧩🚀 The meaning of a symbol becomes familiar to us only through habit.
John Stasko — Visual Perception (CS7450 lecture notes, Georgia Tech)
link
Jacques Bertin — Semiology of Graphics (excerpt PDF)
link
Alberto Cairo — The Functional Art (sample pages)
link
Preattentive properties & Gestalt perception (lecture slides)
link
Gestalt Principles in Data Visualization (student paper)
link
Exploring the use of Gestalt’s principles (ResearchGate / case study)
link
Interaction Design Foundation — What are the Gestalt Principles?
link
Principles of Data Visualization — Artificium (PDF)
link
Lecture: Pre-attentive attributes & Gestalt (PDF)
link
The Gestalt Principles — Data Action Lab (handout PDF)
link