visualization_lecture

🎮🛠️ Exercise on interactive visualizations and visual storytelling (Superintelligence)

We show how to create interactive visualizations
How to bridge the gap between abstract technical concepts and data-driven storytelling? Below is a structured exercise designed for a Python-based data visualization class.
Reading Michale Nielsen superintelligence risk
Blog post on superintelligence
Paper and resources co-created with students. Link forthcoming.
Game for paperclip optimization problem in AGI

Interested in learning more? See the Cambridge AI Safety Hub

Exercise: The Velocity Gap – A Narrative Data Storytelling Challenge

Objective

In this exercise, you will use Python to model a speculative future. You will generate synthetic data representing the growth of AI benefits versus AI harms, constrained by “Institutional Inertia.” Your goal is to create a compelling visualization and a 300-word narrative that explains the “Velocity Gap” to a non-technical audience.

🎮💡 Activity

Add one slide here
Create a narrative around AGI and superintelligence using the code below.

Interactive Python simulator

Open the following code in a Google Colab notebook

or in a standalone python script.

This uses ipywidgets.

python -m venv venv_viz
source activate venv_viz
pip install matplotlib numpy ipywidgets plotly

%matplotlib inline
import matplotlib.pyplot as plt
import numpy as np
from ipywidgets import interact, widgets

def plot_velocity_gap(harm_growth, benefit_ceiling, inst_speed, policy_lag):
    time = np.linspace(0, 25, 250)
    
    # Models
    y_harm = 0.5 * np.exp(harm_growth * time)
    midpoint = 10 + policy_lag
    y_benefit = benefit_ceiling / (1 + np.exp(-inst_speed * (time - midpoint)))
    
    # Calculate Risk Score (Area between curves)
    risk_score = np.trapz(np.maximum(0, y_harm - y_benefit), time)
    
    # Plotting
    plt.figure(figsize=(10, 6))
    plt.plot(time, y_harm, color='red', lw=2, label='Harmful AI Potential')
    plt.plot(time, y_benefit, color='blue', lw=2, label='Realized AI Benefits')
    
    # Fill the gap
    plt.fill_between(time, y_benefit, y_harm, where=(y_harm > y_benefit), 
                     color='red', alpha=0.1, label='The Velocity Gap')
    
    # Formatting
    plt.title(f"AI Velocity Gap | Cumulative Risk: {risk_score:.2f}", fontsize=14)
    plt.xlabel("Years from AGI Emergence")
    plt.ylabel("Impact Magnitude")
    plt.ylim(0, min(max(y_harm)*1.1, 300))
    plt.grid(True, linestyle='--', alpha=0.6)
    plt.legend()
    
    plt.show()

# Interactive Sliders
interact(
    plot_velocity_gap,
    harm_growth = widgets.FloatSlider(value=0.25, min=0.1, max=0.4, step=0.01),
    benefit_ceiling = widgets.IntSlider(value=50, min=10, max=100, step=5),
    inst_speed = widgets.FloatSlider(value=0.4, min=0.1, max=1.0, step=0.05),
    policy_lag = widgets.IntSlider(value=0, min=-5, max=10, step=1)
);

Part 1: The Synthetic Data Generator

Use the following Python script to generate your dataset. This script simulates two trajectories:

“Harmful AI Potential”: Exponential growth driven by rapid, unregulated code deployment.
“Realized AI Benefits”: Logistic (S-curve) growth, representing the friction of policy, safety trials, and human consensus.

import pandas as pd
import numpy as np

def generate_ai_narrative_data(years=20, seed=42):
    np.random.seed(seed)
    time = np.linspace(0, years, 100)
    
    # Scenario A: Exponential Harm (unregulated)
    # Grows at 30% annually
    harm_trajectory = 0.5 * np.exp(0.25 * time) + np.random.normal(0, 1, 100).cumsum() * 0.2
    
    # Scenario B: Sluggish Benefits (Institutional Friction)
    # Logistic growth: starts strong, but hits the 'Consensus Ceiling'
    L = 15 # Maximum realized benefit
    k = 0.4 # Growth rate
    x0 = 10 # Midpoint of adoption
    benefit_trajectory = L / (1 + np.exp(-k * (time - x0))) + np.random.normal(0, 0.2, 100)
    
    df = pd.DataFrame({
        'Year': 2024 + time,
        'Harmful_Potential': np.maximum(0, harm_trajectory),
        'Realized_Benefits': np.maximum(0, benefit_trajectory)
    })
    return df

# Students: Start your analysis here
df = generate_ai_narrative_data()
print(df.head())

Part 2: The Scenarios

Choose one of the following “Institutional Environments” to model. Adjust the parameters in the code (or manually perturb the data) to reflect your chosen story:

Scenario 1: “The Great Stagnation” – Policy becomes so gridlocked that the Realized_Benefits curve plateaus early (at $L=5$), while Harmful_Potential accelerates.
Scenario 2: “The Alignment Sprint” – A global treaty is signed in year 10. The Harmful_Potential curve should show a sudden “kink” or drop, while Realized_Benefits continues its slow climb.
Scenario 3: “The Double-Edged Sword” – Both curves are identical for 10 years, then diverge sharply as an ASI (Artificial Superintelligence) is reached.

Part 3: The Narrative Visual Task

Your submission must include a single, publication-quality plot created with Matplotlib, Seaborn, or Plotly that adheres to the following storytelling principles:

Visual Hierarchy: Use color to distinguish between “Benefit” (calm, stable) and “Harm” (urgent, alarming).
Annotation as Narrative: Do not just plot lines. Add at least three text annotations to the plot that mark “The Consensus Crisis,” “The Policy Lag,” or “The Velocity Gap.”
The “So What?” Factor: Use a title that is a statement, not a description. (e.g., “Why Policy Inertia Makes AI Risks Grow Faster Than its Rewards” instead of “AI Growth Plot”).

Part 4: The Written Story (300 Words)

Write a short “news from the future” article (dated 2040) based on your plot.

How did the “Velocity Gap” manifest?
What specific institution (e.g., the FDA, UN, or Patent Office) was the bottleneck?
What was the consequence of the harm curve outpacing the benefit curve?

Evaluation Rubric

Criteria	Excellent (5/5)	Developing (3/5)
Technical Execution	Clean, bug-free Python code; effective use of libraries.	Code runs but has redundant steps or poor formatting.
Data Storytelling	Annotations and colors guide the eye to the “Velocity Gap.”	Plot is technically correct but lacks context or narrative.
Insight & Narrative	The story explains why the curves diverge based on Michael Nielsen’s theories.	The story is generic and doesn’t connect to the data.
Aesthetics	Professional styling (no default settings), clear labels, and high contrast.	Default Matplotlib colors; overlapping text or unreadable labels.

Interactive notebook

This is a complete Python structure designed to be copied directly into a Jupyter Notebook or Google Colab. It uses Plotly to create an interactive experience where students can hover over data points to see the “Institutional Bottlenecks” and “Unregulated Risks” at specific moments in time.

Notebook: The AI Velocity Gap Interactive Story

Overview

This notebook explores the “Velocity Gap” — the divergence between the exponential growth of AI risks and the linear/logistic growth of AI benefits. You will generate synthetic data, visualize it interactively, and annotate the “friction points” where human institutions struggle to keep pace.

Step 1: Setup and Data Generation

We will generate a dataset spanning from 2024 to 2044.

Harmful Potential: Modeled as an exponential curve.
Realized Benefits: Modeled as a Logistic (S-curve) to simulate initial excitement followed by a “Consensus Plateau.”

import pandas as pd
import numpy as np
import plotly.graph_objects as go

def generate_velocity_data(years=20):
    np.random.seed(42)
    time = np.linspace(0, years, 200)
    
    # 1. Exponential Harm (Speed of Code)
    harm = 0.8 * np.exp(0.22 * time) + np.random.normal(0, 0.5, 200).cumsum() * 0.1
    
    # 2. Logistic Benefits (Institutional Speed)
    # L = max benefit, k = growth rate, x0 = midpoint
    L, k, x0 = 18, 0.35, 10
    benefits = L / (1 + np.exp(-k * (time - x0))) + np.random.normal(0, 0.1, 200)
    
    df = pd.DataFrame({
        'Year': 2024 + time,
        'Harmful_Potential': np.maximum(0, harm),
        'Realized_Benefits': np.maximum(0, benefits),
        'Gap': np.maximum(0, harm - benefits)
    })
    
    # Adding 'Event' labels for interactivity
    df['Event'] = ""
    df.loc[30, 'Event'] = "First Major AI-Driven Bank Run"
    df.loc[100, 'Event'] = "UN Global Consensus Summit (Deadlocked)"
    df.loc[150, 'Event'] = "Institutional Stagnation Peak"
    
    return df

df = generate_velocity_data()
df.head()

Step 2: Create the Interactive Plot

In this step, we use graph_objects to create a dual-layered story. Hover over the lines to see the widening “Velocity Gap.”

# Create the figure
fig = go.Figure()

# Add the Harmful Potential Trace
fig.add_trace(go.Scatter(
    x=df['Year'], y=df['Harmful_Potential'],
    mode='lines',
    name='Harmful AI Potential',
    line=dict(color='#ef4444', width=4),
    hovertemplate='<b>Year %{x:.1f}</b><br>Harm Level: %{y:.2f}<extra></extra>'
))

# Add the Realized Benefits Trace
fig.add_trace(go.Scatter(
    x=df['Year'], y=df['Realized_Benefits'],
    mode='lines',
    name='Realized AI Benefits (Institutional)',
    line=dict(color='#3b82f6', width=4),
    fill='tonexty', # Fills the "Gap" between the two lines
    fillcolor='rgba(239, 68, 68, 0.1)', 
    hovertemplate='<b>Year %{x:.1f}</b><br>Benefit Level: %{y:.2f}<extra></extra>'
))

# Add markers for specific "Historical Events"
events = df[df['Event'] != ""]
fig.add_trace(go.Scatter(
    x=events['Year'], y=events['Harmful_Potential'],
    mode='markers+text',
    name='Critical Milestones',
    text=events['Event'],
    textposition="top left",
    marker=dict(color='black', size=10, symbol='x')
))

# Update Layout for Storytelling
fig.update_layout(
    title={
        'text': "<b>The Velocity Gap: Why AI Risk Outpaces Policy</b><br><span style='font-size:14px; color:gray'>Exponential technical harms vs. Logistic institutional benefits</span>",
        'y':0.95, 'x':0.5, 'xanchor': 'center', 'yanchor': 'top'
    },
    xaxis_title="Timeline (Years)",
    yaxis_title="Impact Magnitude",
    hovermode="x unified",
    template="plotly_white",
    legend=dict(orientation="h", yanchor="bottom", y=1.02, xanchor="right", x=1),
    shapes=[
        # Highlight the "Divergence Point"
        dict(type="rect", xref="x", yref="paper",
             x0=2034, y0=0, x1=2044, y1=1,
             fillcolor="LightSalmon", opacity=0.1, layer="below", line_width=0),
    ]
)

# Add a narrative annotation
fig.add_annotation(
    x=2038, y=35,
    text="<b>The Velocity Gap</b><br>Risk grows permissionless;<br>Benefits require consensus.",
    showarrow=True, arrowhead=2,
    ax=40, ay=-40,
    bordercolor="#c7c7c7", borderwidth=2, borderpad=4, bgcolor="#ffffff", opacity=0.8
)

fig.show()

Step 3: Critical Analysis (Discussion Questions)

The Divergence Point: Around the year 2034, the red line surpasses the blue line. Based on Michael Nielsen’s essay, what specific “social technologies” could we invent to move the blue line upward?
The Shape of the Curve: Why is a Logistic Curve (S-curve) a more realistic model for AI benefits than a straight line? (Hint: Think about FDA approvals or international treaties).
The Shaded Area: The shaded red area represents the “Unmanaged Risk.” If you were a policymaker, what is the maximum “Gap” you would allow before pausing AI development?

Student Task: Narrative Extension

Modify the code above to simulate a “Policy Breakthrough” Scenario.

Change the benefit_trajectory parameters so that the curve doesn’t plateau at 18, but continues to grow linearly after year 10.
Add a new annotation to the plot marking where “Global Consensus” was reached.
Export your plot as an HTML file and write a 200-word summary explaining how your new data changes the future outcome.

How to use this in class:

Live Coding: Run the data generation block first and ask students to guess what the plot will look like.
Parameter Tweak: Have students change the k (growth rate) in the logistic function to see how “faster bureaucracy” changes the outcome.
Visualization Critique: Discuss why we used a “fill” between the lines (to visually represent the cost of the gap).

🎮💡 Activity

Now read this blog on how we are all connected to Ramanujan
Implement the core formula probabilty = e^(x * a / b)

where

x = number of molecules breathed by us

a = number of molecules breathed by Ramanujan during his lifetime

b = Number of molecules in the Earth’s atmosphere

Design an interactive simulation that varies x, a and b and looks at how it changes the probability that we breathe a molecule inhaled by Ramanujan.
🤔❓How can you innovatively visualize a probability?

Play around with this interactive Google Colab notebook

Interactive math exercise

ChatGPT has a feature to allow you to interactively learn math exercises
Design an interactive notebook in Python using the ipywidgets package to understand the Pythagoras theorem
Solution is shown below

# Interactive Pythagoras Notebook (ipywidgets)
# -------------------------------------------------
# Run this file in a Jupyter environment (JupyterLab / Jupyter Notebook)
# Requires: numpy, matplotlib, ipywidgets, IPython. Optional: sympy
#
# Features:
# - Interactive sliders for legs a and b
# - Live plotting of the right triangle and squares on each side
# - Numeric and symbolic checks that a^2 + b^2 = c^2
# - Short exercises and an auto-generated quiz with answer checking

import math
import numpy as np
import matplotlib.pyplot as plt
from matplotlib.patches import Polygon
from matplotlib.collections import PatchCollection
from ipywidgets import FloatSlider, IntSlider, Checkbox, Button, Output, HBox, VBox, Label, Layout, interactive_output, Dropdown, BoundedFloatText, FloatText
from IPython.display import display, Markdown, HTML

# ---------- Helper geometry utilities ----------

def hypotenuse(a, b):
    return math.hypot(a, b)


def centroid_triangle(p1, p2, p3):
    return ((p1[0]+p2[0]+p3[0])/3.0, (p1[1]+p2[1]+p3[1])/3.0)


def square_coords(p1, p2, outward_point):
    """
    Given segment p1->p2, return coordinates of the square built on that segment.
    outward_point is a point inside the triangle (so we can choose the square direction away from the triangle).
    """
    p1 = np.array(p1, dtype=float)
    p2 = np.array(p2, dtype=float)
    v = p2 - p1
    L = np.linalg.norm(v)
    if L == 0:
        return [p1.tolist()]
    # perpendicular vector of same length
    perp = np.array([-v[1], v[0]])
    perp = perp / np.linalg.norm(perp) * L
    midpoint = (p1 + p2) / 2.0
    # choose perp direction so it points away from triangle centroid
    dir_test = outward_point - midpoint
    if np.dot(perp, dir_test) > 0:
        perp = -perp
    q1 = p1 + perp
    q2 = p2 + perp
    # square coords (p1 -> p2 -> q2 -> q1)
    return [tuple(p1.tolist()), tuple(p2.tolist()), tuple(q2.tolist()), tuple(q1.tolist())]


# ---------- Plotting function ----------

def plot_triangle_and_squares(a, b, show_squares=True, annotate=True, figsize=(6,6)):
    """Plot a right triangle with legs a (x-axis) and b (y-axis) and the squares on each side."""
    # points
    A = (0.0, 0.0)       # right angle at origin
    B = (a, 0.0)         # point on x-axis
    C = (0.0, b)         # point on y-axis
    c = hypotenuse(a, b)

    # prepare figure
    fig, ax = plt.subplots(figsize=figsize)
    ax.set_aspect('equal', adjustable='box')

    # triangle
    tri = Polygon([A, B, C], closed=True)
    ax.add_patch(Polygon([A, B, C], closed=True, fill=True, alpha=0.15, edgecolor='black'))

    patches = []
    colors = []

    centroid = np.array(centroid_triangle(A,B,C))

    if show_squares:
        # square on AB (length a)
        sq_ab = square_coords(A, B, outward_point=centroid)
        patches.append(Polygon(sq_ab, closed=True))
        colors.append(0.6)

        # square on AC (length b)
        sq_ac = square_coords(A, C, outward_point=centroid)
        patches.append(Polygon(sq_ac, closed=True))
        colors.append(0.4)

        # square on BC (hypotenuse length c)
        sq_bc = square_coords(B, C, outward_point=centroid)
        patches.append(Polygon(sq_bc, closed=True))
        colors.append(0.2)

        p = PatchCollection(patches, alpha=0.18, edgecolor='black')
        ax.add_collection(p)

    # plot points and edges
    xs = [A[0], B[0], C[0], A[0]]
    ys = [A[1], B[1], C[1], A[1]]
    ax.plot(xs, ys, '-k')
    ax.scatter([A[0],B[0],C[0]],[A[1],B[1],C[1]], zorder=20)
    ax.text(A[0], A[1], ' A (right angle)', fontsize=9, va='top', ha='left')
    ax.text(B[0], B[1], f' B ({a:.2f},0)', fontsize=9, va='bottom', ha='center')
    ax.text(C[0], C[1], f' C (0,{b:.2f})', fontsize=9, va='center', ha='right')

    # annotation for lengths
    mid_ab = ((A[0]+B[0])/2.0, (A[1]+B[1])/2.0)
    mid_ac = ((A[0]+C[0])/2.0, (A[1]+C[1])/2.0)
    mid_bc = ((B[0]+C[0])/2.0, (B[1]+C[1])/2.0)
    ax.text(mid_ab[0], mid_ab[1]-0.05*max(a,b,1), f'a = {a:.2f}', ha='center')
    ax.text(mid_ac[0]-0.05*max(a,b,1), mid_ac[1], f'b = {b:.2f}', va='center', rotation=90)
    ax.text(mid_bc[0]+0.02*max(a,b,1), mid_bc[1]+0.02*max(a,b,1), f'c = {c:.2f}')

    # axes limits
    pad = max(a,b)*0.6 + 0.5
    ax.set_xlim(-pad, max(a,b)+pad)
    ax.set_ylim(-pad, max(a,b)+pad)
    ax.set_xlabel('x')
    ax.set_ylabel('y')
    ax.set_title('Right triangle with legs a and b; squares on each side')
    ax.grid(True, alpha=0.3)

    if annotate:
        # Area texts: squares areas
        ax.text(0.02, 0.98, f'a^2 = {a**2:.3f}\nb^2 = {b**2:.3f}\nc^2 = {c**2:.3f}\na^2 + b^2 = {(a**2+b**2):.3f}', transform=ax.transAxes,
                fontsize=10, va='top', bbox=dict(boxstyle='round', alpha=0.12))

    plt.show()


# ---------- Interactive widgets and layout ----------

# Widget definitions
a_slider = FloatSlider(value=3.0, min=0.1, max=12.0, step=0.1, description='a (leg):', continuous_update=True, layout=Layout(width='380px'))
b_slider = FloatSlider(value=4.0, min=0.1, max=12.0, step=0.1, description='b (leg):', continuous_update=True, layout=Layout(width='380px'))
show_squares_chk = Checkbox(value=True, description='Show squares')
annotate_chk = Checkbox(value=True, description='Show numeric annotation')

out_plot = Output()


def _update_plot(a, b, show_squares, annotate):
    out_plot.clear_output(wait=True)
    with out_plot:
        if a <= 0 or b <= 0:
            display(Markdown('**Please choose positive values for a and b.**'))
            return
        plot_triangle_and_squares(a, b, show_squares=show_squares, annotate=annotate)

# wire interactive output
widgets_to_link = {'a': a_slider, 'b': b_slider, 'show_squares': show_squares_chk, 'annotate': annotate_chk}
interactive_plot = interactive_output(_update_plot, widgets_to_link)

# Top instructions
instructions = Label(
    value="""
    # Pythagoras theorem — interactive exploration

    Move the sliders for **a** and **b** (the legs of a right triangle).
    - The notebook draws the triangle and the squares built on each of the three sides.
    - Compare the area of the square on the hypotenuse (c^2) with the sum of the areas of the squares on the legs (a^2 + b^2).

    Use the \"+Quiz\" section below to test your understanding with short exercises.
    """
)

controls = VBox([HBox([a_slider, b_slider]), HBox([show_squares_chk, annotate_chk])])

# Display UI
ui = VBox([instructions, controls, out_plot])

# initial render
_update_plot(a_slider.value, b_slider.value, show_squares_chk.value, annotate_chk.value)

# display interactive UI (so users can re-render by changing widgets)
display(ui)

# connect event handlers so that moving sliders updates the plot interactively
for w in [a_slider, b_slider, show_squares_chk, annotate_chk]:
    w.observe(lambda change: _update_plot(a_slider.value, b_slider.value, show_squares_chk.value, annotate_chk.value), names='value')


# ---------- Symbolic check (optional) ----------
try:
    import sympy as sp
    has_sympy = True
except Exception:
    has_sympy = False

if has_sympy:
    display(Markdown('**Symbolic demonstration (using SymPy)**'))
    a, b = sp.symbols('a b', positive=True)
    c = sp.sqrt(a**2 + b**2)
    expr = sp.simplify(c**2 - (a**2 + b**2))
    display(Markdown(f'`simplify(c**2 - (a**2 + b^2))` = `{expr}` (identically 0)'))
else:
    display(Markdown('*SymPy not detected — symbolic verification skipped (optional).*'))


# ---------- Small exercises & quiz ----------

quiz_out = Output()

# Widgets for exercise: compute hypotenuse
exercise_a = FloatText(value=3.0, description='a:')
exercise_b = FloatText(value=4.0, description='b:')
answer_input = FloatText(value=0.0, description='c (your answer):')
check_btn = Button(description='Check answer', button_style='primary')


def _check_answer(btn=None):
    quiz_out.clear_output()
    a = float(exercise_a.value)
    b = float(exercise_b.value)
    true_c = hypotenuse(a, b)
    user_c = float(answer_input.value)
    tol = 1e-2
    with quiz_out:
        if abs(user_c - true_c) <= tol:
            display(Markdown(f':white_check_mark: Correct! True c = {true_c:.5f}'))
        else:
            display(Markdown(f':x: Not quite. True c = **{true_c:.5f}**. Your answer = {user_c:.5f}'))

check_btn.on_click(_check_answer)

# Random quiz generator
rand_btn = Button(description='New random problem', button_style='info')

import random

def _new_problem(btn=None):
    a_val = round(random.uniform(1.0, 10.0), 2)
    b_val = round(random.uniform(1.0, 10.0), 2)
    exercise_a.value = a_val
    exercise_b.value = b_val
    answer_input.value = 0.0
    _update_plot(a_val, b_val, show_squares_chk.value, annotate_chk.value)

rand_btn.on_click(_new_problem)

# assemble quiz UI
quiz_box = VBox([
    Label(value='## Quick quiz — compute the hypotenuse'), # Changed Markdown to Label
    HBox([exercise_a, exercise_b, answer_input]),
    HBox([check_btn, rand_btn]),
    quiz_out
])

# display quiz
display(quiz_box)

# ---------- Extension ideas for students ----------

display(Markdown('''
---
### Extension ideas (for a class exercise or a short project)

1. **Proof by rearrangement**: create an interactive demonstration that divides the square of side (a+b) into pieces and rearranges them to show equality of areas.
2. **Generalize to Euclidean geometry**: demonstrate using vectors why the dot product leads to the theorem (u·u = |u|^2, and orthogonality implies cross terms vanish).
3. **Data-collection exercise**: measure (with a ruler) multiple right triangles and build a scatter plot of a^2 + b^2 vs c^2; compute residuals and discuss measurement error.
4. **3D generalization**: explore distance formula in 3D and compare to the Pythagorean result.

'''))

# End of notebook
display(Markdown('**Notebook ready — change sliders above or use the quiz to explore Pythagoras.**'))

This site is open source. Improve this page.