Data visualization is the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.
n a world drowning in data, the ability to process information is common, but the ability to communicate it is a superpower. For a Computer Science professional, data visualization is the bridge between back-end complexity and front-end decision-making.
This module moves beyond simply “making charts.” We will explore the cognitive science of how humans perceive visual information and the technical frameworks required to build scalable, interactive, and honest representations of data. You will learn to stop viewing visualization as an “afterthought” and start seeing it as a critical component of the software development lifecycle.
Course: Visualization in Data Science (Lecture Series)
Purpose. This introduction collects the mathematical foundations, practical examples, code resources and project ideas you’ll need for a lecture series that teaches how to think about, design, and implement effective visualisations in data science.
By the end of the course students should be able to:
Define the Strategic Purpose: Articulate the specific objectives of a visualization (e.g., exploratory vs. explanatory) and how it serves the broader goals of a project.
Architect Visual Solutions: Select optimal visualization techniques by analyzing the dimensionality and complexity of data structures (multivariate, temporal, geospatial, etc.).
Design for the Audience: Tailor visual complexity and interactivity based on the technical literacy and needs of the end-user.
Implement Aesthetic Precision: Develop visualizations that are not only functionally accurate but also aesthetically professional, utilizing principles of color theory, typography, and visual hierarchy.
Integrity & Ethics: A visualization can lie just as easily as a bug can crash a program. We will study “Lie Factors” and how to avoid misleading your audience through truncated axes or improper scaling.
Lecture + reading: Mathematics for data science & visualization (notes & proofs).
Teaching resources on the mathematics of dimensionality reduction: https://www.researchgate.net/publication/375186575_Everything_you_wanted_to_know_about_the_mathematics_of_dimensionality_reduction_and_visualization_but_were_afraid_to_ask_Teaching_resources_and_activities
Activities: Visualise principal components on simple 2D/3D datasets. —
Tufte & data-ink: short reading and applied exercise.
Activities: produce a Tufte-style one-page analysis of a dataset using RMarkdown; critique a student-submitted plot for ‘data-ink’ waste.
Motivation & intuition: resources to help students visualise higher dimensions and intuitive strategies.
Activity: interactive visual exercises and mental-model building (embedding spheres, hypercubes) and mapping to dimensionality reduction outputs.
Cartographic projections: compare distortion trade-offs when mapping a sphere to a plane.
Activity: small project building choropleth maps and experimenting with different projections and colour schemes.
Classic and modern case studies:
Activity: reproduce Minard, then redesign it for another historical dataset; present differences in communication and fidelity.
How to mislead & how to avoid it: curated reads and examples.
Activity: group exercise — find an example that misleads and rewrite it to improve accuracy and clarity.
Code repos & examples:
Labs:
Broken charts & critique: (a modern take on chart failure modes)
Activity: peer review a visualization and provide a short written critique (1 page) plus a revised figure.


(Use these to demonstrate cluster structure vs. heatmap summaries; ensure you note provenance and licensing if reused.)
Great talks and recorded lectures (playlist): https://www.youtube.com/watch?v=iipVlV4I_Vg&list=PLB2SCq-tZtVnXalwtfVPcjwy0xJbu-btN&index=4