visualization_lecture

Introduction to Visualization in Data Science

Data visualization is the graphical representation of information and data. By using visual elements like charts, graphs, and maps, data visualization tools provide an accessible way to see and understand trends, outliers, and patterns in data.

n a world drowning in data, the ability to process information is common, but the ability to communicate it is a superpower. For a Computer Science professional, data visualization is the bridge between back-end complexity and front-end decision-making.

Summary

This module moves beyond simply “making charts.” We will explore the cognitive science of how humans perceive visual information and the technical frameworks required to build scalable, interactive, and honest representations of data. You will learn to stop viewing visualization as an “afterthought” and start seeing it as a critical component of the software development lifecycle.

Course: Visualization in Data Science (Lecture Series)

Purpose. This introduction collects the mathematical foundations, practical examples, code resources and project ideas you’ll need for a lecture series that teaches how to think about, design, and implement effective visualisations in data science.


Learning outcomes

By the end of the course students should be able to:


Suggested prerequisites


1. Mathematical foundations & dimensionality reduction

Activities: Visualise principal components on simple 2D/3D datasets. —

2. Perception, design principles & Tufte

Activities: produce a Tufte-style one-page analysis of a dataset using RMarkdown; critique a student-submitted plot for ‘data-ink’ waste.


3. High-dimensional thinking & dimensional imagination

Activity: interactive visual exercises and mental-model building (embedding spheres, hypercubes) and mapping to dimensionality reduction outputs.


4. Projections, maps & spatial visualization

Activity: small project building choropleth maps and experimenting with different projections and colour schemes.


5. Multivariate visualisation & case-studies

Activity: reproduce Minard, then redesign it for another historical dataset; present differences in communication and fidelity.


6. Good / bad visualisation — ethics & pitfalls

Activity: group exercise — find an example that misleads and rewrite it to improve accuracy and clarity.


7. Practical coding & statistical modelling labs (R/Python)

Labs:


8. Modern alternatives & troubleshooting

Activity: peer review a visualization and provide a short written critique (1 page) plus a revised figure.


Example datasets & illustrations

Example tSNE

Example heatmaps

(Use these to demonstrate cluster structure vs. heatmap summaries; ensure you note provenance and licensing if reused.)


Suggested assessment & deliverables


Further reading & talks