Track 12

Advanced Unsupervised and Manifold Workflows

This track turns unsupervised geometry into a route: inspect whether the structure is compact, curved, noisy, or graph-like, then choose the method that matches that shape instead of trusting the prettiest plot.

Open The Clustering Topic Run Fast Examples Use The Study Plan

Primary Goal

Match Geometry To Method

The real skill is deciding when centroid, density, graph, or manifold thinking is the right lens for the data.

Best For

After Basic Clustering

Use this when KMeans versus DBSCAN is no longer enough and you need stronger judgment about non-convex shapes, noise, and embedding claims.

Exit Rule

Leave With One Defensible Method Call

You are done when you can justify a geometry-matching method with stability and non-visual evidence, not only with one nice view.

Use This Track When¶

compact centroid assumptions already feel too blunt
the data may contain curved groups, local connectivity, or meaningful noise
you need to compare KMeans, spectral clustering, DBSCAN, and agglomerative clustering honestly
you want to use Isomap, MDS, or t-SNE as inspection tools without overclaiming

What This Track Is Training¶

This track trains one route:

inspect the geometry first
decide whether the data wants centroid, density, or graph structure
check whether the result is stable across seeds
use low-dimensional views as inspection aids, not as proof
keep one non-visual check that would still matter if the plot looked worse

First Session¶

Use this order:

Clustering and Low-Dimensional Views
run academy/.venv/bin/python academy/examples/unsupervised-manifold-recipes/spectral_clustering_demo.py
run academy/.venv/bin/python academy/examples/unsupervised-manifold-recipes/manifold_inspection_demo.py
write one short note on where local connectivity beats the centroid assumption

Full Track Loop¶

For the complete workflow:

read Clustering and Low-Dimensional Views
run academy/.venv/bin/python academy/examples/unsupervised-manifold-recipes/spectral_clustering_demo.py
run academy/.venv/bin/python academy/examples/unsupervised-manifold-recipes/manifold_inspection_demo.py
run academy/.venv/bin/python academy/labs/advanced-unsupervised-and-manifold-workflows/src/advanced_unsupervised_workflow.py
finish the matching exercises in academy/exercises/advanced-unsupervised-and-manifold-workflows/
keep one note with the chosen method, one stability check, and one claim you refuse to make from the embedding alone

What To Inspect¶

By the end of the track, the learner should have inspected:

whether the geometry looks compact, curved, fragmented, or noisy
how spectral clustering changes the answer relative to KMeans
how much noise DBSCAN leaves unlabeled and whether that is useful
whether labels remain stable across seeds using a check such as pairwise ARI
whether the embedding preserves enough neighborhood structure to support inspection
one non-visual summary that is stronger than the prettiest scatter plot

Common Failure Modes¶

forcing KMeans onto data that is obviously not centroid-shaped
treating a good-looking embedding as proof of global structure
choosing a method from one lucky seed without checking stability
reading DBSCAN noise as failure when the refusal to assign is the honest answer
forgetting to ask whether a simpler geometric assumption already fits well enough

Exit Standard¶

Before leaving this track, the learner should be able to:

explain why the chosen method matches the data geometry better than the alternatives
name one stability check that matters more than one polished run
use an embedding for inspection without claiming it proves the full structure
say when density, graph, or centroid thinking is the most honest first move

That is enough to move into harder multimodal or representation-heavy workflows without treating every plot as truth.