Track 01
Python, NumPy, Pandas, Visualization
This first track turns data handling into a real workflow: inspect the shape, inspect the table, inspect one slice, inspect one plot, then build one feature matrix without losing row alignment.
Primary Goal
Inspect Before Modeling
The student should leave this track able to see data problems early instead of discovering them later through a broken model.
Best For
First-Time Learners
This is the default first track when arrays, tables, grouped summaries, and plotting are not yet mechanical.
Exit Rule
Leave With One Clean Baseline Table
You are done when the data path feels stable enough that the first model is no longer the scary part.
Use This Track When¶
- you are learning AI for the first time
- table inspection is still slow
- grouped summaries and counts do not yet feel automatic
- feature construction still risks row misalignment
What This Track Is Training¶
This track trains one durable habit:
- inspect before you transform
That means the learner should be able to move through this order without rushing:
- shapes
- schema
- missingness
- slice counts
- one useful plot
- one stable feature matrix
First Session¶
Use this order:
- Array Shapes and Axis Operations
- Table Inspection
- run
academy/.venv/bin/python academy/examples/numpy-shape-sanity/shape_sanity.py - run
academy/.venv/bin/python academy/examples/course-support-inspection/quick_inspection.py - write one short note on the first suspicious slice or feature risk
Full Track Loop¶
For the complete workflow:
- read the tooling topics in order
- run one short example after each core topic
- run
academy/.venv/bin/python academy/labs/python-numpy-pandas-visualization/src/course_support_eda.py - run
academy/.venv/bin/python academy/labs/python-numpy-pandas-visualization/src/feature_matrix_walkthrough.py - finish Exercise Sets 1 and 2 in
academy/exercises/python-numpy-pandas-visualization/before moving to the tiny logistic baseline in Set 3 - keep one note with the baseline table, slice summary, and first feature-matrix checks
What To Inspect¶
By the end of the track, the learner should have inspected:
- row and column counts
- missing-value rates
- target balance or label counts
- at least one grouped summary with counts
- one plot that changes the next action
- one feature matrix whose row alignment is still explicit
Common Failure Modes¶
- moving to modeling before the schema is understood
- trusting a slice rate without its count
- using plots decoratively instead of diagnostically
- building derived features without checking row alignment
- converting to a matrix too early and losing the meaning of columns
Exit Standard¶
Before leaving this track, the learner should be able to:
- explain what the table contains
- name the first risky slice
- build one clean feature matrix
- say what should be inspected before the first baseline model
That is enough to move safely into scikit-learn Validation and Tuning.