Course Outline
Module 1: Essential Python for ML Workflows
• Course introduction and environment configuration
Aligning goals and establishing a reproducible Python ML workspace
• Python language fundamentals (accelerated track)
Reviewing syntax, control flow, functions, and patterns prevalent in ML codebases
• Data structures for ML
Utilising lists, dictionaries, sets, and tuples for features, labels, and metadata
• Comprehensions and functional programming tools
Implementing transformations via comprehensions and higher-order functions
• Object-oriented Python for ML developers
Working with classes, methods, composition, and practical design decisions
• dataclasses and lightweight modelling
Using typed containers for configuration, examples, and results
• Decorators and context managers
Applying patterns for timing, caching, logging, and resource-safe execution
• File and path management
Handling datasets robustly and managing serialization formats
• Exceptions and defensive programming
Writing ML scripts that fail safely and transparently
• Modules, packages, and project structure
Organising reusable ML codebases effectively
• Typing and code quality
Implementing type hints, documentation, and lint-friendly structures
Module 2: Numerical Python, SciPy, and Data Handling
• NumPy foundations for vectorised computing
Executing efficient array operations and performance-conscious coding
• Indexing, slicing, broadcasting, and shapes
Ensuring safe tensor manipulation and shape reasoning
• Linear algebra essentials with NumPy and SciPy
Performing stable matrix operations and decompositions common in ML
• SciPy deep dive
Exploring statistics, optimisation, curve fitting, and sparse matrices
• Pandas for tabular ML data
Cleaning, joining, aggregating, and preparing datasets
• scikit-learn deep dive
Mastering the estimator interface, pipelines, and reproducible workflows
• Visualisation essentials
Creating diagnostic plots for data exploration and model behaviour analysis
Module 3: Programming Patterns for Building ML Applications
• Transitioning from notebooks to maintainable projects
Refactoring exploratory code into structured packages
• Configuration management
Externalising parameters and validating startup conditions
• Logging, warnings, and observability
Implementing structured logging for debuggable ML systems
• Reusable components with OOP and composition
Designing extensible transformers and predictors
• Practical design patterns
Applying Pipeline, Factory/Registry, Strategy, and Adapter patterns
• Data validation and schema checks
Preventing silent data issues
• Performance and profiling
Identifying bottlenecks and applying optimisation techniques
• Model I/O and inference interfaces
Ensuring safe persistence and clean prediction interfaces
• End-to-end mini build
Constructing a production-style ML pipeline with configuration and logging
Module 4: Statistical Learning for Tabular, Text, and Image Data
• Evaluation foundations
Managing train/validation splits, honest cross-validation, and business-aligned metrics
• Advanced tabular ML
Working with regularised GLMs, tree ensembles, and leakage-free preprocessing
• Calibration and uncertainty
Using Platt scaling, isotonic regression, bootstrap, and conformal prediction
• Classical NLP methods
Navigating tokenisation trade-offs, TF-IDF, linear models, and Naive Bayes
• Topic modelling
Understanding LDA fundamentals and practical limitations
• Classical computer vision
Utilising HOG, PCA, and feature-based pipelines
• Error analysis
Detecting bias, label noise, and spurious correlations
• Hands-on labs
Building a leakage-proof tabular pipeline
Comparing and interpreting text baselines
Establishing classical vision baselines with structured failure analysis
Module 5: Neural Networks for Tabular, Text, and Image Data
• Training loop mastery
Writing clean PyTorch loops with AMP, clipping, and reproducibility features
• Optimisation and regularisation
Managing initialisation, normalisation, optimisers, and schedulers
• Mixed precision and scaling
Implementing gradient accumulation and checkpointing strategies
• Tabular neural networks
Using categorical embeddings, feature crosses, and ablation studies
• Text neural networks
Working with embeddings, CNNs, BiLSTMs/GRUs, and sequence handling
• Vision neural networks
Understanding CNN fundamentals and ResNet-style architectures
• Hands-on labs
Building a reusable training framework
Comparing Tabular NNs vs boosting
Conducting CNN experiments with augmentation and scheduling
Module 6: Advanced Neural Architectures
• Transfer learning strategies
Applying freeze/unfreeze patterns and discriminative learning rates
• Transformer architectures for text
Exploring self-attention internals and fine-tuning approaches
• Vision backbones and dense prediction
Utilising ResNet, EfficientNet, Vision Transformers, and U-Net concepts
• Advanced tabular architectures
Working with TabTransformer, FT-Transformer, and Deep and Cross networks
• Time series considerations
Handling temporal splits and detecting covariate shift
• PEFT and efficiency techniques
Navigating LoRA, distillation, and quantisation trade-offs
• Hands-on labs
Fine-tuning a pretrained text transformer
Fine-tuning a pretrained vision model
Comparing tabular transformers vs GBDT
Module 7: Generative AI Systems
• Prompting fundamentals
Mastering structured prompting and controlled generation
• LLM foundations
Understanding tokenisation, instruction tuning, and hallucination mitigation
• Retrieval-Augmented Generation (RAG)
Implementing chunking, embeddings, hybrid search, and evaluation metrics
• Fine-tuning strategies
Applying LoRA and QLoRA with strict data quality controls
• Diffusion models
Grasping latent diffusion intuition and practical adaptation
• Synthetic tabular data
Using CTGAN and addressing privacy considerations
• Hands-on labs
Building a production-style RAG mini-application
Validating structured output with schema enforcement
Optional diffusion experimentation
Module 8: AI Agents and MCP
• Agent loop design
Implementing observe, plan, act, reflect, and persist cycles
• Agent architectures
Designing ReAct, plan-and-execute, and multi-agent coordination systems
• Memory management
Using episodic, semantic, and scratchpad approaches
• Tool integration and safety
Establishing tool contracts, sandboxing, and defences against prompt injection
• Evaluation frameworks
Creating replayable traces, task suites, and regression testing mechanisms
• MCP and protocol-based interoperability
Designing MCP servers with secure tool exposure
• Hands-on labs
Building an agent from scratch
Exposing tools via an MCP-style server
Creating an evaluation harness with safety constraints
Requirements
Participants are expected to possess a practical understanding of Python programming.
This programme is designed for technical professionals at the intermediate to advanced levels.
Testimonials (1)
the ML ecosystem not only MLFlow but Optuna, hyperops, docker , docker-compose