Get in Touch

Course Outline

Foundations of Safe and Fair AI

  • Core concepts: safety, bias, fairness, transparency.
  • Bias typologies: dataset, representation, and algorithmic bias.
  • Overview of regulatory frameworks (e.g., EU AI Act, GDPR).

Bias in Fine-Tuned Models

  • Mechanisms through which fine-tuning introduces or amplifies bias.
  • Case studies and real-world failure examples.
  • Methods for identifying bias in datasets and model predictions.

Techniques for Bias Mitigation

  • Data-level strategies (rebalancing, augmentation).
  • Training-stage strategies (regularization, adversarial debiasing).
  • Post-processing strategies (output filtering, calibration).

Model Safety and Robustness

  • Detection of unsafe or harmful outputs.
  • Handling adversarial inputs.
  • Red teaming and stress testing of fine-tuned models.

Auditing and Monitoring AI Systems

  • Evaluation metrics for bias and fairness (e.g., demographic parity).
  • Explainability tools and transparency frameworks.
  • Ongoing monitoring and governance practices.

Toolkits and Hands-On Practice

  • Leveraging open-source libraries (e.g., Fairlearn, Transformers, CheckList).
  • Practical session: Detecting and mitigating bias in a fine-tuned model.
  • Generating safe outputs via prompt design and constraints.

Enterprise Use Cases and Compliance Readiness

  • Best practices for integrating safety into LLM workflows.
  • Documentation standards and model cards for compliance.
  • Preparation for audits and external reviews.

Summary and Next Steps

Requirements

  • A foundational understanding of machine learning models and training methodologies.
  • Practical experience with fine-tuning techniques and Large Language Models (LLMs).
  • Familiarity with Python programming and Natural Language Processing (NLP) concepts.

Audience

  • AI compliance teams.
  • ML engineers.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories