Multimodal LLM Workflows in Vertex AI Training Course
Vertex AI offers robust tools for developing multimodal LLM workflows that seamlessly integrate text, audio, and image data into a unified pipeline. With support for long context windows and Gemini API parameters, it facilitates advanced applications in planning, reasoning, and cross-modal intelligence.
This instructor-led, live training (available both online and on-site) is designed for intermediate to advanced-level practitioners who aim to design, build, and optimize multimodal AI workflows using Vertex AI.
By the end of this training, participants will be able to:
- Utilize Gemini models for handling multimodal inputs and outputs.
- Implement long-context workflows for sophisticated reasoning tasks.
- Design pipelines that combine text, audio, and image analysis.
- Optimize Gemini API parameters to enhance performance and cost efficiency.
Format of the Course
- Interactive lectures and discussions.
- Hands-on labs focusing on multimodal workflows.
- Project-based exercises for practical application of multimodal use cases.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to Multimodal LLMs in Vertex AI
- Overview of multimodal capabilities in Vertex AI
- Gemini models and supported modalities
- Use cases in enterprise and research
Setting Up the Development Environment
- Configuring Vertex AI for multimodal workflows
- Working with datasets across modalities
- Hands-on lab: environment setup and dataset preparation
Long Context Windows and Advanced Reasoning
- Understanding long-context workflows
- Use cases in planning and decision-making
- Hands-on lab: implementing long-context analysis
Cross-Modal Workflow Design
- Combining text, audio, and image analysis
- Chaining multimodal steps in pipelines
- Hands-on lab: designing a multimodal pipeline
Working with Gemini API Parameters
- Configuring multimodal inputs and outputs
- Optimizing inference and efficiency
- Hands-on lab: tuning Gemini API parameters
Advanced Applications and Integrations
- Interactive multimodal agents and assistants
- Integrating external APIs and tools
- Hands-on lab: building a multimodal application
Evaluation and Iteration
- Testing multimodal performance
- Metrics for accuracy, alignment, and drift
- Hands-on lab: evaluating multimodal workflows
Summary and Next Steps
Requirements
- Proficiency in Python programming
- Experience with machine learning model development
- Familiarity with multimodal data (text, audio, image)
Audience
- AI researchers
- Advanced developers
- ML scientists
Open Training Courses require 5+ participants.
Multimodal LLM Workflows in Vertex AI Training Course - Booking
Multimodal LLM Workflows in Vertex AI Training Course - Enquiry
Multimodal LLM Workflows in Vertex AI - Consultancy Enquiry
Consultancy Enquiry
Upcoming Courses
Related Courses
Advanced LangGraph: Optimization, Debugging, and Monitoring Complex Graphs
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications through composable graphs that maintain persistent state and offer control over execution.
This instructor-led, live training (available online or on-site) is targeted at advanced-level AI platform engineers, DevOps professionals specializing in AI, and ML architects who aim to optimize, debug, monitor, and operate production-grade LangGraph systems.
By the end of this training, participants will be able to:
- Design and optimize complex LangGraph topologies for improved speed, cost efficiency, and scalability.
- Implement reliability features such as retries, timeouts, idempotency, and checkpoint-based recovery.
- Debug and trace graph executions, examine state, and systematically reproduce production issues.
- Instrument graphs with logs, metrics, and traces, deploy them to production, and monitor SLAs and costs.
Format of the Course
- Interactive lecture and discussion sessions.
- Plenty of exercises and practical activities.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Coding Agents with Devstral: From Agent Design to Tooling
14 HoursDevstral is an open-source framework designed for building and running coding agents that can interact with codebases, developer tools, and APIs to enhance engineering productivity.
This instructor-led, live training (available both online and onsite) is aimed at intermediate to advanced ML engineers, developer-tooling teams, and SREs who want to design, implement, and optimize coding agents using Devstral.
By the end of this training, participants will be able to:
- Set up and configure Devstral for developing coding agents.
- Create agentic workflows for exploring and modifying codebases.
- Integrate coding agents with developer tools and APIs.
- Implement best practices for secure and efficient agent deployment.
Format of the Course
- Interactive lecture and discussion.
- Plenty of exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Open-Source Model Ops: Self-Hosting, Fine-Tuning and Governance with Devstral & Mistral Models
14 HoursDevstral and Mistral models are open-source AI technologies designed for flexible deployment, fine-tuning, and scalable integration.
This instructor-led, live training (online or onsite) is aimed at intermediate to advanced ML engineers, platform teams, and research engineers who wish to self-host, fine-tune, and manage Mistral and Devstral models in production environments.
By the end of this training, participants will be able to:
- Set up and configure self-hosted environments for Mistral and Devstral models.
- Apply fine-tuning techniques to optimize performance for specific domains.
- Implement versioning, monitoring, and lifecycle management.
- Ensure the security, compliance, and responsible use of open-source models.
Format of the Course
- Interactive lecture and discussion.
- Hands-on exercises in self-hosting and fine-tuning.
- Live-lab implementation of governance and monitoring pipelines.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph Applications in Finance
35 HoursLangGraph is a framework designed for constructing stateful, multi-actor LLM applications using composable graphs with persistent state and controlled execution.
This instructor-led, live training (available online or on-site) is targeted at intermediate to advanced professionals who aim to design, implement, and manage LangGraph-based financial solutions while ensuring proper governance, observability, and compliance.
By the end of this training, participants will be able to:
- Create finance-specific LangGraph workflows that align with regulatory and audit requirements.
- Incorporate financial data standards and ontologies into graph state and tools.
- Implement reliability, safety, and human-in-the-loop controls for critical processes.
- Deploy, monitor, and optimize LangGraph systems to ensure performance, cost-effectiveness, and service level agreements (SLAs).
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph Foundations: Graph-Based LLM Prompting and Chaining
14 HoursLangGraph is a framework designed for creating graph-structured LLM applications that support planning, branching, tool integration, memory management, and controllable execution.
This instructor-led, live training (available online or on-site) is aimed at beginner-level developers, prompt engineers, and data practitioners who wish to design and build reliable, multi-step LLM workflows using LangGraph.
By the end of this training, participants will be able to:
- Understand key LangGraph concepts (nodes, edges, state) and know when to apply them.
- Create prompt chains that can branch, call external tools, and maintain memory.
- Incorporate retrieval mechanisms and external APIs into graph workflows.
- Test, debug, and evaluate LangGraph applications for reliability and safety.
Format of the Course
- Interactive lecture and facilitated discussion.
- Guided labs and code walkthroughs in a sandbox environment.
- Scenario-based exercises focused on design, testing, and evaluation.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph in Healthcare: Workflow Orchestration for Regulated Environments
35 HoursLangGraph enables stateful, multi-actor workflows powered by LLMs with precise control over execution paths and state persistence. In healthcare, these capabilities are essential for compliance, interoperability, and building decision-support systems that align with medical workflows.
This instructor-led, live training (online or onsite) is aimed at intermediate to advanced-level professionals who wish to design, implement, and manage LangGraph-based healthcare solutions while addressing regulatory, ethical, and operational challenges.
By the end of this training, participants will be able to:
- Design healthcare-specific LangGraph workflows with a focus on compliance and auditability.
- Integrate LangGraph applications with medical ontologies and standards (FHIR, SNOMED CT, ICD).
- Apply best practices for reliability, traceability, and explainability in sensitive environments.
- Deploy, monitor, and validate LangGraph applications in healthcare production settings.
Format of the Course
- Interactive lecture and discussion.
- Hands-on exercises with real-world case studies.
- Implementation practice in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph for Legal Applications
35 HoursLangGraph is a framework designed for creating stateful, multi-actor LLM applications as composable graphs that maintain persistent state and offer precise control over execution.
This instructor-led, live training (available both online and on-site) is targeted at intermediate to advanced professionals who aim to design, implement, and operate LangGraph-based legal solutions with the required compliance, traceability, and governance controls.
By the end of this training, participants will be able to:
- Create legal-specific LangGraph workflows that ensure auditability and compliance.
- Integrate legal ontologies and document standards into the graph state and processing.
- Implement guardrails, human-in-the-loop approvals, and traceable decision paths.
- Deploy, monitor, and maintain LangGraph services in a production environment with observability and cost controls.
Format of the Course
- Interactive lectures and discussions.
- Numerous exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Dynamic Workflows with LangGraph and LLM Agents
14 HoursLangGraph is a framework designed for creating graph-structured workflows with large language models (LLMs). It supports branching, tool integration, memory management, and controllable execution processes.
This instructor-led, live training (available online or on-site) is targeted at intermediate-level engineers and product teams who aim to integrate LangGraph’s graph logic with LLM agent loops. This will enable them to develop dynamic, context-aware applications such as customer support agents, decision trees, and information retrieval systems.
By the end of this training, participants will be able to:
- Create graph-based workflows that effectively coordinate LLM agents, tools, and memory.
- Implement conditional routing, retries, and fallback mechanisms to ensure robust execution.
- Incorporate retrieval, APIs, and structured outputs into agent loops for enhanced functionality.
- Evaluate, monitor, and fortify agent behavior to ensure reliability and safety.
Format of the Course
- Interactive lectures and facilitated discussions.
- Guided labs and code walkthroughs in a secure sandbox environment.
- Scenario-based design exercises and peer reviews to enhance practical skills.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
LangGraph for Marketing Automation
14 HoursLangGraph is a graph-based orchestration framework that enables conditional, multi-step workflows involving language models and tools, making it ideal for automating and personalizing content pipelines.
This instructor-led, live training (available online or on-site) is designed for intermediate-level marketers, content strategists, and automation developers who want to implement dynamic, branching email campaigns and content generation pipelines using LangGraph.
By the end of this training, participants will be able to:
- Design graph-structured content and email workflows that incorporate conditional logic.
- Integrate language models, APIs, and data sources for automated personalization.
- Manage state, memory, and context across multi-step campaigns.
- Evaluate, monitor, and optimize the performance and delivery outcomes of workflows.
Format of the Course
- Interactive lectures and group discussions.
- Hands-on labs for implementing email workflows and content pipelines.
- Scenario-based exercises focusing on personalization, segmentation, and branching logic.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Le Chat Enterprise: Private ChatOps, Integrations & Admin Controls
14 HoursLe Chat Enterprise is a private ChatOps solution that offers secure, customizable, and governed conversational AI capabilities for organizations, supporting role-based access control (RBAC), single sign-on (SSO), connectors, and enterprise application integrations.
This instructor-led, live training (available online or on-site) is designed for intermediate-level product managers, IT leaders, solution engineers, and security/compliance teams who are looking to deploy, configure, and manage Le Chat Enterprise in enterprise environments.
By the end of this training, participants will be able to:
- Set up and configure Le Chat Enterprise for secure deployments.
- Implement RBAC, SSO, and compliance-driven controls.
- Integrate Le Chat with enterprise applications and data stores.
- Design and implement governance and admin playbooks for ChatOps.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Cost-Effective LLM Architectures: Mistral at Scale (Performance / Cost Engineering)
14 HoursMistral is a high-performance series of large language models designed for cost-effective deployment at scale.
This instructor-led, live training (available online or on-site) is targeted at advanced-level infrastructure engineers, cloud architects, and MLOps leaders who aim to design, deploy, and optimize Mistral-based architectures to achieve maximum throughput with minimal costs.
By the end of this training, participants will be able to:
- Implement scalable deployment patterns for Mistral Medium 3.
- Apply batching, quantization, and efficient serving techniques.
- Optimize inference costs while preserving performance.
- Design production-ready serving architectures for enterprise-level workloads.
Format of the Course
- Interactive lectures and discussions.
- Plenty of exercises and practical sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Productizing Conversational Assistants with Mistral Connectors & Integrations
14 HoursMistral AI is an open AI platform that enables teams to develop and integrate conversational assistants into enterprise and customer-facing workflows.
This instructor-led, live training (online or on-site) is designed for beginner to intermediate-level product managers, full-stack developers, and integration engineers who want to design, integrate, and commercialize conversational assistants using Mistral connectors and integrations.
By the end of this training, participants will be able to:
- Integrate Mistral conversational models with enterprise and SaaS connectors.
- Implement retrieval-augmented generation (RAG) for contextually accurate responses.
- Create UX patterns for internal and external chat assistants.
- Deploy assistants into product workflows for practical use cases.
Format of the Course
- Interactive lectures and discussions.
- Practical integration exercises.
- Live-lab development of conversational assistants.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Enterprise-Grade Deployments with Mistral Medium 3
14 HoursMistral Medium 3 is a high-performance, multimodal large language model designed for deployment in enterprise environments across various industries.
This instructor-led, live training (available online or on-site) is tailored for intermediate to advanced AI/ML engineers, platform architects, and MLOps teams who aim to deploy, optimize, and secure Mistral Medium 3 for enterprise use cases.
By the end of this training, participants will be able to:
- Deploy Mistral Medium 3 using both API and self-hosted options.
- Optimize inference performance and manage costs effectively.
- Implement multimodal use cases with Mistral Medium 3.
- Apply best practices for security and compliance in enterprise environments.
Format of the Course
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Mistral for Responsible AI: Privacy, Data Residency & Enterprise Controls
14 HoursMistral AI is an open and enterprise-ready AI platform that offers features for secure, compliant, and responsible AI deployment.
This instructor-led, live training (online or onsite) is designed for intermediate-level compliance leads, security architects, and legal/ops stakeholders who want to implement responsible AI practices using Mistral by leveraging privacy, data residency, and enterprise control mechanisms.
By the end of this training, participants will be able to:
- Implement privacy-preserving techniques in their Mistral deployments.
- Apply data residency strategies to meet regulatory requirements.
- Set up enterprise-grade controls such as Role-Based Access Control (RBAC), Single Sign-On (SSO), and audit logs.
- Evaluate vendor and deployment options for compliance alignment.
Format of the Course
- Interactive lectures and discussions.
- Compliance-focused case studies and exercises.
- Hands-on implementation of enterprise AI controls.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Multimodal Applications with Mistral Models (Vision, OCR, & Document Understanding)
14 HoursMistral models are open-source AI technologies that now support multimodal workflows, enabling both language and vision tasks for enterprise and research applications.
This instructor-led, live training (online or onsite) is designed for intermediate-level ML researchers, applied engineers, and product teams who want to build multimodal applications using Mistral models, including OCR and document understanding pipelines.
By the end of this training, participants will be able to:
- Set up and configure Mistral models for multimodal tasks.
- Implement OCR workflows and integrate them with NLP pipelines.
- Design document understanding applications for enterprise use cases.
- Develop vision-text search and assistive UI functionalities.
Format of the Course
- Interactive lecture and discussion.
- Hands-on coding exercises.
- Live-lab implementation of multimodal pipelines.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.