When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
Learn to deploy ML models to production using the Sovereign Rust Stackāa pure Rust implementation with zero Python runtime dependencies. This hands-on course teaches you to work with three critical model formats (GGUF, SafeTensors, APR), implement MLOps pipelines with CI/CD and observability, and deploy models across GPU, CPU, WebAssembly, and edge targets.
Through real-world projects including a Python-to-Rust transpiler (Depyler), browser-based speech recognition (Whisper.apr), and LLM inference benchmarking (Qwen), you'll master format conversion, cryptographic model signing, and performance optimization. The course culminates in a capstone project deploying Qwen2.5-Coder across all three formats with benchmarking.
What makes this course unique: instead of relying on Python frameworks, you'll build with production-grade Rust tooling that compiles to native binaries and WebAssembly. Learn to run sub-millisecond inference in browsers, bundle models into executables, and achieve 2x performance gains over standard tools.
Ideal for ML engineers and software developers ready to move beyond notebooks into production deployment.
Understanding ML model formats and the Sovereign AI Stack. Learn GGUF, SafeTensors, and APR formats for different deployment targets.
What's included
6 videos8 readings1 assignment
Show info about module content
6 videosā¢Total 21 minutes
Course Introductionā¢3 minutes
Hugging Face Model Publishingā¢4 minutes
Model Types on Hugging Faceā¢3 minutes
APR Format Deep Diveā¢4 minutes
Model Format Comparisonā¢3 minutes
Why Trace Models ā¢4 minutes
8 readingsā¢Total 8 minutes
Introduction to Course and Course Resourcesā¢1 minute
Meet your instructorsā¢1 minute
Key Conceptsā¢1 minute
Reflectionā¢1 minute
Key Termsā¢1 minute
Reflectionā¢1 minute
Key Termsā¢1 minute
Reflectionā¢1 minute
1 assignmentā¢Total 5 minutes
Quiz: Model Formatā¢5 minutes
MLOps Foundations
Module 2ā¢1 hour to complete
Module details
Production infrastructure for ML systems. This module covers the essential MLOps practices needed to deploy and maintain ML models in production environments. Learn how to implement CI/CD pipelines specifically designed for ML workflows, set up comprehensive observability with logs, metrics, and traces, apply cryptographic model signing for supply chain security, and choose optimal deployment patterns based on your infrastructure requirements.
What's included
8 videos6 readings1 assignment
Show info about module content
8 videosā¢Total 24 minutes
Model Registry Architectureā¢3 minutes
CI/CD Pipeline for MLā¢4 minutes
Model Observability Stackā¢3 minutes
Model Signing & Securityā¢3 minutes
Binary Deployment Patternsā¢3 minutes
Inference Server Architectureā¢3 minutes
Corpus Management & DataOpsā¢3 minutes
Cost-Performance Decision Matrixā¢3 minutes
6 readingsā¢Total 60 minutes
Key Conceptsā¢10 minutes
Reflectionā¢10 minutes
Key Termsā¢10 minutes
Reflectionā¢10 minutes
Key Termsā¢10 minutes
Reflectionā¢10 minutes
1 assignmentā¢Total 5 minutes
Quiz: MLOps Foundationsā¢5 minutes
Project Showcase
Module 3ā¢2 hours to complete
Module details
Real-world projects built with the Sovereign AI Stack. This module demonstrates practical applications through three production projects: Depyler (a Python-to-Rust transpiler with self-improving ML), Whisper.apr (speech-to-text in browser and CLI), and the APR ecosystem tools. Learn how to build self-improving systems using compiler-in-the-loop training, deploy speech recognition to resource-constrained environments, and leverage the full APR toolchain for model conversion and inference.
What's included
11 videos6 readings1 assignment
Show info about module content
11 videosā¢Total 43 minutes
Four Projects, One Stackā¢5 minutes
Depyler Deep Diveā¢5 minutes
Depyler Oracle Trainingā¢3 minutes
Depyler Single-Shot Compileā¢3 minutes
Whisper.apr Overviewā¢5 minutes
Whisper Code Walkthroughā¢4 minutes
Whisper Demoā¢3 minutes
APR Format Rosetta Stoneā¢3 minutes
APR Hub & Spoke Architectureā¢3 minutes
APR Chat Demoā¢3 minutes
Course Conclusionā¢3 minutes
6 readingsā¢Total 60 minutes
Key Termsā¢10 minutes
Reflectionā¢10 minutes
Key Conceptsā¢10 minutes
Reflectionā¢10 minutes
Key Conceptsā¢10 minutes
Reflectionā¢10 minutes
1 assignmentā¢Total 5 minutes
Quiz: Project Showcaseā¢5 minutes
Capstone Project
Module 4ā¢1 hour to complete
Module details
Final project deploying Qwen2.5-Coder-0.5B across all three model formats. Students demonstrate mastery of format conversion, CLI deployment, server deployment, and performance benchmarking.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canāt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youāll find a link to apply on the description page.