Get in Touch

Course Outline

Introduction to TinyML and Embedded AI

  • Key characteristics of TinyML model deployment
  • Constraints within microcontroller environments
  • Overview of embedded AI toolchains

Foundations of Model Optimization

  • Understanding computational bottlenecks
  • Identifying memory-intensive operations
  • Conducting baseline performance profiling

Quantization Techniques

  • Post-training quantization strategies
  • Quantization-aware training methods
  • Evaluating the trade-off between accuracy and resource usage

Pruning and Compression

  • Structured and unstructured pruning techniques
  • Weight sharing and model sparsity
  • Compression algorithms designed for lightweight inference

Hardware-Aware Optimization

  • Deploying models on ARM Cortex-M systems
  • Optimizing for DSP and accelerator extensions
  • Considerations for memory mapping and dataflow

Benchmarking and Validation

  • Analysis of latency and throughput
  • Measurement of power and energy consumption
  • Testing for accuracy and robustness

Deployment Workflows and Tools

  • Using TensorFlow Lite Micro for embedded deployment
  • Integrating TinyML models with Edge Impulse pipelines
  • Testing and debugging on actual hardware

Advanced Optimization Strategies

  • Neural architecture search for TinyML applications
  • Hybrid approaches combining quantization and pruning
  • Model distillation for embedded inference

Summary and Next Steps

Requirements

  • A solid understanding of machine learning workflows
  • Experience working with embedded systems or microcontroller development
  • Familiarity with Python programming

Target Audience

  • AI researchers
  • Embedded ML engineers
  • Professionals developing inference systems with constrained resources
 21 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories