You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Michael WagnerMW

Michael Wagner

Edge AI Engineer | Generative AI & Embedded Systems

€960/day
München, DE
3-7 years

Average response time: 1 hour

Freelancer profile translated to English.
Back to original language

About Michael

Edge AI Engineer | Generative AI & Embedded Systems

Specialist for Edge AI & High-Performance Inference. I combine 6 years of embedded experience (C, Robotics) with deep expertise in Generative AI (LLM Optimization, Agents). I ensure that modern AI models run performantly and reliably outside the cloud.

Focus Areas:Generative AI (LLMs, Agents), Edge AI / On-Device Inference, Embedded Software Engineering, Robotics & Sensor Fusion.


Generative AI & LLM Engineering:
  • Inference Optimization: vLLM, llama.cpp, TensorRT-LLM, Quantization (GGUF, AWQ, GPTQ, int8), Speculative Decoding (EAGLE), Model Pruning.
  • Frameworks & Tools: PyTorch, Hugging Face (Transformers, PEFT, TRL), LangChain, LangGraph, LlamaIndex.
  • Agentic AI: Development of autonomous agents, Function Calling, MCP, Multi-Agent Systems.
  • RAG: Building Retrieval-Augmented Generation Pipelines, Vector Databases (Pinecone, ChromaDB), Embeddings.


Embedded Systems & C/C++:
  • Core: C/C++, Embedded Linux (Yocto/Buildroot), RTOS.
  • Robotics: Inverse Kinematics, Sensor Fusion (IMU), Control Engineering, ROS/ROS2, dlib.
  • Communication: CAN Bus (J1939, CANopen), SPI, I2C, MQTT, TCP/IP.


Software Architecture & DevOps:
  • Languages: Python & C (Expert), C++, TypeScript/JavaScript.
  • Infrastructure: Docker, Kubernetes (K8s), AWS (EC2, S3, Lambda), NVIDIA GPU Containers.
  • CI/CD: GitLab CI, GitHub Actions, CMake, Make.
  • Web/Backend: FastAPI, Flask, Next.js, Supabase, PostgreSQL, GraphQL.


Methods & Soft Skills:
  • Requirements analysis, Mentoring.
  • Languages: German (Native), English (Business Fluent).
  • German

    Native or bilingual

  • English

    Fluent

Can work on-site
München (up to 50km)

Experience

  • Internal R&D
    LLM Inference Optimization & Fine-Tuning
    December 2025 - January 2026 (1 month)
    - Goal:Evaluation and implementation of SOTA techniques for accelerating LLM inference on hardware-constrained systems.
    - Performance:Application of int8 quantization (via `llmcompressor`) to Qwen models. Increase in throughput by 50% (>5000 tokens/s) with consistent accuracy (GSM8K).
    - Advanced AI:Investigation of Speculative Decoding (training an EAGLE draft model) and execution of Fine-Tuning (SFT & LoRA).
    - Tech Stack:Python, vLLM, Hugging Face (PEFT, TRL), Kubernetes, Docker, NVIDIA Dynamo
    Python vLLM Hugging Face Kubernetes NVIDIA Dynamo
  • Proof of Concept (PoC)
    Deployment of a Local LLM (Edge AI)
    December 2025 - January 2026 (1 month)
    - Task:Replacing a cloud solution with a local LLM (Privacy & Latency).
    - Solution:Custom build of `llama.cpp` with CPU-specific optimizations. Benchmarking of GGUF quantizations.
    - Integration:Connection to Open WebUI via API as a drop-in replacement.
    - Tech Stack:Linux, Docker, CMake, Open WebUI, Python, llama.cpp
    Python llama.cpp Docker Quantization GGUF
  • Proof of Concept (PoC)
    Automation of Customer Orders
    December 2025 - January 2026 (1 month)
    - Task:Mapping and automation of an order process.
    - Solution:Modeling in BPMN (Camunda) and automation using multiple Python workers (inventory check, invoice, delivery).
    - Tech Stack:Linux, Camunda 7, Python, PostgreSQL, Docker
    Python Camunda Docker PostgreSQL

Recommendations

Be the first to recommend Michael

Help this freelancer shine by sharing your experience working together.

These freelancer profiles also match your criteria

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Education

  • M.Sc. Robotics, Cognition, Intelligence
    Technical University of Munich
    2018
    Vereint interdisziplinäre Kenntnisse aus den Bereichen Robotik, künstliche Intelligenz, Maschinelles Lernen und Kognitive Systeme. Ziel ist es, intelligente, autonome Systeme zu verstehen und zu entwickeln.

Skill set

Categories