Logo Canva Austria GmbH.

Senior Machine Learning Engineer

New

Job

  • Level
    Senior
  • Job Field
    Software, Data
  • Employment Type
    Full Time
  • Contract Type
    Permanent employment
  • Location
    Vienna
  • Working Model
    Onsite
  • Job Summary

    In this role, you will develop robust ML services, integrate models into our monorepo, automate CI/CD pipelines, and ensure service reliability through monitoring and optimization.

    Job Technologies

    Your role in the team

    You’ll be the bridge between research and production.

    Partnering closely with researchers, you’ll ensure experimental code is production ready, integrate models into our monorepo, build shared libraries and services, and create the tooling and processes that let multiple model variants ship safely and quickly.

    Your work shortens the research-to-user loop, reduces duplication, and ensures our ML features are reliable, observable, and easy for other teams to adopt.

    At the moment, this role is focused on:

    • Research-to-Production Pipeline: Hardening experimental models (containerization, tests, CI/CD), making them deployable for real users.
    • Library development: Converting experiments into well-factored libraries with clear APIs, dependency hygiene, and versioning—so teams can import rather than copy-paste.
    • Multi-Variant & Parallel Execution: Enabling multiple model variants to run in parallel (for canaries, A/B tests, and rollbacks) across our image-generation and related codebases.
    • Developer Experience & Documentation: Creating templates, examples, and guidance; offering supportive, high-signal communication so others can adopt libraries confidently.
    • Reliability, Observability & Cost: Instrumenting services with metrics/logging/tracing, setting SLIs/SLOs, and optimising inference performance and spend.

    Primary Responsibilities:

    • Productionise research models: refactor, test, containerise, and integrate them into the monorepo for scalable reuse.
    • Build and maintain inference services, SDKs, and shared libraries that standardise pre/post-processing and interfaces across variants.
    • Create multi-variant runners and rollout frameworks (feature flags, canaries, A/B testing, automated rollbacks).
    • Establish CI/CD workflows, artifact management, and reproducible builds for ML services and model assets.
    • Add robust observability (dashboards, alerts) and reliability practices (load tests, chaos/resiliency checks).
    • Optimize inference (batching, caching, quantization/compilation, hardware utilization) to reduce latency and cost.
    • Partner with researchers and product engineers via code reviews, pair sessions, and clear documentation to accelerate adoption.
    • Drive good engineering hygiene in the research codebase: testing strategy, dependency management, and de-duplication across multiple model variants.

    This text has been machine translated. Show original

    Our expectations of you

    Qualifications

    You’re probably a match if you:

    • Have strong software engineering fundamentals and excellent Python skills; you’re comfortable turning notebooks and prototypes into production-grade services.
    • Have shipped ML systems in production (containers, APIs, CI/CD), ideally within a monorepo environment.
    • Can read research code and refactor it into clean abstractions with stable, well-documented interfaces.
    • Understand service reliability and observability (metrics, tracing, logging) and how they apply to ML systems.
    • Communicate clearly and empathetically—especially when guiding others to adopt libraries and best practices.

    Nice to Have:

    • Familiarity with model-serving/optimization tooling (e.g., ONNX, TorchScript, Triton, quantization).
    • Background with multimodal/image generation stacks or LLM-adjacent tooling (not the core focus, but helpful).
    • Knowledge of MLOps practices (model registries, artifact stores, dependency/version management).

    Experience

    • Bring cloud experience (AWS a plus) without needing to be a deep specialist.
    • Java experience (or JVM ecosystem) for services that integrate with ML components.
    • Experience with experimentation platforms (feature flags, A/B testing) and safe rollout strategies.

    This text has been machine translated. Show original

    Benefits

    Health, Fitness & Fun

    Work-Life-Integration

    Food & Drink

    More net

    Job Locations

    Map of company locations
    • Location Vienna

      Location Vienna

      Ungargasse 37

      1030 Wien

      Austria

    Topics that you deal with on the job

    This is your employer

    Canva Austria GmbH.

    Canva Austria GmbH.

    Wien

    By making complicated tech simple, we strive to enable individuals and businesses of all sizes to benefit from the recent advances in Visual AI. Our tools simplify and accelerate workflows, foster creativity, and enable others to create new products.

    Description

  • Company Size
    50-249 Employees
  • Founding year
    2018
  • Language
    English
  • Company Type
    Startup
  • Working Model
    Hybrid, Onsite
  • Industry
    Internet, IT, Telecommunication
  • Logo Canva Austria GmbH.

    Senior Machine Learning Engineer

    Location
    Vienna
    Working Model
    Onsite

    More Jobs