Audio ML engineer

SQRD.tech

On-site | Warsaw, Poland
Experience: 5+ years in ML/AI

Requirements:

5+ years of experience in ML/AI engineering with a focus on Generative AI (LLMs, TTS) and voice-based conversational systems.
Strong background in API/service engineering and streaming implementations.
Proven experience with hardware/software validation and compatibility checks (GPU, OS, dependencies, audio I/O, codecs).
Expertise in stress testing, latency consistency, and quality assurance for audio pipelines.
Experience with model release management: shipping, rollbacks, integrations, and end-to-end impact assessment.

Nice to Have:

Performance optimization: load balancing, dynamic batching, caching.
Knowledge of observability and reliability systems (logging/tracing, health checks, alerting, dashboards).

Key Tech Areas: Machine Learning, AI, APIs, Audio Streaming (ASR, TTS, VAD, Diarization)

Responsibilities:

Model Delivery & Integration

Own end-to-end delivery of audio ML models between Applied Science and Engineering teams.
Architect and implement streaming audio services (ASR, TTS, VAD, Diarization).
Validate compatibility across hardware/software setups in development and production.
Ensure smooth integration with other system components while preserving audio quality and performance.

Testing & Quality Assurance

Perform stress, soak, and performance testing for audio services.
Maintain quality, low-latency, and stability under heavy load.

Model Release & Lifecycle Management

Manage release of new model versions and architectures.
Provide rollback strategies and monitor end-to-end impact.

Performance & Observability (Optional)

Optimize performance with batching, caching, and load management.
Build observability tools: structured logging, tracing, health monitoring, alerting, and dashboards.

Project:

This role is for an Audio ML Engineer responsible for delivering, testing, and scaling audio-based AI systems (speech recognition, synthesis, and conversational AI).

Required languages

English

B2 - Upper Intermediate

Machine Learning, AI, APIs, Audio Streaming, TTS, VAD, ASR

Published 1 October

5 views

0 applications

To apply for this and other jobs on Djinni login or signup.

Only from 5 years of experience
Office Work
Poland
Countries where we consider candidates
English B2 - Upper Intermediate

ML / AI

Employment: Fulltime
Domain: Other
Outstaff
Office: Poland

Apply for the job

📊 $4000-6500 Average salary range of similar jobs in analytics →