Senior Platform Engineer
Innovecs is a global digital services company with a presence in the US, the UK, the EU, Israel, Australia, and Ukraine. Specializing in software solutions, the Innovecs team has experience in Supply Chain, Healthtech, Collaboration Tech, and Gaming.
For the fifth year in a row, Innovecs is included in the Inc. 5000, the list of fastest-growing private companies in the US, and a ranking of the best global outsourcing service providers by IAOP. Recently, Innovecs was honored with the prestigious Global Good Awards for the Employee Engagement & Wellbeing, won gold at the Employer Brand Management Awards, and was included in the Global Top 100 Inspiring Workplaces Ranking.
Our value to you:
- Flexible hours and remote-first mode
- Competitive compensation
- Complete Hardware/Software setup โ anything you need for work
- Open-door culture, transparent communication, and top management at a handshake distance
- Health insurance, vacation, sick leaves, holidays, paid maternity/paternity leave
- Access to our learning & development center: workshops, webinars, training platform, and edutainment events
- Virtual team buildings and social activities to celebrate the Innovecs lifestyle
Requirements:
- Deep expertise in platform engineering with a strong grounding in SRE principles (SLIs/SLOs, incident response, automation, reliability).
- Proven experience with cloud infrastructure (preferably Azure), container orchestration, infrastructure-as-code (Terraform and Terragrunt), and automation.
- Demonstrated ability to resolve complex reliability and infrastructure challenges independently and proactively.
- High sense of ownership, reliability, and willingness to go above and beyond for the team and organization.
- Excellent communication and collaboration skills, able to work effectively across engineering and product teams.
- Trusted advisor and role model for platform and SRE excellence, setting standards for reliability, automation, and operational maturity.
Responsibilities:
- Architect, implement, and manage cloud-native infrastructure (Kubernetes, Terraform, Terragrunt, Docker, Azure) to support scalable and reliable platforms.
- Champion SRE principles: drive service reliability, availability, and performance through automation, monitoring, and incident management.
- Lead platform automation, CI/CD, and infrastructure-as-code initiatives to improve deployment velocity and system consistency.
- Own observability, monitoring, and alerting using Datadog and related tools, ensuring actionable insights and rapid incident response.
- Collaborate with software engineers to ensure robust integration between application and platform layers, and to improve developer experience (DevX).
- Provide expert support for critical incidents and production issues, demonstrating high ownership and urgency (including out-of-hours support when required).
- Mentor and guide engineers in platform, SRE, and DevOps best practices, fostering a culture of reliability and continuous improvement.
- Identify and address gaps in tooling, automation, and platform reliability to proactively improve engineering outcomes.
Required languages
| English | B2 - Upper Intermediate |
| Ukrainian | Native |
๐
$4000-6000
Average salary range of similar jobs in
analytics โ
Loading...