Senior Site Reliability Engineer (HERE Maps) (offline)

Why we rock?

- Data-centric development. We build reusable components that run complex data pipelines at scale through data management, processing and distribution services and APIs.
- Visualised location intelligence. The maps rendering service we are working on is one of the key Platform's client-facing features which helps businesses to make sense of location data by empowering 2D and 3D rendering capabilities of modern web browsers.
- The way of working. Fresh setup, minimum to none legacy processes and technologies, a good chance to start over with a clean slate.
- Best practices. Platform possesses strong background in continuous delivery approaches, automated testing, and employs the best DevOps practices to ensure the Platforms reliability at scale.
- Self-fulfillment. Stand at the roots of the Platform that will redefine how society thinks about location data and boost your professional value by mastering edge data management techniques.

Our team is working on the Open Location Platform (OLP) which provides the next generation of location based services intelligence. With every connected IoT device or sensor capable of generating and sharing location data, the Platform helps to make better use of that data and transform it into useful services for people and organizations all in real-time. The Platform is meant to become the go-to destination for location services, supporting not only autonomous vehicles but smart cities and intelligent transportation systems too.

We are looking for a strong Site Reliability Engineer to join the high-profile SRE team which supports existing Service Delivery Platform (SDP) and works on development of the new Delivery Platform (DP) for HERE. OLP Delivery Platform is the Kubernetes-based delivery platform that is used to deploy services in OLP. SDP is a common and supported way of getting from code to a customer-facing production grade service. SDP provides a standardized and pre-integrated tooling platform that supports a standardized way of working that is built around the notion of Continuous Integration (CI) and Continuous Delivery (CD).

Responsibilities:

- Stabilize and improve production grade services
- NOC on-call support: provide timely resolution of customers' issues and quickly react on incidents
- Support DP users: respond to queries, provide guidance, fix issues
- Technical guidance: provide technical guidance to Service teams to on-board to DP technical topics include AWS, Kubernetes clusters, Docker, Event Management, Workflow, etc.,
- Defects triaging and resolution
- Cluster updates: new software update or bug fixes, build automation around software deployment
- New cluster creation: build automation around cluster creation, based on multi-region needs for new cluster creation
- Development: contribute to cluster development work, attend Sprint ceremonies, pick up backlog items
- Write documentation to technical solutions and incident post-mortem (lessons learnt).

Requirements:
- 5+ years of experience as a Site Reliability Engineer or DevOps Engineer with production grade systems;
- Good understanding of TCP/IP networking;
- Working proficiency in configuring Linux-based operating systems;
- Solid experience with AWS or similar cloud platform;
- Hands-on programming in Python, experience of commercial software development with it;
- Hands-on experience with OSS tools stack for configuration management, delivery and ops: Docker, Kubernetes, Prometheus, Terraform, Puppet, Git;
- Upper-intermediate level of English, or higher;

As a plus:
- Experience with one or more of the following: Consul, Go language, Prometheus, Grafana, Fluentd

About Intellias

Intellias - cмартова та комфортна компанія з дружніми відносинами в командах, а також гнучким та ефективним менеджментом та особливими умовами для кожного працівника.

Подробиці за посиланням: http://www.intellias.ua/about-us

Company website:
http://www.intellias.com/

The job ad is no longer active

Look at the current jobs Sysadmin Kyiv→