Site Reliability Lead Engineer (offline)

Responsibilities

Monitor and maintain Moon Active production infrastructure in order to maximize uptime
Handle follow-ups and retrospective for production related incidents and tasks
Recruit, mentor and train SREs
Regular pro-active review and tuning of monitoring systems based on business needs and production incidents
Regular review and updating of operational delivery processes for the SRE Team
Act as 1st and 2nd tier infrastructure and application support
Gathering information from different sources and then cross-referencing it in order to attain a resolution to production incidents
Work in cooperation with different teams such as: Product, R&D, DevOps and Support teams to escalate, troubleshoot and resolve complex issues
Ensure proper documentation is provided for all supported SRE activities and standards
Conduct Proof Of Concepts for new SRE tools and technologies

Requirements

Experience in advanced support of web/internet applications as NOC/SOC/SRE engineer
Understanding of Business KPIs
Comfortable briefing and reporting to senior executives and clients
Familiar with Cloud (i.e. AWS, Google) network architectures and the world of the Internet
Previous experience with Logging, Monitoring and Management systems (e.g. AWS CloudWatch, Google StackDriver, SignalFX, DataDog, ELK)
Experience with SQL (preferably Google BigQuery)
Good analytical and technical troubleshooting skills
High English proficiency (verbal and written)
Previous managerial experience

What's in it for you

Competitive salary with yearly performance reviews
Comfortable, centrally-located office with sport and recreation areas, bus shuttle to subway stations
Vacation: 20 business days, unlimited sick leaves
Flexible work schedule
Kitchen with healthy snacks
Medical insurance, gym and swimming pool
Corporate events (Happy hours on Fridays, team buildings, and parties)
Company presents for birthdays and work anniversaries
Education expenses coverage
English classes
Corporate trainings
Relocation assistance for non-local candidates
Bicycle/Car parking

About Ciklum International

Ciklum (www.ciklum.com) is a leading global product engineering and digital services company, serving Fortune 500 and fast-growing organisations.

Headquartered in the UK, Ciklum has 4,000+ software developers, designers, product managers and data scientists around the world building tailored digital solutions that leverage emerging technologies. Ciklum specialises in enabling digital transformation for some of the largest household names in the digital economy.

The Company empowers its clients and people to exceed their potential and pursue the extraordinary.

Join one of the top 10 employers in Ukraine, according to Forbes.
Boost your skills and make a difference with cutting-edge projects, skilled colleagues and the latest tech stacks.

Company website:
https://www.ciklum.com/

DOU company page:
https://jobs.dou.ua/companies/ciklum/

The job ad is no longer active
Job unpublished on 24 October 2020

Look at the current jobs Sysadmin Kyiv→