Site Reliability Engineer (SRE) (offline)

Moon Active is one of the world's fastest-growing mobile game companies, providing entertainment for millions of active users across the Universe.

Our goal is to develop top quality casual games and connect people, friends and players from all over the world. Our latest game, Coin Master, is a top grossing game in every country it was officially launched.
We follow our belief that reaching success comes from setting high standards and striving to be the best at all we do:
-Stunning art
-Fun gameplay
-Marketing expertise
-Data science
-Advanced technology

Moon Active, a mobile games start-up with millions of daily active players worldwide is looking for an experienced Site Reliability Engineer (SRE) to join our team and to help create awesome games at a company that puts quality at the forefront.

Working within our SRE team, the Site Reliability Engineer (SRE) will support production activities to sustain our platforms & tools, troubleshoot, develop, maintain and document technical solutions related to Moon Active’s production infrastructure.

This position requires hands-on technical work as well as good analytical skills. You will train other SRE and act as the first level of contact for technical escalations.

Responsibilities:
• Monitor and maintain Moon Active production infrastructure in order to maximize uptime;
• Handle follow-ups and retrospective for production-related incidents and tasks;
• Work along-side with the NOC Team Lead on mentoring and training of Junior SRE;
• Regular pro-active review, tuning, and automation of monitoring systems based on business needs and production incidents;
• Regular review and updating of operational delivery processes for the Monitoring department;
• Act as 1st and 2nd tier infrastructure and application support;
• Gathering information from different sources and then cross-referencing it in order to attain a resolution to production incidents;
• Work in cooperation with different teams such as: Product, R&D, DevOps and Support teams to escalate, troubleshoot and resolve complex issues;
• Ensure proper documentation is provided for all supported activities and standards;
• Conduct Proof Of Concepts for new tools and technologies.

Requirements:
• Experience in advanced support of web/internet applications as NOC/SOC/SRE;
• Understanding of Business KPIs;
• Comfortable briefing and reporting to senior executives and clients;
• Familiar with cloud (i.e. AWS, Google) network architectures and the world of the Internet;
• Previous experience with Logging, Monitoring and Management systems (e.g. AWS CloudWatch, Google StackDriver, SignalFX, DataDog, ELK);
• Experience with SQL (preferably Google Big Query);
• Good analytical and technical troubleshooting skills;;
• High English proficiency (verbal and written);
• Previous managerial experience.

What's in it for you:
• A challenging function, with a lot of responsibility in an unique dynamic environment;
• Work with the latest technologies;
• Work with skilled and professional teammates. Collaborating together to create awesome games served to tens of millions of players;
• State of the art, cool, centrally located offices with warm atmosphere which creates really good working conditions;
• Competitive salary.

About Ciklum

Ciklum is a top-five global Software Engineering and Solutions Company. Our 3,000+ IT professionals are located in the offices and delivery centres in Ukraine, Belarus, Poland and Spain.

We are looking forward to seeing you as a part of our team!

Company website:
https://jobs.ciklum.com/

The job ad is no longer active

Look at the current jobs Sysadmin Kyiv→