SRE, Prod Support Engineer in Azure IRC211155

Description:

The client is a global technology-enabled services company dedicated to advancing the world of veterinary medicine and empowering veterinary practices. They are bringing together products, services, and technology into a single platform that connects the customers to the solutions and insights they need to work best. There are a set of long-standing veterinary software systems, with more than 20,000 veterinary practices in the U.S., Canada, UK/Europe, and Australia.

The project is a Next-generation Cloud platform that emerges from all existing software systems to provide a truly global open platform.

– Technical Solution

GL team works on the migration of the existing legacy system approaching rebuild strategy and develops a completely new solution for practice management and operations including microservices, Front-end and integration layers.

– Architecture

Event-driven approaching Microservices implementation.

– Technical Challenges

Integration with existing customer’s systems and services.

Strangling of the existing legacy system which is a subject of migration.

Large scale project with several development teams across several geographies.

The process is SAFe (Scaled Agile Framework) with good support of internal tools and well-established Scrum events. A lot of verbal and written communication with US team members.

Requirements:

MUST HAVE

Experience in establishing monitoring and alerting

Practical experience of CI/CD implementation.

Experience with Azure DevOps (former VSTS).

Experience with Azure Cloud Solutions.

Experience with Azure PaaS Services configuration.

Experience with configuring Azure Security.

Experience with SVC (Git).

Experience in Powershell and shell scripting.

Intermediate spoken and written English.

NICE TO HAVE

Basic knowledge of C# programming language and C# Scripting (.CSX).

Responsibilities:

Implement and maintain monitoring and alerting systems to ensure early detection and resolution of issues.

Monitor and analyze system performance, identifying bottlenecks and areas for optimization.

Develop and maintain incident response and resolution processes, ensuring minimal impact on service availability.

Establishes KPIs and delivers reports on the overall health and performance of the system.

Collaborate with development teams to integrate SRE principles into the software development lifecycle.

Automate infrastructure provisioning, configuration management, and deployment processes.

Conduct system capacity planning and scalability assessments.

Tunes the system and configures platform components to maximize performance and efficiency. Applies patches to comply with security standards and best practices. Establishes a maintenance schedule, policies and procedures then communicates this to the larger team and stakeholders.

Provides required support and follows established procedures to report and address failures and outages.

Researches and identifies the root cause of issues and works to resolve them within established service-level agreements.

Provides after-action reports to document root causes and corrective actions taken

Participate in post-incident reviews and root cause analysis to drive system improvements.

Actively participate in architecture and design reviews to provide input on scalability, reliability, and performance aspects of systems.

Contribute to the development and improvement of internal tools, frameworks, and processes that support SRE initiatives.

Stay informed about industry trends, emerging technologies, and best practices related to SRE and incorporate them into the team’s work.

Configuring CI/CD process via Tuning Azure DevOps (VSTS).

Create, update, maintain, and troubleshoot configurations for all environments.

Collaborate with the US DevOps team on building reusable components for project needs.

Regular meetings with the customer’s team (EST time zone).

About GlobalLogic

GlobalLogic, a Hitachi Group Company, is a leader in digital engineering. We put people first. As part of our team, you will grow, be challenged, and expand your skill set working alongside highly experienced and talented people.

In Ukraine, GlobalLogic is:
- one of the TOP-3 largest IT companies
- 6,000+ professionals
- 90%+ of our projects involve complex R&D
- fully autonomous offices are located in Kyiv, Kharkiv, Lviv, and Mykolaiv, along with 10 temporary mini-offices across Ukraine

What is GlobalLogic in numbers:
- 29,000+ engineers
- 20+ countries
- 500+ active clients
- 50+ product engineering centers
- Headquartered in Silicon Valley

Company website:
https://bit.ly/GlobalLogic-Ukraine

DOU company page:
https://jobs.dou.ua/companies/globallogic/
Job posted on 22 April 2024
98 views    7 applications

To apply for this and other jobs on Djinni login or signup.