DevOps / SRE Offline
Domain: Telecom
Project:
Messaging platform that accepts, process, and delivers messages between mobile messengers, social networks, mobile providers, and enterprises. Processing and delivering more than 600 billion messages per year. Provides a single endpoint for messages processing. Integrates with big enterprises and banks and provides secure and reliable messaging service. Processing – SMS, MMS, RCS, WhatsAPP, Facebook, WeChat, etc. Software performance is a key point of a project as 80% of nearly 2 billions daily messages delivered within 4 hours.
What to do:
🔹Assisting the teams with release builds, environments, escalations, and remediation efforts supporting enterprise messaging applications.
🔹Working with the team to accelerate and success rate of feature development in the sustaining development release cycles.
🔹Reduce Production MTTR, working with engineers:
- Remediate “performance agent” monitoring gaps
- Audit LNMS and remove zombie graphs (dead, no traffic0)
- Remediate Log Aggregation gaps
🔹Delivery QoL automation like service start, stop, and health check scripts
🔹Deliver Operations Release validation automation
🔹Support Release deployments in the NSX Lab environment
🔹Assisting with manual deployments
🔹Assisting with compiling the documentation and useful feedback to the support organization (documenting error reports and monitoring performance metrics)
Requirements:
🔹RedHat Linux
🔹Networking concepts
🔹Experience working with containerization solutions, K8s.
🔹Provisioning and deployment through Terraform, Helm.
🔹AWS cloud desirable
🔹Strong Ansible and scripting experience.
🔹Good understanding of CI/CD best practices (Jenkins, Gitlab)
🔹Strong knowledge in Prometheus, Grafana, ELK
🔹Strong troubleshooting skills.
🔹Upper-Intermediate English level
What we offer:
🔹100% remote
🔹good referral program
🔹paid vacations and seak leaves
🔹national holidays off
🔹compensation for education
🔹medical insurance
Hire only Ukrainian