We are seeking an experienced Senior SRE who is passionate about the security, performance, and reliability of applications hosted in our global multi-cloud and on-prem data centers.

Challenges you will solve:
Participate in all stages of infrastructure provisioning, primarily providing the staging and production support.
Assist in implementation of security best practices and initiatives at all levels of the systems infrastructure.
Adhere with SRE (Site Reliability Engineering) principles/pillars on incident management and service level objectives.
Work closely with DevOps engineers to apply/improve the automation scripts and system designs shared by DevOps to improve systems efficiency in production environment.
Ensure maximum uptime and stability of cloud and on-premises environments, especially in staging and production environments.
Apply the latest OS and security patches ensuring the compatibility of underlying running application.
Lead on conducting in the disaster recovery/business continuity (DRBC) routine exercises.
Handle help desk & JIRA tickets and mitigate any production issues.
Ensure accurate knowledge base documentation in a timely manner.
About You:
Strong knowledge of secure web app deployments in AWS (4+ years).
Advanced experience as a Linux or Windows server administrator.
The ability to work with little supervision; must be self-driven and motivated.
Experience with continuous integration/continuous delivery (CI/CD) — Jenkins and Git.
Experience with containerized microservices delivered with Docker, Kubernetes (Kops, AWS EKS), or OpenShift 4.x.
Manage & optimize unified logging system and APM (Application Performance Management) monitoring tools, constantly reduce the MTTR (Mean Time to Recovery).
Strong experience with hybrid infrastructure systems monitoring and proactive incident management.
Strong scripting skills using Shell and Python or Go (a plus).
Some knowledge of web application programming languages (such as JavaScript, NodeJS, Java, etc.).
Ability to proactively triage on troubleshooting urgent production issues under high time pressure with precision.
Experience in working collaboratively with various applications development teams throughout the organization to resolve mission critical problems.
Excellent written and oral communication skills necessary to produce and process technical documents.
Excellent problem-solving and analytical skills and the ability to translate business requirements into information systems solutions.
Experience with IT security.
Someone who is a team player.
Familiarity/experience with the DevOps process.
Professional IT certifications, such as Red Hat Certified Engineer/Windows Server, and AWS certifications (a huge plus).
Relevant work experience (8+ years), either in software development or IT infrastructure.
Master’s degree in technology related, engineering or computer science (a plus).
Participate in a weekly on-call rotation (~every 3-4 weeks) as needed.
Provide mission critical production support in case of an outage during off business hours if necessary.
We Believe in Equal Opportunity

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, gender identity or any other characteristic protected by law.

About Talent500

We are on a mission to build the largest global remote workforce
Fast-growth businesses hire, build, and manage global teams through us. We’ve built talent hubs around the world and the largest Ai-enabled global talent platform. And through this we’ve democratized access to opportunities for professionals everywhere.
The war for talent is heating up. As an organization you want to attract and retain the best talent. And yet, challenges include a sudden surge in demand, the great resignation, reopening economies, a candidate revolution and employee burnout.

This is where we’ve repeatedly helped global tech majors and Fortune500 enterprises stand apart. At Talent500, we are passionate about delivering exceptional results. And we strive to invigorate the industry while doing this innovatively. We hand-pick opportunities from the world’s top-tier companies who are focussed on result-based outcomes. These then powered by a top-tier, location-independent tech workforce.

The future of work is not just remote - it is ‘global’. And we’re the partner to take you there.

Company website:

DOU company page:

Job posted on 11 May 2022
6 views    0 applications

To apply for this and other jobs on Djinni login or signup.
  • home_work Full Remote
  • shopping_basket Product