Production Support Engineer ID59491
Important: after confirming your application on this platform, youβll receive an email with the next step: completing your application on our internal site, LaunchPod. So keep an eye on your inbox and donβt miss this step β without it, the process canβt move forward.
Why join us
If youβre looking for a place to grow, make an impact, and work with people who care, weβd love to meet you! :)
About the role
We are looking for a Production Support Engineer to monitor and support production systems across a multi-account AWS environment, serving as the front line of a tiered support model for a fintech platform. You will triage incidents, execute runbooks, manage SLA performance, and coordinate with engineering, help desk, and security partners. The role includes on-call rotation and structured post-incident review with a focus on continuous operational improvement.
What you will do
β Monitor production systems and respond to alerts across infrastructure, application, and data layers;
β Perform first-level triage on incidents and support requests; escalate to developers with thorough context and diagnostics;
β Execute patching, operational tasks, and documented runbooks;
β Participate in on-call rotation and support scheduled deployments as needed;
β Conduct post-incident reviews and feed lessons back into runbooks and playbooks;
β Identify recurring issues and systemic risks before they escalate;
β Improve documentation and monitoring coverage between active support activities;
β Contribute to operational reporting and SLA dashboards;
β Manage and track SLA performance across all supported services; surface risks proactively;
β Coordinate with Help Desk / Deskside Support partner for production tasks affecting employees;
β Escalate security incidents and vulnerabilities to the vCISO partner per documented procedures.
Must haves
β 3+ years in production support, SRE, NOC, or operations engineering;
β Hands-on AWS experience with EC2/ECS, networking (VPC, security groups, ACLs), and IAM;
β Operational proficiency with PostgreSQL and / or Amazon RDS;
β Incident triage across infrastructure and application layers;
β Track record managing SLAs in a ticketed support environment such as Jira;
β Strong written communication for escalation and post-incident reporting;
β Upper-intermediate English level.
Nice to haves
β Experience with structured incident response such as ITIL or NIST;
β Familiarity with Datadog, CloudWatch, or comparable observability platforms;
β Exposure to AWS data services including Glue, S3, Athena, and EventBridge;
β Basic IaC familiarity with CloudFormation, SAM, or Terraform;
β Background in financial services or regulated environments;
β AWS certification such as SysOps Administrator or Solutions Architect;
β Experience with scripting/automation to reduce manual toil.
Perks and benefits
β Professional growth: Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps
β Competitive compensation: We match your ever-growing skills, talent, and contributions with competitive USD-based compensation and budgets for education, fitness, and team activities
β A selection of exciting projects: Join projects with modern solutions development and top-tier clients that include Fortune 500 enterprises and leading product brands
β Flextime: Tailor your schedule for an optimal work-life balance, by having the options of working from home and going to the office β whatever makes you the happiest and most productive.
Meet Our Recruitment Process
Asynchronous stage β An automated, self-paced track that helps us move faster and give you quicker feedback:
β Short online form to confirm basic requirements
β 30β60 minute skills assessment
β 5-minute introduction video
Synchronous stage β Live interviews
β Technical interview with our engineering team (scheduled at your convenience)
β Final interview with your future teammates
If itβs a matchβyouβll get an offer!
Required languages
| English | B2 - Upper Intermediate |