Site Reliability Engineer Job at Xsolla, Raleigh, NC

U0EwLzRyM0RNZGZ2emlvU1RCNDBjcTgxR3c9PQ==
  • Xsolla
  • Raleigh, NC

Job Description

ABOUT US

At Xsolla, we believe that great games begin as ideas, driven by the curiosity, dedication, and grit of creators around the world. Our mission is to empower these visionaries by providing the support and resources they need to bring their games to life. We are committed to leveling the playing field, ensuring that every creator has the opportunity to share their passion with the world. 

Headquartered in Los Angeles, with offices in Berlin, Seoul, and beyond, we partner with industry leaders like Valve, Twitch, and Ubisoft to clear the paths for innovation in gaming. Our global reach spans over 200 geographies, offering more than 700 payment methods in 130+ currencies.

Longevity Opportunity Vision Enjoy the game!

Requirements

  • Proven experience as a Site Reliability Engineer, or similar Software Engineering role in a large-scale production environment (3+ years). 6+ years
  • overall in IT area (as Ops or Developer).
  • Proficiency in scripting languages such as Python, Bash. Strong understanding of Go and PHP will be a plus.
  • Deep knowledge of monitoring systems such as Datadog, Prometheus, Grafana.
  • Good understanding of continuous integration/continuous delivery processes and platforms (Gitlab preferred). Experience with Helm.
  • Experience with Docker, Kubernetes, or other container orchestration systems.
  • Familiarity with infrastructure automation tools like Terraform.
  • Experience with automation, system administration, and system hardening.
  • Experience with Linux-based infrastructures, Linux/Unix administration.
  • Demonstrated problem-solving skills, particularly debugging and troubleshooting complex software systems. Ability to work under pressure.
  • Excellent communication skills with a capacity to articulate and solve complex technical problems
  • Xsolla Technology Stack: Ubuntu, Kubernetes, Gitlab, Terraform, Terragrunt, Puppet, Nginx, Google Cloud Platform, Datadog, Prometheus, Grafana,
  • ELK, Zabbix and Harbor.

Responsibilities

  • Ensure high reliability and availability and meet SLAs, SLOs, and SLIs.
  • Monitor the system for issues and respond to incidents, ensuring quick resolution to maintain high system availability.
  • Drive incident resolution and process improvements to minimize downtime and increase operational transparency.
  • Ensure all key services are measured, monitored and raising alerts when needed.
  • Develop comprehensive monitoring solutions to provide full visibility to the different platform components using tools and services like
  • Kubernetes, Datadog, Prometheus, Grafana and others.
  • Support services before they go live through activities such as capacity planning, monitoring setup, logging, and production readiness reviews.
  • Engage in service capacity planning and demand forecasting, performance analysis, and system tuning.
  • Collaborate with the development teams to enhance the product's operational stability.
  • Build and drive the automation systems that maintain system health

Education

  • IT professional certifications are not required, but it will be a plus
  • Certified Kubernetes Administrator or Developer
  • HashiCorp Certifications
  • GCP Certifications

$120,000 - $150,000 a year

Benefits:

We are passionate about fostering a supportive environment for our team, so we prioritize the physical, mental, and emotional well-being of our employees and their families through a comprehensive Benefits Program. This includes 100% company-paid medical, dental, and vision plans, unlimited Flexible Time Off, and a personalized career roadmap for each employee. By investing in professional development through training and educational opportunities, we ensure that our team thrives both personally and professionally. Together, we’re not just building a business; we’re cultivating a community that values creativity, collaboration, and the transformative power of play.

By submitting the following job application form, you consent to Xsolla processing your data for career-related inquiries and potential employment opportunities. We process your data in accordance with this Xsolla Privacy Notice for Job Applicants . Please direct any inquiries regarding your data privacy to careers@xsolla.com.

Job Tags

Remote job, Full time, Flexible hours,

Similar Jobs

Empire Today LLC

Dedicated Carpet Installer Job at Empire Today LLC

 ...Are you looking for flooring installation jobs? Stop hunting for work - Empire Today has work for flooring installers right now! We are looking for experienced contractors looking to grow their installation business. Daily Work!! Empire can provide 12 installations... 

Arfa Solutions, Llc

Sr. Site Reliability Engineer Job at Arfa Solutions, Llc

 ...reads it and can understand WHATS WRONG) Performance on the site, jump in front end JavaScript (lighthouse - performance) what is...  ...processes Collaborate with cross-functional teams to ensure the reliability, security, and performance of systems Identify and resolve... 

Cubic Defense

Senior Software Engineer- embedded C++ Job at Cubic Defense

Join to apply for the Senior Software Engineer - Embedded C++ role at Cubic Defense .Company DetailsWhen you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make peoples lives easier by simplifying their... 

Teamlogic It

Onsite IT Technician Level ll supporting Green Bay and Fox Valley Job at Teamlogic It

 ...offer competitive wages, PTO, career opportunities, all within a supportive work environment and ongoing training. We encourage, support...  ...managing VLANs and routers ~ Experience with RMM and PSA software ~ Experience setting up, configuring firewalls such as SonicWALL... 

Lucid Staffing Solutions

Travel Labor & Delivery and Nursery Registered Nurse - $2,038 per week Job at Lucid Staffing Solutions

Lucid Staffing Solutions is seeking a travel nurse RN Labor and Delivery for a travel nursing job in Richlands, Virginia. Job Description & Requirements ~ Specialty: Labor and Delivery ~ Discipline: RN ~ Start Date: 07/21/2025~ Duration: 13 weeks ~36 hours...