Senior Site Reliability Engineer
5 months ago
At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox Live, PlayStation Network, and EA Origin. We are backed by top investors including Softbank, Sony Interactive Entertainment, Galaxy Interactive, NetEase, and Krafton. Our latest Series B funding has firmly solidified our place as a top player in the gaming industry. AccelByte's talent has decades of experience building and shipping some of the largest game and distribution platforms in the world.
We believe that the best companies empower employees to make decisions, obsess about the best user experience, and are not afraid to make and learn from their mistakes. Our culture is based on humility, openness to feedback, drive, and collaboration, which we feel results in the best performing teams. As a company that values diversity, inclusion, and employee growth, our employees have opportunities to work with and learn from teams all over the world. We offer competitive salaries, a full range of health benefits, social activities, career growth opportunities, and an amazing team. Come join us
**Position Summary**:
As a **Senior Site Reliability Engineer**, you design, implement, and maintain infrastructure and operational systems that accomplish a given goal. You discover requirements and guide other engineers collaborating in an area and do exemplary work on complicated problems. You optimize performance, drive efficiency, and ensure the reliability of critical infrastructure.
**Essential Functions/ Responsibilities**:
The **Senior Site Reliability Engineer **is accountable for the following functions and responsibilities:
- Review, provide feedback, and mentor coworkers on changes to maintain reliability.
- Design and develop infrastructure and operational tasks with scalability and stability in mind.
- Contributing in automating solutions to optimize tasks, improve efficiency, and reduce manual effort.
- Design, implement, and maintain scalable infrastructure and deployment frameworks using K8s and CNCF projects.
- Direct a secure, cost-effective, and scalable cloud platform.
- Initiate and conduct thorough investigations of operational incidents and proactively prevent future issues and designing resilient approaches based on insights from operational incidents for long-term mitigations.
- Collaborate with stakeholders to deliver cost-effective, excellent infrastructure solutions and identify areas for improvement.
- Communicate directly with clients, understanding their needs and providing exceptional support.
- The ability to train and mentor less experienced engineers and set the direction for other engineers.
- Model standards for engineering excellence
- Discover requirements by working with PMs and stakeholders
- Perform other duties as assigned
**Qualifications/Experience Required**:
- Specializes in operations and reliability automation.
- 5+ years of professional infrastructure and operational engineering experience with Linux administration.
- Proven track record of infrastructure as code, configuration management, and package management.
- Collaborative completion of infrastructure or operational projects.
- Familiar with Nomad.
- Eagerness to learn new languages, technologies, and containerization principles (e.g., Docker, Kubernetes).
- Practical knowledge of networking, storage, and container technologies.
- Robust knowledge and experience in cloud computing (preferred AWS/GCP).
- Proven experience with automation, CI/CD, and GitOps tools.
- Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, ELK/EFK, Splunk, Datadog, OpsGenie, PagerDuty).
- Software development and scripting experience with Bash, Python, and/or Golang.
- Proficiency in written and verbal English language for remote work.
- Flexibility to adjust work routines/schedules to meet company and customer needs.
- Previous professional infrastructure or operational experience preferred.
- Experience at a AAA game studio or software product company preferred.
- Experience working with cloud platforms or web products preferred.
- Experience in a multinational technology startup is a big plus.
-
Senior Site Reliability Engineer
6 months ago
Jakarta, Indonesia DKatalis Full time**Site Reliability Engineer**: **About DKatalis** DKatalis is a financial technology company with multiple offices in the APAC region. In our quest to build a better financial world, one of our key goals is to create an ecosystem linked financial services business. DKatalis is built and backed by experienced and successful entrepreneurs, bankers, and...
-
Senior Site Reliability Engineer
5 months ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia Pro Sigmaka Full timeWe established at 2012. With experience in several industry sectors, a broad portfolio and technology platform as well as bringing a dedicated and highly qualified team, enabling the talent we provide to provide fast and responsive services, making it the best choice for companies that want to increase the usability of their businesses. OUR SERVICES -...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia PT. Amalura Multi Dimensi Full timeManage and optimize cloud infrastructure (AWS, GCP, Azure). - Administer Linux system, ensuring stability and security. - Implement observability (e. g, OpenTelemtry, HoneyComb, Sentry) to monitor performance. - Optimize content delivery networks (e. g., Akamai) to enhance user experience. - Design monitoring, alerting, and incident response procedure for...
-
Site Reliability Engineer
7 months ago
Jakarta, Indonesia PT Tiga Daya Digital Indonesia (Eksad Technology) Full timeTiga Daya Digital Indonesia, a susidiary company of Triputra Group and DCI Group To be IT partner to enable client growth rapidly. Eksad Providing Services High Quality Based on Strong Experience in the industry and technology. Building the right IT Service Solution to enable it Partners in speeding up business development based on digital technology by...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia Digital Muda Solutions Full timeDeskripsi: - Menjaga ketersediaan, kehandalan, dan performa sistem dengan fokus pada infrastruktur teknis, keamanan, dan skala pengguna. - Berkolaborasi dengan tim pengembangan dan operasi untuk merancang, menguji,dan menerapkan praktik terbaik dalam infrastruktur teknologi, serta melakukan perbaikan dan peningkatan sesuai kebutuhan. - Memastikan integrasi...
-
Site Reliability Engineer
1 month ago
Jakarta, Indonesia Paymentology Full timePaymentology is the first truly global issuer-processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 50 countries, at scale. Our advanced, multi-cloud platform, offering both shared and dedicated processing instances, vast global presence and richer,...
-
Site Reliability Engineer
7 months ago
Jakarta, Indonesia PT Salva Teknologi Digital Full timeSite Reliability Engineer (Junior) - Applicants should have sufficient qualification and relevant experiences in the respective fields "Waspada terhadap Modus Penipuan pada saat proses interview. Perusahaan tidak akan memungut biaya apapun dalam melakukan proses interview. Mohon segera melaporkan ke kami, jika pada saat Anda diundang untuk interview dan...
-
Site Reliability Engineer
2 months ago
Jakarta, Indonesia Hukumonline.com Full timeManage and optimize cloud infrastructure on AWS, GCP, and Azure. - Administer and maintain Linux-based systems, ensuring their stability and security. - Implement and maintain observability solutions, including OpenTelemetry, HoneyComb, and Sentry, to monitor system performance and diagnose issues. - Configure and optimize content delivery networks, with a...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia PT Astra Digital Mobil (mobbi) Full timeJob Description: - Maintain system availability, reliability and performance by focusing on technical infrastructure, security and user scale. - Collaborate with development and operations teams to design, test, and implement best practices in technology infrastructure, and make fixes and improvements as needed. - Conduct in-depth analysis of incidents and...
-
Site Reliability Engineer(DevOps)
5 months ago
Jakarta, Indonesia Digital Muda Solutions Full timeDeskripsi: - Menjaga ketersediaan, kehandalan, dan performa sistem dengan fokus pada infrastruktur teknis, keamanan, dan skala pengguna. - Berkolaborasi dengan tim pengembangan dan operasi untuk merancang, menguji,dan menerapkan praktik terbaik dalam infrastruktur teknologi, serta melakukan perbaikan dan peningkatan sesuai kebutuhan. - Memastikan integrasi...
-
Senior Mechanical Site Engineers
5 months ago
Jakarta, Indonesia Bureau Veritas Full timeBureau Veritas is a world leader in the verification, assessment and risk analysis of Quality, Environment, Health and Safety and Social Accountability (QHSE-SA). The Group provides inspection and auditing, compliance verification and certification services to support organisations of all sizes, belonging to all sectors, both public and private, from...
-
Site Reliability Engineer
5 months ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer Manager
6 months ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer for Fita
3 months ago
Jakarta, Indonesia PT Telkomsel Ekosistem Digital Full timeFita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches. Our mission is to empower Indonesians of all fitness levels to achieve their goals, maintain a healthy lifestyle, and build lasting habits through personalized virtual coaching sessions. **What you will do but not limited to**: - Manage infrastructure...
-
Site Reliability Engineer
2 months ago
Jakarta, Indonesia Flip Full time**About Flip** Rafi, Luqman, and Anjar, who were college friends in Universitas Indonesia, started Flip as a project in 2015 to transfer payments to each other at a fraction of what banks would charge them. They are pioneers in the Indonesian market, with their technology now helping millions of Indonesians, both individuals and businesses, carry out...
-
Senior Site Reliability Engineer
2 months ago
Jakarta, Indonesia Shopee Full timeDepartment Engineering and Technology- LevelExperienced (Individual Contributor)- LocationIndonesia - JakartaThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our...
-
Senior Site Reliability Engineer
5 months ago
Jakarta, Indonesia Flip Full time**About Flip** Rafi, Luqman, and Anjar, who were college friends in Universitas Indonesia, started Flip as a project in 2015 to transfer payments to each other at a fraction of what banks would charge them. They are pioneers in the Indonesian market, with their technology now helping millions of Indonesians, both individuals and businesses, carry out...