Senior Site Reliability Engineer
4 weeks ago
At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox Live, PlayStation Network, and EA Origin. We are backed by top investors including Softbank, Sony Interactive Entertainment, Galaxy Interactive, NetEase, and Krafton. Our latest Series B funding has firmly solidified our place as a top player in the gaming industry. AccelByte's talent has decades of experience building and shipping some of the largest game and distribution platforms in the world.
We believe that the best companies empower employees to make decisions, obsess about the best user experience, and are not afraid to make and learn from their mistakes. Our culture is based on humility, openness to feedback, drive, and collaboration, which we feel results in the best performing teams. As a company that values diversity, inclusion, and employee growth, our employees have opportunities to work with and learn from teams all over the world. We offer competitive salaries, a full range of health benefits, social activities, career growth opportunities, and an amazing team. Come join us
**Position Summary**:
As a **Senior Site Reliability Engineer**, you design, implement, and maintain infrastructure and operational systems that accomplish a given goal. You discover requirements and guide other engineers collaborating in an area and do exemplary work on complicated problems. You optimize performance, drive efficiency, and ensure the reliability of critical infrastructure.
**Essential Functions/ Responsibilities**:
The **Senior Site Reliability Engineer **is accountable for the following functions and responsibilities:
- Review, provide feedback, and mentor coworkers on changes to maintain reliability.
- Design and develop infrastructure and operational tasks with scalability and stability in mind.
- Contributing in automating solutions to optimize tasks, improve efficiency, and reduce manual effort.
- Design, implement, and maintain scalable infrastructure and deployment frameworks using K8s and CNCF projects.
- Direct a secure, cost-effective, and scalable cloud platform.
- Initiate and conduct thorough investigations of operational incidents and proactively prevent future issues and designing resilient approaches based on insights from operational incidents for long-term mitigations.
- Collaborate with stakeholders to deliver cost-effective, excellent infrastructure solutions and identify areas for improvement.
- Communicate directly with clients, understanding their needs and providing exceptional support.
- The ability to train and mentor less experienced engineers and set the direction for other engineers.
- Model standards for engineering excellence
- Discover requirements by working with PMs and stakeholders
- Perform other duties as assigned
**Qualifications/Experience Required**:
- Specializes in operations and reliability automation.
- 5+ years of professional infrastructure and operational engineering experience with Linux administration.
- Proven track record of infrastructure as code, configuration management, and package management.
- Collaborative completion of infrastructure or operational projects.
- Familiar with Nomad.
- Eagerness to learn new languages, technologies, and containerization principles (e.g., Docker, Kubernetes).
- Practical knowledge of networking, storage, and container technologies.
- Robust knowledge and experience in cloud computing (preferred AWS/GCP).
- Proven experience with automation, CI/CD, and GitOps tools.
- Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, ELK/EFK, Splunk, Datadog, OpsGenie, PagerDuty).
- Software development and scripting experience with Bash, Python, and/or Golang.
- Proficiency in written and verbal English language for remote work.
- Flexibility to adjust work routines/schedules to meet company and customer needs.
- Previous professional infrastructure or operational experience preferred.
- Experience at a AAA game studio or software product company preferred.
- Experience working with cloud platforms or web products preferred.
- Experience in a multinational technology startup is a big plus.
-
Senior Site Reliability Engineer
1 week ago
Jakarta, Indonesia DKatalis Full time**Site Reliability Engineer**: **About DKatalis** DKatalis is a financial technology company with multiple offices in the APAC region. In our quest to build a better financial world, one of our key goals is to create an ecosystem linked financial services business. DKatalis is built and backed by experienced and successful entrepreneurs, bankers, and...
-
Senior Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia Abhidi Solution Private Limited Full time**Responsibilities**: - Administer production related jobs - Address production issue - Improve system reliability through configuration or code changes - System monitoring and improve system observability - Remove toil and automate whenever possible - Problem solving, including troubleshoot a production issue **Skills**: - Experience with cloud...
-
Site Reliability Engineer
2 weeks ago
Jakarta, Indonesia PT Midas Daya Teknologi Full timeJob Description: Works with project engineering to ensure the reliability and maintainability of new and modified software. - The reliability engineer is responsible for adhering to the life cycle software management process throughout the entire life cycle. - Also responsible for end-to-end site reliability including service offerings, in particular...
-
Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia Pro Sigmaka Full timeWe established at 2012. With experience in several industry sectors, a broad portfolio and technology platform as well as bringing a dedicated and highly qualified team, enabling the talent we provide to provide fast and responsive services, making it the best choice for companies that want to increase the usability of their businesses. OUR SERVICES -...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia Ajaib Full timeCompany Description **Job Description**: - Perform day-to-day operations to support developers and DevOps. - Create end-to-end monitoring, logging, and alerting system. - Provide technical assistance to improve system performance, capacity, reliability and scalability - Perform root cause analysis of reliability issues. - Document every action so your...
-
Site Reliability Engineer
2 weeks ago
Jakarta, Indonesia Beyondsoft (Malaysia) Sdn. Bhd Full timeCOMPANY DESCRIPTION Beyondsoft (listed by the Shenzhen Stock Exchange, stock code 002649) is a global provider of IT consulting, product and solution services. Relying on strong R&D and innovation capabilities, the company widely adopts emerging technologies based on big data and mobile internet, including big data management platform, enterprise risk...
-
Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia PT Tiga Daya Digital Indonesia (Eksad Technology) Full timeTiga Daya Digital Indonesia, a susidiary company of Triputra Group and DCI Group To be IT partner to enable client growth rapidly. Eksad Providing Services High Quality Based on Strong Experience in the industry and technology. Building the right IT Service Solution to enable it Partners in speeding up business development based on digital technology by...
-
Sr. Site Reliability Engineer
6 days ago
Jakarta, Indonesia Shipper Full time**What is Shipper** Shipper is a growing technology company based in Jakarta. We provide well-rounded logistics solutions for businesses of all sizes. Today, we offer several services including First-Mile Pickup and Delivery, Fulfillment/Warehouse Management, and Cross-Border shipping services. We are financially supported by eminent investors, including...
-
Site Reliability Engineer Manager
1 week ago
Jakarta, Indonesia Shipper Full time**What is Shipper** Shipper is a growing technology company based in Jakarta. We provide well-rounded logistics solutions for businesses of all sizes. Today, we offer several services including First-Mile Pickup and Delivery, Fulfillment/Warehouse Management, and Cross-Border shipping services. We are financially supported by eminent investors, including...
-
Site Reliability Engineer
1 week ago
Jakarta, Indonesia Global Tiket Network Full timeWe think you also hate when travel app is giving you a headache, right? A slight misinformation can ruin the trip. - That is exactly what we are tackling as t-fam! Making sure that our 17+ million users have the best experience in crafting their own adventure. LI-Hybrid Catch the sunrise on the top of Padar Island and see fascinating views of the boundless...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia Digital Muda Solutions Full timeDeskripsi: - Menjaga ketersediaan, kehandalan, dan performa sistem dengan fokus pada infrastruktur teknis, keamanan, dan skala pengguna. - Berkolaborasi dengan tim pengembangan dan operasi untuk merancang, menguji,dan menerapkan praktik terbaik dalam infrastruktur teknologi, serta melakukan perbaikan dan peningkatan sesuai kebutuhan. - Memastikan integrasi...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia Catalyst Tech Full timeAt Catalyst, People are the heartbeat for our company. We believe that good quality people will have a positive impact to our business. We are looking for a **Site Reliability Engineer / DevOps** to join our growing team. If you are passionate about being part of the team, building some of the most critical products, Working alongside teams in the industry...
-
Site Reliability Engineer
2 weeks ago
Jakarta, Indonesia Paymentology Full timePaymentology is the first truly global issuer-processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 50 countries, at scale. Our advanced, multi-cloud platform, offering both shared and dedicated processing instances, vast global presence and richer,...
-
Site Reliability Engineer
6 days ago
Jakarta, Indonesia Hukumonline.com Full timeManage and optimize cloud infrastructure on AWS, GCP, and Azure. - Administer and maintain Linux-based systems, ensuring their stability and security. - Implement and maintain observability solutions, including OpenTelemetry, HoneyComb, and Sentry, to monitor system performance and diagnose issues. - Configure and optimize content delivery networks, with a...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
3 weeks ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia AccelByte Full timeAt AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...
-
Site Reliability Engineer
4 weeks ago
Jakarta, Indonesia PT Astra Digital Mobil (mobbi) Full timeJob Description: - Maintain system availability, reliability and performance by focusing on technical infrastructure, security and user scale. - Collaborate with development and operations teams to design, test, and implement best practices in technology infrastructure, and make fixes and improvements as needed. - Conduct in-depth analysis of incidents and...