Site Reliability Engineer
3 days ago
About the Role
The Site Reliability Engineering (SRE) team architects, builds, and maintains the rock-solid infrastructure that applications rely on. We work closely with development teams to ensure scalability, reliability, and efficiency. This collaboration empowers us to deliver exceptional customer experiences while enabling developers to focus on building great features.
What You Will Do- Deploy, automate, maintain, and manage various cloud-based and on-premises production systems.
- Understanding the high-level overview of our architecture, and possessing the ability to systematically document new and existing requirements to ensure a smooth project delivery without miscommunication.
- Work closely with the Information security and infrastructure team in ensuring that we are adopting security best practices.
- Ensuring the availability, performance, scalability, and security of productions systems.
- Troubleshoot and resolve system issues across platform and application domains.
- Suggest architectural improvements and recommend process optimizations.
- Evaluate new technologies to enhance the infrastructure stack.
- Ensuring system security policies are properly remediated.
- Drive and implement automated provisioning and scaling of servers, along with testing and compliance checks using automation tools.
- Handle operational tasks, including on-call duties, alerts, and incident management.
- At least 2 years of engineering experience.
- Bachelor's or Master's degree in a relevant field (e.g., IT, Computer Science) or a proven track record in DevOps.
- A strong willingness to continuously upgrade skills and stay up-to-date with the latest DevOps trends.
- Experience with cloud-native tools (e.g., Kubernetes, Docker, Nginx, OpenTelemetry) is a plus.
- Experience managing cloud servers (AWS, GCP).
- A desire to transition into engineering management is a valued addition.
- Experience with on-premises physical servers, databases, and storage solutions (MySQL, PostgreSQL, Redis) is a plus, as well as familiarity with Infrastructure as Code (IaC) tools (Terraform, Pulumi).
-
Site Reliability Engineer
1 week ago
Jakarta, Jakarta, Indonesia AVOWS TECHNOLOGIES PRIVATE LIMITED Full timeAbout the RoleWe are looking for an experienced Site Reliability Engineerto design, implement, and manage our cloud-based infrastructure onGoogle Cloud Platform (GCP)from the ground up. The ideal candidate will ensure our systems are highly available, reliable, scalable, and efficient while collaborating closely with software engineers to deliver robust...
-
Site Reliability Engineer
3 days ago
Jakarta, Jakarta, Indonesia Fazz Full timeAbout the RoleThe Site Reliability Engineering (SRE) team architects, builds, and maintains the rock-solid infrastructure that applications rely on. We work closely with development teams to ensure scalability, reliability, and efficiency. This collaboration empowers us to deliver exceptional customer experiences while enabling developers to focus on...
-
IT Site Reliability Engineer
7 days ago
Jakarta, Jakarta, Indonesia PT Bumi Amartha Teknologi Mandiri Full timeMonitoring:SREs monitor software systems to ensure their reliability, performance, and availability for 24x7 by shiftingMonitoring incident from Jira tickets from L2 Team and follow up to related teamsTrace the problems with related logs and documentsMonitor the log files to manage infrastructureSupport and respond issued by using email, chat, ticket...
-
IT Site Reliability Engineer
7 days ago
Jakarta, Jakarta, Indonesia Bumi Amartha Teknologi Mandiri Full timeCompany DescriptionPT. Bumi Amartha Teknologi Mandiri, widely known as Amartek, is a dynamic system integrator founded in 2018, committed to delivering high-value IT solutions globally. As a full-stack technology partner, Amartek specializes in domains such as data & analytics, integration & automation, outcome-based services, and talent augmentation. With...
-
System Reliability Engineer
1 day ago
Jakarta, Jakarta, Indonesia PT. Akhdani Reka Solusi Full timeDesign, implement, and maintain reliable IT infrastructure, Big Data, and cloud platforms.Monitor systems and applications, ensuring observability, performance, and capacity planning.Automate workflows and processes to improve operational efficiency.Manage incidents, preventive and corrective maintenance, patching, backups, and troubleshooting of...
-
System Reliability Engineer
1 day ago
Jakarta, Jakarta, Indonesia PT Akhdani Reka Solusi Full timeDesign, implement, and maintain reliable IT infrastructure, Big Data, and cloud platforms.Monitor systems and applications, ensuring observability, performance, and capacity planning.Automate workflows and processes to improve operational efficiency.Manage incidents, preventive and corrective maintenance, patching, backups, and troubleshooting of...
-
Site Reliability Engineer
1 day ago
Jakarta, Jakarta, Indonesia SawitPRO Full timeCompany DescriptionSawitPRO is a cutting-edge green digital agritech startup located in the heart of Jakarta. Our mission is to be the best innovator that improves the lives of everyone in the palm oil industry. Through our integrated end-to-end agri-platform, we create a win-win-win scenario for people, planet, and prosperity.We provide integrated apps that...
-
Senior Site Reliability Engineer
7 days ago
Jakarta, Jakarta, Indonesia PT. Alto Network Full timeCOMPANY DESCRIPTION ALTO Network is a leading payment infrastructure provider as well as the pioneer in payment solution by always bringing the most innovative and impactful technology to connect merchants or financial institutions with their customers to grow their businesses nationwide and beyond.DESIGNATION : Senior Site Reliability...
-
Site Reliability Engineer
7 days ago
Jakarta, Jakarta, Indonesia PT Tiga Daya Digital Indonesia (Eksad Technology) Full timeMonitoring: SREs monitor software systems to ensure their reliability, performance, and availability for 24x7 by shiftingMonitoring incident from Jira tickets from L2 Team and follow up to related teamsTrace the problems with related logs and documentsMonitor the log files to manage infrastructureSupport and respond issued by using email, chat, ticket...
-
DevOps / Site Reliability Engineer (SRE)
1 day ago
Jakarta, Jakarta, Indonesia PT Selaras Kinerja Muda Full timeJob SummaryWe are looking for a DevOps/SRE professional to improve system reliability, scalability, and deployment efficiency.Key responsibilitiesBuild and manage CI/CD pipelinesAutomate infrastructure and deployment processesMonitor system performance and reliabilityImplement incident response and recovery proceduresCollaborate closely with development...