Site Reliability Engineer for Fita

3 weeks ago


Jakarta, Indonesia PT Telkomsel Ekosistem Digital Full time

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches. Our mission is to empower Indonesians of all fitness levels to achieve their goals, maintain a healthy lifestyle, and build lasting habits through personalized virtual coaching sessions.

**What you will do but not limited to**:

- Manage infrastructure on GCP
- Participate in the entire software development process including design, development, delivery, monitoring, and improvement
- Provide technical assistance to improve system performance, capacity, reliability, scalability, and security
- Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
- Closely collaborating with other engineering and product teams to ensure that expected system behavior is understood and monitoring exists to detect anomalies
- Participate in continuous improvement and execution of quality and timely major incident root cause analysis and blameless post-mortem activities to ensure we take action to avoid similar problems in the future

**Expected KPI**
- Maintain uptime for critical infrastructure components
- Make sure all infrastructure components monitored (Redis, PgSQL, etc) - by node count
- Make sure 80% of alerts have Time To Acknowledge below 30m in office hours
- Make sure 80% of infra provisioning / configuration requests handled below 3h in average in office hours

**Requirements**:

- Medior to Senior Level with working experience 3-7 years at tech company or startup preferably
- Ready to work in October 2024
- Strong background in Linux/Unix systems and scripting languages (e.g., Python, Bash).
- Experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes).
- Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, etc.).
- Understanding of CI/CD pipelines and DevOps practices.
- Ability to communicate effectively and work collaboratively in a team-oriented environment.
- Strong problem-solving skills and a proactive attitude toward operational challenges.



  • Jakarta, Indonesia Abhidi Solution Private Limited Full time

    **Responsibilities**: - Administer production related jobs - Address production issue - Improve system reliability through configuration or code changes - System monitoring and improve system observability - Remove toil and automate whenever possible - Problem solving, including troubleshoot a production issue **Skills**: - Experience with cloud...


  • Jakarta, Indonesia PT Midas Daya Teknologi Full time

    Job Description: Works with project engineering to ensure the reliability and maintainability of new and modified software. - The reliability engineer is responsible for adhering to the life cycle software management process throughout the entire life cycle. - Also responsible for end-to-end site reliability including service offerings, in particular...


  • Jakarta, Indonesia AccelByte Full time

    At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...


  • Jakarta, Indonesia Pro Sigmaka Full time

    We established at 2012. With experience in several industry sectors, a broad portfolio and technology platform as well as bringing a dedicated and highly qualified team, enabling the talent we provide to provide fast and responsive services, making it the best choice for companies that want to increase the usability of their businesses. OUR SERVICES -...


  • Jakarta, Indonesia Ajaib Full time

    Company Description **Job Description**: - Perform day-to-day operations to support developers and DevOps. - Create end-to-end monitoring, logging, and alerting system. - Provide technical assistance to improve system performance, capacity, reliability and scalability - Perform root cause analysis of reliability issues. - Document every action so your...


  • Jakarta, Indonesia PT. Amalura Multi Dimensi Full time

    Manage and optimize cloud infrastructure (AWS, GCP, Azure). - Administer Linux system, ensuring stability and security. - Implement observability (e. g, OpenTelemtry, HoneyComb, Sentry) to monitor performance. - Optimize content delivery networks (e. g., Akamai) to enhance user experience. - Design monitoring, alerting, and incident response procedure for...


  • Jakarta, Indonesia Beyondsoft (Malaysia) Sdn. Bhd Full time

    COMPANY DESCRIPTION Beyondsoft (listed by the Shenzhen Stock Exchange, stock code 002649) is a global provider of IT consulting, product and solution services. Relying on strong R&D and innovation capabilities, the company widely adopts emerging technologies based on big data and mobile internet, including big data management platform, enterprise risk...


  • Jakarta, Indonesia PT Tiga Daya Digital Indonesia (Eksad Technology) Full time

    Tiga Daya Digital Indonesia, a susidiary company of Triputra Group and DCI Group To be IT partner to enable client growth rapidly. Eksad Providing Services High Quality Based on Strong Experience in the industry and technology. Building the right IT Service Solution to enable it Partners in speeding up business development based on digital technology by...


  • Jakarta, Indonesia Shipper Full time

    **What is Shipper** Shipper is a growing technology company based in Jakarta. We provide well-rounded logistics solutions for businesses of all sizes. Today, we offer several services including First-Mile Pickup and Delivery, Fulfillment/Warehouse Management, and Cross-Border shipping services. We are financially supported by eminent investors, including...


  • Jakarta, Indonesia Shipper Full time

    **What is Shipper** Shipper is a growing technology company based in Jakarta. We provide well-rounded logistics solutions for businesses of all sizes. Today, we offer several services including First-Mile Pickup and Delivery, Fulfillment/Warehouse Management, and Cross-Border shipping services. We are financially supported by eminent investors, including...


  • Jakarta, Indonesia Global Tiket Network Full time

    We think you also hate when travel app is giving you a headache, right? A slight misinformation can ruin the trip. - That is exactly what we are tackling as t-fam! Making sure that our 17+ million users have the best experience in crafting their own adventure. LI-Hybrid Catch the sunrise on the top of Padar Island and see fascinating views of the boundless...


  • Jakarta, Indonesia Amartha Full time

    Amartha is embarking on an exciting new journey and is in need of experienced engineers to work with senior management, existing engineers, and product in shaping the next wave of innovative product offerings, ensuring Amartha leapfrogs into the next phase of its journey! Job Description: As a Site Reliability Engineer (SRE) you will combines software and...


  • Jakarta, Indonesia Digital Muda Solutions Full time

    Deskripsi: - Menjaga ketersediaan, kehandalan, dan performa sistem dengan fokus pada infrastruktur teknis, keamanan, dan skala pengguna. - Berkolaborasi dengan tim pengembangan dan operasi untuk merancang, menguji,dan menerapkan praktik terbaik dalam infrastruktur teknologi, serta melakukan perbaikan dan peningkatan sesuai kebutuhan. - Memastikan integrasi...


  • Jakarta, Indonesia Catalyst Tech Full time

    At Catalyst, People are the heartbeat for our company. We believe that good quality people will have a positive impact to our business. We are looking for a **Site Reliability Engineer / DevOps** to join our growing team. If you are passionate about being part of the team, building some of the most critical products, Working alongside teams in the industry...


  • Jakarta, Indonesia PT Salva Teknologi Digital Full time

    Site Reliability Engineer (Junior) - Applicants should have sufficient qualification and relevant experiences in the respective fields "Waspada terhadap Modus Penipuan pada saat proses interview. Perusahaan tidak akan memungut biaya apapun dalam melakukan proses interview. Mohon segera melaporkan ke kami, jika pada saat Anda diundang untuk interview dan...


  • Jakarta, Indonesia Paymentology Full time

    Paymentology is the first truly global issuer-processor, giving banks and fintechs the technology, team and experience to rapidly issue and process Mastercard, Visa and UnionPay cards across more than 50 countries, at scale. Our advanced, multi-cloud platform, offering both shared and dedicated processing instances, vast global presence and richer,...


  • Jakarta, Indonesia Hukumonline.com Full time

    Manage and optimize cloud infrastructure on AWS, GCP, and Azure. - Administer and maintain Linux-based systems, ensuring their stability and security. - Implement and maintain observability solutions, including OpenTelemetry, HoneyComb, and Sentry, to monitor system performance and diagnose issues. - Configure and optimize content delivery networks, with a...


  • Jakarta, Indonesia AccelByte Full time

    At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...


  • Jakarta, Indonesia AccelByte Full time

    At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...


  • Jakarta, Indonesia AccelByte Full time

    At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox...