Site Reliability Engineer- Athena Core - Associate

Singapore, Singapore
11 Sep 2022
26 Sep 2022
Job Function
Industry Sector
Finance - General
Employment Type
Full Time
As a Site Reliability Engineer (SRE), you'll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You'll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you'll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you'll be focused on running better production applications and systems.

You will be part of a dynamic team responsible for our Corporate & Investment Bank strategic platform that various CIB teams build their applications on. You'll put your experience to work across the board, in areas like change management, incident management, problem management, impact identification, management communication, risk and controls, client relationship management, service improvement, discovery/gathering/documenting of business needs, data and requirements.

  • Develop, test and debug automated tasks including apps, systems and infrastructure
  • Automate manual operational work by improving products or software
  • Troubleshoot priority incidents and facilitate blameless post evaluations
  • Work with development & infrastructure teams throughout the product life cycle ensuring the deployments meets business & controls requirements
  • Perform analytics on past data, such as incidents and usage patterns for predicting issues and take proactive steps to implement improvements
  • Build and drive adoption for greater self-healing and resiliency patterns
  • Lead and participate in performance tests, and identify bottlenecks and opportunities for optimization and capacity demands
  • Split time between operational work and engineering work
  • Work schedule involves weekday (4 days) and weekend (1 day) model
  • Bachelor's Degree in Computer sciences, Information technology or related disciplines
  • Min 5 years of experience managing a large Unix platform in an enterprise environment
  • Proficient in Linux systems provisioning and configuration management tools such as Puppet, Ansible and Terraform. Advanced Linux knowledge is required
  • Proficient in at least one or more software languages such as Python, Java, Go with respect to designing, coding, testing and software delivery
  • Proficient in the development of automated tools, systems and services in multiple technology domains
  • Proficient knowledge of one or more infrastructure components such as networking, cloud services, orchestration tools, containerization, compute and storage systems
  • Proficient in service-level changes to a system and troubleshooting components
  • Design and implement and contribute to performance monitoring and capacity management tools

Similar jobs

Similar jobs

  • You need to sign in to save