Site Reliability Engineer- Athena Core - Associate

Employer: J.P.Morgan
Location: Singapore, Singapore
Salary: Competitive
Closing date: Sep 26, 2022

Job Function: Other
Industry Sector: Finance - General
Employment Type: Full Time
Education: Bachelors

As a Site Reliability Engineer (SRE), you'll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You'll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you'll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you'll be focused on running better production applications and systems.

You will be part of a dynamic team responsible for our Corporate & Investment Bank strategic platform that various CIB teams build their applications on. You'll put your experience to work across the board, in areas like change management, incident management, problem management, impact identification, management communication, risk and controls, client relationship management, service improvement, discovery/gathering/documenting of business needs, data and requirements.

Responsibilities:

Develop, test and debug automated tasks including apps, systems and infrastructure
Automate manual operational work by improving products or software
Troubleshoot priority incidents and facilitate blameless post evaluations
Work with development & infrastructure teams throughout the product life cycle ensuring the deployments meets business & controls requirements
Perform analytics on past data, such as incidents and usage patterns for predicting issues and take proactive steps to implement improvements
Build and drive adoption for greater self-healing and resiliency patterns
Lead and participate in performance tests, and identify bottlenecks and opportunities for optimization and capacity demands
Split time between operational work and engineering work
Work schedule involves weekday (4 days) and weekend (1 day) model

Qualifications:

Bachelor's Degree in Computer sciences, Information technology or related disciplines
Min 5 years of experience managing a large Unix platform in an enterprise environment
Proficient in Linux systems provisioning and configuration management tools such as Puppet, Ansible and Terraform. Advanced Linux knowledge is required
Proficient in at least one or more software languages such as Python, Java, Go with respect to designing, coding, testing and software delivery
Proficient in the development of automated tools, systems and services in multiple technology domains
Proficient knowledge of one or more infrastructure components such as networking, cloud services, orchestration tools, containerization, compute and storage systems
Proficient in service-level changes to a system and troubleshooting components
Design and implement and contribute to performance monitoring and capacity management tools

Send job

Sign in to create job alerts

Sign in or create an account to start creating job alerts and receive personalised job recommendations straight to your inbox.

Create alert