High Performance Computing Grid Reliability Engineer, Associate

Morgan Stanley
Singapore, Singapore
15 Nov 2022
15 Dec 2022
Job Function
Industry Sector
Finance - General
Employment Type
Full Time
Morgan Stanley

Morgan Stanley is a leading global financial services firm providing a wide range of investment banking, securities, investment management and wealth management services. The Firm's employees serve clients worldwide including corporations, governments and individuals from more than 747 offices in 42 countries.

In Morgan Stanley, Technology works as a strategic partner with Morgan Stanley business units and the world's leading technology companies to redefine how we do business in ever more global, complex, and dynamic financial markets. Morgan Stanley's sizeable investment in technology results in quantitative trading systems, cutting-edge modelling and simulation software, comprehensive risk and security systems, and robust client-relationship capabilities, plus the worldwide infrastructure that forms the backbone of these systems and tools. Our insights, our applications and infrastructure give a competitive edge to clients' businesses and to our own.

Technology & Reliability and Production Engineering

The mission of Technology is to provide a highly reliable and commercial technology platform, which supports the Firm's strategy, delivered by an innovative, world-class team of professionals. Within Technology, Reliability and Production Engineering (RPE) provides global services for Institutional Securities and Support Services applications. Consolidated support functions include Plant Management/Engineering, Capacity Management, and Grid Management.

Plant Management

RPE includes a horizontal Plant Management (PLM), Tools and Engineering practice area that complements its direct production activities. Plant Management is organized as an agile fleet, with squads covering areas such as operational plant management, grid computing, platform engineering, capacity management and production tooling functions. Plant Management operates globally from New York, London, Montreal, Toronto, Bengaluru, Singapore, Shanghai and Tokyo.

Grid Computing

Within Plant Management, the Grid Computing Squad is responsible for designing, building and operating Morgan Stanley's grid compute systems for pricing and risk calculations. We provide reliable, efficient and innovative grid computing platforms, and partner with our internal customers to use them effectively to meet their business goals.

Members of our squad are highly-motivated problem-solvers that can multi-task and work under time-pressure. We are team-players but can be self-sufficient when required. We take ownership, hold ourselves and others accountable, and are receptive to constructive feedback.

Key expectations from this role
  • Share our squad's responsibility to design, build and operate Morgan Stanley's grid compute systems through a combination of support and project tasks
  • Use Reliability Engineering principles to improve grid compute platform stability and efficiency
  • Manage user escalations with technical & operational acumen
  • Apply understanding of ITIL Production Management principles (Incident Management, Problem Management, etc.)
  • Fully participate as a member of our globally-distributed squad by engaging in all Agile ceremonies (backlog grooming, planning, daily stand-ups, retrospectives)