Site Reliability Engineer, Engineering Site Reliability Engineer, Engineering …

Goldman Sachs
in San Francisco, CA, United States
Permanent, Full time
Be the first to apply
Competitive
Goldman Sachs
in San Francisco, CA, United States
Permanent, Full time
Be the first to apply
Competitive
Site Reliability Engineer, Engineering

WHAT WE DO


At Goldman Sachs, our Engineers don’t just make things – we make things possible. Change the world by connecting people and capital with ideas. Solve the most challenging and pressing engineering problems for our clients. Join our engineering teams that build massively scalable software and systems, architect low latency infrastructure solutions, proactively guard against cyber threats, and leverage machine learning alongside financial engineering to continuously turn data into action. Create new businesses, transform finance, and explore a world of opportunity at the speed of markets.

Engineering, which is comprised of our Technology Division and global strategists groups, is at the critical center of our business, and our dynamic environment requires innovative strategic thinking and immediate, real solutions. Want to push the limit of digital possibilities? Start here.

WHO WE LOOK FOR


Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for the availability and reliability of our firm's most critical platform services, and ensures they meet the requirements of our internal and external users. We look for engineers who are motivated to collaborate with our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business environment.

RESPONSIBILITIES AND QUALIFICATIONS

HOW YOU WILL FULFILL YOUR POTENTIAL

  • Balance feature development velocity and reliability with well-defined SLOs
  • Run the Production environment by monitoring availability and taking a holistic view of system health
  • Drive incident management process and support a blameless post-mortems culture
  • Partner with development teams to improve services via rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts

SKILLS AND EXPERIENCE WE ARE LOOKING FOR

BASIC QUALIFICATIONS

  • BS degree in Computer Science or related technical field involving coding and / or systems engineering
  • Proficiency in one or more of the following: Go, Python, C, C++, Java, Perl, Ruby or shell scripting
  • Experience with algorithms, data structures and software design
  • Experience with UNIX operating systems internals and / or networking

PREFERRED QUALIFICATIONS

  • Experience with distributed systems design, maintenance, and troubleshooting
  • Hands-on experience with debugging and optimizing code, as well as automation
  • Strong interpersonal skills, drive, and ownership
  • Coding beyond simple scripts
  • Solving novel problems from first principles
Close
Loading...