1 day old

Sr Site Reliability Engineer

Cambridge, MA 02139
  • Job Code

The IBM Cognitive Applications team is seeking innovative and enthusiastic software engineering professionals to join our team building the next generation multi-cloud marketplace platform. If you love working with cutting edge technologies, dynamic and high performing teams, then this is the right next step in advancing your career. The team will be challenged to design, develop, and operate cloud applications and services leveraging a wide variety of open source, machine learning, and commercial technologies aimed at dramatically expanding the reach of the IBM Cloud to millions of enterprise developers. Our focus on delivering performance and predictability for our customers' most demanding workloads, at global scale and with leadership, efficiency, resiliency and security is the cornerstone of our development model and will result in maintaining IBMs leadership position for years to come.

Your Role and Responsibilities
As a Sr Site Reliability Engineer, you will be a member of a team responsible for the overall health and availability of the production platform. You will provide consultancy to the product teams on effective DevOps processes, use of tooling, and ongoing support activities. On a day to day basis your focus will be on bridging the gap between our end users and partners, and the development and delivery teams regarding the production availability of the platform. This role will require strong problem solving and time management skills to handle these complex situations effectively.

In this role your responsibilities will include:
  • Providing squads with the tools and processes needed for each squad to be able to do their own operations.
  • Ensuring that components and solutions excel at performance, reliability and web scalability.
  • Monitor availability, track outages, and provide Root Cause Analysis & postmortem solutions.
  • Maintain compliance in all required areas.
  • Ensure consistent monitoring, logging, and alerting with tools such as New Relic, Logentries, and PagerDuty.
  • Ensure consistent support across all components.
  • Ensure that all squads are able to effectively leverage our CI/CD pipeline.
  • Provide defect tracking and broken link tracking.
  • Complete coding, testing, defect fixes, and production support, using agile CI/CD methodologies, for base tooling within the solution.
  • Analyzing current tools and processes with the goal of enacting improvements geared towards making components and processes more efficient for the entire team.
  • Planning iterations and representing accomplishments at team scrums.
  • Presenting individual and team status during weekly playbacks or on management calls.
  • Managing risks and resolving issues that affect scope, schedule, and quality.
  • Coaching, enabling and collaborating with other team members and development squads.
  • Maintain Platform & Trial environments including tuning, upgrades and end to end testing.
  • Install, manage OpenShift clusters.

Required Technical and Professional Expertise
From a technical and professional expertise perspective, you will succeed in this job if you have 3+ years of experience in the following:
  • Software Development Applied Knowledge.
  • Experience with operational practices of production support.
  • Experience with user analytics and monitoring, back-end systems communication.
  • Experience with microservices architecture, containers, Kubernetes, automation, systems engineering, load balancing, High Availability/Disaster Recovery.
  • Source control management experience with Git.
  • Experience with DevOps Technology like Jenkins/Travis CI
  • Experience with cloud-native application development with JavaScript, HTML, CSS, SQL (MySQL, PostgreSQL, or DB2, etc) or NoSQL Databases (MongoDB, Cloudant, or Cassandra, etc.), Akamai/CDM, Ansible, Chef, Terraform.

Preferred Technical and Professional Expertise
The following technical and professional expertise is preferred and will greatly enhance your success on the team:
  • Bachelors Degree (Computer Science or related), or equivalent.
  • Experience with Kubernetes, Docker, OpenShift and container development paradigms
  • Knowledge of the IBM Cloud offering and marketplace ecosystem
  • Infrastructure Operations, Network Production Support experience
  • Knowledge of security architecture, vulnerability management, penetration testing, web application firewalls, secure routing.

About Business Unit
At IBM Cognitive Applications, we build open applications that unlock the power of data for clients, partners, and developers. Running on top of IBM's unique Hybrid, Multi-cloud and AI infrastructures, these applications work across horizontal domains and bring our technology to life for end users. Cognitive Applications unit includes: Watson Customer Engagement, Watson IoT, Watson Media and Weather, Talent & Collaboration, Digital Growth & Commerce, and IBM Developer teams.

Your Life @ IBM
What matters to you when youre looking for your next career challenge?

Maybe you want to get involved in work that really changes the world? What about somewhere with incredible and diverse career and development opportunities where you can truly discover your passion? Are you looking for a culture of openness, collaboration and trust where everyone has a voice? What about all of these? If so, then IBM could be your next career challenge. Join us, not to do something better, but to attempt things you never thought possible.

Impact. Inclusion. Infinite Experiences. Do your best work ever.

About IBM
IBMs greatest invention is the IBMer. We believe that progress is made through progressive thinking, progressive leadership, progressive policy and progressive action. IBMers believe that the application of intelligence, reason and science can improve business, society and the human condition. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 380,000 IBMers serving clients in 170 countries.

Location Statement
For additional information about location requirements, please discuss with the recruiter following submission of your application.

Being You @ IBM
IBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.



  • Computers Software and Hardware
Posted: 2019-12-13 Expires: 2020-01-12

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Sr Site Reliability Engineer

Cambridge, MA 02139

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast