5 days old

Leader - Site Reliability Engineer - Data Center

Cisco Systems Inc.
San Jose, CA 95113

What you'll do

This is an exciting opportunity to join a fast-paced team focused on innovation and customer success, to design, build, deliver on the site reliability functions for multiple service offerings. In this high impact role, you will have the freedom to embrace and extend cutting edge OSS tools and technologies working with a very talented group of engineers.


Who you'll work with

Data Center Networking group at Cisco is working on building next generation Cloud Networking products and services at scale towards our as-a-service transformation strategy. Our industry leading award-winning products and services are used by thousands of companies across the globe in a variety of on-prem, hybrid and multi-cloud environments.


Who you are

You are someone who thrives in a dynamic start-up like environment. You have 10+ years of related experience and are looking for an opportunity to shine, a team where you can thrive, one that recognizes your capabilities and rewards you accordingly.


Qualifications

Bachelor's in Computer Science or equivalent with 10 years experience in designing, analyzing, and troubleshooting large-scale distributed systems

5+ years of SRE experience in managing and operating large scale projects in public clouds (AWS/Azure/GCP)

Experience in managing many Kubernetes clusters in large scale production environments

Programming experience in any of the languages: Java/Python/Go

In-depth hands-on knowledge in Linux kernel, networking, storage, security and Kubernetes based deployments

Systematic problem-solving approach with strong communication skills and a sense of ownership and drive

Experience with deployment tools like Spinnaker, Terraform, Ansible

Experience with monitoring tools such as Prometheus, Grafana, ELK Stack, Datadog. Cloudwatch, Stackdriver

Experience with running and optimizing databases such as MongoDB, CockroachDB, InfluxDB

Experience in troubleshooting and root cause of complex issues across micro-service architecture

Experience in supporting on-call rotation and resolution of issues in 24x7 multi/hybrid cloud environment


Preferred skills

Design and implement SaaS delivery frameworks based on Kubernetes

Experience in automated deployments using GitOps methods based on ArgoCD or similar tools

Experience in developing SRE metrics and defining SLO working with marketing/business leads

Lead automation efforts to streamline global deployment effort

Experience with observability and monitoring tools: Thanos, Cortex, Open Telemetry, Hubble, Falcon

Experience with distributed storage support in Kubernetes using Rook, Ceph, GlusterFS, Minio

Experience in using Gitlab for deployment of multiple clusters and multi-cloud environments


Why Cisco

At Cisco, each person brings their rare talents to work as a team and make a difference.
Yes, our technology changes the way the world works, lives, plays and learns, but our edge comes from our people.
  We connect everything people, process, data and things and we use those connections to change our world for the better
   We innovate everywhere - From launching a new era of networking that adapts, learns and protects, to building Cisco Services that accelerate businesses and business results Our technology powers entertainment, retail, healthcare, education and more from Smart Cities to your everyday devices.
  We benefit everyone - We do all of this while striving for a culture that empowers every person to be the difference, at work and in our communities.
Colorful hair? Dont care. Tattoos? Show off your ink. Like polka dots? Thats cool. Pop culture geek? Many of us are. Be you, with us! #WeAreCisco


*LI-IS1

Categories

Posted: 2021-02-19 Expires: 2021-03-21

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Leader - Site Reliability Engineer - Data Center

Cisco Systems Inc.
San Jose, CA 95113

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast