Your ImpactWe are seeking a highly skilled and experienced Dev Ops / Site Reliability Engineer to join our team, focusing on the development and support of Observability capabilities for workloads across Cisco IT Datacenter and Cloud environments.Responsibilities include:Reshaping how we manage alerts, metrics, and logs by introducing deep learning and Gen AI to enhance reliability services.Taking ownership and responsibility for reliability, scalability, automation, and other issues related to uptime and availability of our monitoring solutions.Minimum QualificationsBachelor's degree in computer science, computer engineering, related field, or 5+ years of relevant experience.Understanding of lifecycle IT processes including architecture, design, implementation, and operations.Understanding of security including OS hardening, firewalls, iptables, and working with Infosec.Understanding of network basics like routers and switches.Experience with software development tools like Git Hub and Jenkins.Python, Shell, Go, or similar programming experience.Software development lifecycle experience including design, development, testing, packaging, deployment, upgrade, and support.Open‐source development experience.Familiarity with Agile software development.Leadership in building and maintaining SRE technologies.Experience with public cloud like AWS, GCP, or Azure.QA and testing experience of your code and the entire platform.Preferred QualificationsExperience with tool suites such as Splunk Cloud, Splunk Observability Cloud, Elastic, Prometheus/Thanos, and Grafana.Experience with Thousand Eyes, Zabbix, App Dynamics, or similar.Experience with Java Script (Node.js or React).Experience with implementing AI/ML & LLM-based Agentic Observability use cases.Experience with infrastructure or application performance monitoring solutions and testing in a diverse and complex infrastructure.Experience with on‐premises cloud technologies using VMware or Open Stack.Experience with container technologies like Open Shift, Kubernetes, and Docker.Experience with building and maintaining Red Hat or Cent OS Linux.Experience with configuration automation using Ansible.Behavioral CompetenciesWorking with geographically distributed teams.Self‐motivated and willing to help where help is needed.Able to build relationships, be culturally sensitive, have goal alignment, and have learning agility.