Site Reliabilty Engineer

Location: Remote
Type: Full-time or contract

What is the opportunity?
Job description:

  • Responsible for taking the code and functionality and making it function in public and private cloud environments
  • Responsible for responding to support escalations which involve troubleshooting complex technical problems and resolving data/configuration issues within defined service level objectives
  • Taking part in both internal/customer projects and operations
  • Responsible for developing software, tools, and scripts to automate deployment, management, and monitoring of production systems in all environments
  • Provide strategic and thought leadership among peers on complex projects
  • Collaboration with cloud engineers in understanding new cloud technologies, assessing impact to security services operations, and proposing solutions to existing business problems
  • Collaboration in the software development lifecycle to develop detailed enhancement/bug definitions, write functional requirements, translate the requirements into solution designs, and navigate the functional requirements through to Production deployments
  • Proactively look for ways to create efficiencies within operations as it pertains to the tools and technology used by Optiva to support their customer base
  • Manage, participate in, or directly work on any additional projects, assignments, or initiatives assigned by management
  • Create/maintain documentation for operational procedures
  • Document and perform system upgrades, application updates, and define monitoring requirements based on customer needs
  • Participate in an on-call rotation in a distributed Worldwide team


What do you need to succeed?

  • 5+ years of related experience with cloud environment
  • 3+ years of DevOps experience
  • Bachelor’s Degree or Master’s degree in a technical field such as Computer Science, Information Technology Engineering or equivalent work experience
  • Strong experience with the agile software development methodology and collaboration with internal teams to deliver software and configuration artifacts
  • Understanding of SRE principles
  • Strong background in bash scripting in addition to one year of experience in either Python
  • GitHub/Jenkins/Helm/Ansible/Terraform experience
  • Experience with Docker or similar container solution
  • Experience with orchestration tooling such as Kubernetes and Docker Swarm
  • Experience working with GCP/Azure/AWS APIs
  • 3+ years deploying public cloud infrastructures preferred
  • 3+ years of operational experience with industry-leading “big data” services technologies
  • Experience deploying distributed, service oriented applications
  • Experience with Java build tools including Gradle
  • 3+ Telco experience (5G, IOT, AI, BigData, ML)
  • Have a software-centric mindset
  • Be comfortable with coding
  • Relish change and frequent releases
  • Don’t fear complexity and scale
  • Understand the full software stack
  • View problems as an opportunity to improve
  • Embrace automation over manual effort
  • Translate the technical into business language
  • See challenges from a business perspective
  • Be prepared to move on with tasks


Apply via email to:

In the News

Industry coverage of Optiva

About Optiva

Optiva driving principles

Follow Us