JOB OVERVIEW
What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built Construction impacts the lives of nearly everyone in the world, and yet it’ s also one of the world’ s least digitized industries, not to mention one of the most dangerous. That’ s why we’ re looking for an experienced StaffSite Reliability Engineer to join our client’ s journey to revolutionize a historically underserved industry.
As a Staff Site Reliability Engineer, you’ re given the unique opportunity to drive the next generation of our application platform initiatives in a global SaaS infrastructure. You’ ll work side-by-side with our Product, Security, and Development teams to automate and rollout new standardized service platforms for product code. Backed by the might of our teams, we’ ll provide you with the tools and resources needed to achieve extraordinary results that render a significant impact extending beyond the boundaries of traditional engineering roles.
These positions could be in our client’ s Carpinteria, CA headquarters, New York City, or Austin, TX office. Remote candidates will be considered based on the level of experience and with the expectation of occasional travel to these offices. We’ re looking for people to join our team immediately.
What you’ ll do:

  • Drive deployment excellence and product quality through a software-defined approach to operations and infrastructure
  • Identify opportunities for differentiating open-source initiatives, and lead the development of new open-source SRE/Infrastructure platform tools (e.g. Envoy, Helm)
  • Serve as a champion for idempotent infrastructure-as-code by taking ownership in the end-to-end configuration, technical dependencies, and overall success of the SaaS environment
  • Ensure services are designed and delivered to be mission critical with focus on security, resiliency, scale, and performance
  • Educate and drive global adoption of automation and orchestration principles, and create an eagerness to automate, wherever and whenever the possibility arises
  • Lead reviews of site reliability processes such as testing, CI/CD, and release management. Provide unwavering support and collaboration for the software/QA engineers on projects
  • Ensure new and existing products support automated deployment and remote execution-based remediation scripts
  • Lead the improvement of testing functionality, operability, deployment, and performance for application or infrastructure changes
  • Mentor and coach junior site reliability engineers, and be a driver for change and DevOps adoption across the broader organization

What we re looking for:

  • BS or MS degree in Management Information Systems or a related discipline; Technical Certifications are a plus
  • 8+ years of combined experience as a Software Engineer and DevOps Engineer, with coding experience in an object-oriented language
  • 5+ years of experience supporting production in a SaaS multi-tenant environment
  • Strong experience documenting and driving process improvements
  • Demonstrated experience leading automaton infrastructure/application systems deployment and configuration
  • Expert with AWS services (certified SysOps Administrator or Solutions Architect preferred)
  • Experience leading small & large initiatives with the ability to course-correct as needed
  • Experience working with teams, providing mentorship and guidance to improve the overall reliability of the ecosystem
  • Ability to consistently evaluate current technical approaches to continue to be industry best-of-class
  • Substantial experience with the following technologies is preferred:
    • AWS
    • Infrastructure/cloud automation tooling (e.g. CloudFormation, Terraform, Packer)
    • Service Mesh/Discovery Tooling (e.g. Consul, Envoy, Istio, etc.)
    • Continuous Integration (e.g. Spinnaker)
    • Containers and Container Management (Docker, Kubernetes)
    • Configuration and Security Management (e.g. Puppet, Chef, Ansible, Salt, Vault, KMS)
    • Networking protocol knowledge (e.g., TCP/IP, UDP, IPSEC, HTTP, HTTPS, routing protocols)

Source link