Manager, Cloud Infrastructure and Operations at Spiceworks
Austin, TX, US

Who are you: We are looking for a hands-on technology professional to oversee all aspects of our infrastructure operations. We are looking for someone who is not afraid to roll-up their sleeves and work as part of a highly collaborative team, but also has the ability to see and manage the big picture.

You'll support the development organization in build vs. buy decisions; interact with the architecture, security and IT/helpdesk teams to provide timely communication and assistance; and most importantly, manage the day-to-day operations of a small team of SRE and automation engineers, both local and remote.

Position reports to the Director of IT/Operations.

Who we are: Spiceworks has been connecting the IT industry since 2006. We’ve grown into a global IT marketplace, powered by millions of tech buyers and sellers who rely on each other to get their jobs done. This tight-knit community trusts Spiceworks to connect them with the right insights, tools, and experts when they need it most. 

Why Spiceworks? No one joins us just because they want a job. Spiceworkers join us to wake up every day inspired to know they make a difference. How? By interacting directly with the people who use technology to transform their organizations, communities, and the world, every day. We’re passionate about helping tech professionals and tech brands drive their businesses forward by connecting them to the right resources at the right time. 

In short, we’re building the biggest IT marketplace on the planet. We’re going to keep changing this industry for the better (and have a blast doing it)! 

Your day-to-day

  • Oversee the operations of our infrastructure team managing an extensive containerized AWS environment
  • Participate in architecting and building systems for maximum performance, reliability and scalability
  • Work with the engineering teams on product design, decisions and troubleshooting
  • Maintain and improve the development squads' ability to rapidly and safely deploy code and configuration
  • Manage and participate in on-call rotation, as an escalation path
  • Maintain and test the disaster recovery plan
  • Define and report on metrics relating to SLAs and uptime

Qualifications: What does it take to do this job?

  • 5+ years of experience in managing production-critical infrastructures and DevOps environments
  • 2+ years of experience managing SRE and automation engineers
  • Extensive knowledge of the Amazon Web Services ecosystem
  • Kubernetes deployment and management experience - ECS, EKR and/or KOPS deployments
  • Is a strong self-starter, operationally-focused, has a holistic data perspective, is a problem-solver
  • Is knowledgeable in network, firewall and security best practices
  • Experience with infrastructure automation and monitoring tools
  • Some prior experience with cloud migrations