Staff Site Reliability Engineer at People.ai
Toronto, CA
People.ai accelerates enterprise growth through the power of AI. With the industry’s only Revenue Intelligence System, People.ai frees all customer-facing teams, including salesmarketing, and customer success, from manual data entry by automatically capturing all contact and customer activity data, dynamically updating CRM and other systems of record, and providing actionable intelligence across management tools to realize the full selling capacity of the enterprise. Some of the world’s best brands are leveraging People.ai to transform their business, including Lyft, New Relic, Okta, Tanium, and Zoom.
 
At People.ai, we believe that people enrich the world around them in countless ways. We believe that the more time they spend applying their creativity, resourcefulness and critical thinking to activities that matter most in their professional life, the more effective a professional they become. We're developing a deep understanding of the professional world, mapping people, companies, and the information that flows between them through natural language processing and machine learning. Our team is a diverse, outspoken group of creatives and critical thinkers, hyper-focused on driving enterprise growth. We embrace different. We applaud non-traditional career paths. We're inspired by people who have made processes their own. 
 
 
The Staff Site Reliability Engineer will provide technical vision to the People.ai cloud infrastructure, evangelize and implement cloud security best practices, identify inefficiencies in engineering processes, and solve them using automation.

Responsibilities

 

    • Review current automation strategy for production and partner with stakeholders to propose and implement a holistic approach to automate the environment which is scalable, reliable, and can support People.ai’s growth in the next few years. 
    • Review People.ai’s monitoring, logging, automation, and observability solutions and in order to work with the Platform team and stakeholders to propose a unified roadmap that would provide a comprehensive solution that can be deployed in an automated and self-service model. 
    • Work with the Information Security team at People.ai to improve security posture, implement a vision for data isolation, and data access inside and outside the company as well as network and application security.

Requirements

 

    • Proven experience with Kubernetes, Docker Containers, and AWS.
    • Experience with Terraform and Ansible to automate operations.
    • Experience with Python, Shell scripting.
    • Proven track record partnering with different engineering teams and completing company-wide initiatives in areas such as automation, security, migrations, and monitoring.
    • Deep understanding of networking (VPC, ACL rules, Routes), Operating Systems (Ubuntu, Debian, CentOS), and experience troubleshooting various production issues.
    • Experience with CI/CD pipelines such as CircleCi and Jenkins.

Nice to Have

 

    • Experience with RDS, Riak, Mongo.
    • Experience implementing API Gateways.
Founded in 2016 and based in San Francisco, the company is backed by ICONIQ Capital, Andreessen Horowitz, Lightspeed Venture Partners, Y Combinator and others. In 2019, People.ai was recognized as a winner of the 2019 Bay Area Best Places To Work, an awards program presented by the San Francisco Business Times and the Silicon Valley Business Journal.