Data Engineer at Smule
San Francisco, CA, US
Music is more than just listening — music also includes creating, sharing, discovering, participating and connecting. Music is the original social network, with the power to break down barriers, touch souls, and bring people from all over the world together.

When we started in 2008, Smule was just a company with a fun name and a big dream. We wanted to bring music back to its roots and empower anyone to join in. Today, we’re a vibrant, global community of music lovers where millions of people across the world come together each day to share their passion for music, make new friends, cheer each other on, and simply have fun.

Powered by a family of awarding winning apps, the Smule community of 50M monthly active users plays and sings over 20 million songs a day on their mobile phones, uploading over 2M of those songs to the growing Smule network.  On a typical day, Smule stores over ​35 terabytes of user generated content in the cloud. Smule has the largest social graph for music on the internet.

​If you are passionate about data, not afraid of dealing with billions of rows, terabytes of data and find this sort of thing super fun, come and join us!​

Responsibilities

  • Develop data expertise, be a data steward evangelist, and own data pipelines, data accuracy and data integrity.
  • Support existing processing running in production. Define and manage SLA for all data sets.
  • Design, build and launch new data models, data pipelines, data extraction, transformation and loading processes in production
  • Assist in building big data infrastructure to efficiently move massive amount of data into the data warehouse and other data platforms
  • Partner with product managers, engineers, analysts and other internal stakeholders to understand data business requirements and build efficient and scalable solutions.

Minimum Qualifications

  • Bachelor's degree in Computer Science or related technical fields
  • 2+ years experience in custom ETL design, implementation and maintenance
  • 2+ years experiences with Python or Java
  • Strong SQL experience
  • Experience in schema design and dimensional data modeling in SQL and NoSQL database management systems and data 
  • Experience in large scale data processing Map Reduce or MPP system such as Spark, Hadoop, Airflow, and Vertica.
  • Effective communication with stakeholders

Preferred Qualifications

  • Experience with Tableau, MySQL, Presto, Parquet and shell scripting
  • Experience working with remote team
  • Experience in a consumer web or mobile company