29 days old

Data Science Engineer, Machine Learning Platform - PlayStation

San Mateo, California

We seek a Platform Engineer to support our transitioning to the cloud. In this role you will deliver tools for our new machine learning, analytics and big data cloud-based platform hosted on AWS and based on EMR Permanent and Transitory Clusters, S3 storage and in house software and open source tools. This will empower our global teams to quickly leverage advanced Machine Learning for a variety of problems. If this is you, please apply! 

Responsibilities:

  • Collaborate globally with data and cloud engineers to build a Machine Learning AWS-based platform.
  • Collaborate with data scientists to make sure new platform meets requirements and conforms to best practice.
  • Perform complex application programming activities, coding, testing, implementation and documentation of solution.
  • Troubleshoot and debug services in all stages of the development cycles, from development to production.
  • Document new and existing projects to improve community understanding and contribution.

Requirements

Required:

  • Strong experience in designing, deploying and operating highly available, scalable and fault tolerant systems using Amazon Web Services (EMR Clusters, S3, ELBs, EC2, EBS).
  • Strong working knowledge of deploying and configuring Apache Spark clusters, ideally on EMR clusters.
  • Strong proficiency in Python.
  • Detailed knowledge/understanding of more than one version control system, including git.
  • Knowledge of large open source projects and how they operate preferably Airflow.
  • Working knowledge of unix-like environments; shell scripting and system level knowledge.
  • Practical exposure to Continuous Integration/Continuous Delivery tools like Jenkins to merge development with testing through pipelines.

 

Nice-to-Have:

  • Big-Data Cloud Scalability.
  • Hive metastore and Hadoop.
  • JDBC/ODBC, SQL query processing, and distributed query engines.
  • Configuration Management tools like Ansible and Terraform.
  • Docker container infrastructure.
  • Monitoring and logging tools like Splunk.
  • Jupyterhub deployment and Apache Livy integration.
  • Visualization tools such as Tableau.

Categories

Posted: 2019-07-19 Expires: 2019-09-17

Featured Jobs

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Data Science Engineer, Machine Learning Platform - PlayStation

PlayStation
San Mateo, California

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast