1+ months

Autopilot - Deep Learning Infrastructure Intern (Fall 2020) - Tesla

Palo Alto, California

As a Software Engineering Intern within Autopilot, you will work on reinforcing, optimizing and scaling our neural network training infrastructure.

At the core of our self-driving capabilities, there are different neural networks that the Deep Learning team is designing to train large amounts of data. Robustly training jobs at scale, should it be for production models or quick experiments, and completing them in the shortest amount of time possible, is critical to our mission.

Responsibilities

  • Write robust Python software code in our machine learning training repository while applying best software practices to support machine learning scientists in tasks such as fetching training data, preprocessing it, and orchestrating the training runs.
  • Integrate the training software into our continuous integration cluster to support metrics persistence across experiments, weekly/nightly neural network builds, and other unit / throughput tests.
  • Profile performance of training software in our training cluster, identify bottlenecks in and between CPU/GPU code execution, and work on optimizing its throughput and scalability within and across nodes to ultimately reduce convergence time.
  • Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning.

Categories

Posted: 2020-05-08 Expires: 2020-07-07

Featured Jobs

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Autopilot - Deep Learning Infrastructure Intern (Fall 2020) - Tesla

Tesla
Palo Alto, California

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast