Job ID : 43689

DevOps Engineer

Cerebras Systems - Computer Science
JOB POSTING INFORMATION
Position Type: Professional Experience Year Co-op (PEY Co-op: 12-16 months)
Job Title: DevOps Engineer
Job Location: Toronto
Job Location Type: Flexible
If working on site, can you provide a copy of your COVID-19 safety protocols?: No
Number of Positions: 1
Salary: $42.00 hourly for 40.0 hours per week
Start Date: 05/06/2024
End Date: 04/25/2025
Job Function: Information Technology (IT)
Job Description: Cerebras Systems has pioneered a groundbreaking chip and system that revolutionizes deep learning applications. Our system empowers ML researchers to achieve unprecedented speeds in training and inference workloads, propelling AI innovation to new horizons.

The Condor Galaxy 1 (CG-1), unveiled in a recent announcement, stands as a testament to Cerebras' commitment to pushing the boundaries of AI computing. With a staggering 4 ExaFLOP processing power, 54 million cores, and 64-node architecture, the CG-1 is the first of nine powerful supercomputers to be built and operated through an exclusive partnership between Cerebras and G42. This strategic collaboration aims to redefine the possibilities of AI by creating a network of interconnected supercomputers that will collectively deliver a mind-boggling 36 ExaFLOPS of AI compute power upon completion in 2024.

Cerebras is building a team of exceptional people to work together on big problems. Join us!

About The Role
You will develop and maintain the infrastructure required to build, test, operate, simulate, and evaluate the Cerebras software stack. As a DevOps engineer, you will be responsible for designing efficient, scalable workflows for automating processes in the cloud and in our datacenter. You will work closely with the development teams to monitor the quality and performance of the software test that runs on the Wafer Scale Engine (WSE), the world’s largest and fastest AI computer.
Job Requirements: Requirements
  • Enrolled in the University of Toronto's PEY program with a degree in Computer Science, Computer Engineering, or other related disciplines.
  • Experience in software development environments.
  • Experience building tools, libraries and automation framework for internal customers. 
  • Experience building services on top of AWS or other cloud platforms at scale.
  • Proficient in Python, shell scripting, Makefiles.
  • Strong end-to-end triage, debug, and troubleshooting skills.
  • Experience developing fixtures, hooks, and plugins in pytest framework is desired.
  • Experience with Jenkins and other CI/CD platforms and regressions is desired.
  • Expertise with GitHub Actions and GitHub webhooks is desired.
  • UI experience highly desirable.
  • Familiar with Docker, Kubernetes and container technology in general.
Preferred Disciplines:
Computer Engineering
Computer Science
Engineering Science (Biomedical)
Engineering Science (Electrical and Computer)
Engineering Science (Infrastructure)
Engineering Science (Machine Intelligence)
Engineering Science (Robotics)
All Co-op programs: No
Targeted Co-op Programs:
Targeted Programs
Professional Experience Year Co-op (12 - 16 months)
APPLICATION INFORMATION
Application Deadline: Nov 1, 2023 11:59 PM
Application Receipt Procedure: Online via system
Additional Application Information: Please apply with both resume & transcript. Lacking transcript will disqualify you from being considered. 
Note that applications will be considered on a rolling basis. Apply as early as possible. 
U of T Job Coordinator: Yasmine Abdelhady
ORGANIZATION INFORMATION
Organization: Cerebras Systems
Division: Computer Science
Website: https://cerebras.net/
ADDITIONAL INFORMATION
Length of Workterm: FLEXIBLE PEY Co-op: 12-16 months (range)




© 2023 University of Toronto - Orbis Career / Co-op Portal Professional v3