Skip to Scheduled Dates
Course Overview
In this workshop, we’ll explore how GPUs can improve data pipelines and how using advanced data engineering tools and techniques can result in significant performance acceleration. Faster pipelines produce fresher dashboards and machine learning (ML) models, so users can have the most current information at their fingertips.
Who Should Attend
Experienced Python Developers
Course Objectives
- How data moves within a computer. How to build the right balance between CPU, DRAM, Disk Memory, and GPUs.
- How different file formats can be read and manipulated by hardware.
- How to scale an ETL pipeline with multiple GPUs using NVTabular.
- How to build an interactive Plotly dashboard where users can filter on millions of data points in less than a second.
Course Outline
- Course Introduction
- Data on the Hardware Level
- ETL with NVTabular
- Data Visualization
- Final Project: Data Detective