Wednesday, January 25, 2023 - 11am EST
1-hour session
In this lesson, we’ll learn some best practices around working with larger-than-memory datasets. We’ll use the Uber/Lyft dataset to:
By the end, we’ll learn the comparative advantages of working with the Parquet file format and how to efficiently work with big data.
May 24th, 2023 - 11am EDT
1-hour session
Learn best practices for larger-than-memory dataframes. Investigate Uber/Lyft data and learn to do the following:
Tune Parquet storage, build features, and explore a challenging dataset with Pandas and Dask.