Coiled Blog

Accelerating Data Science, Python and Open Source

RSS Feed

Enterprise Dask Support

Matthew Rocklin
August 16, 2022
Along with the Cloud SaaS product, Coiled sells enterprise support for Dask. This article will be a more informal description of how we operate and why.
Category:
Dask

Writing JSON from Dask DataFrames with to_json

Matthew Powers
June 15, 2022
This blog post demonstrates how to output the contents of a Dask DataFrame to JSON files.
Category:
Dask
PyData
Python

Speed up a pandas query 10x with these 6 Dask DataFrame tricks

Matthew Powers
February 14, 2022
This post demonstrates how to speed up a pandas query to run 10 times faster with Dask using six performance optimizations.
Category:
Dask
PyData
Python

Reading CSV files into Dask DataFrames with read_csv

Matthew Powers
February 9, 2022
This blog post explains how to read one or multiple CSV files into a Dask DataFrame with read_csv.
Category:
Dask
PyData
Python

How to Merge Dask DataFrames

Richard Pelgrim
February 1, 2022
This post demonstrates how to merge Dask DataFrames and discusses important considerations when making large joins.
Category:
Dask
DataFrame

Parquet to CSV: Convert Many Parquet Files to a Single CSV using Python

Richard Pelgrim
January 10, 2022
This post demonstrates how to read multiple Parquet files and write them into a single CSV file with just two lines of code.
Category:
Dask
Scalable Computing

Parallelize pandas apply() and map() with Dask DataFrame

Pavithra Eswaramoorthy
November 22, 2021
With Dask’s map_partitions(), you can work on each partition of your Dask DataFrame, which is a pandas DataFrame, while leveraging parallelism for various custom workflows.
Category:
Dask
DataFrame

Dask vs Spark | Dask as a Spark Replacement

Matthew Rocklin
October 4, 2021
This article discusses the problems users looking for a Spark/Databricks replacement face, the relative strengths of Dask/Coiled for large-scale ETL processing, and also the current shortcomings.
Category:
Dask
PyData
Python

Level up your Dask using Coiled

Coiled makes it easy to scale Dask maturely in the cloud