With his characteristic mix of genius and humour, David Beazley’s keynote wove a fascinating narrative about the history of Python and the values and strategies baked into its DNA. He shared some priceless anecdotes about accidentally programming a supercomputer nicknamed ‘The Beast’ with 666 lines of Assembler code and the magical things that can happen when your office-trailer gets struck by lightning. Watch the whole keynote here.
If you’re a PyData history buff like us, we recommend checking out Matthew Rocklin’s video on the history of Dask. For more on the current and future state of all things high-performance and distributed computing, check out this blog to access a candid conversation between Matt and Peter Wang from Anaconda on The State of Distributed Computing.
Turning a highly technical topic into an engaging presentation is not an easy feat. We think Francesco Tisiot did a phenomenal job with his talk “Get to know Apache Kafka with Jupyter Notebooks”. Besides a clear explanation of how Apache Kafka works and how to apply it, we also learned some fascinating facts about pizza etiquette in Italy.
With all the flash and hype around Big Data, AI, and ML, it’s sometimes easy to forget that some of the more ‘boring’ down-to-earth stuff like data formats and compression algorithms is actually just as crucial to your data science success. While tuning your ML model’s hyperparameters is important, spending time optimizing the quality and format of your data input can also lead to massive performance gains.
There were a lot of sessions about (interactive) visualisation this year. We especially enjoyed Nicolas Kruchten’s (Plotly) high-level overview of why visualisation matters and the 4 different levels of interactivity. For a brutally honest, on-the-spot comparison of competing visualisation libraries in Python, we recommend taking an hour to watch the Python Dashboarding Shootout and Showdown.
And of course, this list wouldn't be complete without us mentioning the strong Dask presence at this first-ever PyData Global, both from Dask maintainers as well as other presenters from across the business and academic worlds.
Thanks for reading! You can learn more about our product Coiled Cloud, which provides hosted Dask clusters, docker-less managed software, and one-click deployments, and give it a spin for free today, when you click below.