Description

Book Synopsis

Large datasets tend to be distributed, non-uniform, and prone to change. Dask simplifies the process of ingesting, filtering, and transforming data, reducing or eliminating the need for a heavyweight framework like Spark.

Data Science at Scale with Python and Dask teaches readers how to build distributed data projects that can handle huge amounts of data. The book introduces Dask Data Frames and teaches helpful code patterns to streamline the reader’s analysis.

Key Features

  • Working with large structured datasets
  • Writing DataFrames
  • Cleaningand visualizing DataFrames
  • Machine learning with Dask-ML
  • Working with Bags and Arrays

Written for data engineers and scientists with experience using Python. Knowledge of the PyData stack (Pandas, NumPy, and Scikit-learn) will be helpful. No experience with low-level parallelism is required.

About the technology

Dask is a self-contained, easily extendible library designed to query, stream, filter, and consolidate huge datasets.

Jesse Daniel has five years of experience writing applications in Python, including three years working with in the PyData stack (Pandas, NumPy, SciPy, Scikit-Learn). Jesse joined the faculty of the University of Denver in 2016 as an adjunct professor of business information and analytics, where he currently teaches a Python for Data Science course.

Data Science at Scale with Python and Dask

    Product form

    £37.99

    Includes FREE delivery

    RRP £39.99 – you save £2.00 (5%)

    Order before 4pm today for delivery by Wed 1 Jul 2026.

    A Paperback / softback by Jesse Daniel

      Trusted by thousands of customers. See 2,385+ Customer Reviews

      View other formats and editions of Data Science at Scale with Python and Dask by Jesse Daniel

      Publisher: Manning Publications
      Publication Date: 11/10/2019
      ISBN13: 9781617295607, 978-1617295607
      ISBN10: 1617295604

      Description

      Book Synopsis

      Large datasets tend to be distributed, non-uniform, and prone to change. Dask simplifies the process of ingesting, filtering, and transforming data, reducing or eliminating the need for a heavyweight framework like Spark.

      Data Science at Scale with Python and Dask teaches readers how to build distributed data projects that can handle huge amounts of data. The book introduces Dask Data Frames and teaches helpful code patterns to streamline the reader’s analysis.

      Key Features

      • Working with large structured datasets
      • Writing DataFrames
      • Cleaningand visualizing DataFrames
      • Machine learning with Dask-ML
      • Working with Bags and Arrays

      Written for data engineers and scientists with experience using Python. Knowledge of the PyData stack (Pandas, NumPy, and Scikit-learn) will be helpful. No experience with low-level parallelism is required.

      About the technology

      Dask is a self-contained, easily extendible library designed to query, stream, filter, and consolidate huge datasets.

      Jesse Daniel has five years of experience writing applications in Python, including three years working with in the PyData stack (Pandas, NumPy, SciPy, Scikit-Learn). Jesse joined the faculty of the University of Denver in 2016 as an adjunct professor of business information and analytics, where he currently teaches a Python for Data Science course.

      Recently viewed products

      © 2026 Book Curl

        • American Express
        • Apple Pay
        • Diners Club
        • Discover
        • Google Pay
        • Maestro
        • Mastercard
        • PayPal
        • Shop Pay
        • Union Pay
        • Visa

        Login

        Forgot your password?

        Don't have an account yet?
        Create account