Bookbot

Data Science with Python and Dask

Hodnotenie knihy

Viac o knihe

Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti

Nákup knihy

Data Science with Python and Dask, Jesse C. Daniel

Jazyk
Rok vydania
2019
product-detail.submit-box.info.binding
(mäkká)
Akonáhle sa objaví, pošleme e-mail.

Platobné metódy

3,6
Veľmi dobrá
13 Hodnotenie

Tu nám chýba tvoja recenzia

Titul
Data Science with Python and Dask
Jazyk
anglicky
Rok vydania
2019
Väzba
mäkká
Počet strán
296
ISBN10
1617295604
ISBN13
9781617295607
Série
Hodnotenie
3,6 z 5
Anotácia
Dask is a native parallel analytics tool that integrates seamlessly with existing libraries like Pandas, NumPy, and Scikit-Learn, allowing you to work with large datasets using familiar tools. This guide shows how to leverage Dask for data projects without altering your workflow. The print book purchase includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications, with registration instructions provided inside. Efficient data pipelines are crucial for successful data science projects. Dask offers a flexible library for parallel computing in Python, enabling intuitive workflows for ingesting and analyzing large, distributed datasets. It features dynamic task scheduling and parallel collections that enhance the capabilities of NumPy, Pandas, and Scikit-Learn, allowing users to scale their code from a single laptop to a cluster of hundreds of machines effortlessly. The book teaches you to build scalable projects capable of handling massive datasets. You'll explore the Dask framework, analyze the NYC Parking Ticket database, and use DataFrames for streamlined processes. You'll also create machine learning models with Dask-ML, develop interactive visualizations, and build clusters using AWS and Docker. This resource is intended for data scientists and developers familiar with Python and the PyData stack. The author, Jesse Daniel, is an experienced Python developer and educator, leading a team of data scienti