Bookbot

Python Data Cleaning Cookbook

Modern Techniques and Python Tools to Detect and Remove Dirty Data and Extract Key Insights

Hodnotenie knihy

Viac o knihe

Discover how to detail your data, identify issues, and solve them with effective techniques. Key features include mastering various data cleaning methods to uncover insights, manipulating data to meet business needs, and validating large volumes of data to diagnose problems before analysis. This guide emphasizes the importance of clean data for accurate insights, illustrating tools and techniques for data handling with Python. You'll start by understanding data shape through routine practices applicable to most sources, then learn to manipulate data into a useful format. The book covers filtering and summarizing data to enhance comprehension and address identified issues. Key tasks include managing missing values, validating errors, removing duplicates, monitoring large datasets, and handling outliers and invalid dates. You'll also explore supervised learning and Naive Bayes analysis for detecting unexpected values and classification errors, along with generating visualizations for exploratory data analysis (EDA). By the end, you'll have the skills to clean data effectively and diagnose problems. Learn to read and analyze data from various sources, summarize attributes, filter relevant data, tackle messy issues, improve productivity with method chaining, and use visualizations for insights. This resource is ideal for anyone aiming to manage poor data using Python tools, requiring only a basic understanding of Python prog

Nákup knihy

Python Data Cleaning Cookbook, Michael Walker

Jazyk
Rok vydania
2020
product-detail.submit-box.info.binding
(mäkká)
Akonáhle sa objaví, pošleme e-mail.

Platobné metódy

4,5
Veľmi dobrá
6 Hodnotenie

Tu nám chýba tvoja recenzia

Titul
Python Data Cleaning Cookbook
Podtitul
Modern Techniques and Python Tools to Detect and Remove Dirty Data and Extract Key Insights
Jazyk
anglicky
Rok vydania
2020
Väzba
mäkká
Počet strán
436
ISBN10
1800565666
ISBN13
9781800565661
Série
Hodnotenie
4,5 z 5
Anotácia
Discover how to detail your data, identify issues, and solve them with effective techniques. Key features include mastering various data cleaning methods to uncover insights, manipulating data to meet business needs, and validating large volumes of data to diagnose problems before analysis. This guide emphasizes the importance of clean data for accurate insights, illustrating tools and techniques for data handling with Python. You'll start by understanding data shape through routine practices applicable to most sources, then learn to manipulate data into a useful format. The book covers filtering and summarizing data to enhance comprehension and address identified issues. Key tasks include managing missing values, validating errors, removing duplicates, monitoring large datasets, and handling outliers and invalid dates. You'll also explore supervised learning and Naive Bayes analysis for detecting unexpected values and classification errors, along with generating visualizations for exploratory data analysis (EDA). By the end, you'll have the skills to clean data effectively and diagnose problems. Learn to read and analyze data from various sources, summarize attributes, filter relevant data, tackle messy issues, improve productivity with method chaining, and use visualizations for insights. This resource is ideal for anyone aiming to manage poor data using Python tools, requiring only a basic understanding of Python prog