Statistics for Data Science and Analytics
eBook - ePub

Statistics for Data Science and Analytics

  1. English
  2. ePUB (mobile friendly)
  3. Only available on web
eBook - ePub

Statistics for Data Science and Analytics

Book details
Table of contents
Citations

About This Book

Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration

Statistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations.

A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine learning, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of "kitchen sink" formulas. Regression is taught both as a tool for explanation and for prediction.

This book is informed by the authors' experience designing and teaching both introductory statistics and machine learning at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves.

Statistics for Data Science and Analytics includes information on sample topics such as:

  • Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and sets
  • Experiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary data
  • Specialized Python packages like numpy, scipy, pandas, scikit-learn and statsmodels ā€”the workhorses of data scienceā€”and how to get the most value from them
  • Statistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributions

Written by and for data science instructors, Statistics for Data Science and Analytics is an excellent learning resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.

Frequently asked questions

Simply head over to the account section in settings and click on ā€œCancel Subscriptionā€ - itā€™s as simple as that. After you cancel, your membership will stay active for the remainder of the time youā€™ve paid for. Learn more here.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Both plans give you full access to the library and all of Perlegoā€™s features. The only differences are the price and subscription period: With the annual plan youā€™ll save around 30% compared to 12 months on the monthly plan.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, weā€™ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes, you can access Statistics for Data Science and Analytics by Peter C. Bruce,Peter Gedeck,Janet Dobbins in PDF and/or ePUB format, as well as other popular books in Computer Science & Programming in Python. We have over one million books available in our catalogue for you to explore.

Information

Publisher
Wiley
Year
2024
ISBN
9781394253814
Edition
1

Table of contents

  1. Cover
  2. Table of Contents
  3. Title Page
  4. Copyright
  5. Dedication
  6. About the Authors
  7. Acknowledgments
  8. About the Companion Website
  9. Introduction
  10. 1 Statistics and Data Science
  11. 2 Designing and Carrying Out a Statistical Study
  12. 3 Exploring and Displaying the Data
  13. 4 Accounting for Chanceā€”Statistical Inference
  14. 5 Probability
  15. 6 Categorical Variables
  16. 7 Surveys and Sampling
  17. 8 More than Two Samples or Categories
  18. 9 Correlation
  19. 10 Regression
  20. 11 Multiple Linear Regression
  21. 12 Predicting Binary Outcomes
  22. Index
  23. End User License Agreement