If you're making the switch to Data Science, you might have come from a programming route or from science. It would be all too easy to learn a few new skills in data handling and machine learning and neglect statistics.
After all, many of the new breed of Data Scientists are declaring stats to be dead.
They are wrong, and it would be a big mistake to ignore statistics.
Traditionally stats was used mainly for hypothesis testing, but in these days of Data Science, Big Data and the Internet of Things it's being used just as much for making discoveries and formulating new hypotheses.
If you don't know where to start your educational journey with stats, the 3 books in this blog post will help you make your first steps.
Disclosure: the three books in this post link you to the listed book at your local Amazon store.
We may earn an affiliate commission for purchases you make when using the links to books on this page.
In this post - the 3rd in a series of 8 in which we bring you 21 Inspirational Books for All Aspiring Data Scientists, we highlight 3 books to introduce you to the subject of statistics in Data Science:
3 Must-Read Statistics Books for Aspiring Data Scientists @eelrekab @chi2innovations #statistics #datascience
Once considered tedious, the field of statistics is rapidly evolving into a discipline Hal Varian, chief economist at Google, has actually called “sexy”. From batting averages and political polls to game shows and medical research, the real-world application of statistics continues to grow by leaps and bounds. How can we catch schools that cheat on standardized tests? How does Netflix know which movies you’ll like? What is causing the rising incidence of autism? As best-selling author Charles Wheelan shows us in Naked Statistics, the right data and a few well-chosen statistical tools can help us answer these questions and more.
For those who slept through Stats 101, this book is a lifesaver. Wheelan strips away the arcane and technical details and focuses on the underlying intuition that drives statistical analysis. He clarifies key concepts such as inference, correlation, and regression analysis, reveals how biased or careless parties can manipulate or misrepresent data, and shows us how brilliant and creative researchers are exploiting the valuable data from natural experiments to tackle thorny questions.
And in Wheelan’s trademark style, there’s not a dull page in sight. You’ll encounter clever Schlitz Beer marketers leveraging basic probability, an International Sausage Festival illuminating the tenets of the central limit theorem, and a head-scratching choice from the famous game show Let’s Make a Deal – and you’ll come away with insights each time. With the wit, accessibility, and sheer fun that turned Naked Economics into a bestseller, Wheelan defies the odds yet again by bringing another essential, formerly unglamorous discipline to life.
Statistical methods are a key part of of data science, yet very few data scientists have any formal statistics training. Courses and books on basic statistics rarely cover the topic from a data science perspective. This practical guide explains how to apply various statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what’s important and what’s not.
Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you’re familiar with the R programming language, and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.
With this book, you’ll learn:
Scientific progress depends on good research, and good research needs good statistics. But statistical analysis is tricky to get right, even for the best and brightest of us. You’d be surprised how many scientists are doing it wrong.
Statistics Done Wrong is a pithy, essential guide to statistical blunders in modern science that will show you how to keep your research blunder-free. You’ll examine embarrassing errors and omissions in recent research, learn about the misconceptions and scientific politics that allow these mistakes to happen, and begin your quest to reform the way you and your peers do statistics.
You’ll find advice on:
All 8 posts in the series: