Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python 2nd Edition

4.6 out of 5 stars 343 ratings
ISBN-13: 978-1492072942
ISBN-10: 149207294X
Why is ISBN important?
ISBN
This bar-code number lets you verify that you're getting exactly the right version or edition of a book. The 13-digit and 10-digit formats both work.
Scan an ISBN with your phone
Use the Amazon App to scan ISBNs and compare prices.
Share <Embed>
Loading your book clubs
There was a problem loading your book clubs. Please try again.
Not in a club? Learn more
Amazon book clubs early access

Join or create book clubs

Choose books together

Track your books
Bring your club to Amazon Book Clubs, start a new book club and invite your friends to join, or find a club that’s right for you for free.
Used: Very Good | Details
Sold by Vynavylend
Condition: Used: Very Good
Comment: Item without any highlights or writing. Get a super customer care by Amazon customer service. Hassle free return, fast shipping & Fulfilled by Amazon !
Access codes and supplements are not guaranteed with used items.
Get Fast, Free Shipping with Amazon Prime
FREE delivery Sunday, January 30 if you spend $25 on items shipped by Amazon
Or fastest delivery Wednesday, January 26. Order within 2 hrs 10 mins
Only 17 left in stock - order soon.
Available at a lower price from other sellers that may not offer free Prime shipping.
List Price: $69.99 Details
Save: $44.99 (64%)
FREE delivery Sunday, January 30
Or fastest delivery Thursday, January 27. Order within 3 hrs 40 mins
Practical+Statistics+for+... has been added to your Cart
Available at a lower price from other sellers that may not offer free Prime shipping.

Book picks to kick off your year of reading

Frequently bought together

  • Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python
  • +
  • Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Syst
  • +
  • Ace the Data Science Interview: 201 Real Interview Questions Asked By FAANG, Tech Startups, & Wall Street
Total price:
To see our price, add these items to your cart.
Some of these items ship sooner than the others.
Choose items to buy together.

From the brand


From the Publisher

Editorial Reviews

About the Author

Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor’s degree at Princeton, and Masters degrees at Harvard and the University of Maryland.

^

Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor’s degree at Princeton, and PhD in statistics at the University of Washington

^

Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD’s in Chemistry from the University of Erlangen-Nürnberg in Germany and Mathematics from Fernuniversität Hagen, Germany.


Celebrate Lunar New Year with Amazon Gift Cards.

Product details

  • Publisher ‏ : ‎ O'Reilly Media; 2nd edition (June 2, 2020)
  • Language ‏ : ‎ English
  • Paperback ‏ : ‎ 368 pages
  • ISBN-10 ‏ : ‎ 149207294X
  • ISBN-13 ‏ : ‎ 978-1492072942
  • Item Weight ‏ : ‎ 1.3 pounds
  • Dimensions ‏ : ‎ 7 x 0.9 x 9.1 inches
  • Customer Reviews:
    4.6 out of 5 stars 343 ratings

About the author

Follow authors to get new release updates, plus improved recommendations.
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Dr. Peter Gedeck holds a Ph.D. in chemistry. He worked for twenty years as a computational chemist in drug discovery at Novartis in the United Kingdom, Switzerland, and Singapore. His research interests include the application of statistical and machine learning methods to problems in drug discovery. He is a scientist in the research informatics team at Collaborative Drug Discovery, which offers the pharmaceutical industry cloud-based software to manage the huge amount of data involved in the drug discovery process.

Peter’s specialty is the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. His scientific work is published in more than 50 peer reviewed articles.

Peter also teaches at University of Virginia's School of Data Science and gives a series of courses on Predictive Analytics at Statistics.com.

Customer reviews

4.6 out of 5 stars
4.6 out of 5
343 global ratings

Top reviews from the United States

Reviewed in the United States on June 23, 2020
Verified Purchase
34 people found this helpful
Report abuse
Reviewed in the United States on December 6, 2020
Verified Purchase
18 people found this helpful
Report abuse
Reviewed in the United States on February 28, 2021
Verified Purchase
16 people found this helpful
Report abuse
Reviewed in the United States on February 1, 2021
Verified Purchase
7 people found this helpful
Report abuse
Reviewed in the United States on June 30, 2020
Verified Purchase
6 people found this helpful
Report abuse
Reviewed in the United States on June 5, 2020
Verified Purchase
4 people found this helpful
Report abuse
Reviewed in the United States on November 17, 2021
Verified Purchase
One person found this helpful
Report abuse
Reviewed in the United States on July 13, 2021
Verified Purchase
Customer image
1.0 out of 5 stars BAD PRINT
By Harry on July 13, 2021
This was supposed to be a new book. Seller should have caught this. On page 114-115 it looks like the publishing page cutter got crimped or something so that the book pages were as seen on the photos. Don't have time to send back and get another one. Disappointment.
Images in this review
Customer image Customer image Customer image Customer image
Customer imageCustomer imageCustomer imageCustomer image
3 people found this helpful
Report abuse

Top reviews from other countries

Chris
5.0 out of 5 stars Excellent book for aspiring data scientists
Reviewed in the United Kingdom on November 3, 2021
Verified Purchase
Amazon Customer
3.0 out of 5 stars Good
Reviewed in the United Kingdom on March 18, 2021
Verified Purchase
Laszlo Molnar
5.0 out of 5 stars Good explanations of complicated issues
Reviewed in the United Kingdom on February 24, 2021
Verified Purchase
Franco Arda
5.0 out of 5 stars Deep knowledge of data science
Reviewed in Germany on July 14, 2020
Verified Purchase
Customer image
5.0 out of 5 stars Deep knowledge of data science
Reviewed in Germany on July 14, 2020
In my view, this book’s strength is the deep knowledge of the authors added by the ability to explain key points in a few sentences.

I love the frequent question and answer to “Is it important for Data Scientists?” Data Science is such a wide and deep topic, that any pointers are extremely welcome.

Who is this book for? I believe it’s for intermediate to advanced Data Scientists. There’s so much “wisdom” that any reader should find value in the book.

The code snippets are in Python and R. Sometimes those snippets are enough (e.g. power analysis). Sometimes the reader needs different sources to dig deeper (e.g. bootstrapping where I highly recommend infer in R). I believe this “compressed” approach is smart. Data science is too wide and deep and we must be able to dig deeper on our own.

In other words, for a beginner, the code is often not enough to learn a new concept. Experienced Data Scientists should be able to judge from the code snippet if it’s enough.

+++ Personal highlights: +++

One of the best explanations on effect size I’ve ever seen (page 135).

Sometimes, the statistics community uses different terms than the machine learning community. The authors seem to understand both (page 143).

For example, in the last 10 years or so, we’ve seen a trend in statistics that favors data and simulations over classical probability theory and complex tests. But why would we use permutations in a hypothesis test? On page 139, the authors explain in succinctly in two sentences.

In fact, the authors have a deep knowledge of resampling and how to use simulations over classical tests.

The authors don’t try to confuse you. I’ve seen new books which used two pages to explain recall and then two pages to explain sensitivity. In this book, they don’t do it. Recall is the same as sensitivity (page 223).

Another example is “Power and Sample Size.” In only four pages, the reader probably gets a good idea of the four moving parts: sample size, effect size, significance level and power. This stuff is hard and explaining it well is even harder.

When cluster algorithms tend to give the same results and when not.

Funny: “…regression comes with a baggage that is more relevant to its traditional role …”(page 161).

Why would a Data Scientist care about heteroskedasticity? Page 183.

Kudos
Images in this review
Customer image
Customer image
4 people found this helpful
Report abuse
Komal Diwe
3.0 out of 5 stars Blank & White print..!
Reviewed in India on February 26, 2021
Verified Purchase
Customer image
3.0 out of 5 stars Blank & White print..!
Reviewed in India on February 26, 2021
At this huge price was expecting color print but got greyscale edition this disappointed me.
Book content is awesome but color print was expected.😢😢
Images in this review
Customer image
Customer image
9 people found this helpful
Report abuse