Language

- Python 2
- Python 3

Reading time

- Approximately 22 days

What you will learn

- Numerical Programming and Data Mining

Author

- Allen B. Downey

Published

- 5 years ago

Packages you will be introduced to

- numpy
- matplotlib
- pandas
- statsmodel
- scipy

If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python.

By working with a single case study throughout this thoroughly revised book, you’ll learn the entire process of exploratory data analysis—from collecting data and generating statistics to identifying patterns and testing hypotheses. You’ll explore distributions, rules of probability, visualization, and many other tools and concepts.

New chapters on regression, time series analysis, survival analysis, and analytic methods will enrich your discoveries.

- Develop an understanding of probability and statistics by writing and testing code
- Run experiments to test statistical behavior, such as generating samples from several distributions
- Use simulations to understand concepts that are hard to grasp mathematically
- Import data from most sources with Python, rather than rely on data that’s cleaned and formatted for statistics tools
- Use statistical inference to answer questions about real-world data

Read this book? Comment on this book's
GitHub issue page and share what you liked and what you didn't like
about it. Your GitHub comment will
show up as a review here.
See an example.