Language
  • Python 2
  • Python 3
Reading time
  • Approximately 22 days
What you will learn
  • Numerical Programming and Data Mining
Author
  • Allen B. Downey
Published
  • 9 years, 4 months ago
Packages you will be introduced to
  • numpy
  • matplotlib
  • pandas
  • statsmodel
  • scipy

If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python.

By working with a single case study throughout this thoroughly revised book, you’ll learn the entire process of exploratory data analysis—from collecting data and generating statistics to identifying patterns and testing hypotheses. You’ll explore distributions, rules of probability, visualization, and many other tools and concepts.

New chapters on regression, time series analysis, survival analysis, and analytic methods will enrich your discoveries.

  • Develop an understanding of probability and statistics by writing and testing code
  • Run experiments to test statistical behavior, such as generating samples from several distributions
  • Use simulations to understand concepts that are hard to grasp mathematically
  • Import data from most sources with Python, rather than rely on data that’s cleaned and formatted for statistics tools
  • Use statistical inference to answer questions about real-world data
The author Allen B. Downey has the following credentials.

  • Professor at Franklin W. Olin College of Engineering
  • Professor at Harvard University, one of the best universities in the world
  • Works/Worked at Google