Exploratory data analysis
After obtaining a dataset, it is vitally important to understand the characteristics of the existing data. Sometimes the most effective way to grasp the data is through summary statistics or other numerical measures. Often, however, it is a picture that tells a thousand words. Knowing how to best convey the underlying meaning in a dataset is a hugely important aspect of communicating results.
- Categorical data is the focus of Chapter 4 Exploring categorical data. Both numerical and graphical summaries are presented as ways to convey information about categorical data.
- Numerical data is the focus of Chapter 5 Exploring numerical data. Both numerical and graphical summaries are presented as ways to convey information about numerical data.
- Chapter 6 Applications: Explore does a deep dive into important considerations when creating a visualization.
While our book is software agnostic, one of the best ways to become familiar with numerical and graphical summaries is to practice working with different datasets using statistical software. For example, if you are interested in using R, you might try working through some of the chapters in R for Data Science (https://r4ds.hadley.nz), specifically the parts Whole game and Visualize.