Daily Dose of Data Science

Share this post

Raincloud Plots: The Hidden Gem of Data Visualisation

www.blog.dailydoseofds.com

Discover more from Daily Dose of Data Science

High-quality insights on Data Science and Python, along with best practices — shared daily. Get a 550+ Page Data Science PDF Guide and 450+ Practice Questions Notebook, FREE.
Over 36,000 subscribers
Continue reading
Sign in

Raincloud Plots: The Hidden Gem of Data Visualisation

...that very few are talking about.

Avi Chawla
Jul 11, 2023
27
Share this post

Raincloud Plots: The Hidden Gem of Data Visualisation

www.blog.dailydoseofds.com
Share

Visualizing data distributions using box plots and histograms can be misleading at times.

This is because:

  • It is possible to get the same box plot with entirely different data.

    • For instance, consider the illustration below from one of my previous posts: Use Box Plots With Caution! They May Be Misleading.

    • We get the same box plot with three different datasets.

  • Altering the number of bins changes the shape of a histogram.

    • Read this post here.

Thus, to avoid misleading conclusions, it is recommended to plot the data distribution.

Here, jitter (strip) plots and KDE plots are immensely helpful.

One way is to draw them separately and analyze them together, as shown below. But this is quite tedious.

Box, Strip and KDE plot drawn separately

Instead, try Raincloud plots.

They provide a concise way to combine and visualize three different types of plots together.

These include:

  • Box plots for data statistics.

  • Strip plots for data overview.

  • KDE plots for the probability distribution of data.

Raincloud plot with Box, strip and KDE plot at once

Overall, Raincloud plots are an excellent choice for data visualization.

With Raincloud plots, you can:

  • Combine multiple plots to prevent incorrect/misleading conclusions

  • Reduce clutter and enhance clarity

  • Improve comparisons between groups

  • Capture different aspects of the data through a single plot

You can use the PtitPrince library to create Raincloud plots in Python: GitHub.

R users can use Raincloud Plots library: GitHub.

P.S. If the name “Raincloud plot” isn’t obvious yet, it comes from the visual appearance of the plot:

The origin of the name “Raincloud plot”

👉 Over to you: What are some other hidden gems of data visualization?

👉 If you liked this post, don’t forget to leave a like ❤️. It helps more people discover this newsletter on Substack and tells me that you appreciate reading these daily insights. The button is located towards the bottom of this email.


👉 Read what others are saying about this post on LinkedIn and Twitter.

👉 Tell the world what makes this newsletter special for you by leaving a review here :)

Review Daily Dose of Data Science

👉 If you love reading this newsletter, feel free to share it with friends!

Share Daily Dose of Data Science

👉 Sponsor the Daily Dose of Data Science Newsletter. More info here: Sponsorship details.


Find the code for my tips here: GitHub.

I like to explore, experiment and write about data science concepts and tools. You can read my articles on Medium. Also, you can connect with me on LinkedIn and Twitter.

27
Share this post

Raincloud Plots: The Hidden Gem of Data Visualisation

www.blog.dailydoseofds.com
Share
Previous
Next
Comments
Top
New
Community

No posts

Ready for more?

© 2023 Avi Chawla
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing