Daily Dose of Data Science

Share this post

Automated EDA Tools That Let You Avoid Manual EDA Tasks

www.blog.dailydoseofds.com

Discover more from Daily Dose of Data Science

High-quality insights on Data Science and Python, along with best practices — shared daily. Get a 550+ Page Data Science PDF Guide and 450+ Practice Questions Notebook, FREE.
Over 36,000 subscribers
Continue reading
Sign in

Automated EDA Tools That Let You Avoid Manual EDA Tasks

8 automated EDA tools in a single frame.

Avi Chawla
Jul 9, 2023
43
Share this post

Automated EDA Tools That Let You Avoid Manual EDA Tasks

www.blog.dailydoseofds.com
Share

EDA is a vital step in all data science projects.

It is important because examining and understanding the data directly aids the modeling stage.

EDA overview

By uncovering hidden insights and patterns, one can make informed decisions about subsequent steps in the project.

Despite its importance, it is often a time-consuming and tedious task.

The above visual summarizes 8 powerful EDA tools, that automate many redundant steps of EDA and help you profile your data in quick time.

  • SweetViz

    SweetViz
    • Creates a variety of data visualizations.

    • Covers information about missing values, data statistics, etc.

    • Integrates with Jupyter Notebook.

    • Get started: GitHub.

  • Pandas-profiling

    Pandas-profiling
    • Covers info about missing values, data statistics, correlation, etc.

    • Produces data alerts.

    • Plots data feature interactions.

    • Get started: GitHub.

  • DataPrep

    DataPrep
    • Produces interactive visualizations.

    • Typically faster than other common tools.

    • Supports Pandas and Dask DataFrames.

    • Covers info about missing values, data statistics, correlation, etc.

    • Plots data feature interactions.

    • Get started: GitHub.

  • AutoViz

    AutoViz
    • Supports CSV, TXT, and JSON.

    • Interactive Bokeh charts.

    • Covers info about missing values, data statistics, correlation, etc.

    • Presents data cleaning suggestions.

    • Get started: GitHub.

  • D-Tale

    D-Tale
    • Allows you to run many common Pandas operations with no code.

    • Exports code of analysis.

    • Integrates with Jupyter Notebook.

    • Covers info about missing values, data statistics, correlation, etc.

    • Highlights duplicates, outliers, etc.

    • Get started: GitHub.

  • dabl

    dabl
    • Primarily provides visualizations.

    • Covers a wide range of plots:

      • Target distribution.

      • Scatter pair plots.

      • Histograms.

    • Get started: GitHub.

  • QuickDA

    QuickDA
    • Get an overview report of the dataset.

    • Covers info about missing values, data statistics, correlation, etc.

    • Produces data alerts.

    • Plots data feature interactions.

    • Get started: GitHub.

  • Lux

    Lux
    • Integrates with Jupyter Notebook.

    • Provides visualization recommendations.

    • Supports EDA on a subset of columns.

    • Get started: GitHub.

👉 Over to you: What are some other automated EDA tools that you are aware of?

👉 If you liked this post, don’t forget to leave a like ❤️. It helps more people discover this newsletter on Substack and tells me that you appreciate reading these daily insights. The button is located towards the bottom of this email.


👉 Read what others are saying about this post on LinkedIn and Twitter.

👉 Tell the world what makes this newsletter special for you by leaving a review here :)

Review Daily Dose of Data Science

👉 If you love reading this newsletter, feel free to share it with friends!

Share Daily Dose of Data Science

👉 Sponsor the Daily Dose of Data Science Newsletter. More info here: Sponsorship details.


Find the code for my tips here: GitHub.

I like to explore, experiment and write about data science concepts and tools. You can read my articles on Medium. Also, you can connect with me on LinkedIn and Twitter.

43
Share this post

Automated EDA Tools That Let You Avoid Manual EDA Tasks

www.blog.dailydoseofds.com
Share
Previous
Next
Comments
Top
New
Community

No posts

Ready for more?

© 2023 Avi Chawla
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing