Data Analysis

Don't print raw DataFrames.
Embedding the size component to a heatmap.
...in a single frame
4
Supercharge the describe method in Pandas. Skimpy is a lightweight tool for summarizing Pandas dataframes. In a single line of code, it generates a…
Summarytools is a simple EDA tool that gives a richer summary than describe() method. In a single line of code, it generates a standardized and…
2
Please watch a video version of this post for better understanding: Link. This is indeed one of the coolest and most useful Jupyter notebook-based data…
Most steps in a data analysis task stay the same across projects. Yet, manually digging into the data is tedious and time-consuming, which inhibits…
If you want to analyze your dataframe in a GUI-based application, try Pandas GUI. It provides an elegant GUI for viewing, filtering, sorting, describing…
The preliminary steps of any typical EDA task are often the same. Yet, across projects, we tend to write the same code to carry out these tasks. This…
The 𝐯𝐚𝐥𝐮𝐞_𝐜𝐨𝐮𝐧𝐭𝐬() method is commonly used to analyze categorical columns, but it has many limitations. For instance, if one wants to view…
Grouping, pivoting, and plotting are fundamental to almost every tabular data analysis task. Yet, across projects, we often write the same code over and…
Jupyter is a web-based IDE. Thus, whenever you print/display a DataFrame in Jupyter, it is rendered using HTML and CSS. This means you can style your…