Memory Optimization

Out-of-core learning with sklearn
By default, Pandas always assigns the highest memory datatype to its columns. For instance, an integer-valued column always gets the int64 datatype…
If your data has NaN-valued columns, Pandas provides a datatype specifically for representing them - called the Sparse datatype. This is especially…
The CSV file format is widely used to save Pandas DataFrames. But are you aware of its limitations? To name a few, 1. The CSV does not store the…
If you use large static iterables in Python, a list may not be an optimal choice, especially in memory-constrained applications. A list stores the…
If your data has categorical columns, you should not represent them as int/string data type. Rather, Pandas provides an optimized data type specifically…