Discover more from Daily Dose of Data Science
Define the Correct DataType for Categorical Columns
If your data has categorical columns, you should not represent them as int/string data type.
Rather, Pandas provides an optimized data type specifically for categorical columns. This is especially handy when you are working with large data-driven projects.
The snippet compares the memory usage of string and categorical data types in Pandas.
Thanks for reading Daily Dose of Data Science! Subscribe for free to learn something new and insightful about Python and Data Science every day. Also, get a Free Data Science PDF (250+ pages) with 200+ tips.