

Discover more from Daily Dose of Data Science
High-quality insights on Data Science and Python, along with best practices — shared daily.
Get a free 550+ page data science PDF guide and 450+ practice questions notebook.
Over 49,000 subscribers
Continue reading
Don't Create Conditional Columns in Pandas with Apply
While creating conditional columns in Pandas, we tend to use the 𝐚𝐩𝐩𝐥𝐲() method almost all the time.
However, 𝐚𝐩𝐩𝐥𝐲() in Pandas is nothing but a glorified for-loop. As a result, it misses the whole point of vectorization.
Instead, you should use the 𝐧𝐩.𝐰𝐡𝐞𝐫𝐞() method to create conditional columns. It does the same job but is extremely fast.
The condition is passed as the first argument. This is followed by the result if the condition evaluates to True (second argument) and False (third argument).
Read more here: NumPy docs.
I like to explore, experiment and write about data science concepts and tools. You could connect with me on LinkedIn.