Daily Dose of Data Science
Subscribe
Sign in
Home
Archive
About
Run-Time Optimization
New
Top
Community
Improve Python Run-time Without Changing A Single Line of Code
Switching to a smarter implementation of CPython.
Avi Chawla
May 20
16
Share this post
Improve Python Run-time Without Changing A Single Line of Code
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Most Sklearn Users Don't Know This About Its LinearRegression Implementation
Always review what a specific implementation is hiding underneath.
May 17
6
Share this post
Most Sklearn Users Don't Know This About Its LinearRegression Implementation
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
The Most Common Misconception About Inplace Operations in Pandas
...and here's what happens in reality.
Avi Chawla
Apr 27
17
2
Share this post
The Most Common Misconception About Inplace Operations in Pandas
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Avoid This Costly Mistake When Indexing A DataFrame
Row-then-column is not the same as Column-then-row.
Avi Chawla
Apr 22
17
2
Share this post
Avoid This Costly Mistake When Indexing A DataFrame
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Breathing KMeans: A Better and Faster Alternative to KMeans
Addressing limitations of KMeans.
Avi Chawla
Apr 19
38
5
Share this post
Breathing KMeans: A Better and Faster Alternative to KMeans
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Why You Should Avoid Appending Rows To A DataFrame
...and what happens if you do.
Avi Chawla
Mar 30
8
1
Share this post
Why You Should Avoid Appending Rows To A DataFrame
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Probably The Fastest Way To Execute Your Python Code
So the best way to speedup python is by not using python?
Avi Chawla
Mar 27
22
2
Share this post
Probably The Fastest Way To Execute Your Python Code
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
If You Are Not Able To Code A Vectorized Approach, Try This.
A simple tweak to improve iteration run-time over a DataFrame.
Avi Chawla
Mar 12
9
Share this post
If You Are Not Able To Code A Vectorized Approach, Try This.
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Why Are We Typically Advised To Never Iterate Over A DataFrame?
The core reason which very few told you about.
Avi Chawla
Mar 11
7
Share this post
Why Are We Typically Advised To Never Iterate Over A DataFrame?
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
This Small Tweak Can Significantly Boost The Run-time of KMeans
KMeans++: KMeans with a smarter centroid initialization approach.
Avi Chawla
Mar 9
8
Share this post
This Small Tweak Can Significantly Boost The Run-time of KMeans
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Speedup NumPy Methods 25x With Bottleneck
NumPy's methods are already highly optimized for performance. Yer, here's how you can further speed them up. Bottleneck provides a suite of optimized…
Avi Chawla
Feb 18
1
Share this post
Speedup NumPy Methods 25x With Bottleneck
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
Speed-up Parquet I/O of Pandas by 5x
Dataframes are often stored in parquet files and read using Pandas' 𝐫𝐞𝐚𝐝_𝐩𝐚𝐫𝐪𝐮𝐞𝐭() method. Rather than using Pandas, which relies on a…
Avi Chawla
Feb 13
6
Share this post
Speed-up Parquet I/O of Pandas by 5x
www.blog.dailydoseofds.com
Copy link
Linkedin
Facebook
Email
Notes
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts