The 201 Buckets Problem, Part 2: How Bad Estimates Backfire As Your Data Grows


In the last post, I talked about how we don’t get accurate estimates because SQL Server’s statistics only have up to 201 buckets in the histogram. It didn’t matter much in that post, though, because we were using the small StackOverflow2010 database.

But what happens as our data grows? Let’s move to a newer Stack Overflow database, the 2018-06 one that I use for my Mastering training classes. We’ll create the same index, giving us the same statistics, and then look at the histogram to see how Miami grew over time:

USE StackOverflow; GO CREATE INDEX Location ON dbo.Users(Location); GO SELECT