Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

Jim__

(14,088 posts)
Wed Dec 8, 2021, 06:00 PM Dec 2021

The paradox of big data spoils vaccination surveys

Turns out that big data is often poor quality data.

From phys.org:





When Delphi-Facebook and the U.S. Census Bureau provided near-real time estimates of COVID-19 vaccine uptake last spring, their weekly surveys drew on responses from as many as 250,000 people.

The large data sets provided statistically tiny margins of error, a key measure of a poll's accuracy, and raised confidence that the numbers were correct. But when the Centers for Disease Control and Prevention later provided figures of actual reported vaccination rates, the two polls were off—by a lot. By the end of May, the Delphi-Facebook study overestimated vaccine uptake by 17 percentage points—70 percent versus 53 percent, according to the CDC—and the Census Bureau's Household Pulse Survey did the same by 14 percentage points.

A comparative analysis by statisticians and political scientists from Harvard, Oxford, and Stanford universities concludes that the surveys fell victim to the "Big Data Paradox," the mathematical tendency of big data sets to minimize one type of error—that due to small sample size—but to magnify another that tends to get lesser attention: errors due to systematic biases that make the surveyed sample a poor representation of the larger population.

The "Big Data Paradox" was identified and coined by one of the study's authors, Harvard statistician Xiao-Li Meng, the Whipple V.N. Jones Professor of Statistics, in his 2018 analysis of polling during the 2016 presidential election. Famous for predicting a Hillary Clinton presidency, those election polls were skewed by what is termed "nonresponse bias," which in this case was the tendency of Trump voters to either not respond or define themselves as "undecided."

more ...




1 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies
The paradox of big data spoils vaccination surveys (Original Post) Jim__ Dec 2021 OP
Multiply By Really Poor State Healthcare Systems modrepub Dec 2021 #1

modrepub

(3,503 posts)
1. Multiply By Really Poor State Healthcare Systems
Wed Dec 8, 2021, 08:04 PM
Dec 2021

My state's Health Department computers are hopelessly outdated. The state legislature sees no need to have an accurate accounting system. That would only emphasize problems; no data, no problem.

It's scary that COVID is a relatively benign virus with a death rate of only a few percent. Imagine our already porous healthcare system trying to function with something that was highly contagious and very deadly. I can't imagine how damaging the Black Death was during the middle ages. If something like that broke out today, I think our society would be shattered and sent back centuries.

Latest Discussions»Issue Forums»Editorials & Other Articles»The paradox of big data s...