Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search
 

Coyotl

(15,262 posts)
Sat Jan 5, 2013, 10:10 AM Jan 2013

170 BILLION tweets = Library Of Congress Twitter Archive Nears Finish, Remains Unusable

... it takes the system over 24 hours to execute a search of a single keyword.

Library Of Congress Twitter Archive Nears Finish, Remains Unusable
http://idealab.talkingpointsmemo.com/2013/01/library-of-congress-twitter-archive-nearly-done-just-unusable.php
Carl Franzen January 4, 2013, 12:16 PM 1597


Almost four years after the project was first announced, the Library of Congress on Friday announced that it expects by the end of January to finish a research archive of all the tweets publicly posted on Twitter since the service launched in 2006. The archive will remain unusable for the foreseeable future, however, due to technical challenges the agency said it encountered during the course of the project.

Specifically, the Library of Congress (LOC) wrote in a white paper (PDF) published online Friday that to date it has amassed an archive of 170 billion tweets and that is has almost completed its initial objectives — which include creating a chronological archive of tweets between 2006 and 2010 in addition to a separate archive of every tweet since then.

“This month, all those objectives will be completed,” the LOC’s white paper states.

But the LOC is still struggling with “technology challenges to making the archive accessible to researchers and policymakers,” specifically the fact that currently, with the archive of just all of the older tweets, it takes the system over 24 hours to execute a search of a single keyword. ..........
4 replies = new reply since forum marked as read
Highlight: NoneDon't highlight anything 5 newestHighlight 5 most recent replies
170 BILLION tweets = Library Of Congress Twitter Archive Nears Finish, Remains Unusable (Original Post) Coyotl Jan 2013 OP
Will Facebook posting also be archived??? n/t patricia92243 Jan 2013 #1
Of whose Tweets? Every Twitter account user...? Earth_First Jan 2013 #2
It's a shame no one is recording every World of Warcraft move jsr Jan 2013 #3
LOL Aerows Jan 2013 #4

Earth_First

(14,910 posts)
2. Of whose Tweets? Every Twitter account user...?
Sat Jan 5, 2013, 10:56 AM
Jan 2013

Talk about creepy.

Was this made possible by an NSA arts grant?

jsr

(7,712 posts)
3. It's a shame no one is recording every World of Warcraft move
Sat Jan 5, 2013, 11:03 AM
Jan 2013

and archiving them for future historians.

 

Aerows

(39,961 posts)
4. LOL
Sat Jan 5, 2013, 11:06 AM
Jan 2013

"Raid strategies of the poor and lacking a life". (Note I used to raid in WoW so I have some knowledge about this characterization).

Latest Discussions»General Discussion»170 BILLION tweets = Libr...