Library of Congress Compiling Huge Twitter Archive
The U.S. Library of Congress has completed its archive of every Twitter post during the first four years following the site's launch. Unfortunately, making that archive useful has proven extremely difficult.
Back in April 2010, the Library of Congress signed a deal to archive Twitter posts, known as 'tweets.' The Library's Gayle Osterberg says those tweets are an important and valid research source.
"As society turns to social media as a primary method of communication and creative expression, social media is supplementing, and in some cases supplanting, letters, journals, serial publications and other sources routinely collected by research libraries," Osterberg said. (Source: loc.gov)
Under the terms of the deal, Twitter agreed to provide all tweets, both past and present, to the Library for archiving. The Library would then be allowed to make any tweet more than six months old available to legitimate researchers.
However, the deal does not allow the Library of Congress to make the Twitter archive available online for download by the general public.
170 Billion Tweets Already Archived
Collating and organizing those tweets by date has turned out to be a major technological challenge. The Library has only just completed archiving those first four years of tweets, totaling approximately 170 billion messages.
That leaves the Library archive nearly three years behind the current activity on Twitter. Furthermore, the number of tweets posted daily has more than tripled since the Library of Congress project began.
Osterberg says the Library has already received more than 400 requests for access to the archive. Those interested in the Twitter archive include researchers investigating topics ranging from vaccination of diseases to stock market activity.
Unfortunately, the Library hasn't been able to fulfill any of these requests, as it's still trying to figure out how to make the data available in a useful manner.
Officials have taken the approach of breaking the archive into individual files, each covering one hour's worth of Twitter posts from around the world.
Single Search Could Take 24 Hours
The problem now is that simply searching through each file, one at a time, would be extremely time consuming. Officials believe that, as things stand, a single search could require about 24 hours to complete. (Source: digitaltrends.com)
The most practical solution would be to provide hundreds or even thousands of machines for use by searchers. That way, a single search request could be done on multiple archive files simultaneously.
Insiders believe such a tactic could help speed up the research process. Sadly, the Library claims that strategy "is cost-prohibitive and impractical for a public institution."
Officials say they are now looking at ways to partner with private firms, as a way to use outside technology or financing to make archive access faster and more efficient.
Most popular articles
- Which Processor is Better: Intel or AMD? - Explained
- How to Prevent Ransomware in 2018 - 10 Steps
- 5 Best Anti Ransomware Software Free
- How to Fix: Computer / Network Infected with Ransomware (10 Steps)
- How to Fix: Your Computer is Infected, Call This Number (Scam)
- Scammed by Informatico Experts? Here's What to Do
- Scammed by Smart PC Experts? Here's What to Do
- Scammed by Right PC Experts? Here's What to Do
- Scammed by PC / Web Network Experts? Here's What to Do
- How to Fix: Windows Update Won't Update
- Explained: Do I need a VPN? Are VPNs Safe for Online Banking?
- Explained: VPN vs Proxy; What's the Difference?
- Explained: Difference Between VPN Server and VPN (Service)
- Forgot Password? How to: Reset Any Password: Windows Vista, 7, 8, 10
- How to: Use a Firewall to Block Full Screen Ads on Android
- Explained: Absolute Best way to Limit Data on Android
- Explained: Difference Between Dark Web, Deep Net, Darknet and More
- Explained: If I Reset Windows 10 will it Remove Malware?
My name is Dennis Faas and I am a senior systems administrator and IT technical analyst specializing in cyber crimes (sextortion / blackmail / tech support scams) with over 30 years experience; I also run this website! If you need technical assistance , I can help. Click here to email me now; optionally, you can review my resume here. You can also read how I can fix your computer over the Internet (also includes user reviews).
We are BBB Accredited
We are BBB accredited (A+ rating), celebrating 21 years of excellence! Click to view our rating on the BBB.