Processing and visualizing the data in tweets
Author(s)
Marcus, Adam; Bernstein, Michael S.; Badar, Osama; Karger, David R.; Madden, Samuel R.; Miller, Robert C.; ... Show more Show less
Downloadtwitter.pdf (450.1Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Microblogs such as Twitter provide a valuable stream of diverse user-generated data. While the data extracted from Twitter is generally timely and accurate, the process by which developers extract structured data from the tweet stream is ad-hoc and requires reimplementation of common data manipulation primitives. In this paper, we present two systems for querying and extracting structure from Twitter-embedded data. The first, TweeQL, provides a streaming SQL-like interface to the Twitter API, making common tweet processing tasks simpler. The second, TwitInfo, shows how end-users can interact with and understand aggregated data from the tweet stream, in addition to showcasing the power of the TweeQL language. Together these systems show the richness of content that can be extracted from Twitter.
Date issued
2011-12Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
ACM SIGMOD Record
Publisher
Association for Computing Machinery
Citation
Marcus, Adam, Michael S. Bernstein, Osama Badar, David R. Karger, Samuel Madden, and Robert C. Miller. Processing and Visualizing the Data in Tweets. ACM SIGMOD Record 40, no. 4 (January 11, 2012): 21.
Version: Author's final manuscript
ISSN
01635808
1943-5835