Research: A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)
Applicability: “A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)” offers unique insight into the temporal dynamics of term distribution which may hold implications the design of search systems. As the growing importance of real-time search brings with it several information retrieval challenges; this paper frames one such challenge, that of rapid changes to term distributions, particularly for queries.
Abstract: The real-time nature of Twitter means that term distributions in tweets and in search queries change rapidly: the most frequent terms in one hour may look very different from those in the next. Informally, we call this phenomenon “churn”. Our interest in analyzing churn stems from the perspective of real-time search. Nearly all ranking functions, machine-learned or otherwise, depend on term statistics such as term frequency, document frequency, as well as query frequencies. In the real-time context, how do we compute these statistics, considering that the underlying distributions change rapidly? In this paper, we present an analysis of tweet and query churn on Twitter, as a first step to answering this question. Analyses reveal interesting insights on the temporal dynamics of term distributions on Twitter and hold implications for the design of search systems.
Analysis: Summarized analysis from this paper includes observations on:
Authors: Prepared by Jimmy Lin and Gilad Misne of Twitter, Inc., “A Study of “Churn” in Tweets and Real-Time Search Queries (Extended Version)” is a prepared paper submitted and accepted by the 6th International AAAI Conference on Weblogs and Social Media (ICWSM 2012).
This entry was posted on Tuesday, June 5th, 2012 at 2:39 pm. It is filed under chronology, discover and tagged with research, social media. You can follow any responses to this entry through the RSS 2.0 feed.
Comments are closed.
Providing timely articles, expert insight, and industry research, the Weekly eDiscovery News Update is the trusted source for relevant eDiscovery, corporate risk and vendor news and views for legal and technology professionals. Sign up today.
Taken from a combination of public market sizing estimations as shared in leading electronic discovery reports, publications and posts over time, the following eDiscovery Market Size Mashup shares general worldwide market sizing considerations for both the software and service areas of the electronic discovery market for the years between 2012 and 2017.
The gathering and use of information to help achieve personal and professional objectives has been a task executed by individuals and organizations from the beginning of time. However, with the advent of tools and technologies that can greatly accelerate this gathering and use of information, it is increasingly important that one considers not only the positive things that can be accomplished from the greater understanding derived from increased information access, but also considers the potential dark side usage of this increased information access.
Just as there are many tasks in electronic discovery, many times there are multiple technologies and platforms involved in the complete electronic discovery process. When there are multiple technologies and platforms involved, data must be transferred from disparate technologies and/or platforms to other disparate technologies and/or platforms. This data transfer can be considered a risk factor that affects the overall electronic discovery process.
In today’s “sound-bite” environment in which professional organizations compete for client attention through a variety of conduits and communications, it is increasingly important for marketing and sales leaders to consider and coordinate the use of all communications and communications tools in order to maximize impact and influence on potential clients.
Beginning in early 2012 the topic of Technology-Assisted Review moved from expert-led explanations to mainstream mentions in legal community articles, opinions, surveys and reports. Provided for your research, review and consideration are a compilation of key headlines and links from online sources on the topic of Technology-Assisted Review from February, 2012, until now.
Updated: 9/16/2013 – Provided for your consideration and use are the in-progress results of the One-Question Provider Implementation Survey launched by ComplexDiscovery on 3/3/13. The results consist of survey answers harvested directly from the online survey form as completed by provider representatives.
Updated 7/23/2013: Provided for your consideration and use are the in-progress results of the Predictive Coding and Provider Survey launched by ComplexDiscovery on 2/10/13. The in-progress results consist of survey answers harvested directly from the online survey form as completed by provider representatives.
Based on a website review of leading providers in the electronic discovery arena, the following list provides a quick, non-all inclusive reference of firms that appear to have developed “technology assisted review” technology (one form of this being “predictive coding”) for their own and/or partner offerings.