What to track on the Twitter streaming API?: a knapsack bandits approach to dynamically update the search terms

Kumar, S and Carley, K M (2019) What to track on the Twitter streaming API?: a knapsack bandits approach to dynamically update the search terms. In: ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining.

Full text not available from this repository. (Request a copy)

Abstract

We use Twitter streaming API for many purposes like monitoring brands and discovering events. Because Twitter Streaming API only allows tracking words (commonly called 'search-terms'), the data collection goal needs to be formulated in terms of search terms. Twitter limits the number of search terms that can be tracked using the API, and the number of tweets retrieved per search-term depends on the terms being tracked. Therefore it's crucial to use a small set of highly relevant terms for tracking.

Because social media is very dynamic and conversations evolve fast, the search terms that are relevant now might be less useful in as short of time as an hour. Manual monitoring of such discussions to update the search terms is cumbersome, error-prone and expensive. Can we have an algorithm to update the search terms based on the goals of the dataset collection? Taking inspiration from the knapsack bandits problem that effectively handle exploration (new search terms to explore) and exploitation (keep using useful search terms) when resources (network bandwidth, disk capacity or number of search terms) are constrained, we propose a new approach to dynamically update the search terms based on the goals of the data collection.

Item Type: Conference or Workshop Item (Paper)
Additional Information: The research article was published by the author with the affiliation of Carnegie Mellon University
Subjects: Information Systems
Date Deposited: 10 Sep 2023 18:12
Last Modified: 10 Sep 2023 18:12
URI: https://eprints.exchange.isb.edu/id/eprint/2135

Actions (login required)

View Item
View Item