Determining the Accuracy of Crowdsourced Tweet Verification for Auroral Research

Authors

  • Nathan A. Case NASA Goddard Space Flight Center and New Mexico Consortium, US, and Department of Physics, Lancaster University https://orcid.org/0000-0003-0692-1778
  • Elizabeth A. MacDonald NASA Goddard Space Flight Center and New Mexico Consortium
  • Sean McCloat New Mexico Consortium and University of North Dakota
  • Nick Lalone College of Information Sciences and Technology, Pennsylvania State University
  • Andrea Tapia College of Information Sciences and Technology, Pennsylvania State University

DOI:

https://doi.org/10.5334/cstp.52

Keywords:

twitter, crowdsourcing, aurora, sightings, citizen science

Abstract

The Aurorasaurus project harnesses volunteer crowdsourcing to identify sightings of an aurora (the “northern/southern lights”) posted by citizen scientists on Twitter. Previous studies have demonstrated that aurora sightings can be mined from Twitter with the caveat that there is a large background level of non-sighting tweets, especially during periods of low auroral activity. Aurorasaurus attempts to mitigate this, and thus increase the quality of its Twitter sighting data, by using volunteers to sift through a pre-filtered list of geolocated tweets to verify real-time aurora sightings. In this study, the current implementation of this crowdsourced verification system, including the process of geolocating tweets, is described and its accuracy (which, overall, is found to be 68.4%) is determined. The findings suggest that citizen science volunteers are able to accurately filter out unrelated, spam-like, Twitter data but struggle when filtering out somewhat related, yet undesired, data. The citizen scientists particularly struggle with determining the real-time nature of the sightings, so care must be taken when relying on crowdsourced identification.

Downloads

Published

2016-12-21

Issue

Section

Case Studies