Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ CardiffĀ 
WelshClear Cookie - decide language by browser settings

Knowing the Tweeters: Deriving sociologically relevant demographics from Twitter

Sloan, Luke, Morgan, Jeffrey, Housley, William, Williams, Matthew Leighton, Edwards, Adam Michael, Burnap, Peter and Rana, Omer Farooq 2013. Knowing the Tweeters: Deriving sociologically relevant demographics from Twitter. Sociological Research Online 18 (3) , 7. 10.5153/sro.3001

[img]
Preview
PDF - Submitted Pre-Print Version
Download (21MB) | Preview

Abstract

A perennial criticism regarding the use of social media in social science research is the lack of demographic information associated with naturally occurring mediated data such as that produced by Twitter. However the fact that demographics information is not explicit does not mean that it is not implicitly present. Utilising the Cardiff Online Social Media ObServatory (COSMOS) this paper suggests various techniques for establishing or estimating demographic data from a sample of more than 113 million Twitter users collected during July 2012. We discuss in detail the methods that can be used for identifying gender and language and illustrate that the proportion of males and females using Twitter in the UK reflects the gender balance observed in the 2011 Census. We also expand on the three types of geographical information that can be derived from Tweets either directly or by proxy and how spatial information can be used to link social media with official curated data. Whilst we make no grand claims about the representative nature of Twitter users in relation to the wider UK population, the derivation of demographic data demonstrates the potential of new social media (NSM) for the social sciences. We consider this paper a clarion call and hope that other researchers test the methods we suggest and develop them further.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Cardiff Centre for Crime, Law and Justice (CCLJ)
Computer Science & Informatics
Social Sciences (Includes Criminology and Education)
Subjects: H Social Sciences > HT Communities. Classes. Races
Q Science > QA Mathematics > QA76 Computer software
Uncontrolled Keywords: New Social Media, Demographics, Twitter, Social Media Analytics, Social Science, Sampling
Publisher: Sociological Research Online
ISSN: 1360-7804
Funders: ESRC
Last Modified: 18 Dec 2016 04:04
URI: http://orca-mwe.cf.ac.uk/id/eprint/49152

Citation Data

Cited 12 times in Google Scholar. View in Google Scholar

Cited 20 times in Scopus. View in Scopus. Powered By ScopusĀ® Data

Cited 6 times in Web of Science. View in Web of Science.

Actions (repository staff only)

Edit Item Edit Item

Full Text Downloads from ORCA for this publication

Top Downloads of this item by Country

Monthly Full Text Downloads of this item

More statistics for this item...