Understanding public perception of coronavirus disease 2019 (COVID-19) social distancing on Twitter

Sameh N. Saleh; Christoph U. Lehmann; Samuel A. McDonald; Mujeeb A. Basit; Richard J. Medford

doi:10.1017/ice.2020.406

Understanding public perception of coronavirus disease 2019 (COVID-19) social distancing on Twitter

Part of: Highly Cited Papers SARS-CoV-2/COVID-19

Published online by Cambridge University Press: 06 August 2020

Sameh N. Saleh

Christoph U. Lehmann ,

Samuel A. McDonald ,

Mujeeb A. Basit and

Richard J. Medford

Show author details

Sameh N. Saleh*: Affiliation:
Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, Texas
Christoph U. Lehmann: Affiliation:
Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, Texas Department of Pediatrics, University of Texas Southwestern Medical Center, Dallas, Texas
Samuel A. McDonald: Affiliation:
Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, Texas Department of Emergency Medicine, University of Texas Southwestern Medical Center, Dallas, Texas
Mujeeb A. Basit: Affiliation:
Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, Texas
Richard J. Medford: Affiliation:
Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas Clinical Informatics Center, University of Texas Southwestern Medical Center, Dallas, Texas
*: Author for correspondence: Sameh N. Saleh, E-mail: sameh.n.saleh@gmail.com

Article contents

Abstract
Objective:
Design:
Methods:
Results:
Conclusions:
Methods
Results
Discussion
Financial support
Conflicts of interest
Supplementary material
References

Rights & Permissions

Abstract

Objective:

Social distancing policies are key in curtailing severe acute respiratory coronavirus virus 2 (SARS-CoV-2) spread, but their effectiveness is heavily contingent on public understanding and collective adherence. We studied public perception of social distancing through organic, large-scale discussion on Twitter.

Design:

Retrospective cross-sectional study.

Methods:

Between March 27 and April 10, 2020, we retrieved English-only tweets matching two trending social distancing hashtags, #socialdistancing and #stayathome. We analyzed the tweets using natural language processing and machine-learning models, and we conducted a sentiment analysis to identify emotions and polarity. We evaluated the subjectivity of tweets and estimated the frequency of discussion of social distancing rules. We then identified clusters of discussion using topic modeling and associated sentiments.

Results:

We studied a sample of 574,903 tweets. For both hashtags, polarity was positive (mean, 0.148; SD, 0.290); only 15% of tweets had negative polarity. Tweets were more likely to be objective (median, 0.40; IQR, 0–0.6) with ~30% of tweets labeled as completely objective (labeled as 0 in range from 0 to 1). Approximately half of tweets (50.4%) primarily expressed joy and one-fifth expressed fear and surprise. Each correlated well with topic clusters identified by frequency including leisure and community support (ie, joy), concerns about food insecurity and quarantine effects (ie, fear), and unpredictability of coronavirus disease 2019 (COVID-19) and its implications (ie, surprise).

Conclusions:

Considering the positive sentiment, preponderance of objective tweets, and topics supporting coping mechanisms, we concluded that Twitter users generally supported social distancing in the early stages of their implementation.

Type: Original Article
Information: Infection Control & Hospital Epidemiology , Volume 42 , Issue 2 , February 2021 , pp. 131 - 138

DOI: https://doi.org/10.1017/ice.2020.406 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © 2020 by The Society for Healthcare Epidemiology of America. All rights reserved.

On March 11, 2020, the World Health Organization (WHO) declared the novel coronavirus (COVID-19) outbreak a pandemic and emphasized the need for global governmental commitment to control the threat, citing then 118,319 confirmed cases and 4,292 deaths worldwide.¹ To contain severe acute respiratory coronavirus virus 2 (SARS-CoV-2), countries closed their international borders.^{Reference Ratcliffe2} Despite travel restrictions, global cases continued to increase³ requiring enactment of key community mitigation, which garnered significant public attention.^{Reference Glanz, Carey and Holder4,Reference Coppins5} These mitigation strategies, named nonpharmaceutical interventions (NPIs), are approaches outside medications, therapies, and vaccines to prevent further spread of SARS-CoV-2 and to reduce the strain on the healthcare system. NPIs fall under 3 main categories: personal, environmental, and community. Personal NPIs refer to behaviors like staying home when sick, coughing or sneezing in a tissue or elbow, wearing a mask, and washing hands with soap and water or using hand sanitizer. Environmental NPIs refer to appropriate surface cleaning of high-throughput areas and commonly used objects. Community NPIs refer to social distancing and closure of areas where large gatherings may occur, such as schools, businesses, parks, and sporting events. Used previously for other viral outbreaks such as influenza,^{Reference Ahmed, Zviedrite and Uzicanin6} social distancing or physical distancing refers to increasing the space between individuals and avoidance of larger gatherings in an attempt to reduce viral transmission.⁷ This community NPI has been a main components of effectively fighting the COVID-19 pandemic.^{Reference Ngonghala, Iboi and Eikenberry8–Reference Qamar10}

Managing and changing public opinion and behavior are vital for social distancing to successfully slow transmission of COVID-19, preserve hospital resources, and prevent exceeding the healthcare system’s capacity.¹¹ To affect public opinion, one must first examine and understand it. Social media, specifically its microblogging platform Twitter, serves as an ideal medium to provide this understanding. Twitter has >145 million daily active users¹² and allows individuals to post, repost, like, and comment on ‘tweets’ of up to 280 characters. Analysis of Twitter has been used previously within the healthcare realm to understand public sentiment and opinion on topics ranging from diabetes,^{Reference Gabarron, Dorronzoro, Rivera-Romero and Wynn13} cancer therapy,^{Reference Zhang, Hall and Bastola14} and novel healthcare policies such as the Affordable Care Act.^{Reference Davis, Zheng, Liu and Levy15} Within the field of emerging infectious diseases, Twitter analysis has been used to study public opinion and sentiment on measles,^{Reference Mollema, Harmsen and Broekhuizen16} influenza,^{Reference Chew and Eysenbach17} and Zika virus outbreaks.^{Reference Mamidi, Miller, Banerjee, Romine and Sheth18}

We hypothesized that performing sentiment, emotion, and content analysis of tweets related to social distancing on Twitter during the COVID-19 pandemic could provide valuable insight into the public’s beliefs and opinions on this policy. We further hypothesized that the knowledge gained could prove valuable for public health communication as well as dissemination and refinement of information strategies.

Methods

Data collection and processing

From March 27 to April 10, 2020, we extracted daily relevant samples¹⁹ of English-only tweets related to social distancing and created a 2-week cross-sectional data set of social media activity. We used the rtweet package^{Reference Kearney20} to access Twitter’s application programming interface (API) via RStudio version 1.2.1335 (R Foundation for Statistical Computing, Vienna, Austria). The hashtags #socialdistancing and #stayathome, which were the top trending social distancing hashtags at the time of data extraction, were used to identify tweets related to social distancing. We used 15 of the 89 collected tweet metadata variables in our analysis (Table S1 online). We cleaned the tweets by removing characters and words of no or little analytical value and transforming text to its root form. We used Python version 3.6.1 software (Python Software Foundation, Wilmington, DE) for all data processing and analyses. Further details are discussed in Appendix A (online). Institutional review board approval was not required because this study used only publicly available data.

Sentiment and emotion analysis

We used Python’s TextBlob library^{Reference Loria21} to perform sentiment analysis for all tweets through natural language processing and text analysis to identify and classify emotions (positive, negative, or neutral) and content topics. TextBlob applies the AFINN sentiment lexicon^{Reference Nielsen22} from a polarity scale of −1 (most negative) to 1 (most positive). We visualized the polarity distribution using bins for strongly negative (−1 to −0.51), negative (−0.5 to −0.01), neutral (0), positive (0.01 to 0.5), and strongly positive (0.51 to 1). We used a recurrent neural network model developed by Colneric and Demsar to label the primary emotion for each tweet based on Ekman’s emotional classification (anger, disgust, fear, joy, sadness, or surprise).^{Reference Colneric and Demsar23} Using χ² testing and Bonferroni correction to adjust for multiple comparisons, we compared the proportion of each sentiment polarity and emotion for each hashtag. We evaluated changes in effect size between hashtags using the absolute difference in percentage points.

Subjectivity analysis

We used Python’s TextBlob library to perform subjectivity analysis and labeled each tweet from a range of 0 (objective) to 1 (subjective). Objective tweets relay factual information, whereas subjective tweets typically communicate an opinion or belief. For the 2 hashtags #stayathome and #socialdistancing, we visualized sentiment using a histogram of values and compared the median sentiment between hashtags using the Mann-Whitney U test. Through terminology matching, we used key words present in social distancing rules (eg, “stay at least 6 feet [2 meters] from other people” or “avoid large gatherings”) to identify tweets with potentially objective information about these rules (Table S2 online).⁷ We manually reviewed 5% of the resulting tweet subset to verify what percentage of these tweets truly included information about social distancing rules and extrapolated prevalence for the full subset of tweets.

Topic modeling

To understand the major topics being discussed in our tweet sample, we applied an unsupervised machine-learning algorithm called Latent Dirichlet Allocation (LDA)^{Reference Blei, Ng and Jordan24} using the gensim Python library.^{Reference Rehurek and Sojka25} LDA is a commonly used topic-modeling approach to identify clusters of documents (in our case, tweets) by a representative set of words. The most highly weighted words in each cluster provide insight into the content of each topic. LDA requires users to input the number of expected topics. To determine the optimal number of topics, we trained multiple LDA models using different numbers of topics ranging from 4 to 50 and computed a topic coherence score (produced by comparing semantic similarity of a topic’s most highly weighted words) for each LDA model. Selecting the LDA model with the highest score, we ultimately chose 10 topics for the final model. An author without access or insight into the topic model initially labeled the topics using the 20 most frequently used terms ordered by weight. All authors then reached consensus on the topic labels. We identified the prevalence of topics by labeling tweets according to their most dominant topic. We identified example tweets whose content pertained >99% to 1 specific topic (Table 2, last column).

Results

We extracted 1,352,082 tweets during the 2-week period. After removal of repeat and non-English tweets, 574,903 tweets across 347,142 users (range, 1–836 tweets per user; mean, 1.6 tweets per user) were included in the analysis (Table 1). Of those tweets, 98.3% were unique. The hashtag #socialdistancing was included in 264,254 tweets and #stayathome was included in 332,075 tweets; 21,453 tweets contained both hashtags. Twitter for iPhone was the most commonly used platform (31%), followed by Twitter for Android (27.5%). Moreover, <50% of tweets had media (image or video) and more than one-third had a hyperlink. The median user had >3,000 posts and >400 followers at the time of tweeting. Also, 5% of accounts were verified, signified by a blue badge next to a user’s profile name indicating that an account of public interest is authentic.

Table 1. Characteristics of Tweets and Twitter Users^a

^a Median (IQR) is presented for numerical variables. N (%) is presented for categorical variables.

^b n = 574,903; 564,886 unique.

^c n = 264,254 (46.0%).

^d n = 332,075 (57.8%).

Table 2. Topic Clusters Identified by Topic Modeling^a

^a Words contributing to the model are shown in decreasing order of weighting. The topics are labeled manually based on these words. The number of tweets primarily with that topic, mean sentiment, mean subjectivity, and sample tweet are also included.

Word frequency

Our tweet data set contained 13,962,279 words and 93,337,108 characters. The 200 most frequently used words associated with each hashtag before processing are illustrated in Fig. 1. After processing, for both #socialdistancing and #stayathome, the most common word was ‘day’ (20,637 and 28,798 times, respectively). The next 19 most frequent words for #socialdistancing were ‘practice’ (13,988 times), ‘today’ (13,868 times), ‘quarantine’ (13,661 times), ‘coronalockdown coronavirus’ (13,659 times), ‘lockdown’ (12,262 times), ‘help’ (11,064 times), ‘see’ (10,492 times), ‘good’ (10,347 times), ‘new’ (10,204 times), ‘listen’ (9,682 times), ‘staysafe’ (9,609 times), ‘great’ (8,641 times), ‘pandemic’ (8,386 times), ‘way’ (8,243 times), ‘love’ (7,815 times), ‘walk’ (7754 times), ‘say’ (7,167 times), ‘everyone’ (7,141 times), and ‘family’ (6,953 times). For #stayathome the next most frequent words were ‘staysafe’ (21,312 times), ‘lockdown’ (20,839 times), ‘today’ (16,104 times), ‘good’ (13,972 times), ‘quarantine’ (13,758 times), ‘new’ (13,424 times), ‘help’ (13,388 times), ‘see’ (12,545 times), ‘love’ (11,757 times), ‘order’ (10,956 times), ‘everyone’ (9,956 times), ‘say’ (9,836 times), ‘pandemic’ (9,426 times), ‘thank’ (9,280 times), ‘week’ (9,086 times), ‘family’ (9,061 times), ‘life’ (8,793 times), ‘watch’ (8,286 times), and ‘want’ (8,185 times).

Fig. 1. Word cloud of top 200 words.

Sentiment polarity analysis

There was net positive sentiment polarity toward both #socialdistancing and #stayathome, with mean polarity scores of 0.150 (standard deviation [SD], 0.292) and 0.144 (SD, 0.287) respectively. Positive and neutral tweets accounting for 52.2% and 33.1% of tweets, respectively (Fig. 2). Moreover, <15% of tweets were negative and <2% were strongly negative. Although statistical differences between polarity categories were detected due to the large sample sizes, the differences in effect sizes were minimal (Fig. 2). Neutral and positive tweets had the largest absolute differences. Compared to #stayathome, #socialdistancing had 3.6% fewer neutral tweets and 3.2% more positive tweets.

Fig. 2. Sentiment analysis for all tweets and stratified by tweets with the hashtag #socialdistancing and #stayathome. Comparison between the two hashtags is done using χ² testing. Bonferroni correction was used to define statistical significance at a threshold of P = .01 (0.05/n, where n = 5 since 5 comparisons were completed).

Subjectivity analysis

Tweets tended to be more objective in nature and ~30% demonstrated near or complete objectivity (Fig. 3). The median subjectivity scores were similar for #socialdistancing (0.4; interquartile range [IQR], 0–0.59) and #stayathome (0.4; IQR, 0–0.6; P = .13). We matched 6,417 tweets that included key words related to social distancing rules and manually reviewed 320 of them. Of the 320 tweets, 249 were confirmed to be related to social distancing rules, yielding a rate of 77.6%. Extrapolating this to all social distancing tweets, we estimate that 4,980 (1.1% of all) tweets referenced social distancing rules.

Fig. 3. Subjectivity analysis for all tweets. Complete objectivity are defined as 0 and complete subjectivity are defined as 1.

Emotion analysis

Joy was the predominant emotion expressed in >50% of tweets with topics ranging from enjoying recreational activities, connecting with family members, and working from home. Examples:

If you are lucky enough to have even a small garden, now is the time to spend sprucing it up. Our spring gardening feature has helpful advice and new ideas to try, to help you make the most of it and #stayathome

and

To bridge the #socialdistancing gap between me & my #grandchildren, I am reading them stories #daily via #zoom & was so pleased to learn about #savewithstories

Fear was the second most common emotion, present in over 20% of tweets. Example:

We need to take the #homequarantine very, very strictly and seriously. if we don’t treat it like a matter of life n [sic] death, it shall definitely become one.. think of it as an army of terrorists outside your door guns.what would you do? !! #stayhome #corona #socialdistancing.

Surprise was the next most prevalent emotion, and tweets included themes of prolonged policy interventions and discovery of novel talents. Examples:

To save lives, #SocialDistancing must continue longer than we expect.

and

I played golf with my wife today. Odd, I didn’t even know she could play. #SocialDistancing, #familytime”

The least common emotions found in tweets were sadness, disgust, and anger (Fig. 4). We detected statistical differences in all emotions between #stayathome and #socialdistancing tweets. The largest differences in effect size were joy (#stayathome with 6.6% more) and fear (#socialdistancing with 11.9% more).

Fig. 4. Emotion analysis for all tweets and stratified by tweets with the hashtag #socialdistancing and #stayathome. Comparison between the two hashtags is done using χ² testing. Bonferroni correction was used to define statistical significance at a threshold of P = .008 (0.05/n, where n = 6 since 6 comparisons were completed).

Topic modeling

We identified and subjectively labeled the 10 main tweet topics. Table 2 displays the mean topic sentiment polarity and subjectivity score, key words, and example tweets. “Public opinion and values”, “media and entertainment”, and “quarantine measures and effects” emerged as the three most prevalent topics in 88,993, 71,589, and 65,229 tweets, respectively. Discussion of “spring and good sentiments” had the highest mean polarity of 0.26. “Public opinion and values” and “quarantine measures and effects” had the lowest mean polarity of 0.07. Mean subjectivity scores for all topics ranged from 0.33 to 0.43, with “public opinion and values” having the highest subjectivity score.

Discussion

Understanding the beliefs, attitudes, and thoughts of individuals and populations can aid public health organizations (eg, the WHO) and government institutions to identify public perception and gaps in communication and knowledge. We analyzed Twitter activity around the 2 most common social distancing trending hashtags at the study time to understand emotions, sentiment polarity, subjectivity, and topics discussed related to this NPI. Tweets predominantly showed positive sentiment polarity. Tweets were primarily linked to emotions of joy (~50%), fear, and surprise. Anger and disgust were the 2 least common emotions expressed. Analyzing key words, we demonstrated that tweets were primarily objective in nature and were used to disseminate public health information. We identified and labeled 10 main topics demonstrating insight into the thoughts and perceptions of the public.

Social media data and channels provide a rich platform to perform public sentiment analysis and have already been used to examine COVID-19 perceptions. One study leveraged social media to distribute a survey to nearly 9,000 individuals in the United States.^{Reference Nelson, Simard and Oluyomi26} Another large study surveyed 6,000 participants in the United Kingdom and the United States.^{Reference Geldsetzer27} Despite the robust combined sample size of 15,000 participants, there were inherent limitations to the design. These studies utilize nonprobability sampling like convenience and snowball sampling that are plagued by significant selection bias as well as potential reporting bias, making them prone to sampling error. Through probability sampling from the Twitter API, we analyzed nearly 575,000 English tweets across 350,000 users, providing a broader understanding of public perception that is likely more representational of the population. Using a machine-learning approach, we also explored topics and perceptions without introducing predefined researcher notions, thus limiting the risk of biases inherent to the question design.

Recent public opinion polls from a similar time period have shown that the overwhelming majority of US citizens favored the continuation of social distancing measures.^{Reference Epstein28,Reference Kirzinger, Kearney, Hamel and Brodie29} The positive attitude is clearly reflected in the sentiments found in the analyzed tweet sample. Most tweets were either positive or neutral in nature. As public sentiment shifts, we would expect this to be reflected in tweet sentiment as well. For government and public health officials, tweet sentiments may be an important measure to determine the public willingness to continue distancing, which in turn could inform future infection prediction models and social distancing policies.

Many tweets tend to express an opinion; however, tweets associated with #socialdistancing and #stayathome were predominately objective suggesting that these hashtags were used to transmit objective information potentially serving an important public health function. Combined with the large volume of tweets and the finding that 1.1% described social distancing rules, Twitter has the potential to fulfill an important educational function for public health messages.

Joy, fear, and surprise were the dominant emotions for the early phase of social distancing. This correlated well with the topics we discovered, which included leisure activities, community support, and messages of hope (ie, joy), concerns about food insecurity, spreading of the infection, effects of the quarantine (ie, fear), unpredictability of COVID and its unforeseen implications (ie, surprise). As time progresses and the effects of social distancing become more prominent, we would anticipate that other themes such as loss of income, unemployment, inflation, and financial burden would increase in frequency.

The topics we discovered can be aggregated into 4 larger domains. Activities that can be performed during social distancing included 3 topics: media and entertainment, activities, and music and media sharing. Tweets concerning the actual rationale and effect of the social distancing included 3 topics: public opinions and values, quarantine measures and effects, and quarantine and isolation. Two of these were the most prevalent topics. One domain covered the logistics of staying at home falling under a single topic: supplies, food, and orders. The last domain pertained to messages of support and cheering up others: thank healthcare and reduce spread, community support and businesses, spring and good sentiments.

Our study has several limitations. First, we used social media data and specifically Twitter for our analysis. Although there are >300 million monthly active Twitter users, our methodology likely introduced some sampling bias to those with internet and technology access. Second, we used 2 noncomprehensive trending hashtags to identify the most relevant social distancing tweets. We may have missed alternative terminology or key words such as “self isolation” and “corona lockdown,” which appeared as weighted terms in our topic modeling. However, given that these 2 hashtags were the top-trending social distancing hashtags, we expect that these were representative of social distancing during the study period. We recognize that the study period serves as an initial snapshot, rather than a complete evolution, of public perception towards social distancing and that sentiment and topics likely have changed over time. A longitudinal analysis will be a part of future directions. Third, despite analyzing a large number of tweets, we used only a subset of tweets during this time frame, which may have resulted in selection bias. Having analyzed only English tweets, our conclusions may not be generalizable to non-English speaking populations. Since most tweets do not have geolocation, we are also limited in making conclusions based on geographic areas or countries. Fourth, a 2017 study^{Reference Varol, Ferrara, Davis, Menczer and Flammini30} found that between 9% and 15% of all twitter accounts are bots, which may have affected our analysis. We used the Twitter bot analyzer Botometer^{Reference Davis, Varol, Ferrara, Flammini and Menczer31} to analyze a random sample of 3,900 users in our dataset. We found that 90% of users have a <20% chance of being a bot. Figure S1 shows the complete probability distribution. Excluding the remaining 10% of users did not change sentiment, emotion, or subjectivity analysis. Finally, we recognize the risk of labeling bias through assignment of topic themes to weighted terms. We attempted to prevent this by having 2 authors perform the topic modeling and 1 author independently perform the labeling task.

In the early phases of social distancing, we were able to successfully obtain and analyze a representative subset of tweets related to this topic. Performing sentiment, emotion, and content analysis of tweets provided valuable insight into the public’s beliefs and opinions on social distancing. Tweets were predominately objective with joy, fear, and surprise as leading emotions. Tweets contained social distancing instructions in >1% of tweets. In the early phases of social distancing, tweets were skewed toward leisure activities and discussion of rationale and effect of social distancing. As social distancing progresses and then is lifted, we anticipate sentiment and topics to change. Although “attitude is only one antecedent of behavior,” the positive emotions, the preponderance of objective tweets, and the topics supporting coping mechanisms led us to conclude that Twitter users generally supported the social distancing measure. Analyzing tweets about nonpharmaceutical interventions such as social distancing based on content, sentiment, and emotion may prove valuable for public health communication, knowledge dissemination, as well as adjustment of mitigation policies in the future. Future research to implement this analysis in real-time using the Twitter Streaming API³² could augment directed messaging based on user interest and emotion.

Acknowledgments

Financial support

No financial support was provided relevant to this article.

Conflicts of interest

All authors report no conflicts of interest relevant to this article.

Supplementary material

To view supplementary material for this article, please visit https://doi.org/10.1017/ice.2020.406

References

Coronavirus disease 2019 (COVID-19) situation report – 52. World Health Organization website. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200312-sitrep-52-covid-19.pdf?sfvrsn=e2bfc9c0_4. Published March 12, 2020. Accessed August 4, 2020.Google Scholar

Ratcliffe, R. Coronavirus: travellers race home amid worldwide border closures and flight warnings. The Guardian website. https://www.theguardian.com/world/2020/mar/18/coronavirus-travellers-race-home-amid-worldwide-border-closures-and-flight-warnings. Published March 18, 2020. Accessed April 20, 2020.Google Scholar

Center for Systems Science and Engineering. COVID-19 dashboard. Johns Hopkins University website. https://coronavirus.jhu.edu/map.html. Accessed April 20, 2020.Google Scholar

Glanz, J, Carey, B, Holder, J, et al. Where America didn’t stay home even as the virus spread. The New York Times website. https://www.nytimes.com/interactive/2020/04/02/us/coronavirus-social-distancing.html. Published April 2, 2020. Accessed April 20, 2020.Google Scholar

Coppins, M. The social-distancing culture war has begun. The Atlantic website. https://www.theatlantic.com/politics/archive/2020/03/social-distancing-culture/609019/. Published March 30, 2020. Accessed April 20, 2020,Google Scholar

Ahmed, F, Zviedrite, N, Uzicanin, A. Effectiveness of workplace social distancing measures in reducing influenza transmission: a systematic review. BMC Public Health 2018;18:518.CrossRef Google Scholar PubMed

Coronavirus disease 2019 (COVID-19) social distancing. Centers for Disease Control and Prevention website. https://www.cdc.gov/coronavirus/2019-ncov/prevent-getting-sick/social-distancing.html. Accessed April 20, 2020.Google Scholar

Ngonghala, CN, Iboi, E, Eikenberry, S, et al. Mathematical assessment of the impact of non-pharmaceutical interventions on curtailing the 2019 novel coronavirus. Mathemat Biosci 2020;325:108364. doi: 10.1016/j.mbs.2020.108364.CrossRef Google Scholar PubMed

Cowling, BJ, Ali, ST, Ng, TWY, et al. Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: an observational study. Lancet Pub Health 2020;5:e279–e288.CrossRef Google Scholar

Qamar, MA. COVID-19: a look into the modern age pandemic. J Public Health (Berlin) 2020 May 11 [Epub ahead of print]. doi: 10.1007/s10389-020-01294-z.CrossRef Google Scholar

COVID-19 strategy update. World Health Organization website. https://www.who.int/docs/default-source/coronaviruse/covid-strategy-update-14april2020.pdf?sfvrsn=29da3ba0_6. Accessed April 20, 2020.Google Scholar

Q1 2019 earnings report. Twitter website. https://investor.twitterinc.com/financial-information/quarterly-results/default.aspx. Accessed April 20, 2020.Google Scholar

Gabarron, E, Dorronzoro, E, Rivera-Romero, O, Wynn, R. Diabetes on Twitter: a sentiment analysis. J Diabetes Sci Technol 2019;13:439–444.CrossRef Google Scholar PubMed

Zhang, L, Hall, M, Bastola, D. Utilizing Twitter data for analysis of chemotherapy. Int J Med Informat 2018;120:92–100.CrossRef Google Scholar PubMed

Davis, MA, Zheng, K, Liu, Y, Levy, H. Public response to Obamacare on Twitter. J Med Internet Res 2017;19(5):e167.10.2196/jmir.6946CrossRef Google Scholar PubMed

Mollema, L, Harmsen, IA, Broekhuizen, E, et al. Disease detection or public opinion reflection? Content analysis of tweets, other social media, and online newspapers during the measles outbreak in the Netherlands in 2013. J Med Internet Res 2015;17(5):e128.CrossRef Google Scholar

Chew, C, Eysenbach, G. Pandemics in the age of Twitter: content analysis of Tweets during the 2009 H1N1 outbreak. PLoS One 2010;5(11):e14118. doi: 10.1371/journal.pone.0014118.CrossRef Google Scholar PubMed

Mamidi, R, Miller, M, Banerjee, T, Romine, W, Sheth, A. Identifying key topics bearing negative sentiment on twitter: insights concerning the 2015–2016 Zika epidemic. JMIR Public Health Surveill 2019;5(2):e11036. doi: 10.2196/11036.Google Scholar PubMed

Search tweets—standard search API. Twitter website. https://developer.twitter.com/en/docs/tweets/search/api-reference/get-search-tweets. Accessed February 4, 2020.Google Scholar

Kearney, MW. Package “rtweet.” CRAN R project website. https://CRAN.R-project.org/package=rtweet. Accessed April 10, 2020.Google Scholar

Loria, S. TextBlob: simplified text processing. TextBlob website. https://textblob.readthedocs.io/en/dev/. Accessed April 10, 2020.Google Scholar

Nielsen, FÅ. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. arxiv 2011:1103.2903.Google Scholar

Colneric, N, Demsar, J. Emotion recognition on Twitter: comparative study and training a unison model. IEEE Trans Affective Comput 2019. doi: 10.1109/TAFFC.2018.2807817.CrossRef Google Scholar

Blei, DM, Ng, AY, Jordan, MI. Latent Dirichlet allocation. J Mach Learning Res 2003;3:993–1022.Google Scholar

Rehurek, R, Sojka, P. Parallelized latent Dirichlet allocation. gensim website. https://radimrehurek.com/gensim/models/ldamulticore.html. Accessed January 30, 2020.Google Scholar

Nelson, LM, Simard, JF, Oluyomi, A, et al. US public concerns about the COVID-19 pandemic from results of a survey given via social media. JAMA Intern Med 2020;180:1020–1022.CrossRef Google Scholar PubMed

Geldsetzer, P. Knowledge and perceptions of COVID-19 among the general public in the United States and the United Kingdom: a cross-sectional online survey. Ann Intern Med 2020 Mar 20 [Epub ahead of print]. doi: 10.7326/M20-091228.CrossRef Google Scholar

Epstein, K. Just 14% of Americans support ending social distancing in order to reopen the economy, according to a new poll. Business Insider website. https://www.businessinsider.com/poll-most-americans-support-coronavirus-social-distancing-measures-2020-4. Published April 22, 2020. Accessed August 4, 2020.Google Scholar

Kirzinger, A, Kearney, A, Hamel, L, Brodie, M. KFF health tracking poll—early April 2020: the impact of coronavirus on life in America. Kaiser Family Foundation website. https://www.kff.org/coronavirus-covid-19/report/kff-health-tracking-poll-early-april-2020/. Published April 2020. Accessed August 4, 2020.Google Scholar

Varol, O, Ferrara, E, Davis, CA, Menczer, F, Flammini, A. Online human-bot interactions: detection, estimation, and characterization. 2017 Mar 9. arxiv 2020:1703.03107v2.Google Scholar

Davis, CA, Varol, O, Ferrara, E, Flammini, A, Menczer, F. BotOrNot: a system to evaluate social bots. In: Proceedings of the 25th International Conference Companion on World Wide Web—WWW‘16 Companion. Association for Computing Machinery website. http://dl.acm.org/citation.cfm?doid=2872518.2889302. Accessed July 18, 2020.Google Scholar

Filter real-time Tweets. Twitter website. https://developer.twitter.com/en/docs/tweets/filter-realtime/overview. Accessed February 4, 2020.Google Scholar

Table 1. Characteristics of Tweets and Twitter Usersa

Table 2. Topic Clusters Identified by Topic Modelinga

Fig. 1. Word cloud of top 200 words.

Fig. 2. Sentiment analysis for all tweets and stratified by tweets with the hashtag #socialdistancing and #stayathome. Comparison between the two hashtags is done using χ2 testing. Bonferroni correction was used to define statistical significance at a threshold of P = .01 (0.05/n, where n = 5 since 5 comparisons were completed).

Fig. 3. Subjectivity analysis for all tweets. Complete objectivity are defined as 0 and complete subjectivity are defined as 1.

Fig. 4. Emotion analysis for all tweets and stratified by tweets with the hashtag #socialdistancing and #stayathome. Comparison between the two hashtags is done using χ2 testing. Bonferroni correction was used to define statistical significance at a threshold of P = .008 (0.05/n, where n = 6 since 6 comparisons were completed).

Saleh et al. Supplementary Materials

File 22 KB

Article contents

Understanding public perception of coronavirus disease 2019 (COVID-19) social distancing on Twitter

Abstract

Methods

Data collection and processing

Sentiment and emotion analysis

Subjectivity analysis

Topic modeling

Results

Word frequency

Sentiment polarity analysis

Subjectivity analysis

Emotion analysis

Topic modeling

Discussion

Acknowledgments

Financial support

Conflicts of interest

Supplementary material

References

Saleh et al. Supplementary Materials

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests