Skip to content

erkaner/Institutional-and-news-media-tweet-dataset-for-COVID-19-social-science-research

 
 

Repository files navigation

Institutional-and-news-media-tweet-dataset-for-COVID-19-social-science-research

Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic

Detail information pre-print avaliable at: https://arxiv.org/abs/2004.01791


#UPDATE EVERY THURSDAY

News media and government/international organization tweets across different countries (eg. US, UK, China, Spain, France, Germany etc) Feel free to share this repo.

Data collected using twitter REST API.

First data collection at March 12, 2020 (updated on my PC every week). This means the first time I collect the most recent 3200 tweets (official limits) of all the target accounts, then update weekly.

##V1.3: update data from April 16 to April 22.

  • New added: BR_tweets Brazilian government, president, news media
  • Attention: During 0416-0422 @French_Gov tweeted 0 message
  • Attention: During 0416-0422 @BorisJohnson tweeted 0 message

##V1.2: update data from April 9 to April 15.

  • New added: EU_leadership (@BorisJohnson, @EmmanuelMacron, @GiuseppeconteIT, @sanchezcastejon)
  • New added: election_us (@BernieSanders, @JoeBiden, @realDonaldTrump, @POTUS)
  • New added: national_gov_foreign_office (you can see this as a huge update to the previous gov file, which include 14 European/US/Chinese government/foreign office accounts)
  • Minor changes: @globaltimesnews moved from ADDITIONAL_news_tweet_id to CHINA_news_tweet_id.
  • Minor changes: @spiegelonline stop tweeting at 20200108, it was removed from my collection query, tweet_id were saved on V1.0.

##V1.1: update data from April 2 to April 8.

##First online: April 2, 2020


IMPORTANT:

Data crawled by twitter account user name (same as txt file name), some of the accounts may lost maintaince for long time (for example @SanidadPublicaEs, stop tweeting at 2014, but activate this account again when COVID-19 became global crisis).

I did NOT remove the historical data before coronavirus outbreak. Any questions please contact with me (see email below).


How to Hydrate

Two recommendations: by Hydrator https://github.com/DocNow/hydrator

or twarc https://github.com/DocNow/twarc

Please follow the instructions


Contact me

Jingyuan Yu
jingyuan[dot]yu[at]e-campus[dot]uab[dot]cat


License

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

About

Open access data repository for institutional/news media tweet dataset in the time of COVID-19 pandemic

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published