• Home
  • 2021 Datasets

2021 Qualitative Datasets

DATASET 1 (in 2020 list): COVID-19 TWEETS DATASET (BY RABINDRA LAMSAL)
Link: https://ieee-dataport.org/open-access/corona-virus-covid-19-tweets-dataset
Format: CSV | Crawled: Mar, 2020 – present
Access: Free, but you will need to create a free IEEE account to download
Main variables: Tweet ids; sentiment score (positive, negative, neutral)
IMPORTANT NOTE: This .csv dataset contains tweet IDs. To obtain the text of the tweets and other identifying information, you will need to download and use a hydrator to retrieve the information. You can find one example of a hydrator here.

DATASET 2 (in 2020 list): NEWS MEDIA AND GOVERNMENT/INTERNATIONAL ORGANIZATION TWEETS
(BY JINGYUAN YU)
Link: https://github.com/narcisoyu/Institional-and-news-media-tweet-dataset-for-COVID-19-social-science-research
Format: txt | Crawled: Mar 2020 – present
Access: Free
Example variables: Created_at; Hashtags; In_reply_to; Tweet id; User_screen_name;
IMPORTANT NOTE: This .txt dataset contains tweet IDs. To obtain the text of the tweets and other identifying information, you will need to download and use a hydrator to retrieve the information. You can find one example of a hydrator here.

DATASET 3: THREADED POSTS ABOUT COVID-19 VACCINE NEWS RETRIEVED FROM THE REDDIT DISCUSSION FORUM ON THE CORONAVIRUS (BY XING HAN LU)
Link: https://www.kaggle.com/xhlulu/covid19-vaccine-news-reddit-discussions
Format: CSV | Size: 20 MB | Crawled: Nov 2020 – Jan 2021
Access: Free, but you will need to create a free Kaggle account to download
Main variables: Post ID; Post author; Post date; Post score; Post link; Comment ID; Comment author; Comment date; Comment parent ID; Comment score; Comment text

DATASET 4: INTERVIEWS ON THE PERSONAL IMPACT OF COVID-19 (UNIVERSITY OF ALABAMA-BIRMINGHAM)
Link: https://uab.contentdm.oclc.org/digital/collection/COVID19/search
Format: PDF interview transcripts | Size:  430KB (5 files) | Collected: Oct/Nov 2020; CSV dataset file (https://drive.google.com/file/d/1eBN5pM0DVYne0yubRPKav2MBJJMzjhfL/view?usp=sharing) | Size: 72KB
Access: Free
Main variables: Interview ID; Participant name; Utterance

DATASET 5: COVID-19 PRESS BRIEFINGS BY THE WORLD HEALTH ORGANIZATION (JAN 2020 – MAR 2021)
Link: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/media-resources/press-briefings#
Format: Video; Transcript
Access: Free, available on WHO website
Main variables: Date; Speaker; Utterance

DATASET 6: STUDENT REFLECTIONS ON RETURNING TO NORMAL LIFE AFTER COVID-19 (NEW YORK TIMES)
Link: https://www.nytimes.com/2021/03/23/learning/are-you-nervous-about-returning-to-normal-life.html
Description: Responses by high school students to the prompt “Are you nervous about returning to normal life?” (March 2021)
Format: Text
Access: Free, available on the Learning Network section of the NY Times website
Main variables: Name; Location; Date; Comment text

DATASET 7: VIDEO DIARY OF A TRAUMA SURGEON DURING THE COVID-19 PANDEMIC (WIRED)
Link: https://www.wired.com/video/watch/covid-doctor-diary
Description: Trauma surgeon reflecting on their work at a Los Angeles hospital (Mar – May 2020)
Format: Video (23:20); Transcript
Access: Free, available on the WIRED website
Main variables: Date; Utterance