Strategies and challenges for constructing and collecting visual corpora from image-based social media platforms
DOI:
https://doi.org/10.24434/j.scoms.2024.01.3881Keywords:
visual corpus, social media, data collection, InstagramAbstract
Visual elements play an important role within the multimodal nature of social media (Pearce et al., 2020). A growing body of research has focused on the analysis of still and moving images from different social media platforms from various perspectives of communication and media studies (Hautea, Parks, Takahashi, & Zeng, 2021; Li & Xie, 2020; Veum & Undrum, 2018). Although the aforementioned studies describe visual data collection, their principal focus does not rely on this collection, but on data analysis. Little attention has been paid to the challenges of collecting visual datasets (Highfield & Leaver, 2016). In this paper, I propose a methodological overview of several strategies for collecting large corpora of visual data from image-based social media platforms. Provided with exemplary publications, I review five strategies for collecting visual corpora: hashtag-based, account-based, metadata-based, random sampling, and mixed approach. Lastly, I present a case study with my own mixed approach to the collection of visual data from Instagram. Considering the usage, advantages and limitations of each strategy, the article will contribute to the developing science of social media research. I believe that a literature analysis of visual data collection strategies and a provided case study can help researchers optimize visual data collection from image-based social media.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Yuliya Samofalova
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
The electronic contributions in the Internet are distributed under the "Creative Commons Attribution – ShareAlike" - License (CC BY-SA 4.0). Under the new license, anyone can use, distribute, and modify the content if they credit the original author and share any new creations under the same or compatible conditions. Here are the key features of the CC-BY-SA license:
- CC (Creative Commons): A globally recognized framework for licensing creative works.
- BY (Attribution): Requires acknowledgment of the author whenever the work is used.
- SA (ShareAlike): Ensures that adaptations of the work are distributed under the same or compatible licensing terms.