Hostname: page-component-745bb68f8f-grxwn Total loading time: 0 Render date: 2025-01-13T11:39:06.045Z Has data issue: false hasContentIssue false

A survey of United States adult privacy perspectives and willingness to share real-world data

Published online by Cambridge University Press:  01 February 2023

Rachele M. Hendricks-Sturrup*
Affiliation:
Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA, USA Duke-Robert J. Margolis, MD, Center for Health Policy, Washington, DC, USA Department of Interdisciplinary Health Studies, Ohio University, Athens, OH, USA
Christine Y. Lu
Affiliation:
Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA, USA
*
Address for correspondence: R.M. Hendricks-Sturrup, DHSc, MSc, MA, Duke-Robert J. Margolis, MD, Center for Health Policy, Washington, DC, USA. Email: rachele.hendricks.sturrup@duke.edu
Rights & Permissions [Opens in a new window]

Abstract

Objective:

Real-world data privacy is a complex yet underexplored topic. To date, few studies have reported adult perspectives around real-world data privacy and willingness to share real-world data with researchers.

Methods:

Relevant survey items were identified in the literature, adapted and pilot tested among a small convenience sample, and finalized for distribution. The survey was distributed electronically in April 2021 among adults (≥18 years of age) registered in ResearchMatch (www.researchmatch.org). Microsoft Excel was used to assess descriptive statistics across demographical items and four privacy-related items.

Results:

Of 402 completed responses received, half of respondents (∼50%) expressed willingness to share their prescription history data and music streaming data with researchers and unwillingness to share real-world data from several other sources. Most (53–93%) of participants expressed concern with five statements reflecting the sharing and use of their digital data online. Most participants (71–75%) agreed with four statements focused on individual measures taken to protect their personal privacy and disagreed (77–85%) with two statements centered on not being concerned about sharing or 3rd party access to their personal data online.

Conclusions:

Our observations indicate an important yet unmet need to further explore and address real-world data privacy concerns among US adults engaging as prospective research participants.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press on behalf of The Association for Clinical and Translational Science

Introduction

In the 21st century and perhaps beyond, the spectrum and concept of health data have and will continue to grow. This is equally true for the spectrum of ways in which health data can be generated, collected, stored, processed, and shared as evidence across entities and systems. The “patient-consumer spectrum” is a conceptual term that has emerged, being defined as the “phenomenon in which healthcare is rapidly transitioning from a periodic activity in fixed, traditional health care settings to an around-the-clock activity that involves the generation, use, and integration of data reflecting many aspects of individuals’ lives and behaviors” [1].

Like data along the patient-consumer spectrum, real-world data, often overlapping in definition with the term “big data,” are “data relating to patient health status and/or the delivery of health care routinely collected from a variety of sources” [2,Reference Price and Cohen3]. This comprises data from a wide variety of sources that include but are not limited to electronic health record (EHR) data, hospital or insurance company’s administrative and claims data, patient-generated data (e.g., data generated by in-home or self-monitoring devices such as wearables and fitness trackers), consumer-generated data, and laboratory data. Thus, given the largely unregulated nature of most real-world data sources, medical product regulators like the US Food and Drug Administration (FDA) have begun to suggest that clinical trial sponsors should consult with privacy experts to help them identify and address data privacy and security concerns when accessing data [4].

Health science and clinical research traditionally involves the collection of both survey and medical data. Yet, given that today billions of real-world data points can be generated from multiple and diverse sources by a single person, real-world data can be shared passively with researchers to understand clinical outcomes and health behavior in unprecedented ways, either before, following, or absent an intervention [Reference Cerrada, Min and Constantin5Reference Nickels, Edwards and Poole7]. Moreover, sophisticated algorithms and data analytics procedures can be implemented to identify and recruit individuals who meet specific inclusion/exclusion criteria for a research study [Reference Chapman, Domínguez, Fairweather, Delaney and Curcin8Reference Melzer, Maiwald, Prokosch and Ganslandt10]. Analytical datasets comprised of real-world data are also augmenting existing clinical trial methodologies, such as through the use and development of external control arms for cancer and/or rare disease drug trials [Reference Rahman, Ventz and McDunn11Reference Gross13]. Despite these scientific advancements demonstrating the clinical and investigational utility of real-world data, the literature is scant in studies exploring adult perspectives around real-world data privacy in relation to their willingness to share their real-world data with researchers [Reference Grande, Luna Marti and Feuerstein-Simon1417]. This study seeks to build on this nascent body of research.

Methods

Survey Development and Validation

A survey was developed and made available online using Qualtrics software. Parts of the survey contained four demographical items, three privacy threshold items, and two items focused on willingness to share data with researchers. Privacy-related survey items published by Seltzer et al. and validated and published by Doherty et al., as well as demographic-related survey items validated and published by Zhu et al. were selected, adapted, and re-validated among a convenience sample of five individuals who identify as both patients and health consumers, for bias, relevance, and cognition [Reference Seltzer, Goldshear and Guntuku1517]. Based on the pilot participants’ feedback, the survey questions were refined to improve item quality and clarity, and overall instrument clarity, appropriateness, and relevance.

A recent systematic review and a survey study each determined that age, income level, and education level are the strongest predictors of online or digital footprint activity [Reference Kontos, Blake, Chou and Prestin18,Reference Hinds and Joinson19]. Therefore, we applied Zhu et al.’s demographic data collection convention; demographic data collected included age, education level, duration of using online medical websites (years), and annual frequency of getting ill [Reference Zhu, Shen and Xu16]. The final full survey consisted of 4 demographic items and 12 closed and 2 open-ended items focused on participants’ privacy concerns and perspectives (see full survey in Supplement).

Survey Population

Adult survey participants were identified and contacted online using ResearchMatch, a “disease-neutral, Web-based recruitment registry to help match individuals who wish to participate in clinical research studies with researchers actively searching for volunteers throughout the US.” Populations in ResearchMatch live within the USA and Puerto Rico, are of all ages and races/ethnicities, and consist of healthy volunteers as well as those living with medical conditions. Individuals within the ResearchMatch database sign up to become volunteers through the ResearchMatch platform to support research studies.

Survey Distribution

Access to the ResearchMatch platform for this study was provided through Ohio University. At the start of the study, a total of 148,090 participants were registered in ResearchMatch. In April 2021, the electronic survey was administered to adults (≥18 years of age) registered in the ResearchMatch database who agreed to be contacted to engage in the survey after receiving an informational electronic invitation letter via the ResearchMatch platform. As participants agreed to participate in the survey, they received an email correspondence with details about the study and link to the electronic survey.

A total of 13 searches were conducted in ResearchMatch to contact 19,499 ResearchMatch volunteers. Individuals were invited to participate in the survey, regardless of health status, race, gender, or any other mutable or immutable characteristics. Participants’ personal contact details were received only after participants agreed to participate in the survey. Participant email addresses were deleted or destroyed to prevent reidentification at the conclusion of the study.

Survey Incentives and Completion Reminders

Financial incentives have been used in health research contexts to motivate participation and engagement (e.g., completing health risk assessments, health surveys, etc.) [20]. We offered individuals who completed the survey a random chance to receive a $250 (a total of two gift cards available), $100 (a total of four gift cards available), $50 (a total of six gift cards available), or $25 gift card (a total of 12 gift cards available). A random selection tool developed and deployed using Microsoft Excel to randomly select email addresses of survey participants and deliver the gift card incentives.

Survey participants were informed that their participation was entirely voluntary. No survey questions were mandatory, and participants were informed that they may skip any question(s) at any time. Survey participants were welcomed to contact the research team at any time with any questions or concerns about the study. Reminders were sent up to three times to participants who began but had yet to complete and submit the survey within the study timeframe.

Data Analysis

The present analysis centers on a sample of ResearchMatch participants who completed the electronic survey. This analysis is part of a larger study examining US adult privacy-related experiences and motivations to share real-world data with researchers [Reference Hendricks-Sturrup, Zhang and Lu21]. An online Qualtrics software tool [22] was used to calculate an ideal survey sample size (n = 384) based on the total ResearchMatch population (95% CI: 5% margin of error). Survey responses were assessed using descriptive statistics in Microsoft Excel.

Ethics Review, Oversight, and Approval

ResearchMatch is a registry and collaborative project that is maintained at Vanderbilt University and overseen by the Vanderbilt University Institutional Review Board. The present study was reviewed and approved by the Ohio University Institutional Review Board under protocol #20-E-457. ResearchMatch participants’ completion of the survey implied their consent to engage in the survey.

Results

Overall Assessment

A total of 598 volunteers agreed to receive direct invitations to participate in the survey. Following receipt of email invitations, 470 participants initiated the survey and 402 completed and submitted the survey (86% completion rate among those who initiated the survey). Three participants who initiated but did not complete and submit the survey cited reasons for their non-completion, which were that the participant either did no't understand the way electronics affect health or did no't understand the nature of the survey questions. One participant noted that their age (>55) could be a factor as to why the participant did not understand the nature of the survey questions.

Participant Characteristics

Table 1 summarizes demographic characteristics among all survey participants/respondents who completed or submitted the survey. Nearly all (99%) of the survey participants were over the age of 21. Most participants (87%) held either some college/associates/trade school, a bachelors’ degree, or masters’ degree. Just over half of participants (56%) reported to have used online medical websites for 7 years or more. Most participants (94%) reported an annual frequency of getting ill of six occurrences or less.

Table 1. Summary of survey respondents’ demographics (excluding non-responses)

Participant Willingness to Share Data with Researchers

Participants were asked to indicate their willingness to share diverse types of digital data with researchers. Table 2 presents a summary of participants’ responses, indicating that nearly half of respondents expressed willingness to share their prescription history data and music streaming data with researchers. For all other data types, less than half of the participants were willing to share such data with researchers. Most participants indicated unwillingness to share data from eight sources: email history, text message and phone call data, Google search history, online purchase history, tax records and income history, credit card statements, electronic medical records, and geolocation. Most participants did not use the following data sources: Twitter, Snapchat, and Yelp reviews and ratings.

Table 2. Summary of ResearchMatch participants’ willingness to share digital data with health researchers (excluding non-responses)

Online Data Sharing and Use

Participants were asked whether they were concerned with five statements reflecting the sharing and use of their digital data online (Fig. 1). The five statements centered on concern about 1) data use beyond what is stated within an online company’s or website’s privacy policy, 2) internet users defrauding or abusing personal information, 3) data sharing among companies and websites without expressed data subject consent, 4) inappropriate disclosure of data among online friends, and 5) false representation or identity among online users. Most (53–93%) of participants expressed concern with each of the five statements.

Fig. 1. Participants’ concerns about online data use and sharing.

Sharing Personal Information Online and Personal Privacy

Participants were asked whether they agreed or disagreed with six statements focused on sharing personal information online (Fig. 2). Most participants (71–75%) either agreed or strongly agreed to 1) regularly using anti-virus/phishing/spamming software, programs, or tools, or exercising options to protect the privacy of their data (e.g., restricting app data permissions, etc.); 2) sharing minimal personal information about themselves online due to privacy concerns feeling uncomfortable when others have access to their personal information; 3) being a generally private person in everyday life, and 4) feeling uncomfortable when others have access to their personal information. Most participants (77–85%) either disagreed or strongly disagreed with the ideas that 1) it does not bother them that a history of their online activities may be available to 3rd parties online and 2) there is no need to be concerned about sharing personal information online.

Fig. 2. Participants’ concerns about revealing personal information online and personal privacy.

Discussion

This study is one of the first studies to explore the real-world data privacy perspectives of a national sample of US adults. Many participants have taken at some point measures to protect their personal data privacy and expressed concern about sharing personal data or 3rd party access to their personal data online. Participants were generally willing to share their prescription history data and music streaming data with researchers but were unwilling to share real-world data from several other sources. In practice, these negative experiences and limited data sharing preferences might function as a rate-limiting factor to successfully engaging prospective adult participants in potentially valuable, data-driven research. For instance, recent findings from our related study show that 1) privacy-related experiences may shape research participants’ willingness to share real-world data and 2) age range and education level may shape individuals’ willingness to share certain real-world data sources with researchers [Reference Hendricks-Sturrup, Zhang and Lu21]. Yet, researchers may feel encouraged by further observations showing that negative privacy-related experiences are not associated with unwillingness to share most real-world data sources [Reference Hendricks-Sturrup, Zhang and Lu21].

There are limitations to note for this study. Given the relatively low proportion of adults under age 20 engaged in this study, the findings may not be generalizable to adults in this general age demographic. Future work may explore the internal validity of our findings among this age group. Also, given that our study sample derived solely from ResearchMatch, a population that is likely or already somewhat engaged in data sharing for research, future work should explore whether our present findings might align with or differ from members of the general population who are not as engaged in research.

Our findings are partially consistent with observations in Seltzer et al., who conducted very similar survey among their local patient cohort [Reference Seltzer, Goldshear and Guntuku15]. Interestingly, most participants in the present study and that of Seltzer et al. expressed willingness to share prescription history and music streaming data [Reference Seltzer, Goldshear and Guntuku15]. Although, a relatively greater proportion of participants expressed concerns about internet users defrauding or abusing their personal information and online companies and websites either sharing information with other parties without explicit consent and/or using information for purposes not explicitly stated in their privacy policies. Given that Seltzer et al.’s study was completed prior to the COVID-19 pandemic, the relatively greater proportion of participants’ privacy concerns might be related to the COVID-19 experience [Reference Seltzer, Goldshear and Guntuku15]. Recent studies show that individuals’ data privacy concerns were likely heightened during the COVID-19 pandemic, as privacy laws and regulations became relaxed to accommodate digital public health surveillance and remote health care [Reference Grande, Mitra and Marti23,24].

Our findings likely indicate an unmet need to address participants’ privacy concerns about sharing their real-world data for research [Reference Grande, Luna Marti and Feuerstein-Simon14,Reference Seltzer, Goldshear and Guntuku15]. Immediate efforts to address these privacy concerns should include, as recommended during a recent National Academies of Science, Engineering, and Medicine expert convening [24]:

  1. 1. Ensuring privacy protections through comprehensive privacy laws and regulations.

  2. 2. Improving enforcements of third-party privacy policies and data use consent.

  3. 3. Facilitating education and engagement among industry and their third-party business associates, data subjects, and other key stakeholders in data governance and stewardship.

  4. 4. Implementing privacy-by-design or privacy enhancing technologies.

  5. 5. Curating and maintaining trustworthy data-sharing environments.

  6. 6. Addressing structural incentives that discourage electronic data sharing.

Like the FDA, industry and non-industry stakeholders, including but not limited to those engaged in the Personalized Medicine Coalition [25] and the Duke-Margolis Center for Health Policy’s Real-World Evidence Collaborative [26], have recently and continue to raise unresolved privacy concerns and other ethical considerations that follow or accompany acquisitions and uses of real-world data. Therefore, it would be both important and mission critical to engage research participants, including those engaged in ResearchMatch, in real-world evidence policy discussions, particularly in light of FDA’s guidance on use of real-world data in regulatory submissions for investigational new drug applications, and biologics license applications [27].

Conclusion

US adults engaging as prospective participants in health research hold significant concerns about the privacy of their real-world data. Our study identified not only this but also an apparent need to balance such individuals’ real-world data privacy concerns with their impetus to engage in health research by sharing specific sources of their real-world data. Moving forward, opportunities to engage US adults as prospective health research participants in policy-related efforts intended to increase the acceptability of real-world evidence in regulatory submissions should not be missed.

Supplementary Material

To view supplementary material for this article, please visit https://doi.org/10.1017/cts.2023.4.

Acknowledgments

C.Y.L. was supported in part by an Ebert Career Development Award at Harvard Pilgrim Health Care Institute and Harvard Medical School.

Disclosures

C.Y.L. received consulting fees from the Center for Genomic Medicine, Massachusetts General Hospital and the Maine Cancer Genomics Initiative, the Jackson Laboratory, for unrelated work. R.H-S. reports employment with the National Alliance Against Disparities in Patient Health for unrelated work.

References

FPF Staff. FPF Presents @ RightsCon 2020: “Frontiers in health data privacy: navigating blurred expectations across the patient-consumer spectrum” - Future of Privacy Forum. (https://fpf.org/blog/fpf-presents-rightscon-2020-frontiers-in-health-data-privacy-navigating-blurred-expectations-across-the-patient-consumer-spectrum/)Google Scholar
Margolis Center for Health Policy. A Framework for Regulatory Use of Real-World Evidence. (https://healthpolicy.duke.edu/publications/framework-regulatory-use-real-world-evidence)Google Scholar
Price, WN, Cohen, IG. Privacy in the age of medical big data. Nat Med 2019; 25(1): 3743. DOI: 10.1038/s41591-018-0272-7.CrossRefGoogle ScholarPubMed
Research C for DE. Considerations for the Use of Real-World Data and Real-World Evidence To Support Regulatory Decision-Making for Drug and Biological Products. U.S. Food and Drug Administration. 2021. (https://www.fda.gov/regulatory-information/search-fda-guidance-documents/considerations-use-real-world-data-and-real-world-evidence-support-regulatory-decision-making-drug)Google Scholar
Cerrada, CJ, Min, JS, Constantin, L, et al. A prospective real-world study exploring associations between passively collected tracker data and headache burden among individuals with tension-type headache and migraine. Pain Ther. 2022; 11(1): 153170. DOI: 10.1007/s40122-021-00336-y.CrossRefGoogle ScholarPubMed
Senerchia, CM, Ohrt, TL, Payne, PN, et al. Using passive extraction of real-world data from eConsent, electronic patient reported outcomes (ePRO) and electronic health record (EHR) data loaded to an electronic data capture (EDC) system for a multi-center, prospective, observational study in diabetic patients. Contemp Clin Trials Commun 2022; 28: 100920. DOI: 10.1016/j.conctc.2022.100920.CrossRefGoogle Scholar
Nickels, S, Edwards, MD, Poole, SF, et al. Toward a mobile platform for real-world digital measurement of depression: user-centered design, data quality, and behavioral and clinical modeling. JMIR Mental Health 2021; 8(8): e27589. DOI: 10.2196/27589.CrossRefGoogle Scholar
Chapman, M, Domínguez, J, Fairweather, E, Delaney, BC, Curcin, V. Using computable phenotypes in point-of-care clinical trial recruitment. Public Health Inf 2021, 560564. DOI: 10.3233/SHTI210233.Google ScholarPubMed
Evans, SR, Paraoan, D, Perlmutter, J, Raman, SR, Sheehan, JJ, Hallinan, ZP. Real-world data for planning eligibility criteria and enhancing recruitment: recommendations from the clinical trials transformation initiative. Ther Innov Regul Sci 2021; 55(3): 545552. DOI: 10.1007/s43441-020-00248-7.CrossRefGoogle ScholarPubMed
Melzer, G, Maiwald, T, Prokosch, HU, Ganslandt, T. Leveraging real-world data for the selection of relevant eligibility criteria for the implementation of electronic recruitment support in clinical trials. Appl Clin Inf 2021; 12(01): 017026. DOI: 10.1055/s-0040-1721010.Google ScholarPubMed
Rahman, R, Ventz, S, McDunn, J, et al. Leveraging external data in the design and analysis of clinical trials in neuro-oncology. Lancet Oncol 2021; 22(10): e456e465. DOI: 10.1016/S1470-2045(21)00488-5.CrossRefGoogle ScholarPubMed
Gray, CM, Grimson, F, Layton, D, Pocock, S, Kim, J. A framework for methodological choice and evidence assessment for studies using external comparators from real-world data. Drug Saf. 2020; 43(7): 623633. DOI: 10.1007/s40264-020-00944-1.CrossRefGoogle ScholarPubMed
Gross, AM. Using real world data to support regulatory approval of drugs in rare diseases: a review of opportunities, limitations & a case example. Curr Probl Cancer 2021; 45(4): 100769. DOI: 10.1016/j.currproblcancer.2021.100769.CrossRefGoogle ScholarPubMed
Grande, D, Luna Marti, X, Feuerstein-Simon, R, et al. Health policy and privacy challenges associated with digital technology. JAMA Netw Open 2020; 3(7): e208285. DOI: 10.1001/jamanetworkopen.2020.8285.CrossRefGoogle ScholarPubMed
Seltzer, E, Goldshear, J, Guntuku, SC, et al. Patients’ willingness to share digital health and non-health data for research: a cross-sectional study. BMC Med Inf Decis Making 2019; 19(1): 157. DOI: 10.1186/s12911-019-0886-9.CrossRefGoogle ScholarPubMed
Zhu, P, Shen, J, Xu, M. Patients’ willingness to share information in online patient communities: questionnaire study. J Med Intern Res 2020; 22(4): e16546. DOI: 10.2196/16546.Google ScholarPubMed
Doherty C, Lang DM, Lang M. An Exploratory Survey of the Effects of Perceived Control and Perceived Risk on Information Privacy. In: 9th Annual Symposium on Information Assurance. 2014. (https://www.semanticscholar.org/paper/An-Exploratory-Survey-of-the-Effects-of-Perceived-Doherty-Lang/36f7f631953c90cf0c4ef7477f6fc20319d5b0b5)Google Scholar
Kontos, E, Blake, KD, Chou, WYS, Prestin, A. Predictors of eHealth usage: insights on the digital divide from the health information national trends survey 2012. J Med Intern Res 2014; 16(7): e3117. DOI: 10.2196/jmir.3117.Google ScholarPubMed
Hinds, J, Joinson, AN. What demographic attributes do our digital footprints reveal? A systematic review. PLOS ONE 2018; 13(11): e0207112. DOI: 10.1371/journal.pone.0207112.CrossRefGoogle ScholarPubMed
PsycNET. Anticipated regret and health behavior: A meta-analysis. (https://doi.apa.org/doiLanding?doi=10.1037%2Fhea0000294)Google Scholar
Hendricks-Sturrup, RM, Zhang, F, Lu, CY. A survey of research participants’ privacy-related experiences and willingness to share real-world data with researchers. J Personal Med 2022; 12(11): 1922. DOI: 10.3390/jpm12111922.CrossRefGoogle ScholarPubMed
Qualtrics. Sample Size Calculator & Complete Guide in 2022. Qualtrics. 2020. (https://www.qualtrics.com/blog/calculating-sample-size/)Google Scholar
Grande, D, Mitra, N, Marti, XL, et al. Consumer views on using digital data for COVID-19 control in the United States. JAMA Netw Open 2021; 4(5): e2110918. DOI: 10.1001/jamanetworkopen.2021.10918.CrossRefGoogle ScholarPubMed
Health Data Sharing Special Publication. National Academy of Medicine. (https://nam.edu/health-data-sharing-special-publication/)Google Scholar
Personalized Medicine Coalition. Advocates for precision medicine : Health Data Working Group. 2022. (https://www.personalizedmedicinecoalition.org/Members/Information_Management_Working_Group)Google Scholar
Real-World Evidence Collaborative. Margolis Center for Health Policy. 2022. (https://healthpolicy.duke.edu/projects/real-world-evidence-collaborative)Google Scholar
Figure 0

Table 1. Summary of survey respondents’ demographics (excluding non-responses)

Figure 1

Table 2. Summary of ResearchMatch participants’ willingness to share digital data with health researchers (excluding non-responses)

Figure 2

Fig. 1. Participants’ concerns about online data use and sharing.

Figure 3

Fig. 2. Participants’ concerns about revealing personal information online and personal privacy.

Supplementary material: PDF

Hendricks-Sturrup and Lu supplementary material

Hendricks-Sturrup and Lu supplementary material

Download Hendricks-Sturrup and Lu supplementary material(PDF)
PDF 1.8 MB