Hostname: page-component-cd9895bd7-p9bg8 Total loading time: 0 Render date: 2024-12-26T03:25:56.968Z Has data issue: false hasContentIssue false

Race, ethnicity, and considerations for data collection and analysis in research studies

Published online by Cambridge University Press:  29 October 2024

Sima Sharghi*
Affiliation:
Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY, USA
Shokoufeh Khalatbari
Affiliation:
The Michigan Institute for Clinical and Health Research, University of Michigan, Ann Arbor, MI, USA
Amy Laird
Affiliation:
Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, OR, USA
Jodi Lapidus
Affiliation:
Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, OR, USA
Felicity T. Enders
Affiliation:
Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
Jareen Meinzen-Derr
Affiliation:
Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, University of Cincinnati College of Medicine, Cincinnati, OH, USA
Amanda L. Tapia
Affiliation:
Department of Quantitative Health Sciences, Division of Clinical Trials and Biostatistics, Mayo Clinic, Rochester, MN, USA
Jody D. Ciolino
Affiliation:
Department of Preventive Medicine – Biostatistics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
*
Corresponding author: S. Sharghi; Email: sima_sharghi@urmc.rochester.edu
Rights & Permissions [Opens in a new window]

Abstract

Research studies involving human subjects require collection of and reporting on demographic data related to race and ethnicity. However, existing practices lack standardized guidelines, leading to misrepresentation and biased inferences and conclusions for underrepresented populations in research studies. For instance, sometimes there is a misconception that self-reported racial or ethnic identity may be treated as a biological variable with underlying genetic implications, overlooking its role as a social construct reflecting lived experiences of specific populations. In this manuscript, we use the We All Count data equity framework, which organizes data projects across seven stages: Funding, Motivation, Project Design, Data Collection, Analysis, Reporting, and Communication. Focusing on data collection and analysis, we use examples – both real and hypothetical – to review common practice and provide critiques and alternative recommendations. Through these examples and recommendations, we hope to provide the reader with some ideas and a starting point as they consider embedding a lens of justice, equity, diversity, and inclusivity from research conception to dissemination of findings.

Type
Review Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Association for Clinical and Translational Science

Introduction

Research studies involving human subjects typically carry a requirement to collect and report on demographic data pertaining to race and ethnicity. However, there is a lot of variability and minimal guidance on best practices for doing so. Common practices can propagate issues and misunderstanding around race and ethnicity. Race and ethnicity are not reliable indicators of genetic differences and their use in genetic research and medical practice can lead to inaccuracies and reinforce social biases.

The conduct of a study, from conception of the research question to dissemination of findings, involves many decisions that should be considered through a lens of justice, equity, and inclusivity. Data equity [1] frameworks offer principles and practices to approach decisions through this lens. In this manuscript, we consider the We All Count (WAC) data equity framework [2] (see supplemental file for a list of abbreviated words), which organizes a data project into seven stages: Funding, Motivation, Project Design, Data Collection and Sourcing, Analysis, Reporting, and Communication and Distribution. We focus on data collection and analysis, and we provide examples of typical practice, illustrate why they may or may not be optimal, and provide considerations and recommendations for alternative practical approaches. We unpack issues with current practices in collecting, analyzing, and reporting race and ethnicity data through many of our own experiences and lessons learned in conducting research studies.

We assert that self-reported race and ethnicity should not be used as a marker or surrogate for genetics, as they represent a social construct rather than a biological one, reflecting lived experiences [Reference Collins3,Reference Mersha and Beck4]. There is a need to collect and report on race and ethnicity in research studies, but researchers should use these variables in analyses purposefully with caution. In the sections to follow, we (1) set the context for collection and analysis through considerations of research goals and audiences, (2) provide examples of and critique data collection instruments for collecting race and ethnicity, (3) unpack meanings and inferences when using race and ethnicity variables in analyses, (4) touch on reporting and downstream implications, and conclude with some general recommendations for researchers to consider.

Background and considerations

The study objectives and specific hypotheses should determine data collection and analysis plans. Starting from study inception, the role of race and ethnicity should be considered carefully. Specific questions the researcher may want to ask themselves as they plan their studies include: (1) Does race or ethnicity play a direct role in the study objectives? (2) How might we define race and ethnicity in the population of interest? (3) What is the hypothesized distribution of race and ethnicity in the study population of interest? (4) What is the best way to ascertain these data from the population of interest? (5) How should these variables be used in analyses to address the research question(s)? The response to these questions will help shape data collection and analysis, and as the manuscript progresses, we aim to provide the researcher with options as a starting point for collection and analyses as they consider these questions.

Research question considerations

Beginning with the research question, if the answer to question (1) above points to a direct role for race and ethnicity in the research to accomplish study objectives, the research question should reflect this role: whether it is characterizing an association where race or ethnicity may be a potential confounder or an effect modifier, revealing health disparities or sources of structural racism, or something else. For instance, a study titled “Racial Inequities in Access to Ventricular Assist Device (VAD) and Transplant Persist After Consideration for Preferences for Care” by Cascino, T. M et al. [Reference Cascino, Colvin and Lanfear5] goes beyond descriptive summaries, and it intends to explore a hypothesized systematic bias within heart failure (HF) patients. Aside from some relevant demographic and clinical data, patients’ preferences and desire for receiving advanced therapies were also recorded to examine bias among clinicians toward patients who identify as Black. As a result of careful hypothesis generation, collection of appropriate data, and sound analyses, the authors conclude: “Among patients receiving care by advanced HF cardiologists at VAD centers, there is less utilization of VAD and transplant for Black patients even after adjusting for HF severity, quality of life, and social determinants of health, despite similar care preferences. This residual inequity may be a consequence of structural racism and discrimination or provider bias impacting decision-making.”

In contrast, other studies may not have an aim of evaluating disparities using race or ethnicity directly. For example, “Efficacy and Safety of a Quadruple Ultra-low-dose Treatment for Hypertension (QUARTET USA)” sought to evaluate efficacy of a therapy in a hypertensive population [Reference Baldridge, Huffman and Lazar6Reference Sanuade, Jacobson and Quintana8]. Although race and ethnicity are not central to the study’s main aims, collecting and analyzing these demographic factors is important for providing contextual information in dissemination.

Audience considerations

Aside from considering the goal and the hypothesis of the research collecting these data, it is important to consider the audience for dissemination of the work. The audience may include the funders, regulatory bodies, researchers, clinicians, participants, or the public. Consider participant perspectives first. Transparency in data use, analysis, and reporting is key to building trust and collaboration. There are excellent resources on equitable and inclusive data practices that can be helpful for deciding how to frame research and research findings appropriate for a given audience [9,10].

Funder

As stated in WAC framework [2], in the funding stage we need to map out the relationship between data, resources, and authority within the project: “The power structures involved in a data project need to be revealed, evaluated and sometimes altered in order to get the equity that everyone – not just stakeholders or data workers, but funders too – is looking for.” Thus, in the WAC framework, the design, conduct, analysis, and reporting of a project must cater to and make sense for all those involved, including the funder. We begin this discussion with the funder as one “audience” that the researcher must consider.

We use the National Institutes of Health (NIH) as one example of a common funder among US-based human subjects research studies. The NIH requires both (a) an inclusion of minoritized individuals and underserved population statement (with justification for any exclusions as applicable) and (b) an enrollment table that summarizes enrollment projections overall and racial and ethnic categories. Thus, the principal investigator must collect and report on race and ethnicity, according to the categories set forth by the funder. Refer to Table 1 for an example “Table Shell.” At minimum, regardless of other plans for reporting and goals of the research, the investigator conducting NIH-sponsored research would need to collect these data.

Table 1. Example table shell for National Institutes of Health-funded human subjects’ studies

One concern with this table is that the “Asian” category is broad. According to the Census Bureau, an Asian includes “a person having origins in any of the original peoples of the Far East, Southeast Asia, or the Indian subcontinent including, for example, Cambodia, China, India, Japan, Korea, Malaysia, Pakistan, the Philippine Islands, Thailand, and Vietnam.” We note that countries like Iran, Iraq, Afghanistan, and Armenia do not fit that definition and thus it is difficult to understand where study participants with origins from these countries may identify among these categories required by the funder [Reference Lee and Ramakrishnan11]. We note that there are recent updates forthcoming to the US census and future federal forms whereby “Middle Eastern/North African (MENA)” has been added and further ethnicity and race have been combined into a single set of data elements, which we view as a positive step toward accurate representation of research study participants’ identities [12].

Based on a Pew Research report [13], “Six origin groups – Chinese (24%), Indian (21%), Filipino (19%), Vietnamese (10%), Korean (9%) and Japanese (7%) – accounted for 85% of all Asian Americans as of 2019.” This report depicts the rest of the countries that account for “Asian” categories in research studies in the US and they include 2% or less of the following: Pakistani, Thai, Cambodian, Hmong, Laotian, Taiwanese, Bangladeshi, Nepalese, Burmese, Indonesian, Sri Lankan, Malaysian, Mongolian, Bhutanese, and Okinawan. Thus, even among the Asian categories minoritized individuals exist, adding to the diversity of the lived experiences among those of Asian descent. Unfortunately, by default, this concern is overlooked in most research studies since the tendency is often to use the NIH-specific categories as outlined in Table 1.

The funder’s requirements for racial and ethnic data should align with research goals. NIH guidance mandates de-identification but doesn’t specify collection methods, whether self-reported or from sources like Electronic Health Record (HER). 14While this practice of reporting on racial minority groups in clinical trials has been commonplace since 1993, (“In 1993, the United States (US) Congress passed the National Institutes of Health Revitalization Act as part of an effort to improve enrollment of minority groups in clinical trials.” [Reference Turner, Steinberg, Weeks, Rodriguez and Cullen15]) the challenges of doing so persist to this day.

Clinical research community

When we consider describing a participant cohort or understanding generalizability of the study findings, we often think of the typical “Table 1” of a research article[Reference Hayes-Larson, Kezios, Mooney and Lovasi16]. In fact, standard reporting guidelines for randomized clinical trials [Reference Kane, Wang and Garrard17] (RCT) and observational studies [Reference Von Elm, Altman and Egger18] stress the importance of providing tabulated clinical and demographic characteristics for the study participants. The general reasoning is to help the general research community – those who would need to use and interpret the studies – determine both generalizability and gaps. From the table of the participants, we can start to understand the general characteristics of the study population. Participant race and ethnicity are important variables in this regard, since the lack of diversity may result in limitations on generalizability of the study findings [Reference Hayes-Larson, Kezios, Mooney and Lovasi16].

Figure 1 provides examples of several tables from one of the author’s collaborative portfolios [Reference Persell, Peprah and Lipiszko19Reference Stika, Wisner and George23]. We use these examples to illustrate the study-to-study variability in reports of racial and ethnic data. There is no single or perfect way to present these data, and each of the tables presented here carries its own pros and cons. Multiple considerations and discussions were involved in the development of these tables, and they underwent multiple iterations between coauthors, collaborators, reviewers, and editors prior to their eventual publication in the form illustrated here.

Figure 1. Examples of “Table 1” from several publications.

Here we outline some observations in typical practices (Figure 1).

  1. a. Panel (a) summarizes data from participants in a clinical trial on hypertension. As defined by the U.S. Census Bureau [24], race is categorized as American Indian or Alaska Native, Asian, Black or African American, Native Hawaiian or Other Pacific Islander, or White. However, in this table, the breakdown between unknown and other is not clear. Even though this categorization might be necessary for reporting purposes, the audience could benefit from knowing the exact race of the study participants.[Reference Liu, Ding, Belouali, Bai, Raja and Kharrazi25] For example, the authors could include in the text or a footnote, “those listed as “other” included XXX participants self-reporting YYY, etc.”

  2. b. Panel (b) summarizes data from study participants for a postpartum depression preventive intervention. It is difficult to understand the demographic profile of the study participants from the table alone. Race and ethnicity could be broken down further, as it is argued that individuals of certain races tend to receive less mental health interventions [Reference Liu, Ding, Belouali, Bai, Raja and Kharrazi25]. Additionally, the definition of “minority” or the identities included in the “minority” category should be outlined in the manuscript or table footnote.

  3. c. Panel (c) summarizes data from study participants of a prospective study, comparing physical therapy and usual care for the patients who have visited an emergency care unit. Again, race and ethnicity categories could be expanded or fully described to help the audience understand the participant population more clearly. We do note here that race and ethnicity are not separated into different variables.

  4. d. Panel (d) illustrates participant characteristics in a cohort study evaluating changes in sertraline plasma concentrations across pregnancy and postpartum. Even though there are only six people in the category of “Asian, Black, or multiple,” it may be beneficial to certain audiences [Reference Ng, Norman and Naing26] to better understand this breakdown. In this case, the race variable was used for descriptive purposes only (i.e., to describe the study population), and due to the low cell counts and potential lack of anonymity that may be introduced in reporting these low cell counts in a table like this, the study team decided to report the self-reported racial identities in this way so that the reader would know that there were some participants identifying with these specific categories.

Practices in summarizing race and ethnicity data can exacerbate the marginalization of certain populations. For example, summarizations may be overly simplified (e.g., complex racial and ethnic identities can be grouped into broad categories that do not account for the unique lived experiences of smaller or mixed-race identities) or marginalized groups may develop a feeling of invisibility if they are lumped into larger categories. Additionally, the “White” or “Non-Hispanic White” category is frequently listed first and used as the category to which other categories are compared, reinforcing damaging notions about belonging. In fact, we see this occur in some examples in Figure 1. In line with other guidelines and recommendations, we suggest alphabetical ordering for both data collection and reporting [Reference Flanagin, Frey and Christiansen27]. Refer to other reporting guidelines and discussions in Section 5.1. Considering the many audiences of research studies and appropriate reporting for those audiences is often helpful in determining how to set up data collection tools.

Data collection

Collecting race and ethnicity data from study participants may carry potential pitfalls, such as generating feelings of confusion, exclusion, and marginalization. Participants may wonder why the information is needed and how it will be used, which can diminish their interest and investment in the study. The WAC [2] Data Equity Framework provides important considerations for data collection and sourcing to mitigate these issues.

From a study participant’s perspective, providing information on race and ethnicity can be problematic. The participant is the world’s expert on their identity. The framing of a question regarding these constructs can force the participant to misrepresent their own identity. We use Figure 2 to illustrate example data collection tools that may be used to solicit this information from a participant via a series of questions [Reference Harris, Taylor, Thielke, Payne, Gonzalez and Conde28,Reference Harris, Taylor and Minor29]. There are benefits and drawbacks to each of the options presented. For instance, Example A ensures consistent and concise data collection that would likely be required at minimum for a funder (Refer to Table 1); however, it requires the participant to select exactly one of the available options for race and ethnicity. A participant who does not identify with the race options listed must select from the available options, and someone who identifies with more than one option can register only if they identify with more than one option without the opportunity to list which ones.

Figure 2. Examples of data collection form segments to solicit race and ethnicity information from a study participant.

Other concerns of the participants might be: “What is the difference between race and ethnicity? They seem to be asking the same thing from my perspective.” Or, “These categories feel very general. How can you group people of Indian, Chinese, Japanese, Korean, Thai, Indonesian (and many other) descent into just one broad category?” Possible considerations to address these concerns may be (1) Consider whether it is necessary to break the two constructs of race and ethnicity apart; refer to the aforementioned updates at the federal level on this idea [12] – researchers could consider using a set of fields as in Example B; (2) Ask a question that does not specifically state “ethnicity” in the label…“Do you consider yourself Hispanic?” (as in Example D or E); (3) Consider collecting data on a more granular level noting that it may be possible to collapse into larger, “required” categories later. This is preferred over risking participant difficulty with answering questions.

Identifying race and ethnicity can be particularly problematic for people who identify as American Indian/Alaska Native. Since most people who identify as American Indian/Alaska Native also identify with another racial group, American Indian/Alaska Native identity is often not registered when information on race is collected, resulting in a severe undercount of American Indian/Alaska Native people [30]. Moreover, many who identify as American Indian/Alaska Native also identify as Hispanic; a recent movement to treat Hispanic identity as a racial group rather than an ethnic one would mean that some who identify as American Indian/Alaska Native would be separated from that identity in research studies [30]. The way the US government currently handles race and ethnicity data separates more than three-quarters of people who identify as American Indian/Alaska Native from this identity. Such erasure of this identity exacerbates marginalization of these populations in research.

While pitfalls in collecting race and ethnicity data abound, they can be avoided. In a recent publication from results of an RCT, “Effect of Ivermectin vs Placebo on Time to Sustained Recovery in Outpatients with Mild to Moderate COVID-19: a Randomized Clinical Trial” [Reference Naggie, Boulware and Lindsell31] the authors explained the reason for its collection: participants were asked about race and ethnicity “due to the disparity in the burden of COVID-19 infection carried by marginalized communities based on race and ethnicity.” They also stated how these data were collected: “Participants were asked about ethnicity separately from race and were able to select any combination of race designations, including the option to not report any designation.” The demographic summary of the study population gave an authentic reflection of the race and ethnicity data participants provided. It separated ethnicity and race, reported each race designation as an independent binary variable, and presented race designations alphabetically rather than by number who marked each designation. The authors presented each race designation in the table without collapsing designations and used the opportunity to acknowledge the lack of representation in these designations as a limitation of the study: “while the inclusion criteria allow for a broad study population, this study failed to achieve the level of representation desired for underrepresented populations in terms of racial and ethnic diversity.”

Since collecting racial and ethnic information on study participants is necessary to understand inclusivity and generalizability of research, it is important for study personnel to collect this information accurately and to report it faithfully. Here we reiterate some existing recommendations [Reference Ciolino JDaK32].

  1. 1. Clearly state rationale for collecting race and ethnicity information. Understanding why race and ethnicity information is needed for the study and how it will be used conveys respect for the participant and builds trust. This rationale could look like: “Collecting information on race and ethnicity helps us understand our study participants’ background. It will help people reading the study results understand whether the study may or may not apply to them or their population of interest.”

  2. 2. If providing a list of options:

  3. 3. Allow participants to select all that apply. Participants who identify with more than one racial identity should be given the opportunity to enumerate these.

  4. 4. Allow participants to not respond. When participants are allowed to select “choose not to answer” for these questions or to skip the question(s) altogether, a sense of autonomy is instilled in the participants.

  5. 5. Consider including an “other [specify]” option. Participants who do not identify with any of the listed options should have the opportunity to register a different identity. Note we recommend specification of “other” even though this may provide inconsistencies with data entry and open up opportunities for error but may be worth the effort.

  6. 6. Consider including an “unsure” option. Participants who are unsure of what the question is asking or how to respond should be given the opportunity to register their uncertainty.

  7. 7. Plan to acknowledge limitations to any approaches/results interpretations in any pieces of dissemination.

When participants understand the importance of providing demographic information and can represent their identity accurately, they are more likely to participate. Careful consideration of racial and ethnic data collection in study design ensures participants feel seen and accurately represented.

In summary, race and ethnicity should always be collected in human research studies to allow for assessment generalizability. The reason for “why” and “how” these data are collected should be clear and communicated to the participants. Further, collecting and monitoring these data allows us to uncover potential systemic biases in enrollment to ensure the researchers do not unknowingly exclude participants of a certain background.

Analysis

As aforementioned, it is important to consider the audience and the goal of the research in mind in every stage of a research, from inception to dissemination. With respect to race and ethnicity, researchers should ask themselves “What is the ultimate message we want to convey with respect to these data?” [Reference Ciolino JDaK33] “Are we adjusting or controlling for potential confounding effects, and if so, why might confounding by self-identified race or ethnicity be present in our study?” “Are we concerned about heterogeneity of effects, in particular health disparities by self-identified race and/or ethnicity?” These questions are especially important as researchers look to choose the right statistical analysis strategies. Consider the following examples.

Hypothetical example – controlling for ethnicity versus heterogeneity of effect

To illustrate the statistical meaning behind “controlling for race or ethnicity” we use a hypothetical example. Consider a research study in which study participants are randomized into one of two interventional arms: Arm 1 = cognitive behavioral therapy (CBT); Arm 2 = treatment with antidepressant. In the hypothetical example, participants are followed for 2 years and have study visits every 6 months. The primary outcome of interest is depressive symptom severity score, and the primary aim is to evaluate intervention effects on depressive symptom scores over time. The overarching question is whether treatment with CBT is more/less efficacious than treatment with antidepressants in reducing depressive symptom scores over time. The general frequentist hypothesis framework for statistical analyses would involve the following hypotheses: H0: mean depression score for patients treated with CBT = mean depression score for patients treated with antidepressants, versus H1: mean scores in the two patient groups are not equal.

To compare depressive symptom scores over time, one may use a longitudinal model akin to the following: Y = intercept + time + study arm + error. For simplicity, we do not present all the details of the statistical analyses, variance components, and assumptions on this model, but we use a simplified and heuristic model to outline concepts. The primary statistical hypothesis test focuses on the regression coefficient for the “study arm” variable, treated as an indicator or a factor in the analytic model.

There are times, however, when researchers may need to “control” or adjust for racial or ethnic variables in this model. This would likely be a secondary or exploratory analysis, or the purpose may be to increase precision in estimating the main effect. To determine whether adjustment is merited, one should always go back to the research hypothesis and determine if it makes scientific sense to perform additional analyses controlling for race or ethnicity. In this example, assume we have reason to believe self-identified ethnicity does influence outcome (in this case depression scores), and it is scientifically justified to consider adjusting for ethnicity.

Refer to Figure 3 for a visual depiction of the overarching hypothesis to be tested (A) and for a visual of the underlying assumption when “controlling” for ethnicity (B). We note that in this example, we are excluding the baseline measurement period from the visual, as in theory, the mean outcome value would be the same at baseline across both study arms. In this scenario lies an assumption of an inherent difference in depressive symptom scores for patients of different self-identified ethnic backgrounds, regardless of study arm. The hypothetical model becomes Y = intercept + time + ethnicity + study arm + error, where ethnicity would be treated as a factor or categorical variable within the model. The primary interest in this scenario (B) is still the study arm coefficient estimate and the associated hypothesis test. However, the inclusion of the ethnicity variable would only increase precision of this estimate if the assumption in Figure 3b holds.

Figure 3. Visual of hypothetical example involving controlling for ethnicity versus heterogeneity of treatment effect.

Now if one were to desire to evaluate whether the intervention has varying effects for different ethnic identities (i.e., examine heterogeneity effect), we might respecify the model to something similar to the following (Figure 3c): Y = intercept + time + (ethnicity)*study arm + error. The inclusion of the interaction term between ethnicity and study arm seeks to evaluate evidence of a differing study arm effect within groups of participants self-identifying as specific ethnic categories. In our hypothetical illustration (Figure 3c), those identifying with ethnic category #1 tend to respond better with intervention from arm #2, while those identifying with ethnic category #2 tend to respond better with intervention from arm #1.

Taken together, when considering racial or ethnic variables in analyses we recommend the reader (1) revisit their research strategy and the overall study goals, hypotheses, and design; (2) review the role that self-identified race and ethnicity would play in the analyses and inference and determine the underlying scientific premise and statistical hypotheses of interest, (3) review the distribution of the categorical variable in question prior to conducting the analyses. We illustrate the reasoning for #3 with another example to follow. The reader should keep in mind that even if analyses are a part of the study goals, they may not be possible as group counts may be too low.

Another example (based on a real study) – controlling for race

This example is based on a real study, but details have been removed or modified slightly to retain anonymity. Assume we have a study like the one in Section 4.1, and we would like to control for race in analyses. We pre-specify this plan to adjust for self-identified race in analysis. However, of 88 participants enrolled in our study, four participants self-identify as Asian; one as Black; five as multiple races; and 78 (89%) self-identify as White. Statistically, the largest subgroup provides the most “information” in the model. Larger subgroups tend to result in increased precision and more stable model estimates. If we leave these categories as is and put “race” as a four-category factor into our statistical model, we will most likely end up with nonsensical parameter estimates and confidence intervals (model instability issues) due to low cell counts. Anonymity may also suffer in reporting (refer to Figure 1d for an example of this scenario). Now suppose we decide to collapse these categories to the following: Asian/Black/multiple races: 10 (11%) and White: 78 (89%). If we include a two-level categorical variable in our analytic model, the model convergence or stability issues subside; however, referring back to the underlying assumption in the model (for example, see Figure 3b), adjusting this new variable implies that there is reason to believe that those identifying in the broad category of either Asian, Black, or multiple races would have inherently different depressive symptom scores than those identifying as White. It also implies that those falling into one of those broad groups have depressive symptom score behavior that would be more like one another than they would be to those of the larger subgroup. Referring back to the recommendation on considering adjustment in analyses, the underlying scientific premise of such an analysis would not be justified, and we should not adjust for race in this example. Ultimately, while it may be acceptable practice to “collapse” categories for descriptive purposes (especially due to anonymity issues and perhaps with some caveats), it would very seldom be appropriate to collapse racial categories for analyses.

Similarly, Naggie et al.[Reference Naggie, Boulware and Lindsell31] found that structural racism and resource access likely affect COVID-19 symptom duration. Despite broad inclusion criteria, their study on ivermectin lacked racial and ethnic diversity, revealing barriers that prevent minority participation.

Using race and ethnicity in observational studies

In observational studies, race and ethnicity are often crucial for adjusting potential confounders, which are more prone to bias than in randomized controlled trials (RCTs). These variables often correlate with social and economic factors, potentially confounding key exposure-outcome relationships.

For example, if we were to use observational data to evaluate use of CBT versus treatment with antidepressants for depression using observational data rather than in the RCT from Section 4.1, we would likely need to control for several socioeconomic variables that might be related to both exposure (CBT vs. antidepressants) and outcome (depressive symptom scores). Some of these variables such as health literacy, income, education level, access to care, insurance status, etc. may not be measurable within a dataset or there may be inherently missing data on these variables. While there would not necessarily be an underlying scientific premise that race or ethnicity itself would be one such confounder, this variable may be related to these other important confounders.

It is common practice in observational studies to estimate a propensity score (PS) for exposure and control for that PS in primary analyses evaluating exposure and outcome relationships. PS estimation relies heavily on the types of covariates included in the model. While earlier suggestions advocated including all measured covariates in PS modeling [Reference Rosenbaum and Rubin34], new empirical simulation-based studies recommend a more selective approach. For instance, Brookhart et al. (2006) [Reference Brookhart, Schneeweiss, Rothman, Glynn, Avorn and Stürmer35] argue that only variables related to both exposure and outcome should be included since this increases precision without adding bias. Recent methodologies advocate for careful consideration of which covariates are included – race and ethnicity being prime examples. When deciding whether to include self-identified race or ethnicity as covariates in a PS model, investigators must consider what these variables represent within their analysis. Are they serving as proxies for other influential factors such as insurance status, area deprivation index, or income? Or do they capture additional unmeasured effects essential for understanding exposure-outcome relationships? If self-identified race or ethnicity has direct relevance to the outcome under study, their inclusion can enhance model accuracy; however, if their relevance is indirect – acting through other variables like socioeconomic status – they may not need inclusion unless residual confounding is suspected. Thus, include race and ethnicity in PS models based on their specific role. If they act as proxies for other factors, include those instead. If they capture unique aspects of the exposure-outcome relationship, their inclusion is justified for a thorough analysis.

General considerations for analysis

We emphasize that these hypothetical examples focus on the social construct of self-reported identity, not biological or genetic relevance. They illustrate key considerations for researchers using these variables in analyses.

Aside from the recommendations provided above ([1] revisiting the goals, [2] understanding the scientific rationale, and [3] considering whether inclusion of race or ethnicity is viable in analyses), we list additional recommendations – many of which can already be located in an existing recommendations document – when conducting analyses outlined below [Reference Ciolino JDaK33].

First, when working with race or ethnicity variables in statistical models, small categories can lead to computational challenges, imprecise estimates, and model instability. Privacy concerns may arise when reporting or analyzing these small categories. In such cases, collapsing categories can help maintain anonymity and facilitate analyses. However, it is essential to align categories with the overall goals and consider contingency plans for analyses or reporting.

Additionally, the aforementioned document [Reference Ciolino JDaK33] includes several suggestions and options for researchers to consider with respect to parameterization of race and ethnicity variables in analytic models (assuming there is justification for including them based on overarching goals). They include (1) ensuring that racial and ethnic variables are included as a factor variable (i.e., not an ordinal or continuous variable), and (2) considering use of indicator variables (a series of yes/no or 1/0 variables depending on how the data were collected) for inclusion in models.

When choosing a reference category for categorical variables, avoid defaulting to the largest group, often “White” and “Non-Hispanic” in US studies. Consider using the first or last alphabetical category instead. Be mindful of the inferential implications when analyzing racial and ethnic variables.

Other considerations and suggestions

Reporting

Throughout the present manuscript, we focus on unbiased and equitable data collection and sound analyses. However, it is important to acknowledge the importance of carrying these ideas through to the report as the end result of this data collection and analysis is eventual report and dissemination of research findings.

Summarizing race and ethnicity data can be difficult when the data collection process is unclear. Often, summaries don’t explain how data were gathered, which can lead to biases, especially if the data are sourced rather than self-reported. This lack of clarity can distort results, reflecting the choices and biases of the data handlers, rather than accurately representing the study population.

We refer the reader to the (The Journal of the American Medical Association) JAMA guidance on reporting race and ethnicity and to additional guidance on using inclusive language in reporting [Reference Flanagin, Frey and Christiansen27,36]. We highlight some specifics here noting that the overarching goal of the JAMA guidance is to “…provide recommendations and suggestions that encourage fairness, equity, consistency, and clarity in use and reporting of race and ethnicity in medical and science journals [Reference Flanagin, Frey and Christiansen27].”

We list some general ideas presented in the JAMA guidance in Table 2.

Table 2. Recommendations from JAMA guidance on reporting [Reference Flanagin, Frey and Christiansen27]

Impact of practices when handling race and ethnicity data

While the present manuscript focuses on issues around race and ethnicity in research, the eventual downstream effects of clinical research findings are changes in clinical practice. These can range from using predictive algorithms to understand patients’ risks and prognoses through optimal treatment strategies and precision medicine based on evidence from such studies. Recent literature [Reference Chen, Wang and Williamson37] has illustrated the potentially devastating effects of algorithmic bias in medicine and race-adjusted tools, from triaging illnesses to quality of care received, and ultimately health outcomes for patients belonging to minority populations.

Examples of race-adjusted tools used are ubiquitous, across a multitude of fields and subspecialties, including risk calculators in cardiology, estimated Glomerular Filtration Rate as a measure of kidney function, Vaginal Birth After Cesarean risk calculation, urology risk calculators, and cancer survival prediction rates [Reference Vyas, Eisenstein and Jones38,Reference Cerdeña, Plaisime and Tsai39]. These risk calculators or scores consider race as a covariate to control for potential biological differences across self-reported race subgroups. Using race-adjusted scores has caused the over-consideration of race in referrals, treatment plans, and evaluations [Reference Tandon, Johnson and Diebold20]. To reduce these downstream effects, race-adjusted tools should be avoided as they use race as a biological construct rather than a social construct, yet we know that race is a very poor surrogate for genetics/biology[Reference Collins3,Reference Mersha and Beck4].

Shifting the paradigm away from race-based medicine and algorithms starts with shifting the paradigm in research. Considering and implementing the recommendations we put forth for precision and consistency in data collection, analysis, and reporting in research serve as one step in this process.

Summary and discussion: what can researchers do?

This manuscript aims to start a dialog on collecting and analyzing race and ethnicity data in human research, offering initial options for researchers. It advocates for reducing bias in clinical and research practices and highlights the importance of recruiting diverse participants, despite challenges like cultural barriers and distrust, especially among non-US citizens.

We theorize that by asking themselves the following questions and thinking about them from the outset, researchers will be in a better position to continue to enforce a paradigm shift toward more equitable, diverse, and inclusive research when considering collection and analysis of race and ethnicity data:

  1. 1. Why are we collecting these data? Are they necessary to achieve the study objectives?

  2. 2. Does the way we are collecting these data allow us to accomplish the study aims?

  3. 3. Are there other sets of variables that would better address these aims, or are there additional variables to supplement race and ethnicity that would provide a clearer picture?

  4. 4. Do inferences in analyses make sense in the context of the study aims?

In all cases, researchers should have contingency plans and be adaptive (e.g., when analyses are no longer feasible based on the distribution of data) and be transparent. The daunting endeavor of reducing structural racism in medicine and in research cannot be solved with one study or one single group of investigators or clinicians, but it is a collective effort to move against an embedded set of processes. We hope that this manuscript and recommendations provided here help to continue the forward progress of recent work in this arena.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/cts.2024.632.

Acknowledgments

We would like to acknowledge the Northwestern University Department of Preventive Medicine’s working group on Response to Structural Racism in Research and its Diversity, Equity, and Inclusion Committee for laying much of the foundational work for this manuscript.

We would also like to acknowledge the Justice, Equity, Diversity, and Inclusivity working group within the Biostatistics, Epidemiology, and Research Design Special Interest Group of the Association for Clinical and Translational Science for their support and collaboration on this project. Current membership includes: Susan Perkins, PhD (supported in part by: UM1TR004402); Gina-Maria Pomann, PhD; Alexandra L. Hanlon, PhD; Alicia J. Lozano, MS; Ann Brearly, PhD; Sally Thurston, PhD; Sandy Taylor, PhD; Shari Messinger, PhD; Orrin B. Myers, PhD; Sima Sharghi, PhD; Shokoufeh Khalatbari, MS; Amy Laird, PhD; Jodi Lapidus, PhD; Felicity T. Enders, PhD, MPH; Jareen Meinzen-Derr, PhD MPH; Amanda L. Tapia, DrPH; Jody D. Ciolino, PhD We would also like to acknowledge Clemontina A. Davenport, PhD for her review and comments on the manuscript.

Author contributions

Sharghi, Sima; University of Rochester Medical Center, Biostatistics and Computational Biology Drafting, literature review research, organizing, editing, and finalizing the manuscript. Khalatbari, Shokoufeh; University of Michigan, MICHR: Editing the manuscript, critical feedback. Laird, Amy; OHSU: Editing the manuscript, critical feedback. Lapidus, Jodi; Oregon Health & Science University, School of Public Health: Editing the manuscript, critical feedback. Enders, Felicity; Mayo Clinic, Department of Health Sciences Research, Division of Biomedical Statistics and Informatics: Critical feedback, advice on existing and relevant literature. Meinzen-Derr, Jareen; Cincinnati Children’s Hospital Medical Center, Pediatrics, Biostatistics and Epidemiology: Critical feedback, advice on existing and relevant literature. Tapia, Amanda; Mayo Clinic: Review and Critical feedback. Ciolino, Jody; Northwestern University, Preventive Medicine Critical feedback, advice on relevant and existing literature, suggestions on the relevant tables and figures, overseeing the authors’ contributions and editing.

Funding statement

The project described in this publication was supported in part by the following awards. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health: UL1TR002001 from the National Center for Advancing Translational Sciences (University of Rochester); T32ES007271 (University of Rochester); UL1TR002240 (University of Michigan); UL1TR002240 (Oregon Health and Science University); UL1TR002377 (Mayo Clinic); UL1TR001425 and R01DC018550 (University of Cincinnati College of Medicine); UL1TR001422 (Northwestern University).

Competing interests

None.

References

What is Data Equity and Why Does it Matter?, https://data.org/resources/what-is-data-equity-and-why-does-it-matter/. Accessed March 01, 2023.Google Scholar
THE DATA EQUITY FRAMEWORK, https://weallcount.com/the-data-process/. Accessed March 01,2023.Google Scholar
Collins, FS. What we do and don’t know about ‘race’, ‘ethnicity’, genetics and health at the dawn of the genome era. Nat Genet. 2004;36(S11):S135. doi: 10.1038/ng1436.CrossRefGoogle Scholar
Mersha, TB, Beck, AF. The social, economic, political, and genetic value of race and ethnicity in 2020. Hum Genomics. 2020;14(1):37. doi: 10.1186/s40246-020-00284-2.CrossRefGoogle ScholarPubMed
Cascino, TM, Colvin, M, Lanfear, DE, et al. Racial inequities in access to VAD and transplant persist after consideration for preferences for care: a report from the registry evaluation for vital information for VADs in ambulatory life (REVIVAL). Circ Heart Fail. 2023;16(1):e009745.CrossRefGoogle Scholar
Baldridge, AS, Huffman, MD, Lazar, D, et al. Efficacy and safety of a quadruple ultra-low-dose treatment for hypertension (QUARTET USA): rationale and design for a randomized controlled trial. Am Heart J. 2022;254:183193.CrossRefGoogle ScholarPubMed
Huffman, MD, Baldridge, AS, Lazar, D, et al. Efficacy and safety of a four-drug, quarter-dose treatment for hypertension: the QUARTET USA randomized trial. Hypertens Res. 2024;47(6):110.CrossRefGoogle ScholarPubMed
Sanuade, OA, Jacobson, TA, Quintana, A, et al. Process evaluation of a double-blind randomized controlled trial to assess the efficacy and safety of a quadruple Ultra-low-dose treatment for hypertension within a federally qualified health center network (QUARTET USA). J Am Heart Assoc. 2024;13(1):e032236.CrossRefGoogle Scholar
The Do No Harm Project, https://www.urban.org/projects/do-no-harm-project. Accessed March 01,2023.Google Scholar
Six Ways to Bring Empathy into your Data, https://nightingaledvs.com/six-ways-to-bring-empathy-into-your-data/. Accessed Febreury 20, 2023.Google Scholar
Lee, J, Ramakrishnan, K. Who counts as asian. Ethnic Racial Stud. 2020;43(10):17331756.CrossRefGoogle Scholar
Revisions to OMB’s Statistical Policy Directive No. 15: Standards for Maintaining, Collecting, and Presenting Federal Data on Race and Ethnicity from the Office of Information and Regulatory Affairs, Office of Management and Budget, Executive Office of the President https://www.federalregister.gov/documents/2024/03/29/2024-06469/revisions-to-ombs-statistical-policy-directive-no-15-standards-for-maintaining-collecting-and. Accessed June 20, 2024.Google Scholar
Key facts about Asian origin groups in the U.S. Pew Research Center, Washington, D.C., https://www.pewresearch.org/short-reads/2021/04/29/key-facts-about-asian-origin-groups-in-the-u-s/. Accessed April 29, 2023.Google Scholar
What Are the Requirements for Reporting on Human Study Participant Age, Sex/Gender, and Race/Ethnicity?, https://nexus.od.nih.gov/all/2021/04/29/what-are-the-requirements-for-reporting-on-human-study-participant-age-sex-gender-and-race-ethnicity/. Accessed April 20, 2023.Google Scholar
Turner, BE, Steinberg, JR, Weeks, BT, Rodriguez, F, Cullen, MR. Race/ethnicity reporting and representation in US clinical trials: a cohort study. The Lancet Regional Health-Americas. 2022;11:100252.CrossRefGoogle Scholar
Hayes-Larson, E, Kezios, KL, Mooney, SJ, Lovasi, G. Who is in this study, anyway? Guidelines for a useful table 1. J Clin Epidemiol. 2019;114:125132.CrossRefGoogle ScholarPubMed
Kane, RL, Wang, J, Garrard, J. Reporting in randomized clinical trials improved after adoption of the CONSORT statement. J Clin Epidemiol. 2007;60(3):241249.CrossRefGoogle ScholarPubMed
Von Elm, E, Altman, DG, Egger, M, et al. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Int J Surg. 2014;12(12):14951499.CrossRefGoogle ScholarPubMed
Persell, SD, Peprah, YA, Lipiszko, D, et al. Effect of home blood pressure monitoring via a smartphone hypertension coaching application or tracking application on adults with uncontrolled hypertension: a randomized clinical trial. JAMA Netw Open. 2020;3(3):e200255e200255.CrossRefGoogle ScholarPubMed
Tandon, SD, Johnson, JK, Diebold, A, et al. Comparing the effectiveness of home visiting paraprofessionals and mental health professionals delivering a postpartum depression preventive intervention: a cluster-randomized non-inferiority clinical trial. Arch Women’s Mental Health. 2021;24(4):629640.CrossRefGoogle ScholarPubMed
Abrams, LS, Dornig, K, Curran, L. Barriers to service use for postpartum depression symptoms among low-income ethnic minority mothers in the United States. Qual Health Res. 2009;19(4):535551.CrossRefGoogle ScholarPubMed
Kim, HS, Ciolino, JD, Lancki, N, et al. A prospective observational study of emergency department-initiated physical therapy for acute low back pain. Phys Ther. 2021;101(3):pzaa219.CrossRefGoogle ScholarPubMed
Stika, CS, Wisner, KL, George, AL Jr, et al. Changes in sertraline plasma concentrations across pregnancy and postpartum. Clin Pharmacol Therap. 2022;112(6):12801290.CrossRefGoogle ScholarPubMed
Liu, S, Ding, X, Belouali, A, Bai, H, Raja, K, Kharrazi, H. Assessing the racial and socioeconomic disparities in postpartum depression using population-level hospital discharge data: longitudinal retrospective study. JMIR Pediatr Parent. 2022;5(4):e38879.CrossRefGoogle ScholarPubMed
Ng, CH, Norman, TR, Naing, KO, et al. A comparative study of sertraline dosages, plasma concentrations, efficacy and adverse reactions in Chinese versus Caucasian patients. Int Clin Psychopharmacol. 2006;21(2):8792.Google Scholar
Flanagin, A, Frey, T, Christiansen, SL, AMA Manual of Style Committee. Updated guidance on the reporting of race and ethnicity in medical and science journals. JAMA. 2021;326(7):621627. doi: 10.1001/jama.2021.13304.CrossRefGoogle ScholarPubMed
Harris, PA, Taylor, R, Thielke, R, Payne, J, Gonzalez, N, Conde, JG. Research electronic data capture (REDCap)—a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform. 2009;42(2):377381.CrossRefGoogle ScholarPubMed
Harris, PA, Taylor, R, Minor, BL, et al. The REDCap consortium: building an international community of software platform partners. J Biomed Inform. 2019;95:103208.CrossRefGoogle ScholarPubMed
Naggie, S, Boulware, DR, Lindsell, CJ, et al. Effect of ivermectin vs placebo on time to sustained recovery in outpatients with mild to moderate COVID-19: a randomized clinical trial. JAMA. 2022;328(16):15951603.CrossRefGoogle ScholarPubMed
Ciolino JDaK, KN. Analysis Considerations for Race, Ethnicity, and Gender Variables. DigitalHub. Galter Health Sciences Library & Learning Center; 2022. doi: 10.18131/g3-5sa2-wb83.Google Scholar
Ciolino JDaK, KN. Considerations for Capturing of Race, Ethnicity, and Gender Data. DigitalHub. Galter Health Sciences Library & Learning Center; 2022. doi: 10.18131/g3-j4wr-gf57.Google Scholar
Rosenbaum, PR, Rubin, DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):4155.CrossRefGoogle Scholar
Brookhart, MA, Schneeweiss, S, Rothman, KJ, Glynn, RJ, Avorn, J, Stürmer, T. Variable selection for propensity score models. Am J Epidemiol. 2006;163(12):11491156. doi: 10.1093/aje/kwj149.CrossRefGoogle ScholarPubMed
Inclusive Language for Reporting Demographic and Clinical Characteristics. https://jamanetwork.com/pages/inclusive-language. Accessed April 01, 2023.Google Scholar
Chen, RJ, Wang, JJ, Williamson, DF, et al. Algorithmic fairness in artificial intelligence for medicine and healthcare. Nat Biomed Eng. 2023;7(6):719742.CrossRefGoogle Scholar
Vyas, DA, Eisenstein, LG, Jones, DS. Hidden in plain sight—reconsidering the use of race correction in clinical algorithms. Mass Medical Soc. 2020;383(9):874882.Google ScholarPubMed
Cerdeña, JP, Plaisime, MV, Tsai, J. From race-based to race-conscious medicine: how anti-racist uprisings call us to act. Lancet. 2020;396(10257):11251128.CrossRefGoogle ScholarPubMed
Figure 0

Table 1. Example table shell for National Institutes of Health-funded human subjects’ studies

Figure 1

Figure 1. Examples of “Table 1” from several publications.

Figure 2

Figure 2. Examples of data collection form segments to solicit race and ethnicity information from a study participant.

Figure 3

Figure 3. Visual of hypothetical example involving controlling for ethnicity versus heterogeneity of treatment effect.

Figure 4

Table 2. Recommendations from JAMA guidance on reporting [27]

Supplementary material: File

Sharghi et al. supplementary material

Sharghi et al. supplementary material
Download Sharghi et al. supplementary material(File)
File 14.3 KB