Beyond Traditional Conceptions of Policing and Crime Control: New Metrics to Evaluate Police Performance and Improve Police Legitimacy

Dennis P. Rosenbaum

doi:10.1017/9781009612784

1 Introduction

History is replete with efforts to reform American policing, yet this institution continues to suffer from a “crisis of legitimacy.” This Element begins by defining police legitimacy and the various ways that legitimacy has been jeopardized by police practices. The costs associated with the current model of policing are summarized, including the adverse impact on community members, police officers, and public safety. Some constitutionally protected populations are particularly vulnerable to police mistreatment, including people of color, the LGBTQ+ community, persons with mental illness, victims of crime, the homeless, and youth.

This review is followed by an analysis of reform efforts and why they have been ineffective in changing street-level policing and increasing public trust in the police. Weak implementation has contributed to this failure, but the primary barrier to success has been the overreliance on enforcement statistics to evaluate police performance. When evaluating the police, our society needs to “measure what matters” to the public, namely, how they are treated by the police. Giving voice to service recipients is critical for improving service and reducing any biases in treatment.

Hence, this Element offers a new, evidence-based approach to police innovation that should yield significant changes in police culture and improvements in police-community relations. New systems of measurement are described that focus on procedural justice and effective communication skills during police-public encounters. These reforms will require the routine collection, analysis, and utilization of data, along with advanced data analytics and artificial intelligence, to assess differences in policing across different types of calls for service, geographic areas, police units, supervisors, individual officers, and community members. Thus, new approaches to analyzing new datasets are proposed to advance our knowledge of policing and our understanding of police-public encounters. Bringing together two camps in criminology, this framework offers a clear link between process and outcome metrics, between officers’ treatment of the public and public safety.

Policing scholars have been outstanding in measuring specific constructs, like procedural justice, but the collection of good data will have absolutely no impact on police behavior or police legitimacy unless they are applied to police organizational practices. Translational criminology requires that we translate evidence into practice. Hence, this Element proposes evidence-based organizational changes that use new data to incentivize “good policing.”

Finally, this Element proposes “big science” to test the feasibility of this model prior to full-scale implementation. A proof-of-concept demonstration, with rigorous evaluations in multiple cities, will provide a road map for institutionalizing a national system of performance measurement and accountability. This level of innovation, with strong implementation fidelity, should increase procedural justice, increase police legitimacy, strengthen police-community relations, and increase public safety throughout the United States.

2 Police Legitimacy

Over a century ago, Max Weber conceived of legitimacy as something that people with power and authority must earn from those who are expected to follow them (Waeraas, Reference Waeraas, Ihlen and Fredriksson2009). In the policing field, legitimacy is judged, to a large extent, by whether the public believes that the police will act in the best interest of the community and whether they feel obligated to obey legal authorities (Tyler, Reference Tyler2006, Reference Tyler and Fischer2014; Tyler & Jackson, Reference Tyler and Jackson2013). Thus, police authority is not defined entirely by the badge, gun, or their legal powers but is also based on the public’s perception of them and is authorized by the “consent of the governed.” Police legitimacy is defined by the hearts and minds of the public.

Along these lines, the President’s Task Force on 21st Century Policing (2015) opened its report by underscoring the importance of public trust: “Trust between law enforcement agencies and the people they protect and serve is essential in a democracy. It is key to the stability of our communities, the integrity of our criminal justice system, and the safe and effective delivery of police services” (p. 1).

3 The Cost of Modern Policing

Emerging from the southern slave patrols in 1704 (Dulaney, Reference Dulaney2015; Reichel, Reference Reichel2022), American policing has an ugly history of brutality and political corruption that repeatedly undermined the legitimacy of the institution. Officers who worked for city police in the nineteenth century had unlimited authority to administer “street justice” and were part of a corrupt system of government (Walker, Reference Walker1977). After the civil war, the police were called upon to enforced “Jim Crow” laws for nearly 100 years. As a result, police brutality, deviance, and racial bias have been the subject of critical analysis for more than a century (Johnson, Reference Johnson2003; Kappeler et al., Reference Kappeler, Sluder and Alpert1998; Nelson, Reference Nelson2000).

Essentially, American policing failed to follow some of the basic principles of modern policing as articulated by Sir Robert Peel when the London Metropolitan Police Force was created in 1829. In terms of legitimacy, the Peelian principles emphasized the importance of “policing by consent” and gaining the public’s respect by minimizing use of force and arrests and maximizing crime prevention. That has not happened. Consequently, we have seen a rapid growth in publications that criticize police militarization and police violence, such as Rise of the Warrior Cop (Balko, Reference Balko2014), Our Enemies in Blue (Williams, Reference Williams2015), Police State (Spence, Reference Spence2015), and many others. Public dissatisfaction with police performance has also resulted in calls to “defund” and even “abolish” the police among social justice advocates (Purnell, Reference Purnell2021).

The costs of police misconduct and biased enforcement against vulnerable members have been significant, including the adverse effects of investigatory stops, use of force, arrests, and general mistreatment of the public. This has resulted in anti-police movements, as well as negative effects on public safety, officer safety, and taxpayers.

To deter future crime, we want the police to enforce the law, but as Lum and Nagin (Reference Lum, Nagin, Tonry and Nagin2017, p. 344) point out, “arrests are costly to all involved – the police who make them, the public who must pay for the ensuing punishment, and the individual who must endure that punishment.” Local and county law enforcement cost taxpayers roughly $100 billion per year (Buehler, Reference Buehler2021). However, that figure does not include the cost of human life or police misconduct lawsuits. In the United States, there are more than 1,000 fatal police shootings each year (Washington Post, 2024), and lawsuits cost taxpayers billions of dollars in misconduct settlements (Alexander et al., Reference Alexander, Rich and Thacker2022).

Aggressive policing and misconduct are not only financially expensive – they are also costly in terms of police legitimacy and public safety. As discussed in this Element, when the public does not view the police as legitimate authority figures who “do the right thing,” a number of adverse consequences occur, and these effects can undermine public safety.

3.1 Bias Against Vulnerable Groups

I am most concerned about the unconstitutional treatment of the vulnerable classes. The 14th Amendment of the US Constitution guarantees “due process” and “equal protection” of the law for all groups, including those defined by race, color, sex, gender identity, age, religion, mental health, disability, national origin, sexual orientation, and other at-risk groups. Clearly, not all Americans are treated equally by the police. Today, police critics are insisting on fairer and more respectful treatment of vulnerable, constitutionally protected classes. Mistreatment has been highlighted recently in our national dialogue, so we must understand better how these vulnerable groups are being treated and what can be done to improve the police services they receive.

The history of racism in American is well documented in classic works such as The Souls of Black Folk (Du Bois, Reference Du Bois1903), The Nature of Prejudice (Allport, Reference Allport1954), and The Warmth of Other Suns (Wilkerson, Reference Wilkerson2011). Racial bias during enforcement is nothing new and is also well documented. The National Advisory Commission on Civil Disorder (1968) – called the Kerner Commission – attributed the national race riots of 1967 to white racism and blamed police mistreatment of Black Americans as the spark that started the riots. The Commission recommended that agencies “[r]eview police operations in the ghetto to ensure proper conduct by police officers, and eliminate abrasive practices” (p. 8). In the context of racial inequality and poverty and growing frustration and anger over discrimination, white officers making arrests and engaging in brutality against Black community members was enough to trigger numerous protests and riots (Evans, Reference Evans2021). The reports from multiple commissions, beginning with the 1919 Chicago riots, all reached a similar conclusion.

The problem of racial discrimination by the police has continued in recent decades. Racial profiling and violations of civil liberties during the drug war in the 1980s and 1990s contributed significantly to the problem of disproportionate arrest and incarceration of Black community members (Rosenbaum, Reference Rosenbaum, Davis, Lurigio and Rosenbaum1993). The incidents of extreme police violence against Black people that have resulted in protests over the past sixty years do not capture the widespread nature of the problem today. National data collected through the Police-Public Contact Survey have consistently revealed that Blacks and Hispanics are more likely than whites to experience either the threat of force or actual force during their contact with the police, more likely to receive enforcement actions, and more likely to feel that the officer engaged in misconduct, such as slurs, bias, or sexual misbehavior (Harrell & Davis, Reference Harrell and Davis2020; Tapp & Davis, Reference Tapp and Davis2022).

A meta-analysis of twenty-seven separate datasets shows clearly that suspects of color have been arrested at higher rates than white suspects (Kochel et al., Reference Kochel, Wilson and Mastrofski2011). Furthermore, Black and Hispanic community members are shot and killed by the police at a much higher rate than white community members (Washington Post, 2024). Even when guns are not used, an analysis of 1,000 deaths shows that Black persons, who comprise 12 percent of the US population, represented one-third of all deaths involving physical force by the police across many cities (Dunklin et al., Reference Dunklin, Foley and Martin2024).

Some have argued that the racial disparities in enforcement are largely the result of the concentration of crime in low-income neighborhoods where people of color are overrepresented rather than expressions of racial prejudice by the police (Engel & Swartz, Reference Engel, Swartz, Bucerius and Tonry2014). Taking a longitudinal look at the criminal histories of roughly 1,000 youth over time, Sampson and Neil (Reference Sampson and Neil2024) conclude that persistent racial disparities in arrest rates can be attributed to “concentrated and cumulative forms of neighborhood disadvantages and advantages that are spatially concentrated” (p. 198). Clearly, higher crime rates in underresourced communities has resulted in more concentrated policing and arrests, regardless of whether the arrestees have a history of criminality or the police are expressing racial bias. However, for this Element, my point is that the methods of aggressive policing are real and enforcement is problematic regardless of the cause.

There are long-term consequences of enforcement. In addition to death or injury, police enforcement over time has resulted in the mass incarceration of Black people (Alexander, Reference Alexander2011). Critics have also argued that the criminal justice system, including arrest and sentencing disparities, reinforces a caste system of racial control that produces second-class citizens with fewer rights (Alexander, Reference Alexander2011; Lerman & Weaver, Reference Lerman and Weaver2014). There is also an extensive literature on the numerous collateral consequences of involvement in the justice system, especially the adverse effects of monetary sanctions, which are most detrimental to disadvantaged populations and can contribute to re-offending as they accumulate (See Ostermann et al., Reference Ostermann, Link and Hyatt2024, for a review).

Police traffic stops offer the best example of how police enforcement has adverse effects on vulnerable populations. American law enforcement has placed enormous emphasis on “stop and frisk” as a tool for fighting crime (Meares, Reference Meares2014; White & Fradella, Reference White and Fradella2016), but racial disparities in police decision-making during traffic stops have been documented repeatedly across dozens of agencies. Black drivers are stopped more often, stopped more often for nonsafety reasons (“pretextual stops”), searched more often, and subjected to force more often than white drivers (Graham, Reference Graham2024; Graham et al., Reference Graham, Neath and Buchanan2024; Langton & Durose, Reference Langton and Durose2013; Pierson et al., Reference Pierson, Simoiu and Overgoor2020). Young Black and Hispanic males are particularly at risk of citations, searches, arrests, and use of force (Engel & Calnon, Reference Engel and Calnon2004). We must also recognize that traffic stops can be costly in other ways, such as the price of fines, fees, court summons, and incarceration, which can contribute to poverty (Alabama Appleseed Center for Law & Justice, 2023). Thus, the Center for Policing Equity has emphasized that the effects of police stops are compounding as “disparities at each step increase the risks of harm at subsequent decision points” (Graham, Reference Graham2024, p. 1).

As a psychologist, I would like to underscore the fact that investigatory stops are embarrassing and upsetting for the drivers, who are often questioned and searched in front of family members, friends, or bystanders. As Epp and his colleagues (Reference Epp, Maynard-Moody and Haider-Markel2014) document, drivers are often asked where they are going, why they need to go there, and what they are carrying in their glove compartment or trunk. Flashlights run through the windows in every conceivable section; people are asked to get out; personal items are strewn on the ground nearby. The psychological impact can be enormous, resulting in resentment, loss of trust, and growing anger toward the police, especially in communities of color.

In general, we now know from research that contact with the police can have adverse mental health consequences, especially for people of color, youth, and transgender populations (DeVylder et al., Reference DeVylder, Oh and Nam2017). Police enforcement actions can have a range of adverse effects on youth, including increased school absenteeism (Geller & Mark, Reference Geller and Mark2022), increased recidivism (Klein, Reference Klein1986), and reduced options for gainful employment (Bushway, Reference Bushway1996). These effects are especially large for young Black males, as they tend to be heavily policed, and thus hold very negative views of the police (Gau & Brunson, Reference Gau and Brunson2010; Geller, Reference Geller2021; Suddler, Reference Suddler2019). The negative perceptions in the Black community are the result of both direct contact with the police (Tapp & Davis, Reference Tapp and Davis2022), and vicarious experience, such as conversations among friends and family (Rosenbaum et al., Reference Rosenbaum, Schuck, Costello, Hawkins and Ring2005).

Poor communities of color have long been the focus of police intrusion in large cities, as police seek to reduce crime. Unfortunately, we have known for half a century that police work often involves officers seeking to confirm their prejudicial suspicions about criminality (Rubinstein, Reference Rubinstein1973) and create a case of probable cause and enforcement using their own set of rules (Skolnick, Reference Skolnick1975). Police are usually interacting with complete strangers, and in these settings, human beings feel a need to make snap judgments based on limited information – judgments that are often inaccurate (Gladwell, Reference Gladwell2005). In general, people tend to attribute certain attitudes, beliefs, and behavioral traits to strangers (Baum et al., Reference Baum, Fisher and Singer1985), and this prejudice can lead to discrimination. For decades, psychologists have studied how we enhance our self-identity and preserve the status quo by using social stereotypes that emphasize the positive attributes of our “ingroup” while devaluing the attributes of the “outgroup” (Allport, Reference Allport1954; McGarty et al., Reference McGarty, Yzerbyt and Spears2002; Tajfel & Turner, Reference Tajfel, Turner, Austin and Worchel1979). Today, we distinguish between “explicit” and “implicit” bias, with the understanding that all humans suffer from implicit bias that can lead to discrimination, and the police are no exception (see Fridell, Reference Fridell, Lynch, Patterson and Childs2008, Reference Fridell2017).

Today, with the increased polarization of political attitudes (Edsall, Reference Edsall2024), I am concerned about the national rise in expressions of prejudice toward outgroups and how this trend might affect police-community interactions. For example, we see a nationwide effort to abolish diversity, equity, and inclusion (D.E.I.) programs in education, as well as a broader anti-“wokeism” movement – organized efforts that run counter to social justice for all vulnerable groups (see Confessore, Reference Confessore2024). With the rise of homophobia, transphobia, antisemitism, xenophobia (around immigration), and other forms of prejudice, we must remember that police officers are susceptible to these attitudes as well. A study of 220,000 officers from 98 of the 100 largest local agencies found that police officers tend to be more conservative than the civilians they serve (Ba et al., Reference Ba, Kaplan, Knox, Komisarchik and Lanzalotto2023). Thus, those interested in changing police culture should be prepared for some backlash, such as the “Blue Lives Matter” movement, as well as the possibility of additional forms of discrimination.

Thus, to better understand the needs of all vulnerable and constitutionally protected classes, we need to consistently create space for them to voice their concerns and be prepared to hear about their interactions with the police. Furthermore, the community wants evidence that progress is being made to reduce these problems, as too much ambiguity remains about how to measure progress. Here, I will argue that, in the interest of impartial policing and transparency, cities should collect and publicize data on the quality of police services, with breakdowns by the demographic characteristics of the community members being served.

3.2 The Cost to Police Officers

My criticism here is focused largely on the failure of police reform, not on the majority of police officers who are simply trying to do a difficult job within their existing organizational structure. After four decades of observation, I fully understand that police work can be very difficult and it is made more difficult today because of the public outcry and calls for greater police accountability. On the job, officers can face violent and traumatic incidents that are hard to forget, from homicides to traffic fatalities to suicides. As a group, police officers are more likely to commit suicide than die on the job (Stanton, Reference Stanton2022), and they have a suicide rate higher than civilians (McAward, Reference McAward2022). The national trend to introduce officer wellness programs (U.S. Department of Justice, 2024) is a clear indication of the stressfulness of police work. Hopefully, the new management model proposed here will help to reduce some of the stress of police work.

4 The Ineffectiveness of Police Reform

From crime commissions to consent decrees, the United States has sought to increase police accountability and transparency for nearly a century. Some improvements have been noted (Skogan & Frydl, Reference Skogan and Frydl2004), but the general mistreatment of the public remains problematic. We have summarized the problem in one sentence: “American policing progressed from an unregulated politicized entity that eventually enforced ‘Jim Crow’ laws to an organized, quasi-military bureaucracy focused on law enforcement” (Rosenbaum & McCarty, Reference Rosenbaum and McCarty2017, p. 71).

The reforms led by August Vollmer and O. W. Wilson in the twentieth century were designed to insulate the police from long-standing corruption and improve effectiveness in fighting crime. I have summarized the professional model of policing in this way: “Uniforms, military ranks, written policies and procedures, centralized control and command, highly trained specialized units (including detectives), motorized patrols, and modern technology were expected to eliminate corruption, professionalize the police, and above all, prevent crime through rapid response, random patrol, and forensic investigations” (Rosenbaum, Reference Rosenbaum2007, p. 14). This professional model seems to have reduced corruption, but has been ineffective in fighting crime or building police legitimacy. Within this framework, police organizations have used symbols, dress, and ceremonies to maintain their legitimacy and promote their image and values as fair, professional crime fighters (Crank & Langworthy, Reference Crank and Langworthy1992; Gau, Reference Gau2014). However, as Gau (Reference Gau2014) notes, these images and values “are decoupled from task performance,” and thus are essentially myths created to ensure police legitimacy.

In fact, this detached, quasi-military model of policing has contributed directly to the crisis of legitimacy that continues to fester. From Rodney King in 1991 to Tyre Nichols in 2023, the beatings and deaths have continued and calls for police reform have intensified (e.g., Balko, Reference Balko2014; Morrison et al., Reference Morrison, Lauer and Sainz2023). The typical requests for more “accountability” and “transparency” will be insufficient. I am pleased that the Police Executive Research Forum (PERF) has established new guidelines for police use of force, especially when restraining persons who are having a medical, mental, or drug crisis (Seewer & Dunklin, Reference Seewer and Dunklin2024), but I remain skeptical that these guidelines will significantly change behavior on the streets without changes to the police culture and performance standards.

For decades, policing scholars have associated excessive force and mistreatment of the public with this militaristic culture and the war-like mentality used to combat crime and drugs (e.g., Skolnick, Reference Skolnick1975; Skolnick & Fyfe, Reference Skolnick and Fyfe1993). Yes, there are a few rotten apples, but mistreatment of the public by a large segment of the police force is associated with the social norms within the police culture. Psychological and physical training to be a warrior with a “bulletproof mind” is important for surviving in battle (Grossman & Christensen, Reference Grossman and Christensen2008), but the job of police officers is very different. Most police officers never fire their weapon during their entire career and attempts to kill them are extremely rare. Yet the training for new police recruits in the United States exposes them to videos showing officers being ambushed or killed during traffic stops (Kirkpatrick et al., Reference Kirkpatrick, Eder, Barker and Tate2021), reinforcing the mindset of being hypervigilant and suspicious of everyone they encounter. Academy evaluations with pretests and posttests have shown that recruits develop a significantly greater desire to use physical force to solve problems and have less respect toward the community after six months of training (Rosenbaum & Lawrence, Reference Rosenbaum and Lawrence2017). Another pre–post study found that recruits become more inclined to believe that police misconduct and the code of silence are less serious problems after academy training (Schuck & Rabe-Hemp, Reference Schuck and Rabe-Hemp2021). Thus, from the start, the socialization process strengthens their cultural identity as tough warriors who must be on guard against hostile community members, demand obedience, use force to solve problems, and maintain secrecy within the group. Too often, this socialization process undermines, rather than builds, community trust.

Thus, police reform must directly confront this culture and style of policing to achieve widespread public trust. In this Element, I argue that the focus and priorities of police organizations must change. Lum and Nagin (Reference Lum, Nagin, Tonry and Nagin2017) make a compelling case that policing in a democratic society should be built on two basic principles. The first is that crime prevention should be prioritized over arrests, consistent with Peelian principles. The second is that “citizens’ views about the police and their tactics for preventing crime and disorder matter independently of police effectiveness” (p. 339). The authors contend that these two principles are equally important and “neither has standing to trump the other.” Consistent with Lum and Nagin’s conclusion, I am calling on reformers and police researchers to acknowledge the importance of public perceptions of the police and do something about it to advance knowledge and “evidence-based policing” (cf. Sherman, Reference Sherman and Tonry2013). My position is also consistent with Klose’s (Reference Klose2024) revised definition of evidence-based policing that includes “community values, preferences, and circumstances” in the decision-making process.

Since the 1980s, there have been many reforms and innovations in policing (Rosenbaum, Reference Rosenbaum2007). Most of these reforms have been ineffective and some even harmful. I will give some attention to reforms that are relevant to the organizational and research agenda proposed here, seeking to give more attention to policing processes rather than policing outcomes, although the two can be, and should be, connected.

Since the death of George Floyd, there have been nearly 300 state laws that seek to reform the police in various ways, including more civilian oversight, limitations on use of force, anti-bias training, and alternatives to arrest in mental health cases (Monnay, Reference Monnay2022). There has been considerable pushback by law enforcement personnel against these changes, and the general focus of these reforms is the misuse of force, not other police-public interactions.

4.1 Police Accountability and Public Safety

Individual cities have tried to control behavior through discipline and even firing officers for misconduct, but with little success. For most agencies, less than 10 percent of the misconduct complaints are investigated (Kelly & Nichols, Reference Kelly and Nichols2020). The cases investigated by Internal Affairs are usually cleared and police reports often lack important details. When deaths are involved, officers are usually exonerated by reports from medical examiners and coroners (Dunklin et al., Reference Dunklin, Foley and Martin2024). When punishments are recommended, strong union contracts allow appeals to arbitration, which rarely result in sanctions. In one New York Times review, roughly half of the fired officers were reinstated (Barker et al., Reference Barker, Keller and Eder2021). Even most police officers agree that accountability systems are not working. In our national survey of more than 13,000 police officers in 89 large agencies, only 1 in 5 officers felt that “officers who do a poor job are held accountable” (Cordner, Reference Cordner2017). With qualified immunity, police officers are not financially responsible for their misconduct (Yancey-Bragg, Reference Yancey-Bragg2023). Furthermore, thousands of officers who have been fired by one agency are rehired by another, as decertification systems are rarely in place (Stecklow, Reference Stecklow2024).

Although ineffective, this entire process, including public outcry, leaves many officers feeling mistreated, which affects their work-related attitudes and behavior. Our national survey found that when officers are upset about organizational injustice, such as unfair discipline, they are less committed to organizational goals, express less job satisfaction, and are less willing to comply with agency rules (Rosenbaum & McCarty, Reference Rosenbaum and McCarty2017). In the larger context of anti-police sentiment in the media and public protests, many police officers have become demoralized (Bello, Reference Bello2014). In a national survey, police officers expressed anger toward the public and fear of being unfairly disciplined or fired (Pew Research Center, 2017). These sentiments resulted in significant “de-policing” after the George Floyd protests in 2020 (Nix et al., Reference Nix, Huff, Wolfe, Pyrooz and Mourtgos2024). Today, many police prefer to “lay low” and avoid unnecessary risks, which can reduce public safety. For example, in New Jersey, a huge drop in traffic-related citations by the state police (due to criticism of their agency) was followed by a dramatic increase in traffic accidents (Tully, Reference Tully2024). Thus, we can see how police behavior and public safety can be negatively influenced when reforms are not handled properly. Later, I will revisit the issue of police effectiveness in fighting crime.

The complaints filed by community members provide insight into police-public interactions and how these encounters can escalate. One of the most common complaints is that the officer was discourteous, including used offensive language (e.g., Seron et al., Reference Seron, Periera and Kovath2006; Terrill and Reisig, Reference Terrill and Reisig2003). But as Seron and colleagues (Reference Seron, Periera and Kovath2006) note after reviewing complaints against the New York Police, “alleged misconduct often escalates into behavior that is asserted as having been abusive or unnecessarily forceful” (p. 928). This might include verbally threatening civilians or using force ranging from pushing to beating. Research has shown that police respond in a punitive manner when civilians fail to comply or are judged to be disrespectful of police authority (Klinger, Reference Klinger1997; Mastrofski et al., Reference Mastrofski, Reisig and McCluskey2002; Worden & Shepard, Reference Worden and Shepard1996). The real problem is that, aside from these few studies, we know very little about what transpires during daily police-public encounters in American cities. Here I propose to fix this problem.

After negative media coverage of deadly force incidents, many police agencies have tried to increase accountability by changing their use of force policies, conducting more thorough internal investigations of these incidents, creating early intervention systems (EIS) to identify officers at risk, and offering new training on when force is appropriate. While these reforms have been helpful in some ways (see Walker & Archbold, Reference Walker and Archbold2020), they have serious limitations. There are many reasons why these internal changes have had limited impact on street-level policing and public trust, including internal resistance to change, a lack of expertise needed to analyze data, deficiencies in data quality, lack of objectivity when evaluating other officers, weak training, and administrative turnover.

Early intervention systems (EIS) programs are now popular and have potential for identifying and intervening with officers who are at risk of misconduct (Walker & Archbold, Reference Walker and Archbold2020). However, we do not have a standard set of metrics to define “at risk,” and those currently used are not good predictors of problematic behaviors (Russek & Fitzpatrick, Reference Russek and Fitzpatrick2021; Worden et al., Reference Worden, Harris and McLean2014). As La Vigne (Reference La Vigne2022) notes, even if EIS uses fancy algorithms to identify patterns, relying on existing police records will lead to “garbage in, garbage out.” My own experience as a consent decree monitor suggests that EIS personnel are forced to rely on their own judgment or “rules of thumb” rather than “predictive analytics.” Also, supervisors are often free to do whatever they want to satisfy the requirements of an EIS intervention. Often, they do very little beyond having a brief talk with the officer.

I want to point out that focusing on the behavior of a few “rotten apples” doesn’t help us understand the problematic behavior patterns of a larger segment of the police force as shown in contact surveys. To be clear, excessive force incidents are extremely rare, and minor force incidents often go unreported due to the “code of silence” or they are reported with justification from the officer’s perspective. Furthermore, the civilians who file complaints are not representative of the much larger group of unhappy service recipients who are afraid of police retaliation. Thus, a much broader set of performance metrics with predictive validity is needed, including metrics that give greater voice to the community and metrics that rely on indisputable video data.

Existing training is also problematic for police reform. More than a half century ago, following the crisis of legitimacy in the 1960s, psychologists were critical of the standard police training programs and proposed an approach that would focus on “basic skills of being helpful to another” (Danish & Ferguson, Reference Danish, Ferguson, Snibbe and Snibbe1973, p. 499), but little progress followed. Today, a few hours of classroom training on de-escalation and proper communication is a drop in the bucket. A nationwide survey found that recruits in state and local law enforcement training academies receive an average of 168 hours of training on firearms, self-defense, and use of force, compared to only 10 hours on communication and 8 hours on ethics and integrity (Reaves, Reference Reaves2016). In addition to a stronger “dosage” of training on these topics, we need new community-based metrics to assess the impact of training on job performance.

Performance evaluations conducted by supervisors are also problematic and lack credibility. Researchers have recommended ways to improve performance evaluations (e.g., Oettmeier & Wycoff, Reference Oettmeier and Wycoff1997), but the reality is that police work is largely unsupervised. Supervisors are rarely present during most police encounters with the public, and with new BWC data, a few agencies are requiring supervisors to review only a small and unrepresentative sample of footage. Also, performance is difficult to evaluate with existing police records, which have little to do with the quality of police service overall and are based on selective community complaints and use of force incidents. Even when agencies require supervisors to evaluate the quality of police service by individual officers, the results are not credible without observational data to validate these judgments. Furthermore, for decades, the police culture has encouraged supervisors to be loyal to their officers (Engel, Reference Engel2001) and protect them from criticism by headquarters (Skolnick & Fyfe, Reference Skolnick and Fyfe1993). For example, after reviewing more than 2,000 ratings of performance in one large police department, I discovered that supervisors never once gave a rating of “Needs Improvement” on any of the dozens of performance metrics. How are officers supposed to improve their performance if they don’t receive any helpful feedback? Anecdotally, I have asked many officers today how they are evaluated, and the typical response is “Whether I can stay out of trouble.”

External civilian oversight and auditing have been recommended for years (Mastrofski, Reference Mastrofski1999; Reiss, Reference Reiss1971), and have potential, but they are usually opposed by the police and lack critical data needed to accurately evaluate police performance (Walker & Archbold, Reference Walker and Archbold2020). Also, the composition and expertise of these groups is sometimes questionable. Nevertheless, civilian oversight is needed in some capacity, as articulated in this publication.

Beginning in the 1990s, consent decrees by the US Department of Justice have required that police agencies correct a “pattern or practice” of unconstitutional policing against protected groups. While these legal agreements have resulted in some improvements to policies and training regarding the use of force in more than forty agencies involved (US Department of Justice, 2017; Lawrence & Cole, Reference Lawrence and Cole2019; Walker, Reference Walker2017), thousands of police agencies without consent decrees are not required to make these changes. Sometimes, the force training involves teaching officers to include in their force reports that “I feared for my life.” These consent decrees are not only very costly but they are also slow to get started and typically drag on for years because of failure to comply fully with the terms of the agreement (Ravani, Reference Ravani2022) or because of local opposition and national politics (Lynch & Goudsward, Reference Lynch and Goudsward2024). Reaching beyond the focus of consent decrees, here I will emphasize the need for additional changes in accountability and performance assessments that have a better chance of influencing police behavior on the street and thus enhancing police legitimacy.

4.2 Key Innovations in Policing

Various innovations have been proposed to either fight crime more effectively and/or enhance police legitimacy. Most police work since the 1960s has been reactive, namely, responding to calls, taking reports and conducting investigations of crimes already committed. However, since the 1980s, innovation in policing has transitioned to more proactive attempts to prevent crime and engage the community. Here I take a quick look at some of the main strategies and why they have largely failed to change the style of policing or the police culture.

4.2.1 Community Policing

Many reform efforts were introduced in the 1980s to address the legitimacy problem through community engagement and community policing (Cordner, Reference Cordner, Alpert and Piquero1997; Green & Mastrofski, Reference Green and Mastrofski1988; Rosenbaum, Reference Rosenbaum1988, Reference Rosenbaum1994; Skogan & Hartnett, Reference Skogan and Hartnett1997). Community policing remains very popular, but the obstacles to full-scale implementation have been numerous (see Fridell & Wycoff, Reference Fridell and Wycoff2004; Mirzer, Reference Mirzer1996; Skogan, Reference Skogan2003). Community policing initiatives were conceived primarily as community relations programs to demonstrate police interest in community concerns, but they were never fully integrated into the daily practice of policing. Furthermore, observational studies in Chicago (Skogan & Hartnett, Reference Skogan and Hartnett1997), Seattle (Lyons, Reference Lyons1999), and Los Angeles (Gascon & Roussell, Reference Gascon and Roussell2019) have documented how these programs can be insensitive to many of the concerns raised by community members. Although community policing programs remain very popular, neither the structure nor the function of American police organizations has changed significantly as a result of these initiatives (Maguire, Reference Maguire1997; Mastrofski, Reference Mastrofski, Weisburd and Braga2019).

4.2.2 Problem-Oriented Policing

Also emerging in the 1980s was problem-oriented policing (POP), pioneered by Herman Goldstein (Reference Goldstein1979, Reference Goldstein1990). Goldstein argued that the time has come for police to focus on solving crime-related problems and addressing their causes, rather than being obsessed with incident-driven, law enforcement actions such as making arrests. Eck and Spelman (Reference Eck and Spelman1987) articulated a four-step process to implement POP called the SARA model (Scanning to identify the problem, Analysis to collect information about the problem, Response to implement solutions, and Assessment to evaluate effectiveness). Problem-oriented policing was a great idea, but the management structure did not change to support this approach to policing. As Braga and Weisburd (Reference Braga, Weisburd, Weisburd and Braga2019) note, “it seems unrealistic to expect line-level officers to conduct in-depth problem-oriented policing” (p. 198) without the support of researchers and administrators. Furthermore, POP has focused on reducing crime and disorder problems, but rarely other problems that concern the community. Nevertheless, the most rigorous evaluations suggest that intensive POP with full organizational support can produce short-term reductions in crime and disorder (Hinkle et al., Reference Hinkle, Weisburd, Telep and Petersen2020; Weisburd & Majmundar, Reference Weisburd and Majmundar2018). In the present context, however, POP has failed to address the problem of public anger and distrust of the police, and police agencies have failed to adequately seek or respond to community feedback on this issue. Nevertheless, I see the SARA model as very applicable to the organizational changes outlined in Section 7.2.3, with the goal of increasing police legitimacy.

4.2.3 Information Technology and Outcome-Based Policing

Policing in the twenty-first century can be viewed as “the Information Technology era” (Rosenbaum, Reference Rosenbaum2007) because of the focus on geo-based crime-fighting and the surveillance of suspects with the aid of computers. Multiple technologies are being used to predict and fight crime, including body-worn cameras (BWCs), data mining systems, heat sensors, biometrics, nonlethal weapons, GPS tracking, electronic monitoring, telecommunication systems, Internet searches, facial recognition software, drones, and data analytics with artificial intelligence.

The public has complained about secrecy and the lack of transparency in police work for many years. Body-worn cameras represent a significant attempt to address this concern using technology (White, Reference White2014), allowing us to see what happens during police-public encounters. However, a large body of research has failed to identify stable effects of BWCs on the behavior of police officers or community members, with the exception of a reduction in community complaints (Lum et al., Reference Lum, Stoltz, Koper and Scherer2019, Reference Lum, Koper and Wilson2020). Furthermore, the original BWC goals of transparency and accountability have not been achieved at the organizational level. Videos tend to be reviewed by Internal Affairs only when an incident involves a serious public complaint, high level of use of force, or an officer-involved shooting – estimated at less than 1 percent of all encounters. For the other 99 percent, the millions of hours of BWC footage remain untouched.

However, BWC data have changed the ball game by introducing a totally new source of evidence about what transpired during police-public encounters. As I will stress in this publication, BWC data has great potential for identifying patterns of behavior that contribute to, or undermine, public trust and confidence in the police. Such data can also be used to understand why force or violence occurs in some face-to-face interactions, but not in others.

One of the original pillars of the information technology revolution was CompStat, introduced in the New York Police Department in 1994 (Bratton, Reference Bratton and Langworthy1999; Walsh, Reference Walsh2001). With computers used to analyze crime patterns, New York’s CompStat model became extremely popular. Arguably, this was the first time that police managers were held accountable for performance on the street, but unfortunately, it encouraged an increase in traditional methods of aggressive, reactive policing to improve the crime statistics in each district. It also encouraged a punitive environment within the organization, which encouraged dishonesty in reporting performance data. Most importantly, CompStat systems failed to gauge in any meaningful way the quality of policing – only the volume of law enforcement activities and crime statistics for large geographic areas.

Given the calls for greater transparency and community voice in policing, the Vera Institute and the National Policing Institute sought to upgrade the CompStat model with a national conference in 2016 and a series of papers to follow in 2018 (Shah et al., Reference Shah, Burch and Neusteter2018). Acknowledging the original criticism of CompStat (e.g., Willis et al., Reference Willis, Mastrofski and Kochel2010), the recommendations for “CompStat 2.0” sought to incorporate community policing (with regular meetings between the police and community stakeholders), incorporate problem-oriented policing (with a broader focus on identifying and addressing local problems), and expand performance measures beyond crime statistics to problem-solving. Unfortunately, the quality of police services and the treatment of individual community members by the police was not given much attention.

Since the 1990s, other police innovations have since competed for dominance, including broken windows policing, hot spots policing, pulling levers policing, predictive policing, and specialty unit policing, with most of these aided by advances in information technology to target geo-based hot spots of crime and disorder or individuals (see Weisburd & Braga, Reference Weisburd and Braga2006). Controlled experiments have taught us that hot spots policing is one of the few policing strategies that is effective in reducing crime (Braga et al., Reference Braga, Turchan, Papachristos and Hureau2019; Weisburd & Braga, Reference Weisburd, Braga, Weisburd and Braga2019; Weisburd & Eck, Reference Weisburd and Eck2017), at least for short periods of time. However, I have argued that this wave of geo-targeted policing strategies tends to be aggressive and inequitable as practiced by most police agencies (Rosenbaum, Reference Rosenbaum, Davis, Lurigio and Rosenbaum1993, Reference Rosenbaum2007, Reference Rosenbaum, Weisburd and Braga2019). A recent National Academies (2025) workshop on the use of predictive policing approaches cautioned police regarding the potential risk of disparate impact and violations of civil liberties, and the need for a more holistic approach that gives attention to community trust as well as crime reduction.

Because of the focus on aggressive enforcement, these “outcome-based performance models” are more likely to undermine public trust in the police, especially in minority and marginalized communities, than “process-based policing models” (Tyler, 2005). Hence, this Element focuses on advancing knowledge and practice of process-based policing, and, where appropriate, linking it to outcome-based policing models, as Weisburd and his colleagues have done (Weisburd et al., Reference Weisburd, Telep and Vovak2022).

4.2.4 Traffic Stops and Public Safety

Since the late 1990s, American police organizations have believed that stopping, questioning, and frisking (SQF) individuals is the most effective method of achieving their primary goal of reducing crime. If weapons, drugs, and other contraband can be recovered and criminals can be stopped and arrested, this will reduce crime (For details on these tactics, see Remsberg, Reference Remsberg1995). However, the widespread adoption of this proactive enforcement strategy stems, in large part, from a misinterpretation of research on hot spots policing, where studies found that such police interventions can be effective in reducing crime when applied to very small geographic areas or “hot spots” (Weisburd & Braga, Reference Weisburd, Braga, Weisburd and Braga2019; Weisburd & Eck, Reference Weisburd and Eck2017). The success of the pioneering Kansas City Gun Experiment (Sherman & Weisburd, Reference Sherman and Weisburd1995), with four officers intensely producing stops, citations, and arrests on specific street segments of one district, received widespread attention. As a result, many police organizations dramatically increased stops in entire neighborhoods, precincts, or large areas – a bad translation of evidence-based policing and one that leads to many good people feeling harassed by intrusive police tactics. In the United States, we have reached a point where police make roughly 50,000 traffic stops per day or 20 million per year (Baumgartner et al., Reference Baumgartner, Epp and Shoub2018).

Many of these stops are “pretextual stops” for minor violations, such as a broken taillight or tinted windows, which are employed to ask additional questions and make observations to justify a search of the vehicle or person for evidence of criminal activity (Graham, Reference Graham2024). These investigatory stops require “reasonable suspicion” that the person has committed or is in the process of committing a crime. However, research clearly indicates that (1) these stops are more common and include more enforcement against Black drivers (Graham, Reference Graham2024; Graham et al., Reference Graham, Neath and Buchanan2024; Langton & Durose, Reference Langton and Durose2013; Pierson et al., Reference Pierson, Simoiu and Overgoor2020); (2) “reasonable suspicion” is often lacking before the search is conducted (Skogan, Reference Skogan2023); (3) hit rates for finding contraband are very low and officers are less likely to seize contraband from Black drivers than white drivers (e.g., Engel & Calnon, Reference Engel and Calnon2004; Lofstrom et al., Reference Lofstrom, Hayes, Martin and Premkumar2022; Pierson et al., Reference Pierson, Simoiu and Overgoor2020; Weiss & Rosenbaum, Reference Weiss and Rosenbaum2009); (4) mental and physical health effects on community members can be substantial (as noted earlier); and, perhaps most importantly, (5) traffic stops are ineffective at controlling crime, whether conducted in the United States (McCann, Reference McCann2023) or England and Wales (Tiratelli et al., Reference Tiratelli, Quinton and Bradford2018), and thus waste substantial police resources that could be devoted to other crime-fighting methods.

Gladwell (Reference Gladwell2019) illustrates this last point very clearly using the North Carolina State Highway Patrol as an example. Learning of this new approach to fighting crime, the agency increased its traffic stops from 400,000 to 800,000 over seven years. Gladwell reports that 400,000 searches resulted in only 17 cases where guns or drugs were recovered, begging the question, “[i]s it really worth alienating and stigmatizing 399,983 [drivers] … to find 17 bad apples?” (p. 337). Thus, these pretextual stops rarely produce anything that could be helpful in solving crimes and most are irrelevant to traffic safety. Stops for speeding, drunk driving, or running traffic lights would be a much better use of officers’ time, or devoting these hours to more effective crime prevention work. In fact, unarmed drivers who are not being pursued for a violent crime are killed at a rate of more than one per week (Kirkpatrick et al., Reference Kirkpatrick, Eder, Barker and Tate2021).

The damage to police image can be consequential. The national contact survey shows that Black and Hispanic drivers are less likely than whites to believe that the reason for their stop was legitimate (Langton & Durose, Reference Langton and Durose2013). This perceived lack of fairness can place an upper limit on the level of cooperation and compliance that can be expected from the public, which, in turn, has serious consequences for public safety. Consistent harassment and perceived lack of fairness can contribute to legal cynicism in the community (Kirk & Papachristos, Reference Kirk and Papachristos2011), which can undermine public safety. Thus, we need to systematically measure how vulnerable groups feel about their encounters.

Pedestrian stops are also controversial. These proactive crime control programs have stimulated an important debate among academics about the costs and benefits of the common policing tactic of SQF when applied to pedestrians (Ratcliffe et al., Reference Ratcliffe, Braga, Nagin, Webster and Weisburd2024). Some argue it should be used only to reduce gun violence in high-crime neighborhoods, but these tend to be neighborhoods of color (Kegler et al., Reference Kegler, Simon and Sumner2023). Weisburd and his colleagues (Petersen et al., Reference Petersen, Weisburd, Fay, Eggins and Mazerolle2023; Weisburd et al., Reference Weisburd, Petersen and Fay2023) conducted a Campell Collaborative Systematic Review of pedestrian stops. While these SQFs reduced crime (roughly 13 percent), they also were associated with mental and physical harm to the community members who were stopped, as well as more negative views of the police and increased delinquency. Consequently, the authors of the review concluded that “existing scientific evidence does not support the widespread use of SQFs as a proactive policing strategy” (p. 2). The concerns about SQFs, including adverse legitimacy effects, not only suggest the need for more dialogue with the community about SQF tactics (see Sherman, Reference Sherman2023) but also underscores the need for new systems of measurement that will allow researchers to monitor the impact of policing practices.

4.2.5 Responding to Persons with Mental Illness

In response to police killing or mistreating persons suffering from mental illness, many police departments have introduced training programs for officers. Crisis Intervention Teams (CIT), based on the 1988 “Memphis Model,” have spread rapidly across the United States, with some good police training to de-escalate tension with individuals facing a mental health crisis (Watson et al., Reference Watson, Compton and Pope2019). However, management has inadequate mechanisms in place to ensure that the behaviors being recommended in training have translated into street-level behaviors. Even the Memphis Police Department has been accused by the U.S. Department of Justice of unlawfully discriminating against people with behavioral health disabilities (Department of Justice, 2024).

Given the proven danger of armed police responding to mental health crises, we have witnessed a national movement to create community responder programs that employ unarmed public officials to handle nonviolent mental health calls for service (Beckett et al., Reference Beckett, Stuart and Bell2021). I am supportive of these programs, but there is much work to be done to decide which calls they are qualified to handle and how they will interact with the police. One famous policing scholar argued that the primary function of the police is to handle situations that require “coercive force” (Bittner, Reference Bittner1970), that is, when someone needs to be restrained with force or is judged to be a threat to someone else’s safety. However, these are rare calls for service, and the unarmed community responders are not prepared to handle all remaining nonforce incidents. Thus, we still need to more directly address the problem of how to produce a kinder, gentler, and more empathetic response from police officers.

4.3 Summary of Past Reforms

We have witnessed dozens of strategies to reform police organizations over the past century, including one of my favorites, community policing. Unfortunately, these interventions have achieved little success at changing police culture or police behavior on the streets. Despite all the new attention on managing use of force, the number of fatal police shootings continues to increase, reaching a record high of more than 1,200 deaths in 2023 (Washington Post, 2024). Also, the 378 police officers shot in the line of duty in 2023 is the highest in history (National Fraternal Order of Police, 2024).

To make matters worse, the public is both critical of aggressive crime-fighting and, at the same time, asking for more enforcement. This happened in the 1980s with drug enforcement (Rosenbaum, Reference Rosenbaum, Davis, Lurigio and Rosenbaum1993), and is happening again with cities and states returning to “tough on crime” policies and increasing police powers (Hernandez, Reference Hernandez2024), despite overall declines in crime. This includes recriminalizing drug activity in public places, returning to pretextual traffic stops, and arresting the growing homeless population for camping in public spaces (e.g., Hernandez, Reference Hernandez2024; VanSickle, Reference VanSickle2024; Woolington & Lewis, Reference Woolington and Lewis2018).

Unfortunately, the growth in rigid bureaucratic controls and external criticism of the police seems to have increased police solidarity and their psychological distance from the communities they serve. Research on group dynamics reminds us that external threat leads to internal group cohesion (Lang et al., Reference Lang, Xygalatas and Kavanagh2021). Today, we see “de-policing” to “stay out of trouble,” along with negative attitudes toward the community being served.

Here I make the argument that progress in police organizations has been restricted by failure to explore new measures of performance and new methods of accountability that are less punitive, grounded in social science knowledge, and give attention to the community’s voice when evaluating police services.

5 Measuring What Matters to the Community

What do we mean by “good policing?” The police mission is “to serve and protect” and the public would like a better definition of “to serve.” To achieve marked improvements in public satisfaction with police performance, others have joined me in making the argument that performance metrics must be expanded to incorporate new measures of the quality of policing as defined by the community (Engel & Eck, Reference Engel and Eck2015; Langworthy, Reference Langworthy1999; Lum & Nagin, Reference Lum, Nagin, Tonry and Nagin2017; Mastrofski, Reference Mastrofski1999; McCarthy & Rosenbaum, Reference McCarthy and Rosenbaum2015; Moore et al., Reference Moore, Thacher, Dodge and Moore2002). For many years, the limitations of traditional measures of performance have been well documented in the scholarly literature (Alpert & Moore, Reference Alpert, Moore, Alpert and Piquero1993; Blumstein, Reference Blumstein and Langworthy1999; Goldstein, Reference Goldstein1990; Grant & Terry, Reference Grant and Terry2005; Maguire, Reference Maguire2004a; Masterson & Stevens, Reference Masterson, Stevens and Stevens2002; Moore & Poethig, Reference Moore, Poethig and Langworthy1999; Moore et al., Reference Moore, Thacher, Dodge and Moore2002; Reisig, Reference Reisig1999; Skolnick & Fyfe, Reference Skolnick and Fyfe1993; Sparrow, Reference Sparrow2015; White, Reference White2007). Official statistics, such as reported crime rates, clearance rates, citations, arrests, and other enforcement activities, not only suffer from inaccuracy, but more importantly, they provide a very incomplete picture of police work. Within the framework of policing in a democratic society, police scholars and practitioners have called for greater attention to policing processes rather than policing outcomes (Mastrofski, Reference Mastrofski1999; Moore et al., Reference Moore, Thacher, Dodge and Moore2002). Specifically, traditional measures fail to capture what is most important to the public, namely, how they are treated by the police. Thus, policing scholars have called upon the field to “measure what matters” to the community (Dunworth et al., Reference Dunworth, Cordner and Greene2000; Langworthy, Reference Langworthy1999; Masterson & Stevens Reference Masterson, Stevens and Stevens2002; Mastrofski, Reference Mastrofski1999; Rosenbaum, Reference Rosenbaum, Fridell and Wycoff2004). What matters most is procedural justice.

5.1 Procedural Justice

In the pursuit of evidence-based policing, we should use public satisfaction with police encounters, and the sense of procedural justice that drives this satisfaction, as core metrics to define “good policing.” The pioneering work of Tom Tyler and his colleagues (Lind & Tyler, Reference Lind and Tyler1988; Tyler, Reference Tyler1990; Tyler & Huo, Reference Tyler and Huo2002) has paved the way for numerous studies in many countries showing that people’s judgments about the police and police legitimacy are heavily influenced by their sense of whether the process is fair and the officer’s behavior is appropriate (Bradford, Reference Bradford2014; Bradford et al., Reference Bradford, Murphy and Jackson2014; Hough et al., Reference Hough, Jackson, Bradford, Tankebe and Liebling2013; Mazerolle et al., Reference Mazerolle, Sargeant and Cherney2014; Quattlebaum et al., Reference Quattlebaum, Meares and Tyler2018; Tankebe, Reference Tankebe2013; Tyler, Reference Tyler2004; Tyler & Fagan, 2008). Although community residents want safer streets and less violence, most importantly, they want a police force that is fair and sensitive to their needs (Rosenbaum et al., Reference Rosenbaum, Schuck, Costello, Hawkins and Ring2005; Skogan & Frydl, Reference Skogan and Frydl2004; Tyler, Reference Tyler2005; Weitzer & Tuch, Reference Weitzer and Tuch2005).

When evaluating police performance from the public’s perspective, the police response to crime incidents should be placed in the larger context of daily police work. In ten cities examined by the New York Times, serious violent crime comprised only about 1 percent of the calls for service (Asher & Horwitz, Reference Asher and Horwitz2021). In nine cities, the Vera Institute of Justice found that less than 3 percent of all 911 emergency calls involved violent crime situations and only 37 percent involved any type of criminal activity (Dholakia, Reference Dholakia2022). Hence, the detached crime-fighting model has little relevance to the vast majority of calls for service, yet the treatment of the public during these noncrime situations remains very important to their assessment of police legitimacy and public cooperation. For example, the traffic stop is the most common method of police-public contact and, as described earlier, it is a setting where multiple problems can develop regarding fair and respectful treatment.

I will acknowledge here that some policing scholars have debated whether there is a causal link between procedural justice and police legitimacy, given that most studies are correlational and do not involve experimental designs (e.g., Nagin & Telep, Reference Nagin and Telep2017). I acknowledge this issue, but simply point out that procedural justice, legitimacy, and many other human perceptions have been strongly linked in hundreds of studies (for a review, see Tyler & Nobo, Reference Tyler and Nobo2022), and that causality has been demonstrated in one randomized trial (Jonathan-Zamir et al., Reference Jonathan‑Zamir, Perry, Kaplan‑Damary and Weisburd2024). Also, researchers have used a “subjective causality approach” by conducting in-depth interviews of protestors and concluded that the fairness of treatment influences their judgments of police legitimacy and not the reverse (Perry et al., Reference Perry, Jonathan-Zamir, Willis, Weisburd, Jonathan-Zamir, Perry and Hasisi2023).

Another debated issue is whether procedural justice should be given priority over fighting crime. I want to emphasize that this is not a “zero-sum game” or trade-off. We don’t need to pick between procedural justice and public safety, and others agree with me (Engel & Eck, Reference Engel and Eck2015). The police should continue to fight crime but do so in a manner that is less harmful to the community. In addition, we have reason to believe that procedural justice and crime control are connected. We know from many studies that when the police act in a procedurally just manner, the public is more likely to view the police as legitimate and trustworthy (Donner et al., Reference Donner, Maskaly, Fridell and Jennings2015), less cynical about the law (Gau, Reference Gau2015), more likely to obey the law (Tyler, Reference Tyler1990), and more likely to cooperate with the police (Maguire & Lowrey, Reference Maguire and Lowrey2017; Murphy & Cherney, Reference Murphy and Cherney2012; Rosenbaum et al., Reference Rosenbaum, Maskaly and Lawrence2017; Tyler, Reference Tyler2004, Reference Tyler2006). This also means the community is more likely to comply with police requests (Walters & Bolger, Reference Walters and Bolger2018), thus avoiding the need to use force, and is more likely to cooperate by reporting criminal activity (Bolger & Walters, Reference Bolger and Walters2019).

Perhaps the most compelling evidence of the linkage between procedural justice and crime comes from a study by Weisburd and his colleagues (Reference Weisburd, Telep and Vovak2022) in three cities where officers were randomly assigned to hot spots with or without five days of procedural justice training. Officers in the training condition showed more respectful treatment of community members, made fewer arrests, and received fewer perceptions of police harassment and violence than officers in the control group. Most importantly, there was a significant relative 14 percent reduction in crime incidents in the experimental hot spots where officers received the procedural justice training.

Procedurally just behaviors by the police should contribute to more complete and effective investigations of crime due to increased cooperation, thus contributing to deterrence. Also, when the public views the police as trustworthy, they are more likely to exhibit community engagement and collective efficacy (Tyler & Meares, Reference Tyler and Meares2021). A community’s “collective efficacy” and “willingness to intervene” are associated with lower levels of neighborhood violence and less fear of crime (Sampson et al., Reference Sampson, Raudenbush and Earls1997). For these reasons, I have repeatedly encouraged the “co-production” of public safety by engaging community members in crime prevention and multiagency partnerships (Rosenbaum, Reference Rosenbaum1986, Reference Rosenbaum2002; Rosenbaum et al., Reference Rosenbaum, Lurigio and Davis1998; Schuck & Rosenbaum, Reference Schuck, Rosenbaum and Fulbright-Anderson2006).

The police could use more help from the community in fighting crime. Clearance rates are low and have continued to decline over the ten-year period beginning in 2013 (Gramlich, Reference Gramlich2024). Police now clear only 52 percent of homicides, 41 percent of aggravated assaults, and 23 percent of robberies. Clearance rates for rape have declined dramatically from 41 percent to 26 percent during this period. If public trust in the police were higher, we could expect greater public cooperation in solving crime and weaken the “No Snitch” culture present in many high-crime neighborhoods.

For this Element, the key elements of procedural justice need to be defined (for an introduction, see President’s Task Force on 21st Century Policing, 2015; Rosenbaum et al., Reference Rosenbaum, Maskaly and Lawrence2017; Yale Law School Justice Collaboratory, 2023). Essentially, procedural justice research shows that the public’s perceptions of police are strongly influenced by the quality of their encounter with the police. Tyler and other policing scholars have identified four key components of procedural justice when interacting with the police:

(1) Dignity and Respect: Are community members treated with dignity and respect by the officers?
(2) Voice: Are community members given a voice or chance to express their concerns? Are they allowed to participate in the decision-making process?
(3) Neutrality: Are the officers neutral or unbiased in their decisions, guided by consistent and transparent reasons? Are decisions based on the facts and legal procedures, rather than personal biases about race/ethnicity, gender, age, sexual orientation, mental health, religion, or other factors that are not legally relevant?
(4) Trustworthiness: Are the officers conveying trustworthy motives and showing concern about the well-being of those who are affected by their decisions?

These components can be combined into a single Procedural Justice (PJ) Index, as we did under the National Police Research Platform with fifty-eight agencies. As you can see from Figures 1 and 2, the larger the agency, the lower the procedural justice ratings received by officers and the lower the overall public satisfaction with the encounter. Thus, the problem of procedural injustice is greater in larger, urban areas with larger crime problems and more diversity. Across all agencies, procedural justice reported by white residents on four key measures was roughly 12 percentage points higher than Black residents and 8 percentage points higher than Hispanic residents.

A bar chart with five vertical bars, where each bar representing police agencies of different sizes. See long description.

Figure 1 Overall procedural justice index.

Figure 1 Long description

The bars and the data depicted are as follows: 1. Agencies with less than 100 sworn officers: 3.43. 2. Agencies with 100 to 199 sworn officers: 3.25. 3. Agencies with 200 to 599 sworn officers: 3.25. 4. Agencies with 600 to 1999 sworn officers: 3.13 and 5. Agencies with more than 2000 sworn officers: 2.93. The height of each bar reflects the overall procedural justice index score for agencies within that group, where multiple survey questions were combined to produce a procedural justice index. The bars get smaller as the agency size increases, thus indicating that procedural justice within law enforcement agencies, as perceived by officers, decreases as agency size increases. This is also referred to as organizational justice (Rosenbaum and McCarty, 2017).

A bar chart with five vertical bars, each representing police agencies of different sizes. See long description.

Figure 2 Overall satisfaction by agency size. (% satisfied and very satisfied).

Figure 2 Long description

The bars and the data depicted are as follows: 1. Agencies with less than 100 sworn officers: 86.6 percent. 2. Agencies with 100 to 199 sworn officers: 79.4 percent. 3. Agencies with 200 to 599 sworn officers: 81.0 percent. 4. Agencies with 600 to 1999 sworn officers: 74.4 percent. 5. Agencies with more than 2000 sworn officers: 64.8 percent. The height of each bar reflects the level of employee job satisfaction for agencies in that group. The bars get smaller as the agency size increases, thus indicating that job satisfaction among officers within these agencies decreases as agency size increases.

Next I will review research in the social sciences that underscores the importance of each of these components, and how interpersonal communication skills are the key to successful procedural justice in practice. The systems proposed here should be able to measure the main components of procedural justice by capturing relevant information from verbal and nonverbal behavior. The importance of this interpersonal communication should be clear – it can lead to the escalation of conflict and use of force or to mutual respect, trust, and cooperation. After reviewing the scientific literature on police programs seeking to improve police legitimacy, Mazerolle and colleagues (Reference Mazerolle, Bennett, Davis, Sargeant and Manning2013a) concluded that frontline procedurally just communication with the public “is important for promoting citizen satisfaction, confidence, compliance and cooperation with the police.”

5.1.1 Respectful Treatment

All human beings have a strong psychological need to be treated with dignity and respect. Regardless of the circumstances, no one wants to be embarrassed, humiliated, demeaned, or dismissed, especially in the presence of others. We know from social psychology that this motivation stems from our desire to maintain a positive social identity and self-esteem (Lind & Tyler, Reference Lind and Tyler1988). Interactions that are not respectful and threaten our self-identity will not be received positively.

Respectful treatment can be understood in the context of interpersonal communication, where each action usually causes a reaction. Communication is often received with positive or negative emotional reactions. Social psychologists have identified the social norms of positive and negative reciprocity (Gouldner, Reference Gouldner1960). According to the positive reciprocity norm, you are expected to respond to a positive action with another positive action. In other words, when someone says or does something nice to you, usually there is a feeling of gratitude, and you feel a need to repay them, which then contributes to equity in relationships, cooperation, and other positive outcomes (Chen et al., Reference Chen, Chen and Portnoy2009). Conversely, if the person responds to you in a negative or hostile way, you are inclined to respond in the same way – which is called negative reciprocity.

I have observed both positive and negative reciprocity in police-public communication, and the latter can easily lead to escalation of tensions. Many officers are trained to believe that their legal authority and power does not require them to play by these social rules. However, the adverse effects of disrespect are clear – research shows that when someone is disrespected by the police, they typically respond in a negative manner, exhibiting disobedience and resistance (Terrill & Reisig, Reference Terrill and Reisig2003).

Similarly, holding stereotypes can often lead to disrespectful treatment by the police. I have already discussed racial bias in enforcement. No one wants to be questioned, searched, and treated as if they are a suspected criminal, which happens too often in high-crime neighborhoods where the vast majority of residents are law-abiding community members. This well-documented reality may help to explain the results of a meta-analysis by Bolger and colleagues (Reference Bolger, Lytle and Bolger2021), which found that satisfaction with the police is lower for people of color, younger people, and those who have been victims of crime.

Interpersonal communication skills are critical for giving and receiving respect. Routinely, police officers interview victims, suspects, and bystanders to gather information, but there are good and bad methods of interviewing and procedural justice is essential. Considerable scientific knowledge regarding effective interviewing techniques can be applied to police work (Stewart & Cash, Reference Stewart and Cash2008).

One study offers some insight regarding effective police communication styles when interacting with suspects. Foster and colleagues (Reference Foster, Zimmerman, Terrill and Somers2024) analyzed 438 BWCs and dashcam video recordings from two police agencies. They found that when patrol officers “presented a positive tenor/demeanor or employed noncoercive verbal tactics,” suspects were significantly more likely to comply with their requests. In contrast, officers’ use of “coercive verbal tactics or accusatory language” did not improve compliance levels. Furthermore, such negative communication may escalate the tension and lead to use of force, but will certainly lower procedural justice evaluations.

When evaluating police-public interactions, I also emphasize the importance of basic conversational etiquette for gaining respect. As you know, etiquette is a set of rules or practices endorsed by society regarding appropriate behavior in interpersonal settings (e.g., Muja, Reference Muja2014; Sia, Reference Sia2023). All social interactions should have a beginning, middle, and end. The conversation should have a beginning where you identify yourself and greet the person, a middle where you listen and wait your turn to talk (“turn taking”), and an ending, where you close the conversation by saying something like “thank you” or “take care.” Differential power dynamics can also cause officers to skip over some of the social etiquette requirements that are normally expected in social exchanges, thus undermining procedural justice.

Importantly, when you realize that something is not going right in a conversation (e.g., the other person has been offended), you engage in “conversation repair” (Albert & De Ruiter, Reference Albert and de Ruiter2018), such as “I’m sorry I interrupted you – go ahead and finish what you were saying about your neighbor” or “I’m sorry I offended you – I didn’t mean to imply that you were at fault.” Conversational repair requires emotional intelligence to recognize that a problem has occurred and know how to fix it.

While interpersonal communication styles can easily influence respect and other aspects of procedural justice, organizational research has found that verbal and nonverbal communication can be easily misunderstood or distorted, and can even contradict each other (Clampitt, Reference Clampitt1991; Vasu et al., Reference Vasu, Stewart and Garson1998). This communication breakdown can have serious consequences for these encounters. Sometimes the words used by the sender can mean something different to the receiver. Body-worn camera data and survey data can help us determine if and when miscommunication occurs.

Also, nonverbal communication can send a message or attitude (Bolton, Reference Bolton1985; Leathers, Reference Leathers1992), such as when arms are folded across the chest, facial frowning or head shaking, eyes rolling, or conversely, smiling, eye contact, positive tone of voice, and head nodding. Whether it is facial expressions or body language or tone of voice, the person is sending a message. Again, all of this remains unmeasured at this point.

Over many years, Paul Ekman and Wallace Friesen created a taxonomy of facial expressions, finding forty-three distinct muscular movements (Gladwell, Reference Gladwell2005). The basic point here is that our face can express our emotions, whether it be anger, fear, sadness, surprise, disgust, happiness, or other emotions. During police-public contacts, if officers become better at quickly identifying and understanding emotional expressions, they can strengthen their communication skills and manage their own emotions. Of course, we need to reliably measure some of these facial expressions and how they are interpreted before police training.

5.1.2 Voice

The second core principle of procedural justice is voice. Whether you look at feminist theory (e.g., Abrams, Reference Abrams1999) or personal efficacy research in psychology (Bandura, Reference Bandura1997, Reference Bandura2006), you will see that human beings seek to be autonomous agents who are self-governing and self-efficacious. In other words, we want to be in control of ourselves (make decisions on our own) and be in control of our immediate environment. In fact, those who feel efficacious are more likely to achieve their goals (Meichenbaum, Reference Meichenbaum, Glaser, Chipman and Segal1984; Schunk & Zimmerman, Reference Schunk and Zimmerman1994). Supporting this work, social psychological research on Brehm’s reactance theory indicates that people will react negatively against threats to their individual freedom to make decisions (Brehm & Brehm, Reference Brehm and Brehm2013; The Decision Lab, 2024).

Listening skills are essential for effective communication and for giving voice to others, but most of us are poor listeners. Too often, we are waiting for our chance to talk and even interrupt the person talking to make our point or take some action. As Vasu and colleagues note (Reference Vasu, Stewart and Garson1998), the literature distinguishes between the “deliberative listener” who is listening for information, and the “active listener” who is seeking to understand the person and their feelings. We should do both. The latter is where empathy comes in. Officers must be motivated to listen for both information and feelings, and when doing so, avoid passing judgment regarding the individual without all the facts.

Officers, like all of us, must learn how to have “difficult conversations” (Stone et al., Reference Stone, Patton and Henn2023). Experts have offered strategies for having such conversations in a way that causes less stress and helps to resolve interpersonal differences and reach positive outcomes. These strategies include controlling your emotions and not being defensive when being attacked or accused of something, and listening carefully to the message that is being communicated behind the words. For example, officers should keep their cool when someone is angry about receiving a ticket (“Why are you wasting your time stopping me for no reason?”) or blames the officer for a negative experience (“What took you so long to get here?”).

All of this work should be taken into consideration as we seek to minimize any unnecessary, nonnegotiable commands by police officers to civilians. Officers must be careful that their actions do not suppress this human desire for agency and control unless absolutely necessary. Police are trained to believe they have the ultimate authority to enforce the law, and thus, civilians who challenge that authority are not always well received. Hence, new measures should capture whether officers engaged in active listening (i.e., giving voice) during the interaction, even when they receive negative information.

5.1.3 Neutrality

The perception of police neutrality is absolutely essential for gaining legitimacy, as the public today is very sensitive to any type of police bias. Eck and Rosenbaum (Reference Rosenbaum1994) posited that the public evaluates police performance using “three Es” – effectiveness, efficiency, and equity. Somehow, equity has been largely ignored as a defining feature of evidence-based policing.

Consent decrees have investigated a range of unconstitutional policing practices in American cities that pertain to police neutrality, including bias in use of force, stops, searches, and arrests, as well as racist and sexist language (e.g., see D’Souza et al., Reference D’Souza, Weitzer and Brunson2019). Today, the LGBTQ+ community, religious communities, and mental health communities, for example, have grown increasingly concerned about mistreatment by the police. Thus, the time has come to better measure the public’s experience with the police, including these vulnerable groups. For example, as noted in Section 3.1, not only do police show bias in traffic stops and searches but also people of color are more fearful, resentful, and humiliated by these encounters, and feel discriminated against.

I should note that neutrality is closely linked to other elements in the procedural justice model. In other words, recipients of police service are less likely to feel that the police are being discriminatory or unfair if they feel they have a voice in the process, feel they are treated with dignity and respect, and feel the officer was genuinely concerned for their welfare (Skogan & Frydl, Reference Skogan and Frydl2004).

5.1.4 Trustworthiness through Empathy

Police trustworthiness is the fourth key component of procedural justice. Our research on police contacts in fifty-three large cities has identified a core communication skill that is linked to voice and trustworthiness, namely, empathy (Rosenbaum et al., Reference Rosenbaum, Maskaly and Lawrence2017). As we describe it, “[p]eople respond positively when others can understand their point of view, feelings, and circumstances, and respond with compassion, reassurance, and comfort rather than indifference, criticism, or blaming” (p.115).

Empathy can be viewed as a three-step psychological process (Kanov et al., Reference Kanov, Maitlis and Worline2004): (1) noticing that the other person is suffering; (2) feeling their experience by imagining their pain; and (3) responding or acting to ease or eliminate their suffering. Beyond feelings, empathy involves cognition. Various studies on “perspective taking” and “emotional intelligence” describe a cognitive skill set that allows you to recognize other people’s mental states, recognize the hidden meaning behind what they are saying, anticipate their actions, be aware of your own emotions, and be aware of your own impact on someone. Researchers have mapped out multiple developmental stages of perspective taking (Elfers et al., Reference Elfers, Martin, Sokol and Columbus2008). Thus, social dynamics can get very complicated, but they are important for understanding interpersonal communication. Perspective taking is part of the empathy process and has shown benefits. For example, several experiments have shown that perspective taking can prevent automatic racial bias (Todd et al., Reference Todd, Bodenhausen, Richeson and Galinsky2011). Other experiments have shown that empathy traits can lead to more prosocial helping behavior (Xiao et al., Reference Xiao, Lin and Li2021).

The third component of compassionate communication involves responding. The police need to go beyond noticing and feeling to doing something. Showing compassion requires action, such as emotional support, advice, or practical assistance. This means doing whatever you can do (given your time, resources, and skill) to alleviate the person’s pain or suffering. This might include both nonverbal responses (e.g., nodding your head in agreement, facial expressions, body orientation, or simply showing up on time) and verbal responses (e.g., expressing sympathy and providing advice), but remaining neutral and objective as a professional.

Clearly, empathy and perspective taking stem from emotional intelligence. Officers will benefit greatly from being able to determine how their empathetic (nor nonempathetic) behavior has affected the person with whom they are interacting. Empathy is extremely important when responding to victims of personal and property crime, as they care deeply about how they are treated by the police (Elliott et al., Reference Elliott, Thomas and Ogloff2012). Too often, victims of violence in the United States experience negative, unsupportive reactions from professionals. Sexual assault victims in particular receive a range of negative responses (Ullman, Reference Ullman2000), including police failure to properly investigate (Southall, Reference Southall2022), which have been shown to inhibit their psychological recovery and reduce the likelihood of future disclosure or reporting to authorities (Ahrens, Reference Ahrens2006; Lorenz, Reference Lorenz2017; Starzynski et al., Reference Starzynski, Ullman, Filipas and Townsend2005; Ullman, Reference Ullman1999). For example, millions of rape kits used to test for DNA have gone untested in the United States because of doubts about the victim’s report (Nadolny et al., Reference Nadolny, Penzenstadler and Barton2024), leaving offenders on the streets and victims upset. In England and Wales, of the nearly 2,000 victims of rape and sexual assault surveyed in the first half of 2023, “75% of respondents said their mental health had worsened as a result of their police experience, 55% of respondents said their physical health decreased, 54% said their trust in police declined” (Hohl et al., Reference Hohl, Reid, Molisso and Pullerits2024, p. 45). Many survivors did not feel their reports were taken seriously by the police. They felt they were not listened to, were blamed for what happened, and were not treated with kindness and understanding. As a result, many expressed unwillingness to report future sexual offenses. In 2022, charges were brought in only 2.1 percent of all reported rape cases in England and Wales (Hohl & Stanko, Reference Hohl and Stanko2024). Thus, the link between procedural justice and future offending is very clear.

Related to trustworthiness and empathy is the importance of police providing explanations for their decisions and actions. Victims and bystanders want to know why the police are asking certain questions, making certain demands, or making certain decisions. To show empathy and compassion, police will need to slow down, listen to people’s anger, fears, and concerns, and answer their questions. Too often police feel they need to run from call to call and do not have time to “chat” with anyone, which is simply not true in most cities. Thus, another communication skill in short supply is patience.

5.2 Summary of Potential New Measures

In sum, police-public encounters are much more complex than suggested by enforcement statistics on traffic citations, arrests, and use of force. Procedural justice is a defining feature of police-public interactions. The officer’s respectfulness, fairness and impartiality, emotional and informational support, and other communication skills will determine whether community residents are satisfied with their encounter, will work with the police in the future, and will obey the law themselves. Therefore, procedural justice should be measured and used by police agencies in a manner that will increase both police legitimacy and public safety.

6 Building and Testing New Systems of Measurement

In support of translational criminology, this Element seeks to develop strategies that move us from “evidence to action” (La Vigne, Reference La Vigne2024) – strategies that lead to real change in police organizations. The time has come to build and test new systems of measurement to enhance procedural justice during police-public encounters, and thus create police organizations that employ what Rahr & Rice (Reference Rahr and Rice2015) call “guardians” rather than “warriors.” My proposal springs from a basic principle of organizational management: If you measure something, you can manage it and hold employees accountable for achieving it. Conversely, if you fail to measure something, then management is sending the message to employees that it doesn’t matter and officers can ignore it as they go about their work. Thus, the time has come to “measure what matters to the public” – procedural justice – and build these data into new management systems. Two new data systems are proposed here: contact surveys and BWCs.

6.1 Contact Surveys

For many years policing scholars have been promoting community surveys as a method of evaluating public satisfaction with the police (e.g., Brown & Benedict, Reference Brown and Benedict2002; Decker, Reference Decker1981; Fridell & Wycoff, Reference Fridell and Wycoff2004), and now such surveys are widely employed. However, there are several major limitations to previous and ongoing community surveys: (1) the findings cannot be disaggregated to small geographic areas – they are typically citywide or national, (2) they are one-time “snapshots” of community perceptions, not ongoing data collection systems, (3) they cannot be linked to officer performance evaluations or accountability systems, and (4) they only capture the general public’s perceptions of the police, which, as others have pointed out, tends to be “stereotypical views about law enforcement rather than lived experiences” (Carmichael et al., Reference Carmichael, David, Helou and Pereira2021, p. 99). To be clear, results from the national Police-Public Contact Survey indicate that only 21 percent of Americans have contact with the police in a given year (Tapp & Davis, Reference Tapp and Davis2022). In other words, eight out of ten community survey respondents have no “lived experience” with the police. Hence, the time has come to give voice to the one in five Americans who have experiential knowledge of actual police behavior.

The Police-Public Contact Survey introduced by the Bureau of Justice Statistics is a good start, but this is a national survey that is only administered every few years, and only asks community members about police contacts within the last year. As the President’s Task Force (2015) noted, “these surveys do not reflect what is happening every day at the local level when police interact with members of the communities they serve” (p. 24). Clearly, we need to capture the public’s experience quickly – within the first week or two – not a year later. Research indicates that the passage of time has an adverse effect on human recall (Baddeley et al., Reference Baddeley, Eysenck and Anderson2009; Barrouillet & Camos, Reference Barrouillet and Camos2012). Memory will simply decay over time and can be biased by what researchers call “retroactive interference,” where new information interferes with the mental retrieval of old information.

The direct benefits of an ongoing Contact Survey Program are numerous. First, community members who are stopped by the police or call the police for help will be given a voice and empowered in a relationship that has, historically, been perceived as one-sided. This measurement system will also incentivize a new definition of good policing, encouraging the police to “serve and protect” in a procedurally just manner. The contact survey data will help police agencies identify specific skills and circumstances where programmatic intervention will strengthen procedural justice during police-public encounters.

6.1.1 Contact Survey Content and Methods

Contact surveys need to be institutionalized at the local level, as a few small cities are beginning to do now (see Davies, Reference Davies2022), but with methods and measures that meet social science standards and can be integrated into a national system of data collection. We learned a great deal from introducing a contact survey in fifty-three large cities as part of the National Police Research Platform (Rosenbaum et al., Reference Rosenbaum, Maskaly and Lawrence2017), funded by the National Institute of Justice. Similar to the platform project, I would recommend that the proposed contact survey and associated methodologies be the result of meetings with a small group of community leaders, police chiefs/sheriffs, AI experts, and policing scholars with knowledge of procedural justice research. To maximize response rates, I would recommend a relatively short survey, that is, no more than fifteen to twenty questions.

We asked police chiefs to mail letters to community members who have had a recent police contact (i.e., in the past two weeks), but mailing letters is a time-consuming and expensive approach. Now, I would propose that officers be required to distribute business cards with a survey invitation and QR code that provides a link to the survey, to be completed either online or by telephone (we did both successfully). Importantly, an independent agency will need to follow up with text messages to ensure a reasonable response rate. Finally, the city will need a public education campaign, using social media, billboards, and flyers, to make the community aware of this new program and encourage their participation.

Based on the research cited above, I have suggested several relevant topics for the survey in the Online Appendix. Procedural justice should be the main focus, but consideration should be given to related topics, such as threats and use of force, de-escalation, empathy/compassion, social etiquette, willingness to work with the police, trust in the police, and overall satisfaction with the experience.

6.1.2 Linking the Survey Data to Other Agency Data

To avoid a lengthy survey and to provide more context about the incident, the police records division will need to routinely download certain data elements connected with police reports and create a separate database that can be linked to the survey responses. These data elements would include the incident number, demographic information, and other information needed for analysis (see Section 7.1.1).

6.2 Body-Worn Cameras

I envision BWC data as a major source of police reforms in the future, but BWC data have yet to be used for meaningful organizational changes. Here I am proposing that such data be used for measuring the quality of police services and changing incentive structures. For example, when BWC footage of police stops from one California agency was hand-coded using words spoken, the authors found that officers used less respectful language when talking to people of color than when talking to white drivers (Voigt et al., Reference Voigt, Camp, Prabhakaran and Eberhardt2017). From this work, we can begin to see the richness and value of BWC data. However, the hand-coding of BWC data by human beings is far too time-consuming and, therefore, must be replaced by BWC data analytics to have widespread utility.

Today, software built with artificial intelligence (AI) is now being developed and field tested, so that cities can engage in more systematic and proactive reviews of police-community interactions captured with BWCs (e.g., Farooq, Reference Farooq2024; Polis Solutions, 2024). The most significant project to date is what I will call “the Dallas Project.” Funded by the National Institute of Justice, this project involved a collaboration between the National Policing Institute, Polis Solutions, the Caruth Policing Institute at the University of North Texas at Dallas, and the Dallas Police Department (Dolly et al., Reference Dolly, Hansen and Gallardo2024). This initiative was able to field test software developed by Polis Solutions called TrustStat to “analyze verbal and non-verbal cues from BWC footage, identifying behaviors aligned with procedural justice principles.” Human coders were trained to rate the same videos to assess the degree of correspondence between AI and human perceptions of procedural justice during these police-public encounters. The results indicate a high degree of correlation, suggesting that AI software has strong potential to help us understand the interpersonal dynamics that occur during a vast array of police-public encounters that currently go unmeasured.

This opens the door for police agencies to look far beyond the anecdotal investigations of a few “bad officers” and identify patterns of behavior for all officers. Of course, the Dallas project has limitations, as acknowledged by the authors. The sample of videos was limited to those from the Dallas Police Department and did not include force incidents. Second, the sample of human coders was limited to a handful of community members, university faculty, and police personnel from Dallas. Finally, and most importantly, the project did not collect data from those individuals who interacted with the officers in these videos to gain their perceptions of what transpired and their views of procedural justice. As the authors note, “[w]hat may seem like a procedurally just interaction may be objectionable or offensive to the community member experiencing it.” Thus, while the Dallas project is extremely important and promising, more work is needed to refine the measures, test external validity with a diversity of videos and coders, and connect AI results with contact survey data.

6.3 Analysis of New Data

Beginning with BWC data, we need to deconstruct and categorize what is being communicated and how it is being communicated both verbally and nonverbally. For example, did the officers introduce themselves and ask relevant questions? Did they listen carefully without interrupting? How often did they use commanding or demeaning language, raise their voice, show anger, seem rushed, indifferent, impatient, or blame the civilian for what happened? How often did they show patience, speak in a soft tone, show kindness, respect, empathy, and concern for the person’s welfare? How often did they answer questions, offer individual guidance, or provide information about services? Nonverbally, did the officer use their head, face, or arms to communicate support, indifference, or negativity?

While the BWC data will be more extensive than the contact survey data, they should overlap on key dimensions of procedural justice to allow testing for convergent validity. Also, we can expect that combining these complementary datasets for the same incidents will allow researchers to make significant advances in our knowledge of the antecedents and consequences of procedural justice. When one dataset suggests a pattern, the other dataset can be used to provide new insights or explanations. For example, when an officer uses specific language (based on BWC data), does it influence the community members’ perceptions of that officer and level of procedural justice (based on contact survey data)?

Conversational analysis can look at the tone, pitch, intonation, and interruptions that occur in conversations (e.g., Shon, Reference Shon2024), and this can be captured with sophisticated AI analysis of BWC data. Since 2017, when Google created the “transformer” architecture for language translation, we have witnessed a revolution in AI technology (Toews, Reference Toews2023), leading to generative AI like ChatGPT. We now have superfast semiconductors that are beginning to solve many of society’s problems (60 minutes, 2024). For the current proposal, thousands of words and behaviors can now be analyzed in seconds by AI to identify patterns and extract meaning. How these words are received and interpreted by community members will require a contact survey that can inform these AI algorithms.

Applications of AI in the policing field can build upon the impressive use of AI to evaluate physical- and mental health-related interventions and provide feedback to improve diagnosis, treatment, and bedside manners (Bedmutha et al., Reference Bedmutha, Bascom and Sladek2024; Bickman et al., Reference Bickman, Lyons and Wolpert2016; Love-Koh, et al., Reference Love-Koh, Peel and Rejon-Parrilla2018; Ryan et al., Reference Ryan, Luz and Albert2019). Ryan and colleagues (Reference Ryan, Luz and Albert2019) describe how AI could be used to measure the meaning of words, turn taking, and the tone and style of communication.

For policing, the analysis of BWC recordings and contact survey data should eventually produce a reliable PJ Index and independent subcomponents that determine the level of procedural justice and related communication skills exhibited by police officers during their encounters. Subindexes within the overall PJ Index might include voice, respect, fairness, emotional control, empathy, and perhaps other dimensions such as general competence. Once we have established reliable and valid indicators of procedural-justice-related behaviors, we can begin to examine their relationship to specific outcomes, as shown in Figure 3. For example, is disrespect by the officer a stronger predictor of civilian resistance than lack of empathy by the officer?

An illustration showing two boxes depicting how procedural justice behaviors by police officers (or their absence) may have certain effects during police-civilian interactions. See long description.

Figure 3 Linking behaviors and outcomes.

Figure 3 Long description

The box on the left lists procedural justice-related behaviors, such as respect, voice, fair-unbiased, trustworthy, empathy, emotional control, social etiquette, helpful, competent, and suspiciousness. An arrow from the box on the left leads to the box on the right. The box on the right lists the possible outcomes such as civilian resistance, citation, search, escalation, arrest, use of force, civilian satisfaction, victim recovery, and civilian cooperation legitimacy.

However, reality is more complex, as reflected in Figure 4. Interpersonal communication can be extremely complex, so we need to expand our chart here to consider both the context in which the encounter occurred and the demographics of the participants. We need to consider the individual characteristics of both the officer and the civilian involved – their race, age, ethnicity, gender identity, sexual orientation, disability, language proficiency, and other characteristics. I expect that AI analysis of BWC data will allow us to identify a large set of characteristics and behaviors that can be linked to procedural justice, and some of these perceptions and actions may be driven by prejudice and stereotypes.

This Figure illustrates the complexity of police-public interactions. An illustration showing four boxes labeled 1. Individual characteristics, 2. PJ behaviors, 3. Context, and 4. Outcomes. See long description.

Figure 4 The complexity of interpersonal communication: AI and BWC data.

Figure 4 Long description

An arrow from individual characteristics leads to PJ behaviors. An arrow from PJ behaviors leads to outcomes. An arrow from context leads to outcomes. An arrow from individual characteristics leads to outcomes. An arrow from context leads to PJ behaviors.

As one example of demographic effects, we collected procedural justice data through a contact survey in Chicago. With the help of Jon Maskaly, I looked at three demographic characteristics (age, race/ethnicity, and gender identity) of 3,646 people who were stopped by the Chicago police in 2014 and 2015, and matched them with the characteristics of the officers who stopped them. I hypothesized that the more matching that occurred between the demographics of the officer and the community member, the higher the procedural justice scores assigned to the officer. The results support this matching hypothesis. As shown in Figure 5, as the number of demographic matches increased (0, 1, 2, 3), community members reported more satisfaction with the encounter, more trust of the officer, more respect from the officer, and perceived less officer bias against them.

The bar chart contains 16 horizontal bars showing the number of demographic matches between police officers and the civilians they stop (4 levels) and the civilians’ ratings (4 types) associated with each level of matching.

Figure 5 Procedural justice scores by number of civilian/officer demographic matches on age, race, gender.

Figure 5 Long description

The horizontal axis represents satisfaction, trust, respect, and unbiased. The data depicted is as follows: 1. Satisfaction: no matches: 2.30; 1 match, any: 2.36; 2 matches, any: 2.69; and fully matched: 2.73. 2. Trust: no matches: 2.67; 1 match, any: 2.78; 2 matches, any: 2.96; and fully matched: 2.96. 3. Respect: no matches: 2.62; 1 match, any: 2.60; 2 matches, any: 2.81; and fully matched: 2.88. 4. Unbiased: no matches: 2.43; 1 match, any: 2.43; 2 matches, any: 2.67; and fully matched: 2.70.

However, not all matches are the same. Female-female matches outperformed male-male matches across multiple dimensions of procedural justice. My guess is that too much testosterone is at work when the males are paired. Male officers are more inclined to experience the “masculinity threat” (Richardson & Goff, Reference Richardson and Goff2014), and therefore feel the need to response with authority to restore their status as the “Alpha male.”

No doubt, reality is even more complex than these findings suggest. The lower procedural justice scores that result from age mismatches in general could be due to many factors. For example, the use of certain words might be miscommunicated across different generations (Twenge, Reference Twenge2023). Using slang, a Gen X civilian (born 1965–79) might say to new Gen Z officer (born 1995–2012), “You seem like a headbanger.” The younger officer might respond defensively, “I would never harm anyone unless they deserved it,” not realizing the older civilian is referring to heavy metal music lovers.

In terms of context, there are many variables to consider. For example, what is the level of poverty in the neighborhood where the interaction occurred? What is the level of crime and disorder? Did the encounter occur in a hot spot of crime? Are the police responding to a domestic violence incident or making a traffic stop? How long did the encounter last?

In the future, after extensive training with large datasets, AI should be able to distinguish between individual officers in their style of communication and interaction with other people, indicating levels of emotional intelligence, procedural justice, empathy, and other interpersonal skills that are more or less effective in building trust with different types of community members. For example, imagine that Officer A does much better when interacting with younger people than with older people. Officer B does well with most groups, except when someone questions his authority, and then his ego is quickly bruised and he tends to threaten the person with punishment. Officer C receives high procedural justice scores in general, except when entering a crime hot spot, where he quickly becomes an aggressive warrior who is suspicious of everyone he encounters.

The specific communication skills that lead to these outcomes should be determined. Some actions or words may cause tensions to escalate and even result in use of force or other enforcement actions, while other actions lead to de-escalation. In addition, we need to look at how various police behaviors are perceived by the community members in different settings. Some actions may have little impact, while others produce negative sentiment and perceived injustice.

Thus, our knowledge should expand exponentially with the analysis of new data from BWCs and contact surveys, and this information can be used to help officers adjust to different settings and different civilians. The number of statistical interactions between personal and contextual variables could be very large and very helpful. Beyond individual officers, we will learn which procedural justice behaviors are the strongest predictors of outcomes in general, and therefore should be given more attention throughout the organization.

6.3.1 Hot Spots of Procedural Injustice and Hot Individuals

These analyses may identify what I will call “hot spots of procedural injustice,” that is, small geographic areas where procedural injustice by the police is higher than surrounding areas. Many years ago I recommended the use of surveys to study small geographic areas (Rosenbaum & Lavrakas, Reference Rosenbaum and Lavrakas1995). With large enough samples, the contact survey dataset should generate new research findings on the factors that contribute to hot spots of procedural injustice, which, in turn, could advance our understanding of geo-based crime patterns. We know that police-community relations are especially strained in low-income neighborhoods of color, where hot spots of crime are more likely to occur, but we have yet to determine whether there is variability in procedural justice within or between these areas, and if so, what are the causes and effects. What is the degree and conditions of overlap between hot spots of crime and hot spots of procedural injustice? The police should have more success in solving crime if they have a procedurally just relationship with community members who frequent hot spots.

Beyond geography, these new datasets should be used to identify “hot officers” and “hot civilians.” To begin with, each officer should have their own PJ Index and subindex scores that can be compared to other officers. Hot officers could be defined as those officers whose PJ scores are at least two standard deviations below the mean for all officers serving in similar situations. This information should remain confidential but can be used for training and coaching. Hot civilians could be defined as community members whose behavior involves the initiation of, or negative response to, procedural injustice during interactions with the police (e.g., disrespectful language or noncompliance). Understanding these patterns is important. In other words, what specific behaviors by civilians cause what specific types of officers to engage in what specific types of procedural injustice? Here too, these findings can be used to train all officers in how to recognize these general patterns and respond appropriately.

We must also be able to analyze the data to identify “Exemplary Officers,” defined as those officers whose PJ Index scores are at least two standard deviations above the mean for all officers serving in similar situations. We must acknowledge and reward their exceptional performance and use it as a model for other officers to follow.

Finally, the analysis should produce aggregate PJ Index scores as the foundation for organizational change. Procedural Justice Indexes should be generated for all supervisors (i.e., the average of all scores for employees who report to them), thus allowing the identification of hot supervisors and exemplary supervisors. In addition, PJ Indexes should also be calculated for the city as a whole, police districts, beats, shifts, and special units, for internal changes discussed in Section 7. Finally, given the widespread concern about explicit or implicit police bias, the analysis should, to the extent possible, include breakdowns by constitutionally protected classes of service recipients.

6.4 Measurement Conclusion

The collection and analysis of new procedural justice data from BWCs and contact surveys is the first large step in changing police culture and service, but it is only the beginning. We must be able to translate these data patterns into evidence-based policing that results in significant improvements in police-public interactions.

As a psychologist, I cannot emphasize enough that human behavior is shaped by incentives and disincentives. Today, police officers are expected to be good public servants, but there are no systems in place to support and reward such behavior. The crime-fighting behavior of “warriors” is rewarded more than the helpful and respectful behavior of “guardians.” Typically, officers are rewarded for increased productivity and aggressiveness in terms of the number of citations written, arrests made, guns and drugs recovered, and crimes investigated, not the manner in which community members are treated. Many police agencies still have informal quotas for the number of stops or tickets written in a given time period, regardless of the justification for enforcement (Fielding, Reference Fielding2022; Ossei-Owusu, Reference Ossei-Owusu2016), although many laws now prohibit such practices because they can result in racial profiling.

Police officers themselves would agree with my critique of current reward structures. Our national survey found that only one-third of the officers felt that “employees who consistently do a good job are rewarded,” and two-thirds felt that “the department is more interested in measuring activity than the quality of work” (see Cordner, Reference Cordner2017). If we care about the quality of police work, then the time has come to change the structure of incentives and accountability.

7 Using Data to Change Modern Policing

The collection, analysis, and reporting of new performance data is a waste of time if a system is not created for police organizations to use these data for management purposes. The entire chain of command will need to be involved in this process. I am proposing a coordinated approach that involves both internal and external strategies of reform. Here, I will provide a brief summary of the changes needed to alter street-level behavior, with a focus on new data collection, accountability, and performance evaluation systems. Without proper incentive structures, management will continue to face a crisis of public confidence and will be unable to monitor or shape the performance of its officers on the street.

7.1 External Changes

7.1.1 Police Performance Agency (PPA) and Community Involvement

Police agencies have repeatedly experienced difficulty introducing and sustaining major organizational reforms and being objective in self-assessment. Hence, the new data systems should be managed by a unit of government that is completely independent of the police department to gain public trust, protect confidentiality, and ensure strong implementation of the data collection system. Let’s call this the Police Performance Agency (PPA). The PPA, working closely with community leaders, police personnel, a survey laboratory, and a company specializing in using AI to analyze BWC data, would have the primary responsibility for overseeing the collection, analysis, and reporting of findings. The PPA would oversee the work of a survey laboratory involving contact survey data and the work of an AI company involving BWC data. Dashboards for police management would be created. The PPA would also work with an audit unit within the police agency to ensure that all employees, from police officers to managers, are following new policies regarding this system and using data dashboards properly. The PPA must also conduct an ongoing audit to ensure that officers are distributing contact survey business cards for all targeted incidents involving a police report, and not simply handing them out selectively after positive encounters. Telephone calls, text messages, or emails can be used to contact potential survey respondents to ask whether they were handed a business card by the officer.

The community has a very important role as a contact survey respondent, providing information about the quality of their recent contact with the police. They can also assist the PPA in creating a public awareness campaign to inform the community of the contact survey program, its importance, and how community members can participate with their identity protected. Finally, certain community members should be invited to participate in police training to share their lived experience, which can help to create empathy and understanding among officers.

To create the dashboards with the relevant breakdowns, the PPA would need to receive a database from the department’s record management division with information about the police-public contacts being analyzed, for example, incident number, date, characteristics of the incident (traffic stop, traffic accident, or crime incident), unit and supervisor involved, any action taken (search, warning, citation, arrest, and/or use of force), the location (precinct, district, beat, and block), time of day, and the demographics of both the community member and officer involved (race/ethnicity, age, gender, unit, etc.). Some agencies have created good “Metadata” for BWC systems (White et al., Reference White, Saladeen and Martin2023) that include some of these data points and more, such as the exact location and length of the encounter. These “tags” will also be needed for the survey dataset. After protecting the identity of the community member and the officer, these downloads should be provided to the independent agency on a monthly basis to allow for the generation of reports on PJ Index performance.

The full database will serve as the basis for reports and feedback regarding specific precincts, districts, units, supervisors, and individual officers. Importantly, the use of police records and metadata not only provides the context for examining different patterns of procedural justice but also helps to keep the contact survey very short (i.e., without demographic or incident information), thus increasing community response rates.

Also, community feedback and education should be one by-product of this new measurement system. Frankly, the escalation of tension or negative sentiment during police-public encounters can be the result of actions taken by the police officer and/or the community member. Data from the BWCs and contact surveys could teach us a great deal about the behavior patterns of community members as well as officers, and the dynamics of this exchange. Similar to the analysis of police behavior, this information on civilian behavior should be used to educate the general public and police officers about ways they might be able to prevent the escalation of conflict and remain safe.

7.2 Internal Changes

To improve the quality of police services and police legitimacy, the new measurement system must be translated into organizational changes that are focused on improving officers’ communication skills and management accountability for these skills. Other policing scholars have also been critical of past changes and have suggested a variety of evidence-based organizational changes to make policing more effective (Lum, Reference Lum2021; Lum & Nagin, Reference Lum, Nagin, Tonry and Nagin2017; Skolnick & Fyfe, Reference Skolnick and Fyfe1993). The focus here is on how two new data systems can be used to change police culture and strengthen procedural justice during routine encounters with the public. The key to successful reform is to change performance evaluations and accountability systems for personnel at all levels of the organization. Here I briefly describe some of these evidence-based changes that can help advance the science of performance measurement and evaluation, and thus help agencies become “learning laboratories” (Maguire, Reference Maguire2004b).

7.2.1 Performance Evaluations

Performance evaluation systems should be completely redesigned. New dashboards should be built using data from contact surveys and BWCs. While some traditional metrics should be retained (e.g., internal and external complaints), special attention should be given to the quality of service delivered and procedural justice.

Procedural Justice Index scores, computed outside the agency, should be used thoughtfully by supervisors to evaluate individual officers. For decades, agencies have recorded officer-level data on many variables, including the number of stops, citations, arrests, force applications, and citizen complaints, so there is no compelling reason why they cannot retain and use data on the quality of service delivered and procedural justice in particular.

7.2.2 Supervision with Good Data

Anyone who studies police organizations understands that first-line supervisors play a critical role in organizational change – they communicate the norms and values of management, administer rewards and punishments, and protect their employees from harm (Van Maanen, Reference Van Maanen and Punch1983). Organizational research in different settings has taught us that supervisors need to communicate effectively with their employees to create effective working conditions and job satisfaction (Katz & Kahn, Reference Katz and Kahn1966; Pincus, Reference Pincus1986). This holds true for police organizations as well.

In the present context, police supervisors should be required to use the new dashboard data on procedural justice to identify at-risk employees as well as exemplary employees. I see enormous potential for EIS in the future, as they represent a proactive, data-driven system to examine behavioral patterns and predict the performance of individual officers (Walker & Milligan, Reference Walker and Milligan2006; Walker et al., Reference Walker, Alpert and Kenney2000). Of course, criminologists love to predict human behavior, as the entire field of criminology was built on successfully predicting delinquency and geographic crime patterns.

Using reliable and valid PJ Indexes, supervisors should identify and intervene with those individuals who score significantly above or below the mean of comparable officers. For consistently high-scoring officers, their supervisor should acknowledge such achievement with verbal praise, and consider certificates, awards, and pay raises. For consistently low-scoring officers, supervisors need to intervene with additional supervision, coaching, and training. Again, I want to remind police administrators that rewarding desired behaviors and discouraging undesirable behaviors is the basic process by which behavior change is achieved, and it needs to occur frequently. Hence, the typical annual performance evaluation should be supplemented by more frequent observations and meetings with officers who will benefit from assistance or praise.

Police organizations that follow procedural justice principles (what I call “Organizational Justice”) can increase job satisfaction and rule compliance among their employees (Rosenbaum & McCarty, Reference Rosenbaum and McCarty2017). Also, research indicates that when supervisors talk through a recent officer-civilian encounter using procedural justice principles, they can reduce incidents of arrest and use of force (Owens et al., Reference Owens, Weisburd, Alpert and Amendola2015).

7.2.3 Accountability Unit and Meetings

Building on the CompStat model, police departments should hold regular meetings, where a powerful, internal accountability unit and senior management share statistics on procedural justice with all command personnel. I have called them “RespectStat meetings” (McCarthy & Rosenbaum, Reference McCarthy and Rosenbaum2015), but the name is not important. As suggested earlier, data from the contact survey and BWC systems should produce quantitative scores on the PJ Index and subindexes for each precinct/district, shift, unit, types of incident/call, demographics of the officer, and demographics of the community member. The results will likely indicate differences in performance across these variables, begging the question of what is contributing to these differences and what can be done to enhance performance in certain areas. These accountability meetings should put immediate pressure on commanders and supervisors to increase their scores before the next meeting.

Leaders should create a problem-solving culture to aid in this process. Using a problem-oriented policing (POP) framework with the SARA model, managers should be taught to identify specific patterns and problems in procedural justice performance. Then, they must be taught to propose, implement, and evaluate new methods for improving officers’ performance on these indicators. Commanders and middle managers should encourage this type of problem-solving with brainstorming sessions. De-policing that reduces public safety should not be allowed as an acceptable solution.

Everyone from first-line supervisors to the Chief of Police should have certain performance standards or goals built into a strategic plan for their procedural justice program. When they achieve those goals (e.g., 80 percent of the officers under their supervision achieved PJ Index scores above a certain level), these supervisors should be acknowledged and rewarded in some way.

I was impressed with the MAX program (Management Analytics for Excellence) that was introduced in the New Orleans Police Department in 2016 (Morgan et al., Reference Morgan, Murphy and Horwitz2017). Using dashboards, visual maps, report cards, and monthly meetings, MAX sought to hold administrators accountable for changes in specific performance metrics relevant to the consent decree. Like CompStat, they held monthly MAX meetings, but unlike CompStat, the metrics reach far beyond crime statistics to include data on stops, searches, use of force, and even procedural justice. The role of supervisors was given special attention. For procedural justice, every month two team members reviewed BWC footage from seven police stops in each district and scored the officer (yes or no) on several dimensions, such as whether they introduced themselves, explained their actions, and listened to the subject. When searches and use of force are involved, the reviewers also judge whether officers were “reasonably professional and courteous.”

The MAX system represents a significant change in police accountability, requiring that managers do something to improve metrics other than crime and enforcement statistics. In New Orleans, scores improved quickly, presumably because of monthly downward pressure in the chain of command to move from “Red” to “Green” before the next meeting. However, given that the entire process is internally controlled under pressure from a consent monitor, the process of change (e.g., feedback to individual officers) and the validity of the metrics remain uncertain. Also, the small samples of BWC footage will not be representative of the department overall or even the few individuals selected. The scoring system is also unlikely to meet scientific standards of reliability. Nevertheless, this a laudable first initiative.

Clearly, if the MAX system were able to use AI to analyze BWC footage, that would dramatically reduce the workload of reviewers and improve the reliability and validity of these accountability metrics. Furthermore, adding contact survey data to this process would be a significant enhancement of this data-driven approach. We encourage law enforcement agencies to draw on this framework to build a new service-oriented accountability system using new procedural justice data from contact surveys and BWCs that is analyzed with sophisticated AI algorithms.

This type of accountability system should reach beyond commanders to mid-level managers and front-line supervisors. Supervisors should be given training and coaching on how to interpret and use the PJ Indexes to enhance officers’ performance on the job. However, the agency must exercise caution when comparing officers or supervisors who are assigned to precincts or shifts with vastly different levels of crime and poverty. At a minimum, organizational units should be compared against themselves and should be expected to show improvement in their scores over time.

With enough data, sophisticated breakdowns of the data are likely to reveal interesting patterns that deserve attention. Specific patterns could be discussed at these department-wide meetings if they occur above the individual level to protect confidentiality. Imagine, for example, that an agency is having a problem with low PJ Index scores with detectives, but only those working the evening shift and only when interviewing sexual assault victims and only when they show a lack of empathy. Or perhaps there is a PJ Index problem in District 5, but only for Patrol Officers who are using disrespectful language, but only during traffic stops and only with drivers under twenty-five years of age. Handed these findings, the job of management is to engage in problem-solving sessions where the SARA model is used to fully understand the nature of the problem and respond appropriately. This might include additional coaching, training and supervision, or even reassignments to provide additional support. Managers at all levels should be expected to discuss actions taken and progress made on PJ Indexes at future PJ meetings.

If enough procedural justice data is collected, it can be used to assess variability across small geographic areas. Just as criminologists have identified hot spots of crime, we should be able to identify hot spots of procedural injustice. As noted earlier, given the overrepresentation of minorities near hot spots of crime, and given the tendency of officers to respond more aggressively in areas where they hold prejudicial suspicions of criminality, there is a good chance that PJ Index scores will be lower in these areas. District commanders and other supervisors should be made aware of these hot spots of procedural injustice and use the SARA model as part of their brainstorming sessions.

Hence, this new data system should lead to a reduction in the number of procedural injustice hot spots, which in turn should lead to greater trust in, and cooperation with, the police. This cooperation should help to reduce hot spots of crime. However, agencies may also discover hot spots of procedural injustice that are outside the hot spots of crime. These micro places should be addressed as well, as they might contribute to legal cynicism and increase the chances that additional hot spots of crime will emerge.

Agencies should continue accountability for crime reduction, but should focus more on the prevention of crime than on reactive enforcement, especially strategies that engage the community to build public trust. For example, the Portland Police Bureau in Oregon has developed an app that allows officers to keep track of all public meetings they attend and the number of people in attendance.

Other individual-level performance metrics can be included in accountability meetings, such the number of internal and external complaints against officers, applications of force, sick days taken, and poorly written police reports, to give a few examples (for a sophisticated look at force applications, see Jonathan-Zamir et al., Reference Jonathan‑Zamir, Perry, Kaplan‑Damary and Weisburd2024). Again, means and standard deviations should be calculated with appropriate comparisons. My point here is that procedural justice is not meant to replace all other forms of accountability.

The internal audit and evaluation unit should be responsible for working with the PPA to prepare the PJ Index scores and other data for accountability meetings. They would have other responsibilities, such as monitoring the data-driven dashboards to ensure that they are user-friendly and contain data points that are essential for high-quality supervision. They should also be responsible for the quality of data and reports that are included in the EIS program to avoid the “garbage in, garbage out” problem. The audit and evaluation unit would also audit compliance with new policies, including the proper use and activation of BWCs.

Clearly, senior management will need to introduce new incentives that value procedural justice as a central goal of the organization. The performance of mid-level managers and first-line supervisors should be evaluated based on the quality and quantity of problem-solving meetings, employee interventions that focus on improving procedural justice, and aggregate PJ Index scores of their employees. Similar to officers, managers should be rewarded for performing these tasks well and for improving procedural justice outcomes.

7.2.4 Officer Training

Training will be needed at all levels of the organization to introduce new policies and practices designed to enhance procedural justice and legitimacy, while changing the police culture. To achieve officer “buy in,” officers must be reassured that this new system and training will make their job easier and safer and is designed to be supportive rather than punitive.

Lately, we see police organizations changing their policies and linking them to new training on use of force, de-escalation, and peer intervention to stop/prevent harmful behavior by other officers. This is a good start, but these trainings tend to be limited in hours, rarely involve scenario-based practice sessions, and are not evaluated to determine their effectiveness in changing street-level behavior. Furthermore, these policies and trainings tend to have a narrow focus on the misuse of force, which is a small piece of the problem. The larger problem is the police culture that negatively affects daily encounters and the image of the police, which may lead to preventable force incidents.

Trainers must address officers’ skepticism about being nice to people while enforcing the law. Skeptical officers often tell me, “When I give someone a ticket, I’m never going to get a good rating.” We tested that hypothesis in three cities, where we measured the officer’s procedural justice or what I call “Carside Manners” after the officer gave the driver a ticket (Rosenbaum et al., Reference Rosenbaum, Lawrence, Hartnett, McDevitt and Posick2015). As shown in Figure 6, there are massive differences in driver satisfaction depending on whether or not the officer treated them in a procedurally just manner. The average satisfaction score was 42 percent (dotted horizontal line). If the officer listened to the driver and was polite, the driver was about 60–62 percent satisfied with the encounter. However, when the officer did not listen and was not polite, satisfaction rating dropped to about 5–8 percent. So “Carside Manners” (procedural justice) do matter when issuing a ticket! Similarly, a randomized trial using laboratory videos by Maguire and colleagues (Reference Maguire, Lowrey-Kinberg and Johnson2023) found that procedural injustice during traffic stops works to increase anger, while procedural justice decreases anger.

A bar graph depicting the direct response to officers who are skeptical about procedural justice during enforcement, and often say, When I give someone a ticket, I’m never going to get a good rating. See long description.

Figure 6 “Carside manners” when ticketing. (% very satisfied and somewhat satisfied).

Figure 6 Long description

The data depicted is as follows: 1. Officer listened: 62.0 percent. Did not listen: 8.5 percent. 2. Officer polite: 59.8 percent. Not polite: 4.9 percent. A dotted horizontal line shows that the average level of satisfaction among all drivers who are given a ticket is 42 percent.

As I have stated several times, we know that crime is concentrated in low-income neighborhoods where people of color and other vulnerable groups are overrepresented. History tells us that local residents often view the police as white male intruders who are suspicious of everyone (Skolnick & Fyfe, Reference Skolnick and Fyfe1993). Hence, procedural justice training and supervision will need to give special attention to issues of bias. The Black community is not the only group at risk. The prevalence of hatred and intimidation toward immigrants, Hispanics, Jews, Muslims, the homeless, and the LGBTQ+ community has become especially problematic in the United States. Again, data from the new system should be able to direct management to specific bias problems where additional training is needed and the types of interactions that need attention.

In terms of evidence-based policing, I will note that procedural justice training has shown considerable promise for improving police attitudes and behavior (Antrobus et al., Reference Antrobus, Thompson and Ariel2019; Mazerolle et al., Reference Mazerolle, Bennett, Antrobus and Eggins2012; Rosenbaum & Lawrence, Reference Rosenbaum and Lawrence2017; Sahin et al., Reference Sahin, Braga, Apel and Brunson2017; Weisburd et al., Reference Weisburd, Telep and Vovak2022; Wheller et al., Reference Wheller, Quinton, Fildes and Mills2013; Wood et al., Reference Wood, Tyler and Papachristos2020). However, the organization must be ready for such training, otherwise you can expect a backlash. For example, while the procedural justice training for roadside checks was effective in the Australian “Queensland Community Engagement Trial” (Mazerolle et al., Reference Mazerolle, Antrobus, Bennett and Tyler2013b), the same type of training had the opposite effect in the “Scottish Community Engagement Trial” (MacQueen & Bradford, Reference MacQueen and Bradford2015) because officers were unhappy with ongoing reforms and this experiment was not well explained to them in advance (MacQueen & Bradford, Reference MacQueen and Bradford2017).

From my perspective, the most effective procedural justice training requires adult learning methods and lots of practice time with feedback sessions, similar to how officers are trained to use weapons, but more individualized. Cameras can be used to provide immediate feedback in practice sessions and to illustrate desired behaviors exhibited by exemplary officers in the field. In Chicago, we videotaped police recruits facing off with difficult role players, reviewed the recording, and then gave them feedback (Rosenbaum & Lawrence, Reference Rosenbaum and Lawrence2017). Watching BWC footage of your own performance can also be helpful (Deck et al., Reference Deck, Porter, Powell and Alpert2024). Videotaped feedback has been used to enhance communication skills in various fields, from medicine (Roter et al., Reference Roter, Larson and Shinitzky2004) to martial arts (BenitezSantiago & Miltenberger, Reference BenitezSantiago and Miltenberger2015). Professional athletes regularly observe videos of themselves to improve their performance. Training can also be enhanced by using virtual reality headsets to strength officers’ skills (Bridges, Reference Bridges2024), although this work should be expanded beyond use of force decisions to building interpersonal communication skills in general, such as how to interact with victims of crime. In any event, both the quality and quantity of adult learning methods should be increased.

However, we must acknowledge the potential restrictions on building a more empathetic and emotionally intelligent workforce. Police face traumatic incidents, from dead bodies to people being shot, and this can sometimes result in post-traumatic stress disorder (PTSD), which in turn can affect brain function and decision-making (Covey et al., Reference Covey, Shucard and Violanti2023). Long-term exposure to trauma over their career can negatively impact their physical and mental well-being (Craddock & Telesco, Reference Craddock and Telesco2022). Also, there is the real possibility that officers, after years of exposure to intense situations and trauma, will experience “compassion fatigue,” which is familiar to therapists and widespread in the healthcare industry (Cavanagh et al., Reference Cavanagh, Cockett and Heinrich2020). Certainly, we know that some officers become hardened and cynical over time.

In the context of creating a wellness culture (and not just wellness training), agencies may be able to use instruments developed in the healthcare field to identify officers at risk of compassion fatigue, such as the Professional Quality of Life Scale or other metrics (Craddock & Telesco, Reference Craddock and Telesco2022; Sinclair et al., Reference Sinclair, Raffin-Bouchal, Venturato, Mijovic-Kondejewski and Smith-MacDonald2017), and provide early intervention, such as therapy and rotating them out of high-crime, high-stress assignments.

If officers have difficulty showing empathy, they could, at a minimum, practice specific scripts or linguistic formulas to speak in an empathetic manner. In the medical field, one “bedside manners” study found that the AI language model ChatCPT was able to show significantly more “empathy” than medical doctors when answering patients’ questions by using the right words (Ayers et al., Reference Ayers, Poliak and Dredze2023). Down the road, we should consider having officers interact with a chat application on the job, such as an AI chatbot, to improve their procedural justice skills appropriate for a given situation. Once we have trained AI on BWC and contact survey data, the information might be programmed into an AI chatbot wristwatch. These watches could give officers immediate feedback. For example, during the encounter, the watch might suggest (via earbuds) specific language that is responsive to the community member’s actions or words. (e.g., “I’m sorry that happened to you. Can you tell me more?”). After the encounter, the watch might score the officer’s procedural justice performance on different dimensions based on the words they used (e.g., “Your Respect score for this event was 9.0. Your Empathy score was only 3.0. Keep up the good work on Respect and try to improve your Empathy.”). Officers should be reminded that empathy is a skill, not an unchangeable trait, and when people are reminded of this, research shows they work harder to develop it (Zaki, Reference Zaki2017).

7.2.5 Recruitment and Selection

After finding that “reform training” on community policing and problem-solving had limited effects, Haar (Reference Haar2000) noted that “[t]he best predictors of attitude change were by far the attitudes that recruits brought with them to the academy” (p. v). Taking this a step further, cities should pay more attention to the recruitment and selection of individuals who are most capable of engaging in procedurally just behaviors on the job. Given the current retention and staffing problems in law enforcement, this is a good time to stop and reflect on recruiting and hiring practices.

First, for recruitment, agencies should move away from the warrior ads with SWAT teams dropping from helicopters and begin to recruit individuals with service skills and a service orientation. Second, for selection, they should employ social science tools to measure applicants’ attitudes and dispositions toward community-oriented policing styles, procedural justice, and interpersonal communication skills.

Psychological testing should reach beyond the traditional efforts to identify candidates with deficiencies such as proneness to violence, drug abuse, and dishonesty. Police psychologists have developed their own profiles of officers who have excessive force problems. The first profile listed is “Officers with personality disorders such as lack of empathy for others, and antisocial, narcissistic, and abusive tendencies” (Scrivner, Reference 76Scrivner1994, p. iii). They also listed “impulsiveness,” “low tolerance for frustration,” and “dominant, heavy-handed patrol style that is particularly sensitive to challenge and provocation.” Certainly, some individuals should not be police officers because of personality disorders. For example, multiple studies have shown that individuals with a narcissistic personality disorder feel superior to others, are self-centered, antagonistic, overly sensitive to criticism, and lack empathy for others (Orth et al., Reference Orth, Krauss and Back2024). These traits would undermine the ability of police officers to provide compassionate services and de-escalate conflictual encounters.

On a more positive note, I recommend that government agencies seek out individuals who have strong interpersonal communication skills appropriate for the very difficult job of being a police officer. As I always say to my students, “[i]f you don’t like talking to people, don’t be a cop.” Police interact with all types of people facing all types of problems, so officers must be able to adapt and respond appropriately. I offer several specific recommendations.

First, I think psychological testing should give more attention to emotional intelligence. This includes the capacity to read emotions and show empathy. There is a sizeable scholarly literature on emotional intelligence. Clearly, some individuals score higher on emotional intelligence than others, meaning they can perform well in social settings because they are aware of their own and other people’s emotions, they understand the complexity of emotions, and are skilled at managing their own and other people’s emotions to improve performance and achieve desired goals (Mayer et al., Reference Mayer, Salovey, Caruso and Sternberg2000). The field of measurement has evolved significantly (Petrides et al., Reference Petrides, Mikolajczak and Mavroveli2016), so an emotional intelligence test would help with hiring decisions.

Second, because of problems with bias and discrimination that have defined nearly every crisis of police legitimacy in American history, the testing process should include the measurement of stereotypes and prejudice toward vulnerable groups.

Third, to change the police culture and reduce discrimination, the process of recruitment and selection should make diversity a higher priority. A sophisticated analysis of 2.9 million encounters involving Chicago Police patrol officers (Ba et al., Reference Ba, Knox, Mummolo and Rivera2021) found that Black and Hispanic officers made significantly fewer stops, arrests, and applications of force than white officers, especially for Black civilians. This analysis controlled for enforcement opportunities (i.e., beat, shift, month, and type of assignment), and the differences were attributed to fewer discretionary stops and arrests by Black and Hispanic officers for “petty crimes,” including drug offenses. Black civilians appear to be less afraid of Black officers than white officers (Pickett et al., Reference Pickett, Graham, Nix and Cullen2024). However, national data indicate that Black civilians do not attribute more procedural justice to Black officers than they do to white or Hispanic officers (Li & Sun, Reference Li and Sun2023). These nonsignificant differences may be due to the fact that Black and Hispanic officers are trapped between two cultures and identify with the “blue” police culture as much or more than their own racial/ethnic culture (Carbado & Richardson, Reference Carbado and Richardson2018; Weitzer, Reference Weitzer2000). New training and new performance metrics proposed here should minimize the adverse effects of socialization into the predominately white police culture. In any event, greater diversity in the police force should help to tackle racism and restricted promotional opportunities (cf. Forman, Reference Forman2017) that have limited the ability of minority officers to challenge the powerful police culture and express their racial identity.

Fourth, to reduce hypermasculinity in the police culture, the recruitment process should give special attention to female applicants, since they are more likely to interact with the public in the manner described in this Element. In the controlled Chicago study by Ba and colleagues (Reference Ba, Knox, Mummolo and Rivera2021), female officers used force less often than male officers. Reviewing the literature, Lonsway and colleagues (Reference Lonsway, Moore, Harrington, Smeal and Spillar2003, p. 2) point out that “women officers rely on a style of policing that uses less physical force, are better at defusing and de-escalating potentially violent confrontations with citizens, and are less likely to become involved in problems with use of excessive force.”

Unfortunately, the United States has been unsuccessful at hiring more women (only 12 percent of the workforce), unlike Australia, Canada, and the UK. Historically, the police culture has not been supportive of female officers (Roman, Reference Roman2020; Stepler, Reference Stepler2017), but work is underway to lower the barriers to female success in this male-dominated environment (Bumpers et al., Reference Bumpers, Manigault and Winstead2024). My hope is that when female officers reach a critical mass, they will be able to change the hypermasculine culture.

7.2.6 Leadership

Police leaders are the key to successful reform. However, police leaders face many challenges (Haberfeld, Reference Haberfeld2006; Police Executive Research Forum, 2009). Too often they are involved in crisis management around police misconduct rather than engaged in strategic planning. For the mission described herein, police chiefs will need the skills and fortitude to introduce these changes and respond effectively to inevitable resistance. A new generation of police executives must be willing to take some risk with innovation, restore a clear mission regarding public trust and equitable police services, and clearly communicate the evidence-based methods that are needed to achieve this goal.

In summary, police chiefs and management will need to (1) revise policies and procedures to reflect this new procedural justice mission; (2) communicate the importance of this mission to all employees; (3) revise performance evaluation systems to give more attention to procedural justice during encounters with the public and strengthen the coaching skills of supervisors on this topic; (4) introduce new accountability meetings for commanders and supervisors with brainstorming sessions that focus on improving problematic PJ Index scores; (5) introduce new models of training for all employees with opportunities to practice procedural justice and receive supportive feedback; and (6) give more attention to interpersonal communication skills in the recruitment and selection of new officers.

Police chiefs should seek input from managers when making these organizational changes and fully engage them in the process. Chiefs can learn from the private sector regarding the creation of a corporate culture of “respectful” leadership (Meshanko, Reference Meshanko2013), or “servant” leadership (Sipe & Frick, Reference Sipe and Frick2009) instead of “top-down” or “iron-fist” leadership. Also, agencies should look at the long history of research and theory on organizational behavior as applied to public management (Vasu et al., Reference Vasu, Stewart and Garson1998), with the growing ethical obligation to the fair and just treatment of all employees. When teaching, I often showed a picture of a father spanking his son while saying, “This will teach you not to hit people!” On my desk, I have a poster that reads, “The beatings will continue until morale improves.” In contrast, police managers should live by example and be role models of the behavior they expect from their employees. As Mahatma Gandhi once said, “Be the change you wish to see in the world.”

Finally, I would be naive if I did not acknowledge the importance of external support from local government officials and community leaders. Many police chiefs in the past have failed at reforms because of numerous political and organizational obstacles they face (Skolnick & Fyle, Reference Skolnick and Fyfe1993), and we should be aware of these challenges. Clearly, a strong working relationship with the external oversight agency involved in data collection is also essential.

7.3 Organizational Change Summary

In summary, internal changes will be needed at all levels of the organization to dramatically increase procedural justice during police-public encounters. The identity of the police should focus on guardianship and meeting the needs of persons seeking their help. Crime-fighting should continue, but “command and control” behaviors should be restricted to those rare situations where the person is persistently nonresponsive to less aggressive approaches or represents an immediate threat to self or others. The public as a whole should not be fearful of the police or view them with negative sentiment.

With the support of community leaders and government officials, police leaders must be willing to introduce difficult changes and respond to inevitable employee pushback and political interference. The benefits to all officers should be emphasized. Sophisticated AI analytics for BWC video data and contact survey data should be used to identify trends in procedural justice at the individual, supervisory, and unit levels. Supervisory interventions should be driven by a comprehensive problem-solving process that seeks to reward procedurally just behaviors and provide guidance in areas where performance could be improved. Work environments with punitive or weak supervision should be replaced by coaching, training, and positive reinforcement. Outside oversight is needed to collect and analyze the new data, and audit the police role in high-quality data collection. When we begin to “measure what matters” to the community and translate this data into organizational changes, we can expect that police behavior, police legitimacy, and public safety will improve.

I should note that accountability programs using contact survey data could easily be extended beyond direct police services to other components of our public safety system. This framework could be used to evaluate civilian units that respond to mental health crises (Beckett et al., Reference Beckett, Stuart and Bell2021), as well as emergency call taker systems, like 911 (Goodier & Lum, Reference Goodier and Lum2023; Neusteter et al., Reference Neusteter, Mapolski, Khogali and O’Toole2019), where callers could be asked whether their call was handled in a procedurally just manner.

8 Research Agenda

Before widespread implementation of this new system, I would recommend some additional research and experimentation. Here are several research tasks to consider. The AI project in Dallas could be used as a starting point and the elements further refined.

(1) Identify several large police departments that are open to innovation and currently use BWCs, and invite them to participate as demonstration sites.
(2) Engage knowledgeable stakeholders (police, community, and policing scholars) to identify the procedural justice variables that should be measured with AI and the contact survey. Stakeholders should review and analyze BWC footage from a variety of complaint incidents to help refine the operational definition of procedural justice concepts like “disrespectful” or “resistance.” They may identify multiple types of resistance and cooperation. These descriptions of behavioral patterns will help to refine and focus the AI training and possibly improve the wording on the contact survey. This research may also result in the refinement of important variables not traditionally measured in procedural justice research, such as officers’ competence in answering questions, providing assistance, or showing compassion.
(3) Use BWC footage from participating police departments (and other agencies) to refine the AI algorithms via AI training on variables that emerge from this analysis.
(4) Drawing on existing procedural justice research and identified BWC variables, create a short contact survey that focuses on procedural justice (see Online Appendix for my suggestions).
(5) Create a PPA, independent of the participating police agencies, which will manage the data collection, analysis, and reporting of the survey findings. Skills in survey research, data analysis, auditing, and dashboards will be needed.
(6) Field test the contact survey methodology in these test sites. This would include developing a plan and policy for collecting the data by requiring officers to distribute business cards to community members with a QR code that gives them access to the online contact survey via their phone.
(7) Develop training for officers and supervisors around the new policy, emphasizing the importance of procedural justice in the new system of accountability and performance evaluation, and the benefits to them.
(8) Create an internal auditing unit (or use an existing unit) to ensure that all employees, from police officers to managers, are following the new policies and that supervisors are using data dashboards properly for EIS and interventions.
(9) Prepare for, and introduce, PJ meetings with commanders to begin the accountability process.

Clearly, such a new program would need to be implemented in stages, for example, policy development, training, data analysis, performance evaluations, and accountability meetings. Undoubtedly, agencies will face challenges and resistance when implementing something so new and different, but police leaders will need to be reasonable, transparent, bold, persistent, and procedurally just to overcome these challenges.

9 Summary and Conclusion

Nationwide, we are witnessing a new “crisis of legitimacy” with American policing and the problem is much deeper than the fatal shootings we see in the media. Community members are disappointed with the police for other reasons. Too often, constitutionally protected and vulnerable populations have been mistreated, especially people of color, the LGBTQ+ community, persons with mental illness, victims of crime, the homeless, and youth. In addition, our society has become more openly hostile toward other groups, such as immigrants, Jews, and Muslims, and expects the police to be tough on crime. Also, there are many internal obstacles to police reform and evidence-based policing (Lum & Koper, Reference Lum and Koper2017).

In this Element, I have argued that most police reforms have failed because we continue to use the wrong metrics to evaluate police performance. Public officials and police agencies have yet to systematically measure what matters to the public, namely, how people are treated by the police. A large body of scientific evidence indicates that treating people in a procedurally just manner will improve police-community relations in many ways and should improve public safety.

Hence, I have proposed the systematic collection and analysis of new procedural justice data from BWCs and contact surveys. To change police culture and street-level behavior however, police organizations must use these data to identify problems in police-community interactions, propose solutions, implement those solutions, and evaluate their effectiveness. Thus, a POP approach should be used to translate this knowledge into evidence-based practice – new policy, training, supervision, and accountability at all levels of the organization.

This new management accountability system should incentivize respectful and empathetic communication. In other words, police performance evaluations, awards, and promotions should be driven not by the number of stops, searches, citations, and arrests performed by an officer, but rather by how the officer treats community members during their interactions. Punitive enforcement accountability should be replaced with positive service accountability.

These reforms, if carefully implemented, should benefit both the police and the communities they serve (Rosenbaum, Reference Rosenbaum2024). The benefits to community members should include (1) more positive, less conflictual, and more satisfying encounters with the police; (2) greater trust and confidence in the police and support for their decision-making; (3) less cynicism about the law and greater willingness to obey the law; (4) greater willingness to work with the police to create effective crime prevention partnerships; (5) less desire to file complaints and lawsuits; (6) increased public safety as a result of these changes; and (7) fewer taxpayer dollars spent on lawsuits against police officers.

For police officers, I envision many benefits that should lower police resistance to this type of innovation and improve their effectiveness in fighting crime. By learning stronger procedural justice, de-escalation, and communication skills, police officers should experience (1) more successful criminal investigations as a result of more cooperative witnesses, victims, bystanders, callers, and suspects, thus producing greater justice and deterrence; (2) more cooperative community members and fewer use of force incidents; (3) fewer misconduct complaints and investigations, thus reducing officers’ fear of punishment; (4) fewer officer safety and mental health problems; (5) greater job satisfaction and less cynicism toward their agency and the community; (6) less “de-policing,” resulting in more crime prevention and increased public safety; and (7) greater agency legitimacy, resulting in fewer protests, negative social media, lawsuits, and defunding campaigns.

9.1 Big Science

In the future, I envision a national system of procedural justice measurement where thousands of agencies are participating. This should become a cost-effective system where cities share the costs and benefits of this program. The costs include data collection and auditing functions, while the benefits would include implementation integrity, data analysis needed for accountability dashboards, and eventually, increased public trust and legitimacy. Another big benefit for science at large is the knowledge generated from conducting intercity comparisons and using city and police demographics as the unit of analysis. As I pointed out earlier, procedural justice varies by agency size. The question is why and what can we learn from these differences?

In the meantime, we need to see a proof-of-concept demonstration and rigorous evaluation in multiple cities. This should involve a genuine partnership among police leaders, policing scholars, and community leaders, with funding from the government and foundations. The result should be a standardized, validated, cost-effective set of metrics and analytics, along with specific, evidence-based organizational changes needed to translate this knowledge into efficient, effective, and equitable policing.

Weisburd and his colleagues (Reference Weisburd, Petersen and Fay2024) have called for the introduction of “Big Science” with large-scale investments to address the SQF problem. I agree with this recommendation, but would expand the national research agenda to include police-community interactions in general. Big Science should be used to investigate new ways of measuring the effects of police-community interactions and translating this knowledge into practice. After the systems have been developed and field tested for implementation integrity, a large-scale demonstration and evaluation should be funded across multiple cities, including randomized trials to test different methods of implementation. This foundational work is essential for advancing our understanding of police-community encounters and introducing evidence-based policing that will improve procedural justice, police legitimacy, and public safety. Then, we should be ready for a national rollout to all law enforcement agencies in the United States. Welcome to Big Science in the field of policing!

David Weisburd
George Mason University, Virginia
Hebrew University of Jerusalem

Advisory Board

Professor Catrien Bijleveld, VU University Amsterdam
Professor Francis Cullen, University of Cincinnati
Professor Manuel Eisner, Cambridge University
Professor Elizabeth Groff, Temple University
Professor Cynthia Lum, George Mason University
Professor Lorraine Mazerolle, University of Queensland
Professor Daniel Nagin, Carnegie Mellon University
Professor Ojmarrh Mitchell, University of California, Irvine
Professor Alex Piquero, University of Miami
Professor Richard Rosenfeld, University of Missouri

About the Series

Elements in Criminology seeks to identify key contributions in theory and empirical research that help to identify, enable, and stake out advances in contemporary criminology. The series focuses on radical new ways of understanding and framing criminology, whether of place, communities, persons, or situations. The relevance of criminology for preventing and controlling crime is also be a key focus of this series.

Long descriptions

Figure 1 Long description

The bars and the data depicted are as follows: 1. Agencies with less than 100 sworn officers: 3.43. 2. Agencies with 100 to 199 sworn officers: 3.25. 3. Agencies with 200 to 599 sworn officers: 3.25. 4. Agencies with 600 to 1999 sworn officers: 3.13 and 5. Agencies with more than 2000 sworn officers: 2.93. The height of each bar reflects the overall procedural justice index score for agencies within that group, where multiple survey questions were combined to produce a procedural justice index. The bars get smaller as the agency size increases, thus indicating that procedural justice within law enforcement agencies, as perceived by officers, decreases as agency size increases. This is also referred to as organizational justice (Rosenbaum and McCarty, 2017).

Figure 2 Long description

The bars and the data depicted are as follows: 1. Agencies with less than 100 sworn officers: 86.6 percent. 2. Agencies with 100 to 199 sworn officers: 79.4 percent. 3. Agencies with 200 to 599 sworn officers: 81.0 percent. 4. Agencies with 600 to 1999 sworn officers: 74.4 percent. 5. Agencies with more than 2000 sworn officers: 64.8 percent. The height of each bar reflects the level of employee job satisfaction for agencies in that group. The bars get smaller as the agency size increases, thus indicating that job satisfaction among officers within these agencies decreases as agency size increases.

Figure 3 Long description

The box on the left lists procedural justice-related behaviors, such as respect, voice, fair-unbiased, trustworthy, empathy, emotional control, social etiquette, helpful, competent, and suspiciousness. An arrow from the box on the left leads to the box on the right. The box on the right lists the possible outcomes such as civilian resistance, citation, search, escalation, arrest, use of force, civilian satisfaction, victim recovery, and civilian cooperation legitimacy.

Figure 4 Long description

An arrow from individual characteristics leads to PJ behaviors. An arrow from PJ behaviors leads to outcomes. An arrow from context leads to outcomes. An arrow from individual characteristics leads to outcomes. An arrow from context leads to PJ behaviors.

Figure 5 Long description

The horizontal axis represents satisfaction, trust, respect, and unbiased. The data depicted is as follows: 1. Satisfaction: no matches: 2.30; 1 match, any: 2.36; 2 matches, any: 2.69; and fully matched: 2.73. 2. Trust: no matches: 2.67; 1 match, any: 2.78; 2 matches, any: 2.96; and fully matched: 2.96. 3. Respect: no matches: 2.62; 1 match, any: 2.60; 2 matches, any: 2.81; and fully matched: 2.88. 4. Unbiased: no matches: 2.43; 1 match, any: 2.43; 2 matches, any: 2.67; and fully matched: 2.70.

Figure 6 Long description

The data depicted is as follows: 1. Officer listened: 62.0 percent. Did not listen: 8.5 percent. 2. Officer polite: 59.8 percent. Not polite: 4.9 percent. A dotted horizontal line shows that the average level of satisfaction among all drivers who are given a ticket is 42 percent.

Element contents

Beyond Traditional Conceptions of Policing and Crime Control

Summary

Keywords

1 Introduction

2 Police Legitimacy

3 The Cost of Modern Policing

3.1 Bias Against Vulnerable Groups

3.2 The Cost to Police Officers

4 The Ineffectiveness of Police Reform

4.1 Police Accountability and Public Safety

4.2 Key Innovations in Policing

4.2.1 Community Policing

4.2.2 Problem-Oriented Policing

4.2.3 Information Technology and Outcome-Based Policing

4.2.4 Traffic Stops and Public Safety

4.2.5 Responding to Persons with Mental Illness

4.3 Summary of Past Reforms

5 Measuring What Matters to the Community

5.1 Procedural Justice

Figure 1 Long description

Figure 2 Long description

5.1.1 Respectful Treatment

5.1.2 Voice

5.1.3 Neutrality

5.1.4 Trustworthiness through Empathy

5.2 Summary of Potential New Measures

6 Building and Testing New Systems of Measurement

6.1 Contact Surveys

6.1.1 Contact Survey Content and Methods

6.1.2 Linking the Survey Data to Other Agency Data

6.2 Body-Worn Cameras

6.3 Analysis of New Data

Figure 3 Long description

Figure 4 Long description

Figure 5 Long description

6.3.1 Hot Spots of Procedural Injustice and Hot Individuals

6.4 Measurement Conclusion

7 Using Data to Change Modern Policing

7.1 External Changes

7.1.1 Police Performance Agency (PPA) and Community Involvement

7.2 Internal Changes

7.2.1 Performance Evaluations

7.2.2 Supervision with Good Data

7.2.3 Accountability Unit and Meetings

7.2.4 Officer Training

Figure 6 Long description

7.2.5 Recruitment and Selection

7.2.6 Leadership

7.3 Organizational Change Summary

8 Research Agenda

9 Summary and Conclusion

9.1 Big Science

Figure 1 Long description

Figure 2 Long description

Figure 3 Long description

Figure 4 Long description

Figure 5 Long description

Figure 6 Long description

References

Save element to Kindle

Save element to Dropbox

Save element to Google Drive