1. Introduction
In recent years, major new data collection efforts have made it possible to understand Canadian municipal elections and voting behaviour in unprecedented detail. However, despite the exciting progress that these new data sources have enabled, municipal elections scholars have thus far been constrained by the absence of a crucial data source: a comprehensive database of municipal election results.
In this research note, we introduce the Canadian Municipal Elections Database (CMED), the most comprehensive database of municipal election results ever constructed in Canada and one of the largest databases of municipal election results anywhere in the world. The database contains more than 24,000 unique elections across nearly 2,000 municipalities; for six cities, it also contains results stretching back to the nineteenth century. The database is publicly available on Dataverse in easily downloadable formats and is actively maintained by the Canadian Municipal Barometer partnership, with new election results added regularly.Footnote 2
We begin by describing the importance of municipal election results for political science research and outline the construction and content of the CMED. We then illustrate the potential of the database by estimating an important quantity that has never before been accessible to the Canadian political science community: the relationship between gender and electoral outcomes in municipal elections. We conclude by describing other important questions for which the CMED will be immediately relevant, as well as our plans to expand the database with both historical and contemporary elections.
2. Municipal Elections in Political Science
Municipal elections research is growing rapidly (Lucas and McGregor, Reference Lucas, Michael McGregor, Lucas and Michael McGregorforthcoming). One reason for this growth has been a recognition among political scientists that the sheer number of municipal elections, combined with interesting institutional variation across municipalities, makes municipal elections an excellent laboratory for important political science questions. For example, the varying presence and absence of political parties make it possible to better understand how parties structure representation, responsiveness and competition (Breux and Couture, Reference Breux, Couture, Breux and Couture2018; Lucas, Reference Lucas2020b; Tausanovitch and Warshaw, Reference Tausanovitch and Warshaw2014). Similarly, variation in ward versus at-large elections can help us understand how electoral institutions shape representational priorities (Koop and Kraemer, Reference Koop and Kraemer2016; Welch and Bledsoe, Reference Welch and Bledsoe1990). Municipal election results have also enabled research on the causes and consequences of electoral reform (Lucas, Reference Lucas2020a; Santucci, Reference Santucci2017) and the spatial character of electoral cleavages (Doering et al., Reference Doering, Silver and Taylor2020). While scholars must of course be aware of important differences between municipal and regional or national elections, these studies, among many others, highlight how the abundance and variability of municipal elections make it possible to explore research questions that are more difficult to study in elections at other scales.
A second reason for increased interest in municipal elections is a growing recognition of the importance of municipal politics and policy. Canadian municipalities today are large, complex, sophisticated organizations, with responsibility for issues that shape the day-to-day experiences of their residents. This is all the more true in big cities, which not only address bread-and-butter issues of municipal policy but also play a central role in many of the most pressing issues of our era, from immigrant settlement to climate change (Lucas and Smith, Reference Lucas and Smith2019). Several Canadian municipal election studies, both single-case and comparative, focus on municipal elections not only for their larger lessons but also for their importance to the lives of millions of Canadians.
To enable this work, researchers need access to large, comparable and comprehensive municipal election results datasets. Outside Quebec, such studies have been minimal and sporadic. To our knowledge, past studies have been limited to one study of incumbency advantage (Lucas, Reference Lucas2020b), one survey-based study (Kushner and Siegel, Reference Kushner and Siegel1997) and one dataset—closest to the one we develop here—of election outcomes in large municipalities across Canada (Breux et al., Reference Breux, Couture and Koop2017). These datasets have been limited in scope, and most importantly, they have not been publicly available in accessible formats for other researchers.
3. The Canadian Municipal Elections Database
The Canadian Municipal Elections Database (CMED) is a large, public, actively maintained database of Canadian municipal election results. The database currently contains 24,153 unique elections from 1,843 municipalities, with 93,695 distinct observations, each of which represents a single candidate running in a single race. It includes data on votes received, votes cast, position sought, incumbency status, candidate gender, election magnitude (the number of individuals elected in each district), and candidate name for every candidate in every race; in some cases, it also includes additional information such as political party.
The CMED is built from four sources, summarized in Table 1. The first is the Canadian Municipal Barometer elections dataset, which consists of complete election results for the most recent election in every municipality above 9,000 population in Canada—442 municipalities in total. We created this dataset using central databases of election results (Manitoba, New Brunswick, Newfoundland and Labrador, Ontario), downloadable election results datasets (Alberta, British Columbia, Quebec) and official election results reported on municipal websites or provided to us by municipal clerks (Nova Scotia, Saskatchewan, Prince Edward Island). We then cleaned and standardized variables (such as names and vote totals) and calculated other variables (district magnitude and total votes). In some provinces (British Columbia, Alberta, Quebec, New Brunswick), candidates’ gender was included in official results; in all other cases, we manually coded candidate gender using candidate names, and for all ambiguous names, we verified candidate gender using candidates’ self-descriptions and pronoun use on their websites and by searching for photographs of candidates in online news coverage of the relevant elections. To confirm these codes, we then used a statistical inference package (Blevins et al., Reference Blevins and Mullen2015) to infer the probability of each candidate's gender based on their first name, and we manually double-checked all candidates for whom the probability indicated substantial ambiguity (40–60% probability of a female name) along with any individuals whose probability score did not agree with our manual code (for example, candidates coded as female whose female probability was below 50%). Finally, for all candidates who were successfully elected, we visited the websites of each of the 442 municipalities in the dataset to verify gender using official photographs.
The second data source is the Canadian Urban Democracy dataset, part of a larger project on the historical development of urban democratic institutions in Canada. This dataset contains complete election results for six of Canada's largest cities—Calgary, Edmonton, Montreal, Toronto, Vancouver and Winnipeg—for every election back to the nineteenth century. It was constructed by manually digitizing election results based on archival paper records, which we photographed in each city's municipal archives, as well as microfilm newspaper records when archival results were unavailable. We focused our attention on securing official election results, rather than results from newspapers or other sources, even when doing so required travel to archives to photograph results, because the city clerks who prepared the official results had a professional obligation to report results clearly and accurately. Given the historical character of the results, we acknowledge that we have no way of verifying the quality of the counting procedures or record-keeping from which we draw the results, except to note that available archival correspondence and newspaper coverage rarely indicate concerns about the quality of the official results or question the final vote counts. In Montreal, candidates’ political party affiliations were provided as part of the official results, but in other cities we added each candidate's political party affiliation using secondary sources and inspection of microfilm newspaper coverage of hundreds of historical elections. We then manually coded candidates’ gender using candidate names and, for more recent elections, by searching for candidates online using both Google searches and the Internet Archive's Wayback Machine. We once again used a gender inference package to identify ambiguous names and manually double-checked those names using microfilm newspaper coverage and online searches. In many cases, historical coding of candidates’ gender was aided by gendered reporting conventions, in which all female candidates were referred to as “Miss” or “Mrs.” in official election results. This data source includes 5,519 distinct elections across the six cities.
The remaining elections in the CMED are drawn from publicly available official results in British Columbia and Quebec. For British Columbia, we include results for 1,757 elections from CivicInfo BC, which provides results for British Columbia local elections since 2004. For Quebec, we include official government-reported results from 14,113 elections, which includes all municipal elections since 2005. The British Columbia results required little manipulation to add to the database. In Quebec, available government data was not sufficient to distinguish ward and at-large elections, which is necessary to construct our district magnitude variable. We therefore manually coded each of the more than 1,000 municipalities in Quebec for at-large or ward elections for each election year in the dataset. We did so with archival provincial government records using the Internet Archive's Wayback Machine.
We used a variety of strategies to identify and correct errors in the data, which we describe in more detail in the supplementary materials. For example, we checked hand-coded total vote counts with machine-calculated totals to identify and correct typographical errors. We also used machine-calculated vote shares to identify errors—flagging, checking and fixing cases with logically impossible vote shares (for example, above 100%) or extreme values (0% vote share and above 90% vote share). Finally, we checked vote share distributions for each district magnitude to visually identify and manually check outliers. Our Dataverse repository provides clear contact information so that researchers can contact us to correct any remaining errors and update the dataset.
Table 2 provides a summary of the available data in the CMED. As the table illustrates, the dataset is strongest for very large cities and most comprehensive for the most recent election, for which we have data for every municipality in Canada with a population of at least 9,000 and for all municipalities in Quebec and British Columbia. We emphasize that the results in the dataset are not samples but rather complete election results for the municipalities listed in the table.
4. Municipal Elections and Women's Electoral Performance
To illustrate the value of the CMED for political science research, we now briefly analyze an important question in current research: the presence and size of gender differences in electoral performance. This question has been explored in past research in a variety of contexts. In the United States, studies of the US House of Representatives and Senate have suggested that women do not receive fewer votes than men in recent elections (Darcy and Slavin, Reference Darcy and Slavin1977; Smith and Fox, Reference Smith and Fox2001). In Canada, a recent study of federal elections (Sevi et al., Reference Sevi, Arel-Bundock and Blais2019) finds that the gender gap in vote share has decreased over time and is statistically indistinguishable from zero today; this study aligns with past findings at the federal level (Black and Erickson, Reference Black and Erickson2003; Hunter and Denton, Reference Hunter and Denton2008) and in federal ridings in Quebec (Tremblay, Reference Tremblay1995).
Because of the lack of election results data, Canadian studies of gender and municipal representation have focused on women's presence as a proportion of city councils. This work has generally found that women are not better represented at the municipal level than in provincial or federal legislatures (Tolley, Reference Tolley2011; Ogilvie et al., Reference Ogilvie, Tulk and Plana2020). Recent studies have deepened this work by exploring the predictors of women's municipal representation across institutions, population sizes and municipal positions (Breux et al., Reference Breux, Couture and Koop2019; Gavan-Koop and Smith, Reference Gavan-Koop and Smith2008; Sullivan, Reference Sullivan2019).
Our analysis provides a first look at women's electoral performance in Canadian municipal elections; we restrict our focus to elections after 2000, allowing us to bracket the interesting but separate question of the history of the gender gap in women's electoral performance over time. We focus on a dichotomous outcome variable in our analysis—elected (1) or not (0)—for two reasons. First, variation in municipal district magnitude makes vote-share-based analysis less informative at the municipal scale. Second, our substantive interest is in who actually wins municipal elections; focusing on winners and losers is a more stringent and substantively meaningful test.
Our estimate of the relationship between gender and electoral performance could potentially be contaminated by temporal and spatial heterogeneity across elections. For instance, if women are more likely to run for office in recent elections, and if, for any reason, all candidates are more likely to win in recent elections than in the past (for example, if municipal elections are becoming less competitive over time), an estimate of gender and electoral performance that did not account for this time trend would be biased. The same could be true if competition varied across municipalities in ways that were also related to women's likelihood to run. We therefore include fixed effects (FEs) for election year and municipality to eliminate unobserved temporal and spatial heterogeneity from our estimate; the resulting coefficient can be interpreted as an estimate of the average difference in electoral performance between women and men within municipalities and election years. In addition, because incumbent success rates are high in Canadian municipalities, we control for incumbent status. For ease of interpretation, and because the linear probability model may be preferable to maximum likelihood estimators in models containing a large number of fixed effects, we fit an ordinary least squares (OLS) model in the main text. Our findings are robust to alternative specifications and estimators; we show in the supplementary materials that our results are substantively identical with a variety of alternative specifications as well as a logistic regression model.
Table 3 summarizes our analysis. Overall, women are 6 percentage points more likely than men to win municipal elections.Footnote 3 We show in the supplementary materials that this relationship is substantively large and statistically significant across municipal population sizes and regions; we also show that the relationship may be more pronounced among council candidates when compared with candidates for mayor. In addition to the results in Table 1, to enable direct comparison with previous studies at the federal level, we estimate 3.2 per cent higher vote share for women than men in municipal single-member plurality elections; this estimate is very similar to recent federal estimates (Sevi et al., Reference Sevi, Arel-Bundock and Blais2019).Footnote 4
Note: Standard errors in parentheses.
* p < .1; **p < .05; *** p < .01.
5. Conclusion
This research note has introduced the CMED and provided the first ever pan-Canadian estimate of gender differences in municipal electoral performance. This is just one example of the many questions that researchers might explore using the CMED. More in-depth research on gender and election outcomes would, of course, be a valuable extension to the findings we report here; for instance, preliminary analysis suggests that women are more likely than men to win municipal elections only in council elections, rather than mayoral elections.Footnote 5 This possibility, among others, is worthy of further research. Others may wish to use the database to explore patterns of incumbent success and incumbency advantage, electoral competitiveness, municipal turnout or challenger entry—these questions, and many others, are more tractable now that the CMED is available. Combined with municipal elections datasets in the United States (for example, Benedictis-Kessner, Reference Benedictis-Kessner2017; Benedictis-Kessner and Warshaw, Reference Benedictis-Kessner and Warshaw2020; Ferreira and Gyourko, Reference Ferreira and Gyourko2009), the CMED will also enable scholars to explore comparative municipal elections research.
We also hope that this research note will serve as a catalyst for the further growth of the CMED across space (more municipalities) and time (more historical results). By clarifying the value of accessible election results data, we hope to persuade municipal officials, provincial governments and municipal associations to make election results available in complete and downloadable formats, thus reducing the need for manual data entry and enabling more widespread use of the data. We also hope that the CMED will inspire municipal archivists to make election results publicly available online, reducing the need for costly trips to municipal archives and enabling innovative digitization efforts using crowd-sourcing methods (Sumner et al., Reference Sumner, Farris and Holman2020) or university-based research teams.
Supplementary material
To view supplementary material for this article, please visit https://doi.org/10.1017/S000842392000102X
Acknowledgments
We are grateful to the Social Sciences and Humanities Research Council (SSHRC) for financial support and to Luke Battcock, Alexandra Champagne, Amber Chan, Mark Grosjean, Makda Habtegergesa, Sonia Kelly, Andrew Klain, Jessica Weber and Saba Zanab for research assistance. This article is dedicated to the memory of Mark Grosjean (August 21, 1995–March 12, 2018).