Skip to main content Accessibility help
×
Hostname: page-component-78c5997874-ndw9j Total loading time: 0 Render date: 2024-11-10T06:19:45.126Z Has data issue: false hasContentIssue false

Appendix 1 - Technical and methodological

Published online by Cambridge University Press:  26 April 2020

Get access

Summary

Three corpora form the primary engine for the text-mining used in this book. The first is the ‘East Anglian corpus’ which is composed of election-perelection subsamples of constituency Conservative and Liberal speech for the years 1880 to 1910. It contains approximately a million words. The speeches were taken from the Norfolk and Suffolk press, and each subsample contains equal word-counts per party, and for each of the region's sixteen constituencies. The second is the ‘National Speaker corpus’. This is composed of all the extra-parliamentary orations of frontbench politicians (i.e. the leading lights of the main parties who often held cabinet or shadow cabinet level positions) delivered during election campaigns that were reported in The Times. It is similarly subdivided by party and general election year, and contains approximately 1.5 million words. The third is the Constituencies corpus, which is approximately 1.8 million words in size. It contains subsamples of approximately 75,000–100,000 words per party, per election. Speeches in this corpus were selected according to the digital availability of newspapers through the British Newspaper Archive.

The book also makes use of several special supplementary corpora. The most important are the ‘Liberal Unionist corpus’ and the ‘Labour corpus’. However, others are occasionally employed for in-depth analyses of specific topics: for example Chapter 2 uses two corpora for East Anglia for 1835 and 1874, and Chapter 4 employs a ‘Pro-Boer corpus’. The Liberal Unionist and Labour corpora are introduced below, but other special supplementary corpora are introduced individually in the main text when they are utilised. In all corpora (main and supplementary) the numerical results generated from each subsection – for example East Anglian Conservative speeches in 1895, national Liberal speeches in 1900, constituencies Conservatives in 1885 or Liberal Unionists in 1886 – are weighted to ratios of 50,000 words per election subsample to enable direct like-for-like comparisons.

All corpora are machine-readable text files. They were interrogated primarily with Antconc (a free, simple and powerful corpus analysis program) but other software was occasionally employed such as Mallet and Google NGram.

Anatomy: East Anglian corpus

The East Anglian corpus contains approximately a million words of speech from 1880 to 1910, digitally scanned from newspapers. It is subdivided between the two parties and nine general elections, so has eighteen subsections It was compiled according to strict criteria, with each Norfolk and Suffolk constituency equally represented for each party at each election.

Type
Chapter
Information
The War of Words
The Language of British Elections, 1880–1914
, pp. 241 - 247
Publisher: Boydell & Brewer
Print publication year: 2020

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Save book to Kindle

To save this book to your Kindle, first ensure coreplatform@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Available formats
×

Save book to Dropbox

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.

Available formats
×

Save book to Google Drive

To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.

Available formats
×