Stylometry: Quantitative Investigation Into the Characteristics of an Author's Style

2496 Words5 Pages

Stylometry is a quantitative investigation into the characteristics of an author’s style. Lann (1995) defines the term as a technique “to grasp the often elusive character of an author's style, or at least part of it, by quantifying some of its features” (1995:271). Matthews and Merriam (1993) agree claiming “Stylometry attempts to capture quantitatively the essence of an individual’s use of language” (1993:203). To put it simply, stylometric analysis is an approach to the investigation of characteristics within literary works through numerical quantitative methods. The relationship between quantitative aspects and literary phenomena is very old. Numerous studies have attempted to explain the stylistic and linguistic properties of authors in terms of quantitative methods and these have been more developed with the availability of computational methods since these methods are accepted by many as more accurate than non-computational ones. Many scholars (Jockers et al., 2008; Altintas et al., 2007; Burrows, 2007; Burrows, 2005; Paton and Can, 2004; Burrows, 2003; Holmes, 1998; Holmes and Forsyth, 1995; Burrows, 1987) agree that the development of computational methods has enhanced the efficiency and accuracy of stylometric studies since computer systems have capacities for analyzing large quantities of data. In turn, Stylometry is often met by objections from many critics. They argue that the computational approach of Stylometry can never give results that can be universally accepted as definitive. (Delcourt, 1992; Smith, 1992; Smith, 1985). Holmes (1998; 1994) argues that there are two main problems about Stylometry that inhibit its acceptance within humanities scholarship. First, there is no consensus as to correct methodology o... ... middle of paper ... ...variate analysis techniques including PCA, factor analysis, discriminant analysis, and cluster analysis have been successfully used (Burrows, 2007; Burrows, 2003; Holmes, 1998)  The use of rare words The main assumption behind this testing is that the use of rare words is a good indication for determining the author of a given text. The basic argument is that the use of rare words enables one writer to be distinguished from another. Morton (1986) explains “The once occurring words convey many of the elements thought to show excellence in writing, the range of a writer's interests, the precision of his observation, the imaginative power of his comparisons, they demonstrate his command of rhythm and of alternations” (1986: 1). To put it simply, rare words are quite noticeable which makes it easier and accurate to use them as an indicator for determining authors.

Open Document