An Elementary Formal Categorization of a Corpus of Spelling Errors

Smolenaars, Anton J.

doi:10.1007/978-3-642-83943-6_13

Anton J. Smolenaars²

Part of the book series: Recent Research in Psychology ((PSYCHOLOGY))

120 Accesses
1 Citations

Abstract

If we take the complete mastery of orthography to be the productive knowledge of all spelling rules that exist within a language, then the orthography of an individual word can be identified with some subset of those rules. Accordingly, relations between words’ orthographies can be framed in terms of formal relations between those subsets and partitioned into four categories:

identity: two words’ subsets contain the same elements, i.e. the two orthographies contain the same spelling rules
exclusion: two words do not have any rule in common
inclusion: word a’s orthography contains all the rules of that of word b, plus some more
overlap: two words have some — but not all — rules in common. This category naturally is accompanied by some measure for the amount of overlap

The paper describes an algorithm to infer this partition from (co)variations of misspellings of words. The need for such an inference arises from (i) the want of a basis for a taxonomy of spelling errors which is less superficial or ad hoc than the common ones, and from (ii) the requirement to have an independent test for the psychological validity of recently developed software that converts ’sounds’ to orthography for Dutch words. Eventually the inference is the first phase of a broad heuristic to detect the underlying cognitive processes that govern the behavior of persons who are in the process of building up the orthography of their mother’s tongue, say children of about ten. These processes are not necessarily homomorph to the ’institutional’ ones, i.e. the rules from textbooks or teachers.

The kernel of the algorithm consists of 3 independent criteria. The first criterion effects a relation between the scope of the deficit as to the orthography of some word and the likelihood of it distribution of spellings. The second one connects the relative commonality (’covariance’) of two spelling distributions to inclusion and overlap. The third criterion, heterogeneity, focuses on the amount of difference between spellings.

Each triple of words effects three pairs; allocation to one of the four categories of two of them puts constraints on the allocation of the third one. For instance: if a includes b and b and c have overlap, then c cannot include a, nor can c and a exclude each other. These ’triple constraints’ are utilized to handle the stochastic aspect of the inference, which in itself is rather opaque: the algorithm has one free ’bias’-parameter which is assumed to be optimal when the number of violations of the ’triple constraints’ is minimal.

The results of the application of the algorithm to real data are given.

It is indicated how this elementary categorization could be a precursor to a taxonomy of spelling errors, how it could reveal differences between ’normals’ and ’dyslectics’, and how it leads towards the notion of ’effective extent of errors’: a way of counting the number of independent sources of error.

As the algorithm is purely formal, in the sense that no assumptions are involved that are specific to spelling processes, it can be generalized to all domains where incorrect cognitive performances can be discretely and finitely differentiated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bosch, K. van den, Elshout, J.J. and Langereis, M.P. (1986) Spellingstrategieen bij tweede en vijfde klas basisschoolleerlingen’ [Spelling strategies of 4th and 7th grade elementary school pupils,]. Unpublished manuscript, Psychological Laboratory, University of Amsterdam.
Google Scholar
Tversky, A (1977) Features of similarity. Psychological Review, 84, 327–352.
Google Scholar
Verheyden, H. (1984). Een programma ter sturing van het spellingsproces [A program for coaching the spelling process] In H.J.Breimer and E.J.W.M. van Hees (Eds.) Technologie in het onderwijs Onderwijs Research Dagen 1984 (pp. 63–68 ), Lisse, The Netherlands: Swets and Zeitlinger.
Google Scholar
Verheyden,H. (1985). De diagnose van spellingfouten’ [The diagnosis of spelling errors]. In P.W.Verhagen and B.J.Wielinga (Eds.). Media in het onderwijs Onderwijs Research Dagen 1985 (pp 43–50 ) Lisse, The Netherlands: Swets and Zeitlinger.
Google Scholar

Download references

Author information

Authors and Affiliations

Psychological Laboratory, University of Amsterdam, Weesperplein 8, 1018 XA, Amsterdam, The Netherlands
Dr Anton J. Smolenaars

Authors

Dr Anton J. Smolenaars
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Psychology, University of Nijmegen, P.O. Box 9104, 6500 HE, Nijmegen, The Netherlands
Edward E. Roskam

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Smolenaars, A.J. (1989). An Elementary Formal Categorization of a Corpus of Spelling Errors. In: Roskam, E.E. (eds) Mathematical Psychology in Progress. Recent Research in Psychology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83943-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-83943-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-51686-6
Online ISBN: 978-3-642-83943-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics