Comparing Manual Text Patterns and Machine Learning for Classification of E-Mails for Automatic Answering by a Government Agency

Dalianis, Hercules; Sjöbergh, Jonas; Sneiders, Eriks

doi:10.1007/978-3-642-19437-5_19

Hercules Dalianis¹⁷,
Jonas Sjöbergh¹⁸ &
Eriks Sneiders¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6609))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1305 Accesses
6 Citations

Abstract

E-mails to government institutions as well as to large companies may contain a large proportion of queries that can be answered in a uniform way. We analysed and manually annotated 4,404 e-mails from citizens to the Swedish Social Insurance Agency, and compared two methods for detecting answerable e-mails: manually-created text patterns (rule-based) and machine learning-based methods. We found that the text pattern-based method gave much higher precision at 89 percent than the machine learning-based method that gave only 63 percent precision. The recall was slightly higher (66 percent) for the machine learning-based methods than for the text patterns (47 percent). We also found that 23 percent of the total e-mail flow was processed by the automatic e-mail answering system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Busemann, S., Schmeier, S., Arens, R.G.: Message classification in the call center. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, Seattle, Washington, pp. 158–165. ACL (2000)
Google Scholar
Scheffer, T.: E-mail answering assistance by semi-supervised text classification. Intelligent Data Analysis 8(5), 481–493 (2004)
Google Scholar
Lapalme, G., Kosseim, L.: Mercure: Towards an automatic e-mail follow-up system. IEEE Computational Intelligence Bulletin 2(1), 14–18 (2003)
Google Scholar
Sneiders, E.: Automated E-mail Answering by Text Pattern Matching. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS, vol. 6233, pp. 381–392. Springer, Heidelberg (2010)
Chapter Google Scholar
Dalianis, H., Rosell, M., Sneiders, E.: Clustering E-mails for the swedish social insurance agency – what part of the E-mail thread gives the best quality? In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds.) IceTAL 2010. LNCS, vol. 6233, pp. 115–120. Springer, Heidelberg (2010)
Chapter Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11( 1) (2005)
Google Scholar
Cohn, D.A., Zoubin, G., Michael, I.J.: Active learning with statistical models. Journal of Artificial Intelligence Research 4, 129–145 (1996)
Google Scholar
Lampert, A., Dale, R., Paris, C.: Detecting Emails Containing Requests for Action. In: The Proceeding of Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL-HLT, Los Angeles, pp. 984–992 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Systems Sciences (DSV), Stockholm University, Forum 100, SE-164 40, Kista, Sweden
Hercules Dalianis & Eriks Sneiders
KTH CSC, SE-100 44, Stockholm, Sweden
Jonas Sjöbergh

Authors

Hercules Dalianis
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Sjöbergh
View author publications
You can also search for this author in PubMed Google Scholar
Eriks Sneiders
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dalianis, H., Sjöbergh, J., Sneiders, E. (2011). Comparing Manual Text Patterns and Machine Learning for Classification of E-Mails for Automatic Answering by a Government Agency. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2011. Lecture Notes in Computer Science, vol 6609. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19437-5_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-19437-5_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19436-8
Online ISBN: 978-3-642-19437-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics