Content classification of accounting documents using machine learning: The relevant facts case

Authors

  • Brunna Hisla da Silva Sena UnB
  • César Augusto Tibúrcio Silva UnB
  • Roberto Ternes Arrial UnB

DOI:

https://doi.org/10.17524/repec.v4i2.199

Keywords:

Machine learning, Bayesian learning, Relevant facts, Content analysis

Abstract

The analysis of narrative texts content has been more often studied in recent years. In several works research is noticed in relation to readability, comprehensiveness and level of optimism, pessimism or neutrality. However, the classification analysis regarding their optimistic, pessimistic or neutral trends has been proven burdensome, because it demands human analysis of texts, justifying the creation of more rapid and objective text analysis procedures, besides the attempt to reduce subjectivity. Therefore, the objective of this work is to propose an automatic classification of the accounting relevant facts , by making an analysis of narrative texts content using computational tools for text reading and classification. The idea is to try to contribute with an example of machine learning application to Accounting Science. The analysis in this work used relevant facts previously analyzed in the study by Pereira and Silva (2008). The already classified facts were used as training set for the program, so that it could classify other unknown and not-classified data.

Published

2010-08-17

How to Cite

Sena, B. H. da S., Silva, C. A. T., & Arrial, R. T. (2010). Content classification of accounting documents using machine learning: The relevant facts case. Journal of Education and Research in Accounting (REPeC), 4(2), 23–42. https://doi.org/10.17524/repec.v4i2.199

Issue

Section

Articles