A text mining system for deviation detection in financial documents

Attempts to mine text documents to discover deviations or anomalies have increased in recent years due to the elevated amount of textual data in today's data repositories. Text mining assists in uncovering hidden information contents across multiple documents.Although various text mining tools...

全面介紹

Saved in:
書目詳細資料
Main Authors: Kamaruddin, Siti Sakira, Abu Bakar, Azuraliza, Hamdan, Abdul Razak, Mat Nor, Fauzias, Ahmad Nazri, Mohd Zakree, Ali Othman, Zulaiha, Hussein, Ghassan Saleh
格式: Article
出版: IOS Press 2015
主題:
在線閱讀:http://repo.uum.edu.my/16453/
http://doi.org/10.3233/IDA-150768
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Attempts to mine text documents to discover deviations or anomalies have increased in recent years due to the elevated amount of textual data in today's data repositories. Text mining assists in uncovering hidden information contents across multiple documents.Although various text mining tools are available, their focus is mainly to assist in data summarization or document classification. These tasks proved to be helpful, however; they do not provide semantic analysis and rigorous textual comparison to detect abnormal sentences that exist in the documents. In this paper, we describe a text mining system that is able to detect sentence deviations from a collection of financial documents.The system implements a dissimilarity function to compare sentences represented as graphs. Our evaluation on the proposed system revolves around experiments using financial statements of a bank. The findings provide valid evidence that the proposed system is able to identify deviating sentences occurring in the documents. The detected deviations can be beneficial for the authorities in order to improve their business decisions.