Hostname: page-component-78c5997874-g7gxr Total loading time: 0 Render date: 2024-11-15T16:36:14.819Z Has data issue: false hasContentIssue false

Information services for novelty mining

Published online by Cambridge University Press:  21 March 2014

Flora S. Tsai
Affiliation:
Northwest Indian College, Bellingham, WA 98226, USA; e-mail: fst1@columbia.edu, atkwee@yahoo.ca
Agus T. Kwee
Affiliation:
Northwest Indian College, Bellingham, WA 98226, USA; e-mail: fst1@columbia.edu, atkwee@yahoo.ca

Abstract

Information services facilitate users to exploit applications over the network and access them from the remote system at the client side. In this paper, we describe the design and development of information services for novelty mining, which allows users to access the novel yet relevant information of a given topic. Several methodologies regarding novelty mining such as novelty scoring, novelty threshold, novelty feedback, and document-to-sentence technique are described. In addition to Web services, mobile information services are also described. Modelling and implementing information services for novelty mining are especially useful for users to reduce their information overload. We describe the challenging issue of decomposing the complex novelty mining application into several smaller and simpler modules, which are later implemented as services on the Web as well as mobile devices. After deploying our information services for novelty mining, test cases are provided to demonstrate the system. Our information services for novelty mining are confirmed to be helpful in increasing the efficiency of enterprise users in gathering novel information from incoming text. By studying the design and development of information services for novelty mining, we can benefit other developers in investigating effective techniques for developing enterprise services for other real-world applications.

Type
Articles
Copyright
Copyright © Cambridge University Press 2014 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Allan, J., Wade, C., Bolivar, A. 2003. Retrieval and novelty detection at the sentence level. In SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, 314–321.Google Scholar
Chen, Y., Tsai, F. S., Chan, K. L. 2007. Blog search and mining in the business domain. In DDDM ‘07: Proceedings of the 2007 International Workshop on Domain Driven Data Mining, ACM, New York, NY, USA, 55–60.Google Scholar
Hugo, H., Allan, B. 2004. Web Services Glossary. http://www.w3.org/TR/ws-gloss/Google Scholar
Kwee, A. T., Tsai, F. S., Tang, W. 2009. Sentence-level novelty detection in English and Malay. Lecture Notes in Computer Science (LNCS), Springer Berlin/Heidelberg, 5476, 40–51.Google Scholar
Liang, H., Tsai, F. S., Kwee, A. T. 2009. Detecting novel business blogs. In ICICS 2009—Conference Proceedings of the 7th International Conference on Information, Communications and Signal Processing, Macau, China, 1–5.Google Scholar
Ng, K. W., Tsai, F. S., Chen, L., Goh, K. C. 2007. Novelty detection for text documents using named entity recognition. In 2007 6th International Conference on Information, Communications and Signal Processing, ICICS, Singapore, Republic of Singapore, 1–5.Google Scholar
Ong, C. L., Kwee, A., Tsai, F. 2009. Database optimization for novelty detection. In ICICS 2009—Conference Proceedings of the 7th International Conference on Information, Communications and Signal Processing, Macau, China, 1–5.Google Scholar
Perez-Marin, D., Pascual-Nieto, I., Rodriguez, P. 2009. Computer-assisted assessment of free-text answers. The Knowledge Engineering Review 24(4), 353374.CrossRefGoogle Scholar
Soboroff, I. 2004. Overview of the TREC 2004 Novelty Track. In Proceedings of TREC 2004—the 13th Text Retrieval Conference, Gaithersburg, Maryland, USA, 1–16.Google Scholar
Tang, W., Tsai, F. S. 2009. Threshold setting and performance monitoring for novel text mining. In Society for Industrial and Applied Mathematics—9th SIAM International Conference on Data Mining, Proceedings in Applied Mathematics 3, Sparks, Nevada, USA, 1310–1319.Google Scholar
Tang, W., Tsai, F. S., Chen, L. 2010. Blended metrics for novel sentence mining. Expert Systems with Applications 37(7), 51725177.CrossRefGoogle Scholar
Tsai, F. S. 2010a. Comparative study of dimensionality reduction techniques for data visualization. Journal of Artificial Intelligence 3(3), 119134.CrossRefGoogle Scholar
Tsai, F. S. 2010b. Review of techniques for intelligent novelty mining. Information Technology Journal 9(6), 12551261.CrossRefGoogle Scholar
Tsai, F. S., Etoh, M., Xie, X., Lee, W.-C., Yang, Q. 2010a. Introduction to mobile information retrieval. IEEE Intelligent Systems 25(1), 1115.CrossRefGoogle Scholar
Tsai, F. S., Han, W., Xu, J., Chua, H. C. 2009. Design and development of a mobile peer-to-peer social networking application. Expert Systems with Applications 36(8), 1107711087.CrossRefGoogle Scholar
Tsai, F. S., Tang, W., Chan, K. L. 2010b. Evaluation of metrics for sentence-level novelty mining. Information Sciences 180(12), 23592374.CrossRefGoogle Scholar
Tsai, F. S., Zhang, Y. 2011. D2S: document-to-sentence framework for novelty detection. Knowledge and Information Systems 29(2), 419433.CrossRefGoogle Scholar
Zhang, Y., Callan, J., Minka, T. 2002. Novelty and redundancy detection in adaptive filtering. In SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, 81–88.Google Scholar
Zhang, Y., Tsai, F. S. 2009a. Combining named entities and tags for novel sentence detection. In Proceedings of the WSDM'2009 ACM Workshop on Exploiting Semantic Annotations in Information Retrieval, ESAIR 2009, Barcelona, Spain, 30–34.Google Scholar
Zhang, Y., Tsai, F. S. 2009b. Chinese novelty mining. In EMNLP 2009: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Singapore, Republic of Singapore, 1561–1570.Google Scholar
Zhang, Y., Tsai, F. S., Kwee, A. T. 2011. Multilingual sentence categorization and novelty mining. Information Processing and Management: An International Journal 47(5), 667675.CrossRefGoogle Scholar