TY - JOUR
T1 - Multiagent-based processing and integration of system data
AU - Kalegele, Khamisi
AU - Sveholm, Johan
AU - Takahashi, Hideyuki
AU - Sasai, Kazuto
AU - Kitagata, Gen
AU - Kinoshita, Tetsuo
N1 - Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - This paper presents a multiagent-based ETL (Extract, Transform, Load) unit for the processing and integration of system operational data in order to improve its value. Operational data plays a vital role in managing and optimising systems. Although KDD (Knowledge Discovery and Data Mining) techniques and concepts have long existed, it is only now that we are seeing real applications being extended onto network and systems management. However, the massive data pre-processing (e.g. feature extraction and data integration) which is needed prior to putting KDD tools in action, is still limiting the extent of exploitation. We propose and design the multiagent-based ETL unit which uses Support Vector Machine and Natural Language Processing techniques to efficiently extract information features from operational data. The unit uses an mSPIDER algorithm to discover INclusion Dependencies (INDs) which are used to integrate data across its peers within the system. We demonstrate efficiency of the unit and the used approaches using operational data from a mailing system.
AB - This paper presents a multiagent-based ETL (Extract, Transform, Load) unit for the processing and integration of system operational data in order to improve its value. Operational data plays a vital role in managing and optimising systems. Although KDD (Knowledge Discovery and Data Mining) techniques and concepts have long existed, it is only now that we are seeing real applications being extended onto network and systems management. However, the massive data pre-processing (e.g. feature extraction and data integration) which is needed prior to putting KDD tools in action, is still limiting the extent of exploitation. We propose and design the multiagent-based ETL unit which uses Support Vector Machine and Natural Language Processing techniques to efficiently extract information features from operational data. The unit uses an mSPIDER algorithm to discover INclusion Dependencies (INDs) which are used to integrate data across its peers within the system. We demonstrate efficiency of the unit and the used approaches using operational data from a mailing system.
KW - Data integration
KW - Multiagent system
KW - Network and systems management
UR - http://www.scopus.com/inward/record.url?scp=84891946667&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84891946667&partnerID=8YFLogxK
U2 - 10.1504/IJISTA.2013.056207
DO - 10.1504/IJISTA.2013.056207
M3 - Article
AN - SCOPUS:84891946667
SN - 1740-8865
VL - 12
SP - 128
EP - 155
JO - International Journal of Intelligent Systems Technologies and Applications
JF - International Journal of Intelligent Systems Technologies and Applications
IS - 2
ER -