A data quality management of chain stores based on outlier detection

Linh Nguyen, Tsukasa Ishigaki

研究成果: Conference contribution

抄録

For successfully analyzing data in the business of chain stores, the quality of data recorded in their shops or factories is a key factor. Data quality management is an important practical issue because data qualities widely vary depending on the managers or workers of many stores in the chain. In this paper, we present a data quality evaluation method for shops in chain businesses based on outlier detection and then, we apply this method to a dataset observed in real chain stores, which provide tire maintenance for vehicles. To evaluate the data quality of each shop, we use data about trucks tire information such as tread depth, tread pattern, and distance which was recorded by the shops at maintenance time to calculate low-quality data by using outlier detection methods with reliable experimental data and practical knowledge. Some outlier detection methods such as Isolation Forest and one-class Support Vector Machine are applied to detect anomalous tire information, which is used to calculate datas abnormal rate in each shop. Our result showed that with this kind of data, Isolation Forest is outstanding than other methods because Isolation Forest is designed to detect few and different outliers. The proposed method can support better maintenance services for customers as well as be able to get more correct data from these shops, which will be useful for the next research.

本文言語English
ホスト出版物のタイトルAdvanced Studies in Classification and Data Science, IFCS 2017
編集者Tadashi Imaizumi, Akinori Okada, Sadaaki Miyamoto, Fumitake Sakaori, Yoshiro Yamamoto, Maurizio Vichi
出版社Springer Science and Business Media Deutschland GmbH
ページ341-353
ページ数13
ISBN(印刷版)9789811533105
DOI
出版ステータスPublished - 2020
イベントBiennial Conference of the International Federation of Classification Societies, IFCS 2017 - Tokyo, Japan
継続期間: 2017 8月 82017 8月 10

出版物シリーズ

名前Studies in Classification, Data Analysis, and Knowledge Organization
ISSN(印刷版)1431-8814
ISSN(電子版)2198-3321

Conference

ConferenceBiennial Conference of the International Federation of Classification Societies, IFCS 2017
国/地域Japan
CityTokyo
Period17/8/817/8/10

ASJC Scopus subject areas

  • コンピュータ サイエンスの応用
  • 情報システム
  • 情報システムおよび情報管理
  • 分析

フィンガープリント

「A data quality management of chain stores based on outlier detection」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル