TY - GEN
T1 - Handling missing values via decomposition of the conditioned set
AU - Shyu, Mei Ling
AU - Kuruppu-Appuhamilage, Indika Priyantha
AU - Chen, Shu Ching
AU - Chang, Li Wu
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining algorithm in various applications. Our proposed framework adopts the basic concepts from conditional probability theories and further develops an algorithm to facilitate the capability of handling both nominal and numerical values, which addresses the problem of the inability of handling both nominal and numerical values with a high degree of accuracy in the existing algorithms. Several experiments are conducted and the experimental results demonstrate that our framework provides a high accuracy when compared with most of the commonly used algorithms such as using the average value, using the maximum value, and using the minimum value to replace missing values.
AB - In this paper, a framework for replacing missing values in a database is proposed since a real-world database is seldom complete. Good data quality in a database can directly improve the performance of any data mining algorithm in various applications. Our proposed framework adopts the basic concepts from conditional probability theories and further develops an algorithm to facilitate the capability of handling both nominal and numerical values, which addresses the problem of the inability of handling both nominal and numerical values with a high degree of accuracy in the existing algorithms. Several experiments are conducted and the experimental results demonstrate that our framework provides a high accuracy when compared with most of the commonly used algorithms such as using the average value, using the maximum value, and using the minimum value to replace missing values.
UR - http://www.scopus.com/inward/record.url?scp=33745697521&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745697521&partnerID=8YFLogxK
U2 - 10.1109/IRI-05.2005.1506473
DO - 10.1109/IRI-05.2005.1506473
M3 - Conference contribution
AN - SCOPUS:33745697521
SN - 0780390938
SN - 9780780390935
T3 - Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
SP - 199
EP - 204
BT - Proceedings of the 2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
T2 - 2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
Y2 - 15 August 2005 through 17 August 2005
ER -