Please use this identifier to cite or link to this item: http://repositorio.ufla.br/jspui/handle/1/30140
Title: Incremental missing data imputation via modified granular evolving fuzzy model
Other Titles: Imputação incremental de dados faltantes via modelo granular fuzzy evolutivo modificado
Authors: Leite, Daniel Furtado
Esmin, Ahmed Ali Abdalla
Camargo, Heloisa de Arruda
Cintra, Marcos Evandro
Keywords: Evolving intelligence
Fuzzy systems
Data stream
Incremental learning
Missing data imputation
Inteligência em evolução
Sistemas Fuzzy
Fluxo de dados
Aprendizagem incremental
Imputação de dados perdidos
Issue Date: 22-Aug-2018
Publisher: Universidade Federal de Lavras
Citation: GARCIA, C. M. Incremental missing data imputation via modified granular evolving fuzzy model. 2018. 71 p. Dissertação (Mestrado em Engenharia de Sistemas e Automação)-Universidade Federal de Lavras, Lavras, 2018.
Abstract: Large amounts of data have been produced daily. Extracting information and knowledge from data is meaningful for many purposes and endeavors, such as prediction of future values of time series, classification, semi-supervised learning and control. Computational intelligence and machine learning methods, such as neural networks and fuzzy systems, usually require complete datasets to work properly. Real-world datasets may contain missing values due to, e.g., malfunctioning of sensors or data transfer problems. In online environments, the properties of the data may change over time so that offline model training based on multiple passes over data is prohibited due to its inherent time and memory constraints. This study proposes a method for incremental missing data imputation using a modified granular evolving fuzzy model, namely evolving Fuzzy Granular Predictor (eFGP). eFGP is equipped with an incremental learning algorithm that simultaneously impute missing data and adapt model parameters and structure. eFGP is able to handle single and multiple missing values on data samples by developing reduced-term consequent polynomials and relying on information of time-varying granules. The method is evaluated in prediction and function approximation problems considering the constraints of online data stream. Particularly, the underlying data streams may be subject to missing at random (MAR) and missing completely at random (MCAR) types of missing values. Predictions given by the model evolved after data imputation are compared to those provided by state-of-the-art fuzzy and neuro-fuzzy evolving modeling methods in the sense of accuracy. Results and statistical comparisons with other approaches corroborate to conclude that eFGP is competitive as a general evolving intelligent method and overcomes its counterparts in MAR and MCAR scenarios according to an ANOVA-Tukey statistical hypothesis test.
URI: http://repositorio.ufla.br/jspui/handle/1/30140
Appears in Collections:Engenharia de Sistemas e automação (Dissertações)



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.