Academia.eduAcademia.edu

Assessing Performances of Learning Algorithms when Trained using Incomplete Data

2006

Abstract

A common practice in pre-processing step of hydrological modeling is to ignore observations with any missing variable values at any given time step even if it is only one of the independent variable that is missing. These rows of data are labeled incomplete and would not be used in either model building or subsequent testing and verification steps. This is not necessarily the best way of doing it as information is lost when incomplete rows of data are thrown out. Learning algorithms are affected by such problems more than physically-based models as they rely heavily on the data to learn the underlying input/output relationships. In this study, the extent of damage to the performance of the learning algorithm due to missing data is explored in a field-scale application. We have tested and compared the performance of two well- known learning algorithms, namely Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs) for short-term prediction of groundwater levels in a well...