A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting

doi:10.1016/j.neucom.2013.09.024

Neurocomputing

Volume 129, 10 April 2014, Pages 343-349

https://doi.org/10.1016/j.neucom.2013.09.024 Get rights and content

Highlights

•
Small-data-set forecasting problem is difficult for most manufacturing environments.
•
Short-term predictions using new limited data for engineers and managers are more effective and efficient.
•
The proposed method, Latent Information function, can analyze data features and extract hidden information for knowledge learning with small data sets.
•
The proposed method is considered an appropriate procedure in general to forecast manufacturing outputs based on small samples.

Abstract

In the current highly competitive manufacturing environment, it is important to have effective and efficient control of manufacturing systems to obtain and maintain competitive advantages. However, developing appropriate forecasting models for such systems can be challenging in their early stages, as the sample sizes are usually very small, and thus there is limited data available for analysis. The technique of virtual sample generation is one way to address this issue, but this method is usually not directly applied to time series data. This research thus develops a Latent Information function to analyze data features and extract hidden information, in order to learn from small data sets considering timing factors. The experimental results obtained using the Synthetic Control Chart Time Series and aluminum price datasets show that the proposed method can significantly improve forecasting accuracy, and thus is considered an appropriate procedure to forecast manufacturing outputs based on small samples.

Introduction

The manufacturing environment is now changing rapidly, cutting product life cycles and causing increasingly intense competition among enterprises. How to control a manufacturing system effectively and efficiently is thus very important for manufacturing firms, especially in the early stages of such systems [1]. This is because businesses can raise their competitive ability if managers can quickly discover problems in manufacturing processes and take appropriate actions [2]. For this reason, suitable forecasting techniques are needed to boost managerial efficiency.

However, few observations are usually available in the early stages of manufacturing systems, so it is difficult to find robust results using prediction methods that depend on large data sets, such as multivariate analysis, time series models, and data mining techniques [3]. The purpose of this research is thus to establish a forecasting model based on small data sets to help engineers and decision makers to make better predictions under non-deterministic conditions.

Academically, the related uncertainty problems can be divided into roughly three categories: stochastic phenomena [4], cognitive uncertainties [5], and insufficient information [6]. The problems related to the early stages of a manufacturing system are due to the insufficient information caused by a limited sample size, which cannot completely reflect the whole features of a population [7], [8]. To overcome this, virtual sample generation (VSG) techniques are adopted to provide more stable learning methods and create robust and precise models.

In the literature, the prior knowledge obtained from a given small training set is used to create virtual samples to improve the learning results [9], [10], [11]. Li et al. [12] developed a Functional Virtual Population to expand the domain of the system attributes and generate virtual samples for scheduling problems in small data set. A bootstrap procedure was then proposed to enhance statistical inference in simulation experiments by generating bootstrap samples for training [13], [14]. Other VSG algorithms have emerged in recent years, based on the principles of information diffusion [15] derived from fuzzy theory, and the related approaches have been used in many fields, such as medicine, management and manufacturing [16], [17], [18], [19], [20], [21].

However, the generation of virtual samples is usually not directly applied to time series data; because the developing trends of such data are closely related to the order of observations, and it is hard to maintain the appropriate relations among the virtual data that is produced, meaning that the approach cannot effectively improve the performance of model learning. Fig. 1 provides a simple illustration of this issue. If we create virtual samples and get the trend line, shown as the dotted line in the figure, we can see that there is a significant accumulative difference between the dotted line of virtual samples and the solid line of real data. To minimize this difference, the virtual samples have to be located close to the real trend line, and this weakens the effectiveness of this approach.

This study thus proposes a Latent Information (LI) function to analyze data characteristics and extract information to assist knowledge acquisition with small data sets. The approach can acquire extra information by analyzing the data features to extend the domain attributes. To verify the effectiveness of the proposed method, this study employs the Synthetic Control Chart Time Series (SCCTS) dataset from the Knowledge Discovery Database and the monthly average price of aluminum for cash buyers from the London Metal Exchange (LME) to implement the experimental analysis. The experimental results show that the LI function is an appropriate technique for small-sample learning, because it can improve forecasting accuracy.

The remainder of this paper is organized as follows. In Section 2, the concept of the LI function is introduced. In Section 3, a demonstration of the procedure and experimental results are given. Finally, the conclusions are presented in Section 4.

Section snippets

Methodology

The samples collected in the early stages of manufacturing systems are usually not adequate for effective model learning, and past research has found that increasing quantities of information can gradually lead to more stable forecasting results. Correspondingly, the gathering of time series data can be treated as a successive and incremental data collection procedure, where the amount of incoming data will rise and the information gradually be updated. Therefore, an LI function based on

Experimental studies

In this section, we employ one artificial process data set and one real dataset, the Synthetic Control Chart Time Series dataset and the aluminum price dataset, respectively, to demonstrate the use of the LI function. The detailed experimental process is described in the following sub-sections.

Conclusions and discussion

In order to control operating costs in an effective manner, enterprises require appropriate forecasting technology, especially in the early stages of manufacturing systems. However, during these early stages the sample sizes are restricted by considerations of cost and time, and thus only insufficient information can be used to acquire knowledge, and traditional forecasting methods often fail to produce useful results. Therefore, it is very important to develop better small-data-set learning

Acknowledgments

This research is partially supported by the National Science Council of Taiwan under grant NSC 101-22188-E-033-004-.

Che-Jung Chang received his PhD degree in management science from National Cheng Kung University, Taiwan in 2011. He is currently an assistant professor in the Department of Business Administration at Chung Yuan Christian University, Taiwan. His recent research interests include grey system theory, production management and small-data-set learning. His articles have appeared in Omega, Applied Mathematical Modelling, Computers & Industrial Engineering and Journal of Grey System.

References (27)

D.C. Li et al.
An improved grey-based approach for early manufacturing data forecasting
Comput. Ind. Eng.
(2009)
I.A. Gheyas et al.
A neural network-based framework for the reconstruction of incomplete data sets
Neurocomputing
(2010)
D. Huang et al.
Effective feature selection scheme using mutual information
Neurocomputing
(2005)
S. Efromovich
Adaptive nonparametric density estimation with missing observations
J. Stat. Plan. Inference
(2013)
Y.S. Abu-Mostafa
Learning from hints in neural networks
J. Complexity
(1990)
T.I. Tsai et al.
Utilize bootstrap in small data set learning for pilot run modeling of manufacturing systems
Expert Syst. Appl.
(2008)
C.F. Huang et al.
A diffusion-neural-network for learning from small samples
Int. J. Approx. Reason.
(2004)
D.C. Li et al.
Using virtual sample generation to build up management knowledge in the early manufacturing stages
Eur. J. Oper. Res.
(2006)
D.C. Li et al.
Utilization of virtual samples to facilitate cancer identification for DNA microarray data in the early stages of an investigation
Inform. Sci.
(2009)
D.C. Li et al.
Forecasting short-term electricity consumption using the adaptive grey-based approach-an Asian case
Omega-Int. J. Manage S
(2012)

D.J. Dalrymple

Sales forecasting practices: results from a United States survey

Int. J. Forecast.

(1987)

G.D. Li et al.

A New Reliability Prediction Model in Manufacturing Systems

IEEE Trans. Reliab.

(2010)

D.A. Berry et al.

Statistics: Theory and Methods

(1996)

Cited by (24)

A transferred hybrid surrogate model integrating Gaussian membership virtual sample generation for small sample prediction: Applications in metal tube bending
2024, Engineering Applications of Artificial Intelligence
The high-performance virtual sample generation (VSG) method has been extensively introduced to solve the problem of small sample sizes. Data distribution information is a key element of current VSG methods at the data-driven level. Herein, we propose an improved VSG method with a Gaussian distribution and explore the relationship between the Gaussian function and data expansion. To obtain more feasible virtual samples, information expanded based on the Gaussian membership function (GMIE) was established. For further improvement, a hybrid surrogate model based on transfer (THSM) is proposed, which differs from the general hybrid surrogate model (HSM) methods that only mix single models. Using a prevailing evaluation method, our proposed method, which combines a Gaussian membership function with a hybrid surrogate model, outperforms other competing approaches in 12 numerical cases owing to its feasibility and efficacy. Additionally, the proposed approach is applied to a metal tube rotary draw bending (RDB) prediction problem to illustrate its ability to support complex engineering designs.
Small-sample continual learning classification method with vaccine to update memory cells based on the artificial immune system
2022, BioSystems
Citation Excerpt :
However, obtaining sufficient training samples is expensive and sometimes difficult to complete, such as aircraft failure data at runtime. The machine learning problem with few training sample data is called small sample learning problem (Raudys, 2006; Zhu et al., 2016; Chang et al., 2014a). In practice, the sample threshold of small sample problem is usually set at 30 (Cadini et al., 2019; Fukunada et al., 1990).
In this paper, a novel continual learning classification method (SCLM) in small sample cases is proposed, which inspired by the immune system's continuous improvement of immunity through injecting vaccines. Data-driven classification method requires a large number of historical data to establish a pattern recognition model with good generalization performance. However, in practice, the data that can be used for training is usually small and unbalanced, which lead to poor classification accuracy. In addition, batch learning method cannot improve continually classification performance by learning test phase data. In view of the above problems, SCLM generates sample as vaccine by finding the group center of training samples, so that B cells mature and activate memory cells in the train phase. In the test phase, the recognition ability of SCLM is further improved by learning new samples and updating memory cells. In order to evaluate its performance under the condition of less training samples and its possible advantages, the experiments on well-known datasets in UCI repository and reciprocating compressor faults diagnose were performed. The results show that SCLM has better classification performance than other methods when the number of training samples is insufficient. At the same time, the method of generating data has significantly improved the classification performance of other methods.
A Gaussian mixture model based virtual sample generation approach for small datasets in industrial processes
2021, Information Sciences
Due to small-quantity and often imbalance of labeled samples, it is challenging to establish a robust and accurate prediction model through data-driven methods. To deal with the small dataset problem, new virtual samples may be generated via virtual sample generation (VSG) methods based on the trend of the original small raw dataset, thereby improving modeling performance. Effective VSG is desirable, but also challenging. Conventional VSG usually assumes that the raw sample set contains only a single operating mode. Taking multi-mode into account will improve the VSG based modeling performance since actual processes are often multi-mode. To this end, an information expansion function considering sample density and amount (IEDA) is first developed to expand the domain range of the attributes in this paper. Then, virtual samples under the multiple operating mode condition are generated by proposing a Gaussian mixture model based virtual sample generation (GMMVSG) method. Applications of GMMVSG on Tennessee Eastman benchmark process and an industrial hydrocracking process show significant improvement of modeling and predictions over other conventional VSG methods.
Dimensionality reduction for multi-criteria problems: An application to the decommissioning of oil and gas installations
2020, Expert Systems with Applications
This paper is motivated by decommissioning studies in the field of oil and gas, which comprise a very large number of installations and are of interest to a large number of stakeholders. Generally, the problem gives rise to complicated multi-criteria decision aid tools that rely upon the costly evaluation of multiple criteria for every piece of equipment. We propose the use of machine learning techniques to reduce the number of criteria by feature selection, thereby reducing the number of required evaluations and producing a simplified decision aid tool with no sacrifice in performance. In addition, we also propose the use of machine learning to explore the patterns of the multi-criteria decision aid tool in a training set. Hence, we predict the outcome of the analysis for the remaining pieces of equipment, effectively replacing the multi-criteria analysis by the computational intelligence acquired from running it in the training set. Computational experiments illustrate the effectiveness of the proposed approach.
A novel and effective nonlinear interpolation virtual sample generation method for enhancing energy prediction and analysis on small data problem: A case study of Ethylene industry
2018, Energy
Citation Excerpt :
In other words, if one wants to build an accurate and reliable data-driven model, sufficient data and a good distribution assumption are two necessary conditions [1]. In the references of [2] and [3], small data problems refer to the case where the number of samples is less than 50 concerning engineering applications or less than 30 regarding academic researches. The whole features of a population are hard to completely be revealed by the small data because of the insufficient information [4].
An accurate energy prediction and optimization model plays a very important role in the petrochemical industries. Due to the imbalanced and uncompleted characteristics of complex petrochemical small data, it is a big challenge to build accurate prediction and optimization models for energy analysis. In order to solve this problem, a nonlinear interpolation virtual sample generation method integrated with extreme learning machine is proposed. Well virtual input and output variables can be generated through interpolation of the hidden layer outputs of extreme learning machine. The generated virtual samples are put together with the original samples to train models for enhancing accuracy performance. To validate the effectiveness of the proposed nonlinear interpolation virtual sample generation method, a standard function is firstly selected, and then the proposed nonlinear interpolation virtual sample generation method is applied to developing a model of energy analysis for ethylene production systems. Simulation results showed that the prediction accuracy could be significantly improved, which provided helpful guidance for production departments and government to achieve the goal of energy management of petrochemical industries.
A PSO based virtual sample generation method for small sample sets: Applications to regression datasets
2017, Engineering Applications of Artificial Intelligence
Citation Excerpt :
Such algorithms are data-dependent, i.e., sufficient data and a good distribution assumption are the two necessary conditions to ensure a more accurate model in the applications of classification and regression. Small sample sets problems (Zhu et al., 2016; Chang et al., 2014a, 2015; Li et al., 2012, 2014; Li and Lin, 2013) refer to the case of small amount of samples, where the number of samples is less than 50 in respect to engineering applications or less than 30 in regard to academic researches. Such small number of sample sets cannot completely reveal the whole features of a population due to the insufficient information (Li and Fang, 2009).
In the early period of process industries, it is an intractable challenge to build an accurate and robust forecasting model using the collected scared samples. The information derived from small sample sets is unreliable and weak. Thus, the models established based on the small sample sets are inefficient. Virtual sample generation (VSG) is a promising technology which can be used to generate plenty of new virtual samples by the information acquired from small sample sets, aiming at improving the accuracy of forecasting models. To capture the tendency of the raw sample set and reduce information gaps among individuals, an information-expanded function based on triangular membership (TMIE) is developed to asymmetrically expand the domain range in each attribute in this paper. A novel particle swarm optimization based VSG (PSOVSG) approach is proposed to iteratively generate the most feasible virtual samples over the search-space. The effectiveness of PSOVSG is tested against other three methods of VSG over two real cases: multi-layer ceramic capacitors (MLCC) and purified Terephthalic acid (PTA). The simulation results show the proposed PSOVSG achieves better performance than other methods.

View all citing articles on Scopus

Der-Chiang Li is a distinguished professor at the Department of Industrial and Information Management, National Cheng Kung University, Taiwan. He received his PhD degree at the Department of Industrial Engineering at Lamar University, Beaumont, Texas, USA, in 1985. As a research professor, his current interests focus on learning with small data sets. His articles have appeared in Decision Support Systems, Information Sciences, European Journal of Operational Research, Computer & Operations Research, International Journal of Production Research, and other publications.

Wen-Li Dai is an associate professor at the Department of Information Management at Tainan University of Technology. He received his PhD degree from National Cheng Kung University. His primary research interests focus on the information system management, supply chain management and operations research. His research has been published in International Journal of Production Research, Expert Systems with Applications, and Web Journal of Chinese Management Review.

Chien-Chih Chen Chien-Chih Chen received his PhD degree from the Department of Industrial and Information Management at National Cheng Kung University, Taiwan in 2011. His current research interests are in the area of forecasting and data mining. His research has been published in International Journal of Production Research and Expert Systems with Applications.

View full text

A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting

Highlights

Abstract

Introduction

Section snippets

Methodology

Experimental studies

Conclusions and discussion

Acknowledgments

Comput. Ind. Eng.

Neurocomputing

Neurocomputing

J. Stat. Plan. Inference

J. Complexity

Expert Syst. Appl.

Int. J. Approx. Reason.

Eur. J. Oper. Res.

Inform. Sci.

Omega-Int. J. Manage S

Int. J. Forecast.

A New Reliability Prediction Model in Manufacturing Systems

IEEE Trans. Reliab.

Statistics: Theory and Methods