Journal Search Engine
Search Advanced Search Adode Reader(link)
Download PDF Export Citaion korean bibliography PMC previewer
ISSN : 2005-0461(Print)
ISSN : 2287-7975(Online)
Journal of Society of Korea Industrial and Systems Engineering Vol.36 No.4 pp.59-63
DOI : https://doi.org/10.11627/jkise.2013.36.4.59

RBF 뉴럴네트워크를 사용한 바이오매스 에너지문제의 계량적 분석

백승현, 황승준
한양대학교 에리카 경영학부

Quantitative Analysis for Biomass Energy Problem Using a Radial Basis Function Neural Network

Seung-June Hwang, Seung Hyun Baek
Division of Business Administration, Hanyang University ERICA
Corresponding Author sjh@hanyang.ac.kr
Received 4 November 2013; Accepted 5 December 2013

Abstract

In biomass gasification, efficiency of energy quantification is a difficult part without finishing the process. In this article,a radial basis function neural network (RBFN) is proposed to predict biomass efficiency before gasification. RBFN will be comparedwith a principal component regression (PCR) and a multilayer perceptron neural network (MLPN). Due to the high dimensionalityof data, principal component transform is first used in PCR and afterwards, ordinary regression is applied to selected principalcomponents for modeling. Multilayer perceptron neural network (MLPN) is also used without any preprocessing. For this research,3 wood samples and 3 other feedstock are used and they are near infrared (NIR) spectrum data with high-dimensionality. Ashand char are used as response variables. The comparison results of two responses will be shown.

0021-01-0036-0004-8.pdf368.7KB

1. Introduction

The use of biomass as an energy feedstock is emerging as a potentially viable alternative to address U.S. energy security concerns, foreign oil dependence, rural economic development, and diminishing sources of conventional energy. Biomass (organic matter that can be converted into energy) may include food crops, crops for energy (e.g., switchgrass or prairie perennials), crop residues, wood waste and byproducts, and animal manure [4]. There are three reasons why the biomass is increasingly demanding. They are conventional fossil energy makes pollutions, excessive agricultural products and rural development. For the biomass gasification process, quantitative analysis is done in this article to measure efficiency of biomass product. To measure biomass efficiency, near infrared (NIR) spectrum is used. 

Since NIR is a rapid, cost-effective, non-destructive, and efficient analytical method. The interest in NIR spectrospecopy lies in its advantages over alternative instrumental techniques. Thus, it can record spectra for solid and liquid samples with no pretreatment, implement continuous methodologies, provide spectra quickly and predict physical and chemical parameters from a single spectrum. These attributes make it especially attractive for straightforward, speedy characterization of samples [3]. The spectral data have remarkable characteristics, which make necessary their treatment by specific methods. The data maybe comprised of more variables than observations [2].  

For the methodology viewpoint, spectral data needs data transformation methods to handle high dimensionality. There are various data transformation techniques such as singular values decomposition (SVD), principal component analysis (PCA), independent component analysis (ICA), wavelet transform (WT), and Fourier transform (FT).  

After transformation, the data mining approaches are applied to the transformed data for modeling. In some cases, the spectral data and the target property are not linearly related as a result of instrumental factors or the physico-chemical nature of the sample. These cases can be addressed by using non-linear calibration methods, particularly artificial neural network [3]. Neural networks are not the only tools to handle non-linear multivariate data. However, their flexibility is often a decisive asset compared with parametric techniques that require the assumption of a specific hard model form. Hard models cannot be developed with NIR data due to the significant overlap of combination and overtone bands in the spectra [5].  

In section 2, the methodologies are presented. Experiments and results are described in section 3. This article is concluded in section 4.  

2. Methodology

2.1 Principal Component Regression

The principal component regression (PCR) methodology [8] consists of two steps. The first step is principal component analysis (PCA) [6]. The second step is a regression model constructed by minimizing sum of square error between predicted and original response. 

The principal component analysis (PCA) will be used to reduce the dimension without losing any important information. The PCA will use the maximum variance technique to choose better principal component, and normalize each principal component and orthogonality of each principal component. Using the PCA, the collinearity will be removed.  

The PCA term is T = XP where T is a score matrix (m × k), X is an original matrix (m × n), and P is a loading matrix (n × k). 

For example, m = 3, n = 3 and k = 2 then,  

 

The procedure of PCR is shown in <Figure 1>. The strength of PCR is not only the transformation of lower dimensional subspace but also the prevention for over-fitting the data since possible regression function is restricted. 

<Figure 1> Principal Component Regression Model

2.2 Multilayer Perceptron Neural Network (MLPN)

There are three layers of neurons in MLPN process [1], which are input, hidden and output layers. The inputs pass through the input layer to hidden nodes. The hidden layer multiplies the inputs by weights, sums them, and operates on the sum with a function such as a sigmoid or tangent sigmoid one. The result is passed to each output nodes when there is a hidden layer. The output layer multiplies the inputs by weights, sums them and operates on the sum with a function such as a linear one. The result is the output. The hidden layer and output layer can have either linear or non-linear function. The activation function is needed to be continuous, monotonically increasing, and differentiable. The activation function signals propagate forward and error signals propagate backward through the network. <Figure 2> shows architecture of typical three-layer feed-forward multilayer perceptron network.  

<Figure 2> Multilayer Feed-Forward Neural Networks

2.3 Radial Basis Function Neural Network (RBFN)

The radial basis function neural network [7] has two layers, which are hidden and output layer. The activation function of hidden layer is the radial basis function called the Gaussian activation function, which is a non-linear and local mapping layer. The activation function of output layer is the standard linear function and performs a linear transformation of the output of hidden nodes.  

The Gaussian Activation Function is as follows : 

 

where X is the input vector, μ is the center of a region called a receptive field, σi is the width of the receptive field and fi(X) is the output of the ith neuron. 

3. Experiments

3.1 Data Description

Three wood samples; red oak, yellow popular, hickory and three other biomass feedstock; switch grass, corn stover and bagasse are selected and used for this experiment. Wood samples are collected from different trees. Switch grass are collected in different location as same as corn stover. There are 54 samples and 375 variables (wavelengths). In <Figure 3>, the NIR spectrum is shown. The samples are divided into training (27 odd number samples) and test sets (27 even number samples).  

<Figure 3> NIR Spectra with Ash and Char Contents

Ash and char contents are used as response variables. Ash content measurement was conducted based on the method (similar to ASTM standard method, E1755-01) recommended by National Resources and Energy Laboratory (NREL). Simply described, appropriately marked crucibles were placed in the muffle furnace at 575 ± 25℃ for four hours. Then they were taken out them directly into a desiccator, cooled down to room. 

Temperature weighed to the nearest 0.1mg. Again, the crucibles were placed back in the muffle furnace at 575 ± 25℃ and dried to constant weight. Constant weight is defined as a less than ± 0.3mg change in the weight upon one hour of re-heating the crucibles.  

Oven-dried biomass samples at 105℃ weighed about 1.5g and placed into the tared crucible. Then an ashing burner burned the samples until no more smoke or flame appeared. The burned samples were placed in the muffle furnace for 24 ± 6hours. Then the crucibles were removed from furnace directly into a desiccator and weighed to the nearest 0.1mg. Again, the samples were placed back to the furnace, ashed to constant weight, and cooled down to room temperature in desiccator.  

Char content were measured by using thermal gravity analysis (TGA) (Perkin-Elmer Pyris 1 TGA). Samples of 6-7 mg were first heated from 50℃ to 105℃ at a rate of 25℃ min-1 and kept at 105℃ for 10 min to remove the moisture. Then samples were heated again to 750℃ at the same heating rate under nitrogen atmosphere (flow rate of 20 mL min-1). Char content was calculated from the final weight loss at 750℃ by using the initial weight of sample without moisture content. <Figure 4> displayed an example of biomass gasification. As shown in <Figure 4>, the byproducts of a gasification process are ash and char. When the efficiency of the by-products is known before gasification, there are many benefits such as cost. 

<Figure 4> Example of Biomass Gasification [9]

3.2 Experiment Results

For the experiments, a training set used for a modeling and a test set used as the validating without cross-validation. Mean squared errors are obtained with a test set.  

With over 1% variance, principal components were collected and transformed for principal component regression. In this experiment, 9 principal components was collected and used for prediction.  

For the multilayer perceptron neural network, it needs exhaustive computational time. Input variables were transformed with principal component transform. After that, the transformed input data underwent feed-forward back-propagation with the multilayer perceptron neural network. As same as principal component regression, 9 principal components were used for the multilayer perceptron neural network. For the hidden and output layer function, tangent sigmoid and linear functions were used. 2 responses were used as output and 4 hidden neurons were used in the multilayer perceptron neural network.

In radial basis function neural network, the network modeling was fast even though, the variables were larger than the samples. As seen in <Table 1>, 15 spread constants were selected and gave least mean squared error of test set for the Gaussian activation function. For the hidden layer, 22 neurons were used. 375 variables were used as input and 2 responses were used as output. 

<Table 1> Spread Constants Selection in Radial Basis Function Network

For the performance measure, mean squared error (MSE) was used. As shown in <Table 2>, the smallest mean squared error of ash and char contents for prediction of training data were found by RBFN. Also, for prediction of the test set, the smallest mean squared error for both ash and char contents were collected from RBFN. From the <Figure 5>, the graphical representations were provided for test set.  

<Table 2> Mean Squared Error

<Figure 5> Predicted vs. Observed Plots : (a) Ash with PCR (b) Char with PCR (c) Ash with MLPN (d) Char with MLPN (e) Ash with RBFN (f) Char with RBFN

4. Conclusion

RBFN was used to forecast energy efficiency of biomass and compared with the PCR and the MPNN. The conventional regression could not be used in the high-dimensional case due to the bigger size of variables compared to observations. The PCR was applied with a principal component transform in the first stage and ordinary regression was applied in the second stage. For the MPNN, the network was constructed after transform because of exhaustive computational problem. 9 principal components were used for the PCR and the MPNN. As shown in <Table 2>, the MPNN showed the better results than the PCR. Even if the MPNN used same number of principal components as the PCR, the performance showed better results.  

For biomass efficiency prediction, the best method is the RBFN which gave the smallest mean squared error for the prediction of test set. From the above results, it is easier to conclude that the RBFN can be suitable approach for energy- efficiency forecasting in biomass application areas with NIR spectrums. 

Reference

1.Araujo, P., Astray, G., Ferrerio-Lage, J.A., Mejuto, J.C., Rodriguez-Suarez, J.A. and Soto, B., Multilayer perceptron neural network for flow prediction. Journal of Environmental Monitoring, 2011, Vol. 13, No. 1, p 35-41.
2.Benoudjit, N., Cools, E., Meurens, M., and Verleysen, M., Chemometric calibration of infrared spectrometers : selection and validation of variables by non-linear models. Chemometrics and intelligent laboratory systems, 2004, Vol. 70, p 47-53.
3.Blanco, M. and Villarroya, I., NIR spectroscopy : a rapidresponse analytical too. Trends in analytical chemistry, 2002, Vol. 21, No. 4, p 240-250.
4.Bracmort, B.B., Comparison of definitions in legislation through the 112th congress. CRS Report for Congress R40529, Congressional Research Service, 2012.
5.Despagne, F. and Massart, D.L., Neural networks in multivariate calibration, Analyst 1998, Vol. 123, p 157-178
6.Dunia, R., Qin, S.J., Edgar, T.F., and Mcavoy, T.J., Sensor fault identification and reconstruction using principal component analysis, Proceedings of the IFAC World Congress 96, San Francisco, June 30-July 5, 1996, p 259-264.
7.Ghosh, J. and Arindam, N., An overview of radial basis function networks, in Radial Basis Function Networks 2. Physica-Verlag GmbH, 2001.
8.Jolliffe, I.T., Principal component analysis, M. Lovric, eds. International Encyclopedia of Statistical Science, Springer, 2011, p 1094-1096.
9.Smeenk, J., Biomass processes and technologies : biomass as a renewable energy resource for natural gas displacement, Wood-to-Energy : Iowa Wood By-Products Workshop, June 7, Ainsworth, 2006.