Basic Neural Network : Algorithm and Example: GRNN : Generalized Regression Neural Networks

GRNN stands for Generalized Regression neural network.

1. What does GRNN do ??

This is basically a neural network based function approximation or function estimation algorithm. It predicts the output of a given input data.

2. How does it work ??

As per the basic principle of neural network it needs a training data to train itself. Training data should contain input-output mapping. Now if we train the network with the training data set and we feed a new testing data set, it will accordingly gives the output or predict the result.

In case of GRNN, output is estimated using weighted average of the outputs of training dataset, where the weight is calculated using the euclidean distance between the training data and test data. If the weight or distance is large then the weight will be very less and if the distance is small it will put more weight to the output.

3. Network Architecture

Network architecture contains four basic layers. Input layer,Pattern layer,Summation layer, Output layer.

Input layer:

Input layer feeds the input to the next layer.

Pattern layer:

Pattern layer calculates the Euclidean distance and activation function.

Summation layer:

Summation layer has two subparts one is Numerator part and another one is Denominator part. Numerator part contains summation of the multiplication of training output data and activation function. Denominator is the summation of all activation function. This layer feeds both the Numerator & Denominator to the next output layer.

Output layer:

Output layer contains one neuron which calculate the output by dividing the numerator part of the Summation layer by the denominator part.

4. Main Principle

GRNN stands on the below equation:

Y(x)=∑Y_ie^{-(d_i²∕2σ²)} ∕ ∑e^{-(d_i²∕2σ²)}

Where, d_i²=(x-x_i)^T(x-x_i)

Here x is the input sample and x_i is the training sample. Output of the input sample i

is Y_i. d_i² is the Euclidean distance from the x and x_i. e^{-(d_i²∕2σ²)} is the activation function. Basically this activation function theoretically is the weight for that input .

Now if you look closely, The value of d_i² signifies how much the training sample can contribute to the output of that test particular test sample.

If d_i² has small value that means it will contribute more value to the output but if it is a big value that means it will contribute very less to the output.

The term e^{-(d_i²∕2σ²)} is deciding that how much weight the training sample will contribute.

If d_i² is small value, the term e^{-(d_i²∕2σ²)} returns a relatively large value.

If d_i² is large value, the term e^{-(d_i²∕2σ²)} returns a relatively small value.

If d_i² is zero the term e^{-(d_i²∕2σ²)} returns one that means test data = training sample and the output of test data will be the output of the training sample.

Here we have only one unknown parameter, spread constant σ. That can be tuned by training process to an optimum value where the error will be very small.

5. Training Procedure

Training procedure is to find out the optimum value of σ. Best practice is that find the position where the MSE (Mean Squared Error) is minimum.

First divide the whole training sample into two parts. Training sample and test sample. Apply GRNN on the test data based on training data and find out the MSE for different σ. Now find the minimum MSE and corresponding value of σ.

6. Advantages of GRNN

1) The main advantage of GRNN is to speed up the training process which helps the network to be trained faster.

2) The network is able to learning from the training data by “1-pass” training in a fraction of the time it takes to train standard feed forward networks.

3) The spread, Sigma (σ), is the only free parameter in the network, which often can be identified by the V-fold or Split-Sample cross validation.

4) Unlike standard feed forward networks, GRNN estimation is always able to converge to a global solution and won’t be trapped by a local minimum.

7. Example

input output

2 3

4 5

6 7

8 9

What will be the output of 5??

Step 1

Calculate distances d1 = (5-2)^2 = 9 , d2= (5-4)^2 = 1, d3 =(5-6)^2=1, d4 = (5-8)^2 = 9.

Step 2

Calculate weights using the activation function: e^{-(d_i²∕2σ²)}

Lets say σ = 1.

so weights are,

w1 = 0.01

w2 = 0.6

w3= 0.6

w4= 0.01

Step 3

Summation of w's W = w1+w2+w3+w4 = 1.22

So denominator is 1.22.

Now numerator is YW = w1*y1 + w2*y2+w3*y3+ w4*y4

=0 .01*3+0.6*5+0.6*7+0.01*9

= 7.32

Step 4

So the output is: (Neumerator/Denominator )

output = YW/W = 7.32/1.22 = 6.

So predicted output is 6.

10 comments:

UnknownApril 14, 2016 at 5:13 AM
Hi ,
I am trying to design a GRNN for two classes but the output classes have values between 1.1 and 1.7. Can anyone please tell whats going on.

regards
karthikAugust 23, 2017 at 5:42 AM
what is the basic difference between function approximation and classification problems.
raniaDecember 20, 2017 at 10:47 AM
hello !
please do you have the methematical model of generalized regression neural network
AnonymousSeptember 5, 2018 at 4:03 PM
It was so useful and well explained. Thank you so much.
UnknownFebruary 20, 2019 at 12:55 AM
Images from this page are not loading.
Kapil SethiSeptember 20, 2019 at 10:21 PM
please provide me mathematical model of generalized regression neural network, so can i learn this GRNN and implement. you can send me on this email id: kapil.7sethi@gmail.com
UnknownApril 5, 2020 at 9:31 PM
Please can you help me on how to code GRNN from scratch to understand?
UnknownDecember 6, 2021 at 4:19 AM
Perfect explanation

Wednesday, July 10, 2013

GRNN : Generalized Regression Neural Networks

10 comments: