Models
Four model types are defined for Data analytics applications. They are:
- PCA Models – Principal Component Analysis (PCA) models are the statistical engine of the EED application.
- Fuzzy Models – Fuzzy Logic models are used as post processors to normalize and shape and tune the results of the PCA models.
- DA Models – This model type is really only a shell and is used to wrap the PCA and Fuzzy models.
- Neural Models – Neural network models are generic models that use a Stochastic Gradient Decent (SGD) solver. The models used here incorporate an activation function similar to RELU function but has a continuous derivative and no zero outputs and is sometimes referred to as a rectifier. Capability also exists for an SGD solution using a Long-Term-Short-Term (LSTM) formulation
In the preceding screen capture, the evolution of the scores as a function of time is plotted against the first two principal components. The confidence ellipse represents an area of acceptable behavior. Green points represent data from the training set while blue points represent fresh data never seen by the model. In addition scores may also be displayed in higher dimension spaces (hypercube) as shown below.
Predicting Events - In the event or indicator plot shown below, X-axis represents data indexes and Y-axis represents indicator value. The brown trend plot is the trend for the indicator values calculated from the PCA model for the selected data ranges for the fuzzy model. The user entered events are shown in green boxes and predicted events are shown in red boxes. The fuzzy lower limit, fuzzy upper limit and final fuzzy limit are shown in purple lines. The blue line is for the QLimit if the indicator type is QResidual or T2Limit if the indicator type is T2Residual. The light blue trend is for fuzzy output values by applying fuzzy member function to the indicator values. The fuzzy output values are between 0 and 1 and the fuzzy output trend has been normalized according to the original indicator values
Neural Network Models - These models can be used either alone or in conjunction with other models. An important feature of these models is the activation function. The activation function and its derivative are shown in the following figure.Predicting Events - In the event or indicator plot shown below, X-axis represents data indexes and Y-axis represents indicator value. The brown trend plot is the trend for the indicator values calculated from the PCA model for the selected data ranges for the fuzzy model. The user entered events are shown in green boxes and predicted events are shown in red boxes. The fuzzy lower limit, fuzzy upper limit and final fuzzy limit are shown in purple lines. The blue line is for the QLimit if the indicator type is QResidual or T2Limit if the indicator type is T2Residual. The light blue trend is for fuzzy output values by applying fuzzy member function to the indicator values. The fuzzy output values are between 0 and 1 and the fuzzy output trend has been normalized according to the original indicator values
The functions are completely defined in terms of the activation length (in this case 15) and the activation ratio (in this case .2). The neural net is defined in terms of six parameters; network dimensions, learning rate, number of epochs, number of mini-batches, activation ratio and the activation length. Application to the well known benchmark MNIST data is shown below.
MNIST Benchmark data
The setup of the neural network model for the MNIST data set is shown below. The data is composed of 60,000 elements. Of these elements 50,000 are used for training and 10,000 for test validation. Each element is defined in terms of a matrix of 28x28 pixels representing numbers 0 through 9.
As shown, the network is composed of an input layer of 784 neurons (to accommodate the 28x28 pixels of each number), an output layer of 10 neurons to represent the numbers o through 9, and two arbitrary hidden layers both of which use 150 neurons. The activation parameters correspond to the rectifier shown above. The number of mini-batches is 10 and the learning rate 3.0. With this very simple setup it is easy to obtain accuracies of greater than 98% for this data set. The solution at each epoch is shown in the following illustration. The first value represents the number correct predictions while the second is the total number of the validation data. The highest efficiency is stored as the final value.
A few selected predictions are given in the following figures. It can be seen that the hand written numbers can be rather difficult to identify correctly. The hand written numerals can vary dramatically.
The first figure is the numeral 3.
The next two figures are the numeral 4.
The next four figures are the all the numeral 5.
And finally, the next two figures are the numeral 8 followed by a figure of the numeral 9.
This set of data is available in the open literature and shows the ability to classification tasks at least on a toy set of data. In the following figure we can see that there are two classes of data represented by the two different colored circles
The next figure shows a contour map of the neural net classification. Here we can see that the classification missed only point
Another classification problem with three different different classes is shown below.
Finally, the contour map for this data shows a high quality classification






























