File size: 7,161 Bytes
d459c0d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 |
A collection or series of simple Python Scripts to train and test MLP Neural Nets, with various synthetic data (equations, such as y = np.sin(np.pi * X1) + np.square(X2)).
Model Hyperparamters are changeable (except the number of inputs)
Some versions use TORCH.NN.
Some Versions use ONLY-NUMPY for easier modification.
Some scripts save/load checkpoint files.
The loss is printed periodically to the CMD console.
All version generally use graphics at end of train and test to display the tranined model's performance.
Python Scripts for Training and Inference/Test of a MLP (using TORCH.NN library features)
y = np.sin(np.pi * X1) + np.square(X2) # Example target function. Hardcoded but equation y= can be user-modified in the script.
Or, you can comment out the second part and use sin() only: y = np.sin(np.pi * X1) #+ np.square(X2) # Example target function
The provided code indeed implements a standard feedforward neural network layer, commonly known as a perceptron layer, rather than a Knot Algebra Network (MLP) layer. Let's clarify the difference and discuss how a MLP layer might be implemented:
Perceptron Layer:
The code defines a single or multi layer in a feedforward neural network. Each hidden layer consists of a linear transformation followed by a non-linear activation function (ReLU in this case).
This type of layer is often used in traditional neural network architectures for tasks such as classification and regression.
Sometimes the loss falls to
Epoch [1000/1000], Loss: 0.0006
or
Epoch [1000/1000], Loss: 0.0009
or
Epoch [1000/1000], Loss: 0.0021
But other times
Epoch [1000/1000], Loss: 0.0614 (looks like a horizontal spread).
Having fixed the input train data for evaluation, I infer that the randomness of the model results seems to be caused by the random weights and bias at initialization?
Explanation
MLP Layer Definition:
A custom neural network layer MLPLayer is created, which includes a linear transformation followed by an activation function (ReLU in this example).
The HYPERPARAMETERS are user-specified in the MAIN function such as:
input_dim = 2 (must be 2 in this version? because of how the train data and text data is stored, but you can still comment out the second variable part of the y function and use sin(X1) only: y = np.sin(np.pi * X1) #+ np.square(X2) # Example target function)
output_dim = 1
hidden_layers = [5, 5] # Example hidden layers
X_train, y_train = generate_train_data(200)
X_test, y_test = generate_test_data(100)
MLP Model Definition:
The MLP class defines the network architecture with a sequence of MLP_Layer objects. The final layer is a linear layer to produce the output.
Data Preparation:
The generate_data function creates example input data and corresponding target values based on a specified function.
Training Function:
The train_MLP function trains the MLP model using mean squared error loss and the Adam optimizer. Training runs for a specified number of epochs with periodic loss reporting.
Inference Function:
The inference_MLP function generates predictions using the trained MLP model on new test data.
Main Script:
This part initializes the model, criterion, optimizer, and data. It then trains the model and performs inference, finally plotting the true vs. predicted values for visualization.
This script encapsulates the basic workflow for using MLPs in a machine learning context. It can be adapted to different datasets and network configurations depending on the specific requirements of the task.
A custom neural network layer MLP_Layer is created, which includes a linear transformation followed by an activation function (ReLU in this example).
ChatGPT
The lines that define the custom neural network layer MLPLayer, which includes a linear transformation followed by an activation function (ReLU in this example), are:
class MLPLayer(nn.Module):
Line defining the class: class MLPLayer(nn.Module):
Initialization method: def __init__(self, in_features, out_features):
Calling the superclass constructor: super(MLPLayer, self).__init__()
Defining the linear layer: self.linear = nn.Linear(in_features, out_features)
Defining the activation function: self.activation = nn.ReLU()
Forward method: def forward(self, x):
Applying the linear transformation and activation in the forward method: return self.activation(self.linear(x))
# Data preparation (example data) function encapsulates the process of generating synthetic example data for training and testing the MLP model, providing input-output pairs that can be used to train and evaluate the network's performance.
def generate_data(n_samples):
print("Data preparation (example data)")
X = np.random.rand(n_samples, 2) # Example: 2 input features #n_samples samples are generated, each with 2 input features. These features are randomly distributed between 0 and 1.
y = np.sin(np.pi * X[:, 0]) + np.square(X[:, 1]) # Example target function # target values are calculated using a specific function. In this case, it's a combination of the sine of the first input feature multiplied by π and the square of the second input feature. This function is an example and can be replaced with any desired function depending on the problem domain.
return torch.tensor(X, dtype=torch.float32), torch.tensor(y, dtype=torch.float32) #This line converts the generated NumPy arrays X and y into PyTorch tensors. PyTorch tensors are the preferred data type for working with neural networks in PyTorch. Both X and y are converted to torch.float32 data type for consistency.
The train data and test data will be saved in the text files X1.txt and X2.txt
Separating the input data, saving it to file for later re-use (comment out the un-used code lines):
def generate_data(n_samples):
# Generate random data
X1 = np.random.rand(n_samples, 1) # Example: 1st input feature
X2 = np.random.rand(n_samples, 1) # Example: 2nd input feature
# Save data to CSV files with commas as delimiters
np.savetxt('X1.txt', X1, delimiter=',')
np.savetxt('X2.txt', X2, delimiter=',')
# Read data back into variables X1 and X2
X1 = np.loadtxt('X1.txt', delimiter=',').reshape(-1, 1)
X2 = np.loadtxt('X2.txt', delimiter=',').reshape(-1, 1)
# Print loaded data for verification
print("Loaded X1:", X1)
print("Loaded X2:", X2)
# Generate y based on X1 and X2
y = np.sin(np.pi * X1) + np.square(X2) # Example target function
return torch.tensor(np.concatenate((X1, X2), axis=1), dtype=torch.float32), torch.tensor(y, dtype=torch.float32)
The train data and test data will be saved in the text files X1.txt and X2.txt in follwing format:
# Read good random test data into variables X1 and X2
loaded 100 test data X1: [[0.61322319]
[0.11602832]
[0.48507701]
[0.92841799]
...
# Read good random test data into variables X1 and X2
loaded 100 test data X1: [[0.61322319]
[0.11602832]
[0.48507701]
[0.92841799]
...
# Read good random test data into variables X1 and X2
loaded 100 test data X1: [[0.61322319]
[0.11602832]
[0.48507701]
[0.92841799]
...
---
license: apache-2.0
---
|