Homepage | Course content |
Feedforward neural networks
Recapituation of supervised learning systems studied thus far
-
We have studied how linear regression can help us predict continuous values via a linear relationship with a set of features.
-
We have also studied how we can use logistic regression and softmax to classify datapoints into different discrete categories.
- The equations for these systems are:
- Linear regression
- Logistic regression
- Softmax
- If you notice, we have
- used and to transform the input data .
- the result of this transformation gets further processed by a function to get the output .
- However, we can apply more transformations to the input data before reaching the final output.
Neural network with one hidden layer
-
Instead of computing directly, we can use input data to compute a “hidden state” .
-
The function that gives the hidden state is non-linear and arbitrary (although some non-linearities are more commonly used).
- If we used this hidden state with the systems we have studied so far in this class, their equations would be:
- Linear regression neural network
- Logistic regression neural network
- Softmax neural network
- These equations represent three different neural networks, each of which could be used for different tasks and applications.
The choice of non-linearity
-
Today, the most commonly used non-linearity is ReLU (Rectified Linear Unit) or some of its variants.
-
Other useful non-linearities include sigmoid and the hyperbolic tangent (tanh).
-
Advantages of ReLU over other non-linearities include:
- the fact that it does not “saturate”
- its computation does not involve an exponentiation and instead is a “thresholding” operator
- it results in the network learning a sparse representation
Feedforward neural networks
-
In the previous sections we talked about a neural network with one hidden layer.
-
However, we can have more hidden layers. In fact, you can have as many as necessary (how do you know how many are necessary?).
-
The resulting compuational architecture is known as a feedforward neural network.
Feedforward neural network
© Iran R. Roman & Camille Noufi 2022