tutorial/ai-ml/machine-learning/deep-learning/neural-network-basics/perceptron.mdx at b098e2c66e44a01782a3b783cf790e36f4a5f30e · codeharborhub/tutorial

title

The Perceptron: The First Neural Network

sidebar_label

Perceptron

description

Understanding the building block of Deep Learning: Weights, Bias, and Step Functions.

1. Biological Inspiration

Just as a biological neuron receives signals through dendrites and "fires" an impulse through the axon once a threshold is reached, the artificial perceptron sums up its inputs and triggers an output based on a threshold.

2. Anatomy of a Perceptron

A perceptron consists of four main parts:

Input Values ($x_1, x_2, ... x_n$): The features of your data.
Weights ($w_1, w_2, ... w_n$): Values that represent the "strength" or importance of each input.
Bias ($b$): An additional parameter that allows the model to shift the activation function left or right.
Activation Function: A rule (usually a Step Function) that decides the final output.

graph LR
    %% Inputs
    X1["$$x_1$$"] -->|"$$w_1$$"| SUM
    X2["$$x_2$$"] -->|"$$w_2$$"| SUM
    XN["$$x_n$$"] -->|"$$w_n$$"| SUM

    %% Bias
    BIAS["$$b$$"] --> SUM

    %% Summation
    SUM["$$z = \sum_{i=1}^{n} w_i x_i + b$$"]

    %% Activation
    SUM --> ACT["$$\text{Activation Function}$$"]
    ACT --> OUT["$$\hat{y} \in \{0,1\}$$"]

    %% Labels
    ACT -.-> STEP["$$\text{Step Function}$$"]

3. The Mathematics of "Firing"

The perceptron calculates a weighted sum of the inputs and adds a bias. This result is passed through a Step Function.

The Weighted Sum ($z$):

$$ z = \sum_{i=1}^{n} w_i x_i + b $$

The Activation (Heaviside Step Function):

The output is determined by whether is positive or negative:

$$ y = \begin{cases} 1 & \text{if } z \geq 0 \ 0 & \text{if } z < 0 \end{cases} $$

4. How the Perceptron Learns

The learning process is an iterative adjustment of weights and bias. If the model makes a mistake, the weights are updated using the Perceptron Learning Rule:

$$ w_{new} = w_{old} + \eta(y_{actual} - y_{predicted})x_i $$

Where:

$\eta$ (Learning Rate): A small value (e.g., 0.01) that controls how drastically we update the weights.
$y_{actual}$: The true label (0 or 1).
$y_{predicted}$: The output from the perceptron.

5. The Critical Limitation: Linearity

The Perceptron can only solve problems that are Linearly Separable. This means it can only learn to classify data that can be separated by a straight line.

The XOR Problem

In 1969, Minsky and Papert proved that a single-layer perceptron could not solve the XOR logic gate because the points cannot be separated by a single straight line. This discovery led to the first "AI Winter," which only ended when researchers began stacking perceptrons to create Multi-Layer Perceptrons (MLP).

6. Implementation with NumPy

import numpy as np

def step_function(z):
    return 1 if z >= 0 else 0

def perceptron(inputs, weights, bias):
    # Calculate z = (w1*x1 + w2*x2 + ... + wn*xn) + b
    z = np.dot(inputs, weights) + bias
    return step_function(z)

# Example: AND Gate logic
# Inputs: [0, 0], [0, 1], [1, 0], [1, 1]
weights = np.array([1, 1])
bias = -1.5

print(perceptron([1, 1], weights, bias)) # Output: 1 (True)
print(perceptron([0, 1], weights, bias)) # Output: 0 (False)

One neuron can't solve complex problems. But what happens when we connect thousands of them in layers?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

1. Biological Inspiration

2. Anatomy of a Perceptron

3. The Mathematics of "Firing"

The Weighted Sum ($z$):

The Activation (Heaviside Step Function):

4. How the Perceptron Learns

5. The Critical Limitation: Linearity

The XOR Problem

6. Implementation with NumPy

Uh oh!

FilesExpand file tree

perceptron.mdx

Latest commit

History

perceptron.mdx

File metadata and controls

1. Biological Inspiration

2. Anatomy of a Perceptron

3. The Mathematics of "Firing"

The Weighted Sum ($z$):

The Activation (Heaviside Step Function):

4. How the Perceptron Learns

5. The Critical Limitation: Linearity

The XOR Problem

6. Implementation with NumPy