PyDyNet：NumPy-based Dynamic Deep Learning Framework

PyDyNet已被多个技术公众号和社区分享：居然用Numpy实现了一个深度学习框架.

Towards Large Language Model

2025.8.12: 实现了纯推理的llama3 (6-layer Transformer, vocab-size=32000). 参考了这里的NumPy实现和数据集. 将数据集下载到llama文件夹即可运行:

>>> python -m llama.infer
There was a boy named Timmy. He loved to play with hi toy and run around outside. One day, Timmy' mom asked him to help her with the laundry. Timmy didn't want to help because he wanted to play. But hi mom said, "Timmy, you need to help me. It' important to help out."
Timmy didn't want to help, but he knew he had to. So, he put on hi shoe and went outside to help hi mom. A they were folding the clothe, Timmy saw a big pile of laundry on the floor. He wanted to help, so he started to pick it up. But then, he accidentally knocked over a pile of clothe and they fell on him. Timmy wa okay, but he felt bad.
Hi mom saw what happened and said, "Timmy, you need to be more careful. You could have hurt yourself." Timmy felt bad and said sorry. Hi mom hugged him and said, "It' okay, accident happen. Let' clean up the laundry together." Timmy learned that it' important to be careful and help out when you need it.

Token count: 262, elapsed: 0.87s, 300 tokens/s

Overview

PyDyNet也是纯NumPy(0.0.7版本后加入CuPy，其用法和NumPy一致)实现的神经网络，语法受PyTorch的启发，大致结构如下：

graph LR
   N(numpy/cupy.ndarray)--Backend--> A(Tensor) --> ds(Dataset) ---> Data(DataLoader)---> Mission
   A  --Eager execution--> B(Basic operators:<br> add, exp, etc)
   B -.Autograd-.-> A

   B --> CO(Complex<br>operators)
   --> f(Function:<br>img2col, etc) 
   --> M(Basic Module:<br>Linear, etc)
   --> CM(Advanced Module: CNN, RNN, Transformer, etc)
   --> Mission(Learning task)
   A --> GD(Optimizer:<br> SGD, Adam, etc) ---> LS(lr_scheduler: <br>StepLR, etc)---> Mission

虚线表示用户可以通过no_grad来关闭自动微分功能.

Install

git clone https://github.com/Kaslanarian/PyDyNet
cd PyDyNet
python setup.py install

Example

examples/pydynet中是一些例子，examples/pytorch给出等价的pytorch实现. 运行python examples.pydynet.xxx即可:

AutoDiff

autodiff1d.py利用自动微分，对一个一维凸函数进行梯度下降：

以及一个多元凸函数的例子: autodiff2d.py

MLP & LeNet

mlp_cnn.py使用MLP和LeNet对MNIST进行分类. 训练准确率和测试准确率：

Dropout & BN

mlp_dropout_bn.py使用三种网络对fetch_olivetti_faces人脸(64×64)数据集进行分类并进行性能对比：

三层MLP;
三层MLP + Dropout;
三层MLP + BatchNormalization.

学习效果对比：

RNN

ts_prediction中是一个用GRU做时序预测例子:

Transformer

transformer.py中是一个用Transformer训练文本分类模型的例子. 训练结果:

数据集 (CoLA) 链接: https://nyu-mll.github.io/CoLA/cola_public_1.1.zip

cuda加速

在训练batch size为256, 测试batch size为1024情况下，模型在CPU和GPU上的训练速度比较:

Network structure	Dataset	CPU time (s) per epoch	GPU time (s) per epoch
3-layer MLP	MNIST (80000×574)	7.256±0.138	1.203±.0181
LeNet	MNIST (80000×574)	239.664±2.108	2.841±0.026
1-layer Transformer (dim=512, head=4)	CoLA (8551×45×64)	17.503±0.251	1.075±0.002

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyDyNet：NumPy-based Dynamic Deep Learning Framework

Towards Large Language Model

Overview

Install

Example

AutoDiff

MLP & LeNet

Dropout & BN

RNN

Transformer

cuda加速

FilesExpand file tree

cnREADME.md

Latest commit

History

cnREADME.md

File metadata and controls

PyDyNet：NumPy-based Dynamic Deep Learning Framework

Towards Large Language Model

Overview

Install

Example

AutoDiff

MLP & LeNet

Dropout & BN

RNN

Transformer

cuda加速