Skip to content

Latest commit

 

History

History
109 lines (70 loc) · 4.88 KB

File metadata and controls

109 lines (70 loc) · 4.88 KB

PyDyNet:NumPy-based Dynamic Deep Learning Framework

PyDyNet已被多个技术公众号和社区分享居然用Numpy实现了一个深度学习框架.

Downloads Downloads x x x x

Towards Large Language Model

2025.8.12: 实现了纯推理的llama3 (6-layer Transformer, vocab-size=32000). 参考了这里的NumPy实现和数据集. 将数据集下载到llama文件夹即可运行:

>>> python -m llama.infer
There was a boy named Timmy. He loved to play with hi toy and run around outside. One day, Timmy' mom asked him to help her with the laundry. Timmy didn't want to help because he wanted to play. But hi mom said, "Timmy, you need to help me. It' important to help out."
Timmy didn't want to help, but he knew he had to. So, he put on hi shoe and went outside to help hi mom. A they were folding the clothe, Timmy saw a big pile of laundry on the floor. He wanted to help, so he started to pick it up. But then, he accidentally knocked over a pile of clothe and they fell on him. Timmy wa okay, but he felt bad.
Hi mom saw what happened and said, "Timmy, you need to be more careful. You could have hurt yourself." Timmy felt bad and said sorry. Hi mom hugged him and said, "It' okay, accident happen. Let' clean up the laundry together." Timmy learned that it' important to be careful and help out when you need it.

Token count: 262, elapsed: 0.87s, 300 tokens/s

Overview

PyDyNet也是纯NumPy(0.0.7版本后加入CuPy,其用法和NumPy一致)实现的神经网络,语法受PyTorch的启发,大致结构如下:

graph LR
   N(numpy/cupy.ndarray)--Backend--> A(Tensor) --> ds(Dataset) ---> Data(DataLoader)---> Mission
   A  --Eager execution--> B(Basic operators:<br> add, exp, etc)
   B -.Autograd-.-> A

   B --> CO(Complex<br>operators)
   --> f(Function:<br>img2col, etc) 
   --> M(Basic Module:<br>Linear, etc)
   --> CM(Advanced Module: CNN, RNN, Transformer, etc)
   --> Mission(Learning task)
   A --> GD(Optimizer:<br> SGD, Adam, etc) ---> LS(lr_scheduler: <br>StepLR, etc)---> Mission
Loading

虚线表示用户可以通过no_grad来关闭自动微分功能.

Install

git clone https://github.com/Kaslanarian/PyDyNet
cd PyDyNet
python setup.py install

Example

examples/pydynet中是一些例子,examples/pytorch给出等价的pytorch实现. 运行python examples.pydynet.xxx即可:

AutoDiff

autodiff1d.py利用自动微分,对一个一维凸函数进行梯度下降:

ad1

以及一个多元凸函数的例子: autodiff2d.py

ad2

MLP & LeNet

mlp_cnn.py使用MLP和LeNet对MNIST进行分类. 训练准确率和测试准确率:

dnn

Dropout & BN

mlp_dropout_bn.py使用三种网络对fetch_olivetti_faces人脸(64×64)数据集进行分类并进行性能对比:

  1. 三层MLP;
  2. 三层MLP + Dropout;
  3. 三层MLP + BatchNormalization.

学习效果对比:

cnn

RNN

ts_prediction中是一个用GRU做时序预测例子:

RNN

Transformer

transformer.py中是一个用Transformer训练文本分类模型的例子. 训练结果:

transformer

数据集 (CoLA) 链接: https://nyu-mll.github.io/CoLA/cola_public_1.1.zip

cuda加速

在训练batch size为256, 测试batch size为1024情况下,模型在CPU和GPU上的训练速度比较:

Network structure Dataset CPU time (s) per epoch GPU time (s) per epoch
3-layer MLP MNIST (80000×574) 7.256±0.138 1.203±.0181
LeNet MNIST (80000×574) 239.664±2.108 2.841±0.026
1-layer Transformer (dim=512, head=4) CoLA (8551×45×64) 17.503±0.251 1.075±0.002

设备: Nvidia GeForce RTX 4090.