Skip to content

Commit 2c089c1

Browse files
committed
Merge branch 'develop_930' of github.com:ModelEngine-Group/data-platform into develop_930
2 parents eaf378e + c942fc4 commit 2c089c1

388 files changed

Lines changed: 1508 additions & 2978 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

LICENSE

Lines changed: 20 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -1,20 +1,20 @@
1-
MIT License
2-
3-
Copyright (c) [2025] [ModelEngine Data-Engine Platform]
4-
5-
Permission is hereby granted, free of charge, to any person obtaining a copy
6-
of this software and associated documentation files (the "Software"), to deal
7-
in the Software without restriction, including without limitation the rights
8-
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9-
copies of the Software, subject to the following conditions:
10-
11-
The above copyright notice and this permission notice shall be included in all
12-
copies or substantial portions of the Software.
13-
14-
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
15-
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
16-
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
17-
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
18-
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
19-
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
20-
SOFTWARE.
1+
# DataMate Open Source License
2+
3+
DataMate is licensed under the MIT License, with the following additional conditions:
4+
5+
DataMate is permitted to be used commercially, including as a backend service for other applications or as an application development platform for enterprises. However, when the following conditions are met, you must contact the producer to obtain a commercial license:
6+
7+
a. Multi-tenant SaaS service: Unless explicitly authorized by DataMate in writing, you may not use the DataMate source code to operate a multi-tenant SaaS service.
8+
b. LOGO and copyright information: In the process of using DataMate's frontend, you may not remove or modify the LOGO or copyright information in the DataMate console or applications. This restriction is inapplicable to uses of Nexent that do not involve its frontend.
9+
10+
Please contact zhangyafeng2@huawei.com by email to inquire about licensing matters.
11+
12+
As a contributor, you should agree that:
13+
14+
a. The producer can adjust the open-source agreement to be more strict or relaxed as deemed necessary.
15+
b. Your contributed code may be used for commercial purposes, such as DataMate's cloud business.
16+
17+
Apart from the specific conditions mentioned above, all other rights and restrictions follow the MIT License.
18+
Detailed information about the MIT License can be found at: https://opensource.org/licenses/MIT
19+
20+
Copyright © 2025 Huawei Technologies Co., Ltd.

Makefile

Lines changed: 53 additions & 50 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,19 @@
11
MAKEFLAGS += --no-print-directory
22

3+
VERSION ?= latest
4+
NAMESPACE ?= datamate
5+
36
.PHONY: build-%
47
build-%:
58
$(MAKE) $*-docker-build
69

710
.PHONY: build
811
build: backend-docker-build frontend-docker-build runtime-docker-build
912

13+
.PHONY: create-namespace
14+
create-namespace:
15+
@kubectl get namespace $(NAMESPACE) > /dev/null 2>&1 || kubectl create namespace $(NAMESPACE)
16+
1017
.PHONY: install-%
1118
install-%:
1219
ifeq ($(origin INSTALLER), undefined)
@@ -26,7 +33,7 @@ else
2633
endif
2734

2835
.PHONY: install
29-
install: install-data-platform
36+
install: install-data-mate
3037

3138
.PHONY: uninstall-%
3239
uninstall-%:
@@ -47,115 +54,111 @@ else
4754
endif
4855

4956
.PHONY: uninstall
50-
uninstall: uninstall-data-platform
57+
uninstall: uninstall-data-mate
5158

5259
# build
5360
.PHONY: mineru-docker-build
5461
mineru-docker-build:
55-
sh scripts/images/mineru/build.sh
62+
docker build -t mineru:$(VERSION) . -f scripts/images/mineru/Dockerfile
5663

5764
.PHONY: datax-docker-build
5865
datax-docker-build:
59-
sh scripts/images/datax/build.sh
60-
61-
.PHONY: data-juicer-docker-build
62-
data-juicer-docker-build:
63-
sh scripts/images/data-juicer/build.sh
66+
docker build -t datax:$(VERSION) . -f scripts/images/datax/Dockerfile
6467

6568
.PHONY: unstructured-docker-build
6669
unstructured-docker-build:
67-
sh scripts/images/unstructured/build.sh
70+
docker build -t unstructured:$(VERSION) . -f scripts/images/unstructured/Dockerfile
6871

6972
.PHONY: backend-docker-build
7073
backend-docker-build:
71-
sh scripts/images/backend/build.sh
74+
docker build -t backend:$(VERSION) . -f scripts/images/backend/Dockerfile
7275

7376
.PHONY: frontend-docker-build
7477
frontend-docker-build:
75-
sh scripts/images/frontend/build.sh
78+
docker build -t frontend:$(VERSION) . -f scripts/images/frontend/Dockerfile
7679

7780
.PHONY: runtime-docker-build
7881
runtime-docker-build:
79-
sh scripts/images/runtime/build.sh
82+
docker build -t runtime:$(VERSION) . -f scripts/images/runtime/Dockerfile
8083

8184
.PHONY: backend-docker-install
8285
backend-docker-install:
83-
cd deployment/docker/data-platform && docker-compose up -d backend
86+
cd deployment/docker/data-mate && docker-compose up -d backend
8487

8588
.PHONY: backend-docker-uninstall
8689
backend-docker-uninstall:
87-
cd deployment/docker/data-platform && docker-compose down backend
90+
cd deployment/docker/data-mate && docker-compose down backend
8891

8992
.PHONY: frontend-docker-install
9093
frontend-docker-install:
91-
cd deployment/docker/data-platform && docker-compose up -d frontend
94+
cd deployment/docker/data-mate && docker-compose up -d frontend
9295

9396
.PHONY: frontend-docker-uninstall
9497
frontend-docker-uninstall:
95-
cd deployment/docker/data-platform && docker-compose down frontend
98+
cd deployment/docker/data-mate && docker-compose down frontend
9699

97100
.PHONY: runtime-docker-install
98101
runtime-docker-install:
99-
cd deployment/docker/data-platform && docker-compose up -d runtime
102+
cd deployment/docker/data-mate && docker-compose up -d runtime
100103

101104
.PHONY: runtime-docker-uninstall
102105
runtime-docker-uninstall:
103-
cd deployment/docker/data-platform && docker-compose down runtime
106+
cd deployment/docker/data-mate && docker-compose down runtime
104107

105108
.PHONY: runtime-k8s-install
106-
runtime-k8s-install:
107-
helm upgrade kuberay-operator deployment/helm/ray/kuberay-operator --install
108-
helm upgrade raycluster deployment/helm/ray/ray-cluster/ --install
109-
kubectl apply -f deployment/helm/ray/service.yaml
109+
runtime-k8s-install: create-namespace
110+
helm upgrade kuberay-operator deployment/helm/ray/kuberay-operator --install -n $(NAMESPACE)
111+
helm upgrade raycluster deployment/helm/ray/ray-cluster/ --install -n $(NAMESPACE)
112+
kubectl apply -f deployment/helm/ray/service.yaml -n $(NAMESPACE)
110113

111114
.PHONY: runtime-k8s-uninstall
112115
runtime-k8s-uninstall:
113-
helm uninstall raycluster
114-
helm uninstall kuberay-operator
115-
kubectl delete -f deployment/helm/ray/service.yaml
116+
helm uninstall raycluster -n $(NAMESPACE)
117+
helm uninstall kuberay-operator -n $(NAMESPACE)
118+
kubectl delete -f deployment/helm/ray/service.yaml -n $(NAMESPACE)
116119

117120
.PHONY: unstructured-k8s-install
118-
unstructured-k8s-install:
119-
kubectl apply -f deployment/kubernetes/unstructured/deploy.yaml
121+
unstructured-k8s-install: create-namespace
122+
kubectl apply -f deployment/kubernetes/unstructured/deploy.yaml -n $(NAMESPACE)
120123

121124
.PHONY: mysql-k8s-install
122-
mysql-k8s-install:
123-
kubectl create configmap init-sql --from-file=scripts/db/ --dry-run=client -o yaml | kubectl apply -f -
124-
kubectl apply -f deployment/kubernetes/mysql/configmap.yaml
125-
kubectl apply -f deployment/kubernetes/mysql/deploy.yaml
125+
mysql-k8s-install: create-namespace
126+
kubectl create configmap init-sql --from-file=scripts/db/ --dry-run=client -o yaml | kubectl apply -f - -n $(NAMESPACE)
127+
kubectl apply -f deployment/kubernetes/mysql/configmap.yaml -n $(NAMESPACE)
128+
kubectl apply -f deployment/kubernetes/mysql/deploy.yaml -n $(NAMESPACE)
126129

127130
.PHONY: mysql-k8s-uninstall
128131
mysql-k8s-uninstall:
129-
kubectl delete configmap init-sql
130-
kubectl delete -f deployment/kubernetes/mysql/configmap.yaml
131-
kubectl delete -f deployment/kubernetes/mysql/deploy.yaml
132+
kubectl delete configmap init-sql -n $(NAMESPACE)
133+
kubectl delete -f deployment/kubernetes/mysql/configmap.yaml -n $(NAMESPACE)
134+
kubectl delete -f deployment/kubernetes/mysql/deploy.yaml -n $(NAMESPACE)
132135

133136
.PHONY: backend-k8s-install
134-
backend-k8s-install:
135-
kubectl apply -f deployment/kubernetes/backend/deploy.yaml
137+
backend-k8s-install: create-namespace
138+
kubectl apply -f deployment/kubernetes/backend/deploy.yaml -n $(NAMESPACE)
136139

137140
.PHONY: backend-k8s-uninstall
138141
backend-k8s-uninstall:
139-
kubectl delete -f deployment/kubernetes/backend/deploy.yaml
142+
kubectl delete -f deployment/kubernetes/backend/deploy.yaml -n $(NAMESPACE)
140143

141144
.PHONY: frontend-k8s-install
142-
frontend-k8s-install:
143-
kubectl apply -f deployment/kubernetes/frontend/deploy.yaml
145+
frontend-k8s-install: create-namespace
146+
kubectl apply -f deployment/kubernetes/frontend/deploy.yaml -n $(NAMESPACE)
144147

145148
.PHONY: frontend-k8s-uninstall
146149
frontend-k8s-uninstall:
147-
kubectl delete -f deployment/kubernetes/frontend/deploy.yaml
150+
kubectl delete -f deployment/kubernetes/frontend/deploy.yaml -n $(NAMESPACE)
148151

149-
.PHONY: data-platform-docker-install
150-
data-platform-docker-install:
151-
cd deployment/docker/data-platform && docker-compose up -d
152+
.PHONY: data-mate-docker-install
153+
data-mate-docker-install:
154+
cd deployment/docker/datamate && docker-compose up -d
152155

153-
.PHONY: data-platform-docker-uninstall
154-
data-platform-docker-uninstall:
155-
cd deployment/docker/data-platform && docker-compose down
156+
.PHONY: data-mate-docker-uninstall
157+
data-mate-docker-uninstall:
158+
cd deployment/docker/datamate && docker-compose down
156159

157-
.PHONY: data-platform-k8s-install
158-
data-platform-k8s-install: mysql-k8s-install backend-k8s-install frontend-k8s-install runtime-k8s-install
160+
.PHONY: data-mate-k8s-install
161+
data-mate-k8s-install: create-namespace mysql-k8s-install backend-k8s-install frontend-k8s-install runtime-k8s-install
159162

160-
.PHONY: data-platform-k8s-uninstall
161-
data-platform-k8s-uninstall: mysql-k8s-uninstall backend-k8s-uninstall frontend-k8s-uninstall runtime-k8s-uninstall
163+
.PHONY: data-mate-k8s-uninstall
164+
data-mate-k8s-uninstall: mysql-k8s-uninstall backend-k8s-uninstall frontend-k8s-uninstall runtime-k8s-uninstall

README-zh.md

Lines changed: 72 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,72 @@
1+
# DataMate 一站式数据工作平台
2+
3+
<div align="center">
4+
5+
[![Backend CI](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-backend.yml/badge.svg)](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-backend.yml)
6+
[![Frontend CI](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-frontend.yml/badge.svg)](https://github.com/ModelEngine-Group/DataMate/actions/workflows/docker-image-frontend.yml)
7+
![GitHub Stars](https://img.shields.io/github/stars/ModelEngine-Group/DataMate)
8+
![GitHub Forks](https://img.shields.io/github/forks/ModelEngine-Group/DataMate)
9+
![GitHub Issues](https://img.shields.io/github/issues/ModelEngine-Group/DataMate)
10+
![GitHub License](https://img.shields.io/github/license/ModelEngine-Group/DataMate)
11+
12+
**DataMate是面向模型微调与RAG检索的企业级数据处理平台,支持数据归集、数据管理、算子市场、数据清洗、数据合成、数据标注、数据评估、知识生成等核心功能。
13+
**
14+
15+
[简体中文](./README-zh.md) | [English](./README.md)
16+
17+
如果您喜欢这个项目,希望您能给我们一个Star⭐️!
18+
19+
</div>
20+
21+
## 🌟 核心特性
22+
23+
- **核心模块**:数据归集、数据管理、算子市场、数据清洗、数据合成、数据标注、数据评估、知识生成
24+
- **可视化编排**:拖拽式数据处理流程设计
25+
- **算子生态**:丰富的内置算子和自定义算子支持
26+
27+
## 🚀 快速开始
28+
29+
### 前置条件
30+
31+
- Git (用于拉取源码)
32+
- Make (用于构建和安装)
33+
- Docker (用于构建镜像和部署服务)
34+
- Docker-Compose (用于部署服务-docker方式)
35+
- kubernetes (用于部署服务-k8s方式)
36+
- Helm (用于部署服务-k8s方式)
37+
38+
### 拉取代码
39+
40+
```bash
41+
git clone git@github.com:ModelEngine-Group/DataMate.git
42+
```
43+
44+
### 镜像构建
45+
46+
```bash
47+
make build
48+
```
49+
50+
### Docker安装
51+
52+
```bash
53+
make install INSTALLER=docker
54+
```
55+
56+
### kubernetes安装
57+
58+
```bash
59+
make install INSTALLER=k8s
60+
```
61+
62+
## 🤝 贡献指南
63+
64+
感谢您对本项目的关注!我们非常欢迎社区的贡献,无论是提交 Bug 报告、提出功能建议,还是直接参与代码开发,都能帮助项目变得更好。
65+
66+
• 📮 [GitHub Issues](../../issues):提交 Bug 或功能建议。
67+
68+
• 🔧 [GitHub Pull Requests](../../pulls):贡献代码改进。
69+
70+
## 📄 许可证
71+
72+
DataMate 基于 [MIT](LICENSE) 开源,您可以在遵守许可证条款的前提下自由使用、修改和分发本项目的代码。

0 commit comments

Comments
 (0)