You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/COMPETITIVE_ANALYSIS.md
+28-21Lines changed: 28 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,20 +1,21 @@
1
1
# Competitive Analysis
2
2
3
-
**Last Updated**: 2025-12-26
3
+
**Last Updated**: 2026-05-07
4
4
**Purpose**: Comprehensive competitive landscape and feature opportunity analysis
5
5
6
6
## Executive Summary
7
7
8
-
sql-splitter occupies a **unique position** in the SQL dump processing ecosystem by combining multiple capabilities that currently require separate tools. As of v1.9.0, we offer: **split + merge + analyze + validate + sample (FK-preserving) + shard + convert + diff + redact**.
8
+
sql-splitter occupies a **unique position** in the SQL dump processing ecosystem by combining multiple capabilities that currently require separate tools. As of v1.13.5, we offer: **split + merge + analyze + validate + sample (FK-preserving) + shard + convert + diff + redact + graph + order + query (DuckDB)**.
9
9
10
10
No existing tool offers this combination in a single, streaming, CLI-first, multi-dialect binary.
11
11
12
12
**Key differentiators:**
13
13
14
14
- Works on dump files directly (no database connection required)
15
15
- Streaming architecture handles 10GB+ dumps
16
-
- Multi-dialect support (MySQL, PostgreSQL, SQLite)
16
+
- Multi-dialect support (MySQL, PostgreSQL, SQLite, MSSQL)
17
17
- 600+ MB/s throughput
18
+
- Embedded DuckDB for SQL analytics on dumps without import
18
19
19
20
---
20
21
@@ -35,9 +36,14 @@ No existing tool offers this combination in a single, streaming, CLI-first, mult
|**DuckDB**| C++ | 34.8k | Query SQL/CSV/JSON/Parquet directly |
168
+
|**sqlglot**| Python | 7k+ | Parse/transpile, not filter |
163
169
164
-
**[DuckDB](https://github.com/duckdb/duckdb)** could solve querying but is overkill for simple dump filtering.
170
+
sql-splitter embeds DuckDB to give full SQL analytics on dumps without an import step (in-memory or disk-backed for >2GB dumps), with persistent caching that delivers a 400× speedup on repeat queries.
165
171
166
172
---
167
173
168
174
### MSSQL Support
169
175
170
176
| Tool | MSSQL |
171
177
| ---------------- | ----------------- |
172
-
|**sql-splitter**|🟡 Planned |
178
+
|**sql-splitter**|✅ (v1.12.x)|
173
179
| Jailer | ✅ (via JDBC) |
174
180
| pynonymizer | ✅ |
175
181
| sqlglot | ✅ (parsing only) |
176
182
| pgloader | ❌ |
177
183
178
-
**Gap**: Major gap in ecosystem for MSSQL dump processing CLI tools.
184
+
sql-splitter is now the only **streaming, file-based, multi-dialect** CLI with SQL Server support — Jailer/pynonymizer require live DB connections.
179
185
180
186
---
181
187
@@ -247,13 +253,13 @@ No existing tool offers this combination in a single, streaming, CLI-first, mult
3. **dbt** — Bootstrap data transformation projects (planned v1.20.0)
806
+
4. **Great Expectations** — Data quality testing (planned v1.18.0)
807
+
5. **GitHub Actions** — CI/CD automation (future)
885
808
886
809
These integrations position sql-splitter as the **Swiss Army knife that plays well with others** rather than trying to replace every tool in the ecosystem.
0 commit comments