Skip to content

Commit 52b7609

Browse files
authored
Merge pull request #10 from reeshabh90/reeshabh-web-changes
Reeshabh web changes
2 parents 46a5a32 + 4cf72f6 commit 52b7609

33 files changed

Lines changed: 2194 additions & 216 deletions

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44
/node_modules
55
/.pnp
66
.pnp.js
7+
.docusaurus
78

89
# testing
910
/coverage

blog/2026-05-05-docwire-returns.md

Lines changed: 62 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,62 @@
1+
---
2+
title: Docwire - The Engine Behind Secure Data Pipelines
3+
authors: krzysztof
4+
tags: [C++20, technology, document processing, data security]
5+
---
6+
*“Persistence turns a bar of iron into a needle”*, Nguyễn Phan Quế Mai, *Dust Child*
7+
8+
![Docwire return](/img/blog/2026-05-05-docwire-returns/DocwireReturn.PNG)
9+
10+
In course of human history, there has not been a time like the present era when Information has become of paramount importance. As James Gleick observes in his book “The Information”, information is what our world runs on: the blood and the fuel, the vital principle. Information pervades all domains of sciences, transforming every branch of knowledge. And Information needs to be inferred in its purest form to act as Intelligence.
11+
12+
Away from the noise, contemplating in silence, working through perseverance, Docwire SDK was being evolved into an infrastructure layer for modern data workflows to sustain the demands of modern day information processing. The journey has been long and arduous but the result has been equally satisfying. And it is time we present to you the much more evolved Docwire SDK, along with anecdotes about the philosophy behind it and the direction forward.
13+
<!-- truncate -->
14+
## The Invisible Engine
15+
Every interface we interact with is a surface level reality, but the entity itself is supported by an engine which remains invisible. Same is true for the virtual world. Some of the most important software is never seen. When you tap a payment card or rely on an embedded medical device, complex logic runs quietly in the background. Document processing plays a similar role in many systems — critical, yet hidden. And Docwire has been designed to serve exactly the same purpose. An engine that extracts, normalizes, and transforms unstructured documents into structured, usable data — locally, securely and reliably. And it still adheres to its “Plug and Play” philosophy. Developers integrate it once and it simply works! Only better this time, with support for more file formats and a fluent ingestion layer for building data processing pipeline.
16+
17+
18+
## What Changed since 2023
19+
We started this journey of evolving Docwire with following ideas as underlying philosophy:
20+
- Broader format support and improved parsing stability.
21+
- Supporting real-world usage in enterprise environments.
22+
- Deeper involvement in client projects as core processing layer.
23+
- Easy integrations with local LLM runtimes and AI pipelines.
24+
25+
💡And with each execution, one insight (or intuition) became clear:<br/>
26+
<u>*The need for secure and reliable data processing on local premises is increasing in the LLM era.*</u>
27+
28+
And this is where Docwire shines. Docwire is not simply AI-based or AI-driven, but AI-integrated SDK, which handles your document data processing requirements. It gives the user enough flexibility to process the data across various file formats and integrate it further with AI models of their choice, be it local or through APIs.
29+
30+
## From Files to Pipelines
31+
DocWire's evolution is not limited to supporting various file formats but integrating more workflows. In its inception, it was a file parsing tool and in its evolution, it is becoming pipeline construction SDK with various tools at your disposal:<br/>
32+
<b>Documents → Extraction → Normalization → Transformation → Output</b><br/>
33+
34+
These pipelines can feed search, analytics, knowledge bases, or AI systems, often running entirely within controlled environments.
35+
This shift defines the next phase of DocWire.
36+
37+
## Watch: The Invisible Engine Concept
38+
To explain this direction visually, we prepared a short concept video introducing DocWire as the invisible engine behind secure data pipelines.
39+
40+
<iframe
41+
width="100%"
42+
height="400"
43+
src="https://www.youtube.com/embed/dlLMHZlULi4"
44+
title="Demo video"
45+
allowFullScreen
46+
/>
47+
48+
## What’s Next
49+
Now that we are back, look out for communications from the Docwire team frequently on this forum. We will start sharing:
50+
- Engineering deep dives
51+
- Integration experiments
52+
- Lessons from real deployments
53+
- Previews of upcoming features
54+
55+
DocWire is evolving from SDK toward a broader processing framework — and this space will document that journey.
56+
## Explore DocWire
57+
If you’re building systems that depend on reliable document processing or secure data pipelines, we invite you to explore:
58+
- [GitHub repository](https://github.com/docwire/docwire)
59+
- [Documentation](https://docwire.readthedocs.io/en/latest/)
60+
- [Recent releases](https://github.com/docwire/docwire/releases)
61+
62+
Or simply reach out — we are always curious about what others are building behind the scenes, and eager to support the cause in any way possible.

blog/authors.yml

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
1-
ferid:
2-
name: Ferid Obeidat
3-
title: Author
4-
image_url: /img/Ferid-headshot.png
1+
krzysztof:
2+
name: Krzysztof Nowicki
3+
title: Chief Business Officer, Founder - Docwire
4+
image_url: /img/kryz-headshot.png
5+

docusaurus.config.js

Lines changed: 21 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
// @ts-check
22

3-
const {themes} = require('prism-react-renderer');
3+
const { themes } = require('prism-react-renderer');
44
const lightCodeTheme = themes.github;
55
const darkCodeTheme = themes.dracula;
66

@@ -79,6 +79,21 @@ const config = {
7979
}),
8080
],
8181
],
82+
plugins: [
83+
[
84+
'@docusaurus/plugin-content-blog',
85+
{
86+
id: 'tech-dive',
87+
routeBasePath: 'tech-dive',
88+
path: './tech-dive',
89+
showReadingTime: true,
90+
beforeDefaultRemarkPlugins: [
91+
require('./src/plugins/remark-youtube'),
92+
require('./src/plugins/remark-static-images'),
93+
],
94+
},
95+
],
96+
],
8297

8398
themeConfig:
8499
/** @type {import('@docusaurus/preset-classic').ThemeConfig} */
@@ -96,8 +111,8 @@ const config = {
96111
src: 'img/logoDocWire.JPG',
97112
},
98113
items: [
99-
{to: '/about-us', label: 'About Us', position: 'left'},
100-
{to: '/showcases', label: 'Showcases', position: 'left'},
114+
{ to: '/about-us', label: 'About Us', position: 'left' },
115+
{ to: '/showcases', label: 'Showcases', position: 'left' },
101116
{
102117
href: 'https://docwire.readthedocs.io/',
103118
position: 'left',
@@ -108,8 +123,9 @@ const config = {
108123
label: 'Download',
109124
position: 'left',
110125
},
111-
{to: '/blog', label: 'Blog', position: 'left'},
112-
{to: '/contact-us', label: 'Contact Us', position: 'right', className: 'button-pill nav-cta'},
126+
{ to: '/blog', label: 'Blog', position: 'left' },
127+
{ to: '/tech-dive', label: 'Tech Dive', position: 'left' },
128+
{ to: '/contact-us', label: 'Contact Us', position: 'right', className: 'button-pill nav-cta' },
113129
{
114130
href: 'https://github.com/docwire/docwire',
115131
label: 'GitHub',

src/assets/data-extraction.png

221 KB
Loading

src/assets/email-parse.png

362 KB
Loading

src/assets/ocr.png

836 KB
Loading
231 KB
Loading

src/assets/on-premise-security.png

321 KB
Loading

src/assets/parse-data.png

792 KB
Loading

0 commit comments

Comments
 (0)