Skip to content

Commit 2d99652

Browse files
Initial Commit
Initial Commit
1 parent 0c4e8d3 commit 2d99652

3 files changed

Lines changed: 137 additions & 0 deletions

File tree

assets/style.css

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
body {
2+
font-family: Arial, sans-serif;
3+
margin: 0;
4+
line-height: 1.6;
5+
background: #f9f9f9;
6+
color: #333;
7+
}
8+
9+
header, footer {
10+
text-align: center;
11+
padding: 2rem;
12+
background: #282c34;
13+
color: white;
14+
}
15+
16+
h1, h2 {
17+
margin-bottom: 1rem;
18+
}
19+
20+
section {
21+
max-width: 800px;
22+
margin: 2rem auto;
23+
padding: 1rem;
24+
background: white;
25+
border-radius: 10px;
26+
box-shadow: 0 2px 6px rgba(0,0,0,0.1);
27+
}
28+
29+
.btn {
30+
display: inline-block;
31+
padding: 0.5rem 1rem;
32+
margin: 0.5rem;
33+
border-radius: 5px;
34+
background: #61dafb;
35+
color: black;
36+
text-decoration: none;
37+
font-weight: bold;
38+
}
39+
40+
.btn.secondary {
41+
background: #ffa500;
42+
color: white;
43+
}

docs/index.html

Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
<!DOCTYPE html>
2+
<html lang="en">
3+
<head>
4+
<meta charset="UTF-8" />
5+
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
6+
<title>ExtractPDF4J Docs</title>
7+
<link rel="stylesheet" href="../assets/style.css" />
8+
</head>
9+
<body>
10+
<header>
11+
<h1>ExtractPDF4J Documentation</h1>
12+
<a class="btn" href="../index.html">⬅ Back to Home</a>
13+
</header>
14+
15+
<section>
16+
<h2>Installation</h2>
17+
<p>Add ExtractPDF4J to your Maven/Gradle project:</p>
18+
<pre>
19+
Maven:
20+
&lt;dependency&gt;
21+
&lt;groupId&gt;com.extractpdf4j&lt;/groupId&gt;
22+
&lt;artifactId&gt;extractpdf4j&lt;/artifactId&gt;
23+
&lt;version&gt;1.0.0&lt;/version&gt;
24+
&lt;/dependency&gt;
25+
26+
Gradle:
27+
implementation 'com.extractpdf4j:extractpdf4j:1.0.0'
28+
</pre>
29+
</section>
30+
31+
<section>
32+
<h2>Usage Example</h2>
33+
<pre>
34+
PdfHandler handler = new PdfHandler("sample.pdf");
35+
List<Table> tables = handler.parse("lattice");
36+
37+
for (Table t : tables) {
38+
System.out.println(t.toCsv());
39+
}
40+
</pre>
41+
</section>
42+
43+
<section>
44+
<h2>Contributing</h2>
45+
<p>We welcome contributions! Fork the repo, open issues, and raise PRs 🚀</p>
46+
</section>
47+
48+
<footer>
49+
<p>ExtractPDF4J © 2025 | Apache License 2.0</p>
50+
</footer>
51+
</body>
52+
</html>

index.html

Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
<!DOCTYPE html>
2+
<html lang="en">
3+
<head>
4+
<meta charset="UTF-8" />
5+
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
6+
<title>ExtractPDF4J</title>
7+
<link rel="stylesheet" href="assets/style.css" />
8+
</head>
9+
<body>
10+
<header>
11+
<h1>ExtractPDF4J</h1>
12+
<p>A powerful Java library for extracting tables from PDFs (Stream + Lattice + OCR)</p>
13+
<a class="btn" href="https://github.com/yourusername/ExtractPDF4J" target="_blank">⭐ View on GitHub</a>
14+
<a class="btn secondary" href="docs/index.html">📖 Documentation</a>
15+
</header>
16+
17+
<section>
18+
<h2>✨ Features</h2>
19+
<ul>
20+
<li>Stream and Lattice parsing (like Camelot, but in Java!)</li>
21+
<li>OCR-based extraction for scanned PDFs</li>
22+
<li>Multi-page & complex table handling</li>
23+
<li>Open-source under Apache 2.0</li>
24+
</ul>
25+
</section>
26+
27+
<section>
28+
<h2>🚀 Quick Start</h2>
29+
<pre>
30+
&lt;dependency&gt;
31+
&lt;groupId&gt;com.extractpdf4j&lt;/groupId&gt;
32+
&lt;artifactId&gt;extractpdf4j&lt;/artifactId&gt;
33+
&lt;version&gt;1.0.0&lt;/version&gt;
34+
&lt;/dependency&gt;
35+
</pre>
36+
</section>
37+
38+
<footer>
39+
<p>Made with ❤️ by Mehuli Mukherjee | <a href="docs/index.html">Read Docs</a></p>
40+
</footer>
41+
</body>
42+
</html>

0 commit comments

Comments
 (0)