Skip to content

Commit b2d73e1

Browse files
committed
Merge branch 'dj/fix-docs' into dj/docs
2 parents 85b043f + 13d3673 commit b2d73e1

40 files changed

Lines changed: 7252 additions & 2364 deletions

.env_example

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
GOOGLE_API_KEY=""
2+
OPENAI_API_KEY=""
3+
HUGGINGFACEHUB_API_TOKEN=""
4+
TOGETHER_API_KEY=""

.github/workflows/deploy_docs.yml

Lines changed: 36 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,20 +4,48 @@ on:
44
push:
55
branches:
66
- main
7-
- dj/docs
7+
- '**docs**'
88
paths:
99
- 'docs/**'
1010
- '.github/workflows/deploy_docs.yml'
1111

1212
jobs:
13-
pages:
14-
runs-on: ubuntu-20.04
15-
environment:
16-
name: github-pages
17-
url: ${{ steps.deployment.outputs.page_url }}
13+
build-docs:
14+
runs-on: ubuntu-latest
15+
steps:
16+
- name: Checkout repository
17+
uses: actions/checkout@v4
18+
19+
- name: Set up Python
20+
uses: actions/setup-python@v5
21+
with:
22+
python-version: "3.11"
23+
24+
- name: Install dependencies
25+
run: |
26+
python -m pip install --upgrade pip
27+
pip install sphinx docutils
28+
pip install -r docs/requirements.txt || true
29+
30+
- name: Build Sphinx documentation
31+
run: |
32+
sphinx-build -b html docs/ docs/_build/html
33+
34+
- name: Upload documentation artifact
35+
uses: actions/upload-pages-artifact@v3
36+
with:
37+
path: docs/_build/html
38+
39+
deploy:
40+
needs: build-docs
41+
runs-on: ubuntu-latest
1842
permissions:
1943
pages: write
2044
id-token: write
45+
environment:
46+
name: github-pages
47+
url: ${{ steps.deployment.outputs.page_url }}
2148
steps:
22-
- id: deployment
23-
uses: sphinx-notes/pages@v3
49+
- name: Deploy to GitHub Pages
50+
id: deployment
51+
uses: actions/deploy-pages@v2

.gitignore

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -164,4 +164,7 @@ cython_debug/
164164
# Custom
165165
tests/outputs/
166166
outputs/
167-
inputs/
167+
inputs/
168+
169+
# Others
170+
.DS_Store

CHANGELOG.md

Lines changed: 79 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,85 @@
8080
## [0.1.8] - 2025-01-23
8181

8282
### Added
83-
* Rretry and error handling for LLM_PARSE
83+
* Retry and error handling for LLM_PARSE
8484

8585
### Changed
8686
* Remove together Python client dependency and use REST API calls instead
87+
88+
## [0.1.8.post1] - 2025-01-28
89+
90+
### Added
91+
* Documentation
92+
93+
### Changed
94+
* Specify headers for Playwright web page retrieval
95+
96+
## [0.1.9] - 2025-02-17
97+
98+
### Added
99+
- Parameters to specify intermediate PDF save path when `as_pdf=True`.
100+
- Return `token_uage` and `pdf_path` with `parse()` output where applicable
101+
102+
### Changed
103+
- Switched back to together Python client
104+
- Improved `parse()` function return format to be a dictionary.
105+
106+
107+
## [0.1.10] - 2025-02-23
108+
109+
### Added
110+
- Parameter to specify page numbers for parsing
111+
112+
### Fixed
113+
- Errors caused by empty token_usage
114+
115+
## [0.1.11] - 2025-02-27
116+
117+
### Added
118+
- Priority setting to AUTO routing
119+
- More models to benchmark
120+
121+
### Changed
122+
- Set default parse_type to AUTO
123+
- Set default LLM to Gemini 2.0 Flash
124+
- Updated benchmark script to aggregate over multiple runs
125+
126+
### Fixed
127+
- Incorrect title when `as_pdf=True`
128+
129+
130+
## [0.1.11.post1] - 2025-03-05
131+
132+
### Added
133+
- Code of Conduct
134+
135+
### Fixed
136+
- Segmentation fault when PyQT app is reinitialized
137+
138+
## [0.1.12] - 2025-04-11
139+
140+
### Added
141+
* Support for OpenRouter models
142+
* Return token cost when cost mapping is provided
143+
* Support for custom prompts
144+
* Support for parsing Excel and PowerPoint files
145+
146+
### Changed
147+
* Set default `router_priority` to `speed`
148+
149+
## [0.1.13] - 2025-04-20
150+
151+
### Added
152+
* `STATIC_PARSE` improvements
153+
* Horizontal line detection
154+
* Strikethrough text detection
155+
* Email address formatting
156+
* Improved heading level detection
157+
* Monospace font detection
158+
* Indentation detection
159+
160+
## [0.1.14] - 2025-06-05
161+
162+
### Added
163+
* Add support for Fireworks API
164+
* Add support for matching data in document to pre-defined schema or template

CODE_OF_CONDUCT.md

Lines changed: 134 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,134 @@
1+
At Oid Labs we are committed to enabling a safe, welcoming and collaborative environment for everyone.
2+
3+
# Contributor Covenant Code of Conduct
4+
5+
## Our Pledge
6+
7+
We as members, contributors, and leaders pledge to make participation in our
8+
community a harassment-free experience for everyone, regardless of age, body
9+
size, visible or invisible disability, ethnicity, sex characteristics, gender
10+
identity and expression, level of experience, education, socio-economic status,
11+
nationality, personal appearance, race, caste, color, religion, or sexual
12+
identity and orientation.
13+
14+
We pledge to act and interact in ways that contribute to an open, welcoming,
15+
diverse, inclusive, and healthy community.
16+
17+
## Our Standards
18+
19+
Examples of behavior that contributes to a positive environment for our
20+
community include:
21+
22+
- Demonstrating empathy and kindness toward other people
23+
- Being respectful of differing opinions, viewpoints, and experiences
24+
- Giving and gracefully accepting constructive feedback
25+
- Accepting responsibility and apologizing to those affected by our mistakes,
26+
and learning from the experience
27+
- Focusing on what is best not just for us as individuals, but for the overall
28+
community
29+
30+
Examples of unacceptable behavior include:
31+
32+
- The use of sexualized language or imagery, and sexual attention or advances of
33+
any kind
34+
- Trolling, insulting or derogatory comments, and personal or political attacks
35+
- Public or private harassment
36+
- Publishing others' private information, such as a physical or email address,
37+
without their explicit permission
38+
- Other conduct which could reasonably be considered inappropriate in a
39+
professional setting
40+
41+
## Enforcement Responsibilities
42+
43+
Community leaders are responsible for clarifying and enforcing our standards of
44+
acceptable behavior and will take appropriate and fair corrective action in
45+
response to any behavior that they deem inappropriate, threatening, offensive,
46+
or harmful.
47+
48+
Community leaders have the right and responsibility to remove, edit, or reject
49+
comments, commits, code, wiki edits, issues, and other contributions that are
50+
not aligned to this Code of Conduct, and will communicate reasons for moderation
51+
decisions when appropriate.
52+
53+
## Scope
54+
55+
This Code of Conduct applies within all community spaces, and also applies when
56+
an individual is officially representing the community in public spaces.
57+
Examples of representing our community include using an official email address,
58+
posting via an official social media account, or acting as an appointed
59+
representative at an online or offline event.
60+
61+
## Enforcement
62+
63+
Instances of abusive, harassing, or otherwise unacceptable behavior may be
64+
reported to the community leaders responsible for enforcement at
65+
[INSERT CONTACT METHOD].
66+
All complaints will be reviewed and investigated promptly and fairly.
67+
68+
All community leaders are obligated to respect the privacy and security of the
69+
reporter of any incident.
70+
71+
## Enforcement Guidelines
72+
73+
Community leaders will follow these Community Impact Guidelines in determining
74+
the consequences for any action they deem in violation of this Code of Conduct:
75+
76+
### 1. Correction
77+
78+
**Community Impact**: Use of inappropriate language or other behavior deemed
79+
unprofessional or unwelcome in the community.
80+
81+
**Consequence**: A private, written warning from community leaders, providing
82+
clarity around the nature of the violation and an explanation of why the
83+
behavior was inappropriate. A public apology may be requested.
84+
85+
### 2. Warning
86+
87+
**Community Impact**: A violation through a single incident or series of
88+
actions.
89+
90+
**Consequence**: A warning with consequences for continued behavior. No
91+
interaction with the people involved, including unsolicited interaction with
92+
those enforcing the Code of Conduct, for a specified period of time. This
93+
includes avoiding interactions in community spaces as well as external channels
94+
like social media. Violating these terms may lead to a temporary or permanent
95+
ban.
96+
97+
### 3. Temporary Ban
98+
99+
**Community Impact**: A serious violation of community standards, including
100+
sustained inappropriate behavior.
101+
102+
**Consequence**: A temporary ban from any sort of interaction or public
103+
communication with the community for a specified period of time. No public or
104+
private interaction with the people involved, including unsolicited interaction
105+
with those enforcing the Code of Conduct, is allowed during this period.
106+
Violating these terms may lead to a permanent ban.
107+
108+
### 4. Permanent Ban
109+
110+
**Community Impact**: Demonstrating a pattern of violation of community
111+
standards, including sustained inappropriate behavior, harassment of an
112+
individual, or aggression toward or disparagement of classes of individuals.
113+
114+
**Consequence**: A permanent ban from any sort of public interaction within the
115+
community.
116+
117+
## Attribution
118+
119+
This Code of Conduct is adapted from the [Contributor Covenant][homepage],
120+
version 2.1, available at
121+
[https://www.contributor-covenant.org/version/2/1/code_of_conduct.html][v2.1].
122+
123+
Community Impact Guidelines were inspired by
124+
[Mozilla's code of conduct enforcement ladder][Mozilla CoC].
125+
126+
For answers to common questions about this code of conduct, see the FAQ at
127+
[https://www.contributor-covenant.org/faq][FAQ]. Translations are available at
128+
[https://www.contributor-covenant.org/translations][translations].
129+
130+
[homepage]: https://www.contributor-covenant.org
131+
[v2.1]: https://www.contributor-covenant.org/version/2/1/code_of_conduct.html
132+
[Mozilla CoC]: https://github.com/mozilla/diversity
133+
[FAQ]: https://www.contributor-covenant.org/faq
134+
[translations]: https://www.contributor-covenant.org/translations

0 commit comments

Comments
 (0)