The purpose of this package is to provide some helper files/functions/classes for generic PySpark processes.
For reference, these URL's are used:
| Type | Source | URL |
|---|---|---|
| Git Repo | GitHub | https://github.com/data-science-extensions/toolbox-pyspark |
| Python Package | PyPI | https://pypi.org/project/toolbox-pyspark |
| Package Docs | Pages | https://data-science-extensions.com/toolbox-pyspark |
You can install and use this package multiple ways by using any of your preferred methods: pip, pipenv, poetry, or uv.
Using pip:
-
In your terminal, run:
python3 -m pip install --upgrade pip python3 -m pip install toolbox-pyspark
-
Or, in your
requirements.txtfile, add:toolbox-pyspark
Then run:
python3 -m pip install --upgrade pip python3 -m pip install --requirement=requirements.txt
Using pipenv:
-
Install using environment variables:
In your
Pipfilefile, add:[[source]] url = "https://pypi.org/simple" verify_ssl = false name = "pypi" [packages] toolbox-pyspark = "*"
Then run:
python3 -m pip install pipenv python3 -m pipenv install --verbose --skip-lock --categories=root index=pypi toolbox-pyspark
-
Or, in your
requirements.txtfile, add:toolbox-pyspark
Then run:
python3 -m run pipenv install --verbose --skip-lock --requirements=requirements.txt
-
Or just run this:
python3 -m pipenv install --verbose --skip-lock toolbox-pyspark
Using poetry:
-
In your
pyproject.tomlfile, add:[project] dependencies = [ "toolbox-pyspark==1.*", ]
Then run:
poetry sync poetry install
-
Or just run this:
poetry add "toolbox-pyspark==1.*" poetry sync poetry install
Using uv:
-
In your
pyproject.tomlfile, add:[project] dependencies = [ "toolbox-pyspark==1.*", ]
Then run:
uv sync
-
Or run this:
uv add "toolbox-pyspark==1.*" uv sync -
Or just run this:
uv pip install "toolbox-pyspark==1.*"
Check the CONTRIBUTING.md file or Contributing page.