Skip to content

Commit 23df384

Browse files
committed
Fix spelling mistakes in documentation and source code comments
Fix ~219 spelling errors across 92 files including documentation (.rst, .md), source code comments/docstrings, and corresponding test fixtures. Key corrections: compatibility, maintenance, dependency, environment, comparison, simultaneously, unnecessary, bootstrapping, and many others. No code logic changes - only human-readable text in comments, docstrings, and documentation. Test fixtures updated where source string changes affect serialized output. Identified using codespell with project-specific ignore list. Fixes #4719 Signed-off-by: Mrityunjay Raj <mr.raj.earth@gmail.com>
1 parent 022ddc8 commit 23df384

93 files changed

Lines changed: 220 additions & 219 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/ISSUE_TEMPLATE/--feature-request.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ assignees: ''
77

88
---
99

10-
<!-- Please fill out as much of the template as you can. Delete sections if unneccesary -->
10+
<!-- Please fill out as much of the template as you can. Delete sections if unnecessary -->
1111
<!--
1212
Make Sure you've checked out
1313
Existing Issues,

.github/PULL_REQUEST_TEMPLATE.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
<!-- Delete Template sections if unneccesary -->
1+
<!-- Delete Template sections if unnecessary -->
22
<!-- Add issue number here (We encourage you to create the Issue First) -->
33
<!-- You can also link the issue in Commit Messages -->
44

AUTHORS.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,6 +54,7 @@ The following organizations or individuals have contributed to ScanCode:
5454
- Michael Rupprecht @michaelrup
5555
- Mike Rombout @mrombout
5656
- Mrinal Paliwal @mnpw
57+
- Mrityunjay Raj @mr-raj12
5758
- nexB Inc. @nexB
5859
- Nirmal Sarswat @vivonk
5960
- Nisha Kumar @nishakm

CHANGELOG.rst

Lines changed: 20 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -98,7 +98,7 @@ This is a patch release with bugfixes and improvements::
9898
improvements by removing unnecessary regex substitutions.
9999
https://github.com/aboutcode-org/scancode-toolkit/issues/4063
100100

101-
- Fix misc scancode doumentation related issues
101+
- Fix misc scancode documentation related issues
102102
https://github.com/aboutcode-org/scancode-toolkit/pull/4457
103103
https://github.com/aboutcode-org/scancode-toolkit/pull/4458
104104
https://github.com/aboutcode-org/scancode-toolkit/pull/4462
@@ -127,7 +127,7 @@ from ``4.0.0`` to ``4.1.0``. The changes in Output Data Structure are:
127127
directories. This is returned optionally with the ``--info`` plugin.
128128

129129
- A new resource level attribute ``is_community`` is added, which is
130-
True from commonly used files used for community/project maintainence.
130+
True from commonly used files used for community/project maintenance.
131131
This is returned optionally with the ``--classify`` plugin.
132132

133133
These are the details for the most important changes introduced::
@@ -340,7 +340,7 @@ v32.2.1 - 2024-07-02
340340
- Add support for poetry packages, with poetry specific pyproject.toml
341341
support, poetry.lock and package assembly support. Also add support
342342
for parsing and storing resolved packages and dependency relationships
343-
required to statically resolve poetry dependecy graphs.
343+
required to statically resolve poetry dependency graphs.
344344
See https://github.com/nexB/scancode-toolkit/issues/2109
345345

346346
- Add support for pyproject.toml files in python projects.
@@ -378,7 +378,7 @@ v32.2.0 - 2024-06-19
378378
- Fix a bug in dependency parsing (we were not returning any dependencies)
379379
- Also support getting dependency versions from workspace
380380
- Support more attributes from cargo
381-
- Better handle workspace data thorugh extra_data attribute
381+
- Better handle workspace data through extra_data attribute
382382
See https://github.com/nexB/scancode-toolkit/pull/3783
383383

384384
- We now support parsing the Swift manifest JSON dump and the
@@ -455,7 +455,7 @@ Changes in Output Data Structure:
455455
- in package ``license_detections`` and ``other_license_detections``
456456
- ``matches`` for ``license_detections`` everywhere
457457

458-
- Adds all rule atrribute level info in codebase level ``todo``
458+
- Adds all rule attribute level info in codebase level ``todo``
459459
data, to assist in review. This includes length, text, notes,
460460
referenced_filenames, and the boolean attributes (like
461461
is_license_notice, is_license_intro etc, as applicable).
@@ -815,7 +815,7 @@ License detection:
815815
license ``key`` of a matched license expression. We now report instead one
816816
single match for each matched license expression, and list the license keys
817817
as a ``licenses`` attribute. This avoids data duplication.
818-
Inside each match, we list each match and matched rule attributred directly
818+
Inside each match, we list each match and matched rule attributed directly
819819
avoiding nesting. See `license updates doc <https://scancode-toolkit.readthedocs.io/en/latest/reference/license-detection-reference.html#licensematch-result-data>`_
820820
for examples and details.
821821

@@ -841,7 +841,7 @@ License detection:
841841
- There is an ``--additional-directory`` option with the ``scancode-reindex-licenses``
842842
command to add the licenses from a directory.
843843

844-
- There is also a ``--only-builtin`` option to use ony builtin licenses
844+
- There is also a ``--only-builtin`` option to use only builtin licenses
845845
ignoring any additional license plugins.
846846

847847
- See https://github.com/nexB/scancode-toolkit/issues/480 for more details.
@@ -887,7 +887,7 @@ v31.2.6 - 2023-04-25
887887

888888
This is a minor hotfix release.
889889

890-
This fix a crash when parsing a .deb Dbeian package filename
890+
This fix a crash when parsing a .deb Debian package filename
891891
reported in https://github.com/nexB/scancode-toolkit/issues/3259
892892

893893

@@ -1005,7 +1005,7 @@ Important API changes:
10051005
instances that can be aggregating data from multiple manifests.
10061006

10071007
- There is a a new top-level "dependencies" attribute that contains each
1008-
dependency instance, these can be standalone or releated to a package.
1008+
dependency instance, these can be standalone or related to a package.
10091009
These contain a new "extra_data" object.
10101010

10111011
- There is a new resource-level attribute "for_packages" which refers to
@@ -1035,9 +1035,9 @@ Important API changes:
10351035
- ``DatafileHandler.assemble()``, ``DatafileHandler.assemble_from_many()``, and
10361036
the other ``.assemble()`` methods from the other Package handlers from
10371037
packagedcode, have been updated to yield Package items before Dependency or
1038-
Resource items. This is particulary important in the case where we are calling
1038+
Resource items. This is particularly important in the case where we are calling
10391039
the ``assemble()`` method outside of the scancode-toolkit context, where we
1040-
need to ensure that a Package exists before we assocate a Resource or
1040+
need to ensure that a Package exists before we associate a Resource or
10411041
Dependency to it.
10421042

10431043
Copyright detection:
@@ -1373,7 +1373,7 @@ v30.0.0 - 2021-09-23
13731373
This is a major release with new features, and several bug fixes and
13741374
improvements including major updates to the license detection.
13751375

1376-
We have droped using calendar-based versions and are now switched back to semver
1376+
We have dropped using calendar-based versions and are now switched back to semver
13771377
versioning. To ensure that there is no ambiguity, the new major version has been
13781378
updated from 21 to 30. The primary reason is that calver was not helping
13791379
integrators to track major version changes like semver does.
@@ -1940,7 +1940,7 @@ v3.2.0rc1 (2020-09-08)
19401940
- Improve license detection #1999 - Bryan Sutula
19411941
- Correct CC0 license #1984 - Carmen Bianca Bakker
19421942
- Add documentation for the usage of `cpp_includes` plugin - Chin Yeung Li
1943-
- Improve andling of npm package-lock.json #1993 - Chin Yeung Li
1943+
- Improve handling of npm package-lock.json #1993 - Chin Yeung Li
19441944
- Add new license detection rules - Gaupeng
19451945
- Improve documentation - Issei Horie
19461946
- Improve consolidation plugin - Jono Yang @JonoYang
@@ -2027,7 +2027,7 @@ Other features and fixes:
20272027
v3.1.0 (2019-08-12)
20282028
-------------------
20292029

2030-
- Add partial suport for Python 3.6+ #295 @Abhishek-Dev09
2030+
- Add partial support for Python 3.6+ #295 @Abhishek-Dev09
20312031
- Add plugin to collect dwarf references #1167 @licodeli
20322032
- Add fingerprint plugin #1651 @arnav-mandal1234
20332033
- Add summary and consolidation plugin #1673
@@ -2208,8 +2208,8 @@ v2.9.6 (2018-10-25)
22082208

22092209
- Add declared license normalization #1092
22102210
- Add new and improved license rules
2211-
- Add mising and clean up ABOUT files for all embedded third-party libraries
2212-
- Improve npm package.json handling (better keuword support)
2211+
- Add missing and clean up ABOUT files for all embedded third-party libraries
2212+
- Improve npm package.json handling (better keyword support)
22132213
- Update thirdparty libraries #1224
22142214

22152215
Credits: Many thanks to everyone that contributed to this release with code and bug reports
@@ -2288,7 +2288,7 @@ Misc:
22882288
- Add facet, classification and summarizer plugins #357
22892289
- Fix file counts #1055
22902290
- Fix corrupted license cache error
2291-
- Upgrade all thridparty libraries #1070
2291+
- Upgrade all third party libraries #1070
22922292
- De-vendor prebuilt binaries to ease packaging for Linux distros #469
22932293

22942294
Credits: Many thanks to everyone that contributed to this release with code and bug reports
@@ -2429,8 +2429,8 @@ Licenses:
24292429
words compared to the number of words in the matched rule.
24302430
- The license cache is not checked anymore for consistency once created which
24312431
improved startup times. (unless you are using a Git checkout and you are
2432-
developping with a SCANCODE_DEV_MODE tag file present)
2433-
- License catagory names have been improved
2432+
developing with a SCANCODE_DEV_MODE tag file present)
2433+
- License category names have been improved
24342434

24352435
Copyrights:
24362436
- Copyright detection in binary files has been improved
@@ -2453,7 +2453,7 @@ Misc:
24532453
- Move essential configuration to a scancode_config.py module
24542454
- Only read a few pages from PDF files by default
24552455
- Improve handling of files with weird characters in their names on all OSses
2456-
- Improve detection of archive vs. comrpessed files
2456+
- Improve detection of archive vs. compressed files
24572457
- Make all copyright tests data driven using YAML files like for license tests
24582458

24592459

README.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ Build and tests status
3131
======================
3232

3333
We run 30,000+ tests on each commit on multiple CIs to ensure a good platform
34-
compabitility with multiple versions of Windows, Linux and macOS.
34+
compatibility with multiple versions of Windows, Linux and macOS.
3535

3636
+------------+--------------+-------------------------+----------------------------+
3737
| **Azure** | **RTD Build**| **GitHub actions Docs** | **GitHub actions Release** |
@@ -64,7 +64,7 @@ Benefits of ScanCode
6464

6565
- ScanCode detects licenses, copyrights, package manifests, direct dependencies,
6666
and more both in **source code** and **binary** files and is considered as the
67-
best-in-class and reference tool in this domain, re-used as the core tools for
67+
best-in-class and reference tool in this domain, reused as the core tools for
6868
software composition data collection by several open source tools.
6969

7070
- ScanCode provides the **most accurate license detection engine** and does a
@@ -92,7 +92,7 @@ If you have a specific problem, suggestion or bug, please submit a
9292

9393
For quick questions or socializing, join the AboutCode community discussions on `Slack <https://join.slack.com/t/aboutcode-org/shared_invite/zt-3li3bfs78-mmtKG0Qhv~G2dSlNCZW2pA>`_.
9494

95-
Interested in commercial suppport? Contact the `AboutCode team <mailto:hello@aboutcode.org>`_.
95+
Interested in commercial support? Contact the `AboutCode team <mailto:hello@aboutcode.org>`_.
9696

9797
License
9898
=======

ROADMAP-ABOUTCODE.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,7 @@ License detection quality improvements
9494
Improve package detection
9595
~~~~~~~~~~~~~~~~~~~~~~~~~~
9696

97-
- Create synthethic, private packages from non-packaged files based on license and copyright
97+
- Create synthetic, private packages from non-packaged files based on license and copyright
9898
- Create simplified purl-only lightweight package detection
9999
- Evolve model for dependencies towards requirements and true dependencies
100100
- Track private non-published packages
@@ -142,7 +142,7 @@ PURLDB: PurlDB
142142
- MatchCode matching engine
143143

144144
- embed a SCIO with a matching pipeline for match a whole codebase at once
145-
- expore new endpoint for matching whole codebase
145+
- explore new endpoint for matching whole codebase
146146
- support multiple SCIO workers for indexing
147147
- implement proper ranking of matched code results
148148
- refactor directory matching to be a pre-matching step to file matching
@@ -162,7 +162,7 @@ VCIO: VulnerableCode.io
162162
PURL: purl and vers specs
163163
--------------------------
164164

165-
- Merge and advertize vers spec.
165+
- Merge and advertise vers spec.
166166
- Standardize purl with ECMA
167167

168168

docs/source/explanation/scancode-license-detection-updates.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -381,7 +381,7 @@ report here). We are now just reporting a flat mapping here, and all the
381381
rule details are also not present in the license match, and only available
382382
as an optional reference.
383383

384-
See this before/after comparision to see how the license data in results has
384+
See this before/after comparison to see how the license data in results has
385385
evolved.
386386

387387
Before::
@@ -536,12 +536,12 @@ This is now default with the CLI option ``--license``, which references from
536536
the match License-level Data and LicenseDB-level Data, and removes the actual data from
537537
the matches, and adds them to two top-level lists.
538538

539-
Comparision: Before/After license references
539+
Comparison: Before/After license references
540540
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
541541

542542
To compare how the license output data changes between when license references are not collected
543543
vs when they are collected (which is default from version 32.x), check out the before/after
544-
comparision below.
544+
comparison below.
545545

546546
Before::
547547

docs/source/getting-started/contribute/contributing-code.rst

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ TL;DR:
88
- Contributions comes as bugs/questions/issues and as pull requests.
99
- Source code and runtime data are in the /src/ directory.
1010
- Test code and test data are in the /tests/ directory.
11-
- Datasets (inluding licenses) and test data are in /data/ sub-directories.
11+
- Datasets (including licenses) and test data are in /data/ sub-directories.
1212
- We use DCO signoff in commit messages, like Linux does.
1313
- Porting ScanCode to other OS (FreeBSD is supported, etc.) is possible. Enter an issue for help.
1414

@@ -62,7 +62,7 @@ Running tests
6262
-------------
6363

6464
ScanCode comes with over 29,000 unit tests to ensure detection accuracy and
65-
stability across Linux, Windows and macOS OSes: we kinda love tests, do we?
65+
stability across Linux, Windows and macOS OSes: we kind of love tests, do we?
6666

6767
We use pytest to run the tests: call the ``pytest`` script to run the whole
6868
test suite. This is installed with the ``pytest`` package which is installed
@@ -99,7 +99,7 @@ Another useful option after a test run with some failures is to re-run only the
9999
failed tests with the ``--lf`` option, for instance: ``pytest -vvs --lf`` would
100100
only run only test functions that failed in the previous run.
101101

102-
Because we have a lot of tests (over 29,000), we organized theses in test suites
102+
Because we have a lot of tests (over 29,000), we organized these in test suites
103103
using pytest markers that are defined in the ``conftest.py`` pytest plugin.
104104
These are enabled by adding a ``--test-suite`` option to the pytest command.
105105

@@ -109,8 +109,8 @@ These are enabled by adding a ``--test-suite`` option to the pytest command.
109109
extensive data-driven and data validations (for package, copyright and license
110110
detection)
111111

112-
In some cases we need to regenerate test data when expected behavious/result data
113-
structures change, and we have an environement variable to regenerate test data.
112+
In some cases we need to regenerate test data when expected behaviour/result data
113+
structures change, and we have an environment variable to regenerate test data.
114114
`SCANCODE_REGEN_TEST_FIXTURES` is present in `scancode_config` and this can be
115115
set to regenerate test data for specific tests like this:
116116

@@ -124,7 +124,7 @@ If test data is regenerated, it is important to review the diff for test files a
124124
carefully go through all of it to make sure there are no unintended changes there,
125125
and then commit all the regenerated test data.
126126

127-
To help debug in scancode, we use logging. There are different environement variables
127+
To help debug in scancode, we use logging. There are different environment variables
128128
you need to set to turn on logging. In packagedcode::
129129

130130
``SCANCODE_DEBUG_PACKAGE=yes pytest -vvs tests/packagedcode/ --lf``

docs/source/getting-started/contribute/contributing-docs.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -128,7 +128,7 @@ AboutCode documentation uses
128128
`Intersphinx <https://www.sphinx-doc.org/en/master/usage/extensions/intersphinx.html>`_
129129
to link to other Sphinx Documentations, to maintain links to other Aboutcode Projects.
130130

131-
To link sections in the same documentation, standart reST labels are used. Refer
131+
To link sections in the same documentation, standard reST labels are used. Refer
132132
`Cross-Referencing <https://www.sphinx-doc.org/en/master/usage/restructuredtext/roles.html>`_
133133
for more information.
134134

@@ -143,7 +143,7 @@ For example::
143143

144144
It refers to the section itself, see :ref:`my-reference-label`.
145145

146-
Now, using Intersphinx, you can create these labels in one Sphinx Documentation and then referance
146+
Now, using Intersphinx, you can create these labels in one Sphinx Documentation and then reference
147147
these labels from another Sphinx Documentation, hosted in different locations.
148148

149149
You just have to add the following in the ``conf.py`` file for your Sphinx Documentation, where you
@@ -183,7 +183,7 @@ For more information, refer this tutorial named
183183

184184
.. _contributing-docs-style-conventions:
185185

186-
Style Conventions for the Documentaion
186+
Style Conventions for the Documentation
187187
--------------------------------------
188188

189189
1. Headings

docs/source/getting-started/contribute/roadmap.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -65,7 +65,7 @@ Core features
6565
^^^^^^^^^^^^^
6666

6767
- |white_check_mark| pre scan filtering (ignore binaries, etc)
68-
- |white_check_mark| pre/post/ouput plugins! (worked as part of the GSoC by @yadsharaf )
68+
- |white_check_mark| pre/post/output plugins! (worked as part of the GSoC by @yadsharaf )
6969
- |white_check_mark| scan plugins (e.g. plugins that run a scan to collect data)
7070
- |white_check_mark| support Python 3 #295
7171
- |clock1030| transparent archive extraction (as opposed to on-demand with extractcode)
@@ -110,7 +110,7 @@ Packaging
110110

111111
- |white_large_square| simpler installation, automated installer
112112
- |white_check_mark| distro-friendly packaging
113-
- |clock1030| unbundle and package as multiple libaries (commoncode, extractcode, etc)
113+
- |clock1030| unbundle and package as multiple libraries (commoncode, extractcode, etc)
114114

115115
Documentation
116116
^^^^^^^^^^^^^

0 commit comments

Comments
 (0)