Add download archiving system with LocalFilesystem provider#1815
Closed
VarshaUN wants to merge 20 commits into
Closed
Add download archiving system with LocalFilesystem provider#1815VarshaUN wants to merge 20 commits into
VarshaUN wants to merge 20 commits into
Conversation
…iders Signed-off-by: Varsha U N <varshaun58@gmail.com>
Member
|
@VarshaUN thanks for the PR, you need to address a few overall issues before we can start reviewing the code in more details, see comments below for these:
Couple of issues with general direction of the PR as discussed in #1685 (comment):
Presently input archives are downloaded with scancode.io/scanpipe/pipelines/__init__.py Line 130 in 9d41ad3 /input/ directory for each project as specified in scancode.io/scanpipe/models.py Line 573 in 9d41ad3 |
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
This reverts commit 87c81bd.
This reverts commit cd04f3f1062f3ac8c78af3a7b0ed042633f5b375.
This reverts commit b6d2342873168e53865e8f39185a9602de191b7f.
This reverts commit ca2f49f505bd5c951b5f270d4b218a69848a6de9.
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
Signed-off-by: Varsha U N <varshaun58@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
As per discussed with mentors , I have added the following ,
archiving.py
Added DownloadStore abstract base class and implementations (LocalFilesystemProvider, S3LikeProvider, SftpProvider) for storing downloads with SHA256-based deduplication and metadata.
settings.py
Initialized download_store based on DOWNLOAD_ARCHIVING_PROVIDER (localstorage, s3, sftp) with configuration validation and error logging.
input.py
Added add_input_from_url and add_input_from_upload to archive URL downloads and uploaded files using download_store, with fallback to project input directory when archiving is disabled. Integrate with InputSource model for metadata storage.
Enhances input handling for pipelines, supporting deduplicated storage and retrieval of inputs across local, S3, and SFTP backends.
Still in progress ,
Signed-off-by : Varsha U N varshaun58@gmail.com