Skip to content

Commit 0b390a3

Browse files
Testing metadata bot, editing README, no pitfalls addressed
1 parent eafadc5 commit 0b390a3

1 file changed

Lines changed: 21 additions & 95 deletions

File tree

README.md

Lines changed: 21 additions & 95 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
[![DOI](https://zenodo.org/badge/878193479.svg)](https://doi.org/10.5281/zenodo.14803005)
22

3-
43
> [!WARNING]
54
> This is still in progress and subject to change, as additional requirements may arise.
5+
66
# Metadata-Adoption-Quantify
77

88
This software extracts relevant data from SOMEF (Software Metadata Extraction Framework) results to answer specific research questions. The extracted insights are returned as structured JSON files, allowing easy integration and analysis.
@@ -32,6 +32,7 @@ This software is tailored to answer the following research questions:
3232
5. RQ5: What are the most common citation practices among the communities?
3333

3434
## Features
35+
3536
- Extract metadata from repositories using SOMEF.
3637
- Extracts and processes metadata from SOMEF results.
3738
- Filters information relevant to specific research questions.
@@ -68,127 +69,51 @@ The tool is accessible via the `quantify` command. All commands should be prefix
6869
### Available Commands
6970

7071
#### 1. Run SoMEF on Repositories
72+
7173
Extracts metadata from a list of GitHub repositories provided in a JSON file.
74+
7275
```bash
7376
poetry run quantify somef --input repos.json --output-dir somef_outputs --threshold 0.8
7477
```
7578

7679
#### 2. Run RQ Analysis
80+
7781
Analyzes SoMEF outputs to answer specific research questions.
82+
7883
```bash
7984
poetry run quantify rqs --somef-dir somef_outputs --input-repos repos.json --output-dir rq_results --cluster default
8085
```
81-
*Note: `--input-repos` is required for RQ2 analysis.*
86+
87+
_Note: `--input-repos` is required for RQ2 analysis._
8288

8389
The input (in the case of the example that would be `repos.json`) should be a JSON file with the following format:
90+
8491
```json
8592
[
86-
{
87-
"github_url": "https://github.com/foo/bar"
88-
},
89-
{
90-
"github_url": "https://github.com/dgarijo/Widoco/"
91-
}
93+
{
94+
"github_url": "https://github.com/foo/bar"
95+
},
96+
{
97+
"github_url": "https://github.com/dgarijo/Widoco/"
98+
}
9299
]
93100
```
101+
94102
#### 3. Calculate Final Results
103+
95104
Calculates the final percentages and insights for each RQ.
105+
96106
```bash
97107
poetry run quantify calculate --input repos.json --rq-results-dir rq_results --results-dir final_results --cluster default
98108
```
99109

100110
### Main Menu (Alternative)
111+
101112
You can still access help for any command by running:
113+
102114
```bash
103115
poetry run quantify --help
104116
```
105-
## Output
106-
After running the `rqs` command for RQ1, you get a result like this:
107-
```bash
108-
{
109-
"citation.cff": {
110-
"count": 1
111-
},
112-
"readme_url": {
113-
"count": 17
114-
},
115-
"package": {
116-
"count": 7,
117-
"files": [
118-
"temp_analysis/escape2020/gammapy_gammapy/gammapy-1.2/setup.cfg",
119-
"temp_analysis/escape2020/cosimoNigro_agnpy/agnpy-master/setup.py",
120-
"temp_analysis/escape2020/IndexedConv_IndexedConv/IndexedConv-1.3.2/setup.py",
121-
"temp_analysis/escape2020/cds-astro_cds-moc-rust/cds-moc-rust-main/Cargo.toml",
122-
"temp_analysis/escape2020/cds-astro_mocpy/mocpy-0.15.0/Cargo.toml",
123-
"temp_analysis/escape2020/cds-astro_aladin-lite/aladin-lite-3.3.2/package.json",
124-
"temp_analysis/escape2020/escape2020_school2022/school2022-1.0/docs/themes/dream/package.json"
125-
]
126-
},
127-
"authors": {
128-
"count": 2,
129-
"files": [
130-
"temp_analysis/escape2020/R3BRootGroup_R3BRoot/R3BRoot-jun24/AUTHORS",
131-
"temp_analysis/escape2020/FairRootGroup_FairMQ/FairMQ-master/AUTHORS"
132-
]
133-
},
134-
"contributors": {
135-
"count": 3,
136-
"files": [
137-
"output_4.json",
138-
"output_11.json",
139-
"output_5.json"
140-
]
141-
},
142-
"license": {
143-
"count": 17
144-
},
145-
"codemeta.json": {
146-
"count": 15,
147-
"files": [
148-
"temp_analysis/escape2020/gammapy_gammapy/gammapy-1.2/codemeta.json",
149-
"temp_analysis/escape2020/cosimoNigro_agnpy/agnpy-master/codemeta.json",
150-
"temp_analysis/escape2020/IndexedConv_IndexedConv/IndexedConv-1.3.2/codemeta.json",
151-
"temp_analysis/escape2020/cds-astro_cds-moc-rust/cds-moc-rust-main/codemeta.json",
152-
"temp_analysis/escape2020/aardk_jupyter-casa/jupyter-casa-master/codemeta.json",
153-
"temp_analysis/escape2020/cds-astro_tutorials/tutorials-1.0.3/codemeta.json",
154-
"temp_analysis/escape2020/R3BRootGroup_R3BRoot/R3BRoot-jun24/codemeta.json",
155-
"temp_analysis/escape2020/AMIGA-IAA_hcg-16/hcg-16-1.2.3/codemeta.json",
156-
"temp_analysis/escape2020/repo/eossr-master/codemeta.json",
157-
"temp_analysis/escape2020/JColl88_sdc1-solution-binder/sdc1-solution-binder-1.0.0/codemeta.json",
158-
"temp_analysis/escape2020/explore-platform_g-tomo/g-tomo-2/codemeta.json",
159-
"temp_analysis/escape2020/cds-astro_mocpy/mocpy-0.15.0/codemeta.json",
160-
"temp_analysis/escape2020/javierrico_gLike/gLike-master/codemeta.json",
161-
"temp_analysis/escape2020/cds-astro_aladin-lite/aladin-lite-3.3.2/codemeta.json",
162-
"temp_analysis/escape2020/FairRootGroup_FairMQ/FairMQ-master/codemeta.json"
163-
]
164-
},
165-
"zenodo.json": {
166-
"count": 5,
167-
"files": [
168-
"temp_analysis/escape2020/cosimoNigro_agnpy/agnpy-master/.zenodo.json",
169-
"temp_analysis/escape2020/cds-astro_cds-moc-rust/cds-moc-rust-main/.zenodo.json",
170-
"temp_analysis/escape2020/aardk_jupyter-casa/jupyter-casa-master/.zenodo.json",
171-
"temp_analysis/escape2020/javierrico_gLike/gLike-master/.zenodo.json",
172-
"temp_analysis/escape2020/FairRootGroup_FairMQ/FairMQ-master/.zenodo.json"
173-
]
174-
},
175-
"identifier_extract": {
176-
"count": 7,
177-
"extracted_values": [
178-
"10.5281/zenodo.3967385",
179-
"10.5281/zenodo.7544514",
180-
"https://zenodo.org/badge/latestdoi/224865065",
181-
"10.5281/zenodo.1689985",
182-
"10.5281/zenodo.3967385",
183-
"10.5281/zenodo.10405177",
184-
"10.5281/zenodo.4055175"
185-
]
186-
},
187-
"None": {
188-
"count": 0
189-
}
190-
}
191-
```
192117

193118
After running the `calculate` command for a cluster, you get the final summary:
194119

@@ -210,6 +135,7 @@ After running the `calculate` command for a cluster, you get the final summary:
210135
```
211136

212137
## Ackowlegement:
138+
213139
The authors acknowledge the OSCARS project, which has received funding from the European Commission's Horizon Europe Research and Innovation programme under grant agreement No. 101129751
214140

215141
<img src="logo.png" alt="logo"/>

0 commit comments

Comments
 (0)