Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
BioFlow-Insight-Study
Manage
Activity
Members
Labels
Plan
Issues
0
Issue boards
Milestones
Wiki
Code
Merge requests
0
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
shareFAIR
BioFlow-Insight-Study
Commits
3e04f662
Commit
3e04f662
authored
1 year ago
by
George Marchment
Browse files
Options
Downloads
Patches
Plain Diff
update
parent
4e97b08c
No related branches found
No related tags found
No related merge requests found
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
README.md
+2
-0
2 additions, 0 deletions
README.md
github-crawler
+1
-1
1 addition, 1 deletion
github-crawler
study.ipynb
+280
-0
280 additions, 0 deletions
study.ipynb
with
283 additions
and
1 deletion
README.md
+
2
−
0
View file @
3e04f662
...
@@ -2,6 +2,8 @@
...
@@ -2,6 +2,8 @@
`git clone --recurse-submodules https://gitlab.liris.cnrs.fr/sharefair/bioflow-insight-study.git`
`git clone --recurse-submodules https://gitlab.liris.cnrs.fr/sharefair/bioflow-insight-study.git`
`git submodule update --recursive --remote`
tets
tets
## Getting started
## Getting started
...
...
This diff is collapsed.
Click to expand it.
github-crawler
@
bf632200
Compare
53754093
...
bf632200
Subproject commit
537540936dd0a3fc07e5f0143d74fa7cf93735bb
Subproject commit
bf6322002948c7b3df3657e0bd0e5fd82e7c8bd5
This diff is collapsed.
Click to expand it.
study.ipynb
0 → 100644
+
280
−
0
View file @
3e04f662
{
"cells": [
{
"cell_type": "code",
"execution_count": 1,
"metadata": {},
"outputs": [],
"source": [
"import os\n",
"import json"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"from github_crawler.download_corpus import"
]
},
{
"cell_type": "code",
"execution_count": 7,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"* 1/752\n",
"Downloading...'HadrienG/nextflow_rna'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/HadrienG/nextflow_rna'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 2/752\n",
"Downloading...'IARCbioinfo/CODEX-nf'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/IARCbioinfo/CODEX-nf'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 3/752\n",
"Downloading...'AndersenLab/ril-nf'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/AndersenLab/ril-nf'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 4/752\n",
"Downloading...'robsyme/markerdev'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/robsyme/markerdev'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 5/752\n",
"\n",
"\n",
"\n",
"* 6/752\n",
"Downloading...'ctmrbio/BACTpipe'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/ctmrbio/BACTpipe'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 7/752\n",
"Downloading...'danielecook/nextflow-tutorial'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/danielecook/nextflow-tutorial'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 8/752\n",
"Downloading...'IARCbioinfo/bamsurgeon-nf'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/IARCbioinfo/bamsurgeon-nf'...\n"
]
},
{
"name": "stdout",
"output_type": "stream",
"text": [
"\n",
"\n",
"\n",
"* 9/752\n",
"Downloading...'nextflow-io/cwl2nxf'\n"
]
},
{
"name": "stderr",
"output_type": "stream",
"text": [
"mkdir: opérande manquant\n",
"Saisissez « mkdir --help » pour plus d'informations.\n",
"Clonage dans '../../../Workflow-Corpus-Open-License/nextflow-io/cwl2nxf'...\n"
]
}
],
"source": [
"\n",
"current_path= os.getcwd()\n",
"os.chdir(\"github-crawler/\")\n",
"from download_corpus import *\n",
"os.chdir(current_path)\n"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"download_files = True\n",
"repo_to_download = \"../Workflow-Corpus-Open-Licence/\""
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [],
"source": [
"#Read crawler results\n",
"with open(\"./github-crawler/wf_crawl_nextflow.json\") as json_file:\n",
" crawler = json.load(json_file)\n",
"\n",
"_ = crawler.pop(\"last_date\")"
]
},
{
"cell_type": "code",
"execution_count": 2,
"metadata": {},
"outputs": [],
"source": [
"if(download_files):\n",
" current_directory = os.getcwd()\n",
" os.chdir(current_directory+'/'+repo_to_download)"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"'/home/gmarchment/Documents/These/Work/Workflow-Corpus-Open-Licence'"
]
},
"execution_count": 4,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"os.getcwd()"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.12"
}
},
"nbformat": 4,
"nbformat_minor": 2
}
%% Cell type:code id: tags:
```
python
import
os
import
json
```
%% Cell type:code id: tags:
```
python
from
github_crawler.download_corpus
import
```
%% Cell type:code id: tags:
```
python
current_path
=
os
.
getcwd
()
os
.
chdir
(
"
github-crawler/
"
)
from
download_corpus
import
*
os
.
chdir
(
current_path
)
```
%% Output
* 1/752
Downloading...'HadrienG/nextflow_rna'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/HadrienG/nextflow_rna'...
* 2/752
Downloading...'IARCbioinfo/CODEX-nf'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/IARCbioinfo/CODEX-nf'...
* 3/752
Downloading...'AndersenLab/ril-nf'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/AndersenLab/ril-nf'...
* 4/752
Downloading...'robsyme/markerdev'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/robsyme/markerdev'...
* 5/752
* 6/752
Downloading...'ctmrbio/BACTpipe'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/ctmrbio/BACTpipe'...
* 7/752
Downloading...'danielecook/nextflow-tutorial'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/danielecook/nextflow-tutorial'...
* 8/752
Downloading...'IARCbioinfo/bamsurgeon-nf'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/IARCbioinfo/bamsurgeon-nf'...
* 9/752
Downloading...'nextflow-io/cwl2nxf'
mkdir: opérande manquant
Saisissez « mkdir --help » pour plus d'informations.
Clonage dans '../../../Workflow-Corpus-Open-License/nextflow-io/cwl2nxf'...
%% Cell type:code id: tags:
```
python
download_files
=
True
repo_to_download
=
"
../Workflow-Corpus-Open-Licence/
"
```
%% Cell type:code id: tags:
```
python
#Read crawler results
with
open
(
"
./github-crawler/wf_crawl_nextflow.json
"
)
as
json_file
:
crawler
=
json
.
load
(
json_file
)
_
=
crawler
.
pop
(
"
last_date
"
)
```
%% Cell type:code id: tags:
```
python
if
(
download_files
):
current_directory
=
os
.
getcwd
()
os
.
chdir
(
current_directory
+
'
/
'
+
repo_to_download
)
```
%% Cell type:code id: tags:
```
python
os
.
getcwd
()
```
%% Output
'/home/gmarchment/Documents/These/Work/Workflow-Corpus-Open-Licence'
%% Cell type:code id: tags:
```
python
```
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment