Skip to content
Snippets Groups Projects
Commit 352ebe5c authored by Alice Brenon's avatar Alice Brenon
Browse files

Duplicate extraction script for EDdA; fix shebang because scripts actually uses features from bash

parent 3e1224b8
No related merge requests found
#!/bin/bash
INPUT_METADATA="${1}"
SOURCE_TEXT_ARTICLES="${2}"
OUTPUT="${3}"
if [ -d "${OUTPUT}" ]
then
N=1
while [ -d "${OUTPUT}.${N}" ]
do
N=$((N+1))
done
mv "${OUTPUT}" "${OUTPUT}.${N}"
fi
WORKDIR=$(mktemp -d /tmp/parallel-EDdA.XXX)
for T in {1..17}
do
mkdir -p "${WORKDIR}/T${T}"
done
while read LINE
do
LINE="${LINE#*,}"
LINE="${LINE#*,}"
LINE="${LINE#*,}"
LINE="${LINE#*,}"
T="${LINE%%,*}"
LINE="${LINE#*,}"
RANK="${LINE%%,*}"
cp "${SOURCE_TEXT_ARTICLES}/T${T}/article${RANK}."* "${WORKDIR}/T${T}"
done < <(tail -n +2 ${INPUT_METADATA})
mv ${WORKDIR} ${OUTPUT}
#!/bin/sh
#!/bin/bash
INPUT_METADATA="${1}"
SOURCE_TEXT_ARTICLES="${2}"
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment