Skip to content

e2e

spacesavers2_e2e

This is a wrapper to run all the spacesavers2 commands in the correct order. It automatically:

  • adds appropriate prefixes to output files (including time)
  • catalogs files in the folder provided
  • finds duplicates
  • finds high-value duplicates
  • creates blamematrix

This is ideal wrapper to be added as a cronjob.

 % spacesavers2_e2e --help
usage: spacesavers2_e2e [-h] -i INFOLDER [-p THREADS] [-q QUOTA] -o OUTFOLDER

End-to-end run of spacesavers2

options:
  -h, --help            show this help message and exit
  -f FOLDER, --folder FOLDER
                        Folder to run spacesavers_catalog on.
  -p THREADS, --threads THREADS
                        number of threads to use
  -d MAXDEPTH, --maxdepth MAXDEPTH
                        maxdepth for mimeo
  -l LIMIT, --limit LIMIT
                        limit for running spacesavers_grubbers
  -q QUOTA, --quota QUOTA
                        total size of the volume (default = 200 for /data/CCBR)
  -o OUTFOLDER, --outfolder OUTFOLDER
                        Folder where all spacesavers_e2e output files will be saved