Skip to content

parkit Documentation

parkit is a CLI toolkit for archiving CCBR project data to HPC-DME (/CCBR_Archive/GRIDFTP/...).

For most users, the recommended interface is projark, which provides guided deposit and retrieve workflows for entire CCBR project folder(s).

In This Version (v3.0.1)

  • New Python-native projark command with structured subcommands.
  • projark deposit for project archival with sync/host/session preflight checks.
  • projark retrieve with selective file retrieval or full-collection retrieval.
  • --unsplit support for merging downloaded split tar parts.
  • Archived legacy bash projark workflow.

Quick Start

projark --version
projark deposit --help
projark retrieve --help
  1. Complete environment setup from Getting Started.
  2. Use projark Deposit (Recommended) to archive data.
  3. Use projark Retrieve when you need data back.

Notes

  • projark is intended for Helix (helix.nih.gov).
  • All runs should be executed in tmux, screen, or an Open OnDemand graphical session.
  • Disclaimer: Open OnDemand is currently available only on Biowulf compute nodes, not directly on Helix. Since projark is Helix-only today, use tmux/screen on Helix; Open OnDemand support is future-facing until Helix access is available.
  • projark logs include ISO 8601 timestamps.
  • projark sends completion/failure email to $USER@nih.gov from NCICCBR@mail.nih.gov.
  • Docs are versioned; this set describes v3.0.1 behavior.