Bug #755

Add script for unpacking packaged and compressed files

Added by Evelyn McLellan over 10 years ago. Updated over 7 years ago.

Status:VerifiedStart date:
Priority:HighDue date:
Assignee:Joseph Perry% Done:


Target version:Release 0.6
Google Code Legacy ID:archivematica-100 Pull Request:
Sponsored: Requires documentation:


Script needs to run before quarantine in order for packaged files to be

[g] Legacy categories: Ingest


#1 Updated by Evelyn McLellan over 10 years ago

Actually the script needs to run before the sanitizing script is run in order for any
irregular filenames to be sanitized.

#2 Updated by Austin Trask over 10 years ago


easy-extract is a python script that recursively scans directories and unpacks a
number of different archive formats, including: .RAR, .ARJ, .CAB, .CHM, .CPIO, .DMG,

This could be integrated to run on directories prior to sanitizing

#3 Updated by Joseph Perry over 10 years ago

  • Status changed from New to Verified

Created include based on easy-extract from the python repository, to extract various
archives (see comment 2 by austin).
The original easy extract extracted everything to the working directory.
I modified it to create directories for each of the archives it extracts.
The directory name is based on the archive with the UTC clock time appended to
ensure a unique file name when extracting.

Also available in: Atom PDF