Bug #755

Add script for unpacking packaged and compressed files

Added by Evelyn McLellan over 10 years ago. Updated over 7 years ago.

Status:VerifiedStart date:
Priority:HighDue date:
Assignee:Joseph Perry% Done:

0%

Category:-
Target version:Release 0.6
Google Code Legacy ID:archivematica-100 Pull Request:
Sponsored: Requires documentation:

Description

Script needs to run before quarantine in order for packaged files to be
virus-checked.

[g] Legacy categories: Ingest

History

#1 Updated by Evelyn McLellan over 10 years ago

Actually the script needs to run before the sanitizing script is run in order for any
irregular filenames to be sanitized.

#2 Updated by Austin Trask over 10 years ago

http://pypi.python.org/pypi/easy-extract/0.1.0

easy-extract is a python script that recursively scans directories and unpacks a
number of different archive formats, including: .RAR, .ARJ, .CAB, .CHM, .CPIO, .DMG,
.HFS, .LZH, .LZMA, .NSIS, .UDF, .WIM, .XAR, .Z, .ZIP', .GZIP', .TAR, .XTM

This could be integrated to run on directories prior to sanitizing

#3 Updated by Joseph Perry over 10 years ago

  • Status changed from New to Verified

Created include based on easy-extract from the python repository, to extract various
archives (see comment 2 by austin).
The original easy extract extracted everything to the working directory.
I modified it to create directories for each of the archives it extracts.
The directory name is based on the archive with the UTC clock time appended to
ensure a unique file name when extracting.

Also available in: Atom PDF