GitHub - ui-libraries/archiveit_aws_scripts: Here are some scripts that I use(d) during my process of storing copies of Archive-It collections in the Amazon S3 storage, Glacier Deep Archive.

Archive-It Scripts

`archiveit_date_script.sh`

Downloads WARC files from an Archive-It collection (via WASAPI) and uploads any missing files to an S3 bucket.
You can optionally filter by date using the AFTER_VALUE variable.

Example full timestamp: 2025-09-01T00:00:00Z
Or just a simple date: 2025-09-01 (interpreted as midnight UTC)

Run it with:

export COLL_ID=###
export ARCHIVEIT_USER=###
export ARCHIVEIT_PASS=###
export S3_PREFIX=some/path
./archiveit_date_script.sh

`archiveit_recheck_script.sh`

Rechecks all WARC files in an Archive-It collection (via WASAPI) and uploads any that are missing from an S3 bucket.
No date filter is used — it processes every file. Includes large-file handling by using the expected content length to improve upload reliability.

Run it with:

export COLL_ID=###
export ARCHIVEIT_USER=###
export ARCHIVEIT_PASS=###
export S3_PREFIX=some/path
./archiveit_recheck_script.sh

Summary:

Ensures S3 and Archive-It are fully in sync
Skips files that already exist in S3
Logs new uploads to new_uploads.txt

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.env		.env
README.md		README.md
archiveit_date_script.sh		archiveit_date_script.sh
archiveit_recheck_script.sh		archiveit_recheck_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Archive-It Scripts

`archiveit_date_script.sh`

`archiveit_recheck_script.sh`

About

Uh oh!

Releases

Packages

Languages

ui-libraries/archiveit_aws_scripts

Folders and files

Latest commit

History

Repository files navigation

Archive-It Scripts

archiveit_date_script.sh

archiveit_recheck_script.sh

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`archiveit_date_script.sh`

`archiveit_recheck_script.sh`

Packages