WARC file for University Senate - University at Albany-SUNY, 2016 July 17

Acquisition information:

crawl: 225859

Crawl Rules

Ignore Robots.txt for www.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for www.alumni.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for www.ualbanysports.com (last updated 2016-02-11)

Ignore Robots.txt for library.albany.edu (last updated 2017-05-19)

Ignore Robots.txt for alumni.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for asrc.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for atmos.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for bioinformatics.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for cela.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for choose.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for cs.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for csda.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for imls.ctg.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for www.ctg.albany.edu (last updated 2017-05-19)

Ignore Robots.txt for cwig.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for events.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for hr.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for ibl.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for illiad.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for liblogs.albany.edu (last updated 2017-05-19)

Ignore Robots.txt for libguides.library.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for scholarsarchive.library.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for listserv.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for m.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for math.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for omega.math.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for mumford.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for nyjm.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for pdp.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for resnet.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for rit.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for cyberphysics.rit.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for rna.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for slsc.albany.edu (last updated 2017-05-19)

Ignore Robots.txt for uaems.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for uapps.albany.edu (last updated 2016-02-11)

Ignore Robots.txt for wiki.albany.edu (last updated 2016-02-11)

Crawl Times

start_date: 2016-07-17T16:55:03Z

original_start_date: 2016-07-17T16:55:03Z

last_resumption: None

processing_end_date: 2016-07-22T22:52:35Z

end_date: 2016-07-22T17:55:28Z

elapsed_ms: 435383503

Crawl Types

type: MONTHLY

recurrence_type: MONTHLY

pdfs_only: False

test: False

Crawl Limits

time_limit: 432000

document_limit: None

byte_limit: None

crawl_stop_requested: None

Crawl Results

status: FINISHED_TIME_LIMIT

discovered_count: 6792375

novel_count: 212820

duplicate_count: 1098346

resumption_count: 0

queued_count: 5481209

downloaded_count: 1311166

download_failures: 256

warc_revisit_count: 1098317

warc_url_count: 1311042

total_data_in_kbs: 212249210

duplicate_bytes: 207609316437

warc_compressed_bytes: 675349961

Crawl Technical Details

doc_rate: 3.01

kb_rate: 487.0

Physical / technical requirements:
Researchers interested in data analysis with web archives may request a WARC file. WARC files are very large and difficult to work with. Your request may take time to process, and we may be unable to deliver your request remotely. Please consult an archivist if you are interested in advanced research with web archives.

Using these materials

Access:
The archives are open to the public and anyone is welcome to visit and view the collections.
Collection restrictions:
Access to this record group is unrestricted.
Collection terms of access:
Records in this collection were created by the University at Albany, SUNY, and are public records.

Access options

Ask an Archivist

Ask a question or schedule an individualized meeting to discuss archival materials and potential research needs.

Schedule a Visit

Archival materials can be viewed in-person in our reading room. We recommend making an appointment to ensure materials are available when you arrive.

Make a Remote Request

We may also be able to deliver digital scans remotely for a fee.