WARC file for Pride Center of the Capital Region, 2016 November 7

Acquisition information:

crawl: 247622

Crawl Rules

Limit host facebook.com to 500 documents

Limit host twimg.com to 500 documents

Limit host twitter.com to 500 documents

Crawl Times

start_date: 2016-11-07T20:17:54Z

original_start_date: 2016-11-07T20:17:54Z

last_resumption: None

processing_end_date: 2016-11-10T20:34:30Z

end_date: 2016-11-10T20:19:34Z

elapsed_ms: 259295153

Crawl Types

type: MONTHLY

recurrence_type: MONTHLY

pdfs_only: False

test: False

Crawl Limits

time_limit: 259200

document_limit: None

byte_limit: None

crawl_stop_requested: None

Crawl Results

status: FINISHED_TIME_LIMIT

discovered_count: 239435

novel_count: 60745

duplicate_count: 6469

resumption_count: 0

queued_count: 172221

downloaded_count: 67214

download_failures: 2

warc_revisit_count: 6469

warc_url_count: 67207

total_data_in_kbs: 3427608

duplicate_bytes: 461867411

warc_compressed_bytes: 794115439

Crawl Technical Details

doc_rate: 0.26

kb_rate: 13.0

Physical / technical requirements:
Researchers interested in data analysis with web archives may request a WARC file. WARC files are very large and difficult to work with. Your request may take time to process, and we may be unable to deliver your request remotely. Please consult an archivist if you are interested in advanced research with web archives.

Using these materials

Access:
The archives are open to the public and anyone is welcome to visit and view the collections.
Collection restrictions:
Access to this collection is unrestricted. Some folders in the Administrative Files and Events series are restricted due to personal information. Consult a staff member for further details.
Collection terms of access:
The researcher assumes full responsibility for conforming with the laws of copyright. Whenever possible, the M.E. Grenander Department of Special Collections and Archives will provide information about copyright owners and other restrictions, but the legal determination ultimately rests with the researcher. Requests for permission to publish material from this collection should be discussed with the Head of Special Collections and Archives.

Access options

Ask an Archivist

Ask a question or schedule an individualized meeting to discuss archival materials and potential research needs.

Schedule a Visit

Archival materials can be viewed in-person in our reading room. We recommend making an appointment to ensure materials are available when you arrive.

Make a Remote Request

We may also be able to deliver digital scans remotely for a fee.