WARC file for New York Civil Liberties Union (NYCLU) - American Civil Liberties Union of New York State, 2016 May 26
- Acquisition information:
-
crawl: 214791
Crawl RulesLimit host twitter.com to 1000 documents
Limit host upload.wikimedia.org to 1000 documents
Limit host en.wikipedia.org to 1000 documents
Crawl Timesstart_date: 2016-05-26T14:53:30Z
original_start_date: 2016-05-26T14:53:30Z
last_resumption: None
processing_end_date: 2016-05-26T16:05:22Z
end_date: 2016-05-26T15:53:59Z
elapsed_ms: 3616451
Crawl Typestype: TEST_SAVED
recurrence_type: NONE
pdfs_only: False
test: True
Crawl Limitstime_limit: 3600
document_limit: None
byte_limit: None
crawl_stop_requested: None
Crawl Resultsstatus: FINISHED_TIME_LIMIT
discovered_count: 6340
novel_count: 4471
duplicate_count: 723
resumption_count: 0
queued_count: 1146
downloaded_count: 5194
download_failures: 0
warc_revisit_count: 723
warc_url_count: 5187
total_data_in_kbs: 377382
duplicate_bytes: 13335053
warc_compressed_bytes: 211623060
Crawl Technical Detailsdoc_rate: 1.44
kb_rate: 104.0
- Physical / technical requirements:
- Researchers interested in data analysis with web archives may request a WARC file. WARC files are very large and difficult to work with. Your request may take time to process, and we may be unable to deliver your request remotely. Please consult an archivist if you are interested in advanced research with web archives.
Using these materials
- Access:
- The archives are open to the public and anyone is welcome to visit and view the collections.
- Collection restrictions:
- Access to this collection is restricted because it is unprocessed. Portions of the collection may contain recent administrative records and/or personally identifiable information. Please contact an archivist for more information. Certain restrictions may apply.
- Collection terms of access:
- The University Archives are eager to hear from any copyright owners who are not properly identified so that appropriate information may be provided in the future.