|
|
|
| Home | Wayback Machine | Archive-It | Blog | Heritrix |
| Anonymous User (login or join us) | Upload |
Internet Archive crawldata from Webwide Crawl, captured by crawl410.us.archive.org:wide from Tue Jun 7 19:29:45 PDT 2011 to Tue Jun 7 13:43:41 PDT 2011.
This item is part of the collection: Wide Crawl started March 2011
Identifier: WIDE-20110607192945-crawl410
Contributor: Internet Archive
Crawler: Heritrix/3.1.0-SNAPSHOT-20110511.082841
Crawljob: wide
Creator: Internet Archive
Date: 2011
Firstfiledate: 20110607192945
Firstfileserial: 02343
Identifier-access: http://www.archive.org/details/WIDE-20110607192945-crawl410
Lastdate: 20110607134341
Lastfiledate: 20110607204341
Lastfileserial: 02352
Mediatype: web
Numwarcs: 10
Operator: kenji@archive.org
Scandate: 20110607192945
Scanner: crawl410.us.archive.org
Scanningcenter: sanfrancisco
Sizehint: 10550371919
Sponsor: Internet Archive
Publicdate: 2011-06-08 00:15:07
Addeddate: 2011-06-08 00:15:07
Imagecount: 180283
Keywords: crawldata
| Information | Format | Size |
| WIDE-20110607192945-crawl410_files.xml | Metadata | [file] |
| WIDE-20110607192945-crawl410_meta.xml | Metadata | 1.5 KB |
| WIDE-20110607192945-crawl410_reviews.xml | Metadata | 189.0 B |
| Other Files | Text | Web ARChive GZ | WARC CDX Index |
| MANIFEST.txt |
680.0 B
|
||
| WIDE-20110607192945-02343.warc.gz |
953.7 MB
|
1.3 MB
|
|
| WIDE-20110607193804-02344.warc.gz |
1.2 GB
|
1.1 MB
|
|
| WIDE-20110607194630-02345.warc.gz |
1.1 GB
|
1.1 MB
|
|
| WIDE-20110607195738-02346.warc.gz |
1,002.9 MB
|
1.5 MB
|
|
| WIDE-20110607200424-02347.warc.gz |
972.3 MB
|
897.5 KB
|
|
| WIDE-20110607202254-02348.warc.gz |
954.2 MB
|
476.6 KB
|
|
| WIDE-20110607202453-02349.warc.gz |
953.7 MB
|
986.2 KB
|
|
| WIDE-20110607202835-02350.warc.gz |
955.9 MB
|
773.4 KB
|
|
| WIDE-20110607203551-02351.warc.gz |
954.9 MB
|
1.0 MB
|
|
| WIDE-20110607204341-02352.warc.gz |
953.7 MB
|
1.4 MB
|