Digital's Web Proxy Traces: Currently Available Traces
V1.0 format
The files that were present here until
February 7, 1997 were produced by slightly buggy software, and
had an incorrect mapping between UIDs and the underlying values.
If you have already retrieved files from this data set, and
did not see this warning before retrieving the files, then you
probably have bogus data.
We apologize for any problems caused by these errors.
The current traces contain data taken
between 29 August 1996 and 22 September 96. (Some traces may not
cover entire days.)
This is a total of approximately 24,477,674 references.
For a subset of the references including only those references from clients
that were within a short distance of the proxy where the trace
was made, see the
subset traces.
Each day's data is in a separate file (the sizes given are for the
compressed form of the file):
The "headers" file for each trace contains statistics about the trace,
including the number of references.
In addition, we provide files mapping between the unique IDs (UIDs)
in these traces, and keyed-MD5 signatures of the orginal fields.
These mappings may be useful, in the future,
for correlating between traces from different sites.
Note: some of these mapping files are quite large!
The complete set of traces
is about 526 MBytes; the complete set of mapping files is about 107 MBytes.
We will not release the original mapping between these fields and
keyed-MD5 signatures, nor will we release the MD5 key. Please do
not ask.
For information about file formats, etc., please see the
documentation.
In addition to the trace files listed here, we have another
set of files that cover the same trace but with a few additional
fields. These are listed as the
V1.2 format trace files,
and their format is described in the
documentation for the V1.2 format.
Anyone using these traces agrees to these conditions:
- These traces may not be used for any commercial purposes without
the express prior, written consent of Digital Equipment Corporation.
For information about licensing the use of these traces, please contact
Director of Licensing
Western Research Laboratory
Digital Equipment Corporation
250 University Avenue
Palo Alto, California 94301
You may make no attempt to deduce or otherwise discover the
actual client addresses, server names, server paths, or query strings
used in the underlying raw traces, nor may you attempt to deduce or
otherwise discover any encryption key that we have used to conceal these
fields.
We encourage the use of these traces for bona fide research,
but we ask that all publications and public disclosures based on these
traces give explicit credit to Digital Equipment Corporation for providing
these traces.
These traces and the associated software may not be distributed
to third parties.
Digital makes no promises that these traces are accurate, representative,
or properly formatted. Use them entirely at your own risk. In particular,
we cannot take responsibility for any incorrect conclusions that you may
draw from these traces, even if the traces appear to justify these
conclusions.
Concerning any software provided with these traces:
THE SOFTWARE IS PROVIDED "AS IS" AND DIGITAL EQUIPMENT CORP. DISCLAIMS ALL
WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL IMPLIED WARRANTIES
OF MERCHANTABILITY AND FITNESS. IN NO EVENT SHALL DIGITAL EQUIPMENT
CORPORATION BE LIABLE FOR ANY SPECIAL, DIRECT, INDIRECT, OR CONSEQUENTIAL
DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR
PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS
ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS
SOFTWARE.
For more information about the traces, please contact
Jeffrey Mogul (mogul@pa.dec.com).