diff --git a/wayback-cdx-server/README.md b/wayback-cdx-server/README.md index 3e20a1809f..1224ed288d 100644 --- a/wayback-cdx-server/README.md +++ b/wayback-cdx-server/README.md @@ -82,6 +82,10 @@ At this time, the following cdx fields are publicly available: `["urlkey","timestamp","original","mimetype","statuscode","digest","length"]` +To get the HTML of the capture, the URL is formatted as follows: `http://web.archive.org/web//` + +To get the original page back (without the Wayback Machine rewriting URLs on the page to point into the Archive), you can suffix the timestamp with `id_`, as follows: `http://web.archive.org/web/id_/` + It is possible to customize the [Field Order](#field-order) as well. The the **url=** value should be [url encoded](http://en.wikipedia.org/wiki/Percent-encoding) if the url itself contains a query.