A website copy with HTTrack

httrack

Alton Towers October 2002

project name: altontowers
Web(URL) address: www.altontowers.com
tick: Attempt to detect all links
do not tick: DOS names or ISO9660 names
amount of time: 3 hours (56k modem)
in the scan rules, add:
-*.exe -*.zip
+http://press.altontowers.com/
+http://www.peoplebank.com/pb3/corporate/Tussauds/Alton_Towers/alton_popup.htm?
+http://www.rialtoticketing.com/app/rialto/ControlServlet/home/initialise?sellingAgentId=1
+www.altontowers.com/air/main.htm
+http://alton-towers.custhelp.com/
+www.altontowers.com/terror/index.htm?
+www.altontowers.com/map_areas/index.htm?
+http://www.scan.com/VQMobile/

problems:

Javascript, Flash, mirrors, site often and completely redesigned...

Other examples with similar difficulties: Herberton | Extreme World | Martin Luther King 2004
Flash Discovery Cove | Ratanga | The Engine Room | Wild Waters Park | Rapids Water Park | Gulliver's Theme Park | Camelot Theme Park

solutions:

Updating a capture is difficult with this type of site as everything is constantly modified: java applets, Flash, Javascript, images, videos...
This site may be with or without java applet, written in html and javascript, html javascript and flash, html javascript or flash ,php html javascript and flash, flash only...
Moreover, the links can send you to a mirror as www.alton-towers.co.uk.
All the problems may be there. In October 2002, javascript was the main problem as it calculates almost everything.

For images, here is one of the functions. This one calculates in nav.htm the image names of buttons.

Another example of function that computes the filename to call in "pop-ups".

var areapop = false; // variable needed for popup control of map areas.

function wopen(uri,width,height){ // generic window opening function used across whole site.
    newWin = window.open(uri,'','status=no,resizable=no,toolbars=no,scrollbars=yes,
    width='+width+',height='+height);
    return newWin;
}

function areaOpen(uri,width,height){ // function used to control the area pop-up from the map page
    if(areapop.closed){areapop = wopen('map_areas/'+uri,width,height);}
    if(areapop){areapop.location='map_areas/'+uri;areapop.focus();}
    else{areapop = wopen('map_areas/'+uri,width,height);}
}

No website mirror utility can interpret them and download the files.

For the browser, here is a routine that will make the capture useless with the next version of Internet Explorer or Netscape:

    var ie = (navigator.appName.indexOf('Microsoft')!=-1)?true:false;
    var ns = (navigator.appName.indexOf('Netscape')!=-1)?true:false;
    var ie4 = (document.all&&!document.getElementById)?true:false;
    var ns4 = document.layers?true:false;
    var ns6 = (this.ns&&(parseInt(navigator.appVersion)>4))?true:false;
    var ie5 = (this.ie&&document.getElementById)?true:false;

Adding a few scan rules will allow you to capture the calculated file names, but Flash and PHP files will make the capture even more difficult.
The option -*.exe et -*.zip will exclude very big and uninteresting files. You can also add -*.mpg to leave videos out.
The option +http://alton-towers.custhelp.com writes more than 3000 files.
The option +www.alton-towers.co.uk simplifies the capture but doubles the number of files.

To get a satisfying capture, the simpler is to use Internet Explorer cache.

First, finish the capture and take note of pages with problems.

Then visit these pages and click on all the links.
All the pages must be completely loaded.
Point the mouse on all the images, wait for image changes on "mouseovers".

In the home page, click on "AIR" or on the image in the centre. A window opens with Flash animations.
In the end of the animation an html page is loaded, click on all the links.

Back to the home page, click on "SEE THE PARK".
The links in the menu "AREA INFO" have not been captured. Click on one of the links.
In the window which opens,
AREA INFO
you will have to click on all the links.
Wait until each window is completely loaded before clicking the next link.

Now most of the missing files are in the Internet Explorer cache: Temporary Internet Files.

Sort the files by address to find them. Here is a short part of the list.
Temporary Internet Files

Copy the files folder by folder (create the missing ones) into the capture.
They are copied with a figure between brackets. Rename them if they were missing and delete the others.

In the folder map_areas (see over), delete the files index[ ].html.

Modify the left and right menus in the file maptest.htm:

Here is the menu on the left

The names given by WinHTTrack may be different as the pages are generated by PHP and values may have changed.
Replace indexf438.htm with thrillseekers.htm, indexdbcd.htm with familyfun.htm, index9353a.htm with youngchildren.htm

Here is the menu on the right

Replace index0797.htm with valley.htm, indexe1ed.htm with xsector.htm
(then ugland.htm, katanga.htm etc.) until the end of the menu.

Here you are.
topTop of the page

With javascript

W3C XHTML 1.0
W3C CSS