A website copy with HTTrack
Herberton State School December 2002
project name: herbertonWeb(URL) address: www.herbertonss.qld.edu.au
tick: Attempt to detect all links
do not tick: DOS names or ISO9660 names
amount of time: 1 hour (56k modem)
In the Web addresses, add
http://www.herbertonss.qld.edu.au/magazine/year10camp.htm
http://www.herbertonss.qld.edu.au/magazine/menu.htm
http://www.herbertonss.qld.edu.au/magazine/year5full.htm
http://www.herbertonss.qld.edu.au/magazine/successmaker.htm
http://www.herbertonss.qld.edu.au/magazine/germancamp.htm
and for hundreds of young people contributions
http://www.schools.ash.org.au/herberton/bushscene
problems:
Missing files and images, java applet, javascript functionsOther examples with similar difficulties: Adventure City | Firstenergy | Alton Towers | Extreme World | Martin Luther King 2002 | Martin Luther King 2004 | Recycling | Herberton | Canobie
solutions:
At the end of the capture, almost all the files have been downloaded, there are some errors as in any school website, some images are missing and the "School Mag 98" menu does not work.For the menu, it is because of a java applet.
Thus you have to visit the page (www.herbertonss.qld.edu.au/school/school.htm) and wait until it is completely loaded.
In the cache (Temporary Internet Files) you will find:
Copy the file fphoverx.class in the capture folder, then remove [1] in the file name.
The HTML code in the frame www.herbertonss.qld.edu.au/magazine/index.html, shows the file names called by the menu:
<applet width="120" height="24" codebase="_fpclass/" code="fphover.class"> <param name="text" value="Year 10 Camp"> <param name="color" value="#000080"> <param name="hovercolor" value="#0000FF"> <param name="textcolor" value="#FFFFFF"> <param name="effect" value="glow"> <param name="url" value="year10camp.htm" valuetype="ref"> </applet>Here, it is year10camp.htm. This is why I added, after reading the source of the file, all the addresses to be captured in the WEB addresses.
This method is better than copying the missing files from the cache because it allows a complete capture of the links.
At the top of the website homepage there is a navigation bar (which will be useless) which images disappear when the Internet cache is cleared.
A javascript routine finds them in the cache and displays them.
If you want to have them displayed (and as an example), modify the absolute address in the routine you can locate in the page www.herbertonss.qld.edu.au/index.html:
<SCRIPT LANGUAGE="JavaScript" SRC="../sina.schools.net.au/sina3-internal-static/icons/navbar.js"> </SCRIPT>Edit the file navbar.js, replace var imagedir="http://navbar.schools.net.au/sina3-internal-static/icons/"; by
var imagedir="../sina.schools.net.au/sina3-internal-static/icons/"; in all the functions.
The images downloaded by WinHTTrack in the folder are unusable, so you will have to look for them in the cache (Temporary Internet Files) where you should have:
Then copy the files into the mirror folder and remove [1] in the file names.
You will have:
and the (useless) navigation bar will now display the images.
Now, you can browse the mirror offline.