Hi Brian -
I tried out downloading the manual using
HTTrack Website Copier. I used the Windows version in GUI mode. There is command line mode and there is Linux/Mac version.
It took over 3 hours to download the 300 MB site. Log summary line below.
HTTrack Website Copier/3.48-21 mirror complete in 3 hours 24 minutes 13 seconds : 2703 links scanned, 2389 files written (296627796 bytes overall), 2389 files updated [303084543 bytes received at 24735 bytes/sec], 4.1 requests per connection
(304 errors, 868 warnings, 0 messages)While it mentions errors, I didn't notice any issues other than the styling. The results looked just fine to me without the styling. I used all default settings except for Browser ID. With the default Browser ID I was immediately getting a 403. I changed it to "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.101 Safari/537.36" and it worked.
This first line from the logs shows the options that were used.
HTTrack3.48-21+htsswf+htsjava launched on Sat, 26 Aug 2017 17:36:31 at http://zeromanual.com/index.php/Main_Page +*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qiC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F " Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.101 Safari/537.36" -%F "<!-- Mirrored from %s%s by HTTrack Website Copier/3.x [XR&CO'2014], %s -->" -%l "en, *" http://zeromanual.com/index.php/Main_Page -O1 "C:\My Web Sites\Zero_Unofficial_Manual" +*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )To test the results I disabled my network adapter and opened the following file in Chrome:
file:///C:/My%20Web%20Sites/Zero_Unofficial_Manual/zeromanual.com/index.php/Main_Page.html
Then clicked around at the links and made sure the images all appeared, which they did.
Again, only issue was there was no styling. You could add a post-processing step to fix that but I don't think it is worth the effort as it looked fine without the styling. But it might be worth adding a post-processing step to strip out the
link rel="stylesheet" lines as they just timeout in the background.
-Jerome
I am trying to make an offline version of the manual that one can load onto a phone/tablet/laptop/etc. but am having trouble getting everything working in harmony.
I'm having the closest experience to success on this stack (after too many "yak-shaving" experiences) but it really doesn't work without the visualeditor plugin:
https://github.com/openzim/mwoffliner
I do need to update the UI, I just forgot about updating it. New kid, new job, etc... makes it easy to forget about things like that.
I think I started to look at what my options were, but there was some kind of conflict between what I wanted and the version of something I had.. I'll try to look at it soon.
I hope we can clear this hurdle soon, because it seems to unlock a bunch of benefits at once. Honestly, I assume that for some cloud/virtualized hosting services, there's a software image that has this all set up, as long as one is willing to put up with some particular Linux distro.
If that doesn't work, what might?