by gilitelem on Sat Aug 30, 2008 8:27 am
Well... here's the full story.
We have an old online forum which, believe it or not, lacks a search feature.
I used an offline browsing program to download a couple of hundreds of forum pages, and I would like to let some desktop search program (i.e. Google Desktop, Copernic) index them for me.
The problem is the pages are not indexed properly in their original HTML format. I guess it has something to do with the "ugly" HTML format of the forum, which makes the desktop search programs disregard most of the text.
Luckily, indexing works fine after saving each forum page as a text file (using Firefox File>Save Page As).
Unfortunately, I can't do that manually for all the files, so I created a list of links to all the downloaded pages (local links) and would like to run some simple script to go over the list and do that for me.
Thanks!