On Thu, 2003-08-28 at 20:06, Stephen Liu wrote: > Hi William, > > Thanks for your advice. > > On Wed, 2003-08-27 at 20:07, William Hooper wrote: > > Joshua Legbandt said: > > > The Washington Post requires (free) registration. It allows lynx (or > > > anything using the lynx user-agent header) through unhindered. > > > > > > -josh > > > > So one could tell wget to identify itself as lynx. > > > > man wget and look for the "--user-agent" arguement. > > $ wget -U --no-parent > http://www.washingtonpost.com/wp-dyn/articles/A34978-2003Aug23.html > > Downloaded the file and browsed it starting the (free) registration > form. > > Having completed the said form and clicked "Go" following warning pop-up > > "Unable to run the command specified. The file or directory > file:/ac2/wp-dyn/IncrementalRegServlet?node=admin/registration/incremental&destination=incremental&nextstep=display&application=3-Point-technology&applicationURL=http%3A%2F%2Fwww.washingtonpost.com%2Fwp-dyn%2Farticles%2FA34978-2003Aug23.html does not exist" > > Could not proceed further. try: wget -U 'Lynx/2.8.4' --no-parent http://www.washingtonpost.com/wp-dyn/articles/A34978-2003Aug23.html or wget -U 'Lynx/2.8.4' http://www.washingtonpost.com/ac2/wp-dyn/A34978-2003Aug23?language=printer To get the text on a single page with no graphics and no dead links :) -josh > > B.Regards > Stephen > > > To Get Your Own iCareHK.com Email Address? Go To www.iCareHK.com. -- Joshua Legbandt <jtlegbandt@xxxxxxxxxxxxx> -- Shrike-list mailing list Shrike-list@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/shrike-list