Re: finally.... and wget questioon

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 2003-08-28 at 20:06, Stephen Liu wrote:
> Hi William,
> 
> Thanks for your advice.
> 
> On Wed, 2003-08-27 at 20:07, William Hooper wrote:
> > Joshua Legbandt said:
> > > The Washington Post requires (free) registration. It allows lynx (or
> > > anything using the lynx user-agent header) through unhindered.
> > >
> > > -josh
> > 
> > So one could tell wget to identify itself as lynx.
> > 
> > man wget and look for the "--user-agent" arguement.
> 
> $ wget -U --no-parent
> http://www.washingtonpost.com/wp-dyn/articles/A34978-2003Aug23.html
> 
> Downloaded the file and browsed it starting the (free) registration
> form.
> 
> Having completed the said form and clicked "Go" following warning pop-up
> 
> "Unable to run the command specified. The file or directory
> file:/ac2/wp-dyn/IncrementalRegServlet?node=admin/registration/incremental&destination=incremental&nextstep=display&application=3-Point-technology&applicationURL=http%3A%2F%2Fwww.washingtonpost.com%2Fwp-dyn%2Farticles%2FA34978-2003Aug23.html does not exist"
> 
> Could not proceed further.

try:
 wget -U 'Lynx/2.8.4' --no-parent
http://www.washingtonpost.com/wp-dyn/articles/A34978-2003Aug23.html

or  
wget -U 'Lynx/2.8.4'
http://www.washingtonpost.com/ac2/wp-dyn/A34978-2003Aug23?language=printer

To get the text on a single page with no graphics and no dead links :)

-josh

> 
> B.Regards
> Stephen
> 
> 
> To Get Your Own iCareHK.com Email Address?  Go To www.iCareHK.com.
-- 
Joshua Legbandt <jtlegbandt@xxxxxxxxxxxxx>


-- 
Shrike-list mailing list
Shrike-list@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/shrike-list

[Index of Archives]     [Fedora Users]     [Centos Users]     [Kernel Development]     [Red Hat Install]     [Red Hat Watch]     [Red Hat Development]     [Red Hat Phoebe Beta]     [Yosemite Forum]     [Fedora Discussion]     [Gimp]     [Stuff]     [Yosemite News]

  Powered by Linux