On Thu, 2003-08-28 at 21:51, Stephen Liu wrote: > Hi Josh, > > On Fri, 2003-08-29 at 11:56, Joshua Legbandt wrote: > > > try: > > wget -U 'Lynx/2.8.4' --no-parent > > http://www.washingtonpost.com/wp-dyn/articles/A34978-2003Aug23.html > > > > or > > wget -U 'Lynx/2.8.4' > > http://www.washingtonpost.com/ac2/wp-dyn/A34978-2003Aug23?language=printer > > Both work, downloading only 1st page including pictures. How to get > this file including 2nd page? There are 2 pages in this file. > The second statement (ending in =printer) should grab the entire article as html formatted text, at least it did for me... note that the url's are slightly different. > What will be the syntax 'Lynx/2.8.4' used for? > $ rpm -q lynx > lynx-2.8.5-7 > the -U option allows you to specify a user-agent string. This string is used by the http server to to differentiate one browser from another. I specified lynx version 2.8.4, it really doesn't matter what version you specify as long is it works. The string 'Lynx/2.8.4' happened to show up in the logs of one of my websites, so I used that. Hope that helps clarify a bit. cheers, josh -- Joshua Legbandt <jtlegbandt@xxxxxxxxxxxxx> -- Shrike-list mailing list Shrike-list@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/shrike-list