Re: wget issue

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



aha.. we have success...

steps to repeat:

generate the base page to get the initial content, as well as the cookie

wget -S "https://isiscc.smc.edu/pls/apex/f?p=123:1:12101780574999:::::";
 -O  smc.dat

extract the required cookie:
ORA_WWV_APP_123=ORA_WWV-HtXgmY8ZHOxc5O9zc8N8j6tP
>>>>ORA_WWV-HtXgmY8ZHOxc5O9zc8N8j6tP<<<

this goes in the wget as the cookie header

use the returned smc.dat file to determine the required attributes
that change.  In this case, we need


&p_instance=257747591540
&p_page_submission_id=4686527047365
&p_page_checksum=DB5BF68CB0895706468BB180F90F4C3B

search/copy these vals from the data

in order to generate the post data, you can do it in a couple of ways.
I viewed the returned post data from a few different pages of the site
in my fav viewer/debugger, and determined that all of the variables
appear to be the same/consistent.  For real accuracy, and in a
crawler, it would be useful to simply do a xpath on the hidden
name/vals  and use that data to generate the post-data.

but we cheat for now:
so the base post-data consists of
p_flow_id=123&p_flow_step_id=1&p_request=SUBMIT&p_arg_names=6371933991931405031&p_t01=Spring+2015&p_arg_names=6362920808091113318&p_t02=1+%2C+2+%2C+3+%2C+4+%2C+5+%2C+6+%2C+7+%2C+12&p_arg_names=6362811294857854001&p_t03=%27OPEN%27+%2C+%27CLOSED%27&p_arg_names=6374897696943587288&p_t04=20151&p_arg_names=3851224476948135099&p_t05=1&p_arg_names=3851225084220137224&p_t06=2&p_arg_names=6361666501512343398&p_arg_names=6362829299321902607&p_arg_names=6363404907668258054&p_arg_names=6363528795332614067&p_arg_names=6363575091232717019&p_arg_names=6363649096707888902&p_arg_names=3851458085236906098&p_t13=April+++++02%2C+2015+%40+12%3A05+pm&p_arg_names=6372068906550664708&p_t14=cls_web_list_20151&p_arg_names=6372277789412047762&p_t15=web_cat_sched_20151&p_arg_names=6361721707633487104&p_t16=where+cat_grp_seq_num+in+%28cat_grp_seq_num%29+and+cls_status+in+%28%27OPEN%27+%2C+%27CLOSED%27%29+and+print_seq+in+%28print_seq%29+and+print_seq+in+%281+%2C+2+%2C+3+%2C+4+%2C+5+%2C+6+%2C+7+%2C+12%29+and+nvl%28gnr_emply_name%2C%27*%27%29+in+%28nvl%28gnr_emply_name%2C%27*%27%29%29+and+nvl%28ssn_days%2C%27*%27%29+in+%28nvl%28ssn_days%2C%27*%27%29%29+and+sct_begin_week+in+%28sct_begin_week%29+and+nvl%28begin_time%2C%270%27%29+in+%28nvl%28begin_time%2C%270%27%29%29&p_md5_checksum=&p_instance=257747591540&p_page_submission_id=4686527047365&p_page_checksum=DB5BF68CB0895706468BB180F90F4C3B&p_v07=80

the class ID p_v07 changes based on the sel/opt within the page -80
happens to be for chem.
&p_v07=80

so combining everything together...

we get the wget
wget --no-cookies --header "Cookie:
ORA_WWV_APP_123=ORA_WWV-HtXgmY8ZHOxc5O9zc8N8j6tP"
--user-agent="Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.0.11)
Gecko/2009061118 Fedora/3.0.11-1.fc9 Firefox/3.0.11" -O -
--post-data="p_flow_id=123&p_flow_step_id=1&p_request=SUBMIT&p_arg_names=6371933991931405031&p_t01=Spring+2015&p_arg_names=6362920808091113318&p_t02=1+%2C+2+%2C+3+%2C+4+%2C+5+%2C+6+%2C+7+%2C+12&p_arg_names=6362811294857854001&p_t03=%27OPEN%27+%2C+%27CLOSED%27&p_arg_names=6374897696943587288&p_t04=20151&p_arg_names=3851224476948135099&p_t05=1&p_arg_names=3851225084220137224&p_t06=2&p_arg_names=6361666501512343398&p_arg_names=6362829299321902607&p_arg_names=6363404907668258054&p_arg_names=6363528795332614067&p_arg_names=6363575091232717019&p_arg_names=6363649096707888902&p_arg_names=3851458085236906098&p_t13=April+++++02%2C+2015+%40+12%3A05+pm&p_arg_names=6372068906550664708&p_t14=cls_web_list_20151&p_arg_names=6372277789412047762&p_t15=web_cat_sched_20151&p_arg_names=6361721707633487104&p_t16=where+cat_grp_seq_num+in+%28cat_grp_seq_num%29+and+cls_status+in+%28%27OPEN%27+%2C+%27CLOSED%27%29+and+print_seq+in+%28print_seq%29+and+print_seq+in+%281+%2C+2+%2C+3+%2C+4+%2C+5+%2C+6+%2C+7+%2C+12%29+and+nvl%28gnr_emply_name%2C%27*%27%29+in+%28nvl%28gnr_emply_name%2C%27*%27%29%29+and+nvl%28ssn_days%2C%27*%27%29+in+%28nvl%28ssn_days%2C%27*%27%29%29+and+sct_begin_week+in+%28sct_begin_week%29+and+nvl%28begin_time%2C%270%27%29+in+%28nvl%28begin_time%2C%270%27%29%29&p_md5_checksum=&p_instance=257747591540&p_page_submission_id=4686527047365&p_page_checksum=DB5BF68CB0895706468BB180F90F4C3B&p_v07=80"
  -L "https://isiscc.smc.edu/pls/apex/wwv_flow.accept";

now, it might be useful to test to see if this will also work against
the base initial page of

wget -S "https://isiscc.smc.edu/pls/apex/f?p=123:1:0::NO:RP:"; -O smc.dat

I suspect it should


thanks for your help on this.. I'm reposting back to the fed list in
case anyone needs something like this in the future!

I was never able to get this working for the curl/pycurl process.

It appears to have something to do with the cert/ssl issues...

thanks

On Thu, Apr 2, 2015 at 4:49 PM, Alchemist <raimiiic@xxxxxxxxx> wrote:
>
>
> 2015-04-02 23:30 GMT+03:00 bruce <badouglas@xxxxxxxxx>:
>>
>> Hmm..
>>
>> ok..here;s the process I'm going to test... (and by the way. thanks
>> much for the cycles you've used to help with this. Helps to sometimes
>> have someone to bounce things off of! much appreciation)
>>
>> it appears that we can generate the "cookies" (or at least you can)
>> from the initial 2 line cookie script/test - and that this can then be
>> inserted in the wget, using "post data"
>>
>> So the issue then appears to be how to generate the required post data.
>>
>> If we assume the site generates key attributes in the base/top level
>> page, and that these attributes can then be used in the post-data for
>> the subsequent calls, it becomes a matter of then determining which
>> attributes are required to be used from the base page (as it changes),
>> and which attributes can simply be "set" and not changed.
>>
>> I'm going to test this by taking the base page, ripping off the xpath
>> to get the name/val data, and to see if this works..
>>
>> By the way, with the updated data you replied with, just how did you
>> regenerate the post data?
>>
>> thanks
>>
>
> Yes. A quick look uncovered, that wwvFlowForm contains some (perhaps all of
> required to send valid request) of theese attributes. Also take a look at
> pPageChecksum, and apex_layout_***longint*** in html code.
>
> JS responsible to generate request is located here
> https://isiscc.smc.edu/i/libraries/apex/minified/legacy.min.js
>
> and some other here
> https://isiscc.smc.edu/i/libraries/apex/minified/desktop_all.min.js
> https://isiscc.smc.edu/i/libraries/apex/minified/widget.checkboxAndRadio.min.js
>
> It's captured resquest from site generated request, just to show differences
> in POST fields. Yes and both (ORA_WWV-nFa8PnpRSIhvaZ1y372N5oNh and still
> working for me ORA_WWV_APP_123=ORA_WWV-GZ16jkW5CQEUTFD0Qi9Xis6Y)
-- 
users mailing list
users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe or change subscription options:
https://admin.fedoraproject.org/mailman/listinfo/users
Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct
Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines
Have a question? Ask away: http://ask.fedoraproject.org




[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [EPEL Devel]     [Fedora Magazine]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Desktop]     [Fedora Fonts]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Fedora Sparc]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux