HTTPRequest to Access Subscribed Contents
Moderators: JRL, Dorian (MJT support)
HTTPRequest to Access Subscribed Contents
How to use HTTPRequest to retrieve content from password-protected paid web pages?
- CyberCitizen
- Automation Wizard
- Posts: 724
- Joined: Sun Jun 20, 2004 7:06 am
- Location: Adelaide, South Australia
I believe that its only available that way for basic web auth, anything else you would need to use webrecorder.
For basic authentication where a username and password is required by the server before the request will complete put the username and password in the URL using the following format: http://username:[email protected]/etc.etc
For basic authentication where a username and password is required by the server before the request will complete put the username and password in the URL using the following format: http://username:[email protected]/etc.etc
FIREFIGHTER
CyberCitizen,
Good morning Australia!
I did try out http://username:[email protected]/etc.etc.
It doesn't return the paid content. I guess the content is protected by some Java tricks.
Thanks for your help.
Good morning Australia!
I did try out http://username:[email protected]/etc.etc.
It doesn't return the paid content. I guess the content is protected by some Java tricks.
Thanks for your help.
- CyberCitizen
- Automation Wizard
- Posts: 724
- Joined: Sun Jun 20, 2004 7:06 am
- Location: Adelaide, South Australia
Ok, now I can see what you see.CyberCitizen wrote:HTTPRequest only really works on sites that give you this type of logon prompts (not web based ones)
BTW, it isn't a major issue. MS can simulate keyboard typing. Without HTTPRequest, I still can use mouse+keyboard to extract the contents.
Are you an actual fire fighter as signed?
- CyberCitizen
- Automation Wizard
- Posts: 724
- Joined: Sun Jun 20, 2004 7:06 am
- Location: Adelaide, South Australia
- Marcus Tettmar
- Site Admin
- Posts: 7395
- Joined: Thu Sep 19, 2002 3:00 pm
- Location: Dorset, UK
- Contact:
Not entirely true. HTTPRequest can perform POST operations. So if the website login is a FORM which does a POST, then you can reconstruct that POST using HTTPRequest.
That will get you logged in to the next page. You may then need to "scrape" a session ID out of the page to use with subsequent operations to get data further down the line.
But if the site uses a browser session state or cookie then you won't be able to use HTTPRequest.
In this case it is better to use the WebRecorder functions to control an IE instance which will maintain the session state.
That will get you logged in to the next page. You may then need to "scrape" a session ID out of the page to use with subsequent operations to get data further down the line.
But if the site uses a browser session state or cookie then you won't be able to use HTTPRequest.
In this case it is better to use the WebRecorder functions to control an IE instance which will maintain the session state.
Marcus Tettmar
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar
Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?
http://mjtnet.com/blog/ | http://twitter.com/marcustettmar
Did you know we are now offering affordable monthly subscriptions for Macro Scheduler Standard?
That's exactly the problem I encounter. The websites in question such as Bloomberg provide paid financial information. When I look up their source code (HTML), besides a bunch of Javascript, it doesn't contain any data which are shown my monitor. That's amazing!Marcus Tettmar wrote:But if the site uses a browser session state or cookie then you won't be able to use HTTPRequest.