USED: C #, WebDriver (if necessary)

I want to write a parser that will download information programmatically (without a browser).
But some sites have a hard-boiled captcha.

In order not to tinker with the passage of a captcha programmatically, is it possible to implement the following scenario?

SCENARIO:

  1. We go to the site through a browser (either with a regular C # browser or using WebDriver);
  2. Pass the captcha in the manual;
  3. Copy cookies;
  4. Next, the program begins to parse. Requests for parsing are sent with previously generated cookies.

QUESTION
1. Is it possible to implement this scenario?

PS Only I master, therefore there can be inaccuracies in the formulation of a question and terminology.

  • the short answer is yes. But there may be problems. For example, the site checks for a cookie browser / ip and the like. - KoVadim
  • @KoVadim If not difficult, could you give an example for internet explorer. Those. via internet explorer logged onto ru.stackoverflow.com. How can the code take cookies of my entry through internet explorer and enter any page of en.stackoverflow.com and then parse the page? If I understand this process correctly. Cookies are here c: \ Users \ admin \ AppData \ Roaming \ Microsoft \ Windows \ Cookies \. C: \ Program Files \ Internet Explorer \ iexplore.exe - koverflow
  • I do not write on sharpe. - KoVadim

1 answer 1

Yes you can. But in your case it is better to go the other way, namely:

  1. Create a separate browser profile
  2. Use it when initializing selenium.

This will allow you not to suffer with memorizing cookies through crutches like save cookies in XML or serialization. All the necessary data will already be inside the browser profile.

Important: I do NOT recommend using IE if possible. Selenium works with him just disgusting.