Project Description:
Software capability:
We need the email id extraction program where we have to give a captcha.
Steps:
1. Need to open http://www.craigslist.org
2. Then as input we will provide the city (it will be in a list box, we can choose multiple city). Another will be a category (one time one category we can select). Another field with the date (date need to be match as per the craiglist. Suppose we provide “Sun Mar 14” then it need to be perform on that date.
3. Then user will click on start extraction.
4. The software will open one by one ad
for example. (City: Atlanta ; category: Personals > Man seeking women ; date : Sun Mar 14)
Then software need to track the ads of Personals > Man seeking women which posted on Sun Mar 14. Suppose it one the first ad http://atlanta.craigslist.org/nat/m4w/1643452888.html then it need to click on “{Reply to this Post}” . Then we have to enter a captcha. After submitting the captcha we will get the email id.
Software will perform those steps in background. Just user will enter the captcha . As out put we need the ad title and the email id. And the date as we give in input
Example:
WM seeking beautiful BF - 45 (OTP), pers-kwk8k-1643452888#craigslist.org, Sun Mar 14
It will store in a CSV file. Which will update automatically.
Another add-on : Suppose if the software close in middle time or we need to restart the pc for any technical issue- then the next time software need to give a option to resume with the previous extraction. Without any issue.
User will enter the captcha only. We don’t need to change ip or nothing proxy system required. At a time software need to run 20 threads so the user can get continues captcha.
If any doubts please reply we will clear it.
You can make it as desktop application or web application. Give me quote desktop as well as web.
Thanks
Joy
Additional Project Description:
03/16/2010 at 5:24 CET
i want to use http://decaptcher.com/client/ to outsource captcha work.