We need to scrape content from a particular site and export the content into a MYSQL database.
The scraper will need to achieve the following:
- Sign into an account using a username and password (given)
- Navigate to the transactions page
- Search for a date range
- Scrape the pages (average of 10 pages a day) of content and bring the information to MYSQL
- Each page of the target site is laid out identically
- Be scheduled to run, or be run manually
- New data is added to the database, not replacing old data
- Data is not duplicated when date ranges are overlapped
The application (PHP) will be controlled by a web admin (GUI) area and will need the following functionality:
- Allow us to control the date ranges for the scraper
- Ability to search the database
- Ability to display the results in a GUI
- Ability to search the database from within the GUI
Please bear the following conditions in mind:
- We own copyright for the tool
- Tool or functionality is not resalable
- Bid and completion time are critical factors in this project
- Completed tool is to be completely confidential
Time of completion will be a major factor in deciding who will be awarded the project.
Please note we are in the process of putting another project together which will run along similar lines. Any winner will have a strong possibility of getting further work.